[bionet.molbio.genbank.updates] Database Update

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       ECOFABE      1229 bp ds-DNA             BCT       22-FEB-1990
DEFINITION  E.coli biotin carboxyl carrier protein (fabE) gene, complete cds.
ACCESSION   M32214
KEYWORDS    biotin carboxyl carrier protein.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1229)
  AUTHORS   Alix,J.-H.
  TITLE     A rapid procedure for cloning genes from lambda libraries by
            complementation of E.coli defective mutants: Application to the
            fabE region of the E.coli chromosome
  JOURNAL   DNA 8, 779-789 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        269      739     biotin carboxyl carrier protein (fabE)
BASE COUNT      321 a    324 c    318 g    266 t
ORIGIN      
        1 cgacctcgtc ctccctgacg cagtttttgc gctgcggaaa aggtgacatt ggcgcaacga
       61 aggtatattt tgttttttgc cggaggatag cagcagatcg ctgcacaatg tccgtcaagt
      121 ctaacattga cactctgggg caaaatagac cggcgtcccg gcctgctgga atttatcgct
      181 atgcatacag ctgtcggggc atacgcttta cagacggcgg tgaaacgcct gtcacaatca
      241 cactaaacaa agagtacgga acccactcat ggatattcgt aagattaaaa aactgatcga
      301 gctggttgaa gaatcaggca tctccgaact ggaaatttct gaaggcgaag agtcagtacg
      361 cattagccgt gcagctcctg ccgcaagttt ccctgtgatg caacaagctt acgctgcacc
      421 aatgatgcag cagccagctc aatctaacgc agccgctccg gcgaccgttc cttccatgga
      481 agcgccagca gcagcggaaa tcagtggtca catcgtacgt tccccgatgg ttggtacttt
      541 ctaccgcacc ccaagcccgg acgcaaaagc gttcatcgaa gtgggtcaga aagtcaacgt
      601 gggcgatacc ctgtgcatcg ttgaagccat gaaaatgatg aaccagatcg aagcggacaa
      661 atccggtacc gtgaaagcaa ttctggtcga aagtggacaa ccggtagaat ttgacgagcc
      721 gctggtcgtc atcgagtaac gaggcgaaca tgctggataa aattgttatt gccaaccgcg
      781 gcgagattgc attgcgtatt cttcgtgcct gtaaagaact gggcatcaag actgtcgctg
      841 tgcactccag cgcggatcgc gatctaaaac acgtattact ggcagatgaa acggtctgta
      901 ttggccctgc tccgtcagta aaaagttatc tgaacatccc ggcaatcatc agcgccgctg
      961 aaatcaccgg cgcagtagca atccatccgg gttacggctt cctctccgag aacgccaact
     1021 ttgccgagca ggttgaacgc tccggcttta tcttcattgg cccgaaagca gaaaccattc
     1081 gcctgatggg cgacaaagta tccgcaatcg cggcgatgaa aaaagcgggc gtcccttgcg
     1141 taccgggttc tgacgcgccg ctgggcgacg atatggataa aaaccgtgcc attgctaaac
     1201 gcattggtta tccggtgatt atcaaaccg
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       VACWRA       2955 bp ds-DNA             VRL       13-FEB-1990
DEFINITION  Vaccinia virus abortive late protein gene, complete cds.
ACCESSION   M32064
KEYWORDS    abortive late protein.
SOURCE      Vaccinia virus (strain WR) DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 2955)
  AUTHORS   Pacha,R.F., Meis,R.J. and Condit,R.C.
  TITLE     Structure and expression of the vaccinia virus gene which prevents
            virus-induced breakdown of RNA
  JOURNAL   Unpublished (1990) D. of Biochem., SUNY-Buffalo, Buffalo, NY 14214
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.C.Condit, 12-FEB-1990.
FEATURES       from  to/span     description
    pept        825  <     1 (c) 32 kD protein
    pept       1439      828 (c) 23 kD protein
    pept       1454     2935     56 kD abortive late protein
    mRNA       1028  >  2955     56 kD abortive late protein early mRNA
    mRNA    <  2955     1442 (c) 23 kD protein late mRNA
BASE COUNT     1019 a    536 c    474 g    926 t
ORIGIN      About 120 kb from left end of genome.
        1 tctagattcg tcggtgcact catgtaacca acatacccta ggtcctaaat atttatctcc
       61 ggaattagat tttggataat tcgcgcacca acaatttcta tttcctttat gatcgttaca
      121 aaagacgtat aatgccgtat ccccaaaagt aaaataatca ggacgaataa ttctaataaa
      181 ctcagaacaa tatctcgcat ccatatgttt ggagcaaata tcggaataag tagacatagc
      241 cggtttccgt tttgcacgta accattctaa acaattgggg tttccaggat cgtttctaca
      301 aaatccagtc atgaaatcgt cacaatgttc tgtcttgtaa ttattattaa atatttttgg
      361 acagtgtttg gtatttgtct tagaacaaca ttttgccacg ctatcactat cgcccaggag
      421 ataatccttt tttataaaat gacatcgttg cccggatgct atataatcag tagcgtgttt
      481 taaatcctta atatattcag gagttacctc gttctgataa tagattaatg atccaggacg
      541 aaatttgaaa gaactacatg gttctccatg aattaataca tattgtttag caaattcagg
      601 aactataaaa ctactacaat gatctatcga cataccatct atcaaacaaa acttgggttt
      661 aatttctccc ggagatgttt cataatagta cgtataactt tcttctgcaa acttaacagc
      721 tctattatat tcaggataat taaaacctaa ttccatatat ttgtctcgta tatctgctat
      781 tcctggtgct attttgattc tattaagagt aacagctgcc cccattctta ataatcgtca
      841 gtatttaaac tgttaaatgt tggtatatca acatctacct tatttcccgc agtataaggt
      901 ttgttgcagg tatactgttc aggaatggtt acatttatac ttcttctata gtcctgtctt
      961 tcgatgttca tcacatatgc aaagaacaga ataaacaaaa taatgtaaga aataatatta
     1021 aatatctgtg aattcgtaaa tacattgatt gccataataa ttacagcagc tacaatacac
     1081 acaatagaca ttcccacagt gttgccatta cctccacgat acatttgagt tactaagcaa
     1141 taggtaataa ctaagctagt aagaggcaat agaaaagatg agataaatat catcaatata
     1201 gagattagag gagggctata tagagccaag acgaacaaaa tcaaaccgag taacgttcta
     1261 acatcattat ttttgaagat tcccaaataa tcattcattc ctccataatc gttttgcatc
     1321 atacctccat ctttaggcat aaacgattgc tgctgttcct ctgtaaataa atctttatca
     1381 agcactccag cacccgcaga gaagtcgtca agcatattgt aatatcttaa ataactcatt
     1441 tatatattaa aaaatgtcac tattaaagat ggagtataat ctttatgccg aactaaaaaa
     1501 aatgacttgt ggtcaacccc taagtctttt taacgaagac ggggatttcg tagaagttga
     1561 accgggatca tcctttaagt ttctgatacc taagggattt tacgcctctc cttccgtaaa
     1621 gacgagtcta gtatttgaaa cattaacaac gaccgataat aaaatcacta gtatcaatcc
     1681 aacaaatgcg ccaaagttat atcctcttca acgcaaagtc gtatctgaag tagtttctaa
     1741 tatgaggaaa atgatcgaat caaaacgtcc tctatacatt actcttcact tggcgtgtgg
     1801 atttggtaag actattacca cgtgttatct tatggctaca cacggtagaa aaaccgtcat
     1861 ttgcgtaccc aataaaatgt taatacatca atggaagaca caggtagagg cagtcggatt
     1921 ggaacataag atatccatag atggagtaag tagtctatta aaggaactaa agactcaaag
     1981 tccggatgta ttaatagtag tcagtagaca tctgacaaac gatgcctttt gtaaatatat
     2041 caataagcat tatgatttgt tcatcttgga tgaatcacat acgtataatc tgatgaacaa
     2101 tacagcagtt acaagatttt tagcgtatta tcctccgatg atgtgttatt ttttaactgc
     2161 tacacctaga ccagctaacc gaatttattg taacagtatt attaatattg ccaagttatc
     2221 cgatctaaaa aaaactatct atgcggtaga tagttttttt gagccatatt ccacagacaa
     2281 tattagacat atggtaaaac gactagatgg accatctaat aaatatcata tatataccga
     2341 gaagttatta tctgtagacg agcctagaaa tcaacttatt cttaataccc tggtagaaga
     2401 attcaagtca ggaactatta atcgcatttt agttattact aaactacgtg aacatatggt
     2461 attattctac aaacgattat tagatctttt cggaccagag gttgtattta taggagacgc
     2521 ccaaaataga cgtactccag atatggtcaa atcaatcaag gaactaaata gatttatatt
     2581 cgtatccacc ttattttatt ccggtactgg tttagatatt cctagtttgg attcgttgtt
     2641 catttgctcg gcagtaatca acaatatgca aatagagcaa ttactaggga gggtatgtcg
     2701 agaaacagaa ctattagata ggacggtata tgtatttcct aacacatcca tcaaagaaat
     2761 aaagtacatg ataggaaatt tcatgcaacg aattattagt ctgtctgtag ataaactagg
     2821 atttaaacaa gaaagttatc ggaaacatca agaatccgat cccacttctg tatgtacaac
     2881 atcctccaga gaagaacgtg tattaaatag aatatttaac tcgcaaaatc gttaagaagt
     2941 ttaagcgacg atccg
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       PSECAMAB     3825 bp ds-DNA             BCT       13-FEB-1990
DEFINITION  P.putida putidaredoxin reductase (camA) and putidaredoxin (camB)
            genes, complete cds.
ACCESSION   J05406
KEYWORDS    electron transfer protein; flavoprotein reductase;
            iron-sulfur protein; putidaredoxin; putidaredoxin reductase.
SOURCE      P.putida (strain ATCC 17453) DNA, clone pBS5.
  ORGANISM  Pseudomonas putida
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Acidiphilium cryptum; Pseudomonadaceae.
REFERENCE   1  (bases 1 to 3825)
  AUTHORS   Peterson,J.A., Lorence,M.C. and Amarneh,B.
  TITLE     Putidaredoxin reductase and putidaredoxin: Cloning, sequence
            determination, and heterologous expression of the proteins
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Peterson, 07-FEB-1990.
FEATURES       from  to/span     description
    pept         47     1315     putidaredoxin reductase
    pept       1371     1694     putidaredoxin
    mRNA    <     1     1842     camA/camB mRNA (3' end +/- 11 bp)
    binding      32       36     ribosome binding site
    binding      77      106     FAD binding domain (pot.)
    binding     511      541     NADH binding domain (pot.)
    binding    1358     1363     ribosome binding site
BASE COUNT      855 a   1086 c   1135 g    749 t
ORIGIN      
        1 tccggcgact accaaagcgg tataaacaca tgggagtgcg tgctaagtga acgcaaacga
       61 caacgtggtc atcgtcggta ccggactggc tggcgttgag gtcgccttcg gcctgcgcgc
      121 cagcggctgg gaaggcaata tccggttggt gggggatgcg acggtaattc cccatcacct
      181 accaccgcta tccaaagctt acttggccgg caaagccaca gcggaaagcc tgtacctgag
      241 aaccccagat gcctatgcag cgcagaacat ccaactactc ggaggcacac aggtaacggc
      301 tatcaaccgc gaccgacagc aagtaatcct atcggatggc cgggcactgg attacgaccg
      361 gctggtattg gctaccggag ggcgtccaag acccctaccg gtggccagtg gcgcagttgg
      421 aaaggcgaac aactttcgat acctgcgcac actcgaggac gccgagtgca ttcgccggca
      481 gctgattgcg gataaccgtc tggtggtgat tggtggcggc tacattggcc ttgaagtggc
      541 tgccaccgcc atcaaggcga acatgcacgt caccctgctt gatacggcag cccgggttct
      601 ggagcgggtt accgccccgc cggtatcggc cttttacgag cacctacacc gcgaagccgg
      661 cgttgacata cgaaccggca cgcaggtgtg cgggttcgag atgtcgaccg accaacagaa
      721 ggttactgcc gtcctctgcg aggacggcac aaggctgcca gcggatctgg taatcgccgg
      781 gattggcctg ataccaaact gcgagttggc cagtgcggcc ggcctgcagg ttgataacgg
      841 catcgtgatc aacgaacaca tgcagacctc tgatcccttg atcatggccg tcggcgactg
      901 tgcccgattt cacagtcagc tctatgaccg ctgggtgcgt atcgaatcgg tgcccaatgc
      961 cttggagcag gcacgaaaga tcgccgccat cctctgtggc aaggtgccac gcgatgaggc
     1021 ggcgccctgg ttctggtccg atcagtatga gatcggattg aagatggtcg gactgtccga
     1081 agggtacgac cggatcattg tccgcggctc tttggcgcaa cccgacttca gcgttttcta
     1141 cctgcaggga gaccgggtat tggcggtcga tacagtgaac cgtccagtgg agttcaacca
     1201 gtcaaaacaa ataatcacgg atcgtttgcc ggttgaacca aacctactcg gtgacgaaag
     1261 cgtgccgtta aaggaaatca tcgccgccgc caaagctgaa ctgagtagtg cctgaaatct
     1321 atacccacaa taaatcaccg ttttgcccca tagcgtgtga ggataaacag atgtctaaag
     1381 tagtgtatgt gtcacatgat ggaacgcgtc gcgaactgga tgtggcggat ggcgtcagcc
     1441 tgatgcaggc tgcagtctcc aatggtatct acgatattgt cggtgattgt ggcggcagcg
     1501 ccagctgtgc cacctgccat gtctatgtga acgaagcgtt cacggacaag gtgcccgccg
     1561 ccaacgagcg ggaaatcggc atgctggagt gcgtcacggc cgaactgaag ccgaacagca
     1621 ggctctgctg ccagatcatc atgacgcccg agctggatgg catcgtggtc gatgttcccg
     1681 ataggcaatg gtaaaccaca atggtaaacc actgcgagcc aaaacagccg agcaggagcg
     1741 cagtccggca acaccttatt aagcacatgc cgaaccctat ttgcagcgct tcatgcctgc
     1801 aaagtcccga ttgatgaaat ccgggctcca agcaaggagc ccggaatctc tcaccgccac
     1861 gaaatcaatg gccaatcccg ggacttggtt gagacgtccg tttctccact acgactttgt
     1921 cgcaatgatc tcctcatgct cagatccgaa ggatctctat agatcataca gccaggctgt
     1981 atctggagaa gtccaaggtt catctggtct gtgatgccaa tgactcgccc tcaagcctcg
     2041 ttttgtcagt gagagtagtc acctgatgca acagcgtctc actcaacgcc tgcgccgcgc
     2101 tggatagctg atgcccagcc tggtgcagca ggccgacgcg acgcgacacc tgtggctcgc
     2161 tcagcggtag gcagcgggcg cccagttctt ccatctgctg ccggcacagt tgcggcacgg
     2221 cgctgacgcc gagtgaacgc aaacgacaac gtggtcatcg tcggtaccgg actggctggc
     2281 gttgaggtcg ccttcggcct gcgcgccagc ggctgggaag gcaatatccg gttggtgggg
     2341 gatgcgacgg taattcccca tcacctacca ccgctatcca aagcttactt ggccggcaaa
     2401 gccacagcgg aaagcctgta cctgagaacc ccagatgcct atgcagcgca gaacatccaa
     2461 ctactcggag gcacacaggt aacggctatc aaccgcgacc gacagcaagt aatcctatcg
     2521 gatggccggg cactggatta cgaccggctg gtattggcta ccggagggcg tccaagaccc
     2581 ctaccggtgg ccagtggcgc agttggaaag gcgaacaact ttcgatacct gcgcacactc
     2641 gaggacgccg agtgcattcg ccggcagctg attgcggata accgtctggt ggtgattggt
     2701 ggcggctaca ttggccttga agtggctgcc accgccatca aggcgaacat gcacgtcacc
     2761 ctgcttgata cggcagcccg ggttctggag cgggttaccg ccccgccggt atcggccttt
     2821 tacgagcacc tacaccgcga agccggcgtt gacatacgaa ccggcacgca ggtgtgcggg
     2881 ttcgagatgt cgaccgacca acagaaggtt actgccgtcc tctgcgagga cggcacaagg
     2941 ctgccagcgg atctggtaat cgccgggatt ggcctgatac caaactgcga gttggccagt
     3001 gcggccggcc tgcaggttga taacggcatc gtgatcaacg aacacatgca gacctctgat
     3061 cccttgatca tggccgtcgg cgactgtgcc cgatttcaca gtcagctcta tgaccgctgg
     3121 gtgcgtatcg aatcggtgcc caatgccttg gagcaggcac gaaagatcgc cgccatcctc
     3181 tgtggcaagg tgccacgcga tgaggcggcg ccctggttct ggtccgatca gtatgagatc
     3241 ggattgaaga tggtcggact gtccgaaggg tacgaccgga tcattgtccg cggctctttg
     3301 gcgcaacccg acttcagcgt tttctacctg cagggagacc gggtattggc ggtcgataca
     3361 gtgaaccgtc cagtggagtt caaccagtca aaacaaataa tcacggatcg tttgccggtt
     3421 gaaccaaacc tactcggtga cgaaagcgtg ccgttaaagg aaatcatcgc cgccgccaaa
     3481 gctgaactga gtagtgcctg aatgtctaaa gtagtgtatg tgtcacatga tggaacgcgt
     3541 cgcgaactgg atgtggcgga tggcgtcagc ctgatgcagg ctgcagtctc caatggtatc
     3601 tacgatattg tcggtgattg tggcggcagc gccagctgtg ccacctgcca tgtctatgtg
     3661 aacgaagcgt tcacggacaa ggtgcccgcc gccaacgagc gggaaatcgg catgctggag
     3721 tgcgtcacgg ccgaactgaa gccgaacagc aggctctgct gccagatcat catgacgccc
     3781 gagctggatg gcatcgtggt cgatgttccc gataggcaat ggtaa
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       CHKAK1       6300 bp ds-DNA             VRT       15-MAR-1989
DEFINITION  Chicken cytosolic adenylate kinase (AK1) gene, complete cds and
            flanks.
ACCESSION   D00251
KEYWORDS    AK1 gene; adenylate kinase; cytosolic adenylate kinase.
SOURCE      Chicken liver DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 6300)
  AUTHORS   Suminami,Y., Kishi,F., Torigoe,T. and Nakazawa,A.
  TITLE     Structure and complete nucleotide sequence of the gene encoding
            chicken cytosolic adenylate kinase
  JOURNAL   J. Biochem. 103, 611-617 (1988)
  STANDARD  full staff_entry
COMMENT     The chicken AK1 gene spanned about 6 kbp and consisted of 7 exons.
            The transcription initiation site was determined by the S1 nuclease
            mapping analysis. Evolution of the AK1 gene is discussed in
            comparison with the isozyme AK2.  corresponds to DDBJ (rel4.0)
            entry <CHKAK1>(D00251) dated 12-SEP-1988
FEATURES       from  to/span     description
    pept       2612     2621     adenylate kinase (a part of exon 2)
               4056     4091     adenylate kinase (exon 3)
               4478     4641     adenylate kinase (exon 4)
               4754     4870     adenylate kinase (exon 5)
               5382     5573     adenylate kinase (exon 6)
               5689     5754     adenylate kinase (exon 7)
    pre-msg    1070     5914     AK1 mRNA and introns
    IVS        1121     2578     AK1 intron a
    IVS        2622     4055     AK1 intron b
    IVS        4092     4477     AK1 intron c
    IVS        4642     4753     AK1 intron d
    IVS        4871     5381     AK1 intron e
    IVS        5574     5688     AK1 intron f
    signal      973      976     CAT box
    signal     1041     1047     TATA box
    signal     5896     5901     polyadenylation signal
BASE COUNT      998 a   2011 c   1904 g   1387 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccgat tgcttcagag tccccttggt ttggggtccg ggctgcactg tgcaccccag
       61 ggctgataca gcatcagggc cgtgtgcacc gctgctgccc atgggtctgc acagccacga
      121 catggcaggt tttagggtgt ctcatcccct cctgcaggac gtgggaaccc tgcatggtcc
      181 caactcagaa cctcccaggc tgcgttccat caccctgaga gcacacgggg ggtaggtgaa
      241 tggtcccctt tggctggcat gggggtcacg ttgctttggg ggcaaaattt tagccccacc
      301 cccccccccc acctgtcccc cagcacaaat cctcacattt tcagtgtcac catccccaga
      361 gcagagacct gagagccacc ccagttagac caaggggatg acactgccaa cgtggtgccg
      421 ggagggggga ggtggggagg aggggggggg caataccaca cagcagctgc cggaaccttt
      481 tgtaacctgg ccatggggcg tggggggatt ttttccctcc tggtcccggt gtttggattc
      541 ccactgtgga gcccacgatg ctcatcactc agcagcccct cacgaaccat tcagaagggt
      601 ttggacgctg agccccattg ggtggcagcc ccgtgcctca gtttccccct cggccgtggc
      661 agcggggggg gggggcgtcc ggtggggtca caccttgggg ggaggggggg gctcagcacc
      721 gtgttttgta tcgatttgta caggaataaa cctcgctgca gtctgttgcc ttggaccgtg
      781 ctgcgggatg catctctggg gtccgggggg gagggaaagg gaccccgtag gacccccccc
      841 catcccgagc ggggctgagg tgggcggttg ggggctgcag gacctcccag ctctacccgt
      901 catccgcggg ctcctcccgt tgcgggggcg cggtaccggg gggctgcggg ggggctgcgg
      961 gcgggcgggc ggcattgccc ccccccacag tgccccccct ccccgctgcc cgctcccggc
     1021 atgccccgcg ccgccagcat tatatatatc tagcggtacg gcagaagctg ccggccagca
     1081 gcagccgccc cccccgtatt ctgcaccgcc ccccccgcag gtcagtgccg ccccccccag
     1141 ccctgctgct ggtttgcacc gcggttcgtg tcccaacccc cccccccaga tctcccccct
     1201 ggtccccccc ggcacagtct cgcggtgacc ttggggaggg cgttccggca tccggcttcg
     1261 gggccccccc cgctgccccc cgcccgacca ttctcgcgtg ggggcgagac gcgatcggtc
     1321 cgtccccccc tctccgtgcg ttgagctgtg gggatggggg tgcggggggg gggatgtgaa
     1381 catcgctgcg ggtgtgcggg acccgggggg ggagaaggga gaggagcggt ggggagcacc
     1441 ggggggcatt gcaggggtgc tgtgctattg cggggggtgc tgagctatcg gggggcgccg
     1501 gggtgcggcc cttgggctgg agtccccaca tgggaggtgg gtgcgggcaa agcgtgcggt
     1561 tggagctcct ggtgggctgc agttctgggg agcaccctgg ggtgcagcac ggcgtgggct
     1621 gtgctgggga gtgctgtgcc gcccgctccg tgggcacagc gccacgctgg atcaggccgt
     1681 gagcccacca agggctgctg ccaggtgtgc tcggtgcccc cgcgccaggg acccccaggt
     1741 tgtgcccttc ctatggggcg atgcagggct cagcctcaaa acgcaagcag gacgtgaccc
     1801 ggtgcagatt tcagcccgtc catacttagc ctctgccctt tcccggctgt aacacgacct
     1861 gatggatggc cacggcccct tcggccccgg tctggccggt ttggggacag cggggtgacg
     1921 tggtgtggct gagcaggtgg ggtgctgcgg ggctgggtgc gggcgggcgg ttatgggtat
     1981 ggccaaaccc acagcgagga actggaaggg cagtggggtg cagcagcccc cccttgggca
     2041 ccgagaacag gctgacccct taaaaagcat cccctccagc acatggggag ctggaggtac
     2101 gtgggatact gcctccgtcc ctcagtgtcc tgctgccatg tgtcaccccg agccaatttc
     2161 cggctgccag gcagcacccc gggcatgggg atgtggtcag ggcaggggag ggtgctggga
     2221 ttgtcaccgt gcgtcagccc agtgccggca cagggcgggt ggctgccggc agcgaggtag
     2281 gcgacggccg caggaagtga ggtcccagct cctggagcca tggaattcag gcagcattgt
     2341 gcctttctcg tgtattttag taattttata tatttaagag ccaagctggg gaacggtgct
     2401 gctagcaggg gggaaacccc gtcccccagc gtggccctag agctcaattt gagatctgct
     2461 ccatcccatc gtggaagcaa agggagagct gttgtcctgg caacgtcctc cccaggccgc
     2521 agtgccgggc cccgtgtgtc cctgccgtgc ctctcactct gctctgcttc ccttgcagcc
     2581 tgaagtgctc ctccacccag acagcagcag catgtcgaca ggtgagtgct ttgcatctgt
     2641 gcctcctgca tccattccct ccatcccctt tgcattcacc cccccttgca cctgatccct
     2701 caatccccct tggcatccat cctctccaac ccctgttgcc tccatccctt ttatctcatt
     2761 gtgtgtactc aatttttcct ccttgtgtcc atctcctgcg tcctttttgc atttgtcctc
     2821 tccgtccctc ttgcatctaa acccttgggc cccctcgtgt ccatcctctg tgtccatctg
     2881 ttccctctgt tcctcttgca tctaatccct tagttccctt tgcatccatc catctgtccc
     2941 cccatcctcc tatgttgtcc atttgtgtcc atcacttgtg cctcgacatg tccatccctt
     3001 ttctccccct tacatctgat ctctcagccc tctgtgcctg tctcccatgt cctctctgta
     3061 tctgtcacct ctgtccctct tgcatctgat ccttaggtcc ccttttgtgt ctgtctcatg
     3121 ttctctttcc atctatctcc tctgtttccc ttgtatccaa gtcctgaatc tctcatccaa
     3181 tccccagagt cccccccttg tgtccatctt ctatgttctc cagtagtgtc tgtctcctct
     3241 gtctccccct cgtgtccatt ccgtgtcctc tttccatccc ttccctccat tccccttacc
     3301 tgtccccccg ttgcctctgt cccatttctc ccccttgctt ccctccagcc cccacgtact
     3361 ccatgtcttc tgcccacccc ccttccccac cagcctcttg gctccctccc atcccggggc
     3421 tcaccggctg ctggaggagc tgaaatttca gaagactgag atctggagat ggcagcacgg
     3481 ccccagggtg gctcatgtta tttttagcgt cctccctctg cgctgtcaca ccccaacgcg
     3541 tggcagggac agatcctgcg gggggggggg gggaatcctg cagggaaaac ccccccgtcc
     3601 tatgggctgg agtgctctga gtggatgggg ctatgccatt tctggggaca gtcagatccc
     3661 acagggttgt gccatgccgt gccatgctgc accaagcatg actaggagct gaccatgggg
     3721 atggttgtgg ctgtccctgc tggttcctat ctctcagtgg ggtcggagaa atgaccgtct
     3781 ccaggatgca gtggagttcc aacccctcag ctcaacccct cgtctgccag gacaccatgg
     3841 ggtctcagtg cacgagcatg tgcagcttca atcctggact gtctaggtga caggatgaca
     3901 ggatgagtgc ttcctactct gccctccctg gggttggggt tggtccccac cactcctcgt
     3961 tcagtcctgc cactggggac agttttctcc ctggtgtaac ttctggcttt tagatttatt
     4021 gttgtttgtt ttttttttct ctctctctcc accagaaaaa ctgaagcacc acaagatcat
     4081 cttcgtggtg ggtaagtggg tgtgggacag aggtggccgg tgtggggtgt gctgggacca
     4141 gcagggtcta gcaaagcttg ggatgccagc atccaatgtg atgtgatggc aaactcctgg
     4201 gggtggtgtc atgggtgctt gatggggcgg tgatgtccag ggtaccatca ggagagtgat
     4261 gtctgggata ttggtcatgg tgcctctggt gccatcagag caatgatgcc caggtgccaa
     4321 tggccaatca gccacaccag cagcccatgc cgccccaaca gctctttcca tccatcctgt
     4381 ctcctctcat caggctttga tgggtctggg ctgccgctcc tcctaccccc actagatggg
     4441 gcatgggaca tgacggtggc cctggtgtga cttgcaggtg gccccggctc agggaagggg
     4501 acgcaatgcg agaagattgt gcacaagtat gggtacactc acctctccac tggggacctg
     4561 ctccgggcag aggtcagctc gggctcagag cggggcaaga agctgcaagc catcatggag
     4621 aagggcgagc tggttcccct ggtgagtttg tggtcatgaa cacagggtga tgtggggaca
     4681 ctgtggtgat gcagcagcag gaatggggca cgtcgtccca catcaccctg acagctgtat
     4741 gccccctccc caggacacgg tgctggacat gctgcgggac gccatgttgg ctaaggcaga
     4801 cacctccaag ggtttcctca ttgacggcta ccctcgcgag gtgaagcagg gagaggagtt
     4861 tgaaaagaag gtgagggccg ttcccctggt gagtgccgtg cggtgttatg ttgtgccacc
     4921 ctccgtctgc ggcctatgcg gtgctggcca atggtgaggc cttgtcggct tcactgcagc
     4981 ctcagcccag ccctggcgtc agccacagca gggcacggtg gcaggggaca tctatccagg
     5041 ctgccctcag gcctggggca gtggcagaga gccaaggcct gtcccctggt gggtgccatg
     5101 ccacagtgtg ttgtgtcatc ctccatctgc ggcctacatg gtgatggcac tatgtgaggc
     5161 ctagtcaggc ttcactgcgg cctacaggcc tacgccctgg ggacccccta tagtgggatg
     5221 cggtggcagg ggacgtctgt ctgggctgca cccagggctg gcagagggtg actggggtga
     5281 tggccacgct gtcccctcct tgccggtgtc acacctccag ccccacctgg aggagacgtg
     5341 gggacaccca gacccacatg gtgacagctg tgtccccaca gattgccccc cccacgctgc
     5401 tgctctacgt ggatgcgggg aaggagacga tggtgaagcg gctgctgaag cggggagaga
     5461 ccagcgggcg ggtggacgac aacgaggaga ccatcaagaa gcgcttggag acgtactaca
     5521 aggctaccga acccgtcatc gccttctaca agggccgggg catcgtccgc caggtgagcg
     5581 gggtacaggg aggggggaag caatacatgg gggtaacaga gctgccagct gccccccctt
     5641 ccccccagca ttgctgaggg aaccagcacc ccgtcactgt ccccacagct gaatgccgag
     5701 ggcaccgtgg atgaggtttt ccagcaggtc tgctcctacc tcgacaagct gtgacggccc
     5761 cgcgcccccc accctgcgct ccccccacac gcgcagggca gagacagcgg aagtggcctt
     5821 atcctgtttt cgtggacaga gccgcgcgaa ggaaatttca aggacattgt gtttggctct
     5881 ttcccgtctc tccccagtaa agttcacttt aatgagccca gactttatct ttttcttctg
     5941 tcgcaggaaa tgagtttttc tttccagaga tttttgtttc tagcctcccc ccaccccccc
     6001 tcagccagcc cctctctgaa gctgataaag ggcaggaagc gggtgtttat cccggctgag
     6061 atgcagcggg cccgggggat gcttggagga aggcaggaga tcccggctgc ttcgcctggg
     6121 gccttggtgt cctggccctg tcctgctccc cccaacccga tctcattcct gtgctcctgt
     6181 ctctgcatgc gcctggggca gcatcccacc ccatccctcc tcctcatccc atcagatggt
     6241 gatttccacc ccttccacca cttcctcgct gcttttgtcc attggggtgt tttgggatcc
//
LOCUS       CHKBGBL3      192 bp ds-DNA             VRT       15-MAR-1990
DEFINITION  Chick 14K beta-galactoside-binding lectin gene, exon 3.
ACCESSION   D00310
KEYWORDS    14K lectin; beta-galactoside-binding lectin;
            chick 14K beta-galactoside-binding lectin; lectin.
SEGMENT     3 of 4
SOURCE      A library of chick genomic fragments in lambda-EMBL3 provided by J.
            D. Engel (Northwestern University).
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 192)
  AUTHORS   Ohyama,Y. and Kasai,K.
  TITLE     Isolation and characterization of the chick 14K beta-galactoside-
            binding lectin gene
  JOURNAL   J. Biochem. 104, 173-177 (1988)
  STANDARD  full staff_entry
COMMENT     The mRNA for the lectin is encoded by four exons. The first exon
            encodes only three amino acid residues and the other three exons
            encode, respectively, one of the three repeating sequences found in
            this lectin.
FEATURES       from  to/span     description
    pept    +    11  +   182     14K beta-galactoside-binding lectin, exon 3
                                 (AA 31 at 12)
    IVS     <     1       10     14K beta-galactoside-binding lectin, intron 2
    IVS         183  >   192     14K beta-galactoside-binding lectin, intron 3
BASE COUNT       49 a     48 c     52 g     43 t
ORIGIN      About 1.2 kb after segment 2.
        1 ttctctgcag ctttgtgatg aatctgggca aggactcaac ccacctgggg cttcacttca
       61 acccccgctt tgatgctcac ggtgatgtga acctcattgt gtgcaactca aagaaaatgg
      121 aagagtgggg taccgagcaa agggagacgg tcttcccttt ccagaaggga gccccaatag
      181 aggtaaagcc tg
//
LOCUS       XELPRL5B      986 bp ss-mRNA            VRT       15-MAR-1990
DEFINITION  X.laevis ribosomal protein L5b 60 S subunit mRNA, complete cds.
ACCESSION   M29033
KEYWORDS    5S ribosomal RNA binding protein; nucleolar protein;
            ribosomal protein.
SOURCE      X.laevis stage 17 embryo, cDNA to mRNA, clone L5b.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 986)
  AUTHORS   Wormington,W.M.
  TITLE     Developmental expression and 5S rRNA-binding activity of Xenopus
            laevis ribosomal protein L5
  JOURNAL   Mol. Cell. Biol. 9, 5281-5288 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Wormington, 14-OCT-1989.
FEATURES       from  to/span     description
    pept         53      943     L5b ribosomal protein
    mRNA    <     1  >   986     L5b mRNA
BASE COUNT      309 a    200 c    252 g    225 t
ORIGIN      
        1 gttagggctg tgactctccg gtcctagccc cgacggctca aaatagagca aaatggggtt
       61 cgtaaaggtc gtcaagaaca aggcttattt taagaggtac caggtcaagt tccgcagaag
      121 gagagagggc aagaccgatt actatgctcg caagcgactg gtgatccagg ataagaacaa
      181 gtacaatact cccaagtaca ggatgattgt acgtgtcacc aacagagaca tcatctgcca
      241 gattgcttat gctaggattg agggagacat gattgtatgt gctgcttatg ctcatgagct
      301 tcccaagtat ggtgtcaaag ttgggctgac aaactatgct gcagcatatt gtactggcct
      361 gctgctggct cgtaggcttc tcaacaaatt cggtctggac aaagtatacg aaggtcaagt
      421 ggaagtaaca ggagatgagt acaatgtaga gagcgttgat ggtgaacctg gtgcctttac
      481 atgctacctt gatgccggcc tcaccagaac caccactgga aacaaagttt ttggtgctct
      541 taagggagct gtggatggag gtttatctat tccacacagt accaagcgat tccctggcta
      601 tgactctgaa agcaaagaat tcaatcctga ggtccaccgc aagcacatct tcgcccagaa
      661 tattgcagag tacatgcgtc ttctgatgga agaagatgaa gatgcatata aaaaacagtt
      721 ctcccagtac ataaagaatg gtgtcgcagc agatcagttg gaagacatat acaaaaaggc
      781 tcatgcaggt attcgggaga atccagtgca cgaaaagaaa cccaagaagg aagtcaaaaa
      841 gaagaggtgg aaccgtgcca aattgtctct ggaacagaaa aaggaccgtg ttgctcagaa
      901 aaaagccagt ttcctcagag ctcaggagaa ggcagacagc taaatggatt atgttgcgat
      961 tttctaagtg ttgtcagcaa aaaaat
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       DRMMTTRND     976 bp ds-DNA             ORG       15-JUN-1989
DEFINITION  D.mauritania Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase
            subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes.
ACCESSION   M18073
KEYWORDS    NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys;
            transfer RNA-Trp; transfer RNA-Tyr.
SOURCE      D.mauritania (adult, from Mauritius Island) mitochondrial DNA.
  ORGANISM  Mitochondrion Drosophila mauritania
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae; 
            Drosophila mauritiana.
REFERENCE   1  (bases 1 to 976)
  AUTHORS   Satta,Y., Ishiwa,H. and Chigusa,S.I.
  TITLE     Analysis of nucleotide substitutions of mitochondrial DNAs in
            Drosophila melanogaster and its sibling species
  JOURNAL   Mol. Biol. Evol. 4, 638-650 (1987)
  STANDARD  simple staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            Y.Satta, 04-FEB-1988.
FEATURES       from  to/span     description
    ORF     <     1      461     NADH dehydrogenase subunit 2 (AA at 1; ta stop
                                 codon)
    ORF         668  >   976     cytochrome oxidase subunit 1 (gta start codon)
    tRNA        461      526     Trp-tRNA
    tRNA        580      519 (c) Cys-tRNA
    tRNA        666      601 (c) Tyr-tRNA
BASE COUNT      348 a    105 c     96 g    427 t
ORIGIN      Unreported.
        1 tctttaataa ttagagaatc aatttgactt atttattttt ttttttattc atttttatca
       61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct
      121 tgatttgtaa acagaaaaat tttaaaattt acattattta taaatttttt atcattagga
      181 ggattacctc catttttagg atttttacct aaatgacttg taattcaaca attaacatta
      241 tgtaatcaat attttttatt attattaata ataatatcaa ctttaattac attatttttt
      301 tatttacgaa tctgttattc tgcttttata ataaattatt tcgaaaataa ttgaatcata
      361 aagataaata taattagtag taatactaat atatatttaa ttataacttt tttttcaatt
      421 tttggattat ttataatttc tttattttat tttatatttt aaggctttaa gttaataaaa
      481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aaattactcc
      541 ttcaaaattg cagtttgata tcattattga ctataagacc taataaaatt tgtccttatt
      601 tgattaagaa gaataaatct tataaataga tttacaatct atcgcctaaa cttcagccac
      661 ttaatcagta accgcgacaa tgattatttt ctacaaatca taaagatatt ggaactttat
      721 attttatttt tggagcttga gctggaatag ttggaacatc attaagaatt ttaattcgag
      781 ccgaattagg acatcctgga gcattaattg gagatgacca aatttataat gtaattgtaa
      841 ctgcacatgc ttttattata atttttttta tagttatacc tattataatt ggtggatttg
      901 gaaattgatt agtaccttta atattaggtg ctcctgatat agcattccca cgaataaata
      961 atataagatt ttgatt
//
LOCUS       DROMTTRND     975 bp ds-DNA             ORG       15-JUN-1989
DEFINITION  D.melanogaster Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase
            subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes.
ACCESSION   M18022
KEYWORDS    NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys;
            transfer RNA-Trp; transfer RNA-Tyr.
SOURCE      D.melanogaster (adult, from Sapporo, Japan) mitochondrial DNA.
  ORGANISM  Mitochondrion Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae; 
            Drosophila melanogaster.
REFERENCE   1  (bases 1 to 975)
  AUTHORS   Satta,Y., Ishiwa,H. and Chigusa,S.I.
  TITLE     Analysis of nucleotide substitutions of mitochondrial DNAs in
            Drosophila melanogaster and its sibling species
  JOURNAL   Mol. Biol. Evol. 4, 638-650 (1987)
  STANDARD  simple staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            Y.Satta, 04-FEB-1988.
FEATURES       from  to/span     description
    ORF     <     1      461     NADH dehydrogenase subunit 2 (AA at 1; ta stop
                                 codon)
    ORF         667  >   975     cytochrome oxidase subunit 1 (ata start codon)
    tRNA        461      526     Trp-tRNA
    tRNA        580      519 (c) Cys-tRNA
    tRNA        665      600 (c) Tyr-tRNA
BASE COUNT      345 a    101 c     97 g    432 t
ORIGIN      Unreported.
        1 tctttaataa ttagagaatc aatttgatta atttattttt ttttttattc atttttatca
       61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct
      121 tgatttgtta atagaaaaat tttgaaattt acattattta taaatttttt atcattagga
      181 ggattacctc catttttagg atttttacca aaatgacttg taattcaaca attaacatta
      241 tgtaatcaat attttatatt aacaattata ataatatcaa ctttaattac attatttttt
      301 tatttacgaa tttgttattc cgcttttata ataaattatt ttgaaaataa ttgaatcata
      361 aagataaata taaatagtat taattataat atatatataa ttataacttt tttttcaatt
      421 tttggattat ttttaatttc tttattttat tttatatttt aaggctttaa gttaataaaa
      481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aacttactcc
      541 ttcaaaattg cagtttgata tcattattga ctataagacc taattaattt gtccttattt
      601 gattaagaag aataaatctt atatatagat ttacaatcta tcgcctaaac ttcagccact
      661 taatcaataa tcgcgacaat gattattttc tacaaatcat aaagatattg gaactttata
      721 ttttattttt ggagcttgag ctggaatagt tggaacatct ttaagaattt taattcgagc
      781 tgaattagga catcctggag cattaattgg agatgatcaa atttataatg taattgtaac
      841 tgcacatgct tttattataa ttttttttat ggttatacct attataattg gtggatttgg
      901 aaattgatta gtgcctttaa tattaggtgc tcctgatata gcattcccac gaataaataa
      961 tataagattt tgact
//
LOCUS       DRSMTTRND     975 bp ds-DNA             ORG       15-JUN-1989
DEFINITION  D.simulans Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase
            subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes.
ACCESSION   M18072
KEYWORDS    NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys;
            transfer RNA-Trp; transfer RNA-Tyr.
SOURCE      D.simulans (adult from Nairobi, Kenya) mitochondrial DNA.
  ORGANISM  Mitochondrion Drosophila simulans
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae; 
            Drosophila simulans.
REFERENCE   1  (bases 1 to 975)
  AUTHORS   Satta,Y., Ishiwa,H. and Chigusa,S.I.
  TITLE     Analysis of nucleotide substitutions of mitochondrial DNAs in
            Drosophila melanogaster and its sibling species
  JOURNAL   Mol. Biol. Evol. 4, 638-650 (1987)
  STANDARD  simple staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            Y.Satta, 04-FEB-1988.
FEATURES       from  to/span     description
    ORF     <     1      461     NADH dehydrogenase subunit 2 (AA at 1; ta stop
                                 codon)
    ORF         667  >   975     cytochrome oxidase subunit 1 (gtg start codon)
    tRNA        461      526     Trp-tRNA
    tRNA        580      519 (c) Cys-tRNA
    tRNA        665      600 (c) Tyr-tRNA
BASE COUNT      343 a    107 c    102 g    423 t
ORIGIN      Unreported.
        1 tctttaataa ttagagaatc agtttggttt atttattttt ttttttattc atttttatca
       61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct
      121 tgatttgtaa atagaaaaat tttaaaattt acattattta taaatttttt atcattggga
      181 ggattacccc catttttagg atttttacca aaatgacttg taatccaaca attaacatta
      241 tgtaatcaat attttttatt attaataata ataatatcaa ctttaattac attatttttt
      301 tatttacgaa tttgttattc tgcttttata ataaattatt tcgaaaataa ttgaatcata
      361 aagataaata taattagtat taatactaat atatatttaa ttataacttt tttttcaatt
      421 tttggattat ttataatttc tttattttat tttatatttt aaggctttaa gttaataaaa
      481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aaattactcc
      541 ttcaaaattg cagtttgata tcattattga ctataagacc taataaaatt gtccttattt
      601 gattaagaag aataaatctt ataaatagat ttacaatcta tcgcctaaac ttcagccact
      661 taatcagtga ccgcgacaat gattattttc tacaaatcat aaagatattg gaactttata
      721 ttttatcttt ggagcttgag ctgggatagt cggaacatca ttaagaattt taattcgagc
      781 cgaattagga catcctggag cattaatcgg agatgatcaa atttataatg taattgtaac
      841 tgcacatgct tttattataa ttttttttat agttatacct attataattg gtggatttgg
      901 aaattgatta gtgcctttaa tattaggtgc ccctgatata gcattcccgc gaataaataa
      961 tataagattt tgatt
//
LOCUS       RICMTB1      2135 bp ds-DNA   circular  ORG       15-MAR-1989
DEFINITION  Oryza sativa (rice) Mitochondrial plasmid-like DNA (B1), complete
            sequence.
ACCESSION   D00293
KEYWORDS    cytoplasmic male sterility; mitochondrial DNA;
            mitochondrial plasmid.
SOURCE      Cytoplasmic male-sterile rice line (A-58CMS), mitochondrial
            plasmid-like DNA (B1).
  ORGANISM  Mitochondrion Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE   1  (bases 1 to 2135)
  AUTHORS   Shikanai,T., Yang,Z.-Q. and Yamada,Y.
  TITLE     Properties of the circular plasmid-like DNA B1 from mitochondria of
            cytoplasmic male-sterile rice
  JOURNAL   Plant Cell Physiol. 28, 1243-1251 (1987)
  STANDARD  full staff_entry
COMMENT     The complete nucleotide sequence of B1 (2,135bp) which is a
            plasmid-like minicircular DNA in the mitochondria of Oryza sativa
            A-58CMS was found to contain sequences homologous to those in the
            1,913bp plasmid-like DNA of maize, and therefore, this sequence may
            have originated from the nuclear genome.  The number 2014 described
            under the result section as the beginning of ORF seems to be wrong
            number. According to the sequence map, correct number should be
            2019.  corresponds to DDBJ (rel4.0) entry <RICMTB1>(D00293) dated
            ORG
FEATURES       from  to/span     description
    ORF        1793     2035     open reading frame
    ORF        2019     1732 (c) open reading frame (comp strand)
BASE COUNT      545 a    494 c    449 g    647 t
ORIGIN      3 bp downstream of EcoRI site.
        1 ttcaggattg gctaaggctc tagccattaa atgtgactgg ggttgatcca agggctggtc
       61 tcatggtttt gactaggtgt ttctcaaatt tctgatcata tttcctgtcg ttataacccc
      121 caaaatacgc taactcgcag ctcgaagttc ccagttacat gggcaaccct actcactata
      181 catagggtgt gaagagacgg acaacaatag tctttctctg ctttcactcg agccaaacta
      241 agtgttttct aggattggcc ccaaagccat actatggtca caacggtaac cccctcgatt
      301 tagtttatcc aatcaatcga gaggcagcat ctctatcaat tactcggatc ggcaacctcg
      361 gttctggaat gtggacttca ctgaccagcc gcctcttagt ccacacccaa cttggaagtg
      421 ttattgattc gttagtaacc ttcgccatcc taatacccga gactgtagtg ttctaagttg
      481 gttaaagccg tcgtctcctg ggatgcccgg ctggtcgggc aagtattagt gaaacggctc
      541 cgcgcaatca ataactaccc tgcttaggta tgagcttcca agtcccgaat ccttttgtct
      601 caagtctgcc ttggtaatgt gaccacacat cctttgatga tggaagtgtt gtttcccaac
      661 aatccacccc ggaatcctag ttggagtaat gtatccctac atactatcgc gctaatgtaa
      721 tcgggttctc agatcatact caatgagggg tggctagtat cccgaaactt tgaagaacat
      781 gagttaagta gttcattcac ttggtttcgc caccagtcaa aacgccctaa aacccagtag
      841 gttttctcga tcaccattct acgattttat ttccacctgt tttagaacac ataaattgtg
      901 ctacccttaa gagtttatca ctaaaatcaa gaactttatt tatgacctgt aagttgcctc
      961 ataaaaagta gatgttagta cccgactaaa aggagggaat caaataaaag gtaaagctct
     1021 caattaaaat aaaaagctat cgtttacttt gttaagggta tattgcccgt agcgaggtta
     1081 aggtaaaccc ttgaccgagt gaaggcgagc ccggtaaagc actaagtgta ttggctagct
     1141 tgtactttat tagctatcaa gtgtttgatg taccaattac tttattgtgt caactagttt
     1201 attgaatgca aaggtaattc cctcgataat tgatttaatt atcaagaaag agaattagtt
     1261 accctccttc cctcgagctg agctatttct tgtgtattca atcgtatgca cgagggctgt
     1321 cctctcctct gctctatctc ccggacaggt atgaaagtcg taattctttg ggtttttgta
     1381 gtcaatggaa tgagcggagg gagacacatc cttccatcca aaggtgtgtg tgtcccgtag
     1441 cgagtggaat ggctggttcg gtctagtcgt atagaacggg ttgagttctt tttactgagt
     1501 cctttcgagg ctggaagtgt ttcttacagt ttccttttcc gaagggaaag tcaagtagta
     1561 taaactagta ctaagtccaa gaagaagggg tgaaaagtga ggagggcgat tagcccgatc
     1621 cacttgaggg gaggaaggcc ttccttccct actaatagta ccaattcata attcctattc
     1681 gtaataaatt catatttatt tggaaaaata tgaaaactaa ttatctttat tctagttccc
     1741 taaaaccctt tccgtgccac tcctttctgt tccactcatc caacattgtt ggatgccaaa
     1801 gactttgacg aagactttcg tacctgtatt tctgtccggt atgacgtcgc tttcgtgtcg
     1861 gcatagcgtg ccccagttgc cacagttgct aggctgttca gagtcctctc ggaccccggc
     1921 taggcatcct cccggtaatg ccaggggctg cttcaagaaa cgggtgatgt gggttccaat
     1981 cattcttggt ttggttcctc ggcaccaagc caatcccatt cctttccgct ggtgaccgcc
     2041 ttcttgttgc ctaccttttt aacccctgta gtaaccccaa gtccaagtac aaggcatacg
     2101 tgctccttta atatgcttgc tcctcttttc ttgaa
//
LOCUS       TOBCPNPLR      93 bp ds-DNA             ORG       15-MAR-1990
DEFINITION  Tobacco chloroplast 23S rDNA and lincomycin resistant mutants.
ACCESSION   X13526
KEYWORDS    23S ribosomal RNA; antibiotic resistance; lincomycin resistance;
            ribosomal RNA.
SOURCE      tobacco (Chloroplast Nicotiana plumbaginifolia).
  ORGANISM  Chloroplast Nicotiana plumbaginifolia
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae; Nicotiana plumbaginifolia.
REFERENCE   1  (bases 1 to 93)
  AUTHORS   Cseploe,A., Etzold,T., Schell,J. and Schreier,P.H.
  TITLE     Point mutations in the 23 S rRNA genes of four lincomycin resistant
            Nicotiana plumbaginifolia mutants could provide new selectable
            markers for chloroplast transformation
  JOURNAL   Mol. Gen. Genet. 214, 295-299 (1988)
  STANDARD  simple automatic
COMMENT        EMBL features not translated to GenBank features:
               key        from     to       description
            
               MUTANT       22     22       g is a in LR400
               MUTANT       48     48       a is g in LR415
               MUTANT       49     49       a is g in LR421
               MUTANT       49     49       a is g in LR446
FEATURES       from  to/span     description
    rRNA    <     1  >    93     23S ribosomal RNA
BASE COUNT       23 a     22 c     25 g     23 t
ORIGIN      
        1 tgaaatagac atgtctgtga agatgcggac tacctgcacc tggacagaaa gaccctatga
       61 agcttcactg ttccctggga ttggctttgg gcc
//
LOCUS       VFACPTG      1409 bp ds-DNA             ORG       15-SEP-1989
DEFINITION  Broad bean (V.faba) chloroplast Leu-tRNA-CAA, Leu-tRNA-UAA and
            Phe-tRNA genes.
ACCESSION   M23771
KEYWORDS    transfer RNA-Leu; transfer RNA-Phe.
SOURCE      Broad bean (V.faba) chloroplast DNA, clone Bam6a.
  ORGANISM  Chloroplast Vicia faba
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea; Vicia faba.
REFERENCE   1  (bases 1 to 1409)
  AUTHORS   Bonnard,G., Weil,J.-H. and Steinmetz,A.
  TITLE     The intergenic region between the Vicia faba chloroplast
            Leu-tRNA-CAA and Leu-tRNA-UAA genes contains a partial copy of the
            split Leu-tRNA-UAA gene
  JOURNAL   Curr. Genet. 9, 417-422 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA         15       95     Leu-tRNA-CAA
    tRNA        539      573     Leu-tRNA-TAA, exon 1
               1025     1074     Leu-tRNA-TAA, exon 2
    tRNA       1185     1257     Phe-tRNA
BASE COUNT      506 a    240 c    238 g    425 t
ORIGIN      1 bp upstream of BglII site.
        1 agatctcaat attatgcctt gaagaggact cgaacctcca cgctgtttag cacgagattt
       61 tgagtctcgc gtgtctacca tttcaccacc aaggcatcct taaagggaat catattctct
      121 gaatatgata tctatcttat gtcatggttc aatattatta tactccaatt ctaatccaat
      181 tgcaacatag aatacaacat ctcaaataga atacaacatc tcaaatagaa tacggtacga
      241 tataatacag tatctactat ataatacagc gagaatagaa ttctttctcg actaaataaa
      301 tatagcttct agatgaacga aaagctagtc aaagggtcga ttcaaaattg gtagacgcta
      361 cggacttaat tgtattgagc tttggtatgg aaacatagta atgttaagtg aaaacataac
      421 ataatataat acagcatcta ctatataata caacatgtac tatataatac agcgagaaga
      481 gaattctttc tcgactaaat aaatatagct tctagatgaa cgaaaagcaa gtcaaacggg
      541 ggatatggcg aaattggtag acgctacgga cttaattgta ttgagccttg gtatggaaac
      601 atattaagtg aaaactttca aattcagaga aaccctagaa ttttaaatgg gcaatcctga
      661 gccaaatcct tctttccgaa aacaaaaaac taaaagttca gaaaaaaagg ataggtgcag
      721 agactcaatg gaagctattc taacaaatgg agttgacaac attcaattga ttaatgaaga
      781 tttctaactt ctatttgtaa attgtaacta ttttttttag ttttgattcg atcacaattg
      841 aaacgaaaca ttagaatcaa atcaattcca actggaagaa aaaatggtga atattcattg
      901 atcaaatcag tcactccacc aaaatatgat ggatcttttg aataactgat taatcagacg
      961 agaataaaga tagagtccca ttctacatgt caataccgac atcaatgaaa gttttagtaa
     1021 tcggaaaatc cgtcgacttt aaaaatcgtg agggttcaag tccctctatc cccaaaagcc
     1081 gattgaattt gctcattttt atggcccttt cccttttttt tattgagttg acatagactc
     1141 aagtaatttc cctcaaatta ggatggtgcg tcaagatttt aatggtcggg atagctcagc
     1201 cggtagagca aaggactgaa aatccttgtg tcaccagttc aaatctggtt cccggcgact
     1261 cattatgtat gagtatctat tcccatattt cttttcaaaa atcaaatatt gggaggaata
     1321 gatacatatt gattagtggt cttaataatt atacataatt tatctaggcg tacagagata
     1381 tgctctttct agatgaaaaa ggaatcgat
//
LOCUS       WHTCPTAREP    781 bp ds-DNA             ORG       15-MAR-1990
DEFINITION  Wheat chloroplast dispersed repeat (9) DNA for nonfunctional rpl23'
            homologue segment (located in large single-copy region LSCR).
ACCESSION   X12849
KEYWORDS    pseudogene; rbcL gene; repetitive sequence; ribosomal protein;
            ribosomal protein L23; ribulose bisphosphate carboxylase.
SOURCE      wheat (Chloroplast Triticum aestivum).
  ORGANISM  Chloroplast Triticum aestivum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Triticum aestivum.
REFERENCE   1  (bases 1 to 781)
  AUTHORS   Bowman,C.M., Barker,R.F. and Dyer,T.A.
  TITLE     In wheat ctDNA, segments of ribosomal protein genes are dispersed
            repeats, probably conserved by nonreciprocal recombination
  JOURNAL   Curr. Genet. 14, 127-136 (1988)
  STANDARD  simple automatic
COMMENT     *source: clone=pTacE8.  see x12850 for repeat 9 in functional rpl23
            gene; see x12848 for maize chloroplast repeat 9 in rpl23' homologue
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               INVREP      214    228       inverted repeat A
               INVREP      234    249       inverted repeat A'
               RPT         268    276       direct repeat 1
               RPT         297    301       direct repeat 2
               RPT         302    306       direct repeat 2
               RPT         313    318       direct repeat 3
               SITE        377    678       rpl23' nonfunctional segment
               INVREP      377    384       inverted repeat B
               INVREP      385    392       inverted repeat B'
               RPT         606    614       direct repeat 1
               RPT         631    635       direct repeat 2
               RPT         636    640       direct repeat 2
               RPT         645    650       direct repeat 3
FEATURES       from  to/span     description
    pept    <     1       93     rbcL gene product (AA at 1)
BASE COUNT      302 a    105 c    141 g    233 t
ORIGIN      
        1 gcttgcaaat ggagtcctga actagccgca gcttgtgaag tatggaaggc gatcaaattc
       61 gagttcgagc cggtagatac tattgataag tagataaaac taaagataaa gaagagaagg
      121 tataaataaa aaagaaacga aataaaaaga gaaaaaaata agttatgaaa tgcagtaatc
      181 ttctttattc ttctaattga ttgcaattaa actcggctca atcttttttt tctaaaaaag
      241 attgagccga ataaaaatag atcatgatat gatcatgaga cttgacaaat cgagattcgt
      301 ctattctata tatctagaat atatatatta aggtataata caataatgaa atacaaataa
      361 aataataaaa tatagtatta tcatatgata attgaatcaa atacgcagta tttactgaaa
      421 taagtcttcg tttattgggt ttgaatcaat atacttttaa tgtcgaatgg gattcactaa
      481 gactgaaata aagcattggg tcgtactctt ctttggtgtt aaggtggtag ctgtgtatag
      541 ccatcgacta cctagaaagg taaaagaata ggacctattc tgggccatac aatgcattac
      601 agacgtatga tcattaccct tcaaccgggt tattctattc cacttctaga tagagaaaaa
      661 aactaaagga gaatgaatga aaaaagacat agtttggaag ttagaccttt ttataggact
      721 ctctttcaat ttcaaaaaag aggacgtttg aaacttttaa caggcgtaat cgtgagtcaa
      781 c
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       ACYFIMBA     1850 bp ds-DNA             BCT       13-FEB-1990
DEFINITION  A.viscosus fimbrial structural protein type 1 subunit gene,
            complete cds.
ACCESSION   M32067
KEYWORDS    fimbrial structural protein.
SOURCE      A.viscosus (strain T14V) DNA, clone MY3833.
  ORGANISM  Actinomyces viscosus
            Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE   1  (bases 31 to 1850)
  AUTHORS   Yeung,M.K. and Cisar,J.O.
  TITLE     Sequence homology between the subunits of two immunologically and
            functionally distinct types of Actinomyces spp
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 30)
  AUTHORS   Yeung,M.K.
  JOURNAL   Unpublished (1990) Univ. of Texas, San Antonio, Texas 78284
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.K.Yeung, 12-FEB-1990.
FEATURES       from  to/span     description
    pept        124     1725     fimbrial structural protein type 1 subunit
                                 precursor
    sigp        124      213     fimbrial structural protein type 1 subunit
                                 signal peptide
    matp        214     1722     fimbrial structural protein type 1 subunit
BASE COUNT      395 a    714 c    515 g    226 t
ORIGIN      
        1 caagcttgca tgcctgcagg tcgactctag aggaacgcga gaagaaccca cgactccgct
       61 ccgtcgcgcg cgggtcaccc cttctatccc tcattcatcc caacaacaca ggagagaaca
      121 gcaatgcact ccctcaacac gcgccggggc ctcggcctcg ccgctgcgat gacgctcgcc
      181 gccggcgcgc tcgtcgcccc gaccggcgcg gcggcacccg ctgacccgaa cggctccacc
      241 atcgaccccg acgcggccac cacgctgacc gtccacaagt gcgagcagac cgacaccaac
      301 ggcgtcaagg aggggaccgg taacgaggac ccccaggccg agtgcaagcc cgtctccgac
      361 gtcgagttca ccatcaccaa gctgaacgtg gacctgacga cctacgacgg ctggaagacc
      421 ctcgcggacc tcaagggtga cgtggtcaag gccggtgccc tcaagagcac caccgtccag
      481 aagatcacca ccggtgccaa cggtctggcc tccttcaccg acgcccagac cgaggtcggc
      541 gcctacctcg tcagcgagac gcgcaccccc gacaaggtca tcccggccga ggacttcgtc
      601 gtcaccctgc cgatgaccaa cccgcaggac accgccaagt ggaactacaa cgtccacgtc
      661 taccccaaga acaccctctc cggtgtggac aagcaggtga ccgacaagcc ggcccccggc
      721 tccgggcgcg acatcaccta caccatcacc acctccatcc cgaaggtgga ctaccccggc
      781 ggtgcgcgca tcaagcgcta cgaggtcgtt gaccgcctcg acaagcgcat caagaaggaa
      841 gccctgaccc cggtcgtcaa gatcgtcggt cagaacgagg tgaccctggc ggagaccacc
      901 gactacaccc tcatcactgc cgagggtaag gaccacaact gggccaccat ccagctcacc
      961 gaggagggcc gccgcaaggc ctccgaggcg cgctacaacg gcaacggcga gaccaagctc
     1021 caggtgaccc tgaacgccaa gttcgacgcc gccgtcaacc tcgagggcga cctgtccaac
     1081 accgcgggcc tcatccccaa cgacagcccc aacttcacct gggacccgaa caaccccggc
     1141 accaccacgg acatccccgg catccccacc acccccgtgc tctccaagta cggcaaggtg
     1201 gttctcacca agaccggtac ggacgacctg gccgacaaga ccaagtacaa cggcgcccag
     1261 ttccaggtct acgagtgcac caagaccgcc agcggtgcca cgctgcgtga ctccgacccc
     1321 agcacccaga ccgtcgaccc gctgaccatc ggtggggaga agaccttcac caccgccggc
     1381 cagggcaccg tggagatcaa ctacctgcgc gccaacgact acgtcaacgg tgcgaagaag
     1441 gaccagctga ccgacgagga ctactactgc ctcgtggaga ccaaggcccc cgagggctac
     1501 aacctccagg ccgacccgct ccccttccgg gtcctggctg agaaggccga gaagaaggcc
     1561 gcgaccgagg tgaccgtcac cgacatcccg aagaacgccg gcttccgcct gccgctgacc
     1621 ggtgccaacg gcgtcatctt cctgaccatc gccggcgccc tgctggtggc cggtggcgcg
     1681 gtggtcgcct acgccaacaa gcgccgtcac gttgccaagc actgacagcc tctggcagcc
     1741 actggcagcc actgaggtga ccggccccag ccggccgtca tcacgctgac tgcttccttg
     1801 tgggggaggc gggagaaccc gtctccccca cacgggcgtc gcagggatcc
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       BOVTCRA19     123 bp ss-mRNA            MAM       15-MAR-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90028
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA41.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Department of Veterinary Public Health Obihiro University
            of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
            080
FEATURES       from  to/span     description
    pept    <     1  >   123     T-cell receptor alpha chain, variable joining
                                 region. (AA at 1)
BASE COUNT       23 a     34 c     32 g     34 t
ORIGIN      
        1 tttttccttt ttgcggacca cagactcagg cacttacttc tgtgctgtgc agctggaagg
       61 ggggtgctgc caaccagctc atctttggga ctggcacgct gctttctgtc aagccaagca
      121 caa
//
LOCUS       CATSIS1      1125 bp ds-DNA             MAM       15-SEP-1989
DEFINITION  Feline c-sis proto-oncogene, segment 1.
ACCESSION   M25353
KEYWORDS    proto-oncogene.
SEGMENT     1 of 7
SOURCE      Feline placenta DNA, clone pPHS1.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 1125)
  AUTHORS   Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
            Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
  TITLE     Genetic organization of the c-sis transcription unit
  JOURNAL   Nucleic Acids Res. 15, 959-970 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept       1043  +  1105     c-sis proto-oncogene
    pre-msg      51  >  1125     c-sis, mRNA and introns
    IVS        1106  >  1125     c-sis, intron A
BASE COUNT      171 a    430 c    362 g    162 t
ORIGIN      780 bp upstream of BamHI site.
        1 tagcactctc ccttctcctt tataaaggcc ggaacagctg aaagggtggc aacttctcct
       61 cctgccgccg ggagcagccc gcctgtctcc ccgcgcgccc gcagcctccc ccgctgcctc
      121 cctgagggct cccctctggc cgccagcgcc catctttcat ttccgagatt gcgatatttt
      181 gcgcacacac gcatacacac acgcgcaaaa agggggggaa aaaagggccc accctccagc
      241 ctcgctgcaa agagaagccg gagcagccgc agctcacagc ccagaggacg cccagagcgg
      301 cagcgcgcag acagacggac cgacggactc cgccgcgtcc acctgtctgc ccggcccggc
      361 actgcgcgca cgggcacggc cgagcgcgga gcagccgtgc ccgccgcccg ggcccgccaa
      421 ggcgcacacg ctccggcccc ccccacccgg cccgggcggg agtttgcacc tctccctgcc
      481 cgggtactcg ggccgccgct gcaaagccaa ctttggaaaa agtttttggg gggagacttt
      541 ggccttgagg tgcccggctc tgcactttcc gacttcgggg gcctttctag aaaatgttgc
      601 aaaaaagcta agccagcggg cagaggaaaa cgcctatagc cggcgagtga agacgaacca
      661 tcggctgccg tgttcttttt cctcttggag gttggagtcc cctgggcgcc cccacacggc
      721 tagacgcctc ggctggttcg cgacgcagac ccccggccgt ggatgctcgc tcgggctcgg
      781 gatccgccca ggtagcggcc ggaccctggt cccgcgccca ggccctcccc agccccccaa
      841 cgacggagcc ggggccgggg gcggcggcgc cgggggccat gcgggtgagc cgggcggcgc
      901 ctgcagcggc ctgagcgcct gatcgccgca gacccgagcc gagcccacct ccttccccag
      961 cccccgcccc ccaccctggc cgcgggggcg gcgcgctcgg tccacgcgtc cggggccccg
     1021 cggggccggg cccggagtcg gcatgaatcg ctgctgggcg ctcttcctgt ctctctgctg
     1081 ctacctgcgt ctggtcagcg ccgaggtgag ttgcgacggc ggcgg
//
LOCUS       CATSIS2       137 bp ds-DNA             MAM       15-SEP-1989
DEFINITION  Feline c-sis proto-oncogene, segment 2.
ACCESSION   M25354
KEYWORDS    proto-oncogene.
SEGMENT     2 of 7
SOURCE      Feline placenta DNA, clone pPHS1.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 137)
  AUTHORS   Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
            Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
  TITLE     Genetic organization of the c-sis transcription unit
  JOURNAL   Nucleic Acids Res. 15, 959-970 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  +   117     c-sis proto-oncogene
    pre-msg <     1  >   137     c-sis, mRNA and introns
    IVS     <     1       20     c-sis, intron A
    IVS         118  >   137     c-sis, intron B
BASE COUNT       29 a     44 c     34 g     30 t
ORIGIN      About 7.5 kb after segment 1.
        1 ccatggtgtc tccctggcag ggggacccca ttcccgagga actctacaag atgctgagtg
       61 accactcaat ccgctccttc gatgacctcc agcgcctgct acacggagac tccgtaggta
      121 aattgaacct tgttcgg
//
LOCUS       CATSIS3       142 bp ds-DNA             MAM       15-SEP-1989
DEFINITION  Feline c-sis proto-oncogene, segment 3.
ACCESSION   M25355
KEYWORDS    proto-oncogene.
SEGMENT     3 of 7
SOURCE      Feline placenta DNA, clone pPHS1.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 142)
  AUTHORS   Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
            Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
  TITLE     Genetic organization of the c-sis transcription unit
  JOURNAL   Nucleic Acids Res. 15, 959-970 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  +   122     c-sis proto-oncogene
    pre-msg <     1  >   142     c-sis, mRNA and introns
    IVS     <     1       20     c-sis, intron B
    IVS         123  >   142     c-sis, intron C
BASE COUNT       31 a     35 c     45 g     31 t
ORIGIN      About 2.4 kb after segment 2.
        1 tgccttgcct tgccttgcag acgaagacag agccgagttg gacctgaatt cgactcgatc
       61 ccattgtgga ggcgagctgg agagcttatc ccgagggaga aggagcctag gtgaggctgc
      121 aggtaagaat cctcctttct cg
//
LOCUS       CATSIS4       246 bp ds-DNA             MAM       15-SEP-1989
DEFINITION  Feline c-sis proto-oncogene, segment 4.
ACCESSION   M25356
KEYWORDS    proto-oncogene.
SEGMENT     4 of 7
SOURCE      Feline placenta DNA, clone pPHS1.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 246)
  AUTHORS   Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
            Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
  TITLE     Genetic organization of the c-sis transcription unit
  JOURNAL   Nucleic Acids Res. 15, 959-970 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  +   226     c-sis proto-oncogene
    pre-msg <     1  >   246     c-sis, mRNA and introns
    IVS     <     1       20     c-sis, intron C
    IVS         227  >   246     c-sis, intron D
BASE COUNT       37 a     90 c     80 g     39 t
ORIGIN      About 1.5 kb after segment 3.
        1 tgaggccccc ctcccttcag gttccccgac ggtcgccgag ccagccatga tcgccgagtg
       61 caagacacgc acggaggtgt tcgaggtgtc ccggcgcctc atagaccgca ccaacgccaa
      121 cttcctggtg tggccgccct gcgtggaggt gcagcgctgc tccggctgct gtaacaaccg
      181 caacgtgcag tgccgtccca cccaggtgca gctgcggctt gtccaggtgc gcgggggggc
      241 ccgacc
//
LOCUS       CATSIS5       185 bp ds-DNA             MAM       15-SEP-1989
DEFINITION  Feline c-sis proto-oncogene, segment 5.
ACCESSION   M25357
KEYWORDS    proto-oncogene.
SEGMENT     5 of 7
SOURCE      Feline placenta DNA, clone pPHS1.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 185)
  AUTHORS   Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
            Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
  TITLE     Genetic organization of the c-sis transcription unit
  JOURNAL   Nucleic Acids Res. 15, 959-970 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  +   165     c-sis proto-oncogene
    pre-msg <     1  >   185     c-sis, mRNA and introns
    IVS     <     1       20     c-sis, intron D
    IVS         166  >   185     c-sis, intron E
BASE COUNT       40 a     47 c     67 g     31 t
ORIGIN      About 1.4 kb after segment 4.
        1 tcgatggttc tgtcctccag gtgagaaaaa tcgagattgt gcggaagagg ccagtcttta
       61 agaaggccac ggtgaccctg gaggaccacc tggcgtgcaa gtgtgagacg gtggtggctg
      121 cacgacccgt gacccgaagc ccggggagct cccaagagca gcgaggtaag cgctggtccg
      181 gggtc
//
LOCUS       CATSIS6       187 bp ds-DNA             MAM       15-SEP-1989
DEFINITION  Feline c-sis proto-oncogene, segment 6.
ACCESSION   M25358
KEYWORDS    proto-oncogene.
SEGMENT     6 of 7
SOURCE      Feline placenta DNA, clone pPHS1.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 187)
  AUTHORS   Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
            Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
  TITLE     Genetic organization of the c-sis transcription unit
  JOURNAL   Nucleic Acids Res. 15, 959-970 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21       40     c-sis proto-oncogene
    pre-msg <     1  >   187     c-sis, mRNA and introns
    IVS     <     1       20     c-sis, intron E
    IVS         168  >   187     c-sis, intron F
BASE COUNT       44 a     57 c     64 g     22 t
ORIGIN      About 2.1 kb after segment 6.
        1 tctttctctc gagcagccag gacaccccag actcgggtga ccattcggac ggtgcgagtc
       61 cgccggcccc ccaaggggaa gcaccagaag ttcaagcaca cgcatgacaa gaaggcactg
      121 aaggagaccc tcggagccta ggggcatctg caggagcgtg cgggcaggtg aggccaggcg
      181 gggaacc
//
LOCUS       CATSIS7      1643 bp ds-DNA             MAM       15-SEP-1989
DEFINITION  Feline c-sis proto-oncogene, segment 7.
ACCESSION   M25359
KEYWORDS    proto-oncogene.
SEGMENT     7 of 7
SOURCE      Feline placenta DNA, clone pPHS1.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 1643)
  AUTHORS   Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A.,
            Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M.
  TITLE     Genetic organization of the c-sis transcription unit
  JOURNAL   Nucleic Acids Res. 15, 959-970 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pre-msg <     1     1623     c-sis proto-oncogene, mRNA and introns
    IVS     <     1       20     c-sis proto-oncogene, intron F
BASE COUNT      310 a    513 c    437 g    383 t
ORIGIN      About 300 bp after segment 6.
        1 tccctctggt tggcctacag ggttatttaa tacggtattt gctgtatcgc ccccatgggg
       61 tccttggagt gataatattg ttcccctcgt ccgtctgtct cgatgcctga ttcggacggc
      121 caatggtgct tccccccctc ccacgcgcct gtccacccac cagcgggtcc cctcattggc
      181 ctccagcgcc ttgcccggga gctcgagaag gacacgaagg acctgaactc cactgctgtc
      241 ttcctccgct gaccccgaga acgtggcata gaagtgtaag agagacccat gggatcgccg
      301 ttcggggacc ggaggagctt cccttcccca gctcctggcc cgggtcacac ctgagcgccg
      361 tggactggcc tgaaaccctg cacgtggccc gcccggtccc aggacccctg gcccgctctg
      421 aagggagcac ccccaggcag gccagggtgc ctcgtcctcc tgtggctggg accacgcgtt
      481 ggaccgtaga ctggagagaa atcctcccac ggcgcccagg cccagtcact tctttctccc
      541 cggtcacctc tgcttatagt ggcttccttt cgttttctat atatgaaatc ttcgaagatg
      601 tggactcctc tgggtgggcg tggcccgagt acccggcagc caagtgcccc gtcagatggg
      661 ttagagatga agtttgctct ggagccacca tggatggtga tctgggcgct cactgcctcc
      721 ttccactccc ctgccttcac ctcctctgtt tcatctctct acctccaccc tgcgtcttcc
      781 tcttgtcccg gccttcagtc cgctccaccg agggactcct tggaccccgt gctgagaccg
      841 taaataaccc cactcactgc tccattaggc cgaagaccag gggtcagggc agcagggggc
      901 ctgcgcttcg tatcccagcc cggccgagac tgccgtgtag gttcgtgcag ggtgtgcact
      961 gtgcaaggca tgcgtgcagg ggtcgttgtt catgccgtag aacatggccc gctggtgtat
     1021 ttgttaggac agttttctgg cagatggagc taaaatgtct tgagcaaagg gatcctgttc
     1081 ctaatcgaca tgaaggttcc atgtggactg gctgtgtccc gacccagcca atggcttggg
     1141 gtagttggac aggagagtga gcccacgttg ggtggggacg ccacgcgttg tccagctggc
     1201 ctctattttc ctgagtcctc agctcaagca gttccccttc caggcgggcg tgaaaaaccc
     1261 aagagagaat ctccaaggga gggggcagcc ctcgctctcc ccgggcacgc taccccccag
     1321 agtagattct tttgagatcc gcccgtggtg gctcctccca ggaaaccagc tcacgggctg
     1381 ggagtggggg agaaaaggga aaagatcccc aggggtccct ggggtggggt ctgagctccc
     1441 acctccctcc cctccacccc actgcacttt cccccttacc ccccccctcc aaaatctgct
     1501 ctcttcagtt tgtaaagttg gtgattatat ttttgggggc tttcctttta ttttttaaat
     1561 gtaaatttat ttatattccg tatttaaagt tgtaaaaaaa aaaaaataac cacaaacaaa
     1621 accaaatgaa tgaatccacc ggc
//
LOCUS       PIGMDHA      1080 bp ss-mRNA            MAM       15-MAR-1990
DEFINITION  Pig malate dehydrogenase mRNA, 3' end.
ACCESSION   M29463
KEYWORDS    cytosolic malate dehydrogenase; malate dehydrogenase.
SOURCE      Pig liver, cDNA to mRNA, clone ppcMDH-1.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1080)
  AUTHORS   Joh,T., Takeshima,H., Tsuzuki,T., Setoyama,C., Shimada,K.,
            Tanase,S., Kuramitsu,S., Kagamiyama,H. and Morino,Y.
  TITLE     Cloning and sequence analysis of cDNAs encoding mammalian cytosolic
            malate dehydrogenase
  JOURNAL   J. Biol. Chem. 262, 15127-15131 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      870     malate dehydrogenase (AA at 1; EC 1.1.1.37)
BASE COUNT      320 a    232 c    240 g    288 t
ORIGIN      
        1 atgatgggtg tcctggatgg tgtcctgatg gagctgcaag actgtgccct tcccctcctg
       61 aaagatgtca ttgcaacaga taaagaagag attgccttca aagacctgga cgttgccatt
      121 ctcgtgggtt ccatgccaag aagggatggc atggagagga aagatttact caaagcaaat
      181 gtgaaaatct tcaaatgcca gggtgctgcc ttggacaaat atgctaagaa gtcagttaag
      241 gttatcgtgg tgggaaaccc agccaatacc aactgcctga ctgcctccaa gtcggctcca
      301 tccattccca aggagaactt cagctgcttg actcgtttgg atcacaaccg agctaaagca
      361 cagattgctc tcaaacttgg tgtgacttct gatgatgtca agaatgtcat catctgggga
      421 aaccattcct caactcagta tccagatgtc aaccatgcca aggtgaaact gcaggcaaag
      481 gaagttggtg tttatgaagc tgtgaaggat gacagctggc tcaagggaga gttcatcacg
      541 actgtgcagc agcgcggtgc tgctgtcatc aaggctcaga aactgtccag tgcaatgtct
      601 gcggcaaaag ccatttgtga ccacgtcaga gacatctggt ttggaacccc agagggagag
      661 tttgtgtcca tgggcattat ctctgatggc aactcctatg gtgttcctga tgatctgctc
      721 tactcattcc ctgttacaat caaggataag acctggaaaa ttgttgaagg tctccctatt
      781 aatgatttct ctcgtgagaa gatggacctc actgcaaagg aactagcaga agaaaaagaa
      841 actgcatttg aatttctttc ctctgcctga ttaaacaatc atttcaatgt actaaatgcc
      901 ccaaagctga aaaatctaaa tgtccttttt gaccctagta ccaaataata ataatgctaa
      961 ccttaaatta ctagtgaaaa ataacacact ttaaagattg tgtgcttctt ggtacaaatt
     1021 tgtgacagtt tatcatcatg ctgttagtgc cacattctaa aataaatata tatccaaatg
//
LOCUS       PIGUFG        996 bp ds-DNA             MAM       15-MAR-1990
DEFINITION  Pig uteroferrin gene, 5' end.
ACCESSION   M30283
KEYWORDS    iron-transport protein; transport protein; uteroferrin.
SOURCE      Pig adult liver DNA.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 996)
  AUTHORS   Simmen,R.C.M., Srinivas,V. and Roberts,R.M.
  TITLE     cDNA sequence, gene organization, and progesterone induction of
            mRNA for uteroferrin, a porcine uterine iron transport protein
  JOURNAL   DNA 8, 543-554 (1989)
  STANDARD  simple staff_review
COMMENT     See GenBank entry M30284 for sequence of uteroferrin mRNA.
FEATURES       from  to/span     description
    pept        887  >   996     uteroferrin precursor, exon 2 (first expressed
                                 exon)
    sigp        887      946     uteroferrin signal peptide
    matp        947  >   996     uteroferrin
    pre-msg     651  >   996     UF mRNA and introns
    IVS         745      886     UF intron A
BASE COUNT      239 a    333 c    265 g    159 t
ORIGIN      
        1 aatgggggtc aggcaggact tcctggaaaa agggagcccc agctgtttgg gggaggggag
       61 gtgaccctgg gacactggct gggatgttgt caaacacagc tcacacacag gacacacagc
      121 tcagctgaga aacacaacag agacacacaa cttgggcaca aaacaggcac acacaagtcg
      181 ccttcccccc tacacacagt tccacttgag gactcaggtg cacgtcacag acatgccatg
      241 caggccttgc cagctgagta tacacagcca ctggaacact ggctcccagg ctcacaacca
      301 acatgtgact gaggcaggca cgcttggcta cacgcagaaa acagctcaga cacacccaga
      361 cacaggcaga caaacaccac ctctgagaac accgacacac aggaggtgcc tggcccgggc
      421 attcggacac agctgcctac acagcacaca cgggccccga gacacgaact gctcctgtgg
      481 gcagacgcag gggctgtggg gcaggcgcag ctgcactggt gcaccagaca gccacctcaa
      541 atcccgtgtc cagcgctggg gaactaactc caaggctcac atgacccgag gggagggctt
      601 ttggaacagt tggcggagaa actgcatcat cctcactatg atcatttccg acccagggaa
      661 taaaggctca gtgaccgcca gttctgcact ggagctcaac ctgcctctct gagcccccag
      721 cgaccggctc aagcctcccc tagggtgagt gctgggccct gcatgaccac cctttcccct
      781 aagccccatg gctggaccct atcctcccac taaccctggg gataaggggg ggtgggtctc
      841 gggtgcctgc tgcccacatc ccagacactt gtctctcctc tcccagatgg acacgtggac
      901 ggtactgctc atcctgcaag cctccctggt gctccccggg gccgtgggca ccagaaccaa
      961 caccaggact gcccccaccc ccatcctgcg ttttgt
//
LOCUS       PIGUFMR      1424 bp ss-mRNA            MAM       15-MAR-1990
DEFINITION  Pig uteroferrin mRNA, complete cds.
ACCESSION   M30284
KEYWORDS    iron-transport protein; transport protein; uteroferrin.
SOURCE      Pig (60-day pregnant) endometrial, cDNA to mRNA.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1424)
  AUTHORS   Simmen,R.C.M., Srinivas,V. and Roberts,R.M.
  TITLE     cDNA sequence, gene organization, and progesterone induction of
            mRNA for uteroferrin, a porcine uterine iron transport protein
  JOURNAL   DNA 8, 543-554 (1989)
  STANDARD  simple staff_review
COMMENT     See GenBank entry M30283 for sequence of the 5' end of the
            uteroferrin gene.
FEATURES       from  to/span     description
    pept         95     1111     uteroferrin precursor
    sigp         95      154     uteroferrin signal peptide
    matp        155     1108     uteroferrin
    mRNA          1     1424     UF mRNA
BASE COUNT      299 a    446 c    411 g    268 t
ORIGIN      
        1 acccagggaa taaaggctca gtgaccgcca gttctgcact ggagctcaac ctgcctctct
       61 gagcccccag cgaccggctc aagcctcccc taggatggac acgtggacgg tactgctcat
      121 cctgcaagcc tccctggtgc tccccggggc cgtgggcacc agaaccaaca ccaggactgc
      181 ccccaccccc atcctgcgtt ttgtggccgt gggggactgg ggaggggtcc ccaatgcccc
      241 gttccacaca gcccgggaaa tggccaatgc caaggcgatt gccacaactg tgaagaccct
      301 gggcgcagac ttcatcctgt ccctggggga caatttctac ttcactgggg tgcatgatgc
      361 caaagacaag aggttccagg agacctttga ggatgtgttt tctgacccct ccctccgcaa
      421 tgtgccctgg cacgtgctgg ctggcaacca cgaccatctg gggaacgtct cggcacagat
      481 agcctattct aagatctcca agcgctggaa cttccccagc ccttactacc gcctgcgctt
      541 caagatccca cggtccaatg tgtctgtggc catcttcatg ctggacacgg tgacgctgtg
      601 tggcaactcg gacgactttg tcagccagca accggagcgc aacctggcgc tggcccgcac
      661 acagctggcc tggatcaaga agcagctggc ggcagcaaag gaggactatg tgctggtggc
      721 cggccactat cctgtgtggt ccatcgccga gcatggaccc acccactgcc tagtcaagca
      781 gttgctgcca ctgctgacca cacacaaggt cactgcctac ctgtgcggcc atgaccacaa
      841 cctgcagtac cttcaggatg agaatggctt gggctttgtg ctgagcgggg ccgggaactt
      901 catggacccc tccaagaagc acctgcgcaa ggtccccaac ggctacctgc gcttccactt
      961 tggggctgag aactcactag gtggcttcgc ctacgtggag atcaccccca aagagatgag
     1021 cgtcacttac attgaagcct cgggcaagtc tctcttcaag accaagttgc caaggcgagc
     1081 caggtctgag caccagcacc gacgggcctg agaagtgcct ccctgctggc gggtgggtgg
     1141 gtggcccact gggccatccc ccccagccct taagcaggct ttcctgggta cctgtggtgc
     1201 tacggcagag cgggaaggag atggacagat gaggaaatgt gatgccaagt ggcccttgtg
     1261 acaaggatgc ccacagaggt gaaagaagca tggacacatg taccggccag agtgccaagg
     1321 ccccatggcc gggctcgccc agcctgagct tctgggcaat ggggggtgag gaagaaagct
     1381 ctctcctggg tcaggcatcc ttctgtcact gccaaatagt taat
//
LOCUS       RABBGLOB    44594 bp ds-DNA             MAM       15-MAR-1990
DEFINITION  Rabbit beta-like globin gene cluster encoding the epsilon,
            gamma, delta (pseudogene) and beta globin polypeptides, complete
            cds.
ACCESSION   M18818 X07786 X07787 X07788
KEYWORDS    beta-globin; delta globin; epsilon globin; gamma globin;
            globin gene cluster; hemoglobin alpha-2 beta-2; pseudogene.
SOURCE      Rabbit (strain New Zealand White, haplotype short) liver DNA,
            clones lambda R beta G[3,5,8].
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 44594)
  AUTHORS   Margot,J., Demers,G.W. and Hardison,R.
  TITLE     Complete nucleotide sequence of the rabbit beta-like globin gene
            cluster: analysis of intergenic sequences and comparison with the
            human beta-like globin gene cluster
  JOURNAL   J. Mol. Biol. (1988) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by R.Hardison, 02-MAY-1988.
            
            The epsilon and gamma genes are only transcribed in embryonic red
            cells derived from the yolk-sac.  Both genes encode abundant
            amounts of embryonic globin polypeptides.  The delta globin is a
            pseudogene, and is not transcribed in any erythroid tissue.  It is
            the product of a gene conversion between delta and beta, so that
            the 5' end of the delta globin is derived from the beta gene.
            Finally, the beta gene is transcribed in red cells derived from
            both fetal liver and adult bone marrow.
            
FEATURES       from  to/span     description
    pept       5999     6090     epsilon globin, exon 1
               6212     6434     epsilon globin, exon 2
               7237     7365     epsilon globin, exon 3
    pre-msg    5942  >  7489     E-G mRNA + introns
    IVS        6091     6211     E-G, intron A
    IVS        6435     7236     E-G, intron B
    pept      15525    15616     gamma globin, exon 1
              15741    15963     gamma globin, exon 2
              16781    16909     gamma globin, exon 3
    pre-msg   15463  > 16997     G-G mRNA + introns
    IVS       15617    15740     G-G, intron A
    IVS       15964    16780     G-G, intron B
    pept.ps   22252    22342     delta pseudo-globin, exon 1
              22445    22665     delta pseudo-globin, exon 2
              23421    23548     delta pseudo-globin, exon 3
    pre-msg   22204  > 23676     D-pG mRNA + introns
    IVS       22343    22444     D-pG, intron A
    IVS       22666    23420     D-pG, intron B (no splice consensus)
    pept      30970    31061     beta globin, exon 1
              31188    31410     beta globin, exon 2
              31984    32112     beta globin, exon 3
    pre-msg   30917  > 32204     B-G mRNA + introns
    rpt        1201     2586     direct repeat L10c-1
    rpt        2817     3137     direct repeat C1
    rpt        3553     3827     direct repeat C2
    rpt        8895     9206     direct repeat C3
    rpt       10039    10390     direct repeat C4
    rpt       10444    10808     direct repeat C5
    rpt       10866    11186     direct repeat C6
    rpt       11815    12184     direct repeat C7
    rpt       18604    18936     direct repeat C8
    rpt       18937    19059     direct repeat C9 (approx. 5' and 3' ends)
    rpt       19162    19470     direct repeat C10
    rpt       20027    20379     direct repeat C11
    rpt       20382    20743     direct repeat C12
    rpt       24175    24573     direct repeat C13
    rpt       25864    26076     direct repeat L10c-2
    rpt       26206    26587     direct repeat C14
    rpt       28578    29020     direct repeat L10c-3
    rpt       32898    33190     direct repeat C15
    rpt       33724    38097     direct repeat L10c-4
    rpt       38098    44594     direct repeat L10c-5
BASE COUNT    12729 a   8346 c   9035 g  14484 t
ORIGIN      
        1 agctacggcg ctggcttgct agacatttta atgatagtca catagctgta acacacatac
       61 atccatatag atgcagacaa agacatactc tgacacacac agatagatga tatagatggt
      121 ccagaatgta ttttcagagg atttctgagc aattaaccac acaggaccct taaatcctaa
      181 cccacaccac tttgccttca gtatgctacc tgtgtgggaa ccagggatgg ggacacagaa
      241 gtcagaagca ctccaggttt ctgttgttag ccagggccct cccttaccat agcttcgtgt
      301 cacactgagc cttatctata cccacagttt ctgaatccac ctggctatca ccagctcagt
      361 ccttaggaga agggaaaaca aacactgagt ttgtctctgc ttataaaaac aaagcaaaat
      421 aaaattaaat ttcaatccca tcccaaacaa atgaataact aagcaaaaac aaaaaggaga
      481 agtggaaggg aagtggagga ggaagaagaa gagggtaaga aaacagttgc agaaagaggc
      541 tggagatgtg tgcgtctgaa ggagggaaga gccccagtag cctcacaccg gcctagggac
      601 ctgcggaagg agcagcttct ttcctctaca aaatgctcca gatttgataa gctaggaggt
      661 ggggtaattt actaagagaa gggacacatt tggggctttt gttattcaag acttaaaaat
      721 tttgggtagt gaatggtctt gctaatggat ggggatctat cttcttccac ttaagccctg
      781 cacacttgcc attccttaat tttgtttcag ggagatactt tttttgtttt gttttgtttt
      841 gttttagatt caggaaactt caatgtcact gaaaacagca tttctaggaa agaaagagtt
      901 atttactttt gtaagtcaaa aatgatgaaa tcatggatga ctttctagca gcagaggtag
      961 aatattatct tataattgga tattttccta caaattatag tacaaggttt tatgctctat
     1021 atatctaaga gtctcatttg aagcaactct gtaaaaattt tccagtattg agtttatgta
     1081 tattttagaa gtgctgcatt tgaaggaaac ttgtaaacta ttacttaagt cttgggcaag
     1141 agcaatgaga tttgtctcaa tgacctaagg tagtaaactg gagggaagga agattaaaac
     1201 atactcattc tgaacaccaa aaaggaaatc tgttgaagtg aaatggacac tatgagaaac
     1261 ggtgacttga tcagcagagc cctgactgtt aatgaacaac ttaatacatt atccctctta
     1321 ttagtttttt gtctgttcta cttaatatga ctggtttaat tctgtaattt atacacagtt
     1381 attcttaagt gttgaaaatc aactaaaatg tgatccctgt taaacataag agtgggaata
     1441 agagagggaa gagatgtaca atttgggaca tgctcaagct gacttgcccc aaatggtaga
     1501 gttagaaaca taccagggga ttccaattca atcccatcaa ggtggcatgt accaatgcca
     1561 tctcctagtc caagtgatca atttcacttc acaattgatc ataatgaaag gactaagagt
     1621 caaagggagc acataaacaa gtctagtacc tgctaacact aaccgataga ataaataaag
     1681 gggagagtga tccaacatgg gaagtgaggt actcagcaga ctcatagaat ggcagatgtc
     1741 ctaaatagca ctctggcctc agaatcagcc ctaaaggcat tccgatctgg ctgaaaagcc
     1801 catgagagta tttcaggcat ggaaagccaa gacactctgg caaaagatct ctgtgagtga
     1861 gatcccagtg gaaagaacag gtcttcaaag aaggaggtac ctttctctga agggaggaga
     1921 gaacctccac tttgactatg accttgtcta aataagataa gagtcggaga actcagaggg
     1981 cttccatagc cttggaaact catgactgga gcatagggag attactgatg ccatagacag
     2041 gagtgtcaat tggtaaagtc aacaacagga gtcactgtgc acttactcct catgtaggat
     2101 ctctgtcctt aatgtgctgt acattgagac ttaatgctat aacgagtact caaacaatat
     2161 atttcacttt gtgtttctat gggggtgcaa actgttgaaa tctttactta atgcatacta
     2221 aactgatcct ctgtaaaaaa aaaaaaagaa attatcaact cccaacttga ctctcactgg
     2281 gattaaacat gacaataggt ctgatctgat ttcatcatca tttaaaaaaa atcatctatt
     2341 atttttcact ttatgtttct gtgtgggagc aaactgttga aatccttact taatgtatac
     2401 taagctgatc ttctgtatat taagataatc gaaaatgaat cttgatgtga atggaagggg
     2461 agagggagtg ggaaagggga gggttgtggg tgggagggac ggtatggggg ggaaaccatt
     2521 gtaatccata aatcgtattt ggaaatttat attcattaaa taaaagttaa aaaaaatact
     2581 cattctaata aaaatggcat cccttctatt tcctaacatt ttatttctat gttgctttta
     2641 attcatggac tatgttagaa aaattattta acacaaacac agaaaaataa gaatttacca
     2701 gtattttaat catgaaataa tctaacactt atgtatagga tcagaaaatc ccaaagaaac
     2761 caaaaaaagt aaggccatat ttcagataaa aggccatatt tcagataaat ggtaaccaaa
     2821 tgcaagaatg ttttaaacat ttaattaacc acttagaaag ggacacacag gcacacacac
     2881 atacacagaa tcagagagag ctacctgctt gttcactccc aaatgccccc caaaaaccca
     2941 ggataggtta ggcagaaacc aggagcccag actcaatctg gatctcccac aggtgtggca
     3001 gggacccccc atacctgagt tctcacttgc tgcctcttag gaagctagaa tggggagcag
     3061 agccaggacc caagctcaag ccttctgata tagaatacgg gcatctcagg cagcatcaca
     3121 actggtgact tcaaaagttg tccctaattt caacttctaa tttccaagct ttatccttca
     3181 gaaaagtgtg acgtgaaatg aatgattctg cctttccact ccttcttccc agtaaattta
     3241 atttgcattc acagatctat ggttatgctt tagggccatt gtccacaagg tccaaagtct
     3301 gtgaactaag ctactaagag ataaatcagt ctgagaatga gacctaattt tgactcaaat
     3361 tcatctacaa cctttgtaag aaatgaggca gacgcctgat gtgttcacag cagcagcatt
     3421 gtatgtccac tagtagtggt ttagtctcac tttttgcagc tacaccagaa aatattctta
     3481 agacttatga aagcagccac cagaaattct tgttttttgt taccaaagtc attgccagct
     3541 aaggcaaaac ataagagaaa cttctgctgt ggccagggag tgcagtggag gatggcctag
     3601 gtgcttgggc cctgcacccc atgggagacc aggaaaagca cctggctcct ggctcctgcc
     3661 atcggatcac gcggtgcgcc ggccgcagcg cgccggccgc ggcggccatt ggagggtgaa
     3721 ccaacggcaa aggaagacct ttctctctgt ctctctctct cactgttcac tctgcctgtc
     3781 aaaaaataaa aaaaaataaa aaaaataaaa aaaaaaggaa agagaaaaag agagagagag
     3841 gaagagagga tatcggggaa ttcaatctgg gtctcctata ttggtagcag agaaccaagt
     3901 actagagcca tcaccagatg tgcccctcca ccccaggttt gcattaacag gaagctggaa
     3961 ttgacaacaa agctaagact tgaacctagg aaatctgata ttggatgtgg gtatcccaat
     4021 caatcatcaa cttgtcatca tttgacatca actatgtcaa atgcttggtg ttcagatatc
     4081 ttgaagtaat atttgtctca atacttggat attaccctca ttgtgaaatc accactagat
     4141 ctaatgtatt cataatgaag tatatgtaac actacactac atacttgata tttcatgttt
     4201 ggttatttta tctcaataca ttttatcctt agcaatatta cgtatattaa ttcatgcatt
     4261 aacacattat ttagtgtttc cattctacct aattagtcca tgatatagaa gaagagtagg
     4321 aatttctgac tttatctatc ttcttcacta aaagcaagaa agggaggtag agattcaata
     4381 atctggaata atactagttt gaggggagtc atttaatctg atgaagcttc tactattaga
     4441 gtataaagga agataaagta gtctcatgtg ctagaaatgt ttgcattaca cctttaggaa
     4501 gtaatcataa gtaccagcgt agggttataa acaagagaag agttctgggt aaacttagaa
     4561 cattgagatg caaggcactg gactagggag acaaaagcca tagtgcaaaa caaaaaggac
     4621 ccagaagcac ccatgtcacc agaatgcaag aaactatagg aaaagaaatt tccatgtata
     4681 gcaagagcaa aggaagaact agaggaaaga aaggatagaa acatacgggg aattagccag
     4741 atataagaac tagttaagga atactagata atgattatgt agtggaatgt ttctggaatt
     4801 aaaaaggctg gaatttctgc agttctcttc cattatttct ctgatcatca actttttcct
     4861 tttgacaatt atataacttc tgtgacctct ctttctttat tccattccgt ggataatgcc
     4921 aattgtatga ctcagtacag agcagtgaaa taacaaaggg aatatctttc atgaacatca
     4981 ataattcact aatacatata aatgtggatt tacagttttg tctccaacat gcaaagaatt
     5041 agccagtcat cacatcaggg ggcttcaaaa acaagggcac ttagaagaat ttgaacctct
     5101 accctggtga ctacagcaca cactagacaa atcaattaac ttattcctca gtttgattat
     5161 aaatcctggg tttaaataat tttagtgtca tctgcctcat ggattaaagt ataagtagag
     5221 cagacaagcc aaaaaagaga gtcagggaaa tatccctgga gtgctctctg taagcacagg
     5281 agataacctt gctgtctgtc ctgtctagga aatcactcct agactgtact ttggaattga
     5341 gtcttagtcc tactgcagat cccttgtcac tctcttcagc ttccaggagt aaccatgcat
     5401 ggcccctgtc tcatggaaaa actaagatct caaaagtgtt tgcagttcac tcagggaagc
     5461 tcaagtggat gatgttctgc ctcaaccact ttcatttttt gtgtgtgtgt gtgctgtgtt
     5521 gctttgtttt ccttgacctt ctttttgtca ccaaccttga cccaaatatg aaactacttt
     5581 agccctaaca ctttccagtc ttatgcttag ctgctctcta tgaatctttc ttggaaaagg
     5641 ggaatgggaa tgatggatgc catcttggaa agattacatg gaaaaagaag agagtgaaaa
     5701 catcgataat gtatgacaaa ttgtgttctg cttggaggag gaaccaacag agtagggatg
     5761 tctcacgatc cgaaagccat tatcacaaac tcagtgaacc accactgacc atctggtggc
     5821 ctggctccac ccatgaagga cacattccag tcttgaccaa tgacttcaaa gtatgaaaga
     5881 gaacaagggg ccagaccatc agcagtgggg aataaaaggc caagccttga agcagcagca
     5941 caaagctgct tctgacacat ttgtgatcga tcaccagcaa gctcccagac gtgacaccat
     6001 ggtgcacttt actcctgagg aaaagtgcat tatcagtaag cagtggggcc aggtgaacat
     6061 cgatgagact ggaggtgaag ctttgggcag gtaagtattg gttctcaata cttggtagag
     6121 aaaggcaaat atgaccctgg tagactgacc aggaaattgc tcaaagtttt tgtatctctg
     6181 attttctatt tgctgttgtc ccatcatata ggctccttgt tgtctacccc tggacccaaa
     6241 gattctttga caactttggc aacttgtcct cttcctctgc catcatggga aaccccaagg
     6301 tcaaggcaca tggcaagaag gtgctgacct cctttggaga tgccatcaag aacatggaca
     6361 accttaaggg tgcctttgct aagctgagtg agctgcactg tgacaaactg catgtggatc
     6421 ctgagaactt caaagtgagt ttaggagatg ctcatgagct ctttctggtt ttataactta
     6481 gcacaataat gggatataag actatgattg gagagatcaa caaagatttc agaagtcata
     6541 gatatgtttt gttgttagag atctccaatg gtcatcaaca agaccaattt aattcagaaa
     6601 tagtaaggta aaaggctatt aacagtctaa tttttcagaa aatgatgtgt ttatcctgag
     6661 aataatgtta ttgaattgta gctttgtaca aggaaaaata gaatttgtct gttagtagat
     6721 gaaggcctat ctcaatgaag gagaaatgca atgcttacat tatatttttt ctgatgatga
     6781 actaggactt gattagggac gatttatgcg agatctcttg aaaaaaaata caacaaattt
     6841 tctgaaaaat attaaattct ctcagtattg tgaccaaata taaacttgtt cctgctttgg
     6901 ttggagactt taaactccaa gacccttgca acactggcat tctatgttta ctgaaactaa
     6961 acaattatta aatgagaaag aaaaaaagaa agaagataaa tatggtgaaa tgtggaagga
     7021 ggataagata taggtagtca gaatgttgaa cggaagtctc ataaaaatga aattgaatcc
     7081 attcttgtct tggtttaagg gtaggcacaa tccttggagt aatttgaggt cagagttttc
     7141 agggaatgta ggtacatagt catttctttc tttgtaaatt cttttgggaa cacctactca
     7201 atatgcctat gtatcatttt gtcttttacc taacagctcc tgggaaacgt gctgctgatc
     7261 gttctggcca ctcattttgg caaagaattc actccggagg tgcaggctgc ttggcagaag
     7321 ctcgtgtctg gtgttgccat tgctctggcc cacaagtatc actgaactct cttcccagtt
     7381 tctttagtgc ccctactcat ccccagcgtc ctccttctgt accttgggga ctaggctcag
     7441 ccttgatggc acagtttctg tttaataaaa tacattctat tcagtaatca aaaaatgatg
     7501 tctcatcttc ttcgtatttt actcatgtgt ttaaaaaaaa aaaaccacac atgttcatga
     7561 ttgagacaga gagagacata ggaagaaata agaatctttg agtagtatat taggatttat
     7621 caaataaagg gaggtgttat tggagggtac aggagaatcc agagaggaca ccaagatggg
     7681 gaagaagttc tcttgggtgg ggcaaaggct tcattaagtg gactgagtgt ggcaaaaaca
     7741 tacggcagag ggaagggaat gagggaccaa agtggtatct gcgaaatcag atttatggtg
     7801 aatgttgttt cactcaaaag aaagatttgt agtgatcaaa actctagaca tggtaatact
     7861 attttaatgt acttgaacat acacattgaa gaaattagct tgatttatat cttgagaaat
     7921 tggtttgaat ataaagaaat gagttaaagt atgagttgga aatttccaga aaaaaatgtt
     7981 ttcatgaaca aactgcagtt tgttctttcc tgaagggttc tgttgtctca atgggatgcc
     8041 aagactgaga catattacat cattgtgcat tatcttctga ttgaatgctg taaaataaag
     8101 ctagtttcaa cacaaatgaa aactttcatg ggaaagataa agatttcatg tagaaaataa
     8161 aatactagca aatgtgcctg gtcctaaatc agcaaggtaa aacttggtgt catagatgga
     8221 actggaattc tggaagtaga ggggagtatt aggaaagatt agatagtgct gagctttgac
     8281 tataaaaaat aaaatctact actaggctat tagcgacaat gagttataaa tattcttagg
     8341 atgctaagtg actccatgga taagattttg ggggaaataa tcttcttatg catgctggat
     8401 gaattcaaat tttcagagac tagacatgac tgattattca aaaggacaga tataatgatt
     8461 taaacctgag gggggcagag agatctacaa aagtgaatga tgtacaatgt cgagggagga
     8521 agagatggta tctgattgac ggaggtgagt gagaaggaag gaggagctgg acaatgactg
     8581 acaggtccca gactggctga atgtaacagg aggaggccag gcactggcct ggagaccctc
     8641 ctataggagt gtgattaaat gattcactct tgagggagtg ctggtttgct cagggaatct
     8701 tgacattcag gtcctgagtc cacctctcaa atcgggagaa tttacaagtt agaatgctgt
     8761 atcttcacaa atgaaacttt ctcagatatt cattgccccc acacattgat cagaatttgt
     8821 cctatgaatt atttttctct tttctgaata tattaattgt aaaattgtaa taaaacaagt
     8881 aacaaaatgc atgtcagtat acttcttttt atttatttga cagatagagt tagacagtga
     8941 gagagacaga gagaaaggtc ttccttccat tggttcaccc cctaaatggc tgccacggcc
     9001 ggaactaccc cgatctgaag ccaggagcca ggtgcttttt cctacgcgag tgcaggcaca
     9061 caatcacttg ggccatcctc cactgccttc ctaggccaca gcagagagct ggattggaag
     9121 aggagcaacc gggactagaa cctggcgccc atatgggatg ccggcaccgc aggtggagga
     9181 ttaaccaaga gagccacagt atacttaaca acataataaa tttgaaggca ttctatcttc
     9241 catcctcatg aattccttaa aatttagtta attgaataat agtgtctaac agtattagaa
     9301 aaagattact taattttgat agaaaccaga tattttgtga atattgttgc tactcaagga
     9361 gaaaaaaatg caaaaaacac caaatgaaag ttttcttctt tttcttcaac attttgggaa
     9421 tcattgtttt cagagtgagt taaatcaatt attccatgtg actttttgaa ttattgatgt
     9481 ttgttgaagt ggaaacaaga atgtcgaatc taagaaattt gaaaggtatt ggctaggttt
     9541 tccacctgta tattaaagga tattttcact agatataaaa tgatatgttg gcaatatatt
     9601 tttatgaaag ataaaatatc attccattgt tctatgtttt cttctttgag ttttagcagt
     9661 ttgactatgg tgtgcattgg atcgtttttc tcttctgttt aaaccttgct taaagtttcc
     9721 agtcttggat atgtggaaca tttatgagat tttaaaagat tttgctcaat atttattggg
     9781 tttccttctg tcctcctttg ctccaggctc ctctatcagc tgctcagttt tttttttttt
     9841 cattctatct gtgcttcaat ttgagtattt caattgagta tttcctattg ctctttgagt
     9901 ctgctaatct tttttatgtc atttctgtct gctattagct aatgtactga attcttcatc
     9961 ttaaatatta caattttcag cacaggagtt ttatttgatt ttcttctata ctccaattct
    10021 atgaaaatat gctgtttgct cattctctac cttttaaatt tatttttaat gattatctac
    10081 tttctatgtt taggttttat ttttatgttt gtttttcatt tgaaaggcac agggacagaa
    10141 aaagagatag aggtagagat agacccagag agagagaaag agatctttct gctggttcac
    10201 tactcaaatt tctgcagtag ccataatagt atcagcctaa aaccaggaac tagggctcaa
    10261 tctagatctc ttatgccgca ttccagggta catgttaact ggaaggtgta atagaaaaca
    10321 cagttaggac ttgaaccaag cactctgata ggagatgtgg gcatcccaaa tggtgtcata
    10381 accactgcat caaacactca cctcattaaa aaagtttttg attaacatat taaacgtaca
    10441 tatttatgtt caaatattta tttgtgtatt tgaaagtcag agcaacagag agagagagac
    10501 agagacagag acagaaacct tctataggct ggctcactcc ccaaatggca gatatggcca
    10561 ggacagtgtc aggccaaagc caggagccaa gaactctatc caggtcattc atgtgggtgg
    10621 cagaagttca agtacttggg aaatcttcag atgcttctcc aaatacgtta gcatgcagct
    10681 ggatctgaag tggaacagcc aggactcgaa ccagtgctcc aatatgggat gccagtgttg
    10741 caagcagtgt cttaaattgc tgcaccacaa cactggcccc aataattgta tattctaatt
    10801 atgttaaatg cagtatccaa cacaagtatt tagcataaac tcatcaaata ggtaattagc
    10861 ctttccatta tctttccttt tttgaagatt tattttattt atttgaaaga caaagtttca
    10921 tagaggacaa gacaaagaga ggtctttcat ctgctggttc acttcccaaa tgactacaac
    10981 aactgggtct gccagcccaa gccagaaaca ggagcttctt ctgggtcttc catgagggta
    11041 gaggggtgga agcacttgga tctctgctgc tttcccagcc acattagcag agaatagata
    11101 gaagtagaca tgaaccggct cccatatggg atgctgatga tgcagatagt ggttttacct
    11161 gctacaccac aacaccaact ttatctcttc tttgtgtttg gagcctgtga gatcctctct
    11221 ttcagttctt tgcagaatag ataacggtat tatgaactat aatagctcta ttctcctatg
    11281 aaacactaga atttatttat ttggtctaac tgttttggta tctgttattc tacctccctc
    11341 tatcctgctc cccatacctt ctctatcttc tagtaatcat tatgctaaag ccagtttttt
    11401 gagacttcca tatatgagac agaacacata gtagttgtct ttctgtgtct ggtttatttc
    11461 acttaacata aagagttcca gttccattca ttttgctcca aatgtcaaaa tttcattctt
    11521 tttatggctg aataatattt cattgtatgc acatcacatt ttaattttta ttaatataat
    11581 gaaaacagac ttaatgtagc tcatagatac aactctaata atataatgat acttcccttc
    11641 ctccctccct ctccttttct ttcttctttt aattattgat ataacatatt tgtaacttac
    11701 attaactttg acattatagt caaagactta atgctccatt aaataaagag ttagataagg
    11761 ccatagttca tcagacatat aatcaagaac aataaacaat aatcaaaaga aattatgtta
    11821 attttactca tatacattaa ttttttaaaa agatttattt atttattcga aggtcagagt
    11881 aacacagaga gaaaaggagg ggcagagaga gagagtggtt ttccatccaa tggttcactc
    11941 ccaaattgct gcaacagccg gaactgtgtt gatccaaagc caggagccag gagcttcttc
    12001 caggtctccc acgtaagtac agggcccaag gacttgggac atcttctact gctttcccag
    12061 gccatagcag agagctggat tgtaagtgga gcagccgggt cttgaactgg cacccatatg
    12121 ggatggcggc actttgggcc agggtgttaa tccgctgtgc cacagcacca gccccatgta
    12181 aattacattt taaaataatc ataggtcatt taaactacag tagtaaatca ttcttaaaca
    12241 tttttgtaca aagattcttt tttttacatg acattttctt tatctatcaa ttgggcacct
    12301 agatcaattc catattttgg ctgttgtgaa cagccaaatg aatatgacag ggaaggtatt
    12361 tttatgatat actgttttta ttccattgga tatatatcca gaagagagat gctggataat
    12421 atatggttaa tatattttta gttttttgag aaaactaaat accccatgat gactgcccta
    12481 atttccattg tgaatgactt cttaatttta tgttaaatct ctatactatg gataaatgaa
    12541 caacgattat tgtagttttg ggttatcagt ggcctctttg agttcagtat gaccctaaac
    12601 agagtaaaca ggggttattc atgccactaa tgtttttaag caatcataaa tgagatgaag
    12661 gtgaattttt aatcaatcaa ttttttcctc tcttgttccc aactattatt ctggctctgc
    12721 tagaaggcat caaggatggt gactatggag ctatagacag gtaacacact cttcatccct
    12781 aagagggggg tagatctaaa tctagatagt actttcccaa cataacataa ttggaataaa
    12841 tttttaattc acgcatacac aactcagaga caggtgtaaa tgatacacaa aagtgtaagt
    12901 aattcatgtc aatgtatagg caataagttt tctgaacaaa atagtgttgc aacaaccaga
    12961 atattctgac aacagtaatg taaccaggaa tacagagagc ttagagatta tgccattacg
    13021 aagattaaaa tcctgtaacg acctctcact cccaccagcg gaagacaggt tccatctctt
    13081 tagacctgtg cctcttctgc aacaatgagt caatgttctc cacttcccag gcccaccatt
    13141 tatctctttg gcattccaag agttgctttt ctattgcttc agagctttga atttgctatt
    13201 tcatcttcta agttctcttt cctaatatct tcccatgaca gattcctgtt tgtcagtctg
    13261 aagcagctca agtacttcct ctttggaaaa agctctttct gacacaatca tgcttcattc
    13321 ttaccaagtc cagataataa ttttcttttt agaaagcagt gatatctgaa attattttgt
    13381 taatcattta cttattcatt ttctgtatct tctgccccta aattttaaac cctcagagaa
    13441 aaatttattt ccttgatcag ttgtcttaaa gccagagttt ctagaacaat tataacacaa
    13501 ttattcatgt tattatgtaa atgtttaata gtacatggga tggcatttat gaaaacacaa
    13561 ctttataggt gaggtcttag gtatatcata agtaatgacg tggggaaaat ggaaggataa
    13621 aacatctatg gacaaggaga aaattggaaa tattacctca aaaagatagt gggggggggg
    13681 attttcttgt tagcgatgac tcttgaatca ctgggtttct agaaagatag ggaagttgct
    13741 tgcagatgtg agggatcata gatatccctg agtctgagag ggagtgatta aaacaccctc
    13801 atttgttaag acttttggag tcttagatat tagcctctga gtcatcctcc aaggtcgagt
    13861 tctataaact gaccacaata ggactattaa tgtggtcaaa gtgctttctt tataaatcta
    13921 cttcacaggc ttcatttctt aaggagactg taacctgaat tcctttatat cagtagaaat
    13981 gtcctctact cacccttagc ttgagaaaag tcataaatta ttttggttat actggaaaat
    14041 tactgacatc ttgaacaaat ctgatcttcc tagtttttta atcattatca acagtgaaca
    14101 aatacaggtt gtatgtggca tcaagaaaag tagaaggatt ttagctgtat tattttaatt
    14161 gctgttgatc tgaaatgtgt aatttcataa cctgtatttt tatatgatca ttgagacttg
    14221 ccacagggac aaataataat taagccagac ttccaaaact tcgaatatta tgaactagca
    14281 aggtatgaaa ccatggactg ttttcagaga caataaaaca aaatagcata aacttaaaaa
    14341 aaggactctt cctgccttat tcaaagtgtg acatgtcaga gacagaagtc aattaaagat
    14401 aaaagtgtct aaaagtttag ttatgtccca gctgtgacag tacatgaatg atgagtcaat
    14461 ggaaatagct cctgcctttt gagctatttc aaacacacat aagaagagaa ggagatgctc
    14521 cctgatgttt tgtcttataa taaaaattgc tgaaaatttt ccctggcatt tatcagagat
    14581 aatttactca gagccacact gagtacagaa gagcaacacc aatactctgg gtcaccagtc
    14641 agcatacaca cttatccagt gggaagcaca cattatgtgg tgcctacaga gcaagacatt
    14701 catgtggatg aaaatatatt ttgctcacgc agtacagaca tacttatttc aaacagatac
    14761 aatacatgta agtagaaaat aaagtagata catgtaagta gaaaataaag taagagttca
    14821 attctgtatt atgctactta atactaacat atttatgcat ggagagttat gctttatggg
    14881 agttgttttc cctacacagt tgtgaattag gaatatatct cctaaagcca caaaggatat
    14941 attcacaatc agtataggac ttctcccctg gccattccta actgaatgag atttatactt
    15001 gtctcatcag aaatcttgga tctgtgccca aaattatctg aatacttttc atatgtttgc
    15061 ttatgcttta gctataattt gcaggcatct atggtgttag tgactaaaat ataggaaatg
    15121 ctggcaaaaa aagtcctaga ctatttaatg gtgaggaaac aagaaaaata gctaaataga
    15181 aaaataaatg ggagaactct ttaaaaaaat gaattgaaaa gggacaggac taacaaaact
    15241 tgaggagcag agcccccttg agggggcctc tctgccacta tctcaatata tagtctgtgt
    15301 gaagcttctc tgcctaaacc tcacccctgc gctgaccagc cttgccttga ccaatagtcg
    15361 ttacacaaaa acactgacca atagcctcag agaacacggc gaaacaaggg gccagatgtc
    15421 cagcgaggaa gaataaaagg acgagcctta gagcagtttc acatacttgc ttctgagaca
    15481 tctgagacta tcagcaagct cagcgagctc ctagaccaga catcatggtt catttcacgg
    15541 ctgaggaaaa ggctgctatc acaagcacat ggaagctggt ggacgtagaa gatgccggag
    15601 cagaagccct cggaaggtag gccctggggt ccaggacaag gcagaggagg aatgaaactg
    15661 agcctggcag gactccaggc cctttctcag gacttgtgaa gctctctgat actcccacca
    15721 ctattgttct gtctctatag gctcctggtc gtctatcctt ggacccagag attcttcgat
    15781 agctttggaa acctgtcctc ttcctctgcc atcatgggaa accccaaagt gaaggcccat
    15841 ggcaagaagg tgttgactgc ctttggagat gctgtaaaga atgtggatga tctcaaaaac
    15901 acattcgccc acctgagtga gctgcactgt gacaggctgc atgtggatcc cgagaacttc
    15961 aaagtaagtc caagacaagc tcaagccctc ttgcatttag tcatgggtga aggtaggaca
    16021 ttaagtattg gccttctgca gcataggaga tatttggttg ggagataaga aagctctaga
    16081 ggaactgaac tgagactaag tggtaatgca ttagggtcta aacaatactt tcaggagtgt
    16141 agagggattc gtctatactc aagtgtattt tagaagaggc ttctgtagag atgaggaaac
    16201 agacttttgt taatttgact tttgaagaaa taactccttc atttctcccc caactcactt
    16261 tccttaaaaa agtttaaaac cctaatgaag gcagaaaagg agtgacaatg ggaaagataa
    16321 atgagggcag ccataaggcg agacttacgt gattttgtgt agagtgccat gcatattcct
    16381 ctaatggcta atacttgttt aaaatgctgc aatgcccatt tctaacctaa attatacact
    16441 tccagattga ggattatgtg tctctgggag agaatattgg aacatttggt ccgaagtggt
    16501 tttgaaaaga taccgtgagg tgttaccttt tcccctcagc ccacaaatgg ggagttgtgt
    16561 tagatgatga cagaaatatt taaattatga ccaatgatta gtttggacag gggaccaaac
    16621 tacttacatg taacaactgg gaggggggga agatcagcct tggaggtaaa ttaacaaagg
    16681 cttggttttg aggggtattc agtgtccaga tagcgggaga ctagcctgaa ctctctatta
    16741 actatgtgtt tgtgtttgtc ttctcttttc acctccacag ctcttaggaa atgtgctagt
    16801 aattgtcttg gcgaagtatt ttggcaagga attcactccc caggtccaga gtgcctggca
    16861 gaaattggtg gctggagtgg ccactgccct ggcccacaag tatcactgaa ccccttcact
    16921 gtaggacaga gcttctagga agaagcttta tccctcaaat aataatgaaa ataataaaac
    16981 tactctaaga aattatttgt gatggtattg agtttatttt ccttgtactt ttaaatatat
    17041 gatcctcaag ggatttattt tttttattgt tgagggatgt gtgtgtgtgt gtgtgtgtgc
    17101 ctgtgtacta tgtgtgtata ctcttcattt acttttgctt gagcggtgga tttctgatga
    17161 gaaaacagaa gaacaaggca acaaaagggc atgcatgtga gaattctgca cgggaaacaa
    17221 ggagtgggga tttcgggtga ggaggaagtg ctttatggga caagaattca gaacaggaca
    17281 gtgtagagga ggggatgaag tcttcaggga agtcctcagg ggctgacttg aggcagagta
    17341 gtgatggggc tcaatggaaa gaagaaatta cagaccttag taggtctaag tctgggcaag
    17401 agttctaatt tcagtctatt tggtgcaaca caattcccac atgcaccata ctttccacaa
    17461 catttgtggg gctagatagg aggttggaga tcttgaagaa cacagcaggg ggtagttctt
    17521 ttccaagtta accacaaata gctgatcctg aagagattta tgctttaagt gttgaggctt
    17581 gctaggatag ttaatttgat cccatgcttc ctgggcagac aatgtccaag ggtacacata
    17641 gaatacagta cttcaaggaa aaagtatcat tgtgagtagc tagaagggat gagtatcttt
    17701 gtgggggaaa taaatcatca gaaggttaga aataagaatg catgtgatat gtgatcaatg
    17761 gagcagcact gggcggtcag agggaaacag gagggatttg tattgattag tttgcagtga
    17821 aggtgttgta tgactaaaag ttaatccatg tttataggca ggtgtacaat acaatgacca
    17881 tggaattagg aagattagcc tggctgtggg tgctaatcat gctaatgtta ggtgactatc
    17941 attgcttttc ttgccttgga attcaatctc atctcaattt tcatctataa tattccaata
    18001 atacacctga gcccatttct ctactttcaa tccttttgtt cctcccttaa tacattttac
    18061 tctgatgttc taatttattt tttctattat gaaatgccaa aaattcttgg ttcagcttgt
    18121 ttttttcttc tgcagtttca gagttttgca ctgaaaattc ctgctgcttc catggactta
    18181 tttctctaac ttcttatatc tggctctatc ttgtcattag gttcctaatt caactctgac
    18241 ttcttttact aaagctattc tacgttattt ttctatagcc agttgctgct gtatcatttt
    18301 atgcattttc ttctgtcata caatacctca taaaattatc ttattcagtt atttgtttag
    18361 ttgtgtgcat tcattgcctt ttctattagc tgtgaccttc atgagacttc ataagttttc
    18421 ttgtctgttc attcaaaatt tccaacattt agaggtgagc atttagccta ggagttaaga
    18481 tgctggttaa aatgtctgtg tctcacacta gagtcacagg gtttaatacc cagctatgcc
    18541 cacaactcta gtttattggt aatgaaagcc atgggaagca atgatgattg ttcaaataat
    18601 tgcttttctg tggccagcac acggctcact tggttaatcc tccgctgcgg tgccggcatc
    18661 ccatatgggc ccagggttct agttccggtt gctcctcttc cagtccagct ctctgctgtg
    18721 gcctggaggg caatggagga tggcccaagt gcttggcccc tgcacctgca tgacagacca
    18781 ggaagaagca cctggctcct ggctttagat cggcatacct ccagctgtag cagtcatttg
    18841 gggagtgaac caatggaagg aaaacctttc tctctgcctc tctctcactg tctattactc
    18901 tacctgtcca ataaataaaa aaaaaatgtt ttctgtcacc tctgtgggag accaggattg
    18961 agttcctacc ttggctttgg atcctatctc cctcttggcc attgcaggac ttagaggagt
    19021 aaaccagtgg atggaaactg tttctctcta cctctcaaat ctatttttta agtttccagc
    19081 attttaaaca atgtatagat tatttggcta agtgaatgtt ttgctttaaa aaatgatggg
    19141 agcatgaatt tggaagattg gttatgtagc tgcttagaat actcacatcc catattggag
    19201 tgcttccttc aaggcccagc tcctctgctt ctgatccaac ttcctgctaa cacataccct
    19261 gggaggccac aggtgatagc ttaagtgctt ggttccctgt caccaatgtg ggagacctgg
    19321 attgagcttc agactcctgg cttcaatctg gccctgctct ggctgctgta gccatttgaa
    19381 gagttaacaa gataaaatat ctctctatct tctctgggtc tctacctttc agataaaaag
    19441 taaaataaaa taataaaata aaacttatgt gatgagtcag atgattgtga tgatgttcta
    19501 gcactgatgt aaagagggat caaatgtcag tggaggccat gcatgtgaaa taaattggaa
    19561 gtgtggaaca ggtatgaaca agaaagaatt aggaagcctt aaaaagtgat ttacaaagag
    19621 ggtcaaaaca gcatttggaa gaaactgcag taataaatgc ctgtccagct ggtaaggtgg
    19681 ggagaacatt cgctgacctg cagagtgatg aatacccctg aatcagcagg gatggcagtg
    19741 tgttcacagt gtttgaggga taagcattac tgccattaga agctagtgcc ctaaaccatt
    19801 tcatcccagc ccacaaagga gtttaagtga gagccccttg gcttctatct ttcactaact
    19861 tctactctca aaagggctcc tttgggtcga ctcaaactct tccttcaggg ctatataaca
    19921 aagcacagaa tctttccttt gcagctctgc accagtccaa aaggcactga gaggtcagag
    19981 agaattcagt ttgttgggat gaattagctt cttgaagctt agcaattgtt attcatgttt
    20041 tatatatttt aagatttatt tacttgaaaa gcagagttac agagagagat ggagacacac
    20101 acacacacac acacacacac ggatagaatc ttccattcac tggttcacta cccaaatgcc
    20161 catgacagcc aggtctaggc caaagccagg agccaggaac tccatctggg tcatcccctt
    20221 gacggcaagg ccccaaacac ttgggtgatt atctgctgct ttcccaggct tattatcagg
    20281 aagctggatt gggagtggag cactagggac tcaaaacggc attcctacat gggacatctg
    20341 ttttgcaagc agcaactgaa cctgctgtac catggtattg gcccaatgga tgcattattt
    20401 tttttttact tgacagagtt tgacagtgag aaagagagag agagagacat agagaaagga
    20461 gaaagttctt ccttctgttg gttcaccccc aaaatggccg ccacggcagg cacgctgcac
    20521 tgatctgaag ccaggagcca ggtgcttcct cctgatttcg catgcaggtg caggagccca
    20581 agcaattggg ccattctcca ctgcccttct gggccacagc agagagctgg attggaagag
    20641 gagcaactgg gactagaacc caacccccat aagggatgcc tgagctgcag ggggatgatt
    20701 aaccaagcga gccatggtgc tggctcccaa tggatgcatt attaatagta atcactcaca
    20761 atggaacaat gctcagttct tatcaaacat aatcaattct tagttgtgtt tccaaacaga
    20821 aggtagatag aaatcctttt aattttaaaa agcaaatcaa aaaccaagaa aaaattttct
    20881 gtacccttat cttaaagtgt gaattacaat taggaaggtc aagaattata aatctattga
    20941 gcattgggtc ggatgctgct atttttaagc taactttaaa caaaatactc tgataatttt
    21001 tgaatgtatt tctgaagttt gttgagctta gactcaagac taggaaaaga gaaataaccc
    21061 tcttcaatgg aggttttcac acccagcaat gctgaaatgc ttgcatatac atggtgtgag
    21121 gttctttttc ataggagcat atctacacaa tataaaagtt tctagtatct gtggcctaga
    21181 ctagatattt tagtctcctt ggattaccaa cccttcttta taaaagaggg cagtctctag
    21241 aggagcgaga aaatcaacca cagtgttctg tcatagtgaa ttagaaggtg gtggttaaaa
    21301 tgctggttgg gaggcccaga ctccacatta cactgcttgc ctttgagtct tggctctgat
    21361 cgtgattcca gcctccacac acagctaatg tgtgtcctgg gaggatgctg gtgatggcac
    21421 aaataataga gttcctgtca agtcatattc tttctctgct tccctagaca cttccccagt
    21481 tcagttcaga cagaaataat atcaacatgc aatgttgata tcccatttga aagcattgct
    21541 tctcccagtt tttcaacttt ggctggtaat tgggttttgg tgtgtttgta aattttacat
    21601 ctgaagtctc catgtttcca tttctgcatt gtaaaaatct aataagtatc acacagaaca
    21661 aacttgttcc agatttattc tatttttcaa acattattgt ttgaatgaat atgtattttc
    21721 ttaccagaag gattaatgca aagaagagag aagacatact tagaaatatg gcagagatat
    21781 catccattct ttctaatttt ttttgcatgt tcttgagaca caggaagaga tccatatgta
    21841 tacttttaaa ttatccaaat atgcaaacct ctactttgtg tgctttcttt cttcatttgt
    21901 gtaataaaaa aagttgtgga agtaatttct cataggcttg tgatatcaga aaatttacat
    21961 aaaactgcat aaataaaact gctaaggagg gtgattttgg taacaatgaa tattttctgg
    22021 gaggggtcct ggagatacat agagggaaaa gtctgaattc agactcataa gccagtgcca
    22081 gagctgacca tgaccagtgt tctgtcactc ttctggcctc accctggcat tggtgactca
    22141 ccagggtagg cagtgcaggg gccactactg ggcataaaag gcagagtggg acagctgctg
    22201 cttatgcttg tatttgacac aactgtgttt actagcaata ttccaaatag aatggtgcat
    22261 ctctctggtg aagagaagga tacattcact gccctgtggg gcaaggttga tgtgaagaag
    22321 ttggtgatga ggccctgaga gggttgcttt tcaagttata gggcagtcta ttgaggcaga
    22381 tggaagctgg gcttgtgtag acagagctgg ccccagggtt tctgccaggc actgaatccc
    22441 ccagtgctga ttgtctaccc aagaacccag aggttcttta taccgtttgg ggacctgtcc
    22501 tctgaacctg ctgttatgag taatgtcaag gtgacggccc attgcaagaa ggtgctggct
    22561 accttcagtg agggtctgaa tcacctggac aacctcgggg aaaattttgt ccagctgagt
    22621 gagctgcact gtcacaagct ccatgtggat cctgagaact tcagtgttag tctaggcaat
    22681 gttccacttt tttcttttca cattctcatc ttcttttttt gttgtttgtt tgttaagcaa
    22741 cctgtctttg tcaacatttc tatctaattc cctatatttc gtcatttatt gcttttcaaa
    22801 aattttcatt tttctttagt atttcgtttt ttcattcctc acagcattgt ttttttaatg
    22861 tttctttatt aatatcctat attttccttg ctctccactc cctgcccaat atcctttcct
    22921 ctaattgcta tctttgttat gcattctaga ttccatatgc taattctgca cctggaaaaa
    22981 accttctaac cctccatgga gataggagag acttctcaaa aaggagaagc acaggatatg
    23041 tctataggct ataaatcatt tggaacatga agaataattt gaattttata aattcaagaa
    23101 taaatgaaaa taaaaggaag gtaactatct atctgaggac aatagatcaa aaggtcacat
    23161 tgtaggtggg gttgcagatg ttgctaagaa gcaaccacta ttacatcaat taagggatta
    23221 attagtaata ttattatata tgcatataca tgtacatacc tatgtgatta tacacttata
    23281 tatatattgg ttgttttaaa ttgagactct atttagatta agatgtagga agaaaataat
    23341 tagtgttggc ccaatttcag aatacaagct tgatttctgt gctaaccata cgtaaaatat
    23401 ctccatagct ctttctcaca ctccagggca atgggctggt gattgtgctg gccttctact
    23461 tcggcaaaga atttacccca tagctgaagg tgcctatcac agagtggcag ctggtgtggc
    23521 caatgcctgg gctcacaagt acccctaaga tcttgacttg ttctctactt accctgttcc
    23581 atggaaaatc ttttctatat ttcatctatt gaacatgaag acataatacc caacctaagg
    23641 gtgtgagcct aataaaagat catttatctt aacttcatgg ttcatatcat ttattttatt
    23701 tttctcacca tggatatgtg gtgggggaga atccctgatg gcttacagat aggaacctct
    23761 tgtttcttat aaataaatca agaaaatgag aaaaagaaga gggccatata ttcctactaa
    23821 catttgcact tctatctcca aagcatagag attgtcacaa gtgtttcata aagggaggat
    23881 tctgtattac agagattcca caagagacct tcaactaaga tatggcctca taaccacttt
    23941 caaaagttac tgagaaaata atgtgctcaa gggcataaat ttttttgctc cagactgtgg
    24001 tgtttccttt ctttctcttt ccaatattta tacactgtaa ctcgtattac taattgccaa
    24061 acatgtattc ctgctctaag aaccttcctg ccttcctgct cccaaaacct cttcagaata
    24121 atgcttcttt atccagtcct caatttccag tacagtttag tactctctca tcactgctga
    24181 ggtctgtgga gctcagcctt ggggaatttt tatttttaat ttttaagatt tatttattta
    24241 atttaaaatt ggtattatag agaaagggag agagagagag ggagagagag agagagagag
    24301 agatcttcca ttcactggtt tactctacaa atggccatat tggctaggct gaagctagga
    24361 gctttatcca gatcccctat gtgggtggca aggacccaca cattaggcca tcttctgctg
    24421 cttttcctgt gccattagca gggagttggg tcagaagcgg agcagctggg acacaaacta
    24481 gtgctcatat gagatgtcag catcagaggt ggcagcttta cctcgtacgc cacaatactg
    24541 gctccctcag tgaacttttt aatgttgctg aggaaattct ttccaaaccc ttcctgtata
    24601 ttcatttttc acaactttac agctcttctt atgtcacaca cacacacaca cacacacacg
    24661 agcaaataca tacatacacc ccacatatat tatgcttgat aaatatttct gtctattctg
    24721 cactaatttg caggcctact gtgccctgta tcaaattctt ctatagataa taagtggaaa
    24781 ttttttcaat ggaataataa gtgaatgaaa gtaattagga ctcctagctc ttgagggaaa
    24841 taagaatatg ttagatcatt gcgtggagtt cttttgatgt aaatactgca atatacaatg
    24901 aggaagcaag caggaaagta aggtactaaa gataaatctt ttgctttata atacaggctg
    24961 ccaaatttat tcttgttgaa agtaattttg ttatgtgtaa gattagaaga atttttagaa
    25021 ccctctaaca tatgaatttc ttgaaacaca acacattctg gttaattaac ttggacttac
    25081 ttgaatacct aaatagaacc tggtgctttc ttctatatca acatgattct gtaagaactc
    25141 tgcatatact gccaggattt accaacatgc cactgtgtga gggtaaaggg ttttttatgt
    25201 acattttttt cttctcagat gattgcacat agtgtgacca aggaagtctt tatatacaaa
    25261 taaaattttt taaggactac agacttggga tatagatttg acataattgc tatgatgctt
    25321 tattgctatg attgccaact gctagatacc tatatcctat agtggagtgc ccgaatttga
    25381 ttcccagctc tgtttcctgt ttccagcttt tggtaatgca gaccagagag gcagcagtga
    25441 tagttcaaga gattgtgatc agtacatcca tgcgggagaa ttgcattaag ttcccagctc
    25501 atagctcctt ttctgttcat ttgtgttgtg aaccaattga tgggaacaaa taacaatgat
    25561 tattaaagaa acacagactc acatctcaga aaccaaatta ctccaggaaa aatcacagga
    25621 agatcacttt tatcctcctg aacataggaa acccctccta agaatcagat aaaaggataa
    25681 gcgatttctg aaggatctga aagagggtaa gagaatgtgg actttatgct cttgaaatag
    25741 ggaaacaact gacctttttt tcctcaacat ggtcaagaaa agaacaaaaa tattttatta
    25801 tattgtgttc catttttctc catctgaaag tggcaaatca atttaggaag aaatttggcc
    25861 tacaacttac tatgaactga tcttctgtaa ataaagagaa ttgaaaatga attttgatgt
    25921 gaatagaaga ggagagggag cgggaaaagg ggagggttgt gggtgggagg gaagttatgg
    25981 gttgggaagt cactgtaatc cataagctgc actttggaaa tttatattca ttaaataaaa
    26041 gttaaaaaaa aaaaagcaaa aaaaaaaaag aaacttgtga caagcataag taattactgt
    26101 gggcaagcat tactgccttt ctattgtggt tgaggtggtc aagagccagt agagacttgg
    26161 cagcttgtta ttatagtctc atttattggt agacaataat tacttgaaaa tcacagcaga
    26221 ggctggtgct gcggcttact aggctaatcc tctgctctgc ggcgccggca ctccaggttc
    26281 tagttccggt tggggcaccg gattctgtcc cggttgcccc tcttccaggc cagctctctg
    26341 ctgtggcctg ggagtgcagt ggaggatggc ccaagtgctt gggccctgca cccccatggg
    26401 agaccaggag gaagctcctg actcctggct tcggatcggc gcagtgcgcc ggccgtagcg
    26461 accatctagg gggtgaacca acagaaaagg aagacctttt tctctgtctc tctctgttta
    26521 actctgcctg tcaaaaataa gtaaataaat aaatgtgcac ttccctattt acaaaaaaag
    26581 aaaatcatag aagaggaatt aattccttgt gatttgactt cacatttatt tgataacttc
    26641 ccatacagtt atttctattt tcccccatga tgttgcttta taggtgattt cattacatga
    26701 agatgggaaa gatagagatg ggaggaggga gggaagaagg gagggagaga aagagagaga
    26761 ttgattgtag gtttgatttc aaaaacattt aaaaataata ttatttaatt aaaccaagtt
    26821 gaatctgaca gaaaattatc ctaagtatcc atcttttgag ctcagagacc tactctgtag
    26881 attttaactt gtgaatcaat atctacttgt atttataggg tgcccagact taagccaaga
    26941 ttctgtctct aaaaaggtga aatctccaat ttaaccttgg aggagtttgc accattgctt
    27001 tcaaggaatg gcccactcag tgaggtgaat aaaatctaat tctgagtgaa aatctgagaa
    27061 aacatcaaag taaaagtcat acactgaaat accaattgct ctataataat tagaaatata
    27121 taagttgtat tcaaaaaagt tcatggaaat gcattttatg aaaagcatac atgaaaatat
    27181 gtttttctac caagatcaat ttatctatta atttcatttt tcaatggagt ttttaaagaa
    27241 tccctgcact ctagcaaagt ggtatctggg ttaactttac actttttgat gagataaata
    27301 aattcttcct aataaataaa gactacaata ttccctaatt cttcaactat tttgtgtatg
    27361 cccctctttc tatgagttct gattttctta gcgctatttt attttactta tccttagaag
    27421 atgctaagtt gaagtagctg aaaaattgtg cttctgggaa acaaaactac agcaaacagc
    27481 tgaactatgt ttagtcaaga aagcagacag tagattgctg aaatgtcaca catgaggaac
    27541 ttaatataga acaccaaata tattcagaaa cattcatgct gaaggccatt tttgaactgt
    27601 acagcttcac attcacctag atgggtgtca acaggtggat tctgttgacg gagcttcatg
    27661 ttcatccact taattatact ctgattagga ccactttcgt atagaacatt aggaaacagg
    27721 aaagcctgcc agaatggagt caaggaagga aacttgtgcc tctgcctctt tctgagaaca
    27781 catattttgt gtatgtgtat gtgtgtgtgt gtgagtgtgt gtatgtgagt gtatgtgtgt
    27841 gtgtatgtgt gttgcagtgc ttccgggagc tggaggcaaa gtagagcaga ctatctgatc
    27901 cctacaggaa agaagcaggc tcatgctgtg aggcagcagt gaatctcagt tactttctgg
    27961 gaactttgga tggtgtagat ttgagactga ggctctggac ataaccacat cgatcgctat
    28021 ttatcctgtc ttgaaactta ctatgtccaa tacatcagaa cttttgatag tcctatgact
    28081 tgtaggttat ctccacaggg ttatgggtaa cagacggaaa agtctgtttc acattctaat
    28141 cttgctgggg cagcagcaaa ccctcccctg aacccaagat atagcactgg tgctggtgcc
    28201 agtacaagac agtgacaaaa tgtaacttag gggatctagt ttgaacttga gaaatggcgg
    28261 aggaggtatt tagaaaggtg ccaatttctg ctaaccaaga ctgcaaagtt gtagtcttga
    28321 caattctgct ctttacataa aattgaagca ttttctgtgt tgttctttag taagttattt
    28381 aatttttcta tgaaaaagaa ttcttgaaaa ggatttaccc aaacttccta gaacaccttc
    28441 tttcatatat atacacctga catcagataa aataggtatg aaagaggata tttaaaagtc
    28501 tccttgtaaa gtgatgacta aactaatgac tttaaattta ataaaatgaa aaccaaaggc
    28561 aaaattgcta aagccaagtt aaaaaaagaa acttaaaact caaaaggctt ccatagtctt
    28621 ggcaactcat gacaagagcc tagggtgatt actgatgcca taaacaagag tgtcaatttg
    28681 ttaagtcaac aacaggagtc accgtgcact tactcctcat gtaggatctc tgtccttaat
    28741 gtgctgtaca ttatgattta atgatataac tagtactcag tgtttttcac tttgtgtttc
    28801 tatgtgggtg caaactgttg aaatctttac ttaatatatg ctaaactgat cttctgtata
    28861 taaagagaat tgaaaatgaa tcttgatgtg aatggaaggg gagagggagt gggaaagggg
    28921 agggttgcgg gtgggaggga agttatggtg ggggggaagc cattgtaatc cataagctgt
    28981 actttggaaa tttatattca ttaaataaat gttaaaaaaa aaacatttgg aagacctctt
    29041 tccccagtat tcagcatttg aaatgccttc ctaggggtaa attatatgct cattgctcca
    29101 gtctttgaaa gaggaataag tataaagtat ttaaattctg tcccttttac cccatgatca
    29161 tatcctgaac acataattgt tacatatttc cgtataccca gagaagaatc aggccttccc
    29221 ataagtgaaa gaatgagcag aatgcagttt gttcttgagt acctgatggg tgtctgagca
    29281 gtttaaaatg gtgagaaata actgatggat ttagcagtga tggggacagt aatttggaga
    29341 atagcatgat agaaagggaa ttttgtgaat ttttctggaa aaactggaaa tagagaattt
    29401 gttttttttt ttttctggct cactgaaaac acattaatgt aattggagtt tcagagtgaa
    29461 atgacaagag aattctattt gacatttgtt gccagtaatg gcaaaggaga gagaaagatg
    29521 tttctagaca ttatggtacc tgcattagaa aatggttcta tggtaaggaa aagtatggct
    29581 ctttccatac tggaaggaat ggcagtttag gggagggata tactgatgag tcagggatga
    29641 tgcaaagagg ggagaatggg cagcagggtc tgtgggaatg ctgagatcct ccactgagct
    29701 cagaagcact tcctcctttg cttttttcat tgcactttta acaatgacaa gcagaactcc
    29761 acactccaca cttcagaagt ctttcctagc cggagaagtt agcaatttat tatgtgggag
    29821 aaaagatgag aaagagaaag acattgggaa aagctataaa gatagaaggg taaactttag
    29881 taaaagcaat caagtaaaca aaaaatgtat agagaaaaca gggctgttgg tggagagttt
    29941 ggatcaaagg aatgtttcta aaagatagca ttcttagtgg tttagaaggg atgaaacagc
    30001 tcaggctgta aatgtcttcc catgctccta ccaccacctt caagatccag aaacatcccc
    30061 gtttttccct agacactttc ccagatcacc aaggcagagc taattaggca tccagtgtgg
    30121 gtatggcatt gtaagccatt gcttttccta gcgttacagc actggcatgt agctggtttt
    30181 gacacatgtg tagttttcac ctcattgaga ggtccctgtt tccattgctt caccccccaa
    30241 accataacag agtaaaaatt ccacaaaata aatttgattc aaatttattc tattttaaaa
    30301 gtttatttat ttgattatat gataaaatta aatgagtaac aacaacaata acatataatg
    30361 aataaattaa taagtgtttt tcttaccagg aagatttact ccaaagcaga aggaagtcgt
    30421 acttataact atggcacagg tttcatccat tctgtttttt gtaattattt tggatattct
    30481 gaagacataa agagatctct ctctctctct ctctctctct ctacctatct atttatctat
    30541 ttaagtggat ttcaacacac aaatcttctc ccttttctgt gccttaaatc ctcatttgta
    30601 tgataaataa ttgcagagaa aatttttcat aggcttacca ggctctaata acaaaaatta
    30661 tataaataaa tttggcaaga aaggtgtttt cagtagcaat tagtactgct ggtatgggtc
    30721 tgggagatac atagaaggaa ggctgagtct gtcagactcc taagccattg ccataactgc
    30781 caaggacagg ggtgctgtca tcacccagac ctcaccctgc agagccacac cctggtgttg
    30841 gccaatctac acacggggta gggattacat agttcaggac ttgggcataa aaggcagagc
    30901 agggcagctg ctgcttacac ttgcttttga cacaactgtg tttacttgca atcccccaaa
    30961 acagacagaa tggtgcatct gtccagtgag gagaagtctg cggtcactgc cctgtggggc
    31021 aaggtgaatg tggaagaagt tggtggtgag gccctgggca ggttggtatc ctttttacag
    31081 cacaacttaa tgagacagat agaaactggt cttgtagaaa cagagtagtc gcctgctttt
    31141 ctgccaggtg ctgacttctc tcccctgggc tgttttcatt ttctcaggct gctggttgtc
    31201 tacccatgga cccagaggtt cttcgagtcc tttggggacc tgtcctctgc acatgctgtt
    31261 atgagcaatc ctaaggtgaa ggctcatggc aagaaggtgc tggctgcctt cagtgagggt
    31321 ctgaatcacc tggacaacct caaaggcacc tttgctaagc tgagtgaact gcactgtgac
    31381 aagctgcacg tggatcctga gaacttcagg gtgagtttgg ggacccttga ttgttctttc
    31441 tttttcgcta ttgtaaaatt catgttatat ggagggggca aagttttcag ggtgttgttt
    31501 agaatgggaa gatgtccctt gtatcaccat ggaccctcat gataattttg tttctttcac
    31561 tttctactct gttgacaacc attgtctcct cttattttct tttcattttc tgtaactttt
    31621 tcgttaaact ttagcttgca tttgtaacga atttttaaat tcacttttgt ttatttgtca
    31681 gattgtaagt actttctcta atcacttttt tttcaaggca atcagggtat attatattgt
    31741 acttcagcac agttttagag aacaattgtt ataattaaat gataaggtag aatatttctg
    31801 catataaatt ctggctggcg tggaaatatt cttattggta gaaacaacta catcctggtc
    31861 atcatcctgc ctttctcttt atggttacaa tgatatacac tgtttgagat gaggataaaa
    31921 tactctgagt ccaaaccggg cccctctgct aaccatgttc atgccttctt ctttttccta
    31981 cagctcctgg gcaacgtgct ggttgttgtg ctgtctcatc attttggcaa agaattcact
    32041 cctcaggtgc aggctgccta tcagaaggtg gtggctggtg tggccaatgc cctggctcac
    32101 aaataccact gagatctttt tccctctgcc aaaaattatg gggacatcat gaagcccctt
    32161 gagcatctga cttctggcta ataaaggaaa tttattttca ttgcaatagt gtgttggaat
    32221 tttttgtgtc tctcactcgg aaggacatat gggagggcaa atcatttaaa acatcagaat
    32281 gagtatttgg tttagagttt ggcaacatat gccatatgct ggctgccatg aacaaaggtg
    32341 gctataaaga ggtcatcagt atatgaaaca gccccctgct gtccattcct tattccatag
    32401 aaaagccttg acttgaggtt agattttttt tatattttgt tttgtgttat ttttttcttt
    32461 aacatcccta aaattttcct tacatgtttt actagccaga tttttcctcc tctcctgact
    32521 actcccagtc atagctgtcc ctcttctctt atgaagatct tattaaagca gctgggacag
    32581 ggacagaaaa agggctttga ctgcctttct cttgagccct tttcctgatc tccacaactc
    32641 actgatacca ctggtctcat tggaaggggt gggctgttaa cagtgtgaca aatgtaggaa
    32701 taaactggat gcaaaagggg gctttgtgca gctttatatt cactgttgtc ttaaaccctt
    32761 tttatggact caaatcaaat gacagtccct caggatgtta gcttctgaat tcagaaagtg
    32821 attgcagagt tgcccactcc tttatcctgt gtctgatggt tttgctgtct ctgtagtgat
    32881 tagcttatgt caccatttcc tcattcaata ggcactaggt ggatgaaagg ttctggttca
    32941 ctccccaaat acctgcaaca gtcaggagtg tgtcaggcca aaaccagaaa acaggaattg
    33001 ccatggggtc tccatgatgg gtggcaggga ctcaagtaca tgagccatat tcggctgctt
    33061 ccaggtacat tagcagaaaa ctagatcaga agtggagctg tggggaccag aataaacact
    33121 ttgatatggg atgttggtgt ctcaagtagc aacttaaccc cctgctcact aaaacactct
    33181 aatcctcatt acctaggagc aactgagcct gagggctatc taatatagct ggtgacacag
    33241 agatcatata ccctggctaa aagcatggct gaatccatga aagaaaatat atgctcaaaa
    33301 taggaataga atacacagat ttatgcacag atgcttacaa attttagcca atcctgatga
    33361 catggttaac ttggagatct agatcagttc ttgccagcat gcccagagaa tagtacatgg
    33421 gaaaatttat agagatgatg agttagagac aaagtgagtg ataatgacat tgcctgggat
    33481 tgctgctagg tacactgaaa aatcagggag gaagatccaa taaatgaccc attcaaaatc
    33541 tagaaaacct gtcaacagga actttggaaa cttatttcta atgtatctga acatcaaggc
    33601 agcaataagt ctttctgtaa aatcattaaa tatgcccaaa tgtcaagttc tatgtgagtc
    33661 atgaaggtaa cttgataatg ctctacactt catattttgt tcattgttta atacaaaacg
    33721 caatttttat tttatttatt taatttttaa ctgtttattt aataaatata aatttccaaa
    33781 ttacagctta tagattacaa tggcttcatc ctcataactt gccttgccaa cctgcaaccc
    33841 tcccatctcc tgctccctct cccattccat tcacatcaag attcattttc aattatcttt
    33901 atatacagaa gatcaattta gtatatatta agtaaagatt ttaacagttt gcacccacac
    33961 agaacataaa gtataaatac tgtttgagta ctagttatag cattaattca cattgaacaa
    34021 cacattaagg acagagatcc tacatgagga gtaagtgcac agcgactcct gtcgttgact
    34081 taacaaattg acattcttgt ttagggggtc agttatctcc ccaggctcct gtcatgagtt
    34141 accaaggcta tggaggcctt ttgagttcac tgacttcgat cttatttaga caaggtcata
    34201 gtgaaagtgg aagtccactc ctccctttag agaacggtac ctccttcctc aatggcccat
    34261 tctttcaact gggatctcgc tcacagagat ctttcattta gctcatttaa ctcctttttt
    34321 tttttttttt tctagagcat cttacctttc cattgcctga aatactttca tgggctcttc
    34381 agccagatgt gaatgcctta agggctgatt ctgaggccag agtgctgttt aggacatgtg
    34441 ccattctatg agtctgatgt gtatcccatt tcccatgttg gaatgttctc tccattttta
    34501 attctgtcag ttagtattag cagacactag tcttgtttat gtgatccctc tgactcttat
    34561 gcctatcatt acgatcaatt gtgaacagaa attgatcact gggactagtg agatggcatt
    34621 ggaacatggc cacctcaatg ggattgaatt cgaatcccct ggtctgtttc taactctacc
    34681 atttgaggta agtcagtttg agcatgtccc gaattgcaca tctcttccct ctcttattcc
    34741 cactcttata tttaacaggg attacttttc agttaaattt aaacacctaa gaataattgt
    34801 gtgttaatta cagagttcaa ccaatagtat taagtagaac aaccaaaaaa tactaaaagg
    34861 gataaagtat tacattgtac atcaacagtc aggacaaggg ctgttcaagt cactgtttct
    34921 catagtgttc atttcacttt gacaggtttc ctttttggtg ctgggtcagt tgtcactgat
    34981 cagggagaac atatgatatt tgtccctttg ggactggctt atttcactca gcatgatgtg
    35041 ttccagattc ctccattttg ttgcaaatga ccggatttca ttgttttttt ttgcttctat
    35101 atagtattct atagagtaca tgtcccataa tttcttcctc cagtctactg ttgatgggca
    35161 tttgggttgg ttccaggtct tagctattgt gaagtgagcc gcaataaaca ttgaggtgca
    35221 gacagcttgt ttgtttgcca atttaatttc ctttgggtaa attccaggag cgggatggct
    35281 gggttgtatg gtagggttat attcaggttt ctgaggatct ccagactgac ttccataggg
    35341 gcttaaccag tttgcattcc caccaacagt gggttagtgt ccctttctcc ccacatcctt
    35401 tccagcatct attgttggta gatttctgta tgtgagccat tctaagcggg gtgaggtgaa
    35461 acctcattgt ggttttgatt tgcatttccc tgattgctag cgatcttgaa catttcttca
    35521 tgtggatgtt ggccatttgg atttcctctt ttcaaaaatg gcaagtgagg tccttggccc
    35581 atctcttaag tgggttgttt gttttgatgc tgtggagttt ctttatgtct ttgtggattc
    35641 tagctattaa tgctttatct gttgcttagt ttgcaaatat tttttcccat tctgtcagtt
    35701 gcctcttcac ttcctgactg cttcttttgc agtacagaac ttctcaattt gatgtaatct
    35761 caatagttaa ttttggcttt gactgcctgt gcctccaggg tcttttccaa gaagtctttg
    35821 cggtgccaat atcttgcagg gtttctccaa tgttctctaa taacttcatg gtgtcgggtc
    35881 atagatttag gtctttaatc catgttgagt ggatttttgt gtaaggtgta aggtaggggt
    35941 ctttcttcat gcttcagcac gtggaaatcc cagcaccatt tattgaatag actgtccttg
    36001 ctccaggaat tggttttaga ttcctgatca aatataagta ggctcttgta tcccttcaat
    36061 ttctttttct tgcctaacag ctctggctaa agcctccaga aatatactga atagcagtgg
    36121 tgagaatgga tatccctgta tggtaccaga tctcagtgga aatgcttcca actttttccc
    36181 attcaatagg atgctggtcg tgggtttttc ataaattgct ttgattgtat tgaggaacat
    36241 tccttctata cccagtttac ttagagtttt caccatgaaa gggtcttgtg ttttattgaa
    36301 tgctttctct gcatctattg agataatcat atggtttttc ttctgcagtc tgttaatgtg
    36361 gtgtatcaca tttgcaaaca cttgaaccat ccctgcatac cagggttata tcccacttgg
    36421 tctgggtgaa tgatctttct gaaatgttgt tgcactccgt tggccagaat tttattgaga
    36481 atttttgagt ctatgttcat taggtatatt gttctgtaat tttctttcaa tgctgcatct
    36541 ttttccggct taggaattaa ggtgatgctg gattcataga aagattttgg gaggattccc
    36601 tctttttcaa ttgttctgaa tagtttgaga agaattgagt tagttcttct ttaaatttct
    36661 ggtagaattc agtagtgaat ccatctggtc ctgggctttt ctttgttggg agggccttta
    36721 ttactgtttc aatttctgcc tcagttatgg gtttgtttag gctttcgatg tcttcctggt
    36781 tcaatgtagg taggttgcag gtgtccagga atctatgcat ttctgataga tttccctgtt
    36841 tgctggcata cagtccttgt agtaatttct gatgattctt ttcatttctg tggtgtctgc
    36901 tgttacattt cctatttcat ctctgatttt attgatttgg tctcttcttc ttttagttag
    36961 ttgagctaat gcggtatcaa ttttgtttat tttttcaaaa aaccagctcc ccatttggct
    37021 gatttttggt aatttttttg gattcaatcc tgttgatttc ttctctgatt ttaattattt
    37081 ctcttctcct actagatttg ggtctgcttt gctgcagttt ttctagatcc ttgaggtgat
    37141 ttgaaagctc atctatttgg tgcctttcca atttcttgat gtaggcacct attgatataa
    37201 acttttctct taacactgct ttcgctgcat ctcatacatt ttggtatgtt gtgctgttat
    37261 cctcatttac ttccagaaag tttttgattt ctcttttgat ttctttgatg acctagtgtt
    37321 cattcaggag catgttgttc actctccatg tgtttgcata tgctgtaggg attcctgagt
    37381 tgctaatttc cgacttcatt ctattatggt ctgagaagct gcatcatatg attctaattc
    37441 ttttgaatgt gctgagactt gctttatggc ctagtatgtg gttaatctta gagtaggttc
    37501 catgtactgc tgagaagaat gtaaattctt taagtgcagg atgaaaagtt ctgtagatat
    37561 gtgtcagatc catctgggct atagtatcct ttgaatgtac tgtttcctgt agtcttctgt
    37621 cctgtgatct gtctatttct gagagtggag tattgaagtc ccccagtact attgtattgg
    37681 agtctaagtc tccctttaac tctcttaaca aatcttttaa ataaaccggt gccctgtaat
    37741 taggtgcata tacattgata atcgttatat tttttctgtt gaattcatcc cttaatcatt
    37801 atgtagtgcc cctctttgtc tctcttaaca gtttttgtgc taaagtttat tttgtctgat
    37861 attaagatgg ttatgcctgc tcttttttca tttctgttgg catggaatat ctttctccag
    37921 cctttcacat tcagtctgga tgcatctttg ttggaaagat gtgtttctgt aagcagcaaa
    37981 tagatgggtt ttgttcctta acccaatcag ccaatctatg tcttttaact ggacagttca
    38041 ggccattaac gttcagtgtg actattgata agtgtaattt tgccctgcca ttttcccaaa
    38101 gatatatttt ttttctccaa aattgtattt attttacatt tattcttttt ttttatcttt
    38161 tatttaatga atataaattt ccaaagtacg actcatgggt tacaatggct tcccccccat
    38221 accgtccctc ccacccacaa ccctcccctt tcccactccc tctccccttc cattcacatc
    38281 aagattcatt tttattatct taatatgcag aagatcagct tagtatacct taagtaagga
    38341 tttcaacagt ttgctcccac acagaaacat aaagtgaaaa ataatagatg attttttttt
    38401 aaatgatgat gaaatcagat gagacctatt gtcatgttta atcccagtga gagtcaagtt
    38461 gggaattgat aatttctttt ttttttcttt tctttttttt tttacagaag atcagtttag
    38521 tgtacattaa gtaaagattt caatcgtttg cacccccata gaaacacaaa gtgaaatata
    38581 ctgtttgagt actcgttata gcattaagct tcagtgtaca gcacattaag gacagagatc
    38641 ctacatgagg agtaagtgca cagtgactcc tgttgttgac tttacaaatt gacactcctg
    38701 tttatggcat cagtaatctc cctatgcacc agtcatgagt ttccaaggct atggaagccc
    38761 cttgagttct ccgactctta tcttgtttag acacggtcat agtcaaagtg gaggttctct
    38821 cctcccttca gagaaaggca cctccctctt tgaagacctg ttctttccac tgggatctca
    38881 ttcacagaga tctttttgcc agagtgtctt ggctttccat gcctgaaata ctctcatggg
    38941 cttttcagcc agatccgagt gcctttaggg ctgattctga ggccagagtg ctatttagga
    39001 cactccgcca ttctatgagt ctgctgagta tctcacttcc catgttggat cactctcccc
    39061 tttatttatt ctatcggtta gtgttagcag atactagact tgtttatgtg ctccctttga
    39121 ctcttagtcc tttcattatg atcaattgtg aactgaaatt gatcacttgg aatagtgaga
    39181 tggcattggc acatgccacc ttgatgggat tgaattggaa tcccctggta tgtttccaac
    39241 tctaccaatt ggggcaagtc agcctgagca tgtcccaaat tatacatctc ttccctctct
    39301 tattgccact cttatgttta acagggatca catttcagtt aattttcaac acttaagaat
    39361 aactgtgtga taattacaga attaaaccag tcatattaag tagaacagac aaaaaaaata
    39421 ctatgaggga taatgtatta agttgtccat tagcagtcag ggctatgctg atcaagtcac
    39481 catttctcat agtgtccatt tcacttcagg aggtttcctt tttggtgttc agtcagttgt
    39541 caccgatcag ggagaacata tggtatttgt ccctttggga ctggcttact tcactcagca
    39601 tgatgtgttc cagattcctc cattttgttg caaatgactg gatttcgttg tttcttactg
    39661 cggtatagta ttctaaagaa tacatatccc ataatttctt tatccagtct accattgatg
    39721 ggcatttagg ttggttccag gtgttggcta ttgtgaattg tgctgcaaca aacattaggg
    39781 tgcagaccgc ttttttcttt atcaatttaa actcctttgg gtaaattcca aggagtggga
    39841 tggctgggtc gaacggtagg gttatattca ggtttctgag gaatctccag actgatttcc
    39901 ataggggctt gaccagtttg cattcccacc aacagtgggt tagtgtccct ttttccccac
    39961 atcctcacca gcatctgttg ttggtagatt tctgcatgtg aaccattcta accggggtga
    40021 ggtgaaacct cattgtggtt ttgatttgca tttccctgat tgctagtgac cttgaacatt
    40081 ttttcatgtg cctgttggcc atttggattt cctcttttga aaaatgtcta ttgaggtcct
    40141 tggtccatct cttaagtggg ttgtttgttt tgatgttgtg gagtttcttg atttctttgt
    40201 agattctggt tattaaccct ttatctgttg catagtttgc aaatattttt tcccattctg
    40261 ttggttgtct cttcactctc ctgactgttt cttttgcagt acagaaactt ctcaatttga
    40321 tgcaatccca atagttgatt ttggctttga ctgcctgtgc ctcccgggtc ttttccagaa
    40381 attctttgcc tgtgccaata tcttgaaggg tttctccaat gttctctaat aacttaatgg
    40441 tgtcaggtcg tggatttagg tctttaatcc atgttgagtg gatttttgtg tgagatgtaa
    40501 ggtaggggtc ttgcttcatg cttctgcacg tggaaatcca gttttcccaa caccatttat
    40561 tgaatagact gtccttgctc caggaattag ttttagatcc ttgatcaaat aaaagttggc
    40621 tgtagatgtt tggattgatt tctggtgttt caattctgtt ccattggtct atccatctgt
    40681 ttctgtacca gtaccatgct gttttgataa ctactgccct gtagtatgtc ctgaagtctg
    40741 gtattgtgat gcctccggct ttgtttttgt tgtacaagat tgctttagct attcgaggtc
    40801 tcttgtgcct ccatatgaat ttcagcatca ttttttctag atcatagaag aatgtctttg
    40861 gtatcttgat tggtattgca ttgaatctat aaattgcttt tgggagaatg gacattttga
    40921 tgatgttgat cttccaatcc atgagcatgg aagatttttc cattttttgg tatcctcttc
    40981 tatttctttc tttaaggttt tgtaattttc atcgtagaga tctttaacgt ccttggttaa
    41041 gtttattcca aggtatttga ttgtttttgt agctattgtg aatgggattg atcttagcag
    41101 ttctttctca gccatggcat tgcttgtgta tacaaaggct gttgattttt gtgcattgat
    41161 tttatatcct gccactttgc caaactcctc tatgagttcc aatagtctct tagtagagtt
    41221 ctttggatcc tctaagtaca gaatcatatc gtctgcaaag agggatagtt tgacttcttc
    41281 cttcttgatt tgtattcctt tgatttcttt ttcttgtctg atggctctgg ctaaaacatc
    41341 cagaactatg ttaaatagca gtggtgagag tgggcatccc tgcctggtgc cagattttag
    41401 tggaaatgct tccaactttt ccccattcaa taggatgctg gctgtgggtt ttttataaat
    41461 tgctttgatt atattgagga atgttccttc tatacccaat ttgcttagag ttttcatcat
    41521 gaaagggtgt tgaattttat caaatgcttt ctctgcatca attgagataa ccatatggtt
    41581 tttcttctgc agtctgttaa tgtggtgaat cacattgatt gatttgcgaa cattgaacca
    41641 tccctgcata ccagggatga atcccacttg gtctgggtgg atgattttcc tgatgtgttg
    41701 ttgtattcta ttggccagaa ttttattgag gatttttgca tctatgttca tcagggatat
    41761 tggtctgtaa ttttctttca gtcctgcatc tttctctggc ttagggatta aggtgatgct
    41821 ggcttcatag aaagaatttg ggaggattcc ctctttttcg attgttctga atagtttgag
    41881 aataatggga ttagttcttc tttaaatgtc tggtagaatt cagcagtgaa tccatctggt
    41941 cctgggcttt tctttgttgg gagggccttt attactgttt caatttctgt ttcagttatg
    42001 ggtctattta ggttttcgat gtcttcatgg ttcaattttg gtagattgca tgtgtccagg
    42061 aatctatcca tttctgatag gttttcctgt ttgctggcat acaggtcctt gtagtaattt
    42121 ctgatgattc tttttatttc tgtggtgtct gttgttacgt ttactttttc atctctgatt
    42181 ttattgattt gggtcttttc tcttcttttt ttagttagtt gggccaatgg ggtgtcaatt
    42241 ttgtttattt tttcaaaaaa ccagcttctc gcttggctga ttttttgtaa tgtttttttt
    42301 ttttttgatt caatcctgtt aatttcttct ctggttttaa ttatttctct tctcctacta
    42361 gatttgggtt tggtttgctg cagtttttct aggtccttga ggtgcactga aagctcattt
    42421 atttggtgcc tttccaattt cttgatatag gcacctattg ctataaattt gcctctcaat
    42481 actgcttttg ctgtatccca taagttttga tatgttgtgt tgttgtcttc atttacttcc
    42541 agaaagtttt tgatttctct tttgatttct tgaatgaccc agtgttcatt caggagcatg
    42601 ttgttcagtc tccatgtgtt tgcatacttt cttgggtttc ctgagttgct aatttccagc
    42661 ttcatcccac tgtggtctga gaagctgcat ggtatgattc taattctttt aaatttgctg
    42721 agacttgctt tatggcctag tatgtgatca atcctagaga aggtcccatg cgctgctgag
    42781 aagaatgtga agtctgtaga tgtagggttg aaagttctgt agatatctgt tagatccatt
    42841 tgggcaatag tgtcaattaa atctgctgtt tccttgttga tcttctgtcc ggatgatctg
    42901 tctatttctg agagtggagt attgaagtcc cccagtacta ttgtattgga atctaagtct
    42961 ccctttaagt cccttaacat atcttttaaa tagaccggtg ccctgtaatt aggtgcatat
    43021 acatttataa tagttacatc ttcctgttga attgaaccct taatcattat atagtgtccc
    43081 tctttgtctc tcttaacagt ttttgtatta aagtttattt tgtctgatat taatatggct
    43141 acacctgctt tttttggttt ctgttggcat ggaatatctt tttccaacct ttcactttca
    43201 gtctgcatgc ctctttgtta gagagatgtg tttcttgtag gcaacaaata gttgggttgt
    43261 gttctgtgag ccagtcatcc aaacggtgtc ttttaactga agaattcaga ccattaatgt
    43321 tcaatgtgac aattgatacg tagtgacttt gccctgccat tttcccggaa atattttcta
    43381 gtatatgctt tgagcttccc atgctctttt actggtaggt gttcttcctt tcccttcttt
    43441 catattgatg gccgtgtttc tgtgtttctg agtgtagcac atctttaagt atcttttgca
    43501 gggccggaca agtggccaca aagtctttca atttctgttt gctatgaaag gtctttattt
    43561 caccttcatt cacaaatgag agcttggcag gatataatat tctgggctgg caatttttct
    43621 ctcttagcac ctgtgctatg tctcgccatt ccctcctagc ttgtagggtt tctgatgaga
    43681 agtctgctgt gagtctgatt ggagatcctc tgagagtaat ctgatgtttc tctcttgcac
    43741 attttaggat cttttcttta tgtttcactg tggtaagttt aattaccacg tgtcgtgtga
    43801 ggatctcttt tggtcatgtt tattgggggt tctatgagct tcctgtacta ggatatctct
    43861 gtccttctcc aaacctggaa agttctctgc tagtatctca ctaaaaaggc cttccaatcc
    43921 tttctctctc tccatgcctt caggaactcc tagaactcga atgttggttt ttttaatagt
    43981 atcctgtaga ttcccaacaa tattttttag atttctaatt tcctcttctt ttctttggtt
    44041 tgactgtata ctttcctgtg ctctgtcttc taagtccaat attctgtctt ctgcttcacc
    44101 cattctgttt ttaaggcttt ctaatgtgtt tgccatttga tctattgagc tcttcatttc
    44161 attttgattt ctcttcacta ttacactttc ctgttctact agtttctgag tttcattttg
    44221 actcttcctt aaaatttcat tttcacgaga gagattttca atcttgtcca ttaaggattt
    44281 ctgtagttca aggatttgct tttgaaaact tctaaatgtt cttatcataa attttttgaa
    44341 atccgtatct tgcatttctt ctatctcatc atcttcatac tcttggcttg gggtgttttg
    44401 cttatttgga ggcatcatag tgtcatcgtt gatcttgctc cctctatttc tgtgtttgtt
    44461 actcggcata gttaattctt cttgtgtcac tgtgcgtttt ttttttttct tttttttttt
    44521 tttttatact gtgtccatgt taagtggact gcctgctgtt ggaggagcct tggaggcttg
    44581 agatgggtgc ggcc
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       YSCPROMA      162 bp ds-DNA             PLN       13-FEB-1990
DEFINITION  S.cerevisiae promoter region, clone pUT340.
ACCESSION   M32023
KEYWORDS    promoter.
SOURCE      S.cerevisiae (strain FL200) DNA, clone pUT340.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 162)
  AUTHORS   Gatignol,A., Dassain,M. and Tiraby,G.
  TITLE     Cloning of Saccharomyces cerevisiae promoters using a probe vector
            based on phleomycin resistance
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.L.Stassi, 08-FEB-1990.
BASE COUNT       46 a     25 c     29 g     62 t
ORIGIN      
        1 gatctttttt tttttttttt ttttttttgt gctgcaaagt ttcttaaagc cttcgggctt
       61 acgaaatcct ttatcaccga aaggggaccg cttcgaaaag tggatataaa acaaggtatt
      121 tattttttat agacaatgac caaatgacag gatagatcca tg
//
LOCUS       YSCPROMB      270 bp ds-DNA             PLN       13-FEB-1990
DEFINITION  S.cerevisiae promoter region, clone pUT341.
ACCESSION   M32024
KEYWORDS    promoter.
SOURCE      S.cerevisiae (strain FL200) DNA, clone pUT341.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 270)
  AUTHORS   Gatignol,A., Dassain,M. and Tiraby,G.
  TITLE     Cloning of Saccharomyces cerevisiae promoters using a probe vector
            based on phleomycin resistance
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.L.Stassi, 08-FEB-1990.
BASE COUNT       74 a     47 c     55 g     94 t
ORIGIN      
        1 gatatatttt cgcggctcgc gttttgtttg cttcttattt tacactgagt tttcgtgccg
       61 caaacgtgga gatgggaaaa agaaaagtcg ggaaaataat gagaaatttc tacttttggt
      121 attcctcata cagcctgcgc ggtttattag taaaataccc gataatcctc gaggtttgaa
      181 aaacttttcc ctctactact gttgacacgg atttttttat ttaagaggaa aagtcgtggt
      241 tgttttcctc gaacaaatta gatatccatg
//
LOCUS       YSCPROMC      343 bp ds-DNA             PLN       13-FEB-1990
DEFINITION  S.cerevisiae promoter region, clone pUT342.
ACCESSION   M32025
KEYWORDS    promoter.
SOURCE      S.cerevisiae (strain FL200) DNA, clone pUT342.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 343)
  AUTHORS   Gatignol,A., Dassain,M. and Tiraby,G.
  TITLE     Cloning of Saccharomyces cerevisiae promoters using a probe vector
            based on phleomycin resistance
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.L.Stassi, 08-FEB-1990
BASE COUNT       87 a     81 c     64 g    111 t
ORIGIN      
        1 ctaaaataca gcaaacttgc attttggtca tttatggtta gcaagtttaa gtgctactac
       61 attacatatt acctgaccct gaaaacgggc ttaaaagccc atctactata gtgccgctgc
      121 gggtaaccgt agcttcttgc tttttttccc accgtcattt attttctagt cttctttccg
      181 agaaatgccg gcggcgtttg ccgttccgtt cgcaatctct cgagaagttc gagagatgaa
      241 aaattttctt gggaaagcca agaaggtacg atcgtaggca tgattcatat ataaactctt
      301 tcccaatctt tttggttacg tacatactcc cccctgatcc atg
//
LOCUS       YSCPROMD      328 bp ds-DNA             PLN       13-FEB-1990
DEFINITION  S.cerevisiae promoter region, clone pUT343.
ACCESSION   M32026
KEYWORDS    promoter.
SOURCE      S.cerevisiae (strain FL200) DNA, clone pUT343.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 328)
  AUTHORS   Gatignol,A., Dassain,M. and Tiraby,G.
  TITLE     Cloning of Saccharomyces cerevisiae promoters using a probe vector
            based on phleomycin resistance
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.L.Stassi, 08-FEB-1990.
BASE COUNT       95 a     80 c     63 g     90 t
ORIGIN      
        1 gtcatatata accataacca agtaatacat attcaaaatg ggtgttgaac aaatcttaaa
       61 gagaaagacc ggtgtcatcg ttggtgaaga tgtccacaac ttattcactt acgctaagga
      121 acacaagttc gctattccag ctattaacgt cacctcttct tctactgccg tcgctgcttt
      181 agaagctgct agagacagca agtccccaat cattttgcaa acctctaacg gtggtgctgc
      241 ttacttcgct ggtaagggta tctctaacga aggtcaaaat gcttccatca agggtgctat
      301 tgccgctgcc cactacatca gatccatg
//
LOCUS       YSCPROME      123 bp ds-DNA             PLN       13-FEB-1990
DEFINITION  S.cerevisiae promoter region, clone pUT344.
ACCESSION   M32027
KEYWORDS    promoter.
SOURCE      S.cerevisiae (strain FL200) DNA, clone pUT344.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Gatignol,A., Dassain,M. and Tiraby,G.
  TITLE     Cloning of Saccharomyces cerevisiae promoters using a probe vector
            based on phleomycin resistance
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.L.Stassi, 08-FEB-1990.
BASE COUNT       41 a     15 c     21 g     46 t
ORIGIN      
        1 gatcattatt gatagcttac ttctttgagc ggaatatata gagattttgg actcgaagaa
       61 tgctaaatat ttcttagagt taatttttgc aaaaatgaga tatcaatcat ttcatgatcc
      121 atg
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       MUSACTMEL    1857 bp ss-mRNA            ROD       15-MAR-1989
DEFINITION  Mouse A-X actin mRNA, complete cds.
ACCESSION   J04181
KEYWORDS    actin.
SOURCE      Mouse (strain C57BL/6) B16 melanoma F-1, cDNA to mRNA, clone pMA30.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1857)
  AUTHORS   Sadano,H., Taniguchi,S., Kakunaga,T. and Baba,T.
  TITLE     cDNA cloning and sequence of a new type of actin in mouse B16
            melanoma
  JOURNAL   J. Biol. Chem. 263, 15868-15871 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by H.Sadano, 08-SEP-1988.
FEATURES       from  to/span     description
    pept         63     1190     A-X actin
    mRNA    <     1     1857     actin mRNA
BASE COUNT      410 a    498 c    453 g    496 t
ORIGIN      794 bp upstream of BglII site.
        1 cccgcgagca cagcttcttt gcagctcctt cgttgccggt ccactcccgc caccagttcg
       61 ccatggatga cgatatcgct gcgctggtcg tcgacaacgg ctccggcatg tgcaaagccg
      121 gcttcgcggg cgacgatgct cccctggctg tattcccctc catcgtgggc cgccctaggc
      181 accagggtgt gatggtggga atgggtcaga aggactccta tgtgggtgac gaggcccaga
      241 gcaagagagg tatcctgacc ctgaagtacc ccattgaaca tggcattgtt accaactggg
      301 acgacatgga gaagatctgg caccacacct tctacaatga gctgcgtgtg gcccctgagg
      361 agcaccctgt gctgctcacc gaggcccccc tgaaccctaa ggccaaccgt gaaaagatga
      421 cccagatcat gtttgagacc ttcaacaccc cagccatgta cgtagccatc caggctgtgc
      481 tgtccctgta tgcctctggt cgtaccacag gcattgtgat ggactccgga gacggggtca
      541 cccacactgt gcccatctac gagggctatg ctctccctca cgccatcctg cgtctggacc
      601 tggctggccg ggacctgaca gactacctca tgaagatcct gaccgagcgt ggctacagct
      661 tcaccaccac agctgagagg gaaatcgtgc gtgacatcaa agagaagctg tgctatgttg
      721 ctctagactt cgagcaggag atggccactg ccgcatcctc ttcctccctg gagaagagct
      781 atgagctgcc tgacggccag gtcatcacta ttggcaacga gcggttccga tgccctgagg
      841 ctcttttcca gccttccttc ttgggtatgg aatcctgtgg catccatgaa actacattca
      901 attccatcat gaagtgtgac gttgacatcc gtaaagacct ctatgccaac acagtgctgt
      961 ctggtggtac caccatgtac ccaggcattg ctgacaggat gcagaaggag attactgctc
     1021 tggctcctag caccatgaag atcaagatca ttgctcctcc tgagcgcaag tactctgtgt
     1081 ggatcggtgg ctccatcctg gcctcactgt ccaccttcca gcagatgtgg atcagcaagc
     1141 aggagtacga tgagtccggc ccctccatcg tgcaccgcaa gtgcttctag gcggactgtt
     1201 actgagctgc gttttacacc ctttctttga caaaacctaa cttgcgcagg aaaaaaaaaa
     1261 taagagacaa cattggcatg gctttgtttt tttaaatttt ttttaaagtt tttttttttt
     1321 ttttttttta agtttttttg ttttgttttg gcgcttttga ctcaggattt aaaaactgga
     1381 acggtgaagg cgacagcagt tggttggagc aaacatcccc caaagttcta caaatgtggc
     1441 tgaggacttt gtacattgtt ttgttttttt ttttttggtt ttgtcttttt ttaatagtca
     1501 ttccaagtat ccatgaaata agtggttaca ggaagtccct caccctccca aaagccaccc
     1561 ccactcctaa gaggaggatg gtcgcgtcca tgccctgagt ccaccccggg gaaggtgaca
     1621 gcattgcttc tgtgtaaatt atgtactgca aaaatttttt taaatcttcc gccttaatac
     1681 ttcatttttg tttttaattt ctgaatggcc caggtctgag gcctcccttt tttttgtccc
     1741 cccaacttga tgtatgaagg ctttggtctc cctgggaggg ggttgaggtg ttgaggcagc
     1801 cagggctggc ctgtacactg acttgagacc aataaaagtg cacaccttac cttacac
//
LOCUS       MUSIGHMM      369 bp ds-DNA             ROD       15-DEC-1989
DEFINITION  Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from
            anti-alpha(1->6)dextran hybridoma 27.7.2.
ACCESSION   M24320
KEYWORDS    immunoglobulin heavy chain; joining exon; mu immunoglobulin;
            variable region.
SOURCE      Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.7.2, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 369)
  AUTHORS   Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B.,
            Morrison,S.L. and Kabat,E.A.
  TITLE     Amino acid substitutions in V-H CDR2 change the idiotype but noT
            the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans
  JOURNAL   J. Immunol. 142, 888-893 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.A.Kabat, 15-MAR-1989.
FEATURES       from  to/span     description
    pept    <     1  >   369     Ig mu chain VJ3-region precursor (AA at 1)
    matp    <     1  >   369     Ig mu chain VJ3-region
    recomb      321      322     V-region end/J3-region start
BASE COUNT       92 a     85 c     96 g     84 t     12 others
ORIGIN      
        1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata
       61 tcctgcaagg ctactggcta cacattcagt agctactgga tagagnnnnn ntgggtaaag
      121 cagaggcctg gacatggcct tgagtggatt ggagagattt tacctggaag tggtagtact
      181 aactacaatg agaagttcaa gggcaaggcc acattcactg cagatacatc ctccaacaca
      241 gcctacatgc aactcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga
      301 cattactacg gtagtagctc ctttnnnnnn gcttactggg gccaagggac tctggtcact
      361 gtctctgca
//
LOCUS       MUSIGHMN      369 bp ds-DNA             ROD       15-DEC-1989
DEFINITION  Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from
            anti-alpha(1->6)dextran hybridoma 27.4b.2.
ACCESSION   M24321
KEYWORDS    immunoglobulin heavy chain; joining exon; mu immunoglobulin;
            variable region.
SOURCE      Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.4b.2, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 369)
  AUTHORS   Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B.,
            Morrison,S.L. and Kabat,E.A.
  TITLE     Amino acid substitutions in V-H CDR2 change the idiotype but noT
            the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans
  JOURNAL   J. Immunol. 142, 888-893 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.A.Kabat, 15-MAR-1989.
FEATURES       from  to/span     description
    pept    <     1  >   369     Ig mu chain VJ3-region precursor (AA at 1)
    matp    <     1  >   369     Ig mu chain VJ3-region
    recomb      321      322     V-region end/J3-region start
BASE COUNT       90 a     82 c     97 g     88 t     12 others
ORIGIN      
        1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata
       61 tcctgcaagg ctactggcta cacattcagt agttactgga tagagnnnnn ntgggtaaaa
      121 cagaggcctg gacatggcct tgagtggatt ggagagattt ttcctggaag tggtagttct
      181 aactacagtg agaagttcaa gggtaaggcc acattcactg cagatacatc ctccaacaca
      241 gcctacatgc aactcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga
      301 cattactacg gtagtagctc gtttnnnnnn gcttactggg gccaagggac tctggtcact
      361 gtctctgca
//
LOCUS       MUSIGHMO      369 bp ds-DNA             ROD       15-DEC-1989
DEFINITION  Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from
            anti-alpha(1->6)dextran hybridoma 27.10.2.
ACCESSION   M28529 M24322
KEYWORDS    immunoglobulin heavy chain; joining exon; mu immunoglobulin;
            variable region.
SOURCE      Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.10.2, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 369)
  AUTHORS   Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B.,
            Morrison,S.L. and Kabat,E.A.
  TITLE     Amino acid substitutions in V-H CDR2 change the idiotype but noT
            the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans
  JOURNAL   J. Immunol. 142, 888-893 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.A.Kabat, 15-MAR-1989.
FEATURES       from  to/span     description
    pept    <     1  >   369     Ig mu chain VJ3-region precursor (AA at 1)
    matp    <     1  >   369     Ig mu chain VJ3-region
    recomb      321      322     V-region end/J3-region start
BASE COUNT       90 a     84 c     98 g     85 t     12 others
ORIGIN      
        1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata
       61 tcctgcaagg ctactggcta cacattcagt agctactgga tagagnnnnn ntgggtaaag
      121 cagaggcctg gacatggcct tgagtggatt ggagagattt ttcctggaag tggtagtact
      181 aagtacaatg agaagttcaa gggcaaggcc acattcactg cagatacatc ctccaacaca
      241 gcctacatgc agctcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga
      301 cattactacg gtagtagctc ctttnnnnnn gcttactggg gccaagggac tctggtcact
      361 gtctctgca
//
LOCUS       MUSIL71      1926 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse interleukin 7 (IL-7) gene, exon 1.
ACCESSION   M29054
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     1 of 4
SOURCE      Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1926)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept       1897  +  1906     interleukin 7, exon 1
    pre-msg    1350  >  1926     IL7 mRNA and introns
    IVS        1907  >  1926     IL7 intron A
BASE COUNT      498 a    478 c    430 g    520 t
ORIGIN      156 bp upstream of XbaI site.
        1 aatccctggt ttctttctct ttcctctctt tccttttttt tttttttttt tttttttttg
       61 gtggacattt ttccatgttc catcgttgaa atgctttctc accacagtag ctctcttcca
      121 tcctgtcata ctgcaacgct ggagcctaat cctattctag atagcccttc tctgtggatg
      181 ggagctgtct ggagaccagg gctgcagtga agataacaag tactgtcaaa tctcctgatg
      241 ctttgagcag gatgacacca agagagacac cagagaggcc tttaaacacc cctgcaggca
      301 ccgggtagga tttattttgg aagtgcacta agacaacagg ttttacattg aatgtggaaa
      361 gtgagagtcc ctgcagagtc cacaggccag tgcccagtta gaactcctat ggtccacact
      421 ctaaatgcac agacagtacc tctctgcctc ttttcactgt taacagacag aatctgaagc
      481 tcgtgtcctg atgtccgatt aggaaccgcc agctccagac ccctttctga ggttccccag
      541 agcgttaaga gcacacccac caccctaggg cgtcttctta gtctactgta tcctccatta
      601 tttacaaggt tctcttccac aagagattcc aagcagactc tttgaagcca gcctgcaggt
      661 gttttttatc caaattcagc cagtgaactt agctctgtta cttaaggaag cctctgttcc
      721 catgtgtatc tctcaactgc agataagaaa acagattgcc tcggcagaaa actccagatg
      781 ccaaaataca tgtaaattcc tgaccaagag tcctggcata gagcaaatac tagatatatg
      841 agagatatgg tcctgatctc agatctctct tgttactggt ttgcacaacc tgccttcacc
      901 taagcagtgt ttctcctgat tataagaaac aacactaaat caggaatccc ctcggtaatt
      961 caggcctatt attaaaacaa aacaaaacac tttaatggaa acaatccaca atgggtggat
     1021 tagctatgtc aggagataac tgtgggagat agttactatg cagaacttct atgtgccccc
     1081 aactaatacg agttttggtt ttctgaagct accacttaca tactagactg gatggactca
     1141 caagaattat ttgttcttaa agtcctacct gagcaggtgc attgcacaga tgacagggag
     1201 gaaaaaaaaa aaaaaacaaa ctagagaacc tagaggggca gagggcgtga ccctcttaat
     1261 catccttcac ttcctttttt tttttttttt ttttaagatg acttacatcg gcggcagcag
     1321 aagcagcgac gcctcctcag tggaactgac ttgcaatagc gagctttctc tgctgcacat
     1381 ttgtggcttc tgtggacata ttagtaacca gcggttttag ctcccagtct cacagagttg
     1441 ccagagaggt tagaagtcat ttgaaaagcc tattagccga atctttctga tccagaaggc
     1501 cagctggctt ctcctgagct actttcaatt cgcagcaacc actgatcctg gtccaggtga
     1561 ctggggaaga cgctgagggt ataaacccaa acattgaacc tgaagaccca gcgcaaagta
     1621 gaaactgaaa gtaccctgct tactctgccg gcagatccta cggaagttat ggcaaagcca
     1681 gagcgcctgg gtggccggtg atgcatgcgg cccctcttgg gatggatgga ccaggcgtgg
     1741 cgtgggtgag aggagtcagc tgcctgaact gccctgccca gcaccggttt gcggccaccc
     1801 ggtggatgac cggggtcctg ggagtgatta tgggtggtga gagccggctc ctgctgcagt
     1861 cccagtcatc atgactacac ccacctcccg cagaccatgt tccatggtaa gcgctgctct
     1921 ctggtg
//
LOCUS       MUSIL72       174 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse interleukin 7 (IL-7) gene, exon 2.
ACCESSION   M29055
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     2 of 4
SOURCE      Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 174)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21  +   154     interleukin 7, exon 2
    pre-msg <     1  >   174     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron A
    IVS         155  >   174     IL7 intron B
BASE COUNT       44 a     31 c     35 g     64 t
ORIGIN      Undetermined number of bp after segment 1.
        1 tcattaactg ctgggttcag tttcttttag atatatcttt ggaattcctc cactgatcct
       61 tgttctgctg cctgtcacat catctgagtg ccacattaaa gacaaagaag gtaaagcata
      121 tgagagtgta ctgatgatca gcatcgatga attggtacgt gttgttttat tttt
//
LOCUS       MUSIL73       121 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse interleukin 7 (IL-7) gene, exon 3.
ACCESSION   M29056
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     3 of 4
SOURCE      Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21  +   101     interleukin 7, exon 3
    pre-msg <     1  >   121     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron B
    IVS         102  >   121     IL7 intron C
BASE COUNT       46 a     13 c     20 g     42 t
ORIGIN      Undetermined number of bp after segment 2.
        1 attgttttat attattacag gacaaaatga caggaactga tagtaattgc ccgaataatg
       61 aaccaaactt ttttagaaaa catgtatgtg atgatacaaa ggtaatggta atttttactt
      121 t
//
LOCUS       MUSIL74       172 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse interleukin 7 (IL-7) gene, exon 4.
ACCESSION   M29057
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     4 of 4
SOURCE      Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 172)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21  /   152     interleukin 7, exon 4
    pre-msg <     1  >   172     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron C
    IVS         153  >   172     IL7 intron D
BASE COUNT       56 a     34 c     32 g     50 t
ORIGIN      About 1.5 kb after segment 3.
        1 tgactttttt cttctaatag gaagctgctt ttctaaatcg tgctgctcgc aagttgaagc
       61 aatttcttaa aatgaatatc agtgaagaat tcaatgtcca cttactaaca gtatcacaag
      121 gcacacaaac actggtgaac tgcacaagta aggtgagagt ggactactca ct
//
LOCUS       MUSMFG1A      508 bp ss-mRNA            ROD       15-MAR-1990
DEFINITION  Mouse zinc finger protein (mfg1) mRNA, partial cds.
ACCESSION   M28513
KEYWORDS    zinc finger protein.
SOURCE      Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 508)
  AUTHORS   Passananti,C., Felsani,A., Caruso,M. and Amati,P.
  TITLE     Mouse genes coding for "zinc-finger"-containing proteins:
            Characterization and expression in differentiated cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Felsani, 29-SEP-1989.
FEATURES       from  to/span     description
    pept    <     1  >   508     zinc finger protein mfg1 mRNA (AA at 1; put.)
BASE COUNT      191 a     99 c     97 g    121 t
ORIGIN      
        1 attcgtacag gggagaaagc cttcaagtgt gaagaatgtg gtaaggcctt taataatcgt
       61 tcaaaccttt ctgaacacaa gagaatacat actggagaaa aaccctacaa atgtgaagaa
      121 tgtggtaagg cctttcgtat tcgctcaaaa ctttctacac accagagagt tcatacggga
      181 gaaaaaccct acaaatgtga agaatgtggt aaggctttta atagtcattc aaacctttct
      241 gaacacaaga gaatccatac tggagagaaa ccttacaaat gtgaagaatg tggtaaggct
      301 ttcagtactt catcaaacct ttctgaacac aagagaatcc atactggaga aaaaccgatc
      361 aaatgtgaag aatgtggcaa ggccttcagt actcgctcaa catattatag acaccagaaa
      421 aaccatactg gaaaaaaacc atacaagtgt gaagaatgcg ccaaagaatt ttcctaccct
      481 tcattgctta aggtacatca aagaattc
//
LOCUS       MUSMFG2A     1218 bp ss-mRNA            ROD       15-MAR-1990
DEFINITION  Mouse zinc finger protein (mfg2) mRNA, partial cds.
ACCESSION   M28514
KEYWORDS    zinc finger protein.
SOURCE      Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1218)
  AUTHORS   Passananti,C., Felsani,A., Caruso,M. and Amati,P.
  TITLE     Mouse genes coding for "zinc-finger"-containing proteins:
            Characterization and expression in differentiated cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Felsani, 29-SEP-1989.
FEATURES       from  to/span     description
    pept    <     1  >  1218     zinc finger protein mfg2 mRNA (AA at 1; put.)
BASE COUNT      359 a    259 c    282 g    318 t
ORIGIN      
        1 tatgaatgta aggagtctgg gaaggccttc agactccccc aacagctcac aaggcatcag
       61 aagtgtcaca gtggtgagaa gcctttcagt cataatgaag gcagacaggc ttttcaacat
      121 cccaacctac tcaagtaccc taaagccatt catacaggtg ccaaagcatt tgcatgcagg
      181 gaatgtggga agtccttcaa ccgtgtctcc agccttgttg aacatgggct tattcatgct
      241 gatgtgaaac catatgaatg taatgagtgt gggaaagcct ttaaacgaca cagaagtttt
      301 gtgcgccatc agaaaatcca ttctggtgag agaccctttc agtgtaagga ttgtgggaag
      361 ggcttcattg ttctggctca cctcactcgg caccagagca gtcatagtga agagaagcca
      421 tttgaatgtg aggaatgtgg caagaaattt agaactgcca gacaccttgt taagcaccag
      481 cgcattcata gcggcgagaa accctttgaa tgcaatgtat gtgggagtgc tttccgcctt
      541 caactatacc tttctgagca tcagaaaact cacatggaag agaaatactt ggaatgcaat
      601 gtgtgtggga aggcttttcg gcttcaggat atcctttctg agcacttgaa aactcacact
      661 gaagagaatc ctttcaagtg taagctctgt gggtcatcct tcccacataa gtaccaactt
      721 aataagcatc tcacagttca cactgatggg aaaccctatc agtgcaagga atgtggcaaa
      781 tgctttcgtc aaaggtcaaa actaacggaa catgagagta ttcacactgg caagaaaccc
      841 ttccagtgtg aagcatgtgg gaagtcttta gctaacacac ttctcattca tcatcagaag
      901 tctcacagtg gtgagagacc ttttgaatgt aaagaatgcg gaaaggcttt ccttcttccc
      961 agtcaactta atagccataa aattgttcat acaagtaaaa gaccctttga atgcaaggta
     1021 tgtgggaagt ccttcaagcg tgaatccaat ctcattcaac atggcgctgt tcatgctggt
     1081 gtgaaatcat atgaatgcag tgagtgtggg aaaggtttta ttgaccgctc aagcctgttc
     1141 caccatcgga aaattcattc tgatgagaaa ccatttaagt gtcaggagtg tggaaaggcc
     1201 ttcgttgttc tggcgtat
//
LOCUS       MUSMFG3A      916 bp ss-mRNA            ROD       15-MAR-1990
DEFINITION  Mouse zinc finger protein (mfg3) mRNA, partial cds.
ACCESSION   M28515
KEYWORDS    zinc finger protein.
SOURCE      Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 916)
  AUTHORS   Passananti,C., Felsani,A., Caruso,M. and Amati,P.
  TITLE     Mouse genes coding for "zinc-finger"-containing proteins:
            Characterization and expression in differentiated cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Felsani, 29-SEP-1989.
FEATURES       from  to/span     description
    pept    <     1  >   916     zinc finger protein mfg3 mRNA (AA at 1; put.)
BASE COUNT      281 a    186 c    200 g    249 t
ORIGIN      
        1 cttactgtgc tggctcagct tactcggcac gagaacattc acactgaaga caaatcgttt
       61 gaatgtaagc agtgtggtaa gatatttagt aatggctctt acctcttacg acactacgat
      121 actcacacca atgagaaacc ctttgagtgt aatatatgtg gcaaggcttt taggcttcat
      181 ctataccttt ctgagcatca gaaaactcac accgatgaga agcctttcaa gtgtaagctg
      241 tgtgagtcag ccttcagacg taagtaccag ctcagtgaac atcagagaat tcacactgat
      301 ggtaaaccct atcagtgcaa ggattgttgg gaattctttc gtcgacgatc aaattttatc
      361 gaacatcaga gtattcacac tggaaagaaa ccctttgagt gtaaggactg tgggaaggtc
      421 tttagactaa atatacatct cattcgacat cagagatttc atagtgatga gaagcccttt
      481 gaatgtaaag aatgtggaaa ggcttttcat ttttccagcc agcttaataa ccataaaacc
      541 agtcatacag gtcagacacc ttttgaatgt aaagaatgtg ggaagtcgtt taagcgtgtc
      601 tccagccttg tggaacatag gattattcac agtggtgtga aaccatataa atgtaatgcg
      661 tgtggaagag ccttcaatcg tcgctcaaac ctcatgcaac atgagaaaat tcattctgat
      721 gagagaccct ttgagtgtaa ggactgtgga aaggccttca ctgttctggc tcagctcaca
      781 cggcaccaga cgattcataa tggaaagaaa tcatatgaat gtgagcagtg tgggtcagcc
      841 ttcagactcc cgtaccaact cactcaacat cagagaattc attatgatgt gaagccattt
      901 cagtgcaagg atgtgg
//
LOCUS       MUSMHSLPC    5362 bp ss-mRNA            ROD       15-SEP-1989
DEFINITION  Mouse MHC sex-limited protein (Slp) mRNA, complete cds.
ACCESSION   M21576 M12385
KEYWORDS    antigen; major histocompatibility complex; sex-limited protein.
SOURCE      Mouse (strain B10.WR) liver, cDNA to mRNA, clone pSlp20B.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5362)
  AUTHORS   Ogata,R.T. and Sepich,D.S.
  TITLE     Murine sex-limited protein: Complete cDNA sequence and comparison
            with murine fourth complement component
  JOURNAL   J. Immunol. 135, 4239-4244 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         52     5259     sex-limited protein precursor
    sigp         52      108     sex-limited protein signal peptide
    matp        109     5256     sex-limited protein
    mRNA    <     1     5362     Slp mRNA
BASE COUNT     1219 a   1494 c   1462 g   1187 t
ORIGIN      39 bp upstream of BamHI site.
        1 cgagcagaca gtcagaccag acaggtctga cctttcctgg atcctccagc catgcggctc
       61 ctctgggggc tggcctgggt gttcagcttc tgtgcctcat ccctgcagaa gcccaggttg
      121 ctcctgtttt ccccttctgt ggttaatttg gggacccccc tgtcggtggg ggtacagctc
      181 ctggatgccc ctccaggaca ggaggtaaaa ggatcagtgt tcctcagaaa cccaaagggt
      241 ggttcctgct ccccaaagaa ggactttaag ctgagctcgg gagatgactt tgtgctgctc
      301 agccttgagg tcccactgga agatgtgagg agctgtggcc tctttgacct gcgcagagcc
      361 ccccacatcc agctggtagc tcagtctccg tggctaagga acacagcttt caaagccaca
      421 gagactcagg gtgtcaactt gctcttctct tcccgacgag gccacatctt tgtgcagacc
      481 gatcagccta tctataatcc agggcagcgg gttcgttatc gggtctttgc actggatcaa
      541 aagatgcgcc catccactga tttcctcacc atcacagtgg agaactccca tggcctcagt
      601 gtactcaaga aggagatatt tacttccaca cccatcttac aagatacctt catcattcca
      661 gacatctcag agcctgggac ctggaagatc tcagctaggt tctcagatgg actggagtcc
      721 aataggagca cccactttga agtgaagagg tatgtccttc ccaacttcga ggtgaagatt
      781 actccttgga agccatatat cctgatggtg cccagcaaca gtgatgaaat ccaattagac
      841 atccaggcca ggtacatcta tgggaagccc gtgcagggcg tggcatacac acggtttgcg
      901 ctcatggatg agcaagggaa gaggactttc cttcagggcc tagagaccca ggccaagttg
      961 gtggaaggcc ggacccacat ttccatctca aaggaccagt tccaggctgc cctggataaa
     1021 atcaatattg gggtcagaga cctggagggg ctgcgtctct atgctgctac agctgtcatt
     1081 gagtctccag gaggagagat ggaggaggca gaactcacgt cctggcgctt tgtatcatct
     1141 gccttttcct tggatctcag ccgcactaag cggcatctcg tgcctggagc ccacttcctg
     1201 ctgcaggcct tggtcagaga aatatcaggc tctgaagcct ctaacgttat tgtcaaagtt
     1261 tctgccacat tggtgtcagg ctctgattcc caagtcctca acgttcaaca gagcaccaat
     1321 agaattggcc aagtcagcat ttccttcccc atcccaccaa ccgtcacaga acttcggctc
     1381 ttggtgtctg cgggctccct ctacccaacg atagccaggc tcaccgtgca atccccacct
     1441 tcaagaggca ctggctttct ttctattgag ccactagacc ctcggtcccc tcgtgtgggg
     1501 gacaccttta tcctaaactt acaagctgtg ggcatccctg cacctacctt ctctcattac
     1561 tactacatga tcatctccag aggccagatc atggctatga gtcgggaggc ccggaggacc
     1621 gtgacctctg tctccgtgtt ggtggaccat cagctggctc cttcattcta cttcatggct
     1681 tacttctatc accaaggaca cccggtggcc aactctctgc tcatcaacat ccagcccagg
     1741 gactgtgagg gcaagctgca attgaaggta gttggtgcca aggagtatca taatggggac
     1801 atgatgaagc tccaaattca aactgactcc aaagccctgg tggcactggg agctgtggac
     1861 accgctctgt atgctgtggg cggttggtct cacaaacccc tcgacatgag caaggtcttt
     1921 gaagtaatca acagctacaa ccttggctgt ggtcctggag gtggggatga tgcccttcag
     1981 gtgttccagg atgctggtct ggccttttct gatggtgatc gactaactca aaccagagag
     2041 gacctgagct gtcccaagga gaagaaaagt cggcaaaaga gaaatgttaa cttcctgaag
     2101 gctcttagtg agaagttggg ccagtattct tctccagatg ccaagcgctg ctgccaagac
     2161 gggatgacga agctgcccat gaagcgtacc tgtgagcagc gggctgcccg tgtgcctcag
     2221 caggcctgcc gtgagccctt cttgtcctgt tgcaagtttg ctgaggacct tcgcaggaac
     2281 cagaccagga gccaggcacc ccttgcccga aaggtccggg acatggtgaa tttgatagag
     2341 gaagatgaca ttcttgtgcg cacctccttc ccagagaact ggctctggag agtggaacct
     2401 gtagacagct ccaaactgtt gacagtgcgg cttcctgatt ctatgaccac atgggagatt
     2461 catggtgtga gcctgtccaa aagcaaaggt ctgtgtgtag ccaagccaac tcgtgttcga
     2521 gtgttcagaa aattccacct tcacctgcgc ctgcccatct ccgtccgccg ctttgagcag
     2581 cttgaattac ggcctgttct ctacaactat ctgaatgatg acaagaacgt gagtgtccat
     2641 gtgaccccag tggagggcct gtgcatggct ggtggtggaa tgatggccca gtgggtgata
     2701 gtgcctgcag gctctgccct gcctgtggcc ttctctgtgg tacccacagc ttctaccaac
     2761 gtgcccctga aattggtggc taaagggact ttggattcaa gcgattctgt gtctaagatt
     2821 ctccaaattg agaaggaagg agccatccac agagaagaga tagtctacaa cctcgacccc
     2881 ctaaataacc taggtcagat gttggagata cctggcagct cggatcccaa catagtccct
     2941 gaaggagact tcagcacctt tgtcaaggtt acagcctctg aacccttgga gactctgggc
     3001 tctgaaggag ctttgtcccc aggaggcgtg gcctcccttc tgaggcttcc ccggagctgc
     3061 gcagagcaaa ccatgatcta tttggctcct accctgactg cttccaacta cctggacagg
     3121 acaaaacagt ggagcaaact gtcccctgag accaaggacc atgctgtgca tctgatccag
     3181 aaaggccacg ttcggatcca gcagtttcgg aagaaagatg gctcctttgg ggcttggtta
     3241 caccgggaca gcagcacctg gctgactgcc tttgtgctga agattctgag tttggcccag
     3301 gaacaggtgg gcaactcccc ggagaagctg caggagacgg ctagctggct gctggcccag
     3361 cagctgggtg atggctcctt ccacgaccca tgtccagtca tccacagagc aatgcagggg
     3421 ggcttggtgg ggtccaatga gacggtggca ctgaccgcct ttgtggtcat tgcccttcac
     3481 cacgggctga acgtcttccg ggaggggcat gcgaagcagc tgaagaacag agtggaagcc
     3541 tccatcacca aggcaaactc tttcttgggg cagaaggcaa gtgctgggct cctgggtgcc
     3601 catgccgccg ccatcacagc ctatgccctt acgctgacca aggcctcgga ggacctgcgg
     3661 aatgttgccc acaacagcct gatggccatg gctgaggaaa cgggggaaaa cctctactgg
     3721 ggcttagtcc ttggctctca ggacaaagtt gtgttgcgcc ccgcagcccc ccgtagccca
     3781 acagaacctg tgccccaggc cccagccttg tggatcgaaa ccacagccta tgccctgctc
     3841 cacctgcttc tgcgggaggg aaagggaaaa atggctgaca aggctgcatc ctggctcacc
     3901 caccagggaa acttccacgg ggcattccgc agtacccagg acactgtggt caccctggat
     3961 gccctgtctg cctactggat cgcttcgcac accactgagg agaaagcact gaacgtgacg
     4021 ctcagctcca tgggccgcaa tgggctcaaa acccacgtgc tacacttgaa caaccaccaa
     4081 gtcaagggcc tggaggagga gctgaagttc tccctgggca gcacaatcag tgtcaaggtg
     4141 gaaggaaaca gcaaaggcac cttgaagatc cttcgtacct acaacgtcct ggacatgaag
     4201 aacaccacat gccaggacct tcagatagaa gtgaaggtca cagacgctgt ggaatatgca
     4261 tggagcgcct acgaagacta tgaagacgac tataacatgc cagccacaga tgatcccagc
     4321 gttcccttgc agcctgtcac gcccctgcag ctatttgagg gtcgttggag ccgccgcagg
     4381 agggaggccc ccaaggtggc tgaagagcgg gagtccagag ttcactacac tgtgtgtatc
     4441 tggcacaatg gcaagctggg gctgtctggc atggccatcg cagacatcac cctcctgagt
     4501 ggattccacg ccctgagggc tgacctggag aagctgacct ctctctctga ccgttatgtg
     4561 agtcactttg agactgacgg gccccatgtc ctgttgtact ttgactcggt ccctaccacc
     4621 cgggagtgtg tgggctttgg agcctcacag gaggtggttg tgggactggt gcagccagcc
     4681 agtgctgtcc tgtatgacta ctacagccct gatcacaagt gctctgtgtt ttatgctgca
     4741 cccaccaaga gccagctcct ggccacactg tgctctggag atgtatgcca gtgtgccgag
     4801 gggaagtgcc ctcgactgct aaggtcactg gagcgaaggg tggaggacaa ggacggctac
     4861 cggatgaggt tcgcctgcta ttatcaccaa gtggagtatg gcttcacggt taaggttctt
     4921 cgagaagatg gcagagctgc cttccgtctc tttgagtcca agatcaccca agtcctgcat
     4981 ttcagaacgg acaccatggc ctccataggt cagacccgca acttcctgag ccgggcctct
     5041 tgccgccttc gtttggagcc taacaaagag tacttgatca tggggatgga cggggaaacc
     5101 agtgacaaca agggagaccc ccagtacttg ctggactcaa atacctggat tgaggagatg
     5161 ccttcagaac aaatgtgcaa gagcacccgc catcgggcag cctgtttcca gctcaaagat
     5221 ttcctgatgg agttcagcag ccgggggtgc caggtgtgag gccttaggac tctggctctc
     5281 tgagctcagc tcagggtcag ggcctcgctg gatgaggggc tctgctctac agggtaaata
     5341 aaagaaaagc tttttgacag cc
//
LOCUS       MUSNCAML1    3783 bp ss-mRNA            ROD       15-MAR-1990
DEFINITION  Mouse mRNA for neural cell adhesion molecule L1 (N-CAM L1).
ACCESSION   X12875
KEYWORDS    glycoprotein; neural cell adhesion molecule; transmembrane protein.
SOURCE      house mouse (Mus musculus).
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3783)
  AUTHORS   Moos,M., Tacke,R., Scherer,H., Teplow,D., Frueh,K. and Schachner,M.
  TITLE     Neural adhesion molecule L1 as a member of the immunoglobulin
            superfamily with binding domains similar to fibronectin
  JOURNAL   Nature 334, 701-703 (1988)
  STANDARD  simple automatic
COMMENT     *source: developmental stage=8-day old; tissue=brain; *source:
            library=lambda gt10 and lambda gt11.
FEATURES       from  to/span     description
    pept          1     3783     N-CAM L1 (AA 1 - 1260)
BASE COUNT      982 a   1038 c    997 g    766 t
ORIGIN      
        1 atggtcgtga tgctgcggta cgtgtggcct ctcctcctct gcagcccctg cctgctcata
       61 cagattccag acgaatataa aggacaccat gtgctagagc cacctgtcat cacggaacag
      121 tctccacggc gcctggttgt cttcccaaca gatgacataa gcctgaaatg tgaagccaga
      181 ggcagacccc aagtggagtt ccgctggacg aaagatggca tccacttcaa acccaaggaa
      241 gaattgggtg tagtggtgca tgaggcaccc tattctggct ccttcaccat cgaaggcaac
      301 aacagctttg cccagaggtt tcagggcatc tatcgctgct atgccagcaa taagctagga
      361 actgccatgt cgcatgagat ccagctcgtg gccgagggtg cccccaagtg gccgaaggag
      421 actgtaaaac ctgtggaagt ggaggaagga gaatcagtag ttctgccttg caaccctcca
      481 cccagtgcag ccccacctag gatctactgg atgaacagca agattttcga catcaaacaa
      541 gatgagcggg tgtccatggg ccagaatgga gacctatatt ttgccaatgt gcttacctca
      601 gacaatcatt cagactacat ctgcaatgcc cacttccctg gtacccggac catcattcaa
      661 aaggaaccta ttgacctccg ggtcaagccc accaacagca tgattgaccg gaagccacgt
      721 ctgctctttc ccacaaactc cagcagccgc ctggtagcct tgcagggcca gtcattgatc
      781 ctggagtgca ttgctgaggg attccctaca cccaccatca agtggctgca ccccagtgac
      841 ccaatgccaa cagaccgtgt tatctaccaa aaccacaaca agaccctgca actactcaat
      901 gtgggcgaag aggacgatgg cgagtatacc tgccttgctg agaactcgct gggcagtgcc
      961 cggcatgcct actatgttac tgtggaagct gccccatatt ggctgcagaa gccccagagc
     1021 catttgtatg gtccaggaga gactgcccgc ctagactgcc aagtccaggg caggccccaa
     1081 ccagagatca cttggagaat caacggaatg tctatggaga cggtgaacaa ggaccagaag
     1141 taccggattg agcaggggtc tctgatcttg agtaacgtgc agccaactga cacaatggtg
     1201 acccagtgtg aagcccgcaa ccagcatggg ctcctgctag ccaatgccta catttatgtt
     1261 gtccagctgc cagccaggat cctaacaaaa gacaatcaga catacatggc agttgagggc
     1321 agtactgctt acttgctgtg caaagccttt ggagctcctg ttcccagtgt ccagtggctg
     1381 gatgaagaag gaaccacagt gcttcaggat gaacgatttt tcccctatgc caatggaacg
     1441 ctgagcatca gagacctcca ggccaatgac actggacgct atttctgcca ggctgccaat
     1501 gaccagaaca atgtgaccat tttggctaac ctacaggtta aagaagcaac ccagatcaca
     1561 caggggcccc ggagcgcaat tgagaagaaa ggtgcaaggg tgacattcac gtgccaggcc
     1621 tcctttgacc cctctttgca ggccagcatc acttggcgtg gagatgggag agacctacag
     1681 gaacgtgggg acagtgacaa gtatttcata gaagatggga aactagtcat ccagagcctg
     1741 gactacagtg accagggcaa ctacagttgt gtggccagca ctgaactgga tgaggtggag
     1801 agcagggcac agctcttagt ggtggggagc cctgggccag tgcctcacct ggagctgtcc
     1861 gaccgccacc tgctgaagca gagccaggtg cacttgtctt ggagccctgc tgaagaccac
     1921 aactctccca ttgagaagta tgacattgaa tttgaggaca aggaaatggc tcctgagaaa
     1981 tggttcagtc tgggcaaggt gccaggaaat cagacctcta ctaccctcaa gctgtccccc
     2041 tatgtccact acacctttcg ggtcactgcc attaacaaat atggtcctgg agaacccagc
     2101 cctgtctctg agagtgtggt cacacctgag gcagccccag agaagaaccc tgtggatgtg
     2161 agaggggaag ggaatgagac caacaatatg gtcatcacat ggaagcccct tcggtggatg
     2221 gattggaatg ccccccagat tcagtaccgt gtacagtggc gtccacaggg caagcaggag
     2281 acctggagga aacagaccgt gagcgaccct ttcctggtgg tgtctaacac ttccacattt
     2341 gtgccttatg agatcaaagt ccaggcagtg aacaaccagg gcaagggccc tgagccccag
     2401 gtcaccattg gctattcagg ggaagactac ccccaggtga gccctgaact tgaagacatc
     2461 acaatcttca actcaagtac tgtgcttgtc aggtggaggc ctgtggactt ggcccaggtt
     2521 aagggccacc tcaagggata caatgtaaca tactggtgga agggcagcca gagaaagcac
     2581 agcaagaggc atatccacaa aagccacata gtggtacctg caaataccac cagtgccatc
     2641 ctcagtggtt tgcgccctta cagctcttac catgtggagg tacaggcctt taatgggcgg
     2701 ggcttggggc ctgcgagtga atggaccttc agcaccccag agggagtgcc tggccaccct
     2761 gaggcattac acctggagtg tcagtcggac actagtctgc tactgcactg gcagccacca
     2821 ctcagccaca atggagtgct cactggctac ctgctctctt accatcccgt ggaaggggaa
     2881 agcaaagagc agttgttctt caacctttcg gacccagaac tccggactca taatctgacc
     2941 aacctcaacc ctgatctaca gtaccgcttc cagcttcagg ccaccaccca acaggggggt
     3001 cctggtgagg ccatcgtgcg tgaaggaggc accatggccc tgtttggcaa gccagatttt
     3061 ggcaacatct cagccacagc aggtgaaaac tacagcgtgg tctcctgggt ccctcggaag
     3121 ggccagtgca atttcaggtt ccatatcttg ttcaaagcct taccagaagg gaaagtgagc
     3181 cctgatcacc agcctcagcc tcagtatgtc agctacaatc agagctccta cacacaatgg
     3241 aacctacagc ctgacaccaa atatgagatc cacctgataa aggagaaggt cctcctgcac
     3301 catctggatg tgaagactaa tggaactggc cctgtgcgag tttctactac agggagcttt
     3361 gcctccgagg gctggttcat cgcctttgtc agcgctatca ttctcttgct cctcatcctg
     3421 ctcatcctct gcttcatcaa acgcagcaag ggtggcaaat actcagtgaa ggacaaggag
     3481 gacactcagg tagattccga ggcccggccc atgaaagacg agaccttcgg cgagtacagg
     3541 tccctggaga gtgacaatga agagaaggcc tttggcagca gccagccatc tctcaacgga
     3601 gacatcaaac ccctaggcag tgatgacagc ctggctgatt atgggggcag tgtggacgtc
     3661 cagttcaatg aggatggctc tttcatcggc cagtacagtg gcaagaaaga gaaggaggca
     3721 gcaggaggca atgacagttc aggggctacc tctcctatca atcctgcagt agccctagaa
     3781 tag
//
LOCUS       MUSPBGD1     2663 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 1.
ACCESSION   M29949 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     1 of 4
SOURCE      Mouse (C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2663)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene
  JOURNAL   J. Biol. Chem. (1989) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept        505      537     porphobilinogen deaminase (housekeeping) exon
                                 1 (EC 4.3.1.8)
               2191     2307     porphobilinogen deaminase (housekeeping) exon
                                 2 (EC 4.3.1.8)
               2457  +  2510     porphobilinogen deaminase (housekeeping) exon
                                 3 (EC 4.3.1.8)
    pep$       2475  +  2510     porphobilinogen deaminase (erythroid sp.) exon
                                 1
    pre-msg     341  >  2663     PGB deaminase (hk) mRNA and introns
    IVS         538     2190     PGB deaminase (hk) intron A (no splice
                                 consensus)
    IVS        2308     2456     PGB deaminase (hk) intron B
    IVS        2511  >  2663     PGB deaminase (hk) intron C
    pre-msg    2192  >  2663     PGB deaminase (ery.sp.) mRNA and introns
    IVS        2511  >  2663     PGB deaminase (ery.sp.) intron A
    binding    1126     1133     NPE binding site
    binding     312      317     Sp1 binding site
    binding     329      334     Sp1 binding site
    site       2101     2107     CACCC box
    site       2118     2124     CACCC box
BASE COUNT      622 a    661 c    707 g    673 t
ORIGIN      Chromosome 9
        1 ccacccccac cccacacaca cacacaaagt aaatagggct ggagagctta gtggttaaga
       61 gcactgactg ctctttcaga ggtcctgagt tcaattccca gaaaccacat ggtgctcaca
      121 accatctgca atagggtctg atgccctttt ctggtgtgtc taaagaagag agcaatggtg
      181 tactcatata cataaaataa ttttttttaa aaagtaaaag ataataaaaa ttgaaaagga
      241 aaaaaatctt tttgagttgt tctgtgcagt ggacttgagc gaaaaggctg gctatgtcgc
      301 aatcctaatt cccgcccaga ggaaggcacc gccccgttga gggagggcag cggacgtgac
      361 gcagagctca gcaggtcctg cagccggagt gaagtgcggg ctcgggcccc atgtgccttc
      421 agtcccggcc ggcccaggtc gtcggcttct gcagacacca ggggaccgca gcggcactgc
      481 cgcgcctgcg ccctgggcgg agtcatgtcc ggtaacggcg gcgcggccac aaccgcggtg
      541 agttctgagc cggtgaccga tgacccgcac ttctcggggc tttctgggtg caacgattgg
      601 ccccgggttg ccatgttctc gtcgtctatt ggtcggaata gttagctgtc atttttcccc
      661 ccccacacct caaggttttt tttaaagggc cagtaactag gttgccctaa ggcagggaag
      721 gagtgatctc gagcagtggg ggcggggttg tgagtggaaa ggtggtccgc cctgggattc
      781 catccctgta ggctctggct ggatctctgt tgttcccgac cagtaaagga ttatgcacag
      841 acaagatcct tttcacgaag aaggggctga ggcaaatcca gctatctcgg aatacgatcc
      901 acttcattca ggggagagca caccccactt cttaaaactg tatacaaaca tcttggaggt
      961 tacacgcctt ctcccgttct ccgttatgaa gtcacccagc cttagccacc cacaaaagtc
     1021 ctagtagaga cacacctgaa ttgctattgt gagcggggga acccacccct gggccttgtc
     1081 atttctggcc tgcctggaaa gttctgaact tgtgggcagg ctgcctgaga taaggctgag
     1141 ctgggaagct tgcttatctc ctgcccaggc agtaagcagt agtcttggct atgaaaacat
     1201 ttttagagca ctgggttagg gtaggaaggc ctggatttca gcacccactt tctgtctgtt
     1261 catagctgtg agatgtttag acagtaattt gaccactctg catctttgct tctgtgacac
     1321 gggtggaagt acctaccctg tctaacctag tagggttgtt gcaaggacaa tatgcagaca
     1381 ctgctcaaat gctgttctgg gtcaatcaat taaaaaacaa attgtttgaa cttagcaatt
     1441 cctttctatg ggctccctgt tgtccgaaat ttctgtgtta tttcaagccc agctaaattg
     1501 caaaggctat ctcagagtcg tttgttggag gaatcttcgc agtggagtag actggagtcc
     1561 aagagcaagt tttcaccttc agtgaccaag aacttgagtg tctggttata gaagaacctg
     1621 tgagatgagg aacctggtgc agggaagggg gacaatctgt acagtgactc ctgtcccctt
     1681 tgtatcagac tgcagaaccc agttctacct gcttggccct agacaccttt atccaaggcg
     1741 ccttaacaaa agaaagaggt gtgtcctttt gagctcttgg ctctggctta agacaccaga
     1801 ggaaacccgt aggcaatgac tgttaggcag tttattcttg tagtcttctg ggacttcttg
     1861 aggcatgagg tggcctttaa tttaacaagc ccttgatggg atgatgttcc caaagtcacc
     1921 caccaagggc atgaaagggc tgtacattag cttggttgat ttcagtcctt gttaggagta
     1981 catcctggtg tctcacccag ggcttagtga ggccttctca agtgcctgag ttgttgtgga
     2041 cagtgagctt gttctctagc aatgggaggc ttcagctgtc ctgccccagc ttctgtaggc
     2101 cccaccctcc agcagggccc accctcactg tgccgaggct gatgggcctt atcattttgc
     2161 ccacctggct gtgtgcagcc ctcccactca gaacctcctt ggccaggctg ggctttgggg
     2221 ctcagtgtcc tgttgctgct gccacaacag atcctattac agcttttctt ctggtcttgc
     2281 ttctctggat cccgtagagg gcagaaggta ccaaggaaga ttcaaggacc agtcctggga
     2341 gtctctcctt cctagcagcc tcacctgcct aggacccggg agtcctctct cctaagcctg
     2401 tgatcctagt tctttgaatg aggaaaagat cgtaacctag ggactttctt ctgcaggaag
     2461 aaaacggctc aaagatgagg gtgattcgag tgggcacccg taagagccag gtgagtacag
     2521 acatagcgcg ttgcctcaag aattgtaatg ctcacgggtc actagtggga accaaaggct
     2581 agcatcgagc aaataagagt gtgtgagagt cgatttcatg ggggatggca gctcacttcc
     2641 tctgaaaaga gagtctctgg agc
//
LOCUS       MUSPBGD2     2761 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 2.
ACCESSION   M29950 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     2 of 4
SOURCE      Mouse (C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2761)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene
  JOURNAL   J. Biol. Chem. (1989) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept    +    50      122     porphobilinogen deaminase (housekeeping), exon
                                 4 (EC 4.3.1.8)
                485      534     porphobilinogen deaminase (housekeeping), exon
                                 5 (EC 4.3.1.8)
                617      672     porphobilinogen deaminase (housekeeping), exon
                                 6 (EC 4.3.1.8)
                981     1058     porphobilinogen deaminase (housekeeping), exon
                                 7 (EC 4.3.1.8)
               1969     2046     porphobilinogen deaminase (housekeeping), exon
                                 8 (EC 4.3.1.8)
               2141  +  2216     porphobilinogen deaminase (housekeeping), exon
                                 9 (EC 4.3.1.8)
    pep$    +    50      122     porphobilinogen deaminase (erythroid sp.) exon
                                 2
                485      534     porphobilinogen deaminase (erythroid sp.) exon
                                 3
                617      672     porphobilinogen deaminase (erythroid sp.) exon
                                 4
                981     1058     porphobilinogen deaminase (erythroid sp.) exon
                                 5
               1969     2046     porphobilinogen deaminase (erythroid sp.) exon
                                 6
               2141  +  2216     porphobilinogen deaminase (erythroid sp.) exon
                                 7
    pre-msg <     1  >  2761     PGB deaminase (hk and ery.sp.) mRNA and
                                 introns
    IVS     <     1       49     PGB deaminase (hk) intron C; ery.sp. intron A
    IVS         123      484     PGB deaminase (hk) intron D; ery.sp. intron B
    IVS         535      616     PGB deaminase (hk) intron E; ery.sp. intron C
    IVS         673      980     PGB deaminase (hk) intron F; ery.sp. intron D
    IVS        1059     1968     PGB deaminase (hk) intron G; ery.sp. intron E
    IVS        2047     2140     PGB deaminase (hk) intron H; ery.sp. intron F
    IVS        2217  >  2761     PGB deaminase (hk) intron I; ery.sp. intron G
BASE COUNT      703 a    605 c    725 g    728 t
ORIGIN      Chromosome 9; approximately 147 bp upstream of segment 1.
        1 aaggcagtgg ccaggggagg tgagaaacca tctgactctc tttccccagc tggctcgcat
       61 acagaccgag actgtggtgg cgatgctgaa agccttgtac cctggcatac agtttgaaat
      121 cagtaagttt tcttgagagg agtgattggt agtgaacggg aagccagtga accggaggac
      181 agggcatctc tcgtttgcct gtggtcaaag cctgccttgt aagactattc tggctgcttg
      241 tgaagggaaa gaaagattgt ctcctgtgca catctcctcc agctgcccgg gctagcctga
      301 catttccata ctttctgctt tgggttcttt tatgagtatg tctgcttttt ctgtcggtgt
      361 gtgtatctga gagagttagg ggctgggtct tctatgcctc agactccact gtgaatccag
      421 tcaaggcctg aacgaggggt gactcagtag gtgttaatgg gtatctgatt gactctctcc
      481 tcagttgcta tgtccaccac gggagacaag attgttgata ctgcactctc taaggtaacg
      541 ccagtccttg tcccattctt cttgtccctc tcccacgtgt aaggggttca ctctgaggct
      601 ctctcttgcc tggcagattg gagagaagag cctgtttacc aaggagctag aaaacgccct
      661 ggaaaaaaac gagtgagtga ggatggagga atgtggtacc ccgagcctag aaccccaaag
      721 tggctctcca atattggcag gattgtcggg ttagactgtg gagctcacag gctttcacag
      781 agaagagagc cttgccttgg agtagcctaa ctacctgggg aatcagactg ccgggggaaa
      841 ggggtagagt agttgagaag agaccaggtc ttagatctta agatgctatc ttcctgaacg
      901 gtcaaggatg ctggggtggg tggtggagat aaggtcacct actcaaagcc tctctctgtg
      961 cctccccctg ccgtctccag agtggacctg gttgttcact ccctgaagga tgtgcctacc
     1021 atactacctc ctggctttac tattggagcc atctgcaagt aagcggggag gacatgcatg
     1081 ggacggaggg ccctgggcag gattaatcct actgtgggaa tctttgagtt tttttttttt
     1141 ttttttccat ttggaactta accgcttagc cgtctgtttt gaaggttctc agacatagtg
     1201 tggcaggaaa gccaattggt tgacttggtt gactatttag agtttgtgga gttgggctca
     1261 gtggcacgga cctgaaatcc cagctactgg gaggctaaga caggatcaga gattctgggc
     1321 cagcctgggc tacagaggga tttgaaccag cctgaggaac ttagattgtg ccttaggggc
     1381 acagaaggct ggcttacagt ggcttaggtg gtaaaggcat tttttgctgt caagccaatg
     1441 acctgagttc agtccgtggg gtgcacttgg tgaaagaaga gggttgaatc ccacaagttg
     1501 tcatctgact catgcataca tgctgtagaa tgtttatgct cctcatccct caatgaaaat
     1561 ggaaacaatc aaggaaatga aatataaaac ctgctgggtg gtggtgcgca cgcctataat
     1621 cccagcactt gggaggcaga ggcaggtgaa ttcaacctgg tctacaaagt gagttccagg
     1681 actatacaga gaaacccagt cttaaaaaca aaacaaaact aaacaacaac aacaacaaca
     1741 acaaaaaaga aaaaacaaag aaagaaatat aaaacctttc caaagaaaat aaaatgaatt
     1801 tggcctggtg gctcatgcta taatctcagc attcagagag ctgaggcagg agggttattg
     1861 tgagttaaag gctagctggg gtacagagaa aattttaggt cacctgggct agagttaacc
     1921 ctatctccaa atgctaatac ctttatttca tcatcatttg ctttgcagac ggcaaaaccc
     1981 ttgtgatgct gttgtctttc acccaaagtt tattggaaag accctggaaa ccttgccaga
     2041 gaaaaggtga gtgggcctag tgtgcggggg agagaggcct ggacagtgga gaacagttgg
     2101 cagcctgggt taagtttaat tctaaactct ctctgagcag tgccgtggga accagctctc
     2161 tgaggagagt ggctcagcta cagagaaagt tccccaacct ggaattcaag agtattgtat
     2221 cctttcagaa gaaggagggg aaaaagaggg aaagaaggac cttccgaagc aagtggtcca
     2281 tgcggtcagg gggtcgtctt tccatctgtc cgtccaccca cccacccacc catccatcca
     2341 tccatccaca catccacagt cctttaatgt tttgcttttt tttttttttc ctgagacagg
     2401 gtttctctgt gtagcctggc tgtcctggaa ctcactttgt agaccaagct ggcctgcaaa
     2461 gtgagaaatc cgcctgcctc tgcctcctga gtgctgggat taaaggtgcg ccaccactgc
     2521 ctggcaccct aatgtttttt aaactcaggc ctggcaatga ggacaatttg caaaacaaac
     2581 atggttcctt gttctataca gctgacatgt tagacagaca ggcaggcact gcagatactg
     2641 accggtgacc actcctggtg cagggacaga ggcgcttctg cttttacttt ctgtgctgct
     2701 aagtggtttt ggtttttaca gtgaatatgt gatatgtttc ataaaagtaa tttttttttc
     2761 t
//
LOCUS       MUSPBGD3      800 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 5.
ACCESSION   M29951 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     3 of 4
SOURCE      Mouse (C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 800)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene
  JOURNAL   J. Biol. Chem. (1989) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept    +   402      515     porphobilinogen deaminase (housekeeping), exon
                                 10 (EC 4.3.1.8)
                739  +   777     porphobilinogen deaminase (housekeeping), exon
                                 11 (EC 4.3.1.8)
    pep$    +   402      515     porphobilinogen deaminase (erythroid sp.) exon
                                 8
                739  +   777     porphobilinogen deaminase (erythroid sp.) exon
                                 9
    pre-msg <     1  >   790     PGB deaminase (hk and ery.sp.) mRNA and
                                 introns
    IVS     <     1      401     PGB deaminase (hk) intron I; ery.sp. intron G
    IVS         516      738     PGB deaminase (hk) intron J; ery.sp. intron H
    IVS         778  >   790     PGB deaminase (hk) intron K; ery.sp. intron I
BASE COUNT      181 a    201 c    182 g    236 t
ORIGIN      Chromosome 9; approximately 477 bp upstream of segment 2.
        1 ctgtacccca gctagccttt aactcacaat aaccctcctg cctcagctct ctgaatgctg
       61 agattatagc catgagccac caggccaaat tcattttata tttctttctt tcttttttct
      121 tttttgttgt tgttgttgtt gttgtttagt tttgttttgt ttttaagact gggaaactct
      181 gtatagtcct ggaactcact ttgtagacca gatttagcct tgaattcatg gagatctgta
      241 tctgcctcca gtgctgggat ttaaaggtgt atacaccacc actcaacaaa aacacaacaa
      301 aaacaaaagt tttttaaaag ttagctagag gggggaaaag agactgtggg gcagagggtg
      361 cactgggtag gtcttgactt ctccttagca acgctccaca gcggggaaac ctcaacaccc
      421 gccttcggaa gctggatgag ctgcaggaat tcagtgccat tgtcctggct gtggctggcc
      481 tacagcgcat gggctggcag aaccgggtgg gccaggtagg agctgccctg ttctgcttcc
      541 cattgaatct gcctctctcc tgccttgatt tcttggtgac cattctgcca acaacactac
      601 aaccagaagc ccaggctagg gatattggga ctcattgctg gatttcctac ctgtgccttc
      661 cccaggcttc ctagattgca aaccctagct cactgccttt gaacatcccc tatcccacca
      721 tcttgtctct ctccacagat tttgcaccca gaggaatgca tgtatgctgt gggtcaggta
      781 ggtaggtttg cctggagaga
//
LOCUS       MUSPBGD4     1386 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 4.
ACCESSION   M29952 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     4 of 4
SOURCE      Mouse (C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1386)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene
  JOURNAL   J. Biol. Chem. (1989) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept    +   154      273     porphobilinogen deaminase (housekeeping), exon
                                 12 (EC 4.3.1.8)
                459      512     porphobilinogen deaminase (housekeeping), exon
                                 13 (EC 4.3.1.8)
                618      704     porphobilinogen deaminase (housekeeping), exon
                                 14 (EC 4.3.1.8)
                783      956     porphobilinogen deaminase (housekeeping), exon
                                 15 (EC 4.3.1.8)
    pep$    +   154      273     porphobilinogen deaminase (erythroid sp.) exon
                                 10
                459      512     porphobilinogen deaminase (erythroid sp.) exon
                                 11
                618      704     porphobilinogen deaminase (erythroid sp.) exon
                                 12
                783      956     porphobilinogen deaminase (erythroid sp.) exon
                                 13
    pre-msg <     1     1279     PGB deaminase (hk and ery.sp.) mRNA and
                                 introns
    IVS     <     1      153     PGB deaminase (hk) intron K; ery.sp. intron I
    IVS         274      458     PGB deaminase (hk) intron L; ery.sp. intron J
    IVS         513      617     PGB deaminase (hk) intron M; ery.sp. intron K
    IVS         705      782     PGB deaminase (hk) intron N; ery.sp. intron L
BASE COUNT      372 a    305 c    343 g    366 t
ORIGIN      Chromosome 9; approximately 77 bp upstream of segment 3.
        1 atcagtagtt cctgaaacct gttcatacct tgcacctcta tccatcaata atgttaaaga
       61 caggtttgtt gttatgcata acccaggaag cagtagaggt gtgtttctca tcttagctct
      121 attactagag aagaacagcc tgttgttctt tagggggccc tagccgtgga agtccgagcc
      181 aaggaccagg atatcttgga cctagtgagt gtgttgcacg atcctgaaac tctgcttcgc
      241 tgcattgctg aaagggcttt tctgaggcac ctggtaagat gggctcctcc catggtgttg
      301 tggggaaacc aggaagggca gtagggaggg agatttgtca agtactcagt atgtaatgtt
      361 ttgtatgtat ggagaggacc ttgatctggc ctcttgaggt ctgtggtcaa aagtggtgtt
      421 aaaggccctt agagctcaaa ggaacaatat cattgcagga aggaggctgc agcgtgcccg
      481 tagcagtgca tacagtgata aaggatgggc aagtaagcca gggaaatgga tgaggggagg
      541 gactgtcatt tccatgtgca cccaaacatc taagtaactt tctttaaaca tcctggtaca
      601 aacattttat ttcctagctg tacctgactg gtggagtatg gagtctagat ggctcagata
      661 gcatgcaaga gactatgcag gccaccatcc aggtccctgt tcaggtattg actgggagat
      721 gaggaggaat aaatagaact cttgtaatct tcctcttacc aaaattgtaa cctgtcatcc
      781 agcaagaaga tggtccagaa gatgacccac aactggttgg aatcactgcc cggaacattc
      841 caagaggagc ccagctagct gctgagaacc tgggcatcag cctggccagc ttgctgctca
      901 acaaaggagc caagaacatc ctggatgttg cacggcagct taatgatgtg cgctaactgg
      961 tctgtagggc acaggaaccc tggctgccac tccagtgcct acttctggct tccaagtgcc
     1021 ctgtgctcca tccctagggg tgtgattatc ccaggaaatt gaaccacagg gttgttgaga
     1081 cttccacttt ggaagatatg cctcaccttg gggcctccat atctgccttt ccctcagtag
     1141 ttgggggctt catctcttta gagaaagtcc atgccaatct ttgaatgtaa ccaataccac
     1201 taataaacca gtttagaatg tggttcttct gatagagttg gggaagatat gaataaaccc
     1261 aaagcccttt taaacttgaa tgagtctgag acctttctgt tgtaaaacac gctgtgattt
     1321 gcctcatgtt ctcaaaaaaa aaaaaaaaaa tcagccttta attcctacag cctgtcttca
     1381 gtcgac
//
LOCUS       MUSPHKGB     1012 bp ss-mRNA            ROD       15-MAR-1990
DEFINITION  Mouse skeletal muscle phosphorylase kinase alpha subunit mRNA,
            partial cds.
ACCESSION   M28867
KEYWORDS    phosphorylase kinase; protein kinase.
SOURCE      Mouse (strain ICR Swiss White, adult) mixed fiber skeletal muscle
            from hind limb, cDNA to mRNA, clone alpha-1.0Kb.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1012)
  AUTHORS   Bender,P.K. and Lalley,P.A.
  TITLE     The I/Lyn mouse phosphorylase kinase deficiency: Mutation disrupts
            expression of the alpha/alpha' subunit mRNAs
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9996-10000 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Bender, 05-OCT-1989.
FEATURES       from  to/span     description
    pept    <     1  >  1012     phosphorylase kinase alpha subunit (AA at 1)
                                 (EC 2.7.1.3.8)
BASE COUNT      280 a    236 c    229 g    267 t
ORIGIN      Chromosome X.
        1 aattcactac tgcccagggc ttcaacatcc aaagaagttg atgccagtct gctctcagtg
       61 gtctctttcc cagcctttgc tgtagaggac agccatttgg tggagctcac caaacaggag
      121 atcatcacca agcttcaggg tcgttatggt tgctgtcgtt ttctgcgaga tggatataaa
      181 actcctaaag aggatcccca acgcctatac tataacccag ctgagctgaa gctatttgaa
      241 aacattgagt gcgaatggcc attgttctgg acatacttta tccttgatgg gatcttcagt
      301 ggcaacgtag aacaggttca agaatataga gaggctcttg atgcagtcct catcaagggc
      361 aaaaatggag tccctcttct tccagagctg tacagtgtcc ctcctgacag ggttgatgaa
      421 gagtatcaaa atccccacac tgtggatcga gtccctatgg gaaaattgcc tcacatgtgg
      481 ggtcagtctc tatacatttt aggaagcttg atggcagagg gatttttagc tcctggagaa
      541 attgatcccc tgaatcgtag gttttctact gtgccaaagc cagatgtggt ggttcaagtc
      601 tccattctgg ctgaaacaga agaaatcaag gccattttga aggacaaagg aattgatgtg
      661 gagaccattg ctgaagtgta ccccataaga gtacagccag ctcgtattct cagccatatt
      721 tattctagtc taggatgcaa cagtagaatg aaactcagtg gacgacccta caggctcatg
      781 ggtgtgcttg gaacatcaaa actttatgac attcgcaaaa ctatctttac tttcactcca
      841 cagtttatag accagcaaca gttctacctg gctctggaca accagatgat agtagaaatg
      901 ctcagaacag acctttccta cctctgtagc cgctggagga tgacaggcca gcccacgatc
      961 actttcccta tctcgcacac catgcttgat gaagatggaa ccagcttgaa tt
//
LOCUS       MUSTCBVE      389 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse germline T cell receptor V-beta-13g gene V-region, 2 partial
            exons.
ACCESSION   M25914
SOURCE      Mouse T lymphocyte DNA, clone V-beta-13g.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 389)
  AUTHORS   Morahan,G., Allison,J., Peterson,M.G. and Malcolm,L.
  TITLE     Sequence of the V-beta-13 gene used by an influenza-specific T cell
  JOURNAL   Immunogenetics 30, 311-313 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by Grant Morahan, 05-JUL-1989.
FEATURES       from  to/span     description
    pept    <     1       23     T cell receptor V-region, exon 1 (AA at 2)
                350  >   389     T cell receptor V-region, exon 2
    IVS          24      349     TCR intron
BASE COUNT       87 a     91 c     89 g    122 t
ORIGIN      
        1 cagtgttctg ttccttgaca caggtaagtt tttagaaaaa cagtctccgt tttgcatttc
       61 tctggggtgt gtgtgtgtga ttatttcaat tgttcccatt ttgaaacagt ctgatctttt
      121 cccacagtac tgtctgaagc tggagtcacc cagtctccca gatatgcagt cctacaggaa
      181 gggcaagctg tttccttttg gtgtgaccct atttctggac atgataccct ttactggtat
      241 cagcagccca gagaccaggg gccccagctt ctagtttact ttcgggatga ggctgttata
      301 gataattcac agttgccctc ggatcgattt tctgctgtga ggcctaaagg aactaactcc
      361 actctcaaga tccagtctgc aaagcaggg
//
LOCUS       MUSTYR       1838 bp ss-mRNA            ROD       15-MAR-1988
DEFINITION  Mouse tyrosinase mRNA, complete cds.
ACCESSION   D00131
KEYWORDS    melanin; melanocyte; tyrosinase.
SOURCE      Mouse melanocytes cell line TM10 mRNA, cDNA to mRNA, clones
            Tyrs-33, Tyrs-J2, and Tyrs-J19.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1838)
  AUTHORS   Yamamoto,H., Takeuchi,S., Kudo,T., Makino,K., Nakata,A.,
            Shinoda,T. and Takeuchi,T.
  TITLE     Cloning and sequencing of mouse tyrosinase cDNA
  JOURNAL   Jpn. J. Genet. 62, 271-274 (1987)
  STANDARD  simple automatic
COMMENT     Tyrosinase is the key enzyme for melanin,synthesis in mammalian
            melanocytes and has been considered to be a unique marker for the
            study of melanocyte differentiation. A cDNA library was constructed
            from poly(A)+ mRNA from mouse melanocytes and screened using anti-
            tyrosinase antiserum and oligonucleotide probes corresponding to
            amino acid sequence of tyrosinase. sequencing of some cDNA clones
            positive in these screenings gave a nucleotide sequence of 1838
            nucleotides including a open reading frame of 1344 nucleotides that
            was found to correspond exactly to the amino acid sequence of the
            cyanogen bromide fragments of tyrosinase.
FEATURES       from  to/span     description
    pept         65     1411     mouse tyrosinase
    signal     1735     1740     polyadenylation signal
BASE COUNT      505 a    397 c    399 g    537 t
ORIGIN      64 bp upstream of the tyrosinase initiation codon.
        1 ggaaaagaag tctgtgacac tcattaacct attggtgcag attttgtatg atctaaagga
       61 gaaaatgttc ttggctgttt tgtattgcct tctgtggagt ttccagatct ctgatggcca
      121 ttttcctcga gcctgtgcct cctctaagaa cttgttggca aaagaatgct gcccaccatg
      181 gatgggtgat gggagtccct gcggccagct ttcaggcaga ggttcctgcc aggatatcct
      241 tctgtccagt gcaccatctg gacctcagtt ccccttcaaa ggggtggatg accgtgagtc
      301 ctggccctct gtgttttata ataggacctg ccagtgctca ggcaacttca tgggtttcaa
      361 ctgcggaaac tgtaagtttg gatttggggg cccaaattgt acagagaagc gagtcttgat
      421 tagaagaaac atttttgatt tgagtgtctc cgaaaagaat aagttctttt cttacctcac
      481 tttagcaaaa catactatca gctcagtcta tgtcatcccc acaggcacct atggccaaat
      541 gaacaatggg tcaacaccca tgtttaatga tatcaacatc tacgacctct ttgtatggat
      601 gcattactat gtgtcaaggg acacactgct tgggggctct gaaatatgga gggacattga
      661 ttttgcccat gaagcaccag ggtttctgcc ttggcacaga cttttcttgt tattgtggga
      721 acaagaaatt cgagaactaa ctggggatga gaacttcact gttccatact gggattggag
      781 agatgcagaa aactgtgaca tttgcacaga tgagtacttg ggaggtcgtc accctgaaaa
      841 tcctaactta ctcagcccag catccttctt ctcctcctgg cagatcattt gtagcagatc
      901 agaagagtat aatagccatc aggttttatg cgatggaaca cctgagggac cactattacg
      961 taatcctgga aaccatgaca aagccaaaac ccccaggctc ccatcttcag cagatgtgga
     1021 attttgtctg agtttgaccc agtatgaatc tggatcaatg gatagaactg ccaatttcag
     1081 ctttagaaac acactggaag tatttttgtt tgaacacaat ggctgcgaag ggcaccgccc
     1141 tcttttggaa gtttacccag aagccaatgc acctatcggc cataacagag actcttacat
     1201 ggttcctttc ataccgctct atagaaatgg tgatttcttc ataacatcca aggatctggg
     1261 atatgactac agctacctcc aagagtcaga tccaggcttt tacagaaatt atattgagcc
     1321 ttacttggaa caaggccagt cgtatctggc catggcttct tggggcagca ctggtgggag
     1381 ctgttattgc tgcagctctc tctgggctta gcagtaggct atgccttcag aagaagaaga
     1441 agaagaagca accccaggag gaaaggcagc cactcctcat ggacaaagac gactaccaca
     1501 gcttgctgta tcagagccat ctgtgaacat cctaggaaac agagtggact gaaaggtttt
     1561 acctcactcg acctatttgt tggtgtttct acaaatttaa actagcataa aacatagacc
     1621 atagctgttt ggcttttttt cagacccatg ttttttccta agtcctagtt tctaagaaat
     1681 gactgggatt tgctaaaata tatatatata taaataataa cttactaata gctaaataaa
     1741 atttcctctt acaactaatt gagtggtttt tatgaatgtg tcttaattat ttaaacttga
     1801 ggcacatttt tgttttcctt acttcattgt gaatttcc
//
LOCUS       RATALDCPR    1294 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Rat aldolase gene 5'flank promoter region and exon 1
            (EC 4.1.2.13).
ACCESSION   X07845
KEYWORDS    aldolase C; fructose 1,6-bisphosphate aldolase;
            fructose bisphosphate aldolase; promoter.
SOURCE      rat (Rattus rattus).
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1294)
  AUTHORS   Skala,H.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1294)
  AUTHORS   Vibert,M., Henry,A., Kahn,A. and Skala,H.
  TITLE     ;
  JOURNAL   Eur. J. Biochem. 181, 33-39 (1989)
  STANDARD  simple automatic
COMMENT     *source: library=Charon 4A; strain=Buffalo; see X06984 for rat
            aldolase C cDNA
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE          1    640       promoter region
               SITE        641    855       multiple transcription start sites
               MSG         641    901       exon 1
               IVS         902  >1294       intron I
            
            [1]  Author address Skala H., INSERM U. 129, 24 rue du Faubourg
            Saint Jacques, 75014 Paris, France.
            
            Submitted (06-JUN-1988) on tape to the EMBL data library
BASE COUNT      261 a    401 c    284 g    348 t
ORIGIN      
        1 ggatccctta tcccaataat aaaccaattg gcatggagct gactatgttt tatgctattt
       61 aaataaagtg tatttaatgt gtccttccag tccactgaat ctaattttgg gagtaggggt
      121 gggcaggtga aatcatggag gcagatcagc cctccagcct aggcagcctt cacttcctgc
      181 ggcagctaaa gccagggctg actgcttttc ttcgcttgat tctcttgctt ccactctccc
      241 ctccccccca aaaaaagggc aatggactgc ctgttccctg gatcactgcc tgtggggaac
      301 agaaaagact tcattcactt actttcccca ttccgcccac ccgcccccgc cccccggggc
      361 aggcactggt aaatatgggg cagagcatag aaactcaaat ccattattcc atgccttgaa
      421 gtgggcaggc agcctgttag gtggcctctg tcctacctcc catctactga tgtttagatg
      481 tctctcctct aaaccaggct taatccccaa cattccgcat aatgtgaaag catttgcctt
      541 tcacagcaaa agactagtta tggatagagc aaacacaagt ccatctcttt cctcatgggt
      601 tctagtggca gggttccatg atgagggagg ggcatcgtct tttacaaacc ttggggaccc
      661 tggcagagaa gtcactccac ctcactactc cacccctgaa gcgtggcctt accctagacc
      721 agtcctgggg agagggcggg accaggatgg gaggtgtctg tcacgccccc agggagtcac
      781 gtagctctgc ggcatctgct gctgccttat ttactccagc ttggactgag ctactgtctg
      841 ttgctaacca gaccaaagtg agctgtgcct gtggctccgg ccgctgactg actgagtata
      901 ggtaaggtcc cgggtgaggc ggagatgccc cgtccccgtt tgtcttcccc tgccgtcctt
      961 gccctgctcc ctaggaccct agctcctctg ctttctgtgg ctcaggaaaa ccctgcttct
     1021 ctccttcccc ctcgctctcc tttccatttc caagtcttcc tctgcctcca gaacttcact
     1081 ggggtcggtg tatcccacct agggtatttt aatctcagcc ccttcctggt ccttctaggc
     1141 ttgtccccag gtctaggaag agcatcagga ctggttaacc tgtatcacac cactatctag
     1201 ctcatctctc ccaggtctcc ttcctgcttt tcctgcatct ctccctcttt cccataatcc
     1261 actcactcag ccttaattcc tcacctctgg atcc
//
LOCUS       RATCNRAB     2267 bp ss-RNA             ROD       15-MAR-1990
DEFINITION  Rat calcineurin A beta mRNA.
ACCESSION   D90036
KEYWORDS    calcineurin; calcineurin A beta; calmodulin-binding protein;
            calmodulin-dependent protein phosphatase; isoform.
SOURCE      Rat brain, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2267)
  AUTHORS   Kuno,T., Takeda,T., Hirai,M., Ito,A., Mukai,H. and Tanaka,C.
  TITLE     Evidence for a Second Isoform of the Catalytic Subunit of
            Calmodulin-Dependent Protein Phosphatase (Calcineurin A)
  JOURNAL   Biochem. Biophys. Res. Commun. (19xx) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Takayoshi
            Kuno Department of Pharmacology Kobe University School of Medicine
            7-5-1 Kusunoki-cho,Chuo-ku Kobe 650 Japan Phone:  078-341-7451
            x3273 Fax:    078-351-6531 Peptides, 87-338 and 401-424, seem to be
            putative catalytic domain and calmodulin binding domain,
            respectively.
FEATURES       from  to/span     description
    ORF          40     1617     ORF (525 AA)
BASE COUNT      639 a    481 c    525 g    622 t
ORIGIN      
        1 cccgcccggc ccggcaccta gccgagcccc gggcccagca tggccgcccc ggagccggcc
       61 cgggccgcgc cgccccctcc cccgcccccg ccgccccccc tcggggccga ccgcgtcgtc
      121 aaagccgttc cttttccccc aactcaccgg ttgacatctg aagaagtgtt tgatatggat
      181 gggataccca gggttgatgt tctgaagaac catctggtaa aagaaggtcg ggtagatgaa
      241 gaaattgcac taagaattat caatgagggt gctgccatac ttaggcggga gaaaaccatg
      301 atagaagtag aagctccaat tacagtgtgt ggtgacatcc atggccaatt ttttgatctg
      361 atgaaacttt ttgaagtagg aggatcacct gctaatacac gatacctttt tcttggtgat
      421 tatgtggaca gaggttattt tagtatagag tgtgtcttgt atttatgggt cttgaagatt
      481 ctatacccaa gcacattatt ccttctgaga ggcaaccatg aatgcagaca cctcactgag
      541 tattttacct ttaagcagga atgtaaaatt aagtattcag aaagggtcta tgaagcttgt
      601 atggaggctt ttgacagctt gccccttgct gcacttctaa accaacaatt tctttgtgtt
      661 catggtggac tttcaccaga gatacacaca ctggatgata ttaggagatt agatagattt
      721 aaagagccac ctgcatttgg accaatgtgt gacttgctgt ggtctgatcc ttctgaagac
      781 tttggaaatg aaaaatcaca agaacatttt agtcataata cggttcgagg atgttcttat
      841 ttttataact atccagcagt gtgtgaattt ttgcaaaaca ataatttgtt atcgattatt
      901 agagctcatg aagctcaaga tgcaggctat agaatgtaca gaaaaagtca aactacaggg
      961 tttccttcat taataacaat tttttcggca cctaattact tagatgtcta caataataaa
     1021 gctgctgtac taaagtatga gaacaatgtg atgaacatcc gccagtttaa ctgttctcca
     1081 catccttact ggttgcccaa ttttatggat gtctttacat ggtctttacc atttgttgga
     1141 gaaaaagtca cagagatgtt ggtaaatgtt ctcagtattt gctctgatga tgaactaatg
     1201 acagaagggg aagaccagtt tgatgtaggt tcagctgcag cccggaaaga aatcataaga
     1261 aacaagatcc gagcaattgg caagatggca agagtcttct ctgttctcag ggaggagagt
     1321 gaaagcgtgc tgacactcaa gggcctgact cccacaggga tgttgcctag tggagtgttg
     1381 gctggaggac ggcagacctt gcaaagtgcc acagttgagg ctattgaggc tgaaaaagca
     1441 atacgaggat cctctccacc acatagaatc tgcagttttg aagaggcaaa gggtttggat
     1501 aggatcaatg agagaatgcc accccggaaa gatgctgtgc agcaagatgg tttcaattcc
     1561 ctgaacaccg cacataccac tgagaaccac gggactggca accatagtgc ccagtgacca
     1621 gcggcttccc agggatggac agatcactaa ggagccggag gggtcggccg agctgatgat
     1681 aaatgtcata atctctctga agaaacaaac cattgtgctt tttgagaccc ttgccccctt
     1741 cccggatgga ggcttgaggg ccttgggact tgtgctgtcc ataagattgg ggaatcgctg
     1801 ccacagcgga gagcagtgag caaggggctt ggggcaaatt ccagtggagg aagcccaaac
     1861 ctccatttat gcttgtggtt cacacattta cgtttacaaa tgagatttcc tttgttttct
     1921 ccctcagtag aattagattt ttttcaacca tgactttaaa tgcaatcttt agagttaatg
     1981 tggaacctcc ctcaccccca ccccatgaaa tgtctttaag aggatggatt agcatggtct
     2041 taaaatatat ttctgaggtt actagatgta ttttgaattg tagacaaaat ctgagaaacc
     2101 cagttggtgt ttatataaaa acgctgacct caggtcatag ttcttaaatg tggctaattc
     2161 tgtaacatag tcttggtatt tttcaattat gaatgcataa actatttcta agaagactct
     2221 tacttgaaca agatccaaaa aaccaattta gatccttttt gcccccg
//
LOCUS       RATGSTB      1038 bp ss-mRNA            ROD       25-SEP-1987
DEFINITION  Rat liver glutathione S-transferase Yb-1 subunit mRNA, complete
            cds.
ACCESSION   M11719
KEYWORDS    glutathione transferase.
SOURCE      Rat, cDNA to mRNA, clone pGTA/C44.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1038)
  AUTHORS   Ding,G.J.-F., Lu,A.Y.H. and Pickett,C.B.
  TITLE     Rat liver glutathione S-transferases: Nucleotide sequence analysis
            of a Yb-1 cDNA clone and prediction of the complete amino acid
            sequence of the Yb-1 subunit
  JOURNAL   J. Biol. Chem. 260, 13268-13271 (1985)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept         38      694     glutathione S-transferase Yb-1 subunit (EC
                                 2.5.1.18)
BASE COUNT      243 a    313 c    233 g    249 t
ORIGIN      Unreported.
        1 gctgaagcca aattgagaag accacagcgc cagaaccatg cctatgatac tgggatactg
       61 gaacgtccgc gggctgacac acccgatccg cctgctcctg gaatacacag actcaagcta
      121 tgaggagaag agatacgcca tgggcgacgc tcccgactat gacagaagcc agtggctgaa
      181 tgagaagttc aaactgggcc tggacttccc caatctgccc tacttaattg atggatcgcg
      241 caagattacc cagagcaatg ccataatgcg ctaccttgcc cgcaagcacc acctgtgtgg
      301 agagacagag gaggagcgga ttcgtgccga cattgtggag aaccaggtca tggacaaccg
      361 catgcagctc atcatgcttt gttacaaccc cgactttgag aagcagaagc cagagttctt
      421 gaagaccatc cctgagaaga tgaagctcta ctctgagttc ctgggcaagc gaccatggtt
      481 tgcaggggac aaggtcacct atgtggattt ccttgcttat gacattcttg accagtacca
      541 catttttgag cccaagtgcc tggacgcctt cccaaacctg aaggacttcc tggcccgctt
      601 tgagggcctg aagaagatct ctgcctacat gaattgcagc cgctacctct caacacctat
      661 attttcgaag ttggcccaat ggagtaacaa gtaggccctt gctacactgg cactcacaga
      721 gaggacctgt ccacattgga tcctgcaggc accctggcct tctgcactgt ggttctctct
      781 ccttcctgct cccttctcca gctttgtcag ccccatctcc tcaacctcac cccagtcatg
      841 cccacatagt cttcattctc cccactttct ttcatagtgg tccccttctt tattgacacc
      901 ttaacacaac ctcacagtcc ttttctgtga tttgaggtct gccctgaact cagtctccct
      961 agacttaccc caaatgtaac actgtctcag tgccagcctg ttcctggtgg gggagctgcc
     1021 ccaggcctgt ctcatctt
//
LOCUS       RATRHLA      1116 bp ss-mRNA            ROD       15-MAR-1989
DEFINITION  Rat asialoglycoprotein receptor (RHL2) mRNA, complete cds.
ACCESSION   M16347
KEYWORDS    alternative splicing; asialoglycoprotein receptor.
SOURCE      Rat (Sprague Dawley) liver RNA, clone 13D.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1116)
  AUTHORS   McPhaul,M. and Berg,P.
  TITLE     Identification and characterization of cDNA clones encoding two
            homologous proteins that are part of the asialoglycoprotein
            receptor
  JOURNAL   Mol. Cell. Biol. 7, 1841-1847 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        154     1059     asialoglycoprotein receptor (RHL2)
BASE COUNT      266 a    305 c    288 g    257 t
ORIGIN      
        1 ggtgcctaga ttagccccct cctccttctc gcctgctgtc ctgctgtccc aggtttaacc
       61 ccctttttct ccttggactc aggctgcctc cggaagcaga gtagctctct atacatttaa
      121 cagtcccaga tctgtctcca gcctagggcc atcatggaga aggactttca agatatccag
      181 cagctggact ctgaggaaaa cgaccatcag ctcattggcg atgaggaaca aggctctcat
      241 gtgcagaatc ttaggaccga aaatccacgt tggggaggac agcctccttc caggcccttt
      301 ccacagcgcc tctgctccaa gttccgcctc agtctgctcg ccctggcctt caacattctc
      361 ctgctggtgg tcatctgtgt ggtttcatcc caaagcatgc agctgcaaaa ggagttctgg
      421 accctgaaag aaaccttgag caacttctcc accaccaccc tgatggagtt caaggctctg
      481 gactcccacg gaggtagcag gaatgacaac ttgacttctt gggaaacaat actggagaaa
      541 aagcagaagg acataaaagc agatcactcc acgctgctct tccacctgaa gcacttcccc
      601 ctggatctgg caaccctgac ctgtcagctg gcgttcttcc tgagcaacgg cacagaatgc
      661 tgccccgtta actgggtgga gtttggtgga agctgctact ggttttctcg ggatgggctc
      721 acctgggctg aggctgacca gtactgccaa atggagattg cccatctgct ggtcatcaac
      781 tcaagggagg agcaggaatt cgttgtaaag cacaggggcg cgtttcacat ttggataggt
      841 ctcaccgaca aggatggctc ctggaaatgg gtggatggga cggaatatag aagtaacttc
      901 aagaattggg ctttcactca gccagataac tgccaggggc atgaagaggg gggaagtgaa
      961 gactgtgctg aaatcctgtc agatggcctc tggaatgaca acttctgcca gcaggtgaac
     1021 cgctgggctt gtgaaaggaa acgggacatc acctactagg agtctgctct actatgtctt
     1081 tgtcaccctc cgggaacccc gcatcactca ttagga
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       ECOPHNAQ    15611 bp ds-DNA             BCT       02-APR-1990
DEFINITION  E.coli psiD locus containing alkylphosphonate uptake (phn) genes A
            through Q, complete cds.
ACCESSION   J05260
KEYWORDS    C-P lyase; HisM-like integral membrane protein;
            HisP-like nucleotide binding protein; inverted repeats.
SOURCE      E.coli (strains B) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 15611)
  AUTHORS   Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
  TITLE     Molecular biology of carbon-phosphorus bond cleavage: Cloning and
            sequencing of the phn (psiD) genes involved in alkylphosphonate
            uptake and C-P lyase activity in Escherichia coli B
  JOURNAL   J. Biol. Chem. 265, 4461-4471 (1990)
  STANDARD  full staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by C.-M.Chen,
            01-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       2874     3209     phnA protein
    pept       3767     4210     phnB protein
    pept       4343     5131     HisP-like nucleotide binding protein (phnC)
    pept       5156     6172     phnD protein
    pept       6227     7057     phnE protein
    pept       7078     7803     phnF protein
    pept       7804     8256     phnG protein
    pept       8253     8837     phnH protein
    pept       8837     9901     phnI protein
    pept       9894    10739     phnJ protein
    pept      10736    11494     HisP-like nucleotide binding protein (phnK)
    pept      11605    12285     HisP-like nucleotide binding protein (phnL)
    pept      12282    13418     HisM-like integral membrane protein (phnM)
    pept      13418    13975     HisP-like nucleotide binding protein (phnN)
    pept      13962    14396     phnO protein
    pept      14398    15156     phnP protein
    pept      15131    15496     phnQ protein
    pept       2473      245 (c) ORF742
    pept       7470     7030 (c) ORF146
    pept       9298     8489 (c) ORF269
    pept      12264    11920 (c) ORF114
    pept      14760    14380 (c) ORF126
    rpt        3224     3257     inverted repeat a
    rpt        3271     3306     inverted repeat b
    rpt        3324     3357     inverted repeat c
    rpt        3371     3406     inverted repeat d
    rpt        3424     3457     inverted repeat e
    rpt        3471     3506     inverted repeat f
    rpt        3524     3557     inverted repeat g
    rpt        3571     3606     inverted repeat h
    rpt        3624     3657     inverted repeat i
    rpt        3671     3706     inverted repeat j
    rpt        6176     6211     inverted repeat k
    rpt        6229     6262     inverted repeat l
    rpt       11501    11534     inverted repeat m
    rpt       11548    11583     inverted repeat n
    signal     2622     2627     -35 region
    signal     2645     2650     -10 region
    signal     2738     2743     -35 region
    signal     2771     2776     -10 region
    signal     4074     4079     -35 region
    signal     4096     4101     -10 region
    signal     2084     2079 (c) -10 region
    signal     2108     2103 (c) -35 region
    signal     2720     2715 (c) -10 region
    signal     2742     2737 (c) -35 region
    signal    12056    12051 (c) -10 region
    signal    12080    12075 (c) -35 region
    signal     2737     2754     phosphate (pho) box
BASE COUNT     3466 a   4625 c   4306 g   3214 t
ORIGIN      
        1 ggatccagca tcgacgccag tttttccacc attgtcagtc gcaggctaag cggcgcattt
       61 aacatgccgc cgttcgtcca tgtctgaagc tgcacacgcg aaagaagttc ctgcatcagt
      121 cgttcacgaa actgctgctg atgggcttgt ggaaggcggg catcatcgcc ctgcgccaga
      181 tccactaaaa agcggggata aaccgactcc agcacgcgac cggggccgtc cagtaacgtc
      241 ttggtcaata tcgttctgcc gtgaaaagtg tttgaatatc atcgcgtaac agctgggcgt
      301 cggtgtaaat ccagccgtga gtcatcacag tctgctgcaa ttgctgctgc atcagcctga
      361 ccaccgattc attttgttga cgcagagcca ggctttcgcg taaacgcgtc tgtaattccg
      421 tcaaacatga agcgaactca gcgaaaaaag tattcatgcc tgccgtaaca gattcatcga
      481 cctgctctgc cagaacttta gccatttgtt ggcaataaag atcgacttct gcgcttaatg
      541 ctcgttgcaa cacactgtaa tcaaccgttt ctgtcgggga tttctcattt ccccgtcccc
      601 agtcgggctg attcaaccag cgcgaaaaag tctcacgcac aacgcctaaa cgcgtgctct
      661 gctcgtccgt tgcatcctgg cgcgaaatga ctgcactgaa cagctggcga gtgttgaagt
      721 ggggaactac gccgtgaaaa acaggaaaat gaaacccagg acgaaaccct gactcgctca
      781 attccatttt gacttgttgc tcaatggggc gaataacatc ggttaacact cggcaaaggg
      841 tggattccag ctcggcaaaa cgcagcgtaa agtcgcgact gatggtgttc tgcgccgtct
      901 gtaacagtgt ctcacagcgg gtacgcatct cgcttaacgg ctccgaatca tcctgaaaca
      961 aggcggctaa ctgcgcattc agcgcatctt gttgttgacg cagaaagtgg ttggcggagg
     1021 tcagggccag ctcgatttca tgtttaatct cgccgctcac ctgcgcctga ttgagttgca
     1081 atagctgcaa actttcttcg acctgatgga tattttgccg caattgttca caagcgacgt
     1141 ttaacccgtg cgcacgaaaa tccaggtatt cccgcgcctg ctgcgcgtaa ttcaacagtt
     1201 tatgcgcagc agatcgcaaa gcatacaacg aggcgttagc gtaagcggca tgaagcaacg
     1261 cctgaattgg ctgggcgaac agcgaatctt cccacaactg atcggcagca tgacgaatat
     1321 gttcgaggtc cgccagatcg gcatgacgcc agcgcctgcc gagcgcggca tgggcaaaat
     1381 cttccaccca gcgttgttgc tctggcgctg gtaacttacc gttgttggct aactcatggc
     1441 gcgcccgatt cgccaggtag ccccacatcg acgacaccgg aaatatctgc tgtggcgtaa
     1501 tacagccttt catcagcgtc ccggaaatca gtgcccgcac ctggtcggcg tcgtcactgt
     1561 tacgatcctg ttgatcgaac ttattgacca gcacatacag cggcaccgat tgccccaccg
     1621 ccaaaatcgc ctcacggacc tcttcatcgg agatcgattt cagttgcgta taatccagca
     1681 ccgccagtac cgccgaggcg cgtgccagct gctggttaag cattttttgc agatgcggtt
     1741 gcccggcttc atttggcccg ggggtatcca gtaacgtcaa ctgaccggga taactctcca
     1801 gccccgccag atggacaaac tccacttcaa tcacgggaat atgctcaatg gcggcgtaag
     1861 cagaaaaagg aaaatcgacg tccagcgcct tcgccagtcg cactaaatca ttcaaacttt
     1921 tcagacaatg aaaaataggc tgggcaccca gataatattt ttcgaaagcg acgccatttt
     1981 cgatccgctg cataagcgca cgcatatctt tatctatttc cagcacatcg gtcagatgct
     2041 taatatcgca atcacgcagg cgctgttgta attgttgaat taaacaatcg attggcgcga
     2101 catgtgaaaa atgcagtacc ggttcctttt gcccgggcgt atggcgaata agcgtcggca
     2161 gcgcagtcat tgggcgatta cgattaggca gaacctccgt accaacaatg gcattaatgg
     2221 tggttgattt ccctgctttc atggtaccga caattgcaag caccatttcc agtcgggaaa
     2281 ttttacgcaa ctcattattc agcatcgcgt gacgttcggc gatattaggc tgactccagg
     2341 gtaaagccag ttgtggcgcg tcgtctccgg gtacagagag aggcattttt tccagtaact
     2401 gcaactgttg gcgagaaagc tgtaacaggc gttcagcctc ctgacttaac tcatacaggg
     2461 tctgtgtgta catagaaaat tcttccttaa agcaaatttt gttattttat ttagccagat
     2521 tgtttttgag ttctgttttc ggcttttata attactgcaa gaaataattt tatatttagt
     2581 gtgttgtttt ttatcagaat aaataacgtc ttctgatacg tttaaaacgt cagaaagata
     2641 aaaatatcat gtgaattaaa aaaagaacaa gtagagcatt aacattatct taaataataa
     2701 atagaggcaa aaagattatt ttctttttgc gtttcctttc aaatgaaaac gatcgtcgtc
     2761 taaaatcagc agtacccccg acaaactcag ggattttgtg tataattgcg gcctttttcg
     2821 gcaatctgcc gttttttggc gcttttgccc tgctgacttt tgaggaaatc cacatgtcat
     2881 taccacactg cccaaaatgc aactccgaat acacttacga agataacggc atgtacatct
     2941 gcccggaatg tgcctacgaa tggaacgacg cagaacctgc acaggaaagc gacgagctga
     3001 tcgttaaaga tgctaacggc aatctgctgg ctgacggcga cagcgttacc atcattaaag
     3061 atctgaaggt gaaaggtagc tcttcgatgc tgaaaattgg caccaaagtg aaaaacatcc
     3121 gcctggttga aggcgaccat aacatcgatt gcaaaatcga cggttttggt ccgatgaaac
     3181 tgaaatctga gtttgtgaaa aagaactgat tgtattgtga tcggtaagcc ggataaggcg
     3241 ctcgcgccgc atccggcaac ggtgccagat gcctgatgcg acgcttgcgc gtcttatcag
     3301 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
     3361 ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc
     3421 tccgtaggcc ggataaggcg tttacgccgc atccggcaac agtgccaact gcctgatgcg
     3481 acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg
     3541 tttacgccgc atccggcaat ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag
     3601 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
     3661 agtgccgact gcctgatgcg acgctcgcgc gtcttatcag gccgcctctc atctgtataa
     3721 atttcgaact acacttaact ggcttctctt aactgaggtc accatcatgc cgttaagtcc
     3781 ctacctctct tttgccggta actgttccga cgcgattgcc tattatcaac gtacgttggg
     3841 cgcggaactg ctctataaaa tcagcttcgg cgaaatgcca aaatcagcgc aggacagcgc
     3901 cgagaactgc ccttccggaa tgcaatttcc cgataccgcc atcgctcatg ccaacgtgcg
     3961 cattgccgga agcgacatca tgatgagcga tgccatgccg tcaggaaaag ccagctactc
     4021 cggctttacg ctggtgctcg attcgcaaca ggtcgaagaa ggaaaacgct ggtttgacaa
     4081 tcttgccgct aacggaaaaa tcgaaatggc ctggcaggaa actttctggg cgcatggctt
     4141 tggcaaagtc accgataaat ttggcgtacc gtggatgatt aatgtcgtca aacaacaacc
     4201 aacgcaataa cccgccggga ggcccgccct cccgcactgt catcgaattc ccgttaactc
     4261 ttcatctgtt agtcactttt aattaaccaa atcgtcacaa taatccgcca cgatggagcc
     4321 acttttttag ggaggctgca tcatgcaaac gattatccgt gtcgagaagc tcgccaaaac
     4381 cttcaatcag catcaggcgc tgcatgcggt tgatctgaac attcatcacg gtgaaatggt
     4441 ggctctgctt gggccgtcgg gttccggcaa atccaccctt ttacgtcact taagcggttt
     4501 gattaccggc gataaatccg ccggcagcca tatcgagctg ctgggccgca cagtccagcg
     4561 cgaaggccgt ctggcgcgcg atatccgcaa aagccgcgcc aacaccggct acatcttcca
     4621 acaattcaac ctggtgaacc gcctgagcgt actggagaac gtgctgattg gcgcgctcgg
     4681 cagcacgccg ttctggcgca cctgttttag ctggtttacc cgcgagcaga aacaacgcgc
     4741 gttacaggcg ctgacccgcg ttggcatggt gcattttgcc catcaacgcg tttccaccct
     4801 ctccggcgga cagcagcagc gtgtggcgat tgcccgcgcg ctgatgcagc aggcgaaggt
     4861 gattctggcc gatgaaccca tcgcctcgct ggacccggaa tccgcccgca tcgtgatgga
     4921 caccctgcgc gacatcaatc agaacgacgg catcaccgtg gtcgtcacgc tgcatcaggt
     4981 ggattacgcc ctgcgctact gcgaacgcat cgtcgccctg cgccaggggc acgttttcta
     5041 cgacggcagc agccaacagt ttgataacga acgttttgac catctctacc gcagcattaa
     5101 tcgcatcgaa gagaacgcga aagctgcctg acatccccat cattgaggaa aacgaatgaa
     5161 cgctaagata attgcctcgc tggccttcac cagcatgttc agcctcagca ccctgttaag
     5221 cccggcacac gccgaagagc aggaaaaggc gctgaatttc ggcattattt caacggaatc
     5281 acagcaaaac ctgaaaccgc aatggacgcc attcttacag gatatggaga agaagctggg
     5341 cgtgaaggtg aacgccttct ttgccccaga ctacgcaggc attatccagg gaatgcgctt
     5401 caataaagtg gatatcgcct ggtacggcaa cctgtcggca atggaagcgg tggatcgcgc
     5461 caacggccag gtcttcgccc agacggtcgc ggcggatgga tcgccaggtt actggagcgt
     5521 gttgatcgtc aacaaagata gtccgatcaa caacctgaac gatctgctgg cgaagcggaa
     5581 agatctcacc ttcggcaatg gcgatcctaa ctccacctct ggcttcctcg tccccggtta
     5641 ctacgtcttc gccaaaaaca atatctccgc cagcgacttc aagcgcaccg tcaacgccgg
     5701 gcatgaaacc aacgcgctgg ccgtcgccaa caagcaggtg gatgtggcga ccaacaacac
     5761 cgaaaacctc gacaagctga aaacctccgc gccggagaag ctgaaagaac tgaaagtgat
     5821 ctggaaatcg ccgctgatcc caggcgatcc gatcgtctgg cgtaaaaatc tttccgaaac
     5881 caccaaagac aagatctacg acttctttat gaattacggc aaaacgccgg aagagaaagc
     5941 ggtgctggaa cgcctgggct gggcgccgtt ccgcgcctcc agcgacctgc aactggtgcc
     6001 gattcgccag ctcgcactgt ttaaagagat gcagggcgtg aaaagcaata aaggactgaa
     6061 tgagcaggac aagctggcaa aaaccaccgc gattcaggcg caactggatg acctggaccg
     6121 cctgaacaac gcgctaagcg cgatgagttc ggtgagtaaa gcggtgcagt aaatcgtagg
     6181 tcggataaga cgccccggcg tcgcatccga caatgtgcag gcgttgatgc cggatgcggt
     6241 gcaagcacct tatccggcct acagaccgga gccaaacatg caaaccatca ccatcgcccc
     6301 acccaagcgc agctggttct cgcttctgag ctgggccgtt gttctcgccg tgctggtcgt
     6361 ctcgtggcag ggcgcggaaa tggccccgct cacgctgatt aaagacggcg gcaacatggc
     6421 aaccttcgct gccgacttct tcccgcccga tttcagccag tggcaggatt acctcaccga
     6481 aatggccgtc acgctgcaaa tcgccgtctg gggcaccgcg ctggcggtgg ttctctccat
     6541 cccctttggc ctgatgagcg ccgaaaacct ggtgccgtgg tgggtttacc agcccgttcg
     6601 ccgcctgatg gacgcctgcc gcgccattaa cgaaatggtc ttcgccatgc tgttcgtggt
     6661 cgccgtcggt ctcggaccgt tcgctggcgt gctggcgcta tttatccaca ccaccggcgt
     6721 gctctccaag ctgctttccg aagcggtaga agcaattgaa cctggcccgg tggaaggcat
     6781 tcgcgccacc ggtgccaaca agctcgaaga gatcctctac ggcgtgctgc cgcaggtgat
     6841 gccgctgctg atctcctact ccctctatcg cttcgaatcc aacgtccgct cggcgaccgt
     6901 cgtcggcatg gtcggcgcgg gcgggatcgg cgtcaccctg tgggaagcga ttcgcggttt
     6961 ccagttccaa caaacctgcg ccctgatggt gcttatcatc gtcacggtca gcctgctgga
     7021 tttcctctct caacggttgc gtaagcactt tatctgataa gcgaggcatt gatatctatg
     7081 cacttgtcta cacatccgac cagctaccca acacgctatc aagagatagc cgcaaaactt
     7141 gagcaggagc ttcgtcaaca ctaccgctgc ggcgactatc ttcccgccga gcagcaactg
     7201 gcagcgcgct ttgaggtgaa tcgccacacc ctgcgccgcg ccatcgacca actggtggaa
     7261 aaaggctggg tacagcgccg tcagggcgtc ggcgtgctgg tgctgatgcg cccgttcgat
     7321 tacccgctca acgcccaggc gcgttttagc cagaatctgc tggatcaggg cagccatccc
     7381 accagcgaaa aactgctttc ggtattgcgc cccgcgtccg gccacgtcgc tgacgcactg
     7441 gggattaccg agggggagaa cgtcatccac ctgcgcaccc tgcgtcgggt caacggcgtc
     7501 gcgctctgtt taatcgacca ctacttcgcg gacctcaccc tctggccgac gctgcaacgc
     7561 ttcgacagcg gctcgctgca cgattttctg cgcgagcaaa ccggaattgc gctgcgccgc
     7621 agccagacgc ggatcagcgc ccgccgcgcc caggccaaag agtgccagcg tcttgaaatc
     7681 ccgaatatgt cgccgctgct gtgcgtgcgc acccttaacc accgtgacgg tgaaagcagc
     7741 ccggcggagt actccgtcag cctgacgcgc gccgacatga ttgaattcac tatggagcac
     7801 tgaatgcacg cagataccgc gacccgccag cactggatgt ccgtgctggc gcacagccaa
     7861 ccggctgaac tggcagcacg cctgaacgcg ctaaacatca ccgccgacta tgaggtgatc
     7921 cgcgccgctg aaactggcct ggtacagatt caggcgcgga tgggcggcac cggcgaacgt
     7981 ttttttgccg gcgacgccac gctgacccgc gccgccgtgc gcctgactga cggcacgctc
     8041 ggctacagct gggtgctggg gcgtgataaa cagcacgccg aacgctgcgc gctgattgac
     8101 gcgctgatgc agcaatctcg ccactttcaa aacttatcag aaacccttat tgccccgctg
     8161 gacgctgacc gtatggcacg cattgccgca cgccaggccg aagtgaacgc cagccgggtc
     8221 gacttcttta cgatggttcg cggagacaac gcatgaccct ggaaaccgct tttatgcttc
     8281 ccgtgcagga tgcccagcac agttttcgtc gcctgttaaa ggccatgagc gagccgggcg
     8341 tgattgtcgc cctgcatcag ctcaaacgcg gctggcaacc gctgaatatc gccaccacca
     8401 gcgtgctgct gacgctggcc gataacgaca cgccggtgtg gctttctacc ccattaaata
     8461 acgatatcgt caaccagagc ctgcgttttc ataccaacgc gccgctggtc agccagccgg
     8521 aacaggcgac cttcgcggtg acggatgagg cgatttccag cgaacagctc aacgcccttt
     8581 ccaccggcac cgccgttgcg ccggaagcgg gcgcgacgct gattttacag gtcgccagcc
     8641 tgagcggcgg gcgcatgttg cgtctcaccg gcgcgggtat tgccgaagaa cgaatgatcg
     8701 ctccgcagct gccggagtgc attctgcacg aactcaccga gcgcccgcac ccgttcccgc
     8761 tcggcatcga cctgatcctg acctgcggcg aacgcctgct ggctattccg cgaaccacgc
     8821 atgtggaggt gtgctgatgt acgttgccgt aaaagggggc gaaaaggcga tcgacgccgc
     8881 ccacgccctg caagagagcc gacgccgggg cgataccgat ttgcctgaac tgagcgtcgc
     8941 ccagattgaa cagcagctta acctcgcggt agatcgcgtg atgaccgaag gcggcattgc
     9001 cgaccgcgaa ctggcggcgc tggcgctgaa acaggccagc ggcgataacg ttgaagcgat
     9061 tttcctgctg cgcgcctacc gcaccacgtt ggcgaagctg gcggtaagcg agccgctcga
     9121 caccaccggg atgcgtctcg aacgccgtat ctccgccgtt tataaagaca ttcccggcgg
     9181 ccagctgctt ggcccaacct acgactacac ccatcgcctg ctcgatttta ccctgctggc
     9241 aaacggcgaa gcgccgacgc tgaccaccgc cgacagcgaa caacagccgt cgccgcacgt
     9301 tttcagcctg ctggcgcgtc aggggctggc gaagtttgaa gaggatagcg gcgcacagcc
     9361 ggatgacatc acccgcacgc cgccggttta cccctgctca cgttcttccc gtttgcagca
     9421 gttgatgcgc ggcgacgaag gctatttgct ggcgctggcc tactccaccc agcgtggtta
     9481 cggacgcaat cacccgttcg cgggcgagat ccgcagtggt tacatcgacg tgtcgattgt
     9541 gccggaagag ctgggatttg cggtaaacgt cggcgaacta ctgatgaccg agtgtgaaat
     9601 ggtcaacggt tttatcgacc cgccggatga gccgccgcac ttcacgcgcg gctacgggct
     9661 ggtattcggc atgagcgagc gcaaagcgat ggcaatggcg ctggtcgatc gtgcgttgca
     9721 ggctccggaa tacggcgagc acgcgacagg cccggcgcag gatgaagagt ttgtgctggc
     9781 acatgccgac aacgtcgaag ccgcaggctt tgtctcgcac ctcaaactcc cccactacgt
     9841 cgatttccag gccgaactgg agctactcaa acgtctgcaa caggagaaga accatggcta
     9901 atctgagcgg ctacaacttt gcctacctcg acgagcagac caaacgcatg atccgccgcg
     9961 ccatcttaaa agcggtggcg atccccggtt atcaggtgcc gtttggcggg cgcgagatgc
    10021 cgatgccata cggctgggga accggcggca tacagctcac cgccagcgtg attggcgaaa
    10081 gcgacgtgct aaaggtgatt gaccagggtg cggatgacac caccaacgcc gtgtcgattc
    10141 gcaacttctt taagcgcgtg accggggtaa acaccactga acgtacggac gatgcgacgc
    10201 ttatccagac gcgtcaccgc atccccgaaa cgccgctgac cgaagatcag atcattatct
    10261 tccaggtgcc aatcccggaa ccgctgcgct ttatcgagcc gcgcgaaacg gaaacccgca
    10321 ccatgcacgc gctggaagag tacggcgtga tgcaggtgaa actgtatgaa gatatcgccc
    10381 gcttcggtca tatcgccact acctacgcct atccggtgaa ggtgaacggg cgctacgtaa
    10441 tggacccgtc gccgatcccg aaattcgata acccaaaaat ggacatgatg cccgccctgc
    10501 aactgttcgg cgcggggcgc gagaagcgca tctatgcggt gccgccgttt acccgcgtgg
    10561 aaagtctcga tttcgacgat cacccgttca ccgttcagca gtgggatgag ccatgcgcca
    10621 tctgcggatc gacccacagc tatcttgatg aagtggtgct ggatgacgcc ggaaaccgca
    10681 tgtttgtctg ctccgatacc gattattgcc gccaacagag cgaggcaaaa aaccaatgaa
    10741 tcaaccgtta ctttcggtca ataacctgac ccacctttac gcgccgggca aaggctttag
    10801 cgatgtctct tttgatttat ggccggggga agtgctgggc attgtcgggg aatccggctc
    10861 cgggaagacc acgctgctga agtcgatctc cgcgcgcctg acgccgcagc agggggaaat
    10921 tcactacgag aaccgttcgc tgtatgcaat gagcgaggcc gaccgccgtc gcctgctgcg
    10981 taccgaatgg ggcgtggtgc atcagcatcc actcgacggc ctgcgccgcc aggtgtcggc
    11041 aggcggcaat atcggcgagc ggctgatggc gaccggggca cgtcattacg gcgatattcg
    11101 tgccaccgcg cagaagtggc tggaagaggt ggagattccc gccaaccgga tcgacgacct
    11161 gccgaccacc ttttccggcg gtatgcagca gcgtttgcag attgcccgca acctggtgac
    11221 gcatccgaag ctggtgttta tggatgaacc gaccggcggg ctggatgtgt cggtgcaggc
    11281 ccgcctgctc gacctgctgc gcggcctggt ggtggagctg aacctcgcgg tggtgattgt
    11341 cacccatgat ttaggcgtcg cccgcctgct ggcggaccgt ttgctggtga tgaagcaggg
    11401 gcaagtggtg gagagtgggt taaccgaccg cgtgctcgac gacccgcatc atccgtatac
    11461 acagctgctg gtgtcatcgg ttttgcagaa ttgagccggt gccggatgcg gcgtaaacgc
    11521 cttatccggc ctacaaatgc gctccccgta ggtcggataa gacgcgtcag cgtcgcatcc
    11581 gacacccgaa ccacgaggcg aaaaatgatt aacgtacaaa acgtcagtaa aaccttcatc
    11641 ctgcaccagc aaaacggcgt gcgcctgccc gtcctcaatc gcgcctcgct caccgtcaac
    11701 gcgggcgaat gcgtggtgct ccacggccat tccggcagcg gcaaatcaac tctgctacgc
    11761 tcgctgtacg ccaactatct acccgacgaa ggtcaaatcc agatcaaaca cggtgacgag
    11821 tgggtagacc tggtcaccgc gccagcgcgc aaagtggtgg aaatccgcaa aaccaccgtc
    11881 ggctgggtga gccagtttct gcgcgtcatc ccgcgtatct cagcactgga agtggtgatg
    11941 cagccgctgc tcgataccgg cgttccgcgt gaagcctgcg ccgctaaagc cgcgcgtctt
    12001 ctcacccgcc tgaacgtgcc ggaacgcctg tggcacctgg caccatcgac attttccggt
    12061 ggcgaacagc agcgcgtcaa catcgcccgc ggctttatcg tcgactaccc cattctgctg
    12121 cttgacgaac ctaccgcctc gctggacgcc aaaaacagcg ccgcggtggt ggaactgatt
    12181 cgcgaagcca aaacccgtgg cgcagccatc gtaggcatct tccatgacga agctgtacgt
    12241 aatgacgtcg ccgaccgcct gcacccaatg ggagcctctt catgattatc aataacgtta
    12301 agctggtgct ggaaaacgag gtggtaagcg gttcgctgga ggtgcagaac ggcgaaatcc
    12361 gcgcctttgc cgaaagccag agccgcctgc cggaggcgat ggacggcgaa ggcggctggc
    12421 tgctgccggg gctgattgag ctgcataccg ataatctgga taaattcttc accccgcgcc
    12481 cgaaagttga ctggcctgcc cactcggcga tgagcagcca cgacgcgctg atggtggcga
    12541 gcggcatcac caccgtactg gatgccgtgg caattggcga cgtgcgcgac ggcggcgatc
    12601 ggctggagaa tctggagaag atgatcaacg ccatcgaaga gacgcagaaa cgcggcgtca
    12661 accgcgccga gcaccgtctg catctgcgct gcgaactgcc gcatcacacc acgctgccgc
    12721 tgtttgaaaa actggtgcag cgcgagccgg tgacgctggt gtcgctgatg gaccactcgc
    12781 cgggccagcg ccagttcgcc aaccgcgaga agtatcgcga atattatcag ggcaaatact
    12841 ccctcactga tgcgcagatg cagcagtacg aagaagagca actggcgctc gccgcacgct
    12901 ggtcgcagcc gaatcgcgaa tccatcgccg ccctgtgccg cgcgcgaaaa attgcgcttg
    12961 ccagccacga tgacgccacc cacgcccacg ttgctgaatc tcaccagctt ggcagcgtga
    13021 tcgccgaatt tcccaccacg ttcgaagcgg cggaagcctc gcgcaagcat ggcatgaacg
    13081 tgctgatggg cgcgccgaat attgtgcgcg gcggctcgca ctccggcaac gtggcggcca
    13141 gtgaactggc gcagcttggc ctgctggata tcctctcttc cgactactac cccgccagcc
    13201 tgctcgatgc ggcatttcgc gtcgccgatg acgagagcaa ccgctttacg ctgccgcagg
    13261 cggtgaagct ggtgactaaa aatccagcgc aggcgcttaa tctccaggat cgcggggtga
    13321 ttggcgaggg caaacgcgcc gacctggtgc tggcgcatcg caaggacaat catattcata
    13381 tcgaccacgt ctggcgtcag ggtaaaaggg tgttctgatg atgggaaaac tgatttggtt
    13441 aatggggccg tccggctccg ggaaagacag cctgctggcg gaactccgcc tgcgggaaca
    13501 aactcagtta ctggtggcgc atcgctacat cacgcgcgat gccagcgccg gaagtgaaaa
    13561 ccatatcgcc ctgagcgagc aggagttttt tacccgcgcg gggcaaaatc tgttggcctt
    13621 aagctggcac gctaacggtc tgtattatgg cgtcggcgtc gagattgatc tctggctgca
    13681 cgccggattc gacgtgctgg tcaacggctc acgcgcccat ctgccgcagg cgcgggcgcg
    13741 ctatcaatcg gcgctgctgc ccgtctgttt acaggtttcg ccggagatcc tccgccagcg
    13801 cctggaaaac cgtggccgtg aaaacgccag tgaaattaac gcccgcctgg cgcgcgccgc
    13861 ccgctatact ccacaggatt gccatacgct caacaatgac ggcagcctgc gccagtcggt
    13921 cgacacgctg ctgacgctga tccatcagaa ggagaaacac catgcctgct tgtgagcttc
    13981 gcccggccac gcagtacgac accgacgcgg tttacgcgct gatttgtgag ctaaaacagg
    14041 cggagtttga ccaccacgcg tttcgcgtgg gttttaacgc caatctgcgc gacccaaaca
    14101 tgcgctacca tctggcgctg cttgatggcg aagttgtcgg catgatcggc ctgcatttgc
    14161 agtttcatct gcatcatgtc aactggatcg gcgaaattca ggagttggtg gtaatgccgc
    14221 aggcgcgcgg tctgaacgtc ggcagtaagt tactggcgtg ggcagaagaa gaagcccgcc
    14281 aggccggggc cgaaatgacc gaactttcga ccaacgtgaa gcgccacgac gcgcaccgtt
    14341 tctatctgcg cgaaggctac gagcagagcc acttccgctt caccaaggcg ctgtaacatg
    14401 agcctgaccc tcacgctcac cggcaccggc ggcgcacagg gcgttccggc atggggctgc
    14461 gagtgtgcgg cctgcgccag agcgcggcgc tcgccgcagt atcgccgcca accgtgcagc
    14521 ggcgtagtga agtttaacga cgcaatcacc ctgatcgacg ccgggctgca cgatctcgcc
    14581 gatcgctggt cgcccggatc gttccagcag tttttgctga cgcattatca tatggatcac
    14641 gtccaggggc tgtttccgct gcgctggggc gttggcgatc cgatcccggt ttacggcccg
    14701 ccggatgaac agggctgcga cgatctgttt aaacatccgg gcctgcttga tttcagccac
    14761 acggtggaac cgtttgtggt gtttgatttg caggggttac aggtcacgcc cctgccgctc
    14821 aaccactcaa aactgacctt cggttatctg ctggaaacgg cacacagccg ggtggcgtgg
    14881 ctgtctgaca ccgcaggctt gccggaaaaa acgctgaaat ttttacgcaa taatcagccg
    14941 caggtaatgg tgatggattg cagtcacccg ccgcgcgcgg atgcaccgcg taatcactgt
    15001 gatttaaata ccgtgcttgc gctgaatcag gttatccgct cgccacgggt gattctgacc
    15061 catatcagcc accagtttga tgcgtggctg atggaaaacg cactaccgtc agggtttgag
    15121 gtggggtttg atgggatgga gattggggtg gcgtgatgag agggaatgtg cgcgctggcc
    15181 ccctcaccct aaccctctcc ccagaggggc gaggggaccg attgtgctcg atattgaata
    15241 ttgcgctcgt tttctccctc tccccattgg ggtgaggggc gatgcctgct ccatacccaa
    15301 cctcatcgcc catactcatc ttccattctc cgctcttcat cctccagttg ccgacgctcc
    15361 tgatcaagct ggcgctggcg atcgtccagc tgcctgcggc gatcttcaaa ctggcggcgg
    15421 cggtcgtcat attgtctgcg ccgatcgtcg ctcacttcac gctgccagcc gtggtcgcgc
    15481 gaatcttcat agttgaagcg gcgcacgaaa aacgcgaaag cgtttcacga taaatgcgaa
    15541 aactttagct ttcgcgcttc aaatgaaaca gatgtattaa ttactgcttt ttattcatta
    15601 catggggatc c
//

GenBank-Updates@genbank.bio.net (04/06/90)

LOCUS       ECOPHNAQ    15611 bp ds-DNA             BCT       02-APR-1990
DEFINITION  E.coli psiD locus containing alkylphosphonate uptake (phn) genes A
            through Q, complete cds.
ACCESSION   J05260
KEYWORDS    C-P lyase; HisM-like integral membrane protein;
            HisP-like nucleotide binding protein; inverted repeats.
SOURCE      E.coli (strain B) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 15611)
  AUTHORS   Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
  TITLE     Molecular biology of carbon-phosphorus bond cleavage: Cloning and
            sequencing of the phn (psiD) genes involved in alkylphosphonate
            uptake and C-P lyase activity in Escherichia coli B
  JOURNAL   J. Biol. Chem. 265, 4461-4471 (1990)
  STANDARD  full staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by C.-M.Chen,
            01-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       2874     3209     phnA protein
    pept       3767     4210     phnB protein
    pept       4343     5131     HisP-like nucleotide binding protein (phnC)
    pept       5156     6172     phnD protein
    pept       6227     7057     phnE protein
    pept       7078     7803     phnF protein
    pept       7804     8256     phnG protein
    pept       8253     8837     phnH protein
    pept       8837     9901     phnI protein
    pept       9894    10739     phnJ protein
    pept      10736    11494     HisP-like nucleotide binding protein (phnK)
    pept      11605    12285     HisP-like nucleotide binding protein (phnL)
    pept      12282    13418     HisM-like integral membrane protein (phnM)
    pept      13418    13975     HisP-like nucleotide binding protein (phnN)
    pept      13962    14396     phnO protein
    pept      14398    15156     phnP protein
    pept      15131    15496     phnQ protein
    pept       2473      245 (c) ORF742
    pept       7470     7030 (c) ORF146
    pept       9298     8489 (c) ORF269
    pept      12264    11920 (c) ORF114
    pept      14760    14380 (c) ORF126
    rpt        3224     3257     inverted repeat a
    rpt        3271     3306     inverted repeat b
    rpt        3324     3357     inverted repeat c
    rpt        3371     3406     inverted repeat d
    rpt        3424     3457     inverted repeat e
    rpt        3471     3506     inverted repeat f
    rpt        3524     3557     inverted repeat g
    rpt        3571     3606     inverted repeat h
    rpt        3624     3657     inverted repeat i
    rpt        3671     3706     inverted repeat j
    rpt        6176     6211     inverted repeat k
    rpt        6229     6262     inverted repeat l
    rpt       11501    11534     inverted repeat m
    rpt       11548    11583     inverted repeat n
    signal     2622     2627     -35 region
    signal     2645     2650     -10 region
    signal     2738     2743     -35 region
    signal     2771     2776     -10 region
    signal     4074     4079     -35 region
    signal     4096     4101     -10 region
    signal     2084     2079 (c) -10 region
    signal     2108     2103 (c) -35 region
    signal     2720     2715 (c) -10 region
    signal     2742     2737 (c) -35 region
    signal    12056    12051 (c) -10 region
    signal    12080    12075 (c) -35 region
    signal     2737     2754     phosphate (pho) box
BASE COUNT     3466 a   4625 c   4306 g   3214 t
ORIGIN      
        1 ggatccagca tcgacgccag tttttccacc attgtcagtc gcaggctaag cggcgcattt
       61 aacatgccgc cgttcgtcca tgtctgaagc tgcacacgcg aaagaagttc ctgcatcagt
      121 cgttcacgaa actgctgctg atgggcttgt ggaaggcggg catcatcgcc ctgcgccaga
      181 tccactaaaa agcggggata aaccgactcc agcacgcgac cggggccgtc cagtaacgtc
      241 ttggtcaata tcgttctgcc gtgaaaagtg tttgaatatc atcgcgtaac agctgggcgt
      301 cggtgtaaat ccagccgtga gtcatcacag tctgctgcaa ttgctgctgc atcagcctga
      361 ccaccgattc attttgttga cgcagagcca ggctttcgcg taaacgcgtc tgtaattccg
      421 tcaaacatga agcgaactca gcgaaaaaag tattcatgcc tgccgtaaca gattcatcga
      481 cctgctctgc cagaacttta gccatttgtt ggcaataaag atcgacttct gcgcttaatg
      541 ctcgttgcaa cacactgtaa tcaaccgttt ctgtcgggga tttctcattt ccccgtcccc
      601 agtcgggctg attcaaccag cgcgaaaaag tctcacgcac aacgcctaaa cgcgtgctct
      661 gctcgtccgt tgcatcctgg cgcgaaatga ctgcactgaa cagctggcga gtgttgaagt
      721 ggggaactac gccgtgaaaa acaggaaaat gaaacccagg acgaaaccct gactcgctca
      781 attccatttt gacttgttgc tcaatggggc gaataacatc ggttaacact cggcaaaggg
      841 tggattccag ctcggcaaaa cgcagcgtaa agtcgcgact gatggtgttc tgcgccgtct
      901 gtaacagtgt ctcacagcgg gtacgcatct cgcttaacgg ctccgaatca tcctgaaaca
      961 aggcggctaa ctgcgcattc agcgcatctt gttgttgacg cagaaagtgg ttggcggagg
     1021 tcagggccag ctcgatttca tgtttaatct cgccgctcac ctgcgcctga ttgagttgca
     1081 atagctgcaa actttcttcg acctgatgga tattttgccg caattgttca caagcgacgt
     1141 ttaacccgtg cgcacgaaaa tccaggtatt cccgcgcctg ctgcgcgtaa ttcaacagtt
     1201 tatgcgcagc agatcgcaaa gcatacaacg aggcgttagc gtaagcggca tgaagcaacg
     1261 cctgaattgg ctgggcgaac agcgaatctt cccacaactg atcggcagca tgacgaatat
     1321 gttcgaggtc cgccagatcg gcatgacgcc agcgcctgcc gagcgcggca tgggcaaaat
     1381 cttccaccca gcgttgttgc tctggcgctg gtaacttacc gttgttggct aactcatggc
     1441 gcgcccgatt cgccaggtag ccccacatcg acgacaccgg aaatatctgc tgtggcgtaa
     1501 tacagccttt catcagcgtc ccggaaatca gtgcccgcac ctggtcggcg tcgtcactgt
     1561 tacgatcctg ttgatcgaac ttattgacca gcacatacag cggcaccgat tgccccaccg
     1621 ccaaaatcgc ctcacggacc tcttcatcgg agatcgattt cagttgcgta taatccagca
     1681 ccgccagtac cgccgaggcg cgtgccagct gctggttaag cattttttgc agatgcggtt
     1741 gcccggcttc atttggcccg ggggtatcca gtaacgtcaa ctgaccggga taactctcca
     1801 gccccgccag atggacaaac tccacttcaa tcacgggaat atgctcaatg gcggcgtaag
     1861 cagaaaaagg aaaatcgacg tccagcgcct tcgccagtcg cactaaatca ttcaaacttt
     1921 tcagacaatg aaaaataggc tgggcaccca gataatattt ttcgaaagcg acgccatttt
     1981 cgatccgctg cataagcgca cgcatatctt tatctatttc cagcacatcg gtcagatgct
     2041 taatatcgca atcacgcagg cgctgttgta attgttgaat taaacaatcg attggcgcga
     2101 catgtgaaaa atgcagtacc ggttcctttt gcccgggcgt atggcgaata agcgtcggca
     2161 gcgcagtcat tgggcgatta cgattaggca gaacctccgt accaacaatg gcattaatgg
     2221 tggttgattt ccctgctttc atggtaccga caattgcaag caccatttcc agtcgggaaa
     2281 ttttacgcaa ctcattattc agcatcgcgt gacgttcggc gatattaggc tgactccagg
     2341 gtaaagccag ttgtggcgcg tcgtctccgg gtacagagag aggcattttt tccagtaact
     2401 gcaactgttg gcgagaaagc tgtaacaggc gttcagcctc ctgacttaac tcatacaggg
     2461 tctgtgtgta catagaaaat tcttccttaa agcaaatttt gttattttat ttagccagat
     2521 tgtttttgag ttctgttttc ggcttttata attactgcaa gaaataattt tatatttagt
     2581 gtgttgtttt ttatcagaat aaataacgtc ttctgatacg tttaaaacgt cagaaagata
     2641 aaaatatcat gtgaattaaa aaaagaacaa gtagagcatt aacattatct taaataataa
     2701 atagaggcaa aaagattatt ttctttttgc gtttcctttc aaatgaaaac gatcgtcgtc
     2761 taaaatcagc agtacccccg acaaactcag ggattttgtg tataattgcg gcctttttcg
     2821 gcaatctgcc gttttttggc gcttttgccc tgctgacttt tgaggaaatc cacatgtcat
     2881 taccacactg cccaaaatgc aactccgaat acacttacga agataacggc atgtacatct
     2941 gcccggaatg tgcctacgaa tggaacgacg cagaacctgc acaggaaagc gacgagctga
     3001 tcgttaaaga tgctaacggc aatctgctgg ctgacggcga cagcgttacc atcattaaag
     3061 atctgaaggt gaaaggtagc tcttcgatgc tgaaaattgg caccaaagtg aaaaacatcc
     3121 gcctggttga aggcgaccat aacatcgatt gcaaaatcga cggttttggt ccgatgaaac
     3181 tgaaatctga gtttgtgaaa aagaactgat tgtattgtga tcggtaagcc ggataaggcg
     3241 ctcgcgccgc atccggcaac ggtgccagat gcctgatgcg acgcttgcgc gtcttatcag
     3301 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
     3361 ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc
     3421 tccgtaggcc ggataaggcg tttacgccgc atccggcaac agtgccaact gcctgatgcg
     3481 acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg
     3541 tttacgccgc atccggcaat ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag
     3601 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac
     3661 agtgccgact gcctgatgcg acgctcgcgc gtcttatcag gccgcctctc atctgtataa
     3721 atttcgaact acacttaact ggcttctctt aactgaggtc accatcatgc cgttaagtcc
     3781 ctacctctct tttgccggta actgttccga cgcgattgcc tattatcaac gtacgttggg
     3841 cgcggaactg ctctataaaa tcagcttcgg cgaaatgcca aaatcagcgc aggacagcgc
     3901 cgagaactgc ccttccggaa tgcaatttcc cgataccgcc atcgctcatg ccaacgtgcg
     3961 cattgccgga agcgacatca tgatgagcga tgccatgccg tcaggaaaag ccagctactc
     4021 cggctttacg ctggtgctcg attcgcaaca ggtcgaagaa ggaaaacgct ggtttgacaa
     4081 tcttgccgct aacggaaaaa tcgaaatggc ctggcaggaa actttctggg cgcatggctt
     4141 tggcaaagtc accgataaat ttggcgtacc gtggatgatt aatgtcgtca aacaacaacc
     4201 aacgcaataa cccgccggga ggcccgccct cccgcactgt catcgaattc ccgttaactc
     4261 ttcatctgtt agtcactttt aattaaccaa atcgtcacaa taatccgcca cgatggagcc
     4321 acttttttag ggaggctgca tcatgcaaac gattatccgt gtcgagaagc tcgccaaaac
     4381 cttcaatcag catcaggcgc tgcatgcggt tgatctgaac attcatcacg gtgaaatggt
     4441 ggctctgctt gggccgtcgg gttccggcaa atccaccctt ttacgtcact taagcggttt
     4501 gattaccggc gataaatccg ccggcagcca tatcgagctg ctgggccgca cagtccagcg
     4561 cgaaggccgt ctggcgcgcg atatccgcaa aagccgcgcc aacaccggct acatcttcca
     4621 acaattcaac ctggtgaacc gcctgagcgt actggagaac gtgctgattg gcgcgctcgg
     4681 cagcacgccg ttctggcgca cctgttttag ctggtttacc cgcgagcaga aacaacgcgc
     4741 gttacaggcg ctgacccgcg ttggcatggt gcattttgcc catcaacgcg tttccaccct
     4801 ctccggcgga cagcagcagc gtgtggcgat tgcccgcgcg ctgatgcagc aggcgaaggt
     4861 gattctggcc gatgaaccca tcgcctcgct ggacccggaa tccgcccgca tcgtgatgga
     4921 caccctgcgc gacatcaatc agaacgacgg catcaccgtg gtcgtcacgc tgcatcaggt
     4981 ggattacgcc ctgcgctact gcgaacgcat cgtcgccctg cgccaggggc acgttttcta
     5041 cgacggcagc agccaacagt ttgataacga acgttttgac catctctacc gcagcattaa
     5101 tcgcatcgaa gagaacgcga aagctgcctg acatccccat cattgaggaa aacgaatgaa
     5161 cgctaagata attgcctcgc tggccttcac cagcatgttc agcctcagca ccctgttaag
     5221 cccggcacac gccgaagagc aggaaaaggc gctgaatttc ggcattattt caacggaatc
     5281 acagcaaaac ctgaaaccgc aatggacgcc attcttacag gatatggaga agaagctggg
     5341 cgtgaaggtg aacgccttct ttgccccaga ctacgcaggc attatccagg gaatgcgctt
     5401 caataaagtg gatatcgcct ggtacggcaa cctgtcggca atggaagcgg tggatcgcgc
     5461 caacggccag gtcttcgccc agacggtcgc ggcggatgga tcgccaggtt actggagcgt
     5521 gttgatcgtc aacaaagata gtccgatcaa caacctgaac gatctgctgg cgaagcggaa
     5581 agatctcacc ttcggcaatg gcgatcctaa ctccacctct ggcttcctcg tccccggtta
     5641 ctacgtcttc gccaaaaaca atatctccgc cagcgacttc aagcgcaccg tcaacgccgg
     5701 gcatgaaacc aacgcgctgg ccgtcgccaa caagcaggtg gatgtggcga ccaacaacac
     5761 cgaaaacctc gacaagctga aaacctccgc gccggagaag ctgaaagaac tgaaagtgat
     5821 ctggaaatcg ccgctgatcc caggcgatcc gatcgtctgg cgtaaaaatc tttccgaaac
     5881 caccaaagac aagatctacg acttctttat gaattacggc aaaacgccgg aagagaaagc
     5941 ggtgctggaa cgcctgggct gggcgccgtt ccgcgcctcc agcgacctgc aactggtgcc
     6001 gattcgccag ctcgcactgt ttaaagagat gcagggcgtg aaaagcaata aaggactgaa
     6061 tgagcaggac aagctggcaa aaaccaccgc gattcaggcg caactggatg acctggaccg
     6121 cctgaacaac gcgctaagcg cgatgagttc ggtgagtaaa gcggtgcagt aaatcgtagg
     6181 tcggataaga cgccccggcg tcgcatccga caatgtgcag gcgttgatgc cggatgcggt
     6241 gcaagcacct tatccggcct acagaccgga gccaaacatg caaaccatca ccatcgcccc
     6301 acccaagcgc agctggttct cgcttctgag ctgggccgtt gttctcgccg tgctggtcgt
     6361 ctcgtggcag ggcgcggaaa tggccccgct cacgctgatt aaagacggcg gcaacatggc
     6421 aaccttcgct gccgacttct tcccgcccga tttcagccag tggcaggatt acctcaccga
     6481 aatggccgtc acgctgcaaa tcgccgtctg gggcaccgcg ctggcggtgg ttctctccat
     6541 cccctttggc ctgatgagcg ccgaaaacct ggtgccgtgg tgggtttacc agcccgttcg
     6601 ccgcctgatg gacgcctgcc gcgccattaa cgaaatggtc ttcgccatgc tgttcgtggt
     6661 cgccgtcggt ctcggaccgt tcgctggcgt gctggcgcta tttatccaca ccaccggcgt
     6721 gctctccaag ctgctttccg aagcggtaga agcaattgaa cctggcccgg tggaaggcat
     6781 tcgcgccacc ggtgccaaca agctcgaaga gatcctctac ggcgtgctgc cgcaggtgat
     6841 gccgctgctg atctcctact ccctctatcg cttcgaatcc aacgtccgct cggcgaccgt
     6901 cgtcggcatg gtcggcgcgg gcgggatcgg cgtcaccctg tgggaagcga ttcgcggttt
     6961 ccagttccaa caaacctgcg ccctgatggt gcttatcatc gtcacggtca gcctgctgga
     7021 tttcctctct caacggttgc gtaagcactt tatctgataa gcgaggcatt gatatctatg
     7081 cacttgtcta cacatccgac cagctaccca acacgctatc aagagatagc cgcaaaactt
     7141 gagcaggagc ttcgtcaaca ctaccgctgc ggcgactatc ttcccgccga gcagcaactg
     7201 gcagcgcgct ttgaggtgaa tcgccacacc ctgcgccgcg ccatcgacca actggtggaa
     7261 aaaggctggg tacagcgccg tcagggcgtc ggcgtgctgg tgctgatgcg cccgttcgat
     7321 tacccgctca acgcccaggc gcgttttagc cagaatctgc tggatcaggg cagccatccc
     7381 accagcgaaa aactgctttc ggtattgcgc cccgcgtccg gccacgtcgc tgacgcactg
     7441 gggattaccg agggggagaa cgtcatccac ctgcgcaccc tgcgtcgggt caacggcgtc
     7501 gcgctctgtt taatcgacca ctacttcgcg gacctcaccc tctggccgac gctgcaacgc
     7561 ttcgacagcg gctcgctgca cgattttctg cgcgagcaaa ccggaattgc gctgcgccgc
     7621 agccagacgc ggatcagcgc ccgccgcgcc caggccaaag agtgccagcg tcttgaaatc
     7681 ccgaatatgt cgccgctgct gtgcgtgcgc acccttaacc accgtgacgg tgaaagcagc
     7741 ccggcggagt actccgtcag cctgacgcgc gccgacatga ttgaattcac tatggagcac
     7801 tgaatgcacg cagataccgc gacccgccag cactggatgt ccgtgctggc gcacagccaa
     7861 ccggctgaac tggcagcacg cctgaacgcg ctaaacatca ccgccgacta tgaggtgatc
     7921 cgcgccgctg aaactggcct ggtacagatt caggcgcgga tgggcggcac cggcgaacgt
     7981 ttttttgccg gcgacgccac gctgacccgc gccgccgtgc gcctgactga cggcacgctc
     8041 ggctacagct gggtgctggg gcgtgataaa cagcacgccg aacgctgcgc gctgattgac
     8101 gcgctgatgc agcaatctcg ccactttcaa aacttatcag aaacccttat tgccccgctg
     8161 gacgctgacc gtatggcacg cattgccgca cgccaggccg aagtgaacgc cagccgggtc
     8221 gacttcttta cgatggttcg cggagacaac gcatgaccct ggaaaccgct tttatgcttc
     8281 ccgtgcagga tgcccagcac agttttcgtc gcctgttaaa ggccatgagc gagccgggcg
     8341 tgattgtcgc cctgcatcag ctcaaacgcg gctggcaacc gctgaatatc gccaccacca
     8401 gcgtgctgct gacgctggcc gataacgaca cgccggtgtg gctttctacc ccattaaata
     8461 acgatatcgt caaccagagc ctgcgttttc ataccaacgc gccgctggtc agccagccgg
     8521 aacaggcgac cttcgcggtg acggatgagg cgatttccag cgaacagctc aacgcccttt
     8581 ccaccggcac cgccgttgcg ccggaagcgg gcgcgacgct gattttacag gtcgccagcc
     8641 tgagcggcgg gcgcatgttg cgtctcaccg gcgcgggtat tgccgaagaa cgaatgatcg
     8701 ctccgcagct gccggagtgc attctgcacg aactcaccga gcgcccgcac ccgttcccgc
     8761 tcggcatcga cctgatcctg acctgcggcg aacgcctgct ggctattccg cgaaccacgc
     8821 atgtggaggt gtgctgatgt acgttgccgt aaaagggggc gaaaaggcga tcgacgccgc
     8881 ccacgccctg caagagagcc gacgccgggg cgataccgat ttgcctgaac tgagcgtcgc
     8941 ccagattgaa cagcagctta acctcgcggt agatcgcgtg atgaccgaag gcggcattgc
     9001 cgaccgcgaa ctggcggcgc tggcgctgaa acaggccagc ggcgataacg ttgaagcgat
     9061 tttcctgctg cgcgcctacc gcaccacgtt ggcgaagctg gcggtaagcg agccgctcga
     9121 caccaccggg atgcgtctcg aacgccgtat ctccgccgtt tataaagaca ttcccggcgg
     9181 ccagctgctt ggcccaacct acgactacac ccatcgcctg ctcgatttta ccctgctggc
     9241 aaacggcgaa gcgccgacgc tgaccaccgc cgacagcgaa caacagccgt cgccgcacgt
     9301 tttcagcctg ctggcgcgtc aggggctggc gaagtttgaa gaggatagcg gcgcacagcc
     9361 ggatgacatc acccgcacgc cgccggttta cccctgctca cgttcttccc gtttgcagca
     9421 gttgatgcgc ggcgacgaag gctatttgct ggcgctggcc tactccaccc agcgtggtta
     9481 cggacgcaat cacccgttcg cgggcgagat ccgcagtggt tacatcgacg tgtcgattgt
     9541 gccggaagag ctgggatttg cggtaaacgt cggcgaacta ctgatgaccg agtgtgaaat
     9601 ggtcaacggt tttatcgacc cgccggatga gccgccgcac ttcacgcgcg gctacgggct
     9661 ggtattcggc atgagcgagc gcaaagcgat ggcaatggcg ctggtcgatc gtgcgttgca
     9721 ggctccggaa tacggcgagc acgcgacagg cccggcgcag gatgaagagt ttgtgctggc
     9781 acatgccgac aacgtcgaag ccgcaggctt tgtctcgcac ctcaaactcc cccactacgt
     9841 cgatttccag gccgaactgg agctactcaa acgtctgcaa caggagaaga accatggcta
     9901 atctgagcgg ctacaacttt gcctacctcg acgagcagac caaacgcatg atccgccgcg
     9961 ccatcttaaa agcggtggcg atccccggtt atcaggtgcc gtttggcggg cgcgagatgc
    10021 cgatgccata cggctgggga accggcggca tacagctcac cgccagcgtg attggcgaaa
    10081 gcgacgtgct aaaggtgatt gaccagggtg cggatgacac caccaacgcc gtgtcgattc
    10141 gcaacttctt taagcgcgtg accggggtaa acaccactga acgtacggac gatgcgacgc
    10201 ttatccagac gcgtcaccgc atccccgaaa cgccgctgac cgaagatcag atcattatct
    10261 tccaggtgcc aatcccggaa ccgctgcgct ttatcgagcc gcgcgaaacg gaaacccgca
    10321 ccatgcacgc gctggaagag tacggcgtga tgcaggtgaa actgtatgaa gatatcgccc
    10381 gcttcggtca tatcgccact acctacgcct atccggtgaa ggtgaacggg cgctacgtaa
    10441 tggacccgtc gccgatcccg aaattcgata acccaaaaat ggacatgatg cccgccctgc
    10501 aactgttcgg cgcggggcgc gagaagcgca tctatgcggt gccgccgttt acccgcgtgg
    10561 aaagtctcga tttcgacgat cacccgttca ccgttcagca gtgggatgag ccatgcgcca
    10621 tctgcggatc gacccacagc tatcttgatg aagtggtgct ggatgacgcc ggaaaccgca
    10681 tgtttgtctg ctccgatacc gattattgcc gccaacagag cgaggcaaaa aaccaatgaa
    10741 tcaaccgtta ctttcggtca ataacctgac ccacctttac gcgccgggca aaggctttag
    10801 cgatgtctct tttgatttat ggccggggga agtgctgggc attgtcgggg aatccggctc
    10861 cgggaagacc acgctgctga agtcgatctc cgcgcgcctg acgccgcagc agggggaaat
    10921 tcactacgag aaccgttcgc tgtatgcaat gagcgaggcc gaccgccgtc gcctgctgcg
    10981 taccgaatgg ggcgtggtgc atcagcatcc actcgacggc ctgcgccgcc aggtgtcggc
    11041 aggcggcaat atcggcgagc ggctgatggc gaccggggca cgtcattacg gcgatattcg
    11101 tgccaccgcg cagaagtggc tggaagaggt ggagattccc gccaaccgga tcgacgacct
    11161 gccgaccacc ttttccggcg gtatgcagca gcgtttgcag attgcccgca acctggtgac
    11221 gcatccgaag ctggtgttta tggatgaacc gaccggcggg ctggatgtgt cggtgcaggc
    11281 ccgcctgctc gacctgctgc gcggcctggt ggtggagctg aacctcgcgg tggtgattgt
    11341 cacccatgat ttaggcgtcg cccgcctgct ggcggaccgt ttgctggtga tgaagcaggg
    11401 gcaagtggtg gagagtgggt taaccgaccg cgtgctcgac gacccgcatc atccgtatac
    11461 acagctgctg gtgtcatcgg ttttgcagaa ttgagccggt gccggatgcg gcgtaaacgc
    11521 cttatccggc ctacaaatgc gctccccgta ggtcggataa gacgcgtcag cgtcgcatcc
    11581 gacacccgaa ccacgaggcg aaaaatgatt aacgtacaaa acgtcagtaa aaccttcatc
    11641 ctgcaccagc aaaacggcgt gcgcctgccc gtcctcaatc gcgcctcgct caccgtcaac
    11701 gcgggcgaat gcgtggtgct ccacggccat tccggcagcg gcaaatcaac tctgctacgc
    11761 tcgctgtacg ccaactatct acccgacgaa ggtcaaatcc agatcaaaca cggtgacgag
    11821 tgggtagacc tggtcaccgc gccagcgcgc aaagtggtgg aaatccgcaa aaccaccgtc
    11881 ggctgggtga gccagtttct gcgcgtcatc ccgcgtatct cagcactgga agtggtgatg
    11941 cagccgctgc tcgataccgg cgttccgcgt gaagcctgcg ccgctaaagc cgcgcgtctt
    12001 ctcacccgcc tgaacgtgcc ggaacgcctg tggcacctgg caccatcgac attttccggt
    12061 ggcgaacagc agcgcgtcaa catcgcccgc ggctttatcg tcgactaccc cattctgctg
    12121 cttgacgaac ctaccgcctc gctggacgcc aaaaacagcg ccgcggtggt ggaactgatt
    12181 cgcgaagcca aaacccgtgg cgcagccatc gtaggcatct tccatgacga agctgtacgt
    12241 aatgacgtcg ccgaccgcct gcacccaatg ggagcctctt catgattatc aataacgtta
    12301 agctggtgct ggaaaacgag gtggtaagcg gttcgctgga ggtgcagaac ggcgaaatcc
    12361 gcgcctttgc cgaaagccag agccgcctgc cggaggcgat ggacggcgaa ggcggctggc
    12421 tgctgccggg gctgattgag ctgcataccg ataatctgga taaattcttc accccgcgcc
    12481 cgaaagttga ctggcctgcc cactcggcga tgagcagcca cgacgcgctg atggtggcga
    12541 gcggcatcac caccgtactg gatgccgtgg caattggcga cgtgcgcgac ggcggcgatc
    12601 ggctggagaa tctggagaag atgatcaacg ccatcgaaga gacgcagaaa cgcggcgtca
    12661 accgcgccga gcaccgtctg catctgcgct gcgaactgcc gcatcacacc acgctgccgc
    12721 tgtttgaaaa actggtgcag cgcgagccgg tgacgctggt gtcgctgatg gaccactcgc
    12781 cgggccagcg ccagttcgcc aaccgcgaga agtatcgcga atattatcag ggcaaatact
    12841 ccctcactga tgcgcagatg cagcagtacg aagaagagca actggcgctc gccgcacgct
    12901 ggtcgcagcc gaatcgcgaa tccatcgccg ccctgtgccg cgcgcgaaaa attgcgcttg
    12961 ccagccacga tgacgccacc cacgcccacg ttgctgaatc tcaccagctt ggcagcgtga
    13021 tcgccgaatt tcccaccacg ttcgaagcgg cggaagcctc gcgcaagcat ggcatgaacg
    13081 tgctgatggg cgcgccgaat attgtgcgcg gcggctcgca ctccggcaac gtggcggcca
    13141 gtgaactggc gcagcttggc ctgctggata tcctctcttc cgactactac cccgccagcc
    13201 tgctcgatgc ggcatttcgc gtcgccgatg acgagagcaa ccgctttacg ctgccgcagg
    13261 cggtgaagct ggtgactaaa aatccagcgc aggcgcttaa tctccaggat cgcggggtga
    13321 ttggcgaggg caaacgcgcc gacctggtgc tggcgcatcg caaggacaat catattcata
    13381 tcgaccacgt ctggcgtcag ggtaaaaggg tgttctgatg atgggaaaac tgatttggtt
    13441 aatggggccg tccggctccg ggaaagacag cctgctggcg gaactccgcc tgcgggaaca
    13501 aactcagtta ctggtggcgc atcgctacat cacgcgcgat gccagcgccg gaagtgaaaa
    13561 ccatatcgcc ctgagcgagc aggagttttt tacccgcgcg gggcaaaatc tgttggcctt
    13621 aagctggcac gctaacggtc tgtattatgg cgtcggcgtc gagattgatc tctggctgca
    13681 cgccggattc gacgtgctgg tcaacggctc acgcgcccat ctgccgcagg cgcgggcgcg
    13741 ctatcaatcg gcgctgctgc ccgtctgttt acaggtttcg ccggagatcc tccgccagcg
    13801 cctggaaaac cgtggccgtg aaaacgccag tgaaattaac gcccgcctgg cgcgcgccgc
    13861 ccgctatact ccacaggatt gccatacgct caacaatgac ggcagcctgc gccagtcggt
    13921 cgacacgctg ctgacgctga tccatcagaa ggagaaacac catgcctgct tgtgagcttc
    13981 gcccggccac gcagtacgac accgacgcgg tttacgcgct gatttgtgag ctaaaacagg
    14041 cggagtttga ccaccacgcg tttcgcgtgg gttttaacgc caatctgcgc gacccaaaca
    14101 tgcgctacca tctggcgctg cttgatggcg aagttgtcgg catgatcggc ctgcatttgc
    14161 agtttcatct gcatcatgtc aactggatcg gcgaaattca ggagttggtg gtaatgccgc
    14221 aggcgcgcgg tctgaacgtc ggcagtaagt tactggcgtg ggcagaagaa gaagcccgcc
    14281 aggccggggc cgaaatgacc gaactttcga ccaacgtgaa gcgccacgac gcgcaccgtt
    14341 tctatctgcg cgaaggctac gagcagagcc acttccgctt caccaaggcg ctgtaacatg
    14401 agcctgaccc tcacgctcac cggcaccggc ggcgcacagg gcgttccggc atggggctgc
    14461 gagtgtgcgg cctgcgccag agcgcggcgc tcgccgcagt atcgccgcca accgtgcagc
    14521 ggcgtagtga agtttaacga cgcaatcacc ctgatcgacg ccgggctgca cgatctcgcc
    14581 gatcgctggt cgcccggatc gttccagcag tttttgctga cgcattatca tatggatcac
    14641 gtccaggggc tgtttccgct gcgctggggc gttggcgatc cgatcccggt ttacggcccg
    14701 ccggatgaac agggctgcga cgatctgttt aaacatccgg gcctgcttga tttcagccac
    14761 acggtggaac cgtttgtggt gtttgatttg caggggttac aggtcacgcc cctgccgctc
    14821 aaccactcaa aactgacctt cggttatctg ctggaaacgg cacacagccg ggtggcgtgg
    14881 ctgtctgaca ccgcaggctt gccggaaaaa acgctgaaat ttttacgcaa taatcagccg
    14941 caggtaatgg tgatggattg cagtcacccg ccgcgcgcgg atgcaccgcg taatcactgt
    15001 gatttaaata ccgtgcttgc gctgaatcag gttatccgct cgccacgggt gattctgacc
    15061 catatcagcc accagtttga tgcgtggctg atggaaaacg cactaccgtc agggtttgag
    15121 gtggggtttg atgggatgga gattggggtg gcgtgatgag agggaatgtg cgcgctggcc
    15181 ccctcaccct aaccctctcc ccagaggggc gaggggaccg attgtgctcg atattgaata
    15241 ttgcgctcgt tttctccctc tccccattgg ggtgaggggc gatgcctgct ccatacccaa
    15301 cctcatcgcc catactcatc ttccattctc cgctcttcat cctccagttg ccgacgctcc
    15361 tgatcaagct ggcgctggcg atcgtccagc tgcctgcggc gatcttcaaa ctggcggcgg
    15421 cggtcgtcat attgtctgcg ccgatcgtcg ctcacttcac gctgccagcc gtggtcgcgc
    15481 gaatcttcat agttgaagcg gcgcacgaaa aacgcgaaag cgtttcacga taaatgcgaa
    15541 aactttagct ttcgcgcttc aaatgaaaca gatgtattaa ttactgcttt ttattcatta
    15601 catggggatc c
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       HUMJNLTRA     765 bp ss-mRNA            PRI       22-FEB-1990
DEFINITION  Human DNA/endogenous retroviral long terminal repeat (LTR) junction
            mRNA, clone lambda-LTR8.
ACCESSION   M32219
KEYWORDS    Alu repetitive sequence; long terminal repeat.
SOURCE      Human, cDNA to mRNA, clone lambda-LTR8.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 765)
  AUTHORS   Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
            Matsubara,K.
  TITLE     Transcription of human endogenous retroviral long terminal repeat
            (LTR) sequence in a lung cancer cell line
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      223      224     human DNA end; endogenous retrovirus DNA start
    rpt         224      342     Alu repeat
    LTR         343      765     long terminal repeat
BASE COUNT      252 a    158 c    148 g    207 t
ORIGIN      
        1 ggaaccccaa ccttacatgt aatacaaact taactcaaaa tggatcatat atctaaatgt
       61 aaaatggaaa gctataaaac tgaaaacaga ctatctttac aacctaggcg taggtatagt
      121 ttttagacat tacaccaaaa gcacatgccg taaaagaaaa aatagataaa ttggtggatt
      181 tcattaaaat taaaaaactt tttctctctg aaaaatcctg ttaagctggg cgctgtggtt
      241 catgcctgta atcccagcac tttgggaggc tgagttggga agaaattaat agcttgaggc
      301 caggagttca agatcatcct gggcagcaaa gtcatacact cttgagggaa gagagagacc
      361 ttctcatatt gttttatatt gttttatact cagtacctgt tttaagaaaa aaacaaggaa
      421 gtgaaatcaa agacaggcag cccggcacca ggcctgaaac cagccctggg cctgcctggc
      481 ctaaacctag tagttaaaaa tcaacttacg acttagaacc tgatgttatc cgtagattcc
      541 aagcattgta taaaaaaatt gtgaaactcc ctgttgtgtt ctgtaccagt gcatgaaacc
      601 cctgtcacat atcccctaga ttgctcaatc aatcacgacc ctttcatgtg aaatctttag
      661 tgttgtgagc ccttaaaagg gacagaaatt gtgcacttga ggagctcaga ttttaaggct
      721 gtagcttgcc gatgctccca gctgaataaa gcccttcctt ctaca
//
LOCUS       HUMJNLTRB     551 bp ss-mRNA            PRI       22-FEB-1990
DEFINITION  Human DNA/endogenous retroviral long terminal repeat (LTR) junction
            mRNA, clone lambda-LTR22.
ACCESSION   M32220
KEYWORDS    long terminal repeat.
SOURCE      Human, cDNA to mRNA, clone lambda-LTR22.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 551)
  AUTHORS   Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
            Matsubara,K.
  TITLE     Transcription of human endogenous retroviral long terminal repeat
            (LTR) sequence in a lung cancer cell line
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      120      121     human DNA end; endogenous retrovirus DNA start
    LTR         121      551     long terminal repeat
BASE COUNT      146 a    146 c    129 g    130 t
ORIGIN      
        1 ggcttgagct gcagtcacag gccttggctg gaccagggat ggcccccagc tcccaggagg
       61 gcccactgac cctgcagctc cagccttctc catacttcaa caaagaatga gttgtggcaa
      121 tgagggaaga gagaccctct catagtgttt tatactcagt acctgtttta agaaaaaaca
      181 acaaggaagt aaaaccaaag acaggcaggc agcctggcgc taggcccgaa accaggcctg
      241 cgcctgcctg gcctaaaccc agtagttgaa aatcaattca taacttagaa accgatgtta
      301 ttcatagatt ccagacattg tatagaagaa catttgtgaa actccctgcc gtgttctgtt
      361 tctctctgac cgccggtgca tgcagcgcct gtcacgtacc gcctgcttgc tcaaatcaat
      421 gacgaccctt tcatgtgaaa tcttcgtgtt gtgagccctt aaaagggaca gaaattgtgc
      481 acttggggag ctcggatttt aaggcagtag cttgccgatg ctcccagctg aataaagccc
      541 ttccttctac a
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       HUMJNLTRA     765 bp ss-mRNA            PRI       22-FEB-1990
DEFINITION  Human DNA/endogenous retroviral long terminal repeat (LTR) junction
            mRNA, clone lambda-LTR8.
ACCESSION   M32219
KEYWORDS    Alu repetitive sequence; long terminal repeat.
SOURCE      Human lung cancer cell line, cDNA to mRNA, clone lambda-LTR8.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 765)
  AUTHORS   Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
            Matsubara,K.
  TITLE     Transcription of human endogenous retroviral long terminal repeat
            (LTR) sequence in a lung cancer cell line
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      223      224     human DNA end; endogenous retrovirus DNA start
    rpt         224      342     Alu repeat
    LTR         343      765     long terminal repeat
BASE COUNT      252 a    158 c    148 g    207 t
ORIGIN      
        1 ggaaccccaa ccttacatgt aatacaaact taactcaaaa tggatcatat atctaaatgt
       61 aaaatggaaa gctataaaac tgaaaacaga ctatctttac aacctaggcg taggtatagt
      121 ttttagacat tacaccaaaa gcacatgccg taaaagaaaa aatagataaa ttggtggatt
      181 tcattaaaat taaaaaactt tttctctctg aaaaatcctg ttaagctggg cgctgtggtt
      241 catgcctgta atcccagcac tttgggaggc tgagttggga agaaattaat agcttgaggc
      301 caggagttca agatcatcct gggcagcaaa gtcatacact cttgagggaa gagagagacc
      361 ttctcatatt gttttatatt gttttatact cagtacctgt tttaagaaaa aaacaaggaa
      421 gtgaaatcaa agacaggcag cccggcacca ggcctgaaac cagccctggg cctgcctggc
      481 ctaaacctag tagttaaaaa tcaacttacg acttagaacc tgatgttatc cgtagattcc
      541 aagcattgta taaaaaaatt gtgaaactcc ctgttgtgtt ctgtaccagt gcatgaaacc
      601 cctgtcacat atcccctaga ttgctcaatc aatcacgacc ctttcatgtg aaatctttag
      661 tgttgtgagc ccttaaaagg gacagaaatt gtgcacttga ggagctcaga ttttaaggct
      721 gtagcttgcc gatgctccca gctgaataaa gcccttcctt ctaca
//
LOCUS       HUMJNLTRB     551 bp ss-mRNA            PRI       22-FEB-1990
DEFINITION  Human DNA/endogenous retroviral long terminal repeat (LTR) junction
            mRNA, clone lambda-LTR22.
ACCESSION   M32220
KEYWORDS    long terminal repeat.
SOURCE      Human lung cancer cell line, cDNA to mRNA, clone lambda-LTR22.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 551)
  AUTHORS   Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and
            Matsubara,K.
  TITLE     Transcription of human endogenous retroviral long terminal repeat
            (LTR) sequence in a lung cancer cell line
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1-10 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      120      121     human DNA end; endogenous retrovirus DNA start
    LTR         121      551     long terminal repeat
BASE COUNT      146 a    146 c    129 g    130 t
ORIGIN      
        1 ggcttgagct gcagtcacag gccttggctg gaccagggat ggcccccagc tcccaggagg
       61 gcccactgac cctgcagctc cagccttctc catacttcaa caaagaatga gttgtggcaa
      121 tgagggaaga gagaccctct catagtgttt tatactcagt acctgtttta agaaaaaaca
      181 acaaggaagt aaaaccaaag acaggcaggc agcctggcgc taggcccgaa accaggcctg
      241 cgcctgcctg gcctaaaccc agtagttgaa aatcaattca taacttagaa accgatgtta
      301 ttcatagatt ccagacattg tatagaagaa catttgtgaa actccctgcc gtgttctgtt
      361 tctctctgac cgccggtgca tgcagcgcct gtcacgtacc gcctgcttgc tcaaatcaat
      421 gacgaccctt tcatgtgaaa tcttcgtgtt gtgagccctt aaaagggaca gaaattgtgc
      481 acttggggag ctcggatttt aaggcagtag cttgccgatg ctcccagctg aataaagccc
      541 ttccttctac a
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       SHPPACAP     1763 bp ss-mRNA            MAM       23-FEB-1990
DEFINITION  Ovine pituitary adenylate cyclase activating polypeptide (PACAP38)
            mRNA, complete cds.
ACCESSION   M32216
KEYWORDS    pituitary adenylate cyclase activating polypeptide.
SOURCE      Ovine hypothalamus, cDNA to mRNA, clone pOH38P7.
  ORGANISM  Ovis sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1763)
  AUTHORS   Kimura,C., Ohkubo,S., Ogi,K., Hosoya,M., Itoh,Y., Onda,H.,
            Miyata,A., Jiang,L., Dahl,R.R., Stibbs,H.H., Arimura,A. and
            Fujino,M.
  TITLE     A novel peptide which stimulates adenylate cyclase: Molecular
            cloning and characterization of the ovine and human cDNAs
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 81-89 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        577     1107     pituitary adenylate cyclase activating
                                 polypeptide precursor
    sigp        577      648     pituitary adenylate cyclase activating signal
                                 peptide
    matp        970     1083     pituitary adenylate cyclase activating peptide
BASE COUNT      411 a    490 c    418 g    444 t
ORIGIN      
        1 ctgctaactg cccagataaa taggagcaga gggctggtca cctctgtaat aaccaccggc
       61 agcagtagaa gaaaccgcag cttcagaagc agccagagag acttctgagc agcgaaggcg
      121 ctgcctgctc gagctgcctg gccgggcggc tgccccagac gccgacttcg ccgaggccct
      181 ctctctctct ctctctctct ctctctctct ctctctctct ctctgcttct ttccttatca
      241 ctcctttctt ctcagtggac ttcaggccac tttgtctccc acccccactc agctcgtcgc
      301 ctcctccgtc ttccttctcc atctctcctc tcgcccccct tctctcagtg tcacgctccg
      361 tcctagttcc gagcgtcgtc aaacttttga acagaataac aggactcagc aaacaagtcc
      421 tccagctcct cccgcggctc cggctcgttc ctgcggctcc tgctcagaca ctaacgccag
      481 acggcgatgc ctcttgggtt gtgactacag cgcacaaact tggagaagct ctttgcccgc
      541 cgtcctactt ggcagcaaat cctctcctgg cagcgaatga ccatgtgtag cggagcgagg
      601 ctggccctgc tcgtttacgg gatactgatg cacagcagcg tctacggctc acctgccgcc
      661 tccggactcc ggttcccggg gatcaggccg gagaacgagg cgtacgacga ggacggaaac
      721 ccgcagcagg acttctacga ctcggagccg ccaggcgtgg ggagccccgc ctccgcgctg
      781 cgcgatgcct acgcgctcta ctacccggcg gaggaaagag atgtcgccca cgggatcctt
      841 gataaggcct accgcaaagt gctggaccag ctgtccgcca ggagatacct gcagacgctc
      901 atggccaagg gcttgggtgg gaccccgggc ggcggcgcgg acgacgactc ggagccgctc
      961 tccaagcgcc actcggacgg catcttcact gacagctaca gccgctaccg gaagcaaatg
     1021 gctgttaaga aatacttggc ggctgtccta gggaaaaggt ataaacaaag ggttaagaac
     1081 aaaggacggc gaataccgta cttgtagcga cgagttacca gctatcctgt gtatacagcc
     1141 ctgacacaat gagaagtcgt ttttcccaac tgactgaact gtcatcgctg ctgtgttctg
     1201 tcccacatgt atttatgtat gaagtcaagc cattaaatga atattttgat aataatattg
     1261 tttttctttt tacgaagcac tggagaatgc acagatatac tttgtggacc aattattgat
     1321 attgacatat atattacgaa tatataaaga gtatatatat atatatataa gtataataga
     1381 gagccgttca tacagtgtgc acaaggactg aagattcgcc tgagctgttt gtttttatat
     1441 aaaataaata gaaaaataga caatcattgt tttgaatatt actcctattt ttgtaaactg
     1501 gaattaaaag gatagtattt ttatccacaa taggcctgaa gatattaatc ctgaccattt
     1561 gctactgtac ataaacagtg atgccctgct ccagggagac tttgaggtaa tgatttggga
     1621 ggattgctga aggtctctct ttcccaggga gtctctgggg caggctgctt caatcccagc
     1681 tgaactcgac tgaggctctg tctacccctt gctgggtggc aatgccaata cttccgcttt
     1741 ctttgattct atttttatgt gta
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       HUMZFX       3000 bp ss-mRNA            PRI       04-DEC-1989
DEFINITION  Human zinc finger protein X-linked (ZFX) mRNA, complete cds.
ACCESSION   M30608
KEYWORDS    zinc finger protein.
SOURCE      Human HeLa cell line, cDNA to mRNA, clone pCD5.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3000)
  AUTHORS   Palmer,M.S., Berta,P., Sinclair,A.H., Pym,B. and Goodfellow,P.N.
  TITLE     Comparison of human ZFY and ZFX transcripts
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 1681-1685 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.S.Palmer, 08-DEC-1989.
FEATURES       from  to/span     description
    pept        259     2676     zinc finger protein /nomgen="ZFX"
                                 /hgml_locus_uid="LS0126V" /map="Xp22.1-p21.3"
BASE COUNT      918 a    588 c    693 g    801 t
ORIGIN      
        1 gtccgtccgg ttcttgctat attgccccag gctggtctcg aactcctggg ctcaagccgt
       61 tctcccgcct cccactgccc gcctgtcact gccgtctgtt ccctgagctg tgctttacgc
      121 tgggaaagac atagaagaac tattaagaag atagaattgt tttgctgcgc agtacagcaa
      181 cagtggatgt tcaagattaa gattagagtc aagttgtgtg attaagacag gagctgtgac
      241 tgatgagaat taaaggccat ggatgaagat gggcttgaat tacaacaaga gccaaactca
      301 ttttttgatg caacaggagc tgatggtaca cacatggatg gtgatcaaat tgttgtggaa
      361 gtacaagaaa ctgtttttgt ttcagatgtt gtggattcag acataactgt gcataacttt
      421 gttcctgatg acccagattc agttgtaatc caagatgtta ttgaggacgt tgttatagaa
      481 gatgttcagt gcccagatat catggaagaa gcagatgtgt ctgaaacggt catcattcct
      541 gagcaagtgc tggactcaga tgtaactgaa gaagtttctt tagcacattg cacagtccca
      601 gatgatgttt tagcttctga cattacttca gcctcaatgt ctatgccaga acacgtcttg
      661 acgggtgatt ctatacatgt gtctgacgtt ggacatgttg gacatgttgg acatgttgaa
      721 catgtggttc atgatagtgt agtggaagca gaaattgtca ctgatcctct gactaccgac
      781 gtagtttcag aagaagtatt ggtagcagac tgtgcctctg aagcagtcat agatgccaat
      841 gggatccctg tggaccagca ggatgatgac aaaggcaact gtgaggacta ccttatgatt
      901 tccttggatg atgctggcaa aatagaacac gatggttctt ctggaatgac catggacaca
      961 gagtcggaaa ttgatccttg taaagtggat ggcacttgcc ctgaggtcat caaggtgtac
     1021 atttttaaag ctgaccctgg agaagatgac ttaggtggaa ctgtagacat tgtggagagt
     1081 gagcctgaga atgatcatgg agttgaactg cttgatcaga acagcagtat tcgtgttccc
     1141 agggaaaaga tggtttatat gactgtcaat gactctcagc cagaagatga agatttaaat
     1201 gttgctgaaa tcgctgacga agtttatatg gaagtgatcg taggagagga ggatgctgca
     1261 gcagcacggg cagccgccgc cgtgcacgag cagcaaatgg atgacaatga aatcaaaacc
     1321 ttcatgccga ttgcatgggc agcagcttat ggtaataatt ctgatggaat tgaaaaccgg
     1381 aatggcactg caagtgccct cttgcacata gatgagtctg ctggcctcgg cagactggct
     1441 aaacaaaaac caaagaaaag gagaagacct gattccaggc agtaccaaac agcaataatt
     1501 attggccctg atggacatcc tttgactgtc tatccttgca tgatttgtgg gaagaagttt
     1561 aagtcgagag gttttttgaa aaggcacatg aaaaaccatc ccgaacacct tgccaagaag
     1621 aaataccgct gtactgactg tgattacact accaacaaga agataagttt acacaaccac
     1681 ctggagagcc acaagctgac cagcaaggca gagaaggcca ttgaatgcga tgagtgtggg
     1741 aagcatttct ctcatgcagg ggctttgttt actcacaaaa tggtgcataa ggaaaaagga
     1801 gccaacaaaa tgcacaagtg taaattctgt gaatacgaga cagctgaaca agggttattg
     1861 aatcgccacc tcttggcagt ccacagcaag aactttcctc atatttgtgt ggagtgtggt
     1921 aagggttttc gtcacccgtc agagctcaaa aagcacatga gaatccatac tggggagaag
     1981 ccgtaccaat gccagtactg cgaatatagg tctgcagact cttctaactt gaaaacgcat
     2041 gtcaaaacta agcatagtaa agagatgcca ttcaagtgtg acatttgtct tctgactttc
     2101 tcggatacca aagaggtgca gcaacatgct cttatccacc aagaaagcaa aacacaccag
     2161 tgtttgcatt gcgaccacaa gagttcgaac tcaagtgatt tgaaacgaca cataatttca
     2221 gttcacacga aagactaccc ccataagtgt gacatgtgtg ataaaggctt tcacaggcct
     2281 tcagaactca agaaacacgt ggctgcccac aagggcaaaa aaatgcacca gtgtagacat
     2341 tgtgacttta agattgcaga tccatttgtt ctaagtcgcc atattctctc agttcacaca
     2401 aaggatcttc catttaggtg caagagatgt agaaagggat ttaggcaaca gagtgagctt
     2461 aaaaagcata tgaagacaca cagtggcagg aaagtgtatc agtgtgagta ctgtgagtat
     2521 agcactacag atgcctcagg ctttaaacgg cacgttattt ccattcacac gaaagactat
     2581 cctcaccggt gtgagtactg caagaaaggc ttccgaagac cttcagaaaa gaaccagcac
     2641 ataatgcgac atcataaaga agttggcctg ccctaacaat acttctacag aacgtttgta
     2701 gagatattgg ccttgaagca gaaaattcat tttaaagcca atcagtctca ttcacataca
     2761 atactgtata ttgatttatg ctgtgtacaa atagaattat tacttctagt tgactttttt
     2821 ttaaatatac attttgctca gtagtgtgtt ctgaattcta ttcagtttgt ttaataaata
     2881 gggaaaactg gcaacatgct agttactttt aataaagtaa tccctgattc tataccgaag
     2941 ttttatatct tagaatttta tatttattta aatatttacc ttgcttacct tgatggtact
//
LOCUS       HUMZFY       2659 bp ss-mRNA            PRI       04-DEC-1989
DEFINITION  Human zinc finger protein Y-linked (ZFY) mRNA, complete cds.
ACCESSION   M30607
KEYWORDS    zinc finger protein.
SOURCE      Human foreskin fibroblast, cDNA to mRNA, (pCD2Bassing library of
            H.Okayama), clones pMF-1 and p5.8.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2659)
  AUTHORS   Palmer,M.S., Berta,P., Sinclair,A.H., Pym,B. and Goodfellow,P.N.
  TITLE     Comparison of human ZFY and ZFX transcripts
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 1681-1685 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.S.Palmer, 08-DEC-1989.
FEATURES       from  to/span     description
    pept          1     2406     zinc finger protein /nomgen="ZFY"
                                 /hgml_locus_uid="LU0028V" /map="Yp11.3"
    mRNA    <   259     2630     ZFY mRNA (pot.)
BASE COUNT      861 a    488 c    597 g    713 t
ORIGIN      
        1 atggatgaag atgaatttga attgcagcca caagagccaa actcattttt tgatggaata
       61 ggagctgatg ctacacacat ggatggtgat cagattgttg tggaaataca agaagcagtt
      121 tttgtttcta atattgtgga ttctgacata actgtgcata actttgttcc tgatgaccca
      181 gactcagttg taatccaaga tgttgttgaa gatgttgtca tagaggagga tgttcagtgc
      241 tcagatatct tagaagaggc agatgtatct gaaaatgtca tcattcctga gcaagtgctg
      301 gactcagatg taactgaaga agtttcttta ccacactgca cagtcccaga tgatgtttta
      361 gcttctgaca ttacttcaac ctcaatgtct atgccagaac atgttttaac gagtgaatcc
      421 atgcatgtgt gtgacattgg acatgttgaa catatggtgc atgatagtgt agtggaagca
      481 gaaatcatta ctgatcctct gacgagtgac atagtttcag aagaagtatt ggtagcagac
      541 tgtgcccctg aagcagtcat agatgccagc gggatctcag tggaccagca agataatgac
      601 aaagccagct gtgaggacta cctaatgatt tcgttggatg atgctggcaa aatagaacat
      661 gatggttcca ctggagtgac catcgatgca gaatcagaaa tggatccttg taaagtggat
      721 agcacttgtc ctgaagtcat caaggtgtac atttttaaag ctgaccctgg agaagatgac
      781 ttaggtggaa ctgtagacat tgtggagagt gaacctgaaa atgatcatgg agttgaacta
      841 cttgatcaga acagcagtat tcgtgttccc agggaaaaga tggtttatat gactgtcaat
      901 gactctcaac aagaagatga agatttaaat gttgctgaaa ttgctgatga agtttatatg
      961 gaagtgatcg taggagagga ggatgctgct gttgcagcag cagcagctgc tgtgcatgag
     1021 cagcaaattg atgaggatga aatgaaaacc ttcgtaccaa ttgcatgggc agcagcttat
     1081 ggtaataatt ctgatggaat tgaaaaccgg aatggcactg caagtgccct cttgcacata
     1141 gatgagtctg ctggccttgg cagactggct aaacagaaac caaagaaaaa gagaagacct
     1201 gattccaggc agtaccaaac agcaataatt attggccctg atggtcatcc tttgactgtc
     1261 tatccttgca tgatttgtgg gaagaagttt aagtcgaggg gttttttgaa aagacacatg
     1321 aaaaaccatc ctgaacacct tgccaagaag aagtaccact gtactgactg tgattacact
     1381 accaataaga agataagttt acataaccac ctggagagcc acaagctgac cagcaaggca
     1441 gagaaggcca ttgaatgtga tgagtgtggg aagcattttt ctcatgcagg ggctttgttt
     1501 actcacaaaa tggtgcataa ggaaaaaggg gccaacaaaa tgcacaagtg taaattctgt
     1561 gaatatgaga cagctgaaca ggggttattg aatcgccacc tcttggcagt ccacagcaag
     1621 aactttcctc atatttgtgt ggagtgtggt aaaggtttcc gatacccgtc ggaactgaga
     1681 aagcacatgc gaatccatac cggcgagaag ccataccaat gccagtactg tgaatatagg
     1741 tctgcagact cttctaactt gaaaacacat ataaaaacaa agcatagtaa agagatgcca
     1801 ttcaagtgtg acatttgtct tctgactttc tcagatacca aagaagtgca gcaacatact
     1861 cttgtccacc aagaaagcaa aacacatcag tgtttgcatt gcgaccacaa gagttcaaac
     1921 tcaagtgatt tgaaacgaca tgtaatttca gttcatacga aagactatcc tcataagtgt
     1981 gagatgtgcg agaaaggctt tcacaggcct tcagaactta agaaacatgt ggctgtccac
     2041 aaaggtaaaa aaatgcacca atgtagacat tgtgacttta agattgcaga cccatttgtt
     2101 ctaagtcgcc atattctctc agttcacaca aaggatcttc catttaggtg taagagatgt
     2161 agaaagggat ttaggcaaca aaatgagctt aaaaagcata tgaagacaca cagtggcagg
     2221 aaagtatatc agtgtgagta ctgtgagtat agcactacag atgcctcagg ctttaaacgg
     2281 cacgttattt ccattcatac aaaagactat cctcatcggt gtgagtactg caagaaaggc
     2341 ttccgaagac cttcagaaaa gaaccagcac ataatgagac accataaaga agttggtctg
     2401 ccctaacagt gtgtctacaa gcttgtaaag atgttggcct tgaagcagaa aattcatttt
     2461 ttaaagccag tcttgttcac atccattact atacattgaa ttatgctgtg taaaaataga
     2521 attattgctt ctagtccact tttctttaca ttttattcaa tacgctgtcc tgaatcctat
     2581 tcagtttctt taatagatga gtaaaaatag caacaagcaa gttgcttata ataaaataat
     2641 ttgtgattct cccgaattc
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       HUMANTCD36   1870 bp ss-mRNA            PRI       15-DEC-1989
DEFINITION  Human CD36 antigen mRNA, complete cds.
ACCESSION   M24795
KEYWORDS    cell surface antigen; cell surface receptor; erythrocyte antigen;
            monocyte antigen; platelet antigen.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1870)
  AUTHORS   Oquendo,P., Hundt,E., Lawler,J. and Seed,B.
  TITLE     CD36 directly mediates cytoadherence of Plasmodium falciparum
            parasitized erythrocytes
  JOURNAL   Cell 58, 95-101 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Seed, 12-MAY-1989.
FEATURES       from  to/span     description
    pept        211     1629     CD36 antigen
BASE COUNT      599 a    343 c    356 g    572 t
ORIGIN      Unreported.
        1 gaaaaatcct tcttagccat tttaaagata gctttccaat gattagacga attgattctt
       61 tctgtgactc atcagttcct ttcctgtaaa attcatgtct tgctgttgat ttgtgaataa
      121 gaaccagagc ttgtagaaac cactttaatc atatccagga gtttgcaaga aacaggtgct
      181 taacactaat tcacctcctg aacaagaaaa atgggctgtg accggaactg tgggctcatc
      241 gctggggctg tcattggtgc tgtcctggct gtgtttggag gtattctaat gccagttgga
      301 gacctgctta tccagaagac aattaaaaag caagttgtcc tcgaagaagg tacaattgct
      361 tttaaaaatt gggttaaaac aggcacagaa gtttacagac agttttggat ctttgatgtg
      421 caaaatccac aggaagtgat gatgaacagc agcaacattc aagttaagca aagaggtcct
      481 tatacgtaca gagttcgttt tctagccaag gaaaatgtaa cccaggacgc tgaggacaac
      541 acagtctctt tcctgcagcc caatggtgcc atcttcgaac cttcactatc agttggaaca
      601 gaggctgaca acttcacagt tctcaatctg gctgtggcag ctgcatccca tatctatcaa
      661 aatcaatttg ttcaaatgat cctcaattca cttattaaca agtcaaaatc ttctatgttc
      721 caagtcagaa ctttgagaga actgttatgg ggctataggg atccattttt gagtttggtt
      781 ccgtaccctg ttactaccac agttggtctg ttttatcctt acaacaatac tgcagatgga
      841 gtttataaag ttttcaatgg aaaagataac ataagtaaag ttgccataat cgacacatat
      901 aaaggtaaaa ggaatctgtc ctattgggaa agtcactgcg acatgattaa tggtacagat
      961 gcagcctcat ttccaccttt tgttgagaaa agccaggtat tgcagttctt ttcttctgat
     1021 atttgcaggt caatctatgc tgtatttgaa tccgacgtta atctgaaagg aatccctgtg
     1081 tatagatttg ttcttccatc caaggccttt gcctctccag ttgaaaaccc agacaactat
     1141 tgtttctgca cagaaaaaat tatctcaaaa aattgtacat catatggtgt gctagacatc
     1201 agcaaatgca aagaagggag acctgtgtac atttcacttc ctcattttct gtatgcaagt
     1261 cctgatgttt cagaacctat tgatggatta aacccaaatg aagaagaaca taggacatac
     1321 ttggatattg aacctataac tggattcact ttacaatttg caaaacggct gcaggtcaac
     1381 ctattggtca agccatcaga aaaaattcaa gtattaaaga atctgaagag gaactatatt
     1441 gtgcctattc tttggcttaa tgagactggg accattggtg atgagaaggc aaacatgttc
     1501 agaagtcaag taactggaaa aataaacctc cttggcctga tagaaatgat cttactcagt
     1561 gttggtgtgg tgatgtttgt tgcttttatg atttcatatt gtgcatgcag atcgaaaaca
     1621 ataaaataag tatgtaccaa aaaatattgc ttcaataata ttagcttata tattacttgt
     1681 tttcacttta tcaaagagaa gttacatatt aggccatata tatttctaga catgtctagc
     1741 cactgatcat ttttaaatat aggtaaataa acctataaat attatcacgc agatcactaa
     1801 agtatatctt taattctggg agaaatgaga taaaagatgt acttgtgacc attgtaacaa
     1861 tagcacaaat
//
LOCUS       HUMARB3A     1270 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human beta-3-adrenergic receptor gene.
ACCESSION   M29932
KEYWORDS    adrenergic receptor; beta-3-adrenergic receptor.
SOURCE      Human placenta DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1270)
  AUTHORS   Emorine,L.J., Marullo,S., Briend-Sutren,M.-M., Patey,G., Tate,K.,
            Delavier-Klutchko,C. and Strosberg,A.D.
  TITLE     Molecular characterization of the human beta-3-adrenergic receptor
  JOURNAL   Science 245, 1118-1121 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.J.Emorine 17-NOV-1989.
FEATURES       from  to/span     description
    pept         38     1246     beta-3-adrenergic receptor
BASE COUNT      132 a    484 c    405 g    249 t
ORIGIN      33 bp upstream of SmaI site.
        1 ttccttcttt ccctaccgcc ccacgcgcga cccggggatg gctccgtggc ctcacgagaa
       61 cagctctctt gccccatggc cggacctccc caccctggcg cccaataccg ccaacaccag
      121 tgggctgcca ggggttccgt gggaggcggc cctagccggg gccctgctgg cgctggcggt
      181 gctggccacc gtgggaggca acctgctggt catcgtggcc atcgcctgga ctccgagact
      241 ccagaccatg accaacgtgt tcgtgacttc gctggccgca gccgacctgg tgatgggact
      301 cctggtggtg ccgccggcgg ccaccttggc gctgactggc cactggccgt tgggcgccac
      361 tggctgcgag ctgtggacct cggtggacgt gctgtgtgtg accgccagca tcgaaaccct
      421 gtgcgccctg gccgtggacc gctacctggc tgtgaccaac ccgctgcgtt acggcgcact
      481 ggtcaccaag cgctgcgccc ggacagctgt ggtcctggtg tgggtcgtgt cggccgcggt
      541 gtcgtttgcg cccatcatga gccagtggtg gcgcgtaggg gccgacgccg aggcgcagcg
      601 ctgccactcc aacccgcgct gctgtgcctt cgcctccaac atgccctacg tgctgctgtc
      661 ctcctccgtc tccttctacc ttcctcttct cgtgatgctc ttcgtctacg cgcgggtttt
      721 cgtggtggct acgcgccagc tgcgcttgct gcgcggggag ctgggccgct ttccgcccga
      781 ggagtctccg ccggcgccgt cgcgctctct ggccccggcc ccggtgggga cgtgcgctcc
      841 gcccgaaggg gtgcccgcct gcggccggcg gcccgcgcgc ctcctgcctc tccgggaaca
      901 ccgggccctg tgcaccttgg gtctcatcat gggcaccttc actctctgct ggttgccctt
      961 ctttctggcc aacgtgctgc gcgccctggg gggcccctct ctagtcccgg gcccggcttt
     1021 ccttgccctg aactggctag gttatgccaa ttctgccttc aacccgctca tctactgccg
     1081 cagcccggac tttcgcagcg ccttccgccg tcttctgtgc cgctgcggcc gtcgcctgcc
     1141 tccggagccc tgcgccgccg cccgcccggc cctcttcccc tcgggcgttc ctgcggcccg
     1201 gagcagccca gcgcagccca ggctttgcca acggctcgac gggtaggtaa ccggggcaga
     1261 gggaccggcg
//
LOCUS       HUMCY4ARO    2736 bp ss-mRNA            PRI       15-SEP-1989
DEFINITION  Human aromatase system cytochrome P-450 (P450XIX) mRNA, complete
            cds.
ACCESSION   J04127
KEYWORDS    aromatase cytochrome P-450; microsomal protein.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2736)
  AUTHORS   Corbin,C.J., Graham-Lorence,S., McPhaul,M., Mason,J.I.,
            Mendelson,C.R. and Simpson,E.R.
  TITLE     Isolation of a full-length cDNA insert encoding human aromatase
            system cytochrome P-450 and its expression in nonsteroidogenic
            cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 8948-8952 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by E.R.Simpson, 03-MAR-1989.
FEATURES       from  to/span     description
    pept         28     1539     cytochrome P-450-Arom /nomgen="CYP19"
                                 /map="15q21" /hgml_locus_uid="LL0035S"
    mRNA    <     1     2385     CYP19 mRNA (alt.)
    mRNA    <     1     2736     CYP19 mRNA (alt.)
BASE COUNT      801 a    599 c    591 g    745 t
ORIGIN      Chromosome 15q21.
        1 gccccctctg aggtcaagga acacaagatg gttttggaaa tgctgaaccc gatacattat
       61 aacatcacca gcatcgtgcc tgaagccatg cctgctgcca ccatgccagt cctgctcctc
      121 actggccttt ttctcttggt gtggaattat gagggcacat cctcaatacc aggtcctggc
      181 tactgcatgg gaattggacc cctcatctcc cacggcagat tcctgtggat ggggatcggc
      241 agtgcctgca actactacaa ccgggtgtat ggagaattca tgcgagtctg gatctctgga
      301 gaggaaacac tcattatcag caagtcctca agtatgttcc acataatgaa gcacaatcat
      361 tacagctctc gattcggcag caaacttggg ctgcagtgca tcggtatgca tgagaaaggc
      421 atcatattta acaacaatcc agagctctgg aaaacaactc gacccttctt tatgaaagct
      481 ctgtcaggcc ccggccttgt tcgtatggtc acagtctgtg ctgaatccct caaaacacat
      541 ctggacaggt tggaggaggt gaccaatgaa tcgggctatg tggacgtgtt gacccttctg
      601 cgtcgtgtca tgctggacac ctctaacacg ctcttcttga ggatcccttt ggacgaaagt
      661 gctatcgtgg ttaaaatcca aggttatttt gatgcatggc aagctctcct catcaaacca
      721 gacatcttct ttaagatttc ttggctatac aaaaagtatg agaagtctgt caaggatttg
      781 aaagatgcca tagaagttct gatagcagaa aaaagatgca ggatttccac agaagagaaa
      841 ctggaagaat gtatggactt tgccactgag ttgattttag cagagaaacg tggtgacctg
      901 acaagagaga atgtgaacca gtgcatattg gaaatgctga tcgcagctcc tgacaccatg
      961 tctgtctctt tgttcttcat gctatttctc attgcaaagc accctaatgt tgaagaggca
     1021 ataataaagg aaatccagac tgttattggt gagagagaca taaagattga tgatatacaa
     1081 aaattaaaag tgatggaaaa cttcatttat gagagcatgc ggtaccagcc tgtcgtggac
     1141 ttggtcatgc gcaaagcctt agaagatgat gtaatcgatg gctacccagt gaaaaagggg
     1201 acaaacatta tcctgaatat tggaaggatg cacagactcg agtttttccc caaacccaat
     1261 gaatttactc ttgaaaattt tgcaaagaat gttccttata ggtactttca gccatttggc
     1321 tttgggcccc gtggctgtgc aggaaagtac atcgccatgg tgatgatgaa agccatcctc
     1381 gttacacttc tgagacgatt ccacgtgaag acattgcaag gacagtgtgt tgagagcata
     1441 cagaagatac acgacttgtc cttgcaccca gatgagacta aaaacatgct ggaaatgatc
     1501 tttaccccaa gaaactcaga caggtgtctg gaacactaga gaaggctggt cagtacccac
     1561 tctggagcat tttctcatca gtagttcaca tacaaatcat ccatccttgc caatagtgtc
     1621 atcctcacag tgaacactca gtggcccatg gaattttata ggcatacctc ctatggttgt
     1681 caccaagcta ggtgctattg gtcatctgct cctgttcaca ccagagaacc aggctacaag
     1741 agaaaaagca gaggccaaga gtttgagggg agaaatagtc ggtgaagaaa ccgtatccat
     1801 aaagacccga ttccaccaaa tgtgctttga gaaggatagg ccttcattaa caaaatgtat
     1861 gtctggttcc ccagtagagc tctactgcct caaccaaggg gatttttatg tctggggcag
     1921 aaacactcaa gttgattaga aagaccaggc caatgtcagg gtacctgggg ccaaacccac
     1981 ctgctagtgt gaattaaagt actttaattt tgttttctgt ggaggtggaa aagcaacatt
     2041 catagtcttt ggagaaatgc ttagaaattc agcatttgac ccttgctgtg aattaagccc
     2101 aattaattcc tgtttgtcta catatgatct gtctgtggca aaagtttaat cagaggaaat
     2161 tctttcccag tctgtcgatt tatgcctcag ccacttgcct gtgctacaat tcattgtgtt
     2221 acctgtagat tcaggtaata caaaccatat ataatcatca agtaatacaa actaatttag
     2281 taatagcctg ggttaagtat tattagggcc ctgtgtctgc atgtagaaaa aaaaattcac
     2341 atgatgcact tcaaattcaa ataaaaatcc ttttggcatg ttcccatttt tgcttagctc
     2401 aattagtgtg gctaaccaag agataactgt aaatgtgaca ttgatttgct cttactacag
     2461 ctacagtgat tgggggagga aaagtcccaa cccaatgggc tcaaacttct aaggggtact
     2521 cctctcatcc ccttatcctt ctccctcgac attttctccc tctttcttcc catgacccca
     2581 aagccaaggg caacagatca gtaaagaacg tggtcagagt agaacccctg aagtattttt
     2641 taatcctacc tcaaaattta acagttacct gagagattta acattatcta gttcattgaa
     2701 tcattgtatg tggtcatgga taaattgcac acttgg
//
LOCUS       HUMFCRII     1403 bp ss-mRNA            PRI       30-SEP-1988
DEFINITION  Human FcRII mRNA for immunoglobulin G receptor.
ACCESSION   Y00644
KEYWORDS    IgG receptor; cell surface glycoprotein; immunoglobulin receptor.
SOURCE      human (Homo sapiens).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1403)
  AUTHORS   Moore,K.W. and Martens,C.L.
  JOURNAL   Unpublished (1987) DNAX, Palo Alto, CA, 94304 USA.
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1403)
  AUTHORS   Stuart,S.G.
  TITLE     Isolation and Expression of cDNA clones encoding a
  JOURNAL   J. Exp. Med. 166, 1668-1684 (1987)
  STANDARD  simple automatic
COMMENT     *source: cell line=K937; library=pcD vector; clone=16.2;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE        308    316       N-glycosylation site
               SITE        551    559       N-glycosylation site
               SITE        656    742       transmembrane domain
FEATURES       from  to/span     description
    pept         20      973     precursor polypeptide (AA -34 to 287)
    sigp         20      121     signal peptide (AA -34 to -1)
    matp        122      970     mature IgG receptor (AA 1-287)
BASE COUNT      410 a    372 c    305 g    316 t
ORIGIN      
        1 ggggggggac agtgctggga tgactatgga gacccaaatg tctcagaatg tatgtcccag
       61 aaacctgtgg ctgcttcaac cattgacagt tttgctgctg ctggcttctg cagacagtca
      121 agctgcagct cccccaaagg ctgtgctgaa acttgagccc ccgtggatca acgtgctcca
      181 ggaggactct gtgactctga catgccaggg ggctcgcagc cctgagagcg actccattca
      241 gtggttccac aatgggaatc tcattcccac ccacacgcag cccagctaca ggttcaaggc
      301 caacaacaat gacagcgggg agtacacgtg ccagactggc cagaccagcc tcagcgaccc
      361 tgtgcatctg actgtgcttt ccgaatggct ggtgctccag acccctcacc tggagttcca
      421 ggagggagaa accatcatgc tgaggtgcca cagctggaag gacaagcctc tggtcaaggt
      481 cacattcttc cagaatggaa aatcccagaa attctcccgt ttggatccca ccttctccat
      541 cccacaagca aaccacagtc acagtggtga ttaccactgc acaggaaaca taggctacac
      601 gctgttctca tccaagcctg tgaccatcac tgtccaagtg cccagcatgg gcagctcttc
      661 accaatgggg atcattgtgg ctgtggtcat tgcgactgct gtagcagcca ttgttgctgc
      721 tgtagtggcc ttgatctact gcaggaaaaa gcggatttca gccaattcca ctgatcctgt
      781 gaaggctgcc caatttgagc cacctggacg tcaaatgatt gccatcagaa agagacaact
      841 tgaagaaacc aacaatgact atgaaacagc tgacggcggc tacatgactc tgaaccccag
      901 ggcacctact gacgatgata aaaacatcta cctgactctt cctcccaacg accatgtcaa
      961 cagtaataac taaagagtaa cgttatgcca tgtggtcata ctctcagctt gcgtatggat
     1021 gcaaaaaaga ggggaattgt taaaggaaaa tttaaatgga gactggaaaa atcctgagca
     1081 aacaaaacca cctggccctt agaaatagct ttaactttgc ttaaactaca aacacaagca
     1141 aaacttcacg gggtcatact acatacaagc ataagcaaaa cttaacttgg atcatttctg
     1201 gtaaatgctt atgttagaaa taagacaacc ccagccaatc acaagcagcc tactaacata
     1261 taattaggtg actagggact ttctaagaag atacctaccc ccaaaaaaca acttatgtaa
     1321 ttgaaaacca accgattgcc tttattttgc ttccacattt tcccaataaa tacttgcctg
     1381 tgacattttg ccactggaac act
//
LOCUS       HUMGNAT1     1292 bp ss-mRNA            PRI       15-MAR-1990
DEFINITION  Human GNAT1 mRNA for transducin alpha-chain.
ACCESSION   X15088
KEYWORDS    guanine nucleotide-binding protein; transducin;
            transducin alpha-chain.
SOURCE      human (Homo sapiens).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1292; enum. 1 to 1292)
  AUTHORS   Van Dop,C.
  TITLE     ;
  JOURNAL   Unpublished (1989) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1292)
  AUTHORS   Van Dop,C., Medynski,D.C. and Apone,L.M.
  TITLE     Nucleotide sequence for a cDNA encoding the alpha subunit of
            retinal transducin (GNAT1) isolated from the human eye
  JOURNAL   Nucleic Acids Res. 17, 4887-4887 (1989)
  STANDARD  simple automatic
COMMENT     *source: tissue=eye; clone=UO50;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE       1253   1258       pot. polyA site
            
            [1]  Author address Van Dop C., Howard Hughes Medical Institute,
            Children's Hospital   Enders 1107, 300 Longwood Avenue, Boston MA
            02115, U S A.
            
            Submitted (21-APR-1989) on tape to the EMBL data library
FEATURES       from  to/span     description
    pept    <     1     1156     transducin alpha-chain (AA at 2)
                                 /hgml_locus_uid="LE0116J" /map="3"
                                 /nomgen="GNAT1"
BASE COUNT      311 a    387 c    362 g    232 t
ORIGIN      
        1 aggtcctcct gggccagaag ggttcctggg agccaggttc tgggatcccc tccatccaga
       61 agaaccacct gctcactctg tcccttcgcc tgctgctggg accatggggg ctggggccag
      121 tgctgaggag aagcactcca gggagctgga aaagaagctg aaagaggacg ctgagaagga
      181 tgctcgaacc gtgaagctgc tgcttctggg tgccggtgag tccgggaaga gcaccatcgt
      241 caagcagatg aagattatcc accaggacgg gtactcgctg gaagagtgcc tcgagtttat
      301 cgccatcatc tacggcaaca cgttgcagtc catcctggcc atcgtacgcg ccatgaccac
      361 actcaacatc cagtacggag actctgcacg ccaggacgac gcccggaagc tgatgcacat
      421 ggcagacact atcgaggagg gcacgatgcc caaggagatg tcggacatca tccagcggct
      481 gtggaaggac tccggtatcc aggcctgttt tgagcgcgcc tcggagtacc agctcaacga
      541 ctcggcgggc tactacctct ccgacctgga gcgcctggta accccgggct acgtgcccac
      601 cgagcaggac gtgctgcgct cgcgagtcaa gaccactggc atcatcgaga cgcagttctc
      661 cttcaaggat ctcaacttcc ggatgttcga tgtgggcggg cagcgctcgg agccgaagaa
      721 gtggatccac tgcttcgagg gcgtgacctg catcatcttc atcgcggcgc tgaccgcgta
      781 cgacatggtg ctagtggagg acgacgaagt gaaccgcatg cacgagagcc tgcacctgtt
      841 caacagcatc tgcaaccacc gctacttcgc cacgacgtcc atcgtgctct tccttaacaa
      901 gaaggacgtc ttcttcgaga aggtcaagaa ggcgcacctc agcatctgtt tcccggacta
      961 cgatggaccc aacacctacg aggacgccgg caactacatc aaggtgcagt tcctcgagct
     1021 caacatgcgg cgcgacgtga aggagatcta ttcccacatg acgtgcgcca ccgacacgca
     1081 gaacgtcaaa ttctgcttcg acgctgtcac cgacatcatc atcaaggaga acctcaaaga
     1141 ctgtggcctc ttctgagcca gggcctgtgc tgcagtcggg gacaaggagc ttccgtctgg
     1201 caaggccggg gcacaatttg cactcccctc agctagacgc agcagactca gcaataaacc
     1261 tttgcatcag gcaaaaaaaa aaaaacaaaa aa
//
LOCUS       HUMGSHPXG    1733 bp ds-DNA             PRI       15-MAR-1989
DEFINITION  Human glutathione peroxidase gene, complete cds.
ACCESSION   Y00483
KEYWORDS    glutathione peroxidase.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1733)
  AUTHORS   Ishida,K.
  JOURNAL   Unpublished (1987) Nippon Kayaku Co.,Ltd, Kita-ku, Tokyo 115, Japan
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1733)
  AUTHORS   Ishida,K., Morino,T., Takagi,K. and Sukenaga,Y.
  TITLE     Nucleotide sequence of a human gene for glutathione peroxidase
  JOURNAL   Nucleic Acids Res. 15, 10051-10051 (1987)
  STANDARD  simple automatic
COMMENT     An in-phase "tga" codon is located at positions 612-614 and encodes
            selenocysteine.
FEATURES       from  to/span     description
    pept        474      611     glutathione peroxidase, exon 1 (EC 1.11.1.9).
                                 /nomgen="GPX1" /map="3q11-q12"
                                 /hgml-locus_uid="LE0062E"
                615      719     glutathione peroxidase, exon 1
                998     1357     glutathione peroxidase, exon 2
    pre-msg     155     1566     GTP mRNA and intron
    IVS         720      997     GTP intron
BASE COUNT      277 a    529 c    526 g    401 t
ORIGIN      
        1 aacctagatc cctctgctgt cccctgcact gccggtaaca tggcacagca gagcagggtt
       61 gtttgtgcac gggcagctcc tgcagctgct gccgtcgccc accagcctcc tatgccaaac
      121 cccacatcct aactcaggaa cctctgagaa aaaacggagc cctcgagggg cccagccttg
      181 gaagggtaac tggaccgctg ccgcctggtt gcctgggcca gaccagacat gcctgctgct
      241 ccttccggct taggaggagc acgcgtcccg ctcgcgcgca ctctccagcc ttttcctggc
      301 tgaggagggg ccgagcctcc ggtagggcgg gggccggatg aggcgggacc tcaggcccgg
      361 aaaactgcct gtgccacgtg acccgccgcc ggccagttaa aaggaggcgc ctgctggcct
      421 ccccttacag tgcttgttcg gggcgctccg ctggcttctt ggacaattgc gccatgtgtg
      481 ctgctcggct agcggcggcg gcggcccagt cggtgtatgc cttctcggcg cgcccgttgg
      541 ccggcgggga gcctgtgagc ctgggctccc tgcggggcaa ggtactactt atcgagaatg
      601 tggcgtccct ctgaggcacc acggtccggg actacaccca gatgaacgag ctgcagcggc
      661 gcctcggacc ccggggcctg gtggtgctcg gcttcccgtg caaccagttt gggcatcagg
      721 tgcgccgggc ggagcgggac gggacggggg cggacgtgca gtagtggctg ggggcgccgg
      781 cggtgtggtg gtgggtgcgt cggctccatg cgcggagagt ctggctactc tctcgtttcc
      841 tttctgttgc tcgtagctgc tgaaattcct ctccgccctt gggattgcgc atggagggaa
      901 aaatcccggt gactcataga aaatctcccc tgtttgtggt tagaacgttt ctctcctcct
      961 cttgaccccg ggttctagct gcccttctct cctgtaggag aacgccaaga acgaagagat
     1021 tctgaattcc ctcaagtacg tccggcctgg tggtgggttc gagcccaact tcatgctctt
     1081 cgagaagtgc gaggtgaacg gtgcgggggc gcaccctctc ttcgccttcc tgcgggaggc
     1141 cctgccagct cccagcgacg acgccaccgc gcttatgacc gaccccaagc tcatcacctg
     1201 gtctccggtg tgtcgcaacg atgttgcctg gaactttgag aagttcctgg tgggccctga
     1261 cggtgtgccc ctacgcaggt acagccgccg cttccagacc attgacatcg agcctgacat
     1321 cgaagccctg ctgtctcaag ggcccagctg tgcctagggc gcccctccta ccccggctgc
     1381 ttggcagttg cagtgctgct gtctcggggg ggttttcatc tatgagggtg tttcctctaa
     1441 acctacgagg gaggaacacc ttgatcttac agaaaatacc acctcgagat gggtgctggt
     1501 cctgttgatc ccagtctctg ccagaccaag gctagtttcc ccactaataa agtgccgggt
     1561 gtcagcagac tgtgtgtatg tcctgtgtca ttgtcatttg ggaattcttt ttcttttctt
     1621 tttttttttt tttttttgag acggagtttt ttgctctatt gcccaggctt gagtgcagtg
     1681 gcgcaatcta ggctcactgc aagctccgcc tcccgggttc agcatttctg cta
//
LOCUS       HUMHBB3E      213 bp ss-mRNA            PRI       15-MAR-1990
DEFINITION  Human beta-globin mRNA, 3' end.
ACCESSION   M11428
KEYWORDS    beta-globin; globin.
SOURCE      Human cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 213)
  AUTHORS   Lang,K.M. and Spritz,R.A.
  TITLE     Cloning specific complete polyadenylylated 3'-terminal cDNA
            segments
  JOURNAL   Gene 33, 191-196 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       81     beta globin /nomgen="HBB" /map="11p11.5"
                                 /hgml_locus_uid="LQ0007V"
BASE COUNT       53 a     50 c     45 g     65 t
ORIGIN      
        1 gaattcaccc caccagtgca ggctgcctat cagaaagtgg tggctggtgt ggctaatgcc
       61 ctggcccaca agtatcacta agctcgcttt cttgctgtcc aatttctatt aaaggttcct
      121 ttgttccgta agtccaacta ctaaactggg ggatattatg aagggccttg agcatctgga
      181 ttctgcctaa taaaaaacat ttattttcat tgc
//
LOCUS       HUMHK1A      4134 bp ss-mRNA            PRI       15-DEC-1989
DEFINITION  Human calcium-ATPase (HK1) mRNA, complete cds.
ACCESSION   M23114 J04025
KEYWORDS    ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase.
SOURCE      Human kidney cortex, cDNA to mRNA, clones lambda HK[1a,1b].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4134)
  AUTHORS   Lytton,J. and MacLennan,D.H.
  TITLE     Molecular cloning of cDNAs from human kidney coding for two
            alternatively spliced products of the cardiac Ca2+-ATPase gene
  JOURNAL   J. Biol. Chem. 263, 15024-15031 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Lytton, 14-MAR-1989.  Two alternative splicing products, HK1
            and HK2, are realized in human kidney cDNAs.  HK2 codes for a
            protein identical to rabbit cardiac muscle Ca2+ ATPase, with the
            exception of 6 scattered amino acid replacements, whereas HK1 codes
            for a protein identical to that encoded by HK2, but with the
            carboxyl-terminal 4 amino acids replaced by an extended sequence of
            49 amino acids.  See accession M23115 and J04703.
FEATURES       from  to/span     description
    pept        164     3292     calcium-ATPase (EC 3.6.1.3) /nomgen="ATP2B"
                                 /map="12" /hgml_locus_uid="LP0123P"
    mRNA          1     4134     calcium-ATPase mRNA
    site       3143     3144     alternative splice site
BASE COUNT     1056 a    938 c   1012 g   1128 t
ORIGIN      254 bp upstream of HindIII site.
        1 gggtgattca gcgcccggcg aggcggaacg ggccgcaaga ggaggagggg agagcccgtc
       61 cgcgcctggg ctcccggggt ggcacgagcc cgcggccgga gtgcgaggcg gaggcgagga
      121 ggccgcgggg acgggaggcg aggccggccg ggcccccgaa gccatggaga acgcgcacac
      181 caagacggtg gaggaggtgc tgggccactt cggcgtcaac gagagtacgg ggctgagcct
      241 ggaacaggtc aagaagctta aggagagatg gggctccaac gagttaccgg ctgaagaagg
      301 aaaaaccttg ctggaacttg tgattgagca gtttgaagac ttgctagtta ggattttatt
      361 actggcagca tgtatatctt ttgttttggc ttggtttgaa gaaggtgaag aaacaattac
      421 agcctttgta gaaccttttg taattttact catattagta gccaatgcaa ttgtgggtgt
      481 atggcaggaa agaaatgctg aaaatgccat cgaagccctt aaggaatatg agcctgaaat
      541 gggcaaagtg tatcgacagg acagaaagag tgtgcagcgg attaaagcta aagacatagt
      601 tcctggtgat attgtagaaa ttgctgttgg tgacaaagtt cctgctgata taaggttaac
      661 ttccatcaaa tctaccacac taagagttga ccagtcaatt ctcacaggtg aatctgtctc
      721 tgtcatcaag cacactgatc ccgtccctga cccacgagct gtcaaccaag ataaaaagaa
      781 catgctgttt tctggtacaa acattgctgc tgggaaagct atgggagtgg tggtagcaac
      841 tggagttaac accgaaattg gcaagatccg ggatgaaatg gtggcaacag aacaggagag
      901 aacacccctt cagcaaaaac tagatgaatt tggggaacag ctttccaaag tcatctccct
      961 tatttgcatt gcagtctgga tcataaatat tgggcacttc aatgacccgg ttcatggagg
     1021 gtcctggatc agaggtgcta tttactactt taaaattgca gtggccctgg ctgtagcagc
     1081 cattcctgaa ggtctgcctg cagtcatcac cacctgcctg gctcttggaa ctcgcagaat
     1141 ggcaaagaaa aatgccattg ttcgaagcct cccgtctgtg gaaacccttg gttgtacttc
     1201 tgttatctgc tcagacaaga ctggtacact tacaacaaac cagatgtcag tctgcaggat
     1261 gttcattctg gacagagtgg aaggtgatac ttgttccctt aatgagttta ccataactgg
     1321 atcaacttat gcacctattg gagaagtgca taaagatgat aaaccagtga attgtcacca
     1381 gtatgatggt ctggtagaat tagcaacaat ttgtgctctt tgtaatgact ctgctttgga
     1441 ttacaatgag gcaaagggtg tgtatgaaaa agttggagaa gctacagaga ctgctctcac
     1501 ttgcctagta gagaagatga atgtatttga taccgaattg aagggtcttt ctaaaataga
     1561 acgtgcaaat gcctgcaact cagtcattaa acagctgatg aaaaaggaat tcactctaga
     1621 gttttcacgt gacagaaagt caatgtcggt ttactgtaca ccaaataaac caagcaggac
     1681 atcaatgagc aagatgtttg tgaagggtgc tcctgaaggt gtcattgaca ggtgcaccca
     1741 cattcgagtt ggaagtacta aggttcctat gacctctgga gtcaaacaga agatcatgtc
     1801 tgtcattcga gagtggggta gtggcagcga cacactgcga tgcctggccc tggccactca
     1861 tgacaaccca ctgagaagag aagaaatgca ccttgaggac tctgccaact ttattaaata
     1921 tgagaccaat ctgaccttcg ttggctgcgt gggcatgctg gatcctccga gaatcgaggt
     1981 ggcctcctcc gtgaagctgt gccggcaagc aggcatccgg gtcatcatga tcactgggga
     2041 caacaagggc actgctgtgg ccatctgtcg ccgcatcggc atcttcgggc aggatgagga
     2101 cgtgacgtca aaagctttca caggccggga gtttgatgaa ctcaacccct ccgcccagcg
     2161 agacgcctgc ctgaacgccc gctgttttgc tcgagttgaa ccctcccaca agtctaaaat
     2221 cgtagaattt cttcagtctt ttgatgagat tacagctatg actggcgatg gcgtgaacga
     2281 tgctcctgct ctgaagaaag ccgagattgg cattgctatg ggctctggca ctgcggtggc
     2341 taaaaccgcc tctgagatgg tcctggcgga tgacaacttc tccaccattg tggctgccgt
     2401 tgaggagggg cgggcaatct acaacaacat gaaacagttc atccgctacc tcatctcgtc
     2461 caacgtcggg gaagttgtct gtattttcct gacagcagcc cttggatttc ccgaggcttt
     2521 gattcctgtt cagctgctct gggtcaatct ggtgacagat ggcctgcctg ccactgcact
     2581 ggggttcaac cctcctgatc tggacatcat gaataaacct ccccggaacc caaaggaacc
     2641 attgatcagc gggtggctct ttttccgtta cttggctatt ggctgttacg tcggcgctgc
     2701 taccgtgggt gctgctgcat ggtggttcat tgctgctgac ggtggtccaa gagtgtcctt
     2761 ctaccagctg agtcatttcc tacagtgtaa agaggacaac ccggactttg aaggcgtgga
     2821 ttgtgcaatc tttgaatccc catacccgat gacaatggcg ctctctgttc tagtaactat
     2881 agaaatgtgt aacgccctca acagcttgtc cgaaaaccag tccttgctga ggatgccccc
     2941 ctgggagaac atctggctcg tgggctccat ctgcctgtcc atgtcactcc acttcctgat
     3001 cctctatgtc gaacccttgc cactcatctt ccagatcaca ccgctgaacg tgacccagtg
     3061 gctgatggtg ctgaaaatct ccttgcccgt gattctcatg gatgagacgc tcaagtttgt
     3121 ggcccgcaac tacctggaac ctggtaaaga gtgtgtgcag cctgccacca aatcctgctc
     3181 gttctcggca tgcaccgatg ggatttcctg gccgtttgtg ctgctcataa tgcccctggt
     3241 gatctgggtc tatagcacag acactaactt tagcgatatg ttctggtctt gactgacagt
     3301 tttccataaa gaagatgttt aacttaatca attaattttt ttattgttta aagcaactgt
     3361 ctatttctgc tgaattttca catgaacata ctggctggtg atggaggttt catactctag
     3421 attttgtttt gctttttctg actccagtgg ggcaagattt tcctttttta tacacataat
     3481 taaagtgtcc attgacatgt acagagaact aacactattt tatgcaaata tttttttgta
     3541 gatgaaaaag catgtacagt gttctgttta atactcatcc ttgtataaaa aaaatagttg
     3601 agccagcaga cattgtcagc aaattaattg gcagcagatt ttaggaaatg aatgtgtgtg
     3661 gttttttttc taaaactaaa tagcatgtat tgtgtctttt gcatgatgat ccggatttaa
     3721 tttgatatca cagtctaatt tttattcata agccaatttt tctgcactga gcagagtctt
     3781 gctacctcag tcagtattgt tttggtttgc tacttccctc acccactttg gcctccgttc
     3841 accccacccc accccacctc tccccacctt acccccgccc cgcttggctt cttctttagg
     3901 attgtgatgg ttcgttctgt ttacatcagt tttaacgaga ggtatgcctg tactcgcttg
     3961 tgcagaaaac attgttccag attcaatcga ctgggtttat gtcccttcac atagttttta
     4021 aggttattta tttaaatgtc taatgtattt tattgtaaca gacattgttt tgccaacatt
     4081 gcctatttca gtggcacgtc atctagtttt aaaaaaataa aacattttaa aaag
//
LOCUS       HUMHK2A      3781 bp ss-mRNA            PRI       15-DEC-1989
DEFINITION  Human calcium-ATPase (HK2) mRNA, complete cds.
ACCESSION   M23115 J04025
KEYWORDS    ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase.
SOURCE      Human kidney cortex, cDNA to mRNA, clones lambda HK2[a,b].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3781)
  AUTHORS   Lytton,J. and MacLennan,D.H.
  TITLE     Molecular cloning of cDNAs from human kidney coding for two
            alternatively spliced products of the cardiac Ca2+-ATPase gene
  JOURNAL   J. Biol. Chem. 263, 15024-15031 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Lytton, 14-MAR-1989.  Two alternative splicing products, HK1
            and HK2, are realized in human kidney cDNAs.  HK2 codes for a
            protein identical to rabbit cardiac muscle Ca2+ ATPase, with the
            exception of 6 scattered amino acid replacements, whereas HK1 codes
            for a protein identical to that encoded by HK2, but with the
            carboxyl-terminal 4 amino acids replaced by an extended sequence of
            49 amino acids.  See accession M23114 and J04703.
FEATURES       from  to/span     description
    pept          4     2997     calcium-ATPase (EC 3.6.1.3) /nomgen="ATP2B"
                                 /map="12" /hgml_locus_uid="LP0123P"
    mRNA          1     3781     calcium-ATPase mRNA
    site       2983     2984     alternative splice site
BASE COUNT     1018 a    829 c    913 g   1021 t
ORIGIN      94 bp upstream of HindIII site.
        1 gccatggaga acgcgcacac caagacggtg gaggaggtgc tgggccactt cggcgtcaac
       61 gagagtacgg ggctgagcct ggaacaggtc aagaagctta aggagagatg gggctccaac
      121 gagttaccgg ctgaagaagg aaaaaccttg ctggaacttg tgattgagca gtttgaagac
      181 ttgctagtta ggattttatt actggcagca tgtatatctt ttgttttggc ttggtttgaa
      241 gaaggtgaag aaacaattac agcctttgta gaaccttttg taattttact catattagta
      301 gccaatgcaa ttgtgggtgt atggcaggaa agaaatgctg aaaatgccat cgaagccctt
      361 aaggaatatg agcctgaaat gggcaaagtg tatcgacagg acagaaagag tgtgcagcgg
      421 attaaagcta aagacatagt tcctggtgat attgtagaaa ttgctgttgg tgacaaagtt
      481 cctgctgata taaggttaac ttccatcaaa tctaccacac taagagttga ccagtcaatt
      541 ctcacaggtg aatctgtctc tgtcatcaag cacactgatc ccgtccctga cccacgagct
      601 gtcaaccaag ataaaaagaa catgctgttt tctggtacaa acattgctgc tgggaaagct
      661 atgggagtgg tggtagcaac tggagttaac accgaaattg gcaagatccg ggatgaaatg
      721 gtggcaacag aacaggagag aacacccctt cagcaaaaac tagatgaatt tggggaacag
      781 ctttccaaag tcatctccct tatttgcatt gcagtctgga tcataaatat tgggcacttc
      841 aatgacccgg ttcatggagg gtcctggatc agaggtgcta tttactactt taaaattgca
      901 gtggccctgg ctgtagcagc cattcctgaa ggtctgcctg cagtcatcac cacctgcctg
      961 gctcttggaa ctcgcagaat ggcaaagaaa aatgccattg ttcgaagcct cccgtctgtg
     1021 gaaacccttg gttgtacttc tgttatctgc tcagacaaga ctggtacact tacaacaaac
     1081 cagatgtcag tctgcaggat gttcattctg gacagagtgg aaggtgatac ttgttccctt
     1141 aatgagttta ccataactgg atcaacttat gcacctattg gagaagtgca taaagatgat
     1201 aaaccagtga attgtcacca gtatgatggt ctggtagaat tagcaacaat ttgtgctctt
     1261 tgtaatgact ctgctttgga ttacaatgag gcaaagggtg tgtatgaaaa agttggagaa
     1321 gctacagaga ctgctctcac ttgcctagta gagaagatga atgtatttga taccgaattg
     1381 aagggtcttt ctaaaataga acgtgcaaat gcctgcaact cagtcattaa acagctgatg
     1441 aaaaaggaat tcactctaga gttttcacgt gacagaaagt caatgtcggt ttactgtaca
     1501 ccaaataaac caagcaggac atcaatgagc aagatgtttg tgaagggtgc tcctgaaggt
     1561 gtcattgaca ggtgcaccca cattcgagtt ggaagtacta aggttcctat gacctctgga
     1621 gtcaaacaga agatcatgtc tgtcattcga gagtggggta gtggcagcga cacactgcga
     1681 tgcctggccc tggccactca tgacaaccca ctgagaagag aagaaatgca ccttgaggac
     1741 tctgccaact ttattaaata tgagaccaat ctgaccttcg ttggctgcgt gggcatgctg
     1801 gatcctccga gaatcgaggt ggcctcctcc gtgaagctgt gccggcaagc aggcatccgg
     1861 gtcatcatga tcactgggga caacaagggc actgctgtgg ccatctgtcg ccgcatcggc
     1921 atcttcgggc aggatgagga cgtgacgtca aaagctttca caggccggga gtttgatgaa
     1981 ctcaacccct ccgcccagcg agacgcctgc ctgaacgccc gctgttttgc tcgagttgaa
     2041 ccctcccaca agtctaaaat cgtagaattt cttcagtctt ttgatgagat tacagctatg
     2101 actggcgatg gcgtgaacga tgctcctgct ctgaagaaag ccgagattgg cattgctatg
     2161 ggctctggca ctgcggtggc taaaaccgcc tctgagatgg tcctggcgga tgacaacttc
     2221 tccaccattg tggctgccgt tgaggagggg cgggcaatct acaacaacat gaaacagttc
     2281 atccgctacc tcatctcgtc caacgtcggg gaagttgtct gtattttcct gacagcagcc
     2341 cttggatttc ccgaggcttt gattcctgtt cagctgctct gggtcaatct ggtgacagat
     2401 ggcctgcctg ccactgcact ggggttcaac cctcctgatc tggacatcat gaataaacct
     2461 ccccggaacc caaaggaacc attgatcagc gggtggctct ttttccgtta cttggctatt
     2521 ggctgttacg tcggcgctgc taccgtgggt gctgctgcat ggtggttcat tgctgctgac
     2581 ggtggtccaa gagtgtcctt ctaccagctg agtcatttcc tacagtgtaa agaggacaac
     2641 ccggactttg aaggcgtgga ttgtgcaatc tttgaatccc catacccgat gacaatggcg
     2701 ctctctgttc tagtaactat agaaatgtgt aacgccctca acagcttgtc cgaaaaccag
     2761 tccttgctga ggatgccccc ctgggagaac atctggctcg tgggctccat ctgcctgtcc
     2821 atgtcactcc acttcctgat cctctatgtc gaacccttgc cactcatctt ccagatcaca
     2881 ccgctgaacg tgacccagtg gctgatggtg ctgaaaatct ccttgcccgt gattctcatg
     2941 gatgagacgc tcaagtttgt ggcccgcaac tacctggaac ctgcaatact ggagtaaccg
     3001 cttcctaaac cattttgcag aaatgtaagg gtgttcggtt gcgtgcatgt gcgtttttag
     3061 caacacatct accaaccctg tgcatgactg atgttgggga aaaagaaaag taaaaaactt
     3121 cccaactcac tttgtgttat gtggaggaaa tgtgtattac caatggggtt gttagctttt
     3181 aaatcaaaat actgattaca gatgtacaat ttagcttaat cagaaagcct ctccagagaa
     3241 gtttggtttc tttgctgcaa gaggaatgag gctctgtaac cttatctaag aacttggaag
     3301 ccgtcagcca agtcgccaca tttctctgca aaatgtcata gcttatataa atgtacagta
     3361 ttcaattgta atgcatgctt cggttgtaag tagccagatc cctctccagt gacattggaa
     3421 catgctactt tttaattggc cctgtacagt ttgcttattt ataaattcat taaaaacact
     3481 acaggtgttg aatggttaaa atgtaggcct ccagttcatt ttcagttatt ttctgagtgt
     3541 gcagacagct atttcgcact gtattaaatg taacttattt aatgaaatca gaagcagtag
     3601 acagatgttg gtgcaataca aatattgtga tgcatttatc ttaataaaat gctaaatgtc
     3661 aatttatcac tgcgcatgtt tgactttaga ctgtaaatag agatcagttt gtttctttct
     3721 gtgctggtaa caatgagcgt cgcacagaca tggtttcagg taaataaatc tattctatga
     3781 t
//
LOCUS       HUMHMG17G    7195 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human HMG-17 gene for non histone chromosomal protein HMG-17.
ACCESSION   X13546
KEYWORDS    HMG-17 chromosomal protein; HMG-17 gene; non histone protein.
SOURCE      human (Homo sapiens).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 7195; enum. 1 to 7195)
  AUTHORS   Landsman,D.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (sites; for [1])
  AUTHORS   Landsman,D., McBride,O.W. and Bustin,M.
  TITLE     Human non-histone chromosomal protein HMG-17: identification,
            characterization, chromosome localization and RFLP's of a
            functional gene from the large multigene family
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
COMMENT     *source: cell line=MCF-7; library=EMBL3; clone=pH17g; **map:
            chromosomal location=1p12-34.
            
            See also HSHMG17 (M12623) for the cDNA sequence and HSHMG17P
            (X06444) for a retropseudogene HMG-17, published by Landsman D.
            and others.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM        2315   2319       CAAT-box
               PRM        2372   2377       TATA-box
               TRANSCR    2402   5851       primary transcript
               MSG        2402   2523       exon 1
            
               IVS        2524   3369       intron I
               MSG        3370   3414       exon 2
            
               IVS        3415   3589       intron II
               MSG        3590   3619       exon 3
            
               IVS        3620   3969       intron III
               MSG        3970   4020       exon 4
            
               IVS        4021   4456       intron IV
               MSG        4457   4552       exon 5
            
               IVS        4553   4997       intron V
               MSG        4998   5851       exon 6
            
            [1]  Author address Landsman D., Laboratory of Molecular
            Carcinogenesis, National Institutes of Health, Building 37, Room
            3D20, Bethesda, MD 20892, U.S.A__
            
            Submitted (12-NOV-1988) on tape to the EMBL data library by:
FEATURES       from  to/span     description
    pept       2509     2523     put. HMG-17 protein (AA 1-5)
                                 /hgml_locus_uid="LY0033Z" /map="1p36.1-p35"
                                 /nomgen="HMG17"
               3370     3414     put. HMG-17 protein (AA 6-20)
               3590     3619     put. HMG-17 protein (AA 21-30)
               3970     4020     put. HMG-17 protein (AA 31-47)
               4457     4552     put. HMG-17 protein (AA 48-79)
               4998     5033     put. HMG-17 protein (AA 80-90)
BASE COUNT     1664 a   1754 c   1809 g   1968 t
ORIGIN      
        1 aagcttgttc taggtcctct gcttgtgagg gtcaaagctg tgtcctttcc cttacctccc
       61 tctgccagga cttgctgcag agctgctgag aggattagtg cctttgaaga gctgtctgcc
      121 tgagcaactc tatttcaggt gccccacacc ggcaagtacc agccagcaac accaaccaaa
      181 tgctactctc tttaaagtcc attttccttc ttcttttttt tttttttttt ttggagacag
      241 catctcaccc tgtcctggct ggtctcgaac tcctgacctc aggtgatccg ccggcctcac
      301 cctcccaaag tgctgggatt acaggcatga gccaccatgc cagacttccc attttacttt
      361 ctgcaagctg tttccctagc agctccctct aggggagagg tgaaatcttg caagttgtag
      421 caagagcaca caggaaaccc ctaactttcc tataccccac ccgcctcttc ccctttctgt
      481 cccgggatac tggcggcaag agacttcttg gctattgtcc atgctcccag aatcaagcat
      541 aaatgccaga cacggcgatt gagaagccaa tcagtgaacc ctttgcaaag ccccatccac
      601 acctggcact cccctctacc aatccctggc acagggttcc tggagagcag gtgctgtaca
      661 ttttacagct ttacaatggg gctgttgaca gccataatta gggaggcatg aattatgcgg
      721 ctataatgca gagccctaca attaaggcgg gaatgagggg ctggaggcag caaacggaat
      781 ctgccctatg agcgtggctg ttgagtcctg tctcctgggt ctgactttcc gtaatatgat
      841 tggggtacag tagaggtgat taatggggct ggcatctctc tttggcctga ggttctgtat
      901 tctgggaaag gtatacaggg tggagtaggg agaagctgcc ccaggaggcg atgtagtggt
      961 ggaaagaaga ggcagagagg tcgtcgtcgt cgcccagcag caagggctgc aaaatagtag
     1021 aactcgtggt tgctttggac aggtgtgatt tgtgcaagcc aggttcaacc cttgcctcaa
     1081 gaaatcagat gggaccaatt tagtgtcctt ccacctgtga gccaagcccc catttgagga
     1141 catctatcgt attcttgtgt gctgggtctc aaatagaatt tttaaagatt cttagatgta
     1201 aaacttgttt gctaactgca acggaagaac accttctttt gctgctgcaa ttgcctaccc
     1261 tccctcaggt gcacttggct agctccaaga ccctggcttt ggactagggg gcaccggaag
     1321 agcccttttg ccatcacacc acctgttcct ctacttgttt cctaatcccc gggcgctttc
     1381 ctccaccctc ctttcctcgc gcaagtgaaa ggcagccagc ttgacagtgt ggaaggaggg
     1441 atttcgccag ctgcggctgc agtgcccagg cttggtggcc gcgtcgggtt gggcagctgg
     1501 gcaaaatgtc tccgagaggg gtagagtgta gggggaagtg agtgctggta gacctcgacg
     1561 gccgctctcc accatactgt ggcggggtct ggaggtttgg gaatacaggg accttaaggg
     1621 agccacccga gaggagttgg gggtgggggc ttcgatagca tttggggagg cggctgcgca
     1681 acgggtgctg aaaggacagc tcctacctga cccacggcac ccaaaaggca cctctgcatg
     1741 gaaagtctcc ccctgaaacc cctagcccac ctcgagtggg accgtctttc aggtgaggtg
     1801 cacgaaaggc ttctccgaag tggcagctcg gaaggatgca cgtccggctg cgccaagctg
     1861 gtttccccgc ctttttccca cttcggcgtc agcctgggct ctgggaactg cagcctgtgc
     1921 ctccagccgc gcgcttcctc ctgcttccag aggagcaggg cagtgcagcc aggcctcgca
     1981 aacccctctt ctggggcccc cagcgggagc aggaaggaaa gccgctgaga tgcgccgtga
     2041 gtgtcgcacg gcttggagaa gtgggttgct gggttgcata aggtggaagg aacttgctag
     2101 tttgcaaaag acagctcttg gcgggagccg gaacccggag cccgccgact cgcgcacccc
     2161 tcctcattta ctgccttcga cgcgcgccac cccgactgcg cccccacccc ctcggcgcgc
     2221 cgccgtctcc gcgcactacc ccccccacac acccccctcc tcactccctc cagaggaggt
     2281 gagtttaaac cccgcccacg tgaccccagc tgggccaatg aacggcggcg ggaggtgaaa
     2341 tcggttctaa ccggtccggg gctcccagcg ctataaaaac tttataaacc ccccggagcc
     2401 cgagcagtgt gaagaagagg cgagaacgac ccccggaccg accaaagccc gcgcgccgct
     2461 gcatcccgcg tccagcacct acgtcccgct gccgtcgccg ccgccaccat gcccaagaga
     2521 aaggtacgtg gcgcgagggc cccaggcgcc gggccaccac tgccgccacc gccgccgccg
     2581 cctccctggt gcagggagcg agaatcggcg ccgagcagga gccagcgcac cgtccccgcg
     2641 cgggggctgg agacggtgtc gggcagctcg gggctaaccc tgagcggctc ggctgcccgc
     2701 gggcgccaga ggccatattg gaggagcggc ggccgcggcg ggaggagcca tgttggcggc
     2761 tgtttatccc gctctcctcg ctcgccgccc ccgccccgtg ccccctcccc catcgcgagg
     2821 gccccttccc tccctctgcg ggcgggcaat tcaaacccga aagggcggga aggcggcgct
     2881 cggggttggc gggcggggga aagcgtgccg ccaaaaaacc gccgccgtga ggcggggctt
     2941 gtgcggtatg gccccgcccc ctcgcccacg ttccccgcgc acgagacgcg cgctgtcgcc
     3001 gcccacgagt tccccgggct gcgcgcgcct ccctctcccg ccctcgacgg ctgccatagc
     3061 aacggcgctg ggctccgctc cggagggggt ttgtttgcgc catctgcagc tgttgctcct
     3121 gcctgtcgcg gtggtgcggg ctccgctgcc ctccccggct gcgctccggt ccagccctcg
     3181 cttctcgggg tcggcgagcc ggagctcctg cgcgcgcttc gttcttatac gaacgtcggg
     3241 ctcactcatt tatgtcctag aaaagttgtg tggacgactg ctttaatttt catttttagc
     3301 actctaaagt ttgggaagta attaagaacc acctcaaaat ctgcagtttt ttgttcttgt
     3361 ttcttatagg ctgaagggga tgctaaggga gataaagcaa aggtgaagga cgaagtaagt
     3421 cattctctct tcaagggtca aagccttgga ctagcagagg ccactggact cggtgattaa
     3481 ccgtaacctg tgtcctgaat ttacactcct ataatctaga gcaaattgat accaaacttt
     3541 caaagcgact tacctgtcct atttctaact ttctcgttgt ctttaatagc cacagagaag
     3601 atccgcgagg ttgtctgctg taagtgtatg cttttgaatt ttcgtgcttg tccctgaaac
     3661 taaaaaacat caaaaaacaa ttccctttgc ttccatgaat tatggttagt gcctggtttt
     3721 gaatcattgc ctctacttgg gactcttgcc ctttgggttt tgctggttct gaaattctga
     3781 tgcctgtagc caaagtggga catttgagtg ggcttctgga aatccaacat tctagaagaa
     3841 agccaaccac aaaaactttg aggaggagga gaaacttctc tacccttggt acttttggtt
     3901 ggttgtgggt ggttttcttc agtccattgt actgatgttc actttttcct ctcttcctgc
     3961 caaaaaaaga aacctgctcc tccaaagcca gagcccaagc ctaaaaaggc ccctgcaaag
     4021 gtaagtgcta acattggaac tgatcatttt cacagaatga ggactgtcct tagtgcctta
     4081 acttaattag cataatggtg cctccattaa tggaggttat aaactgtgtg gatagcttac
     4141 ctcgtccgtg tcattcataa cgttggtttc ctgatcaaga attctgttgt tagtttatag
     4201 gaaaatctgc atttaaatat taaatacccc agggtttaaa ctgatttatt cttctgttac
     4261 ttaggaaatt ctaagtacta aagattatag caaattacag ataatttagc ctagttttga
     4321 tcagttgttc ttgtcaaatg ggtgagggtt tgtgttttat cttgagtaaa caatcctacc
     4381 ttgtgcagaa ctttgcagac cataccttgg taatacgaaa tggggagaaa cagttctatt
     4441 ttttcccctt tttcagaagg gagagaaggt acccaaaggg aaaaagggaa aagctgatgc
     4501 tggcaaggag gggaataacc ctgcagaaaa tggagatgcc aaaacagacc aggtataact
     4561 gctgtttcac cctttgttag atttgttcat tcagttagtt gctgatatca aaaatttaat
     4621 tgcccttttc ttgtatcact ccaaatgtac catttggtcc agtgtgcttg tggctttcct
     4681 gttaacttaa atcctggatt cttgaaatct ctactgtcag ctgaagggca ttgtgtttta
     4741 tactacatga agttttcaag ctaggagtaa cctcaggtct cttattctgg tgttcttcct
     4801 acctcaattt tacagtgagg cccagaggag aggtgactta gcaaagttac ctagtaagtc
     4861 attctcagaa gaaatactga gtctcagtac ctgaaacctg aactttggcc tgggagcaaa
     4921 gtgatgctgt aggtggaatg tgaacacaga tagttttgaa atctacgcat tgcattaatt
     4981 tgtctgtttt cttttaggca cagaaagctg aaggtgctgg agatgccaag tgaagtgtgt
     5041 gcatttttga taactgtgta cttctggtga ctgtacagtt tgaaatacta ttttttatca
     5101 agttttataa aaatgcagaa ttttgtttta cttttttttt ttttttaaaa gctatgttgt
     5161 tagcacacag aacacttcat tgttgttttt gggggaaggg gcatatgtca ctaatagaat
     5221 gtctccaaag ctggattgat gtggagaaaa cacctttccc ttctagtttt gagagacttc
     5281 ctcttggctc ccaggaggag ggattccctg actttgacac acatggccac cttggcacaa
     5341 aagccttgtg gtatagaaaa acaaatttgt ttttatgtcc tcttctccct ttccatcttt
     5401 cagcatagac ttaactccct taagcccaga catctgttga gacctgaccc ctagtcattg
     5461 gttaccagtg tgtcaggcaa tctggacttt ccagtgatgc cactgagatg gcacctgtca
     5521 aaagagcagt ggttccattt ctagattgtg gatcttcaga taaattctgc cattttcatt
     5581 tcacttcctg aaagtcaggg tcggcttgtg aaaagttgtt aaacaacatg ctaaatgtga
     5641 aatgtcaacc ctcactctaa actttccctg ttcagagcat cagatgaaga cttcattggg
     5701 ttttatagtg gctttctgat ttttggtagt ccattgaaga agggagtttg aaagttgttg
     5761 tatactgtta acgattgtct gcccatgtcc tgcctgaaat accatgattg tttatggaaa
     5821 gtatctttaa taaagctgga tacagtttgg cttggaatgc tgcctctgat cttttccaca
     5881 agtggggtaa cctggtttat ccaagtctct tggaacaggg tacgttctgc tttgaggtac
     5941 tccatgtaca gtctaatgat gatctttcac tgatttatag tcagcttcca aaacacacag
     6001 tgctaggtgc agttaggaag gattccagga gtagacttag ctgatggtga aaggctggga
     6061 gtatggagtg atttctgtac ttggttatgc agcattccct gcagtgtttg ttttttgcat
     6121 gactccgaat acatgaagtg tattaaatgc agatttttgg tagatctgcg gttttggctg
     6181 ggtagctgac aggagcctct ctacttccat ctgcatttgc aacttctata aaattagcaa
     6241 attaatagac tcatgtcaga gagtgtctct gtgaatgtgt acacagttgt ccatttgtaa
     6301 ggtagatgac actttgctcc taagtagacc tgcaaacaaa gacaatgggg tctttcctga
     6361 ttcctttagt ttggagatgc agagaacccc acgtttctct aagtgaaatg gcaagaagtt
     6421 atttgttaga atatggaact ggtaacaaat tcatgtttgc accttgaatt tgctttcaca
     6481 gacctggctg tattgttgga caggaataaa gtaaattgga gactaattgc tttgtccatt
     6541 cccttagtca aaaaagtatc gctgttaagt tcttgaggtt ttttgtgtat tatctctcaa
     6601 gtgttgagat gtttgtgtaa tatatgtatc acactcatgg atgaggaatc aggcacagat
     6661 gaaacttgtt gaaggttatg tgtacattta atgagcagca gagtcaggat tcagaccctc
     6721 aactagacaa aagtcatctc tgacagactt ttattcagga taaaatacca tttgtccatt
     6781 tagccagtta ccagccattt actttcatta ttgctgaagt cattcattct aggttgtaga
     6841 cgggtaattt ttattctaaa agcggtgtcc tcttccagag gaaacttaag agacatctta
     6901 gaatggtcag gggtagtggg ctggtatttg agcagaggtt gcagaatttg agaaagttac
     6961 agctgaggtt ctttgttagt ggcaagtctg ctaagccctg ggctatgcta acagctctgc
     7021 ctcatgctct ttaatctttg agtcagacct atgagataca ggtgtagccc cattttgtag
     7081 tcgagagtga ggctcagatg tgaagcgact ggctgaggtt aaacttgata gtctgtttac
     7141 ttattttctg gccctcttct tgttgccata taattaggat gagtagggta agctt
//
LOCUS       HUMIGLVB      464 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human Ig germline lambda-chain (autoantibody Humv117) gene,
            V-region, partial cds.
ACCESSION   M30446
KEYWORDS    V-region; anti-DNA antibody; autoantibody; germline;
            immunoglobulin light chain; lambda-immunoglobulin.
SOURCE      Human retinoblastoma cell line Y79 DNA, (library of W.-H.Lee).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 464)
  AUTHORS   Siminovitch,K.A., Misener,V., Kwong,P.C., Song,Q.-L. and Chen,P.P.
  TITLE     A natural autoantibody is encoded by germline heavy and lambda
            light chain variable region genes without somatic mutation
  JOURNAL   J. Clin. Invest. 84, 1675-1678 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    44  /   349     Ig lambda chain V-region precursor (AA at 46)
    sigp    /    44       54     Ig lambda chain signal peptide (AA at 46)
    matp         55  /   349     Ig lambda chain
    IVS     <     1       43     Ig lambda-chain intron
    iDNA        350  >   464     intervening DNA (5' end approx.)
    signal      351      357     7 mer recombination signal
    signal      381      389     9 mer recombination signal
BASE COUNT       99 a    150 c    118 g     97 t
ORIGIN      
        1 tctagaccaa gaatcaccgt gtctgtgtct ctcctgcttc cagggtcctg ggcccagtct
       61 gtgttgacgc agccgccctc agtgtctgcg gccccaggac agaaggtcac catctcctgc
      121 tctggaagca gctccaacat tgggaataat tatgtatcct ggtaccagca gctcccagga
      181 acagccccca aactcctcat ctatgaaaat aataagcgac cctcagggat tcctgaccga
      241 ttctctggct ccaagtctgg cacgtcagcc accctgggca tcaccggact ccagactggg
      301 gacgaggccg attattactg cggaacatgg gatagcagcc tgagtgctgg cacagtgctc
      361 cagcccaatg gggaactgag acaagaaccc ccttcttcct cccccaggag ggtgagtgcc
      421 gccagctgct gctcacgcct gacctgtagc ttctgctgct gcag
//
LOCUS       HUMIL71      2195 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human interleukin 7 (IL7) gene, exon 1.
ACCESSION   M29048
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     1 of 6
SOURCE      Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
            to mRNA, clone 3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2195)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept       2167  +  2176     interleukin 7, exon 1
    pre-msg <  1783  >  2195     IL7 mRNA and introns
    IVS        2177  >  2195     IL7 intron A
BASE COUNT      570 a    543 c    498 g    584 t
ORIGIN      1783 bp upstream of EcoRI site.
        1 ctaggcgtgt ttcattatag aaatcatttt taaaactcac tccagtgtca tgtctgtcct
       61 cctccccacc tcccactcta ggttccagtc aaatggaaat atgtggcttt tctcactcca
      121 ttctgtctct cttttccatt tgcttacccc cttcgcttcc tctttcattc tatttccttc
      181 tgctttcatg aactttctct ttcacacata ttcactgatg cattcccact cacctgttct
      241 ctcttactct gtcacccccc agtacttact gaatgccagc catgggctaa agaccatagt
      301 aagtggacgc tgggcaggaa gagaaaggcc attgtggctg gaggatagtg atgtggtacc
      361 acgaaaacgt cagagagggc agtggggccc agatcaccag ggcactgagg gcctggtacc
      421 cagtgggaag tgaccagaag gtggggcaca aaatagaatc agatatcctg ctgctctgag
      481 aagaatgaag gggttgtggg caaaagtgaa agctggaaga tcccttagac agtctctata
      541 gtcaccttag ggttaatttt ggaggtgggg cagacaggac ttgtctatgg atttgatgtg
      601 ggaaatgaga ccaaaaaagg aatcagagac ccaagtttca ggcttgagca actctgttct
      661 ccctagttca ttgccatcct tttttttcct ttctccttcc ctcctcttcc ttgttcacgc
      721 tatagcagaa tgtgacatac tccttaccag ccccaagttc attcctcttc aagtgtcagt
      781 ttagaagtca cagactccag gaaactttcc tgaagtcccc acactggggc aaagccacac
      841 acgtactgtc atctccctcc cggcatcagt gtgcacatta ttactttatt cacaggagtt
      901 ccaagctgga gtttctacag cccaaatgca gcaagtctgt gtctaagctc agttagctaa
      961 gtgaaatttg gaaagttact taaggaagtc tcagtcccca catctgtaaa gtgcagataa
     1021 gaaaatggtc tacctcacac aaagttttgc atgagatgat atacataaaa tgcttagcac
     1081 aaaactggca gagagcaaat gctagataaa tgatagtcgt tattattatc gctgttgtta
     1141 ctggtttaca ttatccacct tcatctaagc accctttctg cagaatagca gaaaccaaac
     1201 taatgtagca aataagctac ataattcaag cccaggaaaa agttaacatt tcagtggcat
     1261 gcattcaaga cgaatagttt gatttattag ccaattcaga taaatgtgca cgtggaagtc
     1321 atagttaaat attatcgtca gtttccacgt cctgcgttta atttggggtt tgattttcca
     1381 aatacaacac ttaccagatt aggtggaccc acaggattat ttttccttga ggtctcacct
     1441 gagcaggtgc atgtacagca gacggagcag aaagagactg attagagagg ttggagtggt
     1501 agagggcgtg accctcttaa tcattcttca cttccttttt taaaagacga cttggcatcg
     1561 tccaccacat ccgcggcaac gcctccttgg tgtcgtccgc ttccaataac ccagcttgcg
     1621 tcctgcacac ttgtggcttc cgtgcacaca ttaacaactc atgggtctag ctcccagtcg
     1681 ccaagcgttg ccaaggcgtt gagagatcat ctgggaagtc ttttacccag aattgctttg
     1741 attcaggcca gctggttttt ctgcggtgat tcggaaattc gcgaattcct ctggtcctca
     1801 tccaggtgcg cgggaagcag gtgcccagga gagaggggat aatgaagatt ccatgctgat
     1861 gatcccaaag attgaacctg cagaccaagc gcaaagtaga aactgaaagt acactgctgg
     1921 cggatcctac ggaagttatg gaaaaggcaa agcgcagagc cacgccgtag tgtgtgccgc
     1981 cccccttggg atggatgaaa ctgcagtcgc ggcgtgggta agaggaacca gctgcagaga
     2041 tcaccctgcc caacacagac tcggcaactc cgcggaagac cagggtcctg ggagtgacta
     2101 tgggcggtga gagcttgctc ctgctccagt tgcggtcatc atgactacgc ccgcctcccg
     2161 cagaccatgt tccatggtaa gcgctcttct ccctt
//
LOCUS       HUMIL72       177 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human interleukin 7 (IL7) gene, exon 2.
ACCESSION   M29049
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     2 of 6
SOURCE      Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
            to mRNA, clone 3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 177)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21  +   157     interleukin 7, exon 2
    pre-msg <     1  >   177     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron A
    IVS         158  >   177     IL7 intron B
BASE COUNT       43 a     26 c     34 g     74 t
ORIGIN      About 3 kb after segment 1.
        1 ttatttactg tcaaatttag tttcttttag gtatatcttt ggacttcctc ccctgatcct
       61 tgttctgttg ccagtagcat catctgattg tgatattgaa ggtaaagatg gcaaacaata
      121 tgagagtgtt ctaatggtca gcatcgatca attattggta tgtgattatt ttgtttt
//
LOCUS       HUMIL73       121 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human interleukin 7 (IL7) gene, exon 3.
ACCESSION   M29050
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     3 of 6
SOURCE      Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
            to mRNA, clone 3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21  +   101     interleukin 7, exon 3
    pre-msg <     1  >   121     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron B
    IVS         102  >   121     IL7 intron C
BASE COUNT       44 a     10 c     22 g     45 t
ORIGIN      Undetermined number of bp after segment 2.
        1 ttttatgtta tttattacag gacagcatga aagaaattgg tagcaattgc ctgaataatg
       61 aatttaactt ttttaaaaga catatctgtg atgctaataa ggtaatgata attatttgga
      121 g
//
LOCUS       HUMIL74       157 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human interleukin 7 (IL7) gene, exon 4.
ACCESSION   M29051
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     4 of 6
SOURCE      Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
            to mRNA, clone 3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 157)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21  +   152     interleukin 7, exon 4
    pre-msg <     1  >   157     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron C
    IVS         153  >   157     IL7 intron D
BASE COUNT       45 a     28 c     32 g     52 t
ORIGIN      About 1.5 kb after segment 3.
        1 ctgacttttt cctataatag gaaggtatgt ttttattccg tgctgctcgc aagttgaggc
       61 aatttcttaa aatgaatagc actggtgatt ttgatctcca cttattaaaa gtttcagaag
      121 gcacaacaat actgttgaac tgcactggcc aggtaag
//
LOCUS       HUMIL75        94 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human interleukin 7 (IL7) gene, exon 5.
ACCESSION   M29052
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     5 of 6
SOURCE      Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
            to mRNA, clone 3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 94)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21  +    74     interleukin 7, exon 5
    pre-msg <     1  >    94     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron D
    IVS          75  >    94     IL7 intron E
BASE COUNT       33 a     13 c     22 g     26 t
ORIGIN      About 1 kb after segment 4.
        1 aatgtgactt tgtttttaag gttaaaggaa gaaaaccagc tgccctgggt gaagcccaac
       61 caacaaagag tttggtgaga ataattgtat aatt
//
LOCUS       HUMIL76      1672 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human interleukin 7 (IL7) gene, exon 6.
ACCESSION   M29053
KEYWORDS    hemopoietic growth factor; interleukin; lymphopoietin;
            pre-B cell growth factor.
SEGMENT     6 of 6
SOURCE      Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA
            to mRNA, clone 3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1672)
  AUTHORS   Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A.,
            Cosman,D. and Goodwin,R.G.
  TITLE     Characterization of the human and murine interleukin-7 genes
  JOURNAL   Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Lupton, 26-SEP-1989.
FEATURES       from  to/span     description
    pept    +    21      140     interleukin 7, exon 6
    pre-msg <     1      811     IL7 mRNA and introns
    IVS     <     1       20     IL7 intron E
    signal      753      758     polyA signal
    signal     1060     1065     polyA signal
BASE COUNT      586 a    266 c    258 g    562 t
ORIGIN      About 0.2 kb after segment 5.
        1 tttaaaactc tattctctag gaagaaaata aatctttaaa ggaacagaaa aaactgaatg
       61 acttgtgttt cctaaagaga ctattacaag agataaaaac ttgttggaat aaaattttga
      121 tgggcactaa agaacactga aaaatatgga gtggcaatat agaaacacga actttagctg
      181 catcctccaa gaatctatct gcttatgcag tttttcagag tggaatgctt cctagaagtt
      241 actgaatgca ccatggtcaa aacggattag ggcatttgag aaatgcatat tgtattacta
      301 gaagatgaat acaaacaatg gaaactgaat gctccagtca acaaactatt tcttatatat
      361 gtgaacattt atcaatcagt ataattctgt actgattttt gtaagacaat ccatgtaagg
      421 tatcagttgc aataatactt ctcaaacctg tttaaatatt tcaagacatt aaatctatga
      481 agtatataat ggtttcaaag attcaaaatt gacattgctt tactgtcaaa ataattttat
      541 ggctcactat gaatctatta tactgtatta agagtgaaaa ttgtcttctt ctgtgctgga
      601 gatgttttag agttaacaat gatatatgga taatgccggt gagaataaga gagtcataaa
      661 ccttaagtaa gcaacagcat aacaaggtcc aagataccta aaagagattt caagagattt
      721 aattaatcat gaatgtgtaa cacagtgcct tcaataaatg gtatagcaaa tgttttgaca
      781 tgaaaaaagg acaatttcaa aaaaataaaa taaaataaaa ataaattcac ctagtctaag
      841 gatgctaaac cttagtactg agttacattg tcatttatat agattataac tgtctaaata
      901 agtttgcaat ttgggagata tatttttaag ataataatat atgtttacct tttaattaat
      961 gaaatatctg tatttaattt tgacactata tctgtatata aaatattttc atacagcatt
     1021 acaaattgct tactttggaa tacatttctc ctttgataaa ataaatgagc tatgtattaa
     1081 cactgccaga ttcagttaat aaatctcaac agaattttta aggtgagatt tttaatactt
     1141 cactgctctt taattttcta ctttcattga atatatgacc tttaatagcc tattaccaac
     1201 aatgaatatg agacattacc ttctatgtta aattttagtc ttttaaaaga aacttgtctg
     1261 gaaatgtctg cagtcttcat tcactgataa agggaaatgg aaacactatt ctaagcaggc
     1321 cagaatctac aaatagctgt cagtttgcat tgcactgaac tcacatcctt catgctctct
     1381 cagcctcatc accatctagt tatgtgctga tgggctttat ttgatctaca gaagatgctc
     1441 tggtgacacc ctcagtgtgt gttggtaaca ccttcctgcc ttctctcact ttccttgacc
     1501 catatgacat ttcaccttca ctggtgcgat agctgtatag aattgtatac acttccatac
     1561 aatttactga actgccccat taacaggact ttgagtcaag tatttttttc tgtttggtta
     1621 tcactgttaa ctccagaaag atgagttgac taacattttt tagaagcaaa tg
//
LOCUS       HUMINSRMUT    349 bp ds-DNA             PRI       15-DEC-1989
DEFINITION  Human insulin receptor gene allele C, exon 17, with an Alu
            insertion.
ACCESSION   M27196
KEYWORDS    Alu repetitive element; insulin receptor.
SOURCE      Human (patient 1, haplotype C) white bloodcell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Taira,M., Taira,M., Hashimoto,N., Shimada,F., Suzuki,Y.,
            Kanatsuka,A., Nakamura,F., Ebina,Y., Tatibana,M., Makino,H. and
            Yoshida,S.
  TITLE     Human diabetes associated with a deletion of the tyrosine kinase
            domain of the insulin receptor
  JOURNAL   Science 245, 63-66 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by M.Taira, 02-SEP-1989.
FEATURES       from  to/span     description
    pept.ps /    40      341     pseudo-insulin receptor, exon 17 (AA at 42)
                                 /nomgen="INSR" /map="19p13.1-p13"
                                 /hgml_locus_uid="LG0007M"
    IVS     <     1       39     INSR intron P
    rpt         145      231     Alu repeat (partial)
    rpt         233      349     Alu repeat (partial)
BASE COUNT       99 a     81 c     89 g     80 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcctcca aggatgctgt gtagataagt aagaagtagt gtttccatgc tctgtgtacg
       61 tgccggacca gtgggaggtg tctcgagaga agatcaccct ccttcgagag ctggggcagg
      121 gctccttcgg catggtgtat gaggctggtc tcgaactccc cacctcaggt gatccgcgtg
      181 ccttggcctc ccaaagtggt gggattacag gcgtgagcca ccacgcccag cccctgcact
      241 gtggttttta tcaccacagt gaagggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaagct
      301 cttttaactc tatgctttgt gtccactttc caaaagaatg aaaatagat
//
LOCUS       HUMPDGFRB     400 bp ss-mRNA            PRI       15-MAR-1990
DEFINITION  Human platelet-derived growth factor beta mRNA fragment.
ACCESSION   M30493
KEYWORDS    platelet-derived growth factor.
SOURCE      Human embryonic lung fibroblast, cDNA to mRNA, clone HPR5.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 400)
  AUTHORS   Miki,T., Matsui,T., Heidaran,M.A. and Aaronson,S.A.
  TITLE     An efficient directional cloning system to construct cDNA libraries
            containing full-length inserts at high frequency
  JOURNAL   Gene 83, 137-146 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    mRNA    <     1  >   400     platelet-derived growth factor beta mRNA
                                 /nomgen="PDGFRB" /map="5q33-q35"
                                 /hgml_locus_uid="LW0046Z"
BASE COUNT       70 a    144 c    112 g     74 t
ORIGIN      Chromosome 5q33-q35.
        1 gccggcccct cagccctgct gcccagcacg agcctgtgct cgccctgccc aacgcagaca
       61 gccagaccca gggcggcccc tctggcggct ctgctcctcc cgaaggatgc ttggggagtg
      121 aggcgaagct gggcgctcct ctcccctaca gcagccccct tcctccatcc ctctgttctc
      181 ctgagccttc aggagcctgc accagtcctg cctgtccttc tactcagctg ttaccaactc
      241 tgggaccagc agtctttctg ataactggga gagggcagta aggaggactt cctggagggg
      301 gtgactgtcc agagcctgga actgtgccca caccagaagc catcagcagc aaggacacca
      361 tgcggcttcc gggtgcgatg ccagctctgg ccctcaaagg
//
LOCUS       HUMPDGFRC     150 bp ss-mRNA            PRI       15-MAR-1990
DEFINITION  Human platelet-derived growth factor alpha mRNA fragment.
ACCESSION   M30494
KEYWORDS    platelet-derived growth factor.
SOURCE      Human embryonic lung fibroblast, cDNA to mRNA, clone TR4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Miki,T., Matsui,T., Heidaran,M.A. and Aaronson,S.A.
  TITLE     An efficient directional cloning system to construct cDNA libraries
            containing full-length inserts at high frequency
  JOURNAL   Gene 83, 137-146 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    mRNA    <     1  >   150     platelet-derived growth factor alpha mRNA
                                 /nomgen="PDGFRA" /map="4q11-q13"
                                 /hgml_locus_uid="LR0223Z"
BASE COUNT       40 a     30 c     47 g     33 t
ORIGIN      Chromosome 4q11-q13.
        1 ccattactgt tggagctaca gggagagaaa caggaggaga ctgcaagaga tcatttggga
       61 aggccgtggg cacgctcttt actccatgtg tgggacattc attgcggaat aacatcggag
      121 gagaagtttc ccagagctat ggggacttcc
//
LOCUS       HUMPSAA      7130 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human prostate specific antigen gene, complete cds.
ACCESSION   M27274
KEYWORDS    Alu repetitive element; kallikrein; prostate specific antigen.
SOURCE      Human peripheral leukocyte DNA, (library of R.Wydro), clones
            lambda-PSA.[24A,25A,25B].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 7130)
  AUTHORS   Lundwall,A.
  TITLE     Characterization of the gene for prostate-specific antigen, a human
            glandular kallikrein
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 1151-1159 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Lundwall, 03-NOV-1989.
FEATURES       from  to/span     description
    pept        675      720     prostate specific antigen, exon 1
                                 /nomgen="APS" /map="19q13.3-qter"
                                 /hgml_locus_uid="LN0098S"
               1959     2118     prostate specific antigen, exon 2
               3756     4042     prostate specific antigen, exon 3
               4186     4322     prostate specific antigen, exon 4
               5699     5854     prostate specific antigen, exon 5
    pre-msg     634     6491     PSA mRNA and introns
    IVS         721     1958     PSA intron A
    IVS        2119     3755     PSA intron B
    IVS        4043     4185     PSA intron C
    IVS        4323     5698     PSA intron D
    rpt        2583     2935     Alu repeat
    signal      503      512     CACCC box
    signal      605      611     TATA box
    binding     581      586     SpI binding site
BASE COUNT     1530 a   2024 c   1867 g   1709 t
ORIGIN      1 bp upstream of EcoRI site; chromosome 19q13.3-qter.
        1 gaattccaca ttgtttgctg cacgttggat tttgaaatgc tagggaactt tgggagactc
       61 atatttctgg gctagaggat ctgtggacca caagatcttt ttatgatgac agtagcaatg
      121 tatctgtgga gctggattct gggttgggag tgcaaggaaa agaatgtact aaatgccaag
      181 acatctattt caggagcatg aggaataaaa gttctagttt ctggtctcag agtggtgcag
      241 ggatcaggga gtctcacaat ctcctgagtg ctggtgtctt agggcacact gggtcttgga
      301 gtgcaaagga tctaggcacg tgaggctttg tatgaagaat cggggatcgt acccaccccc
      361 tgtttctgtt tcatcctggg catgtctcct ctgcctttgt cccctagatg aagtctccat
      421 gagctacaag ggcctggtgc atccagggtg atctagtaat tgcagaacag caagtgctag
      481 ctctccctcc ccttccacag ctctgggtgt gggagggggt tgtccagcct ccagcagcat
      541 ggggagggcc ttggtcagcc tctgggtgcc agcagggcag gggcggagtc ctggggaatg
      601 aaggttttat agggctcctg ggggaggctc cccagcccca agcttaccac ctgcacccgg
      661 agagctgtgt caccatgtgg gtcccggttg tcttcctcac cctgtccgtg acgtggattg
      721 gtgagagggg ccatggttgg ggggatgcag gagagggagc cagccctgac tgtcaagctg
      781 aggctctttc ccccccaacc cagcacccca gcccagacag ggagctgggc tcttttctgt
      841 ctctcccagc cccacttcaa gcccataccc ccagcccctc catattgcaa cagtcctcac
      901 tcccacacca ggtccccgct ccctcccact taccccagaa ctttctcccc attgcccagc
      961 cagctccctg ctcccagctg ctttactaaa ggggaagttc ctgggcatct ccgtgtttct
     1021 ctttgtgggg ctcaaaacct ccaaggacct ctctcaatgc cattggttcc ttggaccgta
     1081 tcactggtcc atctcctgag cccctcaatc ctatcacagt ctactgactt ttcccattca
     1141 gctgtgagtg tccaacccta tcccagagac cttgatgctt ggcctcccaa tcttgcccta
     1201 ggatacccag atgccaacca gacacctcct tcttcctagc caggctatct ggcctgagac
     1261 aacaaatggg tccctcagtc tggcaatggg actctgagaa ctcctcattc cctgactctt
     1321 agccccagac tcttcattca gtggcccaca ttttccttag gaaaaacatg agcatcccca
     1381 gccacaactg ccagctctct gattccccaa atctgcatcc ttttcaaaac ctaaaaacaa
     1441 aaagaaaaac aaataaaaca aaaccaactc agaccagaac tgttttctca acctgggact
     1501 tcctaaactt tccaaaacct tcctcttcca gcaactgaac ctggccataa ggcacttatc
     1561 cctggttcct agcacccctt atcccctcag aatccacaac ttgtaccaag tttcccttct
     1621 cccagtccaa gaccccaaat caccacaaag gacccaatcc ccagactcaa gatatggtct
     1681 gggcgctgtc ttgtgtctcc taccctgatc cctgggttca actctgctcc cagagcatga
     1741 agcctctcca ccagcaccag ccaccaacct gcaaacctag ggaagattga cagaattccc
     1801 agcctttccc agctccccct gcccatgtcc caggactccc agccttggtt ctctgccccc
     1861 gtgtcttttc aaacccacat cctaaatcca tctcctatcc gagtccccca gttccccctg
     1921 tcaaccctga ttcccctgat ctagcacccc ctctgcaggc gctgcgcccc tcatcctgtc
     1981 tcggattgtg ggaggctggg agtgcgagaa gcattcccaa ccctggcagg tgcttgtggc
     2041 ctctcgtggc agggcagtct gcggcggtgt tctggtgcac ccccagtggg tcctcacagc
     2101 tgcccactgc atcaggaagt gagtaggggc ctggggtctg gggagcaggt gtctgtgtcc
     2161 cagaggaata acagctgggc attttcccca ggataacctc taaggccagc cttgggactg
     2221 ggggagagag ggaaagttct ggttcaggtc acatggggag gcagggttgg ggctggacca
     2281 ccctccccat ggctgcctgg gtctccatct gtgtccctct atgtctcttt gtgtcgcttt
     2341 cattatgtct cttggtaact ggcttcggtt gtgtctctcc gtgtgactat tttgttctct
     2401 ctctccctct cttctctgtc ttcagtctcc atatctcccc ctctctctgt ccttctctgg
     2461 tccctctcta gccagtgtgt ctcaccctgt atctctctgc caggctctgt ctctcggtct
     2521 ctgtctcacc tgtgccttct ccctactgaa cacacgcacg ggatgggcct ggggggaccc
     2581 tgagaaaagg aagggctttg gctgggcgcg gtggctcaca cctgtaatcc cagcactttg
     2641 ggaggccaag gcaggtagat cacctgaggt caggagttcg agaccagcct ggccaactgg
     2701 tgaaacccca tctctactaa aaatacaaaa aattagccag gcgtggtggc gcatgcctgt
     2761 agtcccagct actcaggagg ctgagggagg agaattgctt gaacctggga ggttgaggtt
     2821 gcagtgagcc gagaccgtgc cactgcactc cagcctgggt gacagagtga gactccgcct
     2881 caaaaaaaaa aaaaaaaaaa aaaaaaaaaa agaaaagaaa agaaaagaaa aggaatcttt
     2941 tatccctgat gtgtgtgggt atgagggtat gagagggccc ctctcactcc attccttctc
     3001 caggacatcc ctccactctt gggagacaca gagaagggct ggttccagct ggagctggga
     3061 ggggcaattg agggaggagg aaggagaagg gggaaggaaa acagggtatg ggggaaagga
     3121 ccctggggag cgaagtggag gatacaacct tgggcctgca ggccaggcta cctacccact
     3181 tggaaaccca cgccaaagcc gcatctacag ctgagccact ctgaggcctc ccctccccgg
     3241 cggtccccac tcagctccaa agtctctctc ccttttctct cccacacttt atcatccccc
     3301 ggattcctct ctacttggtt ctcattcttc ctttgacttc ctgcttccct ttctcattca
     3361 tctgtttctc actttctgcc tggttttgtt cttctctctc tctttctctg gcccatgtct
     3421 gtttctctat gtttctgtct tttctttctc atcctgtgta ttttcggctc accttgtttg
     3481 tcactgttct cccctctgcc ctttcattct ctctgtcctt ttaccctctt cctttttccc
     3541 ttggtttctc tcagtttctg tatctgccct tcaccctctc acactgctgt ttcccaactc
     3601 gttgtctgta tttttggcct gaactgtgtc ttccccaacc ctgtgttttt ctcactgttt
     3661 ctttttctct tttggagcct cctccttgct cctctgtccc ttctctcttt ccttatcatc
     3721 ctcgctcctc attcctgcgt ctgcttcctc cccagcaaaa gcgtgatctt gctgggtcgg
     3781 cacagcctgt ttcatcctga agacacaggc caggtatttc aggtcagcca cagcttccca
     3841 cacccgctct acgatatgag cctcctgaag aatcgattcc tcaggccagg tgatgactcc
     3901 agccacgacc tcatgctgct ccgcctgtca gagcctgccg agctcacgga tgctgtgaag
     3961 gtcatggacc tgcccaccca ggagccagca ctggggacca cctgctacgc ctcaggctgg
     4021 ggcagcattg aaccagagga gtgtacgcct gggccagatg gtgcagccgg gagcccagat
     4081 gcctgggtct gagggaggag gggacaggac tcctgggtct gagggaggag ggccaaggaa
     4141 ccaggtgggg tccagcccac aacagtgttt ttgcctggcc cgtagtcttg accccaaaga
     4201 aacttcagtg tgtggacctc catgttattt ccaatgacgt gtgtgcgcaa gttcaccctc
     4261 agaaggtgac caagttcatg ctgtgtgctg gacgctggac agggggcaaa agcacctgct
     4321 cggtgagtca tccctactcc caagatcttg aggggaaagg tgagtgggga ccttaattct
     4381 gggctggggt ctagaagcca acaaggcgtc tgcctcccct gctccccagc tgtagccatg
     4441 ccacctcccc gtgtctcatc tcattccctc cttccctctt ctttgactcc ctcaaggcaa
     4501 taggttattc ttacagcaca actcatctgt tcctgcgttc agcacacggt tactaggcac
     4561 ctgctatgca cccagcactg ccctagagcc tgggacatag cagtgaacag acagagagca
     4621 gcccctccct tctgtagccc ccaagccagt gaggggcaca ggcaggaaca gggaccacaa
     4681 cacagaaaag ctggagggtg tcaggaggtg atcaggctct cggggaggga gaaggggtgg
     4741 ggagtgtgac tgggaggaga catcctgcag aaggtgggag tgagcaaaca cctgccgcag
     4801 gggaggggag ggccctgcgg cacctggggg agcagaggga acagcatctg gccaggcctg
     4861 ggaggagggg cctagagggc gtcaggagca gagaggaggt tgcctggctg gagtgaagga
     4921 tcggggcagg gtgcgagagg gaagaaagga cccctcctgc agggcctcac ctgggccaca
     4981 ggaggacact gcttttcctc tgaggagtca ggaactgtgg atggtgctgg acagaagcag
     5041 gacagggcct ggctcaggtg tccagaggct gccgctggcc tccctatggg atcagactgc
     5101 agggagggag ggcagcaggg atgtggaggg agtgatgatg gggctgacct gggggtggct
     5161 ccaggcattg tccccacctg ggcccttacc cagcctccct cacaggctcc tggccctcag
     5221 tctctcccct ccactccatt ctccacctac ccacagtggg tcattctgat caccgaactg
     5281 accatgccag ccctgccgat ggtcctccat ggctccctag tgccctggag aggaggtgtc
     5341 tagtcagaga gtagtcctgg aaggtggcct ctgtgaggag ccacggggac agcatcctgc
     5401 agatggtcct ggcccttgtc ccaccgacct gtctacaagg actgtcctcg tggaccctcc
     5461 cctctgcaca ggagctggac cctgaagtcc cttccctacc ggccaggact ggagccccta
     5521 cccctctgtt ggaatccctg cccaccttct tctggaagtc ggctctggag acatttctct
     5581 cttcttccaa agctgggaac tgctatctgt tatctgcctg tccaggtctg aaagatagga
     5641 ttgcccaggc agaaactggg actgacctat ctcactctct ccctgctttt acccttaggg
     5701 tgattctggg ggcccacttg tctgtaatgg tgtgcttcaa ggtatcacgt catggggcag
     5761 tgaaccatgt gccctgcccg aaaggccttc cctgtacacc aaggtggtgc attaccggaa
     5821 gtggatcaag gacaccatcg tggccaaccc ctgagcaccc ctatcaactc cctattgtag
     5881 taaacttgga accttggaaa tgaccaggcc aagactcaag cctccccagt tctactgacc
     5941 tttgtcctta ggtgtgaggt ccagggttgc taggaaaaga aatcagcaga cacaggtgta
     6001 gaccagagtg tttcttaaat ggtgtaattt tgtcctctct gtgtcctggg gaatactggc
     6061 catgcctgga gacatatcac tcaatttctc tgaggacaca gataggatgg ggtgtctgtg
     6121 ttatttgtgg gatacagaga tgaaagaggg gtgggatcca cactgagaga gtggagagtg
     6181 acatgtgctg gacactgtcc atgaagcact gagcagaagc tggaggcaca acgcaccaga
     6241 cactcacagc aaggatggag ctgaaaacat aacccactct gtcctggagg cactgggaag
     6301 cctagagaag gctgtgagcc aaggagggag ggtcttcctt tggcatggga tggggatgaa
     6361 gtaaggagag ggactggacc ccctggaagc tgattcacta tggggggagg tgtattgaag
     6421 tcctccagac aaccctcaga tttgatgatt tcctagtaga actcacagaa ataaagagct
     6481 cttatactgt ggtttattct ggtttgttac attgacagga gacacactga aatcagcaaa
     6541 ggaaacaggc atctaagtgg ggatgtgaag aaaacaggga aaatctttca gttgttttct
     6601 cccagtgggg tgttgtggac agcacttaaa tcacacagaa gtgatgtgtg accttgtgta
     6661 tgaagtattt ccaactaagg aagctcacct gagccttagt gtccagagtt cttattgggg
     6721 gtctgtagga taggcatggg gtactggaat agctgacctt aacttctcag acctgaggtt
     6781 cccaagagtt caagcagata cagcatggcc tagagcctca gatgtacaaa aacaggcatt
     6841 catcatgaat cgcactgtta gcatgaatca tctggcacgg cccaaggccc caggtatacc
     6901 aaggcacttg ggccgaatgt tccaagggat taaatgtcat ctcccaggag ttattcaagg
     6961 gtgagccctg tacttggaac gttcaggctt tgagcagtgc agggctgctg agtcaacctt
     7021 ttactgtaca ggggggtgag ggaaagggag aagatgagga aaccgcctag ggatctggtt
     7081 ctgtcttgtg gccgagtgga ccatggggct atcccaagaa ggaggaattc
//
LOCUS       HUMRPS24A     659 bp ss-mRNA            PRI       15-MAR-1990
DEFINITION  Human ribosomal protein S24 mRNA.
ACCESSION   M31520
KEYWORDS    ribosomal protein S24.
SOURCE      Human male lymphoblast from lymphoid tumor cell line HT1080 (ATCC
            121) cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 133 to 641)
  AUTHORS   Brown,S.J., Jewell,A., Maki,C.G. and Roufa,D.J.
  TITLE     A cDNA encoding human ribosomal protein S24
  JOURNAL   Unpublished (1990) Kansas St. Univ. Manhattan, KS 66506
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 659)
  AUTHORS   Brown,S.J. and Roufa,D.J.
  JOURNAL   Unpublished (1990) Kansas St. Univ. Manhattan, KS 66506
  STANDARD  full staff_entry
COMMENT     Authorin Submission [2] kindly submitted by Roufa,D.J.,
            22-JAN-1990.
FEATURES       from  to/span     description
    pept    /   132       51 (c) unknown protein (AA at 131)
                                 /evidence=EXPERIMENTAL
    pept        143      544     ribosomal protein S24 /function="component of
                                 human 40S ribosomal subunit"
                                 /evidence=EXPERIMENTAL /gene="HUMRPS24"
                                 /standard_name="rpS24"
    mRNA        132        1 (c) unknown mRNA /evidence=EXPERIMENTAL /note="800
                                 nt. anonymous human transcript detected on
                                 Northern blots of HT1080 and HeLa cell
                                 cytoplasmic mRNAs" [2]
    signal       25       20 (c) poly-A signal /note="detected by pattern only"
                                 [2]
    signal      598      603     poly-A signal /evidence=EXPERIMENTAL [1]
    recomb      133      142     misc_recomb site /note="site of artificial
                                 recombination between 2 cDNAs which occurred
                                 during reverse transcription or recombinant
                                 DNA cloning operations" /evidence=EXPERIMENTAL
                                 [2]
BASE COUNT      218 a    144 c    144 g    153 t
ORIGIN      
        1 gggtttatcg gaaaatgtgt ttattgagat ggtttcccac tcatcttgac tcagagtgct
       61 tttagtgctg cttcctcctg aaggaacatc cttctgtaag ccttgctttt cctccttggc
      121 tgtctgaaga tagatcgcca tcatgaacga caccgtaact atccgcacta gaaagttcat
      181 gaccaaccga ctacttcaga ggaaacaaat ggtcattgat gtccttcacc ccgggaaggc
      241 gacagtgcct aagacagaaa ttcgggaaaa actagccaaa atgtacaaga ccacaccgga
      301 tgtcatcttt gtatttggat tcagaactca ttttggtggt ggcaagacaa ctggctttgg
      361 catgatttat gattccctgg attatgcaaa gaaaaatgaa cccaaacata gacttgcaag
      421 acatggcctg tatgagaaga aaaagacctc aagaaagcaa cgaaaggaac gcaagaacag
      481 aatgaagaaa gtcaggggga ctgcaaaggc caatgttggt gctggcaaaa agccgaagga
      541 gtaaaggtgc tgcaatgatg ttagctgtgg ccactgtgga tttttcgcaa gaacattaat
      601 aaactaaaaa cttcatgtgt aaaaaaaaaa aaaaaaaaaa accccccccc ccccccccc
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       MUSMK1A      2222 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse intronless potassium channel gene MK1.
ACCESSION   M30439
KEYWORDS    potassium channel protein.
SOURCE      Mouse (strain AKR) cell line L47.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
            Ghanshani,S., Tempel,B.L. and Gutman,G.A.
  TITLE     A family of three mouse potassium channel genes with intronless
            coding regions
  JOURNAL   Science 247, 943-975 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2222)
  AUTHORS   Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
            Ghanshani,S., Tempel,B.L. and Gutman,G.A.
  JOURNAL   Unpublished (1990) Univ. of Calif. Irvine, CA 92717
  STANDARD  full staff_entry
COMMENT     Authorin Submission [2] kindly submitted by Gutman,G.A.,
            12-DEC-1989
            
            MK1, MK2 and MK3 represent three members of a family of mouse genes
            encoding potassium channel proteins, related to the Drosophila
            shaker locus.  Each mouse protein is encoded by a single,
            uninterrupted exon, although one (or more) introns may be present
            in the 5' untranslated region (MK1, MK2).  The sequence of MK1 is
            included within, and identical to, the published sequence of MBK1
            cDNA (Tempel, Jan & Jan, Nature 232:837-9, 1988) with which it
            overlaps, except for two additional G's, one each at MK1 positions
            162 and 442.  MK1 closely resemble the published rat cDNA sequence
            RCK1 (Baumann et al., EMBO J. 7:2457, 1988).
FEATURES       from  to/span     description
    pept        584     2071     potassium channel protein
    IVS     <     1       52     intron
BASE COUNT      517 a    613 c    598 g    494 t
ORIGIN      
        1 atcagaatgc ttctgatttt ctaccctcgt atcacttttt atttctctgc agcgtccatc
       61 gatcgccccg gtgggagctt agaaggcggc aggcgaagag gggtaggagg gggggagagc
      121 caaggagcag caaagtgggt gacaggcctg gggagctgct ggagttggca ctgcaccgtg
      181 tcctagaagg gctgcaggag gggaggggag ccggggcgag ccccggaagc gatggcaggt
      241 gtgcattaag gtgatcagag caggaagaaa aacccattca ccaacacaaa aacagagcac
      301 ccccccccaa aaaatacccc aagagtagca aaaaagcttc actgggggcc ctcaggcgag
      361 aggggtttcc aaattgcgta aaaggcagag tggaggaggg gaggtgataa ctagcaaagt
      421 tgcagacttc tgaaccttct ggctctgaag cccctccctg tgagcgtggg gaagactcac
      481 tttccggtgg gggggccgct tgggtccccc ccacccctac tccctcgctc cttcacaacc
      541 cgggctctct cctggcctcc tacccctgca ccctgctttc atcatgacgg tgatgtcggg
      601 ggagaatgcg gacgaggctt cgaccgctcc aggtcacccc caggatggca gctacccgag
      661 gcaggcggac cacgacgacc acgaatgctg cgagcgcgta gtaatcaaca tctccgggct
      721 gcgcttcgaa acgcagctca agactctggc acagttcccc aacacgctgc tgggcaaccc
      781 gaagaaacgc atgcgctact ttgaccccct gaggaacgag tacttctttg accgcaaccg
      841 gcccagcttc gatgccatcc tttattacta ccagtccggg ggccgcctgc gcaggccggt
      901 caacgtgccc ctggacatgt tctccgagga gattaaattt tacgagttgg gcgaggaagc
      961 catggagaag ttccgggaag atgagggctt catcaaggaa gaggagcgcc ccctacccga
     1021 gaaggagtac cagcgccagg tgtggctgct ctttgagtat ccggagagct caggacctgc
     1081 ccgggttatt gccattgtgt cggtcatggt catcctcatc tccatagtca tcttttgcct
     1141 ggagactctc cctgagctga aggacgacaa ggacttcacg ggcaccatcc accgcatcga
     1201 caacaccaca gtcatctata cttccaacat cttcacagac cctttcttca ttgtggaaac
     1261 cttgtgtatc atctggttct cttttgagct ggtggtgcgc ttcttcgcct gccccagcaa
     1321 gacagacttc tttaagaaca tcatgaactt catcgacatt gtggccatca tcccttattt
     1381 cattaccctg ggcacggaga tagctgagca ggagggaaat cagaagggcg agcaggccac
     1441 ttccctggcc atcctcaggg tcatccgctt ggtaagggtg ttcagaatct tcaaactctc
     1501 ccgccactcc aagggccttc agatcctggg ccagaccctc aaagctagta tgagggagtt
     1561 agggctgctc atctttttcc tcttcattgg ggtcatactg ttttctagcg cagtgtactt
     1621 tgcggaggcg gaagaagctg agtcgcactt ctccagtatc cccgatgctt tctggtgggc
     1681 ggtggtgtcc atgaccactg tgggatacgg tgacatgtac cctgtgacaa ttggaggcaa
     1741 gatcgtgggc tccttgtgtg ccatcgctgg tgtgctgaca attgccctgc ccgtacctgt
     1801 cattgtgtcc aatttcaact atttctacca ccgagaaact gagggggaag agcaggctca
     1861 gttgctccat gttagttctc ctaacttagc ctctgacagt gacctcagcc gccgcagctc
     1921 ctctactatc agcaagtctg agtacatgga gatcgaagag gatatgaaca atagcatagc
     1981 ccattacaga caggctaata tcagaactgg taactgcacc acagctgatc aaaactgcgt
     2041 taataagagc aagctcctga ccgatgttta aaaagcaaca ggcaagccaa caaaagcccc
     2101 aaacaaaccc cttggcgact cctgtcccgc tctgtagata ctttactaag cagtagtctt
     2161 tgaatgcttt attttgctgg caatgcattg ttgcattgtg aatttgggga gtggcgaacc
     2221 tg
//
LOCUS       MUSMK2A      3383 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse intronless potassium channel gene MK2.
ACCESSION   M30440
KEYWORDS    potassium channel protein.
SOURCE      Mouse (strain AKR) cell line L47.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
            Ghanshani,S., Tempel,B.L. and Gutman,G.A.
  TITLE     A family of three mouse potassium channel genes with intronless
            coding regions
  JOURNAL   Science 247, 943-975 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3383)
  AUTHORS   Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
            Ghanshani,S., Tempel,B.L. and Gutman,G.A.
  JOURNAL   Unpublished (1990) Univ. of Calif. Irvine, CA 92717
  STANDARD  full staff_entry
COMMENT     Authorin Submission [2] kindly submitted by Gutman,G.A.,
            05-DEC-1989
            
            MK1, MK2 and MK3 represent three members of a family of mouse genes
            encoding potassium channel proteins, related to the Drosophila
            Shaker locus.  Each mouse protein is encoded by a single,
            uninterrupted exon, although one (or more) introns may be present
            in the 5' untranslated region (MK1, MK2).  MK2 closely resembles
            the published rat cDNA sequence RBK2 (McKinnon, J. Biol. Chem.
            264:8230, 1989), and the deduced amino acid sequence of the rat
            protein RCK5 (Stuhmer et al., EMBO J. 8:3235, 1989).
FEATURES       from  to/span     description
    pept       1118     2617     potassium channel protein
    IVS         527      959     intron /note="putative intron by comparison
                                 with RBK2 cDNAs"
BASE COUNT      818 a    881 c    791 g    881 t     12 others
ORIGIN      
        1 cctgcaggca tcgaagctta tggttatcca ggagaggtta ggaaaaggct cctgtggtgg
       61 atctcattca caagtgtcat ggaaagtaga tgtgggccnn nntnnacttc ttgccctgag
      121 gggatctccc agagccctta aaagtcttng tctcagctgc ctggaancct tggtcccacc
      181 ttgctgctgt gttttccagg caacgtcaca cctccggagg acagccagga ggactcaggc
      241 ttttgctgag ctgtgcatct tgcctccttc cttccaaagg cttcagaccc aagctccact
      301 ctcctcctca agtcgtgcgc tgtgcccctg gctaagcatc tcactgtccc tcgagcccca
      361 tccagtccta gagcacacct ccaggacctg ccctcctgct cagaatgact caccattatt
      421 tccagctcaa gtgagaagat gtgacgggga ctgagctgcc tatttgtgta tctgtgccat
      481 cagatctgct cttcgaagaa gagaacctca gctcctgcct ggccaggtaa atcaaacaat
      541 cacttagtag cagtcatata gaggcaatgg gggtggcatc agtgaccaag gnncaggtcc
      601 caaacaccct accatgaata cntcctttgg tgcaggtatt gccaaaacat tagcacagtg
      661 cctggcagag ggctcccttc tccactcctc ctgtgcagct atgtccccag catgtgtgtg
      721 gtacccaaga tcctgtctca gccncctgcc cctagcgcag gccattctat agcttccttg
      781 aaagattgtt tgttccctga accctactga cccctagaca agtgacaagg tgataatagt
      841 tttagtggca gaagtgatgg tacagtcatg tttctgaact cttggggaca tcaagtgcac
      901 ctcctgtcag gttgccaatg actggtgatc tcatgtcctt acctgcctct gtctttcagc
      961 cgagtggggc tgtgtctcca actcacaaag ggctctgggg ctttcctgca gaagctcaga
     1021 cttatgcttc tgtgtttcct gggccccagt tatcagcacc gagttccctg ccgcagctgg
     1081 aaggcttaaa gtaaagcaca cttctcaagc cccaattatg acagtggcta ccggagaccc
     1141 agtggacgag gctgctgccc tccctgggca cccacaagac acctatgacc cagaggcaga
     1201 ccatgaatgc tgtgagagag tggtcatcaa catctcaggc ctgcggttcg aaactcagct
     1261 aaagacctta gcccagttcc cagagaccct cttaggggac cccaagaaac ggatgaggta
     1321 cttcgatccc ctccgaaatg agtacttttt tgatcgcaac cgccctagct ttgatgccat
     1381 tttgtactac tatcagtctg ggggcaggtt gaggcgacct gtgaacgtgc ccttagatat
     1441 cttctcggaa gaaatccggt tttatgagct aggagaagaa gcaatggaga tgtttcggga
     1501 ggatgaaggc tacatcaagg aagaagagcg tcctctgcct gaaaatgagt ttcagagaca
     1561 ggtgtggctt ctctttgaat accctgagag ctcagggcct gccaggatta tagccattgt
     1621 atctgtgatg gtcattctga tctccatcgt cagcttctgt ctggaaacct tgcccatctt
     1681 ccgggatgag aatgaggaca tgcatggtgg cggggtgacc ttccacacct attccaacag
     1741 caccatcggg taccagcagt ccacctcctt caccgaccct ttcttcattg tagagactct
     1801 ctgcatcatc tggttctcct ttgagtttct ggttagattc tttgcctgtc ccagcaaagc
     1861 tggcttcttc accaacatca tgaacatcat tgacattgtg gctatcatcc cttactttat
     1921 caccctgggg acagagttag ctgagaagcc agaggatgcc cagcaaggcc agcaggccat
     1981 gtcactggcc attctccgtg tcatccggtt ggtaagagtc tttaggattt tcaagttgtc
     2041 cagacactcc aaaggtctac agattctagg tcagaccctc aaagctagca tgagggaatt
     2101 gggcctcctg atattcttcc tcttcattgg ggtcatcctc ttctctagtg ctgtctattt
     2161 tgcagaagct gatgagagag attcccagtt ccccagcatc ccggatgctt tctggtgggc
     2221 agtcgtctcc atgacaactg taggctatgg agacatggtt ccaactacca ttggggggaa
     2281 gatagtgggt tctctgtgtg caattgcagg tgtgttaacc attgccttac cagtccctgt
     2341 catagtgtct aatttcaact acttctacca ccgggagaca gagggagagg agcaggccca
     2401 gtacttgcaa gtgacaagct gtccaaagat cccgtcctcc cctgacctaa agaaaagtag
     2461 aagtgcctct accataagta agtctgatta catggagata caggagggag ttaacaacag
     2521 caatgaggac tttagagagg agaacttaaa aacagccaac tgtaccttgg ctaacacaaa
     2581 ctatgtgaat attaccaaaa tgttaactga tgtctgattg aagcctacta atgtactcac
     2641 agctcaacag gactgatgca gatgttgcat aatagcctgc attgtagtca gtgttctaca
     2701 gtgttctaca gcatctggtt ctgcatggaa agcaatagtt cgtgcaagtg acttttgatc
     2761 ttttgatttt tgatttagaa cacagaatat ttatcatggc tttcatgaaa tcttcatcac
     2821 cgacttacag gtttccaaag aggagagtca cccatggagc cagcatctca gaaaggcaca
     2881 atgcagcccc ctcacaccca cacacaacct accatagcaa cactatccta tcttcctacc
     2941 taaacatgag cacacctggg aggtgcattc cctccacccc acccctacca ccctatttca
     3001 gcccatctgc cccttcacag aggaacatac cattgtggct tagttctaaa gtctggtgac
     3061 tactcaaagg tcattctcgg ttttgtgttg agaagaacac acagtcctgt gtgttggaac
     3121 tactttctgt gtcacaggct ggggtttgtg aattgcagtt gccaactaga tgctccggag
     3181 gcttatgttt cgtaacggaa aaatgctgca ttctgctttt tctctgcagt gtcgatgtga
     3241 gggaagccca ggggggtggg ggagggacag ttaatatgac caaaatatga attgccaagt
     3301 ttcacatttg ttcccttagg cctatgggga gaagctacct aattcaggga cttctcagct
     3361 tcagatacca ttttgcaagc ttg
//
LOCUS       MUSMK3A      1994 bp ds-DNA             ROD       15-MAR-1990
DEFINITION  Mouse intronless potassium channel gene MK3.
ACCESSION   M30441
KEYWORDS    potassium channel protein.
SOURCE      Mouse (strain AKR) cell line L47.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
            Ghanshani,S., Tempel,B. and Gutman,G.A.
  TITLE     A family of three mouse potassium channel genes with intronless
            coding regions
  JOURNAL   Science 247, 943-975 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1994)
  AUTHORS   Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A.,
            Ghanshani,S., Tempel,B.L. and Gutman,G.A.
  JOURNAL   Unpublished (1990) Univ. of Calif. Irvine, CA 92717
  STANDARD  full staff_entry
COMMENT     Authorin Submission [2] kindly submitted by Gutman,G.A.,
            05-DEC-1989
            
            MK1, MK2 and MK3 represent three members of a family of mouse genes
            encoding potassium channel proteins, related to the Drosophila
            shaker locus.  Each mouse protein is encoded by a single,
            uninterrupted exon, although one (or more) introns may be present
            in the 5' untranslated region (MK1, MK2).  MK3 closely resembles
            the published rat cDNA sequence RCK3 (Stuhmer et al., EMBO J.
            8:3235, 1989).
FEATURES       from  to/span     description
    pept        150     1736     potassium channel protein
BASE COUNT      394 a    612 c    534 g    454 t
ORIGIN      
        1 agccgccgct agggaaggaa agcaccgccg cctcccgcgc tcgaccgccg cagccctcca
       61 cccatcaccg cgcccaccct gcaccggacc ccgcaggagg cggcgcgcgc atcctgcaga
      121 gccccggcca cgccgagctg ccgccagaca tgaccgtggt gcccggggac cacctgctgg
      181 agccagaggc ggcgggaggc ggtggcgggg acccgcctca gggaggctgt ggcagtggcg
      241 gcggcggtgg cggctgcgac cgctacgagc cactgccacc cgcgctgccc gccgcgggcg
      301 agcaagattg ctgcggcgag cgtgtggtca tcaacatctc cgggctgcgc ttcgagacgc
      361 agctcaagac cctctgccag ttccccgaga cactgctggg cgaccccaag cggcgcatgc
      421 ggtactttga cccactccgc aatgagtact tcttcgaccg caaccgaccc agcttcgacg
      481 ccatcctcta ctactaccag tccgggggcc gcattcgccg gccggtcaac gtgcccatcg
      541 acatcttctc cgaggagatc cgcttttacc agctgggtga ggaggccatg gaaaagttcc
      601 gtgaggatga gggcttcctg cgggaggagg agcgacccct gccccgccgt gacttccagc
      661 gccaggtgtg gctgctcttc gaatatccgg agagctccgg gccggcccgg ggcattgcca
      721 ttgtgtcagt gctggtcatt ctcatctcca ttgtcatctt ctgcttggag acgcttcccg
      781 agtttcgcga tgagaaagac tatcccgcct ccccgtcgca ggacgtgttt gaggctgcca
      841 acaacagcac gtcgggggcc ccttctggag cctccagctt ctcggacccc ttcttcgtgg
      901 tggagacctt gtgcatcatc tggttctcct ttgagcttct ggtgcggttc tttgcttgcc
      961 ccagtaaagc caccttctcc agaaatatca tgaacttgat agacattgtg gccatcattc
     1021 cttattttat cactctgggc actgagctgg ctgaacgaca aggtaatggg cagcaggcca
     1081 tgtcgctggc catcctaaga gtcatccgcc tagtaagggt tttccgcatc ttcaagctct
     1141 cccgccattc taaggggctg cagatcctag gacagacgct gaaggcttcc atgcgggagc
     1201 tggggctgct catattcttc ctcttcattg gggtcatcct tttctccagt gcagcttact
     1261 ttgctgaggc agacgaccct tcttcgggtt ttaacagtat cccggatgcc ttctggtggg
     1321 cagtagtaac catgacaact gttggttatg gtgatatgca cccagtgacc ataggaggca
     1381 agattgtggg ctctctttgt gccatcgcag gtgtcttgac cattgcattg ccagttcctg
     1441 tgattgtttc caacttcaac tacttctacc accgggagac agaaggggaa gagcaagccc
     1501 agtacatgca cgtgggcagt tgccagcacc tctcctcttc agccgaggag ctccgaaaag
     1561 cccggagtaa ctccactctg agtaagtcgg agtatatggt gatcgaagag gggggtatga
     1621 accagagcgc cttcccgcag acccccttca aaacgggcaa ctccacagcc acttgcacca
     1681 cgaacaataa ccccaactcc tgtgtcaaca tcaagaagat attcactgat gtctaatata
     1741 tgatacggtt gccaattctg tgcccagtat tgtgtggaac atgccccctt ggtctgtgta
     1801 tgcccttgat ttatacattt ccagaccact catcaaggaa agtacaagaa gtgaggaagc
     1861 acacttcatt ctccctattg cttcatactg aaacaggtgc ctgtttttgc aagtgggctg
     1921 cattctctca gctctttttt tctctctctc cctgtctctt aattttgtga ccaacaaact
     1981 tacattaagc gtgg
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       YSCMTSPR1A     81 bp ds-DNA             ORG       21-FEB-1990
DEFINITION  S.cerevisiae mitochondrial DNA involved in the formation of a novel
            junction.
ACCESSION   M31839
KEYWORDS    .
SOURCE      S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 81)
  AUTHORS   Clark-Walker,G.D.
  TITLE     In vivo rearrangement of mitochondrial DNA in Saccharomyces
            cerevisiae
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rpt          21       60     40 bp direct repeat
BASE COUNT       30 a     13 c     25 g     13 t
ORIGIN      
        1 gaaaccggga cctcggagac gtaatagggg gagggggtgg gtgataataa ccagaatatt
       61 caataaatac agagcacaca t
//
LOCUS       YSCMTSPR1B     81 bp ds-DNA             ORG       21-FEB-1990
DEFINITION  S.cerevisiae mitochondrial DNA involved in the formation of a novel
            junction.
ACCESSION   M31840
KEYWORDS    .
SOURCE      S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 81)
  AUTHORS   Clark-Walker,G.D.
  TITLE     In vivo rearrangement of mitochondrial DNA in Saccharomyces
            cerevisiae
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rpt          21       60     40 bp direct repeat
BASE COUNT       35 a      7 c     18 g     21 t
ORIGIN      
        1 tatatatttc aaatatataa gtaatagggg gagggggtgg gtgataataa ccagaatatt
       61 aaataaatac agagcacaca t
//
LOCUS       YSCMTSPR1C     80 bp ds-DNA             ORG       21-FEB-1990
DEFINITION  S.cerevisiae mitochondrial DNA involved in the formation of a novel
            junction.
ACCESSION   M31841
KEYWORDS    .
SOURCE      S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Clark-Walker,G.D.
  TITLE     In vivo rearrangement of mitochondrial DNA in Saccharomyces
            cerevisiae
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rpt          16       66     51 bp direct repeat
BASE COUNT       25 a     15 c     17 g     23 t
ORIGIN      
        1 atttattata atatttatat taatagttcc ggggcccggc cacgggagcc ggaaccccga
       61 aaggagtatt attaaacatt
//
LOCUS       YSCMTSPR1D     80 bp ds-DNA             ORG       21-FEB-1990
DEFINITION  S.cerevisiae mitochondrial DNA involved in the formation of a novel
            junction.
ACCESSION   M31842
KEYWORDS    .
SOURCE      S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Clark-Walker,G.D.
  TITLE     In vivo rearrangement of mitochondrial DNA in Saccharomyces
            cerevisiae
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rpt          16       66     51 bp direct repeat
BASE COUNT       28 a     15 c     19 g     18 t
ORIGIN      
        1 ttataattca atttatatat taatagttcc ggggcccggc cacgggagcc ggaaccccga
       61 aaggaggaat aagataaata
//
LOCUS       YSCMTSPR1E     80 bp ds-DNA             ORG       21-FEB-1990
DEFINITION  S.cerevisiae mitochondrial DNA involved in the formation of a novel
            junction.
ACCESSION   M31843
KEYWORDS    .
SOURCE      S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Clark-Walker,G.D.
  TITLE     In vivo rearrangement of mitochondrial DNA in Saccharomyces
            cerevisiae
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rpt          17       65     49 bp direct repeat
BASE COUNT       29 a     14 c     17 g     20 t
ORIGIN      
        1 tattattaaa tattatatag ttccggggcc cggccacggg agccggaacc ccgaaaggag
       61 aaatattaat atattataaa
//
LOCUS       YSCMTSPR1F     80 bp ds-DNA             ORG       21-FEB-1990
DEFINITION  S.cerevisiae mitochondrial DNA involved in the formation of a novel
            junction.
ACCESSION   M31844
KEYWORDS    .
SOURCE      S.cerevisiae (strain hfp 15.19S) mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Clark-Walker,G.D.
  TITLE     In vivo rearrangement of mitochondrial DNA in Saccharomyces
            cerevisiae
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rpt          17       65     49 bp direct repeat
BASE COUNT       32 a     14 c     17 g     17 t
ORIGIN      
        1 atattattta ttattaatag ttccggggcc cggccacggg agccggaacc ccgaaaggag
       61 aaataaataa aataaaaaat
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       HUMBAT2A     6704 bp ss-mRNA            PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 2 (BAT2) mRNA, complete cds.
ACCESSION   M33509 M31293
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SOURCE      Human T-cell line HPB-All, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 6704)
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept        102     6530     HLA-B-associated transcript 2 (BAT2)
    mRNA          1     6704     BAT2 mRNA
    signal     6692     6697     poly-A signal
BASE COUNT     1435 a   2224 c   1897 g   1148 t
ORIGIN      Chromosome 6p21.3.
        1 cctaggcccg ggtcccggat ccccgcgcac ccggccaggc tctggcacgt tttgggggag
       61 gtgcctgcag gacccaacat actcaatgag cttccagcgc aatgtccgat cgctcggggc
      121 cgactgccaa gggaaaggat ggaaagaagt attcctcgct caacctgttt gatacgtata
      181 agggcaagtc cttagagatc cagaaacccg cctgttgccc ctcgccatgg cctgcagagt
      241 ctcgggaaag ttgccattgc ccggcgtatc gacctccagc caaccttcca agcctgaaag
      301 ccgagaacaa aggcaatgac cccaatgtct cactagtgcc aaaagacgga acaggatggg
      361 caagcaaaca ggagcagtcc gaccccaaga gttccgatgc ctcaaccgct cagccgccgg
      421 aatcgcagcc actgccggct tcacagacgc ctgcctccaa ccagccgaaa cgacccccag
      481 cagcccccga gaacactcct ttggttccaa gcggggtaaa gtcctgggca caagccagcg
      541 tcacccatgg agcacatgga gatggtggaa gggcatcaag cctactgtca cgattctctc
      601 gagaggaatt tccgaccctg caggcggctg gcgaccagga caaggctgcc aaggaaaggg
      661 agtctgccga acagtcgtct gggcccggac caagcctccg cccccaaaat tctacaactt
      721 ggagggacgg aggtgggcgt ggccctgatg agctggaggg cccggactcc aaacttcatc
      781 atggtcatga tccccggggt gggctacagc cttcaggccc accccagttc cctccctacc
      841 gcggaatgat gccgcctttc atgtatcccc catatctccc gttccctccg ccctatggac
      901 cccaggggcc ttaccgatac cccactcctg atgggcccag ccgttttccc cgtgtggcgg
      961 gcccccgagg ctcagggcca ccaatgcgct tagtagagcc tgtgggtcgt ccctctattc
     1021 tcaaagagga taatctcaaa gagtttgatc agttggatca ggagaatgat gatggttggg
     1081 caggggccca tgaagaggtt gactacactg aaaagctcaa gttcagcgat gaggaagatg
     1141 ggcgagactc tgatgaggag ggagctgagg gccacaggga ttcccaatca gcttctggtg
     1201 aggaacggcc ccctgaagca gatggcaaaa agggcaactc ccccaacagc gaaccgccca
     1261 ctcctaagac ggcctgggca gaaacctctc ggcctccaga gacagagccg ggacctcctg
     1321 ccccaaagcc tcccctaccc cctggggact acccagatcg tgggggtcct ccctgcaagc
     1381 ccccagcacc tgaagatgag gatgaggcat ggcggcagcg acgaaagcag tcgtcatctg
     1441 agatttccct ggcagtggag cgggcccggc gacggcgaga agaagaggag cggcgcatgc
     1501 aagaagagcg ccgggcagcc tgtgctgaga agctcaagcg actcgatgaa aagtttgggg
     1561 cacctgacaa gcggctcaaa gcagagcctg ctgccccacc tgctgcccct tctaccccag
     1621 ccccaccacc tgcagtccct aaagaactcc ctgcacctcc agctccacct ccagcatcag
     1681 ccccaacacc agagacagaa cctgaagagc cagcacaggc ccctcctgcc caatctactc
     1741 ctactccagg tgtggctgcg gctcccactc tggtgagtgg tggtggcagt accagtagca
     1801 ccagcagtgg cagcttcgaa gccagcccag tggaaccaca actgccctca aaagagggtc
     1861 ctgaaccacc agaagaggtt cctcctccta ccacaccccc agttccaaag gtggaaccca
     1921 agggtgatgg gattggtccc acccgccagc cccctagtca gggcttgggc taccccaaat
     1981 atcagaagtc gttgcctcct cgtttccagc ggcagcagca ggagcagctc ctgaagcagc
     2041 agcagcagca ccagtggcag cagcatcaac agggctctgc ccctcctacc ccagtgcccc
     2101 catcaccacc acagcctgtg accctggggg ctgtgccagc tccacaggct ccacccccgc
     2161 cccccaaggc cctgtaccca ggtgctctgg gccggccccc acccatgccc ccaatgaact
     2221 ttgatccccg atggatgatg attcctcctt atgtggaccc ccggctcctc cagggtcgtc
     2281 cccctctaga gttctaccct cctggtgtgc atccctctgg cctagttccc cgagagcgtt
     2341 cagacagtct ggggctcagc tcagagccat ttgaccgtca tgcacctgct atgttacggg
     2401 aacggggcac tccaccggtg gatccaaagt tggcctgggt aggagatgtc ttcaccgcca
     2461 cacccgctga accccgccca cttacctcac ctctgcgcca ggctgcggat gaggatgaca
     2521 aggggatgag gagcgagact cctccagtac ctcccccacc accctatctg gccagttatc
     2581 caggctttcc tgagaatgga gcccctgggc ccccaatctc tcgctttcct ctggaggaac
     2641 cagggccccg tccactcccc tggcccccag gcagtgatga agtggccaag atacaaactc
     2701 caccacccaa gaaggagccc cctaaggagg agactgcaca gctgacgggg ccagaagcag
     2761 gccgaaagct gcccgcgagt cggagtggag caggcccccc accaccacgc agagagagtc
     2821 gcacagagac ccgctggggc cctcgtccag ggagcagtcg tcgtggaatc cctccagagg
     2881 agccaggggc cccaccccgc cgggctgggc ctataaagaa acctccacca cctacaaaag
     2941 tagaagagct gcctcccaag cccctcgaac agggggatga aacccccaaa cccccaaagc
     3001 cagacccact caagataacc aaggggaagc tagggggccc caaggagacc ccacccaatg
     3061 gaaatctttc ccctgcccca aggcttcgga gggactattc gtatgaaaga gtgggtccta
     3121 cctcttgccg gggtcggggc cgaggcgagt attttgccag agggaggggt tttcggggga
     3181 cctatggggg acgagggcgg ggaggccaag cgaattccgc agttaccgag agtttcgagg
     3241 agatgatggg cgtggaggtg ggacaggggg accaaaccac cctcctgctc cccgaggccg
     3301 ccatgccagc gagacacgga gcgagggttc agagtatgag gaaatcccca agcggtgccg
     3361 gcagcggggc tcagaaacag gcagcgagac ccatgagagt gatctggctc cttcagacaa
     3421 ggaggctccc acacccaagg agggaacact cacccaggtc ctctcgctcc cccaccacca
     3481 ggagccccac ccttcaccga gcgccagccc gcttcacgtg cccgggggtc ggcgagtctt
     3541 cactcccaga gggtgccatc tcgccggggc cgaggaggag ggaggcccct cctcaagttt
     3601 gcccaggctg gagccctcca gccaagtctc tggctcccaa gaaacctccc acaggccctt
     3661 tgccaccaag taaggagcct ttgaaagaga agttgatccc agggcctctg tcccctgtgg
     3721 cgcgcggagg cagcaatgga ggtagcaatg tgggcatgga agatggggag cgaccccgaa
     3781 ggaggcgaca tgggagggct cagcagcagg ataaaccgcc tcgtttccgg aggctgaagc
     3841 aggaacggga gaatgccgca agggggtctg agggcaagcc ctccctaacc cttccagcct
     3901 ccgctcctgg acctgaggag gccctcacaa cagtcacagt ggccccagca cctccgcggg
     3961 cagctgccaa gtctcctgat ctgtcaaacc agaactcaga ccaagccaat gaggaatggg
     4021 agactgcatc agagagcagt gacttcacca gtgagcgccg aggggacaaa gaggcacccc
     4081 caccagtact gctgacaccc aaggctgtgg gaactcctgg gggaggtgga ggtggagccg
     4141 taccaggtat ttcagccatg tcccgcggag atctgagcca gagagccaag gatttgagta
     4201 aacggagctt ctcaagtcag cggccaggca tggaacggca gaatcggcgc cctggcccag
     4261 ggggcaaggc tggcagcagt ggcagcagca gtggaggagg cggtgggggt cctggaggaa
     4321 ggaccgggcc aggacgaggc gacaagagga gctggccctc tcccaagaac cgaagtcgtc
     4381 ctccagagga gcgtcccccg gggcttcccc tgcctccccc acctcccagc agttctgctg
     4441 tcttccgcct ggaccaagtt atccacagca accctgctgg catccaacag gctctggccc
     4501 agcttagtag ccgtcaaggg agtgtaactg caccaggggg tcatccaagg cacaagcctg
     4561 ggcctcccca agcccctcag ggcccctctc ctaggccccc aacccgatac gagccccaga
     4621 gggtcaacag cggcctcagt tctgaccccc actttgagga gccggggcca atggtgagag
     4681 gggtgggtgg gactcctcgg gactctgccg gggttagtcc ctttccccct aaacgtcggg
     4741 agcggcctcc cagaaaacca gagctgctac aggaggaatc tttgccacct cctcatagct
     4801 ctggattctt gggctctaag cctgagggcc caggccctca ggcagagtcc agagatacag
     4861 gcacagaggc cctgacccct cacatctgga accgtttaca tactgccact agccgaaaga
     4921 gttaccggcc cacgtccatg gagccttgga tggagcccct gagtcctttt gaggatgtgg
     4981 ctggcacaga aatgagtcag tctgacagtg gggtggacct gagtggggat tctcaggtgt
     5041 catcaggtcc ctgcagccag cgaagttccc ctgatggagg actcaagggg gcagcagagg
     5101 gaccccccaa gaggcctgga ggctcctcac ccctgaatgc tgttccttgt gagggtccac
     5161 ctggctctga acctcctagg agaccaccac ctgcccccca cgatggggac agaaaggagc
     5221 tgccccggga gcagcctctg ccccctggcc ccattggcac agaacgatca cagcgtacag
     5281 accgaggcac agagcctggc cccattcggc catcccatcg acctggtccc ccagtccagt
     5341 ttggcactag tgacaaggac tcagacttac gcctagtggt aggagacagc ttgaaagcag
     5401 agaaggagct aacagcatca gtcactgagg ccattcctgt atcacgagac tgggagctgc
     5461 ttcccagtgc tgctgcctct gctgagccac aatccaagaa cctggattct gggcactgtg
     5521 tcccggagcc cagctcctca ggccagcgcc tgtatcctga ggttttctat ggcagtgctg
     5581 ggccttccag ttctcagatc tctgggggga gccatggact ctcaattaca tccaaacagt
     5641 ggaggcttcg ccctgggaca ccctcactgc acccttacag atcacagccc ctatacctac
     5701 ccccgggccc agcccctccc tcagcactgc tctctggggt agctctcaag ggccagtttc
     5761 tggatttctc cacaatgcaa gctacagagc tggggaagtt gccggctgga ggagttctct
     5821 accctccacc ttccttcctc tactctccgg ctttctgccc cagtcctttg cctgacacat
     5881 cgttgcttca ggtacgccag gatctgccat ccccttcgga tttttattct actcctctgc
     5941 agcctggtgg ccaaagtggc tttctccctt caggggctcc tgcccagcag atgcttctac
     6001 ccatggtaga ctcacagctg cctgtggtga actttggctc cctgccgcca gcaccacctc
     6061 ctgccccacc tcccctttct ctgttacctg tgggccctgc tctgcagccc cccagcctgg
     6121 ctgtgcggcc cccacctgct cctgctactc gggtgctgcc ttcacctgcc aggcccttcc
     6181 ccgctagctt ggggcgagca gagctgcatc cagtggaact aaagccgttc caggattatc
     6241 aaaaactgag cagcaacctt gggggacctg gatcatcacg gactccccca actggaaggt
     6301 ccttctctgg cctcaattcc cgtctcaagg ccacgccttc cacctacagt ggagtcttcc
     6361 gcacccagcg cgtcgacctt taccagcagg cctccccacc agatgccctg cgctggatac
     6421 ctaagccttg ggagcggaca gggccgccac ctcgagaagg gccctcccga cgggcagagg
     6481 agcctgggtc ccgaggggac aaggagcctg ggttgccccc accccgctga gggagttcct
     6541 cttgccccct acccccgggg cttgtatata gattataaat atataagggg gaaaggggtg
     6601 ggcggggagg ggttgtgggg ctggggcctc acttcccctc ctcccccttc ccctggtccc
     6661 ctgtccctgg ggctgtttgt taaaaaagag taataaaagg attt
//
LOCUS       HUMBAT2B1     336 bp ds-DNA             PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 2 (BAT2) gene, 5' flank.
ACCESSION   M33510 M31293
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SEGMENT     1 of 4
SOURCE      Human T-cell line MANN DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 336)
  AUTHORS   Banerji,J.
  JOURNAL   Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Banerji, 11-JAN-1990, for release after publication.
BASE COUNT      108 a     60 c     65 g    101 t      2 others
ORIGIN      Chromosome 6p21.3.
        1 tctagaatcg ggtagtaaga gacaaaggag ggtaacagta ctgcatttca caaaatgaaa
       61 cccattgtta agaaattaca aattcccaat aatttcaaat ataaaaattt attcatgaaa
      121 attataggtt ataaaattaa atgtccgtct tagtcgatgg ttgcccatat tttgatgaac
      181 gagtcattcc tagcctatct ttgttcaaat gatttgcata cttatgcaaa taggtagaac
      241 tgcccgaaga atgcctacnt gcgtggtgcg gacgaaacgc ttgccgggsc ctttggattg
      301 gtctgtctag ccacctcatt tgcatgacgt aatata
//
LOCUS       HUMBAT2B2     188 bp ds-DNA             PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 2 (BAT2) gene, 5' end.
ACCESSION   M33511 M31293
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SEGMENT     2 of 4
SOURCE      Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 154 to 188)
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 188)
  AUTHORS   Banerji,J.
  JOURNAL   Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pre-msg     154      188     BAT2 mRNA
BASE COUNT       20 a     64 c     59 g     18 t     27 others
ORIGIN      About 500 bp after segment 1; chromosome 6p21.3.
        1 gtgcthhhng gggcggcggt tccgcggatg ggccgttagt cgggstcagc cgcggagtga
       61 gngagggaga cgnnaggasg aacccggcca tccgccgcca tcctcccccg ccccaccgcc
      121 atccgtcccg gggacnnnnn nnnnnnnnnn nnncctaggc ccgggtcccg gatccccgcg
      181 cacccggc
//
LOCUS       HUMBAT2B3    3090 bp ds-DNA             PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 2 (BAT2) gene, exons 2 through 4.
ACCESSION   M33512 M31293
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SEGMENT     3 of 4
SOURCE      Human T-cell line MANN DNA (introns), and T-cell line HPB-All, cDNA
            to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1460 to 1572; 2396 to 2572; and 2919 to 3090)
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3090)
  AUTHORS   Banerji,J.
  JOURNAL   Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1460     1572     HLA-B-associated transcript 2 (BAT2), exon 2
                                 (first expressed exon)
               2396     2572     HLA-B-associated transcript 2, exon 3
               2919  +  3090     HLA-B-associated transcript 2, exon 4
    pre-msg <     1  >  3090     BAT2 mRNA and introns
    IVS     <     1     1459     BAT2 intron A (no splice consensus)
    IVS        1573     2395     BAT2 intron B
    IVS        2573     2918     BAT2 intron C
BASE COUNT      747 a    709 c    730 g    901 t      3 others
ORIGIN      About 500 bp after segment 2; chromosome 6p21.3.
        1 tctagaatcc tgcttttatc ccagcatctt tgctttctat gttgctcagt cgccctatgt
       61 ctgctttttc atttttcctg ttcctcgtct cctttctccc ccaaccccgt ttttcttctt
      121 gggcctctgc cccttacttc gttgtctaca tccttttttt ttttgccatt cctgtttcca
      181 tatattttcc acctgctttc gtattcatta ttttctgtta gttttggact attcgctaca
      241 tgactcttgt attcgttttc ccttcatata tttatcttca cagattggcc tcctcaaaca
      301 cctacgaagc aacatccatc ttatgtgtag cttgtcataa agttctttct ccccaatttt
      361 agctttcatt ctgggcctgt ctggatttcc ctgctttctt ccccactatt tctcatctct
      421 ttacactgtt cccgaccata aacgaatgcc tggtcactct ggaatggact gagagacctg
      481 tcgtccggct tgcttaggga gctggaggta tcgagtaaag aaacactggt gatggacatt
      541 tttaatcagg ataggaaaac gaagatggct ctgccttggc cctctgtttt ctggcccatg
      601 gttacagggt gctaaggtgg ctccataatg ctttttctca gttcttcata tggtaaaaca
      661 gtatttcatc tggaggcgat tttttccagg agccaataca ggagcaagtt taccaaaaga
      721 tgggatattt caaatacttg aggttcctat agcctgggag tatgtacagc cctagttgtt
      781 ctatgaggat ttctctggta ccaaccccca ttccngctga gcaagctcat aaaatcctta
      841 aactcccagc ataccttnct gcaaaccttc ccagatggac acgaggctgc tgggctggga
      901 gctggggtac agggccctgg gggcatgatt agggagcttg tgtccaataa acagggaatc
      961 taaagtgttg tttcttcttc tctgatggaa ttgtatgctt cttttttagt tttctcttag
     1021 cttgaatttg tcctgttgta agtctctgaa acgattttgg tggagagaga agagattatt
     1081 acttgtaggg aattactctt tngtagacag gcacaaaggg cagagtgttt atactaggag
     1141 gatgctggat ttttacttag atttccttgt aacaaaggtc gtctggggcc aaggagggaa
     1201 catggcattt gagctatgag ggagctaagt agatcatggt tggactttaa gaagagtggg
     1261 cagtttacat agactggagg aaaagacacc agagggactc atatctgagt ccctaatgat
     1321 aatgcaatgg agtttttaag tttctgttat ggtctgtaca gggacagaga ctgagacact
     1381 tgcgtctggc ccacaggctc tggcacgttt tgggggaggt gcctgcagga cccaacatac
     1441 tcaatgagct tccagcgcaa tgtccgatcg ctcggggccg actgccaagg gaaaggatgg
     1501 aaagaagtat tcctcgctca acctgtttga tacgtataag ggcaagtcct tagagatcca
     1561 gaaacccgcc tggtgagagt cctgcaaaga tgcttctgat ggttgaaaag ctaggcatgc
     1621 atggggcata cgttttagag ctctaaagga agtggctgta gtagaaatac caaaagacta
     1681 gaggagattt cccaacttac actgggtcct ttaaaggggg tgtgggctct gggtgaacac
     1741 cagttatcct cctacaaagg cgtgtctgtg gttccctgtc tttggacacg taagaattgg
     1801 aggaaataaa tgtggatttg ggaaactttg aggccagctt gcttcttgca ggctcatgat
     1861 caaccaatct cacataaaag tattgaatgt tacatatctc agccttcttg atagggattt
     1921 actagatttt tttttttttt tttttttttt ttttttgaga ccaagtttag ctcctgttgc
     1981 ccaggctgga gtgcaatggt gtgatcttga cttaccacaa cctccaccgc ctgggtttaa
     2041 gcgattatcc tgcctcagcc tcctgagtag ctgggattac aggcatgcac cccggctaat
     2101 tttgtgtttt tagtagagac agggtttctc cattttggtc aagctggtct tgaactcctg
     2161 acctcaggtg atccgcctcc ctcggcctgc caaagtgctg ggattgcaaa gtgtgagcca
     2221 ccacaatcag cgcgatttca gagattatta aggcagggga aggaatccct tctaagagaa
     2281 gtttggagga agtaggtaat aaaatattca acatgtataa atgtgtccca ggataggagg
     2341 ccatcagatc tcccacatga ggcattttcg accctctctc cgtcttgttc tccagttgcc
     2401 cctcgccatg gcctgcagag tctcgggaaa gttgccattg cccggcgtat cgacctccag
     2461 ccaaccttcc aagcctgaaa gccgagaaca aaggcaatga ccccaatgtc tcactagtgc
     2521 caaaagacgg aacaggatgg gcaagcaaac aggagcagtc cgaccccaag aggtagacag
     2581 aggcttgggg gacctagagt gatgggtatt ttaacttgaa cttcagggag cattggggct
     2641 tggtttagtc cagccacgtc tgaagagacg aagaggtccc tttcttacct attgcaggtt
     2701 ccttgttaaa tgactaagga atggtactaa actttagctt tttgtcttgg agagagagca
     2761 tgaaaaaata gacaacaggt acaaggatga caaaattaat ttgtccttat atttgtaaat
     2821 ggtagcaatg ggcatgattt cagtcctgag tctccaccag ttggagaagt cagggaggca
     2881 tctcaggtgt gaataacctt cccattctgt cccctcagtt ccgatgcctc aaccgctcag
     2941 ccgccggaat cgcagccact gccggcttca cagacgcctg cctccaacca gccgaaacga
     3001 cccccagcag cccccgagaa cactcctttg gttccaagcg gggtaaagtc ctgggcacaa
     3061 gccagcgtca cccatggagc acatggagat
//
LOCUS       HUMBAT2B4    6349 bp ds-DNA             PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 2 (BAT2) gene, 3' end.
ACCESSION   M33518 M31293
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SEGMENT     4 of 4
SOURCE      Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 209 to 6349)
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 6349)
  AUTHORS   Banerji,J.
  JOURNAL   Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +   209     6175     HLA-B-associated transcript 2 (BAT2), exon 5
    pre-msg <     1     6349     BAT2 mRNA and introns
    IVS     <     1      208     BAT2 intron D (no splice consensus)
    signal     6337     6342     poly-A signal
BASE COUNT     1334 a   2095 c   1798 g   1121 t      1 others
ORIGIN      About 370 bp after segment 3; chromosome 6p21.3.
        1 agctaatttg tgtgtgttta gtagagatgg gttcacatgt tggcagatgg tctcgatctc
       61 ttgacctctg tgatccgccc gcctcagccg gtcccagagt gctgggatta caggcgtgag
      121 ccaccgcgcc cagccagagt cttccacttt tatnagcatg tcctcaggaa atgtcttctg
      181 tctcctgttc tgcatcccca tcctaatagg tggaagggca tcaagcctac tgtcacgatt
      241 ctctcgagag gaatttccga ccctgcaggc ggctggcgac caggacaagg ctgccaagga
      301 aagggagtct gccgaacagt cgtctgggcc cggaccaagc ctccgccccc aaaattctac
      361 aacttggagg gacggaggtg ggcgtggccc tgatgagctg gagggcccgg actccaaact
      421 tcatcatggt catgatcccc ggggtgggct acagccttca ggcccacccc agttccctcc
      481 ctaccgcgga atgatgccgc ctttcatgta tcccccatat ctcccgttcc ctccgcccta
      541 tggaccccag gggccttacc gataccccac tcctgatggg cccagccgtt ttccccgtgt
      601 ggcgggcccc cgaggctcag ggccaccaat gcgcttagta gagcctgtgg gtcgtccctc
      661 tattctcaaa gaggataatc tcaaagagtt tgatcagttg gatcaggaga atgatgatgg
      721 ttgggcaggg gcccatgaag aggttgacta cactgaaaag ctcaagttca gcgatgagga
      781 agatgggcga gactctgatg aggagggagc tgagggccac agggattccc aatcagcttc
      841 tggtgaggaa cggccccctg aagcagatgg caaaaagggc aactccccca acagcgaacc
      901 gcccactcct aagacggcct gggcagaaac ctctcggcct ccagagacag agccgggacc
      961 tcctgcccca aagcctcccc taccccctgg ggactaccca gatcgtgggg gtcctccctg
     1021 caagccccca gcacctgaag atgaggatga ggcatggcgg cagcgacgaa agcagtcgtc
     1081 atctgagatt tccctggcag tggagcgggc ccggcgacgg cgagaagaag aggagcggcg
     1141 catgcaagaa gagcgccggg cagcctgtgc tgagaagctc aagcgactcg atgaaaagtt
     1201 tggggcacct gacaagcggc tcaaagcaga gcctgctgcc ccacctgctg ccccttctac
     1261 cccagcccca ccacctgcag tccctaaaga actccctgca cctccagctc cacctccagc
     1321 atcagcccca acaccagaga cagaacctga agagccagca caggcccctc ctgcccaatc
     1381 tactcctact ccaggtgtgg ctgcggctcc cactctggtg agtggtggtg gcagtaccag
     1441 tagcaccagc agtggcagct tcgaagccag cccagtggaa ccacaactgc cctcaaaaga
     1501 gggtcctgaa ccaccagaag aggttcctcc tcctaccaca cccccagttc caaaggtgga
     1561 acccaagggt gatgggattg gtcccacccg ccagccccct agtcagggct tgggctaccc
     1621 caaatatcag aagtcgttgc ctcctcgttt ccagcggcag cagcaggagc agctcctgaa
     1681 gcagcagcag cagcaccagt ggcagcagca tcaacagggc tctgcccctc ctaccccagt
     1741 gcccccatca ccaccacagc ctgtgaccct gggggctgtg ccagctccac aggctccacc
     1801 cccgcccccc aaggccctgt acccaggtgc tctgggccgg cccccaccca tgcccccaat
     1861 gaactttgat ccccgatgga tgatgattcc tccttatgtg gacccccggc tcctccaggg
     1921 tcgtccccct ctagagttct accctcctgg tgtgcatccc tctggcctag ttccccgaga
     1981 gcgttcagac agtctggggc tcagctcaga gccatttgac cgtcatgcac ctgctatgtt
     2041 acgggaacgg ggcactccac cggtggatcc aaagttggcc tgggtaggag atgtcttcac
     2101 cgccacaccc gctgaacccc gcccacttac ctcacctctg cgccaggctg cggatgagga
     2161 tgacaagggg atgaggagcg agactcctcc agtacctccc ccaccaccct atctggccag
     2221 ttatccaggc tttcctgaga atggagcccc tgggccccca atctctcgct ttcctctgga
     2281 ggaaccaggg ccccgtccac tcccctggcc cccaggcagt gatgaagtgg ccaagataca
     2341 aactccacca cccaagaagg agccccctaa ggaggagact gcacagctga cggggccaga
     2401 agcaggccga aagctgcccg cgagtcggag tggagcaggc cccccaccac cacgcagaga
     2461 gagtcgcaca gagacccgct ggggccctcg tccagggagc agtcgtcgtg gaatccctcc
     2521 agaggagcca ggggccccac cccgccgggc tgggcctata aagaaacctc caccacctac
     2581 aaaagtagaa gagctgcctc ccaagcccct cgaacagggg gatgaaaccc ccaaaccccc
     2641 aaagccagac ccactcaaga taaccaaggg gaagctaggg ggccccaagg agaccccacc
     2701 caatggaaat ctttcccctg ccccaaggct tcggagggac tattcgtatg aaagagtggg
     2761 tcctacctct tgccggggtc ggggccgagg cgagtatttt gccagaggga ggggttttcg
     2821 ggggacctat gggggacgag ggcggggagg ccaagcgaat tccgcagtta ccgagagttt
     2881 cgaggagatg atgggcgtgg aggtgggaca gggggaccaa accaccctcc tgctccccga
     2941 ggccgccatg ccagcgagac acggagcgag ggttcagagt atgaggaaat ccccaagcgg
     3001 tgccggcagc ggggctcaga aacaggcagc gagacccatg agagtgatct ggctccttca
     3061 gacaaggagg ctcccacacc caaggaggga acactcaccc aggtcctctc gctcccccac
     3121 caccaggagc cccacccttc accgagcgcc agcccgcttc acgtgcccgg gggtcggcga
     3181 gtcttcactc ccagagggtg ccatctcgcc ggggccgagg aggagggagg cccctcctca
     3241 agtttgccca ggctggagcc ctccagccaa gtctctggct cccaagaaac ctcccacagg
     3301 ccctttgcca ccaagtaagg agcctttgaa agagaagttg atcccagggc ctctgtcccc
     3361 tgtggcgcgc ggaggcagca atggaggtag caatgtgggc atggaagatg gggagcgacc
     3421 ccgaaggagg cgacatggga gggctcagca gcaggataaa ccgcctcgtt tccggaggct
     3481 gaagcaggaa cgggagaatg ccgcaagggg gtctgagggc aagccctccc taacccttcc
     3541 agcctccgct cctggacctg aggaggccct cacaacagtc acagtggccc cagcacctcc
     3601 gcgggcagct gccaagtctc ctgatctgtc aaaccagaac tcagaccaag ccaatgagga
     3661 atgggagact gcatcagaga gcagtgactt caccagtgag cgccgagggg acaaagaggc
     3721 acccccacca gtactgctga cacccaaggc tgtgggaact cctgggggag gtggaggtgg
     3781 agccgtacca ggtatttcag ccatgtcccg cggagatctg agccagagag ccaaggattt
     3841 gagtaaacgg agcttctcaa gtcagcggcc aggcatggaa cggcagaatc ggcgccctgg
     3901 cccagggggc aaggctggca gcagtggcag cagcagtgga ggaggcggtg ggggtcctgg
     3961 aggaaggacc gggccaggac gaggcgacaa gaggagctgg ccctctccca agaaccgaag
     4021 tcgtcctcca gaggagcgtc ccccggggct tcccctgcct cccccacctc ccagcagttc
     4081 tgctgtcttc cgcctggacc aagttatcca cagcaaccct gctggcatcc aacaggctct
     4141 ggcccagctt agtagccgtc aagggagtgt aactgcacca gggggtcatc caaggcacaa
     4201 gcctgggcct ccccaagccc ctcagggccc ctctcctagg cccccaaccc gatacgagcc
     4261 ccagagggtc aacagcggcc tcagttctga cccccacttt gaggagccgg ggccaatggt
     4321 gagaggggtg ggtgggactc ctcgggactc tgccggggtt agtccctttc cccctaaacg
     4381 tcgggagcgg cctcccagaa aaccagagct gctacaggag gaatctttgc cacctcctca
     4441 tagctctgga ttcttgggct ctaagcctga gggcccaggc cctcaggcag agtccagaga
     4501 tacaggcaca gaggccctga cccctcacat ctggaaccgt ttacatactg ccactagccg
     4561 aaagagttac cggcccacgt ccatggagcc ttggatggag cccctgagtc cttttgagga
     4621 tgtggctggc acagaaatga gtcagtctga cagtggggtg gacctgagtg gggattctca
     4681 ggtgtcatca ggtccctgca gccagcgaag ttcccctgat ggaggactca agggggcagc
     4741 agagggaccc cccaagaggc ctggaggctc ctcacccctg aatgctgttc cttgtgaggg
     4801 tccacctggc tctgaacctc ctaggagacc accacctgcc ccccacgatg gggacagaaa
     4861 ggagctgccc cgggagcagc ctctgccccc tggccccatt ggcacagaac gatcacagcg
     4921 tacagaccga ggcacagagc ctggccccat tcggccatcc catcgacctg gtcccccagt
     4981 ccagtttggc actagtgaca aggactcaga cttacgccta gtggtaggag acagcttgaa
     5041 agcagagaag gagctaacag catcagtcac tgaggccatt cctgtatcac gagactggga
     5101 gctgcttccc agtgctgctg cctctgctga gccacaatcc aagaacctgg attctgggca
     5161 ctgtgtcccg gagcccagct cctcaggcca gcgcctgtat cctgaggttt tctatggcag
     5221 tgctgggcct tccagttctc agatctctgg ggggagccat ggactctcaa ttacatccaa
     5281 acagtggagg cttcgccctg ggacaccctc actgcaccct tacagatcac agcccctata
     5341 cctacccccg ggcccagccc ctccctcagc actgctctct ggggtagctc tcaagggcca
     5401 gtttctggat ttctccacaa tgcaagctac agagctgggg aagttgccgg ctggaggagt
     5461 tctctaccct ccaccttcct tcctctactc tccggctttc tgccccagtc ctttgcctga
     5521 cacatcgttg cttcaggtac gccaggatct gccatcccct tcggattttt attctactcc
     5581 tctgcagcct ggtggccaaa gtggctttct cccttcaggg gctcctgccc agcagatgct
     5641 tctacccatg gtagactcac agctgcctgt ggtgaacttt ggctccctgc cgccagcacc
     5701 acctcctgcc ccacctcccc tttctctgtt acctgtgggc cctgctctgc agccccccag
     5761 cctggctgtg cggcccccac ctgctcctgc tactcgggtg ctgccttcac ctgccaggcc
     5821 cttccccgct agcttggggc gagcagagct gcatccagtg gaactaaagc cgttccagga
     5881 ttatcaaaaa ctgagcagca accttggggg acctggatca tcacggactc ccccaactgg
     5941 aaggtccttc tctggcctca attcccgtct caaggccacg ccttccacct acagtggagt
     6001 cttccgcacc cagcgcgtcg acctttacca gcaggcctcc ccaccagatg ccctgcgctg
     6061 gatacctaag ccttgggagc ggacagggcc gccacctcga gaagggccct cccgacgggc
     6121 agaggagcct gggtcccgag gggacaagga gcctgggttg cccccacccc gctgagggag
     6181 ttcctcttgc cccctacccc cggggcttgt atatagatta taaatatata agggggaaag
     6241 gggtgggcgg ggaggggttg tggggctggg gcctcacttc ccctcctccc ccttcccctg
     6301 gtcccctgtc cctggggctg tttgttaaaa aagagtaata aaaggattt
//
LOCUS       HUMBAT3A     3740 bp ss-mRNA            PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 3 (BAT3) mRNA, complete cds.
ACCESSION   M33519 M31294
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SOURCE      Human T-cell line HPB-All, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3740)
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept        250     3648     HLA-B-associated transcript 3 (BAT3)
    mRNA          1     3740     BAT3 mRNA
BASE COUNT      744 a   1182 c   1057 g    757 t
ORIGIN      Chromosome 6p21.3.
        1 ggcgacagcg gtggcggctc ctcggggtgc tcggctccct cccacctagg ccggccccgg
       61 cccgactcgc cctcagaaac tcactgtttg gggctgcgga ctttctcgtc gtgccccaca
      121 aaagtaaagc ttggggacct ggggggagcc ggaagtatcg cttcgagatc cccaaatact
      181 atcggggaaa cggaagtggc cgtcggtggc aggtttgggg gagaccggaa gtgacgagac
      241 ctgtcggcca tggagcctaa tgatagtacc agtaccgctg tggaggagcc tgacagcttg
      301 gaggtgttgg tgaagacctt ggactctcaa actcgtacct ttattgtggg ggcccagatg
      361 aatgtaaaag agtttaagga gcacattcgt gcctctgtca gcatcccatc tgaaaaacaa
      421 cggctcattt accagggacg agttctgcaa gatgataaga agcttcagga atacaatgtt
      481 gggggaaagg ttatccacct ggtggaacgg gctcctcctc agactcacct cccttctggg
      541 gcatcttctg ggacggggtc tgcctcagcc actcatggtg ggggatcccc ccctggtact
      601 cgggggcctg gggcctctgt tcatgaccgg aatgccaaca gctatgtcat ggttggaacc
      661 ttcaatcttc ctagtgacgg ctctgctgtg gatgttcaca tcaacatgga acaggccccg
      721 attcagagtg agccccgggt acggctggtg atggctcagc acatgatcag ggatatacag
      781 accttactat cccggatgga gactctcccc taccttcagt gtcgaggagg gccccaaccg
      841 cagcacagtc agccgccccc gcagccaccg gctgtgaccc cggagccagt agccttgagc
      901 tctcaaacat cagaaccagt tgaaagtgaa gcacctcccc gggagcccat ggaggcagaa
      961 gaagtggagg agcgtgcccc agcccagaac ccggagctca ctcctggccc agccccagcg
     1021 ggcccaacac ctgccccgga aacaaatgca cccaaccatc cttcccctgc ggagtatgtc
     1081 gaggtgctcc aggagctaca gcggctggag agtcgcctcc agcccttctt gcagcgctac
     1141 tacgaggttc tgggtgctgc tgccaccacg gactacaata acaatcacga gggccgggag
     1201 gaggatcagc ggttgatcaa cttggtaggg gagagcctgc gactgctggg caacaccttt
     1261 gttgcactgt ctgacctgcg ctgcaatctg gcctgcacgc ccccacgaca cctgcatgtg
     1321 gtccggccta tgtctcacta caccaccccc atggtgctcc agcaggcagc cattcccata
     1381 cagatcaatg tgggaaccac tgtgaccatg acaggaaatg ggactcggcc ccccccaact
     1441 cccaatgcag aggcacctcc ccctggtcct gggcaggcct catccgtggc tccgtcttct
     1501 accaatgtcg agtcctcagc tgagggggct cccccgccag gtccagctcc cccgccagcc
     1561 accagccacc cgagggtcat ccggatttcc caccagagtg tggaacccgt ggtcatgatg
     1621 cacatgaaca ttcaagattc tggcacacag cctggtggtg ttccgagtgc tcccactggc
     1681 cccctgggac cccctggtca tggccaaacc ctgggacagc aggtgccagg cttcccaaca
     1741 gctccaaccc gggtggtgat tgcccggccc actcctccac aggctcggcc ttcccatcct
     1801 ggagggcccc cagtctctgg gacactgcag ggcgccggtc tgggtaccaa tgcctcgttg
     1861 gcccagatgg tgagcggcct tgtggggcag cttcttatgc agccagtcct tgtggctcag
     1921 gggaccccag gtatggctcc accgccagcc cctgccactg cttctgccag tgctggcacc
     1981 accaacacag ctaccacagc tggccccgct cctggggggc ctgcccagcc tccacccacc
     2041 cctcaaccct ccatggctga tcttcagttc tctcagcttc tggggaacct gctagggcct
     2101 gcagggccag gggctggagg gcctggtgtg gcttctccca ccatcactgt ggcgatgcct
     2161 ggtgtccctg cctttctcca aggcatgact gacttcttgc aggcaacaca gacagcccct
     2221 ccaccacccc cacctcctcc acccccacca cctgccccag agcagcagac catgccccca
     2281 ccaggctccc cttctggtgg cgcagggagt cctggaggcc tgggtcttga gagcctgtca
     2341 ccggagtttt ttacctcagt ggtgcagggt gtgctcagct ccctgctggg ctccctgggg
     2401 gctcgggctg gcagcagtga aagtattgct gccttcatac aacgcctcag tggatccagc
     2461 aacatctttg agcctggagc tgatggggcc cttggattct ttggggcctt gctttctctt
     2521 ctgtgccaga acttctctat ggtggacgta gtgatgcttc tccatgggca tttccagcca
     2581 ctacaacggc tccagcccca gctgcgatcc ttcttccacc agcactacct gggtggtcag
     2641 gagcccacac ccagtaacat ccggatggca acccacacat tgatcacggg gctagaagag
     2701 tatgtgcggg agagtttttc cttggtgcag gttcagccag gtgtggacat catccggaca
     2761 aacctggaat ttctccaaga gcagtttaat agcattgctg cgcatgtgct gcattgcaca
     2821 gatagtggat ttggggcccg gttgctggag ttgtgtaacc aaggcctgtt tgaatgcctg
     2881 gccctaaacc tgcactgctt ggggggacag cagatggagc ttgctgctgt tatcaatggc
     2941 cgaattcgtc gtatgtctcg tggggtgaat ccctccttgg tgagctggct gaccactatg
     3001 atgggactga ggcttcaggt ggtactggag cacatgcctg taggccctga tgccattctc
     3061 agatacgttc gcagggttgg tgatcccccc cagccacttc ctgaggagcc aatggaagtt
     3121 cagggagcag aaagagcttc ccctgagcct cagcgggaga atgcttcccc agcccctgga
     3181 acaacagcag aagaggccat gtcccgaggt ccacctcctg ctcctgaggg gggctcccgg
     3241 gatgaacagg atggagcttc agctgagaca gaaccttggg cagctgcagt ccccccagaa
     3301 tgggtcccta ttatccagca ggacattcag agccagcgga aggtgaaacc gcagccccct
     3361 ctgagtgatg cctacctcag tggtatgcct gccaagagac gcaagacgat gcagggtgag
     3421 ggcccccagc tgcttctctc agaggctgtg agccgggcag ctaaggcagc cggagctcgg
     3481 cccctgacga gccccgagag cctgagccgg gacctggagg caccagaggt tcaggagagc
     3541 tacaggcagc agctccggtc tgatatacaa aaacgactgc aggaagaccc caactacagt
     3601 ccccagcgct tccccaatgc ccagcgggcc tttgctgatg atccttagct ctttgctcta
     3661 tggcccttcc tcatcagggg accgtttccc ccctcttcct tcacagtatt taagaaataa
     3721 aagtcggatt ttttctggcc
//
LOCUS       HUMBAT3B1     785 bp ds-DNA             PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 3 (BAT3) gene, 5' end.
ACCESSION   M33520 M31294
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SEGMENT     1 of 2
SOURCE      Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 333 to 689)
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 785)
  AUTHORS   Banerji,J.
  JOURNAL   Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept        582  +   689     HLA-B-associated transcript 3 (BAT3), exon 1
    pre-msg     333  >   785     BAT3 mRNA and introns
    IVS         690  >   785     BAT3 intron A
BASE COUNT      170 a    210 c    239 g    165 t      1 others
ORIGIN      Chromosome 6p21.3.
        1 aaggcgcagc gagggcaata gggtggagaa gagttttagc tgctagacag tgccgcctga
       61 aattatcagc ctgccaagat ttaaacatag atgaatgtgg cataatcccc catctccaaa
      121 gtccaagtcc atacgaccgt ccatagcctc tcgaggcagt ggtagagtcc cagctggtga
      181 ctgtttttca ggcatttacg gtagccacct caatcttcta gcgctcaacg cgcgcacaga
      241 cgtgaacgcc gccagagggg ggagggggtg gggcgatgct taagtgtcca cgcatcccgt
      301 agtgcgacgg cacagcgtag taggtncccc cgggcgacag cggtggcggc tcctcggggt
      361 gctcggctcc ctcccaccta ggccggcccc ggcccgactc gccctcagaa actcactgtt
      421 tggggctgcg gactttctcg tcgtgcccca caaaagtaaa gcttggggac ctggggggag
      481 ccggaagtat cgcttcgaga tccccaaata ctatcgggga aacggaagtg gccgtcggtg
      541 gcaggtttgg gggagaccgg aagtgacgag acctgtcggc catggagcct aatgatagta
      601 ccagtaccgc tgtggaggag cctgacagct tggaggtgtt ggtgaagacc ttggactctc
      661 aaactcgtac ctttattgtg ggggcccagg tgagacacct cactagttct ggaagacacc
      721 tttagctttt ccacgtttag gccccttagc ctgagagatg agcttgattt ttctaggtca
      781 ccaga
//
LOCUS       HUMBAT3B2    4227 bp ds-DNA             PRI       18-JAN-1990
DEFINITION  Human HLA-B-associated transcript 3 (BAT3) gene, 3' end.
ACCESSION   M33521 M31294
KEYWORDS    class III gene; major histocompatibility complex;
            proline-rich protein.
SEGMENT     2 of 2
SOURCE      Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 731 to 848; 963 to 4227)
  AUTHORS   Banerji,J., Sands,J., Strominger,J.L. and Spies,T.
  TITLE     A gene pair from the human major histocompatibility complex encodes
            large proline-rich proteins with multiple repeated motifs and a
            single ubiquitin-like domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 4227)
  AUTHORS   Banerji,J.
  JOURNAL   Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Banerji, 11-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +   731      848     HLA-B-associated transcript 3 (BAT3), exon 2
                963     4135     HLA-B-associated transcript 3, exon 3
    pre-msg <     1     4227     BAT3 mRNA and introns
    IVS     <     1      730     BAT3 intron A
    IVS         849      962     BAT3 intron B
BASE COUNT      842 a   1276 c   1116 g    987 t      6 others
ORIGIN      About 1.1 kb after segment 1; chromosome 6p21.3.
        1 ttatcttntt agatcatttc cttccacctt aacctatacc agacccactc cttctttgcc
       61 attttttaat cttggaaatc acaggagngt ctgtaaatna ctggatcatc ttgtgtttgg
      121 aaggggtact gatgtctcta gacacatacn cccttggatg ccagacagat aatataattt
      181 ccatgtgttt tttttttgtt tttcatccgt gttatttttc ctggatctat aacctgagct
      241 tcattaagtt tatttattta attttttcga gatggagtcc cacnctttca cccaggctag
      301 agtgtagtga tgcgatctcg gctcactgca acctccgcct cccgaattca agtgattctc
      361 ttgcttcagc ctccctagta gctgggatta caggcgacca ccatgcctgg cttatttttt
      421 gtatttttgg taaaaagggg ttttacatgt tggccaggct ggtctcgaac tctgacctaa
      481 gtgatctgcc tgccttggcc tcccaagtgc tggattacag tgtgagacca ccgctccagc
      541 caatatgtct gtatttttga cacgtgttac tttagttaag ggtttgcaca gtaatgatct
      601 cacggtcaag acaaacgggt agtgattdct gtggtggttt ttacccctca cctccacaac
      661 tcggttgtct gtctttgttc ttcctctttc ctccattctt tccattcctg tgcatgcctc
      721 ttcttttcag atgaatgtaa aagagtttaa ggagcacatt cgtgcctctg tcagcatccc
      781 atctgaaaaa caacggctca tttaccaggg acgagttctg caagatgata agaagcttca
      841 ggaatacagt aagggggctg gggaggcagt tcagaggttg gggctactgt ctggagggat
      901 gaactgaggc catgggttta cctgttcata ctatgttttg gtgtgtgtct atttttctgc
      961 agatgttggg ggaaaggtta tccacctggt ggaacgggct cctcctcaga ctcacctccc
     1021 ttctggggca tcttctggga cggggtctgc ctcagccact catggtgggg gatccccccc
     1081 tggtactcgg gggcctgggg cctctgttca tgaccggaat gccaacagct atgtcatggt
     1141 tggaaccttc aatcttccta gtgacggctc tgctgtggat gttcacatca acatggaaca
     1201 ggccccgatt cagagtgagc cccgggtacg gctggtgatg gctcagcaca tgatcaggga
     1261 tatacagacc ttactatccc ggatggagac tctcccctac cttcagtgtc gaggagggcc
     1321 ccaaccgcag cacagtcagc cgcccccgca gccaccggct gtgaccccgg agccagtagc
     1381 cttgagctct caaacatcag aaccagttga aagtgaagca cctccccggg agcccatgga
     1441 ggcagaagaa gtggaggagc gtgccccagc ccagaacccg gagctcactc ctggcccagc
     1501 cccagcgggc ccaacacctg ccccggaaac aaatgcaccc aaccatcctt cccctgcgga
     1561 gtatgtcgag gtgctccagg agctacagcg gctggagagt cgcctccagc ccttcttgca
     1621 gcgctactac gaggttctgg gtgctgctgc caccacggac tacaataaca atcacgaggg
     1681 ccgggaggag gatcagcggt tgatcaactt ggtaggggag agcctgcgac tgctgggcaa
     1741 cacctttgtt gcactgtctg acctgcgctg caatctggcc tgcacgcccc cacgacacct
     1801 gcatgtggtc cggcctatgt ctcactacac cacccccatg gtgctccagc aggcagccat
     1861 tcccatacag atcaatgtgg gaaccactgt gaccatgaca ggaaatggga ctcggccccc
     1921 cccaactccc aatgcagagg cacctccccc tggtcctggg caggcctcat ccgtggctcc
     1981 gtcttctacc aatgtcgagt cctcagctga gggggctccc ccgccaggtc cagctccccc
     2041 gccagccacc agccacccga gggtcatccg gatttcccac cagagtgtgg aacccgtggt
     2101 catgatgcac atgaacattc aagattctgg cacacagcct ggtggtgttc cgagtgctcc
     2161 cactggcccc ctgggacccc ctggtcatgg ccaaaccctg ggacagcagg tgccaggctt
     2221 cccaacagct ccaacccggg tggtgattgc ccggcccact cctccacagg ctcggccttc
     2281 ccatcctgga gggcccccag tctctgggac actgcagggc gccggtctgg gtaccaatgc
     2341 ctcgttggcc cagatggtga gcggccttgt ggggcagctt cttatgcagc cagtccttgt
     2401 ggctcagggg accccaggta tggctccacc gccagcccct gccactgctt ctgccagtgc
     2461 tggcaccacc aacacagcta ccacagctgg ccccgctcct ggggggcctg cccagcctcc
     2521 acccacccct caaccctcca tggctgatct tcagttctct cagcttctgg ggaacctgct
     2581 agggcctgca gggccagggg ctggagggcc tggtgtggct tctcccacca tcactgtggc
     2641 gatgcctggt gtccctgcct ttctccaagg catgactgac ttcttgcagg caacacagac
     2701 agcccctcca ccacccccac ctcctccacc cccaccacct gccccagagc agcagaccat
     2761 gcccccacca ggctcccctt ctggtggcgc agggagtcct ggaggcctgg gtcttgagag
     2821 cctgtcaccg gagtttttta cctcagtggt gcagggtgtg ctcagctccc tgctgggctc
     2881 cctgggggct cgggctggca gcagtgaaag tattgctgcc ttcatacaac gcctcagtgg
     2941 atccagcaac atctttgagc ctggagctga tggggccctt ggattctttg gggccttgct
     3001 ttctcttctg tgccagaact tctctatggt ggacgtagtg atgcttctcc atgggcattt
     3061 ccagccacta caacggctcc agccccagct gcgatccttc ttccaccagc actacctggg
     3121 tggtcaggag cccacaccca gtaacatccg gatggcaacc cacacattga tcacggggct
     3181 agaagagtat gtgcgggaga gtttttcctt ggtgcaggtt cagccaggtg tggacatcat
     3241 ccggacaaac ctggaatttc tccaagagca gtttaatagc attgctgcgc atgtgctgca
     3301 ttgcacagat agtggatttg gggcccggtt gctggagttg tgtaaccaag gcctgtttga
     3361 atgcctggcc ctaaacctgc actgcttggg gggacagcag atggagcttg ctgctgttat
     3421 caatggccga attcgtcgta tgtctcgtgg ggtgaatccc tccttggtga gctggctgac
     3481 cactatgatg ggactgaggc ttcaggtggt actggagcac atgcctgtag gccctgatgc
     3541 cattctcaga tacgttcgca gggttggtga tcccccccag ccacttcctg aggagccaat
     3601 ggaagttcag ggagcagaaa gagcttcccc tgagcctcag cgggagaatg cttccccagc
     3661 ccctggaaca acagcagaag aggccatgtc ccgaggtcca cctcctgctc ctgagggggg
     3721 ctcccgggat gaacaggatg gagcttcagc tgagacagaa ccttgggcag ctgcagtccc
     3781 cccagaatgg gtccctatta tccagcagga cattcagagc cagcggaagg tgaaaccgca
     3841 gccccctctg agtgatgcct acctcagtgg tatgcctgcc aagagacgca agacgatgca
     3901 gggtgagggc ccccagctgc ttctctcaga ggctgtgagc cgggcagcta aggcagccgg
     3961 agctcggccc ctgacgagcc ccgagagcct gagccgggac ctggaggcac cagaggttca
     4021 ggagagctac aggcagcagc tccggtctga tatacaaaaa cgactgcagg aagaccccaa
     4081 ctacagtccc cagcgcttcc ccaatgccca gcgggccttt gctgatgatc cttagctctt
     4141 tgctctatgg cccttcctca tcaggggacc gtttcccccc tcttccttca cagtatttaa
     4201 gaaataaaag tcggattttt tctggcc
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       CHBSSRNA     1504 bp ss-RNA             RNA       01-FEB-1990
DEFINITION  C.limicola ribosomal RNA small subunit.
ACCESSION   M31769
KEYWORDS    ribosomal RNA; ribosomal RNA small subunit.
SOURCE      C.limicola (strain ATCC 8327) RNA.
  ORGANISM  Chlorobium limicola
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Green bacteria; Green sulfur bacteria.
REFERENCE   1  (bases 1 to 1504)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990) Microbiol. Dept. U. of IL., Urbana, IL. 61801
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 01-FEB-1990.
FEATURES       from  to/span     description
    rRNA          1     1504     ribosomal RNA small subunit
BASE COUNT      396 a    334 c    437 g    311 t     26 others
ORIGIN      
        1 nnncaacgga gagtttgatc ctggctcagg acgaacgctg gcggcgtgcc taacacatgc
       61 aagtcaaagg aaagtttctt cggaaataag tacttggcgc aagggtgagt aaggcatagg
      121 taatctgccc tttggactgg cataaccccg agaaatcggg gacaatacca gatgatgcag
      181 cggaaccgca tggttatgtt gttaaatgat ttatcgccaa aggatgagcc tatgttccat
      241 caggtagttg gtagggtaac ggcctaccaa gccaacgacg gatagctggt ctgagaggat
      301 gatcagccac attggaactg agacacggtc cagactccta cgggaggcag cagtgaggaa
      361 tattgcgcaa tgggcgaaag cctgacgcag caacgccgcg tggatgatga agttcttcgg
      421 aatgtaaaat ccttttgctg gggacgaata ggtcggcttg ccgactctga cggtacccag
      481 cgaataagcc acggctaact ctgtgccagc agccgcggtg atacaggggt ggcaagcgtt
      541 gtccggattt actgggtgta aagggtgcgc aggcggactg ataagtcggg ggttaaatcc
      601 atgtgctcaa cacatgcacg gcttccgata ctgtcagtct tgagtctcga agaggaagat
      661 ggaatttccg gtgtaacggt ggaatgtgta gatatcggaa agaacaccag tggcgaaggc
      721 agtcttctgg tcgagtactg acgctcaggc acgaaagcgt ggggagcaaa caggattaga
      781 taccctggta gtccacgccg taaacgatga atactagatg ttggtcatat tgatcagtgt
      841 cgcagctaac gcattaagta ttccacctgg gaagtacgcc cgcaagggtg aaactcaaag
      901 gaattgacgn nnnccngcac aagcggtgga tcatgtggtt taattcgatg caacgcgaag
      961 aaccttacct aggcttgaaa tgttagctaa agctcctgaa agggagcgtc cttcggggag
     1021 ctagcacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc
     1081 gcaacgagcg caacccctac aattagttac taacaggtta agctgaggac tctaattgaa
     1141 ctgcctacgc aagtagagag gaagganggg atgacgtcaa gtcctcatgg cccttacgcc
     1201 tagggccaca cacgtgatac aatggcgact acagagggca aagccgcgag gcagaggaaa
     1261 tcccttaaaa gtcgtctcag tccggatcgg agtctgcaac tcgactccgt gaagttggaa
     1321 tcgctagtaa tcgcggatca gcatgccgcg gtgaatgtgt tcccgggcct tgtacacacc
     1381 gcccgtcaag tcatggaagt caggagtacc caaagacgtc cgcgcgttta aggtaagact
     1441 ggtaactggg actaagnngt aacaagnnnn nngtaccgga agnnnnnnnn ngatcacctc
     1501 cttt
//
LOCUS       ECOSPEA      3236 bp ds-DNA             BCT       01-FEB-1990
DEFINITION  E.coli arginine decarboxylase (speA) gene, complete cds, agmatinase
            (speB) and methionine adenosyltransferase (metK) genes, 5' end.
ACCESSION   M31770
KEYWORDS    agmatinase; arginine decarboxylase; metK gene;
            methionine adenosyltransferase; speA gene; speB gene.
SOURCE      E.coli (strain K12) DNA, clones pLC2-5 and lambda-[1H10,23G45].
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3236)
  AUTHORS   Moore,R.C. and Boyle,S.M.
  TITLE     Nucleotide sequence and analysis of the speA gene encoding arginine
            decarboxylase of Escherichia coli
  JOURNAL   Unpublished (1990) Virginia S. Uni. Vet. Med., Blacksburg, VA 24061
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            R.C.Moore, 02-FEB-1990.
FEATURES       from  to/span     description
    pept        192  <     1 (c) methionine adenosyltransferase (metK)
    pept        987     2963     arginine decarboxylase (speA) (EC 4.1.1.19)
    pept       3101  >  3236     agmatinase (speB)
    signal     3009     3022     rho-independent transcription terminator
    signal      811      816     -35 signal
    signal      839      844     -10 signal
    binding     977      980     ribosome binding site
    site       1878     1886     pyridoxal phosphate binding site (put.)
BASE COUNT      743 a    784 c    860 g    849 t
ORIGIN      62.9 min on K12 map.
        1 tacccaaggt cgctggtggt gatttcgccg ccaactaaaa ccaatgccgg tttttacgta
       61 ggtttcgcaa gcaacgcgtg ctttcggatc ctgttcgagg atcgcgtcta aaacggcatc
      121 agaaatttgg tcagcaattt tgtcaggatg cccttcagag acggactcgg acgtaaaaag
      181 gtgttttgcc atatttaata tcacctaaag agaatttggt tagctcaaac tgttgtgtgg
      241 attttctgtg gtagcggatc ctaccacgac tctgcaggtt aaaaacactg gcagtctgag
      301 tgttaatcgg tatggatgga ttaacatctg gatggctatt ttaggtcaat tcttcaccct
      361 atttccactt ttttttgaat cgtgtctcat tctgttaaaa acgtggctgg aaatttttcc
      421 tgacaatgcc ggcattctgc gtatttatct tttgcaattt tctgccattg tggggtataa
      481 aacgcggcgc gcggcttaaa taaaaagcac acgacgtttc tttcgtgttg ccacttccag
      541 ccgggttcaa atcagagttt tggcttgtgg gttcgtctta acaggcggcc gtggaggtga
      601 tacgaaataa tgaaccgttg tctgctgctt aacctgtctc accgttctgg tgaagattcg
      661 ttccccgcac tctgcatctc tgctttgcat acctgccgat gttataccca tctcggcgct
      721 tctcaggatt caagagctgg ttacagttac tgaggactga acaagggcgc tcttgtaaaa
      781 acaagagttt tctcgtggtt tcgccgaact ttcacactta cgttcggtta tgtgcttaat
      841 aatgttatga aaaagaaacc ggttgcgcag ttggagcgtc agcattcact gctggaaaat
      901 ccatgtgctt atgggttgtt atcgcagttc caggctgcga tagtcgttaa ctgttttaca
      961 cttaataaaa taatttgagg ttcgctatgt ctgacgacat gtctatgggt ttgccttcgt
     1021 cagcgggcga acacggtgta ctacgctcca tgcaggaggt tgcaatgagc tcccaggaag
     1081 ccagcaagat gctgcgtact tacaatattg cctggtgggg caataactac tatgacgtta
     1141 acgagctggg ccacattagc gtgtgcccgg acccggacgt cccggaagct cgcgtcgatc
     1201 tcgcgcagtt agtgaaaact cgtgaagcac agggccagcg tctgcctgca ctgttctgtt
     1261 tcccacagat cctgcagcac cgtttgcgtt ccattaacgc cgcgttcaaa cgtgcgaggg
     1321 aatcctacgg ctataacggc gattacttcc ttgtttatcc gatcaaagtt aaccagcacc
     1381 gccgcgtgat tgagtccctg attcattcgg gcgaaccgct gggtctggaa gccggttcca
     1441 aagccgagtt gatggcagta ctggcacatg ctggcatgac ccgtagcgtc atcgtctgca
     1501 acggttataa agaccgcgaa tatatccgcc tggcattaat tggcgagaag atggggcaca
     1561 aggtctatct ggtcattgag aagatgtcag aaatcgccat tgtgctggat gaagcagaac
     1621 gtctgaatgt cgttcctcgt ctgggcgtgc gtgcacgtct gcgttcgcag ggttcgggta
     1681 aatggcagtc ctccggcggg gaaaaatcga agttcggcct ggctgcgact caggtactgc
     1741 aactggttga aaccctgcgt gaagccgggc gtctcgacag cctgcaacta ctgcacttcc
     1801 acctcggttc gcagatggcg aatattcgcg atatcgcgac aggcgttcgt gaatccgcgc
     1861 gtttctatgt ggaactgcac aagctgggcg tcaatattca gtgcttcgac gtcggcggcg
     1921 gtctgggcgt ggattatgaa ggtactcgtt cgcagtccga ctgttcggtg aactacggcc
     1981 tcaatgaata cgccaacaac attatctggg cgattggcga tgcgtgtgaa gaaaacggtc
     2041 tgccgcatcc gacggtaatc accgaatcgg gtcgtgcggt gactgcgcat cacaccgtgc
     2101 tggtgtctaa tatcatcggc gtggaacgta acgaatacac ggtgccgacc gcgcctgcag
     2161 aagatgcgcc gcgcgcgctg caaagcatgt gggaaacctg gcaggagatg cacgaaccgg
     2221 gaactcgccg ttctctgcgt gaatggttac acgacagtca gatggatctg cacgacattc
     2281 atatcggcta ctcttccggc atctttagcc tgcaagaacg tgcatgggct gagcagcttt
     2341 atttgagcat gtgccatgaa gtgcaaaagc agctggatcc gcaaaaccgt gctcatcgtc
     2401 cgattatcga cgagctgcag gaacgtatgg cggacaaaat gtacgtcaac ttctcgctgt
     2461 tccagtcgat gccggacgca tgggggatcg accagttgtt cccggttctg ccgctggaag
     2521 ggctggatca agtgccggaa cgtcgcgctg tgctgctgga tattacctgt gactctgacg
     2581 gtgctatcga ccactatatt gatggtgacg gtattgccac gacaatgcca atgccggagt
     2641 acgatccaga gaatccgccg atgctcggtt tctttatggt cggcgcatat caggagatcc
     2701 tcggcaacat gcacaacctg ttcggtgata ccgaagcggt tgacgtgttc gtcttccctg
     2761 acggtagcgt agaagtagaa ctgtctgacg aaggcgatac cgtggcggac atgctgcaat
     2821 atgtacagct cgatccgaaa acgctgttaa cccagttccg cgatcaagtg aagaaaaccg
     2881 atcttgatgc tgaactgcaa caacagttcc ttgaagagtt cgaggcaggt ttgtacggtt
     2941 atacttatct tgaagatgag taagtcctgt gttacttgaa tccgcttaat ttagcggtga
     3001 taatccgcca caatttattg tgacaaatcc aacccttcct cgtcgggcct aacgacgcgg
     3061 aagggttttt ttatatcgac tttgtaatag gagtccatcc atgagcacct taggtcatca
     3121 atacgataac tcactggttt ccaatgcctt tggtttttta cgcctgccga tgaacttcca
     3181 gccgtatgac agcgatgcag actgggtgat tactggcgtg ccgttcgata tggcca
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       MUSIGHRT      314 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            68.2D8, partial cds.
ACCESSION   M32044
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.2D8.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   314     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       88 a     58 c     86 g     82 t
ORIGIN      Chromosome 12.
        1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
      121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
      181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
      301 tactgcacag atct
//
LOCUS       MUSIGHRU      314 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            68.3D3, partial cds.
ACCESSION   M32045
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.3D3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   314     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       88 a     58 c     85 g     82 t      1 others
ORIGIN      Chromosome 12.
        1 agggtccaga gtnaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
      121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
      181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
      301 tactgcacag atct
//
LOCUS       MUSIGHRV      306 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            80.1B5, partial cds.
ACCESSION   M32046
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain 80.1B5), cDNA to mRNA, anti-GAC hybridoma 80.1B5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 306)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   306     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       85 a     56 c     85 g     80 t
ORIGIN      Chromosome 12.
        1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
      121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
      181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
      301 tactgc
//
LOCUS       MUSIGHRW      314 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            68.2C4, partial cds.
ACCESSION   M32047
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.2C4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   314     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       89 a     57 c     84 g     84 t
ORIGIN      Chromosome 12.
        1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg
      121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
      181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta atgtctacct tcaaatgaac agcttaaggg ctgaagacac tggaatttat
      301 tactgcacag attt
//
LOCUS       MUSIGHRX      314 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            65.1B3, partial cds.
ACCESSION   M32048
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J),cDNA to mRNA, anti-GAC hybridoma 65.1B3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   314     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       91 a     58 c     84 g     81 t
ORIGIN      Chromosome 12.
        1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgaactgg
      121 gtccgccagt ctccagagaa gggacttgaa tgggttactg aaattagatt gaaatctgat
      181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
      301 tactgcacag atct
//
LOCUS       MUSIGHRY      313 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            66.4A10, partial cds.
ACCESSION   M32049
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.4A10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 313)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   313     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       86 a     59 c     84 g     83 t      1 others
ORIGIN      Chromosome 12.
        1 agggtncaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtcctgg
      121 gtccgccagt ctccagagaa gggactttat tgggttgctg aaattagatt gaaatctgat
      181 aattctgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat
      301 tactgcacag atc
//
LOCUS       MUSIGHRZ      314 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            66.5A3, partial cds.
ACCESSION   M32050
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.5A3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   314     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       91 a     58 c     84 g     81 t
ORIGIN      Chromosome 12.
        1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaaccactg gatgtactgg
      121 gtccgccaat ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat
      181 aattatgtaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctacct gcaaatgaac aacttaaggg ctgaagacac tggaatttat
      301 tacagcacag atct
//
LOCUS       MUSIGHSA      305 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            66.3E3, partial cds.
ACCESSION   M32051
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.3E3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 305)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   305     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       85 a     55 c     85 g     80 t
ORIGIN      Chromosome 12.
        1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtgctgg
      121 gtccgccagt ctccagagaa ggggcttgag tgggttgctg aaattagatt gaaatctaat
      181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctacct gcaaatgaac aacttagagg ctgaagacac tggaatttat
      301 tactg
//
LOCUS       MUSIGHSB      305 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma
            65.1D2, partial cds.
ACCESSION   M32052
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 65.1D2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 305)
  AUTHORS   Jarvis,C.D.., Cannon,L.E. and Stavnezer,J.
  TITLE     Mouse antibody response to group a streptococcal carbohydrate
  JOURNAL   J. Immunol. 143, 4213-4220 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    12  >   305     Ig H-chain V-region, exon X (AA at 13)
    IVS     <     1       11     Ig H-chain V-region intron
BASE COUNT       85 a     55 c     86 g     79 t
ORIGIN      Chromosome 12.
        1 agggtccaga gtgaagtgaa acttgaggag tctggaggag gcttggtgca acctggagga
       61 tccatgaaac tctcctgtgc tgcctctgga ttcactttca ggaactcctg gatatactgg
      121 gtccgccagt ctccagagaa ggggcttgag tgggttggtg aaattagatt gaagtcagat
      181 aattttgcaa cacaatatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat
      241 tccaaaagta gtgtctactt gcaaatgaac agcttaaggg ttgaagacac tgcaatttat
      301 tactg
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       BSPCGTD      2648 bp ds-DNA             BCT       15-DEC-1989
DEFINITION  Bacillus sp. cyclomaltodextrin glucanotransferase gene, complete
            cds.
ACCESSION   M28053
KEYWORDS    cyclomaltodextrin glucanotransferase.
SOURCE      Bacillus sp. (strain 17-1; ATCC 31007) DNA.
  ORGANISM  Bacillus sp.
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2648)
  AUTHORS   Kaneko,T., Song,K.-B., Hamamoto,T., Kudo,T. and Horikoshi,K.
  TITLE     Construction of a chimeric series of Bacillus cyclomaltodextrin
            glucanotransferases and analysis of the thermal stabilities and pH
            optima of the enzymes
  JOURNAL   J. Gen. Microbiol. 135, 3447-3457 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by T.Kaneko, 18-SEP-1989.
FEATURES       from  to/span     description
    pept        224     2365     cyclomaltodextrin glucanotransferase precursor
                                 (EC 2.4.1.19)
    sigp        224      304     cyclomaltodextrin glucanotransferase signal
                                 peptide
    matp        305     2362     cyclomaltodextrin glucanotransferase
BASE COUNT      692 a    777 c    652 g    527 t
ORIGIN      
        1 gaattcgctt tcatttacaa tattgttgaa atcaaacact cgcataattt aagggccatg
       61 cattccgtca ccgcacaccc ggtatggaac aaccccggtt tctcttcagg agatgccggg
      121 gttttttgtt gccctatatt tcgtacagga ggtgattccc cgcagcttcg gtttgatcat
      181 tgaaccaacc cccattctat aactgctaca taggaggaat acgatgaaaa aaatctccaa
      241 acttacaacc gctcttgccc ttagcttgtc cctggcgcta agcttgctcg gcccagccca
      301 tgcagccccg gatacctcgg tatccaacaa gcaaaatttc agcacggatg tcatttatca
      361 aatctttacg gaccgtttct ccgacggcaa tcctgccaac aacccgaccg gcccggcctt
      421 tgacggtacc tgcaccaacc tgaggcttta ctgcggcggc gactggcaag ggatcatcaa
      481 caaaatcaac gacggctacc tgaccggcat gggcgttacc gccatctgga tctcccagcc
      541 ggtcgagaac atctacagcg tcatcaatta ttccggcgtg aataacaccg cataccacgg
      601 ctattgggct agagacttca agaaaacgaa cccggcttac ggcacgattg ccgatttcca
      661 gaatctcatc gctgccgccc atgccaagaa catcaaagtc atcatcgact tcgcaccgaa
      721 ccacacgtct cccgcttcct tggaccagcc gtcgtttgcg gagaacggca agctgtacaa
      781 caacggcagg gacgaaggcg gatacacgaa cgatacgcac aatttattcc atcataacgg
      841 gggcaccgat ttctcgacga ccgagaacgg gatttacaaa aacctgtacg atcttgccga
      901 tctgaaccac aacaacagca cagtggatac ctacctgaag gatgccatta aaatgtggct
      961 cgatctgggc attgacggca ttcgaatgga cgcggtaaaa catatgccgt tcggctggca
     1021 gaagagcttc atggccaccg tcaacaatta caagccggtc ttcaccttcg gcgaatggtt
     1081 cctcggggta aacgaagtga gtgccgagaa tcataagttc gccaacgttt ccggcatgag
     1141 cctgctggac ttccgcttcg cgcagaaggt tcgccaggta ttcaaggaca acaccgacaa
     1201 tatgtacgga ctgaaatcca tgctagaggg ctctgcgacc gattacgcgc agatggagga
     1261 tcaggtcacg ttcatcgata accacgacat ggaacgtttc cacaataaca gcgccaaccg
     1321 ccggaagctg gagcaagcgc tcgccttcac gttaacttcc cgcggcgtgc cagccatcta
     1381 ttatggaacc gagcaataca tgtccggcgg gaacgatccc gacaaccggg ctcgaattcc
     1441 ttctttctct accacgacta cggcttatca ggtcagtaaa aagctggcgc ctctgcgtaa
     1501 atccaatccg gccatcgctt atgggacgac gcaggagcgc tggataaaca acgatgtgtt
     1561 gatctatgag cgcaagttcg gcaacaacgt agccgtcatc gccgtcaacc gcaacgtgaa
     1621 cacgtcggcc tccattacgg gactggtaac ctcactgccg gccggcagct acaccgacgt
     1681 gctgggcggc ctcttgaacg gtaacaattt gaccgtcggc tccggcggat ccgcttccat
     1741 cttcacgctg gcagcgggcg gtacggccgt gtggcaatac accacggccg tgactgcgcc
     1801 aaccatcggg cacgtaggac cgatgatggc caaaccaggc gcagccgtca ccattgacgg
     1861 ccgcggtttc ggtgcaacca aaggcacggt atacttcggc acgacggcag ttaccggcgc
     1921 taatattacg gcttgggaag acacgcagat caaagtgaaa atccctgccg ttgccggagg
     1981 cgtatacaat atcaaaatcg ccaacagtgc cggaacctca agcaatgtgc atgacaactt
     2041 cgaagtgctg agcggggatc aggtcagcgt gcgctttgtg gtgaacaacg ccaccaccgc
     2101 gctcggccag aacgtatatc tggcaggcag cgtaagcgag ctcggcaact gggacccggc
     2161 caaagccatc ggtccgttgt acaaccaggt catctaccaa tacccaacct ggtactatga
     2221 cgtcaccgtt cccgccggca aaacgatcga atttaaattc ctgaaaaaac agggctcgac
     2281 ggtaacgtgg gaaggcggct cgaaccatac cttcaccgca ccaacgagcg gcacggccac
     2341 cataaacgtg aactggcaac cgtaagttca agaacgacta gcttctgccg aagaacggcc
     2401 cctaaacaaa accagctccg gatatgatcc cggagctggt tttgttcatg caggtcataa
     2461 ttccagtagc aatagcccgt tacgcgaagc atctcacagt tcctctgaac tgcatgagcg
     2521 attcccaaag agagagcgtg ctcagcaggc agtaactcag cattcctctg cattgcatga
     2581 gcgattccca aagggagagc gtctcaacaa gtagtgactc agcaagcaga cgggacaact
     2641 caacgatc
//
LOCUS       BSUDIV       1403 bp ds-DNA             BCT       14-FEB-1990
DEFINITION  B.subtilis division initiation protein (divIB) gene, 5' end.
ACCESSION   M31800
KEYWORDS    division initiation protein.
SOURCE      B.subtilis (strain 168; prototroph SB19) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1403)
  AUTHORS   Harry,E.J. and Wake,R.G.
  TITLE     Cloning and expression of a Bacillus subtilis division initiation
            gene for which a homolog has not been identified in another
            organism
  JOURNAL   J. Bacteriol. 171, 6835-6839 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      165     ORF1 (AA at 1)
    pept        295     1086     division initiation protein
    pept       1083  >  1403     ORF3 (ttg start codon)
    signal      213      240     transcription terminator
BASE COUNT      521 a    229 c    313 g    340 t
ORIGIN      
        1 ggcggtgcaa agatatcgga gatgcacgga aacttcatcg tcaatgcggg gggagcatca
       61 gcaaaagatg tgcttgatct cattgaccat gtgaaaaaga caatccgtga aaaatacgag
      121 attgatatgc acacagaggt tgaaatcatc ggcggaaatc gctgattcaa gttctgactg
      181 aagctgttca tatgatatac tgtaagcaaa cgacaaacgg catcatagta tgccgtttgt
      241 tttggaatag acagactttt aacagctgtt tcatttgaat gaggtgaaca ggcaatgaac
      301 ccgggtcaag accgagaaaa aatcgttaac attgaagagc ggatccctaa aattaaagag
      361 cagaggaaac aaaaggcaaa ccgccgtctc atttcattta ttatgctctt ttttattatg
      421 gtgctgatca ttgtgtacct gcaaacacca atcagtaaag tatcaacaat ctctgttaca
      481 gggaatgaaa atgtatcaaa aaaggaaatt atcgaccttt ccgatattaa cagcggagat
      541 actgagtttt ggagtttgga caagcaaaaa acagaaaaga agattcagca aaataaatta
      601 gtgaaaaaag cggagatcag caaatcgctt cctaataaaa tcaacattgc aatcgaagaa
      661 tataaagcga ttgcttattt agaaaaagac gatgtgtact atgaagtgct tgaaaacggc
      721 tcagttctgc caaacgaagt aacgccggat gatgccggac cgattttagt gaattggaca
      781 aacgcgaaga agcggtctca aatggccaag cagctcgatg cgttgtcaaa ttcgttaaaa
      841 cagtcgattt cagaaattta ttatacgcct gtaaaaatgg atgaaaaccg tattaagttg
      901 tacatgaacg acggttatgt tgttacggca tctattaaaa cgtttgcaga ccgaatgaag
      961 acgtatcctt ctatcatctc acagctaagc agcaataaaa aaggaatcat tcatttagaa
     1021 gttgccacct attttgaaga gtttgggaaa aacgataaag ctgctaaaaa ggaagatgaa
     1081 aattgagggg caaatcagca gtccttcttt ccttgattat gcttatcgca ggttttttga
     1141 tttcattttc ttttcaaatg acaaaagaaa acaacaaaag cgcggctgag acggaagagt
     1201 ggaaaaaaga atatgcgctg agagatgaac tgctgaaaca agagaaagaa aataagaagt
     1261 ttgaaaaaga attatatcaa aagcaaaaca aagtccgcca ggcggaaaat aagctgaaga
     1321 aagagaagtc ggaatactat aatgtcctcg aagatacaga aaaatacagg atgtatattg
     1381 gggaagtcgg tgtgcaggga gaa
//
LOCUS       HUMDF3AA      572 bp ss-mRNA            PRI       16-FEB-1990
DEFINITION  Human breast carcinoma-associated antigen DF3 mRNA, 5' end.
ACCESSION   M31823
KEYWORDS    antigen DF3; glycoprotein.
SOURCE      Human breast carcinoma cell line MCF-7, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 572)
  AUTHORS   Abe,M., Siddiqui,J. and Kufe,D.
  TITLE     Sequence analysis of the 5' region of the human DF3 breast
            carcinoma-associated antigen gene
  JOURNAL   Biochem. Biophys. Res. Commun. 165, 644-649 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         67  >   572     DF3 antigen
    mRNA          1  >   572     DF3 mRNA
BASE COUNT      114 a    236 c    129 g     93 t
ORIGIN      
        1 acctctcaag cagccagcgc ctgcctgaat ctgttctgcc ccctccccac ccatttcacc
       61 accaccatga caccgggcac ccagtctcct ttcttcctgc tgctgctcct cacagtgctt
      121 acagctacca cagcccctaa acccgcaaca gttgttacgg gttctggtca tgcaagctct
      181 accccaggtg gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact
      241 gagaagaatg ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc
      301 tcctccacca ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt
      361 tcagctgcca cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc
      421 tccaccaccc cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc
      481 tccaccgccc cccaagccca cggtgtcacc tcggccccgg acaccaggcc ggccccgggc
      541 tccaccgccc cccaagccca cggtgtcacc tc
//
LOCUS       MUSH2LINK    2252 bp ss-mRNA            ROD       14-FEB-1990
DEFINITION  Mouse MHC H-2K/t-w5-linked open reading frame mRNA, complete cds.
ACCESSION   M32010
KEYWORDS    .
SOURCE      Mouse embryo carcinoma cell line F9, cDNA to mRNA, clone 10.2a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2252)
  AUTHORS   Han,St.-Jacques.B., MacMurray,T.-H. and Shin,A.H.-S.
  TITLE     A putative transmembrane protein with histidine-rich charge
            clusters encoded in the H-2K/t-w5 region of mice
  JOURNAL   Mol. Cell. Biol. 10, 138-145 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        301     1611     MHC H-2K/t-w5-linked ORF precursor
    sigp        301      396     MHC H-2K/t-w5-linked ORF signal peptide (put.)
    matp        397     1608     MHC H-2K/t-w5-linked ORF
    mRNA    <     1     2252     MHC H-2K/t-w5-linked ORF mRNA
BASE COUNT      491 a    586 c    702 g    472 t      1 others
ORIGIN      
        1 tcccggagcc ggtgagaggt ccctgctgct cccttacggc gctttctagg cctttacccc
       61 aacgagtggg ccatagagac gcgggcccag agagaccgta aagttgctga tcaaaggcta
      121 gagcggtgtc gggggtgggg ggctgcatcc aggaagggtg ttggggatga ggtggaccgg
      181 ccttggggac aatgtaagag cggagcaagt agtatagagg aagggcttca agggacgcgg
      241 atcccgaata ggtagattga gagtcaagtc gagtcgtctc ttgttcctcc ggtcagcgtg
      301 atgaccatgg gcctgcgggc cccccactgg gtggctgtgg gactgctgac ctgggcggct
      361 ttggggctgc tggtggccgg acacgagggt catggtgacc tgcacaaaga tgtggaagag
      421 gacttccatg gccacagtca cgggcactca catgaagatt tccaccatgg ccacagccac
      481 gggcactcgc atgaagattt ccaccatggc cacggccaca ctcatgaaag catctggcat
      541 gggcatgccc acagccacga ccatggacac tcacgtgagg aattacacca tggccatagc
      601 catggccact cccacgatag cctccaccac ggaggacatg gacatgccca tcgtgaacat
      661 agccatggga cttctaggga ggctggggct ccaggcatca aacaccacct ggacactgtc
      721 accctctggg cctacgcact gggggccaca gtgctgatct ccgcagctcc gttcttcgtg
      781 ctgttcctca tcccagtaga atctaactct cccaggcacc gctctctgct ccagatcctg
      841 ctcagttttg cttccggggg gctcctgggt gatgcgttcc tccacctcat cccgcatgcc
      901 ttggaacctc attctcacca cgctccagag cagcctganc atggacactc ccacagtggc
      961 cagggcccca ttctctctgt ggggctgtgg gttctcagtg ggattgtcgc cttcctcgtg
     1021 gtggagaaat ttgtgagaca cgtgaaagga ggacatggac acagtcacgg acacggagac
     1081 aggcatgcgc atggagacag tcacacccat ggagatagac atgagtgttc ttcaaaggaa
     1141 aagcccagca cagaggaaga gaaggaagtg ggcgggttgc ggaaaaggag aggaggaaac
     1201 actgggccca gagatggccc ggtgaaacct cagagccctg aagaagaaaa agcaggctca
     1261 gacctgcgtg tgtctgggta cctgaatctg gctgctgact tggcacacaa cttcacagac
     1321 ggtctggcca ttggtgcttc ctttcgtggg ggccgagggc tagggatcct gaccacaatg
     1381 acagtcctgc tgcacgaact gcctcatgag gttggggatt ttgccatcct ggtccagtct
     1441 ggctgcagca agaagcaggc gatgcgtctg caactcgtga ctgcaattgg agcattggca
     1501 ggcactcgtg tgcccttctc accgagggag gggcagtgga cagtgacgtg gcaggtggtg
     1561 caggtcctgg ctgggtcctg ccattcactg caggcggatt tatctacgta gcaacagtgt
     1621 ctgtgctgcc tgagctattg agagaccgat ctccactgca gtcactgttg gaggtgctgg
     1681 ggctgctggg gggtgttgcc atgatggtac tgattgccca tcttgagtga ggggtgaggt
     1741 gacctgtcct tcccctaact cctaataaag gcagttggta gtcctggcca gtgctaatgc
     1801 cagaaggagt gtcagccttg gagattagca gagcctctgt ttgaggcctt agaggtatga
     1861 ggatcatacg gagcatgaga ggccagaagg accacggagt gggagactgc ccagcagcgt
     1921 tgttgctttt ggaaaaatca atgggaccat gaagaagact gggaagacgg tagcctaccc
     1981 gatggtccct aatctacgat ttctcagcgc cacgtgccgc aggggtccct ccaaggccca
     2041 tctccctctg aactagttag tgatggcttc agggaagacc tggcagaagg actggggtag
     2101 acatcaatcg tgtgtcctga tttgaagggg gaggggggct ccttgggaag atgtctcagc
     2161 ctgatttttt gtctctactc attttatacc actgtttgaa tgggtgcgga ggaacggtga
     2221 ccaggaataa aagaccttgg atcttcggcc cc
//
LOCUS       PSEMVAAB     1117 bp ds-DNA             BCT       16-FEB-1990
DEFINITION  P.mevalonii HMG-CoA-reductase (mvaA) gene, 3' end, and
            HMG-CoA-lyase (mvbB) gene, complete cds.
ACCESSION   M31807
KEYWORDS    3-hydroxy-3-methyl glutaryl CoA-lyase;
            3-hydroxy-3-methyl glutaryl CoA-reductase; HMG-CoA-lyase;
            HMG-CoA-reductase.
SOURCE      P.mevalonii DNA.
  ORGANISM  Pseudomonas mevalonii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Acidiphilium cryptum; Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1117)
  AUTHORS   Anderson,D.H. and Rodwell,V.W.
  TITLE     Nucleotide sequence and expression in Esherichia coli of the
            3-hydroxy-3-methylglutaryl coenzyme A lyase gene of Pseudomonas
            mevalonii
  JOURNAL   J. Bacteriol. 171, 6468-6472 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      135     HMG-CoA-reductase (mvaA; AA at 1)
    pept        147     1052     HMG-CoA-lyase (mvaB)
BASE COUNT      188 a    362 c    380 g    187 t
ORIGIN      
        1 catgcgcgca atattgccgt ggtggcgggc gcccgaggcg atgaggtgga ctgggttgcc
       61 cggcagttgg tggaatacca cgacgtgcgc gccgaccgcg ccgtagcagt gctgaaacaa
      121 aagcgcggcc aatgagcgtg gtcccgatgc aagcggtaaa ggtctttgaa gtcggccccc
      181 gtgacggcct gcagaacgaa cgccagccgc tgtcggtggc cgcccgtgtg ggcttgatcg
      241 gcgaactggc tggcaccggc ctgcggcata tcgaagccgg cgccttcgtg tcgccgcgct
      301 gggtgccgca gatggccggc agcgacgagg tgttgcgcca gttgcccagc aacgacgggg
      361 tcagttacac ggccctggtg cccaaccggc aaggcttcga ggccgcgcaa cgggctggct
      421 gccgcgaggt agcggtgttc gccgccgcct ccgaggcgtt ttcgcgcaac aacatcaatt
      481 gctccatcga tgaaagcttc gagcgcttca ccccggtgtt gcgcgccgcc aacgaagcct
      541 ctatccgggt gcgcggttat gtatcctgcg tgctcggttg cccgttcagt ggggccgttg
      601 cgccggaggc tgtggccaag gtcgcacgcc gcctgtacga actgggctgc tacgaaatca
      661 gcctgggtga caccattggc gccggccgcc cggatgaaac ggctcaattg ttcgagctct
      721 gcgcacggca actgccggtc gcggcactgg ccggccactt ccacgatacc tggggcatgg
      781 ccatcgccaa tgtgcatgcc gcactcgcgc agggtgtacg caccttcgac agctcggtcg
      841 cgggcctcgg cggctgcccc tactcgccgg gtgccagcgg taacgtggcc acggaagatc
      901 tgttgtacct gctgcacggc ctgggctaca gcaccggtgt cgacctggag gcggtggcac
      961 aggttggtgt gcgcatcagc gcgcagctgg gcaccgccaa ccgctcccgt gccggccttg
     1021 ccctggcagc aaggagcgcc cgcgaacact gatagcgcct gtggctcccg aacaaaaaca
     1081 acaagcccgc aagggcagga gaatgccaat gaccttt
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       HUMFAPS      1148 bp ss-mRNA            PRI       12-JAN-1990
DEFINITION  Human farnesyl pyrophosphate synthetase mRNA, complete cds.
ACCESSION   J05262
KEYWORDS    farnesyl pyrophosphate synthetase.
SOURCE      Human hepatoma cell line HepG2, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1138)
  AUTHORS   Wilkin,D.J., Kutsunai,S.Y. and Edwards,P.A.
  TITLE     Isolation and sequence of the human farnesyl pyrophosphate
            synthetase cDNA: Coordinate regulation of the mRNAs for farnesyl
            pyrophosphate synthetase, 3-hydroxy-3-methylglutaryl coenzyme A
            reductase, and 3-hydroxy-3-methylglutaryl coenzyme A synthase by
            phorbol ester
  JOURNAL   J. Biol. Chem. 265, 4607-4614 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1139 to 1148)
  AUTHORS   Wilkin,D.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.J.Wilkin, 12-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept          7     1068     farnesyl pyrophosphate synthetase (EC 2.5.1.1)
BASE COUNT      302 a    272 c    324 g    250 t
ORIGIN      
        1 cacagaatga acggagacca gaattcagat gtttatgccc aagaaaagca ggatttcgtt
       61 cagcacttct cccagatcgt tagggtgctg actgaggatg agatggggca cccagagata
      121 ggagatgcta ttgcccggct caaggaggtc ctggagtaca atgccattgg aggcaagtat
      181 aaccggggtt tgacggtggt agtagcattc cgggagctgg tggagccaag gaaacaggat
      241 gctgatagtc tccagcgggc ctggactgtg ggctggtgtg tggaactgct gcaagctttc
      301 ttcctggtgg cagatgacat catggattca tcccttaccc gccggggaca gacctgctgg
      361 tatcagaagc cgggcgtggg tttggatgcc atcaatgatg ctaacctcct ggaagcatgt
      421 atctaccgcc tgctgaagct ctattgccgg gagcagccct attacctgaa cctgatcgag
      481 ctcttcctgc agagttccta tcagactgag attgggcaga ccctggacct cctcacagcc
      541 ccccagggca atgtggatct tgtcagattc actgaaaaga ggtacaaatc tattgtcaag
      601 tacaagacag ctttctactc cttctacctt cctatagctg cagccatgta catggcagga
      661 attgatggcg agaaggagca cgccaatgcc aagaagatcc tgctggagat gggggagttc
      721 tttcagattc aggatgatta ccttgacctc tttggggacc ccagtgtgac cggcaaaatt
      781 ggcactgaca tccaggacaa caaatgcagc tggctggtgg ttcagtgtct gcaacgggcc
      841 actccagaac agtaccagat cctgaaggaa aattacgggc agaaggaggc tgagaaagtg
      901 gcccgggtga aggcgctata tgaggagctg gatctgccag cagtgttctt gcaatatgag
      961 gaagacagtt acagccacat tatggctctc attgaacagt acgcagcacc cctgccccca
     1021 gccgtctttc tggggcttgc gcgcaaaatc tacaagcgga gaaagtgacc tagagattgc
     1081 aagggcgggg agaggaggct ctcaataaat aatcgtgtaa ccttaaaaaa aaaaaaaacc
     1141 tcgacgat
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       MUSPROA      1987 bp ss-mRNA            ROD       23-JAN-1990
DEFINITION  Mouse protective protein (Mo54) mRNA, complete cds.
ACCESSION   J05261
KEYWORDS    glycoprotein; protective protein; serine carboxypeptidase.
SOURCE      Mouse (strain BCBA) testis and brain, cDNA to mRNA, clones
            lambda-B1 and lambda-T2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1987)
  AUTHORS   Galjart,N.J., Gillemans,N., Meijer,D. and d'Azzo,A.
  TITLE     Mouse "protective protein": cDNA cloning, sequence comparison, and
            expression
  JOURNAL   J. Biol. Chem. 265, 4678-4684 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.d'Azzo, 22-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept        214     1638     protective protein precursor
    sigp        214      282     protective protein signal peptide
    matp        283     1635     protective protein
    signal     1966     1971     poly-A signal
BASE COUNT      453 a    580 c    522 g    432 t
ORIGIN      
        1 ggaactgctc gatgcgcaga tagggttcaa ccagtagggc ttggggacca ctggggaccg
       61 ctgcctccgc ttctggttga ccttcccggg gcacacggtt gtggtggcgt gtgatagcga
      121 agacccaact gtggccaagg ctgaccaagt caggcagtga aaactcgggt acagcggcca
      181 gactggcggg atccagagcg gtcaggccaa gacatgcccg gaaccgcgct gtctccactg
      241 ctcttgttgc tgctcctgtc ctgggcgtcc cggaacgaag cagctccgga ccaggatgaa
      301 atcgattgtc tccccggcct ggccaagcag ccctctttcc ggcaatactc cggctacctc
      361 agagcatcgg actccaagca cttccactac tggtttgtgg agtcgcagaa cgacccaaag
      421 aacagccccg tggtgctttg gcttaacggg ggtcccggct gcagctcgct cgatgggctg
      481 cttacagagc acggcccctt tctgatccag ccagatggtg tcaccctgga gtacaacccc
      541 tatgcttgga acctgattgc caacgtgctg tatatcgagt ccccagctgg ggtgggcttc
      601 tcctactcgg atgacaagat gtacgtgacc aatgacacag aggtggcgga gaacaattat
      661 gaagccctta aagacttctt ccgcctcttt ccggaataca aggacaacaa acttttcctg
      721 acaggagaga gctatgctgg catctacatc cccaccttgg ctgtactggt catgcaggat
      781 cctagcatga atcttcaggg gctggctgtg ggcaatggac ttgcctccta tgagcagaac
      841 gacaactccc tggtctactt tgcctactac catggccttc tggggaacag actttggact
      901 tcactgcaga cccactgctg cgctcagaac aagtgtaact tctatgacaa caaagaccca
      961 gagtgtgtaa acaatctcct ggaagtgtct cgaattgtgg gcaaatctgg cctcaacatc
     1021 tacaatctct atgctccgtg tgctggtggg gtgcccggca gacatagata tgaggacaca
     1081 cttgtagtcc aggattttgg caacatcttc actcgcctgc cacttaagcg gagatttcct
     1141 gaggcactga tgcgttctgg ggacaaggta cgcttggatc ctccctgcac caacaccaca
     1201 gccccttcca actacctcaa caacccctat gttcggaagg ctctccacat ccccgagtcg
     1261 ctgccccgct gggacatgtg caacttcttg gtgaatttac agtaccgccg cctctaccaa
     1321 agcatgaact cccagtacct gaagctgctc agttcacaga aataccagat cctgctctac
     1381 aacggagatg tggacatggc ctgcaacttc atgggcgatg agtggtttgt ggattcgctc
     1441 aaccagaaga tggaggtgca gcgccggccc tggctagtgg actacgggga gagcggagaa
     1501 caggtagctg gtttcgtgaa ggagtgttca cacatcacct tcctcaccat caagggtgcc
     1561 ggacacatgg tccccacgga caagcctcga gctgctttta ccatgttctc gaggttcctg
     1621 aacaaagagc cttactgata gcttccgctg caacccatgg aaatcctcag tacctgatca
     1681 gcccctctcg gcctctctca gcagagagga gtcctccgct aatgctcccg tggggcaagt
     1741 cctgtactaa aactgcccag acttctcaca gcctggtgca tcccagcctg ggcctcaggg
     1801 cgtcacagac agcctaggac caagtgagca ctttgttccg gccatcgtgg ggtggcctga
     1861 cctccccgac tgcataaaaa ctgaccaccc caggacccca acagagctag agaaacccaa
     1921 ggaggggggt ggatggattc taattgatgg ttgactaatt ctggaattaa atcgggtata
     1981 gcttcca
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       RATCAA       1150 bp ds-DNA             ROD       07-FEB-1990
DEFINITION  Rat cytosolic aspartate aminotransferase gene, exon 1.
ACCESSION   J05263
KEYWORDS    cytosolic aspartate aminotransferase.
SOURCE      Rat (strain Wistar) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1150)
  AUTHORS   Pave-Preux,M., Aggerbeck,M., Veyssier,C., Bousquet-Lemercier,B.,
            Hanoune,J. and Barouki,R.
  TITLE     Hormonal discrimination among transcription start sites of
            aspartate aminotransferase
  JOURNAL   J. Biol. Chem. 265, 4444-4448 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Barouki, 05-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        684  /   801     cytosolic aspartate aminotransferase, exon 1
                                 (EC 2.6.1.1)
    pre-msg     592  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    pre-msg     614  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    pre-msg     618  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    pre-msg     620  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    pre-msg     628  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    pre-msg     630  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    pre-msg     635  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    pre-msg     649  >  1150     cytosolic aspartate aminotransferase mRNA and
                                 intron (alt.)
    IVS         802  >  1150     cytosolic aspartate aminotransferase intron A
BASE COUNT      232 a    334 c    338 g    246 t
ORIGIN      
        1 ctgcagggtc tctcctgcca gcgatcaagg tagacaacac ccccaggttt ctttctgcgg
       61 aagcactgag tggagatgga gaaggtcacc cgcgatcggg tacagcgtgt ccaggcggac
      121 cactagagtc cagtgtcggg gacttggctg tcagcccggc ctggctaacg caggaacgtg
      181 cgcaggaacg gagcctggcg gggacagcca cctgctcagc tggtacagaa agacctgttc
      241 tctactattc ccgagggcgg gccaccgccc caggtctcca ggacccagaa ggtcaggagg
      301 ctgtcctcta gggactcccg ccacccgcat catcctcatc accctaagga ataaaagcct
      361 cagcgattgg aacacgctat gccaatcatc ctagtctttg cccacgcctc ctccggtgtc
      421 cactgcattg gttgcatcat acaagcctcc gataagatcc tcctcttgaa ttggctaata
      481 gagccttgtc ccgcccaacc cttctttgtc ggaaggttgt gattggttaa gcgcgttgcc
      541 aagctccggg cgcggctcgg tcattggaag ccgcgggctg ccccgcccgc cggtgaaggt
      601 gagagtctcc tccagtcaca ggtccccgca ccgcacggag cagctcccag ctcgtgctcg
      661 aggtcccctc agattccatc gcgatggccc ctccatcatt ctttgcccag gttccacagg
      721 ccccgccggt tctggtcttt aagctcattg cggacttccg ggatgatccc gatccccgca
      781 aggttaacct cggcgtggga ggtaaggatg cagtgccctg gaatgcgagc agctttgtgg
      841 ggggaggtgg gcttaggagg aactgaggct tagtttggtg cctccctgag aggggactgg
      901 ggagagaagg ctgggaccac gagggcatca ccagcctacc catttcacag atggcgatac
      961 agagtccttg gaaagaggcg gtgttcaagg tcacacagga caggtgtaac atctgggaca
     1021 catgccgttt ttgtttcttc caccctgtct tctatatagt acttgttcta aagcaaccat
     1081 ggggtggtaa ggagagagaa gtctggtagc tttggcgtca tggtcagctg ttagcctaat
     1141 ggagctgcag
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       RATSKR       2979 bp ss-mRNA            ROD       16-FEB-1990
DEFINITION  Rat substance K receptor mRNA, complete cds.
ACCESSION   M31838
KEYWORDS    substance K receptor.
SOURCE      Rat stomach, cDNA to mRNA, clone prTKR1-1.
  ORGANISM  Rattus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2979)
  AUTHORS   Sasai,Y. and Nakanishi,S.
  TITLE     Molecular characterization of rat substance K receptor and its
            mRNAs
  JOURNAL   Biochem. Biophys. Res. Commun. 165, 695-702 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        449     1621     substance K receptor
    signal     2957     2962     poly-A signal
BASE COUNT      721 a    859 c    729 g    670 t
ORIGIN      
        1 ggcccacaga cttggcacac tgtgcacctc cgtcatctgg gatgtaccct gaatagcatt
       61 aagagagatc tgaattcaaa ttcgggtttg tcaccaactt ccggaatgcc tgttgctgga
      121 cgcgtctttt ctctgactct cagcgttctg actctggtgt gaaagagagg agccgatcca
      181 acgcgctatc aggatgcctc ctgctctgag actggagttt gagttctcca tctccaagag
      241 atgcttgaac agtctgattg ggaccttgct tcctgctcag cctgactcat ttctgcccga
      301 gctgtaggcg tggccctctg gctacacgtg tctgcttttg aatcgtcctt ccccccttaa
      361 atctacacac cgaagaggat ttctcttctg tggcaattcg agacggaact gcagggccag
      421 accctgtgtt ccaggcccag aatcagtcat ggggacccgt gccattgttt ctgacgccaa
      481 catcttgtct ggcctcgaga gcaatgccac gggtgttaca gccttctcca tgcctggctg
      541 gcagctggcg ttatgggcca cagcctacct ggccctggtg ctggtggctg tgacaggcaa
      601 tgccacagtc atctggatca ttctggccca cgagaggatg cgcacagtca ccaactattt
      661 catcatcaac ctggccttgg cggacctctg catggcagcc ttcaacgcga ccttcaactt
      721 catctacgcc agtcacaaca tctggtactt cggccgtgcc ttctgctatt tccagaacct
      781 ctttcccatc acagccatgt tcgtcagcat ctactccatg accgccattg ctgctgacag
      841 gtacatggcc atcgttcacc ccttccagcc acggctctcg gcccccagca ccaaggcgat
      901 tatcgctggc atctggctgg tagccctggc tctcgcctcc ccgcaatgct tctactccac
      961 catcactgtg gacgaggggg ccaccaagtg tgtggtggcc tggcccaatg acaacggagg
     1021 caagatgctc ctactgtatc atctggtcgt gtttgtcctc atctacttcc tgcctctcct
     1081 ggtgatgttc ggggcttaca gtgtcatcgg cctcacactg tggaagcgcg cggtacccag
     1141 acaccaggct cacggagcca acttacgcca tctacaggcc aagaagaagt ttgtgaaggc
     1201 catggtactg gtggtgctga catttgccat ctgctggctg ccctaccacc tctacttcat
     1261 cctggggacc ttccaagagg acatctacta ccacaagttt atccagcagg tctacctggc
     1321 gctcttctgg ctggccatga gctccacaat gtacaaccct atcatttatt gctgccttaa
     1381 ccacaggttt cgctctggat tccggcttgc tttccggtgc tgcccctggg tgacaccaac
     1441 tgaggaagac aggctggagc tgactcacac gccatccctc tccaggagag tcaaccggtg
     1501 tcataccaag gagactttgt tcatgacggg ggacatgacc cactctgagg ctaccaatgg
     1561 acaggttggg agtccccagg atggggagcc tgctggaccc atctgcaagg cccaggctta
     1621 gcccctttga gaagcagctg acggaaaagc ccctacataa tagcttgtcc cacaccccat
     1681 tcttcaacac accaacagaa aaaacagaat ggtgctcgga atgccacagg caatacaggt
     1741 gtcaacaaag aactctgagc acagacacct gtcacaggca atccagtgat ggaggaaggc
     1801 gctgggagga ctgggaagcc acacttggcc aacctagaag ctgcaaagtg agatgtctcc
     1861 caggagtata gagagagaag gaaaactgga tcccagcgaa gaaaactgca gttgtaatga
     1921 gaagggcagc tgaaggacag agagatggct tggcgaggtg gaaaagcctc tgctaagtaa
     1981 gcagtggaaa agcctctgct aagccttgtg acctaagtgc agtacctgca acccacaggg
     2041 aaggtaagaa ccaaagccta caagttgtcc tctgacttcc tccaatgtat gcatgcatca
     2101 tgtatgcatc tgtgtgtgca catacatctg cacatgcacg cacatacaaa atatgcgcat
     2161 gcacgtgcac acttacaatg ttaaagagga agaaggcgac agctcccaca ggactgtaac
     2221 agattgcaga ggacaagcgc ccaaagggaa acccagctgc ttcgctcgct ctggaacaca
     2281 attgccaggt tttccgaata aagaaagcct aagatccaaa tatatatgta tgggagctcc
     2341 gggcttctaa atttggttca tgaaaacgct tagagacact ggctaaccca gagcggtgca
     2401 catccacacc accacctcct cgccaaagac aagccacaaa ggaagtcacc tatttggctg
     2461 cgaatggatc catggttttt ctaccttagg atctaaaagc tcccatccag cccagagggg
     2521 agggcagtta acagtccaga ggagcttcag ctgagatgta agatactctg accccctgct
     2581 tgatggcagg attgccgtga ggcttgttgg aaacgatgcc ttcaaaccag caggccacat
     2641 ccagcgaaat gctcaataaa tagtaaccac gattcttcgc acggcgtcag cttttaagac
     2701 acaaaagcca cctacaagtt aggagaatgg ctccccctct cacttctgca gtgaggagct
     2761 ggcaggcaga gcacaccgat ggtcccagtt ctaagaacgc ctggcctagg ccgctccagg
     2821 ccagtccgtc tccctgcttg gtggtcactc atttgctgtc taccccgtgc cctctgaaag
     2881 ccaaagacgc tgtaaacacg tgttcaaagt ttaatgtcac ctgagctttc ccagagttca
     2941 ctgcgccagt acccccaata aagggttctg atgtcactt
//

GenBank-Updates@genbank.bio.net (04/07/90)

LOCUS       RATIGFBP3    2352 bp ss-mRNA            ROD       16-FEB-1990
DEFINITION  Rat insulin-like growth factor-binding protein (IGF-BP3) mRNA,
            complete cds.
ACCESSION   M31837
KEYWORDS    insulin-like growth factor-binding protein.
SOURCE      Rat ovary, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2352)
  AUTHORS   Shimasaki,S., Koba,A., Mercado,M., Shimonaka,M. and Ling,N.
  TITLE     Complementary DNA structure of the high molecular weight rat
            insulin-like growth factor binding protein (IGF-BP3) and tissue
            distribution of its mRNA
  JOURNAL   Biochem. Biophys. Res. Commun. 165, 907-912 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         57      935     insulin-like growth factor binding protein
                                 precursor
    sigp         57      137     insulin-like growth factor binding protein
                                 signal peptide
    matp        138      932     insulin-like growth factor binding protein
    mRNA    <     1     2352     IGF-BP3 mRNA
BASE COUNT      573 a    623 c    606 g    550 t
ORIGIN      
        1 gggagcagcg agcaagcagg tcctcagcgt ccagtcaccg ctctaagcca ggcgccatgc
       61 atcccgcgcg ccccgcgctc tgggcggctg cgctcaccgc cctcactctg ctccgcggac
      121 cgccagtggc gcgggccggc gcgggcgcgg tgggcgcggg ccccgtggtg cgctgcgaac
      181 cgtgcgacgc gcgtgcgctg gcccagtgcg cgcctccgcc caccgcgccc gcgtgcacgg
      241 agctggtgcg agaacccggc tgcggctgct gcctgacttg cgcgctgcgc gaaggcgacg
      301 cgtgcggcgt ctacacggag cgctgtggca ccggcctccg ctgccagccg cgaccggccg
      361 agcagtatcc cctgaaggcg ctgctgaatg gccgcgggtt ctgcgccaac gccagcgccg
      421 ccagcaacct gagtgcctac ctcccctccc agccgtctcc tggaaacacc actgagtctg
      481 aggaggacca caatgctggg agtgtggaaa gccaggttgt ccccagcaca catcgcgtga
      541 ctgattccaa gttccatcca ctccattcaa agatggaggt catcataaaa ggccaggcta
      601 gggacagcca gcgctacaaa gttgactatg agtcccagag cacagacacc cagaacttct
      661 cctccgagtc taagcgggag acagaatatg gtccctgccg cagagaaatg gaggacacac
      721 tgaatcatct gaagttcctc aatgtgctga gtcccagggg cgtccacatc ccaaactgtg
      781 acaagaaggg gttctataag aagaaacagt gtcgcccttc caaaggcaga aagcggggct
      841 tctgctggtg cgtggacaag tacgggcagc cattgccagg ctatgacacc aaggggaaag
      901 acgacgtgca ttgcctcagc gtgcagagcc agtagatacc gctgtgccac ttaacgtgga
      961 gctcaaatac gccttatttt gcacaaaaga ctgccaacaa cgtgatcagc agctggctat
     1021 accttgattt atatttctct ctctctctct ctctctctct ctctctctct ctctctcttt
     1081 tgtggtgaac tgaataaaaa caaacaaaac acatacaaaa acaaaaacaa aaaaaaaagc
     1141 caagtttaga cagatttctg aaatgcctct ggttgtttaa atagtgaact tggtcatctt
     1201 tgtatctcgc agtagtcaac caaaagcagt ttgaattttc ttgttgcttc ctatgaaaac
     1261 cacacgtgta ctccaggcca cggatgccgt cgccccctaa ctcacccacc cactgtgggc
     1321 ttcagtgctg ctggccctct gccttcttga tttcagaggc tctgttgctg atagagaaaa
     1381 accctctttc catcccctgt aagtaagtgc aggcactgtg gagaatgggg aagcctggaa
     1441 cccagtgacc cggacgtctg gaagcatcct cctgaggcct ctggtcctta ttgtgccatc
     1501 tctgaatcaa gggcctggcc ctgtatctgc aagtggcctg acctacttgg gaactgtggg
     1561 agagaaaaat gtgttgtctc tcttactaaa aatgactaag aatgttctag ggcgctccga
     1621 gagcccataa agacaaggac aaggaccttc ctttgtcagg cagcttcctg atgacttggc
     1681 ccagcagaaa tatcaaactc catgtgcaga gatgtcgcaa ataacggtgc gcttagttct
     1741 ccggatgact tcaagaaaac agtgttttct ggcccagcct ctcaaaataa aatttgttgt
     1801 ggggtggggc tgaggggagg cagctttcaa aagagagaag gttttcatct tccttgttgg
     1861 agaccctggt aagaacatgg agagaatcac ctgtttgttg atcttggggt ccttctcaaa
     1921 ctttctttat aattcatgcg tatatgcaga caaaatatgt tcttaattgt taacattgta
     1981 tacaacatag cccaaatata ttagaatctg tactagataa tcctagataa aaggttagag
     2041 atgctaggtg atgtaaccac agacacgccc gaggaaagga gcctgtgtct ggaggctggg
     2101 ccgctttccc cgaggccaag gccatggtgg tctggcaata cagggtgtga ggagactgta
     2161 ctgcatccca cggggtggac atgcgctgta cagagctttc cttgagagca caaaggaatc
     2221 ttgagacatt ctgcctgcct gtcagctttt ctttattttt ttaattaagt ttttggggga
     2281 aaaatgtatt tttgaaaagt ttgtcttgca atgtatttat aaatagtaaa taaagttttt
     2341 ttactattta ag
//

GenBank-Updates@genbank.bio.net (04/09/90)

LOCUS       HUMTSHRX     2415 bp ss-mRNA            PRI       22-FEB-1990
DEFINITION  Human thyroid stimulatory hormone receptor (hTSHR) mRNA,
            complete cds.
ACCESSION   M32215
KEYWORDS    thyroid stimulatory hormone receptor.
SOURCE      Human thyroid, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2415)
  AUTHORS   Misrahi,M., Loosfelt,H., Atger,M., Sar,S., Guiochon-Mantel,A. and
            Milgrom,E.
  TITLE     Cloning, sequencing and expression of human TSH receptor
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 394-403 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         45     2339     thyroid stimulatory hormone receptor
                                 /hgml_locus_uid="TSHR" /map="22q11-q13"
                                 /nomgen="LY0058G"
BASE COUNT      638 a    624 c    527 g    626 t
ORIGIN      
        1 ggcgatttcg gaggatggag aaatagcccc gagtcccgtg gaaaatgagg ccggcggact
       61 tgctgcagct ggtgctgctg ctcgacctgc ccagggacct gggcggaatg gggtgttcgt
      121 ctccaccctg cgagtgccat caggaggagg acttcagagt cacctgcaag gatattcaac
      181 gcatccccag cttaccgccc agtacgcaga ctctgaagct tattgagact cacctgagaa
      241 ctattccaag tcatgcattt tctaatctgc ccaatatttc cagaatctac gtatctatag
      301 atgtgactct gcagcagctg gaatcacact ccttctacaa tttgagtaaa gtgactcaca
      361 tagaaattcg gaataccagg aacttaactt acatagaccc tgatgccctc aaagagctcc
      421 ccctcctaaa gttccttggc attttcaaca ctggacttaa aatgttccct gacctgacca
      481 aagtttattc cactgatata ttctttatac ttgaaattac agacaaccct tacatgacgt
      541 caatccctgt gaatgctttt cagggactat gcaatgaaac cttgacactg aagctgtaca
      601 acaacggctt tacttcagtc caaggatatg ctttcaatgg gacaaagctg gatgctgttt
      661 acctaaacaa gaataaatac ctgacagtta ttgacaaaga tgcatttgga ggagtataca
      721 gtggaccaag cttgctggac gtgtctcaaa ccagtgtcac tgcccttcca tccaaaggcc
      781 tggagcacct gaaggaactg atagcaagaa acacctggac tcttaagaaa cttccacttt
      841 ccttgagttt ccttcacctc acacgggctg acctttctta cccaagccac tgctgtgcct
      901 ttaagaatca gaagaaaatc agaggaatcc ttgagtcctt gatgtgtaat gagagcagta
      961 tgcagagctt gcgccagaga aaatctgtga atgccttgaa tagccccctc caccaggaat
     1021 atgaagagaa tctgggtgac agcattgttg ggtacaagga aaagtccaag ttccaggata
     1081 ctcataacaa cgctcattat tacgtcttct ttgaagaaca agaggatgag atcattggtt
     1141 ttggccagga gctcaaaaac ccccaggaag agactctaca agcttttgac agccattatg
     1201 actacaccat atgtggggac agtgaagaca tggtgtgtac ccccaagtcc gatgagttca
     1261 acccgtgtga agacataatg ggctacaagt tcctgagaat tgtggtgtgg ttcgttagtc
     1321 tgctggctct cctgggcaat gtctttgtcc tgcttattct cctcaccagc cactacaaac
     1381 tgaacgtccc ccgctttctc atgtgcaacc tggcctttgc ggatttctgc atggggatgt
     1441 acctgctcct catcgcctct gtagacctct acactcactc tgagtactac aaccatgcca
     1501 tcgactggca gacaggccct gggtgcaaca cggctggttt cttcactgtc tttgcaagcg
     1561 agttatcggt gtatacgctg acggtcatca ccctggagcg ctggtatgcc atcaccttcg
     1621 ccatgcgcct ggaccggaag atccgcctca ggcacgcatg tgccatcatg gttgggggct
     1681 gggtttgctg cttccttctc gccctgcttc ctttggtggg aataagtagc tatgccaaag
     1741 tcagtatctg cctgcccatg gacaccgaga cccctcttgc tctggcatat attgtttttg
     1801 ttctgacgct caacatagtt gccttcgtca tcgtctgctg ctgttatgtg aagatctaca
     1861 tcacagtccg aaatccgcag tacaacccag gggacaaaga taccaaaatt gccaagagga
     1921 tggctgtgtt gatcttcacc gacttcatat gcatggcccc aatctcattc tatgctctgt
     1981 cagcaattct gaacaagcct ctcatcactg ttagcaactc caaaatcttg ctggtactct
     2041 tctatccact taactcctgt gccaatccat tcctctatgc tattttcacc aaggccttcc
     2101 agagggatgt gttcatccta ctcagcaagt ttggcatctg taaacgccag gctcaggcat
     2161 accgggggca gagggttcct ccaaagaaca gcactgatat tcaggttcaa aaggttaccc
     2221 acgagatgag gcagggtctc cacaacatgg aagatgtcta tgaactgatt gaaaagtccc
     2281 atctaacccc aaagaagcaa ggccaaatct cagaagagta tatgcaaacg gttttgtaag
     2341 ttaacactac actactcaca atggtagggg aacttacaaa ataatagttt cttgaatatg
     2401 cattccaatc ccatg
//

GenBank-Updates@genbank.bio.net (04/09/90)

LOCUS       MUSHSP70     2249 bp ss-mRNA            ROD       23-FEB-1990
DEFINITION  Mouse heat shock-related protein (hcs70t) mRNA, complete cds.
ACCESSION   M32218
KEYWORDS    heat shock-related protein.
SOURCE      Mouse male testicular germ cell, cDNA to mRNA, clones GE6 and HS2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2249)
  AUTHORS   Matsumoto,M. and Fujimoto,H.
  TITLE     Cloning of a hsp70-related gene expressed in mouse spermatids
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 43-49 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1     1893     heat shock-related protein
BASE COUNT      612 a    569 c    583 g    484 t      1 others
ORIGIN      
        1 atggctgcta ataaaggaat ggcgatcggc atcgacctgg gcaccaccta ctcgtgcgtg
       61 ggcgtgttcc aacacggcaa ggtggagatc atcgccgact accaggtcaa ccgcaccacc
      121 cctgactacg tggccttcac cgacaccagc cgcctcatcg agcgctccaa gaaccaggtg
      181 gccatgaatc cccagaacac tgtttttgat gccaaacgtc taattggcag gaagtttaat
      241 gatcctgttg tgcagtcaaa tatggaagct ttggccattt caatgatcaa tgaagccggc
      301 aaacccaagg tgatggtgtc ctataaagga gagaagaaag ccttctaccc agaggagatc
      361 tcatccatgg tactgacgaa gatgaaggag actgcagagg ttttttgggc accaatgtca
      421 caacgtgtga tcacggtgcc agcctatttc aatgactctc agcggcaagc caccaaggat
      481 gcaggtgtca tcgcaggact caatgtgctg agaataatca atgagcccac ggcggtcgca
      541 tgggctaggt tggataaagg aagtcacgca gagcggcacg tgctcatctt cgacctgggg
      601 ggtggcacgt tcgacttgtc catcctgacg atcgacgacg catcttcagt gaagccacgc
      661 ggcgacacgc acctgggagg ggaggacttc gacaaccggc tggtgagcca cttcgtgaga
      721 gtcaagagga agcacaagaa ggacaatcaa gccaagaaca agcgcgcggt gcggcggctg
      781 cgcacggcgt gtgagagggc caagaggacg ctgtcgtcca gcacccaggv aaacctggag
      841 atcgactctt tatatgaggg catcgacttc tacacgtcca tcactagagc acggtttgaa
      901 gagctgtgtg cagacctatt tagaggcaca cttgagcccg tggaaaagtc tcttcgggat
      961 gccaagatgg ataaggctaa aatccatgac attgttctag tagggggctc cacccgcatc
     1021 ccaaaggtgc aaaaaatgct tcaggactac tttaatggac gggatctcaa caagagtatc
     1081 aatcccgatg aggcagtcgc ctacggagct gcagtccagg gagctatttt aatgggcgac
     1141 aaatctgaaa aagtacagga tttgcttttg ttggacgtag ctcccctgtc tctaggattg
     1201 gagacagctg ggggtgtgat gactgtattg atcaagcgga actccaccat ccccaccaag
     1261 cagacgaaga tcttcaccac ctactcggac aaccagcccg gggtattgat ccagtgtacg
     1321 aggcaaacca tacccgacaa caacctggtg gggccctttg acttgactgg aatacctcct
     1381 gcacctagcg tgccacagat cgaggtgacc ttcgacatcg cgcgcaacgg tatcctgaac
     1441 gttacggcca tggacaagag caccggcaag gccaacaaga tcaccatcac caacgacaag
     1501 ggtcgcctga gcaaggagga gattgactcg atcgagagcc ggactgctta caaacgtgag
     1561 gatgagggtc agagggagaa aatcgctgcc aaaaatgcct tagaatcgta cgcctttaat
     1621 atgaagagcg cttgtggtga tgagggtctg aaggacaaga tcacggagtc cgataaaaag
     1681 aaaatactgg ataaatgcaa tgaggttcct ttcctgctga gtcaaaccca gctggctgag
     1741 aaagtagagt ttgatcataa aagaaaagaa ctggaaaata tgtgtaatcc gatcatcaca
     1801 aaactgtacc agagcggatg caccgggccc actgtacgcc agggtatact ccggcagggc
     1861 tccacaggcc ctaccatcga ggaagtagat tagccttttc cagaattgca gggtgctagg
     1921 gtgcctctag gcgaatttta ttcatcttca aacatcaata tgattcttga actgactgga
     1981 ctcagcctac gtcaccagtc ctttggatcc cgatggagaa gctcgaacgc accttttcac
     2041 acccccaccc gtcgtcagtc tatgatcctg aactggacct ttaggaaaac caggcccctc
     2101 tttagagcca tttgagaatt tgatgtctgt atttattatc cacaccccac ctttctcctt
     2161 cctgtgtgtt ggttatttgt ctctcaataa atttgttccc aaaggaaaca actctgcact
     2221 tttattttag caggccttac tgatatgag
//

GenBank-Updates@genbank.bio.net (04/09/90)

LOCUS       MUSGAS3A     1799 bp ss-mRNA            ROD       26-FEB-1990
DEFINITION  Mouse growth arrest specific (gas3) mRNA, complete cds.
ACCESSION   M32240
KEYWORDS    growth arrest specific protein.
SOURCE      Mouse fibroblast cell line NIH-3T3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1799)
  AUTHORS   Manfioletti,G., Ruaro,M.E., Del Sal,G., Philipson,L. and
            Schneider,C.
  TITLE     A growth arrest specific (gas) gene codes for a membrane protein
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Philipson, 15-FEB-1990.
FEATURES       from  to/span     description
    pept        187      621     growth arrest specific protein (gas3)
    mRNA    <     1     1799     gas3 mRNA
BASE COUNT      444 a    487 c    388 g    480 t
ORIGIN      
        1 ccggagcctc ccactgcccc cttgctttgc gcgcgcgtga cccgcagcac agctgtcttt
       61 ggggacgcca gcaacccagt ggacgcaccg gagtttgtgc ctgaggctaa tctgctctga
      121 gatagctgtc cctttgaact gaaacaggca ccgctcctct gatcccgagc ccaactccca
      181 gccaccatgc tcctactctt gttggggatc ctgttcctgc acatcgcggt gctagtgttg
      241 ctcttcgtct ccaccatcgt cagccaatgg ctcgtgggta atggacacac gactgatctc
      301 tggcagaact gtaccacatc cgccttggga gccgtccaac actgctactc ctcatcagtg
      361 agcgaatggc tgcagtctgt ccaggccacc atgatcctgt ctgtcatctt cagcgtcctg
      421 gctctgttcc tgttcttctg ccagctcttc actctcacca aaggcggccg gttttacatc
      481 actggattct tccaaatcct tgctggtctg tgcgtgatga gtgcagcggc catctacaca
      541 gtgaggcaca gtgagtggca tgtcaacact gactactcct atggcttcgc tacatcctgg
      601 cctgggtggc ctttccccta gccctcctca gtggtatcat ctatgtgatc ctgcggaaac
      661 gcgaatgagg cgcccgacga cgcaccgtcc gtctaggctc tgagcgcgca tagggtccac
      721 agggagggag gaaggaaacc agagaacaaa accaaccaac caaaaaagag ctagccccaa
      781 acccaaacgc aagccaaacc aaacagaacg cagttgagtg gggattgctg ttgattgaag
      841 atgtatataa tatctatggt ttataaaacc tatttataac actttttaca tatatgtaca
      901 taggattgtt ttgcttttta tgttgaccgt cagcctcgtg ttgaatctta aacaacttta
      961 catcctaaca ctataaccaa gctcagtatc tttgttttgt ttcgtttttt tttttaatct
     1021 ttttgttttg ctcagacata aaaactccac gtggccccct ttcatctgaa agcagatacc
     1081 tccctcccac tcaacctcat aggataacca aagtgtgggg acaaacccca gacagttgaa
     1141 gacctttaca ctatgggtga cccagtgcat ttagcaggag tatccactgc ccgaatccat
     1201 gtgtgaagcc ctaagcactc acagacgaaa agccctgacc ggaaccctct gcaaaaacag
     1261 taatagctgg tggctcctga acacttgacc ctgtagacgg agtactgggg ccacacgttt
     1321 aaatgagaag tcagagacaa gcaatctgtg aaatggtgct atagatttac cattccttgt
     1381 tattactaat cgtttaaacc actcactgga aactcaatta acagttttat gcgatacagc
     1441 agaatggaga cccgatacaa acggttcata actgctttca tacctagcta ggctgttgtt
     1501 attactacaa taaataaatc tcaaagcctt cgtcagtccc acagttttct cacggtcgga
     1561 gcatcaggac gagcatctag acccttggga ctagcgagtt ccctggcttt ctgggtctag
     1621 agtgttctgt gcctccaagg actgtctggc gatgacttgt attggccacc aactgtagat
     1681 gtatatacgg tgtccttctg atgctaagac tccagacctt tcttgttttt gcttgctttc
     1741 tctgatttta taccaactgt gtggactaag atgcatcaaa ataaacatca gagtaactc
//

GenBank-Updates@genbank.bio.net (04/09/90)

LOCUS       RATGFGDVA     630 bp ss-mRNA            ROD       26-FEB-1990
DEFINITION  Rat glioma-derived vascular endothelial cell growth factor mRNA,
            complete cds.
ACCESSION   M32167
KEYWORDS    endothelial cell growth factor protein.
SOURCE      Rat cell line GS-9L, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 630)
  AUTHORS   Conn,G., Bayne,M., Soderman,D., Kwok,P., Sullivan,K., Palisi,T.,
            Hope,D. and Thomas,K.
  TITLE     Amino acid and cDNA sequences of a vascular endothelial cell
            mitogen that is homologous to platelet-derived growth factor
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.L.Bayne, 16-FEB-1900.
FEATURES       from  to/span     description
    pept          5      577     endothelial cell growth factor protein
                                 precursor
    sigp          5       82     endothelial cell growth factor protein signal
                                 peptide
    matp         83      574     endothelial cell growth factor protein
BASE COUNT      169 a    163 c    172 g    126 t
ORIGIN      
        1 aaccatgaac tttctgctct cttgggtgca ctggaccctg gctttactgc tgtacctcca
       61 ccatgccaag tggtcccagg ctgcacccac gacagaaggg gagcagaaag cccatgaagt
      121 ggtgaagttc atggacgtct accagcgcag ctattgccgt ccgattgaga ccctggtgga
      181 catcttccag gagtaccccg atgagataga gtatatcttc aagccgtcct gtgtgcccct
      241 aatgcggtgt gcgggctgct gcaatgatga agccctggag tgcgtgccca cgtcggagag
      301 caacgtcact atgcagatca tgcggatcaa acctcaccaa agccagcaca taggagagat
      361 gagcttcctg cagcatagca gatgtgaatg cagaccaaag aaagatagaa caaagccaga
      421 aaatcactgt gagccttgtt cagagcggag aaagcatttg tttgtccaag atccgcagac
      481 gtgtaaatgt tcctgcaaaa acacagactc gcgttgcaag gcgaggcagc ttgagttaaa
      541 cgaacgtact tgcagatgtg acaagccaag gcggtgagcc aggctgcagg aaggagcctc
      601 ctcagggttt cgggaactag acctctcacc
//

GenBank-Updates@genbank.bio.net (04/09/90)

LOCUS       HUMSHBGA     6087 bp ds-DNA             PRI       31-JAN-1990
DEFINITION  Human human sex hormone-binding globulin (SHBG) gene, complete cds.
ACCESSION   M31651
KEYWORDS    human sex hormone-binding globulin.
SOURCE      Human adult testis DNA, clone hgSH!BG-6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 6087)
  AUTHORS   Hammond,G.L., Underhill,D.A., Rykse,H.M. and Smith,C.L.
  TITLE     The human sex hormone-binding globulin gene contains exons for
            androgen-binding protein and two other testicular messenger RNAs
  JOURNAL   Mol. Endocrinol. 3, 1869-1876 (1989)
  STANDARD  simple staff_review
COMMENT     A region of SHBGr-3 mRNA contains an inverted exon.  However, the
            paper does not state the boundaries. This exon is related to a
            palindromic sequence "atcttggctcagtctccacctccaagat" located at
            positions 4455-4482.
FEATURES       from  to/span     description
    pept       2643     2753     sex hormone-binding globulin, exon 1
               2886     2977     sex hormone-binding globulin, exon 2
               3150     3339     sex hormone-binding globulin, exon 3
               3670     3831     sex hormone-binding globulin, exon 4
               4059     4218     sex hormone-binding globulin, exon 5
               4349     4485     sex hormone-binding globulin, exon 6
               5218     5425     sex hormone-binding globulin, exon 7
               5670     5818     sex hormone-binding globulin, exon 8
    sigp       2643     2732     sex hormone-binding globulin signal peptide
    matp       2733     2753     sex hormone-binding globulin
               2886     2977     sex hormone-binding globulin
               3150     3339     sex hormone-binding globulin
               3670     3831     sex hormone-binding globulin
               4059     4218     sex hormone-binding globulin
               4349     4485     sex hormone-binding globulin
               5218     5425     sex hormone-binding globulin
               5670     5815     sex hormone-binding globulin
    IVS        2754     2885     SHBG intron A
    IVS        2978     3149     SHBG intron B
    IVS        3340     3669     SHBG intron C
    IVS        3832     4058     SHBG intron D
    IVS        4219     4348     SHBG intron E
    IVS        4486     5217     SHBG intron F
    IVS        5426     5669     SHBG intron G
    mRNA        656      751     SHBG-2 mRNA, exon 1 (5' end unsure)
            <  2643     2753     SHBG-2 mRNA, exon 2
               2886     2977     SHBG-2 mRNA, exon 3
               3150     3339     SHBG-2 mRNA, exon 4
               3670     3831     SHBG-2 mRNA, exon 5
               4059     4218     SHBG-2 mRNA, exon 6
               4349     4485     SHBG-2 mRNA, exon 7
               5670  >  5818     SHBG-2 mRNA, exon 8
    mRNA        540      596     SHBG-3 mRNA, exon 1
    rpt         835     1223     Alu repeat
    rpt        1359     2002     Alu repeat
    rpt        4619     4928     Alu repeat
BASE COUNT     1400 a   1632 c   1640 g   1415 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcggct agctcctaag gcgtgggtac ggaagctaga ttagagcaga agggccccgc
       61 tgctccccga gcaggttccc aaggcgagcc cctccccctg cccccgcctc ctacgacccc
      121 gctctggccg cgccactctg acccccgggt taccggcctg cagtcttcac ccgaatcagc
      181 ctcaggatat ctccacagtc tccctccttg gcctctcgga tccgcacgga agccatccgg
      241 atccccgctg tctgggacca aagtcccagg gcctcgcaaa cggcaactag accccttaaa
      301 gggcctacgg acttggatcc tgaagagcct gagagagcgg ggtggcggga gtcggggggg
      361 acggcggggt agccgcggcc tggtaagtgg agctgggatt ccggcgccgt acgggaggag
      421 agagtaggcc agcgaggcga tcctctgtcc ggcatagccc caccccctcg aattctgtcg
      481 cagcaggggg cacaactgtc agccaatcag cttggagaac aggcacggcc gcgtcccccc
      541 caagccccac ccccgacagc tggatcttgt gactgggctc ctgggtagag ttcaaggttg
      601 gagtgaagcg gcttccttgc ggttgtgtgg gtgtcccaac ctgggtcgag ataccccgcg
      661 gttcaaaggc tcccccgcag tgctttttaa attgacatat gcagtgataa cctgctttag
      721 cctcaggctc actcacccgc ccagaccctg ggtaagcctt aagaccctca gctctgaaag
      781 ctgtttcctg cagctcttga gtagcatgaa gtgttacctc ttgggggcat ttgcattttt
      841 aaatgtttta ttttatattt atttatttat ttatttttgg agatggtgta ttgctttgtc
      901 gcccaggctg gggtgtagtg gcgcgatctc tgctcactgc agcctccacc tcccgagttc
      961 aagcgatcct cctgtctcag cctccggagt agctggaact acagtcgcgc accggcacgc
     1021 ccggctaatt tttttttctt ttttcttctc tttttttttg ttttaacgga ttctcactct
     1081 gtcactcagg ctggagtgca gtggcgcgat ctcggctcac tgcaacctct ccctcctggg
     1141 ttcaagcgat tctcctgcct caccctggag atagctggga ttacaggcat gggcaaccat
     1201 acctggctaa tttttgtatt tttagtagag acggggtttc actatgttgg ccaggctggt
     1261 ctggaactcc tgacctcaag tgatctgccc gcctcagcct tctaaagtgc ggggattaca
     1321 gatgtgaccc accaagcccg gtctgtcatt tgcattttaa aatgggtcat ggggtgggca
     1381 cagtggctca cacctgtaat cccagcattt tggggaggca gaggcaggcg gatcactgag
     1441 atcaggaatt tgagaccagc ctgaccaaca tggtgaaacc cgtctctact aaaatacaaa
     1501 attagacagg tgtggtggcg catgccttta ctcccagcta cacgggaggc tgagacagga
     1561 gaatcgcttg aacctgggag gtggaggtta cagtgagccg agatcgtgcc attgcactcc
     1621 agcctgggca acaaaagcga aactccatct caaaataaat aaataaaata aaatgcgtca
     1681 gggagggtcg ggccttgtgg ctaatgcctg taatccaggt actttgggag gctgaggtgg
     1741 gcggatcatt tcaggtcaga ggttcgagat cagcctgggc aacgtggtga aatccccgtc
     1801 tctactaaaa acacaaaaaa atttgctggg cgtggtggtg cgtgcacctg tagtcccaac
     1861 tactaaggag gctgagacag gaggatcgct tgaactcgag aggcagaggc agcagtgagc
     1921 cgagatcact ccactgcact ccagcctggg tgatagagca agactctgtc taaaataaaa
     1981 taaaataaaa taaaataaaa ttgggtcagg gagtgggtga tttctactgc tagactgttt
     2041 aggccctgta ataaatggat aagggaagat aactgagagg cggggggcag gtcccttctt
     2101 aatattcact gaatcataca cacagacaat accttcttgg gagacaggcc tcagaggctg
     2161 ggaaaagact gggggaggag ttcagaccag atgccaggca ctgtgcctgc attttctcaa
     2221 tgaaccctct ttcacagtca ccccgtaaag tattatttcc tcattttaca gcaaggacac
     2281 tgaagcacaa aggtgaagtg acttggccca aggtcactca gggacagaaa tcttggagga
     2341 cctagatcag gccctagagg aggagagggg agatggaata tcctctccca gttcagaaac
     2401 tttctcggca gtggaggatg atagtggagg gactctgtcc ttcaccccat tgatccccag
     2461 aggggtgata gctgagtctt gtgactgggc ccctgggcag gggtcaaggg tcagtgcccc
     2521 tgtttccttt accccctcct ccccgggcaa cctttaaccc tccaccgccc acacgcaagg
     2581 ctgcctgcct ctacacattc tcccaagagt tgtctgagcc gccgagtgga cagtggctga
     2641 ttatggagag cagaggccca ctggctacct cgcgcctgct gctgttgctg ctgttgctac
     2701 tactgcgtca cacccgccag ggatgggccc tgagacctgt tctccccacc caggtgcagg
     2761 agcgggacag ggcactcagc tcatgcagtc ttcccttctc tcctctggcc ctgtagcagg
     2821 gcctctccct ctgtctgtct ctgacatgtc cctactcagc tttgtttgtt ttctctttct
     2881 gatagagtgc ccacgaccct ccggctgtcc acctcagcaa tggcccagga caagagccta
     2941 tcgctgtcat gacctttgac ctcaccaaga tcacaaagta tggggttggc ctagcccttg
     3001 acccagtccc ctggttctgc cctctctcca tcagctcttc tcttttccct gtcttccttt
     3061 ccttatctgt gaacaccatc tcccccaaac ccacactggt tctcaaagga cacatgacat
     3121 acacaatctt tccttctgtg tccttccaga acctcctcct cctttgaggt tcgaacctgg
     3181 gacccagagg gagtgatttt ttatggggat accaacccta aggatgactg gtttatgctg
     3241 ggacttcgag acggcaggcc tgagatccaa ctgcacaatc actgggccca gcttacggtg
     3301 ggtgctggac cacggctgga tgatgggaga tggcaccagg taagctagct ctggtcctca
     3361 ggggagggat gtctggagct ggtctgagga aagggaacaa aaccaagtta ttgggcatcc
     3421 ctttaccact gtcatctcgt ttaatccaca cgaaccccca caaagtagct attcttggcc
     3481 ccatcttttc tgatgggaat tctaaggctc agtcagtata taagtgacaa gagctgagtg
     3541 acccaaggcc aaggatgcta gctgcttctt taaggcatgt tctttccact atagtactag
     3601 gctgcctcac aggaaggtgg cagaaacaga tcccaggggc ctctgatttt gcttcccacc
     3661 ttcctgcagg tggaagtcaa gatggagggg gactctgtgc tgctggaggt ggatggggag
     3721 gaggtgctgc gcctgagaca ggtctctggg cccctgacca gcaaacgcca tcccatcatg
     3781 aggattgcgc ttggggggct gctcttcccc gcttccaacc ttcggttgcc ggtaactaca
     3841 ccccaggggt ggaaccctag ccaagacttg gtaaagcact gctgggtggc tggccgtggg
     3901 aatctaagtc cacactttta gggagaaggg aagggttgag agctgcaagg gggaggccaa
     3961 atgctcagag gggagtcaac tgagggcagg gaggtcggga ctgcgcctcc gatgccctga
     4021 tttctacatc cccgtatctt atctctgtca cactccagct ggttcctgcc ctggatggct
     4081 gcctgcgccg ggattcctgg ctggacaaac aggccgagat ctcagcatct gcccccacta
     4141 gcctcagaag ctgtgatgta gaatcaaatc ccgggatatt tctccctcca gggactcagg
     4201 cagaattcaa tctccgaggt agatttcctc ggagtctatt tttcccaccc tggccagctc
     4261 agcctgcctc tgtccccctc taccactggc ccctttcctc cttgagaccc cagctttgag
     4321 gcctcaggat aatcatttct ccccacagac attccccagc ctcatgcaga gccctgggcc
     4381 ttctctttgg acctgggact caagcaggca gcaggctcag gccacctcct tgctcttggg
     4441 acaccagaga acccatcttg gctcagtctc cacctccaag atcaagtaag ggacagtggg
     4501 cattgcctgt attcagtgga gcctggagca atgaggaaga gggagtccaa catgtcaata
     4561 ttaggaaggt ttccagccca gggaacataa caagactggc tccacagaat tgtttttcat
     4621 taataattag ccaggcatgg tggtggtgct tgcctgtaat cccaggtgct ggaggccaag
     4681 accagaggat cacttgaggc caggagtttg acaccagcct gggcaacata gcagagacct
     4741 ctgtctaaaa aaaaaaaaaa attagccagg catggtagca catgtctgct gccctagcta
     4801 tttaggagcc tgaggcagga ggttcacttg agcccaggag tttgaagctg cagtgagcta
     4861 tgatgtgcca ctgcactctg acctgggcca cagtgagacc ctgtctcaaa aaataaaaat
     4921 aaaaataagg cttatggatg gcactcaggt gggtggtagg ggcgagggac atatcttgaa
     4981 gctccccaca gcaagcaaac agttttgact tagactgcat atttacttgg ggcaggtgtg
     5041 gtttcaaaaa gggtcaagcc aaaaaaaatt ggggcaggat ttaagtggtg agaatggcca
     5101 gtaggtggag gcatagcgaa gaggcagaat taaggcagct aggggtgagg ccacaggcag
     5161 taggcccggc tcattcttcc ctctctctct accgtccctt tcccacacac tctgcagaag
     5221 gtggtgttgt cttctgggtc ggggccaggg ctggatctgc ccctggtctt gggactccct
     5281 cttcagctga agctgagtat gtccagggtg gtcttgagcc aagggtcgaa gatgaaggcc
     5341 cttgccctgc ctcccttagg cctgctcccc tcccttaacc tctgggccaa gcctcaaggg
     5401 cgtctcttcc tgggggcttt accaggtaag agagaatgat gttcaagttc atgagcacaa
     5461 cattggaaac agctcaaggg aggcggcaca ttttgagggg aaggaaacct ctgggaggga
     5521 agaagaatag gccacaagaa gaagatatgg gggcagtgga aggtagtgct tttgcaaact
     5581 caggttggag gagtggaaaa gtggggagaa gattctggat ccgagccacc ttaatgctct
     5641 aatgccacct ttgcactacc tccctctagg agaagactct tccacctctt tttgcctgaa
     5701 tggcctttgg gcacaaggtc agaggctgga tgtggaccag gccctgaaca gaagccatga
     5761 gatctggact cacagctgcc cccagagccc aggcaatggc actgacgctt cccattaaag
     5821 ctccacctaa gaaccccctt tgaaagttac tgattattca tttattcaac aaatattcac
     5881 tgtgcactag caatgtacca ggcactgtgc caagtattga gttgtcttaa tgagcaaaaa
     5941 cactctggtt cctaccctct tggtgcccac agtcccatag ggaagcagac atccatcaaa
     6001 ggctaactaa taagtggata gttggaagca ctgataaaga agaattggag agttgtgaaa
     6061 acatggagac tggcgggcgt gtggctc
//

GenBank-Updates@genbank.bio.net (04/09/90)

LOCUS       OWEPH18      1548 bp ds-DNA             INV       23-FEB-1990
DEFINITION  O.fusiformis CCPurine repeat sequence.
ACCESSION   M32217
KEYWORDS    .
SOURCE      O.fusiformis DNA, clone PH18.
  ORGANISM  Owenia fusiformis
            Eukaryota; Animalia; Metazoa; Annelida; Polychaeta; Oweniida; 
            Oweniidae.
REFERENCE   1  (bases 1 to 1548)
  AUTHORS   Bakalara,N., Collet,J., Planells,R., Thouveny,Y. and Fontes,M.
  TITLE     Presence in invertebrate genomes of sequences characterized by the
            repetition of the triplet CCpurine
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 66-73 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         515      663     CCPu repeat sequence
BASE COUNT      424 a    395 c    230 g    499 t
ORIGIN      
        1 aaagcatttt tccaatcttc acgctcgagt gattttggat gatttatgga gtttgagact
       61 ttttttcacc aaacggtcac tttattctta ctcctgagtc aatttagaga ctttttccaa
      121 tcttcacgct cgagtgattt tggatgattt ctggagtttg agacattctt tcaccaaacg
      181 atcacttgat tcttactccc aagtcattta agagtatttt ttcagtcgtc atggtttagt
      241 gatttcggtt acgcttgatc aatttctgct aatttttggg agtttaatgc gttttcacaa
      301 aatagacacc cacccactat acaaaataac atgttttaat aggcatctgt taatttaaca
      361 taccactgat aaaaatataa aaccacacaa gataaaaaaa atgctaacat taacagtttt
      421 acaagcgaaa cattttcgac caactaaagt gtatacttaa atgagcagct gaaccatttc
      481 aatcaatcag attagattat tgcactcttt gaccccgccg ccaccaccac cgccgccgcc
      541 accaccgcca ccaccaccgc cgccaccacc accaccgcca ccaccgccgc caccaccacc
      601 accgccacca ccgccgccac caccaccacc gccaccacca ccaccgccgc caccaccgcc
      661 gcctcggtgc cacaactatc gacaacaaca accccaacaa caataaaaat gtattttagt
      721 gctatagtag tttattaata cacatatttt catttataca aattatttca tagtcccttt
      781 tggatgaggg attcactgtc tgtgttgacg ttttccagga gggccagaat acaccataac
      841 atccctcttt ttcttaggtt ttttaaaaag tcgtattcat cacatttcca ttggaggtgt
      901 ggtcaaagga atcattgtca ctcatttgtc tgtaaacgtc tcttggtggc ctatcccgtt
      961 cgccattttc tttctgctgc ttgtcaattt ctgccttggc tctctataaa cagcttcagt
     1021 gggactgaga tgttgaacaa tcggtttcat ggactgatca cactgtgatc cacttgagtt
     1081 ttcctcgttt accatatcca gcatgttcga tgactctgcc tttcacttgg gctcgtataa
     1141 acttcttcca cacttctggg ttgtccacat aaggcactat cttaggcatt gttgcttctt
     1201 ccttttctat taattatctg atgctaatgt ccaggtttcc ctttatatac tcatcaaaca
     1261 ttcattcatg aaatctaact acgttcgtct tcttcttcca tctcacatag tgtatctgca
     1321 tcttctaaat cacttccgtc aatagactct tcatcgttaa acatttcatc agcgatataa
     1381 gaatatttgg ccaaagtttt tctgatggct ctcgacagag atatgtcttg atcaactaag
     1441 aattcaattt cttctacaat ttctcgatgt aaatcagcat ctttgacttc aatgaataac
     1501 ctcagaattt gaccataatc atttaaaaac tcagattgct gggataag
//

GenBank-Updates@genbank.bio.net (04/09/90)

LOCUS       MUSIGHRH      345 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-5,
            partial cds.
ACCESSION   M32034
KEYWORDS    N-region; diversity exon; gamma-immunoglobulin;
            immunoglobulin heavy chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 345)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   345     Ig H-chain V-D-J region (AA at 1)
    recomb      279      280     V-region DNA end/N-region DNA start
    recomb      282      283     N-region DNA end/D-region DNA start
    recomb      300      301     D-region DNA end/N-region DNA start
    recomb      303      304     N-region DNA end/J-region DNA start
BASE COUNT       93 a     78 c     90 g     84 t
ORIGIN      Chromosome 12.
        1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
       61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
      121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
      181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagccctg
      241 acatctgagg actctgcagt ctatttctgt gcaagatctc attactatgg tggtagctac
      301 gactttgact actggggcca aggcaccact ctcacagtct cctca
//
LOCUS       MUSIGHRI      363 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-7,
            partial cds.
ACCESSION   M32035
KEYWORDS    N-region; diversity exon; gamma-immunoglobulin;
            immunoglobulin heavy chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig H-chain V-D-J region (AA at 1)
    recomb      297      298     V-region DNA end/N-region DNA start
    recomb      300      301     N-region DNA end/D-region DNA start
    recomb      318      319     D-region DNA end/N-region DNA start
    recomb      321      322     N-region DNA end/J-region DNA start
BASE COUNT       99 a     83 c     94 g     87 t
ORIGIN      Chromosome 12.
        1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
       61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
      121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
      181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctcat
      301 tactatggtg gtagctacga ctttgactac tggggccaag gcaccactct cacagtctcc
      361 tca
//
LOCUS       MUSIGHRJ      363 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-3,
            partial cds.
ACCESSION   M32036
KEYWORDS    N-region; diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig H-chain V-D-J region (AA at 1)
    recomb      297      298     V-region DNA end/N-region DNA start
    recomb      300      301     N-region DNA end/D-region DNA start
    recomb      318      319     D-region DNA end/N-region DNA start
    recomb      321      322     N-region DNA end/J-region DNA start
BASE COUNT       99 a     84 c     93 g     87 t
ORIGIN      Chromosome 12.
        1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
       61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
      121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
      181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcccat
      301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc
      361 tca
//
LOCUS       MUSIGHRK      363 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65I4-2,
            partial cds.
ACCESSION   M32037
KEYWORDS    N-region; diversity exon; gamma-immunoglobulin;
            immunoglobulin heavy chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65I4-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig H-chain V-D-J region (AA at 1)
    recomb      297      298     V-region DNA end/N-region DNA start
    recomb      300      301     N-region DNA end/D-region DNA start
    recomb      318      319     D-region DNA end/N-region DNA start
    recomb      321      322     N-region DNA end/J-region DNA start
BASE COUNT      100 a     82 c     93 g     88 t
ORIGIN      Chromosome 12.
        1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
       61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
      121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
      181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctaat
      301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc
      361 tca
//
LOCUS       MUSIGHRL      363 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65J4-1,
            partial cds.
ACCESSION   M32038
KEYWORDS    N-region; diversity exon; gamma-immunoglobulin;
            immunoglobulin heavy chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65J4-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig H-chain V-D-J region (AA at 1)
    recomb      297      298     V-region DNA end/N-region DNA start
    recomb      300      301     N-region DNA end/D-region DNA start
    recomb      318      319     D-region DNA end/N-region DNA start
    recomb      321      322     N-region DNA end/J-region DNA start
BASE COUNT      100 a     82 c     93 g     88 t
ORIGIN      Chromosome 12.
        1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg
       61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg
      121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac
      181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctaat
      301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc
      361 tca
//
LOCUS       MUSIGKCRP     324 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-5,
            partial cds.
ACCESSION   M32039
KEYWORDS    N-region; immunoglobulin light chain; joining exon;
            kappa-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig kappa-chain V-J region (AA at 1)
    recomb      285      286     V-region DNA end/N-region DNA start
    recomb      288      289     N-region DNA end/J-region DNA start
BASE COUNT       96 a     80 c     72 g     76 t
ORIGIN      Chromosome 6.
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
      301 ggcaccaagc tggaaatcaa acgg
//
LOCUS       MUSIGKCRQ     324 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-7,
            partial cds.
ACCESSION   M32040
KEYWORDS    N-region; immunoglobulin light chain; joining exon;
            kappa-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig kappa-chain V-J region (AA at 1)
    recomb      285      286     V-region DNA end/N-region DNA start
    recomb      288      289     N-region DNA end/J-region DNA start
BASE COUNT       96 a     80 c     71 g     76 t      1 others
ORIGIN      Chromosome 6.
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattn ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
      301 ggcaccaagc tggaaatcaa acgg
//
LOCUS       MUSIGKCRR     324 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-3,
            partial cds.
ACCESSION   M32041
KEYWORDS    N-region; immunoglobulin light chain; joining exon;
            kappa-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig kappa-chain V-J region (AA at 1)
    recomb      285      286     V-region DNA end/N-region DNA start
    recomb      288      289     N-region DNA end/J-region DNA start
BASE COUNT       96 a     80 c     72 g     76 t
ORIGIN      Chromosome 6.
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
      301 ggcaccaagc tggaaatcaa acgg
//
LOCUS       MUSIGKCRS     324 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65I4-2,
            partial cds.
ACCESSION   M32042
KEYWORDS    N-region; immunoglobulin light chain; joining exon;
            kappa-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65I4-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig kappa-chain V-J region (AA at 1)
    recomb      285      286     V-region DNA end/N-region DNA start
    recomb      288      289     N-region DNA end/J-region DNA start
BASE COUNT       96 a     80 c     72 g     76 t
ORIGIN      Chromosome 6.
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
      301 ggcaccaagc tggaaatcaa acgg
//
LOCUS       MUSIGKCRT     324 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65J4-1,
            partial cds.
ACCESSION   M32043
KEYWORDS    N-region; immunoglobulin light chain; joining exon;
            kappa-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain A/J), cDNA to mRNA, hybridoma P65J4-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Parhami-Seren,B., Wysocki,L. and Margolies,M.N.
  TITLE     The amino acid residues at the V-H-D-J-H junctions affect the
            affinity of anti-p-azophenylarsonate antibodies
  JOURNAL   J. Immunol. 143, 4090-4097 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig kappa-chain V-J region (AA at 1)
    recomb      285      286     V-region DNA end/N-region DNA start
    recomb      288      289     N-region DNA end/J-region DNA start
BASE COUNT       95 a     80 c     72 g     76 t      1 others
ORIGIN      Chromosome 6.
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattg ccacttactt ttgccnacag ggtaatacgc ttcctcggac gttcggtgga
      301 ggcaccaagc tggaaatcaa acgg
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       MUSIGHRM      374 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrM4,
            partial cds.
ACCESSION   M31953
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma BrM4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 374)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   374     Ig mu-chain precursor V-D-J1-region (AA at 3)
    sigp    <     1       20     Ig mu-chain signal peptide
    sigp         21  >   374     Ig mu-chain
    recomb      314      315     V-region DNA end/D-region DNA start
    recomb      323      324     D-region DNA end/J1-region DNA start
BASE COUNT       92 a     80 c    109 g     91 t      2 others
ORIGIN      Chromosome 12.
        1 ttttaaaaga tgtccagtgt gaagtgcagc tgttggagac tggaggaggc ttggtgcaac
       61 ctggggggtc acggggactc tcttgtgaag gctcagggtt cacttttagt ggcttctgga
      121 tgngctgggt tcgacagaca cctgggaaga ccctggagtg gattggagac attnattctg
      181 atggcagtgc aataaactac gcaccatcca taaaggatcg attcactatc ttcagagaca
      241 atgacaagag caccctgtac ctgcagatga gcaatgtgcg atcggaggac acagccacgt
      301 atttctgtat gagatatggt aactactggt acttcgatgt ctggggcaca gggaccacgg
      361 tcaccgtctc ctca
//
LOCUS       MUSIGHRN      287 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma S12M1,
            partial cds.
ACCESSION   M31954
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma S12M1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 287)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   287     Ig mu-chain V-D-J1-region (AA at 3)
    recomb      227      228     V-region DNA end/D-region DNA start
    recomb      236      237     D-region DNA end/J1-region DNA start
BASE COUNT       77 a     64 c     75 g     64 t      7 others
ORIGIN      Chromosome 12.
        1 aaggcnnagg gnncactttt agtggcttct ggatgagctg ggntcgacag acacctggga
       61 agaccctgga gtggattgga gacattaatt ctgatggcag tgcaataaac tacgcaccat
      121 ccataaagga tcgattcact atcttcagag acaatgacaa gagcaccctg tacctgcaga
      181 tgagcaatgt gcgatcggag gacacagcca cgtatttctg tatgagatat ggtaactact
      241 ngtacttcga tgtctggggc acagggacca cggtcaccgt ctcctna
//
LOCUS       MUSIGHRO      366 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig active mu-chain mRNA V-D-J2 region, from hybridoma CH12,
            partial cds.
ACCESSION   M31956
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma CH12, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 366)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   366     Ig mu-chain precursor V-D-J2-region (AA at 1)
    matp          1  >   366     Ig mu-chain
    recomb      294      295     V-region DNA end/D-region DNA start
    recomb      321      322     D-region DNA end/J2-region DNA start
BASE COUNT       95 a     93 c     97 g     81 t
ORIGIN      Chromosome 12.
        1 caggtccaac tgcagcagcc tgggactgaa ctggtgaagc ctggggcttc agtgaagctg
       61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg
      121 cctggacaag gccttgagtg gattggaaat attaatccta gcaatggtgg tactaactac
      181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagagattac
      301 tacggtagta gctgggggta ctactttgac tactggggcc aaggcaccac tctcacagtc
      361 tcctca
//
LOCUS       MUSIGHRP      366 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig active mu-chain mRNA V-D-J2 region, from hybridoma BrM1,
            partial cds.
ACCESSION   M31957
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma BrM1., cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 366)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   366     Ig mu-chain precursor V-D-J2 region (AA at 1)
    matp          1  >   366     Ig mu-chain
    recomb      294      295     V-region DNA end/D-region DNA start
    recomb      321      322     D-region DNA end/J2-region DNA start
BASE COUNT       93 a     90 c     98 g     80 t      5 others
ORIGIN      Chromosome 12.
        1 caggtccaac tgcngcagcc tgggactgaa ctggtgaagc cttgggcttc agtgaagctg
       61 tcctgcaagg cttctggcca caccttcaca agctantgga taagctgggt gaagnagaat
      121 cctggacagg gccttgagtg gattggatnt attnatccga gaagtggtga tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca aatcctccag cacagcgtac
      241 atggacgtcc gcagcctgac atctgaggac tctgcggtct attattgtgc aagaagctat
      301 tactacggta gtagctacgg ggactttgac tactggggcc aaggcaccac tctcacagtc
      361 tcctca
//
LOCUS       MUSIGHRQ      384 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig active mu-chain mRNA V-D-J3 region, from hybridoma BrM8,
            partial cds.
ACCESSION   M31958
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma BrM8, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 384)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   384     Ig mu-chain precursor V-D-J3-region (AA at 1)
    matp    <     1  >   384     Ig mu-chain
    recomb      297      298     V-region DNA end/D-region DNA start
    recomb      339      340     D-region DNA end/J3-region DNA start
BASE COUNT       96 a     92 c     99 g     95 t      2 others
ORIGIN      Chromosome 12.
        1 gaggtccagc tgcaacaatc tggacctgag ctggtgaagc ctggggcttc agtgaagata
       61 tcctgtaagg cttctggata cacgttcact gactactana tgaactgggt gaagcagagc
      121 catggaaaga gccttgantg gattggagat attaatccta acaatggtgg tactagctac
      181 aaccagaagt tcaagggcaa ggccacattg actgtagaca agtcctcctc cagcacagcc
      241 tacatggtac gccgcagcct gacatctgag gactttgcag tctattactg tgcaagatcc
      301 ctctactact tggggagctc tctctactat gattacggct ggtttgctta ctggggccaa
      361 gggactctgg tcactgtctc tgca
//
LOCUS       MUSIGHRR      331 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrM11,
            partial cds.
ACCESSION   M31959
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma BrM11, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 331)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   331     Ig mu-chain V-D-J1-region (AA at 2)
    recomb      262      263     V-region DNA end/D-region DNA start
    recomb      283      284     D-region DNA end/J1-region DNA start
BASE COUNT       79 a     84 c     87 g     77 t      4 others
ORIGIN      Chromosome 12.
        1 aatggagcct ggagggtccc ggaaactctc ctgtgcagcc tctggattca ctttcagtag
       61 cattggaatg cactgggttc gtcaggctcc agagaagggg ctggagttgg tngcatacat
      121 tagtagtggc agtagtacca tctactatgc agacacagng aagggccgat tcaccatctc
      181 cagagacaat gccaagaaca ccctnttcct gcaaatgacc agtctaaggt ctgaggacac
      241 ggccatgtat tactgtgcaa ggtgccctta tgattacgac tcttngtact tcgatgtctg
      301 gggcacaggg accacggtca ccgtctcctc a
//
LOCUS       MUSIGHRS      195 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrMs19,
            partial cds.
ACCESSION   M31960
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma BrMs19, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 195)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   195     Ig mu-chain V-D-J1-region (AA at 1)
    recomb      135      136     V-region DNA end/D-region DNA start
    recomb      144      145     D-region DNA end/J1-region DNA start
BASE COUNT       50 a     56 c     41 g     40 t      8 others
ORIGIN      Chromosome 12.
        1 agtgngnnaa cnntctacaa cccatctctc cagagcccca tctccattac tagagaaacg
       61 tcaaagaacc agtncttcct ccaattgaac tctgtgacca cagaggacac agccatgtat
      121 tactgtgcag gagacagtgc tggctactgg tacttcgatg tctggggcac agggaccacg
      181 gtcaccgtct ccnna
//
LOCUS       MUSIGKCRU     321 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J4 region, from hybridoma BrM4,
            partial cds.
ACCESSION   M31955
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma BrM4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig kappa-chain precursor VJ4-region (AA at 1)
    matp          1  >   321     Ig kappa-chain
    recomb      285      286     V-region end/J4-region start
BASE COUNT       94 a     74 c     76 g     72 t      5 others
ORIGIN      Chromosome 6.
        1 gacatcaaga tgaccnagtc nccatcctcc atgtatgcat cgctgggaga gagagtcact
       61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
      121 tggaaatctc ctaagaccct gatctatnat gcaacaagct tggcagatgg ggtcccatca
      181 agattcagtg gcagtggatc tgggcaagat tnttctctaa ccatcagcag cctggagtct
      241 gacgatacag caacttntta ctgtctacag catggtgaga gcccattcac gttcggctcg
      301 gggacaaagt tggaaataaa a
//
LOCUS       MUSIGKCRV     261 bp ss-mRNA            ROD       20-FEB-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J2 region, from hybridoma S12M1,
            partial cds.
ACCESSION   M31961
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.A) anti BrMRBC hybridoma S12M1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 261)
  AUTHORS   Conger,J.D., Sage,H.J. and Corley,R.B.
  TITLE     Diversity in the available repertoire of murine antibodies reactive
            with bromelain-treated isologous erythrocytes
  JOURNAL   J. Immunol. 143, 4044-4052 (1989)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   261     Ig kappa-chain VJ2-region (AA at 1)
    recomb      225      226     V-region end/J2-region start
BASE COUNT       79 a     58 c     65 g     55 t      4 others
ORIGIN      Chromosome 6.
        1 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
       61 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca
      121 aganncagtg gcagtggatc tgggcaagat tnttctctaa ccatcagcag cctggagtct
      181 gacgatacag caacttatna ctgtctacag catggtgaga gcccgtacac gttcggaggg
      241 gggaccaagc tggaaataaa a
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       ECOCMA       1007 bp ds-DNA             BCT       15-MAR-1989
DEFINITION  Plasmid pColBM-Cl139 from E.coli cma gene encoding colicin M
            activity peptide, complete cds.
ACCESSION   M16754
KEYWORDS    colicin M activity.
SOURCE      Plasmid pColBM-Cl139 DNA, clone pTO4.
  ORGANISM  Plasmid Colicin BM-Cl139
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1007)
  AUTHORS   Koeck,J., Oelschlaeger,T., Kamp,R.M. and Braun,V.
  TITLE     Primary structure of colicin M, an inhibitor of murein biosynthesis
  JOURNAL   J. Bacteriol. 169, 3358-3361 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         60      875     colicin M activity peptide
    mRNA    <     1      901     cma mRNA
BASE COUNT      283 a    227 c    198 g    299 t
ORIGIN      NaeI site.
        1 ggcggaggcg ttagccagcg gcgaactctg ttatcttgtt aacttataag gagttatgta
       61 tggaaacctt aactgttcat gcaccatcac catcaactaa cttaccaagt tatggcaatg
      121 gtgcattttc tctttcagca ccacatgtgc ctggtgctgg ccctctttta gtccaggttg
      181 tttatagttt tttccagagt ccaaacatgt gtcttcaggc tttaactcaa cttgaggatt
      241 acatcaaaaa acatggggcc agcaaccctc tcacattgca gatcatatcg acaaatattg
      301 gttacttctg taacgccgac cgaaatctgg ttcttcaccc tggaataagc gtttatgacg
      361 cttaccactt cgcaaaacca gcgccaagtc aatatgacta tcgctcaatg aatatgaaac
      421 aaatgagcgg taatgtcact acaccaattg tggcgcttgc tcactattta tggggtaatg
      481 gcgctgaaag gagcgttaat atcgccaaca ttggtcttaa aatttcccct atgaaaatta
      541 atcagataaa agacattata aaatctggtg tagtaggcac attccctgtt tctacaaagt
      601 tcacacatgc cactggtgat tataatgtta ttaccggtgc atatcttggt aatatcacac
      661 tgaaaacaga aggtacttta actatctctg ccaatggctc ctggacttac aatggcgttg
      721 ttcgttcata tgatgataaa tacgatttta acgccagcac tcaccgtggc attatcggag
      781 agtcgctcac aaggctcggg gcgatgtttt ctggtaaaga gtaccagata ctgcttcctg
      841 gtgaaattca cattaaagaa agtggtaagc gataatataa gggccggggt tccggctttt
      901 tatgcagcct ttataacagt gtattagtca ttcgcatatt caacatatcc ctcctcacaa
      961 tggtacatgc cataatacat tctcttaccc tccgggcctt cagcttt
//
LOCUS       ECOCOLM       810 bp ds-DNA             BCT       15-SEP-1989
DEFINITION  Plasmid pColBM-Cl139 colicin M resistance (cmi) locus, complete
            cds.
ACCESSION   M17810
KEYWORDS    cmi protein; colicin M resistance protein.
SOURCE      Plasmid pColBM-Cl139 DNA, isolated from E.coli strain JM101.
  ORGANISM  Plasmid Colicin BM-Cl139
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 810)
  AUTHORS   Oelschlaeger,T. and Braun,V.
  TITLE     Sequence, expression, and localization of the immunity protein for
            colicin M
  JOURNAL   J. Bacteriol. 169, 4765-4769 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        322      675     colicin M resistance protein
    pept        337      675     colicin M resistance protein (alt.)
BASE COUNT      252 a    146 c    177 g    235 t
ORIGIN      7 bp upstream of PstI site.
        1 ggctgcagag cctgatgggg cataagtcca tcagctcaaa cgaggtctac acgaaggtgt
       61 ttgcactgga tgtggctgta cggcaccggg tgcagttttc gatgcctgag tccgatgcgg
      121 tcacaatgct gaaaaacaga catgcataat aagtcacaat tatgaattgt gatttcttct
      181 ataaaaaaga gaccactgca atatgtgatc tcttgtatta tttcataatt gttaaagcca
      241 cttcacagta tgctcacatt gtactgatat attcgtaatg tttttttata tcgaatgaac
      301 gacagaagtt gtggagattt tatgaaagta attagcatga aatttatttt tattttaacg
      361 attattgctc ttgctgctgt ttttttctgg tctgaagata aaggtccggc atgctatcag
      421 gtcagcgatg aacaggccag aacgtttgta aaaaatgatt acctgcaaag aatgaaacgc
      481 tgggacaacg atgtacaact tcttggtaca gaaatcccga aaattacatg ggaaaagatt
      541 gagagaagtt taacagatgt tgaagatgaa aaaacacttc ttgtcccatt taaagctgaa
      601 ggcccggacg gtaagagaat gtattatggc atgtaccatt gtgaggaggg atatgttgaa
      661 tatgcgaatg actaatacac tgttataaag gctgcataaa aagccggaac cccggccctt
      721 atattatcgc ttaccacttt ctttaatgtg aatttcacca ggaagcagta tctggtactc
      781 tttaccagaa aacatcgccc cgacggatcc
//
LOCUS       ECOORID      1890 bp ds-DNA             BCT       15-DEC-1989
DEFINITION  E.coli plasmid pColBM-Cl139 D protein gene, complete cds, oriV
            region, and ORF.
ACCESSION   J03312
KEYWORDS    D protein; ORF; oriV region.
SOURCE      E.coli (strain K12 Cl139) plasmid pColBM-Cl139 DNA.
  ORGANISM  Plasmid Colicin BM-Cl139
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1890)
  AUTHORS   Thumm,G., Oelschlaeger,T. and Braun,V.
  TITLE     Plasmid pCoIBM-CI139 does not encode a colicin lysis protein but
            contains sequences highly homologous to the D protein (resolvase)
            and the oriV region of the miniF plasmid
  JOURNAL   Plasmid (1988) In press
  STANDARD  simple staff_review
COMMENT     Draft entry and sequence for [1] kindly submitted by G.Thumm,
            18-AUG-1988.  Sequence [1] complemented.
FEATURES       from  to/span     description
    pept       1108     1890     D protein
    ORF         654     1106     ORF8
    site        257      576     oriV region
BASE COUNT      434 a    476 c    534 g    446 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcatct tttggccgtt tacgtctgtt ccgttatcct gatgatacga tgttctgcac
       61 gttctgccgg gaagatgcag atgattcgct taaaagtatt atgacccatc tctgggagct
      121 ggatgcagag atgacagatc ctgtcatagc tatgtttaat cacgtctgag tgccgtgagt
      181 gatttctgtc ttttatgcaa cagtgccaag atattgtaat caaaaaaaag cattaatgca
      241 ttttggacag taatctattt taattgatga catagaggca ttaatctttc tttttcttca
      301 ggaagatccg aaaactcctg gtcacggatc ttcctctccc ccacacaacg ccacctcctg
      361 taagcacaac atgtggtgcc ggattcagct gctgatgaca ctatatgttg tgtcatctcc
      421 ctgacctgtg atgcgtcgcg caggggcgga aaacagcgat atgatgattt cctcggcgtg
      481 gtacacttcc ggaaagttgt gatattccgg aaagtcggat ctgacggaaa cggctctccg
      541 gtaatttaac ggcgtggtta tatggatgct tgttatcatg gtgatgatga taacggcatg
      601 atgttatcag acggcgtgac ggtaagggca gtgatgatgg atgacgttat cgcatgaccg
      661 tccctgcccg gaaaagaaaa aaggagtcac ccatgttttt tattgagaat gaaggtcagg
      721 ctgtcgccgg aacggattac tggcagtctg tacaggcgca ggccggatat gtctacctca
      781 gctggaatgc cggcgcagcc aggctgcttg tcccggatgc ggcaaaacat ttactcaggg
      841 agatgcgggg ggctgagtac gtcatcatca gtaagggagc actgcatggc cgcgatgcgc
      901 tggaactggt atttgaagac ggcagcgatg cgccgtttgt gatccacatg ctgagtgagc
      961 agtgcgatcg cctgctcccc gaaaacaacc agggaggggg ttttgttgtc accgtctgga
     1021 cgcgtggcgg taaccagctc cgttatccgg gaaagtaccg ggttgtggaa aacctgcccg
     1081 acgtttcccc gtggagtgaa cactgatatg cagcacctgc cggcaccgat ccaccatgcc
     1141 cgggatgctg ttcagcttcc tgttgccatc gattatccgg cagcgctggc actccgccag
     1201 atgtcgatgg ttcatgatga actgcccaaa tacctgctgg cccctgaagt gagcgccctg
     1261 ctccattacg tcccggatct gcgccgcaag atgctgctgg ccacactgtg gaacaccggt
     1321 gcgcgcatta atgaagcact ggcgctgacg cggggggatt tttcgctcac gcctccgtat
     1381 ccgtttgtgc agctggccac tctgaagcag cggacagaaa aagccgccag gacggcagga
     1441 agaatgcccg ccggtcagca gactcaccgg ctggttccgc tctccgactc ctggtacgtc
     1501 agccagctgc agacgatggt agccacactg aaaatcccca tggaacggcg taataaacga
     1561 acaggcagga cagagaaagc gcggatctgg gaagtgacgg acagaacggt caggacctgg
     1621 attggggagg cggttgccgc cgctgccgct gatggtgtga cgttctctgt cccggtcacg
     1681 ccacatacgt tccgccattc ctatgcgatg cacatgctgt atgccggtat accgcttaag
     1741 gttctgcaga gtctgatggg gcataagtcc atcagctcaa cagaggtcta cacgaaggtg
     1801 tttgcactgg atgtggctgc acggcaccgg gtgcagtttt cgatgcctga gtccgatgcg
     1861 gtcacaatgc tgaaaaacag acatgcataa
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       CEC38P       1455 bp ds-DNA             BCT       31-AUG-1987
DEFINITION  Plasmid ColE3-CA38 colicinogenic region containing colicin E3
            (colE3), immunity (immE3), and putative hic and immE8 genes.
ACCESSION   J01574 J01575 M14038
KEYWORDS    colicin; colicin release protein; immune response gene;
            lysis protein; unidentified reading frame.
SOURCE      Plasmid ColE3-CA38 (from E.coli) DNA.
  ORGANISM  Plasmid Colicin E3-CA38
            Prokaryota; Bacteria.
REFERENCE   1  (bases 52 to 651)
  AUTHORS   Masaki,H. and Ohta,T.
  TITLE     A plasmid region encoding the active fragment and the inhibitor
            protein of colicin E3-CA38
  JOURNAL   FEBS Lett. 149, 129-132 (1982)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 651)
  AUTHORS   Mock,M., Miyada,C.G. and Gunsalus,R.P.
  TITLE     Nucleotide sequence for the catalytic domain of colicin E3 and its
            immunity protein. Evidence for a third gene overlapping colicin
  JOURNAL   Nucleic Acids Res. 11, 3547-3557 (1983)
  STANDARD  full staff_review
REFERENCE   3  (bases 640 to 1455)
  AUTHORS   Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
  TITLE     Characterization and nucleotide sequence of a colicin-release gene
            in the hic region of plasmid ColE3-CA38
  JOURNAL   Gene 29, 175-184 (1984)
  STANDARD  full staff_review
REFERENCE   4  (bases 640 to 1455)
  AUTHORS   Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
  TITLE     Corrigenda: Characterization and nucleotide sequence of a
            colicin-release gene in the hic region of plasmid ColE3-CA38
  JOURNAL   Gene 42, 351-355 (1986)
  STANDARD  full staff_review
COMMENT     There are three ORFs distal to the immunity gene (immE3) in plasmid
            ColE3-CA38. ORF1 (bases 803 to 1066) is homologous to the
            E2-immunity gene in plasmid ColE2-P9. [3] has tentatively assigned
            ORF1 as the immE8 gene, but points out that a gene product has not
            yet been identified. Through deletion mutation studies this region
            was shown to be non-essential for colicin release.
            
            There are two overlapping reading frames further downstream of ORF1
            (bases 1070 to 1246 and 1128 to 1421), which are homologous to the
            H' and H genes in plasmid CloDF13. In CloDF13 the H gene has been
            shown to be the lysis gene. Because of gene homology and in vitro
            studies which show that the ORF3 region is necessary for cell
            lysis, [3] has identified ORF3 as the hic gene. However, the in
            vitro studies do not exclude the possibility that ORF2 functions in
            colicin release or that it contributes to the Hic phenotype.  There
            is also the possibility that the hic gene initiates at bp 1095
            rather than 1128. The hic gene product also has not been
            identified.
            
            The immE8 and hic genes have been experimentally localized to the
            regions annotated in the Features Table. Their coding regions were
            deduced by finding the open reading frames and comparing them with
            sequences of genes in plasmids with like phenotypes [3].
            
            There are six inverted repeats in the hic region. IR-2, IR-3, and
            IR-5 (positions 762-792, 964-984, 1131-1157) are "a" + "t" rich and
            show resemblance to SOS boxes. IR-1 an IR-4 (673-708 and 1072-1107)
            
            
            show attenuator-like structure. They may attenuate transcription of
            the hic gene after SOS induction to a level more optimal for
            colicin release. IR-6 (1287-1332) has a terminator-like structure.
            
            A Shine-Delgarno sequence is present at positions 376-379 between
            the colE3 and immE3 genes. The putative ribosome binding site for
            the hic gene can be found at 1117-1122.
            
            [1] refers to colicin E3 as protein A and the immunity protein as
            protein B. [2] refers to the colE3 gene as the ceaC gene and the
            immE3 gene as the ceaC gene.
            
            Draft entry and clean copy sequence kindly provide by R.J.Watson,
            May 1985 [3].
FEATURES       from  to/span     description
    pept    <     1      375     colicin E3 (AA at 1)
    pept        385      642     immunity protein-E3
    pept        803     1066     immE8 protein (putative; gtg start codon)
    pept       1128     1271     lysis protein (putative)
    revision    755      756     gc in [4]; cg in [3]
    revision   1265     1267     gca in [4]; ga in [3]
    revision   1348     1349     tt in [4]; ttt in [3]
BASE COUNT      473 a    214 c    352 g    416 t
ORIGIN      150 bp upstream of Sau3A site.
        1 gctatggaaa gcaggaagaa gaaagaagat aagaaaagga gtgctgaaaa taatttaaac
       61 gatgaaaaga ataagcccag aaaaggtttt aaagattacg ggcatgatta tcatccagct
      121 ccgaaaactg agaatattaa agggcttggt gatcttaagc ctgggatacc aaaaacacca
      181 aagcagaatg gtggtggaaa acgcaagcgc tggactggag ataaagggcg taagatttat
      241 gagtgggatt ctcagcatgg tgagcttgag gggtatcgtg ccagtgatgg tcagcatctt
      301 ggctcatttg accctaaaac aggcaatcag ttgaaaggtc cagatccgaa acgaaatatc
      361 aagaaatatc tttgagagga agttatggga cttaaattgg atttaacttg gtttgataaa
      421 agtacagaag attttaaggg tgaggagtat tcaaaagatt ttggagatga cggttcagtt
      481 atggaaagtc taggtgtgcc ttttaaggat aatgttaata acggttgctt tgatgttata
      541 gctgaatggg tacctttgct acaaccatac tttaatcatc aaattgatat ttccgataat
      601 gagtattttg tttcgtttga ttatcgtgat ggtgattggt gatcaaatat tatcagggat
      661 gagttgatat acgggcttct agtgttcatg gatgaacgct ggagcctcca aatgtagaaa
      721 tgttatattt tttattgagt tcttggttat aattgctccg caatgattta aataagcatt
      781 atttaaaaca ttctcaggag aggtgaaggt ggagctaaaa aaaagtattg gtgattacac
      841 tgaaaccgaa ttcaaaaaat ttattgaaga catcatcaat tgtgaaggtg atgaaaaaaa
      901 acaggatgat aacctcgagt attttataaa tgttactgag catcctagtg gttctgatct
      961 gatttattac ccagaaggta ataatgatgg tagccctgaa ggtgttatta aagagattaa
     1021 agaatggcga gccgctaacg gtaagtcagg atttaaacag ggctgaaata tgaatgccgg
     1081 ttgtttatgg atgaatggct ggcattcttt cacaacaagg agtcgttatg aaaaaaataa
     1141 cagggattat tttattgctt cttgcagtca ttattctgtc tgcatgtcag gcaaactata
     1201 tccgggatgt tcagggcggg accgtatctc cgtcatcaac agctgaagtg accggattag
     1261 caacgcagta acccgaaatc ctctttgaca aaaacaaagc gtgtcaggct gattctgatg
     1321 cgcttttttt ttgaaatgtc acaaaaattc catgtgggag atgggatcta aaatcctcgt
     1381 gcagaacttt ccatccaggg ggagaaaact tgtcgttttg agccgttcgg tgttcagaac
     1441 gcacgaaacc gatcg
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       BLCNNS        961 bp ss-RNA             VRL       15-MAR-1990
DEFINITION  Bunyamwera virus small RNA segment, N and NSs protein.
ACCESSION   D00353
KEYWORDS    N protein; NSs protein; SRNA; nonstructural protein;
            nucleocapsid protein; small RNA.
SOURCE      Bunyamwera virus, cDNA to viral RNA, clones pBU[NS14,N3/59,N308,
            N309,N93].
  ORGANISM  Bunyamwera virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Bunyaviridae; Bunyavirus.
REFERENCE   1  (bases 1 to 961)
  AUTHORS   Elliott,R.M.
  TITLE     Nucleotide sequence analysis of the small(S) RNA segment of
            Bunyamwera virus, the prototype of the family Bunyaviridae
  JOURNAL   J. Gen. Virol. 70, 1281-1285 (1989)
  STANDARD  full staff_entry
COMMENT     Submitted in computer readable form by R.M. Elliott on 18-Jan-1989.
            The virus contains the negative sense strand; the positive strand
            is shown below.
FEATURES       from  to/span     description
    pept         86      787     N protein
    pept        105      410     NSs protein
BASE COUNT      298 a    187 c    215 g    261 t
ORIGIN      
        1 agtagtgtac tccacactac aaacttgcta ttgttgaaaa tcgctgtgct attaaatcca
       61 acagaaggtc attaaaggct ctttaatgat tgagttggaa tttcatgatg tcgctgctaa
      121 caccagcagt acttttgacc cagaggtcgc atacgctaac tttaagcgtg tccacaccac
      181 tgggcttagt tatgaccaca tacgaatctt ctacattaaa ggacgcgaga ttaaaactag
      241 tctcgcaaaa agaagtgaat gggaagttac acttaacctt gggggctgga agattactgt
      301 atataatacg aattttcctg gcaaccggaa caacccagtt cctgacgatg gtcttaccct
      361 ccaccgcctc agtggattcc ttgccaggta cctacttgag aagatgctga aagtcagtga
      421 accagagaaa ttgattatta aatcaaaaat aatcaaccct ttggctgaaa agaatgggat
      481 cacttggaat gatggagagg aagtttatct ctctttcttc ccaggatcag agatgttctt
      541 aggaactttc agattctacc ccttagcaat cgggatctac aaagttcagc gcaaggaaat
      601 ggaaccaaaa taccttgaga aaacaatgcg gcagaggtac atgggactag aagcagcaac
      661 ttggactgtt agtaaattga cagaagttca gtctgcactg acagttgtct ctagcttagg
      721 ttggaagaaa accaatgtta gtgcagctgc cagggacttc cttgctaaat tcggaatcaa
      781 catgtaagca gggatgcatt tttaatcggg ctaaagtcat ctgttttaat ttggctaaaa
      841 gggttgtttc aacccacaaa ataacagctg cttgggtggg tggttgggga cagaaagaca
      901 gcgggctaaa tcaacattat attgttaatg gtattttaag ttttaggtgg agcacactac
      961 t
//
LOCUS       DEN2NGC      2357 bp ss-RNA             VRL       15-MAR-1990
DEFINITION  Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA.
ACCESSION   D00346
KEYWORDS    E protein; M protein; prM protein; structural protein.
SOURCE      Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA.
  ORGANISM  Dengue virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE   1  (bases 1 to 2357)
  AUTHORS   Gruenberg,A., Woo,W.S., Biedrzycka,A. and Wright,P.J.
  TITLE     Partial Nucleotide Sequence and Deduced Amino Acid Sequence of the
            Structural Proteins of Dengue Virus Type 2, New Guinea C and
            PUO-218 Strains;
  JOURNAL   J. Gen. Virol. 69, 1391-1398 (1988)
  STANDARD  simple staff_review
COMMENT     Nucleotide 1 in the NGC sequence corresponds to nucleotide 77
            counting from the 5'end of the DEN-2(JAM) sequence.
FEATURES       from  to/span     description
    pept         21  >  2357     viral polyprotein
    matp         21      362     C protein
    matp        363      860     prM protein
    matp        636      860     M protein
    matp        861     2345     E protein
    matp       2346     2357     NS 1 protein (amino end)
BASE COUNT      782 a    471 c    595 g    509 t
ORIGIN      20bp upstream from the C protein amino terminal end
        1 aattagagag cagatctctg atgaataacc aacgaaaaaa ggcgagaaat acgcctttca
       61 atatgctgaa acgcgagaga aaccgcgtgt cgactgtaca acagctgaca aagagattct
      121 cacttggaat gctgcaggga cgaggaccat taaaactgtt catggccctg gtggcgttcc
      181 ttcgtttcct aacaatccca ccaacagcag ggatactgaa gagatgggga acaattaaaa
      241 aatcaaaagc cattaatgtt ttgagagggt tcaggaaaga gattggaagg atgctgaaca
      301 tcttgaacag gagacgcaga actgcaggca tgatcattat gctgattcca acagtgatgg
      361 cgttccattt aaccacacgt aacggagaac cacacatgat cgtcagtaga caagagaaag
      421 ggaaaagtct tctgtttaaa acagaggatg gtgtgaacat gtgtaccctc atggccatgg
      481 accttggtga attgtgtgaa gatacaatca cgtacaagtg tccttttctc aggcagaatg
      541 aaccagaaga catagattgt tggtgcaact ctacgtccac atgggtaact tatgggacgt
      601 gtaccaccac aggagaacac agaagagaaa aaagatcagt ggcactcgtt ccacatgtgg
      661 gaatgggact ggagacacga actgaaacat ggatgtcatc agaaggggcc tggaaacatg
      721 cccagagaat tgaaacttgg atcttgagac atccaggctt taccataatg gcagcaatcc
      781 tggcatacac cataggaacg acacatttcc aaagagccct gattttcatc ttactgacag
      841 ctgtcgctcc ttcaatgaca atgcgttgca taggaatatc aaatagagac tttgtagaag
      901 gggtttcagg aggaagctgg gttgacatag tcttagaaca tggaagctgt gtgacgacga
      961 tggcaaaaaa caaaccaaca ttggattttg aactgataaa aacagaagcc aaacaacctg
     1021 ccactctaag gaagtactgt atagaggcaa agctgaccaa cacaacaaca gattctcgct
     1081 gcccaacaca aggagaaccc agcctaaatg aagagcagga caaaaggttc gtctgcaaac
     1141 actccatggt ggacagagga tggggaaatg gatgtggatt atttggaaaa ggaggcattg
     1201 tgacctgtgc tatgttcaca tgcaaaaaga acatgaaagg aaaagtcgtg caaccagaaa
     1261 acttggaata caccattgtg ataacacctc actcagggga agagcatgca gtcggaaatg
     1321 acacaggaaa acatggcaag gaaatcaaaa taacaccaca gagttccatc acagaagcag
     1381 agttgacagg ctatggcact gtcacgatgg agtgctctcc gagaacgggc ctcgacttca
     1441 atgagatggt gttgctgcaa atggaaaata aagcttggct ggtgcacagg caatggttcc
     1501 tagacctgcc gttgccatgg ctgcccggag cggacacaca aggatcaaat tggatacaga
     1561 aagagacatt ggtgactttc aaaaatcccc atgcgaagaa acaggatgtt gttgttttgg
     1621 gatcccaaga aggggccatg cacacagcac tcacaggggc cacagaaatc cagatgtcat
     1681 caggaaactt actgttcaca ggacatctca agtgcaggct gaggatggac aaactacagc
     1741 tcaaaggaat gtcatactct atgtgcacag gaaagtttaa agttgtgaag gaaatagcag
     1801 aaacacaaca tggaacaata gttatcagag tacaatatga aggggacggt tctccatgta
     1861 agatcccttt tgagataatg gatttggaaa aaagacatgt tttaggtcgc ctgattacag
     1921 tcaacccaat cgtaacagaa aaagatagcc cagtcaacat agaagcagaa cctccattcg
     1981 gagacagcta catcatcata ggagtagagc cgggacaatt gaagctcaac tggtttaaga
     2041 aaggaagttc tatcggccaa atgattgaga caacaatgag gggagcgaag agaatggcca
     2101 ttttaggtga cacagcttgg gattttggat ccctgggagg agtgtttaca tctataggaa
     2161 aggctctcca ccaagttttc ggagcaatct atggggctgc cttcagtggg gtctcatgga
     2221 ctatgaaaat cctcatagga gtcattatca catggatagg aatgaattca cgcagcacct
     2281 cactttctgt gtcactagta ttggtgggag tcgtgacgct gtatttggga gttatggtgc
     2341 aggccgatag tggttgc
//
LOCUS       HS1IRLULR   13052 bp ds-DNA             VRL       15-MAR-1990
DEFINITION  Herpes simplex virus type 1 (HSV-1) genome, rightmost part of the
            long unique region (UL) and all of the internal long repeat region
            (IRL).
ACCESSION   D00374
KEYWORDS    IE110; IE63; UL54; UL55; UL56; immediate-early protein;
            internal long repeat region; long repeat region;
            long unique region; transcriptional activator;
            transcriptional modulating protein.
SOURCE      HSV-1 (strain 17) DNA, clones BamHI b, XhoI c, BamHI k, HpaI s plus
            v.
  ORGANISM  Herpes simplex virus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 13052)
  AUTHORS   Perry,L.J. and McGeoch,D.J.
  TITLE     The DNA sequences of the long repeat region and adjoining parts of
            the long unique region in the genome of herpes simplex virus type 1
  JOURNAL   J. Gen. Virol. 69, 2831-2846 (1988)
  STANDARD  full staff_entry
COMMENT     There were two small divergences within the two versions of the UL
            proximal part of RL (discussed in [1]).
FEATURES       from  to/span     description
    pept        413     1951     immediate-early transcriptional modulating
                                 protein IE63 (gene UL54)
    ORF        2175     2735     ORF of gene UL55
    ORF        3602     3009 (c) ORF of gene UL56
    pept      10787    10731 (c) IE110 exon 1
               9965     9299 (c) IE110 exon 2
               9162     7559 (c) IE110 exon 3
    mRNA        275     1974     IE63 mRNA
    pre-msg   10935     7350 (c) IE110 mRNA and introns
    IVS       10730     9967 (c) IE110 intron 1
    IVS        9298     9163 (c) IE110 intron 2
    rpt        3837     4017     reiteration set 1
    rpt        4224     4244     reiteration set 2
    rpt        4465     4496     reiteration set 3
    rpt        7170     7317     reiteration set 4
    rpt       10422    10583     reiteration set 5
    rpt       12007    12060     reiteration set 6
    rpt       12730    12952     reiteration set 7
    refnumbr      1        1     numbered 113322 in [1]
    signal      247      251     TATA box
    signal     3792     3786 (c) TATA box
    signal    10962    10958 (c) TATA box
    signal     1956     1961     polyadenylation signal
    signal     2777     2782     polyadenylation signal
    signal     2880     2875 (c) polyadenylation signal
    signal     2884     2879 (c) polyadenylation signal
    signal     7372     7367 (c) polyadenylation signal
    signal     7412     7407 (c) polyadenylation signal
    variant    1055     1062     eight c residues in HpaI s plus v clone; seven
                                 c residues in BamHI b clone
    site       3836     3836     end of UL
BASE COUNT     1933 a   4879 c   4243 g   1997 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccaac gaccccgccc atgggtccca attggccgtc ccgttaccaa gaccaaccca
       61 gccagcgtat ccacccccgc ccgggtcccc gcggaagcgg aacggggtat gtgatatgct
      121 aattaaatac atgccacgta cttatggtgt ctgattggtc cttgtctgtg ccggaggtgg
      181 ggcgggggcc ccgcccgggg ggcggaacga ggaggggttt gggagagccg gccccggcac
      241 cacgggtata aggacatcca ccacccggcc ggtggtggtg tgcagccgtg ttccaaccac
      301 ggtcacgctt cggtgcctct ccccgattcg ggcccggtcg ctcgctaccg gtgcgccacc
      361 accagaggcc atatccgaca ccccagcccc gacggcagcc gacagcccgg tcatggcgac
      421 tgacattgat atgctaattg acctcggcct ggacctctcc gacagcgatc tggacgagga
      481 cccccccgag ccggcggaga gccgccgcga cgacctggaa tcggacagca gcggggagtg
      541 ttcctcgtcg gacgaggaca tggaagaccc ccacggagag gacggaccgg agccgatact
      601 cgacgccgct cgcccggcgg tccgcccgtc tcgtccagaa gaccccggcg tacccagcac
      661 ccagacgcct cgtccgacgg agcggcaggg ccccaacgat cctcaaccag cgccccacag
      721 tgtgtggtcg cgcctcgggg cccggcgacc gtcttgctcc cccgagcagc acgggggcaa
      781 ggtggcccgc ctccaacccc caccgaccaa agcccagcct gcccgcggcg gacgccgtgg
      841 gcgtcgcagg ggtcggggtc gcggtggtcc cggggctgcc gatggtttgt cggacccccg
      901 ccggcgtgcc cccagaacca atcgcaaccc tgggggaccc cgccccgggg cggggtggac
      961 ggacggcccc ggcgcccccc atggcgaggc gtggcgcggc agtgagcagc ccgacccacc
     1021 cggaggccag cggacacggg gcgtgcgcca agcacccccc ccgctaatga cgctggcgat
     1081 tgcccccccg cccgcggacc cccgcgcccc ggccccggag cgaaaggcgc ccgccgccga
     1141 caccatcgac gccaccacgc ggttggtcct gcgctccatc tccgagcgcg cggcggtcga
     1201 ccgcatcagc gagagctttg gccgcagcgc acaggtcatg cacgacccct ttggggggca
     1261 gccgtttccc gccgcgaata gcccctgggc cccggtgctg gcgggccaag gagggccctt
     1321 tgacgccgag accagacggg tctcctggga aaccttggtc gcccacggcc cgagcctcta
     1381 tcgcactttt gccggcaatc ctcgggccgc atcgaccgcc aaggccatgc gcgactgcgt
     1441 gctgcgccaa gaaaatttca tcgaggcgct ggcctccgcc gacgagacgc tggcgtggtg
     1501 caagatgtgc atccaccaca acctgccgct gcgcccccag gaccccatta tcgggacgac
     1561 cgcggctgtg ctggataacc tcgccacgcg cctgcggccc tttctccagt gctacctgaa
     1621 ggcgcgaggc ctgtgcggcc tggacgaact gtgttcgcgg cggcgtctgg cggacattaa
     1681 ggacattgca tccttcgtgt ttgtcattct ggccaggctc gccaaccgcg tcgagcgtgg
     1741 cgtcgcggag atcgactacg cgacccttgg tgtcggggtc ggagagaaga tgcatttcta
     1801 cctccccggg gcctgcatgg cgggcctgat cgaaatccta gacacgcacc gccaggagtg
     1861 ttcgagtcgt gtctgcgagt tgacggccag tcacatcgtc gcccccccgt acgtgcacgg
     1921 caaatatttt tattgcaact ccctgtttta ggtacaataa aaacaaaaca tttcaaacaa
     1981 atcgcccctc gtgttgtcct tctttgctca tggccggcgg ggcgtgggtc acggcagatg
     2041 gcgggggtgg gcccggcgta cggcctgggt gggcggaggg aactaaccca acgtataaat
     2101 ccgtccccgt tccaaggccg gtgtcatagt gcccttagga gcttcccgcc cgggcgcatc
     2161 cccccttttg cactatgaca gcgacccccc tcaccaacct gttcttacgg gccccggaca
     2221 taacccacgt ggccccccct tactgcctca acgccacctg gcaggccgaa acggccatgc
     2281 acaccagcaa aacggactcc gcttgcgtgg ccgtgcggag ttacctggtc cgcgcctcct
     2341 gtgagaccag cggcacaatc cactgctttt tctttgcggt atacaaggac acccaccaca
     2401 cccctccgct gattaccgag ctccgcaact ttgcggacct ggttaaccac ccgccggtcc
     2461 tacgcgaact ggaggataag cgcggggtgc ggctgcggtg tgcgcggccg tttagcgtcg
     2521 ggacgattaa ggacgtctct gggtccggcg cgtcctcggc gggagagtac acgataaacg
     2581 ggatcgtgta ccactgccac tgtcggtatc cgttctcaaa aacatgctgg atgggggcct
     2641 ccgcggccct acagcacctg cgctccatca gctccagcgg catggccgcc cgcgcggcag
     2701 agcatcgacg cgtcaagatt aaaattaagg cgtgatctcc aaccccccca tgaatgtgtg
     2761 taaccccccc caaaaaaata aagagccgta acccaaccaa accaggcgtg gtgtgagttt
     2821 gtggacccaa agccctcaga gacaacgcga caggccagta tggaccgtga tacttttatt
     2881 tattaactca caggggcgct taccgccaca ggaataccag aataatgacc accacaatcg
     2941 cgaccacccc aaatacagca tggcgccaca ccacgccaca acagccctgt cgccggtatg
     3001 gggcatgatc agacgagccg cgccgcgcgt tgggccctgt acagctcgcg cgaattgacc
     3061 ctaggaggcc gccacgcgcc cgagttttgc gttcgtcgct ggtcgtcggg cgccaaagcc
     3121 ccggacggct gttcggtcga acgaacggcc acgacagtgg cataggttgg ggggtggtcc
     3181 gacatagcct cggcgtacgt cgggaggccc gacaagaggt cccttgtgat gtcgggtggg
     3241 gccacaagcc tggtttccgg aagaaacagg ggggttgcca ataacccgcc agggccaaaa
     3301 ctccggcgct gcgcacgtcg ttcggcgcgg cgccgggcgc gccgagcggc tcgctgggcg
     3361 gcttggcgtg agcggccccg ctccgacgcc tcgccctctc cggaggaggt tggcggaatt
     3421 ggcacggaca acaggggccc agcagagtac ggtggaggtg ggtccgtggg ggtgtccaga
     3481 tcaataacga caaacggccc ctcgttccta ccagacaagc tatcgtaggg gggcggggga
     3541 tcagcaaacg cgttccccgc gctccataaa cccgcgtcgg gttgcgccgc ctccgaagcc
     3601 atggatgcgc cccaaagcca cgactcccgc gcgctaggtc cttggggtaa tggaaaaggc
     3661 cctactcccc atccaagcca gccaagttaa cgggctacgc cttcgggaat gggactggca
     3721 ccccggcgga ttttgttggg ctggcatgcg tcgcccaacc gagggccgcg tccacgggac
     3781 gcgcctttta taaccccggg ggtcattccc aacgatcaca tgcaatctaa ctggctcccc
     3841 tctccccccc tctcccctct ccccccctct cccctctccc cccctctccc ctctcccccc
     3901 ctctcccctc tccccccctc tcccctctcc ccccctctcc cctctccccc cctctcccct
     3961 ctccccccct ctcccctctc cccccctctc ccctctcccc ccctctcccc tctcccctct
     4021 gctctttccc cgtgacaccc gacgctgggg gcgtggctgc cgggaggggc cgcggatggg
     4081 cgggcctact tggtttcccg cccccccccc ccccccccga accgccccgc cggctttgcc
     4141 cccctttgat cccctgctac ccccaacccg tgctggtggt gcgggttggg gggggatgtg
     4201 ggcgggggtg cgcgggaggt gtcggtggtg gtggtggtgg tggtagtagg aatggtggtg
     4261 aggggggggg ggcgctggtt ggtcaaaaaa gggagggacg ggggccggca gaccgacggc
     4321 gacaacgctc cccggcggcc gggtcgcggc tcttacgagc ggcccggccc gcgctcccac
     4381 cccccgggcc gtgtccttgc tttccccccg tctccccccc ccccgccttc tcctcctcct
     4441 cctcgttttt ccaaaccccg cccacccggc ccggcccggc ccggcccggc ccggccaccg
     4501 ccgcccaccc acccacctcg ggatacccag ccccggtccc ccgttccccg ggggccgtta
     4561 tctccagcgc cccgtccggc gcgccgcccc ccgccgctaa accccatccc gcccccggga
     4621 ccccacatat aagcccccag ccacacgcaa gaacagacac gcagaacggc tgtgtttatt
     4681 taaataaacc aatgtcggaa taaacaaaca caaacacccg cgacgggggg acggagggga
     4741 cggagggagg gggtgacggg ggacgggaac agacacaaaa acaaccacaa aaaacaacca
     4801 cccaccgaca cccccacccc agtctcctcg ccttctccca cccaccccac gcccccactg
     4861 agcccggtcg atcgacgagc acccccgccc acgcccccgc ccctgccccg gcgacccccg
     4921 gcccgcacga tcccgacaac aataacaacc ccaacggaaa gcggcggggt gttgggggag
     4981 gcgaggaaca accgagggga acgggggatg gaaggacggg aagtggaagt cctgataccc
     5041 atcctacacc cccctgcctt ccaccctccg gccccccgcg agtccacccg ccggccggct
     5101 accgagaccg aacacggcgg ccgccgcagc cgccgcagcc gccgccgaca ccgcagagcc
     5161 ggcgcgcgca ctcacaagcg gcagaggcag aaaggcccag agtcattgtt tatgtggccg
     5221 cgggccagca gacggcccgc gacacccccc ccccgcccgt gtgggtatcc ggccccccgc
     5281 cccgcgccgg tccattaagg gcgcgcgtgc ccgcgagata tcaatccgtt aagtgctctg
     5341 cagacagggg caccgcgccc ggaaatccat taggccgcag acgaggaaaa taaaattaca
     5401 tcacctaccc acgtggtgct gtggcctgtt tttgctgcgt catctcagcc tttataaaag
     5461 cgggggcgcg gccgtgccga tcgcgggtgg tgcgaaagac tttccgggcg cgtccgggtg
     5521 ccgcggctct ccgggccccc ctgcagccgg ggcggccaag gggcgtcggc gacatcctcc
     5581 ccctaagcgc cggccggccg ctggtctgtt ttttcgtttt ccccgtttcg ggggtggtgg
     5641 gggttgcggt ttctgtttct ttaacccgtc tggggtgttt ttcgttccgt cgccggaatg
     5701 tttcgttcgt ctgtcccctc acggggcgaa ggccgcgtac ggcccgggac gaggggcccc
     5761 cgaccgcggc ggtccgggcc ccgtccggac ccgctcgccg gcacgcgacg cgaaaaaggc
     5821 cccccggagg cttttccggg ttcccggccc ggggcctgag atgaacactc ggggttaccg
     5881 ccaacggccg gcccccgtgg cggcccggcc cggggccccg gcggacccaa ggggccccgg
     5941 cccggggccc cacaacggcc cggcgcatgc gctgtggttt ttttttcctc ggtgttctgc
     6001 cgggctccat cgcctttcct gttctcgctt ctcccccccc ccttcttcac ccccagtacc
     6061 ctcctccctc ccttcctccc ccgttatccc actcgtcgag ggcgccccgg tgtcgttcaa
     6121 caaagacgcc gcgtttccag gtaggttaga cacctgcttc tccccaatag agggggggga
     6181 cccaaacgac agggggcgcc ccagaggcta aggtcggcca cgccactcgc gggtgggctc
     6241 gtgttacagc acaccagccc gttcttttcc ccccctccca cccttagtca gactctgtta
     6301 cttacccgtc cgaccaccaa ctgccccctt atctaagggc cggctggaag accgccaggg
     6361 ggtcggccgg tgtcgctgta accccccacg ccaatgaccc acgtactcca agaaggcatg
     6421 tgtcccaccc cgcctgtgtt tttgtgcctg gctctctatg cttgggtctt actgcctggg
     6481 gggggggagt gcgggggagg gggggtgtgg aaggaaatgc acggcgcgtg tgtacccccc
     6541 ctaaagttgt tcctaaagcg aggatacgga ggagtggcgg gtgccggggg accggggtga
     6601 tctctggcac gcgggggtgg gaagggtcgg gggagggggg gatggagtac cggcccacct
     6661 ggccgcgcgg gtgcgcgtgc ctttgcacac caaccccacg tcccccggcg gtctctaaga
     6721 agcaccgccc cccctccttc ataccaccga gcatgcctgg gtgtgggttg gtaaccaaca
     6781 cgcccatccc ctcgtctcct gtgattctct ggctgcaccg cattcttgtt ttctaactat
     6841 gttcctgttt ctgtctcccc cccccccacc cctccgcccc accccccaac acccacgtct
     6901 gtggtgtggc cgaccccctt ttgggcgccc cgtcccgccc cgccacccct cccatccttt
     6961 gttgccctat agtgtagtta accccccccg ccctttgtgg cggccagagg ccaggtcagt
     7021 ccgggcgggc aggcgctcgc ggaaacttaa cacccacacc caacccactg tggttctggc
     7081 tccatgccag tggcaggatg ctttcgggga tcggtggtca ggcagcccgg gccgcggctc
     7141 tgtggttaac accagagcct gcccaacatg gcacccccac tcccacgcac ccccactccc
     7201 acgcaccccc actcccacgc acccccactc ccacgcaccc ccactcccac gcacccccac
     7261 tcccacgcac ccccactccc acgcaccccc actcccacgc acccccactc ccacgcatcc
     7321 ccgcgataca tccaacacag acagggaaaa gatacaaaag taaaccttta tttcccaaca
     7381 gacagcaaaa atcccctgag ttttttttta ttagggccaa cacaaaagac ccgctggtgt
     7441 gtggtgcccg tgtctttcac ttttcccctc cccgacacgg attggctggt gtagtgggcg
     7501 cggccagaga ccacccagcg cccgaccccc ccctccccac aaacacgggg ggcgtccctt
     7561 attgttttcc ctcgtcccgg gtcgacgccc cctgctcccc ggaccacggg tgccgagacc
     7621 gcaggctgcg gaagtccagg gcgcccacta gggtgccctg gtcgaacagc atgttcccca
     7681 cgggggtcat ccagaggctg ttccactccg acgcgggggc cgtcgggtac tcggggggca
     7741 tcacgtggtt acccgcggtc tcggggagca gggtgcggcg gctccagccg gggaccgcgg
     7801 cccgcagccg ggtcgccatg tttcccgtct ggtccaccag gaccacgtac gccccgatgt
     7861 tccccgtctc catgtccagg atgggcaggc agtcccccgt gatagtcttg ttcacgtaag
     7921 gcgacagggc gaccacgcta gagacccccg agatgggcag gtagcgcgtg aggccgcccg
     7981 cggggacggc cccggaagtc tccgcgtggc gcgtcttccg ggcacacttc ctcggccccc
     8041 gcggcccaga agcagcgcgg gggccgaggg aggtttcctc ttgtctccct cccagggcac
     8101 cgacggcccc gcccgaggag gcggaagcgg aggaggacgc ggccccggcg gcggaagagg
     8161 cggcccccgc gggggtcggg gccgaggagg aagaggcaga ggaggaagag gcggaggccg
     8221 ccgaggacgt caggggggtc ccgggcccac cctggccgcg cccccccggc cctgagtcgg
     8281 agggggggtg cgtcgccgcc ctcttggccc ctgccggcgc gaggggggga cgcgtggact
     8341 ggggggaggg gttttcctgg cccgacccgc gcctcttcct cggacgcacc gccgcctcct
     8401 gctcgacaga ggcggcggag gggagcgggg cggcgccgga gggggcggcg ccgcgggagg
     8461 gcccgtgccc accctccacg cccggccccc ccgagccgcg cgccaccgtc gcacgcgccc
     8521 ggcacagact ctgttcttgg ttcgcggcct gagccaggga cgagtgcgac tggggcacac
     8581 ggcgcgcgtc cgcggggcgg gcggccggct ccgccccggg ggccggggcg cgggggccgg
     8641 gccccggagg cggcgctcgc acgcacgggg ccacggccgc gcgggggcgc gcgggtcccg
     8701 acgcggccgc ggacgcgggg ggcccggggc ggggggcgga gcctggcatg ggcgccgcgg
     8761 ggggcctgtg gggagaggcc gggggggagt cgctgatcac tatggggtct ctgttgtttg
     8821 caaggggggc gggtctgttg acaagggggc ccgtccggcc cctcggccgc cccgcctccg
     8881 cttcaacaac cccaacccca accccaaccc ccccggaggg gccagacgcc ccccgcggcg
     8941 ccgcggctcg cgactggcgg gagccgccgc cgccgctgct gttggtggtg gtgttggtgt
     9001 tactgctgcc gtgtggcccg atgggcgccg aggggggcgc tgtccgagcc gcggccggct
     9061 ggggggctgc gtgagacgcc ccgcccgtca cggggggcgc ggcggcgcct ctgcgtgggg
     9121 gggcgcgggg cgtccggcgg ggggcgggcg gtacgtagtc tgctgcaaga gacaacgggg
     9181 ggcgcgatca ggttacgccc cctccccggc ccgccctttc ctcgcccgcc cgcctattcc
     9241 tccctccccc cccctcctcc tcctcctccc ccagggtcct tgccgccccc cgcctcaccg
     9301 tcgtccaggt cgtcgtcatc ctcgtccgtg gtgggctccg ggtgggtggg cgacagggcc
     9361 ctcaccgtgt gcccccccag ggtcaggtac cgcggggcga accgctgatt gcccgtccag
     9421 ataaagtcca cggccgtgcc cgccctgacg gcctcctcgg cctccatgcg ggtctggggg
     9481 tcgttcacga tcgggatggt gctgaacgac ccgctgggcg tcacgcccac tatcaggtac
     9541 accagcttgg cgttgcacag cgggcaggtg ttgcgcaatt gcatccaggt tttcatgcac
     9601 gggatgcaga agcggtgcat gcacgggaag gtgtcgcagc gcaggtgggg cgcgatctca
     9661 tccgtgcaca cggcgcacac gtcgccctcg tcgctccccc cgtcctctcg agggggggcg
     9721 cccccgcaac tgccggggtc ttcctcgcgg ggggggctcc cccccgagac cgccccccca
     9781 tccacgccct gcggccccag cagccccgtc tcgaacagtt ccgtgtccgt gctgtccgcc
     9841 tcggaggcgg agtcgtcgtc atggtggtcg gcgtcccccc gcccccccac ttcggtctcc
     9901 gcctcagagt cgctgctgtc cggcaggtct cggtcgcagg gaaacaccca gacatccggg
     9961 gcgggctaag gggaaaaaag gggggcgggt aagaatgggg ggggatttcc cgcgtcaatc
    10021 agcacccacg agttccccct ctcccccccc cgcctcacaa agtcctgccc ccctgctggc
    10081 ctcggaagag gggggagaaa ggggtctgca accaaaggtg gtctgggtcc gtcctttgga
    10141 tcccgacccc tcttcttccc tcttctcccg ccctccagac gcaccggagt cgggggtccc
    10201 acggcgtccc ccaaatatgg cgggcggctc ctccccaccc ccctagatgc gtgtgagtaa
    10261 ggggggcctg cgtatgagtc agtggggacc acgcccccaa cacggcgacc ccggtccttg
    10321 tgtgtttgtt gtgggggcgt gtctctgtgt atgagtcagg gggtcccacg gcgaccccgg
    10381 gccctgcgtc tgagtcaaag gggccatgtg tatgtgttgg gggtctgtat atataaagtc
    10441 agggggtcac atggcgaccc ccaacagggc gaccccggtc cctgtatata tagggtcagg
    10501 gggttccgca ccccctaaca tggcgccccc ggtccctgta tatatagtgt cacggggttc
    10561 cacgccccct aacatggcgc cccaacatgg cgcccggctc ccgtgtatga gtgggggtcc
    10621 cccaacatgg cggccggttc cagtgtaagg gtcgggggtc ccccaacatg gcgcccccca
    10681 atatggcgcc ccccaatatg gcgccccaga catggcgccc ggcccctcac ctcgcgctgg
    10741 gggcggccct caggccggcg ggtactcgct ccggggcggg gctccatggg ggtcgtatgc
    10801 ggctggaggg tcgcggacgg agggtccctg ggggtcgcaa cgtaggcggg gcttctgtgg
    10861 tgatgcggag agggggcggc ccgagtctgc ctggctgctg cgtctcgctc cgagtgccga
    10921 ggtgcaaatg cgaccagact gtcgggccag ggctaactta taccccacgc ctttcccctc
    10981 cccaaagggg cggcagtgac gattccccca atggccgcgc gtcccagggg aggcaggccc
    11041 accgcggggc ggccccgtcc ccggggacca acccggcgcc cccaaagaat atcattagca
    11101 tgcacggccc ggcccccgat ttgggggccc aacccggtgt cccccaaaga accccattag
    11161 catgcccctc ccgccgacgc aacaggggct tggcctgcgt cggtgccccg gggcttcccg
    11221 ccttcccgaa gaaactcatt accatacccg gaaccccagg ggaccaatgc gggttcattg
    11281 agcgacccgc gggccaatgc gcgaggggcc gtgtgttccg ccaaaaaagc aattagcata
    11341 acccggaacc ccaggggagt ggttacgcgc ggcgcgggag gcggggaata ccggggttgc
    11401 ccattaaggg ccgcgggaat tgccggaagc gggaagggcg gccggggccg cccattaatg
    11461 agtttctaat taccataccg ggaagcggaa caaggcctct tgcaagtttt taattaccat
    11521 accgggaagt gggcggcccg gcccattggg cggtaactcc cgcccaatgg gccgggcccc
    11581 gaagactcgg cggacgctgg ttggccgggc cccgccgcgc tggcggccgc cgattggcca
    11641 gtcccgcccc cgaggcggcc cgccctgtga gggcgggctg gctccaagcg tatatatgcg
    11701 cggctcctgc catcgtctct ccggagagcg gcttggtgcg gagctcccgg gagctccgcg
    11761 gaagacccag gccgcctcgg gtgtaacgtt agaccgagtt cgccgggccg gctccgcggg
    11821 ccagggcccg ggcacgggcc tcgggcccca ggcacggccc gatgaccgcc tcggcctccg
    11881 ccacccggcg ccggaaccga gcccggtcgg cccgctcgcg ggcccacgag ccgcggcgcg
    11941 ccaggcgggc ggccgaggcc cagaccacca ggtggcgcac ccggacgtgg ggcgagaagc
    12001 gcacccgcgc gggggtcgcg ggggtcgcgg gggtcgcggg ggtcgcgggg gtcgcggggg
    12061 gctccggcgc cccctccccg cccgcgcgtc gcaggcgcag gcgcgccagg tgctccgcgg
    12121 tgacgcgcag gcggagggcg aggcgcggcg gaaggcggaa ggggcgcgag ggggggtggg
    12181 aggggtcagc cccgcccccc gggcccacgc cgggcggtgg gggcccgggg ggcggggggc
    12241 ggcggcggtg ggccgggcct ctggcgccga ctcgggcggg gggctgtccg gccagtcgtc
    12301 gtcatcgtcg tcgtcggacg cggactcggg aacgtggagc cactggcgca gcagcagcga
    12361 acaagaaggc gggggcccac cggcgggggg cggcggcggg gcggccgcgg gcgcgctcct
    12421 gaccgcgggt tccgagttgg gcgtggaggt tacctgggac tgtgcggttg ggacggcgcc
    12481 cgtgggcccg ggcggccggg ggcggcgggg gccgcgatgg cggcggcggc gggccatgga
    12541 gacagagagc gtgccggggt ggtagagttt gacaggcaag catgtgcgtg cagaggcgag
    12601 tagtgcttgc ctgtctaact cgctagtctc ggccgcgggg ggcccgggct gcccgccgcc
    12661 accgctttaa agggccgcgc gcgacccccg gggggtgtgt tttggggggg gcccgttttc
    12721 ggcgtctggc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg
    12781 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg
    12841 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg
    12901 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cccgcggccc
    12961 cgccccccac gcccgccgcg cgcgcgcacg ccgcccggac cgccgcccgc cttttttgcg
    13021 cgcgcgcgcg cccgcggggg gcccgggctg cc
//
LOCUS       HS5IE5KB1    2520 bp ds-DNA             VRL       15-DEC-1989
DEFINITION  Human cytomegalovirus genome, BamHI-HindIII fragment (5'-terminal
            part of the 5 kb transcript from the immediate-early region).
ACCESSION   D00328
KEYWORDS    immediate-early gene; transforming region.
SOURCE      Human cytomegalovirus (strain AD169) genomic DNA, clone pAT153
            provided by J. D. Oram and R. G. Downing.
  ORGANISM  Human cytomegalovirus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Betaherpesvirinae.
REFERENCE   1  (bases 1 to 2520)
  AUTHORS   Kouzarides,T., Bankier,A.T. and Barrell,B.G.
  TITLE     Nucleotide sequence of the transforming region of human
            cytomegalovirus
  JOURNAL   Mol. Biol. Med. 1, 47-58 (1983)
  STANDARD  full staff_entry
REFERENCE   2  (sites; 5 kb RNA start site)
  AUTHORS   Plachter,B., Traupe,B., Albrecht,J. and Jahn,G.
  TITLE     Abundant 5 kb RNA of human cytomegalovirus without a major
            translational reading frame
  JOURNAL   J. Gen. Virol. 69, 2251-2266 (1988)
  STANDARD  full staff_entry
COMMENT     In [2], the 5' end of the 5 kb RNA was determined by primer
            extension.  In [1], the BamHI-HindIII fragment was described as the
            sequence containing the region capable of transforming NIH3T3
            cells.
FEATURES       from  to/span     description
    site       1664     1664     5 kb RNA start site
BASE COUNT      647 a    713 c    589 g    571 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccgca gcagtccgtt ggcggagtcc gaggagtgct gaccgccgct cccgccgccg
       61 ccaccgccac caccagcgcc gccgcctcca ccaccaccgg cagacgagga cgactttttg
      121 cgccgttctt cgtgacgctg ttcctgcctt cgccgttgct gttcctccgc agaagggccg
      181 tcgcgagtcc cgccgctgcc acccagcgga ggacacgcag acggcggaag cggtagacgc
      241 ggcgccgcaa ccaccgcctc cgctggagga ttcgccgtgg tttttcaggt aatgccgcac
      301 gtaagtcact tgcaaattac cgttctcgga aatcatggtg agcagcgcgc tctcattggg
      361 tccgctggag cccaccaccg aggagacgga tttgttgaag acgataccgc cgcgtacaaa
      421 gaggtgctcc tgcagctccc cgtcgcccgt aatgtcaata gacatgaagc cctgctgcgt
      481 cttggcgccg gccgaagcct cgccgtgctg cataatggta gcgcagagcc agcccttgtt
      541 gaggtgcagc accttgccat cgccgtccac gcagttgacc agacgcgcgg tatcgaagac
      601 gaactggcgc acgtcgaaag tctgattgac gctttgatgc aggatgcgat taggattcgc
      661 aaaagtccag tattttcgca cgacggtagt agggagatcc atgacgcggc ggcgcaaagc
      721 gcgagcgcaa cgctcgtcgg aggccgtgga gcgagtgccg ccgcagccgg cagagcgccg
      781 aaccccgtcg cagactctat ttatacatca tctttccagc ccgcctagca acacccacaa
      841 acaacgtcac gacgcaacgt ggttaaacag tacgtttatt aaagtaactg ggtgaacgac
      901 accggagcgg actgcaaatc gcaacgctac tttctcgagt gcagatactc ttcgagacgg
      961 ctaaacaacg tgtccaactc gctgagacct ttccgcgtgc cgttatccga tttcctttcc
     1021 gcctcctgag acagccgact aatcacggac ttatctccgc aacctaacag aggctgggag
     1081 cccgacaaaa gtaaaacagc gtgctatgaa cacgttgtta cctctgtgcg gacagcgccg
     1141 ccacagagac acttacacat tgccgcatgt ctttgtagat ggattctagc gtcgagcgca
     1201 tactatgcaa ttccgtcttg agtccgggat agacgtggtc gcctgcggga aacacgatct
     1261 ccagataccg cctcaacaac cagtccatga cgctgcatcc ccaacagcct ttgaccaccg
     1321 taccgtcgag ccacacggag tagtcgtcct cacgttgcta caagaggaaa actacgtcac
     1381 ccgacacgcg gaaaagaaag accgtcgcaa taaaccgtac ctacgtgacc taccaacgta
     1441 ggttttactc gatgaaaggt gacgcggaga tcttgcaatc tggtcgcgta atcctctgga
     1501 cgacactgcg gctttgtatt ctttatcgtc gtcgtcgccg gcttcgcctc ctcggaagcg
     1561 cctagaaaaa agacgatcag gaccagagag gaagagacca tcaccgacag catcgccgca
     1621 gcatgccgtc ccagtccgcc gcccaactgc gcgtcccagg taggtggtcc tttatgttat
     1681 gatgtttttg tcaatttttt ttttcaattt ctttcttccg cggttagaat agtttctgta
     1741 ggaaccaatt atcaatctga cgggttatcg tcaccacttg atggcaaaac gaaatttttt
     1801 ttttcattgc cttgaagtct ctcccgccac caccaccacc gccgttgtct ccggctggag
     1861 atcaagacga aattcctcct ctctaaaaaa aaaggtggtg ggcttaattg atcatggcaa
     1921 gaagaaaaac tatactgaat aaactgtgtg caaaactact agtaacaaca aaaatagcga
     1981 ctagatacac cacggacaat ctcagcagat actctctcaa aagaaaaaaa agacgccgta
     2041 acgtcggaga atctggtatc tactgcctga cgaatttttt tttcgtccat gtatgtgatt
     2101 acgagtagta gtggtatgta gaacaagaag aaaaatcgta gtccccaaaa ggataataaa
     2161 aataacactc atagagaatc acagattttc tctagacaac tctctatcca aataacgaat
     2221 gtgaagcgta caaagtaaga tattcaaaga atagcacctt catagattca tttcagcttt
     2281 ctactccttg taatttaaag ttgcactaaa caaagctctt aaagaaggtt cgagccgctc
     2341 tcgatcactc atcgatcacg cgagtcttat tattccacca caacgtaaca ttcttcactt
     2401 tgtagagaca ctttatcgta gagtaaccct cgatttccta gctgttgttt tttgattatt
     2461 ttgttcgctc taagagagat actcgaaatc ctacttacac caaggaccct acatcatcgc
//
LOCUS       HS5IE5KB2    1291 bp ds-DNA             VRL       15-DEC-1989
DEFINITION  Human cytomegalovirus genome, 3'-terminal part of the 5 kb
            transcript from the immediate-early region.
ACCESSION   D00327
KEYWORDS    immediate-early gene.
SOURCE      Human cytomegalovirus (strain AD169) genomic DNA, clones pGJ0.1,
            pGJ0.2, pGJ0.3, pGJ0.5, pGJ0.6, and pGJ0.7.
  ORGANISM  Human cytomegalovirus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Betaherpesvirinae.
REFERENCE   1  (bases 1 to 1291)
  AUTHORS   Plachter,B., Traupe,B., Albrecht,J. and Jahn,G.
  TITLE     Abundant 5 kb RNA of human cytomegalovirus without a major
            translational reading frame
  JOURNAL   J. Gen. Virol. 69, 2251-2266 (1988)
  STANDARD  full staff_entry
COMMENT     One of the predominant transcripts from the immediate early region
            is a 5 kb RNA.  This sequence analysis revealed multiple stop
            codons throughout the AT-rich potential coding region.
FEATURES       from  to/span     description
    RNA     <     1     1267     5 kb RNA (3'-terminal part) (alt.)
    RNA     <     1     1280     5 kb RNA (3'-terminal part) (alt.)
BASE COUNT      384 a    333 c    275 g    299 t
ORIGIN      15 bp upstream of EcoRI site.
        1 ctttttattt tttcgaattc atgttcgaaa acacaagctt ccataacaag aacccgtacc
       61 gaagaaaagt tccatcgact aaaaagaaaa aagaaaacga agcaagacct cgacgacaac
      121 aacacatcaa agaaagacga ccagctgatt atgttcttag aattccacac acccgcgagc
      181 cgatccgcaa acgtcgtgcg aggcgcgctt tctctggctc gacacaatga tcacaccgca
      241 cgctatagac acgtcgtcgt ggacgacgat gacctcaggc cacgaatgac aaccaacatg
      301 ggcaaagtcc aattagccaa aaagacgacg attctaagaa ttgatgaatc ctcgatatac
      361 gcctatcgat aggtttcaat tgtgtcatat acatcaaatg aaaaacagga cgcacgataa
      421 aagcttcctt acagcataac tgtaacatac gatcatggaa catctcctca catacctttc
      481 tcctctcaca taggaaaaca aaactctttt ttttctttcc tgtcaaggaa aaaatcaatg
      541 taccaccaca tcactttctc ctcggtcccg gcgacggatg ggcgcgcacg cggacaaaga
      601 cccaccggcc acttccactt attttttgtt gttaatcgtc ttctcccccg cacgcggacg
      661 accaccaacg ctagctgctc attccgtcaa ccagtcacac cgcgcacgga gaaggggccg
      721 gggtccgcgg gcacccgcgg cggaggcgcg gttccctctc tctaattccc tggaaaacaa
      781 gtaatgacaa acaaaaagac gacaaaaggt ctctattctg ctacatgaga gaaattatag
      841 ctgttggcaa tttttcaaaa tacatgttat aaggcatcct ctctgccaca cgcgcagtca
      901 cggataggat cagtgcgtat tcattataaa aaaaaacaca aacaacccat atatgtgaag
      961 cagaatgatg accgaccgca cggagcgacg ccgtcgactg tcagcctcgc gaggagacac
     1021 cgcggaccgg ggaaacggat aagtttacga acagaaatct caaaagacgc tgacccgata
     1081 agtaccgtca cggagacacg gtggtttttt attgaatttc cagtgtatcg agccaccgtg
     1141 atgcaggtac ggtggtttta tgtaaagtgc cgctatctat aggcgatgtg ttcctgacgg
     1201 tgtgtgtttt tttggggata gacaacgtgg ttcttgtacg tggtttttac cctgctcaat
     1261 aaagtcacgt tttccttaca ggtgttgtgt c
//
LOCUS       HSE1GB       4283 bp ds-DNA             VRL       15-DEC-1989
DEFINITION  Equine herpesvirus 1(EHV1) glycoprotein B (gB) gene and 3' end
            of an overlapping upstream gene with homology to the HSV1 ICP18.5
            gene.
ACCESSION   D00401
KEYWORDS    gB gene; glycoprotein; glycoprotein B.
SOURCE      Equine herpesvirus 1 (isolate HVS 25A) genomic DNA, clones
            pMAC[209, 221].
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 4283)
  AUTHORS   Whalley,J.M., Robertson,G.R., Scott,N.A., Hudson,G.C.,
            Bell,C.W. and Woodworth,L.M.
  TITLE     Identification and nucleotide sequence of a gene in equine
            herpesvirus 1 analogous to the herpes simplex virus gene encoding
            the major envelope glycoprotein gB
  JOURNAL   J. Gen. Virol. 70, 383-394 (1989)
  STANDARD  full staff_entry
COMMENT     Submitted in computer readable form by Whalley,J.M. on 19-Nov-1988.
            The EHV1 gB ORF appears to be overlapped at its5' end by 135 nt of
            the 3' end of an upstream ORF the potential translation product of
            which has approximately 50% identity with HSV gene ICP 18.5 and VZV
            gene 30 product.
FEATURES       from  to/span     description
    ORF         951     3893     equivalent to the gB glycoprotein gene of HSV.
    ORF     <     1     1089     analogous ORF to HSV1 ICP18.5.
    sigp        951     1205     signal peptide.
    signal      719      723     putative CAT box.
    signal      802      806     putative TATA box.
    signal     3902     3907     putative polyA signal.
    site        823      831     similar sequence to putative HSV1 mRNA start
                                 site.
BASE COUNT     1090 a   1168 c   1118 g    907 t
ORIGIN      map position aprox. 0.41-0.44 unit.
        1 ctgcagaggc tcacggaccc agacaccagc aacagagagg ccctcaagca gctgctgggt
       61 cgcatagggg tggataccga cgacggggcc ggcgagttgg gggacgcctt agacgtggat
      121 ttggataatc taggtggggc ccctcctgtc aacagcaccc cctgtggtga ggacgccctc
      181 tgtcgaaccg tttccgagga acgcccgtgg gacaaacttt tagagcgggc gactgcggat
      241 gcttcgcagc gcaggcgcat gtacgcggag cgtctgtcaa agcgttccat cgccagtttg
      301 gggcgctgcg tgcgcgaaca gcgaagagaa ctagaaaaaa ccctgagagt taacgtgtat
      361 ggcgaagtgc tgctacatac gtacgtatcg tcctacaacg ggttttgcgc caggcgcggg
      421 ttttgcgcgg cggtgagtcg agcgggtacc atcatagata accgctctag cacgtccgcg
      481 ttcgactcgc atcagttcat gaaggcggcg ctgcttcgcc accccattga ccagtcgctc
      541 atgccgtcca taacacacaa gtttttcgag ctgatcaacg ggcccgtgtt tgacaacgct
      601 ggccacaact ttgcgcagcc gccaaacacg gcattatatt acagcgttga aaacgttggg
      661 ttgttaccgc atctcaagga ggaactagct cggtttatga ttactgcggc taaaggtgat
      721 tggtcaatta gcgagtttca aaggttttat tgctttgagg gagtgacagg tgtgacggcc
      781 acgcagcggc tggcgtggaa atatatcggg gagctcatcc tagccgccgc agtattctcc
      841 tcggttttcc actgtggaga ggtgcgcctc ctgcgcgcag atcgtaccta cccggactcc
      901 agcggcgcac agcgctgcgt gagcggcatt tacataacct acgaggcgtc atgtcctctg
      961 gttgccgttc tgtcggcggc tccacatggg gcaattggcg cggagacggt ggtgatttac
     1021 gacagcgacg tgttctctct cctgtatgca gtgctccagc agctggctcc tggatcggga
     1081 gccaactagg caatgttgga aacttactcg ccacccccca cccgctggga aagccggcat
     1141 catcgagggt gggcacaata gttctagcct gtttgttgct ttttggaagc tgtgttgtta
     1201 gagccgtacc caccacgcca agccccccaa ctagtactcc cacttccatg tcaacgcact
     1261 cccatgggac agtagaccct acgctgctcc ccacagaaac gcccgaccca ctcagactgg
     1321 ctgtgcgcga gtccggtata ctcgctgagg atggagactt ttacacctgc ccaccgccta
     1381 ccggatccac cgtcgtacgc atcgaaccac ctagaacttg ccccaagttt gaccttggga
     1441 gaaacttcac ggaggggatt gctgttattt ttaaggaaaa catcgctccc tacaaattca
     1501 gggcaaacgt atactacaag gacatcgttg taacacgtgt gtggaaagga tacagccata
     1561 cgtccctgtc cgacagatac aatgacaggg ttccggtttc ggtggaggag atcttcggtc
     1621 tcatcgacag taagggaaaa tgttcgtcaa aggccgagta cctcagagat aacatcatgc
     1681 accacgcgta ccacgacgac gaggacgagg tggagcttga tttgtgccgt ccaagtttgc
     1741 aactccgggg ggccagagcc tggcagacca ccaacgatac tacgtcttac gtggggtgga
     1801 tgccatggag gcactacacg tcaacgtctg tcaactgcat cgtcgaggag gtggaggcgc
     1861 ggtccgtcta cccctacgac tccttcgccc tgtccaccgg tgatattgtg tacgcgtctc
     1921 cgttttacgg cctgagggct gccgctcgca tagagcacaa tagctacgcg caggagcgtt
     1981 tcaggcaagt tgaagggtac aggccccgcg acttagacag taaactacaa gccgaagagc
     2041 cggttaccaa aaattttatc actaccccgc atgtcaccgt cagctggaac tggaccgaga
     2101 agaaagtcga ggcgtgtacg ctgaccaaat ggaaagaggt cgacgaactc gtcagggacg
     2161 agttccgcgg gtcctacaga tttactattc gatccatctc gtctacgttt atcagtaaca
     2221 ctactcaatt taagttggaa agtgcccccc ttactgaatg tgtatccaaa gaagcaaagg
     2281 aagccataga ctcgatatac aaaaagcagt acgagtctac gcacgtcttt agcggtgatg
     2341 tggaatatta cctggcacgc ggggggttct taattgcatt cagacctatg ctctccaacg
     2401 aactcgccag gctgtacctg aacgagcttg tgagatctaa ccgcacctac gacctaaaaa
     2461 atctattgaa ccccaatgca aacaataaca ataacaccac gcgaagacgc aggtctctcc
     2521 tgtcagtacc agaacctcag ccaacccaag atggtgtgca tagagaacaa attctacatc
     2581 gcttgcacaa acgagcagtg gaggcaacgg caggtaccga ttcttccaac gtcaccgcca
     2641 aacagctgga gctcatcaaa accacgtcgt ctatcgagtt tgccatgcta cagtttgcat
     2701 acgatcacat ccaatcccac gtcaatgaaa tgctaagtag aatagcaact gcgtggtgta
     2761 ccctccaaaa caaagagcgg accctatgga acgaaatggt gaagattaac ccgagcgcca
     2821 tagtctccgc aacccttgac gagcgagttg cagcgagggt cctgggggac gtgatagcta
     2881 taacgcactg cgccaaaata gagggcaacg tgtacttgca aaactccatg cgctcgatgg
     2941 acagtaacac gtgctactcc cgcccccccg taacatttac aattactaag aatgcaaaca
     3001 acagagggtc gatagaaggc cagctgggag aggagaacga gattttcacg gagcgcaagc
     3061 tgatcgagcc gtgcgccctc aatcagaagc gctactttaa gtttggcaaa gagtacgttt
     3121 actacgagaa ctacacgttc gtccgcaaag tgccccccac ggaaatcgag gttatcagca
     3181 cgtacgttga actaaacttg acccttttgg aagaccgcga gtttctgccc ctggaggtgt
     3241 acacgcgggc tgagctggag gacaccggcc tgctagacta cagcgaaata cagcgccgca
     3301 accagctcca cgctctcagg ttttacgaca tcgacagcgt ggtcaacgtg gacaataccg
     3361 cagtgattat gcaggggatc gccagctttt tcaagggcct gggtaaagtg ggggaggccg
     3421 tgggaacgct cgttctcggc gccgccggcg ctgttgtttc aaccgtatct ggaatagctt
     3481 cgtttttaaa caacccattt ggggggctag ccatcggcct gctggtaatc gccggcctgg
     3541 tagctgcgtt ttttgcttac agatatgtaa tgcagatccg cagtaacccc atgaaagctc
     3601 tataccccat aacaacaaag gccttgaaaa acaaagccaa aacttcctac ggccagaacg
     3661 aggaggacga tgggagcgac tttgatgagg ccaagcttga agaggctcgc gaaatgatca
     3721 aatacatgtc tatggtttcg gccctggaaa agcaggaaaa gaaagctata aagaaaaaca
     3781 gtggggttgg cctgatcgcc agtaacgtct caaagctggc cctgcgaagg cgcggtccca
     3841 aatatacccg actccaacag aacgatacca tggaaaatga aaaaatggtt taaacatgtt
     3901 taataaatat tatgacacgt actcaaagtg tgacctcata tttgcataac cactttctag
     3961 ttccggcccc aaggatattt aagcctagta tctccgccga ggtttcatcc tcattcacca
     4021 actcacactt agagttgacg cttcctcttg cgcctttgct ctcgccgctc ctgtgttagc
     4081 gtatactgcc caagaaatgg attctccacg cggtatctcc acagctaccg gtgatgccca
     4141 cgccgaggcc gcggtttccc cagccgcgaa atccagataa aaacgaagcc cccgatgtag
     4201 acggaccaga agccactact gagtgtttag accacaccta cacccaacag acaagcgggg
     4261 gtgatggcct agatgctatc gat
//
LOCUS       HSEIEP       8174 bp ds-DNA             VRL       15-DEC-1989
DEFINITION  Equine herpesvirus type 1 immediate-early protein gene, complete
            cds.
ACCESSION   J04366
KEYWORDS    immediate-early protein; nonstructural protein; regulatory gene.
SOURCE      Equine herpesvirus type 1 (strain Kentucky A) DNA.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 8174)
  AUTHORS   Grundy,F.J., Baumann,R.P. and O'Callaghan,D.J.
  TITLE     DNA sequence and comparative analyses of the equine herpesvirus
            type 1 immediate early gene
  JOURNAL   Virology 172, 223-236 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by D.J.O'Callaghan, 13-JUN-1989.
FEATURES       from  to/span     description
    pept        988     5451     immediate-early protein
    signal      334      339     TATA box
    signal     5735     5740     polyA signal
BASE COUNT     1171 a   2937 c   2790 g   1276 t
ORIGIN      
        1 cccggggagg agacgcatgc agatgagatg tgcatcgagg tgtcatggcg tccaggggcg
       61 ttcaccttta tgcatatgag aggcgctatt cggcatcccg ttggcgcgac gcgcttccct
      121 gggaggagac atacgcaaat tagaaacgac acacgggttc taattggttg gagcgggggg
      181 gaggcgaaaa gcgcatgcaa atgcaaagcg cgggaccggg ccccataggc tagagccgct
      241 acacgcccac cgcccatcat caacggccaa tcacaatcga tagtgtgggc tggccactcc
      301 cactaggggg aaggcaaaac tccctcgtag tagtataaag cacctgttgc ttacccatcg
      361 tagcatcgcg gactagagag cctttcagct cactggacca gccagccttc gaggactatc
      421 gatcgcatct tggaaagctt acccgctctt ggcactcctt cttcggcttg cggaggtaag
      481 agctccccgg ggacacgacc ggcttcgatc tgcttcttct cccggggaga gcgttagaga
      541 acggggcgag tgccaaaaag gccatggaac ccctccaaca acgatgtccc gagggggtgg
      601 ctccgaggcc cgcttcgacc tagcggtcga agcgcggtgg ggatacttac ctcgaagccg
      661 gcgaaggcta taccttcccc gggcagaccc gggcggcttc tgcctcggcg gagctcggcg
      721 cggaagcctg gatatctgac ggggcgtggt taccacccaa gcgggggaga ggcccgggcc
      781 gcccgcgttc ccttttacca ttcggctccg ctccaactca acatcttttc cgcctctgct
      841 tttccagggt agagaagcgg cgcccgtcgt ccgagcgccc gccgcggaac cccgccaccg
      901 gccacccgcc aaccttccct tctcggtctt ccgagcgagc cttctcgtgc ggttggttct
      961 cgaccccgaa gccggagcta gcacgccatg gccagccagc gcagcgactt cgccccggac
     1021 ctctacgact tcatcgagag caacgacttc ggcgaggacc ccctcatccg cgcagccagc
     1081 gcggccgaag aggggttcac ccagcccgcc gcgcccgacc tgctgtacgg cagccagaac
     1141 atgttcgggg tggacgacgc tccgctctcc accccggtgg tggtcatccc tccgccgtct
     1201 ccggctcccg agccccgcgg agggaaggcg aagcggtcgc cctcggccgc cggcagcggc
     1261 ggtcctccta ccccggcggc tgccgcccag ccggcgtccc cggcacccag cccggctccg
     1321 gggctcgccg cgatgctgaa gatggtccac tcctccgtgg ccccggggaa cggtcgccgg
     1381 gccacgggct cctcatcacc cggcggtggg gacgcggccg acccggtcgc cctcgacagc
     1441 gataccgaga cctgcccggg gtccccgcag cccgagtttc catcctcggc ctccccgggc
     1501 ggagggtccc cggcaccccg ggtccggagc atctccatct catcgtcgtc ctcgtcctcg
     1561 tcctcgatgg acgaggacga ccaggcggat ggtgccgggg cgagtagctc ctcttcgtcg
     1621 tcctccgacg acagcgacag cgacgaaggc ggcgaggagg agacccctcg cccgcggcac
     1681 tcgcagaacg ccgcgaagac cccgtcggcc gccggctctc ccgggccgtc ctccggaggg
     1741 gatcgcccgg ccgctggggc cgccaccccg aagagctgcc gctccggcgc cgcttccccc
     1801 ggcgcacccg ctccggctcc agcttcggcg cccgctccca gccgcccggg aggaggcctc
     1861 ctccctccgg gggctcgcat tttagagtac ctggagggcg tccgcgaggc caatctggcc
     1921 aagacgctgg agaggcccga accgcccgcg gggatggctt ctccgccggg ccggagccct
     1981 caccggctcc ccaaggacca gcgtccgaaa tcggctctgg cgggagcgtc gaagcgcaag
     2041 cgggccaacc ccagacccag accccagacc cagacccagg caccggccga ggaggccccg
     2101 cagacggccg tgtgggactt gctggacatg aactcatccc aggctaccgg ggcggcggca
     2161 gcagcagcat cggccccggc ggcggcttcg tgcgccccgg gcgtctacca gcgcgagccg
     2221 cttctcaccc cgtccgggga cccctggccc gggtcggatc caccaccgat ggggagggtg
     2281 cgatacgggg ggaccgggga ctcgcgggac gggctgtggg acgaccccga gatagtcctg
     2341 gccgcctcgc gctacgccga ggcgcaggcc ccagtaccgg tcttcgtgcc ggagatgggg
     2401 gactccacca agcagtacaa cgctctggtc cgcatggtgt tcgagagccg cgaagccatg
     2461 tcctggctgc agaactctaa gctcagcggg caagaccaga acctggcgca gttctgccag
     2521 aagttcatcc acgctccgcg cggacacggg tccttcatca ccgggagcgt ggccaacccc
     2581 ctgccccaca tcggggacgc catggcggcc gggaacgcgc tctgggccct gccacacgcg
     2641 gccgcctcgg tggccatgag ccgccgctac gatcgcactc agaagagctt catcctccag
     2701 agcctccggc gcgcctacgc ggacatggcc tacccgagag acgaggcggg gaggccggac
     2761 tcactcgccg ccgtggccgg ctgcccggcc caggccgccg ctgccgcggc cagccagcaa
     2821 cagcccgagg ccccggcgcc ctcggtccgc gtccgcgaag cgtacacccg ggtctgcgcg
     2881 gccctcgggc cccgacgcaa ggctgccgcg gccgcggccg ctccggggac cagggcgccc
     2941 aggccgtccg ccttcagact cagggagctc ggggacgcct gcgtgctggc ctgccaggcc
     3001 gtcttcgagg ccctcctgcg cctccgcggc ggggcgtccg ccgtccccgg actggacccc
     3061 agcgagatcc cctctcccgc ctgccctccc gaggcgctgt gctccaaccc ggccgggctg
     3121 gagacggcgg ccctctccct ctacgaactc agggacctgg tcgagcgggc caggctcctc
     3181 ggggactctg accctaccca ccgcctgggc tccgacgagc tgcgcctcgc ggtgcgcgcc
     3241 gttctggtgg tggcccggac cgtggcgccg ctggtgcgct acaacgccga gggggcccgg
     3301 gcccgggcct cggcctggac cgtcacccag gccgtgttca gcatacccag cctggtcggg
     3361 gggatgttgg gggaggccgt gtccctgctg gccccaccga ctcggtccca gcagccctca
     3421 tcgtcctcgc ccggcggcga gcccttctcc ggctccgcgg ccgcggaggg gagccttcag
     3481 accctgccgc ccctgtggcc caccgtcccc gggaagcagt ccgcgacggt cccctcgtcc
     3541 cactcccagt ccccccagca ctcccagagc ggcggaggcg ccggggctac gaccgccacc
     3601 tgctgccggg ccacccagac aaacgcccgc tcccgggggc agcagcacca gccgcagaag
     3661 gcccgctccc ctcaggcggc cgcctccccg gcccacctca gccaggaggc gatgcccggc
     3721 tcctcctcgg acgaccgtgc catccacggg cgccccaggg gcaagagcgg caagcggcgc
     3781 tccgagcccc tggagccggc ggcccaggcc ggagcctcgg cctccttctc ctcgtccgcc
     3841 cgggggtacg atccctcggg gccggtcgac agccctccgg cccccaagcg cagggtggcc
     3901 accccgggcc accaggctcc ccgggccctg ggacccatgc cagccgaggg ccccgaccgt
     3961 cggggcggat tcaggcgcgt tccccgcgga gactgccaca ctccgcggcc cagcgacgcg
     4021 gcttgcgcgg cctactgtcc ccccgagctg gtggcggagc tcatcgacaa ccagctgttc
     4081 cccgaggcct ggcgcccggc gctcaccttc gatccccagg ccctggccac catcgcggcc
     4141 cgctgcagcg gccccccggc ccgggacggc gcgcgcttag gggagctggc ggccagcggc
     4201 ccgctgagac ggagggccgc ctggatgcac cagatccccg accccgagga cgtgaaggtg
     4261 gtggtcctct actccccgct ccaggacgag gacctgctgg gcggactccc ggcctcccgc
     4321 cccggcggct ctcggcgcga gcccctctgg tccgacctca aggggggact ctcggcgctg
     4381 ctggcggccc tggggaaccg catcctcacc aagcggtccc acgcctgggc cggcaactgg
     4441 accggggccc cggacgtctc ggccctcaac gcccaggggg tcctgctgct gtcgaccggg
     4501 gacctggcct tcaccggctg cgtcgagtac ctctgcctgc gcctgggctc cgccaggcgc
     4561 aagctcctgg tgctggacgc ggtctccacc gaggattggc cccaggacgg tcccgcgatc
     4621 agccagtacc acatctacat gcgggccgcc ctgactccgc gggtcgcctg cgccgtgcgc
     4681 tggcccgggg agcgccacct cagccgcgcg gtcctcacct ccagcaccct cttcgggccc
     4741 ggactgttcg cgagggccga ggccgcgttc gcgcgcctgt acccggactc tgcgcccctg
     4801 aggctgtgcc gctcctccaa cgtggcctac acggtggaca ctcgcgccgg cgagcgcacc
     4861 cgcgttcccc tggctccgag ggagtaccgc cagcgcgtcc tgcccgacta cgacggctgc
     4921 aaggacatgc gggcccaggc cgagggcctc gggttccacg acccggactt tgaggagggc
     4981 gccgcgcaga gccaccgcgc ggccaaccga tggggactcg gggcctggct gcgccccgtg
     5041 tacctcgcct gcggccggcg cggcgctggg gccgtggagc cctcggagct tctgatcccc
     5101 gagctgctga gcgagttctg ccgggtggcg ctgctggagc ccgacgccga ggccgagccc
     5161 ctggtgctgc ccatcaccga ggctccccgc cgccgagccc cgcgggtcga ctgggagccc
     5221 gggttcggct ctcgctccac ctcggtcctg cacatggggg ccacggagct gtgcctgccg
     5281 gagcccgacg acgagctcga gatcgacggg gccggcgatg tggagctggt ggttgagcac
     5341 cccggcccga gccccggcgt ggcccaggcc ctccgccgcg ctcccatcaa gatcgaggtg
     5401 gtgtcggacg acgaggacgg aggagactgg tgcaatccgt acctctcctg aacacgatgg
     5461 agcgcctccc tgcggccgaa aacaagaaaa atcagtacat ccacaactat gtgtccgccc
     5521 agcacaacgc agactccgcc tagactcccg cctccatccg ctgacgctga accccgcccc
     5581 gccctctgct gacgcgaaga caaggccctc cccggacgac atgtgaggaa cgaagggggc
     5641 gttgtatcta gcagcccacg ttccttattg ctcacatgtc tgcccaatcg gtgggcactt
     5701 ccaggctttc ccctatcgct gagtggttgt ttttaataaa gtttttttta aattttgatt
     5761 gaccgcgtgg tctttgttta ctgggcgggt tgatgggcgg gttgatgggc gggttgatgg
     5821 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg
     5881 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg
     5941 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg
     6001 gcgggttgat gggcgggttg atggttcctg ctcctcccct tcctgctcct ccccttcctg
     6061 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg
     6121 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg
     6181 ctcctcccct tcctgctcct ccccttccgc tacgtcacta ccgcctacgt cactaccgga
     6241 ctcctcccct tccgcttccg gccacgcccc ttccggtgag ccccagcata gcagtgagcc
     6301 ccagcatagc agtgacgtca ctttgacccc cccccttaga ccacgccccc ctattcaaat
     6361 gcggggggga gacgcgggct gggggggcca ggctctctct cgggcgcggg cccgtgaccc
     6421 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc
     6481 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat
     6541 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg
     6601 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca
     6661 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct
     6721 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg
     6781 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg
     6841 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg
     6901 tgacccttga ccagatatgg cccggggcca ggctctctct cgggcgcggg cccgtgaccc
     6961 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc
     7021 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat
     7081 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg
     7141 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca
     7201 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct
     7261 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg
     7321 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg
     7381 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg
     7441 tgacccttga ccagatatgg cccgggtaga gagagactgg gttcagaaga gccagagtgg
     7501 gtctgtaaag acaagggagt gggacgcggg tggtgggaag tggctcaaca ccgtggccgg
     7561 agatggttgg ggagggggaa aatgggggaa atatagtaaa ctagtttact actggtacta
     7621 ttccacggtt atagcatttc taagctggtc cgaggaggag agtagaaagg actcaatgtg
     7681 tccatttgtg tgatatatag tctgtgaccc ctagtaacac tactgccatt agtttctccc
     7741 cactatatgc tcagcttgtc tataccgcgc tcacactcag gaggttaggt gtgctaatag
     7801 gccaatcggg gggggggggg ggtgtggtgg taaatagcgg catcccccct agagcagata
     7861 aactggagtt taatagggct agggcagggg gctagggcag ggggctaggg cagggggcta
     7921 gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc tagggcaggg
     7981 ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag ggggctaggg
     8041 cagggggcta gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc
     8101 tagggcaggg ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag
     8161 ggggctaggg cagg
//
LOCUS       MLVCASBRE    3335 bp ds-DNA             VRL       15-MAR-1989
DEFINITION  Murine leukemia virus (Cas-Br-E MuLV), 3' end of proviral genome,
            encoding pol polyprotein, partial cds, and env polyprotein,
            complete cds.
ACCESSION   M14702
KEYWORDS    env gene; glycoprotein; pol gene; provirus.
SOURCE      Murine leukemia virus (isolate pBR-NE-8) proviral DNA, from mouse
            brain.
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 3335)
  AUTHORS   Rassart,E., Nelbach,L. and Jolicoeur,P.
  TITLE     Cas-Br-E murine leukemia virus: Sequencing of the paralytogenic
            region of its genome and derivation of specific probes to study its
            origin and the structure of its recombinant genomes in leukemic
            tissues
  JOURNAL   J. Virol. 60, 910-919 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      850     pol polyprotein
    pept        790     2775     env polyprotein
    matp       2185     2772     Prp15E glycoprotein
    LTR        2817     3335     3' long terminal repeat
    rpt        3192     3259     3' LTR R region
    rpt        2817     2828     inverted terminal repeat copy A
    rpt        3324     3335     inverted terminal repeat copy B
BASE COUNT      851 a    928 c    798 g    758 t
ORIGIN      84 bp upstream of HindIII site.
        1 gatcgatttc accgaggtaa aacctagatt gtatggctat aagtatcttt tagtttttgt
       61 agatactttc tctggctgga tagaagcttt cccaaccaag aaagaaaccg ccaaggtcgt
      121 gactaagaaa ctgctagaag agatcttccc taggttcggc atgccgcagg tattgggaac
      181 tgacaatggg cctgccttcg tctccaaggt gagtcagaca gtggccgatc tgttggggat
      241 tgattggaaa ttacattgtg catacagacc ccaaagctca ggtcaggtag aaagaatgaa
      301 taggaccatc aaggagactt taactaaatt aacgcttgca actggctcta gagactgggt
      361 cctcctactc cccttagccc tgtaccgagc ccgcaacacg ccgggccccc atggcctcac
      421 cccatatgag atcttatatg gggcaccccc gccccttgta aacttccctg accctgacat
      481 gaccagagtt actaacagcc cctctctcca agctcactta caggctctct acttagtcca
      541 gcacgaagtt tggagaccac tggcggcagc ttaccaagaa caactggacc ggccggtggt
      601 gcctcaccct taccgggtcg gcgacaccgt gtgggtccgc cgacatcaaa ccaagaacct
      661 agaacctcgc tggaaaggac cttacacagt cctgctgacc acccccaccg ctctcaaagt
      721 ggacggcatc tctgcgtggg tacacgccgc tcacgtaaag gcagcaacga cttctccggc
      781 cagaacagca tggaaggtcc agcgttctca aaatccccta aagataagac tatcgagaga
      841 gccttcctag gggttttggg gatcttattc gtgacaggag ggttagcgag cagagacaac
      901 ccccaccagg tatataatat aacttgggaa gtaacaaatg gagaacaaga cactgtgtgg
      961 gcagtaaccg gcaaccaccc cttgtggact tggtggccag acctcacacc agacctttgt
     1021 atgctggccc tacatggccc aactcattgg ggcctagaca accaccctcc atattcctct
     1081 cccccggggc ccccttgttg ttcaggagat gcaggggctg tgtcaggctg tgctagagac
     1141 tgtgatgagc ccttgacctc ttactccccc cggtgcaata cagcctggaa tagactgaaa
     1201 ctggcccggg taacacatgc acctaaagag ggattttata tctgccctgg gtcacatcgc
     1261 cccaggtggg ctcggtcgtg cgggggtcta gacgcctatt attgtgcctc ctgggggtgc
     1321 gaaactacag gccgagcagc ctggaaccca acttcatctt gggactatat cacagtaagc
     1381 aataatttaa cttcctcaca ggccaccaaa gcctgcaaaa ataatggctg gtgcaacccc
     1441 cttgtcatac gattcacggg tccaggaaaa agggccacct cctggactac aggtcatttc
     1501 tggggactgc gcctgtacat ctctggacat gacccagggc tcacttttgg gattcggcta
     1561 aaagtgacag atctgggacc tagagttcca atagggccaa atcctgtctt gtcagatcag
     1621 cgaccgccct cccggcctgt acctgccaga cctcccccac cttcagcctc accttccact
     1681 cccaccatac ctccacagca ggggaccggg gacaggttac ttaatctggt ccagggagcc
     1741 tacctcacac tcaatatgac tgatcccacc agaacccagg agtgttggtt atgcctagtc
     1801 tccgagcctc cgtattatga aggggtggcc gtgttgagag agtacactag tcatgagacg
     1861 gcacctgcta actgctcctc cggatcccaa cataagctga ccttatctga ggtaactgga
     1921 cagggaagat gtctaggaac ggttcccaaa actcaccagg ctctatgcaa ccgcaccgag
     1981 cccaccgtaa gtggttccaa ttacttggtg gctcccgaag gtaccctctg ggcatgcagc
     2041 accgggctca ctccctgtct gtctactact gtgctcaact taaccactga ttactgtgtc
     2101 ctagttgaac tctggccaaa ggtgacctac cactcccctg actatgtcta tactcagttt
     2161 gaaccagggg ccagattccg aagagagccg gtgtcgctga ccctcgccct gctaccagaa
     2221 ggtctcacca tgggtggaat tgccgcagga gtagggacag ggacaactgc cctggtcgcc
     2281 acccaacagt ttcaacaact tcaggctgct atgcacaacg acctcaagga agttgaaaaa
     2341 tcaattacta atctagaaaa gtctctgacc tcgctgtcag aagtggtttt gcagaaccgc
     2401 agaggcctag atctactatt tctaaaagag ggaggccttt gcgcggctct aaaagaagag
     2461 tgctgctttt atgcagacca cacaggatta gtgagagata gcatggccaa acttagagaa
     2521 agactaaacc agagacaaaa attgtttgaa tcaggacaag ggtggtttga aggactgttt
     2581 aataggtccc catggttcac aaccctgata tccactatta tgggccctct gatagtactt
     2641 ttattaatcc tacttttcgg accctgcatt ctcaatcgat tggtccaatt tgttaaagac
     2701 aggatctcag tggtccaggc tctggttttg actcagcaat atcaccagct aaaacccata
     2761 gagtacgagc cgtgaataaa ataaaagatt ttatttagtt tccagaaaaa ggggggaatg
     2821 aaagacccac catcaggttt agcaagctag cttaagtaac gccatttatt ttgcaaggcc
     2881 tggaaaaata ccgaactgag aatagggaag ttcggatcaa ggtcaggaac agaaaaacag
     2941 ctgaagttgg gccaaacagg atatctgtgg taagcagttt cggccccggc ccgaggccag
     3001 aacagatggt ccccagatat ggcccaatcc tcagcagttt ctagggaccc atcagatgtt
     3061 ttcaggctgc cccaaagacc tgaagtgacc ctgtgcctta tttgaactaa ccaatcagct
     3121 cgcttctcgc ttcggtttgc gcgcttctgc tccccgagct ctataaaaga gcacacaacc
     3181 cctcactcgg cgcgccagtc ctccgataga ctgagtcgcc cgggtacccg tgtatccaat
     3241 aaaccctctt gctgttgcat ccgactggtg gtctcgctgt tccttgggag ggtctcctca
     3301 gagtgattga ctacccgcct cgggggtctt tcatt
//
LOCUS       PPMCG        5089 bp ds-DNA             VRL       15-MAR-1990
DEFINITION  Monkey B-lymphotropic papovavirus complete genome.
ACCESSION   M30540
KEYWORDS    complete genome; large T-antigen; small T-antigen; viral protein.
SOURCE      Monkey C-lymphotropic papovavirus DNA, clone pL02, passed in human
            B-lymphoblastoid cell line BJA-B.
  ORGANISM  Monkey B-lymphotropic papovavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 5089)
  AUTHORS   Furuno,A., Kanda,T. and Yoshiike,K.
  TITLE     Monkey B-lymphotropic papovavirus genome: The entire DNA sequence
            and variable regions
  JOURNAL   Jpn J Med Sci Biol 39, 151-161 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        152      721     small T-antigen
    pept        152      388     large T antigen, exon 1
                744     2600     large T antigen, exon 2
    pept       3770     2664 (c) VP-1
    pept       4362     3649 (c) VP-3
    pept       4719     3649 (c) VP-2
    rpt        4724     4783     repeat copy A
    rpt        4784     4843     repeat copy B
BASE COUNT     1516 a   1015 c   1054 g   1504 t
ORIGIN      695 bp upstream of HindIII site.
        1 cccctagcct cctcctcttc tttcaacaaa gagagaggct ttggaggctt ttccaaaaac
       61 tcattaggta agctgccctg agatattttc ccatataatt aagtattaag gccacctagg
      121 taattaaatt tattccattt tattcacagc catggaccaa acgctgtcta aggaggagag
      181 aaatgagctt atggatttat tgcaaataac tagagctgca tggggaaatc tttctatgat
      241 gaaaaaagcc tataaaaatg tctccaagct ctaccatcct gataaaggag gagattcagc
      301 taaaatgcag cggctcaatg aattatttca aagggtccag gttaccttga tggagataag
      361 gagtcaatgt ggatcctctt cttcccaggt agcttggttt ttttgggatg agaattttag
      421 aaccctagga gcttttctag gagaaaaatt taatgaaaaa attattggac tctaccctac
      481 ttgcactaaa tttgtaagag ctaattgtaa ttgtatagta tgtctgctaa aaaagcagca
      541 tgcaggtaca aaaaaaaatt taaaaaagcc atgtttagtc tggggagaat gttggtgcta
      601 caaatgttat ttagtatggt ttggctttcc tgaggatttc acctcttttc gctactggac
      661 ccttcttatg gcaaatatgg atttatctat gctcaagctt tggacggaac tgggattcta
      721 atgtaagtat ttttattttc tagggttact tcagtgagga cttctacttt gggcctacca
      781 cctttcaata tagccctatg gatcgagatg cagttcggga ggatcttcca aatccagggg
      841 aagggtcttg ggggaaatgg tggagagagt ttgttaatag gcaatgttgt gatgatttgt
      901 tttgctcaga aacaatgagt agttcaagtg atgaagacac ccccccagcg gcgcaacctc
      961 ctcctcctcc tgccccttcc ccagaagaag aggatgaaat agaatttgta gaagagaccc
     1021 caagttcctg tgatggatct tcttctcaaa gctcctacac ctgcaccccc cctaaaagga
     1081 agaaaactga agaaaagaag ccagatgatt ttcctgtatg tttatattcc tttttaagtc
     1141 atgcaattta tagtaataag actatgaata gttttttaat atatactact ttggagaaag
     1201 ccaggcaact gtataaaact gtggaaaaat ctaaaattgt agttgatttt aaggctagtt
     1261 tttcttatca ggatgaggaa ggggaggggt gtttgctgtt tttaattact ttaggaaaac
     1321 atagagtgtc tgctgttaag catttttgtg tatcccaatg tacttttagt tttattcatt
     1381 gtaaagctgt tgttaaacct ctagagttat ataagacctt aagtaaacca ccttttaagt
     1441 tgttggaaga gaacaaaccg ggtgtatcca tgtttgagtt ccaagaggag aaggaacagt
     1501 ctgttaattg gcaagaaata tgtaactttg caaatgaggc caacatttct gatgtcttat
     1561 tgttgcttgg catctacata gattttgcag tggaacctgg caaatgtggc aagtgtgaaa
     1621 aaaagcagca caaattccac tataattatc acaaagcaca tcatgccaat gcttgcctct
     1681 tcttggagag tagagcccaa aaaaacattt gccaacaagc agttgaccag gtcctagcag
     1741 ctaaaaggtt aaaattagta gaatgcagta gaattgaatt attagaagag agatttttgc
     1801 agctttttga tgaaatggat gacttcctgc atggtgagat agaaattcta agatggatgg
     1861 cgggtgtggc ctggtacacc attttactag ataattcttg ggatgttttt caaaatatcc
     1921 tacaattaat aactaccagc caacccaaaa aaaggaatgt cctgataaag ggaccaatta
     1981 acagtggtaa aactactttg gcttctgctt tcatgcattt ttttgatggc aaagctctaa
     2041 atataaattg tcctgcagat aaactgtcct ttgaacttgg ctgtgctatt gatcaattct
     2101 gtgttttgtt agatgatgtg aagggccaaa taaccttaaa taagcacttg caaccaggtc
     2161 aaggggtaaa taatcttgat aacctgagag atcatcttga tggaacaatt aaagttaatt
     2221 tagaaaagaa acatgtaaac aaaaggagtc aaatttttcc cccggttatt atgactatga
     2281 atgagtactt gttgcctcct accataggag ttagatttgc tcttcatctg catttaaaac
     2341 ctaaggctta tcttaaacaa agcctggaaa aaagtgacct ggtagccaaa agaatattaa
     2401 attcaggata tactattttg ctccttttgt tatggtacaa tcctgtggat tcttttactc
     2461 caaaagtgca agaaaaagtg gtgcaatgga aagaaaccct tgaaaaatat gtgtcaatta
     2521 ctcagtttgg taatattcag caaaatatca ttgatggaaa agaccccttg catggaattg
     2581 taattgaaga acaaatgtaa ataatgtaat catcattttc tgttttattt ctggtacaat
     2641 aaagtcttac aatgcattca gcctcacata tcatttgaga cagggagaac agtctggttc
     2701 tgacaaaatt tatcaacata tctattaagg tcagggtccc ctgggagtcc ttctgttccc
     2761 tcaaatattc tgacttcttc cacttgtcct gagacccctt ccattggttg tccctgaatt
     2821 tggggcataa gaccagagaa gaagctattt agaagagagc tgacaggata aggattttta
     2881 acaatccttt tcctgagggt cacattgaaa tatctgggaa gccccctcca actttgggtt
     2941 tcagaatagt tggtatgaac tccagcaata tcagcacaag acagaaacag tttgtcccct
     3001 ttacaaagag gcccaactcc attttcatcc agcagcacag ttgtgacaga attagtgaac
     3061 tgcataactg gtggggtggt ggctccccct gtaaaactcc cataatatct agtattttca
     3121 tttttagagg ggtcagggca ccacacctcc actgggtact ttccatcttt atccagcaag
     3181 gctttggcct ttggatctag gccttggttt cctggtttca tatttttaat agcaactaca
     3241 tcatcaggat aggtagctgt agagctagca actaggcctt ggagttccag gggctctcct
     3301 ccaacagcaa acatgtgata ggtagtgccc tgcacgggga cacaccctga ggatgaccca
     3361 tagatgtact ttcctccctg gtgcaaatta actagtgagg aaattccaac aacttcagtc
     3421 tttacagaca ctgcttccca catcaaaatg gtgtcacagg tcatgtcttc atttaggagg
     3481 gggagtttaa taacagctac tgaataacaa ggaagggtgc ctttgttggg ggtgtcagag
     3541 gccttactga aagcagtatt tatagaatta ctatatccat acaagtcctc agaaggaata
     3601 ttatttccca ttctaggatt aagataggcc tcaatttggg taatagcatc aggccctgtt
     3661 cttacttcta gcacctctac tcctcctttt actaggagcc tggggacggg agcgggaata
     3721 gggcatgttt ttttgcatgc tccgtcttgc ctttttcttt gaggggccat cttcttcttt
     3781 ctccaattta ttaagctcca cttcccaggt gggagttata tcaccatata aacctagaat
     3841 tagaggaagc atccagtctt gagttactct ttggtgtgcc cctcctggag cagtataatg
     3901 ttctacatac tgaccagacc taggttcatc ataacccagt tctcttcttg ccctttgacc
     3961 ttccctattt tcttgatatt caaaatcagc tctacttgga ggaggctctc ctctgtttct
     4021 atattcttgt cttagctgga tgggatttct agcaggtaga tacctgtaat aatcttggac
     4081 actggaataa atatggacag gcccactggt caaagcccac ctagcatttt cagcaatttg
     4141 agccaaggtg tgactcagtt cattggtact tctgacagcc acagcccttg tagcttgacc
     4201 aatttgcaaa gtggcctgtc tcatcaaatg cctccatact tctctgccca cagcatgaaa
     4261 caatgattca ccccagtcaa gtacagcatt caggtagtag ctaaaagagg taaatcccgg
     4321 gaacaaataa tcaacttgag gaaaccaagg cacaagagcc atattaacaa ctggtacttc
     4381 tttggagtat ccaaaagttg tcactcctgc agcaaccaca gcactggcac ctgaaacagt
     4441 ttgaaaaaaa actcctattc ctatggcatt gttgagagct gttgggatag cacttaggag
     4501 ggaaaactgc tctgttgtaa gtccagtaag agacaaggcc tctagagtac taagtccagc
     4561 aagatccact gcttctattt caatgagcca ggctgcctca gtacttacag cagcaaaagc
     4621 ctccccagta aggatagcat caactgtaaa tccagtactt aagcttaatt cagcagcaat
     4681 ttcagaaata ttaaacaaaa gagataatac accccccatt tcttaccaaa tggcgggcta
     4741 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaccaaa tggcgggcta
     4801 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaacttc ctctacttga
     4861 ggttgctaag taggttgcta agcgccacct agcaactaga ccgcagaaca gttgtttgtc
     4921 acttatcagg aaatgtcaca aaaagtcccc gggcggtgcg gtgagcgagt ctaaccacag
     4981 cttcctctat cagttgattc tgcaaaaaca acctgttatt gaagtctgca agtctgcaaa
     5041 atcactatgg caaccctagt tttttttacc tggtataaga ggccagggg
//
LOCUS       PVYAAA       9704 bp ss-RNA             VRL       15-MAR-1990
DEFINITION  Potato virus Y (N strain) genomic RNA, complete.
ACCESSION   D00441
KEYWORDS    38K protein; HC protein; NIa protein; NIb protein; Vpg protein;
            capsid protein; genome-linked protein; helper component protein;
            inclusion protein; polymerase; polyprotein; protease.
SOURCE      Potato virus Y (N strain), 5'end of genomic RNA and cDNA to genomic
            RNA.
  ORGANISM  Potato virus Y
            Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; 
            Potyvirus.
REFERENCE   1  (bases 1 to 9704)
  AUTHORS   Robaglia,C., Durand-Tardif,M., Tronchet,M., Boudazin,G.,
            Astier-Manifacier,S. and Casse-Delbart,F.
  TITLE     Nucleotide sequence of potato virus Y (N strain) Genomic RNA
  JOURNAL   J. Gen. Virol. 70, 935-947 (1989)
  STANDARD  full staff_entry
COMMENT     Most of the sequence was obtained from a shotgun cloning procedure.
            The 150 nucleotides at the 5'end were directly sequenced on the
            virul RNA.
FEATURES       from  to/span     description
    virion        1     9704     genomic RNA
    pept        185     9376     polyproteins
    matp        185     1009     putative extreme 5'protein
    matp       1010     2656     putative helper component protein, HC
    matp       2657     3655     putative 38K protein
    matp       3656     5557     cytoplasmic inclusion protein
    matp       5558     5713     putative genome linked protein, Vpg
    matp       5714     7009     putative nuclear inclusion protein NIa
    matp       7010     8572     putative nuclear inclusion protein NIb
    matp       8573     9376     putative capsid protein
BASE COUNT     3004 a   1818 c   2273 g   2609 t
ORIGIN      putative 5'end of RNA genome.
        1 aattaaaaca actcaataca acataagaaa aacaacgcaa aaacactcat aaacgctcat
       61 tctcactcaa gcaacttgct aagtttcagt ttaaatcatt tccttgcaat tctctagaac
      121 aatattggaa accatttcaa ctcaacaagc aatttcatca cttccaacca atttcagatc
      181 ctcaatggca acttacatgt caacaatctg ttttggttcg tttgaatgca agctaccata
      241 ctcaccagcc tcttgcgagc atattgtgaa ggaacgagaa gtgccggctt ccgttgatcc
      301 tttcgcagat ctggaaacac aacttagtgc acgattgctc aagcaaaaat atgctactgt
      361 tcgtgtgctc aaaaacggta cttttacgta ccgatacaag actgatgccc agataatgcg
      421 cattcagaag aaactggaga ggaaggatag ggaagaatat cacttccaaa tggccgctcc
      481 tagtattgtg tcaaaaatta ctatagctgg cggagatcct ccatcaaagt ctgagccaca
      541 agcaccaaga gggatcattc atacaactcc aaggatgcgt aaagtcaaga cacgccccat
      601 aataaagttg acagaaggcc agatgaatca cctcattaag cagataaaac agattatgtc
      661 ggagaaaaga gggtctgtcc acttaattag taagaaaacc actcatgttc aatataagaa
      721 gatacttggt gcatactccg cagcggttcg aactgcacat atgatgggtt tgcgacggag
      781 agtggactcc gatgtgatat gtggacagtt ggacttttgc aacgtctcgc tcggacggac
      841 aaatggttcc aatcaagtcc gcactatcaa catacgaagg ggtgatagtg gagtcatctt
      901 gaacacaaaa agcctcaaag gccactttgg tagaagttca ggaggcttgt tcatagtgcg
      961 tggatcacac gaagggaaat tgtatgatgc acgttctaga gttactcaga gtattttaaa
     1021 ctcaatgatc cagttttcga atgccgacaa tttttggaag ggtctggacg gtaattgggc
     1081 acgaatgaga tatccttcgg atcacacatg tgtagctggt ttacctgtcg aagattgtgg
     1141 tagggtagct gcattgatgg cacacagtat ccttccgtgc tataagataa cttgccccac
     1201 ctgtgctcaa cagtatgcca gcttgccagt tagcgatctg tttaagctat tgcataaaca
     1261 tgcaagagat ggtttgaatc gattgggagc ggataaagac cggtttatac atgttaataa
     1321 gttcttgata gcgttagagc atctaactga accggtggac ctgaatctcg agcttttcaa
     1381 tgagatattt aaatccatag gggagaaaca gcaagcaccg ttcaagaatt taaatgtctt
     1441 aaataatttc ttcctgaaag gaaaagaaaa tacagctcat gaatggcagg tagctcaatt
     1501 gagtttgctc gaattagcaa ggttccagaa gaacagaact gataacatca agaaaggtga
     1561 tatatctttc ttcagaaata aattatctgc caaggcaaac tggaatctgt atttgtcgtg
     1621 cgacaaccag ctggataaaa atgcaaactt cctctgggga caaagggagt atcatgctaa
     1681 gcggtttttc tcaaacttct ttgaggaaat tgatccagca aagggatact cagcatatga
     1741 aatccgcaag catccaagtg gaacaaggaa gctctcaatt ggtaacttag ttgtcccact
     1801 tgatttagct gagtttaggc agaagatgaa aggtgactat aggaaacaac caggggtcag
     1861 caaaaagtgc acgagttcga aagatggtaa ttatgtgtat ccctgttgtt gcacaacact
     1921 tgatgatggt tcagccattg aatcaacatt ctatccacca actaaaaagc accttgtaat
     1981 tggcaatagt ggtgaccaaa aatttgttga tttaccaaaa ggggattcgg agatgttata
     2041 cattgccaag cagggttatt gttatattaa cgtgtttctt gcaatgctga ttaacattag
     2101 cgaggaggat gcaaaggatt tcacaaagaa agttcgcgac atgtgtgtgc caaagcttgg
     2161 aacctggcca actatgatgg atttggcgac cacttgtgct caaatgagaa tattctatcc
     2221 tgacgtacat gatgcagaat tgcccagaat attggttgac catgacactc aaacgtgtca
     2281 tgtggttgac tcatttggct cgcagacaac tggatatcat attctaaaag catccagcgt
     2341 gtctcaactt atcttgtttg caaatgatga attagaatct gatataaaac attatagagt
     2401 tggtggtgtt cctaatgcta gccctgaact tgggtccaca atatcacctt tcagagaagg
     2461 aggagttata atgtctgagt cggcagcgct gaaactgctt ttgaagggaa tttttagacc
     2521 taaggtgatg agacagttgc tgttagatga gccttacctg ttgattctat caatactatc
     2581 ccctggcata ctgatggcta tgtataataa tgggattttt gaacttgcgg tgaggttgtg
     2641 gattaatgag aaacaatcca tagctatgat agcatcgcta ctatcagctt tagccctacg
     2701 agtgtcagcg gcagaaacac tcgtcgcaca gaggattata attgatgctg cagctacaga
     2761 cctccttgat gctacgtgtg atgggttcaa cctacatcta acgtacccca ctgcattgat
     2821 ggtgttgcaa gttgttaaga atagaaatga atgtgatgat accctattca aggcgggttt
     2881 tccaagttac aacacgagcg tcgtacagat tatggaaaaa aattatctaa atctcttgaa
     2941 cgatgcttgg aaagatttaa cttggcgaga aaattatccg caacatggta ctcatacaga
     3001 gcaaaacgct ctatccactc ggtacataaa acccacagaa aaggcagatt tgaaagggtt
     3061 atacaacata tcaccacaag cgttcttggg ccgaagcgcc caggtggtca aaggcactgc
     3121 ctcaggattg agcgagcgat ttaataatta tttcaatact aagtgtgtaa atatttcatc
     3181 ctttttcatt cgtagaatct ttaggcgttt gccaaccttt gtcacttttg ttaactcatt
     3241 attagttatt agtatgttaa ccagcgtagt ggcagtgtgt caggcaataa ttttagatca
     3301 gaggaagtat aggagagaaa tcgagttgat gcagatagag aagaatgaga ttgtctgcat
     3361 ggagctatat gcaagtttac agcgcaaact tgaacgcgat ttcacatggg atgagtacat
     3421 tgagtatttg aagtcagtaa accctcagat agttcagttt gctcaagcgc agatggaaga
     3481 atatgatgtg cgacaccagc gttccacacc agttgttaaa aatttggaac aagtggtagc
     3541 atttatggct ttagtcatca tggtgtttga tgctgaaagg agtgattgcg tgttcaaaac
     3601 tctcaataaa tttaagggtg tcctttcctc actggattat gaagttagac atcagtcctt
     3661 agacgatgtg atcaagaatt ttgatgagag gaatgagatt attgattttg aattgagtga
     3721 ggacacaatt cgaacttcat cagtgctaga tacaaagttt agtgattggt gggatcgaca
     3781 aatccagatg ggacatacac ttccacatta cagaactgag gggcacttca tggaatttac
     3841 aagagcaact gctgttcaag tggctaatga cattgcccat agcgaacacc tagacttttt
     3901 agtacgggga gctgttgggt ctggaaagtc aactgggttg cctgttcatc ttagtgtggc
     3961 cggatctgtg cttttaattg aaccaacgcg accactagcg gagaacgttt tcaaacagct
     4021 atctagtgaa ccattcttca agaagccaac actgcgtatg cgtggaaata gtatatttgg
     4081 ctcttctcca atctccgtca tgactagcgg atttgcgcta cactacttcg ccaataatcg
     4141 ctctcaatta gctcagttca actttgtaat atttgatgag tgtcatgttc tggatccttc
     4201 cgcgatggcg ttccgcagtc tgctgagtgt ttatcatcaa gcatgcaaag tattaaaagt
     4261 gtcagctact ccagtgggaa gagaggttga attcacaaca cagcagccag tcaagttaat
     4321 agtggaggac acagtgtctt tccaatcatt tgttgatgca caaggttcta aaactaatgc
     4381 tgatgttgtt cagtttggtt caaacgtact tgtgtacgtg tcgagctaca atgaagttga
     4441 caccttggcc aagctcctaa cagacaagaa tatgatggtc acaaaggttg atggcagaac
     4501 aatgaagcac ggttgcctag aaattgtcac aaaaggaacc agtgcgagac cacattttgt
     4561 tgtagcaacc aacataattg agaatggagt gactttggac atagacgtgg ttgtggactt
     4621 tgggttgaaa gtctcaccgt tcttggacat tgacaatagg agcattgctt acaataaggt
     4681 gagtgttagc tatggtgaga gaattcaaag gctgggtcgt gttggacgct tcaagaaagg
     4741 agtagcattg cgcattggac acactgaaaa gggaattatt gaaattccaa gcatggtcgc
     4801 tactgaggcg gctcttgctt gctttgcata taacttgcca gtgatgacag gaggcgtttc
     4861 aactagtctg attggcaatt gtactgtgcg ccaggttaaa acaatgcagc aatttgaatt
     4921 gagtcccttc tttatccaga atttcgttgc tcatgatgga tcaatgcatc ctgtcataca
     4981 tgacattctt aaaaagtata aacttcgaga ttgtatgacg cctttgtgcg atcagtctat
     5041 accatacagg gcatcgagta cttggttatc ggttagtgaa tatgagcgac ttggagtggc
     5101 cttagaaatt ccaaagcaag tcaaaattgc attccatatc aaagagatcc ctcctaagct
     5161 ccacgaaatg ctttgggaaa cggttgtcaa gtacaaagac gtttgcttat ttccaagcat
     5221 tcgagcatcg tccatcagca aaatcgcata cacattgcgt acagatctct tcgccatccc
     5281 aagaactcta atattggtgg agagattgct tgaagaggag cgagtgaagc agagccaatt
     5341 cagaagtctc atcgatgaag ggtgctcaag catgttttca attgttaact taaccaacac
     5401 tctcagagct agatatgcaa aagattacac cgcagagaac atacaaaaac ttgagaaggt
     5461 gagaagtcaa ctaaaagaat tctcaaattt ggatggttct gcatgtgagg agaatttaat
     5521 aaagaggtat gagtcgttgc agttcgttca tcaccaagct gcgacgtcac ttgcaaagga
     5581 tctcaagttg aaggggattt ggaacaagtc attagtggct aaagacttga tcatagcagg
     5641 cgctgttgca attggtggaa taggactcat atatagttgg ttcacacaat cagttgagac
     5701 tgtgtctcat caagggaaaa ataaatccaa aagaatccaa gccttgaagt ttcgccatgc
     5761 tcgtgacaaa agggctggct ttgaaattga caacaatgat gacacaatag aggaattctt
     5821 cggatctgca tacaggaaaa agggaaaagg taaaggtacc acagttggta tgggtaagtc
     5881 aagcaggagg ttcatcaaca tgtatgggtt tgatccaaca gagtactcat tcatccaatt
     5941 cgttgatcca ctcactgggc ggcaaataga agaaaatgtc tatgctgaca ttagagatat
     6001 tcaagagaga tttagtgaag tgcgaaagaa aatggttgag aatgatgaca ttgaaatgca
     6061 agccttgggt agtaacacga ccatacatgc atacttcagg aaagattggt gtgataaagc
     6121 tttgaagatt gatttaatgc cacataaccc actcaaagtt tgtgacaaaa caaatggcat
     6181 tgccaaattt cctgagagag agctcgaact aaggcagact gggccagctg tagaagtcga
     6241 tgtgaaggac ataccagcac aggaggtgga gcatgaagct aaatcgctca tgagaggctt
     6301 gagagacttc aacccaattg cccaaacagt ttgtaggctg aaagtatctg ttgaatatgg
     6361 ggcatcagag atgtacggtt ttggatttgg agcatacata gtagcgaacc accatttatt
     6421 taggagttac aatggttcca tggaggtgca atccatgcac ggtacattca gggtgaagaa
     6481 tctacacagt ttgagcgttc tgccaattaa aggtagggac atcatcctca tcaaaatgcc
     6541 gaaagatttc cctgtctttc cacagaaatt gcatttccga gctcctacac agaatgaaag
     6601 aatttgttta gttggaacca acttccaaga gaagtatgct tcgtcgatca tcacagaaac
     6661 aagcactact tacaatatac caggcagcac attctggaag cattggattg aaacagataa
     6721 tggacattgt ggactaccag tggtgagcac cgccgatgga tgtatagtcg gaattcacag
     6781 tctggcaaac aatgcacaca ccacgaacta ctactcagcc ttcgatgaag attttgaaag
     6841 caagtacctc cgaaccaatg agcacaatga atgggtcaag tcttgggttt ataatccaga
     6901 cacagtgttg tggggcccgt tgaaacttaa agacagcact cccaaagggt tattcaaaac
     6961 aacaaagctt gtgcaagatc taatcgatca tgatgtagtg gtggagcaag ctaagcattc
     7021 tgcatggatg tttgaagcct tgacaggaaa tttgcaagct gtcgcaacaa tgaagagcca
     7081 attagtaacc aagcatgtag ttaaaggaga gtgtcgacac ttcacagaat ttctgactgt
     7141 ggatgcagag gcagaggcag aggcattctt caggcctttg atggatgcgt atgggaaaag
     7201 cttgctaaat agagatgcgt acatcaagga cataatgaag tattcaaaac ctatagatgt
     7261 tggtgtcgtg gatcggatgc atttgaggaa gccatcaata gggttatcat ctacctgcaa
     7321 tgtgcacggc ttcaagaagt gtgcatatgt cactgatgag caagaaattt tcaaagcgct
     7381 caacatgaaa gctgcagtcg gagccagtta tgggtgcaaa aagaaagact attttgagca
     7441 tttcactgat gcagataagg aagaaatagt catgcaaagc tgtctgcgat tgtataaagg
     7501 tttgcttggc atttggaacg gatcattgaa ggcagagctc cggtgtaagg agaagatact
     7561 tgcaaataag acgaggacgt tcactgctgc acctctagac actttgctgg gtggtaaagt
     7621 gtgtgttgat gacttcaata atcaatttta ttcaaagaat attgaatgct gttggacagt
     7681 tgggatgact aagttttatg gtggttggga taaactgctt cggcgtttac ctgagaattg
     7741 ggtatactgt gatgctgatg gctcacagtt tgatagttca ctaactccat acctaatcaa
     7801 tgctgttctc accatcagaa gcacatacat ggaagactgg gatgtggggt tgcagatgct
     7861 gcgcaattta tacactgaga ttgtttacac accaatttca actccagatg gaacaattgt
     7921 caagaagttt agaggtaata atagtggtca accttctacc gttgtggata attctctcat
     7981 ggttgtcctt gctatgcatt acgctctcat taaggagtgc gttgagtttg aagaaatcga
     8041 cagcacgtgt gtattctttg ttaatggtga tgacttattg attgctgtga atccggagaa
     8101 agagagcatt ctcgatagaa tgtcacaaca tttctcagat cttggtttga actatgattt
     8161 ttcgtcgaga acaagaagga aggaggaatt gtggttcatg tcccatagag gcctgctaat
     8221 cgagggtatg tacgtgccaa agcttgaaga agagagaatt gtatccattc tgcaatggga
     8281 tagagctgat ctgccagagc acagattaga agcgatttgc gcagctatga tagagtcctg
     8341 gggttattct gaactaacac accaaatcag gagattctac tcatggttat tgcaacagca
     8401 accttttgca acaatagcgc aggaagggaa ggctccttat atagcaagca tggcactaag
     8461 gaaactgtat atggataggg ctgtggatga ggaagagcta agagccttca ctgaaatgat
     8521 ggtcgcatta gatgatgagt ttgagcttga ctcttatgaa gtacaccatc aagcaaatga
     8581 cacaattgat gcaggaggaa gcaacaagaa agatgcaaaa ccagagcagg gcagcatcca
     8641 gccaaacccg aacaaaggaa aggataagga tgttaatgca ggcacatctg ggacacatac
     8701 tgtgccgaga atcaaggcta tcacgtccaa aatgagaatg cccacaagca agggagcaac
     8761 cgtgccaaac ttagaacatt tgcttgagta tgctccacaa caaattgata tttcaaatac
     8821 tcgggcaact caatcacagt ttgatacgtg gtatgaggca gtgcggatgg catacgacat
     8881 aggagaaact gagatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg
     8941 aacctcgcca aatgtcaacg gagtttgggt tatgatggat gggaatgaac aagttgagta
     9001 cccgttgaaa ccaatcgttg agaatgcaaa accaaccctt aggcaaatca tggcacattt
     9061 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg
     9121 atatggttta attcgaaatc tgcgggatat gggtttagcg cgttatgcct ttgactttta
     9181 tgaggtcaca tcacgaacac cagtgagggc tagggaagcg cacattcaaa tgaaggccgc
     9241 agcattgaaa tcagcccaac ctcgactttt cgggttggac ggtggcatca gtacacaaga
     9301 ggagaacaca gagaggcaca ccaccgagga tgtctctcca agtatgcata ctctacttgg
     9361 agtcaagaac atgtgatgta gtgtctctcc ggacgatata taagtattta catatgcagt
     9421 aagtattttg gcttttcctg tactactttt atcataatta ataatcgttt gaatattact
     9481 ggcagatagg ggtggtatag cgattccgtc gttgttagtg accttagctg tcggttctgt
     9541 attattaagt cttagataaa aagtgccggg ttgttgttgt gtgactgatc tatcgattag
     9601 gtgatgctgt gattctgtca tagcagtgac tatgtctgga tttagttact tgggtgatgc
     9661 tgtgattctg tcatagcagt gactgtaaac ttcaatcagg agac
//
LOCUS       ROBTRFVP2    2687 bp ss-RNA             VRL       15-MAR-1990
DEFINITION  Bovine rotavirus mRNA for RNA binding protein VP2.
ACCESSION   X14057 X14507
KEYWORDS    RNA binding protein.
SOURCE      Bovine rotavirus.
  ORGANISM  Bovine rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 2687; enum. 1 to 2687)
  AUTHORS   Cohen,J.
  TITLE     ;
  JOURNAL   Unpublished (1989) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 2687; enum. 1 to 2687)
  AUTHORS   Kumar,A., Charpilienne,A. and Cohen,J.
  TITLE     Nucleotide sequence of the gene encoding for the RNA binding
            protein (VP2) of RF bovine rotavirus
  JOURNAL   Nucleic Acids Res. 17, 2126-2126 (1989)
  STANDARD  simple automatic
COMMENT     *source: strain=RF;
            
            Data kindly reviewed (21-APR-1989) by Cohen J.
            
            [1]  Author address Cohen J., INRA, Station de Virologie et
            d'Immunologie Moleculaires , INRA, CRJ, Domaine de Vilvert, 78350
            Jouy en Josas, France.
            
            Submitted (16-JAN-1989) on tape to the EMBL data library
FEATURES       from  to/span     description
    pept         17     2659     VP2 protein (AA 1-880)
    site       1622     1637     leucine zipper (AA 536-557)
    site       2009     2074     leucine zipper (AA 665-686)
BASE COUNT      997 a    425 c    520 g    745 t
ORIGIN      
        1 ggctattaaa ggttcaatgg cgtacaggaa acgtggagcg cgccgtgagg cgaatataaa
       61 taataatgac cgaatgcaag agaaagatga cgagaaacaa gatcaaaaca atagaatgca
      121 gttgtctgat aaagtacttt caaagaaaga ggaagtcgta accgacagtc aagaagaaat
      181 taaaattgct gatgaagtga agaaatcgac gaaagaagaa tctaaacaat tgcttgaagt
      241 tttgaaaaca aaagaagagc accaaaaaga gatacaatat gaaattttgc aaaaaacgat
      301 accaacattt gaaccaaaag agtcaatatt gaaaaaattg gaggatatca aaccggaaca
      361 agcgaagaag cagactaagc tatttagaat atttgaaccg agacagctac caatttatag
      421 agcgaatggt gaaaaagagt tgcgtaacag atggtattgg aagctgaaga aagatacttt
      481 accagatgga gattatgatg ttagagaata ctttctaaat ttgtatgatc aggttcttac
      541 tgaaatgcca gattatttac tattaaaaga tatggcagtt gaaaataaaa attcgagaga
      601 tgccggtaaa gttgttgatt ctgaaacagc aagtatctgt gatgctatat ttcaagatga
      661 ggaaacagaa ggtgcagtga gacgattcat tgcggagatg agacagcgcg tacaagctga
      721 cagaaacgtt gtcaattacc catcaatatt gcatccaata gattacgctt ttaatgagta
      781 ttttttgcaa caccaattag ttgaaccatt gaataatgat ataatattca attacattcc
      841 tgaaaggata aggaatgacg ttaactatat acttaatatg gacagaaatc tgccatcaac
      901 agctagatat ataagaccta atttactaca agacagactg aatttgcatg acaattttga
      961 atccttgtgg gatacaataa caacttcaaa ctatattctg gcaagatcgg tagtaccaga
     1021 tttaaaggaa ttagtttcaa ccgaagcgca aattcaaaaa atgtcacaag acttgcaact
     1081 agaagcatta acaatacagt cagaaacgca gtttttaaca ggtataaact cacaagcagc
     1141 aaatgactgt ttcaaaactc tgattgcagc aatgttaagt caacgaacca tgtcgcttga
     1201 tttcgtgact acaaattata tgtcattaat ttcaggcatg tggttactaa ctgtagtgcc
     1261 aaatgacatg ttcataaggg aatcattggt tgcatgtcaa ctggctatag tgaatacaat
     1321 aatatatcca gcgttcggaa tgcaacgaat gcattataga aacggagacc cacaaagacc
     1381 atttcagata gcagaacaac aaatacaaaa ttttcaagta gcgaattggc tgcattttgt
     1441 caataacaat caatttagac aagtagttat tgatggtgta ttgaatcagg tgctgaatga
     1501 caatattaga aatggacatg tcattaatca attgatggaa gctttaatgc aactatcacg
     1561 acaacagttt ccaacaatgc ctgttgatta taagaggtca atccagcgtg gaatattatt
     1621 gctatcaaat aggcttggtc aattagttga tttaactagg ttattagctt acaactacga
     1681 aacactaatg gcatgtgtta cgatgaatat gcaacatgtt cagactttga caacagaaaa
     1741 attacagtta acttcagtca catcgttgtg tatgcttatt ggaaatgcaa ccgttatacc
     1801 cagcccgcag acattgtttc actattataa tgttaatgtt aattttcatt caaattataa
     1861 tgaaagaatt aatgatgcag tggccataat aactggagct aatagactaa atttatatca
     1921 gaaaaagatg aaggcaatag ttgaagattt tttaaaaaga ttacatattt tcgatgtagc
     1981 tagagttcca gatgatcaaa tgtatagatt aagggataga ctacgactat tgccagtaga
     2041 agtaagacga ttggatattt ttaatttgat actgatgaac atggatcaga tagaacgcgc
     2101 atcagataaa attgcgcaag gtgttattat tgcgtaccgc gatatgcaat tggaaagaga
     2161 cgaaatgtat ggctacgtga atatagctag aaatttagat gggttccagc aaataaacct
     2221 agaagaattg atgagaacag gcgattatgc acaaataact aacatgctct tgaataatca
     2281 accagtagcg ctagttggag ctcttccatt tgttacagac tcgtcagtca tatcgttgat
     2341 agcgaacgtt gacgctacag tttttgccca aatagttaaa ttacggaaag ttgatacctt
     2401 gaaaccaata ttgtataaaa taaattcaga ttcgaatgac ttttacctag ttgccaacta
     2461 tgattgggtg cctacttcaa ccacaaaagt atataagcaa gttccacagc aatttgattt
     2521 cagaaattcg atgcatatgt taacatcaaa tcttactttc actgtttact ctgatctgct
     2581 tgcattcgta tcggccgata cagtagaacc tataaatgca gttgcatttg ataatatgcg
     2641 catcatgaac gagttgtaaa cgccaacccc actgtggaga tatgacc
//
LOCUS       VACH3K       4536 bp ds-DNA             VRL       15-DEC-1989
DEFINITION  Vaccinia virus, HindIII K fragment.
ACCESSION   D00382
KEYWORDS    nonessential gene; serine protease inhibitors.
SOURCE      Vaccinia virus HindIII K fragment originally from vaccinia virus
            strain WR, cloned in pBR322, was a gift from R. Wittek, transformed
            into Escherichia coli strain TG1.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 4536)
  AUTHORS   Boursnell,M.E.G., Foulds,I.J., Campbell,J.I. and Binns,M.M.
  TITLE     Non-essential genes in the vaccinia virus HindIII K fragment: a
            gene related to serine protease inhibitors and a gene related to
            the 37K vaccinia virus major envelope antigen
  JOURNAL   J. Gen. Virol. 69, 2995-3003 (1988)
  STANDARD  full staff_entry
COMMENT     One gene, predicted to encode a 42.2K protein, is highly related to
            the family of serine protease inhibitors.   It shows approximately
            25% identity to human antithrombin III and 19% identity to the
            cowpox virus 38K protein gene which is also related to serine
            protease inhibitors.  The product of another gene shows a similar
            high level of identity to the 37K vaccinia virus major envelope
            antigen. The existance of viable deletion mutants and recombinants
            containing foreign DNA inserted into both these genes indicates
            that they are non-essential.
FEATURES       from  to/span     description
    ORF          50  <     1 (c) ORF KO, amino end.
    ORF        1381      272 (c) ORF K1
    ORF        1697     1431 (c) ORF K2
    ORF        3023     1749 (c) ORF K3
    ORF        3185     3051 (c) ORF K4
    ORF        3604     3200 (c) ORF K5
    ORF        3818     3573 (c) ORF K6
    ORF        3957     4406     ORF K7
    ORF        4235     4041 (c) ORF8
BASE COUNT     1454 a    789 c    712 g   1581 t
ORIGIN      1 bp upstream of HindIII site
        1 aagcttttca gctgcttaga cttccaagta ttaattcgtg acagatccat gtctgaaacg
       61 agacgctaat tagtgtatat tttttcattt tttataattt tgtcatattg caccagaatt
      121 aataatatct ctaatagatc tgattagtag atacatggct atcgcaaaac aacatataca
      181 catttaataa aaataatatt tattaagaaa attcagattt cacgtaccca tcaatataaa
      241 taaaataatg attccttaca ccgtacccat attaaggaga ttccacctta cccataaaca
      301 atataaatcc agtaatatca tgtctgatga tgaacacaaa tggtgtatta aattccagtt
      361 tttcaggaga tgatctcgcc gtagctacca taatagtaga tgcctctgct acagttcctt
      421 gttcgtcgac atctatcttt gcattctgaa acattttata aatatataat gggtccctag
      481 tcatatgttt aaacgacgca ttatctggat taaacatact aggagccatc atttcggcta
      541 tcgacttaat atccctctta ttttcgatag aaaatttagg gagtttaaga ttgtacactt
      601 tattccctaa ttgaaacgac caatagtcta attttgcagc cgtaatagaa tctgtgaaat
      661 gggtcatatt atcacctatt gccaggtaca tactaatatt agcatcctta tacggaaggc
      721 gtaccatatc atattcttcg tcatcgattg tgattgtatt tccttgcaat ttagtaacta
      781 cgttcatcat gggaaccgtt ttcgtaccgt acttattagt aaaactagca ttgcgtgttt
      841 tagtgatatc aaacggatat tgccatatac ctttaaaata tatagtatta atgattgccc
      901 atagagtatt attgtcgagc atattagaat ctactacatt agacataccg gatctacgtt
      961 ctactataga attaatttta ttaaccgcat ctcgtctaaa gtttaatcta tataggccga
     1021 atctatgata ttgttgataa tacgacggtt taatgcacac agtattatct acgaaacttt
     1081 gataagttag atcagtgtac gtatatttag atgttttcag cttagctaat cctgatatta
     1141 attctgtaaa tgctggaccc agatctcttt ttctcaaatc catagtcttc aataattcta
     1201 ttctagtatt acctgatgca ggcaatagcg acataaacat agaaaacgaa taaccaaacg
     1261 gtgagaagac aatattatca tcttgaatat ttttatacgc tactataccg gcattggtaa
     1321 atccttgcag acgataggta gacactgaac acgttaacga tagtatcaat aacgcaatca
     1381 tgattttatg gtattaataa ttaaccttat ttttatgttc ggtataaaaa ttattgatgt
     1441 ctacacatcc ttttgtaatt gacatctata tatccttttg tataatcaac tctaatcact
     1501 ttaactttta cagttttccc taccagttta tccctatatt caacatatct atccatatgc
     1561 atcttaacac tctctgccaa gatagcttca aagtgaggat agtcaaaaag ataaatatat
     1621 agagcataat ccttctcgta tactctgccc tttattacat cacccgcatt gggcaacgaa
     1681 taacaaaatg caagcatctt gttaacgggc tcgtaaattg ggataaaaat tatgttttta
     1741 tatctatttt attcaagaga atattcagga atttcttttt ccggttgtat ctcatcgcag
     1801 tatatatcat ttgtacattg tttcatattt tttaatagtc tacacctttt agtaggacta
     1861 gtatcgtaca attcatagct gtattttgaa ttccaatcac gcataaaaat atcttccaat
     1921 tgttgacgaa gacctaatcc atcatccggt gtaatattaa tagatgctcc acatgtatcc
     1981 gtaaagtaat ttcctgtcca atttgaggta cctatatacg ccgttttatc ggttaccata
     2041 tatttggcat ggtttaccct agaatacgga atgggaggat cagcatctgg tacaataaat
     2101 agctttactt ctatatttat gtttttagat tttagcatag cgatagatct taaaaagttt
     2161 ctcatgataa acgaagatcg ttgccagcaa ctaatcaata gcttaactga cacttgtctg
     2221 tctatagcgg ctcttcttaa ttcatcttct atataaggcc aaaacaaaat attgcctgcc
     2281 ttcgaataaa taatagggat aaagttcata acagatacat aaacgaattt actcgcattt
     2341 ctgatacatg acaataaagc ggttaaatca ttggttcttt ccatagtaca tagttgttgc
     2401 ggtgcagaag caataaatac agagtgtgga acgccgctta cgttaatact aagaggatga
     2461 tctgtattat aatacgacgg ataaaagttt ttccaattat atggtagatt gttaactcca
     2521 agataccagt atacctcaaa aatttgagtg agatccgctg ccaagttcct attattgaag
     2581 atcgcaatac ccaattcttt gacctgagtt agtgatctcc aatccatgtt agcgcttcct
     2641 aaataaatat gtgtattatc agatatccaa aattttgtat gaagaactcc tcctaggata
     2701 tttgtaatat ctatgtatcg tacttcaact ccggccattt gtagtctttc aacatccttt
     2761 aatggtttgt tagatttatt gacggctact ctaactcgta ctcctctttt gggtaattgt
     2821 acaatcttgt ttaatattat cgtgccgaaa ttcgtaccca cttcatccga taaactccaa
     2881 taaaaagatg atatatctag tgtttttgtg gtattggata gaatttccct ccacatgtta
     2941 aatgtagaca aatatacttt atcaaattgc atacctatag gaatagtctc tgtaatcact
     3001 gcgattgtat tatccggatt cattttattt gttaaaagaa taatcctata tcacttcact
     3061 ctattaaaaa tccaagtttc tatttctttc atgactgatt ttttaacttc atccgtttcc
     3121 ttatgaagat gatgtttggc accttcataa atttttattt ctctattaca atttgcatgt
     3181 tgcatgaaat aatatgcacc taaaacatcg ctaatcttat tgtttgttcc ctggagtatg
     3241 agagtcgggg ggtgttaatc ttggaaatta tttttctaac cttgttggta gccttcaaga
     3301 cctgactagc aaatccagcc ttaatttttt catgattgat taatgggtcg tattggtatt
     3361 tataaacttt atccatatct ctagatactg attctggaca tagctttccg actggcgcat
     3421 ttagtgtgat ggttcccata agtttggcag ctagcagatt cagttttgaa acagcatctg
     3481 cattaactag aggagacatt agaatcattg ctgtaaacaa gtttggatta tcgtaagagg
     3541 ctagctccca tggaatgacc caataagtag atttaatagt taccacgtgc tgtaccaaag
     3601 tcatcaatca tcattttttc accattactt cttccatgtc caatatgatc atgtgagaat
     3661 actaaaattc ctaacgatga tatgttttca gctagttcgt cataacgtcc agaatgttta
     3721 ccagctccat gacttatgaa tactaatgcc ttaggatatg taataggttt ccaatatatg
     3781 taatcattgt ccagattgaa catacagttt gcactcatga ttcacgttat ataactatca
     3841 atattaacag ttcgtttgat gatcatatta tttttatgtt ttattgataa ttgtaaaaac
     3901 atacaattaa atcaatatag aggaaggaga cggctactgt cttttgtgag atagtcatgg
     3961 cgactaaatt agattatgag gatgctgttt tttactttgt ggatgatgat aaaatatgta
     4021 gtcgcgactc catcatcgat ctaatagatg aatatattac gtggagaaat catgttatag
     4081 tgtttaacaa agatattacc agttgtggaa gactgtacaa ggaattgatg aagttcgatg
     4141 atgtcgctat acggtactat ggtattgata aaattaatga gattgtcgaa gctatgagcg
     4201 aaggagacca ctacatcaat tttacaaaag tccatgatca ggaaagttta ttcgctacca
     4261 taggaatatg tgctaaaatc actgaacatt ggggatacaa aaagatttca gaatctagat
     4321 tccaatcatt gggaaacatt acagatctga tgaccgacga taatataaac atcttgatac
     4381 tttttctaga aaaaaaattg aattgatgat ataggggtct tcataacgca taattattac
     4441 gttagcattc tatatccgtg ttaaaaaaaa ttatcctatc atgtatttga gagttttata
     4501 tgtagcaaac atgatagctg tgatgccaat aagctt
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       FPLTRAC      3227 bp ds-DNA             BCT       05-FEB-1990
DEFINITION  Plasmid F TraC protein (traC) gene, complete cds and TrbI protein
            (traI) gene, partial cds.
ACCESSION   M30936
KEYWORDS    TraC protein; TrbI protein.
SOURCE      plasmid F DNA, clone PKAS2.
  ORGANISM  Plasmid F
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 3227)
  AUTHORS   Schandel,K.A., Maneewannakul,S., Vonder Haar,R.A.,
            Ippen-Ihler,K. and Webster,R.E.
  TITLE     Nucleotide sequence of the F plasmid gene traC and identification
            of its product
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept        268     2895     TraC protein (gtg start codon)
    pept       2892  >  3227     TrbI protein
    binding     256      261     Shine-Delgarno sequence
BASE COUNT      785 a    827 c    900 g    715 t
ORIGIN      
        1 gcatgcggaa atcctattcc ggaagcccgg cggaaaatat ttcccggtgt gacgttgtgc
       61 gttgaatgtc aggcgtatca ggaaagacag agaaaacatt atgcataagt cagtcgcaga
      121 acatagtgat ttaattccgg atgaacatga gtggatattc agaaaacaga aatctttgta
      181 tatgcgccgg gaaatggcgc gttaattaca ggtattccct tcatggctac cgcatttctc
      241 gctttatttt tcaactaagg aattcatgtg aataacccac ttgaggccgt cactcaggcg
      301 gttaactccc tcgtcacagc actgaaactg cctgacgaat ccgcaaaggc caatgaagtt
      361 ctgggcgaaa tgagcttccc gcagttcagc cgtctgctgc cgtaccgtga ttacaaccag
      421 gaatccggtc tgttcatgaa tgacaccacg atgggcttta tgctggaagc cattcccatc
      481 aatggggcga atgagtccat tgtggaggcc ctcgatcata tgctgcgcac caaactgccg
      541 cgcggtattc cgttgtgtat ccatctgatg tccagtcagc tggttggtga caggattgaa
      601 tacgggctgc gtgagttctc ctggtctggc gaacaggccg aacggtttaa cgcgattacc
      661 cgggcctatt atatgaaagc ggcagcgaca cagtttccgc tgccggaggg aatgaatctg
      721 cccctgaccc tgcgccatta ccgggtgttt atctcgtact gttctccctc gaagaaaaaa
      781 agccgggccg acattctgga aatggaaaac ctggtgaaaa tcatccgggc gtcgttacag
      841 ggggccagta tcaccacaca gacggtggat gcacaggcct ttatcgatat tgtcggggag
      901 atgattaacc ataacccgga ttccctgtac ccgaaaagac gtcagctgga cccgtattct
      961 gatctgaatt atcagtgtgt ggaggacagt tttgatctga aagttcgggc tgattacctg
     1021 acactgggcc tgcgtgagaa cggcaggaac agcacggccc gcatcctgaa tttccatctg
     1081 gcccgtaacc cggaaatcgc cttcctgtgg aacatggccg acaactacag caacctgctg
     1141 aacccggaac tgtccatctc ctgtccgttc atcctgacgc tgactctggt ggtggaagac
     1201 caggtgaaaa cccacagcga agccaacctg aagtacatgg acctggagaa aaagtcgaag
     1261 acctcctatg ccaaatggtt tccgtccgtg gagaaagagg cgaaggagtg gggggaactg
     1321 cgtcagcggc tgggctccgg tcagtcctct gtcgtgtcct acttcctcaa catcacagcc
     1381 ttctgcaagg acaataatga aacggcactg gaagtggagc aggacatcct gaacagcttc
     1441 cgtaaaaacg gttttgagct gatttcaccg cgctttaacc acatgcgcaa tttcctgacc
     1501 tgtctgccct ttatggccgg gaaagggttg tttaaacagc tgaaagaggc cggagtggta
     1561 cagcgcgcag agagctttaa tgtggccaac ctgatgccgt tagtggcgga taaccccctg
     1621 acaccggcag gtctgctggc acccacctac cgtaaccagc tggcgtttat cgatattttc
     1681 ttccggggga tgaataacac caactataac atggcggtct gtggcacctc cggggccggt
     1741 aaaaccgggc tgatacagcc acttatccgc agcgtgctgg actccggagg ctttgccgtg
     1801 gtgttcgaca tgggggatgg atacaagtcc ctgtgtgaga acatgggcgg ggtgtatctg
     1861 gacggtgaaa ccctgcgttt taacccgttt gcgaacatca ccgatattga ccagtcagcg
     1921 gagcgtgtcc gtgaccagtt gtcagtgatg gccagcccca acggtaacct ggatgaagtg
     1981 catgaaggtc tgctgttgca ggcggtcaga gcctcctggc tggccaaaga gaacagagca
     2041 cgtattgatg acgtggtgga tttcctgaaa aacgccagtg acagcgagca gtatgccgag
     2101 tcaccgacta tccgcagccg tctggacgaa atgattgtgc tgcttgacca gtacactgcc
     2161 aacggcactt acggccagta ttttaactct gatgagccgt ccctgcggga tgacgcaaaa
     2221 atggtggtgc tggagctggg cggactggaa gaccgtccgt cactgctggt tgcggtgatg
     2281 ttctccctga ttatctacat cgagaacagg atgtaccgca cgccgcgtaa cctcaagaaa
     2341 ctgaacgtta ttgatgaagg ctggcgtctg ctggacttca aaaatcacaa ggtcggtgaa
     2401 tttattgaga aaggctaccg tacggcccgc cgtcataccg gtgcctatat caccatcaca
     2461 cagaacatcg tcgactttga ctctgataag gcctccagtg ctgcccgcgc ggcatggggt
     2521 aactcctcct acaaaattat cctcaaacag agtgcgaagg agttcgcgaa atacaaccaa
     2581 ctgtatccgg accagttcct gccactgcag cgcgacatga ttggtaagtt tggtgcggcc
     2641 aaagaccagt ggttcagttc cttcctgctg caggtggaaa accattcctc ctggcaccgt
     2701 ctgtttgtgg acccgttaag ccgcgccatg tacagctctg acggcccgga ttttgagttt
     2761 gtgcagcaga agcgtaaaga ggggctgagt attcatgagg cagtgtggca gctggcgtgg
     2821 aagaagtcag ggccggaaat ggcttcgctg gaagcctggc tggaagaaca tgagaaatac
     2881 aggagtgtgg catgagttca acgcagaaac ccgccgacgt cacggcagaa cgacgcagcc
     2941 actggtggtg gacggtgccg ggatgcctgg caatggtgtt actgaacgca gccgtcagtt
     3001 acggcattgt cagactgaac gcaccggtga ctgtcgcctt caacatgaaa cagaccgtgg
     3061 atgcgttttt tgacagcgcc agtcagaaac aactgtcaga agcgcagtca aaagcccttt
     3121 cagcgcgttt taacacggca ctggaagcca gcctgcaggc gtggcagcag aaacaccacg
     3181 cggtcattct ggtgtcgcct gctgtggtac agggggcacc ggatatc
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       DROLARM      7606 bp ss-mRNA            INV       19-OCT-1989
DEFINITION  D.melanogaster protein tyrosine phosphatase (DLAR) mRNA, complete
            cds.
ACCESSION   M27700
KEYWORDS    protein tyrosine phosphatase.
SOURCE      Drosophila melanogaster (strain Oregon R) 4-12 hour embryo, cDNA to
            mRNA, clones DLAR[23,55].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (sites)
  AUTHORS   Streuli,M., Krueger,N.X., Tsai,A.Y.M. and Saito,H.
  TITLE     A family of receptor-linked protein tyrosine phosphatases in humans
            and Drosophila
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8698-8702 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 7606; for [1])
  AUTHORS   Saito,H.
  JOURNAL   Unpublished (1989) Dana-Farber Cancer Inst,44 Binney St.,Boston MA
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Saito, 07-SEP-1989.
FEATURES       from  to/span     description
    pept        118     6207     protein tyrosine phosphatase (DLAR) precursor
    sigp        118      213     protein tyrosine phosphatase (DLAR) signal
                                 peptide
    matp        214     6204     protein tyrosine phosphatase (DLAR)
BASE COUNT     1999 a   1973 c   1926 g   1708 t
ORIGIN      
        1 aaagagaata aaaagagcat ttaaagttat tagtaaccat cgttgttgtt gagttgttgt
       61 tgttgttacc aactgtcgac atacccttgc aactgccggc gaaaacatag cgaaataatg
      121 ggtctgcaga tgacagcagc ccgtccaatc gcagctctca gcctactagt gttgtccttg
      181 ctcacctgga ctcaccccac aatcgtggat gcagcccatc cgccggagat catcaggaag
      241 ccgcagaatc agggagtccg agtgggcggc gttgctagct tctattgtgc ggcccgcggt
      301 gatccgcctc catcgatagt gtggcgcaaa aatggcaaaa aagtttcggg aacccagtcg
      361 cgttacacgg tgctggagca gcccggcggg atttccatac tccggattga gcccgtgcgg
      421 gcgggacgcg atgatgcacc atacgagtgt gtggcggaga acggggtggg cgatgccgtt
      481 tccgcagatg caactttaac catatatgaa ggcgataaaa cacccgcagg ctttccggtt
      541 ataacccagg gtccgggaac tcgcgtcatt gaagtgggtc acacggtcct catgacatgc
      601 aaagccatcg gcaatccgac gccaaacatt tactggatta agaatcagac aaaggttgat
      661 atgagcaatc cgcgctactc tctcaaggat ggcttcctgc aaatcgaaaa cagtcgcgag
      721 gaggatcagg gcaaatacga gtgtgtggct gagaactcaa tgggcacgga gcactcgaag
      781 gccaccaact tgtatgtgaa agtccgtcgt gttccgccca ccttttcccg cccaccagag
      841 accatcagcg aggtgatgtt gggatctaat ctgaatctat cctgcattgc cgtcggctca
      901 cccatgccgc atgtcaagtg gatgaagggc tctgaagatc ttacacccga gaatgagatg
      961 ccaatcggac gaaatgtcct gcagctgatc aatatccagg agagcgccaa ctacacttgc
     1021 atagcggcct ccactttggg ccaaatcgat tccgtttcgg tggttaaagt gcaatctctg
     1081 cccaccgcac ccaccgatgt gcaaatctcc gaggtgaccg ccacttcggt gcgtctggag
     1141 tggtcgtaca agggtcccga ggacttgcaa tattacgtga tccagtacaa gccgaagaac
     1201 gccaaccagg ccttcagcga gataagcggc atcatcacca tgtactatgt ggtccgtgcc
     1261 ctgagtccct acacggagta cgagttctac gtgatagccg tgaacaatat tggacgcgga
     1321 ccgccctcgg caccagcgac atgtaccacc ggtgagacaa aaatggaaag tgcaccacgt
     1381 aatgtccaag tgcgcacgct gagctcgtcc acgatggtta ttacttggga accaccagag
     1441 acgcccaatg gacaagtgac cggctacaag gtgtactaca cgaccaattc gaatcagccg
     1501 gaggcgtcgt ggaactccca gatggtcgac aatagcgaac tgaccacagt ctcggacgtg
     1561 acgccccacg ccatctacac ggtccgggtt caggcctaca catcgatggg agccggtcca
     1621 atgtccacgc cggtccaggt gaaggcccag caaggtgtgc catcgcaacc gagcaatttc
     1681 cgggcaaccg atatcggcga gaccgcagtc acactgcaat ggaccaagcc gacgcattcc
     1741 agcgagaata tcgtgcacta cgagctctac tggaatgaca catacgccaa tcaggcccat
     1801 cacaagcgca tttccaactc ggaggcgtat accctggacg gactctaccc cgataccctc
     1861 tactacatct ggctggctgc aaggtcgcaa cgtggcgagg gggccaccac cccgcccatt
     1921 ccggtgcgca ccaagcaata tgtaccaggt gctccgcctc gaaatatcac cgccatagcc
     1981 accagctcga cgaccatatc cctcagctgg ctgcctccgc ccgtcgagcg atcgaacggc
     2041 cggatcatat actataaggt gttcttcgtg gaggtgggtc gcgaagacga cgaggccacc
     2101 accatgaccc tcaatatgac cagcattgtg ctggacgagc tgaagcgctg gacagagtac
     2161 aagatctggg tgctggccgg cacctccgtc ggggatgggc cgcggtcgca tcccatcatt
     2221 ttgcgcaccc aagaggatgt gcccggcgat ccgcaagatg tgaaggccac gcctttgaac
     2281 tccacttcga tccatgtcag ctggaagccg cctctcgaaa aggatcgcaa tggcatcatc
     2341 cgtgggtatc atatacacgc ccaggagctg cgagatgagg gcaagggctt tttgaacgaa
     2401 cccttcaagt ttgatgtggt agacacgctg gagttcaatg tgactggctt gcagccggat
     2461 acaaagtact ccattcaggt ggcggcacta actcgtaaag gagatggtga ccggagtgct
     2521 gcgattgtgg tgaaaactcc tggcggagta ccagtccgac caacggtgag tctgaagatc
     2581 atggagcggg aaccgatcgt gtccatcgaa ctcgaatggg agcgaccggc gcagacctat
     2641 ggcgaattgc gtggctatcg acttcgatgg ggcgtcaagg accaagcact gaaggaggag
     2701 atgctgtcag gaccgcagat gaccaagaaa cggtttgata acttggaacg cggagttgag
     2761 tacgaatttc gtgtggcggg cagcaatcat attggtatcg gacaagagac ggtgaaaata
     2821 tttcagacac ccgagggaac acccggtgga ccgccttcta acattaccat tcgcttccaa
     2881 actccggatg tactgtgcgt gacctgggat ccaccaacta gggagcaccg gaatggcata
     2941 atcacccgct atgatgttca gtttcacaag aaaatcgatc atggcctggg atccgagcga
     3001 aatatgactc tccggaaggc ggtgttcaca aatctggagg agaacaccga gtatatcttc
     3061 cgggtgaggg cttatacgaa gcagggagct ggtcccttca gcgacaagtt aatcgtggag
     3121 acagaacgtg acatgggtcg agcacctatg tccctgcagg cagaggcaac atcggagcaa
     3181 actgcggaga tctggtggga accggtaaca agtcgtggca agttgctggg ctacaagatc
     3241 ttttacacca tgacagctgt cgaggatctg gacgattggc aaacgaaaac cgttggactt
     3301 acggaatccg ctgatcttgt taatctcgag aagtttgccc aatatgccgt ggccattgcg
     3361 gcgaggttca agaacggatt gggacgtctt agtgaaaagg ttacagtacg catcaagccg
     3421 gaggatgtgc ccttaaatct tcgcgctcac gatgtcagca cccattcgat gaccttgagt
     3481 tggtcgccac ccattcgcct aaccccggtc aactacaaga tcagcttcga tgccatgaag
     3541 gtgtttgtgg actcgcaggg attctcccag acccagatcg ttcccaagcg agagattatc
     3601 cttaagcact atgtgaagac ccacactatc aacgaactca gtccgtttac cacgtacaat
     3661 gtgaatgtga gtgccattcc ctcggattat tcctaccggc cgcccacaaa gattacggtc
     3721 acaacgcaaa tggctgcacc tcagccaatg gtgaagccgg atttctacgg cgttgttaat
     3781 ggcgaggaaa ttctggtgat actgcctcag gcttcggagg aatatggacc catatcgcac
     3841 tattatttgg tggtggtccc ggaggacaag tccaatctgc acaagatacc cgatcagttc
     3901 cttaccgatg atctcttgcc gggcaggaac aagccagagc gtccgaatgc accgtacatt
     3961 gcagccaagt tcccgcagcg ttccattccg ttcacattcc acctgggatc tggcgatgat
     4021 tatcataact ttacaaatcg caaattggag cgagagaagc gctaccgcat ctttgtgcga
     4081 gcggtggtgg atacgccaca gaagcacctc tacacctcca gtcccttctc tgagttccta
     4141 tcgctggaca tgagggaagc tccgccaggt gagcggcccc accgacccga tcccaattgg
     4201 cccgcggagc cggaagtgtc ggtgaaccgc aacaaggacg aaccggagat tctgtgggtg
     4261 gtgctgcccc tgatggtgtc cacattcatt gtgtccaccg ccctgatcgt tctctgtgtg
     4321 gtaaagcgtc gtcgccagcc gtgcaagact ccggatcagg cagctgtcac aaggccactg
     4381 atggccgccg acctgggagc cggacctacg cccagcgatc cagtggacat gaggcgcttg
     4441 aacttccaga cacccggcat gatctcccat ccgcccatac cgatatccga gtttgccaac
     4501 cacatcgaac gactcaagtc caatgacaat cagaagtttt cgcaggaata cgaaagcatt
     4561 gagccgggcc aacagttcac ctgggacaac tccaatctgg agcataacaa gtctaagaat
     4621 cgctatgcaa atgttaccgc ctacgatcat tcacgcgtcc agttgccagc ggtggagggt
     4681 gtggttggat cagattacat caatgccaat tactgtgacg gctatcggaa gcacaatgcc
     4741 tacgtggcga cccaaggtcc gttgcaggag acctttgtgg acttctggcg catgtgttgg
     4801 gaactgaaga cggccaccat tgtgatgatg acgcgattgg aggaacgaac gcgcataaag
     4861 tgcgatcagt attggcccac tcgcggaacg gaaacctatg gtcagatctt tgtgaccatc
     4921 acggagacac aggaactggc cacctacagc atccgcacct tccagttgtg ccggcagggc
     4981 tttaacgatc ggcgtgagat caagcagctg cagttcacag cctggccaga tcatggagtg
     5041 cccgatcatc cggctccctt ccttcagttc ttgcgccggt gtcgcgccct cacgccaccg
     5101 gaatccggac ccgtgattgt tcactgctct gcgggagttg gtcgcactgg ctgttatatc
     5161 gtaatcgatt caatgttgga acgaatgaaa cacgagaaga ttatcgacat ctatgggcat
     5221 gttacttgtt tacgggcgca acggaactac atggtgcaga cggaggatca gtacatcttc
     5281 attcatgacg ccatcctgga ggccatcatc tgtggggtga cggaggtgcc ggctcgcaat
     5341 ctacacaccc acctacagaa actattgatc acggagcccg gcgagaccat ctcgggcatg
     5401 gaggtggagt tcaagaagct gtctaacgtc aagatggact cgtccaagtt cgtaacggcc
     5461 aatctgccgt gcaacaagca caagaatcgc ctggtccaca ttctgccgta cgagtcaagt
     5521 cgcgtctacc tgacccccat ccatggaatc gagggaagcg actatgtcaa cgccagcttc
     5581 atcgacggct atcgttaccg ttccgcatac atcgccgcac agggtcctgt ccaggatgcc
     5641 gctgaggact tttggcgcat gctctgggag cacaactcca ccattgtggt catgctgacc
     5701 aagctcaagg aaatgggaag ggaaaagtgc ttccagtact ggccccatga gcgatccgta
     5761 cgctatcagt attatgtcgt ggatcccatt gctgagtaca acatgccgca gtataagctg
     5821 cgtgaattta aggtcacgga tgcccgagat ggctcatcgc gcaccgtccg ccagttccag
     5881 ttcatcgatt ggccggagca gggtgtgccc aagtcgggcg agggcttcat cgacttcatc
     5941 ggacaggtgc acaagaccaa ggagcagttt ggccaggatg gacccattac cgtgcactgt
     6001 tcggcgggcg tgggacgttc gggtgtcttt atcactctga gcatcgttct ggaacgaatg
     6061 cagtacgagg gagtactgga cgtcttccag acagtgcgca tactgcgatc ccagcgtccg
     6121 gctatggtac aaaccgagga tcaataccac ttctgctatc gcgctgcact ggagtacttg
     6181 ggctcattcg acaattatac aaactgagtg catttctcaa tgggattgcc aggtccacag
     6241 agctaaaatc tcatcgaccc tgcgttcgtt gcatacttat ttcggacttt acgcataata
     6301 cgcatttacg atatacacgg atatcataac tctaaagtat tatagccagc tactgcccac
     6361 acttacttaa tacacctata cttatatacg tggatatgtt tagttgataa gcgcaccccc
     6421 gcccgcaggt tatgttacca ttacgattct ttgggggact cgatcttata tattgcgttc
     6481 tattttattg ataatgtaaa ctaaagaaga ctgttacacg tttaattatc aaaaccttat
     6541 atatcaaccc acatgcactt ctatgtacga gtatgtgcat caagatttgc tcgctaatgg
     6601 agaaggatgt caatcaactt gttgttgttc gtggtactgc cgccggagtg accatgaatg
     6661 aggggcgcca ctggaaaccg atacatattt gaactaagga cttggtagct attagtctta
     6721 ccgcctaaat cgaatctcga acggaagcaa agggtcgaaa gtagaagcca ttttaagtaa
     6781 atcgaaaact tctggcgatt tggcgcaaac tccagacgct attataatat ataatacaat
     6841 caatcaaaca ctgctatgtt aatcagtttt catgccttgg ttaatggaaa ccagtataaa
     6901 cggtccgtgc aattagtcaa aagtcttgca tagcgcatat tttataaggc agctagattc
     6961 taaaattagt tttcaaacac aacttacgtt ttgatctccc cgcaaaagat gacgaacctt
     7021 atggactgat cgatgtcttg gtagctaaca gacagttcaa atttcaaaga ccattatgat
     7081 agatcacata cgcaacttcc agccaagcag ctattttaac ttcaacctca ttccctcgac
     7141 ttctattcag ttccctagta cattcctgta tataccacag tcgatacccg agctagccac
     7201 ttatgtatgt gtaatttaga ttgaaaagct atgaaaagtt attgtaccta aggtgtatga
     7261 agacatagag aataaccgag cgaacagatt agaatgataa ctgtagatat gtactgtagc
     7321 atataatttt taccataaaa tagagcgaca cacgcggtta taaataaaca taaacataaa
     7381 atcgagacat tggcaatagt tcttaaggta cataataatg tatataactg ataagctgtt
     7441 ttgcaaaacg tttgaattcg gcctcgtaac tatacataaa tatataaaga gttagcaagc
     7501 aaatgcttaa aagcaaacac taaattatta gctgaaacac atgttaaaca accatagtca
     7561 tatgtaaact aaatatttat acgaaataaa ggtatttgtt ataact
//
LOCUS       DROPTPM      4477 bp ss-mRNA            INV       19-OCT-1989
DEFINITION  D.melanogaster protein tyrosine phosphatase (DPTP) mRNA, complete
            cds.
ACCESSION   M27699
KEYWORDS    protein tyrosine phosphatase.
SOURCE      Drosophila melanogaster (strain Oregon R) 4-12 hour embryo, cDNA to
            mRNA, clone DPTP12.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (sites)
  AUTHORS   Streuli,M., Krueger,N.X., Tsai,A.Y.M. and Saito,H.
  TITLE     A family of receptor-linked protein tyrosine phosphatases in humans
            and Drosophila
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8698-8702 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 4477; for [1])
  AUTHORS   Saito,H.
  JOURNAL   Unpublished (1989) Dana-Farber Cancer Inst,44 Binney St.,Boston MA
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Saito, 07-SEP-1989.
FEATURES       from  to/span     description
    pept         55     4443     protein tyrosine phosphatase (DPTP) precursor
    sigp         55      123     protein tyrosine phosphatase (DPTP) signal
                                 peptide
    matp        124     4440     protein tyrosine phosphatase (DPTP)
BASE COUNT     1456 a    996 c    970 g   1054 t      1 others
ORIGIN      
        1 atataatttt tgacatatgt atttatccgg cncggagtga gcaattagca agccatggcg
       61 ttactctacc gacgcatgtc gatgctatta aatataattc ttgcatatat tttcctttgt
      121 gcgatatgtg tgcaaggatc tgtgaaacaa gagtgggcag aaataggtaa aaacgtatct
      181 ctagagtgcg catctgaaaa cgaagcagta gcatggaaac tgggcaacca aactattaat
      241 aaaaaccata caagatataa aataagaact gagccattga aatccaacga cgatggcagt
      301 gaaaacaacg atagtcaaga cttcatgaag tataagaatg tcctgacgct tctcgatgtc
      361 aatataaatg actcggggaa ctacacctgc acagcccaaa cggggcaaaa ccattcgact
      421 gaattccaag ttaagccata ccttccatcc aaagtcctgc aaagtacccc cgacaggatc
      481 aagagaaaaa tcaagcagga tgtcatgttg tactgtttga tcgagatgta cccacaaaat
      541 gagacaacta atagaaacct caagtggctt aaggacggca gccagtttga attcctggac
      601 accttctcgt ccatttcgaa gctaaacgat acgcacttaa acttcaccct ggaattcaca
      661 gaagtttaca agaaagagaa tggcacctac aagtgcaccg tcttcgatga caccggactc
      721 gaaattacct ccaaagagat tactcttttc gtaatggaag tgccacaagt tagcattgat
      781 ttcgccaagg cagttggtgc taataaaata tacctaaatt ggaccgttaa cgacggcaac
      841 gatccaattc agaagttctt catcactctg caggaggctg gaacaccgac ttttacttat
      901 cataaggact ttattaacgg cagccataca tcgtatattc tggaccattt caagccgaac
      961 acaacctatt ttttaagaat cgtgggaaag aactcgattg gcaatggcca acccacccag
     1021 tatccccagg gaatcaccac gcttagttat gatcccatat ttataccaaa agtcgagacc
     1081 accggcagca cggcgtccac gataacgatt ggctggaatc caccgccgcc ggatctcatt
     1141 gattatatac aatactacga actgattgtc tccgaatcgg gcgaggtacc caaagtgatc
     1201 gaagaggcca tttaccagca gaattctcga aacttgccat acatgtttga taagcttaag
     1261 accgccacag actacgaatt tagggtaagg gcatgtagtg atctaaccaa gacttgtgga
     1321 ccatggtccg aaaacgtgaa cggcacaacg atggacggcg tggcaacaaa acccaccaac
     1381 ttgagcatac agtgtcatca tgacaacgtc acgagaggca actccatcgc cattaactgg
     1441 gacgttccca agacgccaaa cggcaaggtt gtgtcatatt taattcactt gctgggcaat
     1501 cccatgagca cagtggatag agagatgtgg ggaccaaaga ttcgaagaat cgatgaaccc
     1561 catcacaaga ccctctacga aagtgttagc ccgaacacaa actacacggt gacggtgtct
     1621 gccataacgc ggcacaagaa gaacggcgaa ccggccaccg gaagttgtct aatgcccgtc
     1681 tccacgccgg atgccattgg ccgtaccatg tggtcaaagg tgaacctgga ctccaagtac
     1741 gttctcaagt tgtatctacc gaagatcagc gaacgcaatg ggcccatatg ctgctataga
     1801 ttgtatctag ttagaattaa caatgacaac aaggaattgc cggacccgga gaagctaaac
     1861 atcgccacat atcaggaggt tcacagcgat aacgtgacta gaagtagtgc atatatagcg
     1921 gaaatgataa gtagcaagta ctttaggccg gaaatattct tgggcgctga aaagagattt
     1981 agtgagaaca acgatataat ccgcggcaac gacgaaattt gtcggaaatg cttggagggt
     2041 actccatttc tgaggaaacc cgagatcatt cacatacccc cacaaggttc actatcaaat
     2101 tccgattctg aactgcccat tttgtcggag aaggacaact tgatcaaagg agcaaactta
     2161 acagagcatg ctctgaaaat cttagaaagt aagttaaggg ataaaagaaa cgcggtgacc
     2221 agcgatgaga atccaattct aagcgccgtc aacccaaatg tgccactcca cgattctagt
     2281 cgagatgtct tcgatggtga gatagatatt aactccaact acaccggatt cctagagata
     2341 atagttcggg atcgaaacaa tgccctgatg gcttatagca aatactttga tataattact
     2401 ccggcgactg aagctgaacc catccaatcc ttgaacaata tggactacta cctaagcatt
     2461 ggggtcaagg ctggagccgt attacttggt gtcatacttg tatttattgt gctgtgggtt
     2521 ttccatcaca agaaaaccaa gaatgaactg cagggcgaag acactttaac actaagagat
     2581 tctttgagca gggcattgtt cggtcgacga aatcacaacc acagtcattt tataacttcc
     2641 ggaaatcaca aaggattcga cgctggcccc attcacagat tagatttaga aaacgcctac
     2701 aagaaccgac ataaggacac tgactacgga tttctgcggg aatatgagat gctgccaaat
     2761 cgctttagcg atcggacaac taaaaacagt gatttaaagg agaacgcctg caagaacagg
     2821 tatcccgata taaaggctta cgatcagacg cgcgtaaagt tggctgtcat caatggcctg
     2881 caaactacgg actacatcaa tgctaacttc gtaattggct acaaggagag aaagaagttt
     2941 atctgtgcac agggtccaat ggaaagtacc atcgatgatt tttggcgtat gatttgggaa
     3001 caacatctag aaataattgt gatacttaca aatttagagg aatataacaa ggccaagtgc
     3061 gcgaaatatt ggccagaaaa agtatttgat acgaaacagt tcggagatat tttagtgaaa
     3121 tttgcacaag aacgtaagac tggtgattat attgaacgta ccctgaacgt ttccaagaac
     3181 aaagccaatg tcggcgagga ggaggaccgt agacaaatca cccaatacca ctacctaacg
     3241 tggaaggact tcatggcacc agagcatcca catggcatca tcaaattcat acgtcaaatc
     3301 aattccgtct actccctgca aaggggtcca attttagtgc attgcagtgc tggtgtgggt
     3361 agaaccggaa ccctggtggc tttggattcc ctaatccaac agctggagga agaagactcg
     3421 gtgtccattt acaacacagt gtgtgatcta cgacaccaac gaaatttttt agtccaatct
     3481 ctgaaacaat acatctttct ttaccgggct ctattagata ctggaacctt tggaaacacg
     3541 gatatttgca ttgatacaat ggcttctgca attgaatctc ttaagcggaa acccaacgag
     3601 ggtaaatgca aattggaaat ggaattcgag aaactactgg ccacagcaga tgagataagt
     3661 aaatcatgta gtgtgggcga aaacgaggag aacaatatga agaacagaag tcaagagatt
     3721 ataccctacg accgcaacag agttatactg acaccacttc cgatgcggga aaactcaaca
     3781 tacattaacg catcattcat agagggctat gataatagcg aaaccttcat cattgcccag
     3841 gatccatttg aaaacactat aggagacttc tggcgaatga tctcggaaca gagtgttacc
     3901 accctcgtca tgatatccga aatcggagat ggtcccagga aatgcccgcg atactgggca
     3961 gatgatgagg ttcaatacga ccacatactc gtgaaatacg tgcacagcga aagttgtcca
     4021 tactatactc gccgcgaatt ttatgttacg aattgcaaaa tagacgatac gctgaaagtc
     4081 acacaatttc aatacaatgg ttggcccacc gtggacggag aagttcctga agtctgccgt
     4141 ggcattatag aacttgtaga tcaagcatac aaccattata agaacaacaa gaattctggt
     4201 tgtcgatctc cactcacagt tcattgcagt ctgggaaccg atcgaagttc cattttcgtt
     4261 gccatgtgta ttttggtcca gcatctcagg ctggagaagt gtgtcgacat ctgtgccaca
     4321 acaaggaaat tacgatctca gcgaacggga cttatcaact catacgcaca atacgagttc
     4381 ctacatcgcg caataattaa ttattcagac ttacatcaca tagccgagtc aacattggat
     4441 taagttagat atgtatatac ttatacgagt accgaat
//
LOCUS       MUSPPC1       643 bp ds-DNA             ROD       29-AUG-1989
DEFINITION  Mouse secretory granule proteoglycan peptide core gene, exon 1.
ACCESSION   M27391 J05044
KEYWORDS    proteoglycan peptide core.
SEGMENT     1 of 3
SOURCE      M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 643)
  AUTHORS   Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and
            Stevens,R.L.
  TITLE     Cloning and characterization of the mouse gene that encodes the
            peptide core of secretory granule proteoglycans, and expression of
            this gene in transfected rat-1 fibroblasts
  JOURNAL   J. Biol. Chem. 264, 16719-16726 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by R.L. Stevens, 02-AUG-1989.
FEATURES       from  to/span     description
    pept        546  +   618     secretory granule proteoglycan peptide core
    pre-msg     505  >   643     peptide core mRNA and introns
    IVS         619  >   643     peptide core intron A
BASE COUNT      141 a    134 c    180 g    188 t
ORIGIN      
        1 aattctagca gactctggac gttaacggag accgctcatc ctgggggctg agaacccagc
       61 tcggctcgga atgttccctg cttgtgcctg actctgtgcg cgcccagctt ctctttgatg
      121 tgcgctgtgg atgagccgag ctcagttctg gaacagctga gtcctcctgt ctgtttagat
      181 tgttacctga aggaagggag ggggaagaaa gtgctgattc gactttttga tggggaaaac
      241 ttttttttta aacatgcaaa tgacagatgg cagagctttt tggaaaaaga aaaaataata
      301 accacacagc aaacgcctag ggggagtccg gtggagtttc atcatgggta tgaacagttg
      361 ttgttttttt caactttctt cttctttctg ggtgttgatg tggatctctt tctatttgtt
      421 caggaaactg tgacgtgtgt tcttgggcag ggtctgaggt tttggaacct ctttctaaaa
      481 gggacagaaa gagcaccctg ctacatttgc taatccagag gctgagtgga gccgagctgg
      541 tcaggatgca ggttcccgtc ggcagcaggc ttgtcctggc tctcgccttc gtcctggttt
      601 ggggatcttc agtgcaaggt aagagaccca ggatctttaa ttc
//
LOCUS       MUSPPC2       183 bp ds-DNA             ROD       29-AUG-1989
DEFINITION  Mouse secretory granule proteoglycan peptide core gene, exon 2.
ACCESSION   M27392 J05044
KEYWORDS    proteoglycan peptide core.
SEGMENT     2 of 3
SOURCE      M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 183)
  AUTHORS   Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and
            Stevens,R.L.
  TITLE     Cloning and characterization of the mouse gene that encodes the
            peptide core of secretory granule proteoglycans, and expression of
            this gene in transfected rat-1 fibroblasts
  JOURNAL   J. Biol. Chem. 264, 16719-16726 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by R.L. Stevens, 02-AUG-1989.
FEATURES       from  to/span     description
    pept    +    19  +   163     secretory granule proteoglycan peptide core
    pre-msg <     1  >   183     peptide core mRNA and introns
    IVS     <     1       18     peptide core intron A
    IVS         164  >   183     peptide core intron B
BASE COUNT       46 a     44 c     44 g     49 t
ORIGIN      
        1 ggttccttgt tcgcacaggt tatcctgctc ggagagccag gtaccagtgg gtccgctgca
       61 aaccgaatgg cttttttgcg aactgcatcg aggagaaggg accacagttt gacctaatag
      121 atgaatccaa taacatcggc cctcccatga ataatcctgt tttgtaagta gactttcatc
      181 gat
//
LOCUS       MUSPPC3       681 bp ds-DNA             ROD       29-AUG-1989
DEFINITION  Mouse secretory granule proteoglycan peptide core gene, exon 3.
ACCESSION   M27393 J05044
KEYWORDS    proteoglycan peptide core.
SEGMENT     3 of 3
SOURCE      M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and
            Stevens,R.L.
  TITLE     Cloning and characterization of the mouse gene that encodes the
            peptide core of secretory granule proteoglycans, and expression of
            this gene in transfected rat-1 fibroblasts
  JOURNAL   J. Biol. Chem. 264, 16719-16726 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by R.L. Stevens, 02-AUG-1989.
FEATURES       from  to/span     description
    pept    +    20      260     secretory granule proteoglycan peptide core
    pre-msg <     1  >   678     peptide core mRNA and introns
    IVS     <     1       19     peptide core intron B
BASE COUNT      208 a    135 c    123 g    215 t
ORIGIN      
        1 ttttttcttt gtattttagg atggaaggac cctcaaaaga tttcatctcc aattatgatg
       61 actatgggtc aggttcgggc tccggctctg gctccggctc tggctcgggt tccggctccg
      121 gaagtggctt cctaggtgac atggaatggg aataccagcc aacagatgaa agcaatattg
      181 tctatttcaa ctataagcct tttgacagga ttctcactga gcaaaaccaa gaccaaccag
      241 aagacgattt tattatatga atgtgacggt ctctgtctcc ccacctccat gtggaacaat
      301 gtattcagta tacttagtgt accacgttta aatgaccagt ctcaggataa agagttttac
      361 agaaaattta aaatgcctgg aaaagactct tgaatcctgt tacccctttc ctcattaact
      421 cgtaaggaat tatgctttaa tgctgttacc tatcttgttg ttctggaaaa tgcctgcatt
      481 tatgtgtatt gaatcaacat ttaagaaatt aacacacacc cccattatta tacaataact
      541 ttcaaagcca tactggtttt gaaaatttta atttgatagc aagttgatga acaatctttc
      601 atacctaaag tgttcaggaa cccaactcgc attgtgaatt acaaatatat tcctttatgt
      661 gattaaaaag aaaataaagt g
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       CE3LYS        392 bp ds-DNA             BCT       15-JUN-1989
DEFINITION  Plasmid ColE3 lysis protein gene, complete cds.
ACCESSION   X03631
KEYWORDS    inverted repeat; lysis protein.
SOURCE      Plasmid Colicin E3 (from E.coli) DNA.
  ORGANISM  Plasmid Colicin E3
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 392; enum. 1 to 392)
  AUTHORS   Toba,M., Masaki,H. and Ohta,T.
  TITLE     Primary structures of the CoIE2-P9 and CoIE3-CA38 lysis genes
  JOURNAL   J. Biochem. 99, 591-596 (1986)
  STANDARD  simple automatic
COMMENT     EMBL features not translated to GenBank features:
               key        from     to       description
            
               INVREP      233    247       imp. inverted repeat A
               INVREP      249    263       imp. inverted repeat A'
               SITE        264    269       T-stretch
               OPR         233    269       pot. transcription terminator
FEATURES       from  to/span     description
    pept         65      208     lysis protein (aa 1-47)
BASE COUNT      110 a     78 c     93 g    111 t
ORIGIN      
        1 tgaaatatga atgccggttg tttatggatg aatggctggc attctttcac aacaaggagt
       61 cgttatgaaa aaaataacag ggattatttt attgcttctt gcagtcatta ttctgtctgc
      121 atgtcaggca aactatatcc gggatgttca gggcgggacc gtatctccgt catcaacagc
      181 tgaagtgacc ggattagcaa cgcagtaacc cgaaatcctc tttgacaaaa acaaagcgtg
      241 tcaggctgat tctgatgcgc tttttttttg aaatgtcaca aaaattccat gtgggagatg
      301 ggatctaaaa tcctcgtgca gaactttcca tccaggggga gaaaacttgt cgttttgagc
      361 cgttcggtgt tcagaacgca cgaaaccgat cg
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       CECCOLE3IM    915 bp ds-DNA             BCT       16-JUN-1986
DEFINITION  Plasmid ColE3-CA38 immunity protein gene, complete cds, and colicin
            E3 gene, 3' end.
ACCESSION   X01162
KEYWORDS    colicin; immune response gene; nuclease; toxin.
SOURCE      Plasmid ColE3-CA38 DNA, from E.coli.
  ORGANISM  Plasmid Colicin E3-CA38
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 915)
  AUTHORS   Lau,P.C.K., Rowsome,R.W., Zuker,M. and Visentin,L.P.
  TITLE     Comparative nucleotide sequences encoding the immunity proteins and
            the carboxyl-terminal peptides of colicins E2 and E3
  JOURNAL   Nucleic Acids Res. 12, 8733-8745 (1984)
  STANDARD  simple staff_review
COMMENT     EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE         22    255       pot. receptor-binding domain
               SITE        256    546       pot. RNase domain
FEATURES       from  to/span     description
    pept    <     1      549     colicin E3 (AA at 1)
    pept        559      816     immunity protein (aa 1-85)
BASE COUNT      297 a    129 c    236 g    253 t
ORIGIN      21 bp upstream of TaqI site.
        1 gctgaaataa aacaatttaa tcgatttgcc catgacccaa tggctggcgg tcacagaatg
       61 tggcaaatgg ccgggcttaa agcccagcgg gcgcagacgg atgtaaataa taagcaggct
      121 gcatttgatg ctgctgcaaa agagaagtca gatgctgatg ctgcattgag ttctgctatg
      181 gaaagcagga agaagaaaga agataagaaa aggagtgctg aaaataattt aaacgatgaa
      241 aagaataagc ccagaaaagg ttttaaagat tacgggcatg attatcatcc agctccgaaa
      301 actgagaata ttaaagggct tggtgatctt aagcctggga taccaaaaac accaaagcag
      361 aatggtggtg gaaaacgcaa gcgctggact ggagataaag ggcgtaagat ttatgagtgg
      421 gattctcagc atggtgagct tgaggggtat cgtgccagtg atggtcagca tcttggctca
      481 tttgacccta aaacaggcaa tcagttgaaa ggtccagatc cgaaacgaaa tatcaagaaa
      541 tatctttgag aggaagttat gggacttaaa ttggatttaa cttggtttga taaaagtaca
      601 gaagatttta agggtgagga gtattcaaaa gattttggag atgacggttc agttatggaa
      661 agtctaggtg tgccttttaa ggataatgtt aataacggtt gctttgatgt tatagctgaa
      721 tgggtacctt tgctacaacc atactttaat catcaaattg atatttccga taatgagtat
      781 tttgtttcgt ttgattatcg tgatggtgat tggtgatcaa atattatcag ggatgagttg
      841 atatacgggc ttctagtgtt catggatgaa cgctggagcc tccaaatgta gaaatgttat
      901 attttttatt gagtt
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       FSOFARS1A     132 bp ds-DNA             PLN       15-MAR-1990
DEFINITION  F.oxysporum autonomous replicating sequence (ARS).
ACCESSION   M29451
KEYWORDS    autonomous replicating sequence.
SOURCE      F.oxysporum lycopersici (strain 73) ARS DNA, clone pFOLT4R4.
  ORGANISM  Fusarium oxysporum
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE   1  (bases 1 to 132)
  AUTHORS   Powell,W.A. and Kistler,H.C.
  TITLE     In vivo rearrangement of foreign DNA by Fusarium oxysporum produces
            linear self-replicating plasmids
  JOURNAL   Unpublished (1989) College Env. Sci., S.U.N.Y., Sayracuse, NY 13210
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.A.Powell 27-OCT-1989.
FEATURES       from  to/span     description
    site          1        9     DNA homologous to yeast core sequence
    rpt           7       12     telomere repeat unit
    site        111      117     ligation point of telomeres -> ApaI site
BASE COUNT       26 a      8 c     56 g     42 t
ORIGIN      116 bp upstream of ApaI site.
        1 tttggtttag ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg
       61 ttagggttag ggttagggtt agggttaggg ttagggttag ggttagggtt agggccctaa
      121 ccctaaacca aa
//
LOCUS       MZEGST3AA     913 bp ss-mRNA            PLN       15-SEP-1989
DEFINITION  Maize GSH glutathione S-transferase III mRNA, complete cds.
ACCESSION   X06755
KEYWORDS    glutathione S-transferase; tandem repeat; transferase.
SOURCE      Maize, cDNA to mRNA, clone pGTC27.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 913)
  AUTHORS   Grove,G., Zarlengo,R.P., Timmermann,K.P., Li,N., Tam,M.F. and
            Tu,C.-P.D.
  TITLE     Characterization and heterospecific expression of cDNA clones of
            genes in the maize GSH S-transferase multigene family
  JOURNAL   Nucleic Acids Res. 16, 425-438 (1988)
  STANDARD  simple staff_entry
COMMENT     Data kindly reviewed (16 JUNE 1988) by D.Tu.
FEATURES       from  to/span     description
    pept         75      743     glutathione S transferase III (EC 2.5.1.18)
    mRNA    <     1      873     GST mRNA (alt.)
    mRNA    <     1      884     GST mRNA (alt.)
    mRNA    <     1      898     GST mRNA (alt.)
    mRNA    <     1      913     GST mRNA (alt.)
    rpt          54       71     tandemly repeated trinucleotide
BASE COUNT      145 a    332 c    270 g    166 t
ORIGIN      
        1 cccactttac tcctatccac tgcggcctgg acgcgtgcga gaggcttgac caagcagcag
       61 cagcagcagc agcgatggcg cctctgaagc tgtacgggat gccgctgtcc cccaacgtgg
      121 tgcgcgtggc caccgtgctc aacgagaagg gcctcgactt cgagatcgtc cccgtcgacc
      181 tcaccaccgg cgcccacaag cagcccgact tcctcgccct caaccctttc ggccagatcc
      241 cggctctcgt cgacggagac gaagtcctct tcgagtcccg tgcgatcaac cggtacatcg
      301 ccagcaagta cgcgtcggag ggcacggacc tgctccccgc gacggcgtcg gcggcgaagc
      361 tggaggtgtg gctggaggtg gagtcgcacc acttccaccc gaacgcgtcg ccgctggtgt
      421 tccagctgct cgtgaggccg ctcctgggcg gcgcccccga cgcggcggtg gtggagaagc
      481 acgcggagca gctcgccaag gtgctcgacg tgtacgaggc gcacctggcc cgcaacaagt
      541 acctcgccgg ggacgagttc acgctcgccg acgccaacca cgcgctccta cctgctctca
      601 cctcagcaag acccccaagg ccgggctgcg tcgccgcccg cccccacgtc aaggcctggt
      661 gggaggccat cgccgcccgc cccgcgttcc agaagaccgt cgccgccatc cccttgcccc
      721 cgccgccctc ctcctcggct tgacctcgcc ttgcgttgcc tgggtcgcgg atgcgtcgga
      781 gccctgagtc gaataaaaga ggccgcatcc tgtcttgcat ttgctcgtgc gccatgtgct
      841 ataacagcct gtgtaataaa cactgttgcc tttcgtgtgt gttcattgcc tttttgttgg
      901 ttggtctttg caa
//
LOCUS       MZEGSTI       929 bp ss-mRNA            PLN       15-SEP-1989
DEFINITION  Maize GSH glutathione S-transferase I mRNA, complete cds.
ACCESSION   X06754
KEYWORDS    glutathione S-transferase.
SOURCE      Maize, cDNA to mRNA, clone pGTC2.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 929)
  AUTHORS   Grove,G., Zarlengo,R.P., Timmermann,K.P., Li,N., Tam,M.F. and
            Tu,C.-P.D.
  TITLE     Characterization and heterospecific expression of cDNA clones of
            genes in the maize GSH S-transferase multigene family
  JOURNAL   Nucleic Acids Res. 16, 425-438 (1988)
  STANDARD  simple staff_entry
COMMENT     Data kindly reviewed (16 June 1988) by D. Tu
FEATURES       from  to/span     description
    pept         82      726     GST glutathione S transferase I (EC 2.5.1.18)
    mRNA    <     1      915     GST mRNA (alt.)
    mRNA    <     1      918     GST mRNA (alt.)
    mRNA    <     1      924     GST mRNA (alt.)
    mRNA    <     1      929     GST mRNA (alt.)
    rpt          35       65     tandemly repeated pentanucleotide
BASE COUNT      211 a    230 c    277 g    211 t
ORIGIN      
        1 gggcgcagag aatccccaag caaacaaaca gggtagaggg agaggagagg agaggagagg
       61 agaggttggg tctgggacac catggctccg atgaagctgt acggggcggt gatgtcgtgg
      121 aacttgacga ggtgcgcaac ggcgctggag gaggctggct ccgactacga gatcgtgccc
      181 atcaacttcg ccaccgccga gcacaagagc cccgagcacc tcgtccgcaa cccgtttggt
      241 caggttccag ctctgcagga tggtgacttg tacctcttcg aatcacgagc aatctgcaag
      301 tacgctgctc gcaaaaacaa gccagagctg ttgagggaag gaaacctcga ggaggcagca
      361 atggtggatg tttggatcga ggtggaggct aaccagtaca ccgctgcact gaatcccatc
      421 ctcttccagg tcctcatcag tcctatgctt gggggaacca ccgaccagaa agttgtggat
      481 gagaaccttg agaagctgaa gaaggtgcta gaggtgtacg aggcacgcct gaccaagtgc
      541 aagtaccttg ctggagactt cctcagcctc gccgacctga accatgtgtc tgtcactctc
      601 tgcctgttcg ctacgcccta cgcatctgtg ctcgacgcct acccgcatgt gaaggcctgg
      661 tggtctggtc tgatggagag gccgtctgtc cagaaggtcg ctgccctgat gaagccatct
      721 gcttgaaacg gttgccctgt caatcggtga atggaataag ccaagtctgt tctgggtgtt
      781 tgcttgttca gtgatgtgtc ttctatgacg aaagttgagt gccttatgtg tgagcatgct
      841 tgcttgtaac gagaccttga attgttcgga aatgagcatt cctgtgctct tcttgaaaat
      901 aaactttgcg ttaggttcct caggatctt
//
LOCUS       MZEPCSSU      830 bp ss-mRNA            PLN       15-DEC-1988
DEFINITION  Maize ribulose 1,5-bisphosphate carboxylase small subunit mRNA,
            complete cds.
ACCESSION   D00170
KEYWORDS    RuBPC; ribulose 1,5-bisphosphate carboxylase; small subunit.
SOURCE      Maize leaf, cDNA to mRNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 830)
  AUTHORS   Matsuoka,M., Kano-Murakami,Y., Tanaka,Y., Ozeki,Y. and Yamamoto,N.
  TITLE     Nucleotide sequence of cDNA encoding the small subunit of ribulose-
            1,5-bisphosphate carboxylase from maize
  JOURNAL   J. Biochem. 102, 673-676 (1987)
  STANDARD  simple automatic
COMMENT     Comparison with the RuBPC small subunit genes from other plants
            revealed that the maize small subunit is similar to the wheat one,
            there being 73% homology between the transit peptides and 64%
            between the mature proteins. This indicates that there is no
            noteworthy difference between the C3 and C4 small subunit
            structures.  corresponds to DDBJ (rel3.0) entry <RUBPCSSU>(D00170)
            dated 29-MAR-1988
FEATURES       from  to/span     description
    pept         66      578     ribulose 1,5-bisphosphate carboxylase small
                                 subunit (SSu)
BASE COUNT      147 a    281 c    220 g    182 t
ORIGIN      65 bp upstream of the SSu initiation codon.
        1 caggacgacc caagcaagca agcaagcagc gagtacatac atacatacta ggcagccagg
       61 cagccatggc gcccaccgtg atgatggcct cgtcggccac cgccgtcgct ccgttccagg
      121 ggctcaagtc caccgccagc ctccccgtcg cccgccgctc ctccagaagc ctcggcaacg
      181 tcagcaacgg cggaaggatc cggtgcatgc aggtgtggcc ggcctacggc aacaagaagt
      241 tcgagacgct gtcgtacctg ccgccgctgt cgacggacga cctgctgaag caggtggact
      301 acctgctgcg caacggctgg ataccctgcc tcgagttcag caaggtcggc ttcgtgtacc
      361 gcgagaactc cacctccccg tgctactacg acggccgcta ctggaccatg tggaagctgc
      421 ccatgttcgg ctgcaacgac gccacccagg tgtacaagga gctgcaggag gccatcaaat
      481 cctacccgga cgccttccac cgcgtcatcg gcttcgacaa catcaagcag acgcagtgcg
      541 tcagcttcat cgcctacaag cccccgggca gcgactagac cgcgcccgcc ggccgccccc
      601 cgccggctag ctagctagct agctcctgcg tgagctagta gctagtgcca tgcgtcgtct
      661 ctgtcgttcg gttttgcttc gggtcaccgt accctttgct tgcttggttt cttctttcct
      721 tttttccttt tttttttctt cttttccccg gccatggttc ctttgctttc agcagttctc
      781 tgtgatgtga tgtatccatt gttgcaagca tgcatggcct tgcattggct
//
LOCUS       PETRBCA       318 bp ss-mRNA            PLN       15-MAR-1990
DEFINITION  Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA
            (clone pSSU 117), partial cds.
ACCESSION   M29641
KEYWORDS    ribulose 1,5-bisphosphate carboxylase.
SOURCE      Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 117.
  ORGANISM  Petunia hybrida
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Dunsmuir,P., Smith,S. and Bedbrook,J.
  TITLE     A number of different nuclear genes for the small subunit of
            RuBPCase are transcribed in petunia
  JOURNAL   Nucleic Acids Res. 11, 4177-4183 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   318     ribulose 1,5-bisphosphate carboxylase (AA at
                                 1)
BASE COUNT       89 a     67 c     76 g     86 t
ORIGIN      
        1 tacgagacac tctcatactt tcccgatttg accgacgaac aattgttcaa agaagttgag
       61 taccttttga ataagggatg ggttccttgt ttggaattcg agctgaaaca caaatttatc
      121 taccgtgaat atcacgcatc accaggatac tatgatggaa ggtactggac aatgtggaag
      181 ttgcccatgt ttggttgcac tgatgctacc caagtcttgg gtgagctcca agaggccaag
      241 aaggcttacc caaatgcatg gatcaggatc atcggattcg agaacgttcg tcaagtgcag
      301 tgcatcagtt tcattgcc
//
LOCUS       PETRBCB       212 bp ss-mRNA            PLN       15-MAR-1990
DEFINITION  Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA
            (clone pSSU 51), partial cds.
ACCESSION   M29642
KEYWORDS    ribulose 1,5-bisphosphate carboxylase.
SOURCE      Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 51.
  ORGANISM  Petunia hybrida
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Dunsmuir,P., Smith,S. and Bedbrook,J.
  TITLE     A number of different nuclear genes for the small subunit of
            RuBPCase are transcribed in petunia
  JOURNAL   Nucleic Acids Res. 11, 4177-4183 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      207     ribulose 1,5-bisphosphate carboxylase (AA at
                                 1)
BASE COUNT       53 a     57 c     53 g     49 t
ORIGIN      
        1 catgcatctc cagggtacta tgatggcagg tactggacca tgtggaagct gcccatgttc
       61 gggtgcactg atgccaccca agtcttgggt gagctccaag aggccaagaa ggcttacccc
      121 aatgcctgga tcagtatcat tggcttcgac aacgtacgtc aagtgcaatg catcagtttc
      181 attgcctaca agccccaagg ctactagatt ac
//
LOCUS       PETRBCC       251 bp ss-mRNA            PLN       15-MAR-1990
DEFINITION  Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA
            (clone pSSU 41), partial cds.
ACCESSION   M29643
KEYWORDS    ribulose 1,5-bisphosphate carboxylase.
SOURCE      Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 41.
  ORGANISM  Petunia hybrida
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 251)
  AUTHORS   Dunsmuir,P., Smith,S. and Bedbrook,J.
  TITLE     A number of different nuclear genes for the small subunit of
            RuBPCase are transcribed in petunia
  JOURNAL   Nucleic Acids Res. 11, 4177-4183 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      141     ribulose 1,5-bisphosphate carboxylase (AA at
                                 1)
BASE COUNT       77 a     67 c     61 g     46 t
ORIGIN      
        1 accgatgcaa cccaggtgtt gggtgagctc caagaggcca agaaggctta ccctaatgca
       61 tggatcagaa tcatcggatt cgacaacgtc cgtcaagtgc aatgcatcag tttcattgcc
      121 tacaagccac caggctacta agttatatta ggacagcttc ccatgtgtat ttaggggcag
      181 tcccggcgga ttgggagcca aggaccccgg ccggccaaaa atagccagga acccaccaaa
      241 aaaccaaaaa a
//
LOCUS       SPIPSI2       863 bp ss-mRNA            PLN       15-MAR-1990
DEFINITION  Spinach psAD mRNA for subunit II precursor of photosystem I
            reaction center.
ACCESSION   X14017
KEYWORDS    photosynthesis; photosystem I; photosystem I reaction center.
SOURCE      spinach (Spinacia oleracea).
  ORGANISM  Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE   1  (bases 1 to 863)
  AUTHORS   Muench,S., Ljungberg,U., Steppuhn,J., Schneiderbauer,A.,
            Nechushtai,R., Beyreuther,K. and Herrmann,R.G.
  TITLE     Nucleotide sequences of cDNAs encoding the entire precursor
            polypeptides for subunits II and III of the photosystem I reaction
            center from spinach
  JOURNAL   Curr. Genet. 14, 511-518 (1988)
  STANDARD  simple automatic
COMMENT     *source: tissue=seedling; library=lambda gt11; clone=p6SocPS12-4.
            
            seq. differs from that reported by Lagoutte (1988), see Y00759; the
            psaD gene for subunit II probably exists only in a single copy per
            spinach genome
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               MSG          <1    863       psaD mRNA
            
               SITE        842    846       put. polyA signal
               POLYA       863    863       polyA site
FEATURES       from  to/span     description
    pept        155      793     subunit II preprotein (AA -50 to 162)
    sigp        155      307     transit peptide (AA -50 to -1)
    matp        305      790     mature subunit II (AA 1 - 162)
BASE COUNT      279 a    214 c    158 g    212 t
ORIGIN      
        1 ctaaggaaaa cagaaaaatt gccacaaatt gggctacagg aagctttaat tcaaactaca
       61 gtctaatggt acacattcaa caagtcctat aaagtacaat tagttgacaa aacaaaaaac
      121 caatccaaag ttccaaaaca acaacaatca aattatggcc atggcaactc aagccaccct
      181 cttctcccca tcttctctct cctccgcaaa acccattgac acccgcctca ccacctcttt
      241 caaacaaccc tccgccctct ctttcgcctc caagcccgcc tcccgccacc actctatccg
      301 ggcagccgcc gcagctgaag gcaaagctgc caccccaact gaaaccaagg aggcgcctaa
      361 aggttttact cctccggagt tggaccctaa cactccctcc cccatctttg ccggcagcac
      421 aggtggacta ttgaggaagg cacaagtaga ggagttttac gtgataacat gggaatcacc
      481 aaaagagcaa atattcgaga tgccaacagg aggagcagca ataatgaggg aaggaccaaa
      541 cttgctaaaa ttagcacgta aagagcaatg tttagcactt ggaacaagac ttaggtctaa
      601 atacaagatt aagtaccaat tttacagggt tttcccaagt ggtgaagttc aatatttaca
      661 ccctaaagat ggtgtttacc cagaaaaggt taaccctggt agacaaggtg ttgggttaaa
      721 catgaggtct attggtaaga atgttagccc tattgaggtt aaatttactg gcaagcaacc
      781 ttatgatttg taataacttt ttatcttatt ttcatcatgt tcatgctact atgttttgga
      841 caaatacaat gtgattattt tag
//
LOCUS       SPIPSI3       653 bp ss-mRNA            PLN       15-MAR-1990
DEFINITION  Spinach psaE mRNA for subunit III precursor of photosystem I
            reaction center.
ACCESSION   X14018
KEYWORDS    photosynthesis; photosystem I; photosystem I reaction center;
            psaE gene.
SOURCE      spinach (Spinacia oleracea).
  ORGANISM  Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE   1  (bases 1 to 653)
  AUTHORS   Muench,S., Ljungberg,U., Steppuhn,J., Schneiderbauer,A.,
            Nechushtai,R., Beyreuther,K. and Herrmann,R.G.
  TITLE     Nucleotide sequences of cDNAs encoding the entire precursor
            polypeptides for subunits II and III of the photosystem I reaction
            center from spinach
  JOURNAL   Curr. Genet. 14, 511-518 (1988)
  STANDARD  simple automatic
COMMENT     *source: tissue=seedling; library=lambda gt11; clone=p6SocPS12-4.
            
            at least two genes for subunit III are present in spinach genome
            designated psaE-1 and psaE-2  by [1]
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               MSG          <1    653       psaE mRNA
               MSG          <1    608       altern. psAE mRNA
            
               POLYA       608    608       altern. polyA site
               POLYA       653    653       polyA site
FEATURES       from  to/span     description
    pept        106      483     subunit III preprotein (AA -34 to 91)
    sigp        106      207     transit peptide (AA -34 to -1)
    matp        208      480     subunit III (AA 1 - 91)
BASE COUNT      198 a    140 c    139 g    176 t
ORIGIN      
        1 aaaacacaaa gctactagtt caaagggact ccctcctttt cactacttgc aaaatttatt
       61 tcttgtaaga agaaaaaaca gagagagaga gggagaaaat taataatggc tagcattgca
      121 tcatcagtag ctgttaggct tgggctaaca caagtgttgc ccaacaaaaa cttctcatct
      181 ccaaggagca ctaggcttgt tgtccgggct gctgaagagg ctgctgctgc acccgctgcc
      241 gcatcgcccg aaggtgaagc acccaaagct gctgctaagc caccaccaat tggccccaag
      301 aggggttcta aggtgagaat catgagaaag gaatcatact ggtacaaggg tgttggctca
      361 gtggttgctg ttgatcagga tccaaaaaca agatatccag ttgtggtgag attcaacaag
      421 gtgaattacg cgaatgtatc aaccaacaac tacgcactgg atgagattca agaagttgca
      481 tgatcaactc tgttataagt ttgcgaaatt tccaaacatt tagatgaatt aggagctatt
      541 tctgctgtgt gtatccaatg taaatctact tcttttgagt atcaacttct ttgcccatga
      601 aacttgtcag atatgctcct catcatccta agtattcacc atattctttc ctt
//
LOCUS       SPIPSI4       659 bp ss-mRNA            PLN       15-MAR-1990
DEFINITION  Spinach mRNA for photosystem I subunit V.
ACCESSION   X13134 Y00967
KEYWORDS    photosystem I; photosystem I reaction center.
SOURCE      spinach (Spinacia oleracea).
  ORGANISM  Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE   1  (bases 1 to 659)
  AUTHORS   Herrmann,R.G.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 659)
  AUTHORS   Steppuhn,J., Hermans,J., Nechushtai,R., Ljungberg,U., Thuemmler,F.,
            Lottspeich,F. and Herrmann,R.G.
  TITLE     Nucleotide sequence of cDNA clones encoding the entire precursor
            polypeptides for subunits IV and V of the photosystem I reaction
            center
  JOURNAL   FEBS Lett. 237, 218-224 (1988)
  STANDARD  simple automatic
COMMENT     *source: tissue=green tissue; clone=p6SocPI5-7.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               MSG          <1    659       PSI subunit V mRNA
            
               SITE        620    624       put. polyA signal
               POLYA       659    659       polyA site
            
            [1]  Author address Herrmann R.G., Botanisches Institut der Ludwig
            Maximilians Universiteit, Menzigerstr. 67, 8000 Muenchen 19, FRG.
            
            Submitted (06-OCT-1988) on tape to the EMBL data library
FEATURES       from  to/span     description
    pept         16      519     PSI subunit V preprotein (AA -69 to 98)
    sigp         16      222     transit peptide (AA -69 to -1)
    matp        223      516     mature PSI subunit V (AA 1 - 98)
BASE COUNT      185 a    128 c    131 g    215 t
ORIGIN      
        1 taaatccgta aaaaaatggc agcagcaaca gcatctttga gcagcacttt actagcaccc
       61 tgcagctcaa agcagccgca gccgcaacag caacatcagc atcagcagct caaatgcaag
      121 tccttttccg gcctccgccc actgaagcta aacatcagca gcaacaacag cagcagcagt
      181 ctatcgatgt cttctgcaag aagatcaatg acttgccgtg cagagttgag cccatcattg
      241 gtgataagtc tgagtacagg gctatccctt tttttaggaa ggtttgtgtt cttcaatttc
      301 cagagggaga atatggccaa acaggtgcct gaacaaaatg gtatgtctca ttttgaggcg
      361 ggcgatactc gtgctaagga gtatgtgagc cttcttaagt ctaatgatcc tgttgggttt
      421 aacattgttg atgttcttgc ttggggttct attggtcata ttgttgctta ttatatcttg
      481 gctactgcta gcaatggtta tgaccctagc ttcttttaaa ttatcatatt tcggatttta
      541 gtttcttttt taattgttat attattatta ttattattat tattatgttg agttgttgtg
      601 taattaacag attctgctca aatattattg cattcacttc tgtaaaaaaa aaaaaaaaa
//
LOCUS       SPIPSI5       923 bp ss-mRNA            PLN       15-MAR-1990
DEFINITION  Spinach mRNA for photosystem I subunit IV.
ACCESSION   X13133 Y00967
KEYWORDS    photosystem I; photosystem I reaction center.
SOURCE      spinach (Spinacia oleracea).
  ORGANISM  Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE   1  (bases 1 to 923)
  AUTHORS   Herrmann,R.G.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 923)
  AUTHORS   Steppuhn,J., Hermans,J., Nechushtai,R., Ljungberg,U., Thuemmler,F.,
            Lottspeich,F. and Herrmann,R.G.
  TITLE     Nucleotide sequence of cDNA clones encoding the entire precursor
            polypeptides for subunits IV and V of the photosystem I reaction
            center
  JOURNAL   FEBS Lett. 237, 218-224 (1988)
  STANDARD  simple automatic
COMMENT     *source: tissue=green tissue; clone=p6SocPI4-7;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               MSG          <1    923       PSI subunit IV mRNA
            
               MSG         923    923       polyA site
            
            [1]  Author address Herrmann R.G., Botanisches Institut der Ludwig
            Maximilians Universiteit, Menzigerstr. 67, 8000 Muenchen 19, FRG.
            
            Submitted (06-OCT-1988) on tape to the EMBL data library
FEATURES       from  to/span     description
    pept         30      725     PSI subunit IV preprotein (AA -77 to 154)
    sigp         30      260     signal peptide (AA -77 to -1)
    matp        261      722     PSI subunit IV (AA 1 - 154)
BASE COUNT      249 a    183 c    202 g    289 t
ORIGIN      
        1 cctcacttcc ttccatccaa aacacaacca tgtctttcac aatccccact aacctctaca
       61 aacctcttgc taccaaacct aagcatctat cttcttcctc ctttgcacca agatcaaaga
      121 ttgtttgcca acaagaaaat gatcaacaac aacctaagaa gctagagcta gctaaggtag
      181 gtgccaacgc tgccgctgcg ttggctcttt cctcagtgct actttcctca tggagtgtag
      241 cacccgatgc tgccatggct gacattgcag ggctaacacc atgcaaggag tccaagcagt
      301 ttgctaagag ggaaaagcag gccttgaaga agttgcaagc ctcgttgaag ctgtacgcag
      361 atgatagcgc accagcttta gcgatcaagg ctacaatgga gaagaccaag aagaggttcg
      421 ataactatgg aaagtatggt ttgctttgtg ggtctgatgg acttcctcat ttgatagtga
      481 gtggtgatca gagacactgg ggtgagttta tcacaccagg gattttgttc ttgtatattg
      541 ctgggtggat tgggtgggtt ggaaggagtt atttgattgc tattagggat gagaagaaac
      601 ctactcagaa ggagattatt attgatgttc ctttggcttc aagtcttctt tttagaggct
      661 tcagttggcc tgttgctgct tatagagagt tgctcaatgg tgagcttgtt gacaacaact
      721 tttaatttta tcatcctttt atcaaatcaa ttaatcctgt gttatattgt ctactatgta
      781 tatcttaaat actccttttt ggaaacttga atgtgtgttt cgattgtttg attgattcat
      841 ccggatgtaa tcatcatccg attagaattg tgtaagtgtt tgtttaatga tgctaattgg
      901 aaccgacgat cttttacaaa aaa
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       ATPCPRBCLA   1790 bp ds-DNA             ORG       20-FEB-1990
DEFINITION  A.rosea chloroplast ribulose bisphosphate carboxylase/oxygenase
            large subunit (rbcL) gene, complete cds.
ACCESSION   X15924
KEYWORDS    ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE      A.rosea chloroplast DNA.
  ORGANISM  Chloroplast Atriplex rosea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae; Atriplex rosea.
REFERENCE   1  (bases 1 to 1790)
  AUTHORS   Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
            Andrews,T.J. and Whitfield,P.R.
  TITLE     Comparisons of rbcL genes for the large subunit of
            ribulose-bisphosphate carboxylase from closely related C3 and C4
            plant species
  JOURNAL   J. Biol. Chem. 265, 808-814 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            G.S. Fudson, 20-JUL-1989.
FEATURES       from  to/span     description
    pept        259     1686     ribulose bisphosphate carboxylase/oxygenase.
                                 large subunit (EC 4.1.1.39)
BASE COUNT      521 a    314 c    417 g    538 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccgaaa ttagaaactt gaaaaacaac ggttacgatt gggttgcgcc atatatatga
       61 aagagtatac aataatgatg tatttgacga atcaaataca tggtctatta gcgaaccgtt
      121 tgattagttg ataatattaa ttgagaattt attttatgaa agattcctat aaaaggtttc
      181 attaggggct aatttatgtc gagtagacct tgttgttttg ttgtaaaaat ttaaaattga
      241 agttgtaggg agggacttat gtcaccacaa acagagacta aagcaaatgt tggatttaaa
      301 gctggtgtta aagattacaa attgacttat tatactcctg agtatgaaac cctagatact
      361 gatatcttgg cagcattccg agtaagtcct caaccgggag ttccacccga agaagcaggg
      421 gctgcagtag ctgctgaatc ttctactggt acatggacaa ctgtatggac cgacggactt
      481 accagtcttg atcgttacaa aggacgatgc taccacatcg agcctgttgc tggagaagaa
      541 aatcaatata tttgttatgt agcatatccc ttagaccttt ttgaagaagg ttctgttact
      601 aatatgttta cttccattgt gggtaacgta tttgggttca aagccctgcg tgctctacgt
      661 ttggaggatt tgcgaatccc tgttgcttat gtaaaaactt tccaaggccc gcctcacggt
      721 atccaagttg agagagataa attgaataag tatggccgcc ccctattggg atgcactatt
      781 aaaccaaaat tggggttatc cgctaaaaac tatggtcgag cagtttatga agtccttcgt
      841 ggtggacttg attttaccaa agatgatgaa aacgtgaact cccagccgtt tatgcgttgg
      901 agagaccgtt tcctattttg tgccgaagct ctttataaag cacaagccga aacaggcgaa
      961 atcaagggtc attacttgaa tgctactgcg ggtacatgcg aagacatgat gaaaagggct
     1021 gtatttgcca gagaattggg agttcctatc gtaatgcatg actacttaac agggggattc
     1081 actgcaaata ctactttgtc tcattattgc cgagataatg gtctacttct gcacatccac
     1141 cgtgcaatgc acgcagttat tgatagacag aagaatcatg gtatacactt ccgtgtacta
     1201 gctaaagcgt tacgtctgtc tggtggagat catattcatt ctggtaccgt agtaggtaag
     1261 cttgaaggag aaagagatat tactttaggt tttgttgatt tactacgtga tgattatact
     1321 gaaaaagacc gaagtcgtgg tatttatttt agtcaatctt gggtttccac accaggtgtt
     1381 ctgcccgttg cttcgggagg tattcacgtt tggcatatgc ctgctctaac cgagatcttt
     1441 ggggatgatt ctgtactaca gtttggtgga ggaactttag gacacccttg ggggaatgca
     1501 ccgggtgctg tagcgaatcg agtagctcta gaagcatgtg tacaagctcg taatgaggga
     1561 cgtgatcttg ctcgtgaggg taatacaatt attcgcgagg ctagcaaatg gagtcctgaa
     1621 ctagctgctg cttgtgagat atggaaggaa atcaaatttg aattcccagc aatggataca
     1681 gtttaggcta agtaattaat gttcggtctc ttagttgaat tgtaattaaa caaactcggc
     1741 ccaatctttt actaaaagga ttgagccgaa tacaattatt gtagatatat
//
LOCUS       ATPCPRBCLB   1786 bp ds-DNA             ORG       20-FEB-1990
DEFINITION  A.patula chloroplast ribulose bisphosphate carboxylase/oxygenase
            large subunit (rbcL) gene, complete cds.
ACCESSION   X15925
KEYWORDS    ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE      A.patula chloroplast DNA.
  ORGANISM  Chloroplast Atriplex patula
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae; Atriplex patula.
REFERENCE   1  (bases 1 to 1786)
  AUTHORS   Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
            Andrews,T.J. and Whitfield,P.R.
  TITLE     Comparisons of rbcL genes for the large subunit of
            ribulose-bisphosphate carboxylase from closely related C3 and C4
            plant species
  JOURNAL   J. Biol. Chem. 265, 808-814 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            G.S. Fudson, 20-JUL-1989.
FEATURES       from  to/span     description
    pept        255     1682     ribulose bisphosphate carboxylase/oxygenase.
                                 large subunit (EC 4.1.1.39)
BASE COUNT      517 a    316 c    419 g    534 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccgaaa ttagaaactt gaaaaacaac ggttacgatt gggttgcgcc atatatatga
       61 aagagtatac aataatgatg tatttgacga atcaaataca tggtctatta gcgaaccgtt
      121 tgattagttg ataatattaa ttgagaattt tatgaaagat tcctataaaa ggtttcatta
      181 ggggctaatt tatgtcgagt agaccttgtt gttttgttgt aaaaatttaa aattgaagtt
      241 gtagggaggg acttatgtca ccacaaacag agactaaagc aagtgttgga tttaaagctg
      301 gtgttaaaga ttacaaattg acttattata ctcctgagta tgaaacccta gatactgata
      361 tcttggcagc attccgagta agtcctcaac cgggagttcc acccgaagaa gcaggggctg
      421 cagtagctgc tgaatcttct actggtacat ggacaactgt atggaccgac gggcttacca
      481 gtcttgatcg ttacaaagga cgatgctacc acatcgagcc tgttgctgga gaagaaaatc
      541 aatatatttg ttatgtagca tatcccttag acctttttga agaaggttct gttactaata
      601 tgtttacttc cattgtgggt aacgtatttg ggttcaaagc cctgcgtgct ctacgtttgg
      661 aggatttgcg aatccctgtt gcttatgtaa aaactttcca aggcccgcct cacggtatcc
      721 aagttgagag agataaattg aacaagtatg gccgccccct attgggatgc actattaaac
      781 caaaattggg gttatccgct aaaaactatg gtcgagcagt ttatgaagtc cttcgtggtg
      841 gacttgattt taccaaagat gatgaaaacg tgaactccca gccgtttatg cgttggagag
      901 accgtttcct attttgtgcc gaagctcttt ataaagcaca agccgaaaca ggcgaaatca
      961 agggtcatta cttgaatgct actgcgggta catgcgaaga catgatgaaa agggctgtat
     1021 ttgccagaga attgggagtt cctatcgtaa tgcatgacta cttaacaggg ggattcactg
     1081 caaatactac tttgtctcat tattgccgag ataatggtct acttctgcac atccaccgtg
     1141 caatgcacgc agttattgat agacagaaga atcatggtat gcacttccgt gtactagcta
     1201 aagcgttacg tctgtctggt ggagatcata ttcattctgg taccgtagta ggtaagcttg
     1261 aaggagaaag agatattact ttaggttttg ttgatttact acgtgatgat tatactgaaa
     1321 aagaccgaag tcgtggtatt tattttactc aatcttgggt ttccacacca ggtgttctgc
     1381 ccgttgcttc gggaggtatt cacgtttggc atatgcctgc tctaaccgag atctttgggg
     1441 atgattctgt actacagttt ggtggaggaa ctttaggaca cccttggggg aatgcaccgg
     1501 gtgctgtagc gaatcgagta gctctagaag catgtgtaca agctcgtaat gagggacgtg
     1561 atcttgctcg tgagggtaat acaattattc gcgaggctag caaatggagt cctgaactag
     1621 ctgctgcttg tgagatatgg aaggaaatca aatttgaatt cccagcaatg gatacagttt
     1681 aggctaagta attaatgttc ggtctcttag ttgaattgta attaaacaaa ctcggcccaa
     1741 tcttttacta aaaggattga gccgaataca attattgtag atatat
//
LOCUS       BSPMANN      1939 bp ds-DNA             BCT       06-FEB-1990
DEFINITION  Bacillus sp. beta-mannanase gene, complete cds.
ACCESSION   M31797
KEYWORDS    1,4-beta-D-mannan mannanohydrolase;
            mannan endo-1,4-beta-mannosidase; mannanase.
SOURCE      Bacillus sp. (strain AM-001) DNA, clone pMAH5.
  ORGANISM  Bacillus sp.
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1939)
  AUTHORS   Akino,T., Kato,C. and Horikoshi,K.
  TITLE     Two bacillus beta-mannanases having different COOH termini are
            produced in Escherichia coli carrying pMAH5
  JOURNAL   Appl. Environ. Microbiol. 55, 3178-3183 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        220     1761     beta-mannanase precursor (EC 3.2.1.78)
    sigp        220      297     beta-mannanase signal peptide
    matp        298     1758     beta-mannanase
    signal     1826     1840     pot. transcription termination signal
    signal     1861     1889     pot. transcription termination signal
    signal     1905     1916     pot. transcription termination signal
BASE COUNT      637 a    343 c    420 g    539 t
ORIGIN      1 bp upstream of XbaI site.
        1 tctagactcc aaaggttact atcaacctgt ctatttattt aactgtacag tagatggggt
       61 agaatcaaac catcatcatc cctgccatct aaattcatta tatgaactcc tcaatagaga
      121 acaacaaatc ataatccaac catatttttc taatcaatca ctatgttaag ataaaaaatg
      181 taatcgctta caattaaaag gatagaggag gattatgtaa tgaaggtgta caagaaggtg
      241 gcttttgtta tggcttttat tatgtttttt tcggtcctgc cgacgatctc aatgtcgtca
      301 gaagcaaacg gtgctgcatt atcgaatcct aatgcgaacc aaacgacaaa aaacgtgtat
      361 agttggttag ccaatctacc aaacaagagt aataaacgtg tggtgtcggg acacttcgga
      421 gggtacagtg attctacctt agcctggatc aaacaatgcg caagggagct gacaggaaaa
      481 atgccaggaa tattatcttg tgattataag aattggcaga cgcgattgta tgtagccgat
      541 caaattagct atggctgcaa tcaagaatta ataaactttt ggaaccaagg aggtttggtc
      601 acgatcagtg tacacatgcc aaatccaggg tttcattcgg gggaaaacta caaaacaatt
      661 ttgcctactt cacagttcca aaatctaacc aatcacagga caacagaggg tagaaggtgg
      721 aaggatatgc tggataagat ggcagatggg ttggacgagc tacagaacaa tggagtgacg
      781 gttcttttcc gtcctttaca tgaaatgaat ggagaatggt tctggtgggg agcagaaggt
      841 tacaatcaat ttgatcaaac acgtgccaat gcctatatca gcgcatggag agatatgtat
      901 caatatttta ctcatgagcg taagctgaat aaccttattt gggtttactc acctgatgtt
      961 tacagagatc atgtaacaag ttactaccca ggagcaaatt atgtagatat tgtggctctt
     1021 gattcctacc atcctgatcc acatagcctt actgaccaat ataatcgaat gatcgcttta
     1081 gataaacctt ttgcttttgc tgaaatcggt cctcctgaaa gcatggctgg ttcctttgat
     1141 tattcaaatt atattcaagc aattaaacaa aaatatccac gtactgtcta tttcctagct
     1201 tggaatgata aatggagtcc acataacaac agaggagcat gggatctatt taatgattca
     1261 tgggttgtaa ataggggaga gattgattat ggtcaatcaa atccagccac tgttctctat
     1321 gattttgaaa acaatacgct atcgtggtcc gggtgtgaat ttacggacgg aggaccatgg
     1381 acttcgaatg aatggtcggc aaatggtact caatcgttga aagcagatgt cgttctgggc
     1441 aataatagct accatttgca aaaaacagtg aatcgaaatc ttagttcatt caaaaaccta
     1501 gaaattaaag tgagccattc ttcgtgggga aatgtaggaa gtggcatgac agcaagagtt
     1561 ttcgtcaaaa cagggagtgc ttggagatgg aatgcaggtg aattttgtca gtttgcaggc
     1621 aaacgaacaa ccgcactatc tattgatttg acgaaagtaa gtaatctgca tgatgttcga
     1681 gagataggtg tagagtataa agcaccagca aatagcaacg ggaagacggc gatttactta
     1741 gatcatgtga ccgtaagata atacaaaaaa aagtggttga aagcggtaac atatctagca
     1801 tatgatgata gggactagat aataatagac tgtcagacta ggaggtaagt cataatgaaa
     1861 aaaagtctga tcctcttgct cggactttta ttagctttct ccatgctatt aatagcctat
     1921 ctatcattca cccctgcag
//
LOCUS       FTRCPRBCLA   1839 bp ds-DNA             ORG       20-FEB-1990
DEFINITION  F.bidentis chloroplast ribulose bisphosphate carboxylase/oxygenase
            large subunit (rbcL) gene, complete cds.
ACCESSION   X15923
KEYWORDS    ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE      F.bidentis chloroplast DNA.
  ORGANISM  Chloroplast Flaveria bidentis
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Asterales; Asteraceae; Flaveria bidentis.
REFERENCE   1  (bases 1 to 1839)
  AUTHORS   Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
            Andrews,T.J. and Whitfield,P.R.
  TITLE     Comparisons of rbcL genes for the large subunit of
            ribulose-bisphosphate carboxylase from closely related C3 and C4
            plant species
  JOURNAL   J. Biol. Chem. 265, 808-814 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            G.S. Fudson, 20-JUL-1989.
FEATURES       from  to/span     description
    pept        280     1737     ribulose bisphosphate carboxylase/oxygenase.
                                 large subunit (EC 4.1.1.39)
BASE COUNT      523 a    325 c    437 g    554 t
ORIGIN      
        1 aggggggaat ttcttagttt ttgggtgatt tttaggtatt tggataaaaa aaaaagcatt
       61 gggttgcgcc atatatatga aagagtatac aataatgatg tatttgccga atcaaatacc
      121 atggtctaat aatcaagcat tctgattagt tgataatttt actattagtt gggaattttg
      181 tgaaaggttc ctgtaaaaag tttcattaac gcctaattca tgtcgagtag accttgttgt
      241 tgtgagaatt cttaattcat gagttgtagg gagggattta tgtcaccaca aacagagact
      301 aaagcaagtg ttggattcaa agctggtgtt aaagattata aattgactta ttatactcct
      361 gaatatgaaa ccaaggatac tgatatcttg gcagcatttc gagtaactcc tcaacctgga
      421 gttccgcctg aagaagcagg ggccgcagta gctgccgaat cttctactgg tacatggaca
      481 actgtatgga ccgatggact tacgagcctt gatcgttaca aaggccgatg ctatggaatc
      541 gagcctgttc ctggagaaga caatcaatat attgcttatg tagcttaccc attagacctt
      601 tttgaagaag gttctgttac taacatgttt acttccattg taggtaatgt atttgggttc
      661 aaagccctgc gtgctctacg tctggaagat ttgcgaatcc ctactgcgta tgttaaaact
      721 ttcgcgggtc cgcctcacgg tatccaagtt gagagagata aattgaacaa gtatggtcgt
      781 cccctgttgg gctgtactat taaacctaaa ttggggttat ccgctaaaaa ctacggtaga
      841 gcttgttatg aatgtcttcg tggtggcctt gattttacta aagatgatga gaacgtgaac
      901 tcccaaccat ttatgcgttg gagagaccgt ttcttatttt gtgccgaagc tatttataaa
      961 gcacaagctg aaacaggtga aatcaaaggg cattacttga atgctactgc gggtacatgc
     1021 gaagaaatgc ttaaaagggc tgtatttgcc agagaattgg gagttcctat cataatgcat
     1081 gactacctaa cagggggatt cactgcaaat actagcttgt ctcattattg ccgagataac
     1141 ggcctacttc ttcacatcca ccgcgcaatg catgcagtta ttgatagaca gaagaatcat
     1201 ggtattcact tccgtgtact agctaaagcg ttacgtatgt ctggtggaga tcacattcat
     1261 tccggtaccg tagtaggtaa acttgaaggg gaaagagaaa tcactttggg ctttgttgat
     1321 ttattgcgtg atgattttat tgaaaaagat agaagtcgcg gtatttattt cacccaagat
     1381 tgggtctctc taccaggtgt tctgcctgta gcttcggggg gtattcacgt ttggcatatg
     1441 cctgctctga ccgagatctt tggggatgat tccgtactac agttcggtgg aggaacttta
     1501 gggcaccctt gggggaatgc acctggtgcc gtagctaatc gagtagctct agaagcatgt
     1561 gtacaagctc gtaatgaggg acgtgatctt gctactgagg gtaatgagat tatccgtgag
     1621 gctaccaaat ggagccctga actagctgct gcttgtgaag tatggaagga gatcaaattt
     1681 gagttccagg caatggatac tttggatacg gataaggata aggataagaa gagataacgt
     1741 tcgttctctt aattgaattt caatgaaact cggcccaatc ttttactaaa aggattgagc
     1801 cgaatccaac atgcatatat tttagataga tacatactt
//
LOCUS       FTRCPRBCLB   1842 bp ds-DNA             ORG       20-FEB-1990
DEFINITION  F.pringlei chloroplast ribulose bisphosphate carboxylase/oxygenase
            large subunit (rbcL) gene, complete cds.
ACCESSION   X15922
KEYWORDS    ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE      F.pringlei chloroplast DNA.
  ORGANISM  Chloroplast Flaveria pringlei
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Asterales; Asteraceae; Flaveria pringlei.
REFERENCE   1  (bases 1 to 1842)
  AUTHORS   Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
            Andrews,T.J. and Whitfield,P.R.
  TITLE     Comparisons of rbcL genes for the large subunit of
            ribulose-bisphosphate carboxylase from closely related C3 and C4
            plant species
  JOURNAL   J. Biol. Chem. 265, 808-814 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            G.S. Fudson, 20-JUL-1989.
FEATURES       from  to/span     description
    pept        283     1740     ribulose bisphosphate carboxylase/oxygenase.
                                 large subunit (EC 4.1.1.39)
BASE COUNT      529 a    324 c    437 g    552 t
ORIGIN      
        1 aggggggaat ttcttagttt ttgggtgatt tttaggtatt tcgataaaaa aaaaaaaaga
       61 attgggttgc gccatatata tgaaagagta tacaataatg atgtatttgc cgaatcaaat
      121 accatggtct aataatcaag cattctgatt agttgataat tttactatta gttgggaatt
      181 ttgtgaaagg ttcctgtaaa aagtttcatt aacgcctaat tcatgtcgag tagaccttgt
      241 tgttgtgaga attcttaatt catgagttgt agggagggat ttatgtcacc acaaacagag
      301 actaaagcaa gtgttggatt caaagctggt gttaaagatt ataaattgac ttattatact
      361 cctgaatatg aaaccaagga tactgatatc ttggcagcat ttcgagtaac tcctcaacct
      421 ggagttccgc ctgaagaagc aggggccgca gtagctgccg aatcttctac tggtacatgg
      481 acaactgtat ggaccgatgg acttacgagc cttgatcgtt acaaaggccg atgctatgga
      541 atcgagcctg ttcctggaga agacaatcaa tatattgctt atgtagctta cccattagac
      601 ctttttgaag aaggttctgt tactaacatg tttacttcca ttgtaggtaa tgtatttggg
      661 ttcaaagccc tgcgtgctct acgtctggaa gatttgcgaa tccctactgc gtatgttaaa
      721 actttcgagg gtccgcctca cggtatccaa gttgagagag ataaattgaa caagtatggt
      781 cgtcccctgt tgggatgtac tattaaacct aaattggggt tatccgctaa aaactacggt
      841 agagcttgtt atgaatgtct tcgtggtggc cttgatttta ctaaagatga tgagaacgtg
      901 aactcccaac catttatgcg ttggagagac cgtttcttat tttgtgccga agctatttat
      961 aaagcacaag ctgaaacagg tgaaatcaaa gggcattact tgaatgctac tgcgggtaca
     1021 tgcgaagaaa tgcttaaaag ggctgtattt gccagagaat tgggagttcc tatcgtaatg
     1081 catgactacc taacaggggg attcactgca aatactagct tgtctcatta ttgccgagat
     1141 aacggcctac ttcttcacat ccaccgcgca atgcatgcag ttattgatag acagaagaat
     1201 catggtatgc acttccgtgt actagctaaa gcgttacgta tgtctggtgg agatcacatt
     1261 cattccggta ccgtagtagg taaacttgaa ggggaaagag aaatcacttt gggctttgtt
     1321 gatttactgc gtgatgattt tattgaaaaa gatagaagtc gcggtattta tttcacccaa
     1381 gattgggtct ctctaccagg tgttctgcct gtagcttcgg ggggtattca cgtttggcat
     1441 atgcctgctc tgaccgagat ctttggggat gattccgtac tacagttcgg tggaggaact
     1501 ttagggcacc cttgggggaa tgcacctggt gccgtagcta atcgagtagc tctagaagca
     1561 tgtgtacaag ctcgtaatga gggacgcgat cttgctactg agggtaatga aattatccgt
     1621 gaggctacca aatggagtcc tgaactagct gctgcttgtg aagtatggaa ggagatcaaa
     1681 tttgagttcc aggcaatgga tactttggat acggataagg ataaggataa gaagagataa
     1741 cgttcgttct cttaattgaa tttcaatgaa actcggccca atcttttact aaaaggattg
     1801 agccgaatcc aacatgcata tattttagat agatacatac tt
//
LOCUS       HUMHPRT1     1331 bp ss-mRNA            PRI       15-DEC-1989
DEFINITION  Human hypoxanthine phosphoribosyltransferase (HPRT) mRNA, complete
            cds.
ACCESSION   M31642 J00205
KEYWORDS    hypoxanthine phosphoribosyltransferase; transferase.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1331)
  AUTHORS   Jolly,D.J., Okayama,H., Berg,P., Esty,A.C., Filpula,D., Bohlen,P.,
            Johnson,G.G., Shively,J.E., Hunkapillar,T. and Friedmann,T.
  TITLE     Isolation and characterization of a full-length expressible cDNA
            for human hypoxanthine phosphoribosyltransferase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 477-481 (1983)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept         86      742     hypoxanthine phosphoribosyltransferase
                                 /nomgen="HPRT" /map="Xq26"
                                 /hgml_locus_uid="LE0031K"
    mRNA    <     1     1331     hprt mRNA
BASE COUNT      385 a    237 c    281 g    428 t
ORIGIN      Chromosome Xq26.
        1 tcttgctgcg cctccgcctc ctcctctgct ccgccaccgg cttcctcctc ctgagcagtc
       61 agcccgcgcg ccggccggct ccgttatggc gacccgcagc cctggcgtcg tgattagtga
      121 tgatgaacca ggttatgacc ttgatttatt ttgcatacct aatcattatg ctgaggattt
      181 ggaaagggtg tttattcctc atggactaat tatggacagg actgaacgtc ttgctcgaga
      241 tgtgatgaag gagatgggag gccatcacat tgtagccctc tgtgtgctca aggggggcta
      301 taaattcttt gctgacctgc tggattacat caaagcactg aatagaaata gtgatagatc
      361 cattcctatg actgtagatt ttatcagact gaagagctat tgtaatgacc agtcaacagg
      421 ggacataaaa gtaattggtg gagatgatct ctcaacttta actggaaaga atgtcttgat
      481 tgtggaagat ataattgaca ctggcaaaac aatgcagact ttgctttcct tggtcaggca
      541 gtataatcca aagatggtca aggtcgcaag cttgctggtg aaaaggaccc cacgaagtgt
      601 tggatataag ccagactttg ttggatttga aattccagac aagtttgttg taggatatgc
      661 ccttgactat aatgaatact tcagggattt gaatcatgtt tgtgtcatta gtgaaactgg
      721 aaaagcaaaa tacaaagcct aagatgagag ttcaagttga gtttggaaac atctggagtc
      781 ctattgacat cgccagtaaa attatcaatg ttctagttct gtggccatct gcttagtaga
      841 gctttttgca tgtatcttct aagaatttta tctgttttgt actttagaaa tgtcagttgc
      901 tgcattccta aactgtttat ttgcactatg agcctataga ctatcagttc cctttgggcg
      961 gattgttgtt taacttgtaa atgaaaaaat tctcttaaac cacagcacta ttgagtgaaa
     1021 cattgaactc atatctgtaa gaaataaaga gaagatatat tagtttttta attggtattt
     1081 taatttttat atatgcagga aagaatagaa gtgattgaat attgttaatt ataccaccgt
     1141 gtgttagaaa agtaagaagc agtcaatttt cacatcaaag acagcatcta agaagttttg
     1201 ttctgtcctg gaattatttt agtagtgttt cagtaatgtt gactgtattt tccaacttgt
     1261 tcaaattatt accagtgaat ctttgtcagc agttcccttt taaatgcaaa tcaataaatt
     1321 cccaaaaatt t
//
LOCUS       NRACPRBCLA   2010 bp ds-DNA             ORG       20-FEB-1990
DEFINITION  N.tenuifolia chloroplast ribulose bisphosphate
            carboxylase/oxygenase (rbcL) gene, complete cds.
ACCESSION   X15920
KEYWORDS    ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE      N.tenuifolia chloroplast DNA.
  ORGANISM  Chloroplast Neurachne tenuifolia
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Graminneae; Neurachne tenuifolia.
REFERENCE   1  (bases 1 to 2010)
  AUTHORS   Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
            Andrews,T.J. and Whitfield,P.R.
  TITLE     Comparisons of rbcL genes for the large subunit of
            ribulose-bisphosphate carboxylase from closely related C3 and C4
            plant species
  JOURNAL   J. Biol. Chem. 265, 808-814 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            G.S. Fudson, 20-JUL-1989.
FEATURES       from  to/span     description
    pept        438     1874     ribulose bisphosphate carboxylase/oxygenase.
                                 large subunit (EC 4.1.1.39)
BASE COUNT      614 a    345 c    453 g    598 t
ORIGIN      
        1 agtagcccga tataaaataa ataaaaagaa aggggcctat taagaacttt caaaattgta
       61 aaataaggat tagggattgg tttgggttgc gctatatcta tcaaagagta tacaataatg
      121 atggatttgg tgaatcaaat ccatggttta ataacaaatc gtgttaactt accataacaa
      181 caactcaatt cctatcgaat tcctatagta gaattcctac aggatagaac gtacacaggg
      241 tgtacgcatt atatatgaat gaaacatatt cattaactta agcatactcc tttttttatt
      301 taatgagttg atattaattg aatatctttg tttttttttt tttagatttt tgcaaaggtt
      361 tcatttacgc ctaatccata tcgagtagac cctgtcgttg tgagaattat taattaatga
      421 gttgtaggga gggacttatg tcaccacaaa cagaaactaa agcaagtgtt ggatttaaag
      481 ctggtgttaa ggattataaa ttgacttact acaccccgga gtacgaaacc aaggatactg
      541 atatcttggc agcatttcga gtaactcctc agcccggggt tccgcctgaa gaagcagggg
      601 ctgcagtagc tgcggaatct tctactggta catggacaac tgtttggact gatggactta
      661 ccagtcttga tcgttacaaa ggacgatgct atcacatcga gcccgttcct ggggaggaag
      721 atcaatatat ctgttatgta gcttatccat tagacctatt tgaagagggt tctgttacta
      781 acatgtttac ttccattgtg ggtaacgtat ttggtttcaa agccctacgc gctctacgtt
      841 tggaggatct acgaattccc cctacttatt caaaaacttt ccaaggtccg cctcacggta
      901 tccaagttga aagggataag ttgaacaagt atgggcgtcc tttattggga tgtactatta
      961 aaccaaaatt gggattatct gcaaaaaatt acggtagagc gtgttatgag tgtctacgcg
     1021 gtggacttga ttttaccaaa gatgatgaaa acgtaaactc acaaccattt atgcgctgga
     1081 gagaccgttt tgtcttttgt gcggaagcaa tttataaagc acaagccgaa accggtgaaa
     1141 tcaagggaca ttacttgaat gcgactgcag gtacatgcga agaaatgatg aagagagcta
     1201 tatttgcaag agaattaggg gttcctattg taatgcatga ctacttaact ggaggattca
     1261 ccgcaaatac tagtttggct atgtattgcc gcgacaacgg cctacttctt cacattcacc
     1321 gagcaatgca tgcagttatt gatagacaga aaaatcatgg tatgcatttc cgtgtattag
     1381 ctaaagcatt gcgtatgtcg gggggagatc atgtccacgc cggtacagta gtaggtaagt
     1441 tagaagggga acgtgaaata actttaggtt ttgttgattt attgcgcgat gattttattg
     1501 aaaaagatcg atctcgcggt gtctttttca ctcaggactg ggtatccatg ccaggtgtta
     1561 taccggtggc ttcagggggt attcatgttt ggcatatgcc agctctgacc gaaatctttg
     1621 gagacgattc cgtattacaa tttggtggag gaactttagg acatccttgg ggaaatgcac
     1681 ctggtgcagc agctaatcgt gtggctttag aagcctgtgt acaagctcgt aacgaagggc
     1741 gtgatcttgc tcgtgaaggt aatgaaatta tcaaagcagc ttgcaaatgg agtcctgaac
     1801 tagccgcagc ttgtgaagta tggaaggcga tcaaatttga gttcgcgccg gtggataccg
     1861 tagataaggt ctaaataaaa taaaaaagaa gtaaaataga aagagaaaaa atcagttacg
     1921 aaatgcagta attcttcttt attcttctaa ttgattgcaa ttaaattcgg ctcaatcttt
     1981 taaaagattg agccgaattt aaatagatct
//
LOCUS       NRACPRBCLB   1990 bp ds-DNA             ORG       20-FEB-1990
DEFINITION  N.munroi chloroplast ribulose bisphosphate carboxylase/oxygenase
            large subunit (rbcL) gene, complete cds.
ACCESSION   X15921
KEYWORDS    ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase.
SOURCE      N.munroi chloroplast DNA.
  ORGANISM  Chloroplast Neurachne munroi
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Gramineae; Neurachne munroi.
REFERENCE   1  (bases 1 to 1990)
  AUTHORS   Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R.,
            Andrews,T.J. and Whitfield,P.R.
  TITLE     Comparisons of rbcL genes for the large subunit of
            ribulose-bisphosphate carboxylase from closely related C3 and C4
            plant species
  JOURNAL   J. Biol. Chem. 265, 808-814 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            G.S. Fudson, 20-JUL-1989.
FEATURES       from  to/span     description
    pept        418     1854     ribulose bisphosphate carboxylase/oxygenase.
                                 large subunit (EC 4.1.1.39)
BASE COUNT      613 a    340 c    445 g    592 t
ORIGIN      
        1 agtagcccga tataaaataa ataaaaagaa aggggcctat taagaacttt caaaatccta
       61 aaataaggat tagggattgg tttgggttgc gctatatcta tcaaagagta tacaataatg
      121 atggaatcaa atccatggtt taataacgaa tcgtgttaac ttaccataac aacaactcaa
      181 ttcctataga attcctatag gatagaacgt acacagggtg tacgcattat atatgaatga
      241 aacatattca ttaacttaag catactcctt tttttattta atgagttgat attaattgaa
      301 tatttttgaa tatttttttt tttagatttt tgcaaaggtt tcatttacgc ctaatccata
      361 tcgagtagac cttgtcgttg tgagaattat taattaatga gttgtaggga gggacttatg
      421 tcaccacaaa cagaaactaa agcaagtgtt ggatttaaag ctggtgttaa ggattataaa
      481 ttgacttact acaccccgga gtacgaaacc aaggatactg atatcttggc agcatttcga
      541 gtaactcctc agcccggggt tccgcctgaa gaagcagggg ctgcagtagc tgcggaatct
      601 tctactggta catggacaac tgtttggact gatggactta ccagtcttga tcgttacaaa
      661 ggacgatgct atcacatcga gcccgttcct ggggaagaag atcaatatat ctgttatgta
      721 gcttatccat tagacctatt tgaagagggt tctgttacta acatgtttac ttccattgtg
      781 ggtaacgtat ttggtttcaa agccctacgc gctctacgtt tggaggatct acgaattccc
      841 cctacttatt caaaaacttt ccaaggtccg cctcacggta tccaagttga aagggataag
      901 ttgaacaagt atgggcgtcc tttattggga tgtactatta aaccaaaatt gggattatct
      961 gcaaaaaatt acggtagagc gtgttatgag tgtctacgcg gtggacttga ttttaccaaa
     1021 gatgatgaaa acgtaaactc acaaccattt atgcgctgga gagaccgttt tgtcttttgt
     1081 gcggaagcaa tttataaagc acaagccgaa accggtgaaa tcaagggaca ttacttgaat
     1141 gcgactgcag gtacatgcga agaaatgatg aagagagctg catttgcaag agaattaggg
     1201 gttcctattg taatgcatga ctacttaact ggaggattca ccgcaaatac tagtttgtct
     1261 atgtattgcc gcgacaacgg cctacttctt cacattcacc gagcaatgca tgcagttatt
     1321 gatagacaga aaaatcatgg tatccatttc cgtgtattag ctaaagcatt gcgcatgtcg
     1381 gggggagatc atgtccactc cggtacagta gtaggtaagt tagaagggga acgtgaaata
     1441 actttaggtt ttgttgattt attgcgcgat gattttattg aaaaagatcg atctcgcggt
     1501 gtctttttca ctcaggactg ggtatccatg ccaggtgtta taccggtggc ttcagggggt
     1561 attcatgttt ggcatatgcc agctctgacc gaaatctttg gagacgattc tgtattacaa
     1621 tttggtggag gaactttagg acatccttgg ggaaatgcac ctggtgcagt agctaatcgt
     1681 gtggctttag aagcctgtgt acaagctcgt aacgaagggc gtgatcttgc tcgtgaaggt
     1741 aatgaaatta tcaaagcagc ttgcaaatgg agtcctgaac tagccgcagc ttgtgaagta
     1801 tggaaggcga tcaaatttga gttcgagccg gtggataccg tagataaggt ctaaataaaa
     1861 taaaaaagaa gtaaaataga aagagaaaaa ataagttacg aaatgcagta attcttcttt
     1921 attcttctaa ttgattgcaa ttaaattcgg ctcaatcttt taaaagattg agccgaattt
     1981 aaatagatct
//
LOCUS       RATPGKXL     1675 bp ss-mRNA            ROD       07-FEB-1990
DEFINITION  Rat X-chromosome linked phosphoglycerate kinase mRNA, complete cds.
ACCESSION   M31788
KEYWORDS    phosphoglycerate kinase.
SOURCE      Rat liver, cDNA to mRNA, clones 6, 15, and 16.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1675)
  AUTHORS   Ciccarese,S., Tommasi,S. and Vonghia,G.
  TITLE     Cloning and cDNA sequence of the rat X-chromosome linked
            phosphoglycerate kinase
  JOURNAL   Biochem. Biophys. Res. Commun. 165, 1337-1344 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         41     1294     phosphoglycerate kinase
BASE COUNT      426 a    359 c    459 g    431 t
ORIGIN      289 bp upstream of PstI site.
        1 ggcctctctc cgggcctctc gacctcacgg tgttgccaaa atgtcgcttt ctaacaagct
       61 gactttggac aagctggacg tgaaggggaa gcgggtcgtg atgagggtgg acttcaatgt
      121 tcctatgaag aacaaccaga taacgaataa ccaaaggatc aaggctgctg tcccaagcat
      181 caaattctgc ttggacaatg gagccaactc ggttgtgctt atgagccacc tgggccgtcc
      241 tgatggtgtg cccatgcccg acaagtactc cttagagcca gttgctgcag aactcaaatc
      301 tctgctgggc aaggatgttc tgttcttgaa ggattgtgtg ggctcagaag tagagaatgc
      361 ctgtgccaac ccagcggctg ggactgtcat cctcctggag aacctccgct ttcatgtaga
      421 ggaagaaggg aagggaaaag atgcttctgg gaacaaggtt aaagctgagc cagctaaaat
      481 tgatgctttc cgagcctccc tgtccaaact tggagatgtc tatgtcaatg atgcttttgg
      541 gactgcacac agagcccaca gttccatggt gggtgtgaat ctgccacaga aggctggtgg
      601 atttttgatg aagaaggagc tgaactactt tgccaaggct ttggagagtc cagagcgacc
      661 cttcctggct atcttgggag gagctaaagt tgcagacaag atccagctga tcaataatat
      721 gctagacaaa gtcaatgaga tgatcatcgg tgggggaatg gcttttacct tccttaaggt
      781 gctcaacaac atggagattg gcacatctct gtatgatgaa gagggagcca agattgtcaa
      841 agatctcatg accaaagctg agaaaaatgg tgtgaagatt accttgcctg ttgactttgt
      901 cactgctgac aaatttgatg agaatgcaaa gactggccaa gctactgtgg cctctggtat
      961 acctgctggc tggatgggct tggactgtgg tactgagagc agtaagaaat atgctgaggc
     1021 tgtggctcga gctaagcaga ttgtttggaa cggccctgtt ggggtatttg aatgggaagc
     1081 atttgccagg ggaaccaagt ccctcatgga tgaggtggtg aaagccacgt ctaggggctg
     1141 catcactatc ataggtggtg gagacaccgc cacttgctgt gccaaatgga acacagagga
     1201 taaagtcagc catgtgagca ctgggggcgg cgccagtttg gagctcctgg aaggtaaagt
     1261 ccttcctggg gtggatgctc tcagcaatgt ttagtatttt cctgcctttt ggttcctgtg
     1321 cacagcccct aagtcgactt agtgttttcc gcatctccat ttggtgttag tgcagctagt
     1381 ggccaagacg agcaccagga acctaagcag cagctgcaca gcatctcagc tcgtctttac
     1441 tgcatgggat tcattatagt taagtccatt taatttctta gtgactaaac cttgcattaa
     1501 ggcatatatt tatactctgc ctgtgaaagg aagtgagctg taaaagctta gctctcttcg
     1561 ctgtatgtag cctctggtta gcctgtcact gttcatgaca gcatggaaat aacgatgagg
     1621 attccagctg taggtttggg aggttgatga tattgaacca ataaaagatc ccact
//
LOCUS       RATQRE1       504 bp ds-DNA             ROD       20-FEB-1990
DEFINITION  Rat liver NAD(P)H:quinone reductase gene, exon 1.
ACCESSION   M31801
KEYWORDS    NAD(P)H:quinone reductase.
SEGMENT     1 of 5
SOURCE      Rat cell line H5-6 DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 504)
  AUTHORS   Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
  TITLE     Rat liver NAD(P)H:quinone reductase: Regulation of quinone
            reductase gene expression by planar aromatic compounds and
            determination of the exon structure of the quinone reductase
            structural gene
  JOURNAL   J. Biol. Chem. 264, 21793-21797 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        325  +   331     NAD(P)H:quinone reductase, exon 1
    pre-msg     206  >   504     NAD(P)H:quinone reductase mRNA and introns
    IVS         332  >   504     NAD(P)H:quinone reductase intron A
BASE COUNT      112 a    150 c    127 g    115 t
ORIGIN      
        1 actctcaagc tcccagaaaa cccttttgac ttccatatac agaagtccta ggccagccct
       61 aaactgcttc tccctgccaa taacttggta tcctcccccc agcgcctctg ggctcggcaa
      121 tccagccccg ccctcgctgg ctgccctgca cagtgggctg ggccggaaaa gcaagatata
      181 aagcctgaaa gtgctcagta cagctcgcac tagcctaggc tgtggcacgc aggatctttc
      241 cgaagcattt cagggtcgtc ctggcaacca gctgctcagc caatcagcgc ttgacactac
      301 gatccgcccc caacttctgg agccatggcg ggtgagtatg gctccaactc cagcctaatt
      361 catcctgagg aggatgtagg ggcttgctat ggggtttgtt ccttgcctcg aagttgaaaa
      421 gtgtagagat taggatcctg gatgagcctc ggtgagtccc cggaaggaga gcttcttctc
      481 agaaccatag gtgcagatta ttct
//
LOCUS       RATQRE2       634 bp ds-DNA             ROD       20-FEB-1990
DEFINITION  Rat liver NAD(P)H:quinone reductase gene, exons 2 and 3.
ACCESSION   M31802
KEYWORDS    NAD(P)H:quinone reductase.
SEGMENT     2 of 5
SOURCE      Rat liver cell line H5-6 DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 634)
  AUTHORS   Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
  TITLE     Rat liver NAD(P)H:quinone reductase: Regulation of quinone
            reductase gene expression by planar aromatic compounds and
            determination of the exon structure of the quinone reductase
            structural gene
  JOURNAL   J. Biol. Chem. 264, 21793-21797 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   167      331     NAD(P)H:quinone reductase, exon 2
                402  +   532     NAD(P)H:quinone reductase, exon 3
    pre-msg <     1  >   634     NAD(P)H:quinone reductase mRNA and introns
    IVS     <     1      166     NAD(P)H:quinone reductase intron A
    IVS         332  >   401     NAD(P)H:quinone reductase intron B
    IVS         533  >   634     NAD(P)H:quinone reductase intron C
BASE COUNT      156 a    158 c    161 g    159 t
ORIGIN      Undtermined number of bp after segment 1.
        1 ctttagattt tcagcttaaa gacgtttagt tgataatgca gccactgttt cccgctctga
       61 gaaactaagg tggggaacgt gtctggtccc aagcactttt agattaggga ctcacccgtc
      121 ctgtttggat tttctttcct cacctcctca cgtacgcctt aaacagtgag aagagccctg
      181 attgtattgg cccacgcaga gaggacatca ttcaactatg ccatgaagga ggctgctgtg
      241 gaggctctga agaagaaagg atgggaggtg gtcgaatctg acctctatgc tatgaacttt
      301 aaccccctca tttccagaaa cgacatcaca ggtaagaatc gtctccctcc actgacagtg
      361 gaccacgtga cccagcctca gcccctcttg cctcccaaca ggggagccga aggactcgga
      421 gaactttcag taccctgttg agtcatctct ggcgtataag gaaggccgcc tgagcccgga
      481 tattgtagct gaacagaaaa agctggaagc tgcagacctg gtgatatttc aggtacctca
      541 taggacggaa gtcatgagag cagtgtctag ggtgtctgtc cttattgtcc ctagagccct
      601 ctgcattcta agtggtgtca gaaagtgtgc ggct
//
LOCUS       RATQRE3       248 bp ds-DNA             ROD       20-FEB-1990
DEFINITION  Rat liver NAD(P)H:quinone reductase gene, exon 4.
ACCESSION   M31803
KEYWORDS    NAD(P)H:quinone reductase.
SEGMENT     3 of 5
SOURCE      Rat liver cell line H5-6 DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 248)
  AUTHORS   Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
  TITLE     Rat liver NAD(P)H:quinone reductase: Regulation of quinone
            reductase gene expression by planar aromatic compounds and
            determination of the exon structure of the quinone reductase
            structural gene
  JOURNAL   J. Biol. Chem. 264, 21793-21797 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    27  +   140     NAD(P)H:quinone reductase, exon 4
    pre-msg <     1  >   248     NAD(P)H:quinone reductase mRNA and introns
    IVS     <     1       26     NAD(P)H:quinone reductase intron C
    IVS         141  >   248     NAD(P)H:quinone reductase intron D
BASE COUNT       48 a     52 c     74 g     74 t
ORIGIN      Undtermined number of bp after segment 2.
        1 tgacctcttg gtcttctgtc ccacagttcc cattgtattg gtttggggtg cccgccattc
       61 tgaaaggctg gtttgagaga gtgcttgtag caggattcgc ctacacgtat gccaccatgt
      121 atgacaaggg tcctttccag gtaggtgaac tgttgggtat ttcctgagta gattctgccc
      181 tgacatcaca gatggactct taagaccggg ttggggtggg gttggggggt gcactgtaag
      241 cttataaa
//
LOCUS       RATQRE4       199 bp ds-DNA             ROD       20-FEB-1990
DEFINITION  Rat liver NAD(P)H:quinone reductase gene, exon 5.
ACCESSION   M31804
KEYWORDS    NAD(P)H:quinone reductase.
SEGMENT     4 of 5
SOURCE      Rat liver cell line H5-6 DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 199)
  AUTHORS   Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
  TITLE     Rat liver NAD(P)H:quinone reductase: Regulation of quinone
            reductase gene expression by planar aromatic compounds and
            determination of the exon structure of the quinone reductase
            structural gene
  JOURNAL   J. Biol. Chem. 264, 21793-21797 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    63  +   164     NAD(P)H:quinone reductase, exon 5
    pre-msg <     1  >   199     NAD(P)H:quinone reductase mRNA and introns
    IVS     <     1       62     NAD(P)H:quinone reductase intron D
    IVS         165      199     NAD(P)H:quinone reductase intron E
BASE COUNT       41 a     55 c     53 g     50 t
ORIGIN      Undetermined number of bp after segment 3.
        1 cccccttcat taactcacgg ggaccctgga gaggagcggg tgatacttgt tttctctttc
       61 agaataagaa gaccttgctt tccatcacca ccgggggcag cggctccatg tactctctgc
      121 agggtgtcca cggggacatg aacgtcattc tctggccaat tcaggtagct cctcacgggg
      181 aacttccaga ggagtttgt
//
LOCUS       RATQRE5       964 bp ds-DNA             ROD       20-FEB-1990
DEFINITION  Rat liver NAD(P)H:quinone reductase gene, exon 6.
ACCESSION   M31805
KEYWORDS    NAD(P)H:quinone reductase.
SEGMENT     5 of 5
SOURCE      Rat liver cell line H5-6 DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 964)
  AUTHORS   Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B.
  TITLE     Rat liver NAD(P)H:quinone reductase: Regulation of quinone
            reductase gene expression by planar aromatic compounds and
            determination of the exon structure of the quinone reductase
            structural gene
  JOURNAL   J. Biol. Chem. 264, 21793-21797 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    57      362     NAD(P)H:quinone reductase, exon 6
    pre-msg <     1  >   964     NAD(P)H:quinone reductase mRNA and introns
    IVS     <     1       56     NAD(P)H:quinone reductase intron E
    site        727      757     region of dyad symmetry
BASE COUNT      222 a    229 c    240 g    273 t
ORIGIN      Undetermined number of bp after segment 4.
        1 tgcttacagt gcagactcac tcccgtgcct tacctctgtg ttctgtgtct ctccagagtg
       61 gcattctgcg cttctgtggc ttccaggtct tagaacctca actggtgtac agcattggcc
      121 acaccccacc cgatgcccgc gtgcaggtcc tggaagggtg gaagaagcgt ctggagactg
      181 tctgggagga gtcaccactc tactttgctc caagcagctt gtttgaccta aacttccagg
      241 caggattctt actgaaaaaa gaggttcaag aggagcagaa aaagaacaag tttggccttt
      301 ctgtgggcca tcatttgggc aagtccattc cagccgacaa ccagatcaaa gctagaaaat
      361 aaggttttcc ataccatgta gttagaccca ggtttctttt tccccagttt gcctggcttg
      421 ctttcagttt tcgcctttgt tccacaagga taggaaaagg aggaggctcg cctcatgcgt
      481 ttttggatag ttctgccacg gcgtgtgaca gcaaaatgaa cgaggtcaga ttaggggcct
      541 caggtggcct gggatatgaa tcagggagag gtgtagccgc gagcggggaa aatactcttc
      601 taggtctttt gtacactata agcttttttc ttcgggctag cctggctaaa tggcatccaa
      661 tcctccaccc acttgttgct attagttacc tctctgtggt ttagggcagg agggaattgc
      721 tcaaacaatg gctgagggac taacttgttt agcagttagc taaagcctgt ttatgatcca
      781 tcctggtttc aattactgtg cagtgactga caagcctcgg gggattgctc tccagctctt
      841 ctctgccttg tacatagcac acccaggtcc tgggaaatga atacaaaaac aggtctccgc
      901 ctcattcttg ctattctttc tttgtgtgtg tgtgtgtgga aataaatgga tatttcacac
      961 gtca
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       FPLTRAC      3227 bp ds-DNA             BCT       05-FEB-1990
DEFINITION  Plasmid F TraC protein (traC) gene, complete cds and TrbI protein
            (traI) gene, partial cds.
ACCESSION   M30936
KEYWORDS    TraC protein; TrbI protein.
SOURCE      plasmid F DNA, clone PKAS2.
  ORGANISM  Plasmid F
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 3227)
  AUTHORS   Schandel,K.A., Maneewannakul,S., Vonder Haar,R.A.,
            Ippen-Ihler,K. and Webster,R.E.
  TITLE     Nucleotide sequence of the F plasmid gene traC and identification
            of its product
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Printed copy of sequence [1] kindly submitted by R.E.Webster,
            20-DEC-1989
FEATURES       from  to/span     description
    pept        268     2895     TraC protein (gtg start codon)
    pept       2892  >  3227     TrbI protein
    binding     256      261     Shine-Delgarno sequence
BASE COUNT      785 a    827 c    900 g    715 t
ORIGIN      
        1 gcatgcggaa atcctattcc ggaagcccgg cggaaaatat ttcccggtgt gacgttgtgc
       61 gttgaatgtc aggcgtatca ggaaagacag agaaaacatt atgcataagt cagtcgcaga
      121 acatagtgat ttaattccgg atgaacatga gtggatattc agaaaacaga aatctttgta
      181 tatgcgccgg gaaatggcgc gttaattaca ggtattccct tcatggctac cgcatttctc
      241 gctttatttt tcaactaagg aattcatgtg aataacccac ttgaggccgt cactcaggcg
      301 gttaactccc tcgtcacagc actgaaactg cctgacgaat ccgcaaaggc caatgaagtt
      361 ctgggcgaaa tgagcttccc gcagttcagc cgtctgctgc cgtaccgtga ttacaaccag
      421 gaatccggtc tgttcatgaa tgacaccacg atgggcttta tgctggaagc cattcccatc
      481 aatggggcga atgagtccat tgtggaggcc ctcgatcata tgctgcgcac caaactgccg
      541 cgcggtattc cgttgtgtat ccatctgatg tccagtcagc tggttggtga caggattgaa
      601 tacgggctgc gtgagttctc ctggtctggc gaacaggccg aacggtttaa cgcgattacc
      661 cgggcctatt atatgaaagc ggcagcgaca cagtttccgc tgccggaggg aatgaatctg
      721 cccctgaccc tgcgccatta ccgggtgttt atctcgtact gttctccctc gaagaaaaaa
      781 agccgggccg acattctgga aatggaaaac ctggtgaaaa tcatccgggc gtcgttacag
      841 ggggccagta tcaccacaca gacggtggat gcacaggcct ttatcgatat tgtcggggag
      901 atgattaacc ataacccgga ttccctgtac ccgaaaagac gtcagctgga cccgtattct
      961 gatctgaatt atcagtgtgt ggaggacagt tttgatctga aagttcgggc tgattacctg
     1021 acactgggcc tgcgtgagaa cggcaggaac agcacggccc gcatcctgaa tttccatctg
     1081 gcccgtaacc cggaaatcgc cttcctgtgg aacatggccg acaactacag caacctgctg
     1141 aacccggaac tgtccatctc ctgtccgttc atcctgacgc tgactctggt ggtggaagac
     1201 caggtgaaaa cccacagcga agccaacctg aagtacatgg acctggagaa aaagtcgaag
     1261 acctcctatg ccaaatggtt tccgtccgtg gagaaagagg cgaaggagtg gggggaactg
     1321 cgtcagcggc tgggctccgg tcagtcctct gtcgtgtcct acttcctcaa catcacagcc
     1381 ttctgcaagg acaataatga aacggcactg gaagtggagc aggacatcct gaacagcttc
     1441 cgtaaaaacg gttttgagct gatttcaccg cgctttaacc acatgcgcaa tttcctgacc
     1501 tgtctgccct ttatggccgg gaaagggttg tttaaacagc tgaaagaggc cggagtggta
     1561 cagcgcgcag agagctttaa tgtggccaac ctgatgccgt tagtggcgga taaccccctg
     1621 acaccggcag gtctgctggc acccacctac cgtaaccagc tggcgtttat cgatattttc
     1681 ttccggggga tgaataacac caactataac atggcggtct gtggcacctc cggggccggt
     1741 aaaaccgggc tgatacagcc acttatccgc agcgtgctgg actccggagg ctttgccgtg
     1801 gtgttcgaca tgggggatgg atacaagtcc ctgtgtgaga acatgggcgg ggtgtatctg
     1861 gacggtgaaa ccctgcgttt taacccgttt gcgaacatca ccgatattga ccagtcagcg
     1921 gagcgtgtcc gtgaccagtt gtcagtgatg gccagcccca acggtaacct ggatgaagtg
     1981 catgaaggtc tgctgttgca ggcggtcaga gcctcctggc tggccaaaga gaacagagca
     2041 cgtattgatg acgtggtgga tttcctgaaa aacgccagtg acagcgagca gtatgccgag
     2101 tcaccgacta tccgcagccg tctggacgaa atgattgtgc tgcttgacca gtacactgcc
     2161 aacggcactt acggccagta ttttaactct gatgagccgt ccctgcggga tgacgcaaaa
     2221 atggtggtgc tggagctggg cggactggaa gaccgtccgt cactgctggt tgcggtgatg
     2281 ttctccctga ttatctacat cgagaacagg atgtaccgca cgccgcgtaa cctcaagaaa
     2341 ctgaacgtta ttgatgaagg ctggcgtctg ctggacttca aaaatcacaa ggtcggtgaa
     2401 tttattgaga aaggctaccg tacggcccgc cgtcataccg gtgcctatat caccatcaca
     2461 cagaacatcg tcgactttga ctctgataag gcctccagtg ctgcccgcgc ggcatggggt
     2521 aactcctcct acaaaattat cctcaaacag agtgcgaagg agttcgcgaa atacaaccaa
     2581 ctgtatccgg accagttcct gccactgcag cgcgacatga ttggtaagtt tggtgcggcc
     2641 aaagaccagt ggttcagttc cttcctgctg caggtggaaa accattcctc ctggcaccgt
     2701 ctgtttgtgg acccgttaag ccgcgccatg tacagctctg acggcccgga ttttgagttt
     2761 gtgcagcaga agcgtaaaga ggggctgagt attcatgagg cagtgtggca gctggcgtgg
     2821 aagaagtcag ggccggaaat ggcttcgctg gaagcctggc tggaagaaca tgagaaatac
     2881 aggagtgtgg catgagttca acgcagaaac ccgccgacgt cacggcagaa cgacgcagcc
     2941 actggtggtg gacggtgccg ggatgcctgg caatggtgtt actgaacgca gccgtcagtt
     3001 acggcattgt cagactgaac gcaccggtga ctgtcgcctt caacatgaaa cagaccgtgg
     3061 atgcgttttt tgacagcgcc agtcagaaac aactgtcaga agcgcagtca aaagcccttt
     3121 cagcgcgttt taacacggca ctggaagcca gcctgcaggc gtggcagcag aaacaccacg
     3181 cggtcattct ggtgtcgcct gctgtggtac agggggcacc ggatatc
//

GenBank-Updates@genbank.bio.net (04/10/90)

LOCUS       DDIGP24      3000 bp ds-DNA             INV       12-SEP-1989
DEFINITION  D.discoideum glycoprotein 24 A and B (GP24A and GP24B) genes,
            complete cds.
ACCESSION   M27588
KEYWORDS    adhesion protein; glycoprotein.
SOURCE      D.discoideum DNA, clones DG[2,3].
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 3000)
  AUTHORS   Loomis,W.F. and Fuller,D.L.
  TITLE     A pair of tandemly repeated genes code for gp24, a putative
            adhesion protein of Dictyostelium discoideum
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 886-890 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.F.Loomis, 08-SEP-1989, for release after publication.
FEATURES       from  to/span     description
    pept       1072     1132     glycoprotein 24A (GP24A), exon 1
               1234     1481     glycoprotein 24A (GP24A), exon 2
    pept       2585     2645     glycoprotein 24B (GP24B), exon 1
               2747     2997     glycoprotein 24B (GP24B), exon 2
    pre-msg    1023  >  2997     GP24A mRNA and intron
    pre-msg    2533  >  2997     GP24B mRNA and intron
    IVS        1133     1233     GP24A intron A
    IVS        2646     2746     GP24B intron A
    signal      812      817     GP24A G box
    signal      822      826     GP24A G box
    signal      994      999     GP24A TATA box
    signal     2334     2339     GP24B G box
    signal     2342     2346     GP24B G box
    signal     2504     2509     GP24B TATA box
BASE COUNT     1123 a    285 c    323 g   1269 t
ORIGIN      
        1 actccttttg atttggttgg cgttaatgaa tttgaatttt taattaattc tgattttaat
       61 tttaatgatt cctgagaaat aaaatattag tattataatt attataatta ttattattac
      121 tataatatat aaatataaat aatattaata aaaatatttt acaatattct ctatctcttc
      181 taaaatttgt ttaattggtt tctttaataa attcgttaaa ttatatattt gattttgata
      241 aagaggttga atattctcac tattttgaga ttttactact ctagtaattc tgtgacctga
      301 aatgatgtaa tatttatacg cttttgtgat cctaatggta ttgttgatgt tattgttgtt
      361 gttgttgttg ttgttgttgt tgttgttgtt gttgtcgata atgatggtgg attcgatttc
      421 ttttcaaatg agatcgaatt tgttttgaat cttttattgc tattatcatt actattatta
      481 tctacattct catcattaat tatatcattt gtatcactta agtttctttt tgtacctaac
      541 atgtttttat tatttgttgt atctattaca acatttgaaa aaaataaagt tgaataaaaa
      601 aaaaaaaaaa aaaaaaaaaa gactttattt tttaatttgg gcgccaattc aaaaaaaaaa
      661 aaaaaaaaaa aattaatttg ggaaatacag tttttttttt ttatgcccac catttgacat
      721 agataacctt ttataaataa taataaaacc tctaatttaa actgtttata aaaaaatggt
      781 ggaatgataa aaaaaaaaaa actgttttga ttgtggggga ggggtgaaga tgattgtcat
      841 tttattttgt tttatttatt attttttttt ttaatgggga tacaagtgag atataaatta
      901 caataattta ttattaaaat aaaacaatta attttttttt tctttttttt ttatttttta
      961 tttttttctt tttatttttt ttttttaatt atttaataaa attttttttc ctcattcaat
     1021 ttaaatataa ttactcttaa aaataaataa ataataataa taaattttaa aatggttgat
     1081 ttgaaaatta cattagttaa tgaagatggg gaatcaacaa taagcggtaa aggtaattgg
     1141 aataattaat ttctaaatat ttaattttat ttatatatat ttaaaaatct aacatatcat
     1201 ttcatttcat tctggctctt tttaaaaata taggtcatcc attaccagct cccttaatat
     1261 ttcctccaat atatattttc cgttttaccc aataccaaac tgaaggtaaa ctttgggaca
     1321 aaaatgaatt tcaaataaaa tcaggtaaaa ttgaattcga cggagaagaa tatgatattc
     1381 ctgaatctaa aggcacatgg agtaaagatg atgaggaaaa cgcaatcgat gttaatttac
     1441 atctatttag accaccagaa aaatttttcc caaaaaatta aagatcttaa aacagatctg
     1501 atacaatgaa aacaaagaaa taaaaaattt cattaatctt attttttcat catatataat
     1561 ttctatgttt ttaaaattta attatctgac atttaaataa ataagaaaca tatatgatgt
     1621 aaatgtatta ggtggttttg gttttcgttt gttcaattga ggaaacataa gataaattta
     1681 tctgacattt aaataaataa gaaacatata tgatgcaaat gtatttggag ttttgaaatt
     1741 aagaaattca acatcatatt aggtttaata atttacattt ctctgaaatg ggagatatta
     1801 ttcatcatat agttatacca aacattcagt tattgcaatt acattctttc cacaaaaagt
     1861 tttatttata atttgtttta tttgatttta attttgcaga tttgttgtag tgagttttca
     1921 cactaaattt tttttttttt tttttttaaa ttaaaaacaa ttaatttaac caaaacaata
     1981 aaaaaaacta tttataaaca gttatttgat tgttttaaga ttaaataaca atttacttta
     2041 aaatttgcgc caaatttttt ttttttaatt attttttttt ttttctatta tttttttttt
     2101 tttttttgct agtcagattt tagcaaattt tagaaatcaa attcaatgat tggtaaaaat
     2161 caagattttc aatcttttaa taatatatta cagcatcaac aacactcacc aaattaccat
     2221 atcacaaatt attgttatca gtttaataat tgacatagaa taaacttttt ttaaataata
     2281 ataaaacctc taatttaaac tgtttataaa aaaaaaaaaa aaactgtttt gattgtggga
     2341 ggggtgaaga tgattgtcat tttattttat tttatttatt attttttttt aatggggata
     2401 caagtgagat ataaattaca ataatttatt attaaaaaaa acaatttttt tatttttttt
     2461 tttttatttt tttttttctt ttttttattt ttttttaatt atttaataaa attttttttc
     2521 aacattcaat ttaaatataa ttactcttaa aaataaataa ataataataa taataaattt
     2581 taaaatgact gatttgaaaa ttacattagt taatgaagat ggggaatcaa caataagtgg
     2641 taaaggtaat tggaataatt aatttctaaa tatttaattt tatttatata tatttaaaaa
     2701 tctaacatat catttcattt tattctggat tttttataaa atataggtca tccattacca
     2761 gctcccctaa tatttcctcc aatatattgt ttctgtttta tccaatataa aactgaaggt
     2821 aaactttggg acaaaaatga ttttcaaata aaatcaggta aaattgaatt cggcggagaa
     2881 gaatatgata ttactgaatc taaaggcaca tggagtaaag atgatgaaga aaaccacatc
     2941 aaagttagtt tacatctaat tgtaccacca aaaaaaattt tccaaaaaaa tttttagatc
//

GenBank-Updates@genbank.bio.net (04/11/90)

LOCUS       BSUHEMAC     3795 bp ds-DNA             BCT       15-FEB-1990
DEFINITION  B.subtilis delta-aminolevulinate synthase (hemA) uroporphyrinogen I
            synthase (hemC) genes, complete cds.
ACCESSION   M32130
KEYWORDS    delta-aminolevulinate synthase; uroporphyrinogen I synthase.
SOURCE      B.subtilis DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 3795)
  AUTHORS   Petricek,M., Rutberg,L., Schroeder,I. and Hederstedt,
  TITLE     Cloning and characterization of the hemA region of the Bacillus
            subtilis chromosome
  JOURNAL   Unpublished (1990) Univ. of Lund, Sweden
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            I.C.Schroeder, 16-FEB-1990.
FEATURES       from  to/span     description
    pept        232     1599     delta-aminolevulinate synthase (hemA, EC
                                 2.3.1.37)
    pept       1607     2437     ORF2
    pept       2470     3414     uroporphyrinogen I synthase (hemC, EC 4.3.1.8)
BASE COUNT     1068 a    689 c    953 g   1085 t
ORIGIN      
        1 atgcatatca ccttcttgtt ttttagagct gatgtgtagt aaatttctgc tgtttttggt
       61 attgtcaata ggaatgcttc ttttccctga agctttttct aatatagcat aagaatttta
      121 aaatctgttc acattttgtg aaagaaacta tgttataatt attataaata atgagttcta
      181 tgttagaatg attataaatt aagattgggt gttgggggtg taattagagc gatgcatata
      241 cttgttgtgg gagtagatta taaatccgcc cctattgaga tacgtgaaaa agtaagtttt
      301 cagccgaatg agctggcaga agcaatggtg cagctgaaag aagagaaaag cattcttgaa
      361 aacatcattg tctcaacctg caaccgcact gaaatttatg cggtagtcga ccagcttcat
      421 accggccgtt attatataaa aaagttttta gctgattggt ttcaattaag caaagaagag
      481 ctgtcaccgt tcttaacgtt ttatgagagc gatgccgctg ttgagcattt attccgtgta
      541 gcctgcggac ttgattctat ggtgattggc gaaacgcaga ttctcggaca ggtacgcgac
      601 agctttaaaa cagctcagca agaaaaaacg atcgggacta tttttaatga gctgtttaag
      661 caggcagtta cagtgggcaa acggactcac gccgaaacag acattggctc aaatgcggtg
      721 tcagtaagct atgctgcagt tgaacttgcc aaaaaaatct tcggaaatct ttcaagcaag
      781 cacatattga ttctcggtgc gggaaaaatg ggcgagcttg ctgcggaaaa cctgcacgga
      841 cagggaatcg gcaaggtcac tgtcattaac cgaacatact tgaaagcgaa ggagcttgca
      901 gaccgttttt caggtgaagc gagaagcttg aatcagcttg aaagcgcgct tgcggaggct
      961 gatattttaa tcagttcaac cggtgcaagt gaatttgtcg tgtccaaaga gatgatggaa
     1021 aacgcgaata agcttcgcaa gggacgtccg ctgtttatgg tcgacattgc cgtgcctaga
     1081 gatcttgatc cggcgctgaa tgatcttgaa ggtgtttttc tttatgatat cgacgatctg
     1141 gaaggcattg tagaagcgaa catgaaagag cggagagaaa cagctgaaaa agttgaactg
     1201 ttaattgaag aaaccattgt ggaatttaaa caatggatga atacacttgg tgttgtgcct
     1261 gttatttctg cattgcgcga aaaggcgctt gccatccagt cagaaacgat ggacagcatt
     1321 gagcgtaagc tgcctcactt aagcacaaga gagaaaaaac tgttgaacaa acacaccaaa
     1381 agtattatta accaaatgct tcgtgatccg attttaaagg tgaaagagct tgcggcagat
     1441 gctgattctg aagaaaagct cgcgttgttt atgcagattt ttgatattga agaagctgcg
     1501 ggccgtcaaa tgatgaaaac cgttgaaagc agccagaagg tccactcttt taagaaggct
     1561 gaatcaaaag cgggctttag cccacttgta agtgagtgaa agctgaatga ttgatactgc
     1621 aatggcaaga cttaatgagg ggacaatcgt catttacgcg ttaagtgtac tcttttattt
     1681 tatagatttt cttcaacaca accggaaggc tggaaaaatg gccttctggt tgctttctat
     1741 tgtctggact ctgcaaaccg tgtatttggc ctattttatg tgggtgacgg ggcggtttcc
     1801 ggtattaaat gtgacagagg cactttattt ttatgcctgg gtgcttgtca cgctgtcact
     1861 tgtactgaca aagcttttac gtgttgactt tatcgtgttt tttacaaatg ttataggatt
     1921 ttctatgatc gccattcaca cattttcacc gacagagcag cagtcagctg ctttttccgg
     1981 gcagcttgta tccgagcttt tggtgattca tattacaatg gcgattcttt catacggcgc
     2041 tttttccctt tctttcgttt tttctgtgct atatatgttt caatatcatg tgctgaaaaa
     2101 gaaaaagtgg ggaaaatggc tgttgagaat agaagattta tctaagcttg attatatggc
     2161 gtatgtttta aatgtcattg gggttccgat gctgctgctg agtttgattc tcggcgtcat
     2221 ttgggcgtat gtctcactag aaacgctgta ttggtttgac gccaaagtgc ttggttcgtt
     2281 tgtcgtcctg ctgctgtaca gctattatct ttatatcagg ctgattaagg agctgcaagg
     2341 aaaggtcgct gcactgtgga atacggcttg ttttctggtg ctgatgatca attatttcct
     2401 gcttggaagc ctgtcgcaat tccattggtt cagttaaacg atgtcccaag cagattcggg
     2461 aggaaagaaa tgatgagaac gattaaagta ggttccagac ggagcaaact cgctatgact
     2521 caaacaaaat gggttattca aaaactgaag gaaatcaatc cttcgtttgc ttttgaaatt
     2581 aaagagatcg tgacaaaggg cgaccggatt gtcgatgtta cactctcaaa agtgggtgga
     2641 aaagggcttt ttgtcaaaga aattgaacag gcgcttttaa acgaagagat tgatatggca
     2701 gtgcacagca tgaaggacat gcctgctgtt ttgcctgaag gccttgtgat cggctgtatt
     2761 cctgaacggg aggacccgcg tgatgccctt atttcaaaga atcgcgtaaa gctttcagaa
     2821 atgaagaaag gtgctgtcat tggcacaagc agtttaagaa gaagcgcgca gcttttgatt
     2881 gagcgccctg accttacaat taaatggatt agaggtaata ttgatacaag acttcaaaag
     2941 ctggaaacag aggattatga cgcaattatt ttagcggctg ccggcctttc cagaatgggt
     3001 tggaagcaag atgtcgtaac cgaattcctt gagcctgagc gctgtttgcc tgctgtgggg
     3061 cagggagccc tggcgattga gtgccgagaa tcggatgaag agctgttggc gttgttttct
     3121 cagtttacag atgaatatac aaaacggact gtcttagcgg aacgtgcttt tttaaacgcg
     3181 atggagggcg gctgccaggt tccgatcgcg ggctactccg tgttaaatgg acaggatgaa
     3241 attgaaatga caggtcttgt cgcttcacct gacggcaaaa tcatttttaa agaaaccgtc
     3301 accggaaacg atccggagga agtaggaaag cgctgtgccg ctcttatggc tgacaaagga
     3361 gcaaaagatt taattgatcg tgtaaaacgg gagcttgacg aggatggaaa atgattttcc
     3421 gttgaaagga aaaacagtgc ttgtcacccg gaataaggca caggcagcat catttcagca
     3481 aaaagtggag gcgcttggcg gtaaagcggt tttaacctct ttgattacgt ttcgccgcgc
     3541 tttgccgaat gatgttgcgg aacaggtaag agaggatctt gccgcgccag gctggcttgt
     3601 ttttacaagt gtgaacgggg cagacttctt tttttcttat ctgaaggaaa atcagcttat
     3661 tctccctgcg cataaaaaaa ttgcagccgt cggtgaaaaa accgcgcgcc gtttaaaaat
     3721 gcataacgta tcggttgatg tgatgccaca ggagtatatt gctgaacaat tgcgtgacgc
     3781 tcttaagcag catgc
//
LOCUS       ECOAFR1      1476 bp ds-DNA             BCT       15-FEB-1990
DEFINITION  E.coli AF/R1 major pili subunit (afrA) gene, complete cds.
ACCESSION   M32083
KEYWORDS    afrA gene; major pili subunit; surface antigen.
SOURCE      E.coli (strain RDEC-1) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1127)
  AUTHORS   Wolf,M.K. and Boedeker,E.C.
  TITLE     Cloning of the genes for AF/R1 pili from rabbit enteroadherent
            Escherichia coli RDEC-1 and DNA sequence of the major structural
            subunit
  JOURNAL   Infect. Immun. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1476)
  AUTHORS   Wolf,M.K. and Boedeker,E.C.
  JOURNAL   Unpublished (1990) Walter reed Army Inst. of Res. Washington DC
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.K. Wolf, 13-FEB-1990.
FEATURES       from  to/span     description
    pept        627     1115     AF/R1 major pili subunit (afrA)
    site        349      390     region of dyad symmetry
    signal      493      498     pot. -35 region
    signal      515      520     pot. -10 region
    binding     612      616     pot. ribosome binding site
BASE COUNT      447 a    278 c    287 g    462 t      2 others
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcccta gtgaatgtct gctgggaatc ataaaacaat ctttctgata tatccacaat
       61 ttttaggttg gtaaatctta aaagaatagc cgctcgcgtt atcctgctta attgaatgta
      121 tttacctaaa gtaacaccta tgttttcttt aaacagtaat tgcagatacc gtctgctgta
      181 tccggagtaa tcaacgaggg catttatatc tatagatata ctctctaaat tatcatcaat
      241 gtactgtgtt atcgcgttta tcgtaagtgt tttcagcatg tacgtagctc ctatatgtat
      301 gtttacgtgt taccccacat catgttaata aaaccccttc tgttttttta gctgattgtg
      361 cattgtacac ataccgtgca caattagcta acaacgcaga ccaatatttt ttaaaatacc
      421 ccgcgttttc acatgacttg tatctattct cttagagaaa ttaatgcatc tctatcacat
      481 catgtgtagt actggacaaa tagtcatggg agcctattac cgaacagcga agatggcata
      541 tgttttctta ttaagaaaga ggaaagaata tggcgcactc gttttatctc aattttggta
      601 aaaaaaatat atggagaatg tcagaaatga aaaaaacatt tattgcgtct gtaattgtaa
      661 taaccataaa tacgggatca gcaattgctg ctcaaggcga tgttcagttc tttggtaccg
      721 ttactgcgaa gacctgtgat cttgtcgttg aacacgaggg ggctgtggtc aatatgattc
      781 agttgggttc tgtaactaat ggtggaacta atgctggcac cgatatcgga gcaaacaaat
      841 cgtttaccct gaagccagca tcaggggtga catgcaatac catcactact gctaaaatgg
      901 catggtcttc tcctgcaatg accgttaatg gtattggtaa tctatcaggt aaggctattg
      961 atgcccatgt gaagttagtg gcgattaaca gcacgggtaa agttcaaact gataccaacg
     1021 cagataagga aattaaagcg ggtcaaaata cagttgatta ctcaattact ggttctggcc
     1081 tactgatgaa ggctttaaat ttaaagctca gttaattggc ggtaccattc caggtgactt
     1141 cgatagtgct gctgcatatt ccgttgcata caactaatat ttgaatgtaa atccgggaag
     1201 cccctccctt cccggattta atatttagaa cagcatattt aactggtgcc cttaactttg
     1261 cttaggtgtg aagaggttag cttatgaaat taaaaacatt tcctaaaata tctctactgg
     1321 ccctgagtat atggtattct cactccagct tggctgatga acttaatctg gattttatac
     1381 agaacgtcag cgttattcca tcaattctga aaagtgacgc aatttacccg gaaggacaat
     1441 atatcgttga cgtaaccgta aataaagaac gtatdd
//
LOCUS       ECOCYS       5755 bp ds-DNA             BCT       14-FEB-1990
DEFINITION  E.coli thiosulfate binding protein (cysP), sulfate permease (cysT,
            cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes,
            complete cds.
ACCESSION   M32101
KEYWORDS    cysA gene; cysM gene; cysP gene; cysT gene; cysW gene;
            o-acetylserine (thiol)-lyase-B; sulfate permease;
            thiosulfate binding protein.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1292 to 5755)
  AUTHORS   Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A.
  TITLE     Sulfate and thiosulfate transport in E.coli K12: Nucleotide
            sequence and expression of the cysTWAM gene cluster
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1291)
  AUTHORS   Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D.
  TITLE     Sulfate and thiosulfate transport in E.coli K12: Identification of
            a gene encoding a novel protein involved in thiosulfate binding
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.Halanicka, 14-FEB-1990.
FEATURES       from  to/span     description
    pept        559     1575     thiosulfate binding protein (cysP)
    pept       1575     2408     sulfate permease (cysT)
    pept       2408     3283     sulfate permease (cysW)
    pept       3273     4370     sulfate permease (cysA)
    pept       4505     5416     o-acetylserine (thiol)-lyase-B (cysM; gtg
                                 start codon; EC 4.2.99.8)
    signal      516      521     -10 region
    site       5737     5755     right end of mu
BASE COUNT     1259 a   1477 c   1682 g   1337 t
ORIGIN      52 min on K12 map.
        1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag
       61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt
      121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc
      181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt
      241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt
      301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc
      361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac
      421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta
      481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta
      541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc
      601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac
      661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat
      721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg
      781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa
      841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat
      901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc
      961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg
     1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt
     1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc
     1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg
     1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa
     1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac
     1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg
     1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac
     1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc
     1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg
     1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa
     1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg
     1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg
     1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg
     1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc
     1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg
     1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag
     1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg
     2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa
     2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg
     2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga
     2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc
     2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat
     2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag
     2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg
     2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat
     2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc
     2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc
     2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg
     2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg
     2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca
     2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg
     2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga
     2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa
     3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt
     3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca
     3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt
     3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa
     3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga
     3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga
     3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg
     3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc
     3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga
     3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg
     3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg
     3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg
     3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc
     3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca
     3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga
     3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga
     3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg
     4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc
     4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt
     4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg
     4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc
     4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg
     4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt
     4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt
     4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga
     4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat
     4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc
     4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa
     4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat
     4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg
     4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat
     4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga
     4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg
     4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat
     5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca
     5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg
     5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa
     5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc
     5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat
     5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca
     5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg
     5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg
     5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt
     5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa
     5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct
     5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag
//
LOCUS       HUMCOLA2I    1994 bp ds-DNA             PRI       15-FEB-1990
DEFINITION  Human collagen type I alpha-2 (COL1A2) gene, exon 1 (partial).
ACCESSION   M31886
KEYWORDS    collagen.
SOURCE      Human DNA, clone pCOL-alpha-2-bGH.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1994)
  AUTHORS   Sherwood,A.L., Bottenus,R.E., Martzen,M.R. and Bornstein,P.
  TITLE     Structural and functional analysis of the first intron of the human
            alpha-2(I) collagen-encoding gene
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.Bornstein, 02-FEB-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >  1994     collagen alpha-2 type I mRNA and intron
                                 /nomgen="COL1A2" /map="7q21.3-q22.1"
                                 /hgml_locus_uid="LP0002V"
    IVS         156  >  1994     COL1A2 intron A
    binding     141      154     NF1 binding site
    binding    1034     1040     AP1 binding site
    binding    1061     1067     AP1 binding site
    site       1370     1409     gt-rich region
BASE COUNT      580 a    413 c    456 g    545 t
ORIGIN      
        1 gcatgcccgc gcccgccagg tgatacctcc gccggtgacc caggggctct gcgacacaag
       61 gagtctgcat gtctaagtgc tagacatgct cagctttgtg gatacgcgga ctttgttgct
      121 gcttgcagta accttatgcc tagcaacatg ccaatgtaag tgccttcagc ttgtttgggg
      181 gagactgggt agagaggtta gatgggaggg caccctgccc tgaaaaggaa aacctgtaac
      241 ctgaattcca ggtacacttg gagggcagac tctcaggcat gtgggaaaac gccggaattg
      301 ataagaaaca tggaaattac tttaaaaaat gaaaacataa aagccttgcc aaaagttagg
      361 gaacttttcc tctaagttca gagtgagaca gttaactcgg tctggctcct cagcttagta
      421 acccccaaag ggagcggaag gtctttttcc ctaaggatga gatattaacg accaatgtgg
      481 tggaggaagt caagggcctg caccccacag gccccataac cgcactgatg tccaccttgt
      541 aaaacttgag gcctgcgtta gaaagccctt caactgagta atgtaaaact cacctcctaa
      601 gagcttttat cttctgggca ttgtaaggct tgtccggagg aggaggatga cgatgctgat
      661 atgatgatgg ttataaggcg ccctctggag gaaggaaaat gaaagtacag gggacagggc
      721 cttaagcaga tggaatccca attaaagctt ctacggattt atacagatta atgatcagca
      781 tttctggttg gagcctttcc cagtggctag tcagtgaacc ctggaaagaa gaatggatgc
      841 tacttggagt gggtacattc tgaaaagtaa tataagtgtc tcaattcact ttctagtcat
      901 ggaaatggta acatttttta actcaaatct gctctaaatt ttgtttgagc ctgagaatta
      961 cccctttgac atgttcccag tgataagcaa acattatgaa cgcagcaagt tgagaaatat
     1021 caacattgag atgagactca agagaccggg gtttttccca tgagtctgac accaatttgc
     1081 tgcgtgactt tgggcaagtc aaacggcctt ttctaaaatg tgagacagag attaaaggga
     1141 ccccaaggcc actttccagc tctaggttcc atggccagac tttcatgtca acagagaatg
     1201 aagaagatca gtccgttttc atcttgaaaa tggctgccaa agtgctagac aaagatattg
     1261 actagatggg ggatggtatt gtctgaccac acccagtact ccaaaaagtt gttccaccca
     1321 cacagcacgg tgtctaccac tgcataattt ctaatgcatt tgtgtgcttg tgtgtgtgtg
     1381 tgtgtgtgtg tgtctgtgtg tctgtgtgtc tcttccccct tcattcactt ttagtataca
     1441 tactgtggat actaaggagt aattgcagtg aacaaattca cattaccgag ttcatatttt
     1501 taatgagatc ttgagagtgg gaggaaagag tcggctccta gagaataaaa tgaaggcaga
     1561 cttagggaaa tttgaaggta caaaggcaac ttaccttctg atcaacagcc aaccacagtc
     1621 tggaataaat gttatcaaac acacattctt caaaatggtc cgtgtctgag taattaaaag
     1681 gcaaatttcc aaaatcataa ggacttccgt taatcaagtc aggcataatt attcttccta
     1741 ctgatgacac aatgaagtaa acatatcatt cttgtaattt aacagtaatt ctcgtaaatt
     1801 gcccttaaat gtcagtgctg gatgtggtcc accctcctaa attgtgactg ttgcaacaga
     1861 tgttctcact tcaaataacg cacttcttgg ccacctaatt aaagcaattt ttggggtgat
     1921 tcatcctact gcaagcttgg ccacacttgt atcctgtatt aacctataat ttttgtaccg
     1981 taggagaaga attc
//
LOCUS       HUMP120PC    2612 bp ss-mRNA            PRI       14-FEB-1990
DEFINITION  Human proliferating-cell nucleolar protein P120 mRNA, complete cds.
ACCESSION   M32110
KEYWORDS    proliferating cell nuclear protein.
SOURCE      Human fetal liver cell line CML and testis, cDNA to mRNA, and lymph
            node DNA (bases 1 to 30).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2612)
  AUTHORS   Fonagy,A., Henning,D., Jhiang,S., Haidar,M., Busch,R.K., Larson,R.,
            Valdez,B. and Busch,H.
  TITLE     Cloning of the cDNA and sequence of the human proliferating cell
            nuclear protein P120
  JOURNAL   Cancer Communications 1, 243-251 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            H.Busch, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1     2612     proliferating cell nuclear protein P120 (AA at
                                 3)
BASE COUNT      674 a    701 c    725 g    512 t
ORIGIN      346 bp upstream of AvaI site.
        1 cacgcgcgac gccaccttct cccatttctg cctgccacag taccatgggg cgcaagttgg
       61 accctacgaa ggagaagcgg gggccaggcc gaaaggcccg gaagcagaag ggtgccgaga
      121 cagaactcgt cagattcttg cctgcagtaa gtgacgaaaa ttccaagagg ctgtctagtc
      181 gtgctcgaaa gagggcagcc aagaggagat tgggctctgt tgaagcccct aagacaaata
      241 agtctcctga ggccaaacca tcgcctggaa agctaccaaa agggatctct gcaggagctg
      301 tccagacagc tggtaagaag ggaccccagt ccctatttaa tgctcctcga ggcaagaagc
      361 gcccagcacc tggcagtgat gaggaagagg aggaggaaga ctctgaagaa gatggtatgg
      421 tgaaccacgg ggacctctgg ggctccgagg acgatgctga tacggtagat gactatggag
      481 ctgactccaa ctctgaggat gaggaggaag gtgaagcgtt gctgcccatt gaaagagctg
      541 ctcggaagca gaaggcccgg gaagctgctg ctgggatcca gtggagtgaa gaggagaccg
      601 aggacgagga ggaagagaaa gaagtgaccc ctgagtcagg ccccccaaag gtggaagagg
      661 cagatggggg cctgcagatc aatgtggatg aggaaccatt tgtgctgccc cctgctgggg
      721 agatggagca ggatgcccag gctccagacc tgcaacgagt tcacaagcgg atccaggata
      781 ttgtgggaat tctgcgtgat tttggggctc agcgggagga agggcggtct cgttctgaat
      841 acctgaaccg gctcaagaag gatctggcca tttactactc ctatggagac ttcctgcttg
      901 gcaagctcat ggacctcttc cctctgtctg agctggtgga gttcttagaa gctaatgagg
      961 tgcctcggcc cgtcaccctc cggaccaata ccttgaaaac ccgacgccga gaccttgcac
     1021 aggctctaat caatcgtggg gttaacctgg atcccctggg caagtggtca aagactggac
     1081 tagtggtgta tgattcttct gtgcccattg gtgctacccc cgagtacctg gctgggcact
     1141 acatgctgca gggagcctcc agcatgttgc ccgtcatggc cttggcaccc caggaacatg
     1201 agcggatcct ggacatgtgt tgtgcccctg gaggaaagac cagctacatg gcccagctga
     1261 tgaagaacac gggtgtgatc cttgccaatg acgccaatgc tgagcggctc aagagtgttg
     1321 tgggcaactt gcatcggctg ggagtcacca acaccattat cagccactat gatgggcgcc
     1381 agttccccaa ggtggtgggg ggctttgacc gagtactgct ggatgctccc tgcagtggca
     1441 ctggggtcat ctccaaggat ccagccgtga agactaacaa ggatgagaag gacatcctgc
     1501 gctgtgctca cctccagaag gagttgctcc tgagtgctat tgactctgtc aatgcgacct
     1561 ccaagacagg aggctacctg gtttactgca cctgttctat cacagtagaa gagaatgagt
     1621 gggtggtaga ctatgctctg aaaaagagga atgtgcgact ggtgcccacg ggcctagact
     1681 ttggccagga aggttttacc cgctttcgag aaaggcgctt ccaccccagt ctgcgttcta
     1741 cccgacgctt ctaccctcat acccacaata tggatgggtt cttcattgcc aagttcaaga
     1801 aattttccaa ttctatccct cagtcccaga caggaaattc tgaaacagcc acacctacaa
     1861 atgtagactt gcctcaggtc atccccaagt ctgagaacag cagccagcca gccaagaaag
     1921 ccaagggggc tggaaagaca aagcagcagc tgcagaaaca gcaacatccc aagaaggcct
     1981 ccttccagaa gctgaatggc atctccaaag gggcagactc agaattgtcc actgtacctt
     2041 ctgtcacaaa gacccaagct tcctccagct tccaggatag cagtcagcca gctggaaaag
     2101 ccgaagggat cagggagcca aaggtgactg ggaagctaaa gcaacgatca cctaaattac
     2161 agtcctccaa gaaagttgct ttcctcaggc agaatgcccc tcccaagggc acagacacac
     2221 aaacaccggc tgtgttatcc ccatccaaga ctcaggccac cctgaaacct aaggaccatc
     2281 atcagcccct tggaagggcc aagggggttg agaagcagca gttcgcagag cagccttttg
     2341 agaaagctgc cttccagaaa cagaatgata cccccaaggg cctcagcctc ccactgtgtc
     2401 tcccatccgt tccagccgcc ccccaccagc aaagaggaag aaatctcagt ccaggggcaa
     2461 cagccagctg ctgctatctt agatggttga aaactagacg ggtggctcac tgccattgtc
     2521 accaggttgg aactcttgcc tctgtgagga tgccttctct actgtgcata cccatgaaat
     2581 ttaatacaca ttttaaaacc tctggccact ga
//
LOCUS       MUSH2RIIBP   2204 bp ss-mRNA            ROD       10-APR-1990
DEFINITION  Mouse MHC class I regulatory element binding protein (H-2RIIBP)
            mRNA, 3' end.
ACCESSION   M26804
KEYWORDS    MHC class I regulatory element binding protein.
SOURCE      Mouse liver, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2204)
  AUTHORS   Hamada,K., Gleason,S.L., Levi,B.-Z., Hirschfeld,S., Appella,E. and
            Ozato,K.
  TITLE     H-2RIIBP, a member of the nuclear hormone receptor superfamily that
            binds to both the regulatory element of major histocompatibility
            class I genes and the estrogen response element
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8289-8293 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by author, 10-AUG-1989.
FEATURES       from  to/span     description
    pept    <     1     1341     MHC class I regulatory element binding protein
                                 (H-2RIIBP)
    mRNA    <     1     2204     H-2RIIBP mRNA
BASE COUNT      436 a    645 c    648 g    475 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccccc gaagcccaga cagctcctcc ccaaatcccc tttctcaggg gatccgtccg
       61 tcttctcctc ctggcccacc tcttacccct tcagcacctc cacctccaat gccacccccg
      121 ccactgggct cccccttccc agtcatcagt tcttccatgg ggtcccctgg tctgccccct
      181 ccggctcccc caggattctc cgggcctgtc agcagccctc agatcaactc cacagtgtcg
      241 ctccctgggg gtgggtctgg cccccctgaa gatgtgaagc caccggtctt aggggtccgg
      301 ggcctgcact gtccaccccc tccaggtggt cctggggctg gcaaacggct ctgtgcaatc
      361 tgcggggacc gaagctcagg caagcactat ggggtttaca gctgcgaggg ctgcaagggt
      421 ttcttcaagc gcaccattcg gaaggacctg acctactcgt gtcgtgataa caaagactgt
      481 acagtggaca agcgccagcg gaatcgctgt cagtactgtc gctatcagaa gtgcctggcc
      541 actggcatga aaagggaggc ggttcaggag gagcgtcaac gggggaagga caaagacggg
      601 gatggagatg gggctggggg agcccctgag gagatgcctg tggacaggat cctggaggca
      661 gagcttgctg tggagcagaa gagtgaccaa ggcgttgagg gtcctggggc caccgggggt
      721 ggtggcagca gcccaaatga cccagtgact aacatctgcc aggcagctga caaacagctg
      781 ttcacactcg ttgagtgggc aaagaggatc ccgcacttct cctccctacc tctggacgat
      841 caggtcatac tgctgcgggc aggctggaac gagctcctca ttgcgtcctt ctcccatcgg
      901 tccattgatg tccgagatgg catcctcctg gccacgggtc ttcatgtgca cagaaactca
      961 gcccattccg caggcgtggg agccatcttt gatcgggtgc tgacagagct agtgtccaaa
     1021 atgcgtgaca tgaggatgga caagacagag cttggctgcc tgcgggcaat catactgttt
     1081 aatccagacg ccaagggcct ctccaaccct ggagaggtgg agatccttcg ggagaaggtg
     1141 tacgcctcac tggagaccta ttgcaagcag aagtaccctg agcagcaggg ccggtttgcc
     1201 aagctgctgt tacgtcttcc tgccctccgc tccatcggcc tcaagtgtct ggagcacctg
     1261 ttcttcttca agctcattgg cgacaccccc attgacacct tcctcatgga gatgcttgag
     1321 gctccccacc agctagcctg agcccagatg cacaccgagt gtcactgagg aggacttgag
     1381 cctgggcagg gggcagagcc atgggacagg tgcagagcag gaggggactt gcccagcctg
     1441 ccagggatct ggcaacactt agcagggttc gcttggtctc caagtcgaag gggaccccag
     1501 atccctgtga ggactttatg tctaccttca gtggccttga gtctctgaat ttgtcggggt
     1561 ctcccatggt gcaggtgatt cttcatcctg gctccccagc acaaagcact gccctgcttc
     1621 cttctcattt ggcctcactc ccttctgaag agtggaacag agctccccca gaaaggggtg
     1681 ttgtggggca ggccccccaa gctgatgatc atgggagcag ggctctgaca gcctttatcc
     1741 tctcagactt gacagatggg ggcagaggag ggacctgcct ctgtctcctg tcagccccat
     1801 ttccacagtc cctcctgcag tcagactgaa gaataaaggg gtagtgaagg ggctgctgga
     1861 ggtggaggaa cccattgctc ttttaatttc ctgtgaggag agactgggag ttagactcaa
     1921 agaagtactg tacatcccca ggttgactta aatgtcaggg ctggagatgg catgtgggca
     1981 aggaggcccc tcaggtgggc tgtcccaaag ctccctgggc tctgcctcgg gtggccctac
     2041 agctcttccc tagtcttaag cacagctagg ctgggagcaa gtggggacat tgatgggggt
     2101 ggccagcctg cagagttggg tgctgggctg catggttttt gccctggacc tcttttgggg
     2161 gttccctccc atctttcact tgcacataaa gttgctttcc agtt
//
LOCUS       MUSID         927 bp ss-mRNA            ROD       15-FEB-1990
DEFINITION  Mouse helix-loop-helix DNA binding protein regulator (Id) mRNA, 3'
            end.
ACCESSION   M31885
KEYWORDS    helix-loop-helix DNA binding protein regulator;
            helix-loop-helix protein; regulatory protein.
SOURCE      Mouse (strain DBA2) erythroleukemia cell line MEL, cDNA to mRNA,
            clone pMH18.
REFERENCE   1  (bases 1 to 927)
  AUTHORS   Benezra,R., Davis,R.L., Lockshon,D., Turner,D.L. and Weintraub,H.
  TITLE     The protein Id: A negative regulator of helix-loop DNA binding
            proteins
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            R.Benezra, 02-FEB-1990
FEATURES       from  to/span     description
    pept    <     1      533     helix-loop-helix protein (Id) (AA at 3)
    site        261      305     helix 1
    site        306      335     loop
    site        336      383     helix 2
BASE COUNT      192 a    256 c    259 g    220 t
ORIGIN      
        1 attgtacaac ctttctccaa cttcttgttc tcttcccaca ctctgttctc agcctcctcc
       61 gctcccctcc gcctgttctc aggatcatga aggtcgccag tggcagtgcc gcagccgctg
      121 caggccctag ctgttcgctg aaggcgggca ggacagcggg cgaggtggta cttggtctgt
      181 cggagcaaag cgtggccatc tcgcgctgcg ctgggacgcg cctgcccgcc ttgctggacg
      241 agcagcaggt gaacgtcctg ctctacgaca tgaacggctg ctactcacgc ctcaaggagc
      301 tggtgcccac cctgccccag aaccgcaaag tgagcaaggt ggagatcctg cagcatgtaa
      361 tcgactacat cagggacctg cagctggagc tgaactcgga gtctgaagtc gggaccaccg
      421 gaggccgggg actgcctgtc cgcgccccgc tcagcaccct gaacggcgag atcagtgcct
      481 tggcggccga ggcggcatgt gttccagccg acgatcgcat cttgtgtcgc tgaggcggcg
      541 cactgaggga ccagatggac tccagccctt caggaggcaa gaggaaaaaa gtgctctcgg
      601 ttccccaggg gatctctggg aaagacacta ccgcagccac cggactcttg gcggatcggt
      661 ccagtgggta gagggtttga tcaacagagc ctcaccctct ccacctttca gcctccagag
      721 actttgggga gggggttaat caaccccgcg tgtttctgtt ttattgaaaa agcagacatt
      781 ttttttaaat ggtcacattt cgtgcttctc ggatttctga ggaaatattt tatattgtat
      841 attacaatga tcactggctg aaaatattgt tttacaatag ttctatgggg gtgggttttt
      901 tgttgttatt aaacaaacac tttagat
//
LOCUS       MZEPPDK       644 bp ds-DNA             PLN       16-FEB-1990
DEFINITION  Z.mays pyruvate orthophosphate dikinase (PPDK) gene, 3' end.
ACCESSION   M32081
KEYWORDS    pyruvate orthophosphate dikinase.
SOURCE      Z.mays mays (strain B73) DNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 644)
  AUTHORS   Glackin,C.A. and Grula,J.W.
  TITLE     Organ-specific transcripts of different size and abundance
  JOURNAL   Unpublished (1990) In Press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.A.Glacken, 13-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1       23     pyruvate orthophosphate dikinase (PPDK) (AA at
                                 3)
    mRNA    <     1      322     PPDK mRNA
BASE COUNT      150 a    148 c    148 g    198 t
ORIGIN      
        1 ctgcagctca ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc ctgcagctca
       61 ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc tgttggtgca tgctggtgat
      121 taataatact actatgacag agccatatgc tgttggtgca tgctggtgat taataatact
      181 actatgacag agccatatgc tctgtgaaga gtattagtag cagcgctcat aaaagctaca
      241 gttccatcta tctgtgaaga gtattagtag cagcgctcat aaaagctaca gttccatcta
      301 ttttctcagc tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttctcagc
      361 tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttcgtggt gtgagatgtg
      421 catgtcgttg ttgaggccat tgctgcacat ttttcgtggt gtgagatgtg catgtcgttg
      481 ttgaggccat tgctgcacat tccacctatt gaggccctcc tcaaattaag cctcgaacaa
      541 gctgatcatc tccacctatt gaggccctcc tcaaattaag cctcgaacaa gctgatcatc
      601 ttttctgaga actctagact cgttttctga gaactctaga ctcg
//
LOCUS       RATCLATP     4350 bp ss-mRNA            ROD       15-FEB-1990
DEFINITION  Rat ATP citrate-lyase mRNA, complete cds.
ACCESSION   J05210
KEYWORDS    ATP citrate-lyase.
SOURCE      Rat liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4350)
  AUTHORS   Elshourbagy,N.A., Near,J.C., Metz,P.J., Sathe,G.M., Southan,C.,
            Stickler,J.E., Gross,M., Young,J.F., Wells,T.N.C. and Groot,P.H.E.
  TITLE     Rat ATP citrate-lyase: Molecular cloning and sequencing analysis of
            a full length cDNA and mRNA abundance as a function of diet, organ,
            and age
  JOURNAL   J. Biol. Chem. 265, 1430-1435 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            N.A.Elshourbagy, 13-FEB-1990.
FEATURES       from  to/span     description
    pept         73     3375     ATP citrate-lyase
    mRNA    <     1     4350     ATP cytrate-lyase mRNA
BASE COUNT     1084 a   1114 c   1136 g   1016 t
ORIGIN      
        1 taagctggtg cttacggaca gagagccaca ctcgggcttt ctcgaagagg taaaccaggt
       61 ccctctgcag ccatgtcagc caaggcaatt tcagagcaga ccggcaaaga actcctttac
      121 aagtacatct gtaccacctc agccatccag aaccggttca agtatgcccg ggttactccc
      181 gacacagact gggcccatct cctgcaggac cacccctggc tgcttagcca gagcttggta
      241 gtcaagccgg accagctgat caaacgtcga ggaaagcttg gtctagtcgg ggtcaacctc
      301 tctctggatg gagtcaaatc ctggctgaaa cctcgactgg gacatgaggc caccgtcggc
      361 aaggccaaag gcttcctcaa gaactttctg attgagccct tcgtccccca cagtcaggcg
      421 gaggagttct acgtgtgcat ctatgctacc cgggaaggag actacgtcct gttccaccat
      481 gaagggggtg tggatgtggg cgatgtggac accaaagccc agaagctgct tgtgggtgtg
      541 gacgagaaac tgaacgctga agacattaag agacacctgt tggtccacgc ccccgaagac
      601 aagaaagaaa tcctggccag cttcatctcc ggcctattca atttctacga agatctttac
      661 ttcacctacc ttgagatcaa cccccttgtg gtgaccaaag atggtgtcta catccttgac
      721 ctggcggcca aggtggacgc cactgctgac tacatctgca aagtcaagtg gggtgatata
      781 gagttccctc ccccctttgg gcgtgaggca tacccagagg aagcctacat tgcagacctg
      841 gatgccaaaa gtggggcgag cttgaagctg accttgctga accccaaggg gcggatctgg
      901 accatggttg ccgggggtgg cgcctctgtc gtgtacagtg ataccatctg tgatcttgga
      961 ggtgtcaacg aactggcgaa ttacggggag tactctggtg cccccagtga acaacagacc
     1021 tatgactacg ccaagaccat cctctcactt atgactcgag agaagcaccc ggatggcaag
     1081 atcctcatca ttggaggcag cattgcaaac ttcaccaacg tggccgccac cttcaagggc
     1141 attgtgagag caattcgaga ttaccagggt tccctgaagg agcacgaggt caccatcttt
     1201 gttcgaagag gtggcccgaa ctatcaagag ggattacgag tgatgggaga agttgggaag
     1261 accactggaa tccccatcca tgtctttggc acagaaactc acatgacggc cattgtgggc
     1321 atggcctggg caccggccat tcccaaccag ccacccacag cggctcacac tgccaacttc
     1381 ctccttaatg ccagtgggag cacatcgaca ccagcaccca gcaggacagc gtctttttcc
     1441 gagtccagag ctgacgaggt ggcccctgca aagaaagcca agccagccat gccccaagat
     1501 tcagtcccaa gtccaagatc cctgcaagga aagagtgcca ccctcttcag ccgacatacc
     1561 aaggctatcg tatggggcat gcagacccgg gctgtgcaag gcatgctgga ctttgactac
     1621 gtgtgctccc gagatgagcc ttcagtggct gctatggtct acccgttcac gggggatcat
     1681 aagcagaagt tttactgggg acacaaggaa atcctgatcc ctgtcttcaa gaacatggct
     1741 gacgccatga aaaagcatcc ggaggtagac gtgctgatca actttgcatc tctgcgatcg
     1801 gcttatgaca gcaccatgga gaccatgaac tatgcacaga tccggaccat agccatcata
     1861 gcagaaggca tccctgaggc tctcacacgg aagctcatca agaaggcaga ccagaagggc
     1921 gtgaccatca ttgggccagc cacggttggg ggcatcaagc ctggatgctt taagattggg
     1981 aatactggtg ggatgctgga caacatcctg gcctccaaac tgtatcgccc aggcagtgtg
     2041 gcctacgtct cgcgttcagg aggcatgtct aacgaactca ataatatcat ctctcggacc
     2101 acagatggtg tctacgaggg tgttgccatc ggcggggaca ggtaccctgg gtccacattc
     2161 atggatcacg tgctgcgtta ccaagacact ccaggagtca agatgattgt agttcttggg
     2221 gagatagggg gtacagaaga atataagatc tgccggggca tcaaggaggg ccgcctcacc
     2281 aagccagtgg tctgctggtg catcgggacc tgtgccacca tgttctcttc tgaggtccag
     2341 tttggccacg ctggggcttg tgccaaccag gcttctgaaa cggcagtagc caagaaccag
     2401 gccttgaagg aagcgggagt gtttgtgccc cgaagctttg atgagctcgg agaaatcatt
     2461 cagtccgtgt atgaagatct tgtggccaaa ggcgccattg tacctgctca ggaagtgcca
     2521 cctccaacag tacccatgga ctactcttgg gccagggagc tgggtttaat ccgaaaacct
     2581 gcctcattca tgaccagcat ctgtgacgag cgggggcagg aactcattta tgcgggcatg
     2641 cccatcaccg aggtcttcaa ggaagagatg ggcattggtg gtgtcctggg cctcctctgg
     2701 ttccagagaa ggttgcccaa gtattcctgc cagttcattg agatgtgtct catggtcacc
     2761 gctgatcacg ggccagctgt ctccggggcc cataacacta tcatctgtgc tcgggctggg
     2821 aaggacctgg tctccagcct cacctcaggg ctgctcacca ttggggaccg gtttgggggt
     2881 gccttggacg cagcagcgaa gatgttcagt aaagcctttg acagcggcat tattcccatg
     2941 gagtttgtga acaagatgaa gaaggagggg aaactgatca tgggcatcgg ccatcgagtc
     3001 aaatcgataa acaacccaga catgcgagtg cagatcctca aagactttgt caaacagcac
     3061 ttccccgcca ccccgctgct cgactatgca ctggaagtgg agaaaatcac cacctcaaag
     3121 aagccaaatc ttatcctgaa cgtggatggt ttcatcggcg ttgcgtttgt ggacatgctt
     3181 aggaactgtg gctccttcac ccgggaggaa gctgacgagt atgttgacat tggagccctc
     3241 aatggcgtct ttgtgctggg aaggagtatg ggcttcatcg ggcactatct tgaccagaag
     3301 aggctgaagc aagggctgta tcgtcacccc tgggacgaca tttcctatgt tctcccggaa
     3361 cacatgagca tgtaaccgag ccagcagccc taccgtagaa aaaggaagac aaaaactccc
     3421 tcctcgacaa tatagcggac agacagctgg aaacagagcc cgttatgggc tgggcctgga
     3481 atggaaatag ccattgatgt gcaggcatgg aaagccaaca ccacaggccc attcagtcca
     3541 cacagagaag cttagtattt ttttttatat atatatctat atatatataa gcatagaaat
     3601 ttaaaaccaa gccaatactt gtgacgtttg cgctgctacc tgctgtatct attacatgga
     3661 agactgtaag caagcgctgt cagaataatg ttcttctagg gccttatgat gttgctttct
     3721 ttttttaatt agttgaaaat ttatttttcc tctagaacta gtggatccga cttttaagac
     3781 ttcaggatac tatctgtttg taggaccact gtctggtatc ccacctccca ctcatcttca
     3841 caccacatga agaacactgt attaatctga ttttttagga tctttttttt tttttttgtg
     3901 ttatgtgtta agggtttatt tagtatccca ctgaaacgtt ctgtgtttcg gaccaatgtc
     3961 tacttatgtc aaggggagga gggttggggc cattgtaccc ttagccatcg tcacacatgt
     4021 ggagtagtaa cttaaatgta aagttgtaac atacaagtgt ttaaaatgga aaccgcaaag
     4081 caaaaagctg tgaaacgtct cgtgtcttgt gttctctgtg ttcatgcagc tgacttgtct
     4141 gttactgaag tgtgggtcca aagactcaca tctgttccgc atctgtaacc cacagagatt
     4201 ctggcagctg ccacctcagt ctcttctctg tattatcatg tttggtttaa ataaactaga
     4261 tagtaaaaag aattcctgca gcccggggga tccactagtt ctagagcggc gcaccgcggt
     4321 ggagctccag cttttgttcc ctttagtgag
//
LOCUS       RATLOX       5351 bp ss-mRNA            ROD       15-FEB-1990
DEFINITION  Rat aorta lysyl oxidase mRNA, complete cds.
ACCESSION   J02903
KEYWORDS    lysyl oxidase.
SOURCE      Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7,
            13, 8-1 and IIB.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5351)
  AUTHORS   Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D.,
            Troxler,R.F. and Kagan,H.M.
  TITLE     Cloning of rat aorta lysyl oxidase cDNA: Complete codons and
            predicted amino acid sequence
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.C.Trackman, 07-FEB-1990.
FEATURES       from  to/span     description
    pept        287     1516     lysyl oxidase precursor
    sigp        287      349     pot. lysyl oxidase signal peptide
    matp        350     1513     lysyl oxidase
    site       2252     2274     region of dyad symmetry
BASE COUNT     1302 a   1528 c   1245 g   1276 t
ORIGIN      192 bp upstream of TaqI site.
        1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc
       61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg
      121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc
      181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt
      241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg
      301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc
      361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa
      421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga
      481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt
      541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg
      601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg
      661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc
      721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac
      781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga
      841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc
      901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat
      961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga
     1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg
     1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca
     1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag
     1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg
     1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga
     1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa
     1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa
     1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac
     1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt
     1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa
     1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat
     1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta
     1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat
     1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca
     1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt
     1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg
     1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc
     2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt
     2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac
     2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc
     2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta
     2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac
     2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat
     2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct
     2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct
     2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg
     2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat
     2641 gcagtactgt aatccccgtc cctcctggag cccgaattcc ttctacttca gacactgtgc
     2701 gctctcccgg accgtcgtgc gccgctcccc gtcgccttcc aggactggga aaggggagag
     2761 gaggacggtg ccacgtccga cggcctcctg ggctgggggc agggtctgcc gctcgccctt
     2821 ggcaccagtc cctgcgaccc atccccgagc ctcgccctcc tcctccctgc tcgaagaggt
     2881 ctccctcctt cgcgggatct gagtccctgt cttcattttt ctcctagcca cgtccctccc
     2941 cgagaaggga cgagccggga gcatcatgcg tttcgcctgg accgtgctct ttctgggaca
     3001 gctgcagttc tgtccccttc tccgctgcgc cccgcaggcc ccgcgcgagc ctcccgccgc
     3061 ccccggtgcc tggcgccaga caatccaatg ggagaacaac gggcaggtgt tcagtctgtt
     3121 gagcctgggg gcgcagtacc agcctcagcg acgccgcgac tccagcgcca ctgccccgag
     3181 agccgacgca acgctgcagc acagccacgc acgcccattc tgctgctgcg tgacaaccgc
     3241 actgcctctg cccgtgcgag gactccaagc ccatctgggg tcgccgcggg tcgtccccgg
     3301 cccgcagccc gccactggtt ccaagttggt ttctcgccgt cgggggccgg cgatggagcc
     3361 tcaaggcgcg cagaaccgga ctgcgtcgcc acagcctccg cagctcagta atctgaggcc
     3421 acccagccac gtagatcgca tggtggcgac gacccctaca atccctacaa gtactccgac
     3481 gacaacccct attataacta ctatgacact tatgagagac cggtccggga gcaggcaccg
     3541 acctggatat ggcaccggtt acttccagta cggtctcccg gacctggtac ccgatcccta
     3601 ctacatccag gcatccacgt acgtacaaaa gatgtctatg tacaacctga gatgcgctgc
     3661 ggaagaaaac tgcctggcca gttcagcata tagggcggat gtcagagact atgaccacag
     3721 ggtactgcta cgatttcctc agagagtgaa aaaccaaggg acgtctgact tcttaccaag
     3781 ccgcccccgc tactcctggg agtggcacag ctgccaccaa cattaccaca gcatggatga
     3841 attcagccac tacgacctgc tggatgccag cacacagagg agagtggccg agggccacaa
     3901 agcaagcttc tgtctggagg acacttcctg tgattatggg taccacagac gatttgcctg
     3961 tactgcacac acacaggggt tgagtcccgg atgttatgat acttatgcag cagacataga
     4021 ctgccagtgg attgatatta cagatgtaca acccggaaat tacattctaa aggtcagtgt
     4081 aaaccccagc tacctggtgc ctgaatcaga ctacagtaac aatgtcgtac gctgtgaaat
     4141 tcgctacaca ggacatcacg cctatgcctc aggctgcacc atttcaccgt attagaaaga
     4201 agctcacctt cccaaaggat gaagcagtac ctggtgtttg gacctatgaa aaccgtagat
     4261 tagcttaagt aggaagactt agatatttta aaaggcaaac ggaaaaacaa caaagaaggt
     4321 tttgtttgga ctctttcaca acaaatcaca taactggatt ttgagtgttt aaatcagcat
     4381 tagattggca cattttaaat acttattcat gttgctttat gaagtaatgg tgtttcaatt
     4441 ctgtgggtgc atagtgggct ctttcaaaga attctgaatt tcttaccttc ttttgaaatt
     4501 atagtgcaaa aagaagagga tattttaatg aatgagccac aatttgaact gattactttc
     4561 taaattgcca gacccatgag acaatgatga tgggtttgta tttgcctcaa catagattcg
     4621 ctttttaaaa agggtgttcc tattgtatag gcaaaaatgg atacacttgg tgctgaggaa
     4681 gggtcaaata ctaactattg ttgtcacgaa atataggtct acagcagaga gatggtgagt
     4741 atatattcag atagttacat ccctatataa actatgttta cattttagat gcttttcttt
     4801 ctgttaattg cttaatctca ctctgacttg aggtacaact tctgttttgg aatgaattag
     4861 ataattccag attctggttt gataattgtt gacattcccc catgctactt tttctgaggg
     4921 cagaaacgtc taatgtgacg actcttcaca ttaccattac gaggatacac agcacagcga
     4981 aatcattccg atgacaggtg tgatagatgg agagctaaca tgcaactgcc gagtgtttca
     5041 ctgttagcca gaactaagtc acttgcccca cacagcaatt acaccatgaa tctctaacat
     5101 cacaaccttc tttcaaatac ccacggactc atccatcctt ccatccgtca tccatccatc
     5161 cgtccgtccg tccgtcctga ctgcctagtg ccactgtctg gctaggcaca cccactatca
     5221 acctggttca cctgtcatgg cagcctgtac ccacccccgc cacacacccc gacgctggcc
     5281 tatagtgcaa aggttgtgcg ggctggtcct tcccacaatg cagtactgta atccccgtcc
     5341 ctcctggagc c
//
LOCUS       SHFIPAH      2900 bp ds-DNA             BCT       16-FEB-1990
DEFINITION  S.flexner invasion plasmid antigen (ipaH) gene, complete cds.
ACCESSION   M32063
KEYWORDS    invasion plasmid antigen.
SOURCE      S.flexner (strain M90T-W), serotype S) DNA, clone pWR390.
  ORGANISM  Shigella flexneri
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2900)
  AUTHORS   Hartman,A.B., Venkatesan,M.M., Oaks,E.V. and Buysse,J.M.
  TITLE     Sequence and molecular characterization of a multicopy invasion
            plasmid antigen gene, ipaH, of Shigella flexner
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A.B.Hartman, 12-FEB-1990.
FEATURES       from  to/span     description
    pept        251     1849     invasion plasmid antigen
    pept       2277  >  2900     ORF3
    signal      139      144     -35 region
    signal      162      167     -10 region
    signal     2111     2116     -35 region
    signal     2132     2137     -10 region
BASE COUNT      868 a    657 c    578 g    797 t
ORIGIN      
        1 catagaaaac ctccataaat aaattacaac taacttctgt tatgtgtaaa atggaaacta
       61 ttaaaactta atatcggaaa tggtaagtga aatttgtata aatatacaat tttaaatatt
      121 tattctcaca aatataaggt tgacctagca ttatgttctc tgtaaataat acacactcat
      181 cagtttcttg ctccccctct attaactcaa actcaaccag taatgaacat tatctgagaa
      241 tcctgactga atgggaaaag aactcttctc ccgggaagag cgaggcattg cttttaacag
      301 actctcccag tgctttcaga atcaagaagc agtattaaat ttatcagacc taaatttgac
      361 gtctcttccc gaattaccaa agcatatttc tgctttgatt gtagaaaata ataaattaac
      421 atcattgcca aagctgcctg catttcttaa agaacttaat gctgataata acaggctttc
      481 tgtgatacca gaacttcctg agtcattaac aactttaagt gttcgttcta atcaactgga
      541 aaaccttcct gttttgccaa accatttaac atcattattt gttgaaaata acaggctata
      601 taacttaccg gctcttcccg aaaaattgaa atttttacat gtttattata acaggctgac
      661 aacattaccc gacttaccgg ataaactgga aattctctgt gctcagcgca ataatctggt
      721 tacttttcct caattttctg atagaaacaa tatcagacaa aaggaatatt attttcattt
      781 taatcagata accactcttc cggagagttt ttcacaatta gattcaagtt acaggattaa
      841 tatttcaggg aatccattgt cgactcgcgt tctgcaatcc ctgcaaagat taacctcttc
      901 gccggactac cacggcccgc agatttactt ctccatgagt gacggacaac agaatacact
      961 ccatcgcccc ctggctgatg ccgtgacagc atggttcccg gaaaacaaac aatctgatgt
     1021 atcacagata tggcatgctt ttgaacatga agagcatgcc aacacctttt ccgcgttcct
     1081 tgaccgcctt tccgataccg tctctgcacg caatacctcc ggattccgtg aacaggtcgc
     1141 tgcatggctg gaaaaactca gtgcctctgc ggagcttcga cagcagtctt tcgctgttgc
     1201 tgctgatgcc actgagagct gtgaggaccg tgtcgcgctc acatggaaca atctccggaa
     1261 aaccctcctg gtccatcagg catcagaagg ccttttcgat aatgataccg gcgctctgct
     1321 ctccctgggc agggaaatgt tccgcctcga aattctggag gacattgccc gggataaagt
     1381 cagaactctc cattttgtgg atgagataga agtctacctg gccttccaga ccatgctcgc
     1441 agagaaactt cagctctcca ctgccgtgaa ggaaatgcgt ttctatggcg tgtcgggagt
     1501 gacagcaaat gacctccgca ctgccgaagc tatggtcaga agccgtgaag agaatgaatt
     1561 tacggactgg ttctccctct ggggaccatg gcatgctgta ctgaagcgta cggaagctga
     1621 ccgctgggcg caggcagaag agcagaagta tgagatgctg gagaatgagt actctcagag
     1681 ggtggctgac cggctgaaag catcaggtct gagcggtgat gcggatgcgc agagggaagc
     1741 cggtgcacag gtgatgcgtg agactgaaca gcagatttac cgtcagctga ctgacgaggt
     1801 actggccctg cgattgtctg aaaacggctc acgactgcac cattcataat cacgtcgcat
     1861 aagcataaac cgcagaccgg attgactccg gaaaaactgt gacccgatta cggaccttaa
     1921 caacaacccg taaatcctcg ctcaataccg gcagggattt acggcgtgca actgactttt
     1981 ttgaggggat aaccaaccag atcgtttgct atgggaatat cgagacagta atgagttaaa
     2041 tgataaaaat tgtttgaaaa tataggggat aaagatcaat ccaaactgga tgaaagtaga
     2101 actggtcaca ttaacatggg tagactgata taacaatcga cggttactgg aaagacagga
     2161 acatattcct ccagccggaa tgaaaacgcc gataaagctc taggattgtt tttttaaaga
     2221 ctttctcgtt ttatttgcat taatagacca agatatgaat agtgaggggt taataaatga
     2281 aaccgatcaa caatcattct ttttttcgtt ccctttgtgg cttatcatgt atatctcgtt
     2341 tatcggtaga agaacagtgt accagagatt accaccgcat ctgggatgac tgggctaggg
     2401 aaggaacaac aacagaaaat cgcatccagg cggttcgatt attgaaaata tgtctggata
     2461 cccgggagcc tgttctcaat ttaagcttac tgaaactacg ttctttacca ccactccctt
     2521 tgcatatacg tgaacttaat atttccaaca atgagttaat ctccctacct gaaaattctc
     2581 cgcttttgac agaacttcat gtaaatggta acaacttgaa tatactcccg acacttccat
     2641 ctcaactgat taagcttaat atttcattca atcgaaattt gtcatgtctg ccatcattac
     2701 caccatattt acaatcactc tcggcacgtt ttaatagtct ggagacgtta ccagagcttc
     2761 catcaacgct aacaatatta cgtattgaag gtaatcgcct tactgtcttg cctgaattgc
     2821 ctcatagact acaagaactc tttgtttccg gcaacagact acaggaacta ccagaatttc
     2881 ctcagagctt aaaatatttg
//
LOCUS       VACCSBP      1020 bp ds-DNA             VRL       06-DEC-1989
DEFINITION  Vaccinia virus cell surface-binding protein gene, complete cds.
ACCESSION   J05190
KEYWORDS    antigen; carbonic anhydrase-related transmembrane protein;
            cell surface-binding protein; envelope protein.
SOURCE      Vaccinia virus (wild type WR) DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 1020)
  AUTHORS   Maa,J.-S., Rodriguez,J.F. and Esteban,M.
  TITLE     Structural and functional characterization of a cell surface
            binding protein of vaccinia virus
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Esteban 30-NOV-1989.
FEATURES       from  to/span     description
    pept         61      975     cell surface-binding protein
    site        822      972     attachment site
    site          9       19     alpha helix
    site        103      114     alpha helix
    site        270      280     alpha helix
    site        286      292     alpha helix
BASE COUNT      354 a    182 c    167 g    317 t
ORIGIN      Map position HindIII-D.
        1 catccattgt aattcccata ctaagagcta tttttaaaca gttatcattt catttttact
       61 atgccgcaac aactatctcc tattaatata gaaactaaaa aagcaatttc taacgcgcga
      121 ttgaagccgt tagacataca ttataatgag tcgaaaccaa ccactatcca gaacactgga
      181 aaactagtaa ggattaattt taaaggagga tatataagtg gagggtttct ccccaatgaa
      241 tatgtgttat catcactaca tatatattgg ggaaaggaag acgattatgg atccaatcac
      301 ttgatagatg tgtacaaata ctctggagag attaatcttg ttcattggaa taagaaaaaa
      361 tatagttctt atgaagaggc aaaaaaacac gatgatggac ttatcattat ttctatattc
      421 ttacaagtat tggatcataa aaatgtatat tttcaaaaga tagttaatca attgcattcc
      481 attagatccg ccaatacgtc tgcaccgttt gattcagtat tttatctaga caatttgctg
      541 cctagtaagt tggattattt tacatatcta ggaacaacta tcaaccactc tgcagacgct
      601 gtatggataa tttttccaac gccaataaac attcattctg atcaactatc taaattcaga
      661 acactattgt cgtcgtctaa tcatgatgga aaaccgcatt atataacaga gaactataga
      721 aatccgtata aattgaacga cgacacgcaa gtatattatt ctggggagat tatacgagca
      781 gcaactacct ctccagcgcg cgagaactat tttatgagat ggttgtccga tttgagagag
      841 acatgttttt catattatca aaaatatatc gaagagaata aaacattcgc aattattgcc
      901 atagtattcg tgtttatact taccgctatt ctctttttta tgagtcgacg atattcgcga
      961 gaaaaacaaa actagattcg ataccttgtt gagcctccat tagaacggca gtgacttcgc
//

GenBank-Updates@genbank.bio.net (04/11/90)

LOCUS       CATL11AA      424 bp ds-DNA             MAM       28-FEB-1990
DEFINITION  Cat LINE-1 DNA sequence region 1.
ACCESSION   M32251
KEYWORDS    L1 repetitive sequence; repetitive sequence.
SOURCE      Cat DNA.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 424)
  AUTHORS   Fanning,T. and Singer,S.
  TITLE     The LINE-1 DNA sequences in four mammalian orders predict proteins
            that conserve homologies to retrovirus proteins
  JOURNAL   Nucleic Acids Res. 15, 2251-2260 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rpt           1      424     L1 repetitive element
BASE COUNT      182 a     71 c     61 g    110 t
ORIGIN      
        1 caaggatttt tcaatacaaa aagcaatatc agtgtaagat accatattaa cagaatgaag
       61 ggaaaaaacc cacatcatta tctcaactga tacagaaaaa gcatttgaca acattcaata
      121 ccctttcatg ataaaaacat tgtgaaaact agggatagaa gaaattacct tcattgaata
      181 aaagccatat atgaaaaacc cacaagtaat atcatattca atggtaaaag actaaaatct
      241 tttcctctaa atcaggaaaa ggcaaggatg cctactttca ccacttctat tcaatgtgta
      301 ctggaaattc tagctagagc aattaggcaa gaaaaagaaa ggcatccaaa ttgaaaagga
      361 tgaagtaaaa ttattcctat ttgcagttga tatgatctta tacatagaaa accctaaaga
      421 ttac
//
LOCUS       CATL12AA      377 bp ds-DNA             MAM       28-FEB-1990
DEFINITION  Cat LINE-1 DNA sequence region 2.
ACCESSION   M32252
KEYWORDS    L1 repetitive sequence; repetitive sequence.
SOURCE      Cat DNA.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 377)
  AUTHORS   Fanning,T. and Singer,S.
  TITLE     The LINE-1 DNA sequences in four mammalian orders predict proteins
            that conserve homologies to retrovirus proteins
  JOURNAL   Nucleic Acids Res. 15, 2251-2260 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rpt           1      377     L1 repetitive element
BASE COUNT      146 a     81 c     85 g     65 t
ORIGIN      
        1 gcaatgacac atcggacaaa gggctagtat ccaaaacctg taaagaactc accaaactcc
       61 acacccaaaa aacaaataat ccagtgaaga aatgggcaga agacatgaat agacactttc
      121 ctaaacaaga ccagatggtc aacaggcaca tgaaaagatg ctcaacgtca ctccccatga
      181 gggagataca aatcaaaacc acaatgagat atcacctcac accagtcaga gtggctaaaa
      241 tgagaaaatc aggagatgct gatgctggag aggatgtgga gaaacgggaa ccctcttgca
      301 ctgttggtgg gaatgcaagc aaactggtgc agctgctctg gaaaacagtg tggaggttcc
      361 tcagaaaatt aaaaata
//
LOCUS       HUMCOL4A6    1056 bp ss-mRNA            PRI       09-MAR-1990
DEFINITION  Human collagen type IV mRNA.
ACCESSION   M33653
KEYWORDS    alternative splicing; collagen type IV.
SOURCE      Human tumor cell line HT-1080, cDNA to mRNA, clones HT-[125,133].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1056)
  AUTHORS   Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
            Prockop,D.J.
  TITLE     Partial characterization of a low molecular weight human collagen
            that undergoes alternative splicing
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      740     collagen type IV /hgml_locus_uid="LJ0118J"
                                 /map="13q34" /nomgen="COL4A2"
BASE COUNT      277 a    228 c    333 g    218 t
ORIGIN      
        1 agatccggac gctggccttg atggggcctc ttggtcttct tgggcaaatt ggcccacttg
       61 gagctctagg gatgctaggc cagaaggggg agattggact gctaggccct ctaggacacg
      121 atgggaaagg acctcgcggt aaactaggag acatgggccc tcttggtccc caaggccccc
      181 taggaaagga tggacctcca ggagtgaagg gagaaaacgg gcacccaggg agcccaggag
      241 agaaggggga aaaaggggag acaggacaag caggctcacc ggttcctggg ctgctagggc
      301 cagaggggcc tcccggacct ccggggctcc aaggtgttcc tggaccaaag ggggaagcag
      361 gactagacgg agcaaaagga gagaaaggct tccagggaga aaaaggagac cgtggtcccc
      421 tgggactacc cggagcttca ggtttggacg gcaggcctgg gccaccgggt actccaggac
      481 caattggagt tccaggccca gcgggaccaa agggcgagag gggcagcaaa ggagaccctg
      541 ggatgacagg accaacggga gcagctgggc ttcctggttt acatggacca cccggggaca
      601 agggaaaccg gggggagagg gggaagaaag gctctagagg gcctaaaggg gacaagggag
      661 accaaggagc gcctggatta gatgccccct gcccattggg gcaagatggc ttaccagtcc
      721 aaggctgctg gaacaagtga tgcctctaac cttggattgg cctgtgtgtg tgtttgtaca
      781 tagaatattt atttttatac agttttcact ttttgaaaat gccagaagta tgatgcatct
      841 tacagattat taaaaaagaa agaaaaaccg ttgcatattt tgtacagaaa atatcaacct
      901 cttccctttt gtttacaaga tgttttgtat aagcctatgt ctctaataca ttttttgttt
      961 ggtcgtaatg tctgcatgat atttgtgcac atttattaag tatcgaaggt taataaattg
     1021 ttgtgtcctg gtgccaaggg gggccagcca gaactg
//
LOCUS       HUMCOL4A7     629 bp ss-mRNA            PRI       09-MAR-1990
DEFINITION  Human collagen type IV mRNA.
ACCESSION   M15524
KEYWORDS    alternative splicing; collagen type IV.
SOURCE      Human tumor cell line HT-1080, cDNA to mRNA, clone HT-127.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 629)
  AUTHORS   Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
            Prockop,D.J.
  TITLE     Partial characterization of a low molecular weight human collagen
            that undergoes alternative splicing
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      403     collagen type IV (AA at 2)
                                 /hgml_locus_uid="LJ0118J" /map="13q34"
                                 /nomgen="COL4A2"
BASE COUNT      175 a    131 c    185 g    138 t
ORIGIN      
        1 agaggggcct cccggacctc cggggctcca aggtgttcct ggaccaaagg gggaagcagg
       61 actagacgga gcaaaaggag agaaaggctt ccagggagaa aaaggagacc gtggtcccct
      121 gggactaccc ggtactccag gaccaattgg agttccaggc ccagcgggac caaagggcga
      181 gaggggcagc aaaggagacc ctgggatgac aggaccaacg ggagcagctg ggcttcctgg
      241 tttacatgga ccacccgggg acaagggaaa ccggggggag agggggaaga aaggctctag
      301 agggcctaaa ggggacaagg gagaccaagg agcgcctgga ttagatgccc cctgcccatt
      361 ggggcaagat ggcttaccag tccaaggctg ctggaacaag tgatgcctct aaccttggat
      421 tggcctgtgt gtgtgtttgt acatagaata tttattttta tacagttttc actttttgaa
      481 aatgccagaa gtatgatgca tcttacagat tattaaaaaa gaaagaaaaa ccgttgcata
      541 ttttgtacag aaaatatcaa cctcttccct tttgtttaca agatgttttg tataagccta
      601 tgtctctaat acattttttg tttggtcgt
//
LOCUS       HUMRB1RA     4600 bp ss-mRNA            PRI       09-APR-1990
DEFINITION  Human retinoblastoma associated (RB1) mRNA.
ACCESSION   M33647 J02994
KEYWORDS    retinoblastoma protein.
SOURCE      Human retinoblast cDNA to mRNA, clone p4.7R.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4600)
  AUTHORS   Friend,S.H., Horowitz,J.M., Gerber,M.R., Wang,X.-F., Bogenmann,E.,
            Li,F.P. and Weinberg,R.A.
  TITLE     Deletions of a DNA sequence in retinoblastomas and mesenchymal
            tumors: Organization of the sequence and its encoded protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 9059-9063 (1987)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence for [1] kindly
            provided by S.H.Friend, 10-FEB-1988.
FEATURES       from  to/span     description
    pept          4     2790     retinoblastoma-associated protein
                                 /hgml_locus_uid="LE0159J" /map="13q14.2"
                                 /nomgen="RB1"
BASE COUNT     1489 a    842 c    815 g   1454 t
ORIGIN      
        1 gtcatgccgc ccaaaacccc ccgaaaaacg gccgccaccg ccgccgctgc cgccgcggaa
       61 cccccggcac cgccgccgcc gccccctcct gaggaggacc cagagcagga cagcggcccg
      121 gaggacctgc ctctcgtcag gcttgagttt gaagaaacag aagaacctga ttttactgca
      181 ttatgtcaga aattaaagat accagatcat gtcagagaga gagcttggtt aacttgggag
      241 aaagtttcat ctgtggatgg agtattggga ggttatattc aaaagaaaaa ggaactgtgg
      301 ggaatctgta tctttattgc agcagttgac ctagatgaga tgtcgttcac ttttactgag
      361 ctacagaaaa acatagaaat cagtgtccat aaattcttta acttactaaa agaaattgat
      421 accagtacca aagttgataa tgctatgtca agactgttga agaagtatga tgtattgttt
      481 gcactcttca gcaaattgga aaggacatgt gaacttatat atttgacaca acccagcagt
      541 tcgatatcta ctgaaataaa ttctgcattg gtgctaaaag tttcttggat cacattttta
      601 ttagctaaag gggaagtatt acaaatggaa gatgatctgg tgatttcatt tcagttaatg
      661 ctatgtgtcc ttgactattt tattaaactc tcacctccca tgttgctcaa agaaccatat
      721 aaaacagctg ttatacccat taatggttca cctcgaacac ccaggcgagg tcagaacagg
      781 agtgcacgga tagcaaaaca actagaaaat gatacaagaa ttattgaagt tctctgtaaa
      841 gaacatgaat gtaatataga tgaggtgaaa aatgtttatt tcaaaaattt tatacctttt
      901 atgaattctc ttggacttgt aacatctaat ggacttccag aggttgaaaa tctttctaaa
      961 cgatacgaag aaatttatct taaaaataaa gatctagatg caagattatt tttggatcat
     1021 gataaaactc ttcagactga ttctatagac agttttgaaa cacagagaac accacgaaaa
     1081 agtaaccttg atgaagaggt gaatgtaatt cctccacaca ctccagttag gactgttatg
     1141 aacactatcc aacaattaat gatgatttta aattcagcaa gtgatcaacc ttcagaaaat
     1201 ctgatttcct attttaacaa ctgcacagtg aatccaaaag aaagtatact gaaaagagtg
     1261 aaggatatag gatacatctt taaagagaaa tttgctaaag ctgtgggaca gggttgtgtc
     1321 gaaattggat cacagcgata caaacttgga gttcgcttgt attaccgagt aatggaatcc
     1381 atgcttaaat cagaagaaga acgattatcc attcaaaatt ttagcaaact tctgaatgac
     1441 aacatttttc atatgtcttt attggcgtgc gctcttgagg ttgtaatggc cacatatagc
     1501 agaagtacat ctcagaatct tgattctgga acagatttgt ctttcccatg gattctgaat
     1561 gtgcttaatt taaaagcctt tgatttttac aaagtgatcg aaagttttat caaagcagaa
     1621 ggcaacttga caagagaaat gataaaacat ttagaacgat gtgaacatcg aatcatggaa
     1681 tcccttgcat ggctctcaga ttcaccttta tttgatctta ttaaacaatc aaaggaccga
     1741 gaaggaccaa ctgatcacct tgaatctgct tgtcctctta atcttcctct ccagaataat
     1801 cacactgcag cagatatgta tctttctcct gtaagatctc caaagaaaaa aggttcaact
     1861 acgcgtgtaa attctactgc aaatgcagag acacaagcaa cctcagcctt ccagacccag
     1921 aagccattga aatctacctc tctttcactg ttttataaaa aagtgtatcg gctagcctat
     1981 ctccggctaa atacactttg tgaacgcctt ctgtctgagc acccagaatt agaacatatc
     2041 atctggaccc ttttccagca caccctgcag aatgagtatg aactcatgag agacaggcat
     2101 ttggaccaaa ttatgatgtg ttccatgtat ggcatatgca aagtgaagaa tatagacctt
     2161 aaattcaaaa tcattgtaac agcatacaag gatcttcctc atgctgttca ggagacattc
     2221 aaacgtgttt tgatcaaaga agaggagtat gattctatta tagtattcta taactcggtc
     2281 ttcatgcaga gactgaaaac aaatattttg cagtatgctt ccaccaggcc ccctaccttg
     2341 tcaccaatac ctcacattcc tcgaagccct tacaagtttc ctagttcacc cttacggatt
     2401 cctggaggga acatctatat ttcacccctg aagagtccat ataaaatttc agaaggtctg
     2461 ccaacaccaa caaaaatgac tccaagatca agaatcttag tatcaattgg tgaatcattc
     2521 gggacttctg agaagttcca gaaaataaat cagatggtat gtaacagcga ccgtgtgctc
     2581 aaaagaagtg ctgaaggaag caaccctcct aaaccactga aaaaactacg ctttgatatt
     2641 gaaggatcag atgaagcaga tggaagtaaa catctcccag gagagtccaa atttcagcag
     2701 aaactggcag aaatgacttc tactcgaaca cgaatgcaaa agcagaaaat gaatgatagc
     2761 atggatacct caaacaagga agagaaatga ggatctcagg accttggtgg acactgtgta
     2821 cacctctgga ttcattgtct ctcacagatg tgactgtata actttcccag gttctgttta
     2881 tggccacatt taatatcttc agctcttttt gtggatataa aatgtgcaga tgcaattgtt
     2941 tgggtgattc ctaagccact tgaaatgtta gtcattgtta tttatacaag attgaaaatc
     3001 ttgtgtaaat cctgccattt aaaaagttgt agcagattgt ttcctcttcc aaagtaaaat
     3061 tgctgtgctt tatggatagt aagaatggcc ctagagtggg agtcctgata acccaggcct
     3121 gtctgactac tttgccttct tttgtagcat ataggtgatg tttgctcttg tttttattaa
     3181 tttatatgta tattttttta atttaacatg aacaccctta gaaaatgtgt cctatctatc
     3241 ttccaaatgc aatttgattg actgcccatt caccaaaatt atcctgaact cttctgcaaa
     3301 aatggatatt attagaaatt agaaaaaaat tactaatttt acacattaga ttttatttta
     3361 ctattggaat ctgatatact gtgtgcttgt tttataaaat tttgctttta attaaataaa
     3421 agctggaagc aaagtataac catatgatac tatcatacta ctgaaacaga tttcatacct
     3481 cagaatgtaa aagaacttac tgattatttt cttcatccaa cttatgtttt taaatgagga
     3541 ttattgatag tactcttggt ttttatacca ttcagatcac tgaatttata aagtacccat
     3601 ctagtacttg aaaaagtaaa gtgttctgcc agatcttagg tatagaggac cctaacacag
     3661 tatatcccaa gtgcactttc taatgtttct gggtcctgaa gaattaagat acaaattaat
     3721 tttactccat aaacagactg ttaattatag gagccttaat ttttttttca tagagatttg
     3781 tctaattgca tctcaaaatt attctgccct ccttaatttg ggaaggtttg tgttttctct
     3841 ggaatggtac atgtcttcca tgtatctttt gaactggcaa ttgtctattt atcttttatt
     3901 tttttaagtc agtatggtct aacactggca tgttcaaagc cacattattt ctagtccaaa
     3961 attacaagta atcaagggtc attatgggtt aggcattaat gtttctatct gattttgtgc
     4021 aaaagcttca aattaaaaca gctgcattag aaaaagaggc gcttctcccc tcccctacac
     4081 ctaaaggtgt atttaaacta tcttgtgtga ttaacttatt tagagatgct gtaacttaaa
     4141 ataggggata tttaaggtag cttcagctag cttttaggaa aatcactttg tctaactcag
     4201 aattattttt aaaaagaaat ctggtcttgt tagaaaacaa aattttattt tgtgctcatt
     4261 taagtttcaa acttactatt ttgacagtta ttttgataac aatgacacta gaaaacttga
     4321 ctccatttca tcattgtttc tgcatgaata tcatacaaat cagttagttt ttaggtcaag
     4381 ggcttactat ttctgggtct tttgctacta agttcacatt agaattagtg ccagaatttt
     4441 aggaacttca gagatcgtgt attgagattt cttaaataat gcttcagata ttattgcttt
     4501 attgcttttt tgtattggtt aaaactgtac atttaaaatt gctatgttac tattttctac
     4561 aattaatagt ttgtctattt taaaataaat tagttgttag
//
LOCUS       MV1MOBPA     2436 bp ds-DNA             BCT       15-DEC-1989
DEFINITION  Plasmid pMV158 mobilization protein gene.
ACCESSION   M28538
KEYWORDS    mobilization.
SOURCE      Plasmid pMV158 from Streptococcus agalactiae, DNA.
  ORGANISM  Plasmid pMV158
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 2436)
  AUTHORS   Priebe,S.D. and Lacks,S.A.
  TITLE     Region of the streptococcal plasmid pMV158 required for conjugative
            mobilization
  JOURNAL   J. Bacteriol. 171, 4778-4784 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence for [1] kindly
            provided by S.A.Lacks, 18-JAN-1990.
FEATURES       from  to/span     description
    pept        628     2112     mobilization peptide
BASE COUNT      824 a    393 c    565 g    654 t
ORIGIN      
        1 agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt gatccttttt
       61 ttataacagg aattcaaatc tttttgttcc attaaagggc gcgattgctg aataaaagat
      121 acgagagacc tctcttgtat cttttttatt ttgagtggtt ttgtccgtta cactagaaaa
      181 ccgaaagaca ataaaaattt tattcttgct gagtctggct ttcggtaagc tagacaaaac
      241 ggacaaaata aaaatcagca agggacaggt agtatttttt gagtgatctt ctcaaaaaat
      301 actacctgtc ccttgctgat ttttaaacga gcacgagagc aaaacccccc tttgctgagg
      361 tggcagaggg caggtttttt tgtttctttt ttctcgtaaa aaaaagaaag gtcttaaagg
      421 ttttatggtt ttggtcggca ctgccgacag cctcgcagag cacacacttt atgaatataa
      481 agtatagtgt gttatacttt acatggaagt tataccgaaa ttgtgctaaa ctaccagtta
      541 agatgttgct gattaagacg agcaatagac tgtgtcggct cgtctgtctt tcggtaatcg
      601 aaagacaagt cttaaggagg gaaatctatg agttacatgg ttgcaagaat gcagaagatg
      661 aaagctggca atttgggagg agcttttaag cataatgaac gtgtttttga gacgcactca
      721 aataaagata taaacccgag caggtcacac ctcaactacg agttgacgga tcgtgaccgc
      781 tcggtgtcgt atgaaaagca gattaaagac tatgtgaacg aaaataaggt ttctaatcga
      841 gcaatccgaa aagatgcagt tctatgtgat gagtggatta tcacatctga taaagatttt
      901 tttgaaaagt tggatgagga acagacgaga acgttttttg aaacggctaa aaattatttt
      961 gcagagaact atggcgagtc aaatattgct tatgcgagtg ttcacttgga tgaaagcacc
     1021 cctcacatgc acatgggagt agtacctttt gagaacggta aattgtcatc aaaagcaatg
     1081 tttgaccgtg aggagctaaa acatatccaa gaggacttgc ctagatacat gagcgaccac
     1141 ggttttgagt tggaacgtgg caagctgaat agtgaagcta agcataagac tgtagctgaa
     1201 ttcaagaggg caatggctga tatggaactc aaagaggaac ttcttgaaaa atatcatgca
     1261 ccgctttttg ttgatgagag aacaggcgag ttgaacaatg acacggaagc tttttggcat
     1321 gaaaaagagt ttgctgatat gtttgaagtt caatctccga tacgtgaaac aactaaccaa
     1381 gaaaaaatgg actggttaag aaaacagtac caagaagagc tgaaaaaact agaatcgtct
     1441 aaaaagcccc tagaagacga tttaagccat ttagaagagt tgcttgataa aaagaccaag
     1501 gaatatatta aaatcgattc tgaggcctct gagagggcct cagagctatc taaagccgag
     1561 ggatatataa ataccctaga aaatcattcg aagagcttag aagcgaaaat agagtgttta
     1621 gagagtgata atctacaatt ggaaaaacaa aaggcgacaa aactcgaagc gaaagcgttg
     1681 aacgagagtg agttgcgaga actaaagcct aagaagaatt ttctaggaaa agagcattat
     1741 gagttaagtc ctgaacaatt tgaagggttg aaggcagaag tttatcgtag tagaactcta
     1801 ttgcaccaca aagatattga actggagcaa gcaaaacgtc aagtatctct gagagcctct
     1861 aaaaactatt ttacagctag tttagagcga gctaaggaaa aagctaaagg tgagagtata
     1921 gaccgtctta aaagcgaaat aaagcgacta aaaaacgaaa attcaatttt acgtcagcaa
     1981 aatgacaaga tgctagggaa attaagagag ttaatgcctg ataaagcctt taagaatttg
     2041 ttatcagaac ttaaggcgat taagccaatc gtgaatataa ttaaaaaggc tattgaaaag
     2101 agcttgttct gagcgattta tgccgtgaaa gctatttgac aataagcagt gacagagtac
     2161 gctaggacgt gccgagccga aaggctttag cgtttcggac ggacacggac aaaggacggc
     2221 agtcactggt tacttgttgt caaatagacc atggaataaa aagcgtcaaa agtcttgagt
     2281 ggatgatacc ctatggtact ctattcgcct tttgactttt ttgctataat ttaagtgtcg
     2341 ccagttcttc cgtcaggtaa tgcgaactta gactggaggt gagcgttgtg aagacattcc
     2401 tcgagcttgt ctttgtccct tttgtggttg gcgttg
//
LOCUS       RATIGFI1      958 bp ss-mRNA            ROD       15-JUN-1988
DEFINITION  Rat mRNA (clone IGF1AB1) for insulin-like growth factor I.
ACCESSION   X06107 Y00429 M32260
KEYWORDS    insulin-like growth factor I.
SOURCE      rat (Rattus norvegicus).
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 958; enum. 1 to 958)
  AUTHORS   Rotwein,P.
  JOURNAL   Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
  STANDARD  simple automatic
REFERENCE   2  (bases ??)
  AUTHORS   Shimatsu,A. and Rotwein,P.
  TITLE     Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
            Within the 5' Untranslated Region
  JOURNAL   Nucleic Acids Res. 15, 7196-7196 (1987)
  STANDARD  simple automatic
COMMENT     Another IGF-I mRNA of rat liver differing in the 5' UT-region is
            described in $<X06118>$
            
            *source: tissue=liver; strain=Sprague-Dawley;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE          1    258       5' UT-region
            
               SITE        736    958       3' UT-region
FEATURES       from  to/span     description
    pept        259      738     IGF-I (AA 1-159)
BASE COUNT      260 a    246 c    209 g    243 t
ORIGIN      
        1 cgaatgttcc cccagctgtt tcctgtctac agtgtctgtg ttttgtagat aaatgtgagg
       61 attttctcta aatccctctt ctgcttgcta aatctcactg tcgctgctaa attcagacca
      121 gatagagcct gcgcaatcga aataaagtcc tcaaaattga aatgtgactt tgctctaaca
      181 tctcccatct ctctggattc tttttgcctc attattcctg cccaccaatt catttccaga
      241 ctttgtactt cagaagcgat ggggaaaatc agcagtcttc caactcaatt atttaagatc
      301 tgcctctgtg acttcttgaa gataaagata cacatcatgt cgtcttcaca tctcttctac
      361 ctggcactct gcttgctcac ctttaccagc tcggccacag ccggaccaga gaccctttgc
      421 ggggctgagc tggtggacgc tcttcagttc gtgtgtggac caaggggctt ttacttcaac
      481 aagcccacag gctatggctc cagcattcgg agggcaccac agacgggcat tgtggatgag
      541 tgttgcttcc ggagctgtga tctgaggagg ctggagatgt actgtgctcc gctgaagcct
      601 acaaagtcag ctcgttccat ccgggcccag cgccacactg acatgcccaa gactcagaag
      661 tcccagcccc tatcgacaca caagaaaagg aagctgcaaa ggagaaggaa aggaagtaca
      721 cttgaagaac acaagtagag gaagtgcagg aaacaagacc tacagaatgt aggaggagcc
      781 tcccgaggaa cagaaaatgc cacgtcaccg caagatcctt tgctgcttga gcaacctgca
      841 aaacatcgga acacctgcca aatatcaata atgagttcaa taccatttca gagatgggca
      901 tttccctcaa tgaaatacac aagtaaacat tccgacattg tctttaggag tgtttgtt
//
LOCUS       RATIGFI2      798 bp ss-mRNA            ROD       15-JUN-1988
DEFINITION  Rat mRNA (clone IGF1AB2) for insulin-like growth factor I.
ACCESSION   X06108 Y00429 M32339
KEYWORDS    insulin-like growth factor I.
SOURCE      rat (Rattus norvegicus).
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 798)
  AUTHORS   Rotwein,P.
  JOURNAL   Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
  STANDARD  simple automatic
REFERENCE   2  (bases ??)
  AUTHORS   Shimatsu,A. and Rotwein,P.
  TITLE     Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
            Within the 5' Untranslated Region
  JOURNAL   Nucleic Acids Res. 15, 7196-7196 (1987)
  STANDARD  simple automatic
COMMENT     Another IGF-I mRNA of rat liver differing in the 5' UT-region is
            described in $<XX6107>$..  *source: tissue=liver;
            strain=Sprague-Dawley;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE          1     72       5' UT-region
            
               SITE        553    798       3' UT-region
FEATURES       from  to/span     description
    pept         73      552     IGF-I (AA 1-159)
BASE COUNT      238 a    196 c    183 g    181 t
ORIGIN      
        1 gaatgttccc ccagctgttt cctgtctaca gtgtctgtgt tttgtagata aatactttgt
       61 acttcagaag cgatggggaa aatcagcagt cttccaactc aattatttaa gatctgcctc
      121 tgtgacttct tgaagataaa gatacacatc atgtcgtctt cacatctctt ctacctggca
      181 ctctgcttgc tcacctttac cagctcggcc acagccggac cagagaccct ttgcggggct
      241 gagctggtgg acgctcttca gttcgtgtgt ggaccaaggg gcttttactt caacaagccc
      301 acaggctatg gctccagcat tcggagggca ccacagacgg gcattgtgga tgagtgttgc
      361 ttccggagct gtgatctgag gaggctggag atgtactgtg ctccgctgaa gcctacaaag
      421 tcagctcgtt ccatccgggc ccagcgccac actgacatgc ccaagactca gaagtcccag
      481 cccctatcga cacacaagaa aaggaagctg caaaggagaa ggaaaggaag tacacttgaa
      541 gaacacaagt agaggaagtg caggaaacaa gacctacaga atgtaggagg agcctcccga
      601 ggaacagaaa atgccacgtc accgcaagat cctttgctgc ttgagcaacc tgcaaaacat
      661 cggaacacct gccaaatatc aataatgagt tcaataccat ttcagagatg ggcatttccc
      721 tcaatgaaat acacaagtaa acattccgac attgtcttta ggagtgtttg ttaaaaaaaa
      781 aaaaaaaaaa acaaaaac
//
LOCUS       VSVNJLA       590 bp ss-RNA             VRL       14-MAR-1990
DEFINITION  Vesicular stomatitis virus L gene.
ACCESSION   M17436
KEYWORDS    L gene.
SOURCE      Vesicular stomatitis virus (New Jersey serotype; Ogden strain) cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 590)
  AUTHORS   Luk,D., Masters,P.S., Gill,D.S. and Banerjee,A.K.
  TITLE     Intergenic sequences of the vesicular stomatitis virus genome (New
            Jersey serotype): Evidence for two transcription initiation sites
            within the L gene
  JOURNAL   Virology 160, 88-94 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         66  >   590     L protein
BASE COUNT      189 a    110 c    117 g    174 t
ORIGIN      
        1 atggctcatt tccgttaata tatgaaaaaa actggaatag aagcataact ccaacaggaa
       61 tcaaaatgga tttcgatcta atagaagatt ctgctaactg ggaagatgat gagtcggact
      121 tttttttgcg agacattcta tctcaagaag atcagatgtc ttacttaaac accgcagact
      181 ataatttgaa ttctccactc atttccgatg acatggttta tataattaaa agaatgaatc
      241 acgaggaggt tccccccatt tggagatcta aggagtggga ttctcctctt gatatgctta
      301 gaggttgtca agctcaaccc atgtctcacc aagagatgca caattggttt ggaacttgga
      361 ttcaaaatgt tcagcatgat tccgcccaac ccttcacatt cttaaaagag gtggacaaag
      421 agtcagaaat gacctatgac ttggtctcca cttttctgaa ggggtgggtg gggaaagatt
      481 acccattcaa gtcaaagaac aaagaaattg atagtatggc attagtcggt cctctttgtc
      541 aaaaattcct tgatttgcat aaaatcactc tgatattaaa tgcagtttct
//

GenBank-Updates@genbank.bio.net (04/11/90)

LOCUS       CATL11AA      424 bp ds-DNA             MAM       28-FEB-1990
DEFINITION  Cat LINE-1 DNA sequence region 1.
ACCESSION   M32251
KEYWORDS    L1 repetitive sequence; repetitive sequence.
SOURCE      Cat DNA.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 424)
  AUTHORS   Fanning,T. and Singer,S.
  TITLE     The LINE-1 DNA sequences in four mammalian orders predict proteins
            that conserve homologies to retrovirus proteins
  JOURNAL   Nucleic Acids Res. 15, 2251-2260 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rpt           1      424     L1 repetitive element
BASE COUNT      182 a     71 c     61 g    110 t
ORIGIN      
        1 caaggatttt tcaatacaaa aagcaatatc agtgtaagat accatattaa cagaatgaag
       61 ggaaaaaacc cacatcatta tctcaactga tacagaaaaa gcatttgaca acattcaata
      121 ccctttcatg ataaaaacat tgtgaaaact agggatagaa gaaattacct tcattgaata
      181 aaagccatat atgaaaaacc cacaagtaat atcatattca atggtaaaag actaaaatct
      241 tttcctctaa atcaggaaaa ggcaaggatg cctactttca ccacttctat tcaatgtgta
      301 ctggaaattc tagctagagc aattaggcaa gaaaaagaaa ggcatccaaa ttgaaaagga
      361 tgaagtaaaa ttattcctat ttgcagttga tatgatctta tacatagaaa accctaaaga
      421 ttac
//
LOCUS       CATL12AA      377 bp ds-DNA             MAM       28-FEB-1990
DEFINITION  Cat LINE-1 DNA sequence region 2.
ACCESSION   M32252
KEYWORDS    L1 repetitive sequence; repetitive sequence.
SOURCE      Cat DNA.
  ORGANISM  Felis catus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Feliforma; Felidae.
REFERENCE   1  (bases 1 to 377)
  AUTHORS   Fanning,T. and Singer,S.
  TITLE     The LINE-1 DNA sequences in four mammalian orders predict proteins
            that conserve homologies to retrovirus proteins
  JOURNAL   Nucleic Acids Res. 15, 2251-2260 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rpt           1      377     L1 repetitive element
BASE COUNT      146 a     81 c     85 g     65 t
ORIGIN      
        1 gcaatgacac atcggacaaa gggctagtat ccaaaacctg taaagaactc accaaactcc
       61 acacccaaaa aacaaataat ccagtgaaga aatgggcaga agacatgaat agacactttc
      121 ctaaacaaga ccagatggtc aacaggcaca tgaaaagatg ctcaacgtca ctccccatga
      181 gggagataca aatcaaaacc acaatgagat atcacctcac accagtcaga gtggctaaaa
      241 tgagaaaatc aggagatgct gatgctggag aggatgtgga gaaacgggaa ccctcttgca
      301 ctgttggtgg gaatgcaagc aaactggtgc agctgctctg gaaaacagtg tggaggttcc
      361 tcagaaaatt aaaaata
//
LOCUS       HUMCOL4A6    1056 bp ss-mRNA            PRI       09-MAR-1990
DEFINITION  Human collagen type IV mRNA.
ACCESSION   M33653
KEYWORDS    alternative splicing; collagen type IV.
SOURCE      Human tumor cell line HT-1080, cDNA to mRNA, clones HT-[125,133].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1056)
  AUTHORS   Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
            Prockop,D.J.
  TITLE     Partial characterization of a low molecular weight human collagen
            that undergoes alternative splicing
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      740     collagen type IV /hgml_locus_uid="LJ0118J"
                                 /map="13q34" /nomgen="COL4A2"
BASE COUNT      277 a    228 c    333 g    218 t
ORIGIN      
        1 agatccggac gctggccttg atggggcctc ttggtcttct tgggcaaatt ggcccacttg
       61 gagctctagg gatgctaggc cagaaggggg agattggact gctaggccct ctaggacacg
      121 atgggaaagg acctcgcggt aaactaggag acatgggccc tcttggtccc caaggccccc
      181 taggaaagga tggacctcca ggagtgaagg gagaaaacgg gcacccaggg agcccaggag
      241 agaaggggga aaaaggggag acaggacaag caggctcacc ggttcctggg ctgctagggc
      301 cagaggggcc tcccggacct ccggggctcc aaggtgttcc tggaccaaag ggggaagcag
      361 gactagacgg agcaaaagga gagaaaggct tccagggaga aaaaggagac cgtggtcccc
      421 tgggactacc cggagcttca ggtttggacg gcaggcctgg gccaccgggt actccaggac
      481 caattggagt tccaggccca gcgggaccaa agggcgagag gggcagcaaa ggagaccctg
      541 ggatgacagg accaacggga gcagctgggc ttcctggttt acatggacca cccggggaca
      601 agggaaaccg gggggagagg gggaagaaag gctctagagg gcctaaaggg gacaagggag
      661 accaaggagc gcctggatta gatgccccct gcccattggg gcaagatggc ttaccagtcc
      721 aaggctgctg gaacaagtga tgcctctaac cttggattgg cctgtgtgtg tgtttgtaca
      781 tagaatattt atttttatac agttttcact ttttgaaaat gccagaagta tgatgcatct
      841 tacagattat taaaaaagaa agaaaaaccg ttgcatattt tgtacagaaa atatcaacct
      901 cttccctttt gtttacaaga tgttttgtat aagcctatgt ctctaataca ttttttgttt
      961 ggtcgtaatg tctgcatgat atttgtgcac atttattaag tatcgaaggt taataaattg
     1021 ttgtgtcctg gtgccaaggg gggccagcca gaactg
//
LOCUS       HUMCOL4A7     629 bp ss-mRNA            PRI       09-MAR-1990
DEFINITION  Human collagen type IV mRNA.
ACCESSION   M15524
KEYWORDS    alternative splicing; collagen type IV.
SOURCE      Human tumor cell line HT-1080, cDNA to mRNA, clone HT-127.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 629)
  AUTHORS   Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and
            Prockop,D.J.
  TITLE     Partial characterization of a low molecular weight human collagen
            that undergoes alternative splicing
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      403     collagen type IV (AA at 2)
                                 /hgml_locus_uid="LJ0118J" /map="13q34"
                                 /nomgen="COL4A2"
BASE COUNT      175 a    131 c    185 g    138 t
ORIGIN      
        1 agaggggcct cccggacctc cggggctcca aggtgttcct ggaccaaagg gggaagcagg
       61 actagacgga gcaaaaggag agaaaggctt ccagggagaa aaaggagacc gtggtcccct
      121 gggactaccc ggtactccag gaccaattgg agttccaggc ccagcgggac caaagggcga
      181 gaggggcagc aaaggagacc ctgggatgac aggaccaacg ggagcagctg ggcttcctgg
      241 tttacatgga ccacccgggg acaagggaaa ccggggggag agggggaaga aaggctctag
      301 agggcctaaa ggggacaagg gagaccaagg agcgcctgga ttagatgccc cctgcccatt
      361 ggggcaagat ggcttaccag tccaaggctg ctggaacaag tgatgcctct aaccttggat
      421 tggcctgtgt gtgtgtttgt acatagaata tttattttta tacagttttc actttttgaa
      481 aatgccagaa gtatgatgca tcttacagat tattaaaaaa gaaagaaaaa ccgttgcata
      541 ttttgtacag aaaatatcaa cctcttccct tttgtttaca agatgttttg tataagccta
      601 tgtctctaat acattttttg tttggtcgt
//
LOCUS       HUMRB1RA     4600 bp ss-mRNA            PRI       09-APR-1990
DEFINITION  Human retinoblastoma associated (RB1) mRNA.
ACCESSION   M33647 J02994
KEYWORDS    retinoblastoma protein.
SOURCE      Human retinoblast cDNA to mRNA, clone p4.7R.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4600)
  AUTHORS   Friend,S.H., Horowitz,J.M., Gerber,M.R., Wang,X.-F., Bogenmann,E.,
            Li,F.P. and Weinberg,R.A.
  TITLE     Deletions of a DNA sequence in retinoblastomas and mesenchymal
            tumors: Organization of the sequence and its encoded protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 9059-9063 (1987)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence for [1] kindly
            provided by S.H.Friend, 10-FEB-1988.
FEATURES       from  to/span     description
    pept          4     2790     retinoblastoma-associated protein
                                 /hgml_locus_uid="LE0159J" /map="13q14.2"
                                 /nomgen="RB1"
BASE COUNT     1489 a    842 c    815 g   1454 t
ORIGIN      
        1 gtcatgccgc ccaaaacccc ccgaaaaacg gccgccaccg ccgccgctgc cgccgcggaa
       61 cccccggcac cgccgccgcc gccccctcct gaggaggacc cagagcagga cagcggcccg
      121 gaggacctgc ctctcgtcag gcttgagttt gaagaaacag aagaacctga ttttactgca
      181 ttatgtcaga aattaaagat accagatcat gtcagagaga gagcttggtt aacttgggag
      241 aaagtttcat ctgtggatgg agtattggga ggttatattc aaaagaaaaa ggaactgtgg
      301 ggaatctgta tctttattgc agcagttgac ctagatgaga tgtcgttcac ttttactgag
      361 ctacagaaaa acatagaaat cagtgtccat aaattcttta acttactaaa agaaattgat
      421 accagtacca aagttgataa tgctatgtca agactgttga agaagtatga tgtattgttt
      481 gcactcttca gcaaattgga aaggacatgt gaacttatat atttgacaca acccagcagt
      541 tcgatatcta ctgaaataaa ttctgcattg gtgctaaaag tttcttggat cacattttta
      601 ttagctaaag gggaagtatt acaaatggaa gatgatctgg tgatttcatt tcagttaatg
      661 ctatgtgtcc ttgactattt tattaaactc tcacctccca tgttgctcaa agaaccatat
      721 aaaacagctg ttatacccat taatggttca cctcgaacac ccaggcgagg tcagaacagg
      781 agtgcacgga tagcaaaaca actagaaaat gatacaagaa ttattgaagt tctctgtaaa
      841 gaacatgaat gtaatataga tgaggtgaaa aatgtttatt tcaaaaattt tatacctttt
      901 atgaattctc ttggacttgt aacatctaat ggacttccag aggttgaaaa tctttctaaa
      961 cgatacgaag aaatttatct taaaaataaa gatctagatg caagattatt tttggatcat
     1021 gataaaactc ttcagactga ttctatagac agttttgaaa cacagagaac accacgaaaa
     1081 agtaaccttg atgaagaggt gaatgtaatt cctccacaca ctccagttag gactgttatg
     1141 aacactatcc aacaattaat gatgatttta aattcagcaa gtgatcaacc ttcagaaaat
     1201 ctgatttcct attttaacaa ctgcacagtg aatccaaaag aaagtatact gaaaagagtg
     1261 aaggatatag gatacatctt taaagagaaa tttgctaaag ctgtgggaca gggttgtgtc
     1321 gaaattggat cacagcgata caaacttgga gttcgcttgt attaccgagt aatggaatcc
     1381 atgcttaaat cagaagaaga acgattatcc attcaaaatt ttagcaaact tctgaatgac
     1441 aacatttttc atatgtcttt attggcgtgc gctcttgagg ttgtaatggc cacatatagc
     1501 agaagtacat ctcagaatct tgattctgga acagatttgt ctttcccatg gattctgaat
     1561 gtgcttaatt taaaagcctt tgatttttac aaagtgatcg aaagttttat caaagcagaa
     1621 ggcaacttga caagagaaat gataaaacat ttagaacgat gtgaacatcg aatcatggaa
     1681 tcccttgcat ggctctcaga ttcaccttta tttgatctta ttaaacaatc aaaggaccga
     1741 gaaggaccaa ctgatcacct tgaatctgct tgtcctctta atcttcctct ccagaataat
     1801 cacactgcag cagatatgta tctttctcct gtaagatctc caaagaaaaa aggttcaact
     1861 acgcgtgtaa attctactgc aaatgcagag acacaagcaa cctcagcctt ccagacccag
     1921 aagccattga aatctacctc tctttcactg ttttataaaa aagtgtatcg gctagcctat
     1981 ctccggctaa atacactttg tgaacgcctt ctgtctgagc acccagaatt agaacatatc
     2041 atctggaccc ttttccagca caccctgcag aatgagtatg aactcatgag agacaggcat
     2101 ttggaccaaa ttatgatgtg ttccatgtat ggcatatgca aagtgaagaa tatagacctt
     2161 aaattcaaaa tcattgtaac agcatacaag gatcttcctc atgctgttca ggagacattc
     2221 aaacgtgttt tgatcaaaga agaggagtat gattctatta tagtattcta taactcggtc
     2281 ttcatgcaga gactgaaaac aaatattttg cagtatgctt ccaccaggcc ccctaccttg
     2341 tcaccaatac ctcacattcc tcgaagccct tacaagtttc ctagttcacc cttacggatt
     2401 cctggaggga acatctatat ttcacccctg aagagtccat ataaaatttc agaaggtctg
     2461 ccaacaccaa caaaaatgac tccaagatca agaatcttag tatcaattgg tgaatcattc
     2521 gggacttctg agaagttcca gaaaataaat cagatggtat gtaacagcga ccgtgtgctc
     2581 aaaagaagtg ctgaaggaag caaccctcct aaaccactga aaaaactacg ctttgatatt
     2641 gaaggatcag atgaagcaga tggaagtaaa catctcccag gagagtccaa atttcagcag
     2701 aaactggcag aaatgacttc tactcgaaca cgaatgcaaa agcagaaaat gaatgatagc
     2761 atggatacct caaacaagga agagaaatga ggatctcagg accttggtgg acactgtgta
     2821 cacctctgga ttcattgtct ctcacagatg tgactgtata actttcccag gttctgttta
     2881 tggccacatt taatatcttc agctcttttt gtggatataa aatgtgcaga tgcaattgtt
     2941 tgggtgattc ctaagccact tgaaatgtta gtcattgtta tttatacaag attgaaaatc
     3001 ttgtgtaaat cctgccattt aaaaagttgt agcagattgt ttcctcttcc aaagtaaaat
     3061 tgctgtgctt tatggatagt aagaatggcc ctagagtggg agtcctgata acccaggcct
     3121 gtctgactac tttgccttct tttgtagcat ataggtgatg tttgctcttg tttttattaa
     3181 tttatatgta tattttttta atttaacatg aacaccctta gaaaatgtgt cctatctatc
     3241 ttccaaatgc aatttgattg actgcccatt caccaaaatt atcctgaact cttctgcaaa
     3301 aatggatatt attagaaatt agaaaaaaat tactaatttt acacattaga ttttatttta
     3361 ctattggaat ctgatatact gtgtgcttgt tttataaaat tttgctttta attaaataaa
     3421 agctggaagc aaagtataac catatgatac tatcatacta ctgaaacaga tttcatacct
     3481 cagaatgtaa aagaacttac tgattatttt cttcatccaa cttatgtttt taaatgagga
     3541 ttattgatag tactcttggt ttttatacca ttcagatcac tgaatttata aagtacccat
     3601 ctagtacttg aaaaagtaaa gtgttctgcc agatcttagg tatagaggac cctaacacag
     3661 tatatcccaa gtgcactttc taatgtttct gggtcctgaa gaattaagat acaaattaat
     3721 tttactccat aaacagactg ttaattatag gagccttaat ttttttttca tagagatttg
     3781 tctaattgca tctcaaaatt attctgccct ccttaatttg ggaaggtttg tgttttctct
     3841 ggaatggtac atgtcttcca tgtatctttt gaactggcaa ttgtctattt atcttttatt
     3901 tttttaagtc agtatggtct aacactggca tgttcaaagc cacattattt ctagtccaaa
     3961 attacaagta atcaagggtc attatgggtt aggcattaat gtttctatct gattttgtgc
     4021 aaaagcttca aattaaaaca gctgcattag aaaaagaggc gcttctcccc tcccctacac
     4081 ctaaaggtgt atttaaacta tcttgtgtga ttaacttatt tagagatgct gtaacttaaa
     4141 ataggggata tttaaggtag cttcagctag cttttaggaa aatcactttg tctaactcag
     4201 aattattttt aaaaagaaat ctggtcttgt tagaaaacaa aattttattt tgtgctcatt
     4261 taagtttcaa acttactatt ttgacagtta ttttgataac aatgacacta gaaaacttga
     4321 ctccatttca tcattgtttc tgcatgaata tcatacaaat cagttagttt ttaggtcaag
     4381 ggcttactat ttctgggtct tttgctacta agttcacatt agaattagtg ccagaatttt
     4441 aggaacttca gagatcgtgt attgagattt cttaaataat gcttcagata ttattgcttt
     4501 attgcttttt tgtattggtt aaaactgtac atttaaaatt gctatgttac tattttctac
     4561 aattaatagt ttgtctattt taaaataaat tagttgttag
//
LOCUS       MV1MOBPA     2436 bp ds-DNA             BCT       15-DEC-1989
DEFINITION  Plasmid pMV158 mobilization protein gene.
ACCESSION   M28538
KEYWORDS    mobilization.
SOURCE      Plasmid pMV158 from Streptococcus agalactiae, DNA.
  ORGANISM  Plasmid pMV158
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 2436)
  AUTHORS   Priebe,S.D. and Lacks,S.A.
  TITLE     Region of the streptococcal plasmid pMV158 required for conjugative
            mobilization
  JOURNAL   J. Bacteriol. 171, 4778-4784 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence for [1] kindly
            provided by S.A.Lacks, 18-JAN-1990.
FEATURES       from  to/span     description
    pept        628     2112     mobilization peptide
BASE COUNT      824 a    393 c    565 g    654 t
ORIGIN      
        1 agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt gatccttttt
       61 ttataacagg aattcaaatc tttttgttcc attaaagggc gcgattgctg aataaaagat
      121 acgagagacc tctcttgtat cttttttatt ttgagtggtt ttgtccgtta cactagaaaa
      181 ccgaaagaca ataaaaattt tattcttgct gagtctggct ttcggtaagc tagacaaaac
      241 ggacaaaata aaaatcagca agggacaggt agtatttttt gagtgatctt ctcaaaaaat
      301 actacctgtc ccttgctgat ttttaaacga gcacgagagc aaaacccccc tttgctgagg
      361 tggcagaggg caggtttttt tgtttctttt ttctcgtaaa aaaaagaaag gtcttaaagg
      421 ttttatggtt ttggtcggca ctgccgacag cctcgcagag cacacacttt atgaatataa
      481 agtatagtgt gttatacttt acatggaagt tataccgaaa ttgtgctaaa ctaccagtta
      541 agatgttgct gattaagacg agcaatagac tgtgtcggct cgtctgtctt tcggtaatcg
      601 aaagacaagt cttaaggagg gaaatctatg agttacatgg ttgcaagaat gcagaagatg
      661 aaagctggca atttgggagg agcttttaag cataatgaac gtgtttttga gacgcactca
      721 aataaagata taaacccgag caggtcacac ctcaactacg agttgacgga tcgtgaccgc
      781 tcggtgtcgt atgaaaagca gattaaagac tatgtgaacg aaaataaggt ttctaatcga
      841 gcaatccgaa aagatgcagt tctatgtgat gagtggatta tcacatctga taaagatttt
      901 tttgaaaagt tggatgagga acagacgaga acgttttttg aaacggctaa aaattatttt
      961 gcagagaact atggcgagtc aaatattgct tatgcgagtg ttcacttgga tgaaagcacc
     1021 cctcacatgc acatgggagt agtacctttt gagaacggta aattgtcatc aaaagcaatg
     1081 tttgaccgtg aggagctaaa acatatccaa gaggacttgc ctagatacat gagcgaccac
     1141 ggttttgagt tggaacgtgg caagctgaat agtgaagcta agcataagac tgtagctgaa
     1201 ttcaagaggg caatggctga tatggaactc aaagaggaac ttcttgaaaa atatcatgca
     1261 ccgctttttg ttgatgagag aacaggcgag ttgaacaatg acacggaagc tttttggcat
     1321 gaaaaagagt ttgctgatat gtttgaagtt caatctccga tacgtgaaac aactaaccaa
     1381 gaaaaaatgg actggttaag aaaacagtac caagaagagc tgaaaaaact agaatcgtct
     1441 aaaaagcccc tagaagacga tttaagccat ttagaagagt tgcttgataa aaagaccaag
     1501 gaatatatta aaatcgattc tgaggcctct gagagggcct cagagctatc taaagccgag
     1561 ggatatataa ataccctaga aaatcattcg aagagcttag aagcgaaaat agagtgttta
     1621 gagagtgata atctacaatt ggaaaaacaa aaggcgacaa aactcgaagc gaaagcgttg
     1681 aacgagagtg agttgcgaga actaaagcct aagaagaatt ttctaggaaa agagcattat
     1741 gagttaagtc ctgaacaatt tgaagggttg aaggcagaag tttatcgtag tagaactcta
     1801 ttgcaccaca aagatattga actggagcaa gcaaaacgtc aagtatctct gagagcctct
     1861 aaaaactatt ttacagctag tttagagcga gctaaggaaa aagctaaagg tgagagtata
     1921 gaccgtctta aaagcgaaat aaagcgacta aaaaacgaaa attcaatttt acgtcagcaa
     1981 aatgacaaga tgctagggaa attaagagag ttaatgcctg ataaagcctt taagaatttg
     2041 ttatcagaac ttaaggcgat taagccaatc gtgaatataa ttaaaaaggc tattgaaaag
     2101 agcttgttct gagcgattta tgccgtgaaa gctatttgac aataagcagt gacagagtac
     2161 gctaggacgt gccgagccga aaggctttag cgtttcggac ggacacggac aaaggacggc
     2221 agtcactggt tacttgttgt caaatagacc atggaataaa aagcgtcaaa agtcttgagt
     2281 ggatgatacc ctatggtact ctattcgcct tttgactttt ttgctataat ttaagtgtcg
     2341 ccagttcttc cgtcaggtaa tgcgaactta gactggaggt gagcgttgtg aagacattcc
     2401 tcgagcttgt ctttgtccct tttgtggttg gcgttg
//
LOCUS       RATIGFI1      958 bp ss-mRNA            ROD       15-JUN-1988
DEFINITION  Rat mRNA (clone IGF1AB1) for insulin-like growth factor I.
ACCESSION   X06107 Y00429 M32260
KEYWORDS    insulin-like growth factor I.
SOURCE      rat (Rattus norvegicus).
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 958; enum. 1 to 958)
  AUTHORS   Rotwein,P.
  JOURNAL   Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 958)
  AUTHORS   Shimatsu,A. and Rotwein,P.
  TITLE     Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
            Within the 5' Untranslated Region
  JOURNAL   Nucleic Acids Res. 15, 7196-7196 (1987)
  STANDARD  simple automatic
COMMENT     Another IGF-I mRNA of rat liver differing in the 5' UT-region is
            described in $<X06118>$
            
            *source: tissue=liver; strain=Sprague-Dawley;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE          1    258       5' UT-region
            
               SITE        736    958       3' UT-region
FEATURES       from  to/span     description
    pept        259      738     IGF-I (AA 1-159)
BASE COUNT      260 a    246 c    209 g    243 t
ORIGIN      
        1 cgaatgttcc cccagctgtt tcctgtctac agtgtctgtg ttttgtagat aaatgtgagg
       61 attttctcta aatccctctt ctgcttgcta aatctcactg tcgctgctaa attcagacca
      121 gatagagcct gcgcaatcga aataaagtcc tcaaaattga aatgtgactt tgctctaaca
      181 tctcccatct ctctggattc tttttgcctc attattcctg cccaccaatt catttccaga
      241 ctttgtactt cagaagcgat ggggaaaatc agcagtcttc caactcaatt atttaagatc
      301 tgcctctgtg acttcttgaa gataaagata cacatcatgt cgtcttcaca tctcttctac
      361 ctggcactct gcttgctcac ctttaccagc tcggccacag ccggaccaga gaccctttgc
      421 ggggctgagc tggtggacgc tcttcagttc gtgtgtggac caaggggctt ttacttcaac
      481 aagcccacag gctatggctc cagcattcgg agggcaccac agacgggcat tgtggatgag
      541 tgttgcttcc ggagctgtga tctgaggagg ctggagatgt actgtgctcc gctgaagcct
      601 acaaagtcag ctcgttccat ccgggcccag cgccacactg acatgcccaa gactcagaag
      661 tcccagcccc tatcgacaca caagaaaagg aagctgcaaa ggagaaggaa aggaagtaca
      721 cttgaagaac acaagtagag gaagtgcagg aaacaagacc tacagaatgt aggaggagcc
      781 tcccgaggaa cagaaaatgc cacgtcaccg caagatcctt tgctgcttga gcaacctgca
      841 aaacatcgga acacctgcca aatatcaata atgagttcaa taccatttca gagatgggca
      901 tttccctcaa tgaaatacac aagtaaacat tccgacattg tctttaggag tgtttgtt
//
LOCUS       RATIGFI2      798 bp ss-mRNA            ROD       15-JUN-1988
DEFINITION  Rat mRNA (clone IGF1AB2) for insulin-like growth factor I.
ACCESSION   X06108 Y00429 M32339
KEYWORDS    insulin-like growth factor I.
SOURCE      rat (Rattus norvegicus).
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 798)
  AUTHORS   Rotwein,P.
  JOURNAL   Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA.
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 798)
  AUTHORS   Shimatsu,A. and Rotwein,P.
  TITLE     Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing
            Within the 5' Untranslated Region
  JOURNAL   Nucleic Acids Res. 15, 7196-7196 (1987)
  STANDARD  simple automatic
COMMENT     Another IGF-I mRNA of rat liver differing in the 5' UT-region is
            described in $<XX6107>$..  *source: tissue=liver;
            strain=Sprague-Dawley;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE          1     72       5' UT-region
            
               SITE        553    798       3' UT-region
FEATURES       from  to/span     description
    pept         73      552     IGF-I (AA 1-159)
BASE COUNT      238 a    196 c    183 g    181 t
ORIGIN      
        1 gaatgttccc ccagctgttt cctgtctaca gtgtctgtgt tttgtagata aatactttgt
       61 acttcagaag cgatggggaa aatcagcagt cttccaactc aattatttaa gatctgcctc
      121 tgtgacttct tgaagataaa gatacacatc atgtcgtctt cacatctctt ctacctggca
      181 ctctgcttgc tcacctttac cagctcggcc acagccggac cagagaccct ttgcggggct
      241 gagctggtgg acgctcttca gttcgtgtgt ggaccaaggg gcttttactt caacaagccc
      301 acaggctatg gctccagcat tcggagggca ccacagacgg gcattgtgga tgagtgttgc
      361 ttccggagct gtgatctgag gaggctggag atgtactgtg ctccgctgaa gcctacaaag
      421 tcagctcgtt ccatccgggc ccagcgccac actgacatgc ccaagactca gaagtcccag
      481 cccctatcga cacacaagaa aaggaagctg caaaggagaa ggaaaggaag tacacttgaa
      541 gaacacaagt agaggaagtg caggaaacaa gacctacaga atgtaggagg agcctcccga
      601 ggaacagaaa atgccacgtc accgcaagat cctttgctgc ttgagcaacc tgcaaaacat
      661 cggaacacct gccaaatatc aataatgagt tcaataccat ttcagagatg ggcatttccc
      721 tcaatgaaat acacaagtaa acattccgac attgtcttta ggagtgtttg ttaaaaaaaa
      781 aaaaaaaaaa acaaaaac
//
LOCUS       VSVNJLA       590 bp ss-RNA             VRL       14-MAR-1990
DEFINITION  Vesicular stomatitis virus L gene.
ACCESSION   M17436
KEYWORDS    L gene.
SOURCE      Vesicular stomatitis virus (New Jersey serotype; Ogden strain) cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 590)
  AUTHORS   Luk,D., Masters,P.S., Gill,D.S. and Banerjee,A.K.
  TITLE     Intergenic sequences of the vesicular stomatitis virus genome (New
            Jersey serotype): Evidence for two transcription initiation sites
            within the L gene
  JOURNAL   Virology 160, 88-94 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         66  >   590     L protein
BASE COUNT      189 a    110 c    117 g    174 t
ORIGIN      
        1 atggctcatt tccgttaata tatgaaaaaa actggaatag aagcataact ccaacaggaa
       61 tcaaaatgga tttcgatcta atagaagatt ctgctaactg ggaagatgat gagtcggact
      121 tttttttgcg agacattcta tctcaagaag atcagatgtc ttacttaaac accgcagact
      181 ataatttgaa ttctccactc atttccgatg acatggttta tataattaaa agaatgaatc
      241 acgaggaggt tccccccatt tggagatcta aggagtggga ttctcctctt gatatgctta
      301 gaggttgtca agctcaaccc atgtctcacc aagagatgca caattggttt ggaacttgga
      361 ttcaaaatgt tcagcatgat tccgcccaac ccttcacatt cttaaaagag gtggacaaag
      421 agtcagaaat gacctatgac ttggtctcca cttttctgaa ggggtgggtg gggaaagatt
      481 acccattcaa gtcaaagaac aaagaaattg atagtatggc attagtcggt cctctttgtc
      541 aaaaattcct tgatttgcat aaaatcactc tgatattaaa tgcagtttct
//

GenBank-Updates@genbank.bio.net (04/11/90)

LOCUS       LEIKPCAA      234 bp ss-mRNA            ORG       28-MAR-1990
DEFINITION  L.tarentolae kinetoplast ATPase 6 edited mRNA, partial cds.
ACCESSION   M33229
KEYWORDS    ATPase 6.
SOURCE      L.tarentolae kinetoplast (strain University of California) cDNA to
            mRNA.
  ORGANISM  Kinetoplast Leishmania tarentolae
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Leishmania tarentolae.
REFERENCE   1  (bases 1 to 234)
  AUTHORS   Bhat,J.G., Koslowsky,D.J., Feagin,J.E., Smiley,B.L. and Kenneth,S.
  TITLE     An extensively edited mitochondrial transcript in kinetoplastids
            encodes a protein homologous to ATPase subunit 6
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by J.G.Bhat
            23-MAR-1990.
            
            Each uridine of the mRNA within the ranges specified ("t" in
            GenBank conventions) are the result of post-transcriptional
            editing.  The "u addition"s and "u deletions" are alterations of
            the maxicircle DNA.
FEATURES       from  to/span     description
    pept         32  >   234     ATPase 6
    site         33       48     post-transcriptional editing (u addition)
    site         52       64     post-transcriptional editing (u addition)
    site         70       77     post-transcriptional editing (u addition)
    site         28       29     post-transcriptional editing (u deletion)
    site         80       98     post-transcriptional editing (u addition)
    site         31       32     post-transcriptional editing (u deletion)
    site        108      127     post-transcriptional editing (u addition)
    site        130      136     post-transcriptional editing (u addition)
    site        139      212     post-transcriptional editing (u addition)
    site         66       67     post-transcriptional editing (u deletion)
    site        129      130     post-transcriptional editing (u deletion)
BASE COUNT       58 a      7 c     35 g    134 t
ORIGIN      
        1 tatataaaaa attatatcag attaagaaaa aatgtttgtt ttttttgttt gtgatttagt
       61 aattatgcgt attttattat gtttttgtta tagtgtttga tccagaatta tttttgtgtt
      121 attttataat gttttttata tttgtaccga gttaatgttt tgtatttttg atgtttattt
      181 atttgttggt ttatgtatgt ttatttgttt atgattcgtg ttatttaatt ttta
//
LOCUS       TRBKPEATA     828 bp ss-mRNA            ORG       28-MAR-1990
DEFINITION  T.brucei kinetoplast ATPase 6 edited mRNA, complete cds.
ACCESSION   M33228
KEYWORDS    ATPase 6.
SOURCE      T.brucei kinetoplast (strain EATRO 164) procyclic Istar1 serodeme
            cell line [isolated from Bos Taurus blood], cDNA to mRNA.
  ORGANISM  Kinetoplast Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Trypanosoma brucei.
REFERENCE   1  (bases 1 to 828)
  AUTHORS   Bhat,J.G., Koslowsky,D.J., Feagin,J.E., Smiley,B.L. and Kenneth,S.
  TITLE     An extensively edited mitochondrial transcript in kinetoplastids
            encodes a protein homologous to ATPase subunit 6
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by J.G.Bhat
            23-MAR-1990.
            
            Each uridine of the mRNA within the ranges specified ("t" in
            GenBank conventions) are the result of post-transcriptional
            editing.  The "u addition"s and "u deletions" are alterations of
            the maxicircle DNA.
FEATURES       from  to/span     description
    pept         34      678     ATPase 6
    site         33       59     post-transcriptional editing (u addition)
    site         67      132     post-transcriptional editing (u addition)
    site        135      170     post-transcriptional editing (u addition)
    site        172      173     post-transcriptional editing (u deletion)
    site        177      199     post-transcriptional editing (u addition)
    site        201      202     post-transcriptional editing (u deletion)
    site        205      245     post-transcriptional editing (u addition)
    site        253      256     post-transcriptional editing (u addition)
    site        258      259     post-transcriptional editing (u deletion)
    site        261      267     post-transcriptional editing (u addition)
    site        274      275     post-transcriptional editing (u deletion)
    site        276      278     post-transcriptional editing (u addition)
    site        280      281     post-transcriptional editing (u deletion)
    site        282      283     post-transcriptional editing (u addition)
    site        286      370     post-transcriptional editing (u addition)
    site        374      380     post-transcriptional editing (u addition)
    site        383      384     post-transcriptional editing (u deletion)
    site        384      402     post-transcriptional editing (u addition)
    site        407      408     post-transcriptional editing (u deletion)
    site        408      473     post-transcriptional editing (u addition)
    site        480      498     post-transcriptional editing (u addition)
    site        505      528     post-transcriptional editing (u addition)
    site        531      617     post-transcriptional editing (u addition)
    site        621      631     post-transcriptional editing (u addition)
    site        635      642     post-transcriptional editing (u addition)
    site        646      647     post-transcriptional editing (u addition)
    site        650      651     post-transcriptional editing (u deletion)
    site        654      657     post-transcriptional editing (u addition)
    site        660      661     post-transcriptional editing (u deletion)
    site        662      699     post-transcriptional editing (u addition)
    site        702      703     post-transcriptional editing (u deletion)
    site        703      775     post-transcriptional editing (u addition)
    site        776      777     post-transcriptional editing (u deletion)
    site        778      779     post-transcriptional editing (u addition)
    site        782      783     post-transcriptional editing (u addition)
    site        785      786     post-transcriptional editing (u deletion)
BASE COUNT      156 a     21 c    137 g    514 t
ORIGIN      
        1 aaaaataagt attttgatat tattaaagta aatatgtttt tatttttttt ttgtgattta
       61 ttttggttgc gtttgttatt atgtatgtat tattgtgtat gatctaggtt atgttttatt
      121 gtgtatttta attgtttaat gttgattttt gattttttat tattttgttt gtttgatttg
      181 tatttgtttg ttggtttgtg tttgttttta ttgttgtggt ttatgttgtt taatttatat
      241 agtttaattt tgtattattg tattacttat ttgaatttgt atttgttgtt ttgtattgtt
      301 tttttattgt atattgcatt tttatttttg ttttgttttt tatgtgattt ttttttgttt
      361 aataatttgt tagttggtga tagttttatg gatgtttttt ttattcgttt tttgttgtgt
      421 tttttagagt gtttttcttt gttgtgtcgt tgtttgtcga cgtttttgcg tttgttttgt
      481 aatttattat catcccattt tttattgttg atgttttttg atttttttta ttttattttt
      541 gttttttttt ttttatggtg ttttttgtta ttgatttatt ttatttattt ttgtgttttg
      601 tttttgttta ttattttatg tgtttttata tttgttggat ttatttgccg ccatattaca
      661 gttatttatt ttttgtaata tgattttgca gttgataatg gattttttgt tgtttttgtt
      721 gtttgtttag ttttgtattt gatttttgat agttattata ttgttgttga aatttggttt
      781 gttattggag ttatagaata agatcaaata agttaataat aaaaaaaa
//

GenBank-Updates@genbank.bio.net (04/11/90)

LOCUS       RATCA2        446 bp ds-DNA             ROD       13-NOV-1989
DEFINITION  Rat calcium/calmodulin-dependent protein kinase II alpha-subunit
            gene, 5' end.
ACCESSION   M29699
KEYWORDS    calcium/calmodulin-dependent protein kinase.
SOURCE      Rat DNA, clone c4.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 446)
  AUTHORS   Sunyer,T. and Sahyoun,N.
  TITLE     Sequence analysis and DNA-protein interactions within the 5'
            flanking region of the Ca-2+/calmodulin-dependent protein kinase II
            alpha-subunit gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 278-282 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Sunyer, 03-NOV--1989, for release after publication.
FEATURES       from  to/span     description
    pept        429  >   446     calcium/calmodulin-dependent protein kinase II
    mRNA        280  >   446     calcium/calmodulin-dependent protein kinase II
                                 mRNA (alt.)
    mRNA        282  >   446     calcium/calmodulin-dependent protein kinase II
                                 mRNA (alt.)
    signal      114      120     tata box
    binding     201      205     Sp1 binding site
BASE COUNT       96 a    126 c    134 g     90 t
ORIGIN      
        1 gatcacttgt ggactaagtt tgttcacatc ccgttctcca accccctcag tacatcaccc
       61 tgggagaaca aggtccactt gcttctgggc ccacacagtc ctgcagtatt gtgtatataa
      121 ggccagggca acggaggagc aggttttgaa gtgaaaggca ggcaggtgtt ggggaggcag
      181 ttaccggggc aacgggaaca gggcgtttcg gaggtggttg ccatggggac ctggatgctg
      241 acgaaggctc gcgagctgtg agcagccaca gtgccctgct cagaagcccc aagctcgtca
      301 atcaagctgg ttctccattt gcactcagga gcacgggcag gcgagtggcc cctagttctg
      361 ggagcagagt atcagcatcc cagtcctagt cccgagccta aagcctcgcc tgcctgccca
      421 gtgccaggat ggctaccatc acctgc
//

GenBank-Updates@genbank.bio.net (04/12/90)

LOCUS       ECOTRWA        76 bp ss-tRNA            RNA       15-SEP-1989
DEFINITION  E.coli Trp-tRNA.
ACCESSION   M24301
KEYWORDS    transfer RNA-Trp.
SOURCE      E.coli tRNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Eisenberg,S.P., Yarus,M. and Soll,L.
  TITLE     The effect of an Escherichia coli regulatory mutation on transfer
            RNA structure
  JOURNAL   J. Mol. Biol. 135, 111-126 (1979)
  STANDARD  simple staff_entry
COMMENT     In the TrpX mutation the base at position 37 ("a") is not modified.
            In the "uga" suppressor mutation base 24 "g" is changed to "a".
FEATURES       from  to/span     description
    tRNA          1       76     Phe-tRNA
    modified     16       16     d
    modified     17       17     d
    modified     20       20     d
    modified     32       32     cm
    anticdn      34       36     Phe-tRNA anticodon cca
    modified     37       37     ms2i6a
    modified     46       46     gm7
    modified     55       55     psi
BASE COUNT       14 a     21 c     24 g     17 t
ORIGIN      5' end of mature tRNA.
        1 aggggcgtag ttcaattggt agagcaccgg tctccaaaac cgggtgttgg gagttcgagt
       61 ctctccgccc ctgcca
//
LOCUS       ECOTRWSUP      76 bp ss-tRNA            RNA       15-SEP-1989
DEFINITION  E.coli Trp-tRNA suppressor mutant.
ACCESSION   M25066
KEYWORDS    transfer RNA-Trp suppressor.
SOURCE      E.coli tRNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Eisenberg,S.P., Yarus,M. and Soll,L.
  TITLE     The effect of an Escherichia coli regulatory mutation on transfer
            RNA structure
  JOURNAL   J. Mol. Biol. 135, 111-126 (1979)
  STANDARD  simple staff_entry
COMMENT     In the TrpX mutation the base at position 37 ("a") is not modified.
            In Trp-tRNA base 24 "a" is changed to "g".
FEATURES       from  to/span     description
    tRNA          1       76     Phe-tRNA
    modified     16       16     d
    modified     17       17     d
    modified     20       20     d
    modified     32       32     cm
    anticdn      34       36     Phe-tRNA anticodon cca
    modified     37       37     ms2i6a
    modified     46       46     gm7
    modified     55       55     psi
BASE COUNT       15 a     21 c     23 g     17 t
ORIGIN      5' end of mature tRNA.
        1 aggggcgtag ttcaattggt agaacaccgg tctccaaaac cgggtgttgg gagttcgagt
       61 ctctccgccc ctgcca
//
LOCUS       ECOTRWX        76 bp ss-tRNA            RNA       15-SEP-1989
DEFINITION  E.coli Trp-tRNA TrpX mutation.
ACCESSION   M25067
KEYWORDS    transfer RNA-TrpX.
SOURCE      E.coli tRNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Eisenberg,S.P., Yarus,M. and Soll,L.
  TITLE     The effect of an Escherichia coli regulatory mutation on transfer
            RNA structure
  JOURNAL   J. Mol. Biol. 135, 111-126 (1979)
  STANDARD  simple staff_entry
COMMENT     In the Trp-tRNA the base at position 37 ("a") is modified, whereas
            it is not modified in the TrpX mutation.  In the suppressor
            Trp-tRNA base 24 "g" is changed to "a".
FEATURES       from  to/span     description
    tRNA          1       76     Phe-tRNA
    modified     16       16     D
    modified     17       17     D
    modified     20       20     D
    modified     32       32     Cm
    anticdn      34       36     Phe-tRNA anticodon cca
    modified     46       46     Gm7
    modified     55       55     psi
BASE COUNT       14 a     21 c     24 g     17 t
ORIGIN      5' end of mature tRNA.
        1 aggggcgtag ttcaattggt agagcaccgg tctccaaaac cgggtgttgg gagttcgagt
       61 ctctccgccc ctgcca
//
LOCUS       HCYSA         313 bp ds-DNA             SYN       15-MAR-1990
DEFINITION  Human cystatin A gene.
ACCESSION   D00404
KEYWORDS    cystatin A; cysteine-proteinase inhibitor.
SOURCE      Artificial gene.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 313)
  AUTHORS   Kaji,H., Kumagai,I., Takeda,A., Miura,K. and Samejima,T.
  TITLE     Studies on chemical synthesis of human cystatin A gene and its
            expression in Escherichia coli
  JOURNAL   J. Biochem. 105, 143-147 (1989)
  STANDARD  simple staff_entry
COMMENT     Submitted in computer readable form by H. Kaji on 14-Apr-1989.  The
            expression plasmid for cystatin A was constructed by introducing
            the synthetic gene downstream of the tac promoter of an E. coli
            plasmid which is a derivative of pKK223-3 and the gene was
            expressed in E. coli JM109 without IPTG-induction.
FEATURES       from  to/span     description
    pept          6      302     cystatin A
    refnumbr      6        8     numbered codon 1 in [1]
BASE COUNT       96 a     74 c     83 g     60 t
ORIGIN      
        1 aattcatgat cccgggaggc ttaagcgaag ccaagccagc tacaccggag atccaggaga
       61 tcgtcgacaa ggttaagccg cagctcgagg agaagaccaa cgagacatat ggtaagcttg
      121 aggccgttca gtataagacc caggttgttg ccggcaccaa ctactacatc aaagttcgtg
      181 ctggcgacaa caaatacatg catctgaagg tgttcaaaag cttacctggc cagaacgaag
      241 atctggtact gactgggtac caggtcgaca agaacaagga cgacgagtta actggcttct
      301 gatgaattct gca
//
LOCUS       HUMPRG       1182 bp ss-mRNA            PRI       15-MAR-1990
DEFINITION  Human PRG gene (alt. hpg) encoding hemopoietic proteoglycan core
            protein, complete cds.
ACCESSION   X17042 M25538
KEYWORDS    hemopoietic PG core protein.
SOURCE      Human adult hematopoietic myeloid cell, cDNA to mRNA, clone D-D2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1182)
  AUTHORS   Stellrecht,C.M. and Saunders,G.F.
  TITLE     Nucleotide sequence of a cDNA encoding a hemopoietic proteoglycan
            core protein
  JOURNAL   Nucleic Acids Res. 17, 7523-7523 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            provided by C.M.Stellrecht, 27-JUL-1989.
            
            [1] Author Address: C.M.Stellrecht, University of Texas,
            M.D.Anderson Cancer Center, Department of Biochemistry and
            Molecular Biology, Box 117, 1515 Holcomb Blvd., Houston, TX, 77054.
FEATURES       from  to/span     description
    pept         25      501     proteoglycan core protein precursor
                                 /nomgen="PRG" /map="10"
                                 /hgml_locus_uid="LU0037W"
    sigp         25      105     proteoglycan core protein signal peptide
    matp        106      498     proteoglycan core protein
    signal      279      315     CS substitution signal
    rpt         304      357     Ser-Gly repeat
BASE COUNT      359 a    223 c    224 g    376 t
ORIGIN      Chromosome 10q22.1.
        1 gaattccgct agactaagtt ggtcatgatg cagaagctac tcaaatgcag tcggcttgtc
       61 ctggctcttg ccctcatcct ggttctggaa tcctcagttc aaggttatcc tacgcagaga
      121 gccaggtacc aatgggtgcg ctgcaatcca gacagtaatt ctgcaaactg ccttgaagaa
      181 aaaggaccaa tgttcgaact acttccaggt gaatccaaca agatcccccg tctgaggact
      241 gacctttttc caaagacgag aatccaggac ttgaatcgta tcttcccact ttctgaggac
      301 tactctggat caggcttcgg ctccggctcc ggctctggat caggatctgg gagtggcttc
      361 ctaacggaaa tggaacagga ttaccaacta gtagacgaaa gtgatgcttt ccatgacaac
      421 cttaggtctc ttgacaggaa tctgccctca gacagccagg acttgggtca acatggatta
      481 gaagaggatt ttatgttata aaagaggatt ttcccacctt gacaccaggc aatgtagtta
      541 gcatatttta tgtaccatgg ttatatgatt aatcttggga caaagaattt tatagaaatt
      601 tttaaacatc tgaaaaagaa gcttaagttt tatcatcctt ttttttctca tgaattctta
      661 aaggattatg ctttaatgct gttatctatc ttattgttct tgaaaatacc tgcatttttt
      721 ggtatcatgt tcaaccaaca tcattatgaa attaattaga ttcccatggc cataaaatgg
      781 ctttaaagaa tatatatata tttttaaagt agcttgagaa gcaaattggc aggtaatatt
      841 tcatacctaa attaagactc tgacttggat tgtgaattat aatgatatgc cccttttctt
      901 ataaaaacaa aaaaaaaata atgaaacaca gtgaatttgt agagtggggg tatttgacat
      961 attttacagg gtggagtgta ctatatacta ttacctttga atgtgtttgc agagctagtg
     1021 gatgtgtttg tctacaagta tgattgctgt tacataacac cccaaattaa ctcccaaatt
     1081 aaaacacagt tgtgctgtca atacctcata ctgctttacc tttttttcct ggatatctgt
     1141 gtattttcaa atgttactat atattaaagc agaaatataa cc
//
LOCUS       SUSMTA       4191 bp ds-DNA             INV       15-MAR-1990
DEFINITION  S.purpuratus metallothionein-A (SpMTA) gene, complete cds.
ACCESSION   M30606
KEYWORDS    metal binding protein; metallothionein; metallothionein-A.
SOURCE      S.purpuratus sperm DNA, clone lambda-MT206.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 4191)
  AUTHORS   Harlow,P., Watkins,E., Thornton,R.D. and Nemer,M.
  TITLE     Structure of an ectodermally expressed sea urchin metallothionein
            gene and characterization of its metal-responsive region
  JOURNAL   Mol. Cell. Biol. 9, 5445-5455 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Nemer, 11-DEC-1989.
FEATURES       from  to/span     description
    pept        757      787     metallothionein-A, exon 1
               1908     1994     metallothionein-A, exon 2
               3080     3156     metallothionein-A, exon 3
    pre-msg     679     4154     MTA mRNA and introns
    IVS         788     1907     MTA, intron A
    IVS        1995     3079     MTA, intron B
    IVS        3189     3738     MTA, intron C
    signal      652      656     TATA box
BASE COUNT     1194 a    758 c    821 g   1391 t     27 others
ORIGIN      
        1 gtctacgtaa tttagtatgg aaaaataacc attttatcat acaggttgat catcaatata
       61 tcattatgca gtgaacacca tgaatatttc gaaatggagg gatagttgca tttctggcgg
      121 agatatccgg gtcgggaatc atctactttt attccccctc cccatttttt tcttcttctc
      181 cttgttcttc ttctttttcg tttcttaatc atggtattaa tcatttgcaa atcaattaca
      241 tgcctttata tatatataat aatcatggtg ctacatgatt ttgaaatact ttatggaaac
      301 cttacgtttt accagacctt gacacaaagt tgacagtgtt acctttatca ttcacttccg
      361 tactgtttca gcacatgagc aacactttgt tctttgaggg caaacgtttt gggggagggg
      421 agggcatccc acggcgcttt aaaaacaaaa aggatttttg cttcccttcc atatcccgta
      481 taatttatcc gacacgagcc tagtcatcac cattcctgat agaagcgtgt gcaaaactag
      541 aaaggcccac ctagatcgag ctcctcaggc accgctaaca ggttgcctag ctcgtgccta
      601 aacgtggtgt gtttacctat ttgtttgcac acggatattg taatgaaaac atataaaacc
      661 ccgaatttca ccagtttttg tcacaagctg aagcgtttga gtgtgattta cattaccttc
      721 tgaagaaaca tacaactcga gttcaatttc atcaccatgc ctgatgtcaa gtgtgtctgc
      781 tgcaagggta agcaaaaaat cggaatcaat gttctcatta aaataccgag ttacagaata
      841 ccgtgtacgt acggtgtagc gtttctagaa acaaaccgct cgatgttacg tgcatacgat
      901 gatacgatac agccatggcc atgaccatga gaggcccccg atcacgacac acgcagctag
      961 ctagaccgaa ttttccgtct ctttcgttcg ttaagcctct tatttgggta gggtcggctc
     1021 tctctttaga gacggaagcc gactagacct agggcgcgtt catgcttgat tttgaaggca
     1081 gaatgagcga tttgtaacgc tcatcgtaaa cgctcatcgt aatcgcagtt gcacggtgca
     1141 ctgttcatgc tcacttctcc aggcaattct ccttgggtgc cttcagctgg attcgcaaac
     1201 ctgtgtgcgc atgctacgag agcccataga ggtcagtgcg ttgaccccat ttaaacgcgt
     1261 ttacaaattg taaacgcatc tttttgtgag tttacgatcg gcgatcatga atcggctttt
     1321 tcggtgattc tacaaaatca agcatgaacg cgcccctata ctcattccct tgctaacacc
     1381 aaggtgcatt tatttttann nntatnccna naannncccc aaatgtcatt ttacctcgtc
     1441 gcatgttata ttgtaaatta ggcctaggct ataataaatc ggttccactt accaacttga
     1501 ctgaaataat ttaatggtcc ctttttagga aattatgggt ctgaaatcga aatccatgtt
     1561 gaaaatcttg caaaacgtat tttaactctc gctgcgcacg acgtaatcta aatctcatcc
     1621 cttctagtct acaccgaagg caaaatgatt atgaaggaaa aactgtgcgc gagcttcacg
     1681 ttcgttatca cagtttttct tcataatcat tttcatgttt tcttgcccgt ttcttcatac
     1741 ttgttacatc aacagttatg tcaataacta agtttttacc aacaagcact gctggatttg
     1801 tggtttcaaa ttgcaagatt gttattactt ttttaaaatt ttatatcatt atatagacta
     1861 tatatatata tatatatata tatttttttt tttaattctg aacacagagg gcaaggagtg
     1921 tgcctgcttt ggacaagatt gctgtaaaac tggtgaatgc tgcaaggatg gaacctgctg
     1981 tggaatatgc acaagtaagc ctttcatcat tttcagtgaa ccctttaaaa ttaaagcagg
     2041 cgcgtacgag gattttttaa gggggggggg ggggggttta acatttataa atcgggccga
     2101 aaattctgca acgactcagc cgaccgctga ataagtgnng gggggggggg gggtagtaag
     2161 aggacacttt ttattttttt ttggtctcga aattagaaaa tttgacattt tgctccgttg
     2221 gggggggggg ggggaaggtc agctttttta ggtcagccat gggaattttt ttttttaata
     2281 aaaaaaaaaa acatccagan nnnggggggg ggtttttaag gaaatttagg gggggggttt
     2341 acacacaaaa aattaccaaa gggtgggggt ttaaccnnnn nnnncccctg tgtaagcncc
     2401 tgcatcccag taattaaggc ttttttatac taaactgttt acacaaggct tttaaaagta
     2461 tttatattca tagaaaaaaa ttgtcccaat tgctcatgca tttttttaat ttctttttac
     2521 tctcaccttt aaatgggtgc aggcttaaat gtcaggctaa ctggcaatga ttatttgtgc
     2581 tcattatcaa tattctagtg tatccatggt tttgttatgg tcgggggggg gggatctgac
     2641 aaaaaatatt gacagccaag aatataggtc tatatgattt gtcagggaga tatgaatttc
     2701 acatctccta ggttttattg ttgttgtcac caattaaatt attttaaaaa tcactgacaa
     2761 gcttggaaaa ggattttgaa aataatttaa aacatgttca tctcattttc ctggtgaatt
     2821 ttgggtctga aaagcgctat ataagaacta gctattatta tatatatata tatatatata
     2881 taattagatt ttttattttt ttggaggggg gggggggggn ctttttccat gtatgcctaa
     2941 atgaggggcg cagaccaatt gtgcccttta tatatcctgc tgtgtagtct gcatgaaaga
     3001 gaaaatgcag ttataccaac aatagaatag agacatttca atttgatgtt attacttggt
     3061 taattttatt tgtccacaga cgctgcgtgc aagtgtgcta atggctgcaa gtgtggaagc
     3121 ggctgctcat gcactgaggg gaactgtgca tgctagacga gcttggccct ttgctatagg
     3181 gatggcaggt gagattttat ttattttcat ggctacatag taaaaattgt ttccaggcaa
     3241 aaacgttagc tagttatggg atatgacagg gtctgaaatg taagactgta gatttgcact
     3301 gcataaagct gtgccatcat ttttttttaa atagaatttt ccgagagagc atcaaaatta
     3361 atgtccaaat gtcgaaagcc aactgaatat gttaaatatt ggctgagatg tgaccatctg
     3421 aataaatagg ttccctatag gaactcagtg ttattttgct gtggttcata acatcacgna
     3481 ccagagcaaa ataacactga gttcctacac aaaatctatt tattacatta ctgcattgcg
     3541 atatacaaaa aaggtaaaat tagaaaatct ttgcaagtta tggcaatatg tatagacctt
     3601 ccgtgtcttt catgactttt gtatattttt ttcaacaacc ggaacaaatt cattaaccat
     3661 ttctgtgttt tcatactttt ttgtaaaaat attttcaagc attttgtttt tctctccgtg
     3721 tctgtttctc ttctacagat tgagcataca gctgatgaag tcactcagcg catctgccaa
     3781 catgctccaa gcagacctga ctttgtatat actttcgacc cgatgtatta tattttttgt
     3841 tgtaagatag ttgtatatta agttaattta atctttattt ttttctgaaa atatgagact
     3901 aattggcatt aataggacta cttaaacaac aattatttat taaggctgac agtaatgcct
     3961 gatagaattt cacttcattt tttgctgccc tgttttgtaa attcttcagt ctgtttctaa
     4021 cttctatcaa gttgcaaatt tggaatgaaa ataataataa taatacatgt ggagagtact
     4081 tgtagtgcgc aaactccaca tgactaatgt gttaaagact cgctcatgat taaatttaaa
     4141 taatgctaca aagtatattc agtatgcgtt gcttagtttt ttctcatgat t
//
LOCUS       SYNCYSTA      318 bp ds-DNA             SYN       15-MAR-1990
DEFINITION  Artificial cystatin alpha gene.
ACCESSION   X13226
KEYWORDS    artificial gene; cystatin; cysteine-proteinase inhibitor.
SOURCE      Artificial gene.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Katunuma,N., Yamato,M., Kominami,E. and Ike,Y.
  TITLE     Total synthesis of the cystatin alpha gene and its expression in E.
            coli
  JOURNAL   FEBS Lett. 238, 116-118 (1988)
  STANDARD  simple automatic
COMMENT     *source: library=pBR322; clone=pTPI-010;
            
            Data kindly reviewed (23/5/89) by Katunuma N.
FEATURES       from  to/span     description
    pept          6      317     cystatin alpha (AA 1 - 103)
BASE COUNT       99 a     63 c     76 g     80 t
ORIGIN      
        1 aattcatgga cccgggtact actggtatcg ttggtggtgt ttctgaagct aaaccggcta
       61 ctccggaaat ccaggaagtt gctgacaaag ttaaacgtca gctggaagaa aaaactaacg
      121 aaaaatacga aaaattcaaa gttgttgaat acaaatctca ggttgttgct ggtcagatcc
      181 tgttcatgaa agttgacgtt ggtaacggtc gtttcctgca catgaaagtt ctgcgtggtc
      241 tgtctggtga cgacgacctg aaactgctgg actaccagac taacaaaact aaaaacgacg
      301 aactgactga cttctagg
//

GenBank-Updates@genbank.bio.net (04/12/90)

LOCUS       YSCPDC1UP    1193 bp ds-DNA             PLN       15-MAR-1990
DEFINITION  Yeast PDC1 gene for pyruvate decarboxylase (EC 4.1.1.1.).
ACCESSION   X14016
KEYWORDS    PDC1 gene; pyruvate decarboxylase.
SOURCE      yeast (Saccharomyces cerevisiae).
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1193; enum. 1 to 1193)
  AUTHORS   Butler,G. and McConnell,D.J.
  TITLE     Identification of an upstream activation site in the pyruvate
            decarboxylase structural gene (PDC1) of Saccharomyces cerevisiae
  JOURNAL   Curr. Genet. 14, 405-412 (1988)
  STANDARD  simple automatic
COMMENT     *source: clone=YIpDC1-2; The sequence overlaps with that reported
            by Kellermann et. al.  in Nucl. Acids Res. 14:8963-8977(1986)
            X04675.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE        555    569       RPG box (upstream activation site)
FEATURES       from  to/span     description
    pept       1191  >  1193     pyruvate decarboxylase
BASE COUNT      362 a    251 c    206 g    374 t
ORIGIN      
        1 gatcaagcga gtccatcggc gacctgccca ccgctaaact ctcttaacat acggccgtac
       61 agaaaaaacg gcactggagt gagcagtgtc tccgacgatt tggatatcga tttcaacgat
      121 agttttgcct cagaagaaag ttataaacgt gccgctgcag tgacatctac cctaaaggcc
      181 agaattgaaa aaatgaaggc caaatcaagg cgggaaggga caaccaggac gtaaagggta
      241 gcctccccat aacataaact caataaaata tatagtcttc aacttgaaaa aggaacaagc
      301 tcatgcaaag aggtggtacc cgcacgccga aatgcatgca agtaacctat tcaaagtaat
      361 atctcataca tgtttcatga gggtaacaac atgcgactgg gtgagcatat gttccgctga
      421 tgtgatgtgc aagataaaca agcaagacag aaactaactt cttcttcatg taataaacac
      481 accccgcgtt tatttaccta tctttaaact tcaacacctt atatcataac taatatttct
      541 tgagataagc acactgcacc cataccttcc ttaaaaacgt agcttccagt ttttggtggt
      601 tctggcttcc ttcccgattc cgcccgctaa acgcataatt ttgttgcctg gtggcatttg
      661 caaaatgcat aacctatgca tttaaaagat tatgtatgct cttctgactt ttcgtgtgat
      721 gaggctcgtg gaaaaaatga ataatttatg aatttgagaa caattttgtg ttgttacggt
      781 attttactat ggaataatca atcaattgag gattttatgc aaatatcgtt tgaatatttt
      841 tccgaccctt tgagtacttt tcttcataat tgcataatat tgtccgctgc ccgtttttct
      901 gttagacggt gtcttgatct acttgctatc gttcaacacc accttatttt ctaactattt
      961 tttttttagc tcatttgaat cagcttatgg tgatggcaca tttttgcata aacctagctg
     1021 tcctcgttga acataggaaa aaaaaatata taaacaaggc tctttcactc tccttggaat
     1081 cagatttggg tttgttccct ttattttcat atttcttgtc atattctttt ctcaattatt
     1141 atcttctact cataacctca cgcaaaataa cacagtgcaa atcaatcaaa atg
//

GenBank-Updates@genbank.bio.net (04/12/90)

LOCUS       APH5A10A      721 bp ss-RNA             VRL       21-FEB-1990
DEFINITION  Foot and mouth disease virus 5' non-coding region, typeA-12.
ACCESSION   M32257
KEYWORDS    .
SOURCE      Foot and mouth disease virus (serotype A-12) cDNA to viral RNA.
  ORGANISM  Foot and mouth disease virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae; Aphthovirus.
REFERENCE   1  (bases 1 to 721)
  AUTHORS   Clarke,B.E., Brown,A.L., Currey,K.M., Newton,S.E.,
            Rowlands,D.J. and Carroll,A.R.
  TITLE     potential secondary and tertiary structure in the genomic rna of
            foot and mouth disease virus
  JOURNAL   Nucleic Acids Res. 15, 7067-7079 (1987)
  STANDARD  simple staff_entry
BASE COUNT      178 a    202 c    181 g    160 t
ORIGIN      
        1 ccccctaagt tctaccgtcg ttcccgacgc aaaagggagg taaccacaag cttgacgccg
       61 tctttcccga cgttaaaggg tagtaaccgc aagcttctac cgcccatccc ggcgttaaaa
      121 gggtgtaacc acaagatgca ccttcgtccc ggaagtaaaa cggcaatttc acacagtttt
      181 gcccgttttc acgagaaacg ggacgtctgc gcacgaaacg cctgtcgctt gaggaggact
      241 tgtacaaaca cgatctaagc aggtttcccc aactgacaca aaccgtgcaa tttggaactc
      301 cgcctggtct ttccaggtct agaggggtga cactttgtac tgtgtttggc tccacgctcg
      361 gtccactggc gagtgttagt aacagcactg ttgcttcgta gcggagcatg atggccgtgg
      421 gaactcctcc ttggtaacaa ggacccacgg ggccgaaagc cacgtccaat cggacccatc
      481 atgtgtgcaa ccccagcaca gcaacttttc tgcgaaactc acttcaaggt gacactgata
      541 ctggtactca aacactggtg acaggctaag gatgcccttc aggtaccccg aggtaacacg
      601 cgtcactcgg gatctgagaa ggggactggg gcttctataa aagcgtccag gttaaaaagc
      661 ttctatgcct gaataggtga ccggaggccg gcaccttttc tttacagcca ctgactttat
      721 g
//
LOCUS       APH5A10B      685 bp ss-RNA             VRL       21-FEB-1990
DEFINITION  Foot and mouth disease virus 5' non-coding region, typeA-10.
ACCESSION   M32258
KEYWORDS    .
SOURCE      Foot and mouth disease virus (serotype A-10) cDNA to viral RNA.
  ORGANISM  Foot and mouth disease virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae; Aphthovirus.
REFERENCE   1  (bases 1 to 685)
  AUTHORS   Clarke,B.E., Brown,A.L., Currey,K.M., Newton,S.E.,
            Rowlands,D.J. and Carroll,A.R.
  TITLE     potential secondary and tertiary structure in the genomic rna of
            foot and mouth disease virus
  JOURNAL   Nucleic Acids Res. 15, 7067-7079 (1987)
  STANDARD  simple staff_entry
BASE COUNT      169 a    194 c    174 g    148 t
ORIGIN      
        1 ccccctaagt tttgccgtcg tttccgacgt aaaaggaagt aatcacaagc ttgacaccgc
       61 ctgtcccggc ggtaaaggga agtaaccgca agcttactac cgcctacccc ggcgttaacg
      121 gggaacaacc acaaacacac ttttgcccat ttttgcgaga aatgggacgt ctgcgcacga
      181 aacgcgccgt cgcttgagga ggacttgtac aaacacgatc tacgcaggtt tccccaactg
      241 acacttaccg tgcaacttga aatcccgcct ggtctttcca ggtctagagg ggtgacaatt
      301 tgtactgtgg ttgactccac gctcggtcca ctggcgagtg ctagtagcag cactgttgtc
      361 tcgtagcgga gcatgatggc cgtgggaatc cctccttggt aacaaggacc cacggggcca
      421 aaagccacgt cctaacggac ccatcatgtg tgcaacccca gcacggcgac tttcctgtga
      481 aacccacttt aaggtgacac tgatactggt actcaaacac tggtggcagg ctaaggatgc
      541 ccttcaggta ccccgaggta acacgcgaca cttgggatct gagaagggga ctgaggcttc
      601 tttaaaagcg ctcggtttaa aaagcttcta cgcctgaaca ggtgaccgga ggccggcacc
      661 tttctctaca accactgata ttatg
//
LOCUS       CHKHMG17     1327 bp ss-mRNA            VRT       15-MAR-1988
DEFINITION  Chicken mRNA for high mobility group protein HMG-17.
ACCESSION   Y00416 M32256
KEYWORDS    DNA binding protein; high mobility group protein.
SOURCE      chicken (Gallus gallus).
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1327; enum. 1 to 1327)
  AUTHORS   Landsman,D.
  TITLE     ;
  JOURNAL   Unpublished (1987) NIH/NCI, Bethesda, MD 20892, USA.
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1327; enum. 1 to 1327)
  AUTHORS   Landsman,D. and Bustin,M.
  TITLE     Chicken Non-Histone Chromosomal Protein HMG-17 cDNA Sequence
  JOURNAL   Nucleic Acids Res. 15, 6750-6750 (1987)
  STANDARD  simple automatic
COMMENT     *source: developmental stage=embryo.
FEATURES       from  to/span     description
    pept        175      447     high mobility group protein HMG-17 (AA 1-90)
BASE COUNT      355 a    320 c    309 g    343 t
ORIGIN      
        1 gaattccgcc agcgcagcga gccggccgcc agccccgccg cgccgccccg ctctccccct
       61 cggccctccc ccgcttctcg ccgccaccga gcgagcccgg ctgcccgccc ccgcccgccc
      121 cctccgctcg ctctctccct cctcgcacaa cacacgcacg cgccgcccgg agctatgccg
      181 aagagaaagg ctgaaggaga taccaagggc gataaggcca aagttaagga tgagccacaa
      241 cggagatcgg caaggttatc tgctaaacct gcccctccga agccagagcc taaacctaaa
      301 aaggcagctc caaagaagag tgagaaggtg cccaagggaa agaaggggaa agctgatgct
      361 ggcaaggagg gaaacaaccc tgcagaaaat ggagatgcca aaacagacca ggcacagaaa
      421 gccgaaggtg ctggtgatgc caagtaaaat gtgtgaattt ttgataactg tgtacttctg
      481 gtgactgtac agtttgaaat actatttttt atcaagtttt ataacaatgc agaattttgt
      541 tttacttttt tttaagctat gttgttagca cacagaccgc tttgttgttg tgttttgagg
      601 ggggggcagt gggacaaacg tcacttaatc tgtttcttgg aacctaaatt ttaaaagttt
      661 accccttccc agttttttag aaggactctt cctaaatgga gcaggaaggg attccttcgt
      721 gctgcacacc tcttccgttt tgtggaccgc atcagagtga acggaagctc ccgagatgcc
      781 tgttgccaac ttcagaactg cagtttgcag tgccctctgc gtttcctttc atgccctccc
      841 tttttgccta gagcctatca ctccgaaata cagcagacat ggcatgttgg gactcaccac
      901 tctaaatgca ttgtcaggtg atctggactt ctggtgtcta atttgggata taatagctct
      961 aaaaggagct gcatttcctc tttcatattg tagatctaca gattaaggaa tctgcagttt
     1021 ttaatttttc ctcgcaaagt cagggtagat ttgtgaagag ttgttaaaca acatgctaaa
     1081 tgtgaaagtg tccgccctca ctctaaacat ttccctctac aagtatacaa aaatgaagat
     1141 ttgtcggttt tatagcaacc tttatgtttg ggtagtccat gaagggaggg gagtttgaca
     1201 gttgttgtaa aatgttgcag attgtagccc atgtcctgcc taaattacca tgattgttta
     1261 tgaaaagtac ctttaataaa gctggatacg gtttggcttg gaaaaaaaaa aaaaaaaaac
     1321 ggaattc
//
LOCUS       HUMSAU3E       50 bp ds-DNA             PRI       28-FEB-1990
DEFINITION  Human Sau3A related sequence, clone pUCXba30.
ACCESSION   M32265
KEYWORDS    alphoid-like repetitive sequence; repetitive sequence.
SOURCE      Human DNA, clone pUCXba30.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Okumura,K., Kiyama,R. and Oishi,M.
  TITLE     Sequence analysis of extrachromosomal Sau3A and related family DNA:
            Analysis of recombination in the excision event
  JOURNAL   Nucleic Acids Res. 15, 7477-7489 (1987)
  STANDARD  simple staff_entry
BASE COUNT       16 a     11 c      7 g     16 t
ORIGIN      
        1 tgaaactctc tttctttgga ttctgcatct aaaaaccaaa cggaagcatt
//
LOCUS       HUMSAU3F       88 bp ds-DNA             PRI       28-FEB-1990
DEFINITION  Human Sau3A related sequence, clone pUCXba25.
ACCESSION   M32266
KEYWORDS    alphoid-like repetitive sequence; repetitive sequence.
SOURCE      Human DNA, clone pUCXba25.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Okumura,K., Kiyama,R. and Oishi,M.
  TITLE     Sequence analysis of extrachromosomal Sau3A and related family DNA:
            Analysis of recombination in the excision event
  JOURNAL   Nucleic Acids Res. 15, 7477-7489 (1987)
  STANDARD  simple staff_entry
BASE COUNT       22 a     18 c     21 g     27 t
ORIGIN      
        1 cagcagtttc caatcactct ttctgtggaa tctgcaagtg atatttgggc ctctctgagg
       61 atttcgttgg aaacgggata aaatgcac
//
LOCUS       HUMSAU3G     1192 bp ds-DNA             PRI       28-FEB-1990
DEFINITION  Human Sau3A related sequence, clone pUCXba34.
ACCESSION   M32267
KEYWORDS    alphoid-like repetitive sequence; repetitive sequence.
SOURCE      Human DNA, clone pUCXba34.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1192)
  AUTHORS   Okumura,K., Kiyama,R. and Oishi,M.
  TITLE     Sequence analysis of extrachromosomal Sau3A and related family DNA:
            Analysis of recombination in the excision event
  JOURNAL   Nucleic Acids Res. 15, 7477-7489 (1987)
  STANDARD  simple staff_entry
BASE COUNT      328 a    239 c    273 g    352 t
ORIGIN      
        1 cctgtggtgg aaaaggaact atctactcat aaaagctaga tggaagcatt ctcagaaact
       61 actttgtgat gattgctttc aggtcacaga gttgaacatt ccctttgata gagccgtttg
      121 gagacacaat tttggttcaa tcggcaaggg gagatttgga ccgctttgag gcctatgcga
      181 gtagaggaaa tcactgccca taaaaactag acagcagcat tctcaggaaa cactttgtga
      241 cgattgagtt gaactcacag agcggaacat tcctttggat ggagcagttt tgaaacacac
      301 tttttgtaga atctgcaagt ggacagttgg acctccctga ggatttcttt ggaaacggga
      361 taacgtcacc taactaaaca gaagctttcg cagaaacttc tttgtgacgt ttgcattcaa
      421 agtccagagt tgaaccttcc cttgatagtt cacgtttgaa acactctttt tgtaggatct
      481 gcaagtggat attgggagca ctttgaggcc ttcgttcgaa atgggtatat cttcacataa
      541 aatccagaca gaagccttct cagaaacttc tctgtgatga ttgcatgcaa ctcacagagt
      601 tgaacattcc tttgcataga gcagtttcga aactctcttt tttctgaatc tgcacatgga
      661 taggtgcaac tctgtgaaga tttccttgga aacgggaata tcttcactta aagagtaaac
      721 ggatgccttc tcagaaactt ctttgtgagg catgtgttca actcccagac tttaaccttg
      781 cttttcatag agcagttttg aaacattctt ttcgtagagt ctccaagtgg acgtttggag
      841 cgctttcagg cctgtggtgg aaaaggaaat atcttcacct caaaactaga gagaagcatt
      901 gtcagaaacg tctttgtgat gatggcattc aactcacgga gttgaaggtt ccttttgata
      961 cagcagtttg gaaacactct ttcagtggga cctgcaagcg gatatttggg gctctttgga
     1021 gatttcgatg gaaaaggcat aatcttccca taaaagctaa acggaagcgt gctcagagcc
     1081 ttcttggtga tgtttgcatt caactcacag agttgtactt tcctttcgac agagcagctt
     1141 tgaaaccctc tctttctaga atctgcaagt ggacatttgg agggcttcga gg
//
LOCUS       HUMSAU3H       91 bp ds-DNA             PRI       28-FEB-1990
DEFINITION  Human Sau3A related sequence, clone pUCXba34.
ACCESSION   M32268
KEYWORDS    alphoid-like repetitive sequence; repetitive sequence.
SOURCE      Human DNA, clone pUCXba34.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 91)
  AUTHORS   Okumura,K., Kiyama,R. and Oishi,M.
  TITLE     Sequence analysis of extrachromosomal Sau3A and related family DNA:
            Analysis of recombination in the excision event
  JOURNAL   Nucleic Acids Res. 15, 7477-7489 (1987)
  STANDARD  simple staff_entry
BASE COUNT       20 a     15 c     26 g     30 t
ORIGIN      
        1 gttgaaggtt ccttttgata caacagtttg gaaacactct ttcagtggga cctgcaagcg
       61 gatatttggg gctctttgga gatttcgatg g
//
LOCUS       NEUMTINT1    1377 bp ss-mRNA            ORG       27-FEB-1990
DEFINITION  N.crassa mitochondrial RNA precursor intergenic region, segment 1.
ACCESSION   M32263
KEYWORDS    .
SEGMENT     1 of 3
SOURCE      N.crassa mitochondrial cDNA to precursor RNA.
  ORGANISM  Mitochondrion Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE   1  (bases 1 to 1377)
  AUTHORS   Agsteribbe,E. and Hartog,M.
  TITLE     Processing of precursor RNAs from mitochondria of Neurospora crassa
  JOURNAL   Nucleic Acids Res. 15, 7249-7263 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     ATPase 6 (AA at 1)
    pept       1369  >  1377     MAL protein
BASE COUNT      456 a    213 c    277 g    431 t
ORIGIN      
        1 ctacattaga acagtcctgt gcagtccttt aacaaaatta cgctgcttta ggtgacacct
       61 gaaagcggtc caccctcctc ttacgtatcc gaaggatact aaaggtcccc aatatcaagt
      121 taaagtcaca tatgcaatca aatagttaaa cgtctcgttg taacttgata agattactct
      181 attcgaggtc tcttatagca aatagagaga ggaatgccca ctcgcaaacg ggttcggtac
      241 ttaaaatgaa cagagatacg caagttcgca tctgaaaaag tggacggaga aggaacttat
      301 ggccactcta ccgagtggtt gtagcaaatg cagaatcgac ccgctatagg tttaacttct
      361 cctaggatgt agtttgggag gcacgcccaa actataaaag tgctaagggg taggaaggtt
      421 cgaatcctga gttcccacca ccatgggtgg cgtgcctcga gaagggggcg tgcctctgga
      481 tagcttgcgc ttattttttt ttttagccgt ccccccgccc tgcagtactg cagggggggg
      541 aagctaaaaa aaattaaaga ggtggccaga ggctaaaaaa aaaaatgcta gctttagcta
      601 aaaaaaaaat aaccgataaa ccaaattttt aatagcatat atagacgact aatttaatta
      661 tattagtctg agcggagcta agctgtgcgg attaggatta ggattaatcc taatcgtact
      721 taatgattat ctttaataat atgaaataaa tatgaatatt tatttcacca tgtaacatta
      781 ataatgttaa tgtgtgattt aatttggtac gtatcacgta aggggaatga tggatgatac
      841 attatgctat gtaatgatgt aatgtaatgt aatgtaatgt tatatccact atgtactgca
      901 aatgtaatca tctgatagat aatataaaat gtaacttaat gtaacttaac ttaatgtaat
      961 gtaatgtaac ttaatgtaac ttaatgttat gtaactgtaa tgtaatattt tgtaatgtaa
     1021 ctgtaatgta acttaatata atgtaactgt aatgtaatat aatgtacagt aatgtaactg
     1081 taatgtaatg ttacattaca tgtaatgtaa tgtaatgtat tgtaatgtaa tctttatatg
     1141 taatgtacta aagattacga tagctagcca gttatatact ggctagctat cgtatgatta
     1201 ttgtagccat tagggatagc ctaatggcta cttgattatc attagatata ctgcaacagc
     1261 agtaaataca ctgttcggga gcctcagtgg tatttgctgc tttgcggtgg tggaattgaa
     1321 gaggttacta gattctagtt aatttttgat tccaccattg tatatataat gatacaa
//
LOCUS       NEUMTINT2     712 bp ss-mRNA            ORG       28-FEB-1990
DEFINITION  N.crassa mitochondrial RNA precursor intergenic region, segment 2.
ACCESSION   M32264
KEYWORDS    .
SEGMENT     2 of 3
SOURCE      N.crassa mitochondrial cDNA to precursor RNA.
  ORGANISM  Mitochondrion Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE   1  (bases 1 to 712)
  AUTHORS   Agsteribbe,E. and Hartog,M.
  TITLE     Processing of precursor RNAs from mitochondria of Neurospora crassa
  JOURNAL   Nucleic Acids Res. 15, 7249-7263 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     MAL protein (AA at 1)
    pept        704  >   712     COII protein
BASE COUNT      199 a    149 c    175 g    189 t
ORIGIN      225 bp after segment 1.
        1 gttgcatagt aatgatagag attaggaata aaatgatgta tggctgctta aataaaaaat
       61 atggctgctc ttcctgctta gctagctagc tagcttcgag ggaccccacg aatacggccc
      121 acacacgaag gtgtgtgggg tattcgtggg gggccccctc ctacttcctg cctacgaagt
      181 ctaggcaggc aggaagcagc cttaataatt tagctagcca taatatcgcg accaccctcc
      241 tccactcaac cccccccttg gcggccccca tctttccccc gccctgcaat actgcagggg
      301 ggcataatta aatgtgatgg ggaccttgag caaaggaggg ggtacgcgag tggaggaggg
      361 tggtactttt ttgggaggga agctgagcga agctgagcga agctgagcga agtgagcgaa
      421 gtgtgcgaag ctgacggaag gtaattcatt tttttgggcc cccgccccga agggcggtcc
      481 cccgaaaaaa caaaggacgc tagcgtagcc taaccttatg agcagcaggc tataaaaacc
      541 cattaggcac agggtatact tcgtttagtg gataagaatt agcctagcct taatatacaa
      601 tacagtttaa tgttatattt aaattctata ttttatgaaa agattataat atatttatta
      661 ttatggttta atattcaata tagaatattc tatttctttt tttatgggat ta
//
LOCUS       NEUMTINT3     691 bp ss-mRNA            ORG       28-FEB-1990
DEFINITION  N.crassa mitochondrial RNA precursor intergenic region, segment 3.
ACCESSION   M33670
KEYWORDS    .
SEGMENT     3 of 3
SOURCE      N.crassa mitochondrial cDNA to precursor RNA.
  ORGANISM  Mitochondrion Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE   1  (bases 1 to 691)
  AUTHORS   Agsteribbe,E. and Hartog,M.
  TITLE     Processing of precursor RNAs from mitochondria of Neurospora crassa
  JOURNAL   Nucleic Acids Res. 15, 7249-7263 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     COII protein (AA at 1)
BASE COUNT      181 a    154 c    198 g    158 t
ORIGIN      754 bp after of segment 2.
        1 gaacagtaag acgataaaat agtttttggc taaataaagg aactcggggg accccacgaa
       61 tacggcccac acacgaaggt gtgtggggta ttcgtggggc cctatgtatt tgagtatgaa
      121 gccatctagt ggttatctct cccccccctc cccgcaggtt tgcggggggg agcgaggggg
      181 tagcgataaa aagtttctaa tatgaagagg atgacttatg gaattaactc cacccccccc
      241 ccccttaagg gggggagtgg agtggaacgg aatgtggaga cgccgcgtgc cccccgcaaa
      301 tttgcggggg gggcgggtgg atggaacatc cgggaggtgg tgggttatat tataaacaat
      361 attcggggga accccgcccc ttcgggggcg gaggccatta actggaaagc acccccccct
      421 taaggggggt ggcttttaat tattattatt tttataggtt aattaggttt aaaaggctac
      481 ccgccccccc ccctgcagta ctgcagggcg gggaagggtt aaaataacaa gcaagttcct
      541 gaagtctggc gtagctagcg gaaggaacct tgccttaaat ccaaaaacct tttaaagagt
      601 taccaagact gaaccggggg ctagcccccg attaggtaat taataagttt atgtatatct
      661 agtaagggta taactctaat cccctgcagt a
//
LOCUS       PMUMUC1       965 bp ds-DNA             PHG       15-MAR-1990
DEFINITION  Bacteriophage mu C operon 5' region
            involved in late gene transcription.
ACCESSION   Y00419 M32261
KEYWORDS    C operon; activator protein; late gene transcription;
            regulatory protein; trans-activator.
SOURCE      Bacteriophage mu DNA, clone pWM6.
  ORGANISM  Bacteriophage mu
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 965)
  AUTHORS   Stoddard,S.F. and Howe,M.M.
  TITLE     DNA sequence within the mu C operon
  JOURNAL   Nucleic Acids Res. 15, 7198-7198 (1987)
  STANDARD  simple automatic
REFERENCE   2  (bases 161 to 360)
  AUTHORS   Stoddard,S.F. and Howe,M.M.
  TITLE     Characterization of the C operon transcript of bacteriophage mu
  JOURNAL   J. Bacteriol. 172, 361-371 (1990)
  STANDARD  simple staff_entry
COMMENT     The mu C gene product is a transactivator of mu late transcription.
            see also X03992 for Mu C gene
            
            Data kindly reviewed (27-OCT-1987) by Howe M.M.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE         53     43 (C)   pot. MuA binding site
               SITE        123    111 (C)   pot.IHF binding site
               SITE        185    173 (C)   pot. Mu repressor binding site
               SITE        193    180 (C)   pot. Mu repressor binding site
               SITE        183    193       pot. MuA binding site
               INVREP      190    197       inverted repeat A
               INVREP      203    210       inverted repeat A'
               TERM        190    210       transcription terminator of an
                                            early mRNA
               PRM         225    230       pot. -10 region
               SITE        237    240       region of 5' end of Mu C mRNA
                                            (by S1 mapping)
               RBS         251    253       pot. ribosome binding site
            
               SITE        414    426       pot. IHF binding site
               SITE        454    467       pot. Mu repressor binding site
               RBS         462    465       pot. ribosome binding site
            
               INVREP      782    800       inverted repeat B
               SITE        820    811 (C)   pot. MuA binding site
            
               SITE        846    858       pot. Mu repressor binding site
               SITE        903    890 (C)   pot. IHF binding site
               SITE        893    902       pot. MuA binding site
FEATURES       from  to/span     description
    ORF         262      480     open reading frame 1
    ORF         278      406     open reading frame 2
    ORF         390       97 (c) open reading frame 3
    ORF         473      835     open reading frame 4
    ORF         845      964     open reading frame 5
    ORF         933      739 (c) open reading frame 6
BASE COUNT      183 a    230 c    269 g    283 t
ORIGIN      
        1 ctacattcca cgcggtcagg ctctggactc gctgattcgg gatttgcgta tctggaatga
       61 ttttaacggt cgcaacgtca gcgaactgac cacccgttat ggtgtcactt tcaatacggt
      121 gtacaaagcc attcgccgga tgcgacggct gaaataccgc cagtaccagc cctcacttct
      181 gtaaacagta aagccggtta atccggcttt ttttacgtcc tcaatatcct gtgatgaata
      241 accgtaccgg ggaaaatcat catgggtaaa ggctggaatg catcctttca tcttggcaga
      301 cgtgagcggc tgcgtcagga ggttttgcac cgtgtcgccg gtgggccacg tcctgcgccc
      361 cgcgactata ccggtcatga tggcacccac ggcagttatt acatgaaagg ctggcagtca
      421 gttgatatgc cggaaattct tcatcattgc ctgctttaca gggagaaaca ttatgtctga
      481 gcgttctgct cgtcagtggc ctgatttttt atcggtggtg ttactggcgc tgttattgtg
      541 gataagtctg ttttgtggct ggcgcgcgct gatgttttgc tgtgcctcgg ttttcagtgt
      601 tgctttatgt gttgccgctg attgtctgga tgcgctgata atgagctgcc gggtgcctga
      661 acattttgcc cggttcgtct ggcctctgac ctggctgggg agtctgtccg ggctgggatt
      721 agccgtgatg gcaacgtctc aactgaaaac gggtcctgag catgtcatct gggcgctggc
      781 cggattgctg acgttctggc tctcatttcg ttttcgcgct cgtctgttcg ggtaaagggg
      841 caccatgtac agaaaattca gtgatgaatg tttcgggccg tccacgctga ttaatgcgat
      901 aaaagtgatt gcccttgtgg ttctgataac catcagtgcc gtggtgtatc tttctgtctg
      961 ttaac
//
LOCUS       PNSTETFA1      97 bp ds-DNA             BCT       28-FEB-1990
DEFINITION  Plasmid pNS1981 flanking region of tet resistance gene, 5' end.
ACCESSION   M32262
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Plasmid pNS1981 from B.subtilis.
  ORGANISM  Plasmid pNS1981
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 97)
  AUTHORS   Sakaguchi,R. and Shishido,K.
  TITLE     A unique DNA structure of the junction of homologous and
            nonhomologous regions between tetracycline-resistance plasmid
            pNS1981 and kanamycin-resistance plasmid pUB110
  JOURNAL   Nucleic Acids Res. 15, 7202-7202 (1987)
  STANDARD  simple staff_entry
BASE COUNT       38 a      8 c     18 g     33 t
ORIGIN      
        1 tgggaacgga aaaattattt tattaaagag tagttcaaca aacgggccat attgttgtat
       61 aagtgatgaa atactgaatt taaaacttag tttatat
//
LOCUS       PNSTETFA2      94 bp ds-DNA             BCT       28-FEB-1990
DEFINITION  Plasmid pNS1981 flanking region of tet resistance gene, 3' end.
ACCESSION   M33669
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Plasmid pNS1981 from B.subtilis.
  ORGANISM  Plasmid pNS1981
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 94)
  AUTHORS   Sakaguchi,R. and Shishido,K.
  TITLE     A unique DNA structure of the junction of homologous and
            nonhomologous regions between tetracycline-resistance plasmid
            pNS1981 and kanamycin-resistance plasmid pUB110
  JOURNAL   Nucleic Acids Res. 15, 7202-7202 (1987)
  STANDARD  simple staff_entry
BASE COUNT       27 a     14 c     16 g     37 t
ORIGIN      
        1 tttataacag gaattcaaat ctttttgttc cattaaaggg cgcgattgct gaataaaaga
       61 tacgagagac ctctcttgta tcttttttat tttg
//
LOCUS       STMTGFMA      147 bp ds-DNA             BCT       28-FEB-1990
DEFINITION  S.rimosus initiator tRNA gene.
ACCESSION   M32254
KEYWORDS    transfer RNA-fMet.
SOURCE      Streptomyces rimosus DNA.
  ORGANISM  Streptomyces rimosus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 147)
  AUTHORS   Gamulin,V. and Soell,D.
  TITLE     The initiator tRNA genes from Streptomyces rimosus
  JOURNAL   Nucleic Acids Res. 15, 6747-6747 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA         51      125     tRNA-fMet
BASE COUNT       28 a     39 c     51 g     29 t
ORIGIN      
        1 gtgatttgcg ctggtcgacg gcggtggtgc gtagactggt cacacaacgg cgcggggtgg
       61 agcagctcgg tagctcgctg ggctcataac tcagaggtcg caggttcaaa tcctgtcccc
      121 gctactgatg aagaaaggcc cggatcc
//
LOCUS       STMTGFMB      147 bp ds-DNA             BCT       28-FEB-1990
DEFINITION  S.rimosus initiator tRNA gene.
ACCESSION   M32255
KEYWORDS    transfer RNA-fMet.
SOURCE      Streptomyces rimosus DNA.
  ORGANISM  Streptomyces rimosus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 147)
  AUTHORS   Gamulin,V. and Soell,D.
  TITLE     The initiator tRNA genes from Streptomyces rimosus
  JOURNAL   Nucleic Acids Res. 15, 6747-6747 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA         51      125     tRNA-fMet
BASE COUNT       28 a     48 c     49 g     22 t
ORIGIN      
        1 gcgtgggccg ggcaccacct ccgtgccgtg gtaccgtgga cacacaacgg cgcggggtgg
       61 agcagctcgg tagctcgctg ggctcataac ccagaggtcg caggttcaaa tcctgtcccc
      121 gctactggaa acgaaaggcc cggatcc
//

GenBank-Updates@genbank.bio.net (04/12/90)

LOCUS       MUSIGR       2003 bp ss-mRNA            ROD       18-JUL-1989
DEFINITION  Mouse interferon-gamma receptor mRNA, complete cds.
ACCESSION   M25764
KEYWORDS    interferon-gamma receptor.
SOURCE      Mouse (strain BALB/c) plasmacytoid lymphosarcoma cell line ABPL4,
            cDNA to mRNA, clone 45.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2003)
  AUTHORS   Kumar,C.S., Muthukumaran,G., Frost,L.J., Noe,M., Ahn,Y.H.,
            Mariano,T.M. and Pestka,S.
  TITLE     Molecular characterization of the murine interferon gamma receptor
            cDNA
  JOURNAL   J. Biol. Chem. 264, 17939-17946 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence for [1] kindly
            submitted by S.Pestka, 30-JUN-1989, for release after publication.
FEATURES       from  to/span     description
    pept         31     1464     interferon-gamma receptor precursor
    sigp         31      111     interferon-gamma receptor signal peptide
    matp        112     1461     interferon-gamma receptor
BASE COUNT      534 a    462 c    489 g    518 t
ORIGIN      Chromosome 10.
        1 ccccctgtca gaggtgtccc tcgcgcagga atgggcccgc aggcggcagc tggcaggatg
       61 attctgctgg tggtcctgat gctgtctgcg aaggtcggga gtggagcttt gacgagcacc
      121 gaggatcctg agcctccctc ggtgcctgta ccgacgaatg ttctaattaa gtcttataac
      181 ttgaaccctg tcgtatgctg ggaataccag aacatgtcac agactcctat ttttactgta
      241 caggtaaagg tgtattcggg ttcctggact gattcctgca ccaacatttc tgatcattgt
      301 tgtaatatct atggacaaat tatgtatcct gatgtatctg cctgggccag agttaaagct
      361 aaggttggac aaaaagaatc tgactatgca cggtcaaaag agttccttat gtgcctaaag
      421 ggaaaggtcg ggccccctgg cctggagatc aggaggaaga aggaagaaca gctctccgtc
      481 ctcgtatttc accctgaagt cgttgtgaat ggagagagcc agggaaccat gtttggtgac
      541 gggagcacct gttacacatt cgactatact gtgtatgtgg agcataaccg gagtggggag
      601 atcctacata cgaaacatac ggtcgaaaaa gaagagtgta atgagactct gtgtgagtta
      661 aacatctcag tatccacact ggattccaga tattgtattt cagtagacgg aatctcatct
      721 ttctggcaag ttagaacaga aaaatcgaaa gacgtctgta tccctccttt ccatgatgac
      781 agaaaggatt caatttggat tctggtggtt gctcctctta ccgtctttac agtagttatc
      841 ctggtatttg cgtattggta tactaagaag aattcattca agagaaaaag cataatgtta
      901 cctaagtcct tgctctctgt ggtaaaaagt gccacgttag agacaaaacc tgaatcgaag
      961 tattcacttg tcacaccgca ccagccagct gtcctagaga gtgagacggt gatctgtgaa
     1021 gagcccctgt ccacagtgac agctccagac agccccgaag cagcagaaca ggaagaactt
     1081 tcaaaagaaa caaaggctct ggaggctgga ggaagcacgt ctgccatgac cccagacagc
     1141 cctccaactc cgacacaaag acgcagcttt tccctgttaa gtagtaacca gtcaggccct
     1201 tgtagcctca ccgcctatca ctcccgaaac ggctctgaca gtggcctcgt gggatcgggc
     1261 agctccatat cggacttgga atctctccca aacaacaact cagaaacaaa gatggcagag
     1321 cacgaccctc cacccgtgag aaaggccccc atggcctccg gttatgacaa accgcacatg
     1381 ttggtggacg tgcttgtgga tgttgggggg aaggagtctc tcatggggta tagactcaca
     1441 ggagaggccc aggagctgtc ctaaggtctc ccgaggcctg ctggtggtaa agaaactgac
     1501 cttttaggca gtttttctgc attgatttca tgaaagaagc tatacattag ctaatactaa
     1561 ccacatagaa tatcagactt agatacgtga ataaggatcc tgtgggcact gctgggtcca
     1621 ctctgcaaat gccaagacta tcaaaggaac gtattgtcgc ttctggctcc ttcccaggtg
     1681 ggctagcatc tgtgagtttg cctcggctag ccttgcttcc tacagccgcc actgctcctc
     1741 caccctgatc atctcacagg acagggtgga ccgggttttt tttttttttt tcacacacct
     1801 ttgtatatgt aagttcatgt atataatatg tttacatgtt tcactttgaa ctgaaagcta
     1861 ctcaaagcca gccgtaagtc tatggtagaa tgtgatggaa catgttggtg gaagcttgta
     1921 caatagaaca cattggtggg agcttgtaca tactttttta tggagcatta cttacgattt
     1981 tttaagtaaa atgttttgaa acc
//

GenBank-Updates@genbank.bio.net (04/12/90)

LOCUS       DROELAVK     2505 bp ss-mRNA            INV       10-OCT-1988
DEFINITION  D.melanogaster elav protein, complete cds.
ACCESSION   M21152
KEYWORDS    RNA-binding protein; elav protein; neural protein.
SOURCE      D.melanogaster (Canton-S) Kauvar 3-12 hour embryo library cDNA to
            mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2505)
  AUTHORS   Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
  TITLE     The elav gene of Drosophila, required in neurons, has three RNP
            consensus motifs
  JOURNAL   Science 242, 1570-1572 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence kindly submitted
            by K.White, 26-OCT-1988.
FEATURES       from  to/span     description
    pept        492     1943     elav protein
BASE COUNT      772 a    669 c    629 g    435 t
ORIGIN      Chromosome X; 1B5-9 on the cytogenetic map.
        1 tttccacgcc aaccatcgtg cgccgcggtc tgaacgctcc tgccacagaa aaagaaaata
       61 aaagcaagga aaattctaat cgaataccaa atatcgtgct tgtgtgctct ttccgcaatt
      121 gatttttttt aagtagtgca tgacaataac cgttgagttg actccaaccg aagtaaccat
      181 aactggaagc tgacaacagc cattggccgg aatctcatct cacaccgagc gaaatacggc
      241 gacgccagcg tttatttatt aaaaacgatt acattacaac aaacaccaaa gtcctggaca
      301 accgaagaga gtgtgggaga aaatcgaagc agagagcaag gaggagcgcc gtgagagcgc
      361 gcgcgcaagg atttgagatt gcaaagagaa cttagcaatt agttcgcttt gtttgtccag
      421 ccaaacaaca acaacaacca accagctccc caaaaccaac tacacacata ctcgctccct
      481 aaactaaaac aatggacttt attatggcaa ataccggagc tggcggcgga gtagacacac
      541 aggcacagct aatgcagagt gccgctgcag ccgcagcagt ggcggcaaca aacgcggccg
      601 ccgctcccgt acagaatgca gccgccgtgg cggccgccgc ccagctgcag cagcaacagg
      661 tgcaacaggc aatcctgcag gtgcagcagc agcagacaca gcaagcggtg gccgcggccg
      721 ctgccgcagt gacccagcag ctccaacagc aacagcaggc tgtcgtggcc caacaggctg
      781 tagtgcagca gcaacaacag caggcggcgg cagtggtgca acaggcggcg gtccaacagg
      841 ctgtggtgcc ccagccgcag caggcgcagc ccaatacgaa tggcaatgca ggatcgggat
      901 cgcaaaatgg cagcaacggc agcacggaga cgcgcacaaa ccttattgtc aactacttgc
      961 cgcaaacaat gaccgaagac gagatccgtt cgctcttctc cagcgtcggc gagattgagt
     1021 cggtgaagct gatacgcgac aagtcgcagg tctacatcga tcctctcaat ccgcaggcgc
     1081 ccagcaaggg ccaaagtctg ggctacggct ttgttaacta tgtccggccg caagatgccg
     1141 agcaggctgt taatgttcta aacggcctgc gactgcagaa caaaaccata aaggtgtcgt
     1201 ttgcccgccc gtcgtccgat gccattaaag gcgccaacct ttatgtgtcg gggctgccaa
     1261 agacgatgac ccagcaggaa ctggaggcca tcttcgcacc attcggagca ataatcacat
     1321 cgcgcattct gcagaacgct ggcaacgata cgcagacgaa aggagtaggc tttattcgat
     1381 tcgataagcg ggaggaggcc acaagggcta tcattgcctt gaacggtact acaccatcca
     1441 gctgcacaga ccccatagtg gtaaaattct ccaatacgcc cggcagcacc agtaagatca
     1501 tccagccgca gctgcccgct ttcctcaatc cccaactggt gcgacgcatc gggggcgcca
     1561 tgcacacgcc agttaacaag ggtctggccc ggttttcacc aatggcgggc gacatgctgg
     1621 acgtaatgct gcccaatgga ctaggagcgg cggcggcggc ggccacaacg ctggccagtg
     1681 ggcctggcgg cgcgtatccc attttcatct acaacttggc gccggaaacg gaggaggctg
     1741 ccctgtggca gctgtttggt ccatttggag ctgtgcaatc ggtaaagatt gtcaaggatc
     1801 ccacaacgaa tcagtgcaag ggctacggct tcgtttcgat gaccaactac gatgaggcgg
     1861 ccatggccat tcgggcgctc aacggctaca ccatgggcaa tcgggtgctg caggtcagct
     1921 tcaagaccaa caaagccaag tagagcggcc caaatggaag tggacaagga cgcagcggga
     1981 gcaccagcaa ccacaaccca ttatcttgaa tctaaacctg accacacaac acacatatat
     2041 aaatatgcat ggtataacgg taactaagcg caacaaaaca gtatgtgtac aacacacaag
     2101 aaaaaaacaa gaaagtagca ggcaggagaa aatactcttt ccactaaacg acaacgaaac
     2161 ggcggaaatc cccctaaaaa cacaaatctc ctccccaaaa ccgccggcca aaagcgttgc
     2221 aatagaaaaa ttcttcttgt ttagcattta agtaaaaaac cttaactaaa cgaagcgagc
     2281 agaagaagtt ataagatcgg gagaagcgaa taggtagggg gttgagaaag aacaatgaga
     2341 ttttaagaaa taatattgca aaatgaaaac acactgtggc agtggcagct gtgaaaggtc
     2401 aaaggttggc acagtcgatc tagtcacaaa gcaaaacaaa gaaatcaacc aaacaaacac
     2461 aaaacacaca aaaacgaaaa caaatttaac tttattagca aaaaa
//
LOCUS       DROELAVM     9285 bp ds-DNA             INV       26-OCT-1988
DEFINITION  D.melanogaster elav protein, complete cds.
ACCESSION   M21153
KEYWORDS    RNA-binding protein; elav protein; neural protein.
SOURCE      D.melanogaster (Canton-S) Maniatis library DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 9285)
  AUTHORS   Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
  TITLE     The elav gene of Drosophila, required in neurons, has three RNP
            consensus motifs
  JOURNAL   Science 242, 1570-1572 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence kindly submitted
            by K.White, 26-OCT-1988.
BASE COUNT     2786 a   2147 c   2058 g   2294 t
ORIGIN      Chromosome X; 1B5-9 on the cytogenetic map.
        1 ggatccagtc gagggatcac tgaactaaaa agtaaagaga ggagaaacag gagaagccct
       61 taaagtgcga gttcgctgga gtgcctatcg atccttgagg gcagcgctcg atccttatta
      121 ctattggcag gatcatttta taccccttac ataaaaatat cagtatcagt ttgttccctt
      181 atcatgcgtg ctctttgaca tatccgcggg aatttatttc tatttttgct tagtcaaact
      241 gaagaccatt ttgtcccaac ataaaatatg taaaatattt ttttttatca ttttaaattc
      301 tcagggatta agtcaatgtt tgaaaacttc tagctgactt gcgctccact accactcggc
      361 atgcgtgccc ctgattagta ggccaggtgg tgggcctttg gcacgatcgc aaccctaagg
      421 actgcaaccg aatagcattg agaaggagac caggatttct gcactcaaag gtgaaattgt
      481 ctgagcacgg aacacggaaa aacaacaaca aaaacatggc agacggaaaa caatgccgcc
      541 agcggcaagg agggagggtg gaaacacatg ccacgattca gtgtcgaatg tcctgtgatc
      601 ccacattcat atattttgtt attgtgtgcc tgcatccctc gcctgctttt aattcctgca
      661 tcgatccttc tttgctcctg gtctacgaac ccgatgtctt atggtaattt tatgattatt
      721 acaattagat tcgggggaaa ctgcatctgc cgccgtctgc agatccgatt catttggatt
      781 ccgtctgctc aatccagatc catgatggga tcatggccat ttgggcattg gtacatggaa
      841 agttaaaagc gccctaggcg ttctaatcca actcgtctca agaaaaacgg gtgaatgcat
      901 ggaacgctgg atctggttct ggaagatccg gctaagtcaa caagaaatat gtaacctcgc
      961 cattatagct ttactttcga agcagaaaat gcaatcgaga atatcgaatg cacgctccag
     1021 gttcctatgg ctcctagctt tcggtgcagc acaacaaata tttgggttgg ttcgtttttc
     1081 cgccacgatt ttaaaacgct aagggtggct ttttcggagt ccctctcatt tgagccggga
     1141 aaaagaccag accctccctt tccctcccca gccactttcc agctttgttg taacctcgtg
     1201 ttgtgcctgc tttttaatcc cgctttgttg taacgcaggc catttgctgt ttttgttttg
     1261 tttctggcgt ggctaaaacg gattatgtga cccaaccctt tcgatttcga gtctcattgt
     1321 attacatcct tttcccaaaa atgactaaaa aagcgtattg cctatttctg gggtggacag
     1381 aatgaaacgg ggtcagcacg tggccgcaaa cttgaaatac cattatggat ccaataggac
     1441 tgtgaagagt tattagtgaa acatttcgaa caaactattc ctttaattga aatttgccaa
     1501 ttccccactt aagcatcgca tttgcatatt tgtatgctaa ccaaactctc tgcgacgtcg
     1561 atctgtctga atctgaatcc gcatatcttt cgagggcagc aatagcgcca ccccacccca
     1621 tttccgttgc cactagcagc gcaccaccct ttgaaaggac atcaaaaacc accccctgcc
     1681 caaagttgcg ttacaattcc cgaagtgcag tgcagcgtac ataaaacata caatgaaatc
     1741 cgataaaatt ttctatgtcc tttatatcct cataacatgc ccaactataa accgagccaa
     1801 attcttctcg tgaaaaagtt gcacacaacg cgatttcact gtaccttaca ttgttctgta
     1861 aactggaaca gagcgtccga ctggaagagg cgaaggaaaa aatcagcaca gggaacagga
     1921 agttcacaag gattgggggt gtccgagggt ggctacatat atatggcgat atatggcata
     1981 cagtatgtgc atacatatac atacatatac acatatatgt atgtatgtac catactatgt
     2041 ggcgtgtcta gtggcgcgaa taacgaatac aggagcaatg aagcacacga acgaagcgaa
     2101 acgaaacgaa gggaaatatt gaaaaagcca gaattgcatt acttcaaaac tataccggct
     2161 cttggatatg ggtcctgggt ctcggttcct tgcttctggc tttcgcttcg tagggggtgt
     2221 acgagcaata ctgccaggag aaaccaccct agcctttggc gttggtggtt gatagtgcaa
     2281 aaataataat aacggaaacg gaaccataaa cccaaaacag cccttacaac gcgactgatt
     2341 tgtaaggata atggtcgcat gcatcataaa acggttgacg acctcgtcgg aaagagccaa
     2401 cattttgcaa tgatttccaa ttgctcctgc tatctaggcc caatgggccg ctaagtacat
     2461 atgaagcata catagctaaa tacatatgaa gcatacatac atcgggttaa ctataaaacc
     2521 aatgtgttct agatgagttt gttaatttta caaagtaagt tagggctgct ccagtggttt
     2581 taagttgctt aactggacaa gagaacttgg ctctctcccc catctatctc ttgtcttcgg
     2641 gaaaggtaac ctccattttg atcggttgaa tttgtatcaa aaagcatgaa aaactttgaa
     2701 tacagcataa accactgaga gagcagccga aagccagggg tgtatcaaaa ggggttctta
     2761 gaaatattcc actttgcaac gattatagac tcgtttctta tcagacaaaa ctttctccac
     2821 ttctacgtgg tatttacaga aacaaaactt aaggtaaaat gttaggtggc tatatattaa
     2881 agctattaaa atatattaaa ggtttccgga ataaccccct atctcggcgc gcagatacgc
     2941 ccatgtcgcc actctttcgc tctctacctt ccactctcgc tccgaacacc attgcaaccc
     3001 cgagagagct gctcgagagg caactatgag atatgagaat gagaggttgc tctatgtgaa
     3061 aatggaaacc gaaaccgtca atgcctcaat ttcgcatttc attatttcat ttccacgcca
     3121 accatcgtgc gccgcggtct gaacgctcct gccacagaaa aagaaaataa aagcaaggaa
     3181 aattctaatc gaataccaaa tatcgtgctt gtgtgctctt tccgcaattg atttttttta
     3241 agtagtgcat gacaataacc gttgagttga ctccaaccga agtaaccata actggaagct
     3301 gacaacagcc attggccgga atctcatctc acaccgagcg aaatgtaaga gtatccccag
     3361 gatatccaca atttgtgcaa ctgaatgaaa ggcaatgata acaatatcaa tcaacaactt
     3421 tgaaacagta cttttttttt gctaatgatc gcgtcgcgct gcataattgt gcaatttact
     3481 cacttataat ccgcgtgtct gcatatatcc atatgtacct atatctattt ttttgctagt
     3541 gggtgtgtga tgtgtgtaag ttgtatacat tcaaacgaat attaataggc aaaacaacaa
     3601 tatgcaattc gatggaggaa aaagcgaacc tcccaaagag aatggcggag agatgggcgg
     3661 cggttagtca gagtgagagg gggcggccag tgcggagata gtagatgtgt tcagatctac
     3721 gcctctgtga gctgtgtatc tgtaaatact tgctgctcaa atttaaatct gtgtgtgtat
     3781 gtgtgtgtgg gcttgcgtgt gcgggagaga agagcgctga gtgttagagc gagagccagt
     3841 tccgaagagg gcgcagctag catatcgatt ttatgatttc atgtgcaagg agcattgtac
     3901 gcaacaacaa acggaggaac acgaccaact aaccgttaga agtagccatt ttttgagcgc
     3961 agcggagagg gtgggcgagt gtgtgtgggt ggcggagtac ttgcagtcaa caaaacgccg
     4021 gttgccataa aagcaaatgc acgcaggcat gaatcagacc catgtcgtag tagcaatatt
     4081 aaacatagag gattcgcgat tcacagcaaa ggagtttttt ctcaaaaaac gacaacaatt
     4141 agttgttttt tttcccagtg gtcgtctgtg tttatgcttg tgtggaagag agctcgaaat
     4201 aggtagatat atacacgcgt atttatttat tatgtgtaca tatgtgcttg taaaaacaat
     4261 atcagccata caaagtttga gtcccctccg cccacctcct aacaacaaca tcgatatcta
     4321 aatcggtgca ggtaaagagc ggttgctaca ggcccaaata acctctcttc agtagaattt
     4381 taccaacatc tcaaagtatt ctcgtgcgcc acatgtatgc gtatctgcac gcagatatat
     4441 tttttgtatc gcctgtgaaa tatgaattca atgctttgga tggggtaaaa tcgataatat
     4501 cgattggcga taacaatcgc acgcacacac acaggcccgc aaacacgaac aactaaaact
     4561 acaacaacaa ccaccaaacc catcgaattt accgcttata gttttgtttt atttttgttt
     4621 ttgtccacat cagacggcga cgccagcgtt tatttattaa aaacgattac attacaacaa
     4681 acaccaaagt cctggacaac cgaagagagt gtgggagaaa atcgaagcag agagcaagga
     4741 ggagcgccgt gagagcgcgc gcgcaaggat ttgagattgc aaagagaact tagcaattag
     4801 ttcgctttgt ttgtccagcc aaacaacaac aacaaccaac cagctcccca aaaccaacta
     4861 cacacatact cgctccctaa actaaaacaa gtgagtattg aaagacgacg ggccgagctg
     4921 acgagatcgc aggacgaaag agagaaaact ttcgaagggt gcggtgtcgt gtcaaggcaa
     4981 cacaatcaaa ccactaatgc ggaatattaa aaaaaaaaat taatttaaat cataagaggt
     5041 tagctatatt tgatatatat atgtgtgtat ccatgtacat atgtctatgt tagacaagta
     5101 gctctttctt gccgtattgt agttcgggtt atcgttaccc gcctctaatt ggcttgccat
     5161 gacgaaagtg ggtggtacag gacgacgacg tgcgtgcgcg tgaccttttg actcttccat
     5221 aagcgcctca ataaaaggtg cgagtcggta catacatatg tatgagaata gaggatatta
     5281 aaaacgcgcc aaactctaag aaaagcatcc accaaagtaa aaacagtagc aactaaaaac
     5341 ttttcagtct cagctaacca gtccaaacca gctatgtagt ttttagcacg actttgggag
     5401 tgtcgaatgc gttaactaaa gctcaaatat atatcgagac ccgctgtagt ccctatatgc
     5461 aaatcaaatt gctaccattt ccgcagcgat cgcttttcgc gacagccccc tctctctgtc
     5521 gcacttagtt cagtgaacca acacattcgc tggtttcgtt ccgagcagca tcctctgtct
     5581 ctcttgctca gcctcttcgc cgccccctct gcctaagcct ctgtttctgc tgtccgcacc
     5641 ccttaccaag gtgtcatgta aatggcggag ccggcttagg gggtggcaat gccgtgcttt
     5701 taacccttat cggtggcaaa tggcgctcgt acatttggag aaaatgccgt tctacgtttc
     5761 gttttgcttc gtttcgccaa gcgaagtatt acaacttgtt tgactaaggg tgggtctacc
     5821 gtctattgct catttatggt gttttgatgc tttctcaagg ggtgcgggct ggtttctcgc
     5881 atgtcctgac cctctgggtc atccctctgt tcaccgcatt acataactgg gcggggtgca
     5941 cagcgcgaca cactcttatg gagagagcgc tgtaatttgc ggaaaggaga cgagggggag
     6001 aagggacgca agagcagcta cagcgcgcag cagtgccaga aagtcctgcg ctcgccccag
     6061 aagcatcctg ttcctgtgac gcagcggatc tggtctcgcg ttcgctccac tacctccact
     6121 tcctacacct acagtacaca tatcacatga acttaaaact agtgaacgga ttaattaaaa
     6181 aggaatcgca aaacaaaagc tcgctaatcg cgcagacaca gcagaagctt aaattcagaa
     6241 aagtttagtt cttcagtgcc gaaggtgtgt ttgcagagct ttgatttcta gtgcgcgggt
     6301 gtgggaagag aggactaaat agagacagag agagcggccg gatgagaaag tggggcgctg
     6361 ggcggcgcct gttagcaaag cttcgttgac caaccgttat ttcctgaaca aaggaataac
     6421 atacattcat cacttaatca ccaactgttt ttaaatccac agaattataa tttagcgtat
     6481 gattcaagat ttgggaattg tgcgcaccca ggataaccgc gtttgtgttt aatttacaaa
     6541 catgccccta atgtctatta aataaaagtt cagcgtcgga aaattaaaga gtctggtaag
     6601 gattacagta tatttatgct gtaatatcct ttaattgtcc cgggtcacat ttttccgtat
     6661 ccccacgccc taaagcattt tcgtatccat taattaaaca agtaaaggtg taggcttata
     6721 ttatgttata cagtccgctc tcacaagcat tataagaaat atattacttg tctgttcacc
     6781 aaataggtaa aaaatataca tatcatatca aacgatttaa ataattgcgc cctgggcaga
     6841 tgtaaataaa gaaatataaa ttaaaaaata tcgttagaat aaaccaacaa attttctatt
     6901 caaaccctag ctgattgata atcaattagc ccgttataac tactaaaacc cataataata
     6961 caaaaaacta ctaacacatc atgcaaatgt tttaattttg acatttaact attctttcca
     7021 tgcgcctacc cctgcgttgg tcgtgacaac caacaataaa cttctcgact ctgcccgatc
     7081 ctctcaccta tcactcagtg gactttatta tggcaaatac cggagctggc ggcggagtag
     7141 acacacaggc acagctaatg cagagtgccg ctgcagccgc agcagtggcg gcaacaaacg
     7201 cggccgccgc tcccgtacag aatgcagccg ccgtggcggc cgccgcccag ctgcagcagc
     7261 aacaggtgca acaggcaatc ctgcaggtgc agcagcagca gacacagcaa gcggtggccg
     7321 cggccgctgc cgcagtgacc cagcagctcc aacagcaaca gcaggctgtc gtggcccaac
     7381 aggctgtagt gcagcagcaa caacagcagg cggcggcagt ggtgcaacag gcggcggtcc
     7441 aacaggctgt ggtgccccag ccgcagcagg cgcagcccaa tacgaatggc aatgcaggat
     7501 cgggatcgca aaatggcagc aacggcagca cggagacgcg cacaaacctt attgtcaact
     7561 acttgccgca aacaatgacc gaagacgaga tccgttcgct cttctccagc gtcggcgaga
     7621 ttgagtcggt gaagctgata cgcgacaagt cgcaggtcta catcgatcct ctcaatccgc
     7681 aggcgcccag caagggccaa agtctgggct acggctttgt taactatgtc cggccgcaag
     7741 atgccgagca ggctgttaat gttctaaacg gcctgcgact gcagaacaaa accataaagg
     7801 tgtcgtttgc ccgcccgtcg tccgatgcca ttaaaggcgc caacctttat gtgtcggggc
     7861 tgccaaagac gatgacccag caggaactgg aggccatctt cgcaccattc ggagcaataa
     7921 tcacatcgcg cattctgcag aacgctggca acgatacgca gacgaaagga gtaggcttta
     7981 ttcgattcga taagcgggag gaggccacaa gggctatcat tgccttgaac ggtactacac
     8041 catccagctg cacagacccc atagtggtaa aattctccaa tacgcccggc agcaccagta
     8101 agatcatcca gccgcagctg cccgctttcc tcaatcccca actggtgcga cgcatcgggg
     8161 gcgccatgca cacgccagtt aacaagggtc tggcccggtt ttcaccaatg gcgggcgaca
     8221 tgctggacgt aatgctgccc aatggactag gagcggcggc ggcggcggcc acaacgctgg
     8281 ccagtgggcc tggcggcgcg tatcccattt tcatctacaa cttggcgccg gaaacggagg
     8341 aggctgccct gtggcagctg tttggtccat ttggagctgt gcaatcggta aagattgtca
     8401 aggatcccac aacgaatcag tgcaagggct acggcttcgt ttcgatgacc aactacgatg
     8461 aggcggccat ggccattcgg gcgctcaacg gctacaccat gggcaatcgg gtgctgcagg
     8521 tcagcttcaa gaccaacaaa gccaagtaga gcggcccaaa tggaagtgga caaggacgca
     8581 gcgggagcac cagcaaccac aacccattat cttgaatcta aacctgacca cacaacacac
     8641 atatataaat atgcatggta taacggtaac taagcgcaac aaaacagtat gtgtacaaca
     8701 cacaagaaaa aaacaagaaa gtagcaggca ggagaaaata ctctttccac taaacgacaa
     8761 cgaaacggcg gaaatccccc taaaaacaca aatctcctcc ccaaaaccgc cggccaaaag
     8821 cgttgcaata gaaaaattct tcttgtttag catttaagta aaaaacctta actaaacgaa
     8881 gcgagcagaa gaagttataa gatcgggaga agcgaatagg tagggggttg agaaagaaca
     8941 atgagatttt aagaaataat attgcaaaat gaaaacacac tgtggcagtg gcagctgtga
     9001 aaggtcaaag gttggcacag tcgatctagt cacaaagcaa aacaaagaaa tcaaccaaac
     9061 aaacacaaaa cacacaaaaa cgaaaacaaa tttaacttta ttagcaaaaa agaaaatgag
     9121 aaattttaaa aatttattaa ctttaagtaa aacgcagcaa tgaaatcttc caaacaaaag
     9181 aaagcaaaac agcaaatctg tttgttacac ttcaactttt tctaccataa aaactttgaa
     9241 tcttgatact aaacctaaag cagaattaag aaaatagagg aattc
//
LOCUS       HUMDBTEFH    2335 bp ds-DNA             PRI       15-SEP-1989
DEFINITION  Human beta-globin intergenic region spanning the deletion which
            causes delta-beta-thalassemia.
ACCESSION   M22800 X14550
KEYWORDS    .
SOURCE      Human patient s.h. leukocyte cell DNA, togi clone d.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 643)
  AUTHORS   Mishima,N. and Gilman,J.G.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Mishima,N. and Gilman,J.G.
  TITLE     DNA deletion in indian delta-beta-thalassemia with elevated fetal
            hemoglobin extends from 1 kb 3' of a-GAMMA globin gene into
            downstream KpnI (L1) repeated sequence
  JOURNAL   FASEB J. 2, 546-546 (1988)
  STANDARD  simple staff_review
REFERENCE   3  (bases 1 to 2335)
  AUTHORS   Mishima,N. and Gilman,J.G.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by J.G.Gilman 28-FEB-1989; [3], 11-APR-1990.
            
            This sequence is across the DNA deletion in the beta-globin gene
            cluster.
BASE COUNT      931 a    462 c    393 g    530 t     19 others
ORIGIN      Chromosome 11.
        1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa
       61 tttccatgta attgcatagt tttcatctat tcattagtct tgaaatctat ttttattcct
      121 ctgtggtctg agagtgattg atatgatttt ttttcatttt ctgaggattg ttttaggtcc
      181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa
      241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac
      301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa
      361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc
      421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat
      481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa
      541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt
      601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa
      661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat
      721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat
      781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc
      841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga
      901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt
      961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt
     1021 ttcaggatat naaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca
     1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata
     1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact
     1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa
     1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat
     1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat
     1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga
     1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt
     1501 ggggtggtag gagagnngag ggataanatt aggagatata tctaatgtaa atgacgagtt
     1561 aacgggtgca gcacaccaat atggcacacg tatcactatg taacaaactg cacaggtacc
     1621 gtagaactta gagtacaaaa aaaaattgag aaaaagaaaa aagctggaga taacacattg
     1681 cctaacttca aaccatacca cagggctaca gtaaccaaaa cagtttggta ctgttacaga
     1741 aaagacacac agacaaatgg aacagaatag agagcccaga aataagacca catgcctaca
     1801 accatctgat ctttgacaaa gctgataana acaagcaatg aggaaaagac accctatnca
     1861 acaagtcgtg ttgggataac tgactaancg tatntagaag attgaaacca gaccccgtnc
     1921 tatatcatat ncncaaataa ncncaagatg gattagagac ttaaatgcaa agcccaaact
     1981 atnanaatgc ttgaaagcaa cctagacaat accattccgg acatgagcag gggcaaagat
     2041 ttcatgacaa agatgccaag agtcattgca gcaaaagcaa aaattgacaa ataggatcta
     2101 attaaaataa agtttctgca cagcaaaaga aactatcaac tgagtaaaca acctatagaa
     2161 tnggagaaaa tnttnncaaa ctatgcatcc aacaaaggcc taatattcag aatctataag
     2221 gaacttaaat gaatctacaa gaacaaacaa acaatgccac aaaaaatggg taaaggacat
     2281 gaacagacag tattcaaaag aagacatcac atatggccac caagcatata aaaaa
//

GenBank-Updates@genbank.bio.net (04/12/90)

LOCUS       DROELAVK     2505 bp ss-mRNA            INV       10-OCT-1988
DEFINITION  D.melanogaster elav protein, complete cds.
ACCESSION   M21152
KEYWORDS    RNA binding protein; elav protein; neural protein.
SOURCE      D.melanogaster (Canton-S) Kauvar 3-12 hour embryo library cDNA to
            mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2505)
  AUTHORS   Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
  TITLE     The elav gene of Drosophila, required in neurons, has three RNP
            consensus motifs
  JOURNAL   Science 242, 1570-1572 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence kindly submitted
            by K.White, 26-OCT-1988.
FEATURES       from  to/span     description
    pept        492     1943     elav protein
BASE COUNT      772 a    669 c    629 g    435 t
ORIGIN      Chromosome X; 1B5-9 on the cytogenetic map.
        1 tttccacgcc aaccatcgtg cgccgcggtc tgaacgctcc tgccacagaa aaagaaaata
       61 aaagcaagga aaattctaat cgaataccaa atatcgtgct tgtgtgctct ttccgcaatt
      121 gatttttttt aagtagtgca tgacaataac cgttgagttg actccaaccg aagtaaccat
      181 aactggaagc tgacaacagc cattggccgg aatctcatct cacaccgagc gaaatacggc
      241 gacgccagcg tttatttatt aaaaacgatt acattacaac aaacaccaaa gtcctggaca
      301 accgaagaga gtgtgggaga aaatcgaagc agagagcaag gaggagcgcc gtgagagcgc
      361 gcgcgcaagg atttgagatt gcaaagagaa cttagcaatt agttcgcttt gtttgtccag
      421 ccaaacaaca acaacaacca accagctccc caaaaccaac tacacacata ctcgctccct
      481 aaactaaaac aatggacttt attatggcaa ataccggagc tggcggcgga gtagacacac
      541 aggcacagct aatgcagagt gccgctgcag ccgcagcagt ggcggcaaca aacgcggccg
      601 ccgctcccgt acagaatgca gccgccgtgg cggccgccgc ccagctgcag cagcaacagg
      661 tgcaacaggc aatcctgcag gtgcagcagc agcagacaca gcaagcggtg gccgcggccg
      721 ctgccgcagt gacccagcag ctccaacagc aacagcaggc tgtcgtggcc caacaggctg
      781 tagtgcagca gcaacaacag caggcggcgg cagtggtgca acaggcggcg gtccaacagg
      841 ctgtggtgcc ccagccgcag caggcgcagc ccaatacgaa tggcaatgca ggatcgggat
      901 cgcaaaatgg cagcaacggc agcacggaga cgcgcacaaa ccttattgtc aactacttgc
      961 cgcaaacaat gaccgaagac gagatccgtt cgctcttctc cagcgtcggc gagattgagt
     1021 cggtgaagct gatacgcgac aagtcgcagg tctacatcga tcctctcaat ccgcaggcgc
     1081 ccagcaaggg ccaaagtctg ggctacggct ttgttaacta tgtccggccg caagatgccg
     1141 agcaggctgt taatgttcta aacggcctgc gactgcagaa caaaaccata aaggtgtcgt
     1201 ttgcccgccc gtcgtccgat gccattaaag gcgccaacct ttatgtgtcg gggctgccaa
     1261 agacgatgac ccagcaggaa ctggaggcca tcttcgcacc attcggagca ataatcacat
     1321 cgcgcattct gcagaacgct ggcaacgata cgcagacgaa aggagtaggc tttattcgat
     1381 tcgataagcg ggaggaggcc acaagggcta tcattgcctt gaacggtact acaccatcca
     1441 gctgcacaga ccccatagtg gtaaaattct ccaatacgcc cggcagcacc agtaagatca
     1501 tccagccgca gctgcccgct ttcctcaatc cccaactggt gcgacgcatc gggggcgcca
     1561 tgcacacgcc agttaacaag ggtctggccc ggttttcacc aatggcgggc gacatgctgg
     1621 acgtaatgct gcccaatgga ctaggagcgg cggcggcggc ggccacaacg ctggccagtg
     1681 ggcctggcgg cgcgtatccc attttcatct acaacttggc gccggaaacg gaggaggctg
     1741 ccctgtggca gctgtttggt ccatttggag ctgtgcaatc ggtaaagatt gtcaaggatc
     1801 ccacaacgaa tcagtgcaag ggctacggct tcgtttcgat gaccaactac gatgaggcgg
     1861 ccatggccat tcgggcgctc aacggctaca ccatgggcaa tcgggtgctg caggtcagct
     1921 tcaagaccaa caaagccaag tagagcggcc caaatggaag tggacaagga cgcagcggga
     1981 gcaccagcaa ccacaaccca ttatcttgaa tctaaacctg accacacaac acacatatat
     2041 aaatatgcat ggtataacgg taactaagcg caacaaaaca gtatgtgtac aacacacaag
     2101 aaaaaaacaa gaaagtagca ggcaggagaa aatactcttt ccactaaacg acaacgaaac
     2161 ggcggaaatc cccctaaaaa cacaaatctc ctccccaaaa ccgccggcca aaagcgttgc
     2221 aatagaaaaa ttcttcttgt ttagcattta agtaaaaaac cttaactaaa cgaagcgagc
     2281 agaagaagtt ataagatcgg gagaagcgaa taggtagggg gttgagaaag aacaatgaga
     2341 ttttaagaaa taatattgca aaatgaaaac acactgtggc agtggcagct gtgaaaggtc
     2401 aaaggttggc acagtcgatc tagtcacaaa gcaaaacaaa gaaatcaacc aaacaaacac
     2461 aaaacacaca aaaacgaaaa caaatttaac tttattagca aaaaa
//
LOCUS       DROELAVM     9285 bp ds-DNA             INV       26-OCT-1988
DEFINITION  D.melanogaster elav protein, complete cds.
ACCESSION   M21153
KEYWORDS    RNA binding protein; elav protein; neural protein.
SOURCE      D.melanogaster (Canton-S) Maniatis library DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 9285)
  AUTHORS   Robinow,S., Campos,A.R., Yao,K.-M. and White,K.
  TITLE     The elav gene of Drosophila, required in neurons, has three RNP
            consensus motifs
  JOURNAL   Science 242, 1570-1572 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence kindly submitted
            by K.White, 26-OCT-1988.
BASE COUNT     2786 a   2147 c   2058 g   2294 t
ORIGIN      Chromosome X; 1B5-9 on the cytogenetic map.
        1 ggatccagtc gagggatcac tgaactaaaa agtaaagaga ggagaaacag gagaagccct
       61 taaagtgcga gttcgctgga gtgcctatcg atccttgagg gcagcgctcg atccttatta
      121 ctattggcag gatcatttta taccccttac ataaaaatat cagtatcagt ttgttccctt
      181 atcatgcgtg ctctttgaca tatccgcggg aatttatttc tatttttgct tagtcaaact
      241 gaagaccatt ttgtcccaac ataaaatatg taaaatattt ttttttatca ttttaaattc
      301 tcagggatta agtcaatgtt tgaaaacttc tagctgactt gcgctccact accactcggc
      361 atgcgtgccc ctgattagta ggccaggtgg tgggcctttg gcacgatcgc aaccctaagg
      421 actgcaaccg aatagcattg agaaggagac caggatttct gcactcaaag gtgaaattgt
      481 ctgagcacgg aacacggaaa aacaacaaca aaaacatggc agacggaaaa caatgccgcc
      541 agcggcaagg agggagggtg gaaacacatg ccacgattca gtgtcgaatg tcctgtgatc
      601 ccacattcat atattttgtt attgtgtgcc tgcatccctc gcctgctttt aattcctgca
      661 tcgatccttc tttgctcctg gtctacgaac ccgatgtctt atggtaattt tatgattatt
      721 acaattagat tcgggggaaa ctgcatctgc cgccgtctgc agatccgatt catttggatt
      781 ccgtctgctc aatccagatc catgatggga tcatggccat ttgggcattg gtacatggaa
      841 agttaaaagc gccctaggcg ttctaatcca actcgtctca agaaaaacgg gtgaatgcat
      901 ggaacgctgg atctggttct ggaagatccg gctaagtcaa caagaaatat gtaacctcgc
      961 cattatagct ttactttcga agcagaaaat gcaatcgaga atatcgaatg cacgctccag
     1021 gttcctatgg ctcctagctt tcggtgcagc acaacaaata tttgggttgg ttcgtttttc
     1081 cgccacgatt ttaaaacgct aagggtggct ttttcggagt ccctctcatt tgagccggga
     1141 aaaagaccag accctccctt tccctcccca gccactttcc agctttgttg taacctcgtg
     1201 ttgtgcctgc tttttaatcc cgctttgttg taacgcaggc catttgctgt ttttgttttg
     1261 tttctggcgt ggctaaaacg gattatgtga cccaaccctt tcgatttcga gtctcattgt
     1321 attacatcct tttcccaaaa atgactaaaa aagcgtattg cctatttctg gggtggacag
     1381 aatgaaacgg ggtcagcacg tggccgcaaa cttgaaatac cattatggat ccaataggac
     1441 tgtgaagagt tattagtgaa acatttcgaa caaactattc ctttaattga aatttgccaa
     1501 ttccccactt aagcatcgca tttgcatatt tgtatgctaa ccaaactctc tgcgacgtcg
     1561 atctgtctga atctgaatcc gcatatcttt cgagggcagc aatagcgcca ccccacccca
     1621 tttccgttgc cactagcagc gcaccaccct ttgaaaggac atcaaaaacc accccctgcc
     1681 caaagttgcg ttacaattcc cgaagtgcag tgcagcgtac ataaaacata caatgaaatc
     1741 cgataaaatt ttctatgtcc tttatatcct cataacatgc ccaactataa accgagccaa
     1801 attcttctcg tgaaaaagtt gcacacaacg cgatttcact gtaccttaca ttgttctgta
     1861 aactggaaca gagcgtccga ctggaagagg cgaaggaaaa aatcagcaca gggaacagga
     1921 agttcacaag gattgggggt gtccgagggt ggctacatat atatggcgat atatggcata
     1981 cagtatgtgc atacatatac atacatatac acatatatgt atgtatgtac catactatgt
     2041 ggcgtgtcta gtggcgcgaa taacgaatac aggagcaatg aagcacacga acgaagcgaa
     2101 acgaaacgaa gggaaatatt gaaaaagcca gaattgcatt acttcaaaac tataccggct
     2161 cttggatatg ggtcctgggt ctcggttcct tgcttctggc tttcgcttcg tagggggtgt
     2221 acgagcaata ctgccaggag aaaccaccct agcctttggc gttggtggtt gatagtgcaa
     2281 aaataataat aacggaaacg gaaccataaa cccaaaacag cccttacaac gcgactgatt
     2341 tgtaaggata atggtcgcat gcatcataaa acggttgacg acctcgtcgg aaagagccaa
     2401 cattttgcaa tgatttccaa ttgctcctgc tatctaggcc caatgggccg ctaagtacat
     2461 atgaagcata catagctaaa tacatatgaa gcatacatac atcgggttaa ctataaaacc
     2521 aatgtgttct agatgagttt gttaatttta caaagtaagt tagggctgct ccagtggttt
     2581 taagttgctt aactggacaa gagaacttgg ctctctcccc catctatctc ttgtcttcgg
     2641 gaaaggtaac ctccattttg atcggttgaa tttgtatcaa aaagcatgaa aaactttgaa
     2701 tacagcataa accactgaga gagcagccga aagccagggg tgtatcaaaa ggggttctta
     2761 gaaatattcc actttgcaac gattatagac tcgtttctta tcagacaaaa ctttctccac
     2821 ttctacgtgg tatttacaga aacaaaactt aaggtaaaat gttaggtggc tatatattaa
     2881 agctattaaa atatattaaa ggtttccgga ataaccccct atctcggcgc gcagatacgc
     2941 ccatgtcgcc actctttcgc tctctacctt ccactctcgc tccgaacacc attgcaaccc
     3001 cgagagagct gctcgagagg caactatgag atatgagaat gagaggttgc tctatgtgaa
     3061 aatggaaacc gaaaccgtca atgcctcaat ttcgcatttc attatttcat ttccacgcca
     3121 accatcgtgc gccgcggtct gaacgctcct gccacagaaa aagaaaataa aagcaaggaa
     3181 aattctaatc gaataccaaa tatcgtgctt gtgtgctctt tccgcaattg atttttttta
     3241 agtagtgcat gacaataacc gttgagttga ctccaaccga agtaaccata actggaagct
     3301 gacaacagcc attggccgga atctcatctc acaccgagcg aaatgtaaga gtatccccag
     3361 gatatccaca atttgtgcaa ctgaatgaaa ggcaatgata acaatatcaa tcaacaactt
     3421 tgaaacagta cttttttttt gctaatgatc gcgtcgcgct gcataattgt gcaatttact
     3481 cacttataat ccgcgtgtct gcatatatcc atatgtacct atatctattt ttttgctagt
     3541 gggtgtgtga tgtgtgtaag ttgtatacat tcaaacgaat attaataggc aaaacaacaa
     3601 tatgcaattc gatggaggaa aaagcgaacc tcccaaagag aatggcggag agatgggcgg
     3661 cggttagtca gagtgagagg gggcggccag tgcggagata gtagatgtgt tcagatctac
     3721 gcctctgtga gctgtgtatc tgtaaatact tgctgctcaa atttaaatct gtgtgtgtat
     3781 gtgtgtgtgg gcttgcgtgt gcgggagaga agagcgctga gtgttagagc gagagccagt
     3841 tccgaagagg gcgcagctag catatcgatt ttatgatttc atgtgcaagg agcattgtac
     3901 gcaacaacaa acggaggaac acgaccaact aaccgttaga agtagccatt ttttgagcgc
     3961 agcggagagg gtgggcgagt gtgtgtgggt ggcggagtac ttgcagtcaa caaaacgccg
     4021 gttgccataa aagcaaatgc acgcaggcat gaatcagacc catgtcgtag tagcaatatt
     4081 aaacatagag gattcgcgat tcacagcaaa ggagtttttt ctcaaaaaac gacaacaatt
     4141 agttgttttt tttcccagtg gtcgtctgtg tttatgcttg tgtggaagag agctcgaaat
     4201 aggtagatat atacacgcgt atttatttat tatgtgtaca tatgtgcttg taaaaacaat
     4261 atcagccata caaagtttga gtcccctccg cccacctcct aacaacaaca tcgatatcta
     4321 aatcggtgca ggtaaagagc ggttgctaca ggcccaaata acctctcttc agtagaattt
     4381 taccaacatc tcaaagtatt ctcgtgcgcc acatgtatgc gtatctgcac gcagatatat
     4441 tttttgtatc gcctgtgaaa tatgaattca atgctttgga tggggtaaaa tcgataatat
     4501 cgattggcga taacaatcgc acgcacacac acaggcccgc aaacacgaac aactaaaact
     4561 acaacaacaa ccaccaaacc catcgaattt accgcttata gttttgtttt atttttgttt
     4621 ttgtccacat cagacggcga cgccagcgtt tatttattaa aaacgattac attacaacaa
     4681 acaccaaagt cctggacaac cgaagagagt gtgggagaaa atcgaagcag agagcaagga
     4741 ggagcgccgt gagagcgcgc gcgcaaggat ttgagattgc aaagagaact tagcaattag
     4801 ttcgctttgt ttgtccagcc aaacaacaac aacaaccaac cagctcccca aaaccaacta
     4861 cacacatact cgctccctaa actaaaacaa gtgagtattg aaagacgacg ggccgagctg
     4921 acgagatcgc aggacgaaag agagaaaact ttcgaagggt gcggtgtcgt gtcaaggcaa
     4981 cacaatcaaa ccactaatgc ggaatattaa aaaaaaaaat taatttaaat cataagaggt
     5041 tagctatatt tgatatatat atgtgtgtat ccatgtacat atgtctatgt tagacaagta
     5101 gctctttctt gccgtattgt agttcgggtt atcgttaccc gcctctaatt ggcttgccat
     5161 gacgaaagtg ggtggtacag gacgacgacg tgcgtgcgcg tgaccttttg actcttccat
     5221 aagcgcctca ataaaaggtg cgagtcggta catacatatg tatgagaata gaggatatta
     5281 aaaacgcgcc aaactctaag aaaagcatcc accaaagtaa aaacagtagc aactaaaaac
     5341 ttttcagtct cagctaacca gtccaaacca gctatgtagt ttttagcacg actttgggag
     5401 tgtcgaatgc gttaactaaa gctcaaatat atatcgagac ccgctgtagt ccctatatgc
     5461 aaatcaaatt gctaccattt ccgcagcgat cgcttttcgc gacagccccc tctctctgtc
     5521 gcacttagtt cagtgaacca acacattcgc tggtttcgtt ccgagcagca tcctctgtct
     5581 ctcttgctca gcctcttcgc cgccccctct gcctaagcct ctgtttctgc tgtccgcacc
     5641 ccttaccaag gtgtcatgta aatggcggag ccggcttagg gggtggcaat gccgtgcttt
     5701 taacccttat cggtggcaaa tggcgctcgt acatttggag aaaatgccgt tctacgtttc
     5761 gttttgcttc gtttcgccaa gcgaagtatt acaacttgtt tgactaaggg tgggtctacc
     5821 gtctattgct catttatggt gttttgatgc tttctcaagg ggtgcgggct ggtttctcgc
     5881 atgtcctgac cctctgggtc atccctctgt tcaccgcatt acataactgg gcggggtgca
     5941 cagcgcgaca cactcttatg gagagagcgc tgtaatttgc ggaaaggaga cgagggggag
     6001 aagggacgca agagcagcta cagcgcgcag cagtgccaga aagtcctgcg ctcgccccag
     6061 aagcatcctg ttcctgtgac gcagcggatc tggtctcgcg ttcgctccac tacctccact
     6121 tcctacacct acagtacaca tatcacatga acttaaaact agtgaacgga ttaattaaaa
     6181 aggaatcgca aaacaaaagc tcgctaatcg cgcagacaca gcagaagctt aaattcagaa
     6241 aagtttagtt cttcagtgcc gaaggtgtgt ttgcagagct ttgatttcta gtgcgcgggt
     6301 gtgggaagag aggactaaat agagacagag agagcggccg gatgagaaag tggggcgctg
     6361 ggcggcgcct gttagcaaag cttcgttgac caaccgttat ttcctgaaca aaggaataac
     6421 atacattcat cacttaatca ccaactgttt ttaaatccac agaattataa tttagcgtat
     6481 gattcaagat ttgggaattg tgcgcaccca ggataaccgc gtttgtgttt aatttacaaa
     6541 catgccccta atgtctatta aataaaagtt cagcgtcgga aaattaaaga gtctggtaag
     6601 gattacagta tatttatgct gtaatatcct ttaattgtcc cgggtcacat ttttccgtat
     6661 ccccacgccc taaagcattt tcgtatccat taattaaaca agtaaaggtg taggcttata
     6721 ttatgttata cagtccgctc tcacaagcat tataagaaat atattacttg tctgttcacc
     6781 aaataggtaa aaaatataca tatcatatca aacgatttaa ataattgcgc cctgggcaga
     6841 tgtaaataaa gaaatataaa ttaaaaaata tcgttagaat aaaccaacaa attttctatt
     6901 caaaccctag ctgattgata atcaattagc ccgttataac tactaaaacc cataataata
     6961 caaaaaacta ctaacacatc atgcaaatgt tttaattttg acatttaact attctttcca
     7021 tgcgcctacc cctgcgttgg tcgtgacaac caacaataaa cttctcgact ctgcccgatc
     7081 ctctcaccta tcactcagtg gactttatta tggcaaatac cggagctggc ggcggagtag
     7141 acacacaggc acagctaatg cagagtgccg ctgcagccgc agcagtggcg gcaacaaacg
     7201 cggccgccgc tcccgtacag aatgcagccg ccgtggcggc cgccgcccag ctgcagcagc
     7261 aacaggtgca acaggcaatc ctgcaggtgc agcagcagca gacacagcaa gcggtggccg
     7321 cggccgctgc cgcagtgacc cagcagctcc aacagcaaca gcaggctgtc gtggcccaac
     7381 aggctgtagt gcagcagcaa caacagcagg cggcggcagt ggtgcaacag gcggcggtcc
     7441 aacaggctgt ggtgccccag ccgcagcagg cgcagcccaa tacgaatggc aatgcaggat
     7501 cgggatcgca aaatggcagc aacggcagca cggagacgcg cacaaacctt attgtcaact
     7561 acttgccgca aacaatgacc gaagacgaga tccgttcgct cttctccagc gtcggcgaga
     7621 ttgagtcggt gaagctgata cgcgacaagt cgcaggtcta catcgatcct ctcaatccgc
     7681 aggcgcccag caagggccaa agtctgggct acggctttgt taactatgtc cggccgcaag
     7741 atgccgagca ggctgttaat gttctaaacg gcctgcgact gcagaacaaa accataaagg
     7801 tgtcgtttgc ccgcccgtcg tccgatgcca ttaaaggcgc caacctttat gtgtcggggc
     7861 tgccaaagac gatgacccag caggaactgg aggccatctt cgcaccattc ggagcaataa
     7921 tcacatcgcg cattctgcag aacgctggca acgatacgca gacgaaagga gtaggcttta
     7981 ttcgattcga taagcgggag gaggccacaa gggctatcat tgccttgaac ggtactacac
     8041 catccagctg cacagacccc atagtggtaa aattctccaa tacgcccggc agcaccagta
     8101 agatcatcca gccgcagctg cccgctttcc tcaatcccca actggtgcga cgcatcgggg
     8161 gcgccatgca cacgccagtt aacaagggtc tggcccggtt ttcaccaatg gcgggcgaca
     8221 tgctggacgt aatgctgccc aatggactag gagcggcggc ggcggcggcc acaacgctgg
     8281 ccagtgggcc tggcggcgcg tatcccattt tcatctacaa cttggcgccg gaaacggagg
     8341 aggctgccct gtggcagctg tttggtccat ttggagctgt gcaatcggta aagattgtca
     8401 aggatcccac aacgaatcag tgcaagggct acggcttcgt ttcgatgacc aactacgatg
     8461 aggcggccat ggccattcgg gcgctcaacg gctacaccat gggcaatcgg gtgctgcagg
     8521 tcagcttcaa gaccaacaaa gccaagtaga gcggcccaaa tggaagtgga caaggacgca
     8581 gcgggagcac cagcaaccac aacccattat cttgaatcta aacctgacca cacaacacac
     8641 atatataaat atgcatggta taacggtaac taagcgcaac aaaacagtat gtgtacaaca
     8701 cacaagaaaa aaacaagaaa gtagcaggca ggagaaaata ctctttccac taaacgacaa
     8761 cgaaacggcg gaaatccccc taaaaacaca aatctcctcc ccaaaaccgc cggccaaaag
     8821 cgttgcaata gaaaaattct tcttgtttag catttaagta aaaaacctta actaaacgaa
     8881 gcgagcagaa gaagttataa gatcgggaga agcgaatagg tagggggttg agaaagaaca
     8941 atgagatttt aagaaataat attgcaaaat gaaaacacac tgtggcagtg gcagctgtga
     9001 aaggtcaaag gttggcacag tcgatctagt cacaaagcaa aacaaagaaa tcaaccaaac
     9061 aaacacaaaa cacacaaaaa cgaaaacaaa tttaacttta ttagcaaaaa agaaaatgag
     9121 aaattttaaa aatttattaa ctttaagtaa aacgcagcaa tgaaatcttc caaacaaaag
     9181 aaagcaaaac agcaaatctg tttgttacac ttcaactttt tctaccataa aaactttgaa
     9241 tcttgatact aaacctaaag cagaattaag aaaatagagg aattc
//
LOCUS       HUMDBTEFH    2335 bp ds-DNA             PRI       15-SEP-1989
DEFINITION  Human beta-globin intergenic region spanning the deletion which
            causes delta-beta-thalassemia.
ACCESSION   M22800 X14550
KEYWORDS    .
SOURCE      Human patient s.h. leukocyte cell DNA, togi clone d.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 643)
  AUTHORS   Mishima,N. and Gilman,J.G.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Mishima,N. and Gilman,J.G.
  TITLE     DNA deletion in indian delta-beta-thalassemia with elevated fetal
            hemoglobin extends from 1 kb 3' of a-GAMMA globin gene into
            downstream KpnI (L1) repeated sequence
  JOURNAL   FASEB J. 2, 546-546 (1988)
  STANDARD  simple staff_review
REFERENCE   3  (bases 1 to 2335)
  AUTHORS   Mishima,N. and Gilman,J.G.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by J.G.Gilman 28-FEB-1989; [3], 11-APR-1990.
            
            This sequence is across the DNA deletion in the beta-globin gene
            cluster.
BASE COUNT      931 a    462 c    393 g    530 t     19 others
ORIGIN      Chromosome 11.
        1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa
       61 tttccatgta attgcatagt tttcatctat tcattagtct tgaaatctat ttttattcct
      121 ctgtggtctg agagtgattg atatgatttt ttttcatttt ctgaggattg ttttaggtcc
      181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa
      241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac
      301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa
      361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc
      421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat
      481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa
      541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt
      601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa
      661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat
      721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat
      781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc
      841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga
      901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt
      961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt
     1021 ttcaggatat naaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca
     1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata
     1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact
     1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa
     1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat
     1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat
     1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga
     1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt
     1501 ggggtggtag gagagnngag ggataanatt aggagatata tctaatgtaa atgacgagtt
     1561 aacgggtgca gcacaccaat atggcacacg tatcactatg taacaaactg cacaggtacc
     1621 gtagaactta gagtacaaaa aaaaattgag aaaaagaaaa aagctggaga taacacattg
     1681 cctaacttca aaccatacca cagggctaca gtaaccaaaa cagtttggta ctgttacaga
     1741 aaagacacac agacaaatgg aacagaatag agagcccaga aataagacca catgcctaca
     1801 accatctgat ctttgacaaa gctgataana acaagcaatg aggaaaagac accctatnca
     1861 acaagtcgtg ttgggataac tgactaancg tatntagaag attgaaacca gaccccgtnc
     1921 tatatcatat ncncaaataa ncncaagatg gattagagac ttaaatgcaa agcccaaact
     1981 atnanaatgc ttgaaagcaa cctagacaat accattccgg acatgagcag gggcaaagat
     2041 ttcatgacaa agatgccaag agtcattgca gcaaaagcaa aaattgacaa ataggatcta
     2101 attaaaataa agtttctgca cagcaaaaga aactatcaac tgagtaaaca acctatagaa
     2161 tnggagaaaa tnttnncaaa ctatgcatcc aacaaaggcc taatattcag aatctataag
     2221 gaacttaaat gaatctacaa gaacaaacaa acaatgccac aaaaaatggg taaaggacat
     2281 gaacagacag tattcaaaag aagacatcac atatggccac caagcatata aaaaa
//

GenBank-Updates@genbank.bio.net (04/13/90)

LOCUS       CULESTB1     3105 bp ds-DNA             INV       27-FEB-1990
DEFINITION  C.pipiens esterase B1 gene, complete cds.
ACCESSION   M32328
KEYWORDS    esterase.
SOURCE      Mosquito (C.pipiens quinquefasciatus, strain Tem-R) 4th instar
            larva DNA.
  ORGANISM  Culex pipiens
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae.
REFERENCE   1  (bases 1 to 3105)
  AUTHORS   Mouches,C., Pauplin,Y., Agarwal,M., Lemieux,L., Herzog,M.,
            Abadon,M., Beyssat-Arnaouty,V., Hyrien,O.,
            Robert de Saint Vincent,B., Georghiou,G.P. and Pasteur,N.
  TITLE     Characterization of amplification core and esterase B1 gene
            responsible for insecticide resistnace in Culex
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            N.Pasteur, 23-FEB-1990.
FEATURES       from  to/span     description
    pept        406      546     esterase B1, exon 1
               1322     1459     esterase B1, exon 2
               1512     2393     esterase B1, exon 3
               2455     2916     esterase B1, exon 4
    pre-msg     268     3016     esterase B1 mRNA and introns
    IVS         547     1321     esterase B1 intron A
    IVS        1460     1511     esterase B1 intron B
    IVS        2394     2454     esterase B1 intron c
    rpt         155      171     direct repeat copy 1
    rpt         176      192     direct repeat copy 2
    signal      246      250     pot. TATA box
    signal     2998     3003     poly-A signal
BASE COUNT      904 a    671 c    700 g    830 t
ORIGIN      Chromosome 2.
        1 tacataccgc aataatcaaa atataacaaa aataatgttt tttttcctct caaatatgtt
       61 gaacaatccc attttacccc gcatctgcat tcattttaag agagaattgt ctgtgtatag
      121 gtagagtagt ctcgcaggta gggtacaaac acgagcgatg aacacatttg catttgcgat
      181 gaacacattt gcttgatgcg tgttttaaaa cttagcagtg caaagcagaa taccaccaac
      241 atcgcaattt tcagctccac aaatcatcag tacagagtgg gcagccgcac cgagctgttg
      301 gtgcaagtca attcagctga gcaaaccgaa aaaaaaaaac ttcgaagagt cacacccagc
      361 tgatagcgaa aatttaagca acaaaaaaac tccaatctac gtaggatgag tttggaaagc
      421 ttaaccgttc agaccaaata cggcccggtc cggggcaaac ggaacgtatc gttgctggga
      481 caggagtacg tcagctttca gggaattccg tacgcccggg caccggaagg ggagctgcgg
      541 tttaaggtga gagtggtaaa ttgtttcaag tgctgttcaa attttatgga tgtgcaagtg
      601 catttttgtt caaataaaga gcaacgagtg ctgctgatta gcgctgtatc taagagtgtg
      661 acctccgcgg cttgatactt caccgcaatg acatgacata tttgttgata aaaataaaat
      721 gtaataaata ttttgtactt ggacacgtac aaaccagtca tggtctaggc tggaaataaa
      781 ttgaagagtg acaaagtcaa acataaattt ggcagtgact tccattttga taaaagataa
      841 gaccataaac ttaatccaaa attgtttgtt ggacaattga taatgaagtc tgtgcaagta
      901 gaagttattg aaacattaga ctgtttaaaa tttaatttaa tctatgattt aatctttatt
      961 ttgttcaata atctaaattt ttgtaaaaca aaaaaatctt cagaaaacct atcaactgca
     1021 tgctataaca ttttcaaatg acttcgaaat attaccgaaa taacattaaa tttgtcaatc
     1081 aaatttagta gttttgacat tttgaaaatg ctaaaaaaat gatcaaactt gcaaaaccat
     1141 tttttttttt cataaatcct cagcaatttt gcgaacaaat tattaataaa aagaaacgtt
     1201 caaatatcac acctcactgt gtaaaatatt ttaaccagca ctgtatttcc cggtgacttg
     1261 aacgcaaaac aagcaacgaa caaaactgga aatttaaaaa cataacctta aaacatttca
     1321 ggcaccagtt ccaccgcaaa agtggaccga aacgttggac tgcacgcagc aatgcgagcc
     1381 ctgctatcac ttcgaccggc gcctccagaa gatcgtcggc tgcgaggaca gtctgaagat
     1441 caacgtgttt gcgaaggagg tgagttggtt tcaaagaatc aatttcaact ctgaattcac
     1501 gatttctcca gatcaaccct tcaacccctc ttccggtgat gctgtacatc tacggcgggg
     1561 gcttcacgga aggaaccagc ggaaccgaac tgtacgggcc ggatttcctg gttcagaagg
     1621 atatcgtgtt ggtgtcgttc aattaccgta ttggggcgtt aggttttctg tgttgtcaat
     1681 cggagcagga tggcgtaccc ggtaatgccg gactcaaaga tcagaacttg gccattcgtt
     1741 gggttctgga gaacattgcc gcctttggag gagacccgaa gcgcgtgacc ctggccggcc
     1801 atagcgcagg tgccgcttcg gttcagtatc atctgatttc ggatgcgtcc aaggacttgt
     1861 ttcagcggcg tatcgtaatg tctgggagta cgtattccag ttggtctttg accaggcaac
     1921 gcaactgggt tgagaagttg gcgaaggcca tcggttggga tggacagggt ggtgagtccg
     1981 gagcgttgag attcttgaga cgtgccaaac cggaggacat tgttgctcac caggagaagc
     2041 ttctgacgga ccaggacatg caggatgata tctttactcc gtttggacct accgttgaac
     2101 cgtacctgac ggaacagtgc ataataccga aggcaccgtt cgagatggct cgaacagctt
     2161 ggggtgacaa gattgatatc atgatcggtg gtacttctga agaaggactg ctactgctgc
     2221 aaaagatcaa gttgcatccg gaactactgt cccatcctca tctattcctg ggaaatgttc
     2281 ctccaaattt gaagatcagc atggaaaaac gaatcgagtt tgctgccaag ctgaaacaac
     2341 gttactaccc cgacagcatt ccttcaatgg agaacaacct gggatacgtt catgtaagtc
     2401 caaacctaac ctcaatcacc acatcaactt atcaaccctc cacaaaaatt ccagatgatg
     2461 tccgaccggg tcttctggca cggcctgcac cgcaccatcc ttgcccgcgc cgctcgatcg
     2521 cgcgcccgca ccttcgtgta ccggatctgt ctggattcgg agttttacaa ccactaccgc
     2581 atcatgatga tcgacccgaa gctgcgcggc acggcccatg ccgacgagct gtcctatctg
     2641 ttttccaact ttacccagca ggtccccggc aaggaaacgt tcgagtaccg cggtctgcaa
     2701 acgctggtcg atgtgttcag cgcgttcgtc atcaacgggg atccaaactg tggcatgacg
     2761 gcgaagggtg gtgtggtctt tgagccgaac gcgcagacga agcccacgtt caagtgtctg
     2821 aacattgcca acgacggggt ggcgttcgtt gactatccgg atgcggaccg gttggacatg
     2881 tgggacgcaa tgtacgtgaa tgatgagctg ttttgaggag aaagttttta aaatatccta
     2941 ttgagatttt gcaatgcctt atttaatctg ttgttttatt ttaattatgt attgttgaat
     3001 aaatttatac tatataacaa aagttttatt cttggagtca ttccgctaac ttttttgttt
     3061 gtgttcaaca tattgaagat ctgacaaccc tatcaaaggt tatag
//
LOCUS       DROFASI     15288 bp ds-DNA             INV       26-FEB-1990
DEFINITION  D.melanogaster fasciclin I (FasI) gene, complete cds.
ACCESSION   M32311
SOURCE      D.melanogaster DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 15288)
  AUTHORS   McAllistar,L., Zinn,K., Rehm,J. and Goodman,?.
  TITLE     no title given
  JOURNAL   Unpublished (1990)Howard Hughes Med. Inst. Berkeley, CA 94720
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            L.B. McAllister, 22-FEB-1990
FEATURES       from  to/span     description
    pept       1593     1697     fasciclin I, exon 2 (first expressed exon)
               7090     7222     fasciclin I, exon 3
               7304     7470     fasciclin I, exon 4
               7545     7775     fasciclin I, exon 4
               7843     8011     fasciclin I, exon 5
               8089     8249     fasciclin I, exon 6
               9814     9822     fasciclin I, exon 7
              10989    11143     fasciclin I, exon 8
              11210    11300     fasciclin I, exon 9
              11364    11584     fasciclin I, exon 10
              11644    11806     fasciclin I, exon 11
              11871    12018     fasciclin I, exon 12
              12428    12633     fasciclin I, exon 13
    IVS         824     1449     FasI intron A
    IVS        1698     7089     FasI intron B
    IVS        7223     7303     FasI intron C
    IVS        7471     7544     FasI intron D
    IVS        7776     7842     FasI intron E
    IVS        8012     8088     FasI intron F
    IVS        8250     9813     FasI intron G
    IVS        9823    10988     FasI intron H
    IVS       11144    11209     FasI intron I
    IVS       11301    11363     FasI intron J
    IVS       11585    11643     FasI intron K
    IVS       11807    11870     FasI intron L
    IVS       12019    12427     FasI intron M
    ORF           1      164     ORF (AA at 3)
BASE COUNT     4691 a   3381 c   2990 g   4225 t      1 others
ORIGIN      Chromosome 3, map position 89e-f.
        1 ctcgaggaga ccccgaaggt ccagctggag accccgatgc tggcgttgca gcctctgcag
       61 ctcggcagtt gccaactgat ccagctcgct ttggtcagtc atcgttattg gctttctgaa
      121 ttcggtcttt gcaatggtat tgtcaaaact gataatgata ttaatttgat ttcaatagca
      181 aacaagcgtt gctggtggca agttgcttag taacggcgtg gtttatcgaa gacttttacc
      241 atttacttaa agtaagaaga accaaatagg tacactgctt gttaaaattc aatgttatct
      301 ccttagaaaa gtaaaaaata agtgcttgaa aattataacc ggttcaattt ggaataaaat
      361 ctttcaaaca gtagagactc tgcttagcgg agttatccac cgcattctgt cgttgccaca
      421 ttttgcgaaa cggttaaaat gattatttat gcttttaaac gaaaaatgag aaaatttaaa
      481 gcagtcttct tgtaataata ataagattgg ataaacagac attattataa atctgacaca
      541 gaagagcaac gcagcgctcg atttgtacta atattttacc tctaacaaga ttcggaaaag
      601 cataaaaagc acataaataa gagtcaccgc caaagacatg cacaaataaa tataaatgtt
      661 aagcgattta ttaacagtgc acagtgttgc gatttgaaac atcgacagct atcgataaca
      721 ctcgttatcg tcaccacacg ctcggcttaa acctgttctc cgcttttcgt tttaattcga
      781 aaaaaacgtt cgagcgtggc gcacgttgcg attcctccga ctggtacttg ctccccttat
      841 attgaaactc tgtggaaaat ctatactgtg catagggcat aactctaaaa tagctgcacg
      901 gcccatttcg ctgcccttgc tattgattta ccaccgcgct gcgttcgttt gtgtgtgcta
      961 cttaaagaat ttgtgtgttt tccaagtttg gtccgtggac attgcataat ttctgcggcg
     1021 gccgttaatg ttaattcctg cagcccgagt ttccgagaat tacgcagaat aaagaccaga
     1081 gagaaaacta taaaatcgaa aacagaaaaa agtgccgcag cagcgaaatg caaaggcgca
     1141 taataattaa acacacagcg acggaatgaa gaaaaaataa tacacaataa gcgcagcttt
     1201 gtttctagtt aaattgcgtt tgtgttgtgc ttgccgattc ctcggtggtc cgtttttcgc
     1261 ttgttgttct atgtgacata acggaactct gggcaaaagc gaacaggaag cagcgataac
     1321 cttgcaaaaa caaagaaaat accaaggagg acaaaaagca tgccaagcat atatctgtga
     1381 aataatattt tcttttccga ggaaatgctg tttgtcgtcg gctaactgtt gtttgccttt
     1441 gaattgcaga tcttaatcgt agagcagcac tcacaccagc acacgccccc cgcaaaacag
     1501 cacacacagc cacactcaca aacgacgtac actgccagcg cataaacaag tgctgttgtg
     1561 gttgttgttg ctgctgctgc tcgtccgcta aaatgctgaa cgctgcagcg ctgcttttgg
     1621 cgctgctctg cgccgcgaac gcagccgccg ccgccgattt ggcggacaaa ttgcgcgatg
     1681 attcggaact ctctcaggtg agtcagaaga agaagaagaa gcgaataacg agaacgtgga
     1741 ggatgaggag gtgaaaggag gtggtagagt aagagcacga agaaaagcag tagcagcaga
     1801 aacagttggt gcaaataacc gaaacacttg tccgtagtca attaagcccc cagcaccacc
     1861 caccaaacgt cgccccccct ccccccgctc acccttgggg tacagtaggg attgcctgcc
     1921 atgaacatta gtaatccttt tctatctttg caaccgatta aattaatgcg aagagttaca
     1981 cttcgaatat ttgtcaaaag tatgtttgtt taggtatccg tacgcgcaac atgtaaaatt
     2041 tgaacacatt caggggcttt tattagtcta atatatcatt tcgttggaaa agtatttctt
     2101 aactgcctac attttataca gtaatcagtc gcattttttc tgtagccagg gaaactgcca
     2161 aaattaaaat aacttttgtg tgagcaaaaa acaaactatt tttagcagac atttatgcgt
     2221 ttaaataatt ttttcagcct tttgaatatg tttcatatat ttctgtgatg ccttaatttc
     2281 ttctcatggt cggatacctt attgcaatga taaaatcaat tttttattat gcctgacttg
     2341 gctttggcga agtcaataaa tttaccttgc cacctaatag tggcaactta tcaaagttcg
     2401 gctgtaattg tgccaccacc cacaagacgc cgccccaatt cgttggcctt gaaaggaaaa
     2461 gccttggtgg agtgtgcgag gaggcggggc agatcgggcg ggaaaaagga gcctgcgcaa
     2521 gagggacgca cccgcatttg ggcacaggta gcgtagcctt cgttccggca gcatcgtcct
     2581 tgacctttag ccatctcact tgttgttgga ctgcggaaaa tcgatgatgc tgcctaaaat
     2641 ccctggcggc agttcaggtc cagccgaccg ctaagctcca cttggccaca tttcgtatgc
     2701 atggcggagt tcgttatgta aaaatgaaaa gccacataaa ttaagcccag gtctgctgaa
     2761 ttggctctcc ccagcaaaag gcaaatggta gaaaaaacgc cggccgttga cgtccactgc
     2821 gcataacagt tttatattta gttttgcctt tttacacctt cacaggccaa ctcacataca
     2881 aagtacatag aaaacaaaaa cacattaatg tgtaaaaaac tcagaagcca acaaaggtgc
     2941 gcggctgaaa taacaagaac aacatgtgta atgaattcaa aggcgggggt tcagtcacca
     3001 gaaggggtat tagcattaca ccaggagaaa tatatatata tttttttaat atgttaaggt
     3061 ttatggtagc ccattaagcc acataactta tactcaaaaa atagaattga tttgagtgga
     3121 aaatagtaac atagttcgtc tattgtaatt gcttttatct tatatagtag gtgaagtagt
     3181 agttgaatac gcgtttttaa gtaactgact tgagtccctg cttaattttc tagtctgcct
     3241 taacactgca tttaacaccc gaaatacacg ctgcttcaaa agtaaattag acttccaagt
     3301 aaaaagaaag cgtgtgacag actgtaattg ttgcagcaag tgtctcaggg aagtagttgc
     3361 atccacacag aaaaggcaac cttttcgagt tacaataggc gaaacagagc gtacaactaa
     3421 aaaaaattaa aaacaagaag ggggactcgg cttgctcact tgcacccttg ctttagggca
     3481 ttgtaacttt gcccagaagt ttgccacttt cggagcaaag catttccggc ctcatatctg
     3541 gagtactgct atacatgagg gtgggaccat ttggtcgcca taaaaaagta agggcggaaa
     3601 gtaatcttct ggcatttatg caaattaagt gccgtaaaac taatggatca aaaagtcagg
     3661 cttgggaaat cccttttaaa gttacttctt aagactatga aataatcatt ttattaataa
     3721 tatgtcgtat tgcaggtagc tgctttcaat ccaatcccct ttttaaattt tgcgatttat
     3781 atattttctc agttgtattt aaaatgtaga caagactggt tcaaaagggt atctccatct
     3841 tctaaagtcg aagtcgggct tctttactct ggtgcctgca cttctcatta catgctgatg
     3901 gcttagataa atgtaggttg cctgttttgg tttgcggctt acggcttacg gcttgtggtt
     3961 ttgcatggca aaccaccgag tttaggagag agtttcgaaa ttagcagata attgagccac
     4021 gccgatctga gttttatatt caacttcttg acagccgaaa tgaaagtaca ctccgaggaa
     4081 cgagctgacc tcctgcgttt gacccccact tcctgtcgtg cgtcagccaa cgtcttgcgc
     4141 gttttattaa ttttaatttt taacctttga ctgatgcaat tgcagttggc cgagtggggg
     4201 tgtggacagc ttgggttgac aactgcgggg ctggctgaac tctgtctagg aacactcact
     4261 cgtatggctc acggcacaga acagtccaca ctcgcagcat catcgaattc aattaaattt
     4321 aatttaagat ttcaatgcga tgccaaggcg cgttatttga tatgcgacga gctccattac
     4381 aggcgaggcg atgaatgagc cacgactgac cttaatggat ccagatgctc atagacggat
     4441 ctctgtccga cggagcactg aaggtgcaca gttttacgat cctaggcgtg ggggcggaat
     4501 ggcctcggag tgatgccagt gtagaacccc ccatttcgat ccctttccac tgctgaaccg
     4561 ccaataaaag gagaatgctg ttgccccaat ctctcgattc cgaaccggaa tccgaagtcg
     4621 ggggttgaga tcgctttcgc ttttgcttct gttttctgtt tctgctgccg ctgcaattat
     4681 cgaaaagaaa atatactgaa caaccgacat atgcatacat aatttgtacc aattaaattt
     4741 atttatacaa catgcactcg cccagaagga aacaacctaa ccctcctgct ccgatttcag
     4801 tccctagtcc aaaatacgaa gcttggctcg ctctcttggc taattagttt taatggctcc
     4861 attagatgat ggaaattgaa tttcctccaa acaggcttac cctctcatga ttgttgttgt
     4921 ttttggtata ggaattagta caatttttgg ctacaaatcg aaagaaaaca gaacaaattt
     4981 gaactgccag cggttctctc tctttctctt tctttctttc tctctctcta tctctttcag
     5041 agcactgcca aatccttgag gcgccttatg ccagttaggg taagcagcaa ttccaaaaaa
     5101 ccaaacagaa acacacgaac acaattagct attgtaaatt tggaacaaac tcaaaagaca
     5161 ttgtaaatgt acaactcaca tgtttttggc ttaatttagt taaccaaccc gtactcatcg
     5221 ggccattggc aaattattgt ttgctccgcc gtctgactct cacccctgat gttcgttgtt
     5281 cgtctgtggc cccagggtca ttggcttggc actcgggaaa atcaaaatgc gcttttgaat
     5341 aaacccaccc tcttgggagc acttttggcg gttgggcggc gagtggtgtt tgcatagtaa
     5401 atgagacgcc ccagaccgaa agggaaactg tttatcgatg acagcataat actctaacaa
     5461 ttgtatcaaa ttttcgggct tccaaaaaag gactccccgg ggaagagctg gccttatcgc
     5521 gatgatgtgc caaaagagcc ggaaactcat taagatctgt tcccttccgg cccggccttt
     5581 catcaaatgc aaaacattcg ataacggaaa tttatgaaag taattgcatc taattgtcaa
     5641 aggcttccag ccagaaccca agggggttga atttagtaag gatgctgaca gatggccatt
     5701 ggaaaatggc ggccaagaga aagtgtcgca aacaatatca ttgcctattc aataaaactt
     5761 ctctgtactc tattggccat tatacctgaa cgtatcccat atcaacctgt gacactgtga
     5821 catatgccac agccattccc caggacccca cttaacagca gggcactctc ttggccagat
     5881 aagcgaaaac atttcgagag cgatatagac accagcaagc agttctgacc aagctggcca
     5941 agaagacggt gtaagaacaa agcgcgctgg cgactcgaat gtccacgacc aggacaacaa
     6001 caacgctaac ggccccgaca actgacggtc tcaaattacc aaaaagaaaa tgaatgcaga
     6061 gcgaggataa atgtgtattt acactcagga aaaggacgaa aattcctcaa aaaaagagta
     6121 tttcaatttt aaggtgttaa gttcgttcgt tgatgtttga aaggaattca aaggcatatc
     6181 aatatcagta aaagaaatta caagtaagga tcccactggc aatttaaatt taagcatata
     6241 tttccaatta tgaattgaat ggattgaatg gacgtgaagc acctatttct aggcctagcc
     6301 actttttttt cggtgtacat gcgtgtgtat tggcgatgcc agagcataga tatgtacagc
     6361 aggcagggga aatgggaatg gaaaagggaa aggacggact tttaaaccca aattgccatg
     6421 actcatgccg ggcagcacac gaaaacacga gagtcacacg accgttttaa aggagggcac
     6481 ggaaaaaggt gttgtcctcg gcaggacacg cacccacaca ggcacaccaa gacacagata
     6541 tgtccgcagg acctcataaa gccgactggg aatgtgcagc ggggcgtggc ggcagtggtt
     6601 ctccacgccg ggaagccgca gagtgtgcgg ttgcatgtgg cctcatatgg caggggcagg
     6661 tggctggccc gcccgagcaa atgactctgg gcccggttta gggtcagtgg ctggcgtttt
     6721 ggggtggctg gaaaatgcat tttcacatcc cgctctcaat tgaatgacac acaatccact
     6781 tgggcggcgt aacccaccca gaggcttacg taattggttc aataaacctt ttttttctag
     6841 tacgccgccc tggcatcacc accaggacat gcataaagtt gttcctcatt ttattagttt
     6901 gttagacgtt gccatggtgc tggccggttg ccgttttgcc aaaaccgccc agcaaccgtc
     6961 tgcgaaccac cccttaaggc acccactcca tcctagatcc accacccact actccatccc
     7021 actcggttcg ataagatatt ggcaatagag cgacgcgaat gagtgtaacg aacccgttta
     7081 tctttgcagt tctacagcct gctggagagc aatcaaattg ccaactcaac gctttcgctg
     7141 cgcagctgca cgatctttgt gcccaccaat gaagccttcc agcgctacaa gagcaaaacc
     7201 gcccatgtgc tctatcacat tagtgagttc agtttcaatc aggctgatta aaaataataa
     7261 aaatcaaaat aacatgactc taccctctga ttactctttg cagccactga ggcgtacacc
     7321 cagaaacgac tgccgaatac cgtgtcatcg gacatggccg gcaatccacc gctgtacatc
     7381 acaaagaact cgaatggcga catctttgtg aacaatgccc ggatcatacc ctcgctcagt
     7441 gtggagacaa acagcgatgg caagcggcag gtcagtggcc ccacatatgc atatgttgca
     7501 tttcatagag tgattgaaca actctcttca ttacttaccg acagatcatg cacatcatcg
     7561 acgaggtact ggagccgctc accgtcaagg ctggccattc ggataccccc aacaatccga
     7621 atgctctcaa gttcctgaag aacgccgagg agttcaacgt ggacaacatc ggtgtgcgca
     7681 cgtaccgcag ccaggtgacg atggccaaga aagagtcggt ctatgatgcc gccggacagc
     7741 acacgttcct ggttcccgtc gatgaaggct tcaaggtgct tttctagttt aattgcatta
     7801 tttgttacgc attgtaatct cactgttgct ttatgactcc agctctcggc tcgcagcagc
     7861 ctcgtggacg gcaaggtcat cgatggccat gtgataccaa acactgtcat cttcactgcc
     7921 gctgcccagc atgacgatcc caaggcttcc gccgcttttg aggacttact caaggtcacc
     7981 gtcagtttct tcaagcagaa gaacggcaaa agtaagattg cactaaagac ataccatttt
     8041 actaaaggtt tacgaatgat tgtatggttg atataatttt acttgcagtg tacgtcaagt
     8101 caaacaccat tgtgggtgat gccaaacacc gcgtgggcgt ggttctggcc gagatcgtga
     8161 aggcgaacat cccagtgagc aacggagtag tccatctgat ccaccgcccg ctgatgatca
     8221 tcgatacgac ggtcacccaa ttcctgcagg tatgtagggt gttgaagact agcatccagt
     8281 gcaattgtag acacacagac acgacacaga cagtccaaga aacaaacaca tagaaacaat
     8341 gatagtgcct gtggcttgtg ataaggcttg gcattttgat tggctcggca cgcactgaaa
     8401 gatagcgctg gcggcagacc gatggactgg gaacataaat cacttgcctg ctgccccagc
     8461 gaatgctaca aacacatgag ctaaacacat tgaaacatac acactgataa gcggcttttc
     8521 tttgccctaa gtgagcagct cctccaagtg gcgccaaagg gataatattt ccaaggttga
     8581 agtttcgtcg gggctcaact gagggcaaag gccaaaacca aataatcttg cacccagaaa
     8641 ccactgccta aacacataca gtataaataa ataagatcaa tgcagacact agttttcagt
     8701 atcccctaga tatatctaac taatacaaat gaaaacccaa aagactgatt ttatttggaa
     8761 ctgaattcgc tcaaaagtct gtattccgat tcggttctat gtaaaatcaa aagcaaaatt
     8821 ttattttaaa ataattttta aaatataaat tttattttaa aaaaattttg ggtgtaatgt
     8881 aagtattgcc aaaaagctga aaaacaaaaa aaaaaaaaac aaatcttaaa gcaatgtatt
     8941 tcatgtaaat ttttctatga atcagatttt cgtatcaata tgcatttcac aagcagttgt
     9001 ttgcattgca attgcaaaaa actgactcag tagggcaacg cgctcagaat tggccctcga
     9061 aattcaatac gtgatttagt caccaattgg caaacactaa aaaaanttgc ttagcggctg
     9121 cgctttgctt gatctgaccg aaatcgacca aaacttcatt cacaaaaact tccattgttc
     9181 tatattgaaa tcgaccaaaa cttcattcac aaaaacttcc atttttctat attgaatgta
     9241 ttggtacttt aatggcgtca atgaactaag caagtgtata aattattccc atcattgatt
     9301 ggctaaagtt cttcgatccc tcgaccgccg tgcatatcca aagatcagtc atctgtacgc
     9361 tcgcctacag ccggctagct ttgatcatca ccgtaaccgt agatgtatat gtatacacca
     9421 agcataaact gataacaaac cacccacccg ctcgataaat agttaccctc ctcataggca
     9481 tatacattag aaacagtgcg tttcgaaact gtgaggcaaa gtgtcgccga acagttttga
     9541 gtccgcactg tagaacggct atatcattcg accgctcgaa tcgtaagcgc cttcatatcg
     9601 tgctatcgtt ttcgaaattt tcaaactagt tttgttgtaa ttacgtgcga tactctcctt
     9661 ttgctaattt actgtatcca tacctctatc ctttgtttcc gtcacatgcc gaataattgt
     9721 acttaaaact acaaaacaca atccaaccca accacatatc gctcgaaatt gaatcggtgc
     9781 acccaaatat tgaaacaaaa ccaaaacgtt cagtcgttca aggtaaacaa cagactcaaa
     9841 atccgatttt aattttaagt tgttcggttt cttttgactt gcattttgtt ttgcatcccc
     9901 tgctatttta gctatgcatg agtttattac aaaaaatgtg tagacctttt ttgtaaaatt
     9961 agtccattgg ttgttaaagt cgcgatgcct tagggtctaa gaatatttct tctgattttt
    10021 attcacaaat agtctttata ggatgtacag cactagttta aatttgcaaa tgaagcaagt
    10081 gcacatgata ctttaaagtt aaagagtatt tatcgtatga tgagtttaca gaaattttac
    10141 ctaattactt aagataaact tctatttttt gtcggagctt ttatatcctt gttttatttt
    10201 tttaaatcat agtaatggaa attattatga ttcgctgtaa attgttgaaa aatgcacgta
    10261 actcagtaac attgtacata atggagccaa accgtttgcc tgttctttga aaaatgtcta
    10321 gaacaatgaa agattaaata taattccctt gtgcttcatt tcaaacgttc gtttgtggcg
    10381 gaacaactcg ggcaatctgc gtggaaccca aatcccatat taaacaatat ttaaattcaa
    10441 ttttattcga aacttgtctt caccaaatca cttcgaatga atattgaact atttctgcta
    10501 acactccgtt ttaaaactga tttagtttat gaacgtaagt aaacatcaca ccagcagaca
    10561 aagaacaaaa accacaacag tcatcagtgc aacgctttgc tatcaaattg ccatccaacc
    10621 caaccaccgt gcaaacaatt caatgattcg gtagattttc cgtcattaaa tcgcataaat
    10681 tcagctgtgt taagtgagtc attaagcgag cattcccatt gattcacaac ttaattggcc
    10741 agccatccat tcccccaccc atatgtgtat aactggctta cattataatc ggatatattt
    10801 ccaattgttg tatttttttc ctaattcttt gtactactcc caaaaaaaaa aaaacaattt
    10861 ataagcttat gcatacccct cttccattca ccgtttattt atttatatgc attttgtttt
    10921 aaggtacttc aaatagcatt atgatatgtt cattggtatt aattttgcaa tctatcttga
    10981 aggagaagga gaatgctgag aacggagctc tgcgcaagtt ctacgaagtt ataatggaca
    11041 atggtggagc agttctggac gacatcaata gcctgacaga agtgaccatt ttggctccca
    11101 gcaatgaggc ttggaactcc tcgaacatca acaatgtttt gcggtaatgt tgctcaaatc
    11161 gggaagataa tatttaatta tcataataat aaataaatac atcttacaga gatcggaata
    11221 agatgaggca gatcctgaac atgcatatca tcaaggaccg cttaaatgtg gacaagatca
    11281 ggcagaaaaa tgcaaatttg gtgagcttta catgagctta taatcaaatg gaaatttttt
    11341 acaaacacca ttcttttcaa aagattgccc aggtgcccac tgtcaacaac aacactttcc
    11401 tgtacttcaa cgttcgcggt gagggatcgg ataccgtgat aacagttgag ggaggcggcg
    11461 tgaatgccac cgttatccag gctgatgtgg cccagactaa tggttatgtt cacatcatcg
    11521 accatgtgct gggcgtgcct tacactacag ttcttggcaa acttgaatcc gatcccatga
    11581 tgaggtatgt tcaattttaa agctttatta gtacgctgtt taaattttat cgtattcctt
    11641 aagtgacacc tataagatgg gaaaattctc gcactttaat gaccagctga acaacacaca
    11701 acgccgcttc acctactttg tgcccaggga caagggctgg cagaagaccg agctggatta
    11761 cccatcggct cacaagaagc tttttatggc cgacttttcc tatcatgtag gtttttcacc
    11821 ctaagcctat taggcttttc tattatactc aaaatgtttt gtaaccgcag tccaagtcca
    11881 ttctggagcg tcatttggct atttcggata aggagtacac catgaaggat ctggttaagt
    11941 tttcgcaaga atcgggcagc gtaatcctac ccacgttccg cgactctttg agtatccgcg
    12001 tggaggagga agctggacgt aagtataaca cattggatac caagggctca gttgcttgtc
    12061 ctttcaattc aagtcgattc gtttcaccta atttgattga ttctattggt cttcccatac
    12121 cgtgctgcat tactaaccga acctgttttt ttacaactct taaactaaat ttttcgcttt
    12181 ctgcttaact aaaactaatc gttgtccaat caaaagatct ccatgatgag tatgctagtc
    12241 acgaatggac tggtgagtgc tgccgttttt caaacccaca tcgcaccgcc cacaacgaaa
    12301 tccttgacct ggccatgtcc acccatagct cgctcattgg gcaatccact agcattaacc
    12361 acccatcgat tgcataccaa ttttctattg tctagtggct attaacaacg attacttcca
    12421 tttgcaggct atgtgatcat ttggaactac aagaagatca acgtataccg gcccgatgtt
    12481 gagtgcacca acggaattat ccacgtcatc gactacccac tcctggagga aaaggatgtg
    12541 gtcgtggccg gaggtagcta tttgccagaa tcaagcattt gcatcatctt ggccaacctc
    12601 ataatgataa cagtagcaaa gttcttgaac taaatgcatc cgatatgtaa aaacaaatcc
    12661 aatccaaagc aaatgcaaat caaacacaac aacaacagtc gtctacagaa caagaaccaa
    12721 caacactcag tatcagacta acttaacatc cacatggatc taaataatca gcaccagttt
    12781 gttgataccg atcgaaaacc acaagcaacc caaacagtat ctgtaatata tacgtcacaa
    12841 ggaacgatca tcattccaac ccaacatgcc cacgtccacg ccatctacat acccacatac
    12901 acacactcag aaaaaaacga tgcagaaaac tgcctgctaa atttgctttt gcatcgacag
    12961 cttctcccaa aatagctggc aagagagact aagataaagt cgaagaaacc cctatgttta
    13021 agtattaaac tcgattttcc ctcaaacggc cttatgtata atattgaata tgaaactttc
    13081 tgtaaaacat tttaatttta tgttcgccct atactggtag cttacgttca tcagctcgct
    13141 atgttttaag tttagaccca cactgacatt ggtatcgttc catgtaacaa ctgattgtgt
    13201 ttatgatttt aagtgtacat tttttgtgaa ttgcttgttt tagttaatgg tctaatttat
    13261 gatttatgaa atgagttatc ttaagtagaa tgcgaatctc attgtacatc tcgaaaacga
    13321 agaagctaag cataagcaat aagcgcatca cacactctca tactcgagcg cccacacatt
    13381 ccccctattc ctatccataa atacatacaa catgtaacga aacaaagttc attgtaagat
    13441 ttgaaagtgt gaaatttgat aataaactga aagtttttca tttgaacctt actattaaga
    13501 gcggcagcaa tcttgtctcg aaccaatcgc gtgacccctg taggcctaga actatatcag
    13561 tgtcggacaa gaaatgtgca accaacaaac ccaattatat gatatgccat cctgcactta
    13621 gttactagcc ccgctatgca tgtgaacccc cagaaaaacc cattcgccca cacacaccag
    13681 ccacttggag cgagagtagc cactgcaact aggttagtcc aatgtataag ttccgaaact
    13741 gccgacaact acgaaaacta tacagcatac ataaactccc tgtaatctaa atcactcacg
    13801 gcacacaact acaactacaa ctacatgtca tcattgaatg gattttgata ccgattttaa
    13861 cttgcatata aacaaaaaca agaactagac aacgtgaagg agattttaaa caaaattctc
    13921 cctcggtcga gcagttgcat ttcaaacttt gtacgtagtt taaaactagt tttttagtcc
    13981 gacgtagaac aacccaattg ctaactatat accaactttc tttctatttc tctctgtctc
    14041 tccccctaat gctatgtact tataggttag acattgtaac tattgtaatc aacccagtgc
    14101 gttaaacccg agtgttaagt cgaacagtaa cacagaattg tactatcccc aaatgaataa
    14161 ctatcagcct accagtacac tgtctcaact ctcaccacca ccaccaccca cttaggaact
    14221 cagtcgaact tgaactcgaa atcaaagatc cagttgtggc agtcgcttca cgtagttgct
    14281 aattcccaat tcgaaccgat cctttccgaa agtcttatct ttagtatagg tggtttagtt
    14341 tcatttggag ccgtgcagtg ccgtagcagc taagtaaaaa tgtatgaaat gaagatgaac
    14401 acgagatcga aatcgtacgg aatgatcaga aatcagaaat aaataatgaa tacgctaatg
    14461 aattgtacaa gtaagcttta aagaattgct ggaggagcgc ggatcggaga acttagagga
    14521 gggagaaccg cattgcaatc gcattgcaat ttgtgtcgta gtcagtagtt acacgttaag
    14581 cggcgtctta acgtgtaact agtgccttac taaagataaa cgcattacct taacctttat
    14641 acaaatttac tcaaaacata cttgtacccc aagcatacgt tccgcttcga atgataccca
    14701 gatatatata cggagttaca ccccaagaat acaagtataa ctacaaatga tattgcgcca
    14761 cacgctattt acaccaaata caccaaacaa atcgagaaat gcatattttt catatattta
    14821 attgtcagaa taatataacg tatatgtaat atgtagttta tttactgtaa aacgcaagaa
    14881 cctaacaagt ggaatttgaa tcacatacaa ttgatgtata ttagctattg agtttctaag
    14941 caagcgttag acactgaaat atatgttcaa atatatgatt caaatatata caatatgcga
    15001 aaccaagcaa actatggaaa actggagtgc caaaagaata ttattccatt ttattttacg
    15061 acaagcgctt tttacaaata aaccgaatcc atttaaatta ctcgtaaaga acagacagat
    15121 tatatttagc attagttaaa ctaattatta catgtactag aaaaccgaat gtcaaccgag
    15181 aatcttcagc aagcttgagc gaataataaa actttaaaac taactataaa taaatcgacc
    15241 gtcctttttt gcacttaatc atgggttatg gttgtggaac ctgaattc
//
LOCUS       DROIAB       2131 bp ds-DNA             INV       14-FEB-1990
DEFINITION  D.melanogaster abdominal-A (iab) 1.7 and 2.0 kb RNAs, complete
            cds.
ACCESSION   M32109
KEYWORDS    .
SOURCE      D.melanogaster (strain Canton S) embryo DNA, and cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2131)
  AUTHORS   Cumberledge,S., Zaratzian,A. and Sakonju,S.
  TITLE     Characterization of two RNAs transcribed from the cis-regulatory
            region of the abd-A domain within the Drosophila bithorax complex
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1900) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.Sakonju, 14-FEB-1990.
FEATURES       from  to/span     description
    RNA         141     1802     1.7 kb RNA
    RNA         141     2101     2.0 kb RNA
BASE COUNT      592 a    468 c    478 g    593 t
ORIGIN      Chromosome III, bithorax complex.
        1 ttagttcgaa gcacgtgcgt gtgagttccg catgaacggt acaccaacac tttcgaacag
       61 aaacccctca cagatactcc tcacacagat actctactgt actgggctac gatatctgtg
      121 agatacgcac agataccggc aggccgccgg cgtcaaagtc gacgtgctgc cggcgtttga
      181 gagaccgagc ggcgtgcaag aaatcgtcgg gcaacggctt caaatgaacg caaaacgttg
      241 gaaagcaaac aaccggcaaa agccatcagc agctaaaaat aaaactgcaa ctatcaaagt
      301 gaatctgtag agaaagagtg atagctcagt gtgcccagtg tttatttaat tacttattta
      361 aatttgattt tcaaaataag ttgtgatttt ggttcgtttg tttgtttccc cattaaatcg
      421 catcgcggaa ctggtttgca ttttgtatct tttatcgctg cagttcgttt ctgtgtggtt
      481 tcatgtattc cccatctatt ccatctcttt ttgcagatac aagtgaatga aaccctaacg
      541 tttaaacgcc accataaaca ccaaataaca aattttctgg caaacatggc atgtcacggt
      601 cccatttaac gtggaaaatc ggcccagaaa ttcaacacga cgacaacttt ttcgactgtc
      661 ccagaaagca aatatgtaaa gtgtatttgt gtgcgtgcca ttgacattga cttatggtca
      721 tgaaggggtt agggggacgg gtctaagggt gggggttgat ggctaaaggg gggtctcgcc
      781 gctacgacaa aaacttggat tctctcgttt cttttttgtc tttcgtcttc aacgcacatt
      841 tatggtttgt tttgctcaca cacacactca aaaagcctac gcagcggcag agagggcaca
      901 agcagcagag aaagagacga aacatgtctc ggatgatgat aaacgcctct cgtctatttt
      961 ggcgggtcca tgaatgaaac aattgcctcg aatgactgac tgccagacag atagctagaa
     1021 actgccctta cggcgggcaa attggggtat tttgggtttc ggtgactttg agggaccggg
     1081 acctgagggc ccggaaagcc gacagcaata aacactgtgc tcagacaaaa gtctgatgag
     1141 acagaccatc atgatgctca tcatcatcgg cgataataat gattttcatt tgcacattcc
     1201 acccgtttcg ctgttgccgg cttccagcat tcagattcag tttgcattca gtggggcagg
     1261 ttatttatgg cacttttcct atcgccactc actggggtaa tcgattttta ttggattaat
     1321 attgctttcc gcatgatttg gccttctttt cgttatcact tagcaggttt cttgacgctg
     1381 aggagtggac aatgactaaa actcagtcat ctggaaaagc tggaccaagg acatccatat
     1441 acaagtgctg gctaaacgtc tgtggttatc gcaatcattt atggcggaat agaaggacac
     1501 gtcttatgtg tcgattttcg ggccccatcc ctgttgaaaa atcagccgga cagcagcaac
     1561 caccgaacac aagacacagg acgaccggaa gtgaggaaaa cgcttgtaaa atcggtgcac
     1621 acgagtgcga gcgagacacg agcgaacggg ccgaaaggaa acgacaatta cgaggacacg
     1681 ccggcttgct tttctgcttt tcctgtctgc tcctttcagt tgcggccgtg caatcgtcgg
     1741 gtcaacaacg ccgcgtgtcc gatttcaatt tacaaattag aatatacatc tttacatctg
     1801 gatttggttt catctttatc tgcatttgga tccgcgacga cgcataattt tcagcacttt
     1861 tgaataatgg cgattctgtg cgacctcgta aacgtatact gaatgtatcc tgagtgtatc
     1921 ctatccggta taccttcagt atacgtaaca cgagagagca ctagcaacgt gggatgtgaa
     1981 acacttttta agacacctta atacacgtag cactttgtta tatggtctta cactttttaa
     2041 atatctcgac ttataaatat ataatttaaa tataaatact ttaataatgc gatataagtt
     2101 aaatacttcg atttactggc tggaagttca t
//
LOCUS       FVBOPD       1693 bp ds-DNA             BCT       05-MAR-1990
DEFINITION  Flavobacterium sp. parathion hydrolase gene, complete cds.
ACCESSION   M29593
KEYWORDS    parathion hydrolase.
SOURCE      Flavobacterium sp. (strain ATCC 27551) DNA, clone pPDL2.
  ORGANISM  Flavobacterium sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Acidiphilium cryptum; Neisseriaceae.
REFERENCE   1  (bases 1 to 1693)
  AUTHORS   Mulbry,W.W. and Karns,J.S.
  TITLE     Parathion hydrolase specified by the Flavobacterium opd gene:
            Relationship between the gene and protein
  JOURNAL   J. Bacteriol. 171, 6740-6746 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        419     1516     parathion hydrolase
    signal      312      317     -35 region
    signal      334      339     -10 region
    binding     408      411     ribosme binding site
BASE COUNT      372 a    497 c    477 g    347 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccgagc ggcttaccgg cgccctgctc gaccggttga cccaccatgt ccatatcctc
       61 gagatgaacg ggggcagcta tcggcttgcc agcagccgga aacggcaaaa gcacagccac
      121 cacgcacagc aagaggaggt gcccccccca tgacctgatc agaaaacccc tcatctgctg
      181 tgctgaacgg ccttcgctac gctccagacc gttcagcaca gcagatgaaa gcaccgcctc
      241 gacaagaggc tttttgttca atccaactgg tacactctta caccggaatc ttgcacaatt
      301 ttaccccggc attgacatct gacgcgtcaa cagtaaaaga aacaaccggt tcagatctgc
      361 agcctgactc ggcaccagtc gctgcaagca gagtcgtaag caatcgcaag ggggcagcat
      421 gcaaacgaga agggttgtgc tcaagtctgc ggccgccgca ggaactctgc tcggcggcct
      481 ggctgggtgc gcgagcgtgg ctggatcgat cggcacaggc gatcggatca ataccgtgcg
      541 cggtcctatc acaatctctg aagcgggttt cacactgact cacgagcaca tctgcggcag
      601 ctcggcagga ttcttgcgtg cttggccaga gttcttcggt agccgcaaag ctctagcgga
      661 aaaggctgtg agaggattgc gccgcgccag agcggctggc gtgcgaacga ttgtcgatgt
      721 gtcgactttc gatatcggtc gcgacgtcag tttattggcc gaggtttcgc gggctgccga
      781 cgttcatatc gtggcggcga ccggcttgtg gttcgacccg ccactttcga tgcgattgag
      841 gagtgtagag gaactcacac agttcttcct gcgtgagatt caatatggca tcgaagacac
      901 cggaattagg gcgggcatta tcaaggtcgc gaccacaggc aaggcgaccc cctttcagga
      961 gttagtgtta aaggcggccg cccgggccag cttggccacc ggtgttccgg taaccactca
     1021 cacggcagca agtcagcgcg atggtgagca gcaggccgcc atttttgagt ccgaaggctt
     1081 gagcccctca cgggtttgta ttggtcacag cgatgatact gacgatttga gctatctcac
     1141 cgccctcgct gcgcgcggat acctcatcgg tctagaccac atcccgcaca gtgcgattgg
     1201 tctagaagat aatgcgagtg catcagccct cctgggcatc cgttcgtggc aaacacgggc
     1261 tctcttgatc aaggcgctca tcgaccaagg ctacatgaaa caaatcctcg tttcgaatga
     1321 ctggctgttc gggttttcga gctatgtcac caacatcatg gacgtgatgg atcgcgtgaa
     1381 ccccgacggg atggccttca ttccactgag agtgatccca ttcctacgag agaagggcgt
     1441 cccacaggaa acgctggcag gcatcactgt gactaacccg gcgcggttct tgtcaccgac
     1501 cttgcgggcg tcatgacgcc atctggatcc ttccagccag cggccactat tccccgtcaa
     1561 gataccgaac gatgaagtcg cgcatcgatc gataggcatc ttcaatttga tcagggctgc
     1621 cacctccaaa gccgtggcca cccctgtcga tagtcttgag gacgtagggc acaccgtgct
     1681 tttcgaactg cag
//
LOCUS       HUMCSAE      1372 bp ss-mRNA            PRI       26-FEB-1990
DEFINITION  Human epithelial glycoprotein (EGP) mRNA, complete cds.
ACCESSION   M32306
KEYWORDS    cell surface antigen; cell surface glycoprotein;
            epithelial glycoprotein; transmembrane protein;
            tumor-associated antigen.
SOURCE      Human colon carcinoma cell line HT-29, cDNA to mRNA, (library
            HT-29/cdm8).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1372)
  AUTHORS   Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J.,
            Gattoni-Celli,S. and Brand,S.J.
  TITLE     Epithelial glycoprotein is a member of a family of epithelial cell
            surface antigens homologous to nidogen, a matrix adhesion protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1900) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            B.Simon, 23-FEB-1990.
FEATURES       from  to/span     description
    pept         22      966     epithelial glycoprotein (EGP) precursor
    sigp         22       84     epithelial glycoprotein signal peptide
    matp         85      963     epithelial glycoprotein
    mRNA    <     1     1372     EGP mRNA
BASE COUNT      436 a    233 c    317 g    386 t
ORIGIN      
        1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt
       61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag
      121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca
      181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg
      241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg
      301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc
      361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata
      421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga
      481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt
      541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt
      601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct
      661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg
      721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat
      781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg
      841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga
      901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat
      961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac
     1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc
     1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc
     1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact
     1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt
     1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt
     1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg
//
LOCUS       HUMNOXF      2206 bp ss-mRNA            PRI       15-FEB-1990
DEFINITION  Human neutrophil oxidase factor (p67-phox) mRNA, complete cds.
ACCESSION   M32011
KEYWORDS    neutrophil oxidase factor.
SOURCE      Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA,
            clone 10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2206)
  AUTHORS   Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G.,
            Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L.
  TITLE     Cloning of a 67K neutrophil oxidase factor and its similarity to a
            noncatalytic region of P60c-src
  JOURNAL   Science (1900) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            H.L.Malech, 08-FEB-1990.
FEATURES       from  to/span     description
    pept         68     1648     neutrophil oxidase factor (p67-phox)
    mRNA    <     1     2206     p67-phox
    signal     2030     2035     poly-A signal
    signal     2181     2186     poly-A signal
BASE COUNT      624 a    499 c    573 g    510 t
ORIGIN      
        1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta
       61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga
      121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg
      181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa
      241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg
      301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt
      361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct
      421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa
      481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa
      541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc
      601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta
      661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc
      721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag
      781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga
      841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc
      901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga
      961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc
     1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa
     1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta
     1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat
     1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga
     1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa
     1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga
     1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa
     1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt
     1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga
     1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga
     1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat
     1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca
     1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga
     1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat
     1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct
     1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt
     1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa
     2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa
     2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg
     2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc
//
LOCUS       HUMUDPGTA    1855 bp ss-mRNA            PRI       26-FEB-1990
DEFINITION  Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA,
            complete cds.
ACCESSION   J05428
KEYWORDS    3,4-catechol estrogen UDP-glucuronosyltransferase.
SOURCE      Human liver, cDNA to mRNA, clone 63-11.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1855)
  AUTHORS   Ritter,J.K., Sheen,Y.Y. and Owens,I.S.
  TITLE     Cloning and expression of human liver UDP-glucuronosyltransferase
            in COS-1 cells: 3,4-catechol estrogens and estriol as primary
            substrates
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.S.Owens, 22-FEB-1990.
FEATURES       from  to/span     description
    pept         15     1604     UDP-glucuronosyltransferase (EC 2.4.1.17)
BASE COUNT      571 a    337 c    394 g    553 t
ORIGIN      
        1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct
       61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt
      121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac
      181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa
      241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga
      301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca
      361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat
      421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct
      481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc
      541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg
      601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct
      661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt
      721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat
      781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt
      841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg
      901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca
      961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc
     1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt
     1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg
     1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt
     1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg
     1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc
     1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc
     1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc
     1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt
     1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt
     1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg
     1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat
     1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat
     1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa
     1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt
//
LOCUS       PSERCBC      1060 bp ds-DNA             BCT       08-MAR-1990
DEFINITION  P.aeruginosa carbenicillinase gene, complete cds.
ACCESSION   J05162
KEYWORDS    carbenicillinase.
SOURCE      P.aeruginosa (strain Dalgleish) DNA, clone pMON709.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Acidiphilium cryptum; Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1060)
  AUTHORS   Boissinot,M. and Levesque,R.C.
  TITLE     Nucleotide sequence of the PSE-4 carbenicillinase gene and
            correlations with the Staphylococcus aureus PC1 beta-lactamase
            crystal structure
  JOURNAL   J. Biol. Chem. 265, 1225-1230 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed copy of sequence kindly provided by
            authors, 25-JAN-1990.
FEATURES       from  to/span     description
    pept        122     1036     carbenicillinase
BASE COUNT      316 a    195 c    246 g    303 t
ORIGIN      357 bp upstream of HincII site.
        1 tcgggcatcc aagcagcaag cgcgttacgc cgtgggtcga tgtttgatgt tatggagcag
       61 caacgatgtt acgcagcagg gcagtcgccc taaaacaaag ttagccatat tatggagcct
      121 catgctttta tataaaatgt gtgacaatca aaattatggg gttacttaca tgaagttttt
      181 attggcattt tcgcttttaa taccatccgt ggtttttgca agtagttcaa agtttcagca
      241 agttgaacaa gacgttaagg caattgaagt ttctctttct gctcgtatag gtgtttccgt
      301 tcttgatact caaaatggag aatattggga ttacaatggc aatcagcgct tcccgttaac
      361 aagtactttt aaaacaatag cttgcgctaa attactatat gatgctgagc aaggaaaagt
      421 taatcccaat agtacagtcg agattaagaa agcagatctt gtgacctatt cccctgtaat
      481 agaaaagcaa gtagggcagg caatcacact cgatgatgcg tgcttcgcaa ctatgactac
      541 aagtgataat actgcggcaa atatcatcct aagtgctgta ggtggcccca aaggcgttac
      601 tgatttttta agacaaattg gggacaaaga gactcgtcta gaccgtattg agcctgattt
      661 aaatgaaggt aagctcggtg atttgaggga tacgacaact cctaaggcaa tagccagtac
      721 tttgaataaa tttttatttg gttccgcgct atctgaaatg aaccagaaaa aattagagtc
      781 ttggatggtg aacaatcaag tcactggtaa tttactacgt tcagtattgc cggcgggatg
      841 gaacattgcg gatcgctcag gtgctggcgg atttggtgct cggagtatta cagcagttgt
      901 gtggagtgag catcaagccc caattattgt gagcatctat ctagctcaaa cacaggcttc
      961 aatggaagag cgaaatgatg cgattgttaa aattggtcat tcaatttttg acgtttatac
     1021 atcacagtcg cgctgataag gctaacaagg ccatcaagtt
//
LOCUS       RAT5MDAA     2030 bp ss-mRNA            ROD       15-DEC-1988
DEFINITION  Rat iodothyronine 5' monodeiodinase (5'MD) mRNA, 3' end.
ACCESSION   M21018
KEYWORDS    iodothyronine 5' monodeiodinase.
SOURCE      Rat liver, cDNA to mRNA, clone 2302.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2030)
  AUTHORS   Boado,R.J., Campbell,D.A. and Chopra,I.J.
  TITLE     Nucleotide sequence of rat liver iodothyronine 5'-monodeiodinase
            (5' MD): Its identity with the protein disulfide isomerase
  JOURNAL   Biochem. Biophys. Res. Commun. 155, 1297-1304 (1988)
  STANDARD  simple staff_entry
COMMENT     Draft entry and  computer-readable sequence for [1] kindly
            submitted by D.Campbell, 14-OCT-1988.
FEATURES       from  to/span     description
    pept    <     1     1449     iodothyronine 5' monodeiodinase (AA at 1)
BASE COUNT      527 a    493 c    532 g    478 t
ORIGIN      
        1 gtcctggtgc tgaagaagag caacttcgca gaggcgctgg cggcgcacaa ctacctgctg
       61 gtggagttct atgccccatg gtgtggccac tgcaaagcac tggccccaga gtatgccaaa
      121 gctgctgcaa aactgaaggc agaaggctct gagatccgac tagcaaaggt ggacgccaca
      181 gaagagtctg acctggccca gcagtatggt gtccgtggct accccacaat caagttcttc
      241 aagaatggag acacagcctc cccaaaggaa tatacagctg gcagggaagc tgacgacatt
      301 gtgaactggc tgaagaaacg cacaggccca gcagccacaa ccctgtctga cactgcagct
      361 gcagagtcct tggtggactc aagcgaagtg acggtcatcg gcttcttcaa ggacgcaggg
      421 tcagactccg ccaagcagtt cttgctggca gcagaggctg ttgatgacat accttttgga
      481 atcacttcca atagcgatgt gttttccaag taccagctgg acaaggatgg ggtggtcctc
      541 tttaagaagt ttgatgaagg ccgcaacaat tttgaaggtg agatcaccaa ggagaagctg
      601 ttagacttca tcaagcacaa ccagctgcct ttggtcatcg agttcactga acagacagct
      661 ccaaagattt tcggaggtga aatcaaaaca catattctgc tgttcctgcc caagagtgtg
      721 tctgactacg atggcaaatt gagcaacttt aagaaagcgg ccgagggctt taagggcaag
      781 atcctgttca tcttcatcga tagtgaccac actgacaacc agcgcatact tgagttcttt
      841 ggcctgaaga aggaggaatg tccagctgtg cggcttatta ccctggagga agagatgacc
      901 aagtacaaac cggagtcaga cgagctgaca gctgagaaga tcacacaatt ttgccaccac
      961 ttcctggagg gcaagatcaa gccccacctg atgagccagg aactgcctga agactgggac
     1021 aagcagccag tgaaagtgct agttgggaaa aactttgagg aggttgcttt tgatgagaaa
     1081 aagaacgtgt ttgttgaatt ctatgctccc tggtgtggtc actgcaagca gctagccccg
     1141 atttgggata aactgggaga gacatacaaa gaccatgaga atatcgtcat cgctaagatg
     1201 gactcaacag ccaatgaggt ggaagctgtg aaggtgcaca gctttcccac actcaagttc
     1261 ttcccagcaa gtgcagacag aacggtcatt gattacaacg gtgagcggac actagatggt
     1321 tttaagaaat tcttggagag cggtggccag gatggagcgg gggacaatga cgacctcgac
     1381 ctagaagaag ctttagagcc agatatggaa gaagacgacg atcagaaagc cgtgaaggat
     1441 gaactgtagt gcagaagcca gatctgggcg cctgaaccca aaacctcggt ggccatgtcc
     1501 cagcagccca catctccgga gcctgagcct caccccagga ggagcgccat cagaacccag
     1561 ggaatctttc tgaagccaca ctcatctgac acacgtacac ttaaacctgt ccttcttttt
     1621 ttgcttttca attttggaaa gggatctctg accaggccag cccatcttga agggctacgt
     1681 tttgttttaa ttggtggtgt actttttgta cgtggatttt gtcccaagtg cttgctacca
     1741 tatttgggat ttcacactgg taatgtcttt cctgttagag aggtttatgc tatcacttca
     1801 gatttcgtct gtgagatgtt tcatcttcct gacatgtctt catgtcgagg tacttgttcc
     1861 accacgcaga cctccctgag accccttctg ccctgcgcag gaggcgatgg ttctgggtcg
     1921 tatgctctct ctctctccac cttgtactag tgttcgcatg acagcatggc ttttgtagtt
     1981 tgcatttaac ctggggattt ctgcatcctg tcagagggtg ggtccccacc
//
LOCUS       RIRANT17KA    539 bp ds-DNA             BCT       23-FEB-1990
DEFINITION  R.conorii 17K genus-common antigen gene, complete cds.
ACCESSION   M28480
KEYWORDS    antigen.
SOURCE      R.conorii DNA.
  ORGANISM  Rickettsia conorii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Anderson,B.E. and Tzianabos,T.
  TITLE     Comparative sequence analysis of a genus-common rickettsial antigen
            gene
  JOURNAL   J. Bacteriol. 171, 5199-5201 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly submitted
            by B.Anderson, 27-OCT-1989.
FEATURES       from  to/span     description
    pept         60      539     17 kD antigen precursor
    sigp         60      119     17 kD antigen signal peptide
    matp        120      536     17 kD antigen
    mRNA         38  >   539     17 kD mRNA
    signal        1        6     -35 region
    signal       21       27     -10 region
BASE COUNT      179 a     93 c    125 g    142 t
ORIGIN      
        1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata
       61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta
      121 acggtccggg cggcatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg
      181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg
      241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa
      301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag
      361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa
      421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac
      481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga
//
LOCUS       RIRANT17KB    535 bp ds-DNA             BCT       23-FEB-1990
DEFINITION  R.typhi 17K genus-common antigen gene, complete cds.
ACCESSION   M28481
KEYWORDS    antigen.
SOURCE      R.typhi DNA.
  ORGANISM  Rickettsia typhi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE   1  (bases 1 to 535)
  AUTHORS   Anderson,B.E. and Tzianabos,T.
  TITLE     Comparative sequence analysis of a genus-common rickettsial antigen
            gene
  JOURNAL   J. Bacteriol. 171, 5199-5201 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly submitted
            by B.Anderson, 27-OCT-1989.
FEATURES       from  to/span     description
    pept         56      535     17 kD antigen precursor
    sigp         56      115     17 kD antigen signal peptide
    matp        116      535     17 kD antigen
    mRNA         37  >   535     17 kD mRNA
    signal        1        6     -35 region
    signal       21       27     -10 region
BASE COUNT      182 a     93 c    122 g    138 t
ORIGIN      
        1 tttacaaaat tctaaaaacc atatacttat taaataatag attaaagagg attatatgaa
       61 actattatct aaagtaatga ttttagctct tgcagcttct atgttacaag catgtaatgg
      121 tccgggtggt atgaacaaac aagggactgg aacacttctt ggtggcgccg gcggtgcatt
      181 acttggttct caattcggtc acggtaaagg acaacttgtc ggagtaggtg taggcgcatt
      241 acttggggca gttcttggtg gacaaatcgg tgcaagtctg gatgagcagg atagaaaact
      301 tctagaacta acatcacaaa gagctttaga atctgctcct agcggtagta acatagaatg
      361 gcgcaatcca gataatggca atcatggtta cgtaacacct aataaaactt ataggaacag
      421 tacaggtcaa tattgccgtg aatacactca aacagttgta atagggggaa aacaacaaac
      481 aacatatgga aatgcatgcc gccaacctga cggacaatgg caagttgtga attga
//
LOCUS       RIRANT17KC    532 bp ds-DNA             BCT       23-FEB-1990
DEFINITION  R.typhi 17K genus-common antigen gene, complete cds.
ACCESSION   M28482
KEYWORDS    antigen.
SOURCE      R.typhi DNA.
  ORGANISM  Rickettsia typhi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE   1  (bases 1 to 532)
  AUTHORS   Anderson,B.E. and Tzianabos,T.
  TITLE     Comparative sequence analysis of a genus-common rickettsial antigen
            gene
  JOURNAL   J. Bacteriol. 171, 5199-5201 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly submitted
            by B.Anderson, 27-OCT-1989.
FEATURES       from  to/span     description
    pept         56      532     17 kD antigen precursor
    sigp         56      115     17 kD antigen signal peptide
    matp        116      529     17 kD antigen
    mRNA         37  >   532     17 kD mRNA
    signal        1        6     -35 region
    signal       21       27     -10 region
BASE COUNT      183 a     88 c    121 g    140 t
ORIGIN      
        1 tttacaaaat tctaaaaacc atatacttat ttaaataata attagagagg attatatgaa
       61 actattatct aaaataatga ttatagctct tgcagcttct atgttacaag cttgtaatgg
      121 tcagagtggt atgaacaaac aaggtaccgg tacacttctt ggtggcgcag gaggtgcatt
      181 acttggttct caattcggtc aaggtaaagg acaacttgtc ggagtaggtg taggcgcatt
      241 acttggggca gttcttggtg gacaaatcgg cgcaagtatg gatgagcagg atagaagact
      301 tctagaacta acatcacaaa gagctttaga atctgcacct agcggtagta acatagaatg
      361 gcggaatcca gataacggca atcatggtta cgtcacacct aataaaactt ataggaacag
      421 tgcaggtcaa tattgccgtg aatacactca aacagttata ataggcggaa aacaaaaaac
      481 atatggtaat gcatgccgtc aacctgatgg gcaatggcaa gttgtgaatt ga
//
LOCUS       RIRANT17KD    539 bp ds-DNA             BCT       23-FEB-1990
DEFINITION  R.rickettsia 17K genus-common antigen gene, complete cds.
ACCESSION   M28479
KEYWORDS    antigen.
SOURCE      R.rickettsia DNA.
  ORGANISM  Rickettsia rickettsia
            Unclassified.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Anderson,B.E. and Tzianabos,T.
  TITLE     Comparative sequence analysis of a genus-common rickettsial antigen
            gene
  JOURNAL   J. Bacteriol. 171, 5199-5201 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly submitted
            by B.Anderson, 27-OCT-1989.
FEATURES       from  to/span     description
    pept         60      539     17 kD antigen precursor
    sigp         60      119     17 kD antigen signal peptide
    matp        120      536     17 kD antigen
    mRNA         37  >   539     17 kD mRNA
    signal        1        6     -35 region
    signal       21       27     -10 region
BASE COUNT      179 a     92 c    125 g    143 t
ORIGIN      
        1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata
       61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta
      121 acggtccggg cggtatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg
      181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg
      241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa
      301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag
      361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa
      421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac
      481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga
//
LOCUS       ROTNS53      1611 bp ds-RNA             VRL       02-MAR-1990
DEFINITION  Simian rotavirus SA11 segment 5 nonstructural protein (NS53) gene,
            complete cds.
ACCESSION   X14914
KEYWORDS    nonstructural protein.
SOURCE      Simian rotavirus, cDNA to viral RNAi, clone 35.
  ORGANISM  Simian (SA11) rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 1611)
  AUTHORS   Mitchell,D.B. and Both,G.W.
  TITLE     Conservation of a potential metal binding motif despite extensive
            sequence diversity in the rotavirus nonstructural protein NS53
  JOURNAL   Virology 174, 618-621 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.B.Mitchell, 03-APR-1989.
FEATURES       from  to/span     description
    pept         31     1518     nonstructural protein
BASE COUNT      571 a    214 c    288 g    538 t
ORIGIN      
        1 ggcttttttt tgaaaagtct tgtgttagcc atggctactt ttaaagatgc atgctttcat
       61 tatcgtagat taactgcttt aaatcggaga ttatgcaaca ttggtgcaaa ttctatttgc
      121 atgccagttc ctgatgagaa aattaagggg tggtgtttag aatgttgtca aatagctgat
      181 ttaacccatt gttatggttg ctcattgccg catgtttgca aatggtgtgt tcagaacaga
      241 agatgcttcc ttgacaatga acctcatttg cttaagctta gaactgtgaa acatccaatt
      301 accaaagaca aattacagtg tatcatagac ttgtacaata taatatttcc aattaatgat
      361 aaagtaatta gaaaatttga aagaatgata aagcaaagaa aatgtaggaa tcaatataaa
      421 attgaatggt ataatcattt gctgctccca attacattaa atgctgctgc atttaagttt
      481 gatgaaaata atctttatta tgtttttggg ttatatgaga aatcagtcag tgatatatat
      541 gctccatata gaattgttaa ctttataaat gaatttgata aattattgct tgatgatatt
      601 aactttacaa gaatgtccaa tctaccaata gagttgagaa ccattatgca agaatacttc
      661 caattatcaa gactgccatc atcaaaacta aagcaaattt acttttcaga ttttactaaa
      721 gaaactgtga tttttaatac ttatacaaaa acgccaggaa gatcaatata cagaaatgta
      781 actgaattta attggagaga tgaattggag ctttattctg atttaaaaaa tgataagaat
      841 aaattaattg ctgcaatgat gacgagtaag tatactcggt tctatgctca tgataataat
      901 tttggaaggt tgaaaatgac aatatttgag ttgggacatc attgtcagcc taactacgtg
      961 gcatctaatc acccaggcaa tgcttccgat atccagtact gtaaatggtg taatataaaa
     1021 tattttctta gtaaaattga ttggcggatt cgtgatatgt ataatttatt gatggaattt
     1081 attaaggatt gttataaaag taatgttaac gttggacatt gtagttctgt tgaaaacata
     1141 tatcctttaa ttaaaagatt aatttggagt ttgtttacta atcacatgga tcaaacaatt
     1201 gaagaagtgt ttaatcacat gtcgccagtg tcagttgaag gtacgaatgt catcatgttg
     1261 attcttggat tgaatattag tttgtataat gaaattaagc gcactttgaa tgtagatagc
     1321 ataccaatgg tacttaattt aaatgaattc agtagtatag ttaaatcaat tagcagtaaa
     1381 tggtataatg ttgatgaatt ggataaattg ccaatgtcaa taaaatcaac ggaggaactg
     1441 attgaaatga agaattctgg aactttaact gaagaatttg agctactgat ctccaactca
     1501 gaagatgaca atgagtgaaa ttactaatgt cactatctaa ttatacagta tttagccatc
     1561 acaagaccgt ccagactaga gtagcgccta gctggcaaaa tactgtgaac c
//
LOCUS       SPINR        3284 bp ss-mRNA            PLN       06-MAR-1990
DEFINITION  Spinach NADH nitrate reductase mRNA, complete cds.
ACCESSION   M32600
KEYWORDS    NADH nitrate reductase; nitrate reductase.
SOURCE      Spinach, cDNA to mRNA, clone pIMP4.
  ORGANISM  Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE   1  (bases 1 to 3284)
  AUTHORS   Prosser,I.M. and Lazarus,C.M.
  TITLE     Complete nucleotide sequence of spinach nitrate reductase cDNA
  JOURNAL   Unpublished (1990) Long Ashton Res. Stn, Long Ashton, Bristol, UK
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.M.Prosser, 05-MAR-1990.
FEATURES       from  to/span     description
    pept        116     2896     NADH nitrate reductase
BASE COUNT      894 a    642 c    849 g    899 t
ORIGIN      
        1 aaaaaccata acacctctgg aaaaatattg aggaagaacc gtagttcttc tttccttttt
       61 ctttttttct ttttaattac atttcaaaaa accctacatt tgatcgattt gagctatggc
      121 ggcgtcagtc gatcgtcaat atcatccggc gccgatgagc ggtgtcgttc gtaccccatt
      181 tagcaaccac caccgttccg actctccggt tcggaacggt tatactttta gtaacccgcc
      241 ttcttcaaat ggcgttgtta aacccggtga aaaaatcaag ctcgtagata ataatagtaa
      301 tagtaataat ggtagcaata ataataataa tcgctatgat agcgatagtg aagaagatga
      361 tgacgaaaat gagatgaatg tctggaatga gatgatcaaa aaaggtaaca gcgaattgga
      421 gccgtcttcc gtcgatagcc gtgatgaagg aacggctgat caatggattg agaggaatcc
      481 ttcaatgatc cgtctcacag ggaagcaccc gttcaattcg gagcctccat tgacccgatt
      541 gatgcaccac gggtttctca ccccggttcc gcttcattac gttcggaacc acggtccagt
      601 ccccaacgct aagtgggagg attggaccgt tgaggtgacc gggttagtca aaagaccaat
      661 ccggttcact atggaccaat tggtcaatga cttccaaagc cgtgagtttc cggtgacgct
      721 tgtctgcgcc ggtaaccgga ggaaggaaca gaatatgacg aaacaatcga tagggtttaa
      781 ctggggttcc gccgccgtgt caacttccgt gtggcgcggt gttccacttc gcgacgtgct
      841 taagcggtgt ggggtcatga gttcgttaaa aggggctctc aacgtttgtt ttgagggggc
      901 tgaggattta ccaggaggtg gcgggtcgaa atatggtaca agtgttaaaa gggagtttgc
      961 aatggatccg gcccgtgaca ttattctagc gtacatgcaa aatggggaga agttgagccc
     1021 ggatcatggg tatccggttc ggatgatcat acccgggttt attggtgggc ggatggttaa
     1081 gtggttgaag aggattattg tgactacaac agagtctgat aattattacc attacaagga
     1141 taatagggtc cttccctctc atgttgatgc tgaacttgcc aattctgaag cttggtggta
     1201 caaacaagag tacatcataa atgagctgaa cgttaactcg gtgataacat cgccgtgcca
     1261 cgaagaaatc ttgcctatca acgcctggac tacccaacgt ccttacacaa tgaggggtta
     1321 cgcttattct gggggaggaa gaaaagtgac ccgagtggaa gtgacaatgg atggtggaga
     1381 cacatgggac atatgtgaat tggaccacca agagagggga agcaaatatg gtaaattctg
     1441 gtgttggtgt ttctggtctt tagaggtcga agtactagac ttactcggag ccaaagagat
     1501 tggggtccgt gcttgggatg aaagcctcaa cacccaacct gaaaagctca tttggaatgt
     1561 catgggtatg atgaacaact gctggtttcg tgttaagacc aacgtgtgca agcctcacaa
     1621 gggagagatc ggtatcgtgt tcgaacaccc gacccaaccc ggtaacaagt cgggcgggtg
     1681 gatggcccgt gaacgccacc tcgagatctc cgattctggc ccaacattga agaggacagc
     1741 atccacccca ttcatgaaca ccacctcaaa gatgtactca atgtcagagg tgaagaaaca
     1801 caacactgct gactcagctt ggattgtagt ccatggaaat gtctacaacg cgacccggtt
     1861 cctaaaggat cacccgggtg ggtcagacag tatcctcatc aatgcaggaa ccgactgcac
     1921 cgaagagttc gatgcaattc actcagacaa ggccaagagg cttctagaag atttcaggat
     1981 aggtgagttg atctcaacgg gctacacctc tgactcgtcc tcgcccggta actcagtcca
     2041 tggcgggtcg gtttactccg ggttagccgg gttagcccca attacagaag cggttccatt
     2101 acgtaatgtt gctttaaacc cacgggttaa gatcccatgc aaacttattg agaaagtttc
     2161 actttcccat gacgtaagaa ggttccgatt cgggttaccc tctgaagacc aggtgttggg
     2221 tttacctgta ggaaaacaca tcttcctttg tgctaatgtt gatgataagc tttgtatgag
     2281 agcttatact ccatcaagta ccattgatgt ggttgggtac tttgatcttg tggttaaggt
     2341 ttacttcaag gatgttcacc ctaggttccc taacggtggg gtgatgtcac aacatctaga
     2401 ctcactttct ttgggctcga ttgttgacgt gaaaggccca ttgggccata ttgagtattt
     2461 aggaaagggg aatttcacag tccatggaaa gcccaagttt gctaaaaagc ttgctatgat
     2521 atcaggtggg accgggatta ccccgattta tcaggtgatg caggcaatac tgaaagatcc
     2581 tgaggataag actgagatgc atgttgtgta tgcaaatcgg acggaggagg atatcttgct
     2641 tagagaagag cttgataaat gggctgatga gtttagagat cgggttaaag tatggtacgt
     2701 tgttgaaaag gcggaggaag ggtggaaata tgataccggg tttatatctg agaagatact
     2761 tagagatcac gtgccggcgg ttggggatga tgtgttggca ctcacgtgcg gtcctcctcc
     2821 tatgattcaa tttgcggttc aacctaactt ggataagatg ggttttgata ttaaggaaca
     2881 gttgttgatt ttttgaaagt actgttacgt tgggaagtga atttactaca ctacccctga
     2941 agatatttgt gttacgtgca cattgcaata tcgacgtgtg taacacatgc gttgtgcttt
     3001 cgaatgtata gtagaagttg ggtaaaatgg agtatatgaa cactactata tgtattctat
     3061 gtatgtaatg taatttgtgg agctttttct tctttcttta cccggtgaaa ttatgtgaat
     3121 ataagtgctc cactagattg tttgttgata agctttttgt tgtccttatg aaaattggtt
     3181 gatggaggaa gaagtatgta gaaggaagaa attcaagtag ctaaggttta tgtattgtaa
     3241 gagggagatt actatttggt agtctccatt taccatatta ttat
//
LOCUS       SRGMDH       4638 bp ds-DNA             PLN       06-MAR-1990
DEFINITION  S.vulgare NADP malate dehydrogenase gene, compete cds.
ACCESSION   M31965
KEYWORDS    NADP malate dehydrogenase.
SOURCE      S.vulgare (haplotype 2n) 5 day old leaf mesophyll cell DNA.
  ORGANISM  Sorghum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 4638)
  AUTHORS   Luchetta,P., Cretin,C. and Gadal,P.
  TITLE     Structure and characterization of the sorghum vulgare gene encoding
            NADP-malate dehydrogenase
  JOURNAL   Gene (1900) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.Luchetta, 02-MAR-1990.
FEATURES       from  to/span     description
    mRNA        763      783     NADP-malate dehydrogenase, exon 1
                873      963     NADP-malate dehydrogenase, exon 2
               1461     1547     NADP-malate dehydrogenase, exon 3
               1643     1727     NADP-malate dehydrogenase, exon 4
               1984     2165     NADP-malate dehydrogenase, exon 5
               2420     2490     NADP-malate dehydrogenase, exon 6
               2570     2660     NADP-malate dehydrogenase, exon 7
               2749     2823     NADP-malate dehydrogenase, exon 8
               2912     3013     NADP-malate dehydrogenase, exon 9
               3100     3222     NADP-malate dehydrogenase, exon 10
               3303     3371     NADP-malate dehydrogenase, exon 11
               3456     3524     NADP-malate dehydrogenase, exon 12
               3619     3672     NADP-malate dehydrogenase, exon 13
               3765     4638     NADP-malate dehydrogenase, exon 14
    pre-msg     603     4638     NADP-malate dehydrogenase mRNA and introns
    IVS         784      872     NADP-malate dehydrogenase intron A
    IVS         964     1460     NADP-malate dehydrogenase intron A
    IVS        1548     1642     NADP-malate dehydrogenase intron A
    IVS        1728     1983     NADP-malate dehydrogenase intron A
    IVS        2166     2419     NADP-malate dehydrogenase intron A
    IVS        2491     2569     NADP-malate dehydrogenase intron A
    IVS        2661     2748     NADP-malate dehydrogenase intron A
    IVS        2824     2911     NADP-malate dehydrogenase intron A
    IVS        3014     3099     NADP-malate dehydrogenase intron A
    IVS        3223     3302     NADP-malate dehydrogenase intron A
    IVS        3372     3455     NADP-malate dehydrogenase intron A
    IVS        3525     3618     NADP-malate dehydrogenase intron A
    IVS        3673     3764     NADP-malate dehydrogenase intron A
BASE COUNT     1255 a   1044 c    993 g   1346 t
ORIGIN      
        1 aggcctagtc aaaaagtaaa tgtgattatt tacaaacatt gtctctacat tttgcatggc
       61 ttcaaaacca aagatcgcac acacttttgc caaaattttg ccaccgttgc taaccaatcg
      121 atgtgtaacg atgagaagtg ccatcacact ctctgaaaag actcaggccc tggcactggc
      181 aggaccgcag gaaaggcact gagcaaagat gatggtcgtg ggagaataaa caaccttttt
      241 ttttttttgc ctttgtaata aatgtccccg ccgtctctca cgtagtacgt agtttcggtg
      301 tggctttgtt ttaaaaggga accaaatcta ccaatgagtc tttttaaaaa gaagaaaaat
      361 actggtcagc aaaactgaga ttctccagca acgaaaatga aaaagacttg gttgtcagtg
      421 ttaaccacca ccaggatagc aggcgtgtgg acgtacaaga tgccgaacag tcgctgtcgc
      481 gcgacgtgtc ccgcgccggg cggtccgctc gccttgcctg ccgctaatcc aattcctacg
      541 cgcgctcctg ctcctgcgta caaccagaga ggctataatc tcctccactc cctcccctgt
      601 agctgcagcc cgttccactc ctatccatcc gtgctccagc tccaccatgg gcctctcaac
      661 ggcttactcc ccggtcggat cgcacctcgc gccggcgcct ctcgggcacc gtcgctccgc
      721 ccagctccac cgcccgcgcc gggccctgct cgccaccgtc cgatgctccg tcgacgccgc
      781 caagtgagta gtagtatact acccggacgt ggacatgaat gcgtgccatt tatcaactac
      841 taggctgact agccccctat acgttgatgc aggcaggtgc aggatggcgt ggcgacggcg
      901 gaggcgccgg cgacgcggaa ggactgcttc ggggtcttct gcaccaccta cgacctcaag
      961 gcggtgagtc cctcccttcg cctcaggcgg tggtggtcct ccccggttcc cggtccggcc
     1021 tccccgtgtg tgttcgctga aatgctccat cgacttttgc cttttgcttc aaactcaaag
     1081 tagaatccct tttccaccta aagctagcaa cgttgctttt ctccaaaaga gaaaaaaaaa
     1141 gaaaaagcta gcacgtcgtt ggtgtcatgg tgtggaccgt tggaggtgtg gtgtggatat
     1201 atgctacgat actggatagc ttgcactcat atcaacagct ggcgttatgc ctctataaat
     1261 aaaaataaat atcatacaga ccaccaaaac caaagcgcag ggtggagtgc cacgcatgag
     1321 acgctgcact gcacgccact gctgcttgac ttaatttggt ttctaattaa cttgctgctg
     1381 tatatatcgt catatagtag tacagtacta acaatctgtc atcctatgcc attgaatgga
     1441 ttgggagttg ggacacacag gaagacaaga ccaagtcgtg gaagaagcta gtgaacattg
     1501 ctgtgtcagg cgcggctggg atgatatcga accacctgct gttcaaagta actaaatcgt
     1561 ttactgactc tacttgtttc ctatcttgag ttgcagtatt gtaatgctga ccattcattt
     1621 tattaacaca aaaaaaattc agcttgcctc tggtgaggtt ttcggacaag accaaccaat
     1681 agcactgaag ttacttggtt cagagagatc gtttcaagct ctcgaaggtg aatgactgac
     1741 tgcctcttct tctttctgcc ttgctatctg agaagaagaa tcaattcata ttttccactc
     1801 actctgttcc aaaagctaaa atgccttata tataatttgg aacaggggga gtacatacgt
     1861 atagctttag tttcaagctg cccttacagc ttccagtgtt gtactctctt atatcaatgg
     1921 attactgtga cagtgaatat ttatcacttc ttcctcatgt gctctcaaaa actgcaactg
     1981 taggtgttgc tatggaactg gaggactcgc tatatccatt gctgagggaa gtcagcattg
     2041 gtatagatcc ttatgaggtc tttgaagatg tagattgggc ccttcttatt ggtgctaagc
     2101 cccgaggtcc tggcatggag cgagctgcgt tactggatat caatggtcaa atctttgctg
     2161 atcaggtttc ttttgcactc tgttcctcct ttcttctcaa gtactatgtc accaaattgt
     2221 aagaagtcct ctcaggaaag gtgccaagaa gcattttctg aaacttcaac ctcaaatttt
     2281 aaggattcct aagcattaga acctgcaaat tgccaaaacc agaaaccaag tacaaatttg
     2341 agtcaaagcc cacctaaata aacgaccaga gtaacctgtt attcttgtta ttatctgaat
     2401 tttttttaaa ttgtgtcagg ggaaagcact taatgccgtg gcctcgaaga acgtgaaagt
     2461 cttagttgtt ggaaatccct gtaacactaa gtacggacat tgattgttca ccaagattaa
     2521 cgattttatg tcatcaccat gtcttagttg tggcttgccc ttgatgcagt gcgttaattt
     2581 gcttgaaaaa tgctccagac ataccagcaa aaaattttca tgcactgacg aggttggatg
     2641 aaaatagagc aaagtgccag gtgactgacg ttgaacctta ttatgtctcg tccttccatt
     2701 ccaaacagga aaacagatta agaaatcaca cacaatctcc ttttgtagct agcactcaaa
     2761 gcaggtgtat tttatgacaa agtatcaaac gtgactattt gggggaacca ttcgacaact
     2821 caggtgaagt tttgcattat aaattaaatg agcagactat tttgcagaca catttcttcg
     2881 ctgaatggtt ggcactaact tctctgtgta ggttcctgat ttcttgaatg ccaaaattga
     2941 tgggagacca gtgaaagaag tcattaagga taccaagtgg ttagaagaag agttcaccat
     3001 tacggttcaa aaggtatgac aatccatttg gagtgtctgt tactcggata aataaacaag
     3061 gttgtgacca taattcgttt cattcttata tctaaacagc gtggaggtgc gctcatccaa
     3121 aaatggggca gatcttcagc tgcatcaacc gctgtatcaa tagcggatgc tattaaatcc
     3181 ctggtaactc ctaccccaga aggcgactgg ttctccacag gggtaagttt tgatgccttt
     3241 gaattgatca cagctttatt tcggcattcc tctaatggaa aaaagtaata aaaaaaacac
     3301 aggtttatac gactggaaat ccttatggca tagcagagga tatcgtgttc agcatgccat
     3361 gcagatcgaa ggtatgatct actaatcaaa ttgtccttat tatttttgtc ctaagataag
     3421 gtaccactcc aatgatcgta aaattcttcc aacagggtga tggtgattac gaactagcta
     3481 ctgatgtgtc aatggacgat tttctctggg aacggattaa aaaggtaagg gctataggaa
     3541 ctgccatcac tgatatgctg agaatataag cagcactaga gattttaccc taacagttgg
     3601 ccctgtacat attttcagag tgaagctgaa ttgcttgctg agaagaaatg cgttgcccat
     3661 cttactggag aggtatgtca cagtgacgta ttttatttag cttaaaaaga gagaaaccgt
     3721 gccggaaagg attggcttag ctttttctgt ttgtgcaaat gcaggggaat gcatattgtg
     3781 atgttccgga ggataccatg ctaccgggag aagtgtagtt tagttacaga gaagcaatag
     3841 caggggaatg catttgtcta tactatagct gagactcgag agaaacagct gcttgcaaga
     3901 tgcagcaact acttttactt atatatatat attgtttata ttctgtaccc gctcagagcg
     3961 ttgtgtgcac acgctgtgtg attccgggac cgtctttacg atcgttttgc cttttttggg
     4021 ccataccgct tcgctcccgt gttgcgcttc cattattgag ttattaccac tctgttttat
     4081 aatatttatt actgccgaga ttttgtgtag atattacaat cttgtgtttg gttgtgtctc
     4141 tgctcatacg caagtgctgc tgcaactctt gtttttattc acatttagtt acaactgtgt
     4201 tattttgtag taacgacgtg ttatatgtgc tgttacaatc gtattgtaac tctattcttt
     4261 tcctcatttc tccttttact gtacttattt aacagaatta gtcgattttc agtcacagaa
     4321 taatattttc tctcacaaca aatcaacatc aatcagtgtc gtcgaatctc tacctcgtgg
     4381 agccatggac cgttgtctcg tgggcaatcc caagtgagcc ctatatgagg cagttctttc
     4441 gctaatcatt tttaaaacat ataccacaag acccatacac cgattatgtt tgaaactttc
     4501 aatgctagca acttcatcga catacagggc tcacacgtta gtaaaaaaat gtgaattaag
     4561 cactgtctct gcattttgca tggcttcaaa caaaagatct cgcactttga ctacatttcg
     4621 ccactgctgc taaggctt
//
LOCUS       TRBHSRP      2378 bp ds-DNA             INV       15-FEB-1990
DEFINITION  T.brucei heat shock-related protein gene, complete cds.
ACCESSION   M32139
KEYWORDS    heat shock-related protein.
SOURCE      T.brucei (strain 427-60) DNA, clone H8.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 2378)
  AUTHORS   Lee,M.G.-S., Polvere,R.I. and Van der Ploeg,L.H.T.
  TITLE     Evidence for segmental gene conversion between a cognate hsp 70
            gene and the temperature sensitively transcribed hsp 70 genes of
            Trypanosoma brucei
  JOURNAL   Mol. Biochem. Parasitol. (1900) In press
  STANDARD  full staff_entry
COMMENT     The conserved domains are 100 % homologous to the corresponding
            regions of Hsp70 genes 2-6 of T.brucei.
            
            Draft entry and computer-readable sequence [1] kindly submitted by
            L.T.H.Van der Ploeg, 16-FEB-1990.
FEATURES       from  to/span     description
    pept        331     2361     heat shock-related protein
    site        301      463     conserved domain
    site        532      613     conserved domain
    site        732      920     conserved domain
    site        985     1028     conserved domain
BASE COUNT      575 a    535 c    717 g    551 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttcctt tattttcttt ttccttttga gctaactcta cctccgtagc cgccagcgtt
       61 ctcaatgcag ttttgcgtgt aagtcacttg cctcttggcc aagtagactc ggtatgcaac
      121 accaggagcg ccgcacgcat aacaatttgc gtttacgcgt ctacgttagt gtgcagatgt
      181 tatgatactt ttgttttaac tttgctccgc gtccttggcg ttggtattgt tgcaatcttt
      241 catcggccag tgcctgtata tttcaatatc ctcttgcttt tcatccttgc tactgcctct
      301 gaaaacaagg aatatctgcc tctttgaagg atgacatacg aaggcgccat tggtatcgac
      361 ctcggtacga cgtactcgtg cgttggtgtg tggcagaacg aacgtgtgga aatcattgca
      421 aatgaccagg gtaaccgtac gacgccgtcc tacgttgctt tcgtcaataa tgaggttctt
      481 gttggcgatg cggcgaagag ccacgccgct cgtggctcga atggtgttat attcgacgcg
      541 aaacggctga tcggacggaa gttctccgac tctgtcgtgc agtccgacat gaagcactgg
      601 cccttcaagg tcgaggaggg cgagaaaggt ggtgctgtga tgagggtaga gcatcttgga
      661 gaggggatgt tactgcagcc cgagcaaata tctgcccgtg tccttgcgta cctgaagtct
      721 tgtgcggagt cgtatcttgg gaagcaggtg gctaaggctg tcgtaacagt gcctgcatac
      781 ttcaacgact cgcagcgcca ggcaacgaag gacgcgggga cgattgctgg tctggaggtg
      841 ctgcgcatca tcaacgagcc aacggctgct gcaattgcct atggcctgga caaggctgat
      901 gaggggaagg aacgcaacgt tttggtgttt gactttggag gtggaacgtt tgacgtctca
      961 attatttcag taagcggcgg tgttttcgag gtgaaagcca caaatggtga cacccacctt
     1021 ggtggagagg atgtggacgc cgcactgctg gagcacgcac ttgctgatat aaggaaccgt
     1081 tatggaatag aacagggctc tctgtcccaa aagatgctat ccaagctgcg ttcacggtgt
     1141 gaggaagtga agcgagtact gtcacactcg acagtggggg aaattgcact cgacgggctg
     1201 cttcccgacg gcgaagaata cgtgctcaag ctaacgcgag cgaggttgga agagctatgc
     1261 acgaaaattt ttgctcgctg tttgagtgtg gtacagaggg cgttgaagga cgcttcaatg
     1321 aaggttgagg atattgagga tgttgtgctt gtcgggggaa gttctcggat tcctgctgtg
     1381 caggctcagt taagggagct attcagaggt aagcaactct gcagcagtgt gcaccctgac
     1441 gaggcagtgg cgtatggggc tgcatggcag gcgcacgtac tttccggtgg atatggggag
     1501 agctctagga cagcaggtat agtgcttctg gatgttgtgc ccctttcaat cggcgttgaa
     1561 gtggacgacg gtaagtttga tgtgataatt cgtcggaaca caaccattcc ataccttgcg
     1621 accaaggagt acagcactgt ggatgacaat cagagtgaag tggagataca ggttttcgaa
     1681 ggtgagcgcc ccctcacgcg tcacaaccac aggctaggct cctttgttct tgacggtatt
     1741 accccggcaa agcacggtga accgactatc accgttactt ttagtgttga cgccgacggg
     1801 atcctgaccg ttacagccgc agaagaactc gggagcgtaa caaaaacact tgttgtggag
     1861 aactcagagc gacttaccag tgaggaagta cagaagatga ttgaagttgc acaaaagttt
     1921 gccttaacgg acgccaccgc tctggcgagg atggaggcca ccgagaggct cacgcagtgg
     1981 tttgaccgac tcgaagctgt aatggagacc gtcccgcaac catactcaga gaagcttcaa
     2041 aagcgtattg cctttcttcc gcacgggaaa gagtgggtcg gcacacagct tcatacgtac
     2101 actgacgcgg cgtcgataga ggcgaaagta gcgaagattg aacggcttgc aaagagggcc
     2161 ctgaaatcgg cgcgtcgtga aggaaaggat ggatgggcac cgggcaatga ggacaatgga
     2221 agcggtgatg acaacgacgg tgacgataac agtgatgagg aggacgagct ccaaaggggg
     2281 aggggcgtga cagagggatc ggggaggtcc ccgataagga aacgcgaccg catagaggct
     2341 atcaacgcaa acacagagta agtcgtagag gtgtctaa
//
LOCUS       YSCALG1      2011 bp ds-DNA             PLN       06-MAR-1990
DEFINITION  S.cerevisiae beta-mannosyltransferase (alg1) gene, complete cds.
ACCESSION   J05416
KEYWORDS    beta-mannosyltransferase.
SOURCE      S.cerevisiae DNA.
REFERENCE   1  (bases 1 to 2011)
  AUTHORS   Albright,C.F. and Robbins,P.W.
  TITLE     The sequence and transcript heterogeneity of the yeast gene ALG1,
            an essential mannosyltransferase involved in N-glycosylation
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.F.Albright 06-MAR-1990.
FEATURES       from  to/span     description
    pept        115      339     upstream ORF
    pept        379     1728     beta-mannosyltransferase (alg1)
    mRNA        102     1826     alg1 mRNA (alt., both ends approx.)
    mRNA        203     1826     alg1 mRNA (alt., both ends approx.)
BASE COUNT      639 a    341 c    390 g    641 t
ORIGIN      Chromosome II; 2.1 map units from Lys2.
        1 gggcaataaa aaattctaag aatagaatca tatcacagta cacgaaaata taaatacatc
       61 ttgaaaatgt ctcaaattgg gtggaaaaga gggtgcttga attcattagg acatatggtt
      121 gatgttctca ggttctactt atgtttgctt tgtcgatttt tgcatgcttt gactgtgact
      181 ttcctgtctg atatctttgt atggcttgtt gctaaaacaa ggtcaatcca agcagtgatc
      241 atcttgcacg ttgcttcaat agaacgcgca tatagcaatc atcaagttaa ttggtcatat
      301 atctttcaat cagcaattag caaagccatt agagggtaat tgtaacggca ttttaggggc
      361 tatagtcaat tggtgaaaat gtttttggaa attcctcggt ggttacttgc cttaataata
      421 ttataccttt ccataccgtt agtggtttat tatgttatac cctacttgtt ttatggcaac
      481 aagtcgacca aaaaaaggat catcatattt gtgctgggtg atgtaggaca ctctccaagg
      541 atatgctatc acgctataag tttcagtaag ttaggttggc aagtcgagct atgcggttat
      601 gtggaggaca ctctacccaa aattatttcc agtgatccaa atatcaccgt ccatcatatg
      661 tcaaacttga aaagaaaggg aggcggaaca tcagttatat ttatggtaaa gaaggtgctt
      721 tttcaagttt taagtatttt caaattactt tgggaattga gaggaagcga ttacatacta
      781 gttcaaaatc caccgagcat acccattctt ccgattgctg tgctatacaa gttgaccggt
      841 tgtaaactaa ttattgattg gcacaatcta gcatattcga tattgcaact aaaatttaaa
      901 ggaaactttt accatccttt agtgttgata tcttacatgg tagagatgat attcagcaaa
      961 tttgctgatt ataacttgac tgttactgaa gcaatgagga aatatttaat tcaaagcttt
     1021 cacttgaatc caaagagatg tgctgttctc tacgaccgcc cggcttccca atttcaacct
     1081 ttggcaggtg acatttctcg tcaaaaagcc ctaactacca aagcctttat aaagaattat
     1141 attcgcgatg attttgatac agaaaaaggc gataaaatta ttgtgacttc aacatcattc
     1201 acccctgatg aagatattgg tattttatta ggtgccctaa agatttacga aaactcttat
     1261 gtcaaatttg attcaagttt gcctaagatc ttgtgtttta taacgggtaa aggaccacta
     1321 aaggagaaat atatgaagca agtagaagaa tatgactgga agcgctgtca aatcgaattt
     1381 gtgtggttgt cagcagagga ttacccaaag ttattacaat tatgcgatta cggagtttcc
     1441 ctgcatactt caagttcagg gttggacctg ccaatgaaaa ttttagatat gtttggctca
     1501 ggtcttcctg ttattgcaat gaactatcca gtgcttgacg aattagtaca acacaatgta
     1561 aatgggttaa aatttgttga tagaagggag cttcatgaat ctctgatttt tgctatgaaa
     1621 gatgctgatt tataccaaaa attgaagaaa aatgtaacgc aggaagctga gaacagatgg
     1681 caatcaaatt gggaacgaac aatgagagat ttgaagctaa ttcattgagt caatggtaac
     1741 tcagcctttc ttttttgaaa attactattt tcgactcttt tttatacagt tacatagtac
     1801 tacctctaat acacattcat gattaacaat gtttcaaaca atataaagtc ccgataacga
     1861 ccttttgaag tggtgacgtt accgctcttc gttgacaaga ttcaagaggg ctgtcagaat
     1921 aacagctatc atggtggaaa gtaggtgacc ctttgaacaa ggcagcatat atccatcgac
     1981 tatcatgttc ctaaaatgtt atcatagccg g
//
LOCUS       YSCHEM15     2916 bp ds-DNA             PLN       07-MAR-1990
DEFINITION  S.cerevisiae ferrochelatase (HEM15) gene, complete cds,
            and Val-tRNA gene.
ACCESSION   J05395
KEYWORDS    delta repetitive element; ferrochelatase; protoheme ferro-lyase;
            transfer RNA-Val.
SOURCE      S.cerevisiae (strain FL200) DNA, clones pHEM15-[1,2].
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2916)
  AUTHORS   Labbe-Bois,R.
  TITLE     The ferrochelatase from Saccharomyces cerevisiae: Sequence,
            disruption and expression of its structural gene HEM15
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Lobbe-Bois, 06-MAR-1990.
FEATURES       from  to/span     description
    pept        841     2022     ferrochelatase precursor (EC 4.99.1.1)
    sigp        841      933     ferrochelatase signal peptide
    matp        934     2019     ferrochelatase
    tRNA       2253     2325     Val-tRNA
    rpt        2435     2765     variant solo delta element
    rpt        2776     2888     truncated delta-element
BASE COUNT      907 a    563 c    527 g    918 t      1 others
ORIGIN      
        1 aaaatagcat tcagtgggaa cgatcccaat aagcagatag catatcgcag tgtaaaacta
       61 tcaatcccat agttggtaat tatctttgtt aaaacagcgt ccacaggatt gtacatgttg
      121 gaagcgatca gcagcagaat taaatcacca cggtttgtct tggcgtatng gtttgtcttc
      181 ttgcttgacc tgtgcctgta ttttgtcctt ggctttaaat ctgtttaata agaacccttt
      241 atttagatga ccattgtttt cgacggtttg tggtttcgac gcgcaatgtg aaagatcttc
      301 aatgttttgt tgaacagtaa agcaaaacag ctcaacagca ggacaaaaaa agctaaaaaa
      361 gaactcttcg cacttttaac aagcagtttt actgaacaat tttagaagta aatttcagga
      421 aggttcaaca gctttacatg aaagaggatg ctggccattt ttttttttta aaaaagtttt
      481 ttatatacac aaactcagta aaataattac aaatatgtag catgtgtagg atgccttgaa
      541 acatctccgc tggcaatagc gttagacatg cccaatcttc aatggtcacc gtaagcctta
      601 tatgacgaag gagttgctgc ccttcgtttt ctcatgctgc tttttttcgt aggggatatg
      661 ggattgaaga aggaggacgc gaaggaatac ttttggtcgg tcaaggtcac tcttggcttg
      721 acaccaaaca agcgtaaatt ctcgtgttcc acttgagtcg gtttacaaga agtacagaaa
      781 tatttaacac atacctgcta tttggacaaa tttagcagta aaaaaaagaa cgtttaaaga
      841 atgctttcca gaacaatccg tacacaaggt tccttcctaa gaagatcaca actgaccatt
      901 acaagatcat tttcggttac attcaacatg cagaatgcac aaaagagatc acccacagga
      961 attgttttga tgaacatggg tggcccctct aaagttgagg aaacatatga ttttttgtat
     1021 caattatttg ccgataatga cctaattccc attagtgcta agtatcagaa gacaattgct
     1081 aaatatattg ctaagtttcg tacccccaag atagagaagc aatataggga aattggtggg
     1141 ggctccccaa tccggaaatg gtctgagtat caagccactg aggtctgtaa aatcttagat
     1201 aaaacctgtc cagaaacggc gcctcataag ccttacgtgg cgtttcgtta cgcaaagccg
     1261 ctaaccgcag aaacttataa acaaatgcta aaagatggcg tgaagaaggc agtggccttt
     1321 tctcaatatc ctcatttctc ttattccact accgggtcat ccatcaacga attgtggaga
     1381 cagattaagg cattggactc cgagagatct atatcttggt cggttattga tcgttggcct
     1441 acaaatgaag gtctaatcaa ggccttctcc gaaaatatca ccaaaaaact acaagagttt
     1501 ccgcaacctg tcagagacaa ggttgtttta ttgttttccg cacattctct acccatggat
     1561 gttgttaaca ccggtgatgc ctacccagct gaggtagctg cgacggttta caacatcatg
     1621 caaaaattaa agtttaaaaa cccttatagg ttggtttggc aatcccaagt tggaccaaaa
     1681 ccatggttgg gagcgcagac agctgaaatt gcggaatttt taggccccaa agttgatggc
     1741 ctaatgttta ttcctatcgc ctttacctct gatcatattg aaacattgca tgaaattgac
     1801 ttaggcgtca ttggggaatc ggaatataag gataaattta agagatgcga atctttaaat
     1861 ggcaaccaga cctttattga aggcatggca gatctcgtca aaagccactt acagagtaac
     1921 caactctatt ctaatcaact acctcttgat tttgcacttg gcaagtccaa tgatcctgta
     1981 aaggaccttt cattggtatt tggcaatcac gaatctactt gataagggcc attcatccca
     2041 caatctcaat atcagtatat tctcttattt attaattgag gttgcgtatt tatgttttct
     2101 ttgtggtata taatatatca ttatttattg cattttaaaa cttttttttt cattattttt
     2161 ctctgttgaa tgcagtcaac atctccctta atattcgata ctcatcttta atactatcag
     2221 caaaaagttt aagataaaat attcttaaaa acgatttcgc ccaggatcga actggggacg
     2281 ttctgcgtgt taagcagatg ccataaccga ctagaccacg aaaccactat ttgttgaaat
     2341 ctttaacaag cgaggaaact ttagtgaaat attaaaagga ctaaaccgga ccggttcttc
     2401 tatttctaca tatctaatca cccaatgtag tttttgaata tttgtgaata gttggataat
     2461 tgttgggatc ccattgtgca caaaggccat aatattatgt ctacagaata tactagatgt
     2521 cctccctata ggatatagta atcctctaaa tggaaccgat atttctacat aataatatta
     2581 cgattattcc tccttccgtt ttatatgttt cattatccta gcacactatc aatctttgca
     2641 tttcagcttc cattagattt gatgactatt tctcaatctt tatgttatct ccttacgccg
     2701 catgtgataa tatactgcta gtatgactac tagttgatag aagatagttg atttttactc
     2761 caacaaaagt aacaatatta tttagaacta tagattccat tttgtgcatt cccatattct
     2821 cgaggaaaac ttttagtata ttctgtagac ataatattat cgcctttgtg aacaatagaa
     2881 tcccaacaat tgtcgcaaat ttaccaattt tctaga
//
LOCUS       YSCOSCP      1451 bp ds-DNA             PLN       05-MAR-1990
DEFINITION  S.cerevisiae oligomycin sensitivity conferring protein (OSCP) gene,
            complete cds.
ACCESSION   M32487
KEYWORDS    oligomycin sensitivity conferring protein.
SOURCE      S.cerevisiae (strain D-273-10B, ATCC 25657) DNA, clone 62-17.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1451)
  AUTHORS   Misook,U., Jones,D. and Mueller,D.M.
  JOURNAL   Unpublished (1990)Univ. of Health Sci., North Chicago, IL 60064
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            U.Misook, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        498     1136     oligomycin sensitivity conferring protein
                                 (OSCP)
    mRNA        429  >  1136     OSCP mRNA (alt.)
    mRNA        430  >  1136     OSCP mRNA (alt.)
BASE COUNT      484 a    245 c    273 g    449 t
ORIGIN      
        1 agatctggct gcaaacctgg tctacataaa tgtagtacat tataagggac taccaagaaa
       61 tacgaagcgc tcagatatct cttaaagttc atttttttgc cggaataatc ttggtttagt
      121 aacttataaa actacttacg tccaataaat cgttatattt tatgccgtcg tcataaagtg
      181 gacactttcg agagacggtg ctggccaata agatgctgat atttgaaata tcatttaccc
      241 gggcccgggt gggaaaaaca aggaattaaa gataatgaag tgatcttttg gcgagatagt
      301 aaaactttca attatctagg aacaatttca aacgccaatt gttaactttg tgtacagtat
      361 ttctagtttt tatctcattg aaaacaaaat tcagtcaact tggaaattta agcaggctac
      421 tgtggcattg agatattttt ccgcagagta agaaccgatt agcaagttac cgcattattt
      481 ttaaaccgtg gacaatcatg tttaatagag tctttaccag gtcatttgca tcaagcttaa
      541 gagctgctgc ttccaaagct gctgctcccc caccggtgag attgttcggt gttgagggca
      601 cttacgcaac agctttatat caggccgctg caaagaactc ctctattgat gctgcctttc
      661 aatctttgca aaaagttgaa agcacagtca aaaagaatcc aaaattaggt cacttattac
      721 tgaatcctgc attgtcattg aaggacagaa attctgtcat tgatgccatc gtggagactc
      781 acaagaattt ggatggttat gtcgttaacc tattgaaggt tctttccgaa aataacagac
      841 tgggatgttt tgaaaaaatt gcgtctgact ttggcgtttt aaacgatgct cacaacgggt
      901 tacttaaagg tacagtgact agtgctgaac cattggatcc taaatctttc aagaggattg
      961 aaaaagcttt gagcgcctcc aaattggtgg ggcaaggtaa atctttgaag ctggaaaacg
     1021 tggttaagcc agagattaag ggtggtttga ttgtagaact tggtgacaag actgttgact
     1081 taagcatttc tacaaagatt caaaaactga ataaggtctt agaggacagc atttaacgtt
     1141 aataatatag ttggatatgc tcaactcgtt tgatgttact ttctttgcct ccctcctaat
     1201 ttttgattct gttcaaatat atattctttt ccgtgaattt ttcctaaaca tcaaaattac
     1261 gtacatatac gaataaataa gaaacccctc taaaaagtga atttttttgt tcgagacaac
     1321 cattttaatc atatatatgt tcgtgtatcc aggcaaactt atattaatcg caaaaaaaaa
     1381 aattataagt gtatatatgg aataaacata aaaaataata aaagaagaaa aaaaaaaata
     1441 atgaagaatt c
//

GenBank-Updates@genbank.bio.net (04/13/90)

LOCUS       CHKMIM1      5818 bp ds-DNA             VRT       25-OCT-1989
DEFINITION  Chicken myeloid protein-1 (mim-1) gene, complete cds.
ACCESSION   M29448
KEYWORDS    myeloid protein-1.
SOURCE      Chicken (strain SPAFAS) hatchling myeloblast DNA, clone 35B-J612.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 445 to 704)
  AUTHORS   Ness,S.A., Marknell,A. and Graf,T.
  TITLE     The v-myb oncogene product binds to and activates the promyelocyte-
            specific mim-1 gene
  JOURNAL   Cell 59, 1115-1125 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 444; 705 to 5818)
  AUTHORS   Ness,S.A.
  JOURNAL   Unpublished (1989) EMBL pf 10.2209, 6900 Heidelberg 1, Germany
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.A.Ness, 25-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept        713      758     myeloid protein-1, exon 1
               1313     1445     myeloid protein-1, exon 2
               2411     2562     myeloid protein-1, exon 3
               3069     3245     myeloid protein-1, exon 4
               3992     4109     myeloid protein-1, exon 5
               4628     4782     myeloid protein-1, exon 6
               5316     5515     myeloid protein-1, exon 7
    pre-msg     687     5546     mim-1 mRNA and introns
    IVS         759     1312     mim-1 intron A
    IVS        1446     2410     mim-1 intron B
    IVS        2563     3068     mim-1 intron C
    IVS        3246     3991     mim-1 intron D
    IVS        4110     4627     mim-1 intron E
    IVS        4783     5315     mim-1 intron F
    signal      445      704     promoter sequence
    signal      625      621 (c) CAAT box
    signal      659      664     TATA box
    signal     5526     5531     poly-A signal
BASE COUNT     1785 a   1235 c   1132 g   1666 t
ORIGIN      
        1 aactgtggct tggcaccatc catttcagaa tctgagaggt tttaacaaga tttctctgtg
       61 aaaaaaacag ggttccagag aaaagactcc atggattgat gtccagacat ggtgaaaata
      121 tcagacgaac agaaacagtc tatgtcaaga gcacctgcac tcttagccta agtaatatgt
      181 gtactaaaca aacattacac tttcagtgtc ttctgtcgtg tcgtattgaa gccataatgg
      241 gcacatacag ctggggagga ggatctggaa ccagcatgag aaagacaact ggtgatgccc
      301 acaacagaca acacatggag ttttgcccat cttttcccac cagctgaatc ctctggatcc
      361 aattcttgtt tcctatcatt tttggcctaa gcgtgtcctt gcttggggaa ggagaagaaa
      421 aagcgttaag aaatggttca accaaagctt cctgaaaagt gggattgttc atttccatct
      481 gttaatgtga gctaagacac ccgttacttt acctgtcttt cccaaccagc tctaaaaaac
      541 cgttataatg tacagatatc tttagtgcct aatatggcaa ggctgtctgc tgcggcgctg
      601 taacacttgg ctcaggactg attggccaac acaacagtcc cacactttgg aaaacacata
      661 aatactacag agagctactg taggacgtct ttgactcgct caaaggcacg agatgccagc
      721 cctcagcctg atcgccctgc tcagcctggt gtccactggt gagttgggac tgctttggtg
      781 tggtttgtct cgagtctgta cagacagaca ctaattaata actgaaatta atttcagcat
      841 aaagctaagc tctttaaatg tgtttttatg atgaacatat gtaatgtgcg ctagtatata
      901 actattaaca aaaatacaaa gggtgtgttt tacagataga gctatcctga agtgctttgg
      961 tttcatggaa aaagtagttg gcattatagc agaaactaat attagaaatc tgtgtctaca
     1021 gctttacttc cttttactgt atgtaccttt atcaaggcac aaggcttcca ggaaagcagt
     1081 caaatatgca taacaggttg ttcagtgttc ttacagctgc agtttggggc attatgaagg
     1141 cccacaaacc atcatttacc tgtaattaag gaggaaacct ctcctgtgag gcagaagtca
     1201 cgacacgtgt cttccatctt aaacattgca ttttgcagct taccactttc agttgcacaa
     1261 cactaacgca ttcctatttc tattttcctg ttttcctctt ctgtatgggc agcttttgcc
     1321 aggcagtggg aggtgcaccc tccgcagcag cagggcaggc actgggcgca gatatgcagc
     1381 gggaatcctt tcaatagaat ccggggctgt gacagatacg gctgcggcaa ttacggagcc
     1441 agcaggtaac agggcaggca gcaggggcct ggagcaggaa tgcatcatgg ctttcctaca
     1501 ccaaatgttc acaaagcaag aatgagatat aaccaacttt taaaaacgaa aggcttgcta
     1561 atattttgcc ttctttctcc aagatttaag agcaatacac aaaatcacca gtcgtgcctt
     1621 tcatataaca ttcattgata tttctgctgt actttaaatg aaagagagct tctctatgtt
     1681 ttctcttact cactttgaga cattactaag caatctcaca gaaaaaaata tcagtgtaat
     1741 ttagactaac agatgtatgg gtaaagagta aagaagagag accactacat aaactagcag
     1801 gctgagtagg aaccgcaatg gaaagcattc caaggactag tgtaattcat ccttggttct
     1861 tatagacaag acacttaact agcaatcatc aatagagctt ataggaagct ctacaaatta
     1921 ctttacacaa gctaaagatc tgcagacacc actgcctctg cataggcacc ttataacaga
     1981 gaaggttatg ctgtcctaac aaatgctccc aggcccttcc agctcaggat attctctgat
     2041 tctatgatct ctactacaga acttaagaat tgctaagtga ggaaaatgcc aaatatacat
     2101 ttctatttga catggcacag atgcattctg gggtgggaac aggggtactg tgatggtctg
     2161 gcgtcctatg aatttgggag aattaaaaac attcattaaa gtttgaaatt ctgcccccat
     2221 ttaaagtggg tcatactaat acaaacacgt tgcaacaaaa atacgactga aactagcttc
     2281 tccccactag cttctcccaa tttcactgca gatgtaaaac cttacgtttt aagagaagct
     2341 tcatatgtag tgctacgtga tctatctatt aggcctgaca gattgcataa ttacttaatt
     2401 atgtcttcag acagggtaaa ggagaaaagc acaagggcgt ggatgtcatc tgcactgatg
     2461 gatcaatagt gtacgctccc ttcagcggcc agctctccgg acccattcga ttctttcata
     2521 atggaaatgc cattgatgat ggagtccaaa tctctgggtc aggtaagtaa taacagcaca
     2581 caggtcagaa ggcaggcgct acagatctta tttacctgag taagatattt ctgtgcctct
     2641 tgaagtcaag cagattgctg atttagtctc tctagttctt atactgaagc acaactacac
     2701 tgctgctcag aaataccaaa agctccagca ggatacagcc tttttcttag agagaacaga
     2761 agaagggagt cgaaaaacac atagcagaaa aacgactgca agtcaaacaa aggaaacagt
     2821 aaaaacaaag catccgaatc ctcagttttg atgagtggat attttcctgc ctctgttttc
     2881 caacttcctt tttttccttt ttcctatttt ctatagggat gtgatactcg caagattgtt
     2941 ggtaccttga ctccatttca gcttcaccac tgtagccccc agtaacagcc cagtgaggaa
     3001 gatttcttgc actctgcctc aacagatgca gctggtgttt cctgaccttc ctctcccatc
     3061 ttttccaggt tactgtgtaa aactggtctg cattcatccc atcagatacc acggccaaat
     3121 ccagaaaggg caacaacttg gaagaatgct gccaatgcaa aaagtgtttc ctggcattgt
     3181 gtctcacatt cacgtcgaga actgcgacca gtctgatcct actcatctcc tcagacctat
     3241 acctggtaaa aaaaacatgc aacaaacatt tgagttttat catgaatgaa aatatgttta
     3301 attgcaaata acaataatat taaacaatag gagtctcctc tgtcataggg tctcctaaag
     3361 agcaacctcc aaaagagaga agacatgcat tatagcataa caggaccaca gaaaggaagg
     3421 aacatgtatt agcttactga gctttatttc tcacccctca ctgccctccc atggcacagc
     3481 agcatcaagt tccagcaaca gagtcagatg tctcttttga tccttctaac taccagcaat
     3541 gtttccaata ttacatctta actagatcaa actttaaatt tgcacttgaa aacaatcatc
     3601 ctaaaggttc tggtgatggt ttaggactta aggactttga ttccattagt gtaggcacag
     3661 aataagaaaa atccaaatcc taccccctca gtttacactc tggtttataa ctgtgataat
     3721 cctgtttttg tgtttttttt cctctttgaa gcgtgttctt tatatcagct caccgtaatc
     3781 acaaagcatt aagatgtagt ttgaggcact tgttccactt atgtttccaa aggggtgaca
     3841 aattttatgg agaatcaaac accaactgaa catgcaagat gtaattccag catgtttctt
     3901 cctaactaaa atgaaaatgt aatagttgca aatatgaaat acattaaaaa aaaccaacac
     3961 taccttcatt tcctgtttct gtatttcata gatatttcac caccattccc acaacaagat
     4021 gctcactggg ctgtagtatg tgctgggaat cctacaaatg agataagagg ctgtgataaa
     4081 tatggctgtg gatactttgg agctccaagg tactatactt gaacacatgg gggttttatt
     4141 attgctctgt tggaatagac ttgaactctg tttaagctct gaaagtttag gttgaacata
     4201 agggaacact tactacagag gtaatgcagc ataaagaaca ggctgccaat gttgttcatg
     4261 aaaccgtcat acttggaggg tgttgggtta gacaaagtca tgcctgacct ggtgttggga
     4321 acaatacaag ctgaaggtag ggctaaacgc cctccacagg taccttccaa ccagtacttc
     4381 tccagctcta gagttctatt gcaaagctag ggagaaatac atatccaccc acacaattcc
     4441 caaaagctgt aaatctagca gactatataa cattacctta cctaccaaaa cagaatttct
     4501 acacattggt caatcatggc agcaccatct tagtactact aacctgtttt tagaacttgc
     4561 taagcgacct ctctctgcat aacgtaaggt ctggaagaga cttctctcat tacatatttg
     4621 tttccaggcg caatggtaaa ggagagaagc acaagggcgt ggatgtcatc tgtgctgatg
     4681 gtgcaacagt gtatgctccc ttttctggtg agctgtctgg accagttaaa ttctttcata
     4741 atggaaacgc cattgatgat ggagtccaaa tcaggggatc aggtataaag ctattttcat
     4801 ttatttcatt taatagttca ttttgactca agttcatgta ggatacgttc cctcacaggt
     4861 agttgagtat atcagcaaag cacacatttc cagctgctta gctccacatt cagaaatttc
     4921 tgctcgtttc ttaggatgaa aatataaaca ctcaatctta aagacaacga gggaagatct
     4981 gccttacatg aactattatt tagttttctc tctacaagga aaaaaaaaat tggttgggct
     5041 gcaaattaca tcagaaaaaa ttaaaaacgg ggagcttgag ttctgtacaa tgtcttcaag
     5101 ctcttctgct aaagcaagaa ttctcacctg tcccttcatt ttcctttccc taacaagcca
     5161 gcaactccaa ccgctctctc tactttttac agtatttttg atagcattgc atttccaaaa
     5221 ctgctatctc agtggaaagg acaagagttg ttcaaataga atgcaaaaga acaaatgtac
     5281 atcatctaat tttttttctt cctcttctta accaggcttc tgtgtcaaac tgctgtgcat
     5341 ccatcccatc agatacaatg gtaggatttc taagggacaa gtccttggca gaatgttgcc
     5401 gatgcaaaga gtatttcccg ggtacatatc tcatattcat gttgagaact gcgatcgctc
     5461 agatcctact agcaatcttg aaagggggaa aggagaaagt gaaatggaag tgtaataagt
     5521 tctgaataaa tacatctcag cattcaaatg ctgtttttgt tctcacatgg atggtctatc
     5581 tcaatgaaag aaggaaaatg tactttacaa atagaataaa gtgtacacag caggcacaac
     5641 aatagattac tgatgcaata tttttgggcc ctacttaata ctgtaaggtc ctaacatcaa
     5701 catctctcca ggtaccagtg aggtactgat ttttaagtat tggtagttgt gaatagctac
     5761 tgttgcagct tcgccagttt acaaactttc tgcattttgg aatacaaagt tatcgaaa
//
LOCUS       CHKMMIM1X    1054 bp ss-mRNA            VRT       25-OCT-1989
DEFINITION  Chicken myeloid protein (mim-1) mRNA, complete cds.
ACCESSION   M29449
KEYWORDS    myeloid protein-1.
SOURCE      Chicken (strain SPAFAS) hatchling myeloblast, cDNA to mRNA, clone
            J612.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Ness,S.A., Marknell,A. and Graf,T.
  TITLE     The v-myb oncogene binds to and activates the promyelocyte-
            specific mim-1 gene
  JOURNAL   Cell 59, 1115-1125 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1054)
  AUTHORS   Ness,S.A.
  JOURNAL   Unpublished (1989) EMBL pf 10.2209, 6900 Heidelberg 1, Germany
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.A.Ness, 25-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept         27     1007     myeloid protein-1
BASE COUNT      297 a    241 c    265 g    251 t
ORIGIN      
        1 gtctttgact cgctcaaagg cacgagatgc cagccctcag cctgatcgcc ctgctcagcc
       61 tggtgtccac tgcttttgcc aggcagtggg aggtgcaccc tccgcagcag cagggcaggc
      121 actgggcgca gatatgcagc gggaatcctt tcaatagaat ccggggctgt gacagatacg
      181 gctgcggcaa ttacggagcc agcagacagg gtaaaggaga aaagcacaag ggcgtggatg
      241 tcatctgcac tgatggatca atagtgtacg ctcccttcag cggccagctc tccggaccca
      301 ttcgattctt tcataatgga aatgccattg atgatggagt ccaaatctct gggtcaggtt
      361 actgtgtaaa actggtctgc attcatccca tcagatacca cggccaaatc cagaaagggc
      421 aacaacttgg aagaatgctg ccaatgcaaa aagtgtttcc tggcattgtg tctcacattc
      481 acgtcgagaa ctgcgaccag tctgatccta ctcatctcct cagacctata cctgatattt
      541 caccaccatt cccacaacaa gatgctcact gggctgtagt atgtgctggg aatcctacaa
      601 atgagataag aggctgtgat aaatatggct gtggatactt tggagctcca aggcgcaatg
      661 gtaaaggaga gaagcacaag ggcgtggatg tcatctgtgc tgatggtgca acagtgtatg
      721 ctcccttttc tggtgagctg tctggaccag ttaaattctt tcataatgga aacgccattg
      781 atgatggagt ccaaatcagg ggatcaggct tctgtgtcaa actgctgtgc atccatccca
      841 tcagatacaa tggtaggatt tctaagggac aagtccttgg cagaatgttg ccgatgcaaa
      901 gagtatttcc cgggtacata tctcatattc atgttgagaa ctgcgatcgc tcagatccta
      961 ctagcaatct tgaaaggggg aaaggagaaa gtgaaatgga agtgtaataa gttctgaata
     1021 aatacatctc agcattcaaa aaaaaaaaaa aaaa
//

GenBank-Updates@genbank.bio.net (04/13/90)

LOCUS       SYCTREA        76 bp ss-tRNA            RNA       15-JUN-1989
DEFINITION  Synechocystis sp. Glu-tRNA.
ACCESSION   M19535
KEYWORDS    transfer RNA; transfer RNA-Glu.
SOURCE      Synechocystis PCC 6803 tRNA.
  ORGANISM  Synechocystis sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Chroococcales.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   O'Neill,G., Peterson,D., Schon,A., Chen,M.-W. and Soll,D.
  TITLE     Formation of the chlorophyll-precursor delta-aminolevulinic acid in
            cyanobacteria requires aminoacylation of a tRNA-Glu
  JOURNAL   J. Bacteriol. (1988) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.Soll, 30-JUN-1988
FEATURES       from  to/span     description
    tRNA          1       76     Glu-tRNA
    modified     20       20     d
    modified     54       54     t
    modified     55       55     p
    site         34       34     modified bsae (presumably uracil)
BASE COUNT       15 a     25 c     21 g     14 t      1 others
ORIGIN      
        1 gcccccatcg tctagaggcc taggacacct ccctntcacg gaggcgacag ggattcgaat
       61 tcccttgggg gtacca
//
LOCUS       SYCTREAB     1000 bp ds-DNA             BCT       13-FEB-1990
DEFINITION  Synechocystis sp. Glu-tRNA gene.
ACCESSION   M32099
KEYWORDS    transfer RNA; transfer RNA-Glu.
SOURCE      Synechocystis PCC 6803 DNA, clone pS6TE.
  ORGANISM  Synechocystis sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Chroococcales.
REFERENCE   1  (bases 1 to 1000)
  AUTHORS   O'Neill,G.P. and Soll,D.
  TITLE     Expression of the Synechocystis strain 6803 tRNA-Glu provides a
            functional excess of tRNA for protein and chlorophyll biosynthesis
  JOURNAL   Unpublished (1990) Yale Univ. New Haven, CT 06511
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.Soll, 14-FEB-1990.
FEATURES       from  to/span     description
    tRNA        201      273     Glu-tRNA
    signal       97      136     inverted repeat/stem loop
    signal      130      135     -35 transcriptional promoter element
    signal      154      159     -10 transcriptional promoter element
    signal      293      320     inverted repeat/rho-dependent terminator
BASE COUNT      261 a    223 c    254 g    262 t
ORIGIN      
        1 cccgatggca ctaaactggt gacggtccac gaacccattc gctagtaatg agctggccgg
       61 tcagggcgga ctgtttttta ccgttaaacc ccaggcctgt tgaccgtaga aaaaattcat
      121 tgtctagggt tgacagtgat caatctttat cgctatattt aaaaacgtct gaaataacga
      181 actgttatcg agactgccta gcccccatcg tctagaggcc taggacacct ccctttcacg
      241 gaggcgacag ggattcgaat tcccttgggg gtattaaaaa aaaagaaaaa acgaaaagaa
      301 gttcttcggg gcttcttttt tgttgctcac ccaaacaatt ccccaagaag tttaaaaccc
      361 tctttcggtt tatggaacag aagaagaagc ggggagcttg tcgagggaaa cagcgcagaa
      421 atggcaagac ccaaactcca tctatggcag tataggggcg tagttctttg tccttgaggc
      481 tctgtctggc aaatgttaat cctgcgtcgc ctaatactag ttgtttccct tagttttgcc
      541 acgatcgcca ttgacattac tacgagacca gggctggaac cagtgcaagg ccagggggtg
      601 gagacagtag actcgcaaac actaaaagtg ggggtggtgg gcaatcctcc ctttgtcttt
      661 tatggagaag gaaaaaacgc agcatttacc ggtattagcc tagatgtgtg gagagctgtg
      721 gccgagtccc agaagtgaat tcagaatatg taaggcaaaa ttccatctcc gcaggcatca
      781 cagcggtggc ggaaggggaa ttggatattt taattggtcc cattagtgtt accccagagc
      841 gggccgccat agagggcatt accttcaccc agccctattt cagtagtggc attggcctgt
      901 attccaggca aaccggtgag tttatgggaa agattttcgc ctttttttgg catagcgctt
      961 tgtcttcagc gggggtactt actctacttt tatttttagg
//

GenBank-Updates@genbank.bio.net (04/13/90)

LOCUS       RATADHX1      546 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exon 1.
ACCESSION   M29516 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     1 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 124; 125 to 239)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 239 to 546)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept        161  +   178     alcohol dehydrogenase, exon 1 (EC 1.1.1.1)
    pre-msg <     1  >   422     ADH mRNA and introns
    IVS         179  >   546     ADH intron A
BASE COUNT      162 a    101 c    127 g    156 t
ORIGIN      
        1 gaataaactg atgggtgtgt cgtcgtgaag ggctagatca catgtgggat cagctgacac
       61 cccagcaaat aaaatgctgt ggcaactttg gcccagttca ctgcaggagg gaagaagtct
      121 acgaggatcc atccatttcc tgcccacagc gaaggacagc atgagcacag ctggaaaagt
      181 aaggaaaaat acctcaccct tggaaatata ttctcattga tatgtgaaga gatgcagaat
      241 cctaacttag aaatgacagg catcgctggg gtctggtgca gcttgactgt actaagtatc
      301 tttaatgaga gggtatagcg tctaggtttg tgaatgcctt catttcctca ttaactctgt
      361 aggacttttt ccaagccaat ggtgactgct ttgcaaatag aaagttttta ggtcttttat
      421 cgaaagtctg tatttatctt aagataccaa caaatgtgct gtccttcgta atcatatttt
      481 attatgaaca caggaacgtt aaatcggaca gagtttcata tgaggaattt gcttcaggga
      541 atttag
//
LOCUS       RATADHX2      237 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exon 2.
ACCESSION   M29517 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     2 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 233)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 234 to 237)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +   112  +   213     alcohol dehydrogenase, exon 2 (EC 1.1.1.1)
    pre-msg <     1  >   237     ADH mRNA and introns
    IVS     <     1      111     ADH intron A
    IVS         214  >   237     ADH intron B
BASE COUNT       54 a     49 c     67 g     67 t
ORIGIN      About 2.0 Kb after segment 1.
        1 aattatgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gagtgataga
       61 aggatacaat tgcgtgatct cgttggagct gaagtaaccc tttgcttcca ggtaatcaaa
      121 tgcaaagcgg ccgtgctatg ggagcctcac aagcccttca ccatcgagga catagaagtc
      181 gcacccccca aggcccatga agttcgcatt aaggtgaaca ccccttttca agtctgt
//
LOCUS       RATADHX3      186 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exon 3.
ACCESSION   M29518 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     3 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 11 to 186)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 10)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +    30  +   169     alcohol dehydrogenase, exon 3 (EC 1.1.1.1)
    pre-msg <     1  >   186     ADH mRNA and introns
    IVS     <     1       29     ADH intron B
    IVS         170  >   185     ADH intron C
BASE COUNT       44 a     44 c     56 g     42 t
ORIGIN      
        1 gacaactgaa cttgatcttt aacccacaga tggtggccac cggagtctgc cgctcagacg
       61 atcacgcggt tagtggatcc ctgttcacgc ctcttcctgc agttctaggc cacgagggag
      121 ctggcattgt tgagagcatt ggagaagggg tgacttgtgt gaaaccaggg tacagaatag
      181 agctga
//
LOCUS       RATADHX4      586 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exons 4 and 5.
ACCESSION   M29519 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     4 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 119 to 582)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 118; 583 to 586)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Med Res. build. rm 424 Indiana U. Ind., IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +   154      240     alcohol dehydrogenase, exon 4 (EC 1.1.1.1)
                332  +   554     alcohol dehydrogenase, exon 5
    pre-msg <     1  >   586     ADH mRNA and introns
    IVS     <     1      153     ADH intron C
    IVS         241      331     ADH intron D
    IVS         555  >   586     ADH intron E
BASE COUNT      140 a    157 c    136 g    153 t
ORIGIN      About .9 Kb after segment 3.
        1 ttttagagca tattcataat agagacacac atgcacctga aagccaaaac tgactatgag
       61 gaaatgatta tggcttggga acatgcatgc tcttctggag gccctaactg ggtcattcac
      121 aaccacttgc cattctgttt tctcttcatc cagtgataaa gtcatcccgc tgttctctcc
      181 ccagtgtgga aaatgcagga tctgcaagca cccggaaagc aacctctgtt gccaaactaa
      241 gtgatgtgtt tcttattgtc ttcttgcaca attgggcggc actctggttt cattctgtct
      301 cacatcgttt gtattcctgt gtttcaccaa ggaatctgac acagcctaag ggagctttgc
      361 tggacggcac cagcaggttc tcctgcaggg gaaagcccat tcaccacttc atcagcacca
      421 gcaccttctc ccagtacact gtggtagatg acatagcggt ggccaaaatc gatgcggctg
      481 caccgctgga caaagtctgc ctcatcggct gtggcttctc gactggctat ggctctgccg
      541 tccaagtcgc caaggtagga tggacaatgg cccctggaac aaggct
//
LOCUS       RATADHX5      409 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exon 6.
ACCESSION   M29520 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     5 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 110 to 404)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 109; 405 to 409)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +   122  +   382     alcohol dehydrogenase, exon 6 (EC 1.1.1.1)
    pre-msg <     1  >   409     ADH mRNA and introns
    IVS     <     1      121     ADH intron E
    IVS         383  >   409     ADH intron F
BASE COUNT      108 a     94 c    106 g    101 t
ORIGIN      About 2.3 Kb after segment 4.
        1 gggtttccaa ctgatggaat gaagtagcca cgataaagga catagcaagg cagtttgtgc
       61 agtgtggaac acttccgaac agtcaccatt cagtccactc ttgcgttttc tgaaaataca
      121 ggtgacccca ggctccacct gtgccgtgtt tggcctggga ggtgttggtc tgtctgtcgt
      181 cattggctgt aaaacagcag gagcagccaa gatcattgcc gtggacatca acaaagacaa
      241 gtttgcgaag gccaaagagt taggtgccac tgactgtatc aaccctcaag actacaccaa
      301 acccatccag gaagttctcc aggagatgac tgatggaggg gtggactttt catttgaagt
      361 cattggccgt cttgatacca tggtatgtac tgtggcatac cttaagatc
//
LOCUS       RATADHX6      262 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exon 7.
ACCESSION   M29521 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     6 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 49 to 216)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 48; 217 to 262)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +    63  +   198     alcohol dehydrogenase, exon 7 (EC 1.1.1.1)
    pre-msg <     1  >   262     ADH mRNA and introns
    IVS     <     1       62     ADH intron F
    IVS         199  >   262     ADH intron G
BASE COUNT       62 a     68 c     65 g     67 t
ORIGIN      About 1.8 Kb after segment 5.
        1 attgcctaaa gctgcacagg accacaaaaa gaatctactt taactattaa tgtttattcc
       61 agacttctgc cctgttaagc tgccattcag catgcggtgt aagcgtcatt gtcggggtgc
      121 ctcccagtgc ccaaagcctc tccgttaacc ccatgtcgct gctgctggga cgcacctgga
      181 aaggagcaat attcggcggt atgtatttac agcgaggaga tcaacgtccg catctgttca
      241 tgcatgtcaa ggtgggcgtg cg
//
LOCUS       RATADHX7      186 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exon 8.
ACCESSION   M29522 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     7 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 21 to 186)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 20)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +    31  +   169     alcohol dehydrogenase, exon 8 (EC 1.1.1.1)
    pre-msg <     1  >   186     ADH mRNA and introns
    IVS     <     1       30     ADH intron G
    IVS         170  >   186     ADH intron H
BASE COUNT       47 a     37 c     45 g     57 t
ORIGIN      About .9 Kb after segment 6.
        1 gatctcatgg agatgctgct ttcccttcag ggtttaagag taaagatgcc gtccccaaac
       61 ttgtcgctga cttcatggct aagaagtttc cgttggagcc gctgattact catgttttac
      121 cttttgaaaa gataaatgaa gcatttgacc tgctccgtgc tggaaagagg taagctttga
      181 gagtat
//
LOCUS       RATADHX8      545 bp ds-DNA             ROD       27-OCT-1989
DEFINITION  Rat alcohol dehydrogenase (ADH) gene, exon 9.
ACCESSION   M29523 J04747
KEYWORDS    alcohol dehydrogenase.
SEGMENT     8 of 8
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 65 to 360)
  AUTHORS   Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R.,
            Qulali,M., Zhang,K. and Edenberg,H.J.
  TITLE     Structure and expression of the rat class I alcohol dehydrogenase
            gene
  JOURNAL   Genomics 5, 906-914 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 64; 361 to 545)
  AUTHORS   Crabb,D.W.
  JOURNAL   Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Crabb, 13-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +    94      118     alcohol dehydrogenase, exon 9 (EC 1.1.1.1)
    pre-msg <     1  >   360     ADH mRNA and introns
    IVS     <     1       93     ADH intron H
BASE COUNT      130 a    150 c    109 g    156 t
ORIGIN      About .7 Kb after segment 7.
        1 gttgtctcag agatcgtaac acatgtttta ctcgtgcata tggggagtgt aggtgtggag
       61 tgtgaaaacg gtatcttcta ttttgacttt cagtatccgt accgtcctga cgttctgaga
      121 acacatgaat gccttcccgg tttctgactc ctccgcctac cgaatcaagc gccagccaca
      181 tcacaacctt aaccttgctc ttcagagaca cacagccaat aaagcacctg tgcaagcttt
      241 ccgtgagaaa ctgagttttc gatggaacct gttccaaccg catgtttaca atcctatctt
      301 attgtttcgc acctcagctg cctgttaaac accgtgatca aaacacccag cccattgctc
      361 ttcgcccctc cccggaacac tgtactattt ctggaggtgt tgggtcagtc attatttcag
      421 gtttttattg aaggaaacaa ctccatattc attgtccacc aaagggcact tagaacgcac
      481 gagcgctcat gtgtggtgct gccttttagt tcttacaaca gagcttctcc acgcttccag
      541 ccagc
//

GenBank-Updates@genbank.bio.net (04/13/90)

LOCUS       CHTHYPAB     2659 bp ds-DNA             BCT       12-FEB-1990
DEFINITION  C.trachomatis HypA and HypB protein genes, complete cds.
ACCESSION   M31739
KEYWORDS    .
SOURCE      C.trachomatis (serovar A, isolate HAR-13) elementary body DNA,
            clone pTA571.
  ORGANISM  Chlamydia trachomatis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE   1  (bases 1 to 2659)
  AUTHORS   Morrison,R.P., Lyng,K., Su,H., Yuan,Y. and Zhang,Y.-X.
  JOURNAL   Unpublished (1990) Rocky Mountain Laboratory, Hamilton, MT 59840
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.P.Morrison, 31-JAN-1990.
FEATURES       from  to/span     description
    pept        441      779     HypA protein
    pept        817     2451     HypB protein
BASE COUNT      830 a    527 c    613 g    689 t
ORIGIN      
        1 gtcgattttt gtgatgctcg tcagggggcg gagcctatgg aaaaacgcca gcaacgcggc
       61 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc
      121 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag
      181 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa
      241 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga
      301 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc
      361 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca
      421 atttcacaca ggaaacagct atgaccatga ttacgaattc gagctcggta cccggggatc
      481 aagcaacgac cctcaagatt aaacctttgg gagatagaat tttagttaaa agagaagaag
      541 aagcttccac tgcaagaggc ggaatcattc ttcctgacac tgccaagaaa aagcaagata
      601 gagctgaagt tttagctcta ggaacaggca aaaaagatga taaagggcag caacttcctt
      661 ttgaagttca ggttggtgac atcgttttaa ttgataaata ttctggccaa gaacttactg
      721 tcgaaggtga agagtacgtc atcgttcaaa tgagcgaagt tatcgcagtt ctgcaataaa
      781 aactaagaga gtgaagaaga tttaaggagc gcatcaatgg tcgctaaaaa cattaaatac
      841 aacgaagaag ccagaaagaa aattcaaaaa ggagttaaga ctttagctga agctgtaaaa
      901 gtcactctag ggcctaaagg acgacatgtt gtcatagata aaagcttcgg atcccctcaa
      961 gtaactaaag atggtgttac cgttgcgaaa gaagttgagc ttgccgacaa acatgaaaat
     1021 atgggcgctc aaatggtcaa agaagtcgcc agcaaaactg ctgacaaagc tggagacgga
     1081 actacaacag ctactgttct tgctgaagct atctatacag aaggattacg caatgtaaca
     1141 gctggagcaa atccaatgga cctcaaacga ggtattgata aagctgttaa ggttgttgtt
     1201 gatcaaatca aaaaaatcag caagcctgtt cagcatcata aagaaattgc tcaagttgca
     1261 acaatttctg ctaataatga tgcagaaatc gggaatctga ttgctgaagc aatggagaaa
     1321 gttggtaaaa acggctctat cactgttgaa gaagcaaaag gatttgaaac cgttttggat
     1381 gttgttgaag gaatgaattt caatagaggt tacctctcta gctacttcgc aacaaatcca
     1441 gaaactcaag aatgtgtatt agaagacgct ttggttctaa tctacgataa gaaaatttct
     1501 gggatcaaag atttccttcc tattttacaa caagttgctg aatccggccg tcctcttctt
     1561 attatagcag aagacattga aggcgaagct ttagctactt tggtcgtgaa cagaattcgt
     1621 ggaggattcc gggtttgcgc agttaaagct ccaggctttg gagatagaag aaaagctatg
     1681 ttggaagaca tcgctatctt aactggcggt caactcatta gcgaagagtt gggcatgaaa
     1741 ttagaaaacg ctaacttagc tatgttaggt aaagctaaaa aagttatcgt ttctaaagaa
     1801 gacacgacca tcgtcgaagg aatgggtgaa aaagaagctt tagaagctcg ttgcgaaagc
     1861 atcaaaaaac aaattgaaga cagctcttct gattacgata aagaaaaact ccaagagcgt
     1921 cttgctaagc tctctggtgg agtagcagtc attcgcgttg gagctgcaac agagattgag
     1981 atgaaagaga aaaaagatcg tgtagacgat gctcaacatg ctacaatcgc tgctgttgaa
     2041 gaaggaattc ttcctggtgg aggaacagca ttaatccgtt gtatccctac tcttgaagcc
     2101 ttcttgccaa tgttgactaa tgaagatgag caaattggag ctcgcattgt tttgaaagct
     2161 ctttccgctc ctttgaaaca aattgctgca aacgcaggaa aagaaggtgc tatcatcttc
     2221 caacaagtta tgtcccgttc tgcgaacgaa ggatatgatg cattgcgtga tgcatacaca
     2281 gatatgcttg aagctggtat tttagatcct gctaaagtaa cccgttctgc tttagaaagc
     2341 gcggcttccg tagctggatt acttttgaca acagaagctc tcattgcaga gattccagaa
     2401 gaaaaacctg ctgcagctcc agcaatgcct ggcgcaggaa tggactatta attcctctaa
     2461 tgggaacaaa tagattcttc gagcctcgtt tccaaaagga acgaggcttt tttttagatt
     2521 cctaatattt ctctattcct ctatcgtaaa catctagtgc ttacgaccat ccttttctat
     2581 gtttaaacta atcaagagcg catttctcat agcctgttgt attgtagggt acttctggat
     2641 aaaaaaagaa agtattgtt
//
LOCUS       HUMCA1XIA    6158 bp ss-mRNA            PRI       15-MAR-1989
DEFINITION  Human alpha-1 type XI collagen (COL11A1) mRNA, complete cds.
ACCESSION   J04177 J05407
KEYWORDS    alpha-1 (XI) collagen; collagen; type XI collagen.
SOURCE      Human placenta fibroblast, cDNA to mRNA, clone OK4 [1].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1773 to 6158)
  AUTHORS   Bernard,M., Yoshioka,H., Rodriguez,E., van der Rest,M., Kimura,T.,
            Ninomiya,Y., Olsen,B.R. and Ramirez,F.
  TITLE     Cloning and sequencing of pro-alpha-1(XI) collagen cDNA
            demonstrates that type XI belongs to the fibrillar class of
            collagens and reveals that the expression of the gene is not
            restricted to cartilagenous tissue
  JOURNAL   J. Biol. Chem. 263, 17159-17166 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1835)
  AUTHORS   Yoshioka,H. and Ramirez,F.
  TITLE     Pro alpha-1(XI) collagen, structure of the amino-terminal
            propeptide and expression of the gene in tumor cell lines
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Ramirez, 02-FEB-1990; for [2] by M.Bernard, 21-SEP-1988.
FEATURES       from  to/span     description
    pept        162     5582     alpha-1 (type XI) collagen precursor
                                 /nomgen="COL11A1" /map="1p21"
                                 /hgml_locus_uid="LH0170J"
    sigp        162      269     alpha-1 (type XI) collagen signal peptide
    matp        270     1745     alpha-1 (type XI) collagen
    matp       5013     5579     alpha-1 (type XI) collagen
BASE COUNT     1693 a   1355 c   1746 g   1361 t      3 others
ORIGIN      
        1 aaccatcaaa tttagaagaa aaagcccttt gactttttcc ccctctccct ccccaatggc
       61 tgtgtagcaa acatccctgg cgataccttg gaaaggacga agttggtctg cagtcgcaat
      121 ttcgtgggtt gagttcacag ttgtgagtgc ggggctcgga gatggagccg tggtcctcta
      181 ggtggaaaac gaaacggtgg ctctgggatt tcaccgtaac aaccctcgca ttgaccttcc
      241 tcttccaagc tagagaggtc agaggagctg ctccagttga tgtactaaaa gcactagatt
      301 ttcacaattc tccagaggga atatcaaaaa caacgggatt ttgcacaaac agaaagaatt
      361 ctaaaggctc agatactgct tacagagttt caaagcaagc acaactcagt gccccaacaa
      421 aacagttatt tccaggtgga actttcccag aagacttttc aatactattt acagtaaaac
      481 caaaaaaagg aattcagtct ttccttttat ctatatataa tgagcatggt attcagcaaa
      541 ttggtgttga ggttgggaga tcacctgttt ttctgtttga agaccacact ggaaaacctg
      601 ccccagaaga ctatcccctc ttcagaactg ttaacatcgc tgacgggaag tggcatcggg
      661 tagcaatcag cgtggagaag aaaactgtga caatgattgt tgattgtaag aagaaaacca
      721 cgaaaccact tgatagaagt gagagagcaa ttgttgatac caatggaatc acggtttttg
      781 gaacaaggat tttggatgaa gaagtttttg agggggacat tcagcagttt ttgatcacag
      841 gtgatcccaa ggcagcatat gactactgtg agcattatag tccagactgt gactcttcag
      901 cacccaaggc tgctcaagct caggaacctc agatagatga gtatgcacca gaggatataa
      961 tcgaatatga ctatgagtat ggggaagcag agtataaaga ggctgaaagt gtaacagagg
     1021 gacccactgt aactgaggag acaatagcac agacggaggc aaacatcgtt gatgattttc
     1081 aagaatacaa ctatggaaca atggaaagtt accagacaga agctcctagg catgtttctg
     1141 ggacaaatga gccaaatcca gttgaagaaa tatttactga agaatatcta acgggagagg
     1201 attatgattc ccagaggaaa aattctgagg atacactata tgaaaacaaa gaaatagacg
     1261 gcagggattc tgatcttctg gtagatggag atttaggcga atatgatttt tatgaatata
     1321 aagaatatga agataaacca acaagccccc ctaatgaaga atttggtcca ggtgtaccag
     1381 cagaaactga tattacagaa acaagcataa atggccatgg tgcatatgga gagaaaggac
     1441 agaaaggaga accagcagtg gttgagcctg gtatgcttgt cgaaggacca ccaggaccag
     1501 caggacctgc aggtattatg ggtcctccag gtctacaagg ccccactgga ccccctggtg
     1561 accctggcga taggggcccc ccaggacgtc ctggcttacc aggggctgat ggtctacctg
     1621 gtcctcctgg tactatgttg atgttaccgt tccgttatgg tggtgatggt tccaaaggac
     1681 caaccatctc tgctcaggaa gctcaggctc aagctattct tcagcaggct cggattgctc
     1741 tgagaggccc acctggccca atgggtctaa ctggaagacc aggtcctgtg ggggggcctg
     1801 gttcatctgg ggccaaaggt gagagtggtg atccaggtcc tcagggccct cgaggcgtcc
     1861 agggtccccc tggtccaacg ggaaaacctg gaaaaagggg tcgtccaggt gcagatggag
     1921 gaagaggaat gccaggagaa cctggggcaa agggagatcg agggtttgat ggacttccgg
     1981 gtctgccagg tgacaaaggt cacaggggtg aacgaggtcc tcaaggtcct ccaggtcctc
     2041 ctggtgatga tggaatgagg ggagaagatg gagaaattgg accaagaggt cttccaggtg
     2101 aagctggccc acgaggtttg ctgggtccaa ggggaactcc aggagctcca gggcagcctg
     2161 gtatggcagg tgtagatggc cccccaggac caaaagggaa catgggtccc caaggggagc
     2221 ctgggcctcc aggtcaacaa gggaatccag gacctcaggg tcttcctggt ccacaaggtc
     2281 caattggtcc tcctggtgaa aaaggaccac aaggaaaacc aggacttgct ggacttcctg
     2341 gtgctgatgg gcctcctggt catcctggga aagaaggcca gtctggagaa aagggggctc
     2401 tgggtccccc tggtccacaa ggtcctattg gatnnccggg cccccgggga gtaaagggag
     2461 cagatggtgt cagaggtctc aagggatcta aaggtgaaaa gggtgaagat ggttttccag
     2521 gattcaaagg tgacatgggt ctaaaaggtg acagaggaga agttggtcaa attggcccaa
     2581 gagggnaaga tggccctgaa ggacccaaag gtcgagcagg cccaactgga gacccaggtc
     2641 cttcaggtca agcaggagaa aagggaaaac ttggagttcc aggattacca ggatatccag
     2701 gaagacaagg tccaaagggt tccactggat tccctgggtt tccaggtgcc aatggagaga
     2761 aaggtgcacg gggagtagct ggcaaaccag gccctcgggg tcagcgtggt ccaacgggtc
     2821 ctcgaggttc aagaggtgca agaggtccca ctgggaaacc tgggccaaag ggcacttcag
     2881 gtggcgatgg ccctcctggc cctccaggtg aaagaggtcc tcaaggacct cagggtccag
     2941 ttggattccc tggaccaaaa ggccctcctg gaccaccagg aaggatgggc tgcccaggac
     3001 accctgggca acgtggggag actggatttc aaggcaagac cggccctcct gggccagggg
     3061 gagtggttgg accacaggga ccaaccggtg agactggtcc aataggggaa cgtgggtatc
     3121 ctggtcctcc tggccctcct ggtgagcaag gtcttcctgg tgctgcagga aaagaaggtg
     3181 caaagggtga tccaggtcct caaggtatct cagggaaaga tggaccagca ggattacgtg
     3241 gtttcccagg ggaaagaggt cttcctggag ctcagggtgc acctggactg aaaggagggg
     3301 aaggtcccca gggcccacca ggtccagttg gctcaccagg agaacgtggg tcagcaggta
     3361 cagctggccc aattggttta cgagggcgcc cgggacctca gggtcctcct ggtccagctg
     3421 gagagaaagg tgctcctgga gaaaaaggtc cccaagggcc tgcagggaga gatggagttc
     3481 aaggtcctgt tggtctccca gggccagctg gtcctgccgg ctcccctggg gaagacggag
     3541 acaagggtga aattggtgag ccgggacaaa aaggcagcaa gggtggcaag ggagaaaatg
     3601 gccctcccgg tcccccaggt cttcaaggac cagttggtgc ccctggaatt gctggaggtg
     3661 atggtgaacc aggtcctaga ggacagcagg ggatgtttgg gcaaaaaggt gatgagggtg
     3721 ccagaggctt ccctggacct cctggtccaa taggtcttca gggtctgcca ggcccacctg
     3781 gtgaaaaagg tgaaaatggg gatgttggtc catgggggcc acctggtcct ccaggcccaa
     3841 gaggccctca aggtcccaat ggagctgatg gaccacaagg acccccaggt tctgttggtt
     3901 cagttggtgg tgttggagaa aagggtgaac ctggagaagc aggaaaccca gggcctcctg
     3961 gggaagcagg tgtaggcggt cccaaaggag aaagaggaga gaaaggggaa gctggtccac
     4021 ctggagctgc tggacctcca ggtgccaagg ggccgccagg tgatgatggc cctaagggta
     4081 acccgggtcc tgttggtttt cctggagatc ctggtcctcc tggggaactt ggccctgcag
     4141 gtcaagatgg tgttggtggt gacaagggtg aagatggaga tcctggtcaa ccgggtcctc
     4201 ctggcccatc tggtgaggct ggcccaccag gtcctcctgg aaaacgaggt cctcctggag
     4261 ctgcaggtgc agagggaaga caaggtgaaa aaggtgctaa gggggaagca ggtgcagaag
     4321 gtcctcctgg aaaaaccggc ccagtcggtc ctcagggacc tgcaggaaag cctggtccag
     4381 aaggtcttcg gggcatccct ggtcctgtgg gagaacaagg tctccctgga gctgcaggcc
     4441 aagatggacc acctggtcct atgggacctc ctggcttacc tggtctcaaa ggtgaccctg
     4501 gctccaaggg tgaaaaggga catcctggtt taattggcct gattggtcct ccaggagaac
     4561 aaggggaaaa aggtgaccga gggctccctg gaactcaagg atctccagga gcaaaagggg
     4621 atgggggaat tcctggtcct gctggtccct taggtccacc tggtcctcca ggcttaccag
     4681 gtcctcaagg cccaaagggt aacaaaggct ctactggacc cgctggccag aaaggtgaca
     4741 gtggtcttcc agggcctcct gggcctccag gtccacctgg tgaagtcatt cagcctttac
     4801 caatcttgtc ctccaaaaaa acgagaagac atactgaagg catgcaagca gatgcagatg
     4861 ataatattct tgattactcg gatggaatgg aagaaatatt tggttccctc aattccctga
     4921 aacaagacat cgagcatatg aaatttccaa tgggtactca gaccaatcca gcccgaactt
     4981 gtaaagacct gcaactcagc catcctgact tcccagatgg tgaatattgg attgatccta
     5041 accaaggttg ctcaggagat tccttcaaag tttactgtaa tttcacatct ggtggtgaga
     5101 cttgcattta tccagacaaa aaatctgagg gagtaagaat ttcatcatgg ccaaaggaga
     5161 aaccaggaag ttggtttagt gaatttaaga ggggaaaact gctttcatac ttagatgttg
     5221 aaggaaattc catcaatatg gtgcaaatga cattcctgaa acttctgact gcctctgctc
     5281 ggcaaaattt cacctaccac tgtcatcagt cagcagcctg gtatgatgtg tcatcaggaa
     5341 gttatgacaa agcacttcgc ttcctgggat caaatgatga ggagatgtcc tatgacaata
     5401 atccttttat caaaacactg tatgatggtt gtacgtccag aaaaggctat gaaaaaactg
     5461 tcattgaaat caatacacca aaaattgatc aagtacctat tgttgatgtc atgatcagtg
     5521 actttggtga tcagaatcag aagttcggat ttgaagttgg tcctgtttgt tttcttggct
     5581 aagattaaga caaagaacat atcaaatcaa cagaaaatgt accttggtgc caccaaccca
     5641 ttttgtgcca catgcaagtt ttgaataagg atgtatggaa aacaacgctg catatacagg
     5701 taccatttag gaaataccga tgcctttgtg ggggcagaat cacagacaaa agctttgaaa
     5761 atcataaaga tataagttgg tgtggctaag atggaaacag ggctgattct tgattcccaa
     5821 ttctcaactc tccttttcct atttgaattt ctttggtgct gtagaaaaca aaaaaagaaa
     5881 aatatatatt cataaaaaat atggtgctca ttctcatcca tccaggatgt actaaaacag
     5941 tgtgtttaat aaattgtaat tattttgtgt acagttctat actgttatct gtgtccattt
     6001 ccaaaacttg cacgtgtccc tgaattccgc tgactctaat ttatgaggat gccgaactct
     6061 gatggcaata atatatgtat tatgaaaatg aagttatgat ttccgatgac cctaagtccc
     6121 tttctttggt taatgatgaa attcctttgt gtgtgttt
//
LOCUS       STYENVM      1656 bp ds-DNA             BCT       14-FEB-1990
DEFINITION  S.typhimurium envM protein gene, complete cds.
ACCESSION   M31806
KEYWORDS    .
SOURCE      S.typhimurium (strain AG701i50) DNA, clone KF501.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1656)
  AUTHORS   Turnowsky,F., Fuchs,K., Jeschek,C. and Hoegenauer,G.
  TITLE     envM genes of Salmonella typhimurium and Escherichia coli
  JOURNAL   J. Bacteriol. 171, 6555-6565 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        259      558     ORF-1
    pept        676     1464     envM protein
BASE COUNT      413 a    395 c    394 g    454 t
ORIGIN      
        1 cgatgatatc tcttaaatac ctttctgata tctggcaact atgatttccg atctgaaatg
       61 taaacggcat atgtctcatc ctctgaaaat ttattataat acacatgcaa acatataaaa
      121 acctgatttg taactactgt aaccacctga tttccaaaat ttttcatatt cagtattact
      181 ctgcatcaat aaatagagaa aaccttcagc acctaaaaaa gaaattattt ttgttaagca
      241 tttgctggtt aaagttacat gtccataaca tcgctaatag gtgaggtaaa ccatttggat
      301 gaatattcag gcttcctaca gaacctttat ttgtctggtg gaaagcgcat cgctatcgta
      361 cttgctatcc gtcccggtgt ctctggttgg atgaggcgaa ccttacacca gctggtatac
      421 tgcaccgtag ggcctgtccc attagactct ttgatttgcc agtttggccc tggggagtgc
      481 tcaaaatcct cagggacttc gttaccgtgt ggttttagaa cgtcacctgc cggagatagc
      541 ggcggaaaaa tagcgtgaca cacggggcca ggagtgggat ataatttcag actaatttcg
      601 aataatgttg aaaagttgct gttaaccatt gttcgtaaaa gcaaaaaata agcaatgact
      661 ataaggatta aagctatggg ttttctttcc ggtaagcgca ttctggtcac tggcgtggcc
      721 agtaaactgt ccatcgccta cggtatcgca caggcgatgc accgcgaagg agctgaactg
      781 gcgttcacct accagaacga caaactgaaa ggccgcgtag aagaatttgc tgcccagctg
      841 ggttccagca tcgttttacc gtgtgatgtc gctgaagatg ccagcattga cgctatgttt
      901 gcggagctgg gtaacgtttg gccgaaattt gacggttttg tacactcgat cggtttcgcg
      961 cctggcgatc agctggacgg cgattatgtt aatgcggtta cccgcgaagg ctttaaagtc
     1021 gctcacgata ttagctctta cagctttgtt gccatggcga aagcctgccg caccatgctg
     1081 aatccgggtt ctgcgctgtt gaccttgtct tatctgggag cggagcgcgc catcccgaac
     1141 tacaatgtca tgggtctggc gaaagcctct ctggaagcca acgtgcgtta tatggcaaac
     1201 gctatgggac cggaaggggt tcgcgtcaac gccatctctg cgggtccaat ccgcacgctg
     1261 gcggcgtccg gtattaaaga cttccgtaaa atgctggcgc attgtgaagc cgttacgcca
     1321 attcgtcgca ccgtgaccat tgaagatgtg ggtaactcag cggcgtttct gtgttccgat
     1381 ctctcggcag gcatttctgg cgaagtcgtc catgttgacg gcgggttcag cattgcggca
     1441 atgaatgaac tggaactgaa gtaagcgcat taagtttatt agcgggcggc acggctgtgc
     1501 tgccccattc ccctcactca ccactcatct cgttatatag cgttctgata tttgttatca
     1561 cctaacaatc tttcatccct ttcccgcata cgccaggata atccagcata aacgatccga
     1621 cgccgaatca ttatcgatcg ccatttcatc ctcggg
//

GenBank-Updates@genbank.bio.net (04/13/90)

LOCUS       DROEMCA1      934 bp ds-DNA             INV       07-MAR-1990
DEFINITION  D.melanogaster extramacrochaetae protein gene, exon 1.
ACCESSION   M32636
KEYWORDS    extramacrochaetae protein; helix-loop-helix protein;
            regulatory protein.
SEGMENT     1 of 2
SOURCE      D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and
            DNA, clones pP7, lambda-B[11,16].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 934)
  AUTHORS   Ellis,H.M., Spann,D.R. and Posakony,J.W.
  TITLE     Extramacrochaetae, a negative regulator of sensory organ
            development in Drosophila, defines a new class of helix-loop-helix
            proteins
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.W.Posakony, 06-MAR-1990.
FEATURES       from  to/span     description
    pept        347  +   910     extramacrochaetae protein, exon 1
    pre-msg      85  >   934     emc mRNA and introns (minor alt.)
    IVS         911  >   934     emc intron
BASE COUNT      270 a    273 c    244 g    147 t
ORIGIN      Chromosome 3, left arm 61D1,2.
        1 cgaacccgag agcgaaagac tccgctatcg gattcggaaa tgccgttcgg ggggtataaa
       61 agcgggcgct ctgagcgaga caccactcat tcgttgtctg accgctcgca gcaaaagata
      121 tagaaatcga gaacgtaaac gcagaaaaca cgctggatta aagtgtttcg tttcgaacat
      181 cgacagagtg agaaaacaaa gtgaagtgag cgcacaaagc cgaaatcaaa tcaaattgaa
      241 aagccagtca agagcaatct ttaactgaac ctcgcaagta atccggtacg accgtgtaac
      301 aaccgccaag attctactca aggagcgtaa agcaatcatc tccagaatga agtccctgac
      361 ggccgtctgc cagacaggtg cctccggaat gccggccttg aatgccagcg ggcgcatcca
      421 gcgccatccc acgcaccgcg gcgacgggga gaacgccgag atgaagatgt atctgtccaa
      481 actgaaggac ctcgttccgt tcatgcccaa gaacaggaag ctcaccaagc tggagatcat
      541 ccagcacgtc atcgactaca tctgcgacct gcagaccgag ctggagacgc accccgagat
      601 gggcaacttc gatgcggcag ccgctctgac ggcggtgaac ggactccacg aggacgagga
      661 cagcgacatg gaggatgcgg atgccgaggc agaagcggaa gtcgatccag atatcctcgc
      721 ccagcgcctg aatgccgagc agccggcgaa agtctctagt cccgccgccc gtctcccgct
      781 taccgatcgc caaacgccca acactcttgt ggcgcccgcc catccgcagc agcatcagca
      841 gcagcagcaa ctgcaactgc agcagcaaca actgcaatca cagcagcaac tgtccaacag
      901 tttagcaacg gtgagttaac aaacaaattg ctag
//
LOCUS       DROEMCA2     1307 bp ds-DNA             INV       07-MAR-1990
DEFINITION  D.melanogaster extramacrochaetae protein gene, exon 2.
ACCESSION   M32637
KEYWORDS    extramacrochaetae protein; helix-loop-helix protein;
            regulatory protein.
SEGMENT     2 of 2
SOURCE      D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and
            DNA, clones pP7, lambda-B[11,16].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1307)
  AUTHORS   Ellis,H.M., Spann,D.R. and Posakony,J.W.
  TITLE     Extramacrochaetae, a negative regulator of sensory organ
            development in Drosophila, defines a new class of helix-loop-helix
            proteins
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.W.Posakony, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +    31       66     extramacrochaetae protein (emc), exon 1
    pre-msg <     1     1140     emc mRNA and introns
    IVS     <     1       30     emc intron
BASE COUNT      499 a    244 c    193 g    371 t
ORIGIN      Chromosome 3, left arm 61D1,2.
        1 taatttgaat cttccgcttc tctcttttag ccacagaatg cggagaaaga cagcagacag
       61 tcgtaagcgt ggaaacaccc agccgggaaa acccacggaa cctataagcc tattaactaa
      121 atgcataatg aaagtacagt caaaactcca ctcgcaacct tcgtgtgaag atctccgcat
      181 aagataacgt gtgaaactct caagaaaacc acagcccagg cgcatctctt caacgctcct
      241 tttcccggga aagggaaaac ccgggaaaac gagacctcgc ttttggctcc aacagagttt
      301 agtttaaaca tttaataaca agaatacaac aacacaaaac ctagcattag agcagcgtaa
      361 gcaagtacac attatgatta ttaattaaac attacaacta ggatgatggc gcttaagaat
      421 gaacggaaga aagaaaactt ggcaaagctt taaacaattt aattttcctt taacgcttga
      481 aattacactg aagcatatat ttttgttttc ctaattttag ttttgttttg gttaaaacaa
      541 cgcaagaaaa aatgttacta atgcaaagca aacagagatt agcaataaaa aaaaattata
      601 aaaatgcaca ggcaccaaac tggaattacg ttgaatcacc gtttcctgca ttacgattat
      661 atgatttcaa tccaaatgat tttcctatta attttaatca atgaagagtt ttcctattat
      721 tgattagcca cagccaataa acaacaagaa caaccaactc agcagcacac acacattcaa
      781 attcattaat ttatatcgag agagtacgtt tagagtttaa gttcgagaaa gttcatttcc
      841 aggcactaat tttcactcac accactcaca tagaaaaaca caaacacctc attttgtagt
      901 tttccgagtc ccttggatcg ttttcctctt ttttaatcga aataacgcat aatgcataat
      961 aattattacc agatatacac taaaacctat aaatatatat taatatatgt tctgtgatct
     1021 gtagcatact ttgtacaaag tgtttttttt agcataatta tgcgatttta attatgatcc
     1081 gcaatcgaaa ttagagattt ttgttctcat ttcgatggta aaaacactct atcatgatgt
     1141 aaaaaaaaaa aacaaaagaa aacatttgca tgatgttgcg aaaaaatata ttatctaaaa
     1201 cataaagaaa aacaaacact agcctcttaa taaattgcaa aaatttaaaa aacattaaaa
     1261 atcgaatatg tcaatggatt tccgttggga gcaggggatt aagaaac
//
LOCUS       DROFMRFA1    1328 bp ds-DNA             INV       08-MAR-1990
DEFINITION  D.melanogaster FMRFamide neuropeptide gene, 5' flank, exon 1.
ACCESSION   M32640 J05415
KEYWORDS    FMRFamide neuropeptide.
SEGMENT     1 of 2
SOURCE      D.melanogaster (strain Canton S) DNA, (library of T.Maniatis).
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1328)
  AUTHORS   Schneider,L.E. and Taghert,P.H.
  TITLE     Organization and expression of the Drosophila FMRFamide
            neuropeptide gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.E.Schneider, 07-MAR-1990.
FEATURES       from  to/span     description
    pre-msg    1139  >  1328     FMRFamide neuropeptide mRNA and intron
    IVS        1248  >  1328     FMRFamide neuropeptide intron
    signal     1066     1071     CAAT box
    signal     1109     1114     TATA box
    site         40       51     conserved sequence 1
    site        130      140     conserved sequence 2
    site        309      326     conserved sequence 3
    site        374      388     conserved sequence 4
    site        394      414     conserved sequence 5
    site        507      545     conserved sequence 6
    site        574      602     conserved sequence 7
    site        637      649     conserved sequence 8
    site        671      721     conserved sequence 9
    site        746      766     conserved sequence 10
    site        860      872     conserved sequence 11
    site        894      909     conserved sequence 12
    site       1015     1027     conserved sequence 13
    site       1028     1041     conserved sequence 14
    site       1053     1061     conserved sequence 15
    site       1063     1076     conserved sequence 16
    site       1086     1095     conserved sequence 17
    site       1108     1175     conserved sequence 18
BASE COUNT      340 a    331 c    319 g    338 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttctgg ggtgagtcca tagattacag agcagcgcag ttttcttgac cagatgatgg
       61 tctagctgaa ctatttttaa ctgggcgatt ggcacacgac ttggcctggt catcaaactc
      121 aagccaagat ttgaacttga actagtctcg ccttatcaca ttgccttcta attcgttttc
      181 attttcgatt ttcgcatcta cttatgctaa tgaagccatc tgcagacgtg gttttcgaac
      241 gtatttatat tgattatggg tgatcgtcaa caagagcagt ggacacccaa taaacctgtc
      301 caaaacccga cacatttctg cccagtcatg cgtggtggac aatagccaaa tgccattgat
      361 gagactcgtc tccaaaactt tggccttttg ccgggccgta attacagact tccgtctttt
      421 gaacagtttt ttcagcccca cccaagagtc gagtcttgaa aagctggctg ggatggggtg
      481 gtttcgggtg ctggacgaga tgccagaggc gccacaatgt atcctgttac aggttacagg
      541 gccataaagc gccataaacg ccgcgacggc aatggcaaat tataacgcat acggacacgt
      601 agtcgatcca ctggctagaa ggctaattgg acgtgcccgg ccaggatgtc cctgctcatc
      661 ccacagtcga cttaaaagtg ggcgtaacat aacaattgtg ggcgatttgc atacaatttg
      721 gtgctcgagt ggccacctaa ctggcgtcga aatgttgcta cgtcattgtt gctgcccctc
      781 gtgcttcttg gctaataagt cagcatgaat gcggtctgtg ttttggcctt ttgtcccgct
      841 gacaaccgct gacaagcaaa acaaacatat cagcggctga ccagcgacca tttgccgaca
      901 ttgtcaaggt ttttttttca gccccgaata cacctgctaa tctgtggtgg tgactagcgc
      961 gctcggaagg cagaaaaaac tgcagcgaaa atcagttcca ttgaatgtcc tctggcaaca
     1021 tgttgccgca gcacttgttg cgcgcgtcca ctcgcagagg cacctcaaca tcagcctcct
     1081 tgggcgccag cacaaaggac gcaacgcata taaaacagga tgtttcgcca aacatcctca
     1141 cagaactgtt ccaccttcga gcgggcaaca agtgtgtgtg cggcccaaaa ggatccccag
     1201 accttcgaat tcactctagt ttcctagtaa ggggacaggt ttcagaggtg agtgccatgc
     1261 gaaaagtgtg aaaacagtgc tgagccttaa gcagtcgagt tattctttgg aaaagccaat
     1321 agccaatt
//
LOCUS       DROFMRFA2    1603 bp ds-DNA             INV       08-MAR-1990
DEFINITION  D.melanogaster FMRFamide neuropeptide gene, exon 2.
ACCESSION   M32641 J05415
KEYWORDS    FMRFamide neuropeptide.
SEGMENT     2 of 2
SOURCE      D.melanogaster (strain Canton S) DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1603)
  AUTHORS   Schneider,L.E. and Taghert,P.H.
  TITLE     Organization and expression of the Drosophila FMRFamide
            neuropeptide gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.E.Schneider, 07-MAR-1990.
FEATURES       from  to/span     description
    pept        220     1263     FMRFamide neuropeptide precursor
    sigp        220      291     FMRFamide neuropeptide signal peptide
    matp        527      570     FMRFamide neuropeptide A
    matp        571      663     FMRFamide neuropeptide B
    matp        664      696     FMRFamide neuropeptide C
    matp        697      729     FMRFamide neuropeptide D
    matp        730      762     FMRFamide neuropeptide E
    matp        763      795     FMRFamide neuropeptide F
    matp        796      828     FMRFamide neuropeptide G
    matp        829      861     FMRFamide neuropeptide H
    matp        862      894     FMRFamide neuropeptide I
    matp        895      921     FMRFamide neuropeptide J
    matp        922      954     FMRFamide neuropeptide K
    matp        976     1002     FMRFamide neuropeptide L
    matp       1003     1035     FMRFamide neuropeptide M
    matp       1036     1065     FMRFamide neuropeptide N
    pre-msg <     1     1538     FMRFamide neuropeptide mRNA and introns
    IVS     <     1      219     FMRFamide neuropeptide intron
    signal     1404     1408     poly-A signal
    signal     1482     1486     poly-A signal
    signal     1507     1511     poly-A signal
BASE COUNT      409 a    398 c    431 g    365 t
ORIGIN      Undetermined number of bp after segment 1.
        1 gcggctggtt aacttttaac gcgccggccc aacagatctg gccataaata tcatgatggt
       61 taattagcag gacactttgt ctgcgcgtcc tggctgccgg tggctactgt taccctaagt
      121 aaaacgaacg gtcttaatgt gattaaaagt tgcaatagat aacgcgagat gccgccgcct
      181 cataattatc ctgccgtcac tccattctta tccttgcaga tgggcattgc cttgatgttc
      241 ctgctggccc tgtaccagat gcagtcggcc atccacagcg agatcatcga tacgcccaac
      301 tatgcgggca actcgttgca ggacgctgac tccgaggtga gtccatcgca ggacaatgac
      361 ctggtagatg cactgctcgg caacgatcag accgagaggg cggagctgga gttccggcac
      421 cccatctctg tgattggcat cgactactcg aagaacgccg tggtgctgca cttccagaaa
      481 cacggccgga aaccgcgcta caagtacgat cccgagctgg aggccaagcg aaggtccgtg
      541 caggacaact tcatgcactt cggcaagagg caggcggagc agctgccacc ggagggcagc
      601 tatgctgaat ccgatgaact ggagggcatg gccaagcgag cagctatgga tcggtatggc
      661 agagatccca agcaggactt catgcggttt ggtcgggatc cgaaacagga cttcatgagg
      721 tttggcaggg atccaaagca ggacttcatg agattcggtc gggatcccaa gcaggatttc
      781 atgagattcg gtcgagatcc caagcaggat ttcatgaggt ttggacgcac tccggctgag
      841 gatttcatga ggttcggacg cactccggcg gaggacttca tgaggttcgg acgctccgac
      901 aatttcatgc gcttcggacg cagtccccac gaggagcttc gcagtcccaa acaggatttc
      961 atgcgattcg gtcgcccgga caacttcatg cgcttcgggc gttccgctcc gcaggatttt
     1021 gtgcgctccg ggaagatgga ctcaaacttc attcgattcg gtaagagctt gaagccggcg
     1081 gctcccgagt ccaagccagt caagtccaat caaggcaacc caggcgaacg cagtccagtg
     1141 gacaaggcca tgacggagct gttcaagaaa caggagctgc aggatcagca ggtgaagaac
     1201 ggcgcacagg cgaccaccac gcaggatggg agtgtggaac aggaccagtt cttcggccag
     1261 tgaggtagtc ctgcgggacg cctccttgta aatagatatg gacaaatgta cgcaaggatc
     1321 taaattgata tacgtatata acccactcct cacacgaact cctgacttat gcctgaacta
     1381 tgaattttta atgaatgggc tggattaaaa attcaccgtg ctttgaagtt cttatctata
     1441 aatatatcta gtgtaatatt gaagaaattg aaattggcgt gaataaaatc ctgtggcaac
     1501 attttaaata aagattgctt tactgtaaat tatggcgcac tcgaatagcc ttatttcatt
     1561 cggtcgaggt acatattgta tagccattat tagtttctcc act
//
LOCUS       DRVFMRFRN1   3189 bp ds-DNA             INV       08-MAR-1990
DEFINITION  D.virilis FMRFamide-related neuropeptide gene, 5' flank, exon 1.
ACCESSION   M32642
KEYWORDS    FMRFamide-related neuropeptide; neuropeptide;
            secretory granule protein.
SEGMENT     1 of 2
SOURCE      D.virilis DNA.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 3189)
  AUTHORS   Taghert,P.H. and Schneider,L.E.
  TITLE     Inter-specific comparison of a Drosophila gene encoding FMRF
            amide-related neuropeptides
  JOURNAL   J. Neurosci. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.H.Taghert 07-MAR-1990.
FEATURES       from  to/span     description
    pre-msg    2890  >  3189     FMRFamide-related neuropeptide mRNA and
                                 introns
    IVS        3059  >  3189     FMRFamide-related neuropeptide intron A
    site       1211     1222     conserved sequence 1
    site       1225     1236     conserved sequence 2
    site       1300     1416     conserved sequence 3
    site       1491     1504     conserved sequence 4
    site       1513     1533     conserved sequence 5
    site       1636     1668     conserved sequence 6
    site       1695     1724     conserved sequence 7
    site       1426     1438     conserved sequence 8
    site       1814     1864     conserved sequence 9
    site       1888     1908     conserved sequence 10
    site       2116     2128     conserved sequence 11
    site       2144     2159     conserved sequence 12
    site       2635     2646     conserved sequence 13
    site       2674     2688     conserved sequence 14
    site       2754     2853     conserved sequence 15
    site       2691     2704     conserved sequence 16
    site       2041     2050     conserved sequence 17
    site       2856     2933     conserved sequence 18
BASE COUNT     1034 a    630 c    656 g    869 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttctat cagtaaacaa caacaactac cgttattgct caatacctgt agttgttgtc
       61 gctgctgctg atttcccatg cagcttagaa catgacgcaa ctgaacagca gcagcagcag
      121 cagccgccat cgcacaaata tgttcgttta ccgttttgaa tttatgttta tagcagcagc
      181 agcacatgtt gttacgtctt gcaattgttt atggcataaa tctttatagc atacacacac
      241 acatacatgc atgtgtacat gcataaattc cgtgtgtgcc cattaattag agagaattgg
      301 ccatggataa gctggtcggt tacttattag gttaccacct ggtggactcg aaaacaaata
      361 aaactcggca cataaacaaa aaaactaaaa attaagacga aaattcaaca atggcattga
      421 catcaaacac attgcctgcg ctttcaaacg gcccatgaca acaagagcta actaactaaa
      481 tcgtaaaaaa agagctagta aaatccaatt tttttaacca tctaaaatcg taaagcctat
      541 caaaacacgg cttttggcat ttggagcaaa cggttgtcaa gcggagcccc attgagctaa
      601 tcaggttctg ctgttgctca ttgacggaca ccaattttat ttatctattt ttattgactc
      661 ttctagatgt attccatatc cattgttgag ctgtgatatg cttaaaaaag taacgattaa
      721 ctggaaaata ctctttagtg aaatcaacgt aataaaatac ttattggtat ttataataga
      781 acaaatgcaa atacaattat atgatcggaa ataaaatttt tttaagtaat gcatttgaaa
      841 agaaaagaaa aataaatata tttacaatta ggtttctttt attttggtaa cggatgcata
      901 gcctagaaga ggtatatatc gaaaaatatg tctatattta attgtctaat ctgtcaaaaa
      961 tgcaaactga aagattatct aaatattagt tgagcagtaa tggattctag tttatttaac
     1021 tgatttccaa ttacctaaaa gtaaagtaag aaagaagcag ttgatatatt taaaagattg
     1081 cagggcatcg aaaattcgtt gaagtgaact taaagcacgc ccaatcttag tttccgatat
     1141 atataattta agttatttat ttcgtatgtg ttactcactg atggaacagc gcgacggagt
     1201 ccacgctctc ggtcaagaaa acgctttgaa cttgagcgtt gttagcgtta ggtatttatg
     1261 caaatggatt cagcatcatg catatttatc gatgcaaatg tctggctatt gattttcgac
     1321 gatcgttaga gaacaaaaaa aaaaaaaaga agctaaaatg aaataagaat tgaagcgaac
     1381 tgccagacac acaatggttg acacatttct tgcaagtcat ggtccacatc ttggccggac
     1441 gctgggcggg ttcttcaaat tcagacacgc aaaaaacggg ttgaaaacaa aaaacttggc
     1501 ctttcaaagt gtggccgtaa ttacagactt ccgcttacga aaccgaaagt gtccgcccag
     1561 ccaaatggcc aagaagtatt ggccgaagat ggggttgggg ctgggcgcag tcgggcagac
     1621 gttttgggct gctaaaggcg ccacaatgta tcccgttaca ggggccataa aactccataa
     1681 gcgactggcg acaaggccaa attataccgc atacgcacac gtagccatgg tctaagttgt
     1741 ggacccggct aattggacgc gcaccagggg tatggtcagg gggtggtggg gttgtttttg
     1801 gattctctat aaacttaaaa gtgggcggca cataacaatt gtgggcgatt tgcatacaat
     1861 ttggcgcttg gattaccagc aaagcgcgtc aaaatgttgc tgcgtcatcg cgggccaaaa
     1921 accagatgac tttgccatgt gctaaatata aacacatgca catatatata tatatatatg
     1981 tgtatatata atatatatag tatatatgca catgcaagcg gctaataaag atattgtatg
     2041 gccagcacaa tgcagctgtc tgcgggccat tgtttgtctg acagtccgtc tgacagtttt
     2101 tgtctgtcca tcacaaacaa acatatcaca gccaaaatgg caggccgaca ttgtcaaggc
     2161 tggcttctgt tcaaagcttg cagtttaacg ataaatgaat tgcacaaaga caccaaaaaa
     2221 aaaccaacta tatatgctcc agttgcaagt gcttgactag cagataccct gtagccagcg
     2281 acaagctgcc ttgatttgag ttctttgcaa agaaatagtt gatggtcctg ataaataatg
     2341 gggctccaat tggattactt taagcctcct tggagcctgg acctgtccaa agccttctcc
     2401 gttgtgagag tcgaaaaacc caagagcaag caagtttcta gctcacgaaa cttgcgataa
     2461 agaccctctg gtcgggcctc gtagcagcag gagtatttac cagacaggac tcggttgttg
     2521 atgctgatca ggaatatata tatatatata tacttaatgg gcagagcttg ctgacacttt
     2581 gtatcctctt tcacccgttt acaatgggct tagggtataa agagagaagt cgctgcaact
     2641 gttgccagct aacatgttgc ataaacatgc cgcagcaaca cttgttgctg cctcaacaac
     2701 acccacacac acacacacac atacacacac acacacacag ctgcacagca acacgcagag
     2761 gctttggact gccgcctgcc tattatcctg aacaaaggac accgtcggct gtcggccgtc
     2821 ggccggttgt acgttgtcgg ccgccgtcta acggtatata aaaccggatg tttcgccaaa
     2881 catttgcaca gaagtgtttc acattgccac ggtcaacaag tgtcctgacc aaaatcaggt
     2941 agaagaagaa ccacacgagt ggccagtgaa aagtgtttta aaatttccca tttcccaacg
     3001 gaatgtcgat ttgattgcat tgcaatattc gtataatttt aaatatttga aaaaaaaggt
     3061 gagtttttca aaattttgaa gccctgagca aagtgatttg agtgaaatac taaagcttag
     3121 ttaatgggtg gaagaaagct taaatagtct taaaggcttt aaggcttaaa gaaaaagttt
     3181 gaatctaga
//
LOCUS       DRVFMRFRN2   1803 bp ds-DNA             INV       08-MAR-1990
DEFINITION  D.virilis FMRFamide-related neuropeptide gene, exon 2.
ACCESSION   M32643
KEYWORDS    FMRFamide-related neuropeptide; neuropeptide;
            secretory granule protein.
SOURCE      D.virilis DNA.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1803)
  AUTHORS   Taghert,P.H. and Schneider,L.E.
  TITLE     Inter-specific comparison of a Drosophila gene encoding
            FMRFamide-related neuropeptides
  JOURNAL   J. Neurosci. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.H.Taghert 07-MAR-1990.
FEATURES       from  to/span     description
    pept        551     1570     FMRFamide-related neuropeptide precursor
    pre-msg <     1  >  1570     FMRFamide-related neuropeptide mRNA and intron
    IVS     <     1      539     FMRFamide-related neuropeptide intron A
BASE COUNT      471 a    438 c    459 g    435 t
ORIGIN      
        1 tgacggagtg cgaacgttca tatacatcag ctccggcgcc cagcccctgc tcctggcgta
       61 gaccatcagc ttagcgcctt taacaagttg tattaaggtt gttgatggcc gtgtgtgtgt
      121 gtgtgtgtgt gtgacgggga ggggggcaac tcatgttgct gttgttgctg caggtcgatt
      181 gttgttgctg ttgctgttgt tgctgccttt aatgcgcttg gttaactttt tacgcgcaca
      241 ttcatatttc gctcttggcc aggcggatta tcgcataaat attgttatgg ttaattagcg
      301 ggacactttg tctggccggg caaacaaaag gcgttaagac gctgccaccg gagtagcctg
      361 ctgctgcctg gtttttgctg ctgttgccag ttaagtaaaa gtgaatgcgt tcattgcttt
      421 atccatgggg gcgttaatgt gattaaaact tgttatagat aatgcggcaa gtgatttagt
      481 caacacctca taattatgtt caatttctcc ctctctccct ccctttcact ccatctcagc
      541 accttcaatc atgggcatag cactgatgtt cctgctggcc ctctaccaga tgcagtcggc
      601 catccatagc gagattattg agacgccatc cagctataat gataactcgc tgctggaggc
      661 agctgcggag gagcccaatt cgagagccac cgcgtctgag tcggatctgc tggatggact
      721 gatgagcacc gataatccca atccggagca gcaaaccgaa ctggagtttc gctatccgat
      781 ctcggccatt ggcattggtt atgccaagaa ttcggtggtg ttgcgtttcc agaagcacgc
      841 gcgcaaacag aacttcaagt acgatccgga ctacgagatg aagcgcaagt cgctacagga
      901 caatttcatg cactttggca agcggcaggc ggagcagctg cctcaggcaa ctggacccgg
      961 ctactatgag tgcatcaagc gctccgccat ggatcgctac ggacgcgatc ccaagcagga
     1021 ctttatgcgt ttcgggcgtg ctccgccttc ggactttatg cgcttcggac gtgcaccctc
     1081 ggactttatg cgtttcggac gcgatcccag ccaggatttc atgcgcttcg ggcgctccga
     1141 taactttatg cgctttggca ggaacctgaa ctttcacgag gagctgcgca gtcccaaaca
     1201 ggatttcatg cgtttcggtc gcccggacaa tttcatgcgc ttcggccgat ccgcgcccac
     1261 agaattcgag cgtaatggca aaatggactc gaacttcatg cgctttggca agagatcagg
     1321 tgtaatggcc aaattgacca aatcccagct gcagcaaaac aaattgacaa ctgccgatgg
     1381 caagcagcag ccagccgagg agggcaatcc cacggacaag gccatctcga tgctattcaa
     1441 taagcaccag cagcagcagc agcagcagca ggggcagagg ctgcagcagg aagaccgaca
     1501 gcagatgaag agcagcgcag agcagaacaa cttggaggaa gccagcgtgg agcagttcta
     1561 tgagccatag attctaccgc gcacagcaca ctgtaaatat aaattggcca aaaaaaccga
     1621 acactatgat aattgtactt aacactatgt atgtatgtac ttaaacgcat tatagacatt
     1681 gcagaagaaa tttaaaaaag gaaaaaaaaa aacgtaattt gaattttaat ggggctggat
     1741 taaaaattca ccacgctttg atctgatcat aagaaaacaa aagaaaaaaa agtagaaaac
     1801 tct
//
LOCUS       ECACELS      1224 bp ds-DNA             BCT       06-MAR-1990
DEFINITION  E.carotovora beta(1,4)-glucan glucanohydrolase (celS) gene,
            complete cds.
ACCESSION   M32399
KEYWORDS    beta(1,4)-glucan glucanohydrolase; endoglucanase.
SOURCE      E.carotovora carotovora (strain SCC3193) DNA, clone pHSK25.
  ORGANISM  Erwinia carotovora
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1224)
  AUTHORS   Saarilahti,H.T., Henrissat,B. and Palva,E.T.
  TITLE     CelS: A novel endoglucanase identified from Erwinia carotovora
            subsp carotovora
  JOURNAL   Gene (1900) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            H.T.Saarilahti, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        208     1002     endoglucanase (celS) precursor
    sigp        208      303     endoglucanase signal peptide
    matp        304      999     endoglucanase
    binding     196      200     ribosome binding site
BASE COUNT      353 a    278 c    253 g    340 t
ORIGIN      
        1 taaaattaac accctgattt tatttcacat tattcatatt tattaagact ttaatcattt
       61 ccccatgaat tggattattc cgacagagtg gaaataacat aaccagttaa actaaccaca
      121 tctcagactt gatcatagtc acgtgataac gatgtgttaa ctttcagatt aaaacacctc
      181 aagaatatat ttaaaaagag atttattatg cagactgtga atacccagcc acaccgtatt
      241 ttccgcgtat tattgcccgc agtattttct tccttgctac tctcatcact aacagtgtct
      301 gctgccagtt catcaaacga tgcggataaa ctctatttcg gtaataataa atattaccta
      361 ttcaataacg tatggggaaa agatgaaata aaaggatggc agcaaacgat tttctataat
      421 agcccaatta gcatggggtg gaactggcac tggccaagca gtacccacag tgttaaagcc
      481 tatccttcac tggtgagcgg ttggcactgg acggcgggct atacggaaaa tagcggattg
      541 ccgatacaat tatccagtaa taagagtatc accagtaacg ttacttattc tatcaaggct
      601 acgggtactt acaacgccgc ttacgacatt tggttccaca ctaccgataa agccaactgg
      661 gactcgtcgc ctaccgacga attgatgatt tggctaaatg ataccaatgc aggccctgcg
      721 ggtgattata ttgaaaccgt tttcctcggc gacagcagtt ggaacgtatt caaaggctgg
      781 atcaacgcgg ataatggtgg aggttggaat gtcttttcct ttgtccatac ttccggcacc
      841 aacagtgcat cactcaatat tcgtcacttc accgactatt tggtgcagac aaagcaatgg
      901 atgagcgatg aaaaatatat cagcagcgtt gagtttggca ccgagatctt tggcggcgat
      961 gggcagatcg acatcaccga atggcgcgta gacgtaaaat agcgcgtctg gcgttgctat
     1021 tcctccccgt tcagccgcaa taaacacgtt aacgtagagc tgaaacggat tggtctcatc
     1081 gacgttctct tttcattttc agttcagaca gacgttgccc gttctttagg cgggttcgct
     1141 tccccccaca acaaaattcg tgagctgtgg cacgtaaatt tcattagcgt tatgataatc
     1201 ccctcggcta atcaggaatg tccc
//
LOCUS       ECODCM       2472 bp ds-DNA             BCT       26-FEB-1990
DEFINITION  E.coli DNA cytosine methylase (dcm) and patch repair protein genes,
            complete cds.
ACCESSION   M32307
KEYWORDS    DNA cytosine methylase; patch repair protein.
SOURCE      E.coli (K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2472)
  AUTHORS   Sohail,A., Lieb,M., Dar,M. and Bhagwat,A.S.
  TITLE     Gene required for very short patch repair in Escherichia coli
            partly overlaps the DNA cytosine methylase gene
  JOURNAL   Unpublished (1990)Wayne St. Univ. Detroit, MI 48202
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.S.Bhagwat, 23-FEB-1990.
FEATURES       from  to/span     description
    pept        250     1668     DNA cytosine methylase (dcm)
    pept       1649     2119     pot. patch repair protein
BASE COUNT      606 a    620 c    695 g    550 t      1 others
ORIGIN      43 min on K12 map.
        1 gatcatttcc agactaagtt gctgaacgta ccgcaaacca tgcaaaccgc aagaggcaag
       61 cacgtggcgc agcacaacgc gcatttttta gtcgagttta tggcgaagct cagtgccgaa
      121 ctggcggggg agaatgaagg tgtcgatcac aaggtgatag atgcgttttc atccgctggc
      181 tgagcgcgtg gccctaaatg gctgtaatta tgttaacctg tcggccatct cagatggccg
      241 gtgaaatcta tgcaggaaaa tatatcagta accgattcat acagcaccgg gaatgccgca
      301 caggcaatgc tggagaaact gctgcaaatt tatgatgtta aaacgttggt ggcgcagctt
      361 aatggtgtag gtgagaatca ctggagcgcg gcaattttaa aacgtgcgct ggcgaatgac
      421 tcggcatggc accgtttaag tgagaaagag ttcgcccatc tgcaaacgtt attacccaaa
      481 ccaccggcac atcatccgca ttatgcgttt cgctttatcg atctattcgc cggaattggc
      541 ggcatccgtc gcggttttga atcgattggc ggacagtgcg tgtttaccag cgaatggaac
      601 aaacatgcgg tacgcactta taaagccaac cattattgcg atccggcgac gcatcatttt
      661 aatgaagata tccgcgacat caccctcagc cataaagaag gcgtgagtga tgaggcggcg
      721 gcggaacata ttcgtcaaca cattcctgaa cacgatgttt tactggccgg tttcccttgt
      781 cagccatttt cgctggctgg cgtatcgaaa aagaactcgc tcgggcgggc gcacggtttt
      841 gcctgcgata cccagggcac gctgtttttt gatgtggtac gcattatcga cgcgcgtcgt
      901 ccggcgatgt ttgtgctcga aaacgtcaaa aacctgaaaa gtcacgacca gggtaaaacg
      961 ttccgcatca tcatgcagac gctggacgaa ctgggctatg acgtggctga tgcagaagat
     1021 aatgggccag acgatccgaa aatcatcgac ggcaaacatt ttctgccgca gcaccgtgaa
     1081 cgcatcgtgc tggtgggttt tcgtcgcgat ctgaatctga aagccgattt taccctgcgt
     1141 gatatcagcg aatgtttccc tgcgcagcga gtgacgctgg cgcagctgtt ggacccgatg
     1201 gtcgaggcga aatatatcct gacgccggtg ctgtggaagt acctctatcg atatgcgaaa
     1261 aaacatcagg cgcgcggtaa cggcttcggt tatggaatgg tttatccgaa caatccgcaa
     1321 agcgtcacgc gtacgctgtc tgcgcgttat tacaaagatg gcgcggaaat tttaatcgat
     1381 cgcggctggg atatggccac gggtgagaaa gactttgacg atccgctgaa tcagcaacat
     1441 cgtccacgtc ggttaacgcc tcgggaatgc gcgcgcttaa tgggttttga agcgccggga
     1501 gaagcgaaat tccgtattcc ggtttcggac actcaggcct atcgccagtt cggtaactcg
     1561 gtggtcgtgc cggtctttgc cgcggtggca aaactgcttg agccaaaaat caaacaggcg
     1621 gtggcgttgc gtcagcaaga ggcacaacat ggccgacgtt cacgataagg ccactcgcag
     1681 caaaaatatg cgcgcgattg ccacgcgtga tacggcgata gagaagcgcc tcgccagtct
     1741 gttaaccggg cagggcctgg catttcgcgt tcaggacgcc agtctgcccg gacgtccgga
     1801 ttttgtcgtt gatgaatatc gctgcgtgat atttacccat ggctgcttct ggcatcatca
     1861 tcactgctat ctgtttaaag tgcctgcgac tcgaaccgag ttctggctgg agaagatagg
     1921 taaaaatgtt gagcgcgatc gccgcgatat cagtcgcttg caggaactcg gctggcgcgt
     1981 attgattgtc tgggagtgcg cgttacgtgg gcgcgagaag ctgacggatg aagcgcttac
     2041 cgagcgtctg gaagagtgga tctgcggcga aggtgccagc gcgcagatcg acacgcaggg
     2101 gattcattta ctcgcttgat gcgtcctgaa taactggcgc aactacgggt tttgccggga
     2161 agagatattt tcccaacgtg accagtacca ccgcgaagac aattacgccg agcgccagcc
     2221 attcaatctt cgacagtgtt tctccaccca gtcccgtacc cagcaagacc gcgaccaccg
     2281 ggttaacgta agcgtagctg gtggcgagag ccggactgac attacggatt aaatacatat
     2341 aagcgttgat ggcgataatc gaaccaaaca gcgccagata gccgaccgca aggaagcctg
     2401 aaagggaagg gagcgccgtc agtttttnac ccgcaatcat cgacgcgatc attaacacca
     2461 cgcctgccgc cg
//
LOCUS       FCLPOLCYS    3865 bp ss-RNA             VRL       06-MAR-1990
DEFINITION  Feline calicivirus RNA-dependent RNA polymerase, cystein protease,
            and 2C protein,.
ACCESSION   M32296
KEYWORDS    RNA polymerase; cysteine protease; nonstructural protein.
SOURCE      Feline calicivirus (isolate CFI/68 FIV), cDNA to viral RNA, clones
            pCV[2,8].
  ORGANISM  Feline calicivirus
            Viridae; ss-RNA nonenveloped viruses; Caliciviridae.
REFERENCE   1  (bases 1 to 3865)
  AUTHORS   Neill,J.D.
  TITLE     Nucleotide sequence of a region of the feline calicivirus genome
            which encodes picornavirus-like RNA-dependent RNA polymerase,
            cysteine protease and 2C polypeptides
  JOURNAL   Unpublished (1990)Pioneer Hi-Bred International, Johnston, IA 50131
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.D.Neill, 23-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1     3865     polyprotein (AA at 2)
    matp    <     1      451     2C-like protein
    matp       1802     2242     cysteine protease-like protein
    matp       2521     3762     RNA-dependent RNA polymerase-like protein
BASE COUNT     1057 a    865 c    894 g   1049 t
ORIGIN      
        1 ggttcctgta tgttacattc ttactggacc acctggatgc gggaaaacaa ctgcagcgca
       61 ggcattagcc aagaagttat ctgaccaaga accatcggtc atcaaccttg atgtagacca
      121 ccatgatacg tacactggca atgaggtgtg cataattgat gaatttgatt cctctgataa
      181 ggttgattat gcaaattttg ttattggaat ggttaactcg gcaccaatgg tactaaattg
      241 tgatatgctg gaaaataagg gcaagttgtt cacctcaaag tacataataa tgacttccaa
      301 ttctgaaaca cctgtcaaac catcctccaa acgcgctggc gcattttatc ggagagttac
      361 ctatcatgat gttgctacct tggttgaatc gcacaagcgt gcacggcctg gtactgctgt
      421 cccccgtagt tgctacaaga aaaacttttc acatctatcg cttgctaaac gaggcgctga
      481 gtgctggtgc aaagagtatg tccttgatcc taagggattg caacaccaga gcacaaaggc
      541 tcctccccca accttcctta atattgattc tttggcccaa acaatgaagc aagactttgc
      601 tttaaagaac atggcatttg aagctgaggt aggatgtagt gaacatcggt atggatttgt
      661 atgtcagcag agtgaagttg aaacagttcg ccgactcctt aatgccatta gaatgaggct
      721 taacgctacg tttaccgttt gcgttggcct tgaagcttct aattccgttg ggtgtactgc
      781 gcacgtttta acacctgacg aaccattcaa tggaaaaagg ttcgttgtgt cacgctgcaa
      841 tgaggcatcg ctatccgcac tagaaggaaa ctgcgttcaa actgcattag gagtgtgcat
      901 gtccaataag gacctcactc acctgtgtca cttcattaag gggaaaattg tcaatgatag
      961 tgtcaggtta gatgaactac ccgccaatca acatgtggta accgttaatt cggtgtttga
     1021 tttggcctgg gctcttcgtc gtcactctac actaactggt cagttccaag caatcagagc
     1081 cgcatatgat gtgcttcatg tccctgacaa ggtccccgcc atgttgcggc actggatgga
     1141 tgaaacctcc ttctctgatg aacacgtggt aacgcagttc attacacctg gtggagttgt
     1201 aattcttgag tcgtgtggcg gtgcgcgcat ctgggcactt ggtaacaatg ttatcagggc
     1261 tggaggcgtc actgccatac ccactggggg atgtgttagg ttaatgggac tctcagctca
     1321 aacaatgcca tggtctgaaa ttctttcgga actcttctct ctgctaggga aaatctggtc
     1381 tagtgttaaa gtctctaccc ttattctcac cgctctcagc atgtatgcat ctaggttcag
     1441 gcctaaaaca gaagcaaagg gaaaaacaaa atccaaaatt ggtccatata gaggccgagg
     1501 tgtggcactt accgatgatg agtacgacga atggaaagaa cacaatgccg ctaggaagtt
     1561 ggacctctca gttgaagatt ttctaatgtt gcgtcaccgt gccgcactag gggcggatga
     1621 cactgatgcc gtcaaattta ggtcatggtg gaattcaagg tctagattgg ctgatgactt
     1681 tgaagacgtc accgtgattg gtaagggtgg cgtaaagcat gagaaaatta gaacaaatac
     1741 cctcagagcc gttgaccgtg gctacgatgt cagctttgcc gaggagtcag ggcccggggc
     1801 aaaattccac aaaaacgcaa ttgggtcagt cactgacgtc tgtggagaac acaagggcta
     1861 ttgtgttcac atgggtcacg gggtctacgc ctcagtagca cacgtggtta agggtgactc
     1921 tttctttctg ggtgaaagaa tctttgactt aaaaactaat ggtgagttct gttgcttccg
     1981 tagcactaag atcttgccta gtgctgctcc tttcttttct ggaagaccaa cgcgtgaccc
     2041 ttggggatct cctgtagcaa ctgactggaa accaaaacct tactcaacca catcggggaa
     2101 aattgtaggg tgtttcgcaa caacatcaac tgaaacccac cctggcgact gtggcctccc
     2161 gtacattgac gacaatggta gagtcactgg gttgcacaca ggctctgggg gacccaagac
     2221 cccaagtgca aaattggttg ttccctacgt acatattgac atgaaaacta aatctgtcac
     2281 tgctcaaaag tatgatgtca ctaaaccaga tattagttat aaaggtttag tttgtaaaca
     2341 attggatgaa attagaatta tcccaaaagg taccaggcta cacgtatctc ctgcacattt
     2401 agaagatttt gaggaatgct ctcaccagcc tgcctctcta ggtagtggtg atcctcggtg
     2461 tcccaaatcg ctcactgcaa ttgttgttga ctctcttaag ccctactgtg tggtagttaa
     2521 tgggcccccc catgatatct tacatagggt acaaaagatg ctaattgatc atctttctgg
     2581 ttttgtcccc atgaacatat catctgatac ctccatgctt tctgctttcc acaaactcaa
     2641 ccatgataca tcttgtgggc catatctagg tggtagaaag aaagatcaca tggtcaatgg
     2701 agaacctgac aaggcgctat tagatttgtt gtcttcaaaa tggaagttgg caacacaggg
     2761 aattgcactc ccacatgagt acacaattgg tttgaaagat gaactaaggc ccattgaaaa
     2821 agtgcaagaa gggaaaagaa ggatgatctg gggctgtgat gtcggcgttg ccactgtatg
     2881 tgcggctgca tttaaaggtg ttagtgacgc cattacggca aaccatcaat atggccctat
     2941 acaggttggc attaacatgg acagtcccag tgttgaggca cttttccaaa ggatcaaaag
     3001 cgcacgaaag gtgtttgctg ttgattactc aaagtgggat tcaacccaat ccccccgcgt
     3061 tagtgccgct tcaattgaca tactgcggta cttttctgat cggactccaa ttgttgattc
     3121 ggccacaaac acgcttaaaa gcccccccat tgctgtattt aatggagtag ctgtcaaggt
     3181 gtcctctggt ctcccatctg gcatgcccct cacatctgta atcaactcgc tcaatcactg
     3241 tttatatgtt ggctgtgcaa tcttgcaatc tttggaagct cgtaacgtgc ctgtcacttg
     3301 gaatcttttc tcaactttcg acatgatgac ttatggtgat gatggtgttt acatgttccc
     3361 cacaatgtat gcaagtatta gtgaccaaat ttttgccaat ctttctgcat atggtcttaa
     3421 gcccacaaga gttgacaaat cagttggttc aattgaacca attgatccta attcagttgt
     3481 cttcctaaaa agaacaatta cacgaacccc ccaaggaata cgaggcttgc ttgatcgcag
     3541 ctcaattctc aggcaattct actacatcaa gggtgagaac acagataatt ggaaagagcc
     3601 cccaaaaact atagatccaa tgtcaagagg gcaacaattg tggaatgcat gtctgtatgc
     3661 aagtcagcat ggtattgact tttataataa ggtatacaaa ttggccgaaa aagcagtcga
     3721 gtacgaagga ttgcatcttg agcccccttc ttacagcaca gccttggagc attacaacag
     3781 ccaatttaat ggtgtggagg cgcggactga ccagatcgat acgagtggca tggccgccct
     3841 tcactgtgat gtgttcgaag tttga
//
LOCUS       HUMGALOPS     794 bp ds-DNA             PRI       06-MAR-1990
DEFINITION  Human 1,3-galactosyltransferase pseudogene, 3' end.
ACCESSION   J05421
KEYWORDS    1,3-galactosyltransferase; pseudogene.
SOURCE      Human peripheral blood leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 794)
  AUTHORS   Larsen,R.D., Rivera-Marrero,C.A., Ernst,L.K., Cummings,R.D. and
            Lowe,J.B.
  TITLE     Frameshift and nonsense mutation in a human genomic sequence
            homologous to a murine UDP-gal-beta-D-gal alpha-1,3
            galactosyltransferase cDNA
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.B. Lowe. 02-MAR-1990.
FEATURES       from  to/span     description
    pept.ps /    63      755     pseudo-1,3-galactosyltransferase
    site        394      394     frameshift mutation
    site        475      475     frameshift mutation
    site        427      429     in-frame stop codon
    site        708      710     in-frame stop codon
BASE COUNT      227 a    165 c    181 g    221 t
ORIGIN      
        1 cagcttgtgg tttctttcag gaatcccaga ggataaatgt tttgcttttc ttctttgttt
       61 cagatataat gatcattact tggaggagtt cataacatct gctaataggt acttcatggt
      121 tggccacaaa gtcatatttt acatcatggt ggatgatgtc tccaagctgc cgtttataga
      181 gctgggtcct ctgcattcct tcaaaatgtt tgaggtcaag ccagagaaga ggtggcaaga
      241 catcagcatg atgcgtatga agatcactgg ggagcacatc ttggcccaca tccaacacga
      301 ggtcgacttc ctcttctgca tggatgtgga ccaggtcttc caagaccatt ttggggtgga
      361 gaccctaggc cagtcagtgg ctcagctaca ggctggcggt acaaggcaga tccctatgac
      421 tttacctagg agaggtggaa agagtcagca ggatacattc catttggcca ggggattttt
      481 attaccatgc agccatttct ggaggaacac ccattcaggt tctcaacatc acccaggagt
      541 gctttaaggg aatcctcctg gacaagaaaa atgacataga agccaagtgg catgatgaaa
      601 gccacctaaa caagtatttc cttctcaata aaccctctaa aatcttatcc ctaaaatact
      661 gctgggatta tcatataggc ctgccttcag atattaaaac tgtcaagtga tcgtggcaga
      721 caaaagagta taatttggtt agaaataatg tctgacttca aattgtgcca gtagatttct
      781 gaatttaaga gaga
//
LOCUS       MEFTGSRNA    3367 bp ds-DNA             BCT       06-MAR-1990
DEFINITION  M.fervidus 7S RNA, Ser-tRNA, 16S rRNA and Ala-tRNA genes.
ACCESSION   M32222
KEYWORDS    16SrRNA; 7S RNA; transfer RNA-Ala; transfer RNA-Ser.
SOURCE      Methanothermus fervidus.
  ORGANISM  Methanothermus fervidus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanothermaceae.
REFERENCE   1  (bases 1 to 3367)
  AUTHORS   Haas,E.S., Brown,J.W., Daniels,C.J. and Reeve,J.N.
  TITLE     Genes encoding the 7S RNA and a Ser-tRNA are linked to one of the
            two rRNA operons in the genome of the extremely thermophilic
            archaebacterium Methanothermus fervidus
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.J.Daniels, 02-FEB-1990.
FEATURES       from  to/span     description
    pept        132      887     ORF 260
    RNA         944     1258     7S RNA
    tRNA       1273     1359     Ser-tRNA
    rRNA       1566     3046     16S rRNA
    tRNA       3105     3178     Ala-tRNA
BASE COUNT      898 a    751 c    998 g    720 t
ORIGIN      
        1 ggagacagtg gttcaccagt catacaagat ggtaaggttg tgggtaatat tgtcttcaat
       61 atattacaca aatgctacag ctatagcatc ttccctagaa aaatttttag aaaggaagta
      121 atgtggtggc catgaggatt ggatttattg gatttggaga agtaagttct actttatcac
      181 aattttttaa ggataaggtg gaagtacaaa catgtgttaa aggtaggagt gaaaaaacta
      241 aaaaaattgc aaaaaaactt ggagtaaaga tatacaaaga ttataaagac cttgtaaaaa
      301 attctgacat tgttatttca gctgttactc cattttcagc tttagatgtg gctaaaaaat
      361 atggaaaata tgtaaaaggt atatatgtag atgtgaataa tgtatcacct ctaacaaaac
      421 ataaaatttt gaaatatatt gacgaggaaa aatttgttga ttgtgcaata attggaagaa
      481 taaaacgtaa atttaaaatg atttgttcag gtaaaaatgc aaataaactt aaaatattag
      541 aaaaatttgg agtacctatt gaagttatag gttcaaaggt tggggaagcc tcaacattaa
      601 aaatgctcag aagtttatat actaaaagtc ttgctgcaat attattagaa gttttttcgg
      661 ttgctaataa attaggactt atcgatgaat tacttgaaat attagaggaa acagaaggga
      721 agaaatttgt agatttatgt aagtcaagag tggtaggatc atttattcac tcaaggagaa
      781 gatatgaaga aatatgtgaa atagaaaaat ttatattatc tcataatttg aaaccaataa
      841 tgatcaaatg tacaaaaaat atgtttaaac atatagagga atgttgacaa agattatatc
      901 aaaaaattta aataagattg aaaaataaaa tataaaatgg cagcgaggct aggccggggg
      961 gttaggggtc ccctgtaagc gcaaatcccc tatatggcgc ggccgaagcc caggaggcgg
     1021 caagaccgcc agacatcggc ctgagggtta aacaatgaag cctcgtccca cagggccacc
     1081 ggtggcgagg gtccagctgg agggctggac ctaatcgcct ttgctgcggg aacgggtcag
     1141 gcccggaagg gagcagccct accgcagacg gatggtgctt gtgggtcaac ggggtggagt
     1201 ctataaccct cagatcaccg gtgtctggtg gtcttgtcca ctcctgggcg tgcctcgctg
     1261 ccaaatgtct tcagtagccg gggtggccca gcctggtagg gcgtcggcct gctaagccga
     1321 tgatccgtta aggatcgcgc gggttcaaat cccgtccccg gcgtgcaata ttttctaatt
     1381 tttttgtatt gatagtgttg ctaggtatgt aatgccatgg taccggcgtg gattcggagt
     1441 gcctatttag gctcgatgaa gaatcgcact ccagccatgg caacaaaatt ggtgagagcc
     1501 tgagtgaata actcaggctt gatacatagg tcccacgggt gctggtaaag tcccaaagga
     1561 atcccccgcc aactccgttt gatcctggcg gaggccactg ctatgggggt ccgactaagc
     1621 catgcaagtc gaacgggcct tgtgcccgtg gcgaacggct cagtaacacg tggacaacct
     1681 accctggggt ccgggataac cccgggaaac tggggctaat cccggatagg cgaggtctcc
     1741 tggaatgggg cctcgccgaa aggtcttttg accgccccag gatgggtctg cggccgatta
     1801 ggtagttggt agggtaacgg cctaccaagc ctacgatcgg tacgggttgt gagagcaaga
     1861 gcccggagac ggggcctgag acaaggcccc gggccctacg gggcgcagca ggcgcgaaaa
     1921 ctccgcaatg cgcgaaagcg cgacgggggg acccccagtg ccactccgta aggagtggct
     1981 tttccggagt gtaaaaagct ccgggaataa gggctgggca agaccggtgc cagccgccgc
     2041 ggtaacaccg gcagcccgag tggtggccgc gtttattggg cctaaagcgt ccgtagccgg
     2101 tccggtaagt ctccggtgaa agcccgcagc tcaactgcgg gagtagccgg agatactgcc
     2161 ggacttgggg ccgggagagg ccggaggtac ccccggggta ggggtgaaat cctgtaatcc
     2221 cggggggacc acctgtggcg aaggcgtccg gctggaacgg gcccgacggt gagggacgaa
     2281 agccagggga gcgaaccgga ttagataccc gggtagtcct ggccgtaaac gatgcggact
     2341 tggtgttggg gcaacctcga gttgccccag tgccgaaggg aagccgttaa gtccgccgcc
     2401 tggggagtac ggccgcaagg ctgaaactta aaggaattgg cgggggagca ccacaacgcg
     2461 tggagcctgc ggtttaattg gattcaacgc cggacacctc accgggggcg acggcaggat
     2521 gatggccagg ttgatgacct tgcctgacga gccgagagga ggtgcatggc cgccgtcagc
     2581 tcgtaccgtg aggcgtcctg ttaagtcagg caacgagcga gacccgcgcc cctagttgcc
     2641 agcgggtccc gtaagggacg ccgggcacac tagggggacc gccagcgata agctggagga
     2701 aggtgcgggc gacggtaggt ccgtatgccc cgaaaccccc gggctacacg cgggctacaa
     2761 tggccgggac aatgggtacc gaccccgaaa gggggaggta atcccataaa cccggccgta
     2821 gttcggatcg agggctgcaa ctcgccctcg tgaagctgga atgcgtagta atcgcgggtc
     2881 actatcccgc ggtgaatacg tccctgctcc ttgcacacac cgcccgtcac gccacccaaa
     2941 cggggttcgg atgaggccat ggcctctggc catggtcgaa tccgggcccc gtgaggaggg
     3001 cgaagtcgta acaaggtagc cgtaggggaa cctgcggctg gatctaacct ccttaaaata
     3061 ataatattcc tttgggacac caaaagcacc cgtgggacct tttttatttt tgggcccata
     3121 gctcagcctg ggagagcgcc gcccttgcaa ggcggaggcc ccgggttcaa atcccggtgg
     3181 gtccatgcaa ctagccatct tatgataggc tagtgaagct agatgatcaa agatgattct
     3241 aggtgcatag gtagttcgtg gacaccaact gtagggaact ggtccaaggt ttaagccgcc
     3301 cggtggatgg ctcggcttgg gcgccgacga aggccgtggc aagctgcgat aagcccgggt
     3361 gaggcgc
//
LOCUS       STASAU3AIM   3360 bp ds-DNA             BCT       06-MAR-1990
DEFINITION  S.aureus Sau3AI-restriction-enzyme and Sau3AI-modification-enzyme
            genes, complete cds.
ACCESSION   M32470
KEYWORDS    Sau3AI-modification-enzyme; Sau3AI-restriction-enzyme.
SOURCE      S.aureus (strain 3AI) DNA.
  ORGANISM  Staphylococcus aureus
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Micrococcaceae.
REFERENCE   1  (bases 1 to 3360)
  AUTHORS   Seeber,S., Kessler,C. and Goetz,F.
  TITLE     Cloning, expression and characterization of the Sau3AI restriction
            and modification genes in Staphylococcus carnosus TM300
  JOURNAL   Unpublished (1990) Univ. of Tuebingen, D-7400 Tuebingen 1
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.t Seeber, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        448     1917     Sau3AIR protein (ttg start codon)
    pept       2017     3255     Sau3AIM protein
    binding     437      441     ribosome binding site
    binding    2004     2010     ribosome binding site
BASE COUNT     1295 a    393 c    638 g   1034 t
ORIGIN      
        1 aagtcgacct tcaccaagac cgaattttcc gcgtataccc gcagtaccaa acgttaattt
       61 actttcaaaa ccttctcgct gttcaatgtc agattgctgc tcataaaaat gtttaactaa
      121 actatcatta gctctttcta tccataattc tttatccatt gttgctaaac atcctttcaa
      181 aatctcagtt agacttaata aaacatgaaa actaaagccc ttacatttat gtaatgaatt
      241 ataaagaaat acgccccaaa agtaaaaaaa cacagcccca agacaatact tttcacaagt
      301 attatataat agatgtgtat gaaaatgcat ggagtagatg taagagtgat attcaaaatg
      361 tgtaaaaaat atggataatt ctatataatt atattattga aattttaaat agcggaaata
      421 attatttaat gttaagaggg gataattttg gaaagttatt tgacaaaaca agccgtacat
      481 aatcgcgcaa aagaagctgt tggtaaaagt gtattagaat taaatggtgg tgaatcgatt
      541 aaacaaagta agagttcagt tggtgatgca tttgaaaatt ggtttggtaa gaaaaaagac
      601 agtgatagta aaccagatat ggcagaagct ggggtggaac ttaaggcaac gccatttaaa
      661 aagttgaaaa acggaaagta tagctccaaa gaaagattag tattaaatat tataaactat
      721 gagaaagtgg caaatgaaaa ttttgaaact agtagttttt tatctaagaa taatactata
      781 gaattagctt tctatgaata tatcaaggga acacctagtg ataattggat tattaaagaa
      841 gcggtgcttt atgaaatgca taaaaacccg attgattatg aaataattaa acaagattgg
      901 gaaataataa atcaatatat taatgaagga aaggcacatg aattgagtga aggtttgaca
      961 agttatttag cgccatgtac aaagggtgcg aatgctagtt ctttaagaaa tcagccttat
     1021 tcagacataa aagcaaagca aagagcattt tctttgaagt ctgggtatat gacatctatt
     1081 ttacgcaaat atgttctagg tgatgaaaaa atagattcaa ttgtcaaaga cccatttgaa
     1141 ataaaagaaa aatcaataga ggacatagtc tttgaaaaat ttcagccata tataaactgg
     1201 tcaatcgata aattatgcga acatttttct atcaataaag gtgagaaagg tttaaattat
     1261 agaatagcct ctgccatttt aaatctaaaa ggtaaaacta ctaaaagtaa accattcccg
     1321 gaagttgaag agtttgaaaa atcatctata gtagtcaaaa cagttcattt taataaaaag
     1381 aatgtgaata aagaaagtat gtcatttgga gcttttaaat ttgaagaact agctaatgag
     1441 gaatgggaag atagtgaagg atatcctagt gcacaatggc gaaacttttt gttagaaaca
     1501 aggtttttat tttttgttgt taaagaagat gaagatggtg tagacatatt caaaggaata
     1561 aaatttttta gtatgcctga agaagacata aacggacctg tcaaaagaat gtgggatgat
     1621 acagtgaaga aattaaaaga gggtgtcaca ttagaagctg taccggacaa aagtacaaag
     1681 gatggttgga gaataaaaaa taattttgta gataaaagtg atgatttaat ttgccatgtt
     1741 agaccacaca ctaataacag agactatcgt ggaggaagta atgcagataa gcttcctaaa
     1801 aagattaact ggattaatag acctgactca gatgattatt cggatgagtg gatgactaaa
     1861 caaagttttt ggataaataa tgactacata aaaaagcaag ttgaagattt attgtagtta
     1921 aagtatgtta aaatataaga tattctttta aaatatcgaa cgatcgttcg tattttgtgt
     1981 tataataagg ttgaattaag tataggaggt cgcctaatga ataaaattaa agtagtagaa
     2041 ttgtttgcgg gtgtaggcgg gtttcgttta ggtttagaaa atacgaaaaa tggtatattt
     2101 gacataactt gggcaaatca atgggagccc tcacgaaaaa tccaacatgc atttgattgt
     2161 tatagtaaaa gatttaagaa cggcatccat agtaataagg atattgccca ggtatctgat
     2221 gaagaaatgg caaatactga agctgatatg attgtaggag gatttccttg ccaagattat
     2281 tcagttgcaa ggagtttaaa tggagaatta ggaatacaag gaaaaaaggg cgttttattc
     2341 tggcaaatta ttagatatat tcaaaataca tttcctaaat acttgttgct tgaaaatgtt
     2401 gatagattat tgaaatcacc ttcgagtcag agagggagag actttgctgt aatgttatca
     2461 accttaaatg agttaggcta taatgttgaa tggcgcgtga ttaatgctgc tgattatggc
     2521 aatgctcaaa gacgtagaag ggtatttata tttggatata agcaagattt aaactatagc
     2581 aaagctatgg aagaaagtcc gttggataaa attatatatc acaatggttt gtttgctgaa
     2641 gcttttccga ttgaagatta tgccaataaa aatagagtaa ataggactca tattactcat
     2701 gatatagtcg atatttcaga taatttcagt tttcaatttt ataatagtgg aatcatgaaa
     2761 aatggagaaa ttttaactat tgacacaata ccaaaatatg aaaaatcagt aaccttagga
     2821 gaaattattg aaagtaatgt agatgatggt ttttcattaa atcaagatca aattgataaa
     2881 tttaaatatt taagaggacc caaaaagatt aaacgaacta ctaaagatgg tcatgaatat
     2941 tatttttcag aaggtggtat gtctgaaaca gattcattag agttacctgc aagaacaatg
     3001 cttacaagtg aatcatctat taatagaagt actcattttt taaacgtaga tggtgtttat
     3061 agaactttga cacctattga agcagaaagg ttgaatgggt ttccagataa ttggacagaa
     3121 ggtatgccaa ttaagatgag atacttttgt atgggcaatg ctcttgttgt gcctttgatt
     3181 actagaatag gtaatcaaat tgaaaaaatt gatagtatta caaatgatga attcagtcag
     3241 ctacgtttat tttaaataaa tcaaatgtag agtgcgtttg atttaacaag tttgtatact
     3301 aaagattcat agattattgt atattgacca gtatttatca gcgtattatt ttaatatata
//
LOCUS       TRPTYF1      1321 bp ds-DNA             BCT       06-MAR-1990
DEFINITION  T.pallidum pallidum antigen TyF1 gene, complete cds.
ACCESSION   M32401
KEYWORDS    antigen.
SOURCE      T.pallidum pallidum (strain Nichols) DNA.
  ORGANISM  Treponema pallidum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; 
            Spirochaetales; Spirochaetaceae.
REFERENCE   1  (bases 1 to 1321)
  AUTHORS   Noordhoek,G.T., Hermans,P.W.M., Paul,A.N., Schouls,L.M.,
            van der Sluis,J.J. and van Embdon,J.D.A.
  TITLE     Treponema pallidum subspecies pallidum (Nichols) and Treponema
            pallidum subspecies pertenue (CDC 2575) differ in at least one
            nucleotide: Comparison of two homologous antigens
  JOURNAL   Microb. Pathog. 6, 29-42 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            G.T. Noordhoek, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        615     1148     antigen TyF1
    variant     736      736     g in T.pallidum pallidum; a in T.pallidum
                                 pertenue
BASE COUNT      349 a    297 c    353 g    322 t
ORIGIN      
        1 aacagcacgc gggcccccgg actacatcgc tcttgtgaac ctcaacggtc gcctttccat
       61 actctaagga ataggagcac gagtacgtat cccccggtag tttccccgcc ttccgaacta
      121 aaataagagg tattcccatg cgatctgcaa aaggcgcggc aaaaataaag ccacgtgatt
      181 cgattgctgc gaccgcggta acgtgctcat cgcggtagaa ttccaccatt tgatcaagac
      241 agtaacgaaa tacagccgcg ttcatcagca cgccagtaat gtcgtagtag agaattcctt
      301 ttttagggaa atcaatccgc ttacgaattg cgcggtccag cgccgcgtgt ccgtccatag
      361 gggcatggta acgtccaata ccacgcacgt caatgatctt accggtttgt tgggaggctt
      421 ggtggattga gaattacgtc tcctggaaaa aagatttcgc tgaaacttca cgaaatctcg
      481 gtgaaaataa atgattattt taccaatcgg tgaaaaaaag ccgggaaaag tccaaaaaga
      541 cagtggttat gctccatttc tttcgatttt ttgttggcat ggtttttgct ttaaagtttg
      601 gaggagaaag aacgatgaac atgtgtacag atggaaaaaa ataccacagc accgccacga
      661 gcgctgcagt tggagccagc gcccccggtg taccggacgc tcgtgccatt gctgctatct
      721 gcgagcaatt gcgccggcat gtagcggatc tgggagtact gtatatcaag ctacataact
      781 atcactggca catctacggc attgagttta aacaggtgca tgagctcctt gaagagtatt
      841 atgtatcagt tactgaagcc tttgatacga ttgccgagcg gttgttacag ctgggcgcgc
      901 aggctcctgc gtctatggct gaataccttg cgttgagtgg aattgcagaa gagacggaga
      961 aagagatcac tatcgtctct gcgcttgcgc gcgtaaagcg ggattttgaa tacctaagta
     1021 cgcgattcag ccaaacgcaa gtacttgcag ctgaaagtgg ggatgcagtg actgacggca
     1081 ttatcacaga catactgagg acgttgggaa aggccatttg gatgcttggt gctaccctga
     1141 aagcctaggt agagcaggct gtacgtacaa cacacgtacg gccatgcgct ggaagtcctg
     1201 tattttgcac ataaggcctc tctcccgtta cagcatgagg ggagggaggt gttggttgaa
     1261 gtgcttgggg aagtgtgcat aatcgtccta cggaaggggg cgttttgtgg aaaaaattgt
     1321 t
//
LOCUS       YS5MTII      1257 bp ds-DNA             PLN       07-MAR-1990
DEFINITION  C.glabrata metallothionein (MTII) gene, complete cds.
ACCESSION   J05398
KEYWORDS    metallothionein.
SOURCE      C.glabrata (ATCC 2001) DNA.
  ORGANISM  Candida glabrata
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 1257)
  AUTHORS   Mehra,R.K., Garey,J.R. and Winge,D.R.
  TITLE     Selective and tandem amplification of a member of the
            metallothionein gene family in Candida glabrata
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.R.Garey 07-MAR-1990.
FEATURES       from  to/span     description
    pept        631      789     metallothionein (MTII)
    rpt        1246  >  1257     amplifiable repeat
BASE COUNT      420 a    255 c    199 g    383 t
ORIGIN      
        1 ccccgggatc gtttagaccc caagaatcca agatatgttt tagctgacag ggtgcatggc
       61 tactccgtga tttaccatat gtccatcgca ttcgtcatgg cctgatgcca atgaccatac
      121 tctttagact cggatccgag aagtcatccc tatccatacg atatcttctt ccgagtccat
      181 tagataaggc aacgctagat ttagctgatt gattgccctg agaattaaag ggaatcagcg
      241 aaaatagaaa aaaaggtttt ggatggatat ttttgttctt tttttgagac attggtgtca
      301 ctgccacagc ttaataacat ggcaatcagc agtgtctaga ttcagcggaa tttagtggaa
      361 gcgaaatgaa ttcggctgac ttagtgccta tcagtaatat acatatttgc tttttgcttt
      421 ggcattgaat ttatcaataa ctgcataccc aaaattaaaa ataaaataga aaaaatagta
      481 atatataaag cacccgtaac ttcccatttc tgggaaactt ggaattcatt tctcccattt
      541 atcctttctt ctatatatcg aatcaacaca tcaacaatac caacaaactt caacttatac
      601 acaacatcta atatttatta tagcttcgaa atgcctgaac aagtcaactg ccaatacgat
      661 tgccactgct ccaactgtgc ttgtgaaaat acttgcaact gctgtgccaa gccagcatgt
      721 gcttgcacaa actctgcttc caatgaatgc tcctgccaaa cttgcaagtg tcaaacatgc
      781 aagtgctaaa cagcattcaa agaagaataa tttctagtat ttttgctatt gttgctttat
      841 ttttcatgat tattgataag gtactgggta tcttctgcta atgacgctga actagctcta
      901 tatgctagct atatacatta cgtataaaaa aatataaaaa tataaaaata taaaaaaaaa
      961 ataaaaaaat cacttcaaaa actataacaa ctaccaagtt tctattttct ctttgactaa
     1021 atcaaatcga tctagtaaac attgcagaac aatttgtgta attatcgtct gatctaaggg
     1081 ccggcgtaag caattctaac cgaccacact acgttactta acaaccacac ttgctcttga
     1141 atacgaaaca gcaagctcta cctatatgaa atggtctaca aaagaagttt tctctagtac
     1201 tctgagaagg ttatattata catcattgaa atgcttaaag ctgatttgca tcccggg
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATA2UGLB     506 bp ds-DNA             ROD       13-FEB-1990
DEFINITION  Rat alpha-2u globulin gene, exon 1.
ACCESSION   M33690 M32014
KEYWORDS    alpha-2u globulin.
SOURCE      Rat (strain Wistar-Lewis) adult liver DNA, clone 1H.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 138 to 506)
  AUTHORS   Shaw,P.H., Walter-Sierra,R., Tamone,F. and Schibler,U.
  TITLE     Rapid identification of DNA fragments containing promoters for RNA
            polymerase II
  JOURNAL   Gene 84, 371-381 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 137)
  AUTHORS   Tamone,F.
  JOURNAL   Unpublished (1990) Dept. Mol. Biol. U. of Geneva, Geneva, SWITZ.
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Tamone, 07-FEB-1990.
FEATURES       from  to/span     description
    pept        436  >   506     alpha-2u globulin, exon 1
    mRNA        377  >   506     alpha-2u globulin mRNA
    signal        1      376     promoter
BASE COUNT      146 a    120 c    117 g    123 t
ORIGIN      Chromosome 5.
        1 cttcttccac actccactca agtcttgaag caaagattct ctccatgctt gaccttccag
       61 ttcagtaccc acccactcca taactggttt tctgattcta agtcagatcc aatgttgcat
      121 gtggttccag atggccctct agaacaccca ctgtttttct tgggaatatg ttttgcgaaa
      181 tgtatgagtg atagaatcaa tccataggag atgacatcgc caagtttcaa aagggcagga
      241 acaatccttg gcttcacatc agtacatgag gaaaacattc cacaaagcct gaaggatgga
      301 ggcccatatg agaaggaaaa aaaaacactc aaacccagag agagtataaa gacgagcaaa
      361 gtgctggagg tggagtgtgg gcaccatcag cagagggatt gtcccgacag agaggcaatt
      421 ctattcccta ccaacatgaa gctgttgctg ctgctgctgt gtctgggcct gaccctggtc
      481 tgcggccatg cagaagaagc tagttt
//
LOCUS       RATA2UGLBA    371 bp ds-DNA             ROD       13-FEB-1990
DEFINITION  Rat alpha 2u globulin pseudogene, exon 1.
ACCESSION   M33652 M32014
KEYWORDS    alpha-2u globulin.
SOURCE      Rat (strain Wistar-Lewis) adult liver DNA, clone 1F.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 371)
  AUTHORS   Shaw,P.H., Walter-Sierra,R., Tamone,F. and Schibler,U.
  TITLE     Rapid identification of DNA fragments containing promoters for RNA
            polymerase II
  JOURNAL   Gene 84, 371-381 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F. Tamone, 07-FEB-1990.
FEATURES       from  to/span     description
    pept.ps     301  >   371     pseudo-alpha-2u globulin, exon 1
    mRNA        243  >   371     alpha-2u globulin mRNA
BASE COUNT      119 a     71 c     96 g     85 t
ORIGIN      Chromosome 5.
        1 tctagaacat ccattgtttt tcttgggaat atgttttgtg aaatgtatga gtgataaaat
       61 catccatagg agatgagatt gcccaagttt gaaaagggca ggaacaatcc ttggcttcac
      121 atcagtacat gagaaaacat tccacaaagc ctgagggaag gaggcccata tgagaaggaa
      181 aaattaaaaa acactcgacc cagagagagt ataaagagga gcaaagtgct tgaggtggag
      241 tgtgggcacc atcaacagag agagtgtccc gacagagagg caattctatt ccctaccaac
      301 atgaagctgt agctgctgct gctgtgtctg ggcctgaccc tggtctgtgg ccatgcagaa
      361 gaagctaatt c
//
LOCUS       RIRSTPSTA    2906 bp ds-DNA             BCT       12-FEB-1990
DEFINITION  R.tsutsugamushi heat shock protein 11 (stp11) and major antigen 58
            (sta58) genes, complete cds.
ACCESSION   M31887
KEYWORDS    antigen; heat shock protein; heat shock protein 11;
            major antigen 58; outer membrane protein.
SOURCE      R.tsutsugamushi (strain KARP) DNA, clone pRTS58H2.9.
  ORGANISM  Rickettsia tsutsugamushi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE   1  (bases 1 to 2906)
  AUTHORS   Stover,C.K., Marana,D.P., Dasch,G.A. and Oaks,E.V.
  TITLE     Molecular cloning and sequence analysis of the sta58 major antigen
            of Rickettsia tsutsugamushi: Sequence homology and antigenic
            comparison to the 60 kilodalton family of stress proteins
  JOURNAL   Unpublished (1990) Walter Reed Hospital, Washington,D.C. 20307
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.V.Oaks, 02-FEB-1990.
FEATURES       from  to/span     description
    pept        780     1064     heat shock protein 11
    pept       1137     2804     major antigen 58
    binding     770      775     ribosome binding site
    signal      675      680     -35 promoter
    signal      700      705     -10 promoter
BASE COUNT     1083 a    354 c    599 g    870 t
ORIGIN      
        1 aagctttgca acattaattg ccaattcttt tgacagtaat gtaattgcgg ttagaaaaag
       61 tgactgaaac ctcagaaaaa agaattgtag gagttaattg cttaatacaa tatttacatc
      121 taatagtaaa acatttgact ttcttagttt taattttagg ctagtgaaaa ctgatcaatc
      181 taaaattaaa attcagaaaa atgtctagaa cgagagcaca aacttctctg ctaaaaatac
      241 taaaagaaat attctgcaag tttatattgt aaccaattga tagagtaata tatcttatca
      301 accagatatt acatggcttg ataaattact ttaggattgg taactctagc agttgtttta
      361 gatatgtaat ggacttgttt gataagatag taagacatca tctaatgcaa tcaagtggat
      421 tgaaaggctt ttgctggaaa aggttgtgta gaagaaagtt atatgaaaag gtggaattat
      481 ataatgatta ccaaattcga tagatctgtg gtaagcaata acttctgata ggctataagc
      541 aatgatatag ctaaaccagt ataaaacagt tatgttatat actttttcaa attaattaag
      601 aaaaacaatt agtgcgatat taatggttta aactcagcgt ataaaagaaa aatataaaaa
      661 ttttttattc acacttgaaa taagtagtaa cagaagatat atgattaggg catagtaaaa
      721 gtatagatta actaattatg ctaatttaat tatatagagt taaatttttg gaggattata
      781 tgaaatacca accactgtat gatcgtgtgc tagttgagcc aatacaaaat gatgaagcac
      841 atggaaaaat tcttattcca gataccgcaa aggagaaacc aacagaagga atagtagtta
      901 tggttggcgg tggctataga aatgataaag gtgatattac accgttaaaa gtaaagaaag
      961 gagatactat agtctataca aaatgggctg gtactgaaat aaaattagaa agtaaagact
     1021 atgtagttat taaagaatca gatatattat tagttaaaag ctaaaatgct taatgtatta
     1081 ataattaaag gatgctagac taatcaaatt ataaaatatt ttaaggagtt ttcagtatga
     1141 gtaaacaaat tgtacatggc gatcaatgtc gtaaaaaaat aatagaaggc ataaatgttg
     1201 tagctaatgc agtaggtata acattaggac ccaaaggtag atgtgtagca atagaacaat
     1261 cttatggtcc tcctaaaatt actaaagatg gagttagtgt tgctaaagca attcaactta
     1321 aggataaatc tcttaatgtt ggagcccaat ttgttatatc agttgctagt aaaacagctg
     1381 atgtggctgg tgatggtaca actactgcta cagttatagc tgatgctgca gttagggagc
     1441 taaataaagc tgaagttgca ggcattgata tacaggaagt aagaaaaggc gctgagaaag
     1501 ctgttgaagc agttattgct gatgtacgta aaaacagcag ccctgttaaa aatgaggaag
     1561 aaattgctca agtagctact gtttcatcta atggagaccg cgaaattggt gaaaagattg
     1621 ctaatgcaat gaaacaagtt ggccaagaag gagtgataac tgttgaagat tctaaaaatt
     1681 ttaactttga agttgaagtt gttaaaggaa tgagatttga ccgcggatat atatcacagt
     1741 actttgcaac gaatcgtgaa aagatgatta ccgaatttga aaatccttat attttgctat
     1801 tggatcagaa ggtatctaca gtgcagccac tggttcctgt gcttgaagct gttgctcaca
     1861 ctggcaagcc attagtattg attgctgatg atgtagacgg agaagctctt actgcattga
     1921 tattaaacaa tttaaaaggt tctattaaag tagtagctgt taaggcacct ggttttggtg
     1981 atcgaaaaaa agaaatgtta gaagatattg ctattttaac aaatggagaa gtaattactg
     2041 aacaactagg aatcaaacta gaaaaggtta atgatacatc taagttagga actgcaaata
     2101 gagttattgt tacaaaagat catactacta tagttcatga taagaataat agtgatatag
     2161 agaagaaagt taattctcgt tgtgaacaga tccgtgaagc aattaaagat acaacttcag
     2221 attatgaaaa agaaaaatta caagagcgct tagcaaaact tagaaacggt gtagctgtac
     2281 taaaagttgg tggtgctact gaagtagaac aaaaagaacg taaggataga gtagaagatg
     2341 ctttacatgc tactagagca gctgttgaag aaggtatagt gcctggtggt ggtgttgcgc
     2401 tattttatgc ttctcgtgtt ttagatagtt tgaagtttga taatgaagat caaagggtag
     2461 gtattaacat tatcaaaaaa gtattagaag ctccggtaag acaaatagtt aaaaatgctg
     2521 gaggcaaaga ggatgttgtt gttaatgaac taagcaagag tacagataaa aaccgtggat
     2581 tcgatgctag aacaatgcaa tacgttgata tgataaaagc aggtattgtg gatccaacta
     2641 aagttgtacg tactgcgtta caagatgctt tttctgtggc ttcattagta attgctacct
     2701 cagctatgat tactgatcat gaggaagata ataatactgg taaccgcagt ggtggtggag
     2761 taggtggtgg acatcatggt ggtatgggcg gaatggattt ctaatccgcc cccatgtcta
     2821 ccttcaaaaa aaatagagta taacttttgt ataaggttgc tagttatgta tttttttaca
     2881 aaattgcagt ataatatttc aagctt
//
LOCUS       YSCTUP1A     2891 bp ds-DNA             PLN       12-FEB-1990
DEFINITION  S.cerevisiae TUP1 protein gene, complete cds.
ACCESSION   M31733
KEYWORDS    regulatory protein.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2891)
  AUTHORS   Williams,F.E. and Trumbly,R.J.
  TITLE     Characterization of TUP1: A mediator of glucose repression in
            Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990) Dept. Biochem. Ohio Med. Coll., Toledo, OH 43699
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            R.J.Trumbly, 30-JAN-1990.
FEATURES       from  to/span     description
    pept        509     2650     TUP1 protein
    mRNA        234     2780     TUP1 mRNA
    signal      108      112     TATA box
BASE COUNT      899 a    700 c    530 g    762 t
ORIGIN      Chromosome III.
        1 tgtacatccc aaagacataa acatatgtat ataagaacaa cataaacaaa taaataaacc
       61 aggaaagccc taatttaccc ttctttttgg ctgagcaatc taggagttaa taattttttg
      121 gaactttccc cctaacattt ttagagacag aaccctctta cattatcgct accgacggac
      181 ctttaatttt tcgttacttc tcaaaaaaca ctgaaaaact gcgaaaaaat taaaaagctc
      241 tcccgtcaaa gcaacaaaca acaaaacaaa ataacaagta cataagtaaa caaatagaga
      301 gaaattcccc catctacccc caattgataa acatcagcga agcaagaaca actggctgaa
      361 cacgtgcccc tctatctgtc cttctgcaat tgtttaattt tctttcacgc caagttacct
      421 ttcgcacacc actactcttt ttctattgtt tttttgtctt ttttgataag caggggaaga
      481 aagaaatcag ctttccatcc aaaccaatat gactgccagc gtttcgaata cgcagaataa
      541 gctgaatgag cttctcgatg ccatcagaca ggagtttctc caagtctcac aagaggcaaa
      601 tacctaccgt cttcaaaacc aaaaggatta cgatttcaaa atgaaccagc agctggctga
      661 gatgcagcag ataagaaaca ccgtctacga actggagcta actcacagga aaatgaagga
      721 cgcgtacgaa gcagagatca agcacttgaa actagggctg gagcaaagag accatcaaat
      781 tgcatctttg accgtccagc aacagcagca acagcaacag cagcaacagg tccagcagca
      841 tttacaacag caacaacagc agctagccgc tgcatctgca tctgttccag ttgcgcaaca
      901 accaccggct actacttcgg ccaccgccac tccagcagca aacacaacta ctggttcgcc
      961 atcggccttc ccagtacaag ctagccgtcc taatctggtt ggctcacagt tgcctaccac
     1021 cactttgcct gtggtgtcct caaacgccca acaacaacta ccacaacagc aactgcaaca
     1081 gcagcaactt caacaacagc aaccacctcc ccaggtttcc gtggcaccat tgagtaacac
     1141 agccatcaac ggatctccta cttctaaaga gaccactact ttaccctctg tcaaggcacc
     1201 tgaatctacg ttgaaagaaa ctgaaccgga aaataataat acctcgaaga taaatgacac
     1261 cggatccgcc accacggcca ccactaccac cgcaactgaa actgaaatca aacctaagga
     1321 ggaagacgcc accccggcta gtttgcacca ggatcactac ttagtccctt ataatcaaag
     1381 agcaaaccac tctaaaccta tcccaccttt ccttttggat ctagattccc agtctgttcc
     1441 cgatgctctg aagaagcaaa caaatgatta ttatatttta tacaacccgg cactaccaag
     1501 agaaattgac gttgagttac acaaatcttt ggatcatact tcagttgttt gttgcgtgaa
     1561 gttcagtaac gatggtgaat acttagccac aggctgcaac aaaactactc aagtgtatcg
     1621 cgtttcagac ggttctctgg tggcccgttt atctgacgat tctgctgcca ataaccatcg
     1681 aaattcgatc actgaaaata acaccaccac gtccacggat aacaatacaa tgacaaccac
     1741 tactaccacc acaattacta ccacagcgat gacttcggca gcagaattgg caaaagatgt
     1801 ggaaaacctg aacacttcgt cttccccatc atccgacttg tatatccgtt cagtgtgttt
     1861 ttctccagat gggaaatttt tggcaacagg tgctgaagac agactgatta gaatttggga
     1921 tattgaaaat agaaagattg ttatgattct tcaaggccac gaacaagata tttattcatt
     1981 ggactacttt ccctcaggtg acaaattagt ctccggttct ggtgaccgta ccgttcgtat
     2041 ttgggactta cgtacaggcc agtgttcatt gactttatcc attgaagatg gtgttaccac
     2101 cgtcgctgta tcaccaggtg atggtaaata catcgctgct ggttctctag atcgtgctgt
     2161 gagagtttgg gattccgaga ccggattctt ggtggaaaga ctagattcgg aaaacgaatc
     2221 cggtacaggc cacaaggact ctgtttatag cgttgtcttc acgagagatg gacaaagcgt
     2281 tgtatccggc tcattagata gatctgttaa gctctggaat ttgcagaatg caaacaacaa
     2341 gagcgattcg aaaactccaa attccggcac ttgtgaagtt acgtatatcg ggcataaaga
     2401 ctttgtattg tccgtggcca ccacacaaaa tgatgagtac atcttgtccg gttccaaaga
     2461 tcgtggtgtc ctgttttggg ataagaaatc cggcaatccg ttattgatgt tgcaaggtca
     2521 taggaattca gttatatctg tggctgtggc aaacgggtct tcgctgggtc cagaatataa
     2581 cgtttttgct actggtagcg gtgattgtaa agcaaggatt tggaagtata aaaaaatagc
     2641 gccaaattaa gaacagaaca caaaaggaac actttacaaa tgtaactaac taaactattc
     2701 ttcaattcat tgtcattata tctttctctc tattcttttt tatctactta tttgttatgt
     2761 tatgatccag gtacgcttta aaacaaacag ttactttcct caccattcat cccttcccct
     2821 ccttttcctg tgtctgaata ttgcggttca ttattctttg atatgaattt atttgtgtat
     2881 tgtataagct t
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       PFAMSAA       433 bp ds-DNA             INV       13-FEB-1990
DEFINITION  P.falciparum (strain B358) major surface antigen p190 gene,
            partial cds.
ACCESSION   M32111
KEYWORDS    major surface antigen p190.
SOURCE      P.falciparum (strain B358) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 433)
  AUTHORS   Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
  TITLE     Genetic diversity in the major merozoite surface antigen of
            Plasmodium falciparum: High prevalence of a third polymorphic form
            detected in strains derived from malaria patients
  JOURNAL   Unpublished (1990) Inst. Pasteur, Paris Cedex 15
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A. Scherf, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   433     major surface antigen p190 (AA at 1)
BASE COUNT      148 a     75 c     83 g    127 t
ORIGIN      
        1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atggtttatt tcataaggaa
       61 aaaatgatct taaatgaaga agaaattact acaaaaggtg caagtgctca aggttcaagt
      121 ggtccaagtg gtacaccaag tggtacaagt ggtccaagtg gtacaagtgg tccaagtggt
      181 acaagtggtc caagtggtac aagtccatca tctcgttcaa acactttacc tcgttcaaat
      241 acttcatctg gtgcaagccc tccagctgat gcaagcgatt cagatgctaa atcttacgct
      301 gatttaaaac acagagtacg aaattacttg ttcactatta aagaactcaa atatcccgaa
      361 ctctttgatt taaccaatca tatgttaact ttgtgtgata atattcatgg tttcaaatat
      421 ttaattgatg gat
//
LOCUS       PFAMSAB       388 bp ds-DNA             INV       13-FEB-1990
DEFINITION  P.falciparum (strain B342) major surface antigen p190 gene,
            partial cds.
ACCESSION   M32112
KEYWORDS    major surface antigen p190.
SOURCE      P.falciparum (strain B342) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 388)
  AUTHORS   Kimura,E., Mattei,D., Mana di Santi,S.A. and Scherf,
  TITLE     Genetic diversity in the major merozoite surface antigen of
            Plasmodium falciparum: High prevalence of a third polymorphic form
            detected in strains derived from malaria patients
  JOURNAL   Unpublished (1990) Inst. pasteur, Paris Cedex 15
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A. Scherf, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   338     major surface antigen p190 (AA at 1)
BASE COUNT      138 a     63 c     70 g    117 t
ORIGIN      
        1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
       61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta
      121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
      181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
      241 gctaaatctt acgctgattt aaaacataga gctcaaaatt acttgttcac tattaaagaa
      301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttacc taaaaatgtt
      361 gatggtttca aatatttaat tgatggat
//
LOCUS       PFAMSAC       379 bp ds-DNA             INV       14-FEB-1990
DEFINITION  P.falciparum (strain B608) major surface antigen p190 gene,
            partial cds.
ACCESSION   M32113
KEYWORDS    major surface antigen p190.
SOURCE      P.falciparum (strain B608) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 379)
  AUTHORS   Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
  TITLE     Genetic diversity in the major merozoite surface antigen of
            Plasmodium falciparum: High prevalence of a third polymorphic form
            detected in strains derived from malaria patients
  JOURNAL   Unpublished (1990) Inst. Pasteur, Paris Cedex 15
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A. Scherf, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   379     major surface antigen p190 (AA at 1)
BASE COUNT      135 a     64 c     62 g    118 t
ORIGIN      
        1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atggtttatt tcataaggaa
       61 aaaatgctct taaatgaaga agaaattact acaaaaggtg caagtgctca aagtggtaca
      121 agttgtacaa gtggtccaag tggtacaagt ccatcatctc gttcaaacac tttacctcgt
      181 tcaaatactt catctggtgc aagccctcca gctgatgcaa gcgattcaga tgctaaatct
      241 tactctgatt taaaacacag agtacgaaat tacttgttaa ctatcaaaga actcaaatat
      301 cctcaactct tttatttaac taatcatatg ttgactttat ctaaaaatgt tgatggtttc
      361 aaatatttaa ttgatggat
//
LOCUS       PFAMSAD       388 bp ds-DNA             INV       13-FEB-1990
DEFINITION  P.falciparum (strain B440) major surface antigen p190 gene,
            partial cds.
ACCESSION   M32114
KEYWORDS    major surface antigen p190.
SOURCE      P.falciparum (strain B440) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 388)
  AUTHORS   Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
  TITLE     Genetic diversity in the major merozoite surface antigen of
            Plasmodium falciparum: High prevalence of a third polymorphic form
            detected in strains derived from malaria patients
  JOURNAL   Unpublished (1990) Inst. Pasteur, Paris Cedex 15
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A. Scherf, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   388     major surface antigen p190 (AA at 1)
BASE COUNT      138 a     62 c     69 g    119 t
ORIGIN      
        1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
       61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagccagt acctgctgta
      121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
      181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
      241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcgc tattaaagaa
      301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt
      361 gatggtttca aatatttaat tgatggat
//
LOCUS       PFAMSAE       388 bp ds-DNA             INV       13-FEB-1990
DEFINITION  P.falciparum (strain B411) major surface antigen p190 gene,
            partial cds.
ACCESSION   M32115
KEYWORDS    major surface antigen p190.
SOURCE      P.falciparum (strain B411) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 388)
  AUTHORS   Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
  TITLE     Genetic diversity in the major merozoite surface antigen of
            Plasmodium falciparum: High prevalence of a third polymorphic form
            detected in strains derived from malaria patients
  JOURNAL   Unpublished (1990) Inst. Pasteur, Paris Cedex 15
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A. Scherf, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   388     major surface antigen p190 (AA at 1)
BASE COUNT      138 a     61 c     70 g    119 t
ORIGIN      
        1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
       61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta
      121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
      181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
      241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcac tattaaagaa
      301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt
      361 gatggtttca aatatttaat tgatggat
//
LOCUS       PFAMSAF       388 bp ds-DNA             INV       14-FEB-1990
DEFINITION  P.falciparum (strain B439) major surface antigen p190 gene,
            partial cds.
ACCESSION   M32116
KEYWORDS    major surface antigen p190.
SOURCE      P.falciparum (strain B439) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 388)
  AUTHORS   Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A.
  TITLE     Genetic diversity in the major merozoite surface antigen of
            Plasmodium falciparum: High prevalence of a third polymorphic form
            detected in strains derived from malaria patients
  JOURNAL   Unpublished (1990) Inst. Pasteur, Paris Cedex 15
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A. Scherf, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   388     major surface antigen p190 (AA at 1)
BASE COUNT      138 a     61 c     70 g    119 t
ORIGIN      
        1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa
       61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta
      121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact
      181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat
      241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcac tattaaagaa
      301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt
      361 gatggtttca aatatttaat tgatggat
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       HPEPP        2570 bp ss-RNA             VRL       13-APR-1990
DEFINITION  Hepatitis E virus polyprotein (encoding NTP-binding protein and
            RNA-dependent RNA polymerase) mRNA, 3' end.
ACCESSION   M32400
KEYWORDS    NTP-binding protein; NTP-binding protein; polyprotein.
SOURCE      Hepatitis E virus (strain Burma; enterically transmitted non-A,
            non-B), cDNA to viral RNA, (host human liver and bile).
  ORGANISM  Hepatitis E virus
            Viridae; ss-RNA nonenveloped viruses.
REFERENCE   1  (bases 1 to 2570)
  AUTHORS   Reyes,G.R., Purdy,M.A., Kim,J.P., Luk K,-C., Young,L.M.,
            Fry,K.E. and Bradley,D.
  TITLE     Isolation of a cDNA from the virus responsible for enterically
            transmitted non-A, non-B hepatitis
  JOURNAL   Science 247, 1335-1339 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2570)
  AUTHORS   Tam,A.W., Smith,M.M., Kim J,P., Young,L.M., Piatak,M.,
            Feldman,R.A., Purdy,M.A., Bradley,D.W., Reyes,G.R. and Fry,K.E.
  TITLE     Hepatitis E virus: A positive strand RNA virus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin entry and computer-readable sequence for [1] kindly
            submitted by K.E.Fry, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1     2184     polyprotein (NTP-binding protein/RNA-dependent
                                 polymerase region) (AA at 1)
BASE COUNT      429 a    800 c    711 g    630 t
ORIGIN      
        1 gttgttcagt accagtttac tgcaggtgtg cctggatccg gcaagtcccg ctctatcacc
       61 caagccgatg tggacgttgt cgtggtcccg acgcgtgagt tgcgtaatgc ctggcgccgt
      121 cgcggctttg ctgcttttac cccgcatact gccgccagag tcacccaggg gcgccgggtt
      181 gtcattgatg aggctccatc cctcccccct cacctgctgc tgctccacat gcagcgggcc
      241 gccaccgtcc accttcttgg cgacccgaac cagatcccag ccatcgactt tgagcacgct
      301 gggctcgtcc ccgccatcag gcccgactta ggccccacct cctggtggca tgttacccat
      361 cgctggcctg cggatgtatg cgagctcatc cgtggtgcat accccatgat ccagaccact
      421 agccgggttc tccgttcgtt gttctggggt gagcctgccg tcgggcagaa actagtgttc
      481 acccaggcgg ccaagcccgc caaccccggc tcagtgacgg tccacgaggc gcagggcgct
      541 acctacacgg agaccactat tattgccaca gcagatgccc ggggccttat tcagtcgtct
      601 cgggctcatg ccattgttgc tctgacgcgc cacactgaga agtgcgtcat cattgacgca
      661 ccaggcctgc ttcgcgaggt gggcatctcc gatgcaatcg ttaataactt tttcctcgct
      721 ggtggcgaaa ttggtcacca gcgcccatca gttattcccc gtggcaaccc tgacgccaat
      781 gttgacaccc tggctgcctt cccgccgtct tgccagatta gtgccttcca tcagttggct
      841 gaggagcttg gccacagacc tgtccctgtt gcagctgttc taccaccctg ccccgagctc
      901 gaacagggcc ttctctacct gccccaggag ctcaccacct gtgatagtgt cgtaacattt
      961 gaattaacag acattgtgca ctgccgcatg gccgccccga gccagcgcaa ggccgtgctg
     1021 tccacactcg tgggccgcta cggcggtcgc acaaagctct acaatgcttc ccactctgat
     1081 gttcgcgact ctctcgcccg ttttatcccg gccattggcc ccgtacaggt tacaacttgt
     1141 gaattgtacg agctagtgga ggccatggtc gagaagggcc aggatggctc cgccgtcctt
     1201 gagcttgatc tttgcaaccg tgacgtgtcc aggatcacct tcttccagaa agattgtaac
     1261 aagttcacca caggtgagac cattgcccat ggtaaagtgg gccagggcat ctcggcctgg
     1321 agcaagacct tctgcgccct ctttggccct tggttccgcg ctattgagaa ggctattctg
     1381 gccctgctcc ctcagggtgt gttttacggt gatgcctttg atgacaccgt cttctcggcg
     1441 gctgtggccg cagcaaaggc atccatggtg tttgagaatg acttttctga gtttgactcc
     1501 acccagaata acttttctct gggtctagag tgtgctatta tggaggagtg tgggatgccg
     1561 cagtggctca tccgcctgta tcaccttata aggtctgcgt ggatcttgca ggccccgaag
     1621 gagtctctgc gagggttttg gaagaaacac tccggtgagc ccggcactct tctatggaat
     1681 actgtctgga atatggccgt tattacccac tgttatgact tccgcgattt tcaggtggct
     1741 gcctttaaag gtgatgattc gatagtgctt tgcagtgagt atcgtcagag tccaggagct
     1801 gctgtcctga tcgccggctg tggcttgaag ttgaaggtag atttccgccc gatcggtttg
     1861 tatgcaggtg ttgtggtggc ccccggcctt ggcgcgctcc ctgatgttgt gcgcttcgcc
     1921 ggccggctta ccgagaagaa ttggggccct ggccctgagc gggcggagca gctccgcctc
     1981 gctgttagtg atttcctccg caagctcacg aatgtagctc agatgtgtgt ggatgttgtt
     2041 tcccgtgttt atggggtttc ccctggactc gttcataacc tgattggcat gctacaggct
     2101 gttgctgatg gcaaggcaca tttcactgag tcagtaaaac cagtgctcga cttgacaaat
     2161 tcaatcttgt gtcgggtgga atgaataaca tgtcttttgc tgcgcccatg ggttcgcgac
     2221 catgcgccct cggcctattt tgttgctgct cctcatgttt ttgcctatgc tgcccgcgcc
     2281 accgcccggt cagccgtctg gccgccgtcg tgggcggcgc agcggcggtt ccggcggtgg
     2341 tttctggggt gaccgggttg attctcagcc cttcgcaatc ccctatattc atccaaccaa
     2401 ccccttcgcc cccgatgtca ccgctgcggc cggggctgga cctcgtgttc gccaacccgc
     2461 ccgaccactc ggctccgctt ggcgtgacca ggcccagcgc cccgccgttg cctcacgtcg
     2521 tagacctacc acagctgggg ccgcgccgct aaccgcggtc gctccggccc
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATTNT      17249 bp ds-DNA             ROD       20-MAY-1987
DEFINITION  Rat fast skeletal TnT gene encoding troponin T isoforms, complete
            cds.
ACCESSION   M15202 M14514 M14515 M14516 M14517 M14518 M14519 M14520 M14521
            M14522 M14523 M14524 M14525 M14527 M14528 M14529 M14530 M14531
            M10059 M14532 M14533 M14534 K02429 J00794
KEYWORDS    alternate splicing; troponin.
SOURCE      Rat adult skeletal muscle DNA [2],[4], clone pTnT-1 [2]; myogenic
            cell line L6E9, cDNA to mRNA, clone pTnT-15 [1],[2],[3].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 12798 to 16893; exons 12, 13, 14, 15, 17, 18)
  AUTHORS   Garfinkel,L.I., Periasamy,M. and Nadal-Ginard,B.
  TITLE     Cloning and characterization of cDNA sequences corresponding to
            myosin light chains 1, 2, and 3, troponin-C, troponin-T,
            alpha-tropomyosin, and alpha-actin
  JOURNAL   J. Biol. Chem. 257, 11078-11086 (1982)
  STANDARD  full staff_review
REFERENCE   2  (bases 13478 to 16906)
  AUTHORS   Medford,R.M., Nguyen,H.T., Destree,A.T., Summers,E. and
            Nadal-Ginard,B.
  TITLE     A novel mechanism of alternative RNA splicing for the
            developmentally regulated generation of troponin T isoforms from a
            single gene
  JOURNAL   Cell 38, 409-421 (1984)
  STANDARD  full staff_review
REFERENCE   3  (bases 313 to 16893; exons and intron/exon boundaries)
  AUTHORS   Breitbart,R.E., Nguyen,H.T., Medford,R.M., Destree,A.T.,
            Mahdavi,V. and Nadal-Ginard,B.
  TITLE     Intricate combinatorial patterns of exon splicing generate multiple
            regulated troponin T isoforms from a single gene
  JOURNAL   Cell 41, 67-82 (1985)
  STANDARD  full staff_review
REFERENCE   4  (bases 1 to 17249)
  AUTHORS   Breitbart,R.E. and Nadal-Ginard,B.
  TITLE     Complete nucleotide sequence of the fast skeletal troponin T gene:
            Alternatively spliced exons exhibit unusual interspecies divergence
  JOURNAL   J. Mol. Biol. 188, 313-324 (1986)
  STANDARD  full staff_review
REFERENCE   5  (bases 14620 to 14621; sequence correction)
  AUTHORS   Breitbart,R.E.
  JOURNAL   Unpublished (1990) Children's Hosp., Boston MA 02115
  STANDARD  full staff_review
COMMENT     Other exon combinations, besides the ones given in the Features
            table, are possible in theory, but no mRNAs corresponding to the
            right lengths were found [3].  There still is a question if there
            might be an alternate exon 9, which would again double the possible
            exon combinations and proteins.  Even though no proteins have been
            identified for each and every possible protein described below,
            mRNAs of the appropriate lengths have been identified.  The
            variation in mRNAs encoding tropomyosin T may be due to different
            half-lives at particular developmental stages [3].
            
            Distinct similarities are found among exons 3, 4, 5, 6, and 8 [4].
            These exons may have originated by gene duplication [4].  The
            variable region of troponin T is represented by exons 4-8 and the
            constant regions by exons 2 and 3, 9-15 and 18 [3],[4].  Exons 16
            and 17 are mutually exclusive [3],[4].  When exon 16 is translated
            the alpha forms of the proteins are found, and when exon 17 is
            expressed the beta forms are produced [3],[[4].
            
            Class Ia mRNAs are expressed only after the first postnatal week;
            class IVb mRNAs are found in L6E9 myotubes, but not in the fetus
            and are then increasingly expressed postnatally.  Class IIa mRNAs
            increase continually through 14 days.
FEATURES       from  to/span     description
    pept       3146     3162     tropomyosin T class Ia alpha-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ia alpha-1, exon 3
               4313     4330     tropomyosin T class Ia alpha-1, exon 4
               5538     5555     tropomyosin T class Ia alpha-1, exon 5
               5737     5751     tropomyosin T class Ia alpha-1, exon 6
               6722     6733     tropomyosin T class Ia alpha-1, exon 7
               6935     6949     tropomyosin T class Ia alpha-1, exon 8
               9703     9721     tropomyosin T class Ia alpha-1, exon 9
              11980    12025     tropomyosin T class Ia alpha-1, exon 10
              12591    12707     tropomyosin T class Ia alpha-1, exon 11
              12795    12872     tropomyosin T class Ia alpha-1, exon 12
              13082    13195     tropomyosin T class Ia alpha-1, exon 13
              13295    13404     tropomyosin T class Ia alpha-1, exon 14
              13515    13605     tropomyosin T class Ia alpha-1, exon 15
              14613    14653     tropomyosin T class Ia alpha-1, exon 16
              16666    16720     tropomyosin T class Ia alpha-1, exon 17
    pept       3146     3162     tropomyosin T class Ia alpha-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ia alpha-2, exon 3
               5538     5555     tropomyosin T class Ia alpha-2, exon 4
               5737     5751     tropomyosin T class Ia alpha-2, exon 5
               6722     6733     tropomyosin T class Ia alpha-2, exon 6
               6935     6949     tropomyosin T class Ia alpha-2, exon 7
               9703     9721     tropomyosin T class Ia alpha-2, exon 8
              11980    12025     tropomyosin T class Ia alpha-2, exon 9
              12591    12707     tropomyosin T class Ia alpha-2, exon 10
              12795    12872     tropomyosin T class Ia alpha-2, exon 11
              13082    13195     tropomyosin T class Ia alpha-2, exon 12
              13295    13404     tropomyosin T class Ia alpha-2, exon 13
              13515    13605     tropomyosin T class Ia alpha-2, exon 14
              14613    14653     tropomyosin T class Ia alpha-2, exon 15
              16666    16720     tropomyosin T class Ia alpha-2, exon 16
    pept       3146     3162     tropomyosin T class Ib alpha-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ib alpha-1, exon 3
               4313     4330     tropomyosin T class Ib alpha-1, exon 4
               5737     5751     tropomyosin T class Ib alpha-1, exon 5
               6722     6733     tropomyosin T class Ib alpha-1, exon 6
               6935     6949     tropomyosin T class Ib alpha-1, exon 7
               9703     9721     tropomyosin T class Ib alpha-1, exon 8
              11980    12025     tropomyosin T class Ib alpha-1, exon 9
              12591    12707     tropomyosin T class Ib alpha-1, exon 10
              12795    12872     tropomyosin T class Ib alpha-1, exon 11
              13082    13195     tropomyosin T class Ib alpha-1, exon 12
              13295    13404     tropomyosin T class Ib alpha-1, exon 13
              13515    13605     tropomyosin T class Ib alpha-1, exon 14
              14613    14653     tropomyosin T class Ib alpha-1, exon 15
              16666    16720     tropomyosin T class Ib alpha-1, exon 16
    pept       3146     3162     tropomyosin T class Ib alpha-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ib alpha-2, exon 3
               5737     5751     tropomyosin T class Ib alpha-2, exon 4
               6722     6733     tropomyosin T class Ib alpha-2, exon 5
               6935     6949     tropomyosin T class Ib alpha-2, exon 6
               9703     9721     tropomyosin T class Ib alpha-2, exon 7
              11980    12025     tropomyosin T class Ib alpha-2, exon 8
              12591    12707     tropomyosin T class Ib alpha-2, exon 9
              12795    12872     tropomyosin T class Ib alpha-2, exon 10
              13082    13195     tropomyosin T class Ib alpha-2, exon 11
              13295    13404     tropomyosin T class Ib alpha-2, exon 12
              13515    13605     tropomyosin T class Ib alpha-2, exon 13
              14613    14653     tropomyosin T class Ib alpha-2, exon 14
              16666    16720     tropomyosin T class Ib alpha-2, exon 15
    pept       3146     3162     tropomyosin T class Ic alpha-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic alpha-1, exon 3
               4313     4330     tropomyosin T class Ic alpha-1, exon 4
               5538     5555     tropomyosin T class Ic alpha-1, exon 5
               6722     6733     tropomyosin T class Ic alpha-1, exon 6
               6935     6949     tropomyosin T class Ic alpha-1, exon 7
               9703     9721     tropomyosin T class Ic alpha-1, exon 8
              11980    12025     tropomyosin T class Ic alpha-1, exon 9
              12591    12707     tropomyosin T class Ic alpha-1, exon 10
              12795    12872     tropomyosin T class Ic alpha-1, exon 11
              13082    13195     tropomyosin T class Ic alpha-1, exon 12
              13295    13404     tropomyosin T class Ic alpha-1, exon 13
              13515    13605     tropomyosin T class Ic alpha-1, exon 14
              14613    14653     tropomyosin T class Ic alpha-1, exon 15
              16666    16720     tropomyosin T class Ic alpha-1, exon 16
    pept       3146     3162     tropomyosin T class Ic alpha-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic alpha-2, exon 3
               4313     4330     tropomyosin T class Ic alpha-2, exon 4
               6722     6733     tropomyosin T class Ic alpha-2, exon 5
               6935     6949     tropomyosin T class Ic alpha-2, exon 6
               9703     9721     tropomyosin T class Ic alpha-2, exon 7
              11980    12025     tropomyosin T class Ic alpha-2, exon 8
              12591    12707     tropomyosin T class Ic alpha-2, exon 9
              12795    12872     tropomyosin T class Ic alpha-2, exon 10
              13082    13195     tropomyosin T class Ic alpha-2, exon 11
              13295    13404     tropomyosin T class Ic alpha-2, exon 12
              13515    13605     tropomyosin T class Ic alpha-2, exon 13
              14613    14653     tropomyosin T class Ic alpha-2, exon 14
              16666    16720     tropomyosin T class Ic alpha-2, exon 15
    pept       3146     3162     tropomyosin T class Ic alpha-3, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic alpha-3, exon 3
               5538     5555     tropomyosin T class Ic alpha-3, exon 4
               6722     6733     tropomyosin T class Ic alpha-3, exon 5
               6935     6949     tropomyosin T class Ic alpha-3, exon 6
               9703     9721     tropomyosin T class Ic alpha-3, exon 7
              11980    12025     tropomyosin T class Ic alpha-3, exon 8
              12591    12707     tropomyosin T class Ic alpha-3, exon 9
              12795    12872     tropomyosin T class Ic alpha-3, exon 10
              13082    13195     tropomyosin T class Ic alpha-3, exon 11
              13295    13404     tropomyosin T class Ic alpha-3, exon 12
              13515    13605     tropomyosin T class Ic alpha-3, exon 13
              14613    14653     tropomyosin T class Ic alpha-3, exon 14
              16666    16720     tropomyosin T class Ic alpha-3, exon 15
    pept       3146     3162     tropomyosin T class Ic alpha-4, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic alpha-4, exon 3
               6722     6733     tropomyosin T class Ic alpha-4, exon 4
               6935     6949     tropomyosin T class Ic alpha-4, exon 5
               9703     9721     tropomyosin T class Ic alpha-4, exon 6
              11980    12025     tropomyosin T class Ic alpha-4, exon 7
              12591    12707     tropomyosin T class Ic alpha-4, exon 8
              12795    12872     tropomyosin T class Ic alpha-4, exon 9
              13082    13195     tropomyosin T class Ic alpha-4, exon 10
              13295    13404     tropomyosin T class Ic alpha-4, exon 11
              13515    13605     tropomyosin T class Ic alpha-4, exon 12
              14613    14653     tropomyosin T class Ic alpha-4, exon 13
              16666    16720     tropomyosin T class Ic alpha-4, exon 14
    pept       3146     3162     tropomyosin T class Ia beta-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ia beta-1, exon 3
               4313     4330     tropomyosin T class Ia beta-1, exon 4
               5538     5555     tropomyosin T class Ia beta-1, exon 5
               5737     5751     tropomyosin T class Ia beta-1, exon 6
               6722     6733     tropomyosin T class Ia beta-1, exon 7
               6935     6949     tropomyosin T class Ia beta-1, exon 8
               9703     9721     tropomyosin T class Ia beta-1, exon 9
              11980    12025     tropomyosin T class Ia beta-1, exon 10
              12591    12707     tropomyosin T class Ia beta-1, exon 11
              12795    12872     tropomyosin T class Ia beta-1, exon 12
              13082    13195     tropomyosin T class Ia beta-1, exon 13
              13295    13404     tropomyosin T class Ia beta-1, exon 14
              13515    13605     tropomyosin T class Ia beta-1, exon 15
              15251    15291     tropomyosin T class Ia beta-1, exon 16
              16666    16720     tropomyosin T class Ia beta-1, exon 17
    pept       3146     3162     tropomyosin T class Ia beta-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ia beta-2, exon 3
               5538     5555     tropomyosin T class Ia beta-2, exon 4
               5737     5751     tropomyosin T class Ia beta-2, exon 5
               6722     6733     tropomyosin T class Ia beta-2, exon 6
               6935     6949     tropomyosin T class Ia beta-2, exon 7
               9703     9721     tropomyosin T class Ia beta-2, exon 8
              11980    12025     tropomyosin T class Ia beta-2, exon 9
              12591    12707     tropomyosin T class Ia beta-2, exon 10
              12795    12872     tropomyosin T class Ia beta-2, exon 11
              13082    13195     tropomyosin T class Ia beta-2, exon 12
              13295    13404     tropomyosin T class Ia beta-2, exon 13
              13515    13605     tropomyosin T class Ia beta-2, exon 14
              15251    15291     tropomyosin T class Ia beta-2, exon 15
              16666    16720     tropomyosin T class Ia beta-2, exon 16
    pept       3146     3162     tropomyosin T class Ib beta-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ib beta-1, exon 3
               4313     4330     tropomyosin T class Ib beta-1, exon 4
               5737     5751     tropomyosin T class Ib beta-1, exon 5
               6722     6733     tropomyosin T class Ib beta-1, exon 6
               6935     6949     tropomyosin T class Ib beta-1, exon 7
               9703     9721     tropomyosin T class Ib beta-1, exon 8
              11980    12025     tropomyosin T class Ib beta-1, exon 9
              12591    12707     tropomyosin T class Ib beta-1, exon 10
              12795    12872     tropomyosin T class Ib beta-1, exon 11
              13082    13195     tropomyosin T class Ib beta-1, exon 12
              13295    13404     tropomyosin T class Ib beta-1, exon 13
              13515    13605     tropomyosin T class Ib beta-1, exon 14
              15251    15291     tropomyosin T class Ib beta-1, exon 15
              16666    16720     tropomyosin T class Ib beta-1, exon 16
    pept       3146     3162     tropomyosin T class Ib beta-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ib beta-2, exon 3
               5737     5751     tropomyosin T class Ib beta-2, exon 4
               6722     6733     tropomyosin T class Ib beta-2, exon 5
               6935     6949     tropomyosin T class Ib beta-2, exon 6
               9703     9721     tropomyosin T class Ib beta-2, exon 7
              11980    12025     tropomyosin T class Ib beta-2, exon 8
              12591    12707     tropomyosin T class Ib beta-2, exon 9
              12795    12872     tropomyosin T class Ib beta-2, exon 10
              13082    13195     tropomyosin T class Ib beta-2, exon 11
              13295    13404     tropomyosin T class Ib beta-2, exon 12
              13515    13605     tropomyosin T class Ib beta-2, exon 13
              15251    15291     tropomyosin T class Ib beta-2, exon 14
              16666    16720     tropomyosin T class Ib beta-2, exon 15
    pept       3146     3162     tropomyosin T class Ic beta-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic beta-1, exon 3
               4313     4330     tropomyosin T class Ic beta-1, exon 4
               5538     5555     tropomyosin T class Ic beta-1, exon 5
               6722     6733     tropomyosin T class Ic beta-1, exon 6
               6935     6949     tropomyosin T class Ic beta-1, exon 7
               9703     9721     tropomyosin T class Ic beta-1, exon 8
              11980    12025     tropomyosin T class Ic beta-1, exon 9
              12591    12707     tropomyosin T class Ic beta-1, exon 10
              12795    12872     tropomyosin T class Ic beta-1, exon 11
              13082    13195     tropomyosin T class Ic beta-1, exon 12
              13295    13404     tropomyosin T class Ic beta-1, exon 13
              13515    13605     tropomyosin T class Ic beta-1, exon 14
              15251    15291     tropomyosin T class Ic beta-1, exon 15
              16666    16720     tropomyosin T class Ic beta-1, exon 16
    pept       3146     3162     tropomyosin T class Ic beta-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic beta-2, exon 3
               4313     4330     tropomyosin T class Ic beta-2, exon 4
               6722     6733     tropomyosin T class Ic beta-2, exon 5
               6935     6949     tropomyosin T class Ic beta-2, exon 6
               9703     9721     tropomyosin T class Ic beta-2, exon 7
              11980    12025     tropomyosin T class Ic beta-2, exon 8
              12591    12707     tropomyosin T class Ic beta-2, exon 9
              12795    12872     tropomyosin T class Ic beta-2, exon 10
              13082    13195     tropomyosin T class Ic beta-2, exon 11
              13295    13404     tropomyosin T class Ic beta-2, exon 12
              13515    13605     tropomyosin T class Ic beta-2, exon 13
              15251    15291     tropomyosin T class Ic beta-2, exon 14
              16666    16720     tropomyosin T class Ic beta-2, exon 15
    pept       3146     3162     tropomyosin T class Ic beta-3, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic beta-3, exon 3
               5538     5555     tropomyosin T class Ic beta-3, exon 4
               6722     6733     tropomyosin T class Ic beta-3, exon 5
               6935     6949     tropomyosin T class Ic beta-3, exon 6
               9703     9721     tropomyosin T class Ic beta-3, exon 7
              11980    12025     tropomyosin T class Ic beta-3, exon 8
              12591    12707     tropomyosin T class Ic beta-3, exon 9
              12795    12872     tropomyosin T class Ic beta-3, exon 10
              13082    13195     tropomyosin T class Ic beta-3, exon 11
              13295    13404     tropomyosin T class Ic beta-3, exon 12
              13515    13605     tropomyosin T class Ic beta-3, exon 13
              15251    15291     tropomyosin T class Ic beta-3, exon 14
              16666    16720     tropomyosin T class Ic beta-3, exon 15
    pept       3146     3162     tropomyosin T class Ic beta-4, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class Ic beta-4, exon 3
               6722     6733     tropomyosin T class Ic beta-4, exon 4
               6935     6949     tropomyosin T class Ic beta-4, exon 5
               9703     9721     tropomyosin T class Ic beta-4, exon 6
              11980    12025     tropomyosin T class Ic beta-4, exon 7
              12591    12707     tropomyosin T class Ic beta-4, exon 8
              12795    12872     tropomyosin T class Ic beta-4, exon 9
              13082    13195     tropomyosin T class Ic beta-4, exon 10
              13295    13404     tropomyosin T class Ic beta-4, exon 11
              13515    13605     tropomyosin T class Ic beta-4, exon 12
              15251    15291     tropomyosin T class Ic beta-4, exon 13
              16666    16720     tropomyosin T class Ic beta-4, exon 14
    pept       3146     3162     tropomyosin T class IIa alpha-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa alpha-1, exon 3
               4313     4330     tropomyosin T class IIa alpha-1, exon 4
               5538     5555     tropomyosin T class IIa alpha-1, exon 5
               5737     5751     tropomyosin T class IIa alpha-1, exon 6
               6935     6949     tropomyosin T class IIa alpha-1, exon 7
               9703     9721     tropomyosin T class IIa alpha-1, exon 8
              11980    12025     tropomyosin T class IIa alpha-1, exon 9
              12591    12707     tropomyosin T class IIa alpha-1, exon 10
              12795    12872     tropomyosin T class IIa alpha-1, exon 11
              13082    13195     tropomyosin T class IIa alpha-1, exon 12
              13295    13404     tropomyosin T class IIa alpha-1, exon 13
              13515    13605     tropomyosin T class IIa alpha-1, exon 14
              14613    14653     tropomyosin T class IIa alpha-1, exon 15
              16666    16720     tropomyosin T class IIa alpha-1, exon 16
    pept       3146     3162     tropomyosin T class IIa alpha-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa alpha-2, exon 3
               4313     4330     tropomyosin T class IIa alpha-2, exon 4
               5737     5751     tropomyosin T class IIa alpha-2, exon 5
               6935     6949     tropomyosin T class IIa alpha-2, exon 6
               9703     9721     tropomyosin T class IIa alpha-2, exon 7
              11980    12025     tropomyosin T class IIa alpha-2, exon 8
              12591    12707     tropomyosin T class IIa alpha-2, exon 9
              12795    12872     tropomyosin T class IIa alpha-2, exon 10
              13082    13195     tropomyosin T class IIa alpha-2, exon 11
              13295    13404     tropomyosin T class IIa alpha-2, exon 12
              13515    13605     tropomyosin T class IIa alpha-2, exon 13
              14613    14653     tropomyosin T class IIa alpha-2, exon 14
              16666    16720     tropomyosin T class IIa alpha-2, exon 15
    pept       3146     3162     tropomyosin T class IIa alpha-3, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa alpha-3, exon 3
               5538     5555     tropomyosin T class IIa alpha-3, exon 4
               5737     5751     tropomyosin T class IIa alpha-3, exon 5
               6935     6949     tropomyosin T class IIa alpha-3, exon 6
               9703     9721     tropomyosin T class IIa alpha-3, exon 7
              11980    12025     tropomyosin T class IIa alpha-3, exon 8
              12591    12707     tropomyosin T class IIa alpha-3, exon 9
              12795    12872     tropomyosin T class IIa alpha-3, exon 10
              13082    13195     tropomyosin T class IIa alpha-3, exon 11
              13295    13404     tropomyosin T class IIa alpha-3, exon 12
              13515    13605     tropomyosin T class IIa alpha-3, exon 13
              14613    14653     tropomyosin T class IIa alpha-3, exon 14
              16666    16720     tropomyosin T class IIa alpha-3, exon 15
    pept       3146     3162     tropomyosin T class IIa alpha-4, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa alpha-4, exon 3
               5737     5751     tropomyosin T class IIa alpha-4, exon 4
               6935     6949     tropomyosin T class IIa alpha-4, exon 5
               9703     9721     tropomyosin T class IIa alpha-4, exon 6
              11980    12025     tropomyosin T class IIa alpha-4, exon 7
              12591    12707     tropomyosin T class IIa alpha-4, exon 8
              12795    12872     tropomyosin T class IIa alpha-4, exon 9
              13082    13195     tropomyosin T class IIa alpha-4, exon 10
              13295    13404     tropomyosin T class IIa alpha-4, exon 11
              13515    13605     tropomyosin T class IIa alpha-4, exon 12
              14613    14653     tropomyosin T class IIa alpha-4, exon 13
              16666    16720     tropomyosin T class IIa alpha-4, exon 14
    pept       3146     3162     tropomyosin T class IIa beta-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa beta-1, exon 3
               4313     4330     tropomyosin T class IIa beta-1, exon 4
               5538     5555     tropomyosin T class IIa beta-1, exon 5
               5737     5751     tropomyosin T class IIa beta-1, exon 6
               6935     6949     tropomyosin T class IIa beta-1, exon 7
               9703     9721     tropomyosin T class IIa beta-1, exon 8
              11980    12025     tropomyosin T class IIa beta-1, exon 9
              12591    12707     tropomyosin T class IIa beta-1, exon 10
              12795    12872     tropomyosin T class IIa beta-1, exon 11
              13082    13195     tropomyosin T class IIa beta-1, exon 12
              13295    13404     tropomyosin T class IIa beta-1, exon 13
              13515    13605     tropomyosin T class IIa beta-1, exon 14
              15251    15291     tropomyosin T class IIa beta-1, exon 15
              16666    16720     tropomyosin T class IIa beta-1, exon 16
    pept       3146     3162     tropomyosin T class IIa beta-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa beta-2, exon 3
               4313     4330     tropomyosin T class IIa beta-2, exon 4
               5737     5751     tropomyosin T class IIa beta-2, exon 5
               6935     6949     tropomyosin T class IIa beta-2, exon 6
               9703     9721     tropomyosin T class IIa beta-2, exon 7
              11980    12025     tropomyosin T class IIa beta-2, exon 8
              12591    12707     tropomyosin T class IIa beta-2, exon 9
              12795    12872     tropomyosin T class IIa beta-2, exon 10
              13082    13195     tropomyosin T class IIa beta-2, exon 11
              13295    13404     tropomyosin T class IIa beta-2, exon 12
              13515    13605     tropomyosin T class IIa beta-2, exon 13
              15251    15291     tropomyosin T class IIa beta-2, exon 14
              16666    16720     tropomyosin T class IIa beta-2, exon 15
    pept       3146     3162     tropomyosin T class IIa beta-3, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa beta-3, exon 3
               5538     5555     tropomyosin T class IIa beta-3, exon 4
               5737     5751     tropomyosin T class IIa beta-3, exon 5
               6935     6949     tropomyosin T class IIa beta-3, exon 6
               9703     9721     tropomyosin T class IIa beta-3, exon 7
              11980    12025     tropomyosin T class IIa beta-3, exon 8
              12591    12707     tropomyosin T class IIa beta-3, exon 9
              12795    12872     tropomyosin T class IIa beta-3, exon 10
              13082    13195     tropomyosin T class IIa beta-3, exon 11
              13295    13404     tropomyosin T class IIa beta-3, exon 12
              13515    13605     tropomyosin T class IIa beta-3, exon 13
              15251    15291     tropomyosin T class IIa beta-3, exon 14
              16666    16720     tropomyosin T class IIa beta-3, exon 15
    pept       3146     3162     tropomyosin T class IIa beta-4, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIa beta-4, exon 3
               5737     5751     tropomyosin T class IIa beta-4, exon 4
               6935     6949     tropomyosin T class IIa beta-4, exon 5
               9703     9721     tropomyosin T class IIa beta-4, exon 6
              11980    12025     tropomyosin T class IIa beta-4, exon 7
              12591    12707     tropomyosin T class IIa beta-4, exon 8
              12795    12872     tropomyosin T class IIa beta-4, exon 9
              13082    13195     tropomyosin T class IIa beta-4, exon 10
              13295    13404     tropomyosin T class IIa beta-4, exon 11
              13515    13605     tropomyosin T class IIa beta-4, exon 12
              15251    15291     tropomyosin T class IIa beta-4, exon 13
              16666    16720     tropomyosin T class IIa beta-4, exon 14
    pept       3146     3162     tropomyosin T class IIIa alpha, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIIa alpha, exon 3
               4313     4330     tropomyosin T class IIIa alpha, exon 4
               5538     5555     tropomyosin T class IIIa alpha, exon 5
               9703     9721     tropomyosin T class IIIa alpha, exon 6
              11980    12025     tropomyosin T class IIIa alpha, exon 7
              12591    12707     tropomyosin T class IIIa alpha, exon 8
              12795    12872     tropomyosin T class IIIa alpha, exon 9
              13082    13195     tropomyosin T class IIIa alpha, exon 10
              13295    13404     tropomyosin T class IIIa alpha, exon 11
              13515    13605     tropomyosin T class IIIa alpha, exon 12
              14613    14653     tropomyosin T class IIIa alpha, exon 13
              16666    16720     tropomyosin T class IIIa alpha, exon 14
    pept       3146     3162     tropomyosin T class IIIb alpha, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIIb alpha, exon 3
               5538     5555     tropomyosin T class IIIb alpha, exon 4
               9703     9721     tropomyosin T class IIIb alpha, exon 5
              11980    12025     tropomyosin T class IIIb alpha, exon 6
              12591    12707     tropomyosin T class IIIb alpha, exon 7
              12795    12872     tropomyosin T class IIIb alpha, exon 8
              13082    13195     tropomyosin T class IIIb alpha, exon 9
              13295    13404     tropomyosin T class IIIb alpha, exon 10
              13515    13605     tropomyosin T class IIIb alpha, exon 11
              14613    14653     tropomyosin T class IIIb alpha, exon 12
              16666    16720     tropomyosin T class IIIb alpha, exon 13
    pept       3146     3162     tropomyosin T class IIIa beta, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIIa beta, exon 3
               4313     4330     tropomyosin T class IIIa beta, exon 4
               5538     5555     tropomyosin T class IIIa beta, exon 5
               9703     9721     tropomyosin T class IIIa beta, exon 6
              11980    12025     tropomyosin T class IIIa beta, exon 7
              12591    12707     tropomyosin T class IIIa beta, exon 8
              12795    12872     tropomyosin T class IIIa beta, exon 9
              13082    13195     tropomyosin T class IIIa beta, exon 10
              13295    13404     tropomyosin T class IIIa beta, exon 11
              13515    13605     tropomyosin T class IIIa beta, exon 12
              15251    15291     tropomyosin T class IIIa beta, exon 13
              16666    16720     tropomyosin T class IIIa beta, exon 14
    pept       3146     3162     tropomyosin T class IIIb beta, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IIIb beta, exon 3
               5538     5555     tropomyosin T class IIIb beta, exon 4
               9703     9721     tropomyosin T class IIIb beta, exon 5
              11980    12025     tropomyosin T class IIIb beta, exon 6
              12591    12707     tropomyosin T class IIIb beta, exon 7
              12795    12872     tropomyosin T class IIIb beta, exon 8
              13082    13195     tropomyosin T class IIIb beta, exon 9
              13295    13404     tropomyosin T class IIIb beta, exon 10
              13515    13605     tropomyosin T class IIIb beta, exon 11
              15251    15291     tropomyosin T class IIIb beta, exon 12
              16666    16720     tropomyosin T class IIIb beta, exon 13
    pept       3146     3162     tropomyosin T class IVa alpha, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IVa alpha, exon 3
               5538     5555     tropomyosin T class IVa alpha, exon 4
               5737     5751     tropomyosin T class IVa alpha, exon 5
               6722     6733     tropomyosin T class IVa alpha, exon 6
               9703     9721     tropomyosin T class IVa alpha, exon 7
              11980    12025     tropomyosin T class IVa alpha, exon 8
              12591    12707     tropomyosin T class IVa alpha, exon 9
              12795    12872     tropomyosin T class IVa alpha, exon 10
              13082    13195     tropomyosin T class IVa alpha, exon 11
              13295    13404     tropomyosin T class IVa alpha, exon 12
              13515    13605     tropomyosin T class IVa alpha, exon 13
              14613    14653     tropomyosin T class IVa alpha, exon 14
              16666    16720     tropomyosin T class IVa alpha, exon 15
    pept       3146     3162     tropomyosin T class IVb alpha, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IVb alpha, exon 3
               4313     4330     tropomyosin T class IVb alpha, exon 4
               5538     5555     tropomyosin T class IVb alpha, exon 5
               5737     5751     tropomyosin T class IVb alpha, exon 6
               6722     6733     tropomyosin T class IVb alpha, exon 7
               9703     9721     tropomyosin T class IVb alpha, exon 8
              11980    12025     tropomyosin T class IVb alpha, exon 9
              12591    12707     tropomyosin T class IVb alpha, exon 10
              12795    12872     tropomyosin T class IVb alpha, exon 11
              13082    13195     tropomyosin T class IVb alpha, exon 12
              13295    13404     tropomyosin T class IVb alpha, exon 13
              13515    13605     tropomyosin T class IVb alpha, exon 14
              14613    14653     tropomyosin T class IVb alpha, exon 15
              16666    16720     tropomyosin T class IVb alpha, exon 16
    pept       3146     3162     tropomyosin T class IVc alpha-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IVc alpha-1, exon 3
               4313     4330     tropomyosin T class IVc alpha-1, exon 4
               5737     5751     tropomyosin T class IVc alpha-1, exon 5
               6722     6733     tropomyosin T class IVc alpha-1, exon 6
               9703     9721     tropomyosin T class IVc alpha-1, exon 7
              11980    12025     tropomyosin T class IVc alpha-1, exon 8
              12591    12707     tropomyosin T class IVc alpha-1, exon 9
              12795    12872     tropomyosin T class IVc alpha-1, exon 10
              13082    13195     tropomyosin T class IVc alpha-1, exon 11
              13295    13404     tropomyosin T class IVc alpha-1, exon 12
              13515    13605     tropomyosin T class IVc alpha-1, exon 13
              14613    14653     tropomyosin T class IVc alpha-1, exon 14
              16666    16720     tropomyosin T class IVc alpha-1, exon 15
    pept       3146     3162     tropomyosin T class IVc alpha-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IVc alpha-2, exon 3
               5737     5751     tropomyosin T class IVc alpha-2, exon 4
               6722     6733     tropomyosin T class IVc alpha-2, exon 5
               9703     9721     tropomyosin T class IVc alpha-2, exon 6
              11980    12025     tropomyosin T class IVc alpha-2, exon 7
              12591    12707     tropomyosin T class IVc alpha-2, exon 8
              12795    12872     tropomyosin T class IVc alpha-2, exon 9
              13082    13195     tropomyosin T class IVc alpha-2, exon 10
              13295    13404     tropomyosin T class IVc alpha-2, exon 11
              13515    13605     tropomyosin T class IVc alpha-2, exon 12
              14613    14653     tropomyosin T class IVc alpha-2, exon 13
              16666    16720     tropomyosin T class IVc alpha-2, exon 14
    pept       3146     3162     tropomyosin T class IVd alpha-1, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IVd alpha-1, exon 3
               4313     4330     tropomyosin T class IVd alpha-1, exon 4
               5538     5555     tropomyosin T class IVd alpha-1, exon 5
               6722     6733     tropomyosin T class IVd alpha-1, exon 6
               9703     9721     tropomyosin T class IVd alpha-1, exon 7
              11980    12025     tropomyosin T class IVd alpha-1, exon 8
              12591    12707     tropomyosin T class IVd alpha-1, exon 9
              12795    12872     tropomyosin T class IVd alpha-1, exon 10
              13082    13195     tropomyosin T class IVd alpha-1, exon 11
              13295    13404     tropomyosin T class IVd alpha-1, exon 12
              13515    13605     tropomyosin T class IVd alpha-1, exon 13
              14613    14653     tropomyosin T class IVd alpha-1, exon 14
              16666    16720     tropomyosin T class IVd alpha-1, exon 15
    pept       3146     3162     tropomyosin T class IVd alpha-2, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IVd alpha-2, exon 3
               4313     4330     tropomyosin T class IVd alpha-2, exon 4
               6722     6733     tropomyosin T class IVd alpha-2, exon 5
               9703     9721     tropomyosin T class IVd alpha-2, exon 6
              11980    12025     tropomyosin T class IVd alpha-2, exon 7
              12591    12707     tropomyosin T class IVd alpha-2, exon 8
              12795    12872     tropomyosin T class IVd alpha-2, exon 9
              13082    13195     tropomyosin T class IVd alpha-2, exon 10
              13295    13404     tropomyosin T class IVd alpha-2, exon 11
              13515    13605     tropomyosin T class IVd alpha-2, exon 12
              14613    14653     tropomyosin T class IVd alpha-2, exon 13
              16666    16720     tropomyosin T class IVd alpha-2, exon 14
    pept       3146     3162     tropomyosin T class IVd alpha-3, exon 2 (first
                                 expressed exon)
               3344     3357     tropomyosin T class IVd alpha-3, exon 3
               5538     5555     tropomyosin T class IVd alpha-3, exon 4
             

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       TRSSTOBAA     360 bp ss-RNA             VRL       14-MAR-1990
DEFINITION  Tobacco ringspot virus (TobRV) mRNA.
ACCESSION   M17439
KEYWORDS    .
SOURCE      Tobacco ringspot virus (isolate 62L), cDNA to viral RNA.
  ORGANISM  Tobacco ringspot virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Nepovirus.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Buzayan,J.M., McNinch,J.S., Schneider,I.R. and Bruening,G.
  TITLE     A nucleotide sequence rearrangement distinguishes two isolates of
            satellite tobacco ringspot virus RNA
  JOURNAL   Virology 160, 95-99 (1987)
  STANDARD  simple staff_review
BASE COUNT       70 a     90 c    110 g     90 t
ORIGIN      
        1 accggatgtg ctttccggtc tgatgagtcc gtgaggacga aacaggactg tcaggtggcc
       61 gaaagccacc acgtaaacta atgaaccgtg ctgcgtagcg ttgtgcctcg tggaggtggg
      121 atgccacctc gtggagcagc cttcgtgtgg gcgcggcggt gtagttagtc aaggcgtacc
      181 aggtaatata ccacaacgtg tgtttctctg gttgacttct ctgtttgttg tgtcattagg
      241 tcccgggtct cgcattagcg gcgacggggt attctcttac gacatggaag tttgagagac
      301 cgcgcctcta cactatgcgc ggccggggcg aatccaaata ttctagcccg ataccctgtc
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATMYHOA      260 bp ss-mRNA            ROD       14-MAR-1990
DEFINITION  Rat cardiac beta-myosin heavy chain mRNA, 3' end, clone pCMHC-21.
ACCESSION   M32697
KEYWORDS    beta-myosin; myosin; myosin heavy chain.
SOURCE      Rat (strain Wistar) three months-old ventricle, cDNA to mRNA, clone
            pCMHC-21.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 260)
  AUTHORS   Mahdavi,V., Lompre,A.M., Chambers,A.P. and Nadal-Ginard,B.
  TITLE     Cardiac myosin heavy chain isozymic transitions during development
            and under pathological conditions are regulated at the level of
            mRNA availability
  JOURNAL   Eur. Heart J. 5, 181-191 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      204     beta-myosin heavy chain (AA at 1)
BASE COUNT       77 a     69 c     79 g     35 t
ORIGIN      
        1 ctgcagttga aggtgaaggc ctacaagcgc caggctgaag aggcggagga acaggccaac
       61 accaacctgt ccaagttccg caaggtgcag cacgagctgg atgaggcaga ggagagggcg
      121 gacattgccg agtcccaggt caacaagctg cgggccaaga gccgtgacat tggcgccaag
      181 cagaaaatcc acgatgagga ataacctgtc cagcagaaag agcctcgccg ttgccatccc
      241 acaataaata cgaatgttcg
//
LOCUS       RATMYHOB      303 bp ss-mRNA            ROD       14-MAR-1990
DEFINITION  Rat cardiac beta-myosin heavy chain mRNA, 3' end, clone pCMHC-5.
ACCESSION   M32698
KEYWORDS    beta-myosin; myosin; myosin heavy chain.
SOURCE      Rat (strain Wistar) three months-old ventricle, cDNA to mRNA, clone
            pCMHC-5.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 303)
  AUTHORS   Mahdavi,V., Lompre,A.M., Chambers,A.P. and Nadal-Ginard,B.
  TITLE     Cardiac myosin heavy chain isozymic transitions during development
            and under pathological conditions are regulated at the level of
            mRNA availability
  JOURNAL   Eur. Heart J. 5, 181-191 (1984)
  STANDARD  simle staff_review
FEATURES       from  to/span     description
    pept    <     1      198     beta-myosin heavy chain (AA at 1)
BASE COUNT       87 a     80 c     89 g     47 t
ORIGIN      
        1 ctgcagttga aggtgaaggc ctacaagcgc caggctgaag aggcggagga acaggccaac
       61 accaacctgt ccaagttccg caaggtgcag cacgagctgg atgaggcaga ggagagggcg
      121 gacattgccg agtcccaggt caacaagctg cgggccaaga gccgtgacat tggcgccaag
      181 ggcctgaatg aagagtagat cttgctctac ccaaccctaa ggatgcctgt gaagccctga
      241 gacctggagc ctttgaaaag caccttcagg cagaaacaca ataaagcaat tttccttcaa
      301 gcc
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       HUMP42LA     2465 bp ss-mRNA            PRI       11-JAN-1990
DEFINITION  Human erythrocyte membrane protein 4.2 (HUMP4.2L) mRNA,
            complete cds.
ACCESSION   M30647
KEYWORDS    .
SOURCE      Human sickle cell patient reticulocyte, cDNA to mRNA, clones 7 and
            12.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2465)
  AUTHORS   Sung,L.A., Chien,S., Chang,L.-S., Lambert,K., Bliss,S.A.,
            Bouhassira,E.E., Nagel,R.L., Schwartz,R.S. and Rybicki,A.C.
  TITLE     Molecular cloning of human protein 4.2: A major component of the
            red erythrocyte membrane
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 955-959 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.Sung, 12-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept        228     2393     protein 4.2 (HUMP4.2L) (put.)
    mRNA    <     1     2465     protein 4.2 (HUMP4.2L) mRNA
BASE COUNT      603 a    661 c    697 g    504 t
ORIGIN      
        1 gaactggtcc aggagatacc agcagagagg gagtaggaga gaagaaacat gtcagggtgc
       61 tcacaggagt agtgggggga ggttttgcta tttccagatt cttaagccaa caaaagtgcc
      121 ttcatatttt ctgtctggaa gacagaaagc ccagaaggag cccagaagca acagtttgag
      181 agaggcgctt tctgcggcca agtggataag aggagcggcc tgcaaccatg ggacagggtg
      241 agccaagcca gcgctcgaca gggcttgctg gactgtatgc agcccccgca gcatcacctg
      301 ttttcattaa aggaagtggg atggatgccc tgggtatcaa gagctgtgac tttcaggcag
      361 caagaaacaa tgaggagcac cacaccaagg ccctcagctc ccggcgcctc tttgtgagga
      421 gggggcagcc cttcaccatc atcctgtact tccgcgctcc agtccgtgca tttctgcctg
      481 ccctgaagaa ggtggccctc actgcacaaa ctggagagca gccttccaag atcaacagga
      541 cccaagccac attcccaatt tccagtctgg gggaccgaaa gtggtggagt gcagtggtgg
      601 aggagagaga tgcccagtcc tggaccatct ctgtgaccac acctgcagac gctgtcattg
      661 gccactactc gcttctgctg caggtctcag gcaggaagca actcctcttg ggtcagttca
      721 cactgctttt taacccctgg aatagagagg atgctgtgtt cctgaagaat gaggctcagc
      781 gcatggagta cttgttgaac cagaatggtc tcatctacct gggtacagct gactgcatcc
      841 aggcagagtc ctgggacttt ggccagttcg agggggatgt cattgacctc agcctgcgct
      901 tgctgagcaa ggacaagcag gtagagaagt ggagccagcc ggtgcacgtg gcccgtgtgt
      961 tgggtgcctt gctgcatttt ctcaaggagc agagggtcct gcccaccccg cagacccagg
     1021 ccacccagga aggggccttg ctgaacaagc gccggggcag cgtgcccatc ctgcggcagt
     1081 ggctcaccgg ccgaggccga cctgtgtatg atggccaggc ctgggtgttg gctgctgttg
     1141 cttgcacagt gctgcgatgc ctgggaatcc ctgcccgcgt ggtgaccacg tttgcctcag
     1201 cacagggcac cggtgggcgt cttctcatag atgaatacta taatgaggag ggacttcaga
     1261 acggagaagg ccagagaggc agaatctgga tcttccagac ttccacagag tgctggatga
     1321 agcgcggcct gccttgccag ggttatgatg gatggcagat tctgcaccca agtgctccta
     1381 atggaggtgg agtcctgggg tcctgtgatc tggtgccggt cagagcagtc aaggagggga
     1441 cgctggggct gaccccagca gtgtcagacc tttttgctgc cataaatgcc tcatgtgtgg
     1501 tctggaagtg ctgtgaggat gggacactgg agttgactga ctccaacaca aagtatgttg
     1561 gcaacaacat cagcaccaag ggtgtgggca gtgaccgctg cgaggacatc actcagaact
     1621 acaagtatcc tgaagggtct cttcaggaaa aagaggtgct ggagagagtc gagaaagaga
     1681 aaatggaacg tgagaaagac aacggcatcc gtcctcccag tctcgagact gccagtcctc
     1741 tgtacctgct cttgaaagca cccagctccc tacccctgag aggggatgcc cagatctcag
     1801 tgacgctggt taatcacagt gagcaggaga aggcagtgca gctggcaatt ggggtccagg
     1861 ctgtacacta caacggtgtc cttgctgcca agctctggag gaagaagctg cacctcacgc
     1921 tcagtgccaa cctggaaaag ataataacca tcggcctgtt cttctccaat tttgagcgaa
     1981 acccacccga gaacaccttc cttagactca ccgccatggc aacacactct gaatccaacc
     2041 ttagctgctt tgctcaggaa gacattgcca tttgtagacc acaccttgcc atcaagatgc
     2101 cagagaaagc agagcagtat caacccctca cagcctcagt cagcctccag aactccctag
     2161 atgcccccat ggaggactgt gtgatctcca tcctgggaag ggggctcatt cacagagaga
     2221 ggagctacag attccgttca gtgtggcctg aaaacaccat gtgtgccaag ttccagttca
     2281 cgccaacaca tgtggggctc cagagactca ctgtggaagt ggactgcaac atgttccaga
     2341 acctaaccaa ctataaaagc gtcaccgtgg tagcccctga actatcagct taaacttcca
     2401 gctctatcac cactctcctg ccaacccttg ttctacaatc taaaccaaac atgtgctagg
     2461 aagag
//
LOCUS       HUMP42SA     2375 bp ss-mRNA            PRI       11-JAN-1990
DEFINITION  Human erythrocyte membrane protein 4.2 (HUMP4.2S) mRNA,
            complete cds.
ACCESSION   M30646
KEYWORDS    .
SOURCE      Human sickle cell patient reticulocyte, cDNA to mRNA, clones 7 and
            16.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2375)
  AUTHORS   Sung,L.A., Chien,S., Chang,L.-S., Lambert,K., Bliss,S.A.,
            Bouhassira,E.E., Nagel,R.L., Schwartz,R.S. and Rybicki,A.C.
  TITLE     Molecular cloning of human protein 4.2: A major component of the
            red erythrocyte membrane
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 955-959 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.Sung, 12-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept        228     2303     protein 4.2 (HUMP4.2S) (put.)
    mRNA    <     1     2375     protein 4.2 mRNA
BASE COUNT      583 a    638 c    669 g    485 t
ORIGIN      
        1 gaactggtcc aggagatacc agcagagagg gagtaggaga gaagaaacat gtcagggtgc
       61 tcacaggagt agtgggggga ggttttgcta tttccagatt cttaagccaa caaaagtgcc
      121 ttcatatttt ctgtctggaa gacagaaagc ccagaaggag cccagaagca acagtttgag
      181 agaggcgctt tctgcggcca agtggataag aggagcggcc tgcaaccatg ggacaggccc
      241 tgggtatcaa gagctgtgac tttcaggcag caagaaacaa tgaggagcac cacaccaagg
      301 ccctcagctc ccggcgcctc tttgtgagga gggggcagcc cttcaccatc atcctgtact
      361 tccgcgctcc agtccgtgca tttctgcctg ccctgaagaa ggtggccctc actgcacaaa
      421 ctggagagca gccttccaag atcaacagga cccaagccac attcccaatt tccagtctgg
      481 gggaccgaaa gtggtggagt gcagtggtgg aggagagaga tgcccagtcc tggaccatct
      541 ctgtgaccac acctgcagac gctgtcattg gccactactc gcttctgctg caggtctcag
      601 gcaggaagca actcctcttg ggtcagttca cactgctttt taacccctgg aatagagagg
      661 atgctgtgtt cctgaagaat gaggctcagc gcatggagta cttgttgaac cagaatggtc
      721 tcatctacct gggtacagct gactgcatcc aggcagagtc ctgggacttt ggccagttcg
      781 agggggatgt cattgacctc agcctgcgct tgctgagcaa ggacaagcag gtagagaagt
      841 ggagccagcc ggtgcacgtg gcccgtgtgt tgggtgcctt gctgcatttt ctcaaggagc
      901 agagggtcct gcccaccccg cagacccagg ccacccagga aggggccttg ctgaacaagc
      961 gccggggcag cgtgcccatc ctgcggcagt ggctcaccgg ccgaggccga cctgtgtatg
     1021 atggccaggc ctgggtgttg gctgctgttg cttgcacagt gctgcgatgc ctgggaatcc
     1081 ctgcccgcgt ggtgaccacg tttgcctcag cacagggcac cggtgggcgt cttctcatag
     1141 atgaatacta taatgaggag ggacttcaga acggagaagg ccagagaggc agaatctgga
     1201 tcttccagac ttccacagag tgctggatga agcgcggcct gccttgccag ggttatgatg
     1261 gatggcagat tctgcaccca agtgctccta atggaggtgg agtcctgggg tcctgtgatc
     1321 tggtgccggt cagagcagtc aaggagggga cgctggggct gaccccagca gtgtcagacc
     1381 tttttgctgc cataaatgcc tcatgtgtgg tctggaagtg ctgtgaggat gggacactgg
     1441 agttgactga ctccaacaca aagtatgttg gcaacaacat cagcaccaag ggtgtgggca
     1501 gtgaccgctg cgaggacatc actcagaact acaagtatcc tgaagggtct cttcaggaaa
     1561 aagaggtgct ggagagagtc gagaaagaga aaatggaacg tgagaaagac aacggcatcc
     1621 gtcctcccag tctcgagact gccagtcctc tgtacctgct cttgaaagca cccagctccc
     1681 tacccctgag aggggatgcc cagatctcag tgacgctggt taatcacagt gagcaggaga
     1741 aggcagtgca gctggcaatt ggggtccagg ctgtacacta caacggtgtc cttgctgcca
     1801 agctctggag gaagaagctg cacctcacgc tcagtgccaa cctggaaaag ataataacca
     1861 tcggcctgtt cttctccaat tttgagcgaa acccacccga gaacaccttc cttagactca
     1921 ccgccatggc aacacactct gaatccaacc ttagctgctt tgctcaggaa gacattgcca
     1981 tttgtagacc acaccttgcc atcaagatgc cagagaaagc agagcagtat caacccctca
     2041 cagcctcagt cagcctccag aactccctag atgcccccat ggaggactgt gtgatctcca
     2101 tcctgggaag ggggctcatt cacagagaga ggagctacag attccgttca gtgtggcctg
     2161 aaaacaccat gtgtgccaag ttccagttca cgccaacaca tgtggggctc cagagactca
     2221 ctgtggaagt ggactgcaac atgttccaga acctaaccaa ctataaaagc gtcaccgtgg
     2281 tagcccctga actatcagct taaacttcca gctctatcac cactctcctg ccaacccttg
     2341 ttctacaatc taaaccaaac atgtgctagg aagag
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATFERA1      563 bp ds-DNA             ROD       14-MAR-1990
DEFINITION  Rat ferritin heavy subunit gene, partial exon X.
ACCESSION   M18051
KEYWORDS    ferritin.
SEGMENT     1 of 3
SOURCE      Rat (strain Fischer) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 563)
  AUTHORS   Murray,M.T., White,K. and Munro,H.N.
  TITLE     Conservation of ferritin heavy subunit gene structure: Implications
            for the regulation of ferritin gene expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /   349  +   459     ferritin heavy subunit, exon X
    pre-msg <     1  >   563     ferritin H mRNA and intron
    IVS         460  >   563     ferritin H intron A
BASE COUNT       87 a    204 c    173 g     99 t
ORIGIN      
        1 agctcagaga cccaagagcc gcctcacaat cacacaggct cctccccgcc cacgcactgc
       61 tggcttgggc aacacgccta caggaagagg cggggctggg cggcccaccg cgctgattgg
      121 ccggagcgcg cctgacgcag gatcccgcta taaagtgcgg cccgctggtc cctacgccag
      181 acgttctcgc ccagagtcgc cgcggtttcc tgcttcaaca gtgcttgaac ggaacccggt
      241 gctcgacccc tccgaccccc gtccggccgc tttgagcctg agccctttgc aacttcgtcg
      301 ctccgccgct ccagcgtcgc ctccgcgcct cgtccagccg ccatcatgac caccgcgtct
      361 ccctcgcaag tgcgccagaa ctaccaccag gactcggagg ctgccatcaa ccgccagatc
      421 aacctggagt tgtatgcctc ctacgtctat ctgtccatgg tgagtgcggc ctggcctttg
      481 cgggggcgga aagagggtgc ggcctggcct cccttgggcc acttggtgag ctggcggagg
      541 gtgggttggg gcgtggctgc ggg
//
LOCUS       RATFERA2      232 bp ds-DNA             ROD       14-MAR-1990
DEFINITION  Rat ferritin heavy subunit gene, exon X+1.
ACCESSION   M18052
KEYWORDS    ferritin.
SEGMENT     2 of 3
SOURCE      Rat (strain Fischer) three months-old liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 232)
  AUTHORS   Murray,M.T., White,K. and Munro,H.N.
  TITLE     Conservation of ferritin heavy subunit gene structure: Implications
            for the regulation of ferritin gene expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    51  +   197     ferritin heavy subunit, exon X+1
    pre-msg <     1  >   232     ferritin H mRNA and introns
    IVS     <     1       50     ferritin H intron A
    IVS         198  >   232     ferritin H intron B
BASE COUNT       66 a     48 c     57 g     61 t
ORIGIN      About 1 kb after segment 1.
        1 gcatctgcct tgctgtgggg atcaataaca aatacccttt ccactttcag tcttgttatt
       61 ttgaccggga tgatgtggcc ctgaagaact ttgccaaata ctttctccat caatctcatg
      121 aagagaggga acatgctgag aaactgatga agctgcagaa ccagcgaggt ggacgaatct
      181 tcctgcagga tatcaaggta agtagactat gggactgcgt taaatgagca gt
//
LOCUS       RATFERA3      764 bp ds-DNA             ROD       14-MAR-1990
DEFINITION  Rat ferritin heavy subunit gene, last two exons.
ACCESSION   M18053
KEYWORDS    ferritin.
SEGMENT     3 of 3
SOURCE      Rat (strain Fischer) three months-old liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 764)
  AUTHORS   Murray,M.T., White,K. and Munro,H.N.
  TITLE     Conservation of ferritin heavy subunit gene structure: Implications
            for the regulation of ferritin gene expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    34      159     ferritin heavy subunit, exon X+2
                252      413     ferritin heavy subunit, exon X+3
    pre-msg <     1      566     ferritin H mRNA and introns
    IVS     <     1       33     ferritin H intron B
    IVS         160      251     ferritin H intron C
BASE COUNT      209 a    166 c    188 g    201 t
ORIGIN      About 150 bp after segment 2.
        1 ctgcagatga attgacatgt ttctttgatt cagaaacctg accgtgatga ctgggagagc
       61 gggctgaatg caatgaggtg tgcactgcac ttggaaaaga gtgtgaatca gtcactactg
      121 gaacttcaca aactggctac tgacaagaat gatccccacg tgagtatcag aaacacgggg
      181 tgagtggaga tgatttgcca cagggcttgg gagagctgac cagtaaccct gtcccatgtt
      241 ctctttccta gttatgtgac ttcattgaga cgcattacct gaatgagcag gtgaaatcca
      301 ttaaagaact gggtgaccac gtgaccaact tacgcaagat gggagcccct gaatctggca
      361 tggcagaata tctctttgac aagcacaccc tgggacacgg tgatgagagc taagctgacg
      421 tccccaaggc catgtgactt tactggctca ctgaggcagt gcatgcatgt caggctgcct
      481 ttatcttttc tataagttgc accaaaacat ctgcttaaaa gttctttaat ttgtaccatt
      541 tcttcaaata aagaattttg gtacccagct cttgttgtga ttgaggatga gcgcaccagc
      601 ttcccttgcg tcggctatat aaccacactg caacgcctga aagaatattt attaaactcg
      661 tagttgggga aagatagtga aagacaggtg tgttcagaca ggactaagca gtcctggttc
      721 tgagttacct gccagactgc catgggaaca tattcttgag tgtc
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       HUMHPRTB    56736 bp ds-DNA             PRI       15-MAR-1990
DEFINITION  Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete
            cds.
ACCESSION   M26434
KEYWORDS    Alu repetitive element; LINE repetitive element;
            hypoxanthine phosphoribosyltransferase.
SOURCE      Human cell line 4X,Y (GM1202), fetal liver, and beta thalasemia
            patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 56736)
  AUTHORS   Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J.,
            Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W.
  TITLE     Automated DNA sequencing of the human HPRT locus
  JOURNAL   Genomics 6, 593-608 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Edwards, 26-JUL-1989.
FEATURES       from  to/span     description
    pept       1677     1703     hypoxanthine phosphoribosyltransferase, exon 1
                                 /nomgen="HPRT" /map="Xq26"
                                 /hgml_locus_uid="LE0031K"
              14780    14886     hypoxanthine phosphoribosyltransferase, exon 2
              16603    16786     hypoxanthine phosphoribosyltransferase, exon 3
              27891    27956     hypoxanthine phosphoribosyltransferase, exon 4
              31617    31634     hypoxanthine phosphoribosyltransferase, exon 5
              34937    35019     hypoxanthine phosphoribosyltransferase, exon 6
              39815    39861     hypoxanthine phosphoribosyltransferase, exon 7
              40033    40109     hypoxanthine phosphoribosyltransferase, exon 8
              41454    41501     hypoxanthine phosphoribosyltransferase, exon 9
    IVS        1704    14779     HPRT intron A
    IVS       14887    16602     HPRT intron B
    IVS       16787    27890     HPRT intron C
    IVS       27957    31616     HPRT intron D
    IVS       31635    34936     HPRT intron E
    IVS       35020    39814     HPRT intron F
    IVS       39862    40032     HPRT intron G
    IVS       40110    41453     HPRT intron H
    rpt         258       41 (c) Alu repeat copy A
    rpt        1012      694 (c) Alu repeat copy B
    rpt        3532     3342 (c) Alu repeat copy C
    rpt        3982     3343 (c) Alu repeat copy D
    rpt        6232     5937 (c) Alu repeat copy E
    rpt        6524     6288 (c) Alu repeat copy F
    rpt        7504     7807     Alu repeat copy G
    rpt        8713     8408 (c) Alu repeat copy H
    rpt        9477     9169 (c) Alu repeat copy I
    rpt        9699    10142     Alu repeat copy J
    rpt       10407    10293 (c) Alu repeat copy K
    rpt       12154    12310     Alu repeat copy L
    rpt       13302    12998 (c) Alu repeat copy M
    rpt       13638    13314 (c) Alu repeat copy N
    rpt       14238    13923 (c) Alu repeat copy O
    rpt       14610    14239 (c) Alu repeat copy P
    rpt       16090    15843 (c) Alu repeat copy Q
    rpt       16518    16293 (c) Alu repeat copy R
    rpt       18783    18439 (c) Alu repeat copy S
    rpt       18962    19084     Alu repeat copy T
    rpt       19397    19104 (c) Alu repeat copy U
    rpt       20064    19778 (c) Alu repeat copy V
    rpt       20486    20636     Alu repeat copy W
    rpt       20990    20696 (c) Alu repeat copy X
    rpt       21293    20996 (c) Alu repeat copy Y
    rpt       22171    21878 (c) Alu repeat copy Z
    rpt       23772    23460 (c) Alu repeat copy AA
    rpt       25310    25426     Alu repeat copy AB
    rpt       26388    26279 (c) Alu repeat copy AC
    rpt       29863    29560 (c) Alu repeat copy AD
    rpt       33618    33148 (c) Alu repeat copy AE
    rpt       36575    36645     Alu repeat copy AF
    rpt       38764    39083     Alu repeat copy AG
    rpt       39091    39388     Alu repeat copy AH
    rpt       40622    40925     Alu repeat copy AI
    rpt       40952    41262     Alu repeat copy AJ
    rpt       42640    42936     Alu repeat copy AK
    rpt       44776    44486 (c) Alu repeat copy AL
    rpt       46627    46305 (c) Alu repeat copy AM
    rpt       47520    47763     Alu repeat copy AN
    rpt       49257    49551     Alu repeat copy AO
    rpt       49705    49993     Alu repeat copy AP
    rpt       50002    50312     Alu repeat copy AQ
    rpt       51066    50774 (c) Alu repeat copy AR
    rpt       52276    51978 (c) Alu repeat copy AS
    rpt       52407    52674     Alu repeat copy AT
    rpt       54653    54551 (c) Alu repeat copy AU
    rpt       55465    55164 (c) Alu repeat copy AV
    rpt       56523    56689     Alu repeat copy AW
    rpt       11312    11254 (c) LINE repeat copy A (partial)
    rpt       18284    18242 (c) LINE repeat copy B (partial)
    rpt       27091    26843 (c) LINE repeat copy C (partial)
    rpt       37558    37300 (c) LINE repeat copy D (partial)
    rpt       39451    39427 (c) LINE repeat copy E (partial)
    rpt       51317    51100 (c) LINE repeat copy F (partial)
    rpt         261      321     short interspersed repeat copy A
    rpt        1013     1054     short interspersed repeat copy B
    rpt        2902     2869 (c) short interspersed repeat copy C
    rpt        3078     3107     short interspersed repeat copy D
    rpt        4228     4254     short interspersed repeat copy E
    rpt        4463     4494     short interspersed repeat copy F
    rpt       21582    21635     short interspersed repeat copy G
    rpt       28430    28394 (c) short interspersed repeat copy H
    rpt       28923    28981     short interspersed repeat copy I
    rpt       35562    35604     short interspersed repeat copy J
    rpt       44042    44108     short interspersed repeat copy K
    rpt       46628    46688     short interspersed repeat copy L
    rpt       51715    51687 (c) short interspersed repeat copy M
    unsure    17479    17479     polymorphism or cloning error
    unsure    24903    24903     polymorphism or cloning error
BASE COUNT    15689 a  11281 c  11599 g  18167 t
ORIGIN      1 bp upstream of EcoRI site; chromosome Xq26.
        1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt
       61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc
      121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc
      181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt
      241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga
      301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc
      361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag
      421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc
      481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt
      541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct
      601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga
      661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt
      721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat
      781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga
      841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag
      901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct
      961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc
     1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag
     1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac
     1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc
     1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg
     1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt
     1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg
     1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg
     1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg
     1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc
     1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc
     1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg
     1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg
     1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg
     1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc
     1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga
     1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca
     1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc
     2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat
     2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg
     2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag
     2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg
     2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct
     2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt
     2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg
     2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca
     2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg
     2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt
     2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg
     2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa
     2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt
     2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg
     2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc
     2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc
     3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt
     3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt
     3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct
     3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat
     3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt
     3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag
     3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc
     3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc
     3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta
     3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac
     3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc
     3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt
     3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc
     3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc
     3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc
     3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag
     3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc
     4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca
     4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc
     4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc
     4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa
     4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca
     4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga
     4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac
     4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg
     4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt
     4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac
     4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg
     4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag
     4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg
     4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca
     4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg
     4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat
     4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg
     5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc
     5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta
     5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc
     5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact
     5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct
     5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg
     5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg
     5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc
     5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa
     5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat
     5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa
     5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg
     5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc
     5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa
     5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt
     5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt
     6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg
     6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga
     6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg
     6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc
     6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt
     6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc
     6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg
     6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt
     6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga
     6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac
     6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct
     6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag
     6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc
     6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt
     6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg
     6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc
     6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca
     7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc
     7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga
     7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt
     7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc
     7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg
     7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag
     7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag
     7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca
     7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt
     7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac
     7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga
     7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt
     7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa
     7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct
     7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact
     7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt
     7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc
     8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac
     8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct
     8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg
     8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat
     8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag
     8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat
     8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg
     8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg
     8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc
     8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac
     8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg
     8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc
     8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat
     8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga
     8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta
     8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg
     9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt
     9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag
     9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt
     9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc
     9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag
     9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt
     9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca
     9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc
     9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc
     9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat
     9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg
     9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc
     9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga
     9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca
     9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat
     9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta
     9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc
    10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc
    10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa
    10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa
    10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat
    10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt
    10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct
    10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt
    10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga
    10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca
    10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct
    10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg
    10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg
    10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa
    10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta
    10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa
    10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca
    10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct
    11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa
    11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc
    11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat
    11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc
    11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag
    11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag
    11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga
    11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg
    11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg
    11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt
    11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac
    11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa
    11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa
    11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa
    11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt
    11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt
    12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag
    12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat
    12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat
    12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc
    12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt
    12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt
    12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct
    12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat
    12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa
    12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag
    12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt
    12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac
    12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca
    12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct
    12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt
    12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc
    12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta
    13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc
    13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg
    13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc
    13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct
    13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt
    13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg
    13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat
    13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca
    13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga
    13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt
    13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac
    13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta
    13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg
    13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc
    13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat
    13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg
    13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc
    14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt
    14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga
    14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt
    14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc
    14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg
    14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc
    14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc
    14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca
    14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg
    14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata
    14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg
    14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac
    14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg
    14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat
    14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac
    14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa
    15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt
    15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg
    15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg
    15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg
    15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt
    15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat
    15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt
    15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata
    15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat
    15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag
    15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg
    15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt
    15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga
    15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc
    15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca
    15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag
    15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt
    16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag
    16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt
    16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa
    16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa
    16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt
    16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact
    16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac
    16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac
    16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa
    16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg
    16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg
    16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag
    16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa
    16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa
    16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta
    16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta
    16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg
    17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt
    17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt
    17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt
    17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc
    17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg
    17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc
    17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt
    17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa
    17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc
    17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat
    17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc
    17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat
    17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact
    17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa
    17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt
    17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa
    18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat
    18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat
    18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt
    18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat
    18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc
    18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg
    18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt
    18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt
    18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca
    18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc
    18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt
    18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg
    18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg
    18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat
    18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa
    18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat
    18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg
    19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa
    19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag
    19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga
    19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc
    19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa
    19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg
    19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc
    19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa
    19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg
    19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa
    19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa
    19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg
    19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag
    19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga
    19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag
    19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt
    19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct
    20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt
    20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac
    20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct
    20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga
    20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat
    20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg
    20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat
    20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt
    20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc
    20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag
    20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt
    20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca
    20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta
    20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg
    20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc
    20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt
    21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc
    21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg
    21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc
    21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc
    21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa
    21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat
    21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa
    21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat
    21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat
    21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt
    21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt
    21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta
    21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt
    21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt
    21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct
    21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct
    21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat
    22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc
    22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg
    22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca
    22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag
    22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc
    22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat
    22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg
    22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct
    22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca
    22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt
    22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc
    22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta
    22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct
    22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta
    22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct
    22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa
    22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa
    23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag
    23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc
    23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa
    23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc
    23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac
    23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga
    23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt
    23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc
    23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc
    23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat
    23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc
    23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc
    23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc
    23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga
    23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga
    23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa
    24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg
    24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat
    24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat
    24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct
    24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt
    24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga
    24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg
    24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt
    24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc
    24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt
    24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc
    24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg
    24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat
    24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa
    24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt
    24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg
    24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc
    25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag
    25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc
    25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact
    25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta
    25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg
    25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc
    25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag
    25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc
    25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca
    25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag
    25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag
    25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt
    25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag
    25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat
    25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa
    25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata
    25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg
    26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt
    26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat
    26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt
    26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag
    26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc
    26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact
    26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg
    26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat
    26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag
    26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt
    26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt
    26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata
    26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta
    26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat
    26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg
    26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata
    27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata
    27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg
    27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc
    27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat
    27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag
    27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga
    27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt
    27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac
    27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa
    27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg
    27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt
    27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat
    27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct
    27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata
    27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt
    27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat
    27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct
    28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg
    28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca
    28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat
    28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa
    28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca
    28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc
    28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg
    28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa
    28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt
    28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt
    28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt
    28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat
    28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa
    28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac
    28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac
    28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca
    28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct
    29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac
    29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa
    29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc
    29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga
    29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt
    29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca
    29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca
    29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt
    29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt
    29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca
    29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg
    29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt
    29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct
    29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg
    29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag
    29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct
    30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga
    30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact
    30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt
    30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa
    30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt
    30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg
    30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc
    30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc
    30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag
    30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga
    30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta
    30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata
    30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct
    30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata
    30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa
    30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca
    30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga
    31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg
    31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac
    31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat
    31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga
    31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg
    31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt
    31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt
    31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa
    31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat
    31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg
    31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac
    31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat
    31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg
    31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt
    31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt
    31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac
    31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta
    32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa
    32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag
    32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta
    32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt
    32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt
    32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact
    32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc
    32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat
    32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa
    32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta
    32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca
    32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg
    32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc
    32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt
    32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt
    32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt
    33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc
    33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac
    33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta
    33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac
    33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta
    33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta
    33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata
    33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata
    33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc
    33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat
    33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa
    33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct
    33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc
    33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt
    33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga
    33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct
    33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag
    34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc
    34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat
    34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac
    34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt
    34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga
    34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt
    34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc
    34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt
    34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca
    34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct
    34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac
    34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa
    34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc
    34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc
    34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg
    34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt
    34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac
    35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt
    35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat
    35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta
    35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag
    35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga
    35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc
    35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc
    35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt
    35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag
    35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat
    35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta
    35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat
    35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta
    35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg
    35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt
    35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa
    36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt
    36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag
    36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat
    36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga
    36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa
    36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt
    36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga
    36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct
    36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg
    36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg
    36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga
    36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag
    36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga
    36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta
    36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt
    36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag
    36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta
    37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa
    37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc
    37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc
    37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg
    37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc
    37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg
    37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac
    37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag
    37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg
    37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc
    37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt
    37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc
    37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt
    37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc
    37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc
    37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag
    37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa
    38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat
    38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc
    38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat
    38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga
    38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata
    38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa
    38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag
    38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt
    38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg
    38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc
    38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg
    38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa
    38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg
    38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa
    38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt
    38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg
    39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa
    39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt
    39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg
    39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc
    39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga
    39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga
    39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc
    39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca
    39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac
    39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa
    39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc
    39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg
    39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct
    39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa
    39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct
    39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga
    39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat
    40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg
    40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt
    40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa
    40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct
    40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag
    40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag
    40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc
    40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact
    40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg
    40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg
    40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag
    40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac
    40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag
    40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg
    40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca
    40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca
    40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct
    41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg
    41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc
    41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac
    41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg
    41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct
    41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc
    41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt
    41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag
    41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg
    41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta
    41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac
    41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc
    41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt
    41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata
    41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga
    41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg
    42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt
    42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc
    42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt
    42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt
    42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta
    42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg
    42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct
    42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt
    42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta
    42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag
    42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta
    42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca
    42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa
    42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg
    42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag
    42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac
    42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa
    43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt
    43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc
    43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct
    43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat
    43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat
    43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc
    43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt
    43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc
    43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt
    43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt
    43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc
    43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca
    43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag
    43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg
    43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa
    43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg
    43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt
    44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca
    44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc
    44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta
    44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact
    44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc
    44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca
    44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg
    44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg
    44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc
    44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac
    44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg
    44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta
    44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca
    44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc
    44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact
    44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt
    45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga
    45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt
    45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg
    45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc
    45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta
    45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg
    45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc
    45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac
    45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct
    45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca
    45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa
    45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa
    45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa
    45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat
    45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg
    45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt
    45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt
    46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca
    46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc
    46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca
    46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg
    46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt
    46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag
    46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct
    46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt
    46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc
    46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc
    46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc
    46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa
    46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc
    46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat
    46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg
    46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc
    46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac
    47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac
    47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca
    47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg
    47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca
    47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc
    47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga
    47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc
    47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa
    47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact
    47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag
    47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg
    47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc
    47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa
    47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga
    47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg
    47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga
    48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc
    48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct
    48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc
    48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg
    48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg
    48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt
    48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg
    48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa
    48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat
    48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc
    48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg
    48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga
    48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat
    48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg
    48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc
    48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt
    48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt
    49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg
    49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag
    49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc
    49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct
    49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat
    49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa
    49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc
    49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc
    49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc
    49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc
    49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta
    49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg
    49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa
    49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc
    49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag
    49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct
    49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt
    50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat
    50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc
    50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt
    50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct
    50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga
    50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca
    50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt
    50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca
    50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt
    50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg
    50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca
    50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa
    50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga
    50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct
    50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt
    50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac
    51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca
    51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca
    51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct
    51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca
    51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac
    51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc
    51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac
    51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa
    51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc
    51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct
    51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc
    51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc
    51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga
    51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt
    51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac
    51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg
    51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa
    52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga
    52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag
    52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa
    52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg
    52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat
    52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg
    52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca
    52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa
    52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag
    52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca
    52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt
    52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta
    52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa
    52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa
    52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac
    52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc
    52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca
    53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata
    53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt
    53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag
    53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc
    53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact
    53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc
    53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat
    53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag
    53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca
    53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag
    53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc
    53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc
    53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc
    53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt
    53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta
    53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct
    54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg
    54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac
    54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt
    54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg
    54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc
    54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc
    54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg
    54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga
    54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag
    54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc
    54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta
    54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta
    54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg
    54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta
    54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga
    54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa
    54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca
    55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc
    55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca
    55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc
    55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc
    55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc
    55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta
    55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg
    55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg
    55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt
    55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt
    55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg
    55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat
    55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac
    55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt
    55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa
    55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa
    55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta
    56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg
    56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag
    56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca
    56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt
    56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca
    56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt
    56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct
    56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg
    56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg
    56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa
    56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc
    56701 gccgcccctg gtgatggacc agcggggctt cgacga
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       THARGA        635 bp ds-DNA             BCT       06-MAR-1990
DEFINITION  T.acidophilum 5S rRNA gene.
ACCESSION   M32297
KEYWORDS    5S ribosomal RNA.
SOURCE      T.acidophilum (strain 122-1B2) DNA, clone pT5PH-23.
  ORGANISM  Thermoplasma acidophilum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Thermoplasmales.
REFERENCE   1  (bases 1 to 635)
  AUTHORS   Ree,H.K. and Zimmerman,R.A.
  TITLE     The organization and expression of the three rRNA genes from
            Thermoplasma acidophilum
  JOURNAL   Unpublished (1990) Univ. of Massachusetts, Dept. of Biochemistry,
            Amherst, MA 01003
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.K.Ree 23-FEB-1990.
FEATURES       from  to/span     description
    rRNA        213      335     5S rRNA
    RNA         211      346     5S RNA ( 3' end +/- 4 bp)
    site        184      191     put. promoter
    site        333      354     put. terminator
BASE COUNT      175 a    140 c    140 g    180 t
ORIGIN      
        1 ctgcagcctt atcatgggaa aatgggcatc tgctggagac gactatgtac ttcatgcatc
       61 catattatct tatgatatta aaatgctaaa gcgcatgatc gcttcggaag gaatatcgac
      121 agggcgagat tcgctgtctc ataatcgccc ggatctgcgc tgaagccgta aaatcacgaa
      181 aatcttatat agatgtgttc tatatagtgt tcggcaacgg tcatagcagc agggaaacac
      241 cagatcccat tccgaactcg acggttaagc ctgctgcgta ttgcgttgta ctgtatgccg
      301 cgagggtacg ggaagcgcaa tatgctgtta ccacttttga aatgaaagtt ttttacatct
      361 atctttaatt cagtggcaat gattaatcct tggtcatcat ctgatttctt cgactatgag
      421 aggctaaaga gaggaattcg gcataggcga tatgaacgtt cccttcgatc attttctctt
      481 caggacttat acttggccag cgatccatag actatatcga ctacgcttta aagcatcaca
      541 tgaaattcaa cgttatgact ggcctcatgc cttcaggaga gatgcatctt ggcaacaaga
      601 gcgccataga ccaggtcatc ttcttccaga agctt
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       MUSFILA       765 bp ds-DNA             ROD       06-MAR-1990
DEFINITION  Mouse filaggrin gene, partial cds (type A repeat subunit).
ACCESSION   M32300 J05198
KEYWORDS    epidermal matrix protein; filaggrin.
SOURCE      Mouse cell line 3T3 DNA, clone 6.1A2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 765)
  AUTHORS   Rothnagel,J.A. and Steinert,P.M.
  TITLE     The structure of the gene for mouse filaggrin and a comparison of
            the repeating units
  JOURNAL   J. Biol. Chem. 265, 1862-1865 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Rothnagel, 23-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   765     filaggrin (AA at 1)
    rpt           1      765     type A repeat
BASE COUNT      178 a    257 c    250 g     80 t
ORIGIN      
        1 ggtggactgg cagctgacgc ctccaggcgc tctggggcgc gtcaaggcca ggcatctgcc
       61 cagggtcgcg cagggtcaca aggccaagca cagggccgcg tcggctcgtc agctgacagg
      121 caagggcgtc gaggggtcag cgagagtcag gccagcgaca gcgagggcca ctccgacttc
      181 tcagaaggcc aggcagtagg agctcacagg cagtctggag ccggccagag gcacgagcag
      241 cggtcctccc gaggccagca cggctccgga ttctacccag tatactacta ctatgaacaa
      301 gaacacagcg aggaggagtc cgattcccag catcagcacg gccaccagca cgaacagcaa
      361 cgcgggcacc agcaccagca ccagcaccag catgagcacg aacagcccga gtccggccac
      421 aggcagcagc agtccagtgg gagaggacac cagggagccc accaggaaca gggaagggac
      481 agcgccaggt cccggggttc taaccaaggg cacagctcct ctcgccacca ggccgactct
      541 cccagggtca gcgcaagatc aggctcagga ggaagaggac agtcccccga cgcctctggc
      601 cgcagctcaa acaggaggga cagaccccgg cagcccagcc ccagccagtc cagtgacagc
      661 caagtccatt ctggagtcca ggtcgaaggc cggcgcgggc agtcctcatc tgccaacagg
      721 agggccggat ccagctccgg ctcaggggtc cagggggcct ctgca
//
LOCUS       MUSFILB       750 bp ds-DNA             ROD       06-MAR-1990
DEFINITION  Mouse filaggrin gene, partial cds (type B repeat subunit).
ACCESSION   M32301 J05198
KEYWORDS    epidermal matrix protein; filaggrin.
SOURCE      Mouse cell line 3T3 DNA, clone 6.1A2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 750)
  AUTHORS   Rothnagel,J.A. and Steinert,P.M.
  TITLE     The structure of the gene for mouse filaggrin and a comparison of
            the repeating units
  JOURNAL   J. Biol. Chem. 265, 1862-1865 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Rothnagel 23-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >   750     filaggrin (AA at 1)
    rpt           1      750     type B repeat
BASE COUNT      174 a    252 c    248 g     76 t
ORIGIN      
        1 ggtggactgg cagctgacgc ctccaggcgc tctggggcgc ttcaaggcca ggcatctgcc
       61 cagggtcgcg cagggtcaca aggccaagca cagggccgcg tcggctcgtc agctgacagg
      121 caagggcgtc gaggggtcag cgagagtcag gccagcgaca gcgagggcca ctctgacttc
      181 tcagaaggcc aggcagtagg agctcacagg cagtctggag ccggccagag gcacgagcag
      241 cggtcctccc gaggccagca cggctccgga tactactatg agcaagaaca cagcgaggag
      301 gagtccgatt cccagcacca gcacggccac cagcacgaac agcaacgcgg gcaccagcac
      361 cagcaccagc accagcatga gcacgaacag cccgagtccg gccacaggca gcagcagtcc
      421 agtgggagag gacaccaggg agcccaccag gaacagggaa gggacagcgc caggtcccgg
      481 ggttctaacc aagggcacag ctcctctcgc caccaggccg actctcccag ggtcagcgca
      541 agatcaggct caggaggaag aggacagtcc cccgacgcct ctggccgcag ctcaaacagg
      601 agggacagac cccggcagcc cagccccagc cagtccagtg acagccaagt ccattctgga
      661 gtccaggtcg aaggccggcg cgggcagtcc tcatctgcca acaggagggc cggatccagc
      721 tccagctcag gggtccaggg ggcctctgca
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       MUSJUNDR     1596 bp ss-mRNA            ROD       17-JAN-1990
DEFINITION  Mouse junD proto-oncogene mRNA, complete cds.
ACCESSION   J05205
KEYWORDS    JUN-D protein, junD proto-oncogene.
SOURCE      Mouse myocyte cell line BC-3H1, cDNA to genomic RNA, clone jun16.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1596)
  AUTHORS   Li,L., Hu,J.-S. and Olson,E.N.
  TITLE     Different members of the jun proto-oncogene family exhibit distinct
            patterns of expression in response to type beta transforming growth
            factor
  JOURNAL   J. Biol. Chem. 265, 1556-1562 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Li, 12-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept         43     1068     JUN-D protein
BASE COUNT      264 a    559 c    492 g    280 t      1 others
ORIGIN      
        1 gagaggagat gacagagctt catgagtgat tcagtcccac ggatggaaac gcccttgtat
       61 ggcgaggagg cgctgagcgg cctggctgcg ggtgcgtcga gcgtcgctgg tgctactggg
      121 gcccccggcg gtggtggctt cgcgcccccg ggccgcgctt tccccggggc gcccccgacg
      181 agcagcatgc tgaagaaaga cgcgctgacg ctcagcctgg cggacgaggg agcggcggga
      241 ttgaaaccag ggtcggccac tgcaccttct gcgctgcgcc ccgacggcgc ccccgacggg
      301 ctgctggctt cgccggatct tgggctgctc aaactcgcgt cgccggagct ggagaggctg
      361 atcatccagt ccaacgggct ggtgaccact accccgacca gtacgcagtt cctctacccg
      421 aaggtggcag ccagcgagga gcaggagttc gccgaaggct tcgtcaaggc gctggaggac
      481 ctgcacaagc aaagccagct gggtgcggcc accgcggcca cctcaggggc tcccgcgcct
      541 cccgcgcccg ccgacctggc cgccaccccc ggggccacgg agaccccggt ctacgccaac
      601 ctgagcagtt tcgcgggtgg cgccgggccc cctgggggcg cggccaccgt ggctttcgcc
      661 gcggagccag tgcccttccc gccgcccccg ggcgcgctgg ggccgccgcc acctccgcat
      721 ccaccgcgcc tggccgcgct caaggacgag ccgcagaccg tgccggacgt gccgagcttc
      781 ggcgacagcc ctccgctgtc gcccatcgac atggacacgc aagaacgcat caaggcggag
      841 cgcaagaggc tgcgcaaccg catcgccgcc tccaaatgcc gcaagcgcaa gctggagcgt
      901 atctcgcgcc tggaggagaa agtcaagacc ctcaaaagcc agaacaccga gctggcgtcc
      961 accgccagcc tgctgcgcga gcaggtggcg cacgtcaaac agaaagtcct cagccacgtc
     1021 aacagcggct gccagctgct gccccagcac caggtcccgg cgtactgagc ccgagcncgg
     1081 ggcgcatgcg cggactagct gcggtggggg ggcgccccgg actctttcga gactcggtgc
     1141 ccccggactc gacaagccgg acccccctta actctggatg gggaacccga gcgcacgacc
     1201 cccgccctcg cgccgcctct ctacccccag tcctgcgcgt ggctgcccct ttgatcctcc
     1261 gagtaggggc tctaaggggc gcgcagcctc acgctctgcc tttccttttt tttttgcctt
     1321 ttggaagaga gaacgggagt gtccgattcc gccctattta tgtttctact cgggaacaaa
     1381 cgttggttgc gtgttgagtg tgtgtttcct tgtgttggtt ttttaaagaa acgggaagaa
     1441 gaaaaaaaaa accctccccc ctcccccccc cgctctctct cctcccgaca ccatcccctc
     1501 tctttttgtt tggttttgtt ttgctacgag tccacattcc tgttccgtaa tcctggttcg
     1561 ccggttttgt gttttcagta aagtcttcgt tacgcc
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       CECCOLE3A    3066 bp ds-DNA             BCT       31-AUG-1987
DEFINITION  Plasmid ColE3-CA38 colicin E3, immunity proteins E3 and E8,
            and lysis genes, complete cds.
ACCESSION   J01574 J01575 M14038 X02397
KEYWORDS    colicin; colicin release protein; immune response gene;
            immunity protein; lysis protein.
SOURCE      Plasmid ColE3-CA38 (from E.coli) DNA.
  ORGANISM  Plasmid Colicin E3-CA38
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1663 to 2662)
  AUTHORS   Masaki,H. and Ohta,T.
  TITLE     A plasmid region encoding the active fragment and the inhibitor
            protein of colicin E3-CA38
  JOURNAL   FEBS Lett. 149, 129-132 (1982)
  STANDARD  full staff_review
REFERENCE   2  (bases 1612 to 2662)
  AUTHORS   Mock,M., Miyada,C.G. and Gunsalus,R.P.
  TITLE     Nucleotide sequence for the catalytic domain of colicin E3 and its
            immunity protein. Evidence for a third gene overlapping colicin
  JOURNAL   Nucleic Acids Res. 11, 3547-3557 (1983)
  STANDARD  full staff_review
REFERENCE   3  (bases 2251 to 3066)
  AUTHORS   Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
  TITLE     Characterization and nucleotide sequence of a colicin-release gene
            in the hic region of plasmid ColE3-CA38
  JOURNAL   Gene 29, 175-184 (1984)
  STANDARD  full staff_review
REFERENCE   4  (bases 2251 to 3066)
  AUTHORS   Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P.
  TITLE     Corrigenda: Characterization and nucleotide sequence of a
            colicin-release gene in the hic region of plasmid ColE3-CA38
  JOURNAL   Gene 42, 351-355 (1986)
  STANDARD  full staff_review
REFERENCE   5  (bases 1 to 2854)
  AUTHORS   Masaki,H. and Ohta,T.
  TITLE     Colicin E3 and its immunity genes
  JOURNAL   J. Mol. Biol. 182, 217-227 (1985)
  STANDARD  simple automatic
REFERENCE   6  (bases 1 to 3066; revises [5])
  AUTHORS   Akutsu,A., Masaki,H. and Ohta,T.
  TITLE     Molecular structure and immunity specificity of colicin E6, an
            evolutionary intermediate between E-group colicins and cloacin DF13
  JOURNAL   J. Bacteriol. 171, 6430-6436 (1989)
  STANDARD  simple staff_review
COMMENT     [1] refers to colicin E3 as protein A and the immunity protein E3
            as protein B. [2] refers to the colE3 gene as the ceaC gene and the
            immE3 gene as the ceaC gene.
            
            Draft entry and printed copy of sequence for [3] kindly provide by
            R.J.Watson, May 1985.
FEATURES       from  to/span     description
    pept        331     1986     colicin E3
    pept       1996     2253     immunity protein E3
    pept       2420     2677     immunity protein E8 (gtg start codon)
    pept       2739     2882     lysis protein
    binding     317      322     ribosome binding site (pot.)
    binding    1988     1993     ribosome binding site (pot.)
    revision    132      134     atc in [6]; attc in [5]
    revision    755      756     gc in [4]; cg in [3]
    revision   1265     1267     gca in [4]; ga in [3]
    revision   1348     1349     tt in [4]; ttt in [3]
BASE COUNT      897 a    546 c    796 g    827 t
ORIGIN      
        1 aactcggttt taatcagacc tggcatgagt ggaagcggga cgaacagcac aggcaacaac
       61 aacgccgccc cgggcacttc cggggcatga gtatgtgata tccggggctg caccccggac
      121 cccgccaaca catcacgggc cacaaaattt tttgtggccc gctctgcgtt ttctaagtgt
      181 tatccctcct gatttctaaa aaattttcca cctgaacttg acagaaaaaa cgatgacgag
      241 tactttttga tctgtacata aacccagtgg ttttatgtac agtattaatc gtgtaatcaa
      301 ttgttttaac gcttaaaaga gggaattttt atgagcggtg gcgatggacg cggccataac
      361 acgggcgcgc atagcacaag tggtaacatt aatggtggcc cgaccgggct tggtgtaggt
      421 ggtggtgctt ctgatggctc cggatggagt tcggaaaata acccgtgggg tggtggttcc
      481 ggtagcggca ttcactgggg tggtggttcc ggtcatggta atggcggggg gaatggtaat
      541 tccggtggtg gttcgggaac aggcggtaat ctgtcagcag tagctgcgcc agtggcattt
      601 ggttttccgg cactttccac tccaggagct ggcggtctgg cggtcagtat ttcagcggga
      661 gcattatcgg cagctattgc tgatattatg gctgccctga aaggaccgtt taaatttggt
      721 ctttgggggg tggctttata tggtgtattg ccatcacaaa tagcgaaaga tgaccccaat
      781 atgatgtcaa agattgtgac gtcattaccc gcagatgata ttactgaatc acctgtcagt
      841 tcattacctc tcgataaggc aacagtaaac gtaaatgttc gtgttgttga tgatgtaaaa
      901 gacgagcgac agaatatttc ggttgtttca ggtgttccga tgagtgttcc ggtggttgat
      961 gcaaaaccta ccgaacgtcc gggtgttttt acggcatcaa ttccaggtgc acctgttctg
     1021 aatatttcag ttaataacag tacgccagca gtacagacat taagcccagg tgttacaaat
     1081 aatactgata aggatgttcg cccggcagga tttactcagg gtggtaatac cagggatgca
     1141 gttattcgat tcccgaagga cagcggtcat aatgccgtat atgtttcagt gagtgatgtt
     1201 cttagccctg accaggtaaa acaacgtcaa gatgaagaaa atcgccgtca gcaggaatgg
     1261 gatgctacgc atccggttga agcggctgag cgaaattatg aacgcgcgcg tgcagagctg
     1321 aatcaggcaa atgaagatgt tgccagaaat caggagcgac aggctaaagc tgttcaggtt
     1381 tataattcgc gtaaaagcga acttgatgca gcgaataaaa ctcttgctga tgcaatagct
     1441 gaaataaaac aatttaatcg atttgcccat gacccaatgg ctggcggtca cagaatgtgg
     1501 caaatggccg ggcttaaagc ccagcgggcg cagacggatg taaataataa gcaggctgca
     1561 tttgatgctg ctgcaaaaga gaagtcagat gctgatgctg cattgagttc tgctatggaa
     1621 agcaggaaga agaaagaaga taagaaaagg agtgctgaaa ataatttaaa cgatgaaaag
     1681 aataagccca gaaaaggttt taaagattac gggcatgatt atcatccagc tccgaaaact
     1741 gagaatatta aagggcttgg tgatcttaag cctgggatac caaaaacacc aaagcagaat
     1801 ggtggtggaa aacgcaagcg ctggactgga gataaagggc gtaagattta tgagtgggat
     1861 tctcagcatg gtgagcttga ggggtatcgt gccagtgatg gtcagcatct tggctcattt
     1921 gaccctaaaa caggcaatca gttgaaaggt ccagatccga aacgaaatat caagaaatat
     1981 ctttgagagg aagttatggg acttaaattg gatttaactt ggtttgataa aagtacagaa
     2041 gattttaagg gtgaggagta ttcaaaagat tttggagatg acggttcagt tatggaaagt
     2101 ctaggtgtgc cttttaagga taatgttaat aacggttgct ttgatgttat agctgaatgg
     2161 gtacctttgc tacaaccata ctttaatcat caaattgata tttccgataa tgagtatttt
     2221 gtttcgtttg attatcgtga tggtgattgg tgatcaaata ttatcaggga tgagttgata
     2281 tacgggcttc tagtgttcat ggatgaacgc tggagcctcc aaatgtagaa atgttatatt
     2341 ttttattgag ttcttggtta taattgctcc gcaatgattt aaataagcat tatttaaaac
     2401 attctcagga gaggtgaagg tggagctaaa aaaaagtatt ggtgattaca ctgaaaccga
     2461 attcaaaaaa tttattgaag acatcatcaa ttgtgaaggt gatgaaaaaa aacaggatga
     2521 taacctcgag tattttataa atgttactga gcatcctagt ggttctgatc tgatttatta
     2581 cccagaaggt aataatgatg gtagccctga aggtgttatt aaagagatta aagaatggcg
     2641 agccgctaac ggtaagtcag gatttaaaca gggctgaaat atgaatgccg gttgtttatg
     2701 gatgaatggc tggcattctt tcacaacaag gagtcgttat gaaaaaaata acagggatta
     2761 ttttattgct tcttgcagtc attattctgt ctgcatgtca ggcaaactat atccgggatg
     2821 ttcagggcgg gaccgtatct ccgtcatcaa cagctgaagt gaccggatta gcaacgcagt
     2881 aacccgaaat cctctttgac aaaaacaaag cgtgtcaggc tgattctgat gcgctttttt
     2941 tttgaaatgt cacaaaaatt ccatgtggga gatgggatct aaaatcctcg tgcagaactt
     3001 tccatccagg gggagaaaac ttgtcgtttt gagccgttcg gtgttcagaa cgcacgaaac
     3061 cgatcg
//
LOCUS       CECCOLE6A    3065 bp ds-DNA             BCT       16-FEB-1990
DEFINITION  Plasmid ColE6-CT14 colicin E6, immunity proteins E6 and E8, and
            lysis protein genes, complete cds.
ACCESSION   M31808
KEYWORDS    colicin E6; immunity protein; lysis protein.
SOURCE      Plasmid ColE6-CT14 (from E.coli) DNA.
  ORGANISM  Plasmid Colicin E6-CT14
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 3065)
  AUTHORS   Akutsu,A., Masaki,H. and Ohta,T.
  TITLE     Molecular structure and immunity specificity of colicin E6, an
            evolutionary intermediate between E-group colicins and cloacin DF13
  JOURNAL   J. Bacteriol. 171, 6430-6436 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        331     1986     colicin E6
    pept       1996     2253     immunity protein E6
    pept       2420     2677     immunity protein E8 (gtg start codon)
    pept       2739     2882     lysis protein
BASE COUNT      903 a    545 c    799 g    818 t
ORIGIN      
        1 aactcggttt taatcagacc tggcatgagt ggaagcggga cgaacagcac aggcaacaac
       61 aacgccgccc cgggcacttc cggggcatga gtatgtgata tccggggctg caccccggac
      121 cccgccaaca catcacgggc cacaaaattt tttgtggccc gctctgcgtt ttctaagtgt
      181 tatccctcct gatttctaaa aaattttcca cctgaacttg acagaaaaaa agatgacgag
      241 tactttttga tctgtacata aaaccagtgg ttttatgtac agtattaatc atgtaattaa
      301 ttgttttaac gcttaaaaga gggaattttt atgagcggtg gcgatggacg cggccataac
      361 acgggcgcgc atagcacaag tggtaacatt aatggtggcc cgaccgggct tggtgtaggt
      421 ggtggtgctt ctgatggctc cggatggagt tcggaaaata acccgtgggg tggtggttcc
      481 ggtagcggca ttcactgggg tggtggttcc ggtcatggta atggcggggg gaatggtaat
      541 tccggtggtg gctcgggaac aggcggtaat ctgtcagcag tagctgcgcc agtggcattt
      601 ggttttccgg cactttccac tccaggagct ggcggtctgg cggtcagtat ttcagcggga
      661 gcattatcgg cagctattgc tgatattatg gctgccctga aaggaccgtt taaatttggt
      721 ctttgggggg tggctttata tggtgtattg ccatcacaaa tagcgaaaga tgaccccaat
      781 atgatgtcaa agattgtgac gtcattaccc gcagatgata ttactgaatc acctgtcagt
      841 tcattacctc tcgataaggc aacagtaaac gtaaatgttc gtgttgttga tgatgtaaaa
      901 gacgaacgac agaatatttc ggttgtttca ggtgttccga tgagtgttcc ggtggttgat
      961 gcaaaaccta ccgaacgtcc aggtgttttt acggcatcaa ttccaggtgc acctgttctg
     1021 aatatttcag ttaataacag tacgccagca gtacagacat taagcccagg tgttacaaat
     1081 aatactgata aggatgttcg cccggcagga tttactcagg ggggtaatac cagggatgca
     1141 gttattcgat tcccgaagga cagcggtcat aatgccgtat atgtttcagt gagtgatgtt
     1201 cttagccctg accaggtaaa acaacgtcag gatgaagaaa atcgccgtca gcaggaatgg
     1261 gatgctacgc atccggttga agcggctgag cgaaattatg aacgcgcgcg tgcagagctg
     1321 aatcaggcaa atgaagatgt tgccagaaat caggagcgac aggctaaagc tgttcaggtt
     1381 tataattcgc gtaaaagcga acttgatgca gcgaataaaa ctcttgctga tgcaatagct
     1441 gaaataaaac aatttaatcg atttgcccat gacccaatgg ctggcggtca cagaatgtgg
     1501 caaatggccg ggcttaaagc ccagcgggcg cagacggatg taaataataa gcaggctgca
     1561 tttgatgctg ctgcaaaaga gaagtcagat gctgatgctg cattgagttc tgctatggaa
     1621 agcaggaaga agaaagaaga taagaaaagg agcgctgaaa ataaattaaa cgaggaaaaa
     1681 aacaagcctc gcaagggagt taaagattac ggtcatgatt atcatccaga tcctaaaact
     1741 gaagatataa aagggctggg tgagttaaaa gagggtaaac caaaaactcc aaagcaaggt
     1801 ggtggcggta aacgtgctag atggtatgga gataaagggc gtaagattta tgagtgggac
     1861 tctcagcatg gtgagcttga ggggtatcgt gccagtgatg gtcagcatct tggctcattc
     1921 gagcctaaga ctggtaatca gttgaaagga cctgatccaa aacgaaatat caaaaagtat
     1981 ctttgagagg atgttatggg gcttaaatta catattaatt ggtttgataa gacgaccgag
     2041 gaatttaaag gtggtgagta ttcaaaagat tttggagatg atggctcggt cattgaacgt
     2101 cttggaatgc ctttaaaaga taatatcaat aatggttggt ttgatgttat agctgaatgg
     2161 gtacctttgc tacaaccata ctttaatcat caaattgata tttccgataa tgagtatttt
     2221 gtttcgtttg attatcgtga tggtgattgg tgatcaaata ttatcaggga tgagttgatg
     2281 tacgggcttc tagtgttcat ggatgaacgc tggagcctcc aaatgtagaa gtgttatatt
     2341 ttttattgag ttcttggtta taattgctcc gcaataattt aaataggcat tatttaaaac
     2401 attctcagga gaggtgaagg tggagctaaa gaaaagtatt ggtgattaca ctgaaaccga
     2461 attcaaaaaa attattgaaa acatcatcaa ttgtgaaggt gatgaaaaaa aacaggatga
     2521 taacctcgag cattttataa gtgttactga gcatcctagt ggttctgatc tgatttatta
     2581 cccagaaggt aataatgatg gtagccctga agctgttatt aaagagatta aagaatggcg
     2641 agctgctaac ggtaagtcag gatttaaaca gggctgaaat atgaatgccg gttgtttaag
     2701 gatgaatgac tggcattctt tcacaacaag gagtcgttat gaaaaaaata acagggatta
     2761 ttttattgct tcttgcagtc attattctgg ctgcatgtca ggcaaactat atccgtgatg
     2821 ttcagggcgg gactgtatca ccgtcgtcaa ctgctgaact gaccggagtg gaaacgcagt
     2881 aacccgaaat cctctttgac aaaaacaaag cgtgtcaggc tgattctgat gcgctttttt
     2941 ttgaaatgtc acaaaaattc catgtgggag atgggatcta aaatccccgt gcagaacttt
     3001 ccatccagag ggagaaaact tgtcgttttg agccgttcgg tgttcagaac gcacgaaacc
     3061 gatcg
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATCGM1AC1   2238 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
            1 and 2.
ACCESSION   M32476 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     1 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2238)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene and analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        704      767     carcinoembryonic antigen-related protein
                                 (CGM1), exon 1
               1770  +  2129     carcinoembryonic antigen-related protein, exon
                                 2
    pre-msg     512  >  2238     CGM1 mRNA and introns
    IVS         768     1769     CGM1 intron A
    IVS        2130  >  2238     CGM1 intron B
BASE COUNT      615 a    532 c    539 g    552 t
ORIGIN      
        1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
       61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
      121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
      181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
      241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
      301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
      361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
      421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
      481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
      541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
      601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
      661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
      721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
      781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
      841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
      901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
      961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
     1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
     1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
     1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
     1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
     1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
     1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
     1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
     1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
     1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
     1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
     1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
     1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
     1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
     1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
     1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
     1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
     1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
     2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
     2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
     2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
     2221 tgcattgatc tacatgtt
//
LOCUS       RATCGM1AC2    539 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32477 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     2 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   539     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      126 a    127 c    138 g    148 t
ORIGIN      
        1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
       61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
      121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
      181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
      241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
      301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
      361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
      421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
      481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS       RATCGM1AC3    828 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32478 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     3 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 828)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   828     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      193 a    255 c    190 g    190 t
ORIGIN      
        1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
       61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
      121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
      181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
      241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
      301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
      361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
      421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
      481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
      541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
      601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
      661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
      721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
      781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS       RATCGM1AC4    642 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 3.
ACCESSION   M32479 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     4 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 642)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    61  +   420     carcinoembryonic antigen-related protein
                                 (CGM1), exon 3
    pre-msg <     1  >   642     CGM1 mRNA and introns
    IVS     <     1       60     CGM1 intron B
    IVS         421  >   642     CGM1 intron C
BASE COUNT      151 a    160 c    154 g    177 t
ORIGIN      
        1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
       61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
      121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
      181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
      241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
      301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
      361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
      421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
      481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
      541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
      601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS       RATCGM1AC5    616 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 4.
ACCESSION   M32480 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     5 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 616)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    90  +   449     carcinoembryonic antigen-related protein
                                 (CGM1), exon 4
    pre-msg <     1  >   616     CGM1 mRNA and introns
    IVS     <     1       89     CGM1 intron C
    IVS         450      616     CGM1 intron D
BASE COUNT      152 a    153 c    147 g    164 t
ORIGIN      
        1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
       61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
      121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
      181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
      241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
      301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
      361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
      421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
      481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
      541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
      601 ttgtctgatt cccttc
//
LOCUS       RATCGM1AC6    654 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron 4.
ACCESSION   M32481 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     6 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 654)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   654     carcinoembryonic antigen-related protein
                                 intron D
BASE COUNT      185 a    146 c    155 g    168 t
ORIGIN      
        1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
       61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
      121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
      181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
      241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
      301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
      361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
      421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
      481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
      541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
      601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS       RATCGM1AC7    492 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 5.
ACCESSION   M32482 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     7 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   118  +   471     carcinoembryonic antigen-related protein
                                 (CGM1), exon 5
    pre-msg <     1  >   492     CGM1 mRNA and introns
    IVS     <     1      117     CGM1 intron D
    IVS         472      492     CGM1 intron E
BASE COUNT      134 a    134 c    105 g    119 t
ORIGIN      
        1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
       61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
      121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
      181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
      241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
      301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
      361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
      421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
      481 tctcaaggtc tc
//
LOCUS       RATCGM1AC8   1341 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 6.
ACCESSION   M32483 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     8 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   640  /   987     carcinoembryonic antigen-related protein
                                 (CGM1), exon 6
    pre-msg <     1  >  1341     CGM1 mRNA and introns
    IVS     <     1      639     CGM1 intron E
    IVS         988     1341     CGM1 intron F
BASE COUNT      357 a    329 c    321 g    334 t
ORIGIN      
        1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
       61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
      121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
      181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
      241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
      301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
      361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
      421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
      481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
      541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
      601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
      661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
      721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
      781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
      841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
      901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
      961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
     1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
     1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
     1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
     1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
     1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
     1321 atccgttgcc tgacggtcga c
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATCGM1AC1   2238 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
            1 and 2.
ACCESSION   M32476 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     1 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2238)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene and analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        704      767     carcinoembryonic antigen-related protein
                                 (CGM1), exon 1
               1770  +  2129     carcinoembryonic antigen-related protein, exon
                                 2
    pre-msg     512  >  2238     CGM1 mRNA and introns
    IVS         768     1769     CGM1 intron A
    IVS        2130  >  2238     CGM1 intron B
BASE COUNT      615 a    532 c    539 g    552 t
ORIGIN      
        1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
       61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
      121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
      181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
      241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
      301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
      361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
      421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
      481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
      541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
      601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
      661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
      721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
      781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
      841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
      901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
      961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
     1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
     1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
     1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
     1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
     1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
     1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
     1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
     1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
     1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
     1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
     1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
     1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
     1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
     1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
     1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
     1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
     1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
     2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
     2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
     2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
     2221 tgcattgatc tacatgtt
//
LOCUS       RATCGM1AC2    539 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32477 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     2 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   539     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      126 a    127 c    138 g    148 t
ORIGIN      
        1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
       61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
      121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
      181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
      241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
      301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
      361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
      421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
      481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS       RATCGM1AC3    828 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32478 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     3 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 828)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   828     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      193 a    255 c    190 g    190 t
ORIGIN      
        1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
       61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
      121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
      181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
      241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
      301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
      361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
      421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
      481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
      541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
      601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
      661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
      721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
      781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS       RATCGM1AC4    642 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 3.
ACCESSION   M32479 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     4 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 642)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    61  +   420     carcinoembryonic antigen-related protein
                                 (CGM1), exon 3
    pre-msg <     1  >   642     CGM1 mRNA and introns
    IVS     <     1       60     CGM1 intron B
    IVS         421  >   642     CGM1 intron C
BASE COUNT      151 a    160 c    154 g    177 t
ORIGIN      
        1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
       61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
      121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
      181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
      241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
      301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
      361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
      421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
      481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
      541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
      601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS       RATCGM1AC5    616 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 4.
ACCESSION   M32480 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     5 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 616)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    90  +   449     carcinoembryonic antigen-related protein
                                 (CGM1), exon 4
    pre-msg <     1  >   616     CGM1 mRNA and introns
    IVS     <     1       89     CGM1 intron C
    IVS         450      616     CGM1 intron D
BASE COUNT      152 a    153 c    147 g    164 t
ORIGIN      
        1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
       61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
      121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
      181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
      241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
      301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
      361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
      421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
      481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
      541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
      601 ttgtctgatt cccttc
//
LOCUS       RATCGM1AC6    654 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron 4.
ACCESSION   M32481 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     6 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 654)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   654     carcinoembryonic antigen-related protein
                                 intron D
BASE COUNT      185 a    146 c    155 g    168 t
ORIGIN      
        1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
       61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
      121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
      181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
      241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
      301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
      361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
      421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
      481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
      541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
      601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS       RATCGM1AC7    492 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 5.
ACCESSION   M32482 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     7 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   118  +   471     carcinoembryonic antigen-related protein
                                 (CGM1), exon 5
    pre-msg <     1  >   492     CGM1 mRNA and introns
    IVS     <     1      117     CGM1 intron D
    IVS         472      492     CGM1 intron E
BASE COUNT      134 a    134 c    105 g    119 t
ORIGIN      
        1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
       61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
      121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
      181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
      241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
      301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
      361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
      421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
      481 tctcaaggtc tc
//
LOCUS       RATCGM1AC8   1341 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 6.
ACCESSION   M32483 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     8 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   640  /   987     carcinoembryonic antigen-related protein
                                 (CGM1), exon 6
    pre-msg <     1  >  1341     CGM1 mRNA and introns
    IVS     <     1      639     CGM1 intron E
    IVS         988     1341     CGM1 intron F
BASE COUNT      357 a    329 c    321 g    334 t
ORIGIN      
        1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
       61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
      121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
      181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
      241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
      301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
      361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
      421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
      481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
      541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
      601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
      661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
      721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
      781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
      841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
      901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
      961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
     1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
     1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
     1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
     1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
     1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
     1321 atccgttgcc tgacggtcga c
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATCGM1AA    3190 bp ss-mRNA            ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (rnCGM1) mRNA,
            complete cds.
ACCESSION   M32474 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SOURCE      R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation,
            cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3190)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,F.A. and Zimmermann,W.
  TITLE     cDNA and gene analysis imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1900) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        122     2251     carcinoembryonic antigen-related protein
                                 precursor (rnCGM1)
    sigp        122      221     carcinoembryonic antigen-related protein
                                 signal peptide
    matp        222     2248     rnCGM1 protein
BASE COUNT      871 a    798 c    693 g    828 t
ORIGIN      
        1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta
       61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca
      121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct
      181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc
      241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga
      301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact
      361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt
      421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac
      481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt
      541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc
      601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga
      661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc
      721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac
      781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
      841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt
      901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc
      961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga
     1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc
     1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac
     1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
     1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt
     1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc
     1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct
     1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta
     1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta
     1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact
     1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac
     1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt
     1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc
     1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag
     1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc
     1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa
     1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca
     1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg
     2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct
     2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag
     2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct
     2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga
     2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg
     2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca
     2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac
     2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc
     2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct
     2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt
     2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag
     2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa
     2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat
     2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct
     2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag
     2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct
     3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt
     3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag
     3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt
     3181 ttctccaaat
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATCGM1AC1   2238 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
            1 and 2.
ACCESSION   M32476 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     1 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2238)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene and analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        704      767     carcinoembryonic antigen-related protein
                                 (CGM1), exon 1
               1770  +  2129     carcinoembryonic antigen-related protein, exon
                                 2
    pre-msg     512  >  2238     CGM1 mRNA and introns
    IVS         768     1769     CGM1 intron A
    IVS        2130  >  2238     CGM1 intron B
BASE COUNT      615 a    532 c    539 g    552 t
ORIGIN      
        1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
       61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
      121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
      181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
      241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
      301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
      361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
      421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
      481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
      541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
      601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
      661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
      721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
      781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
      841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
      901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
      961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
     1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
     1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
     1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
     1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
     1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
     1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
     1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
     1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
     1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
     1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
     1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
     1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
     1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
     1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
     1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
     1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
     1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
     2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
     2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
     2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
     2221 tgcattgatc tacatgtt
//
LOCUS       RATCGM1AC2    539 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32477 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     2 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   539     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      126 a    127 c    138 g    148 t
ORIGIN      
        1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
       61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
      121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
      181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
      241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
      301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
      361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
      421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
      481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS       RATCGM1AC3    828 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32478 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     3 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 828)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   828     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      193 a    255 c    190 g    190 t
ORIGIN      
        1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
       61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
      121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
      181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
      241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
      301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
      361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
      421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
      481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
      541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
      601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
      661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
      721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
      781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS       RATCGM1AC4    642 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 3.
ACCESSION   M32479 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     4 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 642)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    61  +   420     carcinoembryonic antigen-related protein
                                 (CGM1), exon 3
    pre-msg <     1  >   642     CGM1 mRNA and introns
    IVS     <     1       60     CGM1 intron B
    IVS         421  >   642     CGM1 intron C
BASE COUNT      151 a    160 c    154 g    177 t
ORIGIN      
        1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
       61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
      121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
      181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
      241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
      301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
      361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
      421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
      481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
      541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
      601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS       RATCGM1AC5    616 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 4.
ACCESSION   M32480 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     5 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 616)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    90  +   449     carcinoembryonic antigen-related protein
                                 (CGM1), exon 4
    pre-msg <     1  >   616     CGM1 mRNA and introns
    IVS     <     1       89     CGM1 intron C
    IVS         450      616     CGM1 intron D
BASE COUNT      152 a    153 c    147 g    164 t
ORIGIN      
        1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
       61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
      121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
      181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
      241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
      301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
      361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
      421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
      481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
      541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
      601 ttgtctgatt cccttc
//
LOCUS       RATCGM1AC6    654 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron 4.
ACCESSION   M32481 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     6 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 654)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   654     carcinoembryonic antigen-related protein
                                 intron D
BASE COUNT      185 a    146 c    155 g    168 t
ORIGIN      
        1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
       61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
      121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
      181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
      241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
      301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
      361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
      421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
      481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
      541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
      601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS       RATCGM1AC7    492 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 5.
ACCESSION   M32482 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     7 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   118  +   471     carcinoembryonic antigen-related protein
                                 (CGM1), exon 5
    pre-msg <     1  >   492     CGM1 mRNA and introns
    IVS     <     1      117     CGM1 intron D
    IVS         472      492     CGM1 intron E
BASE COUNT      134 a    134 c    105 g    119 t
ORIGIN      
        1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
       61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
      121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
      181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
      241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
      301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
      361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
      421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
      481 tctcaaggtc tc
//
LOCUS       RATCGM1AC8   1341 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 6.
ACCESSION   M32483 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     8 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   640  /   987     carcinoembryonic antigen-related protein
                                 (CGM1), exon 6
    pre-msg <     1  >  1341     CGM1 mRNA and introns
    IVS     <     1      639     CGM1 intron E
    IVS         988     1341     CGM1 intron F
BASE COUNT      357 a    329 c    321 g    334 t
ORIGIN      
        1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
       61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
      121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
      181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
      241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
      301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
      361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
      421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
      481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
      541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
      601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
      661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
      721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
      781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
      841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
      901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
      961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
     1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
     1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
     1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
     1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
     1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
     1321 atccgttgcc tgacggtcga c
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATCGM1AC1   2238 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
            1 and 2.
ACCESSION   M32476 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     1 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2238)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene and analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        704      767     carcinoembryonic antigen-related protein
                                 (CGM1) precursor, exon 1
               1770  +  2129     carcinoembryonic antigen-related protein, exon
                                 2
    sigp        704      767     carcinoembryonic antigen-related protein
                                 (CGM1) signal peptide
               1770     1804     carcinoembryonic antigen-related protein
                                 signal peptide
    matp       1805  +  2129     carcinoembryonic antigen-related protein
    pre-msg     512  >  2238     CGM1 mRNA and introns
    IVS         768     1769     CGM1 intron A
    IVS        2130  >  2238     CGM1 intron B
BASE COUNT      615 a    532 c    539 g    552 t
ORIGIN      
        1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
       61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
      121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
      181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
      241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
      301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
      361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
      421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
      481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
      541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
      601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
      661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
      721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
      781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
      841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
      901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
      961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
     1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
     1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
     1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
     1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
     1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
     1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
     1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
     1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
     1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
     1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
     1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
     1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
     1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
     1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
     1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
     1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
     1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
     2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
     2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
     2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
     2221 tgcattgatc tacatgtt
//
LOCUS       RATCGM1AC2    539 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32477 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     2 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   539     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      126 a    127 c    138 g    148 t
ORIGIN      
        1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
       61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
      121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
      181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
      241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
      301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
      361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
      421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
      481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS       RATCGM1AC3    828 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32478 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     3 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 828)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   828     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      193 a    255 c    190 g    190 t
ORIGIN      
        1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
       61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
      121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
      181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
      241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
      301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
      361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
      421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
      481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
      541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
      601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
      661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
      721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
      781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS       RATCGM1AC4    642 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 3.
ACCESSION   M32479 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     4 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 642)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    61  +   420     carcinoembryonic antigen-related protein
                                 (CGM1), exon 3
    matp    +    61  +   420     carcinoembryonic antigen-related protein
    pre-msg <     1  >   642     CGM1 mRNA and introns
    IVS     <     1       60     CGM1 intron B
    IVS         421  >   642     CGM1 intron C
BASE COUNT      151 a    160 c    154 g    177 t
ORIGIN      
        1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
       61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
      121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
      181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
      241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
      301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
      361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
      421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
      481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
      541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
      601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS       RATCGM1AC5    616 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 4.
ACCESSION   M32480 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     5 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 616)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    90  +   449     carcinoembryonic antigen-related protein
                                 (CGM1), exon 4
    matp    +    90  +   449     carcinoembryonic antigen-related protein
    pre-msg <     1  >   616     CGM1 mRNA and introns
    IVS     <     1       89     CGM1 intron C
    IVS         450      616     CGM1 intron D
BASE COUNT      152 a    153 c    147 g    164 t
ORIGIN      
        1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
       61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
      121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
      181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
      241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
      301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
      361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
      421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
      481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
      541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
      601 ttgtctgatt cccttc
//
LOCUS       RATCGM1AC6    654 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron 4.
ACCESSION   M32481 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     6 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 654)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   654     carcinoembryonic antigen-related protein
                                 intron D
BASE COUNT      185 a    146 c    155 g    168 t
ORIGIN      
        1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
       61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
      121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
      181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
      241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
      301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
      361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
      421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
      481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
      541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
      601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS       RATCGM1AC7    492 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 5.
ACCESSION   M32482 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     7 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   118  +   471     carcinoembryonic antigen-related protein
                                 (CGM1), exon 5
    matp    +   118  +   471     carcinoembryonic antigen-related protein
    pre-msg <     1  >   492     CGM1 mRNA and introns
    IVS     <     1      117     CGM1 intron D
    IVS         472      492     CGM1 intron E
BASE COUNT      134 a    134 c    105 g    119 t
ORIGIN      
        1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
       61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
      121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
      181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
      241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
      301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
      361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
      421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
      481 tctcaaggtc tc
//
LOCUS       RATCGM1AC8   1341 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 6.
ACCESSION   M32483 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     8 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   640  /   987     carcinoembryonic antigen-related protein
                                 (CGM1), exon 6
    matp    +   640  /   987     carcinoembryonic antigen-related protein
    pre-msg <     1  >  1341     CGM1 mRNA and introns
    IVS     <     1      639     CGM1 intron E
    IVS         988     1341     CGM1 intron F
BASE COUNT      357 a    329 c    321 g    334 t
ORIGIN      
        1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
       61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
      121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
      181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
      241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
      301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
      361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
      421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
      481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
      541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
      601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
      661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
      721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
      781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
      841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
      901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
      961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
     1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
     1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
     1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
     1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
     1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
     1321 atccgttgcc tgacggtcga c
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATCGM1AA    3190 bp ss-mRNA            ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) mRNA, complete
            cds.
ACCESSION   M32474 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SOURCE      R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation,
            cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3190)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analysis imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        122     2251     carcinoembryonic antigen-related protein
                                 precursor (CGM1)
    sigp        122      220     carcinoembryonic antigen-related protein
                                 signal peptide
    matp        221     2248     CGM1 protein
BASE COUNT      871 a    798 c    693 g    828 t
ORIGIN      
        1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta
       61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca
      121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct
      181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc
      241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga
      301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact
      361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt
      421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac
      481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt
      541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc
      601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga
      661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc
      721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac
      781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
      841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt
      901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc
      961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga
     1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc
     1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac
     1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
     1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt
     1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc
     1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct
     1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta
     1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta
     1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact
     1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac
     1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt
     1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc
     1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag
     1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc
     1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa
     1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca
     1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg
     2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct
     2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag
     2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct
     2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga
     2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg
     2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca
     2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac
     2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc
     2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct
     2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt
     2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag
     2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa
     2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat
     2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct
     2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag
     2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct
     3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt
     3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag
     3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt
     3181 ttctccaaat
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       RATCGM4AA    4627 bp ds-DNA             ROD       05-MAR-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM4) gene, exons 2
            and 3.
ACCESSION   M32475 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SOURCE      Rat (SPrarue-Dawley) liver DNA, clone lambda-rnCGM415-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4627)
  AUTHORS   Zimmermann,W.
  JOURNAL   Unpublished (1990) Univ. of Freiburg, D7800 Freiburg FRG
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    /   145      489     carcinoembryonic antigen-related protein
                                 (CGM4), exon 2 (AA at 147)
               3379  /  3738     carcinoembryonic antigen-related protein exon
                                 3
    pre-msg <     1  >  4627     CGM4 mRNA and introns
    IVS     <     1      144     CGM4 intron B
    IVS         490     3378     CGM4 intron C
    IVS        3739  >  4627     CGM4 intron D
BASE COUNT     1213 a   1165 c   1162 g   1087 t
ORIGIN      
        1 agatctgggt cgacctgcag gtcaacggat ctgggcctta gcaggagtgt gggcagagct
       61 ctgggaaggc agaagtgtga ttttttaaaa aaccaacaga tttcacctgc tcaatatcga
      121 tggttgctct gtcttccctt ttagcctccc ttctaacctg ttggctcctg actactgccc
      181 aggtcaacat tgaatcggtg ccattcaatg tggttgaagg ggaaaacgtc cttcttcttg
      241 tccacaatct gccagagaat ctcatagcct ttgcctggta tagagggctg aggaaaattg
      301 gagtatacat actgaacact gaagtaagtg tgacggggcc aatgtacagc ggtagagaga
      361 cagtgtacag caatggttcc ctgtgtatcc gcaatgtcac ccagaaggac acaggattct
      421 acactctacg aacagtcaac acacgtggag aaactgtatc aacaacatcc ttgtacctct
      481 atgtgtacag taagtgatac tttgtgaact ctgggtgttg tgtggggttc attccgtaga
      541 cacacacaga agaggcaggc ctacctaccc tttgcattgt gtctccttat tgaggtgtga
      601 acatttaact caggctaagg agagtaatgc caattgaata gaatccttct tttgacttta
      661 ccttgtagtc agctggatgt gtggttaact cagtgaagga catcagccct tgtctagact
      721 tctggggttc ttagcagtaa tgtgtccttg ggaaagacct tgagggaagg agattgggtt
      781 tgaatgagat agccatagga tcctcatgga agtgagaacc agaaagccct ggctccagac
      841 ctctgtcctg actcatctcc tgatggcccc gagaagcatt ttacaaaggc tggattctga
      901 catctgttgg cagggaacag tgcttttgag gagcaaatcc ttgtgccaca tacaatcacc
      961 tggtgcacgg ccatgagagc cacagttagg cgaggtctcc tggatctctc cagtgactca
     1021 tcagggagag aatagaaaga cagatgtccc ggccactaag ttaactgtta tgatggcctt
     1081 atgagacttc caggaaggtc atggttgcca ggaagaggga caaaggacac agatccccct
     1141 gacagttgct tgtcctttgg ggtccagctc atagaagtct gtccgcaggc aaatgacacc
     1201 aggctctgct gatgtggata gctccccaga tctgagctgc agttctccca gcgatcacga
     1261 gggccgcctc agggaaacac aattaacacc cagaagagta tttgtctaaa ccaggaactt
     1321 acctcctcct ctggctagct cccctgttcc tacagacatg ggggtcacac agccttctca
     1381 gacctaccag ctgcctcctt ttctgctgcc ttgctaggga attatgtgta gtggctgctt
     1441 tgtgtatttt ctttggaaaa gatagagtat cctaagggaa tcacccagac agaggttcaa
     1501 ggcatctctg aaaggccagg cagcacatgg cagagccacc tcacagctca ggacccagag
     1561 gaagtgtgcc caccatcttg aatccatgca tgggacgatg gagcccagag ctacgttcca
     1621 ggactcaggt cacctcccac acactcaaga agtgaggctc ctgacacagc tgctcctggg
     1681 ccccttttct ccctgagaat cctgactggt ggctgcagtg agaacacatc tgtcccctcc
     1741 cccactcgtc acacagctgg cccttgggat cctcacacac atctctgtct ccttcctcct
     1801 gagagcaaac tacctctttg acgggcactg agaacacagg gcagactggg tgcccagctg
     1861 gttctgggtc acccagggag tgcagaggct cactcactgg tgctgactga gccaggaaga
     1921 ggccagaaca gagggatgcc ccccgggtga gctgctgtct tcttagggca cagagatgct
     1981 cagaggtttg tttgtcactg tgagctctgt ggcatgagac agaaagagcc cagaggagag
     2041 gttaggtgtg taggactgag tgtgcacagg gcagagaaca gagttaccca cagcccacgg
     2101 gactctggga tatgatcctg tctggcggag gctgagctca gaggatcaga gaacttggga
     2161 gctgtattgg agcagatgtg ctacagactg aggacagatc tggccacaga gaccagggcg
     2221 gtgctctgta ccatctgcaa acaatgcccc acctgttggt gctcctgctc acagatgagg
     2281 agaccacatt ttacagtgtg tgagaggaga ggactcacct actgtctaaa gtctcttcaa
     2341 ggggacaggg actggagaag agtttcaggt ttgtagggct gaaaacacta aagtataggg
     2401 gctcatcatc atcatcatca ccaccgccat caccaccacc accaccacca ccaccaccac
     2461 caccaccacc accaccacca ccatcatcat catcatcatg aggctcttgg taaataagaa
     2521 gaagcagggg gaggaggaga ttattgtcaa cccacagttc accatcaatg agcccagtgt
     2581 tctgaagact gaggttctca gctgtgatgc cccaaataag aaaccaagct ggtgttgatc
     2641 agtgacatgg ctcagtggat ctgggtgttt gcttcatgtc tgacaacctg agaaccagtg
     2701 aacacaagtt gtccctgacc tccacctagg gacggcgttt tgcacccaac acagacacac
     2761 tgaggcatgc ccttgcacat gaactcatac accaatataa taagcaaatg cataaaaatt
     2821 atagcaaatg gaagcagtca acactgtatt cccaaacata ctaatttgtt aaataaatcc
     2881 atggccatgt attcattcat tcattcattc actcattcat ttactctcca agatatttga
     2941 gttttctttt gcagtctttt ttttttaaaa gataatataa gacaaatccc agttctcatt
     3001 attccctagc cctagactgg aagacgacca gtgaagaaag ctagaaggcg aatcagtcac
     3061 taaaggacaa gaaacaaaag agtcagagtg tgacggtcgg gaggcttcac cccaacaccc
     3121 atcgactgac actgagggtg agcagggatc tgaggacggt gaggcagggc catgttgaca
     3181 cctgaggaga gagcagcata gagaggaaat gacaagtgag gggcgcggag tgcatggagg
     3241 taatgcactg acctccacta gctagggcag ggagactccc acacctcagc tgaccactgg
     3301 acacagctgc tcggactcag gcaccatctt agccaaatac taaagtcctg atgttgacgg
     3361 atctctcttc ccttctagcc tctcttttca tctgtgggcg tccttttaac cctgccaagc
     3421 tcactattga atcagtgccg cccagtgttg ctgaaggggg aagcgttctt ctcctcgttc
     3481 acaatctcca ggacgagctt cgagggtttt tctggtacaa aggggcgtct atgtctagca
     3541 accatgagat agcccgatac agaacagcaa agaattcaag tgtgccaggc cctgcccaca
     3601 gtggtagaga gacggtgtac agcaatggat ccctcctgct ccagaatgtc acccggaatg
     3661 acactgggtt ctacacccta cgcactctga aaagacatca gaaaatggaa ttggcacacg
     3721 tgcaacttca ggtggacagt aagtgatttt ccgtgatcgt tcagtgctgg gtgggtcttt
     3781 gacacacagg actgtcaccc ctggcatgtg gctacctcct ctctgccttt ttatccccat
     3841 gttgtggtta accactatgt gcaggacaca tgtgatggaa agaaatgccc atgggtcaga
     3901 cttatcatct gactctcccc tgtatcaagg acagtaactc aaccctaggt gctagactct
     3961 gcccagtcat ctggggcatc ttgccatgca acgtgaggaa accatggatc ctcacagcgt
     4021 ggtgagcacc aggaagctct gatctcagtc gtttgtccca gacttgactg caaatgtctc
     4081 taggagcatt ttgtcaggag tgctgcttac tgcctctctc ctcacagcct gccatcctga
     4141 tcttatagta acccaggaca ctgagcccag gggtgaaaat gctcccagtt gggctgggct
     4201 ctcccacatc aatcaccaat taaaaatgta ctacaggtta gcccacaggt tattttggtg
     4261 gtggcatttt aaattgaggc ccttgtttca aaaaattcta gcttgtgtta agttgacata
     4321 aagccagcag cacgattcct gagccctccc caatacctat atctgccaag aagaccagac
     4381 tgttcccacc catcatccgg ccttagtcct gggtgctata ggctgggacg tgagaacatg
     4441 tggaatgtga agtctgagga tgaccgcagg tacaaaggag atgagaaagt cagagagtgt
     4501 gtatccaggg tgtgtagaga ccaaaggtca ggggaggcat catcccaaag cacagtgtgc
     4561 atgagtatgt gcaatgtctg aatgagggca gtgagggaca gccacggaga caccaaggac
     4621 agagctc
//

GenBank-Updates@genbank.bio.net (04/14/90)

LOCUS       YSCSEN2X     2813 bp ds-DNA             PLN       26-FEB-1990
DEFINITION  S.cerevisiae tRNA splicing endonuclease beta-subunit (SEN2) gene,
            complete cds.
ACCESSION   M32336
KEYWORDS    tRNA splicing endonuclease beta-subunit.
SOURCE      S.cerevisiae (strain S288C) YCp50 library of Rose DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2813)
  AUTHORS   Ho,C.K., Rauhut,R. and Abelson,J.
  TITLE     Saccharomyces cerevisiae SEN2 gene encodes the middle subunit of
            tRNA splicing endonuclease
  JOURNAL   Unpublished (1990) Cal. Tech., Div. of Bio., Pasadena, CA 91125
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.K.Ho, 23-FEB-1990.  Dr. Ho relayed updated title information
            by phone on 09-APR-1990.
FEATURES       from  to/span     description
    pept        520     1653     tRNA splicing endonuclease beta-subunit
BASE COUNT      927 a    522 c    539 g    825 t
ORIGIN      
        1 aaaattttct tcaagatgca agtttcccat tcagttctat aggaagttaa aacaaaagag
       61 ctgtttgaga gctcgtaggc tttggttgtt aagagaacgg actaaataac accaggaggt
      121 gtgtctagac ttagttgaac gagttttttc tcgtgctatt atatctgttc ttaatacttc
      181 cactttattt taatgatgat agcatgattt atgttgctcg aagttaatcg atgtgagcac
      241 aatgatttct caagactaca acggtatctg aattttattt cttgtcaaaa atgttttata
      301 gaagccactc ttgctataga tttatattgc cacacataac tttttcacgt aaatttgaaa
      361 gagctatatt agctggtgct ttcactaaca tctctttgct ctagtatttc atttcggtgc
      421 ataaaacatt tgaacttcca tcgaaattca atatgctaaa acgtttcatt gtaacacgac
      481 gctgaaaaga caataagtac gaaggactca agaaggagca tgtctaaagg gagggtcaat
      541 cagaagcgtt acaagtatcc tcttccaatt caccccgtag atgaccttcc agagttaatc
      601 cttcataatc cgttatcttg gctatattgg gcataccgct attacaagag tacgaacgca
      661 ctgaacgata aagtacatgt agattttata ggagatacca ctcttcacat cacggttcaa
      721 gatgacaagc aaatgctata tctttggaat aatggatttt ttggcactgg tcaatttagc
      781 aggagtgagc ctacatggaa agctagaaca gaggccagac tgggtctcaa tgatactccc
      841 ctccacaatc ggggaggaac aaagagtaat actgaaacgg agatgacttt agagaaggtc
      901 acacaacaga ggagattaca aagattagag ttcaagaaag aacgtgcgaa gttagaaaga
      961 gaattactag aattaagaaa aaaaggtggt catatcgatg aggaaaacat cctccttgag
     1021 aaacaacgag aatcattaag aaaatttaaa ctaaaacaaa cagaagacgt gggcatcgtt
     1081 gcacaacagc aagacatttc cgaatcaaat ttaagagatg aagacaacaa tcttcttgat
     1141 gagaacggtg atcttttgcc actagagtca ctagaactaa tgccagtgga agctatgttt
     1201 ttaacttttg cacttcctgt tcttgacata tctcccgcgt gcctggcagg gaaactgttt
     1261 caattcgatg ccaaatataa agatattcac tcctttgtca gatcatacgt tatataccat
     1321 cactacagat cacacggttg gtgcgtaaga tctggtataa aattcggctg cgattattta
     1381 ttatataaga gagggccacc atttcaacac gctgaatttt gtgttatggg tcttgaccac
     1441 gacgtctcta aagattatac gtggtattct agcatagccc gtgttgtggg cggcgcaaag
     1501 aagacgtttg tgttatgcta cgtggagaga ctaatctcgg aacaggaggc gatagcacta
     1561 tggaaatcaa acaactttac taaattgttc aatagttttc aagttggtga agtattgtat
     1621 aagagatggg ttcccggaag aaatagagac tagaatatca atgagaagat aaatccttcg
     1681 tatgtatata tgtatgtgta tgtatttttt atacagattt caatggtttt tgtaagcatc
     1741 tgatacccat ccacaatccc taaacccctt gttctttgct ttggggccct tgacagggtc
     1801 atcatatatt gctctaaact tttcattgtg tgtggctggt ttggatatgc aaacaaattt
     1861 attatcagga aggacacatt tcaactgaga tttagggaag ggacaaggac agtctttctg
     1921 ctgtgcaacg cacgttaggg tgtccggaca caaataccca tcacaagggt tatttaaaac
     1981 ttgatcttca tatgattgtt gctgttgctg ttgctgttgc tgatgatgac cgaaattgaa
     2041 gaaggcattc actccagtgc tgccaaaaat aaacaatgca aagatgagaa cgatactcca
     2101 gcgactttga ctcatactac agtaaaattt ggtcgttttt tgcctggcaa accatgccta
     2161 tttttcgtca ttaaaggtct tctgagcttt catcaaaatt atctctataa ttataaagga
     2221 cagctcagag ccacacccaa acatctgcaa aggtaaaggc catctttcat ctcacattgc
     2281 ctaacaagaa agtcacattg caattgtttc gagaagaaaa cattaattca attacgtggt
     2341 tcccatgcgt ctgtgttaag cgtcaattta tcatttgacg aactcgttgc aaccacaaac
     2401 aaagcaaggg ccgcatttat caaaggaaaa ccaaccaaaa gcataaataa acgagatttt
     2461 tattatgaaa aagccatgcg aatcctacta gaataaattc aaagtgaaga catttagtca
     2521 attcgatctt ctctctcttg ctttaactag tcgtaaaacc taataagttt cattcacgac
     2581 gcgtattttt ttttcaacat atcatctaac gcgttttaag aaatttcagt aaataatgaa
     2641 gctgtagtaa tgccaacaat ccattgtggc tactttatac aacgcactat agagcaataa
     2701 aagtgtagca ataaacacta gagagaaggc acataataac aagaatatac tcacgcacat
     2761 gtattatgga atcagccagt ttagcgaacc gtacaacaaa atcttaagga att
//

GenBank-Updates@genbank.bio.net (05/25/90)

LOCUS       FCLCAPS      2493 bp ss-RNA             VRL       25-MAY-1990
DEFINITION  Feline calicivirus capsid protein gene (put.), complete cds.
ACCESSION   M32819
KEYWORDS    capsid protein.
SOURCE      Feline calicivirus (CFI/68 FIV), cDNA to viral RNA, clone pCV[2,7].
  ORGANISM  Feline calicivirus
            Viridae; ss-RNA nonenveloped viruses; Caliciviridae.
REFERENCE   1  (bases 1 to 2493)
  AUTHORS   Neill,J.D.
  TITLE     Nucleotide sequence and expression of the capsid protein gene of
            Feline calicivirus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.D.Neill, 14-MAR-1990. Pioneer Hi-Bred International, 7300 NW
            62nd Ave., Johnston, IA 50131
FEATURES       from  to/span     description
    pept        127     2133     capsid protein ORF (put.)
    pept       2130     2450     capsid protein ORF (put.)
BASE COUNT      666 a    593 c    531 g    703 t
ORIGIN      
        1 gcccccttct tacagcacag ccttggagca ttacaacagc caatttaatg gtgtggaggc
       61 gcggactgac cagatcgata cgagtggcat ggccgccctt cactgtgatg tgttcgaagt
      121 ttgagcatgt gctcaacctg cgctaacgtg cttaaatact atgattggga tcctcacatc
      181 aaattggtaa tcaaccccaa caaatttcta catgttggct tctgcgataa ccctttaatg
      241 tgttgttatc ctgaattact acctgaattt ggcaccatgt gggattgtga tcaatcgcca
      301 ctccaagtct accttgagtc aatcctgggt gatgatgaat ggtcctccac tcatgaagca
      361 attgacccag ttgtgccacc aatgcattgg gatgaagccg gaaaaatctt ccaaccacac
      421 cctggcgtcc ttatgcatca cctcatctgt aaggttgcag aaggatggga cccaaacctg
      481 ccacttttcc gcttggaagc ggacgatggt tccatcacga cacctgaaca gggaacaatg
      541 gttggtggag tcattgctga gcccaacgcc caaatgtcaa ccgcagctga catggccact
      601 gggaaaagtg tggactctga gtgggaagcc ttcttctcct ttcacactag tgtgaactgg
      661 agcacatctg aaactcaggg gaagatactc tttaaacaat ccttaggacc attgctcaac
      721 ccctacctta cccatcttgc aaagctgtat gttgcttggt ctggttctgt tgatgttagg
      781 ttttctattt ctggatctgg tgtctttgga gggaaattag ctgctattgt tgtgccgcca
      841 ggaattgatc ctgttcaaag tacttcaatg ctgcaatatc ctcatgtcct ctttgatgct
      901 cgtcaagttg aacctgttat cttttccatt cccgatctaa gaagcacctt atatcacctt
      961 atgtctgaca ctgataccac atcgttggta atcatggtgt acaatgatct tattaacccc
     1021 tatgctaatg actcaaactc ttcgggctgc attgtcactg tggaaactaa accggggcca
     1081 gatttcaagt ttcacctctt aaaacctcct gggtctatgt taactcacgg atctatccca
     1141 tctgatctaa tcccaaaatc atcttcgctt tggattggaa atcggttttg gtctgacata
     1201 accgattttg taattcggcc ttttgtgttc caggcaaatc gacactttga tttcaaccaa
     1261 gagacagcag gttggagcac cccaaggttt cgcccaatta ctatcactat cagtgttaag
     1321 gagtcagcaa agcttggtat tggagtggcc accgactaca ttgttcccgg cataccagat
     1381 ggatggcccg acacaacaat cccaggtgag ttggtacctg ttggtgacta tgccatcact
     1441 aatggcacca acaatgatat caccacagct gcgcagtacg atgcagccac tgagattaga
     1501 aacaacacca atttcagagg catgtacatt tgtggttctc ttcaaagagc ttggggggat
     1561 aagaagattt caaatactgc ttttatcaca accggcacgg ttgatggagc caaattgata
     1621 cccagtaata ccattgacca aacaaaaatt gccgtattcc aagacacaca tgcgaataag
     1681 catgtccaga cctcggacga cacattggcc ctgcttggtt atactggtat tggtgaggaa
     1741 gcaattggtg ctgaccgcga tagagttgtg cgaattagcg tcctcccgga acgtggcgca
     1801 cgtggtggca atcacccaat cttccacaaa aactctatca agcttggtta tgtaattagg
     1861 tccattgatg tgttcaattc tcaaattctg catacctcta ggcaactttc cctcaatcat
     1921 tacttattgt cgcctgactc ctttgctgtc tataggatta ttgactctaa tggatcctgg
     1981 tttgacatag gcattgataa tgatggattt tcttttgttg gtgtatcaag tattggtaaa
     2041 ttagagtttc ctttaactgc ctcctacatg ggaattcaat tggcaaaaat tcgacttgcc
     2101 tctaacatta ggagtgtgat gacaaaatta tgaattcaat tttgggctta attgacactg
     2161 tcacgaacac aattggcaaa gctcaacaaa tcgaattgga taaggctgca cttggtcagc
     2221 aacgcgagct ggcactccaa cgtatgaact tggatcgcca ggctctaaat aatcaagtgg
     2281 agcaatttaa caaactgctt gagcagaggg tacaaggccc aatccaatct gtgcgcctgg
     2341 cacgcgcagc tggtttcagg gtcgaccctt actcatacac aaatcaaaac ttttatgacg
     2401 atcaattaaa tgcaatcaga ctatcatata gaaatttgtt caagaattga tcacttaacc
     2461 ctttgggtgc cgcacttgcg cctaacccca ggg
//
LOCUS       RIRSTA56     2280 bp ds-DNA             BCT       25-MAY-1990
DEFINITION  R.tsutsugamushi scrub typhus antigen 56 (sta 56) gene, complete
            cds.
ACCESSION   M33004
KEYWORDS    antigen; outer membrane protein; scrub typhus antigen 56.
SOURCE      R.tsutsugamushi (strain Karp) DNA, clone pRTS56H2.3.
  ORGANISM  Rickettsia tsutsugamushi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae.
REFERENCE   1  (bases 1 to 2280)
  AUTHORS   Stover,C.K., Marana,D.P., Carter,J.M., Roe,B.A., Mardis,E. and
            Oaks,E.V.
  TITLE     The 56 kilodalton major protein antigen of Rickettsia
            tsutsugamushi: Molecular cloning and sequence analysis of sta 56
            gene, and precise identification of a strain specific epitope
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.V.Oaks, 16-MAR-1990. Walter Reed Institute, Dept. of Enteric
            Infections, Walter Reed Army Institute of Research, Washington,
            D.C.  20307
FEATURES       from  to/span     description
    pept        556     2154     scrub typhus antigen 56 precursor (sta 56)
    sigp        556      621     scrub typhus antigen 56 signal peptide
    matp        622     2151     scrub typhus antigen 56
    signal       77       82     -35 region
    signal       99      103     -10 region
    binding     545      550     ribosome binding site
BASE COUNT      764 a    319 c    489 g    708 t
ORIGIN      
        1 aagcttgttc attttttatg tgggctaatt ttagataatg caatgttagt ataattatgt
       61 ggttaattaa tgtatcttga tttaagattt tatataaata taataagatt tatgtagggc
      121 ttaattatta gcttaaaaaa ctgttgctat tttagctaaa aataaaagtt tgggcaagaa
      181 aaattattaa taattgaagg tagttgttgc gtaaaaagct gtgttatgct atctaaggtt
      241 aaatgtagca aggtgctaat agataattaa tgtattttcg aacgtgtctt taagctatat
      301 ataagagcag tattctattg aatattgttt ctaagtatat aaaaaataaa aatacatttt
      361 acaattgata aaacgctttg agcacatttt taacacagtg ttttatagat tgtttaaatt
      421 attttacaag tactattaaa tattagtata ctaaataata gttttttgat ataaaactaa
      481 agttagtgtg gctaaataat tagtttagaa tggttaccac taaaaaataa atttaattct
      541 tttaaggaga ttagaatgaa aaaaattatg ttaattgcta gtgcaatgtc tgcgttgtcg
      601 ttgccatttt cagctagtgc aatagaattg ggggaagaag gattagagtg tggtccttat
      661 gctaaagttg gagttgttgg aggaatgatt actggcgtag aatctgctcg cttggatcca
      721 gctgatgctg aaggcaaaaa acacttgtca ttaacaaatg ggctgccatt tggtggaacg
      781 ttggctgcag gtatgacaat cgctccagga tttagagcag agataggtgt tatgtacctt
      841 acaaatataa ctgctcaggt tgaagaaggt aaagttaagg cagattctgt aggtgagaca
      901 aaggcagatt ctgtaggtgg gaaagatgct cctatacgta agcggtttaa acttacacct
      961 cctcagccta ctataatgcc tataagtata gctgtacgtg actttgggat tgatattcct
     1021 aaccagacct cagcagcaag cacaagccgc agcctcaggc ttaatgatga gcaacgtgct
     1081 gcagctagga tcgcttggtt aaagaattgt gctggtattg actatagggt aaaaaaccct
     1141 aatgatccta atgggcctat ggttataaat ccgatattgt taaatattcc acagggtaac
     1201 cctaatcctg ttggaaatcc accgcagcga gcaaatccgc ctgcaggttt tgcgatacat
     1261 aaccatgagc aatggaggca tttggtagtt gggcttgctg cattatcaaa tgctaataaa
     1321 cctagcgctt ctcctgtcaa agtattaagt gataaaatta ctcagatata tagtgatata
     1381 aagcatttgg ctgatatagc tggtattgat gttcctgata ctagtttgcc taatagtgca
     1441 tctgtcgaac agatacagaa taaaatgcaa gaattaaacg atctattgga agagctcaga
     1501 gaatcttttg atgggtatct tggtggtaat gcttttgcta atcagataca gttgaatttt
     1561 gtcatgccgc agcaagcaca gcagcagggg caagggcagc aacagcaagc tcaagctaca
     1621 gcgcaagaag cagtagcagc agcagctgtt aggcttttaa atggcaatga tcagattgcg
     1681 cagttatata aagatcttgt taaattgcag cgtcatgcag gaattaagaa agcgatggaa
     1741 aaattagctg cccaacaaga agaagatgca aagaatcaag gtgaaggtga ctgcaagcag
     1801 caacaaggaa catctgaaaa atctaaaaaa ggaaaagaca aagaggcaga gtttgatctg
     1861 agtatgattg tcggccaagt taaactctat gctgacgtaa tgataactga atcagtctca
     1921 atatatgctg gtgttggtgc agggttagct tatacttctg gaaaaataga taataaggat
     1981 attaaagggc atacaggcat ggttgcatca ggagcacttg gtgtagcaat taatgctgct
     2041 gaaggtgtgt atgtggacat agaaggtagt tatatgtact cattcagtaa aatagaagag
     2101 aagtattcaa taaatcctct tatggcaagt gtaagtgtac gctataactt ctagtttttt
     2161 ctttaaaatt ataaaaaaag cagctaaaag ttctttacag ggtttttagc tgctttttca
     2221 gagttttttt ataataataa aaataacttt attctttgct atttaattag cttgaagctt
//
LOCUS       HUMTROPI      816 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds.
ACCESSION   J04760
KEYWORDS    slow-twitch skeletal troponin I; troponin I.
SOURCE      Human adult slow-twitch skeletal muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 816)
  AUTHORS   Wade,R., Eddy,R., Shows,T.B. and Kedes,L.
  TITLE     cDNA sequence, tissue-specific expression and chromosomal mapping
            of the human slow-twitch skeletal muscle isoform of troponin I
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Wade, 15-MAR-1990.
FEATURES       from  to/span     description
    pept         77      640     slow-twitch skeletal troponin I (TNN1)
BASE COUNT      180 a    247 c    244 g    145 t
ORIGIN      Chromosome 1q12-qter..
        1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg
       61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa
      121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga
      181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg
      241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt
      301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa
      361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg
      421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc
      481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc
      541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa
      601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc
      661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg
      721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct
      781 ctggaactgg gattaaacag atacccaaga ggcagg
//
LOCUS       ASOTAAG1     2935 bp ds-DNA             PLN       25-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G1) gene, complete cds.
ACCESSION   M33218
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 2935)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        607      772     Taka-amylase A (Taa-G1) precursor, exon 1
                828      868     Taka-amylase A precursor, exon 2
                955     1070     Taka-amylase A precursor, exon 3
               1140     1248     Taka-amylase A precursor, exon 4
               1317     1545     Taka-amylase A precursor, exon 5
               1603     1765     Taka-amylase A precursor, exon 6
               1830     1976     Taka-amylase A precursor, exon 7
               2041     2281     Taka-amylase A precursor, exon 8
               2360     2647     Taka-amylase A precursor, exon 9
    sigp        607      669     Taka-amylase A signal peptide
    matp        670      772     Taka-amylase A
                828      868     Taka-amylase A
                955     1070     Taka-amylase A
               1140     1248     Taka-amylase A
               1317     1545     Taka-amylase A
               1603     1765     Taka-amylase A
               1830     1976     Taka-amylase A
               2041     2281     Taka-amylase A
               2360     2644     Taka-amylase A
    pre-msg     543  >  2789     Taa-G1 mRNA and introns
    IVS         773      827     Taa-G1 intron A (no splice consensus)
    IVS         869      954     Taa-G1 intron B
    IVS        1071     1139     Taa-G1 intron C
    IVS        1249     1316     Taa-G1 intron D
    IVS        1546     1602     Taa-G1 intron E
    IVS        1766     1829     Taa-G1 intron F
    IVS        1977     2040     Taa-G1 intron G
    IVS        2282     2359     Taa-G1 intron H
    signal     2784     2789     poly-A signal
BASE COUNT      818 a    752 c    657 g    708 t
ORIGIN      
        1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact
       61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca
      121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca
      181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag
      241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat
      301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat
      361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt
      421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc
      481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg
      541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag
      601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct
      661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat
      721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt
      781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg
      841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg
      901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac
      961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc
     1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat
     1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga
     1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt
     1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct
     1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct
     1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact
     1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct
     1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga
     1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc
     1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg
     1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt
     1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca
     1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt
     1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca
     1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca
     1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa
     1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag
     2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg
     2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa
     2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc
     2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa
     2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga
     2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg
     2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga
     2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt
     2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg
     2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag
     2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag
     2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga
     2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca
     2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc
     2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg
//
LOCUS       ASOTAAG2A1    197 bp ds-DNA             PLN       25-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G2) gene, 5' end.
ACCESSION   M33220
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 197)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        195  >   197     Taka-amylase A (Taa-G2) precursor
    pre-msg     131  >   197     Taa-G2 mRNA and introns
    signal        2       11     CAAT box
    signal       95      100     TATA box
BASE COUNT       59 a     52 c     36 g     50 t
ORIGIN      
        1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa
       61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg
      121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc
      181 cacagaaggc atttatg
//
LOCUS       ASOTAAG2A2    198 bp ds-DNA             PLN       25-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G2) gene, 3' end.
ACCESSION   M33222
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 198)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     Taka-amylase A (Taa-G2) (AA at 1)
    pre-msg <     1      151     Taa-G2 mRNA and introns (alt.)
    pre-msg <     1      156     Taa-G2 mRNA and introns (alt.)
BASE COUNT       45 a     48 c     53 g     52 t
ORIGIN      About 2.1 kb after segment 1.
        1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
       61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
      121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc
      181 gtcctccaga ggcgagga
//
LOCUS       ASOTTAM1      191 bp ss-mRNA            PLN       25-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa) mRNA, 3' end.
ACCESSION   M33219
KEYWORDS    Taka-amylase A.
SEGMENT     1 of 2
SOURCE      A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 191)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        189  >   191     Taka-amylase A (Taa)
    mRNA        125  >   191     Taa mRNA
BASE COUNT       56 a     52 c     32 g     51 t
ORIGIN      
        1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca
       61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt
      121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga
      181 aggcatttat g
//
LOCUS       ASOTTAM2      156 bp ss-mRNA            PLN       25-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa) mRNA, 5' end.
ACCESSION   M33221
KEYWORDS    Taka-amylase A.
SEGMENT     2 of 2
SOURCE      A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     Taka-amylase A (Taa) (AA at 1)
    mRNA    <     1      151     Taa mRNA (alt.)
    mRNA    <     1      156     Taa mRNA (alt.)
BASE COUNT       40 a     35 c     42 g     39 t
ORIGIN      About 1.5 kb after segment 1.
        1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
       61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
      121 ggcaccgggc cccgactgag agcccgtctc gaatcc
//
LOCUS       CIPNADGAPD   1354 bp ss-mRNA            PLN       25-MAY-1990
DEFINITION  M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH)
            mRNA, complete cds.
ACCESSION   J05223
KEYWORDS    D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase;
            glyceraldehyde-3-phosphate dehydrogenase.
SOURCE      M.crystallinum, cDNA to mRNA.
  ORGANISM  Mesembryanthemum crystallinum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE   1  (bases 1 to 1354)
  AUTHORS   Ostrem,J.A., Vernon,D.M. and Bohnert,H.J.
  TITLE     Increased expression of a gene coding for NAD:glyceraldehyde-3-
            phosphate dehydrogenase during the transition from C-3
            photosynthesis to crassulacean acid metabolism in Mesembryanthemum
            crystallinum
  JOURNAL   J. Biol. Chem. 265, 3497-3502 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         55     1068     glyceraldehyde-3-phosphate dehydrogenase (EC
                                 1.2.1.12)
BASE COUNT      315 a    308 c    345 g    386 t
ORIGIN      
        1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct
       61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc
      121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac
      181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag
      241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg
      301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga
      361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt
      421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac
      481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg
      541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc
      601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg
      661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt
      721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg
      781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct
      841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat
      901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga
      961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac
     1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga
     1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc
     1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt
     1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta
     1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc
     1321 agcatttatg aaacattgct gtgagctttt gagt
//
LOCUS       ECOPRIA      2658 bp ds-DNA             BCT       25-MAY-1990
DEFINITION  E.coli primosomal protein n' (priA) gene, complete cds, and cytR
            gene, 5' end.
ACCESSION   M33293
KEYWORDS    cytR gene; priA gene; primosomal protein.
SOURCE      E.coli (strain W3110) DNA, clone pEL042.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2658)
  AUTHORS   Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A.
  TITLE     The priA gene encoding the primosomal, replicative n' protein of
            E.coli
  JOURNAL   Unpublished (1990) Stanford U., Beckman Cntr., Stanford, CA 94305
  STANDARD  full automatic
COMMENT     Authorin sequence for [1] kindly submitted by G.C.Allen Jr.,
            26-MAR-1990.
FEATURES       from  to/span     description
    pept         64     2262     primosomal protein n'
    pept       2418  >  2658     cytR gene product
    signal       27       32     -10 region
    signal        5       10     -35 region
    binding      53       59     ribosome binding site
BASE COUNT      578 a    738 c    756 g    586 t
ORIGIN      
        1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat
       61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg
      121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag
      181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag
      241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga
      301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc
      361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt
      421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa
      481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa
      541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa
      601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg
      661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc
      721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa
      781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg
      841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc
      901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt
      961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt
     1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac
     1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc
     1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt
     1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg
     1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac
     1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt
     1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac
     1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc
     1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg
     1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg
     1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca
     1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg
     1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc
     1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac
     1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa
     1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg
     1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg
     2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg
     2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg
     2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt
     2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg
     2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg
     2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg
     2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg
     2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata
     2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac
     2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc
     2641 cggatatctg cgatcccc
//
LOCUS       EWCTELRNA     657 bp ds-DNA             INV       25-MAY-1990
DEFINITION  E.crassus telomerase RNA component gene, complete cds.
ACCESSION   M33461
KEYWORDS    telomerase RNA.
SOURCE      E.crassus DNA.
  ORGANISM  Euplotes crassus
            Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; 
            Spirotricha; Heterotrichida; Clevelandellina.
REFERENCE   1  (bases 1 to 657)
  AUTHORS   Shippen-Lentz,D. and Blackburn,E.H.
  TITLE     Functional evidence for an RNA template in telomerase
  JOURNAL   Science 247, 546-552 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    RNA         152      342     telomerase RNA component
    site        186      197     functional telomeric template
BASE COUNT      202 a    122 c    111 g    222 t
ORIGIN      
        1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga
       61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt
      121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg
      181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg
      241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt
      301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg
      361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt
      421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat
      481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga
      541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac
      601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt
//
LOCUS       HS4DWXJ       160 bp ds-DNA             VRL       25-MAY-1990
DEFINITION  Epstein-Barr virus defective WZhet junction.
ACCESSION   M33474
KEYWORDS    .
SOURCE      Epstein-Barr virus (strain HR-1, clinical sample 9) DNA.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 160)
  AUTHORS   Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
  TITLE     Defective viral DNA in Epstein-Barr virus-associated oral hairy
            leukoplakia
  JOURNAL   J. Virol. 64, 397-400 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb       87       90     WZhet junction
BASE COUNT       41 a     45 c     41 g     33 t
ORIGIN      
        1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc
       61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact
      121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac
//
LOCUS       HS4WXJ        181 bp ds-DNA             VRL       25-MAY-1990
DEFINITION  Epstein-Barr virus WZhet junction, HR-1 clone 5.
ACCESSION   M33473
KEYWORDS    .
SOURCE      Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 181)
  AUTHORS   Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
  TITLE     Defective viral DNA in Epstein-Barr virus-associated oral hairy
            leukoplakia
  JOURNAL   J. Virol. 64, 397-400 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      108      111     WZhet junction
BASE COUNT       50 a     42 c     46 g     43 t
ORIGIN      
        1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt
       61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca
      121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga
      181 c
//
LOCUS       HUMREGA      4251 bp ds-DNA             PRI       25-MAY-1990
DEFINITION  Human regenerating protein (reg) gene, complete cds.
ACCESSION   J05412
KEYWORDS    pancreatic stone protein; pancreatic thread protein;
            regenerating protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4251)
  AUTHORS   Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
  TITLE     Complete nucleotide sequence of the human reg gene and its
            expression in normal and tumoral tissues
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            H.Okamoto, 23-FEB-1990.
FEATURES       from  to/span     description
    pept       1571     1634     regenerating protein (reg), exon 2 (first
                                 expressed exon)
               2270     2388     regenerating protein, exon 3
               2696     2833     regenerating protein, exon 4
               3549     3660     regenerating protein, exon 5
               3856     3923     regenerating protein, exon 6
    pre-msg    1196     4116     reg mRNA and introns
    IVS        1224     1524     reg intron A
    IVS        1635     2269     reg intron B
    IVS        2389     2695     reg intron C
    IVS        2834     3548     reg intron D
    IVS        3661     3855     reg intron E
    signal     1169     1174     TATA box
BASE COUNT     1161 a    927 c    869 g   1294 t
ORIGIN      
        1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc
       61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat
      121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt
      181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc
      241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt
      301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc
      361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc
      421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat
      481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac
      541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg
      601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg
      661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata
      721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag
      781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc
      841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga
      901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt
      961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag
     1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc
     1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa
     1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa
     1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat
     1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta
     1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc
     1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag
     1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac
     1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt
     1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct
     1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact
     1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt
     1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat
     1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg
     1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt
     1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa
     1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac
     2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga
     2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat
     2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa
     2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc
     2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc
     2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag
     2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca
     2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc
     2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac
     2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca
     2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta
     2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt
     2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca
     2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta
     2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag
     2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata
     3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt
     3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca
     3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct
     3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa
     3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga
     3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt
     3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa
     3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc
     3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc
     3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg
     3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag
     3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg
     3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt
     3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt
     3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct
     3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc
     3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca
     4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt
     4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc
     4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct
     4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c
//
LOCUS       HUMREGRELA   1524 bp ds-DNA             PRI       25-MAY-1990
DEFINITION  Human reg-related sequence, complete cds.
ACCESSION   J05413
KEYWORDS    pancreatic stone protein; pancreatic thread protein;
            regenerating protein.
SOURCE      Human esophageal mucosa DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
  TITLE     Complete nucleotide sequence of the human reg gene and its
            expression in normal and tumoral tissues
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            H.Okamoto, 23-FEB-1990.
BASE COUNT      382 a    368 c    345 g    429 t
ORIGIN      
        1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt
       61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt
      121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg
      181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat
      241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg
      301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga
      361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata
      421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat
      481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag
      541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa
      601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc
      661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac
      721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta
      781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca
      841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt
      901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat
      961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct
     1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc
     1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga
     1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc
     1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct
     1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct
     1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc
     1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg
     1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc
     1501 tataaatcat ataatgtaaa ggat
//
LOCUS       MDPCGA       4801 bp ss-RNA             VRL       25-MAY-1990
DEFINITION  Aleutian mink disease parvovirus complete genome.
ACCESSION   M20036
KEYWORDS    complete genome.
SOURCE      Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4.
  ORGANISM  Aleutian mink disease parvovirus
            Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE   1  (bases 1 to 4592)
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  TITLE     Nucleotide sequence and genomic organization of Aleutian mink
            disease parvovirus (ADV): Sequence comparisons between a
            nonpathogenic and pathogenic strain of ADV
  JOURNAL   J. Virol. 62, 2903-2915 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 4593 to 4801)
  AUTHORS   Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W.,
            Perryman,S. and Wolfinbarger,J.B.
  TITLE     Nucleotide sequence of the 5' terminal palindrome of Aleutian mink
            disease parvovirus (ADV) and construction of an infectious
            molecular clone
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by M.E.Bloom, 18-MAR-1990.
            
            Sequence reported below is (+) strand.
FEATURES       from  to/span     description
    ORF         116     1978     Left ORF
    ORF        1983     2207     Middle ORF2
    ORF        1993     2211     Middle ORF1
    ORF        2241     4399     Right ORF
BASE COUNT     1740 a    912 c    943 g   1206 t
ORIGIN      
        1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt
       61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt
      121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag
      181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact
      241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt
      301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact
      361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc
      421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac
      481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt
      541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta
      601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag
      661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa
      721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc
      781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga
      841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta
      901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt
      961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga
     1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg
     1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa
     1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt
     1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga
     1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt
     1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga
     1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa
     1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg
     1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta
     1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat
     1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac
     1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg
     1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa
     1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa
     1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca
     1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt
     1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca
     2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa
     2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg
     2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata
     2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta
     2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc
     2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag
     2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg
     2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg
     2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt
     2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact
     2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag
     2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata
     2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta
     2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa
     2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt
     2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg
     3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc
     3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct
     3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta
     3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt
     3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg
     3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa
     3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg
     3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt
     3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac
     3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga
     3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat
     3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac
     3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt
     3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc
     3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg
     3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg
     3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac
     4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg
     4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa
     4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag
     4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag
     4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg
     4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact
     4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc
     4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta
     4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac
     4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt
     4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt
     4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct
     4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg
     4801 a
//
LOCUS       MDPUPS       3454 bp ss-RNA             VRL       25-MAY-1990
DEFINITION  Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial
            sequence.
ACCESSION   M32981
KEYWORDS    .
SOURCE      Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA.
  ORGANISM  Aleutian mink disease parvovirus
            Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE   1  (sites)
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  TITLE     Nucleotide sequence and genomic organization of Aleutian mink
            disease parvovirus (ADV): Sequence comparisons between a
            nonpathogenic and pathogenic strain of ADV
  JOURNAL   J. Virol. 62, 2903-2915 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3454, for [1])
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  JOURNAL   Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by M.E.Bloom 18-MAR-1990.
            
            Sequence reported below is (+) strand.
BASE COUNT     1268 a    673 c    698 g    815 t
ORIGIN      
        1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc
       61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac
      121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac
      181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta
      241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa
      301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga
      361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat
      421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt
      481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa
      541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta
      601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat
      661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag
      721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga
      781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat
      841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc
      901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact
      961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt
     1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac
     1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac
     1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa
     1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta
     1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag
     1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac
     1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg
     1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa
     1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac
     1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca
     1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga
     1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc
     1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg
     1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta
     1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt
     1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga
     1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag
     2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag
     2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac
     2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt
     2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga
     2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc
     2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg
     2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat
     2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta
     2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg
     2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa
     2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg
     2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta
     2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc
     2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa
     2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt
     2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca
     3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg
     3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct
     3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga
     3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt
     3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc
     3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt
     3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa
     3421 aacctatggt tactttactt ggagaggcaa gctt
//
LOCUS       MUSAA2DEL     300 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion
            fragment.
ACCESSION   M33468
KEYWORDS    deletion mutant.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
  TITLE     Molecular analysis of two mouse dilute locus deletion mutations:
            Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
            lethal Aa2 alleles
  JOURNAL   Mol. Cell. Biol. 10, 501-509 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      130      133     deletion breakpoint
BASE COUNT       82 a     45 c     76 g     97 t
ORIGIN      
        1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact
       61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt
      121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt
      181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata
      241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa
//
LOCUS       MUSSL20JA     300 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion
            fragment.
ACCESSION   M33467
KEYWORDS    deletion mutant.
SOURCE      Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
  TITLE     Molecular analysis of two mouse dilute locus deletion mutations:
            Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
            lethal Aa2 alleles
  JOURNAL   Mol. Cell. Biol. 10, 501-509 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept.ps /    42      122     dilute gene, exon 3
                179  >   300     dilute gene, exon 5
    recomb      129      130     deletion breakpoint
BASE COUNT       86 a     51 c     61 g    102 t
ORIGIN      
        1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac
       61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac
      121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta
      181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta
      241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc
//
LOCUS       PRVVP4       2359 bp ds-RNA             VRL       25-MAY-1990
DEFINITION  Porcine rotavirus capsid protein VP4 gene, complete cds.
ACCESSION   M33516
KEYWORDS    capsid protein VP4.
SOURCE      Porcine rotavirus (Gottfried strain; serotype 4) DNA.
  ORGANISM  Porcine rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 2359)
  AUTHORS   Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K.
  TITLE     Similarity of the outer capsid protein VP4 of the Gottfried strain
            of porcine rotavirus to that of asymptomatic human rotavirus
            strains
  JOURNAL   J. Virol. 64, 414-418 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         10     2337     capsid protein VP4
BASE COUNT      812 a    406 c    442 g    699 t
ORIGIN      
        1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa
       61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg
      121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat
      181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca
      241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac
      301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga
      361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg
      421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta
      481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat
      541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa
      601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag
      661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta
      721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag
      781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt
      841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca
      901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt
      961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc
     1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac
     1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg
     1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt
     1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc
     1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca
     1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga
     1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac
     1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg
     1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata
     1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt
     1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct
     1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc
     1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata
     1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga
     1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct
     1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat
     1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat
     2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc
     2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata
     2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc
     2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa
     2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag
     2341 ctctatagag gatgtgacc
//
LOCUS       RATSTAA      1000 bp ss-mRNA            ROD       25-MAY-1990
DEFINITION  Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds.
ACCESSION   M33329
KEYWORDS    hydroxysteroid sulfotransferase a.
SOURCE      Rat (strain Sprague-Dawley) female liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1000)
  AUTHORS   Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K.,
            Yamazoe,Y. and Kato,R.
  TITLE     cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a
            strong homology in amino acid sequence with the senescence marker
            protein SMP-2 in rat livers
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         38      892     hydroxysteroid sulfotransferase a (STa)
    mRNA    <     1     1000     STa mRNA
    signal      977      982     poly-A signal
BASE COUNT      299 a    187 c    211 g    303 t
ORIGIN      
        1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga
       61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa
      121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg
      181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt
      241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa
      301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct
      361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc
      421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta
      481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc
      541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa
      601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga
      661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc
      721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg
      781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt
      841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa
      901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa
      961 gtgaatgtgg tcattgaata aattctattc tggcattgtg
//
LOCUS       SMFPOLENV    3534 bp ss-RNA             VRL       25-MAY-1990
DEFINITION  Simian foamy virus type 1 polymerase (pol) gene, 3' end; and
            envelope (env) gene, complete cds.
ACCESSION   M33561
KEYWORDS    envelope protein; polymerase.
SOURCE      Simian foamy virus type 1, cDNA to viral RNA.
  ORGANISM  Simian foamy virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Spumavirinae.
REFERENCE   1  (bases 1 to 3534)
  AUTHORS   Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A.
  TITLE     Relationship of the env genes and the endonuclease domain of the
            pol genes of simian foamy virus type 1 and human foamy virus
  JOURNAL   J. Virol. 64, 406-410 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      582     polymerase (AA at 1)
    pept        524     3481     envelope protein
BASE COUNT     1162 a    645 c    670 g   1046 t     11 others
ORIGIN      
        1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac
       61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta
      121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat
      181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta
      241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc
      301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg
      361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc
      421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt
      481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg
      541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac
      601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag
      661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg
      721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat
      781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt
      841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt
      901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt
      961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt
     1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt
     1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt
     1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca
     1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac
     1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca
     1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa
     1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa
     1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga
     1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg
     1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc
     1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt
     1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca
     1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa
     1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg
     1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt
     1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa
     1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc
     2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac
     2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa
     2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag
     2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg
     2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt
     2341 atatttactn nnnnnnnnnn tggtaaccct gatggaagct gcccttcatg atgtttcgat
     2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat
     2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt
     2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta
     2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga
     2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt
     2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa
     2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt
     2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg
     2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg
     2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt
     3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc
     3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt
     3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa
     3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg
     3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc
     3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg
     3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact
     3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg
     3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat
//
LOCUS       ENHCYPA       495 bp ds-DNA             INV       25-MAY-1990
DEFINITION  Entamoeba histolytica cysteine protease gene, partial cds.
ACCESSION   M27307
KEYWORDS    cysteine protease.
SOURCE      Entamoeba histolytica (strain HM-1) trophozoites DNA, clone E452.
  ORGANISM  Entamoeba histolytica
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina; 
            Entamoebidae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H.
  TITLE     Amplification and sequencing of genomic DNA fragments encoding
            cysteine proteases from protozoan parasites
  JOURNAL   Mol. Biochem. Parasitol. 39, 1-8 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 495)
  AUTHORS   Eakin,A.E.
  JOURNAL   Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by A.E.Eakin, 24-AUG-1989.
FEATURES       from  to/span     description
    pept    <     1  >   495     cysteine protease (AA at 1)
BASE COUNT      161 a     75 c    107 g    152 t
ORIGIN      
        1 caagggcaat gcgggtcgtg ctggactttc tgtacaactg cagttcttga aggaagagtt
       61 aacaaagatc ttggaaaact ttactcattc tctgaacaac aattagttga ttgtgatgct
      121 tctgataatg gatgtgaacg aggacatcca tctaactcac ttaaattcat ccaagaaaat
      181 aatggattag gattagaaag cgattatcca tataaagctg ttgctggtac ttgcaagaaa
      241 gttaaaaacg ttgctactgt tactggttct agaagagtta ctgatggaag tgaaactgga
      301 cttcaaacta ttattgctga aaacggacct gttgctgttg gtatggatgc tagcagacca
      361 tcattccaat tatataagaa aggaactatc tattctgata ctaaatgtag atcaagaatg
      421 atgaatcact gtgttactgc tgttggttat ggttcaaata gtaatggtaa atattggatt
      481 gtcaagaatt cctgg
//
LOCUS       TRBCYPA       495 bp ds-DNA             INV       25-MAY-1990
DEFINITION  Trypanosoma brucei cysteine protease gene, partial cds.
ACCESSION   M27306
KEYWORDS    cysteine protease.
SOURCE      Trypanosoma brucei gambiense trypomastigote DNA, clone B502.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H.
  TITLE     Amplification and sequencing of genomic DNA fragments encoding
            cysteine proteases from protozoan parasites
  JOURNAL   Mol. Biochem. Parasitol. 39, 1-8 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 495)
  AUTHORS   Eakin,A.E.
  JOURNAL   Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by A.E.Eakin, 24-AUG-1989.
FEATURES       from  to/span     description
    pept    <     1  >   495     cysteine protease (AA at 1)
BASE COUNT      116 a    115 c    140 g    124 t
ORIGIN      
        1 caagggcagt gtgggtcgtg ctgggccttt tcaactatcg gcaacatcga agggcagtgg
       61 caggtggcag gaaatcctct cgtatccctc tcggagcaga ttctagtgta ctgtgatcca
      121 ttgattggct gtggtggtgg gctgatggac aatgccttca actggatagt aaattcaaac
      181 ggtggaaacg tattcacgga ggcgagctat ccctatgttt ctgggaatgg tgagcagcca
      241 cagtgccaga tgaatggtca cgagatcggt gctgcgataa cagaccatgt tgacttaccg
      301 caggatgagg acgccatcgc cgcgtatttg gcagaaaacc gtccccttgc tattgccgtt
      361 gaagcaccgc agttttatgg acataacggt gggattctga cttcatgcac ctccgagcaa
      421 ctggatcatg gtgtgctcct cgttggttac aatgataata gcaatccacc ctattggatc
      481 gtcaagaact cctgg
//
LOCUS       TRCCYPA       495 bp ds-DNA             INV       25-MAY-1990
DEFINITION  Trypanosoma cruzi cysteine protease gene, partial cds.
ACCESSION   M27305
KEYWORDS    cysteine protease.
SOURCE      Trypanosoma cruzi (strain RA; Argentine) epimastigote DNA, clone
            R501.
  ORGANISM  Trypanosoma cruzi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H.
  TITLE     Amplification and sequencing of genomic DNA fragments encoding
            cysteine proteases from protozoan parasites
  JOURNAL   Mol. Biochem. Parasitol. 39, 1-8 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 495)
  AUTHORS   Eakin,A.E.
  JOURNAL   Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by A.E.Eakin, 24-AUG-1989.
FEATURES       from  to/span     description
    pept    <     1  >   495     cysteine protease (AA at 1)
BASE COUNT       94 a    143 c    158 g    100 t
ORIGIN      
        1 cagggccaat gcggctcgtg ctgggccttc tccgccattg gcaacgttag tggccagtgg
       61 tttcttgccg gtcacccgct gactaacctg tcggagcaga tgctcgtgtc gtgcgacaaa
      121 acggactctg gctgcagtgg tggcctgatg aacaacgcct ttgagtggat tgtgcaggag
      181 aataacggcg gcgtgtacac ggaggacagc tacccttatg cgtcgggcga ggggatatcg
      241 ccgccgtgca cgacgtcagg ccacacggtg ggtgccacga ttaccggtca cgttgaatta
      301 cctcaggacg aggcccaaat agccgcatgg cttgccgtca atggcccggt tgccgttgcg
      361 cacgccagca gctggatgac ctacacgggc ggcgttatga cgagctgcgt ctccgagcag
      421 ctggatcacg gccttcttct cgtcggctac aatgacagcg ccgcagtgcc gtactggatc
      481 atcaagaact cgtgg
//
LOCUS       STRDEXB      1800 bp ds-DNA             BCT       25-MAY-1990
DEFINITION  S.mutans dextran glucosidase (dexB) gene, complete cds.
ACCESSION   M30944
KEYWORDS    dextran glucosidase.
SOURCE      S.mutans (strain INGBRITT) DNA.
  ORGANISM  Streptococcus mutans
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Streptococcaceae.
REFERENCE   1  (bases 1 to 1800)
  AUTHORS   Russell,R.R.B. and Ferretti,J.J.
  TITLE     Nucleotide sequence of the dextran glucosidase (dexB) gene from
            Streptococcus mutans
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.J.Ferretti, 20-DEC-1989.  U of Oklahoma, Box 26901, Oklahoma
            City, OK 73190
FEATURES       from  to/span     description
    pept         91     1701     dextran glucosidase (dexB) (EC 3.2.1.11)
    binding      78       83     ribosome binding site
BASE COUNT      585 a    290 c    376 g    549 t
ORIGIN      
        1 aaaacggctt aaattgtttt agattttctc agtctcttta ttagaactaa agaaactaac
       61 atgtcataat aaattaaagg tggtgataag atgcaaaaac attggtggca caaggcaact
      121 gtttatcaaa tttatccaaa atcttttatg gatacaaatg gtgatggaat tggtgatctc
      181 aaaggtatta cgagtaaatt ggattatttg caaaagttag gggttatggc tatttggcta
      241 tctccagttt atgatagccc catggatgac aatggctatg acattgcgaa ctatgaagca
      301 attgcggata tttttggcaa tatggctgat atggataatt tgctgacgca ggcaaaaatg
      361 cgcgacataa aaatcattat ggatctagtg gttaatcata cctcagatga acatacttgg
      421 tttattgaag cacgtgagca tccagacagt tctgaacgcg attattatat ttggtgtgac
      481 cagccaaatg atttggaatc tattttcggt ggttctgctt ggcagtatga tgataagtcc
      541 gatcaatatt atttgcattt ttttagtaag aagcagccag atctaaactg ggaaaacgca
      601 aacttacgtc agaagattta tgatatgatg aatttctgga ttgataaagg tattggcggc
      661 tttcggatgg acgtcattga tatgattggg aaaattcctg ctcagcatat tgtcagtaac
      721 ggaccaaaat tgcatgctta tcttaaggag atgaatgccg ctagttttgg tcaacatgat
      781 ctgctgactg tgggggaaac ttggggagca acgcctgaga ttgcgaagca atattcaaat
      841 ccagtcaatc acgaactctc tatgattttt caatttgaac atattggtct tcagcataaa
      901 ccagaagctc ctaaatggga ttatgtgaag gaacttaatg ttcctgcttt aaaaacaatc
      961 tttaataaat ggcagactga gttggaatta ggacaggggt ggaattcgtt attctggaat
     1021 aaccatgacc tgcctcgtgt tttatcaatc tggggaaata cgggcaaata tcgtgagaag
     1081 tctgctaaag cactggctat tcttcttcac cttatgcgtg ggacacctta tatttatcaa
     1141 ggtgaagaga ttgggatgac caattatcct tttaaagatt taaatgaact tgatgatatt
     1201 gaatcactta attatgctaa ggaagctttt acaaatggta agtctatgga aactatcatg
     1261 gacagtattc gtatgattgg ccgtgataat gccagaacac ctatgcaatg ggatgcttct
     1321 caaaatgccg gattttcaac agcggataaa acatggctgc cagttaatcc aaactataaa
     1381 gacatcaatg ttcaagcagc tctgaaaaat tccaattcta tcttttacac ctatcaacaa
     1441 ctcattcagc ttcgaaaaga aaatgattgg ctagtagatg ccgattttga attgctccct
     1501 acagcggaca aagtatttgc ctatttacga aaggtaagag aagaaaggta tcttatagtg
     1561 gtcaatgttt cagatcagga agaagttcta gagattgatg ttgacaaaca agaaactctc
     1621 attagcaata caaatgaaag cgctgctctt gccaatcaca aactccagcc ttgggatgct
     1681 ttttgtatta agataaacta ggatagaata agaaaacaac agacgatacc acttaaaaag
     1741 tgacggcctg ttgttttttt aatgtttgta ttaagtaata aagcgattct gattgactat
//
LOCUS       STRGBP       1968 bp ds-DNA             BCT       25-MAY-1990
DEFINITION  S.mutans glucan-binding protein (gbp) gene, complete cds.
ACCESSION   M30945
KEYWORDS    glucan binding protein.
SOURCE      S.mutans (strain INGBRITT) DNA.
  ORGANISM  Streptococcus mutans
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Streptococcaceae.
REFERENCE   1  (bases 1 to 1968)
  AUTHORS   Banas,J.A., Russell,R.R.B. and Ferretti,J.J.
  TITLE     Sequence analysis of the gene for the glucan-binding protein of
            Streptococcus mutans INGBRITT
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma
            City, OK 73190.
FEATURES       from  to/span     description
    pept        124     1815     glucan-binding protein (gbp)
    signal       12       17     -35 region
    signal       37       42     -10 region
    binding     112      116     ribosome binding site
BASE COUNT      648 a    341 c    445 g    534 t
ORIGIN      
        1 ccggctataa gttgaaatat tgtaggtatt aaaaactatc tttagtttag tatttacatt
       61 aattttaaaa atgttatagt ggaagtgtca tgttgattac tattttttta aggaggtaaa
      121 atgatgaaag aaaagacacg ttttaaactg cacaaggtta aaaagcagtg ggtggcgatt
      181 gccgtgacta gtctagctct agctgcgata ttgtcaggag ctcacttgac tcaggctgag
      241 gaacaatccg gcggtactga cagtaagcca agactgacag cgactgtaca ggaaagctca
      301 gaacaaccaa ttacaaaagc tccagcagct gattcatctg tagaaaataa cagtgctaac
      361 gctgttaaaa gttctgaaac agcagaggca gctgaagtat ccgatggagg cagagccagc
      421 caaactgaag cagtaacaaa ccaaacaaac tctgaagagc accatccagc agaaaaagcg
      481 acagccgttt ctggagaagc tcagtcagtg caaaatgctc catcagaaaa tgctgcccag
      541 caggaaacgg ctaaaaccga gccagcgact gctgcagaaa ataatgacgc tgctccaacc
      601 aatagcttct ttaaaaaaga tggtaaatgg tactacaaaa aggccgatgg acagctggca
      661 accggttggc agataattga tggaaagcag ctctatttca accaagatgg tagtcaggtc
      721 aaaggagaaa ttcatgtgga gacaggggat caaatcattt atcatcctgt tttcataagt
      781 gattcacctt cagttttgga agtcaataag atttattact ttgatcctga tagtggtgaa
      841 ctctggaagg atcgttttgt ctattctagt tatgcagatc ccctccatta tgaaaatatt
      901 aaacatgaag gctggttcta tcttggagaa gatggaaagg ctgctatcgg ctggagaact
      961 attggcggta aaaaatacta ttttgacact aatggtgttc aagtcaaagg aaagctaatt
     1021 agtacagatg gcaattataa tctaattagc cagaagtatg gcaagaaatc tttcctagat
     1081 cctgacaccg gtgaagcttg gactaatcgt tttgtcaatg caaagtatta tttctacaac
     1141 tttgcaggat acgtctctac gacagactgg ttctatatgg gagccgatgg tatcggcgtg
     1201 accgattggc aaaagatcga tggtatggat tactatttcg aaccttccag tggtattcag
     1261 gttaaaggcg acattgctga gcgtgatggc aaggtctatt atttagatga agacagtgga
     1321 caagttgtta agaatcgttt tggcacaaca cctgccgagc gtatcagtac agttgaggct
     1381 cgtttcccta aaacttatta ttttggagcg gacggtagcc gcaaagatct aactggttgg
     1441 cagattattg atggtaaaac ttattacttt aaggatgatc acagcataaa agcaaagtca
     1501 gagtatagtc aaattggtgg ttctgtgcct gatgacggtt ttgcagagat tgatggtgat
     1561 ggttactttt ttgatactca aggtcaattc gtaacgaata gatttgtcag aaaatacgac
     1621 tacagtaata tttggtatta ttatggaagc gatggcaaac gtgtatcagg ctggcaaact
     1681 atcgacggta agcgctacta ctttagccaa gatgaaaaga caaagggccg tcaaattaaa
     1741 ggacaaacca tcactatcga tggtaaagaa tatacttttg acaaagacag cggtgaagtt
     1801 atcaatagta actagttggt aaatcccatg gcacacaaaa acgagcagat ttcatactct
     1861 gttcgttttt tcgccttaaa acttatatat ttataaatcg tcaataaagt gtttacttga
     1921 aaacggtaaa tatgccaaga gtttgactgt tatcaattaa tgggaaag
//
LOCUS       STRGTFS      4376 bp ds-DNA             BCT       25-MAY-1990
DEFINITION  S.downei glucosyltransferase S (gtfs) gene, complete cds.
ACCESSION   M30943
KEYWORDS    glucosyltransferase S.
SOURCE      S.downei (strain MFe28) DNA.
  ORGANISM  Streptococcus downei
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Streptococcaceae.
REFERENCE   1  (bases 1 to 4376)
  AUTHORS   Gilmore,K.S., Russell,R.R.B. and Ferretti,J.J.
  TITLE     Sequence analysis of a Streptococcus downei gtfs gene which
            specifies a glucosyltransferase synthesizing soluble glucans
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma
            City, OK 73190
FEATURES       from  to/span     description
    pept        247     4344     glucosyltransferase S (gtfs)
    signal      127      132     -35 region
    signal      150      154     -10 region
    binding     233      237     ribosome binding site
BASE COUNT     1282 a    944 c   1000 g   1150 t
ORIGIN      
        1 tggcctatgg agtagcggtc attctcacca tcttgaacat caagttaatt attgatttgt
       61 tccaataaat atgaatctct ttaaagaata aatttaaaga gatttttatg ttgttttaca
      121 atctaatgac aaatttagga tttctgagat taaacttaaa taacatgtta aaataaattg
      181 ataattaaaa ttatttggtt attaaagttt taattgtcga atttttttat aaggagaatg
      241 atattcatgg aaaaaaattt gcgctacaaa ttgcataagg ttaaaaagca atgggtagcc
      301 attggtgtga cgaccgtaac gcttagcttt ctagctggag gccaagtcgt tgcagccgac
      361 acaaacaata atgacgggac aagtgttcag gtcaacaaaa tggtgccaag tgatcccaag
      421 tttgacgctc aagcccaaaa cggtcaactt gctcaagcta tgtttaaggc tgctaatcaa
      481 gcagatcaaa ctgctacaag tcaagttagc ccagcaacag atggcagggt tgataaccag
      541 gtgacaccag cagctaatca acctgcggct aatgtggcca atcaagatgt ggccaatcca
      601 gcgactgatg ccggagcact taatcgccaa tcggccgctg atacgtcaac agatggtaag
      661 gctgtacctc agactagtga ccaaccaggt catctggaaa cagttgatgg caaaacttac
      721 tatgtggatg ccaatggtca acggttgaaa aactattcaa tggtcattga tggcaagacc
      781 tattactttg acggtcagac aggcgaagcg caaaccgact tgcctaagac cggtcaagct
      841 aatcaagaca atgtgcccga tagctatcag gccaataatc aagcctatag taatgaggct
      901 tctagctttg agacagttga taattacctg acagcggatt cctggtatcg ccctcgcaag
      961 attttgaaaa atggccagag ctggcaggct agttcagaag gtgacctgcg tcctattctc
     1021 atgacctggt ggccagatgc ggccaccaag gcagcctatg ccaacttctg ggctaaagaa
     1081 ggcctgattt caggttctta caggcaaaat tctgccaatc ttgatgcggc tacccagaat
     1141 attcaatctg ccattgaaaa gaagattgcc agtgaaggca ataccaactg gttgcgcgac
     1201 aagatgtccc aattcgtcaa gtcacaaaac cagtggagta ttgcttctga aaatgagact
     1261 gtctatccta atcaggatca tatgcaaggg ggggccctgc tcttttcaaa cagtaaggat
     1321 accgagcatg ctaactcaga ctggcgcctg ctcaaccgca atccaacctt ccaaactggt
     1381 aaacaaaagt attttacaac caactatgcg ggttatgaat tgctcttagc caatgatgtt
     1441 gataattcta acccggtcgt ccaagccgaa cagctcaatc acctccacta ccttatgaac
     1501 tggggtgaca ttgttatggg ggataaggat gctaactttg atggtgtccg agttgatgcg
     1561 gttgataatg tcaatgccga cctgctacaa attcagaggg attattataa ggccaaatat
     1621 ggcaccgatc aaaatgaaaa aaatgccatt gaccacctct ctatcctaga agcctggtct
     1681 ggcaacgaca atgactacgt caaggaccaa aacaactttt ctctctctat tgacaatgat
     1741 caacgcagtg gtatgttgaa agctttcggt tatgcttctg cctatcgtgg taatttgagt
     1801 aatctcgcaa ctgctggctt gaaaaaccgt agtgcaaatc ctgatagtga cccagttcca
     1861 aactatgtct ttattcgtgc ccatgactct gaggtacaga cacggattgc taagattatt
     1921 cgtgagaaat tgggtaagac caatgctgat ggtttgacca atctgacctt agatgattta
     1981 aacaaggcct ttgacatcta caaccaggat atgaatgcca ccgataaggt ttattatcct
     2041 aataatttac cgatggccta tgcttggatg ctgcaaaata aggatacggt cactcgggtc
     2101 tactatggtg atatgtacac cgataatggc cagtatatgg cgaccaagac cccattttat
     2161 aatgctattg agaccctgct taagggccgg attaagtatg ttgctggtgg tcaagcggtc
     2221 tcctacaagc aagattggtc tagtggaatt ttgacctcgg ttcgctatgg taagggagct
     2281 aattcggcta gtgatgctgg gaacacggaa acccgcaatt ctggtatggc cctcctgatt
     2341 aacaatcgac ctaacttcag ggcttatcgg aacttgacct tgaacatggg ggcagcccat
     2401 aagagccaag cctaccgtcc tctcttgctg tctaccaagg atggcattgc cacctacctt
     2461 aatgatagcg atgtggattc tcgccaatac aagtacaccg atagtcaggg gaacttaagc
     2521 tttagtgcct ctgaactgca aagtgttgct aacgcccaag tctcaggtat gattcaggtt
     2581 tgggtacccg tcggtgcggc agataaccaa gatgtccgta cttctccaag tacacaagcg
     2641 accaaggatg gcaatatcta tcatcaaagt gacgccctag attcccaagt catctatgaa
     2701 ggtttctcta atttccaagc cttcgcccaa agtcctgacc aatataccaa tgctgtcatt
     2761 gctaaaaatg gcgacctctt taagtcttgg ggcattaccc aatttgaaat ggcaccgcag
     2821 tacgtgtcat cagaagatgg aactttcctt gattccgtta ttttaaatgg ttatgccttc
     2881 tcagaccgct atgacctggc tatgagtaag aataataaat atggctccaa gcaggatttg
     2941 gccaatgcca ttaaaggact tcagtctgca ggtatcaagg tcttgtcaga cttggtaccg
     3001 aaccaacttt ataatctgcc tggaaaagaa gtggtaacgg caacccgggt caatcaatat
     3061 ggtcaagcca aatcaggggc gaccatcaat aagactcctt atgtggccaa tacccgttcc
     3121 tatggtgact accaagaaca gtacggtgga aaattcttgg atgacttgca aaagctctac
     3181 ccaagactct ttagcaccaa gcaaatttca acgggcaaac caattgatcc atctgttaag
     3241 attaccaatt ggtcggctaa gtatttcaat ggctccaata ttttggggcg tggtgccaag
     3301 tatgttctga gtgagggcaa taagtatctc aacttggctg atggcaagct cttcttgcca
     3361 acggtcctca ataatactta tggtcagccg caagtatcgg ctaatggttt tatttctaag
     3421 aatggcggta ttcattatct tgataaaaat ggtcaggaag tcaagaatcg gttcaaggaa
     3481 atttctggca gctggtatta cttcgattct gacggtaaga tggctactgg aaaaacgaag
     3541 attggcaatg atacctacct ctttatgcct aatggtaaac aacttaagga aggtgtctgg
     3601 tatgatggta agaaggccta ctattatgat gataatggta ggacgtggac caataagggc
     3661 ttcgttgaat ttagagttga cggtcaagac aagtggcgtt acttcaatgg tgatggcact
     3721 atcgccattg gactagttag tctagataat cgcaccctct actttgatgc ctatggctat
     3781 caagtcaagg gacaaaccgt gacgattaat ggtaagtcct atacctttga tgccgatcag
     3841 ggtgacttgg tccaaacaga caatgccaat ccagctcctc agggtcaagc aggctggaaa
     3901 ctcctaggag ataaccagtg gggctaccgc aaggacggtc aactcttgac gggtgagcaa
     3961 actattgatg gtcaaaaggt cttcttccaa gataatggcg tccaagtcaa aggtggaact
     4021 gcgacagatg cttcaggtgt cttgcgtttc tacgaccgtg accagggcca ccaagttggc
     4081 aagggctggt actcaacctc cgacgataat tgggtctatg tcaatgaatc cggtcaagtt
     4141 ctgacaggct tacaaaccat tgatggtcag acggtctatt tcgatgacaa gggtatccag
     4201 gccaagggca aggctgtctg ggatgaaaat ggcaacctgc gttactttga tgccgattca
     4261 ggtaatatgc ttcgggaccg ttggaagaac gttgacggaa actggtatta cttcaaccgc
     4321 aatggcctag ccacgaggtg gtaagcctaa tactcgctaa atattaaaag agctcg
//
LOCUS       VCHTCPA      1048 bp ds-DNA             BCT       25-MAY-1990
DEFINITION  V.cholerae major pilin subunit (tcpA) gene, complete cds.
ACCESSION   M33514
KEYWORDS    antigen; pilin.
SOURCE      V.cholerae (strain Z17561 (Classical, Inaba)) DNA, clone pPM2114.
  ORGANISM  Vibrio cholerae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 1048)
  AUTHORS   Faast,R., Ogierman,M.A., Stroeher,U.H. and Manning,P.A.
  TITLE     Nucleotide sequence of the structural gene, tcpA, for a major pilin
            subunit of Vibrio cholerae
  JOURNAL   Gene 85, 227-231 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        144      818     major pilin subunit (tcpA) precursor
    sigp        144      218     major pilin subunit (tcpA) signal peptide
    matp        219      815     major pilin subunit (tcpA)
    ORF         865  >  1048     ORF
BASE COUNT      327 a    182 c    240 g    299 t
ORIGIN      
        1 aaaaaataaa aaacacagca aaaatgacat ctgtcaattg taggtgactt tgtgtggtta
       61 aatgtgcgtg ttgcttacgt tatctaaaaa agaccaagcg acgcatttct ttaaagacag
      121 taaaatggtg gagttacata aatatgcaat tattaaaaca gctttttaag aagaaatttg
      181 taaaagaaga acacgataag aaaaccggtc aagagggtat gacattactc gaagtgatca
      241 tcgttctagg cattatgggg gtggtttcgg cgggggttgt tactctggcg cagcgtgcga
      301 ttgattcgca gattatgacc aaggccgcgc aaagtctcaa tagtatccaa gttgcactga
      361 cacagacata ccgtggtcta ggtaattatc cagcaacagc tgatgcgaca gctgctagta
      421 agctaacttc aggcttggtt agtttaggta aaatatcatc cgatgaggca aaaaacccat
      481 tcaatggtac aaatatgaat attttttcat ttccgcgtaa tgcagcagct aataaagcat
      541 ttgcaatttc agtggatggt ctgacacagg ctcaatgcaa gacacttatt accagtgtcg
      601 gtgatatgtt cccatatatt gcaatcaaag ctggtggcgc agtagcactt gcagatctag
      661 gtgattttga gaattctgca gcagcggctg agacaggcgt tggtgtgatc aaatctatcg
      721 ctcccgctag taagaattta gatctaacga acatcactca cgttgagaaa ttatgtaaag
      781 gtactgctcc attcggcgtt gcatttggta acagctaatt caaataagtt tgtttaactt
      841 aatcttaacg ttgcccatta aataatgggc aacttattaa attcaatgtg gtatcaatat
      901 gagaaaatac caacaaggtg tcggattatt ggaggcgatt ctggcttctg cgtattaggg
      961 atggcattgg tcgctgctgg gagctattac aagcgggaag ctgaactcat gattaaatcc
     1021 agtaacgcat ttgatgttat tgagttgt
//
LOCUS       HPCPOLYP     7310 bp ss-RNA             VRL       25-MAY-1990
DEFINITION  Hepatitis C virus polyprotein gene, partial cds.
ACCESSION   M32084
KEYWORDS    polyprotein.
SOURCE      Hepatitis C virus, cDNA to viral RNA, clones K9-1 through 15e,
            isolated from chimpanzee (individual 910) blood plasma.
  ORGANISM  Hepatitis C virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Togaviridae incertae sedis.
REFERENCE   1  (bases 1 to 7310)
  AUTHORS   Choo,Q.-L., Richman,K. and Han,J.
  TITLE     The nucleotide sequence of the Hepatitis C viral genome
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.Houghton, 22-FEB-1990. Chiron Corporation, 4560 Horton Street,
            Emeryville CA 94608.
FEATURES       from  to/span     description
    pept    <     1  >  7310     polyprotein (AA at 3)
BASE COUNT     1495 a   2218 c   2058 g   1539 t
ORIGIN      
        1 caggctgtcc tgagaggcta gccagctgcc gaccccttac cgattttgac cagggctggg
       61 gccctatcag ttatgccaac ggaagcggcc ccgaccagcg cccctactgc tggcactacc
      121 ccccaaaacc ttgcggtatt gtgcccgcga agagtgtgtg tggtccggta tattgcttca
      181 ctcccagccc cgtggtggtg ggaacgaccg acaggtcggg cgcgcccacc tacagctggg
      241 gtgaaaatga tacggacgtc ttcgtcctta acaataccag gccaccgctg ggcaattggt
      301 tcggttgtac ctggatgaac tcaactggat tcaccaaagt gtgcggagcg cctccttgtg
      361 tcatcggagg ggcgggcaac aacaccctgc actgccccac tgattgcttc cgcaagcatc
      421 cggacgccac atactctcgg tgcggctccg gtccctggat cacacccagg tgcctggtcg
      481 actacccgta taggctttgg cattatcctt gtaccatcaa ctacaccata tttaaaatca
      541 ggatgtacgt gggaggggtc gaacacaggc tggaagctgc ctgcaactgg acgcggggcg
      601 aacgttgcga tctggaagac agggacaggt ccgagctcag cccgttactg ctgaccacta
      661 cacagtggca ggtcctcccg tgttccttca caaccctacc agccttgtcc accggcctca
      721 tccacctcca ccagaacatt gtggacgtgc agtacttgta cggggtgggg tcaagcatcg
      781 cgtcctgggc cattaagtgg gagtacgtcg ttctcctgtt ccttctgctt gcagacgcgc
      841 gcgtctgctc ctgcttgtgg atgatgctac tcatatccca agcggaggcg gctttggaga
      901 acctcgtaat acttaatgca gcatccctgg ccgggacgca cggtcttgta tccttcctcg
      961 tgttcttctg ctttgcatgg tatttgaagg gtaagtgggt gcccggagcg gtctacacct
     1021 tctacgggat gtggcctctc ctcctgctcc tgttggcgtt gccccagcgg gcgtacgcgc
     1081 tggacacgga ggtggccgcg tcgtgtggcg gtgttgttct cgtcgggttg atggcgctga
     1141 ctctgtcacc atattacaag cgctatatca gctggtgctt gtggtggctt cagtattttc
     1201 tgaccagagt ggaagcgcaa ctgcacgtgt ggattccccc cctcaacgtc cgaggggggc
     1261 gcgacgccgt catcttactc atgtgtgctg tacacccgac tctggtattt gacatcacca
     1321 aattgctgct ggccgtcttc ggaccccttt ggattcttca agccagtttg cttaaagtac
     1381 cctactttgt gcgcgtccaa ggccttctcc ggttctgcgc gttagcgcgg aagatgatcg
     1441 gaggccatta cgtgcaaatg gtcatcatta agttaggggc gcttactggc acctatgttt
     1501 ataaccatct cactcctctt cgggactggg cgcacaacgg cttgcgagat ctggccgtgg
     1561 ctgtagagcc agtcgtcttc tcccaaatgg agaccaagct catcacgtgg ggggcagata
     1621 ccgccgcgtg cggtgacatc atcaacggct tgcctgtttc cgcccgcagg ggccgggaga
     1681 tactgctcgg gccagccgat ggaatggtct ccaaggggtg gaggttgctg gcgcccatca
     1741 cggcgtacgc ccagcagaca aggggcctcc tagggtgcat aatcaccagc ctaactggcc
     1801 gggacaaaaa ccaagtggag ggtgaggtcc agattgtgtc aactgctgcc caaaccttcc
     1861 tggcaacgtg catcaatggg gtgtgctgga ctgtctacca cggggccgga acgaggacca
     1921 tcgcgtcacc caagggtcct gtcatccaga tgtataccaa tgtagaccaa gaccttgtgg
     1981 gctggcccgc tccgcaaggt agccgctcat tgacaccctg cacttgcggc tcctcggacc
     2041 tttacctggt cacgaggcac gccgatgtca ttcccgtgcg ccggcggggt gatagcaggg
     2101 gcagcctgct gtcgccccgg cccatttcct acttgaaagg ctcctcgggg ggtccgctgt
     2161 tgtgccccgc ggggcacgcc gtgggcatat ttagggccgc ggtgtgcacc cgtggagtgg
     2221 ctaaggcggt ggactttatc cctgtggaga acctagagac aaccatgagg tccccggtgt
     2281 tcacggataa ctcctctcca ccagtagtgc cccagagctt ccaggtggct cacctccatg
     2341 ctcccacagg cagcggcaaa agcaccaagg tcccggctgc atatgcagct cagggctata
     2401 aggtgctagt actcaacccc tctgttgctg caacactggg ctttggtgct tacatgtcca
     2461 aggctcatgg gatcgatcct aacatcagga ccggggtgag aacaattacc actggcagcc
     2521 ccatcacgta ctccacctac ggcaagttcc ttgccgacgg cgggtgctcg gggggcgctt
     2581 atgacataat aatttgtgac gagtgccact ccacggatgc cacatccatc ttgggcatcg
     2641 gcactgtcct tgaccaagca gagactgcgg gggcgagact ggttgtgctc gccaccgcca
     2701 cccctccggg ctccgtcact gtgccccatc ccaacatcga ggaggttgct ctgtccacca
     2761 ccggagagat ccctttttac ggcaaggcta tccccctcga agtaatcaag ggggggagac
     2821 atctcatctt ctgtcattca aagaagaagt gcgacgaact cgccgcaaag ctggtcgcat
     2881 tgggcatcaa tgccgtggcc tactaccgcg gtcttgacgt gtccgtcatc ccgaccagcg
     2941 gcgatgttgt cgtcgtggca accgatgccc tcatgaccgg ctataccggc gacttcgact
     3001 cggtgataga ctgcaatacg tgtgtcaccc agacagtcga tttcagcctt gaccctacct
     3061 tcaccattga gacaatcacg ctcccccagg atgctgtctc ccgcactcaa cgtcggggca
     3121 ggactggcag ggggaagcca ggcatctaca gatttgtggc accgggggag cgcccctccg
     3181 gcatgttcga ctcgtccgtc ctctgtgagt gctatgacgc aggctgtgct tggtatgagc
     3241 tcacgcccgc cgagactaca gttaggctac gagcgtacat gaacaccccg gggcttcccg
     3301 tgtgccagga ccatcttgaa ttttgggagg gcgtctttac aggcctcact catatagatg
     3361 cccactttct atcccagaca aagcagagtg gggagaacct tccttacctg gtagcgtacc
     3421 aagccaccgt gtgcgctagg gctcaagccc ctcccccatc gtgggaccag atgtggaagt
     3481 gtttgattcg cctcaagccc accctccatg ggccaacacc cctgctatac agactgggcg
     3541 ctgttcagaa tgaaatcacc ctgacgcacc cagtcaccaa atacatcatg acatgcatgt
     3601 cggccgacct ggaggtcgtc acgagcacct gggtgctcgt tggcggcgtc ctggctgctt
     3661 tggccgcgta ttgcctgtca acaggctgcg tggtcatagt gggcagggtc gtcttgtccg
     3721 ggaagccggc aatcatacct gacagggaag tcctctaccg agagttcgat gagatggaag
     3781 agtgctctca gcacttaccg tacatcgagc aagggatgat gctcgccgag cagttcaagc
     3841 agaaggccct cggcctcctg cagaccgcgt cccgtcaggc agaggttatc gcccctgctg
     3901 tccagaccaa ctggcaaaaa ctcgagacct tctgggcgaa gcatatgtgg aacttcatca
     3961 gtgggataca atacttggcg ggcttgtcaa cgctgcctgg taaccccgcc attgcttcat
     4021 tgatggcttt tacagctgct gtcaccagcc cactaaccac tagccaaacc ctcctcttca
     4081 acatattggg ggggtgggtg gctgcccagc tcgccgcccc cggtgccgct actgcctttg
     4141 tgggcgctgg cttagctggc gccgccatcg gcagtgttgg actggggaag gtcctcatag
     4201 acatccttgc agggtatggc gcgggcgtgg cgggagctct tgtggcattc aagatcatga
     4261 gcggtgaggt cccctccacg gaggacctgg tcaatctact gcccgccatc ctctcgcccg
     4321 gagccctcgt agtcggcgtg gtctgtgcag caatactgcg ccggcacgtt ggcccgggcg
     4381 agggggcagt gcagtggatg aaccggctga tagccttcgc ctcccggggg aaccatgttt
     4441 cccccacgca ctacgtgccg gagagcgatg cagctgcccg cgtcactgcc atactcagca
     4501 gcctcactgt aacccagctc ctgaggcgac tgcaccagtg gataagctcg gagtgtacca
     4561 ctccatgctc cggttcctgg ctaagggaca tctgggactg gatatgcgag gtgttgagcg
     4621 actttaagac ctggctaaaa gctaagctca tgccacagct gcctgggatc ccctttgtgt
     4681 cctgccagcg cgggtataag ggggtctggc gagtggacgg catcatgcac actcgctgcc
     4741 actgtggagc tgagatcact ggacatgtca aaaacgggac gatgaggatc gtcggtccta
     4801 ggacctgcag gaacatgtgg agtgggacct tccccattaa tgcctacacc acgggcccct
     4861 gtacccccct tcctgcgccg aactacacgt tcgcgctatg gagggtgtct gcagaggaat
     4921 atgtggagat aaggcaggtg ggggacttcc actacgtgac gggtatgact actgacaatc
     4981 tcaaatgccc gtgccaggtc ccatcgcccg aatttttcac agaattggac ggggtgcgcc
     5041 tacataggtt tgcgcccccc tgcaagccct tgctgcggga ggaggtatca ttcagagtag
     5101 gactccacga atacccggta gggtcgcaat taccttgcga gcccgaaccg gacgtggccg
     5161 tgttgacgtc catgctcact gatccctccc atataacagc agaggcggcc gggcgaaggt
     5221 tggcgagggg atcacccccc tctgtggcca gctcctcggc tagccagcta tccgctccat
     5281 ctctcaaggc aacttgcacc gctaaccatg actcccctga tgctgagctc atagaggcca
     5341 acctcctatg gaggcaggag atgggcggca acatcaccag ggttgagtca gaaaacaaag
     5401 tggtgattct ggactccttc gatccgcttg tggcggagga ggacgagcgg gagatctccg
     5461 tacccgcaga aatcctgcgg aagtctcgga gattcgccca ggccctgccc gtttgggcgc
     5521 ggccggacta taaccccccg ctagtggaga cgtggaaaaa gcccgactac gaaccacctg
     5581 tggtccatgg ctgtccgctt ccacctccaa agtcccctcc tgtgcctccg cctcggaaga
     5641 agcggacggt ggtcctcact gaatcaaccc tatctactgc cttggccgag ctcgccacca
     5701 gaagctttgg cagctcctca acttccggca ttacgggcga caatacgaca acatcctctg
     5761 agcccgcccc ttctggctgc ccccccgact ccgacgctga gtcctattcc tccatgcccc
     5821 ccctggaggg ggagcctggg gatccggatc ttagcgacgg gtcatggtca acggtcagta
     5881 gtgaggccaa cgcggaggat gtcgtgtgct gctcaatgtc ttactcttgg acaggcgcac
     5941 tcgtcacccc gtgcgccgcg gaagaacaga aactgcccat caatgcacta agcaactcgt
     6001 tgctacgtca ccacaatttg gtgtattcca ccacctcacg cagtgcttgc caaaggcaga
     6061 agaaagtcac atttgacaga ctgcaagttc tggacagcca ttaccaggac gtactcaagg
     6121 aggttaaagc agcggcgtca aaagtgaagg ctaacttgct atccgtagag gaagcttgca
     6181 gcctgacgcc cccacactca gccaaatcca agtttggtta tggggcaaaa gacgtccgtt
     6241 gccatgccag aaaggccgta acccacatca actccgtgtg gaaagacctt ctggaagaca
     6301 atgtaacacc aatagacact accatcatgg ctaagaacga ggttttctgc gttcagcctg
     6361 agaagggggg tcgtaagcca gctcgtctca tcgtgttccc cgatctgggc gtgcgcgtgt
     6421 gcgaaaagat ggctttgtac gacgtggtta caaagctccc cttggccgtg atgggaagct
     6481 cctacggatt ccaatactca ccaggacagc gggttgaatt cctcgtgcaa gcgtggaagt
     6541 ccaagaaaac cccaatgggg ttctcgtatg atacccgctg ctttgactcc acagtcactg
     6601 agagcgacat ccgtacggag gaggcaatct accaatgttg tgacctcgac ccccaagccc
     6661 gcgtggccat caagtccctc accgagaggc tttatgttgg gggccctctt accaattcaa
     6721 ggggggagaa ctgcggctat cgcaggtgcc gcgcgagcgg cgtactgaca actagctgtg
     6781 gtaacaccct cacttgctac atcaaggccc gggcagcctg tcgagccgca gggctccagg
     6841 actgcaccat gctcgtgtgt ggcgacgact tagtcgttat ctgtgaaagc gcgggggtcc
     6901 aggaggacgc ggcgagcctg agagccttca cggaggctat gaccaggtac tccgcccccc
     6961 ctggggaccc cccacaacca gaatacgact tggagctcat aacatcatgc tcctccaacg
     7021 tgtcagtcgc ccacgacggc gctggaaaga gggtctacta cctcacccgt gaccctacaa
     7081 cccccctcgc gagagctgcg tgggagacag caagacacac tccagtcaat tcctggctag
     7141 gcaacataat catgtttgcc cccacactgt gggcgaggat gatactgatg acccatttct
     7201 ttagcgtcct tatagccagg gaccagcttg aacaggccct cgattgcgag atctacgggg
     7261 cctgctactc catagaacca cttgatctac ctccaatcat tcaaagactc
//
LOCUS       YSCREP1A     1498 bp ds-DNA             PLN       25-MAY-1990
DEFINITION  S.cerevisiae rep1 protein gene, complete cds.
ACCESSION   M31942
KEYWORDS    rep1 protein.
SOURCE      S.cerevisiae (strain SKQ-2n) plasmid Scp3 DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1498)
  AUTHORS   Neuville,P., Bonneu,M. and Aigle,M.
  TITLE     2 micron plasmid heterogeneity in the yeast Saccharomyces
            cerevisiae: A new sequence for the rep1 gene
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Neuville, 13-FEB-1990.
FEATURES       from  to/span     description
    pept        295     1437     rep1 protein
BASE COUNT      513 a    245 c    315 g    425 t
ORIGIN      
        1 aagctttgac aaaaagcatt gagtagctaa ctcttctatg caatctatag ctgttttata
       61 aggcatttaa tgacaaattg attgtttaga aacatactag tgaaagtagc cttaatcgcc
      121 tctcgaaatt aataatgcag tatggtgtaa aaaatgcaac tcgcgttgct ctactttttt
      181 ccgaatttcc gaatacgcag ctggggtgat tgatcgattt cgtatcaaaa gttttgttta
      241 taaaaaccgc gaaatccttc ttcaacagat ggatttttac agtgctgaca tacaatgaca
      301 acaactataa tagaaaataa ccgaaatatg aatggagaga gactacttgt ttgtattaag
      361 caatgtatta tgcagcactt ccaacctatg atatacgatg aaagtagatg tgtaattgaa
      421 accacaaggg gaactttttc agttcctgac aattataaga aatataaaac cttagcattt
      481 gcttttgttg gacacgtact gaatacagat gacacaccgg taattgaaaa agaactggat
      541 tggcctgatc ctgcactagt gtacaataca attgtcgatc gaatcagaaa tcacccagaa
      601 ttatcacagt ttatatcggt tgcattcatt agtcaattaa aggcggccat aggggaggtt
      661 agatatcaat gtaagaggca cgttaaaccg cagggaaagg gtataagaag gccgaacgta
      721 ttttttagat acatggaatc cccgtttgtc aataccaagg ttactgactt cttttcttat
      781 cttcgagact ataataaaat tgcctcagag tatcacaata atactaaatt cattctcacg
      841 ttttcatgta aagcatattg ggcatctggc caaaactttt ccgccttgaa gaatgttatt
      901 aggtgctcca tagttcatga atacatttct aggtttgtgg aaagagaaca gggtcttata
      961 ggagatcagg agcaaccacg tgaagaaaac tcttcccctg aactaagtaa tgtgcaacat
     1021 gaagtcaata atttaatcga acaagatgcc gcggcggatg aagggttgtg gagtgagata
     1081 gattcattat gtgaaaaatg gcagtctgaa gcggaagacc aagctgaagc ggagataata
     1141 gccgacagaa taattggaaa tagtcaaagg atggcgaacc ttaaaattcg tcgtacaaag
     1201 ttcaaaagtg tcttgtatca tatactaaag gaactaattc aatctcaggg aaccgtaaag
     1261 gtttatcgcg gtagtagctt ttcacacgat tctataaaga taagcttaca ttatgaagag
     1321 cagcatatta cagccgtatg ggtctaccta acagtgaaat ttgaagagca ttggaaacca
     1381 gttgatgtag aagtcgagtt taggtgcaag ttcaaggagc gaaaggtcaa tgagtaggtt
     1441 atatagggat atagaagata tagaacatag caaagagctg cttttgagca atgtttgt
//
LOCUS       MUSREN1C      758 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Mouse renin (Ren-1-c) gene, 5' end.
ACCESSION   M34190
KEYWORDS    renin.
SOURCE      M.musculus (strain C57BL/10) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 758)
  AUTHORS   Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J.,
            Pioli,D. and Brammar,W.J.
  TITLE     The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d,
            and its upstream region
  JOURNAL   Gene 84, 91-104 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        729  >   758     renin (Ren-1-c), exon 1
    pre-msg     701  >   758     Ren-1-c mRNA and introns
BASE COUNT      208 a    156 c    245 g    149 t
ORIGIN      
        1 tctagagtca ttgggctcag ccacccttcc cacaccccca tgcctgccac cactctgctc
       61 tgtgaccagg tcctgcttat cctataccta cctagcttgg tctcacaggc taagatttat
      121 cagagctgcc ctgccatggg ccttgttggg cacctggcgt ggggaaataa aggaggtggg
      181 cgagactcac cagagctcag agagatgggc gggcagacag caggggaagg cactgggtgg
      241 tctggcagct ggaaatgctg ggaggccttc ttgggggaga ttagataaag ctcttcaggg
      301 gaaggcctat tccatgactc cagcatggtg agtctagatg aaaggaggta gtctatggtt
      361 ttagagcttt attgtagaag agagagagag aaggtagaga agtagaagcc agtcattgcc
      421 atgaagaagg aagggggaga aggagagcga aaggtaagag taagaaaaca agagctttag
      481 gagagagaca agagagagag gaggaggcaa gcagccaggt aactctgggg gtggagtctg
      541 gacagcctac atgactgatg gccacagaat tatggagctg ggtccttggc cagaaaacag
      601 gctgcctttc atggtcccac aggccctggg gtaataaatc aaagcagagc ctgtgataca
      661 tggtgtgtat aaaagaaggc tcagggggtc tgggctacac agctcttaga aagccttggc
      721 tgaaccagat ggacagaagg aggatgcctc tctgggca
//
LOCUS       MUSREN1D    11231 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Mouse renin (Ren-1-d) gene, complete cds.
ACCESSION   M32352
KEYWORDS    renin.
SOURCE      M.musculus (strain DBA/2J) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 11231)
  AUTHORS   Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J.,
            Pioli,D. and Brammar,W.J.
  TITLE     The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d,
            and its upstream region
  JOURNAL   Gene 84, 91-104 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1286     1377     renin (Ren-1-d), exon 1
               4697     4847     renin, exon 2
               5364     5487     renin, exon 3
               6056     6174     renin, exon 4
               6955     7151     renin, exon 5
               8940     9059     renin, exon 6
               9487     9631     renin, exon 7
               9884     9982     renin, exon 8
              10547    10708     renin, exon 9
    pre-msg    1258    10887     Ren-1-d mRNA and introns
    IVS        1378     4696     Ren-1-d intron A
    IVS        4848     5363     Ren-1-d intron B
    IVS        5488     6055     Ren-1-d intron C
    IVS        6175     6954     Ren-1-d intron D
    IVS        7152     8939     Ren-1-d intron E
    IVS        9060     9486     Ren-1-d intron F
    IVS        9632     9883     Ren-1-d intron G
    IVS        9983    10546     Ren-1-d intron H
BASE COUNT     2809 a   2725 c   3051 g   2646 t
ORIGIN      
        1 ggtacttagg gagagtaata ctgccagcca gggtcaggtc acttcgacag ctctgccagg
       61 cttctgagag gtacagggaa gaaggacagg gggcaaaagg ataggtctaa gggagtgaag
      121 ggccagtagg agggggtcag ctggaagcca gctgcactta agacctcctt tcttttagag
      181 ctacccttct gtccaccatc actaacccag gagggacacc caccctctcc ctgatgccag
      241 aggttagatg acaggttact tttccaacta ggcagcagcc aaaaggaatc actcatgatc
      301 aagatggaag actagggcag agcctaggag gagccagggg tgatttgaag gttaatatag
      361 attaatagat aggttaatat ataatattag gttaatatgc aggtctcggg ggcatgagtt
      421 ggtagggtct gagttagagg tttgaagcca gggtctatgc tgtctgtacc ctgacctcat
      481 gctgaacagc acagctcctc attagcctct gtctaagcta gaggttgcgg ggccaggcca
      541 aacagggact ctagagtcat tgggctcagc cacccttccc acacccccat gcctgccacc
      601 actctgctct gcgaccaggt tctgcttatc ctatacctac ctaacttggt ctcacaggct
      661 agaatttatc aggactgccc tgccatgggc cttgttgggc acctggcatg gggaaataaa
      721 ggaggtgggc gagacctgtg tctcaccaga gctcagagtg actggatggg cagacagcag
      781 gggaaggcac tgggtggtct ggcagctgga aacgctggga ggccttcttg ggggaaatta
      841 gatgaagctc ttcaggggaa ggcctattcc atgactccag catggtgagt ctagatgaaa
      901 ggaggtagtc tatggtttta gagctttatt gtagaagaga gagagagaag gtagagaagt
      961 agatgccagc cattgccacg aagaaggaag ggggagaagg agagcaaaag gtaagagtaa
     1021 gaaaacaaga gctttaggag agagacaaga gagagaggag gaggcaagca gccaggtaac
     1081 tctgggggtg gagtctggac agcctacatg actgatggcc acagaattat ggagctgggt
     1141 ccttggccag aaaacaggct gcctttcatg gtcccacagg ccctggggta ataaatcaaa
     1201 gcagatcctg tgatacatgg tgtgtataaa agaaggctca gggggtctgg gctacacagc
     1261 tcttagaaag ccttggctga accagatgga caggaggagg atgcctctct gggcactctt
     1321 gttgctctgg agtccttgca ccttcagtct cccaacacgc accgctacct ttgaacggta
     1381 acttgggcag agaaggggtg gcagggtaca ggaactggca tcttactaac gccctcagct
     1441 gtctatgcgt tgggtcatcc agtccttttt ggccagccag tcagggatgt atgatcctgc
     1501 tctgacaatc ctagggaata gcccaaagcc ctagtggacg gccataatta aggggtacca
     1561 caacttcccc cttctactac ttttttaaaa ggcgaggggt cactggagtg actgtgtcat
     1621 ttgtggtgaa aattttgggt ggagtggaag tatcagaaga aacatgtagg ccaaaggagt
     1681 aacatgacta tcagttaatt aatctagata gacattaagt tttccttgtg atcatgtaac
     1741 tccccccaag ttcagcagcc gtttggtttg gcgagagagt tcatctgtag ttggaagacc
     1801 tcaagtctgt cctctgtggg gttcagctcg tgtctctgga tctcatgtgt gtcttgggag
     1861 tcttctgtca gatgaggtgt ctggaggaag gcagctttgt ggtctgtagg tctcaggaga
     1921 cctgtctatg ctggatctgc agctgcaggc actgcccttt acttttgtgg ggataaacct
     1981 gtggggtaat cacagcagag gagcagaacc ctagcagcag gatgtctgca tagggtggat
     2041 ggggaacgtt aagcctgcaa tggtttgctt ggaagcacat cctggcctgt tgttaggatc
     2101 ctagaaaagc aagggtgttt gaggaatgta gtcctggaag gcatccttta agtctgcttt
     2161 gggtggctaa ggaggaaaat atcatgttaa ttgtgtatct gggataggca gtaacagatt
     2221 gtctgcaaaa ggaagctggt tgtcagttaa ggattggatg acggtaatca gcatggttaa
     2281 ttcaacccag ggaagcacat tagatggctt ttggccctta aaggaaatca tgaatgctat
     2341 ggagaaaaat aaacattaaa ataaagaagc aagaaaattt tggattgaaa agcttactag
     2401 cagcagatga gggtaagcag caagctaatc tcaccaatcc tcagtccgcc attcaaaaac
     2461 aggatgaaag cagagcctgc ttcaccggct gctttgaaga tttgatgaga ggataagcat
     2521 agcacttccg atgccaggcg gggaccggtg taagaagcca tggttacagc atggattacc
     2581 acagagtgta ggaaagggaa ggctcctggc agccccacga gggaggatga aacagccatt
     2641 tagtgcagtt ccagggctgt aggaacaagg cttcttgagg caggagcagg gcagaggcaa
     2701 gagaaggctc acccagcaat ggtctgggga ctatgatgag gaagtcctga aagagtcaag
     2761 gcaatgagaa ctgtgaatct gcttggtgct taggagctgg tgggttgctt tcatcctaaa
     2821 ttgagtctcc agacagacat cagcacccat gttgcagatg aggatgtcct ccagaaaggt
     2881 ctgaggactc actggtcttt gtttccaaag ccctgccatc ttgctcccag catgcagcag
     2941 agagcgtggg attaactaga gggcgtgggg ttctgtgctc cctgctctgt gtaaagtatt
     3001 gagcaagtca ccttgagggg ctgagcaatg ggctttcttc tcaaggccat tacatgcctt
     3061 gcatacacac aatgtccagc ctccacgctt agcctgagct gtgcctgggc gaaaaaacca
     3121 tcaaggtgcc agaggacgaa gaggttagtg aatgctcaga accgtgcagc atgccctcag
     3181 catcaggtgg ctctatcccc gttgtccctg taggctaggc ttgtggggag cactcagggg
     3241 aaatggaccc attggctggc atccttggtg agagcaccta ggtatgggct caggtcttga
     3301 aacattaagg atctaggaga ggaatagaag gggttttatg tttattttct tttgggacag
     3361 agcctcatta tggagcctgg aactcactat atagaccaga ctggcctcaa acccacagag
     3421 atcagccttc ctctgcctcc tgaggactga aattaatgtc atgccctacc acacaacaga
     3481 gaggttttct aaaagaagga agggtcataa aagaaacagc aagcattggg tctttttcag
     3541 gaaattggtg gagaactaga cctgtcttcc tcccttgaag agtggggaaa taaaacacac
     3601 ccatctggtg agatggctgg ggtcaactaa cacaagagtg gaaatacttg caagccgtgc
     3661 taggactaag tcatttctgc tggagattac tcggataggt gtgaaatatg ggcaaagctg
     3721 ggaggtgagg cacggagtgc tgggaggaag gttagggctc acactggaga agaaagaggg
     3781 ctctctggat gtctgggaaa gccctggctc aggactggtg gcatcttcct ccaactgcaa
     3841 aacagtctgt gtggtcccgg cagtcctggg taggaaagac agacagacag agcttggact
     3901 gtctctgttt ggcatttctg agtcatcttc atccttgtga accttcaatg tctccgccac
     3961 caccgccgga aaccagggag ctagatgtca tgtggtttgt atgagggtgc gtgccaggca
     4021 gctggcctgc ggccaggagt agctcaggga gctgctgttc taaatgaaat ctcttgaatt
     4081 ttgccagtca ataatgattg gaggtgagga ggggcccagg attcaggccc aagacacatc
     4141 agtccccagg tgaggcacac cggcaagcta tagacactgg cctatccaat ccaagtcaag
     4201 ctaggttcat tgggcacata tcacctggct ctgtgatacc ccagggagca acaagtcctt
     4261 gcagcctgct gtgggcatct gctgccactt acaggccatc cgtggtactg ccttgcccgt
     4321 caatccaagg agatggcaag aggctgtata gaccttttat gttcctctcc agaacagtga
     4381 ggaagagtag gtgttcaaca cacgatgatg ataatggtag atagatgaac agacggactg
     4441 actgatgaac agacgggtgg atggacagac ggatggacgg atgaacaaac tgctttatct
     4501 aactaaaata agatctggga attacttagt accagaataa gtgactccaa gaaaccccag
     4561 ggaactaaaa gtgattcgta attgcaatgg gagagggtta gtgttagacg taccaaaaag
     4621 caaggcaaca tctgggctct gtcaacgttg gagcactgca cgactcttct gtttttcttt
     4681 ctctaccctg gagcagaatc ccactcaaga aaatgccctc tgtccgggaa atcctggagg
     4741 agcggggagt ggacatgacc aggctcagtg ctgaaagggg cgtattcaca aagaggcctt
     4801 ccttgatcaa tcttacctcc cccgtggtcc tcaccaacta cctgaatgtg agtcctggct
     4861 catctgctat ctccttttct ctctgggact agcaggctcc cactccttcc acaccatgtt
     4921 tgactgagag ccaaagagca ccccatagaa ctcctattta ccagccgaga agtctgatgg
     4981 ggaggggctg ttgtgattgt tttgttgttt ggtttggttt gtttttgaga cagggtctca
     5041 ctgtgtagcc tggcaagcct ggaactcact ttgtagacca gactgacctc gaactcacag
     5101 agttctacct acctctgcct cccaggtgct gggattaaag catgtgacat catgcccagc
     5161 aattttaatt gctttgatta agcttgcccc agcaagcctc ctgtatccca ggccagcaaa
     5221 agacaatgct gagtcctcac atctcagcaa aaagaaagcc aagagccgag aaggggtcca
     5281 gacaaccccc caaaccagag gccccaagcc ctgaagtcac ccccatctct ctgggcccca
     5341 ttttcctttt gcccccgcca cagacccagt actacggcga gattggcatc ggtaccccac
     5401 cccagacctt caaagtcatc tttgacacgg gttcggccaa cctctgggtg ccctccacca
     5461 agtgcagccg cctctacctt gcttgtggta agagtcaaga gtcagcctcg tgcctctccc
     5521 cagctctgcc tgctgcacct atgacctgct tctcctcagc tccacaccac tgactgagga
     5581 agagagagga agtgtctctc ttctttttga aatggcagcc aagttaataa gccagctttg
     5641 ggaatgcctg ggcacctgga aggccttttg ccatgtccca aggattcaga caggaacaaa
     5701 gtgtaaaccc actccaggtc ttgaggcttt ttcgatgctc ccatgctgag tacttactct
     5761 gtgcccagaa tcctctttag cactatatca gtgttagcaa tggctttaga actctgtgaa
     5821 aaaaatgtct cagaaagcac gagcacataa gacgaaaaca aaaaaaacca ctatccttgc
     5881 ctggtttaca tttttccaag ctggagactc aggaaactaa acacacacac acacacacac
     5941 acacacacac acacacacct aagtcttcaa ccatgtttat catttgtcct gcctctgagc
     6001 cttctatgta aatcaatctc tcttgtgctg actggggcct gtaaccctct gctagggatt
     6061 cacagcctct atgagtcctc tgactcctcc agctacatgg agaacgggtc cgacttcacc
     6121 atccactacg gatcaggaag agtcaaaggc ttcctcagcc aggacgtggt gactgtgagt
     6181 aggatcgcct ctatttacca cagggcccat gtggttccag acaggtatac tccagccctc
     6241 ctccctaccc ctcccactcc tctgcaggag cctgggaagt tgaagtgcag cctccttggc
     6301 agcttcttct agctgaagct ccgaagccaa ttgcaggaga gccaaacctg ctgacactga
     6361 tgcacaaggc tttgcatcta acttgcatgt ggatgtgtgc ctaaacccca cagtttcctt
     6421 cctagtaacc agtctgactt tgctaattga gaccaggttt ggagtactga gattaggaat
     6481 caggggtttg tgtccgggca gttttgattt ctttccttca gtgtctaagc ctgttctggc
     6541 atatctcagc cccgctgtgt cagaggaaga gaaagacaca tacagaggtc agaggaaatc
     6601 agtgcgagcc cttccctcgc actgcccccg cggaatctgg ggagggcaga aggagccctg
     6661 gtaatgacga ggcatcctca gcctcggctg gtgggctagc acgaaagcgg gcctctagga
     6721 gagttccggt cacctgtctg gcaggacctc tttctcagcc tttatagcat gagtcacccc
     6781 catctctctg ggccccaagc atagaggtgt tgtcagtgtg tgtgcagact tgacggaata
     6841 gtcttaactg ctgtacagag cgtgtgtatc tctgctgccc tggtatgaaa gccagagcag
     6901 atcccagcca aggttaggca gtaacctatg tcccttctgg ttcttctctc acaggtgggt
     6961 ggaatcactg tgacacagac ctttggagag gtcaccgagc tgcccctgat ccctttcatg
     7021 ctggccaagt ttgacggtgt cctaggcatg ggctttcctg ctcaggccgt tggcggggtt
     7081 acccctgtct ttgaccacat tctctcccag ggggtgctga aggaggaagt gttctctgtc
     7141 tactacaaca ggtgggcctt tgacaccaag gccagggcag ggaggggtgg agagcaggga
     7201 gagcagccac cgtccagttc aggatttata tgcttctaat taggtggttc tcagtcccca
     7261 tctagaacca cagtagccaa ttagaacaaa tggaagtcaa gcattcggtc cccagggact
     7321 ggccacacag acacagccag cttcatgcct ttgtttatct gttctcattg atccggttag
     7381 tgctctgtta ccatgtgcta tggtaggctc caggggtgga cagtgaacag acgacccctg
     7441 ccctccatcc tcacagagct cactgcccag caggggagat gagaggtaca acacaatgag
     7501 tgccattcta aagctgcaga tggttttcag ttctaggcca ggattctggg cagttcagcc
     7561 aggcctgggg aaactgtggt cagtgcccca gcaggggtgg gatctgctgc cagtggctgg
     7621 gtttagaaat atggtctaag ctggaggtaa gatctttacg ggtaagaata gaggccaggg
     7681 gtggtgagag tggaagggag tggtgtcctg gaacaatctt agaggaagca ggagtctctg
     7741 gatcatgact ctggcagact tctgtgcaaa gaggggcatt gagccaatga caccactcgg
     7801 aggctcctgc agtcaatggt cttgggactg tgtttcaaag agaccagaaa aacaaagtgt
     7861 ctaagagtgg aagtagatgt cagaaagggg gaggggggac tccgcaaaat gtgggcagtt
     7921 ctggaaattc aacagggaag gagatgtggg gcagagaagc gggagagggg aggtgggtgc
     7981 tctaaacgct agaggatgct tgagcaatgc tgtgggtgga agaaggagag ctgtggaaaa
     8041 cacatgcatg aggctgagaa gcaggagccc tggaagggat gaggacccag caacaggtag
     8101 caagcagtcc tgggaagcag agacctgctt cctttccctt gggctgggaa gaaaggaaga
     8161 tctagataag tcactgactc ttgcacagaa cctgtgtaca cctaacccat gtacatatgt
     8221 gcacatacaa atgtgtgttg tgaattctaa gaaactccat catatacaca tgagtatctt
     8281 aaaaccaccc aaggcaatcc tgagcttagg tagcatcatg cccgtggaga gagaagtgag
     8341 ggtagtcagc tctggctggg aaggagagcc tggctccagc catttggtga ctcagttgcc
     8401 tggggggggg gggtgtgaaa attaacctct accaaacaag ggttgctgat tctttcctac
     8461 gaagtctccc ctctgtgtcc ctctttgcct cagcagaaat ctcaaggagg gaggagcccc
     8521 atatggcccc aaagctccaa tacttcccag acccttaccc tggaatgata cccaccaccc
     8581 acacactcca ttcatccagg cccaactcca ctaaatgtct gccttctccc tagtgtccac
     8641 atgcccagga aagacgtgat ttgccacttg tctgcatcta cccaatgtcc ttgagctaga
     8701 cagttttctg ggtgcaggtg aacagtccta gaaatacttc tgttacattt agggaggatc
     8761 cagcaaggga agatgcaatg gggtctgtgt gcctcctgat gacagcttac agccttgcca
     8821 tctacccact aaacccttcc tctatcccaa agtcctctgc cctaccccac accagactct
     8881 actctccttc atccctgtat ctcgggtatg tctctaactt ccaatgttct ttcctttagg
     8941 ggttcccacc tgctgggggg cgaggtggtg ctaggaggca gtgacccgca gcattaccaa
     9001 ggcaattttc actatgtgag catcagcaag actgactcct ggcagatcac gatgaagggg
     9061 tgggtcagca agcgtaactc tcccaaccat gcaaactgat gtcactgtgt gattggggtg
     9121 ggctgggaat gggatgaggg ggacacttct tagggcacag ccaccatctg cctctctgct
     9181 tgctctgtcc taggatgggg gcatgaaaat ttgaggatgg gaaggaaggg actcagggaa
     9241 aggaggggga gtccggaggc tacatgctga tgcaaggaac agttagtacc tttccatagt
     9301 gtctcggccc tcttccagtt catcatcaga ccatgtatgg cctcctatca cactacccta
     9361 aattgggaaa ggcatgcagg cggcgggaaa agggtgcccg acagcttcta cctgcctgcc
     9421 ttgtctgtct gtctgactgt ctgcctgcct gcctgcctgc ctgccagtct aactatagct
     9481 ctgcagggtg tctgtggggt cttccaccct gctatgtgaa gaaggctgtg cggtagtggt
     9541 ggacactggt tcatccttta tctcggctcc tacgagctcc ctgaagttga tcatgcaagc
     9601 cctgggagcc aaggagaaga gaatagatga agtaagagat ccgtggggag ggctgggtgg
     9661 tggtgagggg tggcagcagc actgtgcggc tccagcctgc aaatcgtcca gacagcactg
     9721 gatcggaaac aaagctttcc cctcctcccc tctcccaccc cctgaatgta tggcatgtct
     9781 tccatagaag gtgaccagag actggcacag gctctctggc acagggtgct ctagtctgct
     9841 tcgaagcttt ccaccctgac aacccctctc tttattcccc cagtatgttg tgaactgtag
     9901 ccaggtgccc accctccccg acatttcctt tgacctggga ggcagggcct acacactcag
     9961 cagtacggac tacgtgctac aggtgaggct gggacaaggg tgggcagggg cactgaaggg
    10021 ggacaagtgt aatgtacaga tcccccagta gactcgtggt tgatgaatac ggctcatgtc
    10081 atcatctgga gaatttttag cagtcccagt acccaggcca catacaagtc tcgcctagca
    10141 gatcggagct tccacaacag gcatctggga atcagagttt taaaacgact tccccagtag
    10201 gttccattat gaagctctgc tgtgactcag tgagccccct ctctccatta ttgagtagcc
    10261 ctctttctag tcctccatcc ttttcatgct cccaggctta gagaagttca atgacttgtt
    10321 ttgggtcaga agctaagtga gcccgtgagc aaggggtaga cacaccattt gcctgatacc
    10381 tactacagtg cctgcctacc ccaggaaggc tgggatttag gataggtaaa acagttctgg
    10441 gaggatgtag gagagaggag gcatatactg aagagctgtt gcttcactct ggaggtggga
    10501 atgcgttgtg tggcgggcag tagagcaagc ctctgcttct tgccagtatc cctacaggag
    10561 agacaagctg tgcacactgg ctctccatgc catggacatc ccaccaccca ctgggcctgt
    10621 ctgggtcctg ggtgccacct tcatccgcaa gttctataca gagtttgatc ggcataacaa
    10681 tcgcattgga ttcgccttgg cccgctaagg ccctctgcca cccagtaacc ctaggccaag
    10741 ccaagctggc actcctgggg gccattttgt ctggctttgt ccccaacata gggacactgg
    10801 acacagagac cctaacgagt gtttgcccct tcacctgcac tcacccttcc ctgctttaag
    10861 gaaaaaccga ataaagattt catgtttaaa gcctgtttcg gatgggttct ttggagtttg
    10921 gaggaggtac acctgcatgg tcacacgtgg agaagactat gacacgtgag gccagagatg
    10981 gaggtttgtt agtaaagtgc ttgcttttca agtgtgagaa cctgagtttg attccaggaa
    11041 cccacactaa aaccaacaaa caaacaaaca gggctggaga gatggctcag tagttaagtg
    11101 cactgagtgt tcttcctgat gtccttgcgt tcaattccca gcaacacacg tggtggctca
    11161 caaccatgtg tatggggtct gatgccctct tctggcatgt ctaagagagt gataatgtac
    11221 tcctaattaa t
//
LOCUS       MUSREN2D     1116 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Mouse renin (Ren-2-d) gene, 5' end.
ACCESSION   M34191
KEYWORDS    renin.
SOURCE      M.musculus (strain DBA/2J) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1116)
  AUTHORS   Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J.,
            Pioli,D. and Brammar,W.J.
  TITLE     The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d,
            and its upstream region
  JOURNAL   Gene 84, 91-104 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1087  >  1116     renin (Ren-2-d), exon 1
    pre-msg    1059  >  1116     Ren-2-d mRNA and introns
BASE COUNT      309 a    248 c    303 g    256 t
ORIGIN      
        1 tctagagtca ttgggctcag cacccttccc acacccccat gcctgccacc actctgctct
       61 gcgaccagtg tcttgcttat cctatacctc acctaagcta catgccataa atcaatgctt
      121 gttattcaag gcatcatctc ctttgggggg cacttctcaa aaagtcctct cctgtcccta
      181 gatatctgag actcctcagt ggccttccat ttagaagggt gttgtctctg aagtaaaaac
      241 ttcagttcct ctcacaggct aagatttatc agagctgcct gccatgggcc ttgttggcac
      301 tggcgtggga ataagggggt ggcgagactg tgtcccgcag agctcagaga ctgggcaggc
      361 agacagcagg agaagacact gggtggtctg gcagctggaa acgctgggag gccttcttgc
      421 gggagattag atgaggctat tcaggggaag gcctattcca tttctccagc atggtgagtc
      481 tagatgaaaa gaggtagtga aatagaggcc agccattgcc acaaagaagg taagagtaag
      541 agaacaagag cttaagaaag agccaagatg ggctggagag atggctcagt gggtaagagc
      601 acccgactgc tcttccaaag gtccgaagtt caaatcccag caaccacatg gtggctcaca
      661 accatctcat aatgagatct gactccctct tcttggagtg tctgaagaca gctacagtat
      721 atttacatat aataaaaata aatttaaaaa ataaaaaaat aaaaataaat ttaagagaga
      781 gagagaggag gagccaagca gccccattta tagtgggctg ggctaccttg ctgttgtagg
      841 gtaactgtgg ggaggagcaa acctggctat tgccaggtaa ctctgggggg tggagttacc
      901 ttgactactt gactgatggc cacagaatga tggaggtgga tccttggcca gaaaacaggc
      961 tgcgtttcat ggtcccacag gccctggggt aataaatcag agcagagcct gtgatacatg
     1021 gtgtgtataa aagaaggctc agggggtctg ggctacacag ctcttagaaa gccttggctg
     1081 aaccagatgg acaggaggag gatgcctctc tgggca
//
LOCUS       HUMASPX      7787 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human nonerythroid alpha-spectrin (SPTAN1) mRNA, complete cds.
ACCESSION   J05243
KEYWORDS    alpha-fodrin; nonerythroid alpha-spectrin.
SOURCE      Human lung fibroblast cell line WI38, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 7787)
  AUTHORS   Moon,R.T. and McMahon,A.P.
  TITLE     Generation of diversity of nonerythroid spectrins: Multiple
            polypeptides are predicted by sequence analysis of cDNAs
            encompassing the coding region of human nonerythroid alpha-
            spectrin
  JOURNAL   J. Biol. Chem. 265, 4427-4433 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.T.Moon, 12-DEC-1989.
FEATURES       from  to/span     description
    pept        103     7521     nonerythroid alpha-spectrin
                                 /hgml_locus_uid="LK0022P" /nomgen="SPTAN1"
                                 /map="9q33-q34"
BASE COUNT     2147 a   1926 c   2239 g   1475 t
ORIGIN      
        1 gaattcgggg aacggtgtgg agcggaggcc gcggaggctc ctcggtcctt cagcacccct
       61 cggcccgacg cacccacgcc cctcaccccc cgagagccga aaatggaccc aagtggggtc
      121 aaagtgctgg aaacagcaga ggacatccag gagaggcggc agcaggtcct agaccgatac
      181 caccgcttca aggaactctc aacccttagg cgtcagaagc tggaagattc ctatcgattc
      241 cagttctttc aaagagatgc tgaagagctg gagaaatgga tacaggaaaa acttcagatt
      301 gcatctgatg agaattataa agacccaacc aacttgcagg gaaagcttca gaagcatcaa
      361 gcatttgaag ctgaagtgca ggccaactca ggagccattg ttaagctgga tgaaactgga
      421 aacctgatga tctcagaagg gcattttgca tctgaaacca tacggacccg tttgatggag
      481 ctgcaccgcc agtgggaatt acttttggag aagatgcgag aaaaaggaat caaattgctg
      541 caggcccaga acttggtgca gtacttacga gaatgtgagg acgtgatgga ctggatcaat
      601 gacaaggaag caattgttac ttctgaagag ctgggccagg atctggagca tgtagaggtt
      661 ttacagaaga aatttgaaga gtttcaaaca gatatggctg ctcatgaaga aagagttaat
      721 gaagtgaacc agtttgctgc caaactcata caggagcagc accctgagga ggaactgatc
      781 aagactaagc aggatgaagt caatgcagcc tggcagcggc tgaagggcct ggctctgcag
      841 aggcagggga agctctttgg ggcagcagaa gttcagcgct ttaacaggga tgtggatgag
      901 actatcagtt ggattaagga aaaggagcag ttaatggcct ctgatgattt tggccgagac
      961 ctggcaagtg ttcaggctct gcttcggaag cacgagggtc tggagagaga tcttgctgct
     1021 ctagaagaca aggtcaaagc cctgtgtgct gaggctgacc gcctgcaaca gtcccaccct
     1081 ctgagtgcaa cacagattca agtgaagcga gaggaactga ttacaaactg ggagcagatc
     1141 cgcaccttgg cggcagagag acatgcacgg ctcaatgatt catacaggct tcaacgcttc
     1201 cttgctgact tccgtgacct caccagctgg gtgactgaga tgaaagccct catcaatgca
     1261 gatgagcttg ccagtgatgt ggctggggct gaagccctgc tagatagaca ccaagagcac
     1321 aagggtgaaa ttgatgccca tgaagacagc ttcaaatctg cagatgaatc tggacaggca
     1381 ctgcttgctg ctggtcacta tgcctcagat gaagtgaggg agaagctgac cgtcctttcc
     1441 gaggagagag cggcgctgct ggagctgtgg gagctgcgca ggcagcagta cgagcagtgc
     1501 atggacctgc agctcttcta ccgggacact gagcaggtgg acaactggat gagcaagcag
     1561 gaggcgttcc tgttgaatga agacttggga gatttcttgg atagtgtgga agcgcttctt
     1621 aagaagcacg aagactttga gaaatccctt agtgcccagg aggaaaagat tacagcatta
     1681 gatgaatttg caaccaagct aattcagaac aaccactatg caatggaaga tgtggccact
     1741 cgccgagacg ctctgttgag ccgccgcaat gcccttcacg agagagccat gcgtcgccgg
     1801 gcccagctag ccgattcttt ccatctgcag cagtttttcc gtgattctga tgagctcaag
     1861 agttgggtga atgagaagat gaaaactgcc acagatgaag cttataaaga tccatccaac
     1921 ctacaaggaa aagtacagaa gcatcaggct tttgaggctg agctctcagc aaaccagagc
     1981 cgaattgatg ccttggagaa agctggccaa aagctgattg atgtcaacca ctatgccaag
     2041 gatgaagtgg cagctcgtat gaatgaggtg atcagtttgt ggaagaaact gctagaggcc
     2101 actgaactga aaggaataaa gcttcgtgaa gccaaccagc aacagcaatt taatcgcaat
     2161 gttgaggata ttgaattgtg gctatatgaa gtagaaggtc acttggcttc ggatgattac
     2221 ggcaaagatc ttaccaatgt gcagaacctc cagaagaaac atgccctgct agaggcagat
     2281 gtggctgctc accaggaccg aattgatggc gtcaccattc aggcccgcca gttccaagat
     2341 gctggccatt ttgatgcaga aaacatcaag aagaaacagg aagccctcgt ggctcgctat
     2401 gaggcactca aggagcccat ggttgcccgg aagcagaagc tggccgattc tctgcggttg
     2461 cagcagctct tccgggatgt tgaggatgag gagacgtgga ttcgagagaa agagcccatt
     2521 gccgcatcta ccaacagagg taaggattta attggggtcc agaatctgct aaagaaacat
     2581 caagccttac aagcagaaat tgctggacat gaaccacgca tcaaagcagt tacacagaag
     2641 gggaatgcca tggtggagga aggccatttt gctgcagagg atgtgaaggc caagcttcac
     2701 gagctgaacc aaaagtggga ggcactgaaa gcaaaagctt cccagcgtcg gcaggacctg
     2761 gaggactctc tgcaggccca gcagtacttt gctgatgcta acgaggctga atcctggatg
     2821 cgggagaagg aacccattgt gggcagcact gactatggca aggacgaaga ctctgctgag
     2881 gctctactga agaaacacga agctttgatg tcagatctca gtgcctacgg cagcagcatc
     2941 caggctttgc gagaacaagc acagtcctgc cggcaacaag tggcccccac ggatgatgag
     3001 actgggaagg agctggtctt ggctctctac gactatcagg agaagagtcc ccgagaggtc
     3061 accatgaaga agggagatat ccttacctta ctcaacagca ccaacaagga ttggtggaaa
     3121 gtggaagtga acgatcgtca gggttttgtg ccggctgcgt acgtgaagaa attggacccc
     3181 gcccagtcag cctcccggga gaatctcctg gaggagcaag gcagcatagc actgcggcag
     3241 gagcagattg acaatcagac acgcataact aaggaggccg gcagtgtatc tctgcgtatg
     3301 aagcaggtgg aagaactata tcattctctg ctggaactgg gtgagaagcg taaaggcatg
     3361 ttggagaaga gttgcaagaa gtttatgttg ttccgtgaag cgaatgaact acagcaatgg
     3421 atcaatgaga aggaagccgc tctgacaagt gaggaggtcg gagcagactt ggagcaggtt
     3481 gaggtgctcc agaagaagtt tgatgacttc cagaaggacc tgaaggccaa tgagtcacgg
     3541 ttgaaggaca ttaacaaggt agctgaagac ctggagtctg aaggtcttat ggcagaggag
     3601 gtgcaggctg tgcaacaaca ggaagtgtat ggcatgatgc ccagggatga aactgattcc
     3661 aagacagcct ccccgtggaa gtctgctcgt ctgatggttc acaccgtggc cacctttaat
     3721 tccatcaagg agctgaatga gcgctggcgg tccctacagc agctggccga ggaacggagc
     3781 cagctcttgg gcagcgccca tgaagtacag aggttccaca gagatgctga tgaaaccaaa
     3841 gaatggattg aagagaagaa tcaagctcta aacacagaca attatggaca tgatctcgcc
     3901 agtgtccagg ccctgcaacg caagcatgag ggcttcgaga gggaccttgc ggctctcggt
     3961 gacaaggtaa actcccttgg tgaaacagca gagcgcctga cccagtccca tcccgagtca
     4021 gcagaagacc tgcaggaaaa gtgcacagag ttaaaccagg cctggagcag cctggggaaa
     4081 cgtgcagatc agcgcaaggc aaagttgggt gactcccacg acctgcagcg cttccttagc
     4141 gatttccggg acctcatgtc ttggatcaat ggaatacggg ggttggtgtc ctcagatgag
     4201 ctagccaagg atgtcaccgg agctgaggca ttgctggagc gacaccagga acaccggaca
     4261 gaaatcgatg ccagggctgg cactttccag gcatttgagc agtttggaca gcagctgttg
     4321 gctcacggac actatgccag ccctgagatc aagcagaaac ttgatattct tgaccaggag
     4381 cgtgcagacc tggagaaggc ctgggttcag cgcaggatga tgctggatca gtgccttgaa
     4441 ctgcagctgt tccatcggga ctgtgagcaa gctgagaact ggatggctgc ccgggaggcc
     4501 ttcttgaata ccgaagacaa aggagactca ctggacagcg tagaggctct gatcaaaaaa
     4561 catgaagact ttgacaaagc gattaacgtc caggaagaga agattgctgc tctgcaggcc
     4621 tttgccgacc agctcatcgc tgccggccat tatgccaagg gagacatttc tagccggcgc
     4681 aatgaggtct tggacaggtg gcgacgtctg aaagcccaga tgattgagaa aaggtcaaag
     4741 ctaggagaat ctcaaaccct ccaacagttc agccgggatg tggatgagat tgaggcttgg
     4801 atcagtgaaa aattgcaaac agcgagtgat gagtcgtaca aggatcccac caacatccag
     4861 agcaagcacc agaagcacca ggcttttgaa gcagagctgc atgccaacgc tgaccggatc
     4921 cgtggggtta tcgacatggg caactccctc attgaacgtg gagcctgtgc cggcaatgag
     4981 gatgctgtca aggcccgcct ggctgcctta gctgaccagt ggcaattctt ggtgcaaaag
     5041 tcagcggaaa agagccagaa actgaaagaa gccaacaagc agcagaactt caacacaggg
     5101 atcaaggaca ttgcattctg gctgtctgag gtggaggccc tgctggcatc cgaagattat
     5161 ggcaaagacc tggcttctgt gaacaacctg ctgaaaaagc atcaactgct ggaagcagat
     5221 atatctgccc atgaggatcg cctgaaggac ctgaacagcc aggcagacag cctgatgacc
     5281 agcagtgcct tcgacacctc ccaagtaaag gacaagaggg acaccatcaa cgggcgcttc
     5341 cagaagatca agagcatggc ggcctcccgg cgagccaagc tgaatgaatc ccatcgcctg
     5401 caccagttct tccgggacat ggatgacgag gagtcctgga tcaaggagaa gaagctgctg
     5461 gtgggctcag aggactacgg ccgggaccta actggcgtgc agaacctgag gaagaagcac
     5521 aagcggctgg aagcagaact ggctgcgcat gagccggcta ttcagggtgt cctggacact
     5581 ggcaagaagc tgtccgatga caacaccatc gggaaagagg agatccagca gcggctggcg
     5641 cagtttgtgg agcactggaa agagctgaag cagctggcag ctgcccgggg tcagcggctg
     5701 gaagagtcct tggaatatca gcagtttgta gccaatgtgg aagaggaaga agcctggatc
     5761 aatgagaaaa tgaccctggt ggccagcgaa gattatggcg acactcttgc cgccatccag
     5821 ggcttactga agaaacatga agcttttgag acagccttca ccgtccacaa ggatcgcgtg
     5881 aatgatgtct gcaccaatgg acaagacctc attaagaaga acaatcacca tgaggagaac
     5941 atctcttcaa agatgaaggg cctgaacggg aaagtgtcag acctggagaa agctgcagcc
     6001 cagagaaagg cgaacgtgga tgagaactcg gccttccttc agttcaactg gaaggcggac
     6061 gtggtggagt cctggatcgg tgaaaaggag aacagcttga agacagatga ttatggccga
     6121 gacctgtctt ctgtgcagac gctcctcacc aaacaggaaa cttttgacgc tgggctgcag
     6181 gccttccagc aggaaggcat tgccaacatc actgccctca aagatcagct tctcgccgcc
     6241 aaacacgttc agtccaaggc catcgaggcc cggcacgcct ccctcatgaa gaggtggagc
     6301 cagcttctgg ccaactcagc cgcccgcaag aagaagcttc tggaggctca gagtcacttc
     6361 cgcaaggtgg aggacctctt cctgaccttc gccaaaaagg cttctgcctt caacagctgg
     6421 tttgaaaatg cagaggagga cttaacagac cccgtgcgct gcaactcctt ggaagaaatc
     6481 aaagctttgc gcgaggccca cgacgccttc cgctcctccc tcagctctgc ccaggctgac
     6541 ttcaaccagc tggccgagct ggaccgccag atcaagagct tccgcgtagc ctccaacccc
     6601 tacacctggt ttaccatgga ggccctggag gagacctgga ggaacctaca gaaaatcatc
     6661 aaggagaggg agctggagct gcagaaggaa cagcggcggc aggaggagaa cgacaagctg
     6721 cgccaggagt ttgcccagca cgccaacgcc ttccaccagt ggatccaaga gaccaggaca
     6781 tacctcctcg atgggtcctg tatggtggaa gagtcgggga ccctcgaatc ccagcttgaa
     6841 gctaccaaac gcaagcacca ggaaatccga gccatgagaa gtcagctcaa aaagatcgag
     6901 gacctggggg ccgccatgga ggaggccctc atcctggaca acaagtacac ggagcacagc
     6961 accgtgggcc tcgcccagca gtgggaccag ctggaccagc tgggcatgcg catgcagcac
     7021 aacctggagc agcagatcca ggccaggaac acaacaggtg tgactgagga ggccctcaaa
     7081 gaattcagca tgatgtttaa acactttgac aaggacaagt ctggcaggct gaaccatcag
     7141 gatggcaaat cttgcctgcg ctccctgggc tatgacctgc ccatggtgga ggaaggggaa
     7201 cctgaccctg agttcgaggc aatcctggac acggtggatc cgaacagaga tggccatgtc
     7261 tccttgcaag aatacatggc tttcatgatc agccgcgaaa ctgagaacgt caagtccagc
     7321 gaggagattg agagcgcctt ccgggccctc agctcagagg gaaagcctta cgtgaccaag
     7381 gaggagctct accagaacct gacccgggaa caagccgact actgcgtctc ccacatgaag
     7441 cccatcgtgg acggcaaggg ccgcgagctc cccaccgcgt tcgactacgt ggagttcacc
     7501 cgctcgcttt tcgtgaactg agccactccc tgggtcaccc acccctcgct gcttgccctg
     7561 cgtcgccttg ctgcatgtcc gctcctctgt gtgctctcac tttccactgt aaccttaagc
     7621 ctgcttagct tggaataaga cttaggagaa aatggtgctt cactaacccg cttccggtcc
     7681 agtcacaatc atcatgtcac tgtgggaccc agatctgtgt cttgaagcag ctgccctcat
     7741 tccgacttca gaaaatcgaa gcagctggcg cctccccttc ggaattc
//
LOCUS       CIPFNRA      1419 bp ss-mRNA            PLN       25-MAY-1990
DEFINITION  M.crystallinum ferredoxin-NADP+ reductase (fnrA) mRNA, complete
            cds.
ACCESSION   M25528
KEYWORDS    ferredoxin-NADP+ reductase.
SOURCE      M.crystallinum, cDNA to mRNA, clone lambdaMC-fnr1.
  ORGANISM  Mesembryanthemum crystallinum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE   1  (bases 1 to 1419)
  AUTHORS   Michalowski,C.B., Schmitt,J.M. and Bohnert,H.J.
  TITLE     Expression during salt stress and nucleotide sequence of cDNA for
            ferredoxin-NADP+ reductase from Mesembryanthemum crystallinum
  JOURNAL   Plant Physiol. 89, 817-822 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.J.Bohnert, 13-JUN-1989.
FEATURES       from  to/span     description
    pept         75     1172     ferredoxin-NADP+ reductase precursor (fnrA; EC
                                 1.6.7.1)
    sigp         75      230     ferredoxin-NADP+ reductase signal peptide
    matp        231     1169     ferredoxin-NADP+ reductase
    mRNA         45     1395     fnrA mRNA (alt.)
    mRNA    <     1     1419     fnrA mRNA (alt.)
BASE COUNT      416 a    283 c    322 g    398 t
ORIGIN      
        1 tttttttttt ttctggttcc tcttcttctt cttcttcttc ttctgcaaag atatcttctt
       61 caatcagctc agcaatggct gccgcagtaa ctgccgcagt ctctttccct tctactaaat
      121 ccactcccct ttctaccaga acatcttccg tcatcaccca tgagaaaatt aacttcaaca
      181 aggttcctct atactacagg aatgtatcag ttggtgggaa agtgggaacc attagggcag
      241 tagcctcaga tgttgaggca cctgttgcaa aggtggagaa gcattcaaag aaaatggagg
      301 aaggagtgat agtcaacaag tacaagccca agaaccctta cactggcaga tgcctcctca
      361 acactaagat tactggagat gatgctcctg gagagacctg gcacatggtt ttctcccatg
      421 aaggagagat tccctacaga gaggggcagt ctgttggagt gattcccgaa ggtattgaca
      481 agaatggaaa gccccacaag ttgagattgt actcaattgc cagcaggccc ttgggcgact
      541 ttggtgactc taaaactgtt tccttgtgtg taaaacgact tatctacacc aatgacaacg
      601 gagaaatagt caagggtgtt tgctcaaact ttttgtgtga tttgaaacct ggtagtgaag
      661 tagtgctaac tggaccagtt gggaaggaaa tgctgatgcc aaaagaccct aatgccacaa
      721 ttatcatgct tgcaactgga actggtattg ctcctttccg ttcgttcttg tggaaaatgt
      781 tctttgagaa gcacgatgac tacaagttca atggtttggc atggctcttc ttgggtgtac
      841 ccactagcag ttctttgtta tacaaggagg aatttgagaa gatgaaggag aaagctccag
      901 aaaacttcag gttggatttc gctgtgagca gagaacaaac taatgagaaa ggggagaaaa
      961 tgtacatcca aacccgtatg gcacaatatg atagagagct atgggaattg ctcaagaaag
     1021 acaacactta cgtctacatg tgtgggctga aaggcatgga gaaaggaatc gatgacatta
     1081 tggtttcatt ggccgcagaa gatggtattg actggttcga ctacaagaag cagttgaaga
     1141 aggcagaaca gtggaacgtg gaagtctact gataaacttg tacaaaactc tcttctttct
     1201 ttttgtgaag cagaatatat ctatttgtca tctcccactg caccttgtag ataggtcagt
     1261 ttcttttcga tattgttatg aatatcagct aattgcattg tagaagctat agccaaaaaa
     1321 gctctgtttt ggagtactca tgtatcgcgt aattatactg gaaaccagtg tcaacacttc
     1381 tacactgaga aattcatgag gatgaacaca ttatgctca
//
LOCUS       HUMCAMPPK    3036 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human cAMP-dependent protein kinase type I-alpha subunit, complete
            cds.
ACCESSION   M33336
KEYWORDS    cAMP-dependent protein kinase;
            cAMP-dependent protein kinase type I; protein kinase.
SOURCE      Human testis, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3036)
  AUTHORS   Sandberg,M., Skalhegg,B. and Jahnsen,T.
  TITLE     The two mRNA forms for the type I-alpha regulatory subunit of
            cAMP-dependent protein kinase from human testis are due to the use
            of different polyadenylation site signals
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 323-330 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         88     1233     protein kinase type I-alpha subunit
                                 /nomgen="PRKAR1" /map="7p13-qter"
                                 /hgml_locus_uid="LW0133F"
BASE COUNT      821 a    552 c    682 g    981 t
ORIGIN      
        1 gctgggagca aagcgctgag ggagctcggt acgccgccgc ctcgcacccg cagcctcgcg
       61 cccgccgccg cccgtcccca gagaaccatg gagtctggca gtaccgccgc cagtgaggag
      121 gcacgcagcc ttcgagaatg tgagctctac gtccagaagc ataacattca agcactgctc
      181 aaagattcta ttgtgcagtt gtgcactgct cgacctgaga gacccatggc attcctcagg
      241 gaatactttg agaggttgga gaaggaggag gcaaaacaga ttcagaatct gcagaaagca
      301 ggcactcgta cagactcaag ggaggatgag atttctcctc ctccacccaa cccagtggtt
      361 aaaggtagga ggcgacgagg tgctatcagc gctgaggtct acacggagga agatgcggca
      421 tcctatgtta gaaaggttat accaaaagat tacaagacaa tggccgcttt agccaaagcc
      481 attgaaaaga atgtgctgtt ttcacatctt gatgataatg agagaagtga tatttttgat
      541 gccatgtttt cggtctcctt tatcgcagga gagactgtga ttcagcaagg tgatgaaggg
      601 gataacttct atgtgattga tcaaggagag acggatgtct atgttaacaa tgaatgggca
      661 accagtgttg gggaaggagg gagctttgga gaacttgctt tgatttatgg aacaccgaga
      721 gcagccactg tcaaagcaaa gacaaatgtg aaattgtggg gcatcgaccg agacagctat
      781 agaagaatcc tcatgggaag cacactgaga aagcggaaga tgtatgagga attccttagt
      841 aaagtctcta ttttagagtc tctggacaag tgggaacgtc ttacggtagc tgatgcattg
      901 gaaccagtgc agtttgaaga tgggcagaag attgtggtgc agggagaacc aggggatgag
      961 ttcttcatta ttttagaggg gtcagctgct gtgctacaac gtcggtcaga aaatgaagag
     1021 tttgttgaag tgggaagatt ggggccttct gattattttg gtgaaattgc actactgatg
     1081 aatcgtcctc gtgctgccac agttgttgct cgtggcccct tgaagtgcgt taagctggac
     1141 cgacctagat ttgaacgtgt tcttggccca tgctcagaca tcctcaaacg aaacatccag
     1201 cagtacaaca gttttgtgtc actgtctgtc tgaaatctgc ctcctgtgcc tcccttttct
     1261 cctctcccca atccatgctt cactcatgca aactgcttta ttttccctac ttgcagcgcc
     1321 aagtggccac tggcatcgca gcttcctgtc tgtttatata ttgaaagttg cttttattgc
     1381 accattttca atttggagca ttaactaaat gctcatacac agttaaataa atagaaagag
     1441 ttctatggag actttgctgt tactgcttct ctttgtgcag tgttagtatt caccctgggc
     1501 agtgagtgcc atgctttttg gtgagggcag atccagcacc tattgaatta ccatagagta
     1561 atgatgtaac agtgcaagat tttttttttt aagtgacata attgtccagt tataagcgta
     1621 tttagactgt ggccatatat gctgtatttc tttgtagaat aaatggtttc tcattaaact
     1681 ctaaagatta gggaaatgga tatagaaaat cttagtatag tagaaagaca tctgcctgta
     1741 attaaactag tttaagggtg gaaaaatgaa aatttttgct aattatcaat gggatatgat
     1801 tggttcagtt ttttttttcc agagttgttg tttgccaagc taatctgcct ggtttattta
     1861 tatcttgtta ttaatgtttc ttctccaatt ctgaaatact tttgagtatg gctatctata
     1921 cctgcctttt aagtttgaaa ctaactcata gatgcaaata ttggttagta tttaactaca
     1981 tctgcctcgg ctcacaaatt ccgattagac ctttatccag ctagtgccaa ataattgatc
     2041 agatgctgaa ttgagaataa gaatttgagg tctacattct tggttgttaa tttagagcgt
     2101 ttggttaaag tatgtccttc agctgactcc agtataatct cctctgctca ttaaactgat
     2161 tccaggagat tggatttgct gtgactagat acagatggag caaatgtcct aacagagaaa
     2221 tagaggtgat gctgctaaag ggagaaatgc caggcggaca aagttcagtg tcgggaattt
     2281 tccccgtgac attcactggg gcatgagatt ttggaagaag ttttttactt tggtttagtc
     2341 tttttttcct cctttttatt cagctagaat ttctggtggg ttgatggtag ggtataatgt
     2401 gtctgtgttg cttcaaattg gtctgaaagg ctatcctgct gaaagtcctg ctttcctatc
     2461 tagcatttat tcctctggca aacttttctt tcttttcttt tttaaagtaa acttgtgtat
     2521 tgagtcttaa ctgtatttca gtattttcca gccttatgtg ttacattatt ccaatgatac
     2581 ccaacagttt atttttatta tttttttaaa caaaatttca cagttctgta atgtaggcac
     2641 ttttattttc attgtgattt atatataagg taatgtaggg ttatatttgg gagtgactgc
     2701 aagcattttt ccatctgtgt gcaactaact gactctgtta ttgatccctt ctcctgccct
     2761 ttcccaggta atttaaattg gtcatggtag atttttttca tagatttgaa aaacttttag
     2821 gttgttacca agtatgaagt ataaatctgg ggaagaggtt ttatttacat tttagggtgg
     2881 gtaagaaagc caccttgtta caaatttttt aatttccaaa ataatctata ttaaatgagg
     2941 gtttctgatc tgtactttgt gtttagctac ctttttatat ttaaaaaatt aaaaatgaaa
     3001 attatgttct tacaagctta aagcttgatt tgatct
//
LOCUS       HUMIGHCU      426 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human Ig rearranged H-chain mRNA V-J4-region, hybridoma AE6-5,
            5' end.
ACCESSION   M29468
KEYWORDS    immunoglobulin heavy chain; joining exon; processed gene;
            variable region; variable region subgroup VH-I.
SOURCE      Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVH, derived
            from B-CLL cell line UC729-6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 426)
  AUTHORS   Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
            Berman,J.E., Alt,F.W. and Latov,N.
  TITLE     Cloning and sequence analysis of the variable heavy (VH) and light
            (VL) chain regions of an anti-myelin/DNA antibody from a patient
            with peripheral neuropathy and chronic lymphocytic leukemia
  JOURNAL   J. Immunol. 144, 2821-2828 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            L.A.Spatz, 26-OCT-1989, for release after publication.
            
            Columbia University, Department of Neurology BB-322, 630 W. 168th
            street, New York, NY 10032.
FEATURES       from  to/span     description
    pept          1  >   426     Ig H-chain V-J4-region precursor
                                 /nomgen="IGHV" /map="14q32.33"
                                 /hgml_locus_uid="LK0080M"
    sigp          1       57     Ig H-chain signal peptide
    matp         58  >   426     Ig H-chain
    recomb      384      385     V-region DNA end/J4-region DNA start
BASE COUNT       93 a    114 c    129 g     90 t
ORIGIN      
        1 atggactgga cctggaggat cctcttcttg gtggcagcag ccacaggagc ccactcccag
       61 gtgcagctgg tgcaatctgg ggctgaggtg aagaagcctg gggcctcagt gaaggtctcc
      121 tgcgaggctt ctggatacac cttcaccggc cactatatgc actgggtcgg acaggcccct
      181 ggacaagggc ttgagtggat ggggtggatc aaccctaaca gtggtggcac aaactatgca
      241 cagaagtttc agggcagggt caccatcacc agggacacgt ccatcaacac agcctacatg
      301 gagctgagca ggctgagatc tgacgacacg gccgtgtatt actgtgcgag agcctcatat
      361 tgtggttatg actgctatta cttctttgac tactggggcc agggaaccct ggtcaccgtc
      421 tcctcg
//
LOCUS       HUMIGHCV      489 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human Ig rearranged anti-myelin H-chain mRNA V-J4-region, hybridoma
            AE6-5, 5' end.
ACCESSION   M29470
KEYWORDS    autoantibody; immunoglobulin heavy chain; joining exon;
            processed gene; variable region; variable region subgroup VH-III.
SOURCE      Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVH, derived
            from B-CLL cell line UC729-6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 489)
  AUTHORS   Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
            Berman,J.E., Alt,F.W. and Latov,N.
  TITLE     Cloning and sequence analysis of the variable heavy (VH) and light
            (VL) chain regions of an anti-myelin/DNA antibody from a patient
            with peripheral neuropathy and chronic lymphocytic leukemia
  JOURNAL   J. Immunol. 144, 2821-2828 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            L.A.Spatz, 26-OCT-1989, for release after publication.
            
            Columbia University, Department of Neurology BB-322, 630 W. 168th
            street, New York, NY 10032.
FEATURES       from  to/span     description
    pept         64  >   489     Ig H-chain V-J4-region precursor
                                 /nomgen="IGHV" /map="14q32.33"
                                 /hgml_locus_uid="LK0080M"
    sigp         64      120     Ig H-chain signal peptide
    matp        121  >   489     Ig H-chain
    recomb      450      451     V-region DNA end/J4-region DNA start
BASE COUNT      101 a    121 c    144 g    123 t
ORIGIN      
        1 cccagccctg ggattttcag gtgttttcat ttggtgatca ggactgaaca gagagaactc
       61 accatggagt ttgggctgag ctggcttttt cttgtggcta ttttaaaagg tgtccagtgt
      121 gaggtgcagc tgttggagtc tgggggaggc ttggtacagc ctgaggggtc cctgagactc
      181 tcctgtgcag tctccggatt cacttttagc agctttgcca tgacctgggt ccgccaggct
      241 ccagggaagg ggctggagtg ggtctcagct attagtacta gtggtggtag cacatactac
      301 gcagagtccg tgaagggccg cttcaccatc tccagagaca attccatgca cacgctgtat
      361 ctgcaaatga acagcctgag agccgaggac acggccgtct attactgtgc gaaaggtcct
      421 acatattgta gtagaatcag ctgccctccg gactactggg gccagggaac cctggtcacc
      481 gtctcctca
//
LOCUS       HUMIGHVW      778 bp ds-DNA             PRI       25-MAY-1990
DEFINITION  Human Ig germline H-chain pseudogene V-J6-region, hybridoma AE6-5,
            5' end.
ACCESSION   M29466
KEYWORDS    germline; immunoglobulin heavy chain; joining exon;
            variable region; variable region subgroup VH-I.
SOURCE      Human (patient POP) hybridoma AE6-5 DNA, clone II4a, derived from
            B-CLL cell line UC729-6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 257 to 778)
  AUTHORS   Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
            Berman,J.E., Alt,F.W. and Latov,N.
  TITLE     Cloning and sequence analysis of the variable heavy (VH) and light
            (VL) chain regions of an anti-myelin/DNA antibody from a patient
            with peripheral neuropathy and chronic lymphocytic leukemia
  JOURNAL   J. Immunol. 144, 2821-2828 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 256)
  AUTHORS   Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
            Berman,J.E., Alt,F.W. and Latov,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            L.A.Spatz, 26-OCT-1989, for release after publication.
            
            Columbia University, Department of Neurology BB-322, 630 W. 168th
            street, New York, NY 10032.
FEATURES       from  to/span     description
    pept.ps     257      302     pseudo-Ig H-chain V-J6-region precursor, exon
                                 1 /nomgen="IGHV" /map="14q32.33"
                                 /hgml_locus_uid="LK0080M"
                384  >   778     pseudo-Ig H-chain V-J6-region precursor, exon
                                 2
    sigp.ps     257      302     pseudo-Ig H-chain signal peptide
                384      394     pseudo-Ig H-chain signal peptide
    matp.ps     395  >   778     pseudo-Ig H-chain
    recomb      733      734     V-region DNA end/J6-region DNA start
    signal      157      164     octamer recognition sequence
BASE COUNT      193 a    197 c    216 g    172 t
ORIGIN      
        1 aacacagagt tcagggatgg gtaggggatg cgtggcctct aacaggatta cggcttgaac
       61 cctcagcttc tacaattgtg tcgtccatgt gtcatgtatt tgctctttct catcctgggt
      121 caggaattgg gctattaaat agcatccttc atgaatatgc aaataactga ggtgaatata
      181 gatatctgtg tgccctgaga gcatcaccca aaaaccacac ccctccttgg gagaatccct
      241 agatcacagc tctaccatgg actggacctg gagcatcctt ttcttggtgg cggcagcaac
      301 actaacggac tccccagtcc cggggctgag agagaaacca ggccagtcat gtgagacttc
      361 acccatcctg tgtctctcca caggtgccca ctcccaggtt cgactggtgc agtctggagc
      421 tgacgtgaag aagcctgggg cctcagtgaa ggtctcctgc aaggcttctg gttacacctt
      481 taccagctat gatatcagct gggtgcgaca ggcccctgga caagggcttg agtggatggg
      541 atggatcagc gcttttaatg gtaacacaga ctatgcacag aagttccagg gcagggtcac
      601 catgaccaca gacacatcca cgagcacagc ctacacggag ctgaggagcc tgagatctga
      661 cgacacggcc gtgtattact gtgcgagaga cgggggcttg ggagtggtgg tgatagttga
      721 aacctaaagc aagtactaca acgatatgga cgtctggggc caagggacca cggtcacg
//
LOCUS       HUMIGKAW      405 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human Ig rearranged kappa-chain mRNA V-J1-region, hybridoma AE6-5,
            5' end.
ACCESSION   M29467
KEYWORDS    joining exon; kappa-immunoglobulin;
            processed gene. variable region subgroup VK-IV; variable region.
SOURCE      Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVK, derived
            from B-CLL cell line UC729-6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 405)
  AUTHORS   Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
            Berman,J.E., Alt,F.W. and Latov,N.
  TITLE     Cloning and sequence analysis of the variable heavy (VH) and light
            (VL) chain regions of an anti-myelin/DNA antibody from a patient
            with peripheral neuropathy and chronic lymphocytic leukemia
  JOURNAL   J. Immunol. 144, 2821-2828 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            L.A.Spatz, 26-OCT-1989, for release after publication.
            
            Columbia University, Department of Neurology BB-322, 630 W. 168th
            street, New York, NY 10032.
FEATURES       from  to/span     description
    pept          1  >   405     Ig kappa-chain V-J1-region precursor
                                 /nomgen="IGKV" /map="2p12"
                                 /hgml_locus_uid="LF0081F"
    sigp          1       60     Ig kappa-chain signal peptide
    matp         61  >   405     Ig kappa-chain
    recomb      369      370     V-region DNA end/J1-region DNA start
BASE COUNT       95 a    110 c    101 g     99 t
ORIGIN      
        1 atggtgttgc agacccaggt cttcatttct ctgttgctct ggatctctgg tgcctacggg
       61 gacatcgtga tgacccagtc tccagactcc ctggctgtgt ctctgggcga gagggccacc
      121 atcaactgca agtccagcca gagtctttta tacagctcca acaataagaa ctacttagct
      181 tggtaccagc agaaaccagg acagcctcct aagttgctca tttactgggc atctacccgg
      241 gaatccgggg tccctgaccg attcagtggc agcgggtctg ggacagattt cactctcacc
      301 atcagcagcc tgcaggctga agatgtggca gtttattact gtcagcaata ttatagtact
      361 cctccgatgt tcggccaagg gaccaaggtg gaaatcaaac gaact
//
LOCUS       HUMIGKAX      407 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human Ig rearranged anti-myelin kappa-chain mRNA V-J4-region,
            hybridoma AE6-5, 5' end.
ACCESSION   M29469
KEYWORDS    autoantibody; joining exon; kappa-immunoglobulin; processed gene;
            variable region; variable region subgroup VK-IIIa.
SOURCE      Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVK, derived
            from B-CLL cell line UC729-6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 407)
  AUTHORS   Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J.,
            Berman,J.E., Alt,F.W. and Latov,N.
  TITLE     Cloning and sequence analysis of the variable heavy (VH) and light
            (VL) chain regions of an anti-myelin/DNA antibody from a patient
            with peripheral neuropathy and chronic lymphocytic leukemia
  JOURNAL   J. Immunol. 144, 2821-2828 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            L.A.Spatz, 26-OCT-1989, for release after publication.
            
            Columbia University, Department of Neurology BB-322, 630 W. 168th
            street, New York, NY 10032.
FEATURES       from  to/span     description
    pept         24  >   407     Ig kappa-chain V-J4-region precursor
                                 /nomgen="IGKV" /map="2p12"
                                 /hgml_locus_uid="LF0081F"
    sigp         24       83     Ig kappa-chain signal peptide
    matp         84  >   407     Ig kappa-chain
    recomb      368      369     V-region DNA end/J-region DNA start
BASE COUNT       95 a    123 c    102 g     87 t
ORIGIN      
        1 ctcagttagg acccagagga accatggaag ccccagctca gcttctcttc ctcctgctac
       61 tctggctccc agataccacc ggagaaattg tgttgacaca gtctccagcc accctgtctt
      121 tgtctccagg ggaaagagcc accctctcct gcagggccag tcagagtgtt agcagctact
      181 tagcctggta ccaacagaaa cctggccagg ctcccaggct cctcatctat gatgcatcca
      241 acaaggccac tggcgtccca gccaggttca gtggcagtgg gtctgggaca gacttcactc
      301 tcaccatcag cagcctagag cctgaagatt ttgcagttta ttactgtcag cagagtagca
      361 agtggcctct cactttcggc ggagggacca aggtggagat caaagga
//
LOCUS       HUMLSP1A     1631 bp ss-mRNA            PRI       25-MAY-1990
DEFINITION  Human lymphocyte-specific protein 1 (LSP1) mRNA, complete cds.
ACCESSION   M33552
KEYWORDS    lymphocyte-specific protein; phosphoprotein.
SOURCE      Human CTL AH2 cell line, cDNA to mRNA, clones hlsp1.[3,5-1].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1631)
  AUTHORS   Jongstra-Bilen,J., Young,A.J., Chong,R. and Jongstra,J.
  TITLE     Human and mouse LSP1 genes code for highly conserved
            phosphoproteins
  JOURNAL   J. Immunol. 144, 1104-1110 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        109     1128     lymphocyte-specific protein 1 (LSP1)
    mRNA    <     1     1631     LSP1 mRNA
BASE COUNT      373 a    496 c    500 g    262 t
ORIGIN      
        1 tttaacaccc atagtaggcc taaaagcagc caccaattaa gaaagccaca gcacgtacac
       61 ccactccagg gatctgccag caccctgtgg ggcccagact acaggctgat ggcggaggct
      121 tcgagtgacc cgggtgctga ggagcgggaa gagttgctgg ggcccactgc tcagtggagc
      181 gtggaggacg aggaggaggc cgtccacgag caatgccagc atgagagaga caggcagctt
      241 caggcccagg acgaggaggg aggcggccat gtccccgagc ggccgaagca ggagatgctc
      301 ctcagcctga agccctcgga ggcccctgaa ctggatgagg acgagggctt tggcgactgg
      361 tcccagaggc cagagcagcg gcagcagcac gagggggcgc agggcgcctt ggacagcgga
      421 gagccccccc agtgcaggag tcctgagggg gagcaagagg acaggcccgg cctgcatgcc
      481 tacgaaaagg aggacagtga tgaagtccac ctggaggagt tgagtctgag caaggagggg
      541 ccaggcccag aggacactgt ccaggacaac ctgggggccg caggggctga ggaggaacag
      601 gaggagcacc agaaatgtca gcagcccagg acacccagcc ccttggtctt ggaggggacc
      661 atcgaacaga gctcgcctcc cctgagccct accaccaaac tcatcgacag gaccgagtcc
      721 ctaaaccgct ccatagagaa gagtaacagt gtgaagaaat cccagccaga cttgcccatc
      781 tccaagattg atcagtggct ggaacaatac acccaggcca tcgagaccgc tggccggacc
      841 cccaagctag cccgccaggc ctccatagag ctgcccagca tggctgtggc cagtaccaag
      901 agtcggtggg agacgggtga ggtacaggct cagtctgcgg ccaagactcc gtcctgcaag
      961 gatattgtgg ctggagacat gagcaagaaa agcctctggg agcagaaggg aggctccaag
     1021 acctcatcaa caattaagag caccccatct gggaagaggt ataagtttgt ggccaccggg
     1081 catgggaagt atgagaaggt gcttgtggaa gggggcccgg ctccctaggc gtcccatctc
     1141 gcttcctggg tctgcaggtc cagccggctg gcaccctcca tgtacccagg ggagattcca
     1201 gccagacacc cgccccccgg ccctggctaa gaagttgctt cctgttgcca gcatgaccta
     1261 ccctcgcctc tttgatgcca tccgctgcca cctccttttg ctcctggacc ctttagcctc
     1321 tctgcccttc cactctctga ccaccgcccc cgccctcccc acccagctcc gcttcttgtt
     1381 acttggggga ggaaagaaac tcctgatcat tggccaaagg gacttacccc tggagaggcc
     1441 aagtgccttc taggaagtta ggaggttgag gcacagcctg tgcagagagg gtgggtcacc
     1501 cccccagatc caaggggaaa ctgcaggtca agggctgata acggccatgc aggatgcttg
     1561 atgctgcgtc ccccgctgct tgccgccccc caccccgcca ttttgtataa taaagctccc
     1621 tgtgtattct c
//
LOCUS       MUSVG1A      1985 bp ss-mRNA            ROD       25-MAY-1990
DEFINITION  Mouse Vgr-1 protein mRNA, complete cds.
ACCESSION   J04566
KEYWORDS    Vgr-1 protein.
SOURCE      Mouse (strain ICR) 8.5 day embryo (George), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1985)
  AUTHORS   Lyons,K., Graycar,J.L., Lee,A., Hashmi,S., Lindquist,P.B.,
            Chen,E.Y., Hogan,B.L.M. and Derynck,R.
  TITLE     Vgr-1, a mammalian gene related to Xenopus Vg-1, is a member of the
            transforming growth factor beta gene superfamily
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 4554-4558 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by A.Lee.
            22-MAY-1989, for release after publication.
FEATURES       from  to/span     description
    pept         16     1332     Vgr-1 protein
BASE COUNT      448 a    566 c    551 g    420 t
ORIGIN      
        1 ccttgaccgt aagacatgag gaagatgcaa aaggagatcc tgtcggtgct ggggcccccg
       61 cacaggccgc ggcccctgca cggtctccag cagcctcagc ccccggtgct cccgccacag
      121 cagcagcagc agcagcagca gcagcagacg gcccgcgagg agccccctcc agggcggctg
      181 aagtccgctc cactcttcat gctggatctc tacaacgccc tgtccaatga cgacgaagag
      241 gatggggcat cggagggtgt ggggcaagag cctgggtccc acggaggggc cagctcgtcc
      301 cagctcaggc agccgtctcc cggcgctgca cactccttga accgcaagag tctcctggcc
      361 ccgggacccg gtggcggtgc gtccccactg actagcgcgc aggacagcgc tttcctcaac
      421 gacgcggaca tggtcatgag ctttgtgaac ctggtggagt acgacaagga gttctcccca
      481 catcaacgac accacaaaga gttcaagttc aacctatccc agattcctga gggtgaggcg
      541 gtgacggctg ctgagttccg cgtctacaag gactgtgtgg tggggagttt taaaaaccaa
      601 acctttctta tcagcattta ccaagtcttg caggagcatc agcacagaga ctctgaccta
      661 tttttgttgg acacccgggt ggtgtgggcc tcagaagaag gttggctgga atttgacatc
      721 acagcaacta gcaatctgtg ggtggtgaca ccgcagcaca acatggggct ccagctgagt
      781 gtggtgactc gggatggact ccacgtcaac ccccgtgcgg cgggcctggt gggcagagac
      841 ggcccttacg acaagcagcc cttcatggtg gccttcttca aggtgagcga ggtccacgtg
      901 cgcaccacca ggtcagcctc cagtcggcgg cggcagcaga gtcgcaaccg gtccacccag
      961 tcgcaggacg tgtcccgggg ctccggttct tcagactaca acggcagtga gttaaaaaca
     1021 gcttgcaaga agcatgagct ctatgtgagc ttccaggacc tgggatggca ggactggatc
     1081 attgcaccca aaggctacgc tgccaactac tgtgatggag agtgttcctt cccactcaac
     1141 gcacacatga atgccaccaa ccacgccatt gtacagacct tggtccacct tatgaatccc
     1201 gagtacgtcc ccaaaccatg ctgcgcacca accaaactga atgccatctc ggttctttac
     1261 ttcgatgata actccaatgt catcttgaaa aagtacagga atatggtcgt gagagcttgt
     1321 ggttgccatt aagttgaagc tggtgtgtgt gtgtgggtgg gggcatggtt ctgccttgga
     1381 ttcctaacaa caacatctgc cttaaaccac gaacaacagc acagcgaagc gggatggtga
     1441 cacacagagg gatcgtgaca cgcagacaca tctcccgctg gtgccttacc cacggaggct
     1501 tttatgagga ccttgtcaag ggctttccca gttcctaact gagcagttgc tggtctgcag
     1561 gaagctggaa ggcttgtagt acaggcctgg aaactgcagt tacctaatgt tcgcctcccc
     1621 caaccccgcc cggagtagtt ttagctttta gatctagctg cttgtggtgt aagtaaactt
     1681 gaaggaatat taaatatccc tgggttgaaa gacccggtgg tggctctaca gcacccatcc
     1741 cagggagatt tttgcagaca tccgaatgga ggggagaagg gcactctttc aggttccatt
     1801 cccagcaagg gcagctcaca caggacctgc agcctggcca tcagcaggct ctgtggaggt
     1861 gccttctgtc tactgttgta gttacgtgtt ttgtgttgac tctcggtggt gtgagaatgt
     1921 actaatctct gtcaagacaa actgtagcat ttccacccca tcctcctccc tccctcacag
     1981 aattc
//
LOCUS       NPAIFP1A     1730 bp ds-DNA             VRL       25-MAY-1990
DEFINITION  AcNPV mutant with an S.frugiperda insertion element IFP1.6.
ACCESSION   M32775
KEYWORDS    insertion element; insertion element IFP1.6.
SOURCE      AcNPV (mutant strain AcFP875.2) DNA.
  ORGANISM  Autographa californica nuclear polyhedrosis virus
            Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; 
            Nuclear polyhedrosis viruses.
REFERENCE   1  (bases 1 to 1730)
  AUTHORS   Beames,B. and Summers,M.D.
  TITLE     Sequence comparison of cellular and viral copies of host cell DNA
            insertions found in Autographa californica nuclear polyhedrosis
            virus
  JOURNAL   Virology 174, 354-363 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      118      119     AcNPV DNA end/S.frugiperda DNA start
    recomb     1683     1684     S.frugiperda DNA end/AcNPV DNA start
    site        115      118     viral target sequence
    site       1684     1687     viral target sequence
BASE COUNT      610 a    279 c    252 g    589 t
ORIGIN      
        1 tcgtcgattg cgttttgatc aacgacttga gcagagacac gttaatcaac tgttcaaatt
       61 gatccatatt aactatatca acccgatgcg tatatggtgc gtaaaatata ttttttaacc
      121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac
      181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat
      241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta
      301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc
      361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat
      421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta
      481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg
      541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat
      601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa
      661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa
      721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc
      781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc
      841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa
      901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa
      961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta
     1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc
     1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg
     1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat
     1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa
     1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt
     1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca
     1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat
     1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt
     1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa
     1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga
     1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa
     1681 aggttaaccc tcttatactt tgcactctgc gttaatacgc gttcgtgtac
//
LOCUS       NPAIFP2A     2225 bp ds-DNA             VRL       25-MAY-1990
DEFINITION  AcNPV with an S.frugiperda insertion element IFP2.2.
ACCESSION   M32779
KEYWORDS    insertion element; insertion element IFP2.2.
SOURCE      AcNPV DNA.
  ORGANISM  Autographa californica nuclear polyhedrosis virus
            Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; 
            Nuclear polyhedrosis viruses.
REFERENCE   1  (bases 1 to 2225)
  AUTHORS   Beames,B. and Summers,M.D.
  TITLE     Sequence comparison of cellular and viral copies of host cell DNA
            insertions found in Autographa californica nuclear polyhedrosis
            virus
  JOURNAL   Virology 174, 354-363 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb       26       27     AcNPV DNA end/S.frugiperda DNA start
    recomb     2191     2192     S.frugiperda DNA end/AcNPV DNA start
BASE COUNT      760 a    407 c    349 g    709 t
ORIGIN      
        1 ccgtctcacc ataacaccgt ttttaccaca gaacatataa agagagtaga aatgaaatag
       61 ttcctgattt gtatcagaac gcgtgtcgcc gtttctccca ccctttaatt tcactacaag
      121 caagagacca atgtctgagc ggattacccc cacaaataaa taacacacat ttaaatctcg
      181 aatttgctat tggttgtcac acaaacgcaa tgactgcatt tgacgtacac tctaacacgc
      241 ataaaaactc atgtatgcta gggcgagaca aaggtcgatt gttgatatcg ataaaattta
      301 agtaatcttg tttttgtttt acaagtttgt tgaatattgt gaaattgtat tcatgcgaaa
      361 tagaaaccaa gtaataaaag taaaataaat aaaaaataca taactgaatt ttaaaaccaa
      421 ttttattact caataagtta aaaatcagta aaaaataaat aaatcatagg cacttttgaa
      481 cgtcaaagca aatataataa tattaataac gtctgtctgt cggtcagccc tctagtgaag
      541 ctatttgctc gttccaaagt gtagattcct atggagaaga tcgaagatct gttcgttcga
      601 tctttttcaa ttagattcac aatacaattc ttgataacat tgcttttctt tgcatcgatt
      661 gcttgaacta tgtgagaaca atgtaaaact aatattcagt caaagcgata gaaaaaaaaa
      721 ccttaagaac aacaaaatta atacagtctg gagctgacca gtcccagttg acagcgctcg
      781 ttcaccaaca tgacacgtac accagcaccg tccaactcaa ccatgttgca gggaatcgaa
      841 cgatccaatg cccgtgccac cgccagtgag acctttgagt gagcatgaca actctaagct
      901 gacacaaatg cattctacta gtctaattta gctaattaca gggctctcac atttacaaaa
      961 atctgtataa agtacagaac acattgaaat aacatggtta tattatttca ttttttaaat
     1021 ttacgaaaga tgatatttta aatatcgcca tttttagaca aaaaaactga tttaatgaga
     1081 ctaatttaaa attttctaaa ataaaatcag tttacttact gaattacaca ttgcgatatt
     1141 gtgcggataa acgcctaatt ttaagtttct tttccctgac tttgttatta tgtaattatc
     1201 aacatcgaaa tgttgcgaac gaaatatgct acatttagta ggaatccagg agcccatagt
     1261 acaggttttc ctagtttggg ttcctgaatc ccaccacttg tcatttcaat tgtcttaaac
     1321 ctggtcgtgt aacttgctac ataatttatt atactattaa ttattatata agcttagttt
     1381 ttaagaccaa tgtctgtata aatatggtgg agataaaaat atacgcagtt tctaatctgt
     1441 gtggtggaac ttgaaaaaaa aaaaataatt ctaccagttg tgtcaatcca caattttgtt
     1501 acacctggat ctgttggaaa cctaaaacat aatattaaaa taaatcatat gtcacattct
     1561 ataaaaaaat aaaaacataa aactcactca aaatctatat tactaattag tcgtaataaa
     1621 taatacttat tatttaaatt ataaagtgtg ctattgatta taatagactg tatcatagta
     1681 acaatcgaca ttgattacaa atttgcccaa cactatagat gtcatagatt ttgtggaaat
     1741 gagcttttgt ttcggtactc gggactctcg caatggacgt catacattgc gcgattgctc
     1801 acgtagtgca agattgtact catctatcgc aagcaatgta ttactattag tctgcttgcg
     1861 attatatcgt ttttgtctat aaatttcata atactaatgt tataattaca aatatcacac
     1921 gtatttttat gtttttggac ttaccggtga aaagaaattc cttctcgtgc tatgctacac
     1981 acctggtttc ttttccgaca cgttataata gcgcaagacg gcataatata atatttttgt
     2041 ggagatacgt acgctgtcaa aacatgacag cgattgctag ccggtaagta tgaacatttg
     2101 agcgttatga cagcggtatg cacgttcgtt cgcaccctcg gcacgcgtgg cccgcccaca
     2161 aggccaatcc agtctctttc ttttatccat ggtttttaca aatggaaatg tatttgtaaa
     2221 acggc
//
LOCUS       RATPFKFBP1   2917 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, 5'
            flank and exon 1.
ACCESSION   M26215
KEYWORDS    6-phosphofructo-2-kinase; alternative splicing;
            fructose-2,6-bisphosphatase.
SEGMENT     1 of 2
SOURCE      Rat liver DNA, clone lambda-20.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2917)
  AUTHORS   Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G.
  TITLE     5' flanking sequence and structure of a gene encoding rat
            6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.I.Darville, 08-AUG-1989.
FEATURES       from  to/span     description
    pept       1374  /  1401     M-type 6-phosphofructo-2-kinase (EC
                                 2.7.1.105)/fructose-2,6-bisphosphatase (EC
                                 3.1.3.46; PFK-2/FBPase-2), exon 1
    pre-msg    1255  >  2917     PFK-2/FBPase-2 mRNA and introns (alt.)
    IVS        1402  >  2917     PFK-2/FBPase-2 intron A
BASE COUNT      779 a    627 c    605 g    906 t
ORIGIN      
        1 ggatccagac agaactccgg cttaagtgag cggcccctgg cggtggctgt gaggggcttc
       61 tgcctggagt cctttgggga actttctcag ttgaaagatg gagcagttaa gaatttactt
      121 agctagcctc cgttttaatg gagtctggag gaaggagact acctcaaatc ctccctgtag
      181 agccatattt gtcttgactt tcaaagcgtg agtcagactt gcatttggct ccatttctca
      241 gatgagcaaa atcgaggcct agaacattta agctccggag cagtatcttt tccctatcct
      301 gtctacgctt tgtcctttca ttcttaggat aaaccaggaa gaggcactga agctaagtag
      361 aagttgaata aattcgtctc agcctcacag aagttataga gccaagattt aagcccttct
      421 taatcaggta aacgattgat tttctgtcca cacaactctg tgtcttttcc cgtgggcacc
      481 actgcacttt acgatccatt ctagtgcaac atttaccaca gtgaacagct cccattgatt
      541 tccattgtca ctagaccgct aacagcctga atgcagtggc catcttggac ccagcttata
      601 tgataacgga ataaatattt agcagatgga tgaagcagca ggccagtagc gatgagttct
      661 tgacttattt tttttaacag gaaatgccca atttccagga aaactataac tttgcatcgt
      721 aaggtttttt cactgaaaat tgctacagag ttattcccac tcgaatacac acaccctcag
      781 aatctcacag cctttaagcc atgatcttca gtgggttgct cctctaccct cagaggactt
      841 aaaggtcact agctattgac attagaaaaa catctatgta attaaaagtc tgtgccttga
      901 gcttttccat gcattgggag ttactgacta aagatcgagt cccaggagtg ggttaattag
      961 gacaagggaa atgaacgttt ttcttcatgt ttccagattg ctttcaagca agggattttg
     1021 ccaatgctgt actaaaggca gacagaaacc tttgtcccaa ggaaatccag cttttgctca
     1081 cctcaatagc tggttttaat ttctccattc tccagtggct tgttaatatt agtactgacc
     1141 tttggggcaa ggtgaacaca tggttggact gaagagaaaa cgctctgggt ggctcaggaa
     1201 cttctttggc aagtacagca gctaatattt caacactgca cacacccctt aacaagggcg
     1261 cttcctcagc cttcccagga accaacggat gtctccgagt tttctatttg atgccaccca
     1321 ccggacctgc tttgggggtc tgtaaatgca agagagccat tggaaaatta gcgatggaag
     1381 aaaaagcctc taagagaaca ggtaggtgag cctaaaggaa aaacagacag gtctttttac
     1441 aggtcaaaaa tccttctttc tggcttcttg gaagattttt tctgcatagg attctgattt
     1501 agctttatta aaggggatgg gagggttggg gaatgataat atgaatacct aattcaactg
     1561 ccaatatttt atttttttaa ttattttatt aatattccaa atgttgcccc ctccctattg
     1621 tcagtatttt aaaattcatt tgtacttgac tttttaggac cctctctaaa ggtcagaccc
     1681 ctactatctt tgaaattcca atttttgttt tatttagttt tggaaaatga atttaactgt
     1741 gcttcccatg ggagcctgaa actttctatg tagcctggag tggccatgat tgtgtaaccc
     1801 tcccagccta gcttccacct atgtagtgta tgtaattttt taaattaaat gcacaataga
     1861 gagtgcatct ctgacctctc ctctagtgta taaagttgag ttgatttgct gtgaaacttt
     1921 gacacattct ttagtttctc tgggtgtcaa tggatacctt agtaaaacag aaattaaagc
     1981 ccctagcctc cttcccacag aggactggcc tgatgatcca gaatctctaa taggcagaat
     2041 ccagagttct tttgcttcag ggctttcatg ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt
     2101 gtgtgtgtgt gtgtgtgtgt gtgtgtgaca ttcatgttct tggcttttat gtggcgtcca
     2161 caggtagtac cacccacagc cctctctcta tgtgctctcc ttcctaagac tcgtcctaga
     2221 taagaatgca gtgagctttg tggagcagac aggagtctga aaatagtggc ataatggggc
     2281 cagagcagtc actgagaaca aagggattct aggctgctgt cccagaagtc ctgatatata
     2341 ttcatacatg aatctagcct gtaatcatgt gattatccgt cagccttcta gcctataaca
     2401 aatatattca caaccacaca tttcaagact taccaataag gagaccccat ttatgacttt
     2461 attgcccatc ctatcttgat aaaatgagct tcctcaccac attgtacatg gaggtgggag
     2521 caggctctat ggccgaacat catactgcat agtttcttcc ctagagatag ttctacttta
     2581 tggttcagcc tttagccctg agtatgaagc tgagtaaacc ctgtgtctat aaccaataca
     2641 ttaaaacgtg ttagaatttt atcattttat tctgtgccat cctaaagttt taaggttttt
     2701 tttttttttt ggatgttgct ctttacctca gaaaaaccag ctctcccctg tccatttccc
     2761 tgaaagctgt cataatatct accatggtga actgggattt cttgtcacca tgtttatctc
     2821 ttctgcagca gagagatgta gattgtgaac atgatcactg aacaagtgtc ctccatatca
     2881 gagcccaaga tgaacagtca gcaggtcaca gggatcc
//
LOCUS       RATPFKFBP2   3720 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, exon
            1'.
ACCESSION   M26216
KEYWORDS    6-phosphofructo-2-kinase; alternative splicing;
            fructose-2,6-bisphosphatase.
SEGMENT     2 of 2
SOURCE      Rat liver DNA, clone lambda-20.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3720)
  AUTHORS   Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G.
  TITLE     5' flanking sequence and structure of a gene encoding rat 6-
            phosphofructo-2-kinase/fructose-2,6-bisphosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.I.Darville, 08-AUG-1989.
FEATURES       from  to/span     description
    pept       2913  /  3009     L-type 6-phosphofructo-2-kinase (EC
                                 2.7.1.105)/fructose-2,6-bisphosphatase (EC
                                 3.1.3.46; PFK-2/FBPase-2), exon 1'
    pre-msg    2664  >  3720     PFK-2/FBPase-2 mRNA and introns (alt.)
    IVS        3010  >  3720     PFK-2/FBPase-2 intron A'
BASE COUNT     1020 a    752 c    859 g   1089 t
ORIGIN      About o.1 kb after segment 1.
        1 aagcttctct caaggctata atactgactt tcagagaaga tgaacactaa aactggtctt
       61 aactaagaca gagttcagtg agaataaaag atacttcagg ctgaggaaga gcctaaaagc
      121 aggaaaacgc tgtgttcgag ggtcctgtaa tcagccagtg tggttaaaca tgggggagtc
      181 ctggggggta gaagactgat aggtggctta ggacttgatt tagaagaagt ttgaatgcta
      241 gtctgagtca tttctctaag gcttgttttt tggtgtttct gagttcagaa ttgctaacgg
      301 tagtgaacaa tttcatagtt agttctaggc agggttaaat tagggtgtca aacctggacc
      361 ctgagtttac ccattcagac cttgtcagag gccttctagc tacctagctc agatttcttg
      421 ctactaaaat tatcttcttg attcttggca gtgttgtaag gacaaaattg taaacataaa
      481 agacatgttc atgttgttgg ctagattacc ttagacaagc catcccctcc tcaggcccat
      541 ttccctgtgc tattactggg gaagtttcaa agaaacaaat aagataaaga aaatgttact
      601 ttgtgcgcac tggaaaacat acaaagttac ggtaagatca ttaaaacaga cttccactaa
      661 ttatgtgact tactaggatt aagagactca tgtcttgcaa agtctctttc tgagactcta
      721 ggagttatct gctaatctgg aaaaaactca ttatattgta ttttttactt taaattgata
      781 gataattgtt tatacttgtg tggtacagtg catgtatata atgtacaatg gttaatagct
      841 ttatgtatca cctcaaatat ctatcatttc tcctttccac cagctatatt gaagtatcca
      901 attatctgtt gtgaaccata gttatccaga tgtgctcttg aacatcagaa gttattcctc
      961 ctgtttccca gaatccatcc tgtttgttct ctcttttctc cctctccttt gcccactagt
     1021 ctatactcag tttctcacca atctctttta cctttgtcca gtaggtgaga gcatgtagta
     1081 tttgatgttt tttgccagac ttaagttaac agaaacctag gaaggagttt acctttctaa
     1141 ttgacactca agaagtcttt acagtataaa tgttaatccc gtgtcacaat aacgtgctag
     1201 accttgcaga ttacttggag ggaaaaagat gaatcaatta ctgcccaaag gaaacaacag
     1261 tttgaatcat atgtattcag cttccccagg tttcttcatg tgtatcaata tattgactct
     1321 tcgctaacat tgagggaatg gaccctacta tccttatatc tagcaatggg cagaagtgag
     1381 agtccaccca gacagtatgg ttcctaatcc cattgttatt gttcttaact gctctcatcc
     1441 attgcctcag taggctcctt gagataggta ggccttcacg ttgaaagtct tatatgctca
     1501 gcaactgcag tgaatctaac ccttctatag actaaggttc tgagaagata ttttaggaag
     1561 tgtcttaaaa ggcaaagtaa agctgggagc atttgagggc agggactgga gatttgtttt
     1621 cccgagagat tgcccttgca tactccagct cagtggaaga aagttgagcc tcctaatacc
     1681 caatcactgg agctttgctg ggggtgggaa ggttacaaag aatgaagtaa aagggcactt
     1741 ggacatcttc caggcaagcc atgcagttgc atgaattcta cagataaaag tctgaaaata
     1801 actgggggaa tggcaataag agtggggaca agtgaaaggg agacatctgg atgagatatc
     1861 tctgcctgtg gatttgtgct atttaaatag atgccattac ctcttattcc ccatagtctc
     1921 ctcagtgtgg tttccacctc tcctaagatc ccatcccaac actttggctt gaagagtttt
     1981 agcacgggtt tgaattagat tctgatgcct ttagtgctgg gtcatttcag cattggcagc
     2041 catgtgagcg acccttactt atcggctctc tgcccagcaa tggctgccct cactctcatt
     2101 cctaaggtgt cagatttact ctgatttatt taaccttggc tggagagagt ataaaggtga
     2161 tcagatagtg ccaatctctt tgaagtgccc aggacagctc attttaagag ggagaggtcc
     2221 ggtagagagg cagggaacaa actctaggga aagacagttt tacagtgcct tgttgaatag
     2281 ggtgagtggt gaggtagtga aagtctactt ccagtttggg cctttaattg ctagctcaaa
     2341 acattgcatt ttgattagga acattatgtt tgtggttttg ttgtgggggg gggggcttgt
     2401 aatgaacctc tgtttctgag ttggagctta atgttctaat taggcctgct ttgaaatcaa
     2461 tttcaaagct tagacctatc tccatcacct ttgtcatgca cacaacacag gatgtaagca
     2521 catgatcttt ggtcttttat ttgcatactc tactagtcct gccttcattg gcgagaagta
     2581 actggccaga cctgggctgt cttgctggct gccttccttg ctgggctcca acttatcttc
     2641 tatgtacata gcccttggag ttcagaggcc tctgctgact tctgctcagc ctctggctct
     2701 cagcacccac atccccactt gtacatctgg gagcagaaga aaaagaggga gctggagata
     2761 atctgtgaga ggctccgaag cccagtacat ccccctgact cagccctgcc cacctgctgc
     2821 tgtgactaca gcaacagaca agacagctag gaagatagga agtgaggtcc aataccttgt
     2881 gggcagtggt gtcattagct gcaacgccta agatgtctcg agagatggga gaactcactc
     2941 aaaccaggtt acagaagatc tggattccac acagcagcag tagcagcgtg ctgcaacggc
     3001 gaaggggctg taagtgaggg ttgtcatttg tgggtggtgg tgatgggtgg gtctagaaga
     3061 acaagtggtg ttccccatcc actcctaccc tcacaaagaa gttggaatag aagatcattt
     3121 ggatgaaatt gctagagtgt aacaagaggc taatatgaga agggtaagga agtaagagaa
     3181 aggcaggtag aaaagacaac gatgaggctt agataaaaag aacccaaact acagggaccc
     3241 ctgacctaag ggctgtctcc ctttatttga gaaacggtcg gaatgcttct tgtctggaat
     3301 gcgagcgttc tagtgtagtg ttacccgaga gtattttatt ttagcttctc ttatgaaagc
     3361 tctggaaaca ggtcttgttt aatttatgcc agggtgaaag aaggtgatag ttcctggcgt
     3421 ggtttctcgg aggctaggtg cttagtccac accagtaccg tgtcactgct cagcattgcc
     3481 cttcctcttc caccatcctc ttaaaaataa ctgtacctgc ccagctgggg acaactgcgt
     3541 acttggcctt gacattcaga gacagtcttg ctgttctgag gagtcgaggg gacagacagt
     3601 atcacaaaga ctgatctggt gttcagttgg agcttaacat cctaattaga cctgatgtct
     3661 ttattttaaa actcaggcct ttgaaatttt aaggaggagc agaggcaaag taagcaaagc
//
LOCUS       SPFIFP1A     1801 bp ds-DNA             INV       25-MAY-1990
DEFINITION  S.frugiperda insertion element IFP1.6 DNA, clone lambda 879.
ACCESSION   M32776
KEYWORDS    insertion element; insertion element IFP1.6.
SOURCE      S.frugiperda DNA, clone lambda 879.
  ORGANISM  Spodoptera frugiperda
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Noctuoidea; Noctuidae.
REFERENCE   1  (bases 1 to 1801)
  AUTHORS   Beames,B. and Summers,M.D.
  TITLE     Sequence comparison of cellular and viral copies of host cell DNA
            insertions found in Autographa californica nuclear polyhedrosis
            virus
  JOURNAL   Virology 174, 354-363 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        115      118     5' insertion target site
    site       1755     1758     3' insertion target site
BASE COUNT      629 a    274 c    273 g    625 t
ORIGIN      
        1 gagaatcaat gttcactgga aatatatccg ttttagtttg taacgaagaa aaagatgcga
       61 tgaagtgttt tgtgacgtgg attcttagaa ggaagaattc aatgttctat tttattaacc
      121 ttttgaccgc cagagcataa acgcttcgcc gtgccctcca cgccgagaca caaattcaac
      181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat
      241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta
      301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc
      361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat
      421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta
      481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg
      541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat
      601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa
      661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa
      721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc
      781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc
      841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa
      901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa
      961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta
     1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc
     1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg
     1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat
     1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa
     1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt
     1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca
     1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat
     1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt
     1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa
     1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga
     1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa
     1681 aggtaatgtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg
     1741 tggcgcacag ggcattaata tctatcacct taccttttat tattacttta atgtataatg
     1801 g
//
LOCUS       SPFIFP1B     1820 bp ds-DNA             INV       25-MAY-1990
DEFINITION  S.frugiperda insertion element IFP1.6 DNA, clone lambda 883.
ACCESSION   M32777
KEYWORDS    insertion element; insertion element IFP1.6.
SOURCE      S.frugiperda DNA, clone lambda 883.
  ORGANISM  Spodoptera frugiperda
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Noctuoidea; Noctuidae.
REFERENCE   1  (bases 1 to 1820)
  AUTHORS   Beames,B. and Summers,M.D.
  TITLE     Sequence comparison of cellular and viral copies of host cell DNA
            insertions found in Autographa californica nuclear polyhedrosis
            virus
  JOURNAL   Virology 174, 354-363 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        115      118     5' insertion target site
    site       1774     1777     3' insertion target site
BASE COUNT      636 a    281 c    270 g    633 t
ORIGIN      
        1 agagttaagt ggctttgatg atttgtgcat atagtgccat tgtgcataca aaaccaaaat
       61 gttttatatt caaacttaat tttatcgttg tcatctaaaa cgaaagccat ttctttaacc
      121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac
      181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat
      241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta
      301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc
      361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat
      421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta
      481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg
      541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat
      601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa
      661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa
      721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc
      781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc
      841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa
      901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa
      961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta
     1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc
     1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg
     1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat
     1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa
     1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt
     1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca
     1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat
     1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt
     1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa
     1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga
     1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa
     1681 aggacacgtg aaagtaatga tttaatgtgt ttattttgtt tttctctgtg atctgtttct
     1741 tggcatctat agatgtcggt ggcgcacagg gcattaatcc attttctatc ataagttatt
     1801 tgatacttag atctgtaagt
//
LOCUS       SPFIFP1C     1741 bp ds-DNA             INV       25-MAY-1990
DEFINITION  S.frugiperda insertion element IFP1.6 DNA, clone lambda 889.
ACCESSION   M32778
KEYWORDS    insertion element; insertion element IFP1.6.
SOURCE      S.frugiperda DNA, clone lambda 889.
  ORGANISM  Spodoptera frugiperda
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Noctuoidea; Noctuidae.
REFERENCE   1  (bases 1 to 1741)
  AUTHORS   Beames,B. and Summers,M.D.
  TITLE     Sequence comparison of cellular and viral copies of host cell DNA
            insertions found in Autographa californica nuclear polyhedrosis
            virus
  JOURNAL   Virology 174, 354-363 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site         60       63     5' insertion target site
    site       1696     1699     3' insertion target site
BASE COUNT      609 a    275 c    263 g    594 t
ORIGIN      
        1 ggtaggcagt gacaatggaa cgccaattgc tacgaatctt acatacttca ttcgcttcat
       61 taaccttttg accgccagag cataaacgct tcgctgtgcc ctccacgcca agccacaaat
      121 tcaacgaaat aaccttgaaa atattaggga ttccaaaatg ttatcgataa accaaagtga
      181 gtaataattt tcgtctgatt tttttttgaa taaactgctt ataacataaa aagaatattt
      241 atttaaccta atctcccctg taataaactt aagaccacaa taatagcaac tatcaataaa
      301 atatcagcct tttgagacac aaaaatacat ataaaattgc agcaagttac agcactatta
      361 tctatcagtt acgcgccatt ggcatgtata catgccaatg gcgcggcaac tatcaatttt
      421 ttttaaatct ctcgtttatg attagggatg tagctgaata atgtaactta tcgagttcat
      481 ttatgtatta ggtaatcaaa tttgaaaact gctaaatagg tacagatttt agtgatatta
      541 tttatacact ttcatattct taaaatagtt gccttgtatt tgtaaaatgt cacatgaaac
      601 ttaaaaatat aaaaaatcaa caatcgtatt tatttattat gctcattaac agccttgtaa
      661 ttgaaccgtc acgtgaaact taaaaatata taaaaaatca acaattgtgt ataattatta
      721 tgctcaatac agccttgtat ttgtaaaatg tcacatgaaa cttaaaaata taaaaaatca
      781 caatcgtatt tatttattat gctcattaac agccttgtaa ttgaaaaagg ttacgtgaaa
      841 cttaaaaata tataaaaatc aacaattgta cctataaaac attatgctca ctaacagtct
      901 tggaattgta aaatttaaaa ctaaattgta tattatctat aattattatc aataacaaaa
      961 atttaataaa taaatagcaa ttataatgta ggtactcgat aaaatatttt ttgcaacttc
     1021 cccgcattta aaaacacggc acgcgcttac gacgtcacga gtccgacatc tatacatgtc
     1081 aatggcgcag aagggattgc atatcggtag tatgctagga tgcgcatgat tggcgaaaag
     1141 ggaatgtcga taatattgaa aatatagtaa atatgctatt atacccaaga atttttcagt
     1201 ttaaaatata gtatatttac cttttttctc agagactaag gttatatata tactttttaa
     1261 tagttattac aaaaaataag cgagtcattt agctgaagtt ttaatattac ttgaattaca
     1321 ctacatcact agtaggcatg tatacatgcc attggcgcga gagatacagt tcttttttag
     1381 ccgatattta agttgttttt atcaatcaaa tattagttat tctatattgc aattttgttc
     1441 actgtgtaca attatgtaat aactgggtat tcagatctta agatacatca ataaaacacg
     1501 tgaaagtaat gatttaatgt gtttattttg tttttctctg tgatctgttt cttggcatct
     1561 atagatgtcg gtggcgcaca gggcacgctt gcgcatgaca tctatacatg ccattggcgt
     1621 tcaaaaggtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg
     1681 tggcgcacag ggcattaaca taacttggta taaattgttg cataattgca ttgagagcag
     1741 a
//
LOCUS       SPFIFP6A     1792 bp ds-DNA             INV       25-MAY-1990
DEFINITION  S.frugiperda insertion element IFP1.6 DNA.
ACCESSION   M32780
KEYWORDS    insertion element; insertion element IFP2.2.
SOURCE      S.frugiperda DNA, clone lambda 858.
  ORGANISM  Spodoptera frugiperda
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Noctuoidea; Noctuidae.
REFERENCE   1  (bases 1 to 1792)
  AUTHORS   Beames,B. and Summers,M.D.
  TITLE     Sequence comparison of cellular and viral copies of host cell DNA
            insertions found in Autographa californica nuclear polyhedrosis
            virus
  JOURNAL   Virology 174, 354-363 (1990)
  STANDARD  simple staff_entry
BASE COUNT      586 a    326 c    287 g    593 t
ORIGIN      
        1 atgatgtttg tacataatgt tggtacgttt ctcccaccct ttaatttcac tacaagcaag
       61 agaccaatgt ctgagcggat tacccccaca aataaataac acacatttaa atctcgaatt
      121 tgctattggt tgtcacacaa acgcaatgac tgcatttgac gtacactcta acacgcataa
      181 aaactcatgt atgctagggc gagacaaagg tcgattgttg atatcgataa aatttaagta
      241 atcttgtttt tgttttacaa gtttgttgaa tattgtgaaa ttgtattcat gcgattcaaa
      301 ttgtatcacc aacatgacac gtacaccagc accgtccaac tcaaccatgt tgcagggaat
      361 cgaacgatcc aatgcccgtg ccaccgccag tgagaccttt gagtgagcat gacaactcta
      421 agctgacaca aatgctgaaa tgcattctac tagtctaatt tagctaatta cagggctctc
      481 acatttacaa taatctgtat aaagtacaga acacattgaa ataacatggt tatattattt
      541 cattttttaa atttacgaaa gatgatattt taaatatcgc catttttaga caaaaaaact
      601 gatttaatga gactaattta aaattttcta aaataaaatc agtttactta ctgaattaca
      661 cattgcgata ttgtgcggat aaacgcctaa ttttaagttt cttttccctg actttgttat
      721 tatgtaatta tcaacatcga aatgttgcga acgaaatatg ctacatttag taggaatcca
      781 ggagcccata gtacaggttt tcctagtttg ggttcctgaa tcccaccact tgtcatttca
      841 attgtcttaa acctggtcgt gtaacttgct acataattta ttatactatt aattattata
      901 taagcttagt ttttaagacc aatgtctgta taaatatggt ggagataaaa atatacgcag
      961 tttctaatct gtgtggtgga acttgaaaaa aaaaaaatta attctaccag ttgtgtcaat
     1021 ccacaatttt gttacacctg gatctgttgg aaacctaaaa cataatatta aaataaatca
     1081 tatgtcacat tctataaaaa aataaaaaca taaaactcac tcaaaatcta tattactaat
     1141 tagtcgtaat aaataatact tattatttaa attataaagt gtgctattga ttataataga
     1201 ctgtatcata gtaacaatcg acattgatta caaatttgcc caacactata gatgtcatag
     1261 attttgtgga aatgagcttt tgtttcggta ctcgggactc tcgcaatgga cgtcatacat
     1321 tgcgcgattg ctcacgtagt gcaagattgt actcatctat cgcaagcaat gtattactat
     1381 tagtctgctt gcgattatat cgtttttgtc tataaatttc ataatactaa tgttataatt
     1441 acaaatatca cacgtatttt tatgtttttg gacttaccgg tgaaaagaaa ttccttctcg
     1501 tgctatgcta cacacctggt ttcttttccg acacgttata atagcgcaag acggcataat
     1561 ataatatttt tgcggagata cgtacgctgt caaaacatga cagcgattgc tagccggtaa
     1621 gtatgaacat ttgagcgtta tgacagcggt atgcacgttc gttcgcaccc tcggcacgcg
     1681 tggcccgccc acaaggccaa tccagtctct ttcttttatc catggttttt acaaatggaa
     1741 atgtatttgt aaaacggcgt tggtacatct gtacctaaaa atgatatagt at
//
LOCUS       TACLSEG      7102 bp ss-RNA             VRL       25-MAY-1990
DEFINITION  Tacaribe virus L protein and P-11 protein genes, complete cds.
ACCESSION   J04340 M33513
KEYWORDS    L protein; P-11 protein.
SOURCE      Tacaribe virus (strain T.RVL.II 573), cDNA to viral RNA.
  ORGANISM  Tacaribe virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Arenaviridae.
REFERENCE   1  (bases 1 to 449)
  AUTHORS   Iapalucci,S., Lopez,N., Rey,O., Zakin,M.M., Cohen,G.N. and
            Franze-Fernandez,M.T.
  TITLE     The 5' region of Tacaribe virus L RNA encodes a protein with a
            potential metal binding domain
  JOURNAL   Virology 173, 357-361 (1989)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 420 to 7102)
  AUTHORS   Iapalucci,S., Lopez,R., Rey,O., Lopez,N., Franze-Fernandez,M.T.,
            Cohen,G.N., Lucero,M., Ochoa,A. and Zakin,M.M.
  TITLE     Tacaribe virus L gene encodes a protein of 2210 amino acid residues
  JOURNAL   Virology 170, 40-47 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by S.Iapalucci, 08-AUG-1989.
FEATURES       from  to/span     description
    pept         70      357     P-11 protein
    pept       7072      440 (c) L protein
BASE COUNT     2071 a   1653 c   1240 g   2138 t
ORIGIN      L segment.
        1 cgcaccgggg atcctaggcg ttacgtgcac tcttctttat tgggctggat tacacaaaac
       61 ttttcaagca tgggcaactg caatcgaact cagaaaccca gttcgtcttc taacaacctc
      121 gagaagccac cacaagctgc agaatttaga agaacggcag agcccagtct atatgggagg
      181 tacaattgca aatgttgttg gtttgctgac aaaaacttaa tcacctgtag tgaccattat
      241 ctttgtctaa gatgtcacca aatcatgctg agaaactcag agctgtgcaa catctgctgg
      301 aagcccctgc caacatccat aagagtcccg ctagaggcaa gtgctccgga tctgtaacca
      361 acgacaaaca aggtcgctga ccccccgggg ggcccccaag ggggcccccg tgggggtggc
      421 ccccggggga gtttgggttc tactcaatgt cctccactgt gtttctccta gctagaggtc
      481 tgcacataat gcctctcaac ctcaaggacc cctggttgtc agatacaaca aggtctttca
      541 gggacttact gtatgcaacc ttgaagttga caaattcaac ccagtccccg atcaactcca
      601 gggacctgat caagccagat ctatgtaaat catagacaag gacaaagtcc tcttgttgga
      661 atagcatcac ctgtgcagtt gattttaaat agggccagag ttggttcaac aaagcattca
      721 catcttcagg gtcgtccagt gcgccaattg ccttcactat tatatcacca gtcaactcca
      781 ctctgtaagt agaaactttc tcagttccca taaaaaagtg gccgtctctt aacaccaatg
      841 gctccccttg atcgactggc ccattgagtg tataggcata cttgtcactc aattcagata
      901 aaaatccacc aaagctgaat cttacatccg tcagaattag gttgtcattt gtctccacaa
      961 gatctgcatc catcatcagg gaagccatat taaaagtgcc ttgacctgtg aatatcggaa
     1021 gtcctgcaat catggaaact ccatccaaca agaggtgact tccccccgac cctgacactg
     1081 actgcgtcac atccaccacc tgtaactcta cgcttgcttc tttaagatca atgactttag
     1141 aaacacatgt aacactaaat gagtaagtca ttttcctctt aaacctgtac ttagttgatg
     1201 ttctagaatg ctccacctgg atcaaaaacc tcaagtcaca atcctcatgg tcatcaaagt
     1261 tcccaagaag atccagcaag tcattgatgt gtctccctgg agccagtggt tgctctccaa
     1321 ttgtgatttt gtcaatgtac aatctaacat cttcaagatt tgaaatcaca taacccattg
     1381 ctatgtcttc aataaacatc gctctactga tcttctgatc cctaactaag aaggtgaaat
     1441 ctgttaactc acccaattca atctgaccat catcttccag ctccaatact tcattaaacc
     1501 aaaaaaacga ttttaggcaa gaagttgaca tcactaaagg gttcaccatc gatgatagga
     1561 tgatttgaac catgaagttg taacaaacct gttgaggact cagtccaact tccttatcta
     1621 cgtaatcaga ctgcctaacg tgtgccatgg cacagtcgct ctgtttgact atgtagacac
     1681 tattgtctct ggctttcagt aggtggctga cgattcctag attctcattc atcaaatcca
     1741 atgctatgca aacatctaag aacttcagct tcctcccatc cagtgttctg ttcacttgca
     1801 aatcgctcat gaagggagac aagtgctcat caaataattt cgggtaattt ttccgaatgg
     1861 aatacaatac atgattcata cccagttgat cttctaattg acgaacaatt ctaggcttaa
     1921 cagcccagaa caaattcctg ttacttaaca ccactggaat ctgtggctct ttcacagatg
     1981 agaataccca ctcccctatt tcgcaggcat tggtcaaaac taaagaaaag taatcccaaa
     2041 gaataggttt ggaaaatttt tcaatttttt ccagtgaggg tctgcaataa atcacacctc
     2101 ttttcatgtt gcacacctca cacagacagg gcttcacctt tgtataaata tctttaatgt
     2161 aaaggaaacc gccctcacca tctctgacac atttgctgcc tatactttta caaaaaccta
     2221 taaaaccaga tgccactgaa gactggaatg cagatttgtt tatggactcc gctaagatct
     2281 ttttaacacc ctttgtgaag ttccttgaca acctggactg caatgtctta atcagtgttg
     2341 gcacctcttc tctctcaaga acccttctgc tcgtcattaa cttggtcctc aatactagtc
     2401 ttaggtcacc aaaacttgac aaattcaccc atctaaagcc taaggcacgc ataatatctt
     2461 gctctgagca acttgaataa cgcaagaacc ccacaagggc gtcatcacca tctctaccaa
     2521 taagactgat caggttttcc tcaaaaactt tgcctctttt tatatcatta aaaacccttt
     2581 tagcacagct tctaatgaag cctgtcatct catcatcaga aaatattgat tcaatcttcc
     2641 tctgcaatcg ataacccctc gtcccatcca cccaatcttt tacatcttgt ttctcaacag
     2701 caccaaaagg ggtttccttg aagccggagt atttgatcag ctggttcacc ctttgagaga
     2761 tctttgagac aatttggaca ctaacaccat ttgcgacaca ctggtcacag attgtgtcta
     2821 ttgtttctga caactgtgtt ggtgttttgc atttcacatt gtgcaacgct gcagcaacaa
     2881 attgagttaa tagcggggtt tcttccccca tcacaaaaaa tcttgatttg aattcagcta
     2941 caaaggtgcc cgctacactc ttggggctaa caaatttgtt caattttgag gagaggtact
     3001 catgaaaaca caacatttcc aaccactcat cattaacttg attatcatct gaagcacaag
     3061 gcaatttgat aagtgtgacc tggtcatcgc tagaagtgta ggaagttgga atgacatcat
     3121 atagtaaatc tagcacataa tttaagaact gttctgttat caatccatac aaatcagaca
     3181 tgttatgcag gatgccttgg cccatatcta gaacagacat tatgtgactt ggtatctctc
     3241 cttggactgg catgaattgg tggaagaatt cctctcccac gcttgagcta ccacaaccca
     3301 tcaaccctaa gcttctcttt agattgccag tacagtaagc ctcagcaaca tttactggaa
     3361 cttctataac tttatggagg tgccatctca gcacatttaa gactggttca agattaataa
     3421 tgttcctttc ttttggggtc ctaagttcca aaaattggag gaatgtcaag aaaagcgcag
     3481 gactcatggt tggtccccat ttagagtgat caagagaaca tgataggtca ccattgttca
     3541 ctgctaactt catgtcacag atggctctat caaattcttt ttctgaactg agacatgtat
     3601 atctcatgga actgccaact gcctctgcaa agtcttcaac taaccttgtt atcagcttag
     3661 tgttaagatc cccaacgtaa agttccctat tggaacccac ttgctccttg taacttaagc
     3721 caaactttaa gttaccatta tttgaactga tactggtaaa ctcagtagga gaatcatcag
     3781 agtaaaagca caggttcctt agtgcagcat tggtaaaata actcctatcc aacttatctg
     3841 ctatagcttc agaattgctc tctctcatgc taagtcttac ttcatctttc actctcagtg
     3901 cttcattagt gaaaccaagc cttgatctac tcctatgttc ataacttcct agtttttgat
     3961 cataatgacc ttgtatcaat aagtatttga agcactcaaa atactcttca tcaaaatatg
     4021 cagaagtagt gagatttttc aaaagcaatc ctagaggaca aacttctaaa ggtctctcca
     4081 agaaatactc tggtttgctt gaggagtaat agaccacatc acatagttct ttatacaagt
     4141 ctgaagcgag tacatcagga tcaaagtctt tcacctcatg aagtgagacc tctgtcctaa
     4201 tctctctcat aatcccaaaa ggaccccaaa gttcctcaag tgtacctgtg ctgcttttaa
     4261 ccctgccttc cttttgattc ggacttagat tgacatctcc agatttggag tccttcattt
     4321 tgaccattgc taattgcacg gaatttttta tatcattgaa agcttctgct tgctcttctg
     4381 tcaacccttc aaacatcatt gacagctctt cacaatctct ctgggtttta gaaccaatac
     4441 taacaagccc ggttaagttt tttagtatca agtaatctaa tgtacttgga tcaatgttgt
     4501 atcttccctt tgttttgaac ctttccgcca tggaagaaac acatgaagaa atcatcttct
     4561 ctctatcata taccgaaaga acattcccaa gttcatctag ttttggtact actacgctct
     4621 tattgctaga gaggtctaaa gcagtgctag taatgtttgg cttgaatggg tcgttcttaa
     4681 gctcacctgt gacttttagc tttcctttat taaataatga gatgcaataa ctcaacagtt
     4741 cattgctgac ccctggtaat ttggtgtcag aacagtcttt gagggtcttt gaataaaatt
     4801 tatccagccc gtccaacatc attccttctt gagcttctgt cagttgacca tttaaagatg
     4861 ggttgactat aacacagtta aaatcaattt tgggttcaat gaatttctca aaacatttga
     4921 tttggtctgt taatctgtca ggtgtttcct ttgttattaa atgacaaagg taacttacat
     4981 ttaagagata cttgaacctc ctagcaaaat aggggtccga atcaacacct tcaccaagaa
     5041 ggcattggaa aaggtccact gccagcctct gaactatgac ttcagagcca gacatacact
     5101 ccacttttaa tttggacagt agatccacgt ggtggaattg atttgagtat gccatcaaaa
     5161 aataccttag cccttgcaga aatacctgat ttcttttgga cggggcgcat aaaatgctta
     5221 acaataaaag tcttagtttc tctttggata tcaccataag atcactgttg aagtccagcc
     5281 aagttgtcat ctcatcacac atctcaacca acacctcttg tgaaaagatt ggcagaaaaa
     5341 atctcttcgg gtcacagtaa aaagagccct tctcgacaag aacaccatct tcacatgtgt
     5401 aaattgagta gcatcctgaa cgctcacctg tcttttgata aaaaagaata acaatcagtc
     5461 catcagttaa aaagaatctc tgttggtagc actcccttag tctcacattg ccaaaatacc
     5521 ttgagtattc tttttcatta ataagcaacc ttgaagaaaa cgaggttttc atagagttta
     5581 ttaaggaaag agacactgag gagagtattt tgaaagcctc aacctgttca gcatctgaca
     5641 tcatgtgaaa tgtcttatgt ttacattcag tgttaaagtt ccccccagca gagtatctca
     5701 taatgggtct atcagaagat attgtctgag ctatagatat tagcttggat aaatcaggat
     5761 atcttgccca caccgatatt tgtgaccaga tatcttgttg agacctcctc aagattttag
     5821 agcaattctc cacccactta acattcgtct ctattttctt tttaattaat ttcaccgcct
     5881 ccaaaaaatc caaggccctg tcattgacta gtgtgaccat ttgacccagg atgagaccag
     5941 accttctcaa agagtccagt gcttcgggtt tggctgtaaa taggtcatct aaaaagttta
     6001 ccattatgag atctagaaag agcaacgtat ttcttcgggt gttaaaaact tttaatccct
     6061 taattttatt gcaaatcgac aacacgccta aataggcctt ctcatggggg cagacaacat
     6121 actgatctaa ttcagagttt gatttctccc tgtcctgcct ttttagtgct gtgaaagtca
     6181 aactatagat tatgtccatg aatttgttga cacaaaaggg acaatctgag gtggtgttat
     6241 cctcatgttt ccctaaccta attaaattgt tcagaagctc tgatctatta gttatcctaa
     6301 acttccccaa gtttttgtca aaaacttcct ttttgtacca gagtcttaat ttcattaatt
     6361 cttccctaat cacagattct cgaacatcgc ctttgactcc aacaaataat gatttggcca
     6421 gtctatcatc atactccaaa agatttcttt tgagacactc tatattctca aatcctaaac
     6481 tttgattacg aacgtttgaa agactgtgta tcagacgttg atactcttga tcctcaatat
     6541 caatctctgc ggacgatcgg gatttccgca aaatttcaat taacaaccat ctgatcctct
     6601 ccaccaccca ttcaggagtg aacctattag aataattagt tctaccgtca acaacaggaa
     6661 caagtgtgat gcctgacctc agaaggtctt gtttgagatt tagtagtttt gtcatgtcac
     6721 tcttccattt tctctcaaat tcttctggat tgaccctcac aaatgtttcc aatagtatta
     6781 atgtttttcc agtgagtttg aatccatcag gtatcacttt agggagagtg ggacatggga
     6841 ttccatgatc actgaggatc acatcaacat atttttggca ggtgttcaag tggcatgcat
     6901 gagattcaca ggactctaac tccacaaggt tggagagaag tttaaaacct tcttgaagca
     6961 gtgaccctga atgacagtgt gacaggaagg catctttctg gtgggcaaac tcatgcctat
     7021 ttggaatgtg ttttctaacc aagtctttga gttcagacac agtttcatcc atggtcaagt
     7081 gccgcctagg atccactgtg cg
//
LOCUS       DRHBCDA       922 bp ss-mRNA            INV       25-MAY-1990
DEFINITION  D.heteroneura bicoid (bcd) mRNA, 3' end.
ACCESSION   M32125
KEYWORDS    bicoid protein.
SOURCE      D.heteroneura, cDNA to mRNA.
  ORGANISM  Drosophila heteroneura
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 922)
  AUTHORS   Macdonald,P.M.
  TITLE     Bicoid mRNA localization signal: Phylogenetic conservation of
            functional and RNA secondary structure
  JOURNAL   Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.M.Macdonald, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1       36     bicoid protein (bcd) (AA at 1)
BASE COUNT      309 a    157 c    140 g    316 t
ORIGIN      
        1 ggtggcaaca cgcagttcgc atactgtttc aattaggctg ctacattaat aataattgta
       61 aagaaaaaaa aaaaaaaaaa acaatttcaa tgtttgtgga gtttatagct ttaaccgaca
      121 cgttgtttcc tgattgtaca aagatctaat aattgtagat atccatatgc atagtattta
      181 agccaattta acattgttta aaccaagtat gtaacgcata gttgtagctg ccgagttttt
      241 taatgcattt tctactgaca taaactgaag tgccattatt tggaaacttt atttgtaaag
      301 cgttctttga tctcaacgct gtctggctgg acatttgcca aagcccatga atgcccaacc
      361 agacactgtt gagacgaata tgggctttaa ttgaacgctt tacagaagaa gtttataaat
      421 acacataaaa tgcatttctt tcttataact caggccaact ggcatagcaa gtaaaactgt
      481 actgatctaa atgaaaaatg tttctcttcc tgataatcgc acgtccttct atggacatta
      541 tatgccatta tcagtcgaag atcgaacgtt taaccaataa tgttcttcag aatattttta
      601 agctttgtta tgccacagtt tgcatgttgc aattaaacat tcttcacgtg agaattgttt
      661 acaaattgtc tctaaacaca tgtgaaccac aggcctcact tatctaaacg aaatctgtac
      721 ttactttcag aatgtaagat agttttatca attgaaaatg cattccaaat atgtatctca
      781 gttacccttg aatggtgcac catttaaata ttattctaaa cgtaacttat ctatatatct
      841 gtatgaaata gtatatttgg ggatatattt aaaatattca gagtcctgtt ttgtatacta
      901 ttttaataaa tcttacaaac ta
//
LOCUS       DRQBCDA       987 bp ss-mRNA            INV       25-MAY-1990
DEFINITION  D.sechellia bicoid (bcd) mRNA, 3' end.
ACCESSION   M32124
KEYWORDS    bicoid protein.
SOURCE      D.sechellia, cDNA to mRNA.
  ORGANISM  Drosophila sechellia
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 987)
  AUTHORS   Macdonald,P.M.
  TITLE     Bicoid mRNA localization signal: Phylogenetic conservation of
            functional and RNA secondary structure
  JOURNAL   Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.M.Macdonald, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1       90     bicoid protein (bcd) (AA at 1)
BASE COUNT      301 a    204 c    169 g    313 t
ORIGIN      
        1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa cgccgcgggc
       61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca
      121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat
      181 ctacgcgtag aaagttaggt ctagtcctaa gatccgtgta aatggttccc agggaagttt
      241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca
      301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatctg ggtggttgct
      361 ccattaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat
      421 tccatggaaa gttttcgccc cacatttcgg aaattaaatt ccgagccatc aagaaaatct
      481 tctctgttac aatttgacat agctaaaaac tgtactaatc aaaatgaaaa atgtttctct
      541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg
      601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac
      661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc
      721 agtttcggag attttttgct gccaaacgga atacttataa aaacccaaat tctatttaca
      781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct
      841 ttgacgctgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca
      901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattt cggaattata cttagatact
      961 tatcccatat ttccgtccct ttctaga
//
LOCUS       DRSBCDA       978 bp ss-mRNA            INV       25-MAY-1990
DEFINITION  D.simulans bicoid (bcd) mRNA, 3' end.
ACCESSION   M32123
KEYWORDS    bicoid protein.
SOURCE      D.simulans, cDNA to mRNA.
  ORGANISM  Drosophila simulans
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 978)
  AUTHORS   Macdonald,P.M.
  TITLE     Bicoid mRNA localization signal: Phylogenetic conservation of
            functional and RNA secondary structure
  JOURNAL   Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.M.Macdonald, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1       90     bicoid protein (bcd) (AA at 1)
BASE COUNT      296 a    203 c    171 g    308 t
ORIGIN      
        1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa ccgcgcgggc
       61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca
      121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat
      181 ctacgcgtag aaagttaggt ctagtcctaa gatctgtgta aatggttccc agggaagttt
      241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca
      301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatttg ggtggttgct
      361 ccactaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat
      421 tccatggaaa gttttctccc cacatttcgg aaattaaatt ccgagccagc aagaaaatct
      481 tctctgttac aatttgacat ggctaaaaac tgtactaatc aaaatgaaaa atgtttctct
      541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg
      601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac
      661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc
      721 agtttcggag attttttgct gccaaacgga atacttataa aaacccacat tctatttaca
      781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct
      841 ttgacgttgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca
      901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattc cgaaggatac ttagcccata
      961 tttccgtccc tttctaga
//
LOCUS       DRTBCDA       963 bp ss-mRNA            INV       25-MAY-1990
DEFINITION  D.teissieri bicoid (bcd) mRNA, 3' end.
ACCESSION   M32121
KEYWORDS    bicoid protein.
SOURCE      D.teissieri, cDNA to mRNA.
  ORGANISM  Drosophila teissieri
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 963)
  AUTHORS   Macdonald,P.M.
  TITLE     Bicoid mRNA localization signal: Phylogenetic conservation of
            functional and RNA secondary structure
  JOURNAL   Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.M.Macdonald, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1       90     bicoid protein (bcd) (AA at 1)
BASE COUNT      281 a    195 c    168 g    319 t
ORIGIN      
        1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cccatcgtaa cgccgcgggc
       61 aactcgcagt ttgcctactg cttcaattag cctggacgaa cggcgtatta gagagtttta
      121 ttagccttat gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat
      181 ctacgcgtag aaagttaggt ctagtcctaa tggttcccag ggaagtttta tgtagtagac
      241 tagtcagcag gctacacgga ttccagtgca cgtcataggg ataccctagt ttactcgata
      301 ctttccctgt aatacgcttt tcgcctcaat tgtatctggg tggctgctcc ataaagcccg
      361 ggaatatcca accagttaca tttgaagcca tttgggcttt agcgtattcc acagaaagtt
      421 atcgtcccac aacttatatt ccgagccagc aagaaaatct tctctgtaac tatttgacat
      481 agctaaaaac tgtactaatc aaaatgaaaa atgtttctct tgggcgtaat cttatacaat
      541 gattaccctc aaagatcgaa catttaagca ataatatttg atatgatatt ttcaatttct
      601 atgctatgcc aaagcgtctg acataatcaa acatttgcac attcttcgac caagaatagt
      661 cagcaaattg tattttcaat caatgcagat catgtgttcc agtttcggag attttttgct
      721 gtcaacggaa taactatcct acccacactg tctatttcta tcacttagaa gagcattgaa
      781 atctgttttg tttagcctca actttaattt taaaatggtg cacctttgat gttgtctttt
      841 taagatttgt atttttaatt atgaaaatat ataagaacta ctccactcgg gtaatttgtg
      901 actaacttag tattttaaca aatttcaagg gacgcgtagg cattctttcc ctctctttct
      961 aga
//
LOCUS       DRVBCDA      1023 bp ss-mRNA            INV       25-MAY-1990
DEFINITION  D.virilis bicoid (bcd) mRNA, 3' end.
ACCESSION   M32122
KEYWORDS    bicoid protein.
SOURCE      D.virilis, cDNA to mRNA.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1023)
  AUTHORS   Macdonald,P.M.
  TITLE     Bicoid mRNA localization signal: Phylogenetic conservation of
            functional and RNA secondary structure
  JOURNAL   Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.M.Macdonald, 14-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1       90     bicoid protein (bcd) (AA at 1)
BASE COUNT      326 a    176 c    160 g    361 t
ORIGIN      
        1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cgcatcgtaa tgccggtggc
       61 aacacgcagt tcgcctactg ttttaattag atcgggacac caggcgcccc atataatata
      121 ataattattt taagtacaca aattttaatg tttgtggtgt ttctttagct ttaagtgaca
      181 cgttagttcc tgattgtaca aagatttagt gattgtagat atctatgcgt agtattttaa
      241 gcaaattatc aatattttaa atgaattttg taaagcttaa ttaagtcata gctgtaattt
      301 tccaagcatt ttcaatgtat ttccataaga aacttccttg gtaaaacctt cattggtctc
      361 aactctgtct ggctgggcat ttgccaaagc ccatgagtgc ccatccagac atacttgaga
      421 caataatggg cttaagctga aggttttacc gaagaagttt tcgtatgaaa atctcatgta
      481 agattcattt ctataatttt ggcaccttgg catagcttac aaaaaaactg cactgatcta
      541 aatggaaaat gtttctcttc ctgataatcg cacgcctttc tttttggcct gccattatca
      601 gccgaagatc gaacatttaa acaataatac tcgtcagaat atttttaatt cttgctatgc
      661 cccggttttc atgttacaat tgaacattct tcacgtgaga attgttcaaa tattgtacca
      721 caatacatga ataccacatg cctaagttat cgaaacgaat actatatatt actggaggat
      781 tttcgttttt ttcttataat tgaaaatggg ttgaaaattt accagttacc ttacgaatcg
      841 tgcacctttc aactgatgtt ttaaagctaa gttatcttta atatagtgca tttgggaata
      901 tatattaaat gttcaataca atatttccat aaatttatat attaataaat cttacaaact
      961 aaccaatttt aagaagtgtt ttgcttttaa cgttttacga taaacaaagt aaacgctttc
     1021 gaa
//
LOCUS       M33420         71 bp ds-DNA             BAD       25-MAY-1990
DEFINITION  Figure 3. Nucleotide sequence of junctions in the multicloning
            site and promoter probe plasmids.
ACCESSION   M33420
KEYWORDS    .
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 71)
  AUTHORS   Joseph-Liauzun,E., Fellay,R. and Chandler,M.
  TITLE     Transposable elements for efficient manipulation of a wide range of
            gram-negative bacteria: Promoter probes and vectors for foreign
            genes
  JOURNAL   Gene 85, 83-89 (1989)
  STANDARD  simple staff_entry
BASE COUNT       20 a     15 c     13 g     23 t
ORIGIN      
        1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatccc
       61 gtcgttttac a
//
LOCUS       MUSTCM1AA     805 bp ds-DNA             ROD       25-MAY-1990
DEFINITION  Mouse T-cell receptor germline alpha-chain V-region, 5' end.
ACCESSION   M33586
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; germline;
            variable region.
SOURCE      Mouse (strain C.B20) lymphocyte DNA, clone p2111.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 805)
  AUTHORS   Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
  TITLE     On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
            receptor secondary rearrangements in a murine T cell lymphoma
  JOURNAL   J. Immunol. 144, 1094-1103 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        188  >   526     T-cell receptor alpha-chain V-region
    sigp        188      244     T-cell receptor alpha-chain V-region signal
                                 peptide
    matp        245      526     T-cell receptor alpha-chain V-region
BASE COUNT      169 a    218 c    203 g    215 t
ORIGIN      
        1 caagaatcag atattcaaaa ccttccaggc tcgtgtttgc agcgtcagtt tttatgtgaa
       61 cgggagactt ctccagcaga ctcttcccta cacctgcttc aagctcttgc tttcctcagc
      121 ctctttcctg cctgtactgt tccagagttc ctccacgaca gagctgcagc cttcccaagg
      181 ctcagccatg ctcctggcgc tcctctcagt cctggggata cactttctcc tgagagatgc
      241 ccaagctcag tcagtgacac agcccgatgc tcgcgtcact gtctctcaag gagcctctct
      301 gcagctgaga tgcaagtatt cctactctgg gacaccttat ctgttctggt atgtccagta
      361 cccgcggcag gggctgctgc tgctcctcaa gtactattcc ggagacccag tggttcaagg
      421 agtgaacggc tttgaggctg agttcagcaa gagtaactct tccttccacc cgcggaaagc
      481 ctccgtgcac tggagcgact cggctgtgta cttctgtgct gtgagcgcac agtgtgtggg
      541 gctgcagggg gagctgaaca caaacaccga tggctgtggt gctcagactc aagatctctc
      601 ctgtggtctc tgtggcatct ctttggactt ggagagcaaa gaaggaaagt ttcgcagtct
      661 cagctgtctt tctgcaactc atttccaact aaaacatcca ctatatgcaa tgctcaattg
      721 tagttctgtg gaatgaggag actgtgatgt tatcctgagt gtatggctga gtgaacattg
      781 cttgggcagt gactagacta gggta
//
LOCUS       MUSTCM1AB     265 bp ss-mRNA            ROD       25-MAY-1990
DEFINITION  Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
            partial cds.
ACCESSION   M33587
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; constant region;
            joining exon; variable region.
SOURCE      Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 265)
  AUTHORS   Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
  TITLE     On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
            receptor secondary rearrangements in a murine T cell lymphoma
  JOURNAL   J. Immunol. 144, 1094-1103 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   265     T-cell receptor alpha-chain V-region (AA at 2)
BASE COUNT       54 a     78 c     67 g     64 t      2 others
ORIGIN      
        1 cctctcctgg tacatccagc accatggccg tggcctccag tttctcctca agtactattc
       61 gggaaaccca gtggttcaag gagtgaacgg cttcgaggct gagttcagca agagcgactc
      121 ttccttccac cttcggaaag cctccgtgca ctggagcgac tcggctgtgt acttctgtgc
      181 tgcgaccngn ttatctggta gcttcaataa gttgaccttt ggagcaggga ccagactggc
      241 tgtgtgccca tacatccaga accca
//
LOCUS       MUSTCM1AC     163 bp ss-mRNA            ROD       25-MAY-1990
DEFINITION  Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
            partial cds.
ACCESSION   M33588
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; constant region;
            joining exon; variable region.
SOURCE      Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 163)
  AUTHORS   Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
  TITLE     On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
            receptor secondary rearrangements in a murine T cell lymphoma
  JOURNAL   J. Immunol. 144, 1094-1103 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   163     T-cell receptor alpha-chain V-region (AA at 3)
BASE COUNT       42 a     42 c     44 g     35 t
ORIGIN      
        1 tccacttgca gaaagcctca gtgcaagagt cagactcggc tgtgtactac tgtgctctga
       61 ggccggggac acaggttgtg gggcagctca ctttcgggag cggtacaaga ctccaagttt
      121 atgcaaacat ccagaaccca gaacctgctg tgtaccagtt aaa
//
LOCUS       MUSTCM1AD     287 bp ss-mRNA            ROD       25-MAY-1990
DEFINITION  Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
            partial cds.
ACCESSION   M33589
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; constant region;
            joining exon; variable region.
SOURCE      Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 287)
  AUTHORS   Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
  TITLE     On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
            receptor secondary rearrangements in a murine T cell lymphoma
  JOURNAL   J. Immunol. 144, 1094-1103 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   287     T-cell receptor alpha-chain V-region (AA at 3)
BASE COUNT       69 a     88 c     67 g     63 t
ORIGIN      
        1 ctattgcctt tttctggtat gtgcaatatc tcaacgaagc ccctcaggta ctcctgcgga
       61 gctccacaga caacaagagg accgagcacc aagggttcca cgccactctc cataagagca
      121 gcagctcctt ccatctgcag aagtcctcag cgcagctgtc agactctgcc ctgtactact
      181 gtgctctgat ccggggacac aggttgtggg gcagctcact ttcgggagcg gtacaagact
      241 ccaagtttat gcaaacatcc agaacccaga acctgctgtg taccagt
//
LOCUS       MUSTCM1AE     283 bp ss-mRNA            ROD       25-MAY-1990
DEFINITION  Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
            partial cds.
ACCESSION   M33590
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; constant region;
            joining exon; variable region.
SOURCE      Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 283)
  AUTHORS   Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
  TITLE     On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
            receptor secondary rearrangements in a murine T cell lymphoma
  JOURNAL   J. Immunol. 144, 1094-1103 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   283     T-cell receptor alpha-chain V-region (AA at 1)
BASE COUNT       73 a     88 c     62 g     60 t
ORIGIN      
        1 attgcctttt tctggtatgt gcaatatctc aacgaagccc ctcaggtact cctgcggagc
       61 tccacagaca acaagaggac cgagcaccaa gggttccacg ccactctcca taagagcagc
      121 agctccttcc atctgcagaa gtcctcagcg cagctgtcag actctgccct gtactactgt
      181 gctctgagaa cctcaggagg aaactacaaa cctacgtttg ggaaagggac cagcctcgtg
      241 gttcatccat acatccagaa cccagaacct gctgtgtacc agt
//
LOCUS       MUSTCM1AF     246 bp ss-mRNA            ROD       25-MAY-1990
DEFINITION  Mouse T-cell receptor rearranged mRNA alpha-chain V-region,
            partial cds.
ACCESSION   M33591
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; constant region;
            joining exon; variable region.
SOURCE      Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 246)
  AUTHORS   Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B.
  TITLE     On the mechanism of non-allelically excluded V-alpha-J-alpha T cell
            receptor secondary rearrangements in a murine T cell lymphoma
  JOURNAL   J. Immunol. 144, 1094-1103 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   246     T-cell receptor alpha-chain V-region (AA at 3)
BASE COUNT       69 a     80 c     53 g     44 t
ORIGIN      
        1 aatatctcaa cgaagcccct caggtactcc tgcggagctc cacagacaac aagaggaccg
       61 agcaccaagg gttccacgcc actctccata agagcagcag ctccttccat ctgcagaagt
      121 cctcagcgca gctgtcagac tctgccctgt actactgtgc tctgagaacc tcaggaggaa
      181 actacaaacc tacgtttggg aaagggacca gcctcgtggt tcatccatac atccagaacc
      241 cagaac
//
LOCUS       SYNCATAA       66 bp ds-DNA             SYN       25-MAY-1990
DEFINITION  Synthetic chloramphenicol acetyl-transferase (cat) gene, partial
            cds.
ACCESSION   M33419
KEYWORDS    chloramphenicol acetyl-transferase.
SOURCE      Synthetic DNA, clones pEJL4[30,49].
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Joseph-Liauzun,E., Fellay,R. and Chandler,M.
  TITLE     Transposable elements for efficient manipulation of a wide range of
            gram-negative bacteria: Promoter probes and vectors for foreign
            genes
  JOURNAL   Gene 85, 83-89 (1989)
  STANDARD  simple staff_entry
BASE COUNT       16 a     12 c     18 g     20 t
ORIGIN      
        1 aattcggtga tgctgccaac ttactgattt agtggatccg gtgattgatt gagcaagctt
       61 cgacga
//
LOCUS       SYNIS1AA       65 bp ss-mRNA            SYN       25-MAY-1990
DEFINITION  Synthetic insertion element IS1.
ACCESSION   M33418
KEYWORDS    .
SOURCE      Synthetic DNA.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 65)
  AUTHORS   Joseph-Liauzun,E., Fellay,R. and Chandler,M.
  TITLE     Transposable elements for efficient manipulation of a wide range of
            gram-negative bacteria: Promoter probes and vectors for foreign
            genes
  JOURNAL   Gene 85, 83-89 (1989)
  STANDARD  simple staff_entry
BASE COUNT       21 a     12 c     11 g     21 t
ORIGIN      
        1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatcca
       61 aattt
//

GenBank-Updates@genbank.bio.net (05/26/90)

LOCUS       ECOSPEA      3236 bp ds-DNA             BCT       26-MAY-1990
DEFINITION  E.coli arginine decarboxylase (speA) gene, complete cds, agmatinase
            (speB) and methionine adenosyltransferase (metK) genes, 5' end.
ACCESSION   M31770
KEYWORDS    agmatinase; arginine decarboxylase; metK gene;
            methionine adenosyltransferase; speA gene; speB gene.
SOURCE      E.coli (strain K12) DNA, clones pLC2-5 and lambda-[1H10,23G45].
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3236)
  AUTHORS   Moore,R.C. and Boyle,S.M.
  TITLE     Nucleotide sequence and analysis of the speA gene encoding arginine
            decarboxylase of Escherichia coli
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            R.C.Moore, 02-FEB-1990.
FEATURES       from  to/span     description
    pept        192  <     1 (c) methionine adenosyltransferase (metK)
    pept        987     2963     arginine decarboxylase (speA) (EC 4.1.1.19)
    pept       3101  >  3236     agmatinase (speB)
    signal     3030     3067     rho-independent transcription terminator
    signal      811      816     -35 region
    signal      839      844     -10 region
    binding     977      980     ribosome binding site
    site       1878     1886     pyridoxal phosphate binding site (put.)
BASE COUNT      743 a    784 c    860 g    849 t
ORIGIN      62.9 min on K12 map.
        1 tacccaaggt cgctggtggt gatttcgccg ccaactaaaa ccaatgccgg tttttacgta
       61 ggtttcgcaa gcaacgcgtg ctttcggatc ctgttcgagg atcgcgtcta aaacggcatc
      121 agaaatttgg tcagcaattt tgtcaggatg cccttcagag acggactcgg acgtaaaaag
      181 gtgttttgcc atatttaata tcacctaaag agaatttggt tagctcaaac tgttgtgtgg
      241 attttctgtg gtagcggatc ctaccacgac tctgcaggtt aaaaacactg gcagtctgag
      301 tgttaatcgg tatggatgga ttaacatctg gatggctatt ttaggtcaat tcttcaccct
      361 atttccactt ttttttgaat cgtgtctcat tctgttaaaa acgtggctgg aaatttttcc
      421 tgacaatgcc ggcattctgc gtatttatct tttgcaattt tctgccattg tggggtataa
      481 aacgcggcgc gcggcttaaa taaaaagcac acgacgtttc tttcgtgttg ccacttccag
      541 ccgggttcaa atcagagttt tggcttgtgg gttcgtctta acaggcggcc gtggaggtga
      601 tacgaaataa tgaaccgttg tctgctgctt aacctgtctc accgttctgg tgaagattcg
      661 ttccccgcac tctgcatctc tgctttgcat acctgccgat gttataccca tctcggcgct
      721 tctcaggatt caagagctgg ttacagttac tgaggactga acaagggcgc tcttgtaaaa
      781 acaagagttt tctcgtggtt tcgccgaact ttcacactta cgttcggtta tgtgcttaat
      841 aatgttatga aaaagaaacc ggttgcgcag ttggagcgtc agcattcact gctggaaaat
      901 ccatgtgctt atgggttgtt atcgcagttc caggctgcga tagtcgttaa ctgttttaca
      961 cttaataaaa taatttgagg ttcgctatgt ctgacgacat gtctatgggt ttgccttcgt
     1021 cagcgggcga acacggtgta ctacgctcca tgcaggaggt tgcaatgagc tcccaggaag
     1081 ccagcaagat gctgcgtact tacaatattg cctggtgggg caataactac tatgacgtta
     1141 acgagctggg ccacattagc gtgtgcccgg acccggacgt cccggaagct cgcgtcgatc
     1201 tcgcgcagtt agtgaaaact cgtgaagcac agggccagcg tctgcctgca ctgttctgtt
     1261 tcccacagat cctgcagcac cgtttgcgtt ccattaacgc cgcgttcaaa cgtgcgaggg
     1321 aatcctacgg ctataacggc gattacttcc ttgtttatcc gatcaaagtt aaccagcacc
     1381 gccgcgtgat tgagtccctg attcattcgg gcgaaccgct gggtctggaa gccggttcca
     1441 aagccgagtt gatggcagta ctggcacatg ctggcatgac ccgtagcgtc atcgtctgca
     1501 acggttataa agaccgcgaa tatatccgcc tggcattaat tggcgagaag atggggcaca
     1561 aggtctatct ggtcattgag aagatgtcag aaatcgccat tgtgctggat gaagcagaac
     1621 gtctgaatgt cgttcctcgt ctgggcgtgc gtgcacgtct gcgttcgcag ggttcgggta
     1681 aatggcagtc ctccggcggg gaaaaatcga agttcggcct ggctgcgact caggtactgc
     1741 aactggttga aaccctgcgt gaagccgggc gtctcgacag cctgcaacta ctgcacttcc
     1801 acctcggttc gcagatggcg aatattcgcg atatcgcgac aggcgttcgt gaatccgcgc
     1861 gtttctatgt ggaactgcac aagctgggcg tcaatattca gtgcttcgac gtcggcggcg
     1921 gtctgggcgt ggattatgaa ggtactcgtt cgcagtccga ctgttcggtg aactacggcc
     1981 tcaatgaata cgccaacaac attatctggg cgattggcga tgcgtgtgaa gaaaacggtc
     2041 tgccgcatcc gacggtaatc accgaatcgg gtcgtgcggt gactgcgcat cacaccgtgc
     2101 tggtgtctaa tatcatcggc gtggaacgta acgaatacac ggtgccgacc gcgcctgcag
     2161 aagatgcgcc gcgcgcgctg caaagcatgt gggaaacctg gcaggagatg cacgaaccgg
     2221 gaactcgccg ttctctgcgt gaatggttac acgacagtca gatggatctg cacgacattc
     2281 atatcggcta ctcttccggc atctttagcc tgcaagaacg tgcatgggct gagcagcttt
     2341 atttgagcat gtgccatgaa gtgcaaaagc agctggatcc gcaaaaccgt gctcatcgtc
     2401 cgattatcga cgagctgcag gaacgtatgg cggacaaaat gtacgtcaac ttctcgctgt
     2461 tccagtcgat gccggacgca tgggggatcg accagttgtt cccggttctg ccgctggaag
     2521 ggctggatca agtgccggaa cgtcgcgctg tgctgctgga tattacctgt gactctgacg
     2581 gtgctatcga ccactatatt gatggtgacg gtattgccac gacaatgcca atgccggagt
     2641 acgatccaga gaatccgccg atgctcggtt tctttatggt cggcgcatat caggagatcc
     2701 tcggcaacat gcacaacctg ttcggtgata ccgaagcggt tgacgtgttc gtcttccctg
     2761 acggtagcgt agaagtagaa ctgtctgacg aaggcgatac cgtggcggac atgctgcaat
     2821 atgtacagct cgatccgaaa acgctgttaa cccagttccg cgatcaagtg aagaaaaccg
     2881 atcttgatgc tgaactgcaa caacagttcc ttgaagagtt cgaggcaggt ttgtacggtt
     2941 atacttatct tgaagatgag taagtcctgt gttacttgaa tccgcttaat ttagcggtga
     3001 taatccgcca caatttattg tgacaaatcc aacccttcct cgtcgggcct aacgacgcgg
     3061 aagggttttt ttatatcgac tttgtaatag gagtccatcc atgagcacct taggtcatca
     3121 atacgataac tcactggttt ccaatgcctt tggtttttta cgcctgccga tgaacttcca
     3181 gccgtatgac agcgatgcag actgggtgat tactggcgtg ccgttcgata tggcca
//
LOCUS       FIBEGASE     2310 bp ds-DNA             BCT       26-MAY-1990
DEFINITION  F.succinogenes endoglucanase 3 (cel3) gene, complete cds.
ACCESSION   M29047 M29681
KEYWORDS    cellobiosidase; endoglucanase.
SOURCE      F.succinogenes (strain S85, ATCC 19169) DNA.
  ORGANISM  Fibrobacter succinogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 2310)
  AUTHORS   McGavin,M.J., Forsberg,C.W., Crosby,B., Bell,A.W., Dignard,D. and
            Thomas,D.Y.
  TITLE     Structure of the cel-3 gene from Fibrobacter succinogenes S85 and
            characteristics of the encoded gene product, endoglucanase 3
  JOURNAL   J. Bacteriol. 171, 5587-5595 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Dignard, 14-OCT-1989.
FEATURES       from  to/span     description
    pept        177     2153     endoglucanase 3 precursor
    sigp        177      245     endoglucanase 3 signal peptide A (alt.)
    sigp        177      251     endoglucanase 3 signal peptide A' (alt.)
    matp        246     2150     endoglucanase 3 A (alt.)
    matp        252     2150     endoglucanase 3 A' (alt.)
    site        167      172     ribosome binding site
    site       2172     2213     region of dyad symmetry
BASE COUNT      649 a    653 c    529 g    479 t
ORIGIN      
        1 ggatccgggt gcgtcagtta aataaaatat tttttaacgt ttttcgtaca gaaagtggac
       61 ttttagacca aaacacttat tacacttttt attccgatat atcattttac atagcataaa
      121 accgaccccc aaatatatct ttggtaaaaa agaaaaaatc accttaagag ggttttatgc
      181 aactcaagaa tttctatccc aaaatgagcg ttctcggtat cgcaaccgtg atggcactta
      241 ccgcctgtgg cgatgaaaat acccaggcac tgttcgccaa caatccggtt ccgggtgccg
      301 aaaatcaggt tccggtttct agcagcgaca tgagcccgac ctctagcgac gctgtcattg
      361 acccgacctc cagctctgcc gcagtggtcg acccgtctac gctccctgca gaaggtccta
      421 ttaccatgcc ggaaggtctc ggcactttgg tcgatgactt tgaagatggc gataacttga
      481 gcaaaatcgg tgattactgg tacacctaca acgataacga caacggtggt gcatccatca
      541 tcacgactcc gctaaacgaa gaagaaaaca tcatcccggg ccgcgtcaac aacggttcca
      601 actacgcctt gcaagtcaac tacacgcttg atagaggcga ttacgaatac gatccgtacg
      661 taggctgggg cgtgcaggtc gcaccggacg aagccaacgg acatttcggc ggccttacct
      721 actggtacaa gggcggcgca cacgaagtac atatcgaaat caccgacgtc gaagactacg
      781 acgtgcatct cgccaagttc ccggcatccc gcacatggaa gcaggctgtc gtccgcttca
      841 aggacctcgt tcaaggtggc tggggcaagg aaattccgtt cgacgccaag cacatcatgg
      901 caatcagctt ccaggccaag ggaaacaaga gcaagctcgt gaccgactcc ctcttcatcg
      961 acaacatcta cctgcaggat tcttccgaag ttgaaaagga ccagccggat atggaaatca
     1021 aggacccggt cattccggtc gttgaattta ccgaagctga aatcactgtg acgaacccgt
     1081 tgcaggaaaa ggccatgaag tacctcaaca agggtgtcaa ctttaccaac tggctcgaaa
     1141 acgcagatgg caagttcaag tcctttgaat tgggcgaaag cgacgtcaag attcttgccg
     1201 acaacggatt caagagcctc cgcttgccga ttgaccttga cctctatgcc acaaaccgtg
     1261 acgcattcat cgcaggcacc gacacagaac tcaagttcga tgacgacacc ttgttcctgg
     1321 ttctcgactc cttcgtagaa tggaccgcca agtacaacat gtctttcgtg attgactacc
     1381 atgaatatga caacagctac aacaccacca gcgctaagga ccccaactac atcaagatga
     1441 tggcagaaac gtggaagcat gttgcagccc actacgccga aagcccccgc gaagacttgt
     1501 tcttcgaact cttgaacgaa ccggacatga gcgatggtaa ggtcactgca gcaacatgga
     1561 ccaccgcagc ccaggccatg attgacgcca tccgcacggt tgataccaag cacaccatcc
     1621 tcttcggtga tgcccagtgg tactccatca cgctcctcgc caagcgcact ccgttcaccg
     1681 atgacaacat catctacgtg atccacacct acgaaccgtt cgccttcacg catcagggcg
     1741 gttcctggac ggactacgcc accatccacg atattccgtt cccctacgat ccggcaaagt
     1801 ggtctacggt ttctggcgac ttcggtgtca acaagagcac aaagtcctac gtgaaaacca
     1861 acatcaagaa ctactacaag accggcagca aggaagccat cttggaacag attctcaagg
     1921 ccaagaagtg ggccgccacc aacaacgtac cggtgatcat caacgaattc ggcgcattga
     1981 acctccgctc taccgctgaa tcccgcctca actacctcac ggccatgcgc gaaatctgcg
     2041 ataccctcca gattccttgg acgcactggg gctacaccgg caacttctcc gtgatcgaaa
     2101 acggcaagtt gattgaaggc ctcgacaagg cactcggcgt cggtagcaaa taagtctctc
     2161 cttaaaaccc cctcaaaaaa aggtcacgca gaaatgcgtg gcttttttag taggaagtag
     2221 acggtaggaa gttggaagtt agaagtagga agtaacagga atggcgcaat ggatacagtt
     2281 gacacagata cattacaaaa ccccggatcc
//
LOCUS       SFSSA        1747 bp ss-RNA             VRL       26-MAY-1990
DEFINITION  Sandfly fever sicilian virus S RNA encoding N protein, complete
            cds, and NS-s protein, complete cds.
ACCESSION   J04418
KEYWORDS    N protein; NS-s protein.
SOURCE      Sandfly fever Sicilian virus, cDNA to viral RNA.
  ORGANISM  Sandfly fever Sicilian virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Bunyaviridae; Uukuvirus.
REFERENCE   1  (bases 1 to 1747)
  AUTHORS   Marriott,A.C., Ward,V.K. and Nuttall,P.A.
  TITLE     The S RNA segment of Sandfly fever sicilian virus: Evidence for an
            ambisense genome
  JOURNAL   Virology 169, 341-345 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 693 to 695)
  AUTHORS   Marriott,A.C., Ward,V.K. and Nuttall,P.A.
  JOURNAL   Unpublished (1990) Oxford, UK
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A.C.Marriott 12-JAN-1989.
FEATURES       from  to/span     description
    pept         42      782     N protein
    pept       1727      924 (c) NS-s protien
    revision    693      695     ttc in [2]; tc in [1]
BASE COUNT      483 a    358 c    459 g    447 t
ORIGIN      Unreported.
        1 acacaaaggt ccctagttaa tctgagtgag ctaagtttga aatggacgag taccagaaaa
       61 ttgctgttga gtttggagag caggctattg atgagactgt gatccaggat tggctacaag
      121 catttgcgta tcaaggattt gatgccagaa caattataca caaccttgtg cagcttggag
      181 ggaagagttg ggaagaggat gccaagaaga tgatcatcct atccctaact cgtggcaaca
      241 agcccaagaa gatggttgag agaatgtctc cagagggagc aagagaagtt aagagcctgg
      301 ttgcaaagta taagatagta gagggcagac caggcaggaa tggaattacc ctgtcaaggg
      361 tgctgcagcc ctggctgggt ggacagtcca agctgtggaa gtggttgaaa acttcttacc
      421 agtcccaggg agcacaatgg accgcattgt gtggacaaac ataccccagg cagatgatgc
      481 atccaagctt tgccggtctc attgacccaa gcctcgacca ggaggatttt aatgcagtat
      541 tggatgctca caaacttttc ttgttcatgt tttccaaaac aatcaatgtc agcctccgcg
      601 gtgcgcagaa gagagacatt gaggaatcat tctctcaacc aatgcttgct gctatcaata
      661 gctcattcat tgacaacact cagaggaggg cattcttgac taagtttggg atcctaactt
      721 ctggagcaag agctacagca gttgtaaaga agattgcaga agtttacagg aaactagagt
      781 aagctgctgc tagtgtgggg tgggatgggg attctgggtt gggggttctg gggtggaggg
      841 tggctaggtg gggggtggca agggtggatt cggtttgggt tggggtcatg gggaggggtg
      901 ggtctggggc tgggcagcgg agatcaaaag tcagagtcag acgagctctc atcattttca
      961 tccacatgac tgtgtattgg ggtccaaaga gaattgccat actcggtgag gccagtagaa
     1021 gggtcacttg ctctatagga tctaatcact gttcttacat caagtgcctc cccagaggag
     1081 gcagtgtcaa aaggctctgc attgataagt ctgagacaaa ccagagatcc tatctctcta
     1141 aatagatcgt atccattgta atgctcatca ctaagaccca acctcctagc ttcttgtagt
     1201 atctttttgt gtgcctgaac tatgcactca tccaagctat gtgaatcccc cattctcaga
     1261 atgtaagaca ttagctgatc ccttgtttgt agccctctca caaatctatc actgcatatg
     1321 ctaaagatct cacaatcagg gatacctagt ggccagctaa gagccttcag gacatttggc
     1381 agcccctttc tagagaaact tgtgaggtca aacctggaga ggtcacttgc cataccttgg
     1441 aaggtataca tcataggctt gacagaacta aaatagcatg ctgggcccca agaagctggc
     1501 aactctccaa gggaataaaa gtcagccagt gagtttctgc gtccaaaccc aagtcttaac
     1561 ttctctagtg gtatttcaca atgctcataa gttgaaacgt catgagtgtg aaatttattg
     1621 taggcaacat aagacacact ggagaggagt ctatgacacc tcacatcaat gttaattgcc
     1681 gggtagtcaa acatgtactg gctgttcatc atgttgttgt tgatcattga ctagggggtc
     1741 tttgtgt
//
LOCUS       RSSB800AB     437 bp ds-DNA             BCT       26-MAY-1990
DEFINITION  R.sphaeroides B800-850 alpha and beta subunits of major
            light-harvesting complex.
ACCESSION   X05200
KEYWORDS    light-harvesting complex.
SOURCE      Rhodobacter sphaeroides.
  ORGANISM  Rhodobacter sphaeroides
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 437; no enum.)
  AUTHORS   Ashby,M.K., Coomber,S.A. and Hunter,C.N.
  TITLE     Cloning,nucleotide sequence and transfer of genes for the B800-850
            light harvesting complex of Rhodobacter sphaeroides
  JOURNAL   FEBS Lett. 213, 245-248 (1987)
  STANDARD  simple automatic
FEATURES       from  to/span     description
    pept         40      195     B800-850 beta subunit (AA 1-51)
    pept        210      374     B800-850 alpha subunit (AA 1-54)
BASE COUNT       78 a    156 c    124 g     79 t
ORIGIN      
        1 gccctagcgc acaccgtcga tttaccattg gagacgcaca tgactgacga tctcaacaaa
       61 gtctggccga gcggcctcac cgttgccgaa gccgaagaag ttcataagca actcatcctc
      121 ggcacccgcg tcttcggtgg catggctctg ctcgcgcact tcctcgccgc cgctgcgacc
      181 ccctggctcg gctgatatga gagactgaca tgaccaacgg caaaatctgg ctcgtggtga
      241 aaccgaccgt cggcgttccg ctgttcctca gcgctgccgt catcgcctcc gtcgttatcc
      301 acgctgctgt gctgacgacc accacctggc tgcccgccta ctaccaaggc tcggctgcgg
      361 tcgcggccga gtaatgctgc gcaagcgcgg gcctgcgggc ccacgccagc cagtccgtga
      421 gtccgagcag gccggga
//
LOCUS       RSSPETA       316 bp ds-DNA             BCT       26-MAY-1990
DEFINITION  R.sphaeroides Rieske Fe-S protein cytochrome b (petA) gene, 5' end.
ACCESSION   M18577
KEYWORDS    cytochrome b.
SOURCE      R.sphaeroides (strain GA) DNA.
  ORGANISM  Rhodobacter sphaeroides
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 316)
  AUTHORS   Davidson,E. and Daldal,F.
  TITLE     fbc operon, encoding the Rieske Fe-S protein cytochrome b, and
            cytochrome c1 apoproteins previously described from
            Rhodopseudomonas sphaeroides, is from Rhodopseudomonas capsulata
  JOURNAL   J. Mol. Biol. 195, 25-29 (1987)
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    pept         32  >   316     Rieske Fe-S protein cytochrome b (gtg start
                                 codon)
BASE COUNT       54 a    111 c     99 g     52 t
ORIGIN      Unreported.
        1 ctgcagcggc ccgaggaagg gagaagttct cgtgtccaac gcagaagatc acgcaggcac
       61 tcgcagggat ttcctgtatt acgccacggc cggagccggg gcggtggcca ccggggccgc
      121 cgtctggccg ctgatcaacc aaatgaatcc gtcggccgac gtgcaggccc tcgcctccat
      181 cttcgtcgat gtgagctcgg tcgagccggg tgtccagctg accgtcaagt tcctcggcaa
      241 accgatcttc atccgccgcc gcaccgaggc cgacatcgag ctcggccgct ccgtccagct
      301 cggccagctg gtcgac
//
LOCUS       HUMERCC3A    2751 bp ss-mRNA            PRI       26-MAY-1990
DEFINITION  Human DNA repair helicase (ERCC3) mRNA, complete cds.
ACCESSION   M31899
KEYWORDS    Cockayne's syndrome; DNA repair; excision repair; helicase.
SOURCE      Human lymphoid cell line K562 cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2751)
  AUTHORS   Weeda,G., Van Ham,R.C.A., Vermeulen,W., Bootsma,D.,
            Van der Eb,A.J. and Hoeijmakers,J.H.J.
  TITLE     Identification of the molecular defect involving the human repair
            disorders xeroderma pigmentosum and Cockayne's syndrome in the
            ERCC-3-encoding, a presumed DNA repair helicase
  JOURNAL   Mol. Cell. Biol. 10, 2570-2581 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G. Weeda, 07-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept         96     2444     DNA repair helicase /hgml_locus_uid="LF0034Q"
                                 /map="2q21" /nomgen="ERCC3"
BASE COUNT      727 a    668 c    726 g    630 t
ORIGIN      
        1 gggagcttcc ggattgagcc ggaagtcccc ccagagcgga tgccgcggcg ggcctgtggg
       61 agcggggtca tcttctctct gctgctgtag ctgccatggg caaaagagac cgagcggacc
      121 gcgacaagaa gaaatccagg aagcggcact atgaggatga agaggatgat gaagaggacg
      181 ccccggggaa cgaccctcag gaagcggttc cctcggcggc ggggaagcag gtggatgagt
      241 caggcaccaa agtggatgaa tatggagcca aggactacag gctgcaaatg ccgctgaagg
      301 acgaccacac ctccaggccc ctctgggtgg ctcccgatgg ccatatcttc ttggaagcct
      361 tctctccagt ttacaaatat gcccaagact tcttggtggc tattgcagag ccagtgtgcc
      421 gaccaaccca tgtgcatgag tacaaactaa ctgcctactc cttgtatgca gctgtcagcg
      481 ttgggctgca aaccagtgac atcaccgagt acctcaggaa gctcagcaag actggagtcc
      541 ctgatggaat tatgcagttt attaagttgt gtactgtcag ctatggaaaa gtcaagctgg
      601 tcttgaagca caacagatac ttcgttgaaa gttgccaccc tgatgtaatc cagcatcttc
      661 tccaggaccc cgtgatccga gaatgccgct taagaaactc tgaaggggag gccactgagc
      721 tcatcacaga gactttcaca agcaaatctg ccatttctaa gactgctgaa agcagtggtg
      781 ggccctccac ttcccgagtg acagatccac agggtaaatc tgacatcccc atggacctgt
      841 ttgacttcta tgagcaaatg gacaaggatg aagaagaaga agaagagaca cagacagtgt
      901 cttttgaagt caagcaggaa atgattgagg aactccagaa acgttgcatc cacctggagt
      961 accctctgtt ggcagaatat gacttccgga atgattctgt caaccctgat atcaacattg
     1021 acctaaagcc cacagctgtc ctcagaccct atcaggagaa gagcttgcga aagatgtttg
     1081 gaaacgggcg tgcacgttcg ggggtcattg ttcttccctg cggtgctgga aagtccctgg
     1141 ttggtgtgac tgctgcatgc actgtcagaa aacgctgtct ggtgctgggc aactcagctg
     1201 tttctgtgga gcagtggaaa gcccagttca agatgtggtc caccattgac gacagccaga
     1261 tctgccggtt cacctccgat gccaaggaca agcccatcgg ctgctccgtt gccattagca
     1321 cctactccat gctgggccac accaccaaaa ggtcctggga ggccgagcga gtcatggagt
     1381 ggctcaagac ccaggagtgg ggcctcatga tcctggatga agtgcacacc ataccagcca
     1441 agatgttccg aagggtgctc accatcgtgc aggcccactg taagctgggt ttgactgcga
     1501 ccctcgtccg cgaagatgac aaaattgtgg atttaaattt tctgattggg cctaagctct
     1561 acgaagccaa ctggatggag ctgcagaata atggctacat cgccaaagtc cagtgtgctg
     1621 aggtctggtg ccctatgtct cctgaatttt accgggaata tgtggcaatc aaaaccaaga
     1681 aacgaatctt gctgtacacc atgaacccca acaaatttag agcttgccag tttctgatca
     1741 agtttcatga aaggaggaat gacaagatta ttgtctttgc tgacaatgtg tttgccctaa
     1801 aggaatatgc cattcgactg aacaaaccct atatctacgg acctacgtct cagggggaaa
     1861 ggatgcaaat tctccagaat ttcaagcaca accccaaaat taacaccatc ttcatatcca
     1921 aggtaggtga cacttcgttt gatctgccgg aagcaaatgt cctcattcag atctcatccc
     1981 atggtggctc caggcgtcag gaagcccaaa ggctagggcg ggtgcttcga gctaaaaaag
     2041 ggatggttgc agaagagtac aatgcctttt tctactcact ggtatcccag gacacacagg
     2101 aaatggctta ctcaaccaag cggcagagat tcttggtaga tcaaggttat agcttcaagg
     2161 tgatcacgaa actcgctggc atggaggagg aagacttggc gttttcgaca aaagaagagc
     2221 aacagcagct cttacagaaa gtcctggcag ccactgacct ggatgccgag gaggaggtgg
     2281 tggctgggga atttggctcc agatccagcc aggcatctcg gcgctttggc accatgagtt
     2341 ctatgtctgg ggccgacgac actgtgtaca tggagtacca ctcatcgcgg agcaaggcgc
     2401 ccagcaaaca tgtacacccg ctcttcaagc gctttaggaa atgatgctta ggcagggtac
     2461 ttcgttcaag accggcgctt ggcacccttg ttggaaaggg attttcagca taacattttc
     2521 cttccacctc tttgaccttc cctccagcgt tggccaaatt gtgctgagga agatgcatca
     2581 agggcttggc tgtgccttca taggtcatct agggttttat aaaggaggag gagacaatat
     2641 tttttcaaac tttttgggga gtggggtcat ttctgtatat aaaaaatgtt aatatttaag
     2701 gtgtatttat gttaccgttc tgaataaaca gaatggacca ttgaaccagt a
//
LOCUS       BOLREPA       182 bp ds-DNA             PLN       26-MAY-1990
DEFINITION  B.campestris tandemly repeated DNA.
ACCESSION   M30962
KEYWORDS    repetitive DNA.
SOURCE      B.campestris (strain Var B-85) seedling DNA.
  ORGANISM  Brassica campestris
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 182)
  AUTHORS   Das Gupta,J. and Mandal,R.K.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry,
            Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA.
FEATURES       from  to/span     description
BASE COUNT       54 a     39 c     29 g     60 t
ORIGIN      
        1 aagcttctta catcgtgatt catcctggtt tgattagaat gacaaagaag ctgtccaatt
       61 cccaaacagg aaaactggga tcacctgatt tgaaagtggg ttagcttctt catcctaact
      121 cctatgagat ttcttcaact tcctagtgat tctccattac tttaagtatc aaaatcaagc
      181 tt
//
LOCUS       BOLREPB       182 bp ds-DNA             PLN       26-MAY-1990
DEFINITION  B.juncea tandemly repeated DNA.
ACCESSION   M30963
KEYWORDS    repetitive DNA.
SOURCE      B.juncea (strain Var B-9) DNA.
  ORGANISM  Brassica juncea
            Unclassified.
REFERENCE   1  (bases 1 to 182)
  AUTHORS   Das Gupta,J. and Mandal,R.K.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry,
            Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA.
FEATURES       from  to/span     description
BASE COUNT       53 a     42 c     30 g     57 t
ORIGIN      
        1 aagcttctta cagagtcatt tatcctggtt tgattggaac accgaagaag ctgtcctatt
       61 cccaaactgg gaaactggaa tcacctgatt agaaagtggg ataacttctt catcccaact
      121 cctatgagat ttattcaact tcctggtgat tctccaacac tttatgtatc caaatcaagc
      181 tt
//
LOCUS       HUMHPV16A1    336 bp ds-DNA             PRI       26-MAY-1990
DEFINITION  Human DNA/HPV-16 insertion site, 5' flank, clone H022.
ACCESSION   M33610
KEYWORDS    insertion site.
SEGMENT     1 of 2
SOURCE      Human cervical cancer DNA, clone H022.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 336)
  AUTHORS   Wagatsuma,M., Hashimoto,K. and Matsukura,T.
  TITLE     Analysis of integrated human papillomavirus type 16 DNA in cervical
            cancers: Amplification of viral sequences together with cellular
            flanking sequences
  JOURNAL   J. Virol. 64, 813-821 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        325      326     Human DNA end/HPV-16 DNA start
BASE COUNT      114 a     61 c     56 g    105 t
ORIGIN      
        1 aggtatataa atggccaagg tagaagatat caaaatgagg tggatttgat ttctcatgtg
       61 agactcatag ctaatttaaa tgaaaattta aataagattt atttgacatg attgggaaca
      121 attcaattca actttacaaa cactgattaa atgtctacca tctggatggc accgtgctaa
      181 gtgagtctcc aaacctgaac tgtgattata aagggcattt ataaactttc cctcaaagat
      241 aggacatttg cccatgtaat catgccatct ttaaaagcat cactctaaat tatttaggtg
      301 acttctaact ttgcccagta ctctgtccca cagcta
//
LOCUS       HUMHPV16A2   1002 bp ds-DNA             PRI       26-MAY-1990
DEFINITION  Human DNA/HPV-16 insertion site, 3' flank, clone H022.
ACCESSION   M33611
KEYWORDS    insertion site.
SEGMENT     2 of 2
SOURCE      Human cervical cancer DNA, clone H022.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1002)
  AUTHORS   Wagatsuma,M., Hashimoto,K. and Matsukura,T.
  TITLE     Analysis of integrated human papillomavirus type 16 DNA in cervical
            cancers: Amplification of viral sequences together with cellular
            flanking sequences
  JOURNAL   J. Virol. 64, 813-821 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site          9       10     HPV-16 DNA end/Human DNA start
BASE COUNT      284 a    190 c    206 g    322 t
ORIGIN      About 3187 bp after segment 1.
        1 gaagtggaat aaagtgaaag cctcactctt ctctagccta agttttagag tccagtgaag
       61 cattgcaagc ataggctttg tagtcagaaa accctgagat caaatcctgg ttctaccact
      121 tgctatagcg atcttgggca aggggtcaga tctctctaag cctgtttcct catctgtaag
      181 gaagggtatt atatcacata aggttactgt gaggactaaa ttagactaag tatgcaatag
      241 gaatacaggg tccagttttc tttggatgta atgggcctgg aaaattcctt aaaatccttt
      301 tcacctacaa aatcttatga agttctgcct attttctgct taaaaacttt aaaaaattaa
      361 tagaaataaa agagaattct actagagaga taggttgacg ttacttcttc cttgcttttt
      421 ccttaaagtg gaatgttaaa aactaggata tgcctggaaa gtgttctatc tacaaaaaag
      481 gaagttagca gccgctgaaa agtaactaca gatggctatt cactttactc tgaaagcatt
      541 tgctgttgat ataatcacac cacaggaaaa catcataatg ttggctgaaa gaaatctgaa
      601 atgacacagc aataatgctt catcatgtag aagttggttt caagtttttt tttttttttc
      661 ggtctggata gtgtgattgc aagaagggag gctatgctag cttggttata agcagggaag
      721 ttggctgtga ggagataaac agagatctca caggaattct ggggtagaaa tcactggacc
      781 ggaactgaag ggctatctcc cagcttctgt ttctgccttt tcattcagtc attccctcgt
      841 ttactcaaca gttccctctg ctttggtggc agtttctgct ccttctcaag gctgacttgc
      901 acatggctct gacttgctgt ggcctcctct ccatcattct ctgcatcagg tgctttcaac
      961 cttgatttta ttgtttatat atacttatga acttttctgc ag
//
LOCUS       HUMHPV16B     871 bp ds-DNA             PRI       26-MAY-1990
DEFINITION  Human DNA/HPV-16 insertion site, 5' flank, clone H404.
ACCESSION   M33612
KEYWORDS    insertion site.
SOURCE      Human cervical cancer DNA, clone H404.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 871)
  AUTHORS   Wagatsuma,M., Hashimoto,K. and Matsukura,T.
  TITLE     Analysis of integrated human papillomavirus type 16 DNA in cervical
            cancers: Amplification of viral sequences together with cellular
            flanking sequences
  JOURNAL   J. Virol. 64, 813-821 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        861      862     Human DNA end/HPV-16 DNA start
BASE COUNT      291 a    147 c    164 g    269 t
ORIGIN      
        1 atactctgag taaacaagta aaacatttgg taaaataact ggaaggatat ataccatagt
       61 aaatgattct ttttcaaatt ttctattata tagctatata aggtatgaat ctagtagtta
      121 ccctcaaatt agggtaaaca atttcctcag cagtttgagc agctcatctc ataatacttt
      181 gcaaagatag ccacacaagg gaatgggctg cttgatttga acacaggtgg ggatggatta
      241 atagaactgg ggatcaggga acattgggca ggactaataa gaattaggca gtcagaaaaa
      301 gatttacaaa aaagactgta taacgagtct aaagataaat tctacctatt taacatttct
      361 gcctgagttt ggagaaggca agaaaacatt cttctcttcc tcttacgtac acagacaatt
      421 agggaagcca caatgagata atttatgcta tgttagtgag taacacataa ttttccttca
      481 cagctgatat aacttgatta ctggagtggc agtggaaggg catggagacc caggccatgg
      541 tcacttttct aggtgctcct acgactcaat ttctctcttc tgtcttgatt cctttgggag
      601 attcctggat tttagaaaat cagatgagta agttgttatc atctgaaaaa tgccctctta
      661 ccacacaatt atctattaga ggaaagttta ggaacagttg gtttaactga gagaaataaa
      721 gataatctct atctcccttg cctgctctta ggataagggt tctgagatcc tatataatct
      781 tatatcattt aacataaaca caatttctta ctttgcttga aaagttgtat taaagattcc
      841 agggtgcagt taaatacact tcacaatata c
//
LOCUS       HUMHPV16C1   1130 bp ds-DNA             PRI       26-MAY-1990
DEFINITION  Human DNA/HPV-16 insertion site, 5' flank, clone H705.
ACCESSION   M33613
KEYWORDS    insertion site.
SEGMENT     1 of 2
SOURCE      Human cervical cancer DNA, clone H705.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1130)
  AUTHORS   Wagatsuma,M., Hashimoto,K. and Matsukura,T.
  TITLE     Analysis of integrated human papillomavirus type 16 DNA in cervical
            cancers: Amplification of viral sequences together with cellular
            flanking sequences
  JOURNAL   J. Virol. 64, 813-821 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site       1120     1121     Human DNA end/HPV-16 DNA start
BASE COUNT      321 a    222 c    244 g    343 t
ORIGIN      
        1 tgccatcatt aatgcagctg gcacgacagg ttcccgactg aaacggcagt gagcgcaacg
       61 caattaatgt gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg
      121 ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgaca
      181 tgattacaat tcgagctcgg taccagcaca atgaggaatg catgctagca caagtgaaac
      241 tcatagatgt ccattgtgct atgcattttt tccttgggcc tgatccattt atccatttac
      301 tggtttcctg tctgtaaatt tagaaaagat acaggctctc tgaaaagtaa tttctgtctc
      361 ttacaagtga agggttaatc aaccaatcca cataattttc tccagtactg agagatcatt
      421 tgttttaata aatgcaaata aggtttctta tagttaaagg taattggctt ttcattgtaa
      481 ttcttgatgc tggtcatttt gtgtctgagt tgttcctaat tgctttggtt cagagtctga
      541 gaaatgaaat agccccttga ctataactgt aactacaatt ataacaattt atttatttaa
      601 atcagcaatc cctgcaaagt catttacagt ttgtttattt cagtatgttt tacaaggtgc
      661 aacaaaagca gcctcatcac atagcaaatc tttcttacag gattaaaagt taatgggtaa
      721 ggtaagtctg gcataggcat taaagtggaa gcattgtttc ttcttgactg gtcaacttta
      781 gagacaactt ttcccattcg aagttatcta tcctctaaaa tatacagaga ttgaggccag
      841 gtgggatggc tcacccctgt aatcccagaa ctttgggagg ccaagatgag tggattgctt
      901 gagctcagaa gtttgagacc ttggtaacat ggcgggatgc cggtggtgcc atgcctgtgg
      961 tcccagcttc ttgggggctg aggtgggagg accttctgag cctggtggca aagttgcagt
     1021 aagctgtgtt ggtatcactg cactccagcc tgcactcctg cacaaagcaa gaccctgtct
     1081 caaaacaata aataaattaa aatatagaga gactttgcat tgcaaaggca
//
LOCUS       HUMHPV16C2    148 bp ds-DNA             PRI       26-MAY-1990
DEFINITION  Human DNA/HPV-16 insertion site, 3' flank, clone H705.
ACCESSION   M33614
KEYWORDS    insertion site.
SEGMENT     2 of 2
SOURCE      Human cervical cancer DNA, clone H705.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 148)
  AUTHORS   Wagatsuma,M., Hashimoto,K. and Matsukura,T.
  TITLE     Analysis of integrated human papillomavirus type 16 DNA in cervical
            cancers: Amplification of viral sequences together with cellular
            flanking sequences
  JOURNAL   J. Virol. 64, 813-821 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site         28       29     HPV-16 DNA end/Human DNA start
BASE COUNT       45 a     24 c     28 g     51 t
ORIGIN      About 489 bases after segment 1.
        1 attatcacag atggtacaat gggcctactg atgcagtgat aatagtactg agatgtacta
       61 ttatcccaca tttagttaag ttaggattga tcctagattc acatgttgtc agtgtgatgc
      121 cttaaatatc aagtttccaa ttaagctt
//
LOCUS       HUMHPV16D2    510 bp ds-DNA             PRI       26-MAY-1990
DEFINITION  Human DNA/HPV-16 insertion site, 3 ' flank, clone H901.
ACCESSION   M33616
KEYWORDS    insertion site.
SEGMENT     2 of 2
SOURCE      Human cervical cancer DNA, clone H022.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 510)
  AUTHORS   Wagatsuma,M., Hashimoto,K. and Matsukura,T.
  TITLE     Analysis of integrated human papillomavirus type 16 DNA in cervical
            cancers: Amplification of viral sequences together with cellular
            flanking sequences
  JOURNAL   J. Virol. 64, 813-821 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site         10       11     HPV-16 DNA end/Human DNA start
BASE COUNT      140 a    133 c     94 g    143 t
ORIGIN      About 3994 bp after segment 1.
        1 acattattat ggaaacagat ctgtgagtac caagaaaaga ggataaagat tcatcccatc
       61 caccagtcat tcccatgcac ctctacccgc catcccctgt atccaggaca acccccttct
      121 gacaccaaaa tgcatttcac cattggctgc tgtcggtaga taatacctgc tcagcatttg
      181 ggacaagttc cagacataac ttcctcttag tgaatgatcc tgacaggaga aagaattgag
      241 cttaatttat gccatctaat aacctcagtg cagctacttg ggaagttagc cctccagagt
      301 ttcccccaaa gttttctcca gtgaattaca gtgccatata ttctcattgc taccagcgct
      361 gctcccaaaa tctatctgct gtttaatagt ttttaccttt caaaaatgca agctggctgg
      421 gcgtggattt ttgaaagcat tcctcctgcc ttggcctctc aaagtgctgg attagagggt
      481 gccttctaat cccagcaatc agcattggaa
//
LOCUS       HUMHPV1D1     510 bp ds-DNA             PRI       26-MAY-1990
DEFINITION  Human DNA/HPV-16 insertion site, 5' flank, clone H901.
ACCESSION   M33615
KEYWORDS    insertion site.
SEGMENT     1 of 2
SOURCE      Human cervical cancer DNA, clone H022.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 510)
  AUTHORS   Wagatsuma,M., Hashimoto,K. and Matsukura,T.
  TITLE     Analysis of integrated human papillomavirus type 16 DNA in cervical
            cancers: Amplification of viral sequences together with cellular
            flanking sequences
  JOURNAL   J. Virol. 64, 813-821 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        500      501     Human DNA end/HPV-16 DNA start
BASE COUNT      159 a     82 c    101 g    168 t
ORIGIN      
        1 attcgagctc ggtacccaac atctcaaaat tttgttcttc agtctgtaaa atgggatgat
       61 aaatctctca ggtttggtgt aagaaaaaaa taatatgctc acctaataga ccttcaatta
      121 ctggtagttt ccatcatctt aatgaggatt atatctttat agtgagcacc cattagatgg
      181 tgttgataaa tacatcaatg agtattttag gcagaaagca gagtaaagca gaagtactgg
      241 cattctttgc tgtactcagt tttattaact gattttatat tgatcacgtt ctttgttaca
      301 tgtcagtatt atagtggcag ttgaaggtgg taatattttt agtctccgtt agtgaaatga
      361 caggcattga gctctcagtc atacctttgt aggccttcgt tgaggtgaat acctacctct
      421 taactagaaa aagatggaga atttcttgct tggaaggaaa ttaatgcaat gtccaggtca
      481 tctcctaaaa agcctgaagg aaacaaagta
//
LOCUS       HUMMHDQBH    1104 bp ss-mRNA            PRI       26-MAY-1990
DEFINITION  Human MHC HLA-DQ beta mRNA, complete cds.
ACCESSION   M32577
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human (DR4-Dw14), cDNA to mRNA, LS40 homozygous cell line.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1104)
  AUTHORS   Hilden,J.M., Curtsinger,J.M., Cairns,J.S. and Bach,F.H.
  TITLE     DQ beta sequences in HLA-DR4 haplotypes
  JOURNAL   Hum. Immunol. 18, 261-264 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      754     MHC HLA-DQ beta precursor (AA at 2)
                                 /nomgen="LS0098W" /map="6p21.3"
                                 /hgml_locus_uid="HLA-DQB1"
    sigp    <     1       64     MHC HLA-DQ beta signal peptide (AA at 2)
    matp         65      751     MHC HLA-DQ beta
BASE COUNT      231 a    324 c    303 g    246 t
ORIGIN      
        1 aggccttcgg gtagcaactg tgaccttgat gctggcgatg ctgagcaccc cggtggctga
       61 gggcagagac tctcccgagg atttcgtgta ccagtttaag ggcatgtgct acttcaccaa
      121 cgggacggag cgcgtgcgtc ttgtgaccag atacatctat aaccgagagg agtacgcacg
      181 cttcgacagc gacgtggggg tgtatcgggc ggtgacgccg ctggggccgc ctgccgccga
      241 gtactggaac agccagaagg aagtcctgga gaggacccgg gcggagttgg acacggtgtg
      301 cagacacaac taccagttgg agctccgcac gaccttgcag cggcgagtgg agcccacagt
      361 gaccatctcc ccatccagga cagaggccct caaccaccac aacctgctgg tctgctcagt
      421 gacagatttc tatccagccc agatcaaagt ccggtggttt cggaatgacc aggaggagac
      481 aactggcgtt gtgtccaccc cccttattag gaacggtgac tggaccttcc agatcctggt
      541 gatgctggaa atgactcccc agcgtggaga cgtctacacc tgccacgtgg agcaccccag
      601 cctccagaac cccatcatcg tggagtggcg ggctcagtct gaatctgccc agagcaagat
      661 gctgagtggc attggaggct tcgtgctggg gctgatcttc ctcgggctgg gccttattat
      721 ccatcacagg agtcagaaag ggctcctgca ctgactcctg agactatttt aactgggatt
      781 ggttatcact tttctgtaac gcctgcttgt ccctgcccag aattcccagc tgcctgtgtc
      841 agcctgtccc cctgagatca gagtcctaca gtggctgtca cgcagccacc aggtcatctc
      901 ctttcatccc cacctcgagg ctgatggctg tgaccctgct tcctgcactt acccagagcc
      961 tctgcctgtg cacggccagc tgcgtctact gaggccccaa ggggtttctg tttctattct
     1021 ctcctcagac tgctcaagag aagcacatga aaaccattac ctgactttag agctttttta
     1081 cataattaaa catgatcctg agtt
//
LOCUS       HUMMHDR1C    1191 bp ss-mRNA            PRI       26-MAY-1990
DEFINITION  Human class II HLA-DRB1-BON mRNA, complete cds.
ACCESSION   M33600
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human (haplotype DRB1-BON) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1191)
  AUTHORS   Coppin,H.L., Avoustin,P., Fabron,J., Huchenq,A., Garnier,J.M.,
            Thomsen,M. and De Preval,C.
  TITLE     Evolution of the HLA-DR1 gene family: Structural and functional
            analysis of the new allele "DR-BON"
  JOURNAL   J. Immunol. 144, 984-989 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         71      871     MHC HLA-DR1-BON precursor
    sigp         71      157     MHC HLA-DR1-BON signal peptide
    matp        158      868     MHC HLA-DR1-BON
BASE COUNT      258 a    312 c    344 g    277 t
ORIGIN      
        1 gcccaagtat caagagggag agtgagactt gcctgcttct ctggcccctg gtcctgtcct
       61 gttctccagc atggtgtgtc tgaagctccc tggaggctcc tgcatgacag cgctgacagt
      121 gacactgatg gtgctgagct ccccactggc tttggctggg gacacccgac cacgtttctt
      181 gtggcagctt aagtttgaat gtcatttctt caatgggacg gagcgggtgc ggttgctgga
      241 aagatgcatc tataaccaag aggagtccgt gcgcttcgac agcgacgtgg gggagtaccg
      301 ggcggtgacg gagctggggc ggcctgatgc cgagtactgg aacagccaga aggacatcct
      361 ggaagacgag cgggccgcgg tggacaccta ctgcagacac aactacgggg ttggtgagag
      421 cttcacagtg cagcggcgag ttgagcctaa ggtgactgtg tatccttcaa agacccagcc
      481 cctgcagcac cacaacctcc tggtctgctc tgtgagtggt ttctatccag gcagcattga
      541 agtcaggtgg ttccggaacg gccaggaaga gaaggctggg gtggtgtcca caggcctgat
      601 ccagaatgga gattggacct tccagaccct ggtgatgctg gaaacagttc ctcggagtgg
      661 agaggtttac acctgccaag tggagcaccc aagtgtgacg agccctctca cagtggaatg
      721 gagagcacgg tctgaatctg cacagagcaa gatgctgagt ggagtcgggg gcttcgtgct
      781 gggcctgctc ttccttgggg ccgggctgtt catctacttc aggaatcaga aaggacactc
      841 tggacttcag ccaacaggat tcctgagctg aaatgcagat gaccacattc aaggaagaac
      901 cttctgtccc agctttgcag aatgaaaagc tttcctgctt ggcagttatt cttccacaag
      961 agagggcttt ctcaggacct ggttgctact ggttcggcaa ctgcagaaaa tgtcctccct
     1021 tgtggcttcc tcagctcctg cccttggcct gaagtcccag cattgatgac agcgcctcat
     1081 cttcaacttt tgtgctcccc tttgcctaaa ccgtatggcc tcccgtgcat ctgtacctca
     1141 ccctgtacga caaacacatt acattattaa atgtttctca aagatggagt t
//
LOCUS       HUMMHDRBBB   1216 bp ss-mRNA            PRI       26-MAY-1990
DEFINITION  Human MHC class II HLA-DR beta-1 mRNA (DR2.3), 5'end.
ACCESSION   M32578
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human type I diabetic (Dw4/LD MN2), cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1216)
  AUTHORS   Freeman,S.M., Saunders,T.L., Madden,M., Segall,M., Bach,F.H. and
            Wu,S.
  TITLE     Comparison of DR beta-1 alleles from diabetic and normal
            individuals
  JOURNAL   Hum. Immunol. 19, 1-6 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         62      862     MHC HLA-DR beta-1 precursor /nomgen="LV0063D"
                                 /map="6p21.3" /hgml_locus_uid="HLA-DRB1"
    sigp         62      148     MHC HLA-DR beta-1 signal peptide
    matp        149      859     MHC HLA-DR beta-1
BASE COUNT      265 a    331 c    341 g    279 t
ORIGIN      
        1 agttctccct gagtgagact tgcctgctcc tctggcccct ggtcctgtcc tgttctccag
       61 catggtgtgt ctgaagctcc ctggaggttc ctacatggca gtgctgacag tgacactgat
      121 ggtgctgagc tccccactgg ctttggctgg ggacacccga ccatgtttct tgcagcagga
      181 taagtatgag tgtcatttct tcaacgggac ggagcgggtg cggttcctgc acagaggcat
      241 ctataaccaa caggagaacg tgcgcttcga cagcgacgtg ggggagtacc gggcggtgac
      301 ggagctgggg cggcctgacg ctgagtactg gaacagccag aaggacatcc tggagcaggc
      361 gcgggccgcg gtggacacct actgcagaca caactacggg gctgtggaga gcttcacagt
      421 gcagcggcga gttgagccta aggtgactgt gtatcctgca aggacccaga ccctgcagca
      481 ccacaacctc ctggtctgct ctgtgaatgg tttctatcca ggcagcattg aagtcaggtg
      541 gttccggaac ggccaggaag agaaggctgg ggtggtgtcc acaggcctga ttcagaatgg
      601 agactggacc ttccagattc tggtgatgct ggaaacagtt cctcggagtg gagaggttta
      661 cacctgccaa gtggagcacc caagcgtgac gagccctctc acagtggaat ggagagcaca
      721 gtctgaatct gcacagagca agatgctgag tggaatcggg ggctttgtgc tgggcctgct
      781 cttccttggg gccgggctat tcatctactt caagaatcag aaagggcact ctggacttca
      841 cccaacagga ctcgtgagct gaagtgcaga tgaccacatt caagggggaa ccttctgccc
      901 cagctttgca tgatgaaaag ctttcctgct tggctcttat tcttccacaa gagaggactt
      961 tctcaggccc tggttgctac cggttcagca actctgcaga aaatgtccat ccttgtggct
     1021 tcctcagctc ctgcccttgg cctgaagtcc cagcattgat ggcagtgcct catcttcaac
     1081 tttagtgctc ccctttacct aaccctacgg cctcccatgc atctgtactc cccctgtgcc
     1141 acaaatggac tacgttatta aatttttctg aagcccagag ttaaaaatca tctgtccacc
     1201 tggcaccaaa gacaaa
//
LOCUS       J05239        162 bp ds-DNA             BAD       26-MAY-1990
DEFINITION  Figure 1. Sequence of the 166-bp restriction fragment.
ACCESSION   J05239
REFERENCE   1  (bases 1 to 162)
  AUTHORS   Jones,B.K. and Yeung,A.T.
  TITLE     dna base composition determines the specificity of uvrabc
            endonuclease incision of a psoralen cross-link
  JOURNAL   J. Biol. Chem. 265, 3489-3496 (1990)
  STANDARD  unannotated staff_entry
COMMENT     Bad entry: secondary reference to PNASU 75, 5314-5318 (1978): lac
            promoter sequence.
FEATURES       from  to/span     description
BASE COUNT       40 a     43 c     40 g     39 t
ORIGIN      
        1 cctccgttga gccatctgga tcggcagcgt tgtcttcatc aaccggaacg agcatgccgg
       61 agagcagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtataat
      121 gtgtggaatt gtgagcggat aacaatttca cacaggaaac ag
//
LOCUS       MLVENVB      2002 bp ss-RNA             VRL       26-MAY-1990
DEFINITION  Murine leukemia virus 10A1 derivative env gene, complete cds.
ACCESSION   M33470
KEYWORDS    envelope protein.
SOURCE      Murine leukemia virus 10A1 derivative viral RNA, clone 10A1.
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 2002)
  AUTHORS   Ott,D., Friedrich,R. and Rein,A.
  TITLE     Sequence analysis of amphotropic and 10A1 murine leukemia viruses:
            Close relationship to mink cell focus-inducing viruses
  JOURNAL   J. Virol. 64, 757-766 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         65     2002     env protein
BASE COUNT      528 a    553 c    482 g    439 t
ORIGIN      
        1 ggatccacgc cgctcacgta aaggcggcga caacccctcc ggccggaaca gcatcaggac
       61 cgacatggaa ggtccagcgt tctcaaaacc ccttaaagat aagattaacc cgtggaagtc
      121 cttaatggtc atgggggtct atttaagagt agggatggca gagagccccc atcaggtctt
      181 taatgtaacc tggagagtca ccaacctgat gactgggcgt accgccaatg ccacctccct
      241 tttaggaact gtacaagatg ccttcccaag attatatttt gatctatgtg atctggtcgg
      301 agaagagtgg gacccttcag accaggaacc atatgtcggg tatggctgca aataccccgg
      361 agggagaaag cggacccgga cttttgactt ttacgtgtgc cctgggcata ccgtaaaatc
      421 ggggtgtggg gggccaagag agggctactg tggtgaatgg ggttgtgaaa ccaccggaca
      481 ggcttactgg aagcccacat catcatggga cctaatctcc cttaagcgcg gtaacacccc
      541 ctgggacacg ggatgctcca aaatggcttg tggcccctgc tacgacctct ccaaagtatc
      601 caattccttc caaggggcta ctcgaggggg cagatgcaac cctctagtcc tagaattcac
      661 tgatgcagga aaaaaggcta attgggacgg gcccaaatcg tggggactga gactgtaccg
      721 gacaggaaca gatcctatta ccatgttctc cctgacccgc caggtcctca atatagggcc
      781 ccgcatcccc attgggccta atcccgtgat cactggtcaa ctacccccct cccgacccgt
      841 gcagatcagg ctccccaggc ctcctcagcc tcctcctaca ggcgcagcct ctatagtccc
      901 tgagactgcc ccaccttctc aacaacctgg gacgggagac aggctgctaa acctggtaga
      961 aggagcctat caggcgctta acctcaccaa tcccgacaag acccaagaat gttggctgtg
     1021 cttagtgtcg ggacctcctt attacgaagg agtagcggtc gtgggcactt ataccaatca
     1081 ttctaccgcc ccggccagct gtacggccac ttcccaacat aagcttaccc tatctgaagt
     1141 gacaggacag ggcctatgca tgggagcact acctaaaact caccaggcct tatgtaacac
     1201 cacccaaagt gccggctcag gatcctacta ccttgcagca cccgctggaa caatgtgggc
     1261 ttgtagcact ggattgactc cctgcttgtc caccacgatg ctcaatctaa ccacagacta
     1321 ttgtgtatta gttgagctct ggcccagaat aatttaccac tcccccgatt atatgtatgg
     1381 tcagcttgaa cagcgtacca aatataagag ggagccagta tcgttgaccc tggcccttct
     1441 gctaggagga ttaaccatgg gagggattgc agctggaata gggacgggga ccactgccct
     1501 aatcaaaacc cagcagtttg agcagcttca cgccgctatc cagacagacc tcaacgaagt
     1561 cgaaaaatca attaccaacc tagaaaagtc actgacctcg ttgtctgaag tagtcctaca
     1621 gaaccgaaga ggcctagatt tgctcttcct aaaagaggga ggtctctgcg cagccctaaa
     1681 agaagaatgt tgtttttatg cagaccacac gggactagtg agagacagca tggccaaact
     1741 aagggaaagg cttaatcaga gacaaaaact atttgagtca ggccaaggtt ggttcgaagg
     1801 gcagtttaat agatccccct ggtttaccac cttaatctcc accatcatgg gacctctaat
     1861 agtactctta ctgatcttac tctttggacc ctgcattctc aatcgattgg tccaatttgt
     1921 taaagacagg atctcagtgg tccaggctct ggttttgact caacaatatc accagctaaa
     1981 acctatagag tacgagccat ga
//
LOCUS       MLVENVC      2001 bp ss-RNA             VRL       26-MAY-1990
DEFINITION  Murine leukemia virus env gene, complete cds.
            genes.
ACCESSION   M33469
KEYWORDS    envelope protein.
SOURCE      Murine leukemia virus viral RNA, clone 4070A.
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 2001)
  AUTHORS   Ott,D., Friedrich,R. and Rein,A.
  TITLE     Sequence analysis of amphotropic and 10A1 murine leukemia viruses:
            Close relationship to mink cell focus-inducing viruses
  JOURNAL   J. Virol. 64, 757-766 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         37     2001     env protein
BASE COUNT      532 a    560 c    472 g    437 t
ORIGIN      
        1 ggccgacacc cagagtggac catcctctgg acggacatgg cgcgttcaac gctctcaaaa
       61 ccccctcaag ataagattaa cccgtggaag cccttaatag tcatgggagt cctgttagga
      121 gtagggatgg cagagagccc ccatcaggtc tttaatgtaa cctggagagt caccaacctg
      181 atgactgggc gtaccgccaa tgccacctcc ctcctgggaa ctgtacaaga tgccttccca
      241 aaattatatt ttgatctatg tgatctggtc ggagaggagt gggacccttc agaccaggaa
      301 ccgtatgtcg ggtatggctg caagtacccc gcagggagac agcggacccg gacttttgac
      361 ttttacgtgt gccctgggca taccgtaaag tcggggtgtg ggggaccagg agagggctac
      421 tgtggtaaat gggggtgtga aaccaccgga caggcttact ggaagcccac atcatcgtgg
      481 gacctaatct cccttaagcg cggtaacacc ccctgggaca cgggatgctc taaagttgcc
      541 tgtggcccct gctacgacct ctccaaagta tccaattcct tccaaggggc tactcgaggg
      601 ggcagatgca accctctagt cctagaattc actgatgcag gaaaaaaggc taactgggac
      661 gggcccaaat cgtggggact gagactgtac cggacaggaa cagatcctat taccatgttc
      721 tccctgaccc ggcaggtcct taatgtggga ccccgagtcc ccatagggcc caacccagta
      781 ttacccgacc aaagactccc ttcctcacca atagagattg taccggctcc acagccacct
      841 agccccctca ataccagtta ccccccttcc actaccagta caccctcaac ctcccctaca
      901 agtccaagtg tcccacagcc acccccagga actggagata gactactagc tctagtcaaa
      961 ggagcctatc aggcgcttaa cctcaccaat cccgacaaga cccaagaatg ttggctgtgc
     1021 ttagtgtcgg gacctcctta ttacgaagga gtagcggtcg tgggcactta taccaatcat
     1081 tccaccgctc cggccaactg tacggccact tcccaacata agcttaccct atctgaagtg
     1141 acaggacagg gcctatgcat gggggcagta cctaaaactc accaggcctt atgtaacacc
     1201 acccaaagcg ccggctcagg atcctactac cttgcagcac ccgccggaac aatgtgggct
     1261 tgcagcactg gattgactcc ctgcttgtcc accacggtgc tcaatctaac cacagattat
     1321 tgtgtattag ttgaactctg gcccagagta atttaccact cccccgatta tatgtatggt
     1381 cagcttgaac agcgtaccaa atataaaaga gagccagtat cattgaccct ggcccttcta
     1441 ctaggaggat taaccatggg agggattgca gctggaatag ggacggggac cactgcctta
     1501 attaaaaccc agcagtttga gcagcttcat gccgctatcc agacagacct caacgaagtc
     1561 gaaaagtcaa ttaccaacct agaaaagtca ctgacctcgt tgtctgaagt agtcctacag
     1621 aaccgcagag gcctagattt gctattccta aaggagggag gtctctgcgc agccctaaaa
     1681 gaagaatgtt gtttttatgc agaccacacg gggctagtga gagacagcat ggccaaatta
     1741 agagaaaggc ttaatcagag acaaaaacta tttgagacag gccaaggatg gttcgaaggg
     1801 ctgtttaata gatccccctg gtttaccacc ttaatctcca ccatcatggg acctctaata
     1861 gtactcttac tgatcttact ctttggacct tgcattctca atcgattggt ccaatttgtt
     1921 aaagacagga tctcagtggt ccaggctctg gttttgactc agcaatatca ccagctaaaa
     1981 cccatagagt acgagccatg a
//
LOCUS       MTYRPVP      6331 bp ss-RNA             VRL       26-MAY-1990
DEFINITION  Eggplant mosaic virus genome.
ACCESSION   J04374
KEYWORDS    replicase protein; virion protein.
SOURCE      Eggplant mosaic tymovirus viral RNA.
  ORGANISM  Eggplant mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Tymovirus.
REFERENCE   1  (bases 1 to 6331)
  AUTHORS   Osorio-Keese,M.E., Keese,P. and Gibbs,A.
  TITLE     Nucleotide sequence of the genome of eggplant mosaic tymovirus
  JOURNAL   Virology 172, 547-554 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        102     2051     overlapping out-of-phase protein
    pept        109     5628     replicase protein (putative)
    pept       5633     6199     virion protein
BASE COUNT     1337 a   2441 c    987 g   1566 t
ORIGIN      
        1 gtaatcagaa ccagaactaa ccctgttatc agccttagtt cttttacttt cctgtccaaa
       61 tttctgaacc gactagtgcc ttcctagaac ccactacgtc aatgcctcat ggcctttcag
      121 tctgctctcg aagctctcaa ctcaactact cacagagatg cttctacaaa tccaattctg
      181 aactccgtcg tggaacctct ccgcgactct ctatccctat atccctggct ccttcccaaa
      241 gaagccgttc cccaccttct atcctggggc atcccgaact ccggcctcgg agtcactccc
      301 cacccccacc caatccacaa aacagtcgag acttttctcc tgttcaatca ctggcatgct
      361 ctcgctcgcc tgccttcaac tgtgatgttc atgaaaccgt ccaagtttca aaaacttgcg
      421 gctctaaacc caaaattcca agagttgatc aactttcgac tcactgccgc cgacaccact
      481 cgctacccct ccacctcact cacttttcca agcaattcaa tttgcttcat gcacgatgct
      541 ctgatgtact tttctccagc tcagatcgtc gatctcttca ctcagtctcc cgcactcgag
      601 accctgtact gcagtctcat agtgcctcca gagtctcatt tcacagatct ctctctcttc
      661 cccgagatct acacttacaa gatctcaggt cagactctcc actacatccc ggagaatcac
      721 cactccggct cgtacaatca gcccctccaa gccccatctt ggctgaagat ttcctccatc
      781 ctctcgcctt ccctcgcttt gtctgtgacc aagctggaat cttggggccc agtccactcc
      841 atattgatcc agcgaggcct accaccaaag ccctctctct ctgcacgccc ccccgtcctg
      901 ccaaatcaac ctccccgtgc aacaactccc aactcccaaa accaactgct gcatcagaca
      961 agccagctat tcttccaact gcagcagcct caactcagcc tggtctcctt ccgaattcca
     1021 gactgcgtag aactgccaca agccaccttt ctgcgccaac ctctccgcca ccggctagtg
     1081 ccaacaagcg tttacaacgc tctcttcacc tacactcgcg cagtccgcac tcttcgcact
     1141 tccgacccag ccggatttgt gcgaactcaa agcaacaaac ccgagcacgc ttgggtcact
     1201 ccaaacgcgt gggacaatct gcagaccttg tctgtcaatg ccccccaccg cccccaagta
     1261 tgctaccact tcttctcctc ccccgtggca aggttaaagc tccacttcgc ccaacactgg
     1321 cgagcctatc ttttggctct caccccattc cttaccacgt cacctcttct cctcccctta
     1381 ttcaatttca acaccccttt ccccctccct cggctacttt ctctgtttcg ccgctcggtg
     1441 tcctcaccac ggcttttgca ctcaatccta cccagtcagc tgagaggagc tgcgatcccg
     1501 aatcgcccac tcccactctg ggtcacaaaa ctacatcact ttctcgactc ccactccctc
     1561 ctccccactc cccccattcg gcccaggata gagcttcagc gcttgccact gatgtctcta
     1621 attccgaaac caaaaattgt ccttccccta ctgtccctcc tcctttcctc cccaaccatc
     1681 tacatccact tcttccaggc acagaccccc caacaactcc acgacaatta tcaccttcac
     1741 cttcatccct ctcgcttcga actttcctgg actctgcagt catatcatgt gactcaagcc
     1801 cagtccttcc tccctctcct tctcccagct cccactcaag ctcaagcttc caatcctgca
     1861 cctcgccccc ccgctttcca tgctatcccc ctcccccctc agccctcgac ctcctcttct
     1921 cctccactcc aggaaccgac cctttccccc cacctgatac acccccccct cacaagagaa
     1981 ccatcgccct tgaacggctg cgcctgcgac agtgcgctac tcccttccac agctgcgatg
     2041 acgtctgctg aacatcccac tccactcaac ccccccacac ctagcccaac accagacgtc
     2101 cctcctcccg actcacccgg taacccatca cttttgaagc aagtccctcc cgaagcgaac
     2161 ttgcatccta tccacaaccc agacctcccc tcttccacca ctcttccttc tggggccctg
     2221 acactggtcc cagccaaaac tccttccatc tacgccaatc ccaccccccc cagttcccat
     2281 ccgttcaccc cactggctga tgaccccact gctgtgggtc cttgcctacc gttccacgtt
     2341 ctccacccgg ctgactactt tcctctttca gccgagtttc tcacacggac ccggcatgtc
     2401 cccccctctt ctctctcaca tccaaaactc aattgcctac tcacctgctt ttctgaactt
     2461 tcaggacact ctgagtcaga tctttggttg tccctgcaat caatacttcc tgactcccaa
     2521 ctccaaaatc ctgaagtctc gacacttggc ctgtccactg acattctcac agctctctgc
     2581 ttcatctacc attcatctgt gactctccat gccccctcag gagtttatca ctacggcata
     2641 gcctcctctt ctaccgtcta tgtcatccac tatcaaccag gccctcctcc tcatttttct
     2701 ctctccccta gacttgccgc ttctgctcct cgctgcaacc ccaccaacag cagattggtc
     2761 agacaagctc tgcggtttaa attgaacggc gagtttctcc ccttcaccca ggcttacgcg
     2821 catgaatctt ccatcaccca tgccaaaaac ctcatctcca acatgaagaa tggttttgat
     2881 ggaatcatgt cttctctcac tgactcctct aagggtccct ccccccgtga aaaactgacc
     2941 actctcgact ctctcataga tgtcgctgcc cctcgcgaag tttctctcat ccacatcgcc
     3001 ggcttcgcag gctgcggcaa gacccacccc atccaaaaac tcctccaaac ttcccctttt
     3061 cacgacttcc gaatctcatg ccccactaat gaactccgat ccgaatggaa gcgtgatatg
     3121 caaccaacag ctgaaaatgt ttggaggttc tccacatggg aatccagcct gctcaaacat
     3181 tccgagatcc tcgtaatcga cgagatttac aagctccctc gtggctacct agatctctcc
     3241 atccttgctg atccaactct ctccttggtc atcatccttg gtgaccctct ccaaggagag
     3301 tatcactcga cctctcctca cagctccaat cactttcttc caagtgaggt ccaccgcttc
     3361 aagtcttaca tcgactgcta ctgtttttgg tcccaccgca ttccaaagca gatagcatcc
     3421 ttgttcggcg tagtatgcca caacacgaac gaaggtttcg tgagagccct cacatctcat
     3481 ccccccaatt ccaaaaacct caccaatgcg accaacactg ctctcagtct ccaacagatg
     3541 ggccaccacg ctatcaccat cagcgccaga agggtcacct tcaccgaggc ccatacaatt
     3601 ctgcttgatc gtcataccaa ccttctctcc cccaacaact gtcttgttgc cctcacccgc
     3661 agccgcactg gcgtctactt cgtcggcaat ctgcacctgg catcaaacag ctttggcaca
     3721 aactacatgt tctctcaagc tctctgccaa ggcacaatcg acctaaacaa cgtgttcccc
     3781 cacatcatgc ctcacctccc gaaaatgtat gaacccatcc gctcccggtc caaccgtttt
     3841 gtgtctgggt ccctcaattt tcgaccaacc accaattccc gcctcctttc cagtctcact
     3901 aagccaaccc acctcccccc tcacatccct accaaccact ccctggatgt cctagtttcc
     3961 aaccctgtgc tccttggtga gaccctcgac cctcgattgg aggtcctcca cctcccccca
     4021 actcgcctcc cattgcatct ggacctcctg cccacagtac cttcctcttc cagcttctcc
     4081 tcagtcgacc atcttttccc aacccccatc tcccccgcta tctgcggcta caccttcgaa
     4141 aatttggccg cattcttcct cccagctcat gacccggacc taaaggaggt gctcatcaat
     4201 gaccaaaaga gcaaccagtt cccatacttg gacgcccctt ttgagctttc gtgccaaccc
     4261 tcctcactgt tggcaccaat tcacaagccg gcctcggatc caacccttct ccctggctcc
     4321 atcaagaaac gcctcagatt ccgcgcttct tcctccccat attccatcac tccatctgat
     4381 caacttcttg gtcaacacct cttctcttct ttgtgcctgg cttatgggcg caaccccaat
     4441 tctgtcctcc ccttccaacc tgagctcttc agtgagtgca tatgcattaa tgattacgct
     4501 caactctcct ccaagactca agccaccatc gtggccaatc atcaaaggtc tgatcctgac
     4561 tggcgcctaa ctgctgtccg catctttgcc aaggctcaac acaaagtaaa cgacgcttcc
     4621 atcttttccg ggtggaaggc ttgccaaact ctagccctga tgcacggtta catcattctc
     4681 gtactcggcc cagtcaagaa ataccaacgc atttttgatt ccaaggacag acctccccac
     4741 atctactacc actgcggtaa aactccctcc cagctctccc aatggtgcca aactcacctt
     4801 tctggctctt cctacatcgc caacgactac actgcctttg atcagtccca acacggcgag
     4861 gctgtggtcc tggaatgttt gaagatgcgc cgcctctcca tcccggactc tctcattcag
     4921 ctccactccc acctcaagtg ttccgtcgac acccagttcg gccccctcac ctgcatgcgc
     4981 ctcactggcg agccgggcac ttatgatgac aactctgact acaacctagc tgtcatctac
     5041 tcccaatact ccctcaatgg ccaccccatt ctgatctcag gcgatgactc cgtcctttgc
     5101 ggcacaccgc ccccttctcc actttggccc actctcaaga aaatgcttca tctccgtttc
     5161 aagatcgaac ggacctccca ccccctcttc tgcgggtatt acgtctcccc tcatggcgct
     5221 gcccgcaacc cgtatgctct cttcgccaag ctcatgatct gcgttgatga caagagcctc
     5281 catgacaaga agttgtccta tctctctgaa ttctccactg gccatctggc tggcgacctg
     5341 gtcacctcca ttctcccttc ccacctactt ccctatcagt ccgccgtgca cgacttcttc
     5401 tgccggaatt gcacgcccgc ggaaaaaatt ctcctgtctc tggacccaat ccctgagtcc
     5461 aaaatcctcc agctcattct caaagttcgc tgggcttctc aagctttctt ttcctacctg
     5521 cctcaaaaag ctcgcgaact ccttgtggca cgctcttctc tcccgtccct ctattccaat
     5581 cccaaagtct ctcaactgga gtctgaattg cttcccttct ctcaatagat caatggaaga
     5641 cacagcaatc atcagaagcc ctcagccctc cataaacgca ccaggcttcc atctgccacc
     5701 caccgactca caacaatcct ctgctattga actccccttc cagtttcagg ccaccacttt
     5761 tggcgcgact gaaacagctg ctcaaatcag tctggcctcc gccaacgcta ttaccaagct
     5821 cgcgtctctc taccgccatg tgcggctcac gcagtgcgct gccaccatca ctccgacagc
     5881 ggccgccatt gccaatcctc tcactgtcaa catcgtctgg gtgtctgaca attccactgc
     5941 caagcccacc gagattctca atgtctttgg tggatcttcc tacacgtttg gcggcgccct
     6001 caatgccacc aagcccctta ccatccctct ccccatgaac tcggtcaact gtatgctcaa
     6061 ggactctgtt ctttacacag attgcccaaa gctcctggcc tactcagctg ctcccagctc
     6121 tccctccaaa accccaaccg ccactatcca aatccatggc aagctccgct tgtcctcccc
     6181 cctcctccaa gccaattaac tctctctccc tcagccacca cctcgctcct cccccatctc
     6241 ctatggtaat tgcggacagt tccgctccct ctagcacaca gaggtccatt tgggtgcgac
     6301 tcccccccct cccgtgggtc aacgggaacc a
//
LOCUS       RATRGHA       542 bp ds-DNA             ROD       26-MAY-1990
DEFINITION  Rat growth hormone (rGH) gene, intron B repetitive DNA.
ACCESSION   M32696
KEYWORDS    repetitive DNA.
SOURCE      Rat (strain Sprague-Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 542)
  AUTHORS   Guitierrez-Hartmann,A., Lieberburg,I., Gardner,D., Baxter,J.D. and
            Cathala,G.G.
  TITLE     Transcription of two classes of rat growth hormone gene-associated
            repetitive DNA: Differences in activity and effects of tandem
            repeat structure
  JOURNAL   Nucleic Acids Res. 12, 7153-7173 (1984)
  STANDARD  simple staff_entry
BASE COUNT      199 a    114 c    118 g    111 t
ORIGIN      
        1 aacagtaatg acagagaggg ctggagagat ggctcagtgg ttaagagcac ccgactgctc
       61 ttccaaaggt cctgagttca attccagcaa ccacatggtg gctcacaacc atctgtaaag
      121 agatccgatg ccctcttctg gtgtgtctga agacagctac agtgtactta tataataaac
      181 aaataaatct ttaaaaaaaa aaacaaaaac ggggctggag agatggctca gcggttaaga
      241 gcgcccgact gctcttccag aggtcatgag ttcaattcca gcaaccacat ggtggctcac
      301 aaccatctgt aaagagatct gatgccctct tctggtgtat ctgaagacag ctacagtgta
      361 cttatatata ataaataaat aaatctttaa aaaaaaaaca aaacaggggc tggggattta
      421 gctcagtggt agagcgctta cctaggaagc gcaaggccct gggttcggtc cccagctccg
      481 aaaaaaagaa ccaaaaaaaa aaaaaaaaac caaaacaaaa acaaaacagt aatgacagag
      541 ag
//
LOCUS       ALRVSRC      1801 bp ss-RNA             VRL       26-MAY-1990
DEFINITION  Rous sarcoma virus (Schmidt-Ruppin D strain) v-src gene, complete
            cds.
ACCESSION   M33292
KEYWORDS    oncogene; pp60v-src; src gene; tyrosine kinase.
SOURCE      Rous sarcoma virus (strain Schmidt-Ruppin D) RNA, clone psrc1.
  ORGANISM  Rous sarcoma virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Avian sarcoma viruses.
REFERENCE   1  (bases 1 to 1801)
  AUTHORS   Reddy,S., Mazzu,D., Mahan,D. and Shalloway,D.
  TITLE     Sequence and functional differences between Schmidt-Ruppin D and
            Schmidt-Ruppin A strains of pp60v-src
  JOURNAL   Unpublished (1990) 406 S. Frear Bldg, University Park, PA 16802
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.I.Shalloway, 26-MAR-1990.
FEATURES       from  to/span     description
    pept         65     1645     pp60v-src protein
BASE COUNT      392 a    537 c    551 g    321 t
ORIGIN      
        1 actctgctgg tggcctcgcg taccactgtg gccaagcggt agctggaacg tgcagccgac
       61 caccatgggg agtagcaaga gcaagcctaa ggaccccagc cagcgccggc gcagcctgga
      121 gccacccgac agcacccacc acgggggatt cccagcctcg cagaccccca acaagacagc
      181 agcccccgac acgcaccgca cccccagccg ctccttcggg accgtggcca ccgagcccaa
      241 gctcttcgag gacttcaaca cttctgacac cgttacgtcg ccgcagcgtg ccggggcact
      301 ggctggcggc gtcaccactt tcgtggctct ctacgactac gagtcctgga ttgaaacgga
      361 cttgtccttc aagaaaggag aacgcctgca gattgtcaac aacacggaag gtaactggtg
      421 gctggctcat tccgtgacta caggacagac gggctacatc cccagtaact atgtcgcgcc
      481 ctcagactcc atccaggctg aagagtggta ctttgggaag atcactcgtc gggagtccga
      541 gcggctgctg ctcaaccccg aaaacccccg gggaaccttc ttggtccggg agagcgagac
      601 gacaaaaggt gcctattgcc tctccgtttc tgactttgac aacgccaagg ggctcaatgt
      661 gaagcactac aagatccgca agctggacag cggcggcttc tacatcacct cacgcacaca
      721 gttcagcagc ctgcagcagc tggtggccta ctactccaaa catgctgatg gcttgtgcca
      781 ccgcctgacc aacgtctgcc ccacgtccaa gccccagacc cagggactcg ccaaggacgc
      841 gtgggaaatc ccccgggagt cgctgcggct ggaggtgaag ctggggcagg gctgctttgg
      901 agaggtctgg atggggacct ggaacggcac caccagagtg gccataaaga ctctgaagcc
      961 cggcaccatg tccccggagg ccttcctgca ggaagcccaa gtgatgaaga agctccagca
     1021 tgagaagctg gttcaactgt acgcagtcgt gtcggaagag cccatctaca tcgtcattga
     1081 gtacatgagc aaggggagcc tcctggattt cctgaaggga gagatgggca agtacctgcg
     1141 gctgccacag ctcgttgata tggctgatca gattgcatcc ggcatggcct atgtggagag
     1201 gatgaactac gtgcaccgag acctgcgggc ggccaacatc ctggtggggg agaacctggt
     1261 gtgcaaggtg gctgactttg ggctggcacg cctcatcgag gacaacgagt acacagcacg
     1321 gcaaggtgcc aagttcccca tcaagtggac agcccccgag gcagccctct atggccggtt
     1381 caccatcaag tcggatgtct ggtccttcgg catcctgctg actgagctga ccaccaaggg
     1441 ccggatgcca tacccaggga tgggcaacgg ggaggtgctg gaccgggtgg agaggggcta
     1501 ccgcatgccc tgcccgcccg agtgccccga gtcgctgcat gaccttatgt gccagtgctg
     1561 gcggagggac cctgaggagc ggcccacttt tgagtacctg caggcccagc tgctccctgc
     1621 ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa caaggcaagg
     1681 cttggccgac aattgcatga agaatctgct tagggttagg cgttttgcgc tgcttcgcga
     1741 tgtacgggcc agatatacgc gtatctgagg ggactagggt gtgtttaggc gaaaagcggg
     1801 g
//
LOCUS       AVIH2AA      3800 bp ds-DNA             BCT       26-MAY-1990
DEFINITION  A.vinelandii H2 uptake hydrogenase (hoxK), complete cds,
            and H2 uptake hydrogenase (hoxG), complete cds.
ACCESSION   M33152
KEYWORDS    H2 uptake hydrogenase.
SOURCE      A.vinelandii (strain OP) DNA, clone pALM21.
  ORGANISM  Azotobacter vinelandii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Azotobacteraceae.
REFERENCE   1  (bases 1 to 3800)
  AUTHORS   Menon,A.L., Stultz,L.W., Robson,R.L. and Mortenson,L.E.
  TITLE     Cloning, nucleotide sequence and characterization of the (NiFe)
            hydrogenase structural genes and hoxG from Azotobacter vinelandii
  JOURNAL   Unpublished (1990) U of Georgia, Dep Biochemistry, Athens, GA 30602
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.L.Robson, 22-MAR-1990.
FEATURES       from  to/span     description
    pept        149     1225     H2 uptake hydrogenase (hoxK) precursor
    sigp        149      283     H2 uptake hydrogenase signal peptide (put.)
    matp        284     1222     H2 uptake hydrogenase
    pept       1222     3030     H2 uptake hydrogenase (hoxG)
    ORF        3047     3769     ORF3
BASE COUNT      686 a   1318 c   1213 g    583 t
ORIGIN      
        1 tgtatcaagc catgacaaaa acatggcatt ggcgcattat tcgtgcggtt ttcattcagc
       61 aaccgtgggc catacaaccg gcgcgccgtc atagccgaag gacggtgcgc aggggcgccg
      121 ataacgacct ggccacaagg gtaacggcat gtctcgactc gaaactttct atgacgtgat
      181 gcggcgtcag ggcatcacgc gccgcagctt tctcaaatat tgcagcctga ccgccgcggc
      241 cctgggcctc ggcccggcct tcgccccgcg gatcgcccac gcgatggaaa ccaagccgcg
      301 cactccggtg ctctggctgc acggcctgga gtgcacctgc tgctccgagt cgttcatccg
      361 ttcggcccac ccgctggtca aggacgtggt gctgtcgatg atctcgctgg actacgacga
      421 caccctgatg gccgccgccg gccaccaggc cgaggccgcc ctcgaagaga ccatgcgcaa
      481 gtacaagggc gagtacatcc tcgccgtgga gggcaacccg ccgctcaacg aggacggcat
      541 gttctgcatc gtcggcggca agccgttcat cgagcagctc aggcatgtgg cgaaggacgc
      601 caaggcggtg atcgcctggg gcagttgcgc cagttggggc tgcgtgcagg cggcccggcc
      661 caacccgacc caggcggtgc cgatccacaa ggtcatcacc gacaagccga tcgtcaaggt
      721 gcccggctgc ccgccgatcg ccgaggtgat gaccggggtg atcacctaca tgctgacctt
      781 cggcaagctg cccgagctgg accgccaggg gcggccgaag atgttctacg gccagcgcat
      841 ccacgacaag tgctaccgcc gcccgcactt cgacgccggc cagttcgtcg agcactggga
      901 cgacgagggc gcgcgcaagg gctactgcct gtacaaggtc ggctgcaagg gcccgaccag
      961 ctacaacgcc tgctcgacgg tgcgctggaa cgagggcact tccttcccga tccaggccgg
     1021 ccacggctgc atcggctgct cggaggacgg tttctgggac aagggctcgt tctatgaacg
     1081 cctgaccacc attccgcagt tcggcatcga gaagaacgcc gacgaaatcg gcgccgccgt
     1141 cgccggcggg gtcggcgcgg ccatcgccgc gcatgccgcg gtcaccgcca tcaagcgcct
     1201 gcagaacaag ggggatcgcc catgagcagc ctgccgaacg ccagccaact ggacaagtcc
     1261 ggcaggcgca tcgtcgtcga cccggtgacc cgcatcgagg gccacatgcg ctgcgaggtc
     1321 aacgtcgacg ccagcaacgt gatcaccaac gccgtctcca ccggcaccat gtggcgcggc
     1381 ctggaggtca tcctcaaggg ccgcgacccg cgcgacgcct gggccttcgt cgagcgcatc
     1441 tgcggcgtct gcaccggcac ccatgcgctg acctcggtgc gcgcggtgga ggatgccctg
     1501 gacatccgca tcccctacaa cgcccacctg atccgcaacc tgatggacaa gacgctgcag
     1561 gtgcacgacc acatcgtgca cttctaccac ctgcacgcgc tggactgggt caacccggtc
     1621 aacgccctga aggccgatcc caaggctacc tccgccctgc agcaggcggt ttcgccggcc
     1681 catgccaagt ccagccccgg ctacttccgc gacgtgcaga cgcgcctgaa gaagttcgtc
     1741 gagagcggcc agctcggcct gttctccaac ggctactggg acaatccggc ctacaagctg
     1801 ccgcccgagg cggacctgat ggccgtggcc cactacctgg aggcgctgga cctgcagaag
     1861 gacatcgtca agatccatac catcttcggc ggcaagaacc cgcatccgaa ctacatggtc
     1921 ggcggcgtgg cctgcgccat caacctggac gacgtcggcg ccgccggcgc gccggtcaac
     1981 atgaccagcc tgaacttcgt cctcgaacgc atccacgagg cccgcgagtt caccaggaac
     2041 gtctacctgc cggacgtgct ggcggtcgcc gggatctaca aggactggct gtacggcggc
     2101 ggtctggccg cgcacaacct gctgtcctac ggcaccttca ccaaggtgcc ctacgacaag
     2161 tccagcgacc tgttgccggc cggcgccatc gtcggcggca attgggacga ggtgctgccg
     2221 gtcgacgtgc gcgatcccga ggagatccag gagttcgtca gccactcctg gtacagctac
     2281 gccgacgaaa ccaaggggct gcatccctgg gacggcgtca ccgagccgaa attcgagctc
     2341 ggcccgaaca ccaagggcag ccgcacccac atccaggaaa tcgacgaggc gcacaagtac
     2401 agctggatca aggcgccgcg ctggcgcggc cacgctatgg aggtcggccc gctggcacgt
     2461 tacatcatcg cctacgcttc gggccgcgaa tacgtgaagg aacaggtcga ccgctcgctg
     2521 gccgccttca accagagcac cggcctgaac ctcggcctca agcagttcct gccctcgacc
     2581 ctcggccgca ccctggcgcg cgccctggag tgcgagctgg cggtggacag catgctcgac
     2641 gactggcagg ccctggtcgg caacatcaag gccggcgacc gcgccaccgc caacgtcgag
     2701 aagtgggacc cgagcacctg gccgaaggag gccaagggcg tgggcatcaa cgaggcgccg
     2761 cgcggcgccc tgggccactg gatcaggatc aaggacggca agatcgagaa ctaccaggcg
     2821 atcgtgccga ccacctggaa cggcaccccg cgcgaccatc tgggcaacat cggcgcctac
     2881 gaggccgcgc tgctcaacac caggatggag cgcccggacg agccggtgga gatcctgcgc
     2941 accctgcaca gcttcgaccc ctgcctggcc tgttcgaccc acgtgatgtc gccggacggc
     3001 caggagctga cccgggtgaa ggtccgctga accggaggat tgcgcgatgg cactggaaaa
     3061 atccctggaa accggcgacg gccaggagaa ggtccgcaag cagaccgcgg tgtacgtcta
     3121 cgaggcgccg ctgcgcctct ggcactgggt cacggcgctg tccatcgtcg tgctcggcgt
     3181 gaccggctac ttcatcggcg cgccgctgcc gacgatgccc ggcgaggcga tggacaacta
     3241 cctgatgggc tacatccgct tcgcccactt cgccgccggc tacgtgctgg cgatcggctt
     3301 cctcggccgg gtctactggg ccttcgtcgg caaccaccac gcccgcgagc tgttcctcgt
     3361 gccggtgcac cgcaaggcct ggtggaagga gctgtggcac gaggtgcgct ggtacctgtt
     3421 cctggaaaag accccgaaga agtacatcgg ccacaacccc ctgggccagt tggcgatgtt
     3481 ctgcttcttc gtggtcggcg cggtgttcat gagcgtcacc ggcttcgccc tctacgccga
     3541 ggggctgggg cgggacagct gggccgaccg gctgttcggc tgggtgatcc cgctgttcgg
     3601 ccagagccag gacgtgcaca cctggcacca cctgggcatg tggtacctcg tcgtcttcgt
     3661 catggtgcat gtctacctgg ccgtgcgcga agacatcgtt tcccggcagt cgctgatctc
     3721 caccatggtc ggcggctggc ggatgttcaa ggacgaccgg ccggattgag ccccgtgtcg
     3781 tcccttccgt ccgggccggt
//
LOCUS       RABIGHAS      402 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-1.
ACCESSION   M29412
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-1.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 402)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   402     Ig mu-chain V-D-J precursor
    sigp          1       57     Ig mu-chain signal peptide
    matp         58  >   402     Ig mu-chain
    recomb      339      340     V-region end/D-region start
    recomb      355      356     D-region end/J-region start
BASE COUNT       86 a    108 c    118 g     90 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
      121 acagcctctg gattctccct cagtagttac tacatgcaat gggtccgcca ggctccaggg
      181 aaggggctgg aatggatcgg aatcattggt agtagtggta gcacatacta cgcgagctgg
      241 gtgaagggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt
      301 ctgacaaccg aggacacggc cacctatttc tgtgccagag catatattag taatactgat
      361 ggttctggct ttaacttgtg gggccaaggc accctggtca cc
//
LOCUS       RABIGHAT      399 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-3-1.
ACCESSION   M29413
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-3-1.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 399)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   399     Ig mu-chain V-D-J precursor
    sigp          1       57     Ig mu-chain signal peptide
    matp         58  >   399     Ig mu-chain
    recomb      342      343     V-region end/D-region start
    recomb      361      362     D-region end/J-region start
BASE COUNT       88 a    114 c    111 g     86 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcggtggagg agtccggcgg tcgcctggta aagcctgacg aatccctgac actcacctgc
      121 acagcctctg gattctccct cagtacctac aacatgatct gggtccgcca ggctccagga
      181 aaggggctgg aatacatcgg ccacattagt tttggtggta gcacatacta cgcgagctgg
      241 gcgaaaggtc gatgcaccat atccaaaacc tcgaccacgg tggatctgaa aatgaccagt
      301 ctgacaaccg aggacacggc cacctatttc tgtgccaggg gatggactcc taaaagtctt
      361 tcagccttta acttgtgggg cccaggcacc ctggtcacc
//
LOCUS       RABIGHAU      390 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-5.
ACCESSION   M29414
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-5.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 390)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   390     Ig mu-chain V-D-J precursor
    sigp          1       57     Ig mu-chain signal peptide
    matp         58  >   390     Ig mu-chain
    recomb      339      340     V-region end/D-region start
    recomb      368      369     D-region end/J-region start
BASE COUNT       86 a    100 c    119 g     85 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcggtggagg agtccgggga tcgcctggtc acgcctggga cacccctgac actcacatgc
      121 acagtctctg gattctccct caatagttat gtagtgggct gggtccgcca ggctccagag
      181 aagggactgg aatacatcgg aaccatttgg gtcgatggta agacatacta cgcgagctgg
      241 acgaagggcc gattcaccat ctctaaaacc tcgaccacgg tggatctgaa aatgaccagt
      301 ctgacaaccg aggacacggc cacatatttc tgtgccagat atggtagtag tggtgattta
      361 ggcgtgtggg gccaagggac cctggtcacc
//
LOCUS       RABIGHAV      351 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 2-1.
ACCESSION   M29415
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 2-1.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 351)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept    <     1  >   351     Ig mu-chain V-D-J precursor (AA at 1)
    sigp    <     1       21     Ig mu-chain signal peptide
    matp         22  >   351     Ig mu-chain
    recomb      306      307     V-region end/D-region start
    recomb      339      340     D-region end/J-region start
BASE COUNT       77 a    101 c    100 g     73 t
ORIGIN      
        1 gtgctcaaag gtgtccagtg tcagtcgctg gaggagtccg ggggtcgcct ggtcacgcct
       61 gggacacccc tgacactcac ctgcacagcc tctggattct ccctcagtag ctactggatg
      121 acctgggtcc gccaggctcc agggaagggg ctggaatgga tcggaatcat tgttcatggt
      181 gatagcgcat actacgcgag ctgggcgaaa ggccgattca ccatctccag aacctcgacc
      241 acggtggatc tgaaaatcac cagtccgaca accgaggaca cggccaccta tttctgtgcc
      301 agagaatatt atggtactat taacttgtgg ggcccaggca ccctggttac c
//
LOCUS       RABIGHAW      408 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-2.
ACCESSION   M29416
KEYWORDS    diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain;
            joining exon; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-2.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 408)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   408     Ig gamma-chain V-D-J precursor
    sigp          1       57     Ig gamma-chain signal peptide
    matp         58  >   408     Ig gamma-chain
    recomb      342      343     V-region end/D-region start
    recomb      369      370     D-region end/J-region start
BASE COUNT       77 a    112 c    127 g     92 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcgctggagg agtccggggg tcgcctggtc acgcctggga catccctgac actcacctgc
      121 acagtctctg gattctccct cagtactagt gcaatggcct gggtccgcca ggctccaggg
      181 aaggggctgg aatatgtcgg agtcattagt ggaagtggtg gcacatacta cgcgagctgg
      241 gcgagcggcc ggttcaccat ttccaaagcc tcgtcgacca cggtggatct gaaaatgacc
      301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagtcaggga tagtcatggt
      361 tatattggtg atgcttttga tccctggggc ccaggcaccc tggtcacc
//
LOCUS       RABIGHAX      390 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-3-1.
ACCESSION   M29417
KEYWORDS    diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain;
            joining exon; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-3-1.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 390)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   390     Ig gamma-chain V-D-J precursor
    sigp          1       57     Ig gamma-chain signal peptide
    matp         58  >   390     Ig gamma-chain
    recomb      339      340     V-region end/D-region start
    recomb      356      357     D-region end/J-region start
BASE COUNT       77 a    110 c    120 g     83 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
      121 acagtctctg gattctccct cagtagtcgc tggatgagct gggtccgcca ggctccaggg
      181 gaggggctgg aatccatcgg agccattgat actggtggta gcgcatacta cgcgaactgg
      241 gtgaaaggcc gactcaccat ctccaaaacc tcgtcgacca cggtggattt gaaaatgacc
      301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagattatag tggtggactt
      361 gacttgtggg gcacaggcac cctggtcacc
//
LOCUS       RABIGHAY      399 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-4.
ACCESSION   M29418
KEYWORDS    diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain;
            joining exon; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-4.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 399)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   399     Ig gamma-chain V-D-J precursor
    sigp          1       57     Ig gamma-chain signal peptide
    matp         58  >   399     Ig gamma-chain
    recomb      339      340     V-region end/D-region start
    recomb      364      365     D-region end/J-region start
BASE COUNT       83 a    116 c    116 g     84 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
      121 acagcctctg gattcaccat cagtagctac cacatgatct gggtccgcca ggctccaggg
      181 gaggggctgg aatacatcgg atggattagt actggtggta gcgcatacta cgcgaactgg
      241 gcaaaaggcc gattcaccat ctccagaacc tcgaccacgg tggatctgaa aatgaccagt
      301 ctgacaaccg aggacacggc cacctatttc tgttgcagaa ctcctgctgt tagtaaatgg
      361 gacttgtggg gcccgggcac cctagtcacc gtctcctca
//
LOCUS       RABIGHAZ      384 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 4-1.
ACCESSION   M29419
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 4-1.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 384)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   384     Ig mu-chain V-D-J precursor
    sigp          1       57     Ig mu-chain signal peptide
    matp         58  >   384     Ig mu-chain
    recomb      339      340     V-region end/D-region start
    recomb      365      366     D-region end/J-region start
BASE COUNT       82 a    114 c    113 g     75 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
      121 acagtctctg gaatcgacct cagtggctac cacatgagct gggtccgcca ggctccaggg
      181 gaggggctgg aatggatcgg aaccatgagt actactgata acacatatta cgcgagctgg
      241 gcaaaaggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt
      301 ctgacagccg cggacacggc cacctatttc tgtgccagag gacaggcaac ttttattccc
      361 tggggcccag gcaccctggt cacc
//
LOCUS       RABIGHBA      393 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 5-2.
ACCESSION   M29420
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 5-2.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 393)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept          1  >   393     Ig mu-chain V-D-J precursor
    sigp          1       57     Ig mu-chain signal peptide
    matp         58  >   393     Ig mu-chain
    recomb      339      340     V-region end/D-region start
    recomb      364      365     D-region end/J-region start
BASE COUNT       80 a    103 c    118 g     92 t
ORIGIN      
        1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag
       61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc
      121 acagtctctg gaatcgacct cagtagcttt gcaatggcct gggttcgcca ggctccaggg
      181 aaggggctgg agtggatcgg aatcattaat ggttatggta ctacatacta cgcgagctgg
      241 gtgaatggcc gattcaccat ctccaaaacc tcgacctcgg tggatctgaa aatgaccagt
      301 ctgacaaccg aggacacggc cacctatttc tgtgtcagat atcttagtga tggttggtat
      361 ctagacttgt ggggccaagg caccctggtc acc
//
LOCUS       RABIGHBB      375 bp ss-mRNA            MAM       26-MAY-1990
DEFINITION  Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 7-2.
ACCESSION   M29421
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region.
SOURCE      Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 7-2.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 375)
  AUTHORS   DiPietro,L.A. and Knight,K.L.
  TITLE     Restricted utilization of germ-line VH gene and diversity of D
            regions in rabbit splenic Ig mRNA
  JOURNAL   J. Immunol. 144, 1969-1973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.DiPietro, 25-OCT-1989.
FEATURES       from  to/span     description
    pept    <     1  >   375     Ig mu-chain V-D-J precursor (AA at 1)
    sigp          1       57     Ig mu-chain signal peptide
    matp         58  >   375     Ig mu-chain
    recomb      339      340     V-region end/D-region start
    recomb      361      362     D-region end/J-region start
BASE COUNT       82 a     98 c    112 g     83 t
ORIGIN      
        1 gtgctcaaag gtgtccagtg tcagtcggtg gaggagtccg ggggtcgcct ggtcacgcct
       61 gggacacccc tgacactcac ctgcacagtc tctggattct ccctcaataa ttatgcaatg
      121 ggctgggtcc gccaggctcc agggaagggg ctagaatgga tcggaaccat tggtactggt
      181 ggtagcgtat actacgcgaa ctgggcaaaa ggccgattca ccatctccag aacctcgacc
      241 acggtggatc tgaaaatgac cagtctgaca accgaagaag gacacgccac ctatttctgt
      301 gccagagtgg ctggtggtac tgtttttggc tatgtggggt actttaactt gtggggccaa
      361 ggcaccctgg tcacc
//
LOCUS       PHVARCA       902 bp ss-mRNA            PLN       26-MAY-1990
DEFINITION  P.vulgaris arcelin 2 mRNA, complete cds.
ACCESSION   M28470
KEYWORDS    arcelin.
SOURCE      P.vulgaris, cDNA to mRNA, clone pARC2-11 and pARC2-191.
  ORGANISM  Phaseolus vulgaris
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 902)
  AUTHORS   John,M.E. and Long,C.M.
  TITLE     Sequence analysis of arcelin 2: A lectin-like plant protein
  JOURNAL   Gene 86, 171-176 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by M.E.John, 29-SEP-1989.
FEATURES       from  to/span     description
    pept          1      798     arcelin 2
BASE COUNT      249 a    282 c    164 g    207 t
ORIGIN      
        1 atggcttcct ccaacttact caccctagcc ctcttccttg tgcttctcac ccacgcaaac
       61 tcaagcaacg acgcctcctt caacgtcgag acgttcaaca aaaccaacct catcctccaa
      121 ggcgatgcca ccgtctcatc cgaaggccac ttactactaa ccaatgttaa aggcaacgaa
      181 gaggactcta tgggccgcgc cttctactcc gcccccatcc aaatcaatga cagaaccatc
      241 gacaacctcg ccagcttctc caccaacttc acattccgta tcaacgctaa gaacaatgaa
      301 aattccgcct atggccttgc ctttgctctc gtccccgtcg gctctcggcc caaacttaaa
      361 ggccgttatc taggtctttt caacacagcc aactacgacc gcgacgccca tactgtggct
      421 gtggtgttcg acaccgtcag caaccgtatt gaaatcgacg tgaactccat ccggcctatc
      481 gcaacggagt cttgcaattt cggccacaac aacggagaaa aggccgaggt tcggatcacc
      541 tattactccc ccaagaacga cttgagggtt tctctgcttt acccttcttc ggaagaaaag
      601 tgccacgtct ctgccacagt gccgctggag aaagaagttg aggactgggt gagcgttggg
      661 ttctctgcca cctcagggtc gaaaaaagag accactgaaa cgcacaacgt cctctcttgg
      721 tctttttctt ccaacttcat caattttgag ggcaaaaaat ctgaacgttc caacatcctc
      781 ctcaacaaga tcctctagac tcccaaagcc agcttcactg tgacagtaaa accttcctta
      841 tacgctaata atgttcatct gtcacacaaa ctacaataaa taaaatggga gcaataaata
      901 aa
//
LOCUS       DROGOALA     2204 bp ss-mRNA            INV       26-MAY-1990
DEFINITION  Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo59.
ACCESSION   M29731 J05089
KEYWORDS    G protein; guanine nucleotide-binding protein.
SOURCE      D.melanogaster adult head cDNA to mRNA, clone lambda-DGo59.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2204)
  AUTHORS   Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M.
  TITLE     Immunological and molecular characterization of G-o-alpha-like
            proteins in the Drosophila central nervous system
  JOURNAL   J. Biol. Chem. 264, 18552-18560 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Thambi 04-OCT-1989.
FEATURES       from  to/span     description
    pept        166     1230     G-o-alpha-like protein
BASE COUNT      757 a    493 c    459 g    495 t
ORIGIN      
        1 gaattccgtg ctcggcaagt gcaacgttga aatcgttaaa ctgtacataa gcaaataaga
       61 cataaagaaa aaagtccagg aaaattggaa aacaaaagcc cgaaaaccga aaagccccgt
      121 gtaaatccga atccgaatcc aaatcagtat ccaaacccaa ccacaatggg ctgcaccaca
      181 tccgccgaag aacgcgccgc catccagcga tccaaacaga tcgagaagaa tctaaaggag
      241 gatggaatcc aggcggccaa ggacatcaag ctcctgctgc tgggtgccgg tgagtcgggc
      301 aagagcacaa tagtcaaaca gatgaaaatc attcacgaga gcggcttcac tgcggaggac
      361 tttaaacaat atcgaccggt tgtctacagc aacacaatac aatcattagt tgcaatattg
      421 cgcgcgatgc caaccctaag tattcagtac agcaataacg agcgggagag cgatgccaag
      481 atggtgttcg acgtatgcca acgcatgcac gacaccgagc ccttctcgga ggagctgctg
      541 gccgccatga aacgcctctg gcaggacgcc ggtgtccagg agtgcttctc gcgcagcaac
      601 gaataccaac taaacgattc cgcaaaatat ttcctggacg atttggatcg gttaggcgcc
      661 aaggattacc agccaactga acaagatatc ttgcgcactc gcgtcaagac cactggcatc
      721 gttgaggtac acttctcctt caaaaacctc aactttaaat tgtttgacgt gggcggtcag
      781 cgctcggaac gtaagaaatg gatacactgc ttcgaagatg tcacggcgat cattttctgc
      841 gtggccatgt ccgagtacga tcaagtcttg catgaggatg aaaccacgaa ccgcatgcaa
      901 gagtcgctga aactgtttga ctcgatctgt aacaacaaat ggttcacgga cacctcgatt
      961 attctatttc tgaacaagaa ggatttgttc gaggagaaga ttcgcaagag tcccctgacg
     1021 atttgcttcc ccgaatacac aggtggacag gagtacggcg aggcggctgc ttacattcag
     1081 gctcaatttg aagcgaaaaa caaatcaacc tcaaaagaaa tctactgcca catgacgtgt
     1141 gccacagata ccaataacat tcagtttgta ttcgatgctg tcaccgatgt catcatagca
     1201 aacaacctgc gcggctgtgg actgtactaa gatggattcc aggccggatc ccgacgatgt
     1261 cgacgtccga gtcgatattg atgacgatga cgattatgtg gagcagaatg ggggcgttac
     1321 gagggaacac cgtaacggta ttaaagagca gcgcggagca caacaaccca ccagcattga
     1381 tcaaaaaacc aaacaattta ggagcagatg atagaaccaa ccaacaaacc aaccgcaaac
     1441 cacacagaaa acataggaca ctgaacaagc aaagcccaaa gaacttttat ttgtttaaca
     1501 aaaaaacggc ggacggacgg aaatcccgaa tggatgttat agggaaaatg agcgacaagt
     1561 acattacata atatcgataa tattgaagca gatgcagatg caaatacaca caatgctaat
     1621 gatgatcagg gcgactatga ctaaatgagg cagcaggcaa ctgacactgg gacacgcgat
     1681 taaagtcaca tctgaaaaaa ggcagttgat tgaaaggcat ttctatatac aaacatatac
     1741 aaacacatac atatgcatta tgcaaagcca catgtacgac atgacactaa cacactcaca
     1801 cgacaaacac aagcgccaac attgcataca gttgttgttt ggtctgaata atttttatag
     1861 aatttcataa tttatgtgta gtttagtttc ctcatgtatt tattaaaaca aaaaccaaac
     1921 gagcgtatat ctacatatac cgcatatata tatatacata cacttctata catatatata
     1981 tatatatata catatatata aatattatat attaaatgtt tcctgttgca atctctcttt
     2041 aaaattattc atgccatcaa cgctctgcat ttgtcatgct tgtttagact taagttcgaa
     2101 agtttcaaca aaatccagcg tcaaaggaaa tatcaatatt catttgattg agtgtcagcg
     2161 tgtggtctaa agtaaatata taaaataaca aaccaaaaaa aaaa
//
LOCUS       DROGOALB     2558 bp ss-mRNA            INV       26-MAY-1990
DEFINITION  Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo21.
ACCESSION   M29732 J05089
KEYWORDS    G protein; guanine nucleotide-binding protein.
SOURCE      D.melanogaster adult head cDNA to mRNA, clone lambda-DGo21.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2558)
  AUTHORS   Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M.
  TITLE     Immunological and molecular characterization of G-o-alpha-like
            proteins in the Drosophila central nervous system
  JOURNAL   J. Biol. Chem. 264, 18552-18560 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Thambi 04-OCT-1989.
FEATURES       from  to/span     description
    pept        520     1584     G-o-alpha-like protein
BASE COUNT      867 a    557 c    530 g    604 t
ORIGIN      
        1 gaattccggt tgcctatttc tctcgcttac ctatttattt agcatacatt ttccaagcat
       61 cctgtgaaaa aaccatcaca agttttcctt cgaacggaat gccaagtgca ttctggaagg
      121 aaatcgttgt acatctacat aatgccaata aagaaaatgt aactaaagta aaaaaaaaaa
      181 aaaagagcta aaccgttaaa ttaaagtttt aaagttaaaa aaacgctgaa taagtgttaa
      241 atatatataa caaaaatatt gttgaattga agaaaaccaa agttcaaaaa cctgaaaaaa
      301 ccataaagaa gtgattgaaa aatcagttga agtgccgtac tgaaaattaa agtccagtga
      361 cacgatcgaa tccctcggat agcggagtta gtttagcccc ccgaattcga gtccccgcac
      421 gttgtacacc tggtttttct cgctggcaac gtagtcggcc attgagttgg ccgataccaa
      481 acgaccttca aaacgttttg cgtcgaggca atacgcacca tgggctgcgc acagtctgcc
      541 gaggagcgag ccgcagccgc caggagtcgc ctcatcgagc gcaatctaaa ggaggatgga
      601 atccaggcgg ccaaggacat caagctcctg ctgctgggtg ccggtgagtc gggcaagagc
      661 acaatagtca aacagatgaa aatcattcac gagagcggct tcactgcgga ggactttaaa
      721 caatatcgac cggttgtcta cagcaacaca atacaatcat tagttgcaat attgcgcgcg
      781 atgccaaccc taagtattca gtacagcaat aacgagcggg agagcgatgc caagatggtg
      841 ttcgacgtat gccaacgcat gcacgacacc gagcccttct cggaggagct gctggccgcc
      901 atgaaacgcc tctggcagga cgccggtgtc caggagtgct tctcgcgcag caacgaatac
      961 caactaaacg attccgcaaa atatttcctg gacgatttgg atcggttagg cgccaaggat
     1021 taccagccaa ctgaacaaga tatcttgcgc actcgcgtca agaccactgg catcgttgag
     1081 gtacacttct ccttcaaaaa cctcaacttt aaattgtttg acgtgggcgg tcagcgctcg
     1141 gaacgtaaga aatggataca ctgcttcgaa gatgtcacgg cgatcatttt ctgcgtggcc
     1201 atgtccgagt acgatcaagt cttgcatgag gatgaaacca cgaaccgcat gcaagagtcg
     1261 ctgaaactgt ttgactcgat ctgtaacaac aaatggttca cggacacctc gattattcta
     1321 tttctgaaca agaaggattt gttcgaggag aagattcgca agagtcccct gacgatttgc
     1381 ttccccgaat acacaggtgg acaggagtac ggcgaggcgg ctgcttacat tcaggctcaa
     1441 tttgaagcga aaaacaaatc aacctcaaaa gaaatctact gccacatgac gtgtgccaca
     1501 gataccaata acattcagtt tgtattcgat gctgtcaccg atgtcatcat agcaaacaac
     1561 ctgcgcggct gtggactgta ctaagatgga ttccaggccg gatcccgacg atgtcgacgt
     1621 ccgagtcgat attgatgacg atgacgatta tgtggagcag aatgggggcg ttacgaggga
     1681 acaccgtaac ggtattaaag agcagcgcgg agcacaacaa cccaccagca ttgatcaaaa
     1741 aaccaaacaa tttaggagca gatgatagaa ccaaccaaca aaccaaccgc aaaccacaca
     1801 gaaaacatag gacactgaac aagcaaagcc caaagaactt ttatttgttt aacaaaaaaa
     1861 cggcggacgg acggaaatcc cgaatggatg ttatagggaa aatgagcgac aagtacatta
     1921 cataatatcg ataatattga agcagatgca gatgcaaata cacacaatgc taatgatgat
     1981 cagggcgact atgactaaat gaggcagcag gcaactgaca ctgggacacg cgattaaagt
     2041 cacatctgaa aaaaggcagt tgattgaaag gcatttctat atacaaacat atacaaacac
     2101 atacatatgc attatgcaaa gccacatgta cgacatgaca ctaacacact cacacgacaa
     2161 acacaagcgc caacattgca tacagttgtt gtttggtctg aataattttt atagaatttc
     2221 ataatttatg tgtagtttag tttcctcatg tatttattaa aacaaaaacc aaacgagcgt
     2281 atatctacat ataccgcata tatatatata catacacttc tatacatata tatatatata
     2341 tatacatata tataaatatt atatattaaa tgtttcctgt tgcaatctct ctttaaaatt
     2401 attcatgcca tcaacgctct gcatttgtca tgcttgttta gacttaagtt cgaaagtttc
     2461 aacaaaatcc agcgtcaaag gaaatatcaa tattcatttg attgagtgtc agcgtgtggt
     2521 ctaaagtaaa tatataaaat aacaaaccaa aaaaaaaa
//
LOCUS       MTYCLCGA     6319 bp ss-RNA             VRL       26-MAY-1990
DEFINITION  Turnip yellow mosaic virus Club Lake isolate, complete genome.
ACCESSION   J04373
KEYWORDS    complete genome; nucleotide binding protein; replicase;
            virion protein.
SOURCE      Turnip yellow mosaic virus Club Lake isolate cDNA to viral RNA.
  ORGANISM  Turnip yellow mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Tymovirus.
REFERENCE   1  (bases 1 to 6319)
  AUTHORS   Keese,P., Mackenzie,A. and Gibbs,A.
  TITLE     Nucleotide sequence of the genome of an Australian isolate of
            turnip yellow mosaic tymovirus
  JOURNAL   Virology 172, 536-546 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Gibbs, 04-AUG-1989.
FEATURES       from  to/span     description
    pept         96     5630     replicase polyprotein
    pept         89     1975     Unknown protein
    pept       5645     6214     virion protein
BASE COUNT     1461 a   2426 c   1061 g   1371 t
ORIGIN      
        1 gtaatcaact accaattcca gctctctttt gacaactggt cttataccaa ctttccgtac
       61 acttgcaacc ctcgtaagac aattgcaaat gagtaatggc cttccaatta gcattggacg
      121 cccttgcacc cacgactcac agagatccct ctctgcatcc gattctcgaa tccacagtag
      181 attcgattcg ctcctcgata cagacctacc catggtccat tccgaaggaa cttctgcccc
      241 tactcaactc ctacggcatc ccaacatctg gtttgggaac atcccaccac ccccacgccg
      301 cccacaagac aatcgagact tttctccttt gcacccactg gtctttccag gccaccactc
      361 ccagctccgt catgttcatg aaacccagca agttcaacaa acttgcccag gtgaactcaa
      421 actttcggga attgaagaac taccgcctgc accccaacga cagcactcgt taccccttca
      481 catcaccaga ccttcccgtt ttccccacca ttttcatgca cgacgccctg atgtactacc
      541 atccctccca gatcatggac ctgttcttgc agaaaccaaa cctcgaacgt ctgtacgcca
      601 gcctcgtagt accacccgag gcccatcttt ccgaccaatc cttcttcccg aagttgtaca
      661 cgtacacgac gacccgccac actcttcact acgtcccgga aggtcacgaa gccggcagct
      721 acaaccaacc atccgacgcc cactcttggc tccgaatcaa ttcaattcgc ctcggcaacc
      781 accacctctc agtgacgatc ctggaatcct ggggccctgt ccactcgctc ctaattcaac
      841 gagggacccc cccccccgac ccatcactcc aggccccttc aacacccatg gcgtccgacc
      901 tctttcggtc ttaccaagag ccccgcctcg acgtggtctc cttccgaatc ccagacgcca
      961 tcgaacttcc acaggccaca ttccttcaac aaccgcttcg agaccgactg gtcccccgag
     1021 ccgtctacaa cgccctgttc acctacacca gagcggtccg cacactccgg acttcagacc
     1081 cagcggcatt cgtaaggatg cattcctcca aaccggacca cgattgggtc acctcgaacg
     1141 cctgggacaa tctgcagacc ttcgcacttc tgaacgtacc ccttcgacca aacgtcgtct
     1201 accacgtcct tcagagccca attgcctccc tagctcttta cctgaggcaa cattggcgcc
     1261 gtcttaccgc caccgccgtt cccatcctct ccttcctaac cctcctgcag cgcttcctcc
     1321 cattgcctat acctctggca gaggtaaaat ccatcacagc cttccgaagg gagctctacc
     1381 gaaagaaggc cccccaccac cccctcgacg tcttccatct ccagcaacac ctccgcaatc
     1441 accactccgc gatctcggcc gtacgcccag cttccccacc ccaccaaaga cttccacacg
     1501 cgctccagaa agctgcattg ctgctcctcc gaccgatatc gcccctcttg acagcgaccc
     1561 cgttctttcg gtccgaacag aagtccatgc tcccgaacgc cgaactttca tggaccctga
     1621 agcgcttcgc gctgccttgg caggcctccc tagtcctcct ctctctgtcg gaatcatccg
     1681 tactgcttca caaactgttc tccccaccaa ctctccaagc ccaacacgac acctaccacc
     1741 gacatcttca ccctggatcc tacagtctcc agtgggagag gacgccattg tcgattccga
     1801 ggacgacagc atttcttcct ttcactccca cgacttcaac agcccctccg gaccactccg
     1861 aagccagtct ccctcccgct ttcgcctcca cctccgttcc ccgtccacct ccagtggcat
     1921 cgagccttgg agcccagcct cctacgacta cggcagcgcc cccgacaccg attgaaccca
     1981 cccagcgcgc tcatcaaaat tctgacctca cgcttgaaag ttcaacccca attgaacccc
     2041 ccccaccccc catccaatcc tccgacatcc cgccttccgc ccccgttctt ttcccagaaa
     2101 tcaactcacc gcatcgtttt tcccccaaac ttcccaccac acccgatttc gaacccaccc
     2161 gcacttcacc ccctccttcc acttcgcatc aagattcgac tgaccccgcg gaccccctga
     2221 tgggctccca ccttctgcac cattcactac ctgcacctcc cacccacccg cttcaatctt
     2281 cacagctctt gcccgcacct ttgacaaacg accccaccgc gatcggcccg gtactcccct
     2341 ttgaagaact ccacccacgc aggtaccccg aaaacaccgc cactttcctc acgaggctcc
     2401 gttcacttcc ttcaaaccat ctaccacaac ccaccctgaa ttgtctcctc tctgctgtct
     2461 ccgaccaaac caaggtttcc gaggatcacc tctgggagtc cctacagaca attctcccag
     2521 acagccaact caggaacgaa gagatcaact ctctcgggct ttcaactgaa cacctcactg
     2581 cgttggccca tctttacaac ttccaggcaa ccatctactc cgatcgtggt cccatcctct
     2641 tcggcccatc cgacaccatt aagagaatcg acatcaccca caccaccgga ccgccatccc
     2701 acttttcacc cggcaaaaga cttttaggca gccaaccctc agctaagggc catccctccg
     2761 actcactcat cagagccatg aagtctttca aagtatccgg caactacctt cccttctctg
     2821 aggcccacaa ccatcccacc tccatctcac atgccaagaa cttggtttca aacatgaaga
     2881 atggattcga cggcatcctc tcccttctcg acgtctccac aggccaacga accggaccca
     2941 cccccaaaga cgcgatcatt cagatagacc actacctcga caccaacccc ggcaaaacca
     3001 cccctgtggt gcattttgct ggtttcgctg gctgtggaaa gacatatccg atccaacagc
     3061 tccttaaaac taaactgttc aaagactttc gggtctcctg ccccaccaca gaactcagaa
     3121 ccgaatggaa gactgcgatg gaacttcatg gctcccagtc atggcgcttt aacacttggg
     3181 agtcttccat tctcaagtca tccagaattc tggtcatcga tgaaatctac aaaatgccaa
     3241 gagggtacct cgacctttcc attctcgctg accccgccct cgaactcgtc ataattctcg
     3301 gtgatcctct ccagggcgag taccactctc aatccaaaga ctcatccaat caccgccttc
     3361 cctccgaaac tctcaggctg ctaccataca ttgacatgta ctgctggtgg agttatcgca
     3421 ttccccaatg tatcgcccga ctcttccaaa ttcacagctt caatgcctgg cagggaatca
     3481 tcggctccgt ttcaactccc caggatcaat cccccgttct caccaacagt catgcctcat
     3541 ctctcacctt caacagcctg ggatatcgct cctgcacgat cagctctagc caaggcctca
     3601 cattctgcga ccctgccatc atcgtcctgg acaactacac caagtggctc tcctcggcca
     3661 acggcctcgt cgccctcacc cgatccagat caggtgtcca attcatgggc ccctcttcct
     3721 atgtcggggg aaccaacggc tcttctgcca tgttttctga cgccttcaac aacagcctca
     3781 tcatcatgga tcgctacttc ccatccctgt tcccacaact caagctcatc acctcccccc
     3841 tcacaactcg cagccccaaa ctcaacgggg ccacccccag cgcatctccc acccatcgct
     3901 cgccaaactt ccacctcccc ccacacattc ccctctctta tgatcgtgat ttcgtcacgg
     3961 tcaacccaac tctccctgat cagggacccg aaacaagact cgacacccac ttcctcccac
     4021 cttctcggct cccgcttcat ttcgatctcc caccagctat cacccccccc ccgatttcca
     4081 caagcgtcga cccgccacaa gctaaagcta gccccgtcta tccaggcgag ttcttcgatt
     4141 ctctggcggc gttcttctta ccagcacacg acccatcaac aagggaagta ctccacaaag
     4201 atcaatctag caaccagttc ccttggttcg accgaccctt cagcttgtcc tgccagccct
     4261 caagtttaat ttctgccaag catgcaccca accacgatcc gacccttctg cctgcctcca
     4321 tcaataaacg cttgcgattc agacccagtg aagcaccgca ccaaatcacc gcagacgacg
     4381 tggtcctagg cctgcaactc ttccactctc tctgccgcgc ctactcacgt caacccaaca
     4441 tcaccgttcc attcaaccct gaacttttcg cagaatgtat ctctctgaat gaatacgcgc
     4501 agctcagttc caaaacccaa tccaccatag tggccaacgc ttcacgctcc gacccagact
     4561 ggcgacacac caccgtcaag atttttgcga aagctcaaca caaagtcaac gacggctcca
     4621 tcttcggttc atggaaggcc tgccaaactc tcgcactcat gcatgattac gtaattctgg
     4681 ttcttggacc cgtcaagaaa tatcaaagaa tcttcgacaa cgttgatcgg ccatctcaca
     4741 tctactcaca ctgcggcaag acacccaacc aacttcgaga ttggtgccag gaacatctca
     4801 ctcattccac cccaaaaatc gcaaacgact acaccgcctt cgaccaatcc cagcatggag
     4861 aatccgtggt tcttgaagcc ctcaaaatga agagactgaa cattccgagc catttgattc
     4921 agctccatgt ccacctcaag accaacgtct ccacccagtt cggccccctc acatgcatgc
     4981 gcctgaccgg ggaacccgga acctacgacg acaacactga ctacaacctc gcagtcatct
     5041 actctcagta tgacgttggt tcctgcccca tcatggtctc tggcgacgac tcactcatag
     5101 accaccctct tcccactcgc cacgactggc cctctgttct caaacgcctc cacctccgct
     5161 ttaaacttga actcacttct catcccctct tttgtggcta ctacgtcggt ccagcaggct
     5221 gcatccgcaa ccccttggcc cttttctgca agctcatgat cgcagtggac gatgacgccc
     5281 tcgacgaccg acgactcagc tacctcaccg agttcaccac cggacacctc cttggcgaat
     5341 cactatggca cctcctccct gaaacccacg tccagtatca gtcagcttgc tttgacttct
     5401 tctgcagacg ttgcccaaaa cacgagaaga tgctcctcga tgattccaca cccacactca
     5461 gcctcctcga acgaatcact tcttcaccga ggtggctcac caagaacgcc atgtacctcc
     5521 tccccgccaa gctcagactg gctatcacct ctctgtctca aacgcaatct ttcccagaat
     5581 ccattgaggt ttcccacgct gagtctgaat tgcttcacta tgtccaatag caatcagccc
     5641 cgacatggaa atcgacaaag aactcgcccc ccaagaccgc accgtcaccg tcgccaccgt
     5701 tttaccgact gtccccggcc cctcaccttt caccatcaaa caaccgttcc agtctgaagt
     5761 tctgtttgct gggaccaaag atgccgaggc ctctctcacc atcgccaaca tcgacagcgt
     5821 ttccaccctc accaccttct atcgtcatgc ctctctggaa tcactctggg tcaccatcca
     5881 tcctaccttg caagccccag ctttcccgac cacggttggc gtttgctggg tacccgccaa
     5941 ctccccagtc actcccaccc aaatcaccaa gacctacggc ggccagatct tctgcattgg
     6001 aggcgccatc aacactctct cacccctcat tgtcaagtgc ccacttgaaa tgatgaaccc
     6061 ccgggtcaaa gattcaattc aataccttga ctcgcccaaa ctcctcatct ccatcaccgc
     6121 tcaacccacc gctccccccg catcgacctg cataataact gtatcaggaa ctctctcgat
     6181 gcattctccg ctcatcacgg acacttccac ctaagttctc gatctttaaa atcgttagct
     6241 cgccagttag cgaggtctgt ccccacacga cagataatcg ggtgcaactc ccgccccttt
     6301 tccgagggtc atcggaacc
//
LOCUS       RATTH2BAA     181 bp ds-DNA             ROD       26-MAY-1990
DEFINITION  Rat TH2B gene promoter region.
ACCESSION   M33578
KEYWORDS    H2B histone; histone; transcription regulatory element.
SOURCE      Rat DNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 181)
  AUTHORS   Hwang,I., Lim,K. and Chae,C.-B.
  TITLE     Characterization of the S-phase-specific transcription regulatory
            elements in a DNA replication-independent testis-specific H2B
            (TH2B) histone gene
  JOURNAL   Mol. Cell. Biol. 10, 585-592 (1990)
  STANDARD  simpl staff_entry
FEATURES       from  to/span     description
    mRNA        161  >   181     H2B histone mRNA
    signal       61       68     octamer signal
    signal      110      115     hexamer signal
BASE COUNT       45 a     47 c     32 g     57 t
ORIGIN      
        1 acctgattgg ctgattggtg atgaattaac caatcagaaa gcaccacttg aattcccctt
       61 atttgcatac aaggaacatt tattgtccaa tcatctttcg cgtgctcata cgtcatccaa
      121 ggcccacgcc tataaatacc tctcttcttg gccttcaagc ggtgtgtttt ctcagcagtt
      181 g
//
LOCUS       TCVDIGAA      347 bp ss-RNA             VRL       26-MAY-1990
DEFINITION  Turnip crinkle virus defective interfering RNA.
ACCESSION   M29290
KEYWORDS    defective interfering RNA.
SOURCE      Turnip crinkle virus cDNA to RNA.
  ORGANISM  Turnip crinkle virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Tombusvirus.
REFERENCE   1  (bases 1 to 347)
  AUTHORS   Li,X.H., Heaton,L.A., Morris,T.J. and Simon,A.E.
  TITLE     turnip crinkle virus defective interfering rnas intensify viral
            symptoms and are generated de novo
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9173-9177 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            A.E.Simon, 20-OCT-1989.
FEATURES       from  to/span     description
    RNA           1      347     defective interfering RNA
BASE COUNT       90 a     98 c     86 g     73 t
ORIGIN      
        1 gggataaaaa aggaggctta ccaaccttct ctctattcac gatgcctctt ctacacacac
       61 tcaaaacagc gctcgcagtg ggactccttg gagccaggta ctaccccgaa ggttcaaaac
      121 caagaccccc aagtcgcttt actttgagat gtgttagaaa gccccaaggt cattttactt
      181 tgacctgtgt tagagaccca aaacggtggc agcactgtct agctgcgggc attagactgg
      241 aaaactagtg ctctctgggt aaccactaaa atcccgaaag ggtgggctag tggcgaccct
      301 ccgaactaaa agacagcctc cctcctcgcg gggggggggg cctgccc
//

GenBank-Updates@genbank.bio.net (05/30/90)

LOCUS       FSBISO        676 bp ss-mRNA            VRT       30-MAY-1990
DEFINITION  C.commersoni isotocin/neurophysin mRNA, complete cds.
ACCESSION   X16621 M25145
KEYWORDS    isotocin; neurophysin.
SOURCE      C.commersoni hypothalamic region, cDNA to mRNA, clones pCIT-1,
            M132.1 and M135.1.
  ORGANISM  Catostomus commersoni
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Cypriniformes; Cyprinoidei; Catostomidae.
REFERENCE   1  (bases 1 to 676)
  AUTHORS   Figueroa,J., Morley,S.D., Heierhorst,J., Krentler,C.,
            Lederis,K. and Richter,D.
  TITLE     Two isotocin genes are present in the white sucker Catostomus
            commersoni both lacking introns in their protein coding regions
  JOURNAL   EMBO J. 8, 2873-2877 (1989)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 676)
  AUTHORS   Heierhorst,J., Morley,S.D., Figueroa,J., Krentler,C.,
            Lederis,K. and Richter,D.
  TITLE     Vasotocin and isotocin precursors from the white sucker, Catostomus
            commersoni: Cloning and sequence analysis of the cDNAs
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 5242-5246 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.D.Morely, 31-MAY-1989.
FEATURES       from  to/span     description
    pept         43      507     isotocin-1 precursor
    sigp         43      102     isotocin-1 signal peptide
    matp        103      129     isotocin-1
    matp        139      504     neurophysin (put.)
    mRNA    <     1      676     isotocin mRNA
    signal      573      578     poly-A signal
    signal      657      662     poly-A signal
BASE COUNT      147 a    169 c    176 g    184 t
ORIGIN      
        1 ctgaatcgtt tcactacctg tcaatttcag cctttgcaaa atatgtctgg aagcatgttc
       61 tcagtgtttt ctctgctgta tctgctgtct gtctgctcag cctgttacat ctccaactgc
      121 cccatcggtg gtaaacgagc catacaggat tcaccctctc gacagtgtat gtcttgcggt
      181 ccgggggatc gcggccggtg ttttggtccc agtatctgct gtggtgaggg tctcggctgt
      241 ctgctgggct ctccggagac tcagcgatgt ctggaggagg actttctgcc ctccccgtgt
      301 gaggccggtg ggaaagtgtg cggatatgag ggccgctgtg ctgctccagg tgtctgctgt
      361 gactcagagg gctgcagcgt tgatcagtcg tgtgttgatg gtgacggtga cgctacagct
      421 gtcagtcaac ctgccagcag ccaagattta ctgctgaaac tactgcacct gtcaaacccc
      481 gcccacccat acagactcca ccaatgacat cagagtgaca tccgctcagg agaccaccag
      541 ccaatcagaa ttattcttga agaccttttg taaataaagt ttgtatagcc gatagacatg
      601 tacatggctc atacagatat tagtagaact gtattgatag tgtttactgt gcagctatta
      661 aagtcagaga gaacgt
//
LOCUS       HUMTCBYAL     416 bp ss-mRNA            PRI       30-MAY-1990
DEFINITION  Human T-cell receptor beta-chain mRNA V9-D-J2.3-C2 region, partial
            cds.
ACCESSION   M33240
KEYWORDS    T-cell receptor beta-chain; constant region; diversity exon;
            joining exon; pseudogene; variable region.
SOURCE      Human (individual VW) T-cell line VW114, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 416)
  AUTHORS   Hansen,T. and Lundin,K.
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Hansen, 22-MAR-1990.  Institute of Transplantation Immunology,
            The National Hospital, N-0027 Oslo 1 Norway
FEATURES       from  to/span     description
    pept.ps <     1  >   416     T-cell receptor beta-chain V-D-J-C region
                                 precursor /nomgen="TCRB" /map="7q35"
                                 /hgml_locus_uid="LP0151Q" (AA at 1)
    sigp.ps <     1       35     T-cell receptor beta-chain signal peptide
    matp.ps      36  >   416     T-cell receptor beta-chain
    recomb      317      318     V-region DNA end/D-region DNA start
    recomb      330      331     D-region DNA end/J-region DNA start
    recomb      375      376     J-region DNA end/C-region DNA start
BASE COUNT      121 a    105 c     87 g    103 t
ORIGIN      
        1 atgtggccct ctgcctcctg caagcagatc cactggacac agccgtttcc cagactccaa
       61 aatacctggt cacacagacg ggaaaaaagg agtctcttaa atgagaacaa aatctgggcc
      121 ataatgctat gtattggtat aaacaggact ctaagaaatt tctgaagaca atgtttatct
      181 acagtaacaa ggagccaatt ttaaatgaaa cagttccaaa tcgcttctca cctgactctc
      241 cagacaaagt tcatttaaat cttcacatca attccctgga gcttggtgac tctgctgtgt
      301 atttctgtgc cagcagccaa gggcaattgc acagatacgc agtattttgg cccaggcacc
      361 cggctgacag tgctcgagga cctgaaaaac gtgttcccac ccgaggtcgc tgtgtt
//
LOCUS       RATODCAB     2153 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Rat ornithine decarboxylase (ODC) mRNA, complete cds.
ACCESSION   J04791
KEYWORDS    ornithine decarboxylase.
SOURCE      Rat (strain Sprague-Dawley) testis, cDNA to mRNA, clone pODC2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2153)
  AUTHORS   Wen,L., Huang,J.-K. and Blackshear,P.J.
  TITLE     Rat ornithine decarboxylase gene: Nucleotide sequence, potential
            regulatory elements, and comparison to the mouse gene
  JOURNAL   J. Biol. Chem. 264, 9016-9021 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry for [1] kindly submitted by L.Wen, 28-MAY-1989, for
            release after publication.
FEATURES       from  to/span     description
    pept        160     1545     ornithine decarboxylase (EC 4.1.1.17)
    ORF          39       71     ORF
    signal     1855     1860     poly-A signal
BASE COUNT      541 a    460 c    565 g    587 t
ORIGIN      
        1 cggccgggtg cgccttgggg tttagtggcg gcttctccat gggtccagcc agccgcttcc
       61 ctgtgctgtg aggagacagc attcagagtt gaccttgtga gagctggcca taatttaatt
      121 ccatctctag gttttctgta ggcacatcga gaaccaacca tgggcagctt tactaaggaa
      181 gagtttgact gccatatcct cgatgaaggt ttcactgcta aggacattct ggaccaaaaa
      241 atcaatgaag tttcttcctc tgatgataag gatgctttct atgttgcgga cctcggagac
      301 gttctaaaga agcatctgag gtggctgaaa gctcttcccc gtgttactcc cttctatgct
      361 gtcaagtgta atgacagcag agccatagtg agcaccctgg ctgccattgg gacaggattt
      421 gattgtgcaa gcaagactga aatacagttg gtgcaggggc ttggggtgcc tccagagagg
      481 attatctatg caaatccttg taagcaagtg tctcagatca agtatgctgc cagtaatgga
      541 gtccagatga tgacttttga cagtgaaatt gagttgatga aagttgccag agcacatcca
      601 aaggcaaagt tggttttgcg gattgccact gatgattcca aagcagtttg tcggctcagt
      661 gttaagtttg gtgccacact gaaaaccagc aggcttctct tggaacgggc aaaagagcta
      721 aatattgatg tcattggtgt cagcttccat gtgggcagtg ggtgtactga ccctgagacc
      781 ttcgtgcagg cagtgtcaga tgcccgctgt gtctttgaca tgggaacaga agttggtttc
      841 agcatgtatc tgcttgacat tggtggtggc tttcctgggt ctgaagacac gaagcttaaa
      901 tttgaggaga tcaccagtgt aatcaaccca gctctggaca agtacttccc atcggactct
      961 ggagtgagaa tcatagctga gccaggcaga tactacgtcg catcagcttt cacacttgca
     1021 gtgaatatca ttgccaaaaa aaccgtgtgg aaggagcaga ccggctcgga cgatgaagat
     1081 gagtcaaacg agcaaacttt gatgtattac gtgaatgatg gagtgtatgg gtcatttaac
     1141 tgcattcttt atgaccatgc acatgtgaag gccctgctgc agaagagacc caagccagat
     1201 gagaagtatt actcatccag catctgggga ccaacatgtg atggccttga tcggatcgtc
     1261 gagcgctgta gcctgcctga aatgcatgtg ggtgattgga tgctgtttga gaacatgggt
     1321 gcatacactg ttgctgctgc ttctactttc aatgggttcc agaggccaaa catctactac
     1381 gtaatgtcac ggtcaatgtg gcaactcatg aagcaaatcc agagccatgg cttcccgcca
     1441 gaagtggagg agcaggatgt tggcactctg cccatgtctt gtgcccagga gagcgggatg
     1501 gaccgtcacc ctgcagcctg tgcttctgct agtatcaatg tatagatgcc attcttgtag
     1561 ctcttacctg caagtttagc ttgagttcac ggcatttggg gggaccattt aacttaatta
     1621 ctgctagttt ggaatgtctt tgtaagagta gggttggcac caatgcagta tggaaagact
     1681 aggagatggg ggtcacactt actgtgttcc tatggaaact ttgaatattt tatatggatt
     1741 tttattcact tttcagacct gatactaatg agtgcccctc ggctgctgag caagcatttg
     1801 tagcttgtac attggcagaa tgggctaaaa gcttatgttg tgacccattt tgaaaataaa
     1861 gtatcttgaa atgattggac attggagaat gtgtgcaagt atcccttaca gaaggcacaa
     1921 acttctgcac aggctgtgtg ttacagcagt gagtctagcc cagcagagat gtggatgata
     1981 caaagctgtg ccccctctgt acagcatcaa tgtgcttagc ccatctcaag tgtttactgt
     2041 gaacttggtg cccaaagtct cttaagagtg tcatctgcct agtggcctct tgacttggcc
     2101 acttcctaag gagagggcat ctgaggctct ttgaaccttg cctgcagaaa ccc
//
LOCUS       BPOORF2      3135 bp ds-DNA             BCT       30-MAY-1990
DEFINITION  B.polymyxa beta-1,4-endoglucanase gene, complete cds, and ORF2,
            complete cds.
ACCESSION   M33791 M33840
KEYWORDS    beta-1,4-endoglucanase.
SOURCE      B.polymyxa DNA.
  ORGANISM  Bacillus polymyxa
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 3135)
  AUTHORS   Baird,S.D., Johnson,D.A. and Seligy,V.L.
  TITLE     Molecular cloning, expression, and characterization of endo-beta-
            1,4-glucanase genes from Bacillus polymyxa and Bacillus circulans
  JOURNAL   J. Bacteriol. 172, 1576-1586 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1447     2640     beta-1,4-endoglucanase
    ORF        1072      509 (c) ORF2
BASE COUNT      848 a    689 c    676 g    922 t
ORIGIN      
        1 gaattcataa aaaacgtttt caaaaagatg aaaaggactt attcaaacat tcagttcagc
       61 tttaggaccg attctctgtg gttgatgacc ggattcatac ggatgatttc tgacagctcc
      121 tcatgatcct gttctttttg atcaattttc tccagcagta gcctggcact tcttttcccc
      181 atttcaaaac tcggctggga aacggttgtg atcggcggat tataaaaaga cgcaaaggat
      241 acatcatcta tgccaatcaa ggaaaagtca ttaggtatgg tcagatgatt attttttgca
      301 taaatcagca cctgctccaa aatcatgtca ttcgtagcaa taattgcagt gggcggattc
      361 tccatgtgaa atagttgtcc aatcagatta ggcatttcat ccaacctgcc gcttttgata
      421 tagttttcat tcacgggtat gtcatgttta cccagcgtat ttcggtaacc tgacagtcgc
      481 tcacttcgtg tggtgatcgc ttttttgcct aaggggaacg tgataatccc aatgttgtta
      541 tgaccatgct caatgagtgc ttcaacgcaa gctcgctggc cttttcatta ttcagtagca
      601 ccgtatctac gctgagttca tcgatttttc ggtccacaaa cacgaaggga tagccatttt
      661 tgacaaggga aacatataat ttcttgttct cttccgtcgg gaaaatgata agtccatcca
      721 cttgccgagc aatcatcgac tggacatact ttttctcttt gagcgaatca tcatcggtat
      781 tgcagacaat gacctggaca ttttccagtt ggcattcatc ttcaatggca cgcacgactt
      841 ctgttgtaaa tctcgataaa atggttgaag aaataacccc cacaatgaag gtcttcttct
      901 gcttgagact acgtgccacc tcattgggta tataactgag atcctcaatc gcttgctcga
      961 tcttcttctt ggtgttttca ctcatatagt tgtatctctt ttgcagaaac tgcgaaactg
     1021 tactcttcga caccccggca cgttctgcca cttccttcat cgttgttacc ataattaccc
     1081 tcctttcttt tttgcatgtc atgcgtgcta attttcaaat atatattctg caacctgtat
     1141 gtaaagtgaa tttcacttta ttatatacct tcttttacag caggaaaata taataaataa
     1201 agagttagag cgtgcctcta ctcatataca agcgcggcca aaaatatggg gtggaaactc
     1261 cctcacgaat cttttacaag gggaactttc ctacttagct tgataatccg ttgttataaa
     1321 tctattgatt tcgaacgaaa tcttagaaat atacgaattt tatgaacttg tttgatggaa
     1381 gatttgaaca taaaatgaaa atgtaatcgg ttacaaattc ttttaaataa aggaggaagc
     1441 agaattatga agaaaaaagg attaaaaaaa acatttttcg tcattgcctc cctcgtaatg
     1501 ggcttcacac tgtatggcta tacacccgtt tctgcagatg cagccagtgt gaaaggatat
     1561 tatcacaccc aaggaaacaa gattgtagac gaatccggga aagaagcggc atttaacggc
     1621 ctgaactggt tcggtctgga aactcctaat tacaccttgc atggactgtg gagccgctca
     1681 atggacgaca tgctggatca ggtgaagaaa gaaggctaca atctgattcg tctgccttac
     1741 agcaatcagt tgttcgattc cagttcccgt ccagacagta ttgattatca caaaaaccct
     1801 gatctggtcg gattaaaccc gattcaaatt atggacaagc tgatcgaaaa agctggacaa
     1861 cgcggtattc agattatcct tgaccgtcac cgtccaggct caggtgggca atccgagctg
     1921 tggtacacat cccagtaccc tgagtctcgc tggattagtg actggaaaat gttggctgat
     1981 cgttataaaa ataaccccac cgtcattggt gcggatttgc acaacgagcc acacggtcaa
     2041 gcaagctggg gtacaggcaa tgcctccaca gactggcgtc tggcggcaca acgtgcaggg
     2101 aatgcgattc tgtccgtgaa tccgaattgg ctgattctcg tagaaggtgt agaccacaat
     2161 gtacaaggca acaatagcca atactggtgg ggtggcaacc tgacaggtgt agccaactat
     2221 cctgtcgttc tggacgtacc gaaccgtgtc gtatattctc cacacgatta cggccccggt
     2281 gtgtcttcgc agccatggtt caacgacccg gccttcccgt ccaacctgcc agcgatctgg
     2341 gatcaaacct ggggctacat cagcaaacaa aacatagctc cggtgctggt tggtgaattc
     2401 ggcggccgta atgttgattt gtcctgccct gaggggaaat ggcaaaatgc gcttgttcac
     2461 tatattggtg ccaacaacct gtactttacg tactggtccc tgaatccgaa tagcggcgac
     2521 acaggcggtc tgctgctgga tgactggact acctggaatc gtccgaagca agatatgctg
     2581 ggtcgaatta tgaagcctgt tgtttccgta gcccagcaag cggaagcagc agccgaatag
     2641 gcacaggcct tcaacttcac ttaattaaaa aagccgattt tcctccttca ggccagatta
     2701 ctgctgggct atgaagatgt ggagaaacgg ctttttttgt tcttccagtt ttacacattg
     2761 taaattcccc attgaatggg tcattttgta aactaaatct ctctttcaac cgataaatac
     2821 atatgttgga ttttaattta gaaaggaagt attcatgtga ttatttggaa gggttttggt
     2881 attttaaata ttattattcc agggatttta tttgttattg tcggtagttt agtatccgct
     2941 ctagggcttg actccataga ttcccggctg ccaatggctt ttgtgttcat tgtgtcggga
     3001 gttattatct ggtatttggg gaaagcactc aactccgact caaagtgcta gttgatatgg
     3061 aaacgggaca acgctatcga atgggaaccc agcacagtct gttttttatc cctatgcatt
     3121 attggggccc tgcag
//
LOCUS       SHPMHDQB     3123 bp ds-DNA             MAM       30-MAY-1990
DEFINITION  Ovine MHC class II OLA-DQB beta-chain gene, partial cds.
ACCESSION   M33307
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Ovine (strain Merino) DNA, clone B10-1, library Ch.28.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 3123)
  AUTHORS   Scott,P.C. and Brandon,M.R.
  TITLE     Nucleotide sequence of ovine MHC class II beta genes: DQB and DRB
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.C.Scott 28-MAR-1990. Univ of Melbourne, Veterinary Preclinical
            Centre, Parkville, Victoria, AUSTRALIA 3052
FEATURES       from  to/span     description
    pept    /   204      473     MHC OLA-DQB beta-chain, exon 2 (AA at 3)
                876     1157     MHC OLA-DQB beta-chain, exon 3
               1656     1766     MHC OLA-DQB beta-chain, exon 4
               2232     2255     MHC OLA-DQB beta-chain, exon 5
               2697     2710     MHC OLA-DQB beta-chain, exon 6
    IVS     <     1      203     MHC OLA-DQB intron A
    IVS         474      875     MHC OLA-DQB intron B
    IVS        1158     1655     MHC OLA-DQB intron C
    IVS        1767     2231     MHC OLA-DQB intron D
    IVS        2256     2696     MHC OLA-DQB intron E
    signal     3014     3020     poly-A signal
BASE COUNT      635 a    773 c    851 g    707 t    157 others
ORIGIN      
        1 cgggaggccg gggacgagcc gctggaggac ctgggttcgc atgcacgaag gcgcagggcc
       61 gggctggaac cgaacagggg ccgagcggcg gcgggttgcc ggtttcggga cccgctggcg
      121 cgcggccggc acctccccct ctgccgagcg cgccccgtgg ggctgtgtgg gctgagcctg
      181 accgagcggc tgtctccccg cagaggattt cgtgtaccag tttataggcc actgttactt
      241 caccaacggg acggagcggg tgcggcatgt gaccagatac atctacaacc aggaggagtt
      301 cgtgcgcttc gacagcggac tgggacgagc accggcggtg accctgctgg ggcggccgga
      361 cgccgagtac tggaacagcc agaaggacat catggagcgg acgcgggccg aggtggacac
      421 ggtgtgcaga cacaactacc aggtggaagc cgccttcacc tggcagcggc gaggtgagtg
      481 ccggcgcccg ctccgcgggg ccgccctcca cccgccagga ctccgcaccg aaggactgag
      541 tctccggggt cggtccccag accacggatg ggacagagag gcgctgaggg acaggggccc
      601 aggggcacag cgtgagggcg gggcatcgaa tttggcaccc tggagccttc ctgcagnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tcccacctgg aatgtgctgt gtggtctcac
      841 gtctcactgt cttttcctgc ctgttcttcc ctcagtggaa cctacagtga ccgtctcccc
      901 atccaggact gaggctctaa accaccacaa cctgctggtc tgctcggtga cagatttcta
      961 tccaggccag atcaaggttc ggtggttccg gaatgaccgg gaggagacag ccggcgttgt
     1021 gtccacccct cttattagga acggggactg gaccttccag atcctcgtga tgctggaaat
     1081 gaccccccag cgaggagatg tgtacacctg ccgcgtggag caccccagcc tccagagccc
     1141 catctcggtg gaatggcgta agggcacttg gtttcctttc actgtgggcc caacaggtca
     1201 gggcagacag agcttcccca gtccatgccc tctcatccct tgtccccacc atcactactg
     1261 aactggaaat cacaggacac aagagtgctc atgcctccta gcacaggcat cagaagagcc
     1321 aaatcacatt gtcttttcac atacagggaa ctcactctac acatcatggc cccagagccc
     1381 agcctggtag ctctgtagaa ttgactggtg accacagtct taaggtctaa agttatagaa
     1441 gtgtccctga gagcagggat ccactttcac cttctctcac ctgcccactg tgtccaaagc
     1501 tctgttggca ggtcccttcc atgggggtgg tcagaatgga gagccaggtt cccctgacac
     1561 ctccacctcc tgtaccttag actagacctc aagcttccta aagggacacc atgagatgtg
     1621 gggacaaacg ctgatactca ggctctgctc cccaggggcg cagtctgaat ctgcccagag
     1681 taagatgctg agtggtgttg ggggcttcgt gctggggctg atcttcctca gtctgggcct
     1741 catcatccat cacaggagcc agaagggtaa ggaactctgg ggaaatggga aaatgggctg
     1801 tgattgagac cctctgttca gggcagtgtc tgctctgggt gtagttcttt ccacctgatc
     1861 ctgaaacagg gaggcggggc tggggggtgg gaggaaatgg acaacttagg gagacattgg
     1921 agtctgactt tactggtttg agagggtagc cctgccatgg agatggcagt ggaatttatt
     1981 ctaagactac cttaaagttc atcattgtct catgggctcc tttctgaaaa cttcctgcat
     2041 taagaggatc agagcactgg tttcctttcc ttctagtgac agcttcattc atttggggga
     2101 attttagctt agggcagtta aggcctgtgg tcaatggata aaaaagaaat acattttccc
     2161 ttgagtagct tatctcattt ccctttggat gagtaaggca gtggctgttt gtgtcacgag
     2221 agcttttgta ggacattgtg ggcccctacc agcaggtgct ctgtttctgt tctgtgtcag
     2281 tggggttgtg ggataaagga gggaagggtg tgaggtgagt gtacctgggt gcagtgtctc
     2341 gttcatgacc tgttccctac tatggaatca ggagttaggg aagaagtttc tgtaggaggt
     2401 tctgtaggaa gccataactt tgtggcatct ttctgtgaaa cttggagcca gagctctggt
     2461 ttgaaagata gataccagga tgttgcctac tttgtgccac atgttggtgg ctactgcctg
     2521 tgggcattta taagtgactg aatgtggtag aaagaaggtg agctatcatt aaatttacta
     2581 aaaaattgaa atcttcacat cccccagaag gacaatagcc gctcccctgc ttcccatgcc
     2641 tccttgccag gttgaagtgt gggtcctgtg tgctgatcac tctctctctt ctacagggct
     2701 cacgcgctga ctcctgagga tattttggga ttggtgtttg ctcttcttta atcatgtgcc
     2761 tgttccgccc ggaattcnnn cttctgagat cagagttggt caccaggtca tttcccgtgg
     2821 ccatccacca accaaggctc tggctgtgat gctgcttcct ccactgaccc tggagcctct
     2881 gactgtgcat tttcagctgc atctactcag accccaagag cttcctctgc tcccattttt
     2941 acccaacaga gtgtgcaaaa gaagcacatt gaaataattt accaggctgt agagtttttt
     3001 tccctttaat cataattaaa catgattatg aggttatctg tactctgaac ttccttaaat
     3061 gagtaggagg gtagggaacc actgcagaat gaagggaaac tattttgggg tgaaccagcc
     3121 aag
//
LOCUS       SHPMHDRB     2277 bp ds-DNA             MAM       30-MAY-1990
DEFINITION  Ovine MHC class II OLA-DRB beta-chain pseudogene, partial cds.
ACCESSION   M33306
KEYWORDS    cell surface glycoprotein; class II gene;
            major histocompatibility complex;
            pseudogene. pseudogene. integral membrane protein.
SOURCE      Ovine (strain Merino) DNA, clone C5-2, library CH.28.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 2277)
  AUTHORS   Scott,P.C. and Brandon,M.R.
  TITLE     Nucleotide sequence of ovine MHC class II beta genes: DQB and DRB
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.C.Scott, 28-MAR-1990. Univ of Melbourne, Veterinary
            Preclinical Centre, Parkville, Victoria, AUSTRALIA 3052
FEATURES       from  to/span     description
    pept.ps /   891     1169     MHC OLA-DRB beta-chain, exon 3
               1555     1665     MHC OLA-DRB beta-chain, exon 4
               1914     1937     MHC OLA-DRB beta-chain, exon 5
               2144     2155     MHC OLA-DRB beta-chain, exon 6 (alt.)
               2232  /  2243     MHC OLA-DRB beta-chain, exon 6' (alt.)
    IVS     <     1      890     MHC OLA-DRB intron B (no splice consensus)
    IVS        1170     1554     MHC OLA-DRB intron C (no splice consensus)
    IVS        1666     1913     MHC OLA-DRB intron D (no splice consensus)
    IVS        1938     2143     MHC OLA-DRB intron E (alt.; no splice
                                 consensus)
    IVS        1938     2231     MHC OLA-DRB intron E' (alt.; no splice
                                 consensus)
    IVS        2156  >  2277     MHC OLA-DRB intron F (alt.; no splice
                                 consensus)
    IVS        2244  >  2277     MHC OLA-DRB intron F' (alt.; no splice
                                 consensus)
BASE COUNT      485 a    437 c    447 g    513 t    395 others
ORIGIN      
        1 ccaacgtcgc taaaaaagag agactgtata aaatgatcag ttcagtaaaa agactgctta
       61 ttttgcaaag ggggcctcct tggtggctga tacagcaaag aatgtgcctg caatgtgggg
      121 gacccaggtt caaactctag gttgggaagt tcccctggag aagggaatgg ctaccctctc
      181 cagtattctt gtctggagaa ttccatggac agagctgctg tccagtgggt agcaaagttg
      241 gacacaactg agtgactaac actttcactt tcacttcatt ttgcaaagga gagaaattga
      301 tcagctgagt gtaattgcac gtgttctttt tttaaaaaat aaatgactat ttgctccttt
      361 ccttctcctt cagagctcnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngtcaaccct
      601 aaataccttg caaagattgg taaacacatc aggtcataat gaataattct tttgggctat
      661 gaaaaaacaa aacaaaatta atttcctcca catctctgaa atctaacagt cccgtattgt
      721 aagaattaaa gtctaagctc atggttctaa ctcctgttgg gtcacatggc cagttttcag
      781 agaggtctgc caagactgga gcatgggact agagactagg ggcagaaagg cagccaggga
      841 gatttactct gccttcctca ctgatttcct ccactttttc tctcccagtg gagcctacag
      901 tgactgtgta tcctgaaaag acccagcccc tgcagcacca cgacctcctg gtctgctctg
      961 tggatggttt ctatctgggc catgctgaag tcaggtggtt ctggaaaggc caggaagagg
     1021 aggctggggt gatctccaca ggcctgatcc agaacggaga ctagaacctc cagaccatgg
     1081 tgatgcttga aatggttcct cagagtggag aggtctacac ctgccaagtg gagcacccca
     1141 gccggacgag ccctctcaca gtggaacgag tgagaagctt ctgatcttgt gagttcctac
     1201 caccgtggag ggggcttgtt ttccctgagt gtcaggtgcc tctctccaca ccatgtgttc
     1261 atttgctcat gtttctcnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1321 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1381 nnnnnnnnnn nntgatcgac ttagcgtgat ttttcacact tctttctatg cagtggtcag
     1441 ttattttagt tcttccggct gccctccctc attcctgaag ctctcagtct cagagtctca
     1501 atcagcaaag ttgtatttca gataaaaatc acaaaaccat ggtttctttt ctcaggggca
     1561 cagtctgact ctgcttagag taagatgatg agtggaatag gggactttgt tctgggtttg
     1621 ctctttcttg tggtggggct gttcacccac ttcaggaatc agaaacgtga gagcctgttt
     1681 acagggggat cctcatannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1741 nnnnnnnnnn nnnnnnnnnn nnnntcgggt cgctctcttt ctgccctgga tcatctggtg
     1801 tggttgcagt gtcgagaaat ttcagatggg gctttgggtc tgaggacatt gaatctggta
     1861 acagaattgc cttcatatct ggtaaatctg tatatcttcc cttcagtttc ccaggacact
     1921 ctggacttca gccaacaggt catgcccttt aatcctcttt tagagataga tttggtctct
     1981 ggctcagtgg taaagaacta tcctgccaat gcagagacac aagagacatg ggtttcatcc
     2041 ctgagtcggg aatatctctt ggaataggaa atagcaacac actccagtat tcttgcctgg
     2101 aaaatttcat ggaccagagg agcctgggac ctacagtccc ctagggttga aaagagtcag
     2161 acatgattgt gtgagtgcat gggcacaagc gcaccctgta cacacacaca cacacacaca
     2221 cacacacaca cggtctccta gaattgatcg tagagatgac aagacaggag acagaca
//
LOCUS       ASGALDAA     3497 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  A.niger aldehyde dehydrogenase (aldA) gene, complete cds.
ACCESSION   M32351
KEYWORDS    aldehyde dehydrogenase.
SOURCE      A.niger DNA.
  ORGANISM  Aspergillus niger
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 3497)
  AUTHORS   O'Connell,M.J. and Kelly,J.M.
  TITLE     Physical characterization of the aldehyde-dehydrogenase-encoding
            gene of Aspergillus niger
  JOURNAL   Gene 84, 173-180 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1187     1251     aldehyde dehydrogenase (aldA), exon 1
               1321     2522     aldehyde dehydrogenase, exon 2
               2580     2649     aldehyde dehydrogenase, exon 3
               2703     2859     aldehyde dehydrogenase, exon 4
    pre-msg    1158     2969     aldA mRNA and introns (alt., + 1 bp)
    pre-msg    1158     3016     aldA mRNA and introns (alt., +/- 1 bp)
    IVS        1252     1320     aldehyde dehydrogenase intron A
    IVS        2523     2579     aldehyde dehydrogenase intron B
    IVS        2650     2702     aldehyde dehydrogenase intron C
BASE COUNT      833 a    995 c    806 g    863 t
ORIGIN      
        1 agatctcact agtcagtggc taattggact atacagggat gtttgcaaag tgcgtgaact
       61 agcatctact aagtactaat ccccatcatg aaaggaacaa ggggactggc tgactaaaat
      121 gttacagctt gcaggtcaac aaactagctc aagagactag attccacttg gtccccgcca
      181 cttacaccga ggtggagggg ttccggaaat acgaaaagag cttgctccaa tcgctgtgag
      241 taatattccc tagtgagtta caaaggagat gactgataac tcactactaa gaggttgcct
      301 agctactagc ctcgtcttct cccggccggc gaaggtcctc cattcttctt cggatgatgt
      361 aatgaagaaa ccagatgtaa ctgactagta accatatcaa gtcaagacac gataaccatc
      421 cccttccact accactacca gtaccagtac cagtaccaca gggactcttc gatggatgaa
      481 tactagttag ttacacggta aaatggctga cacaggggga atctgggtgg tggtggtggt
      541 ggtggtggtg gccaggaaaa agagccacgg atagcgaaac cccgcacgat cgctgccgat
      601 tgttggatag agagtgggga acgggctggc atgtgggaat tcacctactt ttcaggttcc
      661 cttcatgccc tggtcactta cgctttaccg cgatacaata taatatgatg gttacttata
      721 ggggtctcgg gagtacgtag gttaactcgc ggtgggtttc gggctcaagg agcttggtct
      781 ctccaatcgg agcatctaga tgagggggtg gttatccgcg gagggattag aatgaggtaa
      841 atggcaatgg atgagacccg gaatttctcc catggatccg actagaccag tccaattgaa
      901 ctaaactaaa ctaaacccag tggacgagga tgatctccgc tggcgtttcc ccctcgcgcc
      961 tcgcaagctt cccctccctt caaccatcaa ccatcaaacc aacttctctt tcctctttct
     1021 ctctcttccc ccccccactt ccaccccacc actagtacaa agtaactaaa cttcctgtct
     1081 gctttattaa ttgatagctc tccccctctc cgctcttctt tcttcttctc ttccctctct
     1141 ctctttcttt ctcttctaac caacacctcc tctcccatat ctcatcatgt ctgacctctt
     1201 cgccaccatc acgaccccca acggcgtcaa atacgaacag cctctgggct tgtatgttat
     1261 ccccgcgtcc cgtcctctcg atatcgcgct tttggaaact aaccactgtt tactttacag
     1321 gttcattgac ggtgaattcg tcaagggcgc ggaaggcaag accttcgaga ctatcaaccc
     1381 cagcaacgag aagcccattg tcgccgttca cgaagccacg gagaaggatg tggatactgc
     1441 cgttgccgct gctcgcaagg ccttcgaggg ctcctggcgc caggtcactc cttccacccg
     1501 tggccgcatg ctgaccaagc tggccgacct tttcgagcgc gatgccgaga tcctggcttc
     1561 catcgaagct ctggacaatg gcaagtctat caccatggcc cacggtgata tcgctggcgc
     1621 cgccggttgc ctgcgctact atggtggctg ggccgacaag atccatggtc agaccatcga
     1681 caccaactcc gagaccttga actacactcg ccacgaaccc atcggtgtct gcggacagat
     1741 cattccctgg aacttcccct tgctgatgtg ggcctggaag attggtcctg ccatcgctac
     1801 cggtaacact gttgttatca agactgctga gcagactcct ctgtccggtc tctatgccgc
     1861 gaacgtgatc aaggaggctg gcatccccgc tggtgtggtt aacgtcatct ccggtttcgg
     1921 tcgtgtggcc ggttccgcca tctctcacca catggacatc gacaaggtcg ccttcactgg
     1981 ctccaccctt gttggccgta ctatcctcca ggccgccgcc aagagcaacc tcaagaaggt
     2041 gaccctcgag ctgggtggca agtcccccaa cattgtcttc aacgatgccg acattgacaa
     2101 cgctatctcc tgggccaact tcggtatctt ctacaaccac ggtcagtgct gctgtgctgg
     2161 ttctcgtatc ctggttcagg aaggcatcta cgacaagttc attgcccgtc tcaaggagcg
     2221 tgctctccag aacaaggtcg gtgacccctt cgccaaggac accttccaag gtccccaggt
     2281 ttcgcagctc cagttcgacc gcatcatgga atacatccag cacggtaagg acgccggtgc
     2341 taccgtggct gttggtggtg agcgtcacgg cacagagggt tacttcatcc agcctaccgt
     2401 cttcaccgac gtcacctccg acatgaagat caaccaggag gagatcttcg gccccgtcgt
     2461 cactgtccag aagttcaagg acgtcgagga cgctatcaag atcggcaaca gcacttctta
     2521 tggtgagttc tttttcgcca ctaaattgca gtgtgtcaag tgctaatctt tcgtcctagg
     2581 tctcgctgcc ggtatccaca ccaaggatgt caccaccgcc atccgtgtct ccaacgccct
     2641 ccgtgctggg taagctatac ttaacactac attcctgtat tcattcacta acatgcacat
     2701 agaaccgtct gggtcaacag ctacaacctg atccaatacc aggttccctt cggaggattc
     2761 aaggagtccg gtatcggtcg cgagctcggc tcgtacgccc tcgagaacta cacccagatc
     2821 aaggccgtgc actaccgtct gggcgacgct ctcttctaat ttaatatttg aaaccatgag
     2881 aaaatgggaa gagttcgctt cagcgacgag ctcgtggata ttgggttacc tttttcgcag
     2941 atagccatga ctggttcatg caaaacaatc aggatacgaa tttacgatga ttacaaattg
     3001 ttcttacatt gcagtaactg tagacctttg taaatataat ctgtctcaag cttcaacctc
     3061 cacataatct tggtactggc tataatcact ccctagtggt taactaagga cgaatcacca
     3121 ccccttccac tcgttcaatt gcaatcgacc tccatcaggc cattgcatat cgtacccatt
     3181 gttacacacc caaattcccg ggacaaccac cggcgagttt gagtagcctg ggtctagtct
     3241 cgggagaatt actacgccga gaagagagac cccctcgggt cgaagatggc ccgaagcgag
     3301 ggcattcatc agcgaataca caaggaccga atgattggtt atgtcaccga cgttgatgat
     3361 gatcagccat tcaaaaagac agctaatttc aatcttcagt acttcatagg ttgattgatt
     3421 acttgctggt gccgaaaata ggtattcggg aactataatc tcctatcttg tcagttctgg
     3481 tactagaatg catcgat
//
LOCUS       POABETA1     2293 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  P.agilis beta-1 tubulin (beta-1-tub) gene, complete cds.
ACCESSION   M33372
KEYWORDS    beta-1 tubulin.
SOURCE      P.agilis (strain Aragoa) DNA.
  ORGANISM  Polytomella agilis
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Dunaliellaceae.
REFERENCE   1  (bases 1 to 2293)
  AUTHORS   Conner,T.W., Thompson,M.D. and Silflow,C.D.
  TITLE     Structure of the three beta-tubulin-encoding genes of the
            unicellular alga, Polytomella agilis
  JOURNAL   Gene 84, 345-358 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        233      250     beta-1 tubulin (beta-1-tub), exon 1
                327      353     beta-1 tubulin, exon 2
                432      555     beta-1 tubulin, exon 3
                624      848     beta-1 tubulin, exon 4
                974     1911     beta-1 tubulin, exon 5
    IVS         251      326     beta-1 tubulin intron A
    IVS         354      431     beta-1 tubulin intron B
    IVS         556      623     beta-1 tubulin intron C
    IVS         849      973     beta-1 tubulin intron D
BASE COUNT      524 a    578 c    464 g    727 t
ORIGIN      
        1 tttttttgta aggtttaaga ggtcatttgc ttcctaaatg gagtttaaaa aaattaaatg
       61 tgtaggtgct ggttgattat atgcttccgg ctcgtatgtt gtgtggaatg tgagcggata
      121 acaatttcac acaggaaaca gctatgacat gattacgaat tatatatatg tgccgtcctc
      181 ccaaaaaatt taaattgaac tacgcctgtt ttgtcaattt aaccggtagc aaatgcgtga
      241 aatcgttcat gtgagtaata tttatttgat atctgccact tactctcctt attctaatag
      301 catcttctcc ttttcctcac ctacagattc aaggtggcca gtgtggtaac caggtgtgtt
      361 tttcgccctc ttgcctgcta aatgttcgtt ttgactctat caacacctct aacatgcttc
      421 tctccctgta gattggtgcc aagttctggg aagtcgtttc tgacgagcat ggcattgacc
      481 ccaccggtac tgctcttggt gacagcgacc tccagcttga gcgcatcaac gtctacttca
      541 acgaggccac tggaagtaag taatttgagc ccctcctcaa tatttttttc agctccaatt
      601 aaaatctttt tttctcactc caggccgcta tgtcccccgt gccatcctca tggatcttga
      661 gcccggtacc atggattccg tccgctctgg tccctatggc cagattttcc gccctgacaa
      721 ctttgtcttc ggtcagactg gtgccggtaa caactgggct aagggccact atactgaggg
      781 tgctgagctc attgactccg tccttgatgt tgtccgcaag gaggctgagt cctgcgattg
      841 ccttcagggt aagaatttga tatcttttat ttctatgatt cttagattat tttttcaacc
      901 taacatagtt aatatttcaa attgtatctc actgttcatt tttataaaat taatttttat
      961 tatgtcttta aaggcttcca ggtctgccac tccctcggtg gtggtaccgg ctccggtatg
     1021 ggaaccctcc ttatttccaa gatccgtgag gagtaccctg accgcatgat gctcaccttc
     1081 tccgtcgtcc cctcccccaa ggtgtctgat accgtcgttg agccctacaa cgccaccctc
     1141 tccgttcatc agctcgttga gaacgccgat gagtgcatgg tcctcgataa cgaggctctc
     1201 tacgatatct gcttccgcac ccttaagctc accaccccca ccttcggtga ccttaaccat
     1261 cttatctccg ccgtcatgtc cggcatcacc tgctgcctcc gcttccccgg tcagctcaac
     1321 gctgatctcc gtaagcttgc cgtcaacctt atccccttcc cccgtctcca tttcttcatg
     1381 gttggtttca cccccctcac ctcccgtggc agccagcagt accgtgccct caccgtcccc
     1441 gagcttaccc agcagatgtg ggatgccaag aacatgatgt gcgccgctga tccccgccac
     1501 ggtcgttacc tcaccgcctc cgcccttttc cgtggtcgta tgtccaccaa ggaggtcgat
     1561 gagcagatgc ttaacgtcca gaacaagaac tcttcttact tcgttgagtg gatccccaac
     1621 aatgtcaagt cctccgtttg cgatatccct cctaagggtc ttaagatgtc tgctaccttc
     1681 attggtaaca gcactgctat ccaggagatg ttcaagcgtg tctctgagca gttcactgcc
     1741 atgttcaggc gcaaggcttt cttgcattgg tatactggtg agggtatgga tgagatggag
     1801 ttcactgagg ccgagtccaa catgaacgat ctcgtctccg agtaccagca gtaccaggat
     1861 gcctctgctg aagaggaggg cgagttcggt gaggaagagg aggagaacta aactatcaca
     1921 aactatttgt gaaactagcc gctgatttta aggtttatgt agaaggggga tacccacaat
     1981 ttttgcttta atgattcagt ttatgtttaa ctgtatgtat gtttttttat aatgaatagc
     2041 agtcactgtc cactctataa tgtaatttta ttttgttttc ttcactaata gcgtaatgcc
     2101 gagagcttgt aatgataaag gacgttaatt acgaagagta atatagatat aaatattttt
     2161 tacatgatgt ttataattat aataacaata aaatactttt tgactcttct aacgttaatg
     2221 ttattattct ctatcctcgg tcacgtttac gtttttttta ccctttgttt tatttcctgc
     2281 cgcatcttta taa
//
LOCUS       POABETA2     2121 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  P.agilis beta-2 tubulin (beta-2-tub) gene, complete cds.
ACCESSION   M33373
KEYWORDS    beta-2 tubulin.
SOURCE      P.agilis (strain Aragoa) DNA.
  ORGANISM  Polytomella agilis
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Dunaliellaceae.
REFERENCE   1  (bases 1 to 2121)
  AUTHORS   Conner,T.W., Thompson,M.D. and Silflow,C.D.
  TITLE     Structure of the three beta-tubulin-encoding genes of the
            unicellular alga, Polytomella agilis
  JOURNAL   Gene 84, 345-358 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        233      277     beta-2 tubulin (beta-2-tub), exon 1
                377      500     beta-2 tubulin, exon 2
                565      789     beta-2 tubulin, exon 3
                910     1847     beta-2 tubulin, exon 4
    IVS         278      376     beta-2 tubulin intron A
    IVS         501      564     beta-1 tubulin intron B
    IVS         790      909     beta-1 tubulin intron C
BASE COUNT      479 a    530 c    439 g    673 t
ORIGIN      
        1 tttttttgta aggtttaaga ggtcatttgc aagatcagag ctagcgtatc cttatgaatc
       61 gctcctacac atcttatagt gtggttacga aaatcaaatt tggagtagtt tggatcccat
      121 ggatatttac tataaactag gtgtccaaac caaatatttc aaaagctatt ttttgaagct
      181 ccgattgcag agacttcatt ttccttcata atatattagt agaaaaacaa ccatgcgtga
      241 aattgttcat attcagggtg gccagtgcgg taaccaggta aattattaat cgttttttat
      301 tctttgcact tattgcacag tcctcttaag ccctacgtct aattcttttc cttttttact
      361 attcaacctc ctgtagattg gtgccaagtt ttgggaggtc gtctctgacg agcacggcat
      421 tgaccccacc ggtactgcaa ttggtgacag cgatcttcag ctcgagcgta tcaacgttta
      481 ctttaacgaa gccaccggca gtatgtatac ccgtattctt aagaatccaa tattttttcc
      541 tgatgtttta cttgatatat acaggccgtt atgtcccccg tgccatcctc atggatcttg
      601 agcctggtac tatggactcc gttcgttctg gtccctatgg tcagatcttc cgtcctgaca
      661 acttcgtctt tggccagact ggtgccggta acaactgggc taagggtcat tataccgagg
      721 gtgccgagct catcgactcc gtccttgatg ttgtccgcaa ggaggccgag tcttgcgact
      781 gtctccaagg tagatttctt tttgcatttt tctttttcaa atttgcgtat taattcttca
      841 acctattaaa ctttgacttt tccgatttgc tgaaatattt tttctcaatg ttcctttttt
      901 ttgtttcagg tttccaggtt tgtcactctc tcggtggtgg tactggctct ggtatgggaa
      961 ctctccttat ttccaagatc cgtgaggagt accctgaccg tatgatgctc accttctccg
     1021 tcgtcccctc ccccaaggtg tctgataccg tcgttgagcc ctacaacgcc accctctccg
     1081 ttcatcagct cgtcgagaac gccgatgagt gcatggtcct cgataacgag gctctctacg
     1141 atatctgctt ccgcaccctt aagctcacca cccccacctt cggtgacctt aaccacctta
     1201 tctccgccgt catgtccggc atcacctgct gcctccgctt ccccggtcag ctcaacgctg
     1261 atctccgtaa gcttgctgtc aaccttatcc ccttcccccg tctccacttc ttcatggttg
     1321 gtttcacccc cctcacctcc cgtggcagcc agcagtaccg tgctctcacc gtccccgagc
     1381 ttacccagca gatgtgggat gccaagaaca tgatgtgcgc cgctgatccc cgccacggtc
     1441 gttacctcac tgcctccgcc cttttccgtg gtcgtatgtc caccaaggag gtcgatgagc
     1501 agatgcttaa cgtccagaac aagaactctt cttacttcgt tgagtggatc cccaacaacg
     1561 tcaagtcctc cgtttgcgat atccctccta agggtcttaa gatgtctgct accttcattg
     1621 gtaacagcac tgccatccag gagatgttca agcgtgtctc tgagcagttc actgccatgt
     1681 tcaggcgcaa ggctttcttg cactggtaca ctggtgaggg tatggatgaa atggagttca
     1741 ctgaggccga gtccaacatg aacgatctcg tctccgagta ccagcagtac caggatgcct
     1801 ctgctgagga agagggcgag ttcggtgagg aagaagagga gaactaagcg ctcttttttt
     1861 gtgagaatta attgaaacgc tcgatatatt tttacaaaga attagaggag tgataggtta
     1921 aagcctgatc ttaatttttc tttgctgaat atgtattgat gtgttgtagt attagtcatg
     1981 tgagtatgtt tgtaatatag taaaccttta tgcatctttt cgtttttata tgtagcagga
     2041 gttaaaaact aaacttttga attttccaaa taacttttca tccttaataa attgtaatat
     2101 tggtatatgc tttatagcca c
//
LOCUS       POABETA3     2202 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  P.agilis beta-3 tubulin (beta-3-tub) gene, complete cds.
ACCESSION   M33371
KEYWORDS    beta-3 tubulin protein.
SOURCE      P.agilis (strain Aragoa) DNA.
  ORGANISM  Polytomella agilis
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Dunaliellaceae.
REFERENCE   1  (bases 1 to 2202)
  AUTHORS   Conner,T.W., Thompson,M.D. and Silflow,C.D.
  TITLE     Structure of the three beta-tubulin-encoding genes of the
            unicellular alga, Polytomella agilis
  JOURNAL   Gene 84, 345-358 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        233      250     beta-3 tubulin (beta-3-tub), exon 1
                325      351     beta-3 tubulin, exon 2
                429      552     beta-3 tubulin, exon 3
                620      844     beta-3 tubulin, exon 4
                984     1921     beta-3 tubulin, exon 5
    IVS         251      324     beta-3 tubulin intron A
    IVS         352      428     beta-3 tubulin intron B
    IVS         553      619     beta-3 tubulin intron C
    IVS         845      983     beta-3 tubulin intron D
BASE COUNT      494 a    552 c    464 g    692 t
ORIGIN      
        1 tttttttgta aggtttaaga ggtcatttgc ttcctaaatg gagtttaaaa aaattaaatg
       61 tgtaggtgct ggttgattat tacttttatg gatcaatact caaaactcga aagtgtatca
      121 aattcataac gacactaaaa attcgagaat tgtcgtacaa cctatggcct tatataaatc
      181 atgctcccct tttaaaaatc aacttgttta gtttttagcc aaataaataa ccatgcgtga
      241 aatagttcat gtaagttgca attgttgttg attttgccgt tttgttcata ttcttattct
      301 ttttgctgtt ttctgcctcc acagattcaa ggtggccagt gcggtaacca ggtggcttct
      361 gccagttttg gctccctttc ctcagctttt gaatacatga gcctttctaa cttgtcttct
      421 tactgcagat cggtgccaag ttctgggagg tcgtctctga tgagcacggc atcgacccca
      481 ccggtaccgc ccttggtgac agcgacctcc agcttgagcg catcaacgtc tacttcaatg
      541 aggccactgg aagtaagtaa tcagcgtgtt gccaatattt tttggggctt ggatttacta
      601 tttttttact tctcttcagg ccgttatgtc ccccgtgcta tcctcatgga tcttgagccc
      661 ggtaccatgg attccgtccg ctctggtccc tatggtcaga ttttccgccc tgacaacttt
      721 gtcttcggcc agactggtgc cggtaacaac tgggctaagg gtcactacac tgagggtgct
      781 gagctcattg actctgtcct tgatgttgtc cgcaaggagg ctgagtcctg cgattgcctt
      841 cagggtaaga ttatttttat tttccacttt tatgttcctt ataatagggt ggggcattaa
      901 ttttgtatta ttcagcacat ctctcgtata tttaaatata taaatatatt tcttatttaa
      961 aatttttaaa taatatgtta caggcttcca ggtctgccac tctctcggtg gtggtaccgg
     1021 ttccggtatg ggaaccctcc ttatttccaa gatccgtgag gagtaccctg accgcatgat
     1081 gctcaccttc tccgtcgtcc cctcccccaa ggtgtctgat accgtcgttg agccctacaa
     1141 cgccaccctc tccgttcatc agctcgttga gaacgccgat gagtgcatgg tcctcgataa
     1201 cgaggctctc tacgatatct gcttccgcac ccttaagctc accaccccca ccttcggtga
     1261 ccttaaccat cttatctccg ccgtcatgtc cggcatcacc tgctgcctcc gcttccccgg
     1321 tcagctcaac gctgatctcc gtaagcttgc cgtcaacctt atccccttcc cccgtctcca
     1381 tttcttcatg gttggtttca cccccctcac ctcccgtggc agccagcagt accgtgccct
     1441 caccgtcccc gagcttaccc agcagatgtg ggatgccaag aacatgatgt gcgctgctga
     1501 tccccgtcac ggtcgttacc tcactgcctc cgcccttttc cgtggtcgta tgtccaccaa
     1561 ggaggtcgat gagcagatgc ttaacgtcca gaacaagaac tcttcttact tcgttgagtg
     1621 gatccccaac aatgtcaagt cctccgtttg cgatatccct cctaagggtc ttaagatgtc
     1681 tgctaccttc attggtaaca gcactgccat ccaggagatg ttcaagcgtg tctctgagca
     1741 gttcactgcc atgttcaggc gcaaggcttt cttgcattgg tacactggtg agggtatgga
     1801 tgagatggag ttcactgagg ccgagtccaa catgaacgat ctcgtctccg agtaccagca
     1861 gtaccaggat gcctctgctg aagaggaagg cgagttcggt gaggaggagg aggagaacta
     1921 aagaactcaa ttgcatttca tgttttaaat tcaggaatga tctagagata ctttgctttt
     1981 ttttttcttg cttgtagcac ctctcgtcgc gattttgaaa gcattatcaa tacatcctag
     2041 tcagctatgt tcggtggatg atataatgaa gcttttgcac gacaatattt gtgtgctgaa
     2101 ccgcttatga ctgaattaag gttatagata atttcagtgt aaattattag tgctccgcta
     2161 ataccaaatc cattttttat tttatgtcca tgaagttata ta
//
LOCUS       CELDAF1      5899 bp ds-DNA             INV       30-MAY-1990
DEFINITION  C.elegans serine/threonine protein kinase (daf-1) gene, complete
            cds.
ACCESSION   M32877
KEYWORDS    serine protein kinase; threonine protein kinase;
            transmembrane protein.
SOURCE      C.elegans (strain N2) DNA, clones DR#[168,169], and cDNA to mRNA,
            clones DR#[113,114,118].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 5899)
  AUTHORS   Georgi,L.L., Albert,P.S. and Riddle,D.L.
  TITLE     daf-1, a Caenorhabditis elegans gene controlling dauer larva
            development, encodes a novel receptor protein kinase
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by L.L.Georgi,
            16-MAR-1990.
FEATURES       from  to/span     description
    pept        268      313     serine/threonine protein kinase precursor 1
               1693     1779     serine/threonine protein kinase precursor 2
               2112     2239     serine/threonine protein kinase precursor 3
               2972     3190     serine/threonine protein kinase precursor 4
               3243     3999     serine/threonine protein kinase precursor 5
               4182     4345     serine/threonine protein kinase precursor 6
               4473     4739     serine/threonine protein kinase precursor 7
               4785     5058     serine/threonine protein kinase precursor 8
               5371     5438     serine/threonine protein kinase precursor 9
    sigp        268      312     serine/threonine protein kinase signal peptide
    matp        313      313     serine/threonine protein kinase
               1693     1779     serine/threonine protein kinase
               2112     2239     serine/threonine protein kinase
               2972     3190     serine/threonine protein kinase
               3443     3999     serine/threonine protein kinase
               4182     4345     serine/threonine protein kinase
               4473     4739     serine/threonine protein kinase
               4785     5058     serine/threonine protein kinase
               5371     5435     serine/threonine protein kinase
    pre-msg     233     5899     daf-1 mRNA and introns
    IVS         314     1692     daf-1 intron A
    IVS        1780     2111     daf-1 intron B
    IVS        2240     2971     daf-1 intron C
    IVS        3191     3242     daf-1 intron D
    IVS        4000     4181     daf-1 intron E
    IVS        4346     4472     daf-1 intron F
    IVS        4740     4784     daf-1 intron G
    IVS        5059     5370     daf-1 intron H
    rpt        2302     2832     direct repeat
    rpt        2520     2634     direct repeat
BASE COUNT     1759 a   1194 c   1121 g   1825 t
ORIGIN      Map position 29.2-29.4.
        1 cttcttctct attcaattcc attcatagtg gactaggtga gccgctcgtc agtatctgct
       61 cactgtctgc gtctctgctt ggctcttctc tttctctcca tgtgctccgg ctccccctcc
      121 tcacactcaa ccacacacac acacacacgt atacatatct tcttatagtt ttgttgcatg
      181 cttcatctct tttctcatct tttgccttct atgtactttg ctttttttcc agatctcctg
      241 atgaactaaa ctcttgggaa ataaattatg aggatacggc acgtggtgtt ctgcctgctc
      301 gcactggttt atggtgagta ctttgaggtt ttggtgtttg atagcaaaag tcatcaatct
      361 atttttcggg ttaattgggt taatataggt attattatgc agctagatga tgatttggtt
      421 ctatctaaac aaaaaagtgc aattaaaatc gaattctcac tggtttacat aaatttttag
      481 ttataatcac aaatttaata gtttacgctc tatttttttt tttgacaaac tagatatagt
      541 taaacatatt tgaaatttga tccaatatcc aatataagtg aaatcaagtt aaggttgtta
      601 agtttgatat atgtagtagt aaatatatag aaatgttttt catatcaggt tcagccaatc
      661 atattcgagt tgcgtagtgc ttatgcagtt gataggtttt ttggtataat ctagttatgt
      721 tgtaactcta aaaaaatttt ctaattatca tcacgaatat gtatatcgtt taagaagtta
      781 gaaaaaaaca ctgattattt ttttattagc tcaaaaagta tgtattagtc ccatttagct
      841 ggcttttaga aattcgtgtt taattttgat tagtttgact atggaaattg tttgagccca
      901 aagttaagca atttatgtta ttgcagatta gctgcagact ttttctatta gaatagaatc
      961 ttattacata gttattattg cctaatttca gctatacaaa atgcgactta tcaacttatc
     1021 attttagttg tagttgatcg cagttgaaga aaattgaaat gttattatga tcttaaaaaa
     1081 atacatatca ctttaattat gagtgacttg aaattgctat tttttattgc aattcatagt
     1141 tgatatatgt tcaaaaaagt caacaagtta actgatttgc tgatagtgaa tattaaattt
     1201 agcgcaatga gcaattgatg caattaaatg ttagttgcag ttttattctg ttagtgggta
     1261 tttagaaaat agctaggtgc aatttaaact aaattttcaa caatttagga atttaaactt
     1321 tactttttag tcgcatcact tatctaggca ccccactttt taactttaaa agcgcctaac
     1381 aataagtatc taagttgtcc tgtattttat tttaaaattg ccaatttgtt aattttttct
     1441 ctatataatt aattatctat tgatatttac aaaatgtgcc gagtcctgcc cttccaaagg
     1501 aaactagtgg ggtgcaatac caatagaaga gtacaaaatt tacagaagtc caaaataata
     1561 tgcatcacgg ttgagagctt tttaaatttg ttatcagtat ttaggccttg gctcaaacac
     1621 atcataaatt ggtacctgtg caagaaaatt aaccaaatac agctttctac gacccttttc
     1681 aattttttcc aggcgccgaa acttccgacg atgatctcga cgaacggaca aatatcttca
     1741 tccgtgataa actgattccg gcgttgaagc ttgcagaagg tagtttgttt ttttggactc
     1801 aacagaaaat aatcgccggc actcaaatta gatggcattt atcgtcagaa gaagctttgg
     1861 gcaaaatccc ccggtagcaa agcaaaataa tttcgttcga tcaaacatta aaagagattt
     1921 tcaactatat atataatcag tgataagcct ataatcttta gtgtttcccc cgaaagcact
     1981 aatctatggg aaatttcctt ttcgggggac gtgagtttat gcagaaaaaa aaattgaagt
     2041 ttagtttatt tttggaagga tttttaatct tgtgataagg gatttaacaa gattttctga
     2101 ttttcttgca gtgaccaagg tgaatttcac acgattacac ttatgccatt gctcaaggga
     2161 agttggatgc aacgcgagga ctaccggatg ggtgcctgga atcgagttcc ttaatgagac
     2221 ggatcgcagt ttttatgagg ttaggcttta accatgtggt cgagtttttc cttgcccgca
     2281 atagatcctt attgaccaaa actacagtaa tcctacagta cctcaaccat cttcctctat
     2341 tagacctgtc accaaagtat ttgacaaagc aaaaaaacaa gaataattta attactcgat
     2401 aactccaaaa aacaacaaaa aattgtattt tggtttttta aatttcttta tctgattgtg
     2461 aatattttcg ttttccccgc gatttcttta aaagatcaaa agactaagac taagataatc
     2521 ttacagtacc ccgaccctat ccctccacta accttaatct cttcaaaaaa caatgagcat
     2581 gttttctcct aagctacagt aatcctacag tacccctaca gtaccccagt actactacag
     2641 taccccgacc ctatccctct actaatctta atctcttcaa aaaacaatga acatgttttc
     2701 ttctaagcta cagtactcct acagtaccca tacagtaccc ttacaacact actacagtac
     2761 ccgaccctat ccctctacta accttaatct cttcaaaaaa caaaaaaaac atgtttctcc
     2821 tatactacag tactcttaca gtaccccaga ggtgtcctta taagtttgac tagtttttgt
     2881 catttggcag taaaactacg gttcttctaa aatttcggtt acttggttac taggtctcaa
     2941 ttctcacact agaccaatcc tctctttcca gaacacgtgc tacacggacg gaagctgcta
     3001 ccagagcgct cggccgtcac ccgaaatctc acatttcgga tgcatggacg agaaatctgt
     3061 gaccgacgag accgagtttc acgatacggc agcgaaagtg tgcacgaaca acacgaaaga
     3121 cccgcacgct accgtctgga tttgctgtga taagggaaat ttttgtgcga acgagacgat
     3181 catccacttg gtaagtgtga tctctttttt gacatccgga ttttgaattt tcaaattttc
     3241 aggcgccagg cccgcagcag tcatccacgt ggctgattct tactatcctc gccctgctca
     3301 cgttcatcgt gctcctgggc attgcaattt tcctcaccag aaaatcatgg gaggcgaaat
     3361 tcgactggta tataagattc aagccaaaac ctggagatcc tctacgcgag acggagaaca
     3421 atgtaccgat ggttacaatg ggcgacggtg ctggaagttc tgtgccagaa gttgctccaa
     3481 tcgagcagca gggaagcaca atgagcacat ctgcagggaa ctcgttccct cctggaatca
     3541 tgcctaataa catgaaggat atgcttgatg tactcgagga gacaagtgga tctggtatgg
     3601 gtccaactac acttcacaaa ctcacgattg gtggacagat caggttaact ggtagagttg
     3661 gatccggaag atttggtaac gtgtcacgtg gagactaccg aggtgaagcg gtcgctgtga
     3721 aggttttcaa tgcgctcgac gagccggcgt tccacaagga gacggagatc tttgagacgc
     3781 ggatgttgag gcatccgaat gtgctgagat atattggaag tgatcgggtt gatactggat
     3841 ttgtaacaga actttggctg gtgacagagt atcatcccag cgggtccttg cacgattttc
     3901 tgctcgagaa tacggttaac attgagacgt attataattt gatgaggtcg acggcaagtg
     3961 gacttgcgtt cctgcacaat cagattggag ggagcaaagg taagtgcaat atacatgtgt
     4021 ctcagaaatc taggagccta gactaggccc ctgagaagaa aaatttggac caacggtgtc
     4081 tgaaactttt ctattcagag agtagtctag gacgccaact gagtccgctg gccagacgta
     4141 ctctgtcgtt gaacatatat taatcaaaat acaaattcca gagtccaaca agccagcgat
     4201 ggctcaccgg gatatcaaga gcaagaacat catggtcaag aacgatctga cgtgtgccat
     4261 cggagatctc ggtctctcgt tatccaagcc agaagacgcg gcatcagata ttattgccaa
     4321 tgagaactac aagtgcggta ctgttgtgag ttttgggatc taggccattt tagtcagaga
     4381 gctcaaggct cgagaccctg ccaagacgga ggttttggcg taaagtctag ctataatctt
     4441 ttttcaaaat tttcacaaat taaatttttt agagatacct tgcaccggaa atcctgaaca
     4501 gtacgatgca gttcaccgtc ttcgagagct accaatgtgc cgatgtctac tcattttctc
     4561 tagtcatgtg ggaaacccta tgcagatgtg aagatggaga tgttctgccg agagaagcgg
     4621 ccacagtgat cccatatatt gagtggaccg atcgggatcc acaagacgct cagatgtttg
     4681 atgtggtttg cacgcggagg ctcagaccta cagagaatcc tctgtggaag gatcatccgg
     4741 taagataata aactttatgt aaagtacttc taactacttt tcaggaaatg aaacacatta
     4801 tggagattat caaaacgtgc tggaatggca atccgagcgc ccgtttcacc tcctatattt
     4861 gccgcaaacg aatggatgaa cggcagcaac tgttgctgga caagaaggca aaagctgtgg
     4921 cccagacggc aggggtgaca gttcaggatc ggaagatttt gggtccccag aaaccgaaag
     4981 acgagtcacc ggcaaatggt gctccgcgga tagttcagaa ggagattgat cgagaagatg
     5041 agcaggaaaa ttggcggggt aagaattttg atcaggctcc tattgtggac cgagctacta
     5101 agcccggacc aaaaaaaaac caacattttg ggtatcatgt ctatctggtg ccttaaatac
     5161 cctacgaatt tagaaacttt tctcaaaagc ttccaactaa acaaattatc tttaaccttg
     5221 aaggtcagat ctacctacta cctaatacta tagtagcgga cacttaataa gtccaggtcc
     5281 ctaacctagc atagtttata tgcccgccac cgtagaattt tgcgccttta aagtgaagaa
     5341 cacttttcaa cataaattaa tttttttcag aaactgcaaa aactcccaat ggacacattt
     5401 ccagtaatga cgactcgtca cgcccacttc ttggttagag cccattttat cttcgttctt
     5461 aggcgccaat atatcaaaat tattataatt attatgttct atatactaca cttatcacct
     5521 ctcgtgactt cctcccaccc ccatatcgat gatctcattt ctcttcctca accactaact
     5581 gtctgctacc tatttcttgt ttattagcgg gtgcattatg cagctaattg tccggcgccc
     5641 catcattcta aaagagactc ctcaccgtgt acctcagaaa aacataaaaa tcttgtgata
     5701 ggtatgtact gcataagctt tattaccctc ccccgaaact ttgtagtttg tagtttaccc
     5761 tgtttttttt ctgtaccatg atttcaagtg tttttgaaat tttattttgt ttttattgtt
     5821 attattatta ctattttcaa aaaaggagaa aacacatttc aatgcaaaaa ctgaaaatga
     5881 aaataaaaaa tttgatttt
//
LOCUS       MUSIGHAAI     318 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 3-14-9.
ACCESSION   M32760
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene;
            variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 3-14-9, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   318     Ig H-chain V-D-region (AA at 1)
    recomb      294      295     V-region DNA end/D-region DNA start
BASE COUNT       92 a     69 c     86 g     71 t
ORIGIN      
        1 gaggtgaagc ttctcgagtc tggaggtggc ctggtgcagc ctggaggatc cctgaatctc
       61 tcctgtgcag cctcaggatt cgattttagt agatactgga tgagttgggc tcggcaggct
      121 ctagggaaag ggcaggaatg gatcggagaa atcaatctag gaagcagtac gataaactat
      181 acgccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac
      241 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aagactactg
      301 gccaaggcta tggactac
//
LOCUS       MUSIGHAAJ     318 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 1-5-1.
ACCESSION   M32761
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene;
            variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 1-5-1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   318     Ig H-chain V-D-region (AA at 1)
    recomb      294      295     V-region DNA end/D-region DNA start
BASE COUNT       93 a     67 c     82 g     76 t
ORIGIN      
        1 gaggtgaagc ttctccagtc tggaggtggc ctggtgcagc ctggaggatc cctgaaactc
       61 tcctgtgcag cctcaggaat cgattttagt agatactgga tgagttgggt tcggcgggct
      121 ccagggaaag gactagaatg gattggagaa attaatccag atagcagtac atttaactat
      181 gcaccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac
      241 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aaaaaactgg
      301 gactggtact tcgatgtc
//
LOCUS       MUSIGHAAK     258 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 2-28-9.
ACCESSION   M32762
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene;
            variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 2-28-9, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 258)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   258     Ig H-chain V-D-region (AA at 1)
    recomb      234      235     V-region DNA end/D-region DNA start
BASE COUNT       84 a     52 c     61 g     61 t
ORIGIN      
        1 tcctgtgcag cctcaggaat cgattttagt agatactgga tgagttgggt tcggcgggct
       61 ccagggaaag gactagaatg gattggagaa attaatccag atagcagtac aataaactat
      121 gcaccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac
      181 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aaaaaactgg
      241 gactggtact tcgatgtc
//
LOCUS       MUSIGHAAO     243 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 3-27-6.
ACCESSION   M32759
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene;
            variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 3-27-6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 243)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   243     Ig H-chain V-D-region (AA at 1)
    recomb      219      220     V-region DNA end/D-region DNA start
BASE COUNT       82 a     46 c     58 g     57 t
ORIGIN      
        1 ggaatcgatt ttagtagata ctggatgagt tgggttcggc gggctccagg gaaaggacta
       61 gaatggattg gagaaattaa tccagatagc agtacaataa actatgcacc atctctaaag
      121 gataaattca tcatctccag agacaacgcc aaaaatacgc tgtacctgca aatgagcaaa
      181 gtgagatctg aggacacagc cctttattac tgtgcaaaaa actgggactg gtacttcgat
      241 gtc
//
LOCUS       MUSIGHAAP     246 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 2-1-3.
ACCESSION   M32763
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene;
            variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 2-1-3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 246)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   246     Ig H-chain V-D-region (AA at 1)
    recomb      222      223     V-region DNA end/D-region DNA start
BASE COUNT       78 a     50 c     57 g     61 t
ORIGIN      
        1 tcaggaatcg attttagcag atactggatg agctgggtcc ggcgggctcc agggaaaggg
       61 ctagaatgga ttggagaaat taatccaaat agcagtacaa taaactatgc accatctcta
      121 aaggataaat tcatcatctc cagagacaac gccaaaaatg cgctgtacct gcaagtgagc
      181 aaagtgagat ctgttaacac atccctttat tactgtgcaa gatcttacgg tagctggttt
      241 gcttac
//
LOCUS       MUSIGHAAQ     261 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 10L16-5.
ACCESSION   M32764
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene;
            variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 10L16-5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 261)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   261     Ig H-chain V-D-region (AA at 1)
    recomb      219      220     V-region DNA end/D-region DNA start
BASE COUNT       83 a     49 c     63 g     66 t
ORIGIN      
        1 ggattcgatt ttagtagata ctggatgatt tgggtccggc aggctccagg gaaagggcta
       61 gaatggattg gagaaattaa tccagatagc agtacgataa actatacgcc atctctaaag
      121 gataaattca tcatctccag agacaacgcc aaaaatacgc tgtacctgca aatgagcaaa
      181 gtgagatctg aggacacagc cctttattac tgtgcaagaa aggggtattt ttactacggt
      241 agtagctcgt acttcgatgt c
//
LOCUS       MUSIGHAAY     291 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 3-27-6.
ACCESSION   M32766
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 3-27-6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig kappa-chain V-J-region (AA at 1)
    recomb      285      286     V-region DNA end/J-region DNA start
BASE COUNT       85 a     73 c     61 g     72 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattg ccacttactt ttgccaacag ggttatacgc ttcctcggac g
//
LOCUS       MUSIGHABA     291 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 1-5-1.
ACCESSION   M32768
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 1-5-1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig kappa-chain V-J-region (AA at 1)
    recomb      285      286     V-region DNA end/J-region DNA start
BASE COUNT       84 a     72 c     60 g     75 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccatcagcaa cctggaacct
      241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g
//
LOCUS       MUSIGKAAX     288 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-region.
ACCESSION   M32765
KEYWORDS    immunoglobulin light chain; kappa-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 288)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   288     Ig kappa-chain V-region (AA at 1)
BASE COUNT       83 a     70 c     60 g     75 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatgtattac acatcaagtt tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
      241 gaagatattg ccagttacta ttgtcagcag tatagtaagc ttcctcca
//
LOCUS       MUSIGKAAZ     291 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 3-14-9.
ACCESSION   M32767
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 3-14-9, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig kappa-chain V-J-region (AA at 1)
    recomb      285      286     V-region DNA end/J-region DNA start
BASE COUNT       86 a     73 c     61 g     71 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aatttattaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactacgg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac g
//
LOCUS       MUSIGKABB     291 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 2-28-9.
ACCESSION   M32769
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 2-28-9, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig kappa-chain V-J-region (AA at 1)
    recomb      285      286     V-region DNA end/J-region DNA start
BASE COUNT       84 a     73 c     61 g     73 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca gggcattagc aattatttaa actggtcaca gcagaaacca
      121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccatcagcaa cctggaacct
      241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g
//
LOCUS       MUSIGKABC     291 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 2-1-3.
ACCESSION   M32770
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 2-1-3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig kappa-chain V-J-region (AA at 1)
    recomb      285      286     V-region DNA end/J-region DNA start
BASE COUNT       83 a     72 c     61 g     75 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
      241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g
//
LOCUS       MUSIGLKABD    246 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 10L16-5.
ACCESSION   M32771
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) hybridoma 10L16-5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 246)
  AUTHORS   Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I.,
            French,D., Rothe,J. and Bona,C.
  TITLE     A molecular and structural analysis of the V-H and V-K regions of
            monoclonal antibodies bearing the A48 regulatory idiotype
  JOURNAL   J. Immunol. 144, 614-624 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   246     Ig kappa-chain V-J-region (AA at 1)
    recomb      240      241     V-region DNA end/J-region DNA start
BASE COUNT       74 a     54 c     52 g     66 t
ORIGIN      
        1 ggagacagag tcaccatcag ttgcagtgta agtcagggca ttagcaatta tttaaactgg
       61 tatcagcaga aaccagatgg aactgttaaa ctcctgatct ataacacatc aactttatac
      121 tcaggagtcc catcaaggtt cagtggcagt gggtctggga cagattattc tctcaccatc
      181 agtaacctgg aacctgaaga tattgccact tattattgtc agcagtatag taagcttcct
      241 ccgacg
//
LOCUS       HUMACTAF     1123 bp ds-DNA             PRI       30-MAY-1990
DEFINITION  Human alpha-actin (ACTA) gene, non-coding exon 1.
ACCESSION   J05193
KEYWORDS    alpha-actin.
SOURCE      Human placental DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1123)
  AUTHORS   Reddy,S., Ozgur,K., Lu,M., Chang,W., Mohan,S.R., Kumar,C.C. and
            Ruley,H.E.
  TITLE     Structure of the human smooth muscle alpha-actin gene: Analysis of
            a cDNA and 5' upstream region
  JOURNAL   J. Biol. Chem. 265, 1683-1687 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Reddy, 20-DEC-1989.
FEATURES       from  to/span     description
    pre-msg     892  >  1123     alpha-actin mRNA and introns
                                 /hgml_locus_uid="LW0057W" /nomgen="ACTA"
                                 /map="1p21-qter"
    IVS         935  >  1123     ACTA intron A
    signal      863      869     TATA box
BASE COUNT      286 a    258 c    258 g    321 t
ORIGIN      
        1 gaattcgaga cgagatttgg gtggggacgt agaaccaaac catatcacct ggtctctcta
       61 cttcctgtca aggaggttag tgggcagaga ggagggctag aggcttcctt tgaacaatct
      121 cctttctttt ccaaactact tctttgacag gctgctgggt agactctctg gtcaaaggat
      181 ggtccctact tatgctgcta aattgctcgg tgacaaatta gtagacaaag ctaatgcacc
      241 aaaaaaatga atgtagttat agtaatgcta acatccaaat tcctctttgt aagacatagg
      301 cctgtcaacc ttgtctccat acttcaattc ctatttccac tcacctccct caagaacttg
      361 atttataaac agtgtgccta ccataaaatc atcactccct ctatgtattt atagacgact
      421 gaaggaatat ctttcttctt tgacatgcta ccgtggtaga agggttttaa aagtccgtgc
      481 taggcagagg cagccctttc tgcccctttc tgttctcagt ttattaggaa atggcctgaa
      541 attccagcat gatagcaagc tggcatcctc tgtggaatgt gcaaaccatg cctgcatctg
      601 cccattaccc tagctcagtg tctctgggca tttctgcagt tgttctgaag gcttggcgtg
      661 tttatctccc acaggcggct gaaccgcctc ccgtttcatg agcagaccag tggaatgcag
      721 tggaagagac ccaggcctcc ggccacccag attagagagt tttgtgctga ggtccctata
      781 tggttgtgtt agactgaaca gcaggctcaa gtctgtcttt gctccttgtt tgggaagcaa
      841 gtgggaggag agcagaaggg gctatataac ccttcagctt tcagcttccc tgaacaccac
      901 ccagtgtgga gcagcccagc caagcactgt cagggtaagt ggcgccaggc caaggatgtg
      961 acttatagat tccagtggct cttttaatta cccggtataa taagacatca tctgcaggga
     1021 tttggctggg ttcatgcact gatatttctg aatgaagatt gtactactaa aatgattgta
     1081 gcttttggct ttaatgatct aacgttaaag acagggctaa tat
//
LOCUS       HUMACTAM     1329 bp ss-mRNA            PRI       30-MAY-1990
DEFINITION  Human alpha-actin (ACTA) mRNA, complete cds.
ACCESSION   J05192
KEYWORDS    alpha-actin.
SOURCE      Human umbilical artery, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Reddy,S., Ozgur,K., Lu,M., Chang,W., Mohan,S.R., Kumar,C.C. and
            Ruley,H.E.
  TITLE     Structure of the human smooth muscle alpha-actin gene: Analysis of
            a cDNA and 5' upstream region
  JOURNAL   J. Biol. Chem. 265, 1683-1687 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Reddy, 20-DEC-1989.
FEATURES       from  to/span     description
    pept         49     1182     alpha-actin /hgml_locus_uid="LW0057W"
                                 /nomgen="ACTA" /map="1p21-qter"
    mRNA    <     1     1329     ACTA mRNA
    signal     1305     1310     poly-A signal
BASE COUNT      326 a    361 c    327 g    315 t
ORIGIN      
        1 gcagcccagc caagcactgt acaggaatcc tgtgaagcag ctccagctat gtgtgaagaa
       61 gaggacagca ctgccttggt gtgtgacaat ggctctgggc tctgtaaggc cggctttgct
      121 ggggacgatg ctcccagggc tgttttccca tccattgtgg gacgtcccag acatcagggg
      181 gtgatggtgg gaatgggaca aaaagacagc tacgtgggtg acgaagcaca gagcaaaaga
      241 ggaatcctga ccctgaagta cccgatagaa catggcatca tcaccaactg ggacgacatg
      301 gaaaagatct ggcaccactc tttctacaat gagcttcgtg ttgcccctga agagcatccc
      361 accctgctca cggaggcacc cctgaacccc aaggccaacc gggagaaaat gactcaaatt
      421 atgtttgaga ctttcaatgt cccagccatg tatgtggcta tccaggcggt gctgtctctc
      481 tatgcctctg gacgcacaac tggcatcgtg ctggactctg gagatggtgt cacccacaat
      541 gtccccatct atgagggcta tgccttgccc catgccatca tgcgtctgga tctggctggc
      601 cgagatctca ctgactacct catgaagatc ctgactgagc gtggctattc cttcgttact
      661 actgctgagc gtgagattgt ccgggacatc aaggagaaac tgtgttatgt agctctggac
      721 tttgaaaatg agatggccac tgccgcatgg tcatcctccc ttgagaagag ttacgagttg
      781 cctgatgggc aagtgatcac catcggaaat gaacgtttcc gctgcccaga gaccctgttc
      841 cagccatcct tcatcgggat ggagtctgct ggcatccatg aaaccaccta caacagcatc
      901 atgaagtgtg atattgacat caggaaggac ctctatgcta acaatgtcct atcagggggc
      961 accactatgt accctggcat tgccgaccga atgcagaagg agatcacggc cctagcaccc
     1021 agcaccatga agatcaagat cattgcccct ccggagcgca aatactctgt ctggatcggt
     1081 ggctccatcc tggcctctct gtccaccttc cagcagatgt ggatcagcaa acaggaatac
     1141 gatgaagccg ggccttccat tgtccaccgc aaatgcttct aaaacacttt cctgctcctc
     1201 tctgtctcat gcacacaact gtgaatgtcc tgtggaatta tgccttcagt tcttttccaa
     1261 atcattccta gccaaagctc tgactcgtta cctatgtgtt ttttaataaa tctgaaatag
     1321 gctactggt
//
LOCUS       ASOTAAG1     2935 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G1) gene, complete cds.
ACCESSION   M33218
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 2935)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        607      772     Taka-amylase A (Taa-G1) precursor, exon 1
                828      868     Taka-amylase A precursor, exon 2
                955     1070     Taka-amylase A precursor, exon 3
               1140     1248     Taka-amylase A precursor, exon 4
               1317     1545     Taka-amylase A precursor, exon 5
               1603     1765     Taka-amylase A precursor, exon 6
               1830     1976     Taka-amylase A precursor, exon 7
               2041     2281     Taka-amylase A precursor, exon 8
               2360     2647     Taka-amylase A precursor, exon 9
    sigp        607      669     Taka-amylase A signal peptide
    matp        670      772     Taka-amylase A
                828      868     Taka-amylase A
                955     1070     Taka-amylase A
               1140     1248     Taka-amylase A
               1317     1545     Taka-amylase A
               1603     1765     Taka-amylase A
               1830     1976     Taka-amylase A
               2041     2281     Taka-amylase A
               2360     2644     Taka-amylase A
    pre-msg     543  >  2789     Taa-G1 mRNA and introns
    IVS         773      827     Taa-G1 intron A (no splice consensus)
    IVS         869      954     Taa-G1 intron B
    IVS        1071     1139     Taa-G1 intron C
    IVS        1249     1316     Taa-G1 intron D
    IVS        1546     1602     Taa-G1 intron E
    IVS        1766     1829     Taa-G1 intron F
    IVS        1977     2040     Taa-G1 intron G
    IVS        2282     2359     Taa-G1 intron H
    signal     2784     2789     poly-A signal
BASE COUNT      818 a    752 c    657 g    708 t
ORIGIN      
        1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact
       61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca
      121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca
      181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag
      241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat
      301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat
      361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt
      421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc
      481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg
      541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag
      601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct
      661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat
      721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt
      781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg
      841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg
      901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac
      961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc
     1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat
     1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga
     1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt
     1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct
     1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct
     1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact
     1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct
     1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga
     1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc
     1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg
     1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt
     1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca
     1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt
     1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca
     1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca
     1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa
     1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag
     2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg
     2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa
     2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc
     2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa
     2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga
     2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg
     2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga
     2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt
     2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg
     2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag
     2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag
     2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga
     2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca
     2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc
     2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg
//
LOCUS       ASOTAAG2A1    197 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G2) gene, 5' end.
ACCESSION   M33220
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 197)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        195  >   197     Taka-amylase A (Taa-G2) precursor
    pre-msg     131  >   197     Taa-G2 mRNA and introns
    signal        2       11     CAAT box
    signal       95      100     TATA box
BASE COUNT       59 a     52 c     36 g     50 t
ORIGIN      
        1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa
       61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg
      121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc
      181 cacagaaggc atttatg
//
LOCUS       ASOTAAG2A2    198 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G2) gene, 3' end.
ACCESSION   M33222
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 198)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     Taka-amylase A (Taa-G2) (AA at 1)
    pre-msg <     1      151     Taa-G2 mRNA and introns (alt.)
    pre-msg <     1      156     Taa-G2 mRNA and introns (alt.)
BASE COUNT       45 a     48 c     53 g     52 t
ORIGIN      About 2.1 kb after segment 1.
        1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
       61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
      121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc
      181 gtcctccaga ggcgagga
//
LOCUS       ASOTTAM1      191 bp ss-mRNA            PLN       30-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa) mRNA, 3' end.
ACCESSION   M33219
KEYWORDS    Taka-amylase A.
SEGMENT     1 of 2
SOURCE      A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 191)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        189  >   191     Taka-amylase A (Taa)
    mRNA        125  >   191     Taa mRNA
BASE COUNT       56 a     52 c     32 g     51 t
ORIGIN      
        1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca
       61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt
      121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga
      181 aggcatttat g
//
LOCUS       ASOTTAM2      156 bp ss-mRNA            PLN       30-MAY-1990
DEFINITION  A.oryzae Taka-amylase A (Taa) mRNA, 5' end.
ACCESSION   M33221
KEYWORDS    Taka-amylase A.
SEGMENT     2 of 2
SOURCE      A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     Taka-amylase A (Taa) (AA at 1)
    mRNA    <     1      151     Taa mRNA (alt.)
    mRNA    <     1      156     Taa mRNA (alt.)
BASE COUNT       40 a     35 c     42 g     39 t
ORIGIN      About 1.5 kb after segment 1.
        1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
       61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
      121 ggcaccgggc cccgactgag agcccgtctc gaatcc
//
LOCUS       CIPNADGAPD   1354 bp ss-mRNA            PLN       30-MAY-1990
DEFINITION  M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH)
            mRNA, complete cds.
ACCESSION   J05223
KEYWORDS    D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase;
            glyceraldehyde-3-phosphate dehydrogenase.
SOURCE      M.crystallinum, cDNA to mRNA.
  ORGANISM  Mesembryanthemum crystallinum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE   1  (bases 1 to 1354)
  AUTHORS   Ostrem,J.A., Vernon,D.M. and Bohnert,H.J.
  TITLE     Increased expression of a gene coding for NAD:glyceraldehyde-3-
            phosphate dehydrogenase during the transition from C-3
            photosynthesis to crassulacean acid metabolism in Mesembryanthemum
            crystallinum
  JOURNAL   J. Biol. Chem. 265, 3497-3502 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         55     1068     glyceraldehyde-3-phosphate dehydrogenase (EC
                                 1.2.1.12)
BASE COUNT      315 a    308 c    345 g    386 t
ORIGIN      
        1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct
       61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc
      121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac
      181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag
      241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg
      301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga
      361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt
      421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac
      481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg
      541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc
      601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg
      661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt
      721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg
      781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct
      841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat
      901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga
      961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac
     1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga
     1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc
     1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt
     1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta
     1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc
     1321 agcatttatg aaacattgct gtgagctttt gagt
//
LOCUS       ECOPRIA      2658 bp ds-DNA             BCT       30-MAY-1990
DEFINITION  E.coli primosomal protein n' (priA) gene, complete cds, and cytR
            gene, 5' end.
ACCESSION   M33293
KEYWORDS    cytR gene; priA gene; primosomal protein.
SOURCE      E.coli (strain W3110) DNA, clone pEL042.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2658)
  AUTHORS   Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A.
  TITLE     The priA gene encoding the primosomal, replicative n' protein of
            E.coli
  JOURNAL   Unpublished (1990) Stanford U., Beckman Cntr., Stanford, CA 94305
  STANDARD  full automatic
COMMENT     Authorin sequence for [1] kindly submitted by G.C.Allen Jr.,
            26-MAR-1990.
FEATURES       from  to/span     description
    pept         64     2262     primosomal protein n'
    pept       2418  >  2658     cytR gene product
    signal       27       32     -10 region
    signal        5       10     -35 region
    binding      53       59     ribosome binding site
BASE COUNT      578 a    738 c    756 g    586 t
ORIGIN      
        1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat
       61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg
      121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag
      181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag
      241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga
      301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc
      361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt
      421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa
      481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa
      541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa
      601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg
      661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc
      721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa
      781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg
      841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc
      901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt
      961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt
     1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac
     1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc
     1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt
     1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg
     1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac
     1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt
     1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac
     1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc
     1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg
     1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg
     1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca
     1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg
     1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc
     1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac
     1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa
     1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg
     1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg
     2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg
     2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg
     2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt
     2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg
     2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg
     2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg
     2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg
     2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata
     2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac
     2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc
     2641 cggatatctg cgatcccc
//
LOCUS       EWCTELRNA     657 bp ds-DNA             INV       30-MAY-1990
DEFINITION  E.crassus telomerase RNA component gene, complete cds.
ACCESSION   M33461
KEYWORDS    telomerase RNA.
SOURCE      E.crassus DNA.
  ORGANISM  Euplotes crassus
            Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; 
            Spirotricha; Heterotrichida; Clevelandellina.
REFERENCE   1  (bases 1 to 657)
  AUTHORS   Shippen-Lentz,D. and Blackburn,E.H.
  TITLE     Functional evidence for an RNA template in telomerase
  JOURNAL   Science 247, 546-552 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    RNA         152      342     telomerase RNA component
    site        186      197     functional telomeric template
BASE COUNT      202 a    122 c    111 g    222 t
ORIGIN      
        1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga
       61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt
      121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg
      181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg
      241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt
      301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg
      361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt
      421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat
      481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga
      541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac
      601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt
//
LOCUS       HS4DWXJ       160 bp ds-DNA             VRL       30-MAY-1990
DEFINITION  Epstein-Barr virus defective WZhet junction.
ACCESSION   M33474
KEYWORDS    .
SOURCE      Epstein-Barr virus (strain HR-1, clinical sample 9) DNA.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 160)
  AUTHORS   Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
  TITLE     Defective viral DNA in Epstein-Barr virus-associated oral hairy
            leukoplakia
  JOURNAL   J. Virol. 64, 397-400 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb       87       90     WZhet junction
BASE COUNT       41 a     45 c     41 g     33 t
ORIGIN      
        1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc
       61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact
      121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac
//
LOCUS       HS4WXJ        181 bp ds-DNA             VRL       30-MAY-1990
DEFINITION  Epstein-Barr virus WZhet junction, HR-1 clone 5.
ACCESSION   M33473
KEYWORDS    .
SOURCE      Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 181)
  AUTHORS   Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
  TITLE     Defective viral DNA in Epstein-Barr virus-associated oral hairy
            leukoplakia
  JOURNAL   J. Virol. 64, 397-400 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      108      111     WZhet junction
BASE COUNT       50 a     42 c     46 g     43 t
ORIGIN      
        1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt
       61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca
      121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga
      181 c
//
LOCUS       HUMREGA      4251 bp ds-DNA             PRI       30-MAY-1990
DEFINITION  Human regenerating protein (reg) gene, complete cds.
ACCESSION   J05412
KEYWORDS    pancreatic stone protein; pancreatic thread protein;
            regenerating protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4251)
  AUTHORS   Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
  TITLE     Complete nucleotide sequence of the human reg gene and its
            expression in normal and tumoral tissues: The reg protein,
            pancreatic stone protein, and pancreatic thread protein are one and
            the same product of the gene
  JOURNAL   J. Biol. Chem. 265, 7432-7439 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            H.Okamoto, 23-FEB-1990.
FEATURES       from  to/span     description
    pept       1571     1634     regenerating protein (reg), exon 2 (first
                                 expressed exon)
               2270     2388     regenerating protein, exon 3
               2696     2833     regenerating protein, exon 4
               3549     3660     regenerating protein, exon 5
               3856     3923     regenerating protein, exon 6
    pre-msg    1196     4116     reg mRNA and introns
    IVS        1224     1524     reg intron A
    IVS        1635     2269     reg intron B
    IVS        2389     2695     reg intron C
    IVS        2834     3548     reg intron D
    IVS        3661     3855     reg intron E
    signal     1169     1174     TATA box
BASE COUNT     1161 a    927 c    869 g   1294 t
ORIGIN      
        1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc
       61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat
      121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt
      181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc
      241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt
      301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc
      361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc
      421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat
      481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac
      541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg
      601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg
      661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata
      721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag
      781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc
      841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga
      901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt
      961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag
     1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc
     1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa
     1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa
     1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat
     1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta
     1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc
     1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag
     1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac
     1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt
     1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct
     1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact
     1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt
     1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat
     1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg
     1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt
     1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa
     1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac
     2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga
     2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat
     2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa
     2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc
     2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc
     2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag
     2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca
     2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc
     2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac
     2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca
     2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta
     2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt
     2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca
     2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta
     2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag
     2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata
     3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt
     3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca
     3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct
     3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa
     3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga
     3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt
     3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa
     3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc
     3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc
     3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg
     3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag
     3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg
     3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt
     3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt
     3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct
     3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc
     3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca
     4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt
     4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc
     4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct
     4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c
//
LOCUS       HUMREGRELA   1524 bp ds-DNA             PRI       30-MAY-1990
DEFINITION  Human reg-related sequence, complete cds.
ACCESSION   J05413
KEYWORDS    pancreatic stone protein; pancreatic thread protein;
            regenerating protein.
SOURCE      Human esophageal mucosa DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
  TITLE     Complete nucleotide sequence of the human reg gene and its
            expression in normal and tumoral tissues: The reg protein,
            pancreatic stone protein, and pancreatic thread protein are one and
            the same product of the gene
  JOURNAL   J. Biol. Chem. 265, 7432-7439 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            H.Okamoto, 23-FEB-1990.
BASE COUNT      382 a    368 c    345 g    429 t
ORIGIN      
        1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt
       61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt
      121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg
      181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat
      241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg
      301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga
      361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata
      421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat
      481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag
      541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa
      601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc
      661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac
      721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta
      781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca
      841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt
      901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat
      961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct
     1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc
     1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga
     1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc
     1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct
     1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct
     1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc
     1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg
     1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc
     1501 tataaatcat ataatgtaaa ggat
//
LOCUS       MDPCGA       4801 bp ss-RNA             VRL       30-MAY-1990
DEFINITION  Aleutian mink disease parvovirus complete genome.
ACCESSION   M20036
KEYWORDS    complete genome.
SOURCE      Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4.
  ORGANISM  Aleutian mink disease parvovirus
            Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE   1  (bases 1 to 4592)
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  TITLE     Nucleotide sequence and genomic organization of Aleutian mink
            disease parvovirus (ADV): Sequence comparisons between a
            nonpathogenic and pathogenic strain of ADV
  JOURNAL   J. Virol. 62, 2903-2915 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 4593 to 4801)
  AUTHORS   Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W.,
            Perryman,S. and Wolfinbarger,J.B.
  TITLE     Nucleotide sequence of the 5' terminal palindrome of Aleutian mink
            disease parvovirus (ADV) and construction of an infectious
            molecular clone
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by M.E.Bloom, 18-MAR-1990.
            
            Sequence reported below is (+) strand.
FEATURES       from  to/span     description
    ORF         116     1978     Left ORF
    ORF        1983     2207     Middle ORF2
    ORF        1993     2211     Middle ORF1
    ORF        2241     4399     Right ORF
BASE COUNT     1740 a    912 c    943 g   1206 t
ORIGIN      
        1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt
       61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt
      121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag
      181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact
      241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt
      301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact
      361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc
      421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac
      481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt
      541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta
      601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag
      661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa
      721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc
      781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga
      841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta
      901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt
      961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga
     1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg
     1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa
     1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt
     1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga
     1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt
     1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga
     1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa
     1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg
     1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta
     1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat
     1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac
     1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg
     1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa
     1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa
     1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca
     1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt
     1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca
     2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa
     2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg
     2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata
     2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta
     2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc
     2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag
     2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg
     2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg
     2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt
     2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact
     2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag
     2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata
     2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta
     2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa
     2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt
     2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg
     3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc
     3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct
     3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta
     3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt
     3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg
     3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa
     3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg
     3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt
     3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac
     3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga
     3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat
     3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac
     3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt
     3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc
     3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg
     3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg
     3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac
     4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg
     4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa
     4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag
     4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag
     4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg
     4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact
     4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc
     4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta
     4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac
     4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt
     4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt
     4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct
     4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg
     4801 a
//
LOCUS       MDPUPS       3454 bp ss-RNA             VRL       30-MAY-1990
DEFINITION  Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial
            sequence.
ACCESSION   M32981
KEYWORDS    .
SOURCE      Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA.
  ORGANISM  Aleutian mink disease parvovirus
            Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE   1  (sites)
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  TITLE     Nucleotide sequence and genomic organization of Aleutian mink
            disease parvovirus (ADV): Sequence comparisons between a
            nonpathogenic and pathogenic strain of ADV
  JOURNAL   J. Virol. 62, 2903-2915 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3454, for [1])
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  JOURNAL   Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by M.E.Bloom 18-MAR-1990.
            
            Sequence reported below is (+) strand.
BASE COUNT     1268 a    673 c    698 g    815 t
ORIGIN      
        1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc
       61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac
      121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac
      181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta
      241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa
      301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga
      361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat
      421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt
      481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa
      541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta
      601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat
      661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag
      721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga
      781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat
      841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc
      901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact
      961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt
     1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac
     1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac
     1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa
     1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta
     1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag
     1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac
     1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg
     1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa
     1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac
     1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca
     1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga
     1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc
     1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg
     1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta
     1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt
     1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga
     1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag
     2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag
     2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac
     2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt
     2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga
     2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc
     2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg
     2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat
     2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta
     2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg
     2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa
     2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg
     2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta
     2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc
     2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa
     2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt
     2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca
     3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg
     3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct
     3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga
     3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt
     3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc
     3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt
     3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa
     3421 aacctatggt tactttactt ggagaggcaa gctt
//
LOCUS       MUSAA2DEL     300 bp ds-DNA             ROD       30-MAY-1990
DEFINITION  Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion
            fragment.
ACCESSION   M33468
KEYWORDS    deletion mutant.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
  TITLE     Molecular analysis of two mouse dilute locus deletion mutations:
            Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
            lethal Aa2 alleles
  JOURNAL   Mol. Cell. Biol. 10, 501-509 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      130      133     deletion breakpoint
BASE COUNT       82 a     45 c     76 g     97 t
ORIGIN      
        1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact
       61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt
      121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt
      181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata
      241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa
//
LOCUS       MUSSL20JA     300 bp ds-DNA             ROD       30-MAY-1990
DEFINITION  Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion
            fragment.
ACCESSION   M33467
KEYWORDS    deletion mutant.
SOURCE      Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
  TITLE     Molecular analysis of two mouse dilute locus deletion mutations:
            Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
            lethal Aa2 alleles
  JOURNAL   Mol. Cell. Biol. 10, 501-509 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept.ps /    42      122     dilute gene, exon 3
                179  >   300     dilute gene, exon 5
    recomb      129      130     deletion breakpoint
BASE COUNT       86 a     51 c     61 g    102 t
ORIGIN      
        1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac
       61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac
      121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta
      181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta
      241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc
//
LOCUS       PRVVP4       2359 bp ds-RNA             VRL       30-MAY-1990
DEFINITION  Porcine rotavirus capsid protein VP4 gene, complete cds.
ACCESSION   M33516
KEYWORDS    capsid protein VP4.
SOURCE      Porcine rotavirus (Gottfried strain; serotype 4) DNA.
  ORGANISM  Porcine rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 2359)
  AUTHORS   Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K.
  TITLE     Similarity of the outer capsid protein VP4 of the Gottfried strain
            of porcine rotavirus to that of asymptomatic human rotavirus
            strains
  JOURNAL   J. Virol. 64, 414-418 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         10     2337     capsid protein VP4
BASE COUNT      812 a    406 c    442 g    699 t
ORIGIN      
        1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa
       61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg
      121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat
      181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca
      241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac
      301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga
      361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg
      421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta
      481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat
      541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa
      601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag
      661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta
      721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag
      781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt
      841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca
      901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt
      961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc
     1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac
     1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg
     1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt
     1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc
     1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca
     1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga
     1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac
     1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg
     1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata
     1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt
     1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct
     1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc
     1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata
     1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga
     1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct
     1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat
     1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat
     2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc
     2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata
     2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc
     2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa
     2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag
     2341 ctctatagag gatgtgacc
//
LOCUS       RATSTAA      1000 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds.
ACCESSION   M33329
KEYWORDS    hydroxysteroid sulfotransferase a.
SOURCE      Rat (strain Sprague-Dawley) female liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1000)
  AUTHORS   Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K.,
            Yamazoe,Y. and Kato,R.
  TITLE     cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a
            strong homology in amino acid sequence with the senescence marker
            protein SMP-2 in rat livers
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         38      892     hydroxysteroid sulfotransferase a (STa)
    mRNA    <     1     1000     STa mRNA
    signal      977      982     poly-A signal
BASE COUNT      299 a    187 c    211 g    303 t
ORIGIN      
        1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga
       61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa
      121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg
      181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt
      241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa
      301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct
      361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc
      421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta
      481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc
      541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa
      601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga
      661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc
      721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg
      781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt
      841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa
      901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa
      961 gtgaatgtgg tcattgaata aattctattc tggcattgtg
//
LOCUS       SMFPOLENV    3534 bp ss-RNA             VRL       30-MAY-1990
DEFINITION  Simian foamy virus type 1 polymerase (pol) gene, 3' end; and
            envelope (env) gene, complete cds.
ACCESSION   M33561
KEYWORDS    envelope protein; polymerase.
SOURCE      Simian foamy virus type 1, cDNA to viral RNA.
  ORGANISM  Simian foamy virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Spumavirinae.
REFERENCE   1  (bases 1 to 3534)
  AUTHORS   Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A.
  TITLE     Relationship of the env genes and the endonuclease domain of the
            pol genes of simian foamy virus type 1 and human foamy virus
  JOURNAL   J. Virol. 64, 406-410 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      582     polymerase (AA at 1)
    pept        524     3481     envelope protein
BASE COUNT     1162 a    645 c    670 g   1046 t     11 others
ORIGIN      
        1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac
       61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta
      121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat
      181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta
      241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc
      301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg
      361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc
      421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt
      481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg
      541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac
      601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag
      661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg
      721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat
      781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt
      841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt
      901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt
      961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt
     1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt
     1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt
     1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca
     1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac
     1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca
     1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa
     1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa
     1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga
     1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg
     1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc
     1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt
     1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca
     1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa
     1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg
     1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt
     1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa
     1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc
     2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac
     2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa
     2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag
     2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg
     2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt
     2341 atatttactn nnnnnnnnnn tggtaaccct gatggaagct gcccttcatg atgtttcgat
     2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat
     2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt
     2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta
     2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga
     2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt
     2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa
     2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt
     2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg
     2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg
     2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt
     3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc
     3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt
     3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa
     3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg
     3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc
     3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg
     3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact
     3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg
     3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat
//
LOCUS       IPBFATPD     1035 bp ss-mRNA            PLN       30-MAY-1990
DEFINITION  Sweet potato F-1-ATPase (mitochondrial type) delta subunit mRNA,
            complete cds.
ACCESSION   J05397
KEYWORDS    F-1-ATPase delta subunit.
SOURCE      I.batatas (strain Kokei No.4) tuberous root, cDNA to mRNA.
  ORGANISM  Ipomoea batatas
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Convolvulaceae.
REFERENCE   1  (bases 1 to 1035)
  AUTHORS   Kimura,T., Takeda,S., Asahi,T. and Nakamura,K.
  TITLE     Primary structure of a precursor for the delta-subunit of sweet
            potato mitochondrial F-1-ATPase deduced from full-length cDNA
  JOURNAL   J. Biol. Chem. 265, 6079-6085 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Nakamura, 23-FEB-1990.
FEATURES       from  to/span     description
    pept        115      849     F-1-ATPase delta subunit precursor (EC
                                 3.6.1.3)
    sigp        115      249     F-1-ATPase delta subunit signal peptide
    matp        250      846     F-1-ATPase delta subunit
    signal      964      969     poly-A signal
BASE COUNT      283 a    225 c    238 g    289 t
ORIGIN      
        1 catagggtaa actgcagcgt tctttgggct tcgttctcaa acccttccct tctgtctccg
       61 aattctctcg cgccggcgaa cgattcagct tagagcaaag aagagctctc tgatatggca
      121 atgacaggac gtgctagatc gatgggcttc tctattctcc agaaggccct ctcttcagcc
      181 cagagatcta acgctcatcg atccattctc tgccctaccc tctcaaactc tgagctgctg
      241 aggaactatg caactgcctc tgcttcaaaa gaacaaaaaa ttaaggtgcc tctgacaatg
      301 tatggggttt ctggaaacta tgcatctgct ttgtaccttg cagcagtaaa gtctaacaca
      361 ttagagaagg ttgagtcaga gctctatgac cttgttgagg cttcaaagaa gagtcctaca
      421 ttttctcagt tcatgaggga tccatctgta cctgttgata ctagagtgaa tgccataaag
      481 gaaatttgtg ctcaagctaa atttggggat accacccaga acttcttgct tattttggct
      541 gaaaatggga ggctgaaaca cattgatcgc attgtgaagc gattcaaaga gctgaccatg
      601 gcacacaggg gagaagtcaa agccactgtg acaacagtaa ttccccttcc cgctgatgag
      661 gagaaagaat tgaaagccac attgcaggaa atggttggac agggaaagag tgttcaaatt
      721 gaacagaaga ttgatccaac cattcttggt ggccttgttg tggaattcgg gcagaaggtc
      781 tttgacatgt ctataaggac acgggcacgc cagatggaga gattcttgcg tgaaccactc
      841 aacttttgat tgcctccaat cccgttagaa ggcgtttgtt tccatcttga ggaaagcacg
      901 atttcggtct gggctccttg agctgcctgt tttccgataa acattgttct gctctttttt
      961 gcaaataaag aaaacttgct ttttgtaaga atttgatggg aaaaaaagaa aacctaatgt
     1021 tgtttttact tgctg
//
LOCUS       YSCSAC7P     1500 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  S.cerevisiae SAC7 protein gene, complete cds.
ACCESSION   M32335
KEYWORDS    SAC7 protein.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 301 to 1500)
  AUTHORS   Dunn,T.M. and Shortle,D.
  TITLE     Null alleles of SAC7 suppress temperature-sensitive actin mutations
            in Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 2308-2314 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 300)
  AUTHORS   Dunn,T.M.
  JOURNAL   Unpublished (1990) Univ. of Health Sciences, Bethesda MD 20814-4799
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1], [2] kindly
            submitted by T.M.Dunn, 23-FEB-1990.
FEATURES       from  to/span     description
    pept        606     1430     SAC7 protein
BASE COUNT      526 a    313 c    288 g    373 t
ORIGIN      Chromosome 4R, between ADE8 and TRP4.
        1 ggatcaccaa cttctctttc aaggaatgct caaccaacta ctttaaaacg gcctccatta
       61 tcttcaagac catattcata taatacccca accaaagata ggaaatcctt ttccaaatcc
      121 gcgaaacaga acaataacaa taacaacgct aattccggaa catcgcctca cgcagagttc
      181 aaaaattaca gagatatgtt tttatctaat agaaatggtt tcacaggtag ggttttcggt
      241 gttactttag cagaatcgtt gagcgtcgcc agtgcagagg tcattgttca aagtgagttg
      301 gtgagttttg gtcggatacc catcgtggta gccaagtgcg gcgcatactt aaaagcaaat
      361 ggattggaga cctcgggtat atttcgtata gcgggcaatg gtaaaagagt aaaagccctt
      421 caatacatat tctcgtcgcc acctgattat ggtaccaaat tcaacgattg gaaacatata
      481 cagtgcacga tgttgcatcg ctcctgagga gataccttaa taatttggcc gaaccactaa
      541 tacctttatc cctatatgaa caattcagaa acccgctacg atctagacca agaatcctaa
      601 ggcatatgtt gacccacgaa gtttctcatc cgaatgcgaa taaaacaaat aatgtaacag
      661 taaaatcaag tagacagaac tataatgatg atggtgctaa tgatggtgac atcgaaaagg
      721 aggacgccaa agatgatgaa gaaaaaagac gaagaaaaat acgtcataag agaaggctga
      781 cccgcgatat aagggcagct atcaaggaat atgaagagct ctttgttacc ttatcaaatg
      841 acacgaaaca gctaactata tacctactcg atttactgag tctttttgca aggcaatcac
      901 agtttaacct aatgtctggt agaaacttgg cggccatctt ccaaccttca atattatcgc
      961 atcctcaaca tgatatggac cccaaagaat acgagttatc ccgactagta gtggaatttt
     1021 tgattgaata ctcgtacaag ctattacccc atcttttgaa gttggctaaa agggaacaac
     1081 aggaacggtt gtcaaccgaa aataagaaaa ataatggaga taaacagaaa actgatccta
     1141 tagaaatacc aaagatcacc tcatcggatt cgccaccaat agtttcttcc aataaaaacc
     1201 caccggcgat tgacaataac aataaactag accatacgac gttatcgcca atatctactt
     1261 cgataccgga gaactcatca gacctacaga cttcaaaaat gttaaagcct ccgaagcaaa
     1321 gaagaccaca ctcaaaatcg tttggttcta ctccggttct ccggatgtta ttgctagtaa
     1381 taaaagaaga acgagtttat tcccgtggtt acataaaccg ggaattttga gtgacacagg
     1441 agacaatggc gacttaactg ctactgaagc tgaaggtgat gattatgaag aagaaaatgt
//
LOCUS       BOVINSMN      960 bp ss-mRNA            MAM       30-MAY-1990
DEFINITION  Bovine inositol monophosphatase mRNA, complete cds.
ACCESSION   J05394
KEYWORDS    inositol 1-phosphatase; inositol monophosphatase.
SOURCE      Bovine calf brain, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 960)
  AUTHORS   Diehl,R.E., Whiting,P., Potter,J., Gee,N., Ragan,C.I.,
            Linemeyer,D., Schoepfer,R., Bennett,C. and Dixon,R.A.F.
  TITLE     Cloning and expression of bovine brain inositol monophosphatase
  JOURNAL   J. Biol. Chem. 265, 5946-5949 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.Whiting, 07-FEB-1990.
FEATURES       from  to/span     description
    pept         61      894     inositol monophosphatase
BASE COUNT      266 a    197 c    248 g    249 t
ORIGIN      
        1 cggaacggcc gcgcgtctga caggctggtg cgcccgcggc cgcgcgcccg ccgccctgag
       61 atggctgatc cttggcagga atgcatggat tatgcagtaa ccctggcagg acaagctgga
      121 gaggtggttc gtgaagcgct caagaatgaa atgaatatta tggttaaaag ttctccggct
      181 gatttggtaa ctgctactga ccaaaaagtt gaaaaaatgc ttatcacatc cataaaggaa
      241 aagtatccat ctcacagttt cattggtgag gaatctgtgg cagctgggga aaaaagtatc
      301 ttaaccgaca accctacatg gatcattgac cctattgatg gaacaactaa ctttgtacat
      361 ggatttcctt ttgtagctgt ttcaattggc tttgtggtaa ataaaaagat ggagtttgga
      421 attgtgtaca gttgcctgga ggataagatg tacactggcc ggaaaggaaa aggtgccttt
      481 tgtaatggtc aaaaactaca ggtttcacac caagaagata ttaccaaatc tctcttggtg
      541 acagagttgg gctcttccag aacaccagag actgtgagaa ttattctttc taatatagaa
      601 aggcttcttt gccttcccat ccatgggatc cggggtgttg gaacagcagc tctgaacatg
      661 tgcctggtgg ccgctggggc tgcggacgcg tattatgaaa tggggatcca ctgctgggat
      721 gttgcaggcg ccggcatcat tgtgactgaa gctggcggag tgctcctgga tgtcacaggt
      781 ggaccatttg atttgatgtc acgaagagta attgcttcaa gcaataaaac attagcagaa
      841 aggatagcca aagaaattca gataatacct cttcaaagag atgatgaaga ttaactccga
      901 cggcttcaca cattcaatca cagttgcgtc tctccaggtt ttctaatgca ctcatggtgt
//
LOCUS       FVBPNGF      1350 bp ds-DNA             BCT       30-MAY-1990
DEFINITION  F.meningosepticum endoglycosidase F (PNGase F) gene, complete cds.
ACCESSION   J05411
KEYWORDS    N-glycanase; endoglycosidase F;
            peptide-N-4-(N-acetyl-beta-D-glucosaminyl) asparagine amidase F.
SOURCE      F.meningosepticum (ATCC 33958) DNA, clone pGB29, subclone J.
  ORGANISM  Flavobacterium meningosepticum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1350)
  AUTHORS   Barsomian,G.D., Johnson,T.L., Borowski,M., Denman,J.,
            Ollington,J.F., Hirani,S., McNeilly,D.S. and Rasmussen,J.R.
  TITLE     Cloning and expression of peptide-N-4-(N-acetyl-beta-D-
            glucosaminyl) asparagine amidase F in Escherichia coli
  JOURNAL   J. Biol. Chem. 265, 6967-6972 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Barsomian, 29-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        139     1203     endoglycosidase F precursor (EC 3.5.1.52)
    matp        259     1200     endoglycosidase F
BASE COUNT      470 a    219 c    276 g    385 t
ORIGIN      
        1 ggatctgtac tgagtattgt aaaggtttta taatttgatt aaaaatgaat ctttgttaat
       61 aaatcagggt ttaatttgac ttttaaaacg atttcttaaa taaatacaat atattcgtat
      121 aacaaatcgc aataaattat gagaaaacta ctaattttta gtatatctgc ttacttgatg
      181 gcaggtatcg tttcgtgtaa aggtgtagac agtgcaacac ctgtaacaga agatcgctta
      241 gctctgaatg cggtaaatgc tccggcagat aataccgtaa atattaaaac attcgacaaa
      301 gtaaaaaatg cctttggtga cggattgtcc caaagtgcag aaggaacctt tacatttccg
      361 gccgatgtaa caactgtaaa aacgattaag atgttcatca aaaatgaatg tcctaataaa
      421 acctgtgatg aatgggatcg ttatgccaat gtttatgtaa aaaataaaac aacaggagaa
      481 tggtatgaaa taggacgctt tattactcca tattgggtgg gtacggaaaa attacctcgt
      541 ggactggaaa ttgatgttac cgatttcaaa tctttactgt ccggaaatac agaacttaaa
      601 atttatacgg agacttgctt ggccaaagga agagaataca gtgtagactt tgatattgta
      661 tatggtacac cggattataa atattcggca gtagtacctg taatccaata taacaaatca
      721 tccattgatg gtgttcctta tggtaaagca catacactgg gattaaaaaa gaatattcag
      781 ttaccaacaa acacggaaaa agcttatctt agaactacta tttccggatg gggacatgcc
      841 aagccatatg atgcgggaag caggggctgt gcagaatggt gcttcagaac acatactata
      901 gcaataaata atgcgaatac tttccaacac cagctgggtg ctttaggatg ttcagcaaac
      961 cctattaata atcagagtcc gggaaattgg gctcctgaca gagcagggtg gtgtccggga
     1021 atggcagtgc caacacgtat agatgtgttg aataactctt taacgggtag tacttttagt
     1081 tatgaatata agttccagag ttggacaaac aacggaacca atggagatgc tttttatgca
     1141 atttccagtt ttgtgattgc aaaaagtaat acacctatta gtgctccggt agttacaaac
     1201 taaaaaacag catatattca tttatagata aaaccgcctt ggcaaggaat caattacttt
     1261 tcgggtattt gcaaaaacac cccgcagatg aaaatctgcg gggtgtatga aaaaggtaag
     1321 aagttagaga tatctttcct cagccgaaat
//
LOCUS       POTPPIIIK     945 bp ds-DNA             PLN       30-MAY-1990
DEFINITION  Potato proteinase inhibitor IIK gene , 5' flank.
ACCESSION   M29965
KEYWORDS    proteinase inhibitor IIK.
SOURCE      Potato (strain Russet Burbank) DNA.
  ORGANISM  Solanum tuberosum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 945)
  AUTHORS   Palm,C.J., Costa,M.A., An,g. and Ryan,C.A.
  TITLE     Wound-inducible nuclear protein binds DNA fragments that regulate a
            proteinase inhibitor II gene from potato
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 603-607 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.An, 18-NOV-1989.
FEATURES       from  to/span     description
    mRNA        893  >   945     proteinase inhibitor IIK
BASE COUNT      374 a    129 c    138 g    304 t
ORIGIN      
        1 tcgacccaat tcaaagaact tgttcattaa ttattatttt tccagcaaaa tgaaaatttt
       61 tagttaggat gataaaattt atccaaattt ctaccaccac ccaaaactaa atcttccaat
      121 tcagaaaaaa tcattcatta caactcttgg atttgtttgt atgatgtgtt agaagcagaa
      181 atgggtttaa aaggtgaatt aaatggtatt tttaactaaa gaaatgaatg cgggatatat
      241 ttaagttttt tcagataatt catgaatatt tttatcctct ttccgtgtgc aataatagag
      301 ttccaactta attatcacgt ggacttataa gaaaccgatg ctggatgata attatttaaa
      361 aaaacaagca agtcggggtc agtacagctt gctgaatagg agaaattaaa gatagcaacc
      421 aggaaaagtt aagagctttt ggctcctccg tccaattata accgtccact atatatattt
      481 gagactcgta ttgagaaaca ataaataata agaatgatat tactatatta ccttttgaat
      541 atattaaatt taatgctttg aaaaatatat ttgataatga ataatatcta atagcagcaa
      601 gaataaaata gacacaaaca agtaaaatta ctcattgatc tttcaaatta gacaagtatt
      661 atcggacatc tacttttagt atagtaaaca agtaaagatc ggataaagag agtaataaag
      721 aagaagcaag cgtaagtacc ttgccaaaat aattaactaa caagcacatc tttttttttt
      781 tatcaaatat taataaaaat aatttatatt aatatgaaga aaaaaaaggt tttagtttgc
      841 tatctttttg atcactcgtt tgctataaat aggtggagga ggacagacac tcttcacccc
      901 aaaattaaaa gaaaaagagg cagtactaat taattatcca tcatg
//
LOCUS       CADPOLYP     1631 bp ss-RNA             VRL       30-MAY-1990
DEFINITION  Canine distemper virus polymerase-associated (P) protein and C
            protein mRNA, complete cds.
ACCESSION   M32418
KEYWORDS    C protein; P protein; polymerase-associated protein.
SOURCE      Canine distemper virus, cDNA to viral RNA, clone pCDV 30.
  ORGANISM  Canine distemper virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Paramyxoviridae; Morbillivirus.
REFERENCE   1  (bases 1 to 1631)
  AUTHORS   Barrett,T., Shrimpton,S.B. and Russell,S.E.H.
  TITLE     Nucleotide sequence of the entire protein coding region of Canine
            distemper virus polymerase-associated (P) protein mRNA
  JOURNAL   Virus Res. 3, 367-372 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         41     1564     polymerase-associated (P) protein
    pept         63      587     C protein
BASE COUNT      526 a    367 c    396 g    342 t
ORIGIN      
        1 tacccgatca tcattcaccg accacccgtt ctatccctaa atggcagagg aacaggccta
       61 ccatgtcagc aaagggctgg aatgcctcaa agccctcaga gagaatcctc ctgacattga
      121 ggagattcaa gaggtcagca gcctcagaga ccaaacctgc aacccaggcc aagagaatgg
      181 aaccacaggc atgcaggaag aggaggactc tcagaatctc gatgaatcac acgagccaac
      241 aaaaggatca aactatgtcg gccatgtacc tcaaaataat ccgggatgtg gagaacgcaa
      301 tactgcgctt gtggaggcgg agcggccccc tagagaggac atccaaccag gacctggaat
      361 acgatgtgat catgtttatg atcacagcgg tgaagaggtt aagggaatcg aagatgctga
      421 cagtctcgtg gtacctgcag gcactgtcgg taatcgagga ttcgagagag gagaaggaag
      481 ccttgatgat agcactgagg attctggcga agattattcc gaaggaaatg cttcatctaa
      541 ctggggatat tctttcggcc ttaaaccgga cagagcagct gatgtgagca tgctgatgga
      601 agaggaatta agtgctctac tcaggacaag cagaaatgta gggattcaga aaagggatgg
      661 gaagactctg cagttcccac ataatcccga agtaagacaa gggatccgga gtgtggatcc
      721 attaaaaagg ggcacagaag agaggtcagt ctcacatgga atggggatag ttgctggatc
      781 gacaagtggt gcaacccaat ctgcactcaa gtcaactggg ggatcatcag agccaagtgt
      841 ttctgcgggg aatgtccgcc aacctgcaat gaatgcaaag atgacccaga aatgcaaact
      901 cgagtctggc acgcaactcc ctcccaggac ctcaaatgag gctgagtctg acagtgagta
      961 cgatgatgag cttttctctg agatacaaga aattcgatct gccattacta aactaactga
     1021 agataatcaa gcaatactta ctaaactgga taccttatta ctgcttaaag gagagactga
     1081 ttcaattaag aaacaaatca gcaaacaaaa tattgctatt tccacgattg aggggcatct
     1141 atcaagcatt atgatagcta tacctggttt tggaaaggac acgggagatc ctacggcaaa
     1201 tgtcgacatt aacccagagc tccgccctat catagggaga gattcaggaa gaccactagc
     1261 agaagttctc aagcagcccg catcatcccg cggtaatcgg aaggacagtg gtattactct
     1321 gggctcaaaa ggtcaactat tgagagacct ccagctgaaa cccattgaca aagagtctag
     1381 ctcggcaatc ggatacaaac cgaaggatac cgcaccttcc aaagctgtac ttgcatcatt
     1441 gattagatca agcagagttg atcaaagtca caaacataac atgctggctc tgctcaaaaa
     1501 tatcaagggg gatgacaacc taaacgagtt ctaccaaatg gtcaaaagta ttactcatgc
     1561 ttaatctgta gcgttgacta atctactaac cggcgcaaaa ctgctttcac tatcgcttaa
     1621 aagcaattat a
//
LOCUS       ROHSEG4A     2359 bp ds-RNA             VRL       30-MAY-1990
DEFINITION  Human rotavirus segment 4 gene, complete cds.
ACCESSION   M32559
KEYWORDS    .
SOURCE      Human rotavirus (strain RV-5, serotype 2) RNA.
  ORGANISM  Human rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 2359)
  AUTHORS   Kantharidis,P., Dyall-Smith,M.L. and Holmes,I.H.
  TITLE     Marked sequence variation between segment 4 genes of human RV-5 and
            simian SA 11 rotaviruses
  JOURNAL   Arch. Virol. 93, 111-121 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         10     2337     segment 4 protein
BASE COUNT      871 a    343 c    413 g    732 t
ORIGIN      
        1 ggctataaaa tggcttcgct catttataga caacttctca ctaattcata ttcggtagac
       61 ttgcatgatg aaatagaaca gattggatcg gagaaaactc aaagtgtaac agtaaatcca
      121 ggtccatttg cacaaaccag atacgctcca gttaattggg gacatgggga gattaatgat
      181 tcaactacag tggaaccagt tttagatggt ccttatcaac ccactacatt caaaccaccc
      241 aatgattatt ggttccttat tagttcaaat acagatggag tagtctacga aagtacaaat
      301 aataatgatt tttggacagc agttatcgca gttgaaccac atgttagtca aacaaatagg
      361 caatatattt tatttggtga aaataaacag tttaacgtag aaaacaattc agataaatgg
      421 aaatttttcg aaatgtttaa aggtagtagt cagggtgatt tttctaatag acggactcta
      481 acctctaaca atagacttgt agggatgcta aaatatggtg gaagagtatg gacatttcat
      541 ggtgaaacac caagagctac tactgatagt tcaaatactg cggatttaaa taatatatca
      601 attatgattc attcagagtt ttatattatt ccaagatccc aagaatctaa atgtaacgag
      661 tatatcaaga atggtttacc accaattcag aatactagaa acgtagttcc attatcttta
      721 tcatccagat ctattcagta taggagagca caagttaatg aagatattac aatttcaaaa
      781 acttcattat ggaaggaaat gcaatataat agagatatta taataagatt taaatttggt
      841 aatagtgtta taaaactagg aggattggga tataaatggt ctgaaatatc atataaagca
      901 gcgaattatc agtatagtta ttcacgtgat ggtgaacaag ttactgcaca taccacttgt
      961 tcagtaaatg gagtaaataa ttttagctat aatggaggtt cactacctac tgatttcagt
     1021 atttcgagat atgaagttag taaagaaaat tcttatgtgt atatagacta ctgggatgat
     1081 tcaaaagcat ttagaaatat ggtgtatgtt agatcgctag cagcaaattt aaattcagtg
     1141 aaatgtacag gtgggagtta tgattttaga ttgcccgtag gtggatggcc tattatgaat
     1201 ggtggtgctg tatcattaca ttttgctgga gttacattat ctacacagtt cactgatttt
     1261 gtatcattaa attcactgcg atttagattc agtttaacag tagatgaacc atctttctca
     1321 ataatacgaa cacgtacaat aaacttgtac ggattaccag cagctaatcc gaacaatgga
     1381 aatgagtact atgaaatgtc aggaaggttt tcacttattt ctttagttct aaccaatgat
     1441 gattatcaaa ctccaattat gaattcagta acagtaaggc aggatttaga acgccagctt
     1501 aatgatttgc gagaagaatt taattcattg tcacaagaaa tagctatgtc acaattaatt
     1561 gatttagcat tactaccttt agatatgttc tctatgtttt cgggaataaa aagtacaatt
     1621 gatctaacca aatcaatggc aactagtgta atgaaaaaat ttagaaaatc gaaattagct
     1681 acatcaattt cagaaatgac taattcattg tcagatgcgg cttcatcagc atcaagaagt
     1741 gcttctatta gatcaaattt atctacgatt tcaaattgga ctaatacttc aaaaagtgtg
     1801 tcaaatgtaa ctgactcagt aaatgatgtt tcaacacaaa catctacaat tagtaagaaa
     1861 cttagattaa gagagatgat tactcaaact gaaggaatga gttttgatga tatttcagca
     1921 gctgtactga aaacgaaaat agatatgtcc acacaaattg gaaaaaacac tttacctgat
     1981 atagttactg aagcgtctga aaaatttatt ccaaaacgat catatcgagt attaaaagat
     2041 aatgaagtaa tggaaattaa cactgaagga aagttttttg catataaagt ggatacactc
     2101 aatgagatac catttgatat aaataaattc gccgaacttg tcacggattc tccagttata
     2161 tcagcgataa tagactttaa gacgttaaaa aatttaaacg acaattatgg aattactcgt
     2221 attgaagcgt ttaatttaat taaatcgaat ccaaatgtgc tacgtaattt cattaatcaa
     2281 aataatccaa ttataagaaa tagaattgag cagttaattc tacaatgtaa gttgtgagaa
     2341 tgtcatccag gatgtgacc
//
LOCUS       MUSCAIA      1224 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse carbonic anhydrase I (CAI) mRNA, complete cds.
ACCESSION   M32452
KEYWORDS    carbonic anhydrase; carbonic anhydrase I; zinc metalloenzyme.
SOURCE      Mouse anemic spleen, cDNA to mRNA, clone pMCAI.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1224)
  AUTHORS   Fraser,P.J. and Curtis,P.J.
  TITLE     Molecular evolution of the carbonic anhydrase genes: Calculation of
            divergence time for mouse carbonic anhydrase I and II
  JOURNAL   J. Mol. Evol. 23, 294-299 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         46      831     carbonic anhydrase I
BASE COUNT      358 a    299 c    234 g    333 t
ORIGIN      
        1 gccagcatct tctctgaagc tgaatcacaa cctaaataag agaaaatggc aagtgcagac
       61 tggggatatg gaagcgaaaa tggtcctgac caatggagca agctgtatcc cattgccaat
      121 ggtaacaacc agtctcctat tgatattaaa accagtgaag ccaatcatga ctcctctctg
      181 aaaccactca gcatctccta taatcctgca actgccaaag aaattgttaa cgtgggacat
      241 tctttccatg taatttttga tgacagtagc aaccaatctg ttctgaaagg tggccctctt
      301 gctgatagct atcggctcac tcagttccat tttcactggg gcaactcaaa cgaccatggc
      361 tctgagcaca ccgtggatgg aactagatat tctggagagc ttcacttagt tcactggaat
      421 tctgcaaagt actccagtgc ttctgaagcc atctccaagg ctgatggcct ggcaatcctt
      481 ggcgttttga tgaaggttgg tccagccaac ccaagcctgc agaaagtact tgatgctcta
      541 aactcagtta aaactaaggg aaaacgagcc ccattcacaa attttgaccc atccagtctg
      601 cttccttcat ctctggatta ctggacctac tttggctctc tgactcaccc tcctcttcat
      661 gaaagtgtga cctgggtgat ctgcaaggat agcatcagtc taagcccaga gcagctggcc
      721 cagctccgtg gtcttctgtc aagtgcagag ggagagtctg cagttccagt tctgagcaac
      781 caccgtccac cccaacccct gaagggcaga acagtcagag cctcattttg agtcccagca
      841 aggaatgagt cctcaattat gacctggccc cctctacagc agcatccaat aaagggatca
      901 cgattaagaa acagacttat gtcagtgcta gacagtatac ctgcaaattt aatccacaga
      961 actaaaatgc tttcatttta attcctgata cttaatgcaa atactctata agtttgccta
     1021 ctgtaatttt tatgcattcg tttttgctta aatacacctt tttgtggaag tcgttcataa
     1081 ttacaactaa gtgattcatt ctctttctgc tctcttcact ttaaataaac caataaataa
     1141 atcttttaaa aaaaaaaaaa aaaaaagctg agaaataagc gctgatagca gccttccagt
     1201 tctgttcaag tatatggctg cctg
//
LOCUS       RATGLOBBET    287 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Rat hemoglobin beta-chain mRNA, 3' end.
ACCESSION   M32509
KEYWORDS    hemoglobin; hemoglobin beta-chain.
SOURCE      Rat reticulocyte, cDNA to mRNA, clone pBRrg X.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 287)
  AUTHORS   Crkvenjakov,R., Bucan,M., Konstantinovic,M., Fogel,M., Savic,A. and
            Glisin,V.
  TITLE     Characterization of two rat globin cDNA clones
  JOURNAL   Hemoglobin 8, 597-611 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      158     hemoglobin beta-chain (AA at 3)
BASE COUNT       67 a     72 c     68 g     80 t
ORIGIN      
        1 agctgcatgt ggatcctgag aacttcaggc tcctgggcaa catgattgtg attatgatgg
       61 gccaccacct gggcaaggaa ttcaccccga gtgcacaggc tgccttccag aaggtggtgg
      121 ctggagtggc cactgccttg gctcacaagt accactaaac ctcttttcct gctctggtct
      181 ttgtgcaatg gtcaattgtt ccctagagag catctgtcag ttgttgtcaa aatgacatcg
      241 acctttgaaa atctgtccta ctaataaaag gcatttactt tcactgc
//
LOCUS       RATHEMALPH    194 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Rat hemoglobin alpha chain mRNA, partial cds.
ACCESSION   M32510
KEYWORDS    hemoglobin; hemoglobin alpha-chain.
SOURCE      Rat reticulocyte, cDNA to mRNA, clone pBRrg 5.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 194)
  AUTHORS   Crkvenjakov,R., Bucan,M., Konstantinovic,M., Fogel,M., Savic,A. and
            Glisin,V.
  TITLE     Characterization of two rat globin cDNA clones
  JOURNAL   Hemoglobin 8, 597-611 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   194     hemoglobin alpha-chain (AA at 3)
BASE COUNT       35 a     73 c     39 g     47 t
ORIGIN      
        1 tcgacctgcc tggtgccctg tccactctga gcgacctgca tgcccacaaa cttgatcctg
       61 tcaacttcaa gttcctgagc cactgcctgc tggtgacctt ggcttgccac caccctggag
      121 atttcacacc cgccatgcac gcctctctgg acaaattcct tgcctccgtg agcaccgtgc
      181 ttacctccaa gtac
//
LOCUS       HUMIGKVAE    1241 bp ds-DNA             PRI       30-MAY-1990
DEFINITION  Human germline kappa-chain gene (A2) V-region, exons 1 and 2.
ACCESSION   M31952
KEYWORDS    germline; immunoglobulin light chain; kappa-immunoglobulin;
            variable region; variable region subgroup VK-II.
SOURCE      Human DNA, isolated from immune serum of Hib-PS vaccine recipients.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1241)
  AUTHORS   Scott,M.G., Crimmins,D.L., McCourt,D.W., Zocher,I., Thiebe,R.,
            Zachau,H.G. and Nahm,M.H.
  TITLE     Clonal characterizaton of the human IgG antibody repertoire to
            Haemophilus influenzae type B polysaccharide: III. A single
            V-kappa-II gene and one of several JK genes are joined by an
            invariant arginine to form the most common L chain V region
  JOURNAL   J. Immunol. 143, 4110-4116 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        325      373     Ig kappa-chain V-region, exon 1
                742  /  1052     Ig kappa-chain V-region, exon 2
    IVS         374      741     IgK intron A
    iDNA       1053  >  1241     intervening DNA (5' end put.)
BASE COUNT      318 a    306 c    283 g    334 t
ORIGIN      
        1 ccgacaagaa tttggaagcc ctgacatcct ataaaacgtt acttgcccaa gattgaaact
       61 ttcaaattca ggtccttctt cctcctctat gatgaattag gttttattag tttcctccaa
      121 gggacacttt atatcacatt gctcacagag aagacatatc tacccccttc accccccacc
      181 caatggctct ttccacacca ctgcacccac caggtgattt gcatattgtc ccctagggag
      241 gaccttccct tgtgaggcct gagataaaag ctcagctcta accttgcctt gactgatcag
      301 gacttctcag ttcatcttct caccatgagg ctccctgctc agctcctggg gctgctaatg
      361 ctctggatac ctggtaagga tggaaggaga tgagggagga ggagggggtg ggaagctgag
      421 ctctggcggc cccactgatt cccgtgttta ttctaaccat gtgttaaagg aatatggcct
      481 atgctccagg gagaggaatt catattttgc cctgatgatg atttgaaaac tcctaaaagc
      541 agtgctctga ataatatctt gagaaatgaa agaactcttg tgcctattta ataaagggtt
      601 catttaaaga gtttgttttt atgatatgaa tacaaatttg taaaaataaa agattagcca
      661 taaatcaata ccataaggca aatctcaaaa gttgttcatt atgctttcac ataaccttgc
      721 acttctctct cataatttca ggatccagtg cagatattgt gatgacccag actccactct
      781 ctctgtccgt cacccctgga cagccggcct ccatctcctg caagtctagt cagagcctcc
      841 tgcatagtga tggaaagacc tatttgtatt ggtacctgca gaagccaggc cagcctccac
      901 agctcctgat ctatgaagtt tccaaccggt tctctggagt gccagatagg ttcagtggca
      961 gcgggtcagg gacagatttc acactgaaaa tcagccgggt ggaggctgag gatgttgggg
     1021 tttattactg catgcaaagt atacagcttc ctcccacagt ggtacagacc aatacagaaa
     1081 cctccctgct ggggtgtccc agctgctcac ttgcactgct tgtctgggga gtagctcagc
     1141 agggactctg agtctgcaga agaggaggct gttggagaac tcagtggcag ggcttgctgt
     1201 tgaggactct ggcccatgag gtctcagcag cacctcagtc c
//
LOCUS       RATS14        350 bp ds-DNA             ROD       30-MAY-1990
DEFINITION  Rat S14 protein gene, 5' flank.
ACCESSION   M33553
KEYWORDS    S14 protein.
SOURCE      Rat (strain Sprague-Dawley) male liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 350)
  AUTHORS   Jump,D.B., Bell,A. and Santiago,V.
  TITLE     Thyroid hormone and dietary carbohydrate interact to regulate rat
            liver S14 gene transcription and chromatin structure
  JOURNAL   J. Biol. Chem. 265, 3474-3478 (1990)
  STANDARD  simple staff_review
BASE COUNT       71 a     77 c     96 g    106 t
ORIGIN      
        1 ctagacacta agtgacatat gttcatatag atagtgtttt taaactggaa agactgtccc
       61 taccttggtt tgtagccacc tgggctgagg gccagctgag gttacttggg gcctggcagc
      121 ctgggtacca cttgctcatc ttacccttgg atcacattca catgatgttc tggccaggcc
      181 cttgacccca gttcctctgg ggccaattca atttgtccag cctagggcat tctgggtgag
      241 gttattccaa aagatgtgtc agccaaccct ccgtttctcc acagggttgt agtcatggaa
      301 agactagaga aagtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt
//
LOCUS       MUSPTKA       211 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD15.
ACCESSION   M33421
KEYWORDS    protein tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 211)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   211     protein tyrosine kinase (AA at 3)
BASE COUNT       47 a     55 c     64 g     45 t
ORIGIN      
        1 ggatccacag ggacctggct gctcggaact gcctggtgac agagaagaat gtcctgaaga
       61 tcagcgactt tgggatgtcc cgcgaagaag ctgatgggat ctatgccgcc tgcagcggcc
      121 tcagacaagt ccctgttaag tggactgccc ctgaggccct taactacgga cgctactcct
      181 cagagagtga tgtgtggagc tttggaattc c
//
LOCUS       MUSPTKB       211 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD16.
ACCESSION   M33422
KEYWORDS    protein tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD16.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 211)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   211     protein tyrosine kinase (AA at 3)
BASE COUNT       50 a     50 c     59 g     52 t
ORIGIN      
        1 ggatccacag agaccttgct gctaggaact gcatggatgc cgaagatttc acagtaaaaa
       61 ttggagattt cggtatgaca cgagacatct acgagacgga ctactaccgg aaaggcggga
      121 aggggttgct gcctgtgcgc tggatgtctc tcgagtccct caaggatggt gtcttcacta
      181 ctcattctga cgtctggtcc ttcggaattc c
//
LOCUS       MUSPTKC       214 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD17.
ACCESSION   M33423
KEYWORDS    protein tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD17.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 214)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   214     protein tyrosine kinase (AA at 3)
BASE COUNT       69 a     42 c     58 g     45 t
ORIGIN      
        1 ggatccacag ggacctggca acaaggaaca tattggtgga aaatgagaac agggttaaaa
       61 taggagactt cggattaacc aaagtcttgc cgcaggacaa agaatactac aaagtaaagg
      121 agccagggga aagaccgata ttctggtacg cacctgaatc cttgacggag agcaagtttt
      181 ctgtggcctc agatgtctgg tcctttggaa ttcc
//
LOCUS       MUSPTKD       217 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD19.
ACCESSION   M33424
KEYWORDS    protein tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD19.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 217)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   217     protein tyrosine kinase (AA at 3)
BASE COUNT       64 a     44 c     57 g     52 t
ORIGIN      
        1 ggatccacag agacttagct gcaagaaact gcatgttgga tgaaaaattc actgtcaagg
       61 ttgctgattt cggtcttgcc agagacatgt acgataaaga gtactatagt gtccacaaca
      121 agacgggtgc caagctacca gtgaagtgga tggctttaga gagtctgcaa aggcagaagt
      181 tcaccaccac gtcagatgtg tggtcctttg gaattcc
//
LOCUS       MUSPTKE       214 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD22.
ACCESSION   M33425
KEYWORDS    protein tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD22.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 214)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   214     protein tyrosine kinase (AA at 3)
BASE COUNT       59 a     46 c     55 g     54 t
ORIGIN      
        1 ggatccaccg ggacttagca gcaagaaatg tccttgttga gagtgagcat caagtgaaga
       61 tcggagactt tggtttaacc aaagcaattg aaaccgataa ggagtactac acagtcaagg
      121 acgaccggga cagcccagtg ttctggtacg ctccggagtg tttaatccag tgtaaatttt
      181 atatcgcctc tgacgtctgg tcctttggaa ttcc
//
LOCUS       MUSPTKF       208 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD175.
ACCESSION   M33426
KEYWORDS    protein tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD175.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 208)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   208     protein tyrosine kinase (AA at 3)
BASE COUNT       49 a     53 c     55 g     51 t
ORIGIN      
        1 ggatccaccg tgatctgcga gctgctaacg tcctggtctc tgagtcactc atgtgcaaga
       61 ttgcagactt tggcctcgcg agagtcatcg aagataacga gtacacagca agggaaggtg
      121 cgaagttccc tatcaagtgg acagctccag aggcgttcaa cttcggctgc ttcactatca
      181 aatctgacgt gtggtccttt ggaattcc
//
LOCUS       MUSPTKG       208 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse protein tyrosine kinase (PTK) mRNA, partial cds.
ACCESSION   M33427
KEYWORDS    protein tyrosine kinase.
SOURCE      Mouse haemopoietic cell line WEH1-3B D+, cDNA to mRNA, clone W3.13.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 208)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   208     protein tyrosine kinase (AA at 3)
BASE COUNT       53 a     44 c     56 g     55 t
ORIGIN      
        1 ggatccacag agacctggct gccagaaatt gtctagtgaa tgaagcagga gttgtcaaag
       61 tatctgattt tggaatggcc aggtacgttc tggatgatca gtacacaagt tcttctggcg
      121 ccaagttccc tgtgaagtgg tgtcccccag aagagtttaa ttacagccgc tttagcagca
      181 agtcagacgt gtggtcctat ggaattcc
//
LOCUS       DIRPARAM     2545 bp ss-mRNA            INV       30-MAY-1990
DEFINITION  D.immitis paramyosin mRNA, partial cds.
ACCESSION   M29733
KEYWORDS    paramyosin.
SOURCE      D.immitis, cDNA to mRNA.
  ORGANISM  Dirofilaria immitis
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 2545)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  2545     paramyosin (AA at 1)
BASE COUNT      949 a    420 c    544 g    632 t
ORIGIN      
        1 gcattcggta gtatgtcggt tgccgatttg ggaagtttaa ctcgcttgga ggacaagatt
       61 cgtctactgc aagaagatct cgaatccgaa cgtgaacttc gaaatagaat cgaacgagaa
      121 agagctgatc ttagtgtaca actgattgca ttaactgata gacttgaaga tgctgagggt
      181 actactgata gtcagattga atcaaatcgt aaacgtgaag cagaattgca aaaattacgt
      241 aaattattgg aagaatcaca attagaaaat gaagatgcaa tgaatgtttt acgtaaaaag
      301 catcaagatg catgtctcga ttacgctgaa caaattgaac aattacaaaa gaaaaattca
      361 aagattgatc gtgaacgtca acgtctgcaa catgaagtaa ttgagcttac tgcgacaatt
      421 gatcaacttc aaaaggataa gcatttggcg gaaaaagcag cggaacgttt tgaagcgcaa
      481 actatcgaat tgagtaataa agttgaagat ttaaatcgac atgttaatga tttagctcaa
      541 caacgtcaac gtttacaagc tgaaaataac gatcttctca aagagattca tgatcaaaaa
      601 gtacaattgg ataatttgca acacgtgaaa tatcaacttg cgcaacaact tgaagaagca
      661 cgtcgacgtt tagaagatgc tgaacgcgaa cgttctcaac ttcaagcaca acttcatcaa
      721 gtacaattag aattggattc tgtacgtacc gcattagatg aagaatcagc agctcgagca
      781 gaagctgaac ataaacttgc tttagcaaat accgagatta cccaatggaa gagtaaattt
      841 gatgctgaag ttgctcttca tcatgaagag gttgaagatt tacgaaagaa aatgctccaa
      901 aaacaggctg aatatgagga acaaattgaa ataatgttgc aaaaaatttc acaacttgaa
      961 aaagctaaat cacgcttaca aagtgaagtt gaagtattaa ttgttgattt ggaaaaagca
     1021 caaaatacaa tagcaattct tgaacgagca aaagaacaac tcgaaaaaac tgttaatgaa
     1081 ttaaaggtac gtattgatga acttactgtt gaacttgaag cagcacaacg tgaagctcgt
     1141 gctgcattag cagaattaca aaagatgaaa aatttatacg aaaaagcaat tgagcaaaaa
     1201 gaagccttag ctcgagaaaa taaaaaatta caagatgatt tgcatgaagc aaaagaagcg
     1261 cttgctgatg caaatcgtaa attgcatgag cttgatcttg aaaatgcacg attagctggt
     1321 gaaatacgag aattgcaaac agctcttaag gaatctgaag ctgcacgacg tgatgctgag
     1381 aatcgtgcac aacgtgcact tgctgaatta caacaacttc gtattgaaat ggaacgtcgt
     1441 ttgcaagaga aagaggaaga aatggaagca cttcgtaaga atatgcaatt tgaaattgat
     1501 cgacttacag ctgcattagc cgatgctgaa gcacgtatga aggcggaaat ttcacgtcta
     1561 aagaagaagt atcaagcgga aattgccgaa cttgaaatga ccgttgataa tttgaatcgg
     1621 gcaaatattg aagcgcaaaa gacgattaaa aaacaaagtg aacaacttaa aatattgcaa
     1681 gcatcacttg aggatactca acgtcaacta cagcaaactc ttgatcagta tgcattggcg
     1741 caacgtaaag tttccgcact tagcgctgaa ttagaagaat gcaaagttgc acttgataat
     1801 gcaattcgtg ctcgtaaaca agctgaaatt gatcttgaag aagcgaatgc tcgtattact
     1861 gatttggtat caattaacaa taatttaaca gcaattaaaa ataaattgga aactgaatta
     1921 tcgaccgcac aagctgattt ggatgaagca acgaaagaat tgcatgccgc ggatgagcgg
     1981 gctaatcgtg ctttagcaga tgcagctcga gctgttgaac aacttcatga ggagcaggaa
     2041 cattccatga aaattgatgc tctcaggaaa tcattagaag agcaggtgaa gcaattacaa
     2101 gtacaaattc aagaagctga agctgcagca ttattgggtg gtaaacgtgt aattgctaaa
     2161 cttgaaacac gtatacgtga tttggaaact gctttggatg aagaaacacg tcgacataag
     2221 gaaacgcaag gtgcgcttag aaagaaagat cgacgcatca aagaagttca aatgcaagtt
     2281 gatgaagagc ataagatgtt tgtgatggcg caagatactg ctgataggct tcttgaaaaa
     2341 cttaacattc aaaagtggca acttggagaa gctgaatctt taacgatggc taatcttcaa
     2401 cgagtacgtc gatatcaacg tgaattggaa gatgctgaag gtcgtgctga tcaagctgaa
     2461 agctcattac atcttattcg tgctaaacat cgttcgtcag ttgttaccgg caaaaatgct
     2521 tcagcatcta aaatttacgt tctcg
//
LOCUS       ONGPARAM1     134 bp ss-mRNA            INV       30-MAY-1990
DEFINITION  O.volvulus paramyosin mRNA partial cds, clone OV-1.
ACCESSION   M29734
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 134)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   134     paramyosin (AA at 1)
BASE COUNT       53 a     23 c     28 g     30 t
ORIGIN      
        1 ttacaaaaga aaaattcaaa aattgaccgt gaacgtcaac gtctgcaaca tgaagtaatt
       61 gagcttacag cggcaattga tcaacttcaa aaggataagc atttggctga gaaagcagca
      121 gaacgttttg aagc
//
LOCUS       ONGPARAM2     135 bp ss-mRNA            INV       30-MAY-1990
DEFINITION  O.volvulus paramyosin mRNA partial cds, clone OV-2.
ACCESSION   M29735
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   135     paramyosin (AA at 2)
BASE COUNT       43 a     31 c     26 g     35 t
ORIGIN      
        1 tttgcaacat gtcaaatatc aacttgcgca acaacttgaa gaagcacgtc gtcgtttgga
       61 agatgctgaa cgtgaacgtt cacaacttca agcacaactt catcaagttc agctagaatt
      121 ggattccgta cgtac
//
LOCUS       ONGPARAM3     176 bp ss-mRNA            INV       30-MAY-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-3.
ACCESSION   M29736
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 176)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   176     paramyosin (AA at 1)
BASE COUNT       74 a     20 c     36 g     46 t
ORIGIN      
        1 aaagttgctc ttcatcatga agaggttgaa gatttacgaa agaaaatgtt gcaaaagcaa
       61 gctgaatatg aagagcaaat tgaaataatg ctgcaaaaaa tttcacaact tgaaaaggct
      121 aaatcacgtt tgcaaagtga agttgaagta ttgattgttg atttagaaaa agcaca
//
LOCUS       ONGPARAM4     414 bp ss-mRNA            INV       30-MAY-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-4.
ACCESSION   M29737
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 414)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   414     paramyosin (AA at 3)
BASE COUNT      143 a     70 c    101 g    100 t
ORIGIN      
        1 ccttagctcg agaaaataaa aagttgcaag atgatttgca tgaagctaaa gaagcgcttg
       61 ctgatgcaaa tcgtaaattg catgaacttg atcttgagaa tgcacgatta gctggtgaaa
      121 ttcgagaatt gcaaacagca ctcaaagaat ctgaagcagc tcgacgtgat gctgagaatc
      181 gtgcacaacg tgcacttgct gaattgcaac aacttcgcat cgaaatggaa cgtcgtttgc
      241 aagagaagga agaagaaatg gaagcacttc gtaagaatat gcaatttgaa attgatcgtc
      301 ttacagctgc attagctgat gctgaagcac gtatgaaggc ggagattgca cgtttgaaga
      361 aaaagtatca ggcggaaatt gccgaacttg aaatgaccgt tgataatttg aatc
//
LOCUS       ONGPARAM5     145 bp ss-mRNA            INV       30-MAY-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-5.
ACCESSION   M29738
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 145)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   145     paramyosin (AA at 3)
BASE COUNT       48 a     28 c     28 g     41 t
ORIGIN      
        1 tattgcaagc atcacttgaa gatactcaac gtcaattaca acaaactcta gatcagtatg
       61 ctttggcaca acgtaaagta tctgcgctta gtgctgaatt ggaagaatgc aaagttgcac
      121 ttgataatgc aattcgtgct cgtaa
//
LOCUS       ONGPARAM6     129 bp ss-mRNA            INV       30-MAY-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-6.
ACCESSION   M29739
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 129)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   129     paramyosin (AA at 3)
BASE COUNT       48 a     20 c     31 g     30 t
ORIGIN      
        1 aagtacaaat tcaagaagct gaagcagcag cattattggg tggtaaacgt gtaattgcta
       61 aacttgaaac acgtatacgt gatttggaaa ctgctttgga tgaagaaaca cgtcgacata
      121 aggaaacgc
//
LOCUS       MUSA1AT      1350 bp ss-mRNA            ROD       30-MAY-1990
DEFINITION  Mouse alpha-1-antitrypsin (AT) mRNA, complete cds.
ACCESSION   M33567
KEYWORDS    alpha-1-antitrypsin.
SOURCE      Mouse liver, cDNA to mRNA.
  ORGANISM  Mus caroli
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1350)
  AUTHORS   Latimer,J.J., Berger,F.G. and Baumann,H.
  TITLE     Highly conserved upstream regions of the alpha-1-antitrypsin gene
            in two mouse species govern liver-specific expression by different
            mechanisms
  JOURNAL   Mol. Cell. Biol. 10, 760-769 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         49     1287     alpha-1-antitrypsin (AT)
    mRNA          1  >  1350     alpha-1-antitrypsin mRNA
BASE COUNT      355 a    391 c    306 g    298 t
ORIGIN      
        1 gtctcctcag cctccgtcac cacccagctc tgggacagca agctgaaaat gactccctcc
       61 atttcatggg gtctactgct tctggcaggc ctgttttgcc tggtccccag ctttctagct
      121 gaggatgttc aggagacaga cacctccaga agggattcag tcccagcctc ccatgatact
      181 ccctacaacc tcgagctttc catcagccta taccgggagc ttggacacaa gtccaccact
      241 tccaacatct tcttctccca ggtgagcatt gccacagcct ttgctatgct ctccctaggg
      301 gagaagggtg acactcacac gcagatccta gagggcctgc agttcaacct cacacaaaca
      361 tcggaggctg acatccacaa ggccttccaa cacctcctcc aaaccctcaa caggccagac
      421 agtgagctgc agctgagcac aggcaatggc tctttgctca acaatgacct gaagctggtg
      481 gagaagtttc tggaagaggc caagaacaat tatcactcag aagtcttctc cgtcaacttt
      541 gcagagtcag aggaggccaa gaaagtgatc aatgattttg tggagaaggg aacccaagga
      601 aagatagctg aggcggtgaa agaccctgac gaagacacag ttttcgccct ggcgaattac
      661 attctcttta aaggcaaatg gaagaagcca ttcgatccta agcacactga ggaagctgaa
      721 ttccacgtgg acacagtcac cacggtgaag gtgcccatga tgaccctcac gggcatgctt
      781 gacgtgcacc attgcagcac gctgtccagc tgggtgctgc tgatggatta cctgggcaat
      841 cgcactgctg tcttcctcct gcccgatgat ggcaagatgc agcatctgga gcaaactctc
      901 aacaaggagc tcatctctaa gttcctgcta aacaggcaca gaaggttagc ccaggtccac
      961 ttacccagat tgtccctctc tggaaactat actttgaaca cactcatgag tcacctgggc
     1021 atcacccgga tcttcaacaa tggggctgac ctctccggaa tcacagagga gaatgctccc
     1081 ctgaagctca gcaaggctgc agataaggcc gtgctgacca tggatgagac aggaacagaa
     1141 gctgcagcag ctacagtctt acaagccgtt cctatgtcta tgccccctat cctgaacttc
     1201 aacaaacctt tcattttcat aatagttgaa gaacacactc agagccccct ctttgtggga
     1261 aaagtggtag atcccacacg taaatgacca ccctaagaaa tcatccttcc tttctgaatt
     1321 gggtcccttc cttaaactct gacggctgct
//
LOCUS       XEBALPA       586 bp ss-mRNA            VRT       30-MAY-1990
DEFINITION  X.borealis alpha-I globin mRNA, complete cds.
ACCESSION   M32453
KEYWORDS    alpha globin.
SOURCE      X.borealis adult immature red blood cell, cDNA to mRNA.
  ORGANISM  Xenopus borealis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 586)
  AUTHORS   Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
  TITLE     Globin evolution in the genus Xenopus: Comparative analysis of
            cDNAs coding for adult globin polypeptides of Xenopus borealis and
            Xenopus tropicalis
  JOURNAL   J. Mol. Evol. 23, 211-223 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         42      470     alpha-I globin protein
BASE COUNT      175 a    140 c    111 g    160 t
ORIGIN      
        1 atctggtgta gactgcttat tgcacaacac aaacaacaac catgcttctt tcagccgatg
       61 acaagaaaca catcaaggca attatgcctt ctattgctgc tcatggcgac aaatttggtg
      121 gagaagcttt gtacaggatg ttcttggtca accccaagac caaaacctac ttccctactt
      181 ttgacttcca ccacaattca aaacagatca gtgctcatgg caagaaagtt gtggatgctc
      241 tgaatgaagc ctccaaccat ttggataaca ttgctggaag cctgagcaag ctgagtgacc
      301 tccatgccta tgacctgaga gtggatccgg gcaacttccc attgctggcc cataatattt
      361 tggtggttgt tgctatgaac ttccctaagc agtttgatcc tgcaacccat aaggccctgg
      421 ataagttcct ggctaccgtt tcttctgttc tgacatccaa atatcgttaa ggctcaacag
      481 caacagtagc agaagtctca acatcagaca tcagttaatt atatgcaatt aaactgacaa
      541 agcatgttaa agaatgttct gaaataaaca tttgaaccat tatcct
//
LOCUS       XEBALPB       525 bp ss-mRNA            VRT       30-MAY-1990
DEFINITION  X.borealis alpha-II globin mRNA, complete cds.
ACCESSION   M32455
KEYWORDS    alpha globin.
SOURCE      X.borealis adult immature red blood cell, cDNA to mRNA.
  ORGANISM  Xenopus borealis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 525)
  AUTHORS   Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
  TITLE     Globin evolution in the genus Xenopus: Comparative analysis of
            cDNAs coding for adult globin polypeptides of Xenopus borealis and
            Xenopus tropicalis
  JOURNAL   J. Mol. Evol. 23, 211-223 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         43      471     alpha-II globin protein
BASE COUNT      159 a    123 c     98 g    145 t
ORIGIN      
        1 atctggtgtg tactgctaat tgcaacaaaa caaacagcaa caatgcttct tacagctgat
       61 gacaagaaac acatcaaggc aattttgcct tccatagctg ctcatggcga caaatttggt
      121 ggagaagctt tgtacagaat gttcttgatt aatcctaaaa ccaaaaccta ctttcctaat
      181 tttgacttcc accacaattc aaaacaaatc agtgctcatg gcaagaaagt tgtggatgct
      241 ctgaatgaag ctgccaacca cttggataac attgctggaa gcatgagcaa gctgagtgac
      301 ctccatgcct atgacttgag agtggatccg ggcaacttcc cattgctggc ccataatatt
      361 ctggtgactg ttgctatgta cttccctcag cagtttgatc ctcataccca taaggctctg
      421 gataagttcc tggcttccgt atcttctgtt ctgacatcca aatatcgtta aggctcagca
      481 gcaaaagcaa cagcacaagt ttcaatatta atcatcagta aatca
//
LOCUS       XEBBETA       605 bp ss-mRNA            VRT       30-MAY-1990
DEFINITION  X.borealis beta-I globin mRNA, complete cds.
ACCESSION   M32456
KEYWORDS    beta globin.
SOURCE      X.borealis adult immature red blood cell, cDNA to mRNA.
  ORGANISM  Xenopus borealis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 605)
  AUTHORS   Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
  TITLE     Globin evolution in the genus Xenopus: Comparative analysis of
            cDNAs coding for adult globin polypeptides of Xenopus borealis and
            Xenopus tropicalis
  JOURNAL   J. Mol. Evol. 23, 211-223 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         47      487     beta-I globin protein
BASE COUNT      177 a    146 c    126 g    156 t
ORIGIN      
        1 acgggttatt tttgcagaag ctcagaataa actctaaact ttgaccatgg gtttgacagc
       61 acatgatcgt cagctgatca acagcacctg gggcaaagta tgtgccaaga ctataggaaa
      121 agaggccctt ggacgtctgc tgtggactta tccctggacc caaagatact ttagttcttt
      181 tgggaacctc aacagtgctg acgccgtctt ccacaatgaa gcagtggctg ctcatggtga
      241 aaaggtggtg acatctattg gagaggccat caagcacatg gatgacatta agggatatta
      301 tgctcagctg agcaaatacc actcagaaac cctacatgtg gatccatgca acttcaagcg
      361 cttcggtggc tgcttgtcca tttccctggc tcgccaattc catgaagaat atacacctga
      421 gctacatgct gcctatgaac atctgtttga tgcaattgct gatgcccttg ggaagggtta
      481 ccactaaacc atcctcaaga acacccaaat ggagtctcta agctacataa taccaaccta
      541 caaaatgttt tcctctaaaa cgtagccatt catatctgct cctaataaaa agaaagtttc
      601 ttcac
//
LOCUS       XEBBETB       619 bp ss-mRNA            VRT       30-MAY-1990
DEFINITION  X.borealis beta-II globin mRNA, complete cds.
ACCESSION   M32458
KEYWORDS    beta globin.
SOURCE      X.borealis adult immature red blood cell, cDNA to mRNA.
  ORGANISM  Xenopus borealis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 619)
  AUTHORS   Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
  TITLE     Globin evolution in the genus Xenopus: Comparative analysis of
            cDNAs coding for adult globin polypeptides of Xenopus borealis and
            Xenopus tropicalis
  JOURNAL   J. Mol. Evol. 23, 211-223 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         48      488     beta-II globin protein
BASE COUNT      185 a    152 c    130 g    152 t
ORIGIN      
        1 cacgagttct ttttgcagaa actcagaata ctctctcaat tttaaccatg ggtttgacag
       61 cacatgaaaa acagctcatc accggcagct ggggcaaaat aaatgccaaa gccataggaa
      121 aagaggccct cggcaggctg ctgaatactt ttccttggac ccaaagatac tttagttctt
      181 ttgggaacct cggcagtgct gaagccatct tccacaatga agctgtagct gctcatgggg
      241 aaaaggtagt gacatctgtt ggagaggcca tcaaacacat ggatgacata aagggctatt
      301 atgctgagct gagcaagtac cactctgaaa ccctgcatgt ggatccaaac aacttcaagc
      361 gcttcggtgg ctgcttgtcc attaccctgg gtcaccactt tggtgaagaa tatacacccg
      421 agttacatgc tgcctatgaa catctgtttg atgcaattgc cgacgccctt ggcaagggtt
      481 accactgaac cagcctcaag aacacctgaa cggagtcgca aagctacata gaaacaacct
      541 acagtacaca ttgctaaaat gttttttccc cccaaaatgt agccattcat atgagctcct
      601 aataaaataa agtttcttc
//
LOCUS       XETALPA       581 bp ss-mRNA            VRT       30-MAY-1990
DEFINITION  X.tropicalis alpha globin mRNA, complete cds.
ACCESSION   M32454
KEYWORDS    alpha globin.
SOURCE      X.tropicalis adult immature red blood cell, cDNA to mRNA.
  ORGANISM  Xenopus tropicalis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 581)
  AUTHORS   Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
  TITLE     Globin evolution in the genus Xenopus: Comparative analysis of
            cDNAs coding for adult globin polypeptides of Xenopus borealis and
            Xenopus tropicalis
  JOURNAL   J. Mol. Evol. 23, 211-223 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         42      470     alpha globin protein
BASE COUNT      170 a    138 c    119 g    154 t
ORIGIN      
        1 agcatctggt gtggactgct tattgcacaa cacaaacaac aatgcatctt acagctgatg
       61 acaagaaaca catcaaggcc atttggcctt ctgtagctgc tcatggtgac aaatatggcg
      121 gagaagcttt gcacaggatg ttcatgtgtg ctcccaagac caaaacctac tttcctgatt
      181 ttgacttcag cgaacattca aaacacatct tggctcatgg caagaaagtt tcggatgctc
      241 tgaatgaggc ttgcaaccat ctggacaaca ttgccggatg cctgtccaag ctgagtgacc
      301 tccatgccta tgacctgaga gtggatccag gcaacttccc attgctggcc catcaaattc
      361 tggtggttgt tgctatccat ttccctaagc agtttgaccc tgcaacccat aaggccctgg
      421 acaagttcct ggtttccgta tctaatgttc tgacatccaa atatcgttaa ggctcagcag
      481 taacagtagc agaagtttgg acatcagaca tcagttaatg acaaacaatc aaactgacac
      541 agcttgtgaa agaatgttct gaaataaaca tttttaaaat t
//
LOCUS       XETBETA       563 bp ss-mRNA            VRT       30-MAY-1990
DEFINITION  X.tropicalis beta globin mRNA, complete cds.
ACCESSION   M32457
KEYWORDS    beta globin.
SOURCE      X.tropicalis adult immature red blood cell, cDNA to mRNA.
  ORGANISM  Xenopus tropicalis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 563)
  AUTHORS   Knoechel,W., Korge,E., Basner,A. and Meyerhof,W.
  TITLE     Globin evolution in the genus Xenopus: Comparative analysis of
            cDNAs coding for adult globin polypeptides of Xenopus borealis and
            Xenopus tropicalis
  JOURNAL   J. Mol. Evol. 23, 211-223 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         52      495     beta globin protein
BASE COUNT      170 a    133 c    115 g    145 t
ORIGIN      
        1 aacacaggct ctttttttgc agaagcttaa aataaacgct cagctttgac catggttaat
       61 ctgacagcaa aggaacgcca gctcatcact ggcacctgga gcaaaatatg tgcaaagacc
      121 ttgggaaaac aggcccttgg cagtatgctg tatacttatc cttggaccca aagatacttt
      181 agtagttttg ggaacctcag cagtatcgaa gccatcttcc acaatgcagc tgtggctact
      241 catggtgaaa aggtgttgac atctattgga gaggccatca aacacatgga tgacataaag
      301 ggatattatg ctcagctgag caagtaccac tcagaaaccc tacatgtgga tccatacaat
      361 ttcaagcgct tctgtagctg cacgattatt tccatggctc aaaccctcca agaagacttt
      421 acacctgagc tacaggctgc tttcgaaaag ctgtttgctg caattgctga tgcccttggc
      481 aagggttacc actaaaccaa cctcaagaac atgtgactgg agtctcttag ctacacagaa
      541 acaaaatctc gttttttttc aaa
//
LOCUS       ROB11AA       667 bp ds-RNA             VRL       30-MAY-1990
DEFINITION  Bovine rotavirus gene 11, complete cds.
ACCESSION   M33606
KEYWORDS    .
SOURCE      Bovine rotavirus (strain VMRI) viral RNA.
  ORGANISM  Bovine rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 667)
  AUTHORS   Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B.
  TITLE     Sequence analysis of gene 11 equivalents from "short" and "super
            short" strains of rotavirus
  JOURNAL   J. Virol. 64, 120-124 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         22      618     gene 11 protein
BASE COUNT      219 a    123 c    139 g    186 t
ORIGIN      
        1 ggcttttaaa gcgctacagt gatgtctctc agtattgacg tgacgagtct tccttctttt
       61 tcttctagca tttataagaa tgaatcatct gcaacagcgt caactctttc tggaaaatct
      121 attggtagga gtgtacaata tgtttcacca gatgcagaag cattcagtaa atatatgctg
      181 tcaaagtctc cagaagatat tggaccatct gattctgctt caaacgatcc actcaccagc
      241 ttttcgatta gatcgaatgc agttaagaca aatgcagacg ctggggtgtc tatggattca
      301 tcagtacaat cacgaccatc aataaatgtt gggtgcgatc aagtggattt ctcctttaat
      361 aaaggaatca aagtaaatgc taacctggat tcatcaatat cagtgtcaac aaattcaaga
      421 aaggagaaat ccaaaggtga tcgtaaaagt aggaaacact accctaaaat tgaagcagaa
      481 tctgattcgg atgagtacgt acttgacgat tcagacagtg atgatggcaa atgtaggaat
      541 tgtaaatata aaaggaagta tttcgcacta agaatgagaa tgaaacaagt agctatgcaa
      601 ttaatcgaag atttgtaggt ctgacctgag aggtcactag ggagctcccc actcccgttt
      661 tgtgacc
//
LOCUS       ROH11AA       658 bp ds-RNA             VRL       30-MAY-1990
DEFINITION  Human rotavirus gene 11, complete cds.
ACCESSION   M33607
KEYWORDS    .
SOURCE      Human rotavirus (strain 69M) viral RNA.
  ORGANISM  Human rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 658)
  AUTHORS   Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B.
  TITLE     Sequence analysis of gene 11 equivalents from "short" and "super
            short" strains of rotavirus
  JOURNAL   J. Virol. 64, 120-124 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         22      609     gene 11 protein
BASE COUNT      221 a    118 c    127 g    192 t
ORIGIN      
        1 ggcttttaaa gcgctacagt gatgtctctc agtattgacg tgacaagtct tccctcaatt
       61 tcttctagta tttataaaca tgaatcgtct tcaacaacgt caactatttc tggaaaatct
      121 attggtagga gtgaacagta catttcacca gatgcagaag ctttcaataa atacatgttg
      181 tcgaagtctc cagaggatat tggaccatct gattctgcat caaacgatcc actcaccagc
      241 ttttcgatta gatcgaatgc agttaagaca aatgcagatg ctggcgtgtc tatggattca
      301 tcaacacaat cacgaccatc aagtgatatt ggatgcgatc aagtggattt ctccttaaac
      361 agaggtatta agattgatgc tatggtagat tcttcaatat caatatctac taaatctaag
      421 aaagagaaat ccaaacatga aactaagaac agaaaatgtt atccaaagat tgaagcagag
      481 tcagattctg atgactacgt tttagatagt gatgatggta aatgtaagaa ctgtaaatat
      541 aaaaagaagt attttgcact tcggttaaga atgaaacaag ttgcaatgca attaattgaa
      601 aatttgtgag tctaacctgg gaggtcacta gggagctccc cactcccgtt ttgtgacc
//
LOCUS       ROH11AB       667 bp ds-RNA             VRL       30-MAY-1990
DEFINITION  Human rotavirus gene 11, complete cds.
ACCESSION   M33608
KEYWORDS    .
SOURCE      Human rotavirus (strain DS-1) viral RNA.
  ORGANISM  Human rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 667)
  AUTHORS   Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B.
  TITLE     Sequence analysis of gene 11 equivalents from "short" and "super
            short" strains of rotavirus
  JOURNAL   J. Virol. 64, 120-124 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         22      618     gene 11 protein
BASE COUNT      231 a    121 c    121 g    194 t
ORIGIN      
        1 ggcttttgaa gcgctacagt tatgtctctc agtattgatg tgactggtct tccttcaatt
       61 tcttcaagtg tttataaaaa tgaatcgtct tcaacaacgt caactatttc tggaaaatct
      121 attggtagga gtgaacagta catttcacca gatgcagaag ctttcagaaa atacatgttg
      181 tcaaaatctc cagaagatat tggaccttct gattctgcat cgaacgatcc actcaccagc
      241 ttttcgatta gatcgaatgc agttaagaca aatgcagatg ctggcgtgtc tatggattca
      301 tcagcgcaat cacgaccatc tagcgacatt ggatacgatc aaatggattt ctccttgaat
      361 aaaggtatta aatttgatgc tacagtggac tcttcaatat caatatctac cacatcaaag
      421 aaggagaaat ctaaaaacaa aaataagtat aaaaaatgtt acccaaaaat cgaagcagag
      481 tctgattccg atgattacat attagatgat tcagatagcg atgatggaaa atgtaaaaat
      541 tgcaaatata aaaagaaata ttttgcactt cgtttaagaa tgaagcaagt tgcaatgcaa
      601 ttaattaaag atttatgagt ctaacctgag agatcactag ggagctcccc actcccgttt
      661 tgtgacc
//

GenBank-Updates@genbank.bio.net (05/31/90)

LOCUS       SHPMHDQAA    2387 bp ds-DNA             MAM       31-MAY-1990
DEFINITION  Sheep MHC class II OLA-DQA alpha-chain gene, 3' end.
ACCESSION   M33304
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Sheep DNA, clone OLAIIC8-1, library CH.28.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 2387)
  AUTHORS   Scott,P.C. and Brandon,M.R.
  TITLE     Nucleotide sequence, polymorphism and expression of ovine MHC class
            II alpha genes DQA
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.C.Scott 28-MAR-1990.  University of Melbourne, Veterinary
            Preclinical Centre, Parkville, Victoria, Australia 3052
FEATURES       from  to/span     description
    pept    /   359      607     MHC OLA-DQA alpha-chain, exon 2 (AA at 361)
               1085     1366     MHC OLA-DQA alpha-chain, exon 3
               1604     1758     MHC OLA-DQA alpha-chain, exon 4
    pre-msg     358  >  2228     MHC OLA-DQA alpha-chain mRNA
    IVS     <     1      358     MHC OLA-DQA intron A
    IVS         608     1084     MHC OLA-DQA intron B
    IVS        1367     1603     MHC OLA-DQA intron C
    IVS         608     1084     MHC OLA-DQA intron A
    IVS        1367     1603     MHC OLA-DQA intron B
    signal     2223     2228     poly-A signal
BASE COUNT      608 a    610 c    489 g    680 t
ORIGIN      Chromosome 20.
        1 ggatccagct gaggctaaaa ctgtaaaagt gaattagttc caattgacac agtagagtga
       61 tactcttcag catcagttgg tagcccagga caagtaccag atgaaaagga tattacatgt
      121 atttctgcct ggaaaaggag tcaagatgca ttgcaaagat aaggaggctt tgttgcaagg
      181 ctttttaact aactggacaa ctaccaacac agagcgaaga aggagcaggt actgggaatt
      241 ttgtctagaa actggccaca aatgaagccc acaatgtttg atagtcaatt tcttctttca
      301 ctgcttaatg aagatctttt ctctattttt ccctttcttg ctcctcaccc tgactcagct
      361 gaccacattg gcacctatgg cgtaaacatc taccaaacat atggtccctc tggctactat
      421 acccatgaat ttgatggaga tgaagagttc tacgtggacc tggaaaagag ggagactgtc
      481 tggcgtctgc ctgagtttag taaatttaca agttttgacc ctcagggtgc actgagaaac
      541 atagctacgg tgaaacataa tttggagatc ttgattcaaa ggtccaactc tactgctgct
      601 accaacagta tgtgttcacc attctgcctc tctttgttgt tcttcccctt cataccagct
      661 tcactccctt tttccctagg gatagatacc cttcaccact ctataaaact ttctcctttc
      721 caaggagtca ccacattttc tcatggtaat atctgaaccc tcatcttctg ccttctttcc
      781 actcatatat ttccatatga tagaaagatc cttacttcca taatgcaagc tttaatattt
      841 cataggagag ccccacagac atcccacttg acaggtcgtc aaaaacaggg ggacaaggat
      901 aaagcagaag caacatatag tgcctcccaa cagaagggaa gcaagagctt ctcctctgtc
      961 agagtgcaac tgttggtgga agggctcccc caggaggcag tgcagtaaag gctcccccag
     1021 gaggcagtgc agaatcaggg cagagctact tccgtttcac atctgtgctg tttcctcacc
     1081 acagaggttc ctgaggtgac tgtgttttcc aagtctcccg tgatgctggg tcaacccaac
     1141 accctcatct gtcacgtgga caacatcttt ccccctgtga tcaacatcac atggctgagg
     1201 aacgggcact cggtcacaga gggagtttct gagaccagtt ttctcatcaa gagtgattat
     1261 tcttttctca agatcaatta tctcaccttc ctcccttctg atgatgatgt ttatgactgc
     1321 aaagtggagc actggggtct ggatgagcca cttctgaaac actggggtat gtacgagttc
     1381 cagccttttg gtaccttctt ttctctgtca ggtccacaat gtcatgtttg ttatccccta
     1441 atcctcatgg ccttgaactt tgagggattc taatgataga cctcactctc ttctctcagc
     1501 tccatgcccc aagtctgtta gaaccaaccc ctcttcaccc atcccacata cacacacaga
     1561 aacactctgt attgtgactt ccacaaaact tcagctttca cagagcctga gattccagtc
     1621 cctatgtcag agctgacaga gactgtggtc tgtgccctgg ggttgaccgt gggcctcgtg
     1681 ggcatcgtgg tgggcaccgt cctcatcatc cgaggtctgc gctcaggtgg cccctccaga
     1741 caccaggggc cgttgtgagt catactccag aaggaaggta agaattcaga tctgttagat
     1801 ctgggaacat agtacagacg agggaaagtg ggaggaggct gtggacacaa atgtgctgaa
     1861 agtttctaaa ttgggaaaca gcatggcagt ggcacaggag ccctgagccc actcatctca
     1921 tgccttttct gttgcaggtg cgctgtccat ctttaagaac agaagaacgg acttgctaga
     1981 tgacctagag ctattttctg cccaagtcca tcatgtacct tctctcctcc tcctaccctc
     2041 ctcctcatac ttcttctctg gacttaaggt gctgtatcat ctcgagctca catacatttg
     2101 ggattctccc ccgaccttct aatttttttt tttctcaatt gttacctatt atgggatcac
     2161 tgagatatcc cacccagcta cctaatccac caatgaccct aaaataatat atccatggaa
     2221 gaaataaatg cctttatgag gtctttactg actttttccc atctttcatc tcagggctga
     2281 ttagttccat gtcctcttct gtttcaggcc cttttaacct catttcccag attatgtttc
     2341 acggcccagt aacaccggag cagcttgtag agactagtaa tattttt
//
LOCUS       SHPOLADQAB   3355 bp ds-DNA             MAM       31-MAY-1990
DEFINITION  Sheep MHC class II OLA-DQA alpha-chain gene, 3' end.
ACCESSION   M33305
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Sheep (strain Merino) DNA, clone OLAIIC17-2, library CH.28.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 3355)
  AUTHORS   Scott,P.C. and Brandon,M.R.
  TITLE     Nucleotide sequence, polymorphism and expression of ovine MHC class
            II alpha genes DQA
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.C.Scott 28-MAR-1990.  University of Melbourne, Veterinary
            Preclinical Centre, Parkville, Victoria, Australia 3052
FEATURES       from  to/span     description
    pept    /  1057     1305     MHC OLA-DQA alpha-chain, exon 2 (AA at 1059)
               1751     2032     MHC OLA-DQA alpha-chain, exon 3
               2281     2435     MHC OLA-DQA alpha-chain, exon 4
    IVS     <     1     1056     MHC OLA-DQA intron A
    IVS        1306     1750     MHC OLA-DQA intron B
    IVS        2033     2280     MHC OLA-DQA intron C
    signal     2913     2918     poly-A signal
BASE COUNT      888 a    842 c    665 g    960 t
ORIGIN      
        1 ctgcagtgca gagacctggg tttgatccct gggtcgggaa gatcccctgg agaaggaaat
       61 ggctacccac catgagtatt cttgcctgga taatccatgg acagaggaac ccagcaggct
      121 aaagtccatg ggatcgcaaa gcgttggaca cgactgagtg actttcacat taaagaaaga
      181 aataagggga aggaagtgtc agaaatgcaa aataaagctt catttttttt tttattttca
      241 gaaggaatca cggcattcac tttaagaact cactacggtt aacaaaatgg acaccttaga
      301 agcattgttt attctttata tgctttgtta tgttgtcact gtgcttgcta tactgtttct
      361 tataccttat aaaagaggaa tctctgttaa aatagtagga tcatgtgtgc ttttccttgg
      421 ttaaaaaaaa aaaataaaga aggaatctct atgcagtaaa tgcttctgct atgcctaccc
      481 ccgagggaaa agaactcgaa tgaatgtgtt attatctgat gaaaaacttt tgaagaagag
      541 gaaacaatgc taataccagg aaaattcatt cattcattcc cttttttctt ctctctttcc
      601 attataatct ttattacaga ctacaatgtg ccaggcactg agctgggcag gcaggaaata
      661 gcagtaaacg aagcctgtga aatcctgccc ttgtgagcat acaagctcag tcgtgatgga
      721 actctcaaac ccatggactg tagcctttcc cttctccagg ggacttccca acccagggat
      781 cgaacccagg tctcccactg caggcggatt ctttaccagc caagccgcaa gggactctca
      841 agaatcacat atgttacagt gcaaaagcag cctggattct aacaggacag ctaccaacac
      901 ttagggaaaa ggaagcaggt gctggcactt tgcttagaga cattgtgcca aaggtgaagc
      961 ccaccgtgtt tgaaagttag tttcttcagc tactttgttt aatatggcct tttctctctg
     1021 ttttccacct tcctgctcct caccctcact tatcagctga ccactttggc tcctatggca
     1081 cagagatcta ccaatctcat ggtccctctg gccagtacac ccaggaattt gatggagacg
     1141 agctgtttta tgtggacctg gggaagaagg agactgtctg gaggctgcct atgtttagcc
     1201 agtttgcagg ttttgatcca cagggtgcac tgagtgaaat agctacagca aaacacaact
     1261 tggatatcct gactaaacgc tccaacttta cccctgctat caatggtaag tgtccaccat
     1321 tctacttctc tttactgaat ctattctttc atatcaggct tcactccctt cttttctaag
     1381 gagagatatc cttcaccatg ctatgaaact ttcccaagtg tccccagatt tcatagtaat
     1441 tattgaacga ctcatcctct cccacctcaa agatcacata tttccatgta atataaggac
     1501 ccttactccc ataacatatt ccttgaatcc ctcaaggagg agtcccacag acctcctcct
     1561 taacaagcat gcccacagac agcacgggga taaagcatgg gcaacatata gcatctccca
     1621 gcagaaggcg aacaagagct cctcctctgt cagactggga aacgttgtga gagggctccc
     1681 ccaggaggca gtgcagaatc agggccaagc tttttcccca tttcacatct gtgctgtttt
     1741 ctcaccatag aagttcctga ggtgactgtg ttttccaagt ctcccgtgat gctgggtcaa
     1801 cccaacaccc tcatctgtca cgtggacaac atttttcccc ctgtgatcaa catcacatgg
     1861 ctgaagaacg ggcatgcagt cacagagggt gtttctgaga ccagcttcct tcccaaggat
     1921 gatcattcct tcctcaagat tggttatctc accttcctcc cttctgatga tgacgtttat
     1981 gactgcaaag tggagcactg gggcctggat gagccacttc tgaaacactg gggtatgtag
     2041 agttcaccct tttggtactc tctctttttc taccaagtac aaaacactga gttttggtcc
     2101 tcatctcaca gtcaagcttg ttttccacac ttcaaggttt ctaaaattat agaacttcat
     2161 tctcttccct tagcctggtg ccctgagttt ttgtagcatg aacacacact ctgcccaatc
     2221 tcatacacag gcacatgaac caaccctctg tattctgact ttcataactt cattttccag
     2281 agcctgagat tccagcccct atgtcagagc tgacagagac tgtggtctgt gccctggggt
     2341 tgaccgtggg ccttgtgggc atcgtggtgg gcaccatctt catcatccaa ggcctgcgct
     2401 caggtggggc ctccagacac cagggtcccc tgtgagtcgc accctagaaa ggaaggtaag
     2461 gattcatatt tgtcagtgcc acagacacac ttcaggagaa agcaaagcgg gaagtaaatt
     2521 gtagacacaa atgtggttga aagttgtggg aaacgtggga accagcatga ctacagcaca
     2581 gaagctccct aggactcatc agtttcatgc ctttcctgtt gcaggtctct gcccatctag
     2641 gattgcagaa gagaggacgt tctagacgac ctagaactag tttctggcca agttcatcat
     2701 atactctctc ttcccctaca cactctgccc ctcctctctt ctctgggact taagatgctg
     2761 ttacatctca gagctcacat ataccccaga gttctcccct gactttctga tatttttttc
     2821 tgttctcagt agttgcctac catgagatca ctggggtatt ccacccagtt acctactccc
     2881 tagtgacccc aaccccatat caccatggaa gcaataaact ccccttcaat agaaattttt
     2941 cctgtctttc atcacggagc tgaccaagat catgttctct tatgctttag gccccttaat
     3001 ttcatttctc agatcatgtt tcatgctgaa gaacataaga gcaactttta caaattcata
     3061 atatttcagt atgttatgtt cactatggtt aatatttctc tcttcctact atttccaatc
     3121 ttggctcttg gcacttccat ccacccctta attcaggcat cagtgaaagt aatatatacc
     3181 ttttttcctt ggtttatatt tgttatagca gaaatacaaa aaccaagaga ggagtctgta
     3241 ctcatctctt tttttttttt ggctgcactg tccagcctgt gagatcttag ttctccaacc
     3301 tggcatcgaa ctcagaacca cagcaaagaa agctggtgtt cactactgaa tcagg
//
LOCUS       BSUPOLCA      240 bp ds-DNA             BCT       31-MAY-1990
DEFINITION  B.subtilis DNA polymerase III (polC) gene, partial cds.
ACCESSION   M33543
KEYWORDS    DNA polymerase III.
SOURCE      B.subtilis DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 240)
  AUTHORS   Barnes,M.H., Hammond,R.A., Foster,K.A., Mitchener,J.A. and
            Brown,N.C.
  TITLE     The cloned polC gene of Bacillus subtilis: Characterization of the
            azp12 mutation and controlled in vitro synthesis of active DNA
            polymerase III
  JOURNAL   Gene 85, 177-186 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   240     DNA polymerase III (polC) (AA at 1)
    mut          76       76     t in wt; g in azp12 mutant
BASE COUNT       59 a     52 c     56 g     73 t
ORIGIN      
        1 gaattcggaa cccgatttgt ccggcagatg cttgaagata caaagccgac cactttttct
       61 gagctcgttc agatttcagg cttgtctcac ggaactgatg tatggcttgg caatgcacag
      121 gagctcatcc acaataatat ttgtgagctg agtgaggtta tcggctgccg tgatgacatt
      181 atggtttatt taatctatca aggccttgag ccgtcccttg cctttaaaat catggaattc
//
LOCUS       FIBGLUC      1426 bp ds-DNA             BCT       31-MAY-1990
DEFINITION  F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
            complete cds.
ACCESSION   M33676
KEYWORDS    1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase.
SOURCE      F.succinogenes (strain S85) DNA, clone PJI5.
  ORGANISM  Fibrobacter succinogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1426)
  AUTHORS   Teather,R.M. and Erfle,J.D.
  TITLE     DNA sequence of a Fibrobacter succinogenes mixed linkage
            beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.M.Teather, 11-APR-1990.
FEATURES       from  to/span     description
    pept        145     1194     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 precursor (EC 3.2.1.73)
    sigp        145      225     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 signal peptide
    matp        226     1191     1,3-1,4-beta-D-glucan 4-glucanohydrolase
    binding     132      137     ribosome binding site
    signal       62       66     -35 region
    signal       85       90     -10 region
BASE COUNT      371 a    346 c    335 g    374 t
ORIGIN      
        1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
       61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
      121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
      181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
      241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
      301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
      361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
      421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
      481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
      541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
      601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
      661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
      721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
      781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
      841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
      901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
      961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
     1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
     1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
     1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
     1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
     1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
     1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
     1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS       CEATAT        264 bp ss-RNA             VRL       31-MAY-1990
DEFINITION  Caprine arthritis encephalitis virus tat gene, complete cds.
ACCESSION   M33674
KEYWORDS    tat protein.
SOURCE      Caprine arthritis encephalitis virus (strain Cork) proviral DNA,
            clone pCo1.9, isolated from goat synovial membrane.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
            Robinson,M.M. and Cheevers,W.P.
  TITLE     Genetic structure of the pol-env region of the Caprine arthritis
            encephalitis lentivirus genome: Possible role in trans-activation
            of the viral long terminal repeat
  JOURNAL   Unpublished (1990) see COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.P.Cheevers, 11-APR-1990.  Washington State University, Dept.
            of Veterinary Microbiology and Pathology, Pullman, WA 99164-7040
FEATURES       from  to/span     description
    pept          1      264     tat protein
BASE COUNT       97 a     40 c     80 g     47 t
ORIGIN      Map position 5856.
        1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta
       61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg
      121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg
      181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta
      241 tgtaacccgg ggtggggaac ataa
//
LOCUS       CEATATX       264 bp ss-RNA             VRL       31-MAY-1990
DEFINITION  Caprine arthritis encephalitis virus tat protein mRNA, complete
            cds.
ACCESSION   M33675
KEYWORDS    tat protein.
SOURCE      Caprine arthritis encephalitis virus (strain G63) cDNA to viral
            RNA, clone pC63-49, isolated from goat synovial membrane.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
            Robinson,M.M. and Cheevers,W.P.
  TITLE     Genetic structure of the pol-env region of the Caprine arthritis
            encephalitis lentivirus genome: Possible role in trans-activation
            of the viral long terminal repeat
  JOURNAL   Unpublished (1990) see COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.P.Cheevers, 11-APR-1990.  Washington State University, Dept.
            of Veterinary Microbiology and Pathology, Pullman, WA 99164-7040
FEATURES       from  to/span     description
    pept          1      264     tat protein
BASE COUNT       99 a     43 c     75 g     47 t
ORIGIN      Map position 5856.
        1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata
       61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg
      121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg
      181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta
      241 tgtaacccgg ggtgggaaac ataa
//
LOCUS       MUSLRPA      3070 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Mouse LCA-related phosphatase (LRP) mRNA, complete cds.
ACCESSION   M33671
KEYWORDS    leukocyte common antigen-related phosphatase; tyrosine phosphatase.
SOURCE      Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3, cDNA to
            mRNA, clones mLRP-[B20,C26].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3070)
  AUTHORS   Matthews,R.J., Cahir,E.D. and Thomas,M.L.
  TITLE     Identification of a novel member of the protein tyrosine
            phosphatase family: Evidence for alternative splicing in the
            tyrosine phosphatase domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.L.Thomas, 11-APR-1990.
FEATURES       from  to/span     description
    pept         26     2515     LCA-related phosphatase precursor (EC 3.13.48)
    sigp         26       82     LCA-related phosphatase signal peptide
    matp         83     2512     LCA-related phosphatase
    mRNA    <     1     3070     LCA-related phosphatase mRNA
    site        828      936     variably spliced exon
BASE COUNT      874 a    754 c    732 g    710 t
ORIGIN      
        1 ccgcccagcg ccgggctcgg tcagcatgga ttcctggttc attcttgtcc tgtttggcag
       61 tggtctaata catgttagtg ccaacaatgc tactacagtt tcaccttctt taggaacgac
      121 aagattaatt aaaacatcaa caacagaatt ggctaaggaa gagaataaaa cctcaaattc
      181 aacctcttca gtaatttctc tttctgtggc accaacattc agcccaaacc tgactctgga
      241 gcccacctat gtgactactg ttaattcttc acactctgac aatgggacca ggagggcagc
      301 cagcacggaa tctggaggca ctaccatttc cccgaacgga agctggctta ttgagaacca
      361 gttcacggat gccataacag aaccctggga ggggaactcc agcactgcag caaccactcc
      421 agaaaccttc cccccggcag atgagacacc aattattgcg gtgatggtgg ccctgtcctc
      481 tctgctagta atcgtgttta ttatcatagt tctgtacatg ttaaggttta agaaatacaa
      541 gcaagctggg agtcattcca actctttccg cctgtcaaat ggccgcacgg aggatgtgga
      601 gccccaaagt gtaccacttc tggccaggtc cccaagcacc aacaggaagt acccaccact
      661 gcctgtggac aagctggaag aggagattaa ccggagaatg gctgatgaca ataagatctt
      721 cagagaagaa ttcaacgctc tccctgcttg tcctatccag gccacctgtg aggctgcctc
      781 caaggaagaa aacaaggaaa aaaaccgcta tgtaaacatc ctgccctttc tctctttagc
      841 tgtgagcaag gatgcagtga aagcactgaa caaaaccact ccattgttag aaagaaggtt
      901 tattgggaaa tcaaactcca gaggctgtct ctcagatgac cactctagag tgcacctgac
      961 acctgttgaa ggggtcccag attctgatta catcaacgct tcattcatta atggctacca
     1021 ggaaaagaac aaattcatcg ctgcacaagg accaaaagaa gaaacagtga atgacttctg
     1081 gagaatgata tgggaacaaa acacagctac tattgtcatg gtgaccaacc tgaaggagag
     1141 aaaggagtgt aaatgtgccc aatactggcc agaccaaggc tgctggacct atgggaatgt
     1201 ccgtgtgtct gtcgaggatg tgactgttct ggtggactac acagtacgga aattctgcat
     1261 ccagcaggtg ggcgacgtga ccaacaggaa accacagcgc ctcatcactc agttccactt
     1321 caccagctgg ccagactttg gggtgccttt caccccaatt ggcatgctca agttcctcaa
     1381 gaaggtgaag gcctgtaacc ctcagtacgc aggggctatc gtggtccact gcagtgcagg
     1441 tgtagggcgc actggcacct ttgttgtcat cgatgccatg ctggacatga tgcattcgga
     1501 acgcaaagtg gatgtatacg ggtttgtgag ccggatccgg gcccagcgct gccagatggt
     1561 acagacagac atgcagtacg tcttcatata ccaggccctt ctggagcatt atctgtatgg
     1621 ggacacagaa ctggaagtga cttctctaga aacccaccta caaaaaattt ataacaagat
     1681 cccagggacg agcaacaacg ggttagagga ggagtttaag aaattaactt caatcaaaat
     1741 ccagaatgac aagatgcgca cgggaaacct tccagccaac atgaagaaga accgggtttt
     1801 acagatcatt ccatatgaat ttaacagagt gatcattcca gtcaaacgag gcgaagagaa
     1861 cacagactat gtgaacgcat ccttcattga tggataccgg cagaaagact cctacattgc
     1921 cagccagggc cctcttctcc acacgattga ggacttctgg cgaatgatct gggagtggaa
     1981 gtcctgttct atcgtaatgc tgacagaact ggaagagaga ggccaggaga agtgtgccca
     2041 gtactggcca tctgatggcc tggtgtccta tggagacatc acagttgagc tgaagaagga
     2101 ggaggaatgt gaaagctaca ctgtccgaga cctcctggtc accaacacca gggagaacaa
     2161 gagtcggcaa atccggcagt tccacttcca cggctggcct gaggtgggca tccccagcga
     2221 cggcaagggc atgatcaaca tcattgcagc agtgcagaag cagcagcagc agtcggggaa
     2281 ccatcccatc actgtgcact gcagtgccgg ggcaggacgg acaggaacct tctgtgcctt
     2341 gagcacagtc ctggaacgtg tgaaagcaga aggaatttta gatgtcttcc aaactgtcaa
     2401 gagcctgcgg ctgcagaggc cacacatggt ccagacactg gaacagtatg aattctgcta
     2461 caaggtggta caggagtaca ttgacgcctt ttcagattat gccaacttca agtgacaggt
     2521 gacaaggccc acagacagga gaattgcctt taatattttg taatattctg tttttgttaa
     2581 tatacccaaa attgtatata tcttataact gttttagaaa tggcacatag gcttctatta
     2641 cctgttaggt ggagattttg tatgtaaatg tgttagcact gatagtcctt ttccagtgtt
     2701 ttattgggaa attaaatagt gtgatatttg ggttgatata atgaaatcct cagcctggaa
     2761 actgggccag attgttcctt gcttcaaata tcttttccta aagaagataa acctaagact
     2821 cattccaggt agctcagtgc caactaaaac aaagcacaaa gttctcagag ctcttgagga
     2881 aatggttgtc tccctgtccc caggcaggcc tcttcccctc cctgtcctgt aaatatccct
     2941 cccctctcca gtccaccctc atctcccacc aagatcagcc acctcaggca tggggagtaa
     3001 tgagaccaga gcgcctctct ggcaccacag cagggatcgt caggtaataa acactcttga
     3061 ttccctgagg
//
LOCUS       BOV33DPTP     982 bp ss-mRNA            MAM       31-MAY-1990
DEFINITION  Cow 33-kDa phototransducing protein mRNA, 3' end.
ACCESSION   M33529
KEYWORDS    phototransducing protein.
SOURCE      Cow adult photoreceptor rod cell, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 982)
  AUTHORS   Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
            Yamaki,K. and Shinohara,T.
  TITLE     Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
            retina and pineal gland
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Shinohara, 03-APR-1990.
FEATURES       from  to/span     description
    pept    <     1      719     33-kDa phototransducing protein (AA at 3)
BASE COUNT      334 a    164 c    215 g    269 t
ORIGIN      
        1 aaagtttgga ggaagacttt gaaggacagg cctcacatac aggacccaaa ggagtaataa
       61 atgactggag aaagtttaaa ttggagagtg aagatagtga ttcagttgca cccagcaaga
      121 aggagattct cagacaaatg tcttctcctc agagtagaga tgacaaagac tcaaaagaaa
      181 gattcagcag aaagatgagc gttcaagaat atgaactaat ccacaaagac aaagaagatg
      241 aaaattgcct tcgtaaatac cgcagacagt gtatgcagga tatgcaccag aagctgagtt
      301 ttgggcctag atatgggttt gtgtatgagc tggaatctgg ggagcaattc ctggaaacca
      361 ttgaaaagga acagaaaatc accactatcg ttgttcatat ttatgaagat ggtattaagg
      421 gctgtgatgc tctaaacagt agcttgatat gccttgcagc cgaataccct atggtcaagt
      481 tttgtaaaat aaaggcttct aatacaggtg ccggagaccg cttttcctca gatgtactcc
      541 ccacgctgct tgtctacaaa ggtggggaac tcctaagcaa tttcattagt gttactgaac
      601 agctggctga agaatttttt actggggatg tggagtcttt cctaaatgaa tatgggttat
      661 tacctgaaaa agagatgcat gtcctagagc agagcaaaat ggaagaggat atggaataaa
      721 gattcactat gtcaatgttt catatttcgt atttctcctt taagcattga acactgattt
      781 tggtagtatt cacattcttt tagggaatac caaacatagc cctggctttt ctaatttggg
      841 gaagaaaaac tccagactga cactaaaatt atatgattag catgtcttaa tattagttac
      901 tcaagctgat ataacacttt acctcaaaac attgtagtct tcagcaatat gttagtagac
      961 aaagagaata tgaaaaatac tg
//
LOCUS       HUM33DPTP    1182 bp ss-mRNA            PRI       31-MAY-1990
DEFINITION  Human 33-kDa phototransducing protein mRNA, complete cds.
ACCESSION   M33478
KEYWORDS    phototransducing protein.
SOURCE      Human adult photoreceptor rod cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1182)
  AUTHORS   Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
            Yamaki,K. and Shinohara,T.
  TITLE     Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
            retina and pineal gland
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Shinohara, 30-MAR-1990.
FEATURES       from  to/span     description
    pept         52      792     33-kDa phototransducing protein
    mRNA    <     1     1181     33-kDa phototransducing protein mRNA
BASE COUNT      405 a    187 c    242 g    348 t
ORIGIN      
        1 aggacaccag gcacagagat ccaaactatt atatcaaatc caatccctaa aatggaagaa
       61 gccaaaagcc aaagtttgga ggaagacttt gaaggacagg ccacacatac aggacccaaa
      121 ggagtaataa atgattggag aaagtttaaa ttagagagtc aagacagtga ttcaattcca
      181 cctagcaaga aggagattct caggcaaatg tcttctcctc agagtaggaa tggcaaagat
      241 tcaaaggaac gagtcagcag aaagatgagc attcaagaat atgaactaat ccataaagag
      301 aaagaggatg aaaactgcct tcgtaaatac cgtagacagt gtatgcagga tatgcaccag
      361 aagctgagtt ttgggcctag atatgggttt gtgtatgagc tggaaactgg aaagcaattc
      421 ctagaaacaa ttgaaaagga actgaagatc accacaattg ttgttcacat ttatgaagat
      481 ggtattaagg gttgtgatgc tctaaacagt agtttaacat gccttgcagc agaataccct
      541 atagttaagt tttgtaaaat aaaagcttcg aatacaggtg ctggggaccg cttttcctta
      601 gatgtacttc ctacactgct catctataaa ggtggggaac tcataagcaa ttttattagt
      661 gttgctgaac agtttgctga agaatttttt gctggggatg tggagtcttt cctaaatgaa
      721 tatgggttac tacctgaaag agaggtacat gtcctagagc ataccaaaat agaagaagaa
      781 gatgttgaat gaagattcac tatgtcaata tctcatgttt atcctttagg tattggatga
      841 tggttttggt agtatctata ttgcttttgt gaacacagag tatgggcacg gctatgctaa
      901 cttgacaaaa atgactgatg caacaatcga gttattagca tttcatagta ttagttactc
      961 aaattgatac aatgcttgac tacaaaacaa agctgtcttc agcaacatta ttagtagaca
     1021 aagaggatgt ggataatatt atgacatttt tcaaaaatcc ctttcaagtt atgttttgtc
     1081 ttttttactc cattttccct catcactgtt attatttgga cttttcaaat tacattattc
     1141 attataattt tctttgtgta ataaaaatga aatctcatga ag
//
LOCUS       RAT33DPPT    1221 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Rat 33-kDa phototransducing protein mRNA, complete cds.
ACCESSION   M33530
KEYWORDS    phototransducing protein.
SOURCE      Rat (strain Sprague-Dawley) adult photoreceptor rod cell, cDNA to
            mRNA, pineal gland library.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1221)
  AUTHORS   Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
            Yamaki,K. and Shinohara,T.
  TITLE     Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
            retina and pineal gland
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Shinohara, 03-APR-1990.
FEATURES       from  to/span     description
    pept         92      832     33-kDa phototransducing protein
BASE COUNT      399 a    234 c    282 g    306 t
ORIGIN      
        1 gttcagtgga cagagattct cacccactgg acataatcta ggactccagg aacagagacc
       61 ccaaactact acaccgaaca caacccataa aatggaagaa gccgcaagcc aaagcttaga
      121 ggaagatttt gaaggacagg ccacacacac aggacccaaa ggagtaataa atgactggag
      181 aaagtttaaa ttagaaagtg aagatggtga ttcaattcca cccagcaaga aggagatcct
      241 cagacaaatg tcctctcctc agagcagaga tgacaaagac tcgaaagaaa gaatgagcag
      301 aaagatgagc attcaagaat atgaactaat tcatcaggac aaagaagatg aaggttgcct
      361 tcgcaaatac cgcagacagt gcatgcagga tatgcatcag aagctgagct ttgggcctag
      421 gtatgggttt gtgtatgagc tggaaacagg ggagcaattc ctggaaacca tcgaaaagga
      481 gcagaaggtc accaccatcg tggttaacat ttacgaggat ggtgtcaggg gctgtgacgc
      541 actcaacagc agtttagaat gccttgcagc agaataccca atggtcaagt tctgtaaaat
      601 aagagcttcg aatactggag ctggggaccg cttttcctca gacgtactcc cgacattgct
      661 cgtatacaaa ggtggggagc tcataagcaa ttttattagt gtcgctgaac aatttgccga
      721 agattttttc gctgcggatg tggagtcttt cctaaatgaa tatggcttac taccagaaag
      781 agagatacat gacctagggc agaccaacac ggaagatgaa gatatcgagt aagcacgcac
      841 ggtgcagtat ctcgtgacta tcctttgcac agtgaacatc gatggttttt ttggtagtat
      901 ctatattcct ttagcaaaca ctaaatacag tcaggccaac tcaaattggg gaggggacac
      961 taaaattatg taaatagcat ttttagtact aattattcaa attgagataa tattttactg
     1021 caaaaaatat tgtggttctg agcaaataac tactgaacag aggaggtggg tgatgtcaac
     1081 attaggtcat tgtaaaaatc cctttccagt tccgtgttag ctttttactc ctccttttcc
     1141 ccttggcatt tatttttggg cttttgaggc tgaatgctac ttataaattg gtttgtatga
     1201 taagaataaa gtcctatgaa g
//
LOCUS       RAT33DPTP    1272 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Rat 33-kDa phototransducing protein mRNA, complete cds.
ACCESSION   M33528
KEYWORDS    phototransducing protein.
SOURCE      Rat (strain Sprague-Dawley) adult photoreceptor rod cell, cDNA to
            mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1272)
  AUTHORS   Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S.,
            Yamaki,K. and Shinohara,T.
  TITLE     Analysis of the human, bovine and rat 33-kDa proteins and cDNA in
            retina and pineal gland
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Shinohara, 03-APR-1990.
FEATURES       from  to/span     description
    pept         82      822     33-kDa phototransducing protein
    mRNA    <     1     1272     33-kDa phototransducing protein mRNA
BASE COUNT      420 a    244 c    289 g    319 t
ORIGIN      
        1 cagagattct cacccactgg acataatcta ggactccagg aacagagacc ccaaactact
       61 acaccgaaca caacccataa aatggaagaa gccgcaagcc aaagcttaga ggaagatttt
      121 gaaggacagg ccacacacac aggacccaaa ggagtaataa atgactggag aaagtttaaa
      181 ttagaaagtg aagatggtga ttcaattcca cccagcaaga aggagatcct cagacaaatg
      241 tcctctcctc agagcagaga tgacaaagac tcgaaagaaa gaatgagcag aaagatgagc
      301 attcaagaat atgaactaat tcatcaggac aaagaagatg aaggttgcct tcgcaaatac
      361 cgcagacagt gcatgcagga tatgcatcag aagctgagct ttgggcctag gtatgggttt
      421 gtgtatgagc tggaaacagg ggagcaattc ctggaaacca tcgaaaagga gcagaaggtc
      481 accaccatcg tggttaacat ttacgaggat ggtgtcaggg gctgtgacgc actcaacagc
      541 agtttagaat gccttgcagc agagtaccca atggtcaagt tctgtaaaat aagagcttcg
      601 aatactggag ctggggaccg cttttcctca gacgtactcc cgacattgct catatacaaa
      661 ggtggggagc tcataagcaa ttttattagt gtcgctgaac aatttgccga agattttttc
      721 gctgcggatg tggagtcttt cctaaatgaa tatggcttac taccagaaag agagatacat
      781 gacctagggc agaccaacac ggaagatgaa gatatcgagt aagcacgcac ggtgcagtat
      841 ctcgtgacta tcctttgcac agtgaacatc gatggttttt ttggtagtat ctatattcct
      901 ttagcaaaca ctaaatacag tcaggccaac tcaaattggg gaggggacac taaaattatg
      961 taaatagcat ttttagtact aattattcaa attgagataa tattttactg caaaaaatat
     1021 tgtggttctg agcaaataac tactgaacag aggaggtggg tgatgtcaac gttaggtcat
     1081 tgtaaaaatc cctttccagt tccgtgttag cttattactc ctccttttcc ccttggcatt
     1141 tatttttggg cttttgaggc tgaatgctac ttataaattg gtttgtatga taagaataaa
     1201 gtcctatgaa gaaacaaata gtctgtctaa agtctgccgt gagtcaagga cttaaaaact
     1261 tcataaagct tt
//
LOCUS       HUMCPIIA3A   1748 bp ss-mRNA            PRI       31-MAY-1990
DEFINITION  Human cytochrome P450IIA3 (CYP2A3) mRNA, complete cds.
ACCESSION   M33318 M33316
KEYWORDS    coumarin 7-hydroxylase; cytochrome P450; cytochrome P450IIA3.
SOURCE      Human liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1748)
  AUTHORS   Yamano,S., Tatsuno,J. and Gonzalez,F.J.
  TITLE     The CYP2A3 gene product catalyzes coumarin 7-hydroxylation in human
            liver microsomes
  JOURNAL   Biochemistry 29, 1322-1329 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         10     1494     cytochrome P450IIA3 /nomgen="CYP2A"
                                 /map="19q13.1-q13.2" /hgml_locus_uid="LN0074T"
    mRNA    <     1     1748     CYPIIA3 mRNA
    allele       60       60     g in allele CYP2A3; a in allele CYP2A3v
    allele      488      488     t in allele CYP2A3; a in allele CYP2A3v
    allele     1645     1645     g in allele CYP2A3; c in allele CYP2A3v
BASE COUNT      395 a    478 c    516 g    359 t
ORIGIN      
        1 gctaccacca tgctggcctc agggatgctt ctggtggcct tgctggtctg cctgactgtg
       61 atggtcttga tgtctgtttg gcagcagagg aagagcaagg ggaagctgcc tccgggaccc
      121 accccattgc ccttcattgg aaactacctg cagctgaaca cagagcagat gtacaactcc
      181 ctcatgaaga tcagtgagcg ctatggcccc gtgttcacca ttcacttggg gccccggcgg
      241 gtcgtggtgc tgtgtggaca tgatgccgtc agggaggctc tggtggacca ggctgaggag
      301 ttcagcgggc gaggcgagca agccaccttc gactgggtct tcaaaggcta tggcgtggta
      361 ttcagcaacg gggagcgcgc caagcagctc cggcgcttct ccatcgccac cctgcgggac
      421 ttcggggtgg gcaagcgagg catcgaggag cgcatccagg aggaggcggg cttcctcatc
      481 gacgccctcc ggggcactgg cggcgccaat atcgatccca ccttcttcct gagccgcaca
      541 gtctccaatg tcatcagctc cattgtcttt ggggaccgct ttgactataa ggacaaagag
      601 ttcctgtcac tgttgcgcat gatgctagga atcttccagt tcacgtcaac ctccacgggg
      661 cagctctatg agatgttctc ttcggtgatg aaacacctgc caggaccaca gcaacaggcc
      721 tttcagttgc tgcaagggct ggaggacttc atagccaaga aggtggagca caaccagcgc
      781 acgctggatc ccaattcccc acgggacttc attgactcct ttctcatccg catgcaggag
      841 gaggagaaga accccaacac ggagttctac ttgaaaaacc tggtgatgac cacgttgaac
      901 ctcttcattg ggggcaccga gaccgtcagc accaccctgc gctatggctt cttgctgctc
      961 atgaagcacc cagaggtgga ggccaaggtc catgaggaga ttgacagagt gatcggcaag
     1021 aaccggcagc ccaagtttga ggaccgggcc aagatgccct acatggaggc agtgatccac
     1081 gagatccaaa gatttggaga cgtgatcccc atgagtttgg cccgcagagt caaaaaggac
     1141 accaagtttc gggatttctt cctccctaag ggcaccgaag tgtaccctat gctgggctct
     1201 gtgctgagag accccagttt cttctccaac ccccaggact tcaatcccca gcacttcctg
     1261 aatgagaagg ggcagtttaa gaagagtgat gcttttgtgc ccttttccat cggaaagcgg
     1321 aactgtttcg gagaaggcct ggccagaatg gagctctttc tcttcttcac caccgtcatg
     1381 cagaacttcc gcctcaagtc ctcccagtca cctaaggaca ttgacgtgtc ccccaaacac
     1441 gtgggctttg ccacgatccc acgaaactac accatgagct tcctgccccg ctgagcgagg
     1501 gctgtgccgg tgcaggtctg gtgggcgggg ccagggaaag ggcagggcca agaccgggct
     1561 tgggagaggg gcgcagctaa gactgggggc aggatggcgg aaaggaaggg gcgtggtggc
     1621 tagagggaag agaagaaaca gaaggggctc agttcacctt gataaggtgc ttccgagctg
     1681 ggatgagagg aaggaaaccc ttacattatg ctatgaagag tagtaataat agcagctctt
     1741 atttcctg
//
LOCUS       HUMCYIIA4A   1747 bp ss-mRNA            PRI       31-MAY-1990
DEFINITION  Human cytochrome P450IIA4 (CYP2A4) mRNA, complete cds.
ACCESSION   M33317
KEYWORDS    cytochrome P450; cytochrome P450IIA4.
SOURCE      Human liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1747)
  AUTHORS   Yamano,S., Tatsuno,J. and Gonzalez,F.J.
  TITLE     The CYP2A3 gene product catalyzes coumarin 7-hydroxylation in human
            liver microsomes
  JOURNAL   Biochemistry 29, 1322-1329 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         10     1494     cytochrome P450IIA4 /hgml_locus_uid="LN0074T"
                                 /nomgen="CYP2A" /map="19q13.1-q13.2"
    mRNA    <     1     1747     CYP2A4 mRNA
BASE COUNT      396 a    471 c    517 g    363 t
ORIGIN      
        1 gctaccacca tgctggcctc agggctgctt ctggtggcct tgctggcctg cctgactgtg
       61 atggtcttga tgtctgtctg gcagcagagg aagagcaggg ggaagctgcc tccgggaccc
      121 accccactgc ccttcattgg aaactacctc cagctgaaca cagagcacat atgtgactcc
      181 atcatgaagt tcagtgagtg ctatggcccc gtgttcacca ttcacttggg gccccggcgg
      241 gtcgtggtgc tgtgtggaca tgatgccgtc agggaggctc tggtggacca ggctgaggag
      301 ttcagcgggc gaggcgagca agccaccttc gactgggtct tcaaaggcta tggcgtggcg
      361 ttcagcaacg gggagcgcgc caagcagctc ctgcgctttg ccatcgccac cctgagggac
      421 ttcggggtgg gcaagcgagg catcgaggag cgcatccagg aggagtcggg cttcctcatc
      481 gaggccatcc ggagcagcca cggcgccaat atcgatccca ccttcttcct gagccgcaca
      541 gtctccaatg tcatcagctc cattgtcttt ggggaccgct ttgactatga ggacaaagag
      601 ttcctgtcac tgctgagcat gatgctagga atcttccagt tcacgtcaac ctccacgggg
      661 cagctctatg agatgttctc ttcggtgatg aaacacctgc caggaccaca gcaacaggcc
      721 tttaagttgc tgcaagggct ggaggacttc atagccaaga aggtggagca caaccagcgc
      781 acgctggatc ccaattcccc acaggacttc atcgactcct ttctcatcca catgcaggag
      841 gaggagaaga accccaacac ggagttctac ttgaagaacc tgatgatgag cacgttgaac
      901 ctcttcattg caggcacgga gaccgtcagc accaccctgc gctatggctt cttgctgctc
      961 atgaagcacc cagaggtgga ggccaaggtc catgaggaga ttgacagagt gatcggcaag
     1021 aaccggcagc ccaagtttga ggaccggacc aagatgccct acatggaggc agtgatccac
     1081 gagatccaaa gatttggaga cgtgatcccc atgagtttgg cccgcagagt caaaaaggac
     1141 accaagtttc gggatttttt cctccctaag ggcaccgaag tgttccctat gctgggctcc
     1201 gtgctgagag accccagttt cttctccaac ccccaggact tcaatcccca gcacttcctg
     1261 gatgacaagg ggcagtttaa gaagagtgat gcttttgtgc ccttttccat cggaaagcgg
     1321 tactgtttcg gagaaggcct ggccagaatg gagctctttc tcttcttcac caccgtcatg
     1381 cagaacttcc gcctcaagtc ctcccagtca cctaaggaca ttgacgtgtc ctccaaacac
     1441 gtgggctttg ccacgatccc acgaaactac accatgagct tcctgccccg ctgagcgagg
     1501 gctgtgccgg tgcaggtctg gtgggcgggg ccagggaaag gcggggtcag ggcggggttc
     1561 gcggaagagg cgggtataag aatgggggga agatgcggga aaggaagggg cgtggtggct
     1621 agagggaaga gaagaaacag aaggggctca gttcaccttg ataaggtgct tccgtggtgg
     1681 gatgagagga agggaaacct tacattatgc tatgaagagt agtaataata gcagctctta
     1741 tttcctg
//
LOCUS       TRLKPRGXY    2568 bp ds-DNA             ORG       31-MAY-1990
DEFINITION  Leptomonas sp. 9S and 12S ribosomal RNA genes.
ACCESSION   J03814
KEYWORDS    12S ribosomal RNA; 9S ribosomal RNA; ribosomal RNA gene.
SOURCE      Leptomonas sp. kinetoplast DNA.
  ORGANISM  Kinetoplast Leptomonas sp.
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Leptomonas sp..
REFERENCE   1  (bases 1 to 2568)
  AUTHORS   Lake,J.A., de la Cruz,V.F., Ferreira,P.C.G., Morel,C. and
            Simpson,L.
  TITLE     Evolution of parasitism: Kinetoplastid protozoan history
            reconstructed from mitochondrial rRNA gene sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 4779-4783 (1988)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer_readable sequence for [1] kindly provided
            by L.Simpson, 06-MAY-1988.
FEATURES       from  to/span     description
    rRNA        351     1500     12S rRNA (put.)
    rRNA       1651     2250     9S rRNA (put.)
BASE COUNT      968 a    207 c    322 g   1071 t
ORIGIN      BamHI site.
        1 gatcctaggt aatctcgtca gctgctgatc acaccaaaaa acatatatta caataaatta
       61 ctaattatac taatattaaa taagaaagat agaaatgtgg ttaatatgca ttattaaggg
      121 atgtaatgat gtgatgacat gtaattaaaa gacctcggtt atatgaatta tttataaaaa
      181 cttgtagggt gtggaattaa aagagtatct aagcctaagc cctaagtccc ctctctctca
      241 tcgtacttta catcaattat tactgtaatt aattttcgtg agcggagcac atggcgttta
      301 atttgagaca taaattagta agaaaagggt aaaaataaat tgtagatttt attttgtttt
      361 tatttatcaa tatagttaat aaaataatct agaattttat gttaaatata taattatatt
      421 tttgattatt atattttgtt attttattta agttaattaa attgtattat atttaatttt
      481 taaatttgtt gttttatatt tagttttatg tttataattt aatgcaatac tgcacatatt
      541 ttaaaatttt aatgaactgt tatttatagt tgattatttt tagtttaatg tttaaatatt
      601 taactaatgg aggcacagtt gttctatatg taccaataaa aaatagtaaa attaatttaa
      661 ttaatttaat aaataattat ttgatcaaaa ttagtacaaa tattatgtaa aatttaaaaa
      721 ttaaatattt tatctaatat taacttatat taaatatttg gtttacattg aatattaaat
      781 ttacagattt aatttgtaat taatttaata ttgtgtgttt attaaaatag aggattaatg
      841 taataaattt ttaattttaa tatttaatat ttgttaaatt aatagtaaca taacttaatt
      901 aaaaagaatt atttatacgt ataaattttt aaaattataa ttatattgtt aataaaatta
      961 tcaagtttca aaagcgttta ttaaatgcgt ctgtctaaga tttatattta agagtattct
     1021 tgtatatagt tttaagtttt aataatccta tttaattata aagttaacct caaaaagcat
     1081 ttataagtag catagtaatt tgttaactaa atattaaagt gttccataga aaattttaaa
     1141 attataacaa tcatcgtaac taataaatta aaataaaaat tttaaaaaaa attaaaaaat
     1201 taaaataggg caagtcctac tctcctttac aaagagaaca tttaatatgt agttgtatgt
     1261 ttgattgggg caatactata tcttgttata tagaaaaaga actataatta ttgaaataat
     1321 aaaaggttcg agcaggttaa caagcattaa tactaaatgt gtttcatcgt ctacttattg
     1381 ctataaaaaa ttgattgttc atcaaaaatg caattcgtta gttgggttaa aatcgttgta
     1441 aagcagattt gtttatatat ttaatatttt tattatttta aaaattaata ttagtacgca
     1501 aggattcatt atttgttgct ttatttatat ttatatttat atttatattt atatttataa
     1561 tatttattat attttaattg cttttttgcg tttgtatcgt ataattacat atttattata
     1621 aatatataat tctatattta aatgtagtta ttttatatat gtatatgtat atgtagtcaa
     1681 ttgttattat tcatattaat ttttttaatt gtttttaaat tgttttatat tagtttattg
     1741 aaatacttat ttattttcaa tgtttaggaa tagttaataa taatttataa ttttgattag
     1801 attactttgt tgatgctata agaaggggtg tggaaatagt gttaaatgta agatatataa
     1861 ctataataaa taacatataa attattagtc agaaatggat gccagccgtt gcggtaatat
     1921 ctatgctttt aaatattata catttattat attaatttgt tacttattaa ttttagtcaa
     1981 taaaaagttt tatatatttt aatttgtttt tatacaccat caggtatatg caaatataaa
     2041 atgacattaa ttattaatta tattatatta tatttattca tatctttata cacataaaat
     2101 ttttagattt tgacaccatg aaaaggctat cgaatggaat tgtatatttt ataatcaaaa
     2161 ttaattaatt atattaaatt attaaatttt agataaaaaa ataaattcaa aaggtattgt
     2221 tgcccaccaa tttttataat aaaaataacg tgcagtaatt aatgaactta taaaagtaca
     2281 ttttttcata ttaaactttc attatctttt ttgttattta aatatttatg cattaaatct
     2341 taaaaattta aatatttgca cacatgtgta taatatatta aattgggaac cccgttgaag
     2401 gagggacaga ccaagaggac agagaggtcg ggaatttcag cgatttgatt tttttttttt
     2461 tgggggagcg gagcagtcga ggaaagccca gagttttcag agcgttggcg aagaagggtc
     2521 gttttattcg gaaaataaag accgttctgg aaggggagtt ttttcagg
//
LOCUS       MUSCALP1       83 bp ds-DNA             ROD       31-MAY-1990
DEFINITION  Mouse calpactin I heavy chain (p36) gene, exon 1.
ACCESSION   M33321
KEYWORDS    calpactin I; calpactin I heavy chain; p36 protein.
SEGMENT     1 of 2
SOURCE      Mouse DNA, clones RW22 and B34.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 83)
  AUTHORS   Amiguet,P., D'Eustachio,P., Kristensen,T., Wetsel,R.A.,
            Saris,C.J.M., Hunter,T., Chaplin,D.D. and Tack,B.F.
  TITLE     Structure and chromosome assignment of the murine p36 (calpactin I
            heavy chain) gene
  JOURNAL   Biochemistry 29, 1226-1232 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         30  /    77     calpactin I heavy chain (p36), exon 1
    pre-msg      23  >    83     calpactin I heavy chain mRNA and introns
    IVS          78  >    83     calpactin I heavy chain intron A
BASE COUNT       17 a     16 c     16 g     34 t
ORIGIN      
        1 tttttttttc agcttttttt ttcttcaaaa tgtctactgt ccacgaaatc ctgtgcaagc
       61 tcagcctgga gggtgatgta agt
//
LOCUS       MUSCALP2      573 bp ds-DNA             ROD       31-MAY-1990
DEFINITION  Mouse calpactin I heavy chain (p36) gene, exon 12.
ACCESSION   M33322
KEYWORDS    calpactin I; calpactin I heavy chain; p36 protein.
SEGMENT     2 of 2
SOURCE      Mouse DNA, clones RW22 and B34.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 573)
  AUTHORS   Amiguet,P., D'Eustachio,P., Kristensen,T., Wetsel,R.A.,
            Saris,C.J.M., Hunter,T., Chaplin,D.D. and Tack,B.F.
  TITLE     Structure and chromosome assignment of the murine p36 (calpactin I
            heavy chain) gene
  JOURNAL   Biochemistry 29, 1226-1232 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     7       63     calpactin I heavy chain (p36), exon 12
    pre-msg <     1      343     calpactin I heavy chain mRNA and introns
    IVS     <     1        6     calpactin I heavy chain intron K
BASE COUNT      137 a    145 c    132 g    159 t
ORIGIN      About 22 kb after segment 1.
        1 ctgcaggaca ccaagggtga ctaccagaag gcactgctgt acctgtgtgg tggggatgac
       61 tgaagggctc agcacagtgg atcacccaga agtggctcta cctgtgcccc aacctggcgt
      121 tctagagact tcgctctcca ctaatggacc cctgagctcc tccctgtgag gatgatgaca
      181 gggctgccga ccctttcccc atcttagctg cccttgcctg gctttctcct cattctctcc
      241 tttatgccaa agaagtgaac attccaggga gtggggcgtc agtctgtgac atgagacact
      301 tcctcttatg tactgtgtcg tgaataaacc gtttttactt tagaaacaag aatgccgttg
      361 cctttgcttt caagccactg ttcagaaagc agacatactt gggatttgat tgttcgccca
      421 gggagttagg cccctagagg atgaaagttc tagaacttat ctcttccctc tggtagccac
      481 tatacagatg gcactttttt aaaaggatga gtgaatttaa aattatatac agttcaaatt
      541 ttacccctta ggcaaagggc tacagctctt atc
//
LOCUS       HUMLSERP1    1235 bp ds-DNA             PRI       31-MAY-1990
DEFINITION  Human leuserpin 2 (hLS2) gene, 5' flank.
ACCESSION   M33659 J03921
KEYWORDS    leuserpin 2; serine protease inhibitor.
SEGMENT     1 of 2
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1235)
  AUTHORS   Ragg,H. and Preibisch,G.
  TITLE     Structure and expression of the gene coding for the human serpin
            hLS2
  JOURNAL   J. Biol. Chem. 263, 12129-12134 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Ragg, 17-JAN-1988
FEATURES       from  to/span     description
    IVS        1134  >  1235     leuserpin 2 (hLS2) intron A
BASE COUNT      345 a    313 c    302 g    275 t
ORIGIN      
        1 ctcgggaggt tgaggctgca gtgagccaag atcacgccac tgcacttcag cctgggtaac
       61 agagtgagac cctgtctcaa aaaacacata gggcaggcgt ggtggctcac gcatgtaatc
      121 ccagcacttt gggaggccga gacgggagga tcacttcact ccaggagttc aacaccagcc
      181 tggccaacat agtgaaaccc cgtctctact aaaaatacaa aaaattagtt ggacatggtg
      241 gtgtgcgcct gtaatctcag ccactcagga ggctgaggca ggagaatcgc ttgaacttgg
      301 gagacagagg ttgcagtgag ctgagatcgc accactgcac tccagcatgg gcagcagcgc
      361 gaaactctgt ctcaaaacaa acaaacaaac aaacaaacac ccataaacac aaaatgtatc
      421 acagcctcag agatccccac gaatgcctaa gtggccctga atttgggagg cactgctcag
      481 taatagtcct atctgtccca caacagacag gagtgctggg ctgcacctac tggcaacaaa
      541 cacagcaacc cttgactgaa gaaaggtcca tgccacaatc cccttattct gtaagccact
      601 aattttgtcc tctctcctcc acctttcact gaggaacgag ctcttggaag gacagggaca
      661 cccgcctagt agctgagcca gccacatcag tcctggagag caggtggagg gcagatgctg
      721 tgatcatccc agaagagagg acacagttgg aggcagatgc atggtctcta ctttcagcta
      781 ccctcaatgc agcctggtcc ccagaggcct gaagagcgcc ttgtttatgt ggtgacctca
      841 agaggggctg ctcctgcacc aaggctatgt gtgcatgcta acacagtaac cgtcatatac
      901 tcaaaatgtc agctctaaga actggagatg aggagctgca agccactcta cagttatcaa
      961 aggcacagct gagggggttt gtgctgacca agctggttgc ctggtgtttg gattgggact
     1021 tatttacttt ggaaaatatg cagcaacagc ccagcaccaa agttcacatc aaaatcccac
     1081 tgatgacctt ggctgctttc atctctgaag cgccacttct cagaaacaca gaggtaagtt
     1141 gggtttctaa tgtttctgct gattataaat tatttttggt gtttacggat aggcaactgg
     1201 ttcatttttc tagcaaacta agaattcaga agctt
//
LOCUS       HUMLSERP2     408 bp ds-DNA             PRI       31-MAY-1990
DEFINITION  Human leuserpin 2 (hLS2) gene, exon 2.
ACCESSION   M33660 J03921
KEYWORDS    leuserpin 2; serine protease inhibitor.
SEGMENT     2 of 2
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 408)
  AUTHORS   Ragg,H. and Preibisch,G.
  TITLE     Structure and expression of the gene coding for the human serpin
            hLS2
  JOURNAL   J. Biol. Chem. 263, 12129-12134 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Ragg, 17-JAN-1988
FEATURES       from  to/span     description
    pept         52  >   408     leuserpin 2 (hLS2) precursor, exon 2 (first
                                 expressed exon)
    sigp         52      108     leuserpin 2 (hLS2) signal peptide
    matp        109  >   408     leuserpin 2 (hLS2)
    IVS     <     1       36     hLS2 intron A
BASE COUNT      107 a    111 c     98 g     92 t
ORIGIN      About 5.2 kb after segment 1.
        1 aggccgcctt tcactgtgtt ctgttttccc tcccagcttt agctccgcaa aatgaaacac
       61 tcattaaacg cacttctcat tttcctcatc ataacatctg cgtggggtgg gagcaaaggc
      121 ccgctggatc agctagagaa aggaggggaa actgctcagt ctgcagatcc ccagtgggag
      181 cagttaaata acaaaaacct gagcatgcct cttctccctg ccgacttcca caaggaaaac
      241 accgtcacca acgactggat tccagagggg gaggaggacg acgactatct ggacctggag
      301 aagatattca gtgaagacga cgactacatc gacatcgtcg acagtctgtc agtttccccg
      361 acagactctg atgtgagtgc tgggaacatc ctccagcttt ttcatggc
//
LOCUS       MUSIGHAAN     375 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Mouse Ig rearranged gamma-chain mRNA V-D-J4-region, hybridoma A6.1,
            partial cds.
ACCESSION   M33558
KEYWORDS    N-region; diversity exon; gamma-immunoglobulin;
            immunoglobulin heavy chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse (strain BW) hybridoma A6.1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 375)
  AUTHORS   Tsao,B.P., Ebling,F.M., Roman,C., Panosian-Sahakian,N.,
            Calame,K. and Hahn,B.H.
  TITLE     Structural characteristics of the variable regions of
            immunoglobulin genes encoding a pathogenic autoantibody in murine
            lupus
  JOURNAL   J. Clin. Invest. 85, 530-540 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   375     Ig gamma-chain V-D-J4-region (AA at 1)
    recomb      300      301     V-region DNA end (3' end put.)/D-region DNA
                                 start (5' end put.)
    recomb      324      325     D-region DNA end/N-region DNA start
    recomb      327      328     N-region DNA end/J-region DNA start
BASE COUNT       93 a     90 c     97 g     95 t
ORIGIN      
        1 gaggtccagc tgcagcagtc tggacctgag ctggtgaagc ctggggcttc agtgaagatg
       61 tcctgcaagg cttctggtta ctcctttact ggctacttta tgaactgggt gaagcagagc
      121 catggaaaga gccttgagtg gattggacgt attaatcctt acaatggtga tactttctac
      181 aaccagaagt tcaagggcaa ggccacattg actgtagaca aatcctctag cacagcccac
      241 atggagctcc ggagcctgac atctgaggac tctgcactct attattgtgc aagagattcc
      301 ccttattact acggtagtag ctacgggtat gctatggact actggggtca aggaacctca
      361 gtcaccgtct cctca
//
LOCUS       MUSIGKABE     321 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J2-region, hybridoma A6.1,
            partial cds.
ACCESSION   M33559
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BW) hybridoma A6.1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Tsao,B.P., Ebling,F.M., Roman,C., Panosian-Sahakian,N.,
            Calame,K. and Hahn,B.H.
  TITLE     Structural characteristics of the variable regions of
            immunoglobulin genes encoding a pathogenic autoantibody in murine
            lupus
  JOURNAL   J. Clin. Invest. 85, 530-540 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig kappa-chain V-J2-region (AA at 1)
    recomb      291      292     V-region DNA end/J-region DNA start
BASE COUNT       82 a     74 c     76 g     89 t
ORIGIN      
        1 gacatccaga tgattcagtc tccatcgtcc atgtttgcct ctctgggaga cagagtcagt
       61 ctctcttgtc gggctagtca gggcattaga ggtaatttag actggtatca gcagaaacca
      121 ggtggaacta ttaaactcct gatctactcc acatccaatt taaattctgg tgtcccatca
      181 aggttcagtg gcagtgggtc tgggtcagat tattctctca ccatcagcag cctagagtct
      241 gaagattttg cagactatta ctgtctacag cgtaatgcgt atcctctcac gttcggaggg
      301 gggaccaagc tggaaataaa a
//
LOCUS       RATA2UGLBB    852 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Rat alpha-2u-globulin (L type) mRNA, complete cds., clone pA110.
ACCESSION   M26836
KEYWORDS    alpha-2u-globulin; globulin.
SOURCE      Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA110.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 852)
  AUTHORS   Ichiyoshi,Y., Endo,H. and Yamamoto,M.
  TITLE     Length polymorphism in the 3' noncoding region of rat hepatic
            alpha-2u-globulin mRNAs
  JOURNAL   Biochim. Biophys. Acta 910, 43-51 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          4      549     alpha-2u-globulin
    mRNA    <     1      852     alpha-2u-globulin mRNA
BASE COUNT      246 a    186 c    198 g    222 t
ORIGIN      
        1 aacatgaagc tgttgctgct gctgctgtgt ctgggcctga cactggtctg tggccatgca
       61 gaagaagcta gttccacaag agggaacctc gatgtggaca agctcaatgg ggattggttt
      121 tctattgtcg tggcctctga caaaagagaa aagatagaag agaatgccag catgagagtt
      181 tttatgcagc acatcgatgt cttggagaat tccttaggct tcaagttccg tattaaggaa
      241 aatggagagt gcagggaact atatttggtt gcctacaaaa cgccagagga tggcgaatat
      301 tttgttgagt atgacggagg gaatacattt actatactta agacagacta tgacagatat
      361 gtcatgtttc atctcattaa tttcaagaac ggggaaacct tccaggcgat ggtgctctac
      421 ggcagaacaa aggatctgag ttcagacatc aaggaaaagt ttgcaaaact atgtgaggcg
      481 catggaatca ctagggacaa tatcattgat ctaaccaaga ctgatcactg tctccaggcc
      541 cgaggatgaa gaaaggcctg agactccagg tgggcaatct ccagtgagag caagtgctga
      601 gtggagactt ctcaccagga ctctagcatc accatttcct gtccatggag catcctgaga
      661 caaattctgc gatctgattt ccatcctctg tcacagaaaa gtgcaatcct ggtctctcca
      721 gcatcttccc tagttaccca ggacaacaca tcgagaatta aaagctttct taaatttctc
      781 ttggccccac ccatgatcat tccgcacaaa tttcttgctc ttgcagttca ataaatgatt
      841 acccttgcac tt
//
LOCUS       RATA2UGLBC    838 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Rat alpha-2u-globulin (S type) mRNA, complete cds., clone pA120.
ACCESSION   M26837
KEYWORDS    alpha-2u-globulin; globulin.
SOURCE      Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA120.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 838)
  AUTHORS   Ichiyoshi,Y., Endo,H. and Yamamoto,M.
  TITLE     Length polymorphism in the 3' noncoding region of rat hepatic
            alpha-2u-globulin mRNAs
  JOURNAL   Biochim. Biophys. Acta 910, 43-51 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         15      560     alpha-2u-globulin
    mRNA    <     1      838     alpha-2u-globulin mRNA
BASE COUNT      240 a    186 c    189 g    223 t
ORIGIN      
        1 tattccctac caacatgaag ctgttgctgc tgctgctgtg tctgggcctg acactggtct
       61 gtggccatgc agaagaagct agttccacaa gagggaacct cgatgtggct aagctcaatg
      121 gggattggtt ttctattgtc gtggcctcta acaaaagaga aaagatagaa gagaatggca
      181 gcatgagagt ttttatgcag cacatcgatg tcttggagaa ttccttaggc ttcaagttcc
      241 gtattaagga aaatggagag tgcagggaac tatatttggt tgcctacaaa acgccagagg
      301 atggcgaata ttttgttgag tatgacggag ggaatacatt tactatactt aagacagact
      361 atgacagata tgtcatgttt catctcatta atttcaagaa cggggaaacc ttccagctga
      421 tggtgctcta cggcagaaca aaggatctga gttcagacat caaggaaaag tttgcaaaac
      481 tatgtgaggc gcatggaatc actagggaca atatcattga tctaaccaag actgatcgct
      541 gtctccaggc ccgaggatga agaaaggcct gagcctccag tgctgagtgg agacttctca
      601 ccaggactct agcatcacca tttcctgtcc atggagcatc ctgagacaaa ttctgcgatc
      661 tgatttccat cctctgtcac agaaaagtgc aatcctggtc tctccagcat cttccctagt
      721 tacccaggac aacacatcga gaattaaaag ctttcttaaa tttctcttgg ccccacccat
      781 gatcattccg cacaaatatc ttgctcttgc agttcaataa atgattaccc ttgcactt
//
LOCUS       RATA2UGLBD    835 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Rat alpha-2u-globulin (S type) mRNA, complete cds., clone pA108.
ACCESSION   M26838
KEYWORDS    alpha-2u-globulin; globulin.
SOURCE      Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA108.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 835)
  AUTHORS   Ichiyoshi,Y., Endo,H. and Yamamoto,M.
  TITLE     Length polymorphism in the 3' noncoding region of rat hepatic
            alpha-2u-globulin mRNAs
  JOURNAL   Biochim. Biophys. Acta 910, 43-51 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         12      557     alpha-2u-globulin
    mRNA    <     1      835     alpha-2u-globulin mRNA
BASE COUNT      240 a    185 c    190 g    220 t
ORIGIN      
        1 tccctaccaa catgaagctg ttgctgctgc tgctgtgtct gggcctgaca ctggtctgtg
       61 gccatgcaga agaagctagt tccacaagag ggaacctcga tgtggacaag ctcaatgggg
      121 attggttttc tattgtcgtg gcctctgaca aaagagaaaa gatagaagag aatggcagca
      181 tgagagtttt tatgcagcac atcgatgtct tggagaattc cttaggcttc aagttccgta
      241 ttaaggaaaa tggagagtgc agggaactat atttggttgc ctacaaaacg ccagaggatg
      301 gcgaatattt tgttgagtat gacggaggga atacatttac tatacttaag acagactatg
      361 acagatatgt catgtttcat ctcattaatt tcaagaacgg ggaaaccttc caggcgatgg
      421 tgctctacgg cagaacaaag gatctgagtt cagacatcaa ggaaaagttt gcaaaactat
      481 gtgaggcgca tggaatcact agggacaata tcattgatct aaccaagact gatcactgtc
      541 tccaggcccg aggatgaaga aaggcctgag actccagtgc tgagtggaga cttctcacca
      601 ggactctagc atcaccattt cctgtccatg gagcatcctg agacaaattc tgcgatctga
      661 tttccatcct ctgtcacaga aaagtgcaat cctggtctct ccagcatctt ccctagttac
      721 ccaggacaac acatcgagaa ttaaaagctt tcttaaattt ctcttggccc cacccatgat
      781 cattccgcac aaatttcttg ctcttgcagt tcaataaatg attacccttg cactt
//
LOCUS       RATA2UGLBE    867 bp ss-mRNA            ROD       31-MAY-1990
DEFINITION  Rat alpha-2u-globulin (L type) mRNA, complete cds., clone pA8.
ACCESSION   M26835
KEYWORDS    alpha-2u-globulin; globulin.
SOURCE      Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA8.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 867)
  AUTHORS   Ichiyoshi,Y., Endo,H. and Yamamoto,M.
  TITLE     Length polymorphism in the 3' noncoding region of rat hepatic
            alpha-2u-globulin mRNAs
  JOURNAL   Biochim. Biophys. Acta 910, 43-51 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         19      564     alpha-2u-globulin
    mRNA    <     1      867     alpha-2u-globulin mRNA
BASE COUNT      248 a    192 c    198 g    229 t
ORIGIN      
        1 attctattcc ctaccaacat gaagctgttg ctgctgctgc tgtgtctggg cctgacactg
       61 gtctgtggcc atgcagaaga agctagttcc acaagaggga acctcgatgt ggctaagctc
      121 aatggggatt ggttttctat tgtcgtggcc tctaacaaaa gagaaaagat agaagagaat
      181 ggcagcatga gagtttttat gcagcacatc gatgtcttgg agaattcctt aggcttcaag
      241 ttccgtatta aggaaaatgg agagtgcagg gaactatatt tggttgccta caaaacgcca
      301 gaggatggcg aatattttgt tgagtatgac ggagggaata catttactat acttaagaca
      361 gactatgaca gatatgtcat gtttcatctc attaatttca agaacgggga aaccttccag
      421 ctgatggtgc tctacggcag aacaaaggat ctgagttcag acatcaagga aaagtttgca
      481 aaactatgtg aggcgcatgg aatcactagg gacaatatca ttgatctaac caagactgat
      541 cgctgtctcc aggcccgagg atgaagaaag gcctgagcct ccaggtgggc aatctccagt
      601 gagagcaagt gctgagtgga gacttctcac caggactcta gcatcaccat ttcctgtcca
      661 tggagcatcc tgagacaaat tctgcgatct gatttccatc ctctgtcaca gaaaagtgca
      721 atcctggtct ctccagcatc ttccctagtt acccaggaca acacatcgag aattaaaagc
      781 tttcttaaat ttctcttggc cccacccatg atcattccgc acaaatatct tgctcttgca
      841 gttcaataaa tgattaccct tgcactt
//
LOCUS       MZENAR1       386 bp ss-mRNA            PLN       31-MAY-1990
DEFINITION  Maize glyceraldehyde-3-phosphate dehydrogenase mRNA, 5' end.
ACCESSION   M31481 M31482
KEYWORDS    glyceraldehyde-3-phosphate dehydrogenase.
SEGMENT     1 of 2
SOURCE      Maize leaf (strain L. W64xW128E), cDNA to mRNA, clone Zm6.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 386)
  AUTHORS   Gowri,G. and Campbell,W.H.
  TITLE     cDNA clones for corn leaf NADH:nitrate reductase and chloroplast
            NAD(P)+: glyceraldehyde-3-phosphate dehydrogenase
  JOURNAL   Plant Physiol. 90, 792-798 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         12  >   386     glyceraldehyde-3-phosphate dehydrogenase
                                 precursor
    sigp         12      209     glyceraldehyde-3-phosphate dehydrogenase
                                 signal peptide
    matp        210  >   386     glyceraldehyde-3-phosphate dehydrogenase
    mRNA    <     1  >   386     glyceraldehyde-3-phosphate dehydrogenase mRNA
BASE COUNT       57 a    143 c    125 g     61 t
ORIGIN      
        1 ggtccgtggc catggcgtcg tccatgctct ccgctaccac cgtgccactc cagcaggggg
       61 gcggcctgtc cgagttctcc gggctcagga gctccgcgtc gctgcccatg cgccggaatg
      121 ccacctccga cgacttcatg tccgccgtct ccttcaggac ccacgcggtc ggtacgagcg
      181 gcgggccgcg gcgggcgccg acggaggcaa agctgaaggt ggccatcaac gggttcggcc
      241 gcatcgggcg caacttcctg cggtgctggc acgggcgcgg cgacgcgtcg cccctcgacg
      301 tcatcgccat caacgacacc ggaggcgtca agcaggcgtc ccacctgctc aagtacgact
      361 ccacgctcgg catcttcgat gccgat
//
LOCUS       MZENAR2       456 bp ss-mRNA            PLN       31-MAY-1990
DEFINITION  Maize glyceraldehyde-3-phosphate dehydrogenase, 3' end.
ACCESSION   M31483
KEYWORDS    glyceraldehyde-3-phosphate dehydrogenase.
SEGMENT     2 of 2
SOURCE      Maize leaf (strain L. W64xW1282E), cDNA to mRNA, clone Zm6.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 456)
  AUTHORS   Gowri,G. and Campbell,W.H.
  TITLE     cDNA clones for corn leaf NADH:nitrate reductase and chloroplast
            NAD(P)+: glyceraldehyde-3-phosphate dehydrogenase
  JOURNAL   Plant Physiol. 90, 792-798 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      312     glyceraldehyde-3-phosphate dehydrogenase
                                 precursor (AA at 1)
    matp    <     1      309     glyceraldehyde-3-phosphate dehydrogenase
    mRNA    <     1      456     glyceraldehyde-3-phosphate dehydrogenase mRNA
BASE COUNT       87 a    150 c    122 g     97 t
ORIGIN      About 525 bp after segment 1.
        1 cgggtgccca ccccgaacgt ctccgtcgtc gacctcgtcg tgcaggtctc caagaagacc
       61 ctcgccgagg aggtgaacca ggcgttccgc gacgccgcgg ccaacgagct cacgggcatc
      121 ctcgaggtct gcgacgtgcc gctcgtgtcc gtcgacttca ggtgctccga cgtctcctcc
      181 accatcgacg cctccctcac catggtcatg ggcgacgaca tggtcaaggt catctcctgg
      241 tacgacaacg agtggggcta ctcgcagagg gtcgtcgacc tcgctgacat ctgcgccaac
      301 cagtggaagt gaagcgttgc attgtaatag cactttgtaa ctcctcaccc tccacagggc
      361 tttgggattc accccgaaat gactcacttt cggtgaaatt cacccgtttc ggcgaggtcg
      421 tccgaatata ccggccaaat gtttccattg gatttc
//

GenBank-Updates@genbank.bio.net (06/01/90)

LOCUS       ECONANA      1243 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  Escherichia coli nanA gene encoding N-acetylneuraminate lyase,
            complete cds.
ACCESSION   M20207 D00067 N00067
KEYWORDS    N-acetylneuraminate lyase; NALase; nanA gene; npl gene.
SOURCE      E.coli JE1011 DNA, 1.2 kb HindIII-EcoRI fragment DNA containing
            nanA gene, which is cloned in pNL4, for [1]. E.coli C600 DNA, clone
            pMK6, for [2].
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1243)
  AUTHORS   Kawakami,B., Kudo,T., Narahashi,Y. and Horikoshi,K.
  TITLE     Nucleotide sequence of the N-acetylneuraminate lyase gene of
            Escherichia coli
  JOURNAL   Agric. Biol. Chem. 50, 2155-2158 (1986)
  STANDARD  simple automatic
FEATURES       from  to/span     description
    pept         93      986     N-acetylneuraminate lyase
BASE COUNT      300 a    309 c    322 g    312 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttctg tatggggtgt tgcttaattg atctggtata acaggtataa aggtatatcg
       61 tttatcagac aagcatcact tcagaggtat ttatggcaac gaatttacgt ggcgtaatgg
      121 ctgcactcct gactcctttt gaccaacaac aagcactgga taaagcgagt ctgcgtcgcc
      181 tggttcagtt caatattcag cagggcatcg acggtttata cgtgggtggt tcgaccggcg
      241 aggcctttgt acaaagcctt tccgagcgtg aacaggtact ggaaatcgtc gccgaagagg
      301 cgaaaggtaa gattaaactc atcgcccacg tcggttgcgt cagcaccgcc gaaagccaac
      361 aacttgcggc atcggctaaa cgttatggct tcgatgccgt ctccgccgtc acgccgttct
      421 actatccttt cagctttgaa gaacactgcg atcactatcg ggcaattatt gattcggcgg
      481 atggtttgcc gatggtggtg tacaacattc cagccctgag tggggtaaaa ctgaccctgg
      541 atcagatcaa cacacttgtt acattgcctg gcgtaggtgc gctgaaacag acctctggcg
      601 atctctatca gatggagcag atccgtcgtg aacatcctga tcttgtgctc tataacggtt
      661 acgacgaaat cttcgcctct ggtctgctgg cgggcgctga tggtggtatc ggcagtacct
      721 acaacatcat gggctggcgc tatcagggga tcgttaaggc gctgaaagaa ggcgatatcc
      781 agaccgcgca gaaactgcaa actgaatgca ataaagtcat tgatttactg atcaaaacgg
      841 gcgtattccg cggcctgaaa actgtcctcc attatatgga tgtcgtttct gtgccgctgt
      901 gccgcaaacc gtttggaccg gtagatgaaa aatatctgcc agaactgaag gcgctggccc
      961 agcagttgat gcaagagcgc gggtgagttg tttcccctcg ctcgccccta ccggtgaggg
     1021 gaaataaacg catctgtacc ctacaatttt cataccaaag cgtgtgggca tcgcccaccg
     1081 cgggagactc acaatgagta ctacaaccca gaatatcccg tggtatcgcc atctcaaccg
     1141 tgcacaatgg cgcgcatttt ccgctgcctg gttgggatat ctgcttgacg gttttgattt
     1201 cgttttaatc gccctggtac tcaccgaagt acaaggtgaa ttc
//
LOCUS       MACEPO       1462 bp ss-mRNA            PRI       01-JUN-1990
DEFINITION  Monkey (Cynomolgus) erythropoietin mRNA, complete cds.
ACCESSION   M18189 M15818 M15819 M18188
KEYWORDS    erythropoietin.
SOURCE      Monkey cDNA to mRNA.
  ORGANISM  Macaca fascicularis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; 
            Cercopithecidae; Cercopithecinae.
REFERENCE   1  (bases 1 to 1462)
  AUTHORS   Lin,F.-K., Lin,C.-H., Lai,P.-H., Browne,J.K., Egrie,J.C.,
            Smalling,R., Fox,G.M., Chen,K.K., Castro,M. and Suggs,S.
  TITLE     Monkey erythropoietin gene: Cloning, expression and comparison with
            the human erythropoietin gene
  JOURNAL   Gene 44, 201-209 (1986)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by F.-K.Lin, 04-AUG-1987.
FEATURES       from  to/span     description
    pept        217      795     erythropoietin precursor
    sigp        217      288     erythropoietin signal peptide
    matp        289      792     erythropoietin
BASE COUNT      297 a    455 c    433 g    277 t
ORIGIN      Unreported.
        1 agagtccctg ggccaccccg gtcgctcgct gcgtgcgtcc cggagccgga ccggggccac
       61 cgcgccggct ctgcaccgat cccgcgcccc ctggacagcc gccctctcct ccaggcccgt
      121 ggggctggcc ctgccccgct gaacttcccg ggatgaggac tcccggtgtg gtcaccgcgc
      181 gcctaggtcg ctgagggacc ccggccaggc gcggagatgg gggtgcacga atgtcctgcc
      241 tggctgtggc ttctcctgtc tctcgtgtcg ctccctctgg gcctcccagt cccgggcgcc
      301 ccaccacgcc tcatctgtga cagccgagtc ctggagaggt acctcttgga ggccaaggag
      361 gccgagaatg tcacgatggg ctgttccgaa agctgcagct tgaatgagaa tatcaccgtc
      421 ccagacacca aagttaactt ctatgcctgg aagaggatgg aggtcgggca gcaggctgta
      481 gaagtctggc agggcctggc cctgctctca gaagctgtcc tgcggggcca ggccgtgttg
      541 gccaactctt cccagccttt cgagcccctg cagctgcaca tggataaagc catcagtggc
      601 cttcgcagca tcaccactct gcttcgggcg ctgggagccc aggaagccat ctccctccca
      661 gatgcggcct cggctgctcc actccgaacc atcactgctg acactttctg caaactcttc
      721 cgagtctact ccaatttcct ccggggaaag ctgaagctgt acacggggga ggcctgcagg
      781 agaggggaca gatgaccagg tgcgtccagc tgggcacatc caccacctcc ctcaccaaca
      841 ctgcctgtgc cacaccctcc ctcaccactc ccgaacccca tcgaggggct ctcagctaag
      901 cgccagcctg tcccatggac actccagtgc cagcaatgac atctcagggg ccagaggaac
      961 tgtccagagc acaactctga gatctaagga tgtcgcaggg ccaacttgag ggcccagagc
     1021 aggaagcatt cagagagcag ctttaaactc aggagcagag acaatgctgg gaaaacacct
     1081 gagctcactc ggccacctgc aaaatttgat gccaggacac gctttggagg caatttacct
     1141 gtttttgcac ctaccatcag ggacaggatg acctggagaa cttaggtggc aagctgtgac
     1201 ttctccaggc ctcacgggca ctcccttggt ggcaagagcc cccttgacac tgagagaata
     1261 ttttgcaatc tgcagcagga aaaattacgg acaggttttg gaggttggag ggtacttgac
     1321 aggtgtgtgg ggaagcaggg cggtaggggt ggagctggga tgcgagtgag aaccgtgaag
     1381 acaggatggg ggctggcctc tggttctcgt ggggtccaag ctttgtgtat tcttcaatct
     1441 cactggcaag aactgaaacc ac
//
LOCUS       AFACFXP1     1751 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  A.eutrophus phosphoribulokinase P (cfxP) gene, complete cds.
ACCESSION   M33562
KEYWORDS    phosphoribulokinase.
SOURCE      A.eutrophus (strain H16) DNA.
  ORGANISM  Alcaligenes eutrophus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Alcaligenaceae.
REFERENCE   1  (bases 1 to 1751)
  AUTHORS   Kossmann,J., Klintworth,R. and Bowien,B.
  TITLE     Sequence analysis of the chromosomal and plasmid genes encoding
            phosphoribulokinase from Alcaligenes eutrophus
  JOURNAL   Gene 85, 247-252 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      621     phosphoribulokinase F(AA at 1)
    pept        633     1511     phosphoribulokinase P
BASE COUNT      310 a    614 c    542 g    285 t
ORIGIN      
        1 ctgcagcccg gcagcgccca ggtggcggcc ggctacgcgc tctacggtcc caccaccatg
       61 ctggtgctga ccgtgggcaa tggcgtcaac ggcttcacgc tcgatcccaa cctgggcgag
      121 ttcttcctga cgcaccccca cctgcgggtg ccgcccgata cccaggaatt tgccatcaac
      181 gcgtcgaaca gccgcttctg ggaagcgccg gtgcagcgct acatcggcga gtgcatggct
      241 ggcaagagcg ggccgcgcgg caaggatttc aatatgcgct ggatcgcatc gatggtggcc
      301 gaggcgcacc gcatcctgat gcgcggcggt gtcttcatgt acccgcgcga caccaaggat
      361 cccgccaagc cgggccgcct tcgcctgctg tacgaggcca acccgatcgc cttcctgatg
      421 gagcaggccg gcgggcgcgc cagcacgggc cggcagacgc tgatgtcggt ggcgccgggc
      481 gcgctgcacc agcgtatcgg cgtgatcttc ggctcgcgca atgaagtgga acggatcgag
      541 ggctaccaca ccaaccagac cgaccccgac cttccgagtc ccctgttcaa cgagcgcagc
      601 ctgttccgcg cgtccgcctg aggtgcctgg ccatgtcaga acgttatccc attatcgcta
      661 tcaccgggtc ctcgggcgcc ggtaccacgt cggtgacccg taccttcgag aacatcttct
      721 gccgtgaagg cgtgaagtcg gtcgtgatcg aaggcgacag cttccaccgc tacgaccgtg
      781 ccgagatgaa ggtcaagatg gccgaggccg aacgcaccgg caatatgaac ttcagccact
      841 tcggcgcgga gaacaacctg ttcggcgacc tggagagcct gttccgctcc tacgcggaat
      901 ccggcacggg catgcgccgg cgctatctgc acagcacgga ggaggccgcg ccgttcgggc
      961 agcagcccgg caccttcacc gcgtgggagc cattgccggc cgataccgac ctgctgttct
     1021 acgaaggcct gcacggcggc gtggtcaccg acgaggtcaa tgtcgcccag tatcccaacc
     1081 tgctgatcgg cgtggtgccc gtcatcaacc tggagtggat tcagaagctc tggcgcgaca
     1141 agaaacagcg cggctactcg accgaggccg tgaccgacac catcctgcgc cgcatgccgg
     1201 actacgtgaa ctacatctgc ccgcagttct cgcgcacgca tgtgaacttc cagcgggtgc
     1261 cgtgcgtgga cacgtccaac ccctttatct cgcgcgaaat ccccgcgccc gatgaaagca
     1321 tggtggtgat ccgctttgcc aacccgaagg ggatcgactt ccagtacctg ctgagcatga
     1381 tccacgactc cttcatgtcg cgcgccaaca ccatcgtggt gccgggcggc aagatggaac
     1441 tggccatgca gctgatcttc acgcccttcg tgctgcgcat gatggagcgc cgcaagcgcg
     1501 ccgcgctgta aggagccgag atgaacgcac ccgaacgcat cgaccccgca gcgcgttgcg
     1561 ccaacgcgct gcgcttcctg gccgccgacg cggtggagtt ggcccgctcc ggccaccccg
     1621 gcgcccccat gggcatggcc gagatggccg aggtcgtgtg gcggcgccac ctgcgccaca
     1681 acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac ggccatgcct
     1741 ccatgctgca g
//
LOCUS       AFACFXP2     1641 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  A.eutrophus phosphoribulokinase P (cfxF) gene, complete cds.
ACCESSION   M33563
KEYWORDS    phosphoribulokinase.
SOURCE      A.eutrophus (strain H16) DNA.
  ORGANISM  Alcaligenes eutrophus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Alcaligenaceae.
REFERENCE   1  (bases 1 to 1641)
  AUTHORS   Kossmann,J., Klintworth,R. and Bowien,B.
  TITLE     Sequence analysis of the chromosomal and plasmid genes encoding
            phosphoribulokinase from Alcaligenes eutrophus
  JOURNAL   Gene 85, 247-252 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      480     phosphoribulokinase F
    pept        492     1370     phosphoribulokinase P
BASE COUNT      306 a    555 c    505 g    275 t
ORIGIN      
        1 ctgcaggtgc cggccgatac ccaggaattt gccatcaatg cgtcgaacag ccgcttctgg
       61 gaagcgccgg tgcagcgcta catcgccgag tgcatggccg gcaagagcgg gccgcgcggc
      121 aaggatttca atatgcgctg gatcgcgtcg atggtggccg aggcgcaccg catcctgatg
      181 cgtggcggcg tcttcatgta cccgcgcgac tccaaggatc ccgccaagcc gggccgcctg
      241 cgcctgctgt acgaggccaa tccgatcgcc ttcctgatgg agcaggctgg cgggcgcgcc
      301 agcacgggcc ggcagacgct gatgtcggtg gcgccgggtg cgctgcacca gcgcattggc
      361 gtgatcttcg gctcgcgcaa tgaagtggaa cggatcgagg gctaccacac cgaccagacc
      421 gatcccgacc ttccgagtcc cctgttcaac gagcgcagcc tgttccgcgc gtctgcctga
      481 ggtgcctggc catgtcagaa cgttatccca ttatcgccat caccggctcc tcgggcgccg
      541 gtaccacgtc ggtgacccgt accttcgaga acatcttccg ccgcgagggc gtgaagtcag
      601 tggtgatcga aggcgacagc ttccaccgct atgaccgcgc cgagatgaag gtcaagatgg
      661 ccgaggccga gcgcaccggc aatatgaact tcagccactt cggcgaggag aacaacctgt
      721 tcggcgagct ggagaacctg ttccgctcct atgccgaaac cggcacgggc atgcaccggc
      781 actacctgca tagcccggag gaagccgcgc cgttcgggca ggaacccggc accttcacgc
      841 agtgggagcc gctgccggcc gataccgatc tgctgttcta cgagggcctg cacggcggcg
      901 tggtcaccga ttcggtcaat gtcgcccagt atcccaacct gctgatcggc gtggtccctg
      961 tcatcaacct ggaatggatc cagaagctct ggcgcgacaa gaagcagcgc ggctattcga
     1021 ccgaggccgt gaccgacacc atcctgcgcc gcatgccgga ctacgtgaat tacatctgtc
     1081 cgcagttctc gcgcacgcat gtgaacttcc agcgcgtgcc gtgcgtggat acgtccaacc
     1141 cctttatctc gcgcgaaatc cccgcgcccg atgaaagcat ggtggtgatc cgctttgcca
     1201 acccgaaggg gatcgacttc cagtacctgc tgagcatgat ccacgactcc ttcatgtcgc
     1261 gcgccaacac catcgtggtg cccggcggca agatggagct ggccatgcag ctgatcttca
     1321 cgcccttcgt gctgcgcatg atggagcgcc gcaagcgcgc cgcgcagtaa ggactcgacc
     1381 gtgaaacgcc gactcgcgag gatctcctga atgaacgcac cagaacgcat cgattccgca
     1441 gcgcgctgtg ccaacgcgct gcgcttcctg gccgccgacg cggtggagca ggccaaatcg
     1501 ggccaccccg gcgcacccat gggcatggcc gaaatggccg aagtgctgtg gcggcgccac
     1561 ctgcgacaca acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac
     1621 ggccatgcgt ccatgctgca g
//
LOCUS       BIFLLD       1767 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  B.longum L-lactate dehydrogenase (ldh) gene, complete cds.
ACCESSION   M33585
KEYWORDS    L-lactate dehydrogenase.
SOURCE      B.longum (strain aM101-2) DNA.
  ORGANISM  Bifidobacterium longum
            Prokaryota; Bacteria; Irregular, nonsporing, gram-positive rods.
REFERENCE   1  (bases 1 to 1767)
  AUTHORS   Minowa,T., Iwata,S., Sakai,H., Masaki,H. and Ohta,T.
  TITLE     Sequence and characteristics of the Bifidobacterium longum gene
            encoding L-lactate dehydrogenase and the primary structure of the
            enzyme: A new feature of the allosteric site
  JOURNAL   Gene 85, 161-168 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        555     1517     L-lactate dehydrogenase (EC 1.1.1.27)
    mRNA        493  >  1517     L-lactate dehydrogenase mRNA
BASE COUNT      373 a    555 c    488 g    351 t
ORIGIN      
        1 gtcgacgcgg tcaatgacgt gttggcggac atcgaaggca cggcctcgat tccgcgtatt
       61 ctcgtattca acaaggccga tcaggcggac gaggcgactc gtgaacgact cgccgcgctg
      121 cagccagatg cgttcatcgt ctccgcctat accggtgagg gattggacga gctgcgtacc
      181 gcggtcgaaa gtctgctgcc ggtcccgcat gtgcatgtca acgctctgct gccgtatacc
      241 gctggctccc tgatctctcg tgtacgcgaa tacggcaagg tagacaaggt ggagtaccgc
      301 gatgatggca tacagcttga agcggacgtt gatgcccatc ttgcgctcag gtggtcgaac
      361 agtccattga ctaacgtgat aaacatcaca gtatattcgt gagcgctaac aaccgttgaa
      421 aacattacca tacggttgtc aaacagggtg gtgtgccggt agcaaaacgt cttagcgggt
      481 ttatagagtg aagacgttag ttacaaggcc tgccattcat cagcagaccg cctttgaaga
      541 gaggttcatc catcatggcg gaaactaccg ttaagcccac gaagcttgct gttattggtg
      601 ccggtgccgt tggctccacc ctcgccttcg ccgctgccca gcgtggcatc gctcgcgaga
      661 tcgtgcttga agacatcgcc aaggagcgcg tggaagccga agtgctcgac atgcagcatg
      721 gctccagctt ctacccgacc gtgtccatcg acggttccga cgatcctgag atctgccgcg
      781 acgccgacat ggtcgtcatc accgctggtc cgcgtcagaa gccgggtcag tctcgtcttg
      841 agctcgttgg cgctaccgtc aacatcctca aggccatcat gccgaacctg gtcaaggtgg
      901 ctccgaacgc catctacatg ctcatcacca acccggtcga catcgctacc cacgtggctc
      961 agaagctcac cggtctgccc gagaaccaga tcttcggttc cggcaccaac ctggactccg
     1021 ctcgtctgcg cttcctgatt gcccagcaga ccggcgtcaa cgtcaagaac gtgcacgcct
     1081 acatcgccgg cgagcacggc gactccgaag tcccgctgtg ggagtccgcc accatcggtg
     1141 gcgtccccat gtgcgactgg accccgctgc ccggccacga tccgctcgac gccgacaagc
     1201 gcgaggagat ccaccaggaa gtcaagaacg ccgcttacaa gatcatcaac ggtaagggtg
     1261 ccaccaacta cgccatcggc atgtccggcg tcgacatcat cgaagccgtc ctgcacgaca
     1321 ccaaccgcat tctgcccgtg agctccatgc tcaaggactt ccacggcatc tccgacatct
     1381 gcatgtccgt gccgaccctc ctcaaccgtc agggcgtcaa caacaccatc aacaccccgg
     1441 tctccgacaa ggagctcgcc gctctgaagc gctctgccga gacgctgaag gaaaccgccg
     1501 cccagttcgg cttctgataa aaaatcgctg tacggagcgc ctttcgcacc gcagcaagac
     1561 tcgacctacc tttgtaggcc ttcaccttgt gcggcacgaa aatcgcacac gtacagcgat
     1621 tttttatacc ctgctgaatg ctcccgttgg gagcattttt tattcgtgga gttcgttgga
     1681 ggagggggag ctatagccct cgggctcgag ctggaaggtg gtgtggggga ccgagaccgg
     1741 gaagtgctcg cgcaggcaat cctgcag
//
LOCUS       BOVP450SCC   1471 bp ds-DNA             MAM       01-JUN-1990
DEFINITION  Bovine P-450scc gene, exon 1.
ACCESSION   J05245
KEYWORDS    P-450scc.
SOURCE      Bovine calf thymus DNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1471)
  AUTHORS   Ahlgren,R., Simpson,E.R., Waterman,M.R. and Lund,J.
  TITLE     Characterization of the promoter/regulatory region of the bovine
            CYP11A (P-450-scc) gene: Basal and cAMP-dependent expression
  JOURNAL   J. Biol. Chem. 265, 3313-3319 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        948  /  1216     P-450scc, exon 1
    pre-msg     897  >  1471     P-450scc mRNA and introns
BASE COUNT      324 a    367 c    396 g    384 t
ORIGIN      
        1 gcagatgtcc ctggtgattc ctgaaacagg ccctctgttt aaattcttca gcagttagag
       61 ggaaggtcaa tttttcccaa ggcttttggg ctttgattgt tttcattttt aaattatctg
      121 cattctaaag agatattttg ggtggcagat tttgctctcc tacaggactt tgtctaggag
      181 acggctctca ggccagctcc gacgactgtt ccaaagaagt aagggaaagc tagggtttat
      241 atcaatcttt ttttttgctg ggagaagggg gatgaacatg tagtcaaaca taaaaagatc
      301 actgctaatc ccaaacaaca gacacctcaa gtgaatggtt ttagtgtttt tctatatatg
      361 ttgtttagtc actaagtcct gtccgactct tttgcgactc catagactgt agcccaccaa
      421 gctcctctgt ccatgggatt tttctaggca agaatactgg agtgggttgc catttccttc
      481 tccctgggat cttcctaacc caaggactga acccttgtct cctgcattgc aggtggattt
      541 tttaccgact gagccaccag ggaagttatg tgtgcaagaa tccggggtca tggaaatttt
      601 cccttagata tacatcgtat ctagggacca gtacaatgca aatgcttcct gtttttcttc
      661 atcctgaagt ctcctcaggg tgcattgagg gagggagtcc cctcaggtgg gtgaccacag
      721 tggctgacgc ttgatgttgt agaactggaa tgatgggtta cattctttcg tttacagtac
      781 tgagtctggg aggagctgtg tgggctggag tcagccggag gaggctgacc gccctgtcag
      841 cttctcactt agccttgagc tggtgattat aagctgggtc ccagggtccc agggccagag
      901 tcacctgctg cagtacgagc agagacagca gcagctgtgg gggcagcatg ctagcaaggg
      961 ggcttcccct ccgttcagcc ctggtcaaag cctgcccacc catcctgagc tcagtggggg
     1021 agggctgggg ccaccacagg gtgggcactg gagagggagc tggcatctcc acaaagaccc
     1081 ctcgccccta cagtgagatc ccctcccctg gtgacaatgg ctggcttaac ctctaccatt
     1141 tctggaggga gaagggctca cagagaatcc actttcgcca catcgagaac ttccagaagt
     1201 atggccccat ttacaggtaa gcctggcagg aggattgggg ctggcgggat agggaagcct
     1261 gtggtggccc cctccctgaa aggtctgccc tccccttcca ggctctggtt cacctctgac
     1321 tttatttctt cctgcctggc ggtggcagga gtagagttaa tgcttcccag acagtgggtt
     1381 cacttcccag ccctgaggcc tcaacagtcc ccgggctcta cacccttaga aactttgggg
     1441 aggtggggag gcccaagaaa ataagccccg g
//
LOCUS       BOVTGN        725 bp ds-DNA             MAM       01-JUN-1990
DEFINITION  Bovine minisatellite repeat DNA.
ACCESSION   J04756
KEYWORDS    minisatellite sequence.
SOURCE      Bovine (strain Holstein-Friesian) blood DNA, clone lambda-gBJ32.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 725)
  AUTHORS   Kashi,Y., Iraqi,F., Tikochinski,Y., Ruzinzki,B., Nave,A.,
            Beckmann,J.S., Friedmann,A., Soller,M. and Gruenbaum,Y.
  TITLE     (TG)n Uncovers a sex-specific hybridization pattern in cattle
  JOURNAL   Genomics 7, 31-36 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Gruenbaum, 22-JAN-1990.
FEATURES       from  to/span     description
    rpt         311      703     minisatellite repeat
BASE COUNT      134 a    131 c    226 g    234 t
ORIGIN      
        1 aagcttcaca tcccgagaat tccctcccag cgctcgtggt cccacagagg gctctgctgg
       61 acctgcctcg ggtcacatgg caggtctggg gaggacacac ctctccccgg cagagaaatg
      121 gccagaagcc aggtctgctc cacacgtgcc ttctcccaat actctctaac tttaaaaaaa
      181 ctgccaaaga aaaagcggta cgtaataaca agcgcacaga tacgtaattt ataatggctg
      241 acacggttgg cagggaaatg tgttacgcag gaattatgtt tttatttatg tgtgtcctgt
      301 tttggagaca gcataagtaa tcatgggtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc
      361 tccagcgtaa gtaatcatgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagcgtaagt
      421 aatcgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc tccagagtaa
      481 gtaatcatgg gtgtgtgtgt gtgtgtgttg cctgtctcca gcataagtaa tcatgggtgt
      541 gtgtgtgtgt gtgtgttgcc tgtctccagc ataagtaatc atgggtgtgt gtgtgtgtgt
      601 gtgtgtgttg cctgtctcca gcataagtaa tcatgggggg gtgtgtgtgt gtgtgtgtgt
      661 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagggacttt tgtacagaga
      721 agctt
//
LOCUS       CELCDPK      2305 bp ss-mRNA            INV       01-JUN-1990
DEFINITION  C.elegans cAMP-dependent protein kinase subunit R mRNA,
            complete cds.
ACCESSION   J05220
KEYWORDS    cAMP-dependent protein kinase.
SOURCE      C.elegans mRNA.
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 2305)
  AUTHORS   Lu,X., Gross,R.E., Bagchi,S. and Rubin,C.S.
  TITLE     Cloning, structure, and expression of the gene for a novel
            regulatory subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 3293-3303 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        132     1262     cAMP-dependent protein kinase subunit R
    mRNA    <     1     2305     cAMP-dependent protein kinase subunit R mRNA
BASE COUNT      651 a    543 c    417 g    694 t
ORIGIN      
        1 gcggcatccc ccgtcaaaag tctgttacct tccccgccaa gttttcatgt catttcaaac
       61 gtgtgttttc tgctattttt ctcgcatttt tgtctctttt gtttgtttaa aacaagaaat
      121 gactgaaaat tatgaacaat tattcaggtg acattgtttt tatgtcgggt ggaaacgaag
      181 aggatcagtt ggcacaatgc caggcctatg tgcaacgaca caacattcag cagcttgtca
      241 aagaagcaat tgttgttctg tgcatacaca aacccgacaa cccggttctc ttcttgaagg
      301 atcattttga aaaactcaac gaacaacgag cacaggaagg aggaaatccg gacgctgctg
      361 atgatgacga catcattgtg gagccaccaa agagatcagg tggacgcaga accggaatct
      421 ctgccgaacc aatcaaggag gacgataccg agtacaagaa ggttgtgatt ccaaaggatg
      481 atgctacaag aaggtcgctt gagtcagcca tgaggaagaa cttgctcttt gctcatttgg
      541 aggaggatga gcaaaaaacc atgtacgatg ccatgttccc agttgagaag tctgctggag
      601 agaccatcat cgaacaggga gaagaaggag ataacttcta tgttattgat aagggaaccg
      661 ttgatgtcta cgtcaatcac gagtacgttc tcaccatcaa cgagggtgga tcatttggag
      721 aacttgcttt gatctatgga actccacgtg ctgcaaccgt cattgccaag attgacgtga
      781 aattgtgggc catcgaccgt ctcacctacc gtcgtatctt gatgggatca gtaacaaaga
      841 agagaaagat gtatgacgag ttcctctcca aggttcaaat cttggccgat ctcgaccagt
      901 gggagcgcgt taacgtggct gatgctcttg agagatgcga cttcgagcca ggaactcatg
      961 tcgtcgagca aggacaaccc ggagacgaat tcttcatcat tctcgaggga gaagccaatg
     1021 tgttgcagaa gagaagcgac gacgcaccat tcgacgttgt cggacatctt ggaatgtcgg
     1081 actatttcgg agaaatcgct cttcttctcg accgcccacg tgccgccacc gttgttgcca
     1141 aaacccatct caagtgcatc aagctggatc gtaaccgctt cgaacgtgtc atgggaccag
     1201 ttcgtgagat cctgaagaga gacgtttcaa actacaactc atacgtcaaa ctgatgacct
     1261 aatcggatta caatccgttt gccccgcatc ttcctcatca ccgtatcttc ttttccacct
     1321 tttcgccaaa gtactcgtaa ctattttttg aaagtatccc aaaaattctt ccttagtttt
     1381 tttttagtat tgtatttttt cctgtatttt tttggatttt ctattattct gcatactgtc
     1441 tagacctgta accttcttct ctcaccaacc gtgacatctt caagggcccc ccccccaatc
     1501 aattcgttcc ctatcttcat ttctagtcag tcgatgtcaa attaaattct aaatgtctaa
     1561 atttctaact catagccaaa acaacattca ttgttttaac cagcccaaaa tcctagtcag
     1621 aagtttgcag aaaactattt gaaactcctc gtgaaaaagc taaaaaaatg ttcaactaga
     1681 atagttttag tttttattcc ctgtgtaaca aaaaaaacag ttttgcatgc ttttctatcc
     1741 accctaaaat ttcaacattt tcagtagctt tacagctcaa tctttaccaa ccatttcctt
     1801 caattcctat aggttaactc aaagaagaaa cgaacaccat tcttcattaa ttaacattaa
     1861 gcagtgaatg ccaaatcttg tctttataat atgtaacata taattttatg agaaataaga
     1921 aatgcgcatc cacaacaatc gtcccccccc cccctaccca gcacacattt ttttcttatt
     1981 tgttccactt ctgaattctg aactccgtca cccttatcca gcattgcgaa gtctcttatc
     2041 tacgccccct tccctcaaca tccaacttct tttttttttc atttttatat atttttcaca
     2101 ttcattggaa atccgctgtc atgtcgccat ctcaatgttt tcttgtctct tttttttcat
     2161 ttctccacca cattttccac tagatttctg ttatccaaaa tttttgtcgg tttcccccgc
     2221 gcacacacac acaaaaaaca gtatgttata aactatctat aatgtaatct tagaggttta
     2281 ttggactaat aaaatttgga agttg
//
LOCUS       ECHOGL       1803 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  E.chrysanthemi oligogalacturonate lysase (ogl) gene, complete cds.
ACCESSION   M33583
KEYWORDS    oligogalacturonate lysase.
SOURCE      E.chrysanthemi DNA.
  ORGANISM  Erwinia chrysanthemi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1803)
  AUTHORS   Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J.
  TITLE     Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes
            negatively regulated by the kdgR gene product
  JOURNAL   Gene 85, 125-134 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        271     1437     oligogalacturonate lysase (ogl)
BASE COUNT      480 a    393 c    441 g    489 t
ORIGIN      
        1 agtgaatcaa tacagccgga ggatatttat tctgacgttt ggtctggctg cttattgccg
       61 aatctggatc aatatcaggc aaaataatct tgcgtcagcg caaacagtca gtgaaaaatg
      121 aaagaatgtt ttataaaata aaaccacgat cacggaaaaa tgaaacgttg tttctacaat
      181 accggtataa caaggcgatt ggccgggatg acgggtaatc tttcacctgg cggcgtctga
      241 tttttcacga taagcgctaa ggatttacgg atggccaaag gtaaaaagct ttctttttcg
      301 ttccatactt accaggattc agtcaccggc accgaagtgg tgcgtctcac tcctcccgat
      361 gttatctgcc accgcaacta cttctatcag aagtgttttt ccaatgatgg cagcaagctg
      421 ctttttggtg gcgcctttga cgggccgtgg aactactatt tgctggatct gaaaactcag
      481 caggcgacgc aactgaccga aggtaccggc gacaatactt ttggtggttt tctgtcacca
      541 gatgatgacg cgctttatta tgtaaagaac gttcgtaatt tgatgcgtgt tgacctgaat
      601 acactggaag aaaccaatat ttatcaggtg ccggacgact gggtcgggta cggtacctgg
      661 gttgccaact ccgactgcac caaaatggtc ggtatcgaga tcaagaaaga ggattggaaa
      721 ccactgaccg actggaaaaa attccaggaa ttctacttta ccaatccatg ctgccgtttg
      781 attcgtatcg atctgaaaac cggcgaagcc accaccattc tgaaggaaaa ccaatggctg
      841 ggtcatccta tttaccgtcc gggtgacgat aatacggtgg ccttctgcca tgaaggtccg
      901 catgacctgg ttgatgcgcg tatgtggttc atcaatgaag atggctccaa tatgcgtaag
      961 gtaaaagagc atgcgccggg cgaaagctgc actcacgaat tctgggtgcc gaatggttct
     1021 gcgctggcct acgtttccta tctgaaaggc agtactaacc gtttcatttg cagcgttgat
     1081 ccggtaacgc tggaaaaccg tcagttgact gaaatgccgc cgtgttctca cctgatgagt
     1141 aactacgatg gtacgttgat ggtgggagat gggtgtaatg cgccggtgga tgtgaaagat
     1201 gacggtggct acaagactga aaacgatccg ttcctgtatg tgttcaatat gaagaccggg
     1261 aaacatttcc aggtcgctca acacaacacc tcctgggaag tgctggaggg cgatcgtcag
     1321 gtaacgcatc cacatccgtc ctttacgccg gatgacaagc acattctgtt tacgtctgat
     1381 gtcgatggta agccggcgtt atatctggca aaagtgcctg attccgtctg gcaataactg
     1441 tacctgcaac cgtgcccgcc agggcacgtt ttttcgcttt tacaataaga ttattaattc
     1501 ggcagtgcta ttgtgatgga tgatattaat cggtctgtca cttatagtaa tgcccgattg
     1561 ccatattcaa agcgttccct ttatactaaa acacgtgttc tatttttttt aaaacaaaaa
     1621 atactgagta gggtaaccac aaaaatggct attgcagatt tagataaaca acccgattcc
     1681 gtgtcatctg tcttaaaggt ctttggtatt ttgcaggcgt taggcgaaga gcgtgagatt
     1741 ggcattactg aactttcaca gcgcgtgatg atgtctaaaa gtactgttta tcgtttcctg
     1801 cag
//
LOCUS       ECHPLE       1758 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  E.chrysanthemi pectate lysase e isoenzyme (pelE) gene, complete
            cds.
ACCESSION   M33584
KEYWORDS    pectate lysase e isoenzyme.
SOURCE      E.chrysanthemi DNA.
  ORGANISM  Erwinia chrysanthemi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1758)
  AUTHORS   Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J.
  TITLE     Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes
            negatively regulated by the kdgR gene product
  JOURNAL   Gene 85, 125-134 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        251     1465     pectate lysase e (pelE)
BASE COUNT      534 a    480 c    379 g    365 t
ORIGIN      
        1 tttaaaactt catttttaat ttaatcataa aaaataaaaa accaagtaac acaaagttac
       61 aaatacagtc aatagtttat tttttattaa taaaaaccat tgtcatcatc gtgacaaagt
      121 tcacaaaata gacactcaac cgcataaaca ttcgccaaag caaaagatga aatggtattt
      181 cgtttttaga cacacatgta acaaatggac accatttgga tcgctcactg agcacacaag
      241 gaaaattgcc atgaacaact cacgtatgtc ttccgtttca acacagaaaa caacaggacg
      301 ttctgccttg ggaaccaaaa gcgcgctggc cgcgattatc gccaccacca tgatggtctc
      361 agtcgcttcc gccgccagtc tgcaaaccac caaagcgaca gaagcggcgt caaccggctg
      421 ggcaacgcag agcggcggca ccaccggcgg cgccaaagcg tcatcatcca aaatctacgc
      481 ggtgaaaagc atcagcgaat tcaaagcagc actgaacgga accgattcgt cgcccaagat
      541 catccaggtc accggagcaa ttgatatcag cggcggtaaa gcctacaccg actttgacga
      601 tcagaaagcc cgcagccaga tcagcattcc gtccaacacc accatcatcg gtattggcaa
      661 caaaggcaaa ttcaccaacg gttcgctggt ggtgaaaggc gtcagcaacg ttatcctgcg
      721 taacctgtac atcgaaacgc cggtggacgt ggcgccgcat tacgaagagg gagatggttg
      781 gaacgccgag tgggacgcag tagtgattga cagcacagac cacgtctggg ttgaccatgt
      841 caccatcagc gacggcagcc tcaccgacga caaatacacc accaaaaacg gcgaaaaata
      901 tgtgcagcat gacggctcgc tggatatcaa gcgcggctcc gactacgtta ccgtttctaa
      961 tagccgcttc gagctgcacg acaaaaccat cctgatcggc cacagcgaca acaacggctc
     1021 gcaggacgcc ggcaaactgc gcgtcacctt ccacaacaac ctgtttgacc gggtcggcga
     1081 acgtaccccg cgcgtacgct tcggcagcgt tcacgcttac aacaacgttt acgttggcga
     1141 cgtcaaccac aaagcctatc gctatcagta cagcttcggt atcggcacca gcggtagcct
     1201 gctgtctgag tccaacgcgt ttaccattga taacatgaaa aaaatcagcg gccgcgacaa
     1261 agaatgcagc gtggtcaagg cgtttaacgg taagatcttc tctgataaag gctcgatcat
     1321 caacggcgct tcgtacaatc tgaatggttg cggctttggc ttcagcgcat acagcgccaa
     1381 gatcccgtac aaatactcag cccaaaccat caccaccagc ctggcaaaca gcatcagcag
     1441 caacgcgggc tacggtaaac tgtaatttcc cgcccagcgt aaaggccact tgcaaaagtg
     1501 gcttttattt tcccatattc ccaccactcg tattaccccg atgttctccg aagcgccact
     1561 caatacggtt caggttaatc cggccttgcc acaggcgttc atcaagaaat cgaccggttt
     1621 cccccatcaa atgcagggac tagcaggaat tagtaagaga cacagattaa taaaattcta
     1681 tttaataaaa atcgaatcaa ttgttaactg tttgggttat ttcagataaa aacgcttata
     1741 catatagctg aatttaaa
//
LOCUS       ECOFIMA1       66 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  E.coli type 1 fimbriae (fimA) gene with viral insertion site,
            partial cds.
ACCESSION   M33412
KEYWORDS    type 1 fimbriae.
SOURCE      E.coli (strain HB101) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Hedegaard,L. and Klemm,P.
  TITLE     Type 1 fimbriae of Escherichia coli as carriers of heterologous
            antigenic sequences
  JOURNAL   Gene 85, 115-124 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >    66     type 1 fimbriae (AA at 1)
    recomb        3        4     E.coli DNA end/ viral DNA start
    recomb       63       64     viral DNA end/ E.coli DNA start
BASE COUNT       12 a     19 c     17 g     18 t
ORIGIN      
        1 gttgatcccc ggacctgcat gatcatcgct caaggaacct ctatgtatcc ctcctgttgg
       61 ggggat
//
LOCUS       ECOFIMA2       66 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  E.coli type 1 fimbriae (fimA) gene and viral insertion site,
            partial cds.
ACCESSION   M33413
KEYWORDS    type 1 fimbriae.
SOURCE      E.coli (strain HB101) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Hedegaard,L. and Klemm,P.
  TITLE     Type 1 fimbriae of Escherichia coli as carriers of heterologous
            antigenic sequences
  JOURNAL   Gene 85, 115-124 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >    66     type 1 fimbriae (AA at 1)
    recomb        3        4     E.coli DNA end/ viral DNA start
    recomb       62       63     viral DNA end/ E.coli DNA start
BASE COUNT       14 a     17 c     17 g     18 t
ORIGIN      
        1 gttgatccaa acctgcgtgg agatctccag gttctggctc agaaagttgc tcgtacctta
       61 ccggat
//
LOCUS       ECOSPDE      2792 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  E.coli speED operon speE and speD genes encoding S-adenosyl-
            methionine decarboxylase and spermidine synthase, complete cds.
ACCESSION   J02804 J04247
KEYWORDS    S-adenosylmethionine decarboxylase; speD gene; speE gene;
            spermidine synthase.
SOURCE      E.coli K12 DNA, clones pT7-5-1 and LC37-29.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 643 to 2524)
  AUTHORS   Tabor,C.W. and Tabor,H.
  TITLE     The speED operon of Escherichia coli: Formation and processing of a
            proenzyme form of S-adenosylmethionine decarboxylase
  JOURNAL   J. Biol. Chem. 262, 16037-16040 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 642; 2525 to 2792)
  AUTHORS   Xie,Q.-W., Tabor,C.W. and Tabor,H.
  TITLE     Spermidine biosynthesis in Escherichia coli the promoter and the
            termination regions of the speED operon
  JOURNAL   J. Bacteriol. 171, 4457-4465 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed copy of sequence for [1] kindly submitted
            by H.Tabor, 02-DEC-1987 and for [2] 13-MAR-1990.
FEATURES       from  to/span     description
    pept        314      661     unidentified protein
    pept        767     1633     spermidine synthase /nomgen="speE"
    pept       1649     2443     S-adenosylmethionine decarboxylase (ttg start
                                 codon) /nomgen="speD"
    signal     2448     2482     hairpin terminator
BASE COUNT      714 a    738 c    637 g    703 t
ORIGIN      
        1 aactgaatgc gattacgggc atcggtcgtg agcaaatcag ggatcggtaa cgttgggcgt
       61 tctgccgcaa atactgcgcg gctccacagc ggcaaagcga agccacaccc agcgcgacgg
      121 aatattttaa gaaatcacga cgttgcatag ttatttcctt attcttaagc aggcgaaaca
      181 aaatcaaacg ttgagcatag tccttcccct tacgggaagg tcaagccgca gacacaattt
      241 aatcgacaat aatggtcgtc gcctcgctcg cagtgtgcta acgtttatct tctttaagcc
      301 cctggtagaa gcaatgaaga cgtttttcag aacagtgtta ttcggcagcc tgatggccgt
      361 ctgcgcaaac agttacgcgc tcagcgagtc tgaagccgaa gatatggccg atttaacggc
      421 agtttttgtc tttctgaaga acgattgtgg ttaccagaac ttacctaacg ggcaaattcg
      481 tcgcgcactg gtctttttcg ctcagcaaaa ccagtgggac ctcagtaatt acgacacctt
      541 cgacatgaaa gccctcggtg aagacagcta ccgcgatctc agcggcattg gcattcccgt
      601 cgctaaaaaa tgcaaagccc tggcccgcga ttccttaagc ctgcttgcct acgtcaaata
      661 atccctgata cctttttgta gaaatattga ccgtgcatcc gcggtcaatg ttagctatta
      721 tgttgcgccc tttttttacg ggtgttaaca aaggaggtat caacccatgg ccgaaaaaaa
      781 acagtggcat gaaacgctac acgaccagtt tgggcagtac tttgcggtag ataacgttct
      841 gtatcatgaa aagaccgatc accaggatct gatcattttt gagaacgctg catttggtcg
      901 cgtaatggcg ctggatggcg tagtacaaac caccgagcgc gacgagttta tctatcatga
      961 gatgatgacc catgttccgc tactggccca tggtcacgcg aaacatgtgc tgattatcgg
     1021 cggcggcgac ggtgccatgc tgcgtgaagt aacccgacat aaaaacgttg agtcaatcac
     1081 gatggtggaa atcgatgcgg gtgtcgtatc gttctgccgt cagtatctac ccaaccataa
     1141 cgccggtagc tacgacgatc cgcgctttaa gctggtgatc gacgatggcg tcaatttcgt
     1201 taatcaaacc agccagacct ttgatgtcat tatctccgac tgcaccgatc ctatcggtcc
     1261 cggcgaaagc cttttcactt cggcatttta tgaaggctgc aaacgttgcc tgaatcctgg
     1321 cggtatcttc gtcgcacaaa acggcgtctg ctttttacag caggaagaag ccatcgacag
     1381 ccatcgcaaa ctcagccatt acttcagcga cgttggcttt tatcaggcgg cgatcccgac
     1441 ctattacggc ggtatcatga cttttgcatg ggcgacagat aacgacgcct tacgccatct
     1501 ctcaaccgaa attattcagg cgcgttttct cgcctctggc ctgaaatgcc gttattacaa
     1561 tccggcaatc catacggcag cttttgcctt acctcagtat ctgcaagacg cactggcttc
     1621 acagccgtcc taaggagaag ataagaaatt gaaaaaactg aaactgcatg gctttaataa
     1681 tctgaccaaa agtctgagtt tttgtattta cgatatctgc tacgccaaaa ctgccgaaga
     1741 gcgcgacggt tatattgctt atatcgatga actctataat gccaaccgtc tgaccgaaat
     1801 cctgtcagaa acctgttcca ttatcggggc taatattctt aacatcgccc gccaggatta
     1861 cgaaccacag ggtgccagcg tcactattct ggtgagtgaa gaaccggttg acccgaaact
     1921 catcgacaaa acagaacacc ccggcccact gccagaaacg gtcgttgccc atcttgataa
     1981 aagtcatatt tgcgtacata cctacccgga aagtcatcct gaaggcggtt tatgtacctt
     2041 ccgcgccgat attgaagtct ctacctgcgg cgtgatttct ccgctgaagg cgctgaatta
     2101 cctgatccac cagcttgagt ccgatatcgt aaccattgat tatcgcgtgc gcggttttac
     2161 ccgcgacatt aacggtatga agcactttat cgaccatgag attaattcga ttcagaactt
     2221 tatgtctgac gatatgaagg cgctgtatga catggtggat gtgaacgtct atcaggaaaa
     2281 tatcttccat accaagatgt tgcttaaaga gttcgacctt aagcactaca tgttccacac
     2341 caaaccggaa gacttaaccg acagcgagcg ccaggaaatt accgctgcgc tgtggaaaga
     2401 aatgcgcgag atttattacg ggcgcaatat gccagctgtt taacggctct ggcggagctc
     2461 ccaggctccg ccagatttat ttacttctgc tgcacgaaat tgcggtaagc cgccacgacc
     2521 tgcagaaaat cctcaacgcc gcatagcgac aggctttctt cgtcgtagta gttcatcccc
     2581 tcttccattt catcgccagc gaattccagt tgattggcgc gaaccatcac ctcttcaccg
     2641 tccatccaca gggtgtattc atgccctgcc cgttgccagg accgttcgct acctttcagt
     2701 gcgtgcgctg cttgttccac ttcatcaagc aaggccaggt tttctttcac ctcttcatta
     2761 aaccagtgcc cgaccacttc atgccccatg ga
//
LOCUS       HUMBTRO       608 bp ds-DNA             PRI       01-JUN-1990
DEFINITION  Human beta-tropomyosin gene, exon 1.
ACCESSION   J05247
KEYWORDS    beta-tropomyosin.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 608)
  AUTHORS   Libri,D., Mouly,V., Lemonnier,M. and Fiszman,M.Y.
  TITLE     A nonmuscle tropomyosin is encoded by the smooth/skeletal beta-
            tropomyosin gene and its RNA is transcribed from an internal
            promoter
  JOURNAL   J. Biol. Chem. 265, 3471-3473 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        441  /   572     beta-tropomyosin
    IVS         573  >   608     beta-tropomyosin intron A
BASE COUNT      102 a    198 c    223 g     83 t      2 others
ORIGIN      
        1 gtctctgcag cagatgtgtg gggactctgg agagcacccc tggatacact ccatgcagag
       61 ctgctcacag cctctgcggc cccttttcct ctcaggaaaa taaacacccc gaccgccttc
      121 ccctctgcct gctgggctcg ctgcggcatg cagccctcct ggnntctgtg ccctgctttg
      181 gggctgtgcg cccatgctgg ggtcgacagc gtgcctccac ggggcagcca tgagccttat
      241 cgggggacga cgtcgctcag cgcagggagc gcggcgggag cgggtcggtg tgcggcagga
      301 ggaggaggag gaggaggagc aggaagggat ggaggcaggg cgggagcggc cgccgcctga
      361 gcgcttcctg cccgagccag gcggatccca caaagggctc agcggagcgg ccttgctgcg
      421 ctccccgccg gccgcaggcc atggccggca tcagctccat cgatgccgtc aagaagaaga
      481 tccagagcct gcagcaggtg gctgacgagg cggaggaacg cgccgagcac ctgcagcggg
      541 aggccgatgc cgagcggcag gcccgggagc gggtaaggct cctgggggca ccccgcgacc
      601 cccagacc
//
LOCUS       HUMGPIIB1    2238 bp ds-DNA             PRI       01-JUN-1990
DEFINITION  Human platelet glycoprotein IIb (GPIIb) gene, exon 1.
ACCESSION   M33319
KEYWORDS    platelet glycoprotein IIb.
SEGMENT     1 of 3
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2238)
  AUTHORS   Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S.,
            Schwartz,E. and Poncz,M.
  TITLE     Organization of the gene for platelet glycoprotein IIb
  JOURNAL   Biochemistry 29, 1232-1244 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept       1286  +  1473     platelet glycoprotein IIb (GPIIb), exon 1
    pre-msg    1254  >  2238     GPIIb mRNA and introns
    IVS        1474  >  2238     GPIIb intron A
BASE COUNT      596 a    554 c    595 g    493 t
ORIGIN      
        1 gtcaacggat cagaaaatag aaatcaaaag gaaaatgtgg ctatggttac ccctagcgga
       61 cctcttaaat cttcctgaga acctgctttt ttgggaaggc atgagtgcca gtaagacttg
      121 gcactcctcc tcttccgctt accgagagaa aatgactttg cctttctgct caaaactcat
      181 cccttcactt tgtcacccta tgtttgcatc ttccatcctt agtgtgtgtt tccatccatc
      241 cagtctttca gcaatacacg tactacacat tggactcttg ggtagtctct agggctgtag
      301 caaggagcct tgctcccaag ggactcattt acacaatcct gtgaacggac caagagtaaa
      361 cagtgtgctc aatgctgtgc ctacgtgtgt tagcccacgc ggccagcctg aggagtcagg
      421 gaaggctccc ctaggcaaag cccccaacca gaatcaagtc ttaatggtta aagagctcca
      481 tcacccaaaa aggattgagg gcctaccttc aactgaacag ctaatgcata atctcagaaa
      541 ctgtgagtca aaattccctg gaataactcc actttatccc caatctcctt gccacctaga
      601 ccaaggtcca ttcaccaccc tgtccccagc actgactgca ctgctgtggc cacactaaag
      661 cttggctcaa gacggaggag gagtgaggaa gctgctgcac caatatggct ggttgaggcc
      721 gcccaaggtc ctagaaggag gaagtgggta aatgccatat ccaaaaagat acagaagcct
      781 caggttttat cgggggcagc agcttccttc tccttccccg acctgtggcc aagtcacaaa
      841 gcaccacagc tgtacagcca gatgggggaa gggaggagat tagaactgta ggctagagta
      901 gacaagtatg gaccagttca caatcacgct atcccaagca gaaagtgatg gtggcttgga
      961 ctagcacggt ggtagtagag atggggtaaa gattcaagag acatcattga taggcagaac
     1021 caataggaca tggtaataaa ctattctcag gaaaggggag gagtcatggc tttcagccat
     1081 gagcatccac cctctgggtg gcctcaccca cttcctggca attctagcca ccatgagtcc
     1141 aggggctata gccctttgct ctgcccgttg ctcagcaagt tacttggggt tccagtttga
     1201 taagaaaaga cttcctgtgg aggaatctga agggaaggag gaggagctgg cccattcctg
     1261 cctgggaggt tgtggaagaa ggaagatggc cagagctttg tgtccactgc aagccctctg
     1321 gcttctggag tgggtgctgc tgctcttggg accttgtgct gcccctccag cctgggcctt
     1381 gaacctggac ccagtgcagc tcaccttcta tgcaggcccc aatggcagcc agtttggatt
     1441 ttcactggac ttccacaagg acagccatgg gaggtgagcc gtaagggaag ttggggtatt
     1501 gggagagagc aggacccctc cccatcactg cttctggggg cttcgagttt cccatttgcg
     1561 atagcagttg agcaaggtga cttgtggggc ctattcaggt tgatttcttg tcaagaatgt
     1621 tggggtccag gggactggct caggtgaagg tataagggca gggcacatgt gggctgatgg
     1681 gcactgaaaa ctacagcaag aacaaaggga agacaagagt tgatgcttta ttttttcccc
     1741 aagggtcagt tgtatgaacc actccaccct caacaccttg aaatgcagag aggaggccgg
     1801 gcgcggtggc tcatgcctgt aatcccagca ctttgggagg ccgaggcggg cagatcacct
     1861 gaggtcgaga attcgagacc agcctgacca acatggagaa accccgtctc tactaaaaat
     1921 acaaaaaaaa aaaaaaaaaa aagaggccag gcacagtggc tcacacctgc aatcccagca
     1981 ctttgggagg cagaggtggg cagatcatga ggtcaggagt tcaagaccag cctggccaat
     2041 atggtgaaac cctgtctcta ttaaaaatac aaaaattagc tgagcatggt ggcacactcc
     2101 tgtagtccca gctactcggg aggctgaggt aggagaatca cttgaacccg ggaggtggag
     2161 gttgcagtga gctgagacta tgccactgca ctcccagccc tggggttgac agagtgacac
     2221 tccgtctcaa aaaaaaaa
//
LOCUS       HUMGPIIB2   13204 bp ds-DNA             PRI       01-JUN-1990
DEFINITION  Human platelet Glycoprotein IIb (GPIIb) gene, exons 2-29.
ACCESSION   M33320
KEYWORDS    platelet Glycoprotein IIb.
SEGMENT     2 of 3
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 13204)
  AUTHORS   Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S.,
            Schwartz,E. and Poncz,M.
  TITLE     Organization of the gene for platelet glycoprotein IIb
  JOURNAL   Biochemistry 29, 1232-1244 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   498      619     platelet Glycoprotein IIb (GPIIb), exon 2
                709      806     platelet Glycoprotein IIb, exon 3
                912     1077     platelet Glycoprotein IIb, exon 4
               1293     1342     platelet Glycoprotein IIb, exon 5
               1419     1464     platelet Glycoprotein IIb, exon 6
               1552     1680     platelet Glycoprotein IIb, exon 7
               2042     2089     platelet Glycoprotein IIb, exon 8
               2245     2288     platelet Glycoprotein IIb, exon 9
               2461     2514     platelet Glycoprotein IIb, exon 10
               2653     2705     platelet Glycoprotein IIb, exon 11
               2897     3108     platelet Glycoprotein IIb, exon 12
               5536     5718     platelet Glycoprotein IIb, exon 13
               5952     5997     platelet Glycoprotein IIb, exon 14
               6106     6210     platelet Glycoprotein IIb, exon 15
               6295     6350     platelet Glycoprotein IIb, exon 16
               6443     6594     platelet Glycoprotein IIb, exon 17
               6783     6908     platelet Glycoprotein IIb, exon 18
               7886     7953     platelet Glycoprotein IIb, exon 19
               8087     8234     platelet Glycoprotein IIb, exon 20
               8803     8895     platelet Glycoprotein IIb, exon 21
               9506     9585     platelet Glycoprotein IIb, exon 22
              10202    10282     platelet Glycoprotein IIb, exon 23
              10406    10505     platelet Glycoprotein IIb, exon 24
              10605    10757     platelet Glycoprotein IIb, exon 25
              10874    10999     platelet Glycoprotein IIb, exon 26
              11478    11591     platelet Glycoprotein IIb, exon 27
              11828    11929     platelet Glycoprotein IIb, exon 28
              12117  + 12233     platelet Glycoprotein IIb, exon 29
    pre-msg <     1  > 13204     GPIIb mRNA and introns
    IVS     <     1      497     GPIIb intron A
    IVS         620      708     GPIIb intron B
    IVS         807      911     GPIIb intron C
    IVS        1078     1292     GPIIb intron D
    IVS        1343     1418     GPIIb intron E (no splice consensus)
    IVS        1465     1551     GPIIb intron F
    IVS        1681     2041     GPIIb intron G
    IVS        2090     2244     GPIIb intron H (no splice consensus)
    IVS        2289     2460     GPIIb intron I
    IVS        2515     2652     GPIIb intron J
    IVS        2706     2896     GPIIb intron K
    IVS        3109     5535     GPIIb intron L
    IVS        5719     5951     GPIIb intron M
    IVS        5998     6105     GPIIb intron N
    IVS        6211     6294     GPIIb intron O
    IVS        6351     6442     GPIIb intron P
    IVS        6595     6782     GPIIb intron Q
    IVS        6909     7885     GPIIb intron R
    IVS        7954     8086     GPIIb intron S
    IVS        8235     8802     GPIIb intron T
    IVS        8896     9505     GPIIb intron U
    IVS        9586    10201     GPIIb intron V
    IVS       10283    10405     GPIIb intron W
    IVS       10506    10604     GPIIb intron X
    IVS       10758    10873     GPIIb intron Y
    IVS       11000    11477     GPIIb intron Z
    IVS       11592    11827     GPIIb intron AA
    IVS       11930    12116     GPIIb intron BB
    IVS       12235  > 13204     GPIIb intron CC
BASE COUNT     3046 a   3579 c   3857 g   2722 t
ORIGIN      About 2000 bp after segment 1.
        1 ctgcaggtca acggatctgc tagggtcctc ctatcagcac acacactcca gccccacttt
       61 agaggtaccc gctaccttcc ctcattaaaa ccagctctca agaggggatc tggtaacagt
      121 ctaggcaggc attccaggga gcatgtgaac cgctggttct tgttgcgggt ggaggatgga
      181 ggtgttgtac agagtttagg tctttttcag caaagatctc caaaccccgg gtgttcaaaa
      241 tcaaaccaaa ggggattata gtcccagctc tactcacaac tcactggtta ctttagccac
      301 gagattgccc tcgctgagag tcggtttcac tgtccataag atgaagaagt acatcacggt
      361 ggtctgtgag gtgtcattga ggaaagatgg tccagtgccc ccatgccaca tggccttcgg
      421 gcagtgctcc cagcgccggc gccagggcct gggatacgct ggaatctgcg cggcgctcac
      481 ccagctttcc tatgcagagt ggccatcgtg gtgggcgccc cgcggaccct gggccccagc
      541 caggaggaga cgggcggcgt gttcctgtgc ccctggaggg ccgagggcgg ccagtgcccc
      601 tcgctgctct ttgacctccg tgagtcccag gcaaggagag caaggttggg gtcagaggga
      661 cgtggactgc ccgggcttca gcgccccacc ccttcttgtg ccttccaggt gatgagaccc
      721 gaaatgtagg ctcccaaact ttacaaacct tcaaggcccg ccaaggactg ggggcgtcgg
      781 tcgtcagctg gagcgacgtc attgtggtgg gccccgcggt acagggcaca gggaacaatc
      841 gggggcaggg acactggggc caggaggagc ccaagtctcg cgccccgtcc ccatctgtgg
      901 ccctttctca ggcctgcgcc ccctggcagc actggaacgt cctagaaaag actgaggagg
      961 ctgagaagac gcccgtaggt agctgctttt tggctcagcc agagagcggc cgccgcgccg
     1021 agtactcccc ctgtcgcggg aacaccctga gccgcattta cgtggaaaat gattttagta
     1081 agcgccagct acgacctggc cccgcccact cgcgacggct tggccccgcc ccccatcgga
     1141 tcccgccccc agcgccgcag cccttgcttt ggatctggcc tcgccccagg gccccgccga
     1201 ctcaaggccc cgcccctgtc ccccagccct cctccgggct cgcgcgcgcc tcccttcacc
     1261 cctgggctga cccctcctcc ttgtctcctc aggctgggac aagcgttact gtgaagcggg
     1321 cttcagctcc gtggtcactc aggcgagtag ggagcaaaag cgcagtgggg gcggctccca
     1381 aacagggccc cctctcaccc tcaggacttc ccttccaggc cggagagctg gtgcttgggg
     1441 ctcctggcgg ctattatttc ttaggtacgt gcccatccgt acacctccct cccttctcgc
     1501 ggccgaagga gaccgctttg ggcttcacac ccgctgtccc tcccgcccta ggtctcctgg
     1561 cccaggctcc agttgcggat attttctcga gttaccgccc aggcatcctt ttgtggcacg
     1621 tgtcctccca gagcctctcc tttgactcca gcaacccaga gtacttcgac ggctactggg
     1681 gtaacaccgc cattccagac ttccagcacc ccgagggtca ccgcccaccg cagacggtca
     1741 ggtcctgccc ctgtgggagc ctccatggcc acccctgccg gccaacccac cgcctaagcc
     1801 gctcccgccc tccgctcctg cgcttccccg cagaccgccc acctcccatg cgcccaccgc
     1861 tcccttccac tgcggactcg tagcgcagcc tggggcaggg cttggcccct cgaaggcctc
     1921 cgtttttcca tctgcacaat gcagggctgg ggctgagtgg ccttaatctc ctccttcttt
     1981 gccctccgtc ccctctgtgc ttcctcccct ggaaaagact aatttgcgcc cttgtcctca
     2041 gggtactcgg tggccgtggg cgagttcgac ggggatctca acactacagg caagaaatcc
     2101 acttagggcg ggagttgggt agcccagccc ggggaggagc gccttcctga aatctcccct
     2161 atgtagctgg gtgcagaacg gggagcggga agtgggtagg ttctaaggct ctcattccct
     2221 gagcctggct ctccctatcg ccagaatatg tcgtcgtgcc ccccacttgg agctggaccc
     2281 tgggagcggt aagtgccccc accactgggc ctcccgaagc cccttatccc agttctcagg
     2341 ctgacaactc ctgagcgccc cccacccccg ccccgcctcc accaaaccac cctttctcac
     2401 ctggagtggg aggttgcttt gggtacaaga atgatgctct cgcctgcgct gtccgtgcag
     2461 gtggaaattt tggattccta ctaccagagg ctgcatcggc tgcgcggaga gcaggtgggg
     2521 gccaggtccc agtgggcgtg gctgggtgga gggggaactg agacttcaga atatttcatg
     2581 ggaggtgagg gcccatttct taaagaggat gcttgtccag cggcgtgaat gatggtgctc
     2641 ctcatcttgc agatggcgtc gtattttggg cattcagtgg ctgtcactga cgtcaacggg
     2701 gatgggtgag gagggacatg cccccacccc tacccagttg ggtcccaaat taccagagct
     2761 gcccctctgt ctccctttcc tagccctagt ctcacgtatc cactggagga acaggagagc
     2821 aagggtcgag gagatttggc cctagcccca atatacccct ggtccagtcc catgtaacca
     2881 ctcatctggc ccacaggagg catgatctgc tggtgggcgc tccactgtat atggagagcc
     2941 gggcagaccg aaaactggcc gaagtggggc gtgtgtattt gttcctgcag ccgcgaggcc
     3001 cccacgcgct gggtgccccc agcctcctgc tgactggcac acagctctat gggcgattcg
     3061 gctctgccat cgcacccctg ggcgacctcg accgggatgg ctacaatggt gagggaagag
     3121 aggagcccta cttgctgcag aggggttaac agccactcaa aaagcatgga gttggcctga
     3181 gggcagccag aaccaggatg ggttttaagc atataagtat gtggcttaga cacatggggt
     3241 gctgagtgga gagcagatgg gagagttgaa gactaattag gaagtgtttg ccttaatcca
     3301 agcaagagac aatgaccacc tggatgtgga ttttggcagt ggagttagag atgggagtga
     3361 cttcacagat atttaggact cggattatta ggacttggtg ggagactgga tgtggggcca
     3421 ggggagaggt tggagttggg tgcctgtgat ggcctccact gcctggaact caggccgtgc
     3481 agcaggtgct ggggagaggc gggagatcag cagttcagct ctggacctgt tgagcttgaa
     3541 gggcttgggt gctttaggcg gaaatatcca aagaacagtt gggagtggct ctccccgctt
     3601 ccacaagaga gatctgaatg ggagacaggg gtttggggaa agtggatgag gtcccgggac
     3661 ctgtgaaata agaggcccag gatagagccc tagggagcaa aagcatttag gtgactccta
     3721 caggaggtaa gtctgagaag gagacagagg agtgtccaga gagggaggag ggaacccagg
     3781 gggtctgatg gcccgggact caaggaagag catgcgttaa agagcatgca caggaggaag
     3841 tgggcgctgc agctcctgct gctgctgcaa gatacaatta ggtggggctg gagaaatatt
     3901 catgggcttt agcaagaaga gggtgccagg catggtggct catacctgta atcccagcta
     3961 cttgggaaat tgaagcagga gaatctcttg aacccgggaa gtggaggttg cactgagctg
     4021 agcttgcgcc actactgcac tccagcctgg gtgacagagc aagactccat ctcaacaaaa
     4081 taaaaaaaaa aatagagaaa gaaaggaaga aagaaaaaag aaggggaggt tattggtgac
     4141 agtgacataa attgattcag gccaagatag ggtcagaagc cagaatgcaa tggggtaagg
     4201 tatgaatgga gatgaaaaat tggatgcagc taatgtagac agctctttca acaggtttgt
     4261 ggtaaaaagg aatttgagga atagaaagga aaaaaaaaaa catgtttgac tataagagga
     4321 aaaagagaaa aggtgatcac agaaaagaga tgagggtcaa gggaagatta tttcaatgtg
     4381 gaagaacatg tagtaggttg aaaatgatgt tgtggggaaa tggggggatg agccagcaga
     4441 gagtccctgt gatgcctcag ggggtgggag ggtgactggc ccagtgtcag ggtgaaggaa
     4501 ggaaacctct tccagggtca aatggggaaa gggaaaaaga aagttggtgt gggattatag
     4561 cataacagtg ggctgcctct cttcctgaag taagagatta cgtcacctgc tgaaggaagt
     4621 gtggggggtc tgggagtttg atggaatgga gaaggctaga aatagatgct agatggccag
     4681 gcacggtggc tcacacctgg aatcccagca ctttgggagg ccgaggcagg aggatcactg
     4741 gagcctagga gtttgacacc agcctggcca acatagggag atctcgtctc cataaaaatt
     4801 tttaaaaatt agctgggcat ggtggctata gtctcaactg cttgggaagc tgaggtggga
     4861 ggattgcttt agtccagaag gttgaggctg cagtaagcca tggttgcacc actgcacttc
     4921 agcctgaatg acaagtgcaa gactgtctta aaataaaaaa tttaaagggc ttgggcacgg
     4981 tggctcacac ctgtaatcca gcactttggg agcccaaggt gggcagatca cttgaggtca
     5041 ggagttcgag atcagcctgg ccaatgtggt gaaaccccgt ctctactgaa aatacaaaaa
     5101 ttagccgggc atggtggtag gcgcctgtaa tcccagctac tgaagaggct gaggcacaag
     5161 aatcacttta acgggggagg cagaggttgc agtgagccga gatcgcacca ctgcactcca
     5221 gccaggacaa cagagcgaga ctccatctca aaaaaaaaaa aatttagaaa agggaataat
     5281 gatgcttaat tttcaggata tattttcctc aatagacagt gagagttgtc actgttttta
     5341 taacaatcct acttggcagg tccctctccc acctgattgt taactcctgg agggtagggc
     5401 agtgcctcct tcacccacac tttgcacccc tttcctagtc tcctgggatg ttcccagaga
     5461 agctcaggaa agttttacag tcatctaggg aggctgaata acaatcagcc acttcctttc
     5521 tgttactcct tccagacatt gcagtggctg ccccctacgg gggtcccagt ggccggggcc
     5581 aagtgctggt gttcctgggt cagagtgagg ggctgaggtc acgtccctcc caggtcctgg
     5641 acagcccctt ccccacaggc tctgcctttg gcttctccct tcgaggtgcc gtagacatcg
     5701 atgacaacgg atacccaggt gccctggact gcctccagct agaaatgccc aagaaaggcc
     5761 cttggacatt cgctggaagt gccaagagac acggccaggg ctcatgcctg gcctggtgtc
     5821 ccactatgga ctgccagagg ggctgggtga aacctccagt gggggaggtg gtgtggggaa
     5881 cccctgggaa gatgagatga ggatccccat accctaatcg ccaattctga cccattcctc
     5941 gatgtctata gacctgatcg tgggagctta cggggccaac caggtggctg tgtacaggtg
     6001 agcactggct ccaggggcgg gatggggaag gtcctgtgcc atcaagagga ggccaggcca
     6061 ggaggagcca caatggcaag cctccccatc accctatccc atcagagctc agccagtggt
     6121 gaaggcctct gtccagctac tggtgcaaga ttcactgaat cctgctgtga agagctgtgt
     6181 cctacctcag accaagacac ccgtgagctg gtgaggaggc agagggcatg ggccttaaag
     6241 gatctgggac ctcagaaagg ctccaacccc tgagccccac ttacgtcttt gcagcttcaa
     6301 catccagatg tgtgttggag ccactgggca caacattcct cagaagctat gtgagtggca
     6361 tgaagggggc aggagggagg tgggcttgga ctcccccgga ggctggccag ggaggtcctg
     6421 actcttctgc ttgccctgcc agccctaaat gccgagctgc agctggaccg gcagaagccc
     6481 cgccagggcc ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctgaacctg
     6541 gatctgggcg gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagtacgc
     6601 ccaggcaggg gattggcagg gctgggagag tagaacttac ccactggact tgttcatcta
     6661 gccctggggc actgagctgg gtgctgtgag tccgggggtg gtcaggacac aggtgcctac
     6721 tggccaggag aaggtgggat gtgtatggta gcaagatggc ctgactcttg cccctgtcct
     6781 aggatgaggc agacttccgg gacaagctga gccccattgt gctcagcctc aatgtgtccc
     6841 taccgcccac ggaggctgga atggcccctg ctgtcgtgct gcatggagac acccatgtgc
     6901 aggagcaggt agggacaggc agggacaggc cagggaggtg caggacccct gatagcaaat
     6961 caggattagg gttagtgcca agtcacaatg taaccccaaa accttgatgt cattccaaac
     7021 cctaatgaaa acctcaaaat ccagccagtc atggtggctc acacctgtaa tcccagcact
     7081 ttgggagacc gaggcaggca gattgcctga ggtcaggagt tagagaccaa cctggccaac
     7141 atggtgaaaa cccatctcta ctaaaaatac aaaaaaaatt agccgggtgt ggtgacgcat
     7201 gcctgtaatt ccagctactc gggaggctga agcaggagaa tcacttgaac ccaggaggca
     7261 gaggttgcag tgagccaaga gtgtgccaca gcactccagc ctgggtgaca gagcaagact
     7321 ctgtctcaaa aaaaaaaaaa aaagccaggc gcagtggcct cacgcctgta atcccagcac
     7381 tttgggaggc caaggcgggt ggatcacgag gtcaggagat caagaccatc ctggctaaca
     7441 cagtgaaacc ccgtctacta aaaatacaaa aaaaaaaaaa aaattagctg ggcgtggtgg
     7501 cgggtacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaaccccgg
     7561 gggcggacgt tgcagtgagc cgagatagtg ccactgcact ccagcctgga cgacagagcg
     7621 agactccgtc tccaaaaata aaaaaacacc tgaaaatccc agtatcccct aagctctgat
     7681 gtaaattgac aaaccctgac attgtcccaa acctccaaat ataacccgag ccccgatacc
     7741 atctacaaac tccttttcgt cctcagatct tcttactccc taagccccta tgtgaacccc
     7801 aagcccactg ttttcctaac cctgatgtaa tccctaaacc tcacacatcc ccaacttacc
     7861 cgcacacccc aatgtgcccc tctagacacg aatcgtcctg gactgtgggg aagatgacgt
     7921 atgtgtgccc cagcttcagc tcactgccag cgtgtgagga ggcctcccat tctgcccgac
     7981 cctggccctt tctgcctatc atacctgctc cacaccttag tcccctcttt tcccacatcc
     8041 tgggcccaga cccaggctcc ctggcttcac tcctctttcc ccacaggacg ggctccccgc
     8101 tcctagttgg ggcagataat gtcctggagc tgcagatgga cgcagccaac gagggcgagg
     8161 gggcctatga agcagagctg gccgtgcacc tgccccaggg cgcccactac atgcgggccc
     8221 taagcaatgt cgaggtatgg cccccaccct gggaacagta cccgggacct gggaggcact
     8281 ggagccttgg ctctctcatc tccctccctg agagtccctc ttctcttctg ctttgctgtc
     8341 aaagatgtaa tttttttttt aatttggagg aggatacttg ctaatggtca gtcagaattc
     8401 caaaactcta ttacaaaaac cagaaaaaca aaaaaggttt aggaaccaaa tgttaacagg
     8461 aacctctgtt aacatttggt ggatttcctt ccagtctttt tttcaatatt gactcacact
     8521 cacataagta tatatttatt ttttatgttg ttaatatagt ttataataat gggggtcata
     8581 ctctaatgtt ttgtgttttt tatttccaaa atgaaaatgc ctaaaaagta gtagtgctac
     8641 agcaatacac acactagcat gtgacagtcc cttgagcgac cccaccccaa gaaacccccc
     8701 cctccctacc ttggcacaca aatctttcca gaccttccaa gggagcttaa atatatatat
     8761 atgatgctct gtaatttctt tcttggaact gccttcctga agggctttga gagactcatc
     8821 tgtaatcaga agaaggagaa tgagaccagg gtggtgctgt gtgagctggg caaccccatg
     8881 aagaagaacg cccaggtgag gctgctgggt cgtggtaccg ggtctccacc aggggctcat
     8941 gaataaccag attttagggg tgaggtttta gagccacata gttctgggcc agaatcttgg
     9001 tcctcacact ccctttgcca acattgtcct tgggtgagtg actttccctc tctgagcccc
     9061 tttaccagtg ggcttccagg taaaatagaa ataataatgg tggcctggtg cggtcgtcac
     9121 gcctgtaatc ccagcactct gggaggccag agcgggtgga tcacgaggtc aggagttcaa
     9181 gaccagcctg gccaacatag caaaaccccg tctctactaa aaatacaaaa attacccggg
     9241 catggtggcg cacgcctata gtcagagcta ctcgggaggt tgaggcagaa aaatcacttg
     9301 aacctgggag gtggaggttg cagtgagccg agatcatgcc actgcactcc agcctgggtg
     9361 acagagtgag actccgtctc ggaaaaaaaa aaaaagaaaa agaatagtgg tgatcttgga
     9421 gggtgaagac tggaggccac attcagggca gggctgtcct aagtggggca cttgggcagt
     9481 gaccttggcc ctcctcatct cccagatagg aatcgcgatg ttggtgagcg tggggaatct
     9541 ggaagaggct ggggagtctg tgtccttcca gctgcagata cggaggtact gacctggcga
     9601 gcgtgcctac ccaccaccct tcccccgtct gacccccgtg cagagcccct caggtccctt
     9661 ccatacagaa gggtctttcg aggccaggcg cagtggctca cacctgtaat cccagcacgt
     9721 tgcgaggcca aggcagaagg atcactggag gtcaggagtt ggagaccagc ctggccaaca
     9781 tggtgaaacc ccatctctac taaaatataa aattagctgg gcatggtggt gcgcacctac
     9841 aatcccagct actcgggagg ctgaggcagg agaatagctt gaaccgaacc tgggaggtgg
     9901 aggttgcagt gagctgagat tgggccactg cactccagcc ttccagcctg ggcgacagtg
     9961 cgagattcta tctcaaaaga aaaaaaaaaa aaggtcttga agaagcctgg ttccctttct
    10021 tcctcagaga tttagcgagt cttggagccc tagaggaagt tctttcccag gtctaacttc
    10081 agtgtggcat gctctttgta taattagctc tctctgaact ctctaaaatt ctggcctcac
    10141 ccccagaaag tcactgggct ggtgtccctg gccctgtttc tcctcatccc ctcccctcta
    10201 gcaagaacag ccagaatcca aacagcaaga ttgtgctgct ggacgtgccg gtccgggcag
    10261 aggcccaagt ggagctgcga gggtgagagg ccaggggtgg agaagggaga tggcattcag
    10321 ggctctaaac tccagggggc gctggggaaa cctcacaggc caatcagggc atcacactct
    10381 ctctgggggt cttgggcacc tgcaggaact cctttccagc ctccctggtg gtggcagcag
    10441 aagaaggtga gagggagcag aacagcttgg acagctgggg acccaaagtg gagcacacct
    10501 atgaggtatt ggggagcctc gcgtccctgg ctggggtgag cgggtcctca gaactccggg
    10561 tgaggcgcta agctccccac accctgccac caccacccct tcagctccac aacaatggcc
    10621 ctgggactgt gaatggtctt cacctcagca tccaccttcc gggacagtcc cagccctccg
    10681 acctgctcta catcctggat atacagcccc aggggggcct tcagtgcttc ccacagcctc
    10741 ctgtcaaccc tctcaaggta agagctgggt ggaagaaaga cctgggaagg cggccccaga
    10801 ccaaccaccg gggcacctct gtgggctggg gttcggggga gacctgggcc tgaccactcc
    10861 tttgcccccc caggtggact gggggctgcc catccccagc ccctccccca ttcacccggc
    10921 ccatcacaag cgggatcgca gacagatctt cctgccagag cccgagcagc cctcgaggct
    10981 tcaggatcca gttctcgtag tgagcaggct ctctggtctc gggcccggcc tccccgggac
    11041 ccacggggca gaggggatgg gaggagggag aggggtccgg gtgtgctgtg ggcctctgtg
    11101 ggccacgctt ggtccctggg agcacttcaa gtgaacatgg aggagcatgc tggcttgtgt
    11161 ctggggtgag ctgaaagaca cttgcacttt ttaaaagctt cccagtacgt taaggagcat
    11221 aaaacaatgc caaagcaagg ttatcataga tctgagcatt gtgcgctggg ggatgaccct
    11281 ccctgcatct ctgggactat gtgagcaagc ccgtggaaag acagcatccg aagcttggat
    11341 ccaaggccct tcctgatggg aaggccaccg cttcctgaac ccccggcccc ttctgcgttg
    11401 ggtcctgggg gtaagggggt gggggatgat ggggtgatgg gccgggacgg ctggggactg
    11461 acgatgcttc ccctcagagc tgcgactcgg cgccctgtac tgtggtgcag tgtgacctgc
    11521 aggagatggc gcgcgggcag cgggccatgg tcacggtgct ggccttcctg tggctgccca
    11581 gcctctacca ggtggggtgg gccgtggtgg ggcggggccg ggccttctgg gccgggacca
    11641 ctttgctctg ggaggggcgg ggtttggtgt gggagggcag gaagagaggg aaggcaaggt
    11701 ttactttggg ggattgcagt gggattaggt cagaggcagg gcttccccgc cgggtgtggg
    11761 acctggactc cgtgcaacca ataggcctct tgtgggtgta aacggctttc aaccccaacc
    11821 tgtccagagg cctctggatc agtttgtgct gcagtcgcac gcatggttca acgtgtcctc
    11881 cctcccctat gcggtggccc cgctcagcct gccccgaggg gaagctcagg tgagtgtggg
    11941 gggatggagc agagaccagt cctgcaggac ccattgtccc ccagtcagtg cccagccaga
    12001 aaagtctgag gggtggtacg ggtgggtggc atggctggag gtcaccagcc tgaggtttga
    12061 gtctttgtga aaggcaggtg tcaaggtgac tgaggagaca cgtgggtttg ccccaggtgt
    12121 ggacacagct gctccgggcc ttggaggaga gggccattcc aatctggtgg gtgctggtgg
    12181 gtgtgctggg tggcctgctg ctgctcacca tcctggtcct ggccatgtgg aaggtgaggt
    12241 gtgaaggacg gtggagtccc cagcggggca caggcttggc tctgccctgc ctcacaggga
    12301 gtcaaggaga gatggtggcc cacccaagtg ggtaatccag ggaccagggg tctatgtctc
    12361 cactattaga atgtcattct cgtccagggg ggtggctcac acctgtaatc ccagcacttt
    12421 ggcaggcaaa gcgtttagat cacctgaggt caagagttcg agaccagcct ggccaacatg
    12481 gtgaaacccc atctctacta aaaataccaa attagccggg cgtgttgaca catgcctgta
    12541 atctcagcta ctcgggaggc tgaggcagta gaattgcatg aacccaggag gcggaggttg
    12601 cagtgagccg agatcacacc actgcactcc agcttgggca acagagcgag cctccatctc
    12661 aaaaaaaaaa caaaaaaata gaatgtcttt ctctagtaga gcaaaaggca aaacaaacac
    12721 aaaaatgtca ttctcctggg aacccttcca gacacatacc actggaaagg atagcacctg
    12781 aaattctgag gcctttagac acccctgcca ccaaaaagat tcagaggata tagagggtat
    12841 agagggtgta agtcctgcct tcaggaattc ctggctggtc tcaaggacaa gatgcacttc
    12901 ttcctagccc tgcccttccc cttgagtgag gaagaggcca aggattggtc tagaccctat
    12961 tccatacctt cctatgtggc cctggagggt cactcgctcc tctgcacctg gaggagtctc
    13021 aagcacactg aagggaagac atggtgcttt tagggaaaac cacgcactag acccacaata
    13081 atcaaataca tatcatcata tgctcgagtc atgcagacac aaacttcagt ataagaaaaa
    13141 ttccaggctg ggcgttggtg gctcacaccg gtaaaatccc agcactttgg gaggccgagg
    13201 tggg
//
LOCUS       HUMGPIIB3     364 bp ds-DNA             PRI       01-JUN-1990
DEFINITION  Human platelet Glycoprotein IIb (GPIIb) gene, exon 30.
ACCESSION   M34344
KEYWORDS    platelet Glycoprotein IIb.
SEGMENT     3 of 3
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 364)
  AUTHORS   Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S.,
            Schwartz,E. and Poncz,M.
  TITLE     Organization of the gene for platelet glycoprotein IIb
  JOURNAL   Biochemistry 29, 1232-1244 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   110      169     platelet Glycoprotein IIb (GPIIb), exon 30
    pre-msg <     1  >   364     GPIIb mRNA and introns
    IVS     <     1      109     GPIIb intron CC
BASE COUNT       68 a    111 c     98 g     87 t
ORIGIN      About 600 bp after segment 2.
        1 ttctgcgctg gtccagggag gtgctcatat gctagcatac ttcctcacat gtgctctggg
       61 gccagcaaat catctgtata ccctgacctt ggcccccgtg tacccccagg tcggcttctt
      121 caagcggaac cggccacccc tggaagaaga tgatgaagag ggggagtgat ggtgcagcct
      181 acactattct agcaggaggg ttgggcgtgc tacctgcacc gccccttctc caacaagttg
      241 cctccaagct ttgggttgga gctgttccat tgggtcctct tggtgtcgtt tccctcccaa
      301 cagagctggg ctaccccccc tcctgctgcc taataaagag actgagccct gatgctgagc
      361 atgc
//
LOCUS       HUMIMPH      2858 bp ss-mRNA            PRI       01-JUN-1990
DEFINITION  Human IMP dehydrogenase type 1 mRNA complete cds.
ACCESSION   J05272
KEYWORDS    IMP dehydrogenase.
SOURCE      Human spleen, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2858)
  AUTHORS   Natsumeda,Y., Ohno,S., Kawasaki,H., Konno,Y., Weber,G. and
            Suzuki,K.
  TITLE     Two distinct cDNAs for human IMP dehydrogenase
  JOURNAL   J. Biol. Chem. 265, 5292-5295 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            Y.Natsumeda, 15-FEB-1990.
FEATURES       from  to/span     description
    pept        601     2145     IMP dehydrogenase type 1 (EC 1.1.1.205)
    signal     2843     2848     poly-A signal
BASE COUNT      545 a    923 c    860 g    530 t
ORIGIN      
        1 tcggaagggg ccaggagaca ctggaaggtc cggacggcag ggaaggggac ggggttcttt
       61 ccagtcccac ccgtgtaggg acacctctcc ccctcatccc ccgatgtacc ctcgctgaat
      121 ctgggatggg agagacgaac cgagtctagg catctgcgta gcagcgccgg ggagagcggg
      181 gagcccaggc ggagcccagt cgactcccgg attcccctgc cccgcccccg gcacgaggcc
      241 ccgccccggc gccccgcccc tcctcgggac tcgaccgggc tgcgctcact gcccagccgg
      301 ggccccggga gcctccaggc tcgcccgccc tgagctgcgg cctccgcatg gagggccact
      361 cactccacca ccgctgcagg gaggcggacg gcgctgttcc ggagcccgga gcccggcaac
      421 acccgggaca cgagacggcg gcgcagggct acagcgcccg actgctgcag gccggctacg
      481 agcccgagag ccctagattg gacctcgcta cacacccgac gacaccccgt tcagaactat
      541 cttcagtggt cttactggca ggtgttggtg tccagatgga tcgccttcgc agggctagcc
      601 atggcggact acctgatcag cggcggcacc ggctacgtgc ccgaggatgg gctcaccgcg
      661 cagcagctct tcgccagcgc cgacgacctc acctacaacg acttcctgat tctcccagga
      721 ttcatagact tcatagctga tgaggtggac ctgacctcag ccctgacccg gaagatcacg
      781 ctgaagacgc cactcatctc ctcccccatg gacactgtga cagaggctga catggccatt
      841 gccatggctc tgatgggagg tattgggttc attcaccaca actgcacccc agagttccag
      901 gccaatgaag tacgcaaggt caagaacttt gaacagggct tcatcacgga ccctgtggtg
      961 ctgagcccct cgcacactgt gggcgatgtg ctggaggcca agatgcggca tggcttctct
     1021 ggcatcccca tcactgagac gggcaccatg ggcagcaagc tggtgggcat cgtcacctcc
     1081 cgagacatcg actttcttgc tgagaaggac cacaccaccc tcctcagtga ggtgatgacg
     1141 ccaaggattg aactggtggt ggctccagca ggtgtgacgt tgaaagaggc aaatgagatc
     1201 ctgcagcgta gcaagaaagg gaagctgcct atcgtcaatg attgcgatga gctggtggcc
     1261 atcatcgccc gcaccgacct gaagaagaat cgagactacc ctctggcctc caaggattcc
     1321 cagaagcagc tgctctgtgg ggcagctgtg ggcacccgtg aggatgacaa ataccgtctg
     1381 gacctgctga cccaggcggg ggtcgacgtc atagtcttcc actcgtccca agggaattcg
     1441 gtgtatcaga tcgccatggt gcattacatc aaacagaagt acccccacct ccaggtgatt
     1501 ggggggaacg tggtgacagc agcccaggcc aagaacctga ttgatgctgg tgtggacggg
     1561 ctgcgcgtgg gcatgggctg cggctccatc tgcatcaccc aggaagtgat ggcctgtggt
     1621 cggccccagg gcactgctgt gtacaaggtg gctgagtatg cccggcgctt tggtgtgccc
     1681 atcatagccg atggcggcat ccagaccgtg ggacacgtgg tcaaggccct ggcccttgga
     1741 gcctccacag tgatgatggg ctccctgctg gccgccacta cggaggcccc tggcgagtac
     1801 ttcttctcag acggggtgcg gctcaagaag taccggggca tgggctcact ggatcccatg
     1861 gagaagagca gcagcagcca gaaacgatac ttcagcgagg gggataaagt gaagatcgca
     1921 cagggtgtct cgggctccat ccaggacaaa ggatccattc agaagttcgt gccctacctc
     1981 atagcaggca tccaacacgg ctgccaggat atcggggccc gcagcctgtc tgtccttcgg
     2041 tccatgatgt actcaggaga gctcaagttt gagaagcgga ccatgtcgcc ccagattgag
     2101 ggtggtgtcc atggcctgca ctcttacgaa aagcggctgt actgaggaca gcggtggagg
     2161 ccgaggtggt ggaggggatg caccccagtg tccacttttg ggcacaggct ccctccataa
     2221 ctgagtggtc cacagatttg cactacgggt tctccagctc ctttccaggc agagaggagg
     2281 ggaggtcctg aggggactgc tgcccctcac tcggcatccc ctgcagagtc aggactgctc
     2341 ccgggggcca ggctgccctg ggaggccccc tccgagacca gccagccagg ctctcaggac
     2401 ctgcgctgcc ttaggatctt tcttgctgca gcctgctcca gcctggcccc caccccaggg
     2461 gcaggcggcc cctcctggct tctcctgtag ggcacctccc tgcccctagc ctcccagcaa
     2521 atggtgctct cctggccctg ctctggccct tcccgggccg tgcccctcag ccatgtggca
     2581 cttctgagct cctgacctag gccaagggga ggtctctgcc cccttccccg gccctgggct
     2641 acccttgggt cctgctcctc aggccgctcc cctgtccctg gccatgggta ggagactgcc
     2701 ctggtcatgg ccgcctgcct gtcattcctg actcaccacc gtccccaggt gaaccattcc
     2761 tcccttctcc tcagctgcag tcgaaggctt taactttgca cacttgggat cacagttgcg
     2821 tcattgtgta ttaaatactt ggaataaatc aagcaggt
//
LOCUS       MBIFTRX      1260 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  M.thermoautotrophicum tetrahydromethanopterin formyltransferase
            (FTR) gene, complete cds.
ACCESSION   J05173
KEYWORDS    tetrahydromethanopterin formyltransferase.
SOURCE      M.thermoautotrophicum.
  ORGANISM  Methanobacterium thermoautotrophicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanobacteriaceae.
REFERENCE   1  (bases 1 to 1260)
  AUTHORS   DiMarco,A.A., Sment,K.A., Konisky,J. and Wolfe,R.S.
  TITLE     The formylmethanofuran: Tetrahydromethanopterin formyltransferase
            from Methanobacterium thermoautotrophicum delta-H: Nucleotide
            sequence and functional expression of the cloned gene
  JOURNAL   J. Biol. Chem. 265, 472-476 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      153     unknown protein (AA at 1)
    pept        211     1101     tetrahydromethanopterin formyltransferase
BASE COUNT      371 a    260 c    305 g    324 t
ORIGIN      
        1 ggatccttgg agggagatga gggagatgag ggtatagtaa atatttttgt tctctggtgc
       61 tgtaataacc atatgtttat cacccggcgc ataaccctct tattattcct aggggccctc
      121 agattcatgg ttttacagtt atttctaagc tgaattgaaa gtagaaaaat atattaaaca
      181 ttctcccagt aaatcaacta ggtgatagtt atggagataa atggtgttga aatagaagac
      241 acatttgcag aggcctttgg cataaaggtt tcaagggttc ttgtaaccgc aacaaaaaaa
      301 cttgccaaaa tagcagcaac cgaggcaaca ggttacggta cctcagttat agggtgccct
      361 gcagaggctg gcatcgactg ctatgttcca cctgaggaga cacctgacgg aagaccaggg
      421 tacattataa tgatatgcaa cccatccaag aagagccttg accatgagct ccttgaaagg
      481 atagggatgg gtatactgac agcaccaacc accgctgtct tcgatgcact ggatgatgag
      541 gatgagaagc tcaacatagg attcaaacta aaattcttcg gtgacggcta cgagaaggaa
      601 cttgaaattg atggaagaaa gatccactca atcccaataa tgtcagggga cttcctgatt
      661 gaaagccagt tcggaataaa ggatggagtt gccggcggaa acttcttcat aatgggtgac
      721 agtcaggcat cagcgctact ggctgcacag gcagctgttg atgccatagc tgcagttgaa
      781 ggtactgtaa caccattccc tggtggagtg gttgcttcag gatcaaaggt cggctcaaac
      841 aagtacaagt tcctcaacgc ctcaaccaat gagaagatgt gtgtgaccct gaaggacgaa
      901 gttgaggaca cccagatacc tgagaacgtc aacggggtct acgagatagt catagatggt
      961 gttgatgaag aggccgtaag ggaggccatg aaggaaggta taaaggctgc ctgcacggtt
     1021 ccaggcataa taaagataag tgcaggaaac tatggcggta acctgggagc ctacaagata
     1081 aaactccatg acctcttctg atttttttat ccatgccctg atcctatcag ggttgacctt
     1141 tttttattct atgcagaatt tttccatcat ttcgctttca actatttccc atatctcgtc
     1201 tcgcggaggt taacaccgag agtatacccc ttttcctgag gtcatcttca attttcatta
//
LOCUS       PFAROP1      3061 bp ds-DNA             INV       01-JUN-1990
DEFINITION  P.falciparum rhoptry associated protein 1 (rop1) gene, complete
            cds.
ACCESSION   M32853
KEYWORDS    rhoptry associated protein 1.
SOURCE      P.falciparum (strain K1) blood stage DNA, clones lambda rop[1-3].
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 3061)
  AUTHORS   Ridley,R.G., Takacs,B., Lahm,H.-W., Delves,C.J., Goman,M.,
            Certa,V., Matile,H., Woollett,G.R. and Scaife,J.G.
  TITLE     Characterisation and sequence of a protective antigen from
            Plasmodium falciparum
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by R.G.Ridley, 15-MAR-1990.
FEATURES       from  to/span     description
    pept        628     2976     rhoptry associated protein 1 (rop1) precursor
    sigp        628      695     rhoptry associated protein 1 (rop1) signal
                                 peptide
    matp        696     2973     rhoptry associated protein 1 (rop1)
BASE COUNT     1266 a    335 c    357 g   1103 t
ORIGIN      
        1 gtatacacac atacaaatat gtaaaaatag acatttcaaa atttttaata taaaataaca
       61 tatatatata taaataaata gtaaaattaa ttttttttat ttcatcattt tatatataca
      121 tatatatata tatatatata tatatattta tttatttatt tatatacata tataagttat
      181 ttatttattt atttcctttt tggttgtatt ttcgtttttt tgattctatt tatatattta
      241 tagtaatatt tataattata attattttat ttaatttttt atctttttca ttttatatat
      301 taataacttt cttttaaata ttatgatatt aatttttatt ttttagtatt tatataagat
      361 tttaaaatag acaaaataaa tataaataaa ttatatatta tatatatata tatatatttt
      421 ttttttttta tagttatgca catatatatg tattaatata tattatcttt aaactattag
      481 aatatatact ttattattct aaatattatt aatttgtaat actcacatat atatatatat
      541 atatatatat atttatttat ttatttattt atttatttat ttatatagta aaaaatcaat
      601 taattttttt tttttttttt atatataatg agtttctatt tgggtagctt agtaataata
      661 ttccatgtac tcttccgtaa tgtcgctgat ggtataaatg taaacggaga taataattat
      721 gggaaaacaa taatcaataa tgatttcaat tttgatgatt acaattattg gacaccaata
      781 aataaaaagg aatttttaaa ttcctatgaa gatgaatttt caagtgaatc ctttttagaa
      841 aataaatcta gtgttgatga tggaaatata aatttaacag atacaagtac atcaaataaa
      901 agttctaaaa aaggacatgg tagaagtaga gtaagatcag catcagctgc tgcaattctt
      961 gaagaagatg attcaaaaga tgatatggaa tttaaagctt ctccttcagt tgttaaaaca
     1021 tctactccat caggtacaca gacatctggt ttaaaatcat ctagtccatc tagtacaaag
     1081 tcatcaagtc catcaaatgt aaaatcagct agtccacatg gtgaatctaa ttcttctgaa
     1141 gaaagtacta ctaaatcctc aaagagaagt gcttcggttg caggtattgt aggtgccgac
     1201 gaagaagcac ctcctgcacc aaaaaacacc ctcactccat tagaagaatt atatcctact
     1261 aatgttaatt tatttaacta taaatattca ttaaacaata tggaagaaaa tatcaatata
     1321 cttaaaaacg aaggagattt agttgcacaa aaagaagaat ttgaatatga tgaaaatatg
     1381 gaaaaagcta aacaagacaa aaaaaaagca cttgagaaaa taggaaaaca atcagacgaa
     1441 gaacctttta tgttttcaga aaataaattt cttgaaaatc aagtaaaaga aagaaatgtt
     1501 gctggatcct tttctcgatt tttcagtaaa ttaaatcctt ttaagaaaga tgaagtaata
     1561 gaaaaaactg aagtatcaaa gaaaacattt tcaggtatag gttttaatct tactgacaaa
     1621 gaagctaaag tattaggtgt aggtgcaacc tatcaagaat atccagaaac catgttatat
     1681 aactgtccaa acaattctaa tttgtttgat actatagaat cattacaagg aagaataatt
     1741 gatattaaaa aaagagaaag catgatatca acaactttcg aacaacaaaa agaatgttta
     1801 aaaaatatgg gtgtacttga tcttgaatta aacgatacac aatgtaaatt tggtacatgt
     1861 ataggtagct ttggagaaca tcatcttaga ttatacgaat ttgagaatga cttatttaaa
     1921 tttcatccaa atattgatta tttaacttta gctgatggat ataaattaca aaaaaatcat
     1981 atatatgaat tatcccatgt aaacttttgc ttattaaatc ctaaaacatt agaagaattt
     2041 ttaaaaaaaa aagaaatcaa ggatcttatg ggtggtgatg atcttataaa atataaagaa
     2101 aattttgata actttatgag tatatctata acatgccata ttgaatcttt aatatatgat
     2161 gatattgaag catctcaaga tattgctgct gtattaaaaa ttgctaaaag taaattacat
     2221 gtaataacat caggtttatc atataaagca agaaaattag tatataaaat ttatagtgaa
     2281 attcaaaaaa atccagatga actctatgaa aaattaacat ggatttatga taatatctat
     2341 atgattaaaa gatattatac tgcatatgct ttagaaggtg tctgttcata tcttgaacat
     2401 gataaaagtc aaatgtatac agaattacat atttataaca aaatagtcga ctctgttcgt
     2461 tattatagtt catgctttaa aaacgttatt gtttataatg ctatcatttc tggtatacat
     2521 gaaaaaataa aacatttctt aaaattagta ccaagacaca actttctttt ggattatcac
     2581 tttaattcaa tttttgaaaa agaaattaaa ccagccaaaa aatatagtac ttcacatatt
     2641 tattttgatc caactgttgc atcatatgct tattataatt tagatagaag aaccatggtt
     2701 actattatta atgattattt cgaagcaaaa aaaaaagaat taaccgttat agtatctcgt
     2761 atgaaaacag atatgctcag tcttcaaaat gaagaatcaa aaataccaaa tgacaaaagt
     2821 gcaaattcaa aactagctac aagattaatg aaaaaattta aagctgaaat cagagatttc
     2881 ttcaaagaaa tgcgtataca atatgctaaa ttaataaaca tacgttacag atctcactta
     2941 aagaaaaact actttgcctt caagagatta gattaagaat ataaacttga aaaatatata
     3001 taatgtaaaa atatactaaa tatattaata taatatatat atatatatat atatatatat
     3061 a
//
LOCUS       STYFLICD      350 bp ds-DNA             BCT       01-JUN-1990
DEFINITION  S.typhimurium phase-1 flagellin C and D (fliC, fliD) genes,
            partial cds.
ACCESSION   M33541
KEYWORDS    phase-1 flagellin.
SOURCE      S.typhimurium DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 350)
  AUTHORS   Inoue,Y.H., Kutsukake,K., Iino,T. and Yamaguchi,S.
  TITLE     Sequence analysis of operator mutants of the phase-1 flagellin-
            encoding gene, fliC, in Salmonella typhimurium
  JOURNAL   Gene 85, 221-226 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        313  >   350     phase-1 flagellin C
    pept         56  <     1 (c) phase-1 flagellin D
BASE COUNT      120 a     63 c     78 g     89 t
ORIGIN      
        1 aacaactggt ctaacggtaa gtttgatccc acacctaatg atgaaattga agccatgcct
       61 tcttcctttt tgattgcaaa cagtagttaa gcgcgttatc ggcaatctgg aggcaaagtt
      121 taatgataat tttgcaaaaa taatgcgcgg aataatgatg cataaagcgg ctatttcgcc
      181 gcctaagaaa aagatcgggg gaagtgaaaa attttctaaa gttcgaaatt caggtgccga
      241 tacaagggtt acggtgagaa accgtgggca acagcccaat aacatcaagt tgtaattgat
      301 aaggaaaaga tcatggcaca agtcattaat acaaacagcc tgtcgctgtt
//
LOCUS       SYNCRPBS1      88 bp ds-DNA             SYN       01-JUN-1990
DEFINITION  Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION   M33414
KEYWORDS    Artificial promoter; cyclic AMP activated promoter.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Aiba,H., Hanamura,A. and Tobe,T.
  TITLE     Semisynthetic promoters activated by cyclic AMP receptor protein of
            Escherichia coli
  JOURNAL   Gene 85, 91-97 (1989)
  STANDARD  simple staff_entry
BASE COUNT       24 a     22 c     18 g     24 t
ORIGIN      
        1 aaatgtgatc tagatcacat ttcaagcttg actgactgag atcagcttgc aacctttccg
       61 gtcgcggcga taaagacatc ttcaccgt
//
LOCUS       SYNCRPBS2      88 bp ds-DNA             SYN       01-JUN-1990
DEFINITION  Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION   M33415
KEYWORDS    Artificial promoter; cyclic AMP activated promoter.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Aiba,H., Hanamura,A. and Tobe,T.
  TITLE     Semisynthetic promoters activated by cyclic AMP receptor protein of
            Escherichia coli
  JOURNAL   Gene 85, 91-97 (1989)
  STANDARD  simple staff_entry
BASE COUNT       25 a     24 c     18 g     21 t
ORIGIN      
        1 aaatgtgatc tagatcacat ttggaattcc cggggatccg ggccctcttc aaacaggccc
       61 agttcgccaa taaaatcacc tgattcag
//
LOCUS       SYNCRPBS3      56 bp ds-DNA             SYN       01-JUN-1990
DEFINITION  Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION   M33416
KEYWORDS    Artificial promoter; cyclic AMP activated promoter.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 56)
  AUTHORS   Aiba,H., Hanamura,A. and Tobe,T.
  TITLE     Semisynthetic promoters activated by cyclic AMP receptor protein of
            Escherichia coli
  JOURNAL   Gene 85, 91-97 (1989)
  STANDARD  simple staff_entry
BASE COUNT       17 a     10 c     14 g     15 t
ORIGIN      
        1 aaatgtgatc tagatcacat ttggagtgcc tgaatgggcg tacagttatg aaaccc
//
LOCUS       SYNCRPBS4      56 bp ds-DNA             SYN       01-JUN-1990
DEFINITION  Synthetic E.coli cyclic AMP activated promoter DNA.
ACCESSION   M33417
KEYWORDS    Artificial promoter; cyclic AMP activated promoter.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 56)
  AUTHORS   Aiba,H., Hanamura,A. and Tobe,T.
  TITLE     Semisynthetic promoters activated by cyclic AMP receptor protein of
            Escherichia coli
  JOURNAL   Gene 85, 91-97 (1989)
  STANDARD  simple staff_entry
BASE COUNT       20 a     10 c      9 g     17 t
ORIGIN      
        1 aaatgtgatc tagatcacat ttggaattcc aaatgtgatc tacagttatg aaaccc
//
LOCUS       TRFCRE1      3940 bp ds-DNA             INV       01-JUN-1990
DEFINITION  C.fasciculata retrotransposable element I (CRE 1) gene
            complete cds.
ACCESSION   M33009
KEYWORDS    integrase; retrotransposable element I; reverse transcriptase.
SOURCE      C.fasciculata.
  ORGANISM  Crithidia fasciculata
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 3940)
  AUTHORS   Gabriel,A., Yen,T.J., Schwartz,D.C., Smith,C.L., Boeke,J.D.,
            Sollner-Webb,B. and Cleveland,D.W.
  TITLE     Rapidly rearranging retrotransposon within the miniexon gene locus
            of Crithidia fasciculata
  JOURNAL   Mol. Cell. Biol. 10, 615-624 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Gabriel, 21-MAR-1990.
FEATURES       from  to/span     description
    ORF     <   416     3837     ORF (AA at 2)
    rpt           1      415     mini-exon gene repeat
    site        386      415     target site duplication
    site        415      415     mini-exon splice junction
    rpt        3899     3940     mini-exon gene repeat
    site       3899     3927     target site duplication
BASE COUNT      890 a   1032 c   1330 g    688 t
ORIGIN      
        1 aagcttccgg aaacaaccgg cacaaatttt gaggcggaag cgctgctttt ttttgtgtcc
       61 gggggggtgc tccttggggt ccccctgtcc agccccagcc ggtcgcccac cacataggaa
      121 tttgcgaagg acccccaaaa atcccggtcc ccggggcgag ttgtcccaac tttttcaaac
      181 ctcatgaaga gctagttgcg tcattgaaaa gttcgtgtgc agaaaccccc tcccccacgt
      241 ttgtacaatg gaagagttta cgatacaggt tttctcacgg ttttgaggtg ttttttcgaa
      301 aaacaaaaaa tatagaggtg tatagcgctt atttttgaca cccccctcaa aacatgctgg
      361 gggtataggt ccttccaact aacgctatat aagtatcagt ttctgtactt tattgaacgg
      421 cattcggtct agtgggtcca ctcccaacct tctcctcctt ggtctcgggt tcgattccgg
      481 tcgggcacga aactctcttc ctctctaaac tccacatata catccacaat aaccactctc
      541 ataactgttc tggtgcgggg ccatttcgaa ccatattgac atttggcgca ccaaactcac
      601 acatactaac caaaaatggc agagccaccc ggggcgacgg tccgagcgtg gcggagcacc
      661 tcaagcgccg gcgggtcgaa ggatctgagc cggtcgtggt ggggtctcgc caggagggtc
      721 tctccggtga gtcagcagtc gaagcgattg ttgtggaaag tggcagcgag gctgacgaag
      781 aatccacggc gagggggctg ggagcctctc gacaagcgcc aggtgatgac tcgggcgggc
      841 cggtgcccag ggcggaggag gacctcgccc cggttgggta cctgtatccc gagaacctgg
      901 ccgtcccaca gggggcggcg tgtccggtgg tgggttgtgg gtaccgaccg aacacccggg
      961 tgggaccgag actggtggaa cacctgaaca cggtgcaccg ggatatcctg ggcaacattc
     1021 ctgtcgacgc ctggcgacgt cagggactcg tccgctgcct acggtgtggc tcggccttga
     1081 ccgcgtccgg tcatggtcgg ggtgcacacg gtgggaagtg tggtccgtac agaagcagaa
     1141 acgcggccat tagggctcgg acgcagagtt tctttggtgg aatttcgcag aattctgaca
     1201 cacttacggc gggaaacggg gtctcagaag ctggtggtct agtggaagtg ggtagggtag
     1261 aagacccaac cacagacccg tggtacaggg ttcgaacccc cttgaagcga caaatttacc
     1321 gtacggactt ccgggtgtgg cagggcttgg cccggccagt actcctgggg tactctctgg
     1381 cggatacagc agggaaggag ggacgccttc tcgccctgct caatctcccc agggaccatc
     1441 tggaggtcca ggtaaatgct aagaggggaa tccaaccaca gccggctgaa atcgaggccc
     1501 aggttcgaag gaaggtagtg gagttggcag ggataggagc ggtaggacga gcgatggcag
     1561 tgatgactcg ggggcggctg gtcgaggtgc ccctggagcg ggtgatggag cagctcgagg
     1621 agctccaccc gcaggaggac ccgcggggtt atccggcagc accagatacg agcgaggtgc
     1681 tgcgggccaa ggagcagaag gtgcggcggg cgatcgcggc acggatgggg aggggtacgg
     1741 cgcccggcct cgatggctgg acgcgggagc tcctcctccc cctcgcagaa gacccggccc
     1801 tgctacacga gatcacgtcg gtggtctcgg atatcatgca ggggaaggtg gccgaggtgg
     1861 tggcgcggag gctgcggagc agcgccgtca ccccgattcc gaaggacgag gcggggacga
     1921 agatacgccc gatcgtgccg gagtcggcct ggctgaagct ggcctcgctg gtggcgatgg
     1981 cggagatacc atccagcttc aaggagacct tcaaggggtg gcagtacggg gtctgggggg
     2041 acgtcgccaa agcagtggcg aagatccgcc gggacagcga ggagcacgag tacctggtgg
     2101 cactcgacgg ggtcaatgca tacaatacga tgagcagggc ccacatcctc caagccgtgt
     2161 acgccgagca gcgcctgaag ccgatctggg gggtggtgaa ggtggcgctt ggggggccgg
     2221 ggttcctggg agtatacagg gacggctgcc tcaagggcaa cctgtggtcc accaagggaa
     2281 tccggcaggg catggtgctg ggccccctcc tgtacgcgac cgggatggca gcggccatcg
     2341 ggccggtacg gcagcgtatc cccggggtcc ccgtgacggc ctacatcgac gacatcaccc
     2401 tcgcggccag cggggcggag ggagccaggg cagccgaggc atacgcagac gccctcgaga
     2461 cggtcggggt ggtcaccaac gccaggaagt cgatggtggt ggggccagaa ggcacccggg
     2521 tggggatcgg gggtgtagac ctgccggtgg tggccgaggc ccggatcctg ggggcccact
     2581 tccgagcaag ggggacacca gaggcccgta ccatcgagtg gctgcaggcg gccgtcgaga
     2641 agtggcgacc catccaccag aagctgcggc aggacatcat cccgaagaac attgcgatga
     2701 tgatgacccg catcagcctg gggtccaaga tgaccttcct cctccagacc cactcgccac
     2761 aggaactgga gaccgcagcg aagacggcag acgacgaggt cgagcagacc ctccagcacc
     2821 ttatggggca ggtagagatc acgccccgag caaggctgct ggcacaactc ccgatcagag
     2881 agggggggct gggtctccgg cgaagcagcg agattgcgaa gttcgcacag gcagacgtgg
     2941 ggcaaggcga ggcccaccag gcacacacaa aggcactaga tgaagggatc aagcaccagt
     3001 tacaaccact cctctcggag tccgaggtgc agatcctgaa gtcgaacgct ggaatggggg
     3061 ccgggcgggt gctgacagat agtagcctga ggatcccaga cgtggcagca acaatcgcgc
     3121 tgagggagag actcctgctc agggtgctcc cggagggatg cagtgtgtgt gtggggggga
     3181 cgcgacgaac taccatgtac acacgtgctc caacataccc accaagcccc ggacccgacg
     3241 acacgacggg gtggtggatg agctggtggc cctggccagg aagatggggt acgagcccag
     3301 caaggagccg agggcggacg ttgacgagta gggcgaggcc ggacctgtac atcacaggaa
     3361 gcctgaagcc ggcggcgacg gatgtaacaa taacctaccc gggcaggcag gcgaggggag
     3421 cacactcccg ttgcagcagg cctaccggaa taagatgggg gcctgggagg catgggggaa
     3481 cctgcgaggg gtggacatgc agccggtggt cctcgggacg aacgcggaga tacacccgga
     3541 gagtgcgaat ggatacgaag gttgacctcg gtcgaagaca aagacaaaat acataccagt
     3601 tacaacgagg tgacgggacg aatcgtggag acggtgttgg ttgggaacgt ggagctgttc
     3661 aacgcagtga cgaacctggc gctggtcagg gagttgatgt aggaaggtac gggtgggtag
     3721 tagtagaata tcgaacggcg actaggaagt tggaccattg ttacttgtta cttgtgattc
     3781 tgacgacgac gaaagattgt tccttgttac ttgttaccct gaaatttgat attacttgaa
     3841 ttgatatgat atgattgata caaaaaaatt taaaaaaaaa aaaaaaaaaa aaaaaaaaat
     3901 ataagtatca gtttctgtac tttattggta taagaagctt
//
LOCUS       SQDKINEA     3693 bp ss-mRNA            INV       01-JUN-1990
DEFINITION  Squid kinesin heavy chain mRNA, complete cds.
ACCESSION   J05258
KEYWORDS    kinesin; kinesin heavy chain.
SOURCE      Squid, cDNA to mRNA.
  ORGANISM  Loligo pealii
            Eukaryota; Animalia; Eumetazoa; Mollusca; Cephalopoda; Coleoidea; 
            Teuthoidea; Myopsida; Loliginidae.
REFERENCE   1  (bases 1 to 3693)
  AUTHORS   Kosik,K.S., Orecchio,L.D., Schnapp,B., Inouye,H. and Neve,R.L.
  TITLE     The primary structure and analysis of the squid kinesin heavy chain
  JOURNAL   J. Biol. Chem. 265, 3278-3283 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        421     3324     kinesin heavy chain
BASE COUNT     1220 a    684 c    822 g    967 t
ORIGIN      
        1 aaaagaaatc tttctaggcg gccataagtc gctgttctac aaataatacc gagatcgtca
       61 taacaaaccg cctatttaac acttccgaac aacttttata gaatatttac gactgttttg
      121 aatagtatat ttagctgtca ggctgcgatt tgcttccttt tatttattta tttatttatt
      181 tatatttttt gtagtagaaa tttacatttt ttttttttgc gggggagacg tcaggatatt
      241 attacctact tttgtttgtg cgcgtgtgtt gaagaaacga aagacgttca agctttattg
      301 gattagagaa ttgataaaga tcggattaac tgtataataa cgctaaactt taatttattt
      361 gggacctagg aaagtcccat ataaacccaa gaatttatat cgtcctcaaa caacgccgcc
      421 atggacgtgg cttccgagtg taacattaaa gtcatctgcc gagttagacc ccttaacgaa
      481 gccgaggaac gggccggcag caagttcatc cttaaattcc caactgatga ttccatctct
      541 atcgctggga aagtgttcgt atttgataag gttttaaaac ctaatgtctc acaagaatat
      601 gtgtacaatg ttggtgctaa gcccattgtt gctgatgtac ttagtggttg caatggaaca
      661 atttttgctt atggacaaac atccagtggg aaaactcata ctatggaggg tgtgctggac
      721 aaaccctcca tgcacggcat cattccacgc attgtacaag acatcttcaa ctacatttat
      781 ggaatggacg aaaacttgga gtttcatatc aagatttcat attatgaaat ttacttggac
      841 aaaatcaggg atcttttgga tgttaccaag accaacttgg ctgtccatga agacaaaaac
      901 agagtcccgt ttgtcaaggg tgccactgaa cgatttgtat ccagtcctga agaagtaatg
      961 gaagttattg atgagggtaa aaacaacaga catgtagctg tgacaaacat gaatgaacac
     1021 agttcgagaa gtcatagtgt atttttaatc aatgtcaagc aagaaaatgt tgaaacccag
     1081 aaaaaattaa gtggtaaact ttacctagta gatcttgctg gtagtgaaaa agtaagcaaa
     1141 actggtgcag aaggtgcagt ccttgatgaa gctaaaaaca tcaataaatc tttgtcagct
     1201 ctgggaaatg tcatttcagc tcttgctgat ggaaataaat ctcacgtccc atacagagac
     1261 agtaaattga cgagaatctt gcaggaaagt ttgggtggta atgcccgaac cactatggtc
     1321 atatgttgtt cacctgcctc ttacaatgaa tccgagacca agtctacttt gctttttggc
     1381 caaagagcta agactattaa gaatgtggta tcggttaatg aggagcttac agctgatgaa
     1441 tggaaacgaa gatatgagaa agaaaaagaa agggttacta agcttaaagc cactatggct
     1501 aagcttgagg cagaactaca gagatggaga actgggcaag ctgtatctgt agaagaacaa
     1561 gttgatctta aggaagatgt ccctgctgaa tctcctgcca ctagtacaac tagcctggct
     1621 ggaggactca tcgcttcaat gaatgagggt gaccgcaccc aactggaaga agagcgtctc
     1681 aagctttatc aacagttgga tgacaaggat gatgaaatta acaatcagtc tcagttaatt
     1741 gaaaaattga aggagcagat gatggaacaa gaagatttaa ttgcccaatc tcgtcgtgat
     1801 tatgaaaatc tacaacaaga tatgtctcga atccaagctg acaatgagtc tgccaaggat
     1861 gaagtcaaag aggtcttgca agctcttgag gaactggcca tgaactatga tcagaagtct
     1921 caggaggtgg aagacaaaaa taaggagaat gagaatctaa gtgaggaact taaccagaaa
     1981 ttgagcacct tgaatagctt acagaatgaa ttggatcagc tgaaggattc atctatgcac
     2041 cataggaaac gagttactga catgatgatt aacttgttga aagatcttgg tgatattggt
     2101 actattgtag gtggaaatgc tgcagaaact aagccaactg ctggttctgg tgagaaaatt
     2161 gaggaggaat tcactgtagc tcgtctgtat attagcaaaa tgaaatctga ggtgaagacg
     2221 ttggtgagcc gaaacaacca actggaaaat actcagcagg acaattttaa gaaaattgaa
     2281 acacatgaaa aggatttgtc caactgcaaa ttactcattc aacaacatga agccaaaatg
     2341 gctagtttac aggaagccat taaagattct gagaacaaga agaggatgtt ggaagacaat
     2401 gtggattctc taaatgagga atatgccaaa cttaaagctc aagaacaaat gcatttagct
     2461 gctctgtccg aacgggagaa ggagaccagc caagcaagtg aaactcgaga agttcttgag
     2521 aaacagatgg aaatgcatcg ggaacaacac caaaaacaac ttcaatcttt acgggatgaa
     2581 atttcagaaa agcaggctac tgtggataat cttaaagatg ataaccagag actttctttg
     2641 gccctggaga aacttcaggc tgattatgac aaactcaaac aagaggaagt tgaaaaggct
     2701 gccaagttgg ctgacttatc attgcagatt gaccgtcggg agcaagccaa acaggatctc
     2761 aaaggacttg aagagactgt ggcaaaggag ctgcaaacac tacacaatct tagaaaactt
     2821 tttgtacaag atctacaaaa taaggtcaaa aagtcatgca gcaagaccga agaagaagat
     2881 gaagatactg gtggtaatgc tgcccagaaa caaaagattt ctttcttaga aaataatctt
     2941 gagcagctga ctaaggttca caagcagctt gttcgagaca atgctgactt gcgctgtgag
     3001 ttgcccaagc tggagaaaag gttacgtgct actatggagc gagtcaagtc cttggaatct
     3061 gctctcaagg atgccaagga aggcgctatg agagacagaa aaaggtacca gcatgaggtg
     3121 gatagaatca aggaagctgt ccgccagaag aatctcgctc gccgaggaca tgctgcacaa
     3181 atagccaaac caatccggcc tggacaacac cagtctgttt ctcctgcgca ggcagcagct
     3241 attcgcggtg gtggtggttt gtcccagaat ggaccaatga ttacatctac gccaattcga
     3301 atggcacctg aaagcaaggc ataaatatct agttggaaga ggaattttta ctcccattca
     3361 tcaaagaaca acagaactac aaagctttct agtgcattca ttgataattg aaatggagga
     3421 tatgtatctc ctcctcctct actattatat ctcctcctcc tacatgaagc tgcattccgt
     3481 tttgttagcg tgtaccctgt gtctcgctgc tacagtggag tgtatgtggg ctcttgcata
     3541 ccactgcata gtggcagtta ggatggctgg ttacttagct atttaactgc ctttctacct
     3601 gcctgcttcc cacctgcctt agtgatggca agtgagtgac tgctctggtg gcaatccaac
     3661 ttttgtacaa ttattaaaaa aaaaaaaaaa aaa
//

GenBank-Updates@genbank.bio.net (06/02/90)

LOCUS       RATMRF4AA    1271 bp ss-mRNA            ROD       02-JUN-1990
DEFINITION  Rat muscle regulatory factor mRNA MRF4, complete cds.
ACCESSION   M27151
KEYWORDS    differentiation protein; muscle regulatory factor.
SOURCE      Rat cDNA to mRNA, clone lambda-cRS4-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1271)
  AUTHORS   Rhodes,S.J. and Konieczny,S.F.
  TITLE     Identification of MRF4: A new member of the muscle regulatory
            factor gene family
  JOURNAL   Genes Dev. 3, 2050-2061 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            S.F.Konieczny 16-AUG-1989, for release after publication.
FEATURES       from  to/span     description
    pept         45      773     muscle regulatory factor MRF4
BASE COUNT      347 a    292 c    303 g    329 t
ORIGIN      
        1 gtccatcacc cagttcagat cagtcagagg ccaaggagga gaacatgatg atggaccttt
       61 ttgaaactgg ctcctatttc ttctacttag atggagaaaa tgtgactctt cagccattag
      121 aagtggcaga gggctctcct ttgtacccag ggagtgatgg taccctatcc ccttgccagg
      181 accaaatgcc ccaggaagcc gggagcgaca gcagtggaga ggaacacgtt ctggctcccc
      241 caggccttca gccaccccac tgcccaggtc agtgtctgat ctgggcttgc aagacttgca
      301 agagaaaatc tgcccccaca gatcgtcgga aagcagctac cctgcgcgaa aggaggaggc
      361 ttaagaaaat caacgaagcc tttgaggcct tgaagcgtag aactgtggcc aaccccaacc
      421 agaggctgcc caaggtggag attctgagaa gtgccatcaa ctacattgag cgtctgcagg
      481 acctgctgca ccggctggat cagcaagaga aaatgcagga gctgggggtg gacccttaca
      541 gctacaaacc caagcaagaa attcttgagg gtgcggattt cctgcgcacc tgcagcccgc
      601 agtggccaag tgtttcggat cattccaggg gcctggtgat aactgctaag gaaggaggag
      661 caagcgtcga tgcttcagcc tccagcagtc ttcagcgcct ttcttccatc gtggacagta
      721 tttcctcaga ggaacgcaaa ctccccagcg tggaggaggt ggtggagaag taactcagtc
      781 agcatttgga acattcttcg ctcagcagga agagcccctt tccgcctaat catttagatt
      841 agggctcaca gaccccagaa tttatgaaag gcaagagact tagtgttaaa aaagaaacct
      901 ctccccacct caagtgaaaa tccttcggct tggggctttt attataacta ttattgtatc
      961 tgaaccgcta gtggcttagc tctagaaccc taattttgtt tttagtttgg ttggtttttt
     1021 ataacatatt aacttttgct atgatcacgt gaccctttcc tgtcagttgc aaacgaagtt
     1081 cattcctgtc taaatcaaag tgggaacgtt taatcctaag agtatttaat gtacttttgt
     1141 aaatagtctt agtactttcg tttttatgta aacctaaagg acatatttta aatgtggaat
     1201 taagcattgt atataaaatg tgtgagagtc tgatattgta atattaaaat attaatacgt
     1261 ttctacacgt a
//

GenBank-Updates@genbank.bio.net (06/05/90)

LOCUS       HUMBREAKZ     680 bp ds-DNA             PRI       05-JUN-1990
DEFINITION  Human chromosome 10q24 breakpoint region.
ACCESSION   M33603 M31943
KEYWORDS    Chromosomal translocation.
SOURCE      Human lung fibroblast DNA, clones G10-1 and G10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 680)
  AUTHORS   Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A.,
            Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M.,
            Carroll,A.J. and Korsmeyer,S.J.
  TITLE     The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia
            juxtaposes the delta T-cell receptor with TCL3, a conserved and
            activated locus at 10q24
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.M.Zutter, 07-FEB-1990.
FEATURES       from  to/span     description
    site        256      257     breakpoint
BASE COUNT      119 a    215 c    196 g    150 t
ORIGIN      Chromosome 10q24.
        1 ggatccacta gccgggtcag gcagagagaa ggaggccata gattcaggac caccttctgc
       61 agctactctt ggcctgggaa tcttagagaa aggggtgcaa cgtgaacagt ttcttccgtc
      121 ctggctgctg ttctaggacc caggaagcga gccccaggat cagacacaag aaaagaacag
      181 gttcccccac ctccagtccc ctacacacat gcacttcgcg cccctagctc cggtgcccgt
      241 ggtgctgcgg ggccagtggg ggctctgagc cccggtaaat cagcagaacc agtggccttt
      301 tttttttttt ccggagagca ggcttcatct gggaggatgg cagcggggct tagccgccca
      361 ggacagctag cttttcattt tcgttctctc ccggctgggt gtcctacccc gggcccagcc
      421 ctgttatctt gggcacgaac aatgcaccgg taggctggtg atcggtggcg ggaagactac
      481 tcccgggaca ttgggtacta agtggtggcc cgtgccgcct tctctgctgc cccgtcccgc
      541 cttcctctgc tggccctggc tccctctgct ccgtttccac tctgggcact caactctccc
      601 tctggcgtgg atcccccggg ctgcaggaat tcgatatcaa gcttatcgat accgtcgacc
      661 tcgagggggg gcccggatcc
//
LOCUS       HUMTCDXB       60 bp ds-DNA             PRI       05-JUN-1990
DEFINITION  Human T-cell receptor rearranged delta-chain gene V-region
            (V2-N-J1).
ACCESSION   M34439 M31943
KEYWORDS    T-cell receptor; delta chain; rearranged gene; variable region.
SOURCE      Human (T-cell lymphoblastic leukemia patient 1143) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 60)
  AUTHORS   Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A.,
            Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M.,
            Carroll,A.J. and Korsmeyer,S.J.
  TITLE     The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia
            juxtaposes the delta T-cell receptor with TCL3, a conserved and
            activated locus at 10q24
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.M.Zutter, 07-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  >    60     T-cell receptor delta-chain V2-N-J1-region (AA
                                 at 3) /hgml_locus_uid="LP0165Q" /nomgen="TCRD"
                                 /map="14q11.2"
    recomb       29       30     V2-region end/N-region start
    recomb       35       36     N-region end/J1-region start
BASE COUNT       19 a     15 c     11 g     15 t
ORIGIN      
        1 aagggtctta ctactgtgcc tgtgacacca aacgtacacc gataaactca tctttggaaa
//
LOCUS       HUMTRANSX      50 bp ds-DNA             PRI       05-JUN-1990
DEFINITION  Human T-cell leukemia t(10:14)(q24:q11) chromosomal translocation.
ACCESSION   M33602 M31943
KEYWORDS    T-cell receptor; chromosomal translocation; oncogene.
SOURCE      Human (T-cell lymphoblastic leukemia patient 1143) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A.,
            Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M.,
            Carroll,A.J. and Korsmeyer,S.J.
  TITLE     The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia
            juxtaposes the delta T-cell receptor with TCL3, a conserved and
            activated locus at 10q24
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.M.Zutter, 07-FEB-1990.
FEATURES       from  to/span     description
    ORF     <     1  >    50     TCL3 oncogene (put.)
    recomb       20       21     10q24 DNA end/N DNA start
    recomb       27       28     N DNA end/14q11 DNA start
BASE COUNT       12 a     10 c     19 g      9 t
ORIGIN      
        1 cggtggtgct gcggggccag ggagggaacc gataaactca tctttggaaa
//
LOCUS       MUSTCPB      2369 bp ds-DNA             ROD       05-JUN-1990
DEFINITION  Mouse t complex protein-10 (Tcp-10b) gene, complete cds.
ACCESSION   M22597 J03141
KEYWORDS    t complex responder.
SOURCE      Mouse (haplotype tw5) testes germ cell DNA, clone T66B-a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 519)
  AUTHORS   Schimenti,J., Cebra-Thomas,J.A., Decker,C.L., Islam,S.D.,
            Pilder,S.H. and Silver,L.M.
  TITLE     A candidate gene family for the mouse t complex responder (Tcr)
            locus responsible for haploid effects on sperm function
  JOURNAL   Cell 55, 71-78 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2369)
  AUTHORS   Schimenti,J., Cebra-Thomas,J.A., Decker,C.L., Islam,S.D.,
            Pilder,S.H. and Silver,L.M.
  JOURNAL   Unpublished (1989) Lewis Thomas Lab, Princeton, NJ 08544-1014
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by L.M.Silver, 07-JUN-1989.
            
            The sequence is interrupted by introns that are not indicated in
            [1] or [2].
FEATURES       from  to/span     description
    pept        421     1737     t complex protein-10
BASE COUNT      707 a    565 c    594 g    493 t     10 others
ORIGIN      Chromosome 17.
        1 tgcagaaaac catggggccc gggccatatg aaggtgcagt gggttatgaa taagctctgc
       61 tataggtgcc tttgactctc agggtgaaat accttgtctc cagtcagatc ccctaccgct
      121 gtggctaaac ctctcattct tgtttctata cagcctaggt tatggagttg taggttgggg
      181 acaaggggtt tgtcctgatg agaggcagcg aaaccttcca cagcaaaggt ctgacgaaga
      241 actaagacct cccaggcgcc ctctcaaggc cctgttcacc tcaaggtaat gctggcagta
      301 ctcatgtgat caggcaccat gctgactctc tgaaccaaag tgttttccac cctcctctgg
      361 ggacacagcc cttaatgctt actggtcacc tcgaggctaa atagtcagca gagacaagcc
      421 atgttggagg atctgagtca aggaaaaggg tccaaccatg agaagaggaa gatggagagc
      481 acagcccaga tcactgagga agacagcaag cttgatgagg ttgtggggct gcagaagcag
      541 atctgtgacc ttgggacaga gctcacaaga caatcatctt ggtggtgcgt agctcacaaa
      601 gacctccaaa gccagatcga tgctctgata aaggagaacc aggagatccg tgcggagctg
      661 aagaccttga agaagcagga tgcggaggcc accaaagcct gtataggctc gcccaccccg
      721 gcaagagcaa gcaacactct gccagtgtac ataaagatag agggaattga ttccgagaag
      781 acaacctcat gggatgaaag agatgagctt tctggaagtc ctccaaacag aagcacaatg
      841 gccaccggag gaacagactc ccaggatgaa aggctgtctt ttacatctgt ggatgaaaag
      901 gttatacaca tgtcttccaa atttctgcaa agaagcttcg gcagaatgtc accagaacca
      961 ctgtctgaca gcacattcct ggacacagag tcactggctg acatctggtc ctcaaatcca
     1021 gagacttcgg acagtgaact tctcctgcat gctcaagcaa gcagggtcat tccttgtttt
     1081 tccccaaatg cactgtgggt gcagaatatt ccaacaaagt caagagctcc taaagaaata
     1141 cagcaaacct cagacactac aaagactgat gagacaaagg aaaagcgaca cccaaacggc
     1201 aaggtggagc ggatgctcag tgacgggcga accatcatca ccttccccaa tggaaccagg
     1261 aaggagatca gtgctgacaa gaagaccacc ctcatcaggt tttttaacgg tgacatgaag
     1321 aagatcaagt ccgatcagaa agtgatttat tattatgcgg acacacaaac aatgcacaca
     1381 acctacccag atggtgttga agtggtgcag tttcctaaca agtggactga aaaattctac
     1441 ccggatggct ccaaggaaac cgtgtttcct gatgggacag tgaaacagct taaggatgga
     1501 tgtgaagaga cggtgttccc cgatgggaca tttgtgacag tgaagaggaa cggagacaaa
     1561 accatcatgt tcagcaacgg agagaaagaa atccacacgg ccaggttcaa gnggatggaa
     1621 ttcccagatg gnaccaccaa gactgtgtat tgcaatggct gccaggagac caagtatgcc
     1681 tcagggaggg tcagggtcaa agatgagaag ggaactgtca tcctggactg gaagtagtgc
     1741 atccaacaga ctcaaggcca ggcagtaggc tccaatctcc aaatatagtt ttgaccaaaa
     1801 cagacaaagc gactcacgat tcccttgaca atcctggaga tcccctgcac atgaggagta
     1861 aagagacatc caaaaattgt aaagaaagag accatgtagc caccaccatc ttgggagaga
     1921 ggtcagagaa gcagtagcca gaaatgaggt cctggacatg gcttccgtgg ccttgtncnn
     1981 accatgcctt ctgtcggagt ctggccctag agcagagggt acaatgggag caacgtgacc
     2041 actgccagct agttcctggg agcagccgtt aacaacagct ggcncngtgt ttttgcttgg
     2101 ccaacacctg cctgcatcta cttctatccc cttctagagt cctcatcaat gggatccagg
     2161 gggatgtatc ttcactggtg cctgatgtat gaatagactt gatttaacca tcctgaagtg
     2221 tctagacaga acctcatgtg gcattctatc ccataaattc acacaatttt gatntgtcaa
     2281 tcaacaagga gttgagnaaa agcacagata aaaacaaaga cagaaacata gtgngccaca
     2341 cctcctacct gttcctgtga gtctgccct
//

GenBank-Updates@genbank.bio.net (06/06/90)

LOCUS       HUMARC1      1753 bp ds-DNA             PRI       06-JUN-1990
DEFINITION  Human androgen receptor gene, exon 1.
ACCESSION   M27423
KEYWORDS    androgen receptor; dihydrotestosterone receptor;
            testosterone receptor.
SEGMENT     1 of 8
SOURCE      Human (ATCC library XLAOXNLO1) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1753)
  AUTHORS   Lubahn,D.B., Brown,T.R., Simental,J.A., Higgs,H.N., Migeon,C.J.,
            Wilson,E.M. and French,F.S.
  TITLE     Sequence of the intron/exon junctions of the coding region of the
            human androgen receptor gene and identification of a point mutation
            in a family with complete androgen insensitivity
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9534-9538 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1)
  AUTHORS   Lubahn,D.B.
  JOURNAL   Unpublished (1990) lubahnd%uncvx1.bitnet
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by D.B. Lubahn, 29-AUG-1989, [2] 04-JUN-1990.
FEATURES       from  to/span     description
    pept         79  +  1691     androgen receptor, exon 1 /nomgen="AR"
                                 /map="Xq12" /hgml_locus_uid="LL0058U"
    IVS        1692  >  1753     AR intron A
    revision      1        1     a in [2]; c in [1]
BASE COUNT      344 a    546 c    573 g    290 t
ORIGIN      
        1 agcctgttga actcttctga gcaagagaag gggaggcggg gtaagggaag taggtggaag
       61 attcagccaa gctcaaggat ggaagtgcag ttagggctgg gaagggtcta ccctcggccg
      121 ccgtccaaga cctaccgagg agctttccag aatctgttcc agagcgtgcg cgaagtgatc
      181 cagaacccgg gccccaggca cccagaggcc gcgagcgcag cacctcccgg cgccagtttg
      241 ctgctgctgc agcagcagca gcagcagcag cagcagcagc agcagcagca gcagcagcag
      301 cagcagcagc aagagactag ccccaggcag cagcagcagc agcagggtga ggatggttct
      361 ccccaagccc atcgtagagg ccccacaggc tacctggtcc tggatgagga acagcaacct
      421 tcacagccgc agtcggccct ggagtgccac cccgagagag gttgcgtccc agagcctgga
      481 gccgccgtgg ccgccagcaa ggggctgccg cagcagctgc cagcacctcc ggacgaggat
      541 gactcagctg ccccatccac gttgtccctg ctgggcccca ctttccccgg cttaagcagc
      601 tgctccgctg accttaaaga catcctgagc gaggccagca ccatgcaact ccttcagcaa
      661 cagcagcagg aagcagtatc cgaaggcagc agcagcggga gagcgaggga ggcctcgggg
      721 gctcccactt cctccaagga caattactta gggggcactt cgaccatttc tgacaacgcc
      781 aaggagttgt gtaaggcagt gtcggtgtcc atgggcctgg gtgtggaggc gttggagcat
      841 ctgagtccag gggaacagct tcggggggat tgcatgtacg ccccactttt gggagttcca
      901 cccgctgtgc gtcccactcc ttgtgcccca ttggccgaat gcaaaggttc tctgctagac
      961 gacagcgcag gcaagagcac tgaagatact gctgagtatt cccctttcaa gggaggttac
     1021 accaaagggc tagaaggcga gagcctaggc tgctctggca gcgctgcagc agggagctcc
     1081 gggacacttg aactgccgtc taccctgtct ctctacaagt ccggagcact ggacgaggca
     1141 gctgcgtacc agagtcgcga ctactacaac tttccactgg ctctggccgg accgccgccc
     1201 cctccgccgc ctccccatcc ccacgctcgc atcaagctgg agaacccgct ggactacggc
     1261 agcgcctggg cggctgcggc ggcgcagtgc cgctatgggg acctggcgag cctgcatggc
     1321 gcgggtgcag cgggacccgg ttctgggtca ccctcagccg ccgcttcctc atcctggcac
     1381 actctcttca cagccgaaga aggccagttg tatggaccgt gtggtggtgg tgggggtggt
     1441 ggcggcggcg gcggcggcgg cggcggcggc ggcggcggcg gcggcggcgg cggcgaggcg
     1501 ggagctgtag ccccctacgg ctacactcgg ccccctcagg ggctggcggg ccaggaaagc
     1561 gacttcaccg cacctgatgt gtggtaccct ggcggcatgg tgagcagagt gccctatccc
     1621 agtcccactt gtgtcaaaag cgaaatgggc ccctggatgg atagctactc cggaccttac
     1681 ggggacatgc ggtaagtttt tccttccaga aatgtcgcct ttcggcccag ggcagagtca
     1741 ctctgtgttc tgg
//
LOCUS       ONOMICACTA   2374 bp ds-DNA             INV       06-JUN-1990
DEFINITION  O.nova micronuclear MIC-ACT-1 gene, complete cds.
ACCESSION   M25531
KEYWORDS    actin.
SOURCE      O.nova micronuclear DNA.
  ORGANISM  Oxytricha nova
            Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; 
            Spirotricha; Hypotrichida; Sporadotrichina; Oxytrichidae.
REFERENCE   1  (bases 1 to 2374)
  AUTHORS   Greslin,A.F., Prescott,D.M., Oka,Y., Loukin,S.H. and Chappell,J.C.
  TITLE     Reordering of nine exons is necessary to form a functional actin
            gene in Oxytricha nova
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6264-6268 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by A.F.Greslin, 13-JUN-1989.
FEATURES       from  to/span     description
    mRNA       2028     2220     actin mRNA
               1956     2016     actin mRNA
                427      918     actin mRNA
                947     1393     actin mRNA
               1666     1703     actin mRNA
               1405     1555     actin mRNA
               1723     1836     actin mRNA
               2245     2270     actin mRNA
               1848     1939     actin mRNA
    site        427      918     actin, exon 1
    site        947     1393     actin, exon 2
    site       1405     1555     actin, exon 3
    site       1666     1703     actin, exon 4
    site       1723     1836     actin, exon 5
    site       1848     1939     actin, exon 6
    site       1956     2016     actin, exon 7
    site       2028     2220     actin, exon 8
    site       2245     2270     actin, exon 9
BASE COUNT      745 a    531 c    450 g    648 t
ORIGIN      
        1 ttaagcagta aaaccataat agccagtact tactagtcac tgatgaaaat gatagtatgc
       61 acaagaagcc agctaaaaag aagatatatg gattgaatat ccactatgca attggaagaa
      121 tctcaccaca aggccagagg aatgcaaacc tttttggaag aacaatatcc gaagagtaag
      181 agtccaagtc ctatttcaca gtaaaaattt aaatacctat gaaaagagtc ctcgaatttc
      241 aaggatataa tctaatatga accacttata taccttaaaa attatataca aattatattt
      301 ataaataata taaaaaaaaa ttacaatagt aagatgatgt taaaatttaa tgataaattg
      361 ttctatcaag catatatttt ccatttcaaa aatgagcatt caatcttcct cttagctatt
      421 aagacatcgg agtcgtcaag gctggtttct ccggcgaaga tgctcccaga gctgtgttcc
      481 cttccatcat cggaagaccc aagaacgtca gcgctttgat cggagttgac tccgcttctg
      541 agtacatcgg agacgaggcc caacaaaaga gaggagtcct caagatcttc taccccatcg
      601 agcacggtat catcaaggat tgggaagaca tggagaagat ctggaaccac accttctacg
      661 ttgaactcag agtccagccc gatgagcacc ccgtcctcct cactgaggct cccctcaacc
      721 ccaagaccaa cagagaaaag atgacccaga tcatgttcga gaccttcaat gttcccgctc
      781 tctacgtcgc catccaggcc gtcctctccc tctactccgc tggtagaacc accggtatcg
      841 tttgcgatgc tggtgatggt gtcacccaca ccgtccccat ctatgagggt ttctccatcc
      901 cccacgccgt ctcaagaatc tcattttatt actcatttaa tatggagtaa tccagcttgc
      961 cggtagagac ttgaccactt tcatggccaa gctcctgact gagaagggct acgtgttcac
     1021 ttcctccgct gagatggaaa tcgtcagaga catcaaggaa aagctttgct tcgtcgcttt
     1081 ggactacgaa gctgccatga agcagtccta cgagagcacc accttcgaga agaactacga
     1141 gcttcccgac ggtagagtca ttaccattgg taacgctaga ttcagatgcc ccgagtatct
     1201 cttcaagccc ctcgagatga acggaaagga actcgactcc atccaatcac tcacctacaa
     1261 ctccatccag gagtgcgatg tcgatgtcag aagagacctc taccaaaaca tcatcctctc
     1321 cggaggtacc accatgtacg agggtattgg tgagagactc ctcaaggaaa ttgaggccag
     1381 agctcccaag tccataatag aatatagcca agccccgaca gaagattcgc cgtctggaga
     1441 ggaggttcca ccctcacttc tctgtccact ttcgccagca tgtggatcac caaggaagac
     1501 tacgacgaga acggagcaag catcgtccac agaaagtgcc tctgagccca aaactctatc
     1561 cctatattta tacagctgtt cagattacaa tataaagttt catcatttac tagctcattt
     1621 tattgaaaca attaaatatc tctctcctat tggaaaataa atggttttct cccaagtcca
     1681 tcaacgtgaa ggttattgcc agccccataa tagtatttgg gaagacaaaa ctctagaaag
     1741 atattgttta atttatgaaa cgctaataca aggggtgcag gatgaggatg tccgtagcct
     1801 ctgcttcctt ttaactctga tgactcgcct ttgggttgaa tattatagtt aggttgaatg
     1861 agaacttagg atatgtgttc cctttgactg catctcacta atttattaag catttcccct
     1921 attttatttt atttgactcc ttttaataaa attgttagct tgacgactcc ggaaccatta
     1981 tcaacaacca cagtttgttt gtctgccatg tgtagtaagc ttataaaaca tattgtgcta
     2041 attgattctg aggaaatata ataaatgaaa gatagaggat ttgcgggggc gaagtagtga
     2101 gattgtgata ttaattatat tagagtgggt gggaaagaga agcaagttta atgctacttt
     2161 aatagatata gcgccaaaga tacagccttt aataatatta aatctttcta ctacttacta
     2221 cacattgaag ccataattcc cttttggttg aggccaggtt gaatgaatat ttttatcatc
     2281 acaagcaaat tttaaattaa acttacacca aaatatataa attgataact taaagtctat
     2341 aaatcagaat tgagcagtca tctgacttgt gatc
//
LOCUS       ONOMICACTB   2301 bp ds-DNA             INV       06-JUN-1990
DEFINITION  O.nova micronuclear MIC-ACT-2 gene, complete cds.
ACCESSION   M25530
KEYWORDS    actin.
SOURCE      O.nova micronuclear DNA.
  ORGANISM  Oxytricha nova
            Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; 
            Spirotricha; Hypotrichida; Sporadotrichina; Oxytrichidae.
REFERENCE   1  (bases 1 to 2301)
  AUTHORS   Greslin,A.F., Prescott,D.M., Oka,Y., Loukin,S.H. and Chappell,J.C.
  TITLE     Reordering of nine exons is necessary to form a functional actin
            gene in Oxytricha nova
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6264-6268 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by A.F.Greslin, 13-JUN-1989.
FEATURES       from  to/span     description
    mRNA       2030     2225     actin mRNA
               1959     2019     actin mRNA
                429      920     actin mRNA
                949     1395     actin mRNA
               1669     1706     actin mRNA
               1407     1558     actin mRNA
               1726     1839     actin mRNA
               2241     2266     actin mRNA
               1851     1942     actin mRNA
    site        429      920     actin, exon 1
    site        949     1395     actin, exon 2
    site       1407     1558     actin, exon 3
    site       1669     1706     actin, exon 4
    site       1726     1839     actin, exon 5
    site       1851     1942     actin, exon 6
    site       1959     2019     actin, exon 7
    site       2030     2225     actin, exon 8
    site       2241     2266     actin, exon 9
BASE COUNT      704 a    525 c    445 g    627 t
ORIGIN      
        1 ttaagcagta aaaccataat agccagtact tactagtcac tgatgaaaat gatagtatgc
       61 acaagaagcc agctaaaaag aagatatctg gattgaatat ccactatgca attggaagaa
      121 tctcaccaca aggccagagg aatgcaaacc tttttggaag aacaatatcc gaagagtaag
      181 agtcccaagt cctatttcag agtaaaaatt taaataccta tgaaaagagt cctcgaattt
      241 caaggatata atctaatatg aaccacttat ataccttaaa aattatatac aaattatatt
      301 tataaataat atataataac tacattagta agatgatgtt aaaatttaat gataattgtt
      361 ctatcaagca tatattttcc atttcaaaaa tgagcattca atcttcctct tagatatgaa
      421 ggcataggag tcgtcaaggc tggtttctcc ggcgaagatg ctcccagagc tgtgttccct
      481 tccatcatcg gaagacccaa gaacgtcagc gctttgatcg gagttgactc cgcttctgag
      541 tacatcggag acgaggccca acaaaagaga ggagtcctca agatcttcta ccccatcgag
      601 cacggtatca tcaaggattg ggaagacatg gagaagatct ggaaccacac cttctacgtt
      661 gaactcagag tccagcccga tgagcacccc gtcctcctca ctgaggctcc cctcaacccc
      721 aagaccaaca gagaaaagat gacccagatc atgttcgaga ccttcaatgt tcccgctctc
      781 tacgtcgcca tccaggccgt cctctccctc tactccgctg gtagaaccac cggtatcgtt
      841 tgcgatgctg gtgatggtgt cacccacacc gtccccatct atgagggttt ctccatcccc
      901 cacgccgtct caagaatctc attttattac tcacttaata tggagtaatc cagcttgccg
      961 gtagagactt gaccactttc atggccaagc tcctgactga gaagggctac gtgttcactt
     1021 cctccgctga gatggaaatc gtcagagaca tcaaggaaaa gctttgcttc gtcgctttgg
     1081 actacgaagc tgccatgaag cagtcctacg agagcaccac cttcgagaag aactacgagc
     1141 ttcccgacgg tagagtcatc accattggta acgctagatt cagatgcccc gagtatctct
     1201 tcaagcccct cgagatgaac ggaaaggaac tcgactccat ccaatcactc acctacaact
     1261 ccatccagga gtgcgatgtc gatgtcagaa gagacctcta ccaaaacatc atcctctccg
     1321 gaggtaccac catgtacgag ggtatcggtg agagactcct caaggaaatt gaggccagag
     1381 ctcccaagtc cataatagaa tatagccagc cccgacagaa gattcgccgt ctggagagga
     1441 ggttccaccc tcacttctct gtccactttc gccagcatgt ggatcaccaa ggaagactac
     1501 gacgagaacg gagcaagcat cgtccacaga aagtgcctct gagcccaaaa ctctatccct
     1561 atatttatac agctgttctt attacaatat aaagtttcat catttactag ctcattttat
     1621 tgaaacaatt aaatatctct ctcctattgg aaaataaatg gttttctccc aagtccatca
     1681 acgtgaaggt tattgccagc cccataatag tatttgggaa gacaaaactc tagaaagata
     1741 ttgtttagtt tatgaaacgc taatacaagg ggtgcaggat gaggatgtcc gtagcctctg
     1801 cttcctttta actctgatga ctcgcctttg ggttgaatat tatagttagg ttgaatgaga
     1861 acttaggata tgtgttccct ttgactgcat ctcactaatt tattaagcat ttcccctatt
     1921 ttattttatt tgactccttt taataaaatt gttagcttga cgactccgga accattatca
     1981 acaaccacag tttgtttgtc tgccatgtgt agtaagctta taaacattat gtgcaaattg
     2041 attctgagga aatagaataa tgaaagatag aggatttgcg ggggcgaagt agtgagattg
     2101 tgatattaat tatattagag tgggtgggaa agagaagcaa gtttaatgct actttaatag
     2161 atatagcgcc aaagatacaa cctttaataa tattaaatct ttctactact tactacacat
     2221 tgaagccaat ttcatattca ttccctttgg gttgaggcca ggttgaatga ctattttttt
     2281 catcactaag caaattttaa a
//
LOCUS       BSUTRPS      1450 bp ds-DNA             BCT       06-JUN-1990
DEFINITION  B.subtilis trpS gene encoding tryptophanyl-tRNA synthetase,
            complete cds.
ACCESSION   M24068
KEYWORDS    tryptophanyl-tRNA synthetase.
SOURCE      B.subtilis (strain QB928) DNA, clone pTSQ2.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1450)
  AUTHORS   Chow,K.-C. and Wong,J.T.-F.
  TITLE     Cloning and nucleotide sequence of the structural gene coding for
            Bacillus subtilis tryptophanyl-tRNA synthetase
  JOURNAL   Gene 73, 537-543 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by K.-C.Chow 05-MAY-1989.
FEATURES       from  to/span     description
    pept        171     1163     tryptophanyl-tRNA synthetase (EC 6.1.1.2)
BASE COUNT      433 a    306 c    331 g    380 t
ORIGIN      
        1 ctgaccgaac tggaaaaaag taggctcagc gagcagtctc cgttacaaac gtcagagtga
       61 ttccatttta atggaataat cagggtggta ccacggttca ttcgtccctt ttttacaggg
      121 gaagaatgag ccttttttat tatgttttaa gaaatgaggt tgatgttttc atgaaacaaa
      181 cgattttttc aggcattcag ccaagcggct cagtgacgct cggcaactat atcggtgcaa
      241 tgaagcagtt tgtcgaactg cagcatgatt ataacagcta tttttgcatc gtcgatcagc
      301 atgcgataac tgttcctcaa gaccggcttg agcttagaaa gaatatccgc aatctcgcgg
      361 cgctttactt agctgtcgga cttgatccag aaaaagcaac attgtttatt cagtcagagg
      421 tccccgcaca tgcgcaggcc ggatggatga tgcagtgtgt cgcctatatc ggcgagcttg
      481 agcggatgac tcaatttaag gacaaatcca aaggcaatga agctgtcgtc tccggcctgt
      541 taacatatcc gccgctgatg gccgctgata ttctgctgta cggaacggat cttgtacctg
      601 tcggcgagga tcaaaagcag caccttgagc tgacgcggaa tcttgcagaa cgcttcaaca
      661 aaaaatacaa cgacatcttt acgattccgg aagtgaaaat tccaaaagtc ggtgcacgta
      721 tcatgtctct gaatgatccg ctgaagaaaa tgagcaaatc tgatccgaat cagaaagctt
      781 atattacatt gctggatgag ccgaagcagc ttgaaaagaa aatcaaaagc gcagtaacgg
      841 attctgaagg cattgtcaaa tttgataagg aaaacaaacc gggcgtttcc aaccttctta
      901 caatttattc aatcctcggc aatacgacaa ttgaagagct tgaagcaaag tacgaaggaa
      961 aaggctacgg cgagtttaaa ggtgatttgg cagaagtcgt agtgaacgca ttaaaaccga
     1021 tccaggaccg ctattacgag ctgatagaat ctgaagaatt agaccggatt cttgatgaag
     1081 gcgcggaacg agcgaatcgg acagcaaaca aaatgctgaa aaaaatggag aatgccatgg
     1141 gtcttggaag aaaaagacgc taatcaaaaa accgctcttt gcaaagagcg gtttttttca
     1201 gttgaccttt gattcgtttt ccatttccca aagcttttcg aaaaaaggct ggcctgttat
     1261 taggttttcg aaaaagctag gtgtttgtcc gaccagcctt caagtgtgtc gttatacagc
     1321 atgtcccaaa tctcttcaaa gctcatcctt ttgatttgtt catacatatg cggattccac
     1381 tctgtgtacc agtagccaat ttccgctcgt ttttcagccc ttttaattgg tcaagcgaca
     1441 taaacatcag
//
LOCUS       ERGTUBB      2475 bp ds-DNA             PLN       06-JUN-1990
DEFINITION  E.graminis beta-tubulin gene, complete cds.
ACCESSION   X51326 M30927
KEYWORDS    beta-tubulin.
SOURCE      E.graminis hordei (race CR3) DNA.
  ORGANISM  Erysiphe graminis
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Erysiphales; Erysiphaceae.
REFERENCE   1  (bases 1 to 2475)
  AUTHORS   Sherwood,J.E. and Somerville,S.C.
  TITLE     Sequence of the Erysiphe graminis f. sp. hordei gene encoding
            beta-tubulin
  JOURNAL   Nucleic Acids Res. 18, 1052-1052 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.C.Somerville, 20-DEC-1989.
FEATURES       from  to/span     description
    pept        518      529     beta-tubulin, exon 1
                581      604     beta-tubulin, exon 2
                653      677     beta-tubulin, exon 3
                729      771     beta-tubulin, exon 4
                833      887     beta-tubulin, exon 5
                934     1724     beta-tubulin, exon 6
               1772     2162     beta-tubulin, exon 7
    IVS         530      580     beta-tubulin intron A
    IVS         605      652     beta-tubulin intron B
    IVS         678      728     beta-tubulin intron C
    IVS         772      832     beta-tubulin intron D
    IVS         888      933     beta-tubulin intron E
    IVS        1725     1771     beta-tubulin intron F
BASE COUNT      664 a    569 c    537 g    703 t      2 others
ORIGIN      3 bp upstream of SmaI site.
        1 cccggggcaa atcacactct gcctctctag cctcctnccc gaaggtcgtg ctgaaatttc
       61 tggaaacagc gtaattgctg tatggtagct tagcccaact ttttttgtgc cgtccagggc
      121 tctagggagt gctgttnagc tagtgagaat agaagtcttc cgagatattt gaaagcctac
      181 catagctctg aaggcattgt ggcaggacct agaggatcgt aagagatatg actgacaagt
      241 gttgagtcct tgcgtcctaa ttttgtgtat tattaccgtt gtgagacatc acggcgacgc
      301 agaccgatct gcacttttca gtgccaagat ggtatgtaag cagtcccgtg atcggagcta
      361 gcgcagcaac ggtttagtgt gagccaggtc caccgtcaac ccgcaattac tttctcgctg
      421 cgccaaatct ccaatttcta catcccaact aacctccgga aacgcaccta cactataccc
      481 cctatcatcc tctaccgcct atctttcatc agccaatatg cgtgaaattg ttagttaaat
      541 tccatcacgg caactcttga aatcgctaat gataatttag gtccatttac agaccggtca
      601 atgcgtacgt taatatttag tttgccttgt aactctacta atagagtttt agggaaacca
      661 aatcggagcc gcattctgtt cgtagaatct caagcttcta gctcaactcg ctcacaccca
      721 ctctacaggg caaacaattt ctggtgagca tggacttgat ggttcaggag tgtaagttcc
      781 ctcgcataat taggtatccg cattcatcaa cgaattataa tgactccatt agctacaatg
      841 gtacatcaga tttacaactt gagaggatga acgtatactt taacgaggtc tggtgaagct
      901 ctcaaaagag aagttatcgc taaccctctt aaggcttctg gtaacaaata tgttcctcgt
      961 gctgttctcg tcgacttgga gccaggtact atggatgctg tacgagctgg tccttttggc
     1021 cagctattca gaccagacaa cttcgtcttc ggacaatctg gagccggaaa taattgggca
     1081 aaaggtcact acactgaagg cgcggagctt gtcgaccaag tactagatgt agtgcgaaga
     1141 gaagcagagg gatgtgactg tcttcagggg tttcaaataa cacattctct tgggggtggt
     1201 acaggtgccg gtatgggtac gttattaatt tcaaaaatcc gggaagaatt ccctgatcga
     1261 atgatggcaa ccttctcagt tgtgccgtcc cctaaggtgt ctgacactgt tgttgagcca
     1321 tacaacgcaa ctctttcagt ccatcagttg gtcgaaaact ccgacgagac tttctgtatc
     1381 gacaatgagg cactttacga catctgtatg aggacgttaa agctttctaa cccctcatat
     1441 ggtgatctaa atcacttggt atccgctgta atgtcaggcg taactacttg ccttcgattc
     1501 cctggccagc taaactctga tttgcgcaaa ctggcagtta acatggttcc tttcccacgt
     1561 cttcactttt ttatggttgg atttgcaccg cttacaagcc gtggagcgca ctctttccgt
     1621 gccgtaactg ttcctgagtt aactcagcag atgtttgatc cgaaaaatat gatggcagca
     1681 tccgacttcc gaaatggtcg ttacttgaca tgctctgcca ttttgtaagt cagctcgtta
     1741 tatacgcata ttctatacta acatattaca gccgcggtaa ggtatctatg aaggaagtag
     1801 aagatcagat gcgaaatgtc caacaaaaga acgtatccta ctttgttgag tggattccaa
     1861 ataatgttca aactgcccta tgttctatcc cgcctcgtgg cctaaaaatg tcttcgacat
     1921 tcgtcggaaa ctcgacttcc atccaagagc tcttcaaacg tgtcggagat caattcacgg
     1981 ccatgttccg gagaaaggca tttctgcatt ggtacactgg cgaaggaatg gacgaaatgg
     2041 agtttactga ggccgagtct aatatgaacg atcttgtctc agagtatcaa caataccaag
     2101 aagcttcaat ctcggagggt gaggaagagt atccagagga agtaagcaac gaagaagaat
     2161 agatcaaaat tgtcctttta acacattgtt ctgtactgta tctgaagtag cggatacgat
     2221 gaactgtacc ccattaatcg taccgttcct cttgtttgga aaattaaaag tcaccaacca
     2281 aaatgcgata gaaagtgccg gtgtactggc ctagcacatc cttgtctggg tggatgtcga
     2341 acactttatt tgctccatta gctagtctct ttccgcgcga caggatgaaa actttctaat
     2401 catcaatcac atttcttttc tacgaaagtt tcaattctgt tactgtagag ttttttttgt
     2461 aaaaccatct ctaga
//

GenBank-Updates@genbank.bio.net (06/07/90)

LOCUS       DROIAB       2131 bp ds-DNA             INV       07-JUN-1990
DEFINITION  D.melanogaster infra-abdominal (iab-4) 1.7 and 2.0 kb RNAs,
            complete cds.
ACCESSION   M32109
KEYWORDS    .
SOURCE      D.melanogaster (strain Canton S) embryo DNA, and cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2131)
  AUTHORS   Cumberledge,S., Zaratzian,A. and Sakonju,S.
  TITLE     Characterization of two RNAs transcribed from the cis-regulatory
            region of the abd-A domain within the Drosophila bithorax complex
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3259-3263 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.Sakonju, 14-FEB-1990.
FEATURES       from  to/span     description
    RNA         141     1802     1.7 kb RNA
    RNA         141     2101     2.0 kb RNA
BASE COUNT      592 a    468 c    478 g    593 t
ORIGIN      Chromosome III, bithorax complex.
        1 ttagttcgaa gcacgtgcgt gtgagttccg catgaacggt acaccaacac tttcgaacag
       61 aaacccctca cagatactcc tcacacagat actctactgt actgggctac gatatctgtg
      121 agatacgcac agataccggc aggccgccgg cgtcaaagtc gacgtgctgc cggcgtttga
      181 gagaccgagc ggcgtgcaag aaatcgtcgg gcaacggctt caaatgaacg caaaacgttg
      241 gaaagcaaac aaccggcaaa agccatcagc agctaaaaat aaaactgcaa ctatcaaagt
      301 gaatctgtag agaaagagtg atagctcagt gtgcccagtg tttatttaat tacttattta
      361 aatttgattt tcaaaataag ttgtgatttt ggttcgtttg tttgtttccc cattaaatcg
      421 catcgcggaa ctggtttgca ttttgtatct tttatcgctg cagttcgttt ctgtgtggtt
      481 tcatgtattc cccatctatt ccatctcttt ttgcagatac aagtgaatga aaccctaacg
      541 tttaaacgcc accataaaca ccaaataaca aattttctgg caaacatggc atgtcacggt
      601 cccatttaac gtggaaaatc ggcccagaaa ttcaacacga cgacaacttt ttcgactgtc
      661 ccagaaagca aatatgtaaa gtgtatttgt gtgcgtgcca ttgacattga cttatggtca
      721 tgaaggggtt agggggacgg gtctaagggt gggggttgat ggctaaaggg gggtctcgcc
      781 gctacgacaa aaacttggat tctctcgttt cttttttgtc tttcgtcttc aacgcacatt
      841 tatggtttgt tttgctcaca cacacactca aaaagcctac gcagcggcag agagggcaca
      901 agcagcagag aaagagacga aacatgtctc ggatgatgat aaacgcctct cgtctatttt
      961 ggcgggtcca tgaatgaaac aattgcctcg aatgactgac tgccagacag atagctagaa
     1021 actgccctta cggcgggcaa attggggtat tttgggtttc ggtgactttg agggaccggg
     1081 acctgagggc ccggaaagcc gacagcaata aacactgtgc tcagacaaaa gtctgatgag
     1141 acagaccatc atgatgctca tcatcatcgg cgataataat gattttcatt tgcacattcc
     1201 acccgtttcg ctgttgccgg cttccagcat tcagattcag tttgcattca gtggggcagg
     1261 ttatttatgg cacttttcct atcgccactc actggggtaa tcgattttta ttggattaat
     1321 attgctttcc gcatgatttg gccttctttt cgttatcact tagcaggttt cttgacgctg
     1381 aggagtggac aatgactaaa actcagtcat ctggaaaagc tggaccaagg acatccatat
     1441 acaagtgctg gctaaacgtc tgtggttatc gcaatcattt atggcggaat agaaggacac
     1501 gtcttatgtg tcgattttcg ggccccatcc ctgttgaaaa atcagccgga cagcagcaac
     1561 caccgaacac aagacacagg acgaccggaa gtgaggaaaa cgcttgtaaa atcggtgcac
     1621 acgagtgcga gcgagacacg agcgaacggg ccgaaaggaa acgacaatta cgaggacacg
     1681 ccggcttgct tttctgcttt tcctgtctgc tcctttcagt tgcggccgtg caatcgtcgg
     1741 gtcaacaacg ccgcgtgtcc gatttcaatt tacaaattag aatatacatc tttacatctg
     1801 gatttggttt catctttatc tgcatttgga tccgcgacga cgcataattt tcagcacttt
     1861 tgaataatgg cgattctgtg cgacctcgta aacgtatact gaatgtatcc tgagtgtatc
     1921 ctatccggta taccttcagt atacgtaaca cgagagagca ctagcaacgt gggatgtgaa
     1981 acacttttta agacacctta atacacgtag cactttgtta tatggtctta cactttttaa
     2041 atatctcgac ttataaatat ataatttaaa tataaatact ttaataatgc gatataagtt
     2101 aaatacttcg atttactggc tggaagttca t
//
LOCUS       ECOMUT1X     1545 bp ds-DNA             BCT       07-JUN-1990
DEFINITION  E.coli mutT1 mutant DNA with an insertion sequence IS1.
ACCESSION   M29970
KEYWORDS    insertion sequence IS1.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 300 to 315; 1084 to 1105)
  AUTHORS   Bhatnagar,S.K., Bullions,L.C., Lew,G. and Bessman,M.J.
  TITLE     Characterization of the defect in the Escherichia coli mutT1
            mutator gene
  JOURNAL   J. Bacteriol. 172, 2802-2803 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1545)
  AUTHORS   Bhatnagar,S.K., Bullions,L.C., Lew,G. and Bessman,M.J.
  JOURNAL   Unpublished (1989) Johns Hopkins, Baltimore, MD 21218
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.J.Bessman, 19-NOV-1989.
FEATURES       from  to/span     description
    trns        316     1083     IS1
    site        308      315     5' insertion target sequence
    site       1084     1091     3' insertion target sequence
BASE COUNT      379 a    366 c    423 g    377 t
ORIGIN      
        1 gatccttgcc cgtgcggttc tggtaaaaaa tacaagcagt gccatggccg cctgcaataa
       61 aagctaactg ttgaagtaaa aggcgcagga ttctgcgcct tttttatagg tttaagacaa
      121 tgaaaaagct gcaaattgcg gtaggtatta ttcgcaacga gaacaatgaa atctttataa
      181 cgcgtcgcgc agcagatgcg cacatggcga ataaactgga gtttcccggc ggtaaaattg
      241 aaatgggtga aacgccggaa caggcggtgg tgcgtgaact tcaggaagaa gtcgggatta
      301 ccccccaaca tttttggtga tgctgccaac ttactgattt agtgtatgat ggtgtttttg
      361 aggtgctcca gtggcttctg tttctatcag ctgtccctcc tgttcagcta ctgacggggt
      421 ggtgcgtaac ggcaaaagca ccgccggaca tcagcgctat ctctgctctc actgccgtaa
      481 aacatggcaa ctgcagttca cttacaccgc ttctcaaccc ggtacgcacc agaaaatcat
      541 tgatatggcc atgaatggcg ttggatgccg ggcaacagcc cgcattatgg gcgttggcct
      601 caacacgatt ttacgtcact taaaaaactc aggccgcagt cggtaacctc gcgcatacag
      661 ccgggcagtg acgtcatcgt ctgcgcggaa atggacgaac agtggggcta tgtcggggct
      721 aaatcgcgcc agcgctggct gttttacgcg tatgacagtc tccggaagac ggttgttgcg
      781 cacgtattcg gtgaacgcac tatggcgacg ctggggcgtc ttatgagcct gctgtcaccc
      841 tttgacgtgg tgatatggat gacggatggc tggccgctgt atgaatcccg cctgaaggga
      901 aagctgcacg taatcagcaa gcgatatacg cagcgaattg agcggcataa cctgaatctg
      961 aggcagcacc tggcacggct gggacggaag tcgctgtcgt tctcaaaatc ggtggagctg
     1021 catgacaaag tcatcgggca ttatctgaac ataaaacact atcaataagt tggagtcatt
     1081 accacatttt tcgctatttg aaaaactgga atatgaattc ccggacaggc atataacact
     1141 gtggttttgg ctggtcgaac gctgggaagg ggagccgtgg ggtaaagaag ggcaacccgg
     1201 tgagtggatg tcgctggtcg gtcttaatgc cgatgatttt ccgccagcca atgaaccggt
     1261 aattgcgaag cttaaacgtc tgtaggtcag ataaggcgtt ttcgccgcat ccgacattcg
     1321 cacacgatgc ctgatgcgac gctggcgcgt cttatcaggc ctaaagggat ttctaactca
     1381 ttgataaatt tgtttttgta ggtcggataa ggcgttcacg ccgcatccga catttgcaca
     1441 agatgcctga tgcgacgctg tccgcgtctt atcaggccta cgtgcggcat cagacaaatg
     1501 tcactgcttt ggttcttcgc tccagtcatc gctttcggaa agatc
//
LOCUS       YSAEF1B      2411 bp ds-DNA             PLN       07-JUN-1990
DEFINITION  C.albicans elongation factor 1-alpha (TEF-1) gene, complete cds.
ACCESSION   M29934
KEYWORDS    elongation factor.
SOURCE      C.albicans (strain SC5314) DNA.
  ORGANISM  Candida albicans
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 2411)
  AUTHORS   Sundstrom,P., Smith,D. and Sypherd,P.S.
  TITLE     Sequence analysis and expression of the two genes for elongation
            factor 1-alpha from the dimorphic yeast Candida albicans
  JOURNAL   J. Bacteriol. 172, 2036-2045 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Sundstrom, 17-NOV-1989.
FEATURES       from  to/span     description
    pept        624     2000     elongation factor 1-alpha
    variant    1487     1487     g in EcoRI 2.3 kb clone; a in HindIII 4.9 kb
                                 clone
BASE COUNT      760 a    402 c    477 g    772 t
ORIGIN      
        1 acggtaaatt agtgggaggc aaatctccca aatcaattaa acggaaatta ctcgaagctg
       61 caaatctgtt tgctgatgga cagaatgatc atagtattaa tccagatgtt acaattgaag
      121 aggatctgat gtctcaaata aggtatcaag acaacgaatc aatgatcaac caagaaagat
      181 tattagagaa attggatgga tcagctgtgc ttgtggaaac actgatacga catgccagta
      241 aacgaagtag aaaaaactaa ctgtatattc gctaacaaaa aaattgtatt atttttagac
      301 atcgtaaatg gggctgtagc cctaattatt ttttatttct cgtgatctgc acgtgtactg
      361 ttgatttttt tgtgcgattg aaattatatt gtgctgttat cttaataaaa tcagtcatac
      421 cttttttttt ttgggttttt gttgtttaat ttgtgataaa catcccatga ggaacagcga
      481 gaaagttttt tgctgttcac tttctcgttc aacttttgca aagtaaagta aagaaaaaaa
      541 aattttccct cgtcatcttg atttttactt cttcttcttc ttcttctttt tcctttcttc
      601 acacattata gtcataatca atcatgggta aagaaaaaac tcacgttaac gttgttgtta
      661 ttggtcacgt cgattccggt aaatctacta ccaccggtca cttaatttac aagtgtggtg
      721 gtatcgataa aagaaccatt gaaaaattcg aaaaagaagc tgctgaattg ggtaaaggtt
      781 ctttcaaata cgcttgggtc ttggacaaat tgaaggctga aagagaaaga ggtatcacca
      841 ttgatattgc tttgtggaaa ttcgaaactc caaaatacca cgttaccgtc attgatgctc
      901 caggtcacag agatttcatc aagaatatga tcactggtac ttctcaagct gattgtgcta
      961 ttttgattat tgctggtggt actggtgaat tcgaagccgg tatttctaag gatggtcaaa
     1021 ccagagaaca cgctttgttg gcttacactt tgggtgtcaa acaattgatt gttgctgtca
     1081 acaagatgga ctctgtcaaa tgggacaaaa acagatttga agaaatcatc aaggaaacct
     1141 ccaacttcgt caagaaggtt ggttacaacc caaagactgt tccattcgtt ccaatctctg
     1201 gttggaatgg tgacaacatg attgaaccat ccaccaactg tccatggtac aagggttggg
     1261 aaaaggaaac caaatccggt aaagttactg gtaagacctt gttagaagct attgacgcta
     1321 ttgaaccacc aaccagacca accgacaaac cattgagatt gccattgcaa gatgtttaca
     1381 agattggtgg tattggtact gtgccagtcg gtagagttga aactggtatc atcaaagccg
     1441 gtatggttgt tactttcgcc ccagctggtg ttaccactga agtcaagtcc gttgaaatgc
     1501 atcacgaaca attggctgaa ggtgttccag gtgacaatgt tggtttcaac gttaagaacg
     1561 tttccgttaa agaaattaga agaggtaacg tttgtggtga ctccaagaac gatccaccaa
     1621 agggttgtga ctctttcaat gcccaagtca ttgttttgaa ccatccaggt caaatctctg
     1681 ctggttactc tccagtcttg gattgtcaca ctgcccacat tgcttgtaaa ttcgacactt
     1741 tggttgaaaa gattgacaga agaactggta agaaattgga agaaaatcca aaattcgtca
     1801 aatccggtga tgctgctatc gtcaagatgg tcccaaccaa accaatgtgt gttgaagctt
     1861 tcactgacta cccaccatta ggtagattcg ctgtcagaga tatgagacaa accgttgctg
     1921 ttggtgtcat caaatctgtt gaaaaatccg acaaagctgg taaagttacc aaggctgctc
     1981 aaaaagctgc taagaaataa gctagttgaa tattatgtaa gatctgttag agtttttatt
     2041 ttgtattcat ttatttagtt tattttctta ttatatttga aatataatat atattttaaa
     2101 aaaaaaattt acagtgtaga atttttggta gtccggtttg ttttgaaatc agtggtggta
     2161 ttcaatattt gattaaattt tggtatgaat ttgtgttgaa aaataaaaaa taagcgagaa
     2221 atttgcgtgg catattattt gtaatgttcg aatattctct cgtacaccaa aaggtttacg
     2281 attattcatt gacatataat aaaatctaat tactaaagag ttttgctgga aatgcatgta
     2341 tagtttatat catttctatt atctggacca tcatggaaga atagcaatta tttcttgtta
     2401 tcggtagtaa a
//
LOCUS       YSAEF2A      2369 bp ds-DNA             PLN       07-JUN-1990
DEFINITION  C.albicans elongation factor 1-alpha (TEF-2) gene, complete cds.
ACCESSION   M29935
KEYWORDS    elongation factor.
SOURCE      C.albicans (strain SC5314) DNA.
  ORGANISM  Candida albicans
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 2369)
  AUTHORS   Sundstrom,P., Smith,D. and Sypherd,P.S.
  TITLE     Sequence analysis and expression of the two genes for elongation
            factor 1-alpha from the dimorphic yeast Candida albicans
  JOURNAL   J. Bacteriol. 172, 2036-2045 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Sundstrom, 17-NOV-1989.
FEATURES       from  to/span     description
    pept        656     2032     elongation factor 1-alpha
    variant    2188     2190     gta in one strand of clone 2.6 EcoRI; gtata in
                                 opposite strand.
BASE COUNT      750 a    428 c    430 g    761 t
ORIGIN      
        1 gtatcaagaa aaaattgtat agtgcttgct gttcgatatt gctagaggca aagaaaaagg
       61 attacctcac gcctaaaatg gaatttgaaa attgaggaga tgagttgagc ttttgtcttt
      121 tacactttcc ctattcaatt gtgccgcaaa aacttcaata tctaacacag tgtgaaatat
      181 ttgttatgta taccgaaagc caaataccat ttaaactggt atttcaagat ttcataaatc
      241 aagtttctgg ttaaattaat tgctgttaaa cagaacaaga gcttcacatt gatgttggaa
      301 taactactat agaattataa aatacacaaa accaggtcgt gtacgcaata acttttcaat
      361 gtatgaaaaa attttttttt tttttgcttc ttctcttcgc ttcttttttc tagttatcaa
      421 tttggcaatg aattgatttg attgcccttt gtttagttta ctatgaaaaa aaattttttt
      481 tttgtctctg tcagattttc tctcattttt ttttttgcct cataatgaac aactcaaaaa
      541 agaatataaa tttctcatca aagacctttc caattgaaaa ttttttcttt ccttcttttc
      601 cttttttttt ttctttcttt cttcttttct aagtatacat tttatataat caatcatggg
      661 taaagaaaaa actcacgtta acgttgttgt tattggtcac gtcgattccg gtaaatccac
      721 taccaccggt cacttaattt acaagtgtgg tggtattgat aaaagaacca ttgaaaagtt
      781 cgaaaaagaa gctgctgaat tgggtaaagg ttctttcaaa tacgcttggg tcttggacaa
      841 attgaaggct gaaagagaaa gaggtatcac cattgatatc gctttgtgga aattcgaaac
      901 tccaaaatac cacgttaccg tcattgatgc tccaggtcac agagatttca tcaagaatat
      961 gatcactggt acttctcaag ctgattgtgc tattttgatt attgctggtg gtactggtga
     1021 attcgaagcc ggtatttcta aggatggtca aaccagagaa cacgctttgt tggcttacac
     1081 tttgggtgtc aaacaattga ttgttgctgt caacaagatg gactctgtca aatgggacaa
     1141 aaacagattt gaagaaatca tcaaggaaac ctccaacttc gtcaagaagg ttggttacaa
     1201 cccaaagact gttccattcg ttccaatctc tggttggaat ggtgacaaca tgattgaacc
     1261 atccaccaac tgtccatggt acaagggttg ggaaaaggaa accaaatccg gtaaagttac
     1321 tggtaagacc ttgttagaag ctattgacgc tattgaacca ccaaccagac caaccgacaa
     1381 accattgaga ttgccattgc aagatgttta caagatcggt ggtattggta ctgtgccagt
     1441 cggtagagtt gaaactggta tcatcaaagc cggtatggtt gttactttcg ccccagctgg
     1501 tgttaccact gaagtcaaat ccgttgaaat gcatcacgaa caattggctg aaggtgttcc
     1561 aggtgacaat gttggtttca acgttaagaa cgtttccgtt aaagaaatta gaagaggtaa
     1621 cgtttgtggt gactccaaga acgatccacc aaagggttgt gactctttca atgcccaagt
     1681 cattgttttg aaccatccag gtcaaatctc tgctggttac tctccagtct tggattgtca
     1741 cactgcccac attgcttgta aattcgacac tttggttgaa aagattgaca gaagaactgg
     1801 taagaaattg gaagaaaatc caaaattcgt caaatccggt gatgctgcta tcgtcaagat
     1861 ggtcccaacc aaaccaatgt gtgttgaagc tttcactgac tacccaccat taggtagatt
     1921 cgctgtcaga gatatgagac aaaccgttgc tgttggtgtc atcaaatctg ttgaaaaatc
     1981 cgacaaagct ggtaaagtta ccaaggctgc tcaaaaagct gctaagaaat aaactaatta
     2041 aatacctttg tttaaatagt tgtgattata ttattttttt aaaatatata taaataagag
     2101 aataatcata caacatgtag taaggagcga agagtcccag ctctgtaaat aaaaaaaaga
     2161 agaggagtta tgtgcttctt tgtttatgta tatatatata tatatatata tatattctta
     2221 ccagttagtt aatcttacac atatgttgat ggtgctatat tgtacataca aactctaaat
     2281 catatcagaa ctgcttggaa acctttccca ctaacaacac ccatccaatc cccttggttc
     2341 tgtctccaac acacgttatt gatttccaa
//

GenBank-Updates@genbank.bio.net (06/08/90)

LOCUS       BSUHEMAC     3795 bp ds-DNA             BCT       08-JUN-1990
DEFINITION  B.subtilis 5-aminolevulinate synthase (hemA) and porphobilinogen
            deaminase (hemC) genes, complete cds.
ACCESSION   M32130
KEYWORDS    5-aminolevulinate synthase; porphobilinogen deaminase.
SOURCE      B.subtilis (strain 168) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 3795)
  AUTHORS   Petricek,M., Rutberg,L., Schroeder,I. and Hederstedt,L.
  TITLE     Cloning and characterization of the hemA region of the Bacillus
            subtilis chromosome
  JOURNAL   J. Bacteriol. 172, 2250-2258 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            I.C.Schroeder, 16-FEB-1990.
FEATURES       from  to/span     description
    pept        232     1599     5-aminolevulinate synthase (hemA, EC 2.3.1.37)
    pept       1607     2437     unidentified protein (ORF2)
    pept       2470     3414     porphobilinogen deaminase (hemC, EC 4.3.1.8)
    pept       3404  >  3795     unidentified protein (ORF4)
BASE COUNT     1068 a    689 c    953 g   1085 t
ORIGIN      Map position 245 degrees.
        1 atgcatatca ccttcttgtt ttttagagct gatgtgtagt aaatttctgc tgtttttggt
       61 attgtcaata ggaatgcttc ttttccctga agctttttct aatatagcat aagaatttta
      121 aaatctgttc acattttgtg aaagaaacta tgttataatt attataaata atgagttcta
      181 tgttagaatg attataaatt aagattgggt gttgggggtg taattagagc gatgcatata
      241 cttgttgtgg gagtagatta taaatccgcc cctattgaga tacgtgaaaa agtaagtttt
      301 cagccgaatg agctggcaga agcaatggtg cagctgaaag aagagaaaag cattcttgaa
      361 aacatcattg tctcaacctg caaccgcact gaaatttatg cggtagtcga ccagcttcat
      421 accggccgtt attatataaa aaagttttta gctgattggt ttcaattaag caaagaagag
      481 ctgtcaccgt tcttaacgtt ttatgagagc gatgccgctg ttgagcattt attccgtgta
      541 gcctgcggac ttgattctat ggtgattggc gaaacgcaga ttctcggaca ggtacgcgac
      601 agctttaaaa cagctcagca agaaaaaacg atcgggacta tttttaatga gctgtttaag
      661 caggcagtta cagtgggcaa acggactcac gccgaaacag acattggctc aaatgcggtg
      721 tcagtaagct atgctgcagt tgaacttgcc aaaaaaatct tcggaaatct ttcaagcaag
      781 cacatattga ttctcggtgc gggaaaaatg ggcgagcttg ctgcggaaaa cctgcacgga
      841 cagggaatcg gcaaggtcac tgtcattaac cgaacatact tgaaagcgaa ggagcttgca
      901 gaccgttttt caggtgaagc gagaagcttg aatcagcttg aaagcgcgct tgcggaggct
      961 gatattttaa tcagttcaac cggtgcaagt gaatttgtcg tgtccaaaga gatgatggaa
     1021 aacgcgaata agcttcgcaa gggacgtccg ctgtttatgg tcgacattgc cgtgcctaga
     1081 gatcttgatc cggcgctgaa tgatcttgaa ggtgtttttc tttatgatat cgacgatctg
     1141 gaaggcattg tagaagcgaa catgaaagag cggagagaaa cagctgaaaa agttgaactg
     1201 ttaattgaag aaaccattgt ggaatttaaa caatggatga atacacttgg tgttgtgcct
     1261 gttatttctg cattgcgcga aaaggcgctt gccatccagt cagaaacgat ggacagcatt
     1321 gagcgtaagc tgcctcactt aagcacaaga gagaaaaaac tgttgaacaa acacaccaaa
     1381 agtattatta accaaatgct tcgtgatccg attttaaagg tgaaagagct tgcggcagat
     1441 gctgattctg aagaaaagct cgcgttgttt atgcagattt ttgatattga agaagctgcg
     1501 ggccgtcaaa tgatgaaaac cgttgaaagc agccagaagg tccactcttt taagaaggct
     1561 gaatcaaaag cgggctttag cccacttgta agtgagtgaa agctgaatga ttgatactgc
     1621 aatggcaaga cttaatgagg ggacaatcgt catttacgcg ttaagtgtac tcttttattt
     1681 tatagatttt cttcaacaca accggaaggc tggaaaaatg gccttctggt tgctttctat
     1741 tgtctggact ctgcaaaccg tgtatttggc ctattttatg tgggtgacgg ggcggtttcc
     1801 ggtattaaat gtgacagagg cactttattt ttatgcctgg gtgcttgtca cgctgtcact
     1861 tgtactgaca aagcttttac gtgttgactt tatcgtgttt tttacaaatg ttataggatt
     1921 ttctatgatc gccattcaca cattttcacc gacagagcag cagtcagctg ctttttccgg
     1981 gcagcttgta tccgagcttt tggtgattca tattacaatg gcgattcttt catacggcgc
     2041 tttttccctt tctttcgttt tttctgtgct atatatgttt caatatcatg tgctgaaaaa
     2101 gaaaaagtgg ggaaaatggc tgttgagaat agaagattta tctaagcttg attatatggc
     2161 gtatgtttta aatgtcattg gggttccgat gctgctgctg agtttgattc tcggcgtcat
     2221 ttgggcgtat gtctcactag aaacgctgta ttggtttgac gccaaagtgc ttggttcgtt
     2281 tgtcgtcctg ctgctgtaca gctattatct ttatatcagg ctgattaagg agctgcaagg
     2341 aaaggtcgct gcactgtgga atacggcttg ttttctggtg ctgatgatca attatttcct
     2401 gcttggaagc ctgtcgcaat tccattggtt cagttaaacg atgtcccaag cagattcggg
     2461 aggaaagaaa tgatgagaac gattaaagta ggttccagac ggagcaaact cgctatgact
     2521 caaacaaaat gggttattca aaaactgaag gaaatcaatc cttcgtttgc ttttgaaatt
     2581 aaagagatcg tgacaaaggg cgaccggatt gtcgatgtta cactctcaaa agtgggtgga
     2641 aaagggcttt ttgtcaaaga aattgaacag gcgcttttaa acgaagagat tgatatggca
     2701 gtgcacagca tgaaggacat gcctgctgtt ttgcctgaag gccttgtgat cggctgtatt
     2761 cctgaacggg aggacccgcg tgatgccctt atttcaaaga atcgcgtaaa gctttcagaa
     2821 atgaagaaag gtgctgtcat tggcacaagc agtttaagaa gaagcgcgca gcttttgatt
     2881 gagcgccctg accttacaat taaatggatt agaggtaata ttgatacaag acttcaaaag
     2941 ctggaaacag aggattatga cgcaattatt ttagcggctg ccggcctttc cagaatgggt
     3001 tggaagcaag atgtcgtaac cgaattcctt gagcctgagc gctgtttgcc tgctgtgggg
     3061 cagggagccc tggcgattga gtgccgagaa tcggatgaag agctgttggc gttgttttct
     3121 cagtttacag atgaatatac aaaacggact gtcttagcgg aacgtgcttt tttaaacgcg
     3181 atggagggcg gctgccaggt tccgatcgcg ggctactccg tgttaaatgg acaggatgaa
     3241 attgaaatga caggtcttgt cgcttcacct gacggcaaaa tcatttttaa agaaaccgtc
     3301 accggaaacg atccggagga agtaggaaag cgctgtgccg ctcttatggc tgacaaagga
     3361 gcaaaagatt taattgatcg tgtaaaacgg gagcttgacg aggatggaaa atgattttcc
     3421 gttgaaagga aaaacagtgc ttgtcacccg gaataaggca caggcagcat catttcagca
     3481 aaaagtggag gcgcttggcg gtaaagcggt tttaacctct ttgattacgt ttcgccgcgc
     3541 tttgccgaat gatgttgcgg aacaggtaag agaggatctt gccgcgccag gctggcttgt
     3601 ttttacaagt gtgaacgggg cagacttctt tttttcttat ctgaaggaaa atcagcttat
     3661 tctccctgcg cataaaaaaa ttgcagccgt cggtgaaaaa accgcgcgcc gtttaaaaat
     3721 gcataacgta tcggttgatg tgatgccaca ggagtatatt gctgaacaat tgcgtgacgc
     3781 tcttaagcag catgc
//
LOCUS       SHFIPAH      2900 bp ds-DNA             BCT       08-JUN-1990
DEFINITION  S.flexner invasion plasmid antigen H (ipaH) gene, complete cds.
ACCESSION   M32063
KEYWORDS    invasion plasmid antigen.
SOURCE      S.flexner (strain M90T-W; serotype 5) DNA, clone pWR390.
  ORGANISM  Shigella flexneri
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2900)
  AUTHORS   Hartman,A.B., Venkatesan,M., Oaks,E.V. and Buysse,J.M.
  TITLE     Sequence and molecular characterization of a multicopy invasion
            plasmid antigen gene, ipaH, of Shigella flexneri
  JOURNAL   J. Bacteriol. 172, 1905-1915 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A.B.Hartman, 12-FEB-1990.
FEATURES       from  to/span     description
    pept        251     1849     invasion plasmid antigen H
    pept       2277  >  2900     ORF3
    signal      139      144     -35 region
    signal      162      167     -10 region
    signal     2111     2116     -35 region
    signal     2132     2137     -10 region
BASE COUNT      868 a    657 c    578 g    797 t
ORIGIN      
        1 catagaaaac ctccataaat aaattacaac taacttctgt tatgtgtaaa atggaaacta
       61 ttaaaactta atatcggaaa tggtaagtga aatttgtata aatatacaat tttaaatatt
      121 tattctcaca aatataaggt tgacctagca ttatgttctc tgtaaataat acacactcat
      181 cagtttcttg ctccccctct attaactcaa actcaaccag taatgaacat tatctgagaa
      241 tcctgactga atgggaaaag aactcttctc ccgggaagag cgaggcattg cttttaacag
      301 actctcccag tgctttcaga atcaagaagc agtattaaat ttatcagacc taaatttgac
      361 gtctcttccc gaattaccaa agcatatttc tgctttgatt gtagaaaata ataaattaac
      421 atcattgcca aagctgcctg catttcttaa agaacttaat gctgataata acaggctttc
      481 tgtgatacca gaacttcctg agtcattaac aactttaagt gttcgttcta atcaactgga
      541 aaaccttcct gttttgccaa accatttaac atcattattt gttgaaaata acaggctata
      601 taacttaccg gctcttcccg aaaaattgaa atttttacat gtttattata acaggctgac
      661 aacattaccc gacttaccgg ataaactgga aattctctgt gctcagcgca ataatctggt
      721 tacttttcct caattttctg atagaaacaa tatcagacaa aaggaatatt attttcattt
      781 taatcagata accactcttc cggagagttt ttcacaatta gattcaagtt acaggattaa
      841 tatttcaggg aatccattgt cgactcgcgt tctgcaatcc ctgcaaagat taacctcttc
      901 gccggactac cacggcccgc agatttactt ctccatgagt gacggacaac agaatacact
      961 ccatcgcccc ctggctgatg ccgtgacagc atggttcccg gaaaacaaac aatctgatgt
     1021 atcacagata tggcatgctt ttgaacatga agagcatgcc aacacctttt ccgcgttcct
     1081 tgaccgcctt tccgataccg tctctgcacg caatacctcc ggattccgtg aacaggtcgc
     1141 tgcatggctg gaaaaactca gtgcctctgc ggagcttcga cagcagtctt tcgctgttgc
     1201 tgctgatgcc actgagagct gtgaggaccg tgtcgcgctc acatggaaca atctccggaa
     1261 aaccctcctg gtccatcagg catcagaagg ccttttcgat aatgataccg gcgctctgct
     1321 ctccctgggc agggaaatgt tccgcctcga aattctggag gacattgccc gggataaagt
     1381 cagaactctc cattttgtgg atgagataga agtctacctg gccttccaga ccatgctcgc
     1441 agagaaactt cagctctcca ctgccgtgaa ggaaatgcgt ttctatggcg tgtcgggagt
     1501 gacagcaaat gacctccgca ctgccgaagc tatggtcaga agccgtgaag agaatgaatt
     1561 tacggactgg ttctccctct ggggaccatg gcatgctgta ctgaagcgta cggaagctga
     1621 ccgctgggcg caggcagaag agcagaagta tgagatgctg gagaatgagt actctcagag
     1681 ggtggctgac cggctgaaag catcaggtct gagcggtgat gcggatgcgc agagggaagc
     1741 cggtgcacag gtgatgcgtg agactgaaca gcagatttac cgtcagctga ctgacgaggt
     1801 actggccctg cgattgtctg aaaacggctc acgactgcac cattcataat cacgtcgcat
     1861 aagcataaac cgcagaccgg attgactccg gaaaaactgt gacccgatta cggaccttaa
     1921 caacaacccg taaatcctcg ctcaataccg gcagggattt acggcgtgca actgactttt
     1981 ttgaggggat aaccaaccag atcgtttgct atgggaatat cgagacagta atgagttaaa
     2041 tgataaaaat tgtttgaaaa tataggggat aaagatcaat ccaaactgga tgaaagtaga
     2101 actggtcaca ttaacatggg tagactgata taacaatcga cggttactgg aaagacagga
     2161 acatattcct ccagccggaa tgaaaacgcc gataaagctc taggattgtt tttttaaaga
     2221 ctttctcgtt ttatttgcat taatagacca agatatgaat agtgaggggt taataaatga
     2281 aaccgatcaa caatcattct ttttttcgtt ccctttgtgg cttatcatgt atatctcgtt
     2341 tatcggtaga agaacagtgt accagagatt accaccgcat ctgggatgac tgggctaggg
     2401 aaggaacaac aacagaaaat cgcatccagg cggttcgatt attgaaaata tgtctggata
     2461 cccgggagcc tgttctcaat ttaagcttac tgaaactacg ttctttacca ccactccctt
     2521 tgcatatacg tgaacttaat atttccaaca atgagttaat ctccctacct gaaaattctc
     2581 cgcttttgac agaacttcat gtaaatggta acaacttgaa tatactcccg acacttccat
     2641 ctcaactgat taagcttaat atttcattca atcgaaattt gtcatgtctg ccatcattac
     2701 caccatattt acaatcactc tcggcacgtt ttaatagtct ggagacgtta ccagagcttc
     2761 catcaacgct aacaatatta cgtattgaag gtaatcgcct tactgtcttg cctgaattgc
     2821 ctcatagact acaagaactc tttgtttccg gcaacagact acaggaacta ccagaatttc
     2881 ctcagagctt aaaatatttg
//
LOCUS       BOVMTAK5     3852 bp ds-DNA             MAM       08-JUN-1990
DEFINITION  Bovine mitochondrial adenylate kinase gene, exon 6 and 7.
ACCESSION   D90069
KEYWORDS    ATP:AMP phosphotransferase; adenylate kinase.
SEGMENT     5 of 5
SOURCE      Bovine mitochondria DNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 3852)
  AUTHORS   Tanaka,H., Yamada,M., Kishi,F. and Nakazawa,A.
  TITLE     Isolation and characterization of bovine mitochondrial adenylate
            kinase gene
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hiroshi
            Tanaka Department of Biochemistry Yamaguchi University School of
            medicine 1144 Kogushi Ube, Yamaguchi 755 Japan Phone:  0836(22)2215
            Fax:    0836(22)2315
FEATURES       from  to/span     description
    pept    /   126      347     adenylate kinase 2A exon 6(a+b) (EC 2.7.4.3)
    pept    /   126  +   321     adenylate kinase 2B exon 6a
    pept    +  2572     2576     adenylate kinase 2B exon 7
    pre-msg <     1      491     adenylate kinase 2A mRNA and introns
    pre-msg <     1     3448     adenylate kinase 2B mRNA and introns
    IVS         322     2571     adenylate kinase 2B intron
BASE COUNT      942 a    915 c    933 g   1062 t
ORIGIN      
        1 tctagattgt ctgtctctcc ctgtgatgga ctttgggatc agacagcatc tcagatctgc
       61 atctggcctt ggattaggag accttgctga gtcccagcct ttcattgccc tcttcttctt
      121 ctcagatcac tggggaaccg ttgatccgcc gatcagatga taataagaag gccttgaaaa
      181 tccgcctgga agcctaccac actcaaacca ccccgctggt ggagtactac agtaaacggg
      241 ggatccactc cgccatcgat gcgtcccaga cccctgacgt cgtgtttgca agcatcctag
      301 cagcattctc caaagccaca tgtaaagact tggttatgtt tatttaatgt tgggtccaaa
      361 aaggaacttc tttcttcttc ctttccctgt tgaacgagtg ggtgggaatg gcagagtagg
      421 cagagggaag cttcctcagg ccagcaagaa tatcatttga tgtattgatt aaaaaagcac
      481 ttgcttcatg tacctttggt gtgtgcgcaa gtctcatctc atctgtgtgt gtgtgtggtg
      541 tgtatgtgcg tgcacgtgca cacacgtgtg taagtataag tatgtgtaca ctcttatact
      601 tcttaaattg taggcgagac tgtttacttc tttagccgta ctctttattt tcgacttccc
      661 tggtggcgca gacggtaaag catctgtcta caatgcagga gacccgggtt cgatccctgg
      721 gttgggaaga tcccctggag aaggaaatgg caatccactc taggactgtt gcctggaaaa
      781 tcccatggac agaggagcct ggtaggctac agtccatggg gttgcaaaga gtcagacaca
      841 actgagcgac ttcacttcac ttcactttat tttcaaccaa aataaactaa ctcaagtatc
      901 cgccaggcag acaagttcct ggtatttttc catctccagc agctaattag aagtcctggg
      961 tcacatgcag tcaggcaggg ctacagttcc tgatggcagc ctgttggcca gaaattccac
     1021 ttgttttctc acccataatg aaaagtcttt gagtcattgt ggaaagggat cattaatttt
     1081 ttccccctaa acaggaatga aaaggcactt actttataga ttccagaaat tactgggaga
     1141 gggtatcacc atagaaagag cctggccaag ttgaattatt tttgtgatct gccccatggt
     1201 tctataaagt aggaatcatt cggcacctgt gttgaatact aaattgatgc caaagaatta
     1261 aaccaggtga aaggtcctct tgaattcaga ctgtcttctg aacatccagg ctaggcatct
     1321 gagagcagcc agtccacttc cccaaagaca gcaggataga tttgtcttta ttttcaatgt
     1381 ttgctctgtt tgcaagtgtg aacaaaacgg tgatttgagg atcatttggg gatcatgttg
     1441 attagcattt attcaagccc agtctctaca ggatatgaag tcacatatat cccttttgaa
     1501 cgcagccctc tcccttccat atctagtcct tcacccctta tgtcatgacg gtggtggcca
     1561 cagtctcccc aagaaaggac cacccccaaa ataagatcac ctatggccgt aaccagcttt
     1621 tcttcctaac ctcggctccc ggctcttagg catctcttga gatccacact gtgccttttg
     1681 gtctctggtt tgatctgtgg caaacggagg aattaataga ctgctatgag ggaccacaaa
     1741 aacagcaggc tctggaaaaa ccattaaaaa gtcagtgaca ggtccagtaa ataatgccac
     1801 ctgcccaggg tactctgctg actcagtgca gtggcctcgg aagtgcccag cccccacctg
     1861 acctgagctc tcctgctgtc tgtgagacca ccagagggct tgtcctctgt ccgcctggaa
     1921 actggctgcg gtacgtggcc acctgtgtcc ctctctgtct gcttcaagtg agtgtctgct
     1981 ggctctgctc tgccttgttt ccctggttgg agactaactc cacccattct taacggaaac
     2041 ccagcctggc atgggatgtt tctgggaagc acatgatttc tgagaatgga caaggaaagg
     2101 gagtgaaacc aaaactgtct gctgtgtctg cagggatctg ggctgcttct ctgggtgagg
     2161 tggcctcatg aatcttagag ttgactcccc tttgacgggg gacagttgca ggcacttggc
     2221 ctcaaagttg aggtccttga atgattgctg atagcgtgtg catgatctgt gctcaaatat
     2281 ttgtctgtgt gcattcacac gtgcacacat atacatgtga tatttgggct accaatggga
     2341 aagggaaggt cgtcgggggt gcatatgtcc tcttgggggt gggtagaatt caatttttgc
     2401 tttgcttaat tccagagcct taacctcagc aggcatgaga gctttcacct ggcagagaaa
     2461 tgtccagtgt ttaatgcatt tggttactgg ggtgtgcatg ccaaccttct ctgatggtta
     2521 cttggttttt aaaaatgcat tttgttttct cttgtctcct ctttatttca gcctagtaac
     2581 agaaggccag gcgggactgc accactactc atcaccccac ggcatgatcc ctgctcttag
     2641 gtgctgggca gagggagggg cggtcagggt aaggatgggg acagaggagc ggtgagggcc
     2701 caagcggggg agaggagctg tgttgccatg aagcagtttc atttacgtgg agtagaactc
     2761 ttaaaagcat aagcaaaggg gaaaattaat tttttaaaac acttgattgg agggtataaa
     2821 tagaaacaag gagatgcaat attatttcta aggaatcatg ctttcattta ctttggactg
     2881 gtgacaactg gacaggtctc agtgccctga gacctcagct ttgatcagaa cctcatgcca
     2941 gcccaggaat gcgggaaatc acactgttgt cctgtctgac ccatagcttg gaacagggca
     3001 ggagttgatt actaatcctg gttacccata ccatgaaatc agaacgtcat ctccccacgt
     3061 gcaatagaca tggaatgtgt tcctgggggc ggcgtctcaa caaatccggc ttactggagt
     3121 catggggcag gctggtgtcc ctcccttcca tactcaccac tgatttacca gcccacctgc
     3181 cctcacgggt gagccctcgg cagccaccca gcatacgcca cagtcctgtg ctcttgcctt
     3241 cctccatcca catcatgtga aaggactctt tttaatcaat gagcaagtgt cctaagcaac
     3301 attatccaaa gactgtcctt tccatcctca aatcctatga ctgggatcag tctacaacac
     3361 tgtgatgtat tattttcaat gaggtgcctt tcttaactgt ccaaatgctg ccttgtttgg
     3421 cccttaaata aagtgtgtta aaagtttgta tcccctgttg tggcattttt taaggagctg
     3481 tgggctagta aaatgacgct gaattctgga tctgacttag aagccctttt ctgggtcatt
     3541 agaagtcata agaaggcaga aaggtccaag agtgtgttca taagaagagt ctgctccctc
     3601 ttggccttca gctcatctgc aggattttct tgagcccaag gacaaggcag cttgggccaa
     3661 tggagacagt gctgtgcttt gggttcaggg atgtacaaca aatcccagcc aatcatttgc
     3721 tcctgcttcc ttgtgtgaga aagagactag tggtggtcca gtggttaaga ctcttacctg
     3781 ccactgcaga ggacactggt tcgatcccgg gtctgggaag atcccacatg ccgtgggcac
     3841 tggacccatg tc
//
LOCUS       BBRHWP       4330 bp ds-DNA             BCT       08-JUN-1990
DEFINITION  B. brevis cell wall protein (HWP) gene.
ACCESSION   D90050
KEYWORDS    HWP; cell wall protein.
SOURCE      Bacillus brevis (strains HPD31, HPD52 and HPO33) genomic DNA.
  ORGANISM  Bacillus brevis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 4330)
  AUTHORS   Ebisu,S., Tsuboi,A., Takagi,H., Naruse,Y., Yamagata,H.,
            Tsukagoshi,N. and Udaka,S.
  TITLE     Conserved structures of cell wall protein genes among
            protein-producing Bacillus brevis strains
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Norihiro
            Tsukagoshi Department of Food Science and Technology Faculty of
            Agriculture, Nagaya University Furo-cho, Chikusa-ku Nagoya 464
            Japan Phone:  052-781-5111 x6869 Fax:    052-781-4447
FEATURES       from  to/span     description
    pept        548     3898     HWP precursor (ttg start codon)
    sigp        548      706     HWP signal peptide
    sigp        638      706     HWP signal peptide
    matp        707     3895     HWP mature protein
    site        132      132     transcription initiation site
    site        255      255     transcription initiation site
    site        325      325     transcription initiation site
    site        373      373     transcription initiation site
    site        386      386     transcription initiation site
BASE COUNT     1401 a    804 c   1000 g   1125 t
ORIGIN      
        1 atcgattacc aaaaaaccgc gaaagattcc gcggctgaga gctatcgttt gaccaatttg
       61 cgctttgaaa cggtctggca acgaccctcg aagtgattca gccgaggagg aattatccaa
      121 ccgtgagaat cagtaccaaa aagcgatcct gcagtacaat ctggctgtgg taaactttga
      181 aactgccctc ggaagctaag aagaatttgt aaggttttga atggaactgg aaaaaggttc
      241 aatggtgaca gtccgtccgg cgtaccctat aatacgagtt gtggcggatg tcactgctta
      301 catattacaa gtgaatacga cccatgaaaa tttctttaaa ctttttttct gaggcgccgc
      361 aacttttgct cgactgaggc gtttaatagg gtgtcacacg aaaaacgggg aattgtgtaa
      421 aaaagattca cggattctag catttgtgtt acactagtga ttgttacatt ttacacaata
      481 gctgaatata ctagagattt ttaacacaaa aaagcgaagc tgtcctgcga aaggaggtga
      541 cacgcgcttg caggattcgg gctttaaaaa gaaagataga tcaacaaata ttccccaaga
      601 acaatttgtt tatactagag gaggagaaca caaggttatg aaaaaggtcg ttaacagtgt
      661 attggctagt gcgctcgcca tcacagttgc tccaatggct ttcgcagcag aggacacaac
      721 tacagctcct aaaatggatg ctgcaatgga gaaaaccgta aaacgtctgg aagctcttgg
      781 cctggtagca ggttatggca acggcgattt cggtgcagac aaaaccatca ctcgtgcaga
      841 gttcgctact ctaatcgttc gcgctcgcgg tctggagcaa ggtgcgaaac tggcacaatt
      901 caacactact tacacagatg ttagatccac tgactggttc gctggttttg taaacgtagc
      961 ttccggtgaa gaaatcgtaa aaggtttccc ggacaaatct tttaaaccac aaaaccaagt
     1021 tacttatgct gaagcagtaa ccatgatcgt tcgtgctttg ggttatgagc catccgttcg
     1081 aggtgtatgg ccgaacagca tgatctccaa aggttccgaa ctgaacattg caaaaggtat
     1141 caacaaccct aacatgcagc agttcgcggc gacaatcttc aaaatgctgg acaacgctct
     1201 tcgcgttaag ctgatggagc aaatcgaata cggtactgac atccgtttaa acgtaactga
     1261 cgaaactctc ttgactaaat atttgaaagt taccgtacgt gatatggact gggctcacga
     1321 aaagggtaac aattctgatg aattgccact tgtaacaaac gtacctgcta ttggtctggg
     1381 tagtttgaaa gcaaatgaag ttactttgaa tggaaaagat gctgatctgg gtagcaacac
     1441 tacttataaa gtagctgaag gcatcaatcc taacgcattt gatggtcaaa aagtacaagt
     1501 gtggatcaaa gatgaccgag aaaatgtcat cgtttggatg gaaggttccg aagacgaaga
     1561 tgtcgttatg gaccgtgtga gtgctctgta cctgaaaggt aaagccttca cagatgatat
     1621 tgtaaaagat cttagcaagt ctgatttgga tgatgtaaaa atcgaaatgg atggtagcga
     1681 gaaatcgtat cgcttgacgg aagatacgaa aatcacttat aacttcactc gtttcaacga
     1741 tccagtagat gctctgagca aaatttacaa agacaacgac acgtttggtg ttaaagttgt
     1801 tttgaatgat aacaatgaag ttgcatacct ccacatcatt gacgatcaaa caattgataa
     1861 aagcgtaaaa ggcgttaaat acggttccaa agttattagc aaaatcgatg ctgataagaa
     1921 gaaaatcacc aacttggata actccaaatt tagcgacttg gaagatcagg atgaaggaaa
     1981 agacttcttg gtattccttg atggccaacc agctaaactg ggtgatctga aagaatccga
     2041 tgtttactcg gtatactatg ctgatggtga taaagataag tacttggtgt ttgctaaccg
     2101 taacgttgca gaaggaaaag tagaaaaagt ggtaagccgt aacaagacag acattcgtct
     2161 gactgttggt ggtaaaactt acaaagtata tccagatgct tcttactccg aaaacgcaaa
     2221 taaagatgtt aagaaggtta attccgactt ggatctgatt agcaaccttg atggagaaga
     2281 agtaaaactt ctcttggatc catccggtcg agttcgtcac atcgaaacaa aagacgcgat
     2341 cgatgatcgt aaaccattgg caatcatcac taagggtgct acctataact ccagcaaaga
     2401 tacttatgac tttactgtta tgacccaaaa aggtaaaaca caaatcgtat ctttggatca
     2461 aaaggacatc tacgatagat atggggttaa ctacgataaa tccaatgata agcgtcaagc
     2521 tttcgaaaag gatttagttg agcttctgca acctaaggta gttaaagagg actcagctac
     2581 tgatgccaat caaaccgttc ttttggaagt taattttgat tccaaaggcg aagttgataa
     2641 agtcaaagtt cttgatagca agctgaaata ctccgagaaa agcacttggg ataaacttgc
     2701 ggatgaagat gacgacgtcg ttggtgatta tgaagtaacc gataagactg ctgtcttcaa
     2761 aatgactggt gaccttactc cagctactgg aactaaacgt ggtgaactga aaaacgctgg
     2821 tactgctaaa tttaaggatg ttgctaagaa gagcgatctg aaggtttggt actcagtgga
     2881 ggaagacaaa ggtgaagttc aagcgatctt cgtagtagat ggcagtggtt tgggtggcga
     2941 ccatcagttt ggtatggtca aacaatatgg tactgctagt aaacaggaca ctatcactat
     3001 agttacaaaa gatggtgatt ctgtcactga aaaagagtac aaattggatg gcgacgctga
     3061 tgatttgaaa gttgatcaag acatcagacg tggagatgtc atttctttca cactgaattc
     3121 tgatggagaa gttattgttg acgatgtagt cgaggttgta aataacaacc acattgataa
     3181 cactgcttct aaatcagcta cgctcatgcc tgaagacgaa cgtcaaaaag caggaatcga
     3241 caaattggtt gttgctcgcg ttgacgaagt tgatggtaac actatttcct tgaactatgc
     3301 tgacggaaag acacaaaaat attacacaaa agcatccact gcgttcattg atgtgtatga
     3361 cggtcttgaa ggaattgatg gagtagatga aggcgactac atcgtaatga tcgatagcgc
     3421 cgatattgac ggaactcgct ttgactatgt actggtagtt tctagcgatg atgagatccg
     3481 cacgcaacac atctccacta aagcagttac ggacttcctg aacaagccaa ccagactatg
     3541 taccaaatcc tggcgatggg gaagaagtag tcacggcacc aaagttaata cagttaacga
     3601 tgaagcagtt gtagatggta ttgtaactct tccagctgat gcatctgtta gaaacttcaa
     3661 cattgcattt gatcaagaaa ttaacagcaa agatgcaacg gtaactgtta ctaatgaaga
     3721 tacgcttggt aacgtaacgg tatctgaggt tgcgacagat gcaaaagtat tgagcttcaa
     3781 gactgctaaa ctggacacta caaaaactta catcatcaca gttaaaggct tgaaagataa
     3841 aaacggtaaa gcagtgaaag atgtaactct ttacgttgaa tttgttgctg gagtgtgact
     3901 tgttgctcta taaacactaa tagtgtgaaa tgagtgaaga agggagaagg attaaaacct
     3961 tctcccttct ttaactgttt tggcgaattg aagggtgtaa cccccttccg tggaaacaac
     4021 cattaatcca cgtgactaat cctcctgcat gcgttgtaag ctaactctta cagggtgtga
     4081 agccaggtga agacgtaacc cttagtggtg cgggtctccg taaaagctaa ctatggttag
     4141 gaaacgaatc tacgtctgag gcgtgtcagg tttggtaatt agcccttcag ctgctaatta
     4201 ccccacaggc tgagatgcct ggagggttct aacgaattaa ggttgttaac gttagttcat
     4261 cttgatatgt atagtaggaa cctaaggttt agcaaaaaaa gggatagcca aaatggaaca
     4321 agtgagatct
//
LOCUS       BSPRSDA      2996 bp ds-DNA             BCT       08-JUN-1990
DEFINITION  Bacillus sp. raw-starch-digesting amylase gene.
ACCESSION   D90112
KEYWORDS    alpha amylase; raw-starch-digesting amylase.
SOURCE      Bacillus sp.(strain B1018) DNA.
  ORGANISM  Bacillus sp.
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2996)
  AUTHORS   Itokor,P., Tsukagoshi,N. and Udaka,S.
  TITLE     Nucleotide sequence of the raw-starch-digesting amylase gene from
            Bacillus sp. B1018 and its strong homology to the cyclodextrin
            glucanotransferase genes
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 630-636 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Shigezo
            Udaka Department of Food Science and Technology, Faculty of
            Agriculture Nagoya University Furo-cho, Chikusa-ku Nagoya 464 Japan
            Phone:  052-782-5111 x6356 Fax:    052-781-4447
FEATURES       from  to/span     description
    pept        313     2454     raw-starch-digesting amylase precursor (EC
                                 3.2.1.1)
    sigp        313      393     raw-starch-digesting amylase signal peptide
    matp        394     2454     raw-starch-digesting amylase mature peptide
    binding     302      306     ribosome binding site
    signal      101      106     -35 region
    signal      125      130     -10 region
    rpt        2567     2580     inverted repeat
    rpt        2585     2598     inverted repeat
BASE COUNT      764 a    852 c    774 g    606 t
ORIGIN      10 bp upstream of RsaI site.
        1 ttatttgagt acattttatg tattcccaca ttgcgcccga tatctacgct tagaaaaaaa
       61 tcgtcggaaa agcgccccaa aaaattttta ttgttattta ttgacagttg tattcgcttt
      121 catctacaat gatggaggaa cgcaatactc gatataattt aagggccatg cattccgtga
      181 ccgcacaccc ggtatggaac aaccccggta tctcgatgga gaagccgggg ttttttgtcg
      241 ccctttttta ggaggtgatc cggcgacagc ggatcaagcc tggaattcaa ataattacat
      301 aggaggtata acatgaagaa atttctgaaa atgacagccg cgttttccct gggattatcc
      361 ctggcgttcg ggcttttcag ccccgcccag gccgcgccgg atacctcggt atccaacaag
      421 caaaatttca gcaccgacgt catctatcaa attttcaccg acaggttttc ggacggcaat
      481 cccgccaaca atccgaccgg cgcggcgttt gacggaacct gcacgaacct ccggctgtat
      541 tgcggcggcg actggcaggg catcatcaac aaaatcaacg acggttacct gaccgggatg
      601 ggcgttaccg ccatctggat ctcccagccg gtcgaaaaca tctacagcat catcaattat
      661 tccggcgtca acaacacggc ctatcacggc tactgggccc gggacttcaa gaagacgaat
      721 ccggcctacg gcacgattgc ggacttccag aacctgatcg ccgccgcgca tgccaaaaac
      781 atcaaagtca ttatcgactt cgccccgaac catacgtcgc ccgcctcgtc cgaccagcct
      841 tcctttgcgg aaaacggccg gctgtacgat aacggcacgc tgctcggggg atacacgaac
      901 gatacgcaga acctgttcca ccataacggc ggcacggact tttccacgac cgaaaacggc
      961 atctacaaaa acctgtacga tctcgccgac ctgaaccata acaacagcac gtcggacgtc
     1021 tacttgaagg acgcgatcaa aatgtggctg gatctcggca tcgacggcat ccgcatggat
     1081 gcggtgaagc atatgccgtt cggctggcag aagagcttta tggctgccgt caacaactat
     1141 aagccggtct ttaccttcgg cgaatggttc ctgggcgtaa acgaagtagg cccggaaaac
     1201 cataagtttg ccaacgaatc cggcatgagc ctgcttgatt tccgttttgc ccaaaaggtg
     1261 cggcaggtgt tccgggacaa caccgacaat atgtacggcc tgaaggcgat gctggagggc
     1321 tccgcagccg attacgccca ggtggatgac caggtgacgt tcatcgacaa ccatgacatg
     1381 gagcgtttcc acgcaagcaa tgcaaaccgc cggaagctgg agcaagcgct ggcgttcacg
     1441 ctgatcctcg cgcgcgtccc cgccatttat tacggcaccg agcagtacat gtcgggtggg
     1501 accgatccgg acaaccgggc gcggatccct tccttctcca cgtcgacgac cgcctatcaa
     1561 gtcattcaaa agctggcgcc gctgcgcaag tccaacccgg ccatcgccta cggatcgacg
     1621 caggagcgct ggatcaacaa cgacgtgctc atttatgagc gcaaattcgg cagcaacgtt
     1681 gccgtcgttg ccgtcaaccg caatttgaac gcgccggctt ccatttcggg acttgtcact
     1741 tccctgccgc aaggcagcta caatgacgtc cttggcggcc ttctgaacgg caacacgtta
     1801 acggtaggct ccggcggagc cgcctccaat ttcacgcttg cggccggcgg cacggcggtg
     1861 tggcagtaca ccgcggcaac ggcgacgccg accatcgggc atgtcgggcc gatgatggcc
     1921 aagccgggcg tgacgatcac gatcgacggc cgcggcttcg gctctagcaa aggcaccgtc
     1981 tacttcggca cgacggcggt gagcggcgcc aacatcacgt cttgggaaga cacgcagatc
     2041 aaagtgaaaa ttccggccgt cgcaggcggc atctacaaca ttaaagtcgc aaacgccgcc
     2101 ggaacggcaa gcaacgtgta cgacaacttc gaggtattgt ccggagacca ggtcagcgtc
     2161 cgcttcgtgg tcaacaacgc gacaacggcc cttgggcaaa atctctacct gacgggcaat
     2221 gtcagcgagc tggggaactg ggacccggca aaagcgatcg ggccgatgta caaccaggtc
     2281 gtttaccaat atccgaactg gtattatgac gtcagcgttc cggccggcaa aacgatcgag
     2341 ttcaagtttt tgaaaaaaca aggctccacc gtcacgtggg aaggcggcag caaccacacc
     2401 ttcaccgcgc cgtccagcgg caccgcgacc attaacgtga attggcagcc ataaggcgtg
     2461 agggataggc ggctggcatt cattggaaaa ggcggactat atgacgtccg ttccgtgagc
     2521 aacgctcatc gctccgttca aaccgccaca aggctgatct tcagccaaaa aaagagggga
     2581 cctttcccct ctttttttat ttccgttgac taacggtatt cccaaaaatt acattggggg
     2641 ataagctccc tcccctctaa tagcaataac aagagcgtaa acccaaccag gtgatccata
     2701 gcgtgcggtc gcctttaatc ccggtatcaa aatgtatcct accttacaaa aatgatcgga
     2761 tcatacaaaa tagtgcgtac tactcaacga aatagaacct acatacagaa cgatcgatcc
     2821 agatttcaac gaacggcacg gtcgtttaaa aaaatggtgt gcggggtgcg agaatatgca
     2881 agaatatcaa ctgactttga aagataagcg gatcgtatgg gggaaggcga tcgaccttga
     2941 gcctctcatt ggcaaatatc ctggcgactc gattagacag ggcatgaacg aagctt
//
LOCUS       HUMALPL      3101 bp ds-DNA             PRI       08-JUN-1990
DEFINITION  Human alkaline phosphatase (EC 3.1.3.1) gene.
ACCESSION   D90054
KEYWORDS    alkaline phosphatase.
SOURCE      Human liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3101)
  AUTHORS   Matsuura,S., Kishi,F. and Kajii,T.
  TITLE     Characterization of a 5'-flanking region of the human liver/bone/
            kidney alkaline phosphatase gene: Two kinds of mRNA from a single
            gene
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Fumio
            Kishi Department of Pediatrics Yamaguchi University School of
            Medicine Ube, Yamaguchi 755 Japan Phone:  0836-22-2258 Fax:
            0836-22-2696
FEATURES       from  to/span     description
    pre-msg    2130     3101     alkaline phosphatase mRNA and intron
    IVS        2341     3101     alkaline phosphatase intron
    rpt         375      664     Alu sequence
    rpt        2631     2926     Alu sequence
BASE COUNT      775 a    752 c    692 g    882 t
ORIGIN      chromosome 1; map position p34-36.1.
        1 aagctttctc cagcgagtat gatggtttct gcaggttctt ggcataaagc ctttatcaga
       61 ttaaggaaat tcttttcaat acctggtttg ctgagggctt ctgtcacatc gttttctgtg
      121 accccattcc ctctccctag gtgagcacgt caagtttgat cagggtgtta aactgccacc
      181 cctgtgccta tgattcccaa atttatactc taacccagac ttctttttca aatgccagag
      241 ccaaatattc agctgcctcc ttagtgtctc cacttctaaa agacatctcc aactcaacat
      301 atccaaaaac aagttcctga ttgtctccac ctcatgcctc aaaagaccac cccaaacgcc
      361 gaaaggctga atgctttttt ctttttcttt tttttttttt tctgagatgg agtctcactc
      421 tgttgcccag gctggactgc agtgatgcga tctcagctca ctgcaaactc tgcttcctgg
      481 gttcaagtga ttctcctacc tcagcctctc aggtagctgg gactacaggt gcacaccacc
      541 atgcccagct aatttttgta gagagagttt caccatgttg gccaggctgg tctcaaacac
      601 ctgaccttaa gggatccacc cgcctcagcc tctcaaagtg ctgggattac aggtgtgagc
      661 catcgcactt ggctcggtag tatatggctc agaaacattg ccatttacaa tagttcccca
      721 aaaagcaaaa ttcttaggta taaatctgga ttcagagtcc agaatgctaa ccattacacg
      781 atggaacccg taggtataaa tctaagaaaa catatccaag atctacaggc tgaagactac
      841 agagtgctga taaaaccgaa gaactctgac tgaatgagtg gagagacgtg gtgtcttcat
      901 gactgggcaa ctccatgtgg tatagacgta aaccctccca cattgatctg tggatttaat
      961 accataccta tcaaaaacac agtggtggag gacagatcag ggatcgccag gtttagggat
     1021 ggggggattg tgtaactata aagaacgcaa gagagatttt tggggtggca gagctgttct
     1081 gggtcctgac ggtggcggtg gtggttacat aaatctatcc atgtgtcaaa cgtcagaaca
     1141 ctcattttac acttgggggc aacagaaatc cctccctctg gagggggtga ctgatggtaa
     1201 cctgattgct aattctggaa tcaggagccc tgtggtcagg tttctgctct gcaacttcct
     1261 gttggtaacc ttgggcaagt ctccgtccag agccttggtt ttctcatctg taaaaggaga
     1321 tgataggtcc ttttctgtcc actgcatagc tgattagtga aacatcatgg tgaaattctt
     1381 tatgaactat ggagtgcagc acatagactt gctttcattt tgtcagtatc ctttatagat
     1441 tgttcatgta agctcccaaa gagtagtatt tattttattg aaataaaatg cacgtagaga
     1501 aaaatgtgtg tatcatacat tgacagctga acccaccgtg taaccagcac ccacccaccc
     1561 agatcaatca taaaccgaac cgcaccagca ccccagcagc ccgttcccgt ttccgtaccc
     1621 tccacgtgga gcctccgttc tgtctcccaa cgccctgggt tagtttttat actttctgtc
     1681 atcggaatca cactgtaagt gctcttgggt ttagcttcct ttgctcaagc ttaccttgtg
     1741 cgattcattc atgttgttgt gaggagctgt ggatcatcca ttctccttgc tgtctgtggt
     1801 ggtttctgtg ttgtgaacac acacaatgta ttatccagcc tgccgtagat ggaggcagtt
     1861 ttgaagccat tataaacagg gctgatgtgc acattctgct ggagagaaac gggtcccagg
     1921 gtacaggtag gatgatcagc ttcggtagat cctgccggtt ttcccatgcg ctgtgcctgt
     1981 ctgcactcca ccaacggcga gcggaccttc cggtagttaa acatcttcac gaactcttgg
     2041 actttcctgc acacacagag aagataattt tggatggctc ttcccttccc cccacaacct
     2101 tccttagggc actggctttc aactgatgta aatatttact atgccaagca ctaggagggc
     2161 agagacaaac aagacaaagt cctcacactt agaaactccc ggtgtggcag ctgagatggc
     2221 ccaggaaaga actatattac cttcaaaaag agaggtacat gcgatgtttg aggtggcatg
     2281 aagctcagtg gtgttatatt ggaatgagtg agtgaccatc ctggagcctt cctgaaagag
     2341 gtgacttcat ttttaagtga ttttaaataa tagtttaatg aattagtatt tcgtattcag
     2401 ttaataacat ttttctgatt ttaggatttg ctatagaaat atttggaaac cgtaaagtag
     2461 aacaaaaaaa aaatgtagga atcatctgaa attccaaatt ctaccactca cagttaagtg
     2521 ttgttagatg ttagatgtgg gatattgcct tttaatttcc actctgcgcc gctaccccca
     2581 gcccctaccc cagagccgtc acttctggca ctggagcgca gcttgcgtgg tttttttttt
     2641 tttttttttt tttttgagac agagtcctgc ctgtcgccca ggctggagtg cagtggcgcg
     2701 atctcggctc actgcaactc cccctcccgg gttcacgcca ttctcctgcc tcagcctcct
     2761 gagtagctgg gactacaggc gcccgctacc tctcccggct aattttttgt atttttagta
     2821 gagacggggt ttcactgtgt tagccaggat ggtctcgatc tcctgacttc gtgatccgcc
     2881 cgcctcggcc tcccaaagtg ggcagatcac ctgaggtaga gagttcgaga ccagacctga
     2941 ccaacatgga ccccatctct actaaaaata caaaattggc cagggcatgg tggcgcaagc
     3001 tgctaatccc agccactcag ggaggctgag gctggaaaat tgcttgaacc cgacctgcag
     3061 gcatgcaagc ttggcgtaat catggtcata gctgttttcc t
//
LOCUS       RATCNRAA     2337 bp ss-mRNA            ROD       08-JUN-1990
DEFINITION  Rat calcineurin A alpha mRNA, complete cds.
ACCESSION   D90035
KEYWORDS    calcineurin; calcineurin A alpha; calmodulin binding protein;
            calmodulin-dependent protein phosphatase; isoform.
SOURCE      Rat brain, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2337)
  AUTHORS   Ito,A., Hashimoto,T., Hirai,M., Takeda,T., Shuntoh,H., Kuno,T. and
            Tanaka,C.
  TITLE     The Complete Primary Structure of Calcineurin A, a Calmodulin
            Binding Protein Homologous with Protein Phosphatases 1 and 2A
  JOURNAL   Biochem. Biophys. Res. Commun. 163, 1492-1497 (1989)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Takayoshi
            Kuno Department of Pharmacology Kobe University School of Medicine
            7-5-1 Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone:  078-341-7451
            x3273 Fax:    078-351-6531 Peptides, 78-329 and 391-414, seem to be
            putative catalytic domain and calmodulin binding domain,
            respectively.
FEATURES       from  to/span     description
    ORF         208     1773     calcineurin A alpha
    signal     1944     1950     polyadenylation signal
BASE COUNT      649 a    523 c    596 g    569 t
ORIGIN      
        1 cgggaggagg agtgaaggcg gcggcggcgg aggagggacg cgcggagccg gcagtaactt
       61 tcgagccagc ccagagcccg gagctccagc cgagcggttt gcagcgcggc ggcgcggcgc
      121 tgagtgtctg gcccgccggt gcggtcgggg tgtgcagtcg gacgggacca gcagcgcgtc
      181 gctgtccccc cctcccggtg actggagatg tccgagccca aggcgattga tcccaagttg
      241 tcgactacgg acagggtggt gaaagccgtt ccatttccgc caagtcaccg gctgacagca
      301 aaggaagtgt ttgataacga tgggaagcct cgtgtggata tcttaaaagc acatctcatg
      361 aaggaaggca ggctggaaga aagtgtcgcg ttgagaataa taacagaggg tgcttcgatt
      421 ctccgacagg aaaaaaactt gctggatatt gatgccccag tcacagtttg cggggacatc
      481 catggacaat tctttgactt gatgaagctc tttgaagtgg gaggatctcc tgccaacact
      541 cgctacctct tcttagggga ctatgttgac agagggtact tcagtatcga atgtgtgctg
      601 tatttgtggg ccttgaaaat tctttacccc aaaacactgt ttttacttcg tggaaaccat
      661 gaatgtaggc acctaacaga gtatttcacg tttaaacaag aatgtaaaat aaagtattca
      721 gaacgcgttt atgacgcctg tatggatgcc ttcgactgcc ttcccctggc tgcgctgatg
      781 aaccaacaat tcctgtgtgt acacggtggt ttgtctccag agattaacac tctagatgac
      841 atcagaaaat tagaccgatt caaagaacca cctgcttatg ggcctatgtg tgacatcttg
      901 tggtcagacc ccctggagga ctttggaaat gagaagactc aggaacattt cactcacaac
      961 acagtcaggg gttgttcgta cttctacagt tacccggctg tatgtgactt cctgcagcac
     1021 aataatttgt tgtccatact ccgagcccac gaagcccagg acgcagggta ccgcatgtac
     1081 aggaaaagcc aaacaactgg cttcccgtct ctaattacga tcttctcggc accaaattac
     1141 ttagatgtgt acaataataa agctgcagtg ttgaagtacg agaacaacgt gatgaacatc
     1201 aggcagttca actgctcccc ccatccgtac tggctcccaa atttcatgga tgttttcacc
     1261 tggtcgctgc catttgttgg ggagaaagtg actgagatgc tggtaaacgt cctgaacatc
     1321 tgctcagatg atgaactggg gtcagaagaa gatggatttg acggagccac ggctgcagcc
     1381 cggaaggagg tcatcaggaa caagatccga gcaataggca aaatggccag agtattctca
     1441 gttctcagag aagagagtga gagcgttcta actctgaagg gcctgacccc gactggcatg
     1501 ctccccagcg gagtgctctc tggcgggaaa caaactctgc aaagcgctac tgttgaggcc
     1561 attgaggctg atgaagccat caaaggattc tcaccacaac ataagattac cagcttcgag
     1621 gaggccaagg gcttagaccg aattaacgag aggatgccgc ctcgcagaga cgccatgcct
     1681 tccgacgcca accttaactc catcaacaag gctctcgcct cagagactaa cggcacagac
     1741 agcaacggca gtaatagcag caatattcag tgaccacttc ctgttcactt tttttttttg
     1801 agctgcaggg catgatgggt ttgctgcatc tcagcagttg gatgttcttg cctctgacgg
     1861 tagcttgttt gctctggggg ggccaggaat tggattcagt ttacactatc atgaaaaaaa
     1921 aaaagaggga gagagagaga gataataaaa ctatattttg gtgagggtgg tgattaaaca
     1981 cctcttttgg gtatgccttt aaaaatgctt ctaggaaaaa aaaagtttta aaaagaaagc
     2041 taatgctagt ctatacttca atgttagggg aatgaacacg ttttcctagc gcactgggga
     2101 cttttagata ggttaatgaa aggcctttta ttctgttact ggacacgaaa actttgtcta
     2161 atttcttata ctctattgta cgtttacagt cgcagcacta aaaatggatg acatcaaaca
     2221 tttttaaaca gaaaaaaaag atgtacaaac taaataagga ctatttattg ataatgtttt
     2281 gctactcttg tcagacaatg gctataaact gaattaggca gtcttaaaaa aaaaccg
//
LOCUS       BOVPAS       3589 bp ss-mRNA            MAM       08-JUN-1990
DEFINITION  Bovine poly(ADP-ribose) synthetase mRNA.
ACCESSION   D90073
KEYWORDS    DNA binding protein; helix-turn-helix; nuclear location signal;
            poly(ADP-ribose) synthetase.
SOURCE      Bovine thymus, cDNA to mRNA, clone pSO-7.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 3589)
  AUTHORS   Saito,I., Hatakeyama,K., Kido,T., Ohkubo,H., Nakanishi,S. and
            Ueda,K.
  TITLE     Cloning of a full-length cDNA encoding bovine thymus
            poly(ADP-ribose) synthetase: evolutionally conserved segments and
            their potential functions
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Isao
            Saito Department of Clinical Science and Laboratory Medicine
            Faculty of Medicine, Kyoto University Shogoinkawahara-cho 54,
            Sakyo-ku Kyoto 606 Japan Phone:  075-751-3467 Fax:    075-771-4792
            Zinc-binding finger motifs are observed in seq. 21-51 and 128-165.
            The sequences, 200-220 and 250-270, showed helix-turn-helix
            structure.  Nucleotide-binding fold was found in seq. 890-903.
FEATURES       from  to/span     description
    pept         77     3127     poly(ADP-ribose) synthetase (EC 2.4.2.30)
    signal      224      231     nuclear location signal
BASE COUNT      975 a    879 c   1010 g    725 t
ORIGIN      
        1 tggatgagac ggcactagca gcagcgggtc tccaggtcgc ggcgtcgggc ttccggagct
       61 gtgagcggtt agaaggatgg cggagtcttc agacaagctc taccgggtcg agtacgccaa
      121 gagcgggcgc gcgtcttgca agaaatgcaa agagagcatc cccaaggact cgatccggat
      181 ggccttcatg gtggagtcgc ccatgttcga tgggaaaatc ccgcactggt accacctctc
      241 ctgcttctgg aaggtcggct tctccatctg gcaccctgat gtcgaggtgg aggggttctc
      301 tgagctccgc tgggatgacc agcagacgat caagaagatg gccgagactg gcggacggac
      361 agatgtttca ggcaaaggcc aagatggagt tggcagcaag accgagaaga cgttgattga
      421 cttcggggca gggtacgcca agtccaacag aagcacgtgc aagagctgca tggagaagat
      481 agacaagggc caggtgcgcc tgtctaagaa ggtggtgtac cccgataagc cccagctggg
      541 catggttgac tgctggtacc acccaaagtg ttttgttcag aaacgggagg agctgggctt
      601 ccgtcccgag ttcagcgcaa cgcacctcat gggcttcagc gtcctcaccg cagaggacca
      661 agaaaccctc aagaagcaac tcccggccat caagggtgaa agaaagagaa aaggtgatga
      721 ggtggatgga atagatgaag tgaccaagaa gaagtctaaa aaagaaaaag acaaggagat
      781 taaacttgaa aaggccctta aggcccagaa cgacctgatc tggaatgtca aggacgagct
      841 aaagaaagcg tgctctacga acgacctgaa agagttgctc atcttcaaca agcaggaagt
      901 gccttccggg gagtcggcga tcttggaccg cgtggccgac ggtatggtgt ttggcgccct
      961 ccttccctgc gaggaatgct cgggccagct ggtcttcaag ggcgacgcct attactgtac
     1021 cggggatgtg actgcctgga ccaagtgtat ggtcaagaca cagacgccca accggaagga
     1081 gtgggtgacc ccaaaggaat tccgagaaat ctcttacttc aagaaactga agatcaaaaa
     1141 gcaggaccgt atattccccc cagagagcag caccccagtg ggggcagcag ccccaccctc
     1201 cgcagcttca gcgcctgccg ctgtgcactc tggcccccca gacaagccat tatccaacat
     1261 gaagatcctg actctcggga aactctccca gaacaaggat gaagtgaagg ccacgattga
     1321 gaaactcggg gggaaattga cagggacggc caacaaggcc tccctgtgta tcagcaccaa
     1381 aaaggaggtg gacaagttga ataaaaagat ggaggaagta aaagaagcca acatccgtgt
     1441 cgtgtctgag gacttcctcc aagacatctc cgcctccacc aagagccttc aggagttgct
     1501 ctccacccac ctcttgtccc cctggggagc cgaggtgaag gtggagcctg ttgaagcagt
     1561 gggcccaaag gggaagtcgg gggccgcgcc ctccaagaag agcaagggtc ccgtcaagga
     1621 ggaaggtacc aacaaatctg aaaagaggat gaaattaact cttaaaggag gagcagctgt
     1681 cgaccctgat tcaggtctgg aacacaatgc acacgtcctc gagaaaggcg ggaaggtctt
     1741 cagcgccacc ctcgggctcg tggacatcgt caaagggacc aactcctatt acaagctgca
     1801 gctcctggag gatgacaaag agagcaggta ctggatattc aggtcctggg gccgtgtggg
     1861 cacggtgatt ggtagtaaca aactggagca gatgccatcc aaggaggatg ccattgagca
     1921 ttttatgaaa ttatatgaag agaaaaccgg aaacgcctgg cactctaaaa actttacgaa
     1981 gcatcccaaa aagttctacc ctctggagat tgactacggc caggatgaag aggcggtgaa
     2041 gaagttgaca gtaaaccctg gcaccaagtc caagctcccc aagccagtgc agaacctcat
     2101 taagatgatc tttgatgtag agagtatgaa gaaagccatg gtggagtatg agattgacct
     2161 tcagaagatg cccttgggga agctgagcaa aaggcagatc caggctgcat actccatcct
     2221 cagtgaggtc cagcaggcac tgtcccaggg cagcagtgac tctcacatcc tggatctctc
     2281 caaccgcttc tacaccctga tcccccacga cttcgggatg aagaagcccc cgctgctgaa
     2341 caacgcaaac agcgtgcagg ccaaggtgga aatgctagac aacctgctgg atattgaggt
     2401 ggcctacagt ctacttaggg gtggttctga tgacagcagc aaggacccca ttgatgtcaa
     2461 ctatgagaag ctcaaaactg acattaaggt tgtggacaaa gattccgaag aagccgagat
     2521 cattaggaag tatgtgaaga acactcacgc gaccacacac aacgcgtacg acttagaagt
     2581 cgtcgacatc ttcaagatag agcgcgaagg ggaaagccag cgttacaagc cgtttaagca
     2641 gctgcataac cggaggctgc tgtggcacgg gtccaggacc accaacttcg cgggcatcct
     2701 gtcccagggt ctccggatag ccccacctga agcacctgtg acgggctaca tgtttggtaa
     2761 gggaatctat ttcgcggaca tggtctccaa gagtgccaac tactgccaca cgtcccaggg
     2821 agacccaata ggcttgatcc tgttgggaga agctgccctt ggaaacatgt atgaattgaa
     2881 acatgctcga catatcagca agttacccaa gggcaagcac agtgtcaaag gtttaggcaa
     2941 aactacccct gacccatcgg ctagtattac tgtggatggt gtggaggtgc ctctcgggac
     3001 agggatttca tctggtgtta atgacacctg tctgttgtat aacgagtaca tcgtctacga
     3061 cattgctcag gtccacctga agtacctgct gaagctgaag ttcaacttta agacatccct
     3121 gtggtgagct gtgaggtgcg caggctggtg actggcgtgc tttgcatgac tggccaggca
     3181 gaaacctcag ctgagttgtc gaggcatagc tagcttcact cagtctcctt agaagggatt
     3241 ttatatgaac acttggtcac atgtttcaga gcgattttcc cagcttttca aatcccttga
     3301 ttgtgttggg cgggggagcg ggcggggggt ggtttcgggg tttttctcct gccagccata
     3361 accctggtag gtgcaactga caatagagga gaagctgggg agggtcttac ctgtgtctag
     3421 attagtccaa tgaaagagcc tcgcaggcca cattagaaca tctgccttac tggtttcccc
     3481 agggaaggaa aaaaataatc ttctaccctt gtttttaagt attcatcttt agttttggaa
     3541 aaatgttaag tgtttatttt gaggtcaaaa taaaaactaa tttcatact
//
LOCUS       CHKNMYC      6360 bp ds-DNA             VRT       08-JUN-1990
DEFINITION  Chicken N-myc protein gene.
ACCESSION   D90071
KEYWORDS    N-myc gene; nuclear protein; oncogene.
SOURCE      Chicken (strain white leghorn) embryo DNA, clone CNG1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 6360)
  AUTHORS   Sawai,S., Kato,K., Wakamatsu,Y. and Kondoh,H.
  TITLE     Organization and expression of the chicken N-myc gene
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hisato
            Kondoh Department of Molecular Biology School of Science, Nagoya
            University Furocho, Chikusaku Nagoya 464-01 Japan Phone:
            052-781-5111 x6685 Fax:    052-782-8575
FEATURES       from  to/span     description
    pept       2872  +  3598     N-myc protein exon 1
    pept    +  4682     5280     N-myc protein exon 2
    pre-msg    2354     6208     N-myc mRNA and introns
    IVS        2496     2769     intron A
    IVS        3599     4681     intron B
BASE COUNT     1318 a   1648 c   1735 g   1659 t
ORIGIN      
        1 tgagagtaga cctaaaaagg agaacttgac gtagtattgg tggatgaaaa gctgttagtg
       61 tgtgcctgaa acccagaagc caaccatatc ctggaccaca tgaaaagaag catggccagc
      121 acggcaaggg tggtgattgt ccctctctac tccattctca aattcctgta cccaaagaga
      181 aaacatagct atatacatat atgtatatgt gggggttttt ttgtttgttt ttttgctgtt
      241 cccaggacta tgtttagcca atgcatcaaa acgcatacaa gcatttgcgg tcacttgagt
      301 tggcactgca ctgtgcctcg tgctctggtt gccacccaga tgctgatcac acaccgtggg
      361 tttggtggtt gctgtaccat ggatgcaccg ggtgtgaggc agggccttca cactgagatg
      421 ggagccccac agctgccatc ttcctcctcc ggactctgtg ctgcagccag gtccaaccag
      481 ctccatctgg ccagcccctc tgtatgcttt cagctctgca gcggacatcc aaacaacagt
      541 gtgaacgctt aaggcgacct caccttcctg gaagggcctt gatacctcag tcttggccac
      601 ccaagcacaa catggaggca agcgggcatc accccaggga acagcgtagc ccaggggctg
      661 gagcgcaatg gggcaacgcg gtgaaggtgc acggtgagga actattaccg tctggctctg
      721 ttctcagaaa caaactcctc ctcagcgact ctcacacaga ccctccctca gagctgccac
      781 ttttgaagcc gccgggctca ccccgggcgg ctcgggagcg gcgccccgcc cggccgcaca
      841 caaacggcgc cggcagcctt cctgcgagca acgcccacgt ccccgcctca acctgcagcc
      901 tatgagcggg taggcgcctc cccaagcggg cgccggaccg ggatcgcaca gccgaggacg
      961 gcgccgcgct gccgtaccgc ggtctccccg ccgctgtccg cgcccgcggg gaaggaggca
     1021 gagcggtggg tgtccgccgc cggggggacg cccggccgcc gcctccggcc ctccgcaagg
     1081 tgtcgccgcg cgcagcggaa cgccgggtcc ggccgggagc tgccgtgcgg tgcggcgggg
     1141 ccggaggtgt gcgcggcgag cggggaggcg atcaccgcag gacgagcgct agttctttat
     1201 aaagcccaga gcgagggcga ccccgcgttc cagacacttc cattattcct ctcctttatt
     1261 agtatttttt tcacctctcc ctcggtcttc gcagataggg ccaccgctct cacgcagcgc
     1321 cgcgtccccc cggggagggg cggccccgca cccccgggcg tgccgggcgc ctcgcaccgc
     1381 gttggagaat aaagagcgaa tggggctttt ggctcgcaag cggaggcacc tccctcgtct
     1441 atggaaatca gggggtgggg ttacgccgcg ctcccccccc cccttttttt ttttttttta
     1501 atgctccccg ggtgtgtcag ctatctttat ttttttattt tttttttcct cccctccctt
     1561 ccccccaccc cgccactcct ccccctcgca acacccaccg ccggcagcgg ggatttaaga
     1621 ggaggaggaa gcggcagggt ctgggctgcc agcagcggga ggacaaaacc cccatgaaat
     1681 aaatacaatc ccacatccag ctgggaggat gggacgggag cggcagcctc cgccggccct
     1741 caatgggggt ttcattgagc ggatatttgg gctgtgctgc ggagcggtcg caccgagctc
     1801 gccgcactct gtaggtgcct ctttgctctt cgcattttta acggtttcgt gtatttcggg
     1861 agcagggaag cgcacaatgc tggaaaggag cagggggcgg cgcgggggcg agcagctgag
     1921 ccgcgtttcg gtgcatttca tcagagcgct ttgcacgact gtgtgtgtgt gtgtgtgttt
     1981 tttttttttt tttcctcccc cttttgctct ccgtcagctg ttcgcaaccc agcacacgtt
     2041 ttcccttcgc tgggtcggga ctaagagctc atttcggttt caatgcccga tgcagaggag
     2101 catggggctg cgggtgagcg gccgcgggta gcgcgggggc accgaggtaa gggcgctgct
     2161 cccgggggcg aaccgtgtgg ctcttgtttt aacctgcaaa gagggagaaa aacttcctgt
     2221 agcgagtcag gctttgttgg ggtgtgttca tcgtttggtg tgcgtcggtg tttgctgctg
     2281 ctggattccc ctgatggagt tgcacggctc ggggagcccc ttctttgcat ccttctctcc
     2341 ccactacact ggaaatgaca aaaggagtaa acttcctttg cttgcggtcg cgtataggag
     2401 ggtttctgca gggaggcgtg catctgcccg cggctgtgct gctctgagct ctggaggcgg
     2461 tcgggcaggg cgcggagctc cgtgcaggac cggaggtgtg tgtgtgtgtg tgtgtgtgtg
     2521 tgtgtgtgtg caccctggag cggctcacgg ccgctgcctg cggcgtccga gaaggtgcgg
     2581 tgcgttcctc cttaacttgt aaaggctcct ggtttgtttg ggtttttttt tccctatcca
     2641 taatggaatt gtgttttttt atttttcctt tttttttttt tttccccact acataatagg
     2701 gaattacagg cgtctttttt tttcgtgttt taattgtgtt cgtttcttta tttccatttt
     2761 ctgctttagt ttcaggtcct ggacacacct ctcctctcct tggtggaact accgtaaaac
     2821 ccgcctctaa cacgagcccc cagcccggga atcgacacgg agagccccgc catgccggga
     2881 atgatcagca agaacccgga cctcgagttc gattccttac agccttgttt ctacccggac
     2941 gaagatgatt tctatttatg cgggccggac tcggcccccc ccggggaaga catctggaag
     3001 aagttcgagc tgctgcccac ccctccgctg tcccccagcc gggccgggct gcaggagcca
     3061 ccccccgggg ggggctccat tgccgtgggg aggggcggac ctggggagtg ccgccccgtt
     3121 gatcccctgg actgggcgtc cgagttgctc ctcctgcccc ccgaggccga gctgtggggc
     3181 agcacggacg gcgcggattt cttcgagacg ggcctcgggg cgagcaacaa cctcaactcc
     3241 atcatcatcc aggactgcat gtggagtgcg ttctcggccc gcgagaagct ggagcgggcg
     3301 gtcagcgaga agctgcagag caagcctccc gccgccgccc cgccgcctcc gcccccggtt
     3361 gtacccaccg ccgcttgtcg ccgccgcgaa cagccccagc gcggtcccgg ccgcgcggag
     3421 ctgggcggct ccgtgcccga gtgcgtggac ccggccgtgg tcttcccctt ccccgtcaac
     3481 aagcgggagg cggcggtgcc aagcggcggc gagaccccgc ggggcgggcg ccggccgggt
     3541 cccgcggggg agagccgggc cagcagcagc tccggggacg acacgctcag cgactcgggt
     3601 caagcgcgcg gctgacgggt gggcggtgcg gggtcgggcc gccgcagcgc acactcaaca
     3661 gccgcggggc ccgtgcgcgc cccacggggc tgggttctcc ggggctgtgg gccaacagca
     3721 aggacggtgt gcccgggctc gggctgtgcc gtgcggctcc tcgggcttcc catgtcctca
     3781 tgtccatcgg ggtgtgtgtg cgccattctg ggggccgttt tggggagcgt gtggtcagaa
     3841 gcgaggggca gtgccatgct gaaccacgtt cggtggggga tgtttacggc gttgggaggc
     3901 tggaacagag cggcagcccg cagagggtcc ttgtctgtac ggtgacagag cctaaaatgc
     3961 cctcagatag ttcccatcgg agaaatccgg ttcggcagtg tagttctggc tgtgtaagtg
     4021 agcggtccgt atgatcgtca ccataaactt ggcgatccgt ctcggctcgc ggagcctact
     4081 gtccctttaa tgtctggttt gacagctttg ggtgaggaag cacttccaac agctgtcttc
     4141 ttggcactgc accaagcgcc ggttaaaggg tccccggcgg gagcagcttc accttctgca
     4201 ctagaacaaa cccagcgatt gtttcgtttc ccgtctgctt ttccaccaac ccagggctgt
     4261 gttgtggctc tgtacgtgca gtttgtgccc caggtctgca gctgtagtaa aaatgaatgc
     4321 attttgtatg aaaacttttc gccactggga ttagttgggt ggtagcccca ctttcattta
     4381 aaatgagaag ttgttaaaga aaaggagttg gagacaatcc ctatgggtcc cttcaaactc
     4441 ggggtattct atgaaaaaca aacctcataa agttccccca aacttgcaga gatctatttt
     4501 gtttagtttt gtgtatgtta tatacgtctc tagaactata tttgattatg gacagtgagc
     4561 acaccctatt ttttttacca tcagttattt tattggtatt tctaatgtgc atatcttgtc
     4621 aataaatatt attgcttttg agagaatatc aatttctttt tccattttct ttaaccgaaa
     4681 gatgatgaag acgaggagga agaggatgat gaagaagaaa tagatgttgt gacagtggag
     4741 aaaagacgct cctcctccaa caaggctgtt accaccctca ctattacagt gcgtcctaaa
     4801 aataccactt ttccatcggt caggacacag cagaatgaac tgattttaaa gcgttgtgcg
     4861 ccaattcacc agcagcataa ttatgccgct ccttctccgt atatggagag tgaagatgtg
     4921 ccaccacaga aaaagttaaa agccgaggtg ccccgtccag taaaacccat gatccaacca
     4981 aagtctaaga gttcaagtcc tcgaaactct gattcggagg acagcgaacg tcgacgtaac
     5041 cataatatct tggagcgtca gaggcgtaat gatctgcggt ccagtttcct cacgttaagg
     5101 gaccacgttc cagaacttgt taaaaatgag aaagctgcaa aagttgtgat tttgaaaaaa
     5161 gccactgaat atgtccattc ccttcaggca gaggagcaga agttattgct agaaaaggag
     5221 aaattgcaag ccaggcaaca acagttgcta aagaaaatag agtacaagcg gacttgctaa
     5281 acttcttttt ctcgtttttt ttcttttctt tttggttttt tttttttttt ttggctgacc
     5341 aggacagtca ttgccacttt gcacattttt gattctttaa aaaaaaaaaa aaaattgtgt
     5401 tttttgacgt taagaatgtt ggttttactt tcaattcagt ccctgaagta attgacaaac
     5461 tttattatcc gggtacgagc aaatgggtgt tcttgcaaga agtttattgc gagactacca
     5521 aacacaatgg actgcctttg tttttcattt taagaactgt agatggtgga tttttttaat
     5581 tttttttaaa ttgttgtgag catttggagc tgctgatgac atctagttga gttgaaaaca
     5641 tccattccta atttttatgg tgcttatgtt ctaacagatg ttactttagg gggttggcat
     5701 ttgtacccct ctggaatttt ctgtaaatac catctacaca cttgcctttt gtacatgtct
     5761 tgggttatga gaggtggctt ttgctgccag tattagactg gaagttcata cctaagtact
     5821 gtaatacctc aatgtttgag gagcatgttt ttgtatacaa atatattgtt aatctctgtt
     5881 atgtactgta ctaattctta cattgcctgt atactttagt acgtcgctga tacataacta
     5941 aatttgatac ttatattttc gtatgaaaat gagttgtgaa agttttgagt agatattact
     6001 ttatcacttt tttgaactaa gaaacttttg taaagaaatt tactatatat gccttttcct
     6061 agcctgtttc ttccagttaa tgtatttgtt aatgtttggt gcatagaact gggtaactgc
     6121 aaagttctgt gtttaatttc ttccaatggt gtacatttag tgctgcgtct tatagcactt
     6181 tgaaatacct catgtttatg aaaataaata gcaattacat gatgtgccat ttactatttt
     6241 ttcttttaaa tggagttcct tgaaataaag tttttaactt tgcaatgttt atgaaataaa
     6301 aacctagtcc catatgtaag gatttgaact ctcaagtata ggtgatgttt acggacaagc
//
LOCUS       BMEGDH1      2834 bp ds-DNA             BCT       08-JUN-1990
DEFINITION  B. megaterium glucose dehydrogenase gene and ORFs.
ACCESSION   D90043
KEYWORDS    glucose dehydrogenase.
SOURCE      Bacillus megaterium (strain IAM1030) DNA.
  ORGANISM  Bacillus megaterium
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2834)
  AUTHORS   Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
            Urabe,I. and Okada,H.
  TITLE     Active and silent isozyme genes of glucose dehydrogenase from
            Bacillus megaterium IAM1030
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihide
            Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan.
FEATURES       from  to/span     description
    pept       1964     2749     glucose dehydrogenase (EC 1.1.1.47)
    ORF         181      867     ORF1
    ORF        1086     1946     ORF2
    signal       89       94     put. -35 region for ORF1
    signal      116      121     put. -10 region for ORF1
    signal      167      172     ORF2 ribosome binding site
    signal      893      928     termination signal
    signal      972      977     put. -35 region for ORF2
    signal      997     1003     put. -10 region for ORF2
    signal     1070     1076     ORF2 ribosome binding site
    signal     1949     1954     glucose dehydrogenase ribosome binding site
    signal     2751     2783     termination signal for glucose dehydrogenase
BASE COUNT      896 a    442 c    644 g    852 t
ORIGIN      
        1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga
       61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag
      121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa
      181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg
      241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt
      301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat
      361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca
      421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat
      481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag
      541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta
      601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat
      661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct
      721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt
      781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt
      841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg
      901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt
      961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt
     1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt
     1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt
     1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt
     1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg
     1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta
     1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca
     1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg
     1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag
     1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa
     1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat
     1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca
     1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg
     1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc
     1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga
     1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag
     1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga
     1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg
     2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa
     2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt
     2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt
     2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt
     2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga
     2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt
     2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa
     2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat
     2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg
     2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc
     2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc
     2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca
     2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga
     2821 ctggtgttga attc
//
LOCUS       BMEGDH2      1202 bp ds-DNA             BCT       08-JUN-1990
DEFINITION  B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene.
ACCESSION   D90044
KEYWORDS    glucose dehydrogenase; isozyme.
SOURCE      Bacillus megaterium (strain IAM1030) DNA.
  ORGANISM  Bacillus megaterium
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1202)
  AUTHORS   Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
            Urabe,I. and Okada,H.
  TITLE     Active and silent isozyme genes of glucose dehydrogenase from
            Bacillus megaterium IAM1030
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihide
            Mitamura Department of Fermentation Technology Osaka University 2-1
            Yamada-oka Suita, Osaka 565 Japan Phone:  06-877-5111 x4373 Fax:
            06-876-9036
FEATURES       from  to/span     description
    pept        125      910     glucose dehydrogenase
    signal       27       32     put. -35 region
    signal       49       55     put. -10 region
    signal      111      116     SD sequence
    signal      928      959     termination signal
BASE COUNT      427 a    190 c    268 g    317 t
ORIGIN      
        1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa
       61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt
      121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt
      181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg
      241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc
      301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc
      361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt
      421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg
      481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa
      541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc
      601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa
      661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa
      721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg
      781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt
      841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg
      901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt
      961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg
     1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa
     1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg
     1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg
     1201 ag
//
LOCUS       BMOPTTHP1    1023 bp ss-mRNA            INV       08-JUN-1990
DEFINITION  B.mori PTTH mRNA.
ACCESSION   D90082
KEYWORDS    PTTH; preproPTTH.
SOURCE      B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to
            mRNA, clones P1, P2, C2, C9 and C19.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 1023)
  AUTHORS   Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
            Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
  TITLE     Molecular cloning of the Bombyx mori prothoracicotropic hormone
  JOURNAL   Science 247, 1333-1335 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hironori
            Ishizaki Department of Biology, School of Science, Nagoya
            University Chikusa-ku Nagoya 464-01 Japan Phone:  052-781-5111
            x2472 Fax:    052-783-0719 Telex:  SCUNAG J: 447-7323
FEATURES       from  to/span     description
    pept         34      708     preproPTTH
    matp        379      708     PTTH subunit
    signal      768      773     polyadenylation signal
    signal      826      831     polyadenylation signal
    signal      997     1002     polyadenylation signal
    signal     1004     1009     polyadenylation signal
BASE COUNT      358 a    179 c    186 g    300 t
ORIGIN      
        1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc
       61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg
      121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac
      181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg
      241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta
      301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa
      361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct
      421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc
      481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc
      541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct
      601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg
      661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt
      721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata
      781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc
      841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt
      901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca
      961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt
     1021 aaa
//
LOCUS       BMOPTTHP4     944 bp ss-mRNA            INV       08-JUN-1990
DEFINITION  B.mori preproPTTH mRNA.
ACCESSION   D90083
KEYWORDS    PTTH; preproPTTH.
SOURCE      B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to
            mRNA, clone P4 and C21.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 944)
  AUTHORS   Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
            Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
  TITLE     Molecular cloning of the Bombyx mori prothoracicotropic hormone
  JOURNAL   Science 247, 1333-1335 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hironori
            Ishizaki Department of Biology, School of Science, Nagoya
            University Chikusa-ku Nagoya 464-01 Japan Phone:  052-781-5111
            x2472 Fax:    052-783-0719 Telex:  SCUNAG J: 447-7323
FEATURES       from  to/span     description
    pept    <     1      631     preproPTTH
    matp        302      631     PTTH subunit
    signal      691      696     polyadenylation signal
    signal      749      754     polyadenylation signal
    signal      918      923     polyadenylation signal
    signal      925      930     polyadenylation signal
BASE COUNT      337 a    163 c    177 g    267 t
ORIGIN      
        1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt
       61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag
      121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga
      181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa
      241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag
      301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa
      361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg
      421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag
      481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga
      541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag
      601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt
      661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa
      721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag
      781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt
      841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata
      901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa
//
LOCUS       HUMMTSDHB     958 bp ss-mRNA            ORG       08-JUN-1990
DEFINITION  Human mitochondrial succinate-ubiquinone oxidoreductase
            (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA.
ACCESSION   D90047
KEYWORDS    Ip; complex II; iron sulfur subunit; sdh B;
            succinate-ubiquinone oxidoreductase.
SOURCE      Human liver mitochondrion, cDNA to mRNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 958)
  AUTHORS   Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M.
  TITLE     Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning
            of iron sulfur(Ip) subunit of liver mitochondria
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kiyoshi
            Kita Department of Parasitology Juntendo University 2-1-1,Hongo
            Bunkyo-ku, Tokyo 113 Japan Phone:  03-813-3111 x3542 Fax:
            03-814-9300
FEATURES       from  to/span     description
    pept    <     1      789     succinate-ubiquinone oxidoreductase (sdh B)
                                 (AA at 1)
    site         74       95     iron-sulfur binding site I
    site        167      179     iron-sulfur binding site II
    site        224      236     iron-sulfur binding site III
BASE COUNT      319 a    212 c    204 g    223 t
ORIGIN      
        1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt
       61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg
      121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag
      181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt
      241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac
      301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat
      361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag
      421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag
      481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc
      541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg
      601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc
      661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat
      721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct
      781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt
      841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata
      901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS       HUMNCAW      2287 bp ss-mRNA            PRI       08-JUN-1990
DEFINITION  Human nonspecific cross-reacting antigen (NCA-W272) mRNA.
ACCESSION   D90064
KEYWORDS    CEA; CEA gene family; PI-anchored membrane protein.
SOURCE      Human white blood cells, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2287)
  AUTHORS   Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and
            Matsuoka,Y.
  TITLE     Characterization of a cDNA clone encoding a new species of the
            nonspecific cross-reacting antigen (NCA), a member of the CEA gene
            family
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Fumiko
            Arakawa First Department of Biochemistry School of Medicine Fukuoka
            University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone:
            092-801-1011  x2892 Fax:   092-801-3600
FEATURES       from  to/span     description
    ORF          87     1136     nonspecific cross-reacting antigen ORF
BASE COUNT      618 a    593 c    453 g    623 t
ORIGIN      
        1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag
       61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc
      121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca
      181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc
      241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa
      301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag
      361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg
      421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg
      481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca
      541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc
      601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc
      661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac
      721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga
      781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag
      841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt
      901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa
      961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca
     1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag
     1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc
     1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa
     1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact
     1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc
     1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa
     1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt
     1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc
     1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga
     1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga
     1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc
     1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt
     1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt
     1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa
     1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca
     1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat
     1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga
     2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt
     2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac
     2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc
     2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg
     2281 gctgatc
//
LOCUS       PIGDESTN     1681 bp ss-mRNA            MAM       08-JUN-1990
DEFINITION  Porcine destrin mRNA.
ACCESSION   D90053 J05290
KEYWORDS    actin-binding protein; cofilin; destrin.
SOURCE      Pig adult brain, cDNA to mRNA, clone PD2.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1681)
  AUTHORS   Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S.,
            Iida,K. and Yahara,I.
  TITLE     Destrin, a mammalian Actin-Depolymerizing Protein, Is Closely
            Related to Cofilin --Cloning and Expression of Porcine Brain
            Destrin cDNA
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kenji
            Moriyama Department of Biophysics and Biochemistry Faculty of
            Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan
            Phone: 03-821-2111  x4408
FEATURES       from  to/span     description
    pept         54      551     destrin
    signal     1645     1660     polyadenylation signal
BASE COUNT      507 a    322 c    372 g    480 t
ORIGIN      
        1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt
       61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt
      121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag
      181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa
      241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg
      301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc
      361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg
      421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc
      481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat
      541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc
      601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg
      661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct
      721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga
      781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt
      841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt
      901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt
      961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac
     1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca
     1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca
     1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt
     1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta
     1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag
     1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc
     1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg
     1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga
     1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt
     1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta
     1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taacttaaaa aaaaaaaaaa
     1681 a
//
LOCUS       RATPMP70X    3324 bp ss-mRNA            ROD       08-JUN-1990
DEFINITION  Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA.
ACCESSION   D90038 J05256
KEYWORDS    PMP70; peroxisomal membrane protein.
SOURCE      Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181,
            189,201].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3324)
  AUTHORS   Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T.
  TITLE     The 70-kDa Peroxisomal Membrane Protein Is a Member of the
            Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Keiju
            Kamijo Department of Biochemistry Shinshu University School of
            Medicine Matsumoto 390 Japan Phone:  263-35-4600 x5182 Fax:
            263-33-6458
FEATURES       from  to/span     description
    pept         36     2015     peroxisomal membrane protein (PMP70)
    signal     3271     3276     polyadenylation signal (put.)
BASE COUNT      921 a    658 c    773 g    972 t
ORIGIN      
        1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga
       61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc
      121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg
      181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac
      241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta
      301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac
      361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact
      421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg
      481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag
      541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc
      601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta
      661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg
      721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa
      781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta
      841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa
      901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct
      961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg
     1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca
     1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc
     1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg
     1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta
     1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg
     1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa
     1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg
     1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat
     1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc
     1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa
     1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg
     1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg
     1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac
     1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca
     1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc
     1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa
     1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt
     2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta
     2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca
     2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat
     2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg
     2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga
     2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct
     2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt
     2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat
     2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt
     2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt
     2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg
     2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac
     2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg
     2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg
     2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca
     2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc
     3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg
     3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa
     3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt
     3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat
     3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag
     3301 attaaaaaaa aaaaaaagga attc
//
LOCUS       VACSANT      1525 bp ds-DNA             VRL       08-JUN-1990
DEFINITION  Vaccinia virus surface (S) antigen gene.
ACCESSION   D90076
KEYWORDS    S gene; surface antigen.
SOURCE      Vaccinia virus DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 1525)
  AUTHORS   Ueda,Y., Morikawa,S. and Matsuura,Y.
  TITLE     Identification and nucleotide sequence of the gene encoding a
            surface antigen induced by vaccinia virus
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Yoshiaki
            Ueda National Institute of Health Gakuen, Musashimurayama Tokyo
            190-12 Japan Phone:  0425-61-0771 Fax:    0425-65-3315
FEATURES       from  to/span     description
    pept        382     1437     surface antigen S
BASE COUNT      568 a    229 c    258 g    470 t
ORIGIN      
        1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata
       61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat
      121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa
      181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat
      241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac
      301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata
      361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca
      421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc
      481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt
      541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt
      601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa
      661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta
      721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca
      781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc
      841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga
      901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat
      961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta
     1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat
     1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa
     1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact
     1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga
     1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc
     1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt
     1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata
     1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt
     1501 gctatagaat gaatgaggat acgcg
//
LOCUS       YSCA1         881 bp ds-DNA             PLN       08-JUN-1990
DEFINITION  S. cerevisiae acidic ribosomal protein A1 (YSCA1).
ACCESSION   D90072 X13682
KEYWORDS    acidic ribosomal protein; ribosomal protein.
SOURCE      S. cerevisiae (strain IFO-40028) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 881)
  AUTHORS   Mitsui,K. and Tsurugi,K.
  TITLE     Identification of A1 protein as the fourth member of 13 kDa-type
            acidic ribosomal protein family in yeast Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 315; 631 to 881)
  AUTHORS   Mitsui,K. and Tsurugi,K.
  TITLE     Identification of A1 protein as the fourth member of 13 kDa-type
            acidic ribosomal protein family in yeast Saccharomyces cerevisiae
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989)
  STANDARD  full staff_entry
REFERENCE   3  (bases 277 to 742)
  AUTHORS   Tsurugi,K. and Mitsui,K.
  TITLE     cDNA and deduced amino acid sequence of acidic ribosomal protein A1
            from Saccharomyces cerevisiae
  JOURNAL   Nucleic Acids Res. 16, 3574-3574 (1988)
  STANDARD  simple automatic
COMMENT     These data kindly submitted in computer readable form by: Kazuhiro
            Mitsui Department of Biochemistry Yamanashi Medical college Tamaho,
            Nakakoma-gun Yamanashi 409-38 Japan Phone:  0552-73-1111 x2257
FEATURES       from  to/span     description
    pept        313      633     acidic ribosomal protein A1
    signal      125      135     UASrpg box1
    signal      182      192     UASrpg box2
    signal      716      721     poly(A) signal
    variant     303      303     a in [1]; g in [3]
    variant     684      685     tt in [1]; t in [3]
    variant     719      719     a in [1]; t in [3]
BASE COUNT      270 a    165 c    157 g    289 t
ORIGIN      
        1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt
       61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca
      121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa
      181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca
      241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa
      301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac
      361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca
      421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac
      481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct
      541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc
      601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg
      661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa
      721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg
      781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc
      841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c
//
LOCUS       YSCCDC23X    3107 bp ds-DNA             PLN       08-JUN-1990
DEFINITION  S. cerevisiae CDC23 protein gene.
ACCESSION   D90081
KEYWORDS    CDC23 protein.
SOURCE      Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line
            D22) DNA, clone YX34.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3107)
  AUTHORS   Doi,A. and Doi,K.
  TITLE     Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces
            cerevisiae
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kenji Doi
            The Institute of Scientific and Industrial Research Osaka
            University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111
            Fax:    06-877-4977
FEATURES       from  to/span     description
    ORF         765     2645     ORF for CDC23
    site        153      161     calcium-binding site
BASE COUNT     1010 a    595 c    629 g    873 t
ORIGIN      
        1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact
       61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt
      121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc
      181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt
      241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg
      301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct
      361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt
      421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga
      481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag
      541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat
      601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact
      661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc
      721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc
      781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac
      841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag
      901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag
      961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt
     1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt
     1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc
     1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta
     1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc
     1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga
     1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag
     1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa
     1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact
     1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat
     1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta
     1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg
     1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc
     1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg
     1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt
     1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg
     1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact
     1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt
     2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca
     2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca
     2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt
     2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg
     2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca
     2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat
     2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg
     2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa
     2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta
     2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata
     2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat
     2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc
     2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact
     2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc
     2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc
     2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga
     3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga
     3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag
//
LOCUS       RATLIPAA     3225 bp ss-mRNA            ROD       08-JUN-1990
DEFINITION  Rat mRNA for hormone sensitive lipase.
ACCESSION   X51415 J03087
KEYWORDS    hormone sensitive lipase; lipase.
SOURCE      Rat adipose tissue cDNA to mRNA.
  ORGANISM  Rattus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3225)
  AUTHORS   Holm,C., Kirchgessner,T.G., Svenson,K.L., Lusis,A.J.,
            Belfrage,P. and Schotz,M.C.
  TITLE     Nucleotide sequence of rat adipose hormone sensitive lipase cDNA
  JOURNAL   Nucleic Acids Res. 16, 9879-9879 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (sites; for [1])
  AUTHORS   Holm,C., Kirchgessner,T.G., Svenson,K.L., Fredrikson,G.,
            Nilsson,S., Miller,C.G., Shively,J.E., Heinzmann,C., Sparkes,R.S.,
            Mohandas,T., Lusis,A.J., Belfrage,P. and Schotz,M.C.
  TITLE     Hormone-sensitive libase: Sequence, expression, and chromosomal
            localization to 19 cent-q13.3
  JOURNAL   Science 241, 1503-1506 (1988)
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    pept        616     2889     hormone sensitive lipase
    signal     3209     3214     polyA signal (put.)
    signal     3225     3225     polyA signal
BASE COUNT      701 a    952 c    916 g    656 t
ORIGIN      
        1 gaagaaactc gtaatggggc ttgagtgtgt gggtaaggaa gggaactgct gagcgtccag
       61 atttcccaga acaagaagga caagtccaaa gacaataaac aaagatagga gttcatcgtt
      121 gaatacctgg aaggaagaag gaagagggtg ggcagcttct gggatagaga agcagattct
      181 ttggagttca ggagattgga ctcctagatc caggaaagag ggcaaaagcc atggacagtg
      241 ggtttctgat tctgaagagg agagataggc aggggctgct tagcctgggg cttcgactcc
      301 cagccgatag gcagaaagag gatgaggccc cagacttcta gaagacaagg agataaattc
      361 cgaggtgtga gagagaagat aatggaaagc tcctgcatct ccaggaggct tggaacagac
      421 cggtgaacct gaggaaggag agggctacgg ctttggactc ctgaatctcc tgtatcttgg
      481 cagggatggt cctcggtttg aatactgggt tgggcctagg gagagagagg taggagctgt
      541 acccctgggc ctgacagagc aaagagtaat cccagacctt cccatcttcc cgcagcctcg
      601 gcattctcac acagcatgga tttacgcaca atgacacagt cgctggtggc gctcgcagaa
      661 gacaacatgg ccttcttctc aagccagggc ccaggagaga cagcacggcg gctgtccaac
      721 gtctttgcag gtgttcggga acaggcactg gggctagaac caactctagg tcagctcttg
      781 ggtgtggcac accatttcga cctggacact gagacaccag ccaacggata ccgtagcttg
      841 gtgcacacag ctcgttgctg cctggcacac ctactacaca aatcccgcta tgtggcctct
      901 aaccgcagaa gtatcttctt ccgtgccagc cacaacctag cagaactgga ggcctacctg
      961 gctgccctca cccagctccg tgctctagcc tactacgccc agcgcctgct gaccatcaac
     1021 cgaccaggag tgctcttctt tgagggcgat gagggactca gcgctgactt cctacaggac
     1081 tatgtcacgc tacataaagg ctgcttttac ggtcgctgcc tgggcttcca gttcacacct
     1141 gccatccggc cgttcctgca gactctctcc atcgggctgg tgtccttcgg ggaacactac
     1201 aaacgcaacg agacgggcct cagtgtgact gcaagttccc tctttacggg tggccgattc
     1261 gccatagacc cagagttgcg tggggctgaa tttgagcgca tcatacagaa cctggacgtg
     1321 cacttctgga aagccttctg gaatatcacg gagatcgagg tgctatcgtc tctggccaac
     1381 atggcatcaa ccaccgtgag ggtaagccgc ctgcttagct tgccgcctga ggcctttgag
     1441 atgccactaa cctctgaccc caagctcaca gttaccatct cacctccctt ggcacacacg
     1501 ggaccaggcc ctgtgctagc caggctcatc tcctatgacc tgcgggaagg gcaggacagc
     1561 aagatgctca acagcctggc aaaatctgag ggcccacgcc tggagctgcg cccacggcct
     1621 caacaagccc cccgctcacg agccctggtt gtccacattc acggtggtgg ctttgtggca
     1681 cagacctcca aatcccatga gccctacctc aagaactggg cccaggagct aggcgtcccc
     1741 atcatctcca tcgactactc cctggcccct gaggccccct tcccccgagc actggaggag
     1801 tgtttttttg cctactgctg ggctgtcaag cactgtgaac tgcttggttc aaccggagag
     1861 cggatatgcc ttgcggggga cagcgcaggt gggaacctct gcatcaccgt gtcccttcgg
     1921 gcagcagcct atggggtgag ggtgccagat ggcatcatgg cagcctaccc agttaccacc
     1981 ctgcagtcct ctgcttctcc ctctcgtctg ctgagcctca tggatcctct tctaccactg
     2041 agcgtactct ccaagtgtgt gagcgcctat tcagggacag agacggagga ccattttgac
     2101 tcagaccaga aggcattggg cgtgatgggg ctggtgcaga gagacacgtc cctgttcctc
     2161 agagacctcc gcctgggcgc ctcctcatgg ctcaactcct tcctggagtt aagtgggcgc
     2221 aagccccata agaccccagt tgcctgcaac agagacactg cgccccacgg attctgggcg
     2281 cttacggagt ctatgcgcag gagtgtgtct gaggcagccc tggcccagcc tgagggcttg
     2341 ctgggcacag attccttgaa gaagctaaca ataaaggact tgagctttaa gggcaactca
     2401 gagccatcag acagccccga gatgtcacag tcaatggaga cacttggccc ctccacaccc
     2461 tcggatgtca acttttttct gcgatccggg aattcccagg aagaggctga aaccagagat
     2521 gatataagcc ccatggacgg aatcccccgc gtgcgcgctg ccttccctga tggtttccac
     2581 ccacggcgct caagccaagg tgtcctccac atgcccctct actcgtcacc catagtcaag
     2641 aaccccttca tgtctcctct gctggcccct gacgtcatgc tgaagaccct gccgcccgtg
     2701 caccttgtgg cgtgcgctct ggaccccatg ctggatgact cggtcatgtt cgcgcggcga
     2761 ctgaaggacc tgggccagcc cgtgacgctg aaagtggtag aggacctgcc gcatggtttc
     2821 ctgagcttag cagccctgtg tcgtgagacc ggcaggccgc ggagttgtgc gtgcagcgca
     2881 tccggctgat cctcaccccc cctgctgcac cgctgacctg agctgggaac ggcggggggc
     2941 ggcactaaaa gacctcttgc acccttctgc gcgggcttcc gtgatgagtg cgctccgagt
     3001 cgggctctag gccccctcgg tcgggctggg cgggaggggg cgggccgtgc cttaaccata
     3061 agtggggcgg ggcaggggcc aaaagctgaa cctgggggag ggggacgcac acacacacac
     3121 ctgtcaccga gacagctgga cctgcactct accactgcct tctactgctg ctgtgaccga
     3181 ccaccgctag tcggttttgc tttttgtaaa taaaagttat ttaat
//

GenBank-Updates@genbank.bio.net (06/27/90)

LOCUS       KAEASTAB     3324 bp ds-DNA             BCT       27-JUN-1990
DEFINITION  K.aerogenes arylsulfatase (atsA) and atsB genes, complete cds.
ACCESSION   M31938
KEYWORDS    arylsulfatase.
SOURCE      K.aerogenes (strain W70) DNA, clone pAS123.
  ORGANISM  Klebsiella aerogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3324)
  AUTHORS   Murooka,Y., Ishibashi,K., Yasumoto,M., Sasaki,M., Sugino,H.,
            Azakami,H. and Yamashita,M.
  TITLE     A sulfur- and tyramine-regulated Klebsiella aerogenes operon
            containing the arylsulfatase (atsA) gene and the atsB gene
  JOURNAL   J. Bacteriol. 172, 2131-2140 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            Y.Murooka, 02-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        452     1669     atsB protein
    pept       1685     3079     arylsulfatase (atsA) (EC 3.1.6.1)
    mRNA        420     3172     atsB and atsA mRNA (alt.)
    mRNA        421     3172     atsB and atsA mRNA (alt.)
    signal      383      388     atsB -35 region
    signal      408      413     atsB -10 region
    binding     441      444     atsB ribosome binding site
    binding    1672     1675     atsA ribosome binding site
    rpt        3118     3142     inverted terminal repeat, copy A
    rpt        3148     3172     inverted terminal repeat, copy B
BASE COUNT      758 a    994 c    928 g    644 t
ORIGIN      
        1 cggctggctt ttaacaaagc agacatgata gggcctcccc gtcgagatga tggtacatga
       61 cacatggatg actactgata attgaagcat acatttcaca tttaagtaac acaatcatta
      121 acatattaac aatgtgatag ccgttacaaa attttcagca atacattgtt atataaagca
      181 tttaaaaaac accccagcca tgctttacga tagcggcgtt aaaacataaa caaataagat
      241 taacacccgg cgtacctgcc cctggtgata aaaaaaccct attcccgacg cgtgattctc
      301 atcctaaaga acgatggatt tttgtttttt ttcacttata cccttaacag gatgcattta
      361 attttcgccg cggcttaatc atgtgattac gcctgacccg ctgttgccaa aataatatca
      421 acagtaccgg tcattaaccg aggataagcc gatgctgaat atcgccgccc tgcgccagca
      481 gcaaattccg ctggccgctg agccgcgctc gccggtgccg tttcatattc tgatgaagcc
      541 gattggcccc gcctgcaatc tcgcctgccg ctattgctat tacccgcagg acgaaacgcc
      601 ggtcaacaag atggatgacg cgcggctgga gcagtttatc cgccgctata ttgccgccca
      661 gcccgccggc gcgcgggaaa tcaactttgt ctggcagggg ggcgagccgc tgctggccgg
      721 cctgagcttc tacaaaaaag cgctcgccct gcaggcacgc tatgcccccg acggcgtgac
      781 tatcagcaac agcctgcaga ccaacgggac gctgatcaac gacgcatggt gccgactgtt
      841 ccgcgaacat ggctttatta tcgggttgag cctcgaaggc aacgaagcgc tgcaggacta
      901 ccatcgtccg gataaacgcg gccggtcgac ctggtcggcg gcgctgcgcg gcattgacct
      961 gctccatcag catcaggtgg actttaatct gctggtggtg gtgcataacg agatggcggc
     1021 ccacgcggcg gcgatttatg accggctggt cagcctcggc gcgcgctatc tgcagtttca
     1081 gccgctgatg agcgaaggcg cggccctgcg cgaaggatac cagctcagcg ccgataactg
     1141 gggacgtttt atggtcggca tctggcgaca gtggcggaag cgctgcgata gagggcgggt
     1201 gttcgttatc aatatcgaac aggcgtgggc gcagtatttc actcatacca gcggcagctg
     1261 cgtgcacagc gcccgctgcg gcagcaacct ggtgatggag cccgacggac agctctacgc
     1321 ctgcgatcat ctgatcaacg ccgaacatcg gctgggtcgc cttgacgagc agacgcttgc
     1381 cgccgcagtt gacgcctcgg tgcaactgcc tttcggtcag cagaaaagtc tgcgccgcga
     1441 atgccagact tgctcggtaa aaatggtctg ccagggcggc tgcccggcgc atctcaacgc
     1501 cgcaggcaac aaccgcctct gcggaggcta ctaccgcttc tttagcgaca tcctcgcgcc
     1561 attacgcccc ttttcccgcg accttaatgg cctgaaggcc tggcgggccg cgtttgttgg
     1621 gactgccgca tactgcgtag caccttaccc tgatgatatc cccctctgaa caggagagtc
     1681 agtcgtgaac aaaaaagcca tggccgcggc ggtcagcatg atcctcgccg gtggcgcgca
     1741 cgccgcgcag caggagcgtc caaacgtcat cgtgattatc gccgatgata tgggctactc
     1801 ggacatcagc ccctttggcg gcgagatccc cacccccaac ctgcaggcga tggccgagca
     1861 gggaatgcgc atgagccagt attacacctc gccgatgtcg gccccggcgc gctcaatgct
     1921 gctcaccggc aacagtaacc agcaggctgg gatgggcggc atgtggtggt acgacagcac
     1981 cattggcaag gagggctacg agctgcggtt gaccgaccgc gtcaccacca tggccgagcg
     2041 ctttaaagac gcggggtata acaccctgat ggccggtaaa tggcacctcg gttttgttcc
     2101 cggcgccacg ccgaaagatc gcggctttaa tcacgccttc gcctttatgg gcggcggcac
     2161 cagccacttt aacgacgcga taccgctggg gaccgttgaa gccttccaca cctactacac
     2221 ccgcgacggc gagcgcgtct ccctaccgga tgatttttac tccagcgaag cctacgcccg
     2281 ccagatgaac agctggatta aagcgacgcc gaaggaacag ccggtcttcg cctggctggc
     2341 cttcaccgcc cctcatgatc ccctgcaggc gcctgacgag tggattaagc gcttcaaagg
     2401 ccagtatgag cagggttacg ccgaggtcta tcgccagcgc atcgcccgcc tgaaagcgct
     2461 gggaattatc catgacgata ccccgctgcc gcatctggag ctggataaag agtgggaagc
     2521 gctaacgcca gagcagcaga aatatacggc gaaagtgatg caggtgtatg cggcgatgat
     2581 cgccaatatg gacgcgcaaa tcggcaccct gatggagacg ctgaaacaaa ccgggcgcga
     2641 taaaaatacc ctgctggtct ttttaaccga taacggcgcc aacccggcgc agggtttcta
     2701 ctacgaatct accccggaat tctggaagca gttcgataac agctatgaca acgtcggccg
     2761 caaaggatca tttgtctcct acggcccgca ctgggccaac gtcagcaacg ccccctacgc
     2821 caattatcac aaaaccacca gcgcccaggg cggcatcaat accgacttta tgatctccgg
     2881 tcccgggatc acccgccacg gtaaaatcga cgcctcgacg atggcggtgt atgacgtggc
     2941 gccgacgcta tatgaattcg ccggcatcga tccgaacaag tcgctggcga aaaagccggt
     3001 gttgccgatg atcggcgtca gtttaagcgc tatctcaccg gcgaagtaca ggagccgccg
     3061 cgccgaacta cggggttgaa ctgcatcatc aggcggcctg ggtcgatggc gaatggaagc
     3121 tgcgacggct ggtgccgcgc ggcctcaccg ccggcgacgc gccgtggcag ctatttaatc
     3181 tgcacgacga cccgctggag acgcatgatg tcgcggccga acatccgatc gggtcaaagc
     3241 catgagcgag gcctacgagg catttgctaa gcgcaccatg gttaccaaag cgcagggcaa
     3301 aatgaatgct acgtcggtat cgat
//
LOCUS       RATGAH       1003 bp ss-mRNA            ROD       27-JUN-1990
DEFINITION  Rat L-glutamine amidohydrolase mRNA, 3' end.
ACCESSION   J05499
KEYWORDS    L-glutamine amidohydrolase.
SOURCE      Rat (strain Sprague-Dawley) liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1003)
  AUTHORS   Smith,E.M. and Watford,M.
  TITLE     Molecular cloning of a cDNA for rat hepatic glutaminase: Sequence
            similarity to kidney-type glutaminase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Waterford, 08-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      660     L-glutamine amidohydrolase (EC 3.5.1.2)
BASE COUNT      270 a    243 c    258 g    232 t
ORIGIN      
        1 ttccggatgt gtctgtcacc cccgttagac aagctgggga acagccacag gggcatcagc
       61 ttctgccaga agttggtgtc tctgtttaac ttccacaact acgacaacct gcggcactgt
      121 gctcggaagt tagacccacg gagggaaggg ggggaagttc ggaacaagac cgtggtgaac
      181 ctgttatttg ctgcatatag tggagatgtc tcagctcttc gaaggtttgc cttgtctgcc
      241 gtggatatgg agcagaagga ctatgattcc cgcacagccc tacatgtggc ggcagcggaa
      301 ggacacattg acgttgtcaa gtttctgatc gaggcttgca aagtgaatcc ttttgtcaag
      361 gacaggtggg gcaacattcc cctggatgat gccctgcagt tcaatcacct ggaggtggtc
      421 aaactgcttc aggattacca tgactcctac atgctgtctg agactcaagc tgaggtacag
      481 ctgagactct gtcaaaagag aactgagaga gcatgtgtga gcacaggcca gggcagcccg
      541 tgctcaagaa aaagcatgag cgggccacaa tttaacccaa ggccaccaaa aatactattg
      601 caagctgctt cagtgggatc aacacagcca tctggtgaca caggccagtg ttttctgtga
      661 gaatcaaaat gccccattcc ctcatcggac agcacagaga aaagcttcag tggacacctg
      721 agcagagcta gccacggaga cctcaaggta tagcttaagt gacatcctcc accagaaagt
      781 agcccaggct tttacccagg tccccatttc aacttccttg gagagcgtct agctacatgc
      841 atatgtatct gtcacagagc aagagaggtg ggtgagagcc caatcacctg gctttagaaa
      901 tctgcagaga tctgtccatc ttagccaaga catgctgcta ctgctgacag gagttttata
      961 gacaaagtat tttgtgttca aataaacttt aattaccgga att
//
LOCUS       PSEPROC      1167 bp ds-DNA             BCT       27-JUN-1990
DEFINITION  P.aeruginosa delta-1-pyrroline-5-carboxylate reductase gene,
            complete cds.
ACCESSION   M33557
KEYWORDS    delta-1-pyrroline-5-carboxylate reductase.
SOURCE      P.aeruginosa (strain PAO1) DNA.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1167)
  AUTHORS   Savoiz,A., Jeenes,D.J., Kocher,H.P. and Haas,D.
  TITLE     Comparison of proC and other housekeeping genes of Pseudomonas
            aeruginosa with their counterparts in Escherichia coli
  JOURNAL   Gene 86, 107-111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Savioz, 04-APR-1990.
FEATURES       from  to/span     description
    pept        102      923     delta-1-pyrroline-5-carboxylate reductase (EC
                                 1.5.1.2)
    binding      90       94     ribosome binding site
    binding     921      927     ribosome binding site
BASE COUNT      182 a    431 c    380 g    174 t
ORIGIN      Chromosome PAO1.
        1 ctcgaggcag ccatcggcga aggtgcgacc tgggtccgca tcggtaccgc cctgttcggc
       61 gcccgcgact acggcgcgcc ggcttcttga aggaatcccc tatgagcaca ccccgcatag
      121 cgttcatcgg cgccggcaac atggccgcca gcctgatcgg cggcctgcgc gcccaaggcg
      181 tgccggcggc gcagatccgc gccagcgacc cgggcgccga gcaacgcgcg aagatcgccg
      241 gcgagttcgc catcgacgtg gtcgagtcca acgccgaggc cgtggccgac gccgacgtcg
      301 tggtcctgtc ggtcaagccg caggccatga aagccgtgtg ccaggccttg gcgccggcgc
      361 tgaagccgga gcaactgatc gtctccatcg ccgccggcat cccctgcgcc agcctcgaag
      421 cctggctcgg ccagccgcgc ccggtggtcc gctgcatgcc caacaccccg gcgctgctgc
      481 gccagggcgc cagcgggctg tacgccaacg cgcaggtcag cgccgcgcaa tgcgagcagg
      541 ccgggcagtt gctgtccgcg gtgggcatcg ccctgtggct ggacgacgaa gcgcagatcg
      601 acgcggtgac cgcagtgtcg ggcagcggcc cggcgtattt cttcctgctg atgcaggcca
      661 tgaccgacgc cggcgagaag ctcggcctgt cccgcgagac agcctcgcgc ctgaccctgc
      721 agaccgccct cggtgcggcg cagatggcgc tgtccagcga ggtcgagccg gccgaactgc
      781 ggcgccgggt gacttcgccc aacggcacca ccgaggcggc gatcaaatcc ttccaggcca
      841 acggcttcga agcactggtg gagcaggccc tgaacgccgc cagccagcgc tccgccgagc
      901 tggccgaaca gcttggccaa taaggagtag tcgatgatcg gcctcaacac cgctgctatc
      961 tatatcctcc agaccctcgg cagcctctac ctgctgatcg tgctgctgcg cttcatcctg
     1021 cagttggtcc gcgcggactt ctacaacccc ctcagccagt tcatcgtgcg cgccaccaag
     1081 ccgctgctga acccgctgcg gcggatcatc cccggcttcg gcgggatcga cctggcctcg
     1141 ctggtgctgg cgatcctgat ccagctg
//
LOCUS       ECOPEPD      2311 bp ds-DNA             BCT       27-JUN-1990
DEFINITION  E.coli, peptidase D (pepD) gene, complete cds.
ACCESSION   M34034
KEYWORDS    peptidase D.
SOURCE      E.coli (strain K12; substrain CM17) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2311)
  AUTHORS   Henrich,B., Monnerjahn,U. and Plapp,R.
  TITLE     Peptidase D gene (pepD) of Escherichia coli K12: Nucleotide
            sequence, transcript mapping, and comparison with other peptidase
            genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Henrich, 04-MAY-1990.
            
                  Author address: B.Henrich
                  Universitaet Kaiserlautern, Fac Biologie,
                  Postach 3049, D-6750 Kaiserslautern, FRG
FEATURES       from  to/span     description
    pept        187     1644     peptidase D
    mRNA         94     1693     peptidase D mRNA
    mRNA        127     1693     peptidase D mRNA (alt.)
    binding     173      179     ribosome binding site
    signal       83       88     TATA box
    signal      113      118     TATA box
BASE COUNT      558 a    608 c    590 g    555 t
ORIGIN      
        1 gatctggcgc actaaaaacc agtatttcac atgagtccgc gtctttttac gcactgcctc
       61 tccctgacgc gggataaagt ggtattctca aacatatctc gcaagcctgt cttgtgttga
      121 caacattttc tgctaaccct gtgacctgca atactgtttt gcgggtgatc gacaaggaga
      181 cttaacgtgt ctgaactgtc tcaattatct ccacagccgc tgtgggatat ttttgccaaa
      241 atctgttcta ttcctcaccc gtcctatcat gaagagcaac tcgctgaata cattgttggt
      301 tgggcaaaag agaaaggttt ccatgtcgaa cgcgatcagg taggtaatat cctgattcgt
      361 aaacctgcta ccgcaggtat ggaaaatcgt aaaccggtcg tcttacaggc ccacctcgat
      421 atggtgccgc agaaaaataa cgacaccgtg catgacttca cgaaagatcc tatccagcct
      481 tatattgatg gcgaatgggt taaagcgcgc ggcaccacgc tgggtgcgga taacggcatt
      541 ggtatggcct ctgcgctggc ggttctggct gacgaaaacg tggttcacgg cccgctggaa
      601 gtgctgctga ccatgaccga agaagccggt atggacggtg cgttcggctt acagggcaac
      661 tggttgcagg ctgatattct gattaacacc gactccgaag aagaaggtga aatctacatg
      721 ggttgtgcgg ggggtatcga cttcacctcc aacctgcatt tagatcgtga agcggttcca
      781 gctggttttg aaaccttcaa gttaacctta aaaggtctga aaggcggtca ctccggcggg
      841 gaaatccacg ttgggctggg taatgccaac aaactgctgg tgcgcttcct ggcgggtcat
      901 gcggaagaac tggatctgcg ccttatcgat ttcaacggcg gcacactgcg taacgccatc
      961 ccgcgtgaag cctttgcgac cattgctgtc gcagctgata aagtcgacgt cctgaaatct
     1021 ctggtgaata cctatcagga gatcctgaaa aacgagctgg cagaaaaaga gaaaaatctg
     1081 gccttgttgc tggactctgt agcgaacgat aaagctgccc tgattgcgaa atctcgcgat
     1141 acctttattc gtctgctgaa cgccaccccg aacggtgtga ttcgtaactc cgatgtagcc
     1201 aaaggtgtgg ttgaaacctc cctgaacgtc ggtgtggtga ccatgactga caataacgta
     1261 gaaattcact gcctgatccg ttcactgatc gacagcggta aagactacgt ggtgagcatg
     1321 ctggattcgc tgggtaaact ggctggcgcg aaaaccgaag cgaaaggcgc atatcctggc
     1381 tggcagccgg acgctaattc tccggtgatg catctggtac gtgaaaccta tcagcgcctg
     1441 ttcaacaaga cgccgaacat ccagattatc cacgcgggcc tggaatgtgg tctgttcaaa
     1501 aaaccgtatc cggaaatgga catggtttct atcgggccaa ctatcaccgg tccacactct
     1561 ccggatgagc aagttcacat cgaaagcgta ggtcattact ggacactgct gactgaactg
     1621 ctgaaagaaa ttccggcgaa gtaattattt gatttgctgc cggatggcgt ttaatcgcct
     1681 tccggcagtt tcatccttca ttatccttcg ataaaagcca tccctgtaaa tgtccgtcgc
     1741 gggttgccac gttcaatctg gtgatggaac attcgccgct gcgatttcag cgccgcgcta
     1801 ttttcctgtt gctgttgctc cagcttccag gcaatcagca atcgtgccag ccgcttgtta
     1861 gcatgctgac tacgctctga ctgaaccttc acgctaatac cggatgccaa atgcgtggcg
     1921 cgtaccgccg agtcggtttt attgacatgt tgaccgcccg gccccgacga acgcagcgtc
     1981 tcataacgga ttgcatccga ttgttcctgc tcatcagcgg taaaacgccc aatgcccaga
     2041 aaccagtttt tgcgcccatg atgaggccga tacggactcg gacaaatcca ctgaatagtg
     2101 ccacaccact ttcgcttaat gcccatgcgt tatcgccatc gagagaaacc agcgccgaac
     2161 gcagtgtgtc agagtagcgg cccgtttctg tttccagcac cgttaccgcg acgtcttgcc
     2221 gggtaaaaac gaccaacaca gaagaagctg cgccaatcgg gtagccgcgt ccggggtgta
     2281 aatctggcat ccccactacg cgctgcatgt t
//
LOCUS       HUMTGFB1B    5089 bp ss-mRNA            PRI       27-JUN-1990
DEFINITION  Human transforming growth factor beta-1-binding protein mRNA,
            complete cds.
ACCESSION   M34057
KEYWORDS    transforming growth factor beta-1-binding protein.
SOURCE      Human fibroblast, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 5089)
  AUTHORS   Kanzaki,T., Olofsson,A., Moren,A., Wernstedt,C., Hellman,U.,
            Miyazono,K., Claesson-Welsh,L. and Heldin,C.-H.
  TITLE     TGF-beta-1 binding protein: A component of the large latent complex
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.-H.Heldin, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         91     4275     transforming growth factor beta-1-binding
                                 protein precursor
    sigp         91      151     transforming growth factor beta-1-binding
                                 protein signal peptide
    matp        152     4272     transforming growth factor beta-1-binding
                                 protein
BASE COUNT     1355 a   1127 c   1255 g   1352 t
ORIGIN      
        1 tgaattcggc acgaggtggg ctctctcttt tcccctcttg ctcctttctt ttcttttttt
       61 ctgttttttt aaaccttcca aggcaagttc atggatacta agctgatgtg tttgttgttc
      121 tttttctccc tgcctccgct cctagtgagt aaccacactg gccgcatcaa ggtggtcttt
      181 actccgagca tctgtaaagt gacctgcacc aagggcagct gtcagaacag ctgtgagaag
      241 gggaacacca ccactctcat tagtgagaat ggtcatgctg ccgacaccct gacggccacg
      301 aacttccgag tggtaatttg ccatcttcca tgtatgaatg gtggccagtg cagttcaagg
      361 gacaaatgtc agtgccctcc aaatttcaca ggaaaacttt gtcagatccc agtccatggt
      421 gccagcgtgc ctaaacttta tcagcattcc cagcagccag gcaaggcgtt ggggacgcat
      481 gtcatccatt caacacatac cttgcctctg accgtgacta gccagcaagg agtcaaagtg
      541 aaatttcctc ctaacatagt caatatccat gtgaaacatc ctcctgaagc ttccgtccag
      601 atacatcagg tttcaagaat tgatggccca acaggccaga agacaaaaga agctcaacca
      661 ggccaatccc aagtctcgta ccaagggctt cctgtccaga agacccagac catacattcc
      721 acatactccc accagcaggt cattcctcac gtctaccccg tggctgctaa gacacagctt
      781 ggccggtgct tccaggaaac cattgggtca cagtgtggca aagcgctccc tggcctttca
      841 aagcaagagg actgctgtgg aactgtgggt acctcctggg gctttaacaa atgccagaaa
      901 tgccccaaga aaccatctta tcatggatac aaccaaatga tggaatgcct accgggttat
      961 aagcgggtta acaacacctt ttgccaagat attaatgaat gtcagctaca aggtgtatgc
     1021 cctaatggtg agtgtttgaa taccatgggc agctatcgat gtacctgcaa aataggattt
     1081 gggccggatc ctaccttttc aagttgtgtt cctgatcccc ctgtgatctc ggaagagaaa
     1141 gggccctgtt accgacttgt cagttctgga agacagtgta tgtaccctct gtctgttcac
     1201 ctcaccaagc agctctgctg ttgtagtgtg ggcaaggctg ggccacactg tgagaaatgt
     1261 ccccttccag gcacagctgc ttttaaggaa atctgtcctg gtggaatggg ttatacggtt
     1321 tctggcgttc atagacgcag gccaatccat caccatgtag gtaaaggacc tgtatttgtc
     1381 aagccaaaga acactcaacc tgttgctaaa agtactcatc ctccacctct cccagccaag
     1441 gaagagccag tggaggccct gaccttctcc cgggaacacg gggccaggag tgcggagcca
     1501 gaagtggcaa ctgcaccccc tgaaaaggaa ataccttcat tggatcaaga gaaaaccaaa
     1561 cttgagcctg gtcaacccca gctgtctcca ggcatttccg ctattcatct gcatccacag
     1621 tttccagtag tgattgaaaa aacatcacct cctgtgcctg ttgaagtagc tcctgaagct
     1681 tctacgtcta gtgccagcca agtgattgct cctactcaag tgacagaaat caatgaatgt
     1741 actgtgaacc ctgatatctg tggagcagga cactgcatta acctaccagt gagatatacc
     1801 tgtatatgct acgagggcta caggttcagt gaacaacaga ggaaatgtgt ggatattgat
     1861 gagtgtactc aggtccaaca cctctgctcc cagggccgct gtgaaaacac cgagggaagt
     1921 ttcttgtgca tttgcccagc aggatttatg gccagtgagg agggtactaa ctgcatagat
     1981 gttgacgaat gcctgaggcc ggacgtctgt ggggaggggc actgtgtcaa tactgtgggg
     2041 gccttccggt gtgaatactg tgacagcggg taccgcatga ctcagagagg ccgttgtgag
     2101 gatattgatg aatgtttgaa tccaagcact tgtccagatg agcagtgtgt gaattctcct
     2161 ggatcttacc agtgcgttcc ctgcacagaa ggattccgag gctggaatgg acagtgcctt
     2221 gatgtggacg agtgcctgga accaaacgtc tgcgcaaatg gtgattgttc caaccttgaa
     2281 ggctcctaca tgtgttcatg ccacaaaggc tatacccgga ctccggacca caagcactgt
     2341 agagatattg atgaatgtca gcaagggaat ctatgtgtaa acgggcagtg caaaaatacc
     2401 gagggctcct tcaggtgcac ctgtggacag gggtaccagc tgtcggcagc taaagaccag
     2461 tgtgaagaca ttgatgaatg ccagcaccgt catctctgtg ctcatgggca gtgcaggaac
     2521 actgagggct cttttcaatg tgtgtgtgac cagggttaca gagcatctgg gcttggagac
     2581 cactgtgaag atatcaatga atgcttggag gacaagagtg tttgccagag aggagactgc
     2641 attaatactg cagggtccta tgattgtact tgtccggatg gatttcagct agatgacaat
     2701 aaaacatgtc aagatattaa tgaatgtgaa catccagggc tctgtggtcc gcaaggggag
     2761 tgcctaaaca cagagggttc tttccattgt gtctgccagc agggtttctc aatctctgca
     2821 gatggccgta cgtgtgaaga tattgatgaa tgtgtaaaca acactgtttg tgacagtcac
     2881 gggttttgtg acaatacagc tggctccttc cgctgcctct gttatcaggg ctttcaagcc
     2941 ccacaggatg ggcaagggtg tgtggatgtg aatgaatgtg aactgctcag tggggtgtgt
     3001 ggtgaagcct tctgtgaaaa cgtggaaggg tccttcctgt gcgtgtgtgc tgatgaaaac
     3061 caagagtaca gccccatgac tgggcagtgc cgctcccgga cctccacaga tttagatgta
     3121 gatgtagatc aacccaaaga agaaaagaaa gaatgctact ataatctcaa tgacgccagt
     3181 ctctgtgata atgtgttggc ccccaatgtc acgaaacaag aatgctgctg tacatcaggc
     3241 gcgggatggg gagataactg cgaaatcttc ccctgcccgg tcttgggaac tgctgagttc
     3301 actgaaatgt gtcccaaagg gaaaggtttt gtgcctgctg gagaatcatc ttctgaagct
     3361 ggtggtgaga actataaaga tgcagatgaa tgcctacttt ttggacaaga aatctgcaaa
     3421 aatggtttct gtttgaacac tcggcctggg tatgaatgct actgtaagca agggacgtac
     3481 tatgatcctg tgaaactgca gtgctttgat atggatgaat gtcaagaccc cagtagttgt
     3541 attgatggcc agtgtgttaa tacagagggc tcttacaact gcttctgtac tcaccccatg
     3601 gtcctggatg cgtcagaaaa aagatgtata cgaccggctg agtcaaacga acaaatagaa
     3661 gaaactgatg tctaccaaga tttgtgctgg gaacatctga gtgatgaata cgtgtgtagc
     3721 cggcctcttg tgggcaagca gacaacgtac actgagtgct gctgtctgta tggagaggcc
     3781 tggggcatgc agtgtgccct ctgccccctg aaggattcag atgactatgc tcagctgtgt
     3841 aacatccccg tgacgggacg ccggcagcca tatggacggg acgccttggt tgacttcagt
     3901 gaacagtata ctccagaagc cgatccctac ttcatccaag accgttttct aaatagcttt
     3961 gaggagttac aggctgagga atgcggcatc ctcaatggat gtgaaaatgg tcgctgtgtg
     4021 agggtccagg aaggttacac ctgcgattgc ttggatgggt atcacttgga tactgccaag
     4081 atgacctgtt tcgatgtaaa tgaatgcgat gagttgaaca accggatgtc tctctgcaag
     4141 aatgccaagt gcattaacac cgatggttcc tacaagtgtt tgtgtctgcc aggctacgtg
     4201 ccttctgaca agccaaacta ctgcactccg ttgaataccg ccttgaattt agagaaagac
     4261 agtgacctgg agtgaaacag aatctacata acctaagccc atatactctg cactgtgtaa
     4321 aggaaaaggg agaaatgtat tatacttgag acattgcacc taccccggaa ggctggaaat
     4381 acagaaacag catggagttg caagtcctct gaagacaatg agaggattta ggatgagccc
     4441 gataggtgtg gcagaccaaa tggacatttc tctaaaaaac cagtatatat agtctgttca
     4501 tatgtaaaat tcaatggaag agaggtggaa cagtgctgtt attttaaaca gaaggttgta
     4561 ttattatgtt gttttgtttt ttttactatt gcttgattaa atttggcatt taaatagtgg
     4621 tggaaatatt tttatataat tttcattttt tggttgtgca gttccttggc tactgttttt
     4681 cttttacttc agttttttaa aaatctcaaa tgaaaaagtc ttcgatacaa tattgttaag
     4741 ctgtattata agtattgtta cacagggtta tgcaattccc ggcctggagc atttttgaaa
     4801 ttcaaattgt ctgtcctgtg gagcaggcag tgattttgtt ccaaaacttt gtatacacat
     4861 ttggagaaaa gtactttata ttttcagtgt tttgtctgat tttaatgtcc gttcttagcc
     4921 aagctgctag caggtgttaa ttggatccct ttccttcact gaaatggaag agtttataag
     4981 cttacgttag tattgtaata tgtaaagtaa gcccaacaaa aatttttaaa aatttgatga
     5041 tccccaatat atctaccatt gtatgttaaa taaatctcgt gccgaattc
//
LOCUS       ALRVSRC      1801 bp ss-RNA             VRL       27-JUN-1990
DEFINITION  Rous sarcoma virus (Schmidt-Ruppin D strain) v-src gene, complete
            cds.
ACCESSION   M33292
KEYWORDS    oncogene; pp60v-src; src gene; tyrosine kinase.
SOURCE      Rous sarcoma virus (strain Schmidt-Ruppin D) RNA, clone psrc1.
  ORGANISM  Rous sarcoma virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Avian sarcoma viruses.
REFERENCE   1  (bases 1 to 1801)
  AUTHORS   Reddy,S., Mazzu,D., Mahan,D. and Shalloway,D.
  TITLE     Sequence and functional differences between Schmidt-Ruppin D and
            Schmidt-Ruppin A strains of pp60v-src
  JOURNAL   J. Virol. 64, 3545-3550 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.I.Shalloway, 26-MAR-1990.
FEATURES       from  to/span     description
    pept         65     1645     pp60v-src protein
BASE COUNT      392 a    537 c    551 g    321 t
ORIGIN      
        1 actctgctgg tggcctcgcg taccactgtg gccaagcggt agctggaacg tgcagccgac
       61 caccatgggg agtagcaaga gcaagcctaa ggaccccagc cagcgccggc gcagcctgga
      121 gccacccgac agcacccacc acgggggatt cccagcctcg cagaccccca acaagacagc
      181 agcccccgac acgcaccgca cccccagccg ctccttcggg accgtggcca ccgagcccaa
      241 gctcttcgag gacttcaaca cttctgacac cgttacgtcg ccgcagcgtg ccggggcact
      301 ggctggcggc gtcaccactt tcgtggctct ctacgactac gagtcctgga ttgaaacgga
      361 cttgtccttc aagaaaggag aacgcctgca gattgtcaac aacacggaag gtaactggtg
      421 gctggctcat tccgtgacta caggacagac gggctacatc cccagtaact atgtcgcgcc
      481 ctcagactcc atccaggctg aagagtggta ctttgggaag atcactcgtc gggagtccga
      541 gcggctgctg ctcaaccccg aaaacccccg gggaaccttc ttggtccggg agagcgagac
      601 gacaaaaggt gcctattgcc tctccgtttc tgactttgac aacgccaagg ggctcaatgt
      661 gaagcactac aagatccgca agctggacag cggcggcttc tacatcacct cacgcacaca
      721 gttcagcagc ctgcagcagc tggtggccta ctactccaaa catgctgatg gcttgtgcca
      781 ccgcctgacc aacgtctgcc ccacgtccaa gccccagacc cagggactcg ccaaggacgc
      841 gtgggaaatc ccccgggagt cgctgcggct ggaggtgaag ctggggcagg gctgctttgg
      901 agaggtctgg atggggacct ggaacggcac caccagagtg gccataaaga ctctgaagcc
      961 cggcaccatg tccccggagg ccttcctgca ggaagcccaa gtgatgaaga agctccagca
     1021 tgagaagctg gttcaactgt acgcagtcgt gtcggaagag cccatctaca tcgtcattga
     1081 gtacatgagc aaggggagcc tcctggattt cctgaaggga gagatgggca agtacctgcg
     1141 gctgccacag ctcgttgata tggctgatca gattgcatcc ggcatggcct atgtggagag
     1201 gatgaactac gtgcaccgag acctgcgggc ggccaacatc ctggtggggg agaacctggt
     1261 gtgcaaggtg gctgactttg ggctggcacg cctcatcgag gacaacgagt acacagcacg
     1321 gcaaggtgcc aagttcccca tcaagtggac agcccccgag gcagccctct atggccggtt
     1381 caccatcaag tcggatgtct ggtccttcgg catcctgctg actgagctga ccaccaaggg
     1441 ccggatgcca tacccaggga tgggcaacgg ggaggtgctg gaccgggtgg agaggggcta
     1501 ccgcatgccc tgcccgcccg agtgccccga gtcgctgcat gaccttatgt gccagtgctg
     1561 gcggagggac cctgaggagc ggcccacttt tgagtacctg caggcccagc tgctccctgc
     1621 ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa caaggcaagg
     1681 cttggccgac aattgcatga agaatctgct tagggttagg cgttttgcgc tgcttcgcga
     1741 tgtacgggcc agatatacgc gtatctgagg ggactagggt gtgtttaggc gaaaagcggg
     1801 g
//
LOCUS       RATLACREC    2041 bp ss-mRNA            ROD       27-JUN-1990
DEFINITION  Rat lactogen receptor mRNA, complete cds.
ACCESSION   M34083
KEYWORDS    lactogen receptor.
SOURCE      Rat (strain Sprague-Dawley) luteinized ovary, cDNA to mRNA, clone
            R1.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2041)
  AUTHORS   Zhang,R., Buczko,E., Tsai-Morris,C.H., Hu,Z.-Z. and Dufau,M.L.
  TITLE     Isolation and characterization of two novel rat ovarian lactogen
            receptor cDNA species
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 415-422 (1990)
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by M.L.Dufau,
            08-MAY-1990.
FEATURES       from  to/span     description
    pept         93     1925     lactogen receptor precursor
    sigp         93      149     lactogen receptor signal peptide
    matp        150     1922     lactogen receptor
    signal     2031     2036     poly-A signal
BASE COUNT      615 a    497 c    460 g    469 t
ORIGIN      
        1 gtgagcactg cagatgtttt gcacatgagc cctgaaggga gcctctgata cattgcctat
       61 agcaagaaga aggggccaac ctgaagaaaa acatgccatc tgcacttgct ttcgtcctac
      121 ttgttctcaa catcagcctc ctgaagggac agtcaccacc agggaaacct gagatccaca
      181 aatgtcgctc tcctgacaag gaaacattca cctgctggtg gaatcctggg acagatggag
      241 gacttcctac caattattca ctgacttaca gcaaagaagg agagaaaacc acctacgaat
      301 gtccagacta caaaaccagt ggccccaact cctgcttctt tagcaagcag tacacttcca
      361 tctggaaaat atatatcatc acagtaaatg ccacgaacca aatgggaagc agttcctcgg
      421 atccacttta tgtggatgtg acttacatcg ttgagccaga gcctcctcgg aacctgacat
      481 tagaagtaaa acagctaaaa gacaaaaaaa catatctgtg ggtaaaatgg tccccaccca
      541 ccataactga tgtgaaaact ggttggttta caatggaata tgaaattcga ttaaagcctg
      601 aagaagcaga agagtgggag atccatttta caggtcatca aacacagttt aaagtttttg
      661 acctatatcc agggcaaaag tatcttgtcc agactcgctg caagccagac catggatact
      721 ggagtagatg gagccaggag agttccgttg aaatgccaaa tgacttcacc ttgaaggaca
      781 caaccgtgtg gatcattgcg gccattctct ctgctgtcat ctgtttgatt atggtctggg
      841 cagtggcttt gaagggctat agcatgatga cctgcatctt tccaccagtt cctgggccaa
      901 aaataaaagg atttgatacc catctgctgg agaagggcaa gtctgaagag ctgctgagtg
      961 ccttggggtg ccaagacttt ccccctactt ctgactgtga ggacttgctg gtggagttct
     1021 tagaagttga tgacaatgag gacgagcggc taatgccatc ccattccaaa gagtatccag
     1081 gtcaaggtgt taagcccaca cacctagatc ccgacagtga ctctgttcac ggaagctatg
     1141 acagccattc tcttttatct gaaaagtgtg aggaacccca ggcctacccc cctactttgc
     1201 acatccctga gatcactgag aagccagaga atcctgaagc aaatattcct cccaccgtgg
     1261 acccccaaag caccaacccc aattttcatg tagatgcacc caaatcttca acatggccat
     1321 tactgcctgg ccaacacatg cccagatctc cttaccacag tgttgctgat gtgtgcaagc
     1381 tagccggaag tcctgtgaat acactggact ctttcttgga caaagcagag gaaaatgttc
     1441 taaagttgtc taaagccctt gagactggag aggaagaagt ggctgaggaa aaagggggaa
     1501 aaagcttccc ttctgacaaa caaaacacac cttggccgct gctccaggag aaaagcccca
     1561 ctgtctatgt taaaccccca gattatgtgg agattcacaa agtcaacaaa gatggagtgc
     1621 tatcattatt ccccaagcag agagaaaaca accagacaga gaagcctggg gttcctgaaa
     1681 ccagtaagga gtatgccaag gtgtctggca ttacggataa caatatcctc gtattagtgc
     1741 cagactcacg agccaagaac acagcgttgc tcgaggaatc agccaagaag gctccaccat
     1801 cgtttgaagc tgaccaatct gagaaagatc tggccagctt cactgcaacc tcaagcaacc
     1861 gcagactcca actgggtagg ctggattacc tggatcctac gtgcttcatg cactcctttc
     1921 actgatagct agacttatgg aacgattggc taaactgtga tttctcttca ggtaacacta
     1981 cagagtcatg aagtaatgtg gtctgctagc aaatgttaca ggatgtgggt ataaaaaaaa
     2041 a
//
LOCUS       YSCCP1A      1403 bp ds-DNA             PLN       27-JUN-1990
DEFINITION  S.cerevisiae centromere-binding protein CP1 gene, complete cds.
ACCESSION   M34070
KEYWORDS    CP1 protein; DNA-binding protein; centromere-binding protein.
SOURCE      S.cerevisiae (strain X2180) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1403)
  AUTHORS   Baker,R.E. and Masison,D.C.
  TITLE     Isolation of the gene encoding the Saccharomyces cerevisiae
            centromere-binding protein CP1
  JOURNAL   Mol. Cell. Biol. 10, 2458-2467 (1990)
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by R.E.Baker,
            25-APR-1990.
FEATURES       from  to/span     description
    pept        256     1311     centromere-binding protein CP1
BASE COUNT      508 a    284 c    310 g    301 t
ORIGIN      
        1 ggatccttcg ggaccaccat gtctcaccca ccaaggtcaa aaaaaataaa atcggacgag
       61 aaaagtattg ggcaaacaac aattatccac gtgcttatca cgttttttta tttaatgtag
      121 aatgtaaacc attatacctt aaagataaac actttgttaa ttattcgttt tagttggcgg
      181 tcattttcat ttctgctaca agtaccaaca tcaagtgctt aaaatataat acggttttct
      241 acacttttat taacgatgaa ctctctggca aataataata agctttctac tgaggatgag
      301 gaaatccatt cggcaagaaa aagaggctat aacgaggagc agaactacag cgaggccagg
      361 aaaaagcaaa gagatcaagg cttgttgtca caagagagca atgacggaaa tattgactct
      421 gctctgttga gcgaaggggc tacgctaaaa gggactcaaa gtcagtatga aagcggactg
      481 acgtctaaca aagatgagaa aggaagtgat gacgaagatg cgtcagtggc tgaggctgct
      541 gttgccgcta ccgtcaatta tacagactta atccagggcc aggaagatag ttctgatgcc
      601 cacacatcta atcaaacgaa cgcgaatggt gagcacaagg attcccttaa tggggaaaga
      661 gccataaccc catcaaatga aggagtcaag ccaaacacct ccctggaggg aatgacgtct
      721 tcaccaatgg aatctacaca gcaatctaaa aacgacatgc tcataccatt ggcagaacac
      781 gatcgtgggc cagaacacca acaggatgat gaagacaatg atgacgctga cattgatctc
      841 aaaaaggata taagcatgca gccgggtcgt cgtggaagaa aacctactac tttggccaca
      901 acagacgagt ggaaaaagca aagaaaagac tcccataaag aagtcgaaag gcgtcgccgc
      961 gaaaacatca acactgcaat caacgtttta agcgacctcc tgcccgtgag agaatcaagt
     1021 aaggcagcaa tattggcgcg tgccgctgaa tacattcaaa aattgaaaga aaccgatgaa
     1081 gcaaacatcg aaaagtggac gttacaaaaa ttgcttagcg agcaaaacgc atcgcaatta
     1141 gccagtgcaa atgagaaact gcaggaagaa ctgggaaatg cttacaaaga aatcgagtac
     1201 atgaaacgcg ttttaaggaa ggagggaata gaatacgagg atatgcacac tcacaagaaa
     1261 caagagaacg aaagaaaaag cactaggagc gataatccac atgaggcttg agagttaaaa
     1321 atagatagct aaatgtattt cgagtctccc tatgtatctg cgtctctgcc ccatagcacc
     1381 tctgttaagt ttttatcaat att
//
LOCUS       MUSCD3A      1347 bp ss-mRNA            ROD       27-JUN-1990
DEFINITION  Mouse CD3-eta protein mRNA, complete cds.
ACCESSION   M33158
KEYWORDS    CD3-eta protein.
SOURCE      Mouse (strain B6/CBA F1J) female thymus, cDNA to mRNA, clone pBS17.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1347)
  AUTHORS   Jin,Y.-J., Clayton,L.K., Howard,F.D., Koyasu,S., Sieh,M.,
            Steinbrich,R., Tarr,G.E. and Reinherz,E.L.
  TITLE     Molecular cloning of the CD3-eta subunit identifies a
            CD3-zeta-related product in thymus-derived cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3319-3323 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.L.Reinherz, 22-MAR-1990.
FEATURES       from  to/span     description
    pept        139      759     CD3-eta protein precursor
    sigp        139      201     CD3-eta signal peptide
    matp        202      756     CD3-eta protein
    mRNA    <     1     1347     CD3-eta mRNA
    signal     1329     1334     poly-A signal
BASE COUNT      374 a    342 c    336 g    295 t
ORIGIN      
        1 cgaggagagg cacccggtac atgttctcta ggagaacaaa gccagcagag actccatcag
       61 cgcctccttt tctcctcatc ctcccaggca tagctgcctc tgcctctgcc tctgggtacc
      121 atcccaggga agcagaagat gaagtggaaa gtgtctgttc tcgcctgcat cctccacgtg
      181 cggttcccag gagcagaggc acagagcttt ggtctgctgg atcccaaact ctgctacttg
      241 ctagatggaa tcctcttcat ctacggagtc atcatcacag ccctgtacct gagagcaaaa
      301 ttcagcagga gtgcagagac tgctgccaac ctgcaggacc ccaaccagct ctacaatgag
      361 ctcaatctag ggcgaagaga ggaatatgac gtcttggaga agaagcgggc tcgggatcca
      421 gagatgggag gcaaacagca gaggaggagg aacccccagg aaggcgtata caatgcactg
      481 cagaaagaca agatggcaga agcctacagt gagatcggca caaaaggcga gaggcggaga
      541 ggcaaggggc acgatggcct ttaccaggac agccacttcc aagcagtgca gttcgggaac
      601 agaagagaga gagaaggttc agaactcaca aggacccttg ggttaagagc ccgccccaaa
      661 ggtgaaagca cccagcagag tagccaatcc tgtgccagcg tcttcagcat ccccactctg
      721 tggagtccat ggccacccag tagcagctcc cagctctaag gccctgtgct cagctctggt
      781 gatgaccctg gctgctgtca catgagttgt ggtgaggatg ggacttttga aaaatctgat
      841 gttccaattt ctttcatgca tgctctactc agaagtgagc aagggccaaa actcctgggc
      901 atgcatctga atgaatctct caacttagaa aagacttgcc tgcccctctg agctggccag
      961 gtgtccccac ctaccctttg gcatgcctcc aagtgccagg acgccacaga ctgcattggg
     1021 gccgaagact tcccttttct ttggtttttt tgtttgtttt gagtttatac aatcattaag
     1081 aaatctttgg ttttggctgg aaatggaaaa acaaaacaaa tcaaagaaac caccctcccc
     1141 tggcttatag cagcagtatt atgacctgac ctggctgagc tttccccccc tccaactttg
     1201 ggggtggaaa ttgcaagtta agaactacat tcaaagaaaa cgttgaaagg gccggagaag
     1261 cagcttccag aaagcccgtc ggatataaga ttgtcaaata ataataacta ttattataat
     1321 atattaataa taaacttaaa tatttgg
//
LOCUS       YSCPRP5      3017 bp ds-DNA             PLN       27-JUN-1990
DEFINITION  S.cerevisiae pre-RNA processing protein (PRP5) gene, complete cds.
ACCESSION   M33191
KEYWORDS    PRP5 protein; helicase; pre-RNA processing protein.
SOURCE      S.cerevisiae (strain SPJ 5.41) vegetative DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3017)
  AUTHORS   Dalbadie-McFarland,G. and Abelson,J.
  TITLE     PRP5: A helicase-like protein required for mRNA splicing in yeast
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4236-4240 (1990)
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by
            G.Dalbadie-McFarland, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        284     2833     PRP5
BASE COUNT     1101 a    469 c    636 g    811 t
ORIGIN      Chromosome 2R.
        1 gttgacattc tacttttcta agacggataa tggacgactc cacgttgtag tagttgtcac
       61 attaatcgga aaagtgtctc acatgtaaag gattaaaata agaaatcttc taatttatgt
      121 tgaaatatct tgatttacca ttacccggat tagatgaaac taattaaagt catttagagc
      181 aatggaccaa actaaaggta ttagagttct ttctaatata atacagcgcc aaataattcg
      241 agaagggaaa tcataacgct gttgccttgg gtttatttga ggtatggaaa ctattgattc
      301 gaagcaaaat attaataggg agtctttatt ggaggaaagg aggaaaaaac tagcaaaatg
      361 gaaacaaaaa aaagcacaat ttgatgctca gaaagagcat caaacttcac gcaatgacat
      421 tgttactaac agtttagaag gtaagcagac tactgaaaag tttacggaaa gacaggaacg
      481 tgtaaaagaa gagcttcgga agagaaaaaa tgaatttagg aaatctgacg aacctgtatc
      541 ggttaagcct tcgaagaaaa agtcgaagag aagtaaggta aagaagaaaa tatccttcga
      601 cttcagtgat gacgatgatt ctgaaatagg agtttctttt agatcgaagg agcacattca
      661 aaaagctcca gagcatgata atgaaaaaga tccactggat gagtttatga cgtcattgaa
      721 ggaggagaaa atgagtaata gtaaaggcat gtacgacaga ggcgacattc ttgatgtaga
      781 ggatcaattg tttgaacttg gaggaaccga cgatgaggat gttgaagata atacggataa
      841 ttctaatata gcgaaaattg ctaaactcaa agcaaaaaaa cgtgtaaagc aaatttatta
      901 ctctccagaa gagcttgagc cgttccaaaa aaatttttat atagaatctg aaacagtttc
      961 ctcgatgtca gagatggaag ttgaggagct tagacttagt ctggataata tcaaaataaa
     1021 gggaacaggt tgccccaaac cagtcacaaa gtggtctcaa ctgggactat caacggacac
     1081 catggtttta attacagaaa agttgcactt tggctcctta acgcctattc aatcccaggc
     1141 tcttcccgct attatgtcag gtcgtgatgt tataggaata tcaaaaactg gctccggcaa
     1201 gaccatctcc tatcttttac cattgctaag acaagtaaaa gctcaacggc cattatcaaa
     1261 acatgaaaca gggcccatgg gtttaattct ggctccaact agagagctag ctttacaaat
     1321 acatgaagaa gttactaaat tcacagaagc agatacatcc attagatcag tatgctgtac
     1381 aggaggttct gaaatgaaaa agcagattac tgatcttaaa agaggcactg agattgttgt
     1441 tgccacaccg ggacgattta ttgatatatt aacactaaat gatgggaaat tacttagtac
     1501 taaaagaata acgttcgtag taatggatga ggcagacagg ctgttcgatt taggttttga
     1561 acctcaaata acgcaaatca tgaaaactgt tcgaccggat aaacagtgtg ttctatttag
     1621 tgcaactttt ccgaacaaac tacgcagttt tgccgtaaga gttttgcatt ccccaatatc
     1681 tattacgatt aattcaaagg gaatggttaa tgaaaacgta aaacaaaagt ttagaatatg
     1741 ccattcagaa gatgaaaaat ttgacaatct ggtgcagctt atccatgagc gcagtgaatt
     1801 ttttgacgag gttcaaagtg aaaatgacgg acagtcaagc gatgttgaag aagttgatgc
     1861 taaagccatt atatttgtat caagtcagaa tatttgcgac ttcatttcaa aaaagctgtt
     1921 aaatgctgga attgtgacct gtgctattca tgcaggtaag ccatatcaag aaaggcttat
     1981 gaacttagaa aagttcaaac gagaaaagaa cagtattctt ctctgtacag aggttctctc
     2041 aagaggttta aatgttcccg aagtgtcgtt ggttattatc tataacgccg tcaaaacttt
     2101 cgcgcaatat gttcatacta ctggaagaac tgccagaggt agtcgctccg gtactgctat
     2161 tactcttcta ttgcatgatg aactgtcagg tgcttatatc ttgagcaagg caatgcgcga
     2221 tgaagaaata aaagctttag accctctgca agcaaaagaa cttcaggaaa tgagtgccaa
     2281 gtttgaatcc ggtatgaaaa agggtaagtt taggttgtcg aaagggttcg gcggcaaagg
     2341 actggaaaat atcaaaagta aaagagagga agctcaaaat aaagatttgg agctcaaaaa
     2401 gaatgataag aggagcgatg acttggaaaa gaaaatcagc aatccacgcg aagggcatga
     2461 cagtgtgtct gagtcttctg cgcttattcc gagactcaat tatgaactat tcaaagagtc
     2521 tacagacggg tcgatcatct tttacgccaa ggtttatatt aatgatttac cccagattgt
     2581 aagatgggag gctacaaaga atacaacctt attatttatc aagcatgaaa ccggatgtag
     2641 tattactaac aagggtaaat tttaccctga agggaaggaa ccaaagaatg aaaacgatga
     2701 gcctaaacta tacttattga tcgaaggcca agatgagaaa gacatacaat taagtataga
     2761 attgctggag caaaaagtta aagagggggt cgtaaaggct gcaagcttgt ctttgaagag
     2821 tactaaatac tagtttttag aatttaactc actcgtggtg ctatatactt tcgtagttag
     2881 ttctgaaatt tttagggtgg taatttaaag catattcaat cataataaaa aaaaaaaaag
     2941 agaattatta ggaaaaataa aaactaaaat aatatataat atatgatgta gaaaaaaatc
     3001 ttgaaatttt tatcgat
//
LOCUS       STMACM       1155 bp ds-DNA             BCT       27-JUN-1990
DEFINITION  S.globisporus N-acetylmuramidase M1 (acm) gene, complete cds.
ACCESSION   M30645
KEYWORDS    N-acetylmuramidase M1.
SOURCE      S.globisporus (strain ATCC 21553) DNA.
  ORGANISM  Streptomyces globisporus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 1155)
  AUTHORS   Lichenstein,H.S., Hastings,A.E., Langley,K.E., Mendiaz,E.A.,
            Rohde,M.F., Elmore,R. and Zukowski,M.M.
  TITLE     Cloning and nucleotide sequence of the N-acetylmuramidase
            M1-encoding gene from Streptomyces globisporus
  JOURNAL   Gene 88, 81-86 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.S.Lichenstein, 12-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept        175     1059     N-acetylmuramidase M1 precursor
    matp        406     1056     N-acetylmuramidase M1
BASE COUNT      175 a    454 c    372 g    154 t
ORIGIN      
        1 cgccattgct ctagaccggc ggagccgaca gatgggtctt tacgggggtc tacgcgcgtg
       61 cattacttgt gtcgtgctca tggcgaccgg cctttccggg ccccttcgcc ggggcaagca
      121 ccggtcattc gcgcgttcca cggctacatc ccccactcgt gcctggaggc agtcatgccc
      181 gcgtacagct ctctcgcacg ccgcggccgc agacccgcgg tcgtcctcct cggcggtctc
      241 gtcagcgcct ccctggcgct caccctggcg cccaccgccg ccgccgcgcc cctcgcgccc
      301 ccgcccggca aggacgtcgg gcccggcgag gcgtacatgg gtgtcggcac ccgcatcgag
      361 caggggctcg gcgccggccc cgacgagcgc accatcggcc cggccgacac cagcggtgtc
      421 caggggatcg acgtgtcgca ctggcagggc tccatcaact ggagctcggt gaagtcggcc
      481 gggatgtcct tcgcctacat caaggcgacc gagggcacca actacaagga cgaccggttc
      541 agcgcgaact acaccaacgc gtacaacgcg gggatcatcc ggggcgccta ccacttcgcc
      601 cgcccgaacg cctccagcgg cacggcgcag gccgactact tcgccagcaa cggcggcggc
      661 tggtcccgcg acaaccggac cctgccgggc gtcctggaca tcgagcacaa cccctccggc
      721 gccatgtgct acgggctctc caccacgcag atgcgcacct ggatcaacga cttccacgcc
      781 cggtacaagg cgcgcaccac ccgcgacgtc gtcatctaca ccacggcgag ctggtggaac
      841 acctgcaccg gcagctggaa cggcatggcg gccaagtccc cgttctgggt ggcccactgg
      901 ggcgtgagcg ccccgacggt gccgagcggc ttcccgacct ggacgttctg gcagtactcg
      961 gcgaccggcc gggtcggcgg cgtcagcggg gacgtcgacc gcaacaagtt caacggctcc
     1021 gccgcccgtc tgctggccct ggccaacaac acggcgtgag acggccggag ggccgggggc
     1081 acggcacgca cgccctgccc ccggccctcc cccgcgcccg gcgcggctac cgcatccgcc
     1141 cgaggtgctc gcgca
//
LOCUS       YSCKRE5      4383 bp ds-DNA             PLN       27-JUN-1990
DEFINITION  S.cerevisiae Kre5 protein (KRE5) gene, complete cds.
ACCESSION   M33556
KEYWORDS    Kre5 protein.
SOURCE      S.cerevisiae (strain GRF88) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 4383)
  AUTHORS   Meaden,P., Hill,K., Wagner,J., Slipetz,D., Sommer,S.S. and
            Bussey,H.
  TITLE     The yeast KRE5 gene encodes a probable endoplasmic reticulum
            protein required for (1->6)-beta-D-glucan synthesis and normal cell
            growth
  JOURNAL   Mol. Cell. Biol. 10, 3013-3019 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            H.Bussey, 04-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         77     4174     KRE5 protein precursor
    sigp         77      128     KRE5 protein signal peptide
    matp        129     4171     KRE5 protein
BASE COUNT     1448 a    758 c    826 g   1351 t
ORIGIN      Chromosome 15R.
        1 tatataacgt ggcatattaa agattaattg tcctggtaga atatagacgt atcagtgtga
       61 gtgcctctgt tgattaatga gactacttgc gttggtattg ttattgttgt gtgcgccgct
      121 tcgtgcatgg acttatagct tacgatatgg catacccgaa tctgctcagg tctggtctat
      181 tttagttcat ttactgggcg atgttgataa tcagctgtta actaatttat atcctttggt
      241 taccggtttg gatgacgaga ttgatattca agaaaatctt gtggcgctaa cttccaatgt
      301 attaagggag cgatacgata aagaggatgt ggctgattta ttggaactgt atgctagtct
      361 ttaccctatg ggtatgatac agcacgatat cagttccaat gcagaacaag acgatgcaaa
      421 tagtagctat ttcgttttga atggtaatag gtacgaaaag cccgacgacg tgttctactt
      481 gaaatctaag gatttaacaa ttcaacagaa agtcccagat gttgatgtta tacaacctta
      541 cgatgttgtc attggtacta actcagaagc gccgatattg atcttgtacg gttgtcctac
      601 cgttattgac tccgacttcg aagaattcaa taggaattta tttatggaag caatgaatgg
      661 agagggaaaa tttagattta tttggagatc cacatgttcc cttgatggga aaagcgtgga
      721 gtatccctta actcatccgc ttgaaattac tttacaaaat ggttctagaa tgagctccat
      781 acctcaatta aaaaaaatac tatatactgt acccaaagaa atattggttg gagcagacaa
      841 cgatgatcag ctccatgatc tagaaccaga agaattacgt gaacttgatt tgagagtaac
      901 atcgttaatc tcagaatttt accaatataa aaaggatatc acagccactc taaatttcac
      961 caaaagtatt gttaacaact ttccactaat ctctaaacaa ctgattaagg tttcatctgt
     1021 taacaaggat ataataacaa gtaatgaaga actcaatagt aaaggcttcg attacaacat
     1081 gctaggtctc tatattaatg gacagaattg gaaaattacc tcactgactc cgtacaattt
     1141 gcttactgct ttaaaaactg aataccaaag tttactgaaa attacgaacc ttttgcaaga
     1201 actcgagcca tcgaaatgca tactagattc caagttttta ctcaataagt tttctcaatt
     1261 ttcattgggg aagttgcaaa acttacaacc aatcaaaatg gatctccaca caattccagg
     1321 gttctcagaa tcagtaatat acttcaatga tatcgaaagc gacccgcaat atgacgaatt
     1381 agtaaatagt gttcaagcat tttttgataa atcgaaattc ggagagttgc ctgaaataaa
     1441 gcaaaactgg tcagagatca tattcgttat agatttcgcc cgtttagaag atagtgaggt
     1501 gaaggaggca ttgggtgggt tggttcgtgc cgttaatgtt gtctcccagg gatatccgca
     1561 aagagtcgga ctattgccat ttagttcaga tagtgacaag tccgttgtta ataaaattta
     1621 cgagctgaag aactcaactg acaatttaac agaattaaaa agttttttgg agacaatgct
     1681 gcttgcagat ggcctttccg cgaatgcaaa acattcaaaa cacataccag ttccagatgt
     1741 tttccatcta cttgatgaac ttcaaattga cgaaacatca attataatca atggagagat
     1801 ttacccattt agaaaaaatt ggaattattt aattgcaaaa gttatcaaaa aggacactga
     1861 atttattcgt aaagaattga gcaattcttc tccgaaaaac aaacaaatta gcgtaaggga
     1921 cttattgcat tacaaatctg caaatctgag acataataaa tatacaccaa attattttgc
     1981 tgattcggta tattcttcgg tcaacaatac tgcattggaa agcgtatgct cagaaagaat
     2041 aggctactat actaaaaatg aagaatacaa tttattacac acaatcacat tagtggatga
     2101 ttttggctct attcatgctt tgaaaagatt gagaaacttg ttgcatactt cctttgttgg
     2161 tgttaggatc agaatcattc acgtaggtga tatttctgat atttggtatc aattgcgtgg
     2221 aagtctttcc caaaaagatc caataggctc aataaataca tttattgatg ctttgaaact
     2281 taaaaaggta aaaagtcaca cgtacaaaaa aagcggctta aaccagttag gccttcataa
     2341 atggcttcct gacattccat tatttgaatt gcaaaagggt tcatttattg ctttgaacgg
     2401 tagatttatc atcttgatca aaatgaagtg ccagaaacag aacatttcga aggccaaaat
     2461 cataaagaga gaagctctaa gaacgatcga ttcagttttc gccctagatt tactttttcc
     2521 aggtttctca caggaaataa tcaatcctga tttgatagaa atgatctcct ccattttaac
     2581 taggttgttt taccaaggta cacatatata caataatggt attgattata ccactgaaag
     2641 tagcttacca agaatggatt tgagcgagtt ttttagacct aataatttaa cgatgtttga
     2701 agatggaaaa tcagcttcta ttgatttact actaatttta gatccacttg aagagagaac
     2761 gcaaatgatt ctttctcttg ttgagcaatt caggcctttg aaatttgtta atattcaggt
     2821 aattttaatg ccgacactgg aattaaacat tgtccctatt agaagaatat acgttgatga
     2881 cgcagatatt gtcaaatcaa taacttctga ggatagcaga tcagatccag aagtagatat
     2941 tgaaatggat gttcctaatt ctttcattgt agataataat tatcggataa aaaaattgct
     3001 catagaatta cattccttct ctagcaaaac agtcctttca actggcaata ttgatggtat
     3061 ggggggtgta tgcctagcac ttgtcgattc tgcagggaac attattgaca aaactacaac
     3121 aatgaaaacc tttggctatg gacaatttca taccgacaaa tttttaaagg gttgctatat
     3181 aaaaagttgt gattcaagat ataccgttca gtcattttct actgacgggc atcccgactt
     3241 tataccatca gattccttgg atatactgtc gtacaatcca caaaaaatcg ctgtaaaaat
     3301 ttcagaagag cctacacacg aggaagaata cgaggaaggt cgcaacaatg atacaataat
     3361 caatattttt actattttag agtccgggcc agatgaggaa gagaggtaca tgcaaatgat
     3421 tttatccatt ttgtcaaagt gtcccgaaac gcaaaaggtg aattttttca ttttagatca
     3481 gccgtttatc tccgacactt taaggaaatc atgtgagtat ataaattcct ctgatgaaat
     3541 gagaggcaat gtcatttttt tgaattatga atggcctcaa tggttaagac cgcaaagatt
     3601 ttcttcaagg agaagggatg tctctagatt tctgttcttg gatgtccttt tacctcaaaa
     3661 catctccaaa gtgttatata tgagtccaac tgaagtaccg ctggatcctt ttgacatttt
     3721 tcaatttcaa ggcctcaaac gtgcacctct aggactattc cgaatgagtg gtgatggtta
     3781 ttggaaagaa ggatactggg aaaaaatgtt aagggagaat aatttagaat tttattctac
     3841 cgaaccggcc tttttagtaa acttagagag gtttcgggag ttagatgctg gtgataaata
     3901 caggattcac tatcaacgta tttctacaga cgccatgtct cttgtcaata tcggccaaga
     3961 tctagttaac aacctacaac tcgaggttcc gattaggttt ctcaagggat cgtataagaa
     4021 gaaattagtt attaatgatg aatgtgtttc tgaatggaag aaaaaaataa ataagttcgc
     4081 atcctctcct ggcgatgaag acgtacctgg agaaagtgtt agcagcaaat accaagattc
     4141 cgacaatgcc gctcctctgc atgacgaatt ataactactc cagaagagtt cattacgcga
     4201 ctgtccaaga gcgtgaagaa ttgcttctgc gcataagtct tcttccataa tagattttta
     4261 tatcattttt agaacataaa atttctcgcc aagctagttt tttagctaaa agcagatatc
     4321 cagtaacatg ggttccgctt tttgcagcga atactatgaa gagttttgcc cgactggctc
     4381 ccc
//
LOCUS       STMCURA      2192 bp ds-DNA             BCT       27-JUN-1990
DEFINITION  S.curacoi curC, CurA genes, complete cds, curD gene, 3'
            end and curB gene 5' end.
ACCESSION   M33704
KEYWORDS    .
SOURCE      S.curacoi (strain ATCC 13385) DNA.
  ORGANISM  Streptomyces curacoi
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 2192)
  AUTHORS   Bergh,S. and Uhlen,M.
  TITLE     Cloning, analysis and heterologous expression of the polyketides
            synthesis genes of Streptomyces curacoi
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by S.Bergh,
            13-APR-1990.
            
            Author address: S.Bergh, Dept. of Biochemistry,  Teknikringen 30,
            Stockholm, SWEDEN  S-100 44.  E-mail: staffan@physchem.kth.se
FEATURES       from  to/span     description
    pept    <     1       32     curD protein (AA at 3)
    pept         77      541     curC protein
    pept        538     1806     curA protein
    pept       1803  >  2192     curB protein
BASE COUNT      349 a    823 c    731 g    289 t
ORIGIN      
        1 tgatcaccga ccgcagggcc gcgcagccct gacagcgcag ggacccccac cccaccgacc
       61 gacctggaga cagaccatgg acaagatgcg cccgcgcgtc gtggacgtca aagagatcga
      121 gcccaaccgc aagcgcggcg gtgacctgcg cacgctgctc acccctgtca cggtgggtgc
      181 caccagcggg ttcatgggcc tggccatcat gcggccgggc gaacgcatca gcgagcacta
      241 ccacccgtac tccgaggagt tcgtgtacgt cgtcgagggc aggctggagg tcgacctcga
      301 tggcgaaacg tttcctctcc gtgccgacca gggcctcatg atccccatcg acatgcggca
      361 ccgcttccgc aacgtcggtg acgaggaagc ccggatggtc ttccatctga gcccgctggc
      421 accgaagccg agcctcggcc acgtcgacac ggaggccccg gcgatcagcg acgacgtcaa
      481 ggcgtacccg ctcgtccagg aggagagcgg gcggccggaa cgacccgggg tcctgtcgtg
      541 acccggcggg tggcggtcac cggcataggc gtggtcgctc cgggaggcat cggggtgccc
      601 gcgttctggg acctcctctc gagcggccgt accgcgacac gtggcatcac cctgttcgac
      661 cccgaggggc tgcggtcccg catcgccgcc gagtgcgact tcgacccgct cgcccacggc
      721 ctcgaccccg agctggtcga acgcgccgac cggtacatac agttcgccct ggtcgccgcc
      781 gacgaggccg tgaccgacag cggcatcgac ttcggcaccg agaacccctg gcgcgtggcc
      841 gtgtccctgg gcagcgcggt gggcgggacg accaggctgg agcacgacta cgtcctcgtc
      901 agcgaacggg gacagcgctg ggacgtcgac caccgcgccg ccgagccgga actgcaccgc
      961 gccttctcgc ccagcacgct ggccgcggac gtcgcggagc ggttcggcgc ccagggaccg
     1021 gtgcagaccg tgtccaccgg ctgcacctcc gggctcgacg cggtgggtta cgccttccac
     1081 accatcgagg agggccgcgc cgacgtctgc atcgccgggg cgtcggactc gccgatctcc
     1141 ccaatcacca tggcctgctt cgacgccatc aaggccacgt cgcccaacaa cgacgacccg
     1201 gagcacgcct cccgcccctt cgacgcccac cgcgacggct tcgtcatggg cgagggcgcc
     1261 gccgtcctcg tcctggagga actggaacac gcccgtgcgc gcggcgcaca tgtctactgc
     1321 gagatcggcg gctacgccac cttcggcaac gcctaccaca tgaccggact gaccagtgag
     1381 ggcctggaga tggcccgggc catcgacgtc gcgctcgacc acgcgcgcgt cgacccgacc
     1441 gacatcgact acgtcaacgc ccacggctcg ggcacccggc agaacgaccg ccacgagacc
     1501 gccgcggtca agaagtccct gggcgcccac gcctacgaca cccccatgag ctcgatcaag
     1561 tccatggtgg gccactcgct cggcgcgatc ggcgcgatcg aggtcgtcgc ctgcgtactc
     1621 gccctggccc ggcaggtggt cccgccgacg gcgaactacg agaccccgga ccccgagtgc
     1681 gacctggact acgtaccgcg caccgcacgg ccccgccggc tcgaccacgt gctctccgtg
     1741 ggcagcggct tcggcgggtt ccagtccgcg gtgctcctga cggggccggc tgggaggaaa
     1801 cgatgaacgg cagtggtagt ggcatacgca cccggcggac cgccgtcacg gggatcggtg
     1861 tggtcgcgcc caacggcctg cacgccgaca cctactggaa gtccgtcaag gagggcgcga
     1921 gcgtactgga ccgcatcacc cgggagggct gcgagcacct tccgctccgc gtcgccggtg
     1981 aggtccgcgg attcgacccg tcggcgctca tcgaggagac cttcctcgtc cagaccgacc
     2041 tgttcaccca cttcgccctg gccgccgcgg acgccgccct ccaggacgcg gggctgacca
     2101 aggccgccgc cgtggccgac tcgccgtact cggtcggtgt cgtcaccgcc gccgggtccg
     2161 gtggcggcga gttcgggcag cgggaactgc ag
//
LOCUS       MUSTRKB      2484 bp ss-mRNA            ROD       27-JUN-1990
DEFINITION  Mouse tyrosine protein kinase B (trkB) mRNA, complete cds.
ACCESSION   M33385
KEYWORDS    tyrosine protein kinase.
SOURCE      Mouse adult brain, cDNA to mRNA, TK-negative clone pFRK42.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2484)
  AUTHORS   Klein,R., Conway,D., Parada,L.F. and Barbacid,M.
  TITLE     The trkB tyrosine protein kinase gene codes for a second neurogenic
            receptor that lacks the catalytic kinase domain
  JOURNAL   Cell 61, 647-656 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Barbacid, 29-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept          1     1431     tyrosine protein kinase B (trkB) precursor
    sigp          1       93     tyrosine protein kinase B (trkB) signal
                                 peptide
    matp         94     1428     tyrosine protein kinase B (trkB)
BASE COUNT      638 a    614 c    549 g    683 t
ORIGIN      
        1 atgtcgccct ggctgaagtg gcatggaccc gccatggcgc ggctctgggg cttatgcctg
       61 ctggtcttgg gcttctggag ggcctctctc gcctgcccga cgtcctgcaa atgcagttcc
      121 gctaggattt ggtgtactga gccttctcca ggcatcgtgg cattcccgag gttggaacct
      181 aacagcgttg acccggagaa catcacggaa attctcattg caaaccagaa aaggctagaa
      241 atcatcaatg aagatgacgt tgaagcttac gtggggctga gaaaccttac aattgtggat
      301 tccggcttaa agtttgtggc ttacaaagcg tttctgaaaa acagcaacct gcggcacata
      361 aatttcacac gaaacaagct gacgagtttg tccaggagac atttccgcca ccttgacttg
      421 tctgacctga tcctgacggg taatccgttc acgtgctcct gcgacatcat gtggctcaag
      481 actctccagg agactaaatc cagccccgac actcaggatt tgtactgcct caatgagagc
      541 agcaagaaca tgcccctggc gaacctgcag atacccaatt gtggtctgcc atctgcacgt
      601 ctggctgctc ctaacctcac cgtggaggaa ggaaagtctg tgaccctttc ctgcagtgtg
      661 gggggtgacc cactccccac cttgtactgg gacgttggga atttggtttc caagcacatg
      721 aatgaaacaa gccacacaca gggctcctta aggataacga acatttcatc tgatgacagt
      781 ggaaagcaaa tctcttgtgt ggcagaaaac cttgtaggag aagatcaaga ttctgtgaac
      841 ctcactgtgc attttgcgcc aactatcacg tttctcgagt ctccaacctc agatcaccac
      901 tggtgcattc cattcactgt gagaggcaac cccaagcctg cgcttcagtg gttctacaat
      961 ggggccatac tgaatgagtc caagtacatc tgtactaaga tccacgtcac caatcacacg
     1021 gagtaccatg gctgcctcca gctggataac cccactcata tgaataacgg agactacacc
     1081 ctgatggcca agaacgagta tgggaaggat gagagacaga tctccgctca cttcatgggc
     1141 cggcctggag tcgactacga gacaaaccca aattaccctg aagtcctcta tgaagactgg
     1201 accacgccaa ctgacattgg ggatactacg aacaaaagta atgaaatccc ctccacggat
     1261 gttgctgacc aaagcaatcg ggagcatctc tcggtctatg ccgtggtggt gattgcatct
     1321 gtggtgggat tctgcctgct ggtgatgttg ctcctgctca agttggcgag acattccaag
     1381 tttggcatga aaggttttgt tttgtttcat aagatcccac tggatgggta gctgagataa
     1441 aggaaagaca aaggctgggg ctgtggtgct tgttgcctga cgccctgtga gctgaactct
     1501 gggactgctg ttgcctatcc caggaagtgc tgcttatttg agggtgtctg gtggaaatgg
     1561 gtaatctccg aggatgtctg cagcctgctt gttgtgagct gtgactgggg aaccccaagg
     1621 cagaggcagg ggtcaggcag ctgagaagca gcagaagaac acacttagat tcaccttctg
     1681 ttcttacaat agttcaaata tagaatcgaa gtgaaatctc attggattat gcctctctaa
     1741 tgaaaagcga gctgtttgac tatacggaaa atgtgctgac attaattgct tctgtttatt
     1801 aaaggtgatt tgcaaattaa aaactctgca tctatcatct atccatctat ctgtttgtct
     1861 atcatatcta tctgtctgtc tatctgtcta tcatctatct acctacctct ctatcatatc
     1921 tatctgtctg tctatctatc tatctatcta tctatctatc tatctatcta tctatctatc
     1981 tatctatcat ctatctacct atcatcgatc tacttatcta tcatctatct atctacctat
     2041 catcgattta cttatctatc atctatctat ctatctatct atctatctat ctatctatct
     2101 atctgtcatc tatctaaagt catagctagg tctaagtgca cactaaaagt ctaatccaca
     2161 cataacacct atttcagcaa catcttctgt tctctaacct ttgctaactt ctgtgatttc
     2221 cacctacaac cctgcgactg atagacttaa aggcacattg gtggtgtcat tagtaggttc
     2281 tttgttttgc tggcagcaaa gacccaaact cttcgctaac gattgctttc aaagtccacc
     2341 cggcaggtag aacggagcag caccagggac tgtgtggcca ggagtatgga cctgaattaa
     2401 ccacagcctg agaataaata atggtagggt atatgcatat agggaattaa aatcttgtcc
     2461 ctttccattg ccctctgcta accg
//
LOCUS       STRPRH       1784 bp ds-DNA             BCT       27-JUN-1990
DEFINITION  Streptococcus protein H gene, complete cds.
ACCESSION   M29398
KEYWORDS    IgG binding protein; protein H.
SOURCE      Streptococcus (group A, strain AP1) DNA.
  ORGANISM  Streptococcus sp.
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Streptococcaceae.
REFERENCE   1  (bases 1 to 1784)
  AUTHORS   Gomi,H., Hozumi,T., Hattori,S., Tagawa,C., Kishimoto,F. and
            Bjoerck,L.
  TITLE     The gene sequence and some properties of protein H: A novel IgG
            binding protein
  JOURNAL   J. Immunol. 144, 4046-4052 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            H.Gomi, 25-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept        328     1458     protein H precursor
    sigp        328      471     protein H signal peptide
    matp        472     1455     protein H
    binding     316      321     ribosome binding site
    signal      246      251     -35 region
    signal      270      274     -10 region
    site       1489     1514     terminator
BASE COUNT      734 a    343 c    338 g    369 t
ORIGIN      
        1 gaattccgtt accatcaaca ggtgaaacag ctaacccatt cttcacagcg gcagccctta
       61 ctgttatggc aacagctgga gtagcagcag ttgtaaaacg caaagaagaa aactaagcta
      121 tcactttgta atactgagtg aacatcaaga gagaaccagt cggttctctc ttttatgtat
      181 agaagaatga ggttaaggag aggtcacaaa ctaaacaact cttaaaaagc tgacctttac
      241 tccttttgat taactatata taataaaaat attaggaaaa taatagcact attaattttc
      301 ttttttaata aaatcaagga gtagataatg actagacaac aaaccaagaa aaattattca
      361 ctacggaaac taaaaaccgg tacggcttca gtagccgttg ctttgaccgt tttgggcgca
      421 ggttttgcaa accaaacaac agttaaggcg gaaggggcta aaattgattg gcaagaagag
      481 tataaaaagt tagacgaaga taatgctaaa cttgttgagg ttgttgaaac cacaagtttg
      541 gaaaacgaaa aactcaagag tgagaatgag gagaataaga aaaatttaga caaacttagc
      601 aaagaaaatc aaggaaagct cgaaaaattg gagcttgact atctcaaaaa attagatcac
      661 gagcacaaag agcaccaaaa agaacaacaa gaacaagaag agcgacaaaa aaatcaagaa
      721 caattagaac gtaaatacca acgagaagta gaaaaacgtt atcaagaaca actccaaaaa
      781 caacaacaat tagaaacaga aaagcaaatc tcagaagcta gtcgtaagag cctaagccgt
      841 gaccttgaag cgtctcgtgc agctaaaaaa gaccttgaag ctgagcacca aaaacttgaa
      901 gctgagcacc aaaaacttaa agaagacaaa caaatctcag acgcaagtcg tcaaggccta
      961 agccgtgacc ttgaagcgtc tcgtgcagct aaaaaagagc ttgaagcaaa tcaccaaaaa
     1021 cttgaagctg agcaccaaaa acttaaagaa gacaaacaaa tctcagacgc aagtcgtcaa
     1081 ggcctaagcc gtgaccttga agcgtctcgt gcagctaaaa aagagcttga agcaaatcac
     1141 caaaaacttg aagcagaagc aaaagcactc aaagaacaat tagcgaaaca agctgaagaa
     1201 cttgcaaaac taagagctgg aaaagcatca gactcacaaa cccctgatac aaaaccagga
     1261 aacaaagctg ttccaggtaa aggtcaagca ccacaagcag gtacaaaacc taaccaaaac
     1321 aaagcaccaa tgaaggaaac taagagacag ttaccatcaa caggtgaaac agctaaccca
     1381 ttcttcacag cggcagccct tactgttatg gcaacagctg gagtagcagc agttgtaaaa
     1441 cgcaaagaag aaaactaagc tatcactttg taatactgag tgaacatcaa gagagaacca
     1501 gtcggttctc tcttttatgt atagaagaat gaggttaagg agaggtcaca aactaaacaa
     1561 ctcttaaaaa gctgaccttt actaataatc gtctttgttt tataatgaaa acattaacga
     1621 aataatttat taaggagaga atactaatga atattagaaa taagattgaa aatagtaaaa
     1681 cactactatt tacatccctt gtagccgtgg ctctactagg agctacacaa ccagtttcag
     1741 ccgaaacgta tacatcacgc aattttgact ggtctgggga attc
//
LOCUS       BSUSACP      3518 bp ds-DNA             BCT       27-JUN-1990
DEFINITION  B.subtilis sacT gene, complete cds, enzyme II (sacP) gene (of the
            sucrose phosphotransferase system), complete cds, and sucrase gene,
            5' end.
ACCESSION   J03006 M33761
KEYWORDS    enzyme II; sucrase; sucrose transferase system.
SOURCE      B.subtilis (strain Marburg 168) DNA, clone pBSG8-10.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1987 to 3518)
  AUTHORS   Fouet,A., Arnaud,M., Klier,A. and Rapoport,G.
  TITLE     Bacillus subtilis sucrose-specific enzyme II of the
            phosphotransferase system: Expression in Escherichia coli and
            homology to enzymes II from enteric bacteria enzymes II from
            enteric bacteria
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 8773-8777 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2070)
  AUTHORS   Debarbouille,M., Arnaud,M., Fouet,A., Klier,A. and Rapoport,G.
  TITLE     The sacT gene regulating the sacPA operon in B.subtilis shares
            strong homology with transcriptional antiterminators
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            A.Fouet, 16-SEP-1987.  Draft entry and computer-readable sequence
            for [2] kindly submitted by M.Debarbouille 18-APR-1990.
            
              Author address: M.Debarbouille
                Institut Pasteur
                25-Rue du Dr. Roux
                75724
                Paris Cedex 15
                FRANCE
                Emr:DEDONDER%PASTEUR.BITNET@CUNYVM.CUNY.EDU
            
FEATURES       from  to/span     description
    pept        136      897     sacT protein
    pept       2008     3390     enzyme II
    pept       3387  >  3518     sucrase
BASE COUNT     1013 a    712 c    867 g    926 t
ORIGIN      
        1 ctttacatgg cggtcaaaag gaggtaaaat cccagtcaaa agcaaattgg cgggagagat
       61 aacctcttga aaatctataa agtattaaac aacaatgcgg ctttaataaa agaggatgat
      121 caggaaaaaa tcgtgatggg accgggaatc gcttttcaaa aaaagaaaaa tgatctcatc
      181 cctatgaata aagtggaaaa gatctttgtc gtgcgcgacg agaatgaaaa gttcaaacaa
      241 atcctgcaaa cactgccgga ggagcatata gaaatagctg aggatatcat cagctatgcg
      301 gaaggagagc tcgcggcgcc gctgagcgat cacattcata tcgcgctttc cgaccatttg
      361 tcctttgcga tcgaaaggat tcaaaatggg ctgcttgtgc agaataaatt gctgcatgag
      421 ataaaggcgc tctataaaaa agaatatgag atcggcctgt gggcgatcgg acatgtaaaa
      481 gagacattgg gcgtgtctct gcctgaagat gaagcgggtt atattgccct tcacatccat
      541 acggcgaaga tggatgcgga gagcatgtat tcagcgctga agcatacgac catgatcaaa
      601 gaaatgatag agaaaataaa acaatacttt aatcgaaagg tggatgaaaa cagcatttcc
      661 tatcaacgcc ttgtcacgca tttgcgatac gcggtcagcc ggttggaatc aaatgaagcg
      721 ctccatcgta tggacgagga gatgctttat ttcatccaaa aaaagtattc attcgcctat
      781 caatgtgcgc tggagctggc ggagtttttg aaaaatgaat atcaattaca tttgccggaa
      841 tccgaggccg gctatatcac gctgcatgtc cagcgtcttc aagatctctc ggaataaccg
      901 ctttgacttg cagggagtga tctctggaag tttttttatt gatcaggatt cagtgtgtag
      961 tacatccagc ccatcataac tattcccgcg gtcagattgc cgagcgtaac ggggattaag
     1021 tttctgactg ctcccatcag tgtcactgta tcagggtgct cgatcaaaag cgagatggcg
     1081 aatgtgcaca tattggcaat gctgtgttca aagccggaaa taaagaagca gaaaacgaaa
     1141 agcatcatgg taaaaagctt tgctccttcc cctttgagag acattggaat gaaaaaggcg
     1201 aggcacacaa gccaattgca cagcattcct ctgaaaaaca attcggaagc cggcggctcc
     1261 attttgtgct ctgccaaatg aatcaaaaag gaatgaacag aaggctcctc aaaaagtccc
     1321 gtcgcgctga tcaggatggc aaacagaatg gcgccgatta aattgccggc atagcttgac
     1381 atccacaagt atagggtgtc gcgccagctg atttttttcc ggagcgcggt ataggtgaaa
     1441 taaaaggtgt tgccggtaaa taaatctccg cctccgtaag caatcatcag aatagcggcc
     1501 ccgaaagtga cagcggctgc cggaaaggca aacggagaat cagccataaa gaaatagctg
     1561 cctgttttgc ttgcggctgt aatgccaaaa ccaataaata tactggccaa aatggaccga
     1621 agcacataac ggattttgct tgaagcgaat atgttttgtt ttttcaaagc atactgttca
     1681 accttttgta atgcttgagt ttccattgtc ttttcctgct ttctttcatg attgagcata
     1741 gtatggcacg atattgtgaa gtatttcaca attaattttt acgaattttt gtctacatgt
     1801 tatatcaaac tatttaacta aaaaaatagt tgacgaaaac gctatcatga tttatgatga
     1861 aagcgtattc ttaatctgaa taagcgggat tgtgactggt aaagcaggca agacctaaaa
     1921 tttgcgtaaa tgaaaaagga tcgctgtgtc ctttattcgt tggcgaattt taggtctttt
     1981 ttgtttaaat aaaaggggga gaaatagatg gattacaaag agactgcaaa acgcctcatt
     2041 gagcttctcg gagggaaaga aaatattatc agcgcggctc attgtgcaac aagactgcgt
     2101 ttagtgatga aagatgaatc aaagatagat caagcacaag tagaagagct tgatggcgtt
     2161 aaaggggctt tcagcagctc tggccagtac cagatcattt tcggaacagg ccttgtgaat
     2221 aaagtattcg atgccttttc taaagaagct gatatcgagc gtgaagaaca cgtcaatcat
     2281 caggatgcgg caaaagaaaa gctgaatccc gctgcgagat ttgcgaaaac gctttctaat
     2341 atttttgttc caatcatccc agctattgta gccagcggcc tattaatggg attgctgggt
     2401 atgatcaatg cgtttcattg gatgagcaag gattctgcgc ttttgcagct gcttgatatg
     2461 ttttcaagtg cagcattcat tttcttgccg attctaatcg gggtcagcgc ttcaaaagag
     2521 tttggcagca atccatactt gggagcggtc atcgggggaa tcatgattca tccgaatctt
     2581 ttaaatccat ggggattggc ggaagaacaa ctgattacat gcatcttttc ggatttgata
     2641 tcgctcttct cggctacagg gaactgttat ccctgtcctg cttgcggtgt atgtgatgag
     2701 caaggtagag aaatggacga gaaaagtggt tccacatgcg gtgatttgct tgttacacca
     2761 tttgttactg tgattgtcac cgggtttgtt gcttttatcg cgattggccc tctaggcaga
     2821 gcgcttggct ccggcattac ggttgcttta acttatgtat atgaccatgc cggttttgtc
     2881 gcaggtctga ttttcggggg cacgtattca ctcatcgtgc tgacaggcgt tcatcacagc
     2941 ttccatgcga ttgaggcagg gctaattgca gatataggca aaaactactt gctgccgatc
     3001 tggtcaatgg cgaatgtcgc acagggcggg gcaggtcttg ccgtcttctt tatggcgaaa
     3061 aaggcaaaga caaaagaaat cgcgcttccg gcagcatttt ccgcttttct cggcattact
     3121 gagcccgtca tattcggagt caatctgcgc taccgaaaac cgtttatcgc cgcgatgatc
     3181 ggaggcgcat tgggtggggc atatgtcgtc tttacacatg tagctgcaaa cgcttacggg
     3241 ttaacgggaa ttccgatgat cgccattgcg gctccgtttg gcttcagcaa cttgattcac
     3301 tatctgattg gcatggcgat tgcggcagtg tcagcgttca ttgcagcatt tgtcatgaag
     3361 attaatgagg atgaggagag gaaaaaatga cagcacatga ccaggagctt cgtcgccggg
     3421 cttatgaaga agtggagaaa aaagagccca tcgctaacag cgatccgcac cgccagcatt
     3481 ttcatatcat gccgccggtt gggctgctga atgacccg
//
LOCUS       MUSFGFR      2983 bp ss-mRNA            ROD       27-JUN-1990
DEFINITION  Mouse FGF receptor mRNA, complete cds.
ACCESSION   M33760
KEYWORDS    FGF receptor.
SOURCE      Mouse cell line NIH3T3, cDNA to mRNA, clone A-15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2983)
  AUTHORS   Mansukhani,A., Moscatelli,D., Talarico,D., Levytska,V. and
            Basilico,C.
  TITLE     A murine FGF receptor expressed in CHO cells is activated by both
            bFGF and K-FGF
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Basilico, 22-MAR-1990.
FEATURES       from  to/span     description
    pept         44     2245     FGF receptor precursor
    sigp         44      113     FGF receptor signal peptide
    matp        114     2242     FGF receptor
BASE COUNT      699 a    834 c    799 g    651 t
ORIGIN      
        1 gaattccgga gccttgttac caacctctaa ccgcagaact gggatgtggg gctggaagtg
       61 cctcctcttc tgggctgtgc tggtcacagc cactctctgc actgccaggc cagccccaac
      121 cttgcccgaa caagatgcac tcccatcctc ggaagatgat gacgacgacg atgactcctc
      181 ctcggaggag aaagagacgg acaacaccaa accaaaccgt aggcctgtag ctccctactg
      241 gacatcccca gagaaaatgg agaagaaact gcatgcggtg cccgctgcca agacggtgaa
      301 gttcaagtgc ccgtcgagtg ggacacccaa ccccactctg cgctggttga aaaatggcaa
      361 agagtttaag cctgaccacc gaattggagg ctacaaggtt cgctatgcca cctggagcat
      421 cataatggat tctgtggtgc cttctgacaa gggcaactac tcctgcatcg tggagaatga
      481 gtatgggagc atcaaccaca cctaccagct tgacgtcgtg gaacgatctc cgcaccgacc
      541 catccttcag gcagggctgc ctgcaaacaa gacagtggcc ctggcgagca atgtggagtt
      601 catgtgtaag gtgtacagcg atccgcagcc tcacattcag tggctgaagc acatcgaggt
      661 gaacgggagt aagatcgggc cagacaactt gccgtatgtc cagatcctga agactgctgg
      721 agttaatacc accgacaagg aaatggaggt gcttcaccta cggaatgtct cgtttgagga
      781 tgcgggggag tatacgtgct tggcgggtaa ctctatcgga ctctcccatc actctgcatg
      841 gttgaccgtt ctggaagccc tggaagagag accagctgtg atgacctcac cgctctacct
      901 ggagatcatt atctactgca ccggggcctt cctgatctcc tgcatgttgg gctctgtcat
      961 catctataag atgaagagcg gcaccaagaa gagcgacttc catagccaga tggctgtgca
     1021 caagctggcc aagagcatcc ctctgcgcag acaggtaaca gtgtcagctg actccagtgc
     1081 atccatgaac tctggggttc tcctggttcg gccctcacgg ctctcctcca gcgggacccc
     1141 catgctggct ggagtctccg aatatgagct ccctgaggat ccccgctggg agctgccacg
     1201 agacagactg gtcttaggca aaccacttgg cgagggctgc ttcgggcagg tggtgttggc
     1261 tgaggccatc gggctggata aggacaaacc caaccgtgtg accaaagtgg ccgtgaagat
     1321 gttgaagtcc gacgcaacgg agaaagacct gtcggatctg atctcggaga tggagatgat
     1381 gaaaatgatt gggaagcaca agaatatgat caaccttctg ggagcgtgca cacaggatgg
     1441 tcctctttat gtcattgtgg agtacgcctc caaaggcaat ctccgggagt atctacaggc
     1501 ccggaggcct cctgggctgg agtactgcta taaccccagc cacaaccccg aggaacagct
     1561 gtcttccaaa gatctggtat cctgtgccta tcaggtggct cggggcatgg agtatcttgc
     1621 ctctaagaag tgtatacacc gagacctggc tgctaggaac gtcctggtga ccgaggataa
     1681 cgtaatgaag atcgcagact ttggcttagc tcgagacatt catcatatcg actactacaa
     1741 gaaaaccacc aacggccggc tgcctgtgaa gtggatggcc cctgaggcgt tgtttgaccg
     1801 gatctacaca caccagagcg atgtgtggtc ttttggagtg ctcttgtggg agatcttcac
     1861 tctgggtggc tccccatacc ccggtgtgcc tgtggaggaa cttttcaagc tgctgaagga
     1921 gggtcatcga atggacaagc ccagtaactg taccaatgag ctgtacatga tgatgcggga
     1981 ctgctggcat gcagtgccct ctcagagacc tacgttcaag cagttggtgg aagacctgga
     2041 ccgcattgtg gccttgacct ccaaccagga ttatctggac ctgtccatac cgctggacca
     2101 gtactcaccc agctttcccg acacacggag ctccacctgc tcctcagggg aggactctgt
     2161 cttctctcat gagccgttac ctgaggagcc ctgtctgcct cgacacccca cccagcttgc
     2221 caacagtgga ctcaaacggc gctgactacc aaccctgtcc cagttttctc ccattccgtc
     2281 gtcacccgtg ccctcaccca caaatcccct gttggacaca ctgccttctc ctcctccttg
     2341 ccgctggcaa gagccagtgc ctgactgaag gccttcctgt gttgtggcct tccccctcca
     2401 tcacccccaa gacccctctt ctccctcttc ttagcctgct gtgtgagaga ggagccaaga
     2461 ggcaggtgct tgccgacggc cgcatcctcc ttcccaggtg ttggaccaag acccgccccg
     2521 ctgctggcac tgcttggagg tgtgcagagc ggaagcaagt ggagacgtat ccggggcatt
     2581 cctgttgacc catcagcccc ttctgttctg gcggcagggc cttggggctc ctggaagccg
     2641 tgaggtttct gtttaggcct taaccgaagg caacctctgc tccagatgga tggtaccagt
     2701 agcttcttaa ttccaatact aatttgcttt gctgaccaaa tacctgcctg gtaccagaag
     2761 acagggaggc agagactggg agccgtgatg tgcccttggc tgagcctaga cttggggctc
     2821 tgtacatagc tatgaagaaa aacacaaagt gtataaatct tgagatatat ttacatgtct
     2881 ttttaaaaag ggtcgttact agagatttac catgggggag acgcccaggg tagcatccgt
     2941 tgctatatat taaaaacaca aacgaacaaa aaaaaaagaa ttc
//
LOCUS       BOVTCRA01     402 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, leader sequence,
            variable region, and joining region.
ACCESSION   D90010
KEYWORDS    T-cell receptor; alpha chain; joining region; leader sequence;
            variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA52.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 402)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept          1  >   402     T-cell receptor alpha chain, leader sequence,
                                 variable region and joining region
BASE COUNT      114 a    111 c     91 g     86 t
ORIGIN      
        1 atgatgaaat cctcaagagt tttaccagtg atcctgtggc ttcagttgat ctcggtgagc
       61 agccagcaga acacagtgga gcagagtcct gcgtctctgc ccgtcccaga gggagaggcc
      121 gcctctctcg gctgcactta ttctgacagt gcttctcagt acttcatatg gtacagacag
      181 taccctggga aaggccctga gtttctgctg caagtgtatg ccaacaaaga caaagaggaa
      241 ggaaaattta cagcacagtc caataaaacc agcaagcatg tctccctgcg tatcagagac
      301 tccgagccca gcgactcagc cacctacctc tgtgcagctt ctaactacca aggcagtcaa
      361 ttaaactttg gaacaggaac tagactcacc atcacagcca aa
//
LOCUS       BOVTCRA02     408 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, leader sequence,
            variable region, and joining region.
ACCESSION   D90011
KEYWORDS    T cell receptor; alpha chain; joining region; leader sequence;
            variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA29.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 408)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinary Public Health Obihiro Univ. of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept          1  >   408     T-cell receptor alpha chain, leader sequence,
                                 variable region and joining region
BASE COUNT      115 a    109 c     95 g     89 t
ORIGIN      
        1 atgatgaaat cctcaagagt tttattagtg atcctgttgg cttcagttga tcttgagcag
       61 ccagcagaac acagtggagc agagcctgcg tctctgcccg tcccagaggg agccgctgca
      121 tctctcggct gcacttattc tgacagcaat tctctgtact tcacatggta cagacagtac
      181 cctgggaaag gccctgagtt tctgctgcaa gtgtatgcca acaacaacaa agaggaagga
      241 aaatttacag cgcagtccaa taaaaccaac aagcatgtct ccctgcgtat cagagactcc
      301 gagcccagcg actcagccac ctacctctgt gcagttgata cgatatccac cactgcaggg
      361 accaagctaa cctttggaga aggaaccagg ctgatagtca agctgaaa
//
LOCUS       BOVTCRA03     411 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, leader sequence,
            variable region, and joining region.
ACCESSION   D90012
KEYWORDS    T cell receptor; alpha chain; joining region; leader sequence;
            variable region.
SOURCE      Bovine T-cell, cDNA to mRNA, clone BTA44.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 411)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept          1  >   411     T-cell receptor alpha chain, leader sequence,
                                 variable region and joining region
BASE COUNT      104 a    114 c    101 g     92 t
ORIGIN      
        1 atgatgaaat cctcaagagt tttattggtg atcctgtggc ttcagttgat cttggcgagc
       61 acccagcaga acacagtgga gcagagtcct gcgtctctgc ccgtcccaga gggagccgcc
      121 gcatctctcc gctgcactta ttctgacagc acttctcggt acttcacatg gtacagacag
      181 taccctggga aaaggcctga gtttctcgtg caagtgtatg ccaacaacaa caaagaggaa
      241 ggaaaattta cagcgcagtc caataaaacc agcaagcacg tctccctgcg catcagagac
      301 tctgagccca gcgactcagc cacctacctc tgtgcagtga gcaatgatta tgggggtgct
      361 gccaaccagg tcatctttgg gtctggcacg ctgctttctg tcaagccaaa a
//
LOCUS       BOVTCRA04     393 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, leader sequence, variable
            region, joining region.
ACCESSION   D90013
KEYWORDS    T cell receptor; alpha chain; joining region; leader sequence;
            variable region.
SOURCE      Bovine T-cell, cDNA to mRNA, clone BTA43.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 393)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinary Public Health Obihiro Univ. of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
            Japan 080 Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept          1  >   393     T-cell receptor alpha chain, leader sequence,
                                 variable and joining region
BASE COUNT      111 a     88 c     95 g     99 t
ORIGIN      
        1 atgaagacac aaagaagagt cttgctgagt ctcttgtgga tacagatttg ctgcctcaga
       61 gtgcagatga aggtggagca gagtccgggg gttctgactc tccaagaggg gagaaattcc
      121 tctctgatat gcaattattc tatttccatg accagtgtgc agtggttcca acaaaatcct
      181 gatggacgcc tcatctcctt gttttatata gcttcaggaa tgcagcagaa aggaagactg
      241 aaatccacca ttaatagtaa ggagcgttac agtcaactct acatcagaga ctcccagcct
      301 ggggactcag ccacttactt ctgtgctgcg atcaggactg acaattccag aatcttcttt
      361 ggaactggaa cccaggtggt ggtaaagccc aaa
//
LOCUS       BOVTCRA05     393 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, leader sequence, variable
            region, and joining region.
ACCESSION   D90014
KEYWORDS    T cell receptor; alpha chain; joining region; leader sequence;
            variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA20.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 393)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept          1  >   393     T cell receptor alpha chain, leader sequence,
                                 variable and joining region
BASE COUNT       96 a     97 c     93 g    107 t
ORIGIN      
        1 atgtccttgg tgttcacctt gatgcttgag atgctcctgt ttctgagagc tggagcccag
       61 tcagtgaccc agcctgatga ccacattgct gtctctgaag agccgtctgg agctgaagtg
      121 caactactca tcttctgttt caccgatctc ttctggtaca tacagtaccc caaccaagga
      181 ctccagcttc tcctgaagta cgtgtctgga gacaatcttg tttcaggcat caaaggtttt
      241 gaggctgaat ttaggaacag tgagacgtct ttccacctga ggaaaatacc agctcattgg
      301 aaagactcgg ccaagtactt ctgtgctctg agtaggactg acaattccag aatcttcttt
      361 ggaactggaa cccaggtggt ggtaaagccc aaa
//
LOCUS       BOVTCRA06     393 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, leader sequence, variable
            region, and joining rgion.
ACCESSION   D90015
KEYWORDS    T cell receptor; alpha chain; joining region; leader sequence;
            variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA25.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 393)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept          1  >   393     T cell receptor alpha chain, leader sequence,
                                 variable and joining region
BASE COUNT      102 a     91 c    113 g     87 t
ORIGIN      
        1 atgaagagaa aatggggagc cctgctgggg tttctgtggg tccagatttg cttggtgagg
       61 ggagtaaagg tggagcagag tccttcagtc ttgagcctgc aggagggagc caactctact
      121 ctgaggtgca atttttctga cacagtggac agtgtgcagt ggttccagca gaatcccgga
      181 ggagccctca ccaccctgtt tttcatagct tcagggacga agaagaatga aaggatgagt
      241 tccacagtga attctaagga gcggtatagc accctacaca tcacagcctc ccagctggaa
      301 gacgcagcca cctacctctg tgcggtcgat ctcggttcag gaaacaggca acttgtcttt
      361 ggaaaaggca caagacttgc tgtgactcca aaa
//
LOCUS       BOVTCRA07     399 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, leader sequence, variable
            region, and joining region.
ACCESSION   D90016
KEYWORDS    T cell receptor; alpha chain; joining region; leader sequence;
            variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA15.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 399)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept          1  >   399     T-cell receptor alpha chain, leader sequence,
                                 variable and joining region.
BASE COUNT       91 a     90 c    103 g    115 t
ORIGIN      
        1 atggctttgc agagcactct ggaagcagtg tggctgggtg tcctcctcag ctctctctgg
       61 aaggttgcag agagcaagga gcaagtattt cagtctccca ctgtggtctc tttggaggga
      121 gctgtggcag aaatttcctg taatcactct atatccaatg tttatgactt cctctggtac
      181 tttcacttcc caggatttgc accaagactc ctcattaagg gcttagggcc ttctcaacag
      241 ggacgctaca acatgacata tgagaggttc tcttcatccc tgctcatcct ccaggtgcag
      301 acggcagatg caggggttta ctattgtgct ctgaaggcga tttcaagtgg ccagaagttg
      361 gtttttggaa gtgggaccat gttaaaggtg aatcttaaa
//
LOCUS       BOVTCRA08     348 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90017
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA49.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   348     T-cell receptor alpha chain, variable and
                                 joining region. (AA at 1)
BASE COUNT       99 a     90 c     75 g     84 t
ORIGIN      
        1 ttgagtcttg ctaagacctc ccagcccatc ttcattgact catacgaagg acaagaagtg
       61 aacatatctt gtaaccacac cacaatacgt acaatgagca atatcttgcg gtatgtgaag
      121 gttccccatc cagggaccaa atttatcatt caaggatata agacaaacgt ggaaaatgaa
      181 gtggcatctc tgttaattcc tcctgacagg aagttcagca ccctgagcct gccccaggct
      241 tccctgagag acaccgctgt gtactactgc atcgtgagag atcaatatgg gggtgctgcc
      301 aaccagctca tctttgggac tggcacgctg ctttctgtca agccaaaa
//
LOCUS       BOVTCRA09     306 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90018
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA28.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 306)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   306     T-cell receptor alpha chain, variable and
                                 joining region. (AA at 1)
BASE COUNT       85 a     76 c     78 g     67 t
ORIGIN      
        1 ttgcttctgc aagaaggaga gaacttcacc acatactgca attcctcaag cattttaagc
       61 agcttacagt ggtacaagca gagccctggg ggcagtcctg tcgtcttgat gatattagct
      121 aagggtggag aagtgaagac agagcagaga ctgacaggtc agtttggaga gaccaaacag
      181 cacagctccc tgcacctcac ggctgcccag ctctcagatg caggaaccta cttctgcgtc
      241 agggatcgga caaacctctg gcaaaattgt ctttggaaga gggactcagc ttcatgttct
      301 cccaaa
//
LOCUS       BOVTCRA10     282 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90019
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA42.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 282)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   282     T-cell receptor alpha chain, variable and
                                 joining region (AA at 1)
BASE COUNT       80 a     70 c     70 g     62 t
ORIGIN      
        1 accagctact gcaattcctc aagcatttta agcagcttac agtggtataa gcagaggtct
       61 gggagcagtc ctgtcctctt gatgatatta gctaagggtg gagaagtgaa gacagagcag
      121 agactgacag gtcggcttgg agagaccaaa cagcacagct ccctgcacct cacggctgcc
      181 cagctctcag atgcaggaac ctacttctgc acggggattc tgagcacagg ttatcagaaa
      241 ttcgtatttg gaactggcac ccaacttttg atcaacccaa aa
//
LOCUS       BOVTCRA11     258 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90020
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA13.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 258)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   258     T-cell receptor alpha chain, variable and
                                 joining region (AA at 1)
BASE COUNT       81 a     70 c     54 g     53 t
ORIGIN      
        1 agacacagct tcagactact tcccttggat aaacaaggac ctgggaaagg tccccaactc
       61 cttatagcta ttcgttcaaa tatgggtgaa aaaaaagacc agagactgac tgttttattg
      121 aataagacca ccaaacatct ctccctgcac atcgcaacca tccaacctgg agactcagct
      181 gtctacttct gtgcagcaag aggtgcaggc cagaagctgg tattcgggcg aggaaccagg
      241 ctgaccatca accccaaa
//
LOCUS       BOVTCRA12     189 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90021
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA18.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 189)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   189     T-cell receptor alpha chain, variable and
                                 joining region. (AA at 1)
BASE COUNT       59 a     52 c     33 g     45 t
ORIGIN      
        1 tttaacattc cttcaggaac aaaacaagat ggaagattaa acgccacgac aatccctaaa
       61 gaacgtcgca gctcactgca catttcctct tcacagacca cagactcagg cacttacttc
      121 tgtgctggtg aacaaacctc tggcaaaatt gtctttggaa gagggactca gcttcatgtt
      181 ctccccaaa
//
LOCUS       BOVTCRA13     171 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90022
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA40.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 171)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   171     T-cell receptor alpha chain, variable and
                                 joining region. (AA at 1)
BASE COUNT       39 a     36 c     50 g     46 t
ORIGIN      
        1 gatctcgatg cggcaagggg atccagtgac tgtgaagtga ccctatacag tttcggggag
       61 cccattatct tttctgggag tgactctacc ttgtacttct gtgctgtgag cggggattat
      121 gggaacagac tcacttttgg aaatgggact cgagtgctgg tcacaccaaa a
//
LOCUS       BOVTCRA14     156 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90023
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA15.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   156     T-cell receptor alpha chain, variable and
                                 joining region. (AA at 1)
BASE COUNT       39 a     36 c     31 g     50 t
ORIGIN      
        1 tttttttttt cattccttag acgctctgat gcacacagtt acctccttct gaaggaactc
       61 cacatgaaag actttgcctc ttacctctgt gttgtgatgg atggcaacta tcagtggatc
      121 tggggctctg ggaccaaact aattataaag ccagaa
//
LOCUS       BOVTCRA15     156 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90024
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA21.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   156     T-cell receptor alpha chain, variable and
                                 joining region. (AA at 1)
BASE COUNT       49 a     34 c     27 g     46 t
ORIGIN      
        1 tttttttttc aaaagggaaa aaaaatcatc agcctcacca tctcttcctt gtacctggcc
       61 gactcagcaa agtatttctg tgctcttcgg gatagtggaa ctagcagcta taaacagaca
      121 tttggaaaag gcactctctt aattgtgaat ccaaaa
//
LOCUS       BOVTCRA16     147 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90025
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA26.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 147)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Dep. of Veterinart Public Health Obihiro University of
            Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080
            Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >   147     T-cell receptor alpha chain, variable and
                                 joining region (AA at 1)
BASE COUNT       45 a     31 c     38 g     33 t
ORIGIN      
        1 aagaaggatg aaaactactt ccacctgatg aaagcttcag cccattggag tgattctgcc
       61 aagtacttct gtgctctgtg tgatccatgg ggaaacacag gaagactcac ctttgggcag
      121 gggacagtgt tgcaagtaaa accagaa
//
LOCUS       BOVTCRA17     135 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90026
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA27.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Department of Veterinary Public Health Obihiro University
            of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
            080
FEATURES       from  to/span     description
    pept    <     1  >   135     T-cell receptor alpha chain, variable and
                                 joining region (AA at 1)
BASE COUNT       36 a     40 c     34 g     25 t
ORIGIN      
        1 aggagcggaa tgtccctaca catcacagcc tcccagctgg aagacgcagc cacctacctc
       61 tgtgtggtgg acaggactgg tgtctacaat aagctgacct ttggaaccgg aaccaggctg
      121 actgtacact cacaa
//
LOCUS       BOVTCRA18     123 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90027
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA36.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Department of Veterinary Public Health Obihiro University
            of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
            080
FEATURES       from  to/span     description
    pept    <     1  >   123     T-cell receptor alpha chain, variable and
                                 joining region (AA at 1)
BASE COUNT       34 a     39 c     28 g     22 t
ORIGIN      
        1 gagcggtata gcaccctaca catcacagcc tcccagctgg aagacgcagc cacctacctc
       61 tgtgcggtgg aggaaaacac actgatcttt ggctcaggaa ccactttgac agtcaagcct
      121 caa
//
LOCUS       BOVTCRA19     123 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90028
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA41.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Department of Veterinary Public Health Obihiro University
            of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
            080
FEATURES       from  to/span     description
    pept    <     1  >   123     T-cell receptor alpha chain, variable joining
                                 region. (AA at 1)
BASE COUNT       23 a     34 c     32 g     34 t
ORIGIN      
        1 tttttccttt ttgcggacca cagactcagg cacttacttc tgtgctgtgc agctggaagg
       61 ggggtgctgc caaccagctc atctttggga ctggcacgct gctttctgtc aagccaagca
      121 caa
//
LOCUS       BOVTCRA20      51 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, variable and joining region.
ACCESSION   D90029
KEYWORDS    T cell receptor; alpha chain; joining region; variable region.
SOURCE      Bovine T cell, cDNA to mRNA, clone BTA46.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 51)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Department of Veterinary Public Health Obihiro University
            of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
            080 Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1  >    51     T-cell receptor alpha chain, variable and
                                 joining region (AA at 1)
BASE COUNT       12 a     16 c     11 g     12 t
ORIGIN      
        1 gccaaccagc tcatctttgg gactggcacg ctgctttctg tcaagccaaa a
//
LOCUS       BOVTCRA21     955 bp ss-mRNA            MAM       27-JUN-1990
DEFINITION  Bovine T-cell receptor alpha chain, constant and 3' untranslated
            region.
ACCESSION   D90030
KEYWORDS    T cell receptor; alpha chain; constant region.
SOURCE      Bovine T cell, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 955)
  AUTHORS   Ishiguro,N., Tanaka,A. and Shinagawa,M.
  TITLE     Sequence analysis of bovine T-cell receptor alpha chain
  JOURNAL   Immunogenetics 31, 57-60 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Naotaka
            Ishiguro Department of Veterinary Public Health Obihiro University
            of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido
            080 Japan Phone:  0155-48-5111 x281
FEATURES       from  to/span     description
    pept    <     1      423     T-cell receptor alpha chain, constant region
    signal      935      940     polyadenylation signal
BASE COUNT      223 a    288 c    211 g    233 t
ORIGIN      
        1 gtcaaggacc ccaaccccac tgtgtaccag ctgagaagcc cccaatccag tgacacttct
       61 gtctgcctat tcaccgattt tgattcaaat caagtcaaca tggaaaaaat catggggtcc
      121 gaggggagca cggtgcacaa aacaaacagc accgtgctca acatggagat cctgggatcc
      181 aagagcaacg ggatagtgac ttggggaaac accagcgatg ctggatgcgc atacaccttc
      241 aacgagacca ttcccttcgc ctccagcttg gaaatctcct gtaatgccaa gctggtagag
      301 aaaagctttg aaacagatat aaacctaaac tcccaaaacc tgtcagtgat agtgttccgc
      361 atcctcctcc tgaaggtggt cgggtttaat ctgctcatga cgctgcggct ctggtccagt
      421 tgaggtcacc gacaacctga gagccccgtg ctccctcgcc ccttgctcct cgtcgccctt
      481 cctctccctc ttcaagcaga gaggcgcact ctctgccccc atggatgaga aggctccctc
      541 ctcctctctg gcctggctgg ccataacacc aactggatcc tccacatact ggtgatcaag
      601 atgctgacaa gctgcccagc acggctgcca cccactctct tcctcactgc tgcttgtcac
      661 tgcctggcat tcttggcaaa gtcggggggc tccccacccc caccccccag agactgcctc
      721 tgatgtccca ctggatggtg gatccccagt gggttctcct gggctctagc tcctggagaa
      781 tgttgtgtgt tgtttatatt tttttacata gtgttcataa aaaaatatat atcacccttt
      841 tccccaacat gtggggaaaa attatctttt cacgatctag gccctgctct tctgtgtatc
      901 cgagccacat tgtatattct gctgccacgg cttcaataaa agcgatttgg aagag
//
LOCUS       SUSFCK       3985 bp ss-mRNA            INV       27-JUN-1990
DEFINITION  S.purpuratus flagellar creatine kinase mRNA, complete cds.
ACCESSION   M33763
KEYWORDS    creatine kinase; flagellar creatine kinase.
SOURCE      S.purpuratus sperm, cDNA to mRNA, clone WD2.2.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 3985)
  AUTHORS   Wothe,D.D., Charbonneau,H. and Shapiro,B.M.
  TITLE     The phosphocreatine shuttle of sperm: Flagellar creatine kinase
            results from a gene triplication
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.D.Wothe, 18-APR-1990.
            
              Author address: D.D.Wothe
              University of Washington
              Dept. of Biochemistry
              SJ-70
              Seattle, WA 98195
              E-mail:WOTHE@toby.acs.washington.edu
            
FEATURES       from  to/span     description
    pept         95     3619     flagellar creatine kinase (E.C. 2.7.3.2)
    signal     3961     3966     poly-A signal
BASE COUNT     1184 a    923 c    910 g    968 t
ORIGIN      
        1 ccctaaactt caatcacact tctaatcaag aaaggaacaa atttgtatta attatttcat
       61 tttctacata tgcattgtaa tcagtgaccc agcaatggga tgcgcagcat cgtcacaaca
      121 aacgacggcc acaggtggtc aaccagctgc tggtgaaaag gcaaacccgg cccccgctaa
      181 taacaatcca aacgcagcaa acaaagcaga aacgacagga gcagctgaag aactcactaa
      241 agaatctgaa ccatttgtgg aaccggatcc caattaccct gacctaagca agcacaacaa
      301 ttacctggca gaatctttaa cgccttccat ctacaacaaa atctgtaacc ttcgcacact
      361 ttctggttac tctgtggatg gttgcatgca gaccggcgtg gacaatccag gacatccttt
      421 catcaagact gtcggattgg tggctgggga tgaagagtgt tacgatctct ttgctgacct
      481 ctttgatcca actattgaca aaagacacaa cggatatccc cgcaatgcaa agcacacaac
      541 tgatctcaac cctgatcatc tcaagggtgg tgatgacttt gatccaaagt atgtcctttc
      601 atgcagagtt cgtacaggcc gctgcattag agggtacggc ctcccaccac attgcacgag
      661 ggctgaacgt cgagacgttg agaaagtttg caaagacgca ctggcaactc tagatgggcc
      721 actcaaaggc acgtattacc cactaacggg gatgacagag gaaatgcagg acaaactaat
      781 tgcagatcat ttcctattcg ataagcccgt ttctcctttg cttatgtcag ctcgcatggc
      841 ccgtgattgg cctgatggac gtgggatctg gcacaatgcc gataaaaact tcctggtttg
      901 gataaatgag gaagatcaca ctcgggttat ttccatggaa acaagtggca acatgaagaa
      961 tgtctttaaa cgattctgta atggtctcaa caaagtcgag aatgctctca aagcaaaagg
     1021 gtatgaattc tcatggaatg agcatcttgg gtatgtgctc acttgtccat ccaaccttgg
     1081 tacaggtgta cgtgcaggag tccacatcaa gattcctctt ttcagcaagc acgctggatt
     1141 tgaatccatt ctgaagcatt atcgtctcca gaaacgtggc acaggaggcg tggatacggc
     1201 atccactgac ggcacgttcg acatttcaaa cctagatcgt cttggcactt ctgaagtaca
     1261 acaggttcaa tccgttgttg atggtgtcaa gaagctgatt gagttggaaa aagccttaga
     1321 aaagggatca gatatcagcg gccagattcc ccgggaccca gcaatagttc gtgcagaaca
     1381 agtgaaagaa ggatatccag acctatccaa acacaacaat catctggccc actgtctgac
     1441 atacgacatt tggaaaagtc tcaaggacaa aaagacgcca agtggcttta ctctggatgg
     1501 ctgcatccaa acgggagtca tgaaccctgg tcatcctcac atcatgaccg ttggaatggt
     1561 tgcaggagat gaggaatcat atgacgtctt cgctgatatc tttgacccgg tcattgatgc
     1621 acgacacgga ggttatccca aagatgccgt ccatgtcacc aacataaacc atgcagatct
     1681 taaaggagga gataacctgg atcccaaata tgttttgtct tgccgtgttc gcacaggacg
     1741 cagcatcatc gggtattctc tgcccccgca ttgcaccgtg gaagagagag ctgccgtaga
     1801 gacgatcacg atcggagctc ttgataaatt tgatggagac ttgcaaggaa aatactaccc
     1861 actggaagga atgtctgatg aaacacaaac acagctgata gatgaccatt tcctcttcga
     1921 taagcctgtt tctcctcttc ttaccgccgc aagaatgcat cgtgattggc cacagggaag
     1981 aggaatttgg cataacgaaa acaagaactt cttggtgtgg gtcaatgaag aagatcatat
     2041 ccgtgtcatc tctatggaga aggatgggaa tatgcgagcg gtcttcaagc gattctgtga
     2101 aggtttgcag aagttcgagc agatgatcaa aaaggatggc aaggagttca tgtggaacaa
     2161 acatctgggt tacgtactta cttgtccgtc taacctggga accggcctac gcgctggcgt
     2221 acatgtgaaa ctgcctctgc tgagcaagta tccgagattc gaccagatcc taagagccct
     2281 ccgacttcag aagagaggta caggtggagt agatactgcc tcaactgatg gaaccttcga
     2341 catctcgaac ctcgacaggc ttggatcatc agaagtacag caagttcagt ttgttgtcga
     2401 tggcgttgaa cttctcgtcc aaatggagaa gaaacttgaa aagggcgagg acatttttga
     2461 tattctacca cagcaatgcc gtcccaaacc accaataaag cctttcagtt atgactaccc
     2521 cgatttctct ctgcacaaca actggatgtc taaatgtatg acagaagaaa tctacaacaa
     2581 actttgcaac ttgaagacaa aaggtggagt caccctgaat gactgcattc aaactggcat
     2641 tgacaatcct ggacatccat acatcatgac tgttggtttg gtagctggag atgaagagtg
     2701 ctacgaggtt ttcgctcctc tcttcgaccc ggtgataagt gccaggcacg gcggctacgc
     2761 tcttgatgcc aaacacccta ccaacctcaa tgcagcagaa ctcaaaggcg gagatgattt
     2821 ggaccctgag ttcgtcctat cttgtcgtgt acgtactggt cgttgcatcc gcggcttggc
     2881 tctcccgcca tgttgcacca gagcagaacg tgcggaagta gagaagatta caacagaagc
     2941 attatctacc ctaagtggac ctctgaaggg aaagtactac ccactaaccg gtatgacaga
     3001 cgaagagcaa gagaagctca tcgaagacca cttcctcttt gacaagcccg tatccccgct
     3061 gttgctttgt gccaatatgg cacgtgactg gccccagggt cgtggcatat ggcacaatga
     3121 tgagaagaat ttccttgtct gggtcaacga ggaagatcac actcgagtca tctccatgga
     3181 gaagagcggg aacatgaagc gagtgtttga gaggttctgc gacggtctca aaaaggtcga
     3241 agactccatc aagtccaagg gttaccagtt catgtggaat gagcatcttg gttatgtact
     3301 gacatgtcca tccaacctcg ggacgggact gcgtgcaggc gtccatgtca aggttccttt
     3361 gttaagtcaa cagaaaatct ttgactctat ccttgatcac atgcgtctgc aaaagcgagg
     3421 aacaggaggt gtagacacag cctctactga cggcaccttt gatatctcca actcggatcg
     3481 catcggtttc tccgaagtcc atcttgtcca acaactcgta gacggtgtca agctcctcgt
     3541 caacctcgaa aaggctctca tgaagggaga agacatcaac agtctcctgc cagagaagct
     3601 aagggaagat tcctcataaa gccaggatcc acagcttgca gaatagtggc cgttgtcgcg
     3661 caagtgttat aaatgccacc tttagaagat gtagcctaca cattaaaaaa aaatacaatt
     3721 agaattgaaa tctataagga cgatttgaat ataccataaa gatttgccca tgattttgaa
     3781 tcaaaataac ttcatgctga acttgtaaat gaaaaagtaa tgttagaaaa ctctgtaaat
     3841 tgggaagata taaaatttca tggccgtttt tctttctgat gaagtacact aattatatta
     3901 atttctctta ttaatatgta tggaaacttt taattatgta aaaacaagga aacgaactga
     3961 aatacaattc aaactacttt tcatt
//
LOCUS       HAMPPTPCA    2126 bp ss-mRNA            ROD       27-JUN-1990
DEFINITION  Hamster dihydroorotate synthetase mRNA, partial cds.
ACCESSION   M33702
KEYWORDS    CAD; aspartate transcarbamoylase; carbamoyl phosphate synthetase;
            dihydroorotase; dihydroorotate synthetase;
            pyrimidine pathway trifunctional protein complex.
SOURCE      Hamster SV40 transformed cell line BHK21, cDNA to mRNA, clone
            pCAD142.
  ORGANISM  Cricetulus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2126)
  AUTHORS   Williams,N., Simpson,R.J., Moritz,R.L., Peide,Y., Crofts,L.,
            Minasian,E., Leach,S.J., Wake,R.G. and Christopherson,R.I.
  TITLE     Location of dihydroorotase domain within trifunctional hamster
            dihydroorotate synthetase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.I.Christopherson, 13-APR-1990.  Univ. of Sydney, Dept. of
            Biochem., NSW, 2006 AUSTRALIA
FEATURES       from  to/span     description
    pept    <     1  >  2126     dihydroorotase synthetase complex (AA at 3)
                                 (EC 3.5.2.3)
BASE COUNT      415 a    676 c    609 g    426 t
ORIGIN      Short arm of chromosome B9.
        1 gtcgacggct ttcctccttc gtcaccaagg gctaccgcac gcggcgcctg gctgctgact
       61 tctctgtgcc tctcatcatc gacatcaagt gcaccaaact cttcgtggag gccctgggtc
      121 agattggccc cgccccgcct ttgaaggttc atgtagactg catgacctcc cagaaactgg
      181 tgcggctccc tgggttgatc gacgtccatg tgcaccttcg ggagccaggt gggacacaca
      241 aagaagactt tgcctcgggc acagctgctg ccttggctgg gggtgtcacc atggtctgtg
      301 ccatgcctaa tacccggccc cccatcattg atgcccctgc tctggccctg gcccagaagc
      361 tggcagaggc cggtgcccgc tgtgactttg ccctgttcct tggggcctca tctgaaaacg
      421 cggggactct gggtgctgta gccgggtctg cagcaggact aaagctctac ctcaacgaga
      481 ccttctccga gctgcggctg gacagtgtgg cccaatggat ggagcatttt gaaacgtggc
      541 cttcccacct ccccattgtg gcccatgcag agcggcagag tgttgccgca gtcctcatgg
      601 tggctcagct gacccagcgc ccagtgcaca tatgtcacgt ggctcggaag gaagagatcc
      661 tgctgattaa aactgcgaag gcacaggggc tgcctgtgac ctgtgaggtc gcaccccacc
      721 acctcttcct gaatcgggaa gacttggagc gcctgggacc tgggagggga gaggtccggc
      781 ctgagcttgg ttcccgagag gatatggagg ctctgtggga gaatatggct gtcatcgact
      841 gctttgcctc agaccacgct ccccataccc tggaggagaa gtgtgggccc aagcctccac
      901 ccggcttccc agggctggag accatgctgc cgctcctgct gacagctgtg agtgagggcc
      961 ggctcagtct ggatgacctg ctacagcgcc tgcaccacaa cccgcgccgg atcttccacc
     1021 tgccccttca ggaggacacc tatgtagagg tggatctgga gcatgagtgg acaatcccta
     1081 gccacatgcc cttctccaag gcccgctgga ccccttttga ggggcagaag gtgaagggta
     1141 ccatccgccg tgtggtcctg cgaggggagg ttgcttatat cgatggacag gtgttggtac
     1201 ccccaggcta tggacaagat gtacggaagt ggcctcaggg ggctgttccc cagccccctc
     1261 cttcagctcc tgccaccaca gaaataacca cgacacctga gagaccacgc cgagtcatcc
     1321 caggccttcc tgatggccgc ttccacttgc caccccgaat ccacagagcc tctgacccag
     1381 gtttgccagc tgaggaaccc aaagagaagc catccaggaa agtagtggag ccagagctca
     1441 tggggacccc cgacggtccc tgctaccctg caccgccggt gcctagacag gcatcacctc
     1501 agaacctggg ctcttctggc ctactgcacc cacagacttc acccctgctg cactccttag
     1561 tgggccaaca catcctgtct gtcaagcagt tcactaagga tcagatgtct catctgttca
     1621 acgtcgcgca cacactacgg atgatggtgc agaaagagcg gagccttgac atcctaaagg
     1681 gcaaggtcat ggcctccatg ttctacgagg tgagcacccg caccagtagc tcctttgcag
     1741 cagccatggc ccggctcggg ggcgctgtcc tcagcttttc agaagccacg tcctccgtcc
     1801 agaaggggga atcccttgcc gactctgtgc agaccatgag ttgctacgct gatgtcgttg
     1861 tgctccggca ccctcagcct ggagctgtgg agctggcagc caaacactgt cgcagaccag
     1921 tgatcaatgc tggggatgga gtcggagagc accctactca ggccctgctg gacatcttca
     1981 ctatccggga agagctgggg actgtcaatg gcatgacgat caccatggta ggtgacctga
     2041 agcatgggcg cacagtgcac tccctggcct gcctgctcac ccagtaccgt gtgagcctac
     2101 gctacgtggc acctcccagc ctgcgc
//
LOCUS       TRBALDA      1119 bp ds-DNA             INV       27-JUN-1990
DEFINITION  T.brucei fructose-bisphosphate aldolase (ald) gene, complete cds.
ACCESSION   M19994
KEYWORDS    fructose-bisphosphate aldolase.
SOURCE      T.brucei brucei (strain 427) DNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1119)
  AUTHORS   Marchand,M., Poliszczak,A., Gibson,W.C., Wierenga,R.K.,
            Opperdoes,F.R. and Michels,P.A.M.
  TITLE     Characterization of the genes for fructose-bisphosphate aldolase in
            Trypanosoma brucei
  JOURNAL   Mol. Biochem. Parasitol. 29, 65-76 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.A.M.Michels, 03-AUG-1988.
FEATURES       from  to/span     description
    pept          1     1119     fructose-bisphosphate aldolase (ald) (EC
                                 4.1.2.13)
BASE COUNT      240 a    334 c    305 g    240 t
ORIGIN      
        1 atgtccaagc gtgttgaagt tctgcttacc caactccctg cgtacaaccg cctgaagacg
       61 ccatatgaag cggagctgat tgaaaccgcc aaaaagatga ccgcccccgg taagggtctc
      121 ctcgccgccg atgagtccac tggttcttgc tcgaagcgct ttgcgggcat cggtctcagc
      181 aacactgcag agcaccgccg tcagtaccgc gctctcatgc tggaatgcga aggtttcgag
      241 cagtacatca gcggtgtcat cctgcacgat gagaccgtgt atcagaaagc taagaccggc
      301 gaaacattcc ctcagtacct ccgtcgtcgt ggtgtggtgc ctggcatcaa aaccgattgc
      361 ggtctcgagc ccctcgtgga gggtgccaag ggcgagcaga tgactgctgg tctcgacggt
      421 tacatcaaac gggccaagaa atactacgct atgggctgcc gcttctgcaa gtggcgcaac
      481 gtgtacaaga tccagaacgg caccgtgtct gaagctgttg ttcgtttcaa cgctgaaaca
      541 ctcgctcgct acgctatcct ctcccaactc tgcggtcttg tcccgattgt ggagcctgag
      601 gtgatgatcg atggcactca cgacattgag acttgccaac gcgtttcgca gcatgtgtgg
      661 tcggaggtgg tttctgcact ccaccgccac ggcgttgtat gggagggatg cctgctgaag
      721 cccaacatgg tcgtccctgg cgctgaatcc ggcctcaagg ccactgcgga gcaggttgcc
      781 gagtacaccg tcaaaaccct cgcccgtgtt attccccctg cgctccccgg tgtgacgttc
      841 ctatcaggtg gcctcagtga ggttatggcc tccgagtacc tcaacgctat gaacaactgc
      901 ccgttaccac gcccatggaa actgaccttt tcatacgccc gtgcacttca gtccagcgcc
      961 ataaagcggt ggggtggaaa ggaatctggt gtcgaagctg gccgccgtgc gttcatgcac
     1021 cgcgcaaaga tgaactcact tgcccaactc ggcaagtaca accgtgctga tgacgacaag
     1081 gactcgcagt ctctgtacgt tgctggcaac acatactaa
//

GenBank-Updates@genbank.bio.net (06/28/90)

LOCUS       HUMSODB      8841 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human ornithine decarboxylase gene, complete cds.
ACCESSION   M33764
KEYWORDS    ornithine decarboxylase.
SOURCE      Human sultan myeloma cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 8841)
  AUTHORS   Hickok,N.J., Wahlfors,J., Crozat,A., Halmekyto,M., Alhonen,A.,
            Janne,J. and Janne,O.A.
  TITLE     Human ornithine decarboxylase-encoding loci: Nucleotide sequence of
            the expressed gene and characterization of a pseudogene
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.J.Hickok, 18-APR-1990.
FEATURES       from  to/span     description
    pept       3687     3788     ornithine decarboxylase, exon 3 (first
                                 expressed exon; E.C. 4.1.17) /nomgen="ODC1"
                                 /map="2p25" /hgml_locus_uid="LP0064V"
               4072     4245     ornithine decarboxylase, exon 4
               4452     4624     ornithine decarboxylase, exon 5
               4883     5017     ornithine decarboxylase, exon 6
               5148     5229     ornithine decarboxylase, exon 7
               5406     5489     ornithine decarboxylase, exon 8
               6546     6708     ornithine decarboxylase, exon 9
               6791     6903     ornithine decarboxylase, exon 10
               6997     7211     ornithine decarboxylase, exon 11
               7851     7995     ornithine decarboxylase, exon 12
    pre-msg     388     8338     ODC1 mRNA and introns
    IVS         602     3454     ODC1 intron A
    IVS        3565     3669     ODC1 intron B
    IVS        3789     4071     ODC1 intron C
    IVS        4246     4451     ODC1 intron D
    IVS        4625     4882     ODC1 intron E
    IVS        5018     5147     ODC1 intron F
    IVS        5230     5405     ODC1 intron G
    IVS        5490     6545     ODC1 intron H
    IVS        6709     6790     ODC1 intron I
    IVS        6904     6996     ODC1 intron J
    IVS        7212     7850     ODC1 intron K
    signal      359      363     TATA box
    rpt        2277     2586     Alu-repeat
BASE COUNT     2261 a   1884 c   2279 g   2417 t
ORIGIN      
        1 aggctctgcc gttcagctgc cgcgggcggg gccggggcct gcggcgtcgt gcgccgtgcg
       61 ggaccagttc caggcgggcg agaccgcgca gggcggggcg gggcgaggcg gccgcagggc
      121 ggggagggcg gggagaggcg gccgcagggc ggggagggcg gggcgcgaag ccgggggcgg
      181 gggccacgcg tggggcaggc ggtgctcggc tcggctgacg tcggcccgcc ggcgccccac
      241 cacgtccgcg cgggcccggg ttgccaccgc cggcccccgc ccctcccccg gcggtgtccc
      301 ggccggaacc gatcgtggct ggtttgagct ggtgcgtctc catggcgacc cgccggtgct
      361 ataagtaggg agcggcgtgc cgtggggctt tgtcagtccc tcctgtagcc gccgccgccg
      421 ccgccgcccg ccgcccctct gccagcagct ccggcgccac ctcgggccgg cgtctccggc
      481 gggcgggagc caggcgctga cgggcgcggc gggggcggcc gagcgctcct gcggctgcga
      541 ctcaggctcc ggcgtctgcg cttccccatg gggctggcct gcggcgcctg ggcgctctga
      601 ggtgagggac tccccggccg cggaggaagg gagggagcga gggcgggagc ggggcgggct
      661 gcgggcccgg gcccgggcac gtgtgcggcg cgcctcgccg gcctgcggag acacgtggtc
      721 gccgagcggg ccacgacctt gaggcgccgc ttcctcccgg cccggggttc tcccgcggct
      781 ggataagggt gatccgggcg cctcgttctg cccccgtctt cacagctcgg ggctggaggg
      841 gcctagggga gacccacccg gagaccctgc ggccccgcgc cggcctcttt cccaaccctt
      901 cggcggccgc gcgctggccg gggagccgtt ggggaggccc tggcggccgc gcagcaggtg
      961 caggggcgca gagcctgggc tcgccttggt acagacgagc gggccccggc cttggcgcct
     1021 tcagtttcct tccagttttt attttcgctg tgtctacaga gcagatgaca ccaatttgga
     1081 aacccgcgag agtgggtaga gctaagatag tcttgctgta gtagctgtga tattagatgc
     1141 tcggccatga cttagaggtg tttatttaag gactgtgaat gactcggtga tttcggaaaa
     1201 gcttggctta gatgaacgga catacacagg ggagacagcc ctaaggtttg cagaaaaggc
     1261 tgattgtgct gtttgcgaag tcgaaataat tggtgaaagt gtagaaggca gaacctctca
     1321 ggaatgtctg gggaggacaa agaatgtgtt ggctgacttt gtttaaacat aaaattgggc
     1381 agactttaat tgatttgtga aatttttttc aaagtttgtt tgaattagcc cctatctctt
     1441 ctaacattat cctcttgtgc taattgattg accattttaa ataacttagc tgttacagaa
     1501 agaccgaaag gtgttcttca gtaaaatata ttcaagtaag ttacttaagt aacgccttaa
     1561 aagatacaga aaagcaaaaa agtattggcg tattaaaaag aaatcaaaac tttccaagtt
     1621 taggcctgaa cattgcctta aaaatattta ataaggcctc aaatgaccca gtccgagact
     1681 gcatgagcct atttattatt aaattgtaaa tattcttcat ataaacaaaa atatataacc
     1741 atgtctgtaa caaaaatggt tttgctagcg ttgttactct cttcccttct ccgaggggtg
     1801 atttaggcaa cttcggaggt tgacaatgcc aagcagtcac aatagataga gctttaaagc
     1861 aaattctatg catgggtttg gatttatgac aggcccgtca ccctgggcct gtcatagtac
     1921 cccatgccag agcaaactgt gtccccgaac cattgcctgg cctctgtgcc cgtaggctgc
     1981 tggcactgaa gtgggttgca cagtggaaaa gaagaaagct ctacctggca gaaattttta
     2041 aaggttaaaa taaataattt taagaaagct ggttcacaag gtgccacatt tgatgaaagc
     2101 aaaatacagt ggcttttatt gttactagag tgatgttctt gcttgttttt cttttttggt
     2161 gaagttagcc ccaaattatt ctcatagcta agcaaatacg agagtgactg taaggacagt
     2221 tggcattccc ggaattgcta aacttggtag gcaacgctgg tttaagaata ctgagttcta
     2281 gccgggcgtg gtggctcacg cctgtaatcc caacactttg ggaggctgag gcaggcggat
     2341 cacctgaggt cgggagttgg agaccagcct gactaacatg gagaaacgcc atctccacta
     2401 aaaatataaa attagccagg ccccgggtgt ggtggcacat gccggtaatc ccagctactc
     2461 gggagactga ggcaggagaa tcgcttgaac ccaggaggcg gaggttgagg tgagccgaga
     2521 tcatgccatt gcactccagc ctgggcaaca agagtaaaac tctgtctcaa aaaaaaaaaa
     2581 aaaaaatact gaattctgat caggtaacag caactgtaat acaatgtgat aagttgactt
     2641 gaagattaca gtttttaaga agtatatacc cagctaatac atgaaaatta actcgtaaaa
     2701 tctcaaatgc tccagacatt tccatgatgc ctgttggtca gtaaaaatca ttctaagact
     2761 tagtggaagt aggaaatgtt tgtatggcaa cgtggtgaaa tcctgtctct actaaaaatg
     2821 tgtataaagg ctataatgta atcccagcac tttggaagac cgaggcgggt ggatcacctg
     2881 gggtcaggag tttgagaccc acctggacca caaaaattag ccgggcatgg tggcaggcgc
     2941 ctgtaatccc agctgctggg gaggctgagg caggagaatc gcttgaaccc gggaggcaga
     3001 ggttgcagtg agccaagatt gcaccgctgc actccagcct gggtgacagc gtgagactct
     3061 gtctcaaaaa aaataaaaaa gtctataatg ctattttaag tttctaagga actgaaactg
     3121 ctctgaaata aatcagacca ttataagact tttttccata tcagtgagct aagtgcagat
     3181 aagcttctga aacttgcatg ctagattttt ttggtacaaa tatttgaaat gcttagtgtg
     3241 ctgccttgga aaaacctggt attttttgtt gtgtccttat actgccaagg tttatggaat
     3301 catgtacctt atgcctagta ataattagga tgaccaggcc agtgagtggt tcatatccgg
     3361 ggcatgatta gctctgcgtg tgctcagcca gtgccccatc ttcaactcga tgtgttccta
     3421 aggtagacag caaattccct attttatttc tcagattgtc actgctgttc caagggcaca
     3481 cgcagaggga tttggaattc ctggagagtt gcctttgtga gaagctggaa atatttcttt
     3541 caattccatc tcttagtttt ccatgtaagt attcagttta catttatgtt gcaggttaat
     3601 cttaagaatt gtattgctaa ggcttctaag tgaatttctc cactctattt gcattttgtt
     3661 gcatttcaga ggaacatcaa gaaatcatga acaactttgg taatgaagag tttgactgcc
     3721 acttcctcga tgaaggtttt actgccaagg acattctgga ccagaaaatt aatgaagttt
     3781 cttcttctgt aagtatatga ggcccatgct ggcagtgcag ctgagagtgc caggcaagtg
     3841 gaaaactttg gcaaggtcta aggaagagca atgaggctta catgtcttgt tatggaatgt
     3901 agaaattaat tcactggtgg taaattaata gtgataatgg tgatactcat atcagtggct
     3961 agactcaaaa gagcaggatt cattgtgact gatgggaatg aaggtcgctg gctattggtg
     4021 tggtgtgtgg tgaggctgct agtgagtcac ctgtgaccac tcttgtttca ggatgataag
     4081 gatgccttct atgtggcaga cctgggagac attctaaaga aacatctgag gtggttaaaa
     4141 gctctccctc gtgtcacccc cttttatgca gtcaaatgta atgatagcaa agccatcgtg
     4201 aagacccttg ctgctaccgg gacaggattt gactgtgcta gcaaggtaag cgatagcagc
     4261 aggcctcaaa agcgttgtat aaaatgggcc tggtattccc cacgaggcag atacaagttg
     4321 tgttttttgg gcaataaatg ctcactaaag gcaaatgggg cgggggggta catgacaact
     4381 tcccatgctt ttctgtttat tccacgtgtt aagccacata tggatagcat gacaccactc
     4441 ttctttttca gactgaaata cagttggtgc agagtctggg ggtgcctcca gagaggatta
     4501 tctatgcaaa tccttgtaaa caagtatctc aaattaagta tgctgctaat aatggagtcc
     4561 agatgatgac ttttgatagt gaagttgagt tgatgaaagt tgccagagca catcccaaag
     4621 caaagtgagt tattccccca tctgagggca agatcgggag cataagatat gtggattctt
     4681 atcaaacaaa cttaaatttc tgattattat atttctatac tttagtagaa agtagttgaa
     4741 acccccattg agtcatgaag cctgggactc aaactacaga atatatcagc gacagtattt
     4801 agaacaggat tgtttttatt ttaattgtgg ctataagtga acatctatca tgagacattt
     4861 gctgcacttt ccttgcttgt aggttggttt tgcggattgc cactgatgat tccaaagcag
     4921 tctgtcgtct cagtgtgaaa ttcggtgcca cgctcagaac cagcaggctc cttttggaac
     4981 gggcgaaaga gctaaatatc gatgttgttg gtgtcaggtg agattttggt gggatagcta
     5041 gaggtcaaga cattgaacag tttgagtttt acaggctttc tcctagtgtt tgctattatt
     5101 ttaagaaata ctaagacaca gtgtctcgtc tctttatttt accccagctt ccatgtagga
     5161 agcggctgta ccgatcctga gaccttcgtg caggcaatct ctgatgcccg ctgtgttttt
     5221 gacatggggg tgagtatacg tgaccctgtt agggaagggc gggacacaac tgacaataac
     5281 tagtcttaat tctagagtta actttttatg gcagttggtt ctgtattaca tgggtttcag
     5341 cctatctgct gcatacattt ttgttattag ctgtggatct ggctgactta ttttcttgat
     5401 tctaggctga ggttggtttc agcatgtatc tgcttgatat tggcggtggc tttcctggat
     5461 ctgaggatgt gaaacttaaa tttgaagagg taatttagaa caaaactgta atactcagta
     5521 gccgttctaa taaattcctt tttggaatat ttcaaaattt aagtgtctta actaatacca
     5581 caatgggctg aagtgtcttg gtgtgatatt tttgagtgat ttctttgtgc tgtctgacat
     5641 tacacttgat accatttggt tttctaaagt gtgaatcagc tttcccagaa gtcttggata
     5701 attggttaca ttggaaatca tggctcacac ctgtaatcca gcacttgggg aggccaaggt
     5761 ggtaggatca cttgagccca ggagtttgag accagcctgg gcaacacagt gagaccccat
     5821 ctctacaaaa aaaattttaa aattagcctg gtgtggtggc gggcacctgt aatcccagct
     5881 acttggaagg ctgaggtggg aggatcactt gagcccagga ggttgaggct gcagtgagcc
     5941 atgatcatgc cactgcactc agcctgggct acagagtgag accctgtctc aaaaaaaaaa
     6001 aagaaaaagc atgttgctgt gggcttccta gagaatatgc tgactgtagc acatcatcac
     6061 cccaaatgtg ctttgctaga cctatgcttc ctctccttaa aatacttgaa atgtttagtc
     6121 acttaggaag ttaagccatt atattggtgc ttgaatttat aaaatatatc cacatggttt
     6181 gttaaaatca tgacgtaggc agaataggat ttttatcctg ttggcatgta tttgttaaaa
     6241 tgttttgaca tcttgatgcc ttcctaggta gtagttagtt gcgtactgtt ctttgataaa
     6301 aatcataccc ataacatcct aaaggagata gggtgcctgg aggggaatga aaacgagcca
     6361 cctgggatat gtagcctggt tttcagggag atgttgatgt ttttttgctt ttgttacttt
     6421 aatgataaac ctgtctgttg atgcctggtc tcatgatgtc atgtcacaag gccctgtgat
     6481 gttactcccc catgtgaatt tcccacaatg aaggctgctc tttcttttct gtttcactct
     6541 cttagatcac cggcgtaatc aacccagcgt tggacaaata ctttccgtca gactctggag
     6601 tgagaatcat agctgagccc ggcagatact atgttgcatc agctttcacg cttgcagtta
     6661 atatcattgc caagaaaatt gtattaaagg aacagacggg ctctgatggt atgtataaag
     6721 gacgaatcac ttcatgtata actgaaagct gatgcaaaaa gtcattaaga ttgttgatct
     6781 gcctttctag acgaagatga gtcgagtgag cagaccttta tgtattatgt gaatgatggc
     6841 gtctatggat catttaattg catactctat gaccacgcac atgtaaagcc ccttctgcaa
     6901 aaggtaattt ctgagcatac tgtataaaac aattaagagg actggtcaca acacgtgtaa
     6961 ttaagtagta cttcctctct ccgtctcttt atatagagac ctaaaccaga tgagaagtat
     7021 tattcatcca gcatatgggg accaacatgt gatggcctcg atcggattgt tgagcgctgt
     7081 gacctgcctg aaatgcatgt gggtgattgg atgctctttg aaaacatggg cgcttacact
     7141 gttgctgctg cctctacgtt caatggcttc cagaggccga cgatctacta tgtgatgtca
     7201 gggcctgcgt ggtaagtaag ccatgcatgt tgatggtgct gccaagaata ggcaccttct
     7261 tggatgtgtg cttcttgtct agacgaataa gaaattgtct tgcctaagat taaatatata
     7321 tggatatttt tcctaagaaa agttttagaa aagactgatg agtgtatttc tatgtaattg
     7381 gaatatattt aagttcatgc catgtgtctt gtggtttcct tattaccaaa acggtgactg
     7441 aagaaacgct tgctttagaa atacattgaa ttggccaggt gtgctggctc acacctgaaa
     7501 tcacaacaca ttgggaggcc aaggcagaag gatcacttga gcccaggagt tcgagcctgg
     7561 gcaacatagt gagaccctgt ctctacaaaa aattaaaaaa ttagttggcc atggtagtgg
     7621 gcgcctgtag tcccagctgc ttggctaagg tgagaggttt gcttgagcct gggaggttga
     7681 ggctgcggtg agctatgata gcaccattgt attccagcct gagtaacaga gaaagaccct
     7741 gtctcagaaa aaaaaaaaat acattgaatt gtttcctgat ggaagtaaat actctcatgc
     7801 ccagttagga gtgagtcagg gtttttaata tgccactttt tctttctcag gcaactcatg
     7861 cagcaattcc agaaccccga cttcccaccc gaagtagagg aacaggatgc cagcaccctg
     7921 cctgtgtctt gtgcctggga gagtgggatg aaacgccaca gagcagcctg tgcttcggct
     7981 agtattaatg tgtagatagc actctggtag ctgttaactg caagtttagc ttgaattaag
     8041 ggatttgggg ggaccatgta acttaattac tgctagtttt gaaatgtctt tgtaagagta
     8101 gggtcgccat gatgcagcca tatggaagac taggatatgg gtcacactta tctgtgttcc
     8161 tatggaaact atttgaatat ttgttttata tggattttta ttcactcttc agacacgcta
     8221 ctcaagagtg cccctcagct gctgaacaag catttgtagc ttgtacaatg gcagaatggg
     8281 ccaaaagctt agtgttgtga cctgttttta aaataaagta tcttgaaata attaggcatt
     8341 gggacgtttt tatggtgtgt tcattccaga cagttcacga atcccgtata gctcgctctg
     8401 attctcagag aacaatgagt gggtccaccc acacacaggt aggaggacag gtgagacgga
     8461 agccccatcc tcccatgtgg acggtgcaca tctgctcagc ccaccccaca tgtccagagt
     8521 tggctgcaaa ctccttgtcc agagcctctg gtggtgggac ctacttaagt ctgacggacc
     8581 tgtcctgtcc aggccagtgc ccagggaagg tgtgggaggc cctttgagcc tggcctgcag
     8641 agaccatccg tgtcccctcc caccttcatg cctgtgagaa gttaggaatg tatacggtac
     8701 cacatttggc agtcagctta ttttaataaa ttcagcaaca gcaagtccct accatgttgt
     8761 gtatcttcac catcttgtct gaccatgacc actggccttg tgtgttcttt tactcaacgt
     8821 gtacccccgc tctcccccaa a
//
LOCUS       XELFIMB1     1341 bp ss-mRNA            VRT       28-JUN-1990
DEFINITION  X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds.
ACCESSION   J02910
KEYWORDS    integumentary mucin; mucin.
SOURCE      X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Probst,J.C., Gertzen,E.-M. and Hoffmann,W.
  TITLE     An integumentary mucin (FIM-B.1) from Xenopus laevis homologous
            with Von Willebrand factor
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Hoffmann, 26-APR-1990.
FEATURES       from  to/span     description
    pept    <     1     1197     integumentary mucin (FIM-B.1; AA at 1)
    mRNA    <     1     1341     FIM-B.1 mRNA
    signal     1319     1324     poly-A signal
    rpt           1      279     repeat sequence
BASE COUNT      472 a    290 c    268 g    311 t
ORIGIN      
        1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag
       61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca
      121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa
      181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta
      241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca
      301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact
      361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca
      421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc
      481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa
      541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa
      601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg
      661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat
      721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact
      781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt
      841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca
      901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa
      961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg
     1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt
     1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac
     1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga
     1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata
     1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa
     1321 taaatggttt ttttcagctg t
//
LOCUS       LACISTRAN     808 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  L.lactis insertion sequence IS946 transposase gene, complete cds.
ACCESSION   M33868
KEYWORDS    transposase.
SOURCE      L.lactis DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 808)
  AUTHORS   Romero,D.A. and Klaenhammer,T.R.
  TITLE     Characterization of gram-positive insertion sequence IS946, an
            Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid
            pTR2030
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.A.Romero, 26-APR-1990.
            
                        Author address: D.A.Romero
                        North Carolina State University
                        Dept. of Food Science
                        Box 7624, Raleigh, NC 27695
FEATURES       from  to/span     description
    pept         74      754     transposase (put.)
    rpt           1       18     inverted repeat
    rpt         791      808     inverted repeat
    signal        6       35     promoter region
    binding      62       65     ribosomal binding site
BASE COUNT      281 a    137 c    158 g    232 t
ORIGIN      
        1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag
       61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt
      121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata
      181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa
      241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga
      301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga
      361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt
      421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc
      481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca
      541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg
      601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat
      661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac
      721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta
      781 tttgattttt aaactttgca acagaacc
//
LOCUS       RSSTRXA       464 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  R.sphaeroides Y thioredoxin (trxA) gene, complete cds.
ACCESSION   M33806
KEYWORDS    thioredoxin Y.
SOURCE      R.spaeroides DNA, clone pUTC3.
  ORGANISM  Rhodobacter sphaeroides
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 464)
  AUTHORS   Pille,S., Chuat,J.-C., Breton,A.M., Clement-Metral,J.D. and
            Galibert,F.
  TITLE     Cloning, nucleotide sequence, and expression of the Rhodobacter
            sphaeroides Y thioredoxin gene
  JOURNAL   J. Bacteriol. 172, 1556-1561 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         57      377     thioredoxin (trxA)
BASE COUNT       84 a    147 c    153 g     80 t
ORIGIN      
        1 ccggcagggg gccatcctta ctttccaagc aagcctacgg atcaggagaa ttgaatatgt
       61 ccaccgttcc cgtgacggac gccaccttcg acaccgaggt gcgcaagtcc gacgtgcccg
      121 tcgtcgtcga tttctgggcc gaatggtgcg gcccctgccg gcagatcggc ccggcgctcg
      181 aggagctctc gaaggaatat gccggcaagg tgaagatcgt gaaggtcaat gtcgacgaga
      241 accccgagag cccggcgatg ctgggcgttc gcggcatccc ggcgctgttc ctgttcaaga
      301 acggtcaggt cgtgtcgaac aaggtcggcg ctgcgccgaa ggccgcgctg gccacctgga
      361 tcgcttcggc gctctgagat cctctgccat agcagagacc ggacggggcg ccttcgggcg
      421 cccttttcct tgagcaagcg accgaccgcg acgggcgagg atcc
//
LOCUS       PSECATRA     1070 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  P.putida catR regulatory protein gene, complete cds, and catBC
            gene, 5' end..
ACCESSION   M33817
KEYWORDS    regulatory protein.
SOURCE      P.putida DNA.
  ORGANISM  Pseudomonas putida
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1070)
  AUTHORS   Rothmel,R.K., Aldrich,T.L., Houghton,J.E., Coco,W.M.,
            Ornston,L.N. and Chakrabarty,A.M.
  TITLE     Nucleotide sequencing and characterization of Pseudomonas putida
            catR: a positive regulator of the catBC operon is a member of the
            lysR family
  JOURNAL   J. Bacteriol. 172, 922-931 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        135     1004     catR regulatory protein
    pept          3  <     1 (c) cis,cis-muconate-lactonizing enzyme I (catB)
                                 (EC 5.3.3.4)
    mRNA         61  <     1 (c) catBC mRNA
BASE COUNT      199 a    355 c    329 g    187 t
ORIGIN      
        1 cattgttgcc aggtccgttg tttcgggctg ccagccgcgg gcccagtggt gatttacttg
       61 ttgttcaagg attgcgcgag accctgatag ccgtccaata tcgaatgaat ctcccaccat
      121 accctggagg tctgatggag ctgcgccact tgcgttactt caaggtcctg gccgaaaccc
      181 tgaacttcac tcgcgccgcc gagctgctgc acattgccca gccgccgctg agccggcaga
      241 tcagccaact cgaggaccag ctcggtacct tgctggtagt gcgcgagcgc ccgctgcggc
      301 tgacagaggc gggtcgcttc ttctacgaac agagctgcac cgtgctgcag ctgcagaaca
      361 tcagcgacaa cacccgtcgc attggccagg gccagcgcca gtggctgggg atcggcttcg
      421 ccccgtcgac cctgtacaag gtgctgccgg agctgatccg cgagctgcgc caggacagcg
      481 agctggaact gggcctcaac gagatgacca cgctgcagca ggtggaggcg ctgaaaagcg
      541 ggcgcatcga catcgccttc gggcgcattc gcatcgatga cccggcgatt caccaacaag
      601 tgctgtgcga ggacccgctg gtggcggtat tgcccaagga tcacccgctg gcgagcagcc
      661 cgcttacact tgcccagctg gctggcgagg cgttcatcct ctacccggcc aacccgcggc
      721 ccagctatgc cgaccatgtg ctggcactgt tcgcccacca cggcatgagc atccacgtca
      781 gccaatgggc caacgaactg cagaccgcca tcggcctggt ggccgtcggc gtgggcgtga
      841 ccctggtgcc ggcgtcggtg caacagcagc accgcaccga tatcgaatat gtaagcctgc
      901 tcgacagcgg cgccgtcagc ccgatcatcc tcagccggcg caagggcgat gtgagcccga
      961 tcgtgcagcg ctgcctgaca ttgattgcac aacaggccga atgagggcgc gctcctggaa
     1021 tcaacggtct ggctaaacac ctgccccctg tgggagcggg ttcacccgcg
//
LOCUS       HUMMHDQ1A    1515 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human MHC class II HLA-DQ-alpha gene, regulatory region.
ACCESSION   M33765
KEYWORDS    cell surface antigen; cell surface glycoprotein; class II gene;
            major histocompatibility complex.
SOURCE      Human (DR4, DR4) DNA, clone H11A.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1515)
  AUTHORS   Ombra,M.N., Del Pozzo,G., Perfetto,C. and Maffe,A.
  TITLE     Effect of the AIR-I locus on the activation of an enhancerless
            HLA-DQA1
  JOURNAL   Immunogenetics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Guardiola, 18-APR-1990.
BASE COUNT      441 a    322 c    340 g    412 t
ORIGIN      Chromosome 6p21.3.
        1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag
       61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga
      121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac
      181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag
      241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc
      301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag
      361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag
      421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag
      481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct
      541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg
      601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta
      661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc
      721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt
      781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga
      841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta
      901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt
      961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg
     1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt
     1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc
     1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc
     1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac
     1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg
     1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca
     1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac
     1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta
     1501 gcaaaataag ggatc
//
LOCUS       FSOSTI35X    1856 bp ds-DNA             PLN       28-JUN-1990
DEFINITION  F.solani STI35 protein gene, complete cds.
ACCESSION   M33642
KEYWORDS    STI31; heat-shock gene.
SOURCE      F.solani phaseoli mycelial ATCC 60860 isolate DNA, clone cKAES018.
  ORGANISM  Fusarium solani
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE   1  (bases 1 to 1856)
  AUTHORS   Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and
            Smith,D.A.
  TITLE     STI35, a stress-responsive gene in Fusarium spp
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.L.Schardl, 08-APR-1990.
FEATURES       from  to/span     description
    pept        564     1443     STI35 protein, exon 1
               1492     1586     STI35 protein, exon 2
    IVS        1444     1491     STI35 intron A
BASE COUNT      401 a    585 c    445 g    425 t
ORIGIN      
        1 acacactctc tgttgacgca tcctcagact cggaaataca tttcgttggc cgatatcgcg
       61 ggttactcca tcatgatatc agccatccat tacatcgtta cccctcatct gcggtataaa
      121 agccagccat tgtctcctgc tgcttgatcc attctctctt cttcttcatc aactcatcaa
      181 caacttgaac caaaaccctt atcaagactc aaggtttgta ctcgaggtga gaggtgatga
      241 actggtctga cacctgtgac atgctagata tcatcacttc cggtacggat cctccgtcga
      301 ggaatctgca gccatggagt acctttttgc gggtagttgt ccgaacctga ggttggagcc
      361 attcaggtaa agcatcatgc atgagccggt gcccgttctt cccttcttgg ccattgtctg
      421 tactcctgtg tacatcaatg tcaagaagag gagagcgtgg ctgagattat acggcgaaac
      481 ttgatctgga taataccagc gaaaggatca tgctctcctc cccccattga ccccaatccc
      541 ctaacacaac cccaggccca acaatgtctc ctcccgctgc cgtctccccc cctgcccgct
      601 ctgctgagct cgcctccgcc cccgccgtca agctccccgt cggcctgagc aagaacagcg
      661 ctgctgccac caccgtcgag gagatggagg gcaagtggga cgacttcaag tttgccccca
      721 tccgcgagag ccaggtctcc cgggccatga cccgccgcta cttccaggac ctcgacaact
      781 acgccgagtc cgacattgtc attgtcggcg cgggatcctg cggtcttagc acccgctaca
      841 tcctcggcaa gaagaggccc gacctcaaga tcgccatcat cgaggcttct gtgtctcctg
      901 gtggtggtgc ctggctcggt ggccagctct tctcggccat ggtgatgcgc aagcccgccg
      961 atgccttcct ccgcgaagtc ggcgtccctt acgaggacga gggcaactat gtcgtcgtca
     1021 agcacgctgc cctcttcacc tcgaccatca tgtccaaggt gctccagctc cccaactgca
     1081 agctcttcaa cgccacctgc gtcgaggacc tcatcacccg accttccaag gagggagtcc
     1141 gcatctctgg tgtcgtcacc aactggaccc tcgtgtcgat gcaccacgac gaccagtcct
     1201 gcatggaccc caacaccatc aacgcgcccc tcgtcatctc caccaccggc cacgacgccc
     1261 ccatgggtgc cttttgcgtc aagcgtctcg tcagcatggg acgcatcgag aagctcggcg
     1321 gcatgcgtgg tctcgacatg aacgtggccg aggatgccat cgtcaaggga acccgtgaga
     1381 ttgtccctgg tctgattgtc ggaggaatgg agctttccga ggttgacgga gccaaccgaa
     1441 tgggtgagtt tctcgacacg agtatcaata actggttgct aactcgacta ggccctactt
     1501 tcggtgccat ggttctgagc ggtctcaagg ctgccgagga ggccctcaag gtcattgaca
     1561 tccgccagaa gcagaactcg ttctaagcga gttgacgtac aagaagaaca tggactggga
     1621 aacacgatga ttcaggaaac atgaattcag gagccaggac ctcgtctctc tttcgaggca
     1681 tggatccttg tcgctcatta ggccctttca gagaatgaaa ttggatatca aaactgctga
     1741 attgctcata ctttctcgat ataaaacttg acgtacaatg tgcaactcca agaactttac
     1801 tcatcctcca tctccttccc ctttggcttc tcgggcggaa acttcatccc cccgtt
//
LOCUS       FSOSTI35Y    1177 bp ss-mRNA            PLN       28-JUN-1990
DEFINITION  F.oxysporum STI35 protein mRNA, complete cds.
ACCESSION   M33643
KEYWORDS    STI35 protein.
SOURCE      F.oxysporum cucumerinum (isolate B1-GK) mycelial culture, cDNA to
            mRNA, clone pKAE5030.
  ORGANISM  Fusarium oxysporum
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE   1  (bases 1 to 1177)
  AUTHORS   Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and
            Smith,D.A.
  TITLE     sti35, a stress-responsive gene in Fusarium spp
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.L.Schardl, 08-APR-1990.
FEATURES       from  to/span     description
    pept         29      991     STI35 protein
    mRNA    <     1     1145     STI35 mRNA (alt.)
    mRNA    <     1     1177     STI35 mRNA (alt.)
BASE COUNT      264 a    328 c    291 g    294 t
ORIGIN      
        1 aagatatcat aacctgccca agatcaaaat ggcccctccc gctgctgtct cccctccctc
       61 tcgctctgca gagcttgcta catctaccaa gctcccagtc atgagcaaga acatcaacac
      121 caagaccgtt gaggagatgc tcggtcagtg ggatgatttc aagtttgctc ccatccgcga
      181 gagccaggtc tcccgcgcca tgactcgccg ctacttccag gaccttgata actatgccga
      241 gtctgacatt gttatcatcg gtgctggttc gtgcggtctg agcgccgcgt acattcttgg
      301 caagaagcgt cctgatctca agatcgccat cattgaggct tctgtgtctc ctggtggtgg
      361 tgcttggctt ggtggacagc tcttctccgc catgatcatg cgcaagcctg ctgatgcttt
      421 cctccgcgag gtcggtgttc cttatgaaga tgagggtaac tacgtcgttg tcaagcacgc
      481 cgctctcttc acctcgacca tcatgtccaa ggttcttcag atgcccaaca tcaagctctt
      541 caatgccact tgtgttgagg atctcatcac ccgaccttcc gaggagggag tccgcattgc
      601 cggtgttgtc accaactgga ctcttgtttc catgcaccac gatgaccagt cttgcatgga
      661 ccccaacact attaacgctc ctttgatcat ctccaccacc ggccacgatg gcccgatggg
      721 agctttctgt gtcaagcgcc ttgtcagcat gcagcgcatt gagaagctcg gtggtatgcg
      781 tggtcttgac atgaacctcg ccgaggatgc cattgtcaag ggaacccgtg agattgttcc
      841 tggtcttatt gttggtggaa tggaactttc cgaggttgac ggtgccaacc gcatgggtcc
      901 taccttcggt gccatggctc tcagtggtct caaggctgcc gaggaagctc tcaagatctt
      961 cgacactcgc aagaagcaga acgatctgta agcgagatca tcaaagaagg tcatagttgg
     1021 gaaacggaac atgtgaattc aggatcctgg acctcgtctc tcattcgagg catggaacct
     1081 tgtcattcat ttagtctgag cgactcaagt ttggaaaaca tgaacccaaa gtcggatgcc
     1141 actcgaaaga gacgccaaat aaagcaatac taaattg
//
LOCUS       STMGLUSYN    1208 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  S.hygroscopicus glutamine synthetase (GLNB) gene, complete cds.
ACCESSION   M33783
KEYWORDS    glutamine synthetase.
SOURCE      S.hygroscopicus (strain ATCC 21705) DNA.
  ORGANISM  Streptomyces hygroscopicus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 1208)
  AUTHORS   Kumada,Y., Takano,E., Nagaoka,K. and Thompson,C.J.
  TITLE     Streptomyces hygroscopicus has two glutamine synthetase genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.J.Thompson, 20-APR-1990.
            
              Author address: C.J.Thompson
              Institut Pasteur
              28 Rue du Dr. Roux
              75015 Paris FRANCE
              Tel: (33) 145688842
FEATURES       from  to/span     description
    pept         22     1158     glutamine synthetase (GLNB) precursor
    sigp         22       84     glutamine synthetase signal peptide
    matp         85     1155     glutamine synthetase
BASE COUNT      198 a    432 c    418 g    160 t
ORIGIN      
        1 gaattccttg gaaacggctg gatgtccggc tcggtaacct gtggttcaca aacgggcaac
       61 ggacgagaaa tggcgcgttg cgagagtgcg gaggtacgcc cgcaccgtac ccgcaccgac
      121 cgtgaagagg accccgtgag catcaaggcc gagtacatct ggatcgacgg cacgcagccg
      181 accgccaagc tccgctccaa gaccaagatc ctgtccgacg gcagccggct tccgcggtgg
      241 ggcttcgacg gttccagcac caaccaggcc gaaggccacg cctcggacct cgtactggag
      301 ccggtgttca gctgcccgga cccgatccgc ggcggcgacc acctgctggt gctgtgcgag
      361 gtgctgcaca ccgacctcac cccgcacccc tccaacaccc gggcgctgct gcgcccggtc
      421 gcggagcggt tcgccggcca ggagccgatc ttcggcatcg agcaggagta caccttcctc
      481 aagggcgacc gcccgctcgg cttccccgag ggcggcggct acccggcccc gcaggccgac
      541 tactactgcg gcgtgggcgc cgacgcgatc ttcggccggg agatcgtcga gaagcacctc
      601 gacctgtgcc tggcggccgg tctgggcctg tccggcatca acgccgaggt catgcccggc
      661 cagtgggagt tccaggtcgg cgcgctgccg ccgctggagg tctcggacca catgtgggtg
      721 gcgcgctggc tgctgcaccg ggtggcggag gagttcggcg tcaccgcgtc gctggacgcc
      781 aagccggcca agggcgactg gaacggcgcg ggcgcgcaca ccaacttctc cacccgcgcg
      841 atgcgcgagg gctacgaccc gatcatcacc gcctgcgagg cgctgggcca ggacgacaag
      901 ccgctggagc acgtccgcca gtacggcacc ggcatcgagg accggctgac cggcgcgcac
      961 gagaccgccc cctgggacgc gtactcctac ggcgcctccg accgcggcgc ctcggtgcgc
     1021 atcccctggc aggtcgaggt cgagaagaag ggctacatcg aggaccggcg cccgaacgcc
     1081 aacgtcgacc cgtacgtggt cacccggctg atggtggaca cctgctgcac ggagctggcg
     1141 cggcgcgagc agatctgacg cctgcggtca cgtggccgtg cggtcacgtg accgcacggt
     1201 cgcgcgag
//
LOCUS       BLLEBG       2323 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  B.lautus endo-beta-1,4-glucanase (celB) gene, complete cds.
ACCESSION   M33762
KEYWORDS    endo-beta-1,4-glucanase.
SOURCE      B.lautus (strain PL236) DNA.
  ORGANISM  Bacillus lautus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2323)
  AUTHORS   Joergensen,P.L. and Hansen,C.K.
  TITLE     Multiple endo-beta-1,4-glucanases from Bacillus lautus PL236 and
            characterization of the celB gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.K.Hansen, 18-APR-1990.
            
               Author address: C.K.Hansen
               Institut Pasteur
               24 rue du Dr. Roux
               75724
               Paris Cedex 15
               FRANCE
               Email: DEDONDER%PASTEUR.BITNET@CUNYVM.CUNY.EDU
            
FEATURES       from  to/span     description
    pept        172     1872     endo-beta-1,4-glucanase precursor (celB) (E.C.
                                 3.2.1.4)
    sigp        172      262     endo-beta-1,4-glucanase signal peptide
    matp        263     1869     endo-beta-1,4-glucanase
    signal       46       51     -35 region
    signal       70       75     -10 region
    binding     158      164     ribosome binding site
    signal     1956     1981     transcription terminator
BASE COUNT      644 a    508 c    563 g    608 t
ORIGIN      
        1 ggggttttgt tcatttcaag aaagagcacg aataaacatc gctaatttac acatgattta
       61 ttctctaatt attatggtgc atgccaattg gaaaatgtag tagattagtc atcgtaacct
      121 gctttcatgc caagatatgt aatttttaaa aaagagtaaa ggagaattca gatgaaaaaa
      181 cgtagaagca gtaaagttat tctttcgttg gccatcgttg ttgcattatt ggcagccgtc
      241 gaacctaatg ccgctttagc agcggctcca ccaagtgcca tgcagtccta tgttgaagcg
      301 atgcagcctg gctggaacct tggcaattct ctggatgctg tcggtgcgga tgagacgctg
      361 gcacggggca atccgcggat cacgaaagag ctcattcaga acatcgctgc gcaaggctat
      421 aagagcatac ggattcctgt tacctgggat tcccatatcg gcgcggcccc aaattatcaa
      481 attgaagctg cgtacctcaa tcgagtgcag gaggtcgtac agtgggcttt ggacgcgaac
      541 ctctatgtga tgattaatgt ccatcatgat tcctggctat ggatcagcaa aatggagtcg
      601 cagcacgatc aagtactggc ccgttataat gcgatttgga cgcaaattgc caacaagttc
      661 aagaacagcc cgagcaagct gatgttcgag agcgtgaatg agcctcgctt tacggatggc
      721 ggaactacgg atgaagccaa gcagcaaaaa atgctggacg agctgaacgt atcctttttc
      781 aacatcgtca gaaattccgg cggccagaac gcgactcgcc cgctagttct ttctacgttg
      841 gaggcctctc ccacccaaga gagaatgacg gcgctttata atacgatgac caaactgaac
      901 gacaagaatc tgatcgcaac cgttcatttt tatggattct ggccgtttag cgtaaatatc
      961 gcaggatata cgaaatttga tgcggagacg caaaatgata ttataacgac cttcgataac
     1021 gtgtataaca catttgtagc aaagggaatc ccggtggtag tcggcgaata tggccttctt
     1081 ggattcgata agaataccgg cgtcattgaa cagggtgaga aattgaaatt tttcgagttt
     1141 tttgcccagt atgtgaagca aaaaagcatt tccactatgc tatgggataa cggacagcac
     1201 ttcaaccgca cgagcttcaa gtggtctgac ccggatttat tcaatatgat caaggccagt
     1261 tggaccggac gttcatccac ggcttccagc gacctgatcc atgtcaagca gggcacggcg
     1321 gtaaaagata cttcggttca gctcaatctt aacgggaata cgctaacttc cctttccgta
     1381 aatggaacga cactgaaatc aggcacagat tacactttaa acagcagcag attaactttt
     1441 aaagcgagcc agttgaccaa gctgacctcc ttgggcaaat tgggggtcaa cgcgacgatc
     1501 gtgactaaat tcaatagagg cgccgactgg aagttcaacg tagtcctgta caatacgcct
     1561 aagcttagca gtacgacggg gactacttct tcctttgcga ttccaacggc tttcaacggg
     1621 gatcagcttg ctacgatgga agcggtctat gtaaacggcg gcaatgccgg tccgcataac
     1681 tggacttcct ttaaggaatt cgaaacgacg ttcagccccg cttatagcga ggggaaaatc
     1741 aaactgcagc aggcgttctt taatgaagtg aatgatacca cagtcacgct caagttccaa
     1801 ttctggagcg gggagatcgt caactacacg attaaaaaga gcggttcgac ggtgacgggt
     1861 acggcttcat aagcgagttt ggcaaaaaag gaccgatata ctgcctaatt tggtattgcc
     1921 ttagttgaaa gcaattgctc cgaataaaca gaatgaagcc ccggccagct ggccgggact
     1981 tatgcgttta ggaagtataa acgaatcatc agcaatttat ttagctcgtc tcagttcagc
     2041 aatatcggct tcatgtgaaa cggagcggat gaacaatctt tcgagcaatt tctcatgctc
     2101 ctgctgggtt tggagaacgg tttgctgatt agttttaagt acagatatat cctcacggac
     2161 ttgattgatt catgtggtcc gttagttctt ctacctttgt atttgtggca gcaacgatat
     2221 gaattaattg ttgaatgtgc ccgccatgac tgtttagctg ctcattgtgg ctttgtaact
     2281 gttctcggat ttctttgaat tcttggtcgt gctcattaag ctt
//
LOCUS       HLFRRD1       394 bp ss-rRNA            BCT       28-JUN-1990
DEFINITION  Haloferax 16S ribosomal RNA, partial sequence.
ACCESSION   M33803
KEYWORDS    16S ribosomal RNA.
SEGMENT     1 of 3
SOURCE      Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA.
  ORGANISM  Haloferax sp.
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 394)
  AUTHORS   Holmes,M.L. and Dyall-Smith,M.L.
  TITLE     A plasmid vector with a selectable marker for halophilic
            archaebacteria
  JOURNAL   J. Bacteriol. 172, 756-761 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 394)
  AUTHORS   Holmes,M.L. and Dyall-Smith,M.L.
  TITLE     A plasmid vector with a selectable marker for halophilic
            archaebacteria
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by M.L.Dyall-Smith, 19-JUN-1990.
FEATURES       from  to/span     description
    rRNA    <     1  >   394     16S rRNA
    revision    249      249     c in [1]; g in [2]
BASE COUNT      102 a    101 c    126 g     65 t
ORIGIN      
        1 gaaaagctca gtaacacgtg gccaaactac cctacagaga acgataacct cgggaaactg
       61 aggctaatag ttcatacggg agtcatgctg gaatgccgac tccccgaaac gctcaggcgc
      121 tgtaggatgt ggctgcggcc gattaggtag acggtggggt aacggcccac cgtgccgata
      181 atcggtacgg gttgtgagag caagagcccg gagacggaat ctgagacaag attccgggcc
      241 ctacggggcg cagcaggcgc gaaaccttta cactgcacgc aagtgcgata aggggacccc
      301 aagtgcgagg gcatatagtc ctcgcttttc acgactgtaa ggcggtcgtg gaataagagc
      361 tgggcaagac cggtgccagc cgccgcggta atac
//
LOCUS       HLFRRD2       254 bp ss-rRNA            BCT       28-JUN-1990
DEFINITION  Haloferax 16S ribosomal RNA, partial sequence.
ACCESSION   M33804
KEYWORDS    16S ribosomal RNA.
SEGMENT     2 of 3
SOURCE      Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA.
  ORGANISM  Haloferax sp.
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 254)
  AUTHORS   Holmes,M.L. and Dyall-Smith,M.L.
  TITLE     A plasmid vector with a selectable marker for halophilic
            archaebacteria
  JOURNAL   J. Bacteriol. 172, 756-761 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >   254     16S rRNA
BASE COUNT       67 a     59 c     85 g     43 t
ORIGIN      About 140 bp after segment 1.
        1 tacgtccggg gtaggagtga aatcccgtaa tcctggacgg accaccgatg gcgaaagcac
       61 ctcgagaaga cggatccgac ggtgagggac gaaagctagg gtctcgaacc ggattagata
      121 cccgggtagt cctagctgta aacgatgctc gctaggtgtg acacaggcta cgagcctgtg
      181 ttgtgccgta gggaagccga gaagcgagcc gcctgggaag tacgtccgca aggatgaaac
      241 ttaaaggaat tggc
//
LOCUS       HLFRRD3       539 bp ss-rRNA            BCT       28-JUN-1990
DEFINITION  Haloferax 16S ribosomal RNA, partial sequence.
ACCESSION   M33805
KEYWORDS    16S ribosomal RNA.
SEGMENT     3 of 3
SOURCE      Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA.
  ORGANISM  Haloferax sp.
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Holmes,M.L. and Dyall-Smith,M.L.
  TITLE     A plasmid vector with a selectable marker for halophilic
            archaebacteria
  JOURNAL   J. Bacteriol. 172, 756-761 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >   539     16S rRNA
BASE COUNT      132 a    140 c    159 g    108 t
ORIGIN      About 45 bp after segment 2.
        1 ctcaacgccg gacatctcac cagctccgac tacagtgatg acgatcaggt tgatgacctt
       61 atcacgacgc tgtagagagg aggtgcatgg ccgccgtcag ctcgtaccgt gaggcgtcct
      121 gttaagtcag gcaacgagcg agacccgcac ttctaattgc cagcagcagt ttcgactggc
      181 tgggtacatt agaaggactg ccgctgctaa agcggaggaa ggaacgggca acggtaggtc
      241 agtatgcccc gaatgagctg ggctacacgc gggctacaat ggtcgagaca atgggttgct
      301 atctcgaaag agaacgctaa tctcctaaac tcgatcgtag ttcggattga gggctgaaac
      361 tcgccctcat gaagctggat tcggtagtaa tcgcatttca atagagtgcg gtgaatacgt
      421 ccctgctcct tgcacacacc gcccgtcaaa gcacccgagt gaggtccgga tgaggccacc
      481 acacggtggt cgaatctggc ttcgcaaggg ggcttaagtc gtaacaaggt agccgtagg
//
LOCUS       ZMOADHA      1365 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  Z.mobilis alcohol dehydrogenase I (adhA) gene, complete cds.
ACCESSION   M32100
KEYWORDS    alcohol dehydrogenase I.
SOURCE      Z.mobilis (strain CP4) DNA.
  ORGANISM  Zymomonas mobilis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods.
REFERENCE   1  (bases 1 to 1365)
  AUTHORS   Keshav,K.F., Yomano,L.P., An,H. and Ingram,L.O.
  TITLE     Cloning of the Zymomonas mobilis structural gene encoding alcohol
            dehydrogenase I (adhA): Sequence comparison and expression in
            Escherichia coli
  JOURNAL   J. Bacteriol. 172, 2491-2497 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            K. Keshav, 14-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        218     1231     alcohol dehydrogenase I (adhA) (EC 1.1.1.1)
    mRNA        108  >  1365     adhA mRNA (alt.)
    mRNA        137  >  1365     adhA mRNA (alt.)
    site       1261     1289     adhA transcription termination signal
BASE COUNT      370 a    288 c    340 g    367 t
ORIGIN      
        1 tttttgggcg tatttttatc aagaaaattt aaataatcac atttttatta ttttagattt
       61 aagtattgat acaagtgata tctataaatg tttttataac tttctggatc gtaatcggct
      121 ggcaatcgtt ttccctatat tcgcaagatg tatgtcagcc gcagattttt cgactgacct
      181 ctatctctcc gagatatatc aacaaaaggt agtcaccatg aaagcagccg tcataactaa
      241 agatcatacg atcgaagtga aagacaccaa attacgccct ctgaaatacg gggaagcgct
      301 tttggaaatg gaatattgcg gggtatgtca taccgatctc catgtgaaaa acggggattt
      361 cggcgatgaa accggcagaa ttaccggcca tgaaggtatc ggtatcgtca agcaggtcgg
      421 ggaaggggtt acttctctga aagccggtga ccgcgccagt gttgcatggt tcttcaaagg
      481 ctgcggccat tgcgaatatt gtgtcagcgg gaatgaaacg ctttgccgca acgttgaaaa
      541 tgccggttat acggttgacg gcgctatggc agaagaatgc atcgtcgttg ccgattactc
      601 ggtcaaggtg ccagatggtc ttgatcctgc ggttgccagc agcatcactt gcgcgggtgt
      661 aaccacctat aaagcagtca aagtttctca gatacagccg ggacaatggc tggccatcta
      721 tggcttgggc ggtttaggca atctagccct tcaatatgcc aagaatgttt tcaacgccaa
      781 agtgatcgcg atcgatgtca atgatgaaca gctcgctttt gccaaagagt tgggcgcaga
      841 tatggtcatc aatccgaaaa acgaagatgc tgccaaaatc attcaggaaa aagtcggcgg
      901 cgcacatgcg acggtggtga cggctgttgc caaatccgcc tttaactcgg ctgttgaggc
      961 tatccgcgcg ggtggccgtg ttgtcgccgt tggtctgcct cctgaaaaaa tggatttgag
     1021 cattcctcgt ttggtgcttg acggtatcga agtcctaggt tccttggtcg gaacgcggga
     1081 agatttgaaa gaggccttcc agtttgcagc cgaaggtaag gtcaagccga aagttaccaa
     1141 gcgtaaagtc gaagaaatca accaaatctt tgacgaaatg gaacatggta aattcacagg
     1201 ccgtatggtt gttgatttta cccatcacta ggttttcgtg aaggcggaag cataaacgga
     1261 aaaagccttt ctcttaccag aaaggctttt tctttgtcgt ctgataaaaa ttttcataca
     1321 gaattttaac acggcactcg ttgctataag ccgctatcca agctt
//
LOCUS       MUSIGKABG     349 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V32 region, 5' end.
ACCESSION   M33992
KEYWORDS    immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain BXSB) adult spleen, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   D'Hoostelaere,L.A. and Klinman,D.
  TITLE     Characterization of new mouse Vk groups
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.D'Hoostelaere, 04-MAY-1990.
            
            The frame shift between 178 and 184 could occur anywhere within
            those boundaries.
            
               Author address: L.A.D'Hoostelaere
               Grenzacherstrasse 487
               Postfach 4005
               Basel, Switzerland
               E-mail: IN%BASEL@UTOROCI.bitnet
FEATURES       from  to/span     description
    pept          1      178     Ig kappa-chain V32 region precursor
                180  >   349     Ig kappa-chain V32 region precursor
    sigp          1       60     Ig kappa-chain V-region signal peptide
    matp         61  >   349     Ig kappa-chain V-region
    site        178      184     frame shift
BASE COUNT       99 a     84 c     76 g     90 t
ORIGIN      
        1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt
       61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc
      121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa
      181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc
      241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca
      301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg
//
LOCUS       MUSIGKABH     349 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V33 region, 5' end.
ACCESSION   M33993
KEYWORDS    immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain BXSB) adult spleen, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   D'Hoostelaere,L.A. and Klinman,D.
  TITLE     Characterization of new mouse Vk groups
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.D'Hoostelaere 04-MAY-1990.
            
            The frame shift between 178 and 184 could occur anywhere within
            those boundaries.
            
               Author address: L.A.D'Hoostelaere
               Grenzacherstrasse 487
               Postfach 4005
               Basel, Switzerland
               E-mail: IN%BASEL@UTOROCI.bitnet
FEATURES       from  to/span     description
    pept          1      178     Ig kappa-chain V33-region precursor
                180  >   348     Ig kappa-chain V33-region precursor
    sigp          1       60     Ig kappa-chain V-region signal peptide
    matp         61  >   348     Ig kappa-chain V-region
BASE COUNT       99 a     84 c     76 g     90 t
ORIGIN      
        1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt
       61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc
      121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa
      181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc
      241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca
      301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg
//
LOCUS       HUMCAIX      2785 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human carbonic anhydrase I (CAI) mRNA, complete cds.
ACCESSION   M33987
KEYWORDS    carbonic anhydrase I.
SOURCE      Human EBV transformed SH B cell line DNA, and cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2785)
  AUTHORS   Lowe,N., Brady,H.J.M., Barlow,J.H., Sowden,J.C., Edwards,M. and
            Butterworth,P.H.W.
  TITLE     Structure and methylation patterns of the gene encoding human
            carbonic anhydrase I
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Since no intron sequences were provided this entry is treated as if
            originating from an mRNA.
            
            Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Lowe, 09-MAY-1990.
            
               Author address: N.Lowe
               Department of Biochemistry
               University College London
               Gower Street, London
               WC1E 6BT, U.K.
               E-mail:UCBCMAR%EUCLID.UCL.AC.UK@CUNYVM.CUNY.EDU
FEATURES       from  to/span     description
    pept       1048     1833     carbonic anhydrase I (EC 4.2.1.1)
                                 /hgml_locus_uid="LS0141U" /nomgen="CA1"
                                 /map="8q13-q22"
    mRNA        902     1939     carbonic anhydrase I mRNA (alt.)
    mRNA        902     2165     carbonic anhydrase I mRNA (alt.)
    signal      874      879     TATA box
BASE COUNT      840 a    603 c    492 g    850 t
ORIGIN      
        1 ctttagccca acagtcaaaa ataattgatg ctaccctaca aatgtccaaa actctagtat
       61 atcatatttc taagttacag caaatattag tcctgctaaa ccagggagct ttggcaaaaa
      121 tgttttttga cagtaaattt gtccttgatt atatattaac tagtcaaaga ggtgtttgta
      181 acattattag agcttcttgt tgtaggtggg ttaacaccac caatcaagag gtcattctaa
      241 cagaaagcct ggatcagaaa accatcaccc taaaaaaaca tgccttacat atttaacaca
      301 ctctgaaatc cagtcaaaat atgactaaag gcccttgcca tgactgatgt attctcctgg
      361 ccaacgccaa acaaatggga gcctggttac gagtcagcct tcagggactt gtcacatttc
      421 tacttggttt cttccttgtt attgtcataa taaaatgttt tctatgctgt ttagtgcaac
      481 ttaggcccta ttctgtagaa gtctcctcta ctattcaggc cactcaaaca ccccaaataa
      541 ttgagttcaa aatcgacatc aagatataaa ggaatcagtg actaaatata tttcatatat
      601 ggtattttta ttgattattg tgctgtcttg acctagtatg gaggccttgg ctagaggctg
      661 gtcagtttcc tctcttgagc agctgattaa atccacaccc caaccacttc ccttatcagg
      721 ttctcacact ctggggccac tatgtaccca ctctaatcac cacagggcca gacatcagac
      781 aattaaggac agcgcccatg ccccaaagcc cgccaaaatt atgcaaatta ttcaaaatta
      841 ttcaacctag ctaaccccac cctttttgct gtacataagc tgcccattcc ccctccagcc
      901 tgtggtaccc agtcctcagg tgcaaccccc tgcgtggtcc tctgtggcag ccttctctca
      961 ttcagagctg ttttccacag aggtagtgaa aagaactgga ttttcaagtt cactttgcaa
     1021 gagaaaaaga aaactcagta gaagataatg gcaagtccag actggggata tgatgacaaa
     1081 aatggtcctg aacaatggag caagctgtat cccattgcca atggaaataa ccaatcccct
     1141 gttgatatta aaaccagtga aaccaaacat gacacctctc tgaaacctat tagtgtctcc
     1201 tacaacccag ccacagccaa agaaattatc aatgtggggc attctttcca tgtaaatttt
     1261 gaggacaacg ataaccgatc agtgctgaaa ggtggtcctt tctctgacag ctacaggctc
     1321 tttcagtttc attttcactg gggcagtaca aatgagcatg gttcagaaca tacagtggat
     1381 ggagtcaaat attctgccga gcttcacgta gctcactgga attctgcaaa gtactccagc
     1441 cttgctgaag ctgcctcaaa ggctgatggt ttggcagtta ttggtgtttt gatgaaggtt
     1501 ggtgaggcca acccaaagct gcagaaagta cttgatgccc tccaagcaat taaaaccaag
     1561 ggcaaacgag ccccattcac aaattttgac ccctctactc tccttccttc atccctggat
     1621 ttctggacct accctggctc tctgactcat cctcctcttt atgagagtgt aacttggatc
     1681 atctgtaagg agagcatcag tgtcagctca gagcagctgg cacaattccg cagccttcta
     1741 tcaaatgttg aaggtgataa cgctgtcccc atgcagcaca acaaccgccc aacccaacct
     1801 ctgaagggca gaacagtgag agcttcattt tgatgattct gagaagaaac ttgtccttcc
     1861 tcaagaacac agccctgctt ctgacataat ccagttaaaa taataatttt taagaaataa
     1921 atttatttca atattagcaa gacagcatgc cttcaaatca atctgtaaaa ctaagaaact
     1981 taaattttag ttcttactgc ttaattcaaa taataattag taagctagca aatagtaatc
     2041 tgtaagcata agcttatctt aaattcaagt ttagtttgag gaattcttta aaattacaac
     2101 taagtgattt gtatgtctat ttttttcagt ttatttgaac caataaaata attttatctc
     2161 tttctttctg ttgtgcattc agtttctaaa accattaagt ttctactcca tttacattca
     2221 aaaatcttaa atactttact tgcaagagta ttttgcttca aatacaacaa cctaagagca
     2281 gctggagatg aaatattggg aaattcattt gcttactcct gaagacaaaa atatagctga
     2341 gatgaccact ggatttaata tcgttatgct ggcccaacat tgctaccatt tgtgttgtct
     2401 gtgatcaaaa tgattatctt ttatatagga agatgacgct tctggatatt gctttcactt
     2461 cttctcccca cgttagcaag gacaatgctt ctctgccatt attacaacta gttagtttgc
     2521 atggagaatc tttactttaa aattggaaga aaagtcacaa gtgaatggtt tataaaaatg
     2581 ctaaagaagt cattcttgct tagaatcata tagaaacatc atgcaatctt ttagtcagat
     2641 gtgcgcttca ccttatgcta tttttatctt taattgacac acaataattg tacatgttta
     2701 tggagtatag tgtggtgttt tctgtttgtt tgtttgtttt ttgagacaag gtctcactct
     2761 gccagtcagg gtggagtgcg atggt
//
LOCUS       HUMPDHBZ     1408 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human pyruvate dehydrogenase E1-beta subunit mRNA, 3' end.
ACCESSION   M34056 J05549
KEYWORDS    pyruvate dehydrogenase E1-beta subunit.
SOURCE      Human liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1408)
  AUTHORS   Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J.
  TITLE     Characterization of two cDNA clones for pyruvate dehydrogenase
            E1~subunit and its differential regulation in cultured fibroblasts
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.J.Song, 07-MAY-1990.
            
               Author address: B.J.Song
               Laboratory of Metabolism and Molecular Biology
               National Institute on Alcohol Abuse and Alcoholism
               12501 Washington Ave.
               Rockville, MD 20852
               E-mail: landsman@thebard.nlm.nih.gov
FEATURES       from  to/span     description
    pept    <     1     1008     pyruvate dehydrogenase E1-beta subunit
                                 precursor /hgml_locus_uid="LU0223C"
                                 /nomgen="PDHB" /map="unassigned" (AA at 1)
    sigp          1       18     pyruvate dehydrogenase E1-beta subunit signal
                                 peptide
    matp         19     1005     pyruvate dehydrogenase E1-beta subunit
    signal     1397     1402     poly-A signal
BASE COUNT      404 a    268 c    339 g    397 t
ORIGIN      Chromosome 3p13-3q23.
        1 tggaccgcgc cggctgccgt gcaggtgaca gttcgtgatg ctataaatca gggtatggat
       61 gaggagctgg aaagagatga gaaggtattt ctgcttggag aagaagttgc ccagtatgat
      121 ggggcataca aggttagtcg agggctgtgg aagaaatatg gagacaagag gattattgac
      181 actcccatat cagagatggg ctttgctgga attgctgtag gtgcagctat ggctgggttg
      241 cggcccattt gtgaatttat gaccttcaat ttctccatgc aagccattga ccaggttata
      301 aactcagctg ccaagaccta ctacatgtct ggtggccttc agcctgtgcc tatagtcttc
      361 aggggaccca atggtgcctc agcaggtgta gctgcccagc actcacagtg ctttgctgcc
      421 tggtatgggc actgcccagg cttaaaggtg gtcagtccct ggaattcaga ggatgctaaa
      481 ggacttatta aatcagccat tcgggataac aatccagtgg tggtgctaga gaatgaattg
      541 atgtatgggg ttccttttga atttcctccg gaagctcagt caaaagattt tctgattcct
      601 attggaaaag ccaaaataga aaggcaagga acacatataa ctgtggtttc ccattcaaga
      661 cctgtgggcc actgcttaga agctgcagca gtgctatcta aagaaggagt tgaatgtgag
      721 gtgataaata tgcgtaccat tagaccaatg gacatggaaa ccatagaagc cagtgtcatg
      781 aagacaaatc atcttgtaac tgtggaagga ggctggccac agtttggagt aggagctgaa
      841 atctgtgcca ggatcatgga aggtcctgcg ttcaatttcc tggatgctcc tgctgttcgt
      901 gtcactggtg ctgatgtccc tatgccttat gcaaagattc tagaggacaa ctctatacct
      961 caggtcaaag acatcatatt tgcaataaag aaaacattaa atatttagtt tggacttgaa
     1021 tatcaagtcg ttgaaattta tttgaaatac ttgctggcac tgcacctgga tttgtactgc
     1081 aagacctgac tattcataaa ggaaaacgat ttctaaagca acagcaggta tttttgtaca
     1141 gggaagttta aatgtgtttg tgtatggaaa actctccact ctcctcccct agatgccatg
     1201 cttccttttg tctgttacgg ttgccatgtt ctttgaataa caaattatat cacattttat
     1261 cctctctcac cacaaggaca aagtatggat gtggcagagt cctgatgaaa gatgtatcca
     1321 aacaagataa cttatatgta taaaattaaa gcatataata cacatttact gttagtttgt
     1381 tttgataagg aataaaggaa tttctaac
//
LOCUS       HUMPYRUV     1095 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human pyruvate dehydrogenase E1-beta subunit mRNA, complete cds.
ACCESSION   M34055 J05549
KEYWORDS    pyruvate dehydrogenase E1-beta subunit.
SOURCE      Human liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1095)
  AUTHORS   Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J.
  TITLE     Characterization of two cDNA clones for pyruvate dehydrogenase
            E1~subunit and its differential regulation in cultured fibroblasts
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.J.Song, 07-MAY-1990.
            
               Author address: B.J.Song
               Laboratory of Metabolism and Molecular Biology
               National Institute on Alcohol Abuse and Alcoholism
               12501 Washington Ave.
               Rockville, MD 20852
               E-mail: landsman@thebard.nlm.nih.gov
FEATURES       from  to/span     description
    pept          6     1085     pyruvate dehydrogenase E1-beta subunit
                                 precursor /hgml_locus_uid="LU0223C"
                                 /nomgen="PDHB" /map="unassigned"
    sigp          6       95     pyruvate dehydrogenase E1-beta subunit signal
                                 peptide
    matp         96     1082     pyruvate dehydrogenase E1-beta subunit
    mRNA    <     1     1095     pyruvate dehydrogenase E1-beta subunit mRNA
    signal     1061     1066     poly-A signal
BASE COUNT      288 a    221 c    302 g    284 t
ORIGIN      Chromosome 3p13-3q23.
        1 ccaagatggc ggcggtgtct ggcttggtgc ggagacccct tcgggaggtc tccgggctgc
       61 tgaagaggcg ctttcactgg accgcgccgg ctgccgtgca ggtgacagtt cgtgatgcta
      121 taaatcaggg tatggatgag gagctggaaa gagatgagaa ggtatttctg cttggagaag
      181 aagttgccca gtatgatggg gcatacaagg ttagtcgagg gctgtggaag aaatatggag
      241 acaagaggat tattgacact cccatatcag agatgggctt tgctggaatt gctgtaggtg
      301 cagctatggc tgggttgcgg cccatttgtg aatttatgac cttcaatttc tccatgcaag
      361 ccattgacca ggttataaac tcagctgcca agacctacta catgtctggt ggccttcagc
      421 ctgtgcctat agtcttcagg ggacccaatg gtgcctcagc aggtgtagct gcccagcact
      481 cacagtgctt tgctgcctgg tatgggcact gcccaggctt aaaggtggtc agtccctgga
      541 attcagagga tgctaaagga cttattaaat cagccattcg ggataacaat ccagtggtgg
      601 tgctagagaa tgaattgatg tatggggttc cttttgaatt tcctccggaa gctcagtcaa
      661 aagattttct gattcctatt ggaaaagcca aaatagaaag gcaaggaaca catataactg
      721 tggtttccca ttcaagacct gtgggccact gcttagaagc tgcagcagtg ctatctaaag
      781 aaggagttga atgtgaggtg ataaatatgc gtaccattag accaatggac atggaaacca
      841 tagaagccag tgtcatgaag acaaatcatc ttgtaactgt ggaaggaggc tggccacagt
      901 ttggagtagg agctgaaatc tgtgccagga tcatggaagg tcctgcgttc aatttcctgg
      961 atgctcctgc tgttcgtgtc actggtgctg atgtccctat gccttatgca aagattctag
     1021 aggacaactc tatacctcag gtcaaagaca tcatatttgc aataaagaaa acattaaata
     1081 tttagtttgg acttg
//
LOCUS       ENTOMPX       664 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  E.cloacae outer membrane protein X (OmpX) gene, complete cds.
ACCESSION   M33878
KEYWORDS    outer membrane protein X.
SOURCE      E.cloacae (strain 2249-1) DNA.
  ORGANISM  Enterobacter cloacae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 664)
  AUTHORS   Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and
            van de Klundert,J.A.M.
  TITLE     Molecular characterization of an Enterobacter cloacae outer
            membrane protein OmpX
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by J.Stoorvogel, 26-APR-1990.
            
                     Author address: J.Stoorvogel
                     Dept of Medical Microbiol, Univ Hospital,
                     P.O.Box 9600, 2300 RC Leiden,
                     THE NETHERLANDS
FEATURES       from  to/span     description
    pept         83      601     outer membrane protein X precursor
    sigp         83      151     outer membrane protein X signal peptide
    matp        152      598     outer membrane protein X
    signal       18       25     -35 region
    signal       43       48     -10 region
    binding      74       77     ribosome binding site
    signal      620      649     transcription termination signal
BASE COUNT      171 a    156 c    162 g    175 t
ORIGIN      
        1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac
       61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt
      121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc
      181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta
      241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg
      301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta
      361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca
      421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta
      481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca
      541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta
      601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc
      661 ctct
//
LOCUS       ACARGA       1212 bp ds-DNA             INV       28-JUN-1990
DEFINITION  A.castellanii 5S RNA.
ACCESSION   M34003
KEYWORDS    5S RNA.
SOURCE      A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1.
  ORGANISM  Acanthamoeba castellanii
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; 
            Acanthamoebidae.
REFERENCE   1  (bases 1 to 1212)
  AUTHORS   Zwick,M.G., Wiggs,M. and Paule,M.R.
  TITLE     Cloning, sequencing and genomic organization of a 5S RNA gene from
            the eukaryotic protist Acanthamoeba castellanii
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.R.Paule, 04-MAY-1990.
            
                  Author address: M.R.Paule
                  Colorado State University, Dept. Biochemistry,
                  Fort Collins, CO 80523
FEATURES       from  to/span     description
    RNA         489      610     5S RNA precursor
    RNA         489      607     5S RNA
BASE COUNT      195 a    374 c    329 g    314 t
ORIGIN      
        1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg
       61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc
      121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta
      181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt
      241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac
      301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg
      361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag
      421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt
      481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt
      541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc
      601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg
      661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa
      721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag
      781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc
      841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact
      901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct
      961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt
     1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg
     1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc
     1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg
     1201 gcaaatgtcg ac
//
LOCUS       HUMCDR34     2412 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human cerebellar-degeneration-related antigen (CDR34) gene,
            complete cds.
ACCESSION   M31423 M34243
KEYWORDS    cerebellar-degeneration-related antigen.
SOURCE      Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2412)
  AUTHORS   Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A.,
            Chaganti,R.S.K., Posner,J.B. and Old,L.J.
  TITLE     Cerebellar degeneration-related antigen: A highly conserved
            neuroectodermal marker mapped to chromosomes X in human and mouse
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            Y.-T.Chen, 17-JAN-1990
FEATURES       from  to/span     description
    pept        503     1174     cerebellar-degeneration-related antigen
                                 (CDR34)
BASE COUNT      743 a    334 c    669 g    666 t
ORIGIN      
        1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct
       61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat
      121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg
      181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga
      241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg
      301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag
      361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg
      421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga
      481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt
      541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt
      601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga
      661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat
      721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt
      781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga
      841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct
      901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt
      961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga
     1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt
     1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat
     1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg
     1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa
     1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac
     1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa
     1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt
     1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact
     1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat
     1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct
     1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga
     1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct
     1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac
     1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg
     1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata
     1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa
     1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc
     2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt
     2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta
     2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga
     2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta
     2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat
     2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt
     2401 gagatgctgc ag
//
LOCUS       RATHGF       2485 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Rat hepatocyte growth factor mRNA, complete cds.
ACCESSION   M32987
KEYWORDS    hepatocyte growth factor.
SOURCE      Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2485)
  AUTHORS   Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M.,
            Shimizu,S. and Nakamura,T.
  TITLE     Deduced primary structure of rat hepatocyte growth factor and
            expression of the mRNA in rat tissues
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.Nakamura, 26-MAR-1990.
FEATURES       from  to/span     description
    pept        143     2329     hepatocyte growth factor
BASE COUNT      750 a    533 c    577 g    625 t
ORIGIN      
        1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc
       61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga
      121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct
      181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca
      241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa
      301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa
      361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc
      421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt
      481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa
      541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg
      601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct
      661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag
      721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat
      781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg
      841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga
      901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta
      961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc
     1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg
     1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca
     1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa
     1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat
     1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta
     1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg
     1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc
     1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg
     1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga
     1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca
     1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa
     1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc
     1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat
     1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct
     1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct
     1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac
     1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg
     2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt
     2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga
     2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt
     2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc
     2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa
     2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa
     2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca
     2461 ttaatatatg tggcgttttc tgttg
//
LOCUS       DDIDPYK1A    1090 bp ss-mRNA            INV       28-JUN-1990
DEFINITION  D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds.
ACCESSION   M33785
KEYWORDS    protein-tyrosine kinase-1.
SOURCE      D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 1090)
  AUTHORS   Tan,J.L. and Spudich,J.A.
  TITLE     Developmentally regulated protein-tyrosine kinase genes in
            Dictyostelium
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.L.Tan, 20-APR-1990.
            
              Author address: J.L.Tan
              Stanford University School of Medicine
              Dept. of Cell Biology
              Stanford, CA 94303
              (415) 723-7408
            
FEATURES       from  to/span     description
    pept    <     1     1014     protein-tyrosine kinase-1 (DPYK1; AA at 1)
BASE COUNT      371 a    198 c    186 g    335 t
ORIGIN      
        1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca
       61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat
      121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat
      181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca
      241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta
      301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa
      361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa
      421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat
      481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac
      541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct
      601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt
      661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat
      721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat
      781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca
      841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat
      901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta
      961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt
     1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca
     1081 cttcaactcg
//
LOCUS       DDIDPYK2A    1292 bp ss-mRNA            INV       28-JUN-1990
DEFINITION  D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds.
ACCESSION   M33784
KEYWORDS    protein-tyrosine kinase-2.
SOURCE      D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 1292)
  AUTHORS   Tan,J.L. and Spudich,J.A.
  TITLE     Developmentally regulated protein-tyrosine kinase genes in
            Dictyostelium
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.L.Tan, 20-APR-1990.
            
              Author address: J.L.Tan
              Stanford University School of Medicine
              Dept. of Cell Biology
              Stanford, CA 94303
              (415) 723-7408
            
FEATURES       from  to/span     description
    pept    <     1     1234     protein-tyrosine kinase-2 (DPYK2; AA at 2)
BASE COUNT      491 a    207 c    204 g    390 t
ORIGIN      
        1 cgattctaca agtacaacaa actctactaa agatatcaca tttttagttt gtgataatcc
       61 tgattcaact aaagaaaaga gtaacgtttc aaatacttca tcaataattt ccgcttcaaa
      121 tttaaataga catataacac caaattctca tatgagacct agaggtagat caatttctga
      181 atctttaatt atgtcaccaa ttaataaaga atctttaaat gatattcaaa gagcaattga
      241 aagtgaaaaa ataaagaaaa ctaaatttga agaattaaaa tcaatattgg gcgaaagaga
      301 atatataatt gatataaatg atattcaatt tatacaaaaa gttggagaag gtgcattcag
      361 tgaagtttgg gaaggttggt ggaaaggtat tcatgttgcc ataaaaaagt taaagattat
      421 aggagatgaa gaacaattca aagagagatt cattagagag gttcaaaatt tgaaaaaagg
      481 aaatcatcaa aacattgtca tgtttattgg tgcatgttat aaaccagcat gtatcataac
      541 agagtatatg gcaggtggta gtctttacaa tatacttcat aatccaaata gttccactcc
      601 aaaagttaaa tattctttcc cattggtttt gaaaatggca accgacatgg cattgggctt
      661 attacatctt cattccatca ccattgtgca tcgtgattta accagtcaaa acattctatt
      721 ggatgaattg ggtaatataa agatctctga ttttggttta tctgctgaaa agagtagaga
      781 aggttcaatg acaatgacaa atggtggcat ttgcaatcca agatggagac cacccgaatt
      841 gacaaagaat ttaggtcact actcggaaaa ggttgatgtc tattgtttct ctctagtagt
      901 ttgggaaatt ttaactggcg aaattccttt ctctgattta gatggatctc aacgatccgc
      961 tcaagtagct tatgctggtt taagaccacc aataccagag tattgcgatc ctgaattaaa
     1021 attactctta actcaatgtt gggaggctga tccaaatgat agacctccct ttacctatat
     1081 agtaaacaaa ttaaaagaaa tctcttggaa taatccaatt ggtttcgtct ctgatcaatt
     1141 ctatcaatat agcgaacctt caactccaag attagcatta tcaaatcaat cttcaaattc
     1201 aagtagtatt tctttatcac caactaaatt ataaaaaaaa aaaaaaaaaa aaacaaattt
     1261 caaacaccaa acaccaccac tcatcaaaat cg
//
LOCUS       MUSH2A1X      929 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Mouse histone H2A.1 gene, complete cds.
ACCESSION   M33988
KEYWORDS    histone H2A.1.
SOURCE      Mouse (BALB/c) DNA, clone 53.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 929)
  AUTHORS   Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
            Sch|mperli,D.
  TITLE     Structure of a mouse histone gene cluster
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.Sch|mperli, 02-MAY-1990.
            
                      Author address: Dr.Schmperli
                      Zoologisches Institut, Baltzerstrasse 4
                      CH-3012 Bern
                      SWITZERLAND
FEATURES       from  to/span     description
    pept        164      556     histone H2A.1
    mRNA        119      624     histone H2A.1 mRNA
BASE COUNT      238 a    247 c    222 g    222 t
ORIGIN      
        1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac
       61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct
      121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa
      181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt
      241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc
      301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct
      361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct
      421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg
      481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa
      541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca
      601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg
      661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta
      721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga
      781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt
      841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa
      901 gggccattaa atcccacttc catagtttc
//
LOCUS       MUSH2A1XX     693 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Mouse histone H3.2 gene, complete cds.
ACCESSION   M33989
KEYWORDS    histone H3.2.
SOURCE      Mouse (strain BALB/c) DNA, clone 53.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 693)
  AUTHORS   Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
            Sch|mperli,D.
  TITLE     Structure of a mouse histone gene cluster
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.Sch|mperil, 02-MAY-1990.
            
                       Author address: D.Schlmperil
                       Zoologisches Institut, Baltzerstrasse 4
                       CH-3012 Bern
                       SWITZERLAND
FEATURES       from  to/span     description
    pept        151      561     histone H3.2
    mRNA        116      611     histone H3.2 mRNA
BASE COUNT      159 a    206 c    184 g    144 t
ORIGIN      
        1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt
       61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat
      121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag
      181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc
      241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc
      301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc
      361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc
      421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg
      481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc
      541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc
      601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg
      661 tttttcaaag atttatctag ttacaaagta tac
//
LOCUS       NEUPSAS      3681 bp ds-DNA             PLN       28-JUN-1990
DEFINITION  N.crassa arginine-specific carbamoyl phosphate synthetase (ARG-2)
            gene, complete cds.
ACCESSION   J05512
KEYWORDS    carbamoyl phosphate synthetase.
SOURCE      N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and
            cDNA to mRNA, clone pAB1.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 3681)
  AUTHORS   Orbach,M.J., Sachs,M.S. and Yanofsky,C.
  TITLE     The Neurospora crassa arg-2 locus: Structure and expression of the
            gene encoding the small subunit of arginine-specific carbamoyl
            phosphate synthetase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.S.Sachs, 07-MAY-1990.
FEATURES       from  to/span     description
    pept       1451     1630     arginine-specific carbamoyl phosphate
                                 synthetase, exon 1 (ARG-2) (EC 6.4.5.5)
               1687     2868     arginine-specific carbamoyl phosphate
                                 synthetase, exon 2
    pept        870      902     ORF, exon 1
               1270     1282     ORF, exon 2
               1359     1387     ORF, exon 3
    pre-msg     752     3082     ARG-2 mRNA and introns
    pre-msg     809     3082     ARG-2 mRNA and introns (alt.)
    IVS        1631     1686     ARG-2 intron A
    IVS         903     1269     ORF intron A
    IVS        1283     1358     ORF intron B
    signal      161      166     TGACTC box
    signal      568      573     TGACTC box
    signal     1008     1013     TGACTC box
    signal     1307     1312     TGACTC box
BASE COUNT      760 a   1093 c    863 g    965 t
ORIGIN      
        1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg
       61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg
      121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg
      181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga
      241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg
      301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg
      361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg
      421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa
      481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa
      541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg
      601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa
      661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag
      721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca
      781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag
      841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc
      901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc
      961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc
     1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat
     1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc
     1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct
     1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg
     1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt
     1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa
     1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat
     1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg
     1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag
     1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt
     1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc
     1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt
     1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac
     1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat
     1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta
     1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg
     1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc
     2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac
     2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc
     2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac
     2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc
     2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc
     2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg
     2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct
     2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt
     2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc
     2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag
     2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt
     2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag
     2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc
     2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg
     2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca
     2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg
     3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag
     3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc
     3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg
     3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca
     3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag
     3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa
     3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg
     3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg
     3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc
     3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc
     3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg
     3661 atgctacccc ctgtcctgca g
//
LOCUS       MUSIMPD      1707 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Mouse IMP dehydrogenase mRNA, complete cds.
ACCESSION   M33934
KEYWORDS    IMP dehydrogenase.
SOURCE      Mouse, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1707)
  AUTHORS   Tiedemann,A.A. and Smith,J.M.
  TITLE     Isolation and DNA sequence of a cDNA encoding mouse IMP
            dehydrogenase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by J.M.Smith 27-APR-1990.
            
               Author address:
               Dr. J.M. Smith
               Seattle Biomedical Research Institute
               4 Nickerson St.
               Seattle, WA 98109
FEATURES       from  to/span     description
    pept         67     1611     IMP dehydrogenase (EC 1.2.1.14)
BASE COUNT      457 a    408 c    451 g    391 t
ORIGIN      
        1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc
       61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc
      121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt
      181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag
      241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg
      301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa
      361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc
      421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc
      481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc
      541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc
      601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat
      661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg
      721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa
      781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat
      841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga
      901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag
      961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta
     1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc
     1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt
     1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct
     1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc
     1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat
     1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa
     1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct
     1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa
     1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag
     1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc
     1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa
     1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa
//
LOCUS       HUMOSTRO     1425 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human osteopontin mRNA, complete cds.
ACCESSION   J04765
KEYWORDS    osteopontin; phosphoprotein.
SOURCE      Human adult osteoblast, cDNA to mRNA, clone HOP-10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1425)
  AUTHORS   Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D.
  TITLE     cDNA cloning, chromosomal location and RFLP analysis of human
            osteopontin
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.F.Young, 04-MAY-1990.
FEATURES       from  to/span     description
    pept         68     1012     osteopontin precursor
                                 /hgml_locus_uid="LK0223T" /nomgen="OPN"
                                 /map="unassigned"
    sigp         68      115     osteopontin signal peptide
    matp        176     1012     osteopontin
    mRNA    <     1     1425     osteopontin mRNA
BASE COUNT      449 a    293 c    296 g    387 t
ORIGIN      Chromosome 4q13.
        1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact
       61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc
      121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga
      181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca
      241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc
      301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga
      361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca
      421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac
      481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg
      541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga
      601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt
      661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga
      721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca
      781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc
      841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt
      901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct
      961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa
     1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa
     1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg
     1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa
     1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg
     1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact
     1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag
     1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag
//
LOCUS       RATALADPS    1612 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat delta-aminolevulinate dehydratase processed pseudogene,
            complete cds.
ACCESSION   J04764
KEYWORDS    delta-aminolevulinate dehydratase; pseudogene.
SOURCE      Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1612)
  AUTHORS   Bishop,T.R., Frelin,L.P. and Boyer,S.H.
  TITLE     Isolation and characterization of a rat delta-aminolevulinate
            dehydratase processed pseudogene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.R.Bishop, 20-APR-1990.
FEATURES       from  to/span     description
    pept.ps     271     1264     pseudo-delta-aminolevulinate dehydratase
    rpt         101      108     8 bp flanking direct repeat
    rpt        1385     1392     8 bp flanking direct repeat
BASE COUNT      404 a    411 c    418 g    379 t
ORIGIN      
        1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt
       61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt
      121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca
      181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta
      241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc
      301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa
      361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct
      421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga
      481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca
      541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg
      601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat
      661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg
      721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat
      781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac
      841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat
      901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga
      961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca
     1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct
     1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa
     1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc
     1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga
     1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa
     1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga
     1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta
     1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt
     1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg
     1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at
//
LOCUS       MZEZEISP      215 bp ds-DNA             PLN       28-JUN-1990
DEFINITION  Z.mays zein storage protein gene, 5' flank.
ACCESSION   M33830
KEYWORDS    storage protein; zein.
SOURCE      Z.mays endosperm DNA, clone pMS1.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 215)
  AUTHORS   Grasser,K.D., Maier,U.-G., Haass,M.M. and Feix,G.
  TITLE     Maize high mobility group proteins bind to CCAAT and TATA boxes of
            a zein gene promoter
  JOURNAL   J. Biol. Chem. 265, 4185-4188 (1990)
  STANDARD  simple staff_review
BASE COUNT       84 a     42 c     26 g     63 t
ORIGIN      
        1 gtataaatta taacccaaca aagtaatcac tcaatgtcaa aaccaactag ataccatgtc
       61 atctctacct tatcttacta atatcttttt gcaaattcga aattaatctt gcacaagcac
      121 aaggactgag atgtgtataa atatctctta gattagtagc taatatatcg cacatattat
      181 tgagaccaac tagcaacata gaaagcacaa tagtg
//
LOCUS       CHKARCASR    2392 bp ss-mRNA            VRT       28-JUN-1990
DEFINITION  Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5-
            aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR)
            synthetase, complete cds.
ACCESSION   M31764
KEYWORDS    5-aminoimidazole ribonucleotide carboxylase;
            5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase.
SOURCE      Chicken liver, cDNA to mRNA, clone pZD1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 2392)
  AUTHORS   Chen,Z., Dixon,J.E. and Zalkin,H.
  TITLE     Cloning of a chicken liver cDNA encoding 5-aminoimidazole
            ribonucleotide carboxylase and
            5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase
            by functional complementation of Escherichia coli pur mutants
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Zalkin, 01-FEB-1990.
FEATURES       from  to/span     description
    pept         59     1339     5-aminoimidazole ribonucleotide (AIR)
                                 carboxylase-5-
                                 aminoimidazole-4-N-succinocarboxamide
                                 ribonucleotide (SAICAR) synthetase
    matp         59      835     SAICAR synthetase domain
    matp        836     1336     AIR carboxylase domain
    site       2341     2347     poly-A signal
BASE COUNT      696 a    466 c    527 g    702 t      1 others
ORIGIN      
        1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat
       61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga
      121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac
      181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac
      241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa
      301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg
      361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa
      421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg
      481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac
      541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg
      601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac
      661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg
      721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga
      781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct
      841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca
      901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac
      961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg
     1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc
     1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca
     1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc
     1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc
     1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt
     1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag
     1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt
     1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg
     1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta
     1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta
     1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt
     1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat
     1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc
     1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg
     1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt
     1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa
     1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc
     2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct
     2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc
     2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac
     2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa
     2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat
     2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc
//
LOCUS       ECORAFRO     1289 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  E.coli raffinose repressor (rafR) gene, complete cds, and
            alpha-galactosidase (rafA) gene, 5' end.
ACCESSION   M29849
KEYWORDS    alpha-galactosidase; raffinose repressor.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1289)
  AUTHORS   Aslanidis,C. and Schmitt,R.
  TITLE     Regulatory elements of the raffinose operon: Nucleotide sequences
            of operator and repressor genes
  JOURNAL   J. Bacteriol. 172, 2178-2180 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Schmitt, 13-NOV-1989.
FEATURES       from  to/span     description
    pept        166     1176     raffinose repressor
    pept       1270  >  1289     alpha-galactosidase
    binding     156      161     ribosome binding site
    binding    1258     1263     ribosome binding site
    signal      120      126     -35 region
    signal      144      149     -10 region
    signal     1212     1216     -35 region
    signal     1235     1240     -10 region
    site       1196     1213     raf operator
    site       1217     1234     raf operator
BASE COUNT      317 a    283 c    326 g    363 t
ORIGIN      
        1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata
       61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga
      121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg
      181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca
      241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc
      301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat
      361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc
      421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac
      481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat
      541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct
      601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga
      661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc
      721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat
      781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat
      841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat
      901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa
      961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat
     1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg
     1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt
     1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg
     1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg
     1261 agatgtctaa tgatttcaaa gtactgcag
//
LOCUS       HSEDILSTD     351 bp ds-DNA             VRL       28-JUN-1990
DEFINITION  Equine herpesvirus type 1 subfragments from near the left terminus
            of STD DNA.
ACCESSION   M34145
KEYWORDS    defective interfering particle.
SOURCE      Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 351)
  AUTHORS   Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
            O'Callaghan,D.J.
  TITLE     Identification of the site of recombination in the generation of
            the genome of DI particles of Equine herpesvirus type 1
  JOURNAL   Virology 175, 448-455 (1990)
  STANDARD  simple staff_review
BASE COUNT       55 a    117 c    127 g     52 t
ORIGIN      
        1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca
       61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga
      121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt
      181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg
      241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc
      301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g
//
LOCUS       HSEDISA       633 bp ds-DNA             VRL       28-JUN-1990
DEFINITION  Equine herpesvirus type 1 defective interfering particle DNA.
ACCESSION   M34143
KEYWORDS    defective interfering particle.
SOURCE      Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 633)
  AUTHORS   Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
            O'Callaghan,D.J.
  TITLE     Identification of the site of recombination in the generation of
            the genome of DI particles of Equine herpesvirus type 1
  JOURNAL   Virology 175, 448-455 (1990)
  STANDARD  simple staff_entry
BASE COUNT      124 a    184 c    202 g    123 t
ORIGIN      
        1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
       61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
      121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
      181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
      241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact
      301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc
      361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca
      421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt
      481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg
      541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc
      601 acggcggggc gcagaacgcc cctggccccc ggg
//
LOCUS       HSEDISTD      290 bp ds-DNA             VRL       28-JUN-1990
DEFINITION  Equine herpesvirus type 1 subfragments from the inverted repeat
            of STD DNA.
ACCESSION   M34144
KEYWORDS    defective interfering particle.
SOURCE      Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG..
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 290)
  AUTHORS   Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
            O'Callaghan,D.J.
  TITLE     Identification of the site of recombination in the generation of
            the genome of DI particles of Equine herpesvirus type 1
  JOURNAL   Virology 175, 448-455 (1990)
  STANDARD  simple staff_review
BASE COUNT       69 a     72 c     77 g     72 t
ORIGIN      
        1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
       61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
      121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
      181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
      241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc
//
LOCUS       HUMAR        1230 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human amphiregulin (AR) mRNA, complete cds.
ACCESSION   M30704
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SOURCE      Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones
            lambda-AR1 and lambda-AR2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1230)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept        210      968     amphiregulin precursor
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    sigp        210      266     amphiregulin signal peptide
    matp        510      962     amphiregulin (alt.)
    matp        528      962     amphiregulin (alt.)
    mRNA          1     1230     amphiregulin mRNA
BASE COUNT      375 a    273 c    285 g    297 t
ORIGIN      
        1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc
       61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg
      121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga
      181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg
      241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg
      301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg
      361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt
      421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac
      481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc
      541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag
      601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc
      661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca
      721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca
      781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg
      841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat
      901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta
      961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc
     1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat
     1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga
     1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt
     1201 tattttacag ctcattaaac ttttttaacc
//
LOCUS       HUMAR1        968 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human amphiregulin (AR) gene, exon 1.
ACCESSION   M30698
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     1 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 968)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept        859  +   919     amphiregulin precursor, exon 1
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    sigp        859      915     amphiregulin signal peptide
    pre-msg     649  >   968     amphiregulin mRNA and introns
    IVS         920  >   968     amphiregulin intron A
BASE COUNT      206 a    280 c    257 g    225 t
ORIGIN      
        1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt
       61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg
      121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg
      181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag
      241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta
      301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta
      361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac
      421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct
      481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt
      541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc
      601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca
      661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc
      721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca
      781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc
      841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg
      901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca
      961 tggcaggt
//
LOCUS       HUMAR2        348 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human amphiregulin (AR) gene, exon 2.
ACCESSION   M30699
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     2 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    51  +   299     amphiregulin precursor, exon 2
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    matp        290  +   299     amphiregulin
    pre-msg <     1  >   348     amphiregulin mRNA and introns
    IVS     <     1       50     amphiregulin intron A
    IVS         300  >   348     amphiregulin intron B
BASE COUNT       88 a     75 c     86 g     99 t
ORIGIN      
        1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc
       61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca
      121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc
      181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc
      241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg
      301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata
//
LOCUS       HUMAR3        302 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human amphiregulin (AR) gene, exon 3.
ACCESSION   M30700
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     3 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 302)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    52  +   252     amphiregulin precursor, exon 3
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    matp    +    52  +   252     amphiregulin
    matp         60  +   252     amphiregulin (alt.)
    pre-msg <     1  >   302     amphiregulin mRNA and introns
    IVS     <     1       51     amphiregulin intron B
    IVS         253  >   302     amphiregulin intron C
BASE COUNT      125 a     46 c     54 g     77 t
ORIGIN      
        1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg
       61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa
      121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta
      181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag
      241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc
      301 tg
//
LOCUS       HUMAR4        254 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human amphiregulin (AR) gene, exon 4.
ACCESSION   M30701
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     4 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 254)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    51  +   204     amphiregulin precursor, exon 4
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    matp    +    51       91     amphiregulin (both alt.)
    pre-msg <     1  >   254     amphiregulin mRNA and introns
    IVS     <     1       50     amphiregulin intron C
    IVS         205  >   254     amphiregulin intron D
BASE COUNT       81 a     42 c     53 g     78 t
ORIGIN      
        1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca
       61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag
      121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac
      181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat
      241 aatgggaggt taat
//
LOCUS       HUMAR5        217 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human amphiregulin (AR) gene, exon 5.
ACCESSION   M30702
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     5 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 217)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    50      143     amphiregulin precursor, exon 5
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    pre-msg <     1  >   217     amphiregulin mRNA and introns
    IVS     <     1       49     amphiregulin intron D
BASE COUNT       89 a     26 c     41 g     61 t
ORIGIN      
        1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac
       61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg
      121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata
      181 tctttagatc atatcctata attttgaaaa atttaac
//
LOCUS       HUMAR6        354 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human amphiregulin (AR) gene, 3' flanking region.
ACCESSION   M30703
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     6 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pre-msg <     1      294     amphiregulon mRNA and intron
BASE COUNT      104 a     48 c     52 g    150 t
ORIGIN      
        1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat
       61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat
      121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct
      181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta
      241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa
      301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc
//
LOCUS       HUMROSSAA    1890 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds.
ACCESSION   M32294
KEYWORDS    Ro ribonucleoprotein autoantigen.
SOURCE      Human T-cell line Jurkat, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1890)
  AUTHORS   McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J.,
            Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M.,
            Sontheimer,R.D. and Capra,J.D.
  TITLE     Molecular cloning, expression and Chromosome 19 localization of a
            human Ro/SS-A auto antigen
  JOURNAL   J. Clin. Invest. 85, 1379-1391 (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.P.McCauliffe, 23-FEB-1990.
FEATURES       from  to/span     description
    pept         67     1320     Ro ribonucleoprotein autoantigen (Ro/SS-A)
                                 precursor
    sigp         67      117     Ro ribonucleoprotein autoantigen signal
                                 peptide
    matp        118     1317     Ro ribonucleoprotein autoantigen
    site       1881     1886     poly-A signal (put.)
BASE COUNT      464 a    497 c    520 g    409 t
ORIGIN      Chromosome 19.
        1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg
       61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag
      121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc
      181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt
      241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg
      301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa
      361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac
      421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc
      481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac
      541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca
      601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac
      661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac
      721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag
      781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac
      841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg
      901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag
      961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc
     1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac
     1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag
     1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag
     1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag
     1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag
     1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg
     1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt
     1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt
     1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt
     1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc
     1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct
     1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca
     1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact
     1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact
     1861 acaaacaaaa tttctattaa attaaatttt
//
LOCUS       PRD1XII      1701 bp ds-DNA             PHG       28-JUN-1990
DEFINITION  Bacteriophage PRD1 early protein genes XII and XIX, complete cds.
ACCESSION   M30146
KEYWORDS    early protein XII; early protein XIX.
SOURCE      Bacteriophage PRD1 (Viradea strain) DNA.
  ORGANISM  Bacteriophage PRD1
            Viridae; ds-DNA nonenveloped viruses; Tectiviridae.
REFERENCE   1  (bases 1 to 1701)
  AUTHORS   Gerendasy,D. and Ito,J.
  TITLE     Nucleotide sequence and transcription of the right early region of
            bacteriophage PRD1
  JOURNAL   J. Bacteriol. 172, 1889-1898 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Gerendasy, 27-NOV-1989.
FEATURES       from  to/span     description
    pept        242      724     early protein XII
    pept        797      955     early protein XIX (put.)
    mRNA        179      732     early protein XII mRNA
    mRNA        181     1100     early protein XIX mRNA
    binding     230      233     ribosome binding site
    binding     785      788     ribosome binding site
    rpt           1      111     inverted terminal repeat
    site        145      172     right early promoter
BASE COUNT      419 a    447 c    390 g    445 t
ORIGIN      
        1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca
       61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat
      121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca
      181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat
      241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag
      301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg
      361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc
      421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct
      481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc
      541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc
      601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc
      661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc
      721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa
      781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc
      841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat
      901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc
      961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct
     1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa
     1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg
     1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag
     1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac
     1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc
     1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga
     1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg
     1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt
     1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg
     1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc
     1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg
     1681 cgagaaaacc gccttgacaa a
//
LOCUS       PSEALALPQ    3141 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  P.aeruginosa exopolysaccharide alginate regulatory protein (algP
            and algQ) genes, complete cds.
ACCESSION   M32077
KEYWORDS    alginate regulatory protein P; alginate regulatory protein Q.
SOURCE      P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient
            PA02003.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 3141)
  AUTHORS   Konyecsni,W.M. and Deretic,V.
  TITLE     DNA sequence and expression analysis of algP and algQ, components
            of the multigene system transcriptionally regulating mucoidy in
            Pseudomonas aeruginosa: algP contains multiple direct repeats
  JOURNAL   J. Bacteriol. 172, 2511-2520 (1990)
  STANDARD  full staff_review
COMMENT     draft entry and computer-readable sequence [1] kindly submitted by
            V.Deretic, 16-FEB-1990.
FEATURES       from  to/span     description
    pept        781     1263     alginate regulatory protein Q
    pept       2079     3137     alginate regulatory protein P (put.)
    mRNA        661     1310     algQ mRNA
    mRNA       2054     3141     algP mRNA
BASE COUNT      607 a   1026 c   1009 g    499 t
ORIGIN      1 bp upstream of BamHI site; map position 9 min.
        1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc
       61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg
      121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg
      181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct
      241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa
      301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg
      361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc
      421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg
      481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt
      541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat
      601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat
      661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc
      721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac
      781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat
      841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag
      901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta
      961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat
     1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc
     1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc
     1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc
     1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc
     1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg
     1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca
     1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg
     1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga
     1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac
     1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga
     1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc
     1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa
     1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc
     1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc
     1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag
     1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc
     1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt
     2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg
     2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag
     2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac
     2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg
     2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt
     2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg
     2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca
     2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca
     2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg
     2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga
     2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc
     2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa
     2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca
     2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg
     2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg
     2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac
     3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc
     3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc
     3121 cgaccagcgc ctcctaaggc g
//
LOCUS       STYAROF      1235 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate
            7-phosphate synthase (aroF) gene, complete cds.
ACCESSION   M31302
KEYWORDS    3-deoxy-d-arabino-heptulosonate 7-phosphate synthase.
SOURCE      S.typhimurium (strain LT2) DNA, clone pGM59.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1235)
  AUTHORS   Muday,G.K. and Herrmann,K.M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 165 to 1235)
  AUTHORS   Muday,G.K. and Herrmann,K.M.
  TITLE     Regulation of the Salmonella typhimurium aroF gene in Escherichia
            coli
  JOURNAL   J. Bacteriol. 172, 2259-2266 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.M.Herrmann, 13-JAN-1990.
            
              Author address: K.M.Herrmann
                Purdue University
                Biochemistry department
                West Lafayette IN, 47907
                (317) 494-1653
            
FEATURES       from  to/span     description
    pept        165     1235     3-deoxy-d-arabino-heptulosonate 7-phosphate
                                 synthase (E.C. 4.1.2.15)
    site          1       20     operator aroFo2
    site         52       71     operator aroFo1
    site         69      113     promoter
    site         75       94     TyrR box-1
BASE COUNT      309 a    286 c    343 g    297 t
ORIGIN      
        1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt
       61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg
      121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc
      181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg
      241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg
      301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg
      361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg
      421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga
      481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa
      541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt
      601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc
      661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt
      721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc
      781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg
      841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag
      901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag
      961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg
     1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata
     1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg
     1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag
     1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa
//
LOCUS       TRBANTAT    18598 bp ds-DNA             INV       28-JUN-1990
DEFINITION  T.brucei expression site variant specific antigen 4 - 7 genes,
            complete cds.
ACCESSION   M33720
KEYWORDS    expression site variant specific antigen.
SOURCE      T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 18598)
  AUTHORS   Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P.,
            Salmon,D. and Steinert,M.
  TITLE     The genes and transcripts of an antigen gene expression site from
            T.brucei
  JOURNAL   Cell 57, 835-845 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Pays, 20-JAN-1989.
FEATURES       from  to/span     description
    pept       1174     2196     expression site variant specific antigen 7
    pept       3528     4733     expression site variant specific antigen 6
    pept       8519     9772     expression site variant specific antigen 5
    pept      14450    18157     expression site variant specific antigen 4
    mRNA        986     2676     ESAG-7 mRNA
    mRNA       3483     5056     ESAG-6 mRNA
    mRNA       8260    11304     ESAG-5 mRNA
    trns       5305     5808     rime retroposon
BASE COUNT     5298 a   3468 c   4707 g   5125 t
ORIGIN      
        1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta
       61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat
      121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg
      181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg
      241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta
      301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc
      361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat
      421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt
      481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt
      541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt
      601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa
      661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag
      721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg
      781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat
      841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca
      901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa
      961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat
     1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg
     1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga
     1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg
     1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca
     1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta
     1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag
     1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc
     1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa
     1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa
     1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg
     1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga
     1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg
     1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt
     1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa
     1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt
     1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg
     1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt
     2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa
     2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa
     2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact
     2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac
     2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa
     2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt
     2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta
     2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt
     2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg
     2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca
     2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta
     2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact
     2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat
     2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat
     2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg
     2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata
     3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg
     3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga
     3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa
     3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg
     3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt
     3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg
     3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt
     3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc
     3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt
     3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga
     3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa
     3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa
     3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg
     3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc
     3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag
     3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt
     3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg
     4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg
     4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg
     4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg
     4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga
     4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg
     4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg
     4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa
     4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg
     4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag
     4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg
     4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga
     4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg
     4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga
     4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa
     4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc
     4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa
     4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag
     5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga
     5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca
     5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat
     5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct
     5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca
     5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg
     5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg
     5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat
     5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca
     5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg
     5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg
     5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca
     5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac
     5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc
     5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg
     5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata
     6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg
     6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg
     6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc
     6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat
     6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac
     6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa
     6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa
     6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta
     6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag
     6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat
     6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg
     6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca
     6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg
     6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag
     6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca
     6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa
     6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct
     7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa
     7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat
     7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag
     7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat
     7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa
     7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta
     7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta
     7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc
     7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca
     7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga
     7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa
     7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg
     7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga
     7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct
     7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt
     7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact
     7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag
     8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg
     8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga
     8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca
     8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca
     8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg
     8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt
     8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg
     8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat
     8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag
     8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga
     8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt
     8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat
     8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga
     8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga
     8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga
     8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa
     9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata
     9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac
     9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc
     9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg
     9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt
     9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa
     9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct
     9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca
     9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac
     9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa
     9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc
     9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt
     9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc
     9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac
     9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta
     9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt
     9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt
    10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt
    10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg
    10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa
    10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata
    10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac
    10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta
    10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg
    10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa
    10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag
    10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt
    10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg
    10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag
    10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg
    10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc
    10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa
    10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg
    10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca
    11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat
    11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc
    11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca
    11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt
    11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg
    11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat
    11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac
    11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct
    11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag
    11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag
    11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt
    11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat
    11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc
    11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata
    11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga
    11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg
    12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt
    12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg
    12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg
    12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc
    12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa
    12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa
    12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg
    12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact
    12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc
    12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct
    12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa
    12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta
    12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga
    12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc
    12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc
    12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata
    12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg
    13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg
    13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt
    13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct
    13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata
    13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca
    13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc
    13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt
    13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa
    13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc
    13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt
    13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta
    13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca
    13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca
    13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct
    13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg
    13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc
    13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc
    14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc
    14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg
    14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt
    14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc
    14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg
    14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc
    14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct
    14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac
    14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt
    14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt
    14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc
    14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt
    14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa
    14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga
    14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt
    14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga
    15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg
    15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt
    15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga
    15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct
    15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat
    15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt
    15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg
    15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa
    15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac
    15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat
    15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac
    15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga
    15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca
    15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg
    15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg
    15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc
    15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag
    16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga
    16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac
    16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag
    16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct
    16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat
    16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat
    16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat
    16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt
    16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct
    16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga
    16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt
    16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct
    16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat
    16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa
    16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc
    16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat
    16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat
    17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct
    17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac
    17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt
    17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa
    17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc
    17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc
    17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc
    17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat
    17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg
    17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat
    17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc
    17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc
    17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga
    17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc
    17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg
    17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg
    18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg
    18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt
    18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc
    18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct
    18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga
    18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga
    18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct
    18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag
    18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc
    18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt
//
LOCUS       BABUGU2A      680 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Baboon U2 small nuclear RNA gene, complete cds.
ACCESSION   M33777
KEYWORDS    U2 small nuclear RNA.
SOURCE      Baboon (P.hamadryas) DNA, clone lambda-Pu2.
  ORGANISM  Papio hamadryas
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; 
            Cercopithecidae; Cercopithecinae.
REFERENCE   1  (bases 1 to 680)
  AUTHORS   Matera,A.G., Weiner,A.M. and Schmid,C.W.
  TITLE     Structure and evolution of the U2 snRNA multigene family in
            primates: Gene amplification under natural selection
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.G.Matera, 20-APR-1990.
            
               Author address: A.G.Matera
               Dept. of Chemistry
               University of California
               Davis, CA 95616
FEATURES       from  to/span     description
    uRNA        191      378     U2 small nuclear RNA
    signal       99      114     upstream control element I
    signal      136      157     upstream control element II
    signal      379      420     3' processing box
BASE COUNT      148 a    159 c    224 g    149 t
ORIGIN      
        1 ggatccaagc acgggtgtaa gatttccctt ttcaaaggtg gagaataaga aaggagcccg
       61 agtgtgtaac ggcgtccata gtggggtgga cgagacagag gggatggggc aaggagcgag
      121 ggtggggctc tcaccgcgac ttgaatgtgg atgagagtgg gacggtgacg gcgggcgcga
      181 aggcgagcgg atcgcttctc ggccttttgg ctaagatcaa gtgtagtatc tgttcttatc
      241 agtttaatat ctgatacgtc ctctatccga ggacaatata ttaaatggat ttttggagca
      301 gggagatgga ataggagctt gctccgtcca ctccacgcat cgacctggta ttgcagtacc
      361 tccaggaacg gtgcaccccc tccggggata caaccgtttg ctaaaagtag agggaggtga
      421 gagacgacaa caccggcggg gcggctcgca cgccaagcgt ctgtgactac gtcttgccgg
      481 ctcaactgct tcgttggagt accacggggt tcctgagctg cgggcggcag acggtaggct
      541 tcgggcacgc ttccatttac gctgtgactg cagcgctttg ggaaggctac gaccccccgc
      601 agccgaccga ctcctttgga ggtcgttagg atctcagctt ggcagtcgag tggtggtgac
      661 cttttaaagg aatgggatcc
//
LOCUS       BOVLMCB       326 bp ds-DNA             SYN       28-JUN-1990
DEFINITION  Synthetic bovine liver microsomal lipase-solubilized cytochrome b-5
            gene, complete cds.
ACCESSION   J02905
KEYWORDS    cytochrome b-5.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 326)
  AUTHORS   Funk,W.D., Lo,T.P., Mauk,M.R., Brayer,G.D., MacGillivray,R.T.A. and
            Mauk,A.G.
  TITLE     Mutagenic, electrochemical, and crystallographic investigation of
            the cytochrome b-5 oxidation-reduction equilibrium: Involvement of
            asparagine-57, serine-64, and heme propionate-7
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            G.Mauk, 16-MAY-1990.
FEATURES       from  to/span     description
    pept         27      311     microsomal cytochrome b-5
    binding      16       19     ribosome binding site
BASE COUNT      106 a     82 c     77 g     61 t
ORIGIN      
        1 gtacggtacc gtaaaggaga acaacaatga gcaaagctgt caaatactac actctggaac
       61 aaatccaaaa acacaacaac tcgaaatcga cgtggctgat cctcgactac aaagtatacg
      121 acctgactaa attcctggaa gagcacccgg gaggcgaaga agtcctgcgc gaacaggccg
      181 gcggtgacgc gactgaaaac tttgaggatg ttggccatag taccgacgct cgtgaactgt
      241 cgaaaacgtt catcatcggt gagctgcacc cggacgatcg ttctaaaatc accaaaccgt
      301 ccgaatctta atgagaattc gtctga
//
LOCUS       DROEST6P     5424 bp ds-DNA             INV       28-JUN-1990
DEFINITION  D.melanogaster carboxylesterase 6 and P (Est-6 and Est-P) genes,
            complete cds.
ACCESSION   M33780 M33781
KEYWORDS    carboxylesterase.
SOURCE      D.melanogaster (strain Dm145) adult DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 929 to 4027)
  AUTHORS   Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
            Richmond,R.C.
  TITLE     Molecular analysis of duplicated esterase genes in Drosophila
            melanogaster
  JOURNAL   Mol. Biol. Evol. 7, 9-28 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5424)
  AUTHORS   Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
            Richmond,R.C.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.C.Richmond, 19-APR-1990.
FEATURES       from  to/span     description
    pept       1173     2559     carboxylesterase 6 (Est-6) precursor (EC
                                 3.1.1.1), exon 1
               2611     2858     carboxylesterase 6 precursor, exon 2
    sigp       1173     1235     carboxylesterase 6 signal peptide
    matp       1236     2559     carboxylesterase 6
               2611     2855     carboxylesterase 6
    pept       3052     4438     carboxylesterase P (Est-P) precursor (EC
                                 3.1.1.1), exon 1
               4495     4742     carboxylesterase P precursor, exon 2
    sigp       3052     3111     carboxylesterase P signal peptide
    matp       3112     4438     carboxylesterase P
               4495     4739     carboxylesterase P
    pre-msg    1132     2867     Est-6 mRNA and intron (alt.)
    pre-msg    1132     2998     Est-6 mRNA and intron (alt.)
    IVS        2560     2610     Est-6 intron A
    IVS        4439     4494     Est-P intron A
    signal     1104     1110     TATA box
BASE COUNT     1536 a   1117 c   1235 g   1536 t
ORIGIN      Chromosome 3L 69A1-5.
        1 aagcttgcta tatatctatc tgtatatacc tatttgctta gaccgaactt aatgattttc
       61 cctttttcca aaacttacac aaatatatac ctaatacctt acactgaaaa aactgttcaa
      121 ttcactatgt acttattagt agttcttata aaccttgtta ttactaatta cgtacgaaaa
      181 cagaacggga aataaatatc taaaaattta aatagtaaac tgagctgatg tttctacaca
      241 aggatgaaca caagtatcaa agtgctcata ggtacagcat ttctagaaca caattacacc
      301 agtcttttgc tagcaataaa ttttcactca tctcccactt ttctctagaa ctatcctgaa
      361 tttatttatt tgattctatt ttgtcgcaag cacactcaat tcttgatctt tttaataata
      421 atcgtagcgg aaagctgtat ctgtacaggc aagaccaccg cccctcaaaa tgccaacgaa
      481 ctacagtcgg attcaaaagc gtgcacctgc aagtgcaagt tgtatgatat cttgcaccat
      541 tttatttcga tacacgtttt ggtacttcga tttaatatgg ctaattgtta ttcagaaaat
      601 aaacacaaaa acttaggaca catgttcaaa aatagaaata tgtacatatt tgttgcaaac
      661 cttaaatctt attaattatc tttatctata atcaaaaagc ctactcattt gttaaatggt
      721 ttcgttttaa agtgcattat caaattcatc tggataaaca ttcaaacaaa aaattgcatg
      781 catgtaaagt aattaatcaa gttgactcga agcgattgtg caatataagt ctcacctgaa
      841 gtgggattgg caactgcttg ggatgcatgt cgaaaatcta tattaagccc agtcaaatat
      901 tttaagcgta aagtaacaaa cctaatttgt atacggctat cgttttaatt cgcacacgcc
      961 tatcaactgg atgatgttca cactagagtt actcccattt gaaagccggg cattggaaaa
     1021 ctaatctcat gcgtgcccag atctcaattg agactggttg actggatgtt caggtggccg
     1081 ggtggcgata agccgatcga tggaataaaa ggggccgcaa ttgccgcatc tcaagatagt
     1141 tgcggtctga attcgccgga gtgaggagca acatgaacta cgtgggactg ggacttatca
     1201 ttgtgctgag ctgcctttgg ctcggttcga acgcgagtga tacagatgac cctctgttgg
     1261 tgcagctgcc ccagggcaag ctacgtggtc gcgataatgg aagctactac agctacgaat
     1321 cgattcccta cgccgaaccg cccactggcg atctacgatt cgaggctcca gagccgtaca
     1381 aacaaaagtg gtcggatata ttcgatgcca ccaaaacccc ggtggcgtgc ctgcagtggg
     1441 atcagttcac gcctggggcc aacaaattgg taggagagga ggattgccta accgtcagcg
     1501 tctacaagcc gaagaatagc aagaggaata gctttccggt ggtggcccac attcacggag
     1561 gtgcctttat gttcggtgca gcatggcaaa atggacacga gaacgtgatg cgtgagggca
     1621 aattcattct ggtgaagata agctatcgcc tggggccatt gggtttcgtg agcaccggcg
     1681 atagggatct tcccggaaac tatggactga aagatcaacg gctggctctc aaatggatta
     1741 agcagaatat agccagtttt ggtggagaac cgcagaacgt actgttggtt ggtcactccg
     1801 ctggaggagc ttcggtccat ctgcagatgc ttcgtgaaga tttcggccag ctggccaggg
     1861 cggcattctc gtttagtgga aatgctctag atccatgggt tatacagaag ggagcaagag
     1921 gacgagcctt tgaactggga cgcaacgtgg gatgtgaatc ggctgaagac tcgaccagcc
     1981 tgaagaaatg cctaaagtca aagccagcca gtgaattagt caccgccgtc cgtaaattcc
     2041 ttatattttc ctatgtgccc tttgctccat ttagtcctgt attggagcca tcggatgctc
     2101 cagacgccat tatcacccag gatcccaggg atgtcattaa gagcggaaag ttcggacagg
     2161 ttccgtgggc tgtttcctat gtcacagagg atggtggcta caatgccgcc ttgcttttga
     2221 aggaacggaa atctggaata gttatcgatg atctaaacga gcgttggctt gagttggcac
     2281 catatttact attctaccgg gacacgaaga ccaaaaagga tatggacgac tactcgcgga
     2341 aaattaagca ggagtatata ggcaatcaga gatttgacat cgaaagctat tcagaattgc
     2401 agcggctatt cacggatatt ctcttcaaga atagcacgca ggagtcattg gatcttcatc
     2461 gcaaatatgg aaagagtcct gcctacgctt atgtctatga caatccagcc gaaaaaggaa
     2521 tcgcacaggt cctggccaat cgaaccgatt atgattttgg taaggaaatc gtacttttaa
     2581 atggacttag ttaaatcatt acttttatag gaactgtaca cggtgacgac tactttttga
     2641 tattcgaaaa tttcgtacga gatgtggaaa tgcgtccgga tgagcagata atttcgagaa
     2701 attttatcaa tatgctggca gattttgctt cgagtgataa tggctctcta aaatatggtg
     2761 aatgcgattt caaagataat gtaggtagtg agaaattcca attattagct atttatattg
     2821 atggctgcca gaataggcag catgtggaat ttccgtaagt tacatgaata aaatcaaaaa
     2881 tttttcgttc tgtgtaattt ttaattattt aattattctc aactggcttt aaatatcatt
     2941 tgtacaaaac gtgtttgtcg ctttatattt tggtttttgt tgtttcttta taaagaataa
     3001 taaacgtttg tttacccgca gtttgattag tactttgtgg agttccagat catgagtata
     3061 ttcaaacggc tgttgtgcct gactttgctg tggatagcag ctttagaatc tgaagctgat
     3121 cccttgattg ttgagataac aaatggaaaa atccgtggca aagataatgg gttgtactac
     3181 agctacgaat cgattcccta tgccgagcat ccaactggtg ccctccgttt tgaagcacct
     3241 cagccgtata gtcatcattg gactgatgtt ttcaatgcca cgcagtctcc agttgagtgc
     3301 atgcagtgga atcagtttat aaacgaaaac aataagctga tgggtgatga ggattgctta
     3361 acggtaagca tctataagcc aaagaaaccc aatcggagca gctttcctgt cgtagtactc
     3421 ctgcatggag gtgctttcat gttcggtagt ggatccatat atggacacga ctccattatg
     3481 cgtgagggaa ctttgcttgt ggtaaaaata agctatcgtc ttggaccatt gggttttgca
     3541 agtaccggcg atagacactt gccgggaaac tatggtctaa aggatcaacg tctggcccta
     3601 caatggatca agaagaacat tgctcacttt ggtggaatgc cagataatat tgtgctcatt
     3661 ggtcactctg caggcggtgc ttcggctcat ttgcagctgt tgcacgagga tttcaaacat
     3721 ttggccaaag gagcgatttc ggtgagcggc aatgcattgg atccttgggt catacagcag
     3781 ggtggacgac gacgtgcatt tgaactgggt cgtattgtcg gttgtggaca cacaaatgtc
     3841 tccgcagaac tcaaggactg cttgaagtct aagccggcta gcgatatagt ctctgctgtc
     3901 cgaagcttcc ttgtgttttc ctatgtaccc ttcagtgctt ttggacctgt tgtggagccg
     3961 tcagatgcac cagacgcctt tctaaccgag gacccaagag cagtgattaa gagcgggaag
     4021 tttgcccaag tcccttgggc tgtgacgtac accactgagg acgggggata caacgctgct
     4081 cagctgttgg aaagaaacaa attaactggc gagagttgga ttgacctact caatgatcga
     4141 tggtttgatt gggcaccata cttgctcttc tatcgggacg ccaagaaaac catcaaagat
     4201 atggatgatc tttcatttga tctcaggcag cagtatctag cagatcggcg attcagtgtg
     4261 gaaagttatt ggaacgtgca gcgaatgttt actgatgttc ttttcaagaa tagcgtgcca
     4321 agtgcaatag atcttcaccg aaagtatggc aaaagtccgg tttattcttt tgtctacgat
     4381 aatcctaccg attccggagt gggtcaattg ctttccaatc gaacagatgt acattttggt
     4441 gggtacatac ttgcgtttca aatacttaaa gctctagctt tcttattatt aaaggtactg
     4501 tccacggaga tgactttttc ttgattttca atacagctgc ataccgtatc ggcattcgtc
     4561 cggatgaaga agttatttca aaaaagttta taggtatgct ggaggatttc gcactcaacg
     4621 ataagggaac attaacattt ggagaatgta atttccaaaa taatgtgaac agcaaggaat
     4681 atcaagtgct gcgtatttca cgaaacgctt gtaaaaacga ggaatatgct cggtttccct
     4741 aaggaataaa tgcatttaaa aaatgtaaac agtataataa tgtaattaaa gaacaaattt
     4801 attacacctt tatgccgcct caactgtcag aggtttttca ataagttcgt cttcttcttc
     4861 tgggtcgttt tcctcgccct ccacattccc atacatctca atcttgctgt cgaggaagcg
     4921 aaaatccttg aggatatgca acttgttgtg aaaccagcaa tggaaatgaa tttaatggcg
     4981 atgtttgcca aacagcagag tagcaggcag tgtgctccac gggaatattg gcttgcagta
     5041 ctccatttag agtgtaggat tgcacatgat tgtgctcacc accgattaag acacaatcgt
     5101 ccacaaagtc acatagatgc acgcgtccgg ggaagctaaa gacacatgtg ctttccatgg
     5161 aacgcaggtg gaaaattgaa gctttgggtc caccaccgca gagcagccaa tcctcgttga
     5221 cagccactgc tcctatccat ttgccccaat ccggacgtaa cagatttggg ttcttgtaag
     5281 gctccagcat tgaagtgtgc tgctgctgct tggtgctcca cactcgcaca gtgccatcct
     5341 cggcgccaga gaagatctgt ccgttggcat tgcccaccac actgtgcacg tagtctgtgt
     5401 gtccgcggta ctcgcgctga attc
//
LOCUS       ECOLASNII    1530 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION   M34277
KEYWORDS    L-asparaginase II.
SOURCE      E.coli (strain K-12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1530)
  AUTHORS   Jennings,M.P. and Beacham,I.R.
  TITLE     Analysis of the Escherichia coli gene encoding L-asparaginase II,
            ansB, and its regulation by cyclic AMP receptor and FNR proteins
  JOURNAL   J. Bacteriol. 172, 1491-1498 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        385     1431     L-asparaginase II (ansB) precursor
    sigp        385      450     L-asparaginase II signal peptide
    matp        451     1428     L-asparaginase II
    mRNA        323  >  1431     ansB mRNA
BASE COUNT      409 a    381 c    382 g    358 t
ORIGIN      
        1 aaatgggcgc gaaagcggtg ctgaaaagcg gcggtaaccc attacagaat gtgctgggaa
       61 gcctgggaag cctggggggg ctgcaatcct caatccaaac cgagtggaaa aagcaggaaa
      121 aagatttcca gcagtttggc aaagatgttt gtagccgcgt tgtgactctg gaagatagcc
      181 gcaaagccct ggtcgggaat ttaaaataat cctctatttt aagacggcat aatacttttt
      241 tatgccgttt aattcttcgt tttgttacct gcctctaact ttgtagatct ccaaaatata
      301 ttcacgttgt aaattgttta acgtcaaatt tcccatacag agctaaggga taatgcgtag
      361 cgttcacgta actggaggaa tgaaatggag tttttcaaaa agacggcact tgccgcactg
      421 gttatgggtt ttagtggtgc agcattggca ttacccaata tcaccatttt agcaaccggc
      481 gggaccattg ccggtggtgg tgactccgca accaaatcta actacacagt gggtaaagtt
      541 ggcgtagaaa atctggttaa tgcggtgccg caactaaaag acattgcgaa cgttaaaggc
      601 gagcaggtag tgaatatcgg ctcccaggac atgaacgata atgtctggct gacactggcg
      661 aaaaaaatta acaccgactg cgataagacc gacggcttcg tcattaccca cggtaccgac
      721 acgatggaag aaactgctta cttcctcgac ctgacggtga aatgcgacaa accggtggtg
      781 atggtcggcg caatgcgtcc gtccacgtct atgagcgcag acggtccatt caacctgtat
      841 aacgcggtag tgaccgcagc tgataaagcc tccgccaacc gtggcgtgct ggtagtgatg
      901 aatgacaccg tgcttgatgg ccgtgacgtc accaaaacca acaccaccga cgtagcgacc
      961 ttcaagtctg ttaactacgg tcctctgggt tacattcaca acggtaagat tgactaccag
     1021 cgtaccccgg cacgtaagca taccagcgac acgccattcg atgtctctaa gctgaatgaa
     1081 ctgccgaaag tcggcattgt ttataactac gctaacgcat ccgatcttcc ggctaaagca
     1141 ctggtagatg cgggctatga tggcatcgtt agcgctggtg tgggtaacgg caacctgtat
     1201 aaatctgtgt tcgacacgct ggcgaccgcc gcgaaaaccg gtactgcagt cgtgcgttct
     1261 tcccgcgtac cgacgggcgc taccactcag gatgccgaag tggatgatgc gaaatacggc
     1321 ttcgtcgcct ctggcacgct gaacccgcaa aaagcgcgcg ttctgctgca actggctctg
     1381 acgcaaacca aagatccgca gcagatccag cagatcttca atcagtacta atcgcctcgc
     1441 cccggtatcg tgccggggct ttttcacttc agactcacgt ccattgccaa ttttaattac
     1501 cctaatgata atcaccggaa taaattattt
//
LOCUS       HUMPADPRP    2085 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human poly(ADP-ribose) polymerase (PADPRP) gene, exon 1.
ACCESSION   M34337
KEYWORDS    poly(ADP-ribose) polymerase.
SOURCE      Human peripheral blood leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2085)
  AUTHORS   Ogura,T., Nyunoya,H., Takahashi-Masutani,M., Miwa,M.,
            Sugimura,T. and Esumi,H.
  TITLE     Characterization of a putative promoter region of the human poly
            (ADP-ribose) polymerase gene: Structural similarity to that of the
            DNA polymerase beta gene
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 701-710 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1943  /  2062     poly(ADP-ribose) polymerase
                                 /hgml_locus_uid="LZ0154E" /nomgen="PPOL"
                                 /map="1q41-q42"
    pre-msg    1777  >  2085     PPOL mRNA and introns
    IVS        2063  >  2085     PPOL intron A
BASE COUNT      495 a    535 c    550 g    505 t
ORIGIN      
        1 tttagggatg atatagttgt caacccagag atggcatgat catgcctttt gacttggtca
       61 ttctctaagt aaaactttta tttgttccat catattttcc acttattctg tttaccttca
      121 aaatatcttt tttttttttt tttgagacag ggtcacactg tcacccaggc tagagtccag
      181 tggcactatc atggctcacc acagcctcaa ccttcagggc tcaggtgatc ctcccacttc
      241 agcctcccga gtagatggga ctacaggcac ctgccaccac ccccagctaa tttttgtaga
      301 gacaaggttt tgccatgttg tccaggctgg tcttgaactc ctgggctcaa gggatccggc
      361 cacctcagcc tcccaaagtg ctaggattat aggcatgagc cactgtgccc agcctacctt
      421 caacgtatct aactggttac taacttttag gattcggcct atgtctcaca accttcttgc
      481 ttactcaaca tccttgtctc ttaagccact agcttcttct ctatggttaa cactttttat
      541 gagttttatt catctgctta tttttcttat cctctatacc agaattgaat attttcaaat
      601 aaagcacact catgttacaa tctttgaaat ggaaaaaaaa aatgcatagg attagaaaag
      661 aaaccaattt taataaacta tattttgaag tatagttcta tattaaacaa caagatctag
      721 gccaggtgca gtggctcatg cctgtaatcc cagcaatttg ggaagtcgag gtgggaggat
      781 tgcttgaggc caggggttca agaccagcct gggcaacatg gagagattcc ccatctcttt
      841 ctttacacac acacacacac acacacacaa aatatctgat agcaacaggt gcagtcatta
      901 ccacaatttc gagtagtgat gagcttaata atatttcgag ttatcaccaa caactgtaaa
      961 ctaacatgaa aacgtctgtg atgactattg cccacaaagt cacaggtact gctaatactc
     1021 ctggtatttg tagtaaattc ataataaagg aaatgctagg tttcagttgg tattttgtcc
     1081 cgacggtctg tggacggcag gttagaacgc ccgtccaagc caggagggtg gacctagcac
     1141 tgcagggtcc acctcgggcc aatcaactat attcccgagg cgggggcctg cgcttcccgg
     1201 acccagctgc cctcagggga gagaggacac acttaagagt ttggggccgg cgtggtagct
     1261 catgcccctg atcccagcac ttcgggaggc tgaggcgtga agatcacttg tagcaggagt
     1321 ttgagaccag tctagccaac ttggcgagac cctgtcccta aaaaaaattt ttttttaatt
     1381 agccagttgt ggtgagcgcc tgtagtccca gctactcggg aggctgaggt gggaggatcg
     1441 ctgggctcag gagttccaga ctgcagtgag ccatgatggc ggcactgcac tccagcgcgg
     1501 tgagactcag tctcaaaaat aaaaggggga ggggttgggg gtaaaattag ttgtgaaatc
     1561 aagtaagact tcctgggaca gaacaatcaa aggggtggcg ccgggtcctc caaagagcta
     1621 ctagctcagc ccaagccccg cctcggcccc cagggcagcg gccgcagagc tccacccggc
     1681 aggcgcccgg gaaactccgc cccccggccg gcagggcgcg cgcccgccgg ccccgccccg
     1741 tggacgcggg ttccgtggcg ttcccgcggc caggcatcag caatctatca gggaacggcg
     1801 gtggccggtg cggcgtgttc ggtgcgctct ggccgctcag ccgtggcggc tgggtgagcg
     1861 cacgcgaggc ggcgaggcgg caagcgtgtg tttctaggtc gtggcgtcgg gcttccggag
     1921 ctttggcggc agctagggga ggatggcgga gtcttcggat aagctctatc gagtcgagta
     1981 cgccaagagc gggcgcgcct cttgcaagaa atgcagcgag agcatcccca aggactcgct
     2041 ccggatggcc atcatggtgc aggtgcgggc cgctgtgcgg cgggg
//
LOCUS       HUMREVERB1    295 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human Rev-ErbA-alpha mRNA, 5' end.
ACCESSION   M34339
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Human fetal skeletal muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 295)
  AUTHORS   Lazar,M.A., Jones,K.E. and Chin,W.W.
  TITLE     Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
            from the noncoding DNA strand of a thyroid hormone receptor gene
            results in a related protein that does not bind thyroid hormone
  JOURNAL   DNA 9, 77-83 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        204  >   295     Rev-ErbA-alpha protein (put.)
BASE COUNT       58 a    107 c     74 g     56 t
ORIGIN      
        1 ccgttgcctc aacgtccaac ccttcctgca gggctgcagt ccggccaccc caagcccttg
       61 ctgcagggtg cttcggatcc tgatcgtgag tcgcggggtc cactccccgc ccttagccag
      121 tgcccagggg gcaacagcgg cgatcgcaac ctctagtttg agtcaaggtc cagtttgaat
      181 gaccgctctc agctggtgaa gacatgacca ccctggactc caacaacaac acaggtggcg
      241 tcatcaccta cattggctcc agtggctcct ccccaagccg caccagccct gaatc
//
LOCUS       HUMREVERB2    285 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human Rev-ErbA-alpha mRNA, partial cds.
ACCESSION   M34340
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Human fetal skeletal muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 285)
  AUTHORS   Lazar,M.A., Jones,K.E. and Chin,W.W.
  TITLE     Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
            from the noncoding DNA strand of a thyroid hormone receptor gene
            results in a related protein that does not bind thyroid hormone
  JOURNAL   DNA 9, 77-83 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   285     Rev-ErbA-alpha protein (put.) (AA at 1)
BASE COUNT       62 a     79 c     99 g     45 t
ORIGIN      About 1.7 kb after segment 1.
        1 ctcgtaaagg agagagaagt gcagagttcg attctgtaca agggggcagc ggcagaaggc
       61 cggccgggcg ggtcactggg cgtccacccg gaaggacagc agcttctcgg aatgcatgtt
      121 gttcagggtc cgcaggtccg gcagcttgag cagcagcttg gtgaagcggg aagtctccaa
      181 gggccggttc ttcagcacca gagcccgaag agcccgcagc agcgtctcct ggagctgctc
      241 caccgaagcg gaattctcca tgcccgagcg gtctgcagag acaag
//
LOCUS       HUMTFEB      1526 bp ss-mRNA            PRI       28-JUN-1990
DEFINITION  Human TFEB protein mRNA, partial cds.
ACCESSION   M33782
KEYWORDS    .
SOURCE      Human BJAB B cell line, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1526)
  AUTHORS   Carr,C.S. and Sharp,P.A.
  TITLE     A helix-loop-helix protein related to the immunoglobulin E box
            binding proteins
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.S.Carr, 19-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >  1526     TFEB protein (AA at 1)
BASE COUNT      324 a    495 c    493 g    214 t
ORIGIN      
        1 ggtggagtga cagccggagc ccgggcgccg gcgcggggct cggtgacagc ggaggcggcg
       61 gcccgggcgg gacgagggag cggccgcggc gccgacggcg ggcgggaagg gcggacgggg
      121 cgggcctccc tggtggcacg gggccggagc aggccgggag caggtggcgg gcacagcttg
      181 cgggccaggc acccgaactt gcgacaagtt gccggagccg gggcgcgggc ggcggacaga
      241 ttgaccttca gagcgaggga gccagcgccg gcagccacca tggcgtcacg catagggttg
      301 cgcatgcagc tcatgcggga gcaggcgcag caggaggagc agcgggagcg catgcagcaa
      361 caggctgtca tgcattacat gcagcagcag cagcagcagc aacagcagca gctcggaggg
      421 ccgcccaccc cggccatcaa tacccccgtc cacttccagt cgccaccacc tgtgcctggg
      481 gaggtgttga aggtgcagtc ctacctggag aatcccacat cctaccatct gcagcagtcg
      541 cagcatcaga aggtgcggga gtacctgtcc gagacctatg ggaacaagtt tgctgctgcc
      601 cacatcagcc cagcccaggc tctccgaaac ccccaccagc cgcctcccca ggggtgcgag
      661 ctggacacgt gctgtcctcc tccgctggca acagtgctcc caataagccc catggccatg
      721 ctgcacattg gctccaaccc tgagagggag ttggatgatg tcattgacaa cattatgcgt
      781 ctgacgatgt ctcttggcta catcaatcct gaaatgcaga tgcccaacac gctacccctg
      841 tccagcagcc acctgaatgt gtacagcagc gacccccagg tcacagcctc cctggtgggc
      901 gtcaccagca gctcctgccc tgcggacctg acccagaagc gagagctcac agatgctgag
      961 agcagggccc tggccaagga gcggcagaag aaagacaatc acaacttaat tgaaaggaga
     1021 cgaaggttca acatcaatga ccgcatcaag gagttgggaa tgctgatccc caaggccaat
     1081 gacctggacg tgcgctggaa caagggcacc atcctcaagg cctctgtgga ttacatccgg
     1141 aggatgcaga aggacctgca aaagtccagg gagctggaga accactctcg ccgcctggag
     1201 atgaccaaca agcagctctg gctccgtatc caggagctgg agatgcaggc tcgagtgcac
     1261 ggcctcccta ccacctcccc gtccggcatg aacatggctg agctggccca gcaggtggtg
     1321 aagcaggagc tgcctagcga agagggccca ggggaggccc tgatgctggg ggctgaggtc
     1381 cctgaccctg agccactgcc agctctgccc ccgcaagccc cgctgcccct gcccacccag
     1441 ccaccgtccc cattccatca cctggacttc agccacagcc tgagctttgg gggcagggag
     1501 gacgagggtc ccccgggcta ccccga
//
LOCUS       MUSFCERII    2237 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Mouse low affinity IgE receptor (Fc-epsilon-RII) mRNA, complete
            cds.
ACCESSION   M34163
KEYWORDS    IgE receptor.
SOURCE      Mouse (strain Balb/c) splenic B cell, cDNA to mRNA, clone p23.18.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2237)
  AUTHORS   Gollnick,S.O., Trounstine,M.L., Yamashita,L.C., Kehry,M.R. and
            Moore,K.W.
  TITLE     Isolation, characterization, and expression of cDNA clones encoding
            the mouse Fc receptor for IgE (Fc-epsilon-RII)
  JOURNAL   J. Immunol. 144, 1974-1982 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        175     1170     low affinity IgE receptor (Fc-epsilon-RII)
    mRNA    <     1     2237     Fc-epsilon-RII mRNA
BASE COUNT      592 a    622 c    533 g    490 t
ORIGIN      
        1 ggagaagact actgtcttca acacactagc ctgagctacc ttatccaagt gctccacata
       61 ttccagaagg agaaggacag acttcaagtt caaatcactt ccagagctgt gagtgacaag
      121 tgccttggca ggtagtgcac gcctcatcac tgaaaggatc caaacaagac tgccatggaa
      181 gaaaatgaat actcaggata ctgggaacct cctagaaagc gttgctgctg tgcaagacgt
      241 gggacacagc tcatgttggt ggggctgctg agcacagcaa tgtgggctgg cctgctggcc
      301 ctgcttcttc tgtggcactg ggaaacggag aagaatctaa aacagctggg agacactgca
      361 attcagaatg tctctcatgt taccaaggac ttacaaaaat tccagagtaa tcaattggcc
      421 cagaagtccc aggttgttca gatgtcacaa aacttgcaag aactccaagc tgaacagaag
      481 caaatgaaag ctcaggactc tcggctctcc cagaacctga ccggactcca ggaggatcta
      541 aggaacgccc aatcccagaa ctcaaaactc tcccagaacc tgaacagact ccaagacgat
      601 ctagtcaaca tcaaatccct gggcttgaat gagaagcgca cagcctccga ttctctagag
      661 aaactccagg aagaggtggc aaagctgtgg atagagatac tgatttcaaa gggaactgca
      721 tgcaacatat gtcccaagaa ctggctccat ttccaacaga agtgctacta ttttggcaag
      781 ggctccaagc agtggatcca ggccaggttc gcctgcagtg acctgcaagg gcgactagtc
      841 agcatccaca gccaaaagga acaggacttc ctgatgcaac acatcaacaa gaaggattcc
      901 tggattggcc tccaggatct caatatggag ggagagtttg tatggtcgga cgggagccct
      961 gtgggttata gcaactggaa tccaggggag cccaataacg ggggccaggg tgaggactgt
     1021 gtgatgatgc ggggatccgg ccagtggaac gacgccttct gccgcagcta cttggatgca
     1081 tgggtgtgtg agcagctggc aacatgtgag atatctgccc ccttagcctc tgtgactcca
     1141 acaaggccca ccccaaaaag tgaaccctga caaacttctg ctcacactct tctggatttc
     1201 tcctctacct ttatcgtgga aacagctggg ccctgaggat acccctatca gggcccaggg
     1261 ctctctgtga ccgaaggctt tgattatgtt cccacccata ctgaagcagc tggtggatgc
     1321 cagctcctgc cagctaccca gaaaccctct ccagctctcc agctaagctg gccatcccat
     1381 tccatctgcc ttcctcaaac ctgggcccca gccttgctag ctccctgact acgggcatgc
     1441 atgtgggcag ctgagccaac cagggagctg ctgagaacaa agatttcgaa ggcttctttt
     1501 gcagtcccca cctcctatca agttccccac tttctccccc tcggcatcag agaacagggg
     1561 ttccctttcc ccaggatctg ggatgagtcc tcccatcaag tttgcatcag tggtcccagg
     1621 actccgaccc tccttggagg ctaccaggtg tgctcctggt gcgggaggta ttgaaggaac
     1681 tctaaacagc tccagcaagg cgagcctggc tctgtctggt aggcctggcc cttctctccc
     1741 attccttcta ccttactaaa agctgttaga gaacagtcct aaagctagcc cccaaggtct
     1801 attcccttat ttggccactt cctcctcctg aggctgacta caaggtccag ctatccaagt
     1861 actgaagtct aacatcaaaa gccccctttg tctcacctaa gtagcaatgc ccaatcaaaa
     1921 tacaccatca catcatagcc cagtctaaca gaccgccctt tttctcttca taaaattaca
     1981 cctgcaacca ggcgtagtgg tgcaggcctt tagtcccagc acttgggagg cagagacaag
     2041 cgaatttctg agttcgaggc cagcctggtc tacaaagtga gttccaggac agccagggct
     2101 acacagagaa accctgtctc gaagaaagaa aaaaaaaaaa aattacacct gcgaggtcac
     2161 ttgggctgct gtttttctgc ctgagtcaga gggcagccac ttaacttttc ttccctgctt
     2221 aataaaggat ctctgtg
//
LOCUS       RATATM1      1585 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat alpha-tropomyosin (alpha-TM) gene, exon 1b.
ACCESSION   M34137
KEYWORDS    alpha-tropomyosin.
SEGMENT     1 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1585)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
COMMENT     [1] reports isolation of exon 1b which is the first exon coding for
            the alternatively expressed low-M-r alpha-tropomyosins.
FEATURES       from  to/span     description
    pept       1275  /  1406     low-M-r alpha-tropomyosin
BASE COUNT      360 a    408 c    449 g    368 t
ORIGIN      
        1 agaggtgttc cagctttgga agataagtga ggcaggctgt tttcttttct acctttgttt
       61 ccattctcca cctcccacat ttaagcaagt tcaggactga ctctttgaca gaggtgtagg
      121 attaaaaagc tcggagactt gacaggcttt aaaaaaaaca tagcggaaga aggaatgaca
      181 gaggcgggtt cttacagact gtgtccatgt ggctttccag ctggtaacca gaggttctgt
      241 cacaggtgtt tgtggtcttg cagacccggg ttcctcagat tccagatatc aggctgaatt
      301 gcctgaagtt gcacagttcg ttatggggta cagacaaaat cttatggtag cgtttaatta
      361 catagtttga gttcttaaat aagtgaggca tatgtaaatg ttttatatca tctgataact
      421 tcaacagcct tgtgaagtgt cctgtactat tatactcgtt tttaagagag gaaatccctg
      481 agaagtcatt tgtccaaagg cacacagcca gcagagactt aagagccaga ttgaaacctt
      541 aaaaagatgt atttagaaca ggagatttaa ccactacgct ttgccagcta gtagtttaca
      601 cttggtgaga tagtttgggt gttgggattc ttgtagtgag gatcttgtgt tctgtgttct
      661 tcacaaggat ctttaggata ttcctatttc aggctccgag ttccttgttc agactctggg
      721 aaatcgggag ttgccaatgc ctttagtatt cggaatgatg gcgttcttta caactcaaag
      781 agtttaggga acgttcaagg tctaggttcc tacccttact cagcttcagg ggaatctcga
      841 tttcagagcg ctccaaatgg ccagcaggag tcgccatagg ctttagcatc cagttcctct
      901 acttggagtc acccaggctc cgcggaatgg atgcaaacgc cctcctgcgg tagcctcgct
      961 ctaggcaaag cagcccagag ccagccattc ccgcgcgcac gcgcagtggt tccaagggta
     1021 gccagttcct cgggtccacc ggccccgccc gcctgcccgc cgcgcccatc ccctgcgttc
     1081 gcgcacccgc ccgcggctct ctagtgcggc cctggggccg tgacttccgg gctgcttctt
     1141 agccgcaggg ggcgccggcg ttgcaaagcg ggcctgggcg ggacgaaccg gacctgggct
     1201 ccgtaactgc aggagcccag cagtagcagc tgcagagcgc ggaggcagag cacagcctca
     1261 gccgccgcct cgccatggcg ggtagcagct cgctggaggc ggtgcggaga aagatccgga
     1321 gcctgcagga acaggcagat gccgcggagg agcgtgcggg cagcctgcag cgagagctgg
     1381 accaggagcg gaagctgcgg gaaaccgtaa gggatccacc catcacccga ggcccccacg
     1441 ccgcacctgg cgcacctagg ccaactggca cccggctact cagcctgccg gtctatctcc
     1501 tcggtctaaa gtaaaacctc caagggggag caagagtgct tgctgttgag aaggttctgg
     1561 gggcgagcat gaaatgggtc ccggg
//
LOCUS       RATATM2       462 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat alpha-tropomyosin (alpha-TM) gene, exon 9c.
ACCESSION   M34138
KEYWORDS    alpha-tropomyosin.
SEGMENT     2 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 462)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
COMMENT     [1] reports isolation of exon 9c which is the final exon coding for
            the alternatively expressed brain alpha-tropomyosins TMBr-1 and
            TMBr-3.
FEATURES       from  to/span     description
    pept    /   375      448     alpha-tropomyosin
BASE COUNT       97 a    141 c    100 g    124 t
ORIGIN      Unknown number of bp after segment 1.
        1 ctcgcctaga atctctgctg agctcccaca aagcatgagt gtctggggct gccctccagc
       61 ctggccttag tccccaaatc tgttgtccta aactacacat aagtgagtca ctgggtggct
      121 ttaatgaggt gcagaaggag cattggttct gcacctcttc gcctctgctc tacaccaagg
      181 aaagattctg gcccctcctt tattctgcca gacctccatg cgcctcaagg gcacagggca
      241 gagggcctgg ggtcagggta ccttgccgct gtcctttgta ttgtgttctc agtctgttaa
      301 ccgtttctca ctgactcttg ctaacctctt ccttctgcct cttcttttct gctaaccctt
      361 gctgacctgc ccagatcaac tctaccatca actcgagcaa aaccgccgtc taactaatga
      421 actaaagctg gccctgaatg aggattaaaa ccctgggcca ag
//
LOCUS       RATPLPA1     1658 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, exons 1 and 2.
ACCESSION   M34108
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     1 of 5
SOURCE      Rat liver DNA, clone lambda-rPLPg10-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1658)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1477  +  1577     parathyroid hormone-like peptide (PLP)
                                 precursor, exon 2 (first expressed exon)
    sigp       1477  +  1577     parathyroid hormone-like signal peptide
    pre-msg     951  >  1658     PLP mRNA and introns
    IVS        1259     1452     PLP intron A
    IVS        1578  >  1658     PLP intron B
BASE COUNT      325 a    507 c    466 g    360 t
ORIGIN      
        1 gggaaccagt tacacacaca cacacacaca cacacacaca cacacacaca cacacacact
       61 ggtaagaacc agtagcccac aacattctcc ccctctcccc acccacccac ccagcagcgc
      121 tgagaagctg aaacgactgg cataaggtta ctcagtgaat ctatagcagg acatagagta
      181 ggttatcaga ggaagagtgg ctaagttgga ttctgtgtgc tccaggttct ctgagatcac
      241 aaagtgggtc ttcgatcctt gttcacacac cctttctagg atctgctcca gatgtgatgt
      301 ccccagctgt tctcaaacag ttaagaccaa aaatcgcctt tgctctcctg ggtgctaccc
      361 ttagtcccct ccccagggga aggccatgcc aggcaggaag gctggcttgc aacagtgcga
      421 gcttgcaaac gagcttggga ctcttgggat tcgcaggcca cccggagttc ctccttgccg
      481 ccggtgggga ccagggctcc tggacctggg ctcctggact caggcggcag caggctggtt
      541 caggaggagg agagcgtcag atcggcccac gcggcgactg gggtggggct ccgtggctcc
      601 cagctcccct gtgctccccg cccccgccgg cccgctcacc ccagtctgct ccgccggcgc
      661 gcgttcctcg ggcgccacct ctttgcgact cgctcacttc tcagcaggtt ggccacagag
      721 cgtgtgaacc ttccaaggct gggctcacag ctcctctcca acctgcgccg cccagctggg
      781 cgtccgcccc gccgccggaa ccgcctcgcg ccacctggcc ctcgcatcca cgacacgcgc
      841 gcctgcaact tgttcaaggg cgttgtggaa tcaactttcc ggaagcaacc agcccaccgg
      901 aggaggtaga ccgacagcta tgtatatata cgctctctgc cgcaagtggc tttgcaccca
      961 gaggcctggt gcgcaggaca gctgactcct gcgggcacac tcgcgtttga agaggggttc
     1021 gccctgcccc gcgacccaga gcgctgccgc caagactaat tagacattgc tatgggagcc
     1081 acagcaacgc gtcacgcatc cccgacgcct atgtaaaacg gttcggtctt cgctctttca
     1141 gaggaaactc tctgattgct ttttcccttc tcgggtccct tttttttttt tttttttttt
     1201 ttttttttgc ctgtgcggtt tgagagaggc acagttggag gcgctgattc ctacaaaagt
     1261 aagtgctcag gggctcggaa acactttttt gctttctttt tcttgttttc ttttttcctt
     1321 ataacttgtg agatgccctc gacattggag tggcttgttg ggaagccacg gggttccaaa
     1381 agagggagag gggcagtgag aggaagagat agatggagag ccaggactga ccctcgggtt
     1441 ccactcttgc aggtcccaga gccagcgagc ggcacgatgc tgcggaggct ggttcagcag
     1501 tggagcgtcc tggtgttcct gctcagctac tccgtgccct cccgcgggcg ctcggtggag
     1561 gggcttggtc gcaggctgta agtaccggat cttcctcccg ggtgagggat ggaggagacc
     1621 agcaccagag gtgccgaact gaggcgggca atgcggaa
//
LOCUS       RATPLPA2      144 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, intron B
            (partial).
ACCESSION   M34109
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     2 of 5
SOURCE      Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 144)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   144     parathyroid hormone-like peptide intron B
BASE COUNT       42 a     26 c     21 g     55 t
ORIGIN      3 kb after segment 1.
        1 gcagaacact atgttttgac tgaagtccaa gttgcctctg ataaactttt gcccaatttt
       61 tcttttaatt tttttacagt aaccaaagac aaggataaag tataaatcca cttatttttc
      121 catgctggca ctttggtttg taat
//
LOCUS       RATPLPA3      145 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, intron B
            (partial).
ACCESSION   M34110
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     3 of 5
SOURCE      Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 145)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   145     parathyroid hormone-like peptide intron B
BASE COUNT       22 a     34 c     34 g     55 t
ORIGIN      640 bp after segment 2.
        1 agtctccata gggtcaatcc actgtcatgt gcttcttatg ctgtgctgtt tcttgtcccc
       61 ttgctgtgac actcacttct gcctttcttg gacttgagaa cacgttgggg acttttgaga
      121 atcggagtcc tgttttgtgg acttt
//
LOCUS       RATPLPA4      920 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, exon 3.
ACCESSION   M34111
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     4 of 5
SOURCE      Rat liver DNA, clone lambda-rPLPg9-0.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 920)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    94  +   516     parathyroid hormone-like peptide (PLP)
                                 precursor, exon 3
    sigp    +    94      100     parathyroid hormone-like signal peptide
    matp        101  +   516     parathyroid hormone-like peptide
    IVS     <     1       93     PLP intron B
    IVS         517  >   920     PLP intron C
BASE COUNT      198 a    353 c    166 g    203 t
ORIGIN      1.3 kb after segment 3.
        1 tccctactct agaggtcaac tgttcaccca tctcattcct tcaagtccaa cacgggcagg
       61 tcctcagctg acaactgtct tctcccctga cagaaaacgc gcggtgtctg agcaccagct
      121 actgcatgac aagggcaagt ccatccaaga cttgcgccgc cgtttcttcc tccaccatct
      181 gattgcggag atccacacag ctgaaatcag agctacctcg gaggtgtccc ctaactccaa
      241 acctgctccc aacaccaaaa accaccctgt gcggtttggg tcagacgacg agggcagata
      301 cctaactcag gaaaccaaca aggtggagac gtacaaagag cagccactca agacgcccgg
      361 gaagaagaag aaaggcaagc cggggaaacg cagagaacag gagaaaaaaa agcgaaggac
      421 tcggtctgcc tggccaggca caactgggag tggcctgctt gaggaccccc agccccacac
      481 ctccccgacc tcgacctccc tggagcccag ctcaaggtat tgtctgtccg ctgtgctcag
      541 tagggctccc ctccctccac cggcctgggt cccagagatg cagaatcctc tgtatcttga
      601 tctgagacta catcagttag gccacaatct cagtccattc ctggttagcc tgctagacaa
      661 tttaatttcc ctacttccct cccaccagcc caatcaacca tccttcctcc ctccctccct
      721 ccttccctcc ctcactccct ccctccctcc atccctccct ccctaccatc ctccctccct
      781 tccttctccc tttcctttcc atcccctccc ctcttctccc tttcctttcc atcccctccc
      841 ctcttctcct cccctctctc tgattcctgt cttcctcact cctccccaca catagggtct
      901 tcctcaagcc aaggctgact
//
LOCUS       RATPLPA5      855 bp ds-DNA             ROD       28-JUN-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, exon 4.
ACCESSION   M34112
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     5 of 5
SOURCE      Rat liver DNA, clone lambda-rPLPg9-0.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 855)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    21       30     parathyroid hormone-like peptide (PLP)
                                 precursor, exon 4
    matp    +    21       27     parathyroid hormone-like peptide
    pre-msg <     1      449     PLP mRNA and introns
    IVS     <     1       20     PLP intron C
BASE COUNT      280 a    139 c    157 g    279 t
ORIGIN      3.6 kb after segment 4.
        1 ccccttcctt gtcttcacag gacgcattga aactttcacc gaagatcttc caagacacgt
       61 tacaggattc tgtaatagta aacatatgga aagtattaga catatttatt gcctgtacat
      121 actgtaaatg cattgggatc aaactgtctc cccaggaaac tgcacatggg tcatgttacc
      181 ataatttatt ttgtcaactg atgtatttat ttgtaaatgt atcttggtgc tgctgactct
      241 gtttttttgt aacataatgc actttaggta tacatatcaa gtatgtcgat gaatttaaca
      301 cgaaaaagtg tctctatttt gtggttcatt ttgatgagtt ctgaaatata attatctaga
      361 ctgatctccc tctgcgcatg taaaaatggc actattttaa atttgtaaag aatgtctaat
      421 aaaatataat ctaattacac cgtgacttac aatgttggtt ttaattcttt aaggtttcta
      481 atcagagaaa cacagtacaa ttttttttta aattctactt gaataaaatc ctaagcttga
      541 tgtgtttgtg tggggatcta ctccaatgtg tccgcattag cttttagtta aaaaccactt
      601 aaaatgtatt ttgccaatat ttcagctcaa gtcaatagaa tggctgaaca gcctgaactt
      661 caagttgaaa agaaagacaa gaaagggcaa aaaataagca gagaagtcag gagaaagggt
      721 caaagagggt tgtccagagt taaaaccaga gaagtgggtc cagacttgaa gactctgaat
      781 aaattttgcc aatgtaaagt ttagtttgca agggcttgct ctcgattcat aacaataata
      841 aaagatctgc tcggg
//
LOCUS       RATTMBR1A    1050 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Rat brain alpha-tropomyosin (TMBr-1) mRNA, complete cds.
ACCESSION   M34135
KEYWORDS    alpha-tropomyosin.
SOURCE      Rat brain, cDNA to mRNA, clone pOk4.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1050)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         41      886     brain alpha-tropomyosin (TMBr-1)
    mRNA    <     1     1050     TMBr-1
BASE COUNT      336 a    220 c    306 g    188 t
ORIGIN      
        1 ctctggctcc tctgcctaag ggcccctcgc caccgccacc atggacgcca tcaagaagaa
       61 gatgcagatg ctgaagctcg acaaagagaa cgccttggat cgagcagagc aggcggaggc
      121 tgacaagaag gctgcggaag accggagcaa gcagctggaa gatgagctgg tgtcactgca
      181 aaagaaactc aaggccactg aagatgaact ggacaaatac tccgaggctc tcaaagatgc
      241 ccaggagaaa ctggagctgg cggagaaaaa ggccacagat gctgaagctg acgtagcatc
      301 tctgaacaga cgcatccagc tggttgagga ggagttggat cgcgctcagg agcgtctggc
      361 cacagctcta cagaagctgg aggaggctga gaaggctgca gatgagagtg agagaggcat
      421 gaaagtcatt gaaagccgag cccaaaaaga tgaagaaaag atggagattc aggagatcca
      481 gctgaaagag gccaagcaca ttgctgaaga tgctgaccga aagtatgaag aggtggcccg
      541 taagctggtc atcatcgaga gcgatctgga gcgtgcggag gagagggctg agctctcgga
      601 aggcaaatgt gccgagcttg aagaagagtt gaaaacggtg acgaacaact tgaagtcact
      661 ggaggctcag gctgagaagt actctcagaa agaagacaag tatgaagagg agatcaaggt
      721 tctctctgac aagctgaagg aggctgagac ccgggctgag tttgcagaga gatcagtaac
      781 caaattggag aaaagcattg atgacttaga agatcaactc taccatcaac tcgagcaaaa
      841 ccgccgtcta actaatgaac taaagctggc cctgaatgag gattaaaacc ctgggccaag
      901 ttctaggaac ggagtccatg ttcagaaaat ctatgactct cggggttaaa aaatgataac
      961 ttctagagaa atgggaagga aatgcattgg ctggaaggcc agccttgcct atgttttcct
     1021 ctgtatatct taattaaatc ttcaataacc
//
LOCUS       RATTMBR2A    1004 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Rat brain alpha-tropomyosin (TMBr-2) mRNA, complete cds.
ACCESSION   M34134
KEYWORDS    alpha-tropomyosin.
SOURCE      Rat brain, cDNA to mRNA, clone pOk15.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1004)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        137      892     brain alpha-tropomyosin (TMBr-2)
    mRNA    <     1     1004     TMBr-2
BASE COUNT      265 a    242 c    321 g    176 t
ORIGIN      
        1 cttagccgca gggggcgccg gcgttgcaaa gcgggcctgg gcgggacgaa ccggacctgg
       61 ctccgtaact gcaggagccc agcagtagca gctgcagagc gcggaggcag agcacagcct
      121 cagccgccgc ctcgccatgg cgggtagcag ctcgctggag gcggtgcgga gaaagatccg
      181 gagcctgcag gaacaggcag atgccgcgga ggagcgtgcg ggcagcctgc agcgagagct
      241 ggaccaggag cggaagctgc gggaaaccgc tgaagctgac gtagcatctc tgaacagacg
      301 catccagctg gttgaggagg agttggatcg cgctcaggag cgtctggcca cagctctaca
      361 gaagctggag gaggctgaga aggctgcaga tgagagtgag agaggcatga aagtcattga
      421 aagccgagcc caaaaagatg aagaaaagat ggagattcag gagatccagc tgaaagaggc
      481 caagcacatt gctgaagatg ctgaccgaaa gtatgaagag gtggcccgta agctggtcat
      541 catcgagagc gatctggagc gtgcggagga gagggctgag ctctcggaag gcaaatgtgc
      601 cgagcttgaa gaagagttga aaacggtgac gaacaacttg aagtcactgg aggctcaggc
      661 tgagaagtac tctcagaaag aagacaagta tgaagaggag atcaaggttc tctctgacaa
      721 gctgaaggag gctgagaccc gggctgagtt tgcagagaga tcagtaacca aattggagaa
      781 aagcattgat gacttagaag ataagtttct ttgcttctct cctcccaaga ctccttcatc
      841 aagccggatg tcccacctct ctgagctctg catctgtctg ctctccagct gacccaggtt
      901 tctttctagt gcccacccac cctagggcca ggcacagacc gtgctttcta ttgtacagag
      961 gtgatcctcc cagtgtaaaa taaacactgt acgctagtcc tgtt
//
LOCUS       RATTMBR3A    2488 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Rat brain alpha-tropomyosin (TMBr-3) mRNA, 3' end.
ACCESSION   M34136
KEYWORDS    alpha-tropomyosin.
SOURCE      Rat brain, cDNA to mRNA, clone pOk10.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2488)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      604     brain alpha-tropomyosin (TMBr-3)
    mRNA    <     1     2488     TMBr-3
BASE COUNT      713 a    531 c    579 g    665 t
ORIGIN      
        1 tgaagctgac gtagcatctc tgaacagacg catccagctg gttgaggagg agttggatcg
       61 cgctcaggag cgtctggcca cagctctaca gaagctggag gaggctgaga aggctgcaga
      121 tgagagtgag agaggcatga aagtcattga aagccgagcc caaaaagatg aagaaaagat
      181 ggagattcag gagatccagc tgaaagaggc caagcacatt gctgaagatg ctgaccgaaa
      241 gtatgaagag gtggcccgta agctggtcat catcgagagc gatctggagc gtgcggagga
      301 gagggctgag ctctcggaag gcaaatgtgc cgagcttgaa gaagagttga aaacggtgac
      361 gaacaacttg aagtcactgg aggctcaggc tgagaagtac tctcagaaag aagacaagta
      421 tgaagaggag atcaaggttc tctctgacaa gctgaaggag gctgagaccc gggctgagtt
      481 tgcagagaga tcagtaacca aattggagaa aagcattgat gacttagaag atcaactcta
      541 ccatcaactc gagcaaaacc gccgtctaac taatgaacta aagctggccc tgaatgagga
      601 ttaaaaccct gggccaagtt ctaggaacgg agtccatgtt cagaaaatct atgactctcg
      661 gggttaaaaa atgataactt ctagagaaat gggaaggaaa tgcattggct ggaaggccag
      721 ccttgcctat gttttcctct gtatatctta attaaatctt caataaccaa gaagttttta
      781 tgatacatta tgtatatggg aaaatattta taccaatcaa ttcagctctt cctgcatcag
      841 caggccctct ttgaagtgat gataatgagg actcccccac caccaccatc cccccaaaaa
      901 aaggaaaaaa ttttctattt cctctaaaca gctgtgagtc ctgctttaaa tatctcttct
      961 acatttgtat tcctaacatc ctcctaacat gtatagtgtt tgaaatgtga tccaacgtgt
     1021 acctatgctt gggcaaaata gcttttgaaa acaggagctc gtaccacaag tccctggttg
     1081 aaaggtatgc tttgttaagt ttaacggtgt tggaatctga ttatcgtgtc atgctgttaa
     1141 gcgggatact ataaaaagac taagagaatg tccctaatga agtgtgcatg agatgtgttc
     1201 atataaccat ttgcgagccg tagaaataaa tctatttaaa aagtggtcag aaccttcctt
     1261 acgtcatctt cgttttctgt gaggtcgtgt tctgacccac ctacccgatc tttaactgcc
     1321 tgtaactcag aatcctgtgc tggtattaac gtcttcttcg gtctgtgtaa caatgctgta
     1381 cctgtatttg gtttttgacc ctcccttgtt ccccgtagca cgcagtgggt gagccttcac
     1441 caaacacacc ctgcacgtta cgtcagggtg ggttgcaggt gtaggtgcac ccggggcctt
     1501 gtgtagatgt gagtgacttc caaagcatgt gtagtgccac tcatctgtgt gtgactggtg
     1561 tttgcttact aaatgtacaa accactaccg agtgtgaagc tcagccattc atgactcaat
     1621 tttctgatcc catcacagag aaagtggcac atgccaaaga agaaaacctt agtatgcacc
     1681 agatgctgga ccagacttta ctggagctaa acaacatgtg aaaccctcct tagctgcggc
     1741 cacattctct cgtttccccc tgttgtttta ttttgtcttt aaacacatgc ttaccatgaa
     1801 accccttcaa tgcttttttt ttttatatat atatatacct ttaccactgt cactgcaaca
     1861 tctgcagaga gccagctagg gcagggggtg gggaaagaca cacagaaagc aggtgtactt
     1921 gttatttaaa tgtgccattt cccaggttga cattgccaca cttcacagag tttaggcatg
     1981 cagtttgctt agccagtgta ggaagcctca cgaaaacaaa aggattaaca gcctaagtgc
     2041 caatggtggg gtcatcagga agactgatgt tggagacaat caggtgcaga ttggtgctaa
     2101 tttaaacaaa agaacccctg gtgtcttttg ttcagtctaa tacaatttag acttctgtcc
     2161 aacactaatt tatttttgtc ttgagtagca ctgcgagagg atatgggttc catgtgcccg
     2221 tgttcctcac agccaagggt ttgaaagcca cgctggtctt gtaagattcc cattccttcc
     2281 cagctgacat gcttgtggtc ccttccaact cagtggggag agggctcagt gctgttcgcc
     2341 cccccccccc ccaattctcc atgtcagggt aagagatcta ggcactacat atgtaattgg
     2401 taaagaaaag gcattttctc gagagttata actatgtaaa cattgtataa tgatatgaaa
     2461 taaaatgcac attgtaggac attttcta
//
LOCUS       RSPNIFHD     1375 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  R.rubrum dinitrogenase reductase (nifH) gene, complete cds; and
            dinitrogenase alpha subunit (nifD') gene, 5' end.
ACCESSION   M33774
KEYWORDS    dinitrogenase alpha subunit; dinitrogenase reductase.
SOURCE      R.rubrum DNA.
  ORGANISM  Rhodospirillum rubrum
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 1375)
  AUTHORS   Lehman,L.J., Fitzmaurice,W.P. and Roberts,G.P.
  TITLE     The cloning and functional characterization of the nifH gene of
            Rhodospirillum rubrum
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.J.Lehman, 18-APR-1990.
FEATURES       from  to/span     description
    pept        367     1254     dinitrogenase reductase (nifH)
    pept       1370  >  1375     dinitrogenase alpha subunit (nifD)
    binding     355      358     ribosome binding site
    binding    1357     1361     ribosome binding site
    signal      145      160     upstream activating sequence
    signal      175      190     upstream activating sequence
    signal      280      296     ntr promoter
BASE COUNT      287 a    404 c    431 g    253 t
ORIGIN      
        1 ggggatccca aggagggacc ctccgtcctg accaagaact gtgcgttcgt gttttcgccc
       61 attcctttat caccgatcgg gcggcgacgg gaagggcgtt ctgggcagta gcgcaccatc
      121 gcgcatagcg tcgcgccgcc ccgctgttgg gtccgcgaca gggttttggg gcgttgtcgg
      181 gtttgcgaca cggttgaaaa taaccctaca ttccttgtta gatagattta ttcctttgat
      241 taaaaacgat aattttcaaa aaaatcgggc aaatcaggtt tggcacgctg ggtgcattaa
      301 cggggtgagc gtggcggtgg agcttccccg ccttaggacc ggatcacgaa gaaaaggatt
      361 gattccatga gcgcactgcg tcagatcgcg ttctacggca agggcggtat cggcaagtcg
      421 acgacgtccc agaacacgct ggccgctctg gtcgaaatgg gccagcggat cctgattgtc
      481 ggctgcgatc ccaaggccga ttccacccgc ctgatcctga acaccaagct gcaggacacc
      541 gttcttcatc tggccgccga ggccggctcg gtcgaagacc tcgacgtcgc cgatgtggtg
      601 aagatcggct acaagggcat caagtgcacc gaatccggtg gtccggagcc cggcgtcggc
      661 tgtgccggcc gtggcgtgat caccgccatc aacttccttg aggaaaacgg cgcctacgac
      721 gacctcgatt acgtctctta cgacgtgctg ggcgacgtgg tttgcggtgg cttcgccatg
      781 ccgatccgcg agaacaaggc tcaggaaatc tacatcgtca tgtcgggcga gatgatggcg
      841 ctgtatgccg ccaacaacat cgccaagggc attttgaaat acgcccacac cggtggggtt
      901 cgtctgggcg gtctgatctg caacgagcgt cagactgata aggaagtcga actggccgag
      961 gccctggccg gccgtttggg ctgccgcctc atccacttcg tgccgcgcga caacggcgtt
     1021 cagcacgccg agctgcgccg ccaaacggtg atccaatacg cccccgacag caagcaggcc
     1081 ggtgagtatc gcacgctggc gaccaagatc cacaacaact ccggccaggg cgtggtgccg
     1141 accccgatca ccatggaaga cctggaagag atgctgatgg aattcggcat catgaagtcc
     1201 gacgaagagg cgctggccga gctcgaagcc aaggaatccg ccgccgccaa ttaaggccgt
     1261 cgccgtgcga cgttcccccc ggccggccgg tcaggcgcag gggcgacgga agcgggcgac
     1321 ctccaagaag gcctcttcgc gaggcgatag tcgatcagag gagagggata tgagc
//
LOCUS       YSCMTFDEL      70 bp ds-DNA             ORG       28-JUN-1990
DEFINITION  S.cerevisiae mitochondrial novel junction sequence.
ACCESSION   M34150
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site         33       44     novel junction site
BASE COUNT       11 a     25 c     19 g     15 t
ORIGIN      
        1 cttcggagtt cggccccccc ataagggggg gacctcactc cttccccact gcactggatg
       61 cgggacttat
//
LOCUS       YSCMTGAP12    457 bp ds-DNA             ORG       28-JUN-1990
DEFINITION  S.cerevisiae mitochondrial gap 12 sequence.
ACCESSION   M34149 X15185
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 457)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
COMMENT     The gap 12 sequence is a segment of yeast mitochondrial DNA that
            had not been determined previously.
BASE COUNT      161 a     57 c     57 g    182 t
ORIGIN      
        1 tacaatttat aatttaataa agaaggaaat aaataataat aactcctttt ggggttccgg
       61 tggggttcac acctttataa ataataaata aagatgttta ctcctcttcg gggttcggtc
      121 ccctttttgg gttccggaac taattaatat tttatataat aataataata tattaatata
      181 atttcattat taataaatat ctcctgcggg gttcggtccc ccccgtaagg ggggggtcct
      241 cactccttcg gagcgtacta ttattataaa taattatata ttataatata attaaaaagt
      301 attataattg aaacgaaaat tgtaatttta aatggaataa taattattat atatttaata
      361 tatttaataa agttataata tctctttcta ccggactatt ttattttatt ttattttatt
      421 tttataaaga aaaatagtat aatattatct tctcctc
//
LOCUS       YSCMTORI1A     70 bp ds-DNA             ORG       28-JUN-1990
DEFINITION  S.cerevisiae mitochondrial ori1 gene, 5' flank.
ACCESSION   M34153
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
BASE COUNT       19 a     11 c     24 g     16 t
ORIGIN      
        1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
       61 ggggtgggtg
//
LOCUS       YSCMTORI2A     70 bp ds-DNA             ORG       28-JUN-1990
DEFINITION  S.cerevisiae mitochondrial ori2 gene, 5' flank.
ACCESSION   M34152
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA         32  >    70     ori2 mRNA (+/- 4 bp)
BASE COUNT       23 a      0 c     18 g     29 t
ORIGIN      
        1 tttttttata agataatttt tgtaaatata taagtaataa attaagtttt atagggggag
       61 ggggtgggtg
//
LOCUS       YSCMTORI7A     70 bp ds-DNA             ORG       28-JUN-1990
DEFINITION  S.cerevisiae mitochondrial ori7 gene, 5' flank.
ACCESSION   M34154
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
BASE COUNT       19 a     11 c     24 g     16 t
ORIGIN      
        1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
       61 ggggtgggtg
//
LOCUS       CFXCSMA       859 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  C.aurantiacus 5.7 kd chlorosomal protein (cmsA) gene, complete cds.
ACCESSION   M33964
KEYWORDS    chlorosomal protein.
SOURCE      C.aurantiacus (strain J-10-fl) DNA.
  ORGANISM  Chloroflexus aurantiacus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Multicellular filamentous green bacteria; Chloroflexaceae.
REFERENCE   1  (bases 107 to 554)
  AUTHORS   Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J.
  TITLE     Gene encoding a 5.7 kd chlorosomal protein in Chloroflexus
            aurantiacus: Regulated message levels and a predicted carboxy
            terminal protein extension
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 859)
  AUTHORS   Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.J.Robinson, 01-MAY-1990.
            
               Author address: S.J.Robinson
               University of Massachusetts
               108 Morrill Science Center
               Amherst, MA 01003
FEATURES       from  to/span     description
    pept        210      452     5.7 kd chlorosomal protein (cmsA)
    mRNA        135  >   452     cmsA mRNA
    binding     196      204     ribosome binding site
    rpt         464      495     inverted repeat
BASE COUNT      158 a    232 c    283 g    186 t
ORIGIN      
        1 gatcggcgac ttccggcggg tcattacgct gccgaccagt ctggccggcc tggaagccgg
       61 tcaggccgag atggagggtg aatggttgat tgtccccttc atggcgccgc aaatggcgca
      121 acgctgaggc agcctgaact gtggggtttc tcggaggctt gcctccggtg acctgaataa
      181 tggagtccgt ttgataagga ggtgtgtgca tggcgacgag aggctggttc tcggagtcgt
      241 cggcgcaggt ggcgcaaatc ggcgacatca tgttccaggg ccactggcaa tgggtctcga
      301 atgcgctaca ggccaccgcg gcagcggttg acaacatcaa ccgcaatgct tacccgggcg
      361 tgtcccggag cggctcgggc gagggagcgt tcagcagcag cccgagcaac ggcttccgtc
      421 cgaagcgcat tcgctcgcgc ttcaaccgct aggctggcgg gcaggcgtga ggcgtgcgtt
      481 ctgtcgcctc acgccgatcc ctcgggagcg ttatgattaa cctgctctac tacagtcata
      541 ccaccgtacc ggctcatctg gaagagatta cgcgactgga tgtgctcgat cctgacgagc
      601 aagtgctggt cgcgatagat ggcgttttgg tcgatcagca gggacagcgg ctgagcggcc
      661 caacgctgca cgattattgt ctgatcaccg acttgcgggt gatcctgtgg gcacgagatt
      721 atggccggca tttgtgttat gcctttccat tgcatgagct gacgttgatt gatgggcgcg
      781 ggattgatcc gatccacggc gcagtgacga tgggctttag cgcgccagat accgatgatc
      841 agatttttac gctgactct
//
LOCUS       VVCACTA      3385 bp ds-DNA             PLN       28-JUN-1990
DEFINITION  Volvox carteri actin gene, complete cds.
ACCESSION   M33963
KEYWORDS    actin.
SOURCE      Volvox carteri (female strain HK10) DNA, clone lambda-VAc21.
  ORGANISM  Volvox carteri
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Volvocaceae.
REFERENCE   1  (bases 1 to 3385)
  AUTHORS   Cresnar,B., Mages,W., Mueller,K., Salbaum,J.M. and Schmitt,R.
  TITLE     Structure and expression of the single Volvox carteri actin gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Mages, 01-MAY-1990.
            
               Author address: W.Mages
               Lehrstuhl fuer Genetik
               Universitaet Regensburg
               Universitaetsstr.31, D-8400 Regensburg, FRG
FEATURES       from  to/span     description
    pept        450      509     actin, exon 2 (first expressed exon)
                613      708     actin, exon 3
                780      926     actin, exon 4
               1009     1074     actin, exon 5
               1150     1197     actin, exon 6
               1396     1567     actin, exon 7
               1667     1821     actin, exon 8
               1959     2142     actin, exon 9
               2425     2630     actin, exon 10
    pre-msg     195  >  2630     actin mRNA and introns
    IVS         258      439     actin intron A
    IVS         510      612     actin intron B
    IVS         709      779     actin intron C
    IVS         927     1008     actin intron D
    IVS        1075     1149     actin intron E
    IVS        1198     1395     actin intron F
    IVS        1568     1666     actin intron G
    IVS        1822     1958     actin intron H
    IVS        2143     2424     actin intron I
    signal      120      124     CAAT box
    signal      164      169     TATA box
BASE COUNT      775 a    830 c    928 g    852 t
ORIGIN      
        1 aagcttgaag agaggtctta tggaccctag tcggtcacga tccagacacc aaacgtaacg
       61 aaaaaaagga ataattagcc gggcttgttg atgcatattc gcggctttac attatcccgc
      121 caatacgttc acgtacggca caggacatcc ttttcactcg agctataaac tatcatcatc
      181 cagattgctt gttttacctt gcaccagtta ctatcagcac caaccggccc ttgacttcgc
      241 acagccttct aaaaaaggta cgcgcgacgt gtttagcgtt cgggcagccg cagctaggag
      301 gtttggtggc tgcgaggccc ttctgcccca ggataagaag gggattaact tccgttgcca
      361 taacatgaag tccggggcct tgtccatgtt cgatcgccaa cgatgcaacc tatgctgtct
      421 taccgctggg cttatgcagg gcagcaacaa tggctgagga gggcgaggtt tctgctctgg
      481 tgtgcgacaa cggctcgggc atggtcaagg tacgttttct ggctgctgct cggttgggcc
      541 cggccaggcc caccccgcag ccttgctgag cttttgatgg atcgcatcta acacgtggcc
      601 cctgatttgt aggctggctt tgctggtgat gatgccccac gagctgtgtt tccgagcatt
      661 gttggtcggc cccgccatac gggtgtgatg gtcggcatgg ggcagaaggt gaatagaatg
      721 acggcgatgc aggcacggac tcctcgacat ttttataatg atctttttgt tttcgccagg
      781 attcctacgt gggcgacgag gcacagtcta agcgtggtat cttgacgttg cgctacccta
      841 ttgagcacgg tatcgttacc aactgggatg acatggagaa gatctggcac cacactttct
      901 ttaatgagct tcgtgtggca ccggaggtgc gtctgcgtac tagcttcctt ggctttcagc
      961 tatggaccag ttcgacattg acaactgctt tccttcccaa atctgtagga gcacccagtc
     1021 cttctcacgg aggcgcctct taatcccaag gcgaaccgtg agaagatgac gcaggtacgg
     1081 atattgtttt atgcacatac cgttgcgttc gttcccaagc ttgcctcccg ataatctccg
     1141 tttgcgcaga tcatgttcga gaccttcaac gtccctgcca tgtatgtggc gatccaggtg
     1201 acgattgggg gaggggactt agtaatacct cgattgctgt tgttcgattc acccctctca
     1261 tttcgatttg cgcacttggg gtggcatcat gtttcgtgcg tcggggtagt tgttaaccct
     1321 gcaaccctag ctactgccac taatgacagt aggaaagtaa tatttaattg ctgtccttca
     1381 tgctgacaac tgcaggccgt gctgtcgctg tatgctagcg gtcgtacgac gggtatcgtt
     1441 ctagattccg gtgacggtgt aacccacacg gtgcccatct atgaaggcta cgcgctgccg
     1501 cacgccatcc tgcgtctgga ccttgccggc cgtgacctga cggactacct gatgaagatt
     1561 ctaatgggtg aggcattcgg atagggggac caaaataccg taccgcgaga aatgggttat
     1621 gagtcgtgtg ttttgacttg tgttgaacgg cttcccttcg ttacagagcg tggttactca
     1681 ttcacgacca ctgctgagcg tgaaatcgtg cgcgacatca aggagaagct ctgctatgtg
     1741 gcgctggatt tcgagcagga aatggcaaca gcagcctcca gctcggcgct tgagaagacg
     1801 tacgagctgc ctgatggcca ggtgcggccg tagctagacg ccgacgaata gcctaaaact
     1861 ttcggtgatg atcagcttag atacatttga ttgtgtcggc attgcacgaa aaaagtcgtt
     1921 gtccataatg ttaaaattgc cctcttttct ccctgcagcc aatcacaatt ggcaacgagc
     1981 gcttccgttg ccctgaggtg ctgtacaacc ctagcctaat cggcatggag gcggtcggta
     2041 tccacgacac tacctttaac agcatcatga agtgcgatgt cgatatccgc aaggatctgt
     2101 acaacaacat cgtgctctct ggtggaacga ccatgttccc tggtaagcaa tgcatactac
     2161 tgttagactt caggacatgt tgtgatcgca ttagcgatta gaaccgggtt agcgcgggac
     2221 agacaggagg tacggtgttt ggggctcggg gaactcttat gaggatgaat aagcggttct
     2281 tagcgtctat gtcacggacg gatctatgga attccgtcag tatctggagg cttgcatgtt
     2341 gcatcgctcc ccctgaactc catcgtcctc cgtgaaggtc acagtgcctt gacaaatgcg
     2401 atctgtgctg ttgaacaatt gcaggcatcg ctgatcgcat gaccaaggag atcacggccc
     2461 ttgcgccgtc tgccatgaag atcaaggtcg tcgccccgcc ggagcgcaag tactctgtgt
     2521 ggattggtgg ttcaatcctg gcgtccttgt cgaccttcca gcaaatgtgg atcgcgaaga
     2581 gcgagtacga cgagagtggt ccgtcgattg tgcaccggaa gtgcttctaa gcataatagc
     2641 ttgcggactg ttattcggcc tctttcggtt ataactgacc ggaagcgggt tactcttcag
     2701 tcaaggcgga tgcctatgat tttgcaaaaa ttcgagcgga gatgtcatct tggtttcaag
     2761 ctgcgtgggt gagaaatcaa ctgtgggatc catgtcgaca gcttaacaac agtctctacc
     2821 atttcatggt atgcggctgt agccgtagcg agtacgtcgt catgagcagc ggtagtttga
     2881 tcgtgcgaag tagtctgata ggggcagggc gcgtggggat ggctaataga acgaagagga
     2941 tgtgtgcatt caagcgagtg gatgtcggcg attggtctag gccaagcaaa gtgcgtgtaa
     3001 gggaaagctg gatgacacga ccattgctcg tgtgctttcg gaacttccac tcttgaactc
     3061 aggcctatgc gctgccttga gtatcctgcc ggataaaagg cgggaggcgg taacgttagt
     3121 agacccttcc aggggtataa atggtcaatt acaacaaaac tgtaacataa cgaagcgctg
     3181 cacgtagtac gaaatccgtt gcaaacgtga aacccagcta cttgccaata cttaaacgcg
     3241 catcagcatt gtgtacaaga acgtcaatga ggcattgcgt aaattacaca tgtgagcact
     3301 aagaaaacag ctgtatgttg tcacgtttat tcactagcgc acagcccgca agtaagcggt
     3361 ggcaaagccc agctttggta caaag
//
LOCUS       HAMCD44      1089 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Hamster hyaluronate receptor (CD44), complete cds.
ACCESSION   M33827
KEYWORDS    CD44 hyaluronate receptor.
SOURCE      Hamster, cDNA to mRNA.
  ORGANISM  Cricetulus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 1089)
  AUTHORS   Aruffo,A., Stamenkovic,I., Melnick,M., Underhill,C.B. and Seed,B.
  TITLE     CD44 is the principal cell surface receptor for hyaluronate
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Seed, 23-APR-1990.
FEATURES       from  to/span     description
    pept          1     1089     hyaluronate receptor (CD44)
BASE COUNT      299 a    282 c    277 g    231 t
ORIGIN      
        1 atggacaagt tttggtggca cgcagcttgg ggactctgcc tcttgccgct gagcctggcg
       61 cacgagcaga tcgatttgaa cataacctgc cgctatgcag gtgtattcca cgtggagaaa
      121 aatggccgct acagcatctc acggactgag gcagctgacc tctgccaagc tttcaacagc
      181 actctgccca ccatggacca gatggtgatg gccctgagca agggctttga aacatgcagg
      241 tatgggttca tagaaggcca cgtggtgatc ccgaggatcc agcccaatgc catctgtgca
      301 gccaaccaca ctggggtgta tatcctcaca tccaacacat ctcactacga tacatattgc
      361 ttcaatgcct cagcacccct tgaagaagac tgtacatctg tcacagacct gcccaattcc
      421 ttcgaaggac cagttaccat aactattgtc aaccgtgatg gtacccgcta cagcaagaag
      481 ggcgagtata gaacacacca agaagacatt gatgcctcaa ataccacaga tgatgatgtc
      541 agcagcggat cctccagtga gaagagcacc tcagggggct atgttttcca cacctacctt
      601 cccactatac actcaactgc agaccaggat gatccctact tcatcgggag caccatggcc
      661 accagagacc aagactcatc catggatccc agggggaatt ccctcactgt gactgatgga
      721 tccaaattaa ctgaacactc aagtgggaat caagacagtg ggcttaactc aacttctcgt
      781 cctggaggaa aacctcgagt tccagaatgg ctcatcgtct tggcatctct cctggcgctg
      841 gctctgattc ttgctgtttg cattgctgtc aacagtagga gaaggtgtgg acagaagaaa
      901 aagctggtga tcaacagtgg caatggaaag gtggaggaca ggaagccaag tgagctcaac
      961 ggggaggcca gcaagtctca ggaaatggtg catttggtga acaaggaacc atcagagact
     1021 cctgaccagt ttatgacagc tgatgagacc cggaatctgc agaatgtgga catgaagatt
     1081 ggggtgtag
//
LOCUS       DROGNBPSA1    441 bp ds-DNA             INV       28-JUN-1990
DEFINITION  D.melanogaster guanine nucleotide-binding protein (Gs) alpha
            subunit gene, exon 1.
ACCESSION   M33996
KEYWORDS    guanine nucleotide-binding protein.
SEGMENT     1 of 3
SOURCE      D.melanogaster (strain Canton S) embryo DNA, clones
            lambda-DGs-[4,5,7,10,11,12].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 441)
  AUTHORS   Quan,F. and Forte,M.A.
  TITLE     Two forms of Drosophila melanogaster Gs-alpha are produced by
            alternate splicing involving an unusual splice site
  JOURNAL   Mol. Cell. Biol. 10, 910-917 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pre-msg     362  >   441     guanine nucleotide-binding protein (Gs-alpha)
                                 mRNA and introns (alt.)
    pre-msg     174  >   441     Gs-alpha mRNA and introns (alt.)
    IVS         426  >   441     intron A
    signal      152      157     promoter region (put.)
    variant     365      365     c in DNA, g in cDNA
BASE COUNT      140 a     89 c     85 g    127 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgcta ttctacatag ctcgggatac tttattttat tattaatatt cgctaaaata
       61 gcacatcgtc atttggcata gatgtttttg gcccagtaac atatctatct acatcaccgc
      121 ccgcactatc cgataggcag actgactatc gataaaagta tctacttcgt tgcatcccta
      181 ggttgataca taaaataaaa aatatataaa aagtgagtta gtactggtat taaaagtaat
      241 ttttaagcta caaattgatg aattgcgagc acacagagac taagaacatt gggaatccgg
      301 aaaacagtgg ctggcgcgtc gggtgcgata gaattgcctg tttttcagac cccaaaagtc
      361 gctcccaacc gagtcgtgca cctatagtct tggcccgaag ttgctcccat gaataaaaat
      421 gttcggtaat ttcacggtaa a
//
LOCUS       DROGNBPSA2    897 bp ds-DNA             INV       28-JUN-1990
DEFINITION  D.melanogaster guanine nucleotide-binding protein (Gs) alpha
            subunit gene, exons 2,3 and 4.
ACCESSION   M33997
KEYWORDS    guanine nucleotide-binding protein.
SEGMENT     2 of 3
SOURCE      D.melanogaster (strain Canton S) embryo DNA, clones
            lambda-DGs-[4,5,7,10,11,12].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 897)
  AUTHORS   Quan,F. and Forte,M.A.
  TITLE     Two forms of Drosophila melanogaster Gs-alpha are produced by
            alternate splicing involving an unusual splice site
  JOURNAL   Mol. Cell. Biol. 10, 910-917 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        265      540     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 2
                607      703     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 3
                760  +   880     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 4
    pep$        265      540     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 2
                607      703     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 3
                760  +   880     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 4
    pre-msg <     1  >   897     Gs-alpha mRNA and introns
    IVS     <     1       29     intron A
    IVS         541      606     intron B
    IVS         704      759     intron C
    IVS         881  >   897     intron D
BASE COUNT      266 a    225 c    226 g    180 t
ORIGIN      About 1.4 kbp downstream of segment 1.
        1 ccatctacta taccattgta cgaatgcagc ctgctgcaga gatccactga agcgctgagc
       61 caacgccaac ggtggtgcca ctgagatcgg agatcggaga tcgcagaccg gagaccgacg
      121 agcacacaag ggattgtggt ttgatccgag gaacccaagc actcagctat actggagcag
      181 aacggagaaa gcacaagggg tgccgagtcg cccgacgcga accaggaaaa ccgagcgacg
      241 tgttggtgga ccccgtgggc tgcgatgggt tgctttgggt cgcccacctc caagcagtcg
      301 gacgtgaact cggaggactc gaagagccag aagcgccgga gcgatgcaat atctagacag
      361 ttgcagaagg acaaacagct ctacagggcc acacacaggc tgctcctcct gggggcgggc
      421 gagtccggca aatcaaccat agtcaagcaa atgcgaatat tgcatgtcga cggattttct
      481 gactcggaaa agaaacagaa aattgatgat attaaaaaga atattcgaga cgctatcttg
      541 gtgagttcca tcccgccggc catctaagcc caaacacttg cttaactcct tgccaatccc
      601 ttacagacta ttacaggagc catgagcaca cttaatccac ctgtagcttt agaaaagaag
      661 gaaaatgaac ccagagtgga gtacattcag gattatgcat ctagtaagag cctctgccgc
      721 cccaacttaa gttgttatca accattatct gccccgcagg tccggacttt aattatcctc
      781 ctgaatttta tgaacataca gaagaactat ggaaagacaa gggcgttctt caaacctatg
      841 agaggtcgaa tgagtatcaa ttaatcgatt gtgcgaaata gtaagtaatg cccaccc
//
LOCUS       DROGNBPSA3   1380 bp ds-DNA             INV       28-JUN-1990
DEFINITION  D.melanogaster guanine nucleotide-binding protein (Gs) alpha
            subunit gene, exons 5,6,7,8 and 9.
ACCESSION   M33998
KEYWORDS    guanine nucleotide-binding protein.
SEGMENT     3 of 3
SOURCE      D.melanogaster (strain Canton S) embryo DNA, clones
            lambda-DGs-[4,5,7,10,11,12].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1380)
  AUTHORS   Quan,F. and Forte,M.A.
  TITLE     Two forms of Drosophila melanogaster Gs-alpha are produced by
            alternate splicing involving an unusual splice site
  JOURNAL   Mol. Cell. Biol. 10, 910-917 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     4      132     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 5
                206      385     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 6
                445      569     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 7
                746      828     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 8
                909     1055     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 9
    pep$    +     4      132     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 5
                206      385     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 6
                445      569     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 7
                755      828     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 8
                909     1055     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 9
    pre-msg <     1     1115     Gs-alpha mRNA and introns (alt.)
    pre-msg <     1     1176     Gs-alpha mRNA and introns (alt.)
    pre-msg <     1     1243     Gs-alpha mRNA and introns (alt.)
    IVS     <     1        3     intron D
    IVS         133      205     intron E
    IVS         386      444     intron F
    IVS         570      745     intron G (for Gs-alpha-S) (no splice
                                 consensus)
    IVS         570      754     intron G' (for Gs-alpha-L)
    IVS         829      908     intron H
    variant     780      780     t in DNA, c in cDNA
    signal     1083     1088     polyA signal
    signal     1154     1159     polyA signal
    signal     1221     1226     polyA signal
BASE COUNT      441 a    265 c    254 g    420 t
ORIGIN      About 500 bp downstream of segment 2.
        1 aagcttcctg gaccgagtga gcacaatcaa gaatccaaac tacaccccta atgagcagga
       61 tattcttcgg tgccgtgttt tgacttctgg aatatttgaa acaagatttc aagtggacaa
      121 agtaaacttt cagtaagtgt aatcccaagt gatatcgcaa tattttagtt tctattccat
      181 tctaatgata tgcattcgtg tgcagcatgt tcgatgtcgg tggccagcgg gacgagcgta
      241 ggaaatggat tcagtgtttc aatgatgtaa ctgctatcat attcgtaact gcgtgctcaa
      301 gttataacat ggttttgcgg gaagatccca cccagaaccg acttcgagaa tctttggatt
      361 tgttcaagag tatttggaac aacaggtgaa ccgagcaatt tggcgccatc tcgacaaatg
      421 gaaactaacg agttgttctt tcagatggct tcgcacgatt tctattatac tatttttaaa
      481 taagcaagat ttgttagcag agaaaattaa ggctggaaaa agtaaattgt cggaatattt
      541 ctccgagttt aacaaatacc aaacgccaag taagtaaaat agtataaccc atagtagagg
      601 ccaattgagg gcgagggagt catcagatca accgactcag atctggccaa gaaattcgca
      661 attccttttc aaatttgcaa accgtgccca agttaactta agaaatttaa tggataatgt
      721 attttgtcta atttcaaatg tgctgtcgac acaggtgacg caataatgga atccaatgat
      781 gacccagaag taatacgagc aaaatatttc atacgagacg agtttctggt gggttatatc
      841 acatctccgt ttcttcctcc acctatctaa ctagccttaa tccttaatgg gatctctttc
      901 ttatatagcg tatatctacc gctagcggag acggaaaaca ctactgctat ccacatttca
      961 catgcgccgt tgacacagaa aacattaaac gtgtgtttaa tgattgcaga gacattattc
     1021 aaaggatgca ccttcgtcaa tatgaattgt tataggttat ccccatcgcc gtaatgcaag
     1081 taaataaaaa tattaatgac ttactgtttt aaatataatt atcaaatgta atttgtattt
     1141 aaatagtccc ttgattaaaa caaaaaattc gctgcattta gatctggtat gacctttcac
     1201 aaaactcaac attaaagaat aataaataaa catgttaata acagacacgc ttgtttcgat
     1261 tattttggct attctaccag gatgcacctg ctttattttc tatagtccct tgacatgaag
     1321 cattttagag tatttccgct atcagagtcc gtatccatat ccatgggcat ttacattcac
//
LOCUS       SYCSBEFA      850 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  Synechocystis sp. cytochrome B559 alpha subunit (psbE) and beta
            subunit (psbF) genes, complete cds.
ACCESSION   M33897 X06988
KEYWORDS    cytochrome B559 alpha subunit; cytochrome B559 beta subunit;
            photosystem II protein.
SOURCE      Synechocystis sp. (strain PCC6803) DNA, clone pKW1261.
  ORGANISM  Synechocystis sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Chroococcales.
REFERENCE   1  (bases 1 to 553)
  AUTHORS   Pakrasi,B.P., Williams,J. and Arntzen,C.
  TITLE     Targeted mutagenesis of the psbE and psbF genes blocks
            photosynthetic electron transport: Evidence for a functional role
            of cytochrome B559 in photosystem II
  JOURNAL   EMBO J. 7, 325-332 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 850)
  AUTHORS   Pakrasi,H.B., Nyhus,K.J. and Granok,H.
  TITLE     Targeted deletion mutagenesis of the beta subunit of Cytochrome
            B559 protein destabilizes the reaction center of photosystem II
  JOURNAL   Z. Naturforsch., C, Biosci. 45, 423-429 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Himadri, 01-MAY-1990.
               Washington University Dept. of Biology
               Campus Box 1137
               St. Louis, MO 63130
FEATURES       from  to/span     description
    pept        121      366     cytochrome B559 alpha-subunit (psbE)
    pept        403      537     cytochrome B559 beta subunit (psbF)
    binding     109      112     ribosomal binding site (put.)
    binding     392      396     ribosomal binding site (put.)
BASE COUNT      197 a    196 c    197 g    260 t
ORIGIN      
        1 gctaccccag caccgaggtg gcaatggttc cctaggcggc tcacaaaata gtagactaga
       61 ctctacttgc tttgcatttg tcagtcaatg ttgttttgaa aaattgaagg agaacacaaa
      121 atgtcaggga ctaccggcga gcgtccattt tccgatattg tcaccagcat tcgctactgg
      181 gtgatccaca gcatcaccat cccgatgttg tttattgctg gttggttgtt tgtcagcacg
      241 ggcttagcct acgatgcttt tggcactccc cgccccgatg aatatttcac ccagacccgt
      301 caagagttgc ccattctcca ggaacgctac gacattaatc aggaaattca agagtttaat
      361 caataaaaca tttaattgtt cttttttagt tggtaattaa caatggcaac ccaaaatcct
      421 aatcaaccgg ttacttatcc catttttacg gtgcgctggc tggcggttca caccctggcg
      481 gtgccctctg tcttctttgt cggggcgatc gccgcgatgc aatttattca acgctaggag
      541 tttttcatgg acagaaattc aaacccaaac cgccaaccgg tggaattgaa ccgcacttct
      601 ttatacctgg gtctattgtt ggtggctgtg ttggggattt tgttctccag ctatttcttt
      661 aactaaactt ttttaatacg caatttagga ggcatggtat gttcgcagaa ggcagaatcc
      721 ctttgtgggt ggtgggtgta gtggccggta ttggcgccat tggtgttcta ggattatttt
      781 tctacggagc ctatgctggt ttaggttctt ccatgtaatc gagggctagc cgccacacaa
      841 tatcatggtt
//
LOCUS       ECOPURCA     2060 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  E.coli 5-phosphoribosyl 5-aminoimidazole 4-N-succinocarboxamide
            synthetase (SAICAR) and dapX gene, complete cds.
ACCESSION   M33928
KEYWORDS    5'-P-ribosyl 5-aminoimidazole 4-N-succinocarboxamide synthetase;
            SAICAR synthetase.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2060)
  AUTHORS   Tiedemann,A.A., DeMarini,D.J., Parker,J. and Smith,J.M.
  TITLE     DNA sequence of the purC gene encoding SAICAR synthetase and
            organization of the dapA-purC region of Escherichia coli K12
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (sites only)
  AUTHORS   He,B., Shiau,A., Choi,K.Y., Zalkin,H. and Smith,J.M.
  TITLE     Genes of the Escherichia coli pur regulon are negatively controlled
            by a repressor-operator interaction
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Authorin sequence for [1] kindly submitted by J.M.Smith,
            27-APR-1990.
            
               Seattle Biomedical Research Institute
               4 Nickerson Street
               Seattle, WA 98109-1651
               e-mail: jms0234@max.acs.washington.edu
FEATURES       from  to/span     description
    pept       1281     1994     5'-phosphoribosyl-5-aminoimidazole-4-
                                 N-succinocarboxamide synthetase (EC 6.3.2.6)
    pept        637     1068     dapX protein
    signal     1106     1111     -10 region
    binding    1112     1127     purine repressor protein binding site (PUR
                                 box)
    site       1180     1216     repetitive extragenic palindrome
BASE COUNT      501 a    504 c    587 g    468 t
ORIGIN      
        1 gcatgccggt ttgctgtaaa gtttagggag atttgatggc ttactctgtt caaaagtcgc
       61 gcctggcaaa ggttgcgggt gtttcgcttg ttttattact cgctgcctgt agttctgact
      121 cacgctataa gcgtcaggtc agtggtgatg aagcctacct ggaagcggca ccgcttgcgg
      181 agccttcatg ccccggctgg aatgattttg ccggtgacct ccggtgttat gcaatcccgg
      241 tgaccaacgg tagtggtgct gtcggtaagg cgctgacatt cgtccaccag cccagccgct
      301 ggcactggtt tctggcgcgc gtacccagtt cacgggcgat accgcttcat tgctggtgga
      361 aaatggtcgt ggcaatactc tgtggccgca ggtggttagc gtgctgcagg cgaaaaacta
      421 caccatcacc caagtgatga tgctggtcag acactgacca ccgattgggt acaatggaac
      481 cgtctggacg aagacgagca gtatcgtggt cgttatcaaa tctctgttaa gccgcagggt
      541 tatcagcagg cggttacggt taaactgctg aacctggaac aggcgggcaa accggttgca
      601 gacgcggctt ccatgcagcg ttacagcacg gagatgatga acgttatttc cgccggtctg
      661 gataaatctg ccactgacgc cgcgaaccgt gcgcaaaatc gtgcctccac cactatggac
      721 gtacaaagtg cagctgatga caccggttta ccaatgctgg tcgtacgcgg gccgttcaat
      781 gtggtttggc aacgtctgcc agcggcgctg gaaaaagtgg gcatgaaagt gaccgacagc
      841 acccgttcgc agggcaacat ggccgtaact tataagccgc tgtctgacag cgactggcag
      901 gaactgggcg cgagcgatcc aggcctggca tccggtgact ataaactgca ggttggcgat
      961 ttagataacc gcagcagcct acagttcatc gatccgaaag gtcatactct gactcagagt
     1021 cagaacgacg cgctggtagc tgtcttccag gctgcgttta gcaagtaaaa atacagggct
     1081 ggaatcatcc ggcccttttt tctgatatga tacgcaaacg tgtgcgtctg caggaaaacg
     1141 cgattttagc ggtaattcgc acgaaatttg tttgtcggac gtagttcgga taaggcgttc
     1201 acgccgcatc cgacaaaaca tccggcacac cagacagcaa aagattttaa aacgttaatt
     1261 cacacccagg agtgataaag atgcaaaagc aagctgagtt gtatcgtggt aaagcgaaaa
     1321 ccgtatacag cacggaaaac ccggacctgt tggtgctcga attccgcaat gatacgtcag
     1381 caggggatgg cgcgcgcatt gagcagtttg atcgcaaagg tatggtgaac aacaagttca
     1441 actacttcat tatgagcaaa ctggctgaag cgggtatccc gactcaaatg gagcgtctgc
     1501 tctccgatac cgaatgtctg gtgaaaaagc tggatatggt gccggttgag tgtgtcgtgc
     1561 gtaaccgtgc tgctggctct ctggtgaaac gtcttggaat cgaagaaggt attgagctga
     1621 acccgccgct gttcgatctg ttcctgaaaa acgacgccat gcacgatccg atggtcaacg
     1681 aatcttactg cgaaaccttt ggctgggtga gcaaagagaa cctggcgcgt atgaaagagc
     1741 tgacctacaa agcgaacgac gtgctgaaaa aactgttcga tgatgctggt ctgattctgg
     1801 tcgacttcaa gctggaattt ggtctgtaca aaggcgaagt ggtactgggt gatgagttct
     1861 ccccggacgg tagccgcctg tgggacaaag aaacgctgga gaaaatggac aaagaccgtt
     1921 tccgccagag cctcggtggc ctgatcgaag cctatgaagc cgtcgcccgc cgcctgggtg
     1981 tacagctgga ctgatttttc tgttcatcat cttgccgtgc agctggcacg gcaagacaac
     2041 cgctctcgga atatatgcat
//
LOCUS       PIGSGLT1A    3649 bp ss-mRNA            MAM       28-JUN-1990
DEFINITION  Pig Na+/glucose cotransporter protein (SGLT1) mRNA, 3' end.
ACCESSION   M34044
KEYWORDS    Na+/glucose cotransporter.
SOURCE      Pig kidney cell line LLC-PK1, cDNA to mRNA, clone pPSGT-[B1,C2].
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 3649)
  AUTHORS   Ohta,T., Isselbacher,K.J. and Rhoads,D.B.
  TITLE     Regulation of glucose transporters in LLC-PK1 cells: Effects of
            glucose and monosaccharides
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Ohta 03-MAY-1990.
               Mass General Hospital
               Building 49
               13th Street
               Charlestown, MA 02129
FEATURES       from  to/span     description
    pept    <     1     1818     Na+/glucose cotransporter (SGLT1)
    signal     1935     1940     polyA signal (put.)
    signal     2086     2091     polyA signal (put.)
    signal     2238     2243     polyA signal (put.)
    signal     3205     3210     polyA signal (put.)
    signal     3628     3633     polyA signal (put.)
BASE COUNT      905 a    790 c    871 g   1083 t
ORIGIN      
        1 ttcttcctgg ctggacgaag tatggtgtgg tggccggttg gagcttctct gtttgccagt
       61 tatatcggaa gcgggcattt tgttgggctg gcaggaactg gagcagctgc aggcattgcc
      121 actggtggct ttgaatggaa tgctctgatt tgggtggttg ttctgggctg gctgtttgtc
      181 cccatttaca ttaaggcagg ggtggtgacg atgccagagt atctgcggaa gcgatttgga
      241 ggcaagcgga tccaggtcta cctctctatc ctgtccctga tgctctacat tttcaccaag
      301 atctcggcag acatcttctc tggggccata ttcatcacct tggccttggg cctggatctg
      361 tacctggcca tctttctctt actggcaatc actggccttt acacaatcac agggggcctg
      421 gctgctgtga tttatacgga taccttgcag acagcaatca tgctggtggg gtcttttatc
      481 ctaactgggt ttgcttttca tgaagtggga ggttatgatg ccttcatcga aaaatacatg
      541 aatgccattc cgactgtgat ttctgatgga aatatcacca tcaagaaaga atgttatgcc
      601 ccgagggctg actccttcca tattttccga gatcctctca agggagacct gccatggcct
      661 gggctcacct ttgggctgtc catcctcgcc ctgtggtact ggtgcacaga tcaggtcatt
      721 gtgcagcgct gtctctcggc caaaaacatg tcacatgtga aggctggctg cgtcatgtgt
      781 gggtacttta agctgctgcc catgtttgtc atagtgatgc cagggatgat cagccgcgtc
      841 ctgtacacag aaaaaattgc ctgcaccgtc ccctcggaat gtgagaaata ttgcggcacc
      901 aaggttggct gttccaacat tgcctacccg accttggtgg tggaactcat gcccaatgga
      961 ttgcgaggcc tgatgctgtc ggtcattttg gcctctctca tgagctccct gacctccatc
     1021 ttcaacagcg ccacgactct cttcaccatg gatgtctacg ccaagatccg gaagagagca
     1081 tctgagaaag agctcatgat tgcgggaagg ctgttcatcc tggtgctgat tggcatcagc
     1141 attgcctggg tgcccattgt gcagtcagca caaagtgggc agctcttcga ttacatccag
     1201 tctgtcacca gttacttggg accacccatc gcagctgtct tcctacttgc tattttctgc
     1261 aagagagtca atgaggaggg agccttttgg ggactggtca taggatgtat gattgggctg
     1321 gcccgtatga ttaccgagtt tgcctatgga accgggagct gcgtggagcc cagcaactgt
     1381 cccacaatta tctgtggtgt gcactacttg tactttgcca tcatcctctt tgtcatttcc
     1441 atcatcatcg tcctggtcgt ctccctcttc accaagccca ttccagatgt gcatctctac
     1501 cgcctgtgtt ggagcctgcg caacagcaaa gaggagcgta ttgacctgga tgcagaggag
     1561 gaggacattc aagaagcccc agaagagacc attgaaatag aagttcctga ggagaagaaa
     1621 ggatgcttca ggaggaccta tgacctgttt tgtggcctgg accagcagaa gggccccaaa
     1681 atgaccaagg aagaggaggc agccatgaag ctgaagatga cagacacctc agagaagcct
     1741 ttgtggcgga cagtagtgaa catcaatggc attatcctgc tgaccgtggc cgtattctgc
     1801 catgcatatt ttgcctgaaa tcctgccttc tgccacaggc tgtgccctga aggctggact
     1861 ctttactctg tttcctttta gtcccattct gtgctgtcaa agggacacca gccagttgta
     1921 aattttgcct agtcaataaa tgtgtacatt tgtaattagc cagaaaaaaa aatcagtgat
     1981 ctgttattaa cttatgtatt tgaggccagt gtcatacagt catctgtaca tatcagacct
     2041 gcagaaggaa gtccactcag tcggatgtct aggaaaaggc agactaagaa acagaagtcc
     2101 cgtgatgtct gatgcaagtt tgtctcaggt agattcagca tgtcagtgtt gtttgtgatc
     2161 cttgaatgtt attttaagag ttgtggtcgc cctggttcgg ccacttccta taaacctcac
     2221 ctctcataat ttttcaaaag aaaaaatccc ccccctcccg atatcatgtc aagtttttcc
     2281 ggcacaagag taacaatttg ataagttgat gaacacattg aagctgagga tggaagtttg
     2341 atggagtccc atcctgggtc tttgctctcc aaaggagagg tctgggatgg taatacttcc
     2401 cccaaagaga gggacccagg tgccaatctc aggccactag ggaaatcagg taacttgagt
     2461 caaggcgatt ttctgcattt cttagaagtg tccattgtga gttccttggg ttgagatttc
     2521 attcattttg gcatatcttt tgcctgtact gtcttagtca ctcagcatgt ctgtggacca
     2581 aatgaatgat ataacatttt gttttataat gtttgcttta tttttggtta actttttctt
     2641 atggctgcac aagtacaacc tctcgtgact ataatgaatt gggtgcattt ccataggcct
     2701 gtcttcctca tggtgccaat gctggcatac aggtctagaa gctgactctc tcacataata
     2761 gagcaaatgc acctcttgga ggcagccgtc atctcaaagt aaccacactc tcctgaaaca
     2821 ctgaaacttt ttggcctgag accctccttc tatgcgctca catgaccata gctcatattt
     2881 ttttgaggtg tcttgattga tgatcatcca ctgaccaagg ttagagtggg tggtctctgt
     2941 tccaggtggt agcacttgac actcttattt cttgtttcct agggagaatt tttaaaaatt
     3001 aaatttaatt aatttaattc ttttaattcc accagtgaaa tattaattat aatataatcc
     3061 tgtggaagtc atggtgttgg tattgactgg tttcaagcta gcttggaaaa gagaagtgat
     3121 ctgagattct gtggagctaa gagcggttcc ttgggtataa aagtggctta acaggtatac
     3181 aatggcttgt tcttgtataa ggaaaagaaa attttgatat gccctatatt ccccttaaag
     3241 ctcaggcaga gaagatgcca aaagtggaga atttcacatt attcagagta cttgtgggag
     3301 tttgtgatgt ttttgagctt tgagaggata aagagggatt atagaaagtt ggggaatgga
     3361 acaggcataa gtatctagat tgtcctagat tctgtttccg actttgccaa tgattcactg
     3421 tgtatattgc agtaaattat tttgtctcaa ctgtgtattt tcctctttga aaaatggaga
     3481 ccttggtgtt atagatctac ctacctcctc tgggtgtgac aaggtcattt tgaaaaggtt
     3541 agatataata catgaaagat gccaagcaaa tttatattaa gcttcattaa tgtagtccca
     3601 tacattaaat tttactactt tggtaaaaat aaaagacatt aaaaatgat
//
LOCUS       RATSHIIA     3064 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Rat K+ channel protein (KShIIA) mRNA, complete cds.
ACCESSION   M34052
KEYWORDS    K+ channel protein.
SOURCE      Rat brain, cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3064)
  AUTHORS   McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R.
  TITLE     Molecular cloning of a member of a novel class of K+ channel genes
            in mammals
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by R.Bernardo, 07-MAY-1990.
FEATURES       from  to/span     description
    pept         80     1771     K+ channel protein (KShIIa)
BASE COUNT      777 a    718 c    729 g    840 t
ORIGIN      
        1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag
       61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt
      121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct
      181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct
      241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc
      301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga
      361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt
      421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt
      481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat
      541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct
      601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga
      661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg
      721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc
      781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt
      841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta
      901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac
      961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct
     1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg
     1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag
     1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac
     1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt
     1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc
     1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac
     1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg
     1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa
     1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa
     1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa
     1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga
     1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc
     1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa
     1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag
     1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt
     1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt
     1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc
     2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg
     2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa
     2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg
     2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga
     2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc
     2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt
     2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat
     2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc
     2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca
     2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa
     2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga
     2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga
     2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt
     2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt
     2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt
     2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa
     3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa
     3061 tgat
//
LOCUS       ECOHIP        726 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  E-coli hip gene for integration host factor beta-subunit (IHF
            beta).
ACCESSION   X04864
KEYWORDS    30S ribosomal subunit protein S1; integration host factor;
            integration host factor beta-subunit; rpsA gene.
SOURCE      Escherichia coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 597)
  AUTHORS   Weisberg,R.A.
  JOURNAL   Unpublished (1987) NICHD/NIH, Bldg.6 Rm.306, Bethesda MD 20892.
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 597)
  AUTHORS   Flamm,E. and Weisberg,R.A.
  TITLE     Primary structure of the hip gene of Escherichia coli and of its
            product,the beta-subunit of integration host factor
  JOURNAL   J. Mol. Biol. 183, 117-128 (1985)
  STANDARD  simple automatic
REFERENCE   3  (bases 1 to 726; revises [1],[2])
  AUTHORS   Mengeritsky,G., Giladi,H., Teff,D., Goitein,D., Weisberg,R.A. and
            Oppenheim,A.B.
  TITLE     Revised DNA sequence of the hip gene of E.coli
  JOURNAL   Unpublished (1990)
  STANDARD  simple automatic
COMMENT     **map:20';
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               INVREP      495    508       inverted repeat B
               INVREP      510    523       inverted repeat B'
               TERM        495    523       put. transcription terminator
            
               [3] Author address G.Mengeritsky
               Dept of Mol Genetics,
               Hebrew University-Hadassah Medical School
               Jerusalem, Israel
               GALINA%HUJIMD.BITNET@CUNYVM.CUNY.EDU
FEATURES       from  to/span     description
    pept    <     1       29     rpsA
    pept        190      474     IHF
    revision    455      459     gcgcc in [3]; gc in [1],[2]
    rpt          40       48     inverted repeat A
    rpt          53       61     inverted repeat A'
    signal       40       61     put. transcription terminator
    signal       76       81     -35 region
    signal       99      104     -10 region
    binding     103      116     IHF binding site
    site        475      483     Lambda-E.coli erossover point
    mut         335      335     g in wt; a in hip157 (gly is asp)
BASE COUNT      204 a    167 c    162 g    193 t
ORIGIN      
        1 aagctttcaa agcagctaaa ggcgagtaat tctctgactc ttcgggattt ttattccgaa
       61 gtttgttgag tttacttgac agattgcagg tttcgtcctg taatcaagca ctaagggcgg
      121 ctacgggccg cccttaatca atgcagcaac agcagccgct taatttgcct ttaaggaacc
      181 ggaggaatca tgaccaagtc agaattgata gaaagacttg ccacccagca atcgcacatt
      241 cccgccaaga cggttgaaga tgcagtaaaa gagatgctgg agcatatggc ctcgactctt
      301 gcgcagggcg agcgtattga aatccgcggt ttcggcagtt tctctttgca ctaccgcgca
      361 ccacgtaccg gacgtaatcc gaagactggc gataaagtag aactggaagg aaaatacgtt
      421 cctcacttta aacctggtaa agaactgcgc gatcgcgcca atatttacgg ttaagttttt
      481 atactaagtt ggcattataa aaaagcattg cttatcaatt tgttgcaacg aacaggtcac
      541 tatcagtcaa aataaaatca ttatttgatt tcaattttgt cccactccct gcctctgtca
      601 tcacgatact gtgatgccat ggtgtccgac ttatgcccga gaagatgttg agcaaactta
      661 tcgcttatct gcttctcata gagtcttgca gacaaactgc gcaactcgtg aaaggtaggc
      721 ggatcc
//
LOCUS       GPIGPC3A     5077 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Guinea pig complement C3 protein (GPC3) mRNA, complete cds.
ACCESSION   M34054
KEYWORDS    complement C3.
SOURCE      Guinea pig (strain Hartley) liver, cDNA to mRNA.
  ORGANISM  Cavia porcellus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Hystricomorpha; Caviidae; Caviinae.
REFERENCE   1  (bases 1 to 5077)
  AUTHORS   Auerbach,H.S., Burger,R., Dodds,A. and Colten,H.R.
  TITLE     Molecular basis of complement C3 deficiency in guinea pigs
  JOURNAL   J. Clin. Invest. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by H.S.Auerbach 07-MAY-1990.
FEATURES       from  to/span     description
    pept         28     5028     complement C3 protein (GPC3) precursor
    sigp         28       93     complement C3 protein signal peptide
    matp         94     2040     complement C3 beta chain
    matp       2053     5025     complement C3 alpha chain
    signal     5055     5060     polyA signal (put.)
BASE COUNT     1174 a   1537 c   1504 g    862 t
ORIGIN      
        1 cctgggtcgc tcggctctcc ctgcaccatg ggacccgccg caggtcccag cctgctgctg
       61 ctgctgctgg ccagcgtctc cctggccctg ggggacccca tgtactcgat catcaccccc
      121 aacatcctgc ggctggagaa cgaggagacc gtggtgctgg aggcccacga agtgcaaggg
      181 gacatccctg tcacagtcac tgtccacgac tttccggcca agaagaatgt gctgtccagt
      241 gagaagaccg tgctgaccag tgccacggga tatctaggca ctgtcaccat caagatcccc
      301 gccagcaagg agttcaagtc cgataagggg cgcaagttgg tggtcgtgca ggccgccttt
      361 gggggcacgc agctggagaa ggtggtgctg gtgagcctgc agagcgggta cctcttcatc
      421 cagacggaca agaccatcta cacgccgggc tccaccgtgc tctaccggat cttcaccgtg
      481 gacagcgacc tgctgcccgt gggccggacc atcattgtca ccatcgagac acccgacggc
      541 atccccatca agcgggacac cctgtcttcc aacaaccagc atggcatctt gcccttgtcc
      601 tggaacatcc ccgagttggt caacatgggg cagtggaaaa tccaggcctt ctacgagaac
      661 tcgcccaagc aagtcttctc ggctgagttc gaggtgaaag aatacgtgct gcccagcttc
      721 gaggtcctgg tggagcccac ggagaagttc tactacatcg acgatccgaa aggcctggag
      781 gtcaacatca tagccaggtt cctctacggg aagaacgtgg acggaacggc ctttgtcatc
      841 ttcggggtcc aggacggtga ccagaggatc tccctggccc agtccctcac ccgagtcgtg
      901 atcgaggacg gcagcggaga ggtcgtgctg agcaggcagg tgctgctgga cggggtgcag
      961 ccctccaggc ccgaggccct ggtggggaag tccctgtacg tgtctgtcac ggtcatcctg
     1021 cactcaggca gcgacatggt ggaggcagag cgcagtggga tccccatcgt cacctccccc
     1081 taccagatcc acttcaccaa gacgcccaag tacttcaagc cggctatgcc cttcgaaatc
     1141 atggtgctgg tgacgaaccc cgacggctct ccggcccccc acgtccccgt ggtgacccag
     1201 ggctccaacg tgcagtctct cacccaagct gacggcgtgg ccaggctgag catcaacacc
     1261 cccaacaccc ggcagcctct gagcgtcacg gttcaaacaa agaaaggggg catcccggac
     1321 gcccggcagg ccatcaacac catgcaggcc ttgccctaca ccaccatgta caactccaac
     1381 aactatttac acctctccat gccgcgcacc gagctcaagc ctggcgagac catcaacgtc
     1441 aacttccacc tgcgaagcga cccgaaccag gaagctaaga tccgctatta cacctacctg
     1501 atcatgaaca aggggaagct gctgaaggtg ggtcggcagc ctcgagagcc gggccaggcc
     1561 ctggtggtgc tgcccatgcc catcaccaag gagctgattc cctccttccg cctggtggcc
     1621 tactacactc tcatcggtgc cagcgcgcag agggaggtgg tggccgactc tgtgtgggca
     1681 gacgtcagag actcctgtgt gggcacgctg gtcgtcaaag gcggctccgg gaaagatggc
     1741 caggacaagc ggcagcagca tctaccccgg cagcagatga ccctcaggat agagggcaac
     1801 cagggggccc gcgtgggcct ggtggccgtg gacaagggtg tgtttgtgct gaacaagaaa
     1861 cacaaactga ctcagagtaa gatctgggac gtggtggaga aagcggacat cggctgcacc
     1921 ccgggcagcg ggaaggacta cgcgggcgtc ttcacggacg cggggctgtc cttcaagtcc
     1981 agcaaggcgg ggctgcagac ggcccagcgg gaaggtctgg actgccccaa gcccgccgcg
     2041 cgccgccgcc gctcggtgca gctcatggag aggaggatgg acaaagccgg gaagtacaag
     2101 agcaaggagc tgcgcaggtg ctgcgaggac ggcatgcgcg aaaaccccat gcagttttcc
     2161 tgccagcgcc gcgcgcgcta cgtgagcctg ggcgaggcgt gcgtcaaggc cttcctggac
     2221 tgctgcacct acatggccca gctgcgccag cagcaccggc gcgagcagaa cctgggcctg
     2281 gcccggagtg acatggatga agacattata ccagaagagg acatcatttc cagaagccag
     2341 tttccagaga gctggctgtg gaccatagag gaactgaagg agccagagag aaacggcatc
     2401 tccaccaaga ccatgaacat cttcctcaaa gacagcatca ctacttggga gattctggcc
     2461 gtgagcttgt cagataagaa aggcatctgc gtggcggacc ccttcgaggt cacggtgatg
     2521 caggatttct tcatcgacct gcggctaccc tactccgtgg tacgcaacga gcaggtggag
     2581 atccgcgcgg tcctctacaa ctaccgcgag gcccagagtc tcaaggtgag ggtggagctg
     2641 ctgcacaacc ccgccttctg cagcctggcc accgccaaga agcgccacac ccagacggtg
     2701 accatcggac ccaagtcctc tgtggcggtg ccctacgtcc tggtgcccct gaagatcggc
     2761 ctgcaggagg tagaggtcaa agctgccgtc tacaactact tcatcagcga cggcgtgaag
     2821 aagaccctca aggtcgtgcc cgaagggatg agagtcaaca aaacggtggc cattcgcact
     2881 ctgaacccag agcagctggg ccaaggcggc gttcagaggg aggaaatccc agctgcagac
     2941 ttaagcgacc aagtcccaga caccgattct gagaccaaga ttctcctcca agggaccccg
     3001 gtggcccaga tggccgagga cgctgtggac gcggagcggc tgaagcacct gatcatcacg
     3061 ccctcgggct gcggagagca gaacatgatc ggcatgacgc ccaccgtgat cgcggtgcac
     3121 tacctggacc agacggagca gtgggagaag ttcggcctgg agaagcggca ggaggcgctg
     3181 aacctcatca acagggggta cacacaacag ctggctttca aacagcccaa ctgggcctac
     3241 gcggctttca agaaccgggc gtccagcact tggctgaccg cctacgtggt gaaggtcttc
     3301 tcgctggcgg ccaatctcat tggcatcgac tcggaggtcc tctgcggggc ggtgaaatgg
     3361 ctgatcctgg agaagcagaa gcccgacgga gtcttccagg aggacgggcc cgtgatacac
     3421 caagagatga ttggtggtgt ccggaccgcc caggaggctg atgtgtccct cacggccttc
     3481 gtgctcattg cattgcaaga ggctaaagat atctgcagag cacaggtcaa caacctggaa
     3541 gcgaacatca acaaagctgg cgattacatc gagtcaagat acgcagacgt gaggagaccg
     3601 tacacgctgg ccatcgctgg ctacgccttg gccctgctgg agaggctgaa cggcgcaacc
     3661 ctccaaaaat ttctaaacgc agctacagag aagaaccgct gggaggaggc ccgtcagaag
     3721 ctgtacagcg tggaagccac gtcctacgcc ctcctggccc tgctcctgct gaaggacttc
     3781 gacgcggtgc ctcccgtcgt gcgctggctc aacgaacaga gatactacgg caggggctat
     3841 ggctccaccc aggccacctt catggtgttc caagccttgg cccagtacca gacagatgtc
     3901 cccgatcaca aggacctgaa catggaggtg gccctccagc tgcccagccg cagctccccg
     3961 tccaaatttc gcctcgtctg ggaagccggt agcctcctgc ggtcagaagc gaccaagcaa
     4021 aacgagggct tcaaactgac agctaaaggg aagggccagg gcacgttgtc ggtggtggct
     4081 gtgtactatg ccaagacgaa acgtaaagtt gtctgcaaga acttcgacct cagggtcact
     4141 ctgaagccgg cccccgatac agtcaagaag cctcaggagg ccaagagcac catgatcctt
     4201 ggcatctgta ccaggtacct gggagaccag gatgccacta tgtccatcct ggacatatcc
     4261 atgatgaccg gtttcattcc tgacacagat gacctcaagc tgctggccac cggcgtggac
     4321 agatacatct ctaagtacga gatgaacaaa gacttcagca agaacaccct catcatctac
     4381 ctggacaagg tctcacactc cgaggaagaa tgtctgtcct tcaaaatcca ccagttcttc
     4441 aatgtggggc tcatccagcc tggctccgtc aaggtctact cctactataa cctagatgaa
     4501 acatgtaccc agttctacca cccggagaag gaggacggga tgctgaacaa actctgccac
     4561 aaggacttgt gccgttgcgc agaggaaaac tgcttcatac agctgccgga gaagatcacc
     4621 ctggatgagc ggctggagaa ggcctgcgag cccggggtgg actacgtgta caagaccaag
     4681 ctgctcaaga tggagctgtc ggatgacttc gatgagtaca tcatgacgat cgagcaggtc
     4741 atcaagtcag gctccgatga ggtgcaggcc gggaaggagc gcagattcat cagccacatc
     4801 aagtgcagag acgccctgca cctgaaggag gggaagcact acctcatgtg gggcctctcc
     4861 tcggacctgt ggggggagag acccaacatg agttacatca tcgggaagga cacctgggtg
     4921 gaagcgtggc ccgaggcgga agaatgccaa gacgaggaga accagcagca atgccaggac
     4981 ctgggcactt tcacagagaa catggtggtt tttggctgcc ccaactgacc acgcccgtca
     5041 gtcatcctcc ctccaataaa gcttcgggtc tatggca
//
LOCUS       HUMPP14B     8076 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human placental protein 14 (PP14) gene, complete cds.
ACCESSION   M34046
KEYWORDS    placental protein 14.
SOURCE      Human (cell line GM1416) DNA, clone PP14G.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 8076)
  AUTHORS   Vaisse,C., Atger,M., Potier,B. and Milgrom,E.
  TITLE     Human placental protein 14 gene: Sequence and characterization of a
            short duplication
  JOURNAL   DNA (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Vaisse, 04-MAY-1990.
FEATURES       from  to/span     description
    pept       3019     3114     placental protein 14 precursor (PP14), exon 1
               3510     3649     placental protein 14 precursor (PP14), exon 2
               4036     4109     placental protein 14 precursor (PP14), exon 3
               5482     5592     placental protein 14 precursor (PP14), exon 4
               6644     6748     placental protein 14 precursor (PP14), exon 5
               7019     7035     placental protein 14 precursor (PP14), exon 6
    sigp       3019     3072     placental protein 14 signal peptide
    matp       3073     3114     placental protein 14, exon 1
               3510     3649     placental protein 14, exon 2
               4036     4109     placental protein 14, exon 3
               5482     5592     placental protein 14, exon 4
               6644     6748     placental protein 14, exon 5
               7019     7032     placental protein 14, exon 6
    pre-msg    2971     8010     PP14 mRNA and introns
    IVS        3115     3509     PP14 intron A
    IVS        3650     4035     PP14 intron B
    IVS        4110     5481     PP14 intron C
    IVS        5593     6643     PP14 intron D
    IVS        6749     7018     PP14 intron E
    IVS        7036     7786     PP14 intron F
    signal     2891     2899     CAAT box
    signal     2942     2948     TATA box
    signal     7992     7997     polyA signal
BASE COUNT     1636 a   2226 c   2203 g   2011 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcttta cctagcccta cgtcctgaag attttctctc acgctttctt ctaaaagttg
       61 tatagtttta tgttttactt ttaaactatg agttaatgca tatgtcaggt gtgagtttta
      121 gatggaggtt cttcgtttgc atgggatcga ttgcacgtga tgaattgctc cagcaccatt
      181 tgttgcaaag actgtccttc ttttagaggg actcccgctt gccaggcctc tggtttaatg
      241 aaacatgacc agagtgactc catcttaacg tgaataacta gacactcaca aggcacctat
      301 aaggttatat aacgaggcta tgctgctcga tactgactac gacaatttcc tgtttctctg
      361 cctacaggac accaccaccc ccatccagag catgatgtgc cagtacctgg gtgggtctca
      421 cagcacatga gctcaacgtg ggtgagaggc agcagctact tccatggctg ggaaccctgg
      481 ggagctgaca actggcttcc tgtccacctc agtgcctgtg ggctggtttt ttctttcttg
      541 gttttttttt tatttgtttg tttgctttgt tttttttgag acagggtctc attctgtcac
      601 tcaggctgga gtgcagtggc gtgatctcgg ctcactacaa cctccacctc ccagattcaa
      661 gcgattctcg tgcctcagct tcccgagtag ctgggattac aggcgcgagc ttatggtctg
      721 aaaatacccg catactaagc tgaccaccaa ttataactgc agaacattta tgcccatacg
      781 aggcatctcc caccaagcct ggagaatgta ccgatgacct gggagtgcag ggggttatct
      841 ttgctcacaa ataacgtcaa cgagtaggct gaggctgaag ggcaaatggt cattgatcac
      901 actaggagcc cctatcttta gcgagtgcat ctgcatgatc caggtttcac tgtagctcat
      961 tacagcttct tacaaacaga ggcactcaca gaggacgggc gttcctcctg ctcgctgagg
     1021 ttgcccggct ctggcacaga gtcatttccc ataaacttgc tttcactgtg ctctgtgagt
     1081 caccttgaat tctttcccgt gtgagatcta agaacccatt cttggggtct ggactgggac
     1141 cctcttttcc gacaacactt cccccacgca ctgttcttgc agctttgtta aaggtctgtt
     1201 gttgttgtgt gtgagtccat gtttggactc tctatttccc attgatctat gtgtccattt
     1261 ctctgccaat accacaatct cttggtgact attgataata gtgcttgtaa acttgggtag
     1321 actaactctt tatactttat tcttcttcaa aatggcttta gcaattgtag ctcctttgcc
     1381 ttgacatgta aattttagaa aaaaaaattt ctctatatat ctacatatgg tggggtgtgg
     1441 tggctcacac ctataattcc agcactttgg gaggccgagg tgagcagatc acttgaggtc
     1501 agcagttcga gaccagcttg gccaaaatag tgaaaacccc tctctactaa aaagacaaac
     1561 attagccagg tgtggtggtg ggcgcctata atcccagcta ctctggaggc tgaagcagca
     1621 ggattacttg aacctgggat gcagaggttg cagtgagtca agatcgtacc actgcactcc
     1681 accctgggca acagagcaag acttggcctc aaaaaaatat atatatgtat atatgtgtgt
     1741 gtgtgtgtgt atacacacac acacacacac atgtctatag ctacctacaa aatatcttgc
     1801 tgagaatttg ataggaattg cattaaatct ctaggttgag ttgaggagag ctaacacctt
     1861 tgctgtgttg aagcttccag cccatgaaca cattgtgcct gtcctttacc ttaggtcttt
     1921 gatttccctt ccttgggttc agagtttaag tcctgtccat atttggtgag atttttgcct
     1981 acatgctgtg tgtgtgatta caaatcctga atttccagtt tcgggttctg tctactcctt
     2041 gacactgtgt gggaacacgg tggatttttg tgcgattgac cttgctgacc tcactcggtt
     2101 ctaggagggg ttttgtagag ttcttgcaat tttgtacctg ggcaatgatg ttatatgcga
     2161 atagagacag gtatacctcc ccctctccaa tctgcctgcc tttagttacc atttcttgcc
     2221 tagctgctct ggccggaatg acctgctctg atggatgggg gagtgggagt ggacacccgt
     2281 gtcttgcctc aggactcaag tgctctccag ggagtgagat gcgggctcga ttgtaagtta
     2341 taaagtgatg cttccctgtg ggaaaaagta catccaatat agactgtagg acaaagtctg
     2401 aaagtccact gcctcccatt tccacccagt cttgcctgtg agtcagagag aaacagtgct
     2461 aacaggcagt gtgcacctgt ccagaggctg gctgtttgag ggacacaggt gtgcacacct
     2521 ggggatgtct gagtggaagg tacaggtcag gattatgacc gtgcagtcag tcacccactg
     2581 gcatgatgtc tgtggcatcc tggggcagcc atggggctct caggccctct gcctgcccca
     2641 caggccatac ccctgccctg gacacagctg tcctcagtgc tggcctctga cccaacattg
     2701 tccaggagcc ccaacccaga aggtgctccc gccgctgcca gcctggaccc gacccaggcc
     2761 cctcccgcct gaggccctgc caagaactgc ccagcccgga cacagaggag gttccgcgtg
     2821 gacgcaggga agagcctccc attgccccag tggaggaagc tgcccagggg ccaaggatga
     2881 gtcacaggtt cgaggaatca catggcgagg ctgtgggcgg ggatcttgtc tgccctcctc
     2941 ctacataagg ccccctgagc ccacactgcc tcagcatccc tctggctcca gagctcagag
     3001 ccacccacag ccgcagccat gctgtgcctc ctgctcaccc tgggcgtggc cctggtctgt
     3061 ggtgtcccgg ccatggacat cccccagacc aagcaggacc tggagctccc aaaggtttga
     3121 ggctggggga gcgggcactt tactgtggga ggcctggggc gggtgggagc tgcgggcagg
     3181 cgggaagcca ggatctcaga aacctacagg aagcacagaa tggacgccat gacgtcagga
     3241 agccctcagc cctgctctcc atctttaggg tggcctctct ggtttcccag catcctaggt
     3301 gactcattat ttggactttg gaacactcct gagttagcac acactggtca ttttaagtac
     3361 aggaaatttc atagcccagg atctggtaga tagcagacaa ccatccaatg ctcactgtac
     3421 ccatcccagt tagactcagc cccgtctgca ccgggtgcaa cgagagccat ggtggggtgg
     3481 gaccgccgtg cagcccaagg ccccctcagt tggcagggac ctggcactcc atggccatgg
     3541 cgaccaacaa catctccctc atggcgacac tgaaggcccc tctgagggtc cacatcacct
     3601 cactgttgcc cacccccgag gacaacctgg agatcgttct gcacagatgg tgggtttctc
     3661 atcattgaga cgggctgggc gggggctcag tctcccccct caggggtcca ggactgggtg
     3721 ggttgggcgg agctggactt agccccaggc attttctgac agccaggggc ttcactgtgg
     3781 cccttccatg agggtggggt ggaaaaccag ggctccagac gttccctgtc cccttggatc
     3841 ccctgcccca ggctctgggc caacagccaa ccacacagtg cagccccagg tcagactgag
     3901 gagaaggtct gggcggctgc gggctgcggt gctccttgga cccggggaag ttcccgtggt
     3961 gacctgattt taggagtgac agtgaaggca actccaattc aagtggccac tcatcctatt
     4021 gtcaccacct ttcagggaga acaacagctg tgttgagaag aaggtccttg gagagaagac
     4081 tgagaatcca aagaagttca agatcaactg tgagtgtccc caggccccaa gggctggctc
     4141 agtgctggca tgctagccac gctctcccag aggcggctct gctggggcat gagggagtgg
     4201 ggcctggcct gtccccactc tctctgcttc agggagtcag agtgtttact ccggtcaacc
     4261 tgatgctgac cccagaggca tcttttacct ggagggcagg ggaagcacta attcttggca
     4321 tgacatgact ggatgtgggt ctgcactgtg cccaggccaa ggggacaggt gctttgttgc
     4381 actgttcact ctggcctcac aaaaggccag ggaggctgca ggcgagcagg tgggcaggtg
     4441 ggcaggtggg taggtgggta ggtggatatg tatacaggtg gcaggagggt aggtgaacag
     4501 gtgggtaggt gggcaggtgg ctaggtgagt aagtggttag gtgaacaggt gggcaggtga
     4561 gcaggtggtt aggtgaacag gtgggcaggt ggataggtga acaggtgggc aggtgggtag
     4621 gtgggtaggt atacaggtgg acaggtgggt aggtggacag gtgggcaggt gagtaggcga
     4681 acaggtgggt gggtgaacag gtggccaggt gaacaagttg gtaggtgggc aggtgggtag
     4741 gtgggtaggt gaacaggtgg gcaggtgggc aggtgggcag gtgggcaagt ggctgctgtt
     4801 cccgtgggcc tggctgcctc ctgcgcactc tggggctgca gctctggtct taggctgagc
     4861 tcccaggcct ctctggggga agagagaggg gcttacagca tgtccttggt ccactgaatt
     4921 cttcctaaca atttgcaaca ttttgttcta ttttgttaat tattattttt ttaaaaagac
     4981 agaggtggtc agggtctggg gcctcttatc ccctcatggg cacattttcc cagcaaatac
     5041 agtttgcttc tcatgcttgg gacttgcctc aggcctttct gaccctgctt gccctcccca
     5101 gaatcgagcc actctccaag gtccatttct tctccctcct cccgcccctg tgccctgttc
     5161 ctgtgccatc tcccgccatc ctcacccgta cgtgacttct cagttggagt ctctccaggt
     5221 cacagcctcc ctgcctgccg tgtctgcctc tccacggcac acctggcctc tcgccctcag
     5281 ccggggctcc atggccctcc acattgcctc tcctcccctt tcctccctgg cttccctgat
     5341 catggtccac agcaggggcc acgtcccatg gtgtcagtga tgaggaagcc acttagtgtg
     5401 gtgggatgtc cacacacctg cacaggactc tgctgagacg gaggcttcat cttccttttg
     5461 gttcttctct tctttcccca gatacggtgg cgaacgaggc cacgctgctc gatactgact
     5521 acgacaattt cctgtttctc tgcctacagg acaccaccac ccccatccag agcatgatgt
     5581 gccagtacct gggtgggtct cacagcacat gagctcaacg tgggtgagag gcagcagcta
     5641 cgtccatggc tgggaaccct ggggagctga caactggctt cctgtccacc tcggtgcctg
     5701 tgggctgact ttttctttct tggttttttt tatttgtttg ttgtttgctt tgtttttttg
     5761 agaaagggtc tcattctgtc actcaggctg gagtgtagtg acgtgatctc ggctcactgc
     5821 aacttctgcc tcccagattc aagcgattct cgtgcctcag cctccagagt agctgggatt
     5881 acaggcgcac gccaccatgc ccagctaatt tttgtatttt ttggtagaga cggggtttca
     5941 ccatgttggc caggctggtc tccaactcct ggatcaagtg atccacccgc cttggcctcc
     6001 caaaggctgg gattacaggt catccaccac gcccggccag gctgagtttt tctccagcgg
     6061 ttcatcgagt cctctgacaa agcaaggagc tgatataggg ccagtgggac ggtcgccagt
     6121 caaggggctg ggcttggtgg atagattaat actcactggg cgtccagtca aaacgccctg
     6181 aaacctatga tgctgtcaac caaacgaagg ccaggaatac caaaatagcc acataggcac
     6241 agcccttccc catgtttctg agcacagtgt ttcctctggg gtcacacagg tgtcttcttg
     6301 atcagcctca gccatgcttg gtgagagccg ggcactggga gagccaggca ctgtgctctc
     6361 ctgtgacgct gtagacacca tcctaagctg tgcagacccc agcgctgccc agagcggagc
     6421 agagggggcc gggcaaggag tgggagctgg ggtcagggaa cctggaggtg cagtggacag
     6481 agccccggag accgccctag ggacctactc cagaccaaac tctgccagac ctcggagcac
     6541 tggggcctcc ttctctgccc tccctcctca ggcaaggcct ctggagctcc ccagctctca
     6601 tggaagcccc aggggcccag gactgaccca gcctcttcca cagccagagt cctggtggag
     6661 gacgatgaga tcatgcaggg attcatcagg gctttcaggc ccctgcccag gcacctatgg
     6721 tacttgctgg acttgaaaca gatggaaggt gagctctgcc taggacacgc ccagcctcag
     6781 ctggaggaga agctgcctct ttcttagccc gagccccctg ctggctctgc aggactcagg
     6841 tcactccttt ttggcccctc ccctgttctc ccctggcctt ctggggtgca gagccaccct
     6901 gaggtggggt cctgccctct cccaccatcc tttcatccct tctctagccc tggggctgct
     6961 gtgtccccag ctgtctcttc tctcgctgac acctccactg tcccatctcc tcccacagag
     7021 ccgtgccgtt tctaggtgag ctcctgcctg gtcctgcctc ctgggtaatg tatcagcctc
     7081 gcccactgtc tgcggctgcc tctctgggcc cctgggacag accctactgt gtccagttca
     7141 gggctgaccc tacaggaatg aactggggtc tggtcttgtg attccagaaa gccaggctgc
     7201 tgacgtcccc attcacgagc ccagcctgtg tcttgcagcc attgtattag tcacgggctt
     7261 gtgccctata gtcagacctc atgctttctt ttggggttag gggtgttggt tggaaatggt
     7321 gggggctata ggaggaggaa ggaggatggt tacatggaag ggcatgagaa gctggggacc
     7381 tgcaggtctc ggtcccacgt tctttttttt ttttcttttt ttaagatgga gtctcgctct
     7441 gtcaccaggc tggagtgcag tggcacaatc tcagctcact gcaacctcga cctcctgggt
     7501 tcaagcgatt ctcctgcctc aaccccccga gttgctggaa ccacaggcgt gtgccaccat
     7561 gcccagctaa tttttgtatt tttaatagaa acggggtttc accatgttgg ccacgatggt
     7621 ctcaatctct tgacctcatg atccccccgc tttagcctcc caaagtgctg ggatttcagt
     7681 gccacattct taagggggtg tgctcaagcc caccacatcc ttccagggct cccccgaaac
     7741 accctgctct tcctccctct acttaagtga cctgtaaacc caacagctca cctccgcctc
     7801 caggaagacc agactcccac ccttccacac ctccagagca gtgggacttc ctcctgccct
     7861 ttcaaagaat aaccacagct cagaagacga tgacgtggtc atctgtgtcg ccatcccctt
     7921 cctgctgcac acctgcacca cggccatggg gaggctgctc cctgggggca gagtctctgg
     7981 cagaggttat taataaaccc ttggagcatg tcctgtctgg atgcgcagcc actgctgggt
     8041 gtgggattca gggacgaggg cctggggtcg gggcag
//
LOCUS       PHAMP1A      1313 bp ss-mRNA            PLN       28-JUN-1990
DEFINITION  P.chrysosporium Mn-dependent peroxidase (MP-1) mRNA, complete
            cds.
ACCESSION   J04980
KEYWORDS    Mn-dependent peroxidase.
SOURCE      P.chrysosporium (strain BKM-F-1767), cDNA to mRNA, clone lambda-
            MP-1.
  ORGANISM  Phanerochaete chrysosporium
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; 
            Agaricales; Corticiaceae.
REFERENCE   1  (bases 1 to 1313)
  AUTHORS   Pease,E.A., Andrawis,A. and Tien,M.
  TITLE     Manganese-dependent peroxidase form Phanerochaete chrysosporium:
            Primary structue deduced from cDNA sequence
  JOURNAL   J. Biol. Chem. 264, 13531-13535 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            E.A.Pease, 21-JUN-1989, for release after publication.
FEATURES       from  to/span     description
    pept         36     1184     Mn-dependent peroxidase precursor (MP-1)
    sigp         36       72     Mn-dependent perocidase signal peptide
    matp         73     1181     Mn-dependent perocidase
BASE COUNT      251 a    462 c    334 g    266 t
ORIGIN      
        1 tcagctctca aggacatccg cactcgaata tcgcaatggc cttcggttct ctcctcgcct
       61 tcgtggctct cgccgccata actcgcgccg ccccgactgc ggagtctgca gtctgtccag
      121 acggtacccg cgtcaccaac gcggcgtgct gcgctttcat tccgctcgca caggatttgc
      181 aagagactct gttccagggt gactgtggcg aagatgccca cgaagtcatc cgtctgacct
      241 tccacgacgc tattgcaatc tcccagagcc taggtcctca ggctggcggc ggtgctgacg
      301 gctccatgct gcacttcccg acaatcgagc ccaacttctc cgccaacagc ggcatcgatg
      361 actccgtcaa caacttgctt cccttcatgc agaaacacga caccatcagt gccgccgatc
      421 ttgtacagtt cgccggtgcg gtcgcgctga gcaactgccc aggtgctcct cgcctcgagt
      481 tcatggctgg acgtccgaac actaccatcc ccgcagttga gggcctcatt cctgagcctc
      541 aagacagcgt caccaaaatc ctgcagcgct tcgaggacgc cggcaacttc tcgccgttcg
      601 aggtcgtctc gctcctggct tcacacaccg ttgctcgtgc ggacaaggtc gacgagacca
      661 tcgatgctgc gcccttcgac tcgacaccct tcaccttcga cacccaggtg ttcctcgagg
      721 tcctgctcaa gggcacaggc ttcccgggct cgaacaacaa caccggcgag gtgatgtcgc
      781 cgctcccact cggcagcggc agcgacacgg gcgagatgcg cctgcagtcc gactttgcgc
      841 tcgcgcgcga cgagcgcacg gcgtgcttct ggcagtcgtt cgtcaacgag caggagttca
      901 tggcggcgag cttcaaggcc gcgatggcga agcttgcgat cctcggccac agccgcagca
      961 gcctcattga ctgcagcgac gtcgtccccg tcccgaagcc cgccgtcaac aagcccgcga
     1021 cgttccccgc gacgaagggc cccaaggacc tcgacacgct cacgtgcaag gccctcaagt
     1081 tcccgacgct gacctctgac cccggtgcta ccgagaccct catcccccac tgctccaacg
     1141 gcggcatgtc ctgccctggt gttcagttcg atggccctgc ctaaaccact cacctccggc
     1201 aatgcacctt tagtagatgt cgattctatt tggttgatta gaagtactgg gttctcgtgt
     1261 catataccac caagttttca gaatgcaagt ctcctttctt tccaaaaaaa aaa
//
LOCUS       PEAHSP177A    772 bp ss-mRNA            PLN       28-JUN-1990
DEFINITION  Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA,
            complete cds.
ACCESSION   M33901
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 772)
  AUTHORS   Vierling,E. and Helm,K.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept         65      523     17.7 kDa heat shock protein (hsp17.7)
BASE COUNT      254 a    127 c    170 g    221 t
ORIGIN      
        1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag
       61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga
      121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc
      181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga
      241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt
      301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga
      361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc
      421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc
      481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat
      541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg
      601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata
      661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc
      721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac
//
LOCUS       PEAHSP179A    700 bp ss-mRNA            PLN       28-JUN-1990
DEFINITION  Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA,
            complete cds.
ACCESSION   M33900
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 700)
  AUTHORS   Vierling,E. and Helm,K.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept    <     1      469     17.9 kDa heat shock protein (hsp17.9)
BASE COUNT      209 a    123 c    170 g    198 t
ORIGIN      
        1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt
       61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa
      121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt
      181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga
      241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg
      301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc
      361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa
      421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct
      481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg
      541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa
      601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt
      661 gagtatgtga aagtttcttt tgattaagct gtatttatcc
//
LOCUS       PEAHSP181A    862 bp ss-mRNA            PLN       28-JUN-1990
DEFINITION  Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA,
            complete cds.
ACCESSION   M33899
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 862)
  AUTHORS   Vierling,E. and DeRocher,A.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept         12      488     18.1 kDa heat shock protein (hsp18.1)
BASE COUNT      278 a    128 c    207 g    249 t
ORIGIN      
        1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg
       61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac
      121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg
      181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga
      241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag
      301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt
      361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc
      421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt
      481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc
      541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat
      601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt
      661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa
      721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac
      781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga
      841 gtctctccaa gatggagaat tc
//
LOCUS       PEAHSP227A    795 bp ss-mRNA            PLN       28-JUN-1990
DEFINITION  Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA,
            complete cds.
ACCESSION   M33898
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 795)
  AUTHORS   Vierling,E. and Helm,K.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept         44      637     22.7 kDa heat shock protein (hsp22.7)
BASE COUNT      261 a    122 c    170 g    242 t
ORIGIN      
        1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct
       61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc
      121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc
      181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca
      241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca
      301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga
      361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca
      421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa
      481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca
      541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga
      601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa
      661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg
      721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa
      781 cttgttggct tttgt
//
LOCUS       LACNISC      1222 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  L.lactis insertion sequence IS981.
ACCESSION   M33933
KEYWORDS    insertion sequence.
SOURCE      L.lactis (strain LM0230) DNA, clone pKM40.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 1222)
  AUTHORS   Polzin,K.M. and McKay,L.L.
  TITLE     Identification, DNA sequence and distribution of IS981, a new, high
            copy number insertion sequence in the lactococci related to the
            IS2/IS3 family of IS-elements
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by K.M.Polzin 27-APR-1990.
               University of Minnesota
               Dept. of Food Science
               1334 Eckles Avenue, St. Paul, MN 55108
               e-mail: kayla@molbio.cbs.umn.edu
FEATURES       from  to/span     description
    pept         58      318     ORF1
    pept        330     1169     ORF2
    binding      44       50     ribosomal binding site (put.)
    rpt           1       40     5' inverted repeat
    rpt        1183     1222     3' inverted repeat
    signal       33       38     -10 region
    signal      269      274     -35 region
    signal      294      299     -10 region
BASE COUNT      442 a    233 c    206 g    341 t
ORIGIN      132 bp upstream of PvuII site.
        1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg
       61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa
      121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata
      181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa
      241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc
      301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca
      361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta
      421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac
      481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac
      541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt
      601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa
      661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata
      721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag
      781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata
      841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc
      901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca
      961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa
     1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct
     1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac
     1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac
     1201 tctaaaaact tgacttaacg tc
//
LOCUS       DRONCDA      2294 bp ss-mRNA            INV       28-JUN-1990
DEFINITION  D.melanogaster non-claret disjunctional protein (ncd) mRNA,
            complete cds.
ACCESSION   M33932
KEYWORDS    non-claret disjunctional protein.
SOURCE      D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA,
            clone pNB40.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2294)
  AUTHORS   Mcdonald,H.B. and Goldstein,L.S.B.
  TITLE     Identification and characterization of a gene encoding a
            kinesin-like protein in Drosophila
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.B.Mcdonald, 30-APR-1990.
FEATURES       from  to/span     description
    pept        111     2168     non-claret disjunctional protein
    site        180      319     alpha helical domain
BASE COUNT      622 a    661 c    603 g    408 t
ORIGIN      1 bp upstream of EcoRI site; chromosome 3 map position 99BC.
        1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt
       61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta
      121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag
      181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa
      241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct
      301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg
      361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc
      421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc
      481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg
      541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt
      601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct
      661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg
      721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg
      781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga
      841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc
      901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag
      961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg
     1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc
     1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc
     1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg
     1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg
     1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca
     1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg
     1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg
     1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg
     1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg
     1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg
     1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc
     1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc
     1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg
     1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca
     1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc
     1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact
     1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca
     2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc
     2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg
     2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat
     2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa
     2281 aaaaaaaaaa aaaa
//
LOCUS       ECOPHOAJ     3133 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  E.fergusonii alkaline phosphatase (phoA) gene, complete cds.
ACCESSION   M33966
KEYWORDS    alkaline phosphatase.
SOURCE      E.fergusonii (ATCC 35469) DNA.
  ORGANISM  Escherichia fergusonii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3133)
  AUTHORS   DuBose,R.F. and Hartl,D.L.
  TITLE     The molecular evolution of bacterial alkaline phosphatase:
            Correlating variation among enteric bacteria with experimental
            manipulations of the protein
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.F.DuBose, 01-MAY-1990.
FEATURES       from  to/span     description
    pept        544     1962     alkaline phosphatase (phoA) (EC 3.1.3.1)
    site        466      486     pho box regulatory sequence
BASE COUNT      874 a    711 c    774 g    774 t
ORIGIN      
        1 aactattaat taagcaattt tttaattcgt ggaattataa tgcgcgagaa acgtttcgcg
       61 ctaatttgtg ttaagaatat ttgccaaaag tattcgttat gtcataaaaa tactactttc
      121 agactctagg aagttttgag tctaaatgtt aacgtcacca tactttggat aaggaaatgc
      181 agttatgaaa aatcttattt ctgagttgtt gattaaactt gcacaaaaag aagaagagtc
      241 aaaggagttg gttgcccagg tagaagccct ggagattatc gtcacggcta tgttgcgtaa
      301 tatggggcaa cctgaacaaa aacgcgtaat cgaccagata gaaggggctt tggccggggt
      361 aaagcctgat gccagcgttc ctgacggcga tacggaaatg ctgcgcgcat atgtaaagaa
      421 gctcttaagg catccccggc aataagacac agaaccgaca cgacactgtc ataaagttgt
      481 catcctcgct acctatagtc gctttgtttt tattttactg tatttataca tggagataat
      541 aaagtgaaac aaagcgctat tgcattggcc ctgttatctt gtctgattac cccggtaagc
      601 caggctcaaa cttctcaaaa tatcaatatc ctggaaaatc gtgccgctca gggtgatatt
      661 acaatgccag gtggcgctcg ccgtttaagt ggcgatcaaa ctgaggcatt acgtgcatct
      721 cttaatgata agccagctaa aaatattatt ttattaattg gcgatggcat gggggattct
      781 gaaattactg ctgcacgaaa ttatgcagaa ggtgcaggtg gttattttaa agggatcgat
      841 gcgttgccat taaccgggca gtacactcac tatgcattgg acaaaaaaac cggcaaacct
      901 gattacgtga cagattctgc tgcatccgca accgcatgga cgacaggcgt caaaacatac
      961 aatggtgcgc tgggtgtcga tatccatgaa aatcctcata ccactattct tgagatggca
     1021 aaagcagccg gtctggcaac aggcaacgtg tctacggctg agttgcagga tgcgacacct
     1081 gctgcgttgg tttctcatgt cacctcacgc aaatgctatg gcccttctgt taccagtgaa
     1141 aaatgcccgg gcaatgcact ggaaaagggg gggaaaggtt caatcactga acaactgttg
     1201 aatgcccgtg cggatgtgac gttgggtggt ggtgcaaaaa cgtttgctga aacggcaacc
     1261 gctggagagt ggcagggtaa aacattgcgt gagcaagctc tggctcgcgg ttatcagatt
     1321 gtcagcgatg ccgcatccct cgcagcagtg acgcaagcag ggcaggacaa acctctgctc
     1381 gggttatttg ctgaaggaaa tatgcctgta cgttggcatg ggccaaaggc gagctatcac
     1441 gggaatctcg ataagcccgc agtaacctgc acgccaaatc ctcagcgtaa cgaaactgta
     1501 ccgacactgg cacagatgac agataaagct attgaattac ttagcaaaaa tgagagaggt
     1561 ttcttcctgc aagtagaagg ggcatctata gataagcaag atcacgcggc taatccatgt
     1621 ggacagattg gtgagacggt tgatctcgac gaagctgttc aacgtgctct ggaatttgcc
     1681 aaaaaagatg gcaatacact ggtgatcgta actgcagatc atgcgcattc cagccagatt
     1741 gtggcaccag ataccaaagc gccaggttta actcaggcgc taaatacgaa agatggtgcg
     1801 gtgatggcga taagctacgg taactctgaa gaagactctc aggagcatac cggtagccag
     1861 ctgcgcatcg cagcgtatgg gccgaatgcc gctaatgtgg tcggactgac cgatcaaacg
     1921 gatcttttct acaccatgaa agccgctctg ggcttgcaat aatccatacc gacggccatt
     1981 aattgcgccg tcgggtggtt ttttcaccat cagcagccag acttataaac actatccttc
     2041 gggcgtattt tccggcccga aggaaactgg tgaagttgca acctcacaaa aggatggtgc
     2101 gatgaaaatg acattactgg ttaccctgct atttgggctg gtttttttaa ccactgtcgg
     2161 tgctgccgag agaacattaa cgccgcaaca acagcgtatg accacctgta atcagcaagc
     2221 tactgcacag gcgttaaaag gtgatgcccg aaaaacgtat atgagcgatt gcctgaaaaa
     2281 cagcaaatct cccccgaatg aaaaaagcct gacaccacaa caacaaaaga tgcgtgaatg
     2341 taatacccag gcgacggaac aatcactgaa aggtgacgac cgtaataaat ttatgagtgc
     2401 gtgtctgaag aaagctgcat agatccccct tagtgcttac gggtgagaca cgcaaacgcc
     2461 tcacccgaaa tatcatactt ctccctcacc ccccctccta taatttggga aatttgtttc
     2521 taaatattcc caaaaaaata atgaatgatg aaaacttcct taaaaaagcg gtcgcaccgg
     2581 gtgaagaacc accttcttca cctcaaaacg aacatcaacg ttccgggtta cgctttgccc
     2641 gtcgtgtgag gctaccaaga gttgtcggtt ttggcggtat gttctcccca tcgcctcggc
     2701 gctggtggag caaccgatcg ctggatggtg gtggttgatc cttgttggct ggtcattcgt
     2761 ttggccacat ctggcctggc aaatgtccag tcgtgcaatg gaccccttaa accgtgaaat
     2821 ttacaaccta aaaacggatg cgattactgc cgggatgtgg gttggcataa tgggtgttaa
     2881 cgcactacct tcaacgacga tgctgatgat tatgtgcctt aacctgatgg gcgccggagg
     2941 acttaggttg ttcgtcgccg ggctggtatt gatggtagtc gcatgtctgg tgacgctaca
     3001 gttgaccggc atccgcgtcg cgtttaccag ttcgcccctt gagtggtggc tttcactccc
     3061 tgtcattgtg ttgtatccct taattttcgc ctgggtcagc tatcagacag caactaaatt
     3121 ggcagagcat aaa
//
LOCUS       SMAPHOAA     2672 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  S.marcescens alkaline phosphatase (phoA) gene, complete cds.
ACCESSION   M33965
KEYWORDS    alkaline phosphatase.
SOURCE      S.marcescens (ATCC 13880) DNA.
  ORGANISM  Serratia marcescens
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2672)
  AUTHORS   DuBose,R.F. and Hartl,D.L.
  TITLE     The molecular evolution of bacterial alkaline phosphatase:
            Correlating variation among enteric bacteria with experimental
            manipulations of the protein
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.F.DuBose, 01-MAY-1990.
FEATURES       from  to/span     description
    pept        667     2094     alkaline phosphatase (phoA) (EC 3.1.3.1)
    site        575      595     pho box regulatory sequence
BASE COUNT      588 a    845 c    749 g    490 t
ORIGIN      
        1 ggccagtgaa ttcctggcgc tgaacccgaa cggcaagatc ccggcgatca tcgatcccga
       61 tggccccggc ggccggccgc tgccgctgtt cgaatccggc gccattctgc tctacctggc
      121 ggaaaagagc ggccgtttcc tgccgcagga cccggcgcag cgttacgaaa ccattcaatg
      181 ggtgttcttc cagatggccg ccgtcggccc gatgttcggc cagctcggtt tcttccataa
      241 attcgccggg cgcgaatatg aagacaagcg gccgctggag cgctacaaaa acgaatcgaa
      301 acgcctgctc ggcgtgctgg aaacgcgact tgaaggccgc gactggatca tgggcgcgga
      361 gtacaccatc gccgatatct cgctgctggg ttgggtgcgc aacctgatcg gcttctatga
      421 ggcgcgcgaa ctggtggagt tcgacagttt ccctcgcgtc gcgcagtggc tggagcgcgg
      481 tctggcgcgc ccggcggtgc agcgtgggct cgctatcccg gcgcgcgcgg cctgagcgtc
      541 aaacgggatg gccgccgggc cgtcccgcct tccgctgtca taaaattgtc atccctctgc
      601 tctactctcc tttcgtaaca atttgatgtt atttacattg ttgtattacg acaaggagct
      661 taccccatgc aacctgctgt atccctgatc gccggcgccg tactgtccgc cctgctctgc
      721 tcctccgcca tcgcggcgga aacctcggcc aatgccgatg gcctgaccga tcgcgccgcg
      781 cgcggcaacc tcgtcgaacc gggcggcgcc cgccgtctgg ccggcgatca gaccaccgcg
      841 ctgaaggcat cgctgtcgga caaaacggcg aaaaacgtga tcctgctgat cggcgacggc
      901 atgggggatt ccgagatcac cgccgcgcgc aactacgccg aaggcgcagg cggttacttc
      961 aagggcatcg acgccctgcc gctgaccgga caatacaccc attactcgct ggacaggaaa
     1021 acccacaagc cggactacgt gaccgactcc gccgcctccg ccaccgcctg ggccaccggg
     1081 gtgaaaacct ataacggcgc gctcggcgtg gacgtgaacg gcaaggatca accgacgctg
     1141 ctggagatcg ccaaggcggc cggcaaggcc accggcaacg tctccaccgc cgagttgcaa
     1201 gacgccacgc cggcggcgct ggtctcgcac gtcatctcac gcaagtgcta tggcccggag
     1261 gaaaccagcg agaaatgcgc cgctaacgcg ctggaaaatg gcggccgtgg ctccatcacc
     1321 gaacaactgc tgaaaacccg cgccgacgtc acgctgggcg gcggcgcgaa atcgttcaac
     1381 cagctggcaa aaagcggcga atggcaagga aaatcgctga aggaccaggc ggcggcgcag
     1441 ggttatcagt gggtgagcaa cgccgacgaa ctgcaggccg ttaccctcgc caaccagcaa
     1501 aaaccgctgc tggggctgtt cgccgacggc aatatgccgg tgcgttggct ggggccgaaa
     1561 gccagctacc atggcaacct cgacaaaccg gcagtgacct gcgaaaacaa cccggcgcgc
     1621 accgccgcca cgccaacgct ggcggcgatg accgaaaaag cgatcgcgct gttgaaagac
     1681 aacccgaacg gcttcttcct gcaggtagag ggcgcgtcga tcgataaaca ggatcacgcc
     1741 gccaacccgt gcgggcagat tggcgaaacc gtcgatctgg acgaagcggt acaaaaagcg
     1801 ctggccttcg cccgcgccga cggcaacacg ctggtgatcg tcaccgccga ccacgcgcac
     1861 gccagccaga tcgtctccgc cgacgccaaa gcgccgggcc tgacccaaaa gctgaccacc
     1921 aaggacggcg cgccgatgac cctcagctac ggcaactctg aagaggaatc tcagggacac
     1981 accggcaccc agctgcgcgt cgcggcctac ggcccgcacg ccgccaacgt agtgggcctg
     2041 accgatcaga ccgatctgtt cttcaccatg cgcgatgcga tgggtatcaa gtaaccgacc
     2101 cctgaacgat ggccgcccgc cacggcggcc atttattatt ggttaccgct aggtatttgc
     2161 aatttaaccg caattattgt gtgataacga aataacgtct aattacgcgg caattaaccc
     2221 atgaaaataa cgcggttata cggcctttct tccctctctg tttattcctg cgacagcaac
     2281 cggccgtttt ttgcttatat tcattttttt tagttacaaa agtagttttt aacattctcg
     2341 caaacaataa taccccgcca gtaacgtgaa ccgatcaatt tgcatccgga cggcctgcaa
     2401 tagttcactt ttcgttaaat accctattaa caaaaaggca ttattgctgc ctatcggctt
     2461 tcgccgggcg gtcggcttca ttatttccag ggtaactccc gctaaattct ttcacctgca
     2521 atgagtagaa tttacacaca agagatttaa ttcaatgaga aaattactcg tcctgatttt
     2581 cagtctcagc ctgtttggcg ttacgcaaca ggcggcggct gaagagaaga ataaaaccgt
     2641 cgacctgcag gcatgcaagc ttggcactgg cc
//
LOCUS       STYMALGA     1232 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  S.typhimurium inner membrane protein genes (malG and malF) genes,
            complete cds, and 3' end.
ACCESSION   M33921
KEYWORDS    .
SOURCE      S.typhimurium (strain LT-2) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1232)
  AUTHORS   Francoz,E., Schneider,E. and Dassa,E.
  TITLE     Sequence of malG in Salmonella typhimurium
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     malG protein is proposed to form with proteins malK, malF, and malE
            a hetero-oligomeric protein complex that performs the ATP-dependent
            translocation of maltose and of maltodextrins through the inner
            membrane of enterobacteriaceae.
            
            Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Dassa, 30-MAR-1990.
               Institut Pasteur
               Unite de Programation Moleculaire et Toxicologie Genetique
               Inserm U169, CNRS URA 271
               e-mail: hofnung%pasteur.bitnet@cunyvm.cuny.edu
FEATURES       from  to/span     description
    pept    <     1       64     inner membrane protein (malF)
    pept         79      969     inner membrane protein (malG)
    binding      66       70     ribosomal binding site (put.)
    signal     1050     1073     rho-independent transcription termination
                                 signal (put.)
BASE COUNT      241 a    336 c    332 g    323 t
ORIGIN      91 minutes on the Styphimurium map.
        1 cttcctgctg gtaggcgcgc tggcaatagt gaacctgaaa gccacgcgta tgaagtttga
       61 ttaagggaga taataacaat ggctatggtc caacccaaat ctcagaaact gcgtctcctc
      121 atcacgcacc tggggctgct gattttcatc gcagcgatta tgttcccgct gctgatggtc
      181 atcgctatct cgctacgtga agggaacttc gccaccggaa gcctgatccc ggataaaatc
      241 tcctgggaac actggcggct ggcgctgggt ttcagcgtgg aacatgcgga tggtcgggta
      301 acgccgcctc ccttcccggt cctgctgtgg ctgtggaact cggtgaaaat cgccggtatt
      361 accgccatcg gcatcgtggc gctctccacc acctgcgcct acgctttcgc acgaatgcgt
      421 tttccaggta aagcaacact gctcaaaggg atgctgattt tccaaatgtt tccggcggtt
      481 ttgtctctgg tagcgctgta tgcgttattt gatcgcctgg gacaatacat tccatttatc
      541 gggctgaata cgcatggcgg cgtgatcttc gcctatcttg gcggcatcgc gctgcatgtt
      601 tggacgatta aaggctactt cgaaactatc gacagttcgc tggaagaggc cgccgcgctg
      661 gatggcgcaa ccccgtggca ggcgtttcgc ctggtgctgc tgccgctttc cgtaccgatt
      721 ctggcggtgg tgtttattct gtcgttcatc gcagcgatta ccgaagttcc ggtcgcctcg
      781 ttgttactgc gcgatgtgga cagctatacg ctggccgtgg gtatgcagca atatctcaac
      841 ccgcaaaact acctgtgggg cgactttgcc gccgccgccg tcctttcagc tattcctatc
      901 accctggtgt tcctgttggc gcaacgttgg ctggtcaacg gcctgacggc agggggtgtg
      961 aaaggttaag ttttatcgaa gtacctgccc ggcaacgggc aatgccactg ccctcaagtt
     1021 atgttatccc cacttatggt cgtattttgg cgctcttcgg agcgcttttt ttatgcgtat
     1081 agacagcggc taatttattg ccacgccata cgtccgttac ggtgtggcga cgcctattct
     1141 cgccgtagtc ggttcgaatt acataaccac aagtgataac caacaggagg atcgccgccg
     1201 aatagagtaa tacatccatt gctttatgat cc
//
LOCUS       ECOCLPPA     1236 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  Escherichia coli ATP-dependent clp protease proteolytic component
            (clpP) gene, complete cds.
ACCESSION   J05534
KEYWORDS    ATP-dependent protease.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1236)
  AUTHORS   Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J.,
            Bowers,B. and Gottesman,S.
  TITLE     Sequence and structure of ClpP, the proteolytic component of the
            ATP-dependent Clp protease of E.coli
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.Gottesman, 23-APR-1990.
FEATURES       from  to/span     description
    pept        378     1001     ATP-dependent protease (clpP)
    mRNA        279  >  1236     clpP mRNA
    pept    <     1      133     ORF
    signal      244      249     -35 region
    signal      267      272     -10 region
    signal      165      193     transcription termination signal
    binding     367      372     ribosome binding site (put.)
BASE COUNT      329 a    291 c    318 g    298 t
ORIGIN      
        1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga
       61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa
      121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc
      181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg
      241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac
      301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt
      361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg
      421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt
      481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta
      541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc
      601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc
      661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt
      721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga
      781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg
      841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat
      901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg
      961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg
     1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt
     1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc
     1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg
     1201 cgcaagctga ttgccggtcc atccgtgtat atctgc
//
LOCUS       PLYORIA       375 bp ds-DNA             VRL       28-JUN-1990
DEFINITION  Human polyomavirus BK (strain WWT) origin of replication.
ACCESSION   M34048
KEYWORDS    origin of replication.
SOURCE      Polyomavirus BK (strain WWT) DNA, from human urine.
  ORGANISM  Polyomavirus BK
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 375)
  AUTHORS   Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
            Subramani,S. and Traavik,T.
  TITLE     At least two type of control regions can be found among naturally
            occurring BK virus strains
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.Johansen, 07-MAY-1990.
               University of Tromso
               P.O. Box 977
               N-9001, Tromso, NORWAY
               e-mail: terje.johansen@esuit.uit.no
FEATURES       from  to/span     description
    orgrpl       97      119     polyomavirus origin of replication
    site          1      375     regulatory region
    site        143      210     P block
    site        211      249     Q block
    site        250      302     R block
BASE COUNT      123 a     86 c     95 g     71 t
ORIGIN      
        1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
       61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
      121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
      181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
      241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt
      301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta
      361 aactggacaa aggcc
//
LOCUS       PLYORIB       424 bp ds-DNA             VRL       28-JUN-1990
DEFINITION  Human polyomavirus BK (strain UT) origin of replication.
ACCESSION   M34049
KEYWORDS    origin of replication.
SOURCE      Human polyomavirus BK (strain UT) DNA, from human urine.
  ORGANISM  Polyomavirus BK
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 424)
  AUTHORS   Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
            Subramani,S. and Traavik,T.
  TITLE     At least two type of control regions can be found among naturally
            occurring BK virus strains
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.Johansen, 07-MAY-1990.
               University of Tromso
               P.O. Box 977
               N-9001, Tromso, NORWAY
               e-mail: terje.johansen@esuit.uit.no
FEATURES       from  to/span     description
    orgrpl       97      119     polyomavirus origin of replication
    site          1      424     regulatory region
    site        143      210     P block
    site        211      249     Q block
    site        250      261     R block
    site        263      315     P block
    site        316      349     Q block
    site        350      361     R block
BASE COUNT      134 a    103 c    105 g     82 t
ORIGIN      
        1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
       61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
      121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
      181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
      241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa
      301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc
      361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa
      421 ggcc
//
LOCUS       ECOUNCGA       85 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  E.coli H+-ATPase gamma subunit (uncG) gene, 3' end.
ACCESSION   M34095
KEYWORDS    H+-ATPase.
SOURCE      E.coli (strain KF12) DNA, clone pBWG1.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Iwamoto,A., Miki,J., Maeda,M. and Futai,M.
  TITLE     H+-ATPase gamma-subunit of Escherichia coli: Role of the conserved
            carboxyl-terminal region
  JOURNAL   J. Biol. Chem. 265, 5043-5048 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       84     H+-ATPase gamma subunit (uncG) (AA at 1)
BASE COUNT       21 a     24 c     23 g     17 t
ORIGIN      
        1 ctgcagttgg tatacaacaa agctcgtcag gccagcatta ctcaggaact caccgagatc
       61 gtctcggggg ccgccgcggt ttaaa
//
LOCUS       HUMPIM1B     1737 bp ds-DNA             PRI       28-JUN-1990
DEFINITION  Human protein kinase (Pim-1) promoter DNA.
ACCESSION   M34228
KEYWORDS    protein kinase; proto-oncogene.
SOURCE      Human cell line K562 DNA, clone pLH/P.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1737)
  AUTHORS   Meeker,T.C., Loeb,J., Ayres,M. and Sellers,W.
  TITLE     The human Pim-1 gene is selectively transcribed in different
            hemato-lymphoid cell lines in spite of a G+C-rich housekeeping
            promoter
  JOURNAL   Mol. Cell. Biol. 10, 1680-1688 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    mRNA       1699  >  1737     Pim-1 mRNA (alt.)
    mRNA       1700  >  1737     Pim-1 mRNA (alt.)
    mRNA       1703  >  1737     Pim-1 mRNA (alt.)
    mRNA       1705  >  1737     Pim-1 mRNA (alt.)
    mRNA       1706  >  1737     Pim-1 mRNA (alt.)
    mRNA       1732  >  1737     Pim-1 mRNA (alt.)
    mRNA       1733  >  1737     Pim-1 mRNA (alt.)
    mRNA       1735  >  1737     Pim-1 mRNA (alt.)
    binding     891      900     Sp1 binding site
    binding    1422     1431     Sp1 binding site
    binding    1433     1442     Sp1 binding site
    binding    1496     1505     Sp1 binding site
    binding    1222     1248     AP2 binding site
    binding    1290     1308     AP2 binding site
    binding    1662     1685     AP2 binding site
    binding    1340     1366     PPF-348 binding site
BASE COUNT      360 a    546 c    426 g    405 t
ORIGIN      
        1 ctttgaatac tcaggaggtg aatttgggtc atttcatctc ctctcttagc tgctgtatga
       61 ccctgtgcca tcctatttat gtttcatttt cctatctgca aaaagggagt aattcttgtt
      121 cccttgggtt ttctcctagt cataaaaaat gaatgagttc ggccacagta cacaaacaaa
      181 agaaaatatg taatattttg taggataaat tcataaaagt tgtggaggat ctgggcactt
      241 ttataaagca agcactgtag aaaactttca aaggaaacat ttaaattcac taatgacagc
      301 tatagggatc ctgatttttc ctttaaatgg caggcacttc aaaaattaaa atagaatgta
      361 gttagctttc aattacttag gccactgaca attcaattta tggattatat agtattttaa
      421 tttactcatt tcacacgtcg ttatcaactc tacatgaagt ttaaaaggac aaaggatgca
      481 agtggaattg gtcagtgttc cgggttattt ccaagggagg cagagggtgg ggggtttcct
      541 ttgagacaag acttggggtt ggccaataat tgctggtatt gcctgcctgg taataacagg
      601 ctgatgaaaa ggtgactaca aaacgtgaaa actggttaaa tcaagcgcac cctcccaccc
      661 tagttttaga tgaggaattt tccgccctca cagaaggggc tgaggcagca tctggcatca
      721 caacactaac atttgcttcg tgatttcctc tttacccggc cctttgacac acatcccttc
      781 ccagaaatca ggattcgctg gtgcttttgc atttctaaaa tgggaatccc gtggctgagc
      841 ttttagccgg ccggaacgac tgagggctgc atccctttcg caggagcggc gtcccgcctc
      901 cagttctgtt ccaggcctga ctcctccact ccctccgtga ctcatgtctg cggatccttc
      961 gcccccgacg cgccccccaa cacacaaacc cccagaatcc gcccccagcc tacagcgcga
     1021 cgtcagcccg ccccagccga cttggaggtc tcgggtctga gtcacacaga aagaccaccc
     1081 tcgtcggcat ccccacacac agtccgacac ccggcgcgcc ggcctccccg cctgacacac
     1141 taacgcccgt cgtctccgcg caacttgtta tgctccggct cgagcccttg acccaaaaac
     1201 ctcagcgaaa cggagagccg cagagccggc ctcgggcggc ctttgatggc tttgttattg
     1261 tttgggtttg aatcgatacg cccctcccca tccttcctcc ctcgcggccc tacacccagc
     1321 tcccgcctcc cctcacgccc cgcgcccctc cccctccatt ttggcgcctt ttccttcccg
     1381 ccacgtcgtg gcggcgtaga gaccattctg accgcgagag ctgggcgggg cgggggcggg
     1441 gcgcgccgag ttatgcagat caatcggcct ctggttggct ggagtagcgc tggcaggggc
     1501 ggggccgggg cgcggccaca gagcgcgcgg ggcgggggcc gaggggagtc gcccagtccc
     1561 gccgcttccc caccccctct cctccctcgg ccggcccggc agccctgctc cccgccttgg
     1621 cctcccggag aggccccgcc ccgtcccccc ccgccgcgcc ctccccgcgc gccctccccg
     1681 ccggcgcgct cctccccttt actcctggct gcggggcgag ccgggcgtct gctgcag
//
LOCUS       ONNCG       11835 bp ss-RNA             VRL       28-JUN-1990
DEFINITION  O'Nyong-nyong virus (Gulu strain) complete genome.
ACCESSION   M20303 M33999
KEYWORDS    nonstructural protein; polyprotein.
SOURCE      O'Nyong-nyong virus (strain Gulu, isolated from Anopheles gambiae),
            cDNA to viral RNA, clones 2,4,7,8,9,11,13,14,17,17s,18,22,23,27,
            passed in BHK-21 cells.
  ORGANISM  O'Nyong-nyong virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Togaviridae; Alphaviridae.
REFERENCE   1  (bases 1 to 11835)
  AUTHORS   Levinsion,R.S., Strauss,J.H. and Strauss,E.G.
  TITLE     Complete sequence of the genomic RNA of O'Nyong-nyong virus and its
            use in the construction of alphavirus phylogenetic trees
  JOURNAL   Virology 175, 110-123 (1990)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 4079 to 7621)
  AUTHORS   Strauss,E.G., Levinson,R., Rice,C.M., Dalrymple,J. and Strauss,J.H.
  TITLE     Nonstructural proteins nsP3 and nsP4 of Ross River and
            O'Nyong-nyong viruses: Sequence and comparison with those of other
            alphaviruses
  JOURNAL   Virology 164, 265-274 (1988)
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    pept         80     7624     O'Nyong-nyong polyprotein A
    matp         80     1684     nonstructural protein P1
    matp       1685     4078     nonstructural protein P2
    matp       4079     5788     nonstructural protein P3
    matp       5789     7621     nonstructural protein P4
    pept       7670    11413     O'Nyong-nyong polyprotein B
    matp       7670     8449     C protein
    matp       8450     8641     E3 protein
    matp       8642     9910     E2 protein
    matp       9911    10093     6K protein
    matp      10094    11410     E1 protein
    mRNA          1    11835     O'Nyong-nyong mRNA
    RNA        7621     7669     26S RNA
    signal    11817    11835     polyA signal
BASE COUNT     3676 a   2860 c   2859 g   2440 t
ORIGIN      
        1 atagctgcgt gatacacaca cgcagcttac gggtttcata ctgctctact ctgcattgca
       61 agagattaaa gtacccatca tggattcagt gtatgtagac atagatgctg acagcgcgtt
      121 tctgaaggcg ttgcagcaag cataccccat gtttgaggtg gaaccaaagc aggtcacgcc
      181 aaatgaccat gcaaacgcta gagcattttc gcatctagca ataaaactga tagagcagga
      241 aattgatcca gactcaacca ttctagacat tggtagcgca ccagctagga ggatgatgtc
      301 tgatagaaaa taccactgcg tctgcccgat gcgcagcgca gaagaccctg agaggctcgc
      361 gaattacgcg agaaaacttg cgtcagccgc tggaaaggtg acagataaaa acatctccgg
      421 aaaaattaat gatctacaag ctgtgatggc cgtaccgaat atggaaacat ccacattctg
      481 cctacacact gatgctacat gcaaacaaag aggagacgtc gccatttatc aagacgtcta
      541 cgccgtccat gcacctacct cgctgtacca tcaggcgatt aaaggagtcc gcgtggcata
      601 ctggataggg ttcgatacga cacctttcat gtacaatgca atggctggcg catacccatc
      661 atattcaaca aactgggctg atgagcaggt actgaaagct aagaacatag ggctgtgttc
      721 aacagaccta tctgagggta gacgaggcaa actatccatc atgagaggca aaaaattgaa
      781 gccatgcgac cgagtgctat tctcggtcgg ctcaacactc taccctgaaa gtcgtaaact
      841 tctacaaagc tggcatttac catcggtatt tcatctgaag ggtaaactca gcttcacctg
      901 ccgctgtgac acgatcgtct catgcgaagg atacgttgtc aagagagtga ccatgagtcc
      961 aggcatctac ggaaagacat cggggtatgc tgtaactcat catgccggcg gcttcctgat
     1021 gtgcaagacg acagatacag tagacggcga aagggtatca ttctccgtgt gtacttacgt
     1081 accagctact atctgcgacc agatgactgg aatccttgcc actgaggtaa ccccagaaga
     1141 cgcacagaaa ctactggttg ggctaaacca acggatagtg gtcaatggca ggacgcaacg
     1201 taatacaaac accatgaaaa actacctgct cccaatagtt gctcaggcct tcagcaagtg
     1261 ggccaaagaa tgtcgaaagg acatggagga cgaaaaactc ttgggtgtcc gagagaggac
     1321 cttaacgtgc tgttgcctat gggcatttag aaagcacaag acgcatacgg tgtacaaaag
     1381 accggatacc cagtcaatcc aaaaggtccc tgccgaattt gacagctttg tgataccaag
     1441 tctgtggtcg tcaggtttat caattccgct gagaaccaga atcaagtggc tcttgagcaa
     1501 agctccaaaa tacgagcaac taccgcacag cggaaacgcc gaggaagcag cccaggctga
     1561 aacagatgcg gtagaagaac aggaggcaga gctaacccga gaagctatgc caccattgca
     1621 ggcgacacag gatgacattc aggtagaaat tgatgtagag caacttgaag accgagcagg
     1681 agcgggcata gtcgaaacac caagaggagc aatcaaagtc acagcccaac cgtcagacct
     1741 tgttgtcgga gagtacttag tactgacacc gcaggcggtc ctgcgcagcc aaaaactcag
     1801 tctgattcac gcgcttgcag agcaggtaaa aacgtgcaca catagtgggc gagcaggcag
     1861 gtacgcggtt gaagcatacg atgggcgtgt tctagtgccc tcgggctacg cgatacccca
     1921 ggaagacttc cagagcttaa gcgaaagtgc caccatggta tttaacgagc gagagtttgt
     1981 gaaccggaag ttacaccaca tcgccatgca cggcccagcg ctgaacactg atgaagagtc
     2041 atatgaactg gtaagggtag agaaaacaga acacgagtac gtctatgacg ttgatcagaa
     2101 gaaatgttgt aagagggagg aagcaacagg actagtgcta gtaggcgact taactagccc
     2161 accataccat gagttcgcct acgaaggact aaaaatccgc ccagcatgtc catacaaaac
     2221 ggcagttata ggtgtcttcg gagtaccggg ttctggcaag tcggctataa tcaaaaacct
     2281 ggtaaccagg caagacttgg tgactagtgg aaaaaaagaa aactgccaag aaatctccaa
     2341 tgacgtaatg cggcaaagga aattggagat atctgcacgt acagtcgact cactactcct
     2401 gaatggatgt aacaagccag tggaagtact gtacgtggac gaggcattcg cttgtcattc
     2461 gggaaccctg ttggcactga tagccatggt tagaccgcgt cagaaggtcg tactttgtgg
     2521 cgacccaaag cagtgcggat tcttcaatat gatgcaaatg aaggtcaact ataatcacaa
     2581 catctgcaca caggtgtacc ataaaagcat atcaaggcgg tgtacactgc ctgtaacagc
     2641 catcgtgtcc tcgttgcatt acgagagcaa gatgcgcact acaaatgagt acaaccagcc
     2701 aatcgtagtg gatactacgg gcataacaaa accagaaccc ggggacttag tgttaacgtg
     2761 tttccgggga tgggttaagc agctgcaaat agactaccgt ggaaacgaag tcatgacagc
     2821 agctgcttct caggggctga ccagaaaagg tgtttatgca gtaaggcaga aagtcaacga
     2881 aaaccctctg tatgcaccaa catcagaaca cgttaacgtg ctattgacac gcacagaggg
     2941 caagttgaca tggaagacac tctcaggcga cccatggata aagatactgc agaacccccc
     3001 aaaaggggac tttaaggcaa caatcaagga gtgggaagca gaacacgcct ccatcatggc
     3061 aggaatatgc aatcaccaga tggcttttga cacatttcag aacaaagcta atgtatgctg
     3121 ggctaaatgc ctggtcccta ttcttgacac tgctggaatc aaattaagtg acaggcagtg
     3181 gtctcagata gtgcaagctt ttaaagaaga tagggcctac tctccagaag ttgcactgaa
     3241 tgaaatatgc actcgcatat atggggtaga cctggacagc ggactattct caaagccact
     3301 gatatccgtc tactatgcag acaaccactg ggacaataga ccaggaggaa aaatgttcgg
     3361 gttcaaccct gaggtggcac ttatgcttga aaagaaatat ccctttacaa aaggtaagtg
     3421 gaacattaac aagcagatat gtataactac cagaaaggtt gacgaattta accccgaaac
     3481 caacataata ccggccaacc gcagactgcc gcactcactc gtggctgaac accactcagt
     3541 gagaggggaa agaatggaat ggctggtaaa caaaatcagc ggtcaccaca tgttgttggt
     3601 tagcggtcat aatcttatat taccaacaaa aagagtcacc tgggtagcac cgttaggcac
     3661 ccgaggtgca gactacacat ataacctgga acttggtcta ccagccacac taggcagata
     3721 tgacctggta gttatcaata tccatactcc attccgcata catcattacc agcagtgtgt
     3781 agatcacgca atgaagctcc agatgctagg gggggactct ctacggctgt taaagccggg
     3841 aggttcactt ctgattagag cttacgggta cgccgaccga accagtgaaa gggtcattag
     3901 cgtattggga cgcaagttca gatcgtccag ggctctgaaa cctcagtgca tcacgagcaa
     3961 tacagaaatg ttcttcctat ttagccgatt cgacaatgga agaaggaact tcaccacaca
     4021 tgttatgaac aaccagctga acgcagtgta tgcaggactg gccactagag cgggctgtgc
     4081 cccgtcatac cgagtgaaac ggatggacat cgcaaagaac actgaggaat gcgtggtaaa
     4141 cgccgccaat ccgcgcggag taccaggcga tggagtatgt aaagccgtgt atagaaaatg
     4201 gccagaatca ttcagaaaca gtgcaacacc agtggggact gcaaagacaa tcatgtgcgg
     4261 tcaatacccc gtcatccacg cagtaggccc taacttctca aactattctg aggctgaagg
     4321 ggatagggaa ttggcttcag tgtatagaga agtggcgaaa gaagtgtcta ggctaggagt
     4381 gagcagtgta gccatccctt tgctctcaac cggtgtgtac tcaggaggca aagacagact
     4441 gctgcaatca ctaaaccatc ttttcgcagc gatggattcg acagatgcag acgttgtcat
     4501 ctactgcagg gacaaggaat gggagaagaa gatcactgaa gccatatcac taagatccca
     4561 ggtagaatta ctagatgatc acatctcagt ggattgcgac attgtacgcg ttcatccaga
     4621 cagcagcttg gcaggccgaa aggggtacag cacagtagag ggagcactct actcgtacct
     4681 agagggaaca agattccacc aaactgcagt agatatggca gagatatata ccatgtggcc
     4741 gaaacaaact gaagccaacg aacaggtctg cctatatgct ctgggggaga gtatagagtc
     4801 cgtcaggcaa aaatgtcccg tagacgacgc cgacgcctca ttccctccga aaacagtccc
     4861 gtgcctatgc cgttatgcta tgacgcctga acgagttgca cgtctacgca tgaatcatac
     4921 caccagcatc atagtgtgct cgtcttttcc gctgccgaaa tacaaaatcg agggcgtgca
     4981 aaaagtaaaa tgttcgaaag cactcttgtt tgatcacaac gtaccgtctc gagtgagccc
     5041 gagaacgtac aggcctgcgg acgaaatcat acagacacct caaacaccaa ctgaagcgtg
     5101 tcaggacgca caactcgtgc agtcaataaa tgatgaagca gtgccagttc cctcagactt
     5161 agaggcttgt gacgcaacta tggactggcc ctctatcggc accgtatcaa caagacaaag
     5221 acacgactca tctgacagcg agtatagtgg ctccagaagt aacatacaac tagtgacggc
     5281 ggacgtgcat gcaccaatgt acgcacattc gctggcgtcc agcggaggtt caatgctgtc
     5341 gctgtccagt gaaccagctc agaacggcac aatgatacta cttgactcag aagacacaga
     5401 cagtataagc agagtaagca caccgatcgc cccgcccaga agacgtttgg gaaggaccat
     5461 aaatgtgacc tgcgacgagc gggaagggaa aatactccct atggccagcg acaggttctt
     5521 cactgctaag ccatacactg tcgcactgag cgtatcaaca gcagatatga ctgtgtatcc
     5581 catccaggca ccgctaggat tgataccacc acctaccctc gaaccgatca ctttcggaga
     5641 cttcgccgaa ggtgaaatag acaacctcct gacaggggca ttgacatttg gggacttcga
     5701 gccaggtgaa gtggaagagc tgacggatag cgagtggtca acgtgctcgg acacagacga
     5761 agagttacga ctagacagag cagggggtta catattctcc tctgacactg gtcaaggtca
     5821 tctacagcaa aaatcagtac gtcaaacgac gctaccggta aacattgttg aagaggtcca
     5881 cgaagagaaa tgctatccac ctaaattgga tgagatcaaa gagcaactac tacttaagag
     5941 acttcaggag agtgcttcca cggctaaccg gagtaggtac caatctagaa aagtggaaaa
     6001 catgaaagcc acgattatcc acagactgaa agagggttgc agactctatt tggcgtcaga
     6061 aacaccgagg gtcccatctt accgagtcac atacccggcg cccatctact cgccttcaat
     6121 caatatcaaa ctgactaacc cagagactgc agtagcagtg tgtaacgagt ttttggccag
     6181 aaactatcca actgtggcat cctaccaagt cactgacgag tacgacgcgt acttggatat
     6241 ggtagacggg tccgaaagtt gcctagacag agctacattc aacccgtcta aactcaggag
     6301 ttacccaaaa caacactctt accacgcacc caccatcaga agtgcagtgc catcaccatt
     6361 ccaaaatacg ttgcagaatg tcttggcagc ggccacaaaa agaaactgca acgtaacgca
     6421 gatgagggaa ctacccacta tggactccgc agtgtttaac gtggagtgtt ttaagaagta
     6481 cgcttgcaac caagagtact ggagagagtt cgcctcaagc cctataaggg taacgacaga
     6541 gaatctgaca atgtatgtga cgaaactaaa ggggcctaaa gcggcggcac tcttcgcaaa
     6601 aacacacaac ttgctgccgc tacaagaggt accgatggac aggttcacaa tggacatgaa
     6661 acgtgatgtg aaagtgacac caggtacaaa gcacaccgag gaaaggccga aagtacaggt
     6721 catacaggcg gcagaaccac tggcaacagc atacctgtgt ggcatacaca gagagttggt
     6781 gagaagacta aatgcagttc tgctaccgaa tgtccacaca ctgttcgata tgtcagccga
     6841 agacttcgat gcaattatag ccacccattt caaaccgggc gatgctgtac tagaaactga
     6901 catagcctca tttgataaga gtcaagacga ctcgcttgcg tcgaccgcca tgatgttgct
     6961 agaagacctt ggggtagatc aacctatcct ggatctgata gaagcagcat tcggcgaaat
     7021 atccagttgt catctaccga cgggtacgcg gttcaagttc ggcgcaatga tgaaatcagg
     7081 catgtttcta accctgtttg tcaataccct cctgaacatc accattgcca gtcgggtgtt
     7141 agaggagcga ttgactactt cagcctgtgc agctttcatt ggggacgaca acataataca
     7201 tggggttgtc tctgacgcac taatggctgc acgttgtgct acgtggatga acatggaagt
     7261 gaaaatcatc gatgcagtag tgtcagagaa ggcgccatac ttctgtgggg gatttatttt
     7321 acacgacacg gtgacaggca cgtcgtgcag agtagcagac cctttaaaga gactgttcaa
     7381 gctaggcaaa cctctggcag ctggagacga acaggatgag gacagaagac gtgctctggc
     7441 agatgaggtt actagatggc aaagaaccgg cttagtcaca gaattggaaa aagcagtata
     7501 ttcaaggtat gaagtacaag gaataacagc cgtaataaca tcaatggcta cctttgcgaa
     7561 tagcaaagaa aactttaaga aactaagagg gcccgtcgta accttgtacg gcggacctaa
     7621 ataggtacgc actacagcta cctacttaga gcatacatcg ctaatagcca tggagttcat
     7681 accagcacaa acttactaca atagaagata ccagcctaga ccctggactc aacgccctac
     7741 tatccaggtg atcaggccaa aaccacgccg aagaaggcct gcaggacaac tcgcacaact
     7801 gatatccgca gtcagcagac tagcactgcg tacagttccc cagaaaccac gccggacccg
     7861 aaaaattaag aagcaaaagc aagtaaagca agaacaacag agtactacga accagaagaa
     7921 aaaggcgccg aaacaaaagc agacccaaaa gaaaaagaga ccaggacgaa gggaaaggat
     7981 gtgcatgaag attgaaaatg actgcatctt cgaagtcaga catgaaggaa aagtaacggg
     8041 gtatgcatgc ctagtaggtg ataaggtaat gaaaccagca cacgtgaaag gaactattga
     8101 caacgcagac ctagcgaagt tggcgttcaa aagatcatcc aaatatgatc tagagtgcgc
     8161 acagatacca gtgcacatga aatcggacgc ctcaaagttc acccatgaaa aaccagaagg
     8221 ctattacaac tggcatcacg gagcagtaca gtattctgga gggaggttca cgatccctac
     8281 aggcgcagga aagcctgggg acagcggaag accaatcttt gacaacaagg ggcgtgtcgt
     8341 ggctattgtt ctaggcggag caaacgaagg aaccaggaca gcactatctg tagtgacttg
     8401 gaataaagac atagtcacaa aaatcacacc agaggggtca gttgaatgga gccttgccct
     8461 ccctgtcatg tgcctgttgg caaatacaac cttcccatgt tcccaaccgc cttgcgcgcc
     8521 gtgctgctac gaaaagaaac cggaagaaac cttgagaatg ctggaggaca acgtcatgca
     8581 accaggatat taccagttac tcgattcagc attggcctgc tcacaacgtc gtcaaaaacg
     8641 taatgcaaga gaaaacttca atgtctacaa agtcactagg ccgtacttag cccactgtcc
     8701 tgactgcggg gagggacact catgccacag cccaatagca ttagaacgga tcagaagtga
     8761 ggcaacagat ggtaccttga aaatccaggt atctctgcaa atcggaataa agacagacga
     8821 cagccacgat tggacgaagc tacggtatat ggatagccat acacctgtgg atgcagaccg
     8881 atccgggttg tttgtcagaa cgtcagcacc gtgcaccatc acgggaacga tgggacattt
     8941 catactagca cgctgtccga aaggagagac gctgacggta ggatttgtag acagtagaag
     9001 gatcagtcac acgtgcatgc acccgttccg ccacgagcca ccgctgatag ggagagagaa
     9061 gtttcactcc cgcccgcagc atggcaaaga actaccttgc agtacatacg tccataccac
     9121 agcggcaact gctgaggaaa tagaagtgca tatgccgcca gatacccctg actacacgct
     9181 gatgacacag caagcgggaa acgttaagat cacagttgac ggccagacgg tacgatacaa
     9241 gtgcaaatgt gacggctcca atgaaggatt aataaccgct gacaaagtca taaataactg
     9301 caaagtagac caatgccaca cagcggttac aaaccacaag aaatggcaat acaattcacc
     9361 gctgaccccg cggaactccg aacaaggaga tagaaaaggt aagatccata tcccatttcc
     9421 actggtgaac acaacctgca gggtaccaaa agcaagaaat ccgactgtca catacggtaa
     9481 aaacagagtc actctgctgt tacatccaga ccacccaaca ctcctttcgt accgcgccat
     9541 gggaaggatc ccggattacc atgaagagtg gataacaaac aagaaggaaa taagtatcac
     9601 agtaccagca gaaggcttag aggttacgtg gggtaataat gacccataca aatattggcc
     9661 ccaactgtct acaaatggta ctgcgcacgg gcacccacat gaaataatcc tctattacta
     9721 tgagctgtac ccaactacca caattgctgt actagctgct gcttctatcg taataacatc
     9781 tttggtaggt ctatcattag gcatgtgcat atgcgcgaga cgcaggtgca tcacgccata
     9841 tgagctgact ccaggagcta ccatcccatt cctcctaggt gtactatgct gtgccaggac
     9901 tgcaaaagca gcatcgtact acgaagctgc aacatacctc tggaatgagc aacaaccatt
     9961 attttggtta cagcttctaa tccctctgtc agctgcaatt gttgtgtgta attgcctaaa
    10021 acttttacca tgctgctgca aaacattgac ttttttagcc gtcatgagca tcggtgcccg
    10081 cactgtgacc gcgtacgagc acgcaacagt gatcccgaac acggtgggag taccgtgtaa
    10141 gactcttgtt agcagaccag ggtacagccc tatggtctta gaaatggagc tacagtcggt
    10201 cactctggaa ccagcattat ccttggatta cattacgtgt gagtataaaa caatcacacc
    10261 gtccccgtac gtaaaatgct gtggtacagc tgaatgtaag gccaagaacc tgccagatta
    10321 taactgcaaa gtattcacag gcgtctaccc atttatgtgg ggaggagcat actgcttctg
    10381 tgacgcagag aacacacagc tcagcgaggc acacgttgag aaatcagaat catgcaaaac
    10441 tgagtttgca tcagcctaca gagcccacac agcttcagta tcagctaaac tacgtgtctt
    10501 ttaccaaggg aataatatca ccgtgtctgc atacgccaat ggtgatcatg cagttacggt
    10561 ggaagacgcg aagtttgtca tcggtccact atcgtccgcc tggtcaccat ttgataataa
    10621 gatcgtggtg tacaaaggcg aagtctacaa tatggactat ccacctttcg gcgcagggag
    10681 gccaggacag ttcggtgaca tccagagccg cacgccagac agcaaggacg tctatgcgaa
    10741 tacgcagtta atactgcaaa gaccagcggc aggagcaata cacgtgcctt actcccaggc
    10801 accttcgggc tttaagtact ggctcaagga aaaaggggca tcattgcagc atactgcacc
    10861 atttggctgt cagatagcaa caaacccggt aagagcagtg aactgtgcag tgggcaacat
    10921 accagtctcc attgacatcc cagatgcagc tttcaccagg gtcactgacg ctccttccat
    10981 cacagacatg tcctgcgaag tagcttcgtg tacccattca tctgattttg gaggtgccgc
    11041 agtcataaag tacacagcta gtaaaaaagg aaaatgcgcc gtgcactctg taacaaatgc
    11101 ggtcactatc cgcgaaccta acgtagatgt caagggaaca gcacaattgc aaattgcctt
    11161 ctcgaccgca ctagctagtg cggaattcaa ggtgcagatc tgctccacac tggtacactg
    11221 ctcagcgacg tgccatcctc ctaaagacca tatagtcaat tacccgtcac ctcacaccac
    11281 actaggagtg caggacattt caacgacagc tatgtcttgg gtccagaaga ttacaggagg
    11341 agtgggactc gtggttgcta tagctgcttt gatcttaatt atagttctct gcgtatcatt
    11401 tagcagacac taaagtacac atgccccaaa gtaatatact gtacaggtat acgtgctccc
    11461 tgagcagcac gatatatgta ttatccataa aagaaaaaca aaacaaaaat ataaaaatta
    11521 taaaatacaa aagtataaaa acaggtattg gtaccccctt agaggtacat tatttaacca
    11581 gtgaaaatag gtattggtgc ccccttagag gcacattata gaaccaggta taggtgcccc
    11641 cttagaggta caccaacaat aggtataagt gcccccttag tggcacacta accaccacaa
    11701 tcggtaatat gaagagacgt aggtatgaag cttcgtaaaa gctgccgaac ttactttaag
    11761 atgtaggcgt accgaactct tcgacaattc tccgacgcag ggacgtagga gaagttattt
    11821 tgtttttaat atttc
//
LOCUS       PNGE155A      155 bp ds-DNA             INV       28-JUN-1990
DEFINITION  P.redivivus E155 gene.
ACCESSION   M33925
KEYWORDS    .
SOURCE      P.redivivus (strain SC) adult DNA, clone E155.
  ORGANISM  Panagrellus redivivus
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; 
            Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae.
REFERENCE   1  (bases 1 to 155)
  AUTHORS   de Chastonay,Y., Muller,F. and Tobler,H.
  TITLE     Two highly reiterated DNA sequences in the low C-value genome of
            Panagrellus redivivus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.De Chastonay, 26-APR-1990.
               Institue of Zoology
               Perolles, Fribourg CH-1700
               SWITZERLAND
               e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu
BASE COUNT       43 a     33 c     30 g     49 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccaag gtggcccccg gggccatacg gtacttaaca aatttgcata aatgtgtgat
       61 ttttgacaca aaacgagctt attaacatct ttggaattgc ccatgaatct tcatttgaat
      121 gtggattata gatcttatcg ccccccaagc ttttg
//
LOCUS       PNGE167A      167 bp ds-DNA             INV       28-JUN-1990
DEFINITION  P.redivivus E167 gene.
ACCESSION   M33926
KEYWORDS    .
SOURCE      P.redivivus (strain SC) adult DNA, clone E167.
  ORGANISM  Panagrellus redivivus
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; 
            Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae.
REFERENCE   1  (bases 1 to 167)
  AUTHORS   de Chastonay,Y., Muller,F. and Tobler,H.
  TITLE     Two highly reiterated DNA sequences in the low C-value genome of
            Panagrellus redivivus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.De Chastonay, 26-APR-1990.
               Institue of Zoology
               Perolles, Fribourg CH-1700
               SWITZERLAND
               e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu
BASE COUNT       54 a     31 c     27 g     55 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcccga tgttcagatt tgaactttgt atgaatttga gatggcccat ttgatcattg
       61 cggattcttc agaatccgaa tctaatggct cagacccaca gaataagctt taaataggct
      121 attctaacta tattgattca actaatctaa agtatgcctt aaaacat
//
LOCUS       SNDHNGP      2600 bp ss-RNA             VRL       28-JUN-1990
DEFINITION  Sendai virus hemagglutinin-neuraminidase (HN) gene, complete cds.
ACCESSION   M34033
KEYWORDS    glycoprotein; hemagglutinin; neuraminidase.
SOURCE      Parainfluenza virus type 4A (strain Toshiba) from primary monkey
            kidney cells, cDNA to viral RNA, clone pG4HF1.
  ORGANISM  Parainfluenza virus type 1
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Paramyxoviridae; Paramyxovirus.
REFERENCE   1  (bases 1 to 2600)
  AUTHORS   Bando,H., Kondo,K., Kawano,M., Komada,H., Tsurudome,M.,
            Nishio,M. and Ito,Y.
  TITLE     Molecular cloning and sequence analysis of human parainfluenza type
            4A virus HN gene: Its irregularities on structure and activities
  JOURNAL   Virology 175, 307-312 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        343     2064     hemagglutinin-neuraminidase (HN)
    mRNA         53     2600     HN mRNA
    mRNA    <     1       52     fusion protein (F) mRNA
    signal       39       52     consensus polyA signal
    signal     2587     2600     consensus polyA signal
BASE COUNT      955 a    561 c    410 g    674 t
ORIGIN      
        1 acaaaacgac acccaatcaa acaatctcca catttcaatt ttaagaaaaa aactattata
       61 ggggggaaca cacttctcag ccctgattgc tcaaggccct tgcatgtgca accgagacac
      121 cccccacaag caccggaata agacctgaca acaaagtagc agccaccacg acccaaaaac
      181 aaaattaaaa ggatccggta acagcccatc aaccagcaat catagaatcc aacaatccag
      241 agagacgtca catcaactca tccacgaatc ttcgaaggga acatcccaga caaaatcaca
      301 gcccattccc tgatcacgga taaactgaga aagatcacaa gaatgcaaga ctcacatggt
      361 aatacacaaa tactcaacca ggcaaattca atggtgaaaa gaacatggag attactattt
      421 cgaattgcaa ccttaatatt acttgtttca atatttgtgt tatcgctcat aattgtatta
      481 cagtcaacac cggggaattt gcaaaacgat atcaatataa ttagaaagga gctcaatgaa
      541 cttatggaga attttgaaac tacatctaag tcactgttaa gtgtatcaaa tcaaatcact
      601 tacgatgtat cagtacttac tcctataaga caagaagcta ttgaaacaaa catcatttca
      661 aaaataaaag atcattgcaa agatagagta attaaagaag gaagcacttg cacattgaat
      721 cgcagccctt tgcatgatgt ctctttttta aatgggttca ataaattcta tttcacatat
      781 aaagataata tgcaaattaa gtttaaatca ttattagatt accccaattt tattccaact
      841 gctacaactc cccacggatg cattcgaatt ccatcattct ccttaggtca aacccattgg
      901 tgttataccc ataatataaa cctactagga tgtgcagacc ctgcatctag caatcaatat
      961 gtatcactag gaaccttaca agtcttaaaa atgggtgacc cttattttaa agtcgagcat
     1021 agtcattatt taaatgacgg gaggaatcga aagagttgtt cagtggttgc tgtccccgac
     1081 ggatgcctgc ggaattgtgt gaccatgaca aaaaatgaga cagagaattt caaagacctc
     1141 aattggcaac acaattactt acatacatat catataatgg taccattaaa gactcgtata
     1201 ataaatccac caggatcatc cagagattgg gttcatatcg caccaggggt aggctcgggc
     1261 cttttgtatg ccaaattact tatatttcct ttgtatgggg gtctcacgga aaaatcagtg
     1321 atacataata atcaatcagg gaaatatttt ttccctaatt caactaaatt gcaatgccgt
     1381 aacagcacta tggaaaaaat aaaaggagca aaagattcat acacaataac ttacttctca
     1441 gggagactta tacagagtgc atttctggtt tgtgatctaa gacaatttct ttctgaagat
     1501 tgtgaaatct taattcctag taatgattac atgatggtcg gtgcagaggg tcgattatat
     1561 aacattgaga acaacatatt ttattatcag agaggatcca gctggtggcc ttatccgagc
     1621 ctctatagaa tcaggttaaa ccttagtaag aaatatccta gaataactga aattaaattt
     1681 acaaaaattg aaatcgcccc aagaccaggc aacaaagatt gtccaggaaa taaggcttgc
     1741 ccaaaagaat gtataacggg agtctaccaa gatatattgc cactaagtta tcccaatact
     1801 gcatttccac acttaaaaca agcgtattat acaggttttt atcttaataa ctcgctcgag
     1861 agacgcaatc caacatttta tactgctgac aatctagatt accatcaaca ggaaagatta
     1921 ggtaaattca atcttactgc tggatactct actacaactt gttttaaaca gaccactact
     1981 gcgaggttat actgtctcta cataattgaa gtgggtgact cagtcattgg ggactttcag
     2041 atcacccttt ttttagcagc ttaatagacc agactgttaa ttaatcaaca aagttattct
     2101 gtaatataaa ctgatcttat aagtgaaaag atgcctatcc aaggaggttg atagacaaat
     2161 agtaaaagta gcaattgtaa caaaactcta aggaaaaagt aattcgagaa atattataga
     2221 ctgacttcag agcaaacaca acatcgatcc ataatagtca atataatcaa taatactcta
     2281 tgagacctta cctatcaaca gcaaaaaaca cagtccatca agcggaaccc aactcgctcc
     2341 atccttaatc atccactgaa agaaaaaata tacgaaggac catcggccac cgggtccaaa
     2401 caatctagca caaaaattca aacaaccgcc aaactctgtt cggcctcaac aaacaatccg
     2461 ccaagccatc tgtcattcct ataccaacac acaaccatcc cattcctcaa aagcaattca
     2521 atccgcgacc caaagaagac tctccacata tccagctaat ccgtcgatcc gacacatcat
     2581 cgtatctttt aagaaaaaaa
//
LOCUS       YSCRP026A    2133 bp ds-DNA             PLN       28-JUN-1990
DEFINITION  S.cerevisiae RNA polymerase II sixth subunit (RP026) gene, complete
            cds.
ACCESSION   M33924
KEYWORDS    RNA polymerase II; oncogene; transmembrane protein;
            tyrosine kinase.
SOURCE      S.cerevisiae single-copy plasmid DNA, clone 2.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2133)
  AUTHORS   Archambault,J., Schappert,K.T. and Friesen,J.D.
  TITLE     A suppressor of an RNA polymerase II mutant of Saccharomyces
            cerevisiae encodes a subunit common to RNA polymerase I, II, and
            III
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.D.Friesen, 01-MAY-1990.
               The Hospital for Sick Children
               555 University Avenue
               Toronto, Ontario, CANADA M5G 1X8
               e-mail: jim@sickkids.toronto.edu
FEATURES       from  to/span     description
    pept       1026     1045     RNA polymerase II sixth subunit (RP026), exon
                                 1
               1122     1569     RNA polymerase II sixth subunit (RP026), exon
                                 2
    IVS        1046     1121     RP026 intron A
BASE COUNT      644 a    374 c    408 g    707 t
ORIGIN      Chromosome XVI.
        1 tctagaaaaa gacatatcag gacatatatg acacttccaa tttttggtta ctagtgagtc
       61 gtcatgaata atcatgtgca tttgtaaacc attttccccc acacatggtt tgctacaaat
      121 aggacatttt aatttaggat gatcattttt tatatgcgat tgcaattgtg accatatccg
      181 aaactctttg caacaaccag caaaagtaca ttggtaagga ttttctacct caggatcgtg
      241 atgtttagaa atgtgatttc ttagcctata tggcctctga aagcttttat tgcagtgtgg
      301 acaggttagt ttatgtaaat gaacagataa aatatgtgcc cttaattgtg gatgcttgta
      361 gaatcggagg ttgcatcctt cctctggaca aatgaaagat ttggtatgcg ttacttcgtg
      421 tcgcttcagt tgctggcgag tcgtcactcc ttttccacaa taagaacatt ggaatggttt
      481 cgtatcagaa tgcgtataca agtgtctctc taagtgactc tttttaacga aggattttgc
      541 acactatcac actgaaatgc tcttaaaccc tgatgtacgc ttaattggtg ttcagtcaaa
      601 attgaaggtc ttgtaaatgc ctatcacagc atcatagtca cagaaatatg tctttggcct
      661 attgctcgat gagcttctag ttgatgttaa actgttcaat gattctgaac tctctgatcg
      721 tgatatggga atggtctcct gtttaagttc ggccagtggc attccttcat tatttagaac
      781 ctctcctccc atttgacagc gatttcaaca gttactactg tgcttaatgc ccttatttga
      841 tagttagttc ttcttataat aaataatcat tgtatattga ttattcgtat agtatgatat
      901 ttaattgaaa aaaaaatttt ttttttttta agaatatcat tcaaaggatc aatcacaacc
      961 ttgaagaaag gctaaaaaga cacattttgc aggtaacagt gtaaagatta aggctacaag
     1021 tgatcatgtc agactacgag gaggcgtatg tgcattcata ttggagaaat tgcgtgcttt
     1081 agttttaatt atatttacta acttctatca ttgtgataca ggtttaacga cggtaatgag
     1141 aacttcgaag attttgatgt agagcatttt tctgatgagg agacttatga ggaaaaacct
     1201 caattcaagg atggtgaaac aaccgatgcc aacggtaaga ccatcgttac tggtggtaat
     1261 ggcccagaag attttcaaca gcatgagcaa ataagaagaa agacacttaa ggaaaaggcc
     1321 atcccaaaag accaaagagc aactactcca tacatgacca aatatgaaag ggcaagaatt
     1381 ttaggtacaa gagccctaca aatttccatg aatgcgccag tttttgtaga tttagaaggt
     1441 gaaactgatc cattgcgtat tgccatgaag gagttggctg aaaagaaaat tcctttggtt
     1501 attagaagat atttaccaga tggttccttt gaggactgga gtgtggagga actcattgtg
     1561 gatttgtgat tacacctgca ttttccttta tgtgtatttg ctttgtttgt atattattat
     1621 aactataaag aaccgattta ttttgtaatg caaataactg gacgagagaa ccacgctata
     1681 acctatttgg taataaaaat gtttgtatga ataaaaaagc tgctgactgc tttctttgat
     1741 atgtgtatat agaatattta tcggtgattg atctaagaaa agcttacctc tgaaagttct
     1801 ctcttcgtcg cttgctgatt tgtagtaaac aatttgaaaa gtttagcaaa ttcttcctca
     1861 ggattttcaa aaccggcttc ctttaaagat caatgacctc attcaaggga acgtttaaat
     1921 cgtgtcctct tccaatggct tttaaacttt cttccaaatt cttcacgctc ggggaaactg
     1981 cgataagttc ttccccatga tagataagaa tatgggaaag cttacacctt cttctccaac
     2041 ttccgcggta ctggtgtcat tgtcaggtac cattttggac cattcctcat ctgtcaatgt
     2101 cttccccaat gttgcatata tcttggtaag atc
//
LOCUS       MUS25OAS     1412 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  Mouse 2'-5' oligo A synthetase mRNA, complete cds.
ACCESSION   M33863
KEYWORDS    2'-5' oligo A synthetase.
SOURCE      Mouse cell line L-929, cDNA to mRNA, clone L3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1412)
  AUTHORS   Coccia,E.M., Nissim,A. and Chebat,J.
  TITLE     A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3
            cells impairs EMCV but not VSV replication
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by E.M.Coccia, 26-APR-1990.
FEATURES       from  to/span     description
    pept         36     1139     2'-5' oligo A synthetase
    signal     1389     1394     poly-A signal
BASE COUNT      340 a    371 c    390 g    311 t
ORIGIN      
        1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc
       61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg
      121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag
      181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca
      241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct
      301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg
      361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca
      421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg
      481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca
      541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta
      601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc
      661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc
      721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt
      781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc
      841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc
      901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga
      961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc
     1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga
     1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag
     1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc
     1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca
     1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg
     1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc
     1381 tgatcaacaa taaaccacag caggtgccgt ca
//
LOCUS       PSEHUTR      1604 bp ds-DNA             BCT       28-JUN-1990
DEFINITION  P.putida histidine utilization genes repressor protein (hut) gene,
            complete cds.
ACCESSION   M33922
KEYWORDS    histidine utilization genes repressor protein.
SOURCE      P.putida (ATCC 12633) DNA.
  ORGANISM  Pseudomonas putida
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1604)
  AUTHORS   Allison,S.L. and Phillips,A.T.
  TITLE     Nucleotide sequence of the gene encoding the repressor for the
            histidine utilization genes of Pseudomonas putida
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.T.Phillips, 26-APR-1990.
FEATURES       from  to/span     description
    pept        167      913     histidine utilization genes repressor protein
                                 (hut)
    pept        910     1482     protein of unknown function
    binding      46       65     hut binding site (put.)
    binding     894      898     ribosomal binding site (put.)
    site         82       95     operator site
    signal       94      122     promoter (put.)
    signal     1516     1543     transcription termination signal
BASE COUNT      295 a    484 c    513 g    312 t
ORIGIN      
        1 ggacatggct ggcccagccc gtaggcaaca gagcgcgttc ggcgaagtag gcggacatcg
       61 gtcaaatcct gttattgtta acttgtatat acatatacag gcgtttgcct gccgggtaaa
      121 ctgcggcaag ctaccgttca ttccctatgc acaaggatcc aacgccgtgc cgacacctcc
      181 tgtctccgcg ctggttgccc agatgggcga gggcccggcg ccgctgtatg cccgggtcaa
      241 acagatgatc atccagcaga tcgacaacgg cagctggccg ccgcatcacc gggtcccctc
      301 ggagagtgaa ctggtcaacg agctaggctt cagccgcatg accatcaacc gtgccctgcg
      361 cgaactcacg gccgacggcc tgctggtgcg catgcagggg gtcggcacgt tcgtagccga
      421 gccaaagggc cgttcggcgt tgttcgaagt caacaacatt gccgatgaaa ttgccgcgcg
      481 cggccatcag catagctgcc aggtgatcac gctcaccgag gaagcagccg gttccgaacg
      541 ggccctggcc ctggacatgc gtgaaggcca gcgggtgttc cactcgctga tcgtgcattt
      601 cgagaacggc gtgccggtgc agatcgagga ccgctacgtc aacgccgcga tcgcacccga
      661 ctacctcaag caggatttca cccggcagac gccatatgcc tacctgtccc aggtagcgcc
      721 gctgaccgag ggtgagcacg tggtcgaagc catcctggcc gagccggaag aatgccgcct
      781 gctgcagatc gagcggggcg aaccttgcct gctgatccgc cgtcgtactt ggtccggccg
      841 ccagccggta accgcggcgc ggctgatcca ccccggttcc cgtcatcgcc tggaaggacg
      901 tttcagcaaa tgagccagct gcagttgttg cgcgcacagg attacccgcg catgccgtgg
      961 aagaacggtg gcggtttcac cgaagagatc acccgcgaca gtggagaggg cctggacggc
     1021 tttggctggc gcctgtcgat tgccgatatc gaagagtctg gcggcttttc caccttcgcc
     1081 ggttaccagc ggatcatcac cgtgctgcag ggcgatggca tgcgcctgtt ggtcgatggc
     1141 cagcccagcc ggccgttgct gccgttcgat gcctttgcct tcagcggcga aagccaggtc
     1201 agctgcaagc tgctgggtgg ggcgatccgc gatttcaacc tgatctatgc accgcaacgg
     1261 taccgggcga ggttgcagtg gtttgatggc acgagccgtt tgtacagctc ggcgtcgaca
     1321 gtgctgttgt ttgctgccag cagtcacgtg gaagtgtcca tggcggggcg tgaggtgcag
     1381 cggttggggt tgtatgactg cctgcggctg gagggcaacg atgagttgct tgggctggaa
     1441 gttcaggggc ggttttgctt gattgagctc atttctcgct gatgggcttg gcgatacatt
     1501 ttcatcgcct gtgagatcga gcgccgcgcg ggcggcgctc gatttgcgcg ccgccgcaaa
     1561 actcaagccg gaccgacgct cgcttcaccc ccccaaaaaa aatc
//
LOCUS       MUSCR2        427 bp ss-mRNA            ROD       28-JUN-1990
DEFINITION  M.musculus complement receptor type 2 (CR2) mRNA, 3' end.
ACCESSION   M35685 J04153
KEYWORDS    complement receptor type 2.
SOURCE      M.musculus (strains BALB/c, DBA/2, CBA/J and C57B10.D2/OSN
            C5-deficient) and Swiss nude (strain nu/nu), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 427)
  AUTHORS   Fingeroth,J.D., Benedict,M.A., Levy,D.N. and Strominger,J.L.
  TITLE     Identification of murine complement receptor type 2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 242-246 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.D.Fingeroth, 13-JAN-1989.
FEATURES       from  to/span     description
    pept    <     1      285     complement receptor type 2
BASE COUNT      148 a     95 c     76 g    108 t
ORIGIN      Chromosome 1.
        1 tgtttacaac cgaagatcaa aagagggcaa atattatcta ttttgaaaga tagttattca
       61 tataatgaca ctgtggcatt ttcttgtgaa cctggcttca ccttgaaggg caacaggagc
      121 attcgatgca atgctcatgg cacatcggag ccaccggtac cagtgtgtga aaaaggaaaa
      181 cacagagaaa gcaattatta tacaaagaca agacccaaag aaggagctct tcatttagaa
      241 acacgagaag tatattctat tgatccatat aacccagcaa gctgatgaca tgacaaatca
      301 agatgtagaa ctctcagcta cctcttcagc accatatctg cttacatgcc accaagctac
      361 cctccacgac aataatggac taaacctctg atttgtaagc cagccccaat taaatgtttt
      421 tctctat
//
LOCUS       RABCYP2C16   2006 bp ss-mRNA            MAM       28-JUN-1990
DEFINITION  Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds.
ACCESSION   M29968
KEYWORDS    cytochrome P450; monooxygenase.
SOURCE      Rabbit (strain New Zealand White) adult liver, cDNA to mRNA.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 2006)
  AUTHORS   Hassett,C. and Omiecinski,C.J.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Hassett, 20-NOV-1989.
            
               Author Address [1]: C.Hasset
                               University of Washington
                               Department of Environmental Health SC-34
                               Seattle, WA 98195
FEATURES       from  to/span     description
    pept         66     1529     cytochrome P450IIC16 (CYP2C16)
    mRNA    <     1     2006     CYP2C16 mRNA
    signal     1985     1989     poly-A signal
BASE COUNT      524 a    474 c    472 g    536 t
ORIGIN      
        1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag
       61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac
      121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta
      181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag
      241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg
      301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca
      361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga
      421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga
      481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa
      541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct
      601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg
      661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt
      721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt
      781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca
      841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt
      901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa
      961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata
     1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca
     1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg
     1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc
     1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc
     1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga
     1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc
     1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg
     1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt
     1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc
     1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac
     1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac
     1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga
     1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc
     1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat
     1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa
     1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct
     1981 cattaataaa gtggatgcat tcactg
//

GenBank-Updates@genbank.bio.net (06/29/90)

LOCUS       MUSTKBLK     2094 bp ss-mRNA            ROD       29-JUN-1990
DEFINITION  Mouse B lymphocyte kinase (blk) mRNA, complete cds.
ACCESSION   M30903
KEYWORDS    B lymphocyte kinase; tyrosine kinase.
SOURCE      Mouse (strain C57B410, haplotype b) B-lymphoid hybridoma cell line
            LB27.4 + 71.67, cDNA to mRNA, clones 102, 103, 201, 205, 215.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites)
  AUTHORS   Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V.
  TITLE     Specific expression of a tyrosine kinase gene, blk, in B lymphoid
            cells
  JOURNAL   Science 247, 332-336 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2094)
  AUTHORS   Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V.
  JOURNAL   Unpublished (1990) Howard Hughes Med. Inst., Baltimore MD 21205
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by S.V.Desiderio, 21-DEC-1989.
FEATURES       from  to/span     description
    pept        350     1849     B lymphocyte kinase
BASE COUNT      485 a    542 c    607 g    460 t
ORIGIN      
        1 cagcctggct gaaggtcagg tgcccatgaa acttctcaga aacacctccg gggctgagga
       61 gccagaagtc tccagaagac ttgactggag ggcagagctc ccactgagct gcttcttagg
      121 gctgttcatg gcggtctgat cacaacaggg ggcacacaca cctctgtgtg ctgttggcag
      181 gaagccacaa gccactgaag ctgactgaga tgagaagaat ccatctgggg atgggattgg
      241 cttttcttaa ctgggtccag agcttctggt tgccgctaga aagttcttca accacacaga
      301 ccctgggatc tgcacagcta tgttgagaag ctactgaagc cctgcaagga tggggctgct
      361 gagcagcaag aggcaggtca gtgagaaggg caagggctgg agccccgtga agatccgcac
      421 ccaggacaag gctcccccac ccctgccacc cctggttgtc ttcaaccacc ttgccccacc
      481 atctcctaac caggacccag atgaagagga gcgttttgtg gtggctctgt ttgactatgc
      541 cgctgtgaat gacagggacc ttcaggtgct gaagggtgag aagctccagg tcttgaggag
      601 cactggagac tggtggttgg ccaggtcact cgtcacagga agagaaggtt atgtgcccag
      661 caactttgtg gccccagtag agactctgga agtagaaaaa tggttcttca ggaccatcag
      721 ccggaaggat gctgagaggc agttgctggc tccgatgaac aaggccggct cctttctcat
      781 cagagagagt gagagcaata aaggtgcctt ttccctgtcc gtgaaagata tcaccaccca
      841 gggggaggtg gtcaagcact ataagatccg atcactggac aatggaggct attacatctc
      901 cccccggatc acctttccca ccctccaggc cctggtgcag cactattcaa agaaagggga
      961 tggtttgtgt cagaagttga ctctgccctg tgtgaacctg gccccgaaga acctttgggc
     1021 ccaagatgaa tgggaaatcc ccaggcagtc tctcaagttg gtccggaaac ttgggtctgg
     1081 gcagtttggc gaagtctgga tgggttatta caaaaataac atgaaggtgg ccatcaagac
     1141 cctgaaggag ggaaccatgt caccggaagc tttcctgggc gaggccaacg tgatgaaaac
     1201 cctgcagcat gagaggctgg ttcgtctcta cgctgtggtc accagagagc ccatttacat
     1261 cgtcactgaa tacatggcca gaggatgctt gctggatttt ctgaagaccg atgaaggtag
     1321 caggttgtcc cttccaaggc tgattgacat gtcagcccag gttgcagagg ggatggctta
     1381 catagagcgc atgaattcca tccaccgtga cctgcgggca gccaacatcc tggtgtctga
     1441 gacgttgtgc tgcaaaatcg ctgacttcgg cttggccagg atcattgaca gtgaatacac
     1501 tgcccaagag ggggccaagt tccccatcaa gtggaccgcc ccggaggcca tccacttcgg
     1561 ggtgtttacc atcaaggctg atgtgtggtc cttcggagtc ttgctgatgg tgattgtcac
     1621 ctatgggcgc gttccctacc caggaatgag caaccctgag gtcatccgta gcctggagca
     1681 cggctaccga atgccatgcc cggagacatg tccaccggag ttgtacaatg atatcatcac
     1741 tgagtgctgg cggggccggc cagaggagcg gcctaccttt gagttcctgc agtcggtgtt
     1801 ggaggacttc tacacagcca cggagggcca atatgagctg cagccctagc aaccaatggt
     1861 ctcttcaggg ctcccagcct gagcctcctt ctggactctg actttgggtt aatatgttca
     1921 cacttagctc ttcttccagg taggttgcag aatcagggaa gaatggtgtg cctgcttctc
     1981 aagtgaggga gccaggagac aaagatggct tcatggtgag gcatgagctc tcaaggcaca
     2041 acctcggtcc acctttcact ggctgtgtgc ctttggccaa gttactttgc tgcg
//
LOCUS       CHKSUH2AC      75 bp ds-DNA             SYN       29-JUN-1990
DEFINITION  Chicken and sea urchin chimeric TATA-H2A-c gene.
ACCESSION   J02525
KEYWORDS    conalbumin; histone.
SOURCE      Chicken and sea urchin DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Grosschedl,R., Wasylyk,B., Chambon,P. and Birnstiel,M.L.
  TITLE     Point mutation in the TATA box curtails expression of sea urchin
            H2A histone gene in vivo
  JOURNAL   Nature 294, 178-180 (1981)
  STANDARD  full staff_review
COMMENT     The TATA-H2A-c gene was formed by replacing the "TATA box" of sea
            urchin H2A with that of chicken conalbumin. A single base change
            from 't' to 'g' in the third position of the "TATA box" reduces the
            rate of transcription by 80%.
FEATURES       from  to/span     description
    mRNA         68  >    75     H2A-c mRNA
    recomb       23       24     sea urchin DNA/BamHI linker
    recomb       25       26     BamHI linker/chicken DNA
    mut          41       41     t in wild type; g in "TAGA" mutation
    recomb       60       61     chicken DNA/sea urchin DNA
BASE COUNT       20 a     21 c     21 g     13 t
ORIGIN      68 bp upstream of H2A-c mRNA start
        1 tcgctgaccg gtctctccga tccggagggc tgctcctcta taaaagggga agaaagagga
       61 accattcaag ccagc
//
LOCUS       CPVW8          70 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  CPV-BR white pock mutant recombination junction, variant W8.
ACCESSION   M11471
KEYWORDS    recombination joint.
SOURCE      Cow pox virus (strain Brighton red; CPV-BR) DNA, passed in
            chorioallantoic membrane (CAM) of 11-day old chicken embryo.
  ORGANISM  Cowpox virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Pickup,D.J., Ink,B.S., Parsons,B.L., Hu,W. and Joklik,W.K.
  TITLE     Spontaneous deletions and duplications of sequences in the genome
            of cowpox virus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 6817-6821 (1984)
  STANDARD  full staff_review
COMMENT     In the white pock variants of cowpox virus, 32 to 38 kb of the 3'
            end (region 2) were replaced by inverted copies, 21 to 50 kb long,
            of the 5' end (region 1) of the genome.  These variants possess
            inverted terminal repeats.  The limits of the inner ends of the
            deletions appear to be defined by the location of the nearest
            essential gene on one end and the location of the gene that encodes
            pock-redness on the other.  The sequences adjacent to the
            recombination/transfer sites show no homology to each other or any
            unusual structural feature.  At the recombination site of variant
            W8 one base pair "t" is shared by the two combining sequences.
FEATURES       from  to/span     description
    recomb       36       37     region 1 end/region 2 start (crossover event)
BASE COUNT       19 a     11 c     14 g     26 t
ORIGIN      Undetermined number of bp downstream of EcoRI site.
        1 ctttttattg agtggtggta gttacggata tctaatttat ccatccagta tgggtataca
       61 acacgaattc
//
LOCUS       DRORGM108    1059 bp ds-DNA             INV       29-JUN-1990
DEFINITION  D.melanogaster 28S rRNA gene with 1.006 kb type I insertion.
ACCESSION   K03141
KEYWORDS    insertion sequence.
SOURCE      D.melanogaster ribosomal DNA, clone MB27.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1059)
  AUTHORS   Roiha,H. and Glover,D.M.
  TITLE     Duplicated rDNA sequences of variable lengths flanking the short
            type I insertions in the rDNA of Drosophila melanogaster
  JOURNAL   Nucleic Acids Res. 9, 5521-5532 (1981)
  STANDARD  simple staff_review
COMMENT     The rRNA genes of D.melanogaster can contain two types of
            non-homologous insertion sequences, which are unlike intervening
            sequences found in other genes.  These insertion sequences appear
            to have a repressive effect on the transcription of rRNA genes.
FEATURES       from  to/span     description
    rpt          24       30     rRNA duplication
    rpt        1037     1043     28S rRNA duplication
    recomb       30       31     28S rRNA end/insertion sequence start
    recomb     1036     1037     insertion sequence end/28S rRNA start
BASE COUNT      237 a    244 c    311 g    265 t      2 others
ORIGIN      164 bp upstream of BstI site.
        1 gaatggatta acgagattcc tactgtccct gtcttagctg ggagcagagg aagactcgcc
       61 tagaggagtg tttaatccag agttggcaga acagatggga cgatgacagc gaaccaggac
      121 gggtgacgca taggtttatc ccatacgtta gtcttgccta tcgggatcca agttttggat
      181 tctcgatgag gacgtctttc ctgcttacag ggcacgggtc gttcaatgca tttttgcacg
      241 ggagagccct cagcgatacc actgcttgcg catgtggcga tccatatgag gactggatgc
      301 atatcttgtg cgcttgcccc ctatatgcag atctgcggga cctagatgga cttggagtgc
      361 agcgccttgg cgaaactgga tcttcgaggg aatcctcgat gatcaagaga agactcaacg
      421 gctggcaatg tttgcggaag aagtgttcct gaggaggagg ggcgtttagc tcagtcatct
      481 ctcgcgtgtg gtttagcggg cgagaatact accacagttc cgctgttgct tgtcgtaaga
      541 gacgactaat acagcgatag gattcctcta accctgcttg tcggagcaaa agggggaggc
      601 ccaccgagcc tcttttcggt accacgggtt gagcagctat ccaagactgc tcattgaggt
      661 aggccccctg gtgggagtat cgtggtggct gtggttggta cccatatcgc gggtagagcc
      721 ttcatgctcg acgtttgagt tacggtgcta gttgcgcaaa actcgggtgc tgtgacccag
      781 agatcagtag agattttagg tagatctcgc tcctcagcaa gggggagtgc ttgcccggca
      841 agcangntac tcgaattgct accggggtgg tcgctatgta catagctata gcttctagtc
      901 cgggacgctt gtctggcgta tccagacaca tgcaccatat gctcacttgt gggtgtatag
      961 ggtgccgtgg ttgtaatccc ttcagtgtgg aacacgccac gtaaaataag ttcggaggga
     1021 tccgaaaagc atacattgtc cctatctact atctagcaa
//
LOCUS       ECOLACIS      113 bp ds-DNA             BCT       29-JUN-1990
DEFINITION  E.coli lacZ and IS1T insertion junction.
ACCESSION   K03504
KEYWORDS    insertion element IS1T; lacZ/IS1 fusion protein.
SOURCE      E.coli K12 DNA, clone pCSH11.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 113)
  AUTHORS   Malamy,M.H., Rahaim,P.T., Hoffman,C.S., Baghdoyan,D.,
            O'Connor,M.B. and Miller,J.
  TITLE     A frameshift mutation at the junction of an IS1 insertion within
            lacZ restores beta-galactosidase activity via formation of an
            active lacZ-IS1 fusion protein
  JOURNAL   J. Mol. Biol. 181, 551-555 (1985)
  STANDARD  full staff_review
COMMENT     The site of integration of IS1 into lacZ is found 18 nucleotides
            from the end of the sequence determining the C-terminus of
            beta-galactosidase.  Reversion to lac+, promoted by ICR191, results
            from the loss of a "g" from the "ggg" sequence located at the
            junction of lacZ and IS1.  As a result an active but
            temperature-sensitive lacZ/IS1 fusion protein is formed containing
            six amino acids derived from IS1 which replace six amino acids
            encoded by lacZ
FEATURES       from  to/span     description
    pept    <     1       57     lacZ/IS1 fusion protein (AA at 1)
    recomb       37       38     E.coli DNA end/IS1 DNA start
BASE COUNT       24 a     25 c     28 g     36 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccagc tgagcgccgg tcgctaccat taccaggtaa tgctgccaac ttactgattt
       61 agtgtatgat ggtgatttta aggtgcttgc gtggcttcca tttccatcag atg
//
LOCUS       ECOOMPASA      70 bp ds-DNA             SYN       29-JUN-1990
DEFINITION  E.coli outer membrane protein ompA gene-plasmid pBR322 junction
            (synthetic), clone pOMPA21.
ACCESSION   J02534
KEYWORDS    artificial gene; membrane protein; outer membrane protein.
SOURCE      E.coli and plasmid pBR322 DNA, clone pOMPA21.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E.,
            Kurz,C. and Schaller,H.
  TITLE     Export of a protein into the outer membrane of Escherichia coli
            K12: Stable incorporation of the OmpA protein requires less than
            193 amino-terminal amino-acid residues
  JOURNAL   Eur. J. Biochem. 122, 223-231 (1982)
  STANDARD  full staff_review
COMMENT     [1] explored to what extent ompA gene can be shortened without
            impairing outer membrane incorporation of resulting protein
            fragments, and found that more than 133 NH2-terminal residues of
            the mature OmpA protein suffice but are unstable, while 193 such
            residues are stable.
FEATURES       from  to/span     description
    pept    <     1       70     OmpA' protein (modified OmpA; aa at 2)
    recomb       13       14     E.coli end/pBR322 start
BASE COUNT       15 a     17 c     24 g     14 t
ORIGIN      13 bp upstream of BamHI site
        1 gctgagcaac ctggatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg
       61 ctccaagtag
//
LOCUS       ECOOMPASB      66 bp ds-DNA             SYN       29-JUN-1990
DEFINITION  E.coli outer membrane protein ompA gene / AD3 insert /
            plasmid pBR322 synthetic construct, clone pOMPA21-11.
ACCESSION   J02532
KEYWORDS    artificial gene; membrane protein; outer membrane protein.
SOURCE      E.coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-11.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E.,
            Kurz,C. and Schaller,H.
  TITLE     Export of a protein into the outer membrane of Escherichia coli
            K12: Stable incorporation of the OmpA protein requires less than
            193 amino-terminal amino-acid residues
  JOURNAL   Eur. J. Biochem. 122, 223-231 (1982)
  STANDARD  full staff_review
COMMENT     See comments for first entry in this series. AD3 is a derivative of
            a synthetic lac operator.
FEATURES       from  to/span     description
    pept    <     1       42     OmpA' (modified OmpA protein)
    recomb       13       14     E.coli end/AD3 insert start
    recomb       60       61     AD3 insert end/pBR322 start
BASE COUNT       19 a     14 c     17 g     16 t
ORIGIN      13 bp upstream of BamHI site
        1 gcgatcactc ctggatccgg aattcaaatt gtgagcggat aacaatttga attccggatc
       61 gcgaag
//
LOCUS       ECOOMPASC      66 bp ds-DNA             SYN       29-JUN-1990
DEFINITION  E.coli outer membrane protein ompA gene / AD3 insert /
            plasmid pBR322 synthetic construct, clone pOMPA21-16.
ACCESSION   J02533
KEYWORDS    artificial gene; membrane protein; outer membrane protein.
SOURCE      E. coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-16.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E.,
            Kurz,C. and Schaller,H.
  TITLE     Export of a protein into the outer membrane of Escherichia coli
            K12: Stable incorporation of the OmpA protein requires less than
            193 amino-terminal amino-acid residues
  JOURNAL   Eur. J. Biochem. 122, 223-231 (1982)
  STANDARD  full staff_review
COMMENT     See comments for first entry in this series. AD3 is a derivative of
            a synthetic lac operator.
FEATURES       from  to/span     description
    pept    <     1       34     OmpA' (modified OmpA; aa at 2)
    recomb       13       14     E.coli end/AD3 insert start
    recomb       60       61     AD3 insert end/pBR322 start
BASE COUNT       22 a     14 c     15 g     15 t
ORIGIN      20 bp upstream of EcoRI site
        1 acagaccaag cacgatccgg aattcaaatt gtgagcggat aacaatttga attccggatc
       61 tgatcg
//
LOCUS       HUMIGLAM2     872 bp ss-mRNA            PRI       29-JUN-1990
DEFINITION  Human Ig rearranged lambda-chain mRNA VJC-region subgroup lambda-IV
            from heterohybridoma H6-3C4.
ACCESSION   M18645
KEYWORDS    immunoglobulin; immunoglobulin light chain; lambda-immunoglobulin;
            variable region; variable region subgroup lambda-IV.
SOURCE      Human hybridoma H6-3C4 (cell fusion of mouse myeloma NS1 cells and
            human peripheral blood lymphocytes), cDNA to mRNA, clone
            p3C4-lambda-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 872)
  AUTHORS   Yamasaki,N., Komori,S. and Watanabe,T.
  TITLE     Complementary DNA for a human subgroup IV immunoglobulin
            lambda-chain
  JOURNAL   Mol. Immunol. 24, 981-985 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         31      732     Ig VJC-region
    recomb       90       91     V-region start
    recomb      375      376     V-region end/J-region start
    recomb      412      413     J-region end/C-region start
    recomb      729      730     C-region end
BASE COUNT      211 a    269 c    220 g    172 t
ORIGIN      254 bp upstream of BamHI site.
        1 tcagaagcag agttctgggg tgtctccacc atggcctgga cccctctctg gctcactctc
       61 ttcactcttt gcataggttc tgtggtttct tctgaactga ctcaggaccc tgttgtgtct
      121 gtggccttgg gacagacagt caggatcaca tgccaaggag acagcctcag aacctatcat
      181 gcaagctggt accagcagaa gccaagacag gcccctgtac ttgtcatcta tgatgaaaac
      241 aaccggccct cagggatccc agaccgattc tctggctcca cctcaggaaa cacagcttcc
      301 ttgaccatca ctggggctca ggcggaagat gaggctgact attactgtaa ctcccgggac
      361 agcagtggta accgtctggt attcggcgga gggaccaagc tgaccgtcct aggtcagccc
      421 aaggctgccc cctcggtcac tctgttcccg ccctcctctg aggagcttca agccaacaag
      481 gccacactgg tgtgtctcat aagtgacttc tacccgggag ccgtgacagt ggcctggaag
      541 ggagatagca gccccgtcaa ggcgggagtg gagaccacca caccctccaa acaaagcaac
      601 aacaagtacg cggccagcag ctatctgagc ctgacgcctg agcagtggaa gtcccacaga
      661 agctacagct gccaggtcac gcatgaaggg agcaccgtgg agaagacagt ggcccctaca
      721 gaatgttcat aggttctcaa ccctcacccc caccacggga gactagagct gcaggatccc
      781 aggggagggg tctctcctcc caccccaagg catcaagccc ttctccctgc actcaataaa
      841 ccctcaataa atattctcat tgtcaaggaa tt
//
LOCUS       HUMTCAXU      401 bp ss-mRNA            PRI       29-JUN-1990
DEFINITION  Human T-cell receptor active alpha-chain mRNA, clone HAP36
            (V-alpha-9.1 family; J-alpha-H family).
ACCESSION   M13737
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain.
SOURCE      Human adult PHA-stimulated peripheral blood T lymphocyte, cDNA to
            mRNA, clone HAP36.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Yoshikai,Y., Kimura,N., Toyonaga,B. and Mak,T.W.
  TITLE     Sequences and repertoire of human T-cell receptor alpha chain
            variable region genes in mature T lymphocytes
  JOURNAL   J. Exp. Med. 164, 90-103 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    ORF     <     1  >   401     T-cell receptor active alpha-chain precursor
    recomb      321      322     V region end/D-J region start
    recomb      387      388     D-J region end/C region start
BASE COUNT      115 a    102 c     87 g     97 t
ORIGIN      
        1 atgaagccca ccctcatctc agtgcttgtg ataatattta tactcagagg aacaagagcc
       61 cagagagtga ctcagcccga gaagctcctc tctgtcttta aaggggcccc agtggagctg
      121 aagtgcaact attcctattc tgggagtcct gaactcttct ggtatgtcca gtactccaga
      181 caacgcctcc agttactctt gagacacatc tctagagaga gcatcaaagg cttcactgct
      241 gaccttaaca aaggcgagac atctttccac ctgaagaaac catttgctca agaggaagac
      301 tcagccatgt attactgtgc tctaagtgtt tataaccagg gaggaaagct tatcttcgga
      361 cagggaacgg agttatctgt gaaacccaat atccagaacc c
//
LOCUS       HUMTCAYR      406 bp ss-mRNA            PRI       29-JUN-1990
DEFINITION  Human T-cell receptor active alpha-chain V-region (V-J-C) mRNA,
            partial cds, clone AA25.
ACCESSION   M17657 J02992
KEYWORDS    T-cell receptor alpha-chain; antigen receptor; constant region;
            joining region; processed gene; variable region.
SOURCE      Human peripheral blood lymphocyte, cDNA to mRNA, clone AA25.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 406)
  AUTHORS   Klein,M.H., Concannon,P., Everett,M., Kim,L.D.H.,
            Hunkapiller,T. and Hood,L.
  TITLE     Diversity and structure of human T-cell receptor alpha-chain
            variable region genes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 6884-6888 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1  >   406     T-cell receptor alpha-chain V-region (V-J-C)
                                 precursor /nomgen="TCRA" /map="14q11.2"
                                 /hgml_locus_uid="LX0123X"
    sigp          1       69     T-cell receptor alpha-chain signal peptide
    matp         70  >   403     T-cell receptor alpha-chain
    recomb      340      341     TCR V-region DNA end/TCR J-region DNA start
    recomb      400      401     TCR J-region DNA end/TCR C-region DNA start
BASE COUNT      104 a     95 c     98 g    109 t
ORIGIN      Chromosome 14q11.2.
        1 atgatgatat ccttgagagt tttactggtg atcctgtggc ttcagttaag ctgggtttgg
       61 agccaacgga aggaggtgga gcaggatcct ggacccttca atgttccaga gggagccact
      121 gtcgctttca actgtactta cagcaacagt gcttctcagt ctttcttctg gtacagacag
      181 gattgcagga aagaacctaa gttgctgatg tccgtatact ccagtggtaa tgaagatgga
      241 aggtttacag cacacgtcaa tagagccagc cagtatattt ccctgctcat cagagactcc
      301 aagctcagtg attcagccac ctacctctgt gtggtgaaca ttcgcccagg aaacacacct
      361 cttgtctttg gaaagggcac aagactttct gtgattccaa atatcc
//
LOCUS       LAMCG       48502 bp ds-DNA   circular  PHG       29-JUN-1990
DEFINITION  Bacteriophage lambda, complete genome.
ACCESSION   J02459 M17233
KEYWORDS    DNA binding protein; circular; coat protein; complete genome;
            origin of replication; repressor; unidentified reading frame.
SOURCE      Lambda wild-type and lambda strain cI857s7.
  ORGANISM  Bacteriophage lambda
            Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
REFERENCE   1  (bases 37224 to 37940)
  AUTHORS   Sauer,R.T.
  TITLE     DNA sequence of the bacteriophage lambda cI gene
  JOURNAL   Nature 276, 301-302 (1978)
  STANDARD  full staff_review
REFERENCE   2  (bases 37990 to 38982)
  AUTHORS   Schwarz,E., Scherer,G., Hobom,G. and Kossel,H.
  TITLE     Nucleotide sequence of cro, cII and part of the O gene in phage
            lambda DNA
  JOURNAL   Nature 272, 410-414 (1978)
  STANDARD  full staff_review
REFERENCE   3  (bases 34957 to 35615; r-strand)
  AUTHORS   Franklin,N.C. and Bennett,G.N.
  TITLE     The N protein of bacteriophage lambda, defined by its DNA sequence,
            is highly basic
  JOURNAL   Gene 8, 107-119 (1979)
  STANDARD  full staff_review
REFERENCE   4  (bases 37768 to 40293)
  AUTHORS   Schwarz,E., Scherer,G., Hobom,G. and Kossel,H.
  TITLE     The primary structure of the phage lambda P gene completes the
            nucleotide sequence of the plasmid lambda-dvh93
  JOURNAL   Biochem. Int. 1, 386-394 (1980)
  STANDARD  full staff_review
REFERENCE   5  (bases 27724 to 29275)
  AUTHORS   Hoess,R.H., Foeller,C., Bidwell,K. and Landy,A.
  TITLE     Site-specific recombination functions of bacteriophage lambda: DNA
            sequence of regulatory regions and overlapping structural genes for
            int and xis
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77, 2482-2486 (1980)
  STANDARD  full staff_review
REFERENCE   6  (bases 43682 to 45218)
  AUTHORS   Daniels,D.L. and Blattner,F.R.
  TITLE     Nucleotide sequence of the Q gene and the Q to S intergenic region
            of bacteriophage lambda
  JOURNAL   Virology 117, 81-92 (1982)
  STANDARD  full staff_review
REFERENCE   7  (bases 1 to 48502; fragments)
  AUTHORS   Sanger,F., Coulson,A.R., Hong,G.F., Hill,D.F. and Petersen,G.B.
  TITLE     Nucleotide sequence of bacteriophage lambda DNA
  JOURNAL   J. Mol. Biol. 162, 729-773 (1982)
  STANDARD  full staff_review
REFERENCE   8  (review; complete genome)
  AUTHORS   Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F. and
            Blattner,F.R.
  TITLE     Appendix I: A molecular map of coliphage lambda
  JOURNAL   (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and
            Weisberg,R.A. (Eds.);
            Lambda II: 469-517;
            Cold Spring Harbor Laboratory, New York (1983)
  STANDARD  full staff_review
REFERENCE   9  (review; complete genome with annotation)
  AUTHORS   Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F.,
            Coulson,A.R., Hong,G.F., Hill,D.F., Petersen,G.B. and Blattner,F.R.
  TITLE     Appendix II: Complete annotated lambda sequence
  JOURNAL   (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and Weisberg,R.A.
            (Eds.);
            Lambda II: 519-674;
            Cold Spring Harbor Laboratory, New York (1983)
  STANDARD  full staff_review
REFERENCE   10 (bases 45493 to 45963)
  AUTHORS   Imada,M. and Tsugita,A.
  TITLE     Amino acid sequence of lambda phage endolysin
  JOURNAL   Nature New Biol. 233, 230-231 (1971)
  STANDARD  full staff_review
REFERENCE   11 (bases 44588 to 44780)
  AUTHORS   Lebowitz,P., Weissman,S.M. and Radding,C.M.
  TITLE     Nucleotide sequence of a ribonucleic acid transcribed in vitro from
            lambda phage deoxyribonucleic acid
  JOURNAL   J. Biol. Chem. 246, 5120-5139 (1971)
  STANDARD  full staff_review
REFERENCE   12 (bases 1 to 12; both strands)
  AUTHORS   Wu,R. and Taylor,E.
  TITLE     Nucleotide sequence analysis of DNA. II. Complete nucleotide
            sequence of the cohesive ends of bacteriophage lambda DNA
  JOURNAL   J. Mol. Biol. 57, 491-511 (1971)
  STANDARD  full staff_review
REFERENCE   13 (bases 38597 to 38672)
  AUTHORS   Dahlberg,J.E. and Blattner,F.R.
  TITLE     In vitro transcription products of lambda DNA: Nucleotide sequences
            and regulatory sites
  JOURNAL   (in) Fox,C.F. and Robinson,W.S. (Eds.);
            Virus Research. Proceedings of 1973 ICN-UCLA Symposium: 533-544;
            Academic Press, New York (1973)
  STANDARD  full staff_review
REFERENCE   14 (sites; fragments at the 3'-terminus)
  AUTHORS   Weigel,P.H., Englund,P.T., Murray,K. and Old,R.W.
  TITLE     The 3'-terminal nucleotide sequences of bacteriophage lambda DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 70, 1151-1155 (1973)
  STANDARD  full staff_review
REFERENCE   15 (bases 35434 to 35618)
  AUTHORS   Dahlberg,J.E. and Blattner,F.R.
  TITLE     Sequence of the promoter-operator proximal region of the major
            leftward of bacteriophage lambda
  JOURNAL   Nucleic Acids Res. 2, 1441-1458 (1975)
  STANDARD  full staff_review
REFERENCE   16 (bases 35583 to 35600)
  AUTHORS   Kleid,D.G., Agarwal,K.L. and Khorana,H.G.
  TITLE     The nucleotide sequence in the promoter region of the gene N in
            bacteriophage lambda
  JOURNAL   J. Biol. Chem. 250, 5574-5582 (1975)
  STANDARD  full staff_review
REFERENCE   17 (bases 37945 to 38018)
  AUTHORS   Maniatis,T., Jeffrey,A. and Kleid,D.G.
  TITLE     Nucleotide sequence of the rightward operator of phage lambda
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 72, 1184-1188 (1975)
  STANDARD  full staff_review
REFERENCE   18 (bases 44588 to 44773)
  AUTHORS   Sklar,J., Yot,P. and Weissman,S.M.
  TITLE     Determination of genes, restriction sites, and DNA sequences
            surrounding the 6s template of bacteriophage lambda
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 72, 1817-1821 (1975)
  STANDARD  full staff_review
REFERENCE   19 (bases 37945 to 38027)
  AUTHORS   Maniatis,T., Ptashne,M., Backman,K., Kleid,D., Flashman,S.,
            Jeffrey,A. and Maurer,R.
  TITLE     Recognition sequences of repressor and polymerase in the operators
            of bacteriophage lambda
  JOURNAL   Cell 5, 109-113 (1975)
  STANDARD  full staff_review
REFERENCE   20 (bases 37946 to 38039)
  AUTHORS   Smith,G.R., Eisen,H., Reichardt,L. and Hedgpeth,J.
  TITLE     Deletions of lambda phage locating a p-rm mutation within the
            rightward operator
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 73, 712-716 (1976)
  STANDARD  full staff_review
REFERENCE   21 (bases 35578 to 35667; 37903 to 38027)
  AUTHORS   Ptashne,M., Bachman,K., Humayun,M.Z., Jeffrey,A., Maurer,R.,
            Meyer,B. and Sauer,R.T.
  TITLE     Autoregulation and function of a repressor in bacteriophage lambda
  JOURNAL   Science 194, 156-161 (1976)
  STANDARD  full staff_review
REFERENCE   22 (bases 37905 to 37989)
  AUTHORS   Walz,A., Pirrotta,V. and Ineichen,K.
  TITLE     Lambda repressor regulates the switch between p-r and p-rm
            promoters
  JOURNAL   Nature 262, 665-669 (1976)
  STANDARD  full staff_review
REFERENCE   23 (bases 27617 to 27934)
  AUTHORS   Landy,A. and Ross,W.
  TITLE     Viral integration and excision: structure of the lambda att sites
  JOURNAL   Science 197, 1147-1160 (1977)
  STANDARD  full staff_review
REFERENCE   24 (bases 39062 to 39170)
  AUTHORS   Denniston-Thompson,K., Moore,D.D., Kruger,K.E., Furth,M.E. and
            Blattner,F.R.
  TITLE     Physical structure of the replication origin of bacteriophage
            lambda
  JOURNAL   Science 198, 1051-1056 (1977)
  STANDARD  full staff_review
REFERENCE   25 (bases 38610 to 38732)
  AUTHORS   Scherer,G., Hobom,G. and Koessel,H.
  TITLE     DNA base sequence of the p-o promoter region of phage lambda
  JOURNAL   Nature 265, 117-121 (1977)
  STANDARD  full staff_review
REFERENCE   26 (bases 38041 to 38241)
  AUTHORS   Roberts,T.M., Shimatake,H., Brady,C. and Rosenberg,M.
  TITLE     Sequence of cro gene of bacteriophage lambda
  JOURNAL   Nature 270, 274-275 (1977)
  STANDARD  full staff_review
REFERENCE   27 (bases 27616 to 28935)
  AUTHORS   Davies,R.W., Schreier,P.H. and Buechel,D.E.
  TITLE     Nucleotide sequence of the attachment site of coliphage lambda
  JOURNAL   Nature 270, 757-760 (1977)
  STANDARD  full staff_review
REFERENCE   28 (bases 35578 to 35667)
  AUTHORS   Humayun,Z., Jeffrey,A. and Ptashne,M.
  TITLE     Completed DNA sequences and organization of repressor-binding sites
            in the operators of phage lambda
  JOURNAL   J. Mol. Biol. 112, 265-277 (1977)
  STANDARD  full staff_review
REFERENCE   29 (bases 37206 to 37263; 37914 to 37970)
  AUTHORS   Humayun,Z.
  TITLE     DNA sequence at the end of the cI gene in bacteriophage lambda
  JOURNAL   Nucleic Acids Res. 4, 2137-2143 (1977)
  STANDARD  full staff_review
REFERENCE   30 (bases 44467 to 44807)
  AUTHORS   Sklar,J.L.
  TITLE     Structure and function of two regions of DNA controlling the
            synthesis of prokaryotic RNAs
  JOURNAL   Thesis (1977) Yale University, New Haven, CT, USA
  STANDARD  full staff_review
REFERENCE   31 (bases 38470 to 39189)
  AUTHORS   Hobom,G., Grosschedl,R., Lusky,M., Scherer,G., Schwarz,E. and
            Koessel,H.
  TITLE     Functional analysis of the replicator structure of lambdoid
            bacteriophage DNAs
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 43, 165-178 (1979)
  STANDARD  full staff_review
REFERENCE   32 (bases 37938 to 38016; 35589 to 35666; comp strand)
  AUTHORS   Flashman,S.M.
  TITLE     Mutational analysis of the operators of bacteriophage lambda
  JOURNAL   Mol. Gen. Genet. 166, 61-73 (1978)
  STANDARD  full staff_review
REFERENCE   33 (bases 38212 to 38362)
  AUTHORS   Rosenberg,M., Court,D., Shimatake,H., Brady,C. and Wulff,D.L.
  TITLE     The relationship between function and DNA sequence in an
            intercistronic regulatory region in phage lambda
  JOURNAL   Nature 272, 414-423 (1978)
  STANDARD  full staff_review
REFERENCE   34 (bases 29711 to 29811; 31043 to 31058)
  AUTHORS   Davies,R.W., Schreier,P.H. and Buechel,D.E.
  TITLE     Determination of the endpoints of partial deletion mutants of the
            attachment site of bacteriophage lambda by DNA sequencing
  JOURNAL   Nucleic Acids Res. 5, 3209-3218 (1978)
  STANDARD  full staff_review
REFERENCE   35 (bases 21661 to 31129; fragments)
  AUTHORS   Hoess,R.H. and Landy,A.
  TITLE     Structure of the lambda att sites generated by int-dependent
            deletions
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 5437-5441 (1978)
  STANDARD  full staff_review
REFERENCE   36 (bases 38453 to 38500)
  AUTHORS   Sprague,K.U., Faulds,D.H. and Smith,G.R.
  TITLE     A single base-pair change creates a chi recombinational hotspot in
            bacteriophage lambda
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 6182-6186 (1978)
  STANDARD  full staff_review
REFERENCE   37 (bases 13 to 72; 48391 to 48502)
  AUTHORS   Nichols,B.P. and Donelson,J.E.
  TITLE     178-Nucleotide sequence surrounding the cos site of bacteriophage
            lambda DNA
  JOURNAL   J. Virol. 26, 429-434 (1978)
  STANDARD  full staff_review
REFERENCE   38 (bases 38597 to 39688)
  AUTHORS   Scherer,G.
  TITLE     Nucleotide sequence of the O gene and of the origin of replication
            in bacteriophage lambda DNA
  JOURNAL   Nucleic Acids Res. 5, 3141-3156 (1978)
  STANDARD  full staff_review
REFERENCE   39 (bases 38453 to 38500)
  AUTHORS   Smith,G.R., Faulds,D.H. and Sprague,K.U.
  TITLE     Nucleotide-sequence analysis of a chi site
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 43, 1067-1068 (1979)
  STANDARD  full staff_review
REFERENCE   40 (sites; transcription termination sites)
  AUTHORS   Adhya,S. and Gottesman,M.
  TITLE     Control of transcription termination
  JOURNAL   Annu. Rev. Biochem. 47, 967-996 (1978)
  STANDARD  full staff_review
REFERENCE   41 (bases 38008 to 39328)
  AUTHORS   Moore,D.D., Denniston-Thompson,K., Kruger,K.E., Furth,M.E.,
            Williams,B.G., Daniels,D.L. and Blattner,F.R.
  TITLE     Dissection and comparative anatomy of the origins of replication of
            lambdoid phages
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 43, 155-163 (1979)
  STANDARD  full staff_review
REFERENCE   42 (bases 27711 to 27826)
  AUTHORS   Ross,W., Landy,A., Kikuchi,Y. and Nash,H.
  TITLE     Interaction of int protein with specific sites on lambda att DNA
  JOURNAL   Cell 18, 297-307 (1979)
  STANDARD  full staff_review
REFERENCE   43 (bases 37305 to 37352)
  AUTHORS   Lieb,M.
  TITLE     Is5 increases recombination in adjacent regions as shown for the
            repressor gene of coliphage lambda
  JOURNAL   Gene 12, 277-280 (1980)
  STANDARD  full staff_review
REFERENCE   44 (bases 29055 to 29131)
  AUTHORS   Miller,H.I., Abraham,J., Benedik,M., Campbell,A., Court,D.,
            Echols,H., Fischer,R., Galindo,J.M., Guarneros,G., Hernandez,T.,
            Mascarenhas,D., Montanez,C., Schindler,D., Schmeissner,U. and
            Sosa,L.
  TITLE     Regulation of the integration-excision reaction by bacteriophage
            lambda
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 45, 439-445 (1981)
  STANDARD  full staff_review
REFERENCE   45 (bases 36245 to 36343)
  AUTHORS   Pirrotta,V., Ineichen,K. and Walz,A.
  TITLE     An unusual polymerase binding site in the immunity region of phage
            lambda
  JOURNAL   Mol. Gen. Genet. 180, 369-376 (1980)
  STANDARD  full staff_review
REFERENCE   46 (bases 37940 to 38016)
  AUTHORS   Rosen,E.D., Hartley,J.L., Matz,K., Nichols,B.P., Young,K.M.,
            Donelson,J.E. and Gussin,G.N.
  TITLE     DNA sequence analysis of prm- mutations of coliphage lambda
  JOURNAL   Gene 11, 197-205 (1980)
  STANDARD  full staff_review
REFERENCE   47 (bases 38212 to 38467)
  AUTHORS   Wulff,D.L., Beher,M., Izumi,S., Beck,J., Mahoney,M., Shimatake,H.,
            Brady,C., Court,D. and Rosenberg,M.
  TITLE     Structure and function of the cy control region of bacteriophage
            lambda
  JOURNAL   J. Mol. Biol. 138, 209-230 (1980)
  STANDARD  full staff_review
REFERENCE   48 (bases 38237 to 38334)
  AUTHORS   Court,D., Brady,C., Rosenberg,M., Wulff,D.L., Behr,M.,
            Mahoney,M. and Izumi,S.
  TITLE     Control of transcription termination: A rho-dependent termination
            site in bacteriophage lambda
  JOURNAL   J. Mol. Biol. 138, 231-254 (1980)
  STANDARD  full staff_review
REFERENCE   49 (bases 38102 to 38166)
  AUTHORS   Calva,E. and Burgess,R.R.
  TITLE     Characterization of a rho-dependent termination site within the cro
            gene of bacteriophage lambda
  JOURNAL   J. Biol. Chem. 255, 11017-11022 (1980)
  STANDARD  full staff_review
REFERENCE   50 (bases 28929 to 29198)
  AUTHORS   Abraham,J., Mascarenhas,D., Fischer,R., Benedik,M., Campbell,A. and
            Echols,H.
  TITLE     DNA sequence of regulatory region for integration gene of
            bacteriophage lambda
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77, 2477-2481 (1980)
  STANDARD  full staff_review
REFERENCE   51 (bases 27724 to 29525)
  AUTHORS   Davies,R.W.
  TITLE     DNA sequence of the int-xis p-i region of the bacteriophage lambda;
            overlap of the int and xis genes
  JOURNAL   Nucleic Acids Res. 8, 1765-1782 (1980)
  STANDARD  full staff_review
REFERENCE   52 (bases 27501 to 27615)
  AUTHORS   Mizuuchi,M. and Mizuuchi,K.
  TITLE     Integrative recombination of bacteriophage lambda: extent of the
            DNA sequence involved in attachment site function
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77, 3220-3224 (1980)
  STANDARD  full staff_review
REFERENCE   53 (bases 27479 to 27633)
  AUTHORS   Hsu,P.-L., Ross,W. and Landy,A.
  TITLE     The lambda phage att site: functional limits and interaction with
            int protein
  JOURNAL   Nature 285, 85-91 (1980)
  STANDARD  full staff_review
REFERENCE   54 (bases 23131 to 23248)
  AUTHORS   Rosenvold,E.C., Calva,E., Burgess,R.R. and Szybalski,W.
  TITLE     In vitro transcription from the b2 region of bacteriophage lambda
  JOURNAL   Virology 107, 476-487 (1980)
  STANDARD  full staff_review
REFERENCE   55 (bases 30493 to 30569)
  AUTHORS   Smith,G.R., Schultz,D.W. and Crasemann,J.M.
  TITLE     Generalized recombination: Nucleotide sequence homology between chi
            recombinational hotspots
  JOURNAL   Cell 19, 785-793 (1980)
  STANDARD  full staff_review
REFERENCE   56 (bases 37940 to 38023)
  AUTHORS   Meyer,B.J., Maurer,R. and Ptashne,M.
  TITLE     Gene regulation at the right operator (o-r) of bacteriophage
            lambda. II. o-r-1, o-r-2, and o-r-3: their roles in mediating the
            effects of repressor and cro
  JOURNAL   J. Mol. Biol. 139, 163-194 (1980)
  STANDARD  full staff_review
REFERENCE   57 (bases 29055 to 29124)
  AUTHORS   Abraham,J. and Echols,H.
  TITLE     Regulation of int gene transcription by bacteriophage lambda:
            location of the start generated by an int constitutive mutation
  JOURNAL   J. Mol. Biol. 146, 157-165 (1981)
  STANDARD  full staff_review
REFERENCE   58 (bases 43681 to 45634)
  AUTHORS   Daniels,D.L.
  TITLE     Control of late transcription in bacteriophage lambda
  JOURNAL   Thesis (1981) University of Wisconsin, Madison, WI, USA
  STANDARD  full staff_review
REFERENCE   59 (bases 35468 to 35819)
  AUTHORS   Horn,G.T. and Wells,R.D.
  TITLE     The leftward promoter of bacteriophage lambda
  JOURNAL   J. Biol. Chem. 256, 1998-2002 (1981)
  STANDARD  full staff_review
REFERENCE   60 (bases 32503 to 35905)
  AUTHORS   Ineichen,K., Shepherd,J.C.W. and Bickle,T.A.
  TITLE     The DNA sequence of the phage lambda genome between p-l and the
            gene bet
  JOURNAL   Nucleic Acids Res. 9, 4639-4653 (1981)
  STANDARD  full staff_review
REFERENCE   61 (bases 43860 to 45001)
  AUTHORS   Petrov,N.A., Karginov,V.A., Mikryukov,N.N., Serpinski,O.I. and
            Kravchenko,V.V.
  TITLE     Complete nucleotide sequence of the bacteriophage lambda DNA region
            containing gene Q and promoter p-r
  JOURNAL   FEBS Lett. 133, 316-320 (1981)
  STANDARD  full staff_review
REFERENCE   62 (bases 44972 to 45057)
  AUTHORS   Smith,G.R., Comb,M., Schultz,D.W., Daniels,D.L. and Blattner,F.R.
  TITLE     Nucleotide sequence of the chi recombinational hotspot chi+d in
            bacteriophage lambda
  JOURNAL   J. Virol. 37, 336-342 (1981)
  STANDARD  full staff_review
REFERENCE   63 (bases 38686 to 39224)
  AUTHORS   Moore,D.D., Denniston,K.J. and Blattner,F.R.
  TITLE     Sequence organization of the origins of DNA replication in lambdoid
            coliphages
  JOURNAL   Gene 14, 91-101 (1981)
  STANDARD  full staff_review
REFERENCE   64 (bases 35468 to 35711)
  AUTHORS   Remaut,E., Stanssens,P. and Fiers,W.
  TITLE     Plasmid vectors for high-efficiency expression controlled by the pl
            promoter of coliphage lambda
  JOURNAL   Gene 15, 81-93 (1981)
  STANDARD  full staff_review
REFERENCE   65 (bases 35468 to 35541)
  AUTHORS   Drahos,D. and Szybalski,W.
  TITLE     Antitermination and termination functions of the cloned Nutl, N and
            tl1 modules of coliphage lambda
  JOURNAL   Gene 16, 261-274 (1981)
  STANDARD  full staff_review
REFERENCE   66 (bases 27650 to 27741)
  AUTHORS   Kravchenko,V.V. and Mikryukov,N.N.
  TITLE     Localization of the promoter p-att of the binding site of
            Escherichia coli polymerase on phage lambda DNA near the
            integration site
  JOURNAL   Dokl. Biochem. 264, 148-151 (1982)
  STANDARD  full staff_review
REFERENCE   67 (bases 39219 to 39338)
  AUTHORS   Moore,D.D. and Blattner,F.R.
  TITLE     Appendix: Sequence of lambda ri c 5b
  JOURNAL   J. Mol. Biol. 154, 81-83 (1982)
  STANDARD  full staff_review
REFERENCE   68 (bases 37938 to 38018)
  AUTHORS   Hawley,D.K. and McClure,W.R.
  TITLE     Mechanism of activation of transcription initiation from the lambda
            p-rm promoter
  JOURNAL   J. Mol. Biol. 157, 493-525 (1982)
  STANDARD  full staff_review
REFERENCE   69 (bases 25157 to 27484)
  AUTHORS   Hong,G.F.
  TITLE     A systematic DNA sequencing strategy
  JOURNAL   J. Mol. Biol. 158, 539-549 (1982)
  STANDARD  full staff_review
REFERENCE   70 (bases 35577 to 35647)
  AUTHORS   Hyman,H.C. and Honigman,A.
  TITLE     The use of the plasmid pha10 in the isolation of lambda pl promoter
            mutations
  JOURNAL   Mol. Gen. Genet. 185, 515-517 (1982)
  STANDARD  full staff_review
REFERENCE   71 (bases 38262 to 38386)
  AUTHORS   Lau,L.F., Roberts,J.W. and Wu,R.
  TITLE     Transcription terminates at lambda tr1 in three clusters
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 79, 6171-6175 (1982)
  STANDARD  full staff_review
REFERENCE   72 (bases 31299 to 31408)
  AUTHORS   Luk,K.-C. and Szybalski,W.
  TITLE     Transcription termination: Sequence and function of the
            rho-independent t-l3 terminator in the major leftward operon of
            bacteriophage lambda
  JOURNAL   Gene 17, 247-258 (1982)
  STANDARD  full staff_review
REFERENCE   73 (bases 35437 to 37348)
  AUTHORS   Landsmann,J., Kroeger,M. and Hobom,G.
  TITLE     The rex region of bacteriophage lambda: Two genes under three-way
            control
  JOURNAL   Gene 20, 11-24 (1982)
  STANDARD  full staff_review
REFERENCE   74 (bases 40218 to 43972)
  AUTHORS   Kroeger,M. and Hobom,G.
  TITLE     A chain of interlinked genes in the NinR region of bacteriophage
            lambda
  JOURNAL   Gene 20, 25-38 (1982)
  STANDARD  full staff_review
REFERENCE   75 (bases 31299 to 31408)
  AUTHORS   Luk,K.-C. and Szybalski,W.
  TITLE     Characterization of the cloned terminators t-r1, t-l3 and t-i, and
            the Nutr antitermination site of coliphage lambda
  JOURNAL   Gene 20, 127-134 (1982)
  STANDARD  full staff_review
REFERENCE   76 (bases 48424 to 48500)
  AUTHORS   Miwa,T. and Matsubara,K.
  TITLE     Identification of sequences necessary for packaging DNA into lambda
            phage heads
  JOURNAL   Gene 20, 267-279 (1982)
  STANDARD  full staff_review
REFERENCE   77 (bases 18414 to 18746)
  AUTHORS   Luk,K.-C. and Szybalski,W.
  TITLE     A cluster of leftward, rho-dependent t'j terminators in the J gene
            of coliphage lambda
  JOURNAL   Gene 21, 175-191 (1983)
  STANDARD  full staff_review
REFERENCE   78 (bases 48469 to 48498)
  AUTHORS   Miwa,T. and Matsubara,K.
  TITLE     Lambda phage DNA sequences affecting the packaging process
  JOURNAL   Gene 24, 199-206 (1983)
  STANDARD  full staff_review
REFERENCE   79 (bases 33287 to 33486)
  AUTHORS   Knight,D.M. and Echols,H.
  TITLE     The cIII gene and protein of bacteriophage lambda
  JOURNAL   J. Mol. Biol. 163, 505-510 (1983)
  STANDARD  full staff_review
REFERENCE   80 (bases 33000 to 33244; 33420 to 33543; 33629 to 34080)
  AUTHORS   Luk,K.-C. and Szybalski,W.
  TITLE     The tl2 cluster of transcription termination sites between genes
            bet and ral of coliphage lambda
  JOURNAL   Virology 125, 403-418 (1983)
  STANDARD  full staff_review
REFERENCE   81 (bases 37938 to 38019)
  AUTHORS   Shih,M.-C. and Gussin,G.N.
  TITLE     Mutations affecting two different steps in transcription initiation
            at the phage lambda p-rm promoter
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 496-500 (1983)
  STANDARD  full staff_review
REFERENCE   82 (bases 1 to 56; 48474 to 48502)
  AUTHORS   Feiss,M., Kobayashi,I. and Widner,W.
  TITLE     Separate sites for binding and nicking of bacteriophage lambda DNA
            by terminase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 955-959 (1983)
  STANDARD  full staff_review
REFERENCE   83 (bases 29063 to 29140)
  AUTHORS   Benedik,M., Mascarenhas,D. and Campbell,A.
  TITLE     The integrase promoter and t1' terminator in bacteriophages lambda
            and 434
  JOURNAL   Virology 126, 658-668 (1983)
  STANDARD  full staff_review
REFERENCE   84 (bases 45901 to 46443)
  AUTHORS   Taylor,A., Benedik,M. and Campbell,A.
  TITLE     Location of the R-z gene in bacteriophage lambda
  JOURNAL   Gene 26, 159-163 (1983)
  STANDARD  full staff_review
REFERENCE   85 (sites; cohesive ends)
  AUTHORS   Hohn,B.
  TITLE     DNA sequences necessary for packaging of bacteriophage lambda DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 7456-7460 (1983)
  STANDARD  full staff_review
REFERENCE   86 (sites; Pre-promoter mutations)
  AUTHORS   Wulff,D.L., Mahoney,M., Shatzman,A. and Rosenberg,M.
  TITLE     Mutational analysis of a regulatory region in bacteriophage lambda
            that has overlapping signals for the initiation of transcription
            and translation
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 555-559 (1984)
  STANDARD  full staff_review
REFERENCE   87 (sites; attP recombination site)
  AUTHORS   Craig,N.L. and Nash,H.A.
  TITLE     E. coli integration host factor binds to specific sites in DNA
  JOURNAL   Cell 39, 707-716 (1984)
  STANDARD  full staff_review
REFERENCE   88 (sites; major leftward transcription unit)
  AUTHORS   Edlind,T.D., Cooley,T.E., Richards,S.H. and Ihler,G.M.
  TITLE     Long range base-pairing in the leftward transcription unit of
            bacteriophage lambda: Characterization by electron microscopy and
            computer-aided sequence analysis
  JOURNAL   J. Mol. Biol. 179, 351-365 (1984)
  STANDARD  full staff_review
REFERENCE   89 (sites; prohead binding)
  AUTHORS   Frackman,S., Siegele,D.A. and Feiss,M.
  TITLE     A functional domain of bacteriophage lambda terminase for prohead
            binding
  JOURNAL   J. Mol. Biol. 180, 283-300 (1984)
  STANDARD  full staff_review
REFERENCE   90 (sites; cII binding site mutations)
  AUTHORS   Place,N., Fien,K., Mahoney,M.E., Wulff,D.L., Ho,Y.-S., Debouck,C.,
            Rosenberg,M., Shih,M.-C. and Gussin,G.N.
  TITLE     Mutations that alter the DNA binding site for the bacteriophage
            lambda cII protein and affect the translation efficiency of the cII
            gene
  JOURNAL   J. Mol. Biol. 180, 865-880 (1984)
  STANDARD  full staff_review
REFERENCE   91 (sites; nutR mutations)
  AUTHORS   Warren,F. and Das,A.
  TITLE     Formation of termination-resistant transcription complex at phage
            lambda nut locus: Effects of altered translation and a ribosomal
            mutation
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 3612-3616 (1984)
  STANDARD  full staff_review
REFERENCE   92 (sites; light chain oligonucleotides)
  AUTHORS   Coleclough,C. and Erlitz,F.L.
  TITLE     Use of primer-restriction-end adapters in a novel cDNA cloning
            strategy
  JOURNAL   Gene 34, 305-314 (1985)
  STANDARD  full staff_review
REFERENCE   93 (sites; nutL antiterminator)
  AUTHORS   Peltz,S.W., Brown,A.L., Hasan,N., Podhajska,A.J. and Szybalski,W.
  TITLE     Thermosensitivity of a DNA recognition Site: Activity of a
            truncated nutL Antiterminator of coliphage lambda
  JOURNAL   Science 228, 91-93 (1985)
  STANDARD  full staff_review
REFERENCE   94 (sites; rho utilization sites A and B)
  AUTHORS   Chen,C.-Y.A. and Richardson,J.P.
  TITLE     Sequence elements essential for rho-dependent transcription
            termination at lambda-tR1
  JOURNAL   J. Biol. Chem. 262, 11292-11299 (1987)
  STANDARD  simple staff_review
REFERENCE   95 (bases 2521 to 3300)
  AUTHORS   Hong,G.F.
  TITLE     Sequencing of large double-stranded DNA using the dideoxy
            sequencing technique
  JOURNAL   Biosci. Rep. 2, 907-912 (1982)
  STANDARD  simple staff_entry
COMMENT     Contributed on tape by F.Sanger via D.L.Daniels. Most of references
            [10] through [80] are either annotated by [8] and [9], which are
            the immediate sources for the annotation below, or they are cited
            in Table 3 of [8]. Only references [1] through [9] are represented
            in the features table herein.
            
            This is the best representation to date of the wild-type lambda
            l-strand, though much of the sequence was determined for the
            cI857s7 strain and changed to wild-type [8]. All reported
            variations leading to the strains cI857s7, imm21, imm434, lac5,
            Nin5 and b2 are included in the annotation. The first twelve bases
            are the sticky ends. A significant fraction of the known mutations
            affecting replication and transcription have been annotated below;
            a large number of point mutations, deletions and substitutions have
            not. For a complete account of lambda mutations in relation to the
            sequence, see [8].
            
            Each coding sequence belongs to a reading frame (orf) whose number,
            given in parentheses, should indicate the number of amino acids
            coded. The starting points for translation are known with varying
            degrees of certainty; for example, the start site for the N
            protein, given here as 35438, may turn out to be downstream (on the
            complementary strand) at 35360. When direct empirical evidence such
            as mutation or amino acid sequence is lacking, the start point is
            said to be putative. For a summary of the evidence bearing upon the
            coding sequences, see [7],[8]. Intergenic spaces in lambda are
            typically short and overlapping: the multiple reading frames (mult)
            range between a span of 1 and a span of 103. In most cases, a start
            codon precedes a termination codon, exceptions being the m-l
            boundary (13429) and the 314-194 boundary (21973) which show the
            E.coli trp operon pattern of "translational coupling" (see
            <ecotrp>).
            
            Transcription in the central region, bases 22686 to 37940, is
            leftward off the l-strand. In our annotation, this is indicated by
            the letter "c" and the descriptive term "comp strand". Signals and
            recognition sites in this region, without judgement made about
            their polarity, are treated accordingly, hence their span should be
            read toward the left rather than toward the right. Furthermore some
            leftward transcription is located outside the central region, and
            that is also indicated by "c" and "comp strand". In general, the
            estimates for the extent or span of signals (e.g. operators),
            binding sites (e.g. Nutr, int-binding sites, etc.) and of the
            attachment site (att) vary in the literature. This annotation
            follows [8]. No attempt is made to annotate promoters as signals
            because of the indefiniteness of their span, however known promoter
            mutants are given. The cII protein is known to bind in the -35
            regions of p-i (29091) and pre(38369). Transcript termination sites
            must be understood to be conditional on the N and Q proteins and
            less than 100% efficient. There remain terminators to be found and
            some of those annotated may have significance only in vitro.
FEATURES       from  to/span     description
    pept        191      736     nu1 (DNA packaging;181)
    pept        711     2636     A (DNA packaging;641)
    pept       2633     2839     W (head-tail joining;68)
    pept       2836     4437     B (capsid component;533)
    sigp       2839     2901     leader peptide
    matp       2902     4434     processed B
    pept       4418     5737     C (capsid component;439)
    pept       5132     5737     nu3 (capsid assembly;201)
    pept       5747     6079     D (head-DNA stabilization;110)
    pept       6135     7160     E (capsid component;341)
    pept       7202     7600     Fi (DNA packaging;117)
    pept       7612     7965     Fii (head-tail joining;117)
    pept       7977     8555     Z (tail component;192)
    pept       8552     8947     U (tail component;131)
    pept       8955     9695     V (tail component;256)
    pept       9711    10133     G (tail component;140)
    pept      10115    10549     T (tail component;144)
    pept      10542    13103     H (tail component;853)
    pept      13100    13429     M (tail component;109)
    pept      13429    14127     L (tail component;232)
    pept      14276    14875     K (tail component;199)
    pept      14773    15444     I (tail component;223)
    pept      15505    18903     J (tail:host specificity;1132)
    pept      18965    19585     lom (outer host membrane;206a)
    pept      19650    20855     orf-401
    pept      21029    21973     orf-314
    pept      21973    22557     orf-194
    pept      23918    22686 (c) ea47
    pept      25399    24509 (c) ea31 (296)
    pept      26973    25396 (c) ea59 (525)
    pept      28882    27812 (c) int (integration;356)
    pept      29078    28860 (c) xis (excision;72)
    pept      29655    29374 (c) ea8.5 (93)
    pept      30395    29847 (c) ea22 (182)
    pept      32028    31348 (c) exo (exonuclease;226)
    pept      32810    32025 (c) bet (recombination;261)
    pept      33232    32816 (c) gam (recombination;138)
    pept      33330    33187 (c) kil(host-killing;54)
    pept      33463    33299 (c) cIII (antitermination;89)
    pept      33904    33536 (c) ea10 (ssb;122)
    pept      34287    34087 (c) ral(restriction alleviation;66)
    pept      35438    35037 (c) N (early gene regulator;133)
    pept      36259    35825 (c) rexb (exclusion;144)
    pept      37114    36275 (c) rexa (exclusion;279)
    pept      37940    37227 (c) cI (repressor;237)
    pept      38041    38241     cro (antirepressor; also tof;66)
    pept      38360    38653     cII (antitermination;119)
    pept      38686    39585     O (DNA replication;299)
    pept      39582    40283     P (DNA replication;233)
    pept      40280    40570     ren(exclusion;96)
    pept      40644    41084     Nin 146 (pept unknown;146)
    pept      41081    41953     Nin 290 (pept unknown;290)
    pept      41950    42123     Nin 57 (pept unknown;57)
    pept      42090    42272     Nin 60 (pept unknown;60)
    pept      42269    42439     Nin 56 (pept unknown;56)
    pept      42429    43043     Nin 204 (pept unknown;204)
    pept      43040    43246     Nin 68 (pept unknown;68)
    pept      43224    43889     Nin 221 (pept unknown;221)
    pept      43886    44509     Q (late gene regulator;207)
    pept      44621    44815     orf-64
    pept      45186    45509     S (cell lysis;107)
    pept      45493    45969     R (cell lysis;158)
    pept      45966    46427     Rz (cell lysis;153)
    mRNA      35582    18482 (c) mRNA-pl (alt.; via t'j4 terminator)
    mRNA      35582    18597 (c) mRNA-pl (alt.; via t'j3 terminator)
    mRNA      35582    18637 (c) mRNA-pl (alt.; via t'j2 terminator)
    mRNA      35582    18671 (c) mRNA-pl (alt.; via t'j1 terminator)
    mRNA      35582    27538 (c) mRNA-pl (alt.; via ti terminator)
    mRNA      35582    31262 (c) mRNA-pl (alt.; via tl3 terminator)
    mRNA      35582    33100 (c) mRNA-pl (alt.; via tl2d terminator)
    mRNA      35582    33141 (c) mRNA-pl (alt.; via tl2c terminator)
    mRNA      35582    33494 (c) mRNA-pl (alt.; via tl2b terminator)
    mRNA      35582    33930 (c) mRNA-pl (alt.; via tl2a terminator)
    mRNA      35582    34560 (c) mRNA-pl (alt.; via tl1 terminator)
    mRNA      29065    27538 (c) mRNA int (integration; 356; via ti terminator)
    mRNA      23231  < 23231 (c) mRNA-pbl
    mRNA      37940    35798 (c) mRNA-prm (via timm terminator)
    mRNA      38343    35798 (c) mRNA-pre (via timm terminator)
    mRNA      36256    35798 (c) mRNA-plit (via timm terminator)
    mRNA      38023    38135     mRNA-pr (alt.; via tr0 terminator)
    mRNA      38023    38315     mRNA-pr (alt.; via tr1a terminator)
    mRNA      38023    38337     mRNA-pr (alt.; via tr1b terminator)
    mRNA      38023    38370     mRNA-pr (alt.; via tr1c terminator)
    mRNA      38023    40624     mRNA-pr (alt.; via tr2 terminator)
    mRNA      38675    38599 (c) mRNA-oop transcription mRNA
    mRNA      44587    44780     mRNA-pr' transcription (late genes) mRNA
    binding   27583    27602     int binding site 1
    binding   27615    27634     int binding site 2
    binding   27714    27747     int binding site 3
    binding   27814    27854     int binding site 4
    binding   35534    35518 (c) Nutl(N-utilization leftward)putative binding
                                 site
    binding   38265    38281     Nutr(N-utilization rightward)putative binding
                                 site
    binding   39034    39051     ori iteron 1(O binding site)
    binding   39054    39071     ori iteron 2(O binding site)
    binding   39078    39095     ori iteron 3(O binding site)
    binding   39101    39118     ori iteron 4(O binding site)
    mut       19368    23278     lac5 substitution
    mut       21737  > 21737     b2 substitution terminating at the att site
    recomb    24389    24390     lambda::lambdoid hybridization site
    mut       27537    27537     t in sib3 , c in wild-type
    mut       27547    27547     a in hef13 , g in wild-type
    mut       27568    27568     a in sib2 , c in wild-type
    mut       27573    27573     t in sib1 , g in wild-type
    recomb    27723    27738     attachment core(att)for host chromosome
                                 insertion
    mut       29063    29063     a in xis am6 , g in wild-type
    recomb    31266    31267     lambda::lambdoid hybridization site
    mut       34378    38617     imm21 region
    mut       35528    35528     a in Nutl63,g in Nutl96,t in Nutl18,c in
                                 wild-type
    mut       35530    35530     g in wild-type deleted in Nutl3
    mut       35583    38245     imm434 region
    mut       35596    35596     a in vir2, t in v003, c in wild-type
    mut       35606    35606     c in vir101 , t in wild-type
    signal    35607    35591 (c) operator-l1 (first base on comp strand)
    mut       35621    35621     t in v305 , c in wild-type
    mut       35622    35622     t in v305 , g in wild-type
    signal    35631    35615 (c) operator-l2 (first base on comp strand)
    signal    35651    35635 (c) operator-l3 (first base on comp strand)
    mut       35940    35940     a in rex209 , g in wild-type
    mut       35947    35947     a in rex111 , g in wild-type
    mut       37287    37287     a in cIam14, c in wild-type
    mut       37308    37308     c in cIam504, g in wild-type
    mut       37313    37313     a in cIam505, g in wild-type
    variant   37589    37589     t in strain cI857s7([2]); c in wild type
    mut       37589    37589     t in ind1 , c in wild-type
    mut       37629    37629     c in cIam499, g in wild-type
    mut       37635    37635     c in cIam212, a in wild-type
    mut       37680    37680     a in cIam34, c in wild-type
    variant   37742    37742     t in strain ci857s7([2]); c in wild-type
    mut       37742    37742     t in ci857 , c in wild-type
    mut       37808    37808     a in cIam282, g in wild-type
    mut       37872    37872     c in cIam302, a in wild-type
    signal    37951    37967     operator-r3
    mut       37954    37954     t in prm-e37 , c in wild-type
    mut       37955    37955     g in vc3 , a in wild-type
    mut       37957    37957     t in or3-r1 , c in wild-type
    mut       37958    37958     t in or3-r2, a in or3-r3 mutants, g in
                                 wild-type
    mut       37965    37965     g in or3-c12 , a in wild-type
    mut       37966    37966     c in or3-c10 , t in wild-type
    mut       37971    37971     g inp-rmup-1 , a in wild-type
    mut       37973    37973     t in prm-m104, 116, u31 mutants, c in
                                 wild-type
    signal    37974    37990     operator-r2
    mut       37978    37978     t in prm-e104, g in vc3, a in wild-type
    mut       37979    37979     a in virl, t in prm-e93, c in wild-type
    mut       37985    37985     t in vn , g in wild-type
    mut       37989    37989     t deleted in mah4 mutant
    mut       37990    37990     g deleted in mch9 mutant
    mut       37991    37991     g in pr-x3 , a in wild-type
    signal    37998    38014     operator-r1
    mut       38003    38003     a in vs326 , c in wild-type
    mut       38007    38007     t in prm-uv8, a in vir3, c in wild-type
    mut       38008    38008     a in prm-uv93, m36 mutants, g in wild-type
    mut       38009    38009     c in vs387, t in vc1, g in wild-type
    site      38249    38266     rho utilization site A (rutA)
    site      38282    38301     rho utilization site B (rutB)
    mut       38302    38302     a in cin-1 , g in wild-type
    mut       38306    38306     c in cnc1 , t in wild-type
    mut       38307    38307     g in cnc8 , a in wild-type
    mut       38350    38350     g in cy3048, a in wild-type
    mut       38354    38354     c in cy2001, t in wild-type
    mut       38357    38357     t in cy3019, c in wild-type
    mut       38364    38364     g in can1 , t in wild-type
    mut       38370    38370     t in cy3003 , c in wild-type
    mut       38371    38371     t in cy42 , a in wild-type
    mut       38376    38376     g in cy844 , a in wild-type
    mut       38379    38379     a in cy3008 , g in wild-type
    mut       38380    38380     t in cy3001 , c in wild-type
    mut       38430    38430     c in cII2002 , t in wild-type
    signal    38543    38557     ice(inceptor signal for DNA replication)
    mut       39122    39122     a in ti-12 , c in wild-type
    recomb    39157    39158     lambda::lambdoid hybridization site
    recomb    39165    39166     lambda::lambdoid hybridization site
    mut       39268    39268     t in ric5b , c in wild-type
    mut       39292    39292     a in ric5b , g in wild-type
    mut       40501    43307     Nin5 substitution
    variant   43082    43082     a in strain cI857s7 ([2]); g in wild-type
    unsure    43082    43082     g or a, cited in [8]
    recomb    43884    43885     lambda::lambdoid hybridization site
    variant   45352    45352     a in strain cI857s7 ([2]); g in wild-type
    mut       45352    45352     a in sam7 , g in wild-type
BASE COUNT    12334 a  11362 c  12820 g  11986 t
ORIGIN      5' end of the l-strand of the lambda chromosome (sticky end).
        1 gggcggcgac ctcgcgggtt ttcgctattt atgaaaattt tccggtttaa ggcgtttccg
       61 ttcttcttcg tcataactta atgtttttat ttaaaatacc ctctgaaaag aaaggaaacg
      121 acaggtgctg aaagcgaggc tttttggcct ctgtcgtttc ctttctctgt ttttgtccgt
      181 ggaatgaaca atggaagtca acaaaaagca gctggctgac attttcggtg cgagtatccg
      241 taccattcag aactggcagg aacagggaat gcccgttctg cgaggcggtg gcaagggtaa
      301 tgaggtgctt tatgactctg ccgccgtcat aaaatggtat gccgaaaggg atgctgaaat
      361 tgagaacgaa aagctgcgcc gggaggttga agaactgcgg caggccagcg aggcagatct
      421 ccagccagga actattgagt acgaacgcca tcgacttacg cgtgcgcagg ccgacgcaca
      481 ggaactgaag aatgccagag actccgctga agtggtggaa accgcattct gtactttcgt
      541 gctgtcgcgg atcgcaggtg aaattgccag tattctcgac gggctccccc tgtcggtgca
      601 gcggcgtttt ccggaactgg aaaaccgaca tgttgatttc ctgaaacggg atatcatcaa
      661 agccatgaac aaagcagccg cgctggatga actgataccg gggttgctga gtgaatatat
      721 cgaacagtca ggttaacagg ctgcggcatt ttgtccgcgc cgggcttcgc tcactgttca
      781 ggccggagcc acagaccgcc gttgaatggg cggatgctaa ttactatctc ccgaaagaat
      841 ccgcatacca ggaagggcgc tgggaaacac tgccctttca gcgggccatc atgaatgcga
      901 tgggcagcga ctacatccgt gaggtgaatg tggtgaagtc tgcccgtgtc ggttattcca
      961 aaatgctgct gggtgtttat gcctacttta tagagcataa gcagcgcaac acccttatct
     1021 ggttgccgac ggatggtgat gccgagaact ttatgaaaac ccacgttgag ccgactattc
     1081 gtgatattcc gtcgctgctg gcgctggccc cgtggtatgg caaaaagcac cgggataaca
     1141 cgctcaccat gaagcgtttc actaatgggc gtggcttctg gtgcctgggc ggtaaagcgg
     1201 caaaaaacta ccgtgaaaag tcggtggatg tggcgggtta tgatgaactt gctgcttttg
     1261 atgatgatat tgaacaggaa ggctctccga cgttcctggg tgacaagcgt attgaaggct
     1321 cggtctggcc aaagtccatc cgtggctcca cgccaaaagt gagaggcacc tgtcagattg
     1381 agcgtgcagc cagtgaatcc ccgcatttta tgcgttttca tgttgcctgc ccgcattgcg
     1441 gggaggagca gtatcttaaa tttggcgaca aagagacgcc gtttggcctc aaatggacgc
     1501 cggatgaccc ctccagcgtg ttttatctct gcgagcataa tgcctgcgtc atccgccagc
     1561 aggagctgga ctttactgat gcccgttata tctgcgaaaa gaccgggatc tggacccgtg
     1621 atggcattct ctggttttcg tcatccggtg aagagattga gccacctgac agtgtgacct
     1681 ttcacatctg gacagcgtac agcccgttca ccacctgggt gcagattgtc aaagactgga
     1741 tgaaaacgaa aggggatacg ggaaaacgta aaaccttcgt aaacaccacg ctcggtgaga
     1801 cgtgggaggc gaaaattggc gaacgtccgg atgctgaagt gatggcagag cggaaagagc
     1861 attattcagc gcccgttcct gaccgtgtgg cttacctgac cgccggtatc gactcccagc
     1921 tggaccgcta cgaaatgcgc gtatggggat gggggccggg tgaggaaagc tggctgattg
     1981 accggcagat tattatgggc cgccacgacg atgaacagac gctgctgcgt gtggatgagg
     2041 ccatcaataa aacctatacc cgccggaatg gtgcagaaat gtcgatatcc cgtatctgct
     2101 gggatactgg cgggattgac ccgaccattg tgtatgaacg ctcgaaaaaa catgggctgt
     2161 tccgggtgat ccccattaaa ggggcatccg tctacggaaa gccggtggcc agcatgccac
     2221 gtaagcgaaa caaaaacggg gtttacctta ccgaaatcgg tacggatacc gcgaaagagc
     2281 agatttataa ccgcttcaca ctgacgccgg aaggggatga accgcttccc ggtgccgttc
     2341 acttcccgaa taacccggat atttttgatc tgaccgaagc gcagcagctg actgctgaag
     2401 agcaggtcga aaaatgggtg gatggcagga aaaaaatact gtgggacagc aaaaagcgac
     2461 gcaatgaggc actcgactgc ttcgtttatg cgctggcggc gctgcgcatc agtatttccc
     2521 gctggcagct ggatctcagt gcgctgctgg cgagcctgca ggaagaggat ggtgcagcaa
     2581 ccaacaagaa aacactggca gattacgccc gtgccttatc cggagaggat gaatgacgcg
     2641 acaggaagaa cttgccgctg cccgtgcggc actgcatgac ctgatgacag gtaaacgggt
     2701 ggcaacagta cagaaagacg gacgaagggt ggagtttacg gccacttccg tgtctgacct
     2761 gaaaaaatat attgcagagc tggaagtgca gaccggcatg acacagcgac gcaggggacc
     2821 tgcaggattt tatgtatgaa aacgcccacc attcccaccc ttctggggcc ggacggcatg
     2881 acatcgctgc gcgaatatgc cggttatcac ggcggtggca gcggatttgg agggcagttg
     2941 cggtcgtgga acccaccgag tgaaagtgtg gatgcagccc tgttgcccaa ctttacccgt
     3001 ggcaatgccc gcgcagacga tctggtacgc aataacggct atgccgccaa cgccatccag
     3061 ctgcatcagg atcatatcgt cgggtctttt ttccggctca gtcatcgccc aagctggcgc
     3121 tatctgggca tcggggagga agaagcccgt gccttttccc gcgaggttga agcggcatgg
     3181 aaagagtttg ccgaggatga ctgctgctgc attgacgttg agcgaaaacg cacgtttacc
     3241 atgatgattc gggaaggtgt ggccatgcac gcctttaacg gtgaactgtt cgttcaggcc
     3301 acctgggata ccagttcgtc gcggcttttc cggacacagt tccggatggt cagcccgaag
     3361 cgcatcagca acccgaacaa taccggcgac agccggaact gccgtgccgg tgtgcagatt
     3421 aatgacagcg gtgcggcgct gggatattac gtcagcgagg acgggtatcc tggctggatg
     3481 ccgcagaaat ggacatggat accccgtgag ttacccggcg ggcgcgcctc gttcattcac
     3541 gtttttgaac ccgtggagga cgggcagact cgcggtgcaa atgtgtttta cagcgtgatg
     3601 gagcagatga agatgctcga cacgctgcag aacacgcagc tgcagagcgc cattgtgaag
     3661 gcgatgtatg ccgccaccat tgagagtgag ctggatacgc agtcagcgat ggattttatt
     3721 ctgggcgcga acagtcagga gcagcgggaa aggctgaccg gctggattgg tgaaattgcc
     3781 gcgtattacg ccgcagcgcc ggtccggctg ggaggcgcaa aagtaccgca cctgatgccg
     3841 ggtgactcac tgaacctgca gacggctcag gatacggata acggctactc cgtgtttgag
     3901 cagtcactgc tgcggtatat cgctgccggg ctgggtgtct cgtatgagca gctttcccgg
     3961 aattacgccc agatgagcta ctccacggca cgggccagtg cgaacgagtc gtgggcgtac
     4021 tttatggggc ggcgaaaatt cgtcgcatcc cgtcaggcga gccagatgtt tctgtgctgg
     4081 ctggaagagg ccatcgttcg ccgcgtggtg acgttacctt caaaagcgcg cttcagtttt
     4141 caggaagccc gcagtgcctg ggggaactgc gactggatag gctccggtcg tatggccatc
     4201 gatggtctga aagaagttca ggaagcggtg atgctgatag aagccggact gagtacctac
     4261 gagaaagagt gcgcaaaacg cggtgacgac tatcaggaaa tttttgccca gcaggtccgt
     4321 gaaacgatgg agcgccgtgc agccggtctt aaaccgcccg cctgggcggc tgcagcattt
     4381 gaatccgggc tgcgacaatc aacagaggag gagaagagtg acagcagagc tgcgtaatct
     4441 cccgcatatt gccagcatgg cctttaatga gccgctgatg cttgaacccg cctatgcgcg
     4501 ggttttcttt tgtgcgcttg caggccagct tgggatcagc agcctgacgg atgcggtgtc
     4561 cggcgacagc ctgactgccc aggaggcact cgcgacgctg gcattatccg gtgatgatga
     4621 cggaccacga caggcccgca gttatcaggt catgaacggc atcgccgtgc tgccggtgtc
     4681 cggcacgctg gtcagccgga cgcgggcgct gcagccgtac tcggggatga ccggttacaa
     4741 cggcattatc gcccgtctgc aacaggctgc cagcgatccg atggtggacg gcattctgct
     4801 cgatatggac acgcccggcg ggatggtggc gggggcattt gactgcgctg acatcatcgc
     4861 ccgtgtgcgt gacataaaac cggtatgggc gcttgccaac gacatgaact gcagtgcagg
     4921 tcagttgctt gccagtgccg cctcccggcg tctggtcacg cagaccgccc ggacaggctc
     4981 catcggcgtc atgatggctc acagtaatta cggtgctgcg ctggagaaac agggtgtgga
     5041 aatcacgctg atttacagcg gcagccataa ggtggatggc aacccctaca gccatcttcc
     5101 ggatgacgtc cgggagacac tgcagtcccg gatggacgca acccgccaga tgtttgcgca
     5161 gaaggtgtcg gcatataccg gcctgtccgt gcaggttgtg ctggataccg aggctgcagt
     5221 gtacagcggt caggaggcca ttgatgccgg actggctgat gaacttgtta acagcaccga
     5281 tgcgatcacc gtcatgcgtg atgcactgga tgcacgtaaa tcccgtctct caggagggcg
     5341 aatgaccaaa gagactcaat caacaactgt ttcagccact gcttcgcagg ctgacgttac
     5401 tgacgtggtg ccagcgacgg agggcgagaa cgccagcgcg gcgcagccgg acgtgaacgc
     5461 gcagatcacc gcagcggttg cggcagaaaa cagccgcatt atggggatcc tcaactgtga
     5521 ggaggctcac ggacgcgaag aacaggcacg cgtgctggca gaaacccccg gtatgaccgt
     5581 gaaaacggcc cgccgcattc tggccgcagc accacagagt gcacaggcgc gcagtgacac
     5641 tgcgctggat cgtctgatgc agggggcacc ggcaccgctg gctgcaggta acccggcatc
     5701 tgatgccgtt aacgatttgc tgaacacacc agtgtaaggg atgtttatga cgagcaaaga
     5761 aacctttacc cattaccagc cgcagggcaa cagtgacccg gctcataccg caaccgcgcc
     5821 cggcggattg agtgcgaaag cgcctgcaat gaccccgctg atgctggaca cctccagccg
     5881 taagctggtt gcgtgggatg gcaccaccga cggtgctgcc gttggcattc ttgcggttgc
     5941 tgctgaccag accagcacca cgctgacgtt ctacaagtcc ggcacgttcc gttatgagga
     6001 tgtgctctgg ccggaggctg ccagcgacga gacgaaaaaa cggaccgcgt ttgccggaac
     6061 ggcaatcagc atcgtttaac tttacccttc atcactaaag gccgcctgtg cggctttttt
     6121 tacgggattt ttttatgtcg atgtacacaa ccgcccaact gctggcggca aatgagcaga
     6181 aatttaagtt tgatccgctg tttctgcgtc tctttttccg tgagagctat cccttcacca
     6241 cggagaaagt ctatctctca caaattccgg gactggtaaa catggcgctg tacgtttcgc
     6301 cgattgtttc cggtgaggtt atccgttccc gtggcggctc cacctctgaa tttacgccgg
     6361 gatatgtcaa gccgaagcat gaagtgaatc cgcagatgac cctgcgtcgc ctgccggatg
     6421 aagatccgca gaatctggcg gacccggctt accgccgccg tcgcatcatc atgcagaaca
     6481 tgcgtgacga agagctggcc attgctcagg tcgaagagat gcaggcagtt tctgccgtgc
     6541 ttaagggcaa atacaccatg accggtgaag ccttcgatcc ggttgaggtg gatatgggcc
     6601 gcagtgagga gaataacatc acgcagtccg gcggcacgga gtggagcaag cgtgacaagt
     6661 ccacgtatga cccgaccgac gatatcgaag cctacgcgct gaacgccagc ggtgtggtga
     6721 atatcatcgt gttcgatccg aaaggctggg cgctgttccg ttccttcaaa gccgtcaagg
     6781 agaagctgga tacccgtcgt ggctctaatt ccgagctgga gacagcggtg aaagacctgg
     6841 gcaaagcggt gtcctataag gggatgtatg gcgatgtggc catcgtcgtg tattccggac
     6901 agtacgtgga aaacggcgtc aaaaagaact tcctgccgga caacacgatg gtgctgggga
     6961 acactcaggc acgcggtctg cgcacctatg gctgcattca ggatgcggac gcacagcgcg
     7021 aaggcattaa cgcctctgcc cgttacccga aaaactgggt gaccaccggc gatccggcgc
     7081 gtgagttcac catgattcag tcagcaccgc tgatgctgct ggctgaccct gatgagttcg
     7141 tgtccgtaca actggcgtaa tcatggccct tcggggccat tgtttctctg tggaggagtc
     7201 catgacgaaa gatgaactga ttgcccgtct ccgctcgctg ggtgaacaac tgaaccgtga
     7261 tgtcagcctg acggggacga aagaagaact ggcgctccgt gtggcagagc tgaaagagga
     7321 gcttgatgac acggatgaaa ctgccggtca ggacacccct ctcagccggg aaaatgtgct
     7381 gaccggacat gaaaatgagg tgggatcagc gcagccggat accgtgattc tggatacgtc
     7441 tgaactggtc acggtcgtgg cactggtgaa gctgcatact gatgcacttc acgccacgcg
     7501 ggatgaacct gtggcatttg tgctgccggg aacggcgttt cgtgtctctg ccggtgtggc
     7561 agccgaaatg acagagcgcg gcctggccag aatgcaataa cgggaggcgc tgtggctgat
     7621 ttcgataacc tgttcgatgc tgccattgcc cgcgccgatg aaacgatacg cgggtacatg
     7681 ggaacgtcag ccaccattac atccggtgag cagtcaggtg cggtgatacg tggtgttttt
     7741 gatgaccctg aaaatatcag ctatgccgga cagggcgtgc gcgttgaagg ctccagcccg
     7801 tccctgtttg tccggactga tgaggtgcgg cagctgcggc gtggagacac gctgaccatc
     7861 ggtgaggaaa atttctgggt agatcgggtt tcgccggatg atggcggaag ttgtcatctc
     7921 tggcttggac ggggcgtacc gcctgccgtt aaccgtcgcc gctgaaaggg ggatgtatgg
     7981 ccataaaagg tcttgagcag gccgttgaaa acctcagccg tatcagcaaa acggcggtgc
     8041 ctggtgccgc cgcaatggcc attaaccgcg ttgcttcatc cgcgatatcg cagtcggcgt
     8101 cacaggttgc ccgtgagaca aaggtacgcc ggaaactggt aaaggaaagg gccaggctga
     8161 aaagggccac ggtcaaaaat ccgcaggcca gaatcaaagt taaccggggg gatttgcccg
     8221 taatcaagct gggtaatgcg cgggttgtcc tttcgcgccg caggcgtcgt aaaaaggggc
     8281 agcgttcatc cctgaaaggt ggcggcagcg tgcttgtggt gggtaaccgt cgtattcccg
     8341 gcgcgtttat tcagcaactg aaaaatggcc ggtggcatgt catgcagcgt gtggctggga
     8401 aaaaccgtta ccccattgat gtggtgaaaa tcccgatggc ggtgccgctg accacggcgt
     8461 ttaaacaaaa tattgagcgg atacggcgtg aacgtcttcc gaaagagctg ggctatgcgc
     8521 tgcagcatca actgaggatg gtaataaagc gatgaaacat actgaactcc gtgcagccgt
     8581 actggatgca ctggagaagc atgacaccgg ggcgacgttt tttgatggtc gccccgctgt
     8641 ttttgatgag gcggattttc cggcagttgc cgtttatctc accggcgctg aatacacggg
     8701 cgaagagctg gacagcgata cctggcaggc ggagctgcat atcgaagttt tcctgcctgc
     8761 tcaggtgccg gattcagagc tggatgcgtg gatggagtcc cggatttatc cggtgatgag
     8821 cgatatcccg gcactgtcag atttgatcac cagtatggtg gccagcggct atgactaccg
     8881 gcgcgacgat gatgcgggct tgtggagttc agccgatctg acttatgtca ttacctatga
     8941 aatgtgagga cgctatgcct gtaccaaatc ctacaatgcc ggtgaaaggt gccgggacca
     9001 ccctgtgggt ttataagggg agcggtgacc cttacgcgaa tccgctttca gacgttgact
     9061 ggtcgcgtct ggcaaaagtt aaagacctga cgcccggcga actgaccgct gagtcctatg
     9121 acgacagcta tctcgatgat gaagatgcag actggactgc gaccgggcag gggcagaaat
     9181 ctgccggaga taccagcttc acgctggcgt ggatgcccgg agagcagggg cagcaggcgc
     9241 tgctggcgtg gtttaatgaa ggcgataccc gtgcctataa aatccgcttc ccgaacggca
     9301 cggtcgatgt gttccgtggc tgggtcagca gtatcggtaa ggcggtgacg gcgaaggaag
     9361 tgatcacccg cacggtgaaa gtcaccaatg tgggacgtcc gtcgatggca gaagatcgca
     9421 gcacggtaac agcggcaacc ggcatgaccg tgacgcctgc cagcacctcg gtggtgaaag
     9481 ggcagagcac cacgctgacc gtggccttcc agccggaggg cgtaaccgac aagagctttc
     9541 gtgcggtgtc tgcggataaa acaaaagcca ccgtgtcggt cagtggtatg accatcaccg
     9601 tgaacggcgt tgctgcaggc aaggtcaaca ttccggttgt atccggtaat ggtgagtttg
     9661 ctgcggttgc agaaattacc gtcaccgcca gttaatccgg agagtcagcg atgttcctga
     9721 aaaccgaatc atttgaacat aacggtgtga ccgtcacgct ttctgaactg tcagccctgc
     9781 agcgcattga gcatctcgcc ctgatgaaac ggcaggcaga acaggcggag tcagacagca
     9841 accggaagtt tactgtggaa gacgccatca gaaccggcgc gtttctggtg gcgatgtccc
     9901 tgtggcataa ccatccgcag aagacgcaga tgccgtccat gaatgaagcc gttaaacaga
     9961 ttgagcagga agtgcttacc acctggccca cggaggcaat ttctcatgct gaaaacgtgg
    10021 tgtaccggct gtctggtatg tatgagtttg tggtgaataa tgcccctgaa cagacagagg
    10081 acgccgggcc cgcagagcct gtttctgcgg gaaagtgttc gacggtgagc tgagttttgc
    10141 cctgaaactg gcgcgtgaga tggggcgacc cgactggcgt gccatgcttg ccgggatgtc
    10201 atccacggag tatgccgact ggcaccgctt ttacagtacc cattattttc atgatgttct
    10261 gctggatatg cacttttccg ggctgacgta caccgtgctc agcctgtttt tcagcgatcc
    10321 ggatatgcat ccgctggatt tcagtctgct gaaccggcgc gaggctgacg aagagcctga
    10381 agatgatgtg ctgatgcaga aagcggcagg gcttgccgga ggtgtccgct ttggcccgga
    10441 cgggaatgaa gttatccccg cttccccgga tgtggcggac atgacggagg atgacgtaat
    10501 gctgatgaca gtatcagaag ggatcgcagg aggagtccgg tatggctgaa ccggtaggcg
    10561 atctggtcgt tgatttgagt ctggatgcgg ccagatttga cgagcagatg gccagagtca
    10621 ggcgtcattt ttctggtacg gaaagtgatg cgaaaaaaac agcggcagtc gttgaacagt
    10681 cgctgagccg acaggcgctg gctgcacaga aagcggggat ttccgtcggg cagtataaag
    10741 ccgccatgcg tatgctgcct gcacagttca ccgacgtggc cacgcagctt gcaggcgggc
    10801 aaagtccgtg gctgatcctg ctgcaacagg gggggcaggt gaaggactcc ttcggcggga
    10861 tgatccccat gttcaggggg cttgccggtg cgatcaccct gccgatggtg ggggccacct
    10921 cgctggcggt ggcgaccggt gcgctggcgt atgcctggta tcagggcaac tcaaccctgt
    10981 ccgatttcaa caaaacgctg gtcctttccg gcaatcaggc gggactgacg gcagatcgta
    11041 tgctggtcct gtccagagcc gggcaggcgg cagggctgac gtttaaccag accagcgagt
    11101 cactcagcgc actggttaag gcgggggtaa gcggtgaggc tcagattgcg tccatcagcc
    11161 agagtgtggc gcgtttctcc tctgcatccg gcgtggaggt ggacaaggtc gctgaagcct
    11221 tcgggaagct gaccacagac ccgacgtcgg ggctgacggc gatggctcgc cagttccata
    11281 acgtgtcggc ggagcagatt gcgtatgttg ctcagttgca gcgttccggc gatgaagccg
    11341 gggcattgca ggcggcgaac gaggccgcaa cgaaagggtt tgatgaccag acccgccgcc
    11401 tgaaagagaa catgggcacg ctggagacct gggcagacag gactgcgcgg gcattcaaat
    11461 ccatgtggga tgcggtgctg gatattggtc gtcctgatac cgcgcaggag atgctgatta
    11521 aggcagaggc tgcgtataag aaagcagacg acatctggaa tctgcgcaag gatgattatt
    11581 ttgttaacga tgaagcgcgg gcgcgttact gggatgatcg tgaaaaggcc cgtcttgcgc
    11641 ttgaagccgc ccgaaagaag gctgagcagc agactcaaca ggacaaaaat gcgcagcagc
    11701 agagcgatac cgaagcgtca cggctgaaat ataccgaaga ggcgcagaag gcttacgaac
    11761 ggctgcagac gccgctggag aaatataccg cccgtcagga agaactgaac aaggcactga
    11821 aagacgggaa aatcctgcag gcggattaca acacgctgat ggcggcggcg aaaaaggatt
    11881 atgaagcgac gctgaaaaag ccgaaacagt ccagcgtgaa ggtgtctgcg ggcgatcgtc
    11941 aggaagacag tgctcatgct gccctgctga cgcttcaggc agaactccgg acgctggaga
    12001 agcatgccgg agcaaatgag aaaatcagcc agcagcgccg ggatttgtgg aaggcggaga
    12061 gtcagttcgc ggtactggag gaggcggcgc aacgtcgcca gctgtctgca caggagaaat
    12121 ccctgctggc gcataaagat gagacgctgg agtacaaacg ccagctggct gcacttggcg
    12181 acaaggttac gtatcaggag cgcctgaacg cgctggcgca gcaggcggat aaattcgcac
    12241 agcagcaacg ggcaaaacgg gccgccattg atgcgaaaag ccgggggctg actgaccggc
    12301 aggcagaacg ggaagccacg gaacagcgcc tgaaggaaca gtatggcgat aatccgctgg
    12361 cgctgaataa cgtcatgtca gagcagaaaa agacctgggc ggctgaagac cagcttcgcg
    12421 ggaactggat ggcaggcctg aagtccggct ggagtgagtg ggaagagagc gccacggaca
    12481 gtatgtcgca ggtaaaaagt gcagccacgc agacctttga tggtattgca cagaatatgg
    12541 cggcgatgct gaccggcagt gagcagaact ggcgcagctt cacccgttcc gtgctgtcca
    12601 tgatgacaga aattctgctt aagcaggcaa tggtggggat tgtcgggagt atcggcagcg
    12661 ccattggcgg ggctgttggt ggcggcgcat ccgcgtcagg cggtacagcc attcaggccg
    12721 ctgcggcgaa attccatttt gcaaccggag gatttacggg aaccggcggc aaatatgagc
    12781 cagcggggat tgttcaccgt ggtgagtttg tcttcacgaa ggaggcaacc agccggattg
    12841 gcgtggggaa tctttaccgg ctgatgcgcg gctatgccac cggcggttat gtcggtacac
    12901 cgggcagcat ggcagacagc cggtcgcagg cgtccgggac gtttgagcag aataaccatg
    12961 tggtgattaa caacgacggc acgaacgggc agataggtcc ggctgctctg aaggcggtgt
    13021 atgacatggc ccgcaagggt gcccgtgatg aaattcagac acagatgcgt gatggtggcc
    13081 tgttctccgg aggtggacga tgaagacctt ccgctggaaa gtgaaacccg gtatggatgt
    13141 ggcttcggtc ccttctgtaa gaaaggtgcg ctttggtgat ggctattctc agcgagcgcc
    13201 tgccgggctg aatgccaacc tgaaaacgta cagcgtgacg ctttctgtcc cccgtgagga
    13261 ggccacggta ctggagtcgt ttctggaaga gcacgggggc tggaaatcct ttctgtggac
    13321 gccgccttat gagtggcggc agataaaggt gacctgcgca aaatggtcgt cgcgggtcag
    13381 tatgctgcgt gttgagttca gcgcagagtt tgaacaggtg gtgaactgat gcaggatatc
    13441 cggcaggaaa cactgaatga atgcacccgt gcggagcagt cggccagcgt ggtgctctgg
    13501 gaaatcgacc tgacagaggt cggtggagaa cgttattttt tctgtaatga gcagaacgaa
    13561 aaaggtgagc cggtcacctg gcaggggcga cagtatcagc cgtatcccat tcaggggagc
    13621 ggttttgaac tgaatggcaa aggcaccagt acgcgcccca cgctgacggt ttctaacctg
    13681 tacggtatgg tcaccgggat ggcggaagat atgcagagtc tggtcggcgg aacggtggtc
    13741 cggcgtaagg tttacgcccg ttttctggat gcggtgaact tcgtcaacgg aaacagttac
    13801 gccgatccgg agcaggaggt gatcagccgc tggcgcattg agcagtgcag cgaactgagc
    13861 gcggtgagtg cctcctttgt actgtccacg ccgacggaaa cggatggcgc tgtttttccg
    13921 ggacgtatca tgctggccaa cacctgcacc tggacctatc gcggtgacga gtgcggttat
    13981 agcggtccgg ctgtcgcgga tgaatatgac cagccaacgt ccgatatcac gaaggataaa
    14041 tgcagcaaat gcctgagcgg ttgtaagttc cgcaataacg tcggcaactt tggcggcttc
    14101 ctttccatta acaaactttc gcagtaaatc ccatgacaca gacagaatca gcgattctgg
    14161 cgcacgcccg gcgatgtgcg ccagcggagt cgtgcggctt cgtggtaagc acgccggagg
    14221 gggaaagata tttcccctgc gtgaatatct ccggtgagcc ggaggctatt tccgtatgtc
    14281 gccggaagac tggctgcagg cagaaatgca gggtgagatt gtggcgctgg tccacagcca
    14341 ccccggtggt ctgccctggc tgagtgaggc cgaccggcgg ctgcaggtgc agagtgattt
    14401 gccgtggtgg ctggtctgcc gggggacgat tcataagttc cgctgtgtgc cgcatctcac
    14461 cgggcggcgc tttgagcacg gtgtgacgga ctgttacaca ctgttccggg atgcttatca
    14521 tctggcgggg attgagatgc cggactttca tcgtgaggat gactggtggc gtaacggcca
    14581 gaatctctat ctggataatc tggaggcgac ggggctgtat caggtgccgt tgtcagcggc
    14641 acagccgggc gatgtgctgc tgtgctgttt tggttcatca gtgccgaatc acgccgcaat
    14701 ttactgcggc gacggcgagc tgctgcacca tattcctgaa caactgagca aacgagagag
    14761 gtacaccgac aaatggcagc gacgcacaca ctccctctgg cgtcaccggg catggcgcgc
    14821 atctgccttt acggggattt acaacgattt ggtcgccgca tcgaccttcg tgtgaaaacg
    14881 ggggctgaag ccatccgggc actggccaca cagctcccgg cgtttcgtca gaaactgagc
    14941 gacggctggt atcaggtacg gattgccggg cgggacgtca gcacgtccgg gttaacggcg
    15001 cagttacatg agactctgcc tgatggcgct gtaattcata ttgttcccag agtcgccggg
    15061 gccaagtcag gtggcgtatt ccagattgtc ctgggggctg ccgccattgc cggatcattc
    15121 tttaccgccg gagccaccct tgcagcatgg ggggcagcca ttggggccgg tggtatgacc
    15181 ggcatcctgt tttctctcgg tgccagtatg gtgctcggtg gtgtggcgca gatgctggca
    15241 ccgaaagcca gaactccccg tatacagaca acggataacg gtaagcagaa cacctatttc
    15301 tcctcactgg ataacatggt tgcccagggc aatgttctgc ctgttctgta cggggaaatg
    15361 cgcgtggggt cacgcgtggt ttctcaggag atcagcacgg cagacgaagg ggacggtggt
    15421 caggttgtgg tgattggtcg ctgatgcaaa atgttttatg tgaaaccgcc tgcgggcggt
    15481 tttgtcattt atggagcgtg aggaatgggt aaaggaagca gtaaggggca taccccgcgc
    15541 gaagcgaagg acaacctgaa gtccacgcag ttgctgagtg tgatcgatgc catcagcgaa
    15601 gggccgattg aaggtccggt ggatggctta aaaagcgtgc tgctgaacag tacgccggtg
    15661 ctggacactg aggggaatac caacatatcc ggtgtcacgg tggtgttccg ggctggtgag
    15721 caggagcaga ctccgccgga gggatttgaa tcctccggct ccgagacggt gctgggtacg
    15781 gaagtgaaat atgacacgcc gatcacccgc accattacgt ctgcaaacat cgaccgtctg
    15841 cgctttacct tcggtgtaca ggcactggtg gaaaccacct caaagggtga caggaatccg
    15901 tcggaagtcc gcctgctggt tcagatacaa cgtaacggtg gctgggtgac ggaaaaagac
    15961 atcaccatta agggcaaaac cacctcgcag tatctggcct cggtggtgat gggtaacctg
    16021 ccgccgcgcc cgtttaatat ccggatgcgc aggatgacgc cggacagcac cacagaccag
    16081 ctgcagaaca aaacgctctg gtcgtcatac actgaaatca tcgatgtgaa acagtgctac
    16141 ccgaacacgg cactggtcgg cgtgcaggtg gactcggagc agttcggcag ccagcaggtg
    16201 agccgtaatt atcatctgcg cgggcgtatt ctgcaggtgc cgtcgaacta taacccgcag
    16261 acgcggcaat acagcggtat ctgggacgga acgtttaaac cggcatacag caacaacatg
    16321 gcctggtgtc tgtgggatat gctgacccat ccgcgctacg gcatggggaa acgtcttggt
    16381 gcggcggatg tggataaatg ggcgctgtat gtcatcggcc agtactgcga ccagtcagtg
    16441 ccggacggct ttggcggcac ggagccgcgc atcacctgta atgcgtacct gaccacacag
    16501 cgtaaggcgt gggatgtgct cagcgatttc tgctcggcga tgcgctgtat gccggtatgg
    16561 aacgggcaga cgctgacgtt cgtgcaggac cgaccgtcgg ataagacgtg gacctataac
    16621 cgcagtaatg tggtgatgcc ggatgatggc gcgccgttcc gctacagctt cagcgccctg
    16681 aaggaccgcc ataatgccgt tgaggtgaac tggattgacc cgaacaacgg ctgggagacg
    16741 gcgacagagc ttgttgaaga tacgcaggcc attgcccgtt acggtcgtaa tgttacgaag
    16801 atggatgcct ttggctgtac cagccggggg caggcacacc gcgccgggct gtggctgatt
    16861 aaaacagaac tgctggaaac gcagaccgtg gatttcagcg tcggcgcaga agggcttcgc
    16921 catgtaccgg gcgatgttat tgaaatctgc gatgatgact atgccggtat cagcaccggt
    16981 ggtcgtgtgc tggcggtgaa cagccagacc cggacgctga cgctcgaccg tgaaatcacg
    17041 ctgccatcct ccggtaccgc gctgataagc ctggttgacg gaagtggcaa tccggtcagc
    17101 gtggaggttc agtccgtcac cgacggcgtg aaggtaaaag tgagccgtgt tcctgacggt
    17161 gttgctgaat acagcgtatg ggagctgaag ctgccgacgc tgcgccagcg actgttccgc
    17221 tgcgtgagta tccgtgagaa cgacgacggc acgtatgcca tcaccgccgt gcagcatgtg
    17281 ccggaaaaag aggccatcgt ggataacggg gcgcactttg acggcgaaca gagtggcacg
    17341 gtgaatggtg tcacgccgcc agcggtgcag cacctgaccg cagaagtcac tgcagacagc
    17401 ggggaatatc aggtgctggc gcgatgggac acaccgaagg tggtgaaggg cgtgagtttc
    17461 ctgctccgtc tgaccgtaac agcggacgac ggcagtgagc ggctggtcag cacggcccgg
    17521 acgacggaaa ccacataccg cttcacgcaa ctggcgctgg ggaactacag gctgacagtc
    17581 cgggcggtaa atgcgtgggg gcagcagggc gatccggcgt cggtatcgtt ccggattgcc
    17641 gcaccggcag caccgtcgag gattgagctg acgccgggct attttcagat aaccgccacg
    17701 ccgcatcttg ccgtttatga cccgacggta cagtttgagt tctggttctc ggaaaagcag
    17761 attgcggata tcagacaggt tgaaaccagc acgcgttatc ttggtacggc gctgtactgg
    17821 atagccgcca gtatcaatat caaaccgggc catgattatt acttttatat ccgcagtgtg
    17881 aacaccgttg gcaaatcggc attcgtggag gccgtcggtc gggcgagcga tgatgcggaa
    17941 ggttacctgg attttttcaa aggcaagata accgaatccc atctcggcaa ggagctgctg
    18001 gaaaaagtcg agctgacgga ggataacgcc agcagactgg aggagttttc gaaagagtgg
    18061 aaggatgcca gtgataagtg gaatgccatg tgggctgtca aaattgagca gaccaaagac
    18121 ggcaaacatt atgtcgcggg tattggcctc agcatggagg acacggagga aggcaaactg
    18181 agccagtttc tggttgccgc caatcgtatc gcatttattg acccggcaaa cgggaatgaa
    18241 acgccgatgt ttgtggcgca gggcaaccag atattcatga acgacgtgtt cctgaagcgc
    18301 ctgacggccc ccaccattac cagcggcggc aatcctccgg ccttttccct gacaccggac
    18361 ggaaagctga ccgctaaaaa tgcggatatc agtggcagtg tgaatgcgaa ctccgggacg
    18421 ctcagtaatg tgacgatagc tgaaaactgt acgataaacg gtacgctgag ggcggaaaaa
    18481 atcgtcgggg acattgtaaa ggcggcgagc gcggcttttc cgcgccagcg tgaaagcagt
    18541 gtggactggc cgtcaggtac ccgtactgtc accgtgaccg atgaccatcc ttttgatcgc
    18601 cagatagtgg tgcttccgct gacgtttcgc ggaagtaagc gtactgtcag cggcaggaca
    18661 acgtattcga tgtgttatct gaaagtactg atgaacggtg cggtgattta tgatggcgcg
    18721 gcgaacgagg cggtacaggt gttctcccgt attgttgaca tgccagcggg tcggggaaac
    18781 gtgatcctga cgttcacgct tacgtccaca cggcattcgg cagatattcc gccgtatacg
    18841 tttgccagcg atgtgcaggt tatggtgatt aagaaacagg cgctgggcat cagcgtggtc
    18901 tgagtgtgtt acagaggttc gtccgggaac gggcgtttta ttataaaaca gtgagaggtg
    18961 aacgatgcgt aatgtgtgta ttgccgttgc tgtctttgcc gcacttgcgg tgacagtcac
    19021 tccggcccgt gcggaaggtg gacatggtac gtttacggtg ggctattttc aagtgaaacc
    19081 gggtacattg ccgtcgttgt cgggcgggga taccggtgtg agtcatctga aagggattaa
    19141 cgtgaagtac cgttatgagc tgacggacag tgtgggggtg atggcttccc tggggttcgc
    19201 cgcgtcgaaa aagagcagca cagtgatgac cggggaggat acgtttcact atgagagcct
    19261 gcgtggacgt tatgtgagcg tgatggccgg accggtttta caaatcagta agcaggtcag
    19321 tgcgtacgcc atggccggag tggctcacag tcggtggtcc ggcagtacaa tggattaccg
    19381 taagacggaa atcactcccg ggtatatgaa agagacgacc actgccaggg acgaaagtgc
    19441 aatgcggcat acctcagtgg cgtggagtgc aggtatacag attaatccgg cagcgtccgt
    19501 cgttgttgat attgcttatg aaggctccgg cagtggcgac tggcgtactg acggattcat
    19561 cgttggggtc ggttataaat tctgattagc caggtaacac agtgttatga cagcccgccg
    19621 gaaccggtgg gcttttttgt ggggtgaata tggcagtaaa gatttcagga gtcctgaaag
    19681 acggcacagg aaaaccggta cagaactgca ccattcagct gaaagccaga cgtaacagca
    19741 ccacggtggt ggtgaacacg gtgggctcag agaatccgga tgaagccggg cgttacagca
    19801 tggatgtgga gtacggtcag tacagtgtca tcctgcaggt tgacggtttt ccaccatcgc
    19861 acgccgggac catcaccgtg tatgaagatt cacaaccggg gacgctgaat gattttctct
    19921 gtgccatgac ggaggatgat gcccggccgg aggtgctgcg tcgtcttgaa ctgatggtgg
    19981 aagaggtggc gcgtaacgcg tccgtggtgg cacagagtac ggcagacgcg aagaaatcag
    20041 ccggcgatgc cagtgcatca gctgctcagg tcgcggccct tgtgactgat gcaactgact
    20101 cagcacgcgc cgccagcacg tccgccggac aggctgcatc gtcagctcag gaagcgtcct
    20161 ccggcgcaga agcggcatca gcaaaggcca ctgaagcgga aaaaagtgcc gcagccgcag
    20221 agtcctcaaa aaacgcggcg gccaccagtg ccggtgcggc gaaaacgtca gaaacgaatg
    20281 ctgcagcgtc acaacaatca gccgccacgt ctgcctccac cgcggccacg aaagcgtcag
    20341 aggccgccac ttcagcacga gatgcggtgg cctcaaaaga ggcagcaaaa tcatcagaaa
    20401 cgaacgcatc atcaagtgcc ggtcgtgcag cttcctcggc aacggcggca gaaaattctg
    20461 ccagggcggc aaaaacgtcc gagacgaatg ccaggtcatc tgaaacagca gcggaacgga
    20521 gcgcctctgc cgcggcagac gcaaaaacag cggcggcggg gagtgcgtca acggcatcca
    20581 cgaaggcgac agaggctgcg ggaagtgcgg tatcagcatc gcagagcaaa agtgcggcag
    20641 aagcggcggc aatacgtgca aaaaattcgg caaaacgtgc agaagatata gcttcagctg
    20701 tcgcgcttga ggatgcggac acaacgagaa aggggatagt gcagctcagc agtgcaacca
    20761 acagcacgtc tgaaacgctt gctgcaacgc caaaggcggt taaggtggta atggatgaaa
    20821 cgaacagaaa agcccactgg acagtccggc actgaccgga acgccaacag caccaaccgc
    20881 gctcagggga acaaacaata cccagattgc gaacaccgct tttgtactgg ccgcgattgc
    20941 agatgttatc gacgcgtcac ctgacgcact gaatacgctg aatgaactgg ccgcagcgct
    21001 cgggaatgat ccagattttg ctaccaccat gactaacgcg cttgcgggta aacaaccgaa
    21061 gaatgcgaca ctgacggcgc tggcagggct ttccacggcg aaaaataaat taccgtattt
    21121 tgcggaaaat gatgccgcca gcctgactga actgactcag gttggcaggg atattctggc
    21181 aaaaaattcc gttgcagatg ttcttgaata ccttggggcc ggtgagaatt cggcctttcc
    21241 ggcaggtgcg ccgatcccgt ggccatcaga tatcgttccg tctggctacg tcctgatgca
    21301 ggggcaggcg tttgacaaat cagcctaccc aaaacttgct gtcgcgtatc catcgggtgt
    21361 gcttcctgat atgcgaggct ggacaatcaa ggggaaaccc gccagcggtc gtgctgtatt
    21421 gtctcaggaa caggatggaa ttaagtcgca cacccacagt gccagtgcat ccggtacgga
    21481 tttggggacg aaaaccacat cgtcgtttga ttacgggacg aaaacaacag gcagtttcga
    21541 ttacggcacc aaatcgacga ataacacggg ggctcatgct cacagtctga gcggttcaac
    21601 aggggccgcg ggtgctcatg cccacacaag tggtttaagg atgaacagtt ctggctggag
    21661 tcagtatgga acagcaacca ttacaggaag tttatccaca gttaaaggaa ccagcacaca
    21721 gggtattgct tatttatcga aaacggacag tcagggcagc cacagtcact cattgtccgg
    21781 tacagccgtg agtgccggtg cacatgcgca tacagttggt attggtgcgc accagcatcc
    21841 ggttgttatc ggtgctcatg cccattcttt cagtattggt tcacacggac acaccatcac
    21901 cgttaacgct gcgggtaacg cggaaaacac cgtcaaaaac attgcattta actatattgt
    21961 gaggcttgca taatggcatt cagaatgagt gaacaaccac ggaccataaa aatttataat
    22021 ctgctggccg gaactaatga atttattggt gaaggtgacg catatattcc gcctcatacc
    22081 ggtctgcctg caaacagtac cgatattgca ccgccagata ttccggctgg ctttgtggct
    22141 gttttcaaca gtgatgaggc atcgtggcat ctcgttgaag accatcgggg taaaaccgtc
    22201 tatgacgtgg cttccggcga cgcgttattt atttctgaac tcggtccgtt accggaaaat
    22261 tttacctggt tatcgccggg aggggaatat cagaagtgga acggcacagc ctgggtgaag
    22321 gatacggaag cagaaaaact gttccggatc cgggaggcgg aagaaacaaa aaaaagcctg
    22381 atgcaggtag ccagtgagca tattgcgccg cttcaggatg ctgcagatct ggaaattgca
    22441 acgaaggaag aaacctcgtt gctggaagcc tggaagaagt atcgggtgtt gctgaaccgt
    22501 gttgatacat caactgcacc tgatattgag tggcctgctg tccctgttat ggagtaatcg
    22561 ttttgtgata tgccgcagaa acgttgtatg aaataacgtt ctgcggttag ttagtatatt
    22621 gtaaagctga gtattggttt atttggcgat tattatcttc aggagaataa tggaagttct
    22681 atgactcaat tgttcatagt gtttacatca ccgccaattg cttttaagac tgaacgcatg
    22741 aaatatggtt tttcgtcatg ttttgagtct gctgttgata tttctaaagt cggttttttt
    22801 tcttcgtttt ctctaactat tttccatgaa atacattttt gattattatt tgaatcaatt
    22861 ccaattacct gaagtctttc atctataatt ggcattgtat gtattggttt attggagtag
    22921 atgcttgctt ttctgagcca tagctctgat atccaaatga agccataggc atttgttatt
    22981 ttggctctgt cagctgcata acgccaaaaa atatatttat ctgcttgatc ttcaaatgtt
    23041 gtattgatta aatcaattgg atggaattgt ttatcataaa aaattaatgt ttgaatgtga
    23101 taaccgtcct ttaaaaaagt cgtttctgca agcttggctg tatagtcaac taactcttct
    23161 gtcgaagtga tatttttagg cttatctacc agttttagac gctctttaat atcttcagga
    23221 attattttat tgtcatattg tatcatgcta aatgacaatt tgcttatgga gtaatctttt
    23281 aattttaaat aagttattct cctggcttca tcaaataaag agtcgaatga tgttggcgaa
    23341 atcacatcgt cacccattgg attgtttatt tgtatgccaa gagagttaca gcagttatac
    23401 attctgccat agattatagc taaggcatgt aataattcgt aatcttttag cgtattagcg
    23461 acccatcgtc tttctgattt aataatagat gattcagtta aatatgaagg taatttcttt
    23521 tgtgcaagtc tgactaactt ttttatacca atgtttaaca tactttcatt tgtaataaac
    23581 tcaatgtcat tttcttcaat gtaagatgaa ataagagtag cctttgcctc gctatacatt
    23641 tctaaatcgc cttgtttttc tatcgtattg cgagaatttt tagcccaagc cattaatgga
    23701 tcatttttcc atttttcaat aacattattg ttataccaaa tgtcatatcc tataatctgg
    23761 tttttgtttt tttgaataat aaatgttact gttcttgcgg tttggaggaa ttgattcaaa
    23821 ttcaagcgaa ataattcagg gtcaaaatat gtatcaatgc agcatttgag caagtgcgat
    23881 aaatctttaa gtcttctttc ccatggtttt ttagtcataa aactctccat tttgataggt
    23941 tgcatgctag atgctgatat attttagagg tgataaaatt aactgcttaa ctgtcaatgt
    24001 aatacaagtt gtttgatctt tgcaatgatt cttatcagaa accatatagt aaattagtta
    24061 cacaggaaat ttttaatatt attattatca ttcattatgt attaaaatta gagttgtggc
    24121 ttggctctgc taacacgttg ctcataggag atatggtaga gccgcagaca cgtcgtatgc
    24181 aggaacgtgc tgcggctggc tggtgaactt ccgatagtgc gggtgttgaa tgatttccag
    24241 ttgctaccga ttttacatat tttttgcatg agagaatttg taccacctcc caccgaccat
    24301 ctatgactgt acgccactgt ccctaggact gctatgtgcc ggagcggaca ttacaaacgt
    24361 ccttctcggt gcatgccact gttgccaatg acctgcctag gaattggtta gcaagttact
    24421 accggatttt gtaaaaacag ccctcctcat ataaaaagta ttcgttcact tccgataagc
    24481 gtcgtaattt tctatctttc atcatattct agatccctct gaaaaaatct tccgagtttg
    24541 ctaggcactg atacataact cttttccaat aattggggaa gtcattcaaa tctataatag
    24601 gtttcagatt tgcttcaata aattctgact gtagctgctg aaacgttgcg gttgaactat
    24661 atttccttat aacttttacg aaagagtttc tttgagtaat cacttcactc aagtgcttcc
    24721 ctgcctccaa acgatacctg ttagcaatat ttaatagctt gaaatgatga agagctctgt
    24781 gtttgtcttc ctgcctccag ttcgccgggc attcaacata aaaactgata gcacccggag
    24841 ttccggaaac gaaatttgca tatacccatt gctcacgaaa aaaaatgtcc ttgtcgatat
    24901 agggatgaat cgcttggtgt acctcatcta ctgcgaaaac ttgacctttc tctcccatat
    24961 tgcagtcgcg gcacgatgga actaaattaa taggcatcac cgaaaattca ggataatgtg
    25021 caataggaag aaaatgatct atattttttg tctgtcctat atcaccacaa aatggacatt
    25081 tttcacctga tgaaacaagc atgtcatcgt aatatgttct agcgggtttg tttttatctc
    25141 ggagattatt ttcataaagc ttttctaatt taacctttgt caggttacca actactaagg
    25201 ttgtaggctc aagagggtgt gtcctgtcgt aggtaaataa ctgacctgtc gagcttaata
    25261 ttctatattg ttgttctttc tgcaaaaaag tggggaagtg agtaatgaaa ttatttctaa
    25321 catttatctg catcatacct tccgagcatt tattaagcat ttcgctataa gttctcgctg
    25381 gaagaggtag ttttttcatt gtactttacc ttcatctctg ttcattatca tcgcttttaa
    25441 aacggttcga ccttctaatc ctatctgacc attataattt tttagaatgg tttcataaga
    25501 aagctctgaa tcaacggact gcgataataa gtggtggtat ccagaatttg tcacttcaag
    25561 taaaaacacc tcacgagtta aaacacctaa gttctcaccg aatgtctcaa tatccggacg
    25621 gataatattt attgcttctc ttgaccgtag gactttccac atgcaggatt ttggaacctc
    25681 ttgcagtact actggggaat gagttgcaat tattgctaca ccattgcgtg catcgagtaa
    25741 gtcgcttaat gttcgtaaaa aagcagagag caaaggtgga tgcagatgaa cctctggttc
    25801 atcgaataaa actaatgact tttcgccaac gacatctact aatcttgtga tagtaaataa
    25861 aacaattgca tgtccagagc tcattcgaag cagatatttc tggatattgt cataaaacaa
    25921 tttagtgaat ttatcatcgt ccacttgaat ctgtggttca ttacgtctta actcttcata
    25981 tttagaaatg aggctgatga gttccatatt tgaaaagttt tcatcactac ttagtttttt
    26041 gatagcttca agccagagtt gtctttttct atctactctc atacaaccaa taaatgctga
    26101 aatgaattct aagcggagat cgcctagtga ttttaaacta ttgctggcag cattcttgag
    26161 tccaatataa aagtattgtg taccttttgc tgggtcaggt tgttctttag gaggagtaaa
    26221 aggatcaaat gcactaaacg aaactgaaac aagcgatcga aaatatccct ttgggattct
    26281 tgactcgata agtctattat tttcagagaa aaaatattca ttgttttctg ggttggtgat
    26341 tgcaccaatc attccattca aaattgttgt tttaccacac ccattccgcc cgataaaagc
    26401 atgaatgttc gtgctgggca tagaattaac cgtcacctca aaaggtatag ttaaatcact
    26461 gaatccggga gcactttttc tattaaatga aaagtggaaa tctgacaatt ctggcaaacc
    26521 atttaacaca cgtgcgaact gtccatgaat ttctgaaaga gttacccctc taagtaatga
    26581 ggtgttaagg acgctttcat tttcaatgtc ggctaatcga tttggccata ctactaaatc
    26641 ctgaatagct ttaagaaggt tatgtttaaa accatcgctt aatttgctga gattaacata
    26701 gtagtcaatg ctttcaccta aggaaaaaaa catttcaggg agttgactga attttttatc
    26761 tattaatgaa taagtgctta cttcttcttt ttgacctaca aaaccaattt taacatttcc
    26821 gatatcgcat ttttcaccat gctcatcaaa gacagtaaga taaaacattg taacaaagga
    26881 atagtcattc caaccatctg ctcgtaggaa tgccttattt ttttctactg caggaatata
    26941 cccgcctctt tcaataacac taaactccaa catatagtaa cccttaattt tattaaaata
    27001 accgcaattt atttggcggc aacacaggat ctctctttta agttactctc tattacatac
    27061 gttttccatc taaaaattag tagtattgaa cttaacgggg catcgtattg tagttttcca
    27121 tatttagctt tctgcttcct tttggataac ccactgttat tcatgttgca tggtgcactg
    27181 tttataccaa cgatatagtc tattaatgca tatatagtat cgccgaacga ttagctcttc
    27241 aggcttctga agaagcgttt caagtactaa taagccgata gatagccacg gacttcgtag
    27301 ccatttttca taagtgttaa cttccgctcc tcgctcataa cagacattca ctacagttat
    27361 ggcggaaagg tatgcatgct gggtgtgggg aagtcgtgaa agaaaagaag tcagctgcgt
    27421 cgtttgacat cactgctatc ttcttactgg ttatgcaggt cgtagtgggt ggcacacaaa
    27481 gctttgcact ggattgcgag gctttgtgct tctctggagt gcgacaggtt tgatgacaaa
    27541 aaattagcgc aagaagacaa aaatcacctt gcgctaatgc tctgttacag gtcactaata
    27601 ccatctaagt agttgattca tagtgactgc atatgttgtg ttttacagta ttatgtagtc
    27661 tgttttttat gcaaaatcta atttaatata ttgatattta tatcatttta cgtttctcgt
    27721 tcagcttttt tatactaagt tggcattata aaaaagcatt gcttatcaat ttgttgcaac
    27781 gaacaggtca ctatcagtca aaataaaatc attatttgat ttcaattttg tcccactccc
    27841 tgcctctgtc atcacgatac tgtgatgcca tggtgtccga cttatgcccg agaagatgtt
    27901 gagcaaactt atcgcttatc tgcttctcat agagtcttgc agacaaactg cgcaactcgt
    27961 gaaaggtagg cggatcccct tcgaaggaaa gacctgatgc ttttcgtgcg cgcataaaat
    28021 accttgatac tgtgccggat gaaagcggtt cgcgacgagt agatgcaatt atggtttctc
    28081 cgccaagaat ctctttgcat ttatcaagtg tttccttcat tgatattccg agagcatcaa
    28141 tatgcaatgc tgttgggatg gcaattttta cgcctgtttt gctttgctcg acataaagat
    28201 atccatctac gatatcagac cacttcattt cgcataaatc accaactcgt tgcccggtaa
    28261 caacagccag ttccattgca agtctgagcc aacatggtga tgattctgct gcttgataaa
    28321 ttttcaggta ttcgtcagcc gtaagtcttg atctccttac ctctgatttt gctgcgcgag
    28381 tggcagcgac atggtttgtt gttatatggc cttcagctat tgcctctcgg aatgcatcgc
    28441 tcagtgttga tctgattaac ttggctgacg ccgccttgcc ctcgtctatg tatccattga
    28501 gcattgccgc aatttctttt gtggtgatgt cttcaagtgg agcatcaggc agacccctcc
    28561 ttattgcttt aattttgctc atgtaattta tgagtgtctt ctgcttgatt cctctgctgg
    28621 ccaggatttt ttcgtagcga tcaagccatg aatgtaacgt aacggaatta tcactgttga
    28681 ttctcgctgt cagaggcttg tgtttgtgtc ctgaaaataa ctcaatgttg gcctgtatag
    28741 cttcagtgat tgcgattcgc ctgtctctgc ctaatccaaa ctctttaccc gtccttgggt
    28801 ccctgtagca gtaatatcca ttgtttctta tataaaggtt agggggtaaa tcccggcgct
    28861 catgacttcg ccttcttccc atttctgatc ctcttcaaaa ggccacctgt tactggtcga
    28921 tttaagtcaa cctttaccgc tgattcgtgg aacagatact ctcttccatc cttaaccgga
    28981 ggtgggaata tcctgcattc ccgaacccat cgacgaactg tttcaaggct tcttggacgt
    29041 cgctggcgtg cgttccactc ctgaagtgtc aagtacatcg caaagtctcc gcaattacac
    29101 gcaagaaaaa accgccatca ggcggcttgg tgttctttca gttcttcaat tcgaatattg
    29161 gttacgtctg catgtgctat ctgcgcccat atcatccagt ggtcgtagca gtcgttgatg
    29221 ttctccgctt cgataactct gttgaatggc tctccattcc attctcctgt gactcggaag
    29281 tgcatttatc atctccataa aacaaaaccc gccgtagcga gttcagataa aataaatccc
    29341 cgcgagtgcg aggattgtta tgtaatattg ggtttaatca tctatatgtt ttgtacagag
    29401 agggcaagta tcgtttccac cgtactcgtg ataataattt tgcacggtat cagtcatttc
    29461 tcgcacattg cagaatgggg atttgtcttc attagactta taaaccttca tggaatattt
    29521 gtatgccgac tctatatcta taccttcatc tacataaaca ccttcgtgat gtctgcatgg
    29581 agacaagaca ccggatctgc acaacattga taacgcccaa tctttttgct cagactctaa
    29641 ctcattgata ctcatttata aactccttgc aatgtatgtc gtttcagcta aacggtatca
    29701 gcaatgttta tgtaaagaaa cagtaagata atactcaacc cgatgtttga gtacggtcat
    29761 catctgacac tacagactct ggcatcgctg tgaagacgac gcgaaattca gcattttcac
    29821 aagcgttatc ttttacaaaa ccgatctcac tctcctttga tgcgaatgcc agcgtcagac
    29881 atcatatgca gatactcacc tgcatcctga acccattgac ctccaacccc gtaatagcga
    29941 tgcgtaatga tgtcgatagt tactaacggg tcttgttcga ttaactgccg cagaaactct
    30001 tccaggtcac cagtgcagtg cttgataaca ggagtcttcc caggatggcg aacaacaaga
    30061 aactggtttc cgtcttcacg gacttcgttg ctttccagtt tagcaatacg cttactccca
    30121 tccgagataa caccttcgta atactcacgc tgctcgttga gttttgattt tgctgtttca
    30181 agctcaacac gcagtttccc tactgttagc gcaatatcct cgttctcctg gtcgcggcgt
    30241 ttgatgtatt gctggtttct ttcccgttca tccagcagtt ccagcacaat cgatggtgtt
    30301 accaattcat ggaaaaggtc tgcgtcaaat ccccagtcgt catgcattgc ctgctctgcc
    30361 gcttcacgca gtgcctgaga gttaatttcg ctcacttcga acctctctgt ttactgataa
    30421 gttccagatc ctcctggcaa cttgcacaag tccgacaacc ctgaacgacc aggcgtcttc
    30481 gttcatctat cggatcgcca cactcacaac aatgagtggc agatatagcc tggtggttca
    30541 ggcggcgcat ttttattgct gtgttgcgct gtaattcttc tatttctgat gctgaatcaa
    30601 tgatgtctgc catctttcat taatccctga actgttggtt aatacgcttg agggtgaatg
    30661 cgaataataa aaaaggagcc tgtagctccc tgatgatttt gcttttcatg ttcatcgttc
    30721 cttaaagacg ccgtttaaca tgccgattgc caggcttaaa tgagtcggtg tgaatcccat
    30781 cagcgttacc gtttcgcggt gcttcttcag tacgctacgg caaatgtcat cgacgttttt
    30841 atccggaaac tgctgtctgg ctttttttga tttcagaatt agcctgacgg gcaatgctgc
    30901 gaagggcgtt ttcctgctga ggtgtcattg aacaagtccc atgtcggcaa gcataagcac
    30961 acagaatatg aagcccgctg ccagaaaaat gcattccgtg gttgtcatac ctggtttctc
    31021 tcatctgctt ctgctttcgc caccatcatt tccagctttt gtgaaaggga tgcggctaac
    31081 gtatgaaatt cttcgtctgt ttctactggt attggcacaa acctgattcc aatttgagca
    31141 aggctatgtg ccatctcgat actcgttctt aactcaacag aagatgcttt gtgcatacag
    31201 cccctcgttt attatttatc tcctcagcca gccgctgtgc tttcagtgga tttcggataa
    31261 cagaaaggcc gggaaatacc cagcctcgct ttgtaacgga gtagacgaaa gtgattgcgc
    31321 ctacccggat attatcgtga ggatgcgtca tcgccattgc tccccaaata caaaaccaat
    31381 ttcagccagt gcctcgtcca ttttttcgat gaactccggc acgatctcgt caaaactcgc
    31441 catgtacttt tcatcccgct caatcacgac ataatgcagg ccttcacgct tcatacgcgg
    31501 gtcatagttg gcaaagtacc aggcattttt tcgcgtcacc cacatgctgt actgcacctg
    31561 ggccatgtaa gctgacttta tggcctcgaa accaccgagc cggaacttca tgaaatcccg
    31621 ggaggtaaac gggcatttca gttcaaggcc gttgccgtca ctgcataaac catcgggaga
    31681 gcaggcggta cgcatacttt cgtcgcgata gatgatcggg gattcagtaa cattcacgcc
    31741 ggaagtgaat tcaaacaggg ttctggcgtc gttctcgtac tgttttcccc aggccagtgc
    31801 tttagcgtta acttccggag ccacaccggt gcaaacctca gcaagcaggg tgtggaagta
    31861 ggacattttc atgtcaggcc acttctttcc ggagcggggt tttgctatca cgttgtgaac
    31921 ttctgaagcg gtgatgacgc cgagccgtaa tttgtgccac gcatcatccc cctgttcgac
    31981 agctctcaca tcgatcccgg tacgctgcag gataatgtcc ggtgtcatgc tgccaccttc
    32041 tgctctgcgg ctttctgttt caggaatcca agagctttta ctgcttcggc ctgtgtcagt
    32101 tctgacgatg cacgaatgtc gcggcgaaat atctgggaac agagcggcaa taagtcgtca
    32161 tcccatgttt tatccagggc gatcagcaga gtgttaatct cctgcatggt ttcatcgtta
    32221 accggagtga tgtcgcgttc cggctgacgt tctgcagtgt atgcagtatt ttcgacaatg
    32281 cgctcggctt catccttgtc atagatacca gcaaatccga aggccagacg ggcacactga
    32341 atcatggctt tatgacgtaa catccgtttg ggatgcgact gccacggccc cgtgatttct
    32401 ctgccttcgc gagttttgaa tggttcgcgg cggcattcat ccatccattc ggtaacgcag
    32461 atcggatgat tacggtcctt gcggtaaatc cggcatgtac aggattcatt gtcctgctca
    32521 aagtccatgc catcaaactg ctggttttca ttgatgatgc gggaccagcc atcaacgccc
    32581 accaccggaa cgatgccatt ctgcttatca ggaaaggcgt aaatttcttt cgtccacgga
    32641 ttaaggccgt actggttggc aacgatcagt aatgcgatga actgcgcatc gctggcatca
    32701 cctttaaatg ccgtctggcg aagagtggtg atcagttcct gtgggtcgac agaatccatg
    32761 ccgacacgtt cagccagctt cccagccagc gttgcgagtg cagtactcat tcgttttata
    32821 cctctgaatc aatatcaacc tggtggtgag caatggtttc aaccatgtac cggatgtgtt
    32881 ctgccatgcg ctcctgaaac tcaacatcgt catcaaacgc acgggtaatg gattttttgc
    32941 tggccccgtg gcgttgcaaa tgatcgatgc atagcgattc aaacaggtgc tggggcaggc
    33001 ctttttccat gtcgtctgcc agttctgcct ctttctcttc acgggcgagc tgctggtagt
    33061 gacgcgccca gctctgagcc tcaagacgat cctgaatgta ataagcgttc atggctgaac
    33121 tcctgaaata gctgtgaaaa tatcgcccgc gaaatgccgg gctgattagg aaaacaggaa
    33181 agggggttag tgaatgcttt tgcttgatct cagtttcagt attaatatcc attttttata
    33241 agcgtcgacg gcttcacgaa acatcttttc atcgccaata aaagtggcga tagtgaattt
    33301 agtctggata gccataagtg tttgatccat tctttgggac tcctggctga ttaagtatgt
    33361 cgataaggcg tttccatccg tcacgtaatt tacgggtgat tcgttcaagt aaagattcgg
    33421 aagggcagcc agcaacaggc caccctgcaa tggcatattg catggtgtgc tccttattta
    33481 tacataacga aaaacgcctc gagtgaagcg ttattggtat gcggtaaaac cgcactcagg
    33541 cggccttgat agtcatatca tctgaatcaa atattcctga tgtatcgata tcggtaattc
    33601 ttattccttc gctaccatcc attggaggcc atccttcctg accatttcca tcattccagt
    33661 cgaactcaca cacaacacca tatgcattta agtcgcttga aattgctata agcagagcat
    33721 gttgcgccag catgattaat acagcattta atacagagcc gtgtttattg agtcggtatt
    33781 cagagtctga ccagaaatta ttaatctggt gaagtttttc ctctgtcatt acgtcatggt
    33841 cgatttcaat ttctattgat gctttccagt cgtaatcaat gatgtatttt ttgatgtttg
    33901 acatctgttc atatcctcac agataaaaaa tcgccctcac actggagggc aaagaagatt
    33961 tccaataatc agaacaagtc ggctcctgtt tagttacgag cgacattgct ccgtgtattc
    34021 actcgttgga atgaatacac agtgcagtgt ttattctgtt atttatgcca aaaataaagg
    34081 ccactatcag gcagctttgt tgttctgttt accaagttct ctggcaatca ttgccgtcgt
    34141 tcgtattgcc catttatcga catatttccc atcttccatt acaggaaaca tttcttcagg
    34201 cttaaccatg cattccgatt gcagcttgca tccattgcat cgcttgaatt gtccacacca
    34261 ttgattttta tcaatagtcg tagtcatacg gatagtcctg gtattgttcc atcacatcct
    34321 gaggatgctc ttcgaactct tcaaattctt cttccatata tcaccttaaa tagtggattg
    34381 cggtagtaaa gattgtgcct gtcttttaac cacatcaggc tcggtggttc tcgtgtaccc
    34441 ctacagcgag aaatcggata aactattaca acccctacag tttgatgagt atagaaatgg
    34501 atccactcgt tattctcgga cgagtgttca gtaatgaacc tctggagaga accatgtata
    34561 tgatcgttat ctgggttgga cttctgcttt taagcccaga taactggcct gaatatgtta
    34621 atgagagaat cggtattcct catgtgtggc atgttttcgt ctttgctctt gcattttcgc
    34681 tagcaattaa tgtgcatcga ttatcagcta ttgccagcgc cagatataag cgatttaagc
    34741 taagaaaacg cattaagatg caaaacgata aagtgcgatc agtaattcaa aaccttacag
    34801 aagagcaatc tatggttttg tgcgcagccc ttaatgaagg caggaagtat gtggttacat
    34861 caaaacaatt cccatacatt agtgagttga ttgagcttgg tgtgttgaac aaaacttttt
    34921 cccgatggaa tggaaagcat atattattcc ctattgagga tatttactgg actgaattag
    34981 ttgccagcta tgatccatat aatattgaga taaagccaag gccaatatct aagtaactag
    35041 ataagaggaa tcgattttcc cttaattttc tggcgtccac tgcatgttat gccgcgttcg
    35101 ccaggcttgc tgtaccatgt gcgctgattc ttgcgctcaa tacgttgcag gttgctttca
    35161 atctgtttgt ggtattcagc cagcactgta aggtctatcg gatttagtgc gctttctact
    35221 cgtgatttcg gtttgcgatt cagcgagaga atagggcggt taactggttt tgcgcttacc
    35281 ccaaccaaca ggggatttgc tgctttccat tgagcctgtt tctctgcgcg acgttcgcgg
    35341 cggcgtgttt gtgcatccat ctggattctc ctgtcagtta gctttggtgg tgtgtggcag
    35401 ttgtagtcct gaacgaaaac cccccgcgat tggcacattg gcagctaatc cggaatcgca
    35461 cttacggcca atgcttcgtt tcgtatcaca caccccaaag ccttctgctt tgaatgctgc
    35521 ccttcttcag ggcttaattt ttaagagcgt caccttcatg gtggtcagtg cgtcctgctg
    35581 atgtgctcag tatcaccgcc agtggtattt atgtcaacac cgccagagat aatttatcac
    35641 cgcagatggt tatctgtatg ttttttatat gaatttattt tttgcagggg ggcattgttt
    35701 ggtaggtgag agatctgaat tgctatgttt agtgagttgt atctatttat ttttcaataa
    35761 atacaattgg ttatgtgttt tgggggcgat cgtgaggcaa agaaaacccg gcgctgaggc
    35821 cgggttattc ttgttctctg gtcaaattat atagttggaa aacaaggatg catatatgaa
    35881 tgaacgatgc agaggcaatg ccgatggcga tagtgggtat catgtagccg cttatgctgg
    35941 aaagaagcaa taacccgcag aaaaacaaag ctccaagctc aacaaaacta agggcataga
    36001 caataactac cgatgtcata tacccatact ctctaatctt ggccagtcgg cgcgttctgc
    36061 ttccgattag aaacgtcaag gcagcaatca ggattgcaat catggttcct gcatatgatg
    36121 acaatgtcgc cccaagacca tctctatgag ctgaaaaaga aacaccagga atgtagtggc
    36181 ggaaaaggag atagcaaatg cttacgataa cgtaaggaat tattactatg taaacaccag
    36241 gcatgattct gttccgcata attactcctg ataattaatc cttaactttg cccacctgcc
    36301 ttttaaaaca ttccagtata tcacttttca ttcttgcgta gcaatatgcc atctcttcag
    36361 ctatctcagc attggtgacc ttgttcagag gcgctgagag atggcctttt tctgatagat
    36421 aatgttctgt taaaatatct ccggcctcat cttttgcccg caggctaatg tctgaaaatt
    36481 gaggtgacgg gttaaaaata atatccttgg caaccttttt tatatccctt ttaaattttg
    36541 gcttaatgac tatatccaat gagtcaaaaa gctccccttc aatatctgtt gcccctaaga
    36601 cctttaatat atcgccaaat acaggtagct tggcttctac cttcaccgtt gttcggccga
    36661 tgaaatgcat atgcataaca tcgtctttgg tggttcccct catcagtggc tctatctgaa
    36721 cgcgctctcc actgcttaat gacattcctt tcccgattaa aaaatctgtc agatcggatg
    36781 tggtcggccc gaaaacagtt ctggcaaaac caatggtgtc gccttcaaca aacaaaaaag
    36841 atgggaatcc caatgattcg tcatctgcga ggctgttctt aatatcttca actgaagctt
    36901 tagagcgatt tatcttctga accagactct tgtcatttgt tttggtaaag agaaaagttt
    36961 ttccatcgat tttatgaata tacaaataat tggagccaac ctgcaggtga tgattatcag
    37021 ccagcagaga attaaggaaa acagacaggt ttattgagcg cttatctttc cctttatttt
    37081 tgctgcggta agtcgcataa aaaccattct tcataattca atccatttac tatgttatgt
    37141 tctgagggga gtgaaaattc ccctaattcg atgaagattc ttgctcaatt gttatcagct
    37201 atgcgccgac cagaacacct tgccgatcag ccaaacgtct cttcaggcca ctgactagcg
    37261 ataactttcc ccacaacgga acaactctca ttgcatggga tcattgggta ctgtgggttt
    37321 agtggttgta aaaacacctg accgctatcc ctgatcagtt tcttgaaggt aaactcatca
    37381 cccccaagtc tggctatgca gaaatcacct ggctcaacag cctgctcagg gtcaacgaga
    37441 attaacattc cgtcaggaaa gcttggcttg gagcctgttg gtgcggtcat ggaattacct
    37501 tcaacctcaa gccagaatgc agaatcactg gcttttttgg ttgtgcttac ccatctctcc
    37561 gcatcacctt tggtaaaggt tctaagctca ggtgagaaca tccctgcctg aacatgagaa
    37621 aaaacagggt actcatactc acttctaagt gacggctgca tactaaccgc ttcatacatc
    37681 tcgtagattt ctctggcgat tgaagggcta aattcttcaa cgctaacttt gagaattttt
    37741 gcaagcaatg cggcgttata agcatttaat gcattgatgc cattaaataa agcaccaacg
    37801 cctgactgcc ccatccccat cttgtctgcg acagattcct gggataagcc aagttcattt
    37861 ttcttttttt cataaattgc tttaaggcga cgtgcgtcct caagctgctc ttgtgttaat
    37921 ggtttctttt ttgtgctcat acgttaaatc tatcaccgca agggataaat atctaacacc
    37981 gtgcgtgttg actattttac ctctggcggt gataatggtt gcatgtacta aggaggttgt
    38041 atggaacaac gcataaccct gaaagattat gcaatgcgct ttgggcaaac caagacagct
    38101 aaagatctcg gcgtatatca aagcgcgatc aacaaggcca ttcatgcagg ccgaaagatt
    38161 tttttaacta taaacgctga tggaagcgtt tatgcggaag aggtaaagcc cttcccgagt
    38221 aacaaaaaaa caacagcata aataaccccg ctcttacaca ttccagccct gaaaaagggc
    38281 atcaaattaa accacaccta tggtgtatgc atttatttgc atacattcaa tcaattgtta
    38341 tctaaggaaa tacttacata tggttcgtgc aaacaaacgc aacgaggctc tacgaatcga
    38401 gagtgcgttg cttaacaaaa tcgcaatgct tggaactgag aagacagcgg aagctgtggg
    38461 cgttgataag tcgcagatca gcaggtggaa gagggactgg attccaaagt tctcaatgct
    38521 gcttgctgtt cttgaatggg gggtcgttga cgacgacatg gctcgattgg cgcgacaagt
    38581 tgctgcgatt ctcaccaata aaaaacgccc ggcggcaacc gagcgttctg aacaaatcca
    38641 gatggagttc tgaggtcatt actggatcta tcaacaggag tcattatgac aaatacagca
    38701 aaaatactca acttcggcag aggtaacttt gccggacagg agcgtaatgt ggcagatctc
    38761 gatgatggtt acgccagact atcaaatatg ctgcttgagg cttattcggg cgcagatctg
    38821 accaagcgac agtttaaagt gctgcttgcc attctgcgta aaacctatgg gtggaataaa
    38881 ccaatggaca gaatcaccga ttctcaactt agcgagatta caaagttacc tgtcaaacgg
    38941 tgcaatgaag ccaagttaga actcgtcaga atgaatatta tcaagcagca aggcggcatg
    39001 tttggaccaa ataaaaacat ctcagaatgg tgcatccctc aaaacgaggg aaaatcccct
    39061 aaaacgaggg ataaaacatc cctcaaattg ggggattgct atccctcaaa acagggggac
    39121 acaaaagaca ctattacaaa agaaaaaaga aaagattatt cgtcagagaa ttctggcgaa
    39181 tcctctgacc agccagaaaa cgacctttct gtggtgaaac cggatgctgc aattcagagc
    39241 ggcagcaagt gggggacagc agaagacctg accgccgcag agtggatgtt tgacatggtg
    39301 aagactatcg caccatcagc cagaaaaccg aattttgctg ggtgggctaa cgatatccgc
    39361 ctgatgcgtg aacgtgacgg acgtaaccac cgcgacatgt gtgtgctgtt ccgctgggca
    39421 tgccaggaca acttctggtc cggtaacgtg ctgagcccgg ccaaactccg cgataagtgg
    39481 acccaactcg aaatcaaccg taacaagcaa caggcaggcg tgacagccag caaaccaaaa
    39541 ctcgacctga caaacacaga ctggatttac ggggtggatc tatgaaaaac atcgccgcac
    39601 agatggttaa ctttgaccgt gagcagatgc gtcggatcgc caacaacatg ccggaacagt
    39661 acgacgaaaa gccgcaggta cagcaggtag cgcagatcat caacggtgtg ttcagccagt
    39721 tactggcaac tttcccggcg agcctggcta accgtgacca gaacgaagtg aacgaaatcc
    39781 gtcgccagtg ggttctggct tttcgggaaa acgggatcac cacgatggaa caggttaacg
    39841 caggaatgcg cgtagcccgt cggcagaatc gaccatttct gccatcaccc gggcagtttg
    39901 ttgcatggtg ccgggaagaa gcatccgtta ccgccggact gccaaacgtc agcgagctgg
    39961 ttgatatggt ttacgagtat tgccggaagc gaggcctgta tccggatgcg gagtcttatc
    40021 cgtggaaatc aaacgcgcac tactggctgg ttaccaacct gtatcagaac atgcgggcca
    40081 atgcgcttac tgatgcggaa ttacgccgta aggccgcaga tgagcttgtc catatgactg
    40141 cgagaattaa ccgtggtgag gcgatccctg aaccagtaaa acaacttcct gtcatgggcg
    40201 gtagacctct aaatcgtgca caggctctgg cgaagatcgc agaaatcaaa gctaagttcg
    40261 gactgaaagg agcaagtgta tgacgggcaa agaggcaatt attcattacc tggggacgca
    40321 taatagcttc tgtgcgccgg acgttgccgc gctaacaggc gcaacagtaa ccagcataaa
    40381 tcaggccgcg gctaaaatgg cacgggcagg tcttctggtt atcgaaggta aggtctggcg
    40441 aacggtgtat taccggtttg ctaccaggga agaacgggaa ggaaagatga gcacgaacct
    40501 ggtttttaag gagtgtcgcc agagtgccgc gatgaaacgg gtattggcgg tatatggagt
    40561 taaaagatga ccatctacat tactgagcta ataacaggcc tgctggtaat cgcaggcctt
    40621 tttatttggg ggagagggaa gtcatgaaaa aactaacctt tgaaattcga tctccagcac
    40681 atcagcaaaa cgctattcac gcagtacagc aaatccttcc agacccaacc aaaccaatcg
    40741 tagtaaccat tcaggaacgc aaccgcagct tagaccaaaa caggaagcta tgggcctgct
    40801 taggtgacgt ctctcgtcag gttgaatggc atggtcgctg gctggatgca gaaagctgga
    40861 agtgtgtgtt taccgcagca ttaaagcagc aggatgttgt tcctaacctt gccgggaatg
    40921 gctttgtggt aataggccag tcaaccagca ggatgcgtgt aggcgaattt gcggagctat
    40981 tagagcttat acaggcattc ggtacagagc gtggcgttaa gtggtcagac gaagcgagac
    41041 tggctctgga gtggaaagcg agatggggag acagggctgc atgataaatg tcgttagttt
    41101 ctccggtggc aggacgtcag catatttgct ctggctaatg gagcaaaagc gacgggcagg
    41161 taaagacgtg cattacgttt tcatggatac aggttgtgaa catccaatga catatcggtt
    41221 tgtcagggaa gttgtgaagt tctgggatat accgctcacc gtattgcagg ttgatatcaa
    41281 cccggagctt ggacagccaa atggttatac ggtatgggaa ccaaaggata ttcagacgcg
    41341 aatgcctgtt ctgaagccat ttatcgatat ggtaaagaaa tatggcactc catacgtcgg
    41401 cggcgcgttc tgcactgaca gattaaaact cgttcccttc accaaatact gtgatgacca
    41461 tttcgggcga gggaattaca ccacgtggat tggcatcaga gctgatgaac cgaagcggct
    41521 aaagccaaag cctggaatca gatatcttgc tgaactgtca gactttgaga aggaagatat
    41581 cctcgcatgg tggaagcaac aaccattcga tttgcaaata ccggaacatc tcggtaactg
    41641 catattctgc attaaaaaat caacgcaaaa aatcggactt gcctgcaaag atgaggaggg
    41701 attgcagcgt gtttttaatg aggtcatcac gggatcccat gtgcgtgacg gacatcggga
    41761 aacgccaaag gagattatgt accgaggaag aatgtcgctg gacggtatcg cgaaaatgta
    41821 ttcagaaaat gattatcaag ccctgtatca ggacatggta cgagctaaaa gattcgatac
    41881 cggctcttgt tctgagtcat gcgaaatatt tggagggcag cttgatttcg acttcgggag
    41941 ggaagctgca tgatgcgatg ttatcggtgc ggtgaatgca aagaagataa ccgcttccga
    42001 ccaaatcaac cttactggaa tcgatggtgt ctccggtgtg aaagaacacc aacaggggtg
    42061 ttaccactac cgcaggaaaa ggaggacgtg tggcgagaca gcgacgaagt atcaccgaca
    42121 taatctgcga aaactgcaaa taccttccaa cgaaacgcac cagaaataaa cccaagccaa
    42181 tcccaaaaga atctgacgta aaaaccttca actacacggc tcacctgtgg gatatccggt
    42241 ggctaagacg tcgtgcgagg aaaacaaggt gattgaccaa aatcgaagtt acgaacaaga
    42301 aagcgtcgag cgagctttaa cgtgcgctaa ctgcggtcag aagctgcatg tgctggaagt
    42361 tcacgtgtgt gagcactgct gcgcagaact gatgagcgat ccgaatagct cgatgcacga
    42421 ggaagaagat gatggctaaa ccagcgcgaa gacgatgtaa aaacgatgaa tgccgggaat
    42481 ggtttcaccc tgcattcgct aatcagtggt ggtgctctcc agagtgtgga accaagatag
    42541 cactcgaacg acgaagtaaa gaacgcgaaa aagcggaaaa agcagcagag aagaaacgac
    42601 gacgagagga gcagaaacag aaagataaac ttaagattcg aaaactcgcc ttaaagcccc
    42661 gcagttactg gattaaacaa gcccaacaag ccgtaaacgc cttcatcaga gaaagagacc
    42721 gcgacttacc atgtatctcg tgcggaacgc tcacgtctgc tcagtgggat gccggacatt
    42781 accggacaac tgctgcggca cctcaactcc gatttaatga acgcaatatt cacaagcaat
    42841 gcgtggtgtg caaccagcac aaaagcggaa atctcgttcc gtatcgcgtc gaactgatta
    42901 gccgcatcgg gcaggaagca gtagacgaaa tcgaatcaaa ccataaccgc catcgctgga
    42961 ctatcgaaga gtgcaaggcg atcaaggcag agtaccaaca gaaactcaaa gacctgcgaa
    43021 atagcagaag tgaggccgca tgacgttctc agtaaaaacc attccagaca tgctcgttga
    43081 agcatacgga aatcagacag aagtagcacg cagactgaaa tgtagtcgcg gtacggtcag
    43141 aaaatacgtt gatgataaag acgggaaaat gcacgccatc gtcaacgacg ttctcatggt
    43201 tcatcgcgga tggagtgaaa gagatgcgct attacgaaaa aattgatggc agcaaatacc
    43261 gaaatatttg ggtagttggc gatctgcacg gatgctacac gaacctgatg aacaaactgg
    43321 atacgattgg attcgacaac aaaaaagacc tgcttatctc ggtgggcgat ttggttgatc
    43381 gtggtgcaga gaacgttgaa tgcctggaat taatcacatt cccctggttc agagctgtac
    43441 gtggaaacca tgagcaaatg atgattgatg gcttatcaga gcgtggaaac gttaatcact
    43501 ggctgcttaa tggcggtggc tggttcttta atctcgatta cgacaaagaa attctggcta
    43561 aagctcttgc ccataaagca gatgaacttc cgttaatcat cgaactggtg agcaaagata
    43621 aaaaatatgt tatctgccac gccgattatc cctttgacga atacgagttt ggaaagccag
    43681 ttgatcatca gcaggtaatc tggaaccgcg aacgaatcag caactcacaa aacgggatcg
    43741 tgaaagaaat caaaggcgcg gacacgttca tctttggtca tacgccagca gtgaaaccac
    43801 tcaagtttgc caaccaaatg tatatcgata ccggcgcagt gttctgcgga aacctaacat
    43861 tgattcaggt acagggagaa ggcgcatgag actcgaaagc gtagctaaat ttcattcgcc
    43921 aaaaagcccg atgatgagcg actcaccacg ggccacggct tctgactctc tttccggtac
    43981 tgatgtgatg gctgctatgg ggatggcgca atcacaagcc ggattcggta tggctgcatt
    44041 ctgcggtaag cacgaactca gccagaacga caaacaaaag gctatcaact atctgatgca
    44101 atttgcacac aaggtatcgg ggaaataccg tggtgtggca aagcttgaag gaaatactaa
    44161 ggcaaaggta ctgcaagtgc tcgcaacatt cgcttatgcg gattattgcc gtagtgccgc
    44221 gacgccgggg gcaagatgca gagattgcca tggtacaggc cgtgcggttg atattgccaa
    44281 aacagagctg tgggggagag ttgtcgagaa agagtgcgga agatgcaaag gcgtcggcta
    44341 ttcaaggatg ccagcaagcg cagcatatcg cgctgtgacg atgctaatcc caaaccttac
    44401 ccaacccacc tggtcacgca ctgttaagcc gctgtatgac gctctggtgg tgcaatgcca
    44461 caaagaagag tcaatcgcag acaacatttt gaatgcggtc acacgttagc agcatgattg
    44521 ccacggatgg caacatatta acggcatgat attgacttat tgaataaaat tgggtaaatt
    44581 tgactcaacg atgggttaat tcgctcgttg tggtagtgag atgaaaagag gcggcgctta
    44641 ctaccgattc cgcctagttg gtcacttcga cgtatcgtct ggaactccaa ccatcgcagg
    44701 cagagaggtc tgcaaaatgc aatcccgaaa cagttcgcag gtaatagtta gagcctgcat
    44761 aacggtttcg ggatttttta tatctgcaca acaggtaaga gcattgagtc gataatcgtg
    44821 aagagtcggc gagcctggtt agccagtgct ctttccgttg tgctgaatta agcgaatacc
    44881 ggaagcagaa ccggatcacc aaatgcgtac aggcgtcatc gccgcccagc aacagcacaa
    44941 cccaaactga gccgtagcca ctgtctgtcc tgaattcatt agtaatagtt acgctgcggc
    45001 cttttacaca tgaccttcgt gaaagcgggt ggcaggaggt cgcgctaaca acctcctgcc
    45061 gttttgcccg tgcatatcgg tcacgaacaa atctgattac taaacacagt agcctggatt
    45121 tgttctatca gtaatcgacc ttattcctaa ttaaatagag caaatcccct tattgggggt
    45181 aagacatgaa gatgccagaa aaacatgacc tgttggccgc cattctcgcg gcaaaggaac
    45241 aaggcatcgg ggcaatcctt gcgtttgcaa tggcgtacct tcgcggcaga tataatggcg
    45301 gtgcgtttac aaaaacagta atcgacgcaa cgatgtgcgc cattatcgcc tggttcattc
    45361 gtgaccttct cgacttcgcc ggactaagta gcaatctcgc ttatataacg agcgtgttta
    45421 tcggctacat cggtactgac tcgattggtt cgcttatcaa acgcttcgct gctaaaaaag
    45481 ccggagtaga agatggtaga aatcaataat caacgtaagg cgttcctcga tatgctggcg
    45541 tggtcggagg gaactgataa cggacgtcag aaaaccagaa atcatggtta tgacgtcatt
    45601 gtaggcggag agctatttac tgattactcc gatcaccctc gcaaacttgt cacgctaaac
    45661 ccaaaactca aatcaacagg cgccggacgc taccagcttc tttcccgttg gtgggatgcc
    45721 taccgcaagc agcttggcct gaaagacttc tctccgaaaa gtcaggacgc tgtggcattg
    45781 cagcagatta aggagcgtgg cgctttacct atgattgatc gtggtgatat ccgtcaggca
    45841 atcgaccgtt gcagcaatat ctgggcttca ctgccgggcg ctggttatgg tcagttcgag
    45901 cataaggctg acagcctgat tgcaaaattc aaagaagcgg gcggaacggt cagagagatt
    45961 gatgtatgag cagagtcacc gcgattatct ccgctctggt tatctgcatc atcgtctgcc
    46021 tgtcatgggc tgttaatcat taccgtgata acgccattac ctacaaagcc cagcgcgaca
    46081 aaaatgccag agaactgaag ctggcgaacg cggcaattac tgacatgcag atgcgtcagc
    46141 gtgatgttgc tgcgctcgat gcaaaataca cgaaggagtt agctgatgct aaagctgaaa
    46201 atgatgctct gcgtgatgat gttgccgctg gtcgtcgtcg gttgcacatc aaagcagtct
    46261 gtcagtcagt gcgtgaagcc accaccgcct ccggcgtgga taatgcagcc tccccccgac
    46321 tggcagacac cgctgaacgg gattatttca ccctcagaga gaggctgatc actatgcaaa
    46381 aacaactgga aggaacccag aagtatatta atgagcagtg cagatagagt tgcccatatc
    46441 gatgggcaac tcatgcaatt attgtgagca atacacacgc gcttccagcg gagtataaat
    46501 gcctaaagta ataaaaccga gcaatccatt tacgaatgtt tgctgggttt ctgttttaac
    46561 aacattttct gcgccgccac aaattttggc tgcatcgaca gttttcttct gcccaattcc
    46621 agaaacgaag aaatgatggg tgatggtttc ctttggtgct actgctgccg gtttgttttg
    46681 aacagtaaac gtctgttgag cacatcctgt aataagcagg gccagcgcag tagcgagtag
    46741 catttttttc atggtgttat tcccgatgct ttttgaagtt cgcagaatcg tatgtgtaga
    46801 aaattaaaca aaccctaaac aatgagttga aatttcatat tgttaatatt tattaatgta
    46861 tgtcaggtgc gatgaatcgt cattgtattc ccggattaac tatgtccaca gccctgacgg
    46921 ggaacttctc tgcgggagtg tccgggaata attaaaacga tgcacacagg gtttagcgcg
    46981 tacacgtatt gcattatgcc aacgccccgg tgctgacacg gaagaaaccg gacgttatga
    47041 tttagcgtgg aaagatttgt gtagtgttct gaatgctctc agtaaatagt aatgaattat
    47101 caaaggtata gtaatatctt ttatgttcat ggatatttgt aacccatcgg aaaactcctg
    47161 ctttagcaag attttccctg tattgctgaa atgtgatttc tcttgatttc aacctatcat
    47221 aggacgtttc tataagatgc gtgtttcttg agaatttaac atttacaacc tttttaagtc
    47281 cttttattaa cacggtgtta tcgttttcta acacgatgtg aatattatct gtggctagat
    47341 agtaaatata atgtgagacg ttgtgacgtt ttagttcaga ataaaacaat tcacagtcta
    47401 aatcttttcg cacttgatcg aatatttctt taaaaatggc aacctgagcc attggtaaaa
    47461 ccttccatgt gatacgaggg cgcgtagttt gcattatcgt ttttatcgtt tcaatctggt
    47521 ctgacctcct tgtgttttgt tgatgattta tgtcaaatat taggaatgtt ttcacttaat
    47581 agtattggtt gcgtaacaaa gtgcggtcct gctggcattc tggagggaaa tacaaccgac
    47641 agatgtatgt aaggccaacg tgctcaaatc ttcatacaga aagatttgaa gtaatatttt
    47701 aaccgctaga tgaagagcaa gcgcatggag cgacaaaatg aataaagaac aatctgctga
    47761 tgatccctcc gtggatctga ttcgtgtaaa aaatatgctt aatagcacca tttctatgag
    47821 ttaccctgat gttgtaattg catgtataga acataaggtg tctctggaag cattcagagc
    47881 aattgaggca gcgttggtga agcacgataa taatatgaag gattattccc tggtggttga
    47941 ctgatcacca taactgctaa tcattcaaac tatttagtct gtgacagagc caacacgcag
    48001 tctgtcactg tcaggaaagt ggtaaaactg caactcaatt actgcaatgc cctcgtaatt
    48061 aagtgaattt acaatatcgt cctgttcgga gggaagaacg cgggatgttc attcttcatc
    48121 acttttaatt gatgtatatg ctctcttttc tgacgttagt ctccgacggc aggcttcaat
    48181 gacccaggct gagaaattcc cggacccttt ttgctcaaga gcgatgttaa tttgttcaat
    48241 catttggtta ggaaagcgga tgttgcgggt tgttgttctg cgggttctgt tcttcgttga
    48301 catgaggttg ccccgtattc agtgtcgctg atttgtattg tctgaagttg tttttacgtt
    48361 aagttgatgc agatcaatta atacgatacc tgcgtcataa ttgattattt gacgtggttt
    48421 gatggcctcc acgcacgttg tgatatgtag atgataatca ttatcacttt acgggtcctt
    48481 tccggtgatc cgacaggtta cg
//
LOCUS       MUSIGKCLO     422 bp ss-mRNA            ROD       29-JUN-1990
DEFINITION  Mouse IgMk rearranged kappa light-chain mRNA variable region
            (V-J-kappa) anti-DNA autoantibody.
ACCESSION   M20834
KEYWORDS    autoantibody; immunoglobulin light chain; kappa-immunoglobulin;
            kappa-immunoglobulin light chain; variable region.
SOURCE      Mouse (strain MRL/Mp -lpr/lpr) spleen hybridoma cell line
            MRL-DNA22, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 422)
  AUTHORS   Kofler,R., Strohal,R., Balderas,R.S., Johnson,M.E., Noonan,D.J.,
            Duchosal,M.A., Dixon,F.J. and Theofilopoulos,A.N.
  TITLE     Immunoglobulin kappa light chain variable region gene complex
            organization and immunoglobulin genes encoding anti-DNA
            autoantibodies in lupus mice
  JOURNAL   J. Clin. Invest. 82, 852-860 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            R. Kofler 28-JUL-1988
FEATURES       from  to/span     description
    pept         33  >   422     IgMk light chain precursor
    sigp         33       98     Ig light chain signal peptide
    matp         99  >   422     Ig light chain
    mRNA          1  >   422     IgMk mRNA
    recomb      386      387     V-region end/J-region start
BASE COUNT      112 a    109 c     95 g    106 t
ORIGIN      Chromosome 6.
        1 taattagcta gggaccaaaa ttcaaagaga aaatggattt tcaggtgcag attttcagct
       61 tcctgctaat cagtgcctca gtcataatgt ccagaggaca aattgttctc acccagtctc
      121 cagcaatcat gtctgcatct cctggggaac gggtcaccat gacctgcagt gccagctcaa
      181 gtgtaagttc cagctacttg tactggtacc agcagaagcc aggatcctcc cccaaactat
      241 ggatttatag cacatccaac ctggcttctg gagtccctgc tcgcttcagt ggcagtgggt
      301 ctgggacctc ttattctctc acaatcagca gcatggaggc tgaagatgct gccacttatt
      361 actgccagca gtacagtggt tacccattca cgttcggcac ggggacaaaa ttggaaataa
      421 aa
//
LOCUS       MUSIGKCLY     312 bp ss-mRNA            ROD       29-JUN-1990
DEFINITION  Mouse Ig unproductively rearranged kappa-chain VJ2C mRNA from
            plasmacytoma BFPC 61B, partial cds.
ACCESSION   M12189
KEYWORDS    immunoglobulin; immunoglobulin light chain; kappa-immunoglobulin;
            variable region.
SOURCE      Mouse plasmacytomas, cDNA to mRNA, clone BFPC 61B.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Bernard,O., Gough,N.M. and Adams,J.M.
  TITLE     Plasmacytomas with more than one immunoglobulin kappa mRNA:
            Implications for allelic exclusion
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 5812-5816 (1981)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept.ps <     1  >   312     Ig kappa-chain VJC-regions (AA at 3)
    recomb      245      246     V-region end/J2-region start
    recomb      285      286     J2-region end/C-region start
BASE COUNT       75 a     88 c     82 g     67 t
ORIGIN      
        1 ctgcaccagg ggagaaggtc acaatgactt gcagggccag ctcaagtgta agttacatgg
       61 actggtacca gcagaagcca ggatcctccc ccaaaccctg gatttatgcc acatccaccc
      121 tagtttctgg agttcctgct cgcttcagtg gcagtgggtc tgggacctct tactctctca
      181 caatcagcag agtggaggct gaagatgctg ccacttatta gtgcccgcag tggagtagta
      241 acccacccga cgttcggtgg aggcaccaag ctggaaatca aacgggctga tgctgcacca
      301 actgtatcca tc
//
LOCUS       MUSTCBXZA     396 bp ss-mRNA            ROD       29-JUN-1990
DEFINITION  Mouse T-cell receptor active beta-chain V-region V11DJ mRNA.
ACCESSION   M13676
KEYWORDS    T-cell antigen receptor; T-cell receptor;
            T-cell receptor beta-chain; antigen receptor; diversity exon;
            joining exon; processed gene; variable region.
SOURCE      Mouse (C57BL/6) T-helper hybridoma, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 390)
  AUTHORS   Behlke,M.A., Spinella,D.G., Chou,H.S., Sha,W., Hartl,D.L. and
            Loh,D.Y.
  TITLE     T-cell receptor beta-chain expression: Dependence on relatively few
            variable region genes
  JOURNAL   Science 229, 566-570 (1985)
  STANDARD  full staff_review
REFERENCE   2  (bases 391 to 396)
  AUTHORS   Behlke,M.A.
  JOURNAL   Unpublished (1985) Howard Hughes Med Inst, St.Louis, MO 63110.
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of the sequence in [2] were
            kindly provided by M.A.Behlke, 25-OCT-1985.
FEATURES       from  to/span     description
    pept          1  >   396     T-cell receptor beta-chain precursor (V11DJ)
    sigp          1       57     T-cell receptor beta-chain signal peptide
    matp         58  >   396     T-cell receptor beta-chain
    recomb      342      343     V11-region end/D-region start
    recomb      345      346     D-region end/N-region start
    recomb      348      349     N-region end/J-region start
    recomb      390      391     J-region end/C-region start
BASE COUNT      113 a     96 c     81 g    106 t
ORIGIN      Chromosome 6.
        1 atgagctgca ggcttctcct ctatgtttcc ctatgtcttg tggaaacagc actcatgaac
       61 actaaaatta ctcagtcacc aagatatcta atcctgggaa gaacaaataa gtctttggaa
      121 tgtgagcaac atctgggaca taatgctatg tactggtata aacagagcgc tgagaagccg
      181 ccagagctca tgtttctcta caatcttaaa cagttgattc gaaatgagac ggtgcccagt
      241 cgttttatac ctgaatgccc agacagctcc aagctacttt tacatatatc tgccgtggat
      301 ccagaagact cagctgtcta tttttgtgcc agcagccaga taactagtaa ccaagacacc
      361 cagtactttg ggccaggcac tcggctcctc gtgtta
//
LOCUS       P18XIS911     115 bp ds-DNA             SYN       29-JUN-1990
DEFINITION  Plasmid pACYC184/insertion element 91 (IS91), left junction.
ACCESSION   K03543
KEYWORDS    insertion sequence.
SEGMENT     1 of 2
SOURCE      Plasmid pACYC184 and insertion element 91 DNA, clone pSU234.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 115)
  AUTHORS   Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F.
  TITLE     Transposition of IS91 does not generate a target duplication
  JOURNAL   J. Bacteriol. 169, 442-443 (1987)
  STANDARD  full staff_review
COMMENT     No duplication of an insertion target sequence seems to take place
            upon the insertion of IS91.
FEATURES       from  to/span     description
    recomb       16       17     pBR322 DNA end/IS91 DNA start
    rpt          16       24     IS91 inverted terminal repeat
BASE COUNT       26 a     26 c     29 g     34 t
ORIGIN      16 bp upstream of TaqI site.
        1 ttaacgaccc tgccctcgag taggcagcct ggcggctgcg gcttgtcatg gtctggaatt
       61 accgttataa aaaaagataa tgtcattgtc tttcaggtag ttatatggcc cgttc
//
LOCUS       P18XIS912     115 bp ds-DNA             SYN       29-JUN-1990
DEFINITION  Plasmid pACYC184/insertion element 91 (IS91), right junction.
ACCESSION   M15256
KEYWORDS    insertion sequence.
SEGMENT     2 of 2
SOURCE      Plasmid pACYC184 and insertion element 91 DNA, clone pSU234.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 115)
  AUTHORS   Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F.
  TITLE     Transposition of IS91 does not generate a target duplication
  JOURNAL   J. Bacteriol. 169, 442-443 (1987)
  STANDARD  full staff_review
COMMENT     No duplication of an insertion target sequence seems to take place
            upon the insertion of IS91.
FEATURES       from  to/span     description
    rpt          93      100     IS91 inverted terminal repeat
    recomb      101      102     IS91 DNA end/pACYC184 DNA start
BASE COUNT       25 a     20 c     29 g     41 t
ORIGIN      About 1.85 kb after segment 1; 97 bp upstream of TaqI site.
        1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt
       61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tgaaccgacg accgg
//
LOCUS       PBR322IS2     115 bp ds-DNA             SYN       29-JUN-1990
DEFINITION  Plasmid pBR322/insertion element 91 (IS91), right junction.
ACCESSION   M15257
KEYWORDS    insertion sequence.
SEGMENT     2 of 2
SOURCE      Plasmid pBR322 and insertion element 91 DNA, clone pSU240.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 115)
  AUTHORS   Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F.
  TITLE     Transposition of IS91 does not generate a target duplication
  JOURNAL   J. Bacteriol. 169, 442-443 (1987)
  STANDARD  full staff_review
COMMENT     No duplication of an insertion target sequence seems to take place
            upon the insertion of IS91.
FEATURES       from  to/span     description
    rpt          93      100     IS91 inverted terminal repeat
    recomb      101      102     IS91 DNA end/pBR322 DNA start
BASE COUNT       25 a     19 c     27 g     44 t
ORIGIN      About 1.85 kb after segment 1; 97 bp upstream of TaqI site.
        1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt
       61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tcaaggatct taccg
//
LOCUS       RATMYCSE      903 bp ds-DNA             ROD       29-JUN-1990
DEFINITION  Rat c-myc/Ig epsilon translocation DNA.
ACCESSION   M23417
KEYWORDS    epsilon immunoglobulin; switch region; translocation.
SOURCE      Rat (strain LOU) immunocytoma cell line IR162 DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 903)
  AUTHORS   Tian,S.-S. and Faust,C.
  TITLE     A simple and precise aberrant translocation of the rat c-myc gene
            into the epsilon-heavy chain switch region of the IgE-producing
            immunocytoma, IR162
  JOURNAL   J. Biol. Chem. 264, 1846-1853 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      298      299     Ig epsilon switch region DNA end/c-myc DNA
                                 start
    signal      617      621     TATA box
    signal      777      781     TATA box
BASE COUNT      212 a    250 c    238 g    203 t
ORIGIN      
        1 gggcccagtt cagctctgtg ccaggccccc agaaagcata ggatatttct gataaacctg
       61 attgtctcta cctaccctga tatagtctgg ctactatatc aggacaatct tcttggagaa
      121 cagttaaaaa catccataaa aaatttcgag caatctattt ttttttaaat taatttattt
      181 tttggatttt ttttaaactt acatttcaaa tgttatcact tttcccccac ccaaccaccc
      241 acccctttcc acctctctgt cctgacattc ccctacactg gtgtgtgtgt ggggtccaga
      301 ctaatacacg caggacaaga acacagttca gccgagcgct gcgcccgaac aactgtacag
      361 aaagggaaag gaccgacgcg cgagcaggag aaaatggtcg ggcgcgcagt taattcatgc
      421 tgcgctatta ctgtttacac cccgagtcgg agtactgggc tgcgggactg aggctcctcc
      481 tctttccccg gctccccact agtgcccctc ccgagttccc aaagcagagg gcgggggagc
      541 gagaggagga aaaaaataga ggtcgggagg ggagggagaa agagattctc tggctaatcc
      601 ccgcccaccc gccctttata atccgggggt ctgcgcggct aaggaccccc gggctgcgct
      661 gctctccgct gccgcctccg ccgcgcccac tccgctcgcc tcctgcctcc aaaagggcag
      721 ggcttcgccg aggcttggcg ggaaaaagaa gcgaggggag ggatccggag tcgcagtata
      781 aaagaagctt ttcgggcgtt ttttttctga ctcgctgtag taattccagc gagagacaga
      841 gggagtgagc gggcgggttg gaagagccca gtgtgcagag ccccactccg ggcttcctag
      901 gaa
//
LOCUS       RATRAFB      2719 bp ss-mRNA            ROD       29-JUN-1990
DEFINITION  Rat activated c-raf oncogene mRNA encoding a fused protein,
            complete cds.
ACCESSION   M15428
KEYWORDS    fusion protein; oncogene; protein kinase; raf oncogene.
SOURCE      Rat primary NIH 3T3 cell transformant IQ7-2, cDNA to mRNA, clone
            pA5.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2719)
  AUTHORS   Ishikawa,F., Takaku,F., Nagao,M. and Sugimura,T.
  TITLE     Rat c-raf oncogene activation by a rearrangement that produces a
            fused protein
  JOURNAL   Mol. Cell. Biol. 7, 1226-1232 (1987)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by F.Ishikawa, 29-MAY-1987.
            
            The activated rat c-raf gene (see below) is transcribed to produce
            a fused mRNA, in which the 5' half of the sequence is replaced by
            an unknown rat sequence.
            
            The normal and activated c-raf were each connected to the long
            terminal repeat of Rous sarcoma virus and transfected into NIH 3T3
            cells.  Only the activated form had transforming activity.
FEATURES       from  to/span     description
    pept        206     2014     raf fusion protein
    mRNA    <     1  >  2525     raf mRNA
    recomb      898      899     rat DNA of unknown origin end/raf DNA start
    site       1032     1032     homology to v-raf start
BASE COUNT      762 a    644 c    698 g    615 t
ORIGIN      123 bp upstream of BglI site.
        1 gccgtgcgca tagaggccgg tgcgcggccc ttgctcgttt aacgcgggac tatatttccc
       61 agggtccgtc gcgggagtct ccggcgggca ggcgcgcggg agactgcgag cgaggcgccg
      121 acggggcggc tcaggcgtct gggtccgcgc atctccttgc tccttcgctt ctccttcagc
      181 cgctgctgcc acgaccccgg ccgacatggc ggcggtgttg cagcaagtgc tggagcgccc
      241 ggagctgaac aagctgccta agtcgaccca gaacaaactt gagaagttcc tggctgaaca
      301 gcagtccgaa atcgactgcc tgaaggggcg gcacgagaaa tttaaggtgg agagtgagca
      361 acaatacttt gagatagaga agagactatc ccagagtcag gagaggcttg ttaatgaaac
      421 ccgggagtgt cagaacttga ggctggagct tgagaagcta aataaccaag taaaagtatt
      481 aactgagaaa aacaaagaac ttgaaactgc tcaagaccgc aatctaggca ttcagagcca
      541 gtttacaaga gcaaaggaag agttagaagc tgaaaaaaga gatttaatca gaaccaatga
      601 gaggttatct caggaagttg aatatttaac agaggatgtt aaacgtctaa acgaaaaact
      661 taaagaaagc aatacaacga agggtgaact tcagttaaag ctggatgaac ttcaagcttc
      721 tgatgtcact gtgaagtacc gagaaaaacg cttagaacaa gaaaaggaat tgctacacaa
      781 tcaaaattca tggctaaaca cagagttgaa aaccaaaact gatgagctat tggctctagg
      841 aagagaaaag ggaaatgaaa ttctggaact taagtgtact cttgaaaaca aaaaggaaga
      901 ggatgcaatt cgaagtcaca gtgaatcagc ctcaccttca gccctgtcca gcagccccaa
      961 caacctgagc ccaacaggct ggtcacagcc caaaacccct gtgccagcac aaagagagag
     1021 ggcgccagga tctgggaccc aggaaaaaaa caaaattagg cctcgtgggc agagagattc
     1081 aagttattac tgggaaatag aagccagtga ggtgatgctg tctactcgga ttggctcggg
     1141 ctcctttggc actgtgtaca agggcaagtg gcatggagat gttgcagtaa agatcctaaa
     1201 ggtggttgac ccaactccag agcaacttca ggccttcagg aacgaggtgg ctgttttgcg
     1261 caaaacacgg catgttaata tcctgctgtt catggggtac atgacaaagg acaacctggc
     1321 gattgtgacc cagtggtgtg aaggcagcag tctctacaaa cacctgcatg tccaggagac
     1381 caaattccag atgttccagc taattgacat tgcccggcag acagctcagg gaatggacta
     1441 tttacatgca aagaacatca tccacagaga catgaaatcc aacaatatat ttctccatga
     1501 aggcctcacg gtgaaaatcg gagattttgg tttggcaaca gtgaagtcgc gctggagtgg
     1561 ttctcagcag gttgaacagc ccactggctc tgtgctgtgg atggccccag aagtaatccg
     1621 aatgcaggat aacaacccgt tcagcttcca gtccgatgtc tactcctatg gcattgtgct
     1681 gtatgagctg atgactgggg agcttcccta ctcccacatc aacaaccgag accagatcat
     1741 cttcatggtg ggccgtgggt acgcctcccc agatcttagc aggctctaca agaactgccc
     1801 caaggcaatg aagaggttgg tggctgactg tgtgaagaaa gtcaaagaag aaaggccttt
     1861 gtttcctcag atcctgtctt ccattgagct gcttcagcac tctctgccga aaatcaacag
     1921 gagcgcctct gagccttccc tgcatcgggc agctcacact gaggacatca atgcttgtac
     1981 gctgaccaca tccccaaggc taccagtctt ctagctgacg ttatagctgt tcttaggcca
     2041 ccaggggacg aagaagagtc agcaggcacc actttctgtt tccttggggg cagaatgcat
     2101 gtttccggaa aagctgctgc taaggaccta gactactcac agggccttaa cttcatattg
     2161 ccttcttttc tacccttcct gccctggaaa tggaagctgt ccgccaagcc agcctgctcc
     2221 agaggtatac aagtcagcga gtatttttag ggcaaatggc cttggagaga gaaggcaggg
     2281 cactccggct actgcaggga catgcagttg ggaacttggc tcattgagct gtacagacag
     2341 tggtgcagtg ccagttttgc acatggagtc ctggccacct gggggagcct gctttggtac
     2401 tacagaactt cactttgtgg acacaccttc ctcttactga gtctaagatg tcctgtgcag
     2461 aggatgcttt ccaagcacgg tgctccacct tctggcagcc tcccacacgc tgaatctgtc
     2521 ttccaggagc tgccctatgg ggtgctgcag cccagcccta tctctatagt cacatccttg
     2581 tctgtaagaa agccaggaat acaggttttc ttaatgattt tgggttttaa ttttgttttt
     2641 attgagcctg ataaaataca gttatctgat ggttcctcaa ttatgttatt ttaataaaat
     2701 aaattaaatt taaaaaaaa
//
LOCUS       YSCINV         68 bp ss-mRNA            SYN       29-JUN-1990
DEFINITION  Yeast (S.cerevisiae) synthetic invertase signal peptide region
            fused to human interferon gene.
ACCESSION   M13346
KEYWORDS    gene fusion; invertase signal peptide; signal peptide.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 68)
  AUTHORS   Chang,C.N., Matteucci,M., Perry,L.J., Wulf,J.J., Chen,C.Y. and
            Hitzeman,R.A.
  TITLE     Saccharomyces cerevisiae secretes and correctly processes human
            interferon hybrid proteins containing yeast invertase signal
            peptides
  JOURNAL   Mol. Cell. Biol. 6, 1812-1819 (1986)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept         12  >    68     yeast invertase signal/human interferon fusion
                                 peptide
    sigp         12       68     invertase signal peptide
    recomb        8        9     EcoRI sequence end/yeast invertase signal
                                 start
BASE COUNT       13 a     16 c     12 g     27 t
ORIGIN      3 bp upstream of EcoRI site.
        1 ccgaattcat gatgcttttg caagctttcc ttttcctttt ggctggtttt gcagccaaaa
       61 tatctgca
//
LOCUS       ADGLS        1477 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units.
ACCESSION   K03325
KEYWORDS    alternate splicing; terminal repeat.
SOURCE      Adenovirus type 7 (simian) DNA.
  ORGANISM  Mastadenovirus s7
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 1477)
  AUTHORS   Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E.
  TITLE     E1a regions of the human adenoviruses and of the highly oncogenic
            simian adenovirus 7 are closely related
  JOURNAL   J. Virol. 53, 399-409 (1985)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.Kimelman (15-AUG-1985).
            
            The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA
            hybridization and S1 nuclease digestion.  The peptide coding
            regions were deduced by analogy to other adenoviruses and following
            the most likely mode of translation.  The intron of the mRNA III
            transcribed from the E1a of other adenoviruses has not yet been
            determined by S1 mapping.
            
            Enhancer sequences are located at positions 165-173 and 265-270.  A
            TATA box is located at position 408-413.  A polyadenylation signal
            is located at posityion 1447-1452.
FEATURES       from  to/span     description
    pept        486      837     E1a 28 K protein from mRNA I, exon 1
               1178     1407     E1a 28 K protein from mRNA I, exon 2
    pept        486     1056     E1a 24 K protein from mRNA II, exon 1
               1178     1407     E1a 24 K protein from mRNA II, exon 2
    pre-msg     439     1477     E1a mRNA
    IVS         838     1177     E1a 24K pept intron A
    IVS        1057     1177     E1a 28K pept intron A
BASE COUNT      331 a    334 c    442 g    370 t
ORIGIN      5' end of the l strand of the genome.
        1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt
       61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg
      121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt
      181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt
      241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga
      301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc
      361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc
      421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg
      481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata
      541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca
      601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg
      661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc
      721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg
      781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg
      841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg
      901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg
      961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc
     1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact
     1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga
     1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca
     1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc
     1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca
     1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg
     1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta
     1441 tcaggaaata aaaagattaa ctggattctt tgtgcct
//
LOCUS       ADGRS         116 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  adenovirus type 7 (simian) right end of genome.
ACCESSION   V00035 J01991
KEYWORDS    terminal repeat.
SOURCE      adenovirus type 7 (simian) dna.
  ORGANISM  Mastadenovirus s7
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 116)
  AUTHORS   Broker,T.R.
  TITLE     terminal nucleotide sequences and restriction endonuclease cleavage
            maps of simian and murine adenoviruses
  JOURNAL   (in) Tooze,J. (Ed.);
            DNA Tumor Viruses, Second Edition Revised: 1037-1046;
            Cold Spring Harbor Laboratory, Cold Spring Harbor (1981).
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 116)
  AUTHORS   Tolun,A., Alestroem,P. and Pettersson,U.
  TITLE     sequence of inverted terminal repetitions from different
            adenoviruses: demonstration of conserved sequences and homology
            between sa7 termini and sv40 dna
  JOURNAL   Cell 17, 705-713 (1979)
  STANDARD  simple staff_review
COMMENT     L-strand is given, from 5' to 3'.
BASE COUNT       25 a     15 c     46 g     30 t
ORIGIN      
        1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg
       61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct
//
LOCUS       ADGSA7PE1    2338 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  Simian adenovirus SA7P, leftmost 2338 bp.
ACCESSION   X01027
KEYWORDS    E1A protein; E1b protein; inverted terminal repeat;
            overlapping genes; transforming gene.
SOURCE      Simian adenovirus SA 7P.
  ORGANISM  Mastadenovirus s7
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 2338; enum. 1 to 2338)
  AUTHORS   Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and
            Van Ormondt,H.
  TITLE     The nucleotide sequence of the leftmost XhoI fragment (6%) of
            simian adenovirus SA7P
  JOURNAL   J. Gen. Virol. 65, 1699-1708 (1984)
  STANDARD  simple automatic
COMMENT     **map: leftmost XhoI fragment (6%);
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE          1    183       inverted terminal repeat (ITR)
                                            of SA7P
               PRM         409    416       TATA E1A region
               CAP         440    440       cap site E1A RNAs
               MSG         440    838       "12S" E1A RNA
               MSG         440    838       1st exon "12S" E1A RNA
               IVS         839   1175       intron "12S" E1A RNA
               MSG        1176  >1445       2nd exon "12S" E1A RNA
            
               MSG         440   1057       1st exon "13S" E1A RNA
               IVS        1058   1175       intron "13S" E1A RNA
               MSG        1176  >1445       2nd exon "13S" E1A RNA
            
               SITE       1445   1450       AATAAA E1A
               PRM        1508   1514       TATA E1B
               CAP        1538   1538       cap E1B RNA's
            
            This virus, an independent isolate from Moscow, USSR, is highly
            related to simian adenovirus 7 (SA7).  The E1A sequences are 95%
            homologous to those of SA7 see $<K03322>$ J. VVrol. 5:399-409(1985)
FEATURES       from  to/span     description
    pept        487      838     E1A 21K protein (part 1)
               1176  /  1401     E1A 21K protein (part 2)
    pept        487     1057     E1A 29K protein (part 1)
               1176  /  1401     E1A 29K protein (part 2)
    pept       1569     2111     pot. 21K E1B protein
    pept       1874  >  2338     pot. 55K E1B protein
BASE COUNT      514 a    535 c    725 g    564 t
ORIGIN      
        1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag
       61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg
      121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt
      181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt
      241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg
      301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg
      361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg
      421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct
      481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat
      541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac
      601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag
      661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg
      721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct
      781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt
      841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg
      901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat
      961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg
     1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact
     1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg
     1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga
     1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac
     1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc
     1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc
     1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata
     1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct
     1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca
     1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg
     1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc
     1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt
     1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg
     1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct
     1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca
     1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc
     1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc
     2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa
     2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg
     2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat
     2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg
     2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag
//
LOCUS       ADGVARNA      850 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene.
ACCESSION   M14082
KEYWORDS    virus-associated RNA.
SOURCE      Simian adenovirus type 7 DNA.
  ORGANISM  Mastadenovirus s7
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 850)
  AUTHORS   Larsson,S., Svensson,C. and Akusjaervi,G.
  TITLE     Characterization of a low-molecular-weight virus-associated (VA)
            RNA encoded by simian adenovirus type 7 which functionally can
            substitute for adenovirus type 5 VA RNA-1
  JOURNAL   J. Virol. 60, 635-644 (1986)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by S.Larsson, 15-JAN-1987.
            
            Simian adenovirus type 7 VA RNA stimulates protein synthesis in a
            transient expression assay and can functionally substitute for VA
            RNA-I during lytic growth of human Ad5.  The SA7 genome encodes
            only one VA RNA species.
FEATURES       from  to/span     description
    RNA         181      352     virus-associated RNA (5' and 3' ends +/-2bp)
    ORF         105  <     1 (c) ORF A cds
    ORF         366  >   850     ORF B cds
BASE COUNT      140 a    260 c    338 g    112 t
ORIGIN      150 bp upstream of XbaI site; 30 mu on viral map.
        1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc
       61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg
      121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga
      181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg
      241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg
      301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt
      361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag
      421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc
      481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg
      541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc
      601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc
      661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca
      721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc
      781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg
      841 tggccccggg
//
LOCUS       ADZITRL       150 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  adenovirus 31 inverted terminal repeat & repl ori left end.
ACCESSION   J01958
KEYWORDS    origin of replication; terminal repeat.
SOURCE      ad31 (adenovirus 31).
  ORGANISM  Mastadenovirus 31
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Stillman,B.W., Topp,W.C. and Engler,J.A.
  TITLE     conserved sequences at the origin of adenovirus dna replication
  JOURNAL   J. Virol. 44, 530-537 (1982)
  STANDARD  simple staff_review
BASE COUNT       38 a     29 c     48 g     35 t
ORIGIN      
        1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
       61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
      121 gtgggagccg ggcgcgccgg atgtgacgcg
//
LOCUS       ADZITRR       150 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  adenovirus 31 inverted terminal repeat & repl ori right end.
ACCESSION   J01959
KEYWORDS    origin of replication; terminal repeat.
SOURCE      ad31 (adenovirus 31).
  ORGANISM  Mastadenovirus 31
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Stillman,B.W., Topp,W.C. and Engler,J.A.
  TITLE     conserved sequences at the origin of adenovirus dna replication
  JOURNAL   J. Virol. 44, 530-537 (1982)
  STANDARD  simple staff_review
BASE COUNT       40 a     28 c     47 g     35 t
ORIGIN      
        1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
       61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
      121 gtgggagccg ggcgcgccgg atgtgacgaa
//
LOCUS       HSKGAORA     3726 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown
            function.
ACCESSION   M33303
KEYWORDS    glycoprotein A.
SOURCE      Herpesvirus of turkey DNA.
  ORGANISM  Gallid herpesvirus type 2
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 3726)
  AUTHORS   Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K.
  TITLE     Homologies between herpesvirus of turkey and Marek's disease virus
            type-1 DNAs within two co-linearly arranged open reading frames,
            one encoding glycoprotein A
  JOURNAL   Gene 84, 399-405 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        207     1676     glycoprotein A
    pept       2118     2756     ORF2
BASE COUNT     1016 a    884 c    815 g   1011 t
ORIGIN      
        1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat
       61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt
      121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc
      181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc
      241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg
      301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg
      361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg
      421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca
      481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta
      541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt
      601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg
      661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta
      721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc
      781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg
      841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata
      901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat
      961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg
     1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg
     1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca
     1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct
     1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga
     1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta
     1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta
     1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc
     1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa
     1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt
     1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt
     1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt
     1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc
     1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact
     1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat
     1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt
     1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa
     1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca
     2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg
     2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg
     2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc
     2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat
     2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg
     2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg
     2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta
     2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg
     2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca
     2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca
     2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac
     2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac
     2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata
     2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta
     2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc
     2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga
     3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc
     3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc
     3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt
     3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg
     3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt
     3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc
     3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa
     3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca
     3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga
     3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag
     3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat
     3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat
     3721 gaattc
//
LOCUS       HSKTK        2039 bp ds-DNA             VRL       29-JUN-1990
DEFINITION  Herpesvirus of turkey thymidine kinase gene, complete cds.
ACCESSION   M26659
KEYWORDS    thymidine kinase.
SOURCE      Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo
            fibroblasts.
  ORGANISM  Gallid herpesvirus type 2
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 2039)
  AUTHORS   Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K.
  TITLE     Genetic and biochemical characterization of the thymidine kinase
            gene from herpesvirus of turkeys
  JOURNAL   J. Virol. 63, 2847-2852 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.L.Martin, 03-AUG-1989.
FEATURES       from  to/span     description
    pept        608     1540     thymidine kinase (EC 2.7.1.21)
BASE COUNT      570 a    458 c    437 g    574 t
ORIGIN      649 bp upstream of XbaI site.
        1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa
       61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg
      121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt
      181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta
      241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa
      301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc
      361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg
      421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat
      481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct
      541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg
      601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc
      661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc
      721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt
      781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc
      841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga
      901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt
      961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact
     1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc
     1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg
     1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt
     1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga
     1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt
     1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct
     1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta
     1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga
     1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga
     1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat
     1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat
     1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt
     1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa
     1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc
     1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc
     1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc
     1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat
//
LOCUS       LACHSP       3095 bp ds-DNA             BCT       29-JUN-1990
DEFINITION  Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene,
            complete cds.
ACCESSION   M30192
KEYWORDS    .
SOURCE      Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 3095)
  AUTHORS   Hill,C., Miller,L.A. and Klaenhammer,T.R.
  TITLE     Sequence of HSP gene from pTR2030
  JOURNAL   Unpublished (1989) North Carolina State Univ., Raleigh, NC 27695
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Hill, 29-NOV-1989.
FEATURES       from  to/span     description
    pept       1105     2991     hsp gene (aborts phage infection)
BASE COUNT     1129 a    439 c    518 g   1009 t
ORIGIN      
        1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg
       61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga
      121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat
      181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt
      241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac
      301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga
      361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat
      421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca
      481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac
      541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct
      601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca
      661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat
      721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa
      781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg
      841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc
      901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat
      961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt
     1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag
     1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca
     1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct
     1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca
     1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca
     1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata
     1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat
     1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa
     1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt
     1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata
     1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa
     1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat
     1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt
     1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat
     1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat
     1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa
     1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat
     2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct
     2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct
     2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat
     2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct
     2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta
     2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata
     2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa
     2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa
     2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata
     2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac
     2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct
     2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa
     2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa
     2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc
     2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa
     2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt
     3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat
     3061 ctcctcatcc gcatttggat aaagatgact ataaa
//
LOCUS       PH15RER      2016 bp ds-DNA             PHG       29-JUN-1990
DEFINITION  Bacteriophage phi-15 right early region containing genes 17,
            16.5-16.9, complete cds.
ACCESSION   M28830
KEYWORDS    DNA replication protein.
SOURCE      Bacteriophage phi-15 DNA.
  ORGANISM  Bacteriophage phi-15
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 2016)
  AUTHORS   Benes,V., Arnold,L., Smrt,J. and Paces,V.
  TITLE     Nucleotide sequence of the right early region of Bacillus phage
            phi-15 and comparison with related phages: Reorganization of gene
            17 during evolution
  JOURNAL   Gene 75, 341-347 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        203      664     gene 17 peptide
    pept        666      992     gene 16.9 peptide (put.)
    pept        989     1306     gene 16.8 peptide (put.)
    pept       1308     1700     gene 16.7 peptide (put.)
    pept       1697     1861     gene 16.6 peptide (put.)
    pept       1858     1971     gene 16.5 peptide (put.)
BASE COUNT      744 a    374 c    382 g    516 t
ORIGIN      
        1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca
       61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa
      121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa
      181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga
      241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc
      301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt
      361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca
      421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc
      481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt
      541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga
      601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa
      661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca
      721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag
      781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa
      841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta
      901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca
      961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg
     1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga
     1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg
     1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat
     1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt
     1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt
     1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa
     1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc
     1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt
     1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc
     1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg
     1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat
     1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt
     1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt
     1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg
     1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc
     1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt
     1981 ttgattgttt cacgtggaac atcactgtat acgcat
//
LOCUS       STLENV       4122 bp ss-RNA             VRL       29-JUN-1990
DEFINITION  Simian T-cell leukemia virus, pol-env-pX-3' LTR region.
ACCESSION   M11373 M11374
KEYWORDS    env gene; pol gene.
SOURCE      Simian T-cell leukaemia virus genomic RNA.
  ORGANISM  Simian T-cell lymphotropic virus type I
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 4122)
  AUTHORS   Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and
            Yoshida,M.
  TITLE     Sequence homology of the simian retrovirus genome with human T-cell
            leukemia virus type I
  JOURNAL   Virology 144, 59-65 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        311     1777     env protein
BASE COUNT      905 a   1456 c    741 g   1020 t
ORIGIN      
        1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt
       61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc
      121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc
      181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc
      241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac
      301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc
      361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca
      421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct
      481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca
      541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg
      601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg
      661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca
      721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg
      781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa
      841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga
      901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac
      961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg
     1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta
     1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt
     1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc
     1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc
     1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc
     1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt
     1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca
     1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca
     1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc
     1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca
     1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat
     1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc
     1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca
     1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca
     1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac
     1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag
     1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc
     2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt
     2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg
     2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt
     2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta
     2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc
     2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca
     2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga
     2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac
     2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa
     2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa
     2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt
     2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag
     2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa
     2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc
     2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc
     2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat
     3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact
     3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt
     3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc
     3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt
     3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc
     3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt
     3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa
     3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta
     3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg
     3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat
     3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc
     3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg
     3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt
     3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt
     3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag
     3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc
     3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc
     4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc
     4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca
//
LOCUS       FIVCG        9474 bp ss-RNA             VRL       29-JUN-1990
DEFINITION  Feline immunodeficiency virus, Petaluma strain, complete
            genome.
ACCESSION   M25381 M25729
KEYWORDS    complete genome; env gene; gag gene; pol gene.
SOURCE      Feline immunodeficiency virus, Petaluma strain, clones FIV-14 [1],
            [3], and 34TF10 [2].
REFERENCE   1  (bases 1 to 355)
  AUTHORS   Olmstead,R.A., Barnes,A.K., Yamamoto,J.K., Hirsch,V.M.,
            Purcell,R.H. and Johnson,P.R.
  TITLE     Molecular cloning of feline immunodeficiency virus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 2448-2452 (1989)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 9472)
  AUTHORS   Talbott,R.L., Sparger,E.E., Lovelace,K.M., Fitch,W.M.,
            Pedersen,N.C., Luciw,P.A. and Elder,J.H.
  TITLE     Nucleotide sequence and genomic organization of feline immuno-
            deficiency virus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 5743-5747 (1989)
  STANDARD  full automatic
REFERENCE   3  (bases 1 to 9474)
  AUTHORS   Olmstead,R.A., Hirsch,V.M., Purcell,R.H. and Johnson,P.R.
  TITLE     Nucleotide sequence analysis of feline immunodeficiency virus:
            genome organization and relationship to other lentiviruses
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8088-8096 (1989)
  STANDARD  full automatic
REFERENCE   4  (bases 7175 to 7182; revision)
  AUTHORS   Elder,J.H. and Scripps Institute LaJolla,CA.
  JOURNAL   Unpublished (1990)
  STANDARD  full automatic
COMMENT     [2] sequence kindly provided in computer-readable form by John
            Elder, Scripps Institute, La Jolla, CA. [3] sequence kindly
            provided in computer-readable form by Robert Olmstead, NIAID, NIH,
            Bethesda, MD. [1] contains the LTR sequence from the complete
            genome in [3].
            
            The sequence from [3] is shown (with differences from [2]); the
            substantive difference from [2] is at position 6123 where the
            latter possesses an early stop codon in orf2, which is thought to
            encode the transactivator protein. Clone 34TF10 is, nevertheless,
            biologically active as described in [2]. A difference earlier
            reported at 7175-7182 turns out to have been a typo [2,4].  Orf1 is
            weakly homologous (PLSEARCH) with the Q orf of visna and has no
            detectable similarity to the vif protein of HIV and SIV.  This is
            consistent with the clustering of FIV with the ungulate
            lentiviruses [2],[3]. Several "urfs" are identified by [2] and [3]
            of which only orf3, possibly analogous to the L orf of visna, is
            annotated below; a small, possibly associated, orf is found at
            8955-9170. For a fuller discussion of the reading frames, see [2]
            and  [3].
FEATURES       from  to/span     description
    pept        628     1980     gag polyprotein
    pept    /  1869     5243     pol polyprotein (NH2 terminus uncertain) (AA
                                 at 1869)
    pept       5236     5991     orf1
    pept       5992     6228     orf2
    pept       6266     8836     env polyprotein
    pept       6712     6918     orf3
    LTR           1      355     5' LTR
    LTR        9120     9474     3' LTR
    rpt         217      281     R repeat 5' copy
    rpt        9336     9400     R repeat 3' copy
    binding     357      377     primer binding site
    variant      12       12     t in [3]; c in [2]
    variant     122      122     c in [3]; t in [2]
    variant     146      146     g in [3]; t in [2]
    variant     373      374     ga in [3]; a in [2]
    variant     498      501     ggac in [3]; agca in [2]
    variant    1575     1575     g in [3]; a in [2]
    variant    1840     1840     g in [3]; c in [2]
    variant    1977     1977     a in [3]; g in [2]
    variant    2028     2028     g in [3]; a in [2]
    variant    2033     2033     c in [3]; t in [2]
    variant    2288     2288     a in [3]; g in [2]
    variant    2464     2464     g in [3]; a in [2]
    variant    2490     2490     c in [3]; t in [2]
    variant    2580     2580     c in [3]; t in [2]
    variant    2631     2631     t in [3]; c in [2]
    variant    2735     2735     a in [3]; g in [2]
    variant    3733     3733     a in [3]; g in [2]
    variant    3743     3743     a in [3]; t in [2]
    variant    3812     3812     g in [3]; a in [2]
    variant    4202     4202     g in [3]; a in [2]
    variant    4211     4211     c in [3]; t in [2]
    variant    4271     4271     c in [3]; t in [2]
    variant    4493     4493     a in [3]; g in [2]
    variant    4540     4540     g in [3]; t in [2]
    variant    4706     4706     c in [3]; t in [2]
    variant    4891     4891     g in [3]; a in [2]
    variant    5332     5332     g in [3]; a in [2]
    variant    5497     5497     t in [3]; c in [2]
    variant    5559     5559     t in [3]; c in [2]
    variant    5986     5986     t in [3]; c in [2]
    variant    6000     6000     c in [3]; a in [2]
    variant    6123     6123     g in [3]; a in [2]
    variant    6149     6149     t in [3]; c in [2]
    variant    6179     6179     a in [3]; g in [2]
    variant    6238     6238     g in [3]; t in [2]
    variant    6245     6248     attt in [3]; ttt in [2]
    variant    6306     6306     t in [3]; c in [2]
    variant    6430     6430     c in [3]; t in [2]
    variant    6548     6548     a in [3]; c in [2]
    variant    6739     6739     c in [3]; t in [2]
    variant    6753     6753     t in [3]; c in [2]
    variant    6758     6758     a in [3]; g in [2]
    variant    6764     6764     a in [3]; g in [2]
    variant    6779     6779     a in [3]; g in [2]
    variant    6782     6783     aa in [3]; gg in [2]
    variant    7175     7182     acaggagg in [3]; caggagga in [2]
    variant    7381     7381     a in [3]; g in [2]
    variant    7484     7484     g in [3]; a in [2]
    variant    7512     7513     ag in [3]; ca in [2]
    variant    7618     7618     a in [3]; c in [2]
    variant    7625     7625     g in [3]; a in [2]
    variant    7670     7670     a in [3]; g in [2]
    variant    7704     7708     aaaaa in [3]; caaat in [2]
    variant    7750     7750     t in [3]; c in [2]
    variant    7808     7810     atg in [3]; gta in [2]
    variant    7824     7828     aaaag in [3]; caaaa in [2]
    variant    7996     7996     g in [3]; a in [2]
    variant    8054     8054     g in [3]; c in [2]
    variant    8129     8129     g in [3]; a in [2]
    variant    8222     8222     g in [3]; a in [2]
    variant    8238     8238     a in [3]; g in [2]
    variant    8329     8329     g in [3]; a in [2]
    variant    8377     8377     t in [3]; c in [2]
    variant    8394     8394     c in [3]; t in [2]
    variant    8548     8550     gaa in [3]; aac in [2]
    variant    8588     8588     g in [3]; a in [2]
    variant    9131     9131     t in [3]; c in [2]
    variant    9241     9241     c in [3]; t in [2]
    variant    9265     9265     g in [3]; t in [2]
BASE COUNT     3571 a   1370 c   2119 g   2414 t
ORIGIN      5' terminus of 5' LTR
        1 tgggatgagt attggaaccc tgaagaaata gaaagaatgc ttatggacta gggactgttt
       61 acgaacaaat gataaaagga aatagctgag catgactcat agttaaagcg ctagcagctg
      121 cctaaccgca aaaccacatc ctatggaaag cttgctaatg acgtataagt tgttccattg
      181 taagagtata taaccagtgc tttgtgaaac ttcgaggagt ctctttgttg aggacttttg
      241 agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta
      301 tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg
      361 cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc
      421 tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg
      481 agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata
      541 aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca
      601 acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg
      661 gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga
      721 gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt
      781 gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga
      841 gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg
      901 gtagcaggac ttttaaatat gacggtgtct actgctgctg cagctgaaaa tatgtattct
      961 caaatgggat tagacactag gccatctatg aaagaagcag gtggaaaaga ggaaggccct
     1021 ccacaggcat atcctattca aacagtaaat ggagtaccac aatatgtagc acttgaccca
     1081 aaaatggtgt ccatttttat ggaaaaggca agagaaggac taggaggtga ggaagttcaa
     1141 ctatggttta ctgccttctc tgcaaattta acacctactg acatggccac attaataatg
     1201 gccgcaccag ggtgcgctgc agataaagaa atattggatg aaagcttaaa gcaactgaca
     1261 gcagaatatg atcgcacaca tccccctgat gctcccagac cattacccta ttttactgca
     1321 gcagaaatta tgggtatagg attaactcaa gaacaacaag cagaagcaag atttgcacca
     1381 gctaggatgc agtgtagagc atggtatctc gaggcattag gaaaattggc tgccataaaa
     1441 gctaagtctc ctcgagctgt gcagttaaga caaggagcta aggaagatta ttcatccttt
     1501 atagacagat tgtttgccca aatagatcaa gaacaaaata cagctgaagt taagttatat
     1561 ttaaaacagt cattgagcat agctaatgct aatgcagact gtaaaaaggc aatgagccac
     1621 cttaagccag aaagtaccct agaagaaaag ttgagagctt gtcaagaaat aggctcacca
     1681 ggatataaaa tgcaactctt ggcagaagct cttacaaaag ttcaagtagt gcaatcaaaa
     1741 ggatcaggac cagtgtgttt taattgtaaa aaaccaggac atctagcaag acaatgtaga
     1801 gaagtgaaaa aatgtaataa atgtggaaaa cctggtcatg tagctgccaa atgttggcaa
     1861 ggaaatagaa agaattcggg aaactggaag gcggggcgag ctgcagcccc agtgaatcaa
     1921 atgcagcaag cagtaatgcc atctgcacct ccaatggagg agaaactatt ggatttataa
     1981 attataataa agtaggtact actacaacat tagaaaagag gccagaaata ctcatatttg
     2041 taaatggata tcctataaaa tttttattag acacaggagc agatataaca attttaaata
     2101 ggagagattt tcaagtaaaa aattctatag aaaatggaag gcaaaatatg attggagtag
     2161 gaggaggaaa gagaggaaca aattatatta atgtacattt agagattaga gatgaaaatt
     2221 ataagacaca atgtatattt ggtaatgttt gtgtcttaga agataactca ttaatacaac
     2281 cattattagg gagagataat atgattaaat tcaatattag gttagtaatg gctcaaattt
     2341 ctgataagat tccagtagta aaagtaaaaa tgaaggatcc taataaagga cctcaaataa
     2401 aacaatggcc attaacaaat gaaaaaattg aagccttaac agaaatagta gaaagactag
     2461 aaagagaagg gaaagtaaaa agagcagatc caaataatcc atggaataca ccagtatttg
     2521 ctataaaaaa gaaaagtgga aaatggagaa tgctcataga ttttagagaa ttaaacaaac
     2581 taactgagaa aggagcagag gtccagttgg gactacctca tcctgctggt ttacaaataa
     2641 aaaaacaagt aacagtatta gatatagggg atgcatattt caccattcct cttgatccag
     2701 attatgctcc ttatacagca tttactttac ctagaaaaaa taatgcggga ccaggaagga
     2761 gatttgtgtg gtgtagtcta ccacaaggct ggattttaag tccattgata tatcaaagta
     2821 cattagataa tataatacaa ccttttatta gacaaaatcc tcaattagat atttaccaat
     2881 atatggatga catttatata ggatcaaatt taagtaaaaa ggagcataaa gaaaaggtag
     2941 aagaattaag aaaattacta ttatggtggg gatttgaaac tccagaagat aaattacagg
     3001 aagaaccccc atatacatgg atgggttatg aattacatcc attaacatgg acaatacaac
     3061 agaaacagtt agacattcca gaacagccca ctctaaatga gttgcaaaaa ttagcaggaa
     3121 aaattaattg ggctagccaa gctattccag acttgagtat aaaagcatta actaacatga
     3181 tgagaggaaa tcaaaaccta aattcaacaa gacaatggac taaagaagct cgactggaag
     3241 tacaaaaggc aaaaaaggct atagaagaac aagtacaact aggatactat gaccccagta
     3301 aggagttata tgctaaatta agtttggtgg gaccacatca aataagttat caagtatatc
     3361 agaaggatcc agaaaagata ctatggtatg gaaaaatgag tagacaaaag aaaaaggcag
     3421 aaaatacatg tgatatagcc ttaagagcat gctataagat aagagaagag tctattataa
     3481 gaataggaaa agaaccaaga tatgaaatac ctacttctag agaagcctgg gaatcaaatt
     3541 taattaattc accatatctt aaggccccac ctcctgaggt agaatatatc catgctgctt
     3601 tgaatataaa gagagcgtta agtatgataa aagatgctcc aataccagga gcagaaacat
     3661 ggtatataga tggaggtaga aagctaggaa aagcagcaaa agcagcctat tggacagata
     3721 caggaaagtg gcaagtgatg gaattagaag gcagtaatca gaaggcagaa atacaagcat
     3781 tattattggc attaaaagca ggatcagagg agatgaatat tataacagat tcacaatatg
     3841 ttataaatat tattcttcaa caaccagata tgatggaggg aatctggcaa gaagttttag
     3901 aagaattgga gaagaaaaca gcaatattta tagattgggt cccaggacat aaaggtattc
     3961 caggaaatga ggaagtagat aagctttgtc aaacaatgat gataatagaa ggggatggga
     4021 tattagataa aaggtcagaa gatgcaggat atgatttatt agctgcaaaa gaaatacatt
     4081 tattgccagg agaggtaaaa gtaataccaa caggggtaaa gctaatgttg cctaaaggat
     4141 attggggatt aataatagga aaaagctcga tagggagtaa aggattggat gtattaggag
     4201 gggtaataga cgaaggatat cgaggtgaaa ttggagtaat aatgattaat gtatcaagaa
     4261 aatcaatcac cttaatggaa cgacaaaaga tagcacaatt aataatattg ccttgtaaac
     4321 atgaagtatt agaacaagga aaagtagtaa tggattcaga gagaggagac aatggttatg
     4381 ggtcaacagg agtattctcc tcttgggttg acagaattga ggaagcagaa ataaatcatg
     4441 aaaaatttca ctcagatcca cagtacttaa ggactgaatt taatttacct aaaatggtag
     4501 cagaagagat aagacgaaaa tgcccagtat gcagaatcag aggagaacaa gtgggaggac
     4561 aattgaaaat agggcctggt atctggcaaa tggattgcac acactttgat ggcaaaataa
     4621 ttcttgtggg tatacatgtg gaatcaggat atatatgggc acaaataatt tctcaagaaa
     4681 ctgctgactg tacagttaaa gctgtcttac aattgttgag tgctcataat gttactgaat
     4741 tacaaacaga taatggacca aattttaaaa atcaaaagat ggaaggagta ctcaattaca
     4801 tgggtgtgaa acataagttt ggtatcccag ggaacccaca gtcacaagca ttagttgaaa
     4861 atgtaaatca tacattaaaa gtttggattc ggaaattttt gcctgaaaca acctccttgg
     4921 ataatgcctt atctctcgct gtacatagtc tcaattttaa aagaagaggt aggataggag
     4981 ggatggcccc ttatgaatta ttagcacaac aagaatcctt aagaatacaa gattattttt
     5041 ctgcaatacc acaaaaattg caagcacagt ggatttatta taaagatcaa aaagataaga
     5101 aatggaaagg accaatgaga gtagaatact ggggacaggg atcagtatta ttaaaggatg
     5161 aagagaaggg atattttctt atacctagga gacacataag gagagttcca gaaccctgcg
     5221 ctcttcctga aggggatgag tgaagaagat tggcaggtaa gtagaagact ctttgcagtg
     5281 ctccaaggag gagtaaatag cgctatgcta tacatatcta ggctacctcc ggatgaaaga
     5341 gaaaagtata aaaaagactt caagaaaaga ctttttgaca cagaaacagg atttataaag
     5401 agactacgga aagctgaagg aataaaatgg agctttcata ctagagatta ttacatagga
     5461 tatgtcagag aaatggtggc aggatccact acatcattaa gtctaaggat gtatatatat
     5521 ataagtaacc cactatggca ttctcagtat cgtccaggtt tgaaaaattt caataaggaa
     5581 tggccttttg taaatatgtg gataaaaaca ggatttatgt gggatgatat tgaaaaacaa
     5641 aatatttgta taggaggaga agtttcacca ggatggggac cagggatggt aggtatagca
     5701 ataaaagctt ttagttgtgg cgaaagaaag attgaggcta ctcctgtaat gattataaga
     5761 ggagaaatag atccaaaaaa atggtgcgga gattgttgga atttaatgtg tcttagaaac
     5821 tcacctccaa agactttaca aagactcgct atgttggcgt gtggcgtgcc ggctaagaag
     5881 tggcgaggat gctgtaatca acgctttgtt tctccttaca gaacgcctgc tgatttagag
     5941 gtcattcaat ccaagcccag ctggaacctg ttatggtcgg gagaattatg aatggaagac
     6001 ataatagtat tattcaatag ggtcactgag aaactagaaa aagaattagc tatcagaata
     6061 tttgtattag cacatcaatt agaaagggac aaagctatta gattactaca aggattattt
     6121 tggagatata gatttaagaa accccgagta gattattgtt tatgttggtg gtgttgcaaa
     6181 ttctattatt ggcagttgca atctacatta tcaataacta ctgcttagaa atatttagat
     6241 taatatttca tttgcaacaa taagaatggc agaaggattt gcagccaata gacaatggat
     6301 aggactagaa gaagctgaag agttattaga ttttgatata gcaacacaaa tgagtgaaga
     6361 aggaccacta aatccaggag taaacccatt tagggtacct ggaataacag aaaaagaaaa
     6421 gcaaaactac tgtaacatat tacaacctaa gttacaagat ctaaggaacg aaattcaaga
     6481 ggtaaaactg gaagaaggaa atgcaggtaa gtttagaaga gcaagatttt taaggtattc
     6541 tgatgaaagt gtattgtccc tggttcatgc gttcatagga tattgtatat atttaggtaa
     6601 tcgaaataag ttaggatctt taagacatga cattgatata gaagcacccc aagaagagtg
     6661 ttataataat agagagaagg gtacaactga caatataaaa tatggtagac gatgttgcct
     6721 aggaacggtg actttgtacc tgattttatt tataggaata ataatatatt cacagacaac
     6781 caacgctcag gtagtatgga gacttccacc attagtagtc ccagtagaag aatcagaaat
     6841 aattttttgg gattgttggg caccagaaga acccgcctgt caggactttc ttggggcaat
     6901 gatacatcta aaagctaaga caaatataag tatacgagag ggacctacct tggggaattg
     6961 ggctagagaa atatgggcaa cattattcaa aaaggctact agacaatgta gaagaggcag
     7021 aatatggaaa agatggaatg agactataac aggaccatca ggatgtgcta ataacacatg
     7081 ttataatgtt tcagtaatag tacctgatta tcagtgttat ttagatagag tagatacttg
     7141 gttacaaggg aaaataaata tatcattatg tctaacagga ggaaaaatgt tgtacaataa
     7201 agttacaaaa caattaagct attgtacaga cccattacaa atcccactga tcaattatac
     7261 atttggacct aatcaaacat gtatgtggaa tacttcacaa attcaggacc ctgaaatacc
     7321 aaaatgtgga tggtggaatc aaatggccta ttataacagt tgtaaatggg aagaggcaaa
     7381 agtaaagttt cattgtcaaa gaacacagag tcagcctgga tcatggttta gagcaatctc
     7441 gtcatggaaa caaagaaata gatgggagtg gagaccagat tttgaaagta aaaaggtgaa
     7501 aatatctcta cagtgcaata gcacaaaaaa cctaaccttt gcaatgagaa gttcaggaga
     7561 ttatggagaa gtaacgggag cttggataga gtttggatgt catagaaata aatcaaaact
     7621 tcatgctgaa gcaaggttta gaattagatg tagatggaat gtagggagta atacctcgct
     7681 cattgataca tgtggaaaca ctcaaaaagt ttcaggtgcg aatcctgtag attgtaccat
     7741 gtattcaaat aaaatgtaca attgttcttt acaaaacggg tttactatga aggtagatga
     7801 ccttattatg catttcaata tgaaaaaggc tgtagaaatg tataatattg ctggaaattg
     7861 gtcttgtaca tctgacttgc catcgtcatg ggggtatatg aattgtaatt gtacaaatag
     7921 tagtagtagt tatagtggta ctaaaatggc atgtcctagc aatcgaggca tcttaaggaa
     7981 ttggtataac ccagtggcag gattacgaca atccttagaa cagtatcaag ttgtaaaaca
     8041 accagattac ttagtggtcc cagaggaagt catggaatat aaacctagaa ggaaaagggc
     8101 agctattcat gttatgttgg ctcttgcagc agtattatct attgccggtg cagggacggg
     8161 ggctactgct atagggatgg taacacaata ccaccaagtt ctggcaaccc atcaagaagc
     8221 tgtagaaaag gtgactgaag ccttaaagat aaacaactta agattagtta cattagagca
     8281 tcaagtacta gtaataggat taaaagtaga agctatggaa aaatttttgt atacagcttt
     8341 cgctatgcaa gaattaggat gtaatcaaaa tcaatttttc tgcaaaatcc ctcctgagtt
     8401 gtggacaagg tataatatga ctataaatca aacaatatgg aatcatggaa atataacttt
     8461 gggggaatgg tataaccaaa caaaagattt acaacaaaag ttttatgaaa taataatgga
     8521 catagaacaa aataatgtac aagggaagaa agggatacaa caattacaaa agtgggaaga
     8581 ttgggtagga tggataggaa atattccaca atatttaaag ggactattgg gaggtatctt
     8641 gggaatagga ttaggagtgt tattattgat tttatgttta cctacattgg ttgattgtat
     8701 aagaaattgt atccacaaga tactaggata cacagtaatt gcaatgcctg aagtagaagg
     8761 agaagaaata caaccacaaa tggaattgag gagaaatggt aggcaatgtg gcatgtctga
     8821 aaaagaggag gaatgatgaa gtatctcaga cttattttat aagggagata ctgtgctgag
     8881 ttcttccctt tgaggaaggt atgtcatatg aatccatttc gaatcaaatc aaactaataa
     8941 agtatgtatt gtaaggtaaa aggaaaagac aaagaagaag aagaaagaag aaagccttca
     9001 agaggatgat gacagagtta gaagatcgct tcaggaagct atttggcacg acttctacaa
     9061 cgggagacag cacagtagat tctgaagatg aacctcctaa aaaagaaaaa agggtggact
     9121 gggatgagta ttggaaccct gaagaaatag aaagaatgct tatggactag ggactgttta
     9181 cgaacaaatg ataaaaggaa atagctgagc atgactcata gttaaagcgc tagcagctgc
     9241 ctaaccgcaa aaccacatcc tatggaaagc ttgctaatga cgtataagtt gttccattgt
     9301 aagagtatat aaccagtgct ttgtgaaact tcgaggagtc tctttgttga ggacttttga
     9361 gttctccctt gaggctccca cagatacaat aaatatttga gattgaaccc tgtcgagtat
     9421 ctgtgtaatc ttttttacct gtgaggtctc ggaatccggg ccgagaactt cgca
//
LOCUS       ASTSSP12S    3341 bp ds-DNA             PLN       29-JUN-1990
DEFINITION  A.sativa 12S globulin seed storage protein gene, complete cds.
ACCESSION   J05485
KEYWORDS    12S globulin; storage protein.
SOURCE      A.sativa (genomic library lambda-Charon-34), clone lambda-OG1-E1.
  ORGANISM  Avena sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 3341)
  AUTHORS   Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A.
  TITLE     Analysis of seed storage protein genes of oats
  JOURNAL   J. Biol. Chem. 265, 9652-9658 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Shotwell, 17-APR-1990.
FEATURES       from  to/span     description
    pept        983     1310     12S globulin seed storage protein, exon 1
               1428     1702     12S globulin seed storage protein, exon 2
               1829     2344     12S globulin seed storage protein, exon 3
               2445     2882     12S globulin seed storage protein, exon 4
    pre-msg     945  >  3341     12S globulin seed storage protein mRNA and
                                 introns
    IVS        1311     1427     12S globulin seed storage protein intron A
    IVS        1703     1828     12S globulin seed storage protein intron B
    IVS        2345     2444     12S globulin seed storage protein intron C
    signal      885      890     CAT box
    signal      917      921     TATA box
    signal     2915     2920     polyA addition signal
    signal     2965     2970     polyA addition signal
BASE COUNT     1044 a    677 c    606 g   1014 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattctttt tggaaagtca ttttgcctcc tgaactccag tgtttcctat ttattaaaaa
       61 aaactaaaaa ctatatttat aagtttgaaa aaatcatgaa acaaaattgt aaaaattgct
      121 agtgatatat cccacaaacg tgcaaaatct caatttgaag tgctttgtat ttcgagctac
      181 acaaaaatga caaagtgtga ctttttatgt gatttgaaat cactatacta cagatctaca
      241 attttgttct ttttgtgaag ctataaatac acattatttt gagttgagat ttttcatgtt
      301 tgtgctatga atcataggct acatcctgat ttatttttag aattttttgg aacctaaaat
      361 atgttctaga ttatttttta aaaagtggga tcattgatgc ccatacacac gaaatctcca
      421 ctcaattctt ttatacatta tctttctata tctactaacg tggattatac atcatagtaa
      481 gtttcttact acatgtgctt tcttgttaca atgtggacat gactcttcca cttttgggct
      541 ttatgttgta tggatatact catgacatgg aattttgtcc acacacgtag aaccatccat
      601 atatattgtt gtgcatagaa caaaacacaa gcaagccatg aaaaaggagt cacaggtgcc
      661 acaaactgtt gtaggaagta taactagtat gaggcctttt atttgacgtc ggacaatggc
      721 caagagctac atataaaaga tggtgctaga tttgtgagtc agcaccagct gtaggcagaa
      781 aacaacacat atcttttgag acaaagttat atctattcac atttaaaacc atgatctgtt
      841 gagtcaccat gaatatcttc ttatctatgt taataattac atgtcatcat gtttatcctg
      901 gactactttt tatggctata aaatcaaact tacaataagg aaactagcac caatccacct
      961 tctacaatct cttcaaacaa tcatggcaac tactaggttt ccatcattgt tgttttactc
     1021 ctatattttt ctcttgtgca atgggtcaat ggctcagcta ttcgggcaga gctttacccc
     1081 atggcaaagc tctcgacaag gaggtttaag ggggtgcaga tttgataggc tacaagcatt
     1141 tgaaccactt cgacaagtgg ggtcacaagc gggtatcatt gagtactttg atgagcagaa
     1201 tgagcaattt cgttgtgcag gtgtatccgt catccgtcgt gttattgagc ctcaaggcct
     1261 cttgttacct caataccaca acgctcctgg cttggtgtac atccttcaag gttagtgtct
     1321 aattgattat aaaaattgcc tttgttatac ttcacttagg atttagatgt gccaaatgtt
     1381 acaccgttca tattttttaa caatgaaaca aatgtgtttt cttttaggta ggggattcac
     1441 agggttgact ttccctggat gcccggcgac cttccaacaa cagttccaac catttgatca
     1501 agcccagttt gctgaaggtc aaagccaaag ccaaaatctt aaggatgaac accaaagagt
     1561 tcaccacatc aaacaaggag atgttgttgc tctaccggct ggcatagtac actggtgcta
     1621 caacgatggt gatgcaccga ttgtagctgt ctatgtcttc gacgtaaaca acaacgctaa
     1681 tcagcttgaa ccaaggcaaa aggtaactat acaacttaat gcacacaaaa tatatacaac
     1741 tatttacaag tgatccaact atgatttgaa gtatacatta gtgggatatt aatgaactct
     1801 gtttaacttc atcgatataa aattttagga gttcttgttg gctggtaaca acaagagaga
     1861 gcaacagttt ggacaaaaca tattcagtgg attcagtgtc caacttctta gtgaggccct
     1921 tggtataagt cagcaagtag cacaaaagat tcagagtcaa aatgaccaaa gaggtgagat
     1981 aattcgtgtg agtcaaggcc ttcaattctt gaagcctttt gtttcccaac aaggaccagt
     2041 agagcatcaa gcctaccaac caattcaaag tcaagaagaa caatcaaccc aataccaggt
     2101 agggcaatca ccacaatatc aagaaggaca atcaactcaa taccagccag gacagtcatg
     2161 ggaccaaagt ttcaatggtt tggaggagaa tttctgttca ttggaggcaa ggcaaaacat
     2221 cgaaaacccg aaacgtgccg acacgtacaa cccacgtgct ggcaggataa cacatctcaa
     2281 tagcaagaat tttcccaccc ttaacctggt gcaaatgagt gctacaagag taaatttata
     2341 ccaggtattt atgatactac attcaacaca ctatcttatt tttagatatt ctaagcttca
     2401 tacaaccgat taataatatg gcatacaaat aattgctatt gcagaatgct attctttcac
     2461 catactggaa cattaatgct cacagtgtca tgcacatgat ccaaggacgt gctcgagttc
     2521 aagttgtcaa taaccatggt cagaccgtat tcaatgacat tcttcgtcgc ggacaactgc
     2581 taatcatacc acaacactat gttgttctca agaaggcaga gcgtgaagga tgccagtata
     2641 tttcattcaa gaccaacccc aattccatgg ttagccagat cgcaggaaag acctccatcc
     2701 tacgtgcatt gcccgttgat gtcctcgcca atgcataccg catttctaga caggaagccc
     2761 aaaacctcaa aaataaccgt ggagaagagt ttgatgcatt cacccctaag tttacacaaa
     2821 cgggctccca gagttaccag gacgaggggg agtcatcttc gactgagaag gcatccgagt
     2881 gaataagtga gtgtaatgga aactagtata gtgaaataaa ggcatcgcat gtttgcagcc
     2941 tagtggtata taaccgctta tctcaataaa aaagtttctc cgtgttatat tgtttgcttg
     3001 tttcttgtac tcttcttaat tttatctttt atgatcaacc aacttacctc tcttcttcaa
     3061 ttttctgcca ctcttcacat atgcatatca aggagacgta tagtccactt gtactagatt
     3121 tattttttgg tactatcttt tgtggtttga gtatttctta ggtccgataa aaacattcaa
     3181 tttactgatg ctaaattttt tcagcaaacc tagccaccta aaattgtgaa tcacatacct
     3241 tcaaatttct atgccaaagt gtgcgatttt tatgttctat agatccaagc ttgggctgca
     3301 ggtcgactct agaggatcga tccccgggcg agctcgaatt c
//
LOCUS       ASTSSPA      1548 bp ds-DNA             PLN       29-JUN-1990
DEFINITION  A.sativa seed storage protein gene, complete cds.
ACCESSION   J05486
KEYWORDS    avenin; prolamine; storage protein.
SOURCE      A.sativa (genomic library lambda-Charon-34), clone lambda-AV45-X1.
  ORGANISM  Avena sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1548)
  AUTHORS   Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A.
  TITLE     Analysis of seed storage protein genes of oats
  JOURNAL   J. Biol. Chem. 265, 9652-9658 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Shotwell, 17-APR-1990.
FEATURES       from  to/span     description
    pept        484     1113     seed storage protein
    mRNA        411  >  1548     seed storage protein mRNA
    signal      343      347     CAT box
    signal      380      384     TATA box
    signal     1189     1194     polyA addition signal
    signal     1252     1258     polyA addition signal
BASE COUNT      484 a    375 c    323 g    366 t
ORIGIN      
        1 tctagaacaa tccttcctgt tactagttta atctacctta tacacagaca agtttacaaa
       61 ctttgcaagc catgacaata tggtttatgc ctaacagcct gtagaaaaac acaaacttag
      121 tttcagaaaa tgatgcaata tagatcagtg attgacatgt aaagcgaaaa agatgagaca
      181 tattgccaac tatcaagctt tcttgatagt cgaaaggtga tacatgtaat ggcaaagcaa
      241 cttttatgat cgatccaaaa ataacatttg taggtagtgc caccaaacac aacataccaa
      301 ataatcagtt cgagaagcat acaagcattt tttaaggaaa agcaaatgca aagatgaaaa
      361 gaaactatgc catggcagat ataaatagac ctgcaccatg gagatcctcc tccatcattc
      421 atccttcaca caccgagagc acaaacatta aaacctagca agcagtatca acacaaatct
      481 accatgaaga acttcctcat ccttgccctc ctttccatgg cggcgaccat ggccactgcg
      541 cagtttgacc ctagcgaaca ataccagcca tatcctgagc aacaacagcc aattctacaa
      601 caacagttgc tgttgcagca acaacaacag atgttgttgc agcaacaacc attgttgcag
      661 gttttgcagc aacagttgaa cccatgcagg cagttcctcg tgcaacagtg cagcccggtg
      721 gcagcggtgt cattcctccg gtcgcaaatc ctgcaacaga gcagctgcca ggtgatgagg
      781 caacaatgct gccgacggct ggagcagatc cccgagcagc tccggtgccc agccatccat
      841 agcgtcgtcc aggccatcat tatgcagcaa caacaacaac agttcttcca gcctcagatg
      901 caacaacagt tcttccagcc tcagatgcaa caggtgacac agggcatctt ccagcctcag
      961 atgcaacagg tgacacaggg catcttccag actcagatgc aaggtcagat cgaggggatg
     1021 agggcgtttg cgctgcaggc cctgccggcg atgtgcgatg tatacgtccc gccgcactgc
     1081 cccgtcgcca ccaccccgct cagtggcttc taagaacact acaagagcta tagtactaca
     1141 taaataccat cagcgtttag ccgatggacc gatcttgtag cggtgacaaa taaaataaaa
     1201 gggtcatgca ctaacatgtg tgacagccga cccgtactag tccaaacttg gaaataaaag
     1261 aatgcaaagt tcttgtctgc ataacattga ttgtcgtttt tccattcatg tttataccta
     1321 accataagct catgcattaa ctagctgctt atgtagcctt cattgatgtg aacataataa
     1381 cagagtgggt gtaaaagatg gatttgaatt gaggcactat taaagtgagg agtaagaacc
     1441 aggtgcagat gatagctctt taatggtgag gcttgtgagt tgtgacctta ttggctgaca
     1501 tgcattggat tctaaagagt acatgttgat gctttcaaca atgtcgac
//
LOCUS       DOGSPC18A     847 bp ss-mRNA            MAM       29-JUN-1990
DEFINITION  Dog microsomal signal peptidase complex (SPC 18) mRNA, complete
            cds.
ACCESSION   J05466
KEYWORDS    ER membrane protein; signal peptidase complex.
SOURCE      Dog liver, cDNA to mRNA, lambda-18-[50,51].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 847)
  AUTHORS   Shelness,G.S. and Blobel,G.
  TITLE     Two subunits of the canine signal peptidase complex are homologous
            to yeast sec-11 protein
  JOURNAL   J. Biol. Chem. 265, 9512-9519 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.S.Shelness, 04-APR-1990.
FEATURES       from  to/span     description
    pept         39      578     microsomal signal peptidase complex
BASE COUNT      212 a    162 c    216 g    257 t
ORIGIN      
        1 cctctccgct gcccgtgtcc tccttcaccg gccgcgccat gctgtctcta gattttttgg
       61 acgatgtgcg gcgaatgaac aagcggcagc tgtattatca agtcctaaat tttggaatga
      121 ttgtctcctc ggcactaatg atctggaagg gactgatggt cataactgga agtgaaagtc
      181 caattgtagt ggtgctcagt ggcagcatgg agccagcatt tcatagagga gatcttctct
      241 ttttaacaaa tcgagttgaa gatcccatcc gagtgggaga aattgttgtt tttaggatag
      301 aaggaagaga gattcctata gttcaccgag tcttgaagat tcatgaaaag caaaatggac
      361 atatcaagtt tttgaccaaa ggagataata atgccgttga tgaccgaggc ctctataaac
      421 aaggacaaca ctggctagag aagaaggatg tcgtggggag agcaagggga tttgttcctt
      481 atattggaat tgtgacgatc ctcatgaatg actatccgaa atttaagtat gccgtcctct
      541 tcttgctggg tttattcgtg ctggtccatc gtgagtaaga agtctacctt gctgttcctg
      601 gaagatgccg tacttttcgt tcctgactgt ttggagtaga tattggtcta tgattggtgg
      661 agtgcagaac gtacatgttg gcgcttcttg gtagcaatgg tttgcattag tttctgtttc
      721 cacgccaggg tctgtgtggg cgggtgcacc aagatgcgca ccgaggggac tctcaatcac
      781 aggatttcat atgttgtcat tgtcacactt tcacattttt gtacatcagt gaatttttta
      841 tattaaa
//
LOCUS       DROCRPA      7226 bp ss-mRNA            INV       29-JUN-1990
DEFINITION  D.melanogaster crumbs protein mRNA, complete cds.
ACCESSION   M33753
KEYWORDS    crumbs protein.
SOURCE      D.melanogaster (strain Oregon R) 3-12 hr embryo, cDNA to mRNA,
            clone lambda-gt10.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 7226)
  AUTHORS   Tepass,U., Theres,C. and Knust,E.
  TITLE     Crumbs encodes an EGF-like protein expressed on apical membranes of
            Drosophila epithelial cells and required for organization of
            epithelia
  JOURNAL   Cell 61, 787-799 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by U.Tepab 17-APR-1990.
FEATURES       from  to/span     description
    pept        214     6633     crumb protein
BASE COUNT     1880 a   1860 c   1879 g   1607 t
ORIGIN      Chromosome 95F9-13.
        1 aaaaaaaata aaaataacct gttctaagcc aagtgccgat tgtgtgtgtg tgtgtgtgtg
       61 cgtatgaagt gtttgcgtgg agcagagaga atgggagaat gcaattacca acaacaacaa
      121 gctcaatgag tatcgtttga atagcaataa tatctagcca aaagcaaccc tttccaatac
      181 gaaaactcga cttgagccaa caaatcagcg atcatggcta aaatcgccaa tgcgtcactg
      241 tcgcaacaac aaaaacagcg gcaagcagaa acagcaacga ccacaacaac aacagtggcg
      301 gcatcaggta gaaacagcaa caacaacagc aagaagtcga gaccgcacga aaagcgccac
      361 gtcaaattac gctcgcattt gctcaagcgc cgcccatttc agtttattcg tcgccctcaa
      421 tggataccgc tcttcatact tatttactta gcaacagatg tcgcctcagt ggcggtgccg
      481 acgaaggagg cgtactttaa tggctccact tacctccgcc tcaccacgcc gatgcccatt
      541 tgggatcact cggcgattag tttccgctcg tgccgcggcg gcgagatcct cgcccagcag
      601 tacaacaaga actccattgt aatctcagtg ctcaatgact ttctgcaaat ctcactggct
      661 ggacccgccg tccatgggcc caacaaccgg ctggatgtca agctgcccta ccaactgctg
      721 gacaaccgct ggcatacgct gcagttcaag tacgagtacg gaaatctcta cctgcatgtg
      781 gatcgcgcgg caagcatatt tgccaactcc acgtacaaca gtcagttcct gactaaccag
      841 gacattggct acaaggacgc catcttgata ctgggaaact ccttctccgg ttgccttctg
      901 gatggaccag gtctgcaatt tgtgaacaac tcgacggtgc agaatgtggt ctttggtcac
      961 tgtcccttga ccccgggtcc ctgcagcgat cacgatcttt tcactcgact gccagacaac
     1021 ttttgtctga atgatccctg tatgggccat ggaacctgct cgtccagtcc agagggatac
     1081 gaatgtcggt gcacggcacg ttactcggga aaaaattgcc agaaggacaa tggctcgccg
     1141 tgtgccaaaa atccatgtga gaacggtggt tcctgcctgg agaattccga ggggaattac
     1201 cagtgcttct gtgatcccaa ccacagtggt cagcactgcg agacggaggt gaacatccac
     1261 ccactctgcc agacgaaccc ctgcctgaac aatggagcat gtgtggtgat tggcggcagt
     1321 ggagcactca cctgcgagtg tcccaaagga tacgctggtg ccaggtgcga ggtggacacg
     1381 gatgagtgcg cctcacagcc gtgtcagaac aacggaagct gcatagaccg tatcaatgga
     1441 ttcagctgcg attgcagcgg caccggttac acgggtgcgt tctgccagac gaatgtggat
     1501 gagtgcgaca agaatccgtg cctgaacggc ggcagatgct tgcataccta cggatggtac
     1561 acctgccagt gtctggatgg ttggggcggc gagatttgtg atcgacccat gacatgccag
     1621 acgcagcagt gtttcaacgg tggcacctgc ctggacaagc ccatcggctt ccagtgcctc
     1681 tgtccgccgg agtacactgg tgaactgtgc caaatcgcac ccagttgcgc acagcagtgc
     1741 cccatcgatt cggagtgcgt aggcggcaaa tgcgtttgta agccaggctc atcgggttac
     1801 aactgtcaaa cgagcacggg cgatggagct tctgctttgg ccctgacacc catcaactgc
     1861 aatgccacca atggaaagtg tctcaatgga ggaacttgct ccatgaacgg aacccactgc
     1921 tattgcgccg tcggctattc gggagatcgt tgcgagaagg ccgagaactg ttcgccactg
     1981 aactgccagg aaccgatggt atgtgtccag aatcagtgcc tctgcccgga gaacaaggtg
     2041 tgcaaccagt gcgccacaca accgtgccag aatggcggtg aatgtgtgga cctaccgaat
     2101 ggagactacg agtgcaagtg cacccgtgga tggactggac gcacctgtgg caacgacgtg
     2161 gacgagtgca ccctgcatcc gaagatctgc ggcaatggta tctgcaagaa cgagaaggga
     2221 tcgtacaaat gctactgtac gcctggattt accggagtcc actgcgattc cgatgtggac
     2281 gaatgcctta gctttccctg cctcaacgga gccacgtgcc acaacaagat aaatgcctac
     2341 gagtgcgttt gccaaccagg atatgaggga gagaactgcg aagtagatat cgatgagtgc
     2401 ggcagtaatc cctgctcgaa cggatccacc tgcatcgaca ggatcaacaa cttcacctgc
     2461 aactgcattc cgggaatgag gggccgcata tgtgacatcg acattgacga ctgtgtggga
     2521 gatccctgct tgaatggtgg tcagtgcatc gaccaattgg gtggcttccg ttgtgactgc
     2581 agtggcaccg gttatgaggg cgagaattgt gagctgaaca tagacgaatg cctctcgaat
     2641 ccgtgcacga atggtgccaa gtgcctggac agggttaagg actacttctg cgattgccac
     2701 aacggctaca agggaaagaa ctgtgagcag gacatcaacg agtgcgagag taatccctgc
     2761 cagtacaatg gcaactgcct agagcgttcg aacataacgc tttaccaaat gagtaggatc
     2821 acggatctgc ccaaggtgtt tagccagccc tttagctttg agaatgccag cggctacgag
     2881 tgcgtctgtg tgcccggcat cattggaaag aactgcgaga tcaacattaa tgagtgcgac
     2941 agtaatccat gcagcaagca cggaaattgc aatgacggga ttggaaccta tacctgcgaa
     3001 tgcgaacctg gcttcgaggg cacccactgt gagatcaaca tagatgaatg tgatcgctat
     3061 aatccctgcc agaggggcac ttgctatgac cagatagacg actacgactg cgactgcgat
     3121 gcgaactatg gaggcaaaaa ctgttccgtt cttctcaaag gctgtgacca gaatccttgt
     3181 ttaaatggcg gtgcctgctt gccgtatttg atcaacgagg taactcatct gtacacctgc
     3241 acctgcgaga acggcttcca gggcgataaa tgcgaaaaga ccactactct gtccatggtg
     3301 gccactagcc tgatttcggt taccacggaa cgcgaggaag gctacgacat taatcttcaa
     3361 tttaggacca cactgcccaa cggagttttg gctttcggaa ccaccggcga aaagaatgaa
     3421 ccagttagct atattttgga gctgattaat ggacgactga atctacattc ctcgctccta
     3481 aataagtggg agggcgtgtt tattggatcg aagctgaacg atagcaactg gcacaaggtg
     3541 tttgtggcca taaacacttc gcatctagtc ctttcggcca acgacgagca ggctattttt
     3601 ccggttggct cctatgaaac ggccaataat agccagcctt cattcccgcg cacatacctc
     3661 ggaggcacca ttcccaatct gaagtcctat ctgcgccacc tcacccacca accatcagct
     3721 tttgtgggct gcatgcaaga tataatggtc aatggtaaat ggatctttcc cgatgagcag
     3781 gatgcgaata tcagttatac caaactggaa aatgtccaga gcggttgtcc gcgcaccgaa
     3841 caatgtaaac cgaatccttg ccattccaac gttgaatgca cggacctttg gcacaccttc
     3901 gcttgtcact gtcccagacc attctttggg cacacatgtc agcacaatat gactgctgcc
     3961 acttttggcc acgagaacac aacccattcg gccgtgattg tggagaccac agatgtggct
     4021 aggcgagcta ttagatccat cctagacatt tccatgttta ttcgtacacg cgagccaact
     4081 ggtcaagtct tttacttggg cactgatcca cgcaaagccc ccactaaaaa tattggcgac
     4141 tcatatgtgg cggccaaatt acacggcggc gagctgctgg taaagatgca gttcagcggt
     4201 actccggagg cctacaccgt tggtggccag aaattggaca acggctataa ccacctgatc
     4261 gaggtggtgc gcaatcaaac gctcgtgcag gtcaagctca atggcaccga gtacttccgc
     4321 aagacgctgt ccacgacggg tctgctggac gcacagttgc tttacttggg cggacctgca
     4381 cccacacgcg agtcacttct gggcgcgacc acagaaccag gaataattcc agttccggga
     4441 gcaggaatac ccattgaaga cacaacagtg cccaaggagg cggacgacag cagggactac
     4501 tttaagggca taatccagga cgtgaaggtt agcaatggct cgctcaacct gattgtggag
     4561 atgtattccc taaacgtgac agatgtccaa gtgaatgcca aacctctggg cgctgtaacc
     4621 atcgatcgtg cctccgtgtt acctggtgag gtgtccgatg atctgtgccg aaagaatgcc
     4681 tgcctccata atgccgagtg caggaacaca tggaatgact acacttgcaa atgcccaaat
     4741 ggttacaagg gaaagaagtg tgccaggaga tcagaatttt gccaacacgt tacatgtccg
     4801 ggacagagct tgtgccaaaa tctggacgat ggctatgagt gtgttacgaa caccacattt
     4861 actgggcagg aacgcagtcc tctcgccttc ttctacttcc aggagcagca atctgacgat
     4921 attgtcagtg aagcgtctcc caagcaaact cttaaaccgg taattgatat tgccttccga
     4981 ctcgtgctgg aggtactctg cctgtacata gacaatgtgg acggattctt tgaaattgga
     5041 gtaaacggag gacgagtgac catcacctgg aagctaagtg cactgcattt tggcgagtct
     5101 gcacgctttg agaaggagaa cactgacgga gaatggagtc gcatttacct cagggcacac
     5161 aacagcaaac tggagggtgg atggaaggga tgggaatcga tggtggatcc gacgccagct
     5221 ttttccacgg acatcgacca agcggccttc cagtccctga tcgctacaag tacccaggtt
     5281 tacttgggtg gcatgccaga gtcgcgacaa gcacgaggat ccactttgtc tgcccagcag
     5341 ggctctcagt tcaagggctg tgtgggagag gcaaggctcg gcgatctttt actgccctac
     5401 ttctccatgg cggaactgta ttcgcgcacc aatgtttcag tacagcaaaa ggctcaattc
     5461 cgtctaaatg ccacacgacc tgaggagggc tgcatcctgt gcttccagtc agactgcaaa
     5521 aatgacggct tctgtcaatc tccttcagat gagtacgcct gcacctgtca gcctggattc
     5581 gagggcgatg attgcggcac ggacatcgac gagtgtctta acacggaatg cttgaacaac
     5641 ggcacctgca tcaaccaggt tgcagctttc ttctgccagt gtcagccagg attcgagggt
     5701 cagcactgtg agcaaaacat cgacgagtgt gcggatcagc cgtgccacaa cggtggcaac
     5761 tgcacggatc ttatcgcatc gtacgtgtgc gactgccctg aggactatat gggcccgcag
     5821 tgcgacgtgc tgaagcaaat gacctgcgag aacgagccat gtcggaatgg atcaacctgc
     5881 cagaatggat tcaatgcttc tactggcaat aactttacat gtacatgcgt gcccggcttc
     5941 gagggtccac tgtgtgacat acccttctgt gaaataacgc cttgcgataa cggtggcctc
     6001 tgcctgacca ctggagcggt accgatgtgc aaatgtagtc tgggatacac tggtcgcctg
     6061 tgcgagcagg acattaacga gtgcgaatcg aatccatgcc agaacggtgg tcagtgtaag
     6121 gacctcgtcg gcaggtacga gtgcgattgc agggcacgga ttcgagggat tcgctgtgaa
     6181 aatgacatcg acgagtgcaa catggaggga gattactgcg gcggattggg ccggtgtttc
     6241 aacaagcccg gatccttcca gtgcatctgc cagaaaccct attgcggagc ctactgcaac
     6301 tttacggatc cctgcaacgc tacggacctc tgtctaaacg gcggtcgctg cgtagagtcc
     6361 tgcggcgcca aaccggacta ctactgcgag tgtccggaag gtttcgcggg aaagaattgc
     6421 acagcaccga ttacggccaa ggaggacggg ccttcgacca cagacattgc catcattgta
     6481 atacccgtag tggtggtgct gctgctgatc gcgggagccc tcctgggcac cttcctggtg
     6541 atggccagga acaagcgagc aaccaggggc acctatagcc cgagcgtcgt caagagtact
     6601 gcaacccacg gctggaaatg gacaacgtac tgaagccacc gccggaagag cgactaattt
     6661 agttttgagt tttgagcatg aacgacgatt agcaaagcaa acaaaagata tttttaaatc
     6721 cgcccatata cacctagctg taggagtaac tcaatgtttt gtactaagtt ggcccctagt
     6781 tatggtttac atcttaaggt gctcaaagca atagcagcag cagcagcagc caggcttcca
     6841 cctcgactcg tcccttatcc cctgcctacg ctttagttag ttaataatgc cgttgtctat
     6901 ttattctagt agttagatga catacgtacc gccctatagt cgttatgtag ttatgttccc
     6961 gatagtttag attccgtatt cgatttctcg tatatgtaat cctaaagctg cgaagaaact
     7021 tgagctccga actgtcgatt tcccctttgc attccacaag gaagcccatc aaatgtattg
     7081 tatgagtatt tttgtaaata agattgtttg ccgactctta aagtaaatta gctgagctaa
     7141 aatcgaaatt aaaaacagaa aatcctatat gcctatatat cgcgagtata tatcagtaaa
     7201 tgtggcctta taacaaaaaa aaaaaa
//
LOCUS       ECOINCRNA     263 bp ss-RNA             BCT       29-JUN-1990
DEFINITION  E.coli antisense Inc RNA.
ACCESSION   M33752
KEYWORDS    antisense RNA.
SOURCE      E.coli clinical isolate RNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 263)
  AUTHORS   Lopez,X.X. and Rodriguez,J.C.
  TITLE     Isolation and nucleotide sequence of an incompatibility gene from
            the incFv1 plasmid pSU401: comparison to other IncF genes
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Rodriguez, 17-APR-1990.  Univ de Cantabria, Depto Biologia
            Molecular, c/Cardenal Herrera Oria s/n, 39011 Santander, SPAIN
FEATURES       from  to/span     description
    signal      179      185     -10 promoter
    signal      205      211     -35 promoter
BASE COUNT       70 a     54 c     71 g     68 t
ORIGIN      
        1 gatcgtttaa ggaattttgt ggctggccac gccgtaaggt ggcaaggaac tggttctgat
       61 gtggatgtac aggagccaga aaagcgaaaa ccccgataat cttctctaac tttggcgagt
      121 gcagaaagat taccggggcc atctaaaaac cgtatagcca acaattcagc tatgcgggga
      181 gtatagttat atgcccggaa aagttcaaga cttctttctg tgctcgctcc ttctgcgcat
      241 tgtaagtgca ggatggtgtg act
//
LOCUS       FLASL287HA   1095 bp ss-RNA             VRL       29-JUN-1990
DEFINITION  Influenza A/SL/2/87, A(H1N1) virus hemagglutinin (seg 4) RNA, 5'
            end.
ACCESSION   M33748
KEYWORDS    hemagglutinin.
SOURCE      Influenza type A/SL/2/87 passed in rhesus monkey kidney cells.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1095)
  AUTHORS   Rajakumar,A., Swierkosz,E.M. and Schulze,I.T.
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
REFERENCE   2  (bases 33 to 1095)
  AUTHORS   Rajakumar,A., Swierkosz,E.M. and Schulze,I.T.
  TITLE     Sequence of an influenza virus hemagglutinin determined directly
            from a clinical sample
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.T.Schulze 17-APR-1990.  St. Louis School of Med, Dept of
            Microbiology, 1402 S Grand Blvd St. Louis Missouri 63104
FEATURES       from  to/span     description
    pept         33  >  1095     hemagglutinin precursor protein
    sigp         33       83     signal peptide
    matp         84     1061     hemagglutinin 1 (HA1)
    matp       1066  >  1095     hemagglutinin 2 (HA2)
BASE COUNT      381 a    233 c    237 g    244 t
ORIGIN      
        1 agcaaaagca ggggaaaata aaaacaacca aaatgaaagc aaaactacta gtcctgttat
       61 gtgcatttac agctacagat gcagacacaa tatgtatagg ctaccatgcg aacaactcaa
      121 ccgacactgt tgacacagta cttgaaaaga acgtgacagt gacacactct gtcaacctac
      181 ttgaggacag tcacaacgga aaactatgtc gactaaaagg aatagcccca ctacaattgg
      241 gtaattgcag tattgccgga tggattttag gaaacccaga atgcgaatca ctgttttcta
      301 agaaatcatg gtcctacatt gcagaaacac caaactccga gaatggaaca tgttacccag
      361 ggtatttcgc tgactatgag gaactgaggg agcaattgag ttcagtatca tcattcgaga
      421 gattcgaaat attccccaaa gaaagctcat ggcccaacca caccgtaacc aaaggagtaa
      481 cggcatcatg ctcccataag gggaaaagca gtttttacag aaatttgcta tggctgacgg
      541 agaagaatgg ctcgtaccca aatctgagca agtcctatgt gaacaacaaa gagaaagaag
      601 tccttgtact atggggtgtt catcacccgt ctaacatagg ggaccaaagg gccatctatc
      661 atacagaaaa tgcttatgtc tctgtagtgt cttcacatta taacaggaga ttcaccccag
      721 aaatagcaaa aagacccaaa gtaagagatc aagaaggaag aattaactac tactggactc
      781 tgctggaacc cgaggacaca ataatatttg aggcaaatgg agatctaata gcgccatggt
      841 atgcttttgc actgagtaga ggctttgggt caggaatcat cacctcaaac gcatcaatgg
      901 atgaatgtga cgcgaagtgt caaacacccc agggagctat aaacagtagt cttcctttcc
      961 agaatgtaca cccagtcaca ataggagagt gcccaaagta tgtcaggagt acaaaattaa
     1021 gaatggttac aggactaagg aacatcccat ccattcaatc cagaggtttg tttggagcca
     1081 ttgccggttt cattg
//
LOCUS       TREKPMDA      999 bp ds-DNA             ORG       29-JUN-1990
DEFINITION  T.evansi kinetoplast DNA minicircle.
ACCESSION   M33751
KEYWORDS    .
SOURCE      T.evansi bloodstream forms DNA, clone AnTat 3/3.
  ORGANISM  Kinetoplast Trypanosoma evansi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Trypanosoma evansi.
REFERENCE   1  (bases 1 to 999)
  AUTHORS   Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
            Steinert,M. and Hamers,R.
  TITLE     Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by E.Bajyana Songa 17-APR-1990.
FEATURES       from  to/span     description
    site        367      516     kinetoplast minicircle common region (ORI)
BASE COUNT      391 a     78 c    203 g    327 t
ORIGIN      
        1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
       61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
      121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
      181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
      241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
      301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
      361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
      421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
      481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
      541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
      601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
      661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
      721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
      781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
      841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
      901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
      961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS       RATCASPAT    1535 bp ss-mRNA            ROD       29-JUN-1990
DEFINITION  Rat cytosolic aspartate aminotransferase mRNA, complete cds.
ACCESSION   D00252
KEYWORDS    cytosolic aspartate aminotransferase.
SOURCE      Rat (strain Wistar), cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1535)
  AUTHORS   Horio,Y., Tanaka,T., Taketoshi,M., Nagashima,F., Tanase,S.,
            Morino,Y. and Wada,H.
  TITLE     Rat cytosolic aspartate aminotransferase: Molecular cloning of cDNA
            and expression in Escherichia coli
  JOURNAL   J. Biochem. 103, 797-804 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Wada, 07-SEP-1989.
FEATURES       from  to/span     description
    pept         43     1284     cytosolic aspartate aminotransferase (EC
                                 2.6.1.1)
    mRNA          1     1535     cytosolic aspartate aminotransferase mRNA
BASE COUNT      360 a    409 c    410 g    356 t
ORIGIN      
        1 agctcccagc tcgtgctcga ggtcccctca gattccatcg cgatggcccc tccatcattc
       61 tttgcccagg ttccacaggc cccgccggtt ctggtcttta agctcattgc ggacttccgg
      121 gatgatcccg atccccgcaa ggttaacctc ggcgtgggag cgtaccgcac agatgactct
      181 cagccctggg ttttgccagt agtgaggaag gtcgaacaga agattgctaa cgaccacagt
      241 ctcaaccacg agtacttgcc catcctgggc ctggcggagt tccggagctg tgcttctcag
      301 ctagtacttg gggacaacag cccagctctc agggagaatc gggttggggg tgtgcagtct
      361 ttgggaggga ccggtgcact tcgaattgga gctgacttct tagggcgatg gtacattggc
      421 acagacaaca agaacacgcc cgtctacgta tcatcgccga cctgggagaa ccataatggc
      481 gtgttttctg ccgctggttt taaagacatt cggtcctatc gctactggga tgcagagaag
      541 agaggacttg atctccaggg tttcctgaat gatctggaga atgctcctga gttctccatc
      601 tttgtcctcc acgcctgtgc acacaaccca acggggaccg acccaactga agaggagtgg
      661 aagcagatcg ccgccgtcat gaagcgccgt tttctgttcc ccttctttga ctcagcctat
      721 cagggctttg catctggaga cctagagaaa gatgcctggg ctattcgcta ttttgtgtct
      781 gaaggcttcg agctcttctg tgcccagtcc ttctccaaga acttcgggct ctacaatgag
      841 agagtgggga atctgaccgt ggtcggaaaa gagcatgaca gcgtcctgcg ggtcctttcc
      901 cagatggaga agattgtacg aatcacctgg tccaatcccc ctgcccaggg agctcggatc
      961 gtggccacca ccctctccaa ccctgagctc tttaaggagt ggaaaggaaa cgtgaagaca
     1021 atggctgacc ggattctgac catgagatcc gaactcaggg cgcgactaga agctctcaag
     1081 actcccggga cttggtctca catcactgag cagattggaa tgttcagctt tactgggttc
     1141 aaccccaagc aggtcgagta tttggtcaac gagaagcaca tctatctgat gccgagcggt
     1201 cggatcaaca tgtgcggctt gaccaccaag aacctagatt atgtggctac ctccatcaat
     1261 gaagctgtca ccaaattcca gtgaagaaac accgagtagt tcataccacc aaagcagttc
     1321 ctgtcacagc tttcctgcct gcgcaaacct agccgtacat gttgtttatt agagatgacc
     1381 accatgggga ggcagccgct gtttagctgg ccccacaaga gaagacattt cttgaacatg
     1441 aacctgggtc gggtgggggg atgactgggg ttagggcctt ttggaaacca gagcagatta
     1501 aagttattta agaataaaaa aacccgttga tatgc
//
LOCUS       RATGL5        599 bp ds-DNA             ROD       29-JUN-1990
DEFINITION  Rat NICER element (GL5-14) 3' long terminal repeat.
ACCESSION   M33534
KEYWORDS    long terminal repeat.
SOURCE      Rat (strain NEDH) liver NGS-TC12 induced pheochromocytoma cell line
            DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 599)
  AUTHORS   Cho,K.-O., Minsk,B. and Wagner,J.A.
  TITLE     NICER elements: A family of nerve growth factor inducible
            cAMP-extinguishable retrovirus-like elements
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Wagner, 03-APR-1990.
FEATURES       from  to/span     description
    rpt          21      575     3' viral long terminal repeat
    recomb      575      576     viral DNA end/rat DNA start
    signal      345      349     CAAT box
    signal      402      406     TATA box
    signal      481      486     Poly-A signal
BASE COUNT      174 a    154 c    139 g    132 t
ORIGIN      
        1 tggtagaaga agaggggaat ggaaaggaaa attatacaga tttaaggttt aaaaatatga
       61 agttaaaaga gtatgtttca actcaggact aaacactgtg aaaagcaagt ccagacagcc
      121 ccgccctgcc gctagaacta acagaccata aaaggaaagg aatgcagaac agaccaggag
      181 taccggacct gactcacagg ccacctggca ggaagagata agcccccagc ccccgacatc
      241 caggacgccc caaagctgcc aatgtgtgta gctataactt attacctcat catgtgaaat
      301 agccaatcat atgtgaacat gtctttgtgc ctcgtttgaa tccaccaatc cccgtaacta
      361 tgcatctgct tctgtacgcc cacttctgct tccccaatcc ctataaaatc cccatgctgg
      421 agctgctggg tgcgcaagtc ctccgaagag tctgtgtgcc cgcaggtacc tgtgttttcc
      481 aataaaccct cttgctgatg gcatcgcagt ggactcggct cggtcattgg ggcttggact
      541 cctcctgagg gaaggtcctc tccgggggtc tttcagaaga agagaattaa cagaccaaa
//
LOCUS       RATGL5A       599 bp ds-DNA             ROD       29-JUN-1990
DEFINITION  Rat NICER element (GL5-14) 5' long terminal repeat.
ACCESSION   M33535
KEYWORDS    long terminal repeat.
SOURCE      Rat (strain NEDH, isolate GL4-14) liver NGS-TC12 induced
            pheochromocytoma cell line DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 599)
  AUTHORS   Cho,K.-O., Minsk,B. and Wagner,J.A.
  TITLE     NICER elements: A family of nerve growth factor inducible
            cAMP-extinguishable retrovirus-like elements
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Wagner, 03-APR-1990.
FEATURES       from  to/span     description
    recomb       18       19     rat DNA end/viral DNA start
    rpt          19      584     5' viral long terminal repeat
    signal      344      348     CAAT box
    signal      401      405     TATA box
    mRNA        439  >   599     mRNA
    signal      479      485     Poly-A signal
    binding     579      586     primer binding site
BASE COUNT      160 a    158 c    138 g    143 t
ORIGIN      
        1 ttattatgtg tccttgaagg gaaaggaaaa ttatacagat ttaaggttta aaaatatgaa
       61 gttaaaagag tatgtttcaa ctcaggacta aacactgtga aaagcaagtc cagacagccc
      121 cgccctgccg ctagaactaa cagaccataa aaggaaagga atgcagaaca gaccaggagt
      181 accggacctg actcacaggc cacctggcag gaagagataa gcccccagcc tccgacatcc
      241 aggacgcccc aaagctgcca atgtgtgtag ctatacctta ttacctcatc atgtgaaata
      301 gccaatcata tgtgaacatg tctttgtgcc tcgtttgaat ccaccaatcc ccgtaactat
      361 gcatctgctt ctgtacgccc acttctgctt ccccaatccc tataaaagcc ccatgctgga
      421 gctgctgggt gcgcaagtcc tccgaagaga ctgtgtgccc gaaggtacct gtgttttcca
      481 ataaaccctc ttgctgattg catcgagtgg actcggctcg gtcattgggc gcttggactc
      541 ctcctgaggg aaaggtcctc tccgggggtc tttcattatg gggctcgccc cgggatttg
//
LOCUS       XELHMGCOA    2970 bp ss-mRNA            VRT       29-JUN-1990
DEFINITION  X.laevis 3-hydroxy-3-methylglutaryl-coenzyme A reductase mRNA,
            complete cds.
ACCESSION   M29258
KEYWORDS    3-hydroxy-3-methylglutaryl-coenzyme A reductase.
SOURCE      X.laevis liver, cDNA to mRNA, clone 2.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2970)
  AUTHORS   Chen,H. and Shapiro,D.J.
  TITLE     Nucleotide sequence and estrogen induction of Xenopus laevis
            3-hydroxy-3-methylglutaryl-coenzyme A reductase
  JOURNAL   J. Biol. Chem. 265, 4622-4629 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Chen, 20-OCT-1989.
FEATURES       from  to/span     description
    pept        115     2766     3-hydroxy-3-methylglutaryl-coenzyme A
                                 reductase
    mRNA    <     1     2970     3-hydroxy-3-methylglutaryl-coenzyme A
                                 reductase
BASE COUNT      821 a    676 c    714 g    759 t
ORIGIN      
        1 ggcgggaggt agaacatctg tgtgtgtcat ctccgtatcc ttgtggtatt tctctgccgc
       61 cgccgggtca gtcaccagcc agcatcggcc aaagatatcc ttaatagagc caacatgctg
      121 tccagactat ttcgcatgca tgggcagttt gtggcctctc atccctggga ggtcatcgtt
      181 gggactgtga ctctcacaat ctgcatgatg tcaatgaaca tgttcactgg gaatgacaaa
      241 atctgtggct ggaattatgc ttgccccaag tttgaagaag atgttctgag cagtgacatc
      301 atcatcctga caatcacaag atgcatcgcc atcctttaca tttatttcca gttccaaaac
      361 ctcagacagc tcggctcaaa atacatttta gggattgctg ggcttttcac catcttctca
      421 agctttgttt ttagcacagt ggttattcac ttcctggata aagaactgac aggcctaaac
      481 gaagctcttc catttttcct gcttttgatc gatctttcca aggctagtgc tttagccaaa
      541 tttgctctaa gttccaactc acaggatgaa gtccgagaca atattgcccg tggcatggca
      601 attctaggac ctacctttac ccttgaggcg cttgtagaat gtcttgtgat tggagtcggt
      661 acaatgtcag gtgtgcggca gttagaaatc atgtgttgct tcggctgcat gtctgtcctt
      721 gccaattatt ttgccttcat gaccttcttt ccagcttgtg tttcattggt gttagagctt
      781 tctagagaga gccgagaggg ccgtccaatc tggcagctca gtcagtttgc tagtgtacta
      841 gaagaagagg aggataataa acccaatcct gttactcaga gagtgaaaat gattatgtct
      901 ttgggtttgg tccttgttca tgctcatagt cggtggataa gtgaaccctc ttcccaaaac
      961 agcacctcaa tcagtgacca tgaagttacc accatgctgg atgatatgat gccaaagaga
     1021 gttgagccta gcatgcctct ctggcagttc tatctttcta gaatggtgac catggatgta
     1081 gaacagatca ttactcttgg tcttgctctc ctccttgccg tgaagtacat attctttgag
     1141 caaacagaga cagaatcaac cttctctatg aagaacccga taatctcccc cgtagcggtg
     1201 caaaagaagc aaatagagag ttgctgcaga cgggaacctg aacaggaaaa aactgtccat
     1261 gtttccacaa cagaagaagc ttcatcaaag gaggaaactg aggcagttat aaagccatta
     1321 ccccttgaga catcgcccaa agccaagttt atagtgggag attcttctcc actggaattg
     1381 tccccagagg ataaaaatac catgtttgat ctaccagagg aaccccgtcc acttgatgaa
     1441 tgtgttcgta tactcaagaa tccagataaa ggcgcacagt atctgactga tgcagaagtc
     1501 attagcctgg tgaatgccaa acacatccca gcctacaaac tggaaaccat gatggaatct
     1561 ccccgagagg gcgtggccat tcgccggcaa atgttgtctg acaagctccc ccagcgatca
     1621 gcactgcaga gcttgcctta taagaactac aactactctt tagtaatggg tgcttgttgt
     1681 gagaatgtca tcgggtacat gcccatccct gtaggtgtag ctggacctct cttgctgaac
     1741 aacaaagagt atcaggtgcc catggcaaca actgaaggat gtcttgtggc cagcacaaac
     1801 agaggttgca gagccataat gctgggagga ggtgccaaaa gtcgtgttct ggctgatggg
     1861 atgactcgtg gaccggttgt taggttacca actgcctgtg atgcagcaga agtaaaggcc
     1921 tggctagaca gcgcagaggg attcaaagtc ataaaagatg catttgatag cacaagcaga
     1981 tttgcacgac tcgggagact acaaaactgc gttgctggca ggaacctgta catccgtttc
     2041 cagtcaaaaa caggagatgc catgggaatg aacatgattt cgaaggtcac tgaacaagct
     2101 cttgctagac tacaagaaga gttccctgat ctccatgttt tggctgtgag cggcaactac
     2161 tgtacggata agaagccagc tgctattaac tggatagagg gaagaggaaa gtcagtggtt
     2221 tgtgaggcca tcattccagc caaagttgta cgggaggttc taaagagctc aacagaagcg
     2281 ttagttgaag tgaatatcaa taagaatttt attggttctg caatggcggg aagcatcgga
     2341 gggtacaatg cacatgctgc aaacattgtg actgccattt acattgcttg cggacaggat
     2401 gctgctcaga atgttgggag ctccaattgc atcacaataa tggaggctac cggtcccacg
     2461 tatgaagatt tgtacatcag ctgcacaatg ccctctatag aaattggaac tgtaggaggt
     2521 ggaactaact tggctcccca gcaagcttgc ttacagatgc tgggagtgca aggagcaagc
     2581 acagagacac cagggaaaaa cgcctgccag cttgctcaga tagtgtgtag cacagttatg
     2641 gccggggaac tgtcactaat ggcggcttta gcagcgggtc acctcgtcaa gagtcacatg
     2701 gtccacaaca ggtcaaaaat taaccttcag gacctgccag gcacgtgcac aaagaaagct
     2761 gcctgattgg ccgtaggaag agccgaatct taaaatgaac ttctctgaga aacggtgctg
     2821 tatctcatca caacacaagg tggcgccaca tctgcctgag aaatggaatt gcctcatcta
     2881 catatggatt cccaggcccg agcttcacaa tcaccatgtt aatccagtct tttattgatg
     2941 atggcatctt aaaaagaaac ttacataatt
//
LOCUS       HUMPFKMIX    1139 bp ss-mRNA            PRI       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM) alternatively spliced mRNA,
            partial cds.
ACCESSION   J05533
KEYWORDS    phosphofructokinase.
SOURCE      Human (34 week old) placenta, cDNA to mRNA, clone HPFKM1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 62; 1110 to 1139)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1139)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >  1139     muscle phosphofructokinase (AA at 3) (EC
                                 2.7.1.11) /nomgen="PFKM" /map="1cen-q32"
                                 /hgml_locus_uid="LL0081L"
BASE COUNT      285 a    262 c    337 g    255 t
ORIGIN      
        1 gaaaaccaat cacctcagaa gacatcaaga atggcagcag gatgggtgtg gaagcagtga
       61 tggcactttt ggaggggacc ccagataccc cagcctgtgt agtgagcctc tctggtaacc
      121 aggctgtgcg cctgcccctc atggaatgtg tccaggtgac caaagatgtg accaaggcca
      181 tggatgagaa gaaatttgac gaagccctga agctgagagg ccggagcttc atgaacaact
      241 gggaggtgta caagcttcta gctcatgtca gacccccggt atctaagagt ggttcgcaca
      301 cagtggctgt gatgaacgtg ggggctccgg ctgcaggcat gaatgctgct gttcgctcca
      361 ctgtgaggat tggccttatc cagggcaacc gagtgctcgt tgtccatgat ggtttcgagg
      421 gcctggccaa ggggcagata gaggaagctg gctggagcta tgttgggggc tggactggcc
      481 aaggtggctc taaacttggg actaaaagga ctctacccaa gaagagcttt gaacagatca
      541 gtgccaatat aactaagttt aacattcagg gccttgtcat cattgggggc tttgaggctt
      601 acacaggggg cctggaactg atggagggca ggaagcagtt tgatgagctc tgcatcccat
      661 ttgtggtcat tcctgctaca gtctccaaca atgtccctgg ctcagacttc agcgttgggg
      721 ctgacacagc actcaatact atctgcacaa cctgtgaccg catcaagcag tcagcagctg
      781 gcaccaagcg tcgggtgttt atcattgaga ctatgggtgg ctactgtggc tacctggcta
      841 ccatggctgg actggcagct ggggccgatg ctgcctacat ttttgaggag cccttcacca
      901 ttcgagacct gcaggcaaat gttgaacatc tggtgcaaaa gatgaaaaca actgtgaaaa
      961 ggggcttggt gttaaggaat gaaaagtgca atgagaacta taccactgac ttcattttca
     1021 acctgtactc tgaggagggg aagggcatct tcgacagcag gaagaatgtg cttggtcaca
     1081 tgcagcaggg tgggagccca accccatttg ataggaattt tgccactaag atgggcgcc
//
LOCUS       J05525         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05525
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        4 a      8 c      3 g      5 t
ORIGIN      
        1 ctcatctcag atacctggcc
//
LOCUS       J05526         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05526
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        5 a      6 c      5 g      4 t
ORIGIN      
        1 actcagcgag gtacttgcac
//
LOCUS       J05527         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05527
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        7 a      8 c      3 g      2 t
ORIGIN      
        1 tctcaaccag acaaggaccc
//
LOCUS       J05528         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05528
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        7 a      3 c      4 g      6 t
ORIGIN      
        1 catcaagaat gttcgtatga
//
LOCUS       J05529         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05529
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        2 a      5 c      6 g      7 t
ORIGIN      
        1 gtcccttcag ctggtggtta
//
LOCUS       J05530         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05530
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        6 a      4 c      5 g      5 t
ORIGIN      
        1 cagaattctg gtaagtcacg
//
LOCUS       J05531         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05531
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        4 a      6 c      8 g      2 t
ORIGIN      
        1 gctcctgcag ggcagcagga
//
LOCUS       J05532         20 bp ds-DNA             BAD       29-JUN-1990
DEFINITION  Human muscle phosphofructokinase (PFKM).
ACCESSION   J05532
REFERENCE   1  (bases 1 to 20)
  AUTHORS   Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A.
  TITLE     Alternative splicing of the transcript encoding the human muscle
            isoenzyme of phosphofructokinase
  JOURNAL   J. Biol. Chem. 265, 9006-9010 (1990)
  STANDARD  simple staff_entry
COMMENT     Intron/exon boundary sequence.  McLachlan promises to send us full
            exon sequence before it is published in August 1990 using the same
            accession numbers.
BASE COUNT        4 a      3 c      7 g      6 t
ORIGIN      
        1 tgtccaggtg gtaagtactg
//

GenBank-Updates@genbank.bio.net (06/30/90)

LOCUS       DROGNBPSA1    441 bp ds-DNA             INV       30-JUN-1990
DEFINITION  D.melanogaster guanine nucleotide-binding protein (Gs) alpha
            subunit gene, exon 1.
ACCESSION   M33996
KEYWORDS    guanine nucleotide-binding protein.
SEGMENT     1 of 3
SOURCE      D.melanogaster (strain Canton S) embryo DNA, clones
            lambda-DGs-[4,5,7,10,11,12].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 441)
  AUTHORS   Quan,F. and Forte,M.A.
  TITLE     Two forms of Drosophila melanogaster Gs-alpha are produced by
            alternate splicing involving an unusual splice site
  JOURNAL   Mol. Cell. Biol. 10, 910-917 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pre-msg     362  >   441     guanine nucleotide-binding protein (Gs-alpha)
                                 mRNA and introns (alt.)
    pre-msg     174  >   441     Gs-alpha mRNA and introns (alt.)
    IVS         426  >   441     intron A
    signal      152      157     promoter region (put.)
    variant     365      365     c in DNA, g in cDNA
BASE COUNT      140 a     89 c     85 g    127 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgcta ttctacatag ctcgggatac tttattttat tattaatatt cgctaaaata
       61 gcacatcgtc atttggcata gatgtttttg gcccagtaac atatctatct acatcaccgc
      121 ccgcactatc cgataggcag actgactatc gataaaagta tctacttcgt tgcatcccta
      181 ggttgataca taaaataaaa aatatataaa aagtgagtta gtactggtat taaaagtaat
      241 ttttaagcta caaattgatg aattgcgagc acacagagac taagaacatt gggaatccgg
      301 aaaacagtgg ctggcgcgtc gggtgcgata gaattgcctg tttttcagac cccaaaagtc
      361 gctcccaacc gagtcgtgca cctatagtct tggcccgaag ttgctcccat gaataaaaat
      421 gttcggtaat ttcacggtaa a
//
LOCUS       DROGNBPSA2    897 bp ds-DNA             INV       30-JUN-1990
DEFINITION  D.melanogaster guanine nucleotide-binding protein (Gs) alpha
            subunit gene, exons 2,3 and 4.
ACCESSION   M33997
KEYWORDS    guanine nucleotide-binding protein.
SEGMENT     2 of 3
SOURCE      D.melanogaster (strain Canton S) embryo DNA, clones
            lambda-DGs-[4,5,7,10,11,12].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 897)
  AUTHORS   Quan,F. and Forte,M.A.
  TITLE     Two forms of Drosophila melanogaster Gs-alpha are produced by
            alternate splicing involving an unusual splice site
  JOURNAL   Mol. Cell. Biol. 10, 910-917 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        265      540     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 2
                607      703     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 3
                760  +   880     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 4
    pep$        265      540     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 2
                607      703     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 3
                760  +   880     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 4
    pre-msg <     1  >   897     Gs-alpha mRNA and introns
    IVS     <     1       29     intron A
    IVS         541      606     intron B
    IVS         704      759     intron C
    IVS         881  >   897     intron D
BASE COUNT      266 a    225 c    226 g    180 t
ORIGIN      About 1.4 kbp downstream of segment 1.
        1 ccatctacta taccattgta cgaatgcagc ctgctgcaga gatccactga agcgctgagc
       61 caacgccaac ggtggtgcca ctgagatcgg agatcggaga tcgcagaccg gagaccgacg
      121 agcacacaag ggattgtggt ttgatccgag gaacccaagc actcagctat actggagcag
      181 aacggagaaa gcacaagggg tgccgagtcg cccgacgcga accaggaaaa ccgagcgacg
      241 tgttggtgga ccccgtgggc tgcgatgggt tgctttgggt cgcccacctc caagcagtcg
      301 gacgtgaact cggaggactc gaagagccag aagcgccgga gcgatgcaat atctagacag
      361 ttgcagaagg acaaacagct ctacagggcc acacacaggc tgctcctcct gggggcgggc
      421 gagtccggca aatcaaccat agtcaagcaa atgcgaatat tgcatgtcga cggattttct
      481 gactcggaaa agaaacagaa aattgatgat attaaaaaga atattcgaga cgctatcttg
      541 gtgagttcca tcccgccggc catctaagcc caaacacttg cttaactcct tgccaatccc
      601 ttacagacta ttacaggagc catgagcaca cttaatccac ctgtagcttt agaaaagaag
      661 gaaaatgaac ccagagtgga gtacattcag gattatgcat ctagtaagag cctctgccgc
      721 cccaacttaa gttgttatca accattatct gccccgcagg tccggacttt aattatcctc
      781 ctgaatttta tgaacataca gaagaactat ggaaagacaa gggcgttctt caaacctatg
      841 agaggtcgaa tgagtatcaa ttaatcgatt gtgcgaaata gtaagtaatg cccaccc
//
LOCUS       DROGNBPSA3   1380 bp ds-DNA             INV       30-JUN-1990
DEFINITION  D.melanogaster guanine nucleotide-binding protein (Gs) alpha
            subunit gene, exons 5,6,7,8 and 9.
ACCESSION   M33998
KEYWORDS    guanine nucleotide-binding protein.
SEGMENT     3 of 3
SOURCE      D.melanogaster (strain Canton S) embryo DNA, clones
            lambda-DGs-[4,5,7,10,11,12].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1380)
  AUTHORS   Quan,F. and Forte,M.A.
  TITLE     Two forms of Drosophila melanogaster Gs-alpha are produced by
            alternate splicing involving an unusual splice site
  JOURNAL   Mol. Cell. Biol. 10, 910-917 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     4      132     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 5
                206      385     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 6
                445      569     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 7
                746      828     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 8
                909     1055     guanine nucleotide-binding protein
                                 (Gs-alpha-S), exon 9
    pep$    +     4      132     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 5
                206      385     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 6
                445      569     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 7
                755      828     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 8
                909     1055     guanine nucleotide-binding protein
                                 (Gs-alpha-L), exon 9
    pre-msg <     1     1115     Gs-alpha mRNA and introns (alt.)
    pre-msg <     1     1176     Gs-alpha mRNA and introns (alt.)
    pre-msg <     1     1243     Gs-alpha mRNA and introns (alt.)
    IVS     <     1        3     intron D
    IVS         133      205     intron E
    IVS         386      444     intron F
    IVS         570      745     intron G (for Gs-alpha-S) (no splice
                                 consensus)
    IVS         570      754     intron G' (for Gs-alpha-L)
    IVS         829      908     intron H
    variant     780      780     t in DNA, c in cDNA
    signal     1083     1088     polyA signal
    signal     1154     1159     polyA signal
    signal     1221     1226     polyA signal
BASE COUNT      441 a    265 c    254 g    420 t
ORIGIN      About 500 bp downstream of segment 2.
        1 aagcttcctg gaccgagtga gcacaatcaa gaatccaaac tacaccccta atgagcagga
       61 tattcttcgg tgccgtgttt tgacttctgg aatatttgaa acaagatttc aagtggacaa
      121 agtaaacttt cagtaagtgt aatcccaagt gatatcgcaa tattttagtt tctattccat
      181 tctaatgata tgcattcgtg tgcagcatgt tcgatgtcgg tggccagcgg gacgagcgta
      241 ggaaatggat tcagtgtttc aatgatgtaa ctgctatcat attcgtaact gcgtgctcaa
      301 gttataacat ggttttgcgg gaagatccca cccagaaccg acttcgagaa tctttggatt
      361 tgttcaagag tatttggaac aacaggtgaa ccgagcaatt tggcgccatc tcgacaaatg
      421 gaaactaacg agttgttctt tcagatggct tcgcacgatt tctattatac tatttttaaa
      481 taagcaagat ttgttagcag agaaaattaa ggctggaaaa agtaaattgt cggaatattt
      541 ctccgagttt aacaaatacc aaacgccaag taagtaaaat agtataaccc atagtagagg
      601 ccaattgagg gcgagggagt catcagatca accgactcag atctggccaa gaaattcgca
      661 attccttttc aaatttgcaa accgtgccca agttaactta agaaatttaa tggataatgt
      721 attttgtcta atttcaaatg tgctgtcgac acaggtgacg caataatgga atccaatgat
      781 gacccagaag taatacgagc aaaatatttc atacgagacg agtttctggt gggttatatc
      841 acatctccgt ttcttcctcc acctatctaa ctagccttaa tccttaatgg gatctctttc
      901 ttatatagcg tatatctacc gctagcggag acggaaaaca ctactgctat ccacatttca
      961 catgcgccgt tgacacagaa aacattaaac gtgtgtttaa tgattgcaga gacattattc
     1021 aaaggatgca ccttcgtcaa tatgaattgt tataggttat ccccatcgcc gtaatgcaag
     1081 taaataaaaa tattaatgac ttactgtttt aaatataatt atcaaatgta atttgtattt
     1141 aaatagtccc ttgattaaaa caaaaaattc gctgcattta gatctggtat gacctttcac
     1201 aaaactcaac attaaagaat aataaataaa catgttaata acagacacgc ttgtttcgat
     1261 tattttggct attctaccag gatgcacctg ctttattttc tatagtccct tgacatgaag
     1321 cattttagag tatttccgct atcagagtcc gtatccatat ccatgggcat ttacattcac
//
LOCUS       SYCSBEFA      850 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  Synechocystis sp. cytochrome B559 alpha subunit (psbE) and beta
            subunit (psbF), and photosystem II L protein genes, complete cds.
ACCESSION   M33897 X06988
KEYWORDS    cytochrome B559 alpha subunit; cytochrome B559 beta subunit;
            photosystem II protein L.
SOURCE      Synechocystis sp. (strain PCC6803) DNA, clone pKW1261.
  ORGANISM  Synechocystis sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Chroococcales.
REFERENCE   1  (bases 1 to 553)
  AUTHORS   Pakrasi,H.B., Williams,J.G.K. and Arntzen,C.J.
  TITLE     Targeted mutagenesis of the psbE and psbF genes blocks
            photosynthetic electron transport: Evidence for a functional role
            of cytochrome B559 in photosystem II
  JOURNAL   EMBO J. 7, 325-332 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 850)
  AUTHORS   Pakrasi,H.B., Nyhus,K.J. and Granok,H.
  TITLE     Targeted deletion mutagenesis of the beta subunit of Cytochrome
            B559 protein destabilizes the reaction center of photosystem II
  JOURNAL   Z. Naturforsch., C, Biosci. 45, 423-429 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.B.Pakrasi, 01-MAY-1990.
               Washington University Dept. of Biology
               Campus Box 1137
               St. Louis, MO 63130
FEATURES       from  to/span     description
    pept        121      366     cytochrome B559 alpha-subunit (psbE)
    pept        403      537     cytochrome B559 beta subunit (psbF)
    pept        547      666     L protein of photosystem II (PS2)
    binding     109      112     ribosomal binding site (put.)
    binding     392      396     ribosomal binding site (put.)
BASE COUNT      197 a    196 c    197 g    260 t
ORIGIN      
        1 gctaccccag caccgaggtg gcaatggttc cctaggcggc tcacaaaata gtagactaga
       61 ctctacttgc tttgcatttg tcagtcaatg ttgttttgaa aaattgaagg agaacacaaa
      121 atgtcaggga ctaccggcga gcgtccattt tccgatattg tcaccagcat tcgctactgg
      181 gtgatccaca gcatcaccat cccgatgttg tttattgctg gttggttgtt tgtcagcacg
      241 ggcttagcct acgatgcttt tggcactccc cgccccgatg aatatttcac ccagacccgt
      301 caagagttgc ccattctcca ggaacgctac gacattaatc aggaaattca agagtttaat
      361 caataaaaca tttaattgtt cttttttagt tggtaattaa caatggcaac ccaaaatcct
      421 aatcaaccgg ttacttatcc catttttacg gtgcgctggc tggcggttca caccctggcg
      481 gtgccctctg tcttctttgt cggggcgatc gccgcgatgc aatttattca acgctaggag
      541 tttttcatgg acagaaattc aaacccaaac cgccaaccgg tggaattgaa ccgcacttct
      601 ttatacctgg gtctattgtt ggtggctgtg ttggggattt tgttctccag ctatttcttt
      661 aactaaactt ttttaatacg caatttagga ggcatggtat gttcgcagaa ggcagaatcc
      721 ctttgtgggt ggtgggtgta gtggccggta ttggcgccat tggtgttcta ggattatttt
      781 tctacggagc ctatgctggt ttaggttctt ccatgtaatc gagggctagc cgccacacaa
      841 tatcatggtt
//
LOCUS       MUSBFGFR     2526 bp ss-mRNA            ROD       30-JUN-1990
DEFINITION  Mouse basic fibroblast growth factor receptor (bFGF-R) mRNA,
            complete cds.
ACCESSION   M28998
KEYWORDS    fibroblast growth factor receptor; transmembrane protein;
            tyrosine kinase.
SOURCE      Mouse (strain CBA/CaH), cDNA to mRNA, clone NE2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2526)
  AUTHORS   Reid,H.H., Wilks,A.F. and Bernard,O.
  TITLE     Two forms of basic fibroblast growth factor receptor-like mRNA are
            expressed in the developing mouse brain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 1596-1600 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.Bernard, 11-OCT-1989.
FEATURES       from  to/span     description
    pept         58     2526     fibroblast growth factor-receptor precursor
    sigp         58      117     fibroblast growth factor-receptor signal
                                 peptide (put.)
    matp        118     2523     fibroblast growth factor-receptor
BASE COUNT      579 a    710 c    708 g    529 t
ORIGIN      
        1 gtggaatatc catggaggta cggagccttg ttaccaacct ctaaccgcag aactgggatg
       61 tggggctgga agtgcctcct cttctgggct gtgctggtca cagccactct ctgcactgcc
      121 aggccagccc caaccttgcc cgaacaagct cagccctggg gagtccctgt ggaagtggag
      181 tctctcctgg tccaccctgg cgacctgcta cagcttcgct gtcggcttcg cgatgatgtg
      241 cagagcatca actggctgcg ggatggggtg cagctggtgg agagcaaccg tacccgcatc
      301 acaggggagg aggtggaggt gcgggactcc atccccgctg actctggcct ctacgcttgc
      361 gtgaccagca gcccctctgg cagcgatacc acctacttct ccgtcaatgt ctcagatgca
      421 ctcccatcct cggaagatga tgacgacgac gatgactcct cctcggagga gaaagagacg
      481 gacaacacca aaccaaaccg taggcctgta gctccctact ggacatcccc agagaaaatg
      541 gagaagaaac tgcatgcggt gcccgctgcc aagacggtga agttcaagtg cccgtcgagt
      601 gggacaccca accccactct gcgctggttg aaaaatggca aagagtttaa gcctgaccac
      661 cgaattggag gctacaaggt tcgctatgcc acctggagca tcataatgga ttctgtggtg
      721 ccttctgaca agggcaacta cacctgcatc gtggagaatg agtatgggag catcaaccac
      781 acctaccagc ttgacgtcgt ggaacgatct ccgcaccgac cccatccttc agcagggctg
      841 cctgccaaca agacagtggc cctgggcagc aatgtggagt tcatgtgtaa ggtgtacagc
      901 gatccgcagc ctcacattca gtggctgaag cacatcgagg tgaacgggag taagatcggg
      961 ccagacaact tgccgtatgt ccagatcctg aagactgctg gagttaatac caccgacaag
     1021 gaaatggagg tgcttcatct acggaatgtc tcctttgagg atgcggggga gtatacgtgc
     1081 ttggcgggta actctatcgg actctcccat cactctgcat ggttgaccgt tctggaagcc
     1141 ctggaagaga gaccagctgt gatgacctca ccgctctacc tggagatcat tatctactgc
     1201 accggggcct tcctgatctc ctgcatgttg ggctctgtca tcatctataa gatgaagagc
     1261 ggcaccaaga agagcgactt ccatagccag atggctgtgc acaagctggc caagagcatc
     1321 cctctgcgca gacaggtaac agtgtcagct gactccagtg catccatgaa ctctggggtt
     1381 ctcctggttc ggccctcacg gctctcctcc agcgggaccc ccatgctggc tggagtctcc
     1441 gaatatgagc tccctgagga tccccgctgg gagctgccac gagacagact ggtcttaggc
     1501 aaaccacttg gcgagggctg cttcgggcag gtggtgttgg ctgaggccat cgggctggat
     1561 aaggacaaac ccaaccgtgt gaccaaagtg gccgtgaaga tgttgaagtc cgacgcaacg
     1621 gagaaggacc tgtcggatct gatctcggag atggagatga tgaaaatgat tgggaagcac
     1681 aagaatatca tcaaccttct gggagcgtgc acacaggatg gtcctcttta tgtcattgtg
     1741 gagtacgcct ccaaaggcaa tctccgggag tatctacagg cccggaggcc tcctgggctg
     1801 gagtactgct ataaccccag ccacaacccc gaggaacagc tgtcttccaa agatctggta
     1861 tcctgtgcct atcaggtggc tcggggcatg gagtatcttg cctctaagaa gtgtatacac
     1921 cgagacctgg ctgctaggaa cgtcctggtg accgaggata acgtaatgaa gatcgcagac
     1981 tttggcttag ctcgagacat tcatcatatc gactactaca agaaaaccac caacggccgg
     2041 ctgcctgtga agtggatggc ccctgaggcg ttgtttgacc ggatctacac acaccagagc
     2101 gatgtgtggt cttttggagt gctcttgtgg gagatcttca ctctgggtgg ctccccatac
     2161 cccggtgtgc ctgtggagga acttttcaag ctgctgaagg agggtcatcg aatggacaag
     2221 cccagtaact gtaccaatga gctgtacatg atgatgcggg actgctggca tgcagtgccc
     2281 tctcagagac ctacgttcaa gcagttggtg gaagacctgg accacattgt ggccttgacc
     2341 tccaaccagg agtatctgga cctgtccata ccgctggacc agtactcacc cagctttccc
     2401 gacacacgga gctccacctg ctcctcaggg gaggactctg tcttctctca tgagccgtta
     2461 cctgaggagc cctgtctgcc tcgacacccc acccagcttg ccaacagtgg actcaaacgg
     2521 cgctga
//
LOCUS       BSSBIODAYB   4382 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  B.sphaericus bioDAYB operon genes, complete cds.
ACCESSION   M29292
KEYWORDS    adenosylmethionine-8-amino-7-oxononanoate aminotransferase; bioB;
            biotin synthase; dethiobiotin synthase.
SOURCE      B.sphaericus (strain IF03525) DNA.
  ORGANISM  Bacillus sphaericus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 456 to 511)
  AUTHORS   Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
            Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
  TITLE     Cloning and characterization of the Bacillus sphaericus genes
            controlling the bioconversion of pimelate into dethiobiotin
  JOURNAL   Gene 87, 63-70 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 4382)
  AUTHORS   Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
            Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Goleckler, 20-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept        509     1213     dethiobiotin synthase (bioD) (ttg start codon)
    pept       1194     2561     adenosylmethionine-8-amino-7-oxononanoate
                                 aminotransferase (bioA) (gtg start codon)
    pept       2566     3156     biotin synthase (bioY)
    pept       3156     4154     bioB protein (gtg start codon)
    mRNA    <   456     4364     bioDAYB mRNA
    binding     491      503     ribosome binding site
    binding    1179     1186     ribosome binding site
    binding    2545     2556     ribosome binding site
    binding    4317     4364     ribosome binding site
BASE COUNT     1376 a    746 c    952 g   1308 t
ORIGIN      
        1 aagctttgca cacttctgtt tcgtatcctc atattgaact tgatgaaacc ttcctatggc
       61 cgtatgcatt gagatttttt ctcgatgttc tgcttgcaat gttcgatatt cttcttgccg
      121 aatagctaca cgataccaaa attcataacg caacggtaaa tctcttattt cgtaagtaag
      181 caaagtattt aaaatactgc tcatttgttc atatgtatct agctttttat ctgtctcctt
      241 aaatagtcca aacattttgc caccccctgt tttgattaat actacaacct atgataaaaa
      301 ccctttaata tttcttggga aataatccaa cgttgataaa acggggtgaa tatccgatca
      361 atcgagtgaa atttaggata gaataccctc ggaaaaagca ttatctgaat catttatgta
      421 aaaatgcaaa aaaaggcatt tacaaaagga aaaagaatgt gttaacttaa aaactatagt
      481 tggttaacta aaagaggggg aggtacagtt gcaacacttt tgggttgttg gaacagatac
      541 agatgttgga aaaacatttg tcaccacatt attaatgcgt aatttgcaaa aacagggcgt
      601 acgtgtaacg ccttataaac cagtccaaac tggtgaagtg tatgatggtg aacaagccta
      661 ttacttcgac acagcgatgt atgaaaaata ttccttgcaa ttgctagaca gagagaattt
      721 aaatggctat tcatttaaag aggctgcatc gccacatttt gcggctcaac tggaggggca
      781 gcaaattgac acacagcagt tattaaagca aatgcaactt ttacagcaaa catgggatgt
      841 tgttatttgt gaaggagcgg gtgggctctt tgtgccatta gatgcatgtg gcgaaacgac
      901 attgttggat gtcattgttg aaagtaaact acccgttgtc gtggttacac gaacagcact
      961 aggaacaatt aaccatacgc tcttaacgtt agaggcattg actacacgga aaattgaagt
     1021 gcttggtctt gtatttaacg gtgatatggg gagcaggatg gagcaagaca atatccaaac
     1081 gattttacag tattatacat tgccctatat gacgatacca aagctggaag agctgtcgga
     1141 cattaatgag tatgcaatta cgggcacatc attgtttgaa aggctgatta gacgtgaaac
     1201 aagtattaac tgagctacaa gaaaaagatt tacaacatgt ctggcatcct tgctcacaaa
     1261 tgaaagatta tgaggctttt ccaccaatcg ttataaaaaa aggcgaaggt gtatggctgt
     1321 atgatgaaca gaatcaacgc tatcttgatg cggtatcttc atggtgggtc aatttatttg
     1381 gacatgccaa tccacgtatt agccaagcat taagtgaaca agcatttacg ttggagcata
     1441 caatttttgc gaatttttca catgagccag cgattaaact cgcacaaaaa ttagtagctt
     1501 taacaccaca aagtttacaa aaagtatttt ttgcagataa tggttcatct gctatagaag
     1561 tcgctttaaa aatgagtttt caatatcata tgcaaacggg gaaaacgcaa aaaaaacgct
     1621 ttttggcatt aacggatgcc taccatggtg aaacattagg tgctttatcc gtcggtggcg
     1681 tagatcttta taacgaagtg tatcaaccac tgttattgga tacggtacga gcacaaggcc
     1741 cagattgttt ccgttgccca ttcaagcatc atccggatag ttgccatgcc caatgtatta
     1801 gttttgtaga ggatcagttg cgcatgcatc ataaggaaat tacggcggtt attattgagc
     1861 cactcattca agcggcagca gggatgaaaa tgtatccagc tatttatttg cgacgtttac
     1921 gtgaactatg tacgcaatat gatgtgcatc taattgcaga cgaaattgct gtaggttttg
     1981 ggcgcacagg tacacttttt gcctgtgagc aggctaatat ctctccggat tttatgtgtt
     2041 tatcaaaagg tttaacaggt gggtatttac cactgtctgt cgtaatgacg acgaatgatg
     2101 tatatcaggc attttatgat gattatgcca cgatgaaggc gtttttacat tcacatagtt
     2161 acacagggaa tacacttgcc tgccgtgttg ctctagaggt attggcgata tttgaagaag
     2221 aacagtatat agacgttgtg caagacaaag gtgaacgcat gcgaaagcta gccttggagg
     2281 cttttagtga tttacctttt gttggtgaat atcggcaagt tgggtttgtc ggggcgattg
     2341 aacttgtggc gaatcgcgat accaaagagc cattaccgag tgaggagcgc atcggctatc
     2401 aaatatacaa aagagcttta gcaaaagggt tactgattcg tccacttggg aatgttttgt
     2461 atttcatgcc accatacatt ataacggacg atgaaatgca atttatgatt caaacaacaa
     2521 aagatacaat tgttcaattt tttgaagagc gggagggatg agggcatgtt gaaacaacag
     2581 tcaacgttat cacttgtgat gattgcgatg tttgctgcat taacagcagt tggtgccttc
     2641 attaaaattc cattaccgct cgtgccgttt acattacaaa ttgtctttgt ctttttagcg
     2701 ggttgcttac tcggtggtcg caatggattt caaagtcagc tagtttacat aggaataggt
     2761 ttagttggct tgccagtttt tacacaaggt ggaggcatta catatgtatt gcagccgact
     2821 tttggttact taataggatt tgctcttgct gcattagtaa tcggctatat gattgatcga
     2881 gtagaatcac caacgaaaaa gcatttcatt gttgccaata ttatagggct tatcattatt
     2941 tatgcagtcg cagtacctta tttatatgta gcattaaatg tatggttaaa catgaaatca
     3001 agttggtctc atgtattttt agtaggcttt gtcaatagta ttgttgcaga cttttgctta
     3061 gcaattgctt ctgccctttt agctgaacgt ctatacaaag tattccgttc cgctagagct
     3121 ataaaacttg tgcaaattga aaaggagaat gtttagtgaa ttggttacaa ttagcagatg
     3181 aagtgattgc aggcaaggta attagcgatg atgaggcact tgccatttta aatagtgatg
     3241 atgatgatat tttaaagcta atggacggcg catttgccat tcgtaagcac tattacggta
     3301 aaaaagtaaa gttaaatatg attatgaatg ctaaaagtgg ctattgccca gaggattgtg
     3361 gctattgctc gcagtcatct aaatcgaccg ctcctattga gaaatatccg ttcattacaa
     3421 aagaagaaat attagcgggg gcaaagcgtg cgtttgaaaa taaaattggt acgtattgca
     3481 tcgtcgcaag cggacgtggg ccgactcgta aagatgtcaa tgtagtgagt gaagccgttg
     3541 aagaaattaa agcaaaatat ggcttaaaag tttgcgcttg cttaggttta ctaaaagaag
     3601 aacaagcaca acaattaaaa gaagcgggtg ttgatcgcta caatcataac ttaaatacat
     3661 cagagcgtca ccattcctat attacgacga cgcacacata tgaggatcgt gttaataccg
     3721 ttgaggttgt aaagaaacat ggtatttccc catgttctgg agccattatt gggatgaaag
     3781 aaacgaaaat ggatgtcgtg gaaattgcac gcgcattgca tcagttggac gcggattcaa
     3841 ttccagttaa cttcttacat gcaattgatg gaacgaaact tgaaggaaca caggacttaa
     3901 atcctcgcta ttgcttaaaa gtattagcgt tattccgcta catgaatcct tcgaaggaaa
     3961 ttagaatttc cggtggtcgc gaagtcaatt taggattcct tcagccattt ggactgtatg
     4021 cagcaaatag tatttttgtt ggggattact taactactga aggacaagaa gccaatagcg
     4081 attatcgtat gcttgaagat ttgggctttg aaatcgagct gacacaaaag caagaagaag
     4141 cattttgttc ttaattcaac caatcattat gaaataaaat ctactactac acaatatgat
     4201 tacctcaaaa ccgtgtgagc gtcgtggaaa aggcgcacag acggtttttt ggtcgataaa
     4261 agagaaggag aaaggtaaat aaatggttcc gataatatac ctataaaatg atggttttca
     4321 caaaatgttc aatgaaagcg ttttgaaatt gaacagtttg tgaagggctt cacataaagc
     4381 tt
//
LOCUS       BSSBIOXWF    4530 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  B.sphaericus bioXWF operon genes, complete cds.
ACCESSION   M29291
KEYWORDS    7-keto-8-aminopelargonic acid synthetase; bioF; bioW; bioX.
SOURCE      B.sphaericus (strain IF03525) DNA.
  ORGANISM  Bacillus sphaericus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1081 to 1151)
  AUTHORS   Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
            Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
  TITLE     Cloning and characterization of the Bacillus sphaericus genes
            controlling the bioconversion of pimelate into dethiobiotin
  JOURNAL   Gene 87, 63-70 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 4530)
  AUTHORS   Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S.,
            Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Goleckler, 20-OCT-1989, for release after publication.
FEATURES       from  to/span     description
    pept       1149     1649     bioX protein
    pept       1630     2367     bioW protein
    pept       2364     3533     7-keto-8-aminopelargonic acid synthetase
                                 (bioF)
    mRNA    <  1081     3560     bioXWF mRNA
    binding    1132     1140     ribosome binding site
    binding    1612     1621     ribosome binding site
    binding    2350     2355     ribosome binding site
BASE COUNT     1426 a    768 c    958 g   1378 t
ORIGIN      
        1 aagctttact acaaccagga ggatgtagtt taggcgctgt tttaccatct aaaacaagtg
       61 atggtaaaac ttatgtcctt agaaattacg atttgtcacc agctatatct gatatgagac
      121 tttgtacaac taaggtaaaa gggaaatata gtcatactgg attctctgtt tcgtactttg
      181 gtcgaagtga aggcttaaat gaagagggat tctgtgtagc ttttgcctca tgcggtattc
      241 cagttggaaa acatccagga atgaaaaagc ctatcctaaa aggacttcag tttatggtaa
      301 tagtaagagc tttacttgaa aattgtaaag atgttgagga aggaattact tatttagaga
      361 atatgcctat tggaacaaat atgaatttgc tactgtcaga tgccaaaggg aatgtagctt
      421 tagtagaaac gtatgatgga gaaaagttcg tagaaagggg caatcaaaaa tcaggatttt
      481 tgattgcaac taatcacgct gtaatgccca aaataatgaa gttagaagac agaaaattag
      541 aacaatctga aatacgttat aattttttga aaaataactt agagagtgat gattttttta
      601 caaagaataa attacaacaa ttaatgttta atgaataccc taatggtgta acagtgcaca
      661 atttcgaaga aaatttctga actgttcatt caattctatt taatcttaac gacaaacaat
      721 tagatatttc tttcggttcc cctataaata ataaaatcta taaattaaaa gtaggagaaa
      781 ccttgccttt taatgaatta gaagttttca tagagaactc aaactatgga ccaaacttct
      841 ggaagctaat caaataataa tatggccaaa gaaacacttt cagctgcatg atgagccgag
      901 agtgtttttt tatcattcct ctattcaata ataaatcgct tatttcacct tcttggaggt
      961 ttgaaatcct cttgtttcat aggtttctgg cgttttcctt tttgacgaag cactgttaaa
     1021 gctctatcct tttttgaaaa aaatatagaa aaacaatttt aatctacctt cctatctata
     1081 aatgtgttaa cttaattatt attaaggtta actcaaattg aagaagttag aatgggagga
     1141 ataggaggat gcgaaagttt tctacatatg atcttgctca gatttcatta ctagcttgtc
     1201 ttattatcgt tacaggcatg tttaagattc caacaggtat tcctggatct gagtttcaat
     1261 tatcagcacc gattgccgtt gcgattgcag cagtatttgg atttaagcga tattttcttg
     1321 cgggaatcat tgcaagtcta atcttatttt tactaggtat acactccatc ttaaatgttg
     1381 aaatttcaat aattttccga ttgactgttg gtctaatcat tgttttatta ggaacttcaa
     1441 ttccggtact agttgtggca ggaccgattg gaacaatggt tgctagactt ggattggctt
     1501 ttacgttagg gaccccgttt ttgccactat tcgttttggc gattccaggg atggtcatta
     1561 cggctgtcag tgtttatcca ataacgaaaa tgttatatgc aattaataag aaagtagcag
     1621 gtgatcatca tgttagaaac gtgttatagc attcgaatgc gtgcagctga aaaaaatctc
     1681 gaaggaggag aaaagcatat atctggtggg gaacggatag ggagtgaatt tcaaatagag
     1741 ccaattgtaa aacagttatt gaacaaagca aggaatcatt cgcgcggaga tgctgacttt
     1801 attcaaatta ccgttgaaaa acttacaggt gatcagatac tgtatatgcc accgttagaa
     1861 ataacgacaa ttgatgagag ttcaattgaa agggcacata aagaagctag gagtatatta
     1921 acctcagtag gtgtttccaa gcaggcacaa aatgttgctt ttcatctact tgctagtaat
     1981 caaaatcttc gtggggctat cctccttcat agtcaaactg gcttacgact tgacaatcgc
     2041 ggactgaaag gcgttcgtgt atcacgaatc gattggcaag acgctgatgt aggttacaat
     2101 gagcgtgttc gtgaagcgct agctctggca acgaaagtgg caaattctcc gtataccatc
     2161 gcagaattat gttggtcaga tgatccagaa tacgttactg gctatgtaag caatcatgag
     2221 attggttatg tcagaattac gcctttaaaa agggaaggct gtgaaagtgg cggacgtatt
     2281 ttttttgtgt cagatgaagt tgagctagaa tcatatatac actatttaga aagagaacct
     2341 attctcatta gggggcattt aaaatgaatg atcgctttcg aagggaactg caagtaatag
     2401 aagagcaagg attgacaagg aagttacgtt tgttttcaac tggaaatgaa agtgaggtag
     2461 tgatgaatgg taagaaattt ttgctatttt catcgaataa ctacttaggc cttgcaacag
     2521 atagtcgttt gaaaaagaaa gcaactgaag gcattagtaa atacggtaca ggggctggcg
     2581 gttctcgact tacaactgga aacttcgaca ttcatgaaca gctagaatct gaaattgcag
     2641 attttaaaaa gactgaagcg gccattgtat tcagcagtgg gtatttagcg aacgtaggtg
     2701 tgatttcgag cgtgatgaag gcaggagata ctatcttttc tgatgcttgg aatcacgcga
     2761 gtattataga tggttgtcga ttaagtaaag ccaaaacgat tgtttatgaa catgcggata
     2821 tggtggattt agagcggaaa ttaaggcaat cacatgggga tggattgaag ttcatcgtaa
     2881 cggatggcgt ttttagtatg gatggtgata ttgcgccact tccaaaaata gtagagttag
     2941 ccaaggaata caaagcgtac ataatgattg atgatgcgca tgcaacaggt gttcttggca
     3001 atgatggttg tggtaccgct gattattttg gtttgaaaga tgagattgat tttacagtag
     3061 gcacgttgag taaagcgatt ggtgcagagg gtggatttgt atcgacatca tccattgcta
     3121 agaactattt gttaaataac gcccgatctt ttattttcca aacagcttta tcgccaagtg
     3181 cgattgaagc agcgcgagaa ggcatttcca tcatacagaa tgagcccgag cggagaaagc
     3241 aattgctgaa aaatgcgcag tacttacgat tgaaattaga ggaatctggt tttgtaatga
     3301 aagaagggga aacacctatt atttctctta tcattggtgg ttctcatgaa gccatgcagt
     3361 tttctgcgaa actactggat gaaggtgtct ttattccagc gattcgacca ccaacagtgc
     3421 cgaaagggtc aagtcggttg cgtataacgg taatggctac acatacaata gagcagctcg
     3481 atatggtcat tagtaaaatt aagaaaatag gaaaagaaat ggggattgta taattgtttg
     3541 agtgcctggc actcaaacaa ttttttgatt gctttttctt attaattaca tagcactaac
     3601 cttcaagttc tgcaattgcc tctaattcaa ttaagcagcc atggcttaat tctttaattg
     3661 ggataatggc gcgggcaggc ttatgactac cgaaaaattg tgcataagtt tcatttacgt
     3721 ccgcccataa atcagcagat gttatataga cgcgacacat aacaacagcc tccttcgtaa
     3781 gaccgcttgc ttgaagcacg acttccatct tttgaagcgc catgaatgtt tcagcgtgaa
     3841 tgccaccagt tggaggtttg cctgtagctg gatcgactga agtttgcccc gaaatatata
     3901 aattcccgtt actaatcatg cctggtgtat aatgcccgat tggctcttgt tctatcaaaa
     3961 ctttcttcat aataagttcc tccttataaa cgttgtaatt gatttagccc aatattgcca
     4021 cctgaaataa ggaagcaaac atggtcgttt ttcgtaatcg gtaaattgcc tcgaagagct
     4081 gctcctatcc cgatggcaga ggagggttca gctaatattt tgccttctag tagcaatgtt
     4141 ttcatcccat taagtatgga agcctcgtct acacgcacaa tgtcatcaac taatttttca
     4201 acaatcggga agtttcgctc cctaggttgt aaagttaata gagcatccgc taacgattgt
     4261 tgttcagcta atagtaatct tttaccaact tgaatacttt ctgaatagcg agggacgacg
     4321 gctggttcaa caccaaaaat tttaatatgt ggtgctaaag attttaccgc agttgcgaca
     4381 ccacttagca gaccaccacc accaatagga acaacaattg cacttacctg agagaactgt
     4441 tctaaaattt ccaatcctac tgtcccttgc cccgccataa tatcgtaatc atcataagga
     4501 tgcacaatgg cataaccatg ttctaagctt
//
LOCUS       HUMATPAS01   1458 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exons 2-5.
ACCESSION   M28284
KEYWORDS    adenosine triphosphatase.
SEGMENT     1 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 1458)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    35      148     adenosine triphosphate alpha subunit, exon 2
                                 (EC 3.6.1.3) (AA at 35)
                308      442     adenosine triphosphate alpha subunit, exon 3
                876      993     adenosine triphosphate alpha subunit, exon 4
               1109  +  1377     adenosine triphosphate alpha subunit, exon 5
    IVS     <     1       34     adenosine triphosphate alpha subunit intron A
    IVS         149      307     adenosine triphosphate alpha subunit intron B
    IVS         443      875     adenosine triphosphate alpha subunit intron C
    IVS         994     1108     adenosine triphosphate alpha subunit intron D
    IVS        1378  >  1458     adenosine triphosphate alpha subunit intron E
BASE COUNT      267 a    493 c    429 g    269 t
ORIGIN      
        1 gtcgacggat caacatcccc acatctcccc acagctgtac ctgggcatcg tgctggcggc
       61 cgtggtgatc atcactggct gcttctccta ctaccaggag gccaagagct ccaagatcat
      121 ggagtccttc aagaacatgg tgccccaggt gaagggtgcc cagcaagggc cagagggggt
      181 gttagtgtat gggctggggc cgggcccagc aagggccaga gggggtgtta gtgtatgggc
      241 tggggccggg ccagtgaccc ccaggcagag ggagtctggg aggtgacatt actccatccc
      301 acctcagcaa gccctggtga tccgggaagg tgagaagatg caggtgaacg ctgaggaggt
      361 ggtggtcggg gacctggtgg agatcaaggg tggagaccga gtgccagctg acctgcggat
      421 catctcagcc cacggctgca aggtgggcct gtagggcccg gctttaccct ccctcgggct
      481 gcccaggagc tcaggcccag ccctcctccc tcacactcag gagtccaggt cccaacccct
      541 cctccctcag acccaggagt ccaggccccc aggccctcct ccctcagacc caggagttta
      601 ggccctgggc acctcctcac ccaacagtca ggagtccagg cccagccctc acctttcgga
      661 gatccttagg gaccctagac cttggccagc agcactgtgc cttccccacc ttcaagatcc
      721 cagcctctgg cctcccacac accactcacc cactgggcac ccaggcttct agctgtgatc
      781 tccaggcaca caggcttcag cccaagccct gtacacaaat accctcctgt cccaggccct
      841 ggactgaacc ctctctctgc tgcacctacc cccaggtgga caactcctcc ctgactggcg
      901 aatccgagcc ccagactcgc tctcccgact gcacgcacga caaccccttg gagactcgga
      961 acatcacctt cttttccacc aactgtgtgg aaggtgaggc gggtgcagag aagacacaca
     1021 gctgggcaga ccagggatgt gtcccagggg tcaggcctca gaacctccct gagccacccc
     1081 acctcagcct aacccctctg ggctgcaggc acggctcggg gcgtggtggt ggccacgggc
     1141 gaccgcactg tcatgggccg tatcgccacc ctggcatcag ggctggaggt gggcaagacg
     1201 cccatcgcca tcgagattga gcacttcatc cagctcatca ccggcgtggc tgtcttcctg
     1261 ggtgtctcct tcttcatcct ctccctcatt ctcggataca cctggcttga ggctgtcatc
     1321 ttcctcatcg gcatcatcgt ggccaatgtc ccagagggtc tgctggccac tgtcactgta
     1381 aggccaggct cctgggtctg ggtggggagg gcctgggggt ctgggctcct gggtctgagg
     1441 gaggaggggc tgggatcc
//
LOCUS       HUMATPAS02    200 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 6.
ACCESSION   M28285
KEYWORDS    adenosine triphosphatase.
SEGMENT     2 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    33  +   200     adenosine triphosphate alpha subunit, exon 6
                                 (EC 3.6.1.3)
    IVS     <     1       32     adenosine triphosphate alpha subunit intron E
BASE COUNT       44 a     65 c     52 g     39 t
ORIGIN      
        1 ccctgtaatt gcctgccttg ctcgtcctcc aggtgtgtct gaccgtgacc gccaagcgca
       61 tggcccggaa gaactgcctg gtgaagaacc tggaggctgt agagaccctg ggctccacgt
      121 ccaccatctg ctcagataag acagggaccc tcactcagaa ccgcatgaca gtcgccacca
      181 tgtggtttga caaccagatc
//
LOCUS       HUMATPAS03    200 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 7 (5' end).
ACCESSION   M28286
KEYWORDS    adenosine triphosphatase.
SEGMENT     3 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    67  /   200     adenosine triphosphate alpha subunit, exon 7
                                 (EC 3.6.1.3)
    IVS     <     1       66     adenosine triphosphate alpha subunit intron F
                                 (no splice consensus)
BASE COUNT       43 a     68 c     55 g     34 t
ORIGIN      
        1 ggccaccctc tgatcggtcc ccagctctcc atccatgaga ccgaggaccc caacgacaac
       61 cgatacctgc tggtgatgaa gggtgccccc gagcgcatcc tggaccgctg ctccaccatc
      121 ctgctacagg gcaaggagca gcctctggac gaggaaatga aggaggcctt tcagaatgcc
      181 taccttgagc tcggtggcct
//
LOCUS       HUMATPAS04    559 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exons 8 and 9.
ACCESSION   M28287
KEYWORDS    adenosine triphosphatase.
SEGMENT     4 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 559)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /   155      330     adenosine triphosphate alpha subunit, exon 8
                                 (EC 3.6.1.3) (AA at 157)
                401  +   537     adenosine triphosphate alpha subunit, exon 9
    IVS     <     1      154     adenosine triphosphate alpha subunit intron G
    IVS         331      400     adenosine triphosphate alpha subunit intron H
    IVS         538  >   559     adenosine triphosphate alpha subunit intron I
BASE COUNT      125 a    165 c    157 g    112 t
ORIGIN      
        1 acttaatata tatatagtag tatgaaggaa gtatcctaga atgaatgctg actgccgtct
       61 gctgatggga gatgaatgcg gcgatcagac atctaggcat gggtgcaggt tccgaggctg
      121 gaaccctcac accccaaccc ttccctgcca ctaggtttct gccattatta cctgcccgag
      181 gagcagtatc cccaaggctt tgccttcgac tgtgatgacg tgaacttcac cacggacaac
      241 ctctgctttg tgggcctcat gtccatgatc ggcccacccc gggcagccgt ccctgacgcg
      301 gtgggcaagt gtcgcagcgc aggcatcaag gtgtggcttg ggtgctgggg aggcaagcag
      361 gcgtggcgga gaggccatcc ctaaaaaaca atgcctgcag gtcatcatgg tcaccggcga
      421 tcaccccatc acggccaagg ccattgccaa gggtgtgggc atcatctctg agggcaacga
      481 gactgtggag gacatcgccg cccggctcaa cattcccgtc agccaggtta accccaggtg
      541 agccacccat tcccagcca
//
LOCUS       HUMATPAS05    323 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 10.
ACCESSION   M35821
KEYWORDS    adenosine triphosphatase.
SEGMENT     5 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 323)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    62  +   212     adenosine triphosphate alpha subunit, exon 10
                                 (EC 3.6.1.3)
    IVS     <     1       61     adenosine triphosphate alpha subunit intron I
    IVS         213  >   323     adenosine triphosphate alpha subunit intron J
BASE COUNT       59 a    106 c     97 g     61 t
ORIGIN      
        1 gatcactttg ccactcctca cacaccctga cctcagccat cgctctctct gctcttccca
       61 gggatgccaa ggcctgcgtg atccacggca ccgacctcaa ggacttcacc tccgagcaaa
      121 tcgacgagat cctgcagaat cacaccgaga tcgtcttcgc ccgcacatcc ccccagcaga
      181 agctcatcat tgtggagggc tgtcagagac aggtgggctg cgctcccgca gaggagggga
      241 cggggccttg actcctgggt cctcactgag gccggggctg gtttcctggg tctgagggag
      301 gaggggctgg gggtctgacc tgg
//
LOCUS       HUMATPAS06    428 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 11.
ACCESSION   M35822
KEYWORDS    adenosine triphosphatase.
SEGMENT     6 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 428)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   147  +   315     adenosine triphosphate alpha subunit, exon 11
                                 (EC 3.6.1.3)
    IVS     <     1      146     adenosine triphosphate alpha subunit intron J
    IVS         316  >   427     adenosine triphosphate alpha subunit intron K
BASE COUNT       83 a    110 c    147 g     88 t
ORIGIN      
        1 tctagagagt ctgagcctaa ctcctagtct aggaagagag ttggacctaa cttctagtct
       61 aagagagggc tgggtcctgg ctttctggat ctgaggaggg actctgggga ctggcctctg
      121 gtgtcatcct taccctctct ccccagggtg caattgtggc tgtgaccggg gatggtgtga
      181 acgactcccc cgctctgaag aaggccgaca ttggggtggc catgggcatc gctggctctg
      241 acgtctccaa gcaggcagct gacatgatcc tgctggacga caactttgcc tccatcgtca
      301 caggggtgga ggagggtgag ttggccaggg gtggcctgga gaccaggtca ctaccggagc
      361 ctgagaccag caaggggaac tggccagggc tgcaggggga tgtgtggcag agaccaaggc
      421 ccctgcct
//
LOCUS       HUMATPAS07    939 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, intron J.
ACCESSION   M28288
KEYWORDS    adenosine triphosphatase.
SEGMENT     7 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 939)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    IVS     <     1  >   939     adenosine triphosphate alpha subunit intron K
BASE COUNT      287 a    197 c    250 g    205 t
ORIGIN      
        1 ggatcccgtc ctgagtgaca cgagccagac acaaaagcat tcttagcatc caactccaac
       61 tggagcaggc tcagcacccg tctctgggct aaaagtcagg ataatggtta tgtctaggag
      121 gagggatata gactggaaga gagcagacaa aacctaaatg gggggctcgg aatggttttc
      181 tatcttgact ggggtctggt ggttgcatgg ctgtgtacat gtgtaaaaat tgtatacacc
      241 ctctctggtt gaatatttta atgagatata atatgcccag ctcagtgtat gcatgccata
      301 tctcaataaa tattcaacca gtacaggttg tctaaagtaa acaaaataaa gattaatcca
      361 ggccaggcgg tggggctaca cctgtaatcc caagactttg gagaccgaag caggaggatt
      421 gcctgagatc aagagttcaa gaccagcctg gccaacatga tgaaacccca tctctactaa
      481 aaatacaaaa attagctggg cttggtggtg ggcacctgta accccagcta cttgggaggc
      541 tgaggcagga gaatcatttg aacctcagag gcagacgttg cagtgagccg caattgtgcc
      601 attgcatcca gcctgggcaa cagagcaaga ctccatctca aaaaaataaa ataaaataaa
      661 ataaacatta gtcaggcacg gtggctcgag cctgtggtcg cagctacttg ggaggctgag
      721 gtgggaggat cacttgagga ggtcgaggtt gcagtgactg atgattgcag cactgggcta
      781 ccagcctggg caacgaggtg gagaccttct ctctaaataa ataaataagt aaaaagatta
      841 gcccatgaac aatgaggtca atcattaaga aagaacaggc gggcacggtg gcaaacacct
      901 gtaatcccag cactttggga gcgctgaggc gggtggatc
//
LOCUS       HUMATPAS08   1619 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exons 12 and 13.
ACCESSION   M28289
KEYWORDS    adenosine triphosphatase.
SEGMENT     8 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 1619)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +  1069     1223     adenosine triphosphate alpha subunit, exon 12
                                 (EC 3.6.1.3)
               1301  +  1424     adenosine triphosphate alpha subunit, exon 13
    IVS     <     1     1068     adenosine triphosphate alpha subunit intron K
    IVS        1224     1300     adenosine triphosphate alpha subunit intron L
    IVS        1425  >  1619     adenosine triphosphate alpha subunit intron M
BASE COUNT      372 a    419 c    489 g    339 t
ORIGIN      
        1 ggatccaagg tccagctgtg gagttgtgtc tgtagctcgc cccttcttgc tgtgtagtat
       61 ttgactgcat ggatgtaccg tggtttattc atccctcacc tgttgatgga catggggttg
      121 actctggctt ttggctctga tgagtggtgc tatggtgaac attctcgtac ttgcttttcg
      181 ctgggcataa acagtcctcc tctcttttgg gtataaaacc taggagtcgg attactcggt
      241 cggtggggtg ggggatgttt gatagcaact ccgaaaccat cttccaaagt ggttatttca
      301 agaatgtttc cacaaaggcg atctggctgg atgtttatgt ggaaaggtag gtgggtgagg
      361 ggccaggtgt ggagacttat gcctgtaatc ccagcacttc gggaggctga gatggaatca
      421 cagcccagaa attagacaag cctaggaaaa gtggtgagac tctgtctcta ctaaaaatac
      481 aaaaattagc tggacgtggt ggcgcgtgac tgtagtctca gctacatggg aggctgaggc
      541 aggaggatca cttgagcctg ggagtcgagg ttgcagtgag ccgtgatcat gccactgcac
      601 tgcagcctgg gcaacagagt gagaccctgt cccaaaaaga aaataatggt gggtgggctg
      661 gggggtagaa aaggacgcag gccgggcgcg gtcgtcacgc ctgtaatccc aggcacttca
      721 ggaggccgag gagggcagat cacgaggtca ggagattgag accatgcagg ctaacacagt
      781 gaaaccccgt ctctaataaa aatacaaaaa aattagccgg gcgtggtggc aggtgcctgt
      841 agtgccagct actcgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt
      901 gcagtgagcc gagatcgcac cactgcactc cagcctgggc aacacagcga gactctgtct
      961 caaaaaaaaa aaaaaaaaaa aaaaggacgt tggatgaggg cagaggaggg gcagagggag
     1021 tggggctccc tggcatgggc gcctgctctg agcctgcctg tgccacaggc cgcctgatct
     1081 tcgacaacct aaagaagtcc attgcctaca ccctgaccag caatatcccg gagatcacgc
     1141 ccttcctgct gttcatcatg gccaacatcc cgctgcccct gggcaccatc accatcctct
     1201 gcatcgatct gggcactgac atggtgagcc ctggcagcca cccttggggc caggagggtg
     1261 gagtcctccc ctctccggct caccggcctc ctccgcctag gtccctgcca tctcactggc
     1321 gtacgaggct gccgaaagcg acatcatgaa gagacagccc aggaacccgc ggacggacaa
     1381 attggtcaat gagagactca tcagcatggc ctacgggcag attggtgagg caccggggac
     1441 tccatctcct taccaccatg ccggcctaga gcatgcctgg ccaccgtgcg tgcttgggac
     1501 cctggcattg actcagggga gcagacgtgg gcaggaccaa ccagtgagct atctgagggg
     1561 ggggtctgca ccccatcctt ctccacctcc tcctctctgc tgttgatgtg tgcggatcc
//
LOCUS       HUMATPAS09    276 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 14.
ACCESSION   M28290
KEYWORDS    adenosine triphosphatase.
SEGMENT     9 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 276)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    86  +   231     adenosine triphosphate alpha subunit, exon 14
                                 (EC 3.6.1.3)
    IVS     <     1       85     adenosine triphosphate alpha subunit intron M
    IVS         232  >   276     adenosine triphosphate alpha subunit intron N
BASE COUNT       46 a     83 c     81 g     66 t
ORIGIN      
        1 ccgggccctc tgaggtgccc tgggttaggc tgactggccc caatctgagc ctctgcctga
       61 ggacttacct ctctcctcct tccaggaatg atccaggctc tcggtggctt cttctcttac
      121 tttgtgatcc tggcagaaaa tggcttcttg cccggcaacc tggtgggcat ccggctgaac
      181 tgggatgacc gcaccgtcaa tgacctggaa gacagttacg ggcagcagtg ggtgagtagg
      241 gcagggtgct gtgtctctgc ccaccgtaag atcccc
//
LOCUS       HUMATPAS10    567 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 15.
ACCESSION   M28291
KEYWORDS    adenosine triphosphatase.
SEGMENT     10 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 567)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   366  +   496     adenosine triphosphate alpha subunit, exon 15
                                 (EC 3.6.1.3)
    IVS     <     1      365     adenosine triphosphate alpha subunit intron N
    IVS         497  >   567     adenosine triphosphate alpha subunit intron O
BASE COUNT       95 a    179 c    161 g    132 t
ORIGIN      
        1 ctgatgccct tgccagccca tctgagcctt tgtcccactg ctgagctcgt ctgccctgcc
       61 agccttcttc cccacctctg tctgtccctt caaagcccag ttgctgtctg tctccttggg
      121 aggccacagg gtcttatcct cctccctggc ccctggtgga tgtgagatca ccattaactt
      181 tcttcttcct tgtacgtctc ctgactcctc cctctgggac tatgagcccg cagaaggaag
      241 acacacctga ggccctgagg accccatgca ggatgggcgg ggcaaagagc acggaacgtc
      301 ggggtggcgg ctagggctgc agtgccacta actgacgtcc cgtgccctgg tcaccgctcc
      361 tgcagacata cgagcagagg aaggtggtgg agttcacctg ccacacggcc ttctttgtga
      421 gcatcgttgt cgcccagtgg gccgatctga tcatctgcaa gacccggagg aactcggtct
      481 tccagcaggg catgaagtga gggccggggg acatggtgac tggacagcca tctgtctgtc
      541 cgagtcgtct gtctgtcgtg tgtactc
//
LOCUS       HUMATPAS11    212 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 16.
ACCESSION   M28292
KEYWORDS    adenosine triphosphatase.
SEGMENT     11 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    28  +   129     adenosine triphosphate alpha subunit, exon 16
                                 (EC 3.6.1.3)
    IVS     <     1       27     adenosine triphosphate alpha subunit intron O
    IVS         130  >   212     adenosine triphosphate alpha subunit intron P
BASE COUNT       33 a     81 c     53 g     45 t
ORIGIN      
        1 cccagccctg cccttctgtg cctccaggaa caagatcatg atcttcgggc tgtttgagga
       61 gacggccctg gctgccttcc tgtcctactg ccccggaatg gacgtggccc tgcgcatgta
      121 ccctctcaag tgagtgcccc gctgccccca gccctgccca caccagcgcc tgccatggag
      181 cctcttcctt agactaggtc tgaactaggc cc
//
LOCUS       HUMATPAS12   1047 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -activated adenosine triphosphatase alpha subunit
            gene, exon 17 and 18.
ACCESSION   M28293
KEYWORDS    adenosine triphosphatase.
SEGMENT     12 of 12
SOURCE      Human placenta DNA, clones lambda-NK-alpha[3-2,10-3].
REFERENCE   1  (bases 1 to 1047)
  AUTHORS   Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L.,
            Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E.,
            Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D.
  TITLE     The family of human Na+, K+ -ATPase genes: A partial nucleotide
            sequence related to the alpha-subunit
  JOURNAL   FEBS Lett. 213, 73-80 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   101      192     adenosine triphosphate alpha subunit, exon 17
                                 (EC 3.6.1.3)
                474      502     adenosine triphosphate alpha subunit, exon 18
    IVS     <     1      100     adenosine triphosphate alpha subunit intron P
BASE COUNT      145 a    391 c    236 g    275 t
ORIGIN      
        1 ggatcctggg agactgcccc tctgcgtccc ctccagtccc ctgaaactct gcctctcgtt
       61 agggccccgc actcaagccc tcctgctctc ccctctgcag gcccagctgg tggttctgtg
      121 ccttccccta cagtttcctc atcttcgtct acgacgaaat ccgcaaactc atcctgcgca
      181 ggaacccagg gggtgaggga gctcggcaag gcagccgagg ggggcggggg gcagcagggt
      241 ctcaggcagc tggtcccagg ctcccctcgc cctgctggat gctctgccac ctggttccca
      301 ctcttctctc tcttcccatc tctccggcac ccactctgtc ttctcacggg tctctgtctg
      361 tgtggtttcc ttgtctctct ccctctctgt ccctctctct gctgggcggc tcgccttgcc
      421 tgtctctctc catctcttac tctgtctctt tctttccttc tttgtctctc caggttgggt
      481 ggagaaggaa acctactact gacctcagcc ccaccacatc gcccatctct tccccgtccg
      541 caggccagga ccgcccctgt cagtcccccc aattttgtat tctgggggga ggagccctct
      601 cttcctgtgg ccccaccttg gcccccaccc cctccactat ctcctgccgc ccccactctg
      661 gctggcttct ctcccctgcc ccaaacctct ctcctctctc ttttctgtgt cagtttctct
      721 ccctctcctc acccctctat ccattcctcc cgccccagcc acctccctgg gctctttttt
      781 actccccttc agccccccgg tgatgccatc tctggttctg gacaattatc aaatatatca
      841 gtggggagag agaagcggtg tgtgtgttgt gcctgctttc cagacgggga ctgcggctgg
      901 gacagcatcc cctctgttgg ggcgcgctgc caggcgactc tcgaaactgc aggaacttag
      961 cctggcactg gcttggaagt cacggaatct cagagccatc taatcacgga gttttcaact
     1021 ccacgaaagt cagagcagct tgaattc
//
LOCUS       HUMATPASE1    309 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 2, clone
            lambda-NK-alpha-RD-16.
ACCESSION   M27571
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     1 of 3
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 309)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    21  +   289     Na+, K+ -ATPase alpha subunit, exon 2 (AA at
                                 23)
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron A
    IVS         290  >   309     Na+, K+ -ATPase alpha subunit intron B
BASE COUNT       54 a     88 c     96 g     71 t
ORIGIN      
        1 gatgccccac catgttgcag gcactgccag gggcattgtg attgccacag gagaccggac
       61 ggtgatgggc cgcatagcta ctctcgcctc aggcctggag gttgggcgga cacccatagc
      121 aatggagatt gaacacttca tccagctgat cacaggggtc gctgtattcc tgggggtctc
      181 cttcttcgtg ctctccctca tcctgggcta cagctggctg gaggcagtca tcttcctcat
      241 cggcatcata gtggccaacg tgcctgaggg gcttctggcc actgtcactg tgagtgggtc
      301 aggctgagg
//
LOCUS       HUMATPASE2    239 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 3, clone
            lambda-NK-alpha-RD-16.
ACCESSION   M27576
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     2 of 3
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 239)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  +   219     Na+, K+ -ATPase alpha subunit, exon 3
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron B
    IVS         220  >   239     Na+, K+ -ATPase alpha subunit intron C
BASE COUNT       53 a     76 c     66 g     44 t
ORIGIN      
        1 ttctcctctt tctctaccag gtgtgcctga ccctgacagc caagcgcatg gcacggaaga
       61 actgcctggt gaagaacctg gaggcggtgg agacgctggg ctccacgtcc accatctgct
      121 cggacaagac gggcaccctc acccagaacc gcatgaccgt cgcccacatg tggtttgaca
      181 accaaatcca tgaggctgac accaccgaag atcagtctgg tgattgggtg ctccagcgg
//
LOCUS       HUMATPASE3    150 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 4, clone
            lambda-NK-alpha-RD-16.
ACCESSION   M27578
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     3 of 3
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  /   130     Na+, K+ -ATPase alpha subunit, exon 4
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron C
    IVS         131  >   150     Na+, K+ -ATPase alpha subunit intron D
BASE COUNT       31 a     49 c     38 g     32 t
ORIGIN      
        1 tccccctcat ttcctcccag gggccacttt tgacaaacga tcccctacgt ggacggccct
       61 gtctcgaatt gctggtctct gcaaccgcgc cgtcttcaag gcaggacagg agaacatctc
      121 cgtgtctaag gtagggggtc aggacacaca
//
LOCUS       HUMATPASA1    308 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 2, clone
            lambda-NK-alpha-TW-4.
ACCESSION   M27572
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     1 of 2
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-TW-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 308)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    21  /   288     Na+, K+ -ATPase alpha subunit, exon 2 (AA at
                                 22)
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron A
    IVS         289  >   308     Na+, K+ -ATPase alpha subunit intron B
BASE COUNT       56 a     79 c     80 g     93 t
ORIGIN      
        1 tccctcccct tctttttaag gaccgcacgt ggtattgttg tctacactgg ggatcgcact
       61 gtgatgggaa gaattgccac acttgcttct gggctggaag gaggccagac ccccattgct
      121 gcagaaattg aacattttat ccacatcatc acgggtgtgg ctgtgttcct gggtgtgtct
      181 ttcttcatcc tttctctcat ccttgagtac acctggcttg aggctgtcat cttcctcatc
      241 ggtatcatcg tagccaatgt gccggaaggt ttgctggcca ctgtcacggt aagaggcagg
      301 tgatggtc
//
LOCUS       HUMATPASA2     95 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 4, 3' end, clone
            lambda-NK-alpha-TW-4.
ACCESSION   M27579
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     2 of 2
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-TW-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 95)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  /    75     Na+, K+ -ATPase alpha subunit, exon 4 (AA at
                                 1)
    IVS          76  >    95     Na+, K+ -ATPase alpha subunit intron D
BASE COUNT       27 a     20 c     21 g     27 t
ORIGIN      
        1 gctctgtcca gaattgcagg tctttgtaac agggcagtgt ttcaggctaa ccaggaaaac
       61 ctacctattc ttaaggtatg ctcaagagtt aacta
//
LOCUS       HUMATPASEA    188 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 3 (5' end).
            clone lambda-NK-alpha-SW3.2.
ACCESSION   M27574
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-SW3.2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 188)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    21  >   188     Na+, K+ -ATPase alpha subunit, exon 3 (AA at
                                 21)
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron B
BASE COUNT       45 a     57 c     50 g     36 t
ORIGIN      
        1 gtccttccct ctcctgctag gtgaccctgt cgctgacagc aaaacggatg gccaagaaga
       61 actgcctggt gaagaacctg gaggctgtgg agacccctgg ctccacctcc atcatctgct
      121 cggacaagac tgggacactg acccagaaca ggatgacagt ggcccatctg tggttcgaca
      181 atcagatc
//
LOCUS       HUMATPASEB    141 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 3 (partial),
            clone lambda-NK-alpha-R15-1.
ACCESSION   M27575
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-R15-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   141     Na+, K+ -ATPase alpha subunit, exon 3 (AA at
                                 1)
BASE COUNT       32 a     41 c     41 g     27 t
ORIGIN      
        1 ctgtccctga cagccaagcg cctggccagt aagaactgcg tggtcaagaa cctggaggcg
       61 gtggagacat tgggctccac ttcggtgatc tgctcggaca agacagggac tctcactcag
      121 aaccgcatga ctgtgtccca t
//
LOCUS       HUMATPAB1     309 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 2,
            clone lambda-NK-alpha-R3-2.
ACCESSION   M27570
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     1 of 3
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 309)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    21  +   289     Na+, K+ -ATPase alpha subunit, exon 2 (AA at
                                 23)
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron A
    IVS         290  >   309     Na+, K+ -ATPase alpha subunit intron B
BASE COUNT       45 a    103 c     89 g     72 t
ORIGIN      
        1 ctaacccctc tggcctgcag gcacggctcg gggcgtggtg gtggccacgg gcgaccgcac
       61 tgtcatgggc cgtatcgcca ccctggcatc agggctggag gtgggcaaga cgcccatcgc
      121 catcgagatt gagcacttca tccagctcat caccggcgtg gctgtcttcc tgggtgtctc
      181 cttcttcatc ctctccctca ttctcggata cacctggctt gaggctgtca tcttcctcat
      241 cggcatcatc gtggccaatg tcccagaggg tctgctggcc actgtcactg taaggccagg
      301 ctcctgggt
//
LOCUS       HUMATPAB2     239 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 3,
            clone lambda-NK-alpha-R3-2.
ACCESSION   M27573
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     2 of 3
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 239)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  +   219     Na+, K+ -ATPase alpha subunit, exon 3 (AA at
                                 22)
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron B
    IVS         220  >   239     Na+, K+ -ATPase alpha subunit intron C
BASE COUNT       54 a     77 c     68 g     40 t
ORIGIN      
        1 ctgccttgct cgtcctccag gtgtgtctga ccgtgaccgc caagcgcatg gcccggaaga
       61 actgcctggt gaagaacctg gaggctgtag agaccctggg ctccacgtcc accatctgct
      121 cagataagac agggaccctc actcagaacc gcatgacagt cgcccacatg tggtttgaca
      181 accagatcca cgaggctgac accactgagg accagtcagg tgagcgcagg ccccgggta
//
LOCUS       HUMATPAB3     150 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human Na+, K+ -ATPase alpha-subunit gene, exon 4,
            clone lambda-NK-alpha-R3-2.
ACCESSION   M27577
KEYWORDS    Na+, K+ -ATPase alpha-subunit.
SEGMENT     3 of 3
SOURCE      Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A.,
            Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V.,
            Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I.,
            Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A.
  TITLE     The family of human Na+ K+ -ATPase genes: No less than five genes
            and/or pseudogenes related to the alpha-subunit
  JOURNAL   FEBS Lett. 217, 275-278 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    21  /   130     Na+, K+ -ATPase alpha subunit, exon 4 (AA at
                                 22)
    IVS     <     1       20     Na+, K+ -ATPase alpha subunit intron C
    IVS         131  >   150     Na+, K+ -ATPase alpha subunit intron D
BASE COUNT       27 a     50 c     38 g     35 t
ORIGIN      
        1 ctcacacatg cctcccccag ggacctcatt tgacaagagt tcgcacacct gggtggccct
       61 gtctcacatc gctgggctct gcaatcgccc tgtcttcaag ggtggtcagg acaacatccc
      121 tgtgctcaag gtgggttagc tactggcctc
//
LOCUS       NGORRDA      1486 bp ss-rRNA            RNA       30-JUN-1990
DEFINITION  N.gonorrhoeae 16S ribosomal RNA.
ACCESSION   M34130
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      N.gonorrhoeae (strain 2013) ribosomal RNA.
  ORGANISM  Neisseria gonorrhoeae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1486)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
               Author address: C.R.Woese
               Dept. Microbiology
               131 Burrill Hall
               407 S. Goodwin Ave.
               University of Illinois
               Urbana, IL 61801
FEATURES       from  to/span     description
    rRNA          1     1486     16S ribosomal RNA
BASE COUNT      365 a    334 c    460 g    294 t     33 others
ORIGIN      
        1 nnaacataag agtttgatcc tggctcagat tgaacgctgg cggcatgctt tacacatgca
       61 agtcggacgg cagcacaggg aagcttgctt ctcgggtggc gagtggcgaa cgggtgagta
      121 acatatcgga acgtaccggg tagcggggga taactgatcg aaagatcagc taataccgca
      181 tacgtcttga gagggaaagc aggggacctn cgggccntgc gctatcngag cggccgatat
      241 ctgattagct ngtnggcggg gtaaaggccc accaaggcga cgatcagtag cgggtctgag
      301 aggatgatcc gcnacactgg gactgagaca cggccnagac tcctacggga gncagcagtg
      361 ggnnattttg gacaatgggc gcnagcctga tccagccatg cngcgtgtct gaagaaggcc
      421 ttcgggttgt aaaggacttt tgtcagggaa gaaaaggccg ttgccaatat cggcggccga
      481 tgacggtacc tgaagaataa gcaccggcta actacgtgcn agcagccncg nnnatacgta
      541 gggtgcgagc gttaatcgga attactgggc gtaaagcggg cgcagacggt tacttaagca
      601 ggatgtgaaa tcncngggct caacccggna actgcgttct gaactgggtg actcgagtgt
      661 gtcagaggga ggtggaattc cacgtgtagc agtgaaatgc gtagagatgt ggaggaatac
      721 cnatggcgaa ggcagcctcc tgggataaca ctgacgttca tgtccgaaag cgtgggtagc
      781 aaacaggatt agataccctg gtagtccacg ccctaaacga tgtcaattag ctgttgggca
      841 actnnattgc ttggtagcgt agctaacgcg tgaaattgac cgcctgggga gtacggtcgc
      901 aagattaaaa ctcaaaggaa ttgacggggn accgcacaag cggtggatga tgtggattaa
      961 ttcgatnnaa cgcgaagaac cttacctggt tttgacatgt gcggaatcct ccggagacgg
     1021 agnagtgcct tcgggagccg taacacaggt gctgcatggc tgtcgtcagc tcgtgtcgtg
     1081 agatgttggg ttaagtcccg caacgagcgc aacccttgtc attagttgcc atcattcggt
     1141 tgggcactct aatgagactg ccggtgacaa gccggaggaa ggtggggatg acgtcaagtc
     1201 ctcatggccc ttatgaccag ggcttcacac gtcatacaat ggtcggtaca gagggtagcc
     1261 aagcgcgagc ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc
     1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc
     1381 ccngnncttg tacacaccgc ccgtcacacc atgggagtgg ggnataccag aagtaggtag
     1441 ggtaaccgca aggagtccgc ttaccacggt atgcttcatg actggg
//
LOCUS       SPLRRDA      1492 bp ss-rRNA            RNA       30-JUN-1990
DEFINITION  S.volutans 16S ribosomal RNA.
ACCESSION   M34131
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      S.volutans (strain ATCC 19554) ribosomal RNA.
  ORGANISM  Spirillum volutans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 1492)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
               Author address: C.R.Woese
               Dept. Microbiology
               131 Burrill Hall
               407 S. Goodwin Ave.
               University of Illinois
               Urbana, IL 61801
FEATURES       from  to/span     description
    rRNA          1     1492     16S ribosomal RNA
BASE COUNT      389 a    327 c    446 g    299 t     31 others
ORIGIN      
        1 tcaaatgaag agtttgatcc tggctcagat tgaacgctgn cngcatgcct tacacatgca
       61 agtcgaacgg taacagggac ttcggtccgc tgacgagtgg cgaacgggtg agtaaagcat
      121 aggaatacat ccattagtgg gggataacgc atcgaaaggt gtgctaatac cgcatacgct
      181 ctaaggagga aagcanggga tccnagacct tgcgctaatg ggctagccta tgtccgatta
      241 gctagttggt agagtaaaag cctaccaagg cgacgatcgg taactggtct gagaggatga
      301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaatt
      361 ttggacaatg ggggaaaccc tgatccagcc atgccgcgtg ggtgaagaag gccttcgggt
      421 tgtaaagccc tttcgggagg aaagaaaagc attgctctaa catagcaatg tgatgacggt
      481 acctccataa gaagcaccgg ctaactacgt gccagcngcc gcggtaatac gtagggtgcn
      541 agcgttaatc ggaattactg ggcgtaaagc gtgcgcaggc ggctagataa gccagatgtg
      601 aaatccccga gctcaacttg ggtatcgcgc ttggaactgt ctagcnagag tttggcngag
      661 gggggtagaa ttccacgtgt agcagtgaaa tgcgtagaga tgtggaggaa taccgatggc
      721 gaaggnngcc ccctgggcta aaactgacgc tcatgcncga aagcgtgggg agcaaacagg
      781 attagatacc ctggtagtcc acgccctaaa cgatgtcaac tggttgttgg gagggtaaac
      841 ctcttagtaa cgaagctaac gcgagaagtt gactgcctgg ggagtacggt cgcaagatta
      901 aaactcaaag gaattgacgg gganccgcac aagcggtgga ttatgtggat taattcgatg
      961 caacgcgaaa aaccttacct acctttgaca tgtcgagaat cctgcagaga tgcgggngtg
     1021 ccttcgggaa ctcgaacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
     1081 gggttaagtc ccgcaacgag cgcaaccctt atcattagtn gctacgcaag ggcactctaa
     1141 tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct catggccctt
     1201 atgggtaggg cttcacacgt aatacaatgg tcggtacaga gggttgccaa cccgtgaggg
     1261 ggagctaatc tcagaaagcc gatcgtagtc cggattgcag tctgcaactc gactgcatga
     1321 agtcggaatc gctagtaatc gcggatcagc atgcngcggt gaatacgttc ccgggtcttg
     1381 tacacaccgc ccgtcacacc atgggagtgg gtttcaccag aagtagttag cctaaccgcn
     1441 aggagggcga ttaccacggt gngattcatg nnnnnnnnnn nnnngtaaca ag
//
LOCUS       LOAREPDNA     839 bp ds-DNA             INV       30-JUN-1990
DEFINITION  Loa loa repetitive DNA sequence.
ACCESSION   M34259
KEYWORDS    repetitive DNA.
SOURCE      Loa loa DNA.
  ORGANISM  Loa loa
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Orthorrhapha; Tabanoidea; Tabanidae.
REFERENCE   1  (bases 1 to 839)
  AUTHORS   Klion,A.D., Raghavan,N., Brindley,P.J. and Nutman,T.B.
  TITLE     Cloning and characterization of a species-specific repetitive DNA
            sequence from Loa loa
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.D.Klion, 11-MAY-1990.
            
               Author address: A.D.Klion
               NIAID/NIH, Lab Parasitic Diseases
               Bldg 4, Rm 126
               9000 Rockville Pike
               Bethesda, MD 20892
BASE COUNT      207 a    130 c    199 g    303 t
ORIGIN      
        1 gatcaaatac ctgatacgct cgtataaccg tgtcatccgg atatttatta tattaattat
       61 taaagtgtta atcaaacatg agcaaaactt tttcgacgac gcaaattaca ttcctcgttt
      121 tagtgcttta acccgaatgt aatataacat aacaaccatt tgcttaggct taggtttttc
      181 gggtttatgc ttaggcttag gtttttcggg tttatgctta ggcttaggct taggttcttc
      241 gggtttatgc ttaggcttag gcttaggttt ctcgggttta tgcttaggtt catgcttagg
      301 cttaggctta ggcttgggct taggcttagg cttaggctta ggtttttcgg gtttatgctt
      361 aggcttaggc ttaggcttgg gcttaggctt aggcttaggc ttaggttttt cgggtttatg
      421 cttaggctta ggcttaggct taggcttggg cttaggctta ggcttaggct taggcttagg
      481 cttaggtttt tcgggtttat gcttaggttt atgcttaggg tttatgctta gttttttttt
      541 gaacactgtt cgataaccat ataagtatca taaatgtaaa catgtaaaat ttcagcattt
      601 ataacaagaa gcaccaaaaa acaccgatgg atgaagcaaa agcggacgat gagcatgcca
      661 cagtgaggag ctgttaagcg acttcgtgct gctacaaatt tgcattatgc aatggattac
      721 agtaatccat tgttggatac agtgttgttt tctgatttgt agtaatccat cacagtaatt
      781 cagcaaagtg aagtgtactc tgttgaaagc gtaagtattg cattggtgat attatgatc
//
LOCUS       CFXRRDA      1486 bp ss-rRNA            BCT       30-JUN-1990
DEFINITION  C.aurantiacus 16S ribosomal RNA.
ACCESSION   M34116
KEYWORDS    16S ribosomal RNA.
SOURCE      C.aurantiacus (strain J10FL) ribosomal RNA.
  ORGANISM  Chloroflexus aurantiacus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Multicellular filamentous green bacteria; Chloroflexaceae.
REFERENCE   1  (bases 1 to 1486)
  AUTHORS   Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and
            Woese,C.R.
  TITLE     The green non-sulfur bacteria: A deep branching in the eubacterial
            line of descent
  JOURNAL   Syst. Appl. Microbiol. 9, 47-53 (1987)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 15-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1486     16S ribosomal RNA
BASE COUNT      301 a    394 c    495 g    224 t     72 others
ORIGIN      
        1 acaatgnaga gtttgatcct ggctcaggac gaacgctggc ggcgtgccta atgcatgcaa
       61 gtcgaacgca gcaggcgtgc ctggctgcgt ggcgaacggc tgaggaacac gtgggtgacc
      121 tgccccggag tgggggatac cccgtcgaaa gacgggacaa tcccgcatac gctcgacgga
      181 ggaaagccga aggcgctctg nnaggggccn gcggcccatc aggtagttgg tgtggtaacg
      241 gcgcaccaag ccaatgacgg gtacccggtc tgagaggacg accgggcaga ctgggactga
      301 gacacggccc agactcctac gggaggcagc agcaaggaat tttccccaat gggcgcaagc
      361 ctgagggagc aacgccgcgt ggaggacgac ggccttcggg ttgtaaactc ctttcgggtg
      421 ggacgatgct gacggtacca ccagaagcag ccccggctaa ctctgtgcca gcagccgcgg
      481 taagacagag ggggcnagcg ttgtccggag ttactgggcg taaagggcgc gcaggcggtg
      541 ggctgcgtcg gcgctgaaag cgccccgctt aacggggcga ggcgcgccga tacgagtcca
      601 ctcgaggcaa gcagagggtg gcggaattcc gggtggagcg gtgaaatgcg tagagatccg
      661 gaggaacgcc agtggggaag ccggccacct gggcttgacc tgacgctgag gcgcgacagc
      721 gtggggagca aaccggatta gatacccggg tagtccacgc cgtaaacgat gccggctcgg
      781 cgtctggcgc acgttggtgt gctgggtgcc ttagctcacg cggtaagccg gccgcctggg
      841 gannnnnnnn nnnngcttaa aactcaaagg aattgnnnnn nnnnngcaca agnnnnnnnn
      901 nnnnnngttt aattcgacgc aacacgaaga accttacccg gacttgacat ggcgctgcat
      961 cccctggaaa cagggncgcc tgncgaggcg caccacaggt gctgcatggc tgtcgtcagc
     1021 tcgtgtcgtg agatgttggg ttcagtcccg caacgagcgc aacccgtgtc ggtagttaca
     1081 ggtgtctacc gagactgccg ntgaccggcg gaggaaggcg cggatgacgt caagtcagca
     1141 tggcccttac gtccggggcg acacacacgc tacaatggcc acgacaatgc gttgccaagc
     1201 cgcaaggtgg agctaatcgc ctaaacgtgg tctcagtgca gantcggggc tgcaactcgc
     1261 ccccgtgaag gcggagttgc tagtaaccgc gtatcagcca tggcgcggtg aatacgttcc
     1321 cgggccttgt acacaccgcc cgtcacgtca tgggagtggc caatgcttga agtccgtgtg
     1381 ctaaccccag tcggggaggc agcggccgag ggcaggggcc gcgacnnnnn nnnnnnngta
     1441 acaagnnnnn ngtaccggaa gnnnnnnnnn gatcacctcc tttcta
//
LOCUS       DVURRDA      1551 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  D.desulfuricans 16S ribosomal RNA gene.
ACCESSION   M34113
KEYWORDS    16S ribosomal RNA.
SOURCE      D.desulfuricans (ATCC 27774) DNA.
  ORGANISM  Desulfovibrio desulfuricans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1551)
  AUTHORS   Oyaizu,H. and Woese,C.R.
  TITLE     Phylogenetic relationships among the sulfate respiring bacteria,
            myxobacteria and purple bacteria
  JOURNAL   Syst. Appl. Microbiol. 6, 257-263 (1985)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1551     16S ribosomal RNA
BASE COUNT      387 a    364 c    488 g    311 t      1 others
ORIGIN      
        1 tgaactggag agtntgattc tggctcagat tgaacgctgg cggcgtgctt aacacatgca
       61 agtcgaacgc gaaagggact tcggtcctga gtaaagtggc gcacgggtga gtaacgcgtg
      121 gataatctgc ccttatgatc gggataacag ttggaaacgg ctgctaatac cggatacgct
      181 caaaatgaac tttttgagga aagatggcct ctgcttgcat gctatcacgt aaggatgagt
      241 ccgcgtccca ttagcttgtt ggcggggtaa cggcccacca aggcatcgat gggtagccga
      301 tttgagagga tgatcggcca cactggaact gaaacacggt ccagactcct acgggaggca
      361 gcagtgggga atattgcgca atgggcgaaa gcctgacgca gcgacgccgc gtgagggatg
      421 aaggttttcg gatcgtaaac ctctgtcaga agggaagaaa ctacgttgtg ctaatcagca
      481 gcgtactgac ggtaccttca aaggaagcac cggctaactc cgtgccagca gccgcggtaa
      541 tacggagggt gcaagcgtta atcggaatta ctgggcgtaa agcgcacgta ggctgtagtg
      601 taagtcaggg gtgaaatccc acggctcaac cgtggaactg cctttgatac tgcacaactt
      661 gaatccggga gagggtggcg gaattccagg tgtaggagtg aaatccgtag atatctggag
      721 gaacatcagt ggcgaaggcg gccacctgga ccggtattga cgctgaggtg cgaaagcgtg
      781 gggagcaaac aggattagat accctggtag tccacgctgt aaacgatgga tgctagatgt
      841 cggggagtat tcttcggtgt cgtagttaac gcgttaagca tcccgcctgg ggagtacggt
      901 cgcaaggctg aaactcaaag aaattgacgg gggcccgcac aagcggtgga gtatgtggtt
      961 taattcgatg caacgcgaag aaccttacct aggtttgaca tccacggaac cctcccgaaa
     1021 aggaggggtg cccttcgggg agccgtgaga caggtgctgc atggctgtcg tcagctcgtg
     1081 tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ctatggatag ttgccagcaa
     1141 gtaatgttgg gcactctatt cagactgccc gggttaaccg ggaggaaggt ggggacgacg
     1201 tcaagtcatc atggccctta cgcctagggc tacacacgta ctacaatggc gcgcacaaag
     1261 gggagcgaga ccgcgaggtg gagccaatcc caaaaaacgc gtcccagtcc ggattgcagt
     1321 ctgcaactcg actgcatgaa gttggaatcg ctagtaattc gagatcagca tgctcgggtg
     1381 aatgcgttcc cgggccttgt acacaccgcc cgtcacacca cgaaagtcgg ttttacccga
     1441 agccggtgag ccaaccagca atggaggcag ccgtctacgg tagggccgat gattggggtg
     1501 aagtcgtaac aaggtagccg taggggaacc tgcggctgga tcacctcctt t
//
LOCUS       HPTRRDA      1484 bp ss-rRNA            BCT       30-JUN-1990
DEFINITION  H.aurantiacus 16S ribosomal RNA.
ACCESSION   M34117
KEYWORDS    16S ribosomal RNA.
SOURCE      H.aurantiacus (ATCC 23779) ribosomal RNA.
  ORGANISM  Herpetosiphon aurantiacus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; 
            Cytophagaceae.
REFERENCE   1  (bases 1 to 1484)
  AUTHORS   Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and
            Woese,C.R.
  TITLE     The green non-sulfur bacteria: A deep branching in the eubacterial
            line of descent
  JOURNAL   Syst. Appl. Microbiol. 9, 47-53 (1987)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 15-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1484     16S ribosomal RNA
BASE COUNT      331 a    359 c    465 g    265 t     64 others
ORIGIN      
        1 nttgtcggcg agtttgatcc tggctcagga cgaacgctgg cggcgtgcct aatgcatgca
       61 agtcgaacgc attcttcgga aggagtggcg cacggctgag gaacacgtga ctaacctacc
      121 ccggtgtggg ggataacggg tcgaaagact cgctaatccc gccatacgat ngcctcggcg
      181 gaggaaagcc gtaaggcgca ctggnngggg gtcgcgtccc attagatagt tggtgtggta
      241 atggcgcacc aagtcgatga tgggtctctg gtctgagagg ncgaccagac agattgggac
      301 tgagacacgg cccaaactcc tacggggngc agcagcaagg aattttcggc aatgggcgca
      361 agcctgaccg agcaacgccg cgtggaggat gacggctctt gggttgtaaa ctccttttgg
      421 gggggacgat aatgacggta ccctccgaat caggcccggc taactacgtg ccagcagccg
      481 cggtaatacg taggggccaa gcgttgtccg gaattactgg gcgtaaagcg tgnntaggcn
      541 gtcgatgatg tgccgcgtga aagcgccgga gtaatgccgg ccaggtcgcg gcagacacgt
      601 tgactagagg ctcgcagagg aacgtggaat tcccggtgta gtggtgaaat gcgtagatat
      661 cgggaggaac accagtggcg caagcggcgt tctgggcgag acctgacact gagccacgac
      721 ggcgtgggga gcaaacagga ttagataccc tggtagtcca cgcagtaaac gatgcatacc
      781 aggtgtggga tggcgttcgc gtcgttccgt gccgcagctt acgcgatgag tatgccgcct
      841 ggggactacg agcgcaagct taaaactcaa aggaattgnn nnnnnnnngc acaagnnnnn
      901 nnnnnnnnng tttaattcga cgcaacgcga agaaccttac ctagtcttga catagcactg
      961 caagcttcgg aaatgaagtt cgccttcgag ggtgtgctac aggtgctgca tggctgtcgt
     1021 cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga gcgcaacccc tgtgaggtgt
     1081 tacaagtgtc acctcagact gccgttgtca acaacggagg aaggcgggga tgacgtcaag
     1141 tccgcatggc ccttacgact agggcgacac acacgctaca atggctggga gaatgcgccg
     1201 cgacctggca acaggctgcg aatcgagaac accagtcaca gttcagattg ggggctgcaa
     1261 ctcgccccca tgaaggcgga gttgctagta atcgccggtc agcatacggc ggtgaatcag
     1321 tacccgggcc ttgtacacac cgcccgtcac gtcatggaag tgggaaacac ctgaagtccg
     1381 tggctaaccg caaggaggca gcggccgagg gtgggtctcg taactgnnnn nnnnnngtaa
     1441 caagnnnnnn nnnnnngaag nnnnnnnnng atcacctcct ttct
//
LOCUS       MXARRDB      1540 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  M.xanthus 16S ribosomal RNA gene.
ACCESSION   M34114
KEYWORDS    16S ribosomal RNA.
SOURCE      M.xanthus (strain MD207) DNA.
  ORGANISM  Myxococcus xanthus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; 
            Myxococcaceae.
REFERENCE   1  (bases 1 to 1540)
  AUTHORS   Oyaizu,H. and Woese,C.R.
  TITLE     Phylogenetic relationships among the sulfate respiring bacteria,
            myxobacteria and purple bacteria
  JOURNAL   Syst. Appl. Microbiol. 6, 257-263 (1985)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1540     16S ribosomal RNA
BASE COUNT      384 a    372 c    488 g    295 t      1 others
ORIGIN      
        1 caattggaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa
       61 gtcgagcgcg aataggggca acccttagta gagcggcgca cgggtgcgta acacgtggat
      121 aatctgcctg agtgctcggg ataaccagtc gaaagattgg ctaataccgg ataagcccac
      181 ggtttcttcg gagactgagg gaaaaggtgg cctctgtata caagctatca cattcagatg
      241 agtccgcggc ccatcagcta gttggcgggg taatggccca ccaaggcaac gacgggtagc
      301 tggtctgaga ggacgatcag ccacactgga actgagacac ggtccagact cctacgggag
      361 gcagcagtgg ggaattttgc gcaatgggcg aaagcctgac gcagcaacgc cgcgtgtgtg
      421 atgaaggtct ttggattgta aagcactttc gaccgggaag aaaacccgtt ggctaacatc
      481 caacggcttg acggtaccgg gagaagaagc accggctaac tctgtgccag cagccgcggt
      541 aatacagagg gtgcaagcgt tgttcggaat tattgggcgt aaagcgcgtg taggcggcgt
      601 gacaagtcgg gtgtgaaagc cctcagctca actgaggaag tgcgcccgaa actgttgtgc
      661 ttgagtgccg gagagggtgg cggaattccc caagtagagg tgaaattcgt agatatgggg
      721 aggaacaccg gtggcgaagg cggccacctg gacggtaact gacgctgaga cgcgaaagcg
      781 tggggagcaa acaggattag ataccctggt agtccacgcc gtaaacgatg agaactaggt
      841 gtcgtgggag ttgacccccg cggtgccgaa gctaacgcat taagttctcc gcctgggaag
      901 tacggtcgca agactaaaac tcaaaggaat tgacgggggn ccgcacaagc ggtggagcat
      961 gtggtttaat tcgacgcaac gcgcagaacc ttacctggtc ttgacatcct cagaatcctt
     1021 cagagatgag ggagtgcccg caagggaact gagagacagg tgctgcatgg ctgtcgtcag
     1081 ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caaccctcgc ctttagttgc
     1141 cacgcaagtg gatctctaga gggactgccg gtgttaaacc ggaggaaggt ggggatgacg
     1201 tcaagtcctc atggccttta tgaccagggc tacacacgtg ctacaatggc cggtacagag
     1261 cgttgccaac ccgcgagggg gagctaatcg cataaaaccg gtctcagttc agattggagt
     1321 ctgcaactcg actccatgaa ggaggaatcg ctagtaatcg cagatcagca cgctgcggtg
     1381 aatacgttcc cgggccttgt acacaccgcc cgtcacacca tgggagtcga ttgctccaga
     1441 aatcatctca ccaagaggtg ctcaaggagt ggtcggtaac tggggtgaag tcgtaacaag
     1501 gtagccgtag gggaacctgc ggctggatca cctcctttct
//
LOCUS       RBLRRDA      1491 bp ss-rRNA            BCT       30-JUN-1990
DEFINITION  R.acidophila 16S ribosomal RNA.
ACCESSION   M34128
KEYWORDS    16S ribosomal RNA.
SOURCE      R.acidophila (strain 7050) ribosomal RNA.
  ORGANISM  Rhodopseudomonas acidophila
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 1491)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              Microbiology Department, 131 Burrill Hall
              University of Illinois
              407 S. Goodwill Ave.
              Urbana, IL 61801
              E-mail: ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1491     16S ribosomal RNA
BASE COUNT      364 a    341 c    449 g    300 t     37 others
ORIGIN      
        1 tacaacttga gagtttgatc ctggctcaga acgaacgctg gcggcaggct taacacatgc
       61 aagtcgaacg ggcatagcaa tatgtcagtg gcagacgggt gaggaacacg tgggaacatg
      121 ccctttggtt cggaacaact cagggaaact tgagctaata ccggataagc ctttcgaggg
      181 aaagatttat cgccgaagga gtggcccgcg tccgattagc tagttggtga ggtaatggct
      241 caccaaggcg acgatcggna gctngtctga gaggatgatc agccacattg ggactgagac
      301 acggcccaaa ctcctacggg aggcagcagt ggggaatatt ggacaatggg cgcaagccng
      361 atccagccat gccgcgtgag tgatgacggc cctagggttg taaagctctt tcacccacga
      421 cgataatgac ggtagtgggc gaagaagccc cggctaactt cgtgccagca gccgcggtaa
      481 tacgaagggg gcnagcgttg ttcggattta ctgggcgtaa agcgcacgta ggcngatctt
      541 taagtcaggg gtgaaatgcc ggagctcaac ttcggaactg cctttgatac tggagatctt
      601 gagttcgaga gaggtgagtg gaactgcgag tgtagaggtg aaattcgtag atattcgcaa
      661 gaacaccagt ggcgaaggcg gctcactggc tcgatactga cgctgaggtg cnnaagcgtg
      721 gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgga tgctagccgt
      781 taggcagctt gctgcttagt ggcgcagcta acgctttaag catcccgcct ggggagtacg
      841 gtcgcaagat taaaactcaa aggaattgac gggggcncgc acaagcggtg gagcatgtgg
      901 tttaattcga agcaacgcgc agaaccttac cagcttttga catgtcttgg acggatagca
      961 gagatgttat cttctcttcg gagccgagaa cacaggtgct gcatggctgt cgtcngctcg
     1021 tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcaac ccacgtcctt agttgccatc
     1081 attaagttgg gcactctagg gagactgccg gtgataagcc gagaggaagg tgtggatgac
     1141 gtcaagtcct catggccctt acaggctggg ctacacacgt gctacaatgg cggtgacaat
     1201 gggaagcgaa agggcgacct ctagcaaatc tcaaaaaacc gtctcagttc ggattgtact
     1261 ctgcaactcg agtgcatgaa ggtggaatcg ctagtaatcg cagatcagaa cgctgcggtg
     1321 aatacgttcc cgggccttgt acacaccgcc cgtcacacca tgggagttgg ctttacccga
     1381 aggcgtttcg ctaaccgcaa ggaggcaggc gaccacggta gggtcagcgn ctnggnnnnn
     1441 nnngtaacaa gnnnnnnnnn nnngaacctg nnnnnngatc acctcctttc t
//
LOCUS       RCARRDA      1470 bp ss-rRNA            BCT       30-JUN-1990
DEFINITION  R.capsulatus 16S ribosomal RNA.
ACCESSION   M34129
KEYWORDS    16S ribosomal RNA.
SOURCE      R.capsulatus (strain B10, ATCC 33303) ribosomal RNA.
  ORGANISM  Rhodobacter capsulatus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 1470)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              Microbiology Department, 131 Burrill Hall
              University of Illinois
              407 S. Goodwill Ave.
              Urbana, IL 61801
              E-mail: ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1470     16S ribosomal RNA
BASE COUNT      343 a    330 c    423 g    275 t     99 others
ORIGIN      
        1 ttcaacttga gagtttgatc ctggctcaga acgaacgctg gcggcaggcc taacacatgc
       61 aagtcgagcg agaccttcgg gtctagcggc ggacgggtga gtaacgcgtg ggaacgtgcc
      121 ctttgctacg gaatagcccc gggaaactgg gagtaatacc gtatgtgccc ttcggggnaa
      181 agatttatcg gcaaaggatc ggcccgcgtt ggattaggta gttggtgggg taatggccta
      241 ccaagccgac gatccatagc tggtttgaga ggatgatcag ccacactggg actgagacac
      301 ggcccagact cctacgggag gcagcagtgg ggaatcttag acaatggggg aaaccctgat
      361 ctagccatgc cgcgtgagcg atgaaggcct tagggttgta aagctctttc aggtgggaag
      421 ataatgacgg taccaccaga agaagccccg gctaactccg nnnnnnnngc cgcggtaata
      481 cgnnnnnnnn nnnnnnnnnn nngaattact gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      541 nnnnngnggt gaaatcccag ggctcaacct tggaactgcc tttgaaactc ctggtcttga
      601 ggncgagaga ggtgagtgga attccgagtg tagaggtgaa attcgtagat attcggagga
      661 acaccagtgg cgaaggcggc tcactggctc gatactgacg ctgaggtgcg aaagcgtggg
      721 gagcaaacag gattagatac cctggtagtc cacgccgtaa acgatgaatg ccagtcgtcg
      781 gcaggcatgc ctgtcggtga cacactaacg gattaagcat tccgcctggg gagaacggtc
      841 gcaagattaa aactcaaagg aattgacggg ggcccgcaca agcggtggag catgtggttt
      901 aattcgaagc aacgcgcaga accttaccaa cccttgacat cgagatcgcg gttaccagag
      961 atggtttcct tcagttcggc tggatcttag acaggtgctg catggctgtc gtcagctcgt
     1021 gtcgtgagat gttgggttaa gtccggcaac gagcgcaacc cacactttca gttgccatca
     1081 ttcagttggg cactctggaa gaactgccga tgataagtcg gaggaaggtg tggaggacgt
     1141 caagtcctca tggcccttac gggttgggct acacacgtgc tacaatggtg gtgacaatgg
     1201 gccaatccca aaaagccatc tcagttcgga ttggggtctg caactcgacc ccatgaagtn
     1261 ggaatcgcta gtaatcgcgt aacagcaaga cgcggtgaat acgttcccgg gccttgtaca
     1321 caccgcccgt cacaccatgg gaattgggtc taccctaaga aggtgcgcca accagcaang
     1381 gaggcagcna gccacggtag gcacaggnng gnnnnnnnnn nngtaacaag nnnnnnnnnn
     1441 nngaacctgn nnnnngatca cctcctttct
//
LOCUS       RMVRRDA      1484 bp ss-rRNA            BCT       30-JUN-1990
DEFINITION  R.vannielii 16S ribosomal RNA.
ACCESSION   M34127
KEYWORDS    16S ribosomal RNA.
SOURCE      R.vannielii (strain EY33) ribosomal RNA.
  ORGANISM  Rhodomicrobium vannielii
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 1484)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              Microbiology Department, 131 Burrill Hall
              University of Illinois
              407 S. Goodwill Ave.
              Urbana, IL 61801
              E-mail: ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1484     16S ribosomal RNA
BASE COUNT      343 a    343 c    433 g    292 t     73 others
ORIGIN      
        1 caaaatctga gagtttgatc ctggctcaga acgaacgctg gcggcaggct taacacatgc
       61 aagtcgaacg caccgcaagg tgagtggcag acgggtgagt aacgcgtggg aaccttccct
      121 ttggtacgga ataacttcgg gaaaccgaag ctaataccgt atatctcctc cgggagaaag
      181 atttatcgcc aaaggatggg cccgcgttgg attagctagt tggtgtggta acggcgcacc
      241 aaggcgacga tccatagctg gtctgagagg atgatcagcc acactgggac tgagacacgg
      301 cccagactcc tacgggaggc ngcngtgggg aatcttggac aatgggggca accctgatcc
      361 agccatgccg cgtgagtgaa gaaggcctta gggttgtaaa gctcttttgg cggggacgat
      421 aatgacggta cccgcagaat aagccccggc taacttcgtg ccagcagccg cggtaatacg
      481 aagggggcna gcgttgttcg gaatcactgg gcgtaaagcg tacgcaggcg gattgataag
      541 tcaggggtga aatcccgggg ctcaacctcg gaattgcctt tgatactgtc tgtctcgagt
      601 tcgggagagg ttggcggaat tcctagtgta gaggtgaaat tcgtagatat taggaagaac
      661 accagtggcg aaggnngcca actggcccga tactgacgct catgtacgaa agcgtgggga
      721 gcaaacagga ttagataccc tggtagtcca cgctgtaaac tatggatgct agccgttggg
      781 gagcttgctc ttcagtggcg cagctaacgt cttaagcatc ccgcctgggg agtacggtcg
      841 caagattaaa actcaaagga attgacgggg gcccgcacaa gcggtggagc atgtggttta
      901 attcgaggca acgcgaagaa ccttaccagc tcttgacatg tcgtgctacg tggagagatt
      961 cacggttccc ttcggggacg cgaacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt
     1021 gagatgttgg gttaagtccc gcaacgagcg caaccctcgc ccttagttgc taccatttag
     1081 ttgagcactc taaggnnacc gccggtgata agccggagga aggtggggat gacgtcaagt
     1141 catcatggcc cttacgggct gggctacaca cgtgctacaa tggcggtgac agtgggcagc
     1201 gacacagcga tgtgatgcta atcccaaaaa gccgtctcag ttcagattgc actctgcaac
     1261 tcgagtgcat gaagtcggaa tcgctagtaa tcgcggatca gcatgccgcg gtgaatacgt
     1321 tcccgggcct tgtacacacc gcccgtcaca ccatgggagt tggttttacc cgaaggcgtn
     1381 angctaaccg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngtaa
     1441 caagnnnnnn nnnnnngaac ctgnnnnnng atcacctcct ttct
//
LOCUS       THRRRDA      1528 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  T.roseum 16S ribosomal RNA gene.
ACCESSION   M34115
KEYWORDS    16S ribosomal RNA.
SOURCE      T.roseum (strain ATCC 27502) DNA.
  ORGANISM  Thermomicrobium roseum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1528)
  AUTHORS   Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and
            Woese,C.R.
  TITLE     The green non-sulfur bacteria: A deep branching in the eubacterial
            line of descent
  JOURNAL   Syst. Appl. Microbiol. 9, 47-53 (1987)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 15-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1528     16S ribosomal RNA
BASE COUNT      295 a    416 c    585 g    226 t      6 others
ORIGIN      
        1 ggggatggag agtttgatcc tggctcaggg ggaacgctgg cggcgtgcct aatgcatgca
       61 agtcggacgg gangcacgcn ctctggcgtg ccgaccgtgg cggacgggtg cgtaacacgt
      121 ggggaaccct cccgggtgcg ggggataacc cggggaaact cgggctaata ccccatacgc
      181 ttggctggcg gtgggccggc caaggaaagg cgcaggcgac tgtgctgtgc tcggagggcc
      241 ctgcggccta tcagctagac ggtagggtaa cggcctaccg tggcgatgac gggtagctgg
      301 tctgagagga tggccagcca cacgggcact gagacacggg cccgactcct acgggaggca
      361 gcagcaggga atcttccgca atgggggcaa ccctgacgga gcgacgccgc gtgcgggagg
      421 aagcccttcg gggtgtaaac cgctgttcgg ggggacgatc gagcgacggt accctcggag
      481 caagtcccgg ctaactacgt gccagcagcc gcggtaagac gtagggggcg agcgttaccc
      541 ggagtcactg ggcgtaaagg gcgtgtaggc ggctgggtac gccgcgtgtg aaagtccccg
      601 gctcaaccgg ggagggtcgc gcgggacggc ctggctcgag ggcgggagag gcgggtggaa
      661 ttcccggtgt agcggtgaaa tgcgtagaga tcgggaggaa cgccggtggc gaaggcggcc
      721 cgctggcccg tacctgacgc tgaggcgcga aggcgtgggg agcgaaccgg attagatacc
      781 cgggtagtcc acgcagtaaa cgatgcgggc gaggtgtggg tggtgttgac cccatccgtg
      841 ccggcgccaa cgcagtaagc ccgccgcctg gggagtacgg ccgcaaggct aaaactcaaa
      901 ggaattgacg ggggcccgca caagcagcgg agcgtgtggt ttaattcgac gcaacgcgaa
      961 gaaccttacc agggcttgac atgccaccga acctggctga aaggctgggg tgcccggtga
     1021 gggagcggtg gcacaggtgc tgcatggctg tcgtcagctc gtgccgtgag gtgttgggtt
     1081 aagtcccgca acgagcgcaa ccctcggggt cagttacgcg ggtgtctgac ccgactgccg
     1141 gggaaagccc ggaggaagga ggggatgacg tcaagtcagc atggccctga cgccctgggc
     1201 gacacacacg ctacagtgac cgggacagtg ggcagcgaag gggcgacctg gagccaatcc
     1261 cgcaaacccg gtcgtggtgg ggatcgcagg ctgcaacccg cctgcgtgaa cgcggagttg
     1321 ctagtaaccg ccggtcagcc atacggcggt gaatacgttc ccgggccttg tacacaccgc
     1381 ccgtcacgtc acgaaagctg gcttcacctg aagctggtgg gccaaccggc gcacgctggg
     1441 ggcagccgtc gagggtgggg ctggtgattg ggacgaagtc gtaacaaggt agccgtaccg
     1501 gaaggnnnng gtggatcacc tcctttct
//
LOCUS       RATTATFL    11973 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat tyrosine aminotransferase (TAT) gene, 5' flank.
ACCESSION   M34257
KEYWORDS    tyrosine aminotransferase.
SOURCE      Rat (strain Wistar) adult male liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 8314 to 8413)
  AUTHORS   Boshart,M., Weih,F., Schmidt,A., Fournier,R.E.K. and Schuetz,G.
  TITLE     A cyclic AMP response element mediates repression of tyrosine
            aminotransferase gene transcription by the tissue-specific
            extinguisher locus Tse-1
  JOURNAL   Cell 61, 905-916 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 11973)
  AUTHORS   Labeit,S., Strahle,U. and Weih,F.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by M.Boshart, 11-MAY-1990.
BASE COUNT     3162 a   2618 c   2675 g   3518 t
ORIGIN      
        1 gttcatgcat gtacaagtac atgtgaatgt gtgtgaacaa cttcagatgt cattacttgg
       61 gccataatta ctgttgttga tgttgatagg gcctatcttc gcctgggggt ggccagtgag
      121 gctggctaac tgctcagtca atccagacat ttgctcatct ctccgttccc aatgctggca
      181 ttactacccg tgcctcccac accaccatgc ctggtatttt ttttttttcc tttttgtctt
      241 tttaaatctg ggctgtaagg ttgaactcag gtccttgtgc atgcaagtca agtgttttac
      301 agactgagct atctctccta gtcctgtgct ggaaattccc tcagacactt ctacatcaga
      361 ttcaaatcct ttctctaggt gagagatgcc attcatttgc caacaaggcg gaatgggtga
      421 gcttgtccaa aataccacgt ttaatcaggt tgtctgtttt ggttggctta tttgacactt
      481 acaagtcaac attgcaacat tattatatat ttgtcttctg attttaaaaa tgatgggtga
      541 tgcgcttgtg ttttaacctt ttctaggaga taaacgtcta cctgttgaat taacagactt
      601 actttttaaa catcctgtcg tacatgacac ttttttttaa acatttattc atttattata
      661 tataagtaca ctgtagctgc tcttcagaca caccagaaga gggcatcgga tctctttaca
      721 gatggttgtg agccaccacg tggttgctgg gaattgaact caggacctct ggaagagtag
      781 tcgggtgctc ttaaccgctg agccatttct ccagcccgac actttttaat actatataaa
      841 acaagttaaa taaatacgat cgttgccctt agaaatgcac aaacacgaaa tgggacagac
      901 ttttcacttt ctttcatgcg tagcacttcc acgaaaccac aagttaggaa caatgatcaa
      961 atatttacag gacccctaag catcacagtt aataattttc atgtggggac gcacctatat
     1021 gtgattggca gttgaatcaa caaagcagca aaaatcaaca actttaaaat aagatgtgta
     1081 ggtcattttg gtcaactaaa atttgagaaa tcgctgattt gtgtatgcac gtgtgttcct
     1141 aaaaccgtta ctttatgtag ttctttttca aagaaaggag atgggcaccc ccaggaaagc
     1201 tggttatacc cattgcttgg agtttctgat gactgctttg acatcagggc agcattctgt
     1261 catcatccgt gtgtactctg ggatgacagg gacctggacc tgtgtagaag agtcccctaa
     1321 ctcccttctc actccccagg gcttaaggct tgcctgggtc acctgatgct cgtttgcagt
     1381 gggcgaggta gggcaggtga ggagagtggg tccattgccg tgagacacgc aatcagtggc
     1441 aatagtgaag tcactgagcc ggacgtttct caatatttgc tctggcagag ggaggggggg
     1501 gggagggagg gagggaggga ggtagagctg tttttttgtt ctaagggact gaaaactctt
     1561 ctggaaatct acggggaatg catggagagc cagcactggg tttggggctc agggggtctc
     1621 gtggacagca tgtccagcct ttcctagtgc taatctactt ggcaaatgga gattcctcct
     1681 tcgggtgtca gctcaggggt agatgaagca ccctctgcct tttccgggct catcctgaac
     1741 ttcctataga acactgatct ataagcctct cgcctgtcct cctggagcac ctgccattca
     1801 caggattcat cagggtggtg ttcagggaag acacccattg gggtcgtcaa cacaatagag
     1861 ccaactgaag aagggaggat ctgcctccca ttttatattt attttattat tttatttgag
     1921 acagagtatg tctatgtagc cttggctagg ctggaacttt ccttttttga gcaggttggc
     1981 ctcaaactca ccaagattat ctggctatac atctcctgag tgctgggatt aaaggagtgt
     2041 accactgtac ctggcttctc tctctctctg tctctgtctc tgtctgtctc tctgtctctc
     2101 tgtctctctg tctctctctc tctctctctc tctttctctc tctctctctt ccttccttcc
     2161 tcttcctttc cttctttcct tcttttcttc tctctctctt tctttcttcc tttctttctt
     2221 tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctt
     2281 ctttctcttc tagtttttgt gtgtgtgtct gtgtgttttg gatacagtct tgctatgtga
     2341 ccctacgctc ttctcaaact tgtatttatt ttgccacagt tttcccagag ctgagagttg
     2401 ctttccttaa aaaaaacaaa aaatcacagg ggttggggat ttagctcagt ggtagcgcta
     2461 tggcctagca agcgcgaagg ccctgggttc ggtcctcagc tccgaaaaaa aagggaaaaa
     2521 aaattacata ttacttttag agaataacag gtttctgttg tgaaatttcc gtaagtattt
     2581 acaacaaatg cagtactgac tattaaacca agagccatgt gtatgctagg caaatgctct
     2641 acccctgacc tgcgtaatgc actttgatga cactcttaac ccgttgcatc ctggtcacac
     2701 tcttctcctt cctcttccca aatagataat cttctgcctt tatgcctttt tcttttctct
     2761 acatctagat tccgcatgtg agattcttcc ttagctggtc ttctcttact gaacaccgtg
     2821 ctctgtccag ttgtctgcaa atggcatcat tccgtccttt agagctgaac agaactccac
     2881 cgtgtatgcg tactgcatct tttttatcca catctctgtg gacggaccct tagctgattg
     2941 cgaagtctgc ctattacgaa tacagtgacg gcagatgcgc aggcatctac ttccgctcct
     3001 ccaaggtgcg caggcgtctc ggcagtacac cgacttccgc tccacccggg acatacctgc
     3061 gagtggtata gctaggtcac agggtgattc tatttttcga atctatacta atttccatgc
     3121 ggctatatta atttacattc ccacaacaat atatggaggt tccattccgc ctgtacatta
     3181 tatcccccct ctctttctct gcctcaagta tgctaggcaa gcacttttac cagtcagcca
     3241 acattcccag cccacttatg gttttcttgg tgatgccatt atgattggga tgaggtgaac
     3301 atctccatgt agttaagatt ggccttttcc tgattctatt tccctcctgt tcatcagaca
     3361 gaccactgaa tagcctaatc atatggggcc ctgaggatct ccagttataa aagttaatga
     3421 ttctggttgc ttaattgact taggtaacag ttaataaaac acagggcctc atgatttact
     3481 gcctatgagg cagaatagct taactaccct gctgttagag gaggccacct tggcccgagg
     3541 tttggttctg ttctttagta gctgtgtggc ctttggcaag ttattcaact ctcctatgtt
     3601 tgggtttctg cagttttcaa atgcgtttat aatagtacct accacatggt ggaatcagta
     3661 aagtaacaca gacttagaat tgccaagtgc caagaacaca agagatgctt gatcaatatt
     3721 tacgggtgac taaaattatt atttctacat ttttagtcat tcttaaaagg caatgctcct
     3781 tccccccttt ttcttctgga cagaatagaa aaaacatcgt tttcaattca taataaattt
     3841 ccattctcat aaaatatcca gtaccattta aggagttaat acaagctcct tgaccctcat
     3901 cacaggtatt aataaagctt tatgaattac attatcagtc acttgttcat tcagccatcc
     3961 tactgagcac tggctgtatg atagacacca tctcatgaaa caacgataaa taaatattct
     4021 gctctcatgg aattaatatt ccagcagggg gaaataggca ataattaaaa tatgtaaaat
     4081 gtatactata ttttactgat tctaaaacat ccattttccc agatgttaaa acccagaatc
     4141 taggtgtggt gcctcccatt tgtaatccta gtattcagga ggctgagcag aaggctgcaa
     4201 gttcaaggct tgcctcagtt aaaaaaataa attctcaggg ctggggattg tattacttgg
     4261 ttgttaggat gcttacctaa gcacacatta gaccctaaat ttgaccctca atactgcata
     4321 aaactgggtc tggtgacatc caggtggtgg aggcaggagg atcagaagtt taaagttatc
     4381 tggctacata gtgagttcca agacaggtca gccacaacct tgtctaaaaa aaaaaaaaaa
     4441 accacacaat taatattcat ttcacactga ctgaccagca tttttctttt gaacaaaaaa
     4501 caaaacaaaa caacatcaga ccaagacaac aaatgcgtca acagcctctt agttttgatg
     4561 ctggacacag ttgcggttag atatagtctc ctcctcctgc ccgcttgctg gaaggtaaaa
     4621 ccaatatcct ttgggtagca aaatcccggc accttctctc ggactttgtt ttgctcatag
     4681 ttctgtggca ggaaacttga tgtcctagac ttgggtaatt cactgtgatt tggccctcgc
     4741 tgatggtcct ctcccctctg taggtcccag cagcgagtct ggataaagat tgctgacttg
     4801 cataatctct ggcatttgtt accaggcggg gtagggaacg agtgatcctg agacttcact
     4861 gagggaggtc agcagcaaag ctctggaaca ttccatgttc cttggccaag tgcatgctcc
     4921 acaacttaca gatcatggag actgaactgg gactttctgc agcaggaagt caggctcagc
     4981 ccctacctgt ccgaatggca gaccttgtat tacccattaa gccacgggac tgggatgctc
     5041 aggattgctg tggtctaagc aggtagtgag agatgagagg ctggctctga ctggtttggg
     5101 ttattatttg tgagtgctca aaaggaaaca gatggtgcgg tcaacctagg cagtttacga
     5161 aaagggactt tcacttgtga ctgtgttcta gcaggatttg tctgtggcct cttgtgcttc
     5221 tgagcggaga ctgggaacga ttacatgggt tcacatgtgg tcacagtcat cctgggccaa
     5281 caacaggctc agatgggcta gccatccaaa atggcttctt catcggttca gtgcgccgat
     5341 agttttgcat cccctctttg acaaaacaca cgggatttct tttagtttta ttttaccttt
     5401 atattttttc atcttattaa atctttccct ccactttatt tgtattgcca tataaccttt
     5461 tccccacata ctatttcctg attttttgat ccccttccct ctgctcccag gtttccctgc
     5521 ctttcctcct atccagattc actctctttc tgtctctcac tagaaaacaa ccaggcttct
     5581 aaaggataat aataaaatat aatataacac aagaaaccaa catctaatac actgggataa
     5641 agacaaaata aaaagaagaa aagaacacaa taagagacag acacagagac ccactagttc
     5701 atacacttag gaatcccaca aaaacactaa actggaagcc ataatataga cgcaaaggac
     5761 ctgtagggta aaaggagaga aaaatatgaa ataattaaat acaaaaaaaa aaaatagccc
     5821 tggcgtggca ctgtgaaaca aggcatctcc aatgttgagt tggtttccca tggaccatct
     5881 cctgctgggc atggggcccg cccctaagag tagtttgttt ccccagcgag actcccttgg
     5941 agaaaagtaa aattttattt gccagtgttt ttaagtggtg ataacttctg gattagggat
     6001 ggggtacgtg tccaactctc ctttcaactc taggacccat ctggtgcagc cctgtgacag
     6061 ccctgcacgt gctgccacag tctctgtgag ttcctgtgtg gatcagtcct gttgaccaga
     6121 gggcttgttt tctcggcgtc ttccatgcct ctggctctcc tgcctcctct tctgcagggt
     6181 tcccagagac acccatttaa gccggggtat gccaaggttt ctcacgctct gcatgctgtc
     6241 tggctacggc tttctatttg tcccatctgc cgaagaagga agcctctctg atgatgactg
     6301 agcagggcac tgatctactt ccctgactga agcacacact ggatttcttt cacgaaagct
     6361 cagggtacta agcgagattt ggtagacgaa ctggcctgtg ttcagaacgc tcagaatgac
     6421 tcctctggct ttcatcggtc aacccagtca tggcaaactt atttcctgat aaaatgctca
     6481 ggatgaggac tggattgcag atgcggccac aaataaaaga acacgaagtt caccctggag
     6541 acaaacggct gaggtaagag tcatggttgc cacagttatg caaaacacaa aacaaataag
     6601 gtggtgctat ttgctaaact ccgtgagtgg ggcacccaga aggcacctca gtctgcaaac
     6661 agatgaagat tttatttagc ttttgaagag cagagtgctg aagtagagag aaaagccaca
     6721 acctttaaag ggtgagcagc cctttgaaaa cccactatcc ctttatacca tgaacaaaac
     6781 agcagatcta atgctggctc ttcctcccct cattaccttc tccctttttt atgtgctatt
     6841 tactgccaac tggtgagcac gtccagaccg aggcttatag gaagcaccac gtaagacaag
     6901 agggcataag tggagaacaa ggaattgatt gtatgtgcag tacccaagtg atactgtcca
     6961 ttattctgga tggaaaggaa cgacttagct ggtgaggaga tggaaaaggc aggcggatgt
     7021 ttcctgcatc agctgaagag ttagttgact attttttttc ttaatttgtt tttgtttgtg
     7081 tatgagtgct ttgcgcgcac ctagtctgtg taccacatgt gtgcaatgct caaggaggcc
     7141 agaagagggc gccgtatctc ctggagctgg cgttgcagat ggtcccgaac ggccacgggg
     7201 gtgctatgaa ttgaacaaaa agtaggccag caggaagttc atcgtctcag gaaaaagtct
     7261 cattaaaata tcctaaaata agacccatag agtaccaagc tgagagagta aggatataga
     7321 atgtacgtct ggaacaatag acacatttca acatgttaat catgggcaga aagaattgat
     7381 attgcttgtg tccaggtggt agaattactg aagaactttt aaaggtttat ttgtttttat
     7441 tttatgtgtg ggagtgtttt gcctgcagga tttgagtgtg ctacgcgcac gtctgttctt
     7501 cttggagggt gtcagatggc ctgaactgaa gttacagttg gttgtgagcc aactgtgtag
     7561 atgctggaaa ccaaatcccc atcaggtcaa acaaatgctc tgaactgctg agccatctct
     7621 ccagccgcct ggcggtagaa ttattttaaa aatgcttttg tatgccaggc atggtggctc
     7681 acagctgtaa agccagaact tgggaggtag aggcagaagg atcagcagtt tgggccatca
     7741 caggctggct acattgtgag tatgagggtc tcaaaaaaca aaacaaaaat tatcccaaag
     7801 ataaagaaca acgacaactc caaaacaaac aattaaccca cttctggcaa aagcgacagc
     7861 gtacacttag aaaagctcac tagaagtcat ttggttttta ttttgatatt agcttttact
     7921 taggtatcaa ttcgtttgtg tgtttagttt tgagataccc cagactatag ttgtttgttt
     7981 ttcaagggaa ggacttcact gtgcctgttg gtgtaacctt tagtgagtgt cagagcttgc
     8041 tgggccttga ctcgaggtct agtttatctg gtattgcaga aacttcctat acagctctgg
     8101 gtcctgtcac acactctccc accctgcctg gtctgctttc tgctgaggga aaacattagc
     8161 ttccagaggc aggtagcacc agaacttgtt tgctctgcct gtttcccctc tgacctcagt
     8221 tctgatgttg gctcccacac ccagatgaag gtgtgttcgt ggcattgacc tgatgatgca
     8281 gctgccctgg ctcccttcgc atgcccctct gccctgcagc ttctgcgtca gcgccagtat
     8341 ttgctggacc acttgccaac ccctgtgcag cctgctgctc tttgatctgt atacctgaag
     8401 ccaagccagc ccggcctgta agcagctgtt ctagagcttt gcaacccgca tcctaagcca
     8461 aggtctcatc taaggcaaaa ataagggggt tattttctac agggacaggt gcgggacact
     8521 cacagatgct tcaagaacac agtgggccga gaaaatgaac aatgtccgtc tttgattaaa
     8581 tggggcaaag gcaacatctt taatattatc attccgtcaa tcactttgag aaaagcttat
     8641 ttaagaagac aattcccaat ggttttcaaa ccaccggtta ggaaggtaat tagcctttct
     8701 gaaaatacca tcaaggagtc tgaagaactg taacgtcggg gtgtggtggc ccacaccttg
     8761 tcctcccagc acctgggagg ccgagacaaa gaagatgtag agttgggggc cagcttctgg
     8821 tactctcgat gttcaaaacc aaaacagaac ggtttaaaaa ttaagactct ggcatagaat
     8881 attctagaat caaacgtacc acaaacagat tcaggaatac tcggaattca gtataaacta
     8941 aagcaacttt ttaaaattag tagggagaac caggtgtggt ggtacacacc tttaattcca
     9001 gcacatggag gcaggggcag gcagatcttt gtgagttcaa gggcagcctg gtctacatgg
     9061 caagttccag gctagccagg gcagcatagt gaagctctgc ttacaaaagc aaaacaaaaa
     9121 accgggaaaa ccttcagttt ttgaataaga cacatatgtg agttcctaag atgcatgaac
     9181 gccacagaga ccactcaaga gagcaccaag gctcctacgc aaaggcaagt tagctccaga
     9241 gccctaggag agctgctcat gggacagggc agcgacagcc accaagctga gagtcagctt
     9301 gtgtgcccag gagggaggcg ttgggtcaga gcctctggag gacccctgaa gtctcttctc
     9361 agtgttctct atcacaggga gagctgtcag cccctggaat gtggttctat gtctagaaaa
     9421 ctatcccata aataacagga agcccaaggt ttaccaatct ctgctgtaca ggatgttcta
     9481 gctactttat ttgcaataga aaatctgaaa gtttccccat gtccaacaag actagaacaa
     9541 acaagtcctg cgtagtcgcc tgtcggtttc tgggtgtggt ggtatagccc tgtaatccca
     9601 gcatttggga agctgaggtg ggaggatcgg gagttcaagg tcagcttggg ctacttagaa
     9661 agaccttgtc tcaaaagaag tggagggggg gtggtggtgg tggtggtggt ggtgtaaaat
     9721 tgatctcttt gtatgataat gtccatacaa tatattaata ttgaaaacag tcatgccata
     9781 gaacaatttt gttcagtaaa tgtacatcac ataagtaact tcaaatttta gaagggataa
     9841 ttttaaagtc aggcaggcag cattaaagaa gaatgaggaa atcaatttaa aattcttttt
     9901 taaagactta cttactggat ttttgcctat gggtaatttg catgtatgtt ataacaccct
     9961 gtatgtttct ggttcctatg gggatcagaa gagggtatta ggtcctctgg gactggaggt
    10021 atagaccgta gtttgtgagt caccgtgtgg gttctaagaa ctgaaccggg tcttctggaa
    10081 gggcagccag tgctcttaac ctctgagcca tcttgccatc cccaataaaa ttagttaaga
    10141 aattattatt attgttgttg tcgttttcgt ccttattagt gtgtgtgttc agcagtcagt
    10201 tctgtccttc cacagtggca tctaagagtg gaacctaggt tggtggtcag gcttgtgcag
    10261 gaagtgattt ttgccggctg agcaatctca tcaggcttcg gacatttttt aactgttttt
    10321 atatttttat tttatagttt ttattgtttt aattcaaagt gtcagagaat ctttcaaaat
    10381 gttgcaatcc atataaaaat gaacgaggtg gttggcattt ttaacactaa gtcttcgcaa
    10441 tgcaacgtct cttttatatc aacaatgcat ctcactttag accagccaca tatcaaggat
    10501 caaaagtcaa ttatggctgt agccattata ggaggccgtg caactctaaa gtatgtacag
    10561 ccggcggtac agcgtttgct ttggttcaat ttcctatatc aaaaaggagt gtgtgggagg
    10621 gagttgtaga aagtgactga ggaagacacc cagtgtccta atgtcaacag gaggatccac
    10681 agcaggactc tggcaacacc cgtgtttatg aaacatctct gcgtgctgta gcttttattg
    10741 gggggtttca gtgatccgtg tgtttgggca ctagaactct gtcggaccca gggggatgtg
    10801 agccttgtgc atctaaatcg cagtccgcct tttcaggccc atcgcaggaa gaagccattg
    10861 tagagatgca gcgtggtggt gggagcactg cacatgcgca gagacgctac tatgcaaata
    10921 atagtctagc gcctcttgtg gacggtgttg tagctgcggt ctgtgcctgt tggagcaagg
    10981 agcctgtgtg gccctggaat cagattacct gctttagggt aggagagaga tcctaccact
    11041 gttgttggtt cctttgttaa cgttgtgatc agttacaagc tgctctggga attttccact
    11101 gtttaaagaa cgacttcaga tcccttaact cgctgcactc agttttgtac tctgatgaaa
    11161 atgaagctgc agagatcttt ttggctgcct tgtttcttga ttgattatta atttgtttga
    11221 actgcccagt ttgttctaac tggagtccaa aactctgtgg tgtgccaaga aaaggaaatt
    11281 aaggcatagg gcaaagaaat aataataacg actacggtga cgacgatgat gatgacgatg
    11341 taaatgccat tatcgaacga gagcatgatt ctatctgtgc tttgaaaact tattaaattt
    11401 tgtacgcatt tattctgcat ttctatttag ctatccgcat acttatctat tttggtgaat
    11461 cagtgaaatt ttttatttgt ttttcccttg gaatgcggtt gaatttttgt ggatattacc
    11521 gttgtccgta gcaaatccca catatggtag atgggtattc aaccaaactc ttatcattgc
    11581 ttatatccag gagtgagact agagtgcaca gagaaagggg taccagaagg accctcgttt
    11641 aaatgtattt ctgtactgtt aacacaattt tcataaccaa aaaagaccaa taaagttaat
    11701 cactgtcacc ccaactgccc tccaccattt tctcaaacat ctcaccaaat gactagaaag
    11761 agttaacagg attccagata cttgatgtaa ggacaaatcc cagattggaa ggtggcccag
    11821 ggttggggtg agaaacagca gagtgggggg tggggtatgg gggtaggtcc gggggaggga
    11881 cttagttctc actctcaacc aatagcacga aggcttcggg cccaacgccc attggctgaa
    11941 actatttcaa gggtcaggac tgcacctgag ctc
//
LOCUS       DRRADHA      1321 bp ds-DNA             INV       30-JUN-1990
DEFINITION  D.orena alcohol dehydrogenase (Adh) gene, 5' flank.
ACCESSION   M33488 Z00032
KEYWORDS    alcohol dehydrogenase.
SOURCE      D.orena DNA.
  ORGANISM  Drosophila orena
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1321)
  AUTHORS   Moses,K., Heberlein,U. and Ashburner,M.
  TITLE     The Adh gene promoters of Drosophila melanogaster and Drosophila
            orena are functionally conserved and share features of sequence
            structure and nuclease-protected sites
  JOURNAL   Mol. Cell. Biol. 10, 539-548 (1990)
  STANDARD  simple staff_review
BASE COUNT      409 a    295 c    260 g    357 t
ORIGIN      
        1 tctagattga ttctactacg ctgcctccag cagacacgcc cccccagcac cccatcccct
       61 cccgctggca ccagtttcag tattacacgt atggaaatta agccgaagtt caattgcgag
      121 cgcaggaaca acacgagctt gcatctttct acacttcttg gcgcgctatg cttgacattc
      181 acaaggtcga agctcttagc gttctgactc ggggtgctac actgcacaaa attacattat
      241 gcattcttca aatatcccta atataccaaa tatgtattaa aaagtgatca tttaccgatc
      301 gttcgaaaga cggtataggc ttacaaaaaa atttgccaaa gtaatttaca aattcataat
      361 ggttttttca agtatataat atgtacatca atattttctt acatgtattt tatgggatga
      421 ttatgtttta aactacactc aattttttct cagtgcactt cctggtgttc cattttctat
      481 tgggtccgta cccggcgttt gtttgcagat cacttgcttg cctattatta tagcatttta
      541 cacattacaa aattctggac gtcgctgctg cagccgctgt cgacgtcaac tgcactcgcc
      601 cccacgacaa aacggtattt aaggcgctgt gcaagtccca gtcgccgatt atgtctcagt
      661 gcagttgtca gttgcagttc agcagaccgg ctagcgagta cttgcatctc ttcaaattta
      721 cctaattgat caagtaagtg ggaaagggac ctcattatgc aatgtcgaat agtaagagat
      781 cactatcact aatggtggag cataataaaa tcaattgcat gcaatcgaaa tgaatgcaaa
      841 ccggcacaag cagtagcaaa cctactaaac aaattaaaat ttggaggctg tgccgtggcg
      901 aatatttgac ttggaaaatc acctgttgtt taaccgctaa aaataggaat tttaacatta
      961 agcatccctg ttaatcggcg ccgtgccttc gttagctatc tcaaaagcgc gcgcgtgcag
     1021 acgagcagta attttccaag catcaggcat agttgggcat aaattataaa catagaaact
     1081 gaatactaat atagaaaaag ctttgccgcc acaaaatccc aaacaaaaac aaaatatgtg
     1141 ccgaaaaata aaacaaaaat aaaccgtaaa ccgggcagcg ttgccgtcgt tgcgggctgt
     1201 gaagcttacg tgaatagccg agagatcgcg taatgataga taaagaaagc tctacgtaag
     1261 cgaagcttct gggggataga tcttcctata aatacgggac cgacgcgaac tggaaacgaa
     1321 c
//
LOCUS       M33489       1380 bp ss-mRNA            BAD       30-JUN-1990
DEFINITION  D.melanogaster alcohol dehydrogenase (Adh) mRNA, 5' flank.
ACCESSION   M33489 Z00032
KEYWORDS    alcohol dehydrogenase.
SOURCE      D.melanogaster, cDNA to mRNA.
REFERENCE   1  (bases 1 to 1380)
  AUTHORS   Moses,K., Heberlein,U. and Ashburner,M.
  TITLE     The Adh gene promoters of Drosophila melanogaster and Drosophila
            orena are functionally conserved and share features of sequence
            structure and nuclease-protected sites
  JOURNAL   Mol. Cell. Biol. 10, 539-548 (1990)
  STANDARD  simple staff_review
COMMENT     Secondary reference.
BASE COUNT      426 a    290 c    241 g    423 t
ORIGIN      
        1 tctagattga ttctacgctg cctccagcag ccacccctcc catccccatc cccatcacca
       61 tccagtcccg ttggctccca gtcacagtat tacacgtatg caaattaagc cgaagttcaa
      121 ttgcgaccgc aggaacaaca cgatctttct acacttctcc ttgctatgct tgacattcac
      181 aaggtcaaag ctcttaatat tctggcttgt ggccctacac tgtaagaaat tactatagaa
      241 ataagggtac acggaataag atattttttt tagtccatat gcttttaaca aatgtgtttt
      301 gagtttatgt tatattattg ttagaaaaca ggtgtttttt tttaaatcgg ttaaaaaatt
      361 actacgagag aaaaatacaa attttgtaaa taagattgac tctttttcga ttttggaata
      421 ttttcattca ttttatgttt ttacgttttc acttatttgt ttctcagtgc actttctggt
      481 gttccatttt ctattgggct ctttaccccg catttgtttg cagatcactt gcttgcgcat
      541 ttttattgca ttttacatat tacacattat ttgaacgccg ctgctgctgc atccgtcgac
      601 gtcgactgca ctcgccccca cgagagaaca gtatttaagg agctgcgaag gtccaagtca
      661 ccgattattg tctcagtgca gttgtcagtt gcagttcagc agacgggcta acgagtactt
      721 gcatctcttc aaatttactt aattgatcaa gtaagtagca aaagggcaca caattgaagg
      781 aaattcttgt ttaattgaat ttattatgca agtgcggaaa taaaatgaca gtattaaata
      841 gtaaatattt tgtaaaatca tatataatca aatttattca atcagaacta attcaagctg
      901 tcacaagtag tgcgaactca attaattggc atcgaattaa aatttggagg cctgttccgc
      961 atattcctct tggaaaatca cctgttagtt aacttctaaa aataggaatt ttaacataac
     1021 tcgtccctgt taatcggcgc cgtgccttcg ttagctatct caaaagcgag cgcgtgcaga
     1081 cgagcagtaa ttttccaagc atcaggcata taatatacta atactaatac taatactaat
     1141 ataagaatac taatatagaa aaaaagcttt gccggtacaa aatcccaaac aaaaacaaac
     1201 cgtgtgtgcc gaaaaataaa aataaaccat aaactaggca gcgctgccgt cgccggctga
     1261 gcagcctgcg tacatagccg agatcgcgta acggtagata atgaaaagct ctacgtaacc
     1321 gaagcttctg ctgtacggat cttcctataa atacggggcc gacacgaact ggaaaccaac
//
LOCUS       RATRIBOI     1456 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat ribophorin I (Rpn-I) gene, 5'end.
ACCESSION   M33508
KEYWORDS    ribophorin I.
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1456)
  AUTHORS   Behal,A., Prakash,K., D'Eustachio,P., Adesnik,M., Sabatini,D.D. and
            Kreibich,G.
  TITLE     Structure and chromosomal location of the rat ribophorin I gene
  JOURNAL   J. Biol. Chem. 265, 8252-8258 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Kreibich, 03-APR-1990.
FEATURES       from  to/span     description
    pept       1362  >  1456     ribophorin I (Rpn-I)
    mRNA       1338  >  1456     Rpn-I mRNA (alt.)
    mRNA       1341  >  1456     Rpn-I mRNA (alt.)
    mRNA       1347  >  1456     Rpn-I mRNA (alt.)
    mRNA       1352  >  1456     Rpn-I mRNA (alt.)
    mRNA       1353  >  1456     Rpn-I mRNA (alt.)
    mRNA       1359  >  1456     Rpn-I mRNA (alt.)
    binding    1314     1324     SP1 binding site
    binding    1325     1333     SP1 binding site
    binding    1247     1256     SP1 binding site
    signal     1297     1300     CAAT box
BASE COUNT      342 a    325 c    336 g    453 t
ORIGIN      
        1 ctgcaggtgc taatttcctg acttcttctc tgtagggagg agcatgtcct tccctccgct
       61 ctctcattag ccagtgtcag tgtttgaata tttccatctc cttccaaaat ttttttttct
      121 tttttttttt ttcggagctg gggaccgaac ccagggcctt agcgcttgct aggccaagct
      181 accactgagc taaatcccca acccccaaaa tttaccttta aattacatcc ttcaagcaac
      241 aatattggga catgttcatt agatcagaat tgataagagg caacacggaa atgaaagagg
      301 tgacagatct ctaccaaggc cttattgccc ttagtcagag cgaatgaagg agcagcagtc
      361 tccttttgtg agtggagact gtacaccaca atcgtgcagg gggcttcgag attttatagg
      421 gttgaaacgg gactttaggg gagcaaagtt ttatcagtag atttttcttt atcattatta
      481 tcatcattgt tgttgttgtt tttaaatggt ttcaaggccc aataatgcca gtagatgttc
      541 tctttgctac attatttttc ttatcaggtg ttatcagtat accgactaga gtttttccac
      601 tagcttgaag ttcatttgtt ttaggtcatt aattcttcag tgtgactttg ggagagacag
      661 cactagagga atccatctgc aagaagcctt tgtaaaaggt cttgccgcag agagaccctt
      721 tcctgccttt ctgtcatatg acgatatgcg attccttctt tagacaatgc aatgccaaag
      781 ttccatcttg gtagtaagga gcagccctca cctgacagct tagtcctgcc agtaacttca
      841 ccttggactt ctacaattgt gctaagtgcc aacagatact tgatgttaat atcatcagcc
      901 ttggttagtt gcttaaggtt tctgccatcg ctcatattaa aaattgaact aaatattagg
      961 ggagagactt gaatctttgg aaatttcccg tttttcttta aacttttgcc cacttagtat
     1021 tcatatcaat gtagagaagg caaacttatt tccttagact aagctgtttc tgagcggaga
     1081 ggccattttt tttttttttt ttttttttgc ataagcatga aggaaacagg agataaggaa
     1141 ggacgactac cacagaaatc actggtgagt ctagctaata agcgctcaaa gacgtgtgcc
     1201 tgcgctttaa gatttttcaa tcaggcgccc catcgcggtt gccttggaga cggggtcctt
     1261 cctgacccgg cccagccatc ttgtggtgct gttggccaat caacggttgc gccgtggcgg
     1321 gggggggcgg ggcgcgtgtc ggtggcgctc agctcgcggt catggaggcg ccgatcgtct
     1381 tgctgctgct gctatggctc gccttgcccg acgcctggca gcgcctcttc ggaggctccg
     1441 ccgctggtca acgagg
//
LOCUS       DROCYCB      2612 bp ss-mRNA            INV       30-JUN-1990
DEFINITION  D.melanogaster cyclin B mRNA, complete cds.
ACCESSION   M33192
KEYWORDS    cyclin B.
SOURCE      D.melanogaster 3-12 hour embryo, cDNA to mRNA, clone Nr 4/4.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2612)
  AUTHORS   Lehner,C.F. and O'Farrell,P.H.
  TITLE     The roles of Drosophila cyclins A and B in mitotic control
  JOURNAL   Cell 61, 535-547 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.F.Lehner, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        250     1842     cyclin B
    mRNA    <     1     2612     cyclin B mRNA
BASE COUNT      710 a    670 c    635 g    597 t
ORIGIN      Chromosome 2; map postion 59A
        1 aattcaaacg gcagcttggc ctcgcttcgt gtgcaaaact cgatcaggtt ttcggataat
       61 tgacaatccc gcactcgact tgtaaattat aaactattag taatcatata ttcggacaac
      121 aaattcaaca gctaaacaag cgttcggtca cagaaacgcg atcaaaagtc aaccaaacca
      181 agtgatagcc aagcgtctgc ctatcttcgt gttaattgtg tttgtacaga tagaaaagaa
      241 gcaatcaaaa tggtgggcac aacactgaaa atgcgtggcg atgagaacgc ttcggagaac
      301 ttcaagcaag tgcaattgaa gaaattgacg gttccttcca tggaggcaac aacaaaacgc
      361 gcggccttgg gcgatttgca gaatcgcggc ataagtcgtc ccatcgcagc gaaggatgcg
      421 gcacagaaag actccaagga tctcaagctc acagacgccc tgcgcaatgc caaagctcgg
      481 gtggacagcc actggaagaa acagccactg ggcagcacca atggcaatgg caatggcgcc
      541 gttccgccca aggtcaacga ggggggcgtg tcggcgtttt tgcgttcgaa ttcggtgcgc
      601 aatcgcgttc cgaccaagac cactgtagaa cccactaaag ttacagtcaa gtccagttct
      661 tccgagaacg tgaacgagcc caccttgaag cgcgaggaca gcaatctgtc gaagaagtcg
      721 ctgaccaaac tcggtgccgc tttggccaaa cccgtgatgg gagtttcagg aattcgacgg
      781 gaaccagtag ctgtttcccg caaagaggca gagaccaaga aggaactgcc agaaaccaag
      841 aaggactcac tggaagtgaa aaaggatgcg accaggatgc cccttattag gggcaacagt
      901 gcagtcacta cgaccacatc cacgatgccc accaccatgt ccctttccag caagcgcttg
      961 gctgggatcg aggacattga tgccaatgac aaggagaacc tggtactggt ctccgaatat
     1021 gtaaacgaca tctacgacta cttgtatcag gtggagctgg agcagcccat tcacaaggat
     1081 cacctggccg gacagaagga ggtgtcccac aagatgcgag ccgtgctgat cgattggatc
     1141 aacgaagtcc acctgcagtt ccatctggct gcagagacct tccagctggc ggtggctatc
     1201 attgatcgct acctgcaggt ggtcaaggac accaaacgca cgtacttgca attggtggga
     1261 gtgacagcac tcttcatagc caccaagtac gaggagctgt tcccgccggc aatcggagat
     1321 ttcgtcttca tcacggacga cacctacact gcccggcaga tccgacagat ggagctgcaa
     1381 atcttcaagg ccatcgactg taatctgtcg cgtccgctgc cgattcactt ccttcgacgc
     1441 tactcgaagg ctgctggcgc cgaggacgag caccatacga tgtccaagta cttcatcgag
     1501 ttagcttccg tggactacga aatggccact tacaggccat cggagattgc agctgcctca
     1561 ctgttcctgt cgctgcactt gctcaatgga aaccaccggg ccggtacagg attcaacgac
     1621 cgtcactgga cgcccactct gaccttctac tcgcgatact cggccgcgca cttgcgtccg
     1681 attacccggc tgatcgcgaa actggcccgg gacgctcctc aggccaagct gaaggccatc
     1741 tacaacaagt accagggcag caagttccag aagatcgcgc tgcgaacgga gctgaccggt
     1801 gcgctgatgg actcgattgt gggccagagc cagaggaaat agtgcggtcc aaggcggact
     1861 ggaagaccct gacttaccta gtttagttta atttgttttc atttttaaat ttgtagcgta
     1921 tttcattttc tgttcgtttc gtgttcgtta aaaaatgcgt atagttaccg tagtcgcatt
     1981 gccaactatc tttacctgca tcacccatcc ctaagatatc gtaatctgct ggagtccctt
     2041 gagcagtttt cggctactgc caagagctgg ctcccggcat ctttgcccac ggagcacaag
     2101 ttgctcgcga gccggccgct gggagtgagt tcctccggta agactaggaa ccgaactaaa
     2161 ctggagcccg tcagctcctt tcgctgggtc accatctcag tgggagcgag cggcggagcg
     2221 gtggacaccg agagagagag tgggcaacga gttcatttgc tggccgaaca catcggcgtt
     2281 gtctctccaa gaattatatt ttataattgt acccgtgaaa tcatttcatc ccctacctac
     2341 ctatcactaa ttgtatttgt cgttagtttt tagtttcgat gggaatgaat aaatccaaaa
     2401 aattgtgcaa aattattccc ataaatcgca taaaaaagag acgtagacta tttgtaattt
     2461 atatcatgta tttcgcacat tcatacaaaa acaacttatg ctagttatat tgtttaccac
     2521 taagcaacga ttaaaacacg ttttaactaa ctgatgtttc ccaaaaatgt ctctgaacaa
     2581 tacatagaaa ataaagtttt gtataatttt at
//
LOCUS       RATSV4I1       37 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat/SV40 5' recombination junction.
ACCESSION   K01218
KEYWORDS    .
SEGMENT     1 of 3
SOURCE      Rat (strain Fisher) SV40-transformed isogenic cell line SV14B.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 37)
  AUTHORS   Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and
            Botchan,M.
  TITLE     Analysis of the sites of integration of viral DNA sequences in rat
            cells transformed by adenovirus 2 or SV40
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980)
  STANDARD  simple staff_review
COMMENT     The three segments of this entry represent an integration of the
            SV40 genome in which two SV40 fragments are integrated in opposite
            orientations with a 40 bp insert of undetermined origin between
            them. The authors report that SV40 and adenovirus inserts into host
            cell DNA appears to be chaotic, with inversions, deletions, and
            recombination with segments of uncertain origin.
FEATURES       from  to/span     description
    recomb       25       26     rat DNA end/SV40 DNA start
BASE COUNT       12 a      8 c      5 g     12 t
ORIGIN      About 57 bp 3' to EcorI site.
        1 acttcacatt ctaggatgtc tggctactaa aaaatct
//
LOCUS       RATSV4I2       58 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat/SV40 recombination junction DNA and an insert of undetermined
            origin.
ACCESSION   K01219
KEYWORDS    .
SEGMENT     2 of 3
SOURCE      Rat (strain Fisher) SV40-transformed isogenic cell line SV14B.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 58)
  AUTHORS   Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and
            Botchan,M.
  TITLE     Analysis of the sites of integration of viral DNA sequences in rat
            cells transformed by adenovirus 2 or SV40
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980)
  STANDARD  simple staff_review
COMMENT     The three segments of this entry represent an integration of the
            SV40 genome in which two SV40 fragments are integrated in opposite
            orientations with a 40 bp insert of undetermined origin between
            them. The authors report that SV40 and adenovirus inserts into host
            cell DNA appears to be chaotic, with inversions, deletions, and
            recombination with segments of uncertain origin.
FEATURES       from  to/span     description
    recomb        9       10     SV40 DNA end/insert of unkown origin DNA start
    recomb       49       50     insert of unknown origin DNA end/SV40 DNA
                                 start
BASE COUNT       11 a     17 c     11 g     19 t
ORIGIN      About 3850 bp after segment 1.
        1 tttgtgatct ttgcaatccc acagattgca gcataccagg cttccctgtc cttgtgca
//
LOCUS       RATSV4I3       40 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat/SV40 3' recombination junction.
ACCESSION   K01220
KEYWORDS    .
SEGMENT     3 of 3
SOURCE      Rat (strain Fisher) SV40-transformed isogenic cell line SV14B.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 40)
  AUTHORS   Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and
            Botchan,M.
  TITLE     Analysis of the sites of integration of viral DNA sequences in rat
            cells transformed by adenovirus 2 or SV40
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980)
  STANDARD  simple staff_review
COMMENT     The three segments of this entry represent an integration of the
            SV40 genome in which two SV40 fragments are integrated in opposite
            orientations with a 40 bp insert of undetermined origin between
            them. The authors report that SV40 and adenovirus inserts into host
            cell DNA appears to be chaotic, with inversions, deletions, and
            recombination with segments of uncertain origin.
FEATURES       from  to/span     description
    recomb       16       17     SV40 DNA end/Rat cell DNA start
BASE COUNT        9 a      8 c      7 g     16 t
ORIGIN      About 770 bp after segment 2.
        1 gctttttaaa tcctcacttc aggggtaaga gtttcattct
//
LOCUS       RATSV4J1       42 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat/SV40 DNA recombination junction, clone pEM2.
ACCESSION   K02743
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Rat (strain Fisher) SV40 transformed cell line tsA30-N2 DNA, clone
            pEM2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 42)
  AUTHORS   Hasson,J.-F., Mougneau,E., Cuzin,F. and Yaniv,M.
  TITLE     Simian virus 40 illegitimate recombination occurs near short direct
            repeats
  JOURNAL   J. Mol. Biol. 177, 53-68 (1984)
  STANDARD  full staff_review
COMMENT     [1] reports that of two junctions between rat and SV40 DNA studied,
            one was a very abrupt transition, whereas the other (shown here)
            had an area of homology (bp 16-26) between the two sequences at the
            junction.  The more likely recombination site at base 26 is given
            in the sites table, although base 16 might be another candidate.
FEATURES       from  to/span     description
    recomb       25       26     rat DNA end/SV40 DNA (base 439) start
BASE COUNT        6 a      6 c     10 g     20 t
ORIGIN      
        1 atggctgtat gcttttttta gagcctttgc tgcaattttg tg
//
LOCUS       RATSV4J2     1357 bp ds-DNA             Rod       30-JUN-1990
DEFINITION  Rat/SV40 recombination junction, clone pEM5.
ACCESSION   K02744
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Rat (strain Fisher) SV40 transformed cell line tsA30-N2 DNA, clone
            pEM5.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1357)
  AUTHORS   Hasson,J.-F., Mougneau,E., Cuzin,F. and Yaniv,M.
  TITLE     Simian virus 40 illegitimate recombination occurs near short direct
            repeats
  JOURNAL   J. Mol. Biol. 177, 53-68 (1984)
  STANDARD  full staff_review
COMMENT     [1] reports that of two junctions between rat and SV40 DNA studied,
            one, shown here, was a very abrupt transition, whereas the other
            had an area of homology between the two sequences at the junction.
            
            A member of an interspersed middle-repetitive family bordered by 15
            bp direct repeats is present at positions 595 to 790. Short direct
            repeats (ttg)6 and (tatc)12 are present at positions 53-70 and
            310-357, respectively. [1] speculates that the presence of these
            short repeats provides a good environment for the viral integration
            process.
FEATURES       from  to/span     description
    recomb       24       25     SV40 DNA (base 705) end/rat DNA start
    unsure      306      307     tt could be ttt
BASE COUNT      336 a    273 c    228 g    520 t
ORIGIN      About 5 kb after segment 1; 823 bp upstream of HindII site.
        1 caacttgcat ctgttgctac tgttttttac tctgatgttg ctactgagaa ttttgttgtt
       61 gttgttgttg gtggttttta aagacagtct cactatgtag tccaggctgg cctggaactt
      121 gctctgtagg gtaagctggc catgaactca gtttgtatcc agtagctaga gtgagtcctt
      181 gaacttggtc ctcctgcctt tgcctctcaa ctgctggcag ctactgagaa acttaactta
      241 atctcattgc ttttttctct gacttttttt agaattggtt tttaatctta atactctatg
      301 gtgcattcct atctatctat ctatctatct atctatctat ctatctatct atctatcacc
      361 tatctctctt atatatctgt accctgtact ttttcagctt tttattatgg ggctcacgca
      421 catgcacata atacgtctgc ataccatggc acacatgtga aggccacaca tgtagactaa
      481 tcagttttct ccttctacta tgtgcatcct gggaatgaac ttaggtcatc gggcttggtg
      541 gcaggtattt ttgtttgcta agctacctca ccagcccctc tggttctttt taaattcttt
      601 tgagtttttt tttaatattt atttatttat tatacataag tacactgtag ctatcttcag
      661 acacaccaga agagggcatt ggttgtgtga gccaccatgt agttgctggg atttgaactc
      721 aggacctctg gaagagcagt cagtgctctt aaccgctgag ccatctctcc aacccttctt
      781 ttgagttttt atgtgtcttt tctttttata aatttctaaa gtcaacgtaa aaagcattgg
      841 atttgattac agcatgtttt tcttttatta aaaatattga ggcttttaac tctttagaga
      901 caataaaccc aataagcatt agcataaata ccaattctta gaacaataca ttttcaaaca
      961 cagaaacatt tactgaaaaa aggaactttt ttgcatcttt ataaatctgc tctattctag
     1021 cttataaggt aaatgcttta ggctcatgtc caatttatat tcaatgtttg ttgatgtctg
     1081 caatctatga ggaaaatcta acctcacaaa ggtgcatggt ttagaaacag aaagtttgac
     1141 tttggttttg ttgttgtttg gatagtggtc tgtttggttg tgagtgtcac atttccttac
     1201 atatgtgtgt gtcattatac ttttacatat tacttttaca tatgtgtcat tataactttg
     1261 cctaatttga ccctctcctc ccctgctacc cgactctatt gctggttcaa ttcttcaccc
     1321 caaatagtct ctcccagtat tcaggtcaca tgaattc
//
LOCUS       RATSV4M1       37 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat/SV40 5' recombination junction.
ACCESSION   K01093
KEYWORDS    .
SEGMENT     1 of 3
SOURCE      Rat cell line 14B DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 37)
  AUTHORS   Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J.
  TITLE     Integration and excision of SV40 DNA from the chromosome of a
            transformed cell
  JOURNAL   Cell 20, 143-152 (1980)
  STANDARD  full staff_review
COMMENT     The authors report that parts of the SV40 genome are integrated in
            opposite orientations and that a segment of SV40 has been deleted
            in rat cell line 14B. There is a 40 bp insertion of unknown origin.
            [1] analyzes 3 segments with rat proviral boundaries.
FEATURES       from  to/span     description
    recomb       25       26     rat DNA end/SV40 DNA start
BASE COUNT       12 a      8 c      5 g     12 t
ORIGIN      
        1 acttcacatt ctaggatgtc tggctactaa aaaatct
//
LOCUS       RATSV4M2       58 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat SV40 proviral DNA with insert of undetermined origin.
ACCESSION   K01094
KEYWORDS    recombinant DNA.
SEGMENT     2 of 2
SOURCE      Rat cell line 14B DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 58)
  AUTHORS   Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J.
  TITLE     Integration and excision of SV40 DNA from the chromosome of a
            transformed cell
  JOURNAL   Cell 20, 143-152 (1980)
  STANDARD  full staff_review
COMMENT     The authors report that parts of the SV40 genome are integrated in
            opposite orientations and that a segment of SV40 has been deleted
            in rat cell line 14B. There is a 40 bp insertion of unknown origin.
            [1] analyzes 3 segments with rat proviral boundaries.
FEATURES       from  to/span     description
    recomb        9       10     SV40 DNA end/DNA of unknown origin start
    recomb       49       50     DNA of unknown origin end/SV40 DNA start
BASE COUNT       19 a     11 c     17 g     11 t
ORIGIN      About 3085 bp after segment 1.
        1 tgcacaagga cagggaagcc tggtatgctg caatctgtgg gattgcaaag atcacaaa
//
LOCUS       RATSV4M3       40 bp ds-DNA             ROD       30-JUN-1990
DEFINITION  Rat/SV40 3' recombination junction.
ACCESSION   K01095
KEYWORDS    .
SEGMENT     3 of 3
SOURCE      Rat cell line 14B DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 40)
  AUTHORS   Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J.
  TITLE     Integration and excision of SV40 DNA from the chromosome of a
            transformed cell
  JOURNAL   Cell 20, 143-152 (1980)
  STANDARD  full staff_review
COMMENT     The authors report that parts of the SV40 genome are integrated in
            opposite orientations and that a segment of SV40 has been deleted
            in rat cell line 14B. There is a 40 bp insertion of unknown origin.
            [1] analyzes 3 segments with rat proviral boundaries.
FEATURES       from  to/span     description
    prov    <     1       16     SV40 DNA
BASE COUNT        9 a      8 c      7 g     16 t
ORIGIN      About 818 bp after segment 2.
        1 gctttttaaa tcctcacttc aggggtaaga gtttcattct
//
LOCUS       XELXOTCH     9163 bp ss-mRNA            VRT       30-JUN-1990
DEFINITION  X.laevis Xotch protein mRNA, complete cds.
ACCESSION   M33874
KEYWORDS    Xotch.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 9163)
  AUTHORS   Coffman,C., Harris,W. and Kintner,C.
  TITLE     Xotch, the Xenopus homolog of the Drosophila notch gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.R.Coffman, 26-APR-1990.
            
                        Author address: C.R.Coffman
                        University of California
                        Dept. of Biology-B022,
                        La Jolla, CA 92093
FEATURES       from  to/span     description
    pept        346     7917     Xotch protein
    mRNA    <     1     9163     Xotch mRNA
    signal     9146     9151     poly-A signal
    rpt         409     4632     EGF-like repeats
    rpt        4684     5025     lin-12/Notch repeats
    rpt        5953     6591     cdc-10/SWI-6 repeats
BASE COUNT     2499 a   2192 c   2188 g   2284 t
ORIGIN      
        1 gaattccgga cgcctggtgg aactaaacag actggaatgc actggaacac ggaagcctat
       61 gcggggatac attgtagcgc tgagcccttc cgctgtaaca ttgtagcgct gagccgctca
      121 tacactgtag cgctgtaact tcaggctgtg cgcaaactga gtgcgctatt ctctctccaa
      181 tctggaaact ctgtcgcttt tggaccaggg attgcactga cttggattat cttcaccttt
      241 ctctaaggga tttatagatt tttgccttgg gacctttatc ctccccttgt cctgggacag
      301 ggaccttcat caggatctat gttcctatag tgagactgca ggaaaatgga tcggataggg
      361 ctggctgtgc tgctctgctc cctgcctgtc ctcacacaag gtcttcgatg cactcagact
      421 gcagaaatgt gcctgaacgg tgggaggtgc gaaatgaccc ctggtggaac tggggtgtgc
      481 ctatgcggta atttgtattt cggtgaaagg tgccagttcc ccaatccctg caccataaag
      541 aatcagtgta tgaactttgg aacctgcgag cctgtgctac aggggaacgc catagacttc
      601 atctgccact gccccgtggg tttcaccgac aaagtgtgcc tgactcccgt ggataacgcc
      661 tgcgtcaaca acccgtgccg gaatggagga acctgtgaac ttttgaatag cgtgactgaa
      721 tacaaatgca ggtgtccccc aggatggaca ggggattcgt gccagcaggc agatccatgt
      781 gcctctaatc catgtgccaa tggtgggaag tgtttgccct ttgagatcca gtatatctgc
      841 aagtgtcccc ctggattcca tggagctacc tgcaagcagg acattaatga gtgcagccaa
      901 aatccctgca agaatggcgg ccagtgcatc aatgagtttg gatcataccg ctgcacctgt
      961 cagaacaggt ttactggaag aaactgtgat gagccatatg tcccctgcaa cccatcgccg
     1021 tgcttgaatg gagggacctg ccgccaaact gatgacacct cctatgattg cacttgtctt
     1081 ccaggcttta gtggacaaaa ttgtgaagag aacattgacg actgcccgag caacaactgc
     1141 cgcaatggtg ggacctgtgt ggatggagtc aacacctata actgccagtg tccacctgac
     1201 tggacaggcc aatactgcac ggaggatgtg gacgaatgcc aactgatgcc aaacgcttgc
     1261 caaaatgggg gaacctgcca caacacctac ggtggataca actgtgtttg tgtcaacggc
     1321 tggaccggag aggactgcag cgagaacatt gacgattgtg ccaacgccgc ctgccatagt
     1381 ggtgccacct gtcatgatcg cgtggcatct ttctactgcg agtgcccaca cggccggaca
     1441 ggtctccttt gccacctgga taatgcctgt atcagcaatc cctgcaacga aggctccaac
     1501 tgtgacacca acccagtcaa cgggaaagcc atttgcacct gcccacctgg gtacactggt
     1561 cctgcttgca acaatgatgt ggatgagtgc tctctgggtg ccaatccatg tgagcatggt
     1621 gggagatgta ccaacaccct gggttccttc cagtgcaact gcccccaggg atatgcagga
     1681 ccccgctgcg agattgatgt caatgaatgt ctttccaacc cgtgccaaaa tgattccaca
     1741 tgcctggacc agattggaga gtttcagtgt atatgtatgc caggttacga aggcctttac
     1801 tgtgaaacca acatagatga atgtgccagc aacccctgcc tacataatgg caagtgcatt
     1861 gataaaatca acgagttccg ctgcgactgt cccacaggct tcagtggaaa cctctgccag
     1921 catgactttg atgagtgtac cagcactccg tgcaaaaatg gagccaaatg cttggatggt
     1981 ccaaattctt acacctgcca atgtacagaa ggtttcaccg ggcgccactg cgaacaagac
     2041 atcaatgaat gcattcctga tccgtgccac tacggcacct gcaaagacgg cattgctacc
     2101 ttcacatgcc tgtgccggcc tgggtacacg gggcgtctgt gtgataatga catcaatgag
     2161 tgcctgagca agccgtgcct aaacgggggg cagtgtacag accgagagaa tggatacata
     2221 tgcacttgtc cgaaaggaac cacaggggtg aattgcgaaa ccaaaataga tgactgtgcc
     2281 agtaacctgt gcgataatgg aaagtgcatt gacaagattg atggctacga gtgcacctgc
     2341 gagcctggtt acacaggcaa gttgtgcaat attaacatta acgaatgtga cagcaatcca
     2401 tgccgcaatg gagggacttg caaagaccag atcaatggct tcacatgtgt gtgcccagat
     2461 ggctaccatg accacatgtg cctgtctgaa gtaaatgaat gcaacagcaa tccatgtatc
     2521 cacggggcgt gccatgatgg agtcaatggg tataaatgtg actgtgaagc aggatggagc
     2581 ggtagcaatt gtgacatcaa taacaatgag tgtgaatcca acccgtgtat gaatggtggt
     2641 acctgcaaag acatgaccgg agcatacatc tgtacttgca aagcaggatt tagtgggcca
     2701 aactgccaaa ctaacatcaa cgaatgttcc tctaacccct gcttgaacca tggaacgtgt
     2761 attgatgatg ttgctggtta caaatgcaac tgcatgttac cttatacagg tgccatttgt
     2821 gaggcagttt tggctccgtg tgcaggcagt ccttgtaaaa atggtggcag atgtaaagaa
     2881 tcagaagact ttgaaacctt ctcttgtgaa tgtccacccg ggtggcaagg gcaaacatgt
     2941 gagattgaca tgaatgaatg cgtaaatcgc ccttgccgta acggtgcaac gtgtcaaaac
     3001 accaatggca gctacaagtg caactgcaaa cctggttaca ctgggcggaa ctgcgagatg
     3061 gacattgatg actgccaacc aaatccttgt cacaatggag gatcctgttc tgacggcatt
     3121 aatatgtttt tctgtaactg cccggccggg ttccgggggc ctaagtgcga agaggacatt
     3181 aacgagtgtg ccagtaaccc gtgcaaaaat ggagccaact gcactgattg tgtaaacagc
     3241 tacacatgca cctgccagcc tggatttagt ggcatccact gtgaaagcaa cacccctgac
     3301 tgcaccgaga gctcgtgctt taatggtggc acctgcattg atgggatcaa tacgtttaca
     3361 tgccagtgcc cacctggatt taccggaagc tactgccaac atgatatcaa tgaatgtgac
     3421 tccaagcctt gtttgaacgg aggcacctgc caggacagtt acgggacata caagtgcacc
     3481 tgtccgcaag gatacactgg gctgaactgc cagaacctgg tgcgttggtg tgactcttct
     3541 ccctgtaaaa atggaggaaa gtgctggcag acaaacaact tttatcgttg tgagtgcaag
     3601 agtggctgga ctggtgttta ctgtgatgtc cccagtgtct cctgcgaggt tgctgctaag
     3661 cagcaaggtg tggatatcgt tcatctgtgt cgcaattctg gcatgtgtgt ggacactgga
     3721 aacacacatt tctgccgctg tcaggctggc tacacaggca gttactgcga ggaacaggtg
     3781 gacgaatgct cgccgaatcc ttgtcagaac ggtgccacat gcacagacta cttaggtggc
     3841 tactcttgcg agtgtgttgc cggctaccat ggagtaaact gctccgagga gatcaatgag
     3901 tgtttgtccc acccgtgcca gaatggagga acctgtattg atctcattaa cacctacaaa
     3961 tgctcctgcc ccagaggaac tcaaggtgtg cactgtgaaa ttaatgtgga tgattgcacg
     4021 cccttctacg actccttcac cttggaaccc aaatgcttca acaatggcaa atgtatagac
     4081 cgcgtgggtg gctataactg catctgccct ccaggctttg taggggaacg ctgtgaaggc
     4141 gatgtgaacg agtgcctgtc taatccctgt gactcccgtg gcacccagaa ctgcattcaa
     4201 ctggtgaatg attaccggtg tgagtgccgg cagggattca caggaaggcg ctgtgaatct
     4261 gttgtggatg gttgtaaggg aatgccctgc cgaaacggtg gaacatgtgc tgtcgccagc
     4321 aatacagaac gtggatttat ttgcaaatgc cctcctgggt ttgacggagc cacttgcgag
     4381 tatgattcac gaacctgcag caaccttcgc tgccagaatg gcggcacttg catttcggtg
     4441 ctgacgagtt ccaaatgcgt gtgctcggaa ggctatactg gcgccacatg tcagtaccct
     4501 gttatcagtc cttgtgcttc ccacccttgt tacaatggag gaacctgcca gttctttgct
     4561 gaggaacctt tcttccagtg cttctgtcca aaaaacttta acgggctctt ctgccatatc
     4621 ttggattatg agtttcccgg gggactaggc aagaatatta ctcctcctga taatgacgat
     4681 atctgcgaaa acgagcagtg ctctgagctg gctgacaaca aagtctgcaa cgccaactgc
     4741 aacaaccatg cttgcggatg ggatggagga gactgctctc tcaacttcaa tgacccatgg
     4801 aagaactgca cccagtcgct gcagtgctgg aaatatttca atgatggcaa atgcgattct
     4861 cagtgcaaca acactgggtg tctgtacgac gggtttgact gccagaaagt tgaggttcag
     4921 tgcaaccctt tgtatgacca gtactgcaag gatcactttc aagatggcca ctgcgaccaa
     4981 ggctgtaaca atgcagagtg cgaatgggat ggcctggact gcgccaacat gccagagaac
     5041 ctggctgaag gcaccctggt gttggtcgtc ctcatgcccc ctgaaaggct taagaacaac
     5101 tctgtcaatt tcctccgtga gctgagccga gtgctccaca caaatgtggt gttcaagaaa
     5161 gacagcaaag gcgaatacaa gatctaccct tactatggca atgaggaaga acttaaaaaa
     5221 catcacatca agaggtcaac tgattactgg tcggatgctc cttctgccat cttcagcaca
     5281 atgaaagaat ccattttact aggaaggcat cgtagggagt tggatgaaat ggaagttagg
     5341 gggtcaatcg tatacttgga aatagacaac cgccagtgtt acaaatcctc ctctcagtgc
     5401 ttcaacagtg ctacagatgt tgccgcattc cttggggctt tggcctccct tgggagtctg
     5461 gacacccttt cctataagat tgaggcagtg aaaagtgaga acatggagac ccccaagccc
     5521 tctactctct atcccatgtt atccatgttg gtcattccgt tattaatcat cttcgtcttc
     5581 atgatggtca tcgttaataa gaagcgtcgc cgtgaacacg acagctttgg ttccccgacg
     5641 gctttattcc aaaagaaccc agcaaaaaga aacggcgaga cgccttggga ggattctgtt
     5701 ggcttaaagc ccattaagaa tatgacagat ggctctttta tggacgataa tcagaatgaa
     5761 tggggagatg aggagaccct ggaaaacaag aggttcaggt ttgaagagca agtgattctc
     5821 ccagaacttg ttgacgacaa aactgacccc cgacagtgga cacggcagca ccttgacgcc
     5881 gctgatctgc gcatttcatc catggcccca acgccgccgc agggagagat tgaggccgac
     5941 tgtatggatg tcaatgtccg tggccctgat ggcttcaccc cacttatgat tgcctcctgt
     6001 agcggaggtg gattagagac tggaaacagc gaagaggaag aggatgcttc agctaatatg
     6061 atttccgact tcattgggca gggtgcccaa ctgcataacc aaaccgaccg cactggcgag
     6121 acagcacttc atctggccgc aagatacgca cgtgctgatg cagccaagcg cctgttggaa
     6181 tcgagtgcgg acgccaacgt ccaggataac atgggcagga ctcctctcca tgcagcagtg
     6241 gcggctgatg ctcagggcgt attccagatt ctcattcgga accgagcgac agacttagac
     6301 gcccgcatgt ttgatggcac tacccctctg atcctggccg ctcggctggc cgtggaaggg
     6361 atggtggagg agcttatcaa tgctcatgca gatgtcaacg ctgttgatga atttggaaaa
     6421 tctgctttgc attgggcagc ggctgtgaat aacgttgatg ctgcagctgt gcttctcaag
     6481 aatagtgcaa ataaggacat gcaaaacaac aaggaagaga catccctgtt cttggccgca
     6541 agagaaggca gctacgaaac tgccaaagtc cttttggatc actacgccaa ccgtgacatc
     6601 acagaccaca tggatcggct gcctcgtgac atcgcccaag aacgcatgca ccacgacatt
     6661 gttcacctgc tggatgaata taaccttgtg aagagcccaa cgctgcacaa tggtccgttg
     6721 ggagcaacga cattatcacc tcccatctgc tcccctaatg gttacatggg gaacatgaag
     6781 ccttctgttc agagcaagaa agcccgcaag cccagtatca aaggtaatgg ctgcaaagag
     6841 gccaaagagc tgaaagccag aaggaaaaaa tctcaagatg ggaaaacaac tctcttggat
     6901 tctggcagtt ctggagtgtt gtccccagtg gactccctgg agtcaacaca tggatactta
     6961 tcagatgtgt cttctcctcc gctgatgacc tctccgtttc agcagtctcc atccatgcct
     7021 ctgaaccact tgacaagcat gccagagtcc cagcttggca tgaatcacat aaacatggcc
     7081 accaagcagg aaatggcagc aggttccaac agaatggctt ttgatgccat ggtgccacgt
     7141 ctgacccatc tcaatgcctc aagccctaat accatcatga gcaatggatc catgcatttc
     7201 actgtgggag gagctccgac tatgaacagc caatgtgact ggttagctag gctgcagaat
     7261 gggatggtcc agaatcagta tgacccaatc agaaatggca tccaacaagg caatgctcaa
     7321 caagctcaag ctcttcagca tggccttatg acctcgctcc ataatggtct gccagcaaca
     7381 actctctccc aaatgatgac ctatcaggcc atgcccaaca caaggctagc caatcagcca
     7441 catctaatgc aagcccagca aatgcaacag cagcaaaact tgcagttgca ccagagcatg
     7501 cagcaacaac atcacaattc cagcacgacc tctactcaca tcaactcacc attctgcagc
     7561 agtgacataa gccagacgga cctgcagcaa atgtcaagca acaacattca ttcagtaatg
     7621 ccccaggaca ctcagatatt tgctgcatct ctgccttcca atcttacgca gtctatgaca
     7681 actgcacaat ttttaacccc cccttcccag catagctact cctccccaat ggacaataca
     7741 ccaagccatc aactacaagt accagaccac ccgttcctga cgccttctcc tgagtcacct
     7801 gaccagtggt caagctcctc ccctcattcc aatatgtctg actggtcaga aggaatatca
     7861 agtcctccca cgagtatgca gcctcagcgc acccacatac ctgaagcttt caagtaaaaa
     7921 aaaaaaaagt ttaaaaaaat gtaaaatatt tttaaagaca ctgagagaga ctttaagaga
     7981 ctgaaggaaa tttttatatg gtttttatac ttaaaataac agaacatttg aattttctag
     8041 tatttattta tatatacgtt tgacctaaaa cactgccctt ttatttataa gctttttttc
     8101 atttttagga caaaatattt taacttcttg ccttgaaagt ttttcagttc taaatcttat
     8161 gaaattggtt cctgcctggt attgaaaacg gcaatgtatt tattttttat ttacctgaat
     8221 agtatacagg aacaaaccac tggggtgggg gggttatcgg gatgtgtatt tagcagaaaa
     8281 aagattttct ataaaatgaa atctttcagg ttttcattta tagcactaaa aagattccag
     8341 tattaatttt aaattaaatc atgaagaaga tgctccgaat acctctcgct gtcagggagg
     8401 tgcccaatag gttttttttt tttgtttttt ttttggaaaa cctgaaactc ttggtaagta
     8461 cagaaagaaa gcataaaata caggagggca cgcctaggtc tgatgcattc tcaatgggaa
     8521 taaaatcctg aaaagtgact cattttaaaa aattatattt gttagaaaaa acaaactgcc
     8581 attttgaatc ccttgtgtcg catggggtat tggtcagaaa atgcatgcaa tttttggtac
     8641 aagcccgtgg catattttct acagtttttt tttttttttt tttttttttg taaatatcta
     8701 tggtggattt tgtttcaaaa atctatttgt atatcctgat ttatattgct tatccgagta
     8761 ggagtgcagg ctggatggaa ccatgccgaa gacaaatcgc agattggaca ttgtttttta
     8821 tttttttgcc tacagagact atttgcacat gaacaacaac aaaaaaaaga agatatattt
     8881 aaatataaaa ttttatacaa acgaaaaaac aaaaaacaaa aactgtcgat ttgcctctag
     8941 ttccacttct tctgtgatac gatgcagttg ggcagttcat tccttatgcc gtactgtaca
     9001 aagatctgta gaatttagtt gtttttcctg aacaaataaa tatgtaatgt atcgagcaga
     9061 aatgtagttt acaaaaaaaa ctatttgttg ttgttgctgt taacacttct gtaaacaaat
     9121 tttttttggt ttgtttattt gttgtaataa aattgtacaa aat
//
LOCUS       CCRFTRCX     2202 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  C.crescentus transcriptional protein (ftrC) gene, complete cds.
ACCESSION   M30946
KEYWORDS    transcriptional protein.
SOURCE      C.crescentus (strain CB15) DNA.
  ORGANISM  Caulobacter crescentus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE   1  (bases 1 to 2202)
  AUTHORS   Van Way,S.M. and Mullin,D.A.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.A.Mullin, 20-DEC-1989.
            
                          Author address: D.A.Mullin
                          Tulane University Dept. of Biology
                          2000 Perciual Stern Hall
                          New Orleans, LA 70118
FEATURES       from  to/span     description
    pept        340     1707     transcriptional protein (ftrC)
BASE COUNT      383 a    741 c    750 g    328 t
ORIGIN      
        1 gaattcggag gggccatgct ggcctccgag gctccgatcg aactcagcga caagaccgct
       61 gcggacctgg cgccggtctt cgacgttccg gtcaacatct cggccgtgct gggccgcgcc
      121 aacatgtccg tggcgcagct gctgcagctg ggccagggct cgatcctgga gctggaccgc
      181 aaggtcggcg aggcgatcga catctacgtc aataaccgcc tggtcgcccg gggcgaggtc
      241 gtcgtcgtcg acgagcgcct gggcgtgacc atgacggaaa tcatcaagga cggcgaccag
      301 ggctgacgcc cggtggtcgt tcgagaggga gagtaagaga tgcggcttct ggtcgttgga
      361 aaactgaacg ggcagctctc ggtcgccgtg aagatggcga tgaacgcggg cgcgaaggtc
      421 tcgcacgtcg aaacgacgga gcaggcgacc aacgcgctgc gggcggggca gggcgctgac
      481 cttctgatgg tcgactatgt gctcgacatc gccggtctga tcgccgccaa cgaagccgag
      541 cggatgcggg tgccggtggt ggcctgcggc gtcgacgccg atccgatgcg cgcggccaac
      601 gccatcaagg ccggggccaa ggagttcatc ccgctgccgc cggacgccga gctgatcgcc
      661 gccgtcctgg ccgccgtcac cgacgacgaa aagccgatgg tcgtccgcga cccggccatg
      721 gagcaggtca tcaagctggc cgaccaggtc gccccctccg aagcctcgat cctgatcacc
      781 ggggagagcg gctcgggtaa ggaggtcatg gcccgctacg tccacggcaa gtcgcgccgg
      841 gccaaggcgc cgttcatcag cgtcaactgc gccgccatcc ccgagaacct gctggaaagc
      901 gagctgttcg gccacgagaa gggcgccttc accggggcca tggcccgccg catcggcaag
      961 ttcgaggagg ccgacggcgg caccctgctg ctggacgaaa tcagcgaaat ggacgtgcgc
     1021 ctgcaagcca agctgctgcg cgccatccag gagcgcgaga tcgaccgcgt gggcggctcc
     1081 aagccggtca aggtcaatat ccgcatcctg gccaccagca accgcgacct ggcccaggcg
     1141 gtgaaggacg ggacgttccg ggaagacctg ctctaccgtc tgaacgtcgt gaacctgcgc
     1201 ctgccgccgc tgcgcgagcg tccggccgac gtgatcagcc tgtgcgagtt cttcgtgaag
     1261 aagtactcgg ccgccaacgg catcgaggaa aagccgatct cggccgaggc caagcgccgc
     1321 ctgatcgctc accgctggcc gggcaacgtc cgcgagctgg aaaacgccat gcaccgggcg
     1381 gtgctgctgt cggcgggccc ggagatcgag gagttcgcca tccgtctgcc cgacggccag
     1441 ccgatggccc cggcgccgga cgtcgcggtt gcccgcggcg cccagatggc cgccgacgcc
     1501 gcctcgcgcg ccttcgtcgg ctcgaccgtc gccgaggtcg agcagcagct gatcatcgac
     1561 accctggagc actgcctggg caaccgcacc catgcggcca acatcctggg catctcgatc
     1621 cgcaccctgc gcaacaagct gaaggaatat tccgacgccg gcgtgcaggt gccgccgccc
     1681 cagggcgggg tcggcgcggc cgcttaagac cagggcgtcg aggtggcggg accgaaaacc
     1741 tcgcccttcg acaagctcag ggtgaggttt tctatgcctg gcccggcgct tcattcgtcc
     1801 tcaccctgag cctgtcgaag ggcgaggacg acgcaaagct ccgggccgat ctggccgccg
     1861 ccgcgaccat cggtccggcg ccgcgccgcg aaaaatttgg ttaacaagat ggcggcgggc
     1921 cgcaaatcgc tccaagcgaa agcctagctt cccatccccg gttcaaggtt ccgaatggcc
     1981 gacgccgccg ccccgaacgc cagctcgatg cccagcgcca agtcgctgct cgacgggttg
     2041 atgcgcggcg agatgggcct ggccctgggc gtggtcggca tcatcgtgct gctgatcatc
     2101 ccggttccgg cgccgctgct ggacgtgctg ctggccattt cgctgacggg atcggtgctg
     2161 atcctgatga cggcgattct gatcaagaag ccgctggaat tc
//
LOCUS       TRCTC26       638 bp ss-mRNA            INV       30-JUN-1990
DEFINITION  T.cruzi AMP-inducible protein (TC26) mRNA, partial cds.
ACCESSION   M34062
KEYWORDS    AMP-inducible protein.
SOURCE      T.cruzi (strain X10/4) trypomastigote, cDNA to mRNA, clone G26.
  ORGANISM  Trypanosoma cruzi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 638)
  AUTHORS   Heath,S., Hieny,S. and Sher,A.
  TITLE     A cyclic AMP inducible gene expressed during the development of
            infective stages of Trypanosoma cruzi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.Heath, 08-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1  >   638     AMP-inducible protein (TC26) (AA at 1)
BASE COUNT      154 a    174 c    168 g    142 t
ORIGIN      
        1 accccaccag accaactggt aatggtagcg accggcgctc agcttggaat tccgtgcgct
       61 gcttctagcc ctagagaagc tgatgatccc ccgtatccga cacaggcgta caaccctgct
      121 tgtggttacg gacagtcagt ctcttctagc ggctctaaac aagggcccgc tcaatcagac
      181 agactggacg gaggatcaga tctggcggcg tctcttgaca ctgacgtgtg ctggctggtc
      241 gtgcacctgc agttttgtta cggacattgt ggagtacatg ctaacgagct tgcagatcag
      301 tatgcgagga actatggaaa gtggacaata cacggagcaa ggaatcgcac ctttatggca
      361 tacggatctg ctgacgtgtt ttactaccca gctcaccaac aagtggcgta ctaccattcg
      421 tcaagacact catcgctacc tgctttgcgg cacaaggcca tcagatctcg cggtaaggac
      481 ctgatcactc aggaagttct acaccgtcag aactggttca cctcgcaagg gcaaggtgcg
      541 gggaatctga gctctggggc cgactatctg ggccgtgaga gattgcacga accaatgccg
      601 tttctgcaac atctcaccgg aacagtctgc atatatat
//
LOCUS       HUMSATM29A    700 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human minisatellite MS29 locus 1 polymorphism.
ACCESSION   M28875
KEYWORDS    minisatellite.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 700)
  AUTHORS   Wong,Z., Royle,N.J. and Jeffreys,A.J.
  TITLE     A novel human DNA polymorphism resulting from transfer of DNA from
            chromosome 6 to chromosome 16
  JOURNAL   Genomics 7, 222-234 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.J.Jeffreys, 04-OCT-1989.
BASE COUNT      156 a    188 c    212 g    144 t
ORIGIN      Chromosome 6p25-pter.
        1 ggacagccag agtgcccaca agctcggcag gaaggaggct cacatcgccg cggaagtctc
       61 gtgatggttc tggtagaatc gtcctcttga aaccttctgc tgccattatt gggtgctgtc
      121 caatatgcct tacgcaaaac actgacaata gcatacacct ttccagaatg ccggcacgcg
      181 ctgtgcaatg cccaaactgt ccctgtgagt tgaatgcccc tggcaggatg cgtgtccact
      241 cccgcagggc cgtgggtgcc tcccttgtct cactcctctg cttcaaccct tcccagaact
      301 ctgaaggttt gagatgaaat gactgagcca cacctagtgc aggaatgagc accagggcct
      361 ctctgcaacg gctgcttaag ggccacagga taagcatagt gtggcgcccc cagttattcc
      421 ctgtttcact cattggacca ccctgatagg cttcagggag gcaggaacct cctctaaggc
      481 agcaggagag gggtggatgg ggcagccaac aggtgcatct catgcctccg tgttctccat
      541 agggagggag gtgggctaag aggcgagtgg ggcagaggcc tgggccagcg tgaagtgtca
      601 ggagtaaggg gggagaagac tggagggggc aaatccagga ggcagctgag gctcgactgt
      661 gcaaatacaa gcccggttac gtctgtgtag cttagcttcc
//
LOCUS       HUMSATM29B    700 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human minisatellite MS29 locus 2 polymorphism.
ACCESSION   M28876
KEYWORDS    minisatellite.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 700)
  AUTHORS   Wong,Z., Royle,N.J. and Jeffreys,A.J.
  TITLE     A novel human DNA polymorphism resulting from transfer of DNA from
            chromosome 6 to chromosome 16
  JOURNAL   Genomics 7, 222-234 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.J.Jeffreys, 04-OCT-1989.
BASE COUNT      182 a    189 c    146 g    183 t
ORIGIN      Chromosome 16p11.
        1 ggacagccag agtgcccaca agctcggcag gaaggaggct cacatcgccg cggaagtctc
       61 gtgatggttc tggtagaatc gtcctcttga aaccttctgc tgccattatt gggtgctgtc
      121 caatatgcct tacgcaaaac actgacaata gcatacacct ttccagaatg ccggcacgcg
      181 ctgtgcaatg cccaaactgt ccctgtgagt tgaatgcccc tggcaggatg cgtgtccact
      241 cccgcagggc cgtgggtgcc tcccttgtct cactcctctg cttcaaccct tcccagaact
      301 ctgaaggttt gagatgaaat gactgagcca gtgagtgaaa agaccagcca agaatgtgaa
      361 aaacctcatc tggacacaag gcaaaataat tattttctcc cttggttacc ttttccattc
      421 cctttcttca agggaaagaa ccatgtctta ttttacttgg catcagcagc tctcagaaca
      481 gtggcttgac caataccaag cattcaataa atgtttattg aaggaatgaa tcaatggcca
      541 cccacaacct atacctctta cagcccattt aatgtccttt agccccttcc ataggtattt
      601 gcctggaatg tttgtaagat ttttgaaaat ctcagaaaca acaccagccc tccacccatg
      661 ggaagatgtg tctcaggcta acacctgtgc tcttgttgat
//
LOCUS       HUMSATOD     1900 bp ds-DNA             PRI       30-JUN-1990
DEFINITION  Human O family dispersed repeat element.
ACCESSION   M28877
KEYWORDS    repetitive element.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1900)
  AUTHORS   Wong,Z., Royle,N.J. and Jeffreys,A.J.
  TITLE     A novel human DNA polymorphism resulting from transfer of DNA from
            chromosome 6 to chromosome 16
  JOURNAL   Genomics 7, 222-234 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.J.Jeffreys, 04-OCT-1989.
FEATURES       from  to/span     description
    rpt         410      785     O family dispersed repeat
BASE COUNT      484 a    501 c    344 g    567 t      4 others
ORIGIN      
        1 gccacctctg tctaatgttc tcatatagct ccaactctaa cacctcctta cattctacca
       61 tctgtgtctt agactgcagc aagccctgta actggctctc ggtctccagg cttctcccga
      121 cctagacttg cttacattgc tgtacaagtg aggtttttaa cattctcagt cttgtcactc
      181 actgcttaaa ctccttcaag aattttcact cccttcaggc caagggctat atgaagccct
      241 tcagatgtgg cccttgcttg tgtctccagc cccatctctc agcagtgctc ttaacactcc
      301 caaccatcat gagctaatta ctatccttcc aacatgccat gctccttcac acccctcttg
      361 ctttgaacac gctgactctt ctgcctgaaa tgtctttccc agttcccttc tgatatggtt
      421 tggctgtgtc cccacccaaa tctcatcttg aattgtagtt cccataatcc ccacatgttg
      481 tgggagggac ctggtaggag gtaattgagt catgggggtg attaccctca tgctgccatt
      541 ctcatggtag tgaacgtgag ttctcacaag atcttatggt tttataaggg gcttctctcc
      601 ctttgctcag cacttcacct tgctgccacc atgtgaagga cacgttttct tccccttcca
      661 ccatgattgt aagtttcctg aggcctcctc agccatgctg aactgtgagt caattaactt
      721 ctttccttta taaattaccc agtctcgggt atgtatttat tagcagtgtg agaatggact
      781 aatacacccc cttcactcac ttttcagtct tcaaagtaca cagccatcct ttactgcccc
      841 atgcagcacc ttcaggtctt gagctctatg cagggaggta agccacggaa gtcacaagca
      901 catcttttat actagtgcac acaactgaaa actcacagac tggtaaaagc acatctttta
      961 tactagtgca cacggctgaa aactcacaga ctggtaccaa accagcttgg ttctgttctg
     1021 cttccactgc ctgtcaagac tgtgtgctct gggcaaggta cttcaaccac tgggtcttgg
     1081 cttcctcatc tggaaaatga aaataaaact gcaactttaa aaagcttttc tgaaatcaaa
     1141 tgcagcaatg tacataaaag gcaaataagc tcaccctatg ttttacctaa ttttcttttt
     1201 atcctgctct gtgtatttct taagccaacc atcttacttc ctttttagaa tgcatcagga
     1261 tagacacaaa taaatggaaa cactgttaca aagcccctgc tcacacttca gaattcagat
     1321 tgcattacct cctccaggaa gtcttcctta atcactctct gctcgagtct catttagatg
     1381 atctcctgtg tgcttgcata tcgtctgtgc atacctctgt gttgggagga gtggtaagag
     1441 atggggctgg agactcaaga aaaggccaca tcggaagggc ctcgatcctt tcccctgaag
     1501 ggagtaaagg ctctccaaag agtttaagca gtgagtgaaa agaccagcca agaatgtgaa
     1561 aaacctcatc tggacacaag gcaaaataat tattttctcc cttggttacc ttttccattc
     1621 cctttcttca agggaaagaa ccatgtctta ttttacttgg catcagcagc tctcagaaca
     1681 gtggcttgnn nnataccaag cattcaataa atgtttattg aaggaatgaa tcaatggcca
     1741 cccacaacct atacctctta cagcccattt aatgtccttt agccccttcc ataggtattt
     1801 gcctggaatg tttgtaagat ttttgaaaat ctcagaaaca acaccagccc tccacccatg
     1861 ggaagatgtg tctcaggcta acacctgtgc tcttgttgat
//
LOCUS       BSURFOLA     4880 bp ds-DNA             BCT       30-JUN-1990
DEFINITION  B.subtilis folic acid biosynthetic operon encoding
            para-aminobenzoic acid synthase, component I (pab), glutamine
            amidotransferase (trpG), pot. p-aminobenzoate synthesis enzyme
            (pabC), and pot. dihydropteroate synthase (sul) genes, complete
            cds.
ACCESSION   M34053
KEYWORDS    H2-pteridine-CH2OH pyrophosphokinase; dihydropteroate synthase;
            glutamine amidotransferase; para-aminobenzoic acid synthase.
SOURCE      B.subtilis (strain VB157) vegetative cell DNA, clones pJS[1,2].
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 4880)
  AUTHORS   Slock,J., Stahly,D.P., Han,C.-y. and Crawford,I.P.
  TITLE     An apparent folic acid biosynthetic operon of Bacillus subtilis
            containing pab, an amphibolic trpG, a third gene required for
            synthesis of para-aminobenzoic acid, and the dihydropteroate
            synthase gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.P.Stahly, 07-MAY-1990.
            
               Author address: D.P.Stahly
               University of Iowa
               Dept. of Microbiology
               Iowa City, IA 52242
               e-mail: cmdsix@vaxa.weeg.uiowa.edu
FEATURES       from  to/span     description
    pept        313     1725     para-aminobenzoic acid synthase, component I
                                 (pab)
    pept       1739     2323     glutamine amidotransferase (trpG)
    pept       2323     3204     pot. p-aminobenzoate synthesis enzyme (pabC)
    pept       3186     4043     pot. dihydropteroate synthase (sul) (EC
                                 2.5.1.15)
    pept       4036     4398     ORF2
    binding     300      303     ribosome binding site
    binding    1724     1730     ribosome binding site
    binding    2306     2313     ribosome binding site
    binding    3175     3179     ribosome binding site
    binding    4023     4027     ribosome binding site
    binding    4381     4385     ribosome binding site
    binding    1720     1728     methyltryptophan resistance protein binding
                                 site
BASE COUNT     1512 a    939 c   1205 g   1224 t
ORIGIN      
        1 gaattctcgg aggcatctca tcaggtgcgg caatttatgc tgctcttcaa gtagcaaaaa
       61 aactcggaaa aggcaaaaaa gtattggcca tcattccaag taacggcgaa cgttatctaa
      121 gcacgccgct gtaccaattc gattaaaaaa agccaaaact cccggttcgc cgggagtttt
      181 tttatatttc gtgcatcaaa tatgccatgt ctgcgacgaa tgttattctt ggaagagaaa
      241 ttcacttttt tcactaacaa cattgcttta caattaaaaa caagtaaaaa agaaaataga
      301 ggaatgatac aaatggcaca acgcagaccg gcaggcaaaa aaataccttt tcaaaaagac
      361 tcattcttac aacaatttga gaaacttgcg caatcccgga aacatcatgt acttctcgaa
      421 agtgcaagag gcggcagata tagtatagcc ggtcttgatc caattgcgac tgtgaaagga
      481 aaagacggaa taactacaat taagcatggt gatgagatgc tgtttaaaga aggtgatcca
      541 ttacgggcct tccacagctg gtttaaaaca ctggaaacag aaacgaatca tgagttccct
      601 gactttcaag gcggggcaat cgggtttctc agctatgatt acgcacggta cattgaaaat
      661 tttaaaatgc tctcattaga tgatttagaa acaccagata tttattttct tgtttttgat
      721 gatatagcag tttatgacca tcaagaagag tctctatggc tgattactca tgttaatggt
      781 tctgatcagg aaacagcgga tgtgaagcta tctgagttag agcagatgtg gttgactgag
      841 cttcccgctg tcacttcgcg agagatgaag cctgaaacag ctggttcttt cgcggcgcca
      901 tttaccgagg atgggttctc acaagctgta gagaaaatca aacaatacat tgccagcgga
      961 gatgtgtttc aagtcaatct atcaataagg cagtcacagt cactgtctgt ccacccatat
     1021 caaatttaca aaaccttgag agaagtaaat ccttctcctt atatggcgta tttagaaaca
     1081 cctgatttcc aaatcatttg cggatcgcct gaactgcttg tcagcaaaaa gggcaagcta
     1141 ttagagacga gaccgattgc gggcacccgt tccagaggga aaacaaatga agaagacgag
     1201 gcgcttgcaa acgaattgat acacaatgaa aaagaacgcg cggaacatgt catgctggtt
     1261 gatcttgagc gaaatgatct gggaagagta tcacgttacg ggtctgtgcg cgtaaatgaa
     1321 ttcatggcaa ttgaaaaata ctcgcatgtg atgcacattg tgtctaatgt ccaaggtgaa
     1381 ctgcaggatg ggtatgatgc tgtagatatt attcatgctg tgtttcccgg aggaaccatt
     1441 actggtgcac cgaaagtaag aacgatggaa attatagaag aacttgagcc gacacgccga
     1501 gggctttata ctggatctat aggatggttt ggatataatc acgatctgca gtttaatatc
     1561 gtcattcgaa ccatttatgc aaccggaggg caggcattta tgcagtccgg tgcaggagtt
     1621 gtgattgatt ctgttccgaa gcacgaatac aaggaatcat tcaaaaaagc ttttgcgatg
     1681 caaagagcat tagagctgag cgaagaagag acaaaaatta gatgaggtga gcggagaaat
     1741 gattttaatg attgataact acgattcatt cacgtacaac ttggtacagt atttgggcga
     1801 gcttggggaa gagctggttg tgaaacgcaa tgacagcatc acaatcgatg aaattgaaga
     1861 actgtctccg gactttctga tgatatctcc cggaccgtgc agccctgatg aggcgggaat
     1921 cagcctcgaa gcaattaaac atttcgcagg gaaaattcct attttcggtg tatgtctcgg
     1981 acatcagtcc atcgcacaag tgttcggtgg tgatgttgtt agggcagaac ggcttatgca
     2041 cgggaaaacc tcggatatcg agcatgacgg caaaaccatt tttgaagggt tgaaaaatcc
     2101 ccttgttgcg acgcgatacc actcgctgat cgtaaaacct gagacgctgc caagctgttt
     2161 tacagtaaca gcacaaacga aagaaggaga aatcatggct attcgccaca atgacctccc
     2221 gatagagggt gtgcaatttc acccagagtc tattatgacc tcctttggga aagaaatgct
     2281 cagaaatttt attgagacat atcgcaagga agttattgcg tgatgatata tgtgaacggc
     2341 cggtatatgg aggagaaaga tgcagttctt tctccttttg accatgggtt tttatatggg
     2401 atcggtgttt tcgaaacctt caggctctac gaaggctgcc cgtttttgct tgattggcat
     2461 atagagaggc ttgaacgcgc gcttaaggat ctgcaaatcg aatatactgt ttcaaagcat
     2521 gagatccttg agatgctgga caagctgctt aagctaaatg atatcaagga tggcaatgcc
     2581 cgcgtcagac tgaacatatc agcgggaatc agcgataaag gctttgtcgc ccagacgtac
     2641 gataagccca ccgttttatg ctttgtgaat cagctgaaac cagaaagtct tcctttgcaa
     2701 aaagaaggaa aggttttatc gatacggaga aacactccag agggatcgtt ccggttaaaa
     2761 tctcatcatt atctaaataa tatgtatgcg aaacgggaaa ttggaaatga tccgcgtgtc
     2821 gagggaattt tcttaacgga ggatggagct gttgcagagg gtatcatatc taatgtcttc
     2881 tggagaaaag gtcgctgtat ctatacgcca tcgcttgata ccggtattct ggacggcgtc
     2941 acgagacgtt ttatcataga aaatgcaaag gatatcggcc tggagctcaa aacgggccga
     3001 tatgagctag aagctctctt gacagctgac gaagcgtgga tgacgaactc ggttcttgaa
     3061 atcatcccgt ttactaaaat agaagaagta aactatggaa gccaaagcgg agaagcaacc
     3121 tccgctcttc aattgttata caaaaaagaa ataaagaaca tgattcatga aaagggagga
     3181 agagcatggc gcagcacaca atagatcaaa cacaagtaat ccacactaag cccagcgctt
     3241 tatcatataa agagaagacg ctggtgatgg gaattttaaa cgtaacgcct gactctttct
     3301 cggacggcgg aaaatatgac agcttggaca aggcgctgct gcacgcgaaa gagatgatcg
     3361 atgatggtgc ccatatcatt gatattggag gggaatcgac aaggcctggc gctgagtgcg
     3421 tatctgagga tgaggagatg tccagagtca ttccggtgat tgagcggatt acgaaagagc
     3481 ttggtgttcc tatttctgta gacacgtaca aggcttctgt cgcagatgaa gcagtgaaag
     3541 ccggtgcatc cattatcaat gatatttggg gagccaaaca tgatccgaag atggcttccg
     3601 ttgcagctga acataatgtt ccaattgtac tcatgcataa ccgccctgaa agaaactaca
     3661 atgacttatt gccggatatg ctgtcggact taatggagag tgtaaaaatt gctgttgagg
     3721 ccggagtaga cgagaagaac attattcttg atcctggtat cggtttcccg aaaacctatc
     3781 acgataactt ggcagtgatg aacaaactag agattttcag cggattggga tatccggttc
     3841 ttctggcaac ctcccgaaaa agattcatcg gacgtgttct ggatcttccg cctgaggagc
     3901 gggctgaggg cacaggcgcg actgtgtgtc tcggcattca aaaaggctgt gacattgtca
     3961 gggtccatga tgtaaagcaa attgccagaa tggcgaaaat gatggacgcg atgctgaata
     4021 agggaggggt gcaccatgga taaagtttat gtagaaggta tggagtttta cggatatcac
     4081 ggtgtgttca cagaagaaaa caaacttggc cagcggttta aagtcgattt aaccgctgag
     4141 ctggatttaa gcaaagctgg acagacagac gaccttgagc aaacgatcaa ctatgctgag
     4201 ctctatcacg tatgtaaaga tatcgtggaa ggggagcctg tgaaattggt ggaaacgctg
     4261 gcggaacgta ttgctggcac tgttctcgga aaatttcagc ctgttcagca atgtacggtg
     4321 aaagtgatta agccagaccc gccaattccc ggacactata aatcagtagc aattgaaatt
     4381 acgagaaaaa agtcatgaac aacatagctt atattgcact tggatctaat attggagata
     4441 gagaaacgta tttaaggcaa gcagtggctt tactgcatca gcatgctgcg gtgacagtca
     4501 ctaaagtgtc gtctatttac gaaactgacc cggtcggata cgaagatcaa gctcaatttt
     4561 tgaatatggc tgttgaaatc aagacatcat tgaacccttt tgaactcctt gaactgacgc
     4621 agcagataga aaatgaatta ggcagaacaa gggaagtaag atggggaccg cggacggcag
     4681 accttgacat tttgttattt aatcgtgaaa atattgaaac agagcaacta attgttccgc
     4741 atccgagaat gtatgagcgt ttgtttgtcc ttgcgccgct tgcggaaatt tgccagcagg
     4801 ttgaaaaaga ggctacaagc gccgaaacag accaagaagg tgtaagagta tggaagcaga
     4861 aatctggggt agacgaattc
//

GenBank-Updates@genbank.bio.net (07/03/90)

LOCUS       DROANNIX     1104 bp ss-mRNA            INV       03-JUL-1990
DEFINITION  D.melanogaster annexin IX mRNA, 3' end.
ACCESSION   M34068 J05501
KEYWORDS    annexin IX.
SOURCE      D.melanogaster adult head, cDNA to mRNA, clone pD3-6.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1104)
  AUTHORS   Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
            Suedhof,T.C.
  TITLE     Two novel annexins from Drosophila melanogaster: Cloning,
            characterization and differential expression in development
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.C.Suedhof,  04-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      891     annexin IX (AA at 1)
    signal     1091     1096     poly-A signal
BASE COUNT      281 a    300 c    306 g    217 t
ORIGIN      
        1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg
       61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc
      121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg
      181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga
      241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt
      301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag
      361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat
      421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc
      481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc
      541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag
      601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag
      661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg
      721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt
      781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag
      841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca
      901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa
      961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg
     1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt
     1081 ccctctgtgc aataatcgga attc
//
LOCUS       DROANNX      1192 bp ss-mRNA            INV       03-JUL-1990
DEFINITION  D.melanogaster annexin X mRNA, complete cds.
ACCESSION   M34069 J05501
KEYWORDS    annexin X.
SOURCE      D.melanogaster adult head, cDNA to mRNA, clone pD3-16.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1192)
  AUTHORS   Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
            Suedhof,T.C.
  TITLE     Two novel annexins from Drosophila melanogaster: Cloning,
            characterization and differential expression in development
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.C.Suedhof , 04-MAY-1990.
FEATURES       from  to/span     description
    pept         91     1056     annexin X
    signal     1175     1184     poly-A signal
BASE COUNT      271 a    348 c    363 g    210 t
ORIGIN      Chromosome 93B or 19A-4,7.
        1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac
       61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag
      121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga
      181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg
      241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag
      301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag
      361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc
      421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag
      481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc
      541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc
      601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg
      661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc
      721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg
      781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc
      841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc
      901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag
      961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac
     1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc
     1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct
     1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc
//
LOCUS       NEUMPPX      2038 bp ss-mRNA            PLN       03-JUL-1990
DEFINITION  N.crassa matrix processing peptidase (MPP) mRNA, complete cds.
ACCESSION   J05484
KEYWORDS    matrix processing peptidase.
SOURCE      N.crassa, cDNA to mRNA.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 2038)
  AUTHORS   Schneider,H., Arretz,M., Wachter,E. and Neupert,W.
  TITLE     Matrix processing peptidase of mitochondria: Structure-function
            relationships
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by W.Neupert, 17-APR-1990.
FEATURES       from  to/span     description
    pept         41     1774     matrix processing peptidase
BASE COUNT      439 a    626 c    552 g    421 t
ORIGIN      
        1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc
       61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc
      121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga
      181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga
      241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa
      301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc
      361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc
      421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc
      481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg
      541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct
      601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg
      661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt
      721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa
      781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac
      841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc
      901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc
      961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga
     1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc
     1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc
     1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg
     1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct
     1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg
     1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat
     1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga
     1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat
     1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg
     1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg
     1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct
     1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga
     1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt
     1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc
     1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc
     1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct
     1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa
//
LOCUS       HUMCYP2DG    5503 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Human debrisoquine 4-hydroxylase mutant allele (CYP2D6-MA1) gene,
            complete cds.
ACCESSION   M33189
KEYWORDS    debrisoquine 4-hydroxylase.
SOURCE      Human individual MAGA DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 5503)
  AUTHORS   Gonzalez,F.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Gonzalez, 23-MAR-1990, for release after publication.
            
               Author address: F.Gonzalez
               National Cancer Institute
               Bldg. 37 Rm. 3E-24
               National Institute of Health
               Bethesda, Md 20892
FEATURES       from  to/span     description
    pept        814      993     debrisoquine 4-hydroxylase, exon 1
               1696     1877     debrisoquine 4-hydroxylase, exon 2
               2419     2571     debrisoquine 4-hydroxylase, exon 3
               2661     2820     debrisoquine 4-hydroxylase, exon 4
               3254     3430     debrisoquine 4-hydroxylase, exon 5
               3621     3762     debrisoquine 4-hydroxylase, exon 6
               3970     4157     debrisoquine 4-hydroxylase, exon 7
               4612     4753     debrisoquine 4-hydroxylase, exon 8
               4852     5030     debrisoquine 4-hydroxylase, exon 9
    pre-msg     726     5103     debrisoquine 4-hydroxylase mRNA and introns
    IVS         994     1695     debrisoquine 4-hydroxylase intron A
    IVS        1878     2418     debrisoquine 4-hydroxylase intron B
    IVS        2572     2660     debrisoquine 4-hydroxylase intron C
    IVS        2821     3253     debrisoquine 4-hydroxylase intron D
    IVS        3431     3620     debrisoquine 4-hydroxylase intron E
    IVS        3763     3969     debrisoquine 4-hydroxylase intron F
    IVS        4158     4611     debrisoquine 4-hydroxylase intron G
    IVS        4754     4851     debrisoquine 4-hydroxylase intron H
    signal      689      702     TATA box
BASE COUNT     1066 a   1537 c   1851 g   1049 t
ORIGIN      Chromosome 22.
        1 ggctgggaag tggggtactt ggtgccgggt ctgtatgtgt gtgtgactgg tgtgtgtgag
       61 agagaatgtg tgccctaagt gtcagtgtga gtctgtgtat gtgtgaatat tgtctttgtg
      121 tgggtgattt tctgcgtgtg taatcgtgtc cctgcaagtg tgaacaagtg gacaagtgtc
      181 tgggagtgga caagagatct gtgcaccatc aggtgtgtgc atagcgtctg tgcatgtcaa
      241 gagtgcaagg tgaagtgaag ggaccaggcc catgatgcca ctcatcatca ggagctctaa
      301 ggccccaggt aagtgccagt gacagataag ggtgctgaag gtcactctgg agtgggcagg
      361 tgggggtagg gaaagggcaa ggccatgttc tggaggaggg gttgtgacta cattagggtg
      421 tatgagccta gctgggaggt ggatggccgg gtccactgaa accctggtta tcccagaagg
      481 ctttgcaggc ttcaggagct tggagtgggg agagggggtg acttctccga ccaggcccct
      541 ccaccggcct accctgggta agggcctgga gcaggaagca ggggcaagaa cctctggagc
      601 agcccatacc cgccctggcc tgactctgcc actggcagca cagtcaacac agcaggttca
      661 ctcacagcag agggcaaagg ccatcatcag ctccctttat aagggaaggg tcacgcgctc
      721 ggtgtgctga gagtgtcctg cctggtcctc tgtgcctggt ggggtggggg tgccaggtgt
      781 gtccagagga gcccatttgg tagtgaggca ggtatggggc tagaagcact ggtgcccctg
      841 gccgtgatag tggccatctt cctgctcctg gtggacctga tgcaccggcg ccaacgctgg
      901 gctgcacgct actcaccagg ccccctgcca ctgcccgggc tgggcaacct gctgcatgtg
      961 gacttccaga acacaccata ctgcttcgac caggtgaggg aggaggtcct ggagggcggc
     1021 agaggtgctg aggctcccct accagaagca aacatggatg gtgggtgaaa ccacaggctg
     1081 gaccagaagc caggctgaga aggggaagca ggtttggggg acttcctgga gaagggcatt
     1141 tatacatggc atgaaggact ggattttcca aaggccaagg aagagtaggg caagggcctg
     1201 gaggtggagc tggacttggc agtgggcatg caagcccatt gggcaacata tgttatggag
     1261 tacaaagtcc cttctgctga caccagaagg aaaggccttg ggaatggaag atgagttagt
     1321 cctgagtgcc gtttaaatca cgaaatcgag gatgaagggg gtgcagtgac ccggttcaaa
     1381 ccttttgcac tgtgggtcct cgggcctcac tgctcaccgg catggaccat catctgggaa
     1441 tgggatgcta actggggcct ctcggcaatt ttggtgactc ttgcaaggtc atacctgggt
     1501 gacgcatcca aactgagttc ctccatcaca gaaggtgtga cccccacccc cgccccagga
     1561 tcaggaggct gggtctcctc cttccacctg ctcactcctg gtagccccgg gggtcgtcca
     1621 aggttcaaat aggactagga cctgtagtct ggggggatcc tggcttgaca agaggccctg
     1681 accctccctc tgcagttgcg gcgccgcttc ggggacgtgt tcagcctgca gctggcctgg
     1741 acgccggtgg tcgtgctcaa tgggctggcg gccgtgcgcg aggcgatggt gacccgcggc
     1801 gaggacacgg ccgaccgccc gcctgtgccc atcacccaga tcctgggttt cgggccgcgt
     1861 tcccaaggca agcagcggtg gggacagaga cagatttccg tgggacccgg gtgggtgatg
     1921 accgtagtcc gagctgggca gagagggcgc ggggtcgtgg acatgaaaca ggccagcgag
     1981 tggggacagc gggccaagaa accacctgca ctagggaggt gtgagcatgg ggacgagggc
     2041 ggggcttgtg acgagtgggc ggggccactg ccgagacctg gcaggagccc aatgggtgag
     2101 cgtggcgcat ttcccagctg gaatccggtg tcgaagtggg gggcggggac cgcacctgtg
     2161 ctgtaagctc agtgtgggtg gcgcggggcc cgcggggtct tccctgagtg caaaggcggt
     2221 cagggtgggc agagacgagg tgggcaaagc cctgccccag ccaagggagc aaggtggatg
     2281 cacaaagagt gggccctgtg accagctgga cagagccagg gactgcggga gaccaggggg
     2341 agcatagggt tggagtgggt ggtggatggt ggggctaatg ccttcatggc cacgcgcacg
     2401 tgcccgtccc acccccaggg gtgttcctgg cgcgctatgg gcccgcgtgg cgcgagcaga
     2461 ggcgcttctc cgtctccacc ttgcgcaact tgggcctggg caagaagtcg ctggagcagt
     2521 gggtgaccga ggaggccgcc tgcctttgtg ccgccttcgc caaccactcc ggtgggtgat
     2581 gggcagaagg gcacaaagcg ggaactggga aggcggggga cggggaaggc gaccccttac
     2641 ccgcatctcc cacccccaag acgccccttt cgccccaacg gtctcttgga caaagccgtg
     2701 agcaacgtga tcgcctccct cacctgcggg cgccgcttcg agtacgacga ccctcgcttc
     2761 ctcaggctgc tggacctagc tcaggaggga ctgaaggagg agtcgggctt tctgcgcgag
     2821 gtgcggagcg agagaccgag gagtctctgc agggcgagct cccgagaggt gccggggctg
     2881 gactggggcc tcggaagagc aggatttgcg tagatgggtt tgggaaagga cattccagga
     2941 gaccccactg taagaagggc ctggaggagg aggggacatc tcagacatgg tcgtgggaga
     3001 ggtgtgcccg ggtcaggggg caccaggaga ggccaaggac tctgtacctc ctatccacgt
     3061 cagagatttc gattttaggt ttctcctctg ggcaaggaga gagggtggag gctggcactt
     3121 ggggagggac ttggtgaggt cagtggtaag gacaggcagg ccctgggtct acctggagat
     3181 ggctggggcc tgagacttgt ccaggtgaac gcagagcaca ggagggattg agaccccgtt
     3241 ctgtctggtg taggtgctga atgctgtccc cgtcctcctg catatcccag cgctggctgg
     3301 caaggtccta cgcttccaaa aggctttcct gacccagctg gatgagctgc taactgagca
     3361 caggatgacc tgggacccag cccagccccc ccgagacctg actgaggcct tcctggcaga
     3421 gatggagaag gtgagagtgg ctgccacggt ggggggcaag ggtggtgggt tgagcgtccc
     3481 aggaggaatg aggggaggct gggcaaaagg ttggaccagt gcatcacccg gcgagccgca
     3541 tctgggctga caggtgcaga attggaggtc atttgggggc taccccgttc tgtcccgagt
     3601 atgctctcgg ccctgctcag gccaagggga accctgagag cagcttcaat gatgagaacc
     3661 tgcgcatagt ggtggctgac ctgttctctg ccgggatggt gaccacctcg accacgctgg
     3721 cctggggcct cctgctcatg atcctacatc cggatgtgca gcgtgagccc atctgggaaa
     3781 cagtgcaggg gccgagggag gaagggtaca ggcgggggcc catgaacttt gctgggacac
     3841 ccggggctcc aagcacaggc ttgaccagga tcctgtaagc ctgacctcct ccaacatagg
     3901 aggcaagaag gagtgtcagg gccggacccc ctgggtgctg acccattgtg gggacgcatg
     3961 tctgtccagg ccgtgtccaa caggagatcg acgacgtgat agggcaggtg cggcgaccag
     4021 agatgggtga ccaggctcac atgccctaca ccactgccgt gattcatgag gtgcagcgct
     4081 ttggggacat cgtccccctg ggtgtgaccc atatgacatc ccgtgacatc gaagtacagg
     4141 gcttccgcat ccctaaggta ggcctggcgc cctcctcacc ccagctcagc accagcccct
     4201 ggtgatagcc ccagcatggc tactgccagg tgggcccact ctaggaaccc tggccaccta
     4261 gtcctcaatg ccaccacact gactgtcccc acttgggtgg ggggtccaga gtataggcag
     4321 ggctggcctg tccatccaga gcccccgtct agtggggaga caaaccagga cctgccagaa
     4381 tgttggagga cccagcgcct gcagggagag ggggcagtgt gggtgcctct gagaggtgtg
     4441 actgcgccct gctgtggggt cggagagggt actgtggagc ttctcgggcg caggactagt
     4501 tgacagagtc cagctgtgtg ccaggcagtg tgtgtccccc gtgtgtttgg tggcaggggt
     4561 cccagcatcc tagagtccag tccccactct caccctgcat ctcctgccca gggaacgaca
     4621 ctcatcacca acctgtcatc ggtgctgaag gatgaggccg tctgggagaa gcccttccgc
     4681 ttccaccccg aacacttcct ggatgcccag ggccactttg tgaagccgga ggccttcctg
     4741 cctttctcag caggtgcctg tggggagccc ggctccctgt ccccttccgt ggagtcttgc
     4801 aggggtatca cccaggagcc aggctcactg acgcccctcc cctccccaca ggccgccgtg
     4861 catgcctcgg ggagcccctg gcccgcatgg agctcttcct cttcttcacc tccctgctgc
     4921 agcacttcag cttctcggtg cccactggac agccccggcc cagccaccat ggtgtctttg
     4981 ctttcctggt gaccccatcc ccctatgagc tttgtgctgt gccccgctag aatggggtac
     5041 ctagtcccca gcctgctccc tagccagagg ctctaatgta caataaagca atgtggtagt
     5101 tccaactcgg gtcccctgct cacgccctcg ttgggatcat cctcctcagg gcaaccccac
     5161 ccctgcctca ttcctgctta ccccaccgcc tggccgcatt tgagacaggg gtatgttgag
     5221 gctgagcaga tgtcagttac ccttgcccat aatcccatgt cccccactga cccaactctg
     5281 actgcccaga ttggtgacaa ggactacatt gtcctggcat gtggggaagg ggccagaatg
     5341 ggctgactag aggtgtcagt cagccctgga tgtggtggag agggcaggac tcagcctgga
     5401 ggcccatatt tcaggcctaa ctcagcccac cccacatcag ggacagcagt cctgccagca
     5461 ccatcacaac agtcacctcc cttcatatat gacaccccaa aac
//
LOCUS       CHKCOLCARB   1394 bp ss-mRNA            VRT       03-JUL-1990
DEFINITION  Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end,
            clone 7 and 13.
ACCESSION   M28659 J05129
KEYWORDS    IX collagen-proteoglycan; extracellular matrix protein.
SOURCE      Chicken 17 day old embryo cartilage, cDNA to mRNA, clones 7 and 13.
  ORGANISM  Gallus domesticus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1394)
  AUTHORS   Nishimura,I., Muragaki,Y. and Olsen,B.R.
  TITLE     Tissue-specific forms of type IX collagen-proteoglycan arise from
            the use of two widely separated promoters
  JOURNAL   J. Biol. Chem. 264, 20033-20041 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES       from  to/span     description
    pept        147  >  1394     alpha(IX) collagen-proteoglycan
BASE COUNT      372 a    341 c    358 g    323 t
ORIGIN      
        1 tccctccccg ctgactgcgt ggggcaggag gagcattctg cacccattca tactctcgtt
       61 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg
      121 agacgtgaag aaaaccagca gagaagatga aaagcaactg gaaaattaca gctttcttgt
      181 atatgtgtag ttttctgggg tctttcatct cagctaccta ccagcaacaa tcaagattgc
      241 cagtcattct gggtgctcgt caaagaactg atctctgccc aacaatcagg attggcgaag
      301 atgacttgcc aggctttgac ctgatttctc agttccagat agaaaaagct gcttctcaag
      361 gaattgtcca gagagtagtg ggttctactg ctctacaagt ggcttataaa ttgggaccca
      421 atgtagactt caggattcca accagtgcaa tatattccaa tggattgcct gatgaatact
      481 cctttcttac tacttttcgg atgactggag ccacacttca gaaatactgg actatttggc
      541 agattcagga ttcttcagga aaagaacaag ttggagtgaa tctcaatggt ccaatgaaaa
      601 gcgttgagtt ttcttataaa ggagtggatg gaagtctcca gactgcatca tttttacatt
      661 tgcctttctt gtttgattcc caatggcaca agcttatgat aagtgtggaa acaaccagcg
      721 ttacactttt tattgactgt ataaaggtag aaaccctaaa cataaaacca aaggggaaaa
      781 tcagtgttga tggcttctca gtgcttggaa gactcaaaaa taatcctcaa atttcagttc
      841 cgtttgaagt ccagtggatg ccgattcact gcgatcccct gcggccccag agagaaggtt
      901 gtggtgagct cccagcccgg ataagccaga cagtgattga gagaggtctt cctggtccac
      961 caggcccccc aggtccacca gggccaccag gagttcctgg cattgatggc atcgatggag
     1021 agagaggacc taacggcccc cccggtccac cgggtccgga cggcgacgca ggcaaagcgg
     1081 gatccccggg cctgcctgga gagccaggag ctgatgggtt aacaggccct gatggatcac
     1141 caggtgccac aggaccgaaa ggacagaagg gtgagccagg acctccaggt gctcgtggac
     1201 ttccgggcaa gggtcttctt ggaccacccg gtccagctgg tgctgcagga cttcccggtg
     1261 aagtaggccg tgctggccca cctggtgatc caggaaaaag gggaccacca ggaccaccag
     1321 gaccaccagg ccctcgagga acaattggtc tgcaagacgg tgacccattg tgtcccaatg
     1381 cttgtccacc tggc
//
LOCUS       CHKCOLCARC    776 bp ss-mRNA            VRT       03-JUL-1990
DEFINITION  Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end,
            clone YM43.
ACCESSION   M28660 J05129
KEYWORDS    IX collagen-proteoglycan; extracellular matrix protein.
SOURCE      Chicken 17 day old embryo cartilage, cDNA to mRNA, clone YM43.
  ORGANISM  Gallus domesticus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 776)
  AUTHORS   Nishimura,I., Muragaki,Y. and Olsen,B.R.
  TITLE     Tissue-specific forms of type IX collagen-proteoglycan arise from
            the use of two widely separated promoters
  JOURNAL   J. Biol. Chem. 264, 20033-20041 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES       from  to/span     description
    pept        255  >   776     alpha(IX) collagen-proteoglycan
BASE COUNT      119 a    319 c    235 g    103 t
ORIGIN      
        1 gaattcccga cacccccacc tgcatcaccc cccccccatc tcgcagtccc tcgcccccat
       61 caaagcccct ttgtgccacc tccgtcgcca cccggcccca gaatagcagc acgctcacct
      121 gcaggggggg tcggagccag cgcctgccct cgtcccccgc tgctccatat taatcagccc
      181 cttcctcctc ctcctcctcc tcctcctcct gccggtccct ccgcagtccg acacttacag
      241 ccccgctccc ggccatggcc caccgcagcc ccgcgctctg cctgctgctc ctgcacgctg
      301 cctgcctctg cctggcccag ctccgggggc caccaggaga gcccggccca cgagggcccc
      361 caggtccgcc aggagtgccg ggagcggatg gcattgatgg tgacaaaggc tctcccggag
      421 cccccggctc cccaggtgcc aaaggggagc ccggagcccc gggtccggat gggcctccag
      481 ggaagccagg cttagacggt cttacgggag ccaaagggag ccggggccca tggggggggc
      541 aaggactgaa gggtcagcct ggactgccgg ggccgccggg gctccccggt ccctcgctgc
      601 caggaccacc cgggctgcca ggccaggtcg gactgcccgg ggagatcgga gtgccaggac
      661 ccaagggcga tcctggaccc gatggcccac ggggcccccc gggtccccca gggaaacccg
      721 gccccccagg acacatccaa ggagtggagg gaagcgcaga tttcttgtgc ccgacc
//
LOCUS       CHKCOLCOR     602 bp ss-mRNA            VRT       03-JUL-1990
DEFINITION  Chicken cornea alpha-1(IX) collagen-proteoglycan mRNA, 5' end.
ACCESSION   M28658 J05129
KEYWORDS    IX collagen-proteoglycan; extracellular matrix protein.
SOURCE      Chicken 8 day old embryo cornea, cDNA to mRNA, clone IN212.
  ORGANISM  Gallus domesticus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 602)
  AUTHORS   Nishimura,I., Muragaki,Y. and Olsen,B.R.
  TITLE     Tissue-specific forms of type IX collagen-proteoglycan arise from
            the use of two widely separated promoters
  JOURNAL   J. Biol. Chem. 264, 20033-20041 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES       from  to/span     description
    pept         89  >   602     alpha(IX) collagen-proteoglycan
BASE COUNT      121 a    175 c    186 g    120 t
ORIGIN      
        1 tgcagctgaa aggtgaactg ggctgtaagg cacattttgg atttctgtgt attgtagcac
       61 ctaggtggct gcaaaatctg tccccactat ggcctgggct gcatggggcc ctctgcttct
      121 cgggcttttc ttgcagattt tttgcctctg ccttgctcaa agaggtcttc ctggtccacc
      181 aggcccccca ggtccaccag ggccaccagg agttcctggc attgatggca ttgatggaga
      241 gagaggacct aacggccccc ccggtccacc gggtccggac ggcgacgcag gcaaagcggg
      301 atccccgggc ctgcctggag agccaggagc tgatgggtta acaggccctg atggatcacc
      361 aggtgccaca ggaccgaaag gacagaaggg tgagccagga cctccaggtg ctcgtggacc
      421 tccgggcaag ggtcttcttg gaccacctgg tccagctggt gctgcaggac ttcccggtga
      481 agtaggccct gctggcccac ctggtgatcc aggaaaaagg ggaccaccag gaccaccagg
      541 accaccaggc cctcgaggaa caattggtct gcaagatggt gacccattgt gtcccaatgc
      601 tt
//
LOCUS       CHKCOLG1      840 bp ds-DNA             VRT       03-JUL-1990
DEFINITION  Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exon 1,
            and cornea alpha-1(IX) collagen-proteoglycan gene, 5' flank.
ACCESSION   M28662 J05129
KEYWORDS    IX collagen-proteoglycan; extracellular matrix protein.
SEGMENT     1 of 2
SOURCE      Chicken DNA.
  ORGANISM  Gallus domesticus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 840)
  AUTHORS   Nishimura,I., Muragaki,Y. and Olsen,B.R.
  TITLE     Tissue-specific forms of type IX collagen-proteoglycan arise from
            the use of two widely separated promoters
  JOURNAL   J. Biol. Chem. 264, 20033-20041 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            B.R.Olsen, 12-SEP-1989, for release after publication.
FEATURES       from  to/span     description
    pept        807  /   820     cartilage alpha(IX) collagen-proteoglycan,
                                 exon 1
    pre-msg     661  >   840     cartilage alpha(IX) collagen-proteoglycan
    IVS         821  >   840     cartilage alpha(IX) collagen-proteoglycan
                                 intron A
BASE COUNT      249 a    202 c    157 g    232 t
ORIGIN      
        1 ccacccgtga gaattcctca agtgaaaatg caaatgaaca gaaattataa attgttcaga
       61 aactgagtat atgttctcca aatttctctg aacgaggccc ctctctttgg aaagtataat
      121 gtgtgtgtga ataacaactg aacaacagga gtcctcttag taatgcctat gtgcattcct
      181 tgaaaaggtt caagtttaag cagtaaaagt ccttttaaat aattggtttt attcagaaga
      241 atcaactagg acactaccag ataggcttct ccagagacct tctgatggat aaatcaacaa
      301 gaactgaaaa tatcttcttt ataggactga tgttcttttc ttgtgaaagt ttttagcttt
      361 aacaccacag tgaagccacc agtttccaca aaatcccttg gtacatgtta ttattctttt
      421 atctgcctca ctgaacagtg cccctgccat ttggtgactg gcatcgctta actcatatag
      481 tgttaatctt tctaccctga tgtcggcata agcagcaccc ctttcttcac tctcttggct
      541 tctttatatt cagctggctc cagagatccg ccctcagacc ccaccaggat acagacgtct
      601 gtccagcccc cacctccttc cctttgcaag attaaaacca acccagcagc ctgcacctcc
      661 ctccccgctg agtcctgcgt ggggcaggag gagcattctg cacccattca tactctcgtt
      721 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg
      781 agacgtgaag aaaaccagca gagaagatga aaagcaactg gtaagagaac aagtgggatt
//
LOCUS       CHKCOLG2      840 bp ds-DNA             VRT       03-JUL-1990
DEFINITION  Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exons 6
            and 7, and cornea alpha-1(IX) collagen-proteoglycan gene, exon 1.
ACCESSION   M28661 J05129
KEYWORDS    IX collagen-proteoglycan; extracellular matrix protein.
SEGMENT     2 of 2
SOURCE      Chicken DNA, clones 13 and 26.
  ORGANISM  Gallus domesticus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 840)
  AUTHORS   Nishimura,I., Muragaki,Y. and Olsen,B.R.
  TITLE     Tissue-specific forms of type IX collagen-proteoglycan arise from
            the use of two widely separated promoters
  JOURNAL   J. Biol. Chem. 264, 20033-20041 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            B.R.Olsen, 12-SEP-1989, for release after publication.
            
            The first amino acid for the open reading frame for exon 6 which is
            indicated in the features as starting at nucleotide 9 could start
            at nucleotide 11.
FEATURES       from  to/span     description
    pept        636  /   707     cornea alpha(IX) collagen-proteoglycan, exon 1
    pept    /     9       86     cartilage alpha(IX) collagen-proteoglycan,
                                 exon 6 (AA at 9)
                750  /   770     cartilage alpha(IX) collagen-proteoglycan,
                                 exon 7
    pre-msg     541  >   840     cornea alpha(IX) collagen-proteoglycan mRNA
                                 and introns
    pre-msg <     1  >   840     cartilage alpha(IX) collagen-proteoglycan mRNA
                                 and introns
    IVS     <     1        8     cartilage alpha(IX) collagen-proteoglycan
                                 intron E
    IVS          87      749     cartilage alpha(IX) collagen-proteoglycan
                                 intron F
    IVS         771  >   840     cartilage alpha(IX) collagen-proteoglycan
                                 intron G
    IVS         708  >   840     cornea alpha(IX) collagen-proteoglycan intron
                                 A
BASE COUNT      181 a    214 c    222 g    223 t
ORIGIN      
        1 cctaccagtt tgaagtccag tggatgctga ttcactgcga tcccctgcgg ccccagagag
       61 aaggttgtgg tgagctccca gcccgggtga cccgcgttcc cagcctgaca gtgctgaact
      121 gggctgccac taaatctatg aagttcacag gagcttcatt tttccccgtc tatgtccaga
      181 gaagtctatt tcaccatacc tgactgaaat ttggtgcctt tagcaatcca gccccctgga
      241 gtagcagcct tactttaact cttccatgcc ttcctatctt ttccttctca gccagtgcta
      301 gggtcagagg cttttgaaag atatccctga cagcgaagag agactgctgt ctccttgcag
      361 actcctgggc aacctgaggg agggaaaccc ttgcctggga ggtgagggag ggtgccaaaa
      421 caacagcgag cagggcaaag ggttaaaggt actgctgtca ttcaatcctc ttcctcccag
      481 ccttcagctc tcctccaatc ccacgaccct ctcccaggca gttaataagg aactgtgagg
      541 ggtgccttgc agctgaaagg tgaactgggc tgtaaggcac attttggatt tctgtgtatt
      601 gtagcaccta ggtggctgca aaatctgtcc ccactatggc ctgggctgca tggggccctc
      661 tgcttctcgg gcttttcttg cagatttttt gcctctgcct tgctcaagta agtttattct
      721 gactttatac ctgtttttct cccttacaga taagccagac agtgattgag gtaagtgtga
      781 gggaagggat ggtgctgcat cgtaagggaa agggtttgga tgaagagggg ctgaaggctg
//
LOCUS       RATIRF1A     2078 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds.
ACCESSION   M34253
KEYWORDS    interferon regulatory factor 1; transcription factor.
SOURCE      Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2078)
  AUTHORS   Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A.
  TITLE     Interferon regulatory factor 1 is an immediate-early gene under
            transcriptional regulation by prolactin in Nb2 T cells
  JOURNAL   Mol. Cell. Biol. 3087, 3094 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.-Y.Yu-Lee, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        198     1184     interferon regulatory factor 1 (IRF-1)
BASE COUNT      542 a    544 c    540 g    452 t
ORIGIN      
        1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg
       61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac
      121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct
      181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc
      241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc
      301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt
      361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca
      421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga
      481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc
      541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa
      601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct
      661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg
      721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg
      781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg
      841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac
      901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca
      961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct
     1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg
     1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact
     1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc
     1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag
     1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg
     1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct
     1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg
     1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga
     1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact
     1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact
     1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa
     1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct
     1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct
     1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt
     1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga
     1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt
     1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta
     2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS       HUMINTB1A    1146 bp ss-mRNA            PRI       03-JUL-1990
DEFINITION  Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain).
ACCESSION   M34189
KEYWORDS    integrin; integrin beta-1 subunit cytoplasmic domain.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1146)
  AUTHORS   Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F.,
            Stefanuto,G. and Silengo,L.
  TITLE     A human integrin beta-1 subunit with a unique cytoplasmic domain
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by F.Altruda, 10-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      117     integrin beta-1 subunit (AA at 1) (cytoplasmic
                                 domain)
BASE COUNT      373 a    184 c    222 g    367 t
ORIGIN      
        1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg
       61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt
      121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct
      181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga
      241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt
      301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct
      361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct
      421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa
      481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata
      541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta
      601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg
      661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac
      721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg
      781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt
      841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag
      901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg
      961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt
     1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga
     1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc
     1141 gaattc
//
LOCUS       NEUAMTR      5928 bp ds-DNA             PLN       03-JUL-1990
DEFINITION  N.crassa mating type protein gene, complete cds.
ACCESSION   M33876
KEYWORDS    mating type protein.
SOURCE      N.crassa (strain 74-ORS-A) DNA.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 5928)
  AUTHORS   Grotelueschen,J., Metzenberg,R.L. and Glass,N.L.
  TITLE     The Neurospora crassa A mating type region
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by N.L.Glass, 26-APR-1990.
FEATURES       from  to/span     description
    pept       4121     4283     mating type protein, exon 1
               4343     5046     mating type protein, exon 2
    IVS        4284     4342     mating type protein intron A
    signal     3858     3906     GC signal
    signal     3945     3949     CAAT box
    signal     5319     5326     poly-A signal
BASE COUNT     1470 a   1459 c   1526 g   1473 t
ORIGIN      
        1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg
       61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga
      121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct
      181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct
      241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag
      301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa
      361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga
      421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa
      481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg
      541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg
      601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg
      661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc
      721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg
      781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc
      841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg
      901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc
      961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat
     1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa
     1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc
     1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg
     1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt
     1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa
     1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc
     1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca
     1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat
     1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac
     1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca
     1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta
     1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga
     1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt
     1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa
     1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac
     1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc
     1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc
     2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg
     2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga
     2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg
     2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca
     2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca
     2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc
     2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca
     2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat
     2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag
     2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc
     2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca
     2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg
     2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag
     2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg
     2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata
     2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg
     3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca
     3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca
     3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt
     3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt
     3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca
     3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag
     3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac
     3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta
     3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca
     3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac
     3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg
     3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca
     3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt
     3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt
     3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc
     3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa
     3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc
     4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt
     4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat
     4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc
     4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt
     4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat
     4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag
     4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat
     4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg
     4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg
     4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg
     4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag
     4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat
     4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt
     4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct
     4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc
     4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat
     4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg
     5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca
     5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc
     5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg
     5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg
     5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg
     5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg
     5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca
     5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa
     5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct
     5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg
     5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag
     5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg
     5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc
     5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc
     5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac
//
LOCUS       HUMNCADH     3451 bp ss-mRNA            PRI       03-JUL-1990
DEFINITION  Human N-cadherin mRNA, complete cds.
ACCESSION   M34064
KEYWORDS    N-cadherin; cell adhesion molecule; transmembrane protein.
SOURCE      Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3451)
  AUTHORS   Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D.,
            Spurr,N. and Goodfellow,P.N.
  TITLE     The N-cadherin gene maps to human Chromosome 18 and is not linked
            to the E-cadherin gene
  JOURNAL   J. Neurochem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.H.Barton, 08-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1     2247     N-cadherin (AA at 1)
BASE COUNT     1041 a    703 c    724 g    983 t
ORIGIN      Chromosome 18.
        1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag
       61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg
      121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg
      181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca
      241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac
      301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga
      361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc
      421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac
      481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga
      541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca
      601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct
      661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata
      721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg
      781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc
      841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc
      901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag
      961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc
     1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc
     1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct
     1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt
     1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct
     1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt
     1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc
     1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct
     1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat
     1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt
     1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg
     1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg
     1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt
     1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt
     1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga
     1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct
     1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc
     1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag
     2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt
     2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt
     2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct
     2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac
     2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc
     2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag
     2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta
     2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt
     2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt
     2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca
     2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta
     2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag
     2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg
     2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga
     2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta
     2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt
     3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt
     3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt
     3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg
     3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa
     3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa
     3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct
     3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg
     3421 ctaattttgg ataacaaaaa aaggggaatt c
//
LOCUS       MHVNSGII      870 bp ss-RNA             VRL       03-JUL-1990
DEFINITION  Murine hepatitis virus non-structural protein gene-2 (NS2).
ACCESSION   M34035
KEYWORDS    non structural protein.
SOURCE      Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA.
  ORGANISM  Murine hepatitis virus A59
            Unclassified.
REFERENCE   1  (bases 1 to 870)
  AUTHORS   Schwarz,B., Routledge,E. and Siddell,S.G.
  TITLE     The coronavirus MHV 30 kDa non-structural protein NS2 is not
            essential for virus replication in transformed murine cells
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.G.Siddell, 04-MAY-1990.
            
              Author address: S.G.Siddell
              Inst of Virology
              Univ of Wuerzburg
              Versbacherstrasse 7
              8700 Wuerzburg
            
FEATURES       from  to/span     description
    pept         40      837     non-structural protein-2 (NS2)
BASE COUNT      269 a    147 c    187 g    267 t
ORIGIN      
        1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt
       61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt
      121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag
      181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa
      241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac
      301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat
      361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg
      421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc
      481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag
      541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg
      601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat
      661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag
      721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg
      781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt
      841 gtttttacac tattagtgta ataaacttat
//
LOCUS       MCAMV6       1904 bp ds-DNA             VRL       03-JUL-1990
DEFINITION  Cauliflower mosaic virus (CaMV) gene six protein gene, complete
            cds.
ACCESSION   M23620
KEYWORDS    gene six protein.
SOURCE      Cauliflower mosaic virus (strain D4) DNA.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (sites)
  AUTHORS   Daubert,S. and Routh,J.
  TITLE     Determinants of symptomatology in the DNA sequence CaMV
  JOURNAL   mol plant microb interact (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1904; for [1])
  AUTHORS   Daubert,S. and Routh,J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1], [2] kindly submitte
            by S.Daubert, 08-AUG-1989, for release after publication.
FEATURES       from  to/span     description
    pept          1     1563     gene six protein
    signal     1629     1635     TATA box
    signal     1547     1550     CCAT enhancer 1
BASE COUNT      645 a    450 c    379 g    430 t
ORIGIN      bps 5774 to 7678 of genome.
        1 atggagaaca tagaaaaact cctcatgcaa gagaaaatac taatgctaga gctcgatcta
       61 gtaaaagcaa aaataagctt agcaagagct aacggctctt cgcaacaagg agaactctct
      121 ctccaccgtg aaacaccgga aaaagaagaa gcagttcatt ctgcactggc cacttttacg
      181 ccaacccaag taaaagctat tccagagcaa acggctcctg gtaaagaatc aacaaatccg
      241 ttgatggcta gtatcttgcc aaaagatatg aattcagttc agactgaaat taggctcaaa
      301 aggccatcgg acttcttacg tccttatcag ggaatttcaa tcccacaaaa atctgagctt
      361 aacagcacag ttactcttca cggagtagaa tcgggtattc aacaccctca tatcaactac
      421 tacgttgtgt ataacggtcc acacgccggt atatacgatg actggggttg tacaaaggcg
      481 gcaacaaacg gcgttcccgg agttgcacaa aagaagtttg ccactattac agaggcaaga
      541 gcagcagctg acgcatacac aacaagtcag caaacagaca ggttgaactt catccccaaa
      601 ggagaagctc aactcaagcc caagagcttt gcgaaggcct taaccagccc atcaaagcaa
      661 aaagcccact ggctcacgct aggaaccaaa aggcccagca gtgatccagc cccaaaagag
      721 atctcctttg ccccggagat caccatggac gactttctct atctctacga tctaggaaga
      781 aagttcgacg gagaaggtga cgataccatg ttcaccactg ataatgagaa gattagcctc
      841 ttcaatttca gaaagaatgc tgacccacag atggttagag aggcctacgc agcaggtctc
      901 atcaagacga tctacccgag caataatctc caggagatca aataccttcc caagaaggtt
      961 aaagatgcag tcaaaagatt caggactaac tgcatcaaga acacagagaa agatatattt
     1021 ctcaagatca gaagtactat tccagtatgg acgattcaag gcttgcttca taaaccaagg
     1081 caagtaatag aaattggagt ctctaagaaa gtagttccta ctgaatcaaa ggccatggag
     1141 tcaaaaattc agatcgagga tctaacagaa ctcgccgtga agactggcga acagttcata
     1201 cagagtcttt tacgactcaa tgacaagaag aaaatcttcg tcaacatggt ggagcacgac
     1261 actctcgtct actccaagaa tatcaaagat acagtctcag aagaccaaag ggctattgag
     1321 acttttcaac aaagggtaat atcgggaaac ctcctcggat tccattgccc agctatctgt
     1381 cacttcatcg aaaggacagt agaaaaggaa ggtggcacct acaaatgcca tcattgcgat
     1441 aaaggaaagg ctatcattca agatgcctct accgacagtg gtcccaaaga tggaccccca
     1501 cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa gcaagtggat
     1561 tgatgtgaca tctccactga cgtaagggat gacgcacaat cccactaccc ttcgcaagac
     1621 ccttcctcta tataaggaag ttcatttcat ttggagagga cacgctgaaa tcaccagtct
     1681 ctctctacaa gactatctct ctctattttc tccagaataa tgtgtgagta gtttcccgat
     1741 aagggaatta gggttcttat agggtttcgc tcatgtgttg agcatataag aaacccttag
     1801 tatgtatttg tatttgtaaa atacttctat caataaaatt tctaattcct aaaaccaaaa
     1861 tccagtacta aaatccagat ctcctaaagt ccctatagat cttt
//
LOCUS       CREAPCYN      577 bp ss-mRNA            PLN       03-JUL-1990
DEFINITION  C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds.
ACCESSION   J05524
KEYWORDS    apoplastocyanin.
SOURCE      C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone
            PC6-2.
  ORGANISM  Chlamydomonas reinhardtii
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Chlamydomonadaceae.
REFERENCE   1  (bases 1 to 577)
  AUTHORS   Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and
            Bogorad,L.
  TITLE     Isolation and characterization of a complementary DNA clone for an
            algal pre-apoplastocyanin
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by K.Hill, 11-MAY-1990.
FEATURES       from  to/span     description
    pept         22      459     apoplastocyanin (PC6-2) precursor
    sigp         22      162     apoplastocyanin signal peptide
    matp        163      456     apoplastocyanin
BASE COUNT       91 a    189 c    171 g    126 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc
       61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt
      121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg
      181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag
      241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc
      301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc
      361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac
      421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat
      481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac
      541 gcagagagcg tgtagctctg taactttttc ggaattc
//
LOCUS       RATSVPIIA    4161 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Rat seminal vesicle secretion II protein (SVS II) gene, complete
            cds.
ACCESSION   J05443
KEYWORDS    seminal vesicle secretion II protein.
SOURCE      Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell
            DNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4161)
  AUTHORS   Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G.,
            Matusik,R.J., Carr,S.A. and Crabb,J.W.
  TITLE     Structural characterization of the rat seminal vesicle secretion II
            protein and gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.E.Harris,  25-APR-1990.
FEATURES       from  to/span     description
    pept       2065     2137     seminal vesicle secretion II protein (SVS II)
                                 precursor, exon 1
               2377     3548     seminal vesicle secretion II protein
                                 precursor, exon 2
    sigp       2065     2130     seminal vesicle secretion II protein signal
                                 peptide
    matp       2131     3545     seminal vesicle secretion II protein
    signal     1934     1946     CAAT box
    signal     2013     2020     TATA box
BASE COUNT     1243 a    902 c    822 g   1194 t
ORIGIN      
        1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct
       61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct
      121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt
      181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt
      241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa
      301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc
      361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac
      421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata
      481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta
      541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta
      601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca
      661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc
      721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc
      781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc
      841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa
      901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca
      961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac
     1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct
     1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt
     1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac
     1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta
     1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt
     1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg
     1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact
     1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt
     1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac
     1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac
     1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa
     1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt
     1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat
     1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct
     1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag
     1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg
     1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc
     2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc
     2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag
     2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca
     2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt
     2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga
     2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga
     2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag
     2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc
     2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg
     2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat
     2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct
     2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg
     2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg
     2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt
     2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac
     2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga
     3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa
     3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg
     3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta
     3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag
     3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc
     3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa
     3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat
     3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa
     3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta
     3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc
     3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg
     3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag
     3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa
     3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac
     3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat
     3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg
     3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc
     4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga
     4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc
     4141 aatggtgacc ctgttgaatt c
//
LOCUS       CODCPRRKA      94 bp ss-RNA             RNA       03-JUL-1990
DEFINITION  Codium fragile chloroplast 4.5S RNA, complete cds.
ACCESSION   M35276 M15192
KEYWORDS    4.5S RNA.
SOURCE      C.fragile chloroplast RNA.
  ORGANISM  Chloroplast Codium fragile
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Brypsidales; Codiaceae; Codium fragile.
REFERENCE   1  (bases 1 to 94)
  AUTHORS   Francis,M.A., Balint,R.F. and Dudock,B.S.
  TITLE     A novel variety of 4.5 S RNA from Codium fragile chloroplasts
  JOURNAL   J. Biol. Chem. 262, 1848-1854 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    RNA           1       94     4.5 S RNA
BASE COUNT       35 a     11 c     14 g     34 t
ORIGIN      
        1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc
       61 atttgtattt gaaaattgtt taggaactag gcac
//
LOCUS       HAMAPBRBD    2339 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Hamster apolipoprotein (apoB) gene, partial cds
            (LDL receptor-binding domain).
ACCESSION   M35187
KEYWORDS    apolipoprotein B.
SOURCE      Hamster DNA.
  ORGANISM  Mesocricetus auratus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2339)
  AUTHORS   Smith,T.J., Hautamaa,D. and Maeda,N.
  TITLE     Sequence of the putative low-density lipoprotein receptor-binding
            regions of apolipoprotein B in mouse and hamster
  JOURNAL   Gene 87, 309-310 (1990)
  STANDARD  simple staff_review
COMMENT     Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications
            on line 4 and line 17 of printed sequence.  The hamster sequence
            should be 1 bp to the left on these lines.
FEATURES       from  to/span     description
    pept    <     1  >  2339     apolipoprotein (apoB) (LDL receptor-binding
                                 domain) (AA at 1)
BASE COUNT      725 a    519 c    450 g    645 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat
       61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat
      121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc
      181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct
      241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat
      301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt
      361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat
      421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac
      481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta
      541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag
      601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg
      661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc
      721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc
      781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc
      841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt
      901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt
      961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt
     1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat
     1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga
     1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac
     1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa
     1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag
     1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac
     1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt
     1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg
     1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact
     1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac
     1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg
     1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg
     1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt
     1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt
     1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt
     1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac
     1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca
     2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa
     2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac
     2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta
     2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct
     2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc
//
LOCUS       MUSAPBRBD    2354 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding
            domain).
ACCESSION   M35186
KEYWORDS    apolipoprotein B.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2354)
  AUTHORS   Smith,T.J., Hautamaa,D. and Maeda,N.
  TITLE     Sequence of the putative low-density lipoprotein receptor-binding
            regions of apolipoprotein B in mouse and hamster
  JOURNAL   Gene 87, 309-310 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  2354     apolipoprotein (apoB) (LDL receptor-binding
                                 domain) (AA at 1)
BASE COUNT      757 a    525 c    423 g    649 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat
       61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat
      121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc
      181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct
      241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat
      301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt
      361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat
      421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac
      481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg
      541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa
      601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta
      661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc
      721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc
      781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt
      841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt
      901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt
      961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt
     1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat
     1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga
     1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac
     1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa
     1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag
     1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac
     1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt
     1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt
     1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa
     1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc
     1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg
     1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca
     1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc
     1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac
     1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag
     1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct
     1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag
     2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg
     2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt
     2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat
     2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga
     2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat
     2341 gtatatctga attc
//
LOCUS       RATBPTT      1035 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Rat beta-tachykinin mRNA, complete cds.
ACCESSION   M35277 M15191
KEYWORDS    neurokinin A; substance P; tachykinin.
SOURCE      Rat (Sprague-Dawley) rostral portion of the caudate putamen, cDNA
            to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1035)
  AUTHORS   Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
  TITLE     Three rat preprotachykinin mRNAs encode the neuropeptides substance
            P and neurokinin A
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        100      492     beta-preprotachykinin
    sigp        100      156     beta-tachykinin signal peptide (3' end could
                                 be 171)
    matp        172      204     substance P
    matp        294      321     neurokinin
    mRNA    <     1     1035     beta-preprotachykinin mRNA
BASE COUNT      289 a    212 c    245 g    289 t
ORIGIN      
        1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc
       61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
      121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
      181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gcccgagccc
      241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
      301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt
      361 tatgggcatg gtcagatctc tcacaaaagg cataaaacag attcctttgt tggactaatg
      421 ggcaaaagag ctttaaattc tgtggcttat gaaagaagcg caatgcagaa ctacgaaaga
      481 aggcgtaaat aaaccctgta acgcactatc tattcatctc catctgtgtc cgcgagcagt
      541 gagcggtaaa ataaaaatgt gcgctatgag gaatgattat ttatttaata tcaaatgttg
      601 ttatgagtga aaaactcaaa aaagtgttta ttttttcata ttgtgccaat aagcattgta
      661 attctaatgt ggtgacctcc tcagacagaa gtagaaatta gttgtaactt cagcaaagca
      721 cagtgttgat ggagttgtac aagtttgcca gcgatgcaag tctccaaaga cagaaaggct
      781 gctgtgaggc agtgcaggcg gctgctgctg gaggcagaga aactcctgtg tgtcttgcgc
      841 ttcccttggt tgcttttatc ctaatgatgt actgagagtt tggtatctga ctctatttgt
      901 atcctagcag catgtttcct gtgttgtgac tatatagaga tgtttttaaa agtttcaatg
      961 tacttctctg gtcttcagtc attgtatgat gtgttgtgat agctaccatt ttaaataaaa
     1021 gaatgtatct tcagg
//
LOCUS       CHPRGIT      1051 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Chimpanzee rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION   M30947
KEYWORDS    internal transcribed spacer.
SOURCE      Chimpanzee DNA.
  ORGANISM  Pan troglodytes
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE   1  (bases 1 to 1051)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  TITLE     Ribosomal RNA gene sequences and hominoid phylogeny
  JOURNAL   Mol. Biol. Evol. 7, 203-219 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.L.Gonzalez, 20-DEC-1989.
BASE COUNT       69 a    411 c    425 g    146 t
ORIGIN      Acrocentric chromosomes 14, 15, 17, 22, 23.
        1 acggagccga aggggggcgc gaggccgcgg cggcgccgcc gcgcgcttcc ctccccccca
       61 ccccgccgca acgcggcgcg tgcgcgggcg gggcccgtgt gccgttcgtt cgttcgttcg
      121 ctgcccggcc ccgccgccgc gagagccgag aactcgggag ggcgacgttg gggggagagc
      181 gagagagaca gaaagaaggg ggcgcgtgtt cgctgcgcgt gtcgtggggc cggcggggag
      241 cggtccccgg cctcgggccc gacggacgtg tgtgtcggcg ggcgcggggg cggttctcgg
      301 cggcgtcacg gcgggtttgg gggggggggt ctcggtgccc tcctccccgc cggggcccgt
      361 cgtccggccc cgccgcgcgc cggctccccg tcgtcggggc cgggccggat tcccgtcgcc
      421 gcctccgccg cgcgccgctc cgcgccaccg ggcacggccc cgctcgctct ccccggcctt
      481 cccgctaggg cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc
      541 ccgccgtcca ggtacctagc gcgttccggc gcggaggttt aaagacccct tgggggatcg
      601 cccgtccgcc cgcgggtcgg gggcggtggt gggcccgcgg gggagtcccg tcgggagggg
      661 cccggcccct cccgcgcctc ccccgcggac tccgcccccg gccggggccg cgccgcctcg
      721 ccggctcggg tcgcggcggc cgtcgggtgg gggctttacc cggcggccgt cgcgtgcgcg
      781 cgtgccgcgc gtgtggcgtg cgccccgcgc cgtgggggcg ggaacccccc gggcgcctgt
      841 ggggtggtgt ccgcgctcgc ccctgcgtgg gcggcgcgcg cctccccgtg gtgtgaaacc
      901 ttccgacccc tctccggagt ccggtcccgt ttttgctgtc tctctggccg gcctgaggca
      961 accccctctc ctctgggggg gggggacgtg ccgcgccagg agggcctccc ggtgtgtttg
     1021 tcgggagcgc cctcgccaaa tcgacctcgt a
//
LOCUS       CHPRGITX     2512 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Chimpanzee 28S ribosomal RNA gene fragment.
ACCESSION   M30950
KEYWORDS    28S ribosomal RNA.
SOURCE      Chimpanzee DNA.
  ORGANISM  Pan troglodytes
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE   1  (bases 193 to 261; 375 to 945; 1042 to 1079; 1334 to 1357;
            1742 to 1958; 2190 to 2204)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  TITLE     Ribosomal RNA gene sequences and hominoid phylogeny
  JOURNAL   Mol. Biol. Evol. 7, 203-219 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2512)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by I.L.Gonzalez, 20-DEC-1989.
FEATURES       from  to/span     description
    rRNA    <     1  >  2512     28S ribosomal RNA
BASE COUNT      397 a    837 c    924 g    354 t
ORIGIN      Chromosomes 14, 15, 17, 22, 23.
        1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg
       61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg
      121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga
      181 cccctccacc cgccctccct tccccccgcc gcccctcctc ctcctccccg gagggggcgg
      241 gctccggcgg gtgcgggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc
      301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg
      361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtcct
      421 cctcctcccc ccccgtctcc gccccccggc cccgcgtcct cccccgggag ggcgcgcggg
      481 tcggggcggt ggcggcggcg gcggcggtgg cggcggtggc ggcgggaccg aaaccccccc
      541 cgagtgttac agccccccgg cagcagcact cgccgaatcc cggggccgag ggagcgagac
      601 ccgtcgccgc gctctccccc ctcccggcgc ccacccccgc ggggatatcc tccgcgaggg
      661 gggtctcccc cgcgggggcg cgccggcgtc tcctcgtggg ggggccgggc cacccctccc
      721 acggcgcgac cgctctccca cccctcctcc ccgcaacccc cctctcccgg cgacggggag
      781 ggccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg
      841 ccgtcgggcc cgggggaggt tctctcgggg ccacgcgcgc gtcccccgaa gagggggacg
      901 gcggagccga gcgcacgggg tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa
      961 cacggaccaa ggagtctaac acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc
     1021 gcaatgaagg tgaaggccgg cgcgctcgcc ggccgaggtg ggatccgagg cctctccagt
     1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc
     1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac
     1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg
     1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc
     1321 tggcgctctc gcagacccga cgcacacccc cccacgcagt tttatccggt aaagcgaatg
     1381 attagaggtc ttggggccga aacgatctca acctattctc aaactttaaa tgggtaagaa
     1441 gcccggctcg ctggcgtgga gccggggtgg aatgcgagtg cctagtgggc cacttttggt
     1501 aagcagaact ggcgctgcgg gatgaaccga acgccgggtt aaggcgcccg atgccgacgc
     1561 tcatcagacc ccagaaaagg tgttggttga tatagacagc aggacggtgg ccatggaagt
     1621 cggaatccgc taaggagtgt gtaacaactc acctgccgaa tcaactagcc ctgaaaatgg
     1681 atggcgctgg agcgtcgggc ccatacccgg ccgtcgccgg cagtcgagag tggacgggag
     1741 cggcgggggc ggcgcgggcg tgtgcgcgcg cgcgtgtgtg cgtgtgtgtc ggagggcggc
     1801 ggcggtggcg gcgggggtgg ggtcctcccc ctcccccacg ccgcctcccc tcctcccacc
     1861 caccaccgcc gccgccaccc ccgctccccg cccccggagc cccgcggacg ctacgccgcg
     1921 acgagtagga gggccgctgc ggtgagcctt gaagcctagg gcgcgggccc gggtggagcc
     1981 gccgcaggtg cagatcttgg tggtagtagc aaatattcaa acgagaactt tgaaggccga
     2041 agtggagaag ggttccatgt gaacagcagt tgaacatggg tcagtcggtc ctgagagatg
     2101 ggcgagcgcc gttccgaagg gacgggcgat ggcctccgtt gccctcggcc gatcgaaagg
     2161 gagtcgggtt cagatccccg aatccggagt ggcggagatg ggcgccgcga ggcgtccagt
     2221 gcggtaacgc gaccgatccc ggagaagccg gcgggagccc cggggagagt tctcttttct
     2281 ttgtgaaggg cagggcgccc tggaatgggt tcgccccgag agaggggccc gtgccttgga
     2341 aagcgtcgcg gttccggcgg cgtccggtga gctctcgctg gcccttgaaa atccggggga
     2401 gagggtgtaa atctcgcgcc gggccgtacc catatccgca gcaggtctcc aaggtgaaca
     2461 gcctctggca tgttggaaca atgtaggtaa gggaagtcgg caagccggat cc
//
LOCUS       GORRGIT       987 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Gorilla rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION   M30948
KEYWORDS    internal transcribed spacer.
SOURCE      Gorilla DNA.
  ORGANISM  Gorilla gorilla
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE   1  (bases 1 to 987)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  TITLE     Ribosomal RNA gene sequences and hominoid phylogeny
  JOURNAL   Mol. Biol. Evol. 7, 203-219 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.L.Gonzalez, 20-DEC-1989.
BASE COUNT       65 a    398 c    390 g    134 t
ORIGIN      Chromosomes 22 and 23.
        1 acggagcgaa gggcgaggcc gcggcggtgg cgccgccgcg tgcttccctc ccccccaccg
       61 acgcggcgcg tgcgcgggcg gggcccgtgc cgttcgttcg ttcgttcgtt cgctgcccgg
      121 ccccgccgcc gcgagagccg aggactcggg agggagacgg ggggggagaa gagaaaggag
      181 gcctgtccgt gtgtgcgtgt cgtggggccg gccgcgctgg tgagcggcgg cgaggcctcc
      241 ccggccgcgg cccgacgacg tgtgtgtcgg cgggtgcggg ggcggttctc ggcggcgtca
      301 cggcgggttt ggggcctcgg tgccctcctc cccgccgggg cccgtcgtcc ggccccgccg
      361 ccggcccccc cgtcgtcggg gccggccggg ttcccgtcgc cgccgccgcc gccgccgtcg
      421 tcgcctccgc cgcgccaccg ggaccggccc cgctcgctct ccccggcctt cccgctaggg
      481 cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc ctccccgccg
      541 ttccaggtac ctagcgcgtt ccggcgcgga ggtttaaaga ccccttgggg gatcgcccgt
      601 ccgccccgtg ggtcgggggc ggtgggcccg cgggggggtc ccgtcgggag gggcccggcc
      661 cctcccgcgc ctccaccgcg gactccgccc cccggccggg gccgcggcgg ccgtcgggtg
      721 ggggctttac ccggcggccg tgcgcccccg cgccgtgggg gcgggaaccc ccgggcgcct
      781 gtggggcgtg tcagcgctcg cccccgcgtg ggcgccgcgc ctccccgtgg tgtgaaacct
      841 tccgacccct ctccggagtc cggtcccgtt tgctgtccgt ctggccggcc tgaggcaacc
      901 ccccctcctc cgtggggggg gggggacgtg ccgcgccagg agggccctcc cggtgtcggg
      961 agcgccctcg ccaaatcgac ctcgtta
//
LOCUS       GORRGITX     2467 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Gorilla 28S ribosomal RNA gene fragment.
ACCESSION   M30951
KEYWORDS    28S ribosomal RNA.
SOURCE      Gorilla DNA.
  ORGANISM  Gorilla gorilla
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE   1  (bases 193 to 261; 375 to 944; 1041 to 1079; 1334 to 1354;
            1737 to 1913; 2145 to 2159)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  TITLE     Ribosomal RNA gene sequences and hominoid phylogeny
  JOURNAL   Mol. Biol. Evol. 7, 203-219 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2467)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by I.L.Gonzalez, 20-DEC-1989.
FEATURES       from  to/span     description
    rRNA    <     1  >  2467     28S ribosomal RNA
BASE COUNT      386 a    819 c    912 g    350 t
ORIGIN      Chromosomes 22 and 23.
        1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg
       61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg
      121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga
      181 cccctccacc cgccctccct tcccccgccg cccctcctcc tcctccccgg agggggcggg
      241 ctccggcggg tgcggggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc
      301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg
      361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtccg
      421 tccgtccgtc ctcctccccc gtctccgccc cccggccccg cgtcctccct cgggaagggg
      481 cgcgcgggtc ggggcggcgg cggcggcggt ggcggcggcg gcggcggcgg cgggaccgaa
      541 acccccccga gtgttacagc cccccggcca gccagccatc gccgaatccc ggggccgagg
      601 gagcgagacc cgtcgccgcg ctctcccccc tcccggcgcc cacccccgcg ggggtccccc
      661 gcgagggggt cccccgcggg ggcgcgccgg cgtctcctcg tgggggggcc gggccacccc
      721 tcccacggcg cgaccgctct cccacccctc gcttccccgc acactccccc ggcgacgggg
      781 tgccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg
      841 ccgtcgggcc cgggggaggt tctcccgggg ccacgcgcgc gtcccccgaa gagggggacg
      901 gcggagcgag cgcacggggt cggcggcgat gtcggctacc cacccgaccc gtcttgaaac
      961 acggaccaag gagtctaaca cgtgcgcgag tcgggggctc gcacgaaagc cgccgtggcg
     1021 caatgaaggt gaaggccggc gcgctcgccg gccgaggtgg gatcccgagg cctctccggt
     1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc
     1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac
     1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg
     1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc
     1321 tggcgctctc gcagacccct cctccccccc acgcagtttt atccggtaaa gcgaatgatt
     1381 agaggtcttg gggccgaaac gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc
     1441 cggctcgctg gcgtggagcc gggtggaatg cgagtgcctg tgggccactt ttggtaagca
     1501 gaactggcgc tgcgggatga accgaacgcc gggttaaggc gcccgatgcc gacgctcatc
     1561 agaccccaga aaaggtgttg gttgatatag acagcaggac ggtggccatg gaagtcggaa
     1621 tccgctaagg agtgtgtaac aactcacctg ccgaatcaac tagccctgaa aatggatggc
     1681 gctggagcgt cgggcccata cccggccgtc gccggcagtc gagagtggac gggagcggcg
     1741 ggggcggcgc gcgcgcgcgc gtgtggggtc ggagggcggc gtgtgggcgg tggggtcctc
     1801 gcccccctcc cccgcgcctc ccctcctccc acccccgctc cccgcccccg ggagccccgc
     1861 ggacgctacg ccgcgacgag taggagggcc gctgcggtga gccttgaagc ctagggcgcg
     1921 ggcccgggtg gagccgccgc aggtgcagat cttggtggta gtagcaaata ttcaaacgag
     1981 aactttgaag gccgaagtgg agaagggttc catgtgaaca gcagttgaac atgggtcagt
     2041 cggtcctgag agatgggcga gcgccgttcc gaagggacgg gcgatggcct ccgttgccct
     2101 cggccgatcg aaagggagtc gggttcagat ccccgaatcc ggagtggcgg agatgggcgc
     2161 cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga agccggcggg agccccgggg
     2221 agagttctct tttctttgtg aagggcaggg cgccctggaa tgggttcgcc ccgagagagg
     2281 ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc ggtgagctct cgctggccct
     2341 tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc gtacccatat ccgcagcagg
     2401 tctccaaggt gaacagcctc tggcatgttg gaacaatgta ggtaagggaa gtcggcaagc
     2461 cggatcc
//
LOCUS       ORARGIT      1070 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Orangutan rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION   M30949
KEYWORDS    internal transcribed spacer.
SOURCE      Orangutan DNA.
  ORGANISM  Pongo pygmaeus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE   1  (bases 1 to 1070)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  TITLE     Ribosomal RNA gene sequences and hominoid phylogeny
  JOURNAL   Mol. Biol. Evol. 7, 203-219 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.L.Gonzalez, 20-DEC-1989.
BASE COUNT       60 a    438 c    438 g    134 t
ORIGIN      Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23.
        1 acggagcgaa gagcgaggcc cgcggcggcg ccgccgcggc gtccttcctc gtcggccggc
       61 cggccgcgtt tctcccccgc ttcccgcggc gcgtgcgcgg gcggggcccg tgccgttcgc
      121 gcgcacgcgc gggcgtgcgt gcgtgcgtcg cccggccccg ccggccgcga gagccggaga
      181 acctcgggag ggagagagag gggggagaga gagagcggtg tgtgtgtgcg cgcgcgcgtg
      241 tctcgggggc ggccggcgcg gcggggagcg gtccccggcc gcggccccga cgtgtgtgtc
      301 ggcgggcgcg ggtgcggtcc tcggcggcgt cgcggcgggg tggggggtgt ctcggtgccc
      361 ctccccgccg gggcccgtcg tcccgtcccc gacccgccgg ctccgcgtcg ggggccggcc
      421 gggttcccgc cgcccccgtc gcctccgcca cgccgcgcca ccgggccggg ccggcccggc
      481 ccgccccgct cgctctcccc ggccttcccg ctagggcgtc tcgagggtcg ggggccggac
      541 gccggtcccc gcgcctcctc gtccgccccc ccctcccccc gccgtccagg tacctagcgc
      601 gttccggcgc ggaggtttaa agaccccttg ggggatcgcc cgtccgcccg tgggtcgggg
      661 gcggtgggcc cgcgtgggga gtcccgtcgg gaggggcccg gcccctcccg cgcctccacc
      721 gcggactccg cccccccggc cggggcgctg ccgccgccgc cgcggtcgcg gcggccgtcg
      781 ggtgggggct ttacccggcg gccgtcgtgc cgtccgtcgc gcgcgtgccc cgcgccgtgg
      841 gggcgggaac cccccgggcg cctgtggggt ggtgtccgcg ctcgcccccg cgtgggcggc
      901 gcgcgcctcc ccgtggtgtg cgacaccttc cgacccctct ccggagtccg gtcccgtttg
      961 ccgtctgact ggccggcctg aggcgacccc cccctgcggg ggggaagtgc cgcgccaggg
     1021 gcgagggcct cccggtgtgt cgggggcgcc ctcgcccgat cgagctcgta
//
LOCUS       ORARGITX     2487 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Orangutan 28S ribosomal RNA gene fragment.
ACCESSION   M30952
KEYWORDS    28S ribosomal RNA.
SOURCE      Orangutan DNA.
  ORGANISM  Pongo pygmaeus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE   1  (bases 193 to 263; 377 to 985; 1084 to 1120; 1376 to 1394;
            1780 to 1933; 2165 to 2179)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  TITLE     Ribosomal RNA gene sequences and hominoid phylogeny
  JOURNAL   Mol. Biol. Evol. 7, 203-219 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2487)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by I.L.Gonzalez, 20-DEC-1989.
FEATURES       from  to/span     description
    rRNA    <     1  >  2487     28S ribosomal RNA
BASE COUNT      393 a    814 c    921 g    359 t
ORIGIN      Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23.
        1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg
       61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg
      121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga
      181 cccctccacc cgccctccct cccccgccgc ccctcctcct cctccccgcg gggagggggc
      241 gggctccggc gggtgcgggg gtgggcgggc ggggccgggg gtggggtcgg cgggggaccg
      301 tcccccgacc ggcgaccggc cgccgccggg cgcatttcca ccgcggcggt gcgccgcgac
      361 cggctccggg acggctggga aggcccggtg gggaaggtgg ctcggggggc cccgtccgtc
      421 cgtccgtccg tcctcctccc tcctcccccc tcgtcttccc cccggccccg cgtcctccct
      481 cgggagggcg cgcgggtcgg gggcggcggc gggggtggct gctgctgctg ctgcggcggc
      541 ggcgggaccg aaccccccga gtgttacagc cccggcagca gcgctcgccg aacccggggc
      601 cgagggagcg agacccgtcg ccgcgctctc ccccctcccg gcgcccaccc ccgcgggggt
      661 cccccgcgag ggggtccccc ccgcgggggc gcgccggcgt ctcctcgcgt ggggggccgg
      721 gccgcccctc ccacggcgcg accgctctcc cacccccccc ttccccgcgc acccccggcg
      781 acgggggccc gcgcgggcgg ggggggcggg gcggactgtc cccagtgcgc cccgggcggg
      841 tcgcgccgtc gggcccgggg aagagagagg gagaggaggg ggttctcctc ctcctcctcc
      901 cctctcgggg ccacgcgcgc gtccctcgaa gagggggacg gcggagccga gcgcacgggg
      961 tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa cacggaccaa ggagtctaac
     1021 acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc gcaatgaagg tgaaggccgg
     1081 cgcgctcgcc ggccgaggtg ggatcccgag gcctctccag tccgccgagg gcgcaccacc
     1141 ggcccgtctc gcccgccgcg ccggggaggt ggagcacgag cgcacgtgtt aggacccgaa
     1201 agatggtgaa ctatgcctgg gcagggcgaa gccagaggaa actctggtgg aggtccgtag
     1261 cggtcctgac gtgcaaatcg gtcgtccgac ctgggtatag gggcgaaaga ctaatcgaac
     1321 catctagtag ctggttccct ccgaagtttc cctcaggata gctggcgctc tcgcagactc
     1381 gaccgaccga ccgcagtttt atccggtaaa gcgaatgatt agaggtcttg gggccgaaac
     1441 gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc cggctcgctg gcgtggagcc
     1501 gggcgtggaa tgcgagtgcc tagtgggcca cttttggtaa gcagaactgg cgctgcggga
     1561 tgaaccgaac gccgggttaa ggcgcccgat gccgacgctc atcagacccc agaaaaggtg
     1621 ttggttgata tagacagcag gacggtggcc atggaagtcg gaatccgcta aggagtgtgt
     1681 aacaactcac ctgccgaatc aactagccct gaaaatggat ggcgctggag cgtcgggccc
     1741 atacccggcc gtcgccggca gtcgagagtg gacgggagcg gcgggggcgg ggtgcgtgcg
     1801 ggtgtggggg tgtgtgtggg ggggggtcct ccccccccgc cactcctcct cctcccaccc
     1861 ctcccccgga gcagccccgc ggacgctacg ccgcgacgag taggagggcc gctgcggtga
     1921 gccttgaagc ccagggcgcg ggcccgggtg gagccgccgc aggtgcagat cttggtggta
     1981 gtagcaaata ttcaaacgag aactttgaag gccgaagtgg agaagggttc catgtgaaca
     2041 gcagttgaac atgggtcagt cggtcctgag agatgggcga gcgccgttcc gaagggacgg
     2101 gcgatggcct ccgttgccct cggccgatcg aaagggagtc gggttcagat ccccgaatcc
     2161 ggagtggcgg agacgggcgc cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga
     2221 agccggcggg agccccgggg agagttctct tttctttgtg aagggcaggg cgccctggaa
     2281 tgggttcgcc ccgagagagg ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc
     2341 ggtgagctct cgctggccct tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc
     2401 gtacccatat ccgcagcagg tctccaaggt gaacagcctc tggcatgttg gaacaatgta
     2461 ggtaagggaa gtcggcaagc cggatcc
//
LOCUS       ACCRRSAA     1536 bp ss-rRNA            RNA       03-JUL-1990
DEFINITION  A.calcoaceticus 16S ribosomal RNA.
ACCESSION   M34139
KEYWORDS    16S ribosomal RNA.
SOURCE      A.calcoaceticus (strain 33604) ribosomal RNA.
  ORGANISM  Acinetobacter calcoaceticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1536)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept. Microbiology
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1536     16S ribosomal RNA
BASE COUNT      387 a    329 c    460 g    330 t     30 others
ORIGIN      
        1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca
       61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta
      121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc
      181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta
      241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga
      301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata
      361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt
      421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt
      481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc
      541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt
      601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga
      661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg
      721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag
      781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga
      841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact
      901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat
      961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt
     1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt
     1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga
     1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat
     1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta
     1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac
     1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg
     1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct
     1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn
     1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt
//
LOCUS       BDERRSAA     1553 bp ss-rRNA            RNA       03-JUL-1990
DEFINITION  B.stolpii 16S ribosomal RNA.
ACCESSION   M34125
KEYWORDS    16S ribosomal RNA.
SOURCE      B.stolpii (strain uki-2) ribosomal RNA.
  ORGANISM  Bdellovibrio stolpii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 1553)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept Microbiology, 131
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1553     16S ribosomal RNA
BASE COUNT      381 a    320 c    429 g    316 t    107 others
ORIGIN      
        1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa
       61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt
      121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca
      181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg
      241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg
      301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc
      361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag
      421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc
      481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt
      541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc
      601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc
      661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga
      721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg
      781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat
      841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag
      901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat
      961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct
     1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag
     1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc
     1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat
     1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac
     1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg
     1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc
     1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac
     1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt
//
LOCUS       PLTRRSAA     1525 bp ss-rRNA            RNA       03-JUL-1990
DEFINITION  P.staleyi 16S ribosomal RNA.
ACCESSION   M34126
KEYWORDS    16S ribosomal RNA.
SOURCE      P.staleyi (strain ATCC 27377) ribosomal RNA.
  ORGANISM  Planctomyces staleyi
            Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales.
REFERENCE   1  (bases 1 to 1525)
  AUTHORS   Woese,C.R. and Oyalzu,H.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 15-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept Microbiology
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1525     16S ribosomal RNA
BASE COUNT      376 a    348 c    482 g    315 t      4 others
ORIGIN      
        1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa
       61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa
      121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg
      181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt
      241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga
      301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga
      361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg
      421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc
      481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta
      541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac
      601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg
      661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg
      721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag
      781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga
      841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac
      901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat
      961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt
     1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt
     1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact
     1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta
     1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta
     1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga
     1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac
     1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca
     1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa
     1501 cctgcggctg gatcacctcc tttct
//
LOCUS       RDCRRSAA     1478 bp ss-rRNA            BCT       03-JUL-1990
DEFINITION  R.purpureus 16S ribosomal RNA.
ACCESSION   M34132
KEYWORDS    16S ribosomal RNA.
SOURCE      R.purpureus (strain 6770) ribosomal RNA.
  ORGANISM  Rhodocyclus purpureus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 1478)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept. Microbiology
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1478     16S ribosomal RNA
BASE COUNT      367 a    345 c    470 g    289 t      7 others
ORIGIN      
        1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca
       61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat
      121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct
      181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt
      241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg
      301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat
      361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg
      421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg
      481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc
      541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt
      601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga
      661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg
      721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag
      781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa
      841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt
      901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga
      961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag
     1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga
     1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg
     1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc
     1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac
     1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg
     1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc
     1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc
     1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt
//
LOCUS       HUMFGF2H     3365 bp ss-mRNA            PRI       03-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) transmembrane form
            mRNA, complete cds.
ACCESSION   M34185
KEYWORDS    FGF receptor; fibroblast growth factor receptor;
            transmembrane tyrosine kinase.
SOURCE      Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
            clone h2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3365)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        256     2457     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      786 a    962 c    917 g    700 t
ORIGIN      
        1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac
       61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca
      121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa
      181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct
      241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca
      301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc
      361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc
      421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa
      481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca
      541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga
      601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac
      661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag
      721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac
      781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag
      841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac
      901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag
      961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt
     1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag
     1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc
     1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag
     1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc
     1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt
     1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag
     1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg
     1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa
     1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac
     1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc
     1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc
     1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc
     1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc
     1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg
     1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc
     1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg
     1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg
     2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg
     2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac
     2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga
     2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag
     2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg
     2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag
     2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg
     2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc
     2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc
     2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct
     2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc
     2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag
     2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg
     2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg
     2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga
     2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg
     3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca
     3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag
     3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat
     3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg
     3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat
     3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa
     3361 aaaaa
//
LOCUS       HUMFGF3H     3503 bp ss-mRNA            PRI       03-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) transmembrane form
            mRNA, complete cds.
ACCESSION   M34186
KEYWORDS    FGF receptor; fibroblast growth factor receptor;
            transmembrane tyrosine kinase.
SOURCE      Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
            clone h3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3503)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        527     2722     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      777 a   1044 c   1005 g    677 t
ORIGIN      
        1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc
       61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg
      121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga
      181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc
      241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc
      301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg
      361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga
      421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc
      481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa
      541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc
      601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc
      661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac
      721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt
      781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga
      841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat
      901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta
      961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat
     1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat
     1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa
     1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt
     1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc
     1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt
     1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga
     1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt
     1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa
     1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc
     1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat
     1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga
     1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga
     1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt
     1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa
     1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc
     1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg
     1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc
     2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc
     2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt
     2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa
     2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat
     2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct
     2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg
     2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg
     2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg
     2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta
     2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt
     2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa
     2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc
     2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc
     2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact
     2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga
     2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc
     3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc
     3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac
     3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc
     3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac
     3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg
     3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga
     3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt
     3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata
     3481 tattaaaaac aaaaaaaaaa aaa
//
LOCUS       HUMFGF4H     2283 bp ss-mRNA            PRI       03-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) transmembrane form
            mRNA, complete cds.
ACCESSION   M34187
KEYWORDS    FGF receptor; fibroblast growth factor receptor;
            transmembrane tyrosine kinase.
SOURCE      Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
            clone h4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2283)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        417     1325     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      566 a    621 c    612 g    484 t
ORIGIN      
        1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca
       61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc
      121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga
      181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc
      241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc
      301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg
      361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt
      421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta
      481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg
      541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg
      601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg
      661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt
      721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg
      781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca
      841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt
      901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta
      961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa
     1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct
     1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct
     1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc
     1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca
     1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc
     1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca
     1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt
     1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag
     1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga
     1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga
     1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa
     1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc
     1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc
     1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg
     1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt
     1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa
     1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag
     2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca
     2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca
     2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac
     2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa
     2281 aaa
//
LOCUS       HUMFGF5H     1625 bp ss-mRNA            PRI       03-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) secreted form mRNA,
            complete cds.
ACCESSION   M34188
KEYWORDS    FGF receptor; fibroblast growth factor receptor.
SOURCE      Human female placenta endothelial cell line HUVEC, cDNA to mRNA,
            clone h5.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1625)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        523     1425     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      368 a    480 c    489 g    288 t
ORIGIN      
        1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg
       61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc
      121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc
      181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat
      241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc
      301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc
      361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct
      421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg
      481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc
      541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc
      601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct
      661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc
      721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa
      781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc
      841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg
      901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc
      961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg
     1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt
     1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg
     1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca
     1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg
     1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat
     1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg
     1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca
     1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca
     1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt
     1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg
     1621 aaaaa
//
LOCUS       SYNLACZA     6476 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD1.27.
ACCESSION   M34296
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD1.27.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6476)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3500     3501     E.coli lacZ end/SV40 start
    recomb     3634     3635     SV40 end/synthetic start
    recomb     3657     3658     synthetic end/pUC19 start
    recomb     4076     4077     pUC19 end/C.elegans sup-7 start
    recomb     4442     4443     C.elegans sup-7 end/pUC19 start
BASE COUNT     1560 a   1646 c   1676 g   1594 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa
     3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat
     3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg
     3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt
     3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt
     3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa
     3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt
     3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct
     3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
     3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
     3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
     3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
     3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
     3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
     4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
     4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
     4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
     4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
     4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
     4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
     4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
     4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
     4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
     4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
     4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
     4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
     4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
     4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
     4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
     4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
     4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
     5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
     5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
     5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
     5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
     5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
     5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
     5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
     5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
     5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
     5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
     5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
     5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
     5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
     5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
     5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
     5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
     6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
     6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
     6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
     6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
     6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
     6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
     6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
     6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS       SYNLACZB     6096 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD8.02.
ACCESSION   M34297
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD8.02.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6096)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     3275     3276     synthetic end/unknown DNA start
    recomb     3696     3697     unknown DNA end/C.elegans sup-7 end
    recomb     4062     4063     C.elegans sup-7 end/pUC19 start
BASE COUNT     1458 a   1571 c   1616 g   1451 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta
     3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg
     3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt
     3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc
     3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat
     3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
     3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
     3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg
     3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat
     3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca
     3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga
     3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa
     3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac
     4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta
     4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg
     4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac
     4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg
     4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc
     4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg
     4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat
     4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg
     4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg
     4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc
     4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
     4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
     4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc
     4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
     4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
     4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
     4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
     5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca
     5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac
     5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg
     5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag
     5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac
     5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt
     5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg
     5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc
     5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc
     5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc
     5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa
     5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt
     5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg
     5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag
     5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc
     5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc
     6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa
     6061 ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS       SYNLACZC     7376 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD8.33.
ACCESSION   M34298
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD8.33.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7376)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     3244     3245     synthetic end/unknown DNA start
    recomb     3524     3425     unknown DNA end/C.elegans sup-7 end
    recomb     4555     4556     synthetic end/pUC19 start
    recomb     4976     4977     pUC19 end/C.elegans sup-7 start
    recomb     5342     5343     C.elegans sup-7 end/pUC19 start
BASE COUNT     1863 a   1793 c   1814 g   1906 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
     3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
     3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
     3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
     3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
     3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
     3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
     3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
     3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
     3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
     3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
     3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
     3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
     4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
     4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
     4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
     4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
     4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
     4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
     4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
     4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
     4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct
     4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
     4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
     4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
     4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
     4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
     4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
     4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
     4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
     5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
     5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
     5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
     5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
     5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
     5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
     5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
     5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
     5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
     5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
     5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
     5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
     5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
     5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
     5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
     5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
     6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
     6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
     6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
     6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
     6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
     6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
     6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
     6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
     6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
     6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
     6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
     6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
     6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
     6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
     6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
     6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
     6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
     7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
     7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
     7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
     7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
     7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
     7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS       SYNLACZD     5730 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD16.43.
ACCESSION   M34299
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD16.43.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 5730)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     3264     3265     synthetic end/unknown DNA start
    recomb     3513     3514     unknown DNA end/synthetic start
    recomb     3524     3525     synthetic end/pUC19 start
BASE COUNT     1348 a   1488 c   1580 g   1314 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga
     3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg
     3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa
     3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca
     3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat
     3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
     3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
     3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta
     3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg
     3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg
     3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac
     3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg
     3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt
     4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg
     4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac
     4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt
     4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag
     4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc
     4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc
     4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta
     4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg
     4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga
     4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac
     4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa
     4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat
     4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc
     4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg
     4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc
     4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg
     4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg
     5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa
     5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg
     5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga
     5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct
     5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca
     5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc
     5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg
     5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc
     5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca
     5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc
     5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga
     5701 gcggataaca atttcacaca ggaaacagct
//
LOCUS       SYNLACZE     7010 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD16.51.
ACCESSION   M34300
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD16.51.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7010)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     4544     4545     synthetic end/pUC19 start
    recomb     4793     4794     pUC19 end/synthetic start
    recomb     3244     3245     synthetic end/unknown DNA start
    recomb     4524     4525     unknown DNA end/synthetic start
    recomb     4804     4805     synthetic end/pUC19 start
BASE COUNT     1753 a   1710 c   1778 g   1769 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
     3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
     3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
     3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
     3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
     3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
     3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
     3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
     3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
     3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
     3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
     3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
     3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
     4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
     4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
     4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
     4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
     4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
     4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
     4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
     4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
     4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg
     4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct
     4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc
     4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat
     4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa
     4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
     4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
     4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg
     4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc
     5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga
     5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga
     5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg
     5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc
     5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac
     5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact
     5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca
     5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg
     5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact
     5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg
     5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg
     5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat
     5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc
     5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat
     5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt
     5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc
     6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt
     6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac
     6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt
     6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct
     6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga
     6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac
     6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg
     6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt
     6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc
     6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg
     6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc
     6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc
     6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag
     6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca
     6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat
     6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg
     6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct
//
LOCUS       SYNLACZF     7088 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD18.32.
ACCESSION   M34301
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD18.32.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7088)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      180      181     synthetic end/E.coli trpS start
    recomb      264      265     E.coli trpS end/synthetic start
    recomb      268      269     synthetic end/E.coli lacZ start
    recomb     3262     3263     E.coli lacZ end/synthetic start
    recomb     4622     4623     synthetic end/pUC19 start
    recomb     4871     4872     pUC19 end/synthetic start
    recomb     3322     3323     synthetic end/unknown DNA start
    recomb     4602     4603     unknown DNA end/synthetic start
    recomb     4882     4883     synthetic end/pUC19 start
BASE COUNT     1780 a   1724 c   1797 g   1787 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
      181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
      241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
      301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
      361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
      421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
      481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
      541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
      601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
      661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
      721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
      781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
      841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
      901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
      961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
     1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
     1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
     1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
     1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
     1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
     1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
     1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
     1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
     1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
     1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
     1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
     1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
     1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
     1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
     1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
     1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
     1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
     2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
     2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
     2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
     2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
     2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
     2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
     2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
     2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
     2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
     2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
     2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
     2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
     2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
     2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
     2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
     2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
     3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
     3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
     3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
     3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
     3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
     3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
     3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
     3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
     3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
     3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
     3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
     3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
     3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
     3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
     3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
     3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
     3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
     4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa
     4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt
     4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt
     4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt
     4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt
     4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa
     4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta
     4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta
     4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata
     4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac
     4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc
     4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc
     4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt
     4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac
     4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat
     4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat
     4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata
     5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct
     5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa
     5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa
     5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt
     5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg
     5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca
     5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa
     5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt
     5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc
     5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa
     5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga
     5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc
     5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga
     5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga
     5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga
     5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat
     6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt
     6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct
     6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc
     6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc
     6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc
     6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc
     6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg
     6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata
     6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta
     6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc
     6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg
     6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt
     6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt
     6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga
     6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc
     6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg
     6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca
     7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg
     7081 aaacagct
//
LOCUS       SYNLACZG     6563 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD21.28.
ACCESSION   M34302
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD21.28.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6563)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      180      181     synthetic end/E.coli trpS start
    recomb      264      265     E.coli trpS end/synthetic start
    recomb      268      269     synthetic end/E.coli lacZ start
    recomb     3262     3263     E.coli lacZ end/synthetic start
    recomb     4097     4098     synthetic end/pUC19 start
    recomb     4346     4347     pUC19 end/synthetic start
    recomb     3322     3323     synthetic end/unknown DNA start
    recomb     4077     4078     unknown DNA end/synthetic start
    recomb     4357     4358     synthetic end/pUC19 start
BASE COUNT     1587 a   1650 c   1723 g   1603 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
      181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
      241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
      301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
      361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
      421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
      481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
      541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
      601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
      661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
      721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
      781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
      841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
      901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
      961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
     1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
     1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
     1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
     1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
     1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
     1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
     1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
     1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
     1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
     1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
     1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
     1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
     1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
     1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
     1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
     1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
     1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
     2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
     2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
     2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
     2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
     2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
     2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
     2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
     2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
     2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
     2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
     2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
     2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
     2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
     2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
     2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
     2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
     3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
     3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
     3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
     3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
     3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
     3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
     3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
     3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
     3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
     3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
     3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
     3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
     3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
     3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
     3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
     3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
     3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
     4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct
     4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc
     4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga
     4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg
     4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc
     4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata
     4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt
     4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag
     4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca
     4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc
     4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat
     4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc
     4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg
     4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc
     4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat
     4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga
     4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc
     5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc
     5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt
     5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc
     5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc
     5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca
     5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca
     5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt
     5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta
     5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg
     5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc
     5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag
     5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa
     5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc
     5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc
     5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta
     5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag
     6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct
     6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga
     6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc
     6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt
     6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg
     6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg
     6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc
     6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc
     6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata
     6541 acaatttcac acaggaaaca gct
//
LOCUS       SYNLACZH     6562 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD22.04.
ACCESSION   M34303
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD22.04.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6562)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      179      180     synthetic end/E.coli trpS start
    recomb      263      264     E.coli trpS end/synthetic start
    recomb      267      268     synthetic end/E.coli lacZ start
    recomb     3261     3262     E.coli lacZ end/synthetic start
    recomb     4096     4097     synthetic end/pUC19 start
    recomb     4345     4346     pUC19 end/synthetic start
    recomb     3321     3322     synthetic end/unknown DNA start
    recomb     4076     4077     unknown DNA end/synthetic start
    recomb     4356     4357     synthetic end/pUC19 start
BASE COUNT     1587 a   1650 c   1722 g   1603 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg
      181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt
      241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc
      301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata
      361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc
      421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc
      481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca
      541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc
      601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga
      661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg
      721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg
      781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag
      841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga
      901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg
      961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt
     1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta
     1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga
     1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg
     1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa
     1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc
     1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga
     1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt
     1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg
     1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac
     1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg
     1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg
     1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta
     1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt
     1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg
     1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt
     1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg
     1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc
     2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg
     2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga
     2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg
     2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt
     2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac
     2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc
     2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc
     2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta
     2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc
     2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga
     2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg
     2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg
     2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg
     2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc
     2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg
     2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg
     3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct
     3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca
     3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg
     3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct
     3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg
     3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc
     3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca
     3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct
     3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa
     3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac
     3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca
     3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc
     3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa
     3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc
     3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt
     3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg
     3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat
     4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta
     4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct
     4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac
     4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg
     4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg
     4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag
     4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg
     4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga
     4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat
     4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca
     4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc
     4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca
     4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg
     4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca
     4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata
     4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag
     4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg
     5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca
     5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta
     5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct
     5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca
     5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag
     5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat
     5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt
     5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa
     5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga
     5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg
     5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc
     5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag
     5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc
     5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg
     5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac
     5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga
     6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt
     6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag
     6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg
     6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta
     6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc
     6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc
     6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc
     6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca
     6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa
     6541 caatttcaca caggaaacag ct
//
LOCUS       SYNLACZI     6567 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD22.11.
ACCESSION   M34304
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD22.11.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6567)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      184      185     synthetic end/E.coli trpS start
    recomb      268      269     E.coli trpS end/synthetic start
    recomb      272      273     synthetic end/E.coli lacZ start
    recomb     3266     3267     E.coli lacZ end/synthetic start
    recomb     4101     4102     synthetic end/pUC19 start
    recomb     4350     4351     pUC19 end/synthetic start
    recomb     3326     3327     synthetic end/unknown DNA start
    recomb     4081     4082     unknown DNA end/synthetic start
    recomb     4361     4362     synthetic end/pUC19 start
BASE COUNT     1587 a   1651 c   1724 g   1605 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg
      181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca
      241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga
      301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg
      361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga
      421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga
      481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc
      541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga
      601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg
      661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg
      721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt
      781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct
      841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa
      901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg
      961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac
     1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga
     1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa
     1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac
     1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat
     1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca
     1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct
     1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct
     1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac
     1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag
     1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc
     1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc
     1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga
     1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa
     1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga
     1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg
     1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa
     1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa
     2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct
     2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga
     2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc
     2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa
     2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac
     2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg
     2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat
     2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca
     2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct
     2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga
     2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca
     2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac
     2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg
     2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc
     2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg
     2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg
     3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct
     3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt
     3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca
     3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga
     3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca
     3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt
     3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc
     3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag
     3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt
     3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa
     3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg
     3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg
     3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg
     3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat
     3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc
     3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg
     3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg
     4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt
     4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa
     4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag
     4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta
     4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag
     4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt
     4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa
     4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca
     4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc
     4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc
     4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt
     4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt
     4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg
     4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact
     4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg
     4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga
     4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg
     5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa
     5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac
     5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc
     5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca
     5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga
     5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta
     5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc
     5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc
     5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt
     5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac
     5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct
     5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact
     5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg
     5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata
     5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga
     5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag
     6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg
     6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac
     6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca
     6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg
     6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc
     6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa
     6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt
     6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt
     6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg
     6541 gataacaatt tcacacagga aacagct
//
LOCUS       SYNLACZJ     7242 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD26.77.
ACCESSION   M34305
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD26.77.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7242)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     4776     4777     synthetic end/pUC19 start
    recomb     5025     5026     pUC19 end/synthetic start
    recomb     3244     3245     synthetic end/unknown DNA start
    recomb     4756     4757     unknown DNA end/synthetic start
    recomb     5036     5037     synthetic end/pUC19 start
BASE COUNT     1780 a   1801 c   1846 g   1815 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc
     3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca
     3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt
     3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc
     3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct
     3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg
     3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca
     3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt
     3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga
     3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat
     3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta
     3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa
     3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt
     4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga
     4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg
     4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat
     4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt
     4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg
     4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta
     4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct
     4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga
     4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt
     4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca
     4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat
     4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt
     4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg
     4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta
     4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg
     4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg
     4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg
     5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg
     5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa
     5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga
     5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
     5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg
     5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc
     5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat
     5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg
     5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag
     5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa
     5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc
     5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca
     5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc
     5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc
     5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg
     5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta
     6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag
     6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga
     6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc
     6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa
     6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa
     6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc
     6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt
     6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc
     6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac
     6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca
     6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg
     6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag
     6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt
     6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat
     6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc
     6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt
     6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag
     7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca
     7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga
     7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt
     7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct
//
LOCUS       SYNLACZK     6620 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD34.110.
ACCESSION   M34306
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD34.110.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6620)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      237      238     synthetic end/E.coli trpS start
    recomb      321      322     E.coli trpS end/synthetic start
    recomb      325      326     synthetic end/E.coli lacZ start
    recomb     3319     3320     E.coli lacZ end/synthetic start
    recomb     4154     4155     synthetic end/pUC19 start
    recomb     4403     4404     pUC19 end/synthetic start
    recomb     3379     3380     synthetic end/unknown DNA start
    recomb     4134     4135     unknown DNA end/synthetic start
    recomb     4414     4415     synthetic end/pUC19 start
BASE COUNT     1592 a   1665 c   1731 g   1632 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta
      181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta
      241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc
      301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct
      361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc
      421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc
      481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct
      541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc
      601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg
      661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg
      721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc
      781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc
      841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat
      901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact
      961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta
     1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct
     1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc
     1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa
     1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac
     1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat
     1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat
     1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg
     1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac
     1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc
     1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc
     1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg
     1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat
     1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt
     1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc
     1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc
     1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat
     1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat
     2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc
     2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa
     2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc
     2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat
     2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg
     2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc
     2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag
     2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat
     2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac
     2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg
     2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc
     2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa
     2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct
     2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat
     2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg
     2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga
     3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat
     3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc
     3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc
     3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa
     3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg
     3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc
     3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg
     3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc
     3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt
     3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa
     3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac
     3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc
     3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa
     3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa
     3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg
     3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc
     3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt
     4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg
     4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact
     4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga
     4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa
     4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca
     4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta
     4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt
     4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg
     4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca
     4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt
     4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga
     4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga
     4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat
     4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca
     4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt
     4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac
     4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct
     5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga
     5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac
     5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat
     5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg
     5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc
     5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc
     5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg
     5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta
     5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg
     5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga
     5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt
     5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag
     5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa
     5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag
     5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca
     5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac
     6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa
     6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc
     6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg
     6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc
     6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc
     6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag
     6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa
     6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga
     6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc
     6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca
     6601 atttcacaca ggaaacagct
//
LOCUS       SYNLACZL     5808 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Cloning vector pPD16.01.
ACCESSION   M34307
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD16.01.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 5808)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      180      181     synthetic end/E.coli trpS start
    recomb      264      265     E.coli trpS end/synthetic start
    recomb      268      269     synthetic end/E.coli lacZ start
    recomb     3262     3263     E.coli lacZ end/synthetic start
    recomb     3342     3343     synthetic end/unknown DNA start
    recomb     3591     3592     unknown DNA end/synthetic start
    recomb     3602     3603     synthetic end/pUC19 start
BASE COUNT     1375 a   1502 c   1599 g   1332 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
      181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
      241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
      301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
      361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
      421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
      481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
      541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
      601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
      661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
      721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
      781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
      841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
      901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
      961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
     1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
     1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
     1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
     1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
     1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
     1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
     1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
     1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
     1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
     1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
     1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
     1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
     1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
     1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
     1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
     1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
     1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
     2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
     2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
     2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
     2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
     2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
     2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
     2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
     2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
     2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
     2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
     2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
     2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
     2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
     2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
     2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
     2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
     3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
     3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
     3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
     3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
     3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
     3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg
     3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg
     3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg
     3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat
     3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg
     3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt
     3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac
     3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa
     3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat
     3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc
     3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga
     3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg
     4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc
     4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag
     4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc
     4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg
     4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg
     4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac
     4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac
     4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg
     4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg
     4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg
     4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac
     4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg
     4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg
     4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc
     4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc
     4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt
     4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc
     5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact
     5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac
     5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag
     5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg
     5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg
     5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga
     5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt
     5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct
     5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg
     5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt
     5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta
     5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta
     5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct
//
LOCUS       HUMMHDQ3L     967 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment,
            long terminal repeat region.
ACCESSION   M33841
KEYWORDS    major histocompatibility complex.
SOURCE      Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 967)
  AUTHORS   Kambhu,S., Falldorf,P. and Lee,J.S.
  TITLE     Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
            locus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by J.S.Lee, 25-APR-1990.
FEATURES       from  to/span     description
    rpt           1        7     inverted repeat A
    rpt         961      967     inverted repeat B
    rpt         560      810     R region
    signal      792      797     poly-A signal
    site         74       79     1/2 GRE
    site         80       87     enhancer core
    site        530      536     TATA box
BASE COUNT      254 a    235 c    222 g    256 t
ORIGIN      Chromosome 6p21.3.
        1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
       61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct
      121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa
      181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
      241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc
      301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt
      361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa
      421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa
      481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg
      541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg
      601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc
      661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg
      721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc
      781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga
      841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct
      901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac
      961 ccctaca
//
LOCUS       HUMMHDQ5L     960 bp ds-DNA             PRI       03-JUL-1990
DEFINITION  Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment,
            long terminal repeat region.
ACCESSION   M33842
KEYWORDS    major histocompatibility complex.
SOURCE      Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 960)
  AUTHORS   Kambhu,S., Falldorf,P. and Lee,J.S.
  TITLE     Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
            locus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by J.S.Lee, 25-APR-1990.
FEATURES       from  to/span     description
    rpt           1        7     inverted repeat A
    rpt         954      960     inverted repeat B
    rpt         555      803     R region
    signal      785      790     poly-A signal
    site         74       79     1/2 GRE
    site         80       87     enhancer core
    site        525      531     TATA box
BASE COUNT      250 a    232 c    219 g    259 t
ORIGIN      Chromosome 6p21.3.
        1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
       61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct
      121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa
      181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
      241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag
      301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa
      361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt
      421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa
      481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt
      541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca
      601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac
      661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct
      721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat
      781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg
      841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa
      901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca
//
LOCUS       MUSMHEBF1     573 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Mouses MHC class II E-beta-f gene, exon 1.
ACCESSION   M35677 M34123
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     1 of 3
SOURCE      Mouse inbred strain B10.M) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 573)
  AUTHORS   Begovich,A.B., Vu,T.H. and Jones,P.P.
  TITLE     Characterization of the molecular defects in the mouse E-beta-f and
            E-beta-q genes: Implications for the origin of MHC polymorphism
  JOURNAL   J. Immunol. 144, 1957-1964 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept        301  +   394     MHC E-beta-f, exon 1
    IVS         395  >   573     MHC E-beta-f intron A
BASE COUNT      135 a    151 c    138 g    149 t
ORIGIN      Chromosome 17.
        1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgtgt
       61 atccatgtaa tgaagagaac tgcaagtttc agaagggaac ctgcaaactg aatctctaac
      121 taggaactga tgatgctgaa cttctttgat gctgattggc tcccagcact ggccttaccc
      181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga
      241 ataaagttac agtctgaagc ttgccttccc ctctgactct cgtgtctcct ctcctgcagc
      301 atgatgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg
      361 agccctccag tggctttggt cagagactcc agacgtaaat gcacacctca ggtgctggga
      421 tgctcggggt cggggaagga aggagctaac attctcactg tccagtccaa gtccctcgaa
      481 actattgata tcttctgtga gcatgcacag tcctcacatg aactctaaac tatgtcccca
      541 aacagacgcc tggatgtttg tgctctcaga tct
//
LOCUS       MUSMHEBF2     495 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Mouses MHC class II E-beta-f gene, exon 2.
ACCESSION   M35678 M34123
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     2 of 3
SOURCE      Mouse inbred strain B10.M) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 495)
  AUTHORS   Begovich,A.B., Vu,T.H. and Jones,P.P.
  TITLE     Characterization of the molecular defects in the mouse E-beta-f and
            E-beta-q genes: Implications for the origin of MHC polymorphism
  JOURNAL   J. Immunol. 144, 1957-1964 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    +    61  +   330     MHC E-beta-f, exon 2
    IVS     <     1       60     MHC E-beta-f intron A
    IVS         331  >   495     MHC E-beta-f intron B
BASE COUNT      111 a    112 c    180 g     92 t
ORIGIN      About 3.0 kb after segment 1; chromosome 17.
        1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag
       61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc
      121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg
      181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc
      241 cggagatcct ggaggatgcg cgggccgcgg tggacacgta ctgcagatac aactatgaga
      301 tcttggataa attccttgtg cggcggagag gtgagacagg acagggtggg tggggcggaa
      361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc
      421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc
      481 aggggaacgg aattc
//
LOCUS       MUSMHEBF3    2155 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Mouses MHC class II E-beta-f gene, exons 3,4,5 and 6.
ACCESSION   M35679 M34123
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     3 of 3
SOURCE      Mouse inbred strain B10.M) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2155)
  AUTHORS   Begovich,A.B., Vu,T.H. and Jones,P.P.
  TITLE     Characterization of the molecular defects in the mouse E-beta-f and
            E-beta-q genes: Implications for the origin of MHC polymorphism
  JOURNAL   J. Immunol. 144, 1957-1964 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    +   122      403     MHC E-beta-f, exon 3
                971     1081     MHC E-beta-f, exon 4
               1484     1507     MHC E-beta-f, exon 5
               1802     1815     MHC E-beta-f, exon 6
    IVS     <     1      121     MHC E-beta-f intron B
    IVS         404      970     MHC E-beta-f intron C
    IVS        1082     1483     MHC E-beta-f intron D
    IVS        1508     1801     MHC E-beta-f intron E
BASE COUNT      459 a    586 c    569 g    541 t
ORIGIN      About 3.9 kb after segment 2; chromosome 17.
        1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcatgc
       61 agacagccta caaggaggac agctccaccc tcatggctcc ttctcacctc tctttctcta
      121 gttgagccta cggtgactgt gtaccccaca aagacgcagc ccctggaaca ccacaacctc
      181 ctggtctgct ctgtgagtga cttctaccct ggcaacattg aagtcagatg gttccggaat
      241 ggcaaggagg agaaaacagg aattgtgtcc acgggcctgg tccgaaatgg agactggacc
      301 ttccagacac tggtgatgct ggagacggtt cctcagagtg gagaggttta cacctgccag
      361 gtggagtatc ccagcctgac cgaccctgtc acggtcgagt ggagtgagtg gtaacttcca
      421 gactctgtga atgcccgccc gggtgggtgt ggtttatccc tgcctgtcag ctttctccac
      481 ccacacactc tttccactgg ctttgtgctg tcctgccttt caccatggct tacagtgtag
      541 gtgcgtgaag cttctacaag cacagttgcc ccctgggaag cagttatgcc cccatagact
      601 catctgagcc tgccagtgac ataacaggtc ctggaatctt cttggcccct gctgcagtct
      661 ctgccgttgc tgggttgtgt tcctcctgct gctgctgctg ctgacgatgg acaaggagca
      721 gtgcagggtc atgactgaac tcagggacat atagtcatag ctctgccttt gctacccctc
      781 agagctcagc agcttcctgt cagctcggct caggcctgtt tggttggttt ctcaacatga
      841 ccaggaatgt tgacagccag atcttctaga acacacttct tccttgggct caaagctccg
      901 agtctcaggg gtccggagtg gaaatgggat ttgggctaaa accctccaaa cctttggctt
      961 cctttctcag aagcacagtc cacatctgca cagaacaaga tgttgagtgg agttgggggc
     1021 ttcgtgctgg gcctcctctt cctcggagcg gggctgttca tctacttcag gaaccagaaa
     1081 ggtaaggagc ctggtgggag ccccaactcc atagcatttc agggaaaagc catggctttg
     1141 ttctcaggat gccattggcc ctgtgacctc aggtttcatt ggattctgaa tgcaacagtc
     1201 tgtggttact tgatttgacc ctgaggaggg ataacacatg ggagagttaa gttgattctg
     1261 gcttgagacc tgaggacaga ggaaggctgg ggggagccat gggcactgcc ggtgactgaa
     1321 gctccctaag cccctccctc tgtccatgct cctcttggtt ctgtgtgctc tgggcagtat
     1381 taccagagga atctcaggtg gcagctcaga gtctggggac atgtgtctgg ggacagatct
     1441 gccttcatgc atgtaagcat ctattttatt ctctcttttc taggacagtc tggacttcag
     1501 ccaacaggta acacccattg tcttctctca gagacagatc tgctttccct acagtatggg
     1561 ggctggggtg atggactcag ggcacaaaat ggggaagact gagatcccag ggttggccag
     1621 gcagttagca ctgagccttg ctccctgcac ttactgaagc ctgtgctctg aagcagcaat
     1681 gactcggggc atgagaagtt cctctctgct cactgccatg ctgtaaggag aggcctgaag
     1741 cagtcagaga agccactgca gagtgaggtc tggaaacagc cctgtcccct gtgctctaca
     1801 ggactcctga gctgagatga agtaacaagg ctgaaggaag gagttccccc ccgtgtctcc
     1861 atgccatgaa aacatgtcct gcttggccca catccctcca gagacactgc tcttccagga
     1921 cctggctcct cctgattctc caccctggag atctgtgctc ctgatggctg cttatccctg
     1981 acccaggcct tgcagctccc agaacagagg ccccactctt cacatctcct gtcccctttt
     2041 gtcccttgcc ttttgtctgg cacttctgag ccagtctgct gtcatatgct tttttacatt
     2101 tttctcaaat aaacaaataa tgaaagtcat ctgcttcata gagtttcaag cagaa
//
LOCUS       MUSMHEBQ1     574 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Mouses MHC class II E-beta-q gene, exon 1.
ACCESSION   M35680 M34124
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     1 of 3
SOURCE      Mouse inbred strain B10.M) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 574)
  AUTHORS   Begovich,A.B., Vu,T.H. and Jones,P.P.
  TITLE     Characterization of the molecular defects in the mouse E-beta-f and
            E-beta-q genes: Implications for the origin of MHC polymorphism
  JOURNAL   J. Immunol. 144, 1957-1964 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept        301  +   394     MHC E-beta-q, exon 1
    IVS         395  >   574     MHC E-beta-q intron A
BASE COUNT      132 a    152 c    143 g    147 t
ORIGIN      Chromosome 17.
        1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgagt
       61 atccatgtaa tgaagagaac tgcaagtttc agaaggggac ctgcaaactg aatctctaac
      121 tagcaactga tgatgctgga ctcctttgat gctgattggc tcccagcact ggccttaccc
      181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga
      241 ataaagttac agtctgaagc ttgccttccc ctctgactcc tgtgtctcct ctcctgcagc
      301 atggtgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg
      361 agccctccag tggctttggt cagagactcc agacgttaag tgcacacctc aggtgctggg
      421 atgctcgggg tcggggaagg aaggagctaa cattctcact gtccaggcca agtccctcgg
      481 aactattgat atcttctgtg agcatgcaca gtcctcacat gaactctaaa ctatgtcccc
      541 aaacagaagc ctggatgttt gtgctctcag atct
//
LOCUS       MUSMHEBQ2     495 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Mouses MHC class II E-beta-q gene, exon 2.
ACCESSION   M35681 M34124
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     2 of 3
SOURCE      Mouse inbred strain B10.G) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 495)
  AUTHORS   Begovich,A.B., Vu,T.H. and Jones,P.P.
  TITLE     Characterization of the molecular defects in the mouse E-beta-f and
            E-beta-q genes: Implications for the origin of MHC polymorphism
  JOURNAL   J. Immunol. 144, 1957-1964 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    +    61  +   330     MHC E-beta-q, exon 2
    IVS     <     1       60     MHC E-beta-q intron A
    IVS         331  >   495     MHC E-beta-q intron B
BASE COUNT      113 a    116 c    176 g     90 t
ORIGIN      About 3.0 kb after segment 1; chromosome 17.
        1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag
       61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc
      121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg
      181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc
      241 cggagatcct ggagcaaaag cgggccgcgg tggacacgta ctgcagacac aactatgaga
      301 tcttcgataa cttccttgtg cggcggagag gtgagacagg acagggtggc tggggcggaa
      361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc
      421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc
      481 aggggaacgg aattc
//
LOCUS       MUSMHEBQ3    2159 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Mouses MHC class II E-beta-q gene, exons 3,4,5 and 6.
ACCESSION   M35682 M34124
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     3 of 3
SOURCE      Mouse inbred strain B10.G) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2159)
  AUTHORS   Begovich,A.B., Vu,T.H. and Jones,P.P.
  TITLE     Characterization of the molecular defects in the mouse E-beta-f and
            E-beta-q genes: Implications for the origin of MHC polymorphism
  JOURNAL   J. Immunol. 144, 1957-1964 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    +   126      407     MHC E-beta-q, exon 3
                975     1085     MHC E-beta-q, exon 4
               1488     1511     MHC E-beta-q, exon 5
               1806     1819     MHC E-beta-q, exon 6
    IVS     <     1      125     MHC E-beta-q intron B
    IVS         408      974     MHC E-beta-q intron C
    IVS        1086     1487     MHC E-beta-q intron D
    IVS        1512     1805     MHC E-beta-q intron E
BASE COUNT      459 a    589 c    572 g    539 t
ORIGIN      About 3.9 kb after segment 2; chromosome 17.
        1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcaggc
       61 atgcagacag cctacaagga ggacagctcc accctcatgg ctccttctca cctctctttc
      121 tctagttgag cctacggtga ctgtgtaccc cacaaagacg cagcccctgg aacaccacaa
      181 cctcctggtc tgctctgtga gtgacttcta ccctggcaac attgaagtca gatggttccg
      241 gaatggcaag gaggagaaaa caggaattgt gtccacgggc ctggtccgaa atggagactg
      301 gaccttccag acactggtga tgctggagac ggttcctcag agtggagagg tttacacctg
      361 ccaggtggag catcccagcc tgaccgaccc tgtcacggtc gagtggagtg agtggtaact
      421 tccagactct gtgaatgccc gcccgggtgg gtgtggttta tccccgcctg tcagctttct
      481 ccacccacac actctttcca ctggctttgt gctgtcctgc ctttcaccat ggcttacagg
      541 gtaggtgcgt gaagcttcta caagcacagt tgccccctgg gaagcagtta tgcccccata
      601 gactcatctg agcctgccag tgacataaca ggtcctggaa tcttcttggc ccctgctgca
      661 gtctctgccg ttgctgggtt gtgttcctcc tgctgctgct gctgctgacg atggacaagg
      721 agcagtgcag ggtcatgact gaactcaggg acatatagtc atagctctgc ctttgctacc
      781 cctcagagct cagcagcttc ctgtcagctc ggctcaggcc tgtttggttg gtttctcaac
      841 atgaccagga atgttgactg ccagatcttc tagaacacac ttcttccttg ggctcaaagc
      901 tccgagtctc aggggtccgg agtggaaatg ggatttgggc taaaaccctc caaacctttg
      961 gcttcctttc tcagaagcac agtccacatc tgcacagaac aagatgttga gtggagttgg
     1021 gggcttcgtg ctgggcctcc tcttcctcgg agcggggctg ttcatctact tcaggaacca
     1081 gaaaggtaag gagcctggtg ggagccccaa ctccatagca tttcagggaa aagccatggc
     1141 tttgttctca ggatgccatt ggccctgtga cctcaggttt cattggattc tgaatgcaac
     1201 agtctgtggt tacttgattt gaccctgagg agggataaca catgggagag ttaagttgat
     1261 tctggcttga gacctgagga cagaggaagg ctggggggag ccatgggcac tgccggtgac
     1321 tgaagctccc taagcccctc cctctgtcca tgctcctctt ggttctgtgt gctctgggca
     1381 gtattaccag aggaatctca ggtggcagct cagagtctgg ggacatgtgt ctggggacag
     1441 atctgccttc atgcatgtaa gcatctattt tattctctct tttctaggac agtctggact
     1501 tcagccaaca ggtaacaccc attgtcttct ctcagagaca gatctgcttt ccctacagta
     1561 tgggggctgg ggtgatggac tcagggcaca aaatggggaa gactgagatc ccagggttgg
     1621 ccaggcagtt agcactgagc cttgctccct gcacttactg aagcctgtgc tctgaagcag
     1681 caatgactcg gggcatgaga agttcctctc tgctcactgc catgctgtaa ggagaggcct
     1741 gaagcagtca gagaagccac tgcagagtga ggtctggaaa cagccctgtc ccctgtgctc
     1801 tacaggactc ctgagctgag atgaagtaac aaggctgaag gaaggagttc ccccccgtgt
     1861 ctccatgcca tgaaaacatg tcctgcttgg cccacatccc tccagagaca ctgctcttcc
     1921 aggacctggc tcctcctgat tctccaccct ggagatctgt gctcctgatg gctgcttatc
     1981 cctgacccag gccttgcagc tcccagaaca gaggccccac tcttcacatc tcctgtcccc
     2041 ttttgtccct tgccttttgt ctggcacttc tgagccagtc tgctgtcata tgctttttta
     2101 catttttctc aaataaacaa ataatgaaag tcatctgctt catagagttt caagcagaa
//
LOCUS       RATHPA1      3282 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Rat haptoglobin (Hp) gene, exons 1,2 and 3.
ACCESSION   M34230
KEYWORDS    haptoglobin.
SEGMENT     1 of 3
SOURCE      Rat (strain Wistar) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3282)
  AUTHORS   Marinkovic,S. and Baumann,H.
  TITLE     Structure, hormonal regulation, and identification of the
            interleukin-6- and dexamethasone-responsive element of the rat
            haptoglobin gene
  JOURNAL   Mol. Cell. Biol. 10, 1573-1583 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1065     1069     haptoglobin (Hp), exon 1
               2797     2879     haptoglobin, exon 2
               3125  +  3226     haptoglobin, exon 3
    pre-msg    1032  >  3282     Hp mRNA and intron
    IVS        1070     2796     Hp intron A
    IVS        2880     3124     Hp intron B
    IVS        3227  >  3282     Hp intron C
BASE COUNT      858 a    743 c    830 g    831 t     20 others
ORIGIN      
        1 ctaatttaaa aacgtttttt aaaacgagtg aagccattta ggttgagcgg ctacattagc
       61 gtgaacagca ttccagaaca ggtgtcgggc tgaaacattg gttttctcct gggctgcgaa
      121 cacagcgagt ctgccattga ggactctgtc tctacactag catgtggtgt ggctttccgc
      181 taacaacaat cagaggagac acagcaggct catttcactg atttcaaatc ggaagacttt
      241 tagcaacagg aagatgtcct catgggtcgg gaagcaactg tgaaacggaa ccgatttctt
      301 tttactgttc tgtgggcgag actgcaggaa tttctacact ggatttaagt gattccgaga
      361 taagtccaga gaagggagcc agtacaaggg tcccatgtca gtctacctat agagctttag
      421 tcactctgag attgaagagc agtggaccaa gacccaataa ctcagtctgc tgcctgcaaa
      481 ttccagagct ctccacaccc aggagatggt catgcttggg caggagagtt gaaaaaagaa
      541 aagacttctt ttatagtctg agttaagggc tgggtcacaa gggtgtttaa aaaaaaaaaa
      601 aaagagggct ggggatttag ctcagtggta gagcgcttac ctaggaagca caaggcctgg
      661 gttcggtccc agctcgaaaa aaagaccaaa aaaaaaaaaa aaaaaaaaaa aaaaagagag
      721 gtctcgtccc tctcccagtt aagtatcaga ttaacagccc ctattccccg tcccactctc
      781 tggggttatc acactgcggt gggtgggagg ggtcgtgaag ttgctagatt tcttcatgat
      841 ttgtaaaata acaccacgag gagagccaag tatgaagcaa gagctcagct cttgaaaagg
      901 ggtttgcttt gtggttactg gaacagtcac tgaccttagc aaggccgaca ttgtgcaaac
      961 acagaaatgg aagaaaagga ggtggggtga aaccgaagca taaaaagggt gagcaggagt
     1021 cagcacagcg cacgccttct ggaaagaggt gagagaggcc cacgatgagg tgagtccaca
     1081 gtccacactt ttgggcacac aatgcagatg tctctgggag agtgagaaaa tgggatgcag
     1141 gaacagggcc gatgggcacc gttctgtggg agttaagccc gcagcctgca ggcgcatatg
     1201 gcgagggata gagctgtgga tgcattgcaa cacactgtaa acttacctga agcgttgtga
     1261 gacttttttt tttttttttg gtcttttttt tcggagctgg ggaccgaacc caggccttgc
     1321 tgccttccta ggcaaagtcc gctctaccac tgagctaaat tccccaaccc cgcgttgtga
     1381 gacttttgtt ttataacttg actatgcagt ttgagtgtga attttgttgg gtgaagacct
     1441 caggctgaaa tgtcaaaggc aggaagtgaa gggaccagtg acaaagcccc ttcctccctg
     1501 tgtccatgag agatgggcag gacagacagg gctttctatc tctaaggagg atctttccca
     1561 gtgagatgaa aggttttgtt ttttaccagg catgcagcag cttcctggga tgctggctgt
     1621 gctgttaaca gacttcctgc ttttaaagga acaaagacaa tagtcacaca gtctagtggc
     1681 accatcaagg catccccctt cctttttaaa atcaaaatat aaagactttg aaggttacaa
     1741 aaagactaga agcatagtgt ccaaaaggaa ttcctaactg gccagaatct acagggaatt
     1801 ggttaccgtt taagtgtggt ctgtgtacca atggtggcca caagtcatgc tgagaggaag
     1861 ccagttttct ccaggtaact tctggtttga tacacaatcc ctttttttaa aattatttat
     1921 ttatttattt gtttgtttgt ttctgtgagt acactgtcgc tgtctttaga cacaccagaa
     1981 gagggcatcg gatctcatta cagaggttgt gagccaccat gtgttgctgg gaattgaact
     2041 caggacctct ggaagagcag tcagtcgtct taaccgctgg gaattgaact caggacctct
     2101 ggaagagcag tcagtgctct taaccgctga gccctctctc cagccctgat atataacctt
     2161 aagaccaaat acttatgaag taataggagc aagcacatgt gagttatata catatgtata
     2221 tatttgggtc atagtgcaca cccagggatt ctagagctga ggcagggtga agtctgggag
     2281 ttcaggagtt gtgacagcta gaaagatgga ctgtgtctnn nnnnnnnnnn nnnnnnnnta
     2341 accttttcat tttggaattc caaaaagaga agagccaaat aaattagagc catcatcttt
     2401 aagttagcta cgatgtccta acaatgtctt catagctgga acttaatgat gcgtgcagag
     2461 gcttcccctt gctgacgttg tggtcaccac cagaggcaga ggcagaggca gaggctcact
     2521 ttgctctgtg cctcctcccc agttggttct tgttccacct cccactctcg ggcgggagac
     2581 aggcacttgt tatgtagcac tacgtaaagc cccgatcctc ctgcctcaga gtggagagct
     2641 ggggtagcac atatgcttcc acactggtgc tgctttcctt cgggtcatgg tgctcccttt
     2701 ctaagcttct acaaaattcc ccagtgacac cttgcttgcg tgtaatgcac aaatgcaaga
     2761 agaccaactc tactccttct tgccacttct ctacagagcc ctgggagctg tcgtcactct
     2821 cctgctctgg ggtcagcttt ttgctgtgga attgggcaat gatgccacag acattgaagg
     2881 tgagtctcag gggtttccca ggagctgtgc accccagcag gctgtggccc tgtctgacca
     2941 catcagtccc gcactgtatt aaggaagacc cagacctcct ctcgcctaga ccctcggggc
     3001 ctcccggcct cagcttccac tcggtgcaag ggagtctggt gttcagggca gctccgtctc
     3061 ttctggcttt gcacggggag catctgatca ccacagccct ttcctcgctt ctttctcttg
     3121 gcagatgaca gctgcccaaa gcccccagag attgcaaacg gctatgtgga acacttggtt
     3181 cgttatcgct gccgacagtt ctacaaacta cagaccgaag gagatggtaa ggctgtttga
     3241 gcgggtaggg ctaggctgtc acaccagaac ttaagtgctg ct
//
LOCUS       RATHPA2       482 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Rat haptoglobin (Hp) gene, exon 4.
ACCESSION   M34231
KEYWORDS    haptoglobin.
SEGMENT     2 of 3
SOURCE      Rat (strain Wistar) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 482)
  AUTHORS   Marinkovic,S. and Baumann,H.
  TITLE     Structure, hormonal regulation, and identification of the
            interleukin-6- and dexamethasone-responsive element of the rat
            haptoglobin gene
  JOURNAL   Mol. Cell. Biol. 10, 1573-1583 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +   296  +   370     haptoglobin (Hp), exon 4
    IVS     <     1      295     Hp intron C
    IVS         371  >   482     Hp intron D
BASE COUNT      118 a    108 c    120 g    136 t
ORIGIN      
        1 ttaacccgtg agccgtctcc agtccaggga gtgtagtcta tctacgactt tgtacagcct
       61 acattcctga caatttctaa gagcttcatt gtgtctttaa agctcccgtg gttgtcatag
      121 cctccttttg ggagagacac tctttaattc cattttttca atgaggaaac tgaggacgga
      181 gatgccaagg tagcttgtga ggggaagagt cttgatctga actctgacct cttcctgtcc
      241 aactctttca tcaggccaca ttcattttct ctgagctcac ctccttttgt ttcaggaatc
      301 tacaccttaa acagtgagaa gcaatgggtg aacccagctg ctggcgataa actccccaag
      361 tgtgaggcag gtgggtgttg aggtcttaaa gcatggggct aaaatggggc catgtttctc
      421 ttgtgtgcct gagtgagtaa gacagggtca gagagacacg ctgcaaagga ggacaatgac
      481 ta
//
LOCUS       RATHPA3      1245 bp ds-DNA             ROD       03-JUL-1990
DEFINITION  Rat haptoglobin (Hp) gene, exon 5.
ACCESSION   M34232
KEYWORDS    haptoglobin.
SEGMENT     3 of 3
SOURCE      Rat (strain Wistar) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1245)
  AUTHORS   Marinkovic,S. and Baumann,H.
  TITLE     Structure, hormonal regulation, and identification of the
            interleukin-6- and dexamethasone-responsive element of the rat
            haptoglobin gene
  JOURNAL   Mol. Cell. Biol. 10, 1573-1583 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +   329     1107     haptoglobin (Hp), exon 5
    IVS     <     1      328     Hp intron D
BASE COUNT      309 a    297 c    342 g    297 t
ORIGIN      
        1 ctgcagaggc tctggaagaa tcagccacca ctgcttgcga aaccaacagt acaggaacac
       61 tgcccttgcc acctgctccg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg
      121 tgtgtgtgtg tgtgtacgtg tgtataaata tatatatgta tatacctaca tacatatgta
      181 atcgtcatca cacatacaca ttccacaatc ctcttgaaag tcaatgacag acctgaaagc
      241 tgtgtacatt tcattcttag acaaagttgc cctgcagggg cctggtgtga actgctgctc
      301 acatcggtct ctcctcctcc ctccgcagtg tgtgggaagc ccaagcatcc tgtggaccag
      361 gtacagcgca tcatcggtgg ttccatggac gccaaaggca gctttccttg gcaggccaag
      421 atgatctcca gacatggact caccactggg gccacactga tcagtgacca gtggctgctg
      481 accactgccc aaaacctctt cctgaatcac agtgagaatg cgacagccaa ggacattgcc
      541 cctaccttaa cactctatgt ggggaaaaac cagctggtgg agattgagaa ggtagttctc
      601 caccccgagc gctctgtggt ggatatcggg ctgatcaagc tcaaacagaa agtgcttgtc
      661 actgagaaag tcatgcctat ctgcctgcct tccaaagact acgtagcgcc aggccgcatg
      721 ctatgtgtcc ggttgggggc gcggaatgtc aactttagat ttactgaacg tctcaagtat
      781 gtcatgctgc ctgtggctga ccaggagaag tgtgagctgc actatgagaa aagcacagtg
      841 cctgagaaga aaggcgctgt aactcctgtt ggggtacagc ccatcttgaa taagcatacc
      901 ttctgtgctg gccttaccaa gtatgaggaa gacacttgct atggtgacgc tggcagtgcc
      961 tttgccgtcc atgacacgga ggaggacacc tggtatgcag ctgggatcct gagctttgac
     1021 aagagttgtg ccgtagctga gtatggtgtg tatgtgaagg caactgatct gaaggactgg
     1081 gtccaggaaa caatggccaa gaactagttc agggctgact agagggctgc acacagtggg
     1141 gcagggcaat tcaccctgga agaggaagta gaagggttgg ggacataatc tgagggctgc
     1201 tagccctgca ttgctcagtc aataataaaa aacgagcttt ggacc
//
LOCUS       MUSTCAXL      331 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma LD1.
ACCESSION   M34194
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 331)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   331     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       71 a     86 c     89 g     85 t
ORIGIN      
        1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg
       61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg
      121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat
      181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg
      241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata
      301 acttttgggg ctggaaccaa actcaccatt a
//
LOCUS       MUSTCAXM      334 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma LD3.
ACCESSION   M34196
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 334)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   334     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       97 a     84 c     75 g     78 t
ORIGIN      
        1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg
       61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg
      121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga
      181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag
      241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg
      301 acatttggat ctggcacaaa agtcattgtt ctac
//
LOCUS       MUSTCAXN      327 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-14.
ACCESSION   M34198
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 327)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   327     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       92 a     80 c     85 g     70 t
ORIGIN      
        1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
       61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
      121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
      181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
      241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt
      301 ggacagggaa ccaagttatc tatcaag
//
LOCUS       MUSTCAXO      324 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-27.
ACCESSION   M34200
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   324     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       92 a     80 c     83 g     69 t
ORIGIN      
        1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
       61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
      121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
      181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
      241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc
      301 tctgggacca agctaattat aaag
//
LOCUS       MUSTCAXP      297 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-6.
ACCESSION   M34202
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 297)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   297     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       85 a     73 c     74 g     65 t
ORIGIN      
        1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac
       61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc
      121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc
      181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt
      241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg
//
LOCUS       MUSTCAXQ      193 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, T-cell clone V2.1.
ACCESSION   M34204
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 193)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   193     T-cell receptor alpha-chain (AA at 2)
BASE COUNT       56 a     45 c     49 g     43 t
ORIGIN      
        1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata
       61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc
      121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt
      181 tctgagtgtt ctg
//
LOCUS       MUSTCAXR      333 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-33.
ACCESSION   M34206
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 333)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   333     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       84 a     96 c     78 g     74 t      1 others
ORIGIN      
        1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg
       61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc
      121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa
      181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg
      241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga
      301 ctcacctttg ggaaaggaac tcagctgatc atc
//
LOCUS       MUSTCAXS      339 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-7.
ACCESSION   M34208
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   339     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       86 a    103 c     75 g     75 t
ORIGIN      
        1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg
       61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat
      121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg
      181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag
      241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga
      301 aagctcacgt ttggacacgg caccatcctt agggtccat
//
LOCUS       MUSTCAXT      342 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma P1F12C4.
ACCESSION   M34210
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 342)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   342     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       80 a    102 c     79 g     81 t
ORIGIN      
        1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg
       61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc
      121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa
      181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg
      241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact
      301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at
//
LOCUS       MUSTCAXU      324 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma P1D3A6.
ACCESSION   M34212
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   324     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       82 a     80 c     81 g     81 t
ORIGIN      
        1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg
       61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga
      121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc
      181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac
      241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt
      301 caaggaaccg ttctgagtgt tata
//
LOCUS       MUSTCAXV      210 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma 1E1O.
ACCESSION   M34214
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 210)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   210     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       54 a     53 c     50 g     53 t
ORIGIN      
        1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg
       61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca
      121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact
      181 tttggtcaag gaaccgttct gagtgttata
//
LOCUS       MUSTCAXW      234 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma 7/6AH1.
ACCESSION   M34216
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 234)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   234     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       55 a     59 c     58 g     62 t
ORIGIN      
        1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct
       61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct
      121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta
      181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc
//
LOCUS       MUSTCAXX      312 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma 2B11.
ACCESSION   M34218
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   312     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       89 a     79 c     79 g     64 t      1 others
ORIGIN      
        1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag
       61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta
      121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac
      181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg
      241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact
      301 ttgcaagtgc aa
//
LOCUS       MUSTCBYAO     153 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma LD1.
ACCESSION   M34195
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 153)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   153     T-cell receptor beta-chain (AA at 1)
    recomb      122      123     V-region end/J-region start
BASE COUNT       39 a     37 c     36 g     41 t
ORIGIN      
        1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc
       61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta
      121 ggttctggaa atacgctcta ttttggagaa gga
//
LOCUS       MUSTCBYAP     111 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma LD3.
ACCESSION   M34197
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   111     T-cell receptor beta-chain (AA at 1)
    recomb       73       74     V-region end/J-region start
BASE COUNT       18 a     37 c     25 g     31 t
ORIGIN      
        1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc
       61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a
//
LOCUS       MUSTCBYAQ     321 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-14.
ACCESSION   M34199
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   321     T-cell receptor beta-chain (AA at 1)
    recomb      292      293     V-region end/J-region start
BASE COUNT      106 a     81 c     61 g     73 t
ORIGIN      
        1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc
       61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag
      121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc
      181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct
      241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac
      301 accttgtact ttggtgcggg c
//
LOCUS       MUSTCBYAR     210 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-27.
ACCESSION   M34201
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 210)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   210     T-cell receptor beta-chain (AA at 1)
    recomb      167      168     V-region end/J-region start
BASE COUNT       57 a     56 c     43 g     54 t
ORIGIN      
        1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc
       61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg
      121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag
      181 ttcttcggac cagggacacg actcaccgtc
//
LOCUS       MUSTCBYAS     255 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-6.
ACCESSION   M34203
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 255)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   255     T-cell receptor beta-chain (AA at 1)
    recomb      219      220     V-region end/J-region start
BASE COUNT       57 a     68 c     77 g     53 t
ORIGIN      
        1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc
       61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc
      121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc
      181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg
      241 ggcacccgac tatcg
//
LOCUS       MUSTCBYAT     213 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, T-cell clone V2.1.
ACCESSION   M34205
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 213)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   213     T-cell receptor beta-chain (AA at 1)
    recomb      171      172     V-region end/J-region start
BASE COUNT       48 a     59 c     58 g     48 t
ORIGIN      
        1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat
       61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc
      121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag
      181 cagttcttcg gaccagggac acgactcacc gtc
//
LOCUS       MUSTCBYAU     207 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-33.
ACCESSION   M34207
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 207)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   207     T-cell receptor beta-chain (AA at 1)
    recomb      163      164     V-region end/J-region start
BASE COUNT       65 a     54 c     36 g     52 t
ORIGIN      
        1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt
       61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag
      121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt
      181 ttcggtcatg gaaccaagct gtctgtc
//
LOCUS       MUSTCBYAV     156 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-7.
ACCESSION   M34209
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   156     T-cell receptor beta-chain (AA at 1)
    recomb      118      119     V-region end/J-region start
BASE COUNT       30 a     48 c     40 g     38 t
ORIGIN      
        1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
       61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct
      121 gaacagtact tcggtcccgg caccaggctc acggtt
//
LOCUS       MUSTCBYAW     132 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma P1F12C4.
ACCESSION   M34211
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 132)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   132     T-cell receptor beta-chain (AA at 1)
    recomb       94       95     V-region end/J-region start
BASE COUNT       35 a     39 c     28 g     30 t
ORIGIN      
        1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg
       61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc
      121 agactcacag tt
//
LOCUS       MUSTCBYAX     303 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma P1D3A6.
ACCESSION   M34213
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 303)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   303     T-cell receptor beta-chain (AA at 1)
    recomb      259      260     V-region end/J-region start
BASE COUNT       80 a     72 c     79 g     72 t
ORIGIN      
        1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac
       61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat
      121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg
      181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc
      241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt
      301 gtt
//
LOCUS       MUSTCBYAY     159 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma 1E1O.
ACCESSION   M34215
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 159)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   159     T-cell receptor beta-chain (AA at 1)
    recomb      113      114     V-region end/J-region start
BASE COUNT       35 a     40 c     39 g     45 t
ORIGIN      
        1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg
       61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga
      121 aatacgctct attttggaga aggaagccgg ctcattgtt
//
LOCUS       MUSTCBYAZ     321 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma 7/6AH1.
ACCESSION   M34217
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   321     T-cell receptor beta-chain (AA at 1)
    recomb      275      276     V-region end/J-region start
BASE COUNT       86 a     78 c     81 g     76 t
ORIGIN      
        1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc
       61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg
      121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac
      181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct
      241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga
      301 gaaggaagcc ggctcattgt t
//
LOCUS       MUSTCBYBA     339 bp ss-mRNA            ROD       03-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma 2B11.
ACCESSION   M34219
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   339     T-cell receptor beta-chain (AA at 1)
    recomb      294      295     V-region end/J-region start
BASE COUNT       90 a     88 c     88 g     73 t
ORIGIN      
        1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca
       61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg
      121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc
      181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
      241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa
      301 aacaccttgt actttggtgc gggcacccga ctatcggtg
//
LOCUS       CHKATHA       188 bp ss-mRNA            VRT       03-JUL-1990
DEFINITION  Chicken avian thymic hormone mRNA, partial cds.
ACCESSION   M34330
KEYWORDS    avian thymic hormone; parvalbumin.
SOURCE      Chicken thymus, cDNA to mRNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 188)
  AUTHORS   Palmisano,W.A. and Henzl,M.T.
  TITLE     Partial nucleotide sequence of the parvalbumin from chicken thymus
            designated "avian thymic hormone"
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 1286-1293 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   188     avian thymic hormone (AA at 1)
BASE COUNT       53 a     43 c     50 g     42 t
ORIGIN      
        1 ccggatcaga tcaagaaggt ttttggaatc cttgatcagg acaagagcgg cttcattgaa
       61 gaagaagagc ttcagctgtt tctgaagaac ttctcttcga gtgccagagt cctcacctct
      121 gcggagacca aagctttcct ggctgcaggt gacaccgacg gcgacgacaa aataggcgta
      181 gaagaatt
//
LOCUS       DDISGSPA     1957 bp ds-DNA             INV       03-JUL-1990
DEFINITION  D.discoideum spore germination-specific protein (270-11) gene,
            complete cds.
ACCESSION   M33862
KEYWORDS    spore germination-specific protein.
SOURCE      D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and
            DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 1957)
  AUTHORS   Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L.
  TITLE     a shared internal theronine-glutamic acid-threonine-proline repeat
            defines a family of dictyostelium discoideum spore germination-
            specific proteins
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.L.Ennis 17-APR-1990.
FEATURES       from  to/span     description
    pept        171      228     spore germination-specific protein, exon 1
                316     1856     spore germination-specific protein, exon 2
    IVS         229      315     spore germination-specific protein intron A
BASE COUNT      724 a    359 c    258 g    616 t
ORIGIN      
        1 aaataatttt attattttct tttgtaaaag taattaaata aaaaaaaaaa taaaaaataa
       61 ataaaattaa ataaagtcaa ttaaaaaaaa aaaaataata taaatatata taaaataaaa
      121 aaaaaaaaac aaaacaataa tagtttatga tataaatttt taataataat atgaaaaata
      181 tatatagttt attcttatta tttgcattaa taagtgcaac atttgcaagt aagttgaaaa
      241 aaaaaaaaaa aaaattatat tgtaaatttt aaataaaaaa caatatacta attattaatt
      301 ttaaaattaa attagataat gcatttattg tacattggaa ttcagattca atttcaaaaa
      361 aattaacggg acaaattggt gatacaatct ctttttatac aagtgatgga aattctcatg
      421 atgtaaaaag ttcagatggt tctgtttcgt caagtgtttt ctctggtagt cttacaaatc
      481 ctggaatttt caaggtaaca cttactaaag aaggtaatat tgaatttacc agttcatatg
      541 atgaaggtct ttctgcaaca atagtagttt cttctggtgg tcaaattccg attacaacaa
      601 cttcatcaac tacaactgat ggtagttcaa ccccttccac tccaacttca acaacttcag
      661 cctcaactac tacaagtggt ggtagtgcta caacaacaac aggagaacca attactgatg
      721 gttctaatgg aggcgccagt tccacaactg gcaatagcgg gacgacaggt tctgctacca
      781 ctactacttc ttcttcttcc gataattccg atggcagtgt aggtacttca actacaactt
      841 caccagctat cacaacttca agtgggtcaa taatcgatcc aacttcacca cctacaactg
      901 attcatcctc taatagtggt ggttatggtt catcatcttc aattgaaaat ggcgtagaat
      961 gtttattaac aatcactcaa gatgcatttg attcttggac atatgataat attatttaca
     1021 ccgtttatca agtaaattta acaaatattg gtacactttc agttgagtct gttattctca
     1081 ctccaaatga taactcttta atttaccata cttgggaatt ggtttatgat ggaacttcac
     1141 tcactcttcc aacctataga aaagctggtc caatcaatcc agaggaaacc attatctttg
     1201 gttatatctc tagaaatagt actgatgtta catttgcttt aagtccaaca tgttcagatt
     1261 catcaagtcc aactccaact cctactgaga ctccaactga gactccaact gagactccaa
     1321 ctgagactcc aactgagact ccaactgaaa ctccaactga aactccaact gaaactgaaa
     1381 ctccaacacc aacaccatca agctcatcta gtgatgtaga tagtggttca tcatctgaaa
     1441 ttgaaacccc aacaccaact gaaactgata ccccaacccc aacaccatca agttcttcaa
     1501 gtgaaggaag tggatcatca tcagaaactc aaccaccaat tactccacca ccaaccactg
     1561 gtacttcttg tttagcccaa gtccaacaaa aagttatcaa ctcatggatt aatggtgaag
     1621 ttgatcatta tatacaagtt gaggctacta ttgttaacca aggttcaact ccaatttcat
     1681 cttttaattt ttattctgat gctgaacaaa tttggtcagt tgaaaaaaca ggaaccaata
     1741 cctataaatt accaagttgg ttctcaacaa ttccagttgg tgggtcccat acctttggtt
     1801 atattgttaa atctgctgaa ttatctgacc tcgaaggagt tcaatataca tgttgatttt
     1861 aaaactctct ttttgtaata ataaaaaaaa aaaaaaaatt ttttggaaat aaatttaatt
     1921 ttcaaaaact agttttgatt tcactttatt taataat
//
LOCUS       DDISGSPB     3655 bp ds-DNA             INV       03-JUL-1990
DEFINITION  D.discoideum pore germination-specific protein gene, complete cds.
ACCESSION   M33861
KEYWORDS    endo-(1,4)-beta-D-glucanase; spore germination-specific protein.
SOURCE      D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and
            DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 3655)
  AUTHORS   Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L.
  TITLE     A shared internal theronine-glutamic acid-threonine-proline repeat
            defines a family of Dictyostelium discoideum spore germination-
            specific proteins
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.L.Ennis, 17-APR-1990.
FEATURES       from  to/span     description
    pept       1346     1412     spore germination-specific protein, exon 1
               1505     3555     spore germination-specific protein, exon 2
    IVS        1413     1504     spore germination-specific protein intron
BASE COUNT     1346 a    482 c    459 g   1368 t
ORIGIN      
        1 tttttttttt ttaatatttt ttattttatt ttttttttta attattatta attattaatc
       61 tttattataa acaaaatgca tatgtgttaa aattattata accaaaaatt aattaattta
      121 aaaaactaag aactatagtt ctgagatttt caatagtttt tttcaaataa tatgatttct
      181 ttttcaaggg tcattaaaat tatattatta gaactattta aaaaaaattc aaaagttaaa
      241 tatttaactt ttgcattttt aaaaccatca attataataa ttaattattt tattattttt
      301 tttttttttt tttttttttt aattattttt gttttttttt tttttttttt tttttttttt
      361 ttttattaaa aaaactatga atactttaaa ttatagtttt tcattttttt attaactgat
      421 cataatttaa tttaatttaa tttaatttat ttttttgtat ttaatactcg aaaaccacat
      481 acccatgatt aattaaaaaa aataaaaaaa aataaaaaaa aaagaaaaag tactttttca
      541 aataaaaaat gtttataaaa aaaaattttt ttttgaggcc aagttaatat ttttgggtag
      601 ttaaaatact aagatttgtt ccaatttgga tttttaatgg tttttatttt taaaaataat
      661 aatttaacat ttttctaatc aattttcaaa tttttttttt tataactgat ttcttttttt
      721 tttattttaa ttttttttta attttttttt atttaaaaaa tatttcaagt tgtacatttc
      781 cgttagaatt tcatttggaa gatattagat tttaatttaa aaacaatttt cctaaaaaat
      841 aaaataaaaa atgcgaaatt taattttttt tttttattaa taattatttt gaattaaatt
      901 tttttttttt tttttttttt ttcccagatt tccaatctta taaaaaggaa ttgtttttta
      961 tttttttttt tttcattttc aaaaaactaa tttattagat ctttaaaaaa aaaaaaaaaa
     1021 ataataataa taataaaaat aataatatta tctattatcc aaatttgttt ttgcaattaa
     1081 tttcgttatt ttttttttta aaaaactcac cacatactta cacaccaaaa aataacaaaa
     1141 ataataattc tattattata atcaatttat tgtagtataa gtttaacttt taaagttcta
     1201 ttaaaaaaaa aaaaaaaaaa aaaaaaagaa aaaaaaaaat atataaaata ataaaacttt
     1261 tgtttattat ttttatgtac tataaatttc aaattcctat atctaaattt ttaatatttc
     1321 taaattttta taaattaaaa ccaatatgaa aatattgaaa aattgtatat tattaataat
     1381 atttgggtta ttatcaactc aattaattaa tggtaaagta taaaaaaaaa aaaaaaaaaa
     1441 aaatattata tttcttaaac aaaaaaaaaa acaaaatatt aattcttaat ttttttttta
     1501 ttagcggata ccgattattg ttcattactt gaaaatgcat taatgtttta taaaatgaat
     1561 agagctggtc gtttaccaga taacgatata ccatggagag gtaattcagc attgaatgat
     1621 gcaagtccaa attcagctaa agatgccaat ggtgatggta atttaagtgg tggttatttt
     1681 gatgctggtg atggtgttaa atttggttta ccaatggctt attctatgac tatgttgggt
     1741 tggtcattca ttgaatatga atccaatatt gctcaatgtg gtttgacaag tttatacctc
     1801 gatacaatta aatatggtac cgactggctt attgcagcac atactgccga taatgaattt
     1861 gcaggccaag ttggtgatgg taatgttgat cattcttggt ggggtcctcc agaagatatg
     1921 acaatggctc gtccaactta tatgttaaca accgaagcac caggtactga aattgcaatg
     1981 gaagcagcat cagcattagc tgcagcttca atagcattta aatcttcaaa cccaacatac
     2041 gctgcaactt gcttagcaca tgctaaaact cttcataatt tcgggtacac ttatcgtggt
     2101 gtttattcag attccattac gaatgctcaa gctttttata attcatggtc tggctataag
     2161 gatgatttag tttggggtag catttggtta tataaagcaa ctcaagattc agattattta
     2221 acaaaagccg ttgcagatta tgcatcaggt ggtgttggtg gaatggcaca aggtaattct
     2281 cacgattggg ataataaagc accaggttgt tgtttattat tatctaaatt agttccaacc
     2341 acaagtactt ataaaactga tttcgaaggt tggttaaatt attggttacc aggtggaggt
     2401 gtcacttata ctccaggtgg tttagcatgg atcagacaat ggggtccagc tcgttatgct
     2461 gccactgccg ctttccttgg ttctttagct ggtactgaaa aaggcacaga tttcactcaa
     2521 aaacaagttg actatttaat tggtaataat ccaaatcaac aatcatttgt agttggtatg
     2581 ggtccaaatt atccaattaa tccacatcat cgtgctgccc atcattctac aactaatgat
     2641 ataaataatc cagttaataa tttatacctc ttaaaaggtg ctttagttgg tggaccaggt
     2701 tcaaatgatg aatatactga tgatagaact gattatattt caaatgaagt tgcaactgat
     2761 tataatgctg gtttcgttgg tgcattagct tctcttgtaa atccatcttc aacttctgtt
     2821 ccaaccacaa ctccaacagt aactgaaacc ccaacagaga ctccaactga gactccaact
     2881 gagactccaa ctgagactcc aacagagact ccaacagaaa ctccaacaga gactccaaca
     2941 gaaactccaa cagagactcc aacagaaact ccaacagaaa ctccaacaga aactccaaca
     3001 gaaactccaa cagaaactcc aacagaaact ccaaccgaga ctccaactga aactgttact
     3061 ccaaccccaa cagtaacacc aactgaaact ccatcaagtg gagaatcttt atcaatctat
     3121 aaaagtggat taaaaaatga tttccaagat tggtcatggg gtgagcattc attaactgat
     3181 acaacaaatg ttgaatctgg agaaaccaat tcaatttcat ttacaccaaa agcatatggt
     3241 gcagtatttt taggatgttt cgaatgtatt gatactgata catacaataa tattgaattt
     3301 gatattaatg gtggtagcag tggtgctcaa ttattaagaa taactgttgt taaaaatagt
     3361 aaatctgttg gttccaaatt aattaccgat cttaatggtg gaactccaat cgaagcaaat
     3421 tcatggacta aaattaaagc atcctttatt gatgacttta aagtatctgg taaagtcgat
     3481 ggtatttgga ttcaagatat caaaggtgat acccaatcaa ctgtatacat aagtaatatt
     3541 attgcaactg cttaaaaaaa tattaatatt aaatattaaa aaaagtataa ataaaataat
     3601 cttaaattaa aaaaaataag tgttttcgaa attttctata gatatatatc taaaa
//
LOCUS       ECOCYSXE     1396 bp ds-DNA             BCT       03-JUL-1990
DEFINITION  E.coli cysteine regulon 33 Kd (cysE) and 16 Kd protein (cysX)
            genes, complete cds.
ACCESSION   M34333
KEYWORDS    cysE gene; cysX gene.
SOURCE      E.coli (strain K-12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1396)
  AUTHORS   Tei,H., Murata,K. and Kimura,A.
  TITLE     Structure and expression of cysX, the second gene in the
            Escherichia coli K-12 cysE locus
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 948-955 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        221     1042     33 Kd protein (cysE)
    pept        919      527 (c) 16 Kd protein (cysX)
BASE COUNT      325 a    362 c    393 g    316 t
ORIGIN      
        1 cgcgaactgg cgcatcgctt cggcgttgaa atgccaataa ccgaggaaat ttatcaagta
       61 ttatattgcg gaaaaaacgc gcgcgaggca gcattgactt tactaggtcg tgcacgcaag
      121 gacgagcgca gcagccacta accccaggga acctttgtta ccgctatgac ccggcccgcg
      181 cagaacgggc cggtcattat ctcatcgtgt ggagtaagca atgtcgtgtg aagaactgga
      241 aattgtctgg aacaatatta aagccgaagc cagaacgctg gcggactgtg agccaatgct
      301 ggccagtttt taccacgcga cgctactcaa gcacgaaaac cttggcagtg cactgagcta
      361 catgctggcg aacaagctgt catcgccaat tatgcctgct attgctatcc gtgaagtggt
      421 ggaagaagcc tacgccgctg acccggaaat gatcgcctct gcggcctgtg atattcaggc
      481 ggtgcgtacc cgcgacccgg cagtcgataa atactcaacc ccgttgttat acctgaaggg
      541 ttttcatgcc ttgcaggcct atcgcatcgg tcactggttg tggaatcagg ggcgtcgcgc
      601 actggcaatc tttctgcaaa accaggtttc tgtgacgttc caggtcgata ttcacccggc
      661 agcaaaaatt ggtcgcggta tcatgcttga ccacgcgaca ggcatcgtcg ttggtgaaac
      721 ggcggtgatt gaaaacgacg tatcgattct gcaatctgtg acgcttggcg gtacgggtaa
      781 atctggtggt gaccgtcacc cgaaaattcg tgaaggtgtg atgattggcg cgggcgcgaa
      841 aatcctcggc aatattgaag ttgggcgcgg cgcgaagatt ggcgcaggtt ccgtggtgct
      901 gcaaccggtg ccgccgcata ccaccgccgc tggcgttccg gctcgtattg tcggtaaacc
      961 agacagcgat aagccatcaa tggatatgga ccagcatttc aacggtatta accatacatt
     1021 tgagtatggg gatgggatct aatgtcctgt gatcgtgccg gatgcgatgt aatcatctat
     1081 ccggcctaca gtaactaatc tctcaatacc gctcccggat accccaactg tcgccaggct
     1141 tcatacacca ctaccgacac cgcattggac agattcatgc tgcggctgtc cggcaccatc
     1201 ggaatgcgaa ttttttgttc agcgggcagg gcatcaagaa tgctcgctgg caggccgcgt
     1261 gtttccgggc cgaacatcag ataatcgcca tcctgatagc ttacggcgct gtgagcaggt
     1321 gtacctttcg tggtgagggc gaacaggcgc tgggattttc tgcttcgagg aacgcgcgat
     1381 agtcatgatg acgcgt
//
LOCUS       ECOTRPP        74 bp ds-DNA             SYN       03-JUL-1990
DEFINITION  Expression plasmid pDS20 derivative.
ACCESSION   M34334
KEYWORDS    .
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 74)
  AUTHORS   Latta,M., Philit,M., Maury,I., Soubrier,F., Denefle,P. and
            Mayaux,J.-F.
  TITLE     Tryptophan promoter derivatives on multicopy plasmids: A
            comparative analysis of expression potentials in Escherichia coli
  JOURNAL   DNA 9, 129-137 (1990)
  STANDARD  simple staff_review
BASE COUNT       18 a     18 c     15 g     23 t
ORIGIN      
        1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc
       61 aaatattctg aaat
//
LOCUS       HUMCYTOK     1724 bp ss-mRNA            PRI       03-JUL-1990
DEFINITION  Human cytokeratin 8 mRNA, complete cds.
ACCESSION   M34225
KEYWORDS    cytokeratin 8.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1724)
  AUTHORS   Yamamoto,R., Kao,L.-C., McKnight,C.E. and Strauss,J.F.III.
  TITLE     Cloning and sequence of cDNA for human placental cytokeratin 8.
            Regulation of the mRNA in trophoblastic cells by cAMP
  JOURNAL   Mol. Endocrinol. 4, 370-374 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         35     1486     cytokeratin 8
BASE COUNT      401 a    498 c    524 g    301 t
ORIGIN      
        1 ttcggcaatt cctacctcca ctcctgcctc caccatgtcc atcagggtga cccagaagtc
       61 ctacaaggtg tccacctctg gcccccgggc cttcagcagc cgctcctaca cgagtgggcc
      121 cggttcccgc atcagctcct cgagcttctc ccgagtgggc agcagcaact ttcgcggtgg
      181 cctgggcggc ggctatggtg gggccagcgg catgggaggc atcaccgcag ttacggtcaa
      241 ccagagcctg ctgagcccct tgtccctgga ggtggacccc aacatccagg ccgtgcgcac
      301 ccaggagaag gagcagatca agaccctgaa caacaagttt gcctccttca tagacaaggt
      361 acggttcctg gagcagcaga acaagatgct ggagaccaag tggagcctcc tgcagcagca
      421 gaagacggct cgaagcaaca tggacaacat gttcgagagc tacatcaaca accttaggcg
      481 gcagctggag actctgggcc aggagaagct gaagctggag gcggagcttg gcaacatgca
      541 ggggctggtg gaggacttca agaacaagta tgaggatgag atcaataagc gtacagagat
      601 ggagaacgaa tttgtcctca tcaagaagga tgtggatgaa gcatacatga acaaggtaga
      661 gctggagtct cgcctggaag ggctgaccga cgagatcaac ttcctcaggc agctgtatga
      721 agaggagatc cgggagctgc agtcccagat ctcggacaca tctgtggtgc tgtccatgga
      781 caacagccgc tccctggaca tggagagcat cattgctgag gtcaaggcac agtacgagga
      841 tattgccaac cgcagccggg ctgaggctga gagcatgtac cagatcaagt atgaggagct
      901 gcagagcctg gctgggaagc acggggatga cctgcggcgc acaaagactg agatctcaga
      961 gatgaaccgg aacatcagcc ggctccaggc tgagattgag ggcctcaaag gccagagggc
     1021 ttccctggag gccgccattg cagatgccga gcagcgtgga gagctggcca ttaaggatgc
     1081 caacgccaag ttgtccgagc tggaggccgc cctgcagcgg gccaagcagg acatggcccg
     1141 gcagctgcgt gagtaccagg agctgatgaa cgtcaagctg gccctggaca tcgacatcgc
     1201 cacctacagg aagctgctgg agggcgagga gagcccgctg gagtctggga tgcagaacat
     1261 gagtattcat acgaagacca ccggcggcta tgcgggtggt ttgagctcgg cctatgggga
     1321 cctcacagac cccggcctca gctacagcct gggctccagc tttggctctg gcgcgggctc
     1381 cagctccttc agccgcacca gctcctccag ggccgtggtt gtgaagaaga tcgagacacg
     1441 tgatgggaag ctggtgtctg agtcctctga cgtcctgccc aagtgaacag ctgcggcagc
     1501 ccctcccagc ctacccctcc tgcgctgccc cagagcctgg gaaggaggcc gctatgcagg
     1561 gtagcactgg gaacaggaga cccacctgag gctcagccct agccctcagc ccacctgggg
     1621 agtttactac ctggggaccc cccttgccca tgcctccagc tacaaaacaa ttcaattgct
     1681 tttttttttt ttggtcccaa aataaaacct cagctagctc tgcc
//

GenBank-Updates@genbank.bio.net (07/04/90)

LOCUS       TRBMVAT5A    1544 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  T.brucei variant surface glycoprotein (MVAT5) mRNA, complete cds.
ACCESSION   M33825
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1544)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
BASE COUNT      497 a    396 c    366 g    285 t
ORIGIN      
        1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt
       61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt
      121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa
      181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg
      241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt
      301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa
      361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac
      421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg
      481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc
      541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg
      601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca
      661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca
      721 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca
      781 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat
      841 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt
      901 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac
      961 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa
     1021 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac
     1081 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc
     1141 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc
     1201 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat
     1261 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca
     1321 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc
     1381 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta
     1441 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa
     1501 gaatttttgc tactttaaaa acttctgata tattttaaca ccta
//
LOCUS       TRBWRATATA   1544 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  T.brucei variant surface glycoprotein (WRATat A) mRNA, complete
            cds.
ACCESSION   M33823
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1544)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
FEATURES       from  to/span     description
    pept         42     1457     variant surface glycoprotein (WRATat A)
    mRNA    <     1     1544     WRATat A mRNA
BASE COUNT      545 a    387 c    345 g    267 t
ORIGIN      
        1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc
       61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt
      121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac
      181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg
      241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg
      301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc
      361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa
      421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct
      481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca
      541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg
      601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac
      661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca
      721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa
      781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac
      841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg
      901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt
      961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg
     1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca
     1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg
     1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc
     1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag
     1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt
     1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga
     1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg
     1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta
     1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg
//
LOCUS       TRBWRATATB   1582 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  T.brucei variant surface glycoprotein (WRATat B) mRNA, complete
            cds.
ACCESSION   M33824
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1582)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
FEATURES       from  to/span     description
    mRNA    <     1     1582     WRATat B mRNA
BASE COUNT      532 a    384 c    376 g    290 t
ORIGIN      
        1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact
       61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga
      121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa
      181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac
      241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga
      301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat
      361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga
      421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc
      481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca
      541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc
      601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg
      661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag
      721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa
      781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga
      841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact
      901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca
      961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca
     1021 cgacctcata acagctaaat ccggtacgac ataagcggcg caatcactcg gctaaagcaa
     1081 ctacaaatcg ccaaagcaaa gctaaaagca agacggctac tgctgcaaaa cagggaacgg
     1141 cagcaaacgc gacttatggc gctagcagac aagatgcaag aattgtacca agaggcctta
     1201 catgacgagg ttcaactcag gaaggaagcg cagaacaaac ctcaagaaac accagattct
     1261 gacaagcaaa aagcatgcga gaaatatcac aacaagtcaa aggaatgcaa agaaaatggt
     1321 tgccaatgga gtggaactga agaaaccact aggaaagtgc gaagctaaac ccaaagcagg
     1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac
     1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt
     1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact
     1561 tctggtatat tttaacacct tt
//
LOCUS       HUMTNC1      1085 bp ds-DNA             PRI       04-JUL-1990
DEFINITION  Human fast skeletal muscle troponin C gene, exon 1.
ACCESSION   M33771 J05502
KEYWORDS    troponin C.
SEGMENT     1 of 2
SOURCE      Human HeLa cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1085)
  AUTHORS   Gahlmann,R. and Kedes,L.
  TITLE     Cloning, structural analysis and expression of the human
            fast-twitch skeletal muscle troponin C gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by R.Gahlmann, 20-APR-1990.
FEATURES       from  to/span     description
    pept       1033  +  1035     troponin C, exon 1
    pre-msg     968  >  1085     troponin C mRNA and introns (alt.)
    pre-msg     969  >  1085     troponin C mRNA and introns (alt.)
    IVS        1036  >  1085     troponin C intron A
BASE COUNT      266 a    276 c    306 g    237 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct
       61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc
      121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa
      181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa
      241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc
      301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac
      361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc
      421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca
      481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga
      541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt
      601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc
      661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga
      721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc
      781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt
      841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt
      901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag
      961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg
     1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg
     1081 tggac
//
LOCUS       HUMTNC2      1773 bp ds-DNA             PRI       04-JUL-1990
DEFINITION  Human fast skeletal muscle troponin C gene, exons 2-6.
ACCESSION   M33772 J05502
KEYWORDS    troponin C.
SEGMENT     2 of 2
SOURCE      Human HeLa cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1773)
  AUTHORS   Gahlmann,R. and Kedes,L.
  TITLE     Cloning, structural analysis and expression of the human
            fast-twitch skeletal muscle troponin C gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by R.Gahlmann, 20-APR-1990.
FEATURES       from  to/span     description
    pept    +    51      102     troponin C, exon 2
                247      390     troponin C, exon 3
                479      593     troponin C, exon 4
                758      894     troponin C, exon 5
               1506     1537     troponin C, exon 6
    pre-msg <     1     1668     troponin C mRNA and introns
    IVS     <     1       50     troponin C intron A
    IVS         103      246     troponin C intron B
    IVS         391      478     troponin C intron C
    IVS         594      757     troponin C intron D
    IVS         895     1505     troponin C intron E
BASE COUNT      358 a    455 c    628 g    331 t      1 others
ORIGIN      About 2.5 kb after segment 1.
        1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc
       61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag
      121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg
      181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc
      241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg
      301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg
      361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat
      421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc
      481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac
      541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct
      601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag
      661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc
      721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc
      781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg
      841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg
      901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct
      961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc
     1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta
     1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat
     1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac
     1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc
     1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa
     1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa
     1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg
     1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc
     1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca
     1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg
     1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc
     1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc
     1741 ggctggcacc acccggagca cccttgggga tcc
//
LOCUS       ARRSEG11      631 bp ds-RNA             VRL       04-JUL-1990
DEFINITION  Adult diarrhea rotavirus segment 11 non-structural protein mRNA,
            complete cds.
ACCESSION   M34380
KEYWORDS    non-structural protein.
SOURCE      Adult diarrhea rotavirus, cDNA to viral RNA, clone AD63, and viral
            RNA segment 11.
  ORGANISM  Adult diarrhea rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 631)
  AUTHORS   Chen,G.-M., Hung,T. and Mackow,E.R.
  TITLE     cDNA cloning of each genomic segment of the group B rotavirus ADRV:
            Molecular characterization of the 11th RNA segment
  JOURNAL   Virology 175, 605-609 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         59      571     non-structural protein
BASE COUNT      246 a    104 c    122 g    159 t
ORIGIN      
        1 ggtatataaa agtcagtaga cggctggaaa cgttgcacgt actactcact acccagagat
       61 ggcagaagcg tctgagttca attttaccat aaagagaaag cagagaacaa tgagtgatcg
      121 aagaacgaga gaggatacaa aacaaaaaaa gattgaagag aaatctgata ttgatttggt
      181 tgattcagcc tctgtctatt cacaagagtc ttctagaagt aactatagtg atgcttacga
      241 taaattaaaa cgtgaaccaa tggttgaaga atcaaatgac gcaaagtata ggaattttga
      301 attttctgaa gatgaggaag ttcatagacc atcaagtaaa gcatcagaca aatcatacag
      361 agaaatgaaa cgtaagcatg atgacattaa tacgtcagat tctattttag agaaactgtc
      421 agaattaaat ctcgagattg aaaagataaa acaaatgaac caaccaatta caattgacgc
      481 agcattcaac atgattcttc gaaatgtcga caatctaaca ataaggcaga agcaggcctt
      541 aataaatgca attgttaatt ctatgaacta gactgtggtc tccacctgtc atctactggg
      601 gtcaacctct tgagttatat ttaaaaaacc c
//
LOCUS       BOVFESUP     1112 bp ss-mRNA            MAM       04-JUL-1990
DEFINITION  Bovine Rieske iron-sulfur protein mRNA, complete cds.
ACCESSION   M34336
KEYWORDS    Rieske iron-sulfur protein.
SOURCE      Bovine heart, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1112)
  AUTHORS   Usui,S., Yu,L. and Yu,C.-A.
  TITLE     Cloning and sequencing of a cDNA encoding the Rieske iron-sulfur
            protein of bovine heart mitochondrial ubiquinol-cytochrome c
            reductase
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 575-579 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         16      825     Rieske iron-sulfur protein precursor
    sigp         16      234     Rieske iron-sulfur protein signal peptide
    matp        235      822     Rieske iron-sulfur protein
BASE COUNT      269 a    249 c    281 g    313 t
ORIGIN      
        1 ggagctgctg tcgccatgtt gtcggttgcc gcccgctcgc gccattcgcg cccgtcctat
       61 cggccacgtc ccgcggggtg gcgcgcgctg cggccctggt acagccggtc gtccaaggag
      121 tcgcctgtgc tggacttgaa gcgctccgtc ctgtgccggg agtcgctcag aggccaggcc
      181 gcggcggctt tggtcgcttc cgtgagcctc aatgtccctg cttctgtccg ttattcccat
      241 acagacatca aagtgcctga cttctctgac tatcgtcgcc ctgaagtgtt agatagtaca
      301 aagtcttcaa aagagagcag tgaggctcgg aaaggtttct cctatttggt gactgcaaca
      361 actactgtgg gtgttgcata tgctgccaag aatgtcgtct ctcagtttgt ttccagcatg
      421 agtgcttctg ccgatgtgtt ggccatgtcg aaaattgaaa tcaagttatc cgatattcca
      481 gaggggaaga acatggcttt caaatggaga ggcaaacccc tgttcgtgcg ccatcgaacc
      541 aagaaggaaa ttgaccagga agctgcagtt gaagtgtccc agttgaggga cccacagcat
      601 gacttagaac gagtaaagaa acctgagtgg gttatcttga taggtgtttg cactcatctt
      661 ggttgtgtac ccattgcaaa tgcaggagat tttggtggtt attactgccc ttgccatggg
      721 tcacactatg atgcgtctgg cagaatcaga aaggggcctg cacctctcaa ccttgaagtt
      781 ccctcatatg agttcaccag cgatgatatg gtgattgttg gttagagaga cctggactcg
      841 agttacaggc ttctttcgtc ttcatgtcac ctcagaagag ttacttgaag aacaagcctt
      901 ctgtgctttg aattagttga tttgaaatat ttaagaattg ctaataatgt gtttgcaaac
      961 atgtaaattg aatttaatgt tgaatacttt caagcattca cctaaaaaag ccccccatta
     1021 aacattgtta tgctcaaagg gtgcagtgtc tttgatagtt aattctaatt aaaaattgta
     1081 gatgagtgta taaaatattt gtgaaatctg ta
//
LOCUS       HUMARX       3231 bp ss-mRNA            PRI       04-JUL-1990
DEFINITION  Human androgen-receptor mRNA, complete cds.
ACCESSION   M34233
KEYWORDS    androgen-receptor.
SOURCE      Human testis and prostate cancer cell line LNCaP, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3231)
  AUTHORS   Govindan,M.V.
  TITLE     Specific region in hormone binding domain is essential for hormone
            binding and trans-activation by human androgen receptor
  JOURNAL   Mol. Endocrinol. 4, 417-427 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        109     2829     androgen-receptor /hgml_locus_uid="LL0058U"
                                 /nomgen="AR" /map="Xq12"
BASE COUNT      713 a    941 c    883 g    694 t
ORIGIN      
        1 agctagctgc agcgactacc gcatcatcac agcctgttga actcttctga gcaagagaag
       61 gggaggcggg gtaagggaag taggtggaag attcagccaa gctcaaggat ggaagtgcag
      121 ttagggctgg gaagggtcta ccctcggccg ccgtccaaga cctaccgagg agctttccag
      181 aatctgttcc agagcgtccg cgaagtgatc cagaacccgg gccccaggca cccagaggcc
      241 gcgagcgcag cacctcccgg cgccagtttg ctgctgctgc agcagcagca gcagcagcag
      301 cagcagcagc agcagcagca gcagcaagag actagcccca ggcagcagca gcagcagcag
      361 ggtgaggatg gttctcccca agcccatcgt agaggcccca caggctacct ggtcctggat
      421 gaggaacagc aaccttcaca gccgcagtcg gccctggagt gccaccccga gagaggttgc
      481 gtcccagagc ctggagccgc cgtggccgcc agcaaggggc tgccgcagca gctgccagca
      541 cctccggacg aggatgactc agctgcccca tccacgttgt ccctgctggc ccccactttc
      601 cccggcttaa gcagctgctc cgctgacctt aaagacatcc tgagcgaggc cagcaccatg
      661 caactccttc agcaacagca gcaggaagca gtatccgaag gcagcagcag cgggagagcg
      721 agggaggcct cgggggctcc cacttcctcc aaggacaatt acttaggggg cacttcgacc
      781 atttctgaca acgccaagga gttgtgtaag gcagtgtcgg tgtccatggg cctgggtgtg
      841 gaggcgttgg agcatctgag tccaggggaa cagcttcggg gggattgcat gtacgcccca
      901 cttttgggag ttccacccgc tgtgcgtccc actccttgtg ccccattggc cgaatgcaaa
      961 ggttctctgc tagacgacag cgcaggcaag agcactgaag atactgctga gtattcccct
     1021 ttcaagggag gttacaccaa agggctagaa ggcgagagcc taggctgctc tggcagcgct
     1081 gcagcaggga gctccgggac acttgaactg ccgtctaccc tgtctctcta caagtccgga
     1141 gcactggacg aggcagctgc gtaccagagt cgcgactact acaactttcc actggctctg
     1201 gccggaccgc cgccccctcc gccgcctccc catccccacg ctcgcatcaa gctggagaac
     1261 ccgctggact acggcagcgc ctgggcggct gcggcggcgc agtgccgcta tggggacctg
     1321 gcgagcctgc atggcgcggg tgcagcggga cccggttctg ggtcaccctc agccgccgct
     1381 tcctcatcct ggcacactct cttcacagcc gaagaaggcc agttgtatgg accgtgtggt
     1441 ggtggtgggg gtggtggcgg cggcggcggc ggcggcggcg gcggcgaggc gggagctgta
     1501 gccccctacg gctacactcg gccccctcag gggctggcgg gccaggaaag cgacttcacc
     1561 gcacctgatg tgtggtaccc tggcggcatg gtgagcagag tgccctatcc cagtcccact
     1621 tgtgtcaaaa gcgaaatggg cccctggatg gatagctact ccggacctta cggggacatg
     1681 cgtttggaga ctgccaggga ccatgttttg cccattgact attactttcc accccagaag
     1741 acctgcctga tctgtggaga tgaagcttct gggtgtcact atggagctct cacatgtgga
     1801 agctgcaagg tcttcttcaa aagagccgct gaagggaaac agaagtacct gtgcgccagc
     1861 agaaatgatt gcactattga taaattccga aggaaaaatt gtccatcttg tcgtcttcgg
     1921 aaatgttatg aagcagggat gactctggga gcccggaagc tgaagaaact tggtaatctg
     1981 aaactacagg aggaaggaga ggcttccagc accaccagcc ccactgagga gacaacccag
     2041 aagctgacag tgtcacacat tgaaggctat gaatgtcagc ccatctttct gaatgtcctg
     2101 gaagccattg agccaggtgt agtgtgtgct ggacacgaca acaaccagcc cgactccttt
     2161 gcagccttgc tctctagcct caatgaactg ggagagagac agcttgtaca cgtggtcaag
     2221 tgggccaagg ccttgcctgg cctccgcaac ttacacgtgg acgaccagat ggctgtcatt
     2281 cagtactcct ggatggggct catggtgttt gccatgggct ggcgatcctt caccaatgtc
     2341 aactccagga tgctctactt cgcccctgat ctggttttca atgagtaccg catgcacaag
     2401 tcccggatgt acagccagtg tgtccgaatg aggcacctct ctcaagagtt tggatggctc
     2461 caaatcaccc cccaggaatt cctgtgcatg aaagccatgc tactcttcag cattattcca
     2521 gtggatgggc tgaaaaatca aaaattcttt gatgaacttc gaatgaacta catcaaggaa
     2581 ctcgatcgta tcattgcatg caaaagaaaa aatcccacat cctgctcaag acgcttctac
     2641 cagctcacca agctcctgga ctccgtgcag cctattgcga gagagctgca tcagttcact
     2701 tttgacctgc taatcaagtc acacatggtg agcgtggact ttccggaaat gatggcagag
     2761 atcatctctg tgcaagtgcc caagatcctt tctgggaaag tcaagcccat ctatttccac
     2821 acccagtgaa gcattggaaa ccctatttcc ccaccccagc tcatgccccc tttcagatgt
     2881 cttctgcctg ttataactct gcactactcc tctgcagtgc cttggggaat ttcctctatt
     2941 gatgtacagt ctgtcatgaa catgttcctg aattctatct gctgggcttt ttttttctct
     3001 ttctctcctt tctttttctt cttccctccc tatctaaccc tcccatggca ccttcagact
     3061 ttgcttccca ttgtggctcc tatctgtgtt ttgaatggtg ttgtatgcct taaatctgtg
     3121 atgatcctca tatggcccag tgtcaagttg tgcttgttta cacgcatctc tgtgccagcc
     3181 acacaaaccg tttacttact taccgcaagg gaacttagag agctagaatt c
//
LOCUS       DBIDB58      2149 bp ds-DNA             PLN       04-JUL-1990
DEFINITION  D.biflorus lectin DB58 gene, complete cds.
ACCESSION   M34271 J05280
KEYWORDS    lectin.
SOURCE      D.biflorus 3-week old leaf and stem DNA.
  ORGANISM  Dolichos biflorus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 2149)
  AUTHORS   Harada,J.J., Spadoro-Tank,J., Maxwell,J.C., Schnell,D.J. and
            Etzler,M.E.
  TITLE     Two lectin genes differentially expressed in Dolichos biflorus
            differ primarily by a 116-base pair sequence in their 5' flanking
            regions
  JOURNAL   J. Biol. Chem. 265, 4997-5001 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        902     1729     lectin DB58
    mRNA        865  >  2149     lectin DB58 mRNA
BASE COUNT      671 a    474 c    294 g    710 t
ORIGIN      
        1 gaagagatgt tgaaatttaa tatattttta ttattatgga tgaaaattta cgtgactctc
       61 ttattaaaca atgattgcct tataattttt tttttcaata attaaaagat gcgttaaaaa
      121 gaataaggaa aatcctcttt taatatatat aattactttt tataatactt tacgaaaaat
      181 attattatta ttattattat tatattatta cagtagatta taaatagtta tatcttaaaa
      241 aaaatttgtc aatatatcac tatttaaaaa ataattatag aatattcgtt atgtgagtat
      301 tttttaaaag atattaataa aaggactcaa attcattctt tacctcatat ttattttagt
      361 ttttttttaa atttcatttt aatttattgt gaatagtata atatataatt caattattac
      421 aaatgaataa ttaaattatt aggaactgat aaagtaaagt aaaaaaagtg aagaacaata
      481 tatctacatt aaaagaacat tatgatatat gtggagtttc tgcatcactt tgggaccgaa
      541 agtggactat attggtaata tgtattgtac tacataaatt atatgattac caaagttata
      601 tgcaccgtac tatctcaatt atatgtttcc aaaaccacaa aaaatataat tttttttctt
      661 tttaactcta ttataatata ttgttttatt atttaatata aataaaacca tacaataaaa
      721 gcatataaca agagaaatca ttttcaaaca cgtgtttatc tctatagatg atgtgattta
      781 gattttgagt ttctatggtt aagaatacca aaacctagat gaggttttga actataaata
      841 tacaacacag gcttctccat atgcatcacc catcaccata gttgagcaaa acatcaaagc
      901 catggcttcc tccacagtct ctgttgtcct ctccctcttc ctgctgcttc tcacccaagc
      961 atactcagct gacatccaat ccttctcctt caaaaacttc aactcatcca gcttcatcct
     1021 ccaaggtgac gccaccgtat cctccagcaa gttacgactc acaaaagtta agggcaacgg
     1081 ccttcccacc ctgtcttctc tcggccgcgc cttctactcc tcccccatcc aaatctatga
     1141 caagtccacc ggcgccgtcg ccagctgggc cacctccttc acagccaaca ttttcgctcc
     1201 caacaagtca tcctccgccg acgggattgc cttcgctttg gtacccgtcg ggtctgagcc
     1261 caaaagcaac agtggctttc taggtgtttt cgacagcgat gtctacgaca actccgccca
     1321 gactgtggca gtggagttcg acacgttctc caacaccgac tgggacccca ccagccgcca
     1381 cattgggatc gacgtgaact ccatcaagtc tatcagaacg gcgtcgtggg gtttggccaa
     1441 cggccaaaac gccgagattc tgatcaccta caacgccgcc acctcccttt tggtggcttc
     1501 tctggtccac ccttctcgca gaacgagcta catcgtctcc gaaagagtgg acatcactaa
     1561 tgaacttccg gagtatgtga gcattgggtt ctccgccacc accgggcttt ccgaaggcta
     1621 cactgaaact cacgacgtcc tctcttggtc ttttgcttcg aagctcccag atgactccac
     1681 cactgaacct ttggacattg ctagctacct cgtccgcaac gtcctctaga ctccttcttc
     1741 ctccctgtga cagtgcttcc atgccataac gctctcttcc tcttcctctt catcttcctc
     1801 ttcctcttcc cactccttct tctttcttcc taataatctt catctgtcac agtaattaat
     1861 attatcttac cttctttcta cactctctgt tcttctttta tatgttaacc tcatctccta
     1921 atcaactctg aacataatat tttaatacaa acatcacata taatatttat tataataatt
     1981 acccgttaaa atgtctttta atacatatat taaaataaat atatgaagaa aatattagta
     2041 tataattatt tgagaaatat aattaaattt ataatgtgaa ttaattgctc tatgagtgaa
     2101 tagtccaaaa caaatatttg aaaaaggaaa aaaattgaaa atggatagg
//
LOCUS       DBISLEC      2230 bp ds-DNA             PLN       04-JUL-1990
DEFINITION  D.biflorus seed lectin (SL) gene, complete cds.
ACCESSION   M34270 J05280
KEYWORDS    lectin.
SOURCE      D.biflorus 28-day postanthesis seed DNA.
  ORGANISM  Dolichos biflorus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 2230)
  AUTHORS   Harada,J.J., Spadoro-Tank,J., Maxwell,J.C., Schnell,D.J. and
            Etzler,M.E.
  TITLE     Two lectin genes differentially expressed in Dolichos biflorus
            differ primarily by a 116-base pair sequence in their 5' flanking
            regions
  JOURNAL   J. Biol. Chem. 265, 4997-5001 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1013     1840     seed lectin
    mRNA        976  >  2230     seed lectin mRNA
BASE COUNT      677 a    498 c    347 g    708 t
ORIGIN      
        1 ttaacgcctt taagttttat gaagagatgg gtgcgaagat ggttgaagaa tacaggtttt
       61 tcagaatgac tgggaaggct cttcaggatt ctggttaaac attcactgta acatgttatt
      121 taacttattc ctattttatg atttctaata agattattaa ttatttttta taatacttta
      181 cgaaaaaaaa ttattattat tattatatta ttataataga ttataaacaa ttatatctta
      241 aaagaaatta tatcactatt taaaaaataa ttatagaata ttcgttatgt gagcattttt
      301 taaaagatat taataaaagg actcaaattc attctttacg tcatattttt cgaaatgtat
      361 tattttattt tttattaaat ttcattttaa tttattgtga atagtataat atataattat
      421 gaataattaa tattattagg aagtgataaa gtaaagtaaa gaatgaagaa taatatatct
      481 acactttaaa agaacattat gatatatgtc gagtttctgc atcactttgg gaccaaaagt
      541 gtactatatt ggtaatatgt actgtggtac ataaattata tgataaccaa agttacatgg
      601 accgtactat ctcaattata tgttaccgaa atcacaaaaa atataattat tttttctttt
      661 taactctatt ataatatatt ttttattatt taatataaaa aaaaccatag gataaaagca
      721 tataacaaga gaaaacaata tttaaacacg ttatctctat agcccaatag ggacggcaca
      781 ccccccactt caagcgcacc tctgtgtcaa cagaatcact ctaagagtat agtcgtggaa
      841 tcggtgaacc acgccagctg cttagatgcg tggcgcagag gatgtgattt agattttgag
      901 tttctatggt taagattacc aaaatctaga tgaggttttc aactataaat agacaagaga
      961 ggctcctcaa tatgcatcac ccatcaccat agttgagcaa aacatcaaag ccatggcttc
     1021 ctccacagtc tctgttgtcc tctccctctt cctgctgctt ctcacccaag caaactcagc
     1081 caacatccaa tccttctcct tcaaaaactt caactcaccc agcttcatcc tccaaggtga
     1141 cgccaccgta tcctccggca agttacaact caccaaagtt aaggaaaacg gcattcccac
     1201 cccgtcttct ctcggccgcg ccttctactc ctcccccatc caaatctatg acaagtccac
     1261 aggcgccgtc gccagctggg ccacctcctt cacagtcaag atttcagctc ccagcaaggc
     1321 atccttcgcc gatgggattg ccttcgcttt ggtacccgtc gggtctgagc ccagacgcaa
     1381 cggtggctat ctaggtgttt tcgacagcga tgtctacaac aactccgccc agactgtggc
     1441 agtggagttc gacacgttct ccaacagcgg ctgggacccc agcatgaagc acattgggat
     1501 cgacgtgaac tccatcaagt ctatcgcaac ggtgtcgtgg gatttggcca acggcgaaaa
     1561 cgccgagatt ctgatcacgt acaacgccgc cacctccctt ttggtggctt ctctggtcca
     1621 cccttctcgc agaacgagct acatcctctc cgaaagagtg gacatcacta atgaacttcc
     1681 cgagtatgtg agcgttgggt tctccgccac caccgggctt tccgaaggct acattgaaac
     1741 tcacgacgtc ctctcttggt cttttgcttc caagctccca gatgactcca ccgctgaacc
     1801 tctggacctt gctagctacc tcgtccgcaa cgtcctctag actccttctt cctccctgtg
     1861 acggtgcttc catgccataa cactctcttc ctcttcctct tcctcttccc accccttctt
     1921 ctttcttcat aataataatc ttcatctgtc acagtaataa tattatctta ccttctttct
     1981 acactctccg ttcttctttt atattttaac ctcatctcct aatcaactct taacaacata
     2041 atatttaata taaataaaat tacccgttaa aatgtctttt aatacatata ttaagataaa
     2101 tatatgaaga aaagattagt atataattat ttgagaaata tgttattatg ggttaagtaa
     2161 tgaatttatt ttaagaaaat atgtaattga gtaaacctgt tttcgtaaac aactttgtaa
     2221 aatggaattc
//
LOCUS       STMBLAC1      957 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  S.badius beta-lactamase gene, complete cds.
ACCESSION   M34178
KEYWORDS    beta-lactamase.
SOURCE      S.badius DNA.
  ORGANISM  Streptomyces badius
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 957)
  AUTHORS   Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B.
  TITLE     Molecular analysis of beta-lactamases from four species of
            Streptomyces: Comparison of amino acid sequences with those of
            other beta-lactamases
  JOURNAL   J. Gen. Microbiol. 136, 589-598 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         16      957     beta-lactamase precursor (EC 3.5.2.6)
    sigp         16       93     beta-lactamase signal peptide
    matp         94      954     beta-lactamase
BASE COUNT      156 a    357 c    320 g    124 t
ORIGIN      
        1 caggaggttt cgcccatgca tcacccccgc gtcaggaccg ccgtcgccgg tctggtcgcc
       61 gtgctgtccc tcgtgccgct ggcggcctgc gggcagagcg actccaccgc gcccccttcg
      121 tccgccaagc ccgccacgag cgcgtcggcc tcgctgccgc ggccgaagcc gtacaccggc
      181 gacttcaaga agctggagcg cgagttcgac gcacggctgg gcgtgtacgc gatcgacacc
      241 ggcaccggac gcgaggtgac ccacaacgac cgggcgcgct tcgcctacca ctccaccttc
      301 aaggcgctcc aggccgcggt cgtcctcagc acctactccc tggacggcct cgacaagcgg
      361 gtgacgtaca cccgcgagga cctggtcgcc cactctccgg tgaccgagaa gcacgtcgac
      421 accggcatga cgctgaagga gctgtgcgac gcctccgtgc gctacagcga caacaccgcg
      481 gccaacctcc tcttcgaccc acgtgcggcc gggcccaagg gcctggacgc ctcgctggag
      541 aagctcggcg acgacatcac gcggatggac cgggaggagc cggagctgag ccgatgggtc
      601 cccggtgaga agcgcgacac ctccacgccc cgggcgctgg ccgaggatct gcgcgcgttc
      661 gtcctgggca aggccctgcg cgcacccgag cgggcgcagc tgacgacgtg gctgcggacg
      721 aacaccaccg gggacgcggt gatcagggcg ggtgtgccgg agaactgggt cgtcggtgac
      781 aagaccggga ccggcagcta ctacggggcc cgcaacgaca tcgccgtggt gtggcccccg
      841 gactccgcgc ccatcgtgat cgccatcctg tcccaccgcg gtacgaagga cgcggagccg
      901 gacgacgaac tcatcgccga agcggcctcc gtggtcgtcg actcgctgtc ctcctag
//
LOCUS       STMBLAC2      948 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  S.lavendulae beta-lactamase gene, complete cds.
ACCESSION   M34180
KEYWORDS    beta-lactamase.
SOURCE      S.lavendulae DNA.
  ORGANISM  Streptomyces lavendulae
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 948)
  AUTHORS   Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B.
  TITLE     Molecular analysis of beta-lactamases from four species of
            Streptomyces: Comparison of amino acid sequences with those of
            other beta-lactamases
  JOURNAL   J. Gen. Microbiol. 136, 589-598 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         31      948     beta-lactamase precursor (gtg start codon) (EC
                                 3.5.2.6)
    sigp         31      133     beta-lactamase signal peptide
    matp        134      945     beta-lactamase
BASE COUNT      136 a    373 c    333 g    106 t
ORIGIN      
        1 gaccgcaaga gccgtagggg agagcgcatc gtggggacca caggagcccg tccgagccgt
       61 cgtgccgtac tgaccgccgc cgcgggggcg gcggtggcgg ggatccccct gggcgggtcc
      121 acggccttcg cggccccgcg ggggaacccg gacgtcctgc ggcagctgcg cgccctcgaa
      181 caggaacact ccgcccgcct gggcgtgtac gcccgggaca cggccaccgg ccggaccgtc
      241 ctccaccgcg cggaggagcg ctttccgatg tgctcggtct tcaagaccct cgccgtcgcc
      301 gccgtcctgc gcgacctcga ccgggacggc gagttcctgg ccacacgcct cttctacacc
      361 gaacaggaag tgaaggactc cgggttcggg ccggtcaccg gcctgcccga gaacctcgcg
      421 gccggcatga ccgtcgagcg gctgtgcgcc gccgccatct gccagagcga caacgccgcg
      481 gcgaacctgc tgctccggga gctgggcggc ccggaggcgg tcacccgttt ctgccggtcg
      541 gtgggggacc ggaccacccg gctggaccgc tgggagcccg aactgaactc cgccgagccc
      601 ggccggctca ccgacaccac caccccgcgc gccatcgggg cgacctacgg cgaactcgtc
      661 ctcggcgacg cgctcgcccc gcgggaccgc gagcggctca ccggctggct cctggcgaac
      721 acgacgagca ccgaacgctt ccgcaagggc ctgcccgccg actggaccct gggcgacaag
      781 accggcggcg gcgcgtacgg cacgaacaac gacgccggcg tcacctggcc cccgcaccgc
      841 ccgccggtgg tgatggtcgt cctgacgacc cacgaccggc cggacgcggt cgccgacaac
      901 ccgctcgtgg ccaagacggc agcgctgctg gcgtcggccc tcggctga
//
LOCUS       STMBLAC3      951 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  S.fradiae beta-lactamase gene, complete cds.
ACCESSION   M34179
KEYWORDS    beta-lactamase.
SOURCE      S.fradiae DNA.
  ORGANISM  Streptomyces fradiae
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 951)
  AUTHORS   Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B.
  TITLE     Molecular analysis of beta-lactamases from four species of
            Streptomyces: Comparison of amino acid sequences with those of
            other beta-lactamases
  JOURNAL   J. Gen. Microbiol. 136, 589-598 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         31      951     beta-lactamase precursor (gtg start codon) (EC
                                 3.5.2.6)
    sigp         31      133     beta-lactamase signal peptide
    matp        134      948     beta-lactamase
BASE COUNT      133 a    352 c    361 g    105 t
ORIGIN      
        1 tcgtcctgcc gtagggagaa ggggtccatc gtggatcgca ccaccgcacg tccgaaccgc
       61 cgagccgtcc tcgccacagg ggtgggggcc gcgctggcgg ccaccgcggc agcggcgggc
      121 cccgcgcacg ccgctccggg gcggggcgcc cgcgtcgagg ggcgcctgcg ggcgctggag
      181 cgcacgcacg acgcgcggct cggcgcgttc gcgtacgaca cgggcacggg gcggacggtg
      241 gcgtaccgcg cggacgagcg cttcccgatc gcgtcgatgt tcaagacgat cgccgtcgcc
      301 gccgtgctgc gcgacctgga ccgggacggc gaggtcctcg cccggcgcgt ccactacacg
      361 gcggactacg tgaagcggtc gggctactcc cccgtcaccg ggctgccgga gaacgtggcg
      421 aacggcatga ccgtcgccga gctgtgcgag gcgacgctga cccgcagcga caacacggcg
      481 gccaacctgc tcctgcgcga tctgggcggc ccgaccgcgg tgacccgttt ctgccgctcc
      541 gtcggcgacc acgtcacccg gctggaccgc tgggagcccg agctgaactc ggccgagccc
      601 gggagggtca ccgacaccac gtcgccgcgc gccatcggcc gcacgtacgg gcggctgatc
      661 ctcggcgacc tgctggcggc gcacgaccgg gagcggctga cgcggtggat gctggacaac
      721 cggacgagtg acgagcggtt ccgcaagggc ctgcccgcgg actggctgct ggcggacaag
      781 acgggcggcg gcgactacgg caccaacaac gacgcggggg tcgcctggcc gcccggacgc
      841 ccgccggtcg tgctggccgt ccagacgacc cggttcaccc cggacgccga ggccgacaac
      901 gtcctcgtcg cggaggcggc acgcctcctg gcggaggcga tgaccgactg a
//
LOCUS       LBKRRNA      1528 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  Lactobacillus kandleri 16S small subunit ribosomal RNA.
ACCESSION   M23038
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      Lactobacillus kandleri (strain DSM 20593) rRNA.
  ORGANISM  Lactobacillus kandleri
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1528; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1528     16S rRNA
BASE COUNT      411 a    326 c    436 g    336 t     19 others
ORIGIN      
        1 nntaaattga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
       61 aagtcgaacg cactgtggtt gaaatgagat gagaagcttg cttcaagtca aatgccaaca
      121 ttgcagtgag tggcgaacgg gtgagtaaca cgtgggaaac ctncctctta gcaggggata
      181 acatctggaa acaggtgcta ataccgtata acaataaaaa ccgcatggtt ttgatttaaa
      241 agatggttct gctatcacta agagatggtc ccncggtgca ttagttagtt ggtgaggtaa
      301 aggctcacca agacgatgat gcatagccga gttgagagac tgatcggcca caatgggact
      361 gagacacggc cnatactcct acgggagnca gcagtaggga atcttccaca atggacgaaa
      421 gtctgatgga gcaacgccgn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
      481 agagaagaat gacattgaga gtaactgttc aatgtgtgac ggtatcttac cagaaaggaa
      541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccnagcgtta tccggattta
      601 ttgggcgtaa agcgagcgca gacggttatt taagtctgaa gtgaaagccc tcggcttaac
      661 cgagnnattg ctttggaaac tggataactt gagtgcagta gaggaaagtg gaactccatg
      721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctttctgga
      781 ctgtaactga cgttgaggct cnnaagtgtg ggtagcaaac aggattagat accctggtag
      841 tccacaccgt aaacgatgag tgctagctgt tcgagggttt ccgcccttga gtggcgaagc
      901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
      961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagnaacgc gaagaacctt
     1021 accaggtctt gacatccttt gaccactcca gagatggagn tttcccttcg gggtcaaagt
     1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccncaa
     1141 cgagcgcaac ccttattgtt agttgccagc atttagttgg gcactctagc aagactgccg
     1201 gtgacaaacc ggaggaaggc ggggatgacg tcaaatcatc atgcccctta tgacctgggc
     1261 tacacacgtg ctacaatggc aagtacaacg agcagccaac ccgcgagggt gcgcaaatct
     1321 cttaaagctt gtctcagttc ggactgtagg ctgcaactcg cctacacgaa gtcggaatcg
     1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggncttgt acacaccgcc
     1441 cgtcacacca tgagagtttg taacacccaa agtcggtgag gtaaccttta taggggccag
     1501 ccgcctaagg tgggacagat gatnaggg
//
LOCUS       LBMRRNA      1524 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  Lactobacillus minor, 16S small subunit ribosomal RNA.
ACCESSION   M23039
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.minor(strain DSM 20014) rRNA.
  ORGANISM  Lactobacillus minor
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1524; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1524     16S rRNA
BASE COUNT      414 a    320 c    431 g    345 t     14 others
ORIGIN      
        1 nncaaaatga gagtttgatc ctggctcagg atnaacgctg gcggcgtgcc taatacatgc
       61 aagtcgaacg ctttgtggtt caactgatat gaagagcttg ctcggatttg aagatgaaca
      121 ttgcaaagag tggcgaacgg gtgagtaaca cgtgggaaac ctncctctta gcgggggata
      181 acacttggaa acaagtgcta ataccgcata atactgataa ccgcatggtt atcagttgaa
      241 agatggttct gctatcacta aganatggtc ccncggtgta ttagctagtt ggtaaggtaa
      301 tggcttacca aggcaatgat acatagccga gttgagagac tgatcggcca caatgggact
      361 gagacacggc cnatactcct acgggaggca gcagtaggga atcttccaca atggacgaaa
      421 gtctgatgga gcaacgccgn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
      481 agagaagaat gacattgaga gtaactgttc agtgtgtgac ggtatcttac cagaaaggaa
      541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccaagcgtta tccggattta
      601 ttgggcgtaa agcgagcgca gacggttgtt taagtctgaa gtgaaagccc acagcttaac
      661 tgtggaagtg ctttggaaac tggataactt gagtgcagta gaggagagtg gaactccatg
      721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctctctgga
      781 ctgtaactga cgttgaggct cgnaagtgtg ggtagcaaac aggattagat accctggtag
      841 tccacaccgt aaacgatgag tgctagatgt ttgagggttt ccgcccttaa gtgtcgcagc
      901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
      961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagnaacgc gaagaacctt
     1021 accagctctt gacatccttt gaccacttca gagatgaagc tttcccttcg gggacaaagt
     1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa
     1141 cgagcgcaac ccttattact agttgccagc atttagttgg gcactctagt gagactgccg
     1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgagctgggc
     1261 tacacacgtg ctacaatggc aagtacaacg agcagctaac ccgcgagggt acgcgaatct
     1321 cttaaaactt gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg
     1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cnggtcttgt acacaccgcn
     1441 cgtcacacca tgagagtttg taacacccaa agccggtgag gtaacctttt aggagccaac
     1501 cgtctaaggt gggacagatg atta
//
LOCUS       LBTRRNA      1529 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.halotolerans 16S small subunit ribosomal RNA.
ACCESSION   M23037
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.halotolerans (strain DSM 20190) rRNA.
  ORGANISM  Lactobacillus halotolerans
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1529; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1529     16S rRNA
BASE COUNT      406 a    329 c    437 g    328 t     29 others
ORIGIN      
        1 nncaaaatga gagtttgatc ctggctcagg angaacgctg gcggcgtgcc taatacatgc
       61 aagtcgaacg ccttgtcgtt ctactgattt agagagcttg ctcaatactg acgtagaact
      121 atacaaggag tggcgaacgg gtgagtaaca cgtgggaaac cnncctctta gtaggggata
      181 acacctggaa acaggngcta ataccgcata acaacgataa ccgcatggtt atcgcttgaa
      241 aggtggcgta agctaccgct aagagatggt cccgcggtgc attagttagt tggtgaggta
      301 atggctcacn nagacgatga tgcatagccg agttgagaga ctgatcggcc acaatgggac
      361 tgagacacgg ccnntnctcc tacgggagnc agcagtaggg aatcttccac aatggacgaa
      421 agtctgatgg agcaacgcng ngtgtgtgat gaagggtttc ggctcgtaaa acactgttgt
      481 aagagaagaa tgatactaag agtaactgtt tagtatgtga cggtatctta ccagaaagga
      541 acggctaaat acgtgccagc agccncggta atacgtatgt tccnagcgtt atccggattt
      601 attgggcgta aagcgagcgc agacggttat ttaagtctga agtgaaagcc cncagcttaa
      661 ctgtggnagt gctttggaaa ctggataact tgagtgcagt agaggaaagt ggaactccat
      721 gtgtagcggt gaaatgcgta gatatatgga agaacaccag cggcgaaggc ggctttctgg
      781 actgtaactg acgttgaggc tcgnaagtgt gggtagcaaa caggattaga taccctggta
      841 gtccacaccg taaacgatga gtgctagatg tttgggggtt tccgccccna agtgtcgtag
      901 ctaacgcatt aagcactccg cctggggagt acgaccgcaa ggttgaaact caaaggaatt
      961 gacggggacc ngcacaagcg gnggagcatg tggtttaatt cgaagnaacg cgaagaacct
     1021 taccaggnct tgacatcctt tgaccacctc agagatgagg ctttcccttc ggggacaaag
     1081 tgacaggtgg ngcatggttg tcgtcagctc gtgtcgtgag atgttgggtt aagtcccgca
     1141 acgagcgcaa cccttatgat tagttgccag catttagttg ggcactctaa tcagactgcc
     1201 ggtgacaaac cggaggaagg tggggatgac gtcaaatcat catgcccctt atgacctggg
     1261 ctacacacgt gctacaatgg caagtacaac gagcagctaa cccgtaaggg cacgcgaatc
     1321 tcttaaaact tgtctcagtt cggattgtgg gctgcaactc gcccncatga agtcggaatc
     1381 gctagtaatc gcggatcagc acgccgcggt gaatacgttc ccgggtcttg tacacaccgc
     1441 ncgtcacncc atgagagttt gtaacaccca aagccggtga ggtaacccgc aagggaacca
     1501 gccgtctaag gtgggacaga tgattnggg
//
LOCUS       LCORRNA      1525 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.confusus 16S small subunit ribosomal RNA.
ACCESSION   M23036
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.confusus (strain DSM 20196) rRNA.
  ORGANISM  Lactobacillus confusus
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1525; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1525     16S rRNA
BASE COUNT      407 a    325 c    440 g    339 t     14 others
ORIGIN      
        1 nccaaattga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
       61 aagtcgaacg ctttgtggtt caactgattt gaagagcttg ctcagatatg acgatggaca
      121 ttgcaaagag tggcgaacgg gtgagtaacn cgtgggaaac cnacctctta gcaggggata
      181 acatttggaa acagatgcta ataccgtata acaatgacaa ccgcatggtn gtnatttaaa
      241 agatggttct gctatcacta agagatggtc ccgcggtgca ttagctagtt ggtaaggtaa
      301 tggcttacca aggcgatgat gcatagccga gttgagagac tgatcggcca caatgggact
      361 gagacacggc ccatactcct acgggaggca gcagtaggga atcttccaca atgggcgaaa
      421 gcctgatgga gcaacgccnn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
      481 agagaagaat gacattgaga gtaactgttc aatgtgtgac ggtatcttac cagaaaggaa
      541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccnagcgtta tccggattta
      601 ttgggcgtaa agcgagcgca gacggttatt taagtctgaa gtgaaagccc tcagctcaac
      661 tgaggaattg ctttggaaac tggatgactt gagtgcagta gaggaaagtg gaactccatg
      721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctttctgga
      781 ctgtaactga cgttgaggct cnaaagtgtg ggtagcaaac aggattagat accctggtag
      841 tccacaccgt aaacgatgag tgctaggtgt ttgagggttt ccgcccttaa gtgccgcagc
      901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
      961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaannaacgc gaagaacctt
     1021 accaggtctt gacatccctt gacaactcca gagatggagn gttcccttcg gggacaaggt
     1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa
     1141 cgagcgcaac ccttattact agttgccagc attcagttgg gcactctagt gagactgccg
     1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgacctgggc
     1261 tacacacgtg ctacaatggc gtatacaacg agttgcaaac ccgcgagggt gagctaatct
     1321 cttaaagtac gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg
     1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggtcttgt acacaccgcc
     1441 cgtcacacca tgagagtttg taacacccaa agccggtggg gtaaccttcg ggagccagcc
     1501 gtctaaggtg ggacagatga ttagg
//
LOCUS       LEURRNAC     1493 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.cremoris 16S small subunit ribosomal RNA.
ACCESSION   M23034
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.cremoris (strain DSM 20346) rRNA.
  ORGANISM  Leuconostoc cremoris
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1493; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1493     16S rRNA
BASE COUNT      398 a    324 c    438 g    324 t      9 others
ORIGIN      
        1 tcaaattgag agtttgatcc tggctcagga tgaacgctgg cggcgtgcct aatacatgca
       61 agtcgaagcc acagcgaaag gtgcttgcac ctttcaagtg agtggcgaac gggtgagtaa
      121 cacgtggaca acctgcctca aggctgggga taacatttgg aaacagatgc taataccgaa
      181 taaaacttag tgtcgcatga cacaaagtta aaaggcgctt cggcgtcacc tagagatgga
      241 tccgcggtgc attagttagt tggtggggta aaggcctacc aagacaatga tgcatagccg
      301 agttgagaga ctgatcggcc acattgggac tgagacacgg cccaaactcc tacgggaggc
      361 tgcagtaggg aatcttccac aatgggcgaa agcctgatgg agcaacgccg cgtgtgtgat
      421 gaaggctttc gggtcgtaaa gcactgttgt atgggaagaa cagctagaat aggaaatgat
      481 tttagtttga cggtaccata ccagaaaggg acggctaaat acgtgccagc agccgcggta
      541 atacgtatgt cccgagcgtt atccggattt attgggcgta aagcgagcgc agacggttta
      601 ttaagtctga tgtgaaagcc cggagctcaa ctccggnatg gcattggaaa ctggttaact
      661 tgagtgcagt agaggtaagt ggaactccat gtgtngcggt ggaatgcgta gatatatgga
      721 agaacaccag tggcgaaggc ggcttactgg actgcaactg acgttgaggc tcgnaagtgt
      781 gggtagcaaa caggattaga taccctggta gtccacaccg taaacgatga acactaggtg
      841 ttaggaggtt tccgcctctt agtgccnaag ctaacgcatt aagtgttccg cctggggagt
      901 acgaccgcaa ggttgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg
      961 tggtttaatt cgaagcaacg cgaagaacct taccaggtct tgacatcctt tgaagctttt
     1021 agagatagaa gtgttctctt cggagacaaa gtgacaggtg gtgcatggtc gtcgtcagct
     1081 cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttattg ttagttgcca
     1141 gcattcagat gggcactcta gcgagactgc cggtgacaaa ccggaggaag gcggggacga
     1201 cgtcagatca tcatgcccct tatgacctgg gctacacacg tgctacaatg gcgtatacaa
     1261 cgagttgcca acccgcgagg gtgagctaat ctcttaaagt acgtctcagt tcggattgta
     1321 gtctgcaact cgactacatg aagtcggaat cgctagtaat cgcggatcag cacgccgcgg
     1381 tgaatacgtt cccgggtctt gtacacaccg cccgtcacac catgggagtt tgtaatgccc
     1441 aaagccggtg gcctaacctt tnaggaagga gcngtctaag gnagganaga nga
//
LOCUS       LEURRNAL     1499 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.lactis 16S small subunit ribosomal RNA.
ACCESSION   M23031 M23032
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.lactis (strain DSM 20202) rRNA.
  ORGANISM  Leuconostoc lactis
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1499; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1499     16S rRNA
BASE COUNT      396 a    320 c    435 g    316 t     32 others
ORIGIN      
        1 nntaaattga gagtttgatc ctggctcagg atnaacgctg gcggcgtgcc taatacatgc
       61 aagtcgaagc cgcagcgaaa ggtgcttgca ccnttcaagc gagtggcgaa cgggtgagta
      121 acacgtggat aacctgcctc aaggctgggg ataacatttg gaaacagatg ctaataccga
      181 ataaaactta gtatcgcatg atacaaagtt caaaggcgct acggcgncac ctagagatgg
      241 gnccgcggtg cattagttag ttggtggggt aaaggcctac caagacaatg atgcatagcc
      301 gagttgagag actnatcggc cacattggga ctgagacacg gcccnaactc ctacgggagg
      361 cagcagtagg gaatcttcca caatgggcga aagcctnatg gagcaacgcc nngtgtgtga
      421 tgaaggcttt agggtcgtaa agcacngttg tatgggaaga aatgctagaa tagggaatga
      481 ttctagttcg acggtaccat accagaaagg gacggctaaa tacgtgccag cagcnncggt
      541 aatacgtatg tccnnagcgt tatccggatt tattgggcgt aaagcgagcg cagacggttg
      601 attaagtctg atgtgaaagc cnggagctca actccggnat ggcattggaa actggttaac
      661 ttgagtgttg tagaggtaag tggaactcca tgtgtagcgg tggaatgcgt agatatatgg
      721 aagaacacca gtggcgaagg cggcttactg gacaacaact gacgttgagg ctcnnaagtg
      781 tgggtagcaa acaggattag ataccctggt agtccacacc gtaaacgatg aatactaggt
      841 gttaggaggt ttccgcctct tagtgccnaa gctaacgcat taagtattcc gcctggggag
      901 tacgaccgca aggttgaaac tcaaaggaat tgacggggac cngcacaagc ggtggagcat
      961 gtggtttaat tcgaagnaac gcgaagaacc ttaccaggtc ttgacatcct ttgaagcttc
     1021 tagagataga agtgttctct tcggagacaa agtgacaggt ggngcatggt cgtcgtcagc
     1081 tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttatt gttagttgcc
     1141 agcattcagt tgggcactct agcgagactg ccggtgacaa accggaggaa ggcggggacg
     1201 acgtcagatc atcatgcccc ttatgacctg ggctacacac gtgctacaat ggcgtataca
     1261 acgagttgcc aacccgcgag ggtgagctaa tctcttaaag tacgtctcag ttcggactgc
     1321 agtctgcaac tcgactgcac gaagtcggaa tcgctagtaa tcgcggatca gcacgccgcg
     1381 gtgaatacgt tcccnggnct tgtacacacc gnnnntcacn ccatgggagt ttgtaatgcc
     1441 cnaagccggt ggcctaacct tatggaggga gccgtctaag gcaggacaga tgactaggg
//
LOCUS       LEURRNAM     1554 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.mesenteroides 16S small subunit ribosomal RNA.
ACCESSION   M23035
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.mesenteroides (DSM 20343) rRNA.
  ORGANISM  Leuconostoc mesenteroides
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1554; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1554     16S rRNA
BASE COUNT      406 a    338 c    447 g    334 t     29 others
ORIGIN      
        1 tcaaattgag agtttgatcc tggctcagga tgaacgctgg cggcgtgcct aatacatgca
       61 agtcgaagcc acagcgaaag gtgcttgcac ctttcaagtg agtggcgaac gggtgagtaa
      121 cacgtggaca acctgcctca aggctgggga taacatttgg aaacagatgc taataccgaa
      181 taaaacttag tgtcgcatga cacaaagtta aaaggcgctt cggcgtcacc tagagatgga
      241 tccgcggtgc attagttagt tggtggggta aaggcctacc aagacaatga tgcatagccg
      301 agttgagaga ctgatcggcc acattgggac tgagacacgg cccaaactcc tacgggaggc
      361 tgcagtaggg aatcttccac aatgggcgaa agcctgatgg agcaacgccg cgtgtgtgat
      421 gaaggctttc gggtcgtaaa gcactgttgt atgggaagaa cagctagaat aggaaatgat
      481 tttagtttga cggtaccata ccagaaaggg acggctaaat acgtgccagc agccgcggta
      541 atacgtatgt cccgagcgtt atccggattt attgggcgta aagcgagcgc agacggttta
      601 ttaagtctga tgtgaaagcc cggagctcaa ctccggnatg gcattggaaa ctggttaact
      661 tgagtgcagt agaggtaagt ggaactccat gtgtngcggt ggaatgcgta gatatatgga
      721 agaacaccag tggcgaaggc ggcttactgg actgcaactg acgttgaggc tcgnaagtgt
      781 gggtagcaaa caggattaga taccctggta gtccacaccg taaacgatga acactaggtg
      841 ttaggaggtt tccgcctctt agtgccnaag ctaacgcatt aagtgttccg cctggggagt
      901 acgaccgcaa ggttgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg
      961 tggtttaatt cgaagcaacg cgaagaacct taccaggtct tgacatcctt tgaagctttt
     1021 agagatagaa gtgttctctt cggagacaaa gtgacaggtg gtgcatggtc gtcgtcagct
     1081 cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttattg ttagttgcca
     1141 gcattcagat gggcactcta gcgagactgc cggtgacaaa ccggaggaag gcggggacga
     1201 cgtcagatca tcatgcccct tatgacctgg gctacacacg tgctacaatg gcgtatacaa
     1261 cgagttgcca acccgcgagg gtgagctaat ctcttaaagt acgtctcagt tcggattgta
     1321 gtctgcaact cgactacatg aagtcggaat cgctagtaat cgcggatcag cacgccgcgg
     1381 tgaatacgtt cccgggtctt gtacacaccg cccgtcacac catgggagtt tgtaatgccc
     1441 aaagccggtg gcctaacctt ttaggaagga gccgtctaag gcaggacaga tgaccggggn
     1501 nnnnnngtaa caagnnnnnn nnnnnngaac ctgnnnnnng atcacctcct ttct
//
LOCUS       LEURRNAO     1510 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.oenos 16S small subunit ribosomal RNA.
ACCESSION   M35820 M23032
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.oenos (strain DSM 20252) rRNA.
  ORGANISM  Leuconostoc oenos
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1510; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1510     16S rRNA
BASE COUNT      387 a    316 c    431 g    351 t     25 others
ORIGIN      
        1 nttaaattga gagtttgatc ctggctcagg atgaacgctg gtggcgtgcc taatacatgc
       61 aagtcgtacg tagccgctga attgatcttc gggtgaagtg aggcaatgac tagagtggcg
      121 aactggtgag taacacgtna gaaacctgcc ctttagtggg ggataacatt tggaaacaga
      181 tgctaatacc gcgtaacaac aaatcacacn tgtgatctgt ttgaaaggtc cttttggatc
      241 gctagaggat ggtcttgcgg cgtattagct tgttggtagg gtagaagcct accaaggcaa
      301 tgatgcgtag ccgagttgag agactggctg gccncattgg gactgagaca ctgcccnaac
      361 tcctacggga gnctgcagta gggaattttc cgcaatgcac gaaagtgtga cggagcgacg
      421 ccgcgtgtgt gatgaaggct ttcgggtcgt aaagcactgt tgtaagggaa gaataactga
      481 attcagagaa agttttcagc ttgacggtac cttaccagaa agggatggct aaatacgtgc
      541 cagcagccgc ggtaatacgt atgtcccnag cgttatccgg atttattggg cgtaaagcga
      601 gcgcagacgg tttattaagt ctgatgtgaa atcccgaggc ccaacctcgg tactgcattg
      661 gaaactgatt tacttgagtg cgatagaggc aagtggaact ccatgtgtag cggtgaaatg
      721 cgtagatatg tggaagaaca ccagtggcga aagcggcttg ctagatcgta actgacgttg
      781 aggctcgaaa gtatgggtag caaacgggat tagatacccc ggtagtccat accgtaaacg
      841 atgggtgcta gttgttaaga ggtttccgcc tcctagtgac gtagcaaacg cattaagcac
      901 ccngcctgag gagtacggcc gcaaggctga aacttaaagg aattgacggg gnnccgcaca
      961 agcggtggag catgtggttt aattcgaaga tacgcnnaaa accttaccag gtcttgacat
     1021 accaatgatc gcttttgtaa tgaaagcttt tcttcggaac attggataca ggtggtgcat
     1081 ggtcgtcgtc agctcgtgtc gtgagatgtt gggttnagtc ccgcaacgag cgcnaccctt
     1141 gttattagtt gccagcattt ngttgggcac tctaatgaga ctgccggtga tnaaccggag
     1201 gaaggtgggg acgacgtcag atcatcatgc cccttatgac ctgggcaaca cacgtgctac
     1261 aatgggaagt acaacgagtc gcaaaccggc gacggtaagc taatctctta aaacttctct
     1321 cagttcggac tggagtctgc aactcgactc cacgaaggcg gaatcgctag tnatcgcgaa
     1381 tcagcatgtc gcggtgaatn cgttcccngg ncttgtacac nccgcnngtc aaatcntggg
     1441 agtcggaagt acccaaagtc gcttggctaa cttttngagg ccggtgccta aggtaaaatc
     1501 gatgactggg
//
LOCUS       LEURRNAPS    1524 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.paramesenteroides 16S small subunit ribosomal RNA.
ACCESSION   M23033
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.paramesenteroides (strain DSM 20288) rRNA.
  ORGANISM  Leuconostoc paramesenteroides
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1524; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1524     16S rRNA
BASE COUNT      404 a    326 c    435 g    340 t     19 others
ORIGIN      
        1 nttaaaatga gagtttganc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
       61 aagtcgaacg ctttgtcttt aactgatatg aagagcttgc tctgatttga ttttatctga
      121 caaagagtgg cgaacgggtg agtaacacgt gggtaaccnr cctcttagca ggggataaca
      181 tttggaaaca agtgctaata ccgtataata ccaacaaccg catggtngtn ggttgaaaga
      241 tggttctgct atcactaaga gatggacccn cggtgcatta gctagttggt aaggtaacgg
      301 cttaccaagg caatgatgca tagccgagtt gagagactga tcggccacaa tgggactgag
      361 acacggccca tactcctacg ggagncagca gtagggaatc ttccacaatg ggcgcaagcc
      421 tgatggagca acgccnngtg tgtgatgaag ggtttcggct cgtaaaacac tgttataaga
      481 gaagaacggc actgagagta actgttcagt gtgtgacggt atcttaccag aaaggaacgg
      541 ctaaatacgt gccagcagcc gcggtaatac gtatgttccn agcgttatcc ggatttattg
      601 ggcgtaaagc gagcgcagac ggttatttaa gtctgaagtg aaagccctca gctcaactga
      661 ggaatggctt tggaaactgg atgacttgag tgcagtagag gaaagtggaa ctccatgtgt
      721 agcggtgaaa tgcgtagata tatggaagaa caccagtggc gaaggcggct ttctggactg
      781 taactgacgt tgaggctcga aagtgtgggt agcaaacagg attagatacc ctggtagtcc
      841 acaccgtaaa cgatgagtgc tagatgttcg agggtttccg cccttgagtg tcgcagctaa
      901 cgcattaagc actccgcctg gggagtacga ccgcaaggtt gaaactcaaa ggaattgacg
      961 gggaccngca caagcggngg agcatgtggt ttaattcgaa gnaacgcgaa gaaccttacc
     1021 aggncttgac atcccttgct aatcctagaa ataggangtt cccttcgggg acaaggtgac
     1081 aggtggngca tggttgtcgt cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga
     1141 gcgcaaccct tattattagt tgccagcatt cagttgggca ctctagtgag actgccggtg
     1201 acaaaccgga ggaaggtggg gatgacgtca aatcatcatg ccccttatga cctgggctac
     1261 acacgtgcta caatggcata tacaacgagt cgccaacccg cgagggtgcg ctaatctctt
     1321 aaagtatgtc tcagttcgga ttgtaggctg caactcgcct acatgaagtc ggaatcgcta
     1381 gtaatcgcgg atcagcacgc cgcggtgaat acgttcccgg gncttgtaca caccgcccgt
     1441 cacaccatga gagtttgtaa cacccaaagc cggtggggta accttttagg agccagccgt
     1501 ctaaggtggg acagatgatn aggg
//
LOCUS       LVIRRNAA     1582 bp ss-rRNA            RNA       04-JUL-1990
DEFINITION  L.viridescens 16S small subunit ribosomal RNA.
ACCESSION   M23040
KEYWORDS    16S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      L.viridescens (strain DSM 20410) rRNA.
  ORGANISM  Lactobacillus viridescens
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (sites)
  AUTHORS   Yang,D. and Woese,C.R.
  TITLE     Phylogenetic structure of the "leuconostocs": An interesting case
            of a rapidly evolving organism
  JOURNAL   Syst. Appl. Microbiol. 12, 145-149 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1582; for [1])
  AUTHORS   Yang,D. and Woese,C.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 10-MAR-1989, for release after publication.
            
               Author address: C.R.Woese
               Microbiology Department
               131 Burrill Hall
               407 S. Goodwin Ave
               Urbana, IL 61801
               Ph: 217-333-9369
               E-mail carl@ninja.life.uiuc.edu
FEATURES       from  to/span     description
    rRNA          1     1582     16S rRNA
BASE COUNT      426 a    334 c    436 g    351 t     35 others
ORIGIN      
        1 tacaaaatga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc
       61 aagtcgaacg ctttgtggtc caactgattt gaagagcttg ctcagatatg acgatggaca
      121 ttgcaaagag tggcgaacgg gtgagtaaca cgtgggaaac ctacctctta gcaggggata
      181 acacttggaa acaagtgcta ataccgtata acactaataa ccgcatggtt attagttaaa
      241 agntggtctt gctatcacta agagatggtc ccacggtgta ttagctagtt ggtaaggtaa
      301 tggcttacca aggcaatgat acatagccga gttgagagac tgatcggcca caatgggact
      361 gagacacggc ccatactcct acgggaggca gcagtaggga atcttccaca atggacgaaa
      421 gtctgatgga gcaacgccgc gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta
      481 agagaagaat gacattgaga gtaactgttc agtgtgtgac ggtatcttac cagaaaggaa
      541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccaagcgtta tccggattta
      601 ttgggcgtaa agcgagcgca gacggttatt taagtccgaa gtgaaagccc acagcttaac
      661 tgtggaagtg ctttggaaac tggataactt gagtgcagta gaggagagtg gaactccatg
      721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctctctgga
      781 ctgtaactga cgttgaggct cgaaagtgtg ggtagcaaac aggattagat accctggtag
      841 tccacaccgt aaacgatgag tgctagatgt ttgagggttt ccgcccttaa gtgtcgcagc
      901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg
      961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagcaacgc gaagaacctt
     1021 accaggtctt gacatccttt gaccacttca gagatgaagc tttcccttcg gggacaaagt
     1081 gacaggtggt gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa
     1141 cgagcgcaac ccttattact agttgccagc attcagttgg gcactctagt gagactgccg
     1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgacctgggc
     1261 tacacacgtg ctacaatggc aagtacaacg agcagctaac ccgcgagggt acgcgaatct
     1321 cttaaaactt gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg
     1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggncttgt acacaccgcc
     1441 cgtcacacca tgagagtttg taacacccaa agccggtgag gtaacctttt aggagccaac
     1501 cgtctaaggt gggacagatg attagggnnn nnnngtaaca agnnnnnnnn nnnnnnnnnn
     1561 nnnnnnngat cacctccttt ct
//
LOCUS       DIRPARAM     2545 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  D.immitis paramyosin mRNA, partial cds.
ACCESSION   M29733
KEYWORDS    paramyosin.
SOURCE      D.immitis, cDNA to mRNA.
  ORGANISM  Dirofilaria immitis
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 2545)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  2545     paramyosin (AA at 1)
BASE COUNT      949 a    420 c    544 g    632 t
ORIGIN      
        1 gcattcggta gtatgtcggt tgccgatttg ggaagtttaa ctcgcttgga ggacaagatt
       61 cgtctactgc aagaagatct cgaatccgaa cgtgaacttc gaaatagaat cgaacgagaa
      121 agagctgatc ttagtgtaca actgattgca ttaactgata gacttgaaga tgctgagggt
      181 actactgata gtcagattga atcaaatcgt aaacgtgaag cagaattgca aaaattacgt
      241 aaattattgg aagaatcaca attagaaaat gaagatgcaa tgaatgtttt acgtaaaaag
      301 catcaagatg catgtctcga ttacgctgaa caaattgaac aattacaaaa gaaaaattca
      361 aagattgatc gtgaacgtca acgtctgcaa catgaagtaa ttgagcttac tgcgacaatt
      421 gatcaacttc aaaaggataa gcatttggcg gaaaaagcag cggaacgttt tgaagcgcaa
      481 actatcgaat tgagtaataa agttgaagat ttaaatcgac atgttaatga tttagctcaa
      541 caacgtcaac gtttacaagc tgaaaataac gatcttctca aagagattca tgatcaaaaa
      601 gtacaattgg ataatttgca acacgtgaaa tatcaacttg cgcaacaact tgaagaagca
      661 cgtcgacgtt tagaagatgc tgaacgcgaa cgttctcaac ttcaagcaca acttcatcaa
      721 gtacaattag aattggattc tgtacgtacc gcattagatg aagaatcagc agctcgagca
      781 gaagctgaac ataaacttgc tttagcaaat accgagatta cccaatggaa gagtaaattt
      841 gatgctgaag ttgctcttca tcatgaagag gttgaagatt tacgaaagaa aatgctccaa
      901 aaacaggctg aatatgagga acaaattgaa ataatgttgc aaaaaatttc acaacttgaa
      961 aaagctaaat cacgcttaca aagtgaagtt gaagtattaa ttgttgattt ggaaaaagca
     1021 caaaatacaa tagcaattct tgaacgagca aaagaacaac tcgaaaaaac tgttaatgaa
     1081 ttaaaggtac gtattgatga acttactgtt gaacttgaag cagcacaacg tgaagctcgt
     1141 gctgcattag cagaattaca aaagatgaaa aatttatacg aaaaagcaat tgagcaaaaa
     1201 gaagccttag ctcgagaaaa taaaaaatta caagatgatt tgcatgaagc aaaagaagcg
     1261 cttgctgatg caaatcgtaa attgcatgag cttgatcttg aaaatgcacg attagctggt
     1321 gaaatacgag aattgcaaac agctcttaag gaatctgaag ctgcacgacg tgatgctgag
     1381 aatcgtgcac aacgtgcact tgctgaatta caacaacttc gtattgaaat ggaacgtcgt
     1441 ttgcaagaga aagaggaaga aatggaagca cttcgtaaga atatgcaatt tgaaattgat
     1501 cgacttacag ctgcattagc cgatgctgaa gcacgtatga aggcggaaat ttcacgtcta
     1561 aagaagaagt atcaagcgga aattgccgaa cttgaaatga ccgttgataa tttgaatcgg
     1621 gcaaatattg aagcgcaaaa gacgattaaa aaacaaagtg aacaacttaa aatattgcaa
     1681 gcatcacttg aggatactca acgtcaacta cagcaaactc ttgatcagta tgcattggcg
     1741 caacgtaaag tttccgcact tagcgctgaa ttagaagaat gcaaagttgc acttgataat
     1801 gcaattcgtg ctcgtaaaca agctgaaatt gatcttgaag aagcgaatgc tcgtattact
     1861 gatttggtat caattaacaa taatttaaca gcaattaaaa ataaattgga aactgaatta
     1921 tcgaccgcac aagctgattt ggatgaagca acgaaagaat tgcatgccgc ggatgagcgg
     1981 gctaatcgtg ctttagcaga tgcagctcga gctgttgaac aacttcatga ggagcaggaa
     2041 cattccatga aaattgatgc tctcaggaaa tcattagaag agcaggtgaa gcaattacaa
     2101 gtacaaattc aagaagctga agctgcagca ttattgggtg gtaaacgtgt aattgctaaa
     2161 cttgaaacac gtatacgtga tttggaaact gctttggatg aagaaacacg tcgacataag
     2221 gaaacgcaag gtgcgcttag aaagaaagat cgacgcatca aagaagttca aatgcaagtt
     2281 gatgaagagc ataagatgtt tgtgatggcg caagatactg ctgataggct tcttgaaaaa
     2341 cttaacattc aaaagtggca acttggagaa gctgaatctt taacgatggc taatcttcaa
     2401 cgagtacgtc gatatcaacg tgaattggaa gatgctgaag gtcgtgctga tcaagctgaa
     2461 agctcattac atcttattcg tgctaaacat cgttcgtcag ttgttaccgg caaaaatgct
     2521 tcagcatcta aaatttacgt tctcg
//
LOCUS       ONGPARAM1     134 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  O.volvulus paramyosin mRNA partial cds, clone OV-1.
ACCESSION   M29734
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 134)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   134     paramyosin (AA at 1)
BASE COUNT       53 a     23 c     28 g     30 t
ORIGIN      
        1 ttacaaaaga aaaattcaaa aattgaccgt gaacgtcaac gtctgcaaca tgaagtaatt
       61 gagcttacag cggcaattga tcaacttcaa aaggataagc atttggctga gaaagcagca
      121 gaacgttttg aagc
//
LOCUS       ONGPARAM2     135 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  O.volvulus paramyosin mRNA partial cds, clone OV-2.
ACCESSION   M29735
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   135     paramyosin (AA at 2)
BASE COUNT       43 a     31 c     26 g     35 t
ORIGIN      
        1 tttgcaacat gtcaaatatc aacttgcgca acaacttgaa gaagcacgtc gtcgtttgga
       61 agatgctgaa cgtgaacgtt cacaacttca agcacaactt catcaagttc agctagaatt
      121 ggattccgta cgtac
//
LOCUS       ONGPARAM3     176 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-3.
ACCESSION   M29736
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 176)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   176     paramyosin (AA at 1)
BASE COUNT       74 a     20 c     36 g     46 t
ORIGIN      
        1 aaagttgctc ttcatcatga agaggttgaa gatttacgaa agaaaatgtt gcaaaagcaa
       61 gctgaatatg aagagcaaat tgaaataatg ctgcaaaaaa tttcacaact tgaaaaggct
      121 aaatcacgtt tgcaaagtga agttgaagta ttgattgttg atttagaaaa agcaca
//
LOCUS       ONGPARAM4     414 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-4.
ACCESSION   M29737
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 414)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   414     paramyosin (AA at 3)
BASE COUNT      143 a     70 c    101 g    100 t
ORIGIN      
        1 ccttagctcg agaaaataaa aagttgcaag atgatttgca tgaagctaaa gaagcgcttg
       61 ctgatgcaaa tcgtaaattg catgaacttg atcttgagaa tgcacgatta gctggtgaaa
      121 ttcgagaatt gcaaacagca ctcaaagaat ctgaagcagc tcgacgtgat gctgagaatc
      181 gtgcacaacg tgcacttgct gaattgcaac aacttcgcat cgaaatggaa cgtcgtttgc
      241 aagagaagga agaagaaatg gaagcacttc gtaagaatat gcaatttgaa attgatcgtc
      301 ttacagctgc attagctgat gctgaagcac gtatgaaggc ggagattgca cgtttgaaga
      361 aaaagtatca ggcggaaatt gccgaacttg aaatgaccgt tgataatttg aatc
//
LOCUS       ONGPARAM5     145 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-5.
ACCESSION   M29738
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 145)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   145     paramyosin (AA at 3)
BASE COUNT       48 a     28 c     28 g     41 t
ORIGIN      
        1 tattgcaagc atcacttgaa gatactcaac gtcaattaca acaaactcta gatcagtatg
       61 ctttggcaca acgtaaagta tctgcgctta gtgctgaatt ggaagaatgc aaagttgcac
      121 ttgataatgc aattcgtgct cgtaa
//
LOCUS       ONGPARAM6     127 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  O.volvulus paramyosin mRNA, 3' end, clone OV-7.
ACCESSION   M29740
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 127)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       56     paramyosin (AA at 30
    mRNA    <     1      127     paramyosin mRNA
BASE COUNT       44 a     20 c     24 g     39 t
ORIGIN      
        1 ctgctggcaa aaatgcttca gcatcgaaga tttacgttct cgaagatgat cagtagatag
       61 atagatggtg ataatagttg aaatcaactg atatcatctg ttaacaatca ttaaaatgtg
      121 tctcatc
//
LOCUS       ONGPARAM7     129 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  O.volvulus paramyosin mRNA, partial cds, clone OV-6.
ACCESSION   M29739
KEYWORDS    paramyosin.
SOURCE      O.volvulus, cDNA to mRNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 129)
  AUTHORS   Limberger,R.J. and McReynolds,L.A.
  TITLE     Filarial paramyosin: cDNA sequences from Dirofilaria immitis and
            Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 38, 271-280 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   129     paramyosin (AA at 3)
BASE COUNT       48 a     20 c     31 g     30 t
ORIGIN      
        1 aagtacaaat tcaagaagct gaagcagcag cattattggg tggtaaacgt gtaattgcta
       61 aacttgaaac acgtatacgt gatttggaaa ctgctttgga tgaagaaaca cgtcgacata
      121 aggaaacgc
//
LOCUS       CORDTXRA     2529 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  C.diphtheriae toxin repressor (dtxR) gene, complete cds.
ACCESSION   M34239
KEYWORDS    regulatory protein; toxin repressor.
SOURCE      C.diphtheriae (strain C7) DNA, clone pHH2500..
  ORGANISM  Corynebacterium diphtheriae
            Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE   1  (bases 1 to 1390)
  AUTHORS   Boyd,J., Oza,M.N. and Murphy,J.R.
  TITLE     Molecular cloning and DNA sequence analysis of the diptheria tox
            regulatory element (dtxR) from Corynebacterium diptheriae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2529)
  AUTHORS   Boyd,J., Oza,M.N. and Murphy,J.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.R.Murphy, 11-MAY-1990.
               University Hospital
               Dept Medicine-Rm E613
               88 East Newton Street
               Boston, MA 02118-2393
FEATURES       from  to/span     description
    pept        681     1361     toxin repressor protein (dtxR)
    pept         25      456     ORF1
    pept       1384     2370     ORF3
    signal      595      629     promoter sequence
    binding     664      669     ribosomal binding site (put.)
    binding    1200     1283     DNA-binding region (put.)
BASE COUNT      691 a    615 c    593 g    630 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttctc gaatcaaacg cgagatgtat caacagctcg gacgtgaagc aaccaacgaa
       61 gagcttgcgg aagaatctgg cattgaagaa tccaaaattg aaatgttgct tcgccagtct
      121 cgtgacccag taagcctaga tatgccagtt ggtgccgatg aagaggctcc acttggtgac
      181 tttatcgaag attccgaggc gactgatgcc gaatcagctg tagttgcgtc gttgcgtcac
      241 tctgatatca gaacagtctt ggacactttg gagcaacgtg aacaggacgt tattcgcctt
      301 cgttatgggc tcgatgatgg agttccccgc actttagatc aaattggtcg tcaatttgga
      361 ctgtctcgtg aaagagttcg ccaaattgaa cgcgaggtca tgtcgaaact gcgtgacggt
      421 gcccgtgccg aaaaacttcg agcatacgca caataaagcg tttgcttaga tatgcctacc
      481 aataaagaca taaacgccta ttaaaagcaa tctttagatt aggcgttata attaaagtct
      541 catcgaaaaa cgcgctgcgg gactacaacg caacaagaaa actattccat atttttcacg
      601 ctacaattcg ttgtagattg ataggaattg atcaccagca cacaacagtc tccatggcac
      661 tataaggaaa gaggcttaca atgaaggact tagtcgatac cacagagatg tacttgcgta
      721 ctatctatga gctggaagaa gagggagtca cccctcttcg cgctaggatc gctgagcgtc
      781 tggaacaatc tggacctaca gttagccaaa ccgttgcccg tatggagcgc gatggacttg
      841 tcgttgtcgc ctcagaccgc agtctacaaa tgacaccgac aggccgcact ttagcgactg
      901 cagttatgcg taaacatcgc ttagctgagc gccttcttac cgatatcatt ggtctagata
      961 tcaataaagt tcacgatgaa gcctgccgct gggaacacgt tatgagtgac gaagttgaac
     1021 gcaggctcgt gaaagtattg aaagatgtca gtcggtcccc cttcggaaac ccaattccag
     1081 gtctcgacga actcggcgta ggcaattctg acgcggcagc ccccggaact cgcgttattg
     1141 acgctgccac cagcatgccc cgcaaagtac gcattgttca gattaacgaa atctttcaag
     1201 ttgaaacgga tcagtttaca cagctcctcg atgctgacat ccgtgttgga tcagaagtcg
     1261 aaattgtaga tagagacggc cacatcacgt tgagccacaa tggaaaagat gtcgaactcc
     1321 tcgatgatct ggctcacact attcgtatcg aagaactcta aatactaaag gcggcgaaat
     1381 tagatgaaac tcctcgttac cggtggcgcc ggatacgtag gaagtgtctg ttccactgtt
     1441 ctgctcgaac aggggcacga agtaacaatt gtcgataatc ttacaaccgg caatcgcgat
     1501 gctgttccgc taggagccac ttttgtcgag ggtgatatca aagacgttgc ggaaaacgtt
     1561 ttatcatctg attcttttga tgcggttctt cacttcgcgg ctcgttcgct tgtaggcgaa
     1621 tcagttgaaa agccagatga atattggcag cacaacatgg tgaccacact cgcacttctt
     1681 gatgccatga aacgtaataa cgtgcgaaac attgtctttt cttctacagc tgctacatac
     1741 ggcgaacccg aaacagtccc gattactgaa gacgctccta cccacccaac gaatccatac
     1801 ggcgcaacca agctgtctat cgactacgcg atcacatcat acgcacatgc atatgggttt
     1861 gctgcgacga gtctgcggta cttcaacgta gctggcgcat atggtttggt gggtgaaaat
     1921 cgagaaatag aaacccatct cattccgctc gttttacaag tagcactggg acaccgtgac
     1981 aagattttca tgtttggtga cgactggcca actgaagatg gaacccctat ccgcgattac
     2041 attcatattc gtgatcttgc agatgcccat attttggcac ttcaaagtaa cgtcgaaggc
     2101 agccaccgga tctttaatct cggatccggt gaaggatatt cggtcaaaca ggtcattgat
     2161 acctgcagag aagttaccgg tcatcccatc cccgctgagg tagctcctcg acgggctggt
     2221 gatcctgcag tgctcattgc atcatcggct aaggcgcaat ccgaactagg atggaagcca
     2281 caacgcactg atctgcacac gattgtttct gatgcttggg cttttacgtc tcaactcgga
     2341 gacaaagccc atagcgcttc tcgtggataa caacgagtag gtttatcttt tgcagtccgc
     2401 ggtgtgatta cttaagtatt cacaccgcgg attttacatt tgttgactca ccaaaatgtt
     2461 tctgagcttc ctgactccca ttgataacaa gatcaacaat ggatccagat agacctcgat
     2521 aataagctt
//
LOCUS       ECOANSBA     1643 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION   M34234
KEYWORDS    L-asparaginase II; ansB gene.
SOURCE      E.coli (strain K12 isolate JM108) DNA, clones lambda-ansB and
            pansB7.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1643)
  AUTHORS   Bonthron,D.T.
  TITLE     L-asparaginase II of Escherichia coli K-12: Cloning and sequencing
            of the ansB gene and its position on the chromosomal map
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.T.Bonthron, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        346     1392     L-asparaginase II (ansB) (EC 3.5.1.1)
    sigp        346      411     L-asparaginase II signal peptide
    matp        412     1389     L-asparaginase II
    binding     335      339     ribosomal binding site (put.)
    site       1400     1425     transcription termination signal (put.)
BASE COUNT      442 a    404 c    405 g    392 t
ORIGIN      63.8 min on K12 map.
        1 gcggtaaccc attacagaat gtgctgggaa gcctgggggg gctgcaatcc tcaatccaaa
       61 ccgagtggaa aaagcaggaa aaagatttcc agcagtttgg caaagatgtt tgtagccgcg
      121 ttgtgactct ggaagatagc cgcaaagccc tggtcgggaa tttaaaataa tcctctattt
      181 taagacggca taatactttt ttatgccgtt taattcttcg ttttgttacc tgcctctaac
      241 tttgtagatc tccaaaatat attcacgttg taaattgttt aacgtcaaat ttcccataca
      301 gagctaaggg ataatgcgta gcgttcacgt aactggagga atgaaatgga gtttttcaaa
      361 aagacggcac ttgccgcact ggttatgggt tttagtggtg cagcattggc attacccaat
      421 atcaccattt tagcaaccgg cgggaccatt gccggtggtg gtgactccgc aaccaaatct
      481 aactacacag tgggtaaagt tggcgtagaa aatctggtta atgcggtgcc gcaactaaaa
      541 gacattgcga acgttaaagg cgagcaggta gtgaatatcg gctcccagga catgaacgat
      601 aatgtctggc tgacactggc gaaaaaaatt aacaccgact gcgataagac cgacggcttc
      661 gtcattaccc acggtaccga cacgatggaa gaaactgctt acttcctcga cctgacggtg
      721 aaatgcgaca aaccggtggt gatggtcggc gcaatgcgtc cgtccacgtc tatgagcgca
      781 gacggtccat tcaacctgta taacgcggta gtgaccgcag ctgataaagc ctccgccaac
      841 cgtggcgtgc tggtagtgat gaatgacacc gtgcttgatg gccgtgacgt caccaaaacc
      901 aacaccaccg acgtagcgac cttcaagtct gttaactacg gtcctctggg ttacattcac
      961 aacggtaaga ttgactacca gcgtaccccg gcacgtaagc ataccagcga cacgccattc
     1021 gatgtctcta agctgaatga actgccgaaa gtcggcattg tttataacta cgctaacgca
     1081 tccgatcttc cggctaaagc actggtagat gcgggctatg atggcatcgt tagcgctggt
     1141 gtgggtaacg gcaacctgta taaatctgtg ttcgacacgc tggcgaccgc cgcgaaaacc
     1201 ggtactgcag tcgtgcgttc ttcccgcgta ccgacgggcg ctaccactca ggatgccgaa
     1261 gtggatgatg cgaaatacgg cttcgtcgcc tctggcacgc tgaacccgca aaaagcgcgc
     1321 gttctgctgc aactggctct gacgcaaacc aaagatccgc agcagatcca gcagatcttc
     1381 aatcagtact aatcgcctcg ccccggtatc gtgccggggc tttttcactt cagactcacg
     1441 tccattgcca attttaatta ccctaatgat aatcaccgga ataaattatt ccgcgtgagg
     1501 tttttcgggt gaaaaagcaa tggattgttg gcacggcact gcttatgttg atgactggta
     1561 atgcctgggc agatggcgaa ccgccaactg aaaatatctt aaaagatcaa ttcaaaaagc
     1621 agtatcacgg cattctcaag ctt
//
LOCUS       HUMADE5RC    1946 bp ds-DNA             SYN       04-JUL-1990
DEFINITION  Synthetic human/adenovirus type 5 recombination junction.
ACCESSION   M34061
KEYWORDS    .
SOURCE      Human fetal bone marrow fibroblasts transformed with Adeno5/SV40
            recombinant virus (cell line H13.1) DNA, clone pH13.1SV1.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 1946)
  AUTHORS   Romani,M., De Ambrosis,A., Alhadeff,B., Purrello,M., Gluzman,Y. and
            Siniscalco,M.
  TITLE     Preferential viral integration at the highly recombinogenic
            chromosomal site 1p36 in human cells transformed with an
            Adeno5/SV40 hybrid
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Romani 08-MAY-1990.
FEATURES       from  to/span     description
    recomb      429      430     adenovirus type 5 end/Human DNA start
    rpt         430     1134     tandem Alu repeats
BASE COUNT      492 a    548 c    510 g    396 t
ORIGIN      Chromosome 1p36.
        1 tttgacccgg aacgcggaaa acacctgaga aaaacacctg ggcgagtctc cacgtaaacg
       61 gtcaaagtcc ccgcggccct agacaaatat tacgcgctat gagtaacaca aaattattca
      121 gatttcactt cctcttattc agttttcccg cgaaaatggc caaatcttac tcggttacgc
      181 ccaaaattta ctacaacatc cgcctaaaac cgcgcgaaaa ttgtcacttc ctgtgtacac
      241 cggcgcacac caaaaacgtc acttttgcca catccgtcgc ttacatgtgt tccgccacac
      301 ttgcaacatc acacttccgc cacactacta cgtcacccgc cccgttccca cgccccgcgc
      361 cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat aaggtatatt
      421 attgatgata tataatatag aggctgaggc aggagggttg cttgagcctg ggaggctgcg
      481 gctgcagtga gccatgatca caccactgca cttcagcctg ggcaacagag cgagactccg
      541 tctcaaaaaa tagtaatagg ccaggcgcgg tggctcatgc cagtaatccc agcgctttgg
      601 gaggctgagg caggcagatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg
      661 tgaaaccctg tctctactaa aaatacaaaa attagttggg catggtggcg gcgcacacca
      721 gtaatcccag ctactcaaga ggctgagggc acttgggcag cagaggaggt tgtagtgagc
      781 agagatcacg ccactgcact ctagcctggg caacagaggg agactctgtc tcaaaaaata
      841 ataggccggg cacggtggct catgcctgta atcccagcac tttgggaggc cgaggcgggt
      901 ggatcacctg agatcaggag tttaagacca gcctggccaa catggtgaaa ccccatctct
      961 actaaaaata caaaaaatta gccaggcgtg gtggcgggcg tctgtaatcc cagctacttg
     1021 ggaggctgaa gcaggagaat cacttgagcc caggaggcag aggttgcagt gagctgagat
     1081 catgccatta cactccagcc tgggcaacaa gagtgaaact ccgtctcaaa aaataataat
     1141 aataatgata ataataataa taattgcatc tattcactga gcgcctcctg tacaccaggc
     1201 accatgtggc attaggggcc catgctgtca tcatattctg ccagccaccc atggggtggt
     1261 ggggtaattc cccaggccac acctgtccac acctctcttc aggtactgcg tgggcatgtg
     1321 cggagacggc gccaatgact gtggggccct gaaggcggct gatgtcggca tctcgctgtc
     1381 ccaggcagaa gcctcagtgg tctcaccctt cacctcgagc atggccagta ttgagtgcgt
     1441 gcccatggtc atcaggtaag gcaggcaggg accggtgggt gagggctcag cagggctggt
     1501 cagccaacct gggtccccta atgtccgtgc cctgccaccc agggaggggc gctgttccct
     1561 tgacacttcg ttcagcgtct tcaagtacat ggctctgtac agcctgaccc agttcatctc
     1621 cgtcctgatc ctctacacgg tgagtatctg cagagctccg tacccagagc cgtcccagga
     1681 ctagggaggg gacacagccc tgtgccttgg agctggcagg tcccagatag atgacgtggg
     1741 cccagatcct gaccctgcca ctctccggct gtgtggttgg agaggtggca aggcttgggg
     1801 gagatgagaa aaacaccagg cacggagcct ggtacccact aggtcctcac aaatggcagc
     1861 agctgtcatg ggctaacatc ttgaccgcaa ctctgggagc tccccaatct gcagggaaga
     1921 ctgaacggag gcagggccaa aaccct
//
LOCUS       MCLPCCA      1701 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  M.laminosus phycoerythrocyanin (PCC) alpha and beta subunit genes,
            complete cds, and linker protein, 5' end.
ACCESSION   M34254
KEYWORDS    light-harvesting biliprotein; phycoerythrocyanin.
SOURCE      M.laminosus (strain Fischerella PCC 7603) DNA.
  ORGANISM  Mastigocladus laminosus
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria.
REFERENCE   1  (bases 1 to 1701)
  AUTHORS   Eberlein,M. and Kufer,W.
  TITLE     Genes encoding both subunits of phycoerythrocyanin, a light-
            harvesting, biliprotein from the cyanobacterium Mastigocladus
            laminosus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Kufer, 11-MAY-1990.
               Botanisches Institut
               Ludwig-Maximilians- Universitaet Muenchen
               Menzingerstr. 67
               D-8000 Muenchen 19
               Federal Republic of Germany
FEATURES       from  to/span     description
    pept        279      797     phycoerythrocyanin beta-subunit
    pept        884     1372     phycoerythrocyanin alpha-subunit
    pept       1528  >  1701     phycoerythrocyanin linker protein precursor
    matp       1531  >  1701     phycoerythrocyanin linker protein
BASE COUNT      526 a    312 c    383 g    480 t
ORIGIN      
        1 agctttacat ttaaggatat ggcgctcaaa gaatgggagt tttacagtta ttaaataaag
       61 ttcataaaaa agaatacaag taatatataa aatacgaaac taaggtttga aagtaattgc
      121 aaatatcaag gcaaacgaaa agataaaggc taattgttgc acgtaaaaat ggtatttttt
      181 tgatttttag gtgtagttat tggcggattt tatctggtct cggtttgtct tgattaaaaa
      241 tgcaatagct gccatgaaat cgaggagaaa atagaaaaat gcttgatgct ttttccagag
      301 tagttgaaca agctgataaa aaaggtgctt acctaagcaa tgatgaaata aatgctttgc
      361 aggctatagt tgcagatagc aacaaacgtt tggatgtagt gaatcggctt acaagcaacg
      421 cctcctcaat tgtggcgaat gcttatcgtg ctttagtagc agaacgtccg caggtattta
      481 atcctggtgg tccttgtttt caccatcgca accaagctgc ttgcatccgc gatttaggat
      541 ttattctgcg ttatgttacg tactccgtac ttgctggtga cactagtgtc atggacgatc
      601 gctgcttgaa tggtttgcgc gaaacctatc aagctcttgg tactcccggt gatgctgtag
      661 cttctggaat taaaaaaatg aaagaagctg cacttaaaat tgctaacgat cccaatggca
      721 ttaccaaagg tgattgcagt caattaatgt ctgaattagc aagttatttt gaccgagcag
      781 cggctgctgt tgcttaatgc agtcaaaggc gcgactaagt gacaaaattt tcaaactctt
      841 ttttagtaat taatttgaca agtttttgga gaagaagcga atcatgaaga ccccgttaac
      901 cgaagcgatc gccgccgccg atcttcgcgg ttcttatttg agcaacacag aattgcaagc
      961 agtatttggt cgtttcaatc gcgctcgtgc tggtttagaa gcagcaagag cattcgctaa
     1021 caacggtaag aaatgggcgg aagcagcagc caatcacgta tatcaaaaat ttccctacac
     1081 tacccaaatg caaggacctc agtatgcttc tactccagaa gggaaagcaa agtgcgtgcg
     1141 ggatatcgac cattacttac gcacaattag ctactgctgt gtagttggcg gtactggtcc
     1201 cttggatgac tatgtagttg caggcttaaa ggaatttaac agcgccctgg gtttatctcc
     1261 cagctggtac atagctgctc tagaatttgt ccgcgacaat cacggtttga ccggtgatgt
     1321 tgctggtgaa gccaacacct acatcaacta tgcaattaat gcattgagct aaatctaaac
     1381 agcgaatgtc ctttgtattt gcttaatagc taatggctga tagctgatgg ctaatggcta
     1441 atggaaaaat gcaattagca attagcagtg agttattggc cattaactgt gtttgagaaa
     1501 aatgataaaa atggagcgat tgttaacatg agtacatcag tcgcagaacg cctcgcaatt
     1561 aaagatgaag tagacaaaaa aattgaattg cgccctaatt ggagcgaaga cgagctgcaa
     1621 attgttttta aaacagccta cgagcaagtt tttggacgtc agggattata tgccagtcaa
     1681 agatttgcaa ctgccgaagc t
//
LOCUS       YSCTSDNAA     238 bp ds-DNA             PLN       04-JUL-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-ii.
ACCESSION   M34310
KEYWORDS    .
SOURCE      S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-ii.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 238)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    146 g     92 t
ORIGIN      
        1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
       61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
      121 gtgtggtgtg tgggtgtggt gtgtgggtgt ggtgtgggtg tgggtgtggt gtgtgtgggt
      181 gtggtgtgtg ggtgtgggtg tggtgtggtg tgtgtgggtg tgtgggtgtg gtgtgtgt
//
LOCUS       YSCTSDNAB     297 bp ds-DNA             PLN       04-JUL-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-i.
ACCESSION   M34311
KEYWORDS    .
SOURCE      S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-i.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 297)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    184 g    113 t
ORIGIN      
        1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
       61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
      121 gtgtggtgtg ggtgtgggtg tgggtgtggt gtgtgtgtgg gtgtggtgtg tgggtgtggg
      181 tgtgtgtggg tgtgggtgtg ggtgtggtgt ggtgtgtggg tgtggtgtgt gggtgtggtg
      241 tgtgtgggtg tggtgtgtgt gggtgtgggt gtgtgtgggt gtgggtgtgg tgtgtgg
//
LOCUS       YSCTSDNAC     325 bp ds-DNA             PLN       04-JUL-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-ii.
ACCESSION   M34312
KEYWORDS    .
SOURCE      S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-4-ii.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 325)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    199 g    126 t
ORIGIN      
        1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
       61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
      121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg gtgtgtgggt gtggtgtggg tgtggtgtgt
      181 gggtgtgtgg gtgtgggtgt gtgtggtgtg tggtgtgtgt ggtgtgggtg tggtgtgtgg
      241 gtgtgggtgt ggtgtgtggg tgtgggtgtg gtgtgtgtgt gtgggtgtgg tgtgtgggtg
      301 tgggtgtggt gtgtgggtgt gtggg
//
LOCUS       YSCTSDNAD     248 bp ds-DNA             PLN       04-JUL-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-i.
ACCESSION   M34313
KEYWORDS    .
SOURCE      S.cerevisiae (strain XS595-6C) extrachromosomal plasmid DNA, clone
            YLP108CA-4-i.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 248)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    151 g     97 t
ORIGIN      
        1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
       61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
      121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg tgtgtgtggg tgtggtgtgg gtgtggtgtg
      181 tgggtgtgtg tgggtgtggt gtgtgtgggt gtgtgggtgt ggtgtgtgtg ggtgtggtgt
      241 gtgggtgg
//
LOCUS       BOVMET       1047 bp ss-mRNA            MAM       04-JUL-1990
DEFINITION  Bovine metalloproteinase inhibitor mRNA, complete cds.
ACCESSION   M32303
KEYWORDS    metalloproteinase inhibitor.
SOURCE      Bovine aorta endothelium, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1047)
  AUTHORS   Boone,T.C., Johnson,M.J., De Clerck,Y.A. and Langley,K.E.
  TITLE     cDNA cloning and expression of a metalloproteinase inhibitor
            related to tissue inhibitor of metalloproteinases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2800-2804 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            K.E.Langley, 23-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        291      953     metalloproteinase inhibitor precursor
    sigp        291      368     metalloproteinase inhibitor signal peptide
    matp        369      950     metalloproteinase inhibitor
    signal     1042     1047     polyA signal
BASE COUNT      220 a    350 c    312 g    165 t
ORIGIN      
        1 gaattccggc ttctatggag cactcgggac caggtccgcg gcgcgcgcac tcgctcgctc
       61 gccgcccccc agccagctct cgcttccgcg ccgccagccg cgccccgcgc ctcctcgctg
      121 caccccgcga cctagagcca agaaagtttg tgtggcgagt gagggccgga gaggagagcg
      181 cgcccgcgga gtgccgtcca gaccagcgcg gccccggcgg agaggggagc gccccgagcc
      241 caggcggcgg cggctagccc gagtccgcga cccccgcccc tccgcccgcc atgggcgccg
      301 ccgcccgcag cctgccgctc gcgttctgcc tcctgctgct ggggacgctg ctcccccggg
      361 ccgacgcctg cagctgctcc ccggtgcacc cgcaacaggc gttttgcaat gcagacatag
      421 tgatcagggc caaagcagtc aataagaagg aggtggactc tggcaacgac atctacggca
      481 accccatcaa gcggattcag tatgagatca agcagataaa gatgttcaag ggacctgatc
      541 aggacataga gtttatctac acagcccccg ccgctgccgt gtgtggggtc tcgctggaca
      601 ttggaggaaa gaaggagtat ctcattgcag ggaaggccga ggggaatggc aatatgcata
      661 tcaccctctg tgacttcatc gtgccctggg acaccctgag tgccacccag aagaagagcc
      721 tgaaccacag gtaccagatg ggctgtgagt gcaagatcac tcgatgcccc atgatcccat
      781 gctacatctc ctctccggac gagtgcctct ggatggactg ggtcacggag aagaacatca
      841 acggacacca ggccaagttc ttcgcctgca tcaagagaag cgacggctcc tgcgcctggt
      901 accgcggagc agcacccccc aagcaggagt ttctggacat cgaggacccg taagcaggcc
      961 accaggactc ctggggccaa ttgacagtgt ccaagagttc agactggtcc agctccgaca
     1021 tcccttcctg gacacagcat gaataaa
//
LOCUS       CFDCG        1291 bp ss-DNA   circular  VRL       04-JUL-1990
DEFINITION  Coconut foliar decay virus, complete genome.
ACCESSION   M29963
KEYWORDS    circular; complete genome.
SOURCE      Coconut foliar decay virus DNA, clones NMORG, pORG, and pORGSAU3A.
  ORGANISM  Coconut foliar decay virus
            Viridae; ss-DNA viruses.
REFERENCE   1  (bases 1 to 1291)
  AUTHORS   Rohde,W., Randles,J.W., Langridge,P. and Hanold,D.
  TITLE     Nucleotide sequence of a circular single-stranded DNA associated
            with coconut foliar decay virus
  JOURNAL   Virology 176, 648-651 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            W.Rhode, 15-MAR-1989, for release after publication.
FEATURES       from  to/span     description
    pept        103      975     ORF 1
    pept        314      775     ORF 2
    pept        639      797     ORF 3
    pept       1098     1286     ORF 4
    pept        987      823 (c) ORF 5
    pept        568      422 (c) ORF 6
    site         40       70     stem-loop structure
BASE COUNT      336 a    323 c    332 g    300 t
ORIGIN      
        1 cgccaaaaac ctctgctaag tcccgtgcta agtataaata gccgcggggg ctagtattac
       61 ccccgcggct ccccaacctc tgctaacccc gcttggctat aaatgggttc ctccattcgc
      121 cgctggtgct tcactctgaa ctacgagaca gaggaagagg cggcgaatgt tgtaaggcgc
      181 atcgagtccc ttaacttggt ctatgctata gttggggacg aggttgcacc ttcaactggc
      241 caacgacacc tccaaggatt catccacttg aagaccggtc ggcgactgca aggattgaag
      301 actgttcttg ggaatgacag gattcacctg gagccgaccc gtggttccga cgaacagaat
      361 agagactact gttcgaagga acgggtgctt ctcgagcacg gagtcccgac tcgtcctgga
      421 gtcaaaaggc cacgattggc ccaacgattt gctgaggaac ctgatgaact ccgcctggaa
      481 gacccaggcg gataccgaag atgcgttgta cacggagctt cggtggaatg gacaagatgg
      541 gccgctgaaa atccgttccc atttccatat cacaattggc agcttgaagt gctgtctgcg
      601 atcggagagc cagcggacga tcgcacaatc ctctggatat gcggacgaga cggaggagac
      661 gggaagtccg tgtttgccaa atatctcgga ctcaagcccg actggttcta cacatgtggt
      721 ggaaccagaa aggacgtatt gtaccagtac atcgaggacc caaaacgaaa tttaatcctc
      781 gatgtaccca ggtgtaattt agagtattta aattatgccc tgttagaatg tgttaagaac
      841 agggcattca gttcggacaa atacgaaccc cttagttatc ttgggttcga ccatgtgcat
      901 gtactcgtat ttgccaatgt cctgcctgat tatttgaaaa tcagcaggga cagaataaaa
      961 ctgtggaata tttaaagtat gtgtcatcta aattacacca atacccgccc gcccacgcgc
     1021 tatcgtttac atcttatgaa tatcctgccc aggccgaagg cctgggaggt gctacccggc
     1081 cgaaggccgg gaacaatatg aatcgagtta tgggcgggcc cacaataaaa gattccattt
     1141 ggataagaac gaatctgtta ctttgcttgc agtgcacgca accactttcc acgtcaccaa
     1201 tccaggtgag tagcttgctg gagaagaaag ccgcaagcct ctatctaccg tccatttgtt
     1261 tttgcgcgat cggacggctg agttgatctg g
//
LOCUS       HUMMET       1035 bp ss-mRNA            PRI       04-JUL-1990
DEFINITION  Human metalloproteinase inhibitor mRNA, complete cds.
ACCESSION   M32304
KEYWORDS    metalloproteinase inhibitor.
SOURCE      Human fetal aorta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1035)
  AUTHORS   Boone,T.C., Johnson,M.J., De Clerck,Y.A. and Langley,K.E.
  TITLE     cDNA cloning and expression of a metalloproteinase inhibitor
            related to tissue inhibitor of metalloproteinases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2800-2804 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            K.E.Langley, 23-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        255      917     metalloproteinase inhibitor precursor
                                 /nomgen="TIMP" /map="Xp11.3-p11.23"
                                 /hgml_locus_uid="LJ0125P"
    sigp        255      332     metalloproteinase inhibitor signal peptide
    matp        333      914     metalloproteinase inhibitor
    signal     1011     1016     polyA signal
BASE COUNT      216 a    361 c    306 g    152 t
ORIGIN      
        1 gaattccggc ccgccgtccc ccaccccgcc gccccgcccg gcgaattgcg ccccgcgccc
       61 ctcccctcgc gcccccgaga caaagaggag agaaagtttg cgcggccgag cggggcaggt
      121 gaggagggtg agccgcgcgg gaggggcccg cctcggcccc ggctcagccc ccgcccgcgc
      181 ccccagcccg ccgccgcgag cagcgcccgg accccccagc ggcggccccc gcccgcccag
      241 ccccccggcc cgccatgggc gccgcggccc gcaccctgcg gctggcgctc ggcctcctgc
      301 tgctggcgac gctgcttcgc ccggccgacg cctgcagctg ctccccggtg cacccgcaac
      361 aggcgttttg caatgcagat gtagtgatca gggccaaagc ggtcagtgag aaggaagtgg
      421 actctggaaa cgacatttat ggcaacccta tcaagaggat ccagtatgag atcaagcaga
      481 taaagatgtt caaagggcct gagaaggata tagagtttat ctacacggcc ccctcctcgg
      541 cagtgtgtgg ggtctcgctg gacgttggag gaaagaagga atatctcatt gcaggaaagg
      601 ccgaggggga cggcaagatg cacatcaccc tctgtgactt catcgtgccc tgggacaccc
      661 tgagcaccac ccagaagaag agcctgaacc acaggtacca gatgggctgc gagtgcaaga
      721 tcacgcgctg ccccatgatc ccgtgctaca tctcctcccc ggacgagtgc ctctggatgg
      781 actgggtcac agagaagaac atcaacgggc accaggccaa gttcttcgcc tgcatcaaga
      841 gaagtgacgg ctcctgtgcg tggtaccgcg gcgcggcgcc ccccaagcag gagtttctcg
      901 acatcgagga cccataagca ggcctccaac gcccctgtgg ccaactgcaa aaaaagcctc
      961 caagggtttc gactggtcca gctctgacat cccttcctgg aaacagcatg aataaaacac
     1021 tcatccccgg aattc
//
LOCUS       HUMACEX      1141 bp ds-DNA             PRI       04-JUL-1990
DEFINITION  Human angiotensin-converting enzyme (ACE) gene, 5' end.
ACCESSION   M34434
KEYWORDS    angiotensin-converting enzyme.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1141)
  AUTHORS   Shai,S.-Y., Langford,K.G., Martin,B.M. and Bernstein,K.E.
  TITLE     Genomic DNA 5' to the mouse and human angiotensin-converting enzyme
            genes contains two distinct regions of conserved sequence
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 1128-1133 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1136  >  1141     angiotensin-converting enzyme
BASE COUNT      169 a    427 c    364 g    181 t
ORIGIN      
        1 ctgtccctgg ctccttccct gatcccaccg ccagcctcac cccacggttc ctccattgcc
       61 ccacctccca ctgcgccgcc gggcctctgc cagggtcaag gggcttcccc cctctggcag
      121 cagacgccat ggtgccgagg tggcctccac aaccgccctg tgcgccaata ggacaagact
      181 gtcctccctc ccccacactt gtcactttga gggacacgtg gatgagacag gaaaacacag
      241 gggagtgtgg agacctgagg tgacttggag caagcctctc aacctgagcg gcaatttctt
      301 catctgtaaa atgagggggt tgttctcatc tctgaggctt tgtgtcgctc tcaaagcctg
      361 ctagcctcgg gttctaggac tctgttggga tcgtgtgtga tgttttctgc tgagcgacgg
      421 cagcctgtgt cctcgggggg aaagagggca ggcgctccaa agctcctgcg ctctgtggct
      481 ccccctccct cgcagcccca agccccaggt gtgccggccg ccctgagccc ctccagcacc
      541 tcccggaggc gcctgcaaga cacctaaggt ccccgcctcc ctcctctccc ccccgccaca
      601 cccctacccc cggcaggcga cgtccccgcc cctcgaccat ggcctggtga agaagccggc
      661 caggcccgat cagccccatc cccgccgcac gagcggcgcc tgcggacagc tcctggggcc
      721 ccggccttgt cactccggag gcgggaggct ccggggggtc gggctgggaa gatcgagccg
      781 gaggccgcta ggctcccagg ccccggccga ggctgcgcgg ccgcacggtg ggcaggctcg
      841 ggtgttccgg caaactgccg ggtccccatc ttcaaaagag aggaggccct ttctccagct
      901 tcctctgcgg gagcccgacc cagccccatc ccgccacccc cgggctgcac ctcggcccct
      961 ccccggcccg cgcccctgcc cggggcgggc caggaacctc ggcccgcgcc gctggggact
     1021 ttggagcgga ggaggaagcg cggcggggcg ggggcggggg tgtgtcgggt tttataaccc
     1081 gcagggcggc cgcggcgcag gagaaggggc agagccgagc accgcgcacc gcgtcatggg
     1141 g
//
LOCUS       MURU3REG      492 bp ss-RNA             VRL       04-JUL-1990
DEFINITION  Murine retrovirus SL3-2 U3 region.
ACCESSION   M34383
KEYWORDS    long terminal repeat.
SOURCE      Murine retrovirus SL3-2, cDNA to viral RNA.
  ORGANISM  Murine retrovirus SL3-2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Dai,H.Y., Etzerodt,M., Baekgaard,A.J., Lovmand,S., Jorgensen,P.,
            Kjeldgaard,N.O. and Pedersen,F.S.
  TITLE     Multiple sequence elements in the U3 region of the leukemogenic
            murine retrovirus SL3-2 contribute to cell-dependent gene
            expression
  JOURNAL   Virology 175, 581-585 (1990)
  STANDARD  simple staff_review
BASE COUNT      158 a    136 c    114 g     84 t
ORIGIN      
        1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc
       61 atgggaaaaa taccagagct gatgttctca gaaaaacaag aacaaggaag tacagagagg
      121 ctaaaaagta ccgggactag ggccaagaac agatggtccc cagaccgcta actgcagtaa
      181 cgacaggata tctgtggtta agcactaggg ccccggccca gggccaagaa cagatggtcc
      241 ccagaccgct aacgacagga tatctgtggt taagcactag ggccccggcc cagggccaag
      301 aacagatggt ccccagaaat agctaaaaca acaacagttt caagagaccc agaaactgtc
      361 tcaaggttcc ccagatgacc ggggatcaac cccaagcctc atttaaacta accaatcagc
      421 tcgcttctcg cttctgtacc cgcgcttatt gctgcccagc tctataaaaa gggtaagaac
      481 cccacactcg gc
//
LOCUS       MUSACEX      1299 bp ds-DNA             ROD       04-JUL-1990
DEFINITION  Mouse angiotensin-converting enzyme (ACE) gene, 5' end.
ACCESSION   M34433
KEYWORDS    angiotensin-converting enzyme.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1299)
  AUTHORS   Shai,S.-Y., Langford,K.G., Martin,B.M. and Bernstein,K.E.
  TITLE     Genomic DNA 5' to the mouse and human angiotensin-converting enzyme
            genes contains two distinct regions of conserved sequence
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 1128-1133 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1294  >  1299     angiotensin-converting enzyme
BASE COUNT      262 a    393 c    369 g    275 t
ORIGIN      
        1 ggatcctctt tctgaactcc ataggtacca ctctcagaat cccctcctaa ccctccacca
       61 gtacagcctg cttcctccag aaacacaaca gctcacacaa acccacagcc ctcaaatcct
      121 gggagcccaa gcaatatgga ccacctaagt tctcagcagg tggccccttg ctccttcctc
      181 ctttctccat ctctacccgt cccctgggac ctctttggga gcaagtcacc tgacaggcca
      241 gcacagtctc actctgagtc cttgcaggtc acacaatctc ccatgtgaac tccatcaccc
      301 catctggcaa ggacaagggt ccgcccacct ctttgggggc acagccctgg tgctcagagg
      361 ttaacttcac aggctgctga tgcatttaga ggacctgatt ctccttcccc caggcctgtc
      421 actttgaggg acgggtggag gagacaggga aatgactggg agtgtggaga cctgaggtga
      481 cttgaagcca atctgtctca ggggtaattt ctccattttt aaaaagcagg gggttgttgt
      541 aatctctaag gcacattttt ttttagcttt cagagcgtgc aactgcgagg ttctagggct
      601 ctgttgggac cacacatgat ggttcttgtg agttgaacag ctgtgtatcc aaagagtagg
      661 ggcaggcatg ccagtgctga tggcctagtc tatggctccc cctcatagag cagaatagac
      721 tctatttcga ggcatcccaa caaccgaagc ccaaggtgaa ctggttgctg acatctgcag
      781 cacctagcca aggcgcctgt caggacagct aaacttcccg cgcccacgca ggcgacccct
      841 ttcctccatg acctggcaac cttaagtcca tcccggtggc accagcctgc cacccccgcc
      901 cccttgtcac tcaagagccc ggaaaggctt ggagggttat gccaagcctg ctgggctctg
      961 tggcctgaga ctacgtggtg gcggccagag ggcacggttg ggccgcagca gtgtttgcag
     1021 cgggcaggga cccttttcta gcttcctctg agagagcccg atctagcccc atttctccat
     1081 ccaggctgct ccctggcccc acagggctct ccttgcccag ggcgggcggg gaatcagccg
     1141 ggttcgcgct gccggggact ttggagcgag gaggaagcgc ggaggggcgg ggaggtgggg
     1201 gtgtgtcggg aggcggtggt ggctgggttt tataatccgc agggcggtcg cggcacgaga
     1261 gaaggggcag cgccgcaccg cgcgcaccgc gccatgggg
//
LOCUS       MUSOCT3      1324 bp ss-mRNA            ROD       04-JUL-1990
DEFINITION  Mouse octamer binding transcription factor (oct-3) mRNA,
            complete cds.
ACCESSION   M34381
KEYWORDS    octamer binding transcription factor.
SOURCE      Mouse embryo carcinoma cell line P19, cDNA to mRNA, clone
            lambda-C1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1324)
  AUTHORS   Okamoto,K., Okazawa,H., Okuda,A., Sakai,M., Muramatsu,M. and
            Hamada,H.
  TITLE     A novel octamer binding transcription factor is differentially
            expressed in mouse embryonic cells
  JOURNAL   Cell 60, 461-472 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         50     1183     octamer binding transcription factor (oct-3)
    mRNA          1     1324     oct-3 mRNA
BASE COUNT      297 a    361 c    400 g    266 t
ORIGIN      
        1 gtgagccgtc tttccaccag gcccccggct cggggtgccc accttcccca tggctggaca
       61 cctggcttca gacttcgcct cctcaccccc accaggtggg ggtgatgggt cagcagggct
      121 ggagccgggc tgggtggatt ctcgaacctg gctaagcttc caagggcctc caggtgggcc
      181 tggaatcgga ccaggctcag aggtattggg gatctcccca tgtccgcccg catacgagtt
      241 ctgcggaggg atggcatact gtggacctca ggttggactg ggcctagtcc cccaagttgg
      301 cgtggagact ttgcagcctg agggccaggc aggagcacga gtggaaagca actcagaggg
      361 aacctcctct gagccctgtg ccgaccgccc caatgccgtg aagttggaga aggtggaacc
      421 aactcccgag gagtcccagg acatgaaagc cctgcagaag gagctagaac agtttgccaa
      481 gctgctgaag cagaagagga tcaccttggg gtacacccag gccgacgtgg ggctcaccct
      541 gggcgttctc tttggaaagg tgttcagcca gaccaccatc tgtcgcttcg aggccttgca
      601 gctcagcctt aagaacatgt gtaagctgcg gcccctgctg gagaagtggg tggaggaagc
      661 cgacaacaat gagaaccttc aggagatatg caaatcggag accctggtgc aggcccggaa
      721 gagaaagcga actagcattg agaaccgtgt gaggtggagt ctggagacca tgtttctgaa
      781 gtgcccgaag ccctccctac agcagatcac tcacatcgcc aatcagcttg ggctagagaa
      841 ggatgtggtt cgagtatggt tctgtaaccg gcgccagaag ggcaaaagat caagtattga
      901 gtattcccaa cgagaagagt atgaggctac aggacacctt tcccaggggg ggctgtatcc
      961 tttcctctgc ccccaggtcc ccactttggc accccaggct atggaagccc ccacttcacc
     1021 acactctact cagtcccttt tcctgagggc gaggcctttc cctctgttcc cgtcactgct
     1081 ctgggctctc ccatgcattc aaactgaggc accagccctc cctggggatg ctgtgagcca
     1141 aggcaaggga ggtagacaag agaacctgga gctttggggt taaattcttt tactgaggag
     1201 ggattaaaag cacaacaggg gtggggggtg ggatggggaa agaagctcag tgatgctgtt
     1261 gatcaggagc ctggcctgtc tgtcactcat cattttgttc ttaaataaag actggacaca
     1321 cagt
//
LOCUS       PP1SIM       1900 bp ds-DNA             PHG       04-JUL-1990
DEFINITION  Bacteriophage P1 sim region proteins, complete cds.
ACCESSION   M34382
KEYWORDS    sim region.
SOURCE      Bacteriophage P1 viral DNA.
  ORGANISM  Bacteriophage P1
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 1900)
  AUTHORS   Maillou,J. and Dreiseikelmann,B.
  TITLE     The sim gene of Escherichia coli phage P1: Nucleotide sequence and
            purification of the processed protein
  JOURNAL   Virology 175, 500-507 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        470      718     sim region ORF 1
    pept        791     1570     sim region ORF 2
BASE COUNT      636 a    323 c    360 g    581 t
ORIGIN      
        1 ccccatcacg ccccaccata attcgggcgt aacgcggttt acggacacaa aaataccgca
       61 atatcggaaa tctgcggttg tccgcactaa cattcaggct gtcaaaaccg gtcgcagaat
      121 ttgctacgac ggtggaacta taagcctgaa cgattaaaag gtcaatatga tgcgaaaaga
      181 tagcattcgc gacttaaaaa tacaaattta ttagagcatt atttgtttaa taaatacaca
      241 gttggatcta ataacctctt ttttttaaag gcgaaaatat gtaccctaaa tgagttataa
      301 ggcaggtgag gttataatga gaaaactatt actaccgtta ttatttatgg ctgggactgt
      361 taatgcagca tcaagcgtaa aggagatttg taccgattat acgaaatacc ttgggcacgt
      421 ttacgccttt gcgatcaatg actattaatc cagtgctagg agtggattta tgctgattcg
      481 tttgttttta gtgctttcct ttttaacatt taatgttttt gctgatgaag ttgacttttc
      541 gaaggtagat tgcaattcag tggaaacaag aaaagctctt attgaagaat ataacgaaat
      601 attatcgtca tatggaataa cagtggttga ttcttataat caaaaaacta ttcagaaagg
      661 aataaataaa ctggtctgtt atggggttta ccaatattca gatggctctt cggagtagtt
      721 tatttataag catacccaaa tgtctgtgaa ttaattagtg agtttaaacc gattaatgag
      781 tgaatggaaa atgaaattat ttaatgtaat aacattttgt tgtgctattt ttgctggaag
      841 cgcgatagct gataataaat tgccagattg gctttctacc tcaaagaaag attatgattt
      901 agtaagggca ttctatttgt ctggatttgc ttcgaaagca atgaacaatc aatttggtta
      961 tcatttgcca tctgagttgg ttaatgattt taaagataat gaatttgctg ctcaggaaaa
     1021 atggaacaca attccaattg tgtatggtga aataaaatcc ataagaatgg tgaataataa
     1081 accaattgta gaattattta ctccagggga aaatgcaacg cctataaatt atatcaaatt
     1141 gaaaatattg gattcaaagc aagactctct gttaaaacta aaaaaagggg atgatatata
     1201 tgcagtgtgc tccggtgcta attttagctt agtgccaatt ctgagcaact gcactccagc
     1261 aacagacgtc attgatgctg cactctcttt ttctggtgaa tatatgttcc ctgcttttga
     1321 ttctttttca cctactaagc aaaacgtcaa atatatattc acaaatcaag atcctgttca
     1381 gatgatgaat tttatagggt acctatcctt agtcgataca acgaaggata agaataaaat
     1441 ggatatggtt cgtaagtgta cgccttggaa gccggaatgt tcacaacaat tcgttgatgt
     1501 aatggaagga tttgatagca tcatgtataa atatgaggga gaatttaaaa actacataga
     1561 attaaaatag tgtttgaaag gatagtcaat ttaagtttta acaccaacgc ccattaaagg
     1621 gcttttattg ttttactcaa aacaacctga ttagtgtata accatatcat atccctcatt
     1681 ccacctacac tgattacccc cagacaacaa tattcctact caatgaacaa atgactactc
     1741 gtagaatcgg ttaacacacc agattctacg aggtttcaat gacaccacga caattactcg
     1801 aagacgtcaa aacccgcttc acacctttga ttgcggatga accggcctta ctggaatccc
     1861 tgctaagaaa agcattggga acctaccagg atagggcggg
//
LOCUS       PSEPST1      1446 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  Plasmid pPST1 (from Pseudomonas stutzeri) ORF1 gene, complete cds.
ACCESSION   M34385
KEYWORDS    .
SOURCE      Plasmid pPST1 (from Pseudomonas stutzeri, strain MO-19) DNA.
  ORGANISM  Pseudomonas stutzeri
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1446)
  AUTHORS   Fujita,M., Kubota,M., Futai,M. and Amemura,A.
  TITLE     Identification and DNA sequencing of a new plasmid (pPST1) in
            Pseudomonas stutzeri MO-19
  JOURNAL   Plasmid 22, 271-274 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        576     1034     ORF1
BASE COUNT      267 a    446 c    425 g    308 t
ORIGIN      
        1 gcatgccatc gacccccgtc aataccttga gcacatgctg cgtccagcgc gtagccatcc
       61 acatctgctc cgggctggcg taggcgtaaa cctgcttggc cttggcgttc tgcggcttga
      121 gttccagttc tgcccgaacg tggtcaggat cgcccccggt ctgcttgccc ttttcgtaga
      181 tgcgcttcat ggctgccgag gtgcgcgacc cgatgtagag cgtccgcccg tcttgctcgc
      241 ggtgagagtc gcctacatgc gtgaccttga ggccgaactt gtctgccgtc tccagaccca
      301 gcgcgtagag agagtcccac gcccccggct cgcagtagtc gatagtcacg tcagccctga
      361 gtaggttgtg ccccctgaac tcatcccgaa cgacagcagc aaaggcgggt gcccggtcgc
      421 cgcttgcaga ggcccatacg cgcgtcccga cggtgttacc gccccactgc acccgtgcaa
      481 gcacagaatc gcctctgacg acgttatagg cccgttcata gccatttttc ggggtggtcg
      541 gcgttacgtc cgaacgactc gaaagagccg gccagatgat caaggaaccc ggtcgggttt
      601 ggcctcgatg ctggcggtgt agtagtcgaa cctcattttt cacccccaga ggctctagag
      661 cgcttcgcgg ggtttttttg gctcggtgtg taggggggtt aggtgttgtg acccccgtag
      721 ttacctatgc gggggttggg cggtctgcaa acatgatcca tgagagcgac gcaaaggccc
      781 gagcatctcg ttcatgtgag cctctccaga ctcaaggcct agagaccatc ccggccctag
      841 attcaacgtg ccagcgcccg ggctttctgg caggcccgga gaaccctttg ttggacaggg
      901 ttaaccatag tcaggaactc cgagacaaga cccgacatcg tgacgccctg aacctcagaa
      961 agacccctga tcgtgtcgta cacgtcctga tcgagcgtaa ccgtgattct cggtttctgt
     1021 gtggccattg gtgatgcact ccgggcaaag tgatgcaccg aatgtacgca gccgaaggcg
     1081 aaaggcaacc cgtgcgaatt gtttacatgt gcgcatgagc gcatgagcgc atgtgatttg
     1141 tgtgcatgtg atttgtatgc atgtgcgcat actggcgcgc tacgcttgcc cgccgagccg
     1201 agttgccggc tcgtaccggc gctacgctcc cggcccgatc cgggcacctc gacccggccc
     1261 gacgaatttg tccgcgccgg tggacttatt tgtccgacac tagccgagca acgaacagcc
     1321 cgaacgactc aatatcgccg cccagtgcct ccagcttgcg gcaatagttg ccgtactgct
     1381 tagccatgaa ctcaagagcg cgctcatcat cggtaagctg gcggatcgtt ccaggcggtg
     1441 ctggct
//
LOCUS       BNACRUCA     1686 bp ss-mRNA            PLN       04-JUL-1990
DEFINITION  B.napus cruciferin mRNA, 3' end.
ACCESSION   J05233
KEYWORDS    12S storage globulin; cruciferin; seed development protein.
SOURCE      B.napus (strain Svaloefs Karat) seed, cDNA to mRNA, clone pCRU1.
  ORGANISM  Brassica napus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 1686)
  AUTHORS   Roedin,J., Ericson,M.L., Josefsson,L.-G. and Rask,L.
  TITLE     Characterization of a cDNA clone encoding a Brassica napus 12 S
            protein (cruciferin) subunit: Relationship between precursors and
            mature chains
  JOURNAL   J. Biol. Chem. 265, 2720-2723 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1     1521     cruciferin precursor (AA at 1)
    sigp    <     1       60     cruciferin signal peptide
    matp         61      948     cruciferin alpha-subunit
    matp        949     1518     cruciferin beta-subunit
    signal     1543     1548     polyA signal
    signal     1671     1676     polyA signal
BASE COUNT      462 a    455 c    415 g    354 t
ORIGIN      
        1 ccttgcgaga cagccgtcgc aacgttcggg gttctcctcg tcctcaacgg ctgtctcgca
       61 aggcagtcgc taggggttcc tcctcagcta gggaacgcgt gtaacctcga taacttagac
      121 gttctccagc ctaccgaaac tatcaagagc gaggctggtc gggtcgagta ctgggatcac
      181 aacaatcctc agatccgatg tgctggtgtc tctgtctctc gtgttataat cgaacaaggc
      241 ggtctctacc ttcctacctt cttcagctcc cccaaaattt caatcgttgt tcaaggaatg
      301 ggtattagcg gaagagtggt ccctggatgc gcggaaacct tcatggactc gcagcctatg
      361 caaggacaac aacaaggtca accatggcag ggacaacaag gacaacaggg tcagcaggga
      421 caacaaggtc aacagggtca gcagggacaa caaggtcaac agggtcagca gggtcaacag
      481 ggacagcagg gtcagcagca gcaagggttc cgtgacatgc accagaaggt cgaacatgtt
      541 cgacatggag acatcattgc cattactgca ggctcttccc attggatcta caacaccggt
      601 gaccagccac ttgtcattat ctgccttctc gacattgcca actaccaaaa ccaactcgac
      661 cgcaacccaa gaacgttccg tctggccgga aacaacccac agggcggttc ccagcagcag
      721 cagcaacaac aacagaacat gttgagcggg ttcgaccctc aggtcctagc ccaggcattg
      781 aaaatcgacg ttaggttggc tcaggagctt cagaaccaac aagacagcag aggaaacatc
      841 gttcgtgtta agggaccttt ccaggttgtg aggccgcctc ttagacagcc atacgagagt
      901 gagcagtgga gacacccccg tggcccacca caaagcccac aagacaacgg cttggaggag
      961 actatctgca gcatgaggac ccacgagaac attgatgacc cagcccgtgc tgacgtgtat
     1021 aagcccaacc tcggccgtgt gactagcgct aacagctaca ctttacccat cttgcagtat
     1081 atcagactca gcgccacccg tggcattctc cagggtaatg cgatggtgct tccgaaatac
     1141 aacatgaacg cgaacgagat cttgtactgc actcaaggac aagcaaggat tcaagtggtg
     1201 aacgacaacg gacagaacgt gctggaccag caggtgcaga agggacagct cgtggtcatc
     1261 ccacaaggat tcgcctatgt tgtccagtcc caccaaaaca acttcgaatg gatttctttc
     1321 aagacaaacg ctaacgcgat ggtcagcact ttggccggta gaacctcggc cttgagggca
     1381 ttgccactag aggtcataac caacgctttc caaatttctc tcgaggaagc tagaaggatc
     1441 aagttcaaca cgcttgagac cactttgact cgtgcgcgcg gtggacaacc ccagttgatc
     1501 gaggagatag tcgaggctta agttaaaacg tttactttta ctaataaagt agtgacatgg
     1561 ttactattgt aatggtcagt ttgtaatcat gtccactcta agtttttaac gtatgtgtaa
     1621 aatatgtgtc taaagaacac gccggcacgt ctcttgtatg taacccttct aataaatacc
     1681 cccccc
//
LOCUS       BS2ORIA      2279 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  Plasmid pBS2 origin of replication.
ACCESSION   M34386
KEYWORDS    origin of replication.
SOURCE      Plasmid pBS2 DNA.
  ORGANISM  Plasmid pBS2
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 2279)
  AUTHORS   Darabi,A., Forough,R., Bhardwaj,G., Watabe,M., Goodarzi,G.,
            Gross,S.C. and Watabe,K.
  TITLE     Identification and nucleotide sequence of the minimal replicon of
            the low-copy-number plasmid pBS2
  JOURNAL   Plasmid 22, 281-286 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        195      470     ORF1
    pept       1139     1948     ORF2
    binding     169      177     ribosomal binding site (put.)
    binding    1110     1117     ribosomal binding site (put.)
    orgrpl      865     1008     origin of replication DNA
BASE COUNT      699 a    343 c    562 g    675 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttgtt gaatatgggg aatagttata accagatggg aagccttttt tcggctgtac
       61 cttattatca caaggctata aaggcagcta aaataagcgg agcgaaggaa attacccagg
      121 cttattatga tttagctctg attcatttta ggaacaatga aaagattgaa ggacgcggat
      181 tctttggaaa agcaatggag caagctgaag aatttaacga tagtctgttt caggatttat
      241 taaatgttct aaaggcatta tttattgaga ctggaagcag gcaaaaggtc atgaatgccc
      301 ttgaggcgtt gagaacggga caaggctatc catattttga agagctagct ttaatcgctg
      361 cagaatttta tactatggat aagcgcatgg aagattctat atatttttac aatgagatgg
      421 tgtgcgctca aagacaaatt caaaggggcg atttcctgta tgaagtttaa aggtcttttt
      481 tcagcggtcc ttattgtaag cctgttggtt ggagcggggt attctttcgt gcaccatgat
      541 gaggtttcag ttgcttcaag gaatgcaaca tgattaagaa taaagaatcg agatttacca
      601 cttaaagata tggaaaacag atcgttattg atctgttttt ttctttgtgt gctgaccatg
      661 cgggcatgat gcggagataa cgctgggtgc tgcggagata atgcgggcat gggtgcgctg
      721 atcgcggaga ctgaaaaggc aaaatttcaa aaaaatctcc cccctacggg ggggaagaat
      781 ggttttgatc ttttgttttg ggttttaaaa aagccggctg ttttcagccg gtactttttc
      841 gattttggcg aagccgaaat cgggtctttt cttatcttga tactatatag aaacaacatc
      901 atttttcaaa atcagttcaa agtcttgtgt gtcaagggtt agatcggttt ttgacaggta
      961 aaaactcctt ctgctattat taaggtgtcg aatcaaaata atagaatgct agagaactag
     1021 ctcagaagga gtttttttgt tgatttattc atctgaaaac gattatagca tcctcgaaga
     1081 taaaaccgca acaggtaaaa agcgggattg gaaggggaaa aagagacgta cgaacctcat
     1141 ggcggagcat tacgaagcat tagagagtaa gattggggca ccttactatg gcaaaaaggc
     1201 tgaaaaacta attagttgtg cagagtatct ttcgtttaag agagacccgg agacgggcaa
     1261 gttaaaactg tatcaagccc atttttgtaa agtgaggtta tgcccgatgt gtgcgtggcg
     1321 caggtcgtta aaaattgctt atcacaataa gttgatcgta gaggaagcca atcggcagta
     1381 cggctgcgga tggatttttc tcacgctgac ggttcggaat gtaaagggag aacggctgaa
     1441 gccacaaatt tctgagatga tggaaggctt taggaaactg ttccagtaca aaaaagtaaa
     1501 aacttcggtt cttggatttt tcagagcttt agagattacc aaaaatcatg aagaagatac
     1561 ataccatcct cattttcatg tgttgatacc agtaaggaaa aattattttg ggaaaaacta
     1621 tattaagcag gcggagtgga cgagcctttg gaaaaaggcg atgaaattgg attacactcc
     1681 gattgttgat attcgtcgag tgaaaggtaa agccaagatt gacgctgaac tgattgaaaa
     1741 cgatgtgcgg gaagcaatga tggagcaaaa agctgttctc gaaatttcta aatatccggt
     1801 taaggatacg gatgttgtgc gcggcaataa ggtgactgaa gacaatctga acacggtgct
     1861 ttacttggat gatgcgttgg cagctcgaag gttaattgga tacggtggca ttttgaagga
     1921 gatacataaa gagctgaatc ttcggtgatg cggaggacgg cgatctggtc aagattgagg
     1981 aagatgatga cgaggttgca aatggtgcat tgaggttatg gcttattggc atcctggcat
     2041 taaaaattac ataatcaaat aaaaaaagca gacctttaag gcgtgctttt tttaactaac
     2101 ccatttgtat tgtgttgaaa tatgttttgt tcgtttttta tctttgtagc tttttatgat
     2161 taattgtcta tgataaccta gggaatcttt acattgtctg tgttgaatag ttacataaac
     2221 tggtttgctc ttaggatagt ttaaatatac tgttgtcatt gctccgaaaa caaaagctt
//
LOCUS       ECOCYSBA     1407 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  E.coli cysteine regulatory protein (cysB) gene, complete cds.
ACCESSION   M34332
KEYWORDS    cysB gene; cysteine regulatory protein.
SOURCE      E.coli (mutant strain CB64) DNA, clone pLB1.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1407)
  AUTHORS   Tei,H., Watanabe,K., Murata,K. and Kimura,A.
  TITLE     Analysis of the Escherichia coli K-12 cysB gene and its product
            using the method of gene fusion
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 962-969 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        349     1323     cysteine regulatory protein (cysB)
BASE COUNT      378 a    312 c    326 g    391 t
ORIGIN      
        1 ggatccggaa ggtaataaga ccatggttcg ctttagccgt aaaaccaaac agcaatatgt
       61 ctcttcggaa aaagacggaa aggcgactgg ctggtcagca ttttatgttg atggcaaatg
      121 ggttgaagga aaaaaataac ctttaattct gtcaggtttt tataaacaaa gggtcgcgaa
      181 agcggccctt ttttattgca tattattttt tcttcacacc tatacactaa ggctataaat
      241 gatatagtgg ttatagttag cacctttttt attattaaat cgtattagtc acccgccagg
      301 tgtgacgaaa aaacgatgtt ctgatggcgt ctaagtggat ggtttaacat gaaattacaa
      361 caacttcgct atattgttga ggtggtcaat cataacctga atgtctcatc aacagcggag
      421 ggactttaca catcacaacc cgggatcagt aaacaagtca gaatgctgga agacgagcta
      481 ggcattcaaa ttttttcccg aagcggcaag cacctgacgc aggtaacgcc agcagggcaa
      541 gaaataattc gtatcgctcg cgaagtcctg tcgaaagtcg atgccataaa atcggttgcc
      601 ggagagcaca cctggccgga taaaggttca ctgtatatcg ccaccacgca tacccaggca
      661 cgctacgcat taccaaacgt catcaaaggc tttattgagc gttatcctcg cgtttctttg
      721 catatgcacc agggctcgcc gacacaaatt gctgatgccg tctctaaagg caatgctgat
      781 ttcgctatcg ccacagaagc gctgcatctg tatgaagatt tagtgatgtt accgtgctac
      841 cactggaatc gggctattgt agtcactccg gatcacccgc tggcaggcaa aaaagccatt
      901 accattgaag aactggcgca atatccgttg gtgacatata ccttcggctt taccggacgt
      961 tcagaactgg atactgcctt taatcgcgca gggttaacgc cgcgtatcgt tttcacggca
     1021 acggatgctg acgtcattaa aacttacgtc cggttagggc tgggggtagg ggtcattgcc
     1081 agcatggcgg tggatccggt cgccgatccc gaccttgtgc gtgttgatgc tcacgatatc
     1141 ttcagccaca gtacaaccaa aattggtttt cgccgtagta ctttcttgcg cagttatatg
     1201 tatgatttca ttcagcgttt tgcaccgcat ttaacgcgtg atgtcgttga tgcggctgtc
     1261 gcattgcgct ctaatgaaga aattgaggtc atgtttaaag atataaaact gccggaaaaa
     1321 taatttcttg cgttattttc ggcacctttt atgtagcgaa ggtgccggaa tatattctct
     1381 tttgttactt atttctatac cgtgcac
//
LOCUS       HUMARHIII      60 bp ds-DNA             BAD       04-JUL-1990
DEFINITION  Synthetic DNA encoding angiogenin/RNase hybrid (ARH-III), partial
            cds.
ACCESSION   M34329
KEYWORDS    .
SOURCE      Human DNA, clone pAng2.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 60)
  AUTHORS   Bond,M.D. and Vallee,B.L.
  TITLE     Replacement of residues 8-22 of angiogenin with 7-21 of RNase a
            selectively affects protein synthesis inhibition and angiogenesis
  JOURNAL   Biochemistry 29, 3341-3349 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >    60     angiogenin/RNase protein (AA 4-23) (AA at 1)
BASE COUNT       17 a     18 c     13 g     12 t
ORIGIN      
        1 tcgaggtata caaaattcga acgccagcac atggactctt cgacaagcgc tgcctccgat
//
LOCUS       HUMIGIIDO    1914 bp ss-mRNA            PRI       04-JUL-1990
DEFINITION  Human interferon-gamma-inducible indoleamine 2,3-dioxygenase (IDO)
            mRNA, complete cds.
ACCESSION   M34455
KEYWORDS    indoleamine 2,3-dioxygenase.
SOURCE      Human diploid fibroblast cell line FS-4, cDNA to mRNA, clone C42.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1914)
  AUTHORS   Dai,W. and Gupta,S.L.
  TITLE     Molecular cloning, sequencing and expression of human interferon-
            gamma-inducible indoleamine 2,3-dioxygenase cDNA
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 1-8 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        640     1851     indoleamine 2,3-dioxygenase (IDO) (EC
                                 1.13.11.17)
BASE COUNT      563 a    396 c    427 g    528 t
ORIGIN      
        1 tgagaagggc aaatgctatc attggaaaaa ctgacaaaag tcccaatagg aaaaataagg
       61 aagtggagag ttactatgtt tctaattttt catgtgcttc tatttttttc ctacttcaga
      121 gccattgact aatagttgag tataacacag gttgtgtttc cgggctgctg aaacatgaca
      181 ctaatatttt caaagaactg tggaagccta aaaggaagcc aatgagaaat aactaaatga
      241 gagtttagga ctgcagcctt cattttcatt caaagattta aaagtttcca taaagtaaaa
      301 tgttcttctc cggccacctg ttttcatagt tctgtgtttt ccttcaggcc tttctggctt
      361 cctatatggc agtaagaaaa tgatgtgctt aatgattaca aatttcatat ggaatacgaa
      421 ctttcagttt gtacatatga tgcacagaga tgcttttgtg gttttattgg ttttcatatt
      481 acaaacaaag aaactagaaa atgaaaccat tccaaaagtg gaagtaattt ctcactgccc
      541 ctgtgataaa ctgtggtcac tggctgtggc agcaactatt ataagatgct ctgaaactct
      601 tcagacactg agggggcacc agaggagcag actacaagaa tggcacacgc tatggaaaac
      661 tcctggacaa tcagtaaaga gtaccatatt gatgaagaag tgggctttgc tctgccaaat
      721 ccacaggaaa atctacctga tttttataat gactggatgt tcattgctaa acatctgcct
      781 gatctcatag agtctggcca gcttcgagaa agagttgaga agttaaacat gctcagcatt
      841 gatcatctca cagaccacaa gtcacagcgc cttgcacgtc tagttctggg atgcatcacc
      901 atggcatatg tgtggggcaa aggtcatgga gatgtccgta aggtcttgcc aagaaatatt
      961 gctgttcctt actgccaact ctccaagaaa ctggaactgc ctcctatttt ggtttatgca
     1021 gactgtgtct tggcaaactg gaagaaaaag gatcctaata agcccctgac ttatgagaac
     1081 atggacgttt tgttctcatt tcgtgatgga gactgcagta aaggattctt cctggtctct
     1141 ctattggtgg aaatagcagc tgcttctgca atcaaagtaa ttcctactgt attcaaggca
     1201 atgcaaatgc aagaacggga cactttgcta aaggcgctgt tggaaatagc ttcttgcttg
     1261 gagaaagccc ttcaagtgtt tcaccaaatc cacgatcatg tgaacccaaa agcatttttc
     1321 agtgttcttc gcatatattt gtctggctgg aaaggcaacc cccagctatc agacggtctg
     1381 gtgtatgaag ggttctggga agacccaaag gagtttgcag ggggcagtgc aggccaaagc
     1441 agcgtctttc agtgctttga cgtcctgctg ggcatccagc agactgctgg tggaggacat
     1501 gctgctcagt tcctccagga catgagaaga tatatgccac cagctcacag gaacttcctg
     1561 tgctcattag agtcaaatcc ctcagtccgt gagtttgtcc tttcaaaagg tgatgctggc
     1621 ctgcgggaag cttatgacgc ctgtgtgaaa gctctggtct ccctgaggag ctaccatctg
     1681 caaatcgtga ctaagtacat cctgattcct gcaagccagc agccaaagga gaataagacc
     1741 tctgaagacc cttcaaaact ggaagccaaa ggaactggag gcactgattt aatgaatttc
     1801 ctgaagactg tgagaagtac aactgagaaa tcccttttga aggaaggtta atgtaaccca
     1861 acaagagcac attttatcat agcagagaca tctgtatgca ttcctgtcat tacc
//
LOCUS       RATRPL35AA    451 bp ss-mRNA            ROD       04-JUL-1990
DEFINITION  Rat 60S ribosomal subunit protein L35 mRNA, complete cds.
ACCESSION   M34331 X51705
KEYWORDS    ribosomal protein.
SOURCE      Rat, cDNA to mRNA, clone pL35-[6,9].
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 451)
  AUTHORS   Suzuki,K., Olvera,J. and Wool,I.G.
  TITLE     The primary structure of rat ribosomal protein L35
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 1377-1382 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         48      419     ribosomal protein L35
    mRNA    <     1      451     ribosomal protein L35 mRNA
BASE COUNT      138 a    126 c    121 g     66 t
ORIGIN      
        1 tcctctttct ctcgaaccgg gcggcctcat ccgcgaacag cgcagcaatg gccaagatta
       61 aggctcggga cctgcgcggc aagaagaagg aggagctgtt gaaacaactg gatgatctga
      121 aggtggaact gtcccagctt cgcgtggcca aagtgacagg cggcgccgcg tccaagctct
      181 ccaagatacg agtcgtacgc aaatccatcg cccgtgtcct cactgtcatt aatcagactc
      241 aaaaggaaaa cctcaggaaa ttctacaagg gaaagaagta caagcccctg gacctgcgac
      301 ccaagaagac aagagccatg cgccgccggc tcaccaagca tgaagagaag ctgaagacca
      361 agaagcagca gcggaaggag cggctgtacc cactgcgcaa gtacgcagtc aaggcctgag
      421 acgacgacaa taacaataaa gtccaaaact g
//
LOCUS       SCMIMP23A    1058 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  S.mansoni 23 kDa integral membrane protein (Sm23) mRNA, complete
            cds.
ACCESSION   M34453
KEYWORDS    integral membrane protein.
SOURCE      S.mansoni (strain Puerto Rican), cDNA to mRNA, clone 3.7.2.
  ORGANISM  Schistosoma mansoni
            Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; 
            Digenea; Strigeidida; Strigeata; Schistosomatoidea; 
            Schistosomatidae.
REFERENCE   1  (bases 1 to 1058)
  AUTHORS   Wright,M.D., Henkle,K.J. and Mitchell,G.F.
  TITLE     An immunogenic Mr 23,000 integral membrane protein of Schistosoma
            mansoni worms that closely resembles a human tumor-associated
            antigen
  JOURNAL   J. Immunol. 144, 3195-3200 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         16      672     23 kDa integral membrane protein (Sm23)
    mRNA    <     1     1058     23 kDa integral membrane protein mRNA
BASE COUNT      269 a    189 c    219 g    381 t
ORIGIN      
        1 gtcccgtttg aaagtatggc aacgttgggt actggtatgc gttgtctaaa aagttgcgta
       61 ttcgttttga acatcatctg tctgttatgt tccctggtat taattggggc tggtgcctat
      121 gtggaagtca agtttagtca atatggagac aatttacaca aagtctggca ggcagcacct
      181 atcgctatta ttgttgtcgg agtaataatc ctgatagtga gctttctggg ttgttgtgga
      241 gccataaagg aaaacgtctg catgctgtat atgtatgcgt tcttcctcgt cgtacttctg
      301 attgctgagt tggccgctgc catagttgca gttgtttaca aagataggat cgattcagaa
      361 attgatgcat taatgactgg tgctctggat aaaccgacaa aagagataac tgaattcatg
      421 aatttgattc aatcatcatt ccattgctgt ggagccaaag gccctgatga ttatcgtggt
      481 aacgtgccag cctcgtgcaa agaagagaat ttgacctata ccgaaggctg tgtatctgtc
      541 tttggagcat tcttaaaacg caacctggtg attgttgcct gtgttgcatt tggtgtatgc
      601 tttttccagt tgttaagcat tgtcatagcc tgctgtttgg gtcgccaaat aaaggagtat
      661 gagaatgtgt aaacactgca aagaaaacag tatgatgttt atcaatcccg ttctgttttc
      721 cctctggctt ttatgaaatg ttgctttttt attgcctaga taattgtgcc ttggctaata
      781 atcgtgtact cgacttcgtt tacgatataa ttattatact taaatatgat actgtcattc
      841 tcattgtttc acttatcatt cttataccaa atgagctgca ttgattccct ggtatttacc
      901 tacaaactga tgacaaatca tcttcatttt tatggtgtgt cttataacat ttctcgttta
      961 tataatttta agtctttggt ataccttttg ttttacagac cgcttatttt gccgccacag
     1021 cattagacat ttcttgtata aattaattcc gttaactt
//
LOCUS       STMMELCLAC     63 bp ds-DNA             BAD       04-JUL-1990
DEFINITION  S.antibioticus melC-lacZ translational fusion protein mRNA, 5' end.
ACCESSION   M34505
KEYWORDS    .
SOURCE      S.antibioticus DNA, clones pIF41[3,5].
  ORGANISM  Streptomyces antibioticus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 63)
  AUTHORS   Tseng,H.-C., Lin,C.-K., Hsu,B.-J., Leu,W.-M., Lee,Y.-H.W.,
            Chiou,S.-J., Hu,N.-T. and Chen,C.W.
  TITLE     The melanin operon of Streptomyces antibioticus: Expression and use
            as a marker in gram-negative bacteria
  JOURNAL   Gene 86, 123-128 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >    63     lacZ-melC fusion protein
    pept         49  >    63     melC protein
BASE COUNT       16 a     20 c     15 g     12 t
ORIGIN      
        1 atgaccatga ttacgccaag ctcgcgaggc ctcgagatct atcgatgcat gccggaactc
       61 acc
//
LOCUS       STMMELCTAC    101 bp ds-DNA             BAD       04-JUL-1990
DEFINITION  S.antibioticus tac promoter and melC gene, 5' end.
ACCESSION   M34506
KEYWORDS    .
SOURCE      S.antibioticus DNA, clone pMELA1.
  ORGANISM  Streptomyces antibioticus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 101)
  AUTHORS   Tseng,H.-C., Lin,C.-K., Hsu,B.-J., Leu,W.-M., Lee,Y.-H.W.,
            Chiou,S.-J., Hu,N.-T. and Chen,C.W.
  TITLE     The melanin operon of Streptomyces antibioticus: Expression and use
            as a marker in gram-negative bacteria
  JOURNAL   Gene 86, 123-128 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         87  >   101     melC protein
BASE COUNT       30 a     23 c     23 g     25 t
ORIGIN      
        1 ccctgttgac aattaatcat cggctcgtat aatgtgtgga attgtgagcg gataacaatt
       61 tcacacagga aacaggatca gcttgcatgc cggaactcac c
//
LOCUS       SUSSPERA     2150 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  S.purpuratus speract egg protein mRNA, complete cds.
ACCESSION   J02896
KEYWORDS    egg protein; speract.
SOURCE      Sea urchin (library lambda-Zap), cDNA to mRNA.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 2150)
  AUTHORS   Ramarao,C.S., Burks,D.J. and Garbers,D.L.
  TITLE     A single mRNA encodes multiple copies of the egg peptide speract
  JOURNAL   Biochemistry 29, 3383-3388 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         98      988     speract protein precursor
    pept        206      988     speract protein precursor
    matp        500      529     speract protein
    matp        767      796     speract protein
    matp        800      829     speract protein
    matp        833      862     speract protein
    matp        866      895     speract protein
    matp        500      529     speract protein homolog (put.)
    matp        668      697     speract protein homolog (put.)
    matp        701      730     speract protein homolog (put.)
    matp        734      763     speract protein homolog (put.)
    matp        899      928     speract protein homolog (put.)
    matp        932      961     speract protein homolog (put.)
    signal     1320     1325     polyA signal
    signal     1463     1468     polyA signal
BASE COUNT      656 a    442 c    437 g    615 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcggct actcgacttt acgctagatc tctaccgagc tttctcgtcg aatagaacaa
       61 ccgaccgacc cacgtcgttg aagatgaagt tcatcgtatg cctcctggcc ctcgtggcgt
      121 agcttcgggc aagcctgtca tcagcggtca ggaccagaag tacacccttg ctgattccag
      181 caaggatcta gagggcattg ctcacatggc tatcgtcgac tcactttccc ctctacacat
      241 ctcgctctcc tctttggaat ctgcgtggaa caacctcctc aacattgctt ctcaggaaga
      301 ctactctatc ccggaacttt ccatacccaa aatcgacgtg aagagcatct tgtcatgcaa
      361 gcccaagtat tctcccaagt accctgtggt cttgcagtac atctctgatc attacgtcca
      421 ggtccaggac catatcgcca acgccaagga acttacagaa ggactgaaat tcgtctcgca
      481 gctcatcatg tacaagaaga tcgaccacga taccctggcc tctgtctcta aaatgttgag
      541 caattacttg actgactacg catcgacaat ctcatcactt aaatcagtcg tctgtcaaga
      601 tcaaacagcg ccatctcatc ctatggacga gagttacatg gacacacccc tgtcaatgat
      661 cttgaaaggc accatgccta ctggcgctgg agtagacaaa ggcttcgcac ttggtggtgg
      721 tggagttggt aaaggattca acttgaacgg cggtggagtt ggcaaaggct ttgacttgaa
      781 cggcggtgga gttggcaaag gattcgactt gaacggcggt ggagttggca aaggctttga
      841 cttgaacggc ggtggagttg gcaaaggctt tgacttgaac ggcggtggag ttggcaaagg
      901 tttcgctttg ggcggtggtg gagttggcaa aggattcagc ctcaccggag gaggcgtcgg
      961 cagggaagtc gaaatcaaag actggtaatc gacgttctgg cgctcaatag atctgacatc
     1021 aaatgccatg cgcccgtaca ggaatttcat aattgccatt tttttttaca ctgatatcaa
     1081 tactttggtc gtgcctgtcc aattatacaa atatgtatta aataatagca tcaaatgcaa
     1141 aaaaaaaaaa aaaaaaaaaa gggtcttgtt ttcttgagct tgcaatgatc aaccatcatc
     1201 cagtgtcgtc atgatggtct gcgtcacaag agaggcgata agcaaaattt aagactcaag
     1261 atggtatgtt aaactttgat cttaaggacg ttttctttgt ttcctcatca aatattgaaa
     1321 ataaattttt aacaaaaaaa attacatatg cctgtaaaat ggctatttca gcatttttct
     1381 ctgttttcta cgattcatat ttctttttca ttggaaaaac tagaaagaat atataaaata
     1441 taaaatattt tcaaaatagt ttaataaatt atttccaaaa gacaactttg tcaaaaagaa
     1501 ttgaaaaaag aagaaaatga attgaaagca aaattttctt ttcaaaatgt cagcaactga
     1561 tagtgatagc cgcaggtatt acatcaggca aatcaaacag gtaaaggtca tacaatgtac
     1621 cttctagatg ttactttaca agattttaga tctactttgg tatcttcttt cctttttcct
     1681 ttgattgtaa actcctttca agtaatacga attaatacca taagttgcac ctgcctagcc
     1741 tatacagtcg acactgccca caaacaaatc gagaagtata gaataacgca acataaaaaa
     1801 ggaaagattt tggtagctgt gctcttttcc tcttttctta tctttcgaat gtttactctc
     1861 ctgtactcat ttaaaatggt aaattaaaaa gagagccaga tttttatttg gaaggttgca
     1921 aatcgattga cagataggtc atcagtgtag agctttgata caagtgacaa aaacggtcca
     1981 tacgcttgtt gagcaatatg gttacttcat gaaggccttg tatatttcac ataatgataa
     2041 caaatactag aagtcaatga tgcattgctg caaagaattt tgataacgtg actttctttg
     2101 tagtctggtg cttggtaaaa tccagagacg caatgaaaat acttgaattc
//
LOCUS       TETTCBP23A    751 bp ss-mRNA            INV       04-JUL-1990
DEFINITION  T.thermophila calcium-binding protein (TCBP-23) mRNA, complete cds.
ACCESSION   J05227
KEYWORDS    calcium-binding protein.
SOURCE      T.thermophila (strain B1868), cDNA to mRNA, clone TCBP-23.
  ORGANISM  Tetrahymena thermophila
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 751)
  AUTHORS   Takemasa,T., Takagi,T., Kobayashi,T., Konishi,K. and Watanabe,Y.
  TITLE     The third calmodulin family protein in Tetrahymena: Cloning of the
            cDNA for Tetrahymena calcium-binding protein of 23 kDa (TCBP-23)
  JOURNAL   J. Biol. Chem. 265, 2514-2517 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         28      651     calcium-binding protein (TCBP-23)
    signal      661      666     polyA signal
    signal      676      681     polyA signal
BASE COUNT      249 a    120 c    152 g    230 t
ORIGIN      
        1 gtagaaagca aacaaacact aaaagaaatg gaacaccaaa tcatcaccta aaacgtctac
       61 gctcctgata ctgaagccaa gcttgatgtt gctagaaagc tttttgctta gtttgattct
      121 aacaagaacg gtactttaga tcctagcgaa gttgctggac tcatcaagac tacttttgaa
      181 aatatgggtg ttaaggacta cagcgtcact gctgatgatg tcaagcttta catgaagagt
      241 gttgatgttg ataacaacgg tcttgtttcc tactctgaat acgaagaata cgtcattgct
      301 tgcctcaaga aagctggctt cgactgtgaa gttaagcaaa aggtaaaaag atctgccaaa
      361 aagagagacg ctgctactga aatgaagttg gacgttgcca gaagactctt cgctaagtac
      421 gactctgata agagtggtta attagaagaa aaggaagttt atggtgtcat tactgaaacc
      481 tataagcaaa tgggtatgga ttacaagccc actgaagctg atgttaagct ttggatgtcc
      541 atgactgata ctgataagaa tggaactgtc tctattgttg aatatgaaga tttcgtcatt
      601 tctggtctta agaaggctgg tttcatggtc aaggaattca ctcaagcttg atcttatttt
      661 aataaacata tttcaaataa aaattgaact gtatttgtgt atttcatgtc atttcttaaa
      721 attttaaatg atttttttaa tatcgttaaa a
//
LOCUS       TRNRESRA      480 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  Transposon Tn2501 putative res region.
ACCESSION   M34377
KEYWORDS    .
SOURCE      Transposon Tn2501 DNA, clone pTM41.
  ORGANISM  Transposon Tn2501
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 480)
  AUTHORS   Michiels,T. and Cornelis,G.
  TITLE     Site-specific recombinations between direct and inverted res sites
            of Tn2501
  JOURNAL   Plasmid 22, 249-255 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        285      310     inverted repeat region I
    site        325      356     inverted repeat region II
    site        366      387     inverted repeat region III
BASE COUNT      125 a    110 c    110 g    135 t
ORIGIN      
        1 gcgattctgg atatgacgct actgtgaggg acattgcttt tgtccggaat aaatccaggt
       61 ccacgcagat aacagagtaa tacagcaaag cccaggcgat ttgctggccg ccggtgttta
      121 ttaatgagtg caatatcctg ttcattcaga aaacacattc gggtcagaat aatttcatca
      181 tctggaataa ccagcaagcg ctcctgctct tcactgctca gtatctgtcg ccgtggcata
      241 aatgcttccc tcgcgtatag tttccatatg ttatggattg gctatgaccg aaaccggtcg
      301 gtttcgtaca ccttctaata tatctgtccg aaagtaagtg aaattcattt cgtacaggcg
      361 ctataattcg gacatccatt tcgtacggaa agtttcctat gtcacgagtt tttgcttact
      421 gccgggtttc gactcttgag caaaccaccg agaatcagcg aagggaaatt gaagcggcgg
//
LOCUS       HUMALRMB     1354 bp ss-mRNA            PRI       04-JUL-1990
DEFINITION  Human aldose reductase mRNA, complete cds.
ACCESSION   J05474
KEYWORDS    aldose reductase.
SOURCE      Human adult and fetus retina, and muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1354)
  AUTHORS   Nishimura,C., Matsuura,Y., Kokai,Y., Akera,T., Carper,D.,
            Morjana,N., Lyons,C. and Flynn,T.G.
  TITLE     Cloning and expression of human aldose reductase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Nishimura, 26-APR-1990.
FEATURES       from  to/span     description
    pept         33      983     aldose reductase (EC 1.1.1.21)
    mRNA    <     1     1354     aldose reductase mRNA
BASE COUNT      320 a    352 c    365 g    317 t
ORIGIN      
        1 cggtacgtgc ggccttgggg agcgcagcag ccatggcaag ccgtctcctg ctcaacaacg
       61 gcgccaagat gcccatcctg gggttgggta cctggaagtc ccctccaggg caggtgactg
      121 aggccgtgaa ggtggccatt gacgtcgggt accgccacat cgactgtgcc catgtgtacc
      181 agaatgagaa tgaggtgggg gtggccattc aggagaagct cagggagcag gtggtgaagc
      241 gtgaggagct cttcatcgtc agcaagctgt ggtgcacgta ccatgagaag ggcctggtga
      301 aaggagcctg ccagaagaca ctcagcgacc tgaagctgga ctacctggac ctctacctta
      361 ttcactggcc gactggcttt aagcctggga aggaattttt cccattggat gagtcgggca
      421 atgtggttcc cagtgacacc aacattctgg acacgtgggc ggccatggaa gagctggtgg
      481 atgaagggct ggtgaaagct attggcatct ccaacttcaa ccatctccag gtggagatga
      541 tcttaaacaa acctggcttg aagtataagc ctgcagttaa ccagattgag tgccacccat
      601 atctcactca ggagaagtta atccagtact gccagtccaa aggcatcgtg gtgaccgcct
      661 acagccccct cggctctcct gacaggccct gggccaagcc cgaggaccct tctctcctgg
      721 aggatcccag gatcaaggcg atcgcagcca agcacaataa aactacagcc caggtcctga
      781 tccggttccc catgcagagg aacttggtgg tgatccccaa gtctgtgaca ccagaacgca
      841 ttgctgagaa ctttaaggtc tttgactttg aactgagcag ccaggatatg accaccttac
      901 tcagctacaa caggaactgg agggtctgtg ccttgttgag ctgtacctcc cacaaggatt
      961 accccttcca tgaagagttt tgaagctgtg gttgcctgct cgtccccaag tgacctatac
     1021 ctgtgtttct tgcctcattt ttttccttgc aaatgtagta tggcctgtgt cactcagcag
     1081 tgggacagca acctgtagag tggccagcga gggcgtgtct agcttgatgt tggatctcaa
     1141 gagccctgtc agtagagtag aagtctcttc cagtttgctt tgcccttctt tctaccctgc
     1201 tggggaaagt acaacctgaa tacccttttc tgaccaaaga gaagcaaaat ctaccaggtc
     1261 aaaatagtgc cactaacggt tgagttttga ctgcttggaa ctggaatcct ttcagcaaga
     1321 cttctctttg cctcaaataa aaagtgcttt tgtg
//
LOCUS       CHTHYPABX    2468 bp ds-DNA             BCT       04-JUL-1990
DEFINITION  C.psittaci hypA and hypB protein genes, complete cds.
ACCESSION   M25101
KEYWORDS    .
SOURCE      C.psittaci (strain GPIC) elementary body DNA, clone pGP57.
  ORGANISM  Chlamydia psittaci
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE   1  (bases 1 to 2468)
  AUTHORS   Morrison,P.R., Belland,R.J., Lyng,K. and Caldwell,H.D.
  TITLE     Chlamydial disease pathogenesis: The chlamydial hypersensitivity
            antigen is a stress response protein
  JOURNAL   J. Exp. Med. 170, 1271-1283 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.P.Morrison, 02-JUN-1989.
FEATURES       from  to/span     description
    pept        314      622     hypA protein
    pept        673     2307     hypB protein
    signal       83       88     -35 region
    signal      120      127     -10 region
BASE COUNT      855 a    462 c    514 g    637 t
ORIGIN      
        1 gaattcttaa caaagaagat aacgctctcg aatcgtacat gaacttctta aaaagtggtg
       61 gctccgactt cccattagaa atcttgaaga agtccggatt ggatatggca accagcgagc
      121 ccatatacaa ggccttttcc ttcatagaga gaaaaattca agagttatca tctttaattt
      181 aaaacaacta aagaaaagta gcacttgata ttattaagtg ctaaaatcat tgccaaaaaa
      241 cgagagactt tggtatcgtt cctgagaaac ggcaaagtct cttttagaac aagaaacaca
      301 aggagcttat aacatgtcag atcaagcaac gacccttagg attaagcccc tgggcgatag
      361 aattttagtg aaaagagaag aagaagattc tacagcgcgc ggcggcatca ttttacctga
      421 tacagcaaag aaaaaacagg atcgagcaga ggtattagtc ctaggcactg gaaaacgaga
      481 taaagatggc aacgtcctac cttttgaagt taccgtgggt gatactgttt taatagataa
      541 atacgcggga caagaactta ccgttgatgg tgaggagtac gtcattgttc aggaaagcga
      601 agttatggca gttctcaagt aagagaaatc attatttata gattgcaaaa agttaaggag
      661 cacaaaaaaa caatggcagc aaaaaatatt aaatataacg aagacgccag aaaaaaaatc
      721 cataaaggag ttaaaaccct tgcagaagct gtaaaagtaa ccttaggtcc taaaggccgt
      781 catgtggtta tcgataaaag ctttggttct cctcaagtta ccaaagatgg cgtaactgtc
      841 gctaaagaaa ttgagctcga agacaagcat gagaacatgg gagctcaaat ggtaaaagaa
      901 gtcgctagca aaactgcaga taaagctggt gatggaacta caacagctac tgttcttgca
      961 gaagctatct acagtgaagg attgagaaac gtaactgcag gcgccaatcc tatggacctc
     1021 aaaagaggca ttgataaggc agtaaaagtc gttgtcgatg aaatcaaaaa aattagtaaa
     1081 cccgtacaac atcacaaaga aatagctcaa gtagcgacta tttctgcaaa taatgatgct
     1141 gaaatcggta atcttatcgc cgaagccatg gaaaaagttg gcaaaaacgg ctctattact
     1201 gttgaagaag ctaaaggttt cgaaactgtc ctcgacgttg tcgaaggtat gaatttcaac
     1261 cgcggatacc tatccagcta cttctctaca aatcctgaaa cacaagaatg tgttttagaa
     1321 gaagctctcg tgcttatcta tgataaaaaa atttccggaa tcaaagattt tctaccagtt
     1381 ttacaacaag tagcagaatc aggacgtccc ctacttatca ttgctgaaga tatcgaagga
     1441 gaagctttag ctactttagt agtaaacaga ctacgtgctg gattcagagt gtgtgcagta
     1501 aaagctcctg gatttggtga tagaagaaaa gctatgttag aagacatcgc tattttaact
     1561 ggtggtcaac tcatcagcga agagcttggc atgaagcttg agaacacaac tctagctatg
     1621 ttaggaaaag ctaaaaaagt catcgtttcc aaagaagata caacaattgt tgaaggtctt
     1681 ggcagcaaag aagatattga atctcgctgc gaaagtatca aaaaacaaat cgaagacagt
     1741 acttctgatt acgacaaaga aaaactccaa gaacgtttag ctaaactttc cggaggcgta
     1801 gctgtaatcc gtgtaggagc tgctacagaa atcgaaatga aagagaaaaa agacagagta
     1861 gatgatgctc agcatgcaac tcttgctgca gttgaagaag gtattctacc tggcggtggt
     1921 acagctttag ttcgctgcat ccctacttta gaagctttca ttcctattct tacaaatgaa
     1981 gatgagcaaa tcggagcacg tattgttctc aaagcattat ccgctccatt aaagcaaatt
     2041 gcagccaatg ctggtaaaga aggcgctatc atctgtcaac aagtgctttc tcgctcctct
     2101 agcgaaggct atgatgcttt acgcgatgct tacaccgaca tgattgaggc aggaattctc
     2161 gatccaacta aagttacacg ttgtgcttta gaaagcgcag cttctgtagc tgggcttcta
     2221 ttaacaacag aagctttaat tgccgatatt cctgaagaga aatcctcttc tgctcccgca
     2281 atgccaggcg caggaatgga ttattaatcc ttaatttaga gagcattttc tctaatatta
     2341 taaggtctcc tttcatccat cttgagaaga aagggggcct tttttatttt ctaatatttc
     2401 tttcttcatc tatgttggaa accaagataa atcatattct catcatgcat gtttaaactt
     2461 ttaaagaa
//
LOCUS       TRBPGKM      3306 bp ds-DNA             INV       04-JUL-1990
DEFINITION  T.brucei phosphoglycerate kinase isozyme-8 (cPGK-8, gPGK) genes,
            complete cds.
ACCESSION   M33775
KEYWORDS    phosphoglycerate kinase.
SOURCE      T.brucei (strain EATRO 164) DNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 3306)
  AUTHORS   Alexander,K., Parail,A.C. and Parsons,M.
  TITLE     An allele of Trypanosoma brucei cytoplasmic phosphoglycerate kinase
            is a mosaic of other alleles and genes
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by M.Parsons,
            18-APR-1990.
            
              Author address: M.Parsons
              Seattle Biomedical Research Institute
              4 Nickerson Street
              Seattle, WA 98109-1651
              (206) 284-8846
            
FEATURES       from  to/span     description
    pept        175     1437     phosphoglycerate kinase (cPGK-8; E.C. 2.7.2.3)
    pept       1792     3114     phosphoglycerate kinase (gPGK; E.C. 2.7.2.3)
    allele      406      406     c in allele 8; g in allele 2
    allele      466      467     cg in allele 8; gc in allele 2
    allele      603      604     ga in allele 8; gaaga in allele 2
    allele      640      640     g in allele 8; t in allele 2
    allele      729      730     tc in allele 8; ca in allele 2
    allele      783      783     g in allele 8; a in allele 2
    allele      840      840     a in allele 8; c in allele 2
    allele      944      950     aatcgaa in allele 8; atccatt in allele 2
    allele     1018     1018     g in allele 8; t in allele 2
    allele     1107     1107     g in allele 8; a in allele 2
    allele     1425     1428     cgaa in allele 8; tgat in allele 2
    allele     1439     1457     attcatcctgttacgggcc in allele 8;
                                 gtacatcaggcgaagggtt in allele 2
    allele     1483     1483     t in allele 8; g in allele 2
    allele     1512     1512     t in allele 8; c in allele 2
    allele     1537     1540     ttta in allele 8; ta in allele 2
    allele     2015     2015     a in allele 8; g in allele 2
    allele     2083     2084     cg in allele 8; gc in allele 2
    allele     2136     2136     t in allele 8; c in allele 2
    allele     2151     2151     t in allele 8; c in allele 2
    allele     2196     2199     cttt in allele 8; gttc in allele 2
    allele     2268     2268     c in allele 8; c in allele 2
    allele     2400     2400     g in allele 8; a in allele 2
    allele     2919     2919     c in allele 8; t in allele 2
    allele     3031     3031     g in allele 8; a in allele 2
    allele     3057     3057     g in allele 8; t in allele 2
    allele     3196     3197     tg in allele 8; gtttg in allele 2
    allele     3248     3248     t in allele 8; c in allele 2
    allele     3257     3257     g in allele 8; t in allele 2
    allele     3273     3274     ag in allele 8; agag in allele 2
    allele     3284     3286     ttg in allele 8; tg in allele 2
    allele     3297     3299     aat in allele 8; at in allele 2
BASE COUNT      863 a    659 c    833 g    951 t
ORIGIN      
        1 aaaccgaatt cccttcccca agtctcgcag tcacttcttt tcaacgtttt ctcacttagc
       61 caacatactt gtattttttg tttcaaaaca ttaaaaaatt gtaacaaggg agtttcttat
      121 ttttttgaaa aaactatata tatcgatata tacttatctg atcacaaatc aaatatgtca
      181 ctaaaagaga ggaagagcat taacgaatgc gatcttaagg gaaagaaggt tcttatccgt
      241 gttgacttta atgttcctct ggatgatggt aatattacca acgactaccg aatccgatca
      301 gctctgccag ccgtccaaaa agttctcaca gaaggcggca gctgtgttct catgagccac
      361 ctcgggaggc cgaaaggcgt ttctatggct gaaggcaaag aactgcggag cgctggcggt
      421 attcccgggt tcgagcagaa ggcaacactc aaaccggtag ccaagcgcct cagcgaattg
      481 ttatcgagac ccgtcacatt cgcacctgac tgtctgaacg ctgcagatgt cgtctctaag
      541 atgtctccgg gcgatgttgt tctgcttgaa aatgtacggt tctacaaaga agagggcagc
      601 aagagcactg aggaacgtga agccatggcc aagatccttg cgtcatatgg tgatgtttac
      661 atcagtgatg cttttggtac agctcaccgt gacagtgcta ccatgaccgg aattccaaag
      721 attttgggtc acggtgctgc cggttatttg atggagaagg agatttcata cttcgctaag
      781 gtgcttggta acccgccgcg tccgctggtt gctatcgttg gtggagcgaa agtgagcgaa
      841 aagatccaac ttctggataa catgttgcag cgcatcgatt atctcttaat tggtggtgca
      901 atggcataca catttctgaa ggctcagggt tacagcattg gaaaatcgaa gtgcgaggaa
      961 agtaaacttg aatttgctcg atccctgctg aagaaggcgg aggaccgcaa ggtgcaggtt
     1021 attcttccaa ttgatcatgt ttgccacacg gaattcaaag ctgtggattc tccattgata
     1081 actgaggatc aaaacatccc tgaagggcat atggctctgg atattggtcc caagactatt
     1141 gaaaaatatg ttcagacgat tgggaagtgt aagagcgcca tttggaacgg tcccatgggt
     1201 gtatttgaaa tggttcctta ttccaaaggt acatttgcaa ttgcgaaagc catgggtcga
     1261 ggaactcacg agcatggact catgagtatc atcggtggtg gtgacagcgc aagtgcagct
     1321 gagttgagcg gtgaggcgaa gcgcatgtct catgtttcaa cgggtggtgg tgcgtctttg
     1381 gaactcctcg agggcaaaac gcttcccggt gttacagtat tggacgaaaa ggagtaatat
     1441 tcatcctgtt acgggcctgt tttatggaat tgtgtttttt agtccttttt atttgttggt
     1501 taggtattgg ttcgtacgtg actattattt ttttttttta ggataacatt tatgtttttt
     1561 ctctactcat tttatttttt ggtactatgt atcgattgca cagtaatatt tcaatcgcta
     1621 tgatatttct atctttttga tcgttctact gtgtaacttt gtgtttatct ttgttacttc
     1681 actctttttt cactcaaatc gtttgggctg caggctcagc tctttggtga tatcaaagca
     1741 taattgctgc ggagatacgt ttttccacct aataagtaat tgtgatacaa gatgaccctt
     1801 aacgagaaga agagcattaa tgaatgcgat cttaagggaa agaaggttct tatccgtgtt
     1861 gactttaatg ttcccgtgaa aaacggtaag atcaccaacg actaccgaat ccgatcagct
     1921 ctgccaacgc tcaagaaggt tctcacagaa ggcggcagct gtgttctcat gagccacctc
     1981 gggaggccga aaggtattcc catggcgcaa gctgacaaaa tacggagcac tggcggtgtt
     2041 cccgggttcc aacagaaggc aacactcaaa ccggtagcca agcgcctcag cgaactgcta
     2101 ttgaggcccg tcacattcgc acctgactgc ctgaatgctg cagatgtcgt ttctaagatg
     2161 tctccgggcg atgttgttct gcttgaaaat gtacgctttt acaaagaaga gggcagcaag
     2221 aaggcaaaag accgtgaagc catggccaag atccttgcgt catatggcga tgtttacatc
     2281 agtgatgctt ttggtacagc tcaccgtgac agtgctacca tgaccggaat tccaaagatt
     2341 ttgggcaacg gtgctgccgg ttatttgatg gagaaggaga tttcatactt cgctaaggtg
     2401 cttggtaacc cgccgcgtcc gctggttgct atcgttggtg gagcgaaagt gagcgacaag
     2461 atccaacttc tggataacat gttgcagcgc atcgattatc tcttaattgg tggtgcaatg
     2521 gcatacacat ttctgaaggc tcagggttac agcattggaa aatcgaagtg cgaggaaagt
     2581 aaacttgaat ttgctcgatc cctgctgaag aaggcggagg accgcaaggt gcaggttatt
     2641 cttccaattg atcatgtttg ccacacggaa ttcaaagctg tggattctcc attgataact
     2701 gaggatcaaa acatccctga agggcatatg gctctggata ttggtcccaa gactattgaa
     2761 aaatatgttc agacgattgg gaagtgtaag agcgccattt ggaacggtcc catgggtgta
     2821 tttgaaatgg ttccttattc caaaggtaca tttgcaattg cgaaagccat gggtcgagga
     2881 actcacgagc atggactcat gagtatcatc ggtggtggcg acagcgcaag tgcagctgag
     2941 ttgagcggtg aggcgaagcg catgtctcat gtttcaactg gtggtggtgc gtctttggaa
     3001 ctcctcgagg gcaaaacgct tcccggcgtt gcagtattgg acgaaaagtc ggcggtggtg
     3061 tcgtatgcct ctgcaggtac tggaactctt tctaaccggt ggagctctct ttaaacttcc
     3121 agaaaaaata tatttctgca aaatactttt ggaagtttgt cttgtcttta tagatgaagg
     3181 atttgtttct ttttttgtga tgttttcaag gttaattagt tttgggggtt cgttatctta
     3241 attattttgg tgggtgggag taaataaagc agaggtaaat tttttggtga cacaaaaatt
     3301 gggaag
//

GenBank-Updates@genbank.bio.net (07/06/90)

LOCUS       MUSLRPA      3070 bp ss-mRNA            ROD       06-JUL-1990
DEFINITION  Mouse leukocyte common antigen-related phosphatase mRNA, complete
            cds.
ACCESSION   M36033 M33671
KEYWORDS    leukocyte common antigen-related phosphatase;
            protein-tyrosine-phosphate phosphohydrolase; tyrosine phosphatase.
SOURCE      Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3, cDNA to
            mRNA, clones lambda-mLRP-[B20,C26].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3070)
  AUTHORS   Matthews,R.J., Cahir,E.D. and Thomas,M.L.
  TITLE     Identification of an additional member of the
            protein-tyrosine-phosphatase family: Evidence for alternative
            splicing in the tyrosine phosphatase domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4444-4448 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.L.Thomas, 11-APR-1990.
FEATURES       from  to/span     description
    pept         26     2515     leukocyte common antigen-related phosphatase
                                 precursor (EC 3.13.48)
    sigp         26       82     leukocyte common antigen-related phosphatase
                                 signal peptide
    matp         83     2512     leukocyte common antigen-related phosphatase
    mRNA    <     1     3070     leukocyte common antigen-related phosphatase
                                 mRNA
    site        827      936     gg in short clone lambda-mLRP-B20
BASE COUNT      874 a    754 c    732 g    710 t
ORIGIN      
        1 ccgcccagcg ccgggctcgg tcagcatgga ttcctggttc attcttgtcc tgtttggcag
       61 tggtctaata catgttagtg ccaacaatgc tactacagtt tcaccttctt taggaacgac
      121 aagattaatt aaaacatcaa caacagaatt ggctaaggaa gagaataaaa cctcaaattc
      181 aacctcttca gtaatttctc tttctgtggc accaacattc agcccaaacc tgactctgga
      241 gcccacctat gtgactactg ttaattcttc acactctgac aatgggacca ggagggcagc
      301 cagcacggaa tctggaggca ctaccatttc cccgaacgga agctggctta ttgagaacca
      361 gttcacggat gccataacag aaccctggga ggggaactcc agcactgcag caaccactcc
      421 agaaaccttc cccccggcag atgagacacc aattattgcg gtgatggtgg ccctgtcctc
      481 tctgctagta atcgtgttta ttatcatagt tctgtacatg ttaaggttta agaaatacaa
      541 gcaagctggg agtcattcca actctttccg cctgtcaaat ggccgcacgg aggatgtgga
      601 gccccaaagt gtaccacttc tggccaggtc cccaagcacc aacaggaagt acccaccact
      661 gcctgtggac aagctggaag aggagattaa ccggagaatg gctgatgaca ataagatctt
      721 cagagaagaa ttcaacgctc tccctgcttg tcctatccag gccacctgtg aggctgcctc
      781 caaggaagaa aacaaggaaa aaaaccgcta tgtaaacatc ctgccctttc tctctttagc
      841 tgtgagcaag gatgcagtga aagcactgaa caaaaccact ccattgttag aaagaaggtt
      901 tattgggaaa tcaaactcca gaggctgtct ctcagatgac cactctagag tgcacctgac
      961 acctgttgaa ggggtcccag attctgatta catcaacgct tcattcatta atggctacca
     1021 ggaaaagaac aaattcatcg ctgcacaagg accaaaagaa gaaacagtga atgacttctg
     1081 gagaatgata tgggaacaaa acacagctac tattgtcatg gtgaccaacc tgaaggagag
     1141 aaaggagtgt aaatgtgccc aatactggcc agaccaaggc tgctggacct atgggaatgt
     1201 ccgtgtgtct gtcgaggatg tgactgttct ggtggactac acagtacgga aattctgcat
     1261 ccagcaggtg ggcgacgtga ccaacaggaa accacagcgc ctcatcactc agttccactt
     1321 caccagctgg ccagactttg gggtgccttt caccccaatt ggcatgctca agttcctcaa
     1381 gaaggtgaag gcctgtaacc ctcagtacgc aggggctatc gtggtccact gcagtgcagg
     1441 tgtagggcgc actggcacct ttgttgtcat cgatgccatg ctggacatga tgcattcgga
     1501 acgcaaagtg gatgtatacg ggtttgtgag ccggatccgg gcccagcgct gccagatggt
     1561 acagacagac atgcagtacg tcttcatata ccaggccctt ctggagcatt atctgtatgg
     1621 ggacacagaa ctggaagtga cttctctaga aacccaccta caaaaaattt ataacaagat
     1681 cccagggacg agcaacaacg ggttagagga ggagtttaag aaattaactt caatcaaaat
     1741 ccagaatgac aagatgcgca cgggaaacct tccagccaac atgaagaaga accgggtttt
     1801 acagatcatt ccatatgaat ttaacagagt gatcattcca gtcaaacgag gcgaagagaa
     1861 cacagactat gtgaacgcat ccttcattga tggataccgg cagaaagact cctacattgc
     1921 cagccagggc cctcttctcc acacgattga ggacttctgg cgaatgatct gggagtggaa
     1981 gtcctgttct atcgtaatgc tgacagaact ggaagagaga ggccaggaga agtgtgccca
     2041 gtactggcca tctgatggcc tggtgtccta tggagacatc acagttgagc tgaagaagga
     2101 ggaggaatgt gaaagctaca ctgtccgaga cctcctggtc accaacacca gggagaacaa
     2161 gagtcggcaa atccggcagt tccacttcca cggctggcct gaggtgggca tccccagcga
     2221 cggcaagggc atgatcaaca tcattgcagc agtgcagaag cagcagcagc agtcggggaa
     2281 ccatcccatc actgtgcact gcagtgccgg ggcaggacgg acaggaacct tctgtgcctt
     2341 gagcacagtc ctggaacgtg tgaaagcaga aggaatttta gatgtcttcc aaactgtcaa
     2401 gagcctgcgg ctgcagaggc cacacatggt ccagacactg gaacagtatg aattctgcta
     2461 caaggtggta caggagtaca ttgacgcctt ttcagattat gccaacttca agtgacaggt
     2521 gacaaggccc acagacagga gaattgcctt taatattttg taatattctg tttttgttaa
     2581 tatacccaaa attgtatata tcttataact gttttagaaa tggcacatag gcttctatta
     2641 cctgttaggt ggagattttg tatgtaaatg tgttagcact gatagtcctt ttccagtgtt
     2701 ttattgggaa attaaatagt gtgatatttg ggttgatata atgaaatcct cagcctggaa
     2761 actgggccag attgttcctt gcttcaaata tcttttccta aagaagataa acctaagact
     2821 cattccaggt agctcagtgc caactaaaac aaagcacaaa gttctcagag ctcttgagga
     2881 aatggttgtc tccctgtccc caggcaggcc tcttcccctc cctgtcctgt aaatatccct
     2941 cccctctcca gtccaccctc atctcccacc aagatcagcc acctcaggca tggggagtaa
     3001 tgagaccaga gcgcctctct ggcaccacag cagggatcgt caggtaataa acactcttga
     3061 ttccctgagg
//
LOCUS       MUSLRPB       144 bp ds-DNA             ROD       06-JUL-1990
DEFINITION  Mouse leukocyte common antigen-related phosphatase gene, exon N.
ACCESSION   M36034 M33671
KEYWORDS    leukocyte common antigen-related phosphatase;
            protein-tyrosine-phosphate phosphohydrolase; tyrosine phosphatase.
SOURCE      Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3 DNA, clone
            lambda-mLRP-B20.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 144)
  AUTHORS   Matthews,R.J., Cahir,E.D. and Thomas,M.L.
  TITLE     Identification of an additional member of the
            protein-tyrosine-phosphatase family: Evidence for alternative
            splicing in the tyrosine phosphatase domain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4444-4448 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    /    25  /   132     leukocyte common antigen-related phosphatase,
                                 exon N (EC 3.13.48) (AA at 27)
    IVS     <     1       24     leukocyte common antigen-related phosphatase
                                 intron X
    IVS         133  >   144     leukocyte common antigen-related phosphatase
                                 intron X+1
BASE COUNT       50 a     30 c     30 g     34 t
ORIGIN      
        1 gcttactcaa acgatcatac aaagttctct ctttagctgt gagcaaggat gcagtgaaag
       61 cactgaacaa aaccactcca ttgttagaaa gaaggtttat tgggaaatca aactccagag
      121 gctgtctctc aggtcagaga aaac
//
LOCUS       HUMACOLA1      34 bp ds-DNA             PRI       06-JUL-1990
DEFINITION  Human alpha-1 type IX collagen (COL9A1) gene, exon 1.
ACCESSION   M34534
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     1 of 4
SOURCE      Human DNA, clone YMh304.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 34)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept         19  /    32     alpha-1 type IX collagen precursor, exon 1
                                 /hgml_locus_uid="LG0170H" /nomgen="COL9A1"
                                 /map="6q12-q14"
    pep$         19  /    32     alpha-1 type IX collagen precursor, exon 1
    sigp         19  /    32     alpha-1 type IX collagen signal peptide
    sig$         19  /    32     alpha-1 type IX collagen signal peptide
    IVS          33  >    34     COL9A1 intron A
BASE COUNT       14 a      5 c      9 g      6 t
ORIGIN      
        1 agaaaatcaa ctgggaaaat gaagacctgc tggt
//
LOCUS       HUMACOLA2      88 bp ds-DNA             PRI       06-JUL-1990
DEFINITION  Human alpha-1 type IX collagen (COL9A1) gene, exon 6.
ACCESSION   M32135
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     2 of 4
SOURCE      Human DNA, clone YMh617.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept    /     3  /    86     alpha-1 type IX collagen, exon 6 (AA at 3)
                                 /hgml_locus_uid="LG0170H" /nomgen="COL9A1"
                                 /map="6q12-q14"
    pep$    /     3  +    86     alpha-1 type IX collagen, exon 6 (AA at 3)
    IVS     <     1        2     COL9A1 intron E
    IVS          87  >    88     COL9A1 intron F
BASE COUNT       23 a     24 c     22 g     19 t
ORIGIN      Unknown number of bp after segment 1.
        1 agtttgaact tcaatggatg ctgatccatt gtgaccccct gcggcccagg acagaaactt
       61 gccatgagct gccagccaga ataacggt
//
LOCUS       HUMACOLA3      74 bp ds-DNA             PRI       06-JUL-1990
DEFINITION  Human alpha-1 type IX collagen (COL9A1) gene, exon 1*.
ACCESSION   M32133
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     3 of 4
SOURCE      Human DNA, clone Ymh617.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 74)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  +    72     alpha-1 type IX collagen, exon 1* (AA at 1)
                                 /hgml_locus_uid="LG0170H" /nomgen="COL9A1"
                                 /map="6q12-q14"
    IVS          73  >    74     COL9A1 intron G*
BASE COUNT        5 a     21 c     32 g     16 t
ORIGIN      Unknown number of bp after segment 2.
        1 atggcctgga ctgcgcggga ccgcggggcc ctggggctgc tgctgttggg gctctgcttg
       61 tgcgcggctc aagt
//
LOCUS       HUMACOLA4      25 bp ds-DNA             PRI       06-JUL-1990
DEFINITION  Human alpha-1 type IX collagen (COL9A1) gene, exon 7.
ACCESSION   M32137
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     4 of 4
SOURCE      Human DNA, clone YMh617.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 25)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept    +     3  /    23     alpha-1 type IX collagen, exon 7
                                 /hgml_locus_uid="LG0170H" /nomgen="COL9A1"
                                 /map="6q12-q14"
    pep$    +     3  /    23     alpha-1 type IX collagen (COL9A1), exon 7
    IVS     <     1        2     COL9A1 intron G*
    IVS          24  >    25     COL9A1 intron H*
    IVS     <     1        2     COL9A1 intron F
    IVS          24  >    25     COL9A1 intron G
BASE COUNT        7 a     10 c      7 g      1 t
ORIGIN      Unknown number of bp after segment 3.
        1 agcccagcca gaccaccgac gaggt
//
LOCUS       MUSACOLA1      34 bp ds-DNA             ROD       06-JUL-1990
DEFINITION  Mouse alpha-1 type IX collagen (COL9A1) gene, exon 1.
ACCESSION   M34533
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     1 of 4
SOURCE      Mouse DNA, clone YMm513.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 34)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept         19  /    32     alpha-1 type IX collagen (COL9A1) precursor,
                                 exon 1
    pep$         19  /    32     alpha-1 type IX collagen precursor, exon 1
    sigp         19  /    32     alpha-1 type IX collagen signal peptide
    sig$         19  /    32     alpha-1 type IX collagen signal peptide
    IVS          33  >    34     COL9A1 intron A
BASE COUNT       13 a      6 c      9 g      6 t
ORIGIN      
        1 agagaaccca ctgggaaaat gaagaacttc tggt
//
LOCUS       MUSACOLA2      88 bp ds-DNA             ROD       06-JUL-1990
DEFINITION  Mouse alpha-1 type IX collagen (COL9A1), exon 6.
ACCESSION   M32134
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     2 of 4
SOURCE      Mouse DNA, clones YMm[413,911,341].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept    /     3  /    86     alpha-1 type IX collagen (COL9A1), exon 6 (AA
                                 at 3)
    pep$    /     3  +    86     alpha-1 type IX collagen, exon 6 (AA at 3)
    IVS     <     1        2     COL9A1 intron E
    IVS          87  >    88     COL9A1 intron F
BASE COUNT       25 a     22 c     21 g     20 t
ORIGIN      Unknown number of bp after segment 1.
        1 agtttgaact ccagtggatg ctgattcatt gtgaccccct gagacccagg agagaaacct
       61 gtcatgagct gccaatcaga atcacagt
//
LOCUS       MUSACOLA3      80 bp ds-DNA             ROD       06-JUL-1990
DEFINITION  Mouse alpha-1 type IX collagen (COL9A1) gene, exon 1*.
ACCESSION   M32132
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     3 of 4
SOURCE      Mouse DNA, clones YMm[413,911,341].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1  +    78     alpha-1 type IX collagen (COL9A1), exon 1* (AA
                                 at 1)
    IVS          79  >    80     COL9A1 intron F*
BASE COUNT        8 a     19 c     31 g     22 t
ORIGIN      Unknown number of bp after segment 2.
        1 atggcctggg ctgcctgggg acgaggagtg cttgggctgt cactgatgct gtctgggctc
       61 cgcttgtgtg ctgctcaagt
//
LOCUS       MUSACOLA4      25 bp ds-DNA             ROD       06-JUL-1990
DEFINITION  Mouse alpha-1 type IX collagen (COL9A1) gene, exon 7.
ACCESSION   M32136
KEYWORDS    alpha-1 type IX collagen.
SEGMENT     4 of 4
SOURCE      Mouse DNA, clones YMm[413,911,341].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 25)
  AUTHORS   Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R.
  TITLE     The alpha-1 (IX) collagen gene gives rise to two different
            transcripts in mouse embryonic and human fetal RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            B.R.Olsen, 15-FEB-1990.
FEATURES       from  to/span     description
    pept    +     3  /    23     alpha-1 type IX collagen (COL9A1), exon 7
    pep$    +     3  /    23     alpha-1 type IX collagen, exon 7
    IVS     <     1        2     COL9A1 intron G*
    IVS          24  >    25     COL9A1 intron H*
    IVS     <     1        2     COL9A1 intron F
    IVS          24  >    25     COL9A1 intron G
BASE COUNT        8 a      7 c      7 g      3 t
ORIGIN      Unknown number of bp after segment 3.
        1 agaccagcca gaccactgat gaggt
//

GenBank-Updates@genbank.bio.net (07/07/90)

LOCUS       TRBMVAT5A    1544 bp ss-mRNA            INV       07-JUL-1990
DEFINITION  T.brucei variant surface glycoprotein (MVAT5) mRNA, complete cds.
ACCESSION   M33825
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1544)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
BASE COUNT      497 a    396 c    366 g    285 t
ORIGIN      
        1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt
       61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt
      121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa
      181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg
      241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt
      301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa
      361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac
      421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg
      481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc
      541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg
      601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca
      661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca
      721 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca
      781 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat
      841 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt
      901 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac
      961 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa
     1021 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac
     1081 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc
     1141 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc
     1201 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat
     1261 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca
     1321 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc
     1381 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta
     1441 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa
     1501 gaatttttgc tactttaaaa acttctgata tattttaaca ccta
//
LOCUS       TRBWRATATA   1544 bp ss-mRNA            INV       07-JUL-1990
DEFINITION  T.brucei variant surface glycoprotein (WRATat A) mRNA, complete
            cds.
ACCESSION   M33823
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1544)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
FEATURES       from  to/span     description
    pept         42     1457     variant surface glycoprotein (WRATat A)
    mRNA    <     1     1544     WRATat A mRNA
BASE COUNT      545 a    387 c    345 g    267 t
ORIGIN      
        1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc
       61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt
      121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac
      181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg
      241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg
      301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc
      361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa
      421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct
      481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca
      541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg
      601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac
      661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca
      721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa
      781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac
      841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg
      901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt
      961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg
     1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca
     1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg
     1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc
     1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag
     1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt
     1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga
     1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg
     1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta
     1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg
//
LOCUS       TRBWRATATB   1585 bp ss-mRNA            INV       07-JUL-1990
DEFINITION  T.brucei variant surface glycoprotein (WRATat B) mRNA, complete
            cds.
ACCESSION   M33824
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1585)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
BASE COUNT      535 a    384 c    376 g    290 t
ORIGIN      
        1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact
       61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga
      121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa
      181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac
      241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga
      301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat
      361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga
      421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc
      481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca
      541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc
      601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg
      661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag
      721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa
      781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga
      841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact
      901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca
      961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca
     1021 cgacctcata acagctaaat ccggtacgac ataagcggcg caatcactcg gctaaagcaa
     1081 ctacaaatcg ccaaagcaaa gctaaaagca agacggctac tgctgcaaaa cagggaacgg
     1141 cagcaaacgc gacttatggc gctagcagac aagatgcaag aattgtacca agaggcctta
     1201 catgacgagg ttcaactcag gaaggaagcg cagaacaaac ctcaagaaac accagattct
     1261 gacaagcaaa aagcatgcga gaaatatcac aacaagtcaa aggaatgcaa agaaaatggt
     1321 tgccaatgga gtggaactga agaaaccact aggaaagtgc gaagctaaac ccaaagcagg
     1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac
     1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt
     1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact
     1561 tctggtatat tttaacacct ttaaa
//
LOCUS       HUMMHB17W    3520 bp ds-DNA             PRI       07-JUL-1990
DEFINITION  Human MHC class I HLA-Bw58 gene, complete cds.
ACCESSION   M11799
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human B lymphoblastoid cell line WT49 DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3520)
  AUTHORS   Ways,J.P., Coppin,H.L. and Parham,P.
  TITLE     The complete primary structure of HLA-Bw58
  JOURNAL   J. Biol. Chem. 260, 11924-11933 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         88      160     HLA-Bw58 of MHC, exon 1 /nomgen="HLA-B"
                                 /map="6p21.3" /hgml_locus_uid="LL0111R"
                290      559     HLA-Bw58 of MHC, exon 2
                806     1081     HLA-Bw58 of MHC, exon 3
               1657     1932     HLA-Bw58 of MHC, exon 4
               2037     2153     HLA-Bw58 of MHC, exon 5
               2595     2627     HLA-Bw58 of MHC, exon 6
               2734     2777     HLA-Bw58 of MHC, exon 7
    pre-msg      67     3386     HLA-Bw58 of MHC, mRNA and introns
    IVS         161      289     HLA-Bw58 intron A
    IVS         560      805     HLA-Bw58 intron B
    IVS        1082     1656     HLA-Bw58 intron C
    IVS        1933     2036     HLA-Bw58 intron D
    IVS        2154     2594     HLA-Bw58 intron E
    IVS        2628     2733     HLA-Bw58 intron F
BASE COUNT      665 a    981 c   1099 g    775 t
ORIGIN      Chromosome 6p21.3.
        1 tctagagaag ccaatcagtg tcgccggggt cccagttcta aagtccccac gcacccaccc
       61 ggactcagaa tctcctcaga cgccgagatg cgggtcacgg cgccccgaac cgtcctcctg
      121 ctgctctggg gggcagtggc cctgaccgag acctgggccg gtgagtgcgg ggtcgggagg
      181 gaaatggcct ctgtggggag gagcgagggg accgcaggcg ggggcgcagg acctgaggag
      241 ccgcgccggg aggagggtcg ggcgggtctc agcccctcct cgcccccagg ctcccactcc
      301 atgaggtatt tctacaccgc catgtcccgg cccggccgcg gggagccccg cttcatcgca
      361 gtgggctacg tggacgacac ccagttcgtg aggttcgaca gcgacgccgc gagtccgagg
      421 acggagcccc gggcgccatg gatagagcag gaggggccgg agtattggga cggggagaca
      481 cggaacatga aggcctccgc gcagacttac cgagagaacc tgcggatcgc gctccgctac
      541 tacaaccaga gcgaggccgg tgagtgaccc cggcccgggg cgcaggtcac gactccccat
      601 cccccacgta cggcccgggg tcgccccgag tctccgggtc cgagatccgc ctccctgagg
      661 ccgcgggacc cgcccagacc ctcgaccggc gagagcccca ggcgcgttta cccggtttca
      721 ttttcagttg aggccaaaat ccccgcgggt tggtcggggc ggggcggggc tcgggggacg
      781 gggctgaccg cggggccggg gccagggtct cacatcatcc agaggatgta tggctgcgac
      841 ctggggcccg acgggcgcct cctccgcggg catgaccagt ccgcctacga cggcaaggat
      901 tacatcgccc tgaacgagga cctgagctcc tggaccgcgg cggacaccgc ggctcagatc
      961 acccagcgca agtgggaggc ggcccgtgtg gcggagcagc tgagagccta cctggagggc
     1021 ctgtgcgtgg agtggctccg cagatacctg gagaacggga aggagacgct gcagcgcgcg
     1081 ggtaccaggg gcagtgggga gccttcccca tctcctatag gtcgccgggg atggcctccc
     1141 acgagaagag gaggaaaatg ggatcagcgc tagaatgtcg ccctcccttg aatggagaat
     1201 ggcatgagtt ttcctgagtt tcctctgagg gccccctctt ctctctagga caattaaggg
     1261 atgacgtctc tgaggaaatg gaggggaaga cagtccctag aatactgatc aggggtcccc
     1321 tttgacccct gcagcagcct tgggaaccgt gacttttcct ctcaggcctt gttctctgcc
     1381 tcacactcag tgtgtttggg gctctgattc cagcacttct gagtcacttt acctccactc
     1441 agatcaggag cagaagtccc tgttccccgc tcagagactc gaactttcca atgaatagga
     1501 gattatccca ggtgcctgcg tccaggctgg tgtctgggtt ctgtgcccct tccccacacc
     1561 aggtgtcctg tccattctca ggctggtcac atgggtggtc ctagggtgtc ccatgagaga
     1621 tgcaaagcgc ctgaattttc tgactcttcc catcagaccc cccaaagaca cacgtgaccc
     1681 accaccccgt ctctgaccat gaggccaccc tgaggtgctg ggccctgggc ttctaccctg
     1741 cggagatcac actgacctgg cagcgggatg gcgaggacca aactcaggac actgagcttg
     1801 tggagaccag accagcagga gatagaacct tccagaagtg ggcagctgtg gtggtgcctt
     1861 ctggagaaga gcagagatac acatgccatg tacagcatga ggggctgccg aagcccctca
     1921 ccctgagatg gggtaaggag ggggatgagg ggtcatatct cttctcaggg aaagcaggag
     1981 cccttctgga gcccttcagc agggtcaggg cccctcgtct tcccctcctt tcccagagcc
     2041 atcttcccag tccaccatcc ccatcgtggg cattgttgct ggcctggctg tcctagcagt
     2101 tgtggtcatc ggagctgtgg tcgctactgt gatgtgtagg aggaagagct caggtaggga
     2161 aggggtgagg ggtggggtct gggttttctt gtcccactgg gggtttcaag ccccaggtag
     2221 aagtgttccc tccctcatta ctgggaagca gcatccacac aggggctaac gcagcctggg
     2281 accctgtgtg ccagcactta ctcttttgtg cagcacatgt gacaatgaag gacggatgta
     2341 tcaccttgat ggttgtggtg ttggggtcct gatttcagca ttcatgagtc aggggaaggt
     2401 ccctgctaag gacagacctt aggagggcag ttggtccagg acccacactt gctttcctcg
     2461 tgtttcctga tcctgccttg ggtctgtagt catacttctg gaaattcctt ttgggtccaa
     2521 gacgaggagg ttcctctaag atcttaaggc cctgcttcct cccagtcccc tcacaggaca
     2581 ttttcttccc acaggtggaa aaggagggag ctactctcag gctgcgtgta agtggtgggg
     2641 gtgggagtgt ggaggagctc acccacccca taattcctcc tgtcccacgt ctcctgcggg
     2701 ctctgaccag gtcctgtttt tgttctactc cagccagcga cagtgcccag ggctctgatg
     2761 tgtctctcac agcttgaaaa ggtgagattc ttggggtcta gagtgggtcg ggtggcgggt
     2821 ctgggggtgg gtggggcaga ggggaaaggc ctgggtaatg gggattcttt gattgggatg
     2881 tttcgcgtgt gtggtgggct gtttacagtg tcatcgctta ccatgactaa ccagaatttg
     2941 ttcatgactg ttgttttctg tagcctgaga cagctgtctt gtgagggact gagatgcagg
     3001 atttcttcac gcctcccctt tgtgacttca agagcctctg gcatctcttt ctgcaaaggc
     3061 acctgaatgt gtctgcgtcc ctgttagcat aatgtgagga ggtggagaca cagcccaccc
     3121 ttgtgtccac tgtgacccct gttcccatgc tgacctgtgt ttcctcccca gtcatctttc
     3181 ctgttccaga gaggtggggc tggatgtctc catctctgtc tcaactttac gtgcactgag
     3241 ctgcaacttc ttacttccct actgaaaata agaatctgaa tataaatttg ttttctcaaa
     3301 tatttgctat gagaggttga tggattaatt aaataagtca attcctggaa tttgagagag
     3361 caaataaaga cctgagaacc ttccagaatc tgcatgttcg ctgtgctgag tctgttgcag
     3421 gtggggtgtg gagaaggctg tggggggccg agtgtggacg gggcctgtgc ccatttggtg
     3481 ttgagtccat catgggcttt atgtggttag tcctcagctg
//
LOCUS       HUMTGFB1B    5075 bp ss-mRNA            PRI       07-JUL-1990
DEFINITION  Human transforming growth factor-beta 1 binding protein mRNA,
            complete cds.
ACCESSION   M34057
KEYWORDS    transforming growth factor-beta 1 binding protein.
SOURCE      Human fibroblast, cDNA to mRNA, clone BPA13.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 5075)
  AUTHORS   Kanzaki,T., Olofsson,A., Moren,A., Wernstedt,C., Hellman,U.,
            Miyazono,K., Claesson-Welsh,L. and Heldin,C.-H.
  TITLE     TGF-beta-1 binding protein: A component of the large latent complex
            of TGF-beta-1 with multiple repeat sequences
  JOURNAL   Cell 61, 1051-1061 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.-H.Heldin, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         91     4275     transforming growth factor-beta 1 binding
                                 protein precursor
    sigp         91      150     transforming growth factor-beta 1 binding
                                 protein signal peptide
    matp        151     4272     transforming growth factor-beta 1 binding
                                 protein
BASE COUNT     1353 a   1122 c   1252 g   1348 t
ORIGIN      
        1 tgaattcggc acgaggtggg ctctctcttt tcccctcttg ctcctttctt ttcttttttt
       61 ctgttttttt aaaccttcca aggcaagttc atggatacta agctgatgtg tttgttgttc
      121 tttttctccc tgcctccgct cctagtgagt aaccacactg gccgcatcaa ggtggtcttt
      181 actccgagca tctgtaaagt gacctgcacc aagggcagct gtcagaacag ctgtgagaag
      241 gggaacacca ccactctcat tagtgagaat ggtcatgctg ccgacaccct gacggccacg
      301 aacttccgag tggtaatttg ccatcttcca tgtatgaatg gtggccagtg cagttcaagg
      361 gacaaatgtc agtgccctcc aaatttcaca ggaaaacttt gtcagatccc agtccatggt
      421 gccagcgtgc ctaaacttta tcagcattcc cagcagccag gcaaggcgtt ggggacgcat
      481 gtcatccatt caacacatac cttgcctctg accgtgacta gccagcaagg agtcaaagtg
      541 aaatttcctc ctaacatagt caatatccat gtgaaacatc ctcctgaagc ttccgtccag
      601 atacatcagg tttcaagaat tgatggccca acaggccaga agacaaaaga agctcaacca
      661 ggccaatccc aagtctcgta ccaagggctt cctgtccaga agacccagac catacattcc
      721 acatactccc accagcaggt cattcctcac gtctaccccg tggctgctaa gacacagctt
      781 ggccggtgct tccaggaaac cattgggtca cagtgtggca aagcgctccc tggcctttca
      841 aagcaagagg actgctgtgg aactgtgggt acctcctggg gctttaacaa atgccagaaa
      901 tgccccaaga aaccatctta tcatggatac aaccaaatga tggaatgcct accgggttat
      961 aagcgggtta acaacacctt ttgccaagat attaatgaat gtcagctaca aggtgtatgc
     1021 cctaatggtg agtgtttgaa taccatgggc agctatcgat gtacctgcaa aataggattt
     1081 gggccggatc ctaccttttc aagttgtgtt cctgatcccc ctgtgatctc ggaagagaaa
     1141 gggccctgtt accgacttgt cagttctgga agacagtgta tgtaccctct gtctgttcac
     1201 ctcaccaagc agctctgctg ttgtagtgtg ggcaaggctg ggccacactg tgagaaatgt
     1261 ccccttccag gcacagctgc ttttaaggaa atctgtcctg gtggaatggg ttatacggtt
     1321 tctggcgttc atagacgcag gccaatccat caccatgtag gtaaaggacc tgtatttgtc
     1381 aagccaaaga acactcaacc tgttgctaaa agtactcatc ctccacctct cccagccaag
     1441 gaagagccag tggaggccct gaccttctcc cgggaacacg gggccaggag tgcggagcca
     1501 gaagtggcaa ctgcaccccc tgaaaaggaa ataccttcat tggatcaaga gaaaaccaaa
     1561 cttgagcctg gtcaacccca gctgtctcca ggcatttccg ctattcatct gcatccacag
     1621 tttccagtag tgattgaaaa aacatcacct cctgtgcctg ttgaagtagc tcctgaagct
     1681 tctacgtcta gtgccagcca agtgattgct cctactcaag tgacagaaat caatgaatgt
     1741 actgtgaacc ctgatatctg tggagcagga cactgcatta acctaccagt gagatatacc
     1801 tgtatatgct acgagggcta caggttcagt gaacaacaga ggaaatgtgt ggatattgat
     1861 gagtgtactc aggtccaaca cctctgctcc cagggccgct gtgaaaacac cgagggaagt
     1921 ttcttgtgca tttgcccagc aggatttatg gccagtgagg agggtactaa ctgcatagat
     1981 gttgacgaat gcctgaggcc ggacgtctgt ggggaggggc actgtgtcaa tactgtgggg
     2041 gccttccggt gtgaatactg tgacagcggg taccgcatga ctcagagagg ccgttgtgag
     2101 gatattgatg aatgtttgaa tccaagcact tgtccagatg agcagtgtgt gaattctcct
     2161 ggatcttacc agtgcgttcc ctgcacagaa ggattccgag gctggaatgg acagtgcctt
     2221 gatgtggacg agtgcctgga accaaacgtc tgcgcaaatg gtgattgttc caaccttgaa
     2281 ggctcctaca tgtgttcatg ccacaaaggc tatacccgga ctccggacca caagcactgt
     2341 agagatattg atgaatgtca gcaagggaat ctatgtgtaa acgggcagtg caaaaatacc
     2401 gagggctcct tcaggtgcac ctgtggacag gggtaccagc tgtcggcagc taaagaccag
     2461 tgtgaagaca ttgatgaatg ccagcaccgt catctctgtg ctcatgggca gtgcaggaac
     2521 actgagggct cttttcaatg tgtgtgtgac cagggttaca gagcatctgg gcttggagac
     2581 cactgtgaag atatcaatga atgcttggag gacaagagtg tttgccagag aggagactgc
     2641 attaatactg cagggtccta tgattgtact tgtccggatg gatttcagct agatgacaat
     2701 aaaacatgtc aagatattaa tgaatgtgaa catccagggc tctgtggtcc gcaaggggag
     2761 tgcctaaaca cagagggttc tttccattgt gtctgccagc agggtttctc aatctctgca
     2821 gatggccgta cgtgtgaaga tattgatgaa tgtgtaaaca acactgtttg tgacagtcac
     2881 gggttttgtg acaatacagc tggctccttc cgctgcctct gttatcaggg ctttcaagcc
     2941 ccacaggatg ggcaagggtg tgtggatgtg aatgaatgtg aactgctcag tggggtgtgt
     3001 ggtgaagcct tctgtgaaaa cgtggaaggg tccttcctgt gcgtgtgtgc tgatgaaaac
     3061 caagagtaca gccccatgac tgggcagtgc cgctcccgga cctccacaga tttagatgta
     3121 gatgtagatc aacccaaaga agaaaagaaa gaatgctact ataatctcaa tgacgccagt
     3181 ctctgtgata atgtgttggc ccccaatgtc acgaaacaag aatgctgctg tacatcaggc
     3241 gcgggatggg gagataactg cgaaatcttc ccctgcccgg tcttgggaac tgctgagttc
     3301 actgaaatgt gtcccaaagg gaaaggtttt gtgcctgctg gagaatcatc ttctgaagct
     3361 ggtggtgaga actataaaga tgcagatgaa tgcctacttt ttggacaaga aatctgcaaa
     3421 aatggtttct gtttgaacac tcggcctggg tatgaatgct actgtaagca agggacgtac
     3481 tatgatcctg tgaaactgca gtgctttgat atggatgaat gtcaagaccc cagtagttgt
     3541 attgatggcc agtgtgttaa tacagagggc tcttacaact gcttctgtac tcaccccatg
     3601 gtcctggatg cgtcagaaaa aagatgtata cgaccggctg agtcaaacga acaaatagaa
     3661 gaaactgatg tctaccaaga tttgtgctgg gaacatctga gtgatgaata cgtgtgtagc
     3721 cggcctcttg tgggcaagca gacaacgtac actgagtgct gctgtctgta tggagaggcc
     3781 tggggcatgc agtgtgccct ctgccccctg aaggattcag atgactatgc tcagctgtgt
     3841 aacatccccg tgacgggacg ccggcagcca tatggacggg acgccttggt tgacttcagt
     3901 gaacagtata ctccagaagc cgatccctac ttcatccaag accgttttct aaatagcttt
     3961 gaggagttac aggctgagga atgcggcatc ctcaatggat gtgaaaatgg tcgctgtgtg
     4021 agggtccagg aaggttacac ctgcgattgc ttggatgggt atcacttgga tactgccaag
     4081 atgacctgtt tcgatgtaaa tgaatgcgat gagttgaaca accggatgtc tctctgcaag
     4141 aatgccaagt gcattaacac cgatggttcc tacaagtgtt tgtgtctgcc aggctacgtg
     4201 ccttctgaca agccaaacta ctgcactccg ttgaataccg ccttgaattt agagaaagac
     4261 agtgacctgg agtgaaacag aatctacata acctaagccc atatactctg cactgtgtaa
     4321 aggaaaaggg agaaatgtat tatacttgag acattgcacc taccccggaa ggctggaaat
     4381 acagaaacag catggagttg caagtcctct gaagacaatg agaggattta ggatgagccc
     4441 gataggtgtg gcagaccaaa tggacatttc tctaaaaaac cagtatatat agtctgttca
     4501 tatgtaaaat tcaatggaag agaggtggaa cagtgctgtt attttaaaca gaaggttgta
     4561 ttattatgtt gttttgtttt ttttactatt gcttgattaa atttggcatt taaatagtgg
     4621 tggaaatatt tttatataat tttcattttt tggttgtgca gttccttggc tactgttttt
     4681 cttttacttc agttttttaa aaatctcaaa tgaaaaagtc ttcgatacaa tattgttaag
     4741 ctgtattata agtattgtta cacagggtta tgcaattccc ggcctggagc atttttgaaa
     4801 ttcaaattgt ctgtcctgtg gagcaggcag tgattttgtt ccaaaacttt gtatacacat
     4861 ttggagaaaa gtactttata ttttcagtgt tttgtctgat tttaatgtcc gttcttagcc
     4921 aagctgctag caggtgttaa ttggatccct ttccttcact gaaatggaag agtttataag
     4981 cttacgttag tattgtaata tgtaaagtaa gcccaacaaa aatttttaaa aatttgatga
     5041 tccccaatat atctaccatt gtatgttaaa taaat
//
LOCUS       MUSPLGN      2720 bp ss-mRNA            ROD       07-JUL-1990
DEFINITION  Mouse plasminogen mRNA, complete cds.
ACCESSION   J04766
KEYWORDS    plasminogen.
SOURCE      Mouse cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2720)
  AUTHORS   Degen,S.J.
  TITLE     Characterization of the cDNA coding for mouse plasminogen and
            localization of the gene to mouse chromosome 17
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable  sequence for [1] kindly
            submitted by S.J.Degen, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         25     2463     plasminogen precursor
    sigp         25       81     plasminogen signal peptide
    matp         82     2460     plasminogen
    signal     2699     2704     poly-A signal
BASE COUNT      731 a    707 c    710 g    572 t
ORIGIN      Chromosome 17.
        1 ccggtgctgt tggccagtcc caacatggac cataaggaag taatccttct gtttctcttg
       61 cttctgaaac caggacaagg ggactcgctg gatggctaca taagcacaca aggggcttca
      121 ctgttcagtc tcaccaagaa gcagctcgca gcaggaggtg tctcggactg tttggccaaa
      181 tgtgaagggg aaacagactt tgtctgcagg tcattccagt accacagcaa agagcagcaa
      241 tgcgtgatca tggcggagaa cagcaagact tcctccatca tccggatgag agacgtcatc
      301 ttattcgaaa agagagtgta tctgtcagaa tgtaagaccg gcatcggcaa cggctacaga
      361 ggaaccatgt ccaggacaaa gagtggtgtt gcctgtcaaa agtggggtgc cacgttcccc
      421 cacgtaccca actactctcc cagtacacat cccaatgagg gactagaaga gaactactgt
      481 aggaacccag acaatgatga acaagggcct tggtgctaca ctacagatcc ggacaagaga
      541 tatgactact gcaacattcc tgaatgtgaa gaggaatgca tgtactgcag tggagaaaag
      601 tatgagggca aaatctccaa gaccatgtct ggacttgact gccaggcctg ggattctcag
      661 agcccacatg ctcatggata catccctgcc aaatttccaa gcaagaacct gaagatgaat
      721 tattgccaca accctgacgg ggagccaagg ccctggtgct tcacaacaga ccccaccaaa
      781 cgctgggaat actgtgacat cccccgctgc acaacacccc cgcccccacc cagcccaacc
      841 taccaatgtc tgaaaggaag aggtgaaaat taccgaggga ccgtgtctgt caccgtgtct
      901 gggaaaacct gtcagcgctg gagtgagcaa acccctcata ggcacaacag gacaccagaa
      961 aatttcccct gcaaaaatct ggaagagaac tactgccgga acccagatgg agaaactgct
     1021 ccctggtgct ataccactga cagccagctg aggtgggagt actgtgagat tccatcctgc
     1081 gagtcctcag catcaccaga ccagtcagat tcctcagttc caccagagga gcaaacacct
     1141 gtggtccagg aatgctacca gagcgatggg cagagctatc ggggtacatc gtccactacc
     1201 atcacaggga agaagtgcca gtcctgggca gctatgtttc cacacaggca ttcgaagacc
     1261 ccagagaact tcccagatgc tggcttggag atgaactact gcaggaaccc ggatggtgac
     1321 aagggccctt ggtgctacac cactgacccg agcgtcaggt gggaatactg caacctgaag
     1381 cggtgctcag agacaggagg gagtgttgtg gaattgccca cagtttccca ggaaccaagt
     1441 gggccgagcg actctgagac agactgcatg tatgggaatg gcaaagacta tcggggcaaa
     1501 acggccgtca ctgcagctgg caccccctgc cagggatggg ctgcccagga gccccacagg
     1561 cacagcatct tcaccccaca gacaaaccca cgggcagatc tggaaaagaa ctactgccga
     1621 aacccagatg gggatgtgaa tggtccttgg tgctatacaa caaaccccag aaaactttat
     1681 gactattgtg acatccccct gtgtgcatca gcatcatcct ttgagtgcgg gaaacctcag
     1741 gtggaaccga agaaatgccc tgggagggtg gtgggtggct gcgtggccaa ccctcactcc
     1801 tggccctggc aaatcagcct tagaacaaga tttaccggac agcacttctg tggcggtact
     1861 ttaatagccc cagagtgggt tctgactgct gcccactgtt tggagaaatc ttcaagacct
     1921 gaattctaca aggttatcct gggtgcgcac gaagaatata tccgtgggtt ggatgttcag
     1981 gaaatatcag tagccaaact gatcttggag cccaacaacc gtgacattgc cctgctgaaa
     2041 ctaagccgcc cagccaccat cacggataaa gtcattccag cttgtctgcc atctccaaat
     2101 tacatggttg ctgaccggac aatatgttac atcaccggct ggggagagac tcaagggact
     2161 ttcggtgccg gtcgtctcaa ggaggctcag ctgcctgtga ttgagaacaa ggtgtgcaac
     2221 cgcgtcgagt atctgaacaa cagagtcaaa tccacggagc tctgtgccgg gcaactggct
     2281 ggtggcgtcg acagctgcca aggcgacagt ggaggacctc tggtttgctt cgagaaggac
     2341 aagtacattt tacaaggagt cacttcttgg ggtcttggct gtgctcgccc caataagcct
     2401 ggtgtctacg ttcgtgtctc acggtttgtt gattggattg aaagggagat gaggaataac
     2461 tgactaggtg gaaggccgag caaaacctct gcttactaaa gcttactgaa tatggggaga
     2521 gggcttaggg tgtttggaaa aactgacagt aatcaaactg ggacactaca ctgaaccaca
     2581 gcttcctgtc gcccctcagc ccctcccctt tttttgtatt attgtgggta aaattttcct
     2641 gtctgtggac ttctggattt tgtgacaata gaccatcact gctgtgacct ttgttgaaaa
     2701 taaactcgat acttactttg
//
LOCUS       HUMMHEA      4938 bp ds-DNA             PRI       07-JUL-1990
DEFINITION  Human MHC class I lymphocyte antigen (HLA-E) (HLA-6.2) gene,
            complete cds.
ACCESSION   M21533
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; lymphocyte antigen;
            major histocompatibility complex.
SOURCE      Human lymphoblastoid (cell line B-LCL) DNA, clone HLA-6.2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4938)
  AUTHORS   Koller,B.H., Geraghty,D.E., Shimizu,Y., DeMars,R. and Orr,H.T.
  TITLE     A novel HLA class I gene expressed in resting T lymphocytes
  JOURNAL   J. Immunol. 141, 897-904 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            H.T.Orr, 23-NOV-1988.
FEATURES       from  to/span     description
    pept        705      768     MHC HLA-E precursor, exon 1 /nomgen="HLA-E"
                                 /map="6p21.3" /hgml_locus_uid="LT0100B"
                899     1168     MHC HLA-E precursor, exon 2
               1411     1686     MHC HLA-E precursor, exon 3
               2308     2583     MHC HLA-E precursor, exon 4
               2708     2824     MHC HLA-E precursor, exon 5
               3575     3607     MHC HLA-E precursor, exon 6
               3712     3752     MHC HLA-E precursor, exon 7
    sigp        704      768     MHC HLA-E signal peptide
    matp        899     1168     MHC HLA-E
               1411     1686     MHC HLA-E
               2308     2583     MHC HLA-E
               2708     2825     MHC HLA-E
               3576     3607     MHC HLA-E
               3712     3749     MHC HLA-E
    pre-msg <   705     4479     MHC HLA-E mRNA and introns
    IVS         769      898     MHC HLA-E, intron A
    IVS        1169     1410     MHC HLA-E, intron B
    IVS        1687     2307     MHC HLA-E, intron C
    IVS        2584     2707     MHC HLA-E, intron D
    IVS        2825     3574     MHC HLA-E, intron E
    IVS        3608     3711     MHC HLA-E, intron F
    IVS        3755     3919     MHC HLA-E, intron G
BASE COUNT     1065 a   1332 c   1380 g   1161 t
ORIGIN      Chromosome 6p21.3.
        1 ggcatgatgg acgcgattag ccggtaacgg atacatgaag catgaggtct ttgaagtcat
       61 tggcggctct ctggtgtacg agtctcagct ggttgactac aaggcactct gaagcagatc
      121 ctgttttttg ttttgttttg ttttttgttc tgtttttccc cctagacatc tcactctgct
      181 gcccaggctg gagtgcagtg gtgtgatctc ggctcactgc aaccatccat cctcctgggt
      241 tcaagtagat tctccatgcc tcagcctccc aagtagccag gattacagcc atgcgccacc
      301 acgccggcta attttttgga cttttagtag agacagggtt tctccatatt ggtcgggctg
      361 gtctcgaact cccaacctca ggtgatcagc ccgccttggc ctcccaaagt gctgagatta
      421 caggcgtgag ccaccgcgcc cagccaggac taatttctaa gagtgtgcag agataccgaa
      481 acctaaaagt ttaagaactg ctgattgctg ggaaactctg cagtttcccg ttcctctcgt
      541 aacctggtca tgtgtccttc ttcctggata ctcatgacgc agactcagtt ctcattccca
      601 atgggtgtcg ggtttctaga gaagccaatc agcgtcgcca cgactcccga ctataaagtc
      661 cccatccgga ctcaagaagt tctcaggact cagaggctgg gatcatggta gatggaaccc
      721 tccttttact cctctcggag gccctggccc ttacccagac ctgggcgggt gagtgcgggg
      781 tcgggatgga aacggcctct accgggagta gagaggggcc ggcccggcgg gggcgaagga
      841 ctcggggagc cgcgccggga ggagggtcgg gccgatctca gcccctcctc gcccccaggc
      901 tcccactcct tgaagtattt ccacacttcc gtgtcccggc ccggccgcgg ggagccccgc
      961 ttcatctctg tgggctacgt ggacgacacc cagttcgtgc gcttcgacaa cgacgccgcg
     1021 agtccgagga tggtgccgcg ggcgccgtgg atggagcagg aggggtcaga gtattgggac
     1081 cgggagacac ggagcgccag ggacaccgca cagattttcc gagtgaacct gcggacgctg
     1141 cggcgctact acaatcagag cgaggccggt gagtgacccc ggccaggagc aggtcacgac
     1201 ccctccccat cccccacgga cggcgcgggt cccctcgaat cttcgggtcc cagattcacc
     1261 ccaaggctgc ggaacccgcc cagaccctag accggggaga gtctcaggcg cctttacccg
     1321 gttctttttc agtttaggcc aaaatgccca cagggtggtg gcgacggggg cggggcttgg
     1381 tgggcgggac tgactaaggg gcggggccag ggtctcacac cctgcagtgg atgcatggct
     1441 gcgagctggg gcccgacagg cgcttcctcc gcgggtatga acagttcgcc tacgacggca
     1501 aggattatct caccctgaat gaggacctgc gctcctggac cgcggtggac acggcggctc
     1561 agatctccga gcaaaagtca aatgatgcct ctgaggcgga gcaccagaga gcctacctgg
     1621 aagacacatg cgtggagtgg ctccacaaat acctggagaa ggggaaggag acgctgcttc
     1681 acctgggtaa gagggtccac agggctactc tcccatctcc ttcttgggct aggactgtgc
     1741 ccacagctga cagacctcaa acagtagaag aaacagggat ggaggccaga ataccactcc
     1801 tcccttggat caggagaggg agctgtcacc tgaggtacag gagatcctat accacagagt
     1861 gactctctta aagggccaga cctctctcag gggcaattaa ggaatctagt ctcgctggag
     1921 attccatcct tcagatgaac tgatgagcag ttctctttga ctcccagtat taggaatcac
     1981 gggggagttt ctctcgtgcc tgattctcag ccccacacca agagtttttg gaggtctgac
     2041 tccagctttt ctcagtcact cagcatccac acaggccagg accagaaatc ccttttcacc
     2101 ttctaccctg ggctagctca tcccgattct agaactttcc aaggaataag aggctatccc
     2161 agatccctaa gtccaggctg gtgtcaaggt tttgtcctct tctcctacta taattgtcct
     2221 cttccttctc aggatggtca catgggtgct gctggagtgt cccatgagag atacaaagtg
     2281 cctgaatttt ctgactcttc ccctcagagc ccccaaagac acacgtgact caccacccca
     2341 tctctgacca tgaggccacc ctgaggtgct gggccctggg cttctaccct gcggagatca
     2401 cactgacctg gcagcaggat ggggagggcc atacccagga cacggagctc gtggagacca
     2461 ggcctgcagg ggatggaacc ttccagaagt gggcagctgt ggtggtgcct tctggagagg
     2521 agcagagata cacgtgccat gtgcagcatg aggggctacc cgagcccgtc accctgagat
     2581 ggagtaagga gggggatggg aggtcatgtc tcttctcagg gaaagcggga gcccttctgg
     2641 agcccttccg cagggtcagg gctgaggcct gggggtcagg gccccttacg ttcccctctt
     2701 ttcccagagc cggcttccca gcccaccatc cccatcgtgg gcatcattgc tggcctggtt
     2761 ctccttggat ctgtggtctc tggagctgtg gttgctgctg tgatatggag gaagaagagc
     2821 tcaggtgggg aagggagaag ggtggggtct gagttttctt gtcccactgg gtgtttcaag
     2881 ccctaggtaa aagtgtgtcc tgcctcgtta ctgggaagca ccatccacac acacgagcct
     2941 acccagcctg gggccctgtg tgccagcacc tactcttttt ttttgagacg gagtcttggc
     3001 tctgtcaccc aggctggagt gcaatggcgt ggtttcagct cactgcaacc tccgcctccc
     3061 aggttcaagc aattctcctg cctcagcctc cctagtagct gggactacac atgcgtgcca
     3121 ccacacctgg ctaatttttt tttttgtatt tttagtggag atggggtttc actatgttgg
     3181 ccaggctggt ctcgaactcc tgactttgtg atctgcctgc ctcggcctcc caaagtgctg
     3241 ggattacagt cgtgagccac cgcacccagc cgcacctact cttttgtaaa gcacctgtga
     3301 caatgaagga cagatttatc accttgacga ttgtggtgat ggggacctga tcccagcagt
     3361 cacaggtcac aggggaaggt ccctgctgaa gacagacctc agaagggcag ttgatccagg
     3421 acccacacct gctttcttca cgtttcctga tcctgccctg ggtctgcagt cacagttcag
     3481 gaaacttctc tgggatccaa aactaggagg ttcctctagg accttatggc cctgcctcct
     3541 ccctggcccc tcacaggaca ttttcttcca acaggtggaa aaggagggag ctactctaag
     3601 gctgagtgta agtgcggggc gggagcgtgg aggagctcgc ccaccctata attcctcctg
     3661 caccacatct cctgtgggct ctgaccaggt cttgtttttg ttctacccca gggagcgaca
     3721 gtgcccaggg gtctgagtct cacagcttgt aaaggtgaga ttctgggggt ctgaagtggg
     3781 tggagggtgg ggcagagggg acaggactgg gttgtgggga ttttttgatt cagaattttt
     3841 gagtgtgtgg tgggctgttc agagtgtcat cacttaccgt gactgacctg aatttgttca
     3901 tgactatttt cttctgtagc ctgagacagc tgccttgtgt gcgactgaga tgcacagctg
     3961 ccttgtgtgc gactgagatg caggatttcc tcacgcctcc cctatgtgtc ttaggggact
     4021 ctggcttctc tttttgcaag ggcctctgaa tctgtctgtg tccctgttag cacaatgtga
     4081 ggaggtagag aaacagtcca cctctgtgtc taccatgacc cccttcctca cactgacctg
     4141 tgttccttcc ctgttctctt ttctattaaa aataagaacc tgggcagagt gcggcagctc
     4201 atgcctgtaa tcccagcact tagggaggcc gaggagggca gatcacgagg tcaggagatc
     4261 gaaaccatcc tggctaacac ggtgaaaccc cgtctctact aaaaaataca aaaaattagc
     4321 tgggcgcaga ggcacgggcc tgtagtccca gctactcagg aggcggaggc aggagaatgg
     4381 cgtcaacccg ggaggcggag gttgcagtga gccaggattg tgcgactgca ctccagcctg
     4441 ggtgacaggg tgaaacgcca tctcaaaaaa taaaaattaa aaaataaaaa aagaacctgg
     4501 atctcaattt aatttttcat attcttgcaa tgaaatggac ttgaggaagc taagatcata
     4561 gctagaaata cagataattc cacagcacat ctctagcaaa tttagcctat tcctattctc
     4621 tagcctattc cttaccacct gtaatcttga ccatatacct tggagttgaa tattgttttc
     4681 atactgctgt ggtttgaatg ttccctccaa cactcatgtt gagacttaat ccctaatgtg
     4741 gcaatactga aaggtggggc ctttgagatg tgattggatc gtaaggctgt gccttcattc
     4801 atgggttaat ggattaatgg gttatcacag gaatgggact ggtggcttta taagaagagg
     4861 aaaagagaac tgagctagca tgcccagccc acagagagcc tccactagag tgatgctaag
     4921 tggaaatgtg aggtgcag
//
LOCUS       HUMGSTPI1    1724 bp ds-DNA             PRI       07-JUL-1990
DEFINITION  Human glutathione S-transferase pi (GST-pi) gene, exons 1-5.
ACCESSION   X08094 M33082
KEYWORDS    glutathione S-transferase pi.
SEGMENT     1 of 3
SOURCE      Human, cell line HPB-ALL, lymphoid DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1724)
  AUTHORS   Cowell,I.G.
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1724)
  AUTHORS   Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B.
  TITLE     The structure of the human glutathione S-transferase pi gene
  JOURNAL   Biochem. J. 255, 79-83 (1988)
  STANDARD  simple staff_entry
COMMENT     See X08094 - X08096 for GST pi gene sequence; see X06547 for GST
            mRNA sequence.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE        116    123       TRE-box
               SITE        129    134       Sp1 binding site
               SITE        138    143       Sp1 binding site
               PRM         154    157       TATA-box
               MSG         182    214       Exon 1
               MSG         499    534       Exon 2
               MSG         819    925       Exon 3
               MSG        1042   1129       Exon 4
               MSG        1488   1591       Exon 5
            
            [1]  Author address Cowell I.G., University College and Middlesex,
            School of Medicine, Windeyer Building, Cleveland Street, London,
            W1P 6DB, United Kingdom.
            
            Submitted (13-JUL-1988) on tape to the EMBL data library by:
FEATURES       from  to/span     description
    pept        214      214     glutathione S-transferase pi, exon 1
                                 /hgml_locus_uid="LQ0085R" /map="11q13-qter"
                                 /nomgen="GST3"
                499      534     glutathione S-transferase pi, exon 2
                819      925     glutathione S-transferase pi, exon 3
               1042     1129     glutathione S-transferase pi, exon 4
               1488  +  1591     glutathione S-transferase pi, exon 5
    pre-msg     182  >  1724     GST3 mRNA and introns
    IVS         215      498     GST3 intron A
    IVS         535      818     GST3 intron B
    IVS         926     1041     GST3 intron C
    IVS        1130     1487     GST3 intron D
    IVS        1592  >  1724     GST3 intron E
    revision   1532     1541     atgaatgac in [1]; gtgaatgacg in [2]
    revision   1707     1707     n in [1]; g in [2]
BASE COUNT      272 a    598 c    541 g    313 t
ORIGIN      
        1 tcgctgcgac tccgggactc cagggcgccc ctctcggccg acgcccgggg tgcagcggcg
       61 ccgggctggg gccggcggga gtccgcggga ccctccagaa gagcggccgg cgccgtgact
      121 cagcactggg gcggagcggg gcgggaccac ccttataagg ctcggaggcc gcgaggcctt
      181 cgctggagtt tcgccgccgc agtcttcgcc accagtgagt acgcgcggcc cgcgtccccg
      241 gggatggggc tcagagctcc cagcatgggg ccaacccgca gcatcaggcc cgggctcccg
      301 gcaggctcct cgcccacctc gagacccggg acggggccta ggggacccag gacgtcccca
      361 gtggccgtta gcggctttca gggggcccgg agcgcctcgg ggagggatgg gaccccgggg
      421 gcgggagggg ggggcagctc cactcaccgc gccttggcat cctccccggg ctccacaaat
      481 tttctttgtt cgctgcagtg ccgccctaca ccgtggtcta tttcccagtt cgaggtagga
      541 gcatgtgtct ggcaggaagg gaggcaggcc tggctgcagc ccacagcccc tcgcccaccc
      601 ggagagatcc gaaccccctt atccctccgt cgtgtggctt ttaccccggg cctccttcct
      661 gttccccgcc tctcccgcca tgcctgctcc ccgccccagt gttgtgtaaa atcttggagg
      721 aacctgtttc ctgttccctc ctgcactcct gaccctcccg ggttgctgcg aggcggagtg
      781 gcccggtccc cacatctcgt acttctccct ccccgcaggc cgctgcgcgg ccctgcgcat
      841 gctgctggca gatcagggcc agagctggaa ggaggaggtg gtgaccgtgg agacgtggca
      901 ggagggctca ctcaaagcct cctgcgtaag tgaccatgcc cgggcaaggg gagggggtgc
      961 tgggccttag ggggctgtgc actaggatcg ggggacgccc aagctcagtg ccccttccct
     1021 gagccatgcc tcccccaaca gctatacggg cagctcccca agttccagga cggagacctc
     1081 accctgtacc agtccaatac catcctgcgt cacctgggcc gcacccttgg tgagtcttaa
     1141 cctcaagtcc aggagtcagg caagcctctg cccccggagc ccttttgttt aaatcaggcc
     1201 ccgcagccct ctggagggga ggaaactgag acccactgag gttacgtagt ttgccaagtc
     1261 aagcctggtt gtttgcaatc cttgcctgtg ccaggctggc tcccaggtgt caggtgagct
     1321 ctgagcacct ggtgtgtggc agtctctcat ccttccacgc acatcctctt cccctcctcc
     1381 caggctgggg ctcatcagac agaaccccct tggcctggcc catccccaat gactgtgtgt
     1441 gatcagcgcc cagtcaaccc ggcctgctcc cctccaccca accccagggc tctatgggaa
     1501 ggaccagcag gaggcagccc tggtggacat ggtgaatgac ggcgtggagg acctccgctg
     1561 caaatacatc tccctcatct acaccaacta tgtgtgagca tctgcaccag gttgggcact
     1621 gggggctgaa caaagaaagg ggcttcttgt gccctcaccc cccttacccc tcaggtggct
     1681 tgggctgacc ccttcttggg tcagggggca ggggctgggt cagg
//
LOCUS       HUMGSTPI2     591 bp ds-DNA             PRI       07-JUL-1990
DEFINITION  Human glutathione S-transferase pi (GST-pi) gene, exon 6.
ACCESSION   X08095 M33083
KEYWORDS    glutathione S-transferase pi.
SEGMENT     2 of 3
SOURCE      Human, cell line HPB-ALL, lymphoid DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 591)
  AUTHORS   Cowell,I.G.
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 591)
  AUTHORS   Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B.
  TITLE     The structure of the human glutathione S-transferase pi gene
  JOURNAL   Biochem. J. 255, 79-83 (1988)
  STANDARD  simple staff_entry
COMMENT     See X08094 - X08096 for GST pi gene sequence; see X06547 for GST
            mRNA sequence.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               TRANSCR      <1   >591       primary transcript
               MSG         470    577       Exon 6
            
            [1]  Author address Cowell I.G., University College and Middlesex,
            School of Medicine, Windeyer Building, Cleveland Street, London,
            W1P 6DB, United Kingdom.
            
            Submitted (13-JUL-1988) on tape to the EMBL data library by:
FEATURES       from  to/span     description
    pept    +   470  +   577     glutathione S-transferase pi, exon 6
                                 /hgml_locus_uid="LQ0085R" /map="11q13-qter"
                                 /nomgen="GST3"
    IVS     <     1      469     GST3 intron E
    IVS         578  >   591     GST3 intron F
BASE COUNT      131 a    111 c    225 g    124 t
ORIGIN      About 300 bp after segment 1.
        1 tgggaccagt cagcagaggc agcgtgtgtg cgcgtgcgtg tgcgtgtgtg tgcgtgtgtg
       61 tgtgtacgct tgcatttgtg tcgggtgggt aaggagatag agatgggcgg gcagtaggcc
      121 caggtcccga aggccttgaa cccactggtt tggagtctcc taagggcaat gggggccatt
      181 gagaagtctg aacagggctg tgtctgaatg tgaggtctag aaggatcctc cagagaagcc
      241 agctctaaag cttttgcaat catctggtga gagaacccag caaggatgga caggcagaat
      301 ggaatagaga tgagttggca gctgaagtgg acaggatttg gtactagcct ggttgtgggg
      361 agcaagcaga ggagaatctg ggactctggt ggtctggcct ggggcagacg ggggtgtctc
      421 aggggctggg agggatgaga gtaggatgat acatggtggt gtctggcagg aggcgggcaa
      481 ggatgactat gtgaaggcac tgcccgggca actgaagcct tttgagaccc tgctgtccca
      541 gaaccaggga ggcaagacct tcattgtggg agaccaggtg agcatctggc c
//
LOCUS       HUMGSTPI3     499 bp ds-DNA             PRI       07-JUL-1990
DEFINITION  Human glutathione S-transferase pi (GST-pi) gene, exon 7.
ACCESSION   X08096 M33084
KEYWORDS    glutathione S-transferase pi.
SEGMENT     3 of 3
SOURCE      Human, cell line HPB-ALL, lymphoid DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 499)
  AUTHORS   Cowell,I.G.
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 499)
  AUTHORS   Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B.
  TITLE     The structure of the human glutathione S-transferase pi gene
  JOURNAL   Biochem. J. 255, 79-83 (1988)
  STANDARD  simple staff_entry
COMMENT     See X08094 - X08096 for GST pi gene sequence; see X06547 for GST
            mRNA sequence.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               TRANSCR      <1      ?       primary transcript
               MSG          17      ?       Exon 7
            
            [1]  Author address Cowell I.G., University College and Middlesex,
            School of Medicine, Windeyer Building, Cleveland Street, London,
            W1P 6DB, United Kingdom.
            
            Submitted (13-JUL-1988) on tape to the EMBL data library by:
FEATURES       from  to/span     description
    pept    +    17      205     glutathione S-transferase pi exon 7
                                 /hgml_locus_uid="LQ0085R" /map="11q13-qter"
                                 /nomgen="GST3"
    IVS     <     1       16     GST3 intron F
    signal      260      265     poly-A signal
    revision    127      128     tc in [1]; cg in [2]
BASE COUNT      114 a    140 c    141 g    104 t
ORIGIN      About 200 bp after segment 2.
        1 ctggcctgcc ctgcagatct ccttcgctga ctacaacctg ctggacttgc tgctgatcca
       61 tgaggtccta gcccctggct gcctggatgc gttccccctg ctctcagcat atgtggggcg
      121 cctcagcgcc cggcccaagc tcaaggcctt cctggcctcc cctgagtacg tgaacctccc
      181 catcaatggc aacgggaaac agtgagggtt ggggggactc tgagcgggag gcagagtttg
      241 ccttcctttc tccaggacca ataaatttct aagagaggct actatgagca ctgtgtttcc
      301 tgggacgggc ttagggcgtt ctcagcctcg aggtcggtgg gagggcagag cagaggacta
      361 gaaaacagct cctccagcca ccgtcagtgg cttctggagc cctcagcctg gctgtgttta
      421 ctcaaccatc acaaactaga agaggaagaa aaaaaaagag agagagtaca agagaaataa
      481 caatggtcct gtcctggag
//

GenBank-Updates@genbank.bio.net (07/10/90)

LOCUS       HUMCDR34     2412 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human cerebellar-degeneration-related antigen (CDR34) gene,
            complete cds.
ACCESSION   M31423
KEYWORDS    cerebellar-degeneration-related antigen.
SOURCE      Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2412)
  AUTHORS   Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A.,
            Chaganti,R.S.K., Posner,J.B. and Old,L.J.
  TITLE     Cerebellar degeneration-related antigen: A highly conserved
            neuroectodermal marker mapped to chromosomes X in human and mouse
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            Y.-T.Chen, 17-JAN-1990
FEATURES       from  to/span     description
    pept        503     1174     cerebellar-degeneration-related antigen
                                 (CDR34)
BASE COUNT      743 a    334 c    669 g    666 t
ORIGIN      
        1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct
       61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat
      121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg
      181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga
      241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg
      301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag
      361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg
      421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga
      481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt
      541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt
      601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga
      661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat
      721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt
      781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga
      841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct
      901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt
      961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga
     1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt
     1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat
     1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg
     1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa
     1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac
     1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa
     1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt
     1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact
     1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat
     1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct
     1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga
     1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct
     1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac
     1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg
     1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata
     1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa
     1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc
     2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt
     2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta
     2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga
     2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta
     2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat
     2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt
     2401 gagatgctgc ag
//
LOCUS       RATHGF       2485 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Rat hepatocyte growth factor mRNA, complete cds.
ACCESSION   M32987
KEYWORDS    hepatocyte growth factor.
SOURCE      Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2485)
  AUTHORS   Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M.,
            Shimizu,S. and Nakamura,T.
  TITLE     Deduced primary structure of rat hepatocyte growth factor and
            expression of the mRNA in rat tissues
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.Nakamura, 26-MAR-1990.
FEATURES       from  to/span     description
    pept        143     2329     hepatocyte growth factor
BASE COUNT      750 a    533 c    577 g    625 t
ORIGIN      
        1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc
       61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga
      121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct
      181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca
      241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa
      301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa
      361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc
      421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt
      481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa
      541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg
      601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct
      661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag
      721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat
      781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg
      841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga
      901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta
      961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc
     1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg
     1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca
     1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa
     1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat
     1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta
     1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg
     1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc
     1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg
     1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga
     1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca
     1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa
     1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc
     1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat
     1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct
     1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct
     1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac
     1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg
     2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt
     2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga
     2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt
     2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc
     2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa
     2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa
     2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca
     2461 ttaatatatg tggcgttttc tgttg
//
LOCUS       HUMINSGS      351 bp ds-DNA             SYN       10-JUL-1990
DEFINITION  Human (synthetic) insulin gene, complete cds.
ACCESSION   J02547 M25881
KEYWORDS    artificial gene; insulin.
SOURCE      Synthetic human DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 79 to 351)
  AUTHORS   Brousseau,R., Scarpulla,R., Sung,W., Hsiung,H.M., Narang,S.A. and
            Wu,R.
  TITLE     Synthesis of a human insulin gene: V. Enzymatic assembly, cloning
            and characterization of the human proinsulin DNA
  JOURNAL   Gene 17, 279-289 (1982)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 351)
  AUTHORS   Georges,F., Brousseau,R., Michniewicz,J., Prefontaine,G.,
            Stawinski,J., Sung,W., Wu,R. and Narang,S.A.
  TITLE     Synthesis of a human insulin gene: VII. Synthesis of
            preproinsulin-like human DNA, its cloning and expression in M13
            bacteriophage
  JOURNAL   Gene 27, 201-211 (1984)
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 351)
  AUTHORS   Narang,S.A., Brousseau,R., Georges,F., Michniewicz,J.,
            Prefontaine,G., Stawinski,J. and Sung,W.
  TITLE     The human preproinsulin gene: synthesis, cloning, gene
            modification, and expression studies
  JOURNAL   Can. J. Biochem. 62, 209-216 (1984)
  STANDARD  full staff_review
COMMENT     In places where the human insulin amino acid sequence is identical
            to the rat insulin amino acid sequence, the synthetic sequence
            follows the published nucleotide sequence for rat (see separate
            entry).
FEATURES       from  to/span     description
    pept          6      350     synthetic preproinsulin
    sigp          6       77     synthetic insulin signal peptide
    matp         90      179     synthetic insulin B-chain
    matp        186      278     synthetic insulin C-chain
    matp        285      347     synthetic insulin A-chain
BASE COUNT       65 a     93 c    100 g     93 t
ORIGIN      78 bp upstream of EcoRI site.
        1 aattcatggg cctatggatc cgtctactgc ctctgatcgc gctgctgatc ctctggggac
       61 cggatccagc tgcggccgaa ttccggatgt ttgtcaatca gcacctttgt ggttctcacc
      121 tggtggaggc tctgtacctg gtgtgtgggg aacgtggttt cttctacaca cccaagaccc
      181 gtcgtgaagc tgaagacctt caagtgggtc aagttgaact tggtgggggt cctggtgcgg
      241 gttctcttca acctttggct ctcgagggat cacttcaaaa gcgtggcatt gtggagcagt
      301 gctgcaccag catctgctcc ctctaccaac tggagaacta ctgcaactga g
//
LOCUS       TRFRRECF      212 bp ss-rRNA            RNA       10-JUL-1990
DEFINITION  Trypanosomatid (C.fasciculata) small rRNA e from the large
            ribosomal subunit.
ACCESSION   K02691 M25882
KEYWORDS    ribosomal RNA.
SOURCE      Trypanosomatid (C.fasciculata) ribosomal RNA.
  ORGANISM  Crithidia fasciculata
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Schnare,M.N., Spencer,D.F. and Gray,M.W.
  TITLE     Primary structures of four novel small ribosomal RNAs from
            Crithidia fasciculata
  JOURNAL   Can. J. Biochem. 61, 38-45 (1983)
  STANDARD  full staff_review
COMMENT     The large subunit of the ribosome of C.fasciculata contains six
            small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i)
            are found in ribosomes of other organisms.  rRNAs e,f,g, and j were
            analyzed by [1].  In rRNA e the number of "g" residues at positions
            91-93 and 123-124 were ambiguous.  At positions 77-81 three "c"s
            and two "t"s were found, but the order was unclear.  Positions 116
            and 119 gave strong "t" bands but also consistently gave weak bands
            in the "u-2" track.  [1] is not sure of the reason: sequencing
            artifact or an indication of cistron heterogeneity.  No evidence of
            this heterogeneity was found in chemical gels.
FEATURES       from  to/span     description
    rRNA          1      212     ribosomal RNA e
    modified    125      125     p (putative)
BASE COUNT       46 a     53 c     51 g     62 t
ORIGIN      5' end of mature rRNA e.
        1 tagtggaaat gcgaaacact tgccaggtga caaatcaatc ctcccacggt gagctttctt
       61 ttcaccataa tccacatctc cggctttgct gggcttgggc ctttttactt ctcgcgttgt
      121 tcggtgcggg ggcccaagat tgaaaaatgc agctctccct acgtactgtc attgttgtga
      181 gttctgcgca ttaaagcaaa aacctggggt gt
//
LOCUS       TRFRRFCF      183 bp ss-rRNA            RNA       10-JUL-1990
DEFINITION  Trypanosomatid (C.fasciculata) small rRNA f from the large
            ribosomal subunit.
ACCESSION   K02692 M25883
KEYWORDS    ribosomal RNA.
SOURCE      Trypanosomatid (C.fasciculata) ribosomal RNA.
  ORGANISM  Crithidia fasciculata
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 183)
  AUTHORS   Schnare,M.N., Spencer,D.F. and Gray,M.W.
  TITLE     Primary structures of four novel small ribosomal RNAs from
            Crithidia fasciculata
  JOURNAL   Can. J. Biochem. 61, 38-45 (1983)
  STANDARD  full staff_review
COMMENT     The large subunit of the ribosome of C.fasciculata contains six
            small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i)
            are found in ribosomes of other organisms. rRNAs e,f,g, and j are
            reported by [1].
FEATURES       from  to/span     description
    rRNA          1      183     ribosomal RNA f
BASE COUNT       41 a     49 c     57 g     36 t
ORIGIN      5' end of mature rRNA f.
        1 gtgagattgt gaagggatct cgcaggcatc gtgagggaag tatggggtag tacgagagga
       61 actcccatgc cgtgcctcta gtttctgggg tttgtcgaac ggcaagtgcc ccgaagccat
      121 cgcacggtgg ttctcggctg aacgcctcta agccagaagc caatcccaag accagatgcc
      181 ccc
//
LOCUS       TRFRRGCF      136 bp ss-rRNA            RNA       10-JUL-1990
DEFINITION  Trypanosomatid (C.fasciculata) small rRNA g from the large
            ribosomal subunit.
ACCESSION   K02693 M25884
KEYWORDS    ribosomal RNA.
SOURCE      Trypanosomatid (C.fasciculata) ribosomal RNA.
  ORGANISM  Crithidia fasciculata
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 136)
  AUTHORS   Schnare,M.N., Spencer,D.F. and Gray,M.W.
  TITLE     Primary structures of four novel small ribosomal RNAs from
            Crithidia fasciculata
  JOURNAL   Can. J. Biochem. 61, 38-45 (1983)
  STANDARD  full staff_review
COMMENT     The large subunit of the ribosome of C.fasciculata contains six
            small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i)
            are found in the ribosomes of other organisms. rRNAs e,f,g, and j
            are reported by [1].  There was some question whether rRNA g
            contained 135 or 136 bp, starting with base 1 or 2 in the sequence
            presented below.
FEATURES       from  to/span     description
    rRNA          1      136     ribosomal RNA g
BASE COUNT       31 a     37 c     40 g     28 t
ORIGIN      5' end of mature rRNA g.
        1 acaacgtccc tctccaaacg agagaatatg catgggctgg catgagcggc atgcttcact
       61 ccggtggggc tcgaggggca cttacgtccc gaggcgctga accttgaggc ctgaaatttc
      121 atgctctggg actaaa
//
LOCUS       TRFRRJCF       73 bp ss-rRNA            RNA       10-JUL-1990
DEFINITION  Trypanosomatid (C.fasciculata) small rRNA j from the large
            ribosomal subunit.
ACCESSION   K02694 M25885
KEYWORDS    ribosomal RNA.
SOURCE      Trypanosomatid (C.fasciculata) ribosomal RNA.
  ORGANISM  Crithidia fasciculata
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 73)
  AUTHORS   Schnare,M.N., Spencer,D.F. and Gray,M.W.
  TITLE     Primary structures of four novel small ribosomal RNAs from
            Crithidia fasciculata
  JOURNAL   Can. J. Biochem. 61, 38-45 (1983)
  STANDARD  full staff_review
COMMENT     The large subunit of the ribosome of C.fasciculata contains six
            small rRNAs (designated e,f,g,h,i,j) when normally only two (h,i)
            are found in the ribosomes of other organisms.  rRNAs e,f,g, and j
            are reported by [1].  There was some question whether rRNA j
            contained 72 or 73 bp, starting with base 1 or 2 in the sequence
            presented below.
FEATURES       from  to/span     description
    rRNA          1       73     ribosomal RNA j
BASE COUNT       17 a     23 c     14 g     19 t
ORIGIN      5' end of mature rRNA j.
        1 tcatcgaatc gccacctaca cgactggagc ttgctccctc gtcggcctct agtatattca
       61 tgatcacaag gta
//
LOCUS       YSCRGEA      1798 bp ds-DNA             PLN       10-JUL-1990
DEFINITION  Yeast (S.cerevisiae) 18S ribosomal RNA gene.
ACCESSION   J01353 M27607
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      Yeast (S.cerevisiae + D4) DNA, clones pY1rA3 and prYC.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1798)
  AUTHORS   Rubtsov,P.M., Musakhanov,M.M., Zakharyev,V.M., Krayev,A.S.,
            Skryabin,K.G. and Bayev,A.A.
  TITLE     The structure of the yeast ribosomal RNA genes. I. The complete
            nucleotide sequence of the 18S ribosomal RNA gene from
            Saccharomyces cerevisiae
  JOURNAL   Nucleic Acids Res. 8, 5779-5794 (1980)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1798; revises [1])
  AUTHORS   Mankin,A.S., Skryabin,K.G. and Rubtsov,P.M.
  TITLE     Identification of ten additional nucleotides in the primary
            structure of yeast 18S rRNA
  JOURNAL   Gene 44, 143-143 (1986)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    rRNA          1     1798     18S ribosomal RNA
    revision    943      943     a in [2]; g in [1]
    revision    962      962     a in [2]; g in [1]
    revision    982      983     ag in [2]; ga in [1]
    revision    988      999     tcgaagatgatc in [2]; tc in [1]
    revision   1002     1002     g in [2]; a in [1]
    revision   1122     1123     ag in [2]; aag in [1]
    revision   1742     1742     a in [2]; g in [1]
BASE COUNT      480 a    348 c    459 g    511 t
ORIGIN      9 bp upstream of Sau3A site.
        1 tatctggttg atcctgccag tagtcatatg cttgtctcaa agattaagcc atgcatgtct
       61 aagtataagc aatttataca gtgaaactgc gaatggctca ttaaatcagt tatcgtttat
      121 ttgatagttc ctttactaca tggtataacc gtggtaattc tagagctaat acatgcttaa
      181 aatctcgacc ctttggaaga gatgtattta ttagataaaa aatcaatgtc ttcggactct
      241 ttgatgattc ataataactt ttcgaatcgc atggccttgt gctggcgatg gttcattcaa
      301 atttctgccc tatcaacttt cgatggtagg atagtggcct accatggttt caacgggtaa
      361 cggggaataa gggttcgatt ccggagaggg agcctgagaa acggctacca catccaagga
      421 aggcagcagg cgcgcaaatt acccaatcct aattcaggga ggtagtgaca ataaataacg
      481 atacagggcc cattcgggtc ttgtaattgg aatgagtaca atgtaaatac cttaacgagg
      541 aacaattgga gggcaagtct ggtgccagca gccgcggtaa ttccagctcc aatagcgtat
      601 attaaagttg ttgcagttaa aaagctcgta gttgaacttt gggcccggtt ggccggtccg
      661 attttttcgt gtactggatt tccaacgggg cctttccttc tggctaacct tgagtccttg
      721 tggctcttgg cgaaccagga cttttacttt gaaaaaatta gagtgttcaa agcaggcgta
      781 ttgctcgaat atattagcat ggaataatag aataggacgt ttggttctat tttgttggtt
      841 tctaggacca tcgtaatgat taatagggac ggtcgggggc atcggtattc aattgtcgag
      901 gtgaaattct tggatttatt gaagactaac tactgcgaaa gcatttgcca aggacgtttt
      961 cattaatcaa gaacgaaagt taggggatcg aagatgatct ggtaccgtcg tagtcttaac
     1021 cataaactat gccgactaga tcgggtggtg tttttttaat gacccactcg gtaccttacg
     1081 agaaatcaaa gtctttgggt tctgggggga gtatggtcgc aaggctgaaa cttaaaggaa
     1141 ttgacggaag ggcaccacta ggagtggagc ctgcggctaa tttgactcaa cacggggaaa
     1201 ctcaccaggt ccagacacaa taaggattga cagattgaga gctctttctt gattttgtgg
     1261 gtggtggtgc atggccgttt ctcagttggt ggagtgattt gtctgcttaa ttgcgataac
     1321 gaacgagacc ttaacctact aaatagtggt gctagcattt gctggttatc cacttcttag
     1381 agggactatc ggtttcaagc cgatggaagt ttgaggcaat aacaggtctg tgatgccctt
     1441 agaacgttct gggccgcacg cgcgctacac tgacggagcc agcgagtcta accttggccg
     1501 agaggtcttg gtaatcttgt gaaactccgt cgtgctgggg atagagcatt gtaattattg
     1561 ctcttcaacg aggaattcct agtaagcgca agtcatcagc ttgcgttgat tacgtccctg
     1621 ccctttgtac acaccgcccg tcgctagtac cgattgaatg gcttagtgag gcctcaggat
     1681 ctgcttagag aagggggcaa ctccatctca gagcggagaa tttggacaaa cttggtcatt
     1741 tagaggaact aaaagtcgta acaaggtttc cgtaggtgaa cctgcggaag gatcatta
//
LOCUS       DROSHA1A     1473 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  D.melanogaster Sha12 protein mRNA, complete cds.
ACCESSION   M32660
KEYWORDS    .
SOURCE      D.melanogaster, cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1473)
  AUTHORS   Butler,A., Wei,A. and Salkoff,L.
  TITLE     Shal, Shab, and Shaw: Three genes encoding potassium channels in
            Drosophila
  JOURNAL   Nucleic Acids Res. 18, 2173-2174 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1473)
  AUTHORS   Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L.
  TITLE     Diverse K+ currents expressed by a Drosophila extended gene family
            which is conserved in mouse
  JOURNAL   Science 248, 599-603 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Salkoff, 07-MAR-1990.
FEATURES       from  to/span     description
    pept          1     1473     Sha12 protein
BASE COUNT      302 a    434 c    432 g    305 t
ORIGIN      Chromosome 3 left arm at locus 76B.
        1 atggcctcgg tcgccgcttg gctgcccttc gcccgggcgg cggccatcgg gtgggtgccg
       61 atagccaccc acccactgcc accgcccccg atgcccaagg atcgccgcaa aacggacgac
      121 gagaagctcc tgatcaacgt ctccgggcgg cgcttcgaga cgtggcggaa tactttggag
      181 aagtatccgg acaccctttt aggttccaat gaaagggagt tcttctacga cgaggactgc
      241 aaagaatact tcttcgatcg ggacccggac atcttccggc acatactgaa ctactaccgg
      301 acgggcaagc tgcactaccc gaagcacgaa tgcctcacca gctacgacga ggagctggcc
      361 ttctttggaa taatgccgga tgtcattggc gattgctgct acgaggacta ccgggaccgg
      421 aagcgggaga acgcggagcg gctgatggac gacaagctgt cggagaacgg ggatcagaat
      481 ctgcagcagc tgaccaacat gcgccagaag atgtggcggg ccttcgagaa tccgcacacg
      541 tcgacgagcg ccctggtgtt ctactatgtt acgggtttct tcatcgccgt ctccgtgatg
      601 gccaacgtgg tggagacggt gccgtgtggc caccggccgg gcagagcggg aactctgccc
      661 tgcggcgagc gctacaagat cgtcttcttc tgcctggata ccgcctgcgt gatgatcttt
      721 acggcggagt acctacttcg actcttcgcc gcccccgatc gctgcaagtt cgtgcgctcg
      781 gtgatgagca ttattgatgt ggtggccatt atgccgtact acattggcct cgggatcacc
      841 gacaacgacg acgtgagcgg tgctttcgtc acgctgcgcg tgttccgtgt cttccgcata
      901 ttcaagttct cgcgccactc gcaaggactt cggatcctcg gctacacgct caagtcctgc
      961 gccagcgaac tgggcttcct tgtcttctcg ctggccatgg ccattatcat ctttgccacc
     1021 gtcatgttct acgccgagaa gaacgtcaat ggcaccaact tcacatcgat tccggcggcc
     1081 ttctggtata ccatcgtcac aatgacgacg ctgggatatg gcgacatggt gccagagaca
     1141 atagctggca aaattgtggg cggcgtctgc tcgcttagcg gtgtgctggt catcgcctta
     1201 cctgtacctg ttatcgtatc gaactttagt agaatctatc accagaacca gcgagcggac
     1261 aagcgcaagg cgcagcggaa agctcgcctg gcgcgcatcc gcattgccaa ggcctcgtcc
     1321 ggagccgcct ttgttagcaa gaagaaggcc gccgaggccc ggtgggctgc ccaggagtcg
     1381 ggcatcgagc tggatgacaa ctatcgggac gaggacatct tcgagctgca gcaccatcat
     1441 ttgctgcgat gtctggagaa gacaacgatg tag
//
LOCUS       DROSHABA     2778 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  D.melanogaster Shab11 protein mRNA, complete cds.
ACCESSION   M32659
KEYWORDS    .
SOURCE      D.melanogaster, cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2778)
  AUTHORS   Butler,A., Wei,A. and Salkoff,L.
  TITLE     Shal, Shab, and Shaw: Three genes encoding potassium channels in
            Drosophila
  JOURNAL   Nucleic Acids Res. 18, 2173-2174 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2778)
  AUTHORS   Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L.
  TITLE     Diverse K+ currents expressed by a Drosophila extended gene family
            which is conserved in mouse
  JOURNAL   Science 248, 599-603 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Salkoff, 07-MAR-1990.
FEATURES       from  to/span     description
    pept          1     2775     Shab11 protein
BASE COUNT      679 a    784 c    788 g    527 t
ORIGIN      
        1 atggtcgggc aattgcaagg tggacaggct gctggccagc aacagcaaca gcaacaagcg
       61 actcagcaac agcaacactc gaagcagcag ctgcaacagc agcagcagca acagcagcaa
      121 ctgcaactca agcagcatca gcagcagcaa caggacatcc tgtatcagca acataacgag
      181 gcaattgcaa ttgcacgcgg actgcaggct gcaacacctg ccgacatcgg cgataatcag
      241 ccgtactacg atacaagcgg taatgtcgat tgggagcggg cgatgggagc cggtggagct
      301 ggtgcatatg gtggcatcgg catcggatct ctaccagcag ctggcggtgc tgcttatcac
      361 cttgggccag ctaatcccgc aggcctcgtt tctcgtcact tggattacgg tgatggcggc
      421 caccttgctg gcccatccgc cggtcttcct gctggagctg tgggatcagg agcaggagcg
      481 ggagccggtg cgggagcatc agtcacggga tcaggatcag gagcagggac aggaacagga
      541 accggagccg gatctggatc gggcagtgga gcagcaggca aggaagttcg ctacgcccct
      601 ttcccagtcg catcaccaac gcactcgatt cccacaacct cccagcagat cgttggcggc
      661 gtcggtggcg tgggcgtcgg tggtgccagc agccagtcga tttcgggcgg tgtacccacc
      721 cacagccaga gcaacaccac cggcgctctg cagcggacac attccagatc catgtcctcc
      781 ataccgccgc ccgagccgtt catgatagcc cagtcgaagg cggtcaacag ccgcgtgtcc
      841 atcaacgtgg gcggggtgag gcacgaggtc ctgtggagga cgctggagcg gctgccccac
      901 acgcggctcg ggcggctggg ggagtgcacc acccacgagg ccatcgtgga gctgtgcgac
      961 gactactcgc tggcggacaa cgagtacttc ttcgaccgac atccgaagag cttcagctcc
     1021 atcctgaact tctatcgcac cggcaagctg cacatcgtcg acgagatgtg cgtgctcgcg
     1081 tttggtgatg acctggagta ctggggcgtc gacgaactgt acctggagtc ctgctgccag
     1141 cacaagtacc accagcgcaa ggagaacgtt cacgaggaga tgcgtaagga ggccgagtcc
     1201 ctgcggcagc gcgacgagga ggaattcggc gaaggtaaat tctccgagta ccagaagtat
     1261 ctgtgggagc tcctcgagaa gcctaacact agtttcgccg cccgggttat cgcagtgata
     1321 tccatactat tcatagtcct gtctaccata gccctgacgt tgaacaccct accacaacta
     1381 caacacattg acaacggtac accacaggat aatccgcaat tggcaatggt tgaggccgtg
     1441 tgtatcacgt ggttcactct agagtacata cttaggttta gctcctcgcc ggacaagtgg
     1501 aagttcttta agggcggcct taacataatc gatctattgg caatactccc atactttgtt
     1561 tcgttatttc tattggaaac gaataagaat gcaacggacc agttccagga tgtgcgtcgg
     1621 gtggtgcagg tctttcgcat catgcgcatc ctgcgggtcc ttaagctggc ccgtcactca
     1681 acgggcctgc agtcgttagg ctttacgctg cgtaactcat ataaggaact cggtctacta
     1741 atgctgttcc tggccatggg cgttctcata ttttcttcgc tggcatattt tgccgaaaag
     1801 gatgaaaagg atacaaaatt cgtttcaata ccggaagcat tttggtgggc gggtattaca
     1861 atgacaactg ttggctacgg ggacatctgt cccacaactg cactgggaaa ggttattggt
     1921 actgtgtgtt gcatatgcgg tgttctggtg gtcgctttgc ctattcccat catcgttaac
     1981 aattttgctg aattttataa gaatcagatg cgccgcgaaa aggccctcaa gcgtcgcgag
     2041 gcactcgatc gtgccaagcg cgagggcagc attgtctcct tccatcatat caatctgaaa
     2101 gatgccttcg ccaagtccat ggatctcatc gatgtgattg tcgacacagg aaagcaaaca
     2161 aatgtcgtgc atccgaaggg taaaagacaa agcaccccca atataggcag gcagaccctc
     2221 gatgtgcaaa gcgccccagg ccacaatctc tcgcaaacgg acggcaacag caccgaaggc
     2281 gagtctacca gcggacgcaa tccggccacc accggaaccg gatgctataa gaattacgac
     2341 cacgtagcca acctgcgcaa ctccaacctg cacaaccgac gcggatccag ctctgagcag
     2401 gatgcagtgc cgccctacag cttcgacaat cccaatgccc gccagacctc aatgatggcc
     2461 atggagagct atcggcgcga cgaacaggca ctgctgcagc aacagcaaca gcagcagcaa
     2521 cagatgttgc agatgcaaca gattcagcag aaggccccga acggaaatgg aggtgcaacc
     2581 ggaggaggag tggccaacaa cctggccatg gtggccgcat caagtgccgg aacagccgtg
     2641 gccaccgcca ccaatgccag taatgccagc aataccgccc ccgggtcaga gggcgccgag
     2701 ggaggcgtga tggagatggg ggcggtgtcg atgacgacaa cctttcccag gccaagggac
     2761 tgcccatcca gatgatga
//
LOCUS       DROSHAWA     1497 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  D.melanogaster Shaw2 protein mRNA, complete cds.
ACCESSION   M32661
KEYWORDS    .
SOURCE      D.melanogaster, cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1497)
  AUTHORS   Butler,A., Wei,A. and Salkoff,L.
  TITLE     Shal, Shab, and Shaw: Three genes encoding potassium channels in
            Drosophila
  JOURNAL   Nucleic Acids Res. 18, 2173-2174 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1497)
  AUTHORS   Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L.
  TITLE     Diverse K+ currents expressed by a Drosophila extended gene family
            which is conserved in mouse
  JOURNAL   Science 248, 599-603 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Salkoff, 07-MAR-1990.
FEATURES       from  to/span     description
    pept          1     1497     Shaw2 protein
BASE COUNT      344 a    426 c    390 g    337 t
ORIGIN      Chromosome 2 left arm at locus 24B-C.
        1 atgaatctga tcaacatgga ctcggaaaac agggtggtgc tcaatgtggg tggcattagg
       61 cacgaaacct acaaggccac gctgaagaag attccggcta cgcgattatc gcgattaaca
      121 gaggcgctgg ccaactatga tccgatactg aatgagtact tctttgatcg gcatccgggc
      181 gtcttcgcac aagtgctcaa ctattacaga actggaaagc tgcattatcc cacggatgtg
      241 tgcggtccgc tgtttgagga ggaattggag ttctggggcc tagactcgaa ccaagtggag
      301 ccctgctgtt ggatgaccta cacacagcat cgcgacaccc aggaaaccct agccgtactc
      361 gatcgtctcg atctggatac ggaaaaaccg tccgaagagg aattggcacg caaattcggc
      421 ttcgaggagg actactacaa aggcacaata tcctggtggc aggaaatgaa gccgcgcatt
      481 tggtccttgt tcgatgagcc ctacagttcc aatgcagcca agactattgg cgtggtttcg
      541 gtgttcttca tctgcatttc gatcctgtcg ttctgcctga agacccatcc cgatatgcgg
      601 gtgcccatcg tccggaacat tacagtgaaa actgcgaatg gaagtaatgg ctggtttttg
      661 gacaaaacgc agaccaatgc gcacatagcc ttcttctata tcgaatgcgt gtgcaatgcc
      721 tggtttacct ttgaaatatt ggtgcgcttt atctcatcgc cgaacaagtg ggagttcatc
      781 aagtcatctg ttaacatcat agactacata gcgacgctta gtttttatat cgatctagtg
      841 cttcagcggt tcgcatcgca cctggagaac gctgacatcc tcgagttctt ctcgatcatc
      901 cgcatcatgc gtctgttcaa gctgacgcgc cactcgtccg gactgaagat cctgatccag
      961 acgttccggg cctcggccaa ggagctgacc ctgctggtgt tcttcctcgt cctgggcatc
     1021 gtgatcttcg ccagccttgt ctactacgcg gagcgcatcc agcccaatcc gcacaacgac
     1081 ttcaacagca taccgctggg cctgtggtgg gccctggtca caatgaccac cgtcggctac
     1141 ggcgacatgg cccccaaaac ctacattggc atgttcgtgg gtgccctctg cgccctggcc
     1201 ggcgtactaa ccatcgcact gccagtgccc gtcatcgtca gcaacttcgc catgtactac
     1261 tcgcacacgc aggccagggc caaactgcca aagaagcgga gacgagtgct tcccgtcgag
     1321 cagccgcgcc agcccagact gccaggtgcc cctggtggtg tcagtggttg cggcaccccg
     1381 ggctcgggtc cccactccgg tccgatggga tccggcggaa ctggaccacg tcgcatgaac
     1441 aataaaacaa aggacctggt cagccccaag tcagatatgg ccttcagttt cgactaa
//
LOCUS       SUVSATA       332 bp ss-RNA             VRL       10-JUL-1990
DEFINITION  Subterranean clover mottle virus satellite RNA (virusoid) sequence.
ACCESSION   M33000
KEYWORDS    .
SOURCE      Subterranean clover mottle virus (isolated from Trifolium
            subterraneum) satellite RNA.
  ORGANISM  Subterranean clover mottle virus
            Viridae; ss-RNA nonenveloped viruses; 
            Velvet tobacco mottle virus group.
REFERENCE   1  (bases 1 to 332)
  AUTHORS   Davies,C., Haseloff,J. and Symons,R.H.
  TITLE     Structure, self-cleavage, and replication of two viroid-like
            satellite RNAs (virusoids) of subterranean clover mottle virus
  JOURNAL   Virology 177, 216-224 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.H.Symons, 20-MAR-1990.
FEATURES       from  to/span     description
    site         62       63     self-cleavage site
    site          1      120     high sequence homology with virusoid of
                                 subterranean clover mottle virus
    site        232      332     high sequence homology with virusoid of
                                 subterranean clover mottle virus
BASE COUNT       77 a     93 c     80 g     82 t
ORIGIN      
        1 agaggcatac cctcctcgcg gattttgaag gtgttctagc tacccaagta ttccacgctg
       61 tctgtacttg tatcagtaca ctgacgagtc cctaaaggac gaaacagcgc accgcaatct
      121 acgtataccc cgattcgact tgcttggagc aagcgttcga cagagtgccg cgcctggaat
      181 gacgcggttc tggccacact cacccgggag gccatcgggc ggattatact agttgtcaag
      241 gacctgtcgt tagttctact atacattact acactacgtg ttacttgtta ggtggcccca
      301 cctcactttc gtgaaggcta gagaacgtcc ac
//
LOCUS       SUVSATB       388 bp ss-RNA             VRL       10-JUL-1990
DEFINITION  Subterranean clover mottle virus satellite RNA (virusoid) sequence.
ACCESSION   M33001
KEYWORDS    .
SOURCE      Subterranean clover mottle virus (isolated from Trifolium
            subterraneum) satellite RNA.
  ORGANISM  Subterranean clover mottle virus
            Viridae; ss-RNA nonenveloped viruses; 
            Velvet tobacco mottle virus group.
REFERENCE   1  (bases 1 to 388)
  AUTHORS   Davies,C., Haseloff,J. and Symons,R.H.
  TITLE     Structure, self-cleavage, and replication of two viroid-like
            satellite RNAs (virusoids) of subterranean clover mottle virus
  JOURNAL   Virology 177, 216-224 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.H.Symons, 20-MAR-1990.
FEATURES       from  to/span     description
    site         63       64     self-cleavage site
    site          1      120     high sequence homology with virusoid of
                                 subterranean clover mottle virus
    site        286      388     high sequence homology with virusoid of
                                 subterranean clover mottle virus
BASE COUNT       97 a    106 c     91 g     94 t
ORIGIN      
        1 agaggcatac cctcctcgcg gattttgaag gtgtttcagc tacccaaagt attccacgct
       61 gtctgtactt atatcagtac actgacgagt ccctaaagga cgaaacagcg caccgcaact
      121 tggccagacc tcgccaatca cccccacacc aagccaaaaa ccggtcccca acgcagttta
      181 gtatcaagtc gtcgcatcca cgctcccgag ggaggaagtt tgcgccttga ggttctgcac
      241 ggtcgtggta acaggaaaag tgttggaatg tttgaaggtc ttgcggttgt caaggaccaa
      301 gtcgttagtg ttactatata ttactaccct acgtgttact ttgttaggtg gccccacctc
      361 actttcgtga aggctaggaa acgtccac
//
LOCUS       BOVCYP4SC    1073 bp ss-mRNA            MAM       10-JUL-1990
DEFINITION  Bovine cytochrome P450-scc mRNA fragment.
ACCESSION   M25920
KEYWORDS    cytochrome P450-scc.
SOURCE      Bovine adrenal gland, cDNA to mRNA, clone pBA644.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1073)
  AUTHORS   Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L.
  TITLE     An approach to the molecular biology of congenital adrenal
            hyperplasia
  JOURNAL   Ann. N.Y. Acad. Sci. 458, 238-251 (1985)
  STANDARD  simple staff_entry
COMMENT     The coding region for cytochrome P450-scc was not indicated in [1].
BASE COUNT      259 a    243 c    193 g    378 t
ORIGIN      
        1 taagtctgaa ttttgcaata aggaactcat gatttgaatt acagtcagct cccattcctg
       61 tttttgctga ctatatagag ccttctccat ttttggctgc aaaacatata atcagtctga
      121 tttggtattt atcattttgt gacataatgt gtaagagtgc ctcgtctgtt tggaaaaggt
      181 agtttctatg accagtgtgt ctcttggcaa actctgttaa cctttgtctc accacttcat
      241 tttgtattcc aaggcctttg tttctctgtt tctccaggta tctcttgact tcctactttt
      301 accttccaat cctctaggat gaaaaggaca tctttttttt tttttttggt gtagttctag
      361 aaggtcttca tagaaagggt caacttcaac ttcttaggca tcagtggtta gggcatatac
      421 ttggattact gtaatgttaa atggtttgct ttggaaacta accaagatca ttctgttgct
      481 tttgagattg cacccaaata ctgcattttg gactcttctg tttactatga ggactactcc
      541 atttaatcta aaggattctt aggccacaat agtagatata atggtcatct gaattattat
      601 aaatttatca attttcttcc attttagttc actgaattct aacttattga tgcttcattc
      661 ttgccatctc ctgcttgacc atgtttttta ccttgattca tggacctgac attccaggtt
      721 cctatgcaat attattctgt atagtgtcag acttactttc accaccagac atatccacaa
      781 ctgtatatca tttccgtttt ggcccagctg cttcactttt tctggaacta ttcatatctg
      841 ccctccactc tttcccaata gcatattgga cacattctcg aacacaggga gccgggggac
      901 aggtgctggt ttcttctggc acacctgggg cagctgaaca cagtgttgac tggcagacac
      961 agccccacac caaacgctcg ctaacactga cactgttccc gtgatggcca gggagccccc
     1021 tccccaaaaa cctgctcctg gaagctggca ggatttgtgc cattcataag ggt
//
LOCUS       BOVCYPC21     920 bp ss-mRNA            MAM       10-JUL-1990
DEFINITION  Bovine cytochrome P450-c21 mRNA fragment.
ACCESSION   M25921
KEYWORDS    cytochrome P450-c21.
SOURCE      Bovine adrenal gland, cDNA to mRNA, clone pBA4.8.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 920)
  AUTHORS   Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L.
  TITLE     An approach to the molecular biology of congenital adrenal
            hyperplasia
  JOURNAL   Ann. N.Y. Acad. Sci. 458, 238-251 (1985)
  STANDARD  simple staff_entry
COMMENT     The coding region for cytochrome P450-c21 was not indicated in [1].
BASE COUNT      185 a    278 c    203 g    254 t
ORIGIN      
        1 gttcagatgc tgtgtcccat tgggaaagtt cagcaggtta ccagggccac ggcctcagtc
       61 atcctcagaa tcgctgtccc tcttggcagg gacagagcac cgcaccgcag acagcagcac
      121 gtcttccacg ggcttcttgg gattctcctc caggctcgtc ttgatggctc cagactcaga
      181 gcaacttcca ctccaactcg tccaaagtca ggttcatgcc accaaacacc agaggtccgg
      241 ataactgagc cttgatgtca ccttcaaggt acacaaatac cgtggcagat tcctatcagg
      301 gtaactgggt atgcaggtgg ttgaaatggc tttgataaac ttgacatcag gaaacttcct
      361 ggcgaggtgc actcaagtgc tgatttatca gggcacagag gggaatccct tgtttgtaaa
      421 ggtgcaggat gacccataag ccctcaccag ctttggtaac ttcttgaaca taatcctttc
      481 cagagatttc caaaacctct ccaaatttgt tcttcagttg ggtcgctttc cattcggcca
      541 gcctttgctg cctgtacatt tcaattgcac gttcgtcttc ctcattaaat tcgtcttcat
      601 tatcctccag ttcttccaaa gtcatgtctt catatgtttt cacaatggac tgctggagga
      661 tccgctgctc ctcttcttct gcctccttct ccagatcttt caaatcttcc tttgaaggca
      721 agatgccttt tttgcgtaag atgtcattcc actcggtgtc tgcgttgggg tcctgcattt
      781 tctgtcaaat cgctagggcc ctgccggcca cagccacccg gcccgtgagc tctctaccgc
      841 gcacgcaggc gccactcgcc tcctctccca gcctgccctg agatctcgtc cgcccgttgg
      901 ccctccttct cttggcgccg
//
LOCUS       MUSINT4      3000 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse proto-oncogene Wnt-4 protein mRNA, complete cds.
ACCESSION   M32502
KEYWORDS    Wnt protein; proto-oncogene.
SOURCE      Mouse (strain BALB/c) 8.5 day old embryo, cDNA to mRNA, (library of
            B.Hogan).
REFERENCE   1  (bases 1 to 3000)
  AUTHORS   Roelink,H., Wagenaar,E., Lopes da Silva,S. and Nusse,R.
  TITLE     Wnt-3, a gene activated by proviral insertion in mouse mammary
            tumors, is homologous to int-1/Wnt-1 and is normally expressed in
            mouse embryos and adult brain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 4519, 4523 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            H. Roelink, 03-MAR-1990.
FEATURES       from  to/span     description
    pept         46     1113     Wnt-4 protein
BASE COUNT      703 a    789 c    787 g    721 t
ORIGIN      Chromosome 11.
        1 cctcttcatg atcgccggca aacttcctcc tcggcgctgc ttctaatgga gccccacctg
       61 ctcgggctgc tactcggcct cctgctcagt ggcaccaggg tcctcgctgg ctacccaatt
      121 tggtggtccc tggccctggg ccagcagtac acatctctgg cctcccagcc tctgctctgc
      181 ggctccatcc caggcctggt ccccaagcaa ctgcgcttct gccgcaatta catcgagatc
      241 atgcccagcg tagcagaagg tgtgaagctg ggcatccagg agtgccagca tcagttccgg
      301 ggccgccggt ggaactgtac caccatagat gacagcctgg ccatctttgg gcctgtcttg
      361 gacaaagcca cccgtgaatc ggccttcgtg catgccatcg cctcggctgg tgtcgccttc
      421 gcagtcacac gctcctgcgc tgagggaacc tccaccatct gcggctgtga ctcacatcat
      481 aaggggccac ctggagaagg ctggaagtgg ggcggctgca gcgaggacgc cgacttcggg
      541 gtgctggtgt cccgggaatt tgcggatgcg cgggagaaca ggccagatgc ccgctcagct
      601 atgaacaagc acaacaatga agcaggccga acgaccatcc tggaccacat gcacctaaag
      661 tgtaaatgcc acgggttgtc cggcagctgc gaggtgaaga cctgctggtg ggcccagccc
      721 gacttccgtg ccattggcga cttcctcaag gacaagtacg acagtgcctc cgagatggtg
      781 gtggagaaac accgtgagtc ccgaggctgg gtggagaccc tgcgggctaa gtacgcgctc
      841 ttcaagccac ccaccgagag ggacctggtc tactacgaga actcccccaa cttttgtgag
      901 cccaacccag agacgggctc ctttggtacc agggaccgga cttgcaatgt cacctcccac
      961 ggcatcgatg gctgcgatct gctgtgctgt ggccggggcc acaacacgag gacggagaaa
     1021 cggaaggaga aatgccattg cgtcttccac tggtgctgct atgtcagctg ccaagagtgt
     1081 attcgcatct acgatgtgca cacctgcaag tagtgagcca gggcactggg aaggggtaga
     1141 ttgtgcggct ggatccattc atcgaagtcc catgagaagc aggatctaga tccaggccag
     1201 ccttcggcac tggccagcaa ggagcatgga ctgttgccag ctgcatgtga taaacgacct
     1261 ggacccagcc ggcctcggac ggacgggcgg cttctttctc aactaacgtc tctccccctg
     1321 ctctggatgg tgtacggctt tacagagggg ctttctttat ggttttacca gggtctgctg
     1381 gggacagact cgaggcttac ctttgcacat gttaaagaaa ataaaaatga aaaaaaaaaa
     1441 tctaccgcaa cagaacaggc tgggctagtg tgagctcttg gcctggtggg aaggacaaga
     1501 ccatggcgag attctgtgtc caagctgcct ctactcgtga cattccaaga tgcctctgag
     1561 gtgggaactg tgaagtagga cagagccccg cagtcccctc ttgtccgtcg actcccattt
     1621 aaattggaca taccttgtcg ttctgagaaa agccatagat aggtgtagct gggatgtagt
     1681 gatggggagg cccctggcca acagtgggag caagatcttg agttttgaag acctcagagt
     1741 tctgggcggc ctgggaagcc atctgcagaa cagagttcct tgtgggctcc tgttttcgct
     1801 agccctgttc tgccctggag cgacagtcag atctccacgc ccctttctgt tgttctacag
     1861 tgtccacctt tactacgcgt tttttttttt tttttcatga tgaccttgta aataggtcag
     1921 atgtggaggc aggtctcttc tggctccatc caccacaccc agaaagaatg ggctgctctg
     1981 cccttctcag ccttgctaac cagcagacac cgaggagagc agcggggcac cttagagagc
     2041 aatctaaaca tggttggcag gtggggaggg taaagagtcc cacttccttt gtgttagaag
     2101 gcagactacc ctgcgtcctt ttctcccatt ggctgaagta accagaaaga caagagatcc
     2161 ttaacaagcc cttcttccca cttgtaaaag ggatagccta tctcagttcc caaggatctg
     2221 gattagatag atattcaaaa gaggcaagca gcgaatggag gcagctccca gctctgttcc
     2281 cgacgcatga tggtactggc tgggtttagt aaggtgggtg gggctgcacg gatcaatcca
     2341 tcaactccgt cttaaggaga atcagaaaga ggagataaaa tgggggaatg gggcagaaca
     2401 aagaatttgt cctttcccgc ttctgtctag ggtctgctaa tgctggcttg acgaggggtc
     2461 agccacttct ttcctgttgt gcagttggct tgccaagcag gctccagtag gcccttgcct
     2521 gcactctcta ccatgtgacc atgagcactg ctctagggac acctcccatc ccttcctagc
     2581 accccaaatg ccccttccca tctctccttc cagaagttgg aaatcaagtc aactggataa
     2641 cgcttgtgtg agacacttga gcagaacgga tacaacaatt tacaagtctc ttcatatcta
     2701 tgtattctat attaaaagtg ataaagtcat gtttccgggg cgtattcaag tagctgacaa
     2761 gtaattattt aataatagta catgagcgca ttgtaattat cctcgccata gtcaggtaat
     2821 agcatccaat gggaggtccc taccaacctg ctgtatccaa agttttgtaa aaagttgtag
     2881 aagttgttga tctttttgat tttatattca aaaagtctct ttttataaat attatttatt
     2941 atacaatgta tatacctttg agttaactaa gattatatat tatataaata tatatatatt
//
LOCUS       DRONCDA      2294 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  D.melanogaster non-claret disjunctional protein (ncd) mRNA,
            complete cds.
ACCESSION   M33932
KEYWORDS    non-claret disjunctional protein.
SOURCE      D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA,
            clone pNB40.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2294)
  AUTHORS   Mcdonald,H.B. and Goldstein,L.S.B.
  TITLE     Identification and characterization of a gene encoding a
            kinesin-like protein in Drosophila
  JOURNAL   Cell 61, 991-1000 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.B.Mcdonald, 30-APR-1990.
FEATURES       from  to/span     description
    pept        111     2168     non-claret disjunctional protein
    site        180      319     alpha helical domain
BASE COUNT      622 a    661 c    603 g    408 t
ORIGIN      1 bp upstream of EcoRI site; chromosome 3 map position 99BC.
        1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt
       61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta
      121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag
      181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa
      241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct
      301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg
      361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc
      421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc
      481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg
      541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt
      601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct
      661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg
      721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg
      781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga
      841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc
      901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag
      961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg
     1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc
     1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc
     1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg
     1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg
     1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca
     1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg
     1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg
     1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg
     1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg
     1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg
     1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc
     1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc
     1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg
     1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca
     1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc
     1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact
     1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca
     2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc
     2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg
     2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat
     2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa
     2281 aaaaaaaaaa aaaa
//
LOCUS       YSCVPS1A     2457 bp ds-DNA             PLN       10-JUL-1990
DEFINITION  S.cerevisiae GTP-binding protein (VPS1) gene, complete cds.
ACCESSION   M33315
KEYWORDS    GTP-binding protein.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2457)
  AUTHORS   Rothman,J.H., Raymond,C.K., Gilbert,T., O'Hara,P.J. and
            Stevens,T.H.
  TITLE     A putative GTP binding protein homologous to interferon-inducible
            Mx proteins performs an essential function in yeast protein sorting
  JOURNAL   Cell 61, 1063-1074 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.O'Hara, 02-APR-1990.
FEATURES       from  to/span     description
    pept        318     2432     GTP-binding protein (VPS1)
BASE COUNT      812 a    454 c    496 g    695 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgata gatacttgaa tcctctaata gtcgaaaaat gctcgagggt aaaccacttg
       61 tgcgcttgga ctggcctagt ttccaaaacc aatgttctaa tggattgatt tcttccccaa
      121 acattattaa gtggccgggt cacccaaaga cttgggcgcc gttgattcgc gtcgctttgc
      181 catcaagaga acaacatatc ttccaagaca gaccgagata attcatctat ttactcctaa
      241 aaaagaatta gagaggcctt ttatagcacc aaaataagga ccgtacgaaa actgcacatt
      301 ttatattatc agatatcatg gatgagcatt taatttctac tattaacaag cttcaggacg
      361 ctttggcgcc cttaggagga ggatctcaat ctcctattga tttaccacag atcaatgttg
      421 tcggttccca gtcgtcagga aagtcgtccg ttttggagaa cattgttggt agggatttct
      481 tgccaagagg tactggtatt gtcaccagga gacctttagt gttacaattg attaatagga
      541 gaccaaaaaa gtcagaacat gctaaagtaa accaaactgc taatgaattg attgacttga
      601 acatcaacga tgatgacaag aaaaaggatg aatcaggaaa gcaccaggaa gagggacaat
      661 ctgaagacaa taaagaggaa tggggtgaat ttttgcattt acctggtaag aagttttata
      721 attttgacga aattagaaag gaaatcgtca aagaaactga caaagtgaca ggtgccaatt
      781 caggtatttc ttctgtgccc attaacttga gaatttattc tccgcatgtt cttactttga
      841 cgttagtgga tttgcctggg ttgacgaagg ttcccgtagg tgaccaacct cctgatattg
      901 aaagacaaat taaggacatg ttgttaaagt atatttcgaa accaaacgct atcatattat
      961 ctgttaatgc cgctaacacc gatttagcca acagcgatgg tttgaagctg gctagagagg
     1021 tcgatccaga aggaacgaga actattggtg tcttgacaaa agtcgatttg atggatcaag
     1081 gtacagatgt catagatatt ttggctggaa gagtcattcc tttgagatat ggttatatcc
     1141 cagttatcaa tagaggtcaa aaggatattg aacacaaaaa aacaatcaga gaagcccttg
     1201 aaaacgaaag aaaatttttt gagaaccatc cctcttacag ttctaaagct cattactgtg
     1261 gtacaccata tttggctaaa aagttaaact caatcttatt acaccacatt aggcaaactc
     1321 tgccagaaat caaagcgaaa atcgaagcca cattgaaaaa atatcaaaac gaacttataa
     1381 acttgggccc agaaactatg gattcagcta gttcggttgt tttgagcatg attactgatt
     1441 tttccaatga atatgccggt atcttggacg gtgaggcgaa ggagctttcc agtcaggaac
     1501 tttctggtgg tgctagaatt tcttacgtat tccatgaaac tttcaaaaat ggtgtagact
     1561 ctttggatcc attcgaccag atcaaagatt ctgatatcag aaccattatg tacaatagtt
     1621 caggttctgc cccatctttg tttgtcggta ccgaagcttt tgaagtttta gttaaacagc
     1681 aaattagaag atttgaagaa ccatctctac gtttagttac tctggtgttt gatgaacttg
     1741 ttcgtatgct aaaacagatt atttcacaac caaagtactc aaggtatcct gctctaagag
     1801 aagcgatttc taatcagttc attcagttct taaaggatgc tactattcct acgaatgagt
     1861 ttgttgtcga tatcatcaaa gctgaacaaa cttacatcaa tacagcccat cccgaccttt
     1921 tgaagggttc tcaagcaatg gttatggtgg aagaaaaatt acatcctcgc caagtcgctg
     1981 ttgacccaaa gacgggtaaa ccattaccaa cccaaccatc gtctagtaag gcgccagtta
     2041 tggaagagaa atcaggattt tttggtgggt tcttctccac taaaaacaag aagaaattgg
     2101 cagctttgga atccccacct cctgttttaa aagctactgg ccaaatgaca gagagggaaa
     2161 caatggaaac agaagtaatc aagttgttga ttagtagtta tttctctatt gtcaaaagaa
     2221 ccattgccga tattatacca aaggctttga tgcttaaatt gattgtgaaa agtaaaactg
     2281 atattcagaa agttttactc gaaaaacttt acggaaagca agatattgaa gaattaacga
     2341 aagaaaacga cataaccatt caaagaagaa aagaatgtaa gaagatggtc gagatattga
     2401 gaaacgctag tcaaatcgtc tcctctgttt aggttttcct catctatacc ggtcgac
//
LOCUS       R75RELAX       99 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  Plasmid R751 relaxation region.
ACCESSION   M33118
KEYWORDS    .
SOURCE      Plasmid R751 DNA.
  ORGANISM  Plasmid R751
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 99)
  AUTHORS   Pansegrau,W., Ziegelin,G. and Lanka,E.
  TITLE     The origin of conjugative IncP plasmid transfer: Interaction with
            plasmid-encoded products and the nucleotide sequence at the
            relaxation site
  JOURNAL   Biochim. Biophys. Acta 951, 365-374 (1988)
  STANDARD  simple staff_entry
BASE COUNT       32 a     26 c     22 g     19 t
ORIGIN      
        1 gaataaggga cagtgaagat agataaccgg ctcgccggtt agctaacttc acacatcctg
       61 cccgccttac ggcgttaata acaccaagga aagtctaca
//
LOCUS       RP4RELAX       99 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  Plasmid RP4 relaxation region.
ACCESSION   M33117
KEYWORDS    .
SOURCE      Plasmid RP4 DNA.
  ORGANISM  Plasmid RP4
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 99)
  AUTHORS   Pansegrau,W., Ziegelin,G. and Lanka,E.
  TITLE     The origin of conjugative IncP plasmid transfer: Interaction with
            plasmid-encoded products and the nucleotide sequence at the
            relaxation site
  JOURNAL   Biochim. Biophys. Acta 951, 365-374 (1988)
  STANDARD  simple staff_entry
BASE COUNT       27 a     28 c     28 g     16 t
ORIGIN      
        1 gaataaggga cagtgaagaa ggaacacccg ctcgcgggtg ggcctacttc acctatcctg
       61 cccggctgac gccgttggat acaccaagga aagtctaca
//
LOCUS       ACCTRPF      1466 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  A.calcoaceticus 5'-phosphoribosyl anthranilate isomerase (trpF)
            and tryptophan synthase (trpB) genes, complete cds and 5' end.
ACCESSION   M34485
KEYWORDS    5'-phosphoribosyl anthranilate isomerase; tryptophan synthase.
SOURCE      A.calcoaceticus DNA.
  ORGANISM  Acinetobacter calcoaceticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1466)
  AUTHORS   Ross,C.M., Kaplan,J.B., Winkler,M.E. and Nichols,B.P.
  TITLE     An evolutionary comparison of Acinetobacter calcoaceticus trpF with
            trpF genes of several organisms
  JOURNAL   Mol. Biol. Evol. 7, 74-81 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        506     1147     5'-phosphoribosyl anthranilate isomerase
    pept       1149  >  1466     tryptophan synthase (trpB) (gtg start codon)
BASE COUNT      430 a    297 c    334 g    405 t
ORIGIN      
        1 gatcaagttt agttgcatct gttgaatcat cagcaaaaac agttgttgaa gaaaacccca
       61 ttgcaattgc aatcgccccc actaaacggg taggctgaaa agaaatagac atgtattgtg
      121 ctccatacat tcaccccacg tgaatgattg agtggataga tgtaacaagc aggtctccgg
      181 actcaaatgg catctcaaaa agagacaagc atattcacct tcccacatct atgcatgcag
      241 tggcgtaagt ctaaatgact tttttaatat ggtttacatt tttaccgttg cgggggcagc
      301 actggatttg caccagtttc cctaaagcga atgcttttaa cttgttacga attgtgtaaa
      361 gtataaagtc tgagcgaaga ttaaacaatc tgaatacgat caaattcgtt caactttgac
      421 gcaaagcaca aaaattgcat tacaatactt agcccaatga tggatagatc ggctgtctgt
      481 caggcaatac aatgagcttc tttctatgcg aacgcgcgca aaaatttgcg gtattacccg
      541 ttcccaagat gtccaagcag cagtaagtgc aggtgcagat gccattggac tggttttttt
      601 cccaccaagt cctcgacatg tttctatagc gcaagcgcaa gcattgctcc agcatattcc
      661 cgcttatgtt caggtggttg gtttatttgt gaatgcaact gcggatcaaa tcaaatcagt
      721 gcttgattgt gtggctttgg atgtattaca actacatggc gatgaaacgc ctgagcaatg
      781 tcaagagatt gctctgcagt gcaagcgtcg ctggtataaa gccattcaag ttaaaccaga
      841 gcttgatgta gttgatgaag ttcagcgtta tcaggccgct ggtgcaagtg cggtattgct
      901 ggatgcgtgg catccagagc tcaaaggtgg aactggtcat caatttgatt ggtcgaagtt
      961 tcccaagctg gatattccac ttattcttgc aggcggttta acgcctgaaa atgttgtaga
     1021 tgccattcaa accacacacg cttttgcagt ggatgtgagc ggaggggtag aggccgcaaa
     1081 aggtattaaa gataaacaac tcatcgaacg atttatgcaa ggagtccaat gtggatcagc
     1141 aaaataacgt gattgactat acgcaatatc cagatgctcg tgggcatttt ggtattcatg
     1201 gcggacgttt tgtatcagaa acacttatgg cggcacttga agatttagaa aatctttaca
     1261 accgcatgaa aaatgacgaa cagtttctgg cagaatttga ccgcgatctt gcctattatg
     1321 taggtcgtcc tagtccactt tattatgctg aacgatggtc aaagaagctc ggtggtgcgc
     1381 aaatttactt aaaacgtgaa gacctgaatc atacaggttc acacaaagtt aataacacca
     1441 ttggtcaggc attattggcc aagctt
//
LOCUS       BCIGLCA      2316 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  B.circulans beta-1,3-glucanase A1 (glcA) gene, complete cds.
ACCESSION   M34503
KEYWORDS    beta-1,3-glucanase.
SOURCE      B.circulans (strain WL-12) DNA, clone pNT003.
  ORGANISM  Bacillus circulans
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2316)
  AUTHORS   Yahata,N., Watanabe,T., Nakamura,Y., Yamamoto,Y., Kamimiya,S. and
            Tanaka,H.
  TITLE     Structure of the gene encoding beta-1,3-glucanase A-1 of Bacillus
            circulans WL-12
  JOURNAL   Gene 86, 113-117 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        241     2289     beta-1,3-glucanase A1 (glcA)
BASE COUNT      705 a    489 c    538 g    584 t
ORIGIN      
        1 ggaaattcaa cccacagagt atcgacaaat gatgcgccaa aacgtagaac gtgaagtaca
       61 ataccacagt acaaatatat aaattgaatc aaaacccaaa aaattgggat ataacaaaaa
      121 taattgtacc ttttcagcag attatcctat tcgatagaat aaagatattc ccccatgtaa
      181 gcgatttcct ttatacgcat agattgggag aaactattat cctatcaaag gagggcaatt
      241 atgaaaccat ctcactttac ggagaaacgg tttatgaaaa aggtacttgg tttgttctta
      301 gtggttgtga tgctggctag tgttggcgtg ttgccaactt caaaagttca agcagctggg
      361 accacagtta cctcaatgga gtacttctca ccagcagatg gacctgttat ttcaaaatct
      421 ggcgttggca aagccagcta cggatttgtt atgcctaagt tcaatggagg ctccgctacg
      481 tggaacgatg tttacagtga cgtgggtgtc aatgtgaaag tgggtaacaa ctgggttgat
      541 attgatcaag ccggaggtta tatctataac caaaactggg ggcactggag cgatggcggt
      601 ttcaatggct attggttcac cctttccgca acaaccgaaa ttcaactgta ctccaaagcg
      661 aatggtgtta agcttgaata tcaacttgta ttccaaaaca ttaacaaaac aaccatcaca
      721 gcgatgaatc cgacacaagg gccgcaaatt acagcaagtt tcacaggcgg tgcaggcttt
      781 acatatccaa cgttcaacaa tgattctgcg gtaacctatg aagccgtagc ggatgatttg
      841 aaggtgtatg taaaacctgt aaacagcagc tcatggattg atattgacaa taatgcagcc
      901 agcggctgga tttatgatca caacttcggc caattcaccg acggtggagg aggttactgg
      961 tttaacgtaa cggaatcgat caacgtcaaa ttggaatcaa agacttcttc ggctaacctt
     1021 gtttatacaa ttacgtttaa tgaacctaca agaaattcat atgtcattac gccatacgaa
     1081 ggaacaacct tcacagcaga tgcgaatggt tccattggaa tcccgcttcc caaaattgat
     1141 gggggtgcgc caatcgccaa agaactgggc aatttcgtat atcagattaa catcaatggg
     1201 caatgggtgg atttgagtaa ctccagtcag agcaagtttg catactcggc taatggctac
     1261 aacaatatgt ctgatgccaa ccagtggggg tactgggccg attatatcta tggcctttgg
     1321 ttccagccaa tccaggaaaa tatgcaaatc cgtatcggat atccgctgaa cggacaggcg
     1381 ggtggaaata ttggcaacaa cttcgtcaac tataccttca tcggtaatcc aaatgctccg
     1441 cgtccggatg tatccgatca agaggatatc tcgatcggaa caccaactga cccggctatt
     1501 gcgggcatga atcttatctg gcaggatgaa tttaacggaa ctacactgga tacaagtaaa
     1561 tggaactatg aaacaggtta ttatctcaat aacgatcccg ctacttgggg atggggaaat
     1621 gcagagttgc agcactacac aaacagcaca caaaatgtat atgtacagga cgggaagctg
     1681 aatatcaaag ccatgaacga tagcaaatct ttcccgcagg atccgaatcg gtatgcacag
     1741 tattcttcag gtaagattaa caccaaggat aaactctcct tgaagtacgg cagagtagat
     1801 tttcgtgcca agcttcctac aggggatggc gtttggccag cgctgtggat gcttccaaaa
     1861 gattctgtat atggcacatg ggctgcatcg ggtgaaatcg atgttatgga agcaagagga
     1921 cgtcttccag ggtctgtaag cggtaccata cactttggcg gacaatggcc cgtgaaccag
     1981 tcttcgggtg gcgattatca cttcccagaa gggcaaactt ttgccaatga ttatcatgta
     2041 tactcggtag tctgggaaga ggacaatatt aaatggtatg tcgacggcaa gtttttctat
     2101 aaagtcacta accagcagtg gtattccaca gctgcaccga ataatccgaa tgctcctttc
     2161 gatgagccgt tctacctcat tatgaacttg gcagtcggcg gaaacttcga cggaggccgt
     2221 actccgaacg cgtccgatat cccggcaact atgcaagtgg attatgtacg tgtgtataaa
     2281 gaacagtaat aaaacagccg tttccgcgat tggggt
//
LOCUS       CHKAGLOB     1737 bp ds-DNA             VRT       10-JUL-1990
DEFINITION  Chicken alpha-globin gene, alpha-5HR DNA fragment.
ACCESSION   M34465
KEYWORDS    alpha-globin.
SOURCE      Chicken fibroblast DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1737)
  AUTHORS   Kalandadze,A.G., Bushara,S.A., Vassetzky,Y.S.Jr. and Razin,S.V.
  TITLE     Characterization of DNA pattern in the site of permanent attachment
            to the nuclear matrix located in the vicinity of replication origin
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 9-15 (1990)
  STANDARD  simple staff_review
BASE COUNT      356 a    589 c    447 g    345 t
ORIGIN      
        1 gcggcacggg gcggccccgg gcccggcgcg cacttactgg ccttggcggc ggggtgctcg
       61 gcgccgcgct ggaaggggaa gcggaagagc agcttgttgc cgcggctgcc cgagctcaca
      121 aggataacgc tgatggggct ggtgctctcg cccatgccgc cgcgccacag cgagcaccgg
      181 gcgggcaacg acggacgcgg ctccgcggaa ggcggcccgg cccgcgcgac ttccgcttcc
      241 gcgcctccgc cgccgccgcc ggttcccccg ggccgcggcc gagcggcggg gcggagctgc
      301 gggcacagcg ctccccgggc aggtcgcgct cagaggccgg gccgccgctt cagcgccgtg
      361 ccctcagtgc ggcccagcgc cgtgcccgca gcgctgccca cacgccctcg gggtgcccca
      421 cggctgctgc ttgctcccgg tgcccgccgt tcctcccagc acctcgcagt gcagccgtgc
      481 ctgaagtgca gcccagcacc tcacacctca gccccgggct cccagtacga ccagcaggtc
      541 acgttggagt ctcttgtcct caagactgcg cagtgtctca cctttgagcc ttgtgccccc
      601 cattcagccc agcacatcac actgtagccc ttacaccctc accacagcac agcacctcac
      661 gttcaggccc cagcacgtca agatggagcc ctgtgccccc agacagccag catggaacca
      721 tcaaatcctt agagttggaa gatgtctgaa tccttgtgcc cccagttcag cccggcacct
      781 ctcacacccc actcaacact cttcagccaa gagcctacag ctcaacccag cacctcacgc
      841 cacccagcag cactcccgcc atcagcccag tgcccccagt ccggatcggt acctctcatg
      901 cccatgcaca gtgcaccaga tcagcctagc accactagtt cattccagca cctcacgtgc
      961 ccacagccaa ccactccagc acccccggtg ccctagtcac acctctccgc tgcctcaagg
     1021 ttcattccca cctcttccca catcccctca caccccctca ttattttcat gtctcgcaat
     1081 ctcctttggt cacttggagt cattcagtta tgacaactcc agaactagaa gctgctggcc
     1141 agcagcaagt gccacaaact gtgttccccc ggcagctctt ctggctcatt tgtcttattg
     1201 tgtgtccagc tgagatcaga aagctatcgg caattatgtc agaggatggc ccagtttttc
     1261 acatagattt gtctgtattt gatagcaata tttagtattt ggtgctccga gtatccccac
     1321 tctggatttt tctctgcaag attcttccct tggacttcag gcagagaagg ggactgaaag
     1381 ggagatgagc acccgcagtg agggcttaat ctgcacggcc attctctgca aggcaggtga
     1441 taacaactga agcaagagaa gctgtcattg aggggagaga gttgttggtg agcgattaaa
     1501 gagcagtcac attatcacag cagagcattc atcgtggccc agtgctgggg agctacgtta
     1561 gaattgccca gtgtgtctgc ttcccagcat aactatgcat tcttcaatta aaaaactgca
     1621 ggcatgtttg ccatttccag ctctcggaga tgagttaaag caaagctctg gaaacctgca
     1681 agctctctga gtgctagtag aatgaaatga aagaataaag ccagatatag attctgc
//
LOCUS       HUMPDHBA     1484 bp ss-mRNA            PRI       10-JUL-1990
DEFINITION  Human pyruvate dehydrogenase beta-subunit mRNA, complete cds.
ACCESSION   M34479
KEYWORDS    pyruvate dehydrogenase.
SOURCE      Human liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1484)
  AUTHORS   Ho,L. and Patel,M.S.
  TITLE     Cloning and cDNA sequence of the beta-subunit component of human
            pyruvate dehydrogenase complex
  JOURNAL   Gene 86, 297-302 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1     1080     pyruvate dehydrogenase beta-subunit precursor
                                 /hgml_locus_uid="LU0223C" /nomgen="PDHB"
                                 /map="unassigned"
    sigp          1       90     pyruvate dehydrogenase beta-subunit signal
                                 peptide
    matp         91     1077     pyruvate dehydrogenase beta-subunit
    mRNA    <     1     1484     pyruvate dehydrogenase beta-subunit mRNA
BASE COUNT      414 a    287 c    369 g    414 t
ORIGIN      
        1 atggcggcgg tgtctggctt ggtgcggaga ccccttcggg aggtctccgg gctgctgaag
       61 aggcgctttc actggaccgc gccggctgcg ctgcaggtga cagttcgtga tgctataaat
      121 cagggtatgg atgaggagct ggaaagagat gagaaggtat ttctgcttgg agaagaagtt
      181 gcccagtatg atggggcata caaggttagt cgagggctgt ggaagaaata tggagacaag
      241 aggattattg acactcccat atcagagatg ggctttgctg gaattgctgt aggtgcagct
      301 atggctgggt tgcggcccat ttgtgaattt atgaccttca atttctccat gcaagccatt
      361 gaccaggtta taaactcagc tgccaagacc tactacatgt ctggtggcct tcagcctgtg
      421 cctatagtct tcaggggacc caatggtgcc tcagcaggtg tagctgccca gcactcacag
      481 tgctttgctg cctggtatgg gcactgccca ggcttaaagg tggtcagtcc ctggaattca
      541 gaggatgcta aaggacttat taaatcagcc attcgggata acaatccagt ggtggtgcta
      601 gagaatgaat tgatgtatgg ggttcctttt gaatttcctc cggaagctca gtcaaaagat
      661 tttctgattc ctattggaaa agccaaaata gaaaggcaag gaacacatat aactgtggtt
      721 tcccattcaa gacctgtggg ccactgctta gaagctgcag cagtgctatc taaagaagga
      781 gttgaatgtg aggtgataaa tatgcgtacc attagaccaa tggacatgga aaccatagaa
      841 gccagtgtca tgaagacaaa tcatcttgta actgtggaag gaggctggcc acagtttgga
      901 gtaggagctg aaatctgtgc caggatcatg gaaggtcctg cgttcaattt cctggatgct
      961 cctgctgttc gtgtcactgg tgctgatgtc cctatgcctt atgcaaagat tctagaggac
     1021 aactctatac ctcaggtcaa agacatcata tttgcaataa agaaaacatt aaatatttag
     1081 tttggacttg aatatcaagt cgttgaaatt tatttgaaat acttgctggc actgcacctg
     1141 gatttgtact gcaagacctg actattcata aaggaaaacg atttctaaag caacagcagg
     1201 tatttttgta cagggaagtt taaatgtgtt tgtgtatgga aaactctcca ctctcctccc
     1261 ctagatgcca tgcttccttt tgtctgttac ggttgccatg ttctttgaat aacaaattat
     1321 atcacatttt atcctctctc accacaagga caaagtatgg atgtggcaga gtcctgatga
     1381 aagatgtatc caaacaagat aacttatatg tataaaatta aagcatataa tacacattta
     1441 ctgttagttt gttttgataa ggaataaagg aatttctaac atga
//
LOCUS       LEIGP63A     3047 bp ds-DNA             INV       10-JUL-1990
DEFINITION  L.chagasi major surface glycoprotein (gp63) gene, complete cds.
ACCESSION   M28527
KEYWORDS    glycoprotein; protease; surface antigen.
SOURCE      L.chagasi (isolate MHOM/BR/82/BA-2C1a) DNA, clones pLc63-[1 and 2].
  ORGANISM  Leishmania chagasi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 3047)
  AUTHORS   Miller,R.A., Reed,S.G. and Parsons,M.
  TITLE     Leishmania gp63 molecule implicated in cellular adhesion lacks an
            Arg-Gly-Asp sequence
  JOURNAL   Mol. Biochem. Parasitol. 39, 267-274 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by M.Parsons, 03-OCT-1989.
FEATURES       from  to/span     description
    pept        496     2295     gp63 protein
BASE COUNT      423 a   1195 c    925 g    504 t
ORIGIN      
        1 ggtacctccc ccaccccggc cctccggccc cgcgcccccg cctctgtgct gtgccgtgcc
       61 ctggactccc tctcctccac ctctcctcgc ttctgtcgct ccgcctcccc gagcgacccg
      121 cggcgccgcg cggtgcgtgt ctggtgcggc gagtggcggg gtgccgtccc ccctcgctgc
      181 ggcacccctc cccgcgccac cacggaggca cccgtgagca cgccaacaga ccaacgcact
      241 cacgtcccca tcgtcctccc ccctccccgc accagcaccg acgtgctctc cgctctccct
      301 ccctcaccac ctcccctcgc accctccctt gccttctccc tgtcccctcc ctccccagat
      361 ccgccaacgc atccgatccc gctacacccc cctctccccc gcccacacgc acgcgcacac
      421 cgccgtgcac aagccctcgc cctcgccctc gccaccacac cccactgccc acagcgcccc
      481 cgcgcctgca gagccatgtc cgtcgacagc agcagcacgc accggcaccg cagcgtcgcc
      541 gcgcgcctgg tgcgcctcgc ggctgccggc gccgcagtca tcgctgctgt cggcaccgcg
      601 gccgcgtggg cacacgccgg tgcggtgcag caccgctgca tccacgacgc gatgcaggca
      661 cgcgtgcggc agtcggtggc gcgccaccac acggcccccg gcgccgtgtc cgcggtgggc
      721 ctgccgtacg ttactctcga caccgcggcc gccgccgatc gccggccggg cagcgcgccc
      781 acagtcgtgc gcgccgcgaa ctggggcgcg ctgcgcatcg ccgtctccac cgaggacctc
      841 accgaccccg cctaccactg cgctcgcgtc gggcagcaca tcaagaggcg acttggcggc
      901 gtcgacatat gcacggccga ggacatcctc accgacgaga agcgcgacat cctggtcaag
      961 cacctcatcc cgcaggcgct gcagctgcac acggagcggc tgaaggtgcg gcaggtgcag
     1021 gacaagtgga aggtgacggg catgggcgac gatgtgtgca gcgacttcaa ggtgccgccg
     1081 gcgcacatca ccgatggcct gagcaacacc gacttcgtga tgtacgtcgc ctccgtgccg
     1141 agcgaggagg gtgtgctggc gtgggccacg acctgccagg tgttctctga cggccatcca
     1201 gccgtgggcg tcatcaacat ccccgcggcg aacattgcgt cgcggtacga ccagctggtg
     1261 acgcgtgtcg tcacgcacga gatggcgcac gcgctcggct tcagcgtcgg cttcttcgaa
     1321 ggcgcccgca tcctggagag catttcgaac gttcggcaca aggacttcga tgttcccgtg
     1381 atcaacagca gcacggcggt ggcgaaggcg cgcgagcagt acggctgcga caccttggag
     1441 tatctggaga tcgaggacca gggcggtgcg ggctccgccg ggtcgcacat caagatgcgc
     1501 aacgcgcagg acgagctcat ggcgcctgcc gcagctgccg ggtactacag cgccctgacc
     1561 atggccatct tccaggacct cggcttctac caggcggact tcagcaaggc cgaggtgatg
     1621 ccgtggggcc ggaacgccgg ctgcgccttc ctcagcgaga agtgcatgga gcggaacatc
     1681 acgaagtggc cggcgatgtt ctgcaatgag aacgaggtga ctatgcgctg ccccaccagt
     1741 cgtctcagcc ttggaaagtg cggtgttacc cgtcacccgg accttccgcc gtactggcag
     1801 tacttcacgg acccgtccct cgccggcatc tccgccttca tggactgctg ccctgtcgtg
     1861 gagccctacg gtgatggcag ctgcgcacag cgtgcgtctg aagcgggcgc accattcaaa
     1921 ggcttcaacg tcttctccga cgcggcgcgc tgcatcgatg gcgccttcag gccgaagacg
     1981 agtcacggca taatcaagtc gtacgccgga ctgtgcgcca acgtgcggtg cgacacggcc
     2041 acgcgcacgt acagcgtgca ggtgcacggc ggcagcggct acgccaactg cacgccgggc
     2101 ctcagagttg agctgagcac cgtgagcagc gccttcgagg agggcggcta catcacgtgc
     2161 ccgccgtacg tggaggtgtg ccagggcaac gtgcaggctg ccaaggacgg cggcaacgcc
     2221 gcggctggtc gccgtggtcc gcgcgccgcg gcgacggcgc tgctggtggc cgcgctgctg
     2281 gccgtggcgc tctagacggt ggataggacg ggtgctgatg gcgtgtcccc tgctcccccc
     2341 tccctccctc cctctcgttg tctctcggaa gagctccacg ctgtcctttc atctcctcgc
     2401 ctgttctacg cttgcttcgc tgcgccgctg caccgggccg gtcctcgccg accctcgcct
     2461 gccctctccc cctcctctct cccgccaccc caccccgctc cccgctgcgc acggtgcctg
     2521 tgcgcttgga gagaggtgca gcagcgcgcg ggagctgagg gagggagggg gtgtcgtgcg
     2581 cgggtgcgca tgccttcttt cacttcctta tttgtcttct atttgttccc tgcggcaccc
     2641 gcacaccccc acccgctggc ggccatccgc ggcatccgcg ggtgcgtgcg cggtgtgtct
     2701 gccttctctc tcctcctttc gctctgtttc cctgtcctcg gactccccgg cgccagcgtg
     2761 agctccgcag tcaccgccca cccggcgctc cggcgcggtc agcgccaccc caccccaccc
     2821 cctctccccc attcgtgcgt gtctcttctc gctttttctg tttcctcttg tagcagggcg
     2881 cgccgcgttg tgggagcggt ggcggcctct gcgcgcggac ggcatgcagg tcggccggga
     2941 gagtctcccg ccagcgcccg cgcagcgcag agccgtcgcc cacccaccgt ctcctcccac
     3001 cttcgcatgc cgccgcacta ggtgcacgtc gtcggcacga ccaccga
//
LOCUS       PFATUBB      2833 bp ds-DNA             INV       10-JUL-1990
DEFINITION  P.falciparum beta-tubulin gene, complete cds.
ACCESSION   M28398
KEYWORDS    tubulin.
SOURCE      P.falciparum (Brazilian strain 7G8, isolate 78G) nonsynchronous
            blood stage DNA, clone 768.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 2833)
  AUTHORS   Sen,K. and Godson,G.N.
  TITLE     Isolation of alpha and beta-tubulin genes of Plasmodium falciparum
            using a single oligonucleotide probe
  JOURNAL   Mol. Biochem. Parasitol. 39, 173-182 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.N.Godson, 27-SEP-1989.
FEATURES       from  to/span     description
    pept        654      749     beta-tubulin, exon 1
               1112     2064     beta-tubulin, exon 2
               2228     2516     beta-tubulin, exon 3
    IVS         750     1111     TUBB intron A
    IVS        2065     2227     TUBB intron B
BASE COUNT     1018 a    311 c    455 g   1049 t
ORIGIN      
        1 aattcctagt ttatttaatt taaaaattaa aagatcgaat gctcaacatt ttaaaaagaa
       61 atctgtgaaa catatcttaa caagaaatgg tgtaacaaaa gaaacaatat taaatgataa
      121 attaccaaag ataaatgatg aaattgacag aacatataat ggacacaaaa tggatgaaaa
      181 tttacaggat aaacaaaaaa ggaatcatgg agtaaatata aaattaataa atgaatatga
      241 aaatatcatg tgaagaataa attctcaaaa tcattgattg tatgacaaga ttcaagaatt
      301 ggttatataa aaatatattt aggaaaagta attttgggtc atatgtatca acatttacag
      361 gtgtatttgg aggtgctgca gctgttagct gtttctgcca taagtggagc ttgtataact
      421 aaatttagtg ttacattggt tccggtattt gcatgttttg ggggtgtctt tgcgattatt
      481 ataatattat taatattagg aacatggatg cttgttacat ggttatggca acacaaagaa
      541 gtagtatttt tttttttttt taatttttac ttaatatatc ctcttacaat ataaaatatt
      601 tatatattta aaaaaaaaag aaaaaatttt ctttgagatt attttattaa agaatgagag
      661 aaattgttca tattcaagct ggccaatgtg gaaatcaaat aggtgcaaag ttttgggaag
      721 tcatttctga tgagcatgga atagatccag taagtttaaa aaaaaaatat atttatttat
      781 atgaatctgt aaacatatgt atatttatat atatatatat atatatatgg aagaataatt
      841 ttgtgtgtat aatttggggt ccttcccctt tattgtattc tataaatgcc tcctttatat
      901 tgataataat ttatatatgt aaacctttaa tgacgaggct tatatataaa aaccttagat
      961 attataaata aatgtatatt atgtacatat gacgatatcg ctctctctat atatatatat
     1021 atatatatat atatatattt atttatttat atatttattt atttatttat ttatttattt
     1081 tttttttttt tttttatttt atttttttta gagtggtacc tatagtgggg acagtgactt
     1141 acagttagaa agagttgacg ttttttacaa cgaagcaaca ggaggtagat atgttccaag
     1201 agctatattg atggacttgg aacctggtac tatggatagt gttcgtgctg gcccctttgg
     1261 tcaattattt cgtccagata attttgtgtt tggtcaaaca ggtgcaggaa ataattgggc
     1321 taaaggacat tatactgaag gtgctgaatt gatagatgca gttttagatg tgcttagaaa
     1381 agaagcagaa ggttgtgatt gtttacaagg atttcagatt actcattcat taggtggtgg
     1441 tacaggtagt ggtatgggta ctttgttgat tagtaaaata agagaggagt atcctgatcg
     1501 tattatggaa acattttctg tatttccatc accaaaagtt tctgatactg ttgttgaacc
     1561 atataatgct acattatcag tccatcagtt ggttgaaaat gctgatgaag ttcaagttat
     1621 cgataatgaa gctttatatg acatatgttt taggactctt aaattaacaa caccaacata
     1681 tggagattta aatcaccttg tatcagctgc aatgtcaggt gtaacctgtt cgttaagatt
     1741 tcctggtcaa cttaacagtg acttaagaaa attagctgtt aatttgatcc cattcccacg
     1801 tttacatttc tttatgtacg ggtttgctcc tttaactagt agaggcagtc aacaatacag
     1861 agccttaact gtgccggagt taacacaaca aatgttcgac gcaaaaaata tgatgtgcac
     1921 aagtgatcca agacatggaa gatatttaac ggcatgtgct atgtttagag gaagaatgtc
     1981 cacaaaggaa gttgacgaac aaatgttaaa cgttcaaaat aaaaactcat cttattttgt
     2041 cgaatggatt cctcacaaca caaagtaaga aggaacaatt gatactagta tgcatgtttt
     2101 tttgtttata tgtatttata tatatatata tatatatgta ttcatttata tattttgaaa
     2161 tatacatttt acatataaat tttttttttt tctttttctt tttttttttt tttgtttttt
     2221 tctttagatc aagtgtttgt gatattccac cattgggatt aaaaatggct gttacttttg
     2281 taggaaactc aaccgccatt caagaaatgt ttaaaagagt ttctgatcaa tttactgcta
     2341 tgtttagaag aaaagccttt ttgcactggt acaccggaga aggtatggac gagatggaat
     2401 ttacagaagc tgaatcaaat atgaatgatt tagtttcaga atatcaacaa tatcaagatg
     2461 ctacagcaga agaggaagga gaatttgaag aagaagaagg agacgtagaa gcctaaatct
     2521 atttatattt atgaaaatat atacatatta tatatatatg tatatgtaat taacaagaat
     2581 aaaaaataaa aaataaaaaa aaaataaaat aaaaaaataa aaatacataa taaaaaagta
     2641 taaaataaat atctaatcat taattatata taacaatata atttaactct tttttttttt
     2701 attattattg aagttatgtt cgggtatata taacatatat ataaattata tatatgttgc
     2761 agtttctttt tttttttttt tttttttttt tcttatcatt tgattttaca ctcacatata
     2821 tatgacatat ata
//
LOCUS       RATADOME1    2513 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Rat S-adenosylmethionine decarboxylase pseudogene, complete cds.
ACCESSION   M34463
KEYWORDS    S-adenosylmethionine decarboxylase; pseudogene.
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2513)
  AUTHORS   Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P.,
            Ihalainen,R. and Pajunen,A.
  TITLE     Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA.
            Comparison with an intronless rat pseudogene
  JOURNAL   Gene 86, 193-199 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps     436     1379     pseudo-S-adenosylmethionine decarboxylase
BASE COUNT      771 a    478 c    518 g    746 t
ORIGIN      
        1 tctactaaac atgataaaga atttaagaaa tccatctctt cacttccagt ctatatatct
       61 ttgagatgct attcaggata ctgagttaaa aaataagatt aggcttacac agcatggcgc
      121 ggaacattag ctaactctca ctcaactctg acaagaaagc agcagactac atgagactga
      181 actgtatctg cctttagttc caacagactc acgttcaact tttcttcacg aaaacagcca
      241 gggaaatttt attagtcctt ttttaaaaat agttaatata aaattataac aacaacagca
      301 gcagcaacaa caacaaggac cctgaactta gtaacacacg tggaacaaac cgtagcagcg
      361 actggagcag tgggagaaga gatttaattt aggtgatttt tttggatttg ttggttgttg
      421 gtcagcctca cagtgatgga agttgcacat atttttttga agggactgag aagctgctag
      481 aggtctggtt ttccagacag cagtccgacg ccagccaggg acatggggat cttcatacca
      541 tcccaagatc tcagtgggat gtgcttttga ggatgtgcag tcctcaacca taagtacgac
      601 aaagatgcac aagcaggaag cttacacact cagtgagagt agcatgttta tacatttcat
      661 gtgatactac cctcttactg aaagctctgg tttccaggtt gaagctcgct agggattacc
      721 gtgggttaga ctcaattctt ttattctcat aagaatttca tgaagccctc tctccaaggg
      781 tacccacacc gaaatttcca cgaagaaatc gaatttctta atgcagtttt cccaaatgga
      841 gcagcatatt gtatgggaca aacgaattct gactgttggt acttatatac ttggatctcc
      901 agagagccga gtcatcaaac agtcagatca accctgggaa ttctgatgag tgagcttgac
      961 ctagcagtta cggaccagtt ctattgctgc aaaggatgtc actcgtgaga gtgaattcat
     1021 gacctgatat caggtcattg atgacacact gtttaatcct tgcagcttct tgatgaatgg
     1081 aatgaaatcg attggactag tcacatcgct ccagaagcag agttctctta tgttagcttt
     1141 gaaacaaacc taagtgagac atcctatgac agcccgatca ggaaagttgg gaaattcgtc
     1201 aagccaggaa aatttgtgac caccttgttt gttaatcaga gttctaaatg tcgcacaggc
     1261 cattcttcat cccagaagat tgacggtttt aaacatcatg attgccaaag tgctatgctc
     1321 aacgaagata aatgcaatat tgaatgtatc aaatgaaaag aattcagtct ctggtggagg
     1381 gggattggag caaggatgaa tcagcccact aaagaaaact ccatggaaaa gacaggctat
     1441 gcagtgcact ttaatcagct tcacacggtg cctaccatgc cttcactaac taaccaagta
     1501 gtgatagaaa tgtccactaa gtcaaagcag aaatgtaata ctaagcattc tgacctcagt
     1561 aagcaccacc attgccacca ttgccaccaa tttttactaa aggaaatttt gaatcaaatg
     1621 aggatctgta gtttccgtct gttctgaggt cggctgttct ctttggtctt cgtttcacca
     1681 tggcgctcag atgatcaaat gagtagctgc cagagggagg aatctccagg ttacttagcc
     1741 tggagaatgg atgaatggat gaaacagcac aatattatga ctgtttagaa atacaggctt
     1801 tcaagagtcg gcatgttagt ggcatttgta gatactgtgg aatttaagca gcaaagaaca
     1861 aattggacta aatttcctat taattgccct cccactgttt cttggtagtt tctggactgg
     1921 cacatcgatg tttttttttt ttttttcctt ccatatttaa aatgaagcac ttttttagca
     1981 tttctaagca aagaatgcac ttggtttgta atcaagtagt tggaacgctg tctgaatgtt
     2041 tactttatac accatgctga ttgaacgctt cattgaggaa gctttcagtc agttattggt
     2101 ctgattctgt aatgagcaca gcacgtggtt tgaattgcca tttggaggac cagtgcttat
     2161 ttaggctgga tcgcgtaaac cggtagattt tagcttgagg tttgattccc tcaccttata
     2221 aaattaagaa ttctaatgtt gaaaattgca taggtttgtg tgaaacaaag cccagaagag
     2281 aaactgtagg tagactagta atcttgtgta attataggtg agaagtttta gtgccgtaat
     2341 ttctttgttg gcgttggact tttatcagct gaaatgtatt tctgtaccac aatgtaagct
     2401 tcaataaagt ttgcttaatt gtctagtaac attaaaaaat ataagattaa tagaattgat
     2461 ctcaacagta aggaaacaaa actaccttta ttattacata acataatctt tca
//
LOCUS       RATADOMET    3102 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Rat S-adenosylmethionine decarboxylase mRNA, complete cds.
ACCESSION   M34464 M21155 J04048 M21783
KEYWORDS    AdoMet decarboxylase; S-adenosylmethionine decarboxylase.
SOURCE      Rat prostate, cDNA to mRNA, clone pSAMr1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 232 to 1821)
  AUTHORS   Pajunen,A., Crozat,A., Janne,O.A., Ihalainen,R., Laitinen,P.H.,
            Stanley,B., Madhubala,R. and Pegg,A.E.
  TITLE     Structure and regulation of mammalian S-adenosylmethionine
            decarboxylase
  JOURNAL   J. Biol. Chem. 263, 17040-17049 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3102)
  AUTHORS   Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P.,
            Ihalainen,R. and Pajunen,A.
  TITLE     Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA.
            Comparison with an intronless rat pseudogene
  JOURNAL   Gene 86, 193-199 (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by A.Crozat, 27-OCT-1988.
FEATURES       from  to/span     description
    pept        273     1274     S-adenosylmethionine decarboxylase (EC
                                 4.1.1.50)
    mRNA          1     3102     S-adenosylmethionine decarboxylase mRNA
BASE COUNT      835 a    650 c    724 g    893 t
ORIGIN      
        1 cggggaaagc agcggactac aagagactga actgtatctg cctctatttc caacggactc
       61 acgttcaact ttcgctcacg aaaatagccg ggaaaatttt attagtcctt tttttaaaaa
      121 aagttaatat aaaattatag caaaaaaaaa aaaaggaacc tgaactttag taacacagct
      181 ggaacaatcc gcagcggcgg caggagcggc gggagaagag tttaatttag ttgattttct
      241 gtggttgttg gttgttcgct agtctcacgg tgatggaagc tgcacatttt ttcgaaggga
      301 ccgagaaact gctggaggtc tggttctcca gacagcagtc cgacgcaagc cagggatctg
      361 gggaccttcg taccatccca agatccgagt gggatgtcct tctgaaggat gtgcagtgct
      421 caatcataag tgtgacaaag actgacaagc aggaagctta tgtactcagt gagagtagca
      481 tgtttgtctc caagagacgt ttcattttga agacatgtgg taccaccctc ttactgaaag
      541 cactggttcc cctgttgaag cttgctaggg actacagtgg gtttgactcg attcaaagct
      601 tcttttattc tcgtaagaat ttcatgaagc cttctcacca agggtaccca caccggaatt
      661 tccaggaaga aatcgagttt cttaatgcaa ttttcccaaa cggagcagga tattgtatgg
      721 gacgtatgaa ttctgactgt tggtacctgt acactttgga tctcccagag agccgagtaa
      781 tcaatcagcc agatcaaacc ctggaaattc tgatgagtga gcttgaccca gcagttatgg
      841 accagttcta catgaaagat ggtgttactg caaaggatgt cactcgtgag agtggaattc
      901 gtgacctgat accaggttct gtcattgatg ccacactgtt caatccttgt ggctactcaa
      961 tgaatggaat gaaatcggat ggaacatatt ggactattca catcactcca gaaccagaat
     1021 tttcttatgt tagctttgaa acaaacctaa gtcagacctc ctatgatgac ctgatcagga
     1081 aagttgtgga agtcttcaag ccaggaaaat ttgtgaccac cttgtttgtt aatcagagtt
     1141 ctaagtgtcg cacagtgctt tcttcgcccc agaagattga cggtttcaaa cgtcttgatt
     1201 gccagagcgc tatgttcaac gattacaatt ttgtttttac cagttttgct aagaaacagc
     1261 aacaacagag ttgattagga aaaatgaaaa agaaaaaacg caaaaagaga agacacacag
     1321 gaggtggtgg ctgctttcta gatgttgatc ctgggggcca tgctgaccgt gaccaccacc
     1381 ttgtagctgc agaaagccct aggtgtaatg atagtgtaat cattttgaag tgtatgcatt
     1441 attatatcaa ggagttagat atcttgcatg aatgctctct tctgtgttta ggtgttctat
     1501 gccactcttg ctgtggaact gaagtgcatg tagaaaagaa ctctgactgt atgaatcttt
     1561 acgacacttg tgaaaacgat tcgacttggt ttatgcacag cgtaatattt ctgcaggcat
     1621 cgtccaaaat cccccacaga caaggctttc gtccccatta gatgcggcct cagctgacca
     1681 ttggcgactg ttctatttgc tgccagagtt tttacatcca gttacctcca ctttctagag
     1741 catattctct actaatgttc aaaaccgatt tctacttcat acgggtgtct tatgcaatgg
     1801 caattaaagt tttcttccac aagttgagtc tttgtaagga aatgattcca gttgcttgtt
     1861 ttgtgttcta ctgttttagt aattgctcct gcatttatag tcctatggtt tttcactacc
     1921 cctgatgaag caatacacgg tcacactgtg ggcttacatt gtaatcttca ccccagatgg
     1981 gagctcagag acggtccctt gctcattttt ccctaagatg tagaatgtgg ccttgctatt
     2041 ggcatgccct tctgtggaag ataaatgatg gaagtgaaag tatcccgggg gtgagcaagg
     2101 agaaccaccc catggcagtg atgggcttgg cagtgcactc cgagctctca cagtggagtg
     2161 cccaccatgc cttcactaac tcactgagca gtgataggat gcccaccaag tcagagcaga
     2221 aatctaaccc taaggattct cacctcggta agtgccgcca ttgccaccac tttactaaag
     2281 gaagtttccg ctcagaggag agtctgtact tcccgcctgt cctaatgtca gctgttctct
     2341 ctggtctttc accatggcgt tcagatgctc aaatgaatgg ctgatcggcc gcagggagga
     2401 ctctccgggt tactgggcct ggagaatgga gaaacaggca cggtattctg acagttaatg
     2461 gcaccagaga tgcgggcttt caagagctgg cctgttagtg gcatttttaa gcagaaaaga
     2521 gcaaactaga cgaagttccc tatttattgc cctcccactg tttccttggc agtttctgga
     2581 ctggcgcaat gatgccttgt tccttccgta tttataacga agctaaaaag cgtttctaag
     2641 catggagtct acttggtttg aaatcaagtg gttggaacac tgtctggatt tttactttac
     2701 gcagtgttga ttgaacgctt cgttggggaa gccttcagtc cgcttcatcg gtctgttctg
     2761 taatgagcac agcacaccta gtttgaattg ctgtttggag ggccagtgct tatttgagct
     2821 gggtcttgta acccagtaga ttttggcttg aggtctgact cccccatctt acgaaattaa
     2881 gaattctaat gttggaaatt gcatagggtt tgcgtggaaa aaagcccagg gaaaaaaaaa
     2941 aaaaaacaga aggcggacta gtgatctagt gtgattacag gcggggaagt tttggtgcca
     3001 taatttcttt gttggtgttg gacttttaat cagctgaaat gtatttctgt accacaatgt
     3061 aagcttcaat aaaagtttgc ttaattgtct agtaacatcc ag
//
LOCUS       HUMCD38      1407 bp ss-mRNA            PRI       10-JUL-1990
DEFINITION  Human lymphocyte differentiation antigen CD38 mRNA, complete cds.
ACCESSION   M34461
KEYWORDS    cell surface glycoprotein; lymphocyte differentiation antigen CD38;
            membrane glycoprotein.
SOURCE      Human PHA-treated peripheral blood cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1407)
  AUTHORS   Jackson,D.G. and Bell,J.I.
  TITLE     Isolation of a cDNA encoding the human CD38 (T10) molecule, a cell
            surface glycoprotein with an unusual discontinuous pattern of
            expression during lymphocyte differentiation
  JOURNAL   J. Immunol. 144, 2811-2815 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         70      972     lymphocyte differentiation antigen CD38
                                 /hgml_locus_uid="LZ0047A" /nomgen="CD38"
                                 /map="4"
BASE COUNT      381 a    332 c    326 g    368 t
ORIGIN      
        1 ctaaagctct cttgctgcct agcctcctgc cggcctcatc ttcgcccagc caaccccgcc
       61 tggagcccta tggccaactg cgagttcagc ccggtgtccg gggacaaacc ctgctgccgg
      121 ctctctagga gagcccaact ctgtcttggc gtcagtatcc tggtcctgat cctcgtcgtg
      181 gtgctcgcgg tggtcgtccc gaggtggcgc cagacgtgga gcggtccggg caccaccaag
      241 cgctttcccg agaccgtcct ggcgcgatgc gtcaagtaca ctgaaattca tcctgagatg
      301 agacatgtag actgccaaag tgtatgggat gctttcaagg gtgcatttat ttcaaaacat
      361 ccttgcaaca ttactgaaga agactatcag ccactaatga agttgggaac tcagaccgta
      421 ccttgcaaca agattcttct ttggagcaga ataaaagatc tggcccatca gttcacacag
      481 gtccagcggg acatgttcac cctggaggac acgctgctag gctaccttgc tgatgacctc
      541 acatggtgtg gtgaattcaa cacttccaaa ataaactatc aatcttgccc agactggaga
      601 aaggactgca gcaacaaccc tgtttcagta ttctggaaaa cggtttcccg caggtttgca
      661 gaagctgcct gtgatgtggt ccatgtgatg ctcaatggat cccgcagtaa aatctttgac
      721 aaaaacagca cttttgggag tgtggaagtc cataatttgc aaccagagaa ggttcagaca
      781 ctagaggcct gggtgataca tggtggaaga gaagattcca gagacttatg ccaggatccc
      841 accataaaag agctggaatc gattataagc aaaaggaata ttcaattttc ctgcaagaat
      901 atctacagac ctgacaagtt tcttcagtgt gtgaaaaatc ctgaggattc atcttgcaca
      961 tctgagatct gagccagtcg ctgtggttgt tttagctcct tgactccttg tggtttatgt
     1021 catcatacat gactcagcat acctgctggt gcagagctga agattttgga gggtcctcca
     1081 caataaggtc aatgccagag acggaagcct ttttccccaa agtcttaaaa taacttatat
     1141 catcagcata cctttattgt gatctatcaa tagtcaagaa aaattattgt ataagattag
     1201 aatgaaaatt gtatgttaag ttacttcctt tagagcacaa tggatctcga gggatcttcc
     1261 atacctacca gttctgcgcc tgcgagtcgc ggccgcatct agaggatctt tgtgaaggaa
     1321 ccttacttct gtggtgtgac ataattggac aaactaccta tagagattta aagctctaag
     1381 gtaaatataa aatttttaag tgtataa
//
LOCUS       MUSCD28      1492 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse glycoprotein CD28 mRNA, complete cds.
ACCESSION   M34563
KEYWORDS    glycoprotein CD28.
SOURCE      Mouse lymphoma T cell line EL4, cDNA to mRNA, clone lambda-SSD1.5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1492)
  AUTHORS   Gross,J.A., St John,T. and Allison,J.P.
  TITLE     The murine homologue of the T lymphocyte antigen CD28: Molecular
            cloning and cell surface expression
  JOURNAL   J. Immunol. 144, 3201-3210 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         57      713     glycoprotein CD28 precursor
    sigp         57      113     glycoprotein CD28 signal peptide
    matp        114      710     glycoprotein CD28
BASE COUNT      401 a    355 c    332 g    404 t
ORIGIN      
        1 acacactctg ccttgctcac agaggagggg ctgcagccct ggccctcatc agaacaatga
       61 cactcaggct gctgttcttg gctctcaact tcttctcagt tcaagtaaca gaaaacaaga
      121 ttttggtaaa gcagtcgccc ctgcttgtgg tagatagcaa cgaggtcagc ctcagctgca
      181 ggtattccta caaccttctc gcaaaggaat tccgggcatc cctgtacaag ggcgtgaaca
      241 gcgacgtgga agtctgtgtc gggaatggga attttaccta tcagccccag tttcgctcga
      301 atgccgagtt caactgcgac ggggatttcg acaacgaaac agtgacgttc cgtctctgga
      361 atctgcacgt caatcacaca gatatttact tctgcaaaat tgagttcatg taccctccgc
      421 cttacctaga caacgagagg agcaatggaa ctattattca cataaaagag aaacatcttt
      481 gtcatactca gtcatctcct aagctgtttt gggcactggt cgtggttgct ggagtcctgt
      541 tttgttatgg cttgctagtg acagtggctc tttgtgttat ctggacaaat agtagaagga
      601 acagactcct tcaagtgact accatgaaca tgactccccg gaggcctggg ctcactcgaa
      661 agccttacca gccctacgcc cctgccagag actttgcagc gtaccgcccc tgacagggac
      721 ccctatccag aagcccgccg gctggtaccc gtctacctgc tcatcatcac tgctctggat
      781 aggaaaggac agcctcatct tcagccggcc actttggacc tctactgggc caccaatgcc
      841 aactatttta gagtgtctag atctaacatc atgatcatct tgagactctg gaatgaatga
      901 cagaagcttc tatggcagga taaagtctgt gtggcttgac ccaaactcaa gcttaataca
      961 tttattgact tgattgggga agttagagta gagcaatcaa aaagatcatt cattcagcct
     1021 tgggaagtca atttgcaggc tcctggatga gccctgcccc gttttcactt gccagcacat
     1081 ttcagtcatg tggtgtgata gccaaagatg ttttggacag agaagaaagg atagaaaaac
     1141 cttctctttg gctaagttgg tgtttggggt ggggataggt tagagtatag tacttaacta
     1201 tttgaaaaat aatgaaaaca cttttttcac tcatgaaatg agccacttag ctcctaaata
     1261 gtgttttcct gttagtttag aaagttgtgg acatattttt ttaatgattt ctgaccattt
     1321 ttaatcacat tgactcatgg aatggcctca aagcaccccc cagtgcttct ttcctcattc
     1381 ccggtcatgg gaactcagta ttattaatag tcacaacatg atttcagaac tagatagccc
     1441 tcccacacca agaagaatgt gagaggaagt aaggtcactt tatgtaaaaa cg
//
LOCUS       MUSIGHAAU     294 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig gamma-chain (anti-insulin Ab 123) mRNA V region, partial
            cds.
ACCESSION   M34523
KEYWORDS    gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
            variable region.
SOURCE      Mouse (BALB/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   294     Ig gamma-chain V-region (AA at 1)
BASE COUNT       83 a     61 c     77 g     73 t
ORIGIN      
        1 caggtccagc tgcagcagtc tgggccagag gtggtgaggc ctggggtctc agtgaagatt
       61 tcctgcaagg gttccgacta cacattcact gattatgcta tgcactgggt gaagcagagt
      121 catgcaaaga gtctagagtg gattggagtt attagtactt acaatggtaa tacaaactac
      181 aaccagaagt ttaagggcaa ggccacaatg actgtagaca aatcctccag cacagcctat
      241 atggaacttg ccagattgac atctgaggat tctgccatgt attactgtgt acgt
//
LOCUS       MUSIGHAAV     294 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig gamma-chain (anti-insulin Ab 126) mRNA V region, partial
            cds.
ACCESSION   M34524
KEYWORDS    gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
            variable region.
SOURCE      Mouse (BALB/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   294     Ig gamma-chain V-region (AA at 1)
BASE COUNT       83 a     69 c     77 g     65 t
ORIGIN      
        1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggcttc agtgaagata
       61 tcctgcaaga cttctggata cacattcact gaatacacca tgcactgggt gaagcagagc
      121 catggaaaga gccttgagtg gattggaggt attaatccta acaatggtgg ttctaactac
      181 aaccagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac
      241 atggagctcc gcagcctgac atctgaggat tctgcagtct attactgtgc aaga
//
LOCUS       MUSIGHAAW     294 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig gamma-chain (anti-insulin Ab 125) mRNA V region, partial
            cds.
ACCESSION   M34525
KEYWORDS    gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   294     Ig gamma-chain V-region (AA at 1)
BASE COUNT       83 a     66 c     74 g     71 t
ORIGIN      
        1 cagatccagt tggtgcagtc tggacctgaa ctgaagaagc ctggagagac agtcaagatc
       61 tcctgcaagg cttctggtta taccttcaca gactattcaa tgcactgggt gaagcaggct
      121 ccaggaaagg gtttaaagtg gatggactgg ataaacactg agactggtgt gccaacatat
      181 gcagatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat
      241 ttgcagatca acgacctcaa aaatgaggac acggctacat atttctgtac taga
//
LOCUS       MUSIGHAAX     294 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig gamma-chain (anti-insulin Ab 127) mRNA V region, partial
            cds.
ACCESSION   M34526
KEYWORDS    gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   294     Ig gamma-chain V-region (AA at 1)
BASE COUNT       81 a     80 c     61 g     72 t
ORIGIN      
        1 gatgtgcagc ttcaggaggt aggacctgac ctggtgaaac cttctcagtc actttcactc
       61 acctgcactg tcactggcta ctccatcacc agtggttata gctggcactg gatccggcag
      121 tttccaggaa acaaactgga atggatgggc tacatacact acagtgatag ctctaactac
      181 aacccatctc tcaaaagtcg aatctctatc actcgagaca catccaagaa ccagttcttc
      241 ctgcagttga attctgtgac tactgaggac acagccacat attactgtgc aagg
//
LOCUS       MUSIGKABI     300 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig kappa-chain (anti-insulin Ab 123) mRNA V region, partial
            cds.
ACCESSION   M34527
KEYWORDS    immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   300     Ig kappa-chain V-region (AA at 1)
BASE COUNT       74 a     86 c     70 g     70 t
ORIGIN      
        1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc
       61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc
      121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc
      181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa
      241 gatgctgcca cttattactg ccagcagtgg agtagtaaac cacccatcac gttcggtgct
//
LOCUS       MUSIGKABJ     300 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig kappa-chain (anti-insulin Ab 126) mRNA V region, partial
            cds.
ACCESSION   M34528
KEYWORDS    immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   300     Ig kappa-chain V-region (AA at 1)
BASE COUNT       77 a     80 c     66 g     77 t
ORIGIN      
        1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaga tagcgtcagt
       61 ctttcctgca gggccagcca aagtattagc aacaacctac actggtatca acaaaaatca
      121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc
      181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact
      241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggcctcacac gttcggctcg
//
LOCUS       MUSIGKABK     312 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig kappa-chain (anti-insulin Ab 127) mRNA V region, partial
            cds.
ACCESSION   M34529
KEYWORDS    immunoglobulin heavy-chain; kappa-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-region (AA at 1)
BASE COUNT       78 a     81 c     77 g     76 t
ORIGIN      
        1 gacattgtgc tgacccaatc tccagcttct ttggctgtgt ctctagggca gagggccacc
       61 atatcctgca gagccagtga aagtgttgat agttatggca atagttttat gcactggtac
      121 cagcagaaac caggacagcc acccaaactc ctcatctatc gtgcatccaa cctagaatct
      181 gggatccctg ccaggttcag tggcagtggg tctaggacag acttcaccct caccattaat
      241 cctgtggagg ctgatgatgt tgcaagctat tactgtcagc aaagtaatga ggaacctccc
      301 acgttcggag gg
//
LOCUS       MUSIGKABL     312 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V region, partial cds.
ACCESSION   M34530
KEYWORDS    immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c), cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Ewulonu,U.K., Nell,L.J. and Thomas,J.W.
  TITLE     V-H and V-L gene usage by murine IgG antibodies that bind
            autologous insulin
  JOURNAL   J. Immunol. 144, 3091-3098 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-region (AA at 1)
BASE COUNT       71 a     96 c     69 g     76 t
ORIGIN      
        1 caaattgttc tcacccagtc tccaacaatc atgtctgcat ctctagggga acgggtcacc
       61 atgacctgca ctgccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag
      121 ccaggatcct cccccaaact ctggatttat agtacatcca acctggcttc tggagtccca
      181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgccag cagtatcatc gttccccacc cacgttcggt
      301 gctgggacca ag
//
LOCUS       HUMINSR01    2085 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 1.
ACCESSION   M23100 M32822
KEYWORDS    Alu repetitive sequence; insulin receptor.
SEGMENT     1 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1933)
  AUTHORS   Seino,S., Seino,M., Nishi,S. and Bell,G.I.
  TITLE     Structure of the human insulin receptor gene and characterization
            of its promoter
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 114-118 (1989)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 2085)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept       1824  +  1923     human insulin receptor precursor, exon 1
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    sigp       1824     1904     human insulin receptor signal peptide
    matp       1905  +  1923     human insulin receptor
    pre-msg    1541  >  2085     hINSR mRNA and introns (alt.)
    pre-msg    1542  >  2085     hINSR mRNA and introns (alt.)
    pre-msg    1548  >  2085     hINSR mRNA and introns (alt.)
    IVS        1924  >  2085     hINSR intron A
    rpt     <     1       76     Alu repeat
BASE COUNT      417 a    631 c    702 g    335 t
ORIGIN      Chromosome 19p13.3-13.2.
        1 agatctggcc attgcactcc agcctgggca acagagaaaa actccatcta aaaaaaaaaa
       61 aaaaaaaaaa aaaaaacaga gagagagaga gagagagaga gaaggaaacg gaactggggg
      121 gaggatttgc aaaaatatgg ttagggatgg cacttcagag atgaagccat cctggagtgt
      181 tacgggcaag ggaaatgctg gggcaaagcc ccagaggcag gaataggttt ggcctgttgc
      241 atgaacagtg ggtccagctc ctagcaaact gtttattgaa tgaaagaaga atgaatgcct
      301 tgggtctagg gttgtgctgg gcgctttctt aagttttctt tcccgggtac ctccccagaa
      361 ctggcatgca ggtattatta aacccattac acaagtgaaa ctggcccaga gacagaaaag
      421 tccctggtcc aagaccacac aggagtgagg ggtggaggaa ccctcctccc attgagttct
      481 ggctttccta tactgaaagc cccttcctct cctgcagtaa ggtaggtgga accgctgtcc
      541 cgccttgttg gtgaatgtcg ttgctagact tcagacacat acaggctggt ctgctgaaaa
      601 tcagagatgt ccacctgcgc cctattcgag gtctccggcg tcttctttgg cgtcgtcttt
      661 gccctttcag aagcgtctgc acatttttcc aggtgtcatt tctccaactt gaacacaggg
      721 agcgcactgg gcacgcgggc acgtggctgt ccccaggggc ctggcttggg tctcgcccct
      781 gggccggggc gcacgcgcgg gcgggacatc tgggggcgcc cacgcgctct gggacgagtg
      841 tcgctggcca ggcccggact gaggaaaggc gagtgagaca ctactcgcct ggggtgcaaa
      901 atttaaggga gtgaaaaaaa aaaaaaaaga aagaaaccaa aaccacctcg agtcaccaaa
      961 ataaacattt taatgcagta ttttttaaaa aatcaacagg aatcctccaa agcccactat
     1021 gaacaaaata gcaaaatggt agagaaagga tctgtgccgc tgcgtcgggc ctgtggggcg
     1081 cctccggggg tctgaaactg gaggagactc ggggctgtag ggcgcgcgga tctggggcgc
     1141 gccctcggtc ccggcgcgcc cagggcctcc cgcgcggggc ccggcacagg gaggcgggga
     1201 ggcgggcggg gcggggcggg accgggcggc acctccctcc cctgcaagct ttccctccct
     1261 ctcctgggcc tctcccgggc gcagagtccc ttcctaggcc agatccgcgc cgccttttcc
     1321 cgcggcccgc acggggccca gctgacgggc cgcgttgttt acgggccgga gcagccctct
     1381 ctcccgccgc ccgcccgcca cccgccagcc caggtgcccg cccgccagtc agctagtccg
     1441 tcggtccgcg cgtccctctg tcccggagcc cgcagatcgc gacccagagc gcgcggggcc
     1501 gagagccgag agacagtccc gggcgcagcg cggagctccg ggccccgaga tcctgggacg
     1561 gggcccgggc cgcagcggcc ggggggtcgg ggccaccacc gcaagggcct ccgctcagta
     1621 tttgtagctg gcgaagccgc gcgcgccctt cccggggctg cctctgggcc ctccccggca
     1681 ggggggctgc ggcccgcggg tcgcgggcgt ggaagagaag gacgcgcggc ccccagcgcc
     1741 tcttgggtgg ccgcctcgga gcatgacccc cgcgggccag cgccgcgcgc tctgatccga
     1801 ggagaccccg cgctcccgca gccatgggca ccgggggccg gcggggagcg gcggccgcgc
     1861 cgctgctggt ggcggtggcc gcgctgctac tgggcgccgc gggccacctg taccccggag
     1921 agggtgagtc tgggggcgcg ggcgtgggcg gggagcgccg cgatggggag aggaccccac
     1981 ccaagccaaa atcgatcccc cgcttgtgga ctgagaaccc tccccagggg cggggggcgg
     2041 tggccaggac ggtagctcct gcatcgcgta gggggagcgg gaagc
//
LOCUS       HUMINSR02     928 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 2.
ACCESSION   M32823
KEYWORDS    insulin receptor.
SEGMENT     2 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 928)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   174  +   725     human insulin receptor precursor, exon 2
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   174  +   725     human insulin receptor
    pre-msg <     1  >   927     hINSR mRNA and introns
    IVS     <     1      173     hINSR intron A
    IVS         726  >   927     hINSR intron B
BASE COUNT      218 a    234 c    237 g    239 t
ORIGIN      About 25.0 kbp downstream of segment 1.
        1 tactttacag agaaagctac tcatcccggc tggctgcaga gtttacaggg cccgggatga
       61 aaacacaggg cccaggtttc ctgtccatga agccggctct gcccctgatc cttctgatgc
      121 atccaccgtg cgtctgctca cctgtcttgc tttctgttca ttttctcttg tagtgtgtcc
      181 cggcatggat atccggaaca acctcactag gttgcatgag ctggagaatt gctctgtcat
      241 cgaaggacac ttgcagatac tcttgatgtt caaaacgagg cccgaagatt tccgagacct
      301 cagtttcccc aaactcatca tgatcactga ttacttgctg ctcttccggg tctatgggct
      361 cgagagcctg aaggacctgt tccccaacct cacggtcatc cggggatcac gactgttctt
      421 taactacgcg ctggtcatct tcgagatggt tcacctcaag gaactcggcc tctacaacct
      481 gatgaacatc acccggggtt ctgtccgcat cgagaagaac aatgagctct gttacttggc
      541 cactatcgac tggtcccgta tcctggattc cgtggaggat aattacatcg tgttgaacaa
      601 agatgacaac gaggagtgtg gagacatctg tccgggtacc gcgaagggca agaccaactg
      661 ccccgccacc gtcatcaacg ggcagtttgt cgaacgatgt tggactcata gtcactgcca
      721 gaaaggtacg ccggggatac agggttctaa gcagtgtctc gtgccttgtt ctagaaagct
      781 taaaatgttt tatggcttaa aaatgttaaa tggtcattag gtaggggccg gggaatagtg
      841 ggtggtggca ttcactagcc cagggagtgg cagacatttt ctgtaaagac tcagatagta
      901 gatacttcag attttgcagg ccatatgg
//
LOCUS       HUMINSR03     639 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 3.
ACCESSION   M32824
KEYWORDS    insulin receptor.
SEGMENT     3 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 639)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   114  +   435     human insulin receptor precursor, exon 3
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   114  +   435     human insulin receptor
    pre-msg <     1  >   639     hINSR mRNA and introns
    IVS     <     1      113     hINSR intron B
    IVS         436  >   639     hINSR intron C
BASE COUNT      134 a    171 c    163 g    171 t
ORIGIN      About 25.0 kbp downstream of segment 2.
        1 gatccagaat tgctgcatat gcagacagga attggacaaa gccatttatt tatttattta
       61 tttatttatt tatttattta tttatttccc tctctctctc tctctctctc cagtttgccc
      121 gaccatctgt aagtcacacg gctgcaccgc cgaaggcctc tgttgccaca gcgagtgcct
      181 gggcaactgt tctcagcccg acgaccccac caagtgcgtg gcctgccgca acttctacct
      241 ggacggcagg tgtgtggaga cctgcccgcc cccgtactac cacttccagg actggcgctg
      301 tgtgaacttc agcttctgcc aggacctgca ccacaaatgc aagaactcgc ggaggcaggg
      361 ctgccaccaa tacgtcattc acaacaacaa gtgcatccct gagtgtccct ccgggtacac
      421 gatgaattcc agcaagtgag ttctggatgt gggtctgggg ggcagccgag aggagaagga
      481 acgtggggtt ggttgtgacg atgccgcttg ttaaaactgt gtgcaaaccc agggttaatt
      541 ggctatgagt gaggtctctg ctctcagatg ctacttttgc accctgtttt ggtcctgggc
      601 ttgggagtgg gagttgacta cctttttctc taaaggacc
//
LOCUS       HUMINSR04     663 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 4.
ACCESSION   M32825
KEYWORDS    insulin receptor.
SEGMENT     4 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 663)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   318  +   466     human insulin receptor precursor, exon 4
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   318  +   466     human insulin receptor
    pre-msg <     1  >   663     hINSR mRNA and introns
    IVS     <     1      317     hINSR intron C
    IVS         467  >   663     hINSR intron D
BASE COUNT      159 a    195 c    171 g    138 t
ORIGIN      About 15.0 kbp downstream of segment 3.
        1 ccaacatggt aaccccgtct ctactcaaaa atacaaaaat tagccaggca cggtggcggg
       61 cacctataat cccagctact gtggaggctg aggcaggaga atctcttgaa cccagaaggc
      121 agaggttgca gtgagctgag atcgcaccac tgcactccag cctgggcaac agagcgagac
      181 tctgtcacac aaacacacac acacacacaa agaaatacca tatcaggcag aaagatgcct
      241 gagatgtctg aaggaccttg gataccgtga cacccccctc ccctttctct ttctctctct
      301 ctctgctccg tccttagctt gctgtgcacc ccatgcctgg gtccctgtcc caaggtgtgc
      361 cacctcctag aaggcgagaa gaccatcgac tcggtgacgt ctgcccagga gctccgagga
      421 tgcaccgtca tcaacgggag tctgatcatc aacattcgag gaggcagtga gtgtctctgt
      481 gtgggcgtcg ggggtgcctg ttgggctcca tgtccctctg agctgtgagc ggggaagaaa
      541 agcagtgcag accctgctgc gtgctcctac agcactttta ggatggtcgt tcagtggctc
      601 ccccatggat agaaccatgc tgggagtctg cctcaaaacc tgaaatgaac agctcagtct
      661 tcc
//
LOCUS       HUMINSR05     410 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 5.
ACCESSION   M32826
KEYWORDS    insulin receptor.
SEGMENT     5 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 410)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   188  +   332     human insulin receptor precursor, exon 5
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   188  +   332     human insulin receptor
    pre-msg <     1  >   410     hINSR mRNA and introns
    IVS     <     1      187     hINSR intron D
    IVS         333  >   410     hINSR intron E
BASE COUNT      105 a     80 c    100 g    125 t
ORIGIN      About 3.0 kbp downstream of segment 4.
        1 gggcagaagt atgcttgacc catttaagga atgctaagga cttcagattg tgttctaagc
       61 atgatgagtt ttgagctggg tatgtccagt catttgcagc ctgagggtta tcttctcacc
      121 atggagaatc atgagaagat tgaaatatgt ctatagaaac ccactggata ttctctcctt
      181 tccttagaca atctggcagc tgagctagaa gccaacctcg gcctcattga agaaatttca
      241 gggtatctaa aaatccgccg atcctacgct ctggtgtcac tttccttctt ccggaagtta
      301 cgtctgattc gaggagagac cttggaaatt gggtacgtgg gcctgattgt gtgtatggcc
      361 tgagtgctaa ctaggaagtt cgtgtattag aacaacttaa ggattttttt
//
LOCUS       HUMINSR06     554 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 6.
ACCESSION   M32827
KEYWORDS    insulin receptor.
SEGMENT     6 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 554)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   189  +   403     human insulin receptor precursor, exon 6
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   189  +   403     human insulin receptor
    pre-msg <     1  >   554     hINSR mRNA and introns
    IVS     <     1      188     hINSR intron E
    IVS         404  >   554     hINSR intron F
BASE COUNT      154 a    129 c    130 g    141 t
ORIGIN      About 1.0 kbp downstream of segment 5.
        1 ggccatgaaa acttcctcaa cttcctctgt tatccacatt caacaaatat gtgttgagta
       61 tgtgccaagc aagtggagag gattaggcac gtagcactga acaagatcaa ctccgagcat
      121 ggccacacca tcttggagtt gtagaagacc agccgttgaa tgactagatg tgtgtgtttt
      181 ttccatagga actactcctt ctatgccttg gacaaccaga acctaaggca gctctgggac
      241 tggagcaaac acaacctcac catcactcag gggaaactct tcttccacta taaccccaaa
      301 ctctgcttgt cagaaatcca caagatggaa gaagtttcag gaaccaaggg gcgccaggag
      361 agaaacgaca ttgccctgaa gaccaatggg gaccaggcat cctgtaagtc actggtcccc
      421 aacctttttg gcacgaggga ccggtttagt ggaagatggt ttttccatgg actggtggtg
      481 ggtggggatg gtttcagcat gattcaagtg cattacattt actatgcact ttattcctat
      541 tatgattaca ttgt
//
LOCUS       HUMINSR07     592 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 7.
ACCESSION   M32828
KEYWORDS    insulin receptor.
SEGMENT     7 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 592)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   277  +   403     human insulin receptor precursor, exon 7
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   277  +   403     human insulin receptor
    pre-msg <     1  >   592     hINSR mRNA and introns
    IVS     <     1      276     hINSR intron F
    IVS         404  >   592     hINSR intron G
BASE COUNT      125 a    144 c    144 g    179 t
ORIGIN      About 1.0 kbp downstream of segment 6.
        1 ttgcgcgggt acagactgcg cttattcagt tgactgtctg gctgagtcaa gtcattggct
       61 tacgtgagtg tgagtggcca agttgcaaaa ctggctctta cctttgaatc ttcccccatt
      121 catactcagc caggcacatg gggaggagac ccttaaggga atagcagcat cacctctgcc
      181 ttctcacggt ccctccagga agtgtggggg tcccaggctt tggtctgaaa ctacactgaa
      241 atagctcatt tttgcctttt gttttaactt ttccaggtga aaatgagtta cttaaatttt
      301 cttacattcg gacatctttt gacaagatct tgctgagatg ggagccgtac tggccccccg
      361 acttccgaga cctcttgggg ttcatgctgt tctacaaaga ggcgtaagta gaagagttag
      421 agagacgctg aggaggcgag ggctggctgg ctctgtgctt gctacgtttg tgctccaatc
      481 tgcccctctt gggttcctgt ctatctccct cctcctcctg gaataaatat cttaggttcc
      541 tttttacaat ctcaccagtc gatggcatgc aaagtcaata gtgtctgctt tt
//
LOCUS       HUMINSR08     401 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 8.
ACCESSION   M32829
KEYWORDS    insulin receptor.
SEGMENT     8 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   124  +   374     human insulin receptor precursor, exon 8
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   124  +   374     human insulin receptor
    pre-msg <     1  >   401     hINSR mRNA and introns
    IVS     <     1      123     hINSR intron G
    IVS         375  >   401     hINSR intron H
BASE COUNT       90 a     98 c    112 g    101 t
ORIGIN      About 3.0 kbp downstream of segment 7.
        1 cattagattg ttgggtgagt aacatgtgac cctatgggat gtaacttccc aggcctcatc
       61 tgcacggcac tcagtgtgac ggtcttgtaa gggtaactgc cttctgctgt tttgtcttga
      121 aagcccttat cagaatgtga cggagttcga tgggcaggat gcgtgtggtt ccaacagttg
      181 gacggtggta gacattgacc cacccctgag gtccaacgac cccaaatcac agaaccaccc
      241 agggtggctg atgcggggtc tcaagccctg gacccagtat gccatctttg tgaagaccct
      301 ggtcaccttt tcggatgaac gccggaccta tggggccaag agtgacatca tttatgtcca
      361 gacagatgcc accagtgagt gtgtcttggg aatgtgaatt c
//
LOCUS       HUMINSR09     420 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 9.
ACCESSION   M32830
KEYWORDS    insulin receptor.
SEGMENT     9 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 420)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   106  +   273     human insulin receptor precursor, exon 9
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   106  +   273     human insulin receptor
    pre-msg <     1  >   420     hINSR mRNA and introns
    IVS     <     1      105     hINSR intron H
    IVS         274  >   420     hINSR intron I
BASE COUNT       85 a    125 c     94 g    116 t
ORIGIN      About 3.0 kbp downstream of segment 8.
        1 ggtgccctca tgatgtcttt aacttgtgtg tcccccgcca tcctcccacc agctttcttt
       61 gcacactgtt tctcatgatg gacccgtttc ctttctccct ggcagacccc tctgtgcccc
      121 tggatccaat ctcagtgtct aactcatcat cccagattat tctgaagtgg aaaccaccct
      181 ccgaccccaa tggcaacatc acccactacc tggttttctg ggagaggcag gcggaagaca
      241 gtgagctgtt cgagctggat tattgcctca aaggtgagtg caggcagctg tgctaggatc
      301 ggtggggttt gcacacgtgt gtctgatgca ctttgcttca cctctaggga agcagctatc
      361 tcttcctgtg tctcagtgtc ggaaggcaca cacacacact ccattctatc tcatatgaaa
//
LOCUS       HUMINSR10     517 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 10.
ACCESSION   M32831
KEYWORDS    insulin receptor.
SEGMENT     10 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 517)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   187  +   388     human insulin receptor precursor, exon 10
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   187  +   388     human insulin receptor
    pre-msg <     1  >   517     hINSR mRNA and introns
    IVS     <     1      186     hINSR intron I
    IVS         389  >   517     hINSR intron J
BASE COUNT       83 a     88 c    194 g    152 t
ORIGIN      About 11.0 kbp downstream of segment 9.
        1 tttgtggtgt gtgtatgtgt ggtgtgttgt gtgatgtgtg tggtgtgtgt gtgggggggt
       61 gtgtggtgtg tgtatgtgtg gtgtgtgtgg tgtgtgtgtg tggtgtgtgt gtgtgggggg
      121 ggtgtgtgtg tgtatgtgtg ttcagccgca gagacttgag cccccctttt ctgtttcttt
      181 ctccagggct gaagctgccc tcgaggacct ggtctccacc attcgagtct gaagattctc
      241 agaagcacaa ccagagtgag tatgaggatt cggccggcga atgctgctcc tgtccaaaga
      301 cagactctca gatcctgaag gagctggagg agtcctcgtt taggaagacg tttgaggatt
      361 acctgcacaa cgtggttttc gtccccaggt caggacttgg cgctgggctc tcttagtggg
      421 tgccaattgg cttggtgttg gtggaaggtc attacttagg gaccgagagg tagtgggagg
      481 gagagacggc agaaccctgg gtggagtctg aatggag
//
LOCUS       HUMINSR11     343 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 11.
ACCESSION   M32832
KEYWORDS    insulin receptor.
SEGMENT     11 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 343)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   123  +   158     human insulin receptor precursor, exon 11
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   123  +   158     human insulin receptor
    pre-msg <     1  >   343     hINSR mRNA and introns
    IVS     <     1      122     hINSR intron J
    IVS         159  >   343     hINSR intron K
BASE COUNT       68 a     97 c     98 g     80 t
ORIGIN      About 2.0 kbp downstream of segment 10.
        1 tggtccaggg tcaaagccag ggtgccctta ctcggacaca tgtggcctcc aagtgtcaga
       61 gcccagtggt ctgtctaatg aagttccctc tgtcctcaaa ggcgttggtt ttgtttccac
      121 agaaaaacct cttcaggcac tggtgccgag gaccctaggt atgactcacc tgtgcgaccc
      181 ctggtgcctg ctccgcgcag ggccggcggc gtgccaggca gatgcctcgg agaacccagg
      241 ggtttctctg gctttttgca tgcggcgggc agctgtgctg gagagcagat gcttcaccaa
      301 ttcagaaatc caatgccttc actctgaaat gaaatctggg cat
//
LOCUS       HUMINSR12     719 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 12.
ACCESSION   M32833
KEYWORDS    insulin receptor.
SEGMENT     12 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 719)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   161  +   435     human insulin receptor precursor, exon 12
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   161  +   435     human insulin receptor
    pre-msg <     1  >   719     hINSR mRNA and introns
    IVS     <     1      160     hINSR intron K
    IVS         436  >   719     hINSR intron L
BASE COUNT      137 a    198 c    195 g    189 t
ORIGIN      About 8.0 kbp downstream of segment 11.
        1 ggtcattcct ggcagtctgt attgtaatcc atgttcccca ttgctgcacc ctcctgcgct
       61 ctgatctttc ttcttaatca agccttttat tctccagtgt cactttttta aaaaaaatga
      121 tggtgatggt gtcatcatac atgtcctact gtcgttccag gccatctcgg aaacgcaggt
      181 cccttggcga tgttgggaat gtgacggtgg ccgtgcccac ggtggcagct ttccccaaca
      241 cttcctcgac cagcgtgccc acgagtccgg aggagcacag gccttttgag aaggtggtga
      301 acaaggagtc gctggtcatc tccggcttgc gacacttcac gggctatcgc atcgagctgc
      361 aggcttgcaa ccaggacacc cctgaggaac ggtgcagtgt ggcagcctac gtcagtgcga
      421 ggaccatgcc tgaaggtagg gctgctggtc cggggtccga gtgtcatggg tgggacatca
      481 aggctgactt tttgtttgag acggagcctt gctctgtcgc ccaggctgga gtacagtggt
      541 gcgacctcag ctcactccag cctctgccac ctatgtcaag tgattccctg cttcagcctc
      601 ccaagtagct gggactacag gtgtctgcca ccacgcccag ctaatttttg tatttttagt
      661 agagatgggg tttcaccata ttgcccaggc tggtcttgaa ctcctgggct caagtgatc
//
LOCUS       HUMINSR13     439 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 13.
ACCESSION   M32834
KEYWORDS    insulin receptor.
SEGMENT     13 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 439)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +    93  +   232     human insulin receptor precursor, exon 13
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +    93  +   232     human insulin receptor
    pre-msg <     1  >   439     hINSR mRNA and introns
    IVS     <     1       92     hINSR intron L
    IVS         233  >   439     hINSR intron M
BASE COUNT       98 a    114 c    105 g    122 t
ORIGIN      About 1.0 kbp downstream of segment 12.
        1 gtcaccagcc caaggttgca ccatggacag gtggcagaag tgggatctca tccaagagtt
       61 acatccctgc ctctcacttc ctctccttac agccaaggct gatgacattg ttggccctgt
      121 gacgcatgaa atctttgaga acaacgtcgt ccacttgatg tggcaggagc cgaaggagcc
      181 caatggtctg atcgtgctgt atgaagtgag ttatcggcga tatggtgatg aggtaaggcc
      241 cttgactctt gggcatgccc ctgcaccact tcagcatgcc ccttcagagt tgcacttggt
      301 acctccttcc tctgctgaaa ttttgattcc agtgcttctc tcatcaggta ctgtgctatt
      361 agtacttaaa gccttgatac ctgacttcgc aggaagatgg gtcagaaatg ccaatctacc
      421 agcttgttac ttttcttag
//
LOCUS       HUMINSR14     386 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 14.
ACCESSION   M32835
KEYWORDS    insulin receptor.
SEGMENT     14 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 386)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +    85  +   244     human insulin receptor precursor, exon 14
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +    85  +   244     human insulin receptor
    pre-msg <     1  >   386     hINSR mRNA and introns
    IVS     <     1       84     hINSR intron M
    IVS         245  >   386     hINSR intron N
BASE COUNT       62 a    123 c    115 g     86 t
ORIGIN      About 6.0 kbp downstream of segment 13.
        1 tggctgtgag ctccctgcga ggggtggaca ctcccagatg tgcaaagctc agccaccctc
       61 cttctcctcc tctcttcctc ccaggagctg catctctgcg tctcccgcaa gcacttcgct
      121 ctggaacggg gctgcaggct gcgtgggctg tcaccgggga actacagcgt gcgaatccgg
      181 gccacctccc ttgcgggcaa cggctcttgg acggaaccca cctatttcta cgtgacagac
      241 tattgtaagt ctccatggca gcctcagctg actggggctg tgcttagcac tgagcatggt
      301 gggacattgc aggggatgac ttggagaggc cgcagtgctg gccctggcct tgactctcag
      361 gcctatcagc tgctgcggtg cttgcc
//
LOCUS       HUMINSR15     429 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 15.
ACCESSION   M32836
KEYWORDS    insulin receptor.
SEGMENT     15 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 429)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +    92  +   194     human insulin receptor precursor, exon 15
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +    92  +   194     human insulin receptor
    pre-msg <     1  >   428     hINSR mRNA and introns
    IVS     <     1       91     hINSR intron N
    IVS         195  >   428     hINSR intron O
BASE COUNT      117 a     67 c     82 g    163 t
ORIGIN      About 3.0 kbp downstream of segment 14.
        1 cccacccatt ccaggagtgg atgtgatttt tgatgtgaac tttgttggaa acacattgat
       61 atgaaacata tattttctta ttctatttca gtagacgtcc cgtcaaatat tgcaaaaatt
      121 atcatcggcc ccctcatctt tgtctttctc ttcagtgttg tgattggaag tatttatcta
      181 ttcctgagaa agaggtgagt tcagtgagtt cagtggtgtg ctgggaacag ttggttctct
      241 gggggaaaac atgccttgat ataggtatag gcatatttaa gtttattatg aattttgctg
      301 atataggatg tgtaacatgc aatttacaga taattgtcat aatatgatat acacaactct
      361 ttattgtaaa ttccctctag acagttgatt ctcacagaat gtttttattg attttttttt
      421 ttgcccaaa
//
LOCUS       HUMINSR16     480 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 16.
ACCESSION   M32837
KEYWORDS    insulin receptor.
SEGMENT     16 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 480)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   261  +   328     human insulin receptor precursor, exon 16
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   261  +   328     human insulin receptor
    pre-msg <     1  >   480     hINSR mRNA and introns
    IVS     <     1      260     hINSR intron O
    IVS         329  >   480     hINSR intron P
BASE COUNT      123 a    131 c    109 g    117 t
ORIGIN      About 2.0 kbp downstream of segment 15.
        1 aaaaacaaaa acaaaaacaa aacaaaaaaa aaaccaccca gggagggatg agtgctccca
       61 tgttgatgca cttacatacc tgtctgatgg gcttccattc aaaacataaa ggtcccccat
      121 ccctgcccta gactgcatct aggattatgg ggattctgct ggtaagggct gccatttgcc
      181 ttggggagtc ttgtatgaaa cacctttctg cagagtccca tgagaatctc aagctaacgt
      241 gcctcgtttt cctcctccag gcagccagat gggccgctgg gaccgcttta cgcttcttca
      301 aaccctgagt atctcagtgc cagtgatggt gagtaccatc ccttccctgt gggtggccag
      361 aaccctactc atcagcttcc tttgccttca ccattgagtg agagtgaagg atgggttccc
      421 cagggaggcc aagaaaagcc ctcttattca tttgagcttg ccaaactgcc cttgctgcag
//
LOCUS       HUMINSR17     485 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 17.
ACCESSION   M32838
KEYWORDS    insulin receptor.
SEGMENT     17 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 485)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   136  +   380     human insulin receptor precursor, exon 17
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   136  +   380     human insulin receptor
    pre-msg <     1  >   485     hINSR mRNA and introns
    IVS     <     1      135     hINSR intron P
    IVS         381  >   485     hINSR intron Q
BASE COUNT       96 a    119 c    162 g    108 t
ORIGIN      About 1.0 kbp downstream of segment 16.
        1 cccggcatgg gtcctggatc acagaactca tttcatgagt gttttcgagg gggtttgggt
       61 gagggcttgg gtggaaggtg gctgcagacc cccaagggat cctccaagga tgctgtgtag
      121 ataagtaaga agtagtgttt ccatgctctg tgtacgtgcc ggacgagtgg gaggtgtctc
      181 gagagaagat caccctcctt cgagagctgg ggcagggctc cttcggcatg gtgtatgagg
      241 gcaatgccag ggacatcatc aagggtgagg cagagacccg cgtggcggtg aagacggtca
      301 acgagtcagc cagtctccga gagcggattg agttcctcaa tgaggcctcg gtcatgaagg
      361 gcttcacctg ccatcacgtg gtgagtccag tgggggtggg acatgggctg gctttcctga
      421 cccttccctt tctctgcctc ctcctcctgc acagagcgac agaggacaca gggtgtatcc
      481 tccta
//
LOCUS       HUMINSR18     287 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 18.
ACCESSION   M32839
KEYWORDS    insulin receptor.
SEGMENT     18 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 287)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   117  +   227     human insulin receptor precursor, exon 18
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   117  +   227     human insulin receptor
    pre-msg <     1  >   287     hINSR mRNA and introns
    IVS     <     1      116     hINSR intron Q
    IVS         228  >   287     hINSR intron R
BASE COUNT       51 a     85 c     98 g     53 t
ORIGIN      About 2.0 kbp downstream of segment 17.
        1 acgctgcatc caggccacag ggtgctgtgt gtgacataga caccagggag ggaggagaac
       61 cctggtgagt cgaatcacgg accctcctcc aagaaccctg gttgcttgct ctgcaggtgc
      121 gcctcctggg agtggtgtcc aagggccagc ccacgctggt ggtgatggag ctgatggctc
      181 acggagacct gaagagctac ctccgttctc tgcggccaga ggctgaggta agctgcttcg
      241 ggggacccag cggggtactc ggtggagcac ccgctcctgg cctcctc
//
LOCUS       HUMINSR19     322 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 19.
ACCESSION   M32840
KEYWORDS    insulin receptor.
SEGMENT     19 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 322)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +    45  +   204     human insulin receptor precursor, exon 19
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +    45  +   204     human insulin receptor
    pre-msg <     1  >   322     hINSR mRNA and introns
    IVS     <     1       44     hINSR intron R
    IVS         205  >   322     hINSR intron S
BASE COUNT       81 a     76 c     79 g     86 t
ORIGIN      About 0.5 kbp downstream of segment 18.
        1 gatcccagtg ctgctgaaac accaaccccg tgtttctgtt ttagaataat cctggccgcc
       61 ctccccctac ccttcaagag atgattcaga tggcggcaga gattgctgac gggatggcct
      121 acctgaacgc caagaagttt gtgcatcggg acctggcagc gagaaactgc atggtcgccc
      181 atgattttac tgtcaaaatt ggaggttcgt ctggctttct gctttgaaaa cataacgacc
      241 caggccaggt ttgatttcag aaggaagttg tctataatga gccgttaagt cttttctgat
      301 aatataaagg ggcaagtact tc
//
LOCUS       HUMINSR20     288 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 20.
ACCESSION   M32841
KEYWORDS    insulin receptor.
SEGMENT     20 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 288)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   115  +   244     human insulin receptor precursor, exon 20
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   115  +   244     human insulin receptor
    pre-msg <     1  >   288     hINSR mRNA and introns
    IVS     <     1      114     hINSR intron S
    IVS         245  >   288     hINSR intron T
BASE COUNT       61 a     55 c    102 g     70 t
ORIGIN      About 0.5 kbp downstream of segment 19.
        1 gacgtgggcc aggtgaaccc ctcttagggc tctgtgagag gtggggcagt caaggtggca
       61 gatgctagga ccaaggctga aggttaagag cgtgtgaacc ttttgtgttg tcagactttg
      121 gaatgaccag agacatctat gaaacggatt actaccggaa agggggcaag ggtctgctcc
      181 ctgtacggtg gatggcaccg gagtccctga aggatggggt cttcaccact tcttctgaca
      241 tgtggtgagt tgtgtgtgga tgggtggatg gacgctgggc ttgaattc
//
LOCUS       HUMINSR21     407 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 21.
ACCESSION   M32842
KEYWORDS    insulin receptor.
SEGMENT     21 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 407)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +   101  +   235     human insulin receptor precursor, exon 21
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +   101  +   235     human insulin receptor
    pre-msg <     1  >   407     hINSR mRNA and introns
    IVS     <     1      100     hINSR intron T
    IVS         236  >   407     hINSR intron U
BASE COUNT       73 a     75 c    118 g    141 t
ORIGIN      About 1.0 kbp downstream of segment 20.
        1 ttgcgtgtgt gtgtgcgttt gcgtgtgtgt gtttgcgcgc gcgcgtgtgt gtgtgtgtct
       61 aaatggcttc tttgttacta ctatcaactg tcatcggcag gtcctttggc gtggtccttt
      121 gggaaatcac cagcttggca gaacagcctt accaaggcct gtctaatgaa caggtgttga
      181 aatttgtcat ggatggaggg tatctggatc aacccgacaa ctgtccagag agagtgtaag
      241 tgtagaaagg gtttaaggtg tgtgaggtgt tcgttgaaag ggtattgccc tttacacgtg
      301 tgcttggttt tgcctttcct atgtctacac gctcaccgtg tttgcatgct gtatgttaca
      361 ggtgtgtttg tgtttgcata gcttgtcttt acatgcatgc ttgcatt
//
LOCUS       HUMINSR22     873 bp ds-DNA             PRI       10-JUL-1990
DEFINITION  Human insulin receptor (hINSR) gene, exon 22.
ACCESSION   M32972
KEYWORDS    insulin receptor.
SEGMENT     22 of 22
SOURCE      Human fetal liver DNA, clone lambda-hINSR-[1-13].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 873)
  AUTHORS   Seino,S., Seino,M. and Bell,G.I.
  TITLE     Human insulin-receptor gene: Partial sequence and amplification of
            exons by polymerase chain reaction
  JOURNAL   Diabetes 39, 123-128 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.I.Bell, 14-MAR-1990.
FEATURES       from  to/span     description
    pept    +    83      437     human insulin receptor precursor, exon 22
                                 /hgml_locus_uid="LG0007M" /nomgen="INSR"
                                 /map="19p13.3-p13.2"
    matp    +    83      434     human insulin receptor
    pre-msg <     1      873     hINSR mRNA and introns
    IVS     <     1       82     hINSR intron U
BASE COUNT      199 a    217 c    234 g    223 t
ORIGIN      About 2.0 kbp downstream of segment 21.
        1 ctgcagggac aagagtgggg gtttgggagg atgcgtggca gggcccccag actcacccag
       61 gacgtgtcct tctgccccgc agcactgacc tcatgcgcat gtgctggcaa ttcaacccca
      121 agatgaggcc aaccttcctg gagattgtca acctgctcaa ggacgacctg caccccagct
      181 ttccagaggt gtcgttcttc cacagcgagg agaacaaggc tcccgagagt gaggagctgg
      241 agatggagtt tgaggacatg gagaatgtgc ccctggaccg ttcctcgcac tgtcagaggg
      301 aggaggcggg gggccgggat ggagggtcct cgctgggttt caagcggagc tacgaggaac
      361 acatccctta cacacacatg aacggaggca agaaaaacgg gcggattctg accttgcctc
      421 ggtccaatcc ttcctaacag tgcctaccgt ggcgggggcg ggcaggggtt cccattttcg
      481 ctttcctctg gtttgaaagc ctctggaaaa ctcaggattc tcacgactct accatgtcca
      541 gtggagttca gagatcgttc ctatacattt ctgttcatct taaggtggac tcgtttggtt
      601 accaatttaa ctagtcctgc agaggattta actgtgaacc tggagggcaa ggggtttcca
      661 cagttgctgc tcctttgggg caacgacggt ttcaaaccag gattttgtgt tttttcgttc
      721 cccccacccg cccccagcag atggaaagaa agcacctgtt tttacaaatt cttttttttt
      781 tttttttttt tttttttttg ctggtgtctg agcttcagta taaaagacaa aacttcctgt
      841 ttgtggaaca aaatttcgaa agaaaaaacc aaa
//
LOCUS       BT1NAMTA     1091 bp ds-DNA             PHG       10-JUL-1990
DEFINITION  Bacteriophage T1 DNA N-6-adenine-methyltransferase (M.T1) gene,
            complete cds.
ACCESSION   J05393
KEYWORDS    DNA N-6-adenine-methyltransferase.
SOURCE      Bacteriophage T1 DNA.
  ORGANISM  Bacteriophage T1
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 1091)
  AUTHORS   Schneider-Scherzer,E., Auer,B., de Groot,E.J. and Schweiger,M.
  TITLE     Primary structure of a DNA (N-6-adenine)-methyltransferase from
            Escherichia coli virus T1
  JOURNAL   J. Biol. Chem. 265, 6086-6091 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        171      824     DNA N-6-adenine-methyltransferase (M.T1)
    pept        824     1072     pot. protein HP 83
    binding     161      164     ribosomal binding site (put.)
    binding     813      816     ribosomal binding site (put.)
    signal      141      146     TATA box
BASE COUNT      345 a    205 c    266 g    275 t
ORIGIN      
        1 aaaagggaag tttctcaaaa aggtccggga gcgtggcggc ttctctgccg tcgcatacgg
       61 attcgggcaa ttcaagatcg caatttacga aatgatgaaa tagcactttt tgttaaaact
      121 gccgggatgg aatctggcat tattatctca ccaaaacgag aggaataaaa atgaaagact
      181 ttaatgatat cgaaactatc gactttgcag aaactggttg ctcattcact cgcgaagcaa
      241 tagcatcagg cggttattat caggcattga aaacgccaac ctgtaaagag atttcagggc
      301 gtcgatacaa ggggacaaat acccctgacg ctgttcgtga tttatggtca actccgcgag
      361 aggttattgc ataccttgag ggtcgttatg ggaaatatga tctcgacgct gcggcaagcg
      421 aagaaaataa agtttgcgag aagttttact ctcaggaaac aaactgctta aaacgttggt
      481 ggggaaagaa taagcacgtt tggttaaatc ctccttatag ccgacctgat atatttgtca
      541 actctactgc gtggtttact gaagcgcggc agaacgcagc tgaaataatc tggattgaag
      601 cggacttgac tgaggatatt gacggcaatg aatacgcacg atccggtcgc ctggctttca
      661 tatccggtga aactggaaag gccgtagacg gtaataacaa aggttcggta atttttatta
      721 tgcgcgaact taaagaaggt gaggtgcaac agactcacta catcccaatc acaagcattt
      781 gcccttcggt gaaaaacaaa cgagcaaagg tgaggaaagt atgatgagcg aaaaaatggt
      841 tcctgttaaa ttaactgagc aaggtttatg gctactttat cgagctacgt gctgcgaaat
      901 tatggagcga aacggattga ctcaggatgt tattggttgc gatctgtggg agttcactag
      961 ttctcttgat atgcttttcg atgagataaa aaatgaatac atagagaact ggccttcaat
     1021 catacagaaa gacgtggaag aacttaaagc tgatacaatc gtacagcact aattgctaaa
     1081 actacccggc g
//
LOCUS       STVBLSG      1130 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  Streptoverticillum sp. blasticidin S-acetyltransferase (bls) gene,
            complete cds.
ACCESSION   M34537
KEYWORDS    blasticidin S-acetyltransferase.
SOURCE      Streptoverticillum sp. (strain JCM4673) DNA.
  ORGANISM  Streptoverticillum sp.
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 1130)
  AUTHORS   Perez-Gonzalez,J.A., Ruiz,D., Esteban,J.A. and Jimenez,A.
  TITLE     Cloning and characterization of the gene encoding a blasticidin S
            acetyltransferase from Streptoverticillum sp
  JOURNAL   Gene 86, 129-134 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        147      557     blasticidin S-acetyltransferase (bls) (147
                                 could be 225)
    binding     136      139     ribosomal binding site (put.)
BASE COUNT      151 a    420 c    394 g    165 t
ORIGIN      
        1 gatcagcgcc ggcccccacc ggcactgtgc atcagcgtac ggccggggta cgacaacgga
       61 agcggattgg caaaactgcc tggccccggt gtttatggtg agctttatgt tcagtattga
      121 ggcggtgaac gacccggaac gacgcgatgt tgtccttgcc acggttgcag accgtcaacg
      181 acgaacgttc gcccgccctg cgggcgttgc ggcgcacgcc ggtgatggag gcgcggccgc
      241 tggaggtgta cgccacgtac gcctgcggcg agcgcgggga gctggcgggc gggctcgtcg
      301 gtcatgtgca gtggcaatgg ctgcacgtgg acctgctgtg ggtggacgcg ggggcccgcg
      361 gggcggggct gggctcgcgg ttgatcgcgc gggcggaggc ccgcgcccgg gaggagttcg
      421 gctgcatcgg cagccaggtg gagacctggg acttccaggc gccggggttc taccagcggg
      481 tggggtatcg cctcgcggcg agcatcccgg actatccgcc cgggatcacg agccacctgc
      541 tggtgaagga gctttgaggc gccccgtcag gggcgcgggg ccgttactcc ggggctgcgc
      601 cccggacccc cgggtggcgc gtcgactgcg ggccggtggg ggcttgtcgc gcagttcccc
      661 gcgcccctta cggggcgcct ggtcgcgccc acgcggcgga gccgcatatc gagcacagcc
      721 ccgcgcccct tacggggcgc tgctctaggc cacccgccgt gccccctccc ccgccgccgt
      781 gccgaacagt cgtgccgtcc ccagtgcctc ggtgaccacc ttggtcaccc tttcctcatc
      841 tgccccatcc accaaggcga ttgccgagcc gccgaagccg ccgcccgtca tccgggcccc
      901 cagggccccc gccttcaccg ccgtctccac caccacgtcc aattccgcac aggacacccg
      961 gaagtcgtcg cgcagcgagg cgtgcccctc cgtcagcagt gggcccacag ccctcgcatc
     1021 ccccgcggcc agcagggccg cgacccgctc cacccggtcg ttctccgtca ccacgtgacg
     1081 gaccaaggcg cgctccgcgg caggcaactc acccagtgcc gcctgcagac
//
LOCUS       HUMGAPDH     1268 bp ss-mRNA            PRI       10-JUL-1990
DEFINITION  Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) mRNA,
            complete cds.
ACCESSION   M33197
KEYWORDS    glyceraldehyde-3-phosphate dehydrogenase.
SOURCE      Human lung cancer cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1268)
  AUTHORS   Tokunaga,K., Nakamura,Y., Sakata,K., Fujimori,K., Ohkubo,M.,
            Sawada,K. and Sakiyama,S.
  TITLE     Enhanced expression of a glyceraldehyde-3-phosphate dehydrogenase
            gene in human lung cancers
  JOURNAL   Cancer Res. 47, 5616-5619 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         61     1068     glyceraldehyde-3-phosphate dehydrogenase (EC
                                 1.2.1.12) /hgml_locus_uid="LM0055R"
                                 /nomgen="GAPD" /map="12p13"
    mRNA    <     1     1268     GAPDH mRNA
BASE COUNT      295 a    385 c    326 g    262 t
ORIGIN      
        1 gttcgacagt cagccgcatc ttcttttgcg tcgccagccg agccacatcg ctcagacacc
       61 atggggaagg tgaaggtcgg agtcaacgga tttggtcgta ttgggcgcct ggtcaccagg
      121 gctgctttta actctggtaa agtggatatt gttgccatca atgacccctt cattgacctc
      181 aactacatgg tttacatgtt ccaatatgat tccacccatg gcaaattcca tggcaccgtc
      241 aaggctgaga acgggaagct tgtcatcaat ggaaatccca tcaccatctt ccaggagcga
      301 gatccctcca aaatcaagtg gggcgatgct ggcgctgagt acgtcgtgga gtccactggc
      361 gtcttcacca ccatggagaa ggctggggct catttgcagg ggggagccaa aagggtcatc
      421 atctctgccc cctctgctga tgcccccatg ttcgtcatgg gtgtgaacca tgagaagtat
      481 gacaacagcc tcaagatcat cagcaatgcc tcctgcacca ccaactgctt agcacccctg
      541 gccaaggtca tccatgacaa ctttggtatc gtggaaggac tcatgaccac agtccatgcc
      601 atcactgcca cccagaagac tgtggatggc ccctccggga aactgtggcg tgatggccgc
      661 ggggctctcc agaacatcat ccctgcctct actggcgctg ccaaggctgt gggcaaggtc
      721 atccctgagc tgaacgggaa gctcactggc atggccttcc gtgtccccac tgccaacgtg
      781 tcagtggtgg acctgacctg ccgtctagaa aaacctgcca aatatgatga catcaagaag
      841 gtggtgaagc aggcgtcgga gggccccctc aagggcatcc tgggctacac tgagcaccag
      901 gtggtctcct ctgacttcaa cagcgacacc cactcctcca cctttgacgc tggggctggc
      961 attgccctca acgaccactt tgtcaagctc atttcctggt atgacaacga atttggctac
     1021 agcaacaggg tggtggacct catggcccac atggcctcca aggagtaaga cccctggacc
     1081 accagcccca gcaagagcac aagaggaaga gagagaccct cactgctggg gagtccctgc
     1141 cacactcagt cccccaccac actgaatctc ccctcctcac agttgccatg tagacccctt
     1201 gaagagggga ggggcctagg gagccgcacc ttgtcatgta ccatcaataa agtaccctgt
     1261 gctcaacc
//
LOCUS       MUSMK2P       728 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Mouse retinoic acid-responsive protein (MK) mRNA, complete cds.
ACCESSION   M35833 J05447
KEYWORDS    MK protein; retinoic acid-responsive protein.
SOURCE      Mouse (strain BALB/c) adult liver, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites)
  AUTHORS   Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T.
  TITLE     Structure of a retinoic acid-responsive gene, MK, which is
            transiently activated during the differentiation of embryonal
            carcinoma cells and the mid-gestation period of mouse embryogenesis
  JOURNAL   J. Biol. Chem. 265, 9441-9443 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 728; for [1])
  AUTHORS   Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.Shyuichiro, 20-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         44      466     retinoic acid-responsive protein MK precursor
    sigp         44      109     retinoic acid-responsive protein MK signal
                                 peptide
    matp        110      463     retinoic acid-responsive protein MK
    mRNA          1      728     MK2 mRNA
BASE COUNT      184 a    211 c    206 g    127 t
ORIGIN      
        1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt
       61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga
      121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag
      181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca
      241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga
      301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa
      361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa
      421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga
      481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat
      541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc
      601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata
      661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac
      721 ttttttcc
//
LOCUS       MUSMKPG      2929 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse retinoic acid-responsive protein (MK) gene, complete cds.
ACCESSION   M34094 J05447
KEYWORDS    MK protein; alternative splicing; retinoic acid-responsive protein.
SOURCE      Mouse (strain BALB/c) adult liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2929)
  AUTHORS   Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T.
  TITLE     Structure of a retinoic acid-responsive gene, MK, which is
            transiently activated during the differentiation of embryonal
            carcinoma cells and the mid-gestation period of mouse embryogenesis
  JOURNAL   J. Biol. Chem. 265, 9441-9443 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.Shyuichiro, 20-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1298     1373     retinoic acid-responsive protein (MK)
                                 precursor, exon 1
               1491     1649     retinoic acid-responsive protein (MK)
                                 precursor, exon 2
               1766     1927     retinoic acid-responsive protein (MK)
                                 precursor, exon 3
               2631     2656     retinoic acid-responsive protein (MK)
                                 precursor, exon 4
    sigp       1298     1363     retinoic acid-responsive protein (MK) signal
                                 peptide
    matp       1364     1373     retinoic acid-responsive protein (MK)
               1491     1649     retinoic acid-responsive protein (MK)
               1766     1927     retinoic acid-responsive protein (MK)
               2631     2653     retinoic acid-responsive protein (MK)
    pre-msg     463     2918     MK3 mRNA and introns (minor alt.)
    pre-msg    1007     2918     MK2 mRNA and introns (major alt.)
    pre-msg    1048     2918     MK1 mRNA and introns (minor alt.)
    IVS         816     1296     MK3 intron A
    IVS        1052     1296     MK2 intron A
    IVS        1374     1490     MK1 intron A, and MK2 and MK3 intron B
    IVS        1650     1765     MK1 intron B, and MK2 and MK3 intron C
    IVS        1928     2630     MK1 intron C, and MK2 and MK3 intron D
    signal       85       91     GC box
    signal      143      149     GC box
    signal      274      280     GC box
    signal      598      604     GC box
    signal      852      858     GC box
    signal      910      916     GC box
    signal      939      945     GC box
BASE COUNT      660 a    771 c    930 g    568 t
ORIGIN      
        1 tggccaccaa catctcagat cacttcggga gatgggtctg ccccgatcct gacctctgcc
       61 tagggcctta ggctcacagc gcctggggcg gagctgattt tcccgctcct gcagggatga
      121 taacaatgaa agtaaaagag gtggggcggg ggccaggctt gggttctttg gtcttttggc
      181 cctgtgccct ggagcagtcc cctccccctg gcttgtactg gggggggggg gggggatctg
      241 cttgaggtga gcctgaggcc ccagggtcag gggtgggcgg ttatcacctc cgggggaagc
      301 ccggtctgga acttctcaga cagctcttgt cagcgacaag atttaccaaa ctcatttcta
      361 tgtgcttccc catccccccc aacgcccttc cctcctcctc ctcccccaaa cctgcactag
      421 aaaaaggctc tcgagccttg ctcacccgga gccatctgag gtcccaggta cccagctccc
      481 tgccacatca gagacccttc ttgcactctg agtgaactga ttaaaaaaaa aaaaaaaaaa
      541 aaaaaaccaa gccggaggtg agccgggcct cgaagggaag gttcgcgggt gcggtggccg
      601 ccccgagcct gtgacaccag gacatactcc cggggcccgc ggtgggcaag cgaagtggtg
      661 acctgagagc tgacaggctg cgagagggaa aagtatagac aggcctagac caggggaagg
      721 ggaggggata gagagctggg cctgctacga ggggacctga gccagaagcg cactggtaaa
      781 accgaactcc aggaccagag acccagagat cagaggtgag aggcacagac gcgggagtcc
      841 cggctcggcg aggggcggga gtggaggcgg ggactagggg ggtctgggga ggtgcgggtt
      901 tggggggagg gggcgggtcc ttccacggga tggggggagg ggcgggggcc catgtgaccg
      961 gctcagaccg gttctggaga caaaaggggc cttagcggcc ttagcgggac aggccggagc
     1021 gggagggagc gaagcatcga gcagtgagcg agtgagcgca cgcagtggct gtggccccag
     1081 tcccttcagg cggctgctct gccaccaagg gggctgaggt gggggtgggg gtacgctgag
     1141 acatcggttc caagtcctcc ctccgtctcc cccttgtcgg tccgacgttt tgggcctgga
     1201 aagtgggaca agtcagtcaa gggtgggagg tccttcccgc ggttcctagc ggagaagaga
     1261 ctaggcgaga aactctaacc caggttttac ccctaggatg cagcaccgag gcttcttcct
     1321 tctcgccctt cttgccctct tggtggtcac gtccgcggtg gccaaaaaaa aaggtgatgg
     1381 gataggatgg gctcaggagt aaaagctggg gtgggcaggt gaggcaggcc gtgtgaccaa
     1441 gtgctggtcc ggcacgccat gtccttaact ttgttccttg cgccctgtag agaaggtgaa
     1501 gaagggcagc gagtgttcgg agtggacctg ggggccctgc acccccagca gcaaggactg
     1561 cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc cagcgcgtcc attgcaaggt
     1621 gccctgcaac tggaagaagg aatttggagg tgaggtggcg cgcgggagga gggcgggaag
     1681 ccagagggta tgtccttata aaccggaggc agggaggaca tccacaaccc tcctgtctct
     1741 caccgtgggg ccactctccc atcagccgac tgcaaataca agtttgagag ctggggggcg
     1801 tgtgatggga gcactggcac caaagcccgc caagggaccc tgaagaaggc gcggtacaat
     1861 gcccagtgcc aggagaccat ccgcgtgact aagccctgca cctccaagac caagtcaaag
     1921 accaaaggtc agcgaatatg gtggggttgt gggccaggct actccatgct ctgtctctgc
     1981 agagcagtct taaagttagg aatgggcagg cacttgaggg ccactctcag gagatgctaa
     2041 accctctgcc caagtaggaa ctactctttc tgttggatca tccgacctgg gttcctggga
     2101 aaggcttgtc tttgtcaact gaggaaggtg gggtgggatc agggaggagt taactctgcg
     2161 cttaaaacta tggaaaggcc tgtcccaaag gtacatgctg ctacctgact cccaacagct
     2221 attgaggcca gcagggcaga ggtgactctg cccatttccc cggtgaggaa cttggagtac
     2281 tctgatccta gatgaaaata gaaagttgaa agtcaggctt ggtagctcgt gcctgtaaaa
     2341 agcggcactt caggactgag gcagtaacac tgccttgagt tcaaggttac agactgagag
     2401 acttgagagt ctgtctttaa aggggggggg ggggcgcgag ggttaaaaag ttgaacgaat
     2461 aaagaaagat ttcatatcac atggctgccc tttcccacca cttccaggtg aactggtcag
     2521 tcaccactag ggggcaggat tttctctcct tgatggacat gtctgcgttg tctggtgagt
     2581 ccgagctagg tcacccaccg cactaatgca tctccgttat tgttttccag ccaagaaagg
     2641 aaaaggaaag gactaagtca ggaggccaga gagcctccgg cctcgcctgg agcctgaacg
     2701 gagccctcct ctcccacagg cccaagatat aacccaccag tgccttttgt cttcctgtca
     2761 gctctgtcaa tcacgcctgt cctctcacgc ccacaccaag tgcccaaagt ggggagggac
     2821 aagagattct ggaaagtgag cctccccata ccctcttttg ttctccccac cctgatactt
     2881 gttattaaga aatgaataaa ataaactcac ttttttccaa taaaagctt
//
LOCUS       MUSCRRY01     676 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 1.
ACCESSION   M34164
KEYWORDS    complement receptor.
SEGMENT     1 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 676)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        414  +   531     complement receptor (Crry; liver) precursor,
                                 exon 1
    sigp        414      530     complement receptor (liver) signal peptide
                                 (put.)
    matp        531  +   531     complement receptor (liver)
    pep$        414  +   660     complement receptor (spleen) precursor, exon 1
    sigp        414      530     complement receptor (spleen) signal peptide
                                 (put.)
    matp        531  +   660     complement receptor (spleen)
    IVS         532  >   676     Crry intron A
    IVS         661  >   676     Crry intron A'
BASE COUNT      148 a    162 c    193 g    173 t
ORIGIN      
        1 atccgaattc atcataagga aataggttct tactgtatac tagacagggt atgcaactgt
       61 cagctcactg ttgcagatta gggttaggct ccacccttgc agatttttaa aaggagtaag
      121 gccgggctat atgccaaacc gagttcccat aatgccttgt tttctttgga gtcgaaggtt
      181 cctgcaagtg gaaaacttcc tggagctgac ctactaggta ttgaaccagt ttctgcattg
      241 ctgaatcaat ctcccaaggg taattccaca gaaatcccag gggcttggag taaacaagac
      301 cgcgcctagc ccagctagag gaagttttat tccggaaccc agcgccattt ctgggtggga
      361 ctgctttcta caccatttgc cgtaaaacgt tgtttgagaa cggtgtgagg ggaatggagg
      421 tctcttctcg gagttcagag cctctggatc cggtgtggct ccttgtagcc ttcggccggg
      481 gaggagtcaa gctagaagtt ttgctgctgt tcttgctgcc atttactttg ggtgagctgc
      541 ggggaggcct ggggaagcac ggacacacgg ttcaccggga acccgcggta aataggctct
      601 gcgcagactc caaacgctgg tctgggctgc ctgtgagtgc tcagcgcccc tttcccatgg
      661 gtgagcgtgg ggcgcc
//
LOCUS       MUSCRRY02     200 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 2.
ACCESSION   M34165
KEYWORDS    complement receptor.
SEGMENT     2 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +   190     complement receptor (Crry; liver) precursor,
                                 exon 2
    matp         11  +   190     complement receptor (liver)
    pep$    +    11  +   190     complement receptor (spleen) precursor, exon 2
    matp    +    11  +   190     complement receptor (spleen)
    IVS     <     1       10     Crry intron A
    IVS     <     1       10     Crry intron A'
    IVS         191  >   200     Crry intron B
BASE COUNT       60 a     51 c     35 g     54 t
ORIGIN      Undetermined number of base pairs after segment 1.
        1 cattcaacag gtcactgccc agccccatca cagcttcctt ctgccaaacc tataaatcta
       61 actgatgaat ccatgtttcc cattggaaca tatttgttgt atgaatgtct cccaggatat
      121 atcaagaggc agttctctat cacctgcaaa caagactcaa cctggacgag tgctgaagat
      181 aagtgtatac gtgagtaact
//
LOCUS       MUSCRRY03     120 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 3.
ACCESSION   M34166
KEYWORDS    complement receptor.
SEGMENT     3 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +   110     complement receptor (Crry; liver) precursor,
                                 exon 3
    matp    +    11  +   110     complement receptor (liver)
    pep$    +    11  +   110     complement receptor (spleen) precursor, exon 3
    matp    +    11  +   110     complement receptor (spleen)
    IVS     <     1       10     Crry intron B
    IVS         111  >   120     Crry intron C
BASE COUNT       36 a     21 c     25 g     38 t
ORIGIN      Undetermined number of base pairs after segment 2.
        1 tttttcatag gaaaacaatg taaaactcct tcagatcctg agaatggctt ggtacatgta
       61 cacacaggca ttcagtttgg atcccgtatt aattatactt gtaatcaagg gtgagttggc
//
LOCUS       MUSCRRY04     104 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 4.
ACCESSION   M34167
KEYWORDS    complement receptor.
SEGMENT     4 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 104)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +    96     complement receptor (Crry; liver) precursor,
                                 exon 4
    matp    +    11  +    96     complement receptor (liver)
    pep$    +    11  +    96     complement receptor (spleen) precursor, exon 4
    matp    +    11  +    96     complement receptor (spleen)
    IVS     <     1       10     Crry intron C
    IVS          97  >   104     Crry intron D
BASE COUNT       20 a     20 c     27 g     37 t
ORIGIN      Undetermined number of base pairs after segment 3.
        1 ctgtgtgtag ataccgcctc attggttcct cctctgctgt atgtgtcatc actgatcaaa
       61 gtgttgattg ggatactgag gcacctattt gtgagtgtaa gttg
//
LOCUS       MUSCRRY05     422 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 5.
ACCESSION   M34168
KEYWORDS    complement receptor.
SEGMENT     5 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 422)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +   412     complement receptor (Crry; liver) precursor,
                                 exon 5
    matp    +    11  +   412     complement receptor (liver)
    pep$    +    11  +   412     complement receptor (spleen) precursor, exon 5
    matp    +    11  +   412     complement receptor (spleen)
    IVS     <     1       10     Crry intron D
    IVS         413  >   422     Crry intron E
BASE COUNT      106 a    100 c    104 g    112 t
ORIGIN      Undetermined number of base pairs after segment 4.
        1 ctttgcccag ggattccttg tgagataccc ccaggcattc ccaatggaga tttcttcagt
       61 tcaaccagag aagactttca ttatggaatg gtggttacct accgctgcaa cactgatgcg
      121 agagggaagg cgctctttaa cctggtgggt gagccctcct tatactgtac cagcaacgat
      181 ggtgaaattg gagtctggag cggccctcct cctcagtgca ttgaactcaa caaatgtact
      241 cctcctccct atgttgaaaa tgcagtcatg ctgtctgaga acagaagctt gttttcctta
      301 agggatattg tggagtttag atgtcaccct ggctttatca tgaaaggagc cagcagtgtg
      361 cattgtcagt ccctaaacaa atgggagcca gagttaccaa gctgcttcaa gggtaagctc
      421 ga
//
LOCUS       MUSCRRY06     206 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 6.
ACCESSION   M34169
KEYWORDS    complement receptor.
SEGMENT     6 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 206)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +   196     complement receptor (Crry; liver) precursor,
                                 exon 6
    matp    +    11  +   196     complement receptor (liver)
    pep$    +    11  +   196     complement receptor (spleen) precursor, exon 6
    matp    +    11  +   196     complement receptor (spleen)
    IVS     <     1       10     Crry intron E
    IVS         197  >   206     Crry intron F
BASE COUNT       61 a     33 c     58 g     54 t
ORIGIN      Undetermined number of base pairs after segment 5.
        1 ctaattgcag gagtgatatg tcgtctccct caggagatga gtggattcca gaaggggttg
       61 ggaatgaaaa aagaatatta ttatggagag aatgtaacct tggaatgtga ggatgggtat
      121 actctagaag gcagttctca aagccagtgc cagtctgatg gcagctggaa tcctcttctg
      181 gccaaatgtg tatctcgtaa gtacaa
//
LOCUS       MUSCRRY07      44 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 7.
ACCESSION   M34170
KEYWORDS    complement receptor.
SEGMENT     7 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 44)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +    34     complement receptor (Crry; liver) precursor,
                                 exon 7
    matp    +    11  +    34     complement receptor (liver)
    pep$    +    11  +    34     complement receptor (spleen) precursor, exon 7
    matp    +    11  +    34     complement receptor (spleen)
    IVS     <     1       10     Crry intron F
    IVS          35  >    44     Crry intron G
BASE COUNT        8 a      8 c     10 g     18 t
ORIGIN      Undetermined number of base pairs after segment 6.
        1 tctctttcag gctcaatcag tggtctaatt gttggtaagt tctg
//
LOCUS       MUSCRRY08      96 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 8.
ACCESSION   M34171
KEYWORDS    complement receptor.
SEGMENT     8 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 96)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +    86     complement receptor (Crry; liver) precursor,
                                 exon 8
    matp    +    11  +    86     complement receptor (liver)
    pep$    +    11  +    86     complement receptor (spleen) precursor, exon 8
    matp    +    11  +    86     complement receptor (spleen)
    IVS     <     1       10     Crry intron G
    IVS          87  >    96     Crry intron H
BASE COUNT       27 a     10 c     19 g     40 t
ORIGIN      Undetermined number of base pairs after segment 7.
        1 tcctgtttag gaattttcat tgggataatc gtctttattt tagtcatcat tgttttcatt
       61 tggatgattc tgaagtataa aaaacggtga gtaaag
//
LOCUS       MUSCRRY09     125 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 9.
ACCESSION   M34172
KEYWORDS    complement receptor.
SEGMENT     9 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 125)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +   115     complement receptor (Crry; liver) precursor,
                                 exon 9
    matp    +    11  +   115     complement receptor (liver)
    pep$    +    11  +   115     complement receptor (spleen) precursor, exon 9
    matp    +    11  +   115     complement receptor (spleen)
    IVS     <     1       10     Crry intron H
    IVS         116  >   125     Crry intron I
BASE COUNT       46 a     24 c     25 g     30 t
ORIGIN      Undetermined number of base pairs after segment 8.
        1 taccaattag caataccaca gatgaaaagt ataaagaagt gggtattcat ttaaattata
       61 aagaagacag ctgtgtccgc cttcagtctc tgctcacaag tcaggagaac agcaggtaca
      121 tatgc
//
LOCUS       MUSCRRY10     128 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry) gene, exon 10.
ACCESSION   M34173
KEYWORDS    complement receptor.
SEGMENT     10 of 10
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 128)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11       56     complement receptor (Crry; liver) precursor,
                                 exon 10
    matp    +    11       53     complement receptor (liver)
    pep$    +    11       56     complement receptor (spleen) precursor, exon
                                 10
    matp    +    11       53     complement receptor (spleen)
    IVS     <     1       10     Crry intron I
BASE COUNT       41 a     30 c     20 g     37 t
ORIGIN      Undetermined number of base pairs after segment 9.
        1 tttgctgaag taccactagc ccagcacgga attcactcac tcaagaagtc tcctaaatag
       61 cagcaacgtg aaatgagaac atgctctgtc tgtatcactt ttaaaataaa ctgtttcctt
      121 ttaagatc
//
LOCUS       MUSCRRYPS    1272 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse complement receptor (Crry-ps) pseudogene DNA fragment.
ACCESSION   M34174
KEYWORDS    complement receptor; pseudogene.
SOURCE      Mouse (strain Balb/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1272)
  AUTHORS   Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H.
  TITLE     The murine complement receptor gene family: III. The genomic and
            transcriptional complexity of the Crry and Crry-ps genes
  JOURNAL   J. Immunol. 144, 1988-1996 (1990)
  STANDARD  simple staff_review
BASE COUNT      377 a    256 c    276 g    363 t
ORIGIN      
        1 tgcccagccc catcacagct tccttctgcc aaacctataa atctaactga tgaatccatg
       61 tttcccattg gaacatctgt gaaatatgaa tgtctcccag gatatatcaa gaggcagttc
      121 tctatcacct gcaaacaaga ctcaacctgg acgagtgctg aagataagtg tatacgaaaa
      181 caatgtaaaa ctcctttaga tcctcagaat ggcttggtac atgtacacac aggcattcag
      241 tttggatccc gtattaatta tacttgtaat aaaggatacc gcctcattgg ttcctcctct
      301 gctgtatgtg tcatcactga tcaaagtgtt gattgggata ctgaggcacc tatttgtgag
      361 tggattcctt gtgatatacc cccaggcatt cccaatggag atttcttcag ttcaactaga
      421 gaagactttc attatggaat ggtggttacc taccgctgca acactgatgc gagagggaag
      481 gcgctcttta acctggtggt tatactgtac cagcaacgat ggtgaaattg gagtctggag
      541 tggccctcct cctcagtgca ttggattcaa caaatgtact cctcctccct atgttgaaaa
      601 tgcagtcatg ctgtctgaga acagaagctt gttttcctta agggatattg tggagtttag
      661 atgtcaccct ggctttatca tgaaaggagc cagcagtgtg cattgtcagt ccctaaacaa
      721 atgggagcca gagttaccaa gctgcttcaa gggagtgata tgtcgtctcc ctcaggagat
      781 gagtggattc cagaaggggt tgggaatgaa aaaagaatat tattatggag agaatgtaac
      841 cttggaatgc gaggatgggt atactctaga aggcagttct caaagccagt gtcagtctga
      901 tggcagctgg aatcctcttc tggccaaaag tgtatcgcgc tcaatcagtg gtctaattgt
      961 tggaattttc attgggatga tcatctttat tttattcatc attgttttca tttggatgat
     1021 tctgaagtat aaaaaacgca ataccacaga tgaaaagtat aaagaagtgg gtattcattt
     1081 aaattataaa ggagacagct gtgtctgcct tcagtctctg ctcacaagtc aggagaacag
     1141 cactaccact agcccagcac agaattcact cgctcaagaa gtctcctaaa tagcagcaac
     1201 gtgaaatgag aacatgtctt tctgtatcat ttttaaaata aactatttct tttaagaaaa
     1261 aaaagaaaga aa
//
LOCUS       BSURGRRNB    7430 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  B.subtilis rrnB operon with 23S rRNA, 16SrRNA, 5S rRNA and tRNA
            gene cluster: Val-, Thr-, Lys-, Leu-cug-, Gly-ggc-, Leu-uua-, Arg-,
            Pro-, Ala-, Met-, Ile-, Ser-uca-, Met-f-, Asp-, Phe-, His-,
            Gly-gga-, Ile-, Asn-, Ser-agc- and Glu-tRNA.
ACCESSION   K00637 M10606 X00007
KEYWORDS    23S ribosomal RNA; 5S ribosomal RNA; ribosomal RNA; transfer RNA;
            transfer RNA-Ala; transfer RNA-Arg; transfer RNA-Asn;
            transfer RNA-Asp; transfer RNA-Glu; transfer RNA-Gly;
            transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu;
            transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe;
            transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr;
            transfer RNA-Val.
SOURCE      B.subtilis 168 DNA, library of Ferrari et al, clone pBC204 [1];
            clone pGS227 [2]; clone pGS332 [3].
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 4897 to 7430)
  AUTHORS   Green,C.J. and Vold,B.S.
  TITLE     Sequence analysis of a cluster of twenty-one tRNA genes in Bacillus
            subtilis
  JOURNAL   Nucleic Acids Res. 11, 5763-5774 (1983)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 1168)
  AUTHORS   Stewart,G.C. and Bott,K.
  TITLE     DNA sequence of the tandem ribosomal RNA promoter for B.subtilis
            operon rrnB
  JOURNAL   Nucleic Acids Res. 11, 6289-6300 (1983)
  STANDARD  simple staff_review
REFERENCE   3  (bases 1 to 7430; revises [1],[2])
  AUTHORS   Green,C.J., Stewart,G.C., Hollis,M.A., Vold,B.S. and Bott,K.F.
  TITLE     Nucleotide sequence of the Bacillus subtilis ribosomal RNA operon,
            rrnB
  JOURNAL   Gene 37, 261-266 (1985)
  STANDARD  simple staff_review
REFERENCE   4  (sites for [1],[2] and [3])
  AUTHORS   Su,S.L. and Dubnau,D.
  TITLE     Binding of Bacillus subtilis ermC' methyltransferase to 23S rRNA
  JOURNAL   Biochemistry 29, 6033-6042 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and sequence in computer readable form for [1],[2],[3]
            kindly provided by K.F.Bott, 26-DEC-1985.
            
            The RNAs, encoded by the sequence presented below, are probably
            transcribed as one polycistronic unit, including the tRNA region,
            because there are no obvious terminator stem loop structures until
            after the end of the tRNA region at positions 7245-7272 and
            7392-7413 [3].
            
            [1] notes that though the Ile-tRNA-nau sequence has the methionine
            anticodon "cau", it is highly homologous to Ile-tRNA-gau; the "c"
            in the wobble position may be post-transcriptionally modified to
            recognize "aua" codons.
            
            Promoter P1 is located at positions 184-189 (-35 region) and
            207-213 (-10 region), and P2 at 276-281 (-35 region) and 299-304
            (-10 region).  A third promoter region could be at positions
            5517-5522.  A potential stem-loop structure, necessary for
            processing of the mature 16S rRNA, is found at positions 327-360
            [2].
FEATURES       from  to/span     description
    rRNA        485     2034     16S rRNA
    rRNA       2203     5129     23S rRNA
    rRNA       5185     5300     5S rRNA
    tRNA       5322     5397     Val-tRNA
    tRNA       5430     5504     Thr-tRNA
    tRNA       5543     5618     Lys-tRNA
    tRNA       5629     5715     Leu-tRNA-cug
    tRNA       5721     5795     Gly-tRNA-ggc
    tRNA       5810     5895     Leu-tRNA-uua
    tRNA       5905     5981     Arg-tRNA
    tRNA       5997     6073     Pro-tRNA
    tRNA       6079     6151     Ala-tRNA
    tRNA       6172     6248     Met-tRNA
    tRNA       6251     6327     Ile-tRNA-nau
    tRNA       6334     6425     Ser-tRNA-uca
    tRNA       6443     6519     Met-tRNA-f
    tRNA       6531     6607     Asp-tRNA
    tRNA       6620     6695     Phe-tRNA
    tRNA       6712     6788     His-tRNA
    tRNA       6799     6872     Gly-tRNA-gga
    tRNA       6888     6964     Ile-tRNA-gau
    tRNA       6975     7049     Asn-tRNA
    tRNA       7053     7143     Ser-tRNA-agc
    tRNA       7169     7240     Glu-tRNA
    revision    504      504     c in [3]; t in [2]
    revision    571      573     tcc in [3]; tc in [2]
    revision   5029     5031     gga in [3]; ga in [1]
    anticdn    5355     5357     Val-tRNA anticodon tac
    anticdn    5463     5465     Thr-tRNA anticodon tgt
    anticdn    5576     5578     Lys-tRNA anticodon ttt
    anticdn    5663     5665     Leu-tRNA-cug anticodon cag
    anticdn    5753     5755     Gly-tRNA-ggc anticodon gcc
    anticdn    5844     5846     Leu-tRNA-uua anticodon taa
    anticdn    5939     5941     Arg-tRNA anticodon acg
    anticdn    6031     6033     Pro-tRNA anticodon tgg
    anticdn    6112     6114     Ala-tRNA anticodon tgc
    revision   6165     6167     act in [3]; at in [1]
    anticdn    6206     6208     Met-tRNA anticodon cat
    anticdn    6285     6287     Ile-tRNA-nau anticodon cat
    anticdn    6370     6372     Ser-tRNA-uca anticodon tga
    anticdn    6477     6479     Met-tRNA-f anticodon cat
    anticdn    6565     6567     Asp-tRNA anticodon gtc
    anticdn    6653     6655     Phe-tRNA anticodon gaa
    anticdn    6746     6748     His-tRNA anticodon gtg
    anticdn    6831     6833     Gly-tRNA-gga anticodon tcc
    anticdn    6922     6924     Ile-tRNA-gau anticodon gat
    anticdn    7007     7009     Asn-tRNA anticodon gtt
    anticdn    7087     7089     Ser-tRNA-agc anticodon gct
    anticdn    7202     7204     Glu-tRNA anticodon ttc
BASE COUNT     1906 a   1694 c   2125 g   1705 t
ORIGIN      65 bp upstream of MboI site; 280 degrees on the B.subtilis map.
        1 ctttaatgct ccccttgtgg tcatcagtat ttagttcgtt tcacatacaa gaaaacgaaa
       61 aaaacaacaa gatcacatga ctgatgtata tgttctttta agaaacttat atgatacaca
      121 cgctttagaa atcatggcga ggattatagt ttatttgttt tatagatttt ttttaaaaaa
      181 ctattgcaat aaataaatac aggtgttata ttattaaacg tcgctgatgc acagcggaca
      241 caactagatg cttcaaaaca acttgaaaaa agttgttgac aaaaaagaag ctgaatgtta
      301 tattagtaaa gctgcttcat tgagaagtaa cgaaatgatc tttgaaaact aaacaagaca
      361 aaacgtacct gttaattcag tttttaaaaa tcgcactgcg atgtgcgtat catcaaacag
      421 ggcctgcacg acgcaggtca cacaggtgtc gccgcaggat gcggtgaact taacctgtga
      481 tccatttatc ggagagtttg atcctggctc aggacgaacg ctggcggcgt gcctaataca
      541 tgcaagtcga gcggacaggt gggagcttgc tccgatgtta gcggcggacg ggtgagtaac
      601 acgtgggtaa cctgcctgta agactgggat aactccggga aaccggggct aataccggat
      661 ggttgtttga accgcatggt tcaaacataa aaggtggctt cggctaccac ttacagatgg
      721 acccgcggcg cattagctag ttggtgaggt aacggctcac caaggcaacg atgcgtagcc
      781 gacctgagag ggtgatcggc cacactggga ctgagacacg gcccagactc ctacgggagg
      841 cagcagtagg gaatcttccg caatggacga aagtctgacg gagcaacgcc gcgtgagtga
      901 tgaaggtttt cggatcgtaa agctctgttg ttagggaaga acaagtaccg ttcgaacagg
      961 gcggtacctt gacggtacct aaccagaaag ccacggctaa ctacgtgcca gcagccgcgg
     1021 taatacgtag gtggcaagcg ttttccggaa ttattgggcg taaagggctc gcaggcggtt
     1081 tcttaagtct gatgtgaaag cccccggctc aaccggggag ggtcattgga aactggggaa
     1141 cttgagtgca gaagaggaga gtggaattcc acgttgtagc ggtgaaatgc gtagagatgt
     1201 ggaggaacac cagtggcgaa ggcgactctc tggtctgtaa ctgacgctga ggagcgaaag
     1261 cgtggggagc gaacaggatt agataccctg gtagtccacg ccgtaaacga tgagtgctaa
     1321 gtgttagggg gtttccgccc cttagtgctg cagctaacgc attgagcact ccgcctgggg
     1381 agtacggtcg caagactgaa actcaaagga attgacgggg gcccgcacaa gcggtggagc
     1441 atgtggttta attcgaagca acgcgaagaa ccttactagg tcttgacatc ctctgacaat
     1501 cctagagata ggacgtcccc ttcggggcag agtgacaggt ggtgcatggt tgtcgtcagc
     1561 tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttgat cttagttgcc
     1621 agcattcagt tgggcactct aaggtgactg ccggtgacaa accggaggaa ggtggggatg
     1681 acgtcaaatc atcatgcccc ttatgacttg ggctacacac gtgctacaat ggacagaaca
     1741 aagggcagcg aaccgcgagg ttaagccaat cccacaaatc tgttctcagt tcggatcgca
     1801 gtctgcaact cgactgcgtg aagctggaat cgctagtaat cgcggatcag catgccgcgg
     1861 tgaatacgtt cccgggcctt gtacacaccg cccgtcacac cacgagagtt tgtaacaccc
     1921 gaagtcggtg aggtaacctt ttaggagcca gccgccgaag gtgggacaga tgattggggt
     1981 gaagtcgtaa caaggtagcc gtatcggaag gtgcggctgg atcacctcct ttctaaggat
     2041 attatacgga atataagacc caaggtctta taaacagaac gttccctgtc ttgtttagtt
     2101 ttgaaggatc attccttcga aacgtgttct ttgaaaacta gataacagta gacatcacat
     2161 tcaattagta acacaagata tcacatagtg attcttttta acggttaagt tagaaagggc
     2221 gcacggtgga tgccttggca ctaggagccg atgaaggacg ggacgaacac cgatatgctt
     2281 cggggagctg taagcaagct ttgatccgga gatttccgaa tggggaaacc caccactcgt
     2341 aatggagtgg tatccatatc tgaattcata ggatatgaga aggcagaccc ggggaactga
     2401 aacatctaag tacccggaga agagaaagca aatgcgattc cctgagtagc ggcgacgaac
     2461 acgggatcag cccaaaccaa gaggcttgcc tctgtggttg taggacactc tgtacggagt
     2521 tacaaaagaa cgaggtagat gaagaggtct ggaaagggcc cgccatagga ggtaacagcc
     2581 ctgtagtcaa aacttcgttc tctcctgagt ggatcctgag tacggcggaa cacgtgaaat
     2641 tccgtcggaa tccgggagga ccatctccca aggctaaata ctccctagtg accgatagtg
     2701 aaccagtacc gtgagggaaa ggtgaaaagc accccggaag gggagtgaaa gagatcctga
     2761 aaccgtgtgc ctacaagtag tcagagcccg ttaacggtga tggcgtgcct tttgtagaat
     2821 gaaccggcga gttacgatcc cgtgcaaggt taagcagaag atgcggagcc gcagcgaaag
     2881 cgagtctgaa tagggcgcat gagtacgtgg tcgtagaccc gaaaccaggt gatctaccca
     2941 tgtccagggt gaagttcagg taacactgaa tggaggcccg aacccacgca cgttgaaaag
     3001 tgcggggatg aggtgtgggt aggggtgaaa tgccaatcga acctggagat agctggttct
     3061 ctccgaaata gctttagggc tagcctcaag gtaagagtct tggaggtaga gcactgattg
     3121 gactaggggc cctcaccggg ttaccgaatt cagtcaaact ccgaatgcca atgacttatc
     3181 cttgggagtc agactgcgag tgataagatc cgtagtcgaa agggaaacag cccagaccgc
     3241 cagctaaggt cccaaagtat acgttaagtg gaaaaggatg tggagttgct tagacaacca
     3301 ggatgttggc ttagaagcag ccaccattta aagagtgcgt aatagctcac tggtcgagtg
     3361 actctgcgcc gaaaatgtac cggggctaaa cgtatcaccg aagctgcgga ctgttcttcg
     3421 aacagtggta ggagagcgtt ctaagggctg tgaagccaga ccggaaggac tggtggacgg
     3481 cttagaagtg agaatgccgg tatgagtagc gaaaagaggg gtgagaatcc ctccaccgaa
     3541 tgcctaaggg ttcctgagga aggctcgtcc gctcagggtt agtcgggacc taagccgagg
     3601 ccgaaaggcg taggcgatgg acaacaggtt gatattcctg taccacctcc tcaccatttg
     3661 agcaatgggg ggtcgcagga ggatagggta agcgcggtat tggatatccg cgtccaagca
     3721 gttaggctgg gaaataggca aatccgtttc ccataaggct gagctgtgat ggcgagcgaa
     3781 atatagtagc gaagttcctg attccacact gccaagaaaa gcctctagcg aggtgagagg
     3841 tgcccgtacc gcaaaccgtc acaggtaggc gaggagagaa tcctaaggtg atcgagagaa
     3901 ctctcgttaa ggaactcggc aaaatgaccc cgtaacttcg ggagaagggg tgctctgtta
     3961 gggtgcaagc ccgagagagc cgcagtgaat aggcccaggc gactgtttag caaaaacaca
     4021 ggtctctgcg aagccgtaag gcgaagtata ggggctgacg cctgcccggt gctggaaggt
     4081 taagaggagc gcttagcgta agcgaaggtg cgaattgaag ccccagtaaa cggcggccgt
     4141 aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac ccgcacgaaa
     4201 ggcgcaacga tctgggcgct gtctcaacga gagactcggt gaaattatag tacctgtgaa
     4261 gatgcaggtt acccgcgaca ggacggaaag accccgtgga gctttactgc agcctgatat
     4321 tgaatgttgg tacagcttgt acaggatagg taggagcctt ggaaaccgga gcgccagctt
     4381 cggtggaggc atcggtggga tactaccctg gctgtattga ccttctaacc ccccgccctt
     4441 atcgggcggg gagacagtgt caggtgggca gtttgactgg ggcggtcgcc tcctaaaagg
     4501 taacggaggc gcccaaaggt tccctcagaa tggttggaaa tcattcgcag agtgtaaagg
     4561 cacaagggag cttgactgcg agacctacaa gtcgagcagg gacgaaagtc gggcttagtg
     4621 atccggtggt tccgcatgga agggccatcg ctcaacggat aaaagctacc ccggggataa
     4681 caggcttatc tcccccaaga gctccacatc gacggggagg tttggcacct cgatgtcggc
     4741 tcatcgcatc ctggggctgt agtcggtccc aagggttggg ctgttcgccc attaaagcgg
     4801 tacgcgagct gggttcagaa cgtcgtgaga cagttcggtc cctatccgtc gcgggcgctg
     4861 gaaatttgag aggagctgtc cttagtacga gaggaccggg atggacgcac cgctggtgta
     4921 ccagttgttc tgccaagggc atcgctgggt agctatgtgc ggacgggata agtgctgaaa
     4981 gcatctaagc atgaagcccc cctcaagatg agatttccca ttccgcaagg aagtaagatc
     5041 cctgaaagat gatcaggttg ataggtctga ggtggaagtg tggcaacaca tggagctgac
     5101 agatactaat cgatcgagga cttaaccata tttttgaatg atgtcacacc tgttatctag
     5161 ttttgagaga acactctcaa tttgtttggt ggcgatagcg aagaggtcac acccgttccc
     5221 ataccgaaca cggaagttaa gctcttcagc gccgatggta gtcgggggtt tccccctgtg
     5281 agagtaggac gccgccaagc aattgcacgt tagtgcaata tggaggatta gctcagctgg
     5341 gagagcatct gccttacaag cagagggtcg gcggttcgag cccgtcatcc tccaccattt
     5401 ttcattatac atatcggttt tacatatatg ccggtgtagc tcaattggta gagcaactga
     5461 cttgtaatca gtaggttggg ggttcaagtc ctcttgccgg caccactttt atatgatata
     5521 atattcaagt ctattgtaag aagagccatt agctcagttg gtagagcatc tgacttttaa
     5581 tcagagggtc gaaggttcga gtccttcatg gctcaccatt tacatgttgc ggatgtggcg
     5641 gaattggcag acgcgctaga atcaggctct agtgtcttta cagacgtggg ggttcaagtc
     5701 ccttcatccg caccatttct gcggaagtag ttcagtggta gaacaccacc ttgccaaggt
     5761 gggggtcgcg ggttcgaatc ccgtcttccg ctccaactat accatccacg ccggggtggt
     5821 ggaattggca gacacacagg acttaaaatc ctgcggtagg tgactaccgt gccggttcaa
     5881 gtccggccct cggcattaag ttttgcgccc gtagctcaat tggatagagc gtttgactac
     5941 ggatcaaaag gttaggggtt cgactcctct cgggcgcgcc atgatctata tgaaatcggg
     6001 aagtagctca gcttggtaga gcacatggtt tgggaccatg gggtcgcagg ttcgaatcct
     6061 gtcttcccga ccattcttgg ggccttagct cagctgggag agcgcctgct ttgcacgcag
     6121 gaggtcagcg gttcgatccc gctaggctcc acttgatttc aaaaactatt tggcggtgta
     6181 gctcagctgg ctagagcgta cggttcatac ccgtgaggtc gggggttcga tcccctccgc
     6241 cgctaccaat ggacctttag ctcagttggt tagagcagac ggctcataac cgtccggtcg
     6301 taggttcgag tcctacaagg tccaccacta tacggaggaa tacccaagtc tggctgaagg
     6361 gatcggtctt gaaaaccgac agggtgtcaa agcccgcggg ggttcgaatc cctcttcctc
     6421 cgccatacat attcctaatc atcgcggggt ggagcagttc ggtagctcgt cgggctcata
     6481 acccgaaggt cgcaggttca aatcctgccc ccgcaaccaa attttaaaat ggtccggtag
     6541 ttcagttggt tagaatgcct gcctgtcacg caggaggtcg cgggttcgag tcccgtccgg
     6601 accgccattt aaatacttag gctcggtagc tcagttggta gagcaacgga ctgaaaatcc
     6661 gtgtgtcggc ggttcgattc cgtcccgagc caccatttat caatatgctt tggcggttgt
     6721 ggcgaagtgg ttaacgcacc agattgtggc tctggcattc gtgggttcga ttcccatcaa
     6781 tcgccccaaa taaaaattgc gggtgtagtt tagtggtaaa acctcagcct tccaagctga
     6841 tgtcgtgggt tcgattccca tcacccgctc catttctata tcgtcatggg cctgtagctc
     6901 agctggttag agcgcacgcc tgataagcgt gaggtcgatg gttcgagtcc attcaggccc
     6961 accatgactt ttgttccaca gtagctcagt ggtagagcta tcggctgtta accgatcggt
     7021 cgcaggttcg aatcctgcct gtggagccaa atggagaagt actcaagtgg ctgaagaggc
     7081 gcccctgcta agggtgtagg tcgtgtaagc ggcgcgaggg ttcaaatccc tccttctccg
     7141 ccatatgatt acagatatca taattatcgg cccgttggtc aagcggttaa gacaccgccc
     7201 tttcacggcg gtaacacggg ttcgaatccc gtacgggtca tcccagaagc cttgcatatc
     7261 ctgcaaggtt tttttgtttt tataaatcat gtatatgtct tagattttgt tctttatttt
     7321 aaaaacagac tacaaaaatc tccatatatt tcgtttttct tcagaaaatg aagttaattg
     7381 tctataagta taagccgttt cagggaaagg gctttttttt atttcttcga
//
LOCUS       ECOAROCX     1690 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  E.coli chorismate synthase (aroC) gene, complete cds.
ACCESSION   M33021
KEYWORDS    aroC gene; chorismate synthase.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1690)
  AUTHORS   White,P.J., Millar,G. and Coggins,J.R.
  TITLE     The overexpression, purification and complete amino acid sequence
            of chorismate synthase from Escherichia coli K12 and its comparison
            with the enzyme from Neurospora crassa
  JOURNAL   Biochem. J. 251, 313-322 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        492     1562     chorismate synthase (EC 4.6.1.4)
BASE COUNT      403 a    467 c    466 g    354 t
ORIGIN      
        1 gtcgacgcgg tggatatctc tccagacgcg ctggcggttg ctgaacagaa catcgaagaa
       61 cacggtctga tccacaacgt cattccgatt cgttccgatc tgttccgcga cttgccgaaa
      121 gtgcagtacg acctgattgt cactaacccg ccgtatgtcg atgcgaagat atgtccgacc
      181 tgccaaacaa taccgccacg agccggaact gggcctggca tctggcactg acggcctgaa
      241 actgacgcgt cgcattctcg gtaacgcggc agattacctt gctgatgatg gcgtgttgat
      301 ttgtgaagtc ggcaacagca tggtacatct tatggaacaa tatccggatg ttccgttcac
      361 ctggctggag tttgataacg gcggcgatgg tgtgtttatg ctcaccaaag agcagcttat
      421 tgccgcacga gaacatttcg cgatttataa agattaagta aacacgcaaa cacaacaata
      481 acggagccgt gatggctgga aacacaattg gacaactctt tcgcgtaacc accttcggcg
      541 aatcgcacgg gctggcgctc ggctccatcg tcgatggtgt tccgccagcc attccgctga
      601 cggaagcgga cctgcaacat gacctcgacc gtcgtcgccc tgggacatcg cgctatacca
      661 cccagcgccg cgagccggat caggtcaaaa ttctctccgg tgtttttgaa ggcgttacta
      721 ccggcaccag cattggcttg ttgatcgaaa acactgacca gcgctctcag gattacagtg
      781 cgattaagga cgttttccgt ccaggccatg ccgattacac ctacgaacaa aaatacggtc
      841 tgcgcgatta tcgcggcggt ggacgttctt ccgcccgcga aaccgccatg cgcgtggcgg
      901 caggagctat tgccaaaaaa tatctcgccg agaaatttgg tattgaaatc cgtggctgcc
      961 tgacccagat gggcgacatt ccgctggata tcaaagactg gtcgcaggtc gagcaaaatc
     1021 cgtttttttg cccggacccc gacaaaatcg acgcgttaga cgagttgatg cgtgcgctga
     1081 aaaaagaggg cgactccatc ggcgctaaag tcaccgttgt tgccagtggc gttcctgccg
     1141 gacttggcga gccggtcttt gaccgcctgg atgctgacat cgcccatgcg ctgatgagca
     1201 tcaacgcggt gaaaggcgtg gaaattggcg acggctttga cgtggtggcg ctgcgcggca
     1261 gccagaaccc cgatgaaatc accaaagacg gtttccagag caaccatgcg ggcggcattc
     1321 tcggcggtat cagcagcggg cagcaaatca ttgcccatat ggcgctgaaa ccgacctcca
     1381 gcattaccgt gccgggtcgt accattaacc gctttggcca agaagttgag atgatcacca
     1441 aaggccgtca cgatccctgt gtcgggatcc gcgcagtgcc gatcgcagaa gcgaatgctg
     1501 gcgatcgttt taatggatca cctgttacgg caacgggcgc aaaatgccga tgtgaagact
     1561 gatattccac gctggtaaaa aatgaataaa accgcgattg cgctgctggc tctgcttgcc
     1621 agtagcgcca gcctggcagc gacggcgtgg caaaaaataa cccaacctgt gccgggtagc
     1681 gccaaatcga
//
LOCUS       PFAMSA2       819 bp ds-DNA             INV       10-JUL-1990
DEFINITION  P.falciparum 45 kD merozoite surface antigen (MSA 2) gene,
            complete cds.
ACCESSION   M28891
KEYWORDS    integral membrane protein; surface antigen.
SOURCE      P.falciparum DNA, clone 3D7.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 819)
  AUTHORS   Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and
            Anders,R.F.
  TITLE     Structural diversity in the 45-kilodalton merozoite surface antigen
            of Plasmodium falciparum
  JOURNAL   Mol. Biochem. Parasitol. 39, 227-234 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by J.A. Smythe, 06-OCT-1989.
FEATURES       from  to/span     description
    pept          1      819     45 kD merozoite surface antigen precursor
    sigp          1       60     45 kD merozoite surface antigen signal peptide
    matp         61      819     45,000 merozoite surface antigen
    rpt         157      228     12 base repeat
    rpt         301      321     9 base repeat
BASE COUNT      304 a    157 c    143 g    215 t
ORIGIN      
        1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt
       61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg
      121 agaagtatgg cagaaagtaa gccttctact ggtgctggtg gtactgctgg tggtagtgct
      181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg ttctggtgat
      241 ggtaatggtg cagatgctga gggaagttca agtactcccg ctactaccac aactaccaaa
      301 actaccacaa ctaccacaac tactaatgat gcagaagcat ctaccagtac ctcttcagaa
      361 aatccaaatc ataaaaatgc cgaaacaaat ccaaaaggta aaggagaagt tcaagaacca
      421 aatcaagcaa ataaagaaac tcaaaataac tcaaatgttc aacaagactc tcaaactaaa
      481 tcaaatgttc cacccactca agatgcagac actaaaagtc ctactgcaca acctgaacaa
      541 gctgaaaatt ctgctccaac agccgaacaa actgaatccc ccgaattaca atctgcacca
      601 gagaataaag gtacaggaca acatggacat atgcatggtt ctagaaataa tcatccacaa
      661 aatacttctg atagtcaaaa agaatgtacc gatggtaaca aagaaaactg tggagcagca
      721 acatccctct taaataactc tagtaatatt gcttcaataa ataaatttgt tgttttaatt
      781 tcagcaacac ttgttttatc ttttgccata ttcatataa
//
LOCUS       PFAMSA2X      864 bp ds-DNA             INV       10-JUL-1990
DEFINITION  P.falciparum 45,000 merozoite surface antigen (MSA2) gene,
            complete cds.
ACCESSION   M28892
KEYWORDS    integral membrane protein; surface antigen.
SOURCE      P.falciparum (isolate Indochina 1) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 864)
  AUTHORS   Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and
            Anders,R.F.
  TITLE     Structural diversity in the 45-kilodalton merozoite surface antigen
            of Plasmodium falciparum
  JOURNAL   Mol. Biochem. Parasitol. 39, 227-234 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by J.A. Smythe, 06-OCT-1989.
FEATURES       from  to/span     description
    pept          1      864     45 kD merozoite surface antigen precursor
    sigp          1       60     45 kD merozoite surface antigen signal peptide
    matp         61      864     45 kD merozoite surface antigen
    rpt         169      312     12 base repeat sequence
    rpt         379      397     9 base repeat sequence
BASE COUNT      288 a    157 c    179 g    240 t
ORIGIN      
        1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt
       61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg
      121 agaagtatga cagaaagtaa tcctcctact ggtgctagtg gtagtgctgg tggtagtgct
      181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct
      241 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct
      301 ggtggtagtg ctggttctgg tgatggtaat ggtgctaatc ctggtgcaga tgctgagaga
      361 agtccaagta ctcccgctac taccacaact accacaacta ctaatgatgc agaagcatct
      421 accagtacct cttcagaaaa tccaaatcat aataatgccg aaacaaatca agcaaataaa
      481 gaaactcaaa ataactcaaa cgttcaacaa gactctcaaa ctaaatcaaa tgttccaccc
      541 actcaagatg cagacactag aagtcctact gcacaacctg aacaagctga aaattctgct
      601 ccaacagccg aacaaactga atcccccgaa ttacaatctg caccagagaa taaaggtaca
      661 ggacaacatg gacatatgca tggttctaga aataatcatc cacaaaatac ttctgatagt
      721 caaaaagaat gtaccgatgg taacaaagaa aactgtggag cagcaacatc cctcttaaat
      781 aactctagta atattgcttc aataaataaa tttgttgttt taatttcagc aacacttgtt
      841 ttatcttttg ccatattcat ataa
//
LOCUS       XELRASX      1143 bp ss-mRNA            VRT       10-JUL-1990
DEFINITION  X.laevis ras protein mRNA, complete cds.
ACCESSION   M34657
KEYWORDS    ras protein.
SOURCE      X.laevis defolliculated oocyte, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1143)
  AUTHORS   Andeol,Y., Gusse,M. and Mechali,M.
  TITLE     Characterization and expression of a Xenopus ras during oogenesis
            and development
  JOURNAL   Dev. Biol. 139, 24-34 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        196      756     ras protein
    mRNA    <     1     1143     ras protein mRNA
BASE COUNT      346 a    275 c    279 g    243 t
ORIGIN      
        1 gaattcgcca gtgttacaga atgggagttc tgaggcgctg tgactaatcc cccccacccc
       61 cgcatattgg ggaaatccac cggcgggcag aaagccagag ggagaactaa ggggggccaa
      121 accaaaggaa aacgcaggag ccaaagcctc cagaaacaca gggatccgtg acgagcccga
      181 gtcggtgctg gtgaaatgac ggagtacaaa ctggtggtgg ttggtgctgg aggcgtgggg
      241 aagagcgcac tcacaatcca gctcattcag aaccattttg tggacgagta tgatcctact
      301 attgaggact cgtataggaa gcaggtggtg atagacgggg agacctgcct cctagatatc
      361 ctggacactg cggggcaaga ggaatacagc gctatgaggg atcagtacat gcgcacggga
      421 gaaggctttc tctgtgtctt tgctattaac aacacaaagt ccttcgagga cgtccatcat
      481 tacagggaac agattaacag agttaaagat tccgatgacg ttcccatggt gttagttggt
      541 aacaaatgcg acctcccatc ccggactgtg gacacaaagc aagcgcagga actggcaaag
      601 agctatggta ttccttttat agagacctct gccaaaacta gacagggagt cgaagacgcc
      661 ttctataccc tagtccgtga aatccgcaag cacaaggaga agatcagcaa cgggaaaaag
      721 aaaaagtcct ccaaaaggaa gtgtgtcgtt ctttaacgtg ccaacctgcc cccccctgcc
      781 atcctcgtgg atcagagaaa accgtgccat cacacacctg aagtcaaaga aaaaaaaagt
      841 gtggactttt gtcgttgctg tggaaaccat tgaattgcca tgaaatttaa aaaaaaaacc
      901 aaaacattga ccacttattt taacacaacc gataaatggc acaggctgtg ccccaatcgt
      961 gtatatattc ttcatgaaca aactgtttta tcagaaagac agatgcaata gccccttctt
     1021 tttaccccaa ttaaccctcc tcctggtttc tatttctccc tggaaaagac gttggtcgac
     1081 cagaggggaa gaacctgccc aggcctttct tacagcccca tttgaataaa gattgaaaca
     1141 ctc
//
LOCUS       HUMSPTB      6765 bp ss-mRNA            PRI       10-JUL-1990
DEFINITION  Human beta-spectrin (SPTB) mRNA, complete cds.
ACCESSION   J05500
KEYWORDS    beta-spectrin; spectrin.
SOURCE      Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and
            V252.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 6765)
  AUTHORS   Winkelmann,J.C., Chang,J.G., Tse,W.T., Marchesi,V.T. and
            Forget,B.G.
  TITLE     Full length sequence of the cDNA for human erythroid beta-spectrin
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Winkelmann, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         96     6509     beta-spectrin /nomgen="SPTB" /map="14"
                                 /hgml_locus_uid="LS0033T"
    mRNA    <     1     6765     SPTB mRNA
    signal     6716     6722     poly-A signal
BASE COUNT     1627 a   1822 c   2144 g   1172 t
ORIGIN      Chromosome 14q23-q24.
        1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag
       61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa
      121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg
      181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg
      241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg
      301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca
      361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc
      421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac
      481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc
      541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag
      601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag
      661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta
      721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca
      781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc
      841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct
      901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc
      961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg
     1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca
     1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca
     1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca
     1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag
     1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg
     1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc
     1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata
     1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga
     1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg
     1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc
     1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg
     1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg
     1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga
     1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg
     1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc
     1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg
     1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg
     2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc
     2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc
     2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt
     2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga
     2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc
     2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg
     2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc
     2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat
     2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgtcaggc ctgcgggagc
     2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg
     2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga
     2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca
     2761 ggttcgacat cctggaccag gagatgaaga ccttcatgac tcagattgat ggtgtgaacc
     2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc
     2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg
     2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt
     3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag
     3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg
     3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga
     3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat
     3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg
     3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca
     3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg
     3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc
     3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg
     3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg
     3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat
     3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga
     3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg
     3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg
     3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc
     3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct
     3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc
     4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc
     4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc
     4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa
     4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg
     4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcgatcagac gacccgggca
     4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga
     4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg
     4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa
     4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg
     4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta
     4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg
     4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg
     4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc
     4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt
     4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct
     4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc
     4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg
     5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg
     5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca
     5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa
     5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc
     5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca
     5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg
     5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc
     5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga
     5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca
     5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg
     5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg
     5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc
     5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct
     5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc
     5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca
     5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc
     6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt
     6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc
     6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg
     6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca
     6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg
     6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga
     6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg
     6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac
     6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg
     6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg
     6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag
     6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa
     6721 attttattgt ataaatatat caccatttac atgaggggaa acact
//
LOCUS       STYEUTBC     2526 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes,
            complete cds.
ACCESSION   J05518
KEYWORDS    ethanolamine ammonia-lyase.
SOURCE      S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2526)
  AUTHORS   Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M.
  TITLE     Cloning, sequencing and expression of the genes encoding the
            alcohol-dependent ethanolamine ammonia-lyase of Salmonella
            typhimurium
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.M.Babior, 08-MAY-1990.
FEATURES       from  to/span     description
    pept        141     1499     ethanolamine ammonia-lyase (eutB)
    pept       1518     2378     ethanolamine ammonia-lyase (eutC)
    binding     130      133     ribosome binding site
    binding    1507     1510     ribosome binding site
BASE COUNT      563 a    687 c    779 g    497 t
ORIGIN      
        1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga
       61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt
      121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt
      181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg
      241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa
      301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc
      361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca
      421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc
      481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc
      541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc
      601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg
      661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc
      721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga
      781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc
      841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct
      901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt
      961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct
     1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg
     1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc
     1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga
     1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc
     1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc
     1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg
     1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg
     1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat
     1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg
     1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc
     1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg
     1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc
     1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg
     1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg
     1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc
     1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg
     1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga
     2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact
     2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca
     2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg
     2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc
     2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg
     2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga
     2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg
     2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt
     2521 gcctcc
//
LOCUS       XELPCNA      1018 bp ss-mRNA            VRT       10-JUL-1990
DEFINITION  X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete
            cds.
ACCESSION   M34080
KEYWORDS    nuclear protein; proliferating cell nuclear antigen.
SOURCE      X.laevis oocyte, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1018)
  AUTHORS   Leibovici,M., Gusse,M., Bravo,R. and Mechali,M.
  TITLE     Characterization and developmental expression of Xenopus
            proliferating cell nuclear antigen (PCNA)
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Leibovici, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         28      813     proliferating cell nuclear antigen (PCNA)
    mRNA    <     1     1018     PCNA mRNA
BASE COUNT      284 a    223 c    237 g    274 t
ORIGIN      
        1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc
       61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca
      121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact
      181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag
      241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg
      301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt
      361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa
      421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc
      481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca
      541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa
      601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat
      661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca
      721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg
      781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc
      841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa
      901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa
      961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac
//
LOCUS       YSCAAC2A     1333 bp ds-DNA             PLN       10-JUL-1990
DEFINITION  S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete
            cds.
ACCESSION   M34076 J05542
KEYWORDS    ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE      S.cerevisiae (strain W303-1B) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1333)
  AUTHORS   Kolarov,J., Kolarova,N. and Nelson,N.
  TITLE     A third ADP/ATP-translocator in yeast
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Nelson, 08-MAY-1990.
FEATURES       from  to/span     description
    pept        235     1158     ADP/ATP-translocator protein (AAC2)
BASE COUNT      388 a    209 c    301 g    435 t
ORIGIN      
        1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc
       61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg
      121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa
      181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt
      241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt
      301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat
      361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt
      421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat
      481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg
      541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct
      601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc
      661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg
      721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc
      781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct
      841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg
      901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga
      961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa
     1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg
     1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc
     1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc
     1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag
     1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc
     1321 agccactgtt gtt
//
LOCUS       YSCAAC3      1308 bp ds-DNA             PLN       10-JUL-1990
DEFINITION  S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete
            cds.
ACCESSION   M34075 J05542
KEYWORDS    ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE      S.cerevisiae (strain W303-1B) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1308)
  AUTHORS   Kolarov,J., Kolarova,N. and Nelson,N.
  TITLE     A third ADP/ATP-translocator in yeast
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Nelson, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         78     1034     ADP/ATP-translocator protein (AAC3)
BASE COUNT      353 a    228 c    263 g    464 t
ORIGIN      
        1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata
       61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc
      121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg
      181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag
      241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca
      301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta
      361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt
      421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg
      481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat
      541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg
      601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac
      661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga
      721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt
      781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa
      841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga
      901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag
      961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta
     1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc
     1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata
     1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa
     1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc
     1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt
//
LOCUS       ECODKSA      1273 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  E.coli dnaK suppressor (dksA) gene, complete cds.
ACCESSION   M34945
KEYWORDS    dnaK suppressor.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1273)
  AUTHORS   Kang,P.J. and Craig,E.A.
  TITLE     Identification and characterization of a new Escherichia coli gene
            that is a dosage-dependent suppressor of a dnaK deletion mutation
  JOURNAL   J. Bacteriol. 172, 2055-2064 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        229      441     ORF 1
    pept        619     1074     dnaK suppressor (dksA)
BASE COUNT      343 a    301 c    333 g    296 t
ORIGIN      
        1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa
       61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc
      121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata
      181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg
      241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt
      301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg
      361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa
      421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg
      481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg
      541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc
      601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt
      661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa
      721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa
      781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac
      841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac
      901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc
      961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg
     1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag
     1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga
     1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct
     1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt
     1261 acgcatagaa gat
//
LOCUS       STFCYCLI     2180 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds.
ACCESSION   M33880
KEYWORDS    cyclin B.
SOURCE      Starfish (A.pectinifera) egg, cDNA to mRNA, clone
            lambda-gt10-cyc10.
  ORGANISM  Asterina pectinifera
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; 
            Stelleroidea; Asteroidea; Spinulosida; Asterinidae.
REFERENCE   1  (bases 1 to 2180)
  AUTHORS   Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T.
  TITLE     The starfish egg mRNA responsible for meiosis reinitiation encodes
            cyclin
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by T.Kishimoto, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        126     1313     cyclin B (CYC)
    mRNA    <   126     2180     cyclin B mRNA
BASE COUNT      650 a    440 c    482 g    608 t
ORIGIN      
        1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg
       61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga
      121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt
      181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc
      241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag
      301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca
      361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa
      421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct
      481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac
      541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt
      601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc
      661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc
      721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc
      781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg
      841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt
      901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca
      961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca
     1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag
     1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc
     1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat
     1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca
     1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg
     1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt
     1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt
     1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg
     1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa
     1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt
     1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga
     1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc
     1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca
     1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac
     1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat
     1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag
     1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat
     2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg
     2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt
     2161 gtaataaagt taatggaaat
//
LOCUS       CHKMTTGHA      90 bp ds-DNA             ORG       10-JUL-1990
DEFINITION  Chicken mitochondrial His-tRNA gene.
ACCESSION   M34496 M34497
KEYWORDS    transfer RNA-His.
SOURCE      Chicken (strain white leghorn) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus.
REFERENCE   1  (bases 1 to 90)
  AUTHORS   L'Abbe,D., Lang,B.F., Desjardins,P. and Morais,R.
  TITLE     Histidine tRNA from chicken mitochondria has an uncoded 5'-terminal
            guanylate residue
  JOURNAL   J. Biol. Chem. 265, 2988-2992 (1990)
  STANDARD  simple staff_entry
COMMENT     the "n"s in the tRNA sequence are probably modified bases.
FEATURES       from  to/span     description
    tRNA         11       79     His-tRNA
    anticdn      41       43     His-tRNA anticodon gtg
    variant      10       10     t in DNA, n in tRNA
    variant      18       18     t in DNA; n in tRNA
    variant      19       19     a in DNA; n in tRNA
    variant      26       26     c in DNA; n in tRNA
    variant      27       27     c in DNA; n in tRNA
    variant      35       35     t in DNA; n in tRNA
    variant      45       45     t in DNA; n in tRNA
BASE COUNT       27 a     22 c     17 g     24 t
ORIGIN      
        1 acccctctat gcaaacatag tttaacccaa acattagatt gtgattctaa aaataggagt
       61 ttaaccctcc ttgttcgccg aggggaggcc
//
LOCUS       DDISAS1A     2145 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  D.discoideum GTP-binding protein (SAS1) gene, complete cds.
ACCESSION   M34456
KEYWORDS    GTP-binding protein.
SOURCE      D.discoideum, cDNA to mRNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 2145)
  AUTHORS   Saxe,S.A. and Kimmel,A.R.
  TITLE     SAS1 and SAS2, GTP-binding protein genes in Dictyostelium
            discoideum with sequence similarities to essential genes in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 2367-2378 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept       1095     1706     GTP-binding protein (SAS1)
BASE COUNT      929 a    249 c    238 g    727 t      2 others
ORIGIN      
        1 gggaattatt aggacatcag gtttaaaacc tattcagaca ccagaataca atttgaattg
       61 agcggcaacg ttcctttcac tctgcactac atcagcatta ttagagagaa aggttgaaaa
      121 acctctatcg aaggtggtgg aattgctgag aagtaacagc aataaataaa acattcaaac
      181 cgatagatga gaggttcaaa atccatctag ttagtagggc taaaaaacta caaatcataa
      241 acccgatccg atacctaaga ctcctttttt tttttttttt tttttaataa atcaaataat
      301 cacatgacct tggagtcttg gtctgcccac gaatttaaag tgcaaagttt attttattta
      361 aactgggtgc atgcaaacat tactctatcg accgatttat ccaattttaa tactaaaatc
      421 ttaaaaacca gaaagaanna ataataataa taataataat aataataata ataataataa
      481 taataataat aataataata ataataataa taataataat aataataata ataataataa
      541 taataataat aataataata ataataacaa ccttatttga aaattcaaat taaaaaaaaa
      601 agaaatagct ttacatttta aaattaaaat tcataaataa aaccattata aaaatattga
      661 agtatatcaa taggtttaat ttaattattg tttatttaat aaaaaaaaaa aaaaaaaaaa
      721 aaaattattt aatcggttca atttaacttt ttcgaagaat tatttttttt aagaaaacat
      781 ttcaacccaa aaaaataaaa aaaataaaaa aataaaaatt taaatcgaat ggttgaaatg
      841 ttttcttaaa aaaacaaaaa ttaaaataaa ttttattttt tttgaattaa atttcaattc
      901 agcaattcaa taattttaac gttttcactt catcaaaaat tataaataga atattaaaca
      961 caacacaaca caactatcca aactaaaaca attaaaatca aaactctaat tttttataaa
     1021 aatttattta ttttctcatc tcaataaaaa catttaaaaa cataattggt aatatagata
     1081 tttttttcaa aataatgact tctccagcaa caaataaacc agcagcctac gattttttag
     1141 ttaaattact tttaattggg gatagtggtg taggaaagtc atgtctttta ttacgttttt
     1201 ctgatggttc tttcacacca agtttcatcg ctactattgg tatcgatttc aaaattcgta
     1261 caattgaatt agagggtaaa agaattaaat tacaaatttg ggacactgca ggtcaagaaa
     1321 gattcagaac tatcactaca gcatactatc gcggtgctat gggtatccta ttggtttatg
     1381 atgtcactga tgaaaaatct tttggtagca ttagaaattg gattagaaat atcgagcaac
     1441 atgcttcaga ctcagttaat aaaatgttaa tcggtaataa atgtgatatg accgaaaaga
     1501 aagttgttga tagctcaaga ggtaaatcac ttgcagacga atatggtatt aaatttttag
     1561 aaacttctgc caaaaacagt gtaaatgtag aggaagcctt tattggttta gcaaaagata
     1621 ttaaaaaacg tatgattgat acaccaaatg atcctgatca taccatatgc attactccaa
     1681 acaataagaa aaatacttgt tgttaaattg gggccatttt aattttcaca ttattagatg
     1741 aaaaaaaaaa aaaaaaaaaa ctaaaattaa aagtaaaaaa cacttttttt tatttaaaaa
     1801 tattattttt cattagtcat gaatggttac gtctaaacga tctaatattt ctctatagta
     1861 gtgaattatt gcttcatgaa ttttagtgaa aagtttagct taataataat aataataata
     1921 ataataataa taataataat aataataata ataataataa ataataataa caattttaaa
     1981 attaaatatc caatgttgaa tattttaagt caaaaataat aataataatt ggaatgtatt
     2041 ttaaaattaa aattcataaa taaactatta attattgttt attgccttta atggctaacc
     2101 tattttttat agtttaaaaa taatttataa ttaatttttt taaat
//
LOCUS       DDISAS2A      989 bp ds-DNA             INV       10-JUL-1990
DEFINITION  D.discoideum GTP-binding protein (SAS2) gene, complete cds.
ACCESSION   M34457
KEYWORDS    GTP-binding protein.
SOURCE      D.discoideum DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 989)
  AUTHORS   Saxe,S.A. and Kimmel,A.R.
  TITLE     SAS1 and SAS2, GTP-binding protein genes in Dictyostelium
            discoideum with sequence similarities to essential genes in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 2367-2378 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         49      675     GTP-binding protein (SAS2)
BASE COUNT      421 a    141 c    142 g    285 t
ORIGIN      
        1 atcaatcaat aaactacaaa tttataatat agatattttt tcgaaataat gacttctcca
       61 gcaacaaata aatcagcagc ctacgattat ttaattaaat tacttttaat cggtgatagt
      121 ggtgtaggta aatcatgtct tttattacgt ttttctgaag attctttcac accaagtttc
      181 atcactacta ttggtatcga tttcaaaatt cgtacaattg aattggaagg taaaagaatt
      241 aaattacaaa tttgggatac tgcaggtcaa gaaagattca gaactatcac tacagcatac
      301 tatcgtggtg ctatgggtat cctattggtt tatgatgtca ctgatgaaaa atcttttggt
      361 aacattagaa attggattag aaatatcgag caacatgcta cagactctgt taataaaatg
      421 ttaatcggta ataaatgtga tatggctgaa aagaaagttg ttgatagctc aagaggtaaa
      481 tcacttgcag acgaatatgg tattaaattt ttagaaacct cagccaaaaa cagtataaat
      541 gtagaggaag cctttattag tttagcaaaa gatattaaaa aacgtatgat tgatacacca
      601 aatgaacaac cacaagttgt tcaaccaggt acaaatcttg gtgcaaataa caataagaaa
      661 aaagcttgtt gttaaattgg gtgctatttt aattttcaca ttatattatt agataaaaat
      721 aaaaaaaaaa aaaaaaatct taaaaaaaaa aaaaaaagtc atcaaaatta ttcacctaaa
      781 aaaataacat ataaaccctg ggtttcaagg cagaggatga ttcacttaca acaacaacaa
      841 caacaaccaa caacaacaac aacaaccaac aacaacaact aacaacaaca acaaataata
      901 ataataataa aaataataat aataaatccc caagttgtga agttgtgttg aaattaataa
      961 gagtgggagg tttatatcgc ataaataac
//
LOCUS       HUMLAMBA     2850 bp ss-mRNA            PRI       10-JUL-1990
DEFINITION  Human lamin B mRNA, complete cds.
ACCESSION   M34458
KEYWORDS    intermediate filament; lamin B.
SOURCE      Human T-cell line MOLT-4, cDNA to mRNA, clone LAM-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2850)
  AUTHORS   Pollard,K.M., Chan,E.K.L., Grant,B.J., Sullivan,K.F., Tan,E.M. and
            Glass,C.A.
  TITLE     In vitro posttranslational modification of lamin B cloned from a
            human T-cell line
  JOURNAL   Mol. Cell. Biol. 10, 2164-2175 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        342     2102     lamin B
    mRNA    <   342     2850     lamin B mRNA
    signal     2834     2839     polyA signal
BASE COUNT      776 a    614 c    748 g    712 t
ORIGIN      
        1 cgcgagcagg agacggcggc gggcgaaccc tgctgggcct ccagtcaccc tcgtcttgca
       61 ttttcccgcg tgcgtgtgtg agtgggtgtg tgtgttttct tacaaagggt atttcgcgat
      121 cgatcgattg attcgtagtt cccccccgcg cgcctttgcc ctttgtgctg taatcgagct
      181 cccgccatcc caggtgcttc tccgttcctc taaacgccag cgtctggacg tgagcgcagg
      241 tcgccggttt gtgccttcgg tccccgcttc gccccctgcc gtcccctcct tatcacggtc
      301 ccgctcgcgg cctcgccgcc ccgctgtctc cgccgcccgc catggcgact gcgacccccg
      361 tgccgccgcg gatgggcagc cgcgctggcg gccccaccac gccgctgagc cccacgcgcc
      421 tgtcgcggct ccaggagaag gaggagctgc gcgagctcaa tgaccggctg gcggtgtaca
      481 tcgacaaggt gcgcagcctg gagacggaga acagcgcgct gcagctgcag gtgacggagc
      541 gcgaggaggt gcgcggccgt gagctcaccg gcctcaaggc gctctacgag accgagctgg
      601 ccgacgcgcg acgcgcgctc gacgacacgg cccgcgagcg cgccaagctg cagatcgagc
      661 tgggcaagtg caaggcggaa cacgaccagc tgctcctcaa ctatgctaag aaggaatctg
      721 atcttaatgg cgcccagatc aagcttcgag aatatgaagc agcactgaat tcgaaagatg
      781 cagctcttgc tactgcactt ggtgacaaaa aaagtttaga gggagatttg gaggatctga
      841 aggatcagat tgcccagttg gaagcctcct tagctgcagc caaaaaacag ttagcagatg
      901 aaactttact taaagtagat ttggagaatc gttgtcagag ccttactgag gacttggagt
      961 ttcgcaaaag catgtatgaa gaggagatta acgagaccag aaggaagcat gaaacgcgct
     1021 tggtagaggt ggattctggg cgtcaaattg agtatgagta caagctggcg caagcccttc
     1081 atgagatgag agagcaacat gatgcccaag tgaggctgta taaggaggag ctggagcaga
     1141 cttaccatgc caaacttgag aatgccagac tgtcatcaga gatgaatact tctactgtca
     1201 acagtgccag ggaagaactg atggaaagcc gcatgagaat tgagagcctt tcatcccagc
     1261 tttctaatct acagaaagag tctagagcat gtttggaaag gattcaagaa ttagaggact
     1321 tgcttgctaa agaaaaagac aactctcgtc gcatgctgac agacaaagag agagagatgg
     1381 cggaaataag ggatcaaatg cagcaacagc tgaatgacta tgaacagctt cttgatgtaa
     1441 agttagccct ggacatggaa atcagtgctt acaggaaact cttagaaggc gaagaagaga
     1501 ggttgaagct gtctccaagc ccttcttccc gtgtgacagt atcccgagca tcctcaagtc
     1561 gtagtgtacg tacaactaga ggaaagcgga agagggttga tgtggaagaa tcagaggcga
     1621 gtagtagtgt tagcatctct cattccgcct cagccactgg aaatgtttgc atcgaagaaa
     1681 ttgatgttga tgggaaattt atccgcttga agaacacttc tgaacaggat caaccaatgg
     1741 gaggctggga gatgatcaga aaaattggag acacatcagt cagttataaa tatacctcaa
     1801 gatatgtgct gaaggcaggc cagactgtta caatttgggc tgcaaacgct ggtgtcacag
     1861 ccagcccccc aactgacctc atctggaaga accagaactc gtggggcact ggcgaagatg
     1921 tgaaggttat attgaaaaat tctcagggag aggaggttgc tcaaagaagt acagtcttta
     1981 aaacaaccat acctgaagaa gaggaggagg aggaagaagc agctggagtg gttgttgagg
     2041 aagaactttt ccaccagcag ggaaccccaa gagcatccaa tagaagctgt gcaattatgt
     2101 aaaattttca actgtcttcc tcaaaataaa gaagtatggt aatctttacc tgtatacagt
     2161 gcagagcctt ctcagaagca cagaatattt ttatatttcc tttatgtgaa tttttaagct
     2221 gcaaatctga tggccttaat ttcctttttg acactgaaag ttttgtaaaa gaaatcatgt
     2281 ccatacactt tgttgcaaga tgtgaattat tgacactgaa cttaataact gtgtactgtt
     2341 cggaaggggt tcctcaaatt ttttgacttt ttttgtatgt gtgttttttc ttttttttta
     2401 agttcttatg aggaggggag ggtaaataaa ccactgtgcg tcttggtgta atttgaagat
     2461 tgccccatct agactagcaa tctcttcatt attctctgct atatataaaa cggtgctgtg
     2521 agggagggga aaagcatttt tcaatatatt gaacttttgt actgaatttt tttgtaataa
     2581 gcaatcaagg ttataatttt ttttaaaata gaaattttgt aagaaggcaa tattaaccta
     2641 atcaccatgt aagcactctg gatgatggat tccacaaaac ttggttttat ggttacttct
     2701 tctcttagat tcttaattca tgaggagggt gggggaggga ggtggaggga gggaagggtt
     2761 tctctattaa aatgcattcg ttgtgttttt taagatagtg taacttgctt aaatttctta
     2821 tgtgacatta acaaataaaa aagctctttt
//
LOCUS       VIBANGRA     4379 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  V.anguillarum trans-acting transcriptional activator (angR),
            S-acyl fatty acid synthesis thioesterase-like protein genes,
            complete cds, and outer membrane protien (omp), 3' end.
ACCESSION   M34504
KEYWORDS    S-acyl fatty acid synthesis thioesterase-like protein;
            outer membrane protein; trans-acting transcriptional activatior.
SOURCE      V.anguillarum DNA, clone pJHC-A103.
  ORGANISM  Vibrio anguillarum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 4379)
  AUTHORS   Farrell,D.H., Mikesell,P., Actis,L.A. and Crosa,J.H.
  TITLE     A regulatory gene, angR, of the iron uptake system of Vibrio
            anguillarum: Similarity with phage P22 cro and regulation by iron
  JOURNAL   Gene 86, 45-51 (1990)
  STANDARD  simple staff_entry
COMMENT     Fur protein is a product of the ferric uptake regulatory gene
            (fur).
FEATURES       from  to/span     description
    pept    <     1      275     outer membrane receptor protein
    pept        361     3507     trans-acting transcriptional activator (angR)
    pept       3504     4262     S-acyl fatty acid synthase thioesterase-like
                                 protein (ORF6)
    binding     154      158     Fur binding site
    binding     348      353     ribosome binding site (put.)
    binding    3488     3493     ribosome binding site (put.)
    signal       34       39     -35a region (put.)
    signal       63       68     -10a region (put.)
    signal      109      114     -35b region (put.)
    signal      126      131     -10b region (put.)
    signal     3387     3392     -35c region (put.)
    signal     3405     3410     -10c region (put.)
    signal     3443     3448     -35d region (put.)
    signal     3456     3461     -10d region (put.)
BASE COUNT     1320 a    914 c    902 g   1243 t
ORIGIN      
        1 ggaacctacc agtgatgcgt caacttactc ttattggtca agcaaattac atgtcagagc
       61 aatatattga tgcacaaaac actcaatcac tgtctgcaca gactattttt gatttaggtg
      121 ctcgctataa ctctaccatc gccaatcaaa gtgtcatttg gcgtcttgcg gtcaacaacg
      181 taaccgatga agcatattgg actaccaccc attacgctag ccttgcgttg ggtgcccctc
      241 gtacggtgat gctatctgct acagcggatt tttaatctcg gtcaattttg cccttgacct
      301 ttctggttaa gggcattcgt cttccccttc cccccatttg gctttttatg agaatttaga
      361 atgaatcaaa atgaacatcc cttcgctttc cctgagacaa aattaccttt aacctccaat
      421 caaaattggc agttatcaac ccaaagacag cgtactgaaa aaaaatcgat taccaatttt
      481 acgtatcagg aatttgatta cgaaaacatt tcgagggaca cattagaacg ctgcctcaca
      541 acaataatta agcatcaccc aatattcgga gctaagctca gtgacgactt ctacctccat
      601 tttccgagca aaactcacat tgaaaccttt gcagttaatg acttaagtaa tgccttaaaa
      661 caagatattg ataaacagtt ggccgatacg cgttctgcag taacgaaaag ccgctcacaa
      721 gcgataatct ctatcatgtt tagtatattg ccaaaaaaca taatcaggct tcatgtacgc
      781 ttcaactcag ttgttgtaga taatccaagt gttacgcttt tttttgagca gcttactcag
      841 ttattatcgg gaagtcccct ttctttttta aatcaagaac agactatctc cgcatacaat
      901 cacaaagtta ataatgagtt gcttagtgtt gatcttgagt ccgcaagatg gaatgaatat
      961 attctaacac tacctagttc agcaaacctt cccacaattt gtgaacccga aaaactggat
     1021 gaaaccgata tcactcgcag gtgcattaca ctgtcacaaa ggaagtggca gcagttggtt
     1081 actgttagca aaaaacataa tgtcacaccg gagataactc ttgccagtat attttcgacc
     1141 gttttatcac tctgggggca tcaaaaatac ctcatgatga gatttgatat caccaaaatc
     1201 aatgactaca cgggcatcat aggccagttt accgaacctt tattagtggg tatgtccggc
     1261 tttgagcaga gctttctttc tcttgttaaa aacaaccaaa aaaagttcga agaagcttat
     1321 cattatgacg ttaaagtacc tgtttttcag tgtgttaata aattatctaa tatttcggat
     1381 tctcaccgtt atcctgctaa tatcactttt tctagcgagc ttttaaacac aaaccatagc
     1441 aaaaaagctg tatggggatg tcgtcaatca gccaatactt ggctttcttt acatgctgta
     1501 atcgagcaag aacaacttgt cttacaatgg gacagccaag acgcaatctt cccaaaagac
     1561 atgatcaaag atatgttaca tagttacacc gatttattag acttactcag ccaaaaagat
     1621 gtcaactggg cacagccttt accaactttg ctgccaaaac atcaggagtc catacgcaat
     1681 aaaataaatc aacagggaga cctagaacta actaaagaac tcctccatca gcgttttttt
     1741 aaaaacgtag agtccacccc taatgctctt gcgattatcc atggtcaaga gtcattagat
     1801 tatataactt tagcaagcta cgccaagagt tgtgcgggtg cactaaccga agctggagta
     1861 aaatcaggag accgcgttgc tgtcactatg aataaaggca ttggtcaaat agtggcagta
     1921 ttgggaatat tatatgctgg ggctatttat gttcctgtct ctctagatca accacaagaa
     1981 aggcgggaaa gtatttatca aggtgctgga attaacgtta ttcttattaa cgaatcagat
     2041 agtaaaaatt ccccttcaaa tgatcttttc tttttcctgg actggcaaac agcgataaag
     2101 agtgagccaa tgcgtagccc tcaagatgtc gcgccaagtc aaccagccta tattatctac
     2161 acatcaggct caacaggaac ccctaaggga gtggtgattt ctcaccaagg cgctcttaat
     2221 acatgtatcg cgatcaatcg acgttatcaa attgggaaaa atgatcgagt attggctctt
     2281 tcagcactac attttgacct ttcggtatac gacatctttg gcctactttc tgccggcggc
     2341 actatcgtat tagtcagtga gcttgaaaga cgtgacccga ttgcttggtg tcaagcaatt
     2401 gaggagcata atgtcaccat gtggaatagc gtcccagcat tatttgatat gttattaact
     2461 tacgctactt gctttaactc tatcgctccc tcaaaactcc gtttaaccat gctttcggga
     2521 gactggattg gattagattt accgcagcgt tatcgcaatt atcgtgtaga tggccaattt
     2581 attgcgatgg gaggagccac cgaagcatcg atatggtcaa acgtctttga cgtagagaaa
     2641 gttccgatgg agtggcgctc tatcccttat ggctatcctc tacctagaca acaatatcga
     2701 gttgtcgatg acttggggcg agattgccca gattgggtag ctggcgaact ttggattggt
     2761 ggtgacggta tcgcactggg gtattttgac gatgaattga aaacgcaagc tcagttttta
     2821 catattgatg gccatgcttg gtatcgtact ggtgacatgg gctgttattg gccagatggt
     2881 actcttgagt tcttggggcg aagagacaag caggtcaaag taggaggtta cagaattgag
     2941 ttgggagaaa tcgaagttgc actcaataat ataccggggg tgcagcgtgc ggttgctatc
     3001 gcagtgggca ataaagacaa aactctagca gcattcatcg ttatggattc ggagcaagca
     3061 ccaatagtta cagcgccgtt ggatgcagaa gaagttcaac ttttgttgaa caaacaactg
     3121 cctaactaca tggttcccaa acgcataatt ttccttgaaa ccttccccct aaccgctaat
     3181 ggtaaagtcg atcataaagc tctaactcga atgactaacc gagaaaagaa aacatctcaa
     3241 agcataaata aacctattat tactgcgagt gaagatagag tagccaaaat ttggaatgac
     3301 gttcttggtc ctacagaact ctataaatcg agtgatttct ttttgtcggg aggagatgca
     3361 tacaacgcaa tagaggtagt caaacgttgt cataaagctg gctatctaat caagctatca
     3421 atgttgtacc gttattctac gattgaagct ttcgctatta tcatggaccg ttgtcgatta
     3481 gcacctcagg aagaggctga gttatgagcc ctttaatcaa acttgcagcc tcttcgaggc
     3541 tgcatgatgc aactcattat gttttatgcc cttttgcagg aggtggtagt ggtgcattta
     3601 gacactggcg tacattatcc cttgaaaatg aagtgatttc ggtaatgctt tatcctggta
     3661 gagaatttcg tatagacgac ccaacagtca taaacatcgg cacattagca gaagaaatga
     3721 tccaagcttt aaaaacctgt aatcaacgaa tagaagatac gatcattgtc ggtcatagta
     3781 tgggcgcgca agtggcgtat gaagcaagta aaaaactagt aaatcagggg ctatttctga
     3841 aagggctgat catctctggt tgtcaagctc ctcatatcaa agggcgaagg ttactaggtg
     3901 aatgcgatga taaaaccttt attcataatc tagtcgagat tggagggtgt gatccaagtt
     3961 tagctaaaag tccagagtgg tggccgatat ttctgccagc tttgagggcg gactttacgg
     4021 ctacagaaca gtatattttc acatcacttc caaatgataa ggaaggcctt cctatcccaa
     4081 ctctattgat ttcaggtgat caagatagag aagctaactt ttcagaaata gaagagtgga
     4141 aactttggtg taataaagtc gttgatcatt tagtggtcga gggcgggcat ttctatataa
     4201 cagagcaacc tcaaatgatg cttgaatgca tccgggcttt atcaaccgaa acgactgcct
     4261 aatactaagg ttcggttgat agatttttag acaaacaact tcaaacgaca agggtatgca
     4321 tttaagcaat gcataccctg ggcttttcga tcaacactat tacttggttt ccggaattc
//
LOCUS       VIBLUXABG    3200 bp ds-DNA             BCT       10-JUL-1990
DEFINITION  P.leiognathi luciferase alpha (luxA), beta (luxB) subunit, and
            gamma protein (luxG) genes, complete cds.
ACCESSION   M34564
KEYWORDS    gamma protein; luciferase.
SOURCE      P.leiognathi (strain 554) DNA, clone pPHL[6,11,12].
  ORGANISM  Photobacterium leiognathi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 3200)
  AUTHORS   Illarionov,B.A., Blinov,V.M., Donchenko,A.P., Protopopova,M.V.,
            Karginov,V.A., Mertvetsov,N.P. and Gitelson,J.I.
  TITLE     Isolation of bioluminescent functions from Photobacterium
            leiognathi: Analysis of luxA, luxB, luxG and neighboring genes
  JOURNAL   Gene 86, 89-94 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      145     ORF1
    pept        182     1246     luciferase alpha-subunit (luxA)
    pept       1295     2272     luciferase beta-subunit (luxB)
    pept       2293     2979     gamma protein (luxG)
    pept       3081  >  3200     ORF2
    binding     171      174     ribosomal binding site (put.)
    binding    1284     1287     ribosomal binding site (put.)
    binding    2281     2284     ribosomal binding site (put.)
    binding    3072     3075     ribosomal binding site (put.)
BASE COUNT     1092 a    616 c    586 g    906 t
ORIGIN      
        1 tcgagcagcc attggcttag acagtgaagt gattgattta gttgatgata ttagtgagcc
       61 aaactttgaa gatctcacca ttattacagt taatgaacgt cgtttgaaaa ataaaattga
      121 aaacgaaatg ttcgctagcg cttaaaccaa tacctattca agtcatcaaa aggaaaagat
      181 aatgaaattt ggcaatattt gtttctcata ccagccacca ggtgaatctc ataaagaagt
      241 catggatcgc tttgttcgtc ttggcgttgc ttcagaagaa ttaaacttcg acaccttctg
      301 gacacttgag caccacttca ctgaattcgg cctaacaggt aacttatatg ttgcttgtgc
      361 caatattctt ggtcgtacca aaaaacttaa cgtcggcaca atgggtatcg tactaccaac
      421 agctcaccct gctcgccaaa tggaagatct actgctactg gatcaaatgt caaaaggacg
      481 ttttaacttt ggtgtagtac gtggtctata ccataaagat ttccgggtat ttggtgttac
      541 gatggaagat tctcgttcga tcactgaaga tttccataaa atgatcatgg acggctctaa
      601 atcaggcgtt ttacacactg atggtaaaaa cattgaattc ccagatgtaa atgtctatcc
      661 agaggcctac ctagacaaga tccctacttg tatgacagcg gaatctgcgg cgacaacgac
      721 ctggctagca gaacgtggtt tgccaatggt actgagctgg atcatcacca ccagcgagaa
      781 aaaagcacag atggaactat acaatgaaat tgcagctgag catgggcacg atattcacaa
      841 tatcgaccac agcatgacct tcatctgttc cgttaatgaa gatccagaaa aagcagaaag
      901 tgtctgccgt gacttcctat caaactggta cgagtcctac accaatgcga ccaatatctt
      961 taaagacagt aaccaaactc gtggttatga ctatcacaaa ggtcaatggc gtgactttgt
     1021 actacaaggc cataccgata cccgtcgtcg tcttgattac agtaataacc taaaccctgt
     1081 tggtacacct gaaaaatgta ttgaaattat ccagcgagat atcgatgcaa cagggatcaa
     1141 caacatcacc cttggttttg aagcaaacgg ttctgagcaa gaaatcatcg catcgatgga
     1201 acgcttcatg acacaagtgg cgccatacct aaaagatccg aaataaactg ccacattaaa
     1261 gccattgaat taaattataa ataaggaaaa aaacatgaat tttggattat tctttctgaa
     1321 ctttcagctc aaaggtatga catctgaagc agtactagac aacatgatcg atactattgc
     1381 tttggttgat aaagacgagt accacttcaa aaccgcattt gtgaacgaac accatttttc
     1441 taaaaacggt atcgttgggg cacctatgac agctgcaagt tttctactag gtttaactga
     1501 acgccttcat attggttcat tgaatcaagt gatcaccact caccacccag tccgtattgc
     1561 agaagaagct agcttacttg atcaaatgtc agatgggcgt tttattcttg ggttaagtga
     1621 ttgtgttagt gatttcgaga tggacttctt taaacgccaa cgagatagcc aacaacaaca
     1681 attcgaagcc tgttacgaaa ttctaaatga cggtatcact accaactact gttatgcgaa
     1741 taatgacttt tataacttcc caaaaatctc tatcaaccca cactgtatta gtaaagaaaa
     1801 cctaaaacag tatattttag cgaccagcat gggcgtggtg gaatgggctg cgaaaaaagg
     1861 gttaccactg acttaccgct ggagtgatac gctggcagaa aaagaaaatt actatcaacg
     1921 ttatttaact gtcgccgctg aaaataatgt cgacattact catgttgatc accaattccc
     1981 attacttgtt aacattaatc cggatcgtga tattgctaaa caagaaatgc gtgactatat
     2041 ccgtggttat attgctgaag cttacccaaa tacagatcaa gaagaaaaaa ttgaagagct
     2101 aattaagcaa catgcggttg gtacagaaga tgaatattat gaatcatcta aatatgcttt
     2161 agaaaaaaca ggttcaaaga atgtattgct atcttttgaa tcaatgaaaa ataaagccgc
     2221 tgtcatcgac cttattaata tggttaatga aaaaatcaag aaaaatctat aataaataac
     2281 aggataataa aaatgacaaa atggaattat ggcgtcttct tccttaattt ttaccatgta
     2341 ggacagcaag agccatcatt aaccatgagc aatgcgttag aaacattacg tattatagat
     2401 gaagatacat ctatctatga tgttgttgca tttagcgaac accacataga taaaagctac
     2461 aatgatgaaa cgaaattagc gccatttgtt agccttggca aacaaattca tattttagcc
     2521 accagccctg aaacggttgt aaaagcggct aaatatggga tgccactact gtttaaatgg
     2581 gatgatagtc aacaaaagcg tatcgaatta ttaaaccatt accaagcagc tgcggctaaa
     2641 tttaatgtcg atattgcagg tgttcgtcat cgattaatgt tatttgtcaa tgttaatgac
     2701 aacccaacgc aagccaaagc tgagcttagc atttacttag aagattacct ctcttacacc
     2761 caagcagaaa catccattga tgaaatcatc aatagcaatg ctgcaggcaa cttcgatacg
     2821 tgtttacatc acgttgctga aatggctcaa ggtttaaata ataaagtcga tttcttattt
     2881 tgctttgaat cgatgaaaga tcaagagaat aaaaaatcac taatgattaa ctttgataaa
     2941 cgcgttatta attatagaaa agaacacaac cttaactaat tcagttaagt caatttaaat
     3001 taaaacttcg tcaatcattg tcattattaa tggcagtgtg gcttcttacg ctgccattaa
     3061 attttttatt aaggtgtaat atgactactt tattagatat tgatactaac gatattattg
     3121 ttagttcaga actcgatgat attattttct catcatcacc gtttacatta acctttgatg
     3181 agcaagaaaa attaaagcaa
//
LOCUS       YSCSLP1A     3456 bp ss-mRNA            PLN       10-JUL-1990
DEFINITION  S.cerevisiae vacuolar function expression protein (SLP1) gene,
            complete cds.
ACCESSION   M34474
KEYWORDS    .
SOURCE      S.cerevisiae, cDNA to mRNA, clone pYKK101.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3456)
  AUTHORS   Wada,Y., Kitamoto,K., Kanbe,T., Tanaka,K. and Anraku,Y.
  TITLE     The SLP1 gene of Saccharomyces cerevisiae is essential for vacuolar
            morphogenesis and function
  JOURNAL   Mol. Cell. Biol. 10, 2214-2223 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        692     2767     SLP1 protein
    signal      536      544     TATA box
BASE COUNT     1136 a    588 c    691 g   1041 t
ORIGIN      
        1 ctgcagctaa tcacgtgctc acatctttac tcaatgagat tgatggtgtt gaagagttaa
       61 agggtgtagt tattgtagcg gcgacgaata gacctgatga aatagatgct gctcttctaa
      121 ggcctggtag gttagataga cacatttacg ttggccctcc agacgtaaac gcccgcttgg
      181 aaatcttaaa gaagtgcaca aagaaattta atacagaaga gtctggagtc gatcttcatg
      241 aattggcaga ccgtacagaa ggttattccg gagctgaagt tgtgctgctt tgtcaagaag
      301 cgggcttggc tgccataatg gaagatttag atgtcgcaaa agtggaatta cgtcattttg
      361 agaaagcttt taaaggaatt gctaggggca ttactccaga aatgctctct tattatgaag
      421 agtttgctct aagaagcggt tcatcttcgt aagcttgttc atagtcaatt cttttccttt
      481 gtgtgctcaa taatagtaga tagaaattat actgaactcc ggtcattttg tataatatat
      541 taatcacttc acacgaacat acataaataa aatatcataa aggttagcaa attggaacta
      601 gttatatgtt aattagttaa aagatagaaa attcgagaaa ggaagaaaaa gctgatattg
      661 cccatctcca actttatcaa atcatttcac gatgaataga ttttggaata ctaagaaatt
      721 ttcattaaca aatgccgatg gactatgtgc taccttaaat gagatatctc aaaatgatga
      781 agttcttgtg gttcaaccaa gtgtattgcc agtactcaat agtttgctaa ctttccaaga
      841 tttgactcaa tcaactcctg taaggaaaat tacgttactc gatgatcagc taagtgacga
      901 tttaccgagt gccttaggca gcgttccgca aatggatctt atttttctta ttgatgtcag
      961 aacatctctc cgactccctc cacaactgct tgatgctgct caaaagcaca atttatcatc
     1021 tttgcatata atatactgtc gatggaaacc gtctttccaa aatactttgg aggatacaga
     1081 gcaatggcaa aaggatggtt tcgatttgaa ttcaaaaaaa acacatttcc ctaacgtcat
     1141 tgaatctcag ttaaaggagc tatcgaacga atataccctt tacccttggg atctcttgcc
     1201 cttcccacag attgatgaaa atgttctatt gactcattcc ctttataaca tggaaaatgt
     1261 aaacatgtat tatcccaact tacgttcttt gcagagtgcc acagagtcaa tactggttga
     1321 tgatatggtc aattcgttgc agagcttgat ttttgaaact aatagtatca taacaaatgt
     1381 tgtgtcgata ggtaatctgt ctaagagatg tagccatctt ttgaagaaac gaatcgatga
     1441 gcatcaaaca gagaatgatt tattcatcaa gggtacgctt tatggtgaac gaaccaactg
     1501 tggactagaa atggacttga ttatcttgga aaggaatacc gatcctataa cgccattgtt
     1561 gacacaactt acgtatgcag gaatactaga tgatctatat gaattcaatt ctggcataaa
     1621 gataaaggag aaagacatga acttcaatta taaggaagat aaaatatgga atgatttgaa
     1681 atttttaaat tttgggtcga ttgggccgca gttaaataaa ttggcaaagg aactacaaac
     1741 gcaatatgat acaaggcata aagccgagag cgtacatgaa atcaaagaat tcgttgattc
     1801 cttaggttct ttgcaacaaa ggcaagcttt tttgaaaaat cacacaacct tatcatccga
     1861 cgttttgaaa gtggtagaga ctgaagagta cggatctttc aataaaatct tagagttaga
     1921 gctggaaatt ttgatgggaa atacacttaa taacgacatt gaagatatta tactcgagtt
     1981 gcagtaccag tacgaggttg atcaaaagaa gattctcaga ttaatctgtt tattgtctct
     2041 ttgtaaaaat tcacttcgag aaaaggatta tgaatatcta agaaccttta tgatcgactc
     2101 ttggggcatt gaaaaatgct ttcaacttga atcattggct gagttaggat ttttcactag
     2161 caaaacggga aaaactgatt tgcatattac aacaagtaag tcaacaagat tacagaaaga
     2221 ataccgttat atttcacaat ggttcaatac agtacccata gaagacgagc atgctgccga
     2281 taaaatcaca aatgagaacg atgacttctc ggaagccact tttgcttaca gtggtgtagt
     2341 gcccttgaca atgagactgg ttcagatgtt atatgatagg tctatcttgt tccataatta
     2401 ttcctcgcag cagcctttta tactgtcaag agaacctaga gtttctcaaa cggaggattt
     2461 aattgaacag ttatatggag actcacatgc gatcgaagag agtatatggg tcccgggaac
     2521 cattacaaaa aagatcaatg caagcatcaa gagcaataat agacggtcca tagacggatc
     2581 taatgggaca tttcatgctg cagaggatat tgcactcgta gtattcctcg gaggtgtaac
     2641 aatgggtgaa atagctataa tgaagcattt gcaaaaaata ctaggtaaaa aaggtatcaa
     2701 taaaaggttt atcatcatcg ccgatggctt gatcaatggc acaaggatca tgaactctat
     2761 atcttaatta ttatatgata gatttgttaa ttttttgtat atgcaaatgt gcttttttca
     2821 ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa aagctagggg
     2881 actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct ggcttagtga
     2941 tttataagaa tggttaatta gttttgtatc ctttattttc tagatagagc cacagagcaa
     3001 actaaacaga aaagttatcc atttccatta cgcaatgttg tgccaacaga tgattagaac
     3061 gacagctaag agaagtagca atatcatgac cagacctatt atcatgaaga ggtcagtaca
     3121 cttcaaagac ggtgtgtatg aaaatatccc attcaaagtc aaaggaagaa agacacctta
     3181 cgccttatct catttcgggt tcttcgctat tggatttgct gttccatttg ttgcctgcta
     3241 tgttcaattg aaaaagtcag gtgcttttta aaacaccccc ctaagttgaa ggatagatgt
     3301 gtgtacatag cgtgcttggt tgagacgttt tagagtgtgt tctttgctat tcctaggtgc
     3361 gcatatcatc gttttattta tttgtacaat tttcttttca tatattcata atcctctcct
     3421 tgtgccttcg tattgagacg gcgggaaaga aggatc
//
LOCUS       CHKMHBLBA    2405 bp ds-DNA             VRT       10-JUL-1990
DEFINITION  Chicken MHC class II B-LBII-beta gene, complete cds.
ACCESSION   M29763
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Chicken (haplotype B12) DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 2405)
  AUTHORS   Zoorob,R., Behar,G., Kroemer,G. and Auffray,C.
  TITLE     Organization of a functional chicken class II B gene
  JOURNAL   Immunogenetics 31, 179-187 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 16; 2332 to 2405)
  AUTHORS   Zoorob,R., Behar,G., Kroemer,G. and Auffray,C.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Zoorob, 06-NOV-1989, for release after publication.
            
               Author address: R.Zoorob
               Institut d'Embryologie Cellulaire et Moleculaire du CNRS
               et du College de France
               49bis av. de la Belle Gabrielle
               F-94736 Nogent sur Marne
               France
FEATURES       from  to/span     description
    pept        828      918     MHC B-LII-beta chain, exon 1
               1127     1396     MHC B-LII-beta chain, exon 2
               1483     1764     MHC B-LII-beta chain, exon 3
               1847     1957     MHC B-LII-beta chain, exon 4
               2049     2072     MHC B-LII-beta chain, exon 5
               2175     2188     MHC B-LII-beta chain, exon 6
    pre-msg     808     2331     MHC B-LII-beta chain mRNA and intron
    IVS         919     1126     MHC B-LII-beta chain intron A
    IVS        1397     1482     MHC B-LII-beta chain intron B
    IVS        1765     1846     MHC B-LII-beta chain intron C
    IVS        1958     2048     MHC B-LII-beta chain intron D
    IVS        2073     2174     MHC B-LII-beta chain intron E
BASE COUNT      380 a    728 c    902 g    395 t
ORIGIN      
        1 ggatccatgg gtgacgtaag gatgaggttc cagcacatat tggacccttc tgcgtttgca
       61 tggagggatc ttcgggggat ctttgtgatc ttcagtgatt ttcagtggtc tttggtggtc
      121 ttcagtgctc ttcgttggtc tttgacaaag atgcagagga gcaccgctcc cagacggacc
      181 ccccggggac cccatttgtc gccatcccca ctgggacatg cagccattga ccacagccct
      241 ccggctgcga ccacccaact gattccttat ccaaagtcca ctctttgcac acttacctcc
      301 aatttagtga taaggatgtg gcgtgggacc gtcccaatgg ccgcacacaa gtccaggtag
      361 atgatatggg atgaccatga agggatcaca gagaggaaca cggggtgacc acgaggagca
      421 acgaaggaaa cgctgagtga ccacgggcag aaaatggtgt gaccattagg ggacaacgag
      481 agggaacaga agtagtaagg agtgagaatg gggtgacaaa gaggtgacca tggcataact
      541 ttgataagac cattgggtga ccgcagggtg atggccatac catggggtga gcactggatg
      601 accatggagg tcattggagg accatcgggt gggacgaggg ccgtggggac acccgtgggg
      661 cggtgggacg ggggcagagt gtcagaagga gccccgcggc gcagaactct gcctggagac
      721 gggtgacgcc gcccggcgcc gccgccgctc attggccctc cccgcccggc cccgggctcg
      781 cggctggcgc ggggtgccgg gtcccccatc gtccggcggc agcagccatg gggagcgggc
      841 gcgtcccggc ggcgggggcc gtgctggtgg cactgctggc gctgggagcc cggccggccg
      901 ccggcacgcg gccctcgggt gagctcggag ccgcggcgcg gggacggcgc tgcgtccccc
      961 ccggagaaac ccccggagcc cttctggccg tgcgcagcgc tcggggctgc ggggggacgg
     1021 agggcggggg ggggcggcgg agccgtgggg ggcagcgggg ccggggaggg ggcggggggt
     1081 gtggcggggg gcggctgtgt gccctgaccg tgccctctgc ccgcagcgtt cttcttctgc
     1141 ggtgcgatat ccgagtgcca ctacctgaac ggcaccgagc gggtgaggta tctgcaaagg
     1201 tacatctaca accggcagca gttcacgcac ttcgacagcg acgtggggaa atttgtggcc
     1261 gattcaccgc tgggtgagcc gcaagctgaa tactggaaca gcaacgccga gcttctggag
     1321 aaccgaatga atgaagtgga caggttctgc cggcacaact acgggggtgt ggagtccttc
     1381 acggtgcaga ggagcggtga gtgccgcggg gcgcagcgcg gacggacggg caggcgccgc
     1441 gctctggcgg tcggtccgca gcgctccccc cgtgccccgc agtggagccc aaggtgaggg
     1501 tctcggcgct gcagtcgggc tccctgcccg aaaccgaccg tctggcgtgc tacgtgacgg
     1561 gcttctaccc gccggagatc gaggtgaagt ggttcctgaa cgggcgggag gagacggagc
     1621 gcgtggtgtc cacggacgtg atgcagaacg gggactggac gtaccaggtg ctggtggtgc
     1681 tggagaccgt cccgcggcgc ggggacagct acgtgtgccg ggtggagcac gccagcctgc
     1741 ggcagcccat cagccaggcg tggggtaagg cccccgggcc ctgccccgcc gcggggggag
     1801 cgggagcgcg gcccgccgcg ctgagccgcc gccttcgtcc ccgcagagcc gccggcggac
     1861 gcgggcagga gcaagctgct gacgggcgtg gggggcttcg tgctggggct cgtcttcctg
     1921 gcgctggggc tcttcgtgtt cctgcgcggt cagaaaggtg agcgctgggg aggggggctg
     1981 cgcggggggg gtcgggagcg gggggtgggg ggcagcgtcc gcgctgacct cgtctcgctg
     2041 tgtttcaggg cgccccgtcg ccgccgctcc aggtaacgtc ccgttcccat tcccgttccc
     2101 gttcccgttc ccgttccgcg ctgcgagcgg ccccgatccc ggcgcggggc tcagctctgc
     2161 ccgtctcccc gcagggatgc tgaattagct gctgccccgc cgagccgctg cacccgcacc
     2221 ccccgctctc ccggccgtcg cctcggctct ccctcgggct gccaccgcgt ccgttggaga
     2281 tgtcgccacg atgcacgctt cgtccccatc ctaataaacg cgctgacttt gaccccgctg
     2341 ttcgctgccc gtgaatcatt ggggactttc cgtcgcgtgg gaggagggga gggaagtgaa
     2401 agctt
//
LOCUS       CHKMHBLIIB    444 bp ds-DNA             VRT       10-JUL-1990
DEFINITION  Chicken MHC class II B-LBIII-beta gene, exon 1.
ACCESSION   M29764
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Chicken (haplotype B12) DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 444)
  AUTHORS   Zoorob,R., Behar,G., Kroemer,G. and Auffray,C.
  TITLE     Organization of a functional chicken class II B gene
  JOURNAL   Immunogenetics 31, 179-187 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Zoorob, 06-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept        125  /   215     MHC B-LIII-beta chain, exon 1
    pre-msg     105  >   444     MHC B-LIII-beta chain mRNA and intron
    IVS         216  >   444     MHC B-LIII-beta chain intron A
BASE COUNT       50 a    156 c    176 g     62 t
ORIGIN      
        1 ctgatcgggg tacccgcaac ggagatctgc ctggagacgg gtgatgccgc ccagcccagg
       61 cactcactgc tccagagcag cggcgcgggc tgccggcacc cttcctcctc ctccggcagc
      121 agccatgggg agcggccgtg tcctggtggc cggggccgtg ctggtagcac tggtggcgct
      181 gggagcacgg caggccgccg gcacgcggcc ctcaggtgag ctcggagtcc cggtgtgggg
      241 atggtgcagg gtggtccctc ccggtgtctc ccggcgccca ccccagcccc gtgcgcagcg
      301 ctcggagctc cgcggctcag gatgccggcg acagcgcgtc cgcagccgtc gtgggcgtgg
      361 ggggcacggg acggagcgcg gacgggagtg gctttcgggt ctgccgaggg gcagctggct
      421 cctgacggtg ccccctcccc gcag
//
LOCUS       RATLY6A      1221 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Rat Ly6-A antigen gene, exon 2.
ACCESSION   M30692
KEYWORDS    antigen.
SOURCE      Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA,
            (library of Clontech), clone RK6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1221)
  AUTHORS   Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
  TITLE     Analysis of three distinct Ly6-A-related cDNA sequences isolated
            from rat kidney
  JOURNAL   Immunogenetics 31, 104-111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept    /   497      834     Ly6-A antigen, exon 2 (put.) (AA at 499)
    IVS     <     1      496     Ly6-A intron A (no splice consensus)
    signal     1204     1209     poly-A signal
BASE COUNT      286 a    315 c    314 g    306 t
ORIGIN      
        1 gtagtccggc tgctggctga gttgtaaggc aggagggagg ctgggtgtgt tttgtcttgc
       61 atgtagccct ctctgcagag ggcctggctt cactcacaca agcctggtaa catctggtac
      121 atcgaactct aagaatcggc aagcccactg ctgccgtctc cttaagagtt catttaggga
      181 gtctgtcagg aacttgggca ggagtccaca ctaagggaag cttacttccc aaacagtggt
      241 gctgggtgga aagtggagga ctcatgagaa cccctagttt aagactttta gagaagcagt
      301 ctgaagcact gtggagatgt ggtcccatcg ccatcctgga gtagggataa ttttgcccag
      361 gagccccagc aatgggtcag aggagcaaaa cgacgacagc tgtaagtggt ctcagaagat
      421 gctagaggaa acagaagatg aactggcagc tgagacttgg cggtaactta ctggcttcga
      481 cactatgcgt gttactctca gggcctaaac tgctacaatt gcacgatgat cccatttggt
      541 aatacctgct catcaactgc tacctgcccc taccctgatg gagtctgtgc tattcaggtg
      601 gcagaagttg ttatgagctc tgtaagacag aaagtaaagg accatatttg ccttcccgtc
      661 tgcccaacga gtcctcaaac aaccgagatc ctgggtactg ttgtcgacat gaagatttcc
      721 tgttgcaata cagatctttg caacgcagca gggcccactg gaggcagcac ctggaccatg
      781 gcaagggtgc ttctgttcag cctgggctca ttcctcctgc agaccttgct gtaatggctc
      841 ctccaaggcc ccgccaccct tgtcctttta tcctcatgtg taatcactcc tccctggagc
      901 cctctagtga taaattctga gtaatagaaa ctctgaggtg ggggtagggt gtggaacacc
      961 ttgtttcaac tctatagccc ctgctgggta ggtgccccac tcccctctct agggctttca
     1021 gatatgtact tcctggaatg ccattatgtt gtggtttgct gctcttggcc ctggaggcat
     1081 gtggacagca cggggaagag acagaaaccc aaggcactgt gtgaccacct ccatccatac
     1141 ataaaaatct ggggtcctgc agggttccca cacatgcctc tcaacatccc cctatttgag
     1201 tccaataaac tctctgttct c
//
LOCUS       RATLY6B       905 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Rat Ly6-B antigen mRNA, complete cds.
ACCESSION   M30689
KEYWORDS    antigen.
SOURCE      Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of
            Clontech), clone RK10.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 905)
  AUTHORS   Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
  TITLE     Analysis of three distinct Ly6-A-related cDNA sequences isolated
            from rat kidney
  JOURNAL   Immunogenetics 31, 104-111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept        103      510     Ly6-B antigen (put.)
    signal      878      882     poly-A signal
    signal      599      604     poly-A signal
BASE COUNT      206 a    248 c    213 g    238 t
ORIGIN      
        1 ctcttgctct cctccagcca caagtggtct cagaagatgc tagaatgtag aggaaacaga
       61 agatgaactg gcaggttttg cctgtgcgcc ccttctcaga ggatgaacag atcttgtgct
      121 atgaagtcct gtgtgctcat ccttctcctg gccctactgt gtgcagaaag agctcagggg
      181 ctaaactgct acaattgcac gatgatccca tttggtaata cctgctcatc aactgctacc
      241 tgcccctacc ctgatggagt ctgcactatt caggtggcag aagttgttgt gagctctgta
      301 agactgaaag taaagagcaa tctctgcctt cccggctgcc ccaagagtcc tcaaacacct
      361 gaggtcctcg gtaccgttgt ccatgtgaat actgactgtt gcaatacaga tctttgcaac
      421 gcagcaggtc ccactggagg cagcacgtgg accatggcag gggtgcttct gttcatcctg
      481 ggctcagtcc tcctgcagac cttgctgtga tggaccctcc aaggccctgc cacccttgtc
      541 cttttatcct tatgtgtaat cactccttcc tggagccctc tagtgataaa ttctgagtaa
      601 taaaaattca gaggggggat tgagtgtgga acaccttgtt gcaactctat agccactgct
      661 ggataggttc cccactcccc tctctagggc tttcagatat gtacttccta gaatgccatt
      721 gtgttttggt ttgctgctct tggccctgga ggcaggggac agcacgggga agaggcagaa
      781 acccaaggca ctgtgacacc acctccatcc atacataaaa atctggggtt ctgcagggtt
      841 cccacacatg cctctgaaca tccccctatt tgagtccaat aaactctctg ttctcccacg
      901 gaatt
//
LOCUS       RATLY6C       931 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Rat Ly6-C antigen gene, complete cds.
ACCESSION   M30690
KEYWORDS    antigen.
SOURCE      Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA,
            (library of Clontech), clone RK3.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 931)
  AUTHORS   Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
  TITLE     Analysis of three distinct Ly6-A-related cDNA sequences isolated
            from rat kidney
  JOURNAL   Immunogenetics 31, 104-111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept         76      262     Ly6C antigen, exon 1 (put.)
                342      559     Ly6C antigen, exon 2 (put.)
    IVS         263      341     Ly6C antigen intron A (no splice consensus)
    signal      927      931     poly-A signal
BASE COUNT      202 a    257 c    224 g    248 t
ORIGIN      
        1 gccctgggac gtaattggaa gtctattaac tggctccaat ttccaaggtt ttctctgtgc
       61 accccttctc tgaggatgaa cagttcttgc gctatgaagt cctgtatgct catctttttc
      121 ctggccctac tgtgtgcaga aagagctcag ggcctaaagt gctacagttg catagaagtc
      181 ccacttaatg ctaactgctc aacagctacc tgcccctact ctgatggagt gtgtgtttct
      241 caggtgttag aagctgtaga gggtctccta gatgcaactt cccagggaac tgcaagagtc
      301 tgagaggctg gttgcccttt ttgctctgcc actgagtgat cgctctgtaa gacggacagc
      361 aaagagcaat ctctgccttc caatctgccc caagtttcct caaagaaccg agatcctggg
      421 taccgttgtc tacacgaagg tttcctgttg caatacagat ctttgcaatg cagcaggtcc
      481 cactggaggc agcacctgga ccgtggcagg ggtgcttctg ttcagcctgg gctcagtcct
      541 cctggagacc ttgctgtgat ggcccctcca aggccccgcc acccttgtcc ttttagcctc
      601 atgtgtaatc actcctctga agccctctag tgataaattc tgagtaatag aaactcccag
      661 gtgggggtag ggtgtggaac accttgattc aactctatag cccctgctgg gtaggtgccc
      721 cactcccctc tctaggactt tcagatctgt acttcctgga atgccattgt gttgtggttt
      781 gctgctcttg gccctggagg cacatggaca gcacagggaa gaggcagaaa cccaaggcac
      841 tgtgacacca cccccatcca tacataaaaa tctggggttc tgcagggttc ccacacatgc
      901 ctctcaaggt tcccctattt tagtccaata a
//
LOCUS       RATLY6CA      783 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Rat Ly6-C antigen mRNA, exon 2.
ACCESSION   M30691
KEYWORDS    antigen.
SOURCE      Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of
            Clontech), clone RK11.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 783)
  AUTHORS   Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U.
  TITLE     Analysis of three distinct Ly6-A-related cDNA sequences isolated
            from rat kidney
  JOURNAL   Immunogenetics 31, 104-111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Friedman, 14-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept    /     1      403     Ly6-C antigen, exon 2 (put.) (AA at 2)
    signal      771      783     poly-A signal
BASE COUNT      173 a    219 c    187 g    204 t
ORIGIN      
        1 gaacagttct tgcgctatga agtcctgtat gctcatcttt ttcctggccc tactgtgtgc
       61 agaaagagct cagggcctaa agtgctacag ttgcatagaa gtcccactta atgctaactg
      121 ctcaacagct acctgcccct actctgatgg agtgtgtgtt tctcaggtgt tagaagctgt
      181 agagggctct gtaagacgga cagcaaagag caatctctgc cttccaatct gccccaagtt
      241 tcctcaaaga accgagatcc tgggtaccgt tgtctacacg aaggtttcct gttgcaatac
      301 agatctttgc aatgcagcag gtcccactgg aggcagcacc tggaccgtgg caggggtgct
      361 tctgttcagc ctgggctcag tcctcctgga gaccttgctg tgatggcccc tccaaggccc
      421 cgccaccctt gtccttttag cctcatgtgt aatcactcct ctgaagccct ctagtgataa
      481 attctgagta atagaaactc ccaggtgggg gtagggtgtg gaacaccttg attcaactct
      541 atagcccctg ctgggtaggt gccccactcc cctctctagg actttcagat ctgtacttcc
      601 tggaatgcca ttgtgttgtg gtttgctgct cttggccctg gaggcacatg gacagcacag
      661 ggaagaggca gaaacccaag gcactgtgac accaccccca tccatacata aaaatctggg
      721 gttctgcagg gttcccacac atgcctctca aggttcccct attttagtcc aataaactct
      781 ctg
//
LOCUS       RATTAG1      5040 bp ss-mRNA            ROD       10-JUL-1990
DEFINITION  Rat axonal glycoprotein (TAG-1), mRNA, complete cds.
ACCESSION   M31725
KEYWORDS    glycoprotein.
SOURCE      Rat 13 day old embryo spinal cord axon, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5040)
  AUTHORS   Furley,A.J., Morton,S.B., Manalo,D., Karagogeos,D., Dodd,J. and
            Jessell,T.M.
  TITLE     The axonal glycoprotein TAG-1 is an immunoglobulin superfamily
            member with neurite outgrowth-promoting activity
  JOURNAL   Cell 61, 157-170 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Furley,A.J.W., 30-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept        224     3346     axonal glycoprotein (TAG-1) precursor
    sigp        224      313     axonal glycoprotein signal peptide
    matp        314     3343     axonal glycoprotein
BASE COUNT     1144 a   1486 c   1397 g   1013 t
ORIGIN      
        1 gaattcccgc ccgctgccgc cacgccagga cagccagtgg ctaaggccgg cggggcaagc
       61 agccctgagg ctggcagcag ggtctgctca ccaggcggcc gcagcagtgc cccagccaac
      121 acccttcccg cactctaggt gtgcctgagt ctccagttga ttctcccgga gcggagctgc
      181 ggctcctctc ttttggactc tgcctctgcc tgaaagaccc accatgggga cacacgccag
      241 gaaaaaggca agcttgctgc tgctggtgct ggccacagtg gccctggtct cctctccagg
      301 atggagtttt gcccagggaa ccccagctac ctttggaccc atcttcgaag agcaacccat
      361 tggcctgcta ttcccagagg agtctgcaga ggatcaggtg acactggcgt gccgtgcccg
      421 tgctagccct ccagccacct acaggtggaa gatgaatggc acagatatga acctggaacc
      481 tggctcccgt caccagctga tggggggcaa cctggtcatc atgagcccca ccaagacaca
      541 ggatgctggt gtctaccagt gcctagcctc caacccagta ggcactgtgg tcagcaagga
      601 ggctgtcctc cgctttggct ttctacagga attctccaag gaggagagag accctgtgaa
      661 aacccatgag ggctggggag tgatgctgcc ctgtaacccg cctgcccatt acccaggttt
      721 gtcctaccgc tggctcctca acgagttccc caacttcatc ccaacggatg ggcgacactt
      781 cgtgtcccag actacaggaa acctgtacat cgcccggacc aatgcctcag acctgggcaa
      841 ctactcttgt ttggctacca gccacatgga cttttccacc aagagtgtct tcagcaaatt
      901 tgcgcagctc aacctggctg cggaagatcc ccgactcttc gctcccagta tcaaagctcg
      961 gttccccccg gagacctacg cactagttgg gcagcaagtc accctggagt gctttgcctt
     1021 tgggaacccg gttccccgga tcaagtggcg caaagtggat ggttccttgt cccctcagtg
     1081 ggccacagct gagcccaccc tgcagatccc cagcgtgagc tttgaagacg agggtaccta
     1141 tgaatgtgag gcagagaact ccaagggtcg tgacaccgtc cagggacgca tcatcgtgca
     1201 agctcagcct gagtggctaa aggtgatctc agacacagag gccgacattg gctccaactt
     1261 acgttggggc tgtgcagcag caggcaaacc ccggcccatg gtgcgctggc tgagaaacgg
     1321 ggaacctctg gcctcccaga accgggtgga ggtcttggct ggggacctgc gattctctaa
     1381 gctgagcctg gaggactctg gcatgtacca gtgtgtggct gaaaacaagc atggcaccat
     1441 ctatgccagt gctgagctgg ctgtacaagc tctggcccca gacttcaggc agaaccctgt
     1501 gagacggctg atccctgcag ctcgaggcgg agagatcagc atcctgtgcc agcctcgcgc
     1561 agccccaaaa gctacaatac tttggagcaa gggtactgag attttgggga acagtaccag
     1621 agtgactgtc acttccgatg gcaccttgat catcagaaac atcagccgat ccgatgaagg
     1681 caaatatacc tgctttgctg agaacttcat gggcaaagcc aacagtaccg ggatcctgtc
     1741 cgtgcgcgat gcaaccaaga tcaccctggc tccctccagt gctgacatca acgtgggtga
     1801 caacctgacc ctacaatgtc atgcctcgca cgaccccact atggacctca cgttcacctg
     1861 gaccctggat gatttcccta ttgactttga taagcctgga ggtcactacc ggagagccag
     1921 tgcgaaggag accattgggg acctgactat cctcaacgcc cacgtacgcc atggagggaa
     1981 gtacacatgc atggcccaga ctgtggtaga tggtacatcc aaggaggcca cagtcctggt
     2041 ccgaggtccc ccaggtcccc cagggggtgt ggtggtgaga gacatcggag acaccaccgt
     2101 tcagcttagc tggagtcgtg gctttgacaa ccacagcccc attgccaagt acacgctgca
     2161 agctcgtact ccaccctcgg ggaaatggaa gcaggttcgg accaatcctg tgaatatcga
     2221 gggtaatgcc gagactgccc aggtgctggg tctcatgcct tggatggact atgagtttcg
     2281 ggtttcagct agcaacatct tgggcactgg ggagcccagc gggccctcca gcaaaatccg
     2341 cactaaggaa gcagtcccct cagtggcacc atcgggactc agtggagggg gaggagcccc
     2401 tggagagctc atcatcaact ggactcccgt gtcacgggag taccagaacg gagacggctt
     2461 cggctacctg ctgtccttcc gcaggcaagg cagctccagc tggcagactg cccgggtgcc
     2521 tggcgctgat gcgcagtact tcgtctacgg caatgacagc atccagccct acacaccctt
     2581 tgaggtcaag atccgaagct acaatcgccg gggggatggg cccgagagcc tcactgcgtt
     2641 agtgtactca gcagaggaag agcccagggt ggcccctgcc aaggtctggg ccaaggggtc
     2701 ctcatcttca gagatgaacg tgagctggga gcctgtgcta caagacatga acggcattct
     2761 cctgggatat gagattcgct actggaaagc cggggacaac gaagcagccg ctgaccgagt
     2821 gaggacagca gggctagaca ccagtgcccg agtcactggc ctgaacccca acaccaaata
     2881 ccacgtaact gtgagggcct acaaccgggc cggcactgga cccgctagcc cttcagctga
     2941 tgccatgacc gtgaagcccc cgccacggag acctcctggc aacatctcct ggactttctc
     3001 aagctccagt ctcagcctta agtgggaccc tgtggttcct ctccgaaatg aatctacggt
     3061 cactggctac aagatgctgt atcagaatga tttgcaccca actcctacgc tccacctcac
     3121 cagcaagaac tggatagaaa taccagtacc cgaagacatt ggccacgctc tggtacagat
     3181 tcgaaccaca gggcctggag gggatgggat ccccgcagaa gtccacattg tgagaaatgg
     3241 aggcacaagc atgatggtgg agagcgccgc cgcccgccct gcccatcccg gacctgcgtt
     3301 ctcctgcatg gtgatattga tgctcgctgg ctaccagaag ctctgatctc aacactgccc
     3361 gccacgccca agctggacac ccaccctaac agacacagcg gctgaccaca gctccctttc
     3421 gtccaaggtg gtccaacact gtgcctgagc gtggttggct tagacaccta ctcccaacag
     3481 taccctttat gtaggaggta ggatattcct attctgccac aggatagaac catgcgagga
     3541 aattttcttt aagtcaagag gcactgggca gtgacttcca tgataatagt actaggccta
     3601 atgcctggac cccttggggt cttggtcgaa aggaacgggc ctttgattaa gcagatggtc
     3661 ctttggggcc acaagtggca ctgccatctg agatcagagt accaggccca gcaggaacat
     3721 gggcagcagt ggggtattgt tttccctcta tgaagcagag ggacctcttc tagtcctcac
     3781 tggagaagca ccatggttgg tcccgacacg gtcttccatg actccctggc ttcctcggta
     3841 gccaaggaca aggccctggg ttactgggga tagaagctca aaagggttga gaggctaccc
     3901 cacccgatgg aaaggggcac cagcctaagc ccattggcca tcctggtggc actgccctct
     3961 cagccagcac tgccaagcca atcctgtcgt cctccagatg gaatggtgga gtgacagagc
     4021 cacttcaggt ggctatgtga ctaaagggct tgcctcgagg agttgccttg cctcatcaag
     4081 atgcttcctt catggaccct ccagggtacg ggcaggagat gtccatctga acgctactct
     4141 cttcccttca gctctgctgc aaacttgtgc ctgcctccac ctcccacaac tgcaggcccc
     4201 agaaatcagc tctcaacaca gcatccattc tttgtcctgg gatagagagg catccgagaa
     4261 gggccagcat caaagtggcc ctgcctgctt ccaggaatat cctccatcac ctggccacac
     4321 ctgctcccca gaactgcctg gactactctc ttcagtcccc acaagaaaaa gggttaataa
     4381 gggggggggg ggtggcctgc cttgagttct gggtagttac cagggataga ccagactacg
     4441 ggagctgaag aagccttata acttgactta tccgtaccct acacttaaca gacgaggaaa
     4501 tggaggtgca gaagggttag ggacttcttg ggggtcacat ggtctgtaag gacaaggcat
     4561 ggtcagcaca gggtctcctc cccacctgtg ggaggctcta tagagagagg gaggatgttg
     4621 agcagtcaca gcctgtcctc taggactctg gaggactctg gaggaggagc cctctgcttc
     4681 aagaggttct ggctggtgag atggacaaat gagctccaac caaggcatag gcagattcca
     4741 ggagtcaatg gcctggggca gccttctgct gggaactcgg cagggagcac tgtctggaag
     4801 cctctcgggc ttgctcattt caagaagagg ccaaagcaag gacagagttc cttagacgag
     4861 gaccctgcag cagcacgacc agaaaacccc agtgtccacg ccctcagccc acgggggcag
     4921 cagagcaggc atttcaagat gcacttgccc tgctgctcct taggccattt ctgtagttta
     4981 cagttagagc tctattttgt tatgggtttt taaacttcaa gccttgctct gtttttctgg
//
LOCUS       MUSADAM01    2308 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 1 (non-coding).
ACCESSION   M34242 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     1 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2308)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pre-msg     885  >  2308     adenosine deaminase (ADA) mRNA and introns
    IVS        1008  >  2308     ADA intron A
BASE COUNT      479 a    615 c    698 g    516 t
ORIGIN      Chromosome 2.
        1 cccacctcaa ggtgcgcaca agttacttaa ggaacttgct acaatatagc cctgctcccg
       61 cccccaaaat cccaccaaac ctagagtatg gttctaaaca gctcacctgt taagtctcct
      121 tggccaatcc tctagaagtt gaccatagta tgaagttttc tgcagcgtag tttttttctg
      181 cccccctttc actactgtgt ctgagcacat gtgctgtgct ttgtagctga aactggcttt
      241 attgctgcag aaaccagtcc actgtattta cccacagcac tgatgtgagc attctaaata
      301 catctcgatg cgtgggcata tttatccagc gtaactgccc caggagagat gaactgtgtg
      361 ttcctgtcca ccccctgtat cagcacctga gactagtctc agagtctctc tcacacacaa
      421 cagtgttctc tgcatcccac ccgccctcac ctggtgaact ccggcagtcg ccgctaaatc
      481 tccctaatta cacacttctt ctgccttgtg attctgcaac aagtgggtct atccctcaaa
      541 atccagcccc ataaggcttc aggactgtgt ggctccagct tcagcctgca caaagtaggc
      601 gcccaagcaa cactggaagc ctcggtactg aaggggcccg gaaggggcag gtgagacatt
      661 ggagtcacgt ctgcaggggg ctcacctggg agcttcctag ggtgtagcca gcagggaagg
      721 tctggggttc agaattccgg gaaatgcgcg ccagagttgc aggcgggggg gggggggggg
      781 ggggggcggg gccgtggctc cggaaggcgg ggtctctctg tgggcgtagc gtgggcgggg
      841 ctgtgcgggg cagcccggta aaaaagagcg tggcgggccg cggtctctga gagccatcgg
      901 gaagcgaccc tgccagcgag ccaacgcaga cccagagagc ttcggcggag agaaccggga
      961 acacgctcgg aaccatggcc cagacacccg cattcaacaa acccaaagta agcaccgagg
     1021 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg
     1081 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca
     1141 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaag
     1201 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg
     1261 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca
     1321 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaac
     1381 ctgggggtgt ctctggtcgg ccttcgggtt cggctgctgt ctatgcgaac ctgggagtgc
     1441 ctccagtcgg ccttcgggtt cggctgctgc ctatgccctg tgccctggag gtctcagcct
     1501 cgctgtctgc caatgggcat ccagtgcggc ggggctgcac agctgtgtgg gactgggcta
     1561 ggacctgggt gtctgagccc cagtagaatg gggcccaggg tctctagctg ttaaatgttc
     1621 agtgtatggc tttatactta agtgttatga ttactttctg ggcaacaggt aacctaggtt
     1681 tgtgggtgcg cccgtgggaa aatctatgat ccaaaccaga aaaggaaggg atagaggctt
     1741 cagggtgcca ggaggaaccc ctacacatac tgaccgtttg gccatatggg tttatttggg
     1801 atgaagtttt agcccattga ccccagagga gaacccttta tctgtctttc tgcaagctgt
     1861 ggcttcttgg aaacagggag actccaggtc cccaaggcca gatttgcagc ccttacagat
     1921 tctgtctagt cagccaggca aattgaactg gtcagcagaa gtgtgggact gagaactcag
     1981 ggggagggat cagagacagt cacccttaga cttacccctc caagaaacag atgctgagtg
     2041 gggggcgggg tggcagacgt atgaatcccg tgtgcatgtt gtgtcatata tgcgtgcatg
     2101 gagggagcgg gagggaagat gggcagtggg cctgtattcc atgcacttac catagggaac
     2161 acactctgcc cctctagcta gaggctagaa gggcagggca agtcttccta cccaaccaat
     2221 gcctgctgca catcttgtct ggtggctcct gaccacagtt ggtgctctta gacatcaaag
     2281 ggtgagtttt cttttgatgg tctgaatt
//
LOCUS       MUSADAM02     207 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 2 (non-coding).
ACCESSION   M34243 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     2 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 207)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   207     adenosine deaminase (ADA) mRNA and introns
    IVS     <     1       78     ADA intron A
    IVS         141  >   207     ADA intron B
BASE COUNT       39 a     58 c     54 g     56 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 1.
        1 gctcctcggg ctctgtggtg gcttctgagg tgtcctctgg ctctgtggta tctcacgctc
       61 tttttctgtc ccttgcaggt agagttacac gtccacctgg atggagccat caagccagaa
      121 accatcttat actttggcaa gtaagtccaa ggacaaccac agaccttccc aggattgcag
      181 agcgtgtaca gctcttcttg gggggcc
//
LOCUS       MUSADAM03     382 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 3 (first expressed
            exon).
ACCESSION   M34244 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     3 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 382)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        235  +   299     adenosine deaminase (ADA, EC 3.5.4.4), exon 3
                                 (first expressed exon)
    IVS     <     1      176     ADA intron B
    IVS         300  >   382     ADA intron C
BASE COUNT       86 a    117 c     94 g     85 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 2.
        1 aacacacaca tgcctgatgc cagcaaagga ggcctgaagg cattggtacc cctggaatta
       61 gagttacagc tggtcatggg cctccatgtg ggtctcgtct tctgcaagaa cagccagtgt
      121 gctcttaccc accaagccct ggtgcagccc ctcacccttg actttatttt taggaggaag
      181 agaggcatcg ccctcccggc agatacagtg gaggagctgc gcaacattat cggcatggac
      241 aagcccctct cgctcccagg cttcctggcc aagtttgact actacatgcc tgtgattgcg
      301 taagttgctc cccaaccctt gtgccccaca gtagcatcca tccctataac caaggtcagg
      361 cctgagctgc tgctgtacaa gg
//
LOCUS       MUSADAM04     346 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 4.
ACCESSION   M34245 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     4 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 346)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept    +    93  +   227     adenosine deaminase (ADA), exon 4
    IVS     <     1       92     ADA intron C
    IVS         228  >   346     ADA intron D
BASE COUNT       77 a     85 c    111 g     73 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 3.
        1 acagttgtag ttacctcgtt ggctactaga cgtcccaagg agctgagaaa ggttgccaac
       61 ctgtgttctt cttcccttcc caggggctgc agagaggcca tcaagaggat cgcctacgag
      121 tttgtggaga tgaaggcaaa ggagggcgtg gtctatgtgg aagtgcgcta tagcccacac
      181 ctgctggcca attccaaggt ggacccaatg ccctggaacc agactgagtg agtgacatca
      241 ctggaggggg ctgtgctgag cggggctctg agctgaggat ggagtgctta gagccctggc
      301 ctggtccatg gactcagagc gactcagctc agtcctaagt gcacga
//
LOCUS       MUSADAM05     385 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 5.
ACCESSION   M34246 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     5 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 385)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept    +   115  +   230     adenosine deaminase (ADA), exon 5
    IVS     <     1      114     ADA intron D
    IVS         231  >   385     ADA intron E
BASE COUNT       84 a    115 c     94 g     92 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 4.
        1 tctccatcta gaaatagaag ggcagagaga catcactaca tccctgctcc agttccatgg
       61 ctgcccatgg tcttcccttg gcctaaagtc ctccctcttc ctctctccac acagagggga
      121 cgtcacccct gatgacgttg tggatcttgt gaaccagggc ctgcaggagg ggaggcaagc
      181 atttggcatc aaggtccggt ccattctgtg ctgcatgcgc caccagccca gtgagtaccg
      241 ccgcaccctg ctggctgcct ggcctataac aaggtggacc gactatccag cgtccccacc
      301 tcgtatttct agagttttct aaaaaacacc tgtgaacttt tggtgactct ggtgagtcct
      361 taacaggaaa ttgggacttg cacag
//
LOCUS       MUSADAM06     189 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 6.
ACCESSION   M34247 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     6 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 189)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept    +    18  +   145     adenosine deaminase (ADA), exon 6
    IVS     <     1       17     ADA intron E
    IVS         146  >   189     ADA intron F
BASE COUNT       37 a     39 c     75 g     38 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 5.
        1 ggcccgtgcc cctgcaggct ggtcccttga ggtgttggag ctgtgtaaga agtacaatca
       61 gaagaccgtg gtggctatgg acttggctgg ggatgagacc attgaaggaa gtagcctctt
      121 cccaggccac gtggaagcct atgaggtggg cctgagaagg ggagggtggc cctgggggag
      181 cttgggtag
//
LOCUS       MUSADAM07     307 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exons 7 and 8.
ACCESSION   M34248 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     7 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 307)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept    +    11       82     adenosine deaminase (ADA), exon 7
                164  +   265     adenosine deaminase, exon 8
    IVS     <     1       10     ADA intron F
    IVS          83      163     ADA intron G
    IVS         266  >   307     ADA intron H
BASE COUNT       68 a     86 c     85 g     68 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 6.
        1 tcccttccag ggcgcagtaa agaatggcat tcatcggacc gtccacgctg gcgaggtggg
       61 ctctcctgag gttgtgcgtg aggtaaggag ccagtgaccc cgggcctctt cttcctgatt
      121 ctgttcctgt ccctggactc acctcctctc tgcttctcca caggctgtgg acatcctcaa
      181 gacagagagg gtgggacatg gttatcacac catcgaggat gaagctctct acaacagact
      241 actgaaagaa aacatgcact ttgaggtgag acgccaaggc agagagagtg agctctggct
      301 accccgt
//
LOCUS       MUSADAM08     249 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 9.
ACCESSION   M34249 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     8 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 249)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept    +   112  +   176     adenosine deaminase (ADA), exon 9
    IVS     <     1      111     ADA intron H
    IVS         177  >   249     ADA intron I
BASE COUNT       56 a     77 c     60 g     56 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 7.
        1 ctgaggcaat gaagcacaaa gctatccaga atagaacctc agctgggctc agccctgacc
       61 agtctggccc cggccactat gccagccagc cacacatcct gccccttgca ggtctgcccc
      121 tggtccagct acctcacagg cgcctgggat cccaaaacga cgcatgcggt tgttcggtga
      181 gatctggttc cgggacccat tttgttttga ttccggaatt cacctatagt gagtcgtata
      241 aattcgtaa
//
LOCUS       MUSADAM09     340 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 10.
ACCESSION   M34250 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     9 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 340)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept    +   141  +   270     adenosine deaminase (ADA), exon 10
    IVS     <     1      140     ADA intron I
    IVS         271  >   340     ADA intron J
BASE COUNT       81 a     77 c     92 g     90 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 8.
        1 ttaatacgag aatgcaaccc tttgtgttgt ctaaggttgt ataaagatgg aagagggagg
       61 tggtggaagg gcagtgatgg ttcttggagt gaagaggctc tctctctctc tcttttcttc
      121 ctgcctggcc cctcccccag cttcaagaat gataaggcca actactcact caacacagac
      181 gaccccctca tcttcaagtc caccctagac actgactacc agatgaccaa gaaagacatg
      241 ggcttcactg aggaggagtt caagcgactg gtgagtatgt gtgagctatg agcctgacac
      301 tggcccaggt gtgtgtgtgt gtgtatatgt gtgtgtgtgt
//
LOCUS       MUSADAM10     279 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 11.
ACCESSION   M34251 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     10 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 279)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pept    +    88      171     adenosine deaminase (ADA), exon 11
    IVS     <     1       87     ADA intron J
    IVS         188  >   279     ADA intron K
BASE COUNT       67 a     88 c     61 g     63 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 9.
        1 ggatctgttt cccccactat gatgcccttg cccttgctaa cagggctgct tccttccttg
       61 tcctgactcc atgtttcccc cttctagaac atcaacgcag cgaagtcaag cttcctccca
      121 gaggaagaga agaaggaact tctggaacgg ctctacagag aataccaata gccaccacag
      181 actgacggta cgcttgtgca gggcgcaata accaccccac cacactgtcc tccttaactc
      241 tgtgcgattg tggcagaagt cttgggcagg agcacacct
//
LOCUS       MUSADAM11     442 bp ds-DNA             ROD       10-JUL-1990
DEFINITION  Mouse adenosine deaminase (ADA) gene, exon 12 (non-coding).
ACCESSION   M34252 J04767
KEYWORDS    adenosine deaminase.
SEGMENT     11 of 11
SOURCE      Mouse lung fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 442)
  AUTHORS   Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E.,
            Chinsky,J.M., Martin,B.D. and Kellems,R.E.
  TITLE     Structural and functional analysis of the murine adenosine
            deaminase gene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Al-Ubaidi, 11-MAY-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   287     adenosine deaminase (ADA) mRNA and introns
    IVS     <     1       69     ADA intron K
    signal      282      287     poly-A signal
BASE COUNT       97 a    111 c    114 g    120 t
ORIGIN      Chromosome 2; undetermined number of base pairs after segment 10.
        1 ttctgtgctt ctaccatgcc ttacatgtca tgagacctga cctttctatt tctctgactt
       61 gaccagcagg gcgggtcccc tgaagatggc aaggccactt ctctgagcct catcctgtgg
      121 ataaagtctt tacaactctg acatattgac cttcattcct tccagacctt ggagaggcca
      181 ggtctgtcct ctgattggat atcctggcta ggtcccaggg gacttgacaa tcatgcacat
      241 gaattgaaaa ccttccttct aaagctaaaa ttatggtgtt caataaagca gctggtgact
      301 ggtatcttgc agcacatggt gaatacggtc tcggggctgc tggctaggat gctaagaaag
      361 gaggagcctg ggccctacgc tgagtgtcag gtctggggag ctagggtctc ttccgcaggt
      421 cgactctaga gatccccggg ct
//
LOCUS       TRBMVAT5A    1664 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  T.brucei MVAT5-like variant surface glycoprotein mRNA, complete
            cds.
ACCESSION   M33825
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1664)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. 169, 730-736 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
FEATURES       from  to/span     description
    pept         43     1596     MVAT5-like variant surface glycoprotein
BASE COUNT      533 a    435 c    395 g    301 t
ORIGIN      
        1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt
       61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt
      121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa
      181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg
      241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt
      301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa
      361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac
      421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg
      481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc
      541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg
      601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca
      661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca
      721 gcaaacttcg gccagtcgat ccctttcgca ggcggctatc taacaaggaa cccgacagcc
      781 aacacagcca gcagcgccga cggtacggac tttgtaagca accccgaaga cagcaagata
      841 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca
      901 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat
      961 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt
     1021 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac
     1081 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa
     1141 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac
     1201 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc
     1261 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc
     1321 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat
     1381 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca
     1441 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc
     1501 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta
     1561 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa
     1621 gaatttttgc tactttaaaa acttctgata tattttaaca ccta
//
LOCUS       TRBWRATATA   1544 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  T.brucei WRATat A variant surface glycoprotein mRNA, complete
            cds.
ACCESSION   M33823
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1544)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. 169, 730-736 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
FEATURES       from  to/span     description
    pept         42     1457     WRATat A variant surface glycoprotein
    mRNA    <     1     1544     WRATat A mRNA
BASE COUNT      545 a    387 c    345 g    267 t
ORIGIN      
        1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc
       61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt
      121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac
      181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg
      241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg
      301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc
      361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa
      421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct
      481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca
      541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg
      601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac
      661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca
      721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa
      781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac
      841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg
      901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt
      961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg
     1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca
     1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg
     1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc
     1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag
     1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt
     1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga
     1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg
     1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta
     1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg
//
LOCUS       TRBWRATATB   1585 bp ss-mRNA            INV       10-JUL-1990
DEFINITION  T.brucei WRATat B variant surface glycoprotein mRNA, complete
            cds.
ACCESSION   M33824
KEYWORDS    variant surface glycoprotein.
SOURCE      Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 1585)
  AUTHORS   Reddy,L.V., Hall,T. and Donelson,J.E.
  TITLE     Sequences of three VSG mRNAs expressed in a mixed population of
            Trypanosoma brucei rhodesiense
  JOURNAL   Biochem. Biophys. Res. Commun. 169, 730-736 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Donelson, 23-APR-1990.
FEATURES       from  to/span     description
    pept         41     1504     WRATat B variant surface glycoprotein
BASE COUNT      535 a    384 c    376 g    290 t
ORIGIN      
        1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact
       61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga
      121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa
      181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac
      241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga
      301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat
      361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga
      421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc
      481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca
      541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc
      601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg
      661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag
      721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa
      781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga
      841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact
      901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca
      961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca
     1021 cgacctcata acagctaaat ccggtacgac actaagcggc gcaatcactc ggctaaagca
     1081 actacaaatc gccaaagcaa agctaaaagc aagacggcta ctgctgcaaa acagggaacg
     1141 gcagcaaacg cgacttatgg cgctagcaga caagatgcaa gaattgtacc aagaggcctt
     1201 acatgacgag gttcaactca ggaaggaagc gcagaacaaa cctcaagaaa caccagattc
     1261 tgacaagcaa aaagcatgcg agaaatatca caacaagtca aaggaatgca aagaaaatgg
     1321 ttgccaatgg agtggaactg aagaaaccat aggaaagtgc gaagctaaac ccaaagcagg
     1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac
     1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt
     1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact
     1561 tctggtatat tttaacacct ttaaa
//

GenBank-Updates@genbank.bio.net (07/11/90)

LOCUS       CDCXYNAB     6067 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  C.saccharolyticum xylanase A (XynA), beta-xylosidase (XynB) and
            acetyl esterase (XynC) genes, complete cds.
ACCESSION   M34459
KEYWORDS    acetyl esterase; beta-xylosidase; xylanase.
SOURCE      C.saccharolyticum DNA, clone pNZ1400.
  ORGANISM  Caldocellum saccharolyticum
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 6067)
  AUTHORS   Luethi,E., Love,D.R., McAnulty,J., Wallace,C., Caughey,P.A.,
            Saul,D. and Bergquist,P.L.
  TITLE     Cloning, sequence analysis, and expression of genes encoding
            xylan-degrading enzymes from the thermophile "Caldocellum
            saccharolyticum"
  JOURNAL   Appl. Environ. Microbiol. 56, 1017-1024 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        195     1223     xylanase/beta-xylosidase (XynA) precursor
    sigp        195      293     xylanase/beta-xylosidase signal peptide
    matp        294     1220     xylanase/beta-xylosidae
    pept       1257     2057     acetyl esterase (XynC)
    pept       2198     2491     ORF 3
    pept       2491     3429     ORF 4
    pept       3445     4911     beta-xylosidase (XynB) (3445 could be 3463)
    pept       5439  >  6067     ORF 6
BASE COUNT     2230 a    787 c   1243 g   1807 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccccgc aaagcctaaa ataagtacat ttagaatgat ggcagaaaat ggttatatta
       61 cccttgaatt tacgttaagt aaaaatgctg tggtgctttt tgaggtaagc aaggttgtag
      121 atgagtcaga tacttatata ggacttgacg atagtaaaat accaggttat tagttgcttt
      181 ataaaataaa aggaatgagg tgtttaattg tgtgcgaaaa tttagagatg ctaaacttat
      241 cattagcaaa aacatacaaa gattacttta aaataggtgc tgcagtaact gcgaaagatt
      301 tagaaggagt tcatagggat attcttttga agcattttaa tagcctcaca ccagaaaatg
      361 ccatgaagtt tgaaaatatt catccagaag agcagagata taattttgaa gaggttgcca
      421 ggataaaaga gtttgcaatt aaaaatgaca tgaagttaag aggacataca tttgtttggc
      481 ataatcaaac tccggggtgg gtgtttttag ataagaatgg ggaagaagcc tcaaaagagt
      541 tagttattga aaggttaaga gagcatataa aaactttgtg tgagagatac aaggatgtag
      601 tatatgcgtg ggatgtggtg aacgaagcag tagaagataa aacagaaaag cttttgcgag
      661 aatcaaactg gagaaaaatt attggagatg attatattaa aattgctttt gagatagcaa
      721 gagaatatgc aggagatgca aagttatttt ataacgatta taacaatgaa atgccttata
      781 aattagaaaa aacctacaaa gttctaaaag agcttttaga aagaggtact ccaatagatg
      841 gaattggtat acaagcacac tggaatatat gggataaaaa tcttgttagt aatttaaaaa
      901 aggctataga agtatatgct tccttaggtt tagaaattca tattacagaa cttgacattt
      961 cagtatttga gtttgaagat aagaggactg acttgtttga accaaccccg gaaatgcttg
     1021 aactacaagc aaaagtatat gaagatgtat ttgcagtttt tcgagaatat aaagatgtaa
     1081 taacttctgt tacattatgg ggtattagcg acagacacac atggaaagat aacttccctg
     1141 taaagggtcg aaaagattgg cctctcttat tcgacgtaaa tggaaaacca aaagaagcct
     1201 tgtacaggat attaagattt taaagatttt ttaacgaaga aaggggttct tttaatatgg
     1261 ctatcatgca aatcaacttt tattcaaaga tgttgaaaaa gaacacaaca attttggcca
     1321 ttttacccgt agataaacca gataagaaat tccagaaaga tgttgatagt gaaaatttga
     1381 aaaccttata tcttttgcat ggttatgctg gtaactacat ggattggttg tgtggagccc
     1441 gaattgttga attatcaatg cgatataatg ttgctgtgtt tctgccatca ggtgaaaata
     1501 gtttttattt agatgatgaa gaaaaggaag aatattttgg tgaatttgtg ggaaatgaaa
     1561 ttatagaatt tacaagaagc gtttttccta ttcctcaaaa aagggaaaaa acttttattg
     1621 gcggtttatc aatgggaggt tacggtgctc ttagaaatgg gcttaaatat aacaagaatt
     1681 ttgtaggtat aatagcttta tcatcagcac taataattca taagattgca ggtattccta
     1741 aggattatag gaatgcttat gcaagttata actattatag acgagtgttt ggagacctaa
     1801 actctttaat aggtagcgat aaagacataa atgccttagt tactaagcta aaacaagaaa
     1861 aaggtagtat tccaaaaata tacatggcat gcggcagaga tgacttttta gttcaagaaa
     1921 acagagattt atttaatttt ttgaaaaatg aaggtataga cgtggtttat gaggaagacg
     1981 aaggtggaca tgactgggat ttttggaaca aatatattgc aaatgctttt gagtggatga
     2041 gtaaggtttc tgattaagtc ttcacgtacc ctgttttaag ttttacaaat agatttgtgg
     2101 ggtgaatagg tttttttaac actattttat taaggaagag gatgaaaaat aaaaaaagtg
     2161 gacaaatttc ttgttaattg taattacatg cattgcaatg gttttctttt ttacatcgtg
     2221 tactattcag tctgctatag agcagaagaa aactgttgag gaaatcttgg gaaaaatagg
     2281 tgagagtgag gacaaaacaa attcaagggg gcaaccagca acaatgaaag aggatgaagt
     2341 tgaagataat cctttaaaag atgtatataa agattatttc ctggttggag cagcaattaa
     2401 tggctattct gttgaaactg ctgctatcaa tcatcctggt atggctgcaa ttttgaaaaa
     2461 aactttaaca gtacaaccct atctaatttg atgaaacaac aatacctttt agattatgaa
     2521 gctacaaaag caagtaaaaa tggaatgcca gtgtgtaaat ttgacagctg cattcctgct
     2581 ttacaatttt gtaaggaaaa tggcataaaa atgagaggac atgtgttagt atggcataat
     2641 cagacaccag aatggttttt ccacaaagac tatgatgtat cgaaaccact tgtagatgct
     2701 gctactatgg aacgccggtt ggaaagttat atcaaacagg taattgaatt ttgtcaaaaa
     2761 aattatcccg gtgtagtcta ttgctgggat gttgttaacg aagctatact tgatgatggt
     2821 tcatggagag aaatcaataa taattggtat accattatga aagaaaagta tgtggaaaag
     2881 gcattttatt atgcaagaaa atatgccaaa aaagatgttg ccctgtttta caatgattac
     2941 aatgtttttc tccctgcaaa gagagaagca atttataatc ttgctcagaa acttaaagaa
     3001 aaaggattga ttgacgggtt gggtcttcaa cctacagtag gcttgaatta tcctgaatta
     3061 gattctgatg atatagattc attcaaaacg acattagaaa catttgcaaa acttggctta
     3121 caaattcata ttactgagtt aaattttgaa ataaagggag atgagagcaa tcgtactcct
     3181 gaaaatctca aaaaacaagc agataggtat tacgaaatga tgaagttatt attgaaggaa
     3241 gatactgata atggtgggcc ttgcaacata acttgtgtta ctgtttttgg tatctgtgac
     3301 gattatccac tatataaaaa ttttaagcag tgcatgtatc tttgggataa aaattgcaat
     3361 cctaaaccat gtttttattc atttctccaa gcaggtttag actggaaagc atctttatta
     3421 agcaaataag aatgaacaac acttatggag aggaggaaaa taatgaaaat aactattaat
     3481 tatggaaaga gacttgggaa aataaacaaa ttttgggcaa aatgtgttgg aagctgtcat
     3541 gctacaactg cgttaagaga agactggcga aagcaattaa aaaaatgtcg tgacgaactt
     3601 ggttttgagt atattcgatt tcatggttgg ttgaatgatg atatgagtgt ttgttttaga
     3661 aatgatgatg ggctactttc attctcattc ttcaacatag attctataat tgattttctt
     3721 ttggagatag gtatgaaacc atttattgaa ctgagcttta tgccagaagc gttagcgtca
     3781 ggtacaaaga cagttttcca ttacaaagga aatataacac cgccgaaatc ttatgaagaa
     3841 tggggtcagc tgattgagga gttagcaagg catcttatta gcagatatgg gaaaaatgaa
     3901 gtaagagaat ggttttttga ggtatggaac gaaccaaatc taaaggattt cttctgggca
     3961 ggaacaatgg aagaatattt taagctttac aaatatgctg cttttgcaat aaagaaagtg
     4021 gactctgaac taagggtagg tggaccagct actgcaatcg atgcatggat acctgaacta
     4081 aaagattttt gtacaaaaaa tggtgttcca atagatttta tttcaacgca tcaatatcca
     4141 acagatttag cattcagtac aagctcaaat atggaagagg ctatggcaaa agcaaagaga
     4201 ggtgaattag cagagagggt aaaaaaggct ttagaggaag catatccatt gcctgtttac
     4261 tacactgaat ggaataactc tccaagtcct cgagacccat atcacgacat accttacgat
     4321 gctgctttta ttgtaaaaac aataattgac attatagatt taccacttgg gtgttattct
     4381 tattggacat ttacagatat ctttgaagaa tgtggacaga gttctttacc ttttcatggg
     4441 ggattcgggc ttctaaatat tcatggtata ccaaaaccat cctatagagc atttcaaatt
     4501 ttagataaac taaacggtga gaggattgag atagagtttg aagataaaag cccaaccatt
     4561 gattgtatag ctgtccagaa tgagagagag ataatacttg tgatctcaaa ccataatgtt
     4621 ccgctgtctc ctattgatac cgaaaatata aaagttgttt taaaaggtat tgagaattgc
     4681 cgagaagttt ttgttgagag aatagatgaa tataatgcca atccaaaaag agtatggctt
     4741 gaaatgggca gtcctgcgta tctcaataga gaacagattg aggagttgat aaaagcatca
     4801 gaactaaaga aagagaaagt ttcatggggg attgtgaata ataatgaaat tacatttgat
     4861 ttaagtgttt tacctcactc agttgtggct gttacaatta agaatggtta gtgaaatgtt
     4921 aagagagaaa agcaattttg tatatctctt ttaattttta cctttgacac atcaaacaat
     4981 ctaaattaaa attaaagtat agtgttttgc atactcaaca tagtataaat tatataaggg
     5041 taacattaat accctttttg tttttgtaag ggggtgtttt tgtggcaaag cacacgcaaa
     5101 aaggtaaatc agctgccaca gccgccgtgt cagacaaaga aaaagcaagg tttgttccta
     5161 aaaatattca agctgagata aaagaaaaga ttaaagacac tggtgaaaaa gtagcaaagg
     5221 ctgagggtaa ggacaaagca cttttacagt taaagctgga gagcaacaaa aaggttgata
     5281 agaaaaaatt caaaaaggat agaagtgttg agaggaataa aacttcatta aatagatttt
     5341 taagtttaga taaaattaaa tccctatatt caaaagagat acataataaa ctttcacaca
     5401 tctttgaaga tgcagtttct gaggtttata gaattttaat ggggctaaag tatatcaaaa
     5461 aggcgccaaa ttacaccgaa attgttctga aggcaaagat attttcaacc ttgattttga
     5521 tgattgtaat attattttta atcaacaaaa tgccttctac atacaaaaaa gcgtatgcag
     5581 ttgttttgaa caatcagatt gtagggtatg tgaaggacaa gactgaagca caaaaccttc
     5641 ttacccagat taaaaaagaa gtagaggaaa gacacaatac agacagtttc attttacaaa
     5701 gtaagcttca actaaagagc attgagcctg gtcaatatcg tgagacaagg gttgatgagc
     5761 tgaaaaatac tatcatagaa aaggggaagg tccttgtaaa aaggtatgct atttttgtta
     5821 attcaaaacc atattttgta tttgaaaatc cacaaactcc aaataatatt cttaacaagc
     5881 taaaaaaggt ctattataat gacaaggcat cacaggcaaa attcttagag aaggtagaaa
     5941 taaaaccagt ttatgtctca ccagctatta aagtagctga tgaagctact gccttaacaa
     6001 agattatgtt tgggaaagac caggtaatag aatatacagt caaggaagga gatactcttt
     6061 gggatcc
//
LOCUS       PFAAMA1      2307 bp ds-DNA             INV       11-JUL-1990
DEFINITION  P.fragile apical membrane antigen 1 (AMA1/AG352) gene, complete
            cds.
ACCESSION   M29898
KEYWORDS    apical membrane antigen.
SOURCE      P.fragile (Nilgiri strain) DNA, from Macaca mulatta, clone
            AG352VATV1.
  ORGANISM  Plasmodium fragile
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 2250)
  AUTHORS   Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F.,
            Collins,W.E., Anders,R.F. and Kemp,D.J.
  TITLE     Apical membrane antigen of Plasmodium fragile
  JOURNAL   Mol. Biochem. Parasitol. 39, 279-284 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 2251 to 2307)
  AUTHORS   Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F.,
            Collins,W.E., Anders,R.F. and Kemp,D.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.Marshall, 15-NOV-1989.
FEATURES       from  to/span     description
    pept        275     1963     apical membrane antigen 1 (AMA1/AG352)
                                 precursor
    sigp        284      322     put. apical membrane antigen 1 signal peptide
    matp        323     1960     apical membrane antigen 1
BASE COUNT      794 a    445 c    499 g    569 t
ORIGIN      
        1 taagttccct ttctacaccc ggatgcctcc tagagcaaat aggagtttca agcgtttaca
       61 tgtaatttac caagcgtttg taattttgca actttgcaat ttttctactg cgcaagtttg
      121 taaccgtgaa gctgctcacc tgtgtgacgg ccaattttta ccaacggtta aacctgttag
      181 tggctatttt tttctcgccc ccctcctgat tgatgtgcag agggagagaa ccaaatagct
      241 gcctttttct tgagtcacaa tttaacaaca caatatgaat aaaatatact gcatactgtt
      301 tttaagtgcc cagtgccttg tgcacatggg taagtgcgag ccaaaccaga agccgagcag
      361 gctgacccgc agcgctaaaa acgttttgtt ggaacaggag cctatggttg agagaagtac
      421 acgaatgagt aacccatgga aagcattcat ggaaaagtac gatatcgaaa aaacacacag
      481 ttctggtatt cgagtagatt taggggaaga tgcagaagtg ggaaattcca gctatagaat
      541 accagcagga aaatgtcctg tttttggaaa gggtatcgtt atacagaatt ctgaggttag
      601 tttcttaaca cctgtagcta caggcaatca aaagttgaag gatggaggtt tcgcctttcc
      661 acaagcaaat gatcatattt cccctatatc cataaaaaac cttagagaaa ggtataaaga
      721 gaatccagat ttgatgaagc taaacgattt agctttgtgt aaaactcatg cagccagctt
      781 tgtaatggaa atggataaaa attcgtccta tagacaccca gctgtatatg atgaagataa
      841 aaaaatatgt tacatgttgt atttatcagc gcaagaaaat atgggtccaa gatactgtag
      901 taaagatgca gaaaataaag atgctatgtt ttgcttcaag ccagataaaa atgaaacatt
      961 tgaccatctt gcctatttaa gcaaaaatgt ggttaatgat tggcaaaaca aatgcccccg
     1021 taaaaattta ggaaattcta aatttggatt atgggtggat ggaaactgtg aagaaatccc
     1081 atacgttcaa gacgtgcagg caaaggatct acgcgaatgt aacagaatcg ttttcgaagc
     1141 tagcgcttca gatcaaccaa ctcagtacga agaagaacta accgattatc aaaaaataca
     1201 agaaggcttt agacaaaacg atcagggtat gattaaaagt gcttttcttc cagtaggtgc
     1261 attcaactcg gacaatttta agagtaaagg aagaggatat aactgggcaa atttcgatac
     1321 tgaaaataag gtttgttacc tttttaatgc caaacccact tgcctcatta atgacaaaaa
     1381 ctttatcgca acaacagcgt tatctcatcc ccaagaagta gacaatgagt ttccatgcag
     1441 catatacaaa gatgaaatgg aaagggaaat gaggaaagaa tcgaggaaca tgagtctgta
     1501 caatgttgat aaggcacgga ttgttctgcc aaggatattt atctccaacg ataaggacag
     1561 tctcaaatgt ccatgcgcac cagaacacat taccaacagt acctgcaact tttacgtttg
     1621 taactgtgta gagaaaaggg cagaaattaa agaaaataac gaagtggcca taaaggaaga
     1681 atttaagcaa gattaccaat acgcgcaagg tgaatccaaa aatcagatgc tcctaattat
     1741 tatcggaata actggaggtg tgtgtgtggt cgcactggct tccatgtttt acttcaggaa
     1801 gaaagctcac aatgataagt atgacaagat ggagcaggca gacgggtacg ggaaacccac
     1861 caccaggaaa gacgagatgc tcgaccccga ggcgtccttc tggggtgaag aaaagcgggc
     1921 ctcccacacc acccctgtgc tgatggagaa gccttactac tgagcgggga agcaaccgaa
     1981 ttggtgaggg cctctttggt cgtaaacaaa gtgggggtgc ctcacaatgc atattttcaa
     2041 cccgcgtcat gtaaaaaaga aaaacgagac acacccagct ggccaacaaa ttgcccacaa
     2101 gggaggagaa atggagcaag ctaaaattgg gctattgtca tcatcaccag ttaccgagga
     2161 aatgaaaaca acaacaaaaa aaaacgtaac acatggtaaa gtaactgatt ggttaagcaa
     2221 agccgagtga aaatttaccc cacttgcgat ttaaaagcat gatttgcctc caccaaatgg
     2281 acctctccac tattaatatt accggag
//
LOCUS       RICAAMYA     1553 bp ss-mRNA            PLN       11-JUL-1990
DEFINITION  Rice alpha-amylase mRNA, complete cds, clone pOS103.
ACCESSION   M24286
KEYWORDS    1,4-alpha-D-glucan glucanohydrolase; alpha-amylase.
SOURCE      Rice (strain M202), cDNA to mRNA, clone pOS103.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1553)
  AUTHORS   O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and
            Rodriguez,R.L.
  TITLE     The alpha-amylase genes in Oryza sativa: Characterization of cDNA
            clones and mRNA expression during seed germination
  JOURNAL   Mol. Gen. Genet. 221, 235-244 (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.H.Kumagai, 25-APR-1989.
            
                          Author address: M.H.Kumagi
                          UC Davis, Dept. of Genetics,
                          Davis, Ca. 95616
FEATURES       from  to/span     description
    pept         34     1338     alpha-amylase (EC 3.2.1.1)
BASE COUNT      340 a    486 c    446 g    281 t
ORIGIN      
        1 atcaatcatc catctccgaa gtgtgtctgc agcatgcagg tgctgaacac catggtgaac
       61 aaacacttct tgtccctttc ggtcctcatc gtcctccttg gcctctcctc caacttgaca
      121 gccgggcaag tcctgtttca gggattcaac tgggagtcgt ggaaggagaa tggcgggtgg
      181 tacaacttcc tgatgggcaa ggtggacgac atcgccgcag ccggcatcac ccacgtctgg
      241 ctccctccgc cgtctcactc tgtcggcgag caaggctaca tgcctgggcg gctgtacgat
      301 ctggacgcgt ctaagtacgg caacgaggcg cagctcaagt cgctgatcga ggcgttccat
      361 ggcaagggcg tccaggtgat cgccgacatc gtcatcaacc accgcacggc ggagcacaag
      421 gacggccgcg gcatctactg cctcttcgag ggcgggacgc ccgactcccg cctcgactgg
      481 ggcccgcaca tgatctgccg cgacgacccc tacggcgatg gcaccggcaa cccggacacc
      541 ggcgccgact tcgccgccgc gccggacatc gaccacctca acaagcgcgt ccagcgggag
      601 ctcattggct ggctcgactg gctcaagatg gacatcggct tcgacgcgtg gcgcctcgac
      661 ttcgccaagg gctactccgc cgacatggca aagatctaca tcgacgccac cgagccgagc
      721 ttcgccgtgg ccgagatatg gacgtccatg gcgaacggcg gggacggcaa gccgaactac
      781 gaccagaacg cgcaccggca ggagctggtc aactgggtcg atcgtgtcgg cggcgccaac
      841 agcaacggca cggcgttcga cttcaccacc aagggcatcc tcaacgtcgc cgtggagggc
      901 gagctgtggc gcctccgcgg cgaggacggc aaggcgcccg gcatgatcgg gtggtggccg
      961 gccaaggcga cgaccttcgt cgacaaccac gacaccggct cgacgcagca cctgtggccg
     1021 ttcccctccg acaaggtcat gcagggctac gcatacatcc tcacccaccc cggcaaccca
     1081 tgcatcttct acgaccattt cttcgattgg ggtctcaagg aggagatcga gcgcctggtg
     1141 tcaatcagaa accggcaggg gatccacccg gcgagcgagc tgcgcatcat ggaagctgac
     1201 agcgatctct acctcgcgga gatcgatggc aaggtgatca caaagattgg accaagatac
     1261 gacgtcgaac acctcatccc cgaaggcttc caggtcgtcg cgcacggtga tggctacgca
     1321 atctgggaga aaatctgagc gcacgatgac gagactctca gtttagcaga tttaacctgc
     1381 gatttttacc ctgaccggta tacgtatata cgtgccggca acgagctgta tccgatccga
     1441 attacggatg caattgtcca cgaagtactt cctccgtaaa taaagtagga tcagggacat
     1501 acatttgtat ggttttacga ataatgctat gcaataaaat ttgcactgct taa
//
LOCUS       RICAAMYB     1682 bp ss-mRNA            PLN       11-JUL-1990
DEFINITION  Rice alpha-amylase mRNA, complete cds, clone pOS137.
ACCESSION   M24287
KEYWORDS    1,4-alpha-D-glucan glucanohydrolase; alpha-amylase.
SOURCE      Rice (strain M202), cDNA to mRNA, clone pOS137.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1682)
  AUTHORS   O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and
            Rodriguez,R.L.
  TITLE     The alpha-amylase genes in Oryza sativa: Characterization of cDNA
            clones and mRNA expression during seed germination
  JOURNAL   Mol. Gen. Genet. 221, 235-244 (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.H.Kumagai, 25-APR-1989.
            
                       Author address: M.H.Kumagi
                       UC Davis, Dept. of Genetics,
                       Davis, Ca. 95616
FEATURES       from  to/span     description
    pept         78     1382     alpha-amylase (EC 3.2.1.1)
BASE COUNT      355 a    491 c    519 g    317 t
ORIGIN      
        1 atccatcatc tacaagagat cgatcagtag tggttagcag caactcacta tcgaacacgg
       61 tttcagctta cacagatatg aagaacacca gcagcttgtg tttgctgctc ctcgtggtgc
      121 tctgcagctt gacctgtaac tcgggtcaag cacaggtcct cttccagggt ttcaactggg
      181 agtcgtggaa gcagcagggt ggctggtaca acatgttgaa aggccaagtc gacgacatcg
      241 ccaaggccgg ggtcacccac gtctggctgc cgccgccgtc gcactccgtg gcgcgagggt
      301 acatgccggg gcgtctctac gacctggacg cgtccaagta cggcacggcg gcggagctca
      361 agtcgctgat cgcggcgttc cacgggaagg gcgtccagtg cgtcgccgac gtcgtgatca
      421 accaccggtg cgccgagaag aaggacgccc gcggcgtgta ctgcgtgttc gagggcggga
      481 cgcgcgaccg cctcgactgg ggccccggca tgatctgcag cgacgacacg cagtactccg
      541 acggcacggg ccaccgcgac accggcgagg ggttcggcgc ggcgcccgac atcgaccacc
      601 tcaacccgcg cgtccagcgg gagctcaccg actggctcaa ctggctcaag tccgacgtcg
      661 gcttcgacgg ctggcgcctc gacttcgcca agggatactc cacggacatc gctaagatgt
      721 acgtcgagag ctgcaagccg ggcttcgtcg tcgccgagat atggaactcg ctgagctaca
      781 acggcgacgg caagccggcg gccaaccagg accagggccg gcaggagctg gtgaactggg
      841 tgaacgccgt cggcgggccg gcgatgacgt tcgacttcac caccaagggc ctcctgcagg
      901 cgggcgtcca gggcgagctg tggcggctgc gcgacggcaa cggcaaggcg cccggcatga
      961 tcgggtggct gccagagaag gccgtcacgt tcgtcgacaa ccacgacacc ggctcgacgc
     1021 agaagctttg gccgttcccc tccgacaagg tcatgcaggg ctacgcctac atcctcaccc
     1081 accccggagt cccctgcatc ttctacgacc acatgttcga ctggaacctg aagcaggaga
     1141 taaccgcgct ggcggcgatc agggagagga acggcatcaa cgccgggagc aagctccgga
     1201 tcgtcgtcgc cgacgccgac gcatacgtcg ccgtcgtcga cgagaaggtc atggtgaaga
     1261 tcgggacgag gtacgacgtg ggcaacgcgg tgccgtcgga tttccatcag acggtgcacg
     1321 gcaaggacta cagcgtctgg gagaaggggt ccctccgcgt cccggcgggg cggcacctat
     1381 agcgggctca agccctaaac tgaacgggat agtcatgctc aaaccagttt ctacacggca
     1441 agaatttact gattcttata ctttttcagt caattaaatt atggttttta tatatgtaat
     1501 tttgtatccg attgtagcgt tcgaataagt aggcaggctc tctagcctct aggttaattg
     1561 cgggcatatg tagcttgcca gttaattgtg tttgtatcac gcagtttgta accgttggtg
     1621 catatatatg tcaggttcag gatgcagtaa aaaatcatac tgcaccgatc agtgagtttt
     1681 ta
//
LOCUS       HUMCEAPX      494 bp ss-mRNA            PRI       11-JUL-1990
DEFINITION  Human cell adhesion protein (SQM1) mRNA, complete cds.
ACCESSION   M33374
KEYWORDS    cell adhesion protein.
SOURCE      Human squamous carcinoma cell line SCC25, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 494)
  AUTHORS   Wong,Y.-C., Tsao,S.-W., Kakefuda,M. and Bernal,S.D.
  TITLE     cDNA cloning of a novel cell adhesion protein expressed in human
            squamous carcinoma cells
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 984-992 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         36      443     cell adhesion protein (SQM1)
    mRNA    <     1      494     SQM1 mRNA
BASE COUNT      107 a    154 c    163 g     70 t
ORIGIN      
        1 ccctcggtgc tgcagggatc tgcaggactg cagccatggg ggcgcacctg gtccggcgct
       61 acctgggcga tgcttcggtg gagcccgacc ccctgcagat gccaaccttc ccgccagact
      121 acggcttccc cgaacgcaag gagcgcgaga tggtggccac acagcaggag atgatggacg
      181 cgagtgaggc tcagctgcgg gactactgcg cccaccacct catccggctg ctcaagtgca
      241 agcgtgacag cttcccaagt tgctggcctg caagcaggaa gcggcacgac tcgggactac
      301 tgcgcaccgc aagctatgtg atgcgcatga aggagtttga gcgggacgag ggctgctcca
      361 gcggaagaag cggcgggaga agaaggcggc aaatctgcaa aggccaggga cccggggaag
      421 tggaccccaa ggtggccctg taggggtgca ccccccaccc tatggaccag tcaaataaaa
      481 ccttcaggcc cctc
//
LOCUS       REOCEAP1     1463 bp ds-RNA             VRL       11-JUL-1990
DEFINITION  Reovirus sp. (serotype ST1) sigma-1 protein gene, complete cds.
ACCESSION   M32860
KEYWORDS    cell attachment protein; sigma-1 protein.
SOURCE      Reovirus sp. (serotype ST1), cDNA to viral RNA.
  ORGANISM  Reovirus sp.
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 1463)
  AUTHORS   Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K.
  TITLE     Identification of conserved domains in the cell attachment proteins
            of the three serotypes of Reovirus
  JOURNAL   Virology 174, 399-409 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         14     1426     sigma-1 protein
BASE COUNT      426 a    291 c    369 g    377 t
ORIGIN      
        1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact
       61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca
      121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat
      181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg
      241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca
      301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga
      361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt
      421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg
      481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga
      541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt
      601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg
      661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta
      721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc
      781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg
      841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat
      901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga
      961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac
     1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt
     1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc
     1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca
     1201 gacgggggat actgaaccat cgtcaactat tgatccattg tctacgacat ttgccgcggt
     1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg
     1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt
     1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc
     1441 ctcggcacaa ggggtcaatc atc
//
LOCUS       REOCEAP2     1440 bp ss-RNA             VRL       11-JUL-1990
DEFINITION  Reovirus sp. (serotype ST2) sigma-1 protein gene, complete cds.
ACCESSION   M32861
KEYWORDS    cell attachment protein; sigma-1 protein.
SOURCE      Reovirus sp. (serotpe ST2), cDNA to viral RNA.
  ORGANISM  Reovirus sp.
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 1440)
  AUTHORS   Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K.
  TITLE     Identification of conserved domains in the cell attachment proteins
            of the three serotypes of reovirus
  JOURNAL   Virology 174, 399-409 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         14     1402     sigma-1 protein
BASE COUNT      384 a    316 c    381 g    359 t
ORIGIN      
        1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac
       61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa
      121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact
      181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg
      241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc
      301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga
      361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat
      421 tactaatgac cttggatcat tgaatacgag gctgacgact gaattgaacg atgtccgcca
      481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt
      541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg
      601 gttcgaccgc aataatgttc tgcagttatt cttatcgaac cagcagaaag ggttgggatt
      661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg
      721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc
      781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct
      841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag
      901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc
      961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata
     1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga
     1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt
     1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc
     1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc
     1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat
     1321 cacgaatcat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat
     1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc
//
LOCUS       REOCEAP3     1416 bp ss-RNA             BAD       11-JUL-1990
DEFINITION  Reovirus sp. (serotype ST3) sigma-1 protein gene, complete cds.
ACCESSION   M32862
KEYWORDS    cell attachment protein; sigma-1 protein.
SOURCE      Reovirus sp. (serotype ST3) viral DNA.
  ORGANISM  Reovirus sp.
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 1416)
  AUTHORS   Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K.
  TITLE     Identification of conserved domains in the cell attachment proteins
            of the three serotypes of reovirus
  JOURNAL   Virology 174, 399-409 (1990)
  STANDARD  simple staff_review
COMMENT     Secondary reference.  Please see: Proc. Natl. Acad. Sci. U.S.A.
            82, 24-28 (1985), accession m10262.
FEATURES       from  to/span     description
    pept         13     1380     sigma-1 protein
BASE COUNT      376 a    301 c    365 g    374 t
ORIGIN      
        1 gctattggtc ggatggatcc tcgcctacgt gaagaagtag tacggctgat aatcgcatta
       61 acgagtgata atggagcatc actgtcaaaa gggcttgaat caagggtctc ggcgctcgag
      121 aagacgtctc aaatacactc tgatactatc ctccggatca cccagggact cgatgatgca
      181 aacaaacgaa tcatcgctct tgagcaaagt cgggatgact tggttgcatc agtcagtgat
      241 gctcaacttg caatctccag attggaaagc tctatcggag ccctccaaac agttgtcaat
      301 ggacttgatt cgagtgttac ccagttgggt gctcgagtgg gacaacttga gacaggactt
      361 gcagacgtac gcgttgatca cgacaatctc gttgcgagag tggatactgc agaacgtaac
      421 attggatcat tgaccactga gctatcaact ctgacgttac gagtaacatc catacaagcg
      481 gatttcgaat ctaggatatc cacgttagag cgcacggcgg tcactagcgc gggagctccc
      541 ctctcaatcc gtaataaccg tatgaccatg ggattaaatg atggactcac gttgtcaggg
      601 aataatctcg ccatccgatt gccaggaaat acgggtctga atattcaaaa tggtggactt
      661 cagtttcgat ttaatactga tcaattccag atagttaata ataacttgac tctcaagacg
      721 actgtgtttg attctatcaa ctcaaggata ggcgcaactg agcaaagtta cgtggcgtcg
      781 gcagtgactc ccttgagatt aaacagtagc acgaaggtgc tggatatgct aatagacagt
      841 tcaacacttg aaattaattc tagtggacag ctaactgtta gatcgacatc cccgaatttg
      901 aggtatccga tagctgatgt tagcggcggt atcggaatga gtccaaatta taggtttagg
      961 cagagcatgt ggataggaat tgtctcctat tctggtagtg ggctgaattg gagggtacag
     1021 gtgaactccg acatttttat tgtagatgat tacatacata tatgtcttcc agcttttgac
     1081 ggtttctcta tagctgacgg tggagatcta tcgttgaact ttgttaccgg attgttacca
     1141 ccgttactta caggagacac tgagcccgct tttcataatg acgtggtcac atatggagca
     1201 cagactgtag ctatagggtt gtcgtcgggt ggtgcgcctc agtatatgag taagaatctg
     1261 tgggtggagc agtggcagga tggagtactt cggttacgtg ttgagggggg tggctcaatt
     1321 acgcactcaa acagtaagtg gcctgccatg accgtttcgt acccgcgtag tttcacgtga
     1381 ggatcagacc accccgcggc actggggcat ttcatc
//
LOCUS       RATGLYSN     2386 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Rat glycogen synthase mRNA, complete cds.
ACCESSION   J05446
KEYWORDS    UDP glucose:glycogen 4-alpha-D-glucosyltransferase;
            glycogen synthase.
SOURCE      Rat adult liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2386)
  AUTHORS   Bai,G., Zhang,Z., Werner,R., Nuttall,F.Q., Tan,A.W.H. and
            Lee,E.Y.C.
  TITLE     The primary structure of rat liver glycogen synthase deduced by
            cDNA cloning: Absence of phosphorylation sites 1a and 1b
  JOURNAL   J. Biol. Chem. 265, 7843-7848 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Y.C.Lee, 16-MAR-1990.
FEATURES       from  to/span     description
    pept         46     2157     glycogen synthase (EC 2.4.1.11)
    mRNA    <     1     2386     glycogen synthase mRNA
    signal     2365     2370     poly-A signal
BASE COUNT      657 a    581 c    579 g    569 t
ORIGIN      
        1 ctgcaaccgg tccccttcgg caccagacac acagctggac gaagaatgct caggggccgc
       61 tccttgtctg tgacgtccct tggtgggctc cctgcatggg aagctgaaag actccccgtg
      121 gaagacttat tgctttttga agtttcctgg gaagtgacca acaaagttgg gggcatctgt
      181 actgtgatcc agagcaaagc caaaaccaca gccaatgaat ggggagagaa ttacttcctg
      241 ataggtccgt attttgagca taatgtgaag actcaagtag agccatgcag gcccgccaac
      301 gatgccgtca gaaaagctgt ggatgccatg aacaaacatg gctgccaggt gcattttgga
      361 agatggctga tagaagggag tccgtatgtg gtgctttttg acatcagctc ctcagtgtgg
      421 aacctggaca ggtggaaggg agacttctgg gaagcatgtg gcgttggcat ccctcacgac
      481 gaccgagaag ccaatgacat gctcatattt gggtctttaa ctgcctggtt cttaaaggag
      541 gtgacggacc atgcagacgg taaacacgtc attgcccaat tccatgaatg gcaggctgga
      601 actgggctga tcctttctcg tgccaggaaa ctccccatcg ccacaatatt tacaacccat
      661 gccacactgc tggggcggta tctctgtgca gcaaatattg acttctacaa ccagcttgat
      721 aagttcaaca tagacaaaga ggccggggag aggcagattt atcaccgcta ctgcatggag
      781 cgggcttccg tgcactgtgc gcacgtgttt accacagtgt cagaaatcac agccatcgag
      841 gcggacgaca tgctgaagag gaagcctgat gtggtgactc caaacggctt gaacgttaag
      901 aagttttctg cggtgcacga atttcaaaat ctccatgcca catacaaggc caggatacag
      961 gattttgttc gaggtcattt ctatggccac ctggacttcg atcttgaaaa gacgttattt
     1021 cttttcattg ctgggaggta tgagttctcc aacaagggag cagacatctt cctagaatcc
     1081 ttatccaggc tcaatttcct cctaaggatg cataagagta acgtcactgt ggtagtgttt
     1141 ttcatcatgc ctgccaagac aaacaatttc aacgtggaaa ccctgaaggg ccaggcggtg
     1201 cggaaacagc tgtgggacac tgtgcactgt atgaaggaaa agtttggcaa gaaactctac
     1261 gatgggttat taagaggaga aatacccgac atgaatagta ttttggatcg agatgactta
     1321 acaattatga aaagagccat tttttcaact cagagacact ctttgcctcc tgtgaccact
     1381 cacaatatga tcgacgattc cacggatccc atcctcagca ccattcgacg aattggactt
     1441 ttcaacaatc gcacagacag agtcaaggtg attttacacc cagaattcct gtcctccacc
     1501 agccccctac taccaatgga ttatgaagag tttgtccgag gctgtcacct tggggtattt
     1561 ccatcatact atgagccctg gggttacacg ccagccgaat gcacagtgat gggcatcccc
     1621 agtgtgacta cgaacctctc tggtttcggg tgtttcatgc aggagcatgt ggctgaccct
     1681 accgcgtacg gtatttatat cgtcgacagc gtccgctctc cagatgattc ttgcaaccag
     1741 ctgactcagt ttctctatgg gttctgtaaa cagtcccgcc gccaaagaat catccagagg
     1801 aaccgcaccg agaggctctc agatcttctg gactggagat acctgggcag atattaccag
     1861 catgccagac atctgacact gagcagggct tttccagaca aattctacct ggagcccaca
     1921 tccccaccaa cgacggatgg ctttaagtat cccaggccct cctcagtacc accttcccca
     1981 tcaggatccc agacttcaag tcctcagagc agcgatgtgg aaaacgaagg ggatgaggat
     2041 gagagatatg atgaggaaga ggaggctgag agggaccggc taaacatcaa gtcaccattt
     2101 tccctgaacc acatcccaaa ggggaagaaa aagcttcatg gagaatataa gaactgagct
     2161 caaatgaaat gattccaaat ccacaagaaa atgagctgag cccaagtcca tccctgatgc
     2221 ataccgacag atatttacag aatgacgtcg gaaatctaga atctgtgtcc agatcactga
     2281 tagtaacttg tagccaccga catgtgtcac cgtactgtga tggtactttt gttgtctaat
     2341 tggaaatttc aatctgttat tgataataaa ttaccaaatc taaatg
//
LOCUS       RABCYP2C16   2006 bp ss-mRNA            MAM       11-JUL-1990
DEFINITION  Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds.
ACCESSION   M29968
KEYWORDS    cytochrome P450; monooxygenase.
SOURCE      Rabbit (strain New Zealand White) adult liver, cDNA to mRNA.
  ORGANISM  Oryctolagus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 2006)
  AUTHORS   Hassett,C. and Omiecinski,C.J.
  TITLE     Sequence and gene expression of rabbit cytochrome P450 IIC16:
            Comparison ti highly related family members
  JOURNAL   Nucleic Acids Res. 18, 1429-1434 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Hassett, 20-NOV-1989.
            
               Author Address [1]: C.Hasset
                               University of Washington
                               Department of Environmental Health SC-34
                               Seattle, WA 98195
FEATURES       from  to/span     description
    pept         66     1529     cytochrome P450IIC16 (CYP2C16)
    mRNA    <     1     2006     CYP2C16 mRNA
    signal     1985     1989     poly-A signal
BASE COUNT      524 a    474 c    472 g    536 t
ORIGIN      
        1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag
       61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac
      121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta
      181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag
      241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg
      301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca
      361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga
      421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga
      481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa
      541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct
      601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg
      661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt
      721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt
      781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca
      841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt
      901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa
      961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata
     1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca
     1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg
     1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc
     1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc
     1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga
     1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc
     1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg
     1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt
     1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc
     1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac
     1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac
     1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga
     1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc
     1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat
     1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa
     1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct
     1981 cattaataaa gtggatgcat tcactg
//
LOCUS       HUMGAPA      4307 bp ss-mRNA            PRI       11-JUL-1990
DEFINITION  Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds.
ACCESSION   M23379
KEYWORDS    GTPase-activating protein.
SOURCE      Human placenta, cDNA to mRNA, clone 101.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4307)
  AUTHORS   Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A.,
            Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and
            McCormick,F.
  TITLE     Molecular cloning of two types of GAP complementary DNA from human
            placenta
  JOURNAL   Science 242, 1697-1700 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by C.M.Long, 31-MAR-1989.  For sequence of clone 16 refer to
            M23612.
FEATURES       from  to/span     description
    pept        119     3262     ras p21 GTP-ase-activating protein (GAP)
BASE COUNT     1280 a    856 c    957 g   1214 t
ORIGIN      
        1 cctcagcctg gggagctgaa ggggagacgc gtctgggtgg ggctgctcgg agcccgggcc
       61 tggtggcccc tggggctccc gggcgggcag ggtagggcag agtagagcgg gcttcaacat
      121 gatggcggcc gaggccggca gtgaggaggg cggcccggta acagccggag ctggaggagg
      181 cggcgcggca gcgggctcca gtgcctatcc cgcagtgtgt cgggtgaaga tacccgcggc
      241 cctgcctgtg gcagccgccc cctatcctgg gctggtggag accggagtgg ctggaactct
      301 gggtggcgga gccgctttgg ggtcagagtt cctaggagcc gggtctgtgg caggggcact
      361 ggggggagct ggactgacag ggggaggtac tgctgctggc gtagctggtg ctgctgctgg
      421 cgtggccggt gctgctgttg ctggacctag tggagacatg gctctcacca aactgcccac
      481 ttcgttgctt gctgagactc tcgggccagg cggcggtttt ccccctctgc cccctccccc
      541 ttacctgccc cctttggggg cgggcctcgg gacagtggac gaaggtgact ctctggatgg
      601 accagaatac gaggaggaag aggtggccat accgttgacc gctcctccaa ctaaccagtg
      661 gtatcacgga aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc
      721 tggcagttat cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct
      781 tagccagatg aatgttgtca accattttag gattattgct atgtgtggag attactacat
      841 tggtggaaga cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg
      901 tttgcttaaa ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag
      961 aaggcgtgta cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt
     1021 cttaaaagga gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac
     1081 aaatttaaga acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg
     1141 ggaagaagat ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc
     1201 ttataattta ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa
     1261 tactcctggc gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat
     1321 atgtccaacg ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga
     1381 catcatagat cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt
     1441 accaatgcag gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa
     1501 taccatccgt cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct
     1561 tctgaaaaag ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga
     1621 tgcccaactt atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga
     1681 tctcagtgta tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt
     1741 tcagatagta gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac
     1801 tccagaacaa gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag
     1861 tagtccaggg acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga
     1921 agaagcccat aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa
     1981 tagtgtccaa gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga
     2041 gtttgtcttt gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa
     2101 aacaaagaaa agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca
     2161 gaaagggcat gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat
     2221 tgaaccaggg tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga
     2281 agagtacagt gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt
     2341 atcacatgta tgtggacaag accgaacact actggccagc atcctactga ggatttttct
     2401 tcacgaaaag cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga
     2461 agatgaagcc actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta
     2521 tatgaaagcc actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat
     2581 aatggaaagc aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt
     2641 gaacactaat ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat
     2701 ggcttcagaa atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt
     2761 tcagcataag tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt
     2821 tcttcgactc atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc
     2881 tccatctcct attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc
     2941 aaatcttgtg gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa
     3001 aagcaacaaa catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc
     3061 ggacactaca gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat
     3121 ttgcgtggct cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca
     3181 cgtattgaaa aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac
     3241 aaaaaccaat gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt
     3301 ccaacatggt aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac
     3361 ttttccacat tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa
     3421 taactatgcc agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg
     3481 aatcaatctt taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg
     3541 ctatgactgt atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc
     3601 aaagttttgc tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt
     3661 attgacaatt gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag
     3721 gaatatgacc atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat
     3781 tataatagga acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg
     3841 gaactgttga aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca
     3901 tacgacttat tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa
     3961 atgtttacaa gtaaatagtt tgaattcagt aaatattatt ggttgttgta ttgatcaatg
     4021 catgttaccc attcaaccat tttatagact accaatttct tttatgttaa ctagaatgct
     4081 tttgttaaaa gttatttgtt cattatttgt gctacccctt tgattatgca gacaacctca
     4141 tcagctgcct aacttatcca tctttgaact tctgactact tgttgtatct gctggatatt
     4201 tagttcaact gtatagtttt atttacttct gtatgtgtat ttttgtgaag tattcacaaa
     4261 ggttaagtta aaataaaacc aagggatatc ttgcaaaaaa aaaaaaa
//
LOCUS       HUMGAPB      3456 bp ss-mRNA            PRI       11-JUL-1990
DEFINITION  Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds.
ACCESSION   M23612
KEYWORDS    GTP-ase-activating protein.
SOURCE      Human placenta, cDNA to mRNA, clone 16.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3456)
  AUTHORS   Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A.,
            Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and
            McCormick,F.
  TITLE     Molecular cloning of two types of GAP complementary DNA from human
            placenta
  JOURNAL   Science 242, 1697-1700 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by C.M.Long, 31-MAR-1989.  For sequence of clone 101 refer to
            M23379.
FEATURES       from  to/span     description
    pept        100     2712     ras p21 GTP-ase-activating protein (GAP)
    site         49       51     5' in frame termination codon
BASE COUNT     1134 a    640 c    687 g    995 t
ORIGIN      
        1 ggaagaggtg gccataccgt tgaccgctcc tccaactaac cagtaagtta agactgctgt
       61 tcaggaattt gggaagctgg ctccagaaaa gaagtggaaa tgaaggggtg gtatcacgga
      121 aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc tggcagttat
      181 cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct tagccagatg
      241 aatgttgtca accattttag gattattgct atgtgtggag attactacat tggtggaaga
      301 cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg tttgcttaaa
      361 ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag aaggcgtgta
      421 cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt cttaaaagga
      481 gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac aaatttaaga
      541 acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg ggaagaagat
      601 ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc ttataattta
      661 ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa tactcctggc
      721 gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat atgtccaacg
      781 ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga catcatagat
      841 cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt accaatgcag
      901 gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa taccatccgt
      961 cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct tctgaaaaag
     1021 ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga tgcccaactt
     1081 atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga tctcagtgta
     1141 tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt tcagatagta
     1201 gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac tccagaacaa
     1261 gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag tagtccaggg
     1321 acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga agaagcccat
     1381 aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa tagtgtccaa
     1441 gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga gtttgtcttt
     1501 gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa aacaaagaaa
     1561 agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca gaaagggcat
     1621 gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat tgaaccaggg
     1681 tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga agagtacagt
     1741 gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt atcacatgta
     1801 tgtggacaag accgaacact actggccagc atcctactga ggatttttct tcacgaaaag
     1861 cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga agatgaagcc
     1921 actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta tatgaaagcc
     1981 actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat aatggaaagc
     2041 aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt gaacactaat
     2101 ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat ggcttcagaa
     2161 atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt tcagcataag
     2221 tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt tcttcgactc
     2281 atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc tccatctcct
     2341 attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc aaatcttgtg
     2401 gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa aagcaacaaa
     2461 catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc ggacactaca
     2521 gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat ttgcgtggct
     2581 cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca cgtattgaaa
     2641 aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac aaaaaccaat
     2701 gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt ccaacatggt
     2761 aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac ttttccacat
     2821 tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa taactatgcc
     2881 agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg aatcaatctt
     2941 taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg ctatgactgt
     3001 atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc aaagttttgc
     3061 tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt attgacaatt
     3121 gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag gaatatgacc
     3181 atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat tataatagga
     3241 acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg gaactgttga
     3301 aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca tacgacttat
     3361 tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa atgtttacaa
     3421 gtaaatagtt tgaattcagt aaatattaaa aaaaaa
//
LOCUS       YSCSDH       1665 bp ds-DNA             PLN       11-JUL-1990
DEFINITION  S.cerevisiae succinate dehydrogenase iron-protein subunit (SDH)
            gene, complete cds.
ACCESSION   J05487
KEYWORDS    succinate dehydrogenase iron-protein subunit.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Lombardo,A., Carine,K. and Scheffler,I.E.
  TITLE     Cloning and characterization of the iron-sulfur subunit gene of
            succinate dehydrogenase from Saccharomyces cerevisiae
  JOURNAL   J. Biol. Chem. 265, 10419-10423 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.E.Scheffler, 13-APR-1990.
FEATURES       from  to/span     description
    pept        738     1538     succinate dehydrogenase iron-protein subunit
                                 (SDH) (EC 1.3.99.1)
    signal      585      589     CAAT box
    signal      616      622     TATA box
BASE COUNT      532 a    349 c    345 g    439 t
ORIGIN      Chromosome VII.
        1 atcttacaag taacttaagt caaggcgtga aaagtaccac cactgtgctt gacatgcaaa
       61 agggttgcag agtgcgtcta ccaaggtacg tggaccatga tcaaatcatc aagccttatg
      121 atctacgtga ggcccaagga caatactggc tcaagaccgt gaatggagga gtattatgaa
      181 tgatgaaatc ctgtcgcacg tatattgcca ggcaaagaac tagcagtaat tgtgtcatgt
      241 cagcacattg ctgaggtgca aatggccacc caagagctta ttggagcaca ggatatcttc
      301 atcagggaat tacattggaa agatccggtc ttcaaattaa ctcaagtctc aatccgaata
      361 cttcattccc atcagcgatc ctgaagaacg tcggtccttg tacaggaaca tcgccattgc
      421 tgttagagaa tacaataagt actgtgaagc tatcctatga tcacatatga aagtatatac
      481 ccgcttttgt acactatgta gctataattc aatcgtatta ttgtacgtcc gcacgaccat
      541 gccttagaaa tatccgcagc gcgcaaaagg cggcctcgca ttggcccaat tagctccggt
      601 gtaaaaaggg caaactatat aagggattaa tgactttcta tgagaatgcc aaaaaatgtt
      661 aggctaaagg aagggattga aaggaatata gttgagctat actttcttga aatactggag
      721 tatacatatt tatagggatg ttgaacgtgc tattgagaag gaaggccttt tgtttggtga
      781 cgaagaaggg tatggctact gccacaacag ctgcagctac gcataccccc agattgaaaa
      841 cttttaaagt ttacagatgg aatccagacg agccaagtgc taaacctcat ttacagtcat
      901 atcaagtgga tctgaatgac tgtgggccca tggtacttga tgcgctgtta aagatcaaag
      961 acgaacagga ttctacccta acttttagaa gatcatgtag agaaggtatc tgcggttcat
     1021 gtgccatgaa cattggcggt agaaacacgc tagcttgtat atgtaagatc gaccagaacg
     1081 aatccaaaca actcaagatc tatccattac cccacatgtt tattgtcaaa gatttggtac
     1141 ctgatttaac taacttctac caacaataca aatctatcca accttactta cagagatcat
     1201 cgtttccaaa ggatggaacg gaagtgctac aaagtattga agatcgtaag aaactggatg
     1261 gtctttacga atgtattctg tgtgcatgct gctctacttc atgtccatcg tactggtgga
     1321 accaagaaca gtatttgggc cctgccgtgc taatgcaagc ctaccgttgg ctaattgact
     1381 ctagagacca agctacaaag acaagaaagg ccatgctaaa caactccatg tcattgtaca
     1441 gatgtcacac catcatgaac tgtactagaa cttgtccaaa gggcttgaat cctggtttgg
     1501 ctattgctga aattaagaaa tctttggcat ttgcctagac tatcagaaaa acagctagcc
     1561 ccgaagaact cagaagcctc tcaaatgatt ttggcactaa taaaagcacc aactattatt
     1621 attattattt tcaaggacga aactcaccat tctcacacat tcctt
//
LOCUS       BOVPDEAP      585 bp ss-mRNA            MAM       11-JUL-1990
DEFINITION  Bovine cone photoreceptor cyclic nucleotide phosphodiesterase
            alpha'-subunit (PDE), partial cds.
ACCESSION   M33140 M29465
KEYWORDS    cone photoreceptor cyclic nucleotide phosphodiesterase.
SOURCE      Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 585)
  AUTHORS   Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K.,
            Mullaney,P.J., Walsh,K.A. and Beavo,J.A.
  TITLE     Identification of a noncatalytic cGMP-binding domain conserved in
            both the cGMP-stimulated and photoreceptor cyclic nucleotide
            phosphodiesterases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   585     cone photoreceptor cyclic nucleotide
                                 phosphodiesterase alpha'-subunit (AA at 1)
BASE COUNT      198 a    115 c    126 g    146 t
ORIGIN      
        1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa
       61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt
      121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt
      181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt
      241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag
      301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct
      361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc
      421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa
      481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca
      541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat
//
LOCUS       CHKG1CLSE     240 bp ds-DNA             VRT       11-JUL-1990
DEFINITION  Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer
            cor segments B3 and B4.
ACCESSION   M33954
KEYWORDS    delta-1-crystallin.
SOURCE      Chicken DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 240)
  AUTHORS   Goto,K., Okada,T.S. and Kondoh,H.
  TITLE     Functional cooperation of lens-specific and nonspecific elements in
            the delta-1-crystallin enhancer
  JOURNAL   Mol. Cell. Biol. 10, 958-964 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site         17      116     core segment B3
    site        112      235     core segment B4
BASE COUNT       61 a     52 c     53 g     74 t
ORIGIN      
        1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg
       61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt
      121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga
      181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa
//
LOCUS       CHKOVAL      9206 bp ds-DNA             VRT       11-JUL-1990
DEFINITION  Chicken ovalbumin gene, complete cds.
ACCESSION   J00895
KEYWORDS    ovalbumin.
SOURCE      Chicken oviduct DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1343 to 8906)
  AUTHORS   Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R.,
            Brownlee,G.G. and O'Malley,B.W.
  TITLE     Complete nucleotide sequence of the chicken chromosomal ovalbumin
            gene and its biological significance
  JOURNAL   Biochemistry 20, 6437-6446 (1981)
  STANDARD  full staff_review
REFERENCE   2  (bases 1043 to 1562; 2675 to 4732; 8885 to 9206)
  AUTHORS   Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P.
  TITLE     The ovalbumin gene-sequence of putative control regions
  JOURNAL   Nucleic Acids Res. 8, 127-142 (1980)
  STANDARD  full staff_review
REFERENCE   3  (bases 1357 to 1389; 2941 to 3052; and ivs junctions)
  AUTHORS   Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P.
  TITLE     Ovalbumin gene: evidence for a leader sequence in mRNA and DNA
            sequences at the exon-intron boundaries
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978)
  STANDARD  full staff_review
REFERENCE   4  (bases 1282 to 1420; 2952 to 2999)
  AUTHORS   Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C.,
            Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and
            Chambon,P.
  TITLE     Organisation and sequences at the 5' end of a cloned complete
            ovalbumin gene
  JOURNAL   Nature 278, 428-434 (1979)
  STANDARD  full staff_review
REFERENCE   5  (bases 1343 to 8906; exons only)
  AUTHORS   McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E.,
            Givol,D., Fields,S., Robertson,M. and Brownlee,G.G.
  TITLE     Sequence of chicken ovalbumin mRNA
  JOURNAL   Nature 273, 723-728 (1978)
  STANDARD  full staff_review
REFERENCE   6  (bases 2675 to 5042)
  AUTHORS   Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F.,
            O'Malley,B.W. and Brownlee,G.G.
  TITLE     Sequence of three introns in the chick ovalbumin gene
  JOURNAL   Nature 278, 370-372 (1979)
  STANDARD  full staff_review
REFERENCE   7  (bases 1 to 1042)
  AUTHORS   Heilig,R., Muraskowsky,R. and Mandel,J.L.
  TITLE     The ovalbumin gene family: The 5' end region of the X and Y genes
  JOURNAL   J. Mol. Biol. 156, 1-19 (1982)
  STANDARD  full staff_review
REFERENCE   8  (bases 5576 to 5624)
  AUTHORS   Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W.
  TITLE     The ovalbumin gene: Alleles created by mutations in the intervening
            sequences of the natural gene
  JOURNAL   Cell 16, 201-211 (1979)
  STANDARD  simple staff_entry
REFERENCE   9  (bases 443 to 611)
  AUTHORS   Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M.
  TITLE     The steroid-dependent regulatory element in the ovalbumin gene does
            not function as a typical steroid response element
  JOURNAL   J. Biol. Chem. 265, 7590-7595 (1990)
  STANDARD  simple staff_entry
COMMENT     Eight exons reported. Sequence homologies with adenovirus early 1a,
            adenovirus major late, chicken conalbumin, chkx and chky genes
            noted for 5' flanking sequence.
            
            OV1.3 is identical to OV1.8 except that intron E is shorter by 522
            nucleotides [8].
FEATURES       from  to/span     description
    pept       2996     3163     ovalbumin, exon 1
               3415     3465     ovalbumin, exon 2
               4047     4175     ovalbumin, exon 3
               4576     4693     ovalbumin, exon 4
               5652     5794     ovalbumin, exon 5
               6126     6281     ovalbumin, exon 6
               7864     8259     ovalbumin, exon 7
    pre-msg    1343     8906     oval mRNA and introns
    IVS        1390     2978     oval intron A
    IVS        3164     3414     oval intron B
    IVS        3466     4046     oval intron C
    IVS        4176     4575     oval intron D
    IVS        4694     5651     oval intron E
    IVS        5795     6125     oval intron F
    IVS        6282     7863     oval intron G
    allele     1282     1282     g may be c [1]
    allele     1309     1309     g may be a [1]
    allele     1376     1376     g may be c [1],[7]
    allele     1385     1385     g may be a [1],[7]
    allele     1393     1393     a may be g [1]
    allele     3010     3010     t may be c [1]
    allele     3154     3154     a may be g [1]
    allele     5747     5747     g may be a [1]
    allele     8032     8032     can be g [5]
    conflict   1471     1471     c in [1]; t in [2],[7]
    conflict   1523     1523     a in [1]; t in [2],[7]
    conflict   1538     1539     at in [1]; a in [2]
    conflict   2681     2681     t in [1],[6]; g in [2]
    conflict   2696     2696     a in [1],[6]; g in [2]
    conflict   3552     3552     a in [1],[6]; g in [2]
    conflict   3683     3683     c in [1],[6]; t in [2]
    conflict   3934     3934     a in [1],[6]; c in [2]
    conflict   3960     3960     a in [1],[6]; g in [2]
    conflict   3994     3994     t in [1],[6]; c in [2]
    conflict   4207     4208     tt in [1],[6]; ttt in [2]
    conflict   4396     4396     t in [1],[6]; c in [2]
    conflict   5636     5636     c in [1]; g in [3]
    conflict   6100     6100     t in [1]; tc in [3]
    conflict   6119     6119     t in [1]; tg in [3]
    conflict   8492     8505     gactcacagtactg in [1]; g in [5]
    site        443      611     steroid-dependent regulatory element [9]
BASE COUNT     2994 a   1749 c   1721 g   2742 t
ORIGIN      5 bp upstream of PstI site.
        1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca
       61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg
      121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc
      181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg
      241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca
      301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga
      361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca
      421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca
      481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa
      541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt
      601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg
      661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt
      721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt
      781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc
      841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg
      901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac
      961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag
     1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat
     1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag
     1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat
     1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg
     1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc
     1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa
     1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac
     1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac
     1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta
     1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt
     1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat
     1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg
     1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat
     1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac
     1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg
     1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac
     1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa
     2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact
     2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa
     2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa
     2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa
     2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc
     2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac
     2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt
     2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt
     2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc
     2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat
     2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa
     2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat
     2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca
     2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg
     2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt
     2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg
     3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca
     3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt
     3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa
     3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct
     3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc
     3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc
     3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt
     3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt
     3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct
     3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa
     3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag
     3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta
     3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg
     3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat
     3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag
     3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta
     3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca
     4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta
     4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta
     4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg
     4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat
     4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac
     4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag
     4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat
     4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta
     4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat
     4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa
     4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa
     4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa
     4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt
     4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa
     4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca
     4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc
     4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat
     5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg
     5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga
     5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt
     5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc
     5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt
     5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag
     5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga
     5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa
     5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc
     5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt
     5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc
     5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga
     5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta
     5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg
     5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct
     5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt
     6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca
     6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc
     6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat
     6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca
     6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg
     6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt
     6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag
     6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa
     6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag
     6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga
     6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac
     6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa
     6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca
     6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat
     6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat
     6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag
     6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga
     7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt
     7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag
     7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac
     7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc
     7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg
     7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac
     7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt
     7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga
     7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga
     7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta
     7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga
     7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag
     7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt
     7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg
     7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg
     7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc
     7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct
     8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa
     8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc
     8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc
     8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact
     8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt
     8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga
     8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac
     8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa
     8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa
     8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa
     8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc
     8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca
     8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt
     8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt
     8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc
     8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag
     9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa
     9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg
     9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg
     9181 ctgccctgat ctcggctggg gtgatg
//
LOCUS       PMUCEN        150 bp ds-DNA             PHG       11-JUL-1990
DEFINITION  Bacteriophage Mu wild type DNA fragment with a gyrase cleavage
            site.
ACCESSION   M32302
KEYWORDS    .
SOURCE      Bacteriophage Mu (wild type) DNA.
  ORGANISM  Bacteriophage mu
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Pato,M., Howe,M. and Higgins,P.
  TITLE     DNA gyrase binds to a centrally located replication enhancer (CEN)
            in the bacteriophage Mu genome
  JOURNAL   Unpublished (1990) In Press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.L.Pato, 23-FEB-1990.
FEATURES       from  to/span     description
    site         49       50     DNA gyrase cleavage site in complimentary
                                 strand
    mut          44       44     a in wt; g in Mu nuB103
    mut          50       50     g in wt; c in Mu nuB1
BASE COUNT       40 a     38 c     27 g     45 t
ORIGIN      Map position at 18.0 kb.
        1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat
       61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt
      121 atcttccttt cactttcttt ctccagtact
//
LOCUS       RATRNRTR     2577 bp ds-DNA             ROD       11-JUL-1990
DEFINITION  Rat snRNP-associated polypeptide N, complete cds.
ACCESSION   J05497
KEYWORDS    snRNP-associated polypeptide N.
SOURCE      Rat male adult (Fisher) DNA, clones rgV and rgIII2.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2577)
  AUTHORS   Schmauss,C. and Lerner,M.R.
  TITLE     The closely related small nuclear ribonucleoprotein polypeptides N
            and B/B' are distinguishable by antibodies as well as by
            differences in their mRNAs and gene structures
  JOURNAL   J. Biol. Chem. 265, 10733-10739 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.Schmauss, 13-APR-1990.
FEATURES       from  to/span     description
    pept       1314     2036     snRNP-associated polypeptide N
    signal      774      779     TATA box
    signal      698      702     CAAT box
BASE COUNT      822 a    537 c    581 g    637 t
ORIGIN      
        1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg
       61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact
      121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag
      181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag
      241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga
      301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg
      361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct
      421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg
      481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc
      541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt
      601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca
      661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat
      721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa
      781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc
      841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg
      901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag
      961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa
     1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag
     1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga
     1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt
     1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt
     1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg
     1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag
     1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct
     1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag
     1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg
     1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg
     1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag
     1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga
     1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag
     1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa
     1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa
     1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc
     1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata
     2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc
     2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg
     2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca
     2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa
     2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca
     2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa
     2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat
     2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg
     2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc
//
LOCUS       WUCSSP       1323 bp ds-DNA             INV       11-JUL-1990
DEFINITION  W.bancrofti species specific DNA fragment.
ACCESSION   M27140
KEYWORDS    .
SOURCE      W.bancrofti DNA, clone IWb35.
  ORGANISM  Wuchereria bancrofti
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 1323)
  AUTHORS   Dissanayake,S. and Piessens,W.F.
  TITLE     Cloning and characterization of a Wuchereria bancrofti-specific DNA
            sequence
  JOURNAL   Mol. Biochem. Parasitol. 39, 147-150 (1990)
  STANDARD  simple staff_entry
BASE COUNT      399 a    204 c    251 g    469 t
ORIGIN      1288 bp upstream of SacI site.
        1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt
       61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat
      121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta
      181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga
      241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt
      301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg
      361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa
      421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg
      481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg
      541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat
      601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc
      661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc
      721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag
      781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg
      841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa
      901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct
      961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat
     1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac
     1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata
     1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca
     1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg
     1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg
     1321 agc
//
LOCUS       YSCHXT2      2890 bp ds-DNA             PLN       11-JUL-1990
DEFINITION  S.cerevisiae high affinity hexose transporter-2 (HXT2) gene,
            complete cds.
ACCESSION   M33270
KEYWORDS    high affinity hexose transporter-2.
SOURCE      S.cerevisiae (isogenic strain to S288C) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2890)
  AUTHORS   Kruckeberg,A.L. and Bisson,L.F.
  TITLE     The HXT2 gene of Saccharomyces cerevisiae is required for high
            affinity glucose transport
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.L.Kruckeberg, 26-MAR-1990.
FEATURES       from  to/span     description
    pept        818     2443     high affinity hexose transporter-2
BASE COUNT      808 a    571 c    535 g    976 t
ORIGIN      
        1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg
       61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc
      121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata
      181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt
      241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac
      301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc
      361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa
      421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac
      481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg
      541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac
      601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc
      661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca
      721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca
      781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg
      841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga
      901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa
      961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt
     1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag
     1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt
     1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg
     1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat
     1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc
     1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc
     1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt
     1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt
     1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc
     1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc
     1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat
     1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa
     1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt
     1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa
     1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc
     1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat
     1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa
     2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat
     2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc
     2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt
     2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt
     2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa
     2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc
     2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact
     2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt
     2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt
     2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg
     2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt
     2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa
     2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa
     2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat
     2881 cggccatact
//
LOCUS       RATGAH       1003 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Rat L-glutamine amidohydrolase mRNA, 3' end.
ACCESSION   J05499
KEYWORDS    L-glutamine amidohydrolase.
SOURCE      Rat (strain Sprague-Dawley) liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1003)
  AUTHORS   Smith,E.M. and Watford,M.
  TITLE     Molecular cloning of a cDNA for rat hepatic glutaminase: Sequence
            similarity to kidney-type glutaminase
  JOURNAL   J. Biol. Chem. 265, 10631-10636 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Waterford, 08-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      660     L-glutamine amidohydrolase (EC 3.5.1.2)
BASE COUNT      270 a    243 c    258 g    232 t
ORIGIN      
        1 ttccggatgt gtctgtcacc cccgttagac aagctgggga acagccacag gggcatcagc
       61 ttctgccaga agttggtgtc tctgtttaac ttccacaact acgacaacct gcggcactgt
      121 gctcggaagt tagacccacg gagggaaggg ggggaagttc ggaacaagac cgtggtgaac
      181 ctgttatttg ctgcatatag tggagatgtc tcagctcttc gaaggtttgc cttgtctgcc
      241 gtggatatgg agcagaagga ctatgattcc cgcacagccc tacatgtggc ggcagcggaa
      301 ggacacattg acgttgtcaa gtttctgatc gaggcttgca aagtgaatcc ttttgtcaag
      361 gacaggtggg gcaacattcc cctggatgat gccctgcagt tcaatcacct ggaggtggtc
      421 aaactgcttc aggattacca tgactcctac atgctgtctg agactcaagc tgaggtacag
      481 ctgagactct gtcaaaagag aactgagaga gcatgtgtga gcacaggcca gggcagcccg
      541 tgctcaagaa aaagcatgag cgggccacaa tttaacccaa ggccaccaaa aatactattg
      601 caagctgctt cagtgggatc aacacagcca tctggtgaca caggccagtg ttttctgtga
      661 gaatcaaaat gccccattcc ctcatcggac agcacagaga aaagcttcag tggacacctg
      721 agcagagcta gccacggaga cctcaaggta tagcttaagt gacatcctcc accagaaagt
      781 agcccaggct tttacccagg tccccatttc aacttccttg gagagcgtct agctacatgc
      841 atatgtatct gtcacagagc aagagaggtg ggtgagagcc caatcacctg gctttagaaa
      901 tctgcagaga tctgtccatc ttagccaaga catgctgcta ctgctgacag gagttttata
      961 gacaaagtat tttgtgttca aataaacttt aattaccgga att
//
LOCUS       CEACAEVA      264 bp ds-DNA             VRL       11-JUL-1990
DEFINITION  Caprine arthritis-encephalitis lentivirus tat protein gene,
            complete cds.
ACCESSION   M34092
KEYWORDS    tat protein.
SOURCE      Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from
            goat synovial membrane, clone pCol.9.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
            Robinson,M.M. and Cheevers,W.P.
  TITLE     Genetic structure of the pol-env region of the Caprine arthritis-
            encephalitis lentivirus genome: Possible role in trans-activation
            of the viral long terminal repeat
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.P.Cheevers, 08-MAY-1990.
            
              Author address: W.P.Cheevers
              Washington State University
              Dept Veterinary Microbiology
              Pullman, WA 94164-7040
              email: b384@wsuvmsl.bitnet
            
FEATURES       from  to/span     description
    pept          1      264     tat protein
BASE COUNT       97 a     40 c     80 g     47 t
ORIGIN      
        1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta
       61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg
      121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg
      181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta
      241 tgtaacccgg ggtggggaac ataa
//
LOCUS       CEACAEVB      264 bp ds-DNA             VRL       11-JUL-1990
DEFINITION  Caprine arthritis-encephalitis lentivirus tat protein gene,
            complete cds.
ACCESSION   M34093
KEYWORDS    tat protein.
SOURCE      Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from
            goat synovial membrane, clone pC63-49.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
            Robinson,M.M. and Cheevers,W.P.
  TITLE     Genetic structure of the pol-env region of the Caprine arthritis-
            encephalitis lentivirus genome: Possible role in trans-activation
            of the viral long terminal repeat
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.P.Cheevers, 08-MAY-1990.
            
              Author address: W.P.Cheevers
              Washington State University
              Dept Veterinary Microbiology
              Pullman, WA 94164-7040
              email: b384@wsuvmsl.bitnet
            
FEATURES       from  to/span     description
    pept          1      264     tat protein
BASE COUNT       99 a     43 c     75 g     47 t
ORIGIN      
        1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata
       61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg
      121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg
      181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta
      241 tgtaacccgg ggtgggaaac ataa
//
LOCUS       FLAPR834HA    540 bp ss-RNA             VRL       11-JUL-1990
DEFINITION  Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds.
ACCESSION   M34335
KEYWORDS    glycoprotein; hemagglutinin.
SOURCE      Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally
            from human.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 540)
  AUTHORS   Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P.
  TITLE     Rapid detection of influenza virus H1 by the polymerase chain
            reaction
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 425-430 (1990)
  STANDARD  simple staff_review
COMMENT     Sequence reported is + strand.
FEATURES       from  to/span     description
    pept    <     1  >   540     hemagglutinin (AA at 2)
BASE COUNT      179 a    121 c    123 g    117 t
ORIGIN      
        1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta
       61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac
      121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat
      181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg
      241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa
      301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc
      361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa
      421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt
      481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa
//
LOCUS       HUMGPPSBAA    355 bp ds-DNA             PRI       11-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1.
ACCESSION   M34422
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, clone PS-beta-G C.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 355)
  AUTHORS   Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
  TITLE     structure, evolution and chromosomal localization of the human
            pregnancy-specific beta-1 glycoprotein gene family
  JOURNAL   Genomics 6, 579-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS           1      355     pregnancy-specific beta-1 glycoprotein intron
                                 C1
BASE COUNT      110 a     64 c     62 g    119 t
ORIGIN      Chromosome 19.
        1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag
       61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc
      121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg
      181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa
      241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct
      301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag
//
LOCUS       HUMGPPSBD    1418 bp ss-mRNA            PRI       11-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION   M34421
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, cDNA to mRNA, clone PS-beta-G B.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1418)
  AUTHORS   Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
  TITLE     Structure, evolution and chromosomal localization of the human
            pregnancy-specific beta-1 glycoprotein gene family
  JOURNAL   Genomics 6, 579-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         79     1359     pregnancy-specific beta-1 glycoprotein
                                 precursor /hgml_locus_uid="LG0073P"
                                 /nomgen="PSBG1" /map="19q13.1-q13.2"
    sigp         79      180     pregnancy-specific beta-1 glycoprotein signal
                                 peptide
    matp        181     1356     pregnancy-specific beta-1 glycoprotein
    mRNA          1     1418     pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT      408 a    398 c    296 g    316 t
ORIGIN      Chromosome 19.
        1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac
       61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc
      121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc
      181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt
      241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac
      301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca
      361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg
      421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa
      481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc
      541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg
      601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag
      661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc
      721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat
      781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat
      841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg
      901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata
      961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac
     1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc
     1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc
     1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa
     1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc
     1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct
     1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga
     1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac
//
LOCUS       HUMGPPSBE    1856 bp ss-mRNA            PRI       11-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION   M34420
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, cDNA to mRNA, clone PS-beta-G A.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1856)
  AUTHORS   Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
  TITLE     Structure, evolution and chromosomal localization of the human
            pregnancy-specific beta-1 glycoprotein gene family
  JOURNAL   Genomics 6, 579-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         82     1368     pregnancy-specific beta-1 glycoprotein
                                 precursor /hgml_locus_uid="LG0073P"
                                 /nomgen="PSBG1" /map="19q13.1-q13.2"
    sigp         82      183     pregnancy-specific beta-1 glycoprotein signal
                                 peptide
    matp        184     1365     pregnancy-specific beta-1 glycoprotein
    mRNA          1     1856     pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT      544 a    476 c    364 g    472 t
ORIGIN      Chromosome 19.
        1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag
       61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc
      121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact
      181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta
      241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag
      301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct
      361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc
      421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga
      481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc
      541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact
      601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg
      661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga
      721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg
      781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag
      841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg
      901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg
      961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag
     1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac
     1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc
     1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag
     1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct
     1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc
     1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt
     1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta
     1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca
     1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt
     1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat
     1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga
     1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt
     1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg
     1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc
//
LOCUS       HUMGPPSBF    2004 bp ss-mRNA            PRI       11-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION   M23575
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, cDNA to mRNA, clone pSP1-i.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2004)
  AUTHORS   Rooney,B.C., Horne,C.H.W. and Hardman,N.
  TITLE     Molecular cloning of a cDNA for human pregnancy-specific
            beta-1-glycoprotein: Homology with human carcinoembryonic antigen
            and related proteins
  JOURNAL   Gene 71, 439-449 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        232     1518     pregnancy-specific beta-1 glycoprotein
                                 precursor /hgml_locus_uid="LG0073P"
                                 /nomgen="PSBG1" /map="19q13.1-q13.2"
    sigp        232      333     pregnancy-specific beta-1 glycoprotein signal
                                 peptide
    matp        334     1515     pregnancy-specific beta-1 glycoprotein
BASE COUNT      579 a    510 c    406 g    509 t
ORIGIN      
        1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca
       61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta
      121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt
      181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc
      241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca
      301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc
      361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct
      421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta
      481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc
      541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac
      601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac
      661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag
      721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat
      781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt
      841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg
      901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac
      961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa
     1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt
     1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat
     1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca
     1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat
     1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa
     1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag
     1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg
     1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct
     1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac
     1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg
     1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact
     1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg
     1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt
     1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc
     1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc
     1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg
     1981 taataaagtt atctgcacaa accc
//
LOCUS       HUMLEUELA     920 bp ss-mRNA            PRI       11-JUL-1990
DEFINITION  Human elastase/medullasin mRNA, complete cds.
ACCESSION   M34379
KEYWORDS    elastase; medullasin.
SOURCE      Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 920)
  AUTHORS   Okano,K., Aoki,Y., Shimizu,H. and Naruto,M.
  TITLE     Functional expression of human leukocyte elastase (HLE)/medullasin
            in eukaryotic cells
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept         39      842     elastase/medullasin precursor (EC 3.4.21.37)
    sigp         39      119     elastase/medullasin signal peptide
    matp        126      839     elastase/medullasin
BASE COUNT      141 a    329 c    287 g    163 t
ORIGIN      
        1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg
       61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct
      121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc
      181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg
      241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata
      301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg
      361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca
      421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg
      481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg
      541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca
      601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct
      661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct
      721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac
      781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact
      841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca
      901 ataaacattc tctgttttgt
//
LOCUS       MSGIS6110    1360 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  M.tuberculosis-50 complex IS6110 insertion sequence-like element.
ACCESSION   M29899
KEYWORDS    insertion sequence.
SOURCE      M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone
            I21.
  ORGANISM  Mycobacterium tuberculosis
            Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae.
REFERENCE   1  (bases 1 to 1360)
  AUTHORS   Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H.,
            Gicquel,B. and Guesdon,J.L.
  TITLE     IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex
  JOURNAL   Nucleic Acids Res. 18, 188-188 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Thierry 15-NOV-1989.
BASE COUNT      269 a    439 c    432 g    220 t
ORIGIN      
        1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca
       61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca
      121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt
      181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc
      241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa
      301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga
      361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg
      421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg
      481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc
      541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt
      601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca
      661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca
      721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg
      781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag
      841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt
      901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg
      961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca
     1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg
     1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac
     1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc
     1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc
     1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct
     1321 cagatcagag agtctccgga ctcaccgggg cggttcacga
//
LOCUS       MUSIGHAAT     348 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Mouse Ig J558 family active H-chain mRNA V-J3 region
            from hybridoma CE5, partial cds.
ACCESSION   M34119
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene.
SOURCE      Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   348     Ig H-chain V-J3 region (AA at 1)
    recomb      294      295     J-region end/D-region start
    recomb      303      304     D-region end/J-region start
BASE COUNT       89 a     82 c     97 g     80 t
ORIGIN      Chromosome 12.
        1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata
       61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg
      121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat
      181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac
      241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt
      301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS       MUSIGKCSR     321 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733,
            partial cds.
ACCESSION   M34120
KEYWORDS    immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE      Mouse (Balb/c) hybridoma GA733, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig kappa-chain (AA at 1)
BASE COUNT       87 a     83 c     74 g     77 t
ORIGIN      Chromosome 6.
        1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc
       61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
      121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat
      181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
      241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga
      301 ggcaccaaac tggaaatcaa a
//
LOCUS       MUSIGKCSS     321 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Mouse Ig active kappa-chain mRNA V-region from hybridoma
            C017-1A, partial cds.
ACCESSION   M34121
KEYWORDS    immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE      Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig kappa-chain (AA at 1)
BASE COUNT       89 a     77 c     78 g     77 t
ORIGIN      Chromosome 6.
        1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc
       61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca
      121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat
      181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct
      241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg
      301 gggaccaagc tggaaataaa a
//
LOCUS       MUSIGKCST     318 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Mouse Ig active kappa-chain mRNA V-region from hybridoma
            CE5, partial cds.
ACCESSION   M34122
KEYWORDS    immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE      Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   318     Ig kappa-chain (AA at 1)
BASE COUNT       84 a     78 c     80 g     76 t
ORIGIN      Chromosome 6.
        1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc
       61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca
      121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag
      181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
      241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc
      301 accaagctgg aaatcaaa
//
LOCUS       PSERRSAA     1517 bp ss-rRNA            RNA       11-JUL-1990
DEFINITION  P.aeruginosa 16S ribosomal RNA.
ACCESSION   M34133
KEYWORDS    16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA.
SOURCE      P.aeruginosa (strain 25330) ribosomal RNA.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1517)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept of Microbiology
              407 S. Goodwin Avenue
              Urbana, IL 61801
              email: carl@ninja.life.uiuc.edu
            
FEATURES       from  to/span     description
    rRNA          1     1517     16S rRNA
BASE COUNT      379 a    334 c    470 g    301 t     33 others
ORIGIN      
        1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa
       61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta
      121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc
      181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta
      241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga
      301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata
      361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat
      421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt
      481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg
      541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg
      601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag
      661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc
      721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg
      781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag
      841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta
      901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag
      961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg
     1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
     1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca
     1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg
     1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg
     1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact
     1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg
     1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta
     1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn
     1501 nnnnnnnnnn gaacctg
//
LOCUS       RATNESTIN    5946 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Rat nestin mRNA, complete cds.
ACCESSION   M34384
KEYWORDS    intermediate filament protein; nestin.
SOURCE      Rat (strain E15) embryo central nervous system, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5946)
  AUTHORS   Lendahl,U., Zimmerman,L.B. and McKay,R.D.G.
  TITLE     CNS stem cells express a new class of intermediate filament protein
  JOURNAL   Cell 60, 585-595 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        128     5545     nestin (128 could be 161)
    mRNA          1     5495     nestin mRNA
BASE COUNT     1600 a   1401 c   1871 g   1074 t
ORIGIN      
        1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa
       61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac
      121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg
      181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc
      241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga
      301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga
      361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt
      421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc
      481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga
      541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc
      601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga
      661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt
      721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg
      781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga
      841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac
      901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca
      961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct
     1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg
     1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg
     1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc
     1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt
     1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc
     1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca
     1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga
     1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc
     1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc
     1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga
     1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct
     1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct
     1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa
     1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca
     1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg
     1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt
     1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt
     2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca
     2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga
     2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca
     2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact
     2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg
     2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat
     2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca
     2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa
     2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt
     2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg
     2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa
     2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg
     2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga
     2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc
     2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag
     2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc
     3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc
     3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac
     3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt
     3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa
     3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt
     3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc
     3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc
     3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc
     3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt
     3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga
     3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga
     3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac
     3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga
     3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa
     3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc
     3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga
     3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga
     4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc
     4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa
     4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag
     4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga
     4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc
     4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt
     4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga
     4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga
     4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg
     4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg
     4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc
     4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct
     4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga
     4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga
     4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc
     4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga
     4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa
     5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct
     5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca
     5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga
     5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt
     5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg
     5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga
     5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc
     5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga
     5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg
     5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt
     5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg
     5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc
     5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac
     5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc
     5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct
     5941 acctac
//
LOCUS       SYNCMPA      1885 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  Synechococcus sp. 42-kD membrane protein (cmpA) gene, complete cds.
ACCESSION   M32999
KEYWORDS    membrane protein.
SOURCE      Synechococcus sp. (strain PCC 7942) DNA.
  ORGANISM  Synechococcus sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Chroococcales.
REFERENCE   1  (bases 1 to 1885)
  AUTHORS   Omata,T., Carlson,T.J., Ogawa,T. and Pierce,J.
  TITLE     Sequencing and modification of the gene encoding the 42 kilodalton
            protein in the cytoplasmic membrane of Synechococcus PCC 7942
  JOURNAL   Plant Physiol. 93, 305-311 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Pierce, 20-MAR-1990.
FEATURES       from  to/span     description
    pept        141     1493     49-kD membrane protein (cmpA)
BASE COUNT      452 a    508 c    487 g    438 t
ORIGIN      
        1 ggttatcagc cttatcggtc tggaataacc agttggccta aagtcatgca gacagagcgt
       61 ttctgcgcct ctcgtgaagc aattcgcaca acttgtccat ctttagaggc atctcctgtt
      121 gtgggatgta ggggagacgt atgaacgaat ttcaaccagt caatcgtcgt cagtttctgt
      181 tcacgctcgg agcaaccgct gctagcgcta ttttgctgaa gggttgcggt aatcctcctt
      241 ccagtagcgg cggcgggact tctagtacaa ctcagccaac tgctgcaggg gcgagtgatc
      301 tggaagtcaa gacaatcaaa ttgggctaca tccccatctt tgaagcggct ccactgatca
      361 ttggccgcga aaaaggcttt tttgccaaat atggcttgga tgttgaagtc tcgaaacaag
      421 ccagctgggc agctgctcgc gataacgtca ttctcggttc tgctggtggc ggcatcgatg
      481 gcggtcagtg gcaaatgccg atgcctgcct tgctaacgga aggtgcgatc agcaacggtc
      541 aaaaagttcc catgtatgtc ttggcttgct tgagcaccca aggcaatggc atcgctgttt
      601 ccaatcagct caaggcccaa aatctgggct tgaagctagc gcccaaccgc gactttatcc
      661 tcaactaccc gcaaactagc ggccggaagt tcaaagcatc ctacaccttc ccgaacgcca
      721 accaagactt ctggattcgc tattggtttg cagctggcgg tatcgatcct gataaagaca
      781 ttgaactctt gaccgttccc agcgcagaaa ctctacaaaa tatgcgcaat ggcacgatcg
      841 attgcttcag taccggcgat ccctggccgt cgcggattgc caaagatgac atcggctatc
      901 aagctgcgct gacaggtcaa atgtggcctt accaccccga ggaattcttg gcgctgcgag
      961 cagactgggt agacaaacat ccgaaagcta cgctcgcctt gctgatgggc ttgatggaag
     1021 cgcagcaatg gtgcgatcag aaagcaaatc gggcagagat ggccaagatc ctctccggtc
     1081 gcaacttctt taacgtgccg gtttcgatcc tgcagccgat tctggaaggt caaatcaaag
     1141 ttggagcaga cggaaaagat ctcaacaact ttgatgccgg cccgctcttc tggaagagtc
     1201 cgcgcggcag tgtctcctat ccctacaaag ggctcaccct ctggttcttg gtggagtcga
     1261 tccgctgggg cttcaacaag caagtgctac ctgacattgc agccgcccag aaactcaacg
     1321 atcgcgtgac tcgtgaagac ctctggcaag aggcagccaa gaaattaggg gtgcccgctg
     1381 cggatatccc aaccggatcg actcgcggta ccgagacctt ctttgatggc atcacctaca
     1441 acccagacag tccgcaagct tatctccaaa gcttgaagat taaacgggca taagtagggg
     1501 cttcaatcat caaccttagt tcagtcacta tcaggagata gacagaccat ggttactgca
     1561 cgggaaacaa gacgaaacgg aagtcgtcct tctggcttaa aaaaatggcg tcagaaactc
     1621 gatggcatct tgctaccgct agcaggaatt ttgggtttcc tcatcatttg gcagatcttt
     1681 tctagcacgg gcaacccgct tgcccggccc tgctcagtct cttcacagaa gagagaacac
     1741 gcgagttgct gccctatccc ttcttggatc gcggcgggct tgataaaggt ctgttctggc
     1801 agacgtatcg cttagttctg acgcgggtgg cccagggctt ttcgatccgc agccatcatc
     1861 ggcatcggca tttccgttgg aattc
//
LOCUS       ECOOXYR      1264 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  E.coli oxyR regulatory protein gene, complete cds.
ACCESSION   J04553
KEYWORDS    oxyR gene; regulatory protein.
SOURCE      E.coli (strain K12, CSH50) DNA, clones pAQ17 and pMomR1200.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1264)
  AUTHORS   Christman,M.F., Storz,G. and Ames,B.N.
  TITLE     Oxyr, a positive regulator of hydrogen peroxide-inducible genes in
            Escherichia coli and Salmonella typhimurium, is homologous to a
            family of bacterial regulatory proteins
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 3484-3488 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            G.Storz, 23-MAY-1989.
FEATURES       from  to/span     description
    pept        203     1120     oxyR protein
    mRNA        170  >  1264     oxyR mRNA
    signal      133      138     -35 region
    signal      157      162     -10 region
    binding     191      195     ribosome binding site
BASE COUNT      289 a    308 c    368 g    299 t
ORIGIN      89.6 minutes on K12 map.
        1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc
       61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat
      121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta
      181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc
      241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag
      301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg
      361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg
      421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct
      481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct
      541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact
      601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga
      661 acgattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca
      721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat
      781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc
      841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc
      901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggagcgca aacgcgatgg
      961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg
     1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat
     1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc
     1141 gatagcttcc gccatcgtcg ggtagttaaa ggtggtgttg acgaagtact caatagtgtt
     1201 gccgccacct ttctgttcca taatcgcctg accgatatga ataatttcgg cgagcgcgct
     1261 cgcc
//
LOCUS       CLLRRE       1860 bp ss-rRNA            RNA       11-JUL-1990
DEFINITION  C.sapidus 18S rRNA, 3' end.
ACCESSION   M34360
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      C.sapidus rRNA.
  ORGANISM  Callinectes sapidus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; 
            Brachyrhyncha; Portunoidea; Portunidae.
REFERENCE   1  (bases 1 to 1860)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-113 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1860     18S ribosoma RNA
BASE COUNT      359 a    375 c    421 g    367 t    338 others
ORIGIN      
        1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct
       61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt
      121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta
      181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc
      241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc
      301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc
      361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac
      421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac
      481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat
      541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct
      601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg
      661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc
      721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata
      841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc
      901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc
      961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg
     1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt
     1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt
     1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac
     1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg
     1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn
     1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct
     1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc
     1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga
     1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag
     1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc
     1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc
     1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn
     1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
//
LOCUS       HUMNCSRC      138 bp ds-DNA             PRI       11-JUL-1990
DEFINITION  Human membrane-associated tyrosine protein kinase (C-SRC) gene,
            exons 3, 4, NI, and NII, partial cds.
ACCESSION   M34469
KEYWORDS    membrane-associated tryosine protein kinase.
SOURCE      Human adult brain DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 138)
  AUTHORS   Pyper,J.M. and Bolen,J.B.
  TITLE     Identification of a novel neuronal C-SRC exon expressed in human
            brain
  JOURNAL   Mol. Cell. Biol. 10, 2035-2040 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       29     membrane-associated tyrosine protein kinase
                                 (C-SRC), exon 3 (AA at 1)
                 30       47     membrane-associated tyrosine protein kinase,
                                 exon NI
                 48       80     membrane-associated tyrosine protein kinase,
                                 exon NII
                 81  >   136     membrane-associated tyrosine protein kinase,
                                 exon 4
    variant     117      117     a or g
    variant     135      135     c or t
BASE COUNT       36 a     40 c     40 g     22 t
ORIGIN      
        1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc
       61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga
      121 cagacaggct acatcccc
//
LOCUS       MHVAPEPA     1000 bp ss-RNA             VRL       11-JUL-1990
DEFINITION  Murine coronavirus peplomer (S) protein gene.
ACCESSION   M34435
KEYWORDS    peplomer protein.
SOURCE      Murine (strain JHM-DL) RNA.
  ORGANISM  Murine coronavirus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 1000)
  AUTHORS   Banner,L.R., Keck,J.G. and Lai,M.M.C.
  TITLE     A clustering of RNA recombination sites adjacent to a hypervariable
            region of the peplomer gene of Murine coronavirus
  JOURNAL   Virology 175, 548-555 (1990)
  STANDARD  simple staff_review
BASE COUNT      258 a    192 c    227 g    317 t      6 others
ORIGIN      
        1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag
       61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa
      121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct
      181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa
      241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta
      301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt
      361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca
      421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac
      481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg
      541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga
      601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg
      661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt
      721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt
      781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa
      841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa
      901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag
      961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa
//
LOCUS       MHVAPEPB      843 bp ss-RNA             VRL       11-JUL-1990
DEFINITION  Murine coronavirus peplomer (S) protein gene.
ACCESSION   M34436
KEYWORDS    peplomer protein.
SOURCE      Murine (strain A59) RNA.
  ORGANISM  Murine coronavirus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 843)
  AUTHORS   Banner,L.R., Keck,J.G. and Lai,M.M.C.
  TITLE     A clustering of RNA recombination sites adjacent to a hypervariable
            region of the peplomer gene of Murine coronavirus
  JOURNAL   Virology 175, 548-555 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site        400      401     deletion site
BASE COUNT      227 a    153 c    183 g    280 t
ORIGIN      
        1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag
       61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa
      121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct
      181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa
      241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta
      301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt
      361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt
      421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc
      481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc
      541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat
      601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt
      661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc
      721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact
      781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc
      841 gaa
//
LOCUS       MHVSP        3780 bp ss-mRNA            VRL       11-JUL-1990
DEFINITION  Mouse hepatitis virus surface protein S mRNA, complete cds.
ACCESSION   X04797 M34437
KEYWORDS    glycoprotein; membrane glycoprotein; surface glycoprotein;
            surface projection glycoprotein.
SOURCE      Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones
            pJMS1010, pJS112, and pJS92.
  ORGANISM  Murine hepatitis virus A59
            Unclassified.
REFERENCE   1  (bases 1 to 3780)
  AUTHORS   Schmidt,I., Skinner,M. and Siddell,S.
  TITLE     Nucleotide Sequence of the Gene Encoding the Surface Projection
            Glycoprotein of Coronavirus MHV-JHM
  JOURNAL   J. Gen. Virol. 68, 47-56 (1987)
  STANDARD  simple automatic
COMMENT        EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE         24     30       put. region of homology preceeding
                                            mRNA 5' initiation  codons
            
               SITE        121    129       pot. N-glycosylation site
               SITE        208    216       pot. N-glycosylation site
               SITE        430    438       pot. N-glycosylation site
               SITE        604    612       pot. N-glycosylation site
               SITE       1099   1107       pot. N-glycosylation site
               SITE       1333   1341       pot. N-glycosylation site
               SITE       1636   1644       pot. N-glycosylation site
               SITE       1732   1740       pot. N-glycosylation site
               SITE       1756   1764       pot. N-glycosylation site
               SITE       1825   1833       pot. N-glycosylation site
               SITE       1900   1914       put. proteolytic cleavage site
               SITE       1972   1980       pot. N-glycosylation site
               SITE       2023   2031       pot. N-glycosylation site
               SITE       2293   2301       pot. N-glycosylation site
               SITE       2440   2448       pot. N-glycosylation site
               SITE       3139   3147       pot. N-glycosylation site
               SITE       3301   3309       pot. N-glycosylation site
               SITE       3331   3339       pot. N-glycosylation site
               SITE       3388   3396       pot. N-glycosylation site
               SITE       3436   3444       pot. N-glycosylation site
               SITE       3499   3507       pot. N-glycosylation site
               SITE       3553   3654       pot. transmembrane domain
               SITE       3622   3675       cysteine-rich region
               SITE       3655   3660       charge cluster
               SITE       3715   3723       pot. N-glycosylation site
FEATURES       from  to/span     description
    pept         31     3738     surface protein S precursor
    sigp         31       60     surface protein S signal peptide
    matp         61     3736     surface protein S
BASE COUNT     1030 a    718 c    800 g   1232 t
ORIGIN      
        1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc
       61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat
      121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact
      181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct
      241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg
      301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac
      361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt
      421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct
      481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat
      541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag
      601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt
      661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg
      721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct
      781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt
      841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt
      901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt
      961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa
     1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact
     1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg
     1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt
     1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt
     1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt
     1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat
     1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat
     1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt
     1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct
     1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc
     1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct
     1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt
     1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat
     1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc
     1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt
     1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc
     1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat
     2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt
     2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat
     2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct
     2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct
     2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac
     2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag
     2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa
     2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg
     2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct
     2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta
     2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac
     2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag
     2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg
     2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca
     2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc
     2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc
     3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata
     3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca
     3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg
     3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat
     3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg
     3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt
     3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc
     3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag
     3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa
     3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt
     3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa
     3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt
     3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt
//
LOCUS       MUSIGLAZ      713 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds.
ACCESSION   M34598 M29013 J03562
KEYWORDS    constant region; immunoglobulin; immunoglobulin light chain;
            joining exon; lambda-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone
            Y31.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 713)
  AUTHORS   Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A.
  TITLE     Structure of a third murine immunoglobulin lambda light chain
            variable region that is expressed in laboratory mice
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 2 to 450)
  AUTHORS   Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
  TITLE     Mouse V-lambda-x gene sequence generates no junctonal diversity and
            is conserved in mammalian species
  JOURNAL   J. Immunol. 144, 2816-2820 (1990)
  STANDARD  simple staff_review
REFERENCE   3  (bases 266 to 429)
  AUTHORS   Sanchez,P. and Cazenave,P.-A.
  TITLE     A new variable region in mouse immunoglobulin lambda light chains
  JOURNAL   J. Exp. Med. 166, 265-270 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         12  >   713     Ig lambda chain precursor V-x,J-2,C-2 region
    sigp         12       68     Ig lambda chain signal peptide
    matp         69  >   713     Ig lambda chain
    recomb      379      380     V-region end/J2-region start
    recomb      414      415     J2-region end/C2-region start
BASE COUNT      175 a    190 c    159 g    189 t
ORIGIN      
        1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt
       61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag
      121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc
      181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca
      241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta
      301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa
      361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc
      421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca
      481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga
      541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca
      601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca
      661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct
//
LOCUS       MUSIGLVD      681 bp ds-DNA             ROD       11-JUL-1990
DEFINITION  Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds.
ACCESSION   M34597
KEYWORDS    constant region; germline; immunoglobulin light chain;
            joining exon; lambda-immunoglobulin; variable region.
SOURCE      Mouse (strain Balb/c AnPt) liver DNA, clone 30X2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
  TITLE     Mouse V-lambda-x gene sequence generates no junctonal diversity and
            is conserved in mammalian species
  JOURNAL   J. Immunol. 144, 2816-2820 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         59      104     Ig lambda-chain precursor Vx-J2-C2 region,
                                 exon 1
                209      531     Ig lambda-chain precursor Vx-J2-C2 region,
                                 exon 2
    sigp         59      104     Ig lambda-chain signal peptide
                209      219     Ig lambda-chain signal peptide
    matp        220      528     Ig lambda-chain
    IVS         105      208     Ig lambda-chain Vx-J2-C2 region intron A
    recomb      530      531     Vx-region end/J2-region start
    recomb      565      566     J2-region end/C2-region start
BASE COUNT      179 a    156 c    136 g    210 t
ORIGIN      Chromosome 16.
        1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat
       61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt
      121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc
      181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca
      241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag
      301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg
      361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct
      421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa
      481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa
      541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta
      601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa
      661 tcctcttgga cttcgtaggg c
//
LOCUS       PAERRE       1877 bp ss-rRNA            RNA       11-JUL-1990
DEFINITION  P.kadiakensis 18S rRNA, 3' end.
ACCESSION   M34359
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.kadiakensis rRNA.
  ORGANISM  Palaemonetes kadiakensis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; 
            Palaemonoidea; Palaemonidae.
REFERENCE   1  (bases 1 to 1877)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-113 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1877     18S ribosoma RNA
BASE COUNT      339 a    321 c    399 g    313 t    505 others
ORIGIN      
        1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt
       61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt
      121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac
      181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg
      241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca
      301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct
      361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna
      421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga
      481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac
      541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn
      601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac
      841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna
      901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta
      961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta
     1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa
     1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt
     1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc
     1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt
     1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn
     1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn
     1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg
     1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna
     1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg
     1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca
     1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact
     1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag
     1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn
     1861 nnnnnnnnnn nnnnnnn
//
LOCUS       PBESVBRA      584 bp ds-DNA             INV       11-JUL-1990
DEFINITION  P.berghei telomeric repeat region subfragment alpha DNA.
ACCESSION   M34601
KEYWORDS    .
SOURCE      P.berghei DNA, clone pTel.1.
  ORGANISM  Plasmodium berghei
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 584)
  AUTHORS   Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
  TITLE     Organization of subtelomeric repeats in Plasmodium berghei
  JOURNAL   Mol. Cell. Biol. 10, 2423-2427 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         461      541     27 bp repeats
BASE COUNT      205 a     41 c     85 g    251 t      2 others
ORIGIN      
        1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac
       61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc
      121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata
      181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat
      241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat
      301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa
      361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt
      421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg
      481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg
      541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga
//
LOCUS       PBESVBRB      593 bp ds-DNA             INV       11-JUL-1990
DEFINITION  P.berghei telomeric repeat region subfragment a DNA.
ACCESSION   M34602
KEYWORDS    .
SOURCE      P.berghei DNA, clone pTel.1.
  ORGANISM  Plasmodium berghei
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 593)
  AUTHORS   Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
  TITLE     Organization of subtelomeric repeats in Plasmodium berghei
  JOURNAL   Mol. Cell. Biol. 10, 2423-2427 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         484      510     27 bp repeat motif
BASE COUNT      209 a     40 c     97 g    247 t
ORIGIN      
        1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca
       61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac
      121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa
      181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt
      241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat
      301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat
      361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat
      421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg
      481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt
      541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg
//
LOCUS       PEURRE       1902 bp ss-rRNA            RNA       11-JUL-1990
DEFINITION  P.aztecus 18S rRNA, 3' end.
ACCESSION   M34362
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.aztecus rRNA.
  ORGANISM  Penaeus aztecus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; 
            Penaeidae.
REFERENCE   1  (bases 1 to 1902)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1902     18S ribosoma RNA
BASE COUNT      340 a    369 c    427 g    332 t    434 others
ORIGIN      
        1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt
       61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt
      121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg
      181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc
      241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg
      301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc
      361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn
      421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag
      481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga
      541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga
      601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg
      661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc
      721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn
      781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn
      841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg
      901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg
      961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna
     1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn
     1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg
     1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa
     1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn
     1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg
     1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac
     1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta
     1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct
     1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg
     1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact
     1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg
     1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt
     1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn
//
LOCUS       POCRRE       1874 bp ss-rRNA            RNA       11-JUL-1990
DEFINITION  P.ascensionis 18S rRNA, 3' end.
ACCESSION   M34358
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.ascensionis rRNA.
  ORGANISM  Procaris ascensionis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; 
            Procaridoidea; Procarididae.
REFERENCE   1  (bases 1 to 1874)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1874     18S ribosoma RNA
BASE COUNT      298 a    291 c    331 g    266 t    688 others
ORIGIN      
        1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct
       61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt
      121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac
      181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt
      241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac
      301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn
      361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc
      481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa
      541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn
      601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt
      781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg
      841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat
      901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat
      961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga
     1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac
     1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt
     1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca
     1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt
     1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta
     1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct
     1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag
     1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca
     1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn
     1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg
     1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg
     1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1861 nnnnnnnnnn nnnn
//
LOCUS       PRARRE       1869 bp ss-rRNA            RNA       11-JUL-1990
DEFINITION  P.leonensis 18S rRNA, 3' end.
ACCESSION   M34363
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.leonensis rRNA.
  ORGANISM  Procambarus leonensis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; 
            Astacoidea; Cambaridae.
REFERENCE   1  (bases 1 to 1869)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1869     18S ribosoma RNA
BASE COUNT      409 a    412 c    459 g    419 t    170 others
ORIGIN      
        1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt
       61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt
      121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat
      181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt
      241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc
      301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca
      361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg
      421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt
      481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact
      541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt
      601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag
      661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc
      721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn
      781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa
      841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg
      901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac
      961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg
     1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg
     1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg
     1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa
     1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta
     1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact
     1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg
     1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg
     1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac
     1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa
     1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn
     1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt
     1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa
     1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn
     1861 nnnnnnnnn
//
LOCUS       PVIC1RPTA     711 bp ds-DNA             INV       11-JUL-1990
DEFINITION  P.vivax circumsporozoite protein gene, partial cds.
ACCESSION   M28745 M25758
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax sporozoite (isolate VK247) sporozoite DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 711)
  AUTHORS   Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
            Waters,A.P. and Prasittisuk,C.
  TITLE     Circumsporozoite protein heterogeneity in the human malaria
            parasite Plasmodium vivax
  JOURNAL   Science 245, 973-976 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Lanar, 25-JUN-1989.
FEATURES       from  to/span     description
    pept    <     1  >   711     circumsporozoite protein (AA at 1)
    site          1       15     region 1
    site        697      711     region 2
    site        502      597     post repeat variable region
    site        598      696     post repeat constant region
BASE COUNT      261 a    151 c    226 g     73 t
ORIGIN      
        1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat
       61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca
      121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca
      181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga
      241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc
      301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa
      361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga
      421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat
      481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca
      541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga
      601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta
      661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c
//
LOCUS       PVIC1RPTB     657 bp ds-DNA             INV       11-JUL-1990
DEFINITION  P.vivax circumsporozoite protein gene, partial cds.
ACCESSION   M28746 M25759
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax sporozoite (isolate VK210) DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 657)
  AUTHORS   Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
            Waters,A.P. and Prasittisuk,C.
  TITLE     Circumsporozoite protein heterogeneity in the human malaria
            parasite Plasmodium vivax
  JOURNAL   Science 245, 973-976 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Lanar, 25-JUN-1989.
FEATURES       from  to/span     description
    pept    <     1  >   657     circumsporozoite protein (AA at 1)
    site          1       15     region 1
    site        643      657     region 2
    rpt          16      477     27 bp repeats
    site        478      544     3' post repeat variable region
    site        545      642     3' post repeat constant region
BASE COUNT      229 a    135 c    224 g     69 t
ORIGIN      
        1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat
       61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa
      121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca
      181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat
      241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca
      301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga
      361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca
      421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga
      481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca
      541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa
      601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc
//
LOCUS       SHV2A         554 bp ss-RNA             VRL       11-JUL-1990
DEFINITION  Simian hepatitis A virus segment 2A-encoded protein mRNA,
            partial cds.
ACCESSION   M34085
KEYWORDS    .
SOURCE      Simian hepatitis A virus (strain PA21), cDNA to viral RNA.
  ORGANISM  Simian hepatitis A virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae.
REFERENCE   1  (bases 1 to 554)
  AUTHORS   Brown,E.A., Jansen,R.W. and Lemon,S.M.
  TITLE     Characterization of a Simian hepatitis A virus (HAV): Antigenic and
            genetic comparison with human HAV
  JOURNAL   Unpublished (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by E.A.Brown, 04-MAY-1990.
            
              Author address: E.A.Brown
              547 Burnett-Womack
              CB# 7030
              Chapel Hill, NC 27599
            
FEATURES       from  to/span     description
    pept    <     1  >   554     segment 2A-encoded protein (AA at 1)
BASE COUNT      184 a     71 c    138 g    161 t
ORIGIN      Segment 2A; map position 3108-3662.
        1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg
       61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag
      121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa
      181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg
      241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg
      301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg
      361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat
      421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt
      481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg
      541 tcaggaatac agga
//
LOCUS       SHVVP1CP     2373 bp ss-RNA             VRL       11-JUL-1990
DEFINITION  Simian hepatitis A virus capsid protein VP1 mRNA, partial cds.
ACCESSION   M34084
KEYWORDS    capsid protein VP1.
SOURCE      Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed
            in cwll line BS-C-1.
  ORGANISM  Simian hepatitis A virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae.
REFERENCE   1  (bases 1 to 2373)
  AUTHORS   Brown,E.A., Jansen,R.W. and Lemon,S.M.
  TITLE     Characterization of a Simian hepatitis A virus (HAV): Antigenic and
            genetic comparison with human HAV
  JOURNAL   J. Virol. 63, 4932-4937 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by E.A.Brown, 04-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1  >  2373     capsid protein VP1 (AA at 1)
BASE COUNT      661 a    447 c    491 g    774 t
ORIGIN      Segment P1; map position 735-3107.
        1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg
       61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact
      121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca
      181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga
      241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa
      301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg
      361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca
      421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct
      481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc
      541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat
      601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct
      661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca
      721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt
      781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat
      841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca
      901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag
      961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa
     1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg
     1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt
     1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc
     1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg
     1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag
     1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat
     1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct
     1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct
     1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag
     1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta
     1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca
     1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg
     1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt
     1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg
     1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg
     1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca
     1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg
     2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat
     2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc
     2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg
     2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg
     2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat
     2341 gatcctcgaa ctgaagagga tcgtaaattt gaa
//
LOCUS       STNRRE       1885 bp ss-rRNA            RNA       11-JUL-1990
DEFINITION  S.hispidus 18S rRNA, 3' end.
ACCESSION   M34361
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      S.hispidus rRNA.
  ORGANISM  Stenopus hispidus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; 
            Stenopodidae.
REFERENCE   1  (bases 1 to 1885)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1885     18S ribosoma RNA
BASE COUNT      327 a    336 c    430 g    368 t    424 others
ORIGIN      
        1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt
       61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact
      121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn
      181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta
      241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag
      301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt
      361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga
      421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn
      481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag
      541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag
      601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn
      661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc
      721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg
      841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg
      901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg
      961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt
     1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt
     1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg
     1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat
     1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg
     1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt
     1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga
     1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng
     1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt
     1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc
     1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg
     1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg
     1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn
     1861 nnnnnnnnnn nnnnnnnnnn nnnnn
//
LOCUS       MSQMUD76A     124 bp ds-DNA             BAD       11-JUL-1990
DEFINITION  A.dirus DNA probe pMU-D76.
ACCESSION   M34656
KEYWORDS    .
SOURCE      A.dirus (Strain D) wild-caught female DNA, clone pMU-D76.
  ORGANISM  Anopheles dirus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 124)
  AUTHORS   Panyim,O., Yasothornsrikul,S., Tungpradubkul,S., Baimai,V.,
            Rosenberg,R., Andre,R.G. and Green,C.A.
  TITLE     Identification of isomorphic malaria vectors using a DNA probe
  JOURNAL   Am. J. Trop. Med. Hyg. 38, 47-49 (1988)
  STANDARD  simple staff_review
BASE COUNT       32 a     33 c     34 g     25 t
ORIGIN      
        1 gatctgcact cggcgtgaat ttggttacca tcgaatgtgc ggaaaaagtt ttaccccgtg
       61 cgcagtgcgg aacacgccag acttgttaca cacggaaacg gaccacgaac gtgttacgcg
      121 cacg
//
LOCUS       ACCCITSYN    1895 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  A.anitratum citrate synthase gene, complete cds.
ACCESSION   M33037
KEYWORDS    citrate synthase.
SOURCE      A.anitratum DNA, clone pLJD1.
  ORGANISM  Acinetobacter anitratum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1895)
  AUTHORS   Donald,L.J. and Duckworth,H.W.
  TITLE     Expression and base sequence of the citrate synthase gene of
            Acinetobacter anitratum
  JOURNAL   Biochem. Cell Biol. 65, 930-938 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        264     1538     citrate synthase precursor
    matp        267     1535     citrate synthase
    signal      120      125     -35 signal
    signal      141      146     -10 signal
    signal     1560     1585     pot. transcription termination signal
BASE COUNT      534 a    406 c    357 g    598 t
ORIGIN      
        1 gtactcaacg cttaattttt ttctgcacgt tcttcttgaa ttgacttatg ataccatccc
       61 gatgcagtga ttttactgac tttttttgct cgggtcttga tgactaactc tctgtgggaa
      121 cgtcattttt tatccataag tataattgac aaaatttcag tactcactaa tcttatagca
      181 aattttgaca ccgtctgatt cgcacatgag aaaattagga tttcgagtca gataatcatt
      241 caccaggaca ggagatctat tgaatgtctg aagcaactgg caaaaaagcc gtattacatc
      301 ttgatggcaa agaaattgaa ttaccaattt acagtggcac attaggtccc gatgtaatcg
      361 acgttaaaga tgtattggcc tcaggtcact ttacttttga tcctggtttt atggcgacag
      421 cttcatgcga gtctaaaatc acatttatcg atggtgacaa aggtatttta ttacaccgcg
      481 gttacccgat tgaccagtta gcgactcaag cagactacct tgaaacttgt tatttattat
      541 taaatggcga gttaccaact gctgaacaaa aagttgagtt cgatgcgaaa gttcgtgctc
      601 atactatggt tcatgatcaa gttagccgtt tcttcaatgg tttccgtcgt gatgctcacc
      661 ctatggcaat catggttggt gtagtaggcg cattatctgc tttctatcac aacaaccttg
      721 acattgaaga catcaaccac cgcgaaatta ctgcgattcg tttgattgct aaaattccaa
      781 cgcttgctgc ttggagctac aaatatactg taggtcagcc attcatctat ccacgtaatg
      841 acttaaatta cgcggaaaac ttcttacaca tgatgtttgc aactcctgca gaccgtgact
      901 acaaagtaaa ccctgttctt gctcgtgcaa tggatcgtat ctttacgctt cacgctgacc
      961 acgaacaaaa cgcgtctact tctacagttc gtcttgctgg ttctactggt gcgaatccat
     1021 atgcgtgtat ctctgctggt atctctgctc tttggggtcc tgcacacggt ggtgcgaacg
     1081 aagcagttct taaaatgctt gatgaaatcg gtagcgttga aaatgttgct gagttcatgg
     1141 aaaaagttaa acgcaaagaa gttaaactta tgggcttcgg tcaccgcgtt tacaaaaact
     1201 tcgatccacg cgctaaagtg atgaagcaaa cttgtgacga agttcttgaa gcattaggta
     1261 tcaatgatcc tcaattagcg cttgctatgg aacttgaacg tattgcattg aacgacccgt
     1321 actttgttga acgtaaactt taccctaacg tagacttcta ctctggtatc atccttaaag
     1381 cgattggtat cccaacagaa atgtttaccg ttatcttcgc tcttgcacgt acagttggct
     1441 ggatcagtca ctggttagaa atgcacagcg gtccttacaa aattggtcgt cctcgtcagc
     1501 tttacactgg tgaagtgcaa cgtgacatca agcgttaata ttcgaaagaa tattaatgta
     1561 aaaagctgcc taatggcagt tttttttata aataagtttt aaaagttatt cttcttcaaa
     1621 catatttaat aagtgatgac taataccatc agctcttagc caagccaact cataacttgc
     1681 ttcggccaaa gctaaaatac gtctttcaaa ctcagtccat acttgtttaa cttgcgcttc
     1741 tgaatcccta aaccactgtc atagctaaat gcttattctt ttcacatatt tttaaggcat
     1801 ggtagagttt agccctttac tcgccccttc attaacctga cacgtttacc taatataaat
     1861 ccttctacat gctgtagact gggaacatag gtacc
//
LOCUS       ECOGUAC      1991 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  E.coli GMP reductase (guaC) gene, complete cds.
ACCESSION   M33020
KEYWORDS    GMP reductase.
SOURCE      E.coli (strain K12) DNA, clone pDS89.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1991)
  AUTHORS   Andrews,S.C. and Guest,J.R.
  TITLE     Nucleotide sequence of the gene encoding the GMP reductase of
            Escherichia coli K12
  JOURNAL   Biochem. J. 255, 35-43 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        210     1250     GMP reductase (guaC) (E.C. 1.6.6.8)
    mRNA         25  <  1250     guaC mRNA (put.)
    mRNA         90  <  1250     guaC mRNA (put.)
    mRNA        126  <  1250     guaC mRNA (put.)
BASE COUNT      493 a    494 c    508 g    496 t
ORIGIN      
        1 gaattcatca tgattatcaa aacgttaaaa atgagtgcac gaaagcgaaa ttgatgaaac
       61 gttcgctcac tatttaccag gtaaatttat gggattgtag cgtaaaaaaa gacaatttcg
      121 cagtcttgcg ccgcattgat tagtgcgtat gatagcgtca ctggagttgc gctcttaccc
      181 ttatagccat taaccccagg aatccgcaca tgcgtattga agaagatctg aagttaggtt
      241 ttaaagacgt tctcatccgc cctaaacgct ccactcttaa aagccgttcc gatgttgaac
      301 tggaacgtca attcaccttc aaacattcag gtcagagctg gtccggcgtg ccgattatcg
      361 ccgcaaatat ggacaccgta ggcacatttt ctatggcctc tgcgctggct tcttttgata
      421 ttttgactgc tgtgcataaa cactattctg tcgaagagtg gcaagcgttt atcaacaatt
      481 cttccgctga tgtgctgaaa catgtgatgg tttctaccgg tacgtctgat gcggatttcg
      541 aaaaaactaa acagattctc gacctgaacc cggcattaaa cttcgtttgt attgacgtgg
      601 cgaatggtta ttccgaacac ttcgtgcagt tcgttgcgaa agcgcgtgaa gcgtggccga
      661 ccaaaaccat ttgtgctggt aacgtagtga ctggtgaaat gtgtgaggag cttatcctct
      721 caggtgccga tatcgttaaa gttggcattg gcccaggttc tgtttgtaca actcgcgtca
      781 aaacaggcgt cggttatccg caactttctg cggtaatcga atgtgccgat gctgcgcacg
      841 gtctgggcgg aatgatcgtc agcgatggtg gctgcaccac gccgggcgat gtggcgaaag
      901 cctttgcgcg tgccgatttc gtcatgcttg gcggcatgct ggcgggccac gaagagagcg
      961 gcggtcgcat cgttgaggag aacggcgaga aatttatgct gttctacggc atgagctccg
     1021 agtctgcgat gaaacgtcac gttggcggcg ttgcggaata tcgcgcagca gaaggtaaaa
     1081 ccgttaagct gccgctgcga ggcccggttg aaaataccgc gcgagatatt ttgggcggcc
     1141 tgcgttcagc ttgtacatac gttggggctt cacgcctgaa agagctgacc aagcgcacca
     1201 cgtttattcg tgtgcaggaa caagaaaacc gcatcttcaa caacctgtaa tctcccaacg
     1261 ctggcgtgga gcaacacgcc acggttatcc catcccactc atcgcatcgc ctaaatggaa
     1321 aattggcaga tacattgcca ccaccagcgt accaataatt cctcccgtta tgatcagcaa
     1381 cgcggttcag taaggctgcg aggttatccg ccagcgccat tgtgttttcc cgatgatgat
     1441 gggcgaggtt gtctaacatg agatccagag agccggatgc ctctcctgtt ctcactaatt
     1501 gcaaacagag cgggctaaac tcaccggtat tttttagcgc cagccagatg ggttgaccgt
     1561 tactgatatc gtgctggatt tgtgtcagaa gttgcaccca gtacgggcag cgcattgttt
     1621 ctctgacgct ctctacgccc tgtaaaaaag taatgcctgc actttgtgtc agcgccagaa
     1681 tcgtaaagat ctgcgtgagt ttttgtcccc gcatcagtga acccataatc gggatgcgta
     1741 acagcaattt ctgccgcact ataagccagg tcggtcggcg catcagcaac ttattggcta
     1801 tcgccagcag aaagccgaac acaccagcag ccagctccat tcgccactaa agtctgccag
     1861 cgtcatgatc ccctgcgtta gtgccggtag tggggtgttg aaggtcttat agatagcggc
     1921 aaactccggc agacacaaaa tgcagcattg ccacaaccac catgattagc catcgctaaa
     1981 atgatgatgg g
//
LOCUS       HUMDKERB     8815 bp ds-DNA             PRI       11-JUL-1990
DEFINITION  Human cytokeratin 8 (CK8) gene, complete cds.
ACCESSION   M34482
KEYWORDS    cytokeratin 8.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 8815)
  AUTHORS   Krauss,S. and Franke,W.W.
  TITLE     Organization and sequence of the human gene encoding cytockeratin 8
  JOURNAL   Gene 86, 241-249 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1113     1436     cytokeratin 8, exon 1
               3972     4180     cytokeratin 8, exon 2
               4809     4869     cytokeratin 8, exon 3
               5344     5439     cytokeratin 8, exon 4
               5958     6248     cytokeratin 8, exon 5
               7113     7333     cytokeratin 8, exon 6
               7492     7550     cytokeratin 8, exon 7
               8380     8567     cytokeratin 8, exon 8
    IVS        1437     3971     CK8 intron A
    IVS        4181     4808     CK8 intron B
    IVS        4870     5343     CK8 intron C
    IVS        5440     5957     CK8 intron D
    IVS        6249     7112     CK8 intron E
    IVS        7334     7491     CK8 intron F
    IVS        7551     8379     CK8 intron G
    signal     1007     1011     TATA box
    signal     8779     8784     poly-A signal
BASE COUNT     1868 a   2324 c   2481 g   2142 t
ORIGIN      
        1 tcaacggatc tcgctctttt ttttctttgg agatggaatc tcgctctgtc gcccaggctg
       61 gagtgcagtg gcaagtctca gctcactgca actctgcctc ccgggttcaa gtgattctcc
      121 tgcctcagcc tcctgagtag ctgggattac accatggcca gctaattttt gtatttttag
      181 tagagatggg gtttcaccat gttggtcagg cttgtcttga actcctgacc tcgtgatccg
      241 cctacctcag cctcccaaag tgctgggatt acaggcgtgc acagcgtgcc ctggccttgg
      301 atctcttttt atcttgcacc ttcagatgta gagggacgac agccactgtg tgtgtatgtg
      361 tatgtgtgtg tgtgtgtgtg tgtgtgcgcg tgtgatgttt attcactcat ttatttattc
      421 attcattcat tccacaaata tctacccaga ccctcttggc actgcaccag gtcgtagggg
      481 tagaacagta acctggaaag atgaggcaaa tggttgattt cagattcaag gctttggact
      541 ccagctgttc tgtcatccag ctcaggcagg ccctcataat cgcttcaatc agggagaaca
      601 caggagagtt tctctggggt gtcggcagct cagaggagac ccaaatacta ggagacccct
      661 tttcccatgc ttcccagtcc tccagtttat ttcccccagg aaggagggag acaagaccca
      721 gagtcagggt tgtagtggct gggcggccca ggcaagtctg cttgttacac gacttgtgcc
      781 aggacaggat ttcttccagt ttcatattca ctgaactgcc ttttcctggg tttctggggg
      841 tggtgctgga gtgggctcca gggttggaac gggcccttgc gacgcgtctc tgctgccccc
      901 acctgagtct gccccgaggt ggcaggtgac gggttcacgc gacgcctctg gcctagccac
      961 tcaggtacga ggcctttccc ccactccccg gggctgggat ctcttttata aaaggccatt
     1021 cctgagagct ctcctcacca agaagcagct tctccgctcc ttctaggatc tccgcctggt
     1081 tcggcccgcc tgcctccact cctgcctcta ccatgtccat cagggtgacc cagaagtcct
     1141 acaaggtgtc cacctctggc ccccgggcct tcagcagccg ctcctacacg agtgggcccg
     1201 gttcccgcat cagctcctcg agcttctccc gagtgggcag cagcaacttt cgcggtggcc
     1261 tgggcggcgg ctatggtggg gccagcggca tgggaggcat caccgcagtt acggtcaacc
     1321 agagcctgct gagccccctt gtcctggagg tggaccccaa catccaggcc gtgcgcaccc
     1381 aggagaagga gcagatcaag accctcaaca acaagtttgc ctccttcata gacaaggtga
     1441 gggtcccctg cgtggctgac tgtgccccgc agcccctttc tcctggtagt cccggtccct
     1501 atgcacatct ccagccccca gctggcgtcc tgctgggcct cacccgccct gggcacactc
     1561 tcccttccat cctccgacct cacccctccc gtgcaccttg gtttgggctg ggtgagggtg
     1621 gggagagggt ctggacagcc gggatgaatc ctggggcttc cttcttccct tttaaactgg
     1681 agggtcttgg aagagagaga caacttaagg gtacagccta gttcccacca cccctctcta
     1741 caaatcccgt tcttcctcag gtcattctgt cccaaattat aaaaaataat agcggttatt
     1801 gttctcaccc caacccagtt ctgaccgtct tttaacgtat gcctgcggca gtcccagctg
     1861 ttcgggacta ccctcctcca ggttcgcctc ttcgccagca ctacccaagg ctccccagtg
     1921 gtgcctttgt gatttttttt ctttcttttt tttacatagg ggtttggtgt gattctagca
     1981 ttctaggaga aggaagtggg tgtctcggtt caaacgggca aatattgatt gaggcctttg
     2041 gccgccggag gcctgagtgc gggggtcaca gaatgagtca tacggcccct ggcccggcag
     2101 cgtgggcggg gccgagggcg gggtgagggc tgcgggcagc agtctgcggg acgctctcct
     2161 ccactggcgg agctcggcgt cgggggcggt gtgggtgggg tggggtgggg tggggtgggc
     2221 tggggtgggg tggaggaggc gagggcctgg cctcggaaag cccatgcagg attcaaagtc
     2281 tcctgggacg ccgcccgggg tttacgtcct gttaagttta tggcttcaga taacgcggtc
     2341 gcccaccaac gcccctcgcc cattcagccc gtgtcccttt ctcggcgtcc tgtccctgct
     2401 gcccccagcc tcggctccac tttccacaca gcaggagcca gggccgggtt ttgcagcctg
     2461 ggactccgct gcctgagccc cggcccccgg cggccccgag gattgggccc ttcacgctga
     2521 ctggctcctg ggaggcattg tgggaacggg aggagggaaa tcctggggca gagtaagccg
     2581 ggaggaaccg gagccccagg aacccagtgg tcgggggccc tcgctgtcca agcgcctgga
     2641 cttgacttgt tgactgcgtt ttgctagccc tggggtcctt atagagagca gctaagcata
     2701 ggctttggaa tctgaattct tggtctgcac tcgtctgccg gttcctggtt atggactccc
     2761 ttgccaagtc ttatttcctc atctataaaa tgaatatgag agcccctaaa tccatatagc
     2821 aaaagttttt gccttattca aacttacata tgtaaagagt tcagcagtgc ttggcccaca
     2881 ttccattagg ataagatgtt ataatcactt ttttttaaaa aataattttg gggcagaatg
     2941 actggggaag aaagcgattt gcagagagtg gtggagggaa ctaggctgta cccttaaaag
     3001 atttctgtcc cctccagttt agaaggagtt acaagttttt ttgtttgttt gagacagagt
     3061 tactctgtgc ccaggctgga gtgcagtggt gtgatctcag ctcactgcaa cgctccgctt
     3121 cctgggttca agcgattctc ctgcctcagc caccgagtag ctgggactac aagtgcgtgc
     3181 acagcccggt taattttgta attattgtag gcaaggttca atatgttggc aggctggtct
     3241 cgaactctga cttcagaaat ccgcctgcct tgaccaccca aagtgctgga attacagcgt
     3301 gagcctccac gcccggcctc tttttcaatc ttaacatctt tagaaaggtt ggctattttt
     3361 ggccgggcgc gggcttacgc ctataatccc agcactttgg gaggccaagg cgggccaatc
     3421 acaaggtcag gagttcgaga ccatcctgcc taagacggtg aaaccctgtc tctactaaaa
     3481 atacaaaaaa attagtgggg cgtggtggca cgcacggctg cctgtagccc cagccactcg
     3541 ggaggctgag gcaggggcag gagaatggca tgaacttggg aggcggagct tgcagtgagc
     3601 tgagatcttg cactgcactc tagcctgggc cggagactcc caaagaaagc ttggctattt
     3661 ttattgatgt gtaatataca acctatgtaa atgaagttag gcctattggt ttgcaaatgc
     3721 agctttaaca taattacctt acctgtctcc ttcccctacc caatgctgag ggacattgct
     3781 ccccacctca ccatcatgcc atgctttctc cccctggtca taggtgatct ttccagaaca
     3841 gctaaccagg tgcctggggt ctggagactt actgcttgag gagtgaatta agagaaaaga
     3901 ctgcttgctt tcctccagac tttgagccct ggcctgatgt agaccttttt gctctctcct
     3961 ccttcgtata ggtacggttc ctggagcagc agaacaagat gctggagacc aagtggagcc
     4021 tcctgcagca gcagaagacg gctcgaagca acatggacaa catgttcgag agctacatca
     4081 acaaccttag gcggcagctg gagactctgg gccaggagaa gctgaagctg gaggcggagc
     4141 ttggcaacat gcaggggctg gtggaggact tcaagaacaa gtgagcaact ccaccctcca
     4201 cccaactgaa gtcacctgct ctcctccacc ccttgacctt gggactaagt ccatggccct
     4261 ctgttgtggg aagtgcagtc ctatctaatt agggtgacca cctgatgagg tttctcggac
     4321 agtctgtgtt tatgccaggt tctagcacat tgttgatagt acccacccct ttcaatctaa
     4381 ctgtctggat ttgaagaaca aattatgtgt caatgttgac atggtaaacc tgagacggga
     4441 gagataggca gcctgtgggc ctcacttttg tacttaacat tctggcccct ctttagtctt
     4501 gacccttgac ctctagcaaa ctctagaaag ttctgtctga ggtctcatgt caggccctgc
     4561 tgttaacact ctcaaggtgt ccaatccgat gtgtattcat ggatttggag agagatttcc
     4621 tgcttcccac gggctaaggg aggggtgagg gtggagaggg cagctgggga aggcagaagg
     4681 accagccttc tcatatcctc atctctgtga actgaatttc ctgatttcac aacgcccctg
     4741 tctcccaaaa gaccaagggc aacctccctt ttgccttcat cctctaattg taagtctttt
     4801 cctcacaggt atgaggatga gatcaataag cgtacagaga tggagaacga atttgtcctc
     4861 atcaagaagg tgagggagtc tcccttctcc tatctggaca ctggaggctg gggctcagag
     4921 actcagacca agaagctttc tgggttttgt ccctaaatat tcctaagtag tgggacaaac
     4981 tcatttatgt aaacatttgg gtgcacagaa aggtagacaa ggatggagtg gtaggtgcat
     5041 ttggacagaa ctcttgacat cggtgttggg acatggttca gaaaacagag cagtagaact
     5101 ggagatctgg ctctagaagg ctccctagag aaggaggtgg aagagggtgt gttgcaggaa
     5161 gcagaggtga aggtgtgtgg gctgagaatg cacatgtgat gggcagaggc tgggctggaa
     5221 gatcaatcca caaagtggca actagaaagt cctgtgacca ggccattggg tggaccttgg
     5281 gagccccttg gttggggttg ggtgtggaaa cccagctcag gctcccctct cctcatcccc
     5341 caggatgtgg atgaagctta catgaacaag gtagagctgg agtctcgcct ggaagggctg
     5401 accgacgaga tcaacttcct caggcagcta tatgaagagg tatgttcctg gtcgcaggag
     5461 agtgagggtc cccagccttg tcagcgcctc caccctgaga ctcaaccaga ggctcctccc
     5521 agcccccagc acactaataa gacaaaggac cccactgctg actaattaca gccaccaata
     5581 tttgctcggc tagtatttat tgggtctata tgttctgtcc ctcgcatgag gtgagtcatt
     5641 accccatttc acagacgaga aagtgggctc agagaagtga aataacgtat ccaaggtcat
     5701 catagggtgt ggtgattcag cagcaactct gtccccaaag cccttgttcc taatctttga
     5761 gctgcattgg atccctctgt gcacctagta ttggtgaccc agttcctttt tcaggaactt
     5821 tgcccctctc cctgaccctg actcccacct gctcctctcc tctgctgccc ctgtcttata
     5881 cctaagaaag gctgttgtgg aaaagggggc tcctgtgtgc agagacaggg cctcaccact
     5941 tgccctcttc cccacaggag atccgggagc tgcagtccca gatctcggac acatctgtgg
     6001 tgctgtccat ggacaacagc cgctccctgg acatggacag catcattgct gaggtcaagg
     6061 cacagtacga ggatattgcc aaccgcagcc gggctgaggc tgagagcatg taccagatca
     6121 agtatgagga gctgcagagc ctggctggga agcacgggga tgacctgcgg cgcacaaaga
     6181 ctgagatctc tgagatgaac cggaacatca gccggctcca ggctgagatt gagggcctca
     6241 aaggccaggt atgggccggg ttgggggtgg gagggttcct tggacacaat cctggtgaga
     6301 ggagataatg taggaagagt gaagtttctg ggagtcgggg aaggaatcct agaccagggt
     6361 tcaggagttg gaggggcagc cacagttcag cttctcagtc tgcttctgag aagcaaaggg
     6421 atgcagggaa ggtcccttgg gccaggacag aggtgaaagg ggactggggc aggtatgttg
     6481 gggactcgtg atacatgctc caagcctgct ttaatcagtc atatgcatca ggggtaaggt
     6541 tgagctctgc tgctttaagg aaagtctaga acccagggat ctagtccagt tagggtaggg
     6601 ggaccttaca gtgtcgcagg tcgagaaggg tgtggagggg aagcacctgg aaactgctca
     6661 tgtctccctg atctgcttcc ttagtctcgt ttatttattt atttattttt gagacagagt
     6721 cttgctctgt cgcccaggct ggagtgcagt ggcgtgatct cggctcactg caagctccgc
     6781 ctcctgggtt cacactattc tcctgactca gcctcctgag tagctgggac tacaggcgcc
     6841 cgcaccaggc tggctaattt tttttgtatt tttgctagag acggggtttc actgtgttag
     6901 ccaggactcg tcgatctcct gaccttgtga tctgcccgcc tcgcctccca aagtgctggg
     6961 attacaggca tgagcactgt gcccggccct tagtctcatt aattgagctg gggagtcagc
     7021 ctagtgtgtg gaggacctga gggagggtgg acgcacggag gaagagaagg catacccaac
     7081 ctgacctact tacctgtccc ctacccacag agagggcttc cctggaggcc gccattgcag
     7141 atgccgagca gcgtggagag ctggccatta aggatgccaa cgccaagttg tccgagctgg
     7201 aggccgccct gcagcgggcc aagcaggaca tggcgcggca gctgcgtgag taccaggagc
     7261 tgatgaacgt caagctggcc ctggacatcg agatcgccac ctacaggaag ctgctggagg
     7321 gcgaggagag ccggtgggtg tgggtacctc tgaccggacc tgcttcccta tccctgggac
     7381 ctggggtggg gacggtggga gccccctgaa gccccttgga cttggggtcc tgttgttctg
     7441 ggccaagaag ggctaggagt tggtcctgac accccatttg acagggtaca ggctggagtc
     7501 tgggatgcag aacatgagta ttcatacgaa gaccaccggc ggctatgcag gtggtgtccc
     7561 agggccctgg atgagggcgg gaggcagggc cagggaggct cagctccagg gagggggctg
     7621 tgctcagtcg ctcacagtga cctcagcctg agcactcatg ttcttgggag aatcctaggg
     7681 tggggaggca catattcagg gaactccagt aataacttta ttacttagta acttcatatt
     7741 agaagataca ccaataacca tagctgtgtg ccaggcactt gcgtaagtat cctacaggtt
     7801 ttatgtgatt tattttattt attaatttaa tttaattttt ttgagacgaa gtctcgctgt
     7861 caccaagctg agtgcagtgc tgatctcagc tcactgtaac ctcacctcct gggttcaaga
     7921 gattctcctc cgtcaggcct cccaagtagc tgggactaca ggcgcatacc accatgccca
     7981 tgctaatttt tgtattttta gtagagacgg ggtttcactg tgttgggcag gctggtctcg
     8041 aactcctgac cttgtgatca gtgctgggat tacaggcatg agacactggg cctggctgta
     8101 atttattttt tatatgacac ctgtaaacgt cttcagttga ggaaggctga ggtgcagcta
     8161 aatgtccaag ctgacacagg ctatatatat ggcagctgtt ttccaccctg ctcctggttt
     8221 tccctgacag ttctggagta gtgaaccatg caatcactga tcaggagagc tgggttaacc
     8281 tccatccctg gggctatgtt gggaatgagc agggagaagg gcatggagcc tgccatggtg
     8341 ggcttctgta ctcatgtggc tacctctgtc cctcaccagg tggtctgagc tcggcctatg
     8401 ggggctcaca agccggcctc agctacagcc tgggctccag ctttggctct ggcgcgggct
     8461 ccagctcctt cagccgcacc agctcctcca gggccgtggt tgtgaagaag atcgagacac
     8521 gtgatgggaa gctggtgtct gagtcctctg acgtcctgcc caagtgaaca gctgcggcag
     8581 cccctcccag cctacccctc ctgcgctgcc ccagagcctg ggaaggaggc cgctatgcag
     8641 ggtagcactg ggaacaggag acccacctga ggctcagccc tagccctcag cccacctggg
     8701 gagtttacta cctggggacc ccccttgccc atgcctccag ctacaaaaca attcaattgc
     8761 tttttttttt tggtccaaaa taaaacctca gctagctcgc cgaatgtcct tgctt
//
LOCUS       HUMSRU30S     179 bp ss-RNA             RNA       11-JUL-1990
DEFINITION  Human 30S small nuclear ribonucleotide protein pre-mRNA complex,
            exons 1 and 2 (partial).
ACCESSION   M34493
KEYWORDS    small nuclear ribonucleoprotein.
SOURCE      Human Hela cell pre-mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 179)
  AUTHORS   Pruzan,R., Furneaux,H., Lassota,P., Hong,G.Y. and Hurwitz,J.
  TITLE     Assemblage of the prespliceosome complex with separated fractions
            isolated from Hela cells
  JOURNAL   J. Biol. Chem. 265, 2804-2813 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    55     small nuclear ribonucleotide protein (snRNP),
                                 exon 1 (AA at 2)
                142  >   179     small nuclear ribonucleotide protein, exon 2
    pre-msg <     1  >   179     snRNP pre-mRNA complex
    IVS          56      141     30S small nuclear RNA intron A
BASE COUNT       30 a     48 c     46 g     55 t
ORIGIN      
        1 aatacacgga attcactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag
       61 tgtgacctgc acgtctaggg cgcagtagtc cagggtttcc ttgatgatgt catacttatc
      121 ctgtcccttt tttttccaca gctcgcggtt gaggacaaac tcttcgcggt ctttccagt
//
LOCUS       K5TPA1PRO     307 bp ds-DNA             PHG       11-JUL-1990
DEFINITION  Bacteriophage BK5-T promotor DNA.
ACCESSION   M34486
KEYWORDS    .
SOURCE      Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1266.
  ORGANISM  Bacteriophage BK5-T
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 307)
  AUTHORS   Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
  TITLE     Molecular characterization of promoters of the Lactococcus lactis
            subsp. cremoris temperate bacteriophage BK5-T and identification of
            a phage gene implicated in the regulation of promoter activity
  JOURNAL   Appl. Environ. Microbiol. 56, 934-942 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        304  >   307     bacteriophage BK5-2 mRNA
BASE COUNT       80 a     56 c     43 g    128 t
ORIGIN      
        1 gatcaaggtg tgtaggtgta atctctagct taggaacgct tttgatacag aacgtgtgat
       61 tgtccgtttt taactttctt gttttgtcat cttcataaac tcacaaagtt tatttttgga
      121 acaaattttt cttttttatc gtatgacgta acttttttca tttggtccat cataagcttt
      181 tttaatattg tcagcttttg ctttttcgac gttctctacc gacgctttca aaatctttaa
      241 tgaaaaaaac cgtaaccatc gaatttttct tccatatttt caaagaatcc gttactatct
      301 aacgatc
//
LOCUS       K5TPA3PRO     182 bp ds-DNA             PHG       11-JUL-1990
DEFINITION  Bacteriophage BK5-T promotor Pa3 DNA.
ACCESSION   M34488
KEYWORDS    .
SOURCE      Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1268.
  ORGANISM  Bacteriophage BK5-T
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 182)
  AUTHORS   Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
  TITLE     Molecular characterization of promoters of the Lactococcus lactis
            subsp. cremoris temperate bacteriophage BK5-T and identification of
            a phage gene implicated in the regulation of promoter activity
  JOURNAL   Appl. Environ. Microbiol. 56, 934-942 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        169  >   182     promotor region mRNA
BASE COUNT       57 a     21 c     24 g     80 t
ORIGIN      
        1 ttttcagaat atgaagttaa aagttctcta atatttttat ccgttaaaga gtatcctata
       61 aataaaattg gggattctgt taagtttgac aatatttttc gcatttacta atgctaattt
      121 agattcatta tttttataat cctcactagt tatacatata gtatttgggt ttttgactga
      181 tc
//
LOCUS       K5TPF1PRO     177 bp ds-DNA             PHG       11-JUL-1990
DEFINITION  Bacteriophage BK5-T promotor Pf1 DNA.
ACCESSION   M34490
KEYWORDS    .
SOURCE      Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1262.
  ORGANISM  Bacteriophage BK5-T
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 177)
  AUTHORS   Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
  TITLE     Molecular characterization of promoters of the Lactococcus lactis
            subsp. cremoris temperate bacteriophage BK5-T and identification of
            a phage gene implicated in the regulation of promoter activity
  JOURNAL   Appl. Environ. Microbiol. 56, 934-942 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        149  >   177     ORF
    mRNA        110  >   177     ORF mRNA
BASE COUNT       63 a     29 c     37 g     48 t
ORIGIN      
        1 cctttattct tcgtgcaagg aggcgcaaga tggtcaaaac ttacaaaccg attgatttta
       61 acagaaaatg taagattgga gttactaaaa cagtaactta ctccaactgg aggtaagatt
      121 gaaaaaattg acccaggaac ggttttaaat gttcgatttc gcggctaaaa tgagatc
//
LOCUS       K5TPF2PRO    1209 bp ds-DNA             PHG       11-JUL-1990
DEFINITION  Bacteriophage BK5-T promotor Pf2 and an ORF, partial cds.
ACCESSION   M34487
KEYWORDS    .
SOURCE      Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1261.
  ORGANISM  Bacteriophage BK5-T
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 1209)
  AUTHORS   Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
  TITLE     Molecular characterization of promoters of the Lactococcus lactis
            subsp. cremoris temperate bacteriophage BK5-T and identification of
            a phage gene implicated in the regulation of promoter activity
  JOURNAL   Appl. Environ. Microbiol. 56, 934-942 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        249  >  1209     bacteriophage BK5-2 mRNA
BASE COUNT      377 a    213 c    235 g    384 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattctgaa tatggttcgt aaccctatgg catttctcaa tactctttca tctaaaactg
       61 aaactagcgg aagtgatagt gctgctggac ttactattcc gcaagatatc cgtactatga
      121 ttaacacatt ggttcgccaa tatgactcac tacaacaata tgtacgtgtt gagagtgttt
      181 ctacttcaaa cggtagtcgt gtatatgaaa aatggactga tgtaactccg ttgactgtaa
      241 tggatgcaga agatggaaaa attcctgatc ttgataatcc acgtttggac aattattaaa
      301 tacttgatta aacgttatgc gggaatcatc aatgccaact aatacattgc ttaaagatac
      361 agcagaaaat attcttgcat ggttatcaag ctggattgct aagaaagtgg ttgtgactcg
      421 taaccaagcg attattgcag caatgggtac agttcctaaa aaaccaacaa tcgctaaatt
      481 tgatgatgtt attactatga ttaatacatc tgttgatcct gcgattatcg ccacttcaag
      541 tcttttgact aaccagtcag ggttgaataa acttgctttg gttaaaactg ctgaaggtaa
      601 atatttgctc gaaccagacc caacaaaacc taattcatat ctaattaaag gtaaaaaagt
      661 tattgttgtt gcagatcgct ggcttccaaa tagtggatca acagtttatc cactttacta
      721 tggagatatg tcgcaagcta ttacattgtt tgaccgtgaa aacatgtcat tacttccaac
      781 aaatattggt gctggtgcat ttgaaactga tactactaaa attcgtgtaa tcgatcgctt
      841 cgatgttaaa actgctgact cagaagcttt agttgctggt tcacttactg caattgcaga
      901 ccaagtaggt aattttactg caggaaagta ggtaatttat gacagtaact gttgatgact
      961 tactagatca gttatcagaa gatgatgatc gcaaaccgca acttcaaatt tatttgatac
     1021 agcaaaagca tatgtgaaaa atgcagtgag ttctgataca gttgatgctc catttttcag
     1081 tgtagaaaac gtttatccga tttatgatgt agctgttctt agctattcta tggatttgtg
     1141 gattaatcgt tctacgacta tgccgcctac tacggctgta gatcacatgg ttggtcagtt
     1201 gagaggcct
//
LOCUS       K5TPG2PRO     195 bp ds-DNA             PHG       11-JUL-1990
DEFINITION  Bacteriophage BK5-T promotor Pg2 DNA.
ACCESSION   M34489
KEYWORDS    .
SOURCE      Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1265.
  ORGANISM  Bacteriophage BK5-T
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 195)
  AUTHORS   Lakshmidevi,G., Davidson,B.E. and Hillier,A.J.
  TITLE     Molecular characterization of promoters of the Lactococcus lactis
            subsp. cremoris temperate bacteriophage BK5-T and identification of
            a phage gene implicated in the regulation of promoter activity
  JOURNAL   Appl. Environ. Microbiol. 56, 934-942 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        154  >   195     promotor region mRNA
BASE COUNT       71 a     28 c     40 g     56 t
ORIGIN      
        1 agagatttac gaaaagttga gtgctttagc tgaaattgat agacttttcc attggtctag
       61 ccatttacat caagaacgat tacaatttgt tagtaaatat ccaaatgtta tggaaaaata
      121 cagacaagca aactaaggag ggtatattga atgaccgaca aactaatatc gctggtcatc
      181 aaagtgtgtg actgg
//
LOCUS       MUSH2A       1805 bp ds-DNA             ROD       11-JUL-1990
DEFINITION  Mouse (H-2a haplotype) DNA fragment.
ACCESSION   D90007
KEYWORDS    .
SOURCE      Mouse (strain B10.A, haplotype H-2a) DNA, clone B10.A.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1805)
  AUTHORS   Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T.,
            Steinmetz,M. and Moriwaki,K.
  TITLE     Recombinational hotspot specific to female meiosis in the mouse
            major histocompatibility complex
  JOURNAL   Immunogenetics 31, 79-88 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihiko
            Shiroishi National Institute of Genetics Yata-1111 Mishima,
            Shizuoka 411 Japan Phone:  0559-75-0771
FEATURES       from  to/span     description
    site          1      288     MT-consensus
    rpt        1227     1242     TCTG repeat
    LTR        1533     1568     Xenotropic solitary LTR sequence
BASE COUNT      391 a    400 c    409 g    605 t
ORIGIN      Chromosome 17.
        1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt
       61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg
      121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct
      181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc
      241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa
      301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct
      361 cctctgcttt cctggacatg taattttttt ttttgagaca aggtcattct tgttgtctat
      421 tcttgactgg ctttgaattc agaatctgca ggctctgcct ctctggtaac atgtaacatt
      481 ttccatatgt aacattttta ccagccattt cccagtaaat gagttacttc atttgaggtt
      541 ttgtcttaaa tccccgtgag caatgttttg ttagtttcca aagcacgagg attctaagtg
      601 tctatttgtt gctaagttgc caggctgtta cagagcacag tttctgggac cctggctctc
      661 tgaaactgac tagggattgc tttagtataa acataaacca ctgggactct ggctctttga
      721 aactgactag ggattgcttt agtacaagta taaaccactc agtcctggtc ctacttggct
      781 tcaaaagttg aatatcgctt ttggtatttg agatggagat ttaaagatgg aattttatta
      841 gtcttctgcc tggttttctt tctttctttg ctcttactgc cttgtggctc agaaccagct
      901 gttgcctgtt tgatagtttg tgaccaatac ctgtactgtt aaattggcca tttgagaact
      961 caaaaagtcc caacttgtag tgttttcggt ttccatggtc ttagatattt ccactgcaga
     1021 caacatcaag ttgccagtgg ttaacaactg tctttcagaa ctctcaagta tttcggtggg
     1081 tctgccagcc cttgtaacgt agcgccacgt ggtatatgct tatttgtctg tctgtctgtc
     1141 tgttgtgcaa gatgcctgtg tgccctgagg tcagaggaca gcttcaaggg ctctccattc
     1201 ttccctgacc acgtggatcc agggaataga actttgacca ttacccacgg gccatgttat
     1261 ttcttgacag ttctgttgta catttgtttt agtctttggc tttatttatt tttctcaccc
     1321 tcagtttccc tttgtctcag atgctttttt ttttttttta aatcttgcct tgggagatgt
     1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt
     1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc
     1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg
     1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg
     1621 tttcagggca ggctggtagt cttctgactc tgctcactgg ccactttcag ttcctgcttt
     1681 ctgaatccta tccagagttc tcagtggtca tcagactctg gagaggacga ggggaagggg
     1741 tgggctctta aactatcatt tatatttaaa aaaaattaaa caacagagtt agaagcagat
     1801 ccagg
//
LOCUS       MUSH2B       1634 bp ds-DNA             ROD       11-JUL-1990
DEFINITION  Mouse (H-2b haplotype) DNA fragment.
ACCESSION   D90008
KEYWORDS    .
SOURCE      Mouse (strain C57BL/10, haplotype H-2b) DNA, clone B10.30.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1634)
  AUTHORS   Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T.,
            Steinmetz,M. and Moriwaki,K.
  TITLE     Recombinational hotspot specific to female meiosis in the mouse
            major histocompatibility complex
  JOURNAL   Immunogenetics 31, 79-88 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihiko
            Shiroishi National Institute of Genetics Yata-1111 Mishima,
            Shizuoka 411 Japan Phone:  0559-75-0771
FEATURES       from  to/span     description
    site          1      285     MT-consensus
    rpt        1128     1143     TCTG repeat
    LTR        1534     1569     Xenotropic solitary LTR sequence
BASE COUNT      340 a    373 c    367 g    554 t
ORIGIN      Chromosome 17.
        1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt
       61 ggcgccattc ctgggctggc agtcctggct tctataagaa agcaggctga gcaagtgatg
      121 acgcccctcc atggcctctg catcagctcc tgcctccaga ttcctgtcct gatttcttcg
      181 gtgactaaca gctatgtgga agtgtaaaca ggatgaacgc tttccttccc aggtagcttt
      241 ggtcctgggg tttcattgca gtaatagtaa ccctagatgg gacaagactt tgatcaagcg
      301 ttccctttca ttgtcccctt cctgtagaca tgacttctct tcctatagac agtctcccct
      361 ctgctttcct ggacacggaa tttttttttt tttttttttg agacaaggtc tttcttgtct
      421 attctcgact ggctttgaat tcagaatctg cagctctgcc tctctagtaa catgtagcat
      481 tttccatatg taacattttt accagccatt tcccagtaaa tgagttactt catttggggt
      541 tttatcctaa atccccgtga gcaatgtttt gttagtttcc aaagcacgag gattctaagt
      601 gtctatttgt tgccaagttg ccaggctgtt acagagcaca gtttctggga ccctggctct
      661 ctgaaactga ctagggattg ctttagtata aacataaacc actgggactc tggctctttg
      721 aaactgacta gggattgctt tagtacaagt ataaaccact cagtcctggt cctacttggc
      781 ttcaaaagtt gaatatcgca tttggtattt gagatggaga tttaaagacg gaattttatt
      841 agtcttctgc ctggttttct ttctttcttt gctcttactg ccttgtggct cagaaccagc
      901 tgttgcctgt ttgatagttt gtgaccaata cctgtactgt taaattggcc atttgagaac
      961 tcaaaaagtc ccaacttgta gtgttttcgg tttccatggt cttagatatt tccactgcag
     1021 acaacatcaa gttgccagtg gttaacaact gtctttcaga actctcaagt gtttcggtgg
     1081 gtctgccagc ccttgtaacg tagcgccacg tggtatatgc ttatttgtct gtctgtctgt
     1141 ctgttgtgca agatgcctgt gtgccctgag gtcagaggac agcttcaagg gctctgcatt
     1201 cttccctgac cacgtggatc cagggaatag aactttgacc attacccacg ggccatgtta
     1261 tttcttgaca gttctgttgt acatttgttt tagtctttgg ctttatttat ttttctcacc
     1321 ctcagtttcc ctttgtctca gatgcttttt tttttttttt aatcttgcct ctgggagatg
     1381 tttcaaactc ttggaacgaa tgatacagtt gtttgattga tagaacgaag ccttccagtg
     1441 tgaatgcgtt tgcatttcag cttgttgctg gctggctgtg tggtgctggt tcagacatgt
     1501 cacaggcttg aggtgttaag gctaactgag ttcggagagt ccccacctga ccccttctcc
     1561 gttcccctca ccagggagac ctccctcctg gctgcagttg agcagggtgc accggggctg
     1621 gtttcagggc atgc
//
LOCUS       MUSH2WM7     1630 bp ds-DNA             ROD       11-JUL-1990
DEFINITION  Mouse (H-2wm7 haplotype) DNA fragment.
ACCESSION   D90009
KEYWORDS    .
SOURCE      Mouse (strain B10.MOL-SGR, haplotype H-2wm7) DNA, clone SGR.31.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1630)
  AUTHORS   Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T.,
            Steinmetz,M. and Moriwaki,K.
  TITLE     Recombinational hotspot specific to female meiosis in the mouse
            major histocompatibility complex
  JOURNAL   Immunogenetics 31, 79-88 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihiko
            Shiroishi National Institute of Genetics Yata-1111 Mishima,
            Shizuoka 411 Japan Phone:  0559-75-0771
FEATURES       from  to/span     description
    site          1      288     MT-consensus
    rpt        1126     1141     TCTG repeat
    LTR        1533     1568     Xenotropic solitary LTR sequence
BASE COUNT      342 a    363 c    370 g    555 t
ORIGIN      Chromosome 17.
        1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt
       61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg
      121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct
      181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc
      241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa
      301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct
      361 cctctgcttt actggacatg taattttttt tttgagacaa ggtcattctt gttgtctatt
      421 cttgactggc tttgaattca gaatctgcag gctctgcctc tctggtaaca tgtaacattt
      481 tccatatgta acatttttac cagccatttc ccagtaaatg agttacttca tttgaggttt
      541 tgtcttaaat ccccgtgagc aatgttttgt tagtttccaa agcacgagga ttctaagtgt
      601 ctatttgttg ctaagttgcc aggctgttac agagcacagt ttctgggacc ctggctctct
      661 gaaactgact agggattgct ttagtataaa cataaaccac tgggactctg gctctttgaa
      721 actgactagg gattgcttta gtacaagtat aaaccactca gtcctggtcc tacttggctt
      781 caaaagttga atatcgcttt tggtatttga gatggagatt taaagatgga attttattag
      841 tcttctgcct ggttttcttt ctttctttgc tcttactgcc ttgtggctca gaaccagctg
      901 ttgcctgttt gatagtttgt gaccaatacc tgtactgtta aattggccat ttgagaactc
      961 aaaaagtccc aacttgtagt gttttcggtt tccatggtct tagatatttc cactgcagac
     1021 aacatcaagt tgccagtggt taacaactgt ctttcagaac tctcaagtgt ttcggtgggt
     1081 ctgccagccc ttgtaacgta gcgccacgtg gtatatgctt atttgtctgt ctgtctgtct
     1141 gttgtgcaag atgccggtgt gccctgaggt cagaggacag cttcaagggc tctgcattct
     1201 tccctgacca cgtggatcca gggaacagaa ctttgaccat tatccacggg ccatgttatt
     1261 tcttgacagt tctgttgtac atttgtttta gtctttggct ttatttattt ttctcaccct
     1321 cagtttccct ttgtctcaga tgcttttttt ttttttttta atcttgcctc tgggagatgt
     1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt
     1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc
     1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg
     1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg
     1621 tttcagggca
//
LOCUS       MUSMHH2IE     576 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Mouse MHC class II I-E-beta-1 (haplotype H2b/K) gene, partial cds.
ACCESSION   M28408
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane glycoprotein; major histocompatibility complex.
SOURCE      Mouse (strain B10 (3R)) adult spleen (haplotype H2b/k), cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 576)
  AUTHORS   Gorski,J. and Hayes,C.E.
  TITLE     The I-J-disparate mouse strains B10.A(3R) and B10.A(5R) have
            identical I-E beta sequences
  JOURNAL   Immunogenetics 39, 127-129 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted J.Gorski, 27-SEP-1989.
            
            The sequence for Mouse (strain B10.A (5R)) is identical to [1].
FEATURES       from  to/span     description
    pept    <     1  >   576     H2-I-E-beta (AA at 1)
BASE COUNT      140 a    149 c    178 g    109 t
ORIGIN      Chromosome 17.
        1 gtcagagact ccagaccatg gtttttggaa tactgtaaat ctgagtgtca tttctacaac
       61 gggacgcagc gcgtgcggct tctggaaaga tacttctaca acctggagga gaacctgcgc
      121 ttcgacagcg acgtgggcga gttccgcgcg gtgaccgagc tggggcggcc agacgccgag
      181 aactggaaca gccagccgga gttcctggag caaaagcggg ccgaggtgga cacggtgtgc
      241 agacacaact atgagatctc ggataaattc cttgtgcggc ggagagttga gcctacggtg
      301 actgtgtacc ccacaaagac gcagcccctg gaacaccaca acctcctggt ctgctctgtg
      361 agtgacttct accctggcaa cattgaagtc agatggttcc ggaatggcaa ggaggagaaa
      421 acaggaattg tgtccacggg cctggtccga aatggagact ggaccttcca gacactggtg
      481 atgctggaga cggttcctca gagtggagag gtttacacct gccaggtgga gcatcccagc
      541 ctgaccgacc ctgtcacggt cgagtggaaa gcacac
//
LOCUS       RATFAPS      1271 bp ss-mRNA            ROD       11-JUL-1990
DEFINITION  Rat testis-specific farnesyl pyrophosphate synthetase mRNA,
            complete cds.
ACCESSION   M34477
KEYWORDS    farnesyl pyrophosphate synthetase.
SOURCE      Rat adult (Sprague-Dawley), cDNA to mRNA, clone TF1.4.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1271)
  AUTHORS   Teruya,J.H., Kutsunai,S.Y., Spear,D.H., Edwards,P.A. and
            Clarke,C.F.
  TITLE     Testis-specific transcriptional initiation sites of rat farnesyl
            pyrophosphate synthetase mRNA
  JOURNAL   Mol. Cell. Biol. 10, 2315-2326 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        158     1219     farnesyl pyrophosphate synthetase
    mRNA          1     1271     farnesyl pyrophosphate synthetase mRNA
BASE COUNT      329 a    319 c    347 g    276 t
ORIGIN      
        1 ttatatttgg gttctgccta ctgagccggg agtctgggaa ctacaactcc cagagtgctg
       61 agcggatgca cgctctgctt ttaggtgtaa gccgcaaaca tcttggaccc cgggagaatc
      121 cgcgttgaag cacagagcat ttagctcctc tgtcagaatg aatggggacc agaaactgga
      181 tgttcataac caagaaaagc agaatttcat ccagcacttc tcccagattg tcaaggtgct
      241 gactgaggat gaactgggac acccagagaa gggagatgct attacccgga tcaaagaggt
      301 cctggagtac aacactgtag gaggcaagta caatcggggt ctgacggtgg tacagacctt
      361 ccaggaactg gtggaaccaa ggaaacagga tgctgagagc ctacagcggg ccctgacggt
      421 gggctggtgt gtagaactgc tccaggcttt cttcctcgtg ttagatgaca tcatggactc
      481 ttcccacact cgccgggggc agatctgctg gtatcagaag ccgggcatag gcttggatgc
      541 catcaacgat gctctgcttc tggaagccgc tatctaccgc ctgcttaagt tctactgcag
      601 ggagcagccc tactacctca acctgctgga gctctttcta cagagttcct atcagactga
      661 gatcgggcag actctcgacc tcatcacagc accccagggc caagtggatc ttggtagata
      721 cactgaaaag aggtacaaat ctatcgtcaa gtacaagaca gctttctact ctttctacct
      781 gcctatcgcg gctgccatgt acatggctgg aattgatggg gagaaggaac acgctaatgc
      841 cctgaagatc ctgctggaga tgggcgagtt cttccagatc caggacgact accttgatct
      901 ctttggagac cccagtgtga ccggaaaggt cggcactgac atccaggaca acaaatgcag
      961 ctggctggtg gttcagtgtc tgctacgagc cactcctcag cagcgccaga tcttagagga
     1021 gaattatggg cagaaggacc cagaaaaagt ggcgcgggtg aaagcactgt acgaggagct
     1081 ggatctgcgg agtgtgttct tcaagtacga ggaagacagt tacaaccgcc tcaagagtct
     1141 catagagcag tgctccgcgc ccctgccccc atccatcttc ctggaactag caaacaagat
     1201 ctacaagcgg agaaagtaac ctcgaattgt agaggctgcg agggaggggt ctcaataaat
     1261 tattgttcaa c
//
LOCUS       TTHRPEGL     2340 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  Thermus thermophilus trpL, anthranilate synthase I and II (trpE and
            trpG) genes, complete cds.
ACCESSION   X07744
KEYWORDS    anthranilate synthase I; anthranilate synthase II; trpE gene;
            trpG gene; trpL gene.
SOURCE      Thermus thermophilus (strain HB8 (ATCC 27634) DNA.
  ORGANISM  Thermus thermophilus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 2340)
  AUTHORS   Sato,S., Nakada,Y., Kanaya,S. and Tanaka,T.
  TITLE     Molecular cloning and nucleotide sequence of Thermus thermophilus
            HB8 trpE and trpG
  JOURNAL   Biochim. Biophys. Acta 950, 303-312 (1988)
  STANDARD  simple automatic
COMMENT        EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM          37     42       pot. -35 region
               PRM          60     65       pot. -10 region [1]
            
            Author address: Sato S., Mitsubishi Kasei, Institute of Life
            Sciences, 11, Minamiooya Machida-Shi, Tokyo, Japan.
            
            Submitted (24-MAY-1988) on tape to the EMBL data library.
FEATURES       from  to/span     description
    pept         72      107     trpL protein
    pept        169     1557     anthranilate synthase I (trpE) (EC 4.1.3.27)
    pept       1603     2217     anthranilate synthase II (trpG)
BASE COUNT      351 a    764 c    855 g    370 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccgggc cctggagggg cggccccttt agcccctgga cagggccccc gtgtcccgct
       61 atcctgaggc catggccctt ccctccgccc tctggtggcc cggctaggcc ccggggcggg
      121 aggcctttcc ccggggcaca ccccggggct ttgtttttgg gggacggcat ggagcggatc
      181 cgaccttacc gcaaaacctt cctcgcggac ctggagaccc cggtgaccgc ctacctgaag
      241 cttgccgaga aggctccggt gagcttcctt ttggagtcgg tggagcgggg gcgccaaagc
      301 cgcttctcca tcgtgggggt gggggcgcgg cgcaccttcc gcctgaagga cggggtcttc
      361 acggtgaacg gggagcgggt ggaaacccgt gatcccttgc gcgccctcta cgagagggtc
      421 tacgccccct tggagcgcca ccccgacctc ccccccttct tcggcggggt ggtgggctac
      481 gccgcctacg acctcgtccg ctactacgaa aggcttccga gcctcaagcc cgacgacctc
      541 ggcctccccg acctcctctt cgtggagccc gaggtggtgg ccgtctttga ccacctgaag
      601 aacctcctcc acctcgtggc cccagggagg gaccccgagg aggcggaggc ccgcctcttt
      661 tgggcggaga ggcggctcaa gggccccttg cccggggtgc cgggggagag ggcggggggg
      721 agggcccgct tccaggcgga cttttcccgg gaggcctacc tggaggcggt gaggagggcc
      781 ctggactaca tccgggcggg ggacatcttc caggtggtcc tctccttgag gctctcctcc
      841 cccctcaccg tccacccctt cgccctctac cgggcgctga ggagcgtgaa cccgagcccc
      901 tacatgggct acctggacct gggggaggtg gtcttggtct cggcgagccc ggaaagcctc
      961 ctccgctcgg acggccgaag ggtggtcacc cggcccatcg cgggcacgag gccgaggggg
     1021 aaggacgagg aggaggacaa aaggcttgcc gaggagctcc ttagggacga gaaggaggtc
     1081 gcggagcacg tgatgcttct ggacctctcc cgcaacgaca tcggccgggt cgccgccttc
     1141 ggcacggtgc gggtcctcga gcccctccac gtggagcact actcccacgt gatgcacctg
     1201 gtctccacgg tggagggcat cttggccgag gggaagaccc ccctggacgc cctggccagc
     1261 gtgctgccca tggggacggt ctccggggcc ccgaagatcc gggccatgga gatcattgaa
     1321 gaactggagc cccaccgccg ggggccctac gggggaagct tcggctacct cgcctacgac
     1381 ggggccatgg acatggccct caccctgcgc accttcgtgg tggcgaaggg gtggatgcac
     1441 gtccaggcgg gggcggggat cgtggcggac tcggtgccgg agagggagta cgaggagtgc
     1501 tggaacaagg cgcgggcgct cctcaaggcg gtggagatgg cggaggcggg gctgtgatcc
     1561 caccccatgc cggcaggggc ccggtaagga ggcctggtag gcatggctgc taacggagcg
     1621 aaggggagaa aggttatgag ggtcttggtg gtggacaact acgacagctt cacctacaac
     1681 ctggtgcagt acctggggga gctcggggcg gagcccatcg tgtggcggaa cgaccgcttc
     1741 cggctggagg aggtggaggc cctggacccg gaccggatcc tcatcagccc ggggccttgc
     1801 accccctttg aggcggggct ttccgtcccc ttggtccagc gctacgcccc ccgctacccc
     1861 atcctggggg tctgcctcgg acaccaggcc atcggggcgg ccttcggggg gaaggtggtc
     1921 cccgcccccg tcctcatgca cggcaaggtg agccccatcc accacgacgg caccggggtc
     1981 ttccgggggc tagatagccc cttccccgcc acccgctacc actccctggc ggtggtggag
     2041 gtgccggagg ccctcgtggt gaacgcctgg gcggaggagg cgggggggcg gacggtgatg
     2101 ggcttccgcc accgggacta ccccacccac ggggtgcagt tccacccgga aagctacctt
     2161 acggaggcgg gtaaactcat cctcaagaac ttcctggagg acccatggac gcggtgaaga
     2221 aggccattct gggcgaggtt ttggaggaag aggaggccta cgaggtcatg cgggccctga
     2281 tggcggggga ggtctccccg gtgcgggcgg cggggctttt ggtggccttg agcctgaggg
//
LOCUS       XELGBBBLI    6777 bp ds-DNA             VRT       11-JUL-1990
DEFINITION  X.laevis beta-L-I globin gene, upstream region.
ACCESSION   M34470
KEYWORDS    beta-L-I.
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 6777)
  AUTHORS   Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
            Knoechel,W.
  TITLE     Sequence analysis of the upstream regions of Xenopus laevis beta-
            globin genes and arrangement of repetitive elements within the
            globin gene clusters
  JOURNAL   Mol. Biol. Rep. 14, 17-26 (1990)
  STANDARD  simple staff_review
BASE COUNT     2121 a   1247 c   1228 g   2181 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc
       61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt
      121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt
      181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag
      241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt
      301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca
      361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac
      421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac
      481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag
      541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac
      601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa
      661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga
      721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta
      781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca
      841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga
      901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa
      961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat
     1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt
     1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt
     1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt
     1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg
     1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg
     1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg
     1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag
     1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg
     1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc
     1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa
     1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga
     1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat
     1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg
     1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag
     1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat
     1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt
     1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa
     2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag
     2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa
     2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat
     2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc
     2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga
     2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat
     2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca
     2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac
     2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct
     2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg
     2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa
     2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt
     2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt
     2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt
     2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat
     2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat
     3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat
     3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact
     3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa
     3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt
     3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc
     3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat
     3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct
     3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga
     3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa
     3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca
     3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc
     3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa
     3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc
     3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata
     3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa
     3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta
     3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca
     4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat
     4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc
     4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct
     4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa
     4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc
     4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat
     4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt
     4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac
     4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg
     4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac
     4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta
     4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac
     4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa
     4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc
     4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg
     4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag
     4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc
     5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct
     5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg
     5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata
     5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat
     5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt
     5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat
     5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca
     5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc
     5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa
     5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc
     5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag
     5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct
     5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt
     5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg
     5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg
     5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat
     6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta
     6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca
     6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg
     6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt
     6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt
     6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga
     6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat
     6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa
     6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca
     6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca
     6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat
     6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac
     6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc
//
LOCUS       XELHBBBAI    2027 bp ds-DNA             VRT       11-JUL-1990
DEFINITION  X.laevis beta-A-I globin gene, upstream region.
ACCESSION   M34471
KEYWORDS    beta-A-I.
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2027)
  AUTHORS   Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
            Knoechel,W.
  TITLE     Sequence analysis of the upstream regions of Xenopus laevis beta-
            globin genes and arrangement of repetitive elements within the
            globin gene clusters
  JOURNAL   Mol. Biol. Rep. 14, 17-26 (1990)
  STANDARD  simple staff_review
BASE COUNT      792 a    266 c    284 g    685 t
ORIGIN      1 bp upstream of BglII site.
        1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct
       61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat
      121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc
      181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt
      241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt
      301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta
      361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa
      421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata
      481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc
      541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct
      601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat
      661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca
      721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct
      781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt
      841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat
      901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg
      961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa
     1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt
     1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag
     1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt
     1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata
     1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca
     1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt
     1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag
     1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg
     1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat
     1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata
     1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc
     1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt
     1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa
     1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc
     1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg
     1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca
     1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc
//
LOCUS       XELHBBBLII    910 bp ds-DNA             VRT       11-JUL-1990
DEFINITION  X.laevis beta-L-II globin gene, upstream region.
ACCESSION   M34472
KEYWORDS    beta-L-II.
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 910)
  AUTHORS   Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
            Knoechel,W.
  TITLE     Sequence analysis of the upstream regions of Xenopus laevis beta-
            globin genes and arrangement of repetitive elements within the
            globin gene clusters
  JOURNAL   Mol. Biol. Rep. 14, 17-26 (1990)
  STANDARD  simple staff_review
BASE COUNT      322 a    152 c    128 g    308 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc
       61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc
      121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt
      181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc
      241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt
      301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta
      361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct
      421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt
      481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta
      541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact
      601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac
      661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag
      721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct
      781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca
      841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat
      901 attttttttt
//
LOCUS       CLONEUR      4835 bp ds-DNA             BCT       11-JUL-1990
DEFINITION  C.botulinum neurotoxin gene, complete cds.
ACCESSION   M30196
KEYWORDS    neurotoxin.
SOURCE      C.botulinum (strain 62A, subtype A) DNA.
  ORGANISM  Clostridium botulinum
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 4835)
  AUTHORS   Binz,T., Kurazono,H., Wille,M., Frevert,J., Wernars,K. and
            Niemann,H.
  TITLE     The complete sequence of the botulinum type A neurotoxin and its
            comparison with other Clostridial neurotoxins
  JOURNAL   J. Biol. Chem. 265, 9153-9158 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Niemann, 29-NOV-1989.
FEATURES       from  to/span     description
    pept        358     4248     neurotoxin
    mRNA        240  >  4835     neurotoxin mRNA
    signal      344      349     ribosome binding site
    site       4400     4432     potential terminator
BASE COUNT     1934 a    517 c    756 g   1628 t
ORIGIN      
        1 aagcttctaa atttaaatta ttaagtataa atccaaataa acaatatgtt caaaaacttg
       61 atgaggtaat aatttctgta ttagataata tggaaaaata tatagatata tctgaagata
      121 atagattgca actaatagat aacaaaaata acgcaaagaa gatgataatt agtaatgata
      181 tatttatttc caattgttta accctatctt ataacggtaa atatatatgt ttatctatga
      241 aagatgaaaa ccataattgg atgatatgta ataatgatat gtcaaagtat ttgtatttat
      301 ggtcatttaa ataattaata atttaattaa ttttaaatat tataagaggt gttaaatatg
      361 ccatttgtta ataaacaatt taattataaa gatcctgtaa atggtgttga tattgcttat
      421 ataaaaattc caaatgcagg acaaatgcaa ccagtaaaag cttttaaaat tcataataaa
      481 atatgggtta ttccagaaag agatacattt acaaatcctg aagaaggaga tttaaatcca
      541 ccaccagaag caaaacaagt tccagtttca tattatgatt caacatattt aagtacagat
      601 aatgaaaaag ataattattt aaagggagtt acaaaattat ttgagagaat ttattcaact
      661 gatcttggaa gaatgttgtt aacatcaata gtaaggggaa taccattttg gggtggaagt
      721 acaatagata cagaattaaa agttattgat actaattgta ttaatgtgat acaaccagat
      781 ggtagttata gatcagaaga acttaatcta gtaataatag gaccctcagc tgatattata
      841 cagtttgaat gtaaaagctt tggacatgaa gttttgaatc ttacgcgaaa tggttatggc
      901 tctactcaat acattagatt tagcccagat tttacatttg gttttgagga gtcacttgaa
      961 gttgatacaa atcctctttt aggtgcaggc aaatttgcta cagatccagc agtaacatta
     1021 gcacatgaac ttatacatgc tggacataga ttatatggaa tagcaattaa tccaaatagg
     1081 gtttttaaag taaatactaa tgcctattat gaaatgagtg ggttagaagt aagctttgag
     1141 gaacttagaa catttggggg acatgatgca aagtttatag atagtttaca ggaaaacgaa
     1201 tttcgtctat attattataa taagtttaaa gatatagcaa gtacacttaa taaagctaaa
     1261 tcaatagtag gtactactgc ttcattacag tatatgaaaa atgtttttaa agagaaatat
     1321 ctcctatctg aagatacatc tggaaaattt tcggtagata aattaaaatt tgataagtta
     1381 tacaaaatgt taacagagat ttacacagag gataattttg ttaagttttt taaagtactt
     1441 aacagaaaaa catatttgaa ttttgataaa gccgtattta agataaatat agtacctaag
     1501 gtaaattaca caatatatga tggatttaat ttaagaaata caaatttagc agcaaacttt
     1561 aatggtcaaa atacagaaat taataatatg aattttacta aactaaaaaa ttttactgga
     1621 ttgtttgaat tttataagtt gctatgtgta agagggataa taacttctaa aactaaatca
     1681 ttagataaag gatacaataa ggcattaaat gatttatgta tcaaagttaa taattgggac
     1741 ttgtttttta gtccttcaga agataatttt actaatgatc taaataaagg agaagaaatt
     1801 acatctgata ctaatataga agcagcagaa gaaaatatta gtttagattt aatacaacaa
     1861 tattatttaa cctttaattt tgataatgaa cctgaaaata tttcaataga aaatctttca
     1921 agtgacatta taggccaatt agaacttatg cctaatatag aaagatttcc taatggaaaa
     1981 aagtatgagt tagataaata tactatgttc cattatcttc gtgctcaaga atttgaacat
     2041 ggtaaatcta ggattgcttt aacaaattct gttaacgaag cattattaaa tcctagtcgt
     2101 gtttatacat ttttttcttc agactatgta aagaaagtta ataaagctac ggaggcagct
     2161 atgtttttag gctgggtaga acaattagta tatgatttta ccgatgaaac tagcgaagta
     2221 agtactacgg ataaaattgc ggatataact ataattattc catatatagg acctgcttta
     2281 aatataggta atatgttata taaagatgat tttgtaggtg ctttaatatt ttcaggagct
     2341 gttattctgt tagaatttat accagagatt gcaatacctg tattaggtac ttttgcactt
     2401 gtatcatata ttgcgaataa ggttctaacc gttcaaacaa tagataatgc tttaagtaaa
     2461 agaaatgaaa aatgggatga ggtctataaa tatatagtaa caaattggtt agcaaaggtt
     2521 aatacacaga ttgatctaat aagaaaaaaa atgaaagaag ctttagaaaa tcaagcagaa
     2581 gcaacaaagg ctataataaa ctatcagtat aatcaatata ctgaggaaga gaaaaataat
     2641 attaatttta atattgatga tttaagttcg aaacttaatg agtctataaa taaagctatg
     2701 attaatataa ataaattttt gaatcaatgc tctgtttcat atttaatgaa ttctatgatc
     2761 ccttatggtg ttaaacggtt agaagatttt gatgctagtc ttaaagatgc attattaaag
     2821 tatatatatg ataatagagg aactttaatt ggtcaagtag atagattaaa agataaagtt
     2881 aataatacac ttagtacaga tatacctttt cagctttcca aatacgtaga taatcaaaga
     2941 ttattatcta catttactga atatattaag aatattatta atacttctat attgaattta
     3001 agatatgaaa gtaatcattt aatagactta tctaggtatg catcaaaaat aaatattggt
     3061 agtaaagtaa attttgatcc aatagataaa aatcaaattc aattatttaa tttagaaagt
     3121 agtaaaattg aggtaatttt aaaaaatgct attgtatata atagtatgta tgaaaatttt
     3181 agtactagct tttggataag aattcctaag tattttaaca gtataagtct aaataatgaa
     3241 tatacaataa taaattgtat ggaaaataat tcaggatgga aagtatcact taattatggt
     3301 gaaataatct ggactttaca ggatactcag gaaataaaac aaagagtagt ttttaaatac
     3361 agtcaaatga ttaatatatc agattatata aacagatgga tttttgtaac tatcactaat
     3421 aatagattaa ataactctaa aatttatata aatggaagat taatagatca aaaaccaatt
     3481 tcaaatttag gtaatattca tgctagtaat aatataatgt ttaaattaga tggttgtaga
     3541 gatacacata gatatatttg gataaaatat tttaatcttt ttgataagga attaaatgaa
     3601 aaagaaatca aagatttata tgataatcaa tcaaattcag gtattttaaa agacttttgg
     3661 ggtgattatt tacaatatga taaaccatac tatatgttaa atttatatga tccaaataaa
     3721 tatgtcgatg taaataatgt aggtattaga ggttatatgt atcttaaagg gcctagaggt
     3781 agcgtaatga ctacaaacat ttatttaaat tcaagtttgt atagggggac aaaatttatt
     3841 ataaaaaaat atgcttctgg aaataaagat aatattgtta gaaataatga tcgtgtatat
     3901 attaatgtag tagttaaaaa taaagaatat aggttagcta ctaatgcatc acaggcaggc
     3961 gtagaaaaaa tactaagtgc attagaaata cctgatgtag gaaatctaag tcaagtagta
     4021 gtaatgaagt caaaaaatga tcaaggaata acaaataaat gcaaaatgaa tttacaagat
     4081 aataatggga atgatatagg ctttatagga tttcatcagt ttaataatat agctaaacta
     4141 gtagcaagta attggtataa tagacaaata gaaagatcta gtaggacttt gggttgctca
     4201 tgggaattta ttcctgtaga tgatggatgg ggagaaaggc cactgtaatt aatctcaaac
     4261 tacatgagtc tgtcaagaat tttctgtaaa catccataaa aattttaaaa ttaatatgtt
     4321 taagaataac tagatatgag tattgtttga actgcccctg tcaagtagac aggtaaaaaa
     4381 ataaaaatta agatactatg gtctgatttc gatattctat cggagtcaga ccttttaact
     4441 tttcttgtat cctttttgta ttgtaaaact ctatgtattc atcaattgca agttccaatt
     4501 agtcaaaatt atgaaacttt ctaagataat acatttctga ttttataatt tcccaaaatc
     4561 cttccatagg accattatca atacatctac caactcgaga catactttga gttgcgccta
     4621 tctcattaag tttattcttg aaagatttac ttgtatattg aaaaccgcta tcactgtgaa
     4681 aaagtggact agcatcagga ttggaggtaa ctgctttatc aaaggtttca aagacaagga
     4741 cgttgttatt tgattttcca agtacatagg aaataatgct attatcatgc aaatcaagta
     4801 tttcactcaa gtacgccttt gtttcgtctg ttaac
//

GenBank-Updates@genbank.bio.net (07/12/90)

LOCUS       BOVB1A        781 bp ss-mRNA            MAM       12-JUL-1990
DEFINITION  Cow beta-crystallin (p-Beta 25/23) mRNA, complete cds.
ACCESSION   M33010
KEYWORDS    beta-crystallin; crystallin.
SOURCE      Cow lens cortex, cDNA to mRNA, clone p-Beta 25/23.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 781)
  AUTHORS   Gorin,M.B. and Horwitz,J.
  TITLE     Cloning and characterization of a cow beta crystallin cDNA
  JOURNAL   Curr. Eye Res. 3, 939-948 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          6      653     beta-crystallin
BASE COUNT      217 a    185 c    187 g    192 t
ORIGIN      
        1 tccagatgga gacccagact gtgcagcagg agctgaaatc ccttccaacc accaagatgg
       61 ctcaaactaa ccccatgccg gggtctgtgg ggccatggaa gattaccatc tatgaccagg
      121 agaacttcca gggcaagaga atggaattca ccagctcctg cccaaatgtc tctgagcgca
      181 attttgacaa cgtccggtct ctcaaggtgg aatgtggcgc ctgggttggt tatgagcata
      241 ccagcttctg tgggcaacag tttgtcctgg agagaggaga gtaccctcgc tgggatgcct
      301 ggagcgggag taatgcctat cacattgagc gcctcatgtc cttccgcccc atctgttcag
      361 ctaatcataa ggagtctaag attacaattt ttgagaaaga aaatttcatt ggacgccaat
      421 gggaaatctg tgatgactac ccctccttgc aagccatggg ttggcccaac aacgaagttg
      481 gctctatgaa gatacaatgt ggagcctggg tttgctacca gtatcctggg taccgtggct
      541 atcagtatat cttggaatgt gaccatcatg gaggagacta caaacactgg agagagtggg
      601 gttctcatgc ccagacttcc cagattcaat ccattcgccg tatccaacag tagtggatta
      661 aaagctccaa gtaagaattc ctcaagcatg agaccttcct aaacaatcta gaataaaata
      721 tatgttctgc tgatattgct tccaaatgtt agctgctgaa atccacaata aatgtcatta
      781 a
//
LOCUS       CFICENB       439 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  C.fimi endoglucanase B (cenB) gene, 5' end.
ACCESSION   M33026
KEYWORDS    endoglucanase; endoglucanase B.
SOURCE      C.fimi DNA.
  ORGANISM  Cellulomonas fimi
            Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE   1  (bases 1 to 439)
  AUTHORS   Owolabi,J.B., Beguin,P., Kilburn,D.G., Miller,R.C.Jr. and
            Warren,R.A.J.
  TITLE     Expression in Escherichia coli of the Cellulomonas fimi structural
            gene for endoglucanase B
  JOURNAL   Appl. Environ. Microbiol. 54, 518-523 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        275  >   439     endoglucanase B (cenB) precursor
    sigp        275      373     endoglucanase B signal peptide
    matp        374  >   436     endoglucanase B
BASE COUNT       58 a    173 c    154 g     54 t
ORIGIN      
        1 ggatcccgcg cccggcgcga gcccgcaacc cacgcgccca cggatcgggc ctcacgagcc
       61 cgacgttggc ggccgggccg gggggcgacc tcgagaccga ggagcccccg cgtgaggcga
      121 cgttggccgc gcacgccgct ggtgagcggg ctgaatcgtt tagggcgttg acctgcggac
      181 ggacccgtct ggacgatgcg ccaggcgtcg tgcgggtgcg actgcggaca gcacgggtcg
      241 ccgaccacca ctcccgtgcc cggaagagga ccccatgctc cgccaagtcc cacgcacgct
      301 cgtcgcgggt ggctccgccc tcgccgtcgc cgtcggggtg ctcgtcgccc cgctcgcgac
      361 cggcgcggcc gccgcgccca cctacaacta cgccgaggcc ctgcagaagt cgatgttctt
      421 ctaccaggcg cacggctcc
//
LOCUS       RATLACTAS     250 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Rat lactase-phlorizin hydrolase mRNA, partial cds.
ACCESSION   M34730
KEYWORDS    lactase; lactase-phlorizin hydrolase.
SOURCE      Rat (strain Sprague-Dawley CD) newborn, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 250)
  AUTHORS   Bueller,H.A., Kothe,M.J.C., Goldman,D.A., Grubman,S.A., Sasak,W.V.,
            Matsudaira,P.T., Montgomery,R.K. and Grand,R.J.
  TITLE     Coordinate expression of lactate-phlorizin hydrolase mRNA and
            enzyme levels in rat intestine during development
  JOURNAL   J. Biol. Chem. 265, 6978-6983 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   250     lactase-phlorizin hydrolase (AA at 2)
BASE COUNT       66 a     62 c     66 g     56 t
ORIGIN      
        1 agaaaggatc ttctaccaca aaacctatat caacgaggct ctgaaagcct acaagctgga
       61 tggtgtggac cttcgagggt actctgcctg gacgctgatg gacgacttcg agtggctgct
      121 tggctacacc atgagatttg gattgtatca cgttgacttt aatcatgtga gcagacctcg
      181 cacagcaaga gcctcagcca gatactatgc agaggtcatt gccaacaatg gcatgcccct
      241 ggccgggaag
//
LOCUS       BOVARRB      1945 bp ss-mRNA            MAM       12-JUL-1990
DEFINITION  Cow beta-arrestin mRNA, complete cds.
ACCESSION   M33601
KEYWORDS    beta-arrestin; inhibitor.
SOURCE      Cow adult brain cortex, cDNA to mRNA, clone pBARRESTIN-1/1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (sites)
  AUTHORS   Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J.
  TITLE     Beta-arrestin: A protein that regulates beta-adrenergic receptor
            function
  JOURNAL   Science 248, 1547-1550 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1945; for [1])
  AUTHORS   Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J.
  JOURNAL   Unpublished (1990) See COMMENT for author address.
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.J.Lohse, 06-APR-1990, for release after publication.
               Author address [1]:
               M.J.Lohse
               Howard Hughes Medical Institute, Box 3821,
               Duke University Medical center
               Durham, NC 27710
FEATURES       from  to/span     description
    pept         97     1353     beta-arrestin
BASE COUNT      419 a    590 c    556 g    380 t
ORIGIN      
        1 gttccgggaa ccggctggcc cgcgcccctc ctgtcggccg gggattttcc agcctgggcg
       61 ctgacgccgc ggacctcccc gcggccgcct cggaccatgg gcgacaaagg gacgcgggtg
      121 ttcaagaagg cgagccccaa tggaaagctc accgtctatc tgggaaagcg ggactttgtg
      181 gaccacatcg acctcgtgga gcccgtggat ggagtggttc ttgtggatcc ggagtatctc
      241 aaggagagga gagtctatgt gacgctgacc tgcgccttcc gctacggccg ggaggacctg
      301 gatgtcctgg gcctgacctt tcgcaaggac ctgtttgtgg ccaacgtgca gtctttcccg
      361 ccggcccctg aggacaagaa gcccctgacg cggctgcagg agcgcctcat caagaagctg
      421 ggcgagcatg cctacccttt cacctttgag atccctccga acctcccatg ctctgtgact
      481 ttgcagccgg gacctgaaga tacagggaag gcctgcggtg tggactacga agtgaaagcc
      541 ttctgtgcgg agaacctgga ggagaagatc cacaagcgga attctgtgcg cctggtcatc
      601 cggaaggttc agtatgcccc agagaggcct ggcccccagc ccacggccga gaccaccagg
      661 cagttcctca tgtcagacaa gcccttgcat ctggaggcct ccctggacaa ggagatctac
      721 taccacggag aacccatcag tgtcaacgtc catgtcacca acaacaccaa caagacggtg
      781 aagaagatca agatctcggt gcgccagtat gcagacatct gtctgttcaa cacagcccag
      841 tacaagtgcc ctgtggccat ggaagaggct gatgacacag tggcacccag ctctacgttc
      901 tgcaaggtct acacgctgac ccccttcctg gccaacaatc gagagaagcg gggcctcgcc
      961 ctggacggga agctcaaaca cgaggacacg aacctggcct ccagcaccct gttgagggaa
     1021 ggagccaacc gggagatcct gggcatcatt gtttcctaca aagtgaaagt gaagctggtg
     1081 gtgtctcgtg gcggcctgtt gggagatctt gcatccagtg atgtggccgt ggaactgcct
     1141 ttcaccctaa tgcaccccaa gcccaaagag gaacccccac accgggaagt tccagagcac
     1201 gagacgccgg tagataccaa tctcatagaa cttgacacca acgatgacga cattgtgttt
     1261 gaggactttg cccgccagag actaaaaggc atgaaggatg acaaggagga agaggaggat
     1321 ggtaccggct ctccgcggct caacgacaga tagactgggg ctgccctccc tccgggcagc
     1381 tccaggtcca ctctcatgca ctaggatgct tgttcgtctt cttcctgtcc tggctccccc
     1441 tcccctttgt tcttccagtt tctaccaggg ggccccagcg gtcttccagg tcacggtggc
     1501 gaacccctgg cctcaggatt ggcccccatc accatgccaa cagggccaca ggcagcaccc
     1561 tcaccctctc actgcatcac ttctccattc cccctctttt cctattgacc cccagacagg
     1621 ccagcacagc tctggccttc ggatttgact cgggatgggg agcagaaagg ggaagatggg
     1681 gcacaagggc ttggcgaggt ggggatgggg gctcaagacg cgtgagagga tgtggccact
     1741 gtcccaggtg atgaatacag ttctggcagc taaaacatga ccgctttgaa ggccaccctc
     1801 ctctggctgg gaggggacag acccatggat agattgtcca cacagatttg ctcgaagttc
     1861 agacctacca aacagctgtc ttcttcttcc ctcgtccctg ccccctgttc ctctgtggct
     1921 gacagtgacc ttggtgaagg tttgt
//
LOCUS       BBVRNA3       389 bp ss-RNA             VRL       12-JUL-1990
DEFINITION  Black beetle virus RNA3 proteins B1 and B2 genes, complete cds.
ACCESSION   M33065
KEYWORDS    .
SOURCE      Black beetle virus.
  ORGANISM  Black beetle virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Nodaviridae.
REFERENCE   1  (bases 1 to 389)
  AUTHORS   Guarino,L.A., Ghosh,A., Dasmahapatra,B., Dasgupta,R. and
            Kaesberg,P.
  TITLE     Sequence of the black beetle virus subgenomic RNA and its location
            in the viral genome
  JOURNAL   Virology 139, 199-203 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         10      318     B1 protein
    pept         20      340     B2 protein
BASE COUNT      118 a    120 c     98 g     53 t
ORIGIN      
        1 tcgttaccaa tgttaaacga tgccaagcaa actcgcgcta atccaggaac ttcccgaccc
       61 cattcaaacg gcggtggaag cagccatggg aatgagctac caagacgcac cgaacaacgt
      121 gcgcagggac ctcgacaacc tgcacgcttg cctaaacaag gcaaaactaa cggtaagtcg
      181 gatggtaaca tcactgctgg agaaacccag cgtggtggca tacctagagg gaaaggcccc
      241 cgaggaggca aaaccaacac tcgaagaacg cctccgaaag ctggagctca gccacagcct
      301 tccaacaacc ggaagtgacc ccccacccgc aaaactgtag gtggctctta ggagcaccca
      361 cacccgttct agcccgaaag ggcagaggt
//
LOCUS       MUSURNAA       54 bp ss-uRNA            ROD       12-JUL-1990
DEFINITION  Mouse small nuclear RNA.
ACCESSION   M34036
KEYWORDS    small nuclear RNA.
SOURCE      Mouse plasmacytoma cell line P301 small nuclear RNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 54)
  AUTHORS   Chernokhvostov,V.V. and Georgiev,G.P.
  TITLE     Complexes of nuclear matrix DNA with proteins tightly bound to DNA
            contain a specific small-size RNA of a novel type
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.V.Chernokhvostov, 04-MAY-1990.
            
            or address:V.V.Chernokhvostov . of Molecular Biology, USSR Acad.
            Sci.
               Vavilova str., 32
               117984, Moscow
               USSR
FEATURES       from  to/span     description
    uRNA          1       54     small nuclear RNA
BASE COUNT       19 a     12 c     13 g     10 t
ORIGIN      
        1 agaagacacc ctgatttaac ttctggtatc ggaagatgca agagccgaac caga
//
LOCUS       RATCYP2A1   18820 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Rat hepatic steroid hydroxylase IIA1 (CYP2A1) gene, complete cds.
ACCESSION   M33312
KEYWORDS    B2 repetitive sequence; LINE repetitive sequence; cytochrome P450;
            hepatic steroid hydroxylase IIA1.
SOURCE      Rat (strain Sprague Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 18820)
  AUTHORS   Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
  TITLE     Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
            genes and regional localization of the CYP2A gene subfamily on
            mouse chromosome 7
  JOURNAL   Biochemistry 29, 1329-1341 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       4573     4749     hepatic steroid hydroxylase IIA1 (CYP2A1),
                                 exon 1
               5050     5212     hepatic steroid hydroxylase IIA1, exon 2
               7638     7787     hepatic steroid hydroxylase IIA1, exon 3
               8005     8165     hepatic steroid hydroxylase IIA1, exon 4
               9386     9562     hepatic steroid hydroxylase IIA1, exon 5
              12760    12898     hepatic steroid hydroxylase IIA1, exon 6
              13340    13527     hepatic steroid hydroxylase IIA1, exon 7
              13960    14101     hepatic steroid hydroxylase IIA1, exon 8
              17010    17191     hepatic steroid hydroxylase IIA1, exon 9
    pre-msg    4545    17380     CYP2A1 mRNA and introns
    IVS        4750     5049     CYP2A1 intron A
    IVS        5213     7637     CYP2A1 intron B
    IVS        7788     8004     CYP2A1 intron C
    IVS        8166     9385     CYP2A1 intron D
    IVS        9563    12759     CYP2A1 intron E
    IVS       12899    13339     CYP2A1 intron F
    IVS       13528    13959     CYP2A1 intron G
    IVS       14102    17009     CYP2A1 intron H
    rpt        3750     4026     B2 repeat
BASE COUNT     5274 a   4402 c   3695 g   5449 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattctagt acggtagccc tggctttcat caactagtta gtgccaaata tttgagaaaa
       61 gttacaggtt caagctaata aaagttgcag agagtataaa agaatgcaga ttagacaaga
      121 aaaaattaat tagagccctt ctagccaaca aagcctcaga tccaggagaa aagactacca
      181 tagaaatggc caaaggctta tttatcaaag aaactgggct cagtggcagc aggatgacca
      241 ccttgcctgt gtttattgtt gccacagcac tgttggataa agatgcaaat aaattaactt
      301 tgggacagaa gttgatcatg actgctcctc cccctgcccc cgcaattctg attgaggctc
      361 agtaatgcct acatgcttca ttatcacact ttactaatca gccctggctg agatattttc
      421 cagccacctg tttccctgaa ccctgcgact cttccaccca accctgactt gggctgtcca
      481 cttcatcaat tcgatgaggt tcaggcccag atacacaata ccagacctta cttgaggaac
      541 tctcatccat cagaaacaga gcatacctgg ttcacggaca gaagtagctt catccataag
      601 ggtcagagga gaacaggggc agcaataaca acagaaggaa aggtaatctg gactcagtct
      661 cttccttcca ggcacttcaa ctcaaaagaa caaactaaga acattaacac aagtcctcat
      721 catgggaaaa ggactggctg ttagcatctg cagggacagc cagatatgca tttacaacta
      781 ctcatgtgca cggagccatt tacaaggaac aacagctcct aacagcagaa ggaaaaactg
      841 tcagaaataa agataaaact ctacaggaca tcaaaaagaa accaatggct agagataaca
      901 gtctgactga taaggcttct aagaaggtag ccttaaagga aacagccaac actagattgg
      961 ccattgtcct acctgaacca cctagagtaa ctgataaatg cagaaaaaga aattaaatgg
     1021 gctggtgatg gtcaactgaa ggtaagcgaa tattgcccac ttctcagaaa gatccacagt
     1081 cactcacttg ggagtaaaat gaatgacaaa ctttaaagtt tgccagctga ccaacactca
     1141 caggaagccc aaacatccaa attcctgact gcgagttaag agacttggag cctactggga
     1201 aaattgattt ttttcagaaa tcaagcaaga aagatatggc tcaaaatatc tgctgatatt
     1261 tgtagacatt ttttttcgag atagatggat agatgtacta ggagagatct ttccgatgtt
     1321 tggagcacct aaggtaacag gatcagacaa tgggcctgtc ttcatatctc aggtaagtca
     1381 gggacttgct aagatcttgg ggactaattg gaaactccat tgttcatatc atccccagag
     1441 ttcagggcag gtagaaagga tgaatagaac tctaaaagag accttaacag aattagcctt
     1501 ggagactggt ggggactggg tgaggctctt tccctttgcc ctatattagg tgtacatggc
     1561 attctagctc ccattgtatc tagcctacag ttggtagcta ttacagaact gaaaaatgat
     1621 aatttaagat ttaaggtcag agctaccaaa taggctcatg aatttgtttg ggcctaaatt
     1681 atgtaccttc tgtgaagcag gcctggttcc agaaccacac aagtcaaaag agactgggtc
     1741 tctatgaaga gatttcacca aggtgcgact aaacccatgt gaaaatggca attcatcatc
     1801 ctgttgatca tgatcaccac ctgggtgtac aacaaccaca ccagaccagt tcctccaatg
     1861 aagaactctg cctggctcca gctgtaccaa aatagagggt tcaaaaggac accaagccct
     1921 tcaagttaaa gttgactcag tctcagtcct gagtctcttg cccctgctaa ctctatgtct
     1981 atatatactg tatgtcttag atccccccct gttaggaagg taccctagct ggatccttga
     2041 taattttact tttatttctg acttttggcc cctgtatttt aagttgctta gtagtttata
     2101 agagaattca gtcaagttaa ttatcttaag gcaacactat ctacagctgg aagcagggaa
     2161 gcaagcatat gagttagaag actataagct tcaagatcaa agctatgcta aaagaaaagg
     2221 ggggaatgaa aagccagagt tggggtcaat ctgaggccaa tgagaaaaac ccaccattaa
     2281 catccaagca cagaacgacc cttctcttcc agaaagagta aagctagttt agttcctgga
     2341 acagctacaa gccaaactgt tgaacaaagc cacatgtaac tccccatcca acctccagaa
     2401 agtcccagaa tggcacactg accacaagtc attttggagg ttacttcacc ccactaatag
     2461 tagtactctt cctagttact gttgtgcaaa ttctgcccca attgtttgta aggtatatac
     2521 agacccagtt agagtctgct cagggtcttc tctttctgaa agggagtcaa ccccgacgca
     2581 ttaaaataaa gctagtcttg gttttgcatt gattagcacc tccttgagtc tcactcaagg
     2641 ggtcccggaa agggtcagat tagacctcat atacctctga gcacagcttg tatggtgact
     2701 aagatacagg atacccacag gctgggatta gagagtttaa accaaagatc tttcatccat
     2761 gtgctccatg cctgccctgt gcccaggggg aaacatggat tctaattaca gaagcctccc
     2821 taaggatctt aatgggaacc aagtaggaga cttttccagt tagaagcctt ctgacaactg
     2881 gggtttcccc atattggtag tttaggttgt tatttcacaa aactacaatt ccttcaccaa
     2941 ctggagttct gagttattct cctctagtct ggaaaatgat ctgctaaaat atagctgtgg
     3001 ttttctaccc ttttcaaagc catacataga cagggaaggt tgcccatcct tccctgaagt
     3061 tgaagatcct tttagaagtc aatgcaccca tcagtggtga taaatgcctt taatcccagt
     3121 atgcagcaaa ctctgtgagt ttgacgccaa attggtctac agagtgtgtt ctagaacagt
     3181 cagagctaaa gagagaaaca ctctgtggaa aagaaagaaa gaaaaaagaa aggaaggaaa
     3241 gaaggaagga aggaaggaag gaaggaagga aggaaggaag gaaggaaaaa gaaaggaagg
     3301 aaggaaaaag gtacagagag agggaaagag ggagggagaa aaataacata tatgaagaca
     3361 cagtacagga ccaatctggg ctcaggtgcc cactttagtc tcctactgga attttcatcc
     3421 acttgtacca gaaactcagc acccacagat ccttcttgcc atgtgacctt ccagtccata
     3481 gtttggaatc tttcctgttt tccttactaa tatttttctc ctaataaaaa gactaaacca
     3541 tctagactct aggactccag agatgactct gtgggtaaga gcacttgttg ctcttgtaga
     3601 agacccaggt tttattccta gaacccacat ggtggcttac aaccatgtgt gaccccattt
     3661 ccaaaggatt cctctaaatc ttttggcatc tttggacagt gtgcccaatt gttacccaga
     3721 cttaaatgga agagaaacct tcatattaca taaaaaatta acacaataag atccataaag
     3781 aaatatataa aagaagataa tcttttttaa aaaggataca ttggatacat tgccaggcct
     3841 ggagagatgg ctcagtggtt aagagcactg acagctcttc cagaggtcct gagttcaaat
     3901 tcccagcaac cacatggtga ctcacaatta tctgtaatgg gattcaatgc ccactactgg
     3961 tgtgtctgaa gacagtgaca gtgtactcat atacatgaaa gaaataaatg aatcttgaga
     4021 acaacattta atgcctgaag ccatgtttcc tgtattgttc cagtcaaatc taagaatgtg
     4081 aattctatca cagaccacaa catttacatc tatgagggct ttcttcatga gctcaaccat
     4141 acaaatatag atttttagtt ctagatttga tctggtggac ccagaaatgg acagcctcct
     4201 gataatagcc acagtcccca atacagcacc aatctcatca tagcatttga agagtgtatt
     4261 atatgttggc ctgttcacct tgtccactaa aaccctcagc ttggtccacc aaagcctctt
     4321 tgactgcatt gtatcaacac accaaagcac accgaggctt taagaatttt gaagtaagcc
     4381 tgccacccag aggtcttcct atttgcccat gttgtgggtg ttgcaacaaa gacagggtca
     4441 gtgttaggag ataggattgg agggtaaaag actcaactag acaaacagga gcaaaggcca
     4501 tcctgtgtcc ctgggagtat aaaggtacta tctcagcctt ggctatcagc ctgtcaatcc
     4561 tcactggcca ctatgctgga cacaggactg cttctggtgg tcatattggc ctccctgagc
     4621 gtcatgctct tggtgtccct ctggcagcag aaaatcaggg ggagattgcc tccaggaccc
     4681 actcctttgc ctttcattgg aaattatctg cagctgaata caaaagacgt atacagttcc
     4741 atcacacagg tatcactgga tgaggggatg gatgggacat gggagcacaa gaggctgtga
     4801 tgttttgcat gttttgtggc agaagattca tagaggaatc caaagtcttg tattagtgga
     4861 gtttagaaag ataaggagct atttcaagtc tttggtttgt tgtttgttgt ttgttttgtt
     4921 tgttgtttgt ttctttgttt ctttgttttt tcaatcattt atttgtagag taacacataa
     4981 tctgacctct gtgtactggt ccagttcagt gaataagtca tctaacagcc cccatctacc
     5041 ccacatcagc tcagtgagcg ctatggtcct gtgttcacca tccaccttgg gcctcgccgg
     5101 gttgtggtgc tttatggata cgatgcagtc aaagaggctt tggtggacca agctgaggag
     5161 ttcagtggac gaggcgaaca ggctacctac aatacactct tcaaaggcta tggtgaggag
     5221 gataccacat tggggaacat gcccaaggac atttgttggc gtcatttaag tagccttcat
     5281 actaactcat ctctccctca aggctgtaca gagttctctg aatttctctc catatccatg
     5341 ttgaatgttg gctctcattg tgaccctccc tagcatttct gagattgaaa acagactttt
     5401 gcaaattctg tgggttcttt cttccatcct tctctaccgt tttcttccgc cctttctacc
     5461 acctatcact agataggaaa gaaaaggaga tagaggtgaa aggggacatt actgttagat
     5521 tatttcctgc tgattaggag tgacgagctc cttagggaaa gttttatctt ctctgtcagg
     5581 atatctaatt tcttcttgtt gttatttctt tacataagac tacttaacaa atcacaagca
     5641 acagcaacta accaatagcc aaaaccaatt tctcagggtc cttgcattta cacaaccttg
     5701 aggagtccca gtatcctgag tgtcacacac tctcagaaac tatctgcagc tggcaaaatc
     5761 ataacctcct gctttggaca acctgaacca gccccatatg ccatacctgg gagtaaacag
     5821 aaacatattt ctataatagt tctgtatttt tcaaagaaat caaatttctt actacatctg
     5881 gccattgctg ctcttctctc tctctctctc tctctctctc tctctctctc tctcacacac
     5941 acacacacac acacacacac aaacacacac acacaaacac acacacacaa acacacacac
     6001 acaaacacac acacacacaa acacacacac acaaacacac acacacgcac gcacacacac
     6061 acaacctctc ggcattctcc tagatggatg actccttttt aatttagctg atatttttat
     6121 ccttcttaaa catttatcca cacacagagc atcagttgca ggtctcaggc attcactcct
     6181 gatgcctctg gattggtttt ttagattctt tgttcttact tttccatcta tgggtgctgg
     6241 gctctcaagc acatctctgc acagtgtgtg tgcctggtgc ccatggaagc aaaaagatgg
     6301 agtcagatct cctgaactcc aggggttccc tgagttccag ggttatgagc tgccaggtga
     6361 gtgctggggt acaagcacag gtcctctgca aggtcagcca gtgctcttga gtgcagagcc
     6421 agctttgctg ccccccactg cctatatttt taaatgctgt tttacatact ccatgtgttg
     6481 tccctaagat gtgtataatg cttatagaac gtcacagtct ggtaagtgct ggccaaagct
     6541 acagaagtat aaaatggcct tgaacagcaa aacactggtt ataagcaaga aaggtcaaaa
     6601 taaagagaaa atccacaaag agccaaatat ctttataaca ttaattctgt agttaaaatt
     6661 taacacagag agtgtatctc gttccttgaa gaactgaagg acacacaaat gactacttct
     6721 acctagggtc aaaatatagc ggtgactaca gctcaagaca cacaaaacca gagtcaagaa
     6781 tcagggagtg gtaataaaat aataaaaaat cctggctcag ggtttcttcc cacctttccc
     6841 tgatgaaagg cacacacagc ctttatattt tagtctgcct tatgcagcac aatagctggg
     6901 cagctgccta ccctccatgc tgttagaatc cattttccta ttgaaagccc caagttaata
     6961 ctttacaagt ttctttatac catatttgct attcttgacc caactgagga gcccttttgg
     7021 ccacactgtc ttggcccata gcacatggtg tctctccttc taccttctgc tctttcttct
     7081 tccatggctt ccacagaggc tcctcaatcc cattctcctt cctcatgctc tctagcccca
     7141 gaaaactaag caccacaagt ctcttctccc agctattagc tgctgacatc tttatttacc
     7201 aatcagaatg aactgcgggc aggatcactc agacaaacta cagactccaa atcttagagg
     7261 ccaacactta ctgttatagg aaacaataaa agacaaaaac ctcaacacca gggtatgttt
     7321 ctgggtaggc tgtccttgct ttaatgggga tttgctgttt tcagaaaatg ctcaatattg
     7381 attgattttg ccatttccag gaccctttgc tgcattctgt ctgtaagtct ctttttattt
     7441 gcctggctga cttgtttcaa ctttctttct ctgactgtgt ctgatgcaca gtctgtgttt
     7501 gtgtcttttg tgtccttgcc atttctatcc aactttgtct cttttctttc ccccttagaa
     7561 cccctttcca gggtgggcct catccatcct cagcctcagt ctacttctcc tgacccctta
     7621 tatttatatc tctacaggcg tggcattcag cagtggggag cgggcaaaac aactcaggcg
     7681 cctctctata gccacattga gagattttgg tgtgggcaag cgtggtgtag aggagcgtat
     7741 cctggaggag gcaggctatt tgatcaagat gttgcagggc acttgtggta agcaagagac
     7801 cattaagtgt ttgggcaaga gaaagaacat ccctgacacc tagaccctat gggttgtgga
     7861 taagaagggc ggggaagacc gcctaccaaa ccatccccag aatctggtgc tgagagattg
     7921 gtgcctcact ccaattccca caccatctgc taactcttct ccctcataat gccaatgtct
     7981 tccaaacaat gtcacccctc tcaggagccc ccattgaccc caccatctac ctgagcaaaa
     8041 cagtctccaa tgttattagc tccattgtct tcggggaacg cttcgactat gaggacacgg
     8101 agttcctgtc actgctgcag atgatgggtc aaatgaacag atttgcagct tcacccacag
     8161 ggcaggtaac agatccagct ctgccaattg tccttatagt gtcccacatt gaccatacca
     8221 acaaagggca aggaccaccc tgactctcat ggctacaaac aaaagctccc ctcaaaaaca
     8281 gaagctcccc tcaaaaccag cctttacttc agaaaactga acctttacat cagagcccac
     8341 agaagctatc cagtgctcac aatctaatgt cctctggata tctcagtagc ctgagaacac
     8401 agccctctgc ttgactctct tccctgggca ggtttctcca gcttaacctc taataaatcc
     8461 tctatgtggt cctcctgaaa atttagacaa ctgcccaagg gatacaagtg accacctctg
     8521 gccccctcct ccaatcctga acacctacct agttctgcaa aactgtggtc agtaaagcta
     8581 ttcagtccat acacccagtt ctccccaaag atcccactga cacaatggca caaaagtcac
     8641 ctgttgtctc aggtaaattc aggaatgagt agacaggcac ctcaaccaag gcaaccaagc
     8701 acagacctct ggatggactg tttccccaaa cacccatatg tctcccagct acacacaacc
     8761 cacatcaaga caatatctga caggtgtgtc tcacacctta taacctgaac caccccacca
     8821 tgaagacctg actatgtgaa aaaccgattc taatctcaaa caaatatcaa gacatctaat
     8881 cttagccctc tcaaatgccc aaacatatag atacttgatt cactgcgaca ctcatgtcct
     8941 gaatactaga aacctggagt aatggtctga tccaaaaatc agttaaataa ctgaatgtct
     9001 actaatgttc ccttttgatc cagttcattg ggattgtaag acaatgacct tcattcttta
     9061 aatcacctag aaaactgtgg tctctggggc ctctgacagt tcagtggttt aagagcatgc
     9121 actgctcatc ctgaggaccg agttcagttc ccactaccta tgctgaacat ttcaaaactc
     9181 tatgggagta cacctgcacc gtgcacataa ttaaaagtaa aatattcaaa cgaatataaa
     9241 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa
     9301 cacctggatg gatcccttga gactcgaccc actcccacgg gtgttgccac tgacaagcct
     9361 tttcttttct cctcccaccc cccagctcta tgacatgttc cattcagtga tgaagtacct
     9421 gcctggacca cagcaacaga tcatcaaggt tactcagaaa ctggaagact tcatgataga
     9481 gaaagtgagg cagaaccata gtaccctgga ccccaattcc ccaaggaact tcattgactc
     9541 ctttctcatc cgcatgcaag aggtgatccc aatcatggtg gatggaatgt ctaaaacagg
     9601 gcagctctaa atcatcctag aaaaggagga ggaatatagg cccattaagt gcccatgatt
     9661 ctcctcacag tcccggttat agttaaacct cactctttca cctgttgagc cttatccaag
     9721 ccagggtatg ggttagcaaa ttaccatgac aaccgatatt ccagtgttcc cctatgagac
     9781 actgttttca gtgttcaact acttagcatg cactgaagct actgtcgaag accctgtgga
     9841 gcctaaactt cgcaaagagg gaaagtgtgc ccagacttgc atgctgactt tatggagaca
     9901 gaaaactata cagccttgcc tctatggctc tcaggctttt actattagcc acatggtctc
     9961 tagcatttca tatctctgtt aggaaataca catcagtaca catcagtggc ctaagacctg
    10021 ggtttttttt tcttttgtct gttctagtaa tttttttatt gtttttcatt tttgtgtttt
    10081 tttcttttat tggatttttt atttctattt cagatattat cccctttctt ggtttccctt
    10141 ccagaaacct gctatctcct catgcttcta tgaggattct ctcccaccca cacaacactc
    10201 cctgccacct ccctgtgctg acattcccct acactggggc atcgagccca gacaggacca
    10261 agggtctctc ctcccattga tacccaacaa ggccatcctc tgttatacat atggctgaag
    10321 caataggtac atccctgtgt actcttggga tggtttagtc actgggagct ctggtgggtc
    10381 tggttggtta atattgttgt tcttcttata gggtggcaaa ccccttcagc tccttcagtc
    10441 ctttctctaa ctcctccata tgggaccatt ttctcagttc aatggttgac tgcaagcatc
    10501 tgcctctgta attgtcacgc tctgcagagc ctctcaggag acagctatat gaggatcctg
    10561 tcaacatata tttcttggca tccacaatat tgtgtgagtt tagaggatgt caatgggatg
    10621 aatccacctg tagggcagtc tctgaatggc ctttccttca gactctgctc caaactttgt
    10681 ctttgtattt ccttctttga gtatttttgt tccccctttc aagaaggact gaagcatact
    10741 cacttgagtc tttcttcttc ttgagtttca tgtggtctct gaattctatc ttgggtattc
    10801 caagtttttg gactaatatt tacttctcag tgagtgcata ccatgtgttg ggttacctca
    10861 cttaggatga tattttttag ttccatccat ttgcctaaga atttcatgaa gtcattattt
    10921 ttaatagcag tgtagtactc cattgtgtaa atttactata ttttttgtat atatttctct
    10981 gttgaagaac atctagtttc tttccagctt ctggctatta taaataaggc tgttatgaac
    11041 atagtggaga gtgtgtcttt gttatatgtt ggagcatctt ttgagtatat gcccaggaat
    11101 ggtatagctg agtcctcaca taatactatg tccaattttc tgaggaacct ccaggatgat
    11161 atccagagtg gttgtatcaa attacaatcc accaacaatg gaggagtgtt actctttctc
    11221 cacatcctta ccagcatctg ttgtcacctt cgtttttgac ctttgccatt ctaactggtg
    11281 tgaggtggaa tctcagagtt gttttgattt gcatttccct gatgactaag gaggttgaac
    11341 atttctttag gtacttctca accatattcc taagctgaga attctttgct tagctcttta
    11401 ctccattttt aatggggtta tttgattctc tggagtctaa cttcttgagt tctttgtata
    11461 tatttaacat tagccctcta tcggatgtgg gattggtaaa gatcttttcc caatctgttg
    11521 gttgtcgatt tgtcctaatg acagtgtcct ttgccttaca gaagctttgc aactttatga
    11581 agtagtattt gtcaattctt gatcttagag cataagccat tggtgttttg tttaggaaac
    11641 tctccctggt gcccatgtgt tcaagaccct ttcccacttt ctgttctatt agttccagtg
    11701 tatctggttt tattttagtt taattttatt tttcttggat aattatgtat tacacatcaa
    11761 atgttattcc ctttgtcccc tctctcatat ccccttcccc tccctctgcc tctatgggga
    11821 tgctaccacc cccatccacc cactcccacc tcaaccccct agcattccct tacattgaga
    11881 aaaagagcct tcactagacc aagggctttt cctcctattg atgctggaca atgccatcct
    11941 ttgctacata tgcagctgaa gccacgggtc cttccatggg tacgctttgg ttggtggttt
    12001 aggccctggg agctctcgtg gagtctggtt ggttagttga tattattctt ccatccctaa
    12061 aatgaatgac agtcacctag acagagaaat gagcaaagct tctcatgcaa acccaagact
    12121 gctaacacag cctggagatc tttttccaac gattggtctg gaccctatga gaactagatc
    12181 caaaggaaat tgcagaagtg ctgcctattg catccctctc ctccatgagg aacttaatcc
    12241 acagttgacg gctgtttaga gacgatgaaa taatattcct ttgcagtgtg gctactagta
    12301 aattgacctt tctcaagtaa agaacccctc gcccatatgc atgcagccac acctaattat
    12361 aagcagttac ccacaacacc cccaacaaac aggaaaatag gaaggagact tattaggaat
    12421 aagaaatggt tcaaaaaaat ggaaagtaga aaataataga ggggaatacg tttaaagtgc
    12481 atttcatgta tacgtctgaa aaataaggac tcaaggttca gtgggtatgg aaggggattc
    12541 atctgggagg gtttggagga ggggtatgaa tatattcaca atacaataaa tgaaattctc
    12601 aaagaattaa taaaattatt tataaaagaa ttactagaaa tgtttcagaa aattaaaacc
    12661 cttaatgttc cccaaggatg acaaaatgat agatttatgc cctctctctc tctctctctc
    12721 tctctctctc tctctctctc tctctctctc tctctgcagg agaaaaatgg caattcagag
    12781 ttccacatga agaacctagt gatgacaaca ctaagcctct tctttgctgg gtctgagaca
    12841 gtcagctcca cactacgcta cggcttcctt ctactcatga agcatccaga tgtggagggt
    12901 gaggctggct atgtggcagg gaagttggga accgcagact ctccaactgc ttacaaccta
    12961 acaatgaccc tcacttctcc caggttcctg gatgctcagt catgctcagc tatgcagaga
    13021 caggggcata ttaaatgcat aaacacagtt ctcacaaact taaaatatta gacattccca
    13081 aattgatttc actctgactt ccagatctct gctctctgtt ctcttccctg actcctgctt
    13141 cttctcccca ccatgattct gtcacgaaaa ggataaaatg accctgtcca gcatttaggt
    13201 atggatatat gtttaaatgg tttaaatgca tgttatttac agagacatgt aatacatgca
    13261 gtggtacaca tgtgaactat tccacctgct ttgaggcctc tggattttta aaaatacccc
    13321 atctccgctt gtctttcagc caaggtccat gaggaaattg agcaggtgat cggcaggaac
    13381 cgacagcctc agtatgagga ccacatgaag atgccctaca cccaggctgt gatcaatgag
    13441 atccaaagat tttctaactt ggctcccttg ggcattcctc gaaggattat caagaacaca
    13501 accttccgtg gcttcttcct ccccaaggta gcagccatgc ccatccagga ggggcctcca
    13561 gcccacttac tgatgcttca gggcttcttt ccatctgtag ctatctaact ccactctaat
    13621 tcctccaacc aaagaattca tccacatgtc cccaaattct tgtccagctg ctttgaactc
    13681 cattttctat ctactcttct gccttgctac cttccaatct ctcaactcct gggctagagg
    13741 caaaggcctg ctgtcacact aacaccctat cttagcacat gatcccctgg agctcaaatc
    13801 tccaattgct gatggcacat atcgtagccc ctcaaatctc ctattcccta atgccttttc
    13861 ctgaggagac ctccaactct gtgccttgca gttgtctata tttggacatc ctttctccat
    13921 caacccatct tctaaaatct cctttcttcc ctcttccagg gcaccgatgt gttccctata
    13981 ttaggttctc tgatgacaga cccaaagttc ttccctagcc ccaaagactt cgacccccag
    14041 aacttcctgg atgacaaggg acagttgaag aaaaatgctg ctttcctccc tttctccact
    14101 ggtaaggaga cagtgggtta ttgaactact gttcacacca acatgggtag cacatgccag
    14161 cttccctgtc tgtgatgctg cctagaatca ggctaaccag gtatagcccc tgcacctccc
    14221 aagcaccaga catgctggat gcaggtgaga ggatccctgg gaccagtgat ctgtgtcaga
    14281 gaccggggag gggttgggaa taccaacttt cctaggtgat gctcatgcaa gcaatttctt
    14341 cacactcttt ctaatgcagc ttttaaataa ttgtttgttt ttctttattt tttaagtaat
    14401 ttatttaatg tgcaatggtg tgaggttgtc agatgccttg gaactgaact tatagatgat
    14461 tatgagctgc catgtggctg ctgagaattg aaccttggat cttcagaaga acagacagtg
    14521 ctcttaacca atgagccatc tcccagcccc atcttcagac tcttaaaagt gggataacaa
    14581 ccaggtggta taggtgcatg cctttaacca cagtactggt ggatatctga gttcaacacc
    14641 agcctgggac tatagagtga gttacaggac aacccaggct acatggagga aaccatgact
    14701 tcaaaaacta aaaataaata aataataggt aggtagatag atagatagat agatacatac
    14761 atacatagat acatagatac atagatacat agatacatag atacatagat acatagatac
    14821 atagatacat agatacatag atagatgcat agatagatac atagatagat agagacatag
    14881 atagatgcat agatagatac atagatagat agatgcatag atagatagat acatagatag
    14941 aaagatgcat gtatacatac atgcatgcat acatagataa atagatgact cataaaaaat
    15001 taaaagaata aaaaaataaa caaggccaca gcagagcatc tacatttgag aggataatta
    15061 ataattgata gaggaagcat ctgtactcca tattgctcca gcctaaaatg agttgtccca
    15121 cgttgtgtgt agggacacca gggttttaag agggttagga gcctttccta atgatccctc
    15181 atgctccagt atagcagccc cttctccttt tttttttctt tttttcttta ttaacttgag
    15241 tatttcttat taacatttcg agtgttattc cctttcccgg tttccaggcc aacatccccc
    15301 taatccctcc ccctcccctt ctttatgggt gttcccctcc ccaccctccc cccattgccg
    15361 cgctcccccc aacaatcaca ttcacagggg gttcagtctt agcaggacca aggacttccc
    15421 cttccattgg tgctcttact aggctattca ttgctaccta tgaggttgga gtccagggtc
    15481 agtccatgta tagtctttag gtagtggctt agtccctgga agctctggtt ggttggcatt
    15541 gttgttcata tggggtttcg agtcccttca agctcttcca gttctttctc tgattccttc
    15601 aacgggggtc ctattctccc acccttcccc cactgccgcc ctccccccaa caatcacgtt
    15661 cactggggct gaaccccatt tttaataggg ttatttgtct ccctgcggtc taacttcttg
    15721 agttctttgt atattttgga tataagccct ctatctgttg taggattggt aaagatcttt
    15781 tcccaatctg ttggttgccg ttttgtccta accacagtgt ctttgcctta cagaagcttt
    15841 gcagttttat gagatcccat ttgtcgattc ttgatcttag agcataagcc attggtgttt
    15901 tgttcaggaa attttctcca gtgcccatgt gttcaagatg cttccccact ttttttccta
    15961 ttagtttgag tgtatctggt ttgatgtgga ggtccttgat ccacttggac ttaagctttg
    16021 tacagcgtga taagcatgga tcaatctgca ttcttctaca tgttgacctc cagttgaacc
    16081 agcaccattt gctgaaaatg ctatcttttt tccattgaat ggttttggcc cctttgtcaa
    16141 aaatcaagtg accataggta ggtgggttca tttctgagtc ttcaattcta ttccattgat
    16201 ctatctgtct gtctctgtac caataccatg cagtttttat cactattgct ctgtaatact
    16261 gcttgagttc agggatagtg attccccctg aagtcctttt attgttgagg atagttttag
    16321 ctatcctggg ttttttgtta ttctagatga atttgcaaat tgttctgtct aactctttga
    16381 agaattggat tggtattttg atggggattg cattgaatct gtagatcgct tttggtaaaa
    16441 tggtcatttt tactagatta atcctgccaa tccatgaaca tgggagatct ttccatcttc
    16501 tgaggtcttc ttcaatttct ttcttcagcg tcttgaagtt cttattgtac agatctttta
    16561 cttgcctggt taaagtcaca ccaaggtatt ttatattatt tgggactatt atgaagggta
    16621 tcgtttccct aatttctttc tcggcttgtt tctcttttgt gtagaggaag gcaactgatt
    16681 tatttgagtt aattttatac ccagccactt tgctgaagtt gtttatcagc tttagtagtt
    16741 ctctggtgga acttttggga tcacttaaat acactatcat gtcatctgca aatagtgata
    16801 ttttgacttc ttcttttcca atctttatcc ccttgatctc cttttgttgt ctgattgctc
    16861 tggcttgaac ttcaagaact atattgaata agtagggaga gagtgcagcc ccttctcttt
    16921 aagagaacac agctttgcac ttggcactga ggcaaggcag cggtgagagc ttccttccca
    16981 actgtgctcc ttccctctct cctcttcagg gaagcgattc tgcttgggag atggcctggc
    17041 taagatggag ctcttcctgc tgctcaccac tattttacag aacttccgtt tcaagttccc
    17101 aatgaaacta gaagacatca acgagtcccc caaacccttg gggtttacca ggatcatacc
    17161 aaagtacacc atgagcttca tgcccatctg attctgagtt gaatcaaggt ggggcaagag
    17221 ggagagagag cctgaagtgg ggccagggtg caggtggaga gaacagggga ggtgaagatg
    17281 agggttaaga agggaccaca cccatggaag aaacacaaaa gacttctcac tttggtaaaa
    17341 ttgtaacagt cctaataaaa agaaagaaat actcagtggg cagcagtaac aacaactgag
    17401 actcatgggg caaaggtggc tcacctctgc agaagctgtc ctgtccttct ctcagtcctc
    17461 tacacaagag cagcatgtcc ccaagtccaa cgtacaggtt gcaaagatgg aacttacaaa
    17521 tttgaaccta aactgaggtg gaaaaaactc aagttagcta ggattgatgt tttggactct
    17581 atcaccagca ttcaggaggg agggaacatg gctctctacc atgtctgcca ggactacaca
    17641 gtgagagctt atctcaaaag aaaaaagaaa aaaagaaaaa aatttatata tatatatata
    17701 tatatgtata tatatgtata tatatatatg gagagagaga gagagagaga gagagagaga
    17761 gagagagaga gagagagaga gagtttgcat tgtacatgat cagggaaata ataaaaacta
    17821 gtttgacagt cacataccag tgggttctaa tttatcaaac tccaccccca cccccactgc
    17881 cactgctgcc ctatgaagga actgaacaga agcttaactt tccttgggcc atttcgacag
    17941 ctgttgtgtc atcaaggctt ctgttttcct atggagacac tacacatggg acagagagga
    18001 taacagggag ctcatgactg agagaccttc aggccaaagc acttgaacct ttgtttatcc
    18061 tgtttattct gaattttctg cttctgggct ctcatttccc caccattaaa atgagaatat
    18121 caatatttac agctgcactg catctctttt tggagtgatt cctggtaact aagaaataag
    18181 tagaaaatgg aaggatgaaa tccaccagga ggtttgagta aattccactg tgggaaacac
    18241 aggggactgt gggatggcaa ggatgagagc tggaaagaat gcaaggccac actatgtctc
    18301 atgcatattt tatatctttt ttatattctt tatatctttg tagtgttttt attagcctac
    18361 aaagaaatac atttctcact ggcaacttct tacatatata tcactaccta tgttctcatt
    18421 cactttcctt cgctggtctt ggcctcttcg caaaattatt caccggtaat ttattcacac
    18481 tttctaattt ttgagcatgg tgcattccag taagatttaa tctctgtggc catggtgttt
    18541 cacagctctg taacactgaa gcacattcat catcaactgc actgaagtca tcaacttaag
    18601 aagcaaagga ggattcttct ggtctccatc tgcgcccaga gctaagtctg ccccacaacc
    18661 ctccagattc aaaacctccc cagacagagc tagtcctcca ggagtgctct cactactaag
    18721 gccacaagtg agaccccatt tcccttcaat accgatccaa agaggagccc accagatacc
    18781 aggtaccaaa gttaaatgag gatccgttga cctgcaggtc
//
LOCUS       RATCYP2A21   7247 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 1 and 2.
ACCESSION   M33313
KEYWORDS    LINE repetitive sequence; cytochrome P450;
            hepatic steroid hydroxylase IIA2.
SEGMENT     1 of 3
SOURCE      Rat (strain Sprague Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 7247)
  AUTHORS   Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
  TITLE     Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
            genes and regional localization of the CYP2A gene subfamily on
            mouse chromosome 7
  JOURNAL   Biochemistry 29, 1329-1341 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       5556     5732     hepatic steroid hydroxylase IIA2 (CYP2A2),
                                 exon 1
               6198  +  6360     hepatic steroid hydroxylase IIA2, exon 2
    pre-msg    5528  >  7247     CYP2A2 mRNA and introns
    IVS        5733     6197     CYP2A2 intron A
    IVS        6361  >  7247     CYP2A2 intron B
    rpt     <     1      649     LINE repeat
    rpt        1120     2122     LINE repeat
    rpt        5852     6037     dre repeat
    rpt        5852     5863     5' direct repeat
    rpt        6026     6037     3' direct repeat
BASE COUNT     2306 a   1563 c   1530 g   1848 t
ORIGIN      
        1 aactatcctc aacaataaaa ggacttctca gggaatcact atccctgaac tcaagcagta
       61 ttacagagca atagtgatta aaaactgcat ggtattggta cagagacata cagatagacc
      121 aatggaatag aactgaagac ccaaaaatga acccaagcac ctatggtcac ttgatttttg
      181 acaaaggaac caaaaccatc caatggaaaa aagatagcat tttcagcaaa tggtgctggt
      241 tcaactggag gtcagtatgt agaagaatgc agatcaatac attcttatca ccctgtacaa
      301 agcttaagtc caaatggatc aaggacctcc acatcaaacc agatacactc aaactaatag
      361 gagaaaaagt ggggaagcat ctcgaacaca tgggcactgg agaaaaatcc ctgaacaaaa
      421 taccagtggc ctatgctcta agatcaagaa tcgacaaatg ggatttcata aaactacaaa
      481 gcttctgtaa ggccaaggac actgttgtta ggacaaaacg gcaaccaaca gattgtgaaa
      541 acatctttac caatcctaaa actgatagag gctcatatcc aaaatataca aagaactcat
      601 gaagttagag tgcagggaga caaataaccc tattaaaaaa tggggttcat gggtgtagat
      661 ctctcctgag agacacaccc agaatacagc atattcatat gcgaatgcca gcagcaatcc
      721 actgaactga gaatgggacc cccgttgaag gaatcagaga aaggactgga agagcttgaa
      781 ggggctcgag accccatatg aacaataatg tcaaccaacc agagcttcca gggactaagc
      841 tattacccaa agactgtaca tggagtgacc ctgggctcca actgcataag tagcaatgaa
      901 tagcctagta agagcacagt ggaaagggaa gcccttagtc ctgccaagac tgaaccccca
      961 gtgaatgtga ttgttggggg gaggacagta atgggtggag gatggggagg ggaacaccaa
     1021 tatagagggg agggggagga gttaggggga atgttggcct ggaaactggg aaagggagta
     1081 acaatcgaaa tgtaaataag aaatactcaa gttgataaag ataaaaaaaa agtgaggttc
     1141 agagctaaac aatgaattca cagctgagga atgccaaatg gctgagaagc accaaagaaa
     1201 tgttcaacat ctttagtcat aagggaaatg caaatcaaaa caaccctgag attctacctc
     1261 acaccagtca gaatggctaa gatcaaaaac tcaggtgaca ccaaatgctg gcgaggatgt
     1321 ggagaaagag gaatactcct ccattgttgg taggattgca gactgctaca accattctgg
     1381 aaatcagtct ggaggttcct cataaaattg gacatagatc tacctgagga cccagctcta
     1441 cctctcttgg gcatataccc aaaagatgca ccaacatata acaaagacac atgctccact
     1501 gtgttcatag cagccttatt tataatgggc agaagctgga aagaacccag atgcccttca
     1561 acagaggaat ggatacagaa aatgtagtac atctacacaa tggaatacta ctccgctatc
     1621 aaaaacaatg actttatgat attcataggc aaatggatgg aactcgaaaa tatcatcctt
     1681 agtgaggtaa cccaatcaca gaaaaacata catggcatgc actcattggt aagtggatat
     1741 tagcccaaat gctcaaatta ccctagatgc acagaacaca tgaaactcaa gaaggatgac
     1801 caaaatgcgg atgcttcact ccttctttaa aacaggaaca agaataccct tgggagagga
     1861 tagggaggca aagtttagaa cagaggcaga acgaacaccc attcagagcc tgcccacatg
     1921 tggcccatac atatatagcc accaaactag ataagatgga tgaagcaaag aagtgcaggc
     1981 tgacaggaga tctatgtaga tagatctctc ctgaaagaca cagccagaat acagcaaata
     2041 cataggcgaa taccagcagc aaaccactga actgagaatg ggaccctgtt gaaggaatta
     2101 gagaaaggac tgaatgttgt tgtaaaaata taaaaataaa gagtaatgtt ggtcttttac
     2161 cccgctaggt atcttggcgg aaacacatcc cagccacgca ctttcctaca ctcaaaccct
     2221 cacataaaag aacacacaac acaataatct ttgacccaat tggtaagata taattgccta
     2281 cttaaacata caaagcccgg taccatccat cccttgagaa cattaataac aatttgtaaa
     2341 tacacagagc agaatcttaa catcaccagc tatcttgtcc tgccacggct tctccgcccc
     2401 tctctccctc ctgtctcttc ctctctccct tagtctcctc ctcttcctta aaacttctct
     2461 cccgcccatc cttccttctc ctccaatgac aggcctcctt ctatcctgta cctgcccctc
     2521 accagtactt tacaaattca gtggagaggt ggttctggtg aagtcacctg agttctgagt
     2581 ccttgactag gcagctgtcc ttggggcagt ggaattagca tcaaaataca gtaacttcag
     2641 ggcaaaccag aataactgaa agagcttgaa ggggcttgaa accccatatg aacaacaatg
     2701 tcaaccaacc agagcttcca gggactaaga ctatacaagg actgaccctg ggctccaact
     2761 gcataggtag caatgaatag cctagtaagg ccaccagtgg aaggggaagc ccttggtact
     2821 gccaagactg aacccccagt gaatgtgatt gttgggggag gacggtaatg gggcgaggat
     2881 ggggagggga acaccaatat agaggggagg gggagaggga gggggatgtt ggcctgaaaa
     2941 cctggaaagg gaataacaat tgtaatgtaa ataagaaatg gctcagtggt taggagcact
     3001 gactgctctc ccataggttc tgagttcaaa tcccagcaac cacatggtgg ctcacagcca
     3061 tttgtatggg atccgattcc ctcttctggt gtgtctgaag acagcaacat tgtacttata
     3121 aatgaataaa caaataaata aatctttaaa aaaaagaaat acccaattta ataaagatgg
     3181 agaacaaaaa acaagaagat acattgctag ggctagagac atggctcagc agttaagagc
     3241 actgactgct cttccagagg tcctgagttc aattcccaga accacatgat ggctcacaac
     3301 aatctgcaat gggattcaaa gatcacttct ggtgtgtcta aagacagtga caatgtactc
     3361 atatacatga aagaaagaat gaaatcttta aaactttcaa aagctgaaga catgctccct
     3421 atattattcc aggcaaatcg aagaatttga attctatcac aaactacaat actcacatca
     3481 atgagggttc ttttcatgtg ctcaaccaca caaatgtaga tttttagtta tggatttgat
     3541 ctggggaacc tagacatgga cagtctccag ataatgccca cagttaccaa tacagcatcc
     3601 ctctcaacat agcctttgaa gagtgtgcta taggttggcc tgttcacctc atccacttaa
     3661 ctcctcagct tggtctccca aagcctcttt gactgcatcc cattcataaa ggaccacaac
     3721 ccagtgaggc tttaagaatt ttgaagtact ggcagcagcc tatgccctgg ggacccctga
     3781 gcatctcacc agttccaggt cggagactcg gctacatacg atggcaccga acccagatac
     3841 tcactggaaa ggaccgtacc tggtgctgct gaccaccctg acagccatca actctcagcc
     3901 ctcaccagcc gtgtactagc tgttggggct gagagctggg acctagagct gggaccagtt
     3961 cttcaaaaag ctccctagac ttaatttcat gtttgccccg ggttttatca agataggtgt
     4021 ggggataggc ttgatttcta ttacaaatga tgtaacattg catatgttag tactcctaac
     4081 acttcttggg actgtgcctc agggatcaca atctgtataa gtttagaagt tctaaaagct
     4141 agtcatgacc ttggtgtgta ggtttagata gtgtccagat tggaatcctg atgctaaaga
     4201 cttagtaaga cacaaaaaaa ggagttgaga attacttagg gctaaggcta tctaggtgct
     4261 gcaagggcag cacaaggaca tctgctgttg caatgcaagg cttatagaga attcagaact
     4321 gccatttagg agtaattaaa gactccatga ataaacttag agaaaggtta gacaaaaggc
     4381 agacagagaa gcgcatcagg gatggtttga gagctggttt agtagatctc cttggatgac
     4441 tactctggta ttttccctta tgggaccctt cttagttttg cttctgcttc tgattatagg
     4501 tccatgtgtg ttagagaaac tagttaatag gtttgactcc tacaaaaaga tagagacgct
     4561 caacaaggtt ggtttgagtc ttggttcact cggtctccct ggatgactac cctactctct
     4621 gctatatggc tgggccatta ctaataattt tcttggtttt agtttttgga ccctgcgtga
     4681 caaacaggtt aattgctttt gttacaaatc gagtgagtgc tgtgcggttg gttctgagac
     4741 aacagtacca gtcagttagg acaactggtg agaccaaata agagacttga tatcaaaatt
     4801 ctaagattag aattacttag tagaagaaga ggggaatgaa aggaaaatta tacagattta
     4861 aggtttaaaa atatgaagtt aaaagagtat gtttcaactc aggactaaac actgtgaaaa
     4921 gcaagtccag gcagccccgc cctgccgcta gaactaacag accataaaag gaaaggaatg
     4981 cagaacagac caggagtacc ggatctgact cacaggccac ctggcaggaa gagataagcc
     5041 cccagccccc gacatccagg acgccccaaa cctgccaatg tgtgtagcta taccttatta
     5101 cctcatcatg tgaaatagcc aatcatatgt gaacatgtct atgtgcctcg tttgaatcca
     5161 ccaatccccg taactatgca tctgcttctg tacgcccgct tctgcttccc caatccctat
     5221 aaaagcccca tgctggagct gctgggcgcg caagtcctcc gaagagactg tgtgcctgca
     5281 ggtacctgtg ttttccaata aaccctcttg ctgattgcaa aaaaaaaaaa aaaaaaaaaa
     5341 aaaaaaaaaa gaattttgaa gtaagcctgc cacctttctt cctatttgcc catgttgtgg
     5401 gtgttgcaac aaagactggg tcaatgttag aaaatagggt tgggaggcaa aagactcaac
     5461 tagacaaaca ggagcaaagg ccatcctgtg tccctgggag tataaaggta ctatctcagc
     5521 cttggctatc agtctgtcca tcctcactgg ctactatgct ggacacagga ctgctcctgg
     5581 tggtcatact ggcctcccta agtgtcatgt tcttggtgtc cctctggcag cagaaaatca
     5641 gggagagatt gcctccagga cccactcctt tgcctttcat tggaaattat ctgcagctga
     5701 atatgaaaga cgtatacagt tccatcacac aggtatcact ggatgagggg atggatggga
     5761 catgggagtc caagaagctg ggttgttttg catgttttgt ggcagaagat tcatagagta
     5821 aatccaaagt cttgtattca tggagtttag aaagataagg agcgggctgg agagatggct
     5881 cagcggttaa gagcaccatg tgctcttcca aaggtcctga gttcaaatcc cagtaaccac
     5941 atggtggctc acaaccatct ataatgagat ctggtgccct cttcttgtat tcttaatcat
     6001 aataaataaa taaatctaaa aaaataagat aaggagctat ttcaactctt tggtttgttg
     6061 tttgcattcg tttgtttgtt tgtttgtttg tttgtttttc aatcatttat ttgtagaata
     6121 acacataatc tgacctctgt gtactggtct agttcagtga ataagtcatc taacagcctc
     6181 catctaccca acatcagctc agtgagcgct atggtcctgt gttcaccatt caccttgggc
     6241 ctcgacggat tgttgtgctt tatggatacg atgcagtcaa agaggctttg gtggaccaag
     6301 ctgaggagtt cagtggacgt ggcgaactgc ctacctttaa tatactcttc aaaggctatg
     6361 gtgaggagga taccacattg gggagcatgc ccaaggacat ttgttggcct catttaagta
     6421 gccttcatcc taactcatct ttcccctcaa ggctgtacat agtcctctga tttttctctc
     6481 catattcaag ttgaatgttg cttcttattg tgacccttcc tagtctttct atgattctct
     6541 gtgggtgctt cctttcattc ttcttcaccc ttttcttcca ttctttaacc ctcataatac
     6601 taggtaggag ataaaaagag atagaggaaa aaggggacac tattgttaga ctacttcctt
     6661 ctgagaggta atgagttcct tagggcaagt ttgatcatct cagtcaggat atctaatttc
     6721 ttcttcctgt tgttactttg cacaaggcga cttaacaaag cacagccaac agcaaccaac
     6781 caacaaccaa aaccaatctc tcaaggccct tgcattaaaa taacctctga ggaatcccca
     6841 gtatcctaag ggtcacactc tcagaaacta tctgcagtag gcaaaatcat acccctgcta
     6901 gagcacaaaa taaatcatag gtctctgctt tggacaatct gattcatccc catattgcat
     6961 acctggaatt aaaaaaacat attcctataa tatttctgta tttgtcaaaa aaaaacaaaa
     7021 ttcttttttt tttatcttta agtaatactc caactttatt gaataaagga ataaatggag
     7081 ttttcaagtt ttcccatcat ggttattttt aaagccacct gatacatgac agtacttatc
     7141 aaaacaagat gtttatctat ttttgtcatt tgtatttttg cttaatttta tattcataat
     7201 atatttaaat taactaatag ttcatggtaa cacttggcca cacaggt
//
LOCUS       RATCYP2A22   4753 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 3,4 and
            5.
ACCESSION   M33325
KEYWORDS    cytochrome P450; hepatic steroid hydroxylase IIA2.
SEGMENT     2 of 3
SOURCE      Rat (strain Sprague Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4753)
  AUTHORS   Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
  TITLE     Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
            genes and regional localization of the CYP2A gene subfamily on
            mouse chromosome 7
  JOURNAL   Biochemistry 29, 1329-1341 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +  1815     1964     hepatic steroid hydroxylase IIA2 (CYP2A2),
                                 exon 3
               2182     2342     hepatic steroid hydroxylase IIA2, exon 4
               3567  +  3743     hepatic steroid hydroxylase IIA2, exon 5
    IVS     <     1     1814     CYP2A2 intron B
    IVS        1965     2181     CYP2A2 intron C
    IVS        2343     3566     CYP2A2 intron D
    IVS        3744  >  4753     CYP2A2 intron E
BASE COUNT     1258 a   1204 c    911 g   1380 t
ORIGIN      About 5 kb after segment 1.
        1 aatcaatagt ttttaagcta ctaacccttt ctagagatga tgaaaataga aaactggaag
       61 aatgcctagg tagcaaatga ccttggaagt tagggactaa aaatttaagt ccacatctgt
      121 gcaagataaa aattaactct tagtttgcat aagctcttat ttttttcata agtcttattt
      181 gtttttttat ctttattaac ttgagtattt cttatttaca tttcgattat tattcccctt
      241 cccagtttct gggtggatga ctccttttta acttagctga tatttttatt cttcttaaac
      301 atttatccac acacagagca tcagtcgcag gtctgaggca caccctgcta gtgcctctgg
      361 attgttttta aagatcattt gctcttactt ttctatctat gggtgttttg cttatgtgta
      421 tatgtgtaca caagtctggt gcccatggaa gcaaaaagat ggagtcagat ctcctgaact
      481 ccaggggttc catgagttct ataagctgtc aggcgagtgc tggggttcaa gcacaggtcc
      541 tctgcaaggt cagccagtgc tcttgagtgc agagccagct ttgctgtcca tccccccgcc
      601 cccgcgcatg tatttttaaa tgttgtttta catatgtcat gtgttgtccc taagatgtgt
      661 ataatgctta tagaacatta cagtctggta agtgctggcc aaagttacag aagtataaaa
      721 tggccttgag cagcaaaaca ttggttataa gcaagaaagt tcaaaataaa gagaaaatcc
      781 acaaagagcc aaatatcttt ataacattaa ttctgtggtt gcgatttaac accaaggggg
      841 tatctgtttc cctgaactaa ggggcacaga aatggctact actacttagg gtcaaaatag
      901 tgactacagc tcaggacaca taagcaaaac cagagccaaa gaccagggag tggtaataaa
      961 ataataaaaa atcctggctc agggattcgt cccacctttc cctggtgaaa gacacacaca
     1021 gcctttatat tttagtctgc cttatgcagc acaatagctg ggcagctgcc taccctccat
     1081 gttgttagaa tccatttccc tatcaatagc cttgagttga tactttacaa atttccatat
     1141 tccatttttg ctgttcttaa cccaatttaa cagccttctg ggccacaatc tcttggccct
     1201 tagcacatgg tatctctcct ttgcccttct tctctttctt cttccttggc ttccacggaa
     1261 gctcctcggt cccattctcc ttcctcatgc tctagccaag gaaacctaaa cccctcctat
     1321 gtcccttctc cccagctatt agctgctggc atctttattt accaaccaaa gtaaatgggg
     1381 gcagagtccc ccaggctaag ggcagattcc aaatcttaga aggcagcacg aagcagtata
     1441 gtaaacagta aaagaaaaaa acgcaacacc agagtacgtt tctatgtatg ctgtccttgc
     1501 tttaatgtgg agtttctgtt ttcagaaaat gctcaaattt ggttctttta gccatgtcag
     1561 cgacctggag cagcattctg agtctctctg cttctgtctg taactctctg tttccttgcc
     1621 tggctgactt gttccaactt tcttactctg actgtgtctg ctgcagagcc tctgttcgtt
     1681 tcttcagtgt tcttgccatc tcaatcccat ctttgtctct tttctttcct ctaagaaggc
     1741 ctttccagca tgggcctggg ccttcctcag cctcagacta cctcacccca acacccatgt
     1801 tcatgtctct acaggttttt cattgagcaa tgtggaacag gccaagcgta tcaggcgctt
     1861 caccatagcc acattgagag attttggtgt gggcaagcgt gatgtacagg agtgtatcct
     1921 ggaggaggca ggctatttga tcaagacgtt gcagggcact tgtggtaagc aagagaccat
     1981 taagtgtttg ggcaagagaa agaacatccc tgacacctag accctatggg ttgtggagaa
     2041 ggaggacggc gaagaccgcc taccaaacca tctccagaat ctggtgctga gagattggtg
     2101 cctcactcca attcccacac catctgctaa ctcttctccc tcataatgcg aatgtcatcc
     2161 aaacaatgtc acccctctca ggagccccca ttgacccttc catctacctg agcaaaacag
     2221 tctccaatgt cattaactcc attgtcttcg ggaaccgctt cgactatgag gacaaagagt
     2281 tcttgtcact gttggagatg atcgatgaaa tgaatatatt tgcagcctca gccacagggc
     2341 aggtaaaaga ttccagctct gccaattgtg cttataatgt cctacattgg ccataccgac
     2401 aaagggcaag gactacccca acgctcatgt ccacaaacat tcccctcaaa aacagaagct
     2461 cccctcaaaa ccaaccttta ccttcagaaa actgaacctt tacatcagag cccacaggag
     2521 ctatccagtg ctcacaatct aatgacctct ggatatctca agggcctgag aacaaagccc
     2581 tctgcttggc tctcttccct gggcaggttt cccccgctta aattctgaca gatcctctgt
     2641 gtggtcgtcc tgaaagttga gacacctgcc caagggagac aagtgatcac ctcaggcccc
     2701 ctcctccaat cctgagcacc tacctggttc tgcaaaacta tggccagtaa agtcattcac
     2761 actggacaca ctgctctccc aaaagatctc actggcacca tgacacgaga gtcacctgct
     2821 tgtctcaggt aaattcagga atgagtagac aggaacctca accaaggcaa ccaagcacag
     2881 acctctagat ggactgtttc cccaaacacc catacgactg ccaaccagcc acacacagtc
     2941 caattcaaaa aggtctgaca ggtgtgtccc acaccttata acccgaacca tcttatcctg
     3001 aatactttac tatgtggaaa acagattcta atctcaaaca aatatcaaga gatctaaatt
     3061 cagccttctt tggtgcccaa acatctaaat acttgagtca ctgtgataac cctggcctga
     3121 acacaggaaa cctggattaa tggtctaatc aaaaaatcaa ttgaatagtt gaatgtctgc
     3181 taatgtcccc ttttgatcca gctcatccag attgtaggac aatgaccctc attctttaaa
     3241 tcaactagaa aattgcagtc tctggggctt cagactgttc agtagtttaa gagcatgtac
     3301 tgctcatcct gaggacctga gttcagttcc cagtacgtat gctggacatt gcacagctca
     3361 aggggagtac acctgcactc gtgcacataa ttaaaagtaa aatattcaaa tgaatataaa
     3421 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa
     3481 cacctggatg aatgacttaa tacaagtgcc actcccactc aatgttgcca ctgacaagcc
     3541 ttttcttttc tcctcccacc ccccagctct atgacatgtt ccattcagtg atgaagtacc
     3601 tgcctggacc acagcaacag atcatcaagg ttactcagaa actggaagac ttcatgatag
     3661 agaaagtgag gcagaaccat agtaccctgg accccaattc cccaaggaac ttcattgact
     3721 cctttctcat ccgcatgcaa gaggtgatcc caatcatggt ggatggaatg tctaagactg
     3781 agcagctgga aatcacccta gaaaaggagg aggaatataa gcccattaag tgcccatgat
     3841 tctcctcaca gtcccggtta tagttaaacc tcactctttc acctgttgag ccttatccaa
     3901 gccagggtat gggttagcaa attaccatga caaccgatat tccagtgttc ccctatgaga
     3961 cactgttttc agtattcaac tacttagcat gcactgaagc aactgtcgaa gaccctgtgg
     4021 agcctaaatt tcgcaaggag ggaaagtgtg cccagacttg catgctaact tcatgcagac
     4081 agaaaactgc ttgcctctat ggctctcagg attttactat tagccacctg gactctagca
     4141 tttcatatct ctgttagaaa atacatatca atacacaacc ctgaactggg caacctgggt
     4201 tgttgtattt tttcttctat tatctgctct agtaattatg tattgttttt tattttaatg
     4261 ttgtttttct tttttttttc atctttatta aattgaagat ttcttattta catttaaatt
     4321 gttattcccc ttcccggttt ccaggccaac attctctaac ccctcccctt ccccttctat
     4381 atgggcttcc ccttcatatc ctccccccat taccaccctt cccccaacaa tcacgttcac
     4441 tgggtgttca gtcttggcag gacccggggc ttccccttcc actggtgctc ttacaagcct
     4501 cattgcttcc tatgaggttg gagcccaggg tcagtccatg tgtagtcgtc gggtagtggc
     4561 ttagtccctg gaagctctgg ttgcttagca ttgttgttca tatagggtct cgaccccttc
     4621 aagctcttac actcctttcg ctgattcctt caacgggggt cccgttctca gttcagtggt
     4681 ttgctcctgg catttgccta tgtatttgct gtattctggc tgtgtctctc aggagagatc
     4741 cgttgacctg cag
//
LOCUS       RATCYP2A23   5080 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 6,7,8 and
            9.
ACCESSION   M34392
KEYWORDS    cytochrome P450; hepatic steroid hydroxylase IIA2.
SEGMENT     3 of 3
SOURCE      Rat (strain Sprague Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5080)
  AUTHORS   Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J.
  TITLE     Structure and in vitro transcription of the rat CYP2A1 and CYP2A2
            genes and regional localization of the CYP2A gene subfamily on
            mouse chromosome 7
  JOURNAL   Biochemistry 29, 1329-1341 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +  1322     1460     hepatic steroid hydroxylase IIA2 (CYP2A2),
                                 exon 6
               1876     2063     hepatic steroid hydroxylase IIA2, exon 7
               2496     2637     hepatic steroid hydroxylase IIA2, exon 8
               3636     3817     hepatic steroid hydroxylase IIA2, exon 9
    pre-msg <     1     4588     CYP2A2 mRNA and introns
    IVS     <     1     1321     CYP2A2 intron E
    IVS        1461     1875     CYP2A2 intron F
    IVS        2064     2495     CYP2A2 intron G
    IVS        2638     3635     CYP2A2 intron H
BASE COUNT     1470 a   1191 c    970 g   1449 t
ORIGIN      About 15 kb after segment 1.
        1 gaattctttg tatatattgg acaatagccc tctatcagat gtacaattgg taaagagctt
       61 ttcccaatct gttggttgtc gttttgtcct aataacagtg tcctttgcct tacagaagct
      121 ttgcaatttt atgaagtccc atttgttgat tcttgatctt agagcataag cccttggtgt
      181 tctcttcagg aaattctccc atgtgccctg tgttcaaggc tcttacccgc tttctcttct
      241 attagtttca gtgcatctgg ttttatttta attttgtttt atttttcttg tatatttttg
      301 tacttacact tcaaatgcta tctcctttgt acattctctg atatctcctc cctgtcccca
      361 tgcttctatg aggatgctct cacttccacc cacccactcc cacctcaatg ccttgacatt
      421 cacctacatt ggggaaatgg gcctttactg gaccaaggac ttttcctcct attaatgatg
      481 gacaatgcca tcctctgcta tatatacagc tgaagccatg cttccctcca tttgtactct
      541 ttggttgggg gtttagtctc tgggagctct gagggaagag tctggttggt tgataatttt
      601 gctcttccag ccatgaaatg aaagacagtc acctatacag agaaacaagc aaagcttctc
      661 ctgcaaacca aagattccaa acacaacctg gacattgctt ttccaaccat tggtctggac
      721 actttgagaa ctagatacaa agaaaattcc agaagtgctg ccacttgggt ccatttctga
      781 ggaatttaat ccacagttga tggctgctta gagatgatga aatcatattc ctttgcagtg
      841 tggctactag taaattgccc tttctcaagt gaagaaccac tcacccatat gcatgcagcc
      901 acacctaatt ataagcagat ctccccccaa ataaaaacag gaaaatatga ggaagactta
      961 ttagaaatta gaaatggttc aataaaataa aaatagagat aatggagggg aatatgttta
     1021 aggtgcattt cacatatatg tctgaaaaat gaagactcaa gattcagtgg gtatggaatg
     1081 ggattcatct gggagggctt gagggagggg tgtgaatgta ttcacagtac aataaatgaa
     1141 attctcaaag aaataataaa aatatttata caataatgac tagaaatgtt ttagaaaatt
     1201 aaaaccctta gtgttcccca aaaggagtac aaaatgataa atagatttgc gttctctctc
     1261 tctctgtctc tgtctctgtc tctgtctctc tgtctctctc tctctctctc tcccccccca
     1321 ggagaaatat gttaattcag aattccacat gaacaaccta gtgatgtcat cattaggcct
     1381 cctctttgct gggactgggt cagtcagctc cacgctatac catggtttcc tgctactcat
     1441 gaagcatcca gatgtggaag gtgaggctgg ctgtgtggca aggaagttgg gaaccccaga
     1501 ttctccaacc tgacaatgac cctcacctct cccagatccc tggatgctca gacatcctga
     1561 ctatgcagac acagaggcat attaaatgca taaacagagt actaagttaa aatattaaac
     1621 attctgaaat tgatttccca ctgactgcca gatccctgtt ctctgttccc tgacttctcc
     1681 ttctccccac catgatttgg tcatgaaaag gataaaatga tcctggccag catttaggta
     1741 tggatgtatg tatagatggt ctaaatgcat gtttacagag acatgtaata catacagtgg
     1801 tacacatgtg aactattcca catgctttga ggtctctgga tttttagaaa cagcccatct
     1861 tcctttgtct tccagccaag gtccatgagg aaattgagcg agtgatcggc aggaaccgac
     1921 agcctcagta tgaggaccac atgaagatgc cctacaccca ggctgtgatc aatgagatcc
     1981 aaagattttc taacttggct cccttgggca ttcctcgaag gattatcaag aacacaacct
     2041 tccgtggctt cttcctcccc aaggtgcagc caggcccacc caagtagggg cctccaaccc
     2101 actccctgat gcttcagggc ctctttccat ctacagccat ctaactcaac tctaattcct
     2161 ccaaccaaag aattcaccca catgtcccca acttcttgtc acactgcttt gaactccaag
     2221 ttctatctga tcttctgcct tactactatc caatctctca actcctgggc taacacacta
     2281 acacattatc tcagaacatg attcccctag agctcaaatc tccaatttct ggtggcacgc
     2341 atcacagccc ctcaaaactc ctattcccta atgccctttc ctcaggagac ccccaactct
     2401 gtgcctttcc gttctcttca tttggacact agcaccactt ggggtccttt ctccatcaac
     2461 ccatcttctc aaatttcctt tctttcctct tccagggcac cgatgtattc cctataatag
     2521 gttctctgat gacagaacca aagttcttcc ctaaccacaa agacttcaac ccccagcact
     2581 tcctggatga caagggacag ttgaagaaga atgctgcatt tctccctttt tccattggta
     2641 aggagacagt gggttattag accactgctc ataccaacag ggataactca tgccagttcc
     2701 catctctgtg attctgccta gcatcaggct aaccaggtac aatccctgca cctcccaagc
     2761 accacgactc aggtcaaagt atcaatgaga tcagtgatct ctttcagaga ctgggaagcg
     2821 gttcagaaca ccaaatttcc caggtcatgc tcatgcaagc aatttcttca tactcttttt
     2881 aaagcagttt taaatgattt ttttgttatt ttttaataat tcatctaatg tgcattggtg
     2941 tgaggttgtc agattcatta gaactggact tatagacatt ttatctgcca tgtgggtgct
     3001 gagaattgaa ccttggttct tcagaagagc agacagtgct cttaaccagt gagccatctc
     3061 ccagccccat attcaaattt taaaagggga taacaaccag gtggtggtgg tacatgtctt
     3121 taaacccagt actcaagaag cagaagcagg tggatatcta agttcaatgc cagctggatc
     3181 tatagagtaa gttagaagaa aacccagact aaatggagga aaccctgact taaaaaacta
     3241 aaaataaata aataatagat agatagatgc atgcatgtat acatacatat atgcatacct
     3301 acatgcatgc atacatagat acatagatga ctcagagata attagatgaa taaataaata
     3361 aacaagacca cagcaggcat ccacatctga gaataaaatt aataattggt agaggaagca
     3421 tctggactcc atattgcttc agcctacaat gagttgcccc actttgtgtg tagggacact
     3481 ggggttctga gagggttagg aacctttcct aatgatcact catgctccag gttagcaccc
     3541 cttttcccta agagaacaag gctgctcact gggtactgag ggaaagaagt gagatcttgc
     3601 tccaagtctg tgctccttac ttctctcctc tttaggaaag cgattctgct tgggagatag
     3661 cctggctaaa atggagctct tcctgctgct caccaccatc ttgcagaact tccgttttaa
     3721 gttcccaatg aatctagaag acatcaacga gtaccccagt cccatagggt ttaccaggat
     3781 cataccaaat tacaccatga gcttcatgcc catctgattc tgagttgaat caaggtgggg
     3841 caagagggag ggagagcctg aagtggggcc agggtgcagg tggagagaac agagaagatg
     3901 aagatgaggg ttaagaaggg accacaccca tggaagaaac acaaaagact tctcagtttg
     3961 gtaaaattgt aacagtccta ataaaaagaa agaaacaccc agtaggcagc agtaacaaca
     4021 actgagactc atggggcaaa ggtggctcac ctctgcagaa gctgtcctgc ccttctctca
     4081 ctcagtcctc tacacaagag cagcatgtcc ccaagcccaa cgtacaggtt caaaagatag
     4141 aacttaaaaa atttgaacct aaactgaggt ggaaaagaca cagttagcta ggattgacac
     4201 attggactct atcaccagca ttcaggaggg agggaacatg gctccctagg aggcctgcca
     4261 gaattacaaa gtgaaactca tctcaaaaaa ggaacaacag aaaataaaat ttcaaattga
     4321 tttctcttag accataagag tccagatctg tatccaaagc tatttggtta tattttttgt
     4381 tattgttgtt ttgtttacac attgtgtttt tctttcggtt tgtaagtctg tttgggatat
     4441 ttaatttaca tttactgatt agtgtgggtg gtagggcata ccatggctca aatgtggaaa
     4501 ccaaagaaaa gcttttggaa gtgtcatctc ccttacaata cgtgtgtcca agaactcaaa
     4561 ttcagacaat aaagcttgat agcaagcact tctacctact gagacatcta actggccaat
     4621 ttagggagtt tattttaatt tatttactta ctaatttata tgaatataag tcctctatct
     4681 gcatggccac ctgcgtggca gacgaaggca tcagatcact ttacagaagg ttgagtccac
     4741 ccagtggtgg atggaaattg aactcaggac ttctagaagc cgtcaaattt tgagccacct
     4801 cttcaacccc ttaaacaagt ttcttaaggt caccctttcc tcaaatgaaa caacaaggac
     4861 ttggaatatt ttaacataac ctgagtcctc ctacctgagg tgttgtttct acaagcctgg
     4921 caggcaactg atctacctcc aacatacact ttccaacagt cttgctttct catccacacc
     4981 ttaatcacct gacacctgtt ggcctcagcc cctgtgccag gtaagtccat tttgtctgac
     5041 tcagtcagtc tgggagacaa aaatcccttt gacagaattc
//
LOCUS       ECOUGRE       108 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  Synthetic uteroglobin (UG) mRNA expressed in E.coli, 5' end.
ACCESSION   M34596
KEYWORDS    uteroglobin.
SOURCE      E.coli DNA, clone pLE103-1.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 108)
  AUTHORS   Miele,L., Cordella-Miele,E. and Mukherjee,A.B.
  TITLE     High level bacterial expression of uteroglobun, a dimeric
            eukaryotic protein with two interchain disulfide bridges, in its
            natural quaternary structure
  JOURNAL   J. Biol. Chem. 265, 6427-6435 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         91  >   108     synthetic uteroglobin
    binding      77       81     ribosomal binding site (put.)
    signal       10       26     phi-10 promoter
BASE COUNT       37 a     23 c     19 g     29 t
ORIGIN      
        1 gatccaaatt aatacgactc actataggga gaccacaacg gtttccctct agaaataatt
       61 ttgtttaact ttaagaagga gatatacacc atggctgcag ccaagctt
//
LOCUS       HCVCG3PE    12283 bp ss-RNA             VRL       12-JUL-1990
DEFINITION  Hog cholera virus polyprotein mRNA, complete cds.
ACCESSION   M31768
KEYWORDS    envelope glycoprotein E1.
SOURCE      Hog cholera virus (strain Brescia), cDNA to viral RNA, passed in
            SK-6 cells.
  ORGANISM  Hog cholera virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Togaviridae; Mucosal disease virus group.
REFERENCE   1  (bases 1 to 12283)
  AUTHORS   Moormann,R.J.M., Warmerdam,P.A.M., van der Meer,B., Schaper,W.M.M.,
            Wensvoort,G. and Hulst,M.M.
  TITLE     Molecular cloning and nucleotide sequence of Hog cholera virus
            strain Brescia and location in the genome of the sequence encoding
            envelope protein E1
  JOURNAL   Virology 177, 184-198 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Moorman,R.J.M., 01-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        361    12057     hog cholera virus protein precursor
    matp       2428     3538     envelope glycoprotein E1 (put.)
BASE COUNT     3850 a   2559 c   3182 g   2692 t
ORIGIN      
        1 agttcattct cgtgtacatg attggacaaa tcaaaatctc aatttggttc agggcctccc
       61 tccagcgacg gccgagctgg gctagccatg cccacagtag gactagcaaa cggagggact
      121 agccgtagtg gcgagctccc tgggtggtct aagtcctgag tacaggacag tcgtcagtag
      181 ttcgacgtga gcagaagccc acctcgagat gctatgtgga cgagggcatg cccaagacac
      241 accttaacct agcgggggtc gttagggtga aatcacacca tgtgatggga gtacgacctg
      301 atagggtgct gcagaggccc actattaggc tagtataaaa atctctgctg tacatggcac
      361 atggagttga atcattttga acttttatac aaaacaaaca aacaaaaacc aatgggagtg
      421 gaggaaccgg tatacgatgt aacggggaga ccattgtttg gagacccaag tgaggtacac
      481 ccacaatcaa cattgaagct accacatgat agggggagag gcaacatcaa aacaacactg
      541 aagaatctac ctaggagagg tgactgcagg agtggcaacc acctaggccc ggttagtggg
      601 atatatgtaa agcccggccc tgtcttttat caggactaca tgggcccagt ctatcataga
      661 gcccctctgg agttttttga cgaagcacag ttctgtgagg tgaccaaaag gataggtagg
      721 gtgacaggta gtgacggaaa gctttaccat atatacgtgt gcatcgatgg ttgcatcctg
      781 ctgaagctag ccaagagggg cgagccaaga accctgaagt ggattagaaa tctcaccgac
      841 tgtccattgt gggttaccag ttgttctgat gatggtgcaa gtgcaagtaa agagaagaaa
      901 ccagatagga tcaacaaggg taaattaaag atagccccaa aagagcatga gaaggacagc
      961 aggactaagc cacctgatgc tacgattgta gtggaaggag taaaatacca ggtcaaaaag
     1021 aaaggtaaag ttaagggaaa gaatacccaa gacggcctgt accacaacaa gaataaacca
     1081 ccagaatcta ggaagaaatt agaaaaagcc ctattggcat gggcagtgat agcaattatg
     1141 ttataccaac ctgttgcagc cgaaaatata actcaatgga acctgagaga caacggtacc
     1201 aatggtatcc agcacgctat gtaccttaga ggagtcagca gaagcttgca tgggatctgg
     1261 ccagaaaaaa tatgcaaagg agtccccacc tacctggcca cagacacgga actgagagaa
     1321 atacagggaa tgatggtagc cagcgagggg acaaactata cgtgctgtaa gttacagaga
     1381 catgaatgga acaaacatgg atggtgtaac tggtataaca tagacccctg gatacagtta
     1441 atgaatagaa cccaagcaaa cttggcagaa ggccctccga gcaaggagtg cgccgtgact
     1501 tgcaggtacg ataaaaatgc tgacattaac gtggtcaccc aggccagaaa caggccaacc
     1561 accctaactg gctgcaagaa agggaccaat ttttcttttg cgggtacagt tatagagggc
     1621 ccatgtaatt tcaacgtttc tgtcgaggat atcttatatg gggatcatga gtgtggcagt
     1681 ctactccagg atacggctct atacctagta gatggaatga ccaacactat agagagagcc
     1741 aggcagggag ccgcgagggt gacatcttgg ctagggaggc aactccgcat tgccgggaag
     1801 aggttggagg gcagaagcaa aacctggttt ggtgcctatg ccctatcacc ttattgtaat
     1861 gtgacaacga aaatagggta catatggtac actaacaact gtaccccggc ttgcctcccc
     1921 aaaaatacaa agataatagg ccccggtaaa tttgacacta acgcggaaga cggaaagatt
     1981 ctccatgaga tggggggcca cctatcagaa tttctgctgc tctctctggt cgttctgtct
     2041 gacttcgccc ctgaaacagc cagcgcgtta tacctcattt tgcactacgt gatccctcaa
     2101 tcccatgaag aacctgaagg ctgtgacaca aaccagctga atttaacagt ggaactcagg
     2161 actgaagacg tgataccatc atcagtctgg aatgttggca aatatgtgtg tgttagacca
     2221 gactggtggc catatgaaac caaggtggct ttgttatttg aagaggcagg acaggtcgta
     2281 aagttagcct tgcgggcact gagggattta accagggtct ggaatagcgc atcaaccacg
     2341 gcattcctca tctgcttgat aaaagtatta agaggacagg tcgtgcaagg tgtgatatgg
     2401 ctgttactgg taactggggc acaaggccgg ctagcctgca aggaagatca caggtacgct
     2461 atatcaacaa ccaatgagat agggctacat ggggccgaag gtctcactac cacctggaaa
     2521 gaatacaacc acaatttgca actggatgat gggaccgtca aggccatctg catggcaggt
     2581 tcctttaaag tcacagcact taatgtggtt agtaggaggt atctggcatc attacataag
     2641 gacgctttac ccacttccgt gacattcgag ctcctgttcg acgggaccag cccattgacc
     2701 gaggaaatgg gagatgactt cgggttcgga ctgtgtccgt atgatacgag ccctgtagtc
     2761 aagggaaaat acaacacaac cttgttgaat ggtagtgcat tctacctagt ttgcccaata
     2821 gggtggacgg gtgttataga gtgcacggca gtgagcccga caactctgag aacagaagtg
     2881 gtaaagacct tcagaagaga gaaacccttt ccgtacagaa gggattgtgt gaccactaca
     2941 gtggaaaatg aagatctatt ctactgtaaa tgggggggca attggacatg tgtgaaaggt
     3001 gaaccagtga cctacacggg ggggccagta aaacaatgca gatggtgtgg cttcgacttc
     3061 aatgagcctg acggactccc acactacccc ataggtaagt gcattttggc aaatgagaca
     3121 ggttacagaa tagtggattc aacggactgt aacagagatg gcgttgtaat cagcacagag
     3181 gggagtcatg agtgcttgat tggtaacaca actgtcaagg tgcatgcatt agatgaaaga
     3241 ctaggcccta tgccatgcag gcctaaggag atcgtctcta gtgcgggacc tgtaaggaaa
     3301 acttcctgta cattcaacta cgcaaaaact ctgaggaaca ggtattatga gcccagggac
     3361 agctatttcc aacaatatat gctcaagggc gagtatcagt actggtttga tctggatgtg
     3421 accgaccgcc actcagatta cttcgcagaa ttcattgtct tggtggtggt ggcactgttg
     3481 ggaggaagat atgtcctgtg gctaatagtg acctacatag ttctaacaga acaactcgcc
     3541 gctggtctac agttaggcca gggtgaggta gtgttaatag ggaacttaat cacccacaca
     3601 gatattgagg ttgtagtata tttcttactg ctctatttgg tcatgagaga tgagcctata
     3661 aagaaatgga tactactgct gttccatgct atgaccaaca atccagttaa gaccataaca
     3721 gtggcactgc tcatggttag cggggttgcc aagggtggaa agatagatgg tggttggcag
     3781 cggctgccgg agaccaactt tgatatccaa ctcgcgctga cagttatagt agtcgctgtg
     3841 atgttgctgg caaagaaaga tccgactacc gtccccttgg ttataacggt ggcaaccctg
     3901 agaacggcta agataactaa tggacttagt acagatctag ccatagctac agtgtcaaca
     3961 gctttgctaa cctggaccta cattagtgac tattataaat acaagacctt gctacagtac
     4021 cttattagca cagtgacagg tatcttcttg ataagggtac tgaagggggt aggtgagtta
     4081 gatttacaca ccccaacctt accatcttac agacccctct tcttcatcct cgtgtacctc
     4141 atttccactg cagtggtaac aagatggaat ctggacatag ccggattgct gctgcagtgt
     4201 gtcccaaccc ttttaatggt tttcacgatg tgggcagaca tccttaccct gatcctcata
     4261 ctgcctactt acgagttgac aaaactatat tacctcaagg aagtgaagat tggggcagaa
     4321 aggggctggt tgtggaagac caacttcaag agggtaaatg acatatacga agttgaccaa
     4381 gctggtgagg gggtgtacct tttcccatca aaacaaaaga caggtacaat aacaggtact
     4441 atgttgccac tgatcaaagc catactcata agttgcatca gcaataagtg gcaatttata
     4501 tatctattgt acttgatatt cgaagtgtct tactaccttc acaagaagat catagatgaa
     4561 atagcaggag ggaccaactt catctcgaga cttgtagccg ctctgatcga agccaattgg
     4621 gcctttgaca acgaagaagt tagaggttta aagaagttct tcttgctgtc tagtagggtt
     4681 aaagaactga tcatcaaaca caaagtgagg aatgaagtga tggtccactg gtttggcgac
     4741 gaagaggtct atgggatgcc gaagctggtt ggcttagtca aggcagcaac actgagtaaa
     4801 aataaacatt gtattttgtg caccgtctgt gaaaacagag agtggagagg agaaacctgc
     4861 ccaaaatgcg gccgttttgg gccaccagtg acctgtggca tgaccctagc cgactttgaa
     4921 gaaaaacact ataagaggat tttctttaga gaggatcaat cagaagggcc ggttagggag
     4981 gagtatgcag ggtatctgca atatagagcc agagggcaat tattcctgag gaatctcccg
     5041 gtgctagcaa caaaagtcaa gatgctcctg gtcggaaatc ttgggacgga ggtgggggat
     5101 ttggaacacc ttggctgggt gctcagaggg cctgccgttt gcaagaaggt taccgaacat
     5161 gagaaatgca ccacatccat aatggacaaa ttaactgctt tcttcggtgt tatgccaagg
     5221 ggcaccacac ctagagcccc tgtgagattc cccacctctc tcttaaagat aagaaggggg
     5281 ctggaaactg gctgggcgta cacacaccaa ggtggcatca gttcagtgga ccatgtcact
     5341 tgtgggaaag acttactggt atgtgacact atgggccgga caagggttgt ttgccaatca
     5401 aataacaaga tgacagacga gtccgagtat ggagttaaaa ctgactccgg atgcccggag
     5461 ggagctaggt gttacgtgtt caaccgagag gcagttaata tatccgggac taaaggagct
     5521 atggtccact tacaaaaaac tggaggagaa ttcacctgtg tgacagcatc agggactccg
     5581 gccttctttg atctcaagaa cctcaaaggc tggtcagggc taccgatatt tgaggcatca
     5641 agtggaagag tagtcggcag ggttaaggtc gggaagaatg aggactctaa accaaccaag
     5701 cttatgagtg gaatacaaac agtctccaaa agtaccacag acttgacaga aatggtaaag
     5761 aaaataacaa ccatgaacag gggagaattc agacaaataa cccttgccac aggtgccgga
     5821 aaaaccacgg aactccctag atcagtcata gaagagatag gaaggcataa gagggtcttg
     5881 gtcttgatcc ctctgagggc ggcagcagag tcagtatacc aatatatgag acaaaaacac
     5941 ccaagcatag cattcaactt gaggataggg gagatgaagg aaggggacat ggccacaggg
     6001 ataacctatg cctcatatgg ttacttctgt cagatgccac aacctaagct gcgagccgcg
     6061 atggttgagt actccttcat attccttgat gagtaccact gttccacccc cgaacaattg
     6121 gctatcatgg gaaagatcca cagattttca gagaacctgc gggtagtagc catgaccgca
     6181 acaccagcag gcacggtaac aactacaggg caaaaacacc ctatagaaga atacatagcc
     6241 ccagaagtga tgaaggggga agacttaggt ccagagtact tggacatagc tggactaaag
     6301 ataccagtag aggagatgaa gagtaacatg ctggtctttg tgcccacaag gaacatggct
     6361 gtagagacgg caaagaaact gaaagctaag ggttataact caggctacta ttatagtgga
     6421 gaggatccat ctaacctgag ggtggtaaca tcacagtccc cgtacgtggt ggtagcaacc
     6481 aacgcaatag aatcaggtgt tactctccca gacttggatg tggtcgtcga cacagggctt
     6541 aagtgtgaaa agaggatacg gctgtcacct aagatgccct tcatagtgac gggcctgaag
     6601 agaatggctg tcacgattgg ggaacaagcc cagagaaggg ggagagttgg gagagtgaag
     6661 cctgggagat actacaggag tcaagaaacc cccgttggtt ccaaagatta ccattacgac
     6721 ctactgcaag cacagaggta cggtatagaa gatgggataa acatcaccaa atcttttaga
     6781 gagatgaatt atgattggag cctttatgag gaggatagtc tgatgattac acaattggaa
     6841 atcctcaaca atctgttgat atcagaagag ctaccaatgg cagtaaaaaa tataatggcc
     6901 aggactgacc acccagaacc aatccaactg gcgtacaaca gctacgaaac gcaggtgcca
     6961 gttctattcc caaaaataaa aaatggagag gtgactgaca gttacgataa ctataccttc
     7021 ctcaacgcaa gaaagctggg ggatgatgta ccaccctacg tgtatgccac agaggatgag
     7081 gacttagcgg tagagctgct gggcttagac tggccggacc ctgggaacca aggaaccgtg
     7141 gaggctggta gagcactaaa acaagtagtt ggtctatcaa cagctgagaa cgccctgtta
     7201 gtagctttat tcggctatgt aggatatcag gcactctcaa agaggcatat accagtagtc
     7261 acagacatat attcaattga agatcacagg ttggaagaca ccacacacct acagtatgcc
     7321 ccgaatgcta tcaagacgga ggggaaggag acagaattga aggagctagc tcagggggat
     7381 gtgcagagat gtatggaagc tatgactaat tatgcaagag atggcatcca attcatgaag
     7441 tctcaggcac tgaaagtgaa agaaaccccc acttacaaag agacaatgga caccgtggcg
     7501 gactatgtaa agaagttcat ggaggcactg gcggacagca aagaagacat cataaaatat
     7561 gggttgtggg ggacgcacac agccttatat aagagcatcg gtgctaggct tgggaacgag
     7621 actgcgttcg ctaccctggt cgtgaaatgg ctggcatttg ggggagaatc aatagcagac
     7681 catgtcaaac aagcggccac agacttggtc gtttactata tcatcaacag acctcagttc
     7741 ccaggagaca cggagacaca acaggaagga aggaaatttg tagccagcct actggtctca
     7801 gccctggcta cttacactta caaaagctgg aattacaata atctgtccaa gatagttgaa
     7861 ccggctttgg ctactctgcc ctatgccgcc acagctctca agctattcgc ccccactcga
     7921 ttggagagcg ttgtcatact gagtaccgca atctacaaaa cctacctatc aatcaggcgc
     7981 ggaaaaagcg atggtttgct aggcacaggg gttagtgcgg ctatggaaat catgtcacaa
     8041 aacccagtat ctgtgggtat agcggtcatg ctaggggtgg gggccgtagc ggcccacaat
     8101 gcaatcgaag ccagtgagca gaagagaaca ctactcatga aagtttttgt aaagaacttc
     8161 ttggatcagg cagccactga tgaattagtc aaggagagcc ctgagaaaat aataatggct
     8221 ttgtttgaag cagtgcagac agtcggcaac cctcttagac tggtatacca cgtttacgga
     8281 gttttttaca aagggtggga ggcaaaagag ttggcccaaa ggacagccgg taggaatctt
     8341 ttcactttga taatgtttga ggctgtggaa ctactgggag tagatagcga aggaaagatc
     8401 cgccagctat caagcaatta catactagag ctcctgtata agttccgtga cagtatcaag
     8461 tccagcgtga ggcagatggc aatcagctgg gcccctgccc cttttagttg tgattggaca
     8521 ccgacggatg acagaatagg gcttccccaa gataatttcc tccgagtgga gacaaaatgc
     8581 ccctgtggtt acaagatgaa agcagttaag aattgtgctg gggagttgag actcttagag
     8641 gaggaaggct catttctctg caggaataaa ttcgggagag gttcacggaa ctacagggtg
     8701 acaaaatact atgatgacaa tctatcagaa ataaagccag tgataagaat ggaaggacat
     8761 gtggaactct actacaaggg agccactatt aaactggatt tcaacaacag taaaacaata
     8821 ttggcaaccg ataaatggga ggtcgatcac tccactctgg tcagggtgct caagaggcac
     8881 acaggggctg gatattgtgg ggcatacctg ggtgagaaac cgaaccacaa acatctgata
     8941 gagagggact gcgcaaccat caccaaagat aaggtttgtt ttctcaagat gaagagaggg
     9001 tgtgcattta cttatgactt atcccttcac aaccttaccc ggctgattga attggtacac
     9061 aagaataact tggaagacaa agagattcct gccgttacgg tcacaacctg gctggcttac
     9121 acatttgtaa atgaagatat agggaccata aaaccagcct tcggggagaa aataacacca
     9181 gagatgcagg aggagataac cttgcagcct gctgtattgg tggatgcaac tgacgtgacc
     9241 gtgaccgtgg taggggaaac ccctactatg actacagggg agaccccaac aacgttcacc
     9301 agctcaggtc cagacccgaa aggccaacaa gttttaaaac tgggtgtagg tgaaggccaa
     9361 taccccggga ctaatccaca gagagcaagc ctgcacgaag ccatacaaag cgcagatgaa
     9421 aggccctctg tgctgatatt ggggtctgat aaagccacct ctaatagagt gaaaactgta
     9481 aagaatgtga aggtatacag aggcagggac ccactagaag tgagagatat gatgaggagg
     9541 ggaaagatcc tagtcatagc cctgtctagg gttgataatg ctctattgaa atttgtagat
     9601 tacaaaggca cctttttaac tagagagacc ctggaggcat taagtttggg taggccaaaa
     9661 aagaaaaaca taaccaaggc agaagcacag tggttgctgc gcctcgaaga ccaaatggaa
     9721 gagctacccg attggttcgc agccggggaa cccatttttc tagaggccaa tattaaacat
     9781 gacaggtatc atctggtagg ggatatagct actatcaaag agaaagccaa acaattgggg
     9841 gctacagact ctacaaagat atccaaggag gttggtgcaa aagtatattc tatgaaattg
     9901 agtaattggg tgatgcaaga agaaaacaaa cagagcaact tgaccccctt atttgaagag
     9961 ctcctacagc agtgtccacc cggaggccaa aacaaaactg cacatatggt ctctgcttac
    10021 caactagctc aagggaactg gatgccaacc agctgccatg tttttatggg gaccatatct
    10081 gccagaagga ctaagaccca tccatatgaa gcatatgtca agttaaggga gttggtagag
    10141 gaacacaaga tgaaaacatt gtgtcccgga tcaagtctgc gtaacgacaa tgaatgggta
    10201 attggcaaga tcaaatacca gggcaacctg aggaccaaac acatgttgaa ccccggcaag
    10261 gtggcagagc aactgcacag agaaggacac agacacaatg tgtataacaa gacaataggc
    10321 tcagtgatga cagctactgg catcaggttg gagaagttgc ccgtggttag ggcccagaca
    10381 gacacaacca acttccacca agcaataagg gataagatag acaaggaaga gaatctacag
    10441 accccgggtt tacataagaa actaatggaa gttttcaatg cattgaaacg acccgagtta
    10501 gagtcctcct atgacgctgt ggaatgggag gaattggaga gaggaataaa cagaaagggt
    10561 gctgctggtt tctttgaacg caaaaacata ggggagatat tggattcaga gaaaattaaa
    10621 gtagaagaga ttattgacaa tctgaaaaag ggtagaaata tcaaatacta tgaaaccgca
    10681 atcccaaaaa atgaaaagag ggatgtcaat gatgactgga ccgcaggtga ctttgtggac
    10741 gagaagaaac ccagagtcat acaataccct gaagcaaaaa caaggctggc catcaccaag
    10801 gtgatgtata agtgggtgaa gcagaagcca gtagtcatac ccgggtatga agggaagaca
    10861 cctctgttcc aaatttttga caaagtaaag aaggaatggg atcaattcca aaatccagtg
    10921 gcagtgagct tcgacactaa ggcgtgggac acccaggtga ccacaaatga tctggagctg
    10981 ataaaggaca tacaaaagta ctacttcaag aagaaatggc ataaatttat tgacaccctg
    11041 actatgcata tgtcagaagt acccgtaatc actgctgatg gggaggtgta tataaggaaa
    11101 gggcaaagag gtagtggaca gcccgacaca agcgcaggca acagcatgct aaatgtgtta
    11161 acaatggttt atgccttctg cgaggccaca ggggtaccct acaagagttt tgacagggtg
    11221 gcaaaaattc atgtgtgtgg ggacgatggt ttcctgatca cagagagagc tctcggcgag
    11281 aaattcgcaa gcaagggagt ccaaatcctg tatgaagctg ggaagcccca gaagatcact
    11341 gaaggggaca aaatgaaagt ggcctaccaa tttgctgata ttgagttttg ctcccataca
    11401 ccaatacaag taaggtggtc agataacact tctagctaca tgccagggag aaatacaacc
    11461 acaatcctgg ctaaaatggc cacaaggtta gattccagtg gtgagagggg taccatagcg
    11521 tacgagaaag cagtagcatt cagcttcctg ctaatgtatt cctggaaccc actaatcaga
    11581 aggatttgct tattggtact atcaactgaa ctgcaagtga aaccagggaa gtcaaccact
    11641 tactattatg aaggggaccc gatatctgcc tacaaggaag tcatcggcca caatcttttc
    11701 gatctcaaga gaacaagctt cgagaagctg gccaagttaa atctcagcat gtccgtactc
    11761 ggggcctgga ctagacacac cagcaaaaga ctactacaag actgtgtcaa tatgggtgtt
    11821 aaagagggca actggttagt caatgcagac agactggtga gtagtaagac tggaaatagg
    11881 tatgtacctg gagaaggcca caccctgcaa gggagacatt atgaagaact ggcgttggca
    11941 agaaaacaga tcaacagctt ccaagggaca gacaggtaca atctaggccc aatagtcaac
    12001 atggtgttaa ggaggctgag agtcatgatg atgaccctga tagggagagg ggtatgagtg
    12061 cgggtgaccc gcgatctgga cccgtcagta ggaccctatt gtagataaca ctaatttttt
    12121 atttatttag atattactat ttatttattt atttatttat tgaatgagta agaactggta
    12181 caaactacct catgttacca cactacactc attttaacag cactttagct ggaaggaaaa
    12241 ttcctgacgt ccacagttgg actaaggtaa tttctaacgg ccc
//
LOCUS       HUMC6A2A1    2159 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human alpha-2 collagen type VI, alpha-2 collagen type VI-a, and
            alpha-2 collagen type VI-a' gene, exons 6,5,4 and 3.
ACCESSION   M34571
KEYWORDS    alpha-2 collagen type VI; alternative splice.
SEGMENT     1 of 3
SOURCE      Human leukocyte DNA, clone D1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2159)
  AUTHORS   Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
  TITLE     Alternative splicing of the human alpha-2(VI) collagen gene
            generates multiple mRNA transcripts which predict three protein
            variants with distinct carboxyl termini
  JOURNAL   J. Biol. Chem. 265, 6473-6480 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    75      120     alpha-2 collagen type VI, exon 6
                273      425     alpha-2 collagen type VI, exon 5
                590     1042     alpha-2 collagen type VI, exon 4
               1307  +  1345     alpha-2 collagen type VI-a, exon 3
    pept    /    75      120     alpha-2 collagen type VI-a, exon 6
                273      425     alpha-2 collagen type VI-a, exon 5
                590     1042     alpha-2 collagen type VI-a, exon 4
               1307  +  1345     alpha-2 collagen type VI-a', exon 3
    pept    /    75      120     alpha-2 collagen type VI-a', exon 5
                273      425     alpha-2 collagen type VI-a', exon 4
                590     1042     alpha-2 collagen type VI-a', exon 3
               1307  +  1345     alpha-2 collagen type VI-a', exon 2
    pre-msg <     1  >  2159     alpha-2cVI mRNA and introns
    IVS     <     1       74     intron E
    IVS         121      272     intron D
    IVS         426      589     intron C
    IVS        1043     1306     intron B
    IVS        1346  >  2159     intron A
BASE COUNT      391 a    653 c    749 g    366 t
ORIGIN      
        1 tgtccggacc ccagccagac tgctgtgaac tcttctgggc ccggggactg ccctgcctgc
       61 cgtgtgcatt gcaggagtgt gacgtcatga cctacgtgag ggagacctgc gggtgctgcg
      121 gtgaggcact gcccacggca gggtcggggc ccatgcaccg ggtggagggc gggagtgcag
      181 cagggctggg tcatcgctgg gtcctgcatg tgcacgtgac cctagggtct gaggtctccc
      241 ggtacccccc gatgaccctg ccaccccccc agactgtgag aagcgctgtg gcgccctgga
      301 cgtggtcttc gtcatcgaca gctccgagag cattgggtac accaacttca cactggagaa
      361 gaacttcgtc atcaacgtgg tcaacaggct gggtgccatc gctaaggacc ccaagtccga
      421 gacaggtcac ggggcagggc gggtgcagca ttgcgggggg ccgcgggcgc gtgggaggcg
      481 atgagatggg agaagtccag acgcgtccct ccaacgaggg cctctgcatg gctggggatg
      541 ccccagaccc cgaggcctct ggcaacgacc tcacgcgtgc ggcttgcagg gacgcgtgtg
      601 ggcgtggtgc agtacagcca cgagggcacc tttgaggcca tccagctgga cgacgaacat
      661 atcgactccc tgtcgagctt caaggaggct gtcaagaacc tcgagtggat tgcaggtggc
      721 acctggacac cctcagccct caagtttgcc tacgaccgcc tcatcaagga gagccggcgc
      781 cagaagacac gtgtgtttgc ggtggtcatc acggacgggc gccacgaccc tcgggacgat
      841 gacctcaact tgcgggcgct gtgcgaccgc gacgtcacag tgacggccat cggcatcggg
      901 gacatgttcc acgagaagca cgagagtgaa aacctctact ccatcgcctg cgacaagcca
      961 cagcaggtgc gcaacatgac gctgttctcc gacctggtcg ctgagaagtt catcgatgac
     1021 atggaggacg tcctctgccc gggtgtacgt gtgggcgcgg ggcagtcagg ccgaggagca
     1081 gcaggcccca gccgcgtcta gcgtgaccgc cagggacacc cctcacctga gggacgaatg
     1141 tgcagcccaa ggatcttggg ctgtgggtgg gaaggggtcg gcctctcggg ctgcagggca
     1201 gacgcgccag ctcgaccctg agcctgtcta ggcagatcag tgaacggccg ctgagggttc
     1261 gctagggact gaccctggcc tggccggcct ctctcctctc ttccagaccc tcagatcgtg
     1321 tgcccagacc ttccctgcca aacaggtaat gcagggacct gagccaccac cccagactag
     1381 caaagcagcc ctggtgtcct tcctcctcga gggccgggct gggggagggg ccgtgcaggg
     1441 acccgggggc ggcggacgac tgcggaggct gctccttagg gagatggccc caggatggca
     1501 gcacagggga ggaggggctt ggggaaggca ggctcccagg aacgcaggaa cagcatcacg
     1561 aggccatgag gtgggtgctg ctagcctggc gctgtgctcg gcatgtggcc actggtcttg
     1621 aaggcccacc atgggcttgc agtctccctc agctgccgcc cagctcccat gggctggccg
     1681 tgcatgtgcc accggaggaa gccctggatc agtgagtgaa accatcccgg ggtggaagca
     1741 ctgacacccc ccagcaccag caggtcttgc tccaaccctg gcctgcctcg atcgagctgc
     1801 agctgcggct ctcatctctg ggagtggggg agcccatgtc cggatgattg gcccagcgtg
     1861 gtgtgaagct ggagctgggg gtgccgttca gctgctgctg gactggtgct gcccccatgg
     1921 tgcactgctg caaccgttgc tgggcccaca ggaggtcccc gggggcggtt atgtagctga
     1981 gtccccctca ttgagccgtc cccttccagg agtgtgaggg tggggatgcc atggagacag
     2041 ggtgggaggg tccagactga gaggaccaca gggtaggaaa cctccaaggg tctgctggta
     2101 ctaagtcagc ccttctcagc actcgggatc gcgatgtgcg atcgagagtc catggggag
//
LOCUS       HUMC6A2A2    1348 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human alpha-2 collagen type VI and alpha-2 collagen type VI-a gene,
            exons 2a and 2b.
ACCESSION   M34572
KEYWORDS    alpha-2 collagen type VI; alternative splice.
SEGMENT     2 of 3
SOURCE      Human leukocyte DNA, clone D1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1348)
  AUTHORS   Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
  TITLE     Alternative splicing of the human alpha-2(VI) collagen gene
            generates multiple mRNA transcripts which predict three protein
            variants with distinct carboxyl termini
  JOURNAL   J. Biol. Chem. 265, 6473-6480 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   437      462     alpha-2 collagen type VI, exon 2a
    pept    +   730     1025     alpha-2 collagen type VI-a, exon 2b
    pre-msg <     1     1336     alpha-2cVI mRNA and introns
    IVS     <     1      436     intron A (alt. splice site)
    IVS     <     1      729     intron A (alt. splice site)
    signal     1285     1291     polyA signal
BASE COUNT      238 a    457 c    398 g    255 t
ORIGIN      About 1.0 kbp downstream from segment 1.
        1 tctggctact ggtgacacac tgctgtgcct gccctggcct tctccagaca gccctgtcca
       61 cccaaagccc agccaccctg gcctgcagca ggcctgtgga gttctcagtt gcgtggggac
      121 cagagggtgc tggagaaaca aaccagacgc agctgaaggc agtcagggca gggcgcaatc
      181 agcgataaga gctgcatagg ggccacagcg taacctgagc tccagtcggt ggaaagaaaa
      241 ggcagagacg ttgcagaggc caggtctgct caggggaaga cagttctggg tgtagaggac
      301 tcacatccca gagaggctga ggaagggttt accacgcaag cttctcattc gggactcttg
      361 aggggtggct ggggtcttcc tggcgacggg ctgcggcact gaagccctac tggagtttgg
      421 cctgtctccg gcacaggttt ggacggagct gttttgtgct gaaaggtttt ctcggggtcc
      481 gtggtgtccc ccaaaggtgc caccgtgcgg gtctcctagc tccctgccag cttcctgtcc
      541 ctgtgctcac tgcccccacg cctcctgcca aggccgagcc acacacccgc tccacctgca
      601 tttcctctac cgactcgcca gcccaaatgc cgctcttcac tctggcctcg ctgagcggct
      661 gcccgaggag gagctctagg ccgacgccca ccgcaggcct tacagtcgtc tctggacgct
      721 cccttgcaga tgcaccgtgg cctggcggcg agcccccggt caccttcctc cgcacggaag
      781 aggggccgga cgccaccttc cccaggacca ttcccctgat ccaacagttg ctaaacgcca
      841 cggagctcac gcaggacccg gccgcctact cccagctggt ggccgtgctg gtctacaccg
      901 ccgagcgggc caagttcgcc accggggtag agcggcagga ctggatggag ctgttcattg
      961 acacctttaa gctggtgcac agggacatcg tgggggaccc cgagaccgcg ctggccctct
     1021 gctaaagccc gggcacccgc ccagccgggc tgggccctcc ctgccacact agcttcccag
     1081 ggctgccccc gacaggctgg ctctcagtgg aggccgagag atctggaatc ggggtcagcg
     1141 gggctacagt ccttccaggg gctctggggc agctcccagc ctcttcccat gctggtggcc
     1201 accgtgtccc ttgctgcggc tgcatcttcc agtctctcct ccgtcttcca gtggccgctc
     1261 tctttataag aaccctggtc attgaattta aggcccaccc caagtccaga atgacctcgc
     1321 aagaccctta actcactccc gtctgcag
//
LOCUS       HUMC6A2A3    1174 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human alpha-2 collagen type VI-a' gene, exon 1.
ACCESSION   M34573
KEYWORDS    alpha-2 collagen type VI; alternative splice.
SEGMENT     3 of 3
SOURCE      Human leukocyte DNA, clone D1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1174)
  AUTHORS   Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
  TITLE     Alternative splicing of the human alpha-2(VI) collagen gene
            generates multiple mRNA transcripts which predict three protein
            variants with distinct carboxyl termini
  JOURNAL   J. Biol. Chem. 265, 6473-6480 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +   140      738     alpha-2 collagen type VI-a', exon 1
    pre-msg <     1     1028     alpha-2cVI mRNA and introns
    IVS     <     1      139     intron A (alt. splice site)
    signal     1010     1015     polyA signal
BASE COUNT      189 a    439 c    364 g    182 t
ORIGIN      About 2.1 kbp downstream of segment 2.
        1 ctgcagaaac gccccgcaga gcccagtggt ctgtgaggtt gcaggcaggg tgcgaatgga
       61 agggacaggt gcggggctgg cacctgcccg gtcctgccca cctctcctcc gcccagcccg
      121 cacctgcggt ctcccacaga gctgtccgtg gcacagtgca cgcagcggcc cgtggacatc
      181 gtcttcctgc tggacggctc cgagcggctg ggtgagcaga acttccacaa ggcccggcgc
      241 ttcgtggagc aggtggcgcg gcggctgacg ctggcccgga gggacgacga ccctctcaac
      301 gcacgcgtgg cgctgctgca gtttggtggc cccggcgagc agcaggtggc cttcccgctg
      361 agccacaacc tcactgccat ccacgaggcg ctggagacca cacaatacct gaactccttc
      421 tcgcacgtgg gcgcaggcgt ggtgcacgcc atcaatgcca tcgtgcgcag cccgcgtggc
      481 ggggcccgga ggcacgcaga gctgtccttc gtgttcctca cggacggcgt cacgggcaac
      541 gacagtctgc acgagtcggc gcactccatg cgcaacgaga acgtggtacc caccgtgctg
      601 gccttgggca gcgacgtgga catggacgtg ctcaccacgc tcagcctggg tgaccgcgcc
      661 gccgtgttcc acgagaagga ctatgacagc ctggcgcaac ccggcttctt cgaccgcttc
      721 atccgctgga tctgctagcg ccgccgcccg ggccccgcag tcgagggtcg tgagcccacc
      781 ccgtccatgg tgctaagcgg gcccgggtcc cacacggcca gcaccgctgc tcactcggac
      841 gacgccctgg gcctgcacct ctccagctcc tcccacgggg tccccgtagc cccggccccc
      901 gcccagcccc aggtctcccc aggccctccg caggctgccc ggcctccctc cccctgcagc
      961 catcccaagg ctcctgacct acctggcccc tgagctctgg agcaagccca ataaaggctt
     1021 tgaacccatt gcgtgcctgc gagcttctgt gcgcaggaga gacctcaaag gtgtcttgtg
     1081 gccaggaggg aaacactgca gctgtcgctc gcccaccagg gtcaatggct cccccgggcc
     1141 cagcctgacc tcctaggaca tcaactgcag gtgc
//
LOCUS       HUMC6A2AA     888 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human alpha-2 collagen type VI mRNA, 3' end.
ACCESSION   M34570
KEYWORDS    alpha-2 collagen type VI.
SOURCE      Human fibroblast, cDNA to mRNA, clone F221.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 888)
  AUTHORS   Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L.
  TITLE     Alternative splicing of the human alpha-2(VI) collagen gene
            generates multiple mRNA transcripts which predict three protein
            variants with distinct carboxyl termini
  JOURNAL   J. Biol. Chem. 265, 6473-6480 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       54     alpha-2 collagen type VI
BASE COUNT      136 a    329 c    249 g    174 t
ORIGIN      
        1 gtgtgcccag accttccctg ccaaacaggt ttggacggag ctgttttgtg ctgaaaggtt
       61 ttctcggggt ccgtggtgtc ccccaaaggt gccaccgtgc gggtctccta gctccctgcc
      121 agcttcctgt ccctgtgctc actgccccca cgcctcctgc caaggccgag ccacacaccc
      181 gctccacctg catttcctct accgactcgc cagcccaaat gccgctcttc actctggcct
      241 cgctgagcgg ctgcccgagg aggagctcta ggccgacgcc caccgcaggc cttacagtct
      301 tctctggacg ctcccttgca gatgcaccgt ggcctggcgg cgagcccccg gtcaccttcc
      361 tccgcacgga agaggggccg gacgccacct tccccaggac cattcccctg atccaacagt
      421 tgctaaacgc cacggagctc acgcaggacc cggccgccta ctcccagctg gtggccgtgc
      481 tggtctacac cgccgagcgg gccaagttcg ccaccggggt agagcggcag gactggatgg
      541 agctgttcat tgacaccttt aagctggtgc acagggacat cgtgggggac cccgagaccg
      601 cgctggccct ctgctaaagc ccgggcaccc gcccagccgg gctgggccct ccctgccaca
      661 ctagcttccc agggctgccc ccgacaggct ggctctcagt ggaggcccag agatctggaa
      721 tcggggtcag cggggctaca gtccttccag gggctctggg gcagctccca gcctcttccc
      781 atgctggtgg ccaccgtgtc ccttgctgcg gctgcatctt ccagtctctc ctccgtcttc
      841 cagtggccgc tctctttata agaaccctgg tcattgaatt taaggccc
//
LOCUS       PPH47CG      7726 bp ds-DNA             VRL       12-JUL-1990
DEFINITION  Human papillomavirus type 47 (HPV-47) +-sense strand.
ACCESSION   M32305
KEYWORDS    .
SOURCE      Human papillomavirus type 47 DNA, clone pTZ18R.
  ORGANISM  Human papillomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 7726)
  AUTHORS   Kiyono,T., Adachi,A. and Ishibashi,M.
  TITLE     Genome organization and taxonomic position of human papillomavirus
            type 47 inferred from its DNA sequence
  JOURNAL   Virology 177, 401-405 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            T.Kiyono, 23-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        966      981     E1/E4 fusion protein, exon 1
               3324     4000     E1/E4 fusion protein, exon 2
    pept        208      678     ORF E6
    pept        668      979     ORF E7
    pept        966     2783     ORF E1
    pept       2725     4245     ORF E2
    pept       3086     4000     ORF E4
    pept       4334     5890     ORF L2
    pept       5903     7447     ORF L1
    pre-msg     198     4465     HPV-47-1 mRNA and intron
    IVS         982     3323     HPV-47-1 intron
    pre-msg <     1     4465     HPV-47-2 mRNA and intron
    IVS        1359     2677     HPV-47-2 intron
    signal     4424     4429     polyA signal
BASE COUNT     2369 a   1517 c   1727 g   2113 t
ORIGIN      
        1 aacggtaagt ttgcattaat gtaccaggtg cggtacagat catttcacaa tggatattat
       61 tgttgccaac taccatagtc ataatcaagt tcttgcctgt atcgttttcg taccttacct
      121 acagtatttt atattaatat ataaataaat aaatatataa atgtgtattt atttctcagg
      181 ctcagttctt tgcaattatt aagacaaatg gctcagaagg ctttggaaca gactacagtt
      241 aaagaggaaa agctagaact acctactact attagaggct tagctcaatt gttagacata
      301 cctttagtag attgtttgct accttgcaac ttttgtggca gatttcttga ctatttagaa
      361 gtttgtgaat ttgattataa aaagcttact ttaatttgga aagactacag tgtttatgcc
      421 tgctgccgtt tgtgctgctc agcaactgcc acatatgaat ttaatgtttt ttatcaacaa
      481 acagtgttag gtagagatat tgagctagct acaggccttt ccatttttga gattgacata
      541 aggtgtcata cctgcctgtc atttcttgac attattgaaa agttagatag ctgtggaaga
      601 ggacttccct ttcacaaagt aagaaacgcc tggaagggtg tttgtaggca gtgtaagcat
      661 ttttacaatg attggtaaag aggtcaccgt gcgagatatt gttctggagt taagtgaggt
      721 tcaacctgaa gtattaccag ttgacctgtt ttgcgacgag gaattaccaa atgaacaaca
      781 ggcggaggag gagctagaca tcgacagagt cgttttcaaa gtgattgcac cgtgcggttg
      841 cagctgctgc gaggtcaagc ttcgcatttt tgtgaacgca acaaaccgtg gcatcaggac
      901 atttcaggaa cttttgactg gtgatctgca gctcctctgc ccagagtgcc gtgggaactg
      961 caaacatggc ggattctaaa ggtagtacat ctaaagaagg gtttggtgat tggtgtattt
     1021 tggaagctga ctgtagtgat gttgaggatg atttgggaca attatttgag agagatacag
     1081 actcagatat ctcggacctg ttagacaatt gtgacctgga tcagggcaat tcacgggaac
     1141 tatttcatca acaggagtgt aagcaaagcg aggagcaatt acaaaaacta aaacgaaagt
     1201 atcttagtcc aaaagctgtc gcgcagctta gtccgcgtct tgagtcaatt tcattgtcac
     1261 ctcagcagaa atccaagaga aggctctttg cagagcaaga cagcggactc gagttaacct
     1321 ttaacaatga agctgaagat gttactcctg aggtggaggt accggctata gactctcggc
     1381 cggatgatga tgagggagga tcaggggatg tagatattca ttatacagca ttgttgcgtt
     1441 ccagcaacca aaaggccaca ttactggcaa aattcaaaca agcgtttggg gtaggcttta
     1501 atgaattgac aagacaattc aaaagctaca aaacctgctg taatcattgg gttgtatccg
     1561 tatatgcagt ccatgatgat ctatttgaaa gctcaaagca gctgttgcaa cagcattgtg
     1621 actatatatg ggtccgtggg atagatgcaa tgtcattata tctattgtgt tttaaggcgg
     1681 gaaaaaatcg tgggacagtt cataagctaa ttaccacaat gttaaatgtg catgagcaac
     1741 agatattgtc tgagcctcca aagttaagaa atacagctgc tgcattattt tggtacaaag
     1801 gatgtatggg acctggagtg ttcacccacg gtccttaccc tgaatggatt gcacaattaa
     1861 ccattttggg ccataagagt gctgaggcaa gtgcgtttga tctgtcagtc atggttcaat
     1921 gggcatttga taacaatctg tttgaggagg cagacattgc atacggatat gcaagactgg
     1981 caccagagga tagcaatgca gttgcatggc ttgcacataa taaccaagct aaatatgtta
     2041 gagaatgtgc tatgatggtt cgatactaca aaaaggggca aatgagagat atgagcatgt
     2101 ctgagtggat atatacaagg atacatgaag tagagggaga aggacagtgg tctagcattg
     2161 ttaaattttt aagatatcaa gaaataaatt ttatttcatt tttggctgct ttaaaagatt
     2221 tattacattc agtacctaaa cgcaattgta ttttattcca tggccctcca aatacaggaa
     2281 agtcatcgtt tggaatgtcc ttaataaaag ttctaagggg gagagtatta tcatttgtaa
     2341 actccaaaag tcagttttgg ttgcagcctc ttggagaatg taaaatagca ttattagatg
     2401 atgttacaga tccatgttgg gtgtatatgg atcaatattt aagaaatggg ttagatgggc
     2461 attttgtgtc tttggattgt aaatatagag cacccatgca aacaaagttt ccacctttaa
     2521 tacttacatc taatattaat gtacatgcag agaccaatta tagataccta catagtagaa
     2581 ttaagggttt tgaatttaaa aatccatttc ctatgaaagc agataataca cctcaatttg
     2641 agttaactga ccaaagctgg aaatcttttt ttacaaggct ttggacacac ttagacctga
     2701 gtgaccaaga agacgagggc gaacatggag aatctcagcg agcgtttcaa tgctctgcaa
     2761 gaacagctaa tgaacattta tgaagctgca gaacagacat taaaggcaca aattttacat
     2821 tggcagacat tgcgaaaaga agctgtgaca ctctactttg ctaggcagaa aggcataaat
     2881 aggttgggat accaaccagt gcctgcatta gcaatatctg aggcaagggc caaagaggct
     2941 atatatatgg tgttgcagtt agagtcgcta caaaaatcag cgtttgcttt ggagccttgg
     3001 accttagtgg acactagtac agagactttt aagagtgctc cagaaaatca ttttaaaaag
     3061 gggcctgtac ctgtggaggt gatatatgac aaagatgaag caaatgctaa tttgtatact
     3121 atgtggacat ttgtgtatta catggattca gatgatgtgt ggcataagac aacaagtggg
     3181 gtcaatcaaa ctggcattta ctacctatat ggaacattta aacactatta tgtgttattt
     3241 gctgatgatg caaagagata tagtgctact ggagaatggg aagttaaagt taataaggaa
     3301 actgtgttta ctcctgtcac tagctccaca ccaccagggt caccaggagg acaaacagac
     3361 ccagacacct cctccaagac ccccaccacc accacagccg ccactgacac ctcgcccaga
     3421 cgccaatcca tcaataaaca gtcacaacaa accgaaacca aacgaagagg gtacggacgg
     3481 agaccatcaa gcagaacaag gcgaccgcaa acgcaccaaa ggcgatccag atccagatcc
     3541 cggtcgcggt ccagttctca aacccactct tccaccacca ccaccaccac cacctacagg
     3601 tccaggtcta cgtcgctcaa caagactcgt gctcgttcca ggtcaaggtc cacctccaga
     3661 tctaccagca ccaccagtag aaggggaggt agagggtcat ccacaaggca aagatcgcga
     3721 tcaccctcca cctacacctc aaaacggtca cgggaaggaa acacaagggg cagagggagg
     3781 gggagacaag ggagagcagg gagcagtggg gggagagagc agcgacggag aaggagatca
     3841 ttctcaacct cccctgactc ctccaaacga gtcagacggg agtctcctaa ataccgtggc
     3901 gtgtctccta gcgaggtggg aaagcaactt cgatcagttg gtgcaaaaca ttcagggcga
     3961 cttggaaggt tattggagga agctagggac cccccagtaa ttcttgtgcg aggggacgca
     4021 aacacattaa aatgctttcg caacagagca aggaacaaat atagagggct ttttagatca
     4081 ttcagcacta cattttcctg ggtagctgga gatagcattg agcgtctagg caggtccaga
     4141 atgctcatta gcttttcctg cctcactcag agaagggatt ttgatgatgc tgtcaaatat
     4201 ccaaaaggag tcgagtggtc atatggtagt cttgatagcc tttaacaagc attaacgctg
     4261 ctttgctact aactgctatt aacaaccaca gctttttttt tacgtttttt tattttactg
     4321 attttgtact gcaatggcgc gtgctagaag ggtcaaacgt gactctgtaa cacatatata
     4381 tcagacctgc aaacaggcag gcacttgccc ctcggacgtt gttaataaag ttgagcaaac
     4441 aacagttgct gacaatattt tgaaatatgg cagtgctggt gtcttttttg gaggccttgg
     4501 cataggaaca ggccgaggga ctgggggtgc tactgggtac gtgccacttg gggaaggtcc
     4561 tggtgtccgt gtgggaggaa ccccaacggt tgtaaggcct tctcttgttc ctgaagcaat
     4621 tggaccagtt gatattttac ccattgacac aatcgcacct gtcgagccta ctgcttcatc
     4681 tttagtccca ttaacagagt cgtctggtgc tgatttactt cccggtgaag ttgaaactat
     4741 agccgaaata catcctattc ctgaaggtcc gacaatcgac tcccctgtag tcaccacaac
     4801 gacaggttcc agtgctgttc tggaagtggc tccagaacct gtacccccta cacgtgttag
     4861 aattgctaga acacaatatc ataatccctc ttttcagata ctcactgaat caacacctgc
     4921 gcagggcgag agttctcttg ctgaccatat tttggtcacc tcagggtctg gtggacaaag
     4981 gataggcggt gatataacag acgaaattga acttactgag tttccaagca gatatacatt
     5041 tgaaatagaa gaacccaccc ctccacgaaa aagtagcaca ccattacaaa ctgtagcctc
     5101 tgcagtaagg cgacggggct tctcattaac aaatagaaga ttggtacaac aagtagctgt
     5161 agacaatcct ttatttttaa gtcaaccttc taagatggta agattctcat ttgacaatcc
     5221 agcttttgaa gaagaggtta ccaatatttt tgaacaggat gttaacagct ttgaagaacc
     5281 tccagacagg gattttcttg atattaaaca attgggccgt cctcaatatt ctacaacacc
     5341 agcaggttat attagggtaa gcagactagg aactcgaggc accattcgca ctcgttctgg
     5401 tgcacaaata ggttctcagg tacactttta tagagattta agttctataa atactgagga
     5461 tccaatagaa ctacagcttt tagggcagca ttctggagat gctactattg ttcaaggtcc
     5521 tgtagaaagc acatttatag atatggacat tgctgaaaac cctttatctg aaacaataga
     5581 tgcttcatct aatgatttac ttttggatga gactgtggag gattttagtg ggtcccaatt
     5641 agtaattgga aatcgaagga gtacaacatc atatactgtt cccagatttg agactactag
     5701 aagtagttcc tattatgttc aagacacaga tggttattat gttgcttacc cagagtcacg
     5761 ggacactatt gatattattt accctacacc tgaattacct gtagttgtca ttcacaccca
     5821 tgacaattct ggagactttt acttacatcc tagtcttaga aggcgtaagc gtaaaagaaa
     5881 atatttgtga tttgcattgc agatggcagt gtggcactcg gctaacggta aagtatacct
     5941 tcctccatca acaccagtgg ccagggttca aagcacggat gaatacatac aaaggactaa
     6001 tatctattat catgcaaata ctgaccgcct tttaacagta ggacatccat atttcaatgt
     6061 atacaataat aatggaacta cattagaggt tccaaaagta tcaggtaatc agcatagggt
     6121 gtttcgctta aaattgccag atcctaatag atttgctcta gcggacatgt ctgtatacaa
     6181 ccctgacaaa gaacgcttgg tgtgggcctg caggggtcta gaaattggaa ggggtcaacc
     6241 tttaggtgtt ggcagtactg gtcacccata ttttaataag gtaaaagata cagaaaacag
     6301 taattcctat atcacaaact caaaagatga cagacaagac acctcttttg atcctaaaca
     6361 aatacagatg tttattgtgg gctgcactcc atgtattggc gaacactggg ataaggcaga
     6421 gccttgtggg gaacagcaaa ctggtctttg tcctcctatt gaattaaaaa acacatacat
     6481 tcaggatggc gacatggcag acattggttt tggcaacatt aatttcaagg ccttacaaca
     6541 cagtaggtct gatgttagtc ttgacattgt aaatgaaact tgcaagtacc cggattttct
     6601 caaaatgcaa aatgatgttt atggggatgc ttgctttttt tatgctcgta gagagcaatg
     6661 ttatgccaga catttttttg ttagaggggg aaaaacaggt gatgacatac caggagcaca
     6721 ggttggcaat ggtaatatga aaaatcaatt ttacattcct ggtgctacgg gtcaggctca
     6781 gagcactata ggtaatgcca tgtatttccc aactgtcagt ggctcactag tctctagtga
     6841 tgctcaactg tttaacaggc cattctggct ccaaagggct cagggtcata ataatggcat
     6901 tctgtgggct aatcaaatgt ttgtcacagt tgtagacaac acaagaaata caaatttcag
     6961 catctctgtt tactctcagg caggggacat aaaggatata caggattata atgcagacaa
     7021 ttttagagag tatcaaagac atgtggagga atatgaaatt tctgtaatat tacaattgtg
     7081 caaagttcct ttaaaagcag aagttttagc acaaattaat gccatgaatt cgtctctttt
     7141 agaggaatgg cagttaggat ttgtgcctac tccagacaac cctattcagg atacatatag
     7201 atatctagaa tctttggcca ctaggtgtcc tgaaaagtct cctccaaaag agaaggttga
     7261 cccctacaaa ggtttaaact tttgggatgt cgatatgaca gagcgccttt ccctggattt
     7321 agatcaatat tcattaggta gaaagttctt attccaggct ggattacagc agacgaccgt
     7381 aaacggtaca aaaacaactc cttacagggg gtccatcaga ggaacaaagc gcaaacgaaa
     7441 aaattgaaga tgaccgtttt cggtacagat tgtttaactt ttacacagta ttcaaggaat
     7501 gtctgtttac tgtgactaag tgtaactctg ccaaagaaac aaccgcaccc ggtacacgta
     7561 ttcagcttgt tgccaaaaca gataagcttg gcagtcagaa cacaccgtgt tcgtcgcaac
     7621 acgctcggat taggtcttct gccaaaagaa atttaatctt gttatcgttt ttggcgatca
     7681 catttggcac cgcgggcagc tgttttggca ctacaagaca accgtt
//
LOCUS       RUBCG        9755 bp ss-RNA             VRL       12-JUL-1990
DEFINITION  Rubella virus complete genome encoding nonstructural protein,
            capsid protein, glycoproteins E1 and E2, complete cds.
ACCESSION   M15240 M18901 M32735
KEYWORDS    C protein; glycoprotein; glycoprotein E1; glycoprotein E2;
            hemagglutinin.
SOURCE      Rubella virus (strain Therien) cDNA to genomic RNA and cDNA to
            mRNA, clones pRUB1025[1010,1012,1002,1006,1015,1001].
  ORGANISM  Rubella virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Rubivirus.
REFERENCE   1  (bases 8155 to 9754)
  AUTHORS   Frey,T.K., Marr,L.D., Hemphill,M.L. and Dominguez,G.
  TITLE     Molecular cloning and sequencing of the region of the rubella virus
            genome coding for glycoprotein E1
  JOURNAL   Virology 154, 228-232 (1986)
  STANDARD  full staff_review
REFERENCE   2  (bases 5917 to 9754; revises [1])
  AUTHORS   Frey,T.K. and Marr,L.D.
  JOURNAL   Unpublished (1987)
  STANDARD  full staff_review
REFERENCE   3  (bases 5247 to 8366)
  AUTHORS   Frey,T.K. and Marr,L.D.
  TITLE     Sequence of the region coding for virion proteins C and E2 and the
            carboxy terminus of the nonstructural proteins of rubella virus:
            comparison with alphaviruses
  JOURNAL   Gene 62, 85-99 (1988)
  STANDARD  full staff_review
REFERENCE   4  (bases 1 to 9755)
  AUTHORS   Domminguez,G., Wang,C.-Y. and Frey,T.K.
  TITLE     Sequence of the genome RNA of rubella virus: Evidence for genetic
            rearrangement during togavirus evolution
  JOURNAL   Virology 177, 225-258 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable copy of sequence in [2] kindly
            provided by T.K.Frey, 01-JUN-1987.
            
            Draft entry and computer-readable sequence for [4] kindly submitted
            by G.Dominguez, 09-MAR-1990, for release after publication.
            
            Glycoprotein E1 contains the viral hemagglutinin activity.
            Multiple copies of the C protein comprise the nucleocapsid.
FEATURES       from  to/span     description
    pept         39     6656     nonstructural polyprotein precursor
    pept       6505     9696     structural polyprotein precursor
    matp       6505     7404     capsid protein (C)
    matp       7405     8250     glycoprotein E2
    matp       8251     9693     glycoprotein E1
    mRNA       6428     9755     subgenomic RNA
BASE COUNT     1457 a   3781 c   3007 g   1510 t
ORIGIN      
        1 atggaagcta tcggacctcg cttaggactc ccattcccat ggagaaactc ctagatgagg
       61 ttcttgcccc cggtgggcct tataacttaa ccgtcggcag ttgggtaaga gaccacgtcc
      121 gatcaattgt cgagggcgcg tgggaagtgc gcgatgttgt taccgctgcc caaaagcggg
      181 ccatcgtagc cgtgataccc agacctgtgt tcacgcagat gcaggtcagt gatcacccag
      241 cactccacgc aatttcgcgg tatacccgcc gccattggat cgagtggggc cctaaagaag
      301 ccctacacgt cctcatcgac ccaagcccgg gcctgctccg cgaggtcgct cgcgttgagc
      361 gccgctgggt cgcactgtgc ctccacagga cggcacgcaa actcgccacc gccctggccg
      421 agacggccag cgaggcgtgg cacgctgact acgtgtgcgc gctgcgtggc gcaccgagcg
      481 gccccttcta cgtccaccct gaggacgtcc cgcacggcgg tcgcgccgtg gcggacagat
      541 gcttgctcta ctacacaccc atgcagatgt gcgagctgat gcgtaccatt gacgccaccc
      601 tgctcgtggc ggttgacttg tggccggtcg cccttgcggc ccacgtcggc gacgactggg
      661 acgacctggg cattgcctgg catctcgacc atgacggcgg ttgccccgcc gattgccgcg
      721 gagccggcgc tgggcccacg cccggctaca cccgcccctg caccacacgc atctaccaag
      781 tcctgccgga caccgcccac cccgggcgcc tctaccggtg cgggccccgc ctgtggacgc
      841 gcgattgcgc cgtggccgaa ctctcatggg aggttgccca acactgcggg caccaggcgc
      901 gcgtgcgcgc cgtgcgatgc accctcccta tccgccacgt gcgcagcctc caacccagcg
      961 cgcgggtccg actcccggac ctcgtccatc tcgccgaggt gggccggtgg cggtggttca
     1021 gcctcccccg ccccgtgttc cagcgcatgc tgtcctactg caagaccctg agccccgacg
     1081 cgtactacag cgagcgcgtg ttcaagttca agaacgccct gtgccacagc atcacgctcg
     1141 cgggcaatgt gctgcaagag gggtggaagg gcacgtgcgc cgaggaagac gcgctgtgcg
     1201 catacgtagc cttccgcgcg tggcagtcta acgccaggtt ggcggggatt atgaaaggcg
     1261 cgaagtgcgc cgccgactct ttgagcgtgg ccggctggct ggacaccatt tgggacgcca
     1321 ttaagcggtt cctcggtagc gtgcccctcg ccgagcgcat ggaggagtgg gaacaggacg
     1381 ccgcggtcgc cgccttcgac cgcggccccc tcgaggacgg cgggcgccac ttggacaccg
     1441 tgcaaccccc aaaatcgccg ccccgccctg agatcgccgc gacctggatc gtccacgcag
     1501 ccagcgaaga ccgccattgc gcgtgcgctc cccgctgcga cgtcccgcgc gaacgtcctt
     1561 ccgcgcccgc cggccagccg gatgacgagg cgctcatccc gccgtggctg ttcgccgagc
     1621 gccgtgccct ccgctgccgc gagtgggatt tcgaggctct ccgcgcgcgc gccgatacgg
     1681 cggccgcgcc cgccccgccg gctccacgcc ccgcgcggta ccccaccgtg ctctaccgcc
     1741 accccgccca ccacggcccg tggctcaccc ttgacgagcc gggcgaggct gacgcggccc
     1801 tggtcttatg cgacccactt ggccagccgc tccggggccc tgaacgccac ttcgccgccg
     1861 gcgcgcatat gtgcgcgcag gcgcgggggc tccaggcttt tgtccgtgtc gtgcctccac
     1921 ccgagcgccc ctgggccgac gggggcgcca gagcgtgggc gaagttcttc cgcggctgcg
     1981 cctgggcgca gcgcttgctc ggcgagccag cagttatgca cctcccatac accgatggcg
     2041 acgtgccaca gctgatcgca ctggctttgc gcacgctggc ccaacagggg gccgccttgg
     2101 cactctcggt gcgtgacctg cccgggggtg cagcgttcga cgcaaacgcg gtcaccgccg
     2161 ccgtgcgcgc tggcccccgc cagtccgcgg ccgcgtcacc gccacccggc gaccccccgc
     2221 cgccgcgccg cgcacggcga tcgcaacggc actcggacgc tcgcggcact ccgccccccg
     2281 cgcctgcgcg cgacccgccg ccgcccgccc ccagcccgcc cgcgccaccc cgcgctggtg
     2341 acccggtccc tcccattccc gcggggccgg cggatcgcgc gcgtgacgcc gagctggagg
     2401 tcgcctgcga gccgagcggc ccccccacgt caaccagggc agacccagac agcgacatcg
     2461 ttgaaagtta cgcccgcgcc gccggacccg tgcacctccg agtccgcgac atcatggacc
     2521 caccgcccgg ctgcaaggtc gtggtcaacg ccgccaacga ggggctactg gccggctctg
     2581 gcgtgtgcgg tgccatcttt gccaacgcca cggcggccct cgctgcaaac tgccggcgcc
     2641 tcgccccatg ccccaccggc gaggcagtgg cgacacccgg ccacggctgc gggtacaccc
     2701 acatcatcca cgccgtcgcg ccgcggcgtc ctcgggaccc cgccgccctc gaggagggcg
     2761 aagcgctgct cgagcgcgcc taccgcagca tcgtcgcgct agccgccgcg cgtcggtggg
     2821 cgtgtgtcgc gtgccccctc ctcggcgctg gcgtctacgg ctggtctgct gcggagtccc
     2881 tccgagccgc gctcgcggct acgcgcaccg agcccgtcga gcgcgtgagc ctgcacatct
     2941 gccaccccga ccgcgccacg ctgacgcacg cctccgtgct cgtcggcgcg gggctcgctg
     3001 ccaggcgcgt cagtcctcct ccgaccgagc ccctcgcatc ttgccccgcc ggtgacccgg
     3061 gccgaccggc tcagcgcagc gcgtcgcccc cagcgacccc ccttggggat gccaccgcgc
     3121 ccgagccccg cggatgccag gggtgcgaac tctgccggta cacgcgcgtc accaatgacc
     3181 gcgcctatgt caacctgtgg ctcgagcgcg accgcggcgc caccagctgg gccatgcgca
     3241 ttcccgaggt ggttgtctac gggccggagc acctcgccac gcattttcca ttaaaccact
     3301 acagtgtgct caagcccgcg gaggtcaggc ccccgcgagg catgtgcggg agtgacatgt
     3361 ggcgctgccg cggctggcat ggcatgccgc aggtgcggtg caccccctcc aacgctcacg
     3421 ccgccctgtg ccgcacaggc gtgccccctc gggcgagcac gcgaggcggc gagctagacc
     3481 caaacacctg ctggctccgc gccgccgcca acgttgcgca ggctgcgcgc gcctgcggcg
     3541 cctacacgag tgccgggtgc cccaagtgcg cctacggccg cgccctgagc gaagcccgca
     3601 ctcatgagga cttcgccgcg ctgagccagc ggtggagcgc gagccacgcc gatgcctccc
     3661 ctgacggcac cggagatccc ctcgaccccc tgatggagac cgtgggatgc gcctgttcgc
     3721 gcgtgtgggt cggctccgag catgaggccc cgcccgacca cctcctggtg tcccttcacc
     3781 gtgccccaaa tggtccgtgg ggcgtagtgc tcgaggtgcg tgcgcgcccc gaggggggca
     3841 accccaccgg ccacttcgtc tgcgcggtcg gcggcggccc acgccgcgtc tcggaccgcc
     3901 cccacctctg gcttgcggtc cccctgtctc ggggcggtgg cacctgtgcc gcgaccgacg
     3961 aggggctggc ccaggcgtac tacgacgacc tcgaggtgcg ccgcctcggg gatgacgcca
     4021 tggcccgggc ggccctcgca tcagtccaac gccctcgcaa aggcccttac aatatcaggg
     4081 tatggaacat ggccgcaggc gctggcaaga ctacccgcat cctcgctgcc ttcacgcgcg
     4141 aagaccttta cgtctgcccc accaatgcgc tcctgcacga gatccaggcc aaactccgcg
     4201 cgcgcgatat cgacatcaag aacgccgcca cctacgagcg ccggctgacg aaaccgctcg
     4261 ccgcctaccg ccgcatctac atcgatgagg cgttcactct cggcggcgag tactgcgcgt
     4321 tcgttgccag ccaaaccacc gcggaggtga tctgcgtcgg tgatcgggac cagtgcggcc
     4381 cacactacgc caataactgc cgcacccccg tccctgaccg ctggcctacc gagcgctcgc
     4441 gccacacttg gcgcttcccc gactgctggg cggcccgcct gcgcgcgggg ctcgattatg
     4501 acatcgaggg cgagcgcacc ggcaccttcg cctgcaacct ttgggacggc cgccaggtcg
     4561 accttcacct cgccttctcg cgcgaaaccg tgcgccgcct tcacgaggct ggcatacgcg
     4621 catacaccgt gcgcgaggcc cagggtatga gcgtcggcac cgcctgcatc catgtaggca
     4681 gagacggcac ggacgttgcc ctggcgctga cacgcgacct cgccatcgtc agcctgaccc
     4741 gggcctccga cgcactctac ctccacgagc tcgaggacgg ctcactgcgc gctgcggggc
     4801 tcagcgcgtt cctcgacgcc ggggcactgg cggagctcaa ggaggttccc gctggcattg
     4861 accgcgttgt cgccgtcgag caggcaccac caccgttgcc gcccgccgac ggcatccccg
     4921 aggcccaaga cgtgccgccc ttctgccccc gcactctgga ggagctcgtc ttcggccgtg
     4981 ccggccaccc ccattacgcg gacctcaacc gcgtgactga gggcgaacga gaagtgcggt
     5041 acatgcgcat ctcgcgtcac ctgctcaaca agaatcacac cgagatgccc ggaacggaac
     5101 gcgttctcag tgccgtttgc gccgtgcggc gctaccgcgc gggcgaggat gggtcgaccc
     5161 tccgcactgc tgtggcccgc cagcacccgc gcccttttcg ccagatccca cccccgcgcg
     5221 tcactgctgg ggtcgcccag gagtggcgca tgacgtactt gcgggaacgg atcgacctca
     5281 ctgatgtcta cacgcagatg ggcgtggccg cgcgggagct caccgaccgc tacgcgcgcc
     5341 gctatcctga gatcttcgcc ggcatgtgta ccgcccagag cctgagcgtc cccgccttcc
     5401 tcaaagccac cttgaagtgc gtagacgccg ccctcggccc cagggacacc gaggactgcc
     5461 acgccgctca ggggaaagcc ggccttgaga tccgggcgtg ggccaaggag tgggttcagg
     5521 ttatgtcccc gcatttccgc gcgatccaga agatcatcat gcgcgccttg cgcccgcaat
     5581 tccttgtggc cgctggccat acggagcccg aggtcgatgc gtggtggcag gcccattaca
     5641 ccaccaacgc catcgaggtc gacttcactg agttcgacat gaaccagacc ctcgctactc
     5701 gggacgtcga gctcgagatt agcgccgctc tcttgggcct cccttgcgcc gaagactacc
     5761 gcgcgctccg cgccggcagc tactgcaccc tgcgcgaact gggctccact gagaccggct
     5821 gcgagcgcac aagcggcgag cccgccacgc tgctgcacaa caccaccgtg gccatgtgca
     5881 tggccatgcg catggtcccc aaaggcgtgc gctgggccgg gattttccag ggtgacgata
     5941 tggtcatctt cctccccgag ggcgcgcgca gcgcggcact caagtggacc cccgccgagg
     6001 tgggcttgtt tggcttccac atcccggtga agcacgtgag cacccctacc cccagcttct
     6061 gcgggcacgt cggcaccgcg gccggcctct tccatgatgt catgcaccag gcgatcaagg
     6121 tgctttgccg ccgtttcgac ccagacgtgc ttgaagaaca gcaggtggcc ctcctcgacc
     6181 gcctccgggg ggtctacgcg gctctgcctg acaccgttgc cgccaatgct gcgtactacg
     6241 actacagcgc ggagcgcgtc ctcgctatcg tgcgcgaact taccgcgtac gcgggggcgc
     6301 ggcctcgacc acccggccac catcggcgcg ctcgaggaga ttcagacccc ctacgcgcgc
     6361 gccaatctcc acgacgccga ctaacgcccc tgtacgtggg gcctttaatc ttacctactc
     6421 taaccaggtc atcacccacc gttgtttcgc cgcatctggt gggtacccaa cttttgccat
     6481 tcgggagagc cccagggtgc ccgaatggct tctactaccc ccatcaccat ggaggacctc
     6541 cagaaggccc tcgaggcaca atcccgcgcc ctgcgcgcgg aactcgccgc cggcgcctcg
     6601 cagtcgcgcc ggccgcggcc gccgcgacag cgcgactcca gcacctccgg agatgactcc
     6661 ggccgtgact ccggagggcc ccgccgccgc cgcggcaacc ggggccgtgg ccagcgcagg
     6721 gactggtcca gggccccgcc ccccccggag gagcggcaag aaactcgctc ccagactccg
     6781 gccccgaagc catcgcgggc gccgccacaa cagcctcaac ccccgcgcat gcaaaccggg
     6841 cgtgggggct ctgccccgcg ccccgagctg gggccaccga ccaacccgtt ccaagcagcc
     6901 gtggcgcgtg gcctgcgccc gcctctccac gaccctgaca ccgaggcacc caccgaggcc
     6961 tgcgtgacct cgtggctttg gagcgagggc gaaggcgcgg tcttttaccg cgtcgacctg
     7021 catttcacca acctgggcac ccccccactc gacgaggacg gccgctggga ccctgcgctc
     7081 atgtacaacc cttgcgggcc cgagccgccc gctcacgtcg tccgcgcgta caatcaacct
     7141 gccggcgacg tcaggggcgt ttggggtaaa ggcgagcgca cctacgccga gcaggacttc
     7201 cgcgtcggcg gcacgcgctg gcaccgactg ctgcgcatgc cagtgcgcgg cctcgacggc
     7261 gacagcgccc cgcttccccc ccacaccacc gagcgcattg agacccgctc ggcgcgccat
     7321 ccttggcgca tccgcttcgg tgccccccag gccttccttg ccgggctctt gctcgccacg
     7381 gtcgccgttg gcaccgcgcg cgccgggctc cagccccgcg ctgatatggc ggcacctcct
     7441 acgctgccgc agcccccctg tgcgcacggg cagcattacg gccaccacca ccatcagctg
     7501 ccgttcctcg ggcacgacgg ccatcatggc ggcaccttgc gcgtcggcca gcattaccga
     7561 aacgccagcg acgtgctgcc cggccactgg ctccaaggcg gctggggttg ctacaacctg
     7621 agcgactggc accagggcac tcatgtctgt cataccaagc acatggactt ctggtgtgtg
     7681 gagcacgacc gaccgccgcc cgcgaccccg acgcctctca ccaccgcggc gaactccacg
     7741 accgccgcca cccccgccac tgcgccggcc ccctgccacg ccggcctcaa tgacagctgc
     7801 ggcggcttct tgtctgggtg cgggccgatg cgcctgcgcc acggcgctga cacccggtgc
     7861 ggtcggttga tctgcgggct gtccaccacc gcccagtacc cgcctacccg gtttggctgc
     7921 gctatgcggt ggggccttcc cccctgggaa ctggtcgtcc ttaccgcccg ccccgaagac
     7981 ggctggactt gccgcggcgt gcccgcccat ccaggcgccc gctgccccga actggtgagc
     8041 cccatgggac gcgcgacttg ctccccagcc tcggccctct ggctcgccac agcgaacgcg
     8101 ctgtctcttg atcacgccct cgcggccttc gtcctgctgg tcccgtgggt cctgatattt
     8161 atggtgtgcc gccgcgcctg tcgccgccgc ggcgccgccg ccgccctcac cgcggtcgtc
     8221 ctgcaggggt acaacccccc cgcctatggc gaggaggctt tcacctacct ctgcactgca
     8281 ccggggtgcg ccactcaagc acctgtcccc gtgcgcctcg ctggcgtccg ttttgagtcc
     8341 aagattgtgg acggcggctg ctttgcccca tgggacctcg aggccactgg agcctgcatt
     8401 tgcgagatcc ccactgatgt ctcgtgcgag ggcttggggg cctgggtacc cgcagcccct
     8461 tgcgcgcgca tctggaatgg cacacagcgc gcgtgcacct tctgggctgt caacgcctac
     8521 tcctctggcg ggtacgcgca gctggcctct tacttcaacc ctggcggcag ctactacaag
     8581 cagtaccacc ctaccgcgtg cgaggttgaa cctgccttcg gacacagcga cgcggcctgc
     8641 tggggcttcc ccaccgacac cgtgatgagc gtgttcgccc ttgctagcta cgtccagcac
     8701 cctcacaaga ccgtccgggt caagttccat acagagacca ggaccgtctg gcaactctcc
     8761 gttgccggcg tgtcgtgcaa cgtcaccact gaacacccgt tctgcaacac gccgcacgga
     8821 caactcgagg tccaggtccc gcccgacccc ggggacctgg ttgagtacat tatgaattac
     8881 accggcaatc agcagtcccg gtggggcctc gggagcccga attgccacgg ccccgattgg
     8941 gcctccccgg tttgccaacg ccattcccct gactgctcgc ggcttgtggg ggccacgcca
     9001 gagcgccccc ggctgcgcct ggtcgacgcc gacgaccccc tgctgcgcac tgcccctgga
     9061 cccggcgagg tgtgggtcac gcctgtcata ggctctcagg cgcgcaagtg cggactccac
     9121 atacgcgctg gaccgtacgg ccatgctacc gtcgaaatgc ccgagtggat ccacgcccac
     9181 accaccagcg acccctggca tccaccgggc cccttggggc tgaagttcaa gacagttcgc
     9241 ccggtggccc tgccacgcac gttagcgcca ccccgcaatg tgcgtgtgac cgggtgctac
     9301 cagtgcggta cccccgcgct ggtggaaggc cttgcccccg ggggaggcaa ttgccatctc
     9361 accgtcaatg gcgaggacct cggcgccgtc ccccctggga agttcgtcac cgccgccctc
     9421 ctcaacaccc ccccgcccta ccaagtcagc tgcgggggcg agagcgatcg cgcgaccgcg
     9481 cgggtcatcg accccgccgc gcaatcgttt accggcgtgg tgtatggcac acacaccact
     9541 gctgtgtcgg agacccggca gacctgggcg gagtgggctg ctgcccattg gtggcagctc
     9601 actctgggcg ccatttgcgc cctcccactc gctggcttac tcgcttgctg tgccaaatgc
     9661 ttgtactact tgcgcggcgc tatagcgcct cgctagtggg cccccgcgcg aaacccgcac
     9721 taggccacta gatccccgca cctgttgctg tatag
//
LOCUS       XEL68KSA     2009 bp ss-mRNA            VRT       12-JUL-1990
DEFINITION  X.laevis 68 kDa serum albumin mRNA, complete cds.
ACCESSION   M18350
KEYWORDS    serum albumin.
SOURCE      X.laevis adult liver hepatocyte (lambda-ZAP library), cDNA to mRNA,
            clone pX1A14.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2009)
  AUTHORS   Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and
            Schoenberg,D.R.
  TITLE     Xenopus laevis serum albumin: Sequence of the cDNAs encoding the 68
            and 74 kDa peptides, relationship of the 74 kDa albumin to
            alpha-fetoprotein, and the regulation of albumin gene expression by
            thyroid hormone during development
  JOURNAL   Mol. Endocrinol. 3, 464-473 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1842 to 2009)
  AUTHORS   Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L.
  TITLE     Extranuclear estrogen-regulated destabilization of Xenopus laevis
            serum albumin mRNA
  JOURNAL   Mol. Endocrinol. 3, 805-814 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.Schoenberg, 14-NOV-1988.  Draft entry and computer-readable
            sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989.
FEATURES       from  to/span     description
    pept         39     1859     68 kDa serum albumin precursor
    sigp         39      110     68 kDa serum albumin signal peptide
    matp        111     1856     68 kDa serum albumin
    signal     1986     1994     poly-A signal
    site          1        1     cap site
BASE COUNT      699 a    388 c    393 g    529 t
ORIGIN      87 bp upstream of HinfI site.
        1 aggcttctca gaggtcccca cccaatacat ctccagtcat gaagtggatc accctcattt
       61 gtctgttaat tagctccact ttaatagaat caagaataat tttcaaaaga gatacagatg
      121 tagaccatca caagcatatt gctgacatgt acaatttatt gactgagcgg accttcaaag
      181 gacttacatt ggctattgtc tcacagaatc tccagaaatg ttcattggag gagctgtcta
      241 aactggtgaa tgaaattaat gactttgcca aatcctgtac aggaaacgac aaaactcctg
      301 agtgtgaaaa acccataggc accctgtttt atgacaaact ctgcgcagat ccaaaagtgg
      361 gtgttaatta tgagtggagc aaagagtgct gttctaagca agatccagag agagcacagt
      421 gcttcagggc acatagagtt tttgaacata atccagtaag gcctaaacct gaggaaactt
      481 gtgcattatt caaagaacac cctgatgatc ttctctcagc attcatacat gaagaggcga
      541 gaaaccatcc agacctttat cccccagcag tactattatt aacacagcaa tatggcaaac
      601 ttgttgaaca ttgttgtgaa gaagaagaca aggataaatg ctttgcagaa aagatgaagg
      661 aactgatgaa acacagtcat tctattgaag ataagcaaaa acatttctgc tggattgtaa
      721 ataattatcc tgaaagagtt attaaagcac taaatttggc cagagtgagc cacagatatc
      781 ctaagcctga tttcaagctt gcccataaat ttaccgagga gactacacac ttcattaagg
      841 attgttgtca tggggacatg tttgaatgca tgacagagag gctggagctt tctgagcata
      901 cctgtcaaca taaagatgag ttatcaacaa aacttgaaaa atgctgtaac ttacctttgc
      961 ttgagcgtac atactgcatt gtcaccttgg aaaatgatga cgttcctgct gaattatcaa
     1021 agccaattac agaatttaca gaggaccctc atgtttgtga gaagtatgct gagaataaaa
     1081 gtttcttaga gatatctcca tggcagagtc aagaaacacc agaattgtct gaacaattcc
     1141 ttttgcaatc tgcaaaagaa tatgaatctt tgctgaacaa gtgctgcttt tcagacaatc
     1201 ctcctgaatg ctacaaggat ggagctgaca gatttatgaa tgaagccaag gagagatttg
     1261 catatttgaa acaaaactgt gatatcttgc atgaacatgg agaatatctc tttgaaaatg
     1321 aattgctcat aagatacaca aagaaaatgc cccaagtgtc agatgaaaca ttgattggaa
     1381 tagcacacca aatggcagat attggtgagc actgctgtgc cgtacctgaa aatcaaagga
     1441 tgccatgtgc agaaggagac cttaccattc tcattggaaa aatgtgtgaa aggcaaaaga
     1501 agacatttat aaataaccac gttgctcatt gctgcactga ctcatattct gggatgcgtt
     1561 catgctttac tgctcttggt ccagatgagg actatgtacc acccccagtt actgatgaca
     1621 catttcactt tgacgacaag atatgcactg ctaatgataa agaaaaacag catatcaaac
     1681 agaaattcct tgtgaagctg attaaagtta gtcctaaatt ggaaaaaaat cacattgatg
     1741 aatggctgct ggaattcctt aagatggtac agaaatgctg tactgcagat gaacaccagc
     1801 catgttttga tacagagaaa ccagtactga ttgaacactg tcaaaaactc catccataag
     1861 agtccataag agcaaagacc agtcttcaaa ctcactgagg aacaccttcc atctctcaaa
     1921 cacaagaaaa aaaagttcct tcagctgaaa agagcatttg cttagagcat tcaactgtgt
     1981 gttgtaataa ataaagcatt ttaaaaaat
//
LOCUS       XEL74KSA     1957 bp ss-mRNA            VRT       12-JUL-1990
DEFINITION  Xenopus laevis 74 kDa serum albumin mRNA, complete cds.
ACCESSION   M21442
KEYWORDS    serum albumin.
SOURCE      X.laevis adult liver hepatocyte, cDNA to mRNA, clone pX1A74.1.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1957)
  AUTHORS   Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and
            Schoenberg,D.R.
  TITLE     Xenopus laevis serum albumin: Sequence of the complementary
            deoxyribonucleic acids encoding the 68- and 74-kilodalton peptides
            and the regulation of albumin gene expression by thyroid hormone
            during development
  JOURNAL   Mol. Endocrinol. 3, 464-473 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1801 to 1957)
  AUTHORS   Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L.
  TITLE     Extranuclear estrogen-regulated destabilization of Xenopus laevis
            serum albumin mRNA
  JOURNAL   Mol. Endocrinol. 3, 805-814 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.Schoenberg, 14-NOV-1988.  Draft entry and computer-readable
            sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    <     1     1818     74 kDa serum albumin (AA at 1)
    sigp    <     1       66     74 kDa serum albumin signal peptide
    matp         67     1815     74 kDa serum albumin
    signal     1935     1944     poly-A signal
BASE COUNT      667 a    383 c    382 g    525 t
ORIGIN      
        1 tggatcaccc tgatttgtct gttaattagc tcctctttca ttgaatcaag gatacttttc
       61 aaaagagata cagatgcaga ccatcacaag catattgctg atgtatacac cgcattgact
      121 gagcggacct tcaaaggact tacattggct attgtctctc agaatctcca gaaatgttcg
      181 ttggaggagt tatctaagct ggtgaatgaa ataaatgact ttgccaaatc ctgtattaat
      241 gacaaaactc ctgagtgtga aaaaccagtg ggcaccctgt tttttgacaa actctgtgca
      301 gatccagcag tgggtgttaa ttatgagtgg agcaaagagt gctgtgccaa gcaagatcca
      361 gagagggctc agtgcttcaa ggcgcacaga gatcatgaac atacttcaat aaagcctgaa
      421 cctgaggaaa cctgcaaatt actcaaagaa caccctgatg atcttctctc agcgttcatt
      481 catgaagagg caagaaacca tccagacctt tatccaccag cagtattagc attaaccaag
      541 caatatcaca aacttgctga acattgttgt gaagaagaag acaaggaaaa atgcttctca
      601 gaaaagatga agcaacttat gaaacaatct cattccattg aagataagca acatcatttc
      661 tgctggattc tggataattt tcctgaaaaa gttcttaaag cactaaattt ggccagagtg
      721 agccacagat atcctaaagc tgaattcaag cttgcccata attttactga ggaggttaca
      781 cactttatta aagattgttg ccatgacgac atgtttgaat gcatgactga gaggctggag
      841 cttactgagc atacctgtca acataaagat gagttatcat caaaacttga aaaatgctgt
      901 aatatacctt tgcttgagcg tacatactgc attgtcacct tggaaaatga tgacgttcct
      961 gctgaattgt ctcagccaat tacagaattt acagaggacc ctcatgtgtg tgagaagtat
     1021 gctgagaata acgaagtttt cttaggaaga tatctccatg ctgtgtcaag aaaacaccag
     1081 gaattgtctg aacaattcct tttgcaatct gcaaaagaat atgaatcttt gctgaacaag
     1141 tgctgcaaaa cagacaatcc tcctgaatgc tacaaggatg gagctgacag atttatgaat
     1201 gaagccaagg agagatttgc atatttgaaa caaaactgtg atatcttgca tgaacatgga
     1261 gaatatctct ttgaaaatga attgctcata agatacacaa agaaaatgcc ccaagtgtca
     1321 gatgaaacat tgattggaat agcacaccaa atggcagata ttggtgagca ctgctgtgcc
     1381 gtacctgaaa atcaaaggat gccatgtgca gaaggagacc ttaccattct cattggaaaa
     1441 atgtgtgaaa ggcaaaagaa gacatttata aataaccacg ttgctcattg ctgcactgac
     1501 tcatattctg ggatgcgttc atgctttact gctcttggtc cagatgagga ctatgtacca
     1561 cccccagtta ctgatgacac atttcacttt gacgacaaga tatgcactgc taatgataaa
     1621 gaaaaacagc atatcaaaca gaaattcctt gtgaagctga ttaaagttag tcctaaattg
     1681 gaaaaaaatc acattgatga atgttctgct gaattcctta agatggtaca gaaatgctgt
     1741 actgcagatg aacaccagcc atgttttgat acagagaaac cagtactgat tgaacactgt
     1801 caaaaactcc atccataaga gtccattaga gcaaaggcca gccttcaaac tcactgagga
     1861 acatcttcca tctctcacat gaaaaaagtt tcctccatct gaaaagaaaa tttgttcatt
     1921 caactgtctg ttgaaataaa taaagcgttt aaaatat
//
LOCUS       MUSHOX28A     216 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Mouse homeobox protein gene Hox-2.8, partial cds.
ACCESSION   M34004
KEYWORDS    homeobox protein.
SOURCE      Mouse (strain CBA) DNA, clone YNOTHOX-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 216)
  AUTHORS   Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and
            Lehrach,H.
  TITLE     A yeast artificial chromosome containing the mouse homeobox cluster
            Hox-2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Krumlauf, 04-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept    <     1  >   216     Hox-2.8 homeobox protein (AA at 1)
    site         34      216     homeobox Hox-2
BASE COUNT       51 a     68 c     66 g     31 t
ORIGIN      Chromosome 11D.
        1 ggccccggat tgccagaatg cggcggcagc ggctcccgca gactgcgcac ggcctacacc
       61 aacacgcaac tgctggagct ggagaaggag ttccacttca ataagtacct gtgccggccg
      121 cgtcgcgtcg agatcgctgc cttgctggac ctcaccgaaa ggcaggtcaa agtctggttc
      181 cagaaccgac gcatgaaaca caagcggcag acggag
//
LOCUS       MUSHOX29A     183 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Mouse homeobox protein gene Hox-2.9, partial cds.
ACCESSION   M34005
KEYWORDS    homeobox protein.
SOURCE      Mouse (strain CBA) DNA, clone YNOTHOX-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 183)
  AUTHORS   Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and
            Lehrach,H.
  TITLE     A yeast artificial chromosome containing the mouse homeobox cluster
            Hox-2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Krumlauf, 04-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept    <     1  >   183     Hox-2.9 homeobox protein (AA at 1)
    site          1      183     homeobox Hox-2
BASE COUNT       50 a     53 c     54 g     26 t
ORIGIN      Chromosome 11D.
        1 cccggcggtc tccgcacaaa cttcaccacg cgccagctga cggagctgga gaaggaattt
       61 catttcaaca aatacctgag ccgtgcccgg agggtggaga tcgccgccac cctggagctc
      121 aatgaaacgc aggtgaagat ctggttccag aaccggcgca tgaagcagaa gaaacgcgag
      181 cga
//
LOCUS       BCCIPMD      1101 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  B.coagulans 3-isopropylmalate dehydrogenase gene, complete cds.
ACCESSION   M33099
KEYWORDS    3-isopropylmalate dehydrogenase.
SOURCE      B.coagulans (ATCC 7051) DNA.
  ORGANISM  Bacillus coagulans
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1101)
  AUTHORS   Sekiguchi,T., Ortega-Cesena,J., Nosoh,Y., Ohashi,S., Tsuda,K. and
            Kanaya,S.
  TITLE     DNA and amino-acid sequences of 3-isopropylmalate dehydrogenase of
            Bacillus coagulans. Comparison with the enzymes of Saccharomyces
            cerevisiae and Thermus thermophilus
  JOURNAL   Biochim. Biophys. Acta 867, 36-44 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1     1101     3-isopropylmalate dehydrogenase
BASE COUNT      288 a    255 c    328 g    230 t
ORIGIN      
        1 atgaaaatga aactggccgt actgcccggc gatgggatcg ggccggaagt gatggatgca
       61 gcgatccgcg ttttaaaaac agtgttggac aatgacgggc atgaagccgt ttttgaaaat
      121 gcgctgattg ggggcgccgc cattgatgaa gcggggacgc ccctaccgga agaaacgctt
      181 gacatttgcc gcaggagcga tgccattttg ctcggcgcgg taggggggcc gaaatgggat
      241 cataacccgg cttccctccg cccggaaaaa ggcctgctcg ggctccggaa agaaatgggg
      301 ctgtttgcga acctgcgccc ggttaaagca tatgccacac ttttaaacgc atcgccttta
      361 aaacgggaac gtgtggaaaa cgtcgatctt gttattgtcc gcgaactgac gggcggcctc
      421 tattttgggc gcccgagtga aaggcgcggg ccgggcgaga atgaagtggt agacacgctt
      481 gcctatacaa gggaagagat tgaaagaatt attgagaaag cattccagct tgcccaaatc
      541 agaagaaaaa aactggcatc cgtcgataag gcgaatgtgc tggaatcaag cagaatgtgg
      601 cgcgaaattg cggaagaaac cgcgaaaaag tatccggacg tggaattgag ccatatgctt
      661 gtcgactcaa cttcgatgca gctgattgca aatccgggcc aatttgatgt cattgtaaca
      721 gagaatatgt tcggcgatat tttaagcgat gaagcgtccg tgattaccgg cagcctcggc
      781 atgttgccat ccgcaagcct ccgttccgac cggttcggca tgtatgaacc ggtccacggc
      841 tccgcgccgg atattgccgg gcagggaaaa gccaacccgc tcgggacagt gctgtcagcg
      901 gctttgatgc tccgttattc gttcgggctt gagaaagaag cggcggccat tgaaaaagca
      961 gtggatgatg tgcttcaaga cggctattgt acaggcgatt tgcaggtggc aaacggaaaa
     1021 gtggtcagta caattgagct cacagaccgg ctgatcgaaa aattaaataa cagcgcagcc
     1081 ggtccgcgca tttttcaata a
//
LOCUS       DROSGS3A      151 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  D.melanogaster synthetic Sgs-3 glue protein gene/Adh gene, 5'
            flank.
ACCESSION   M34726
KEYWORDS    alcohol dehydrogenase; glue protein.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 151)
  AUTHORS   Roark,M., Raghavan,K.V., Todo,T., Mayeda,C.A. and Meyerowitz,E.M.
  TITLE     Cooperative enhancement at the Drosophila Sgs-3 locus
  JOURNAL   Dev. Biol. 139, 121-133 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        131  >   151     synthetic Sgs-3 glue protein mRNA
    recomb      130      131     Adh DNA end/Sgs-3 synthetic DNA start
BASE COUNT       51 a     25 c     36 g     39 t
ORIGIN      
        1 gtcgacccaa aagtatcaaa caaaggggag aaggcttgtg tttgcataat cgaaatactg
       61 actccatttt tagaattgca gtttcagtga aagcgtacct ataaaaaggt gaggtatccg
      121 caagaaaagt atcagtttgt ggtaccgagc t
//
LOCUS       MZESOD3A     1037 bp ss-mRNA            PLN       12-JUL-1990
DEFINITION  Z.mays manganese superoxide dismutase (SOD-3) mRNA, complete cds.
ACCESSION   M33119
KEYWORDS    manganese superoxide dismutase; superoxide dismutase.
SOURCE      Z.mays (strain W64A), cDNA to mRNA, clone pSod3.1c.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1037)
  AUTHORS   White,J.A. and Scandalios,J.G.
  TITLE     Isolation and characterization of a cDNA for mitochondrial
            manganese superoxide dismutase (SOD-3) of maize and its relation to
            other manganese superoxide dismutases
  JOURNAL   Biochim. Biophys. Acta 951, 61-70 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         46      753     manganese superoxide dismutase (SOD-3) (EC
                                 1.15.1.1)
BASE COUNT      237 a    259 c    296 g    245 t
ORIGIN      
        1 gaattccacg cacccaggag atacagcgag cgagcgacca aagccatggc tctccgcacc
       61 ctggcatcga agaaggtcct atccttcccg ttcggcggcg cgggccggcc gttggcggcg
      121 gcggcgtctg cgaggggggt gacgacggtc acactccccg acctctccta cgacttcggc
      181 gcgctggaac cggccatctc gggggagatc atgcgcttgc accaccaaaa gcaccacgcc
      241 acctacgtcg ccaactacaa caaggcgctg gagcagcttg aaactgccgt ctccaagggc
      301 gacgcctccg ctgtcgtcca gctgcaggcg gcgatcaagt tcaacggcgg cggtcatgtg
      361 aaccattcaa tcttctggaa gaacctcaag cccattagcg aaggtggcgg ggagccgcct
      421 catgggaaac ttggctgggc catcgatgag gattttggtt cgtttgaggc acttgtaaag
      481 aagatgaatg cagaaggcgc tgctttccaa gggtctggat gggtgtggtt agctttggat
      541 aaagaggcaa aaaaggtttc agttgaaaca acagctaatc aggatcctct ggtgactaaa
      601 ggtgcaagct tggttccgct gttggggatt gatgtctggg aacatgcata ctacctgcag
      661 tacaagaatg ttaggccgga ttacctgaac aacatctgga aggtgatgaa ctggaaatat
      721 gctggagagg tgtacgaaaa tgttcttgct tgaattgtct taacggacaa tacacatctg
      781 cgcgcgcggg tttcggctgt ttgatcatgt gaaataaaga tggacctgtc tagcggctgg
      841 accttgtgta catttcactg agatagacta atggacggcc tgccgatttt gttcgtcctg
      901 cttgcgtgct actctgtctc tgctcctagt ttttggcatc atgtttatgt tgagcaaggt
      961 gatgcccaag ggaagccatt cccactcttg tctccattaa taaaatcagc tgagcttccg
     1021 atgtttgctt ggaattc
//
LOCUS       RATA2UGLBA    300 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Rat alpha-2u-globulin gene, 5' end.
ACCESSION   M33213
KEYWORDS    alpha-2u-globulin.
SOURCE      Rat DNA, clone 91.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Kurtz,D.T., McCullough,L., Bishop,D.K. and Manos,M.M.
  TITLE     DNA sequences required for hormonal induction of rat
            alpha-2u-globulin genes
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 47, 985-988 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        292  >   300     alpha-2u-globulin
    mRNA        233  >   300     alpha-2u-globulin mRNA
BASE COUNT      108 a     58 c     76 g     58 t
ORIGIN      
        1 acccactaat ttttcgtggg aatatgtttt gcgaaatgta tgagtgatag aatcaatcca
       61 taggagatga catcgccaag tttcaaaagg gcaggaacaa tcgtggcttc acatcagtac
      121 atggaaaaca ttccacaaag cctgagaaga atggaaggcc catatgagaa ggaaaaaaaa
      181 acaccgaaac ccagagagag tataaagacg agcaaagtgc tggaggtgga gtgtgggcac
      241 catcagcaga gggattgtcc cgacagagag gcaattctat tccctaccaa catgaagctg
//
LOCUS       SHRRGBA       409 bp ds-DNA             INV       12-JUL-1990
DEFINITION  Brine shrimp 5.8S ribosomal RNA gene.
ACCESSION   M33097
KEYWORDS    5.8S ribosomal RNA; ribosomal RNA.
SOURCE      Brine shrimp DNA.
  ORGANISM  Artemia sp.
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Branchiopoda; Sarsostraca; Anostraca; Artimiidae.
REFERENCE   1  (bases 1 to 409)
  AUTHORS   Vaughn,J.C., Sperbeck,S.J. and Hughes,M.J.
  TITLE     Molecular cloning and characterization of ribosomal RNA genes from
            the brine shrimp: Nucleotide sequence analysis and evolution of the
            5.8 S rRNA gene region and its flanking nucleotides
  JOURNAL   Biochim. Biophys. Acta 783, 144-151 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA        177      339     5.8S ribosomal RNA
BASE COUNT      104 a     82 c    116 g    107 t
ORIGIN      
        1 ggtgaaaaat agtcatattg gggacgagag tggcttcttg tgattcaagg atcatggata
       61 ccactccgcg agactaaagg gagtgaaggt gagcttgccc caacagagca tggcttgagg
      121 tgtgcaaggg tgcaattgca ttggccttgt ttgagggaga atttgaaaca ttcaatagaa
      181 tgacccttga ggatggatca cttggctcac attacgaaga cgaacgcagc tagacgcgtg
      241 attccatgcg aactgcagga cacatggaac gtctatattt tgaacgcaaa ttgcatgtcc
      301 agcctttgag cttggactac gtctggctga gagacggatg tttttatcat tcggtcatct
      361 gggtataccg tcactgcgag gctccttgct tctatagggc cgttgatcg
//
LOCUS       GLATSAA      3053 bp ss-mRNA            INV       12-JUL-1990
DEFINITION  G.lamblia trophozoite surface antigen (TSA 417) mRNA, complete cds.
ACCESSION   M33641
KEYWORDS    major surface protein; trophozoite surface antigen.
SOURCE      G.lamblia (strain WB, ATCC 30957), cDNA to mRNA, clone pFDG417.
  ORGANISM  Giardia lamblia
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE   1  (bases 1 to 3053)
  AUTHORS   Gillin,F.D., Hagblom,P., Harwood,J., Aley,S.B., Reiner,D.S.,
            McCaffery,M., So,M. and Guiney,D.
  TITLE     Isolation and expression of the gene for a major surface protein of
            Giardia lamblia
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4463-4467 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by S.B.Aley, 08-APR-1990.
FEATURES       from  to/span     description
    pept        205     2346     trophozoite surface antigen protein precursor
                                 (TSA 417)
    sigp        205      255     trophozoite surface antigen protein signal
                                 peptide
    matp        256     2343     trophozoite surface antigen
    binding     188      193     ribosomal binding site (put.)
    signal       67       72     TATA box
    signal     2359     2364     misc. signal
    signal     2831     2837     polyA signal
BASE COUNT      785 a    777 c    865 g    626 t
ORIGIN      
        1 gaattcttac gctatgtacg gcttatattg acaggattgc tacaggctat gaatactatg
       61 ctagagtata aacatgtatc cacggcgatc tgggggtctt ctcggagact agtggccagt
      121 taccatggac acgcaagaag ctgtctgtgg tagcctggcc ccgggctttg cgttggaagc
      181 gccacccagc aggtcggcgg cctaatgttc ggcagatttt tgctcgcgat cgtcatcctt
      241 cagctggcac ggacagcctg cacccaagaa gctgacgatg gaaagtgtaa aacgtgtggc
      301 gtcaccattg gtcaagacac ttggtgctct gagtgcaacg gagcaaacta cgcccccgtg
      361 aacggccagt gtgtagacgt caacgctgag gggccaagca aaacgctttg tccgcaacat
      421 agcgcaggga agtgcacgca gtgcggaggc aactcattca tgtacaagga cggctgttat
      481 tccagcggag aaggccttcc tggacacagc ctgtgcttaa gttccgacgg agatggcgta
      541 tgcaccgagg cggccccggg gtactttgct ccggtgggag cggcgaacac tgaacagtct
      601 gtgatcgcat gtggcgatac aactggagta acaatagcag ctggcggaaa cacatacaag
      661 ggcattgctg actgcgcaga atgcagcgcc cctgacgcaa cagccggcgc tgaggccggc
      721 aaggttgcaa cgtgtaccaa gtgtggagtc agtaagtatc tcaaggataa cgtgtgcgta
      781 gataaagccc aatgtaattc tggtagcact aataagttcg ttgcagttga tgattctgag
      841 aatggcaaca agtgtgtttc ttgcagcgat aacctcaatg gtggcgttgc caattgcgac
      901 acctgtagct acgatgagca atctaagaag atcaagtgta caaaatgcac cgataacaac
      961 tacctgaaaa ccacaagcga aggcacgtcg tgcgtacaaa aagaccaatg caaagacggc
     1021 ttcttcccca aggatgacag cagtgcagga aataaatgcc tcccttgtaa tgacagcacc
     1081 gacggaattg ccaattgcgc cacgtgtgct ctggttagtg gccgatcagg ggctgccctc
     1141 gttacatgct ccgcctgcac ggatggatac aagcctagtg ccgacaaaac tacgtgcgag
     1201 gcggtaagca actgcaagac ccccggatgc aaggcgtgca gcaacgaagg aaaggagaac
     1261 gaggtctgca cagactgtga tggtagcaca tacctcacgc cgacaagcca gtgcatagac
     1321 agctgcgcta agattggaaa ctactatgga gccaccgaag gagcaaagaa actctgtaaa
     1381 gagtgcactg cggctaactg caagacttgc gatgatcagg ggcagtgcca agcatgcaac
     1441 gacgggttct ataaaaacgg cgacgcgtgc tctccgtgcc acgaaagctg caagacatgc
     1501 agcgcaggca ctgccagcga ctgcaccgag tgtcccaccg gaaaagcact caggtacggg
     1561 gacgacggta ctaagggcac gtgcggagaa ggctgcacaa cgggcacagg agcaggagca
     1621 tgcaagacgt gtgggctcac tatcgatggc gctagctact gctctgagtg cgccacaacg
     1681 acagaatatc ctcaaaatgg cgtctgtgca ccaaaggcta gccgcgccac acctacgtgc
     1741 aacgactcgc ctattcagaa tggtgtttgt ggaacgtgtg ccgataacta ctttaagatg
     1801 aacggagggt gctatgaaac agtcaagtat cccggtaaga cggtttgcat tagtgcacca
     1861 aatggtggta cgtgtcaaaa agctgcagat ggttacaagt tggattcagg tacccttaca
     1921 gtttgttctg aagggtgtaa ggaatgtgct agcagtaccg actgtactac gtgtctggac
     1981 ggatatgtaa agagtgcaag tgcgtgcaca aagtgtgacg ctagctgcga aacatgtaat
     2041 ggagcagcta caacatgtaa ggcgtgtgct acgggatact acaagaccgc atcaggagaa
     2101 ggtgcgtgca cgtcttgtga aagtgatagc aacggagtca ctggtattaa gggctgccta
     2161 aactgcgccc ctccgcccaa caataaaggt tccgtcctct gctacctcat aaaggatagc
     2221 ggtagcacca acaagagcgg gctctccact ggtgccatag cgggtatctc cgtcgctgtc
     2281 atcgttgttg tcggcggcct catcggcttc ctctgctggt ggttcctctg cagggggaag
     2341 gcgtagatgt acttagatag taaaccgtca tcgatgggtc tgctcggtgt ctgttcctgc
     2401 tagcacagac agcagggtct cagccagtgc accaagcatc aggcgtgtgg atgaatgttt
     2461 ggcttatcca gtagcgccct tgcgtgtcca cgggctcaca tgtgaccaac agtgctgtac
     2521 aggtaggtag agaccagacc acggatccca tgcactgaat gcaactcctt tgcagccgtg
     2581 atgggtcagt tgtggcaatt tataagacaa aacgagggcc ctgtccatcg cacagtccct
     2641 tgcagcgctt ccagacgcgg agctggcggc ggtcctgcac tacctcgccg agttccgtgg
     2701 gccagaggtc ttcggggact gccttcagac cttgctcagc tcgacgagcc ccgggacgag
     2761 accctcaggc ttgcggcaca gaaagacata cgcggcttcc tcgaaaggat cgacagaaag
     2821 gactcagctc aataaatgcc actcttcacg tcctcgttcc gcggtacatc gtgtagctgt
     2881 acatccagtg gaacttttcg actcagaagt gaagttgatg agctctgtgg tagataactt
     2941 ctcatggcct ggagtggcag acactgtgag cagctgattg gcatgcaatt cacaccctag
     3001 acgcggtgga gagatacccc cgtccatcca ttacaaacaa gtcccacaag ctt
//
LOCUS       HUMPLG01     1272 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 1.
ACCESSION   M33272 J05286
KEYWORDS    plasminogen.
SEGMENT     1 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1272)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1077  +  1125     plasminogen precursor, exon 1
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    sigp       1077  +  1125     plasminogen signal peptide
    pre-msg     957  >  1272     PLG mRNA and introns
    IVS        1126  >  1272     PLG intron A
BASE COUNT      391 a    231 c    269 g    381 t
ORIGIN      
        1 gaattccgca gacattccac ccaagaccat tgggctccca cctctactct tttgccagtt
       61 aatgaatagg caggaatttc actgcctgga aagaggaaca atgctttctg gtccttattt
      121 cacatctaaa atagagaggt caattgattt attcctaaat atctttgaac actaaaatag
      181 aagttttaca gcatatatac tacctggttg ctctagactt aagccaggga aaagtacaga
      241 ttcaacattt aaaattgaga tagacgcttt ccacttaatg ctaccagtct tgctttattt
      301 catgagaatg agaatataat aatatggcat acgttcattt gggggaaaga ttgatgtctt
      361 ataacataat ttataattac agaaaacatg tgagttcact gggaataaat aaattttgaa
      421 gataataaga tactttcact tatgtcataa tttctatgtc atttggtgta ggatgtagag
      481 atattaacgt ttacacctaa ctcaagtttg tcatctaaga cctgaaaggg ttttgtctat
      541 cagctgcacc cctgggtaga gacacaacct tggggaaggc ctcagcccca tccctcgtac
      601 agcaggaatg agaacagccc tgcctgttgg gaagcttgag ggaggctatg gacgtgcagc
      661 gcttggcaga aggtctcgtc atggaaggtt ccagcaaatg tgagatactt ttatgatttc
      721 attttctcca aaagaaaggg aataagagaa gaggggagga aataagacta attgcgagag
      781 ataaagtaca agggtgaggg aaggaataag gagacatgac ggcagcgtgg agcagccgag
      841 gggggagatt gctttcacca cttcccagca tctattgcag attccaccct caaacatttt
      901 gtaaggactc tttattcaag gtaacgtttg aaccctgctg agccagtggc atgggtctct
      961 gagagaatca ttaacttaat ttgactatct ggtttgtgga tgcgtttact ctcatgtaag
     1021 tcaacaacat cctgggattg ggacccactt tctgggcact gctggccagt cccaaaatgg
     1081 aacataagga agtggttctt ctacttcttt tatttctgaa atcaggtaag acatagtttt
     1141 tttaaattat aataattatt ttttctccca caatgtagta aaaatacata tgccatggct
     1201 ttatgtgcaa ttcatttaat ttttgattca tgaaacttcc agttgaaaat cttgtataag
     1261 attgaggaat tc
//
LOCUS       HUMPLG02      161 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, intron A (partial).
ACCESSION   M33273 J05286
KEYWORDS    plasminogen.
SEGMENT     2 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 161)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   161     plasminogen intron A /hgml_locus_uid="LW0013Z"
                                 /nomgen="PLG" /map="6q26-q27"
BASE COUNT       52 a     46 c     20 g     43 t
ORIGIN      Unknown number of base pairs after segment 1.
        1 gaattcaccc atttaggcat acaatccaat ggatttcaag atattgagag ttgtgcagcc
       61 accatcagaa taaattttaa aactattcat acccccaaaa acgcactcca ctctccttag
      121 ctgttacccc aatctgcagc ttctggcaac cactaatcta c
//
LOCUS       HUMPLG03      376 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 2.
ACCESSION   M33274 J05286
KEYWORDS    plasminogen.
SEGMENT     3 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 376)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +   196  +   331     plasminogen (PLG) precursor, exon 2
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    sigp        196      203     plasminogen signal peptide
    matp        204  +   331     plasminogen
    IVS     <     1      195     PLG intron A
    IVS         332  >   376     PLG intron B
BASE COUNT      114 a     74 c     78 g    110 t
ORIGIN      Unknown number of base pairs after segment 2.
        1 tctttattta tgtccaaatg cccgactgtg tgttcttaac taaacatttt gattcatagc
       61 tacccattct acttccagta aacagaaagt tttatttggt taatgctaac caaatagatt
      121 aaaaggaagt catgacaatt agacattgac attgatttac tgaccattta ttccacttgg
      181 atctcccacc tctaggtcaa ggagagcctc tggatgacta tgtgaatacc cagggggctt
      241 cactgttcag tgtcactaag aagcagctgg gagcaggaag tatagaagaa tgtgcagcaa
      301 aatgtgagga ggacgaagaa ttcacctgca ggtatttcca ttgtcgttgc acctacgcag
      361 gaatctgtaa ttcaga
//
LOCUS       HUMPLG04      291 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 3.
ACCESSION   M33275 J05286
KEYWORDS    plasminogen.
SEGMENT     4 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    46  +   152     plasminogen (PLG) precursor, exon 3
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    46  +   152     plasminogen
    IVS     <     1       45     PLG intron B
    IVS         153  >   291     PLG intron C
BASE COUNT       88 a     60 c     51 g     92 t
ORIGIN      Unknown number of base pairs after segment 3.
        1 taaataaaga aaaatactta ttggatttcc tgcttcgttc tgcagggcat tccaatatca
       61 cagtaaagag caacaatgtg tgataatggc tgaaaacagg aagtcctcca taatcattag
      121 gatgagagat gtagttttat ttgaaaagaa aggtgagtac attttcttcc tcctcctcct
      181 actgtcctcc ccatcctccc actcttcctc tttctctatt ctatctttaa tttatgagac
      241 cagaggagga aggcactatc gtgttataaa actgaattct gagttaggac a
//
LOCUS       HUMPLG05       69 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, intron C (partial).
ACCESSION   M33276 J05286
KEYWORDS    plasminogen.
SEGMENT     5 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 69)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    IVS     <     1  >    69     plasminogen intron C /hgml_locus_uid="LW0013Z"
                                 /nomgen="PLG" /map="6q26-q27"
BASE COUNT       26 a      9 c     13 g     21 t
ORIGIN      Unknown number of base pairs after segment 4.
        1 aagtgcagat taaatctaaa ctttatctgg tgaagttatt agttcttaca agtagcaagc
       61 aaacggtaa
//
LOCUS       HUMPLG06       57 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, intron C (partial).
ACCESSION   M33277 J05286
KEYWORDS    plasminogen.
SEGMENT     6 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 57)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    IVS     <     1  >    57     plasminogen intron C /hgml_locus_uid="LW0013Z"
                                 /nomgen="PLG" /map="6q26-q27"
BASE COUNT       18 a     12 c      7 g     20 t
ORIGIN      Unknown number of base pairs after segment 5.
        1 agtgcaacat ctacaataat tactttcctt atttttgaag tggaccatat ctcgaca
//
LOCUS       HUMPLG07      341 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 4.
ACCESSION   M33278 J05286
KEYWORDS    plasminogen.
SEGMENT     7 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 341)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +   167  +   281     plasminogen (PLG) precursor, exon 4
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +   167  +   281     plasminogen
    IVS     <     1      166     PLG intron C
    IVS         282  >   341     PLG intron D
BASE COUNT       89 a     73 c     77 g    102 t
ORIGIN      Unknown number of base pairs after segment 6.
        1 tggctcagtt tactgcagcc tttttgcaga tgcaaaagat gatcttttag aaagcagaaa
       61 cagggggtct ggtgcatgag atctttttct caacgtgact atgctgtgca gaccttcatg
      121 tggtgtcttg tgaaagactt tgaccactgt gtggacttcc cttcagtgta tctctcagag
      181 tgcaagactg ggaatggaaa gaattacaga gggacgatgt ccaaaacaaa aaatggcatc
      241 acctgtcaaa aatggagttc cacttctccc cacagaccta ggtaagacat tccctttcat
      301 ctttgtgttc atctactgta aagttgtccc tctgtgtctg t
//
LOCUS       HUMPLG08      354 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 5.
ACCESSION   M33279 J05286
KEYWORDS    plasminogen.
SEGMENT     8 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    69  +   208     plasminogen (PLG) precursor, exon 5
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    69  +   208     plasminogen
    IVS     <     1       68     PLG intron D
    IVS         209  >   354     PLG intron E
BASE COUNT      101 a     83 c     72 g     98 t
ORIGIN      Unknown number of base pairs after segment 7.
        1 ttctgccttg ctaatagcaa gctgattttt agaatatagt ctaagtgctt cttttccatc
       61 ctccccagat tctcacctgc tacacacccc tcagagggac tggaggagaa ctactgcagg
      121 aatccagaca acgatccgca ggggccctgg tgctatacta ctgatccaga aaagagatat
      181 gactactgcg acattcttga gtgtgaaggt caggagtggt tctagaaaat gttttcattt
      241 ctgcccttca cctgtaaaat aatttgttgt aaagcccctt cccacaggga tgttattaat
      301 aattgagtaa cgtattcacc tctgggaaag aagcaaaacc ccagaattaa cctg
//
LOCUS       HUMPLG09      206 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 6.
ACCESSION   M33280 J05286
KEYWORDS    plasminogen.
SEGMENT     9 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 206)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    56  +   176     plasminogen (PLG) precursor, exon 6
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    56  +   176     plasminogen
    IVS     <     1       55     PLG intron E
    IVS         177  >   206     PLG intron F
BASE COUNT       51 a     57 c     39 g     59 t
ORIGIN      Unknown number of base pairs after segment 8.
        1 ttcatccatt tcagttttct tcttcctctc tgtccttcct tcccactctg tccagaggaa
       61 tgtatgcatt gcagtggaga aaactatgac ggcaaaattt ccaagaccat gtctggactg
      121 gaatgccagg cctgggactc tcagagccca cacgctcatg gatacattcc ttccaagtaa
      181 gtctcactgg gaaaaacatt ccatgt
//
LOCUS       HUMPLG10      100 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, intron F (partial).
ACCESSION   M33281 J05286
KEYWORDS    plasminogen.
SEGMENT     10 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   100     plasminogen intron F /hgml_locus_uid="LW0013Z"
                                 /nomgen="PLG" /map="6q26-q27"
BASE COUNT       29 a     17 c     26 g     28 t
ORIGIN      Unknown number of base pairs after segment 9.
        1 ccaaaatgat aaggtcactg attctgttga gtgattttta cacatgtaaa ctgttagaaa
       61 aacagtgctt ggcagccggg catggtggca catgctgtag
//
LOCUS       HUMPLG11      247 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 7.
ACCESSION   M33282 J05286
KEYWORDS    plasminogen.
SEGMENT     11 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 247)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    68  +   186     plasminogen (PLG) precursor, exon 7
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    68  +   186     plasminogen
    IVS     <     1       67     PLG intron F
    IVS         187  >   247     PLG intron G
BASE COUNT       70 a     63 c     51 g     63 t
ORIGIN      Unknown number of base pairs after segment 10.
        1 cttgaaaaag agtcttatcc atgaatgtaa atgttcagtg ctactaaaat ctttcttgtc
       61 cattcagatt tccaaacaag aacctgaaga agaattactg tcgtaacccc gatagggagc
      121 tgcggccttg gtgtttcacc accgacccca acaagcgctg ggaactttgc gacatccccc
      181 gctgcagtga gtatgatgca cacccagatt ccaggatttg gacctgccct gttcttgaaa
      241 tcaaaag
//
LOCUS       HUMPLG12      244 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 8.
ACCESSION   M33283 J05286
KEYWORDS    plasminogen.
SEGMENT     12 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 244)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    47  +   209     plasminogen (PLG) precursor, exon 8
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    47  +   209     plasminogen
    IVS     <     1       46     PLG intron G
    IVS         210  >   244     PLG intron H
BASE COUNT       68 a     72 c     45 g     59 t
ORIGIN      Unknown number of base pairs after segment 11.
        1 ctcaaaaaat atatatattc attgtaactt attttgccca ttcaagcaac acctccacca
       61 tcttctggtc ccacctacca gtgtctgaag ggaacaggtg aaaactatcg cgggaatgtg
      121 gctgttaccg tgtccgggca cacctgtcag cactggagtg cacagacccc tcacacacat
      181 aacaggacac cagaaaactt tccctgcaag taagtcccct ccagtctcat tctgctgcta
      241 tgga
//
LOCUS       HUMPLG13      217 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 9.
ACCESSION   M33284 J05286
KEYWORDS    plasminogen.
SEGMENT     13 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 217)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    36  +   181     plasminogen (PLG) precursor, exon 9
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    36  +   181     plasminogen
    IVS     <     1       35     PLG intron H
    IVS         182  >   217     PLG intron I
BASE COUNT       64 a     56 c     52 g     45 t
ORIGIN      Unknown number of base pairs after segment 12.
        1 ttggaaagct aaactcacaa tcacttcttt ttcagaaatt tggatgaaaa ctactgccgc
       61 aatcctgacg gaaaaagggc cccatggtgc catacaacca acagccaagt gcggtgggag
      121 tactgtaaga taccgtcctg tgactcctcc ccagtatcca cggaacaatt ggctcccaca
      181 ggtaagcaag ggtatgggag cttactgagg gcccaag
//
LOCUS       HUMPLG14      409 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 10.
ACCESSION   M33285 J05286
KEYWORDS    plasminogen.
SEGMENT     14 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 409)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +   132  +   291     plasminogen (PLG) precursor, exon 10
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +   132  +   291     plasminogen
    IVS     <     1      131     PLG intron I
    IVS         292  >   409     PLG intron J
BASE COUNT      116 a    102 c     73 g    118 t
ORIGIN      Unknown number of base pairs after segment 13.
        1 tctgtctgct aatacagaaa agagaacagt cataattctc agaggctacc gtactgtttt
       61 tgtcataaat tgcttcatgc ttcttttttt tcagtaattg ttaagcttga tttcttttat
      121 tttaatttca gcaccacctg agctaacccc tgtggtccag gactgctacc atggtgatgg
      181 acagagctac cgaggcacat cctccaccac caccacagga aagaagtgtc agtcttggtc
      241 atctatgaca ccacaccggc accagaagac cccagaaaac tacccaaatg cgtatgtctt
      301 tgatttttac tgtaagaggg gcatcagcca actgaaattt ctgttaaaag agccatgctt
      361 catgcttcaa gccaacttcc taggaccaaa tttctcttag acccagaat
//
LOCUS       HUMPLG15      266 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 11.
ACCESSION   M33286 J05286
KEYWORDS    plasminogen.
SEGMENT     15 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 266)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    60  +   241     plasminogen (PLG) precursor, exon 11
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    60  +   241     plasminogen
    IVS     <     1       59     PLG intron J
    IVS         242  >   266     PLG intron K
BASE COUNT       66 a     68 c     65 g     67 t
ORIGIN      Unknown number of base pairs after segment 14.
        1 ctgggtgccc ctgaatattc tcccacctct tgtgacctgt attgttttgg aatttccagt
       61 ggcctgacaa tgaactactg caggaatcca gatgccgata aaggcccctg gtgttttacc
      121 acagacccca gcgtcaggtg ggagtactgc aacctgaaaa aatgctcagg aacagaagcg
      181 agtgttgtag cacctccgcc tgttgtcctg cttccaaatg tagagactcc ttccgaagaa
      241 ggtaagaaat ctgtggctgg acatct
//
LOCUS       HUMPLG16      224 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 12.
ACCESSION   M33287 J05286
KEYWORDS    plasminogen.
SEGMENT     16 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 224)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    26  +   174     plasminogen (PLG) precursor, exon 12
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    26  +   174     plasminogen
    IVS     <     1       25     PLG intron K
    IVS         175  >   224     PLG intron L
BASE COUNT       57 a     56 c     56 g     55 t
ORIGIN      Unknown number of base pairs after segment 15.
        1 aatcatccat tttttccctg tacagactgt atgtttggga atgggaaagg ataccgaggc
       61 aagagggcga ccactgttac tgggacgcca tgccaggact gggctgccca ggagccccat
      121 agacacagca ttttcactcc agagacaaat ccacgggcgg gtctggaaaa aaatgtaagc
      181 cactttgatt tggactcttt ggccttttgc tcaccaatct ttgc
//
LOCUS       HUMPLG17      223 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 13.
ACCESSION   M33288 J05286
KEYWORDS    plasminogen.
SEGMENT     17 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 223)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    31  +   124     plasminogen (PLG) precursor, exon 13
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    31  +   124     plasminogen
    IVS     <     1       30     PLG intron L
    IVS         125  >   223     PLG intron M
BASE COUNT       56 a     44 c     58 g     65 t
ORIGIN      Unknown number of base pairs after segment 16.
        1 gctggagctt acatgccttc ttgttttcag tactgccgta accctgatgg tgatgtaggt
       61 ggtccctggt gctacacgac aaatccaaga aaactttacg actactgtga tgtccctcag
      121 tgtggtaggt tgccttcttt ttggtaagga aactgcttac ttaatatgga tttgcaacaa
      181 aaaaggaaaa gggcttctga gcagactgct tctggggagg aga
//
LOCUS       HUMPLG18      296 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 14.
ACCESSION   M33289 J05286
KEYWORDS    plasminogen.
SEGMENT     18 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 296)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +   126  +   246     plasminogen (PLG) precursor, exon 14
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +   126  +   246     plasminogen
    IVS     <     1      125     PLG intron M
    IVS         247  >   296     PLG intron N
BASE COUNT       74 a     72 c     69 g     81 t
ORIGIN      Unknown number of base pairs after segment 17.
        1 atgattttac tatttagttc ggcctttaag atgtcaaaaa ctcagtgctt ggaatttgtc
       61 tcgaattaca ccacaaaatt gctaccttgt ctcaaatggg atttctttcc caccttgtgc
      121 cacagcggcc ccttcatttg attgtgggaa gcctcaagtg gagccgaaga aatgtcctgg
      181 aagggttgta ggggggtgtg tggcccaccc acattcctgg ccctggcaag tcagtcttag
      241 aacaaggtaa gaacaggccc agaaacgatt tatactgtcc ctccacgtaa gccctg
//
LOCUS       HUMPLG19      361 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 15.
ACCESSION   M33290 J05286
KEYWORDS    plasminogen.
SEGMENT     19 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 361)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    66  +   140     plasminogen (PLG) precursor, exon 15
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    66  +   140     plasminogen
    IVS     <     1       65     PLG intron N
    IVS         141  >   361     PLG intron O
BASE COUNT       93 a     77 c     80 g    111 t
ORIGIN      Unknown number of base pairs after segment 18.
        1 ttctgtacaa tggagcagaa caaagtatca atttaactaa aatttgaact aaatcctctt
       61 tccaggtttg gaatgcactt ctgtggaggc accttgatat ccccagagtg ggtgttgact
      121 gctgcccact gcttggagaa gtatgtttag gggacaattg acatgaagtc ttgtcttaaa
      181 tactttttct gtccttcttt tcctcctttc ctcctttcct ttctcactct tcctcccttc
      241 cttctctggc tgtgacacta gggaccaggc cagggcaatt ggataagaga gaagggaagg
      301 gtttctagaa agaaactgca gaggaaagac acagtacaga tgattttgtg ggcctgaata
      361 a
//
LOCUS       HUMPLG20      331 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 16.
ACCESSION   M34272 J05286
KEYWORDS    plasminogen.
SEGMENT     20 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 331)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    41  +   181     plasminogen (PLG) precursor, exon 16
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    41  +   181     plasminogen
    IVS     <     1       40     PLG intron O
    IVS         182  >   331     PLG intron P
BASE COUNT       80 a     81 c     79 g     91 t
ORIGIN      Unknown number of base pairs after segment 19.
        1 ctggaccata ttttcctctt gacatcctca tcttttctag gtccccaagg ccttcatcct
       61 acaaggtcat cctgggtgca caccaagaag tgaatctcga accgcatgtt caggaaatag
      121 aagtgtctag gctgttcttg gagcccacac gaaaagatat tgccttgcta aagctaagca
      181 ggtactcgtt cacctgtggt cttcacccca cgctggtgaa gatatttgct ttatgtctgg
      241 gttttatggg ccatggcact gcatggcagt ggggaggaac tgtctatcac atgaaaggct
      301 caagggcttt ggggacagca tcaatcttca a
//
LOCUS       HUMPLG21      251 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 17.
ACCESSION   M34273 J05286
KEYWORDS    plasminogen.
SEGMENT     21 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 251)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    41  +   147     plasminogen (PLG) precursor, exon 17
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    41  +   147     plasminogen
    IVS     <     1       40     PLG intron P
    IVS         148  >   251     PLG intron Q
BASE COUNT       71 a     61 c     51 g     68 t
ORIGIN      Unknown number of base pairs after segment 20.
        1 gcagagcagt caaacataac tgctgatgct tttctttcag tcctgccgtc atcactgaca
       61 aagtaatccc agcttgtctg ccatccccaa attatgtggt cgctgaccgg accgaatgtt
      121 tcatcactgg ctggggagaa acccaaggtg agataaattc cattgcccac ataacgaatt
      181 ggttttgacc tacagtccat gtgacaaaat gatcattttg gagaaagctg tgcaaattcc
      241 tatccatgaa t
//
LOCUS       HUMPLG22      101 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, intron Q (partial).
ACCESSION   M34274 J05286
KEYWORDS    plasminogen.
SEGMENT     22 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 101)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   101     plasminogen intron Q /hgml_locus_uid="LW0013Z"
                                 /nomgen="PLG" /map="6q26-q27"
BASE COUNT       24 a     34 c     24 g     19 t
ORIGIN      Unknown number of base pairs after segment 21.
        1 agaagggtgc tccctcacac aactacagca gtccaggtga tgcacccact gcccaatgct
       61 tggtagtcaa gaggagcttc ctccctgcag ctctgcccag a
//
LOCUS       HUMPLG23      254 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 18.
ACCESSION   M34275 J05286
KEYWORDS    plasminogen.
SEGMENT     23 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 254)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    69  +   214     plasminogen (PLG) precursor, exon 18
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    69  +   214     plasminogen
    IVS     <     1       68     PLG intron Q
    IVS         215  >   254     PLG intron R
BASE COUNT       61 a     54 c     63 g     76 t
ORIGIN      Unknown number of base pairs after segment 22.
        1 tgttctggaa tatcctcctg aatgtgtttt gggtgcagtt gccatttctt tcatcttttt
       61 aaacacaggt acttttggag ctggccttct caaggaagcc cagctccctg tgattgagaa
      121 taaagtgtgc aatcgctatg agtttctgaa tggaagagtc caatccaccg aactctgtgc
      181 tgggcatttg gccggaggca ctgacagttg ccaggtaagc aaagatcaag agaccaaagt
      241 tagtcttgtg ctct
//
LOCUS       HUMPLG24     1236 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human plasminogen gene, exon 19.
ACCESSION   M34276 J05286
KEYWORDS    plasminogen.
SEGMENT     24 of 24
SOURCE      Human leukocyte and lung fibroblast DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1236)
  AUTHORS   Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W.
  TITLE     Characterization of the gene for human plasminogen, a key proenzyme
            in the fibrinolytic system
  JOURNAL   J. Biol. Chem. 265, 6104-6111 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ichinose, 26-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept    +    41      202     plasminogen (PLG) precursor, exon 19
                                 /hgml_locus_uid="LW0013Z" /nomgen="PLG"
                                 /map="6q26-q27"
    matp    +    41      199     plasminogen
    pre-msg <     1      427     PLG mRNA and introns (alt.)
    pre-msg <     1      458     PLG mRNA and introns (alt.)
    pre-msg <     1     1184     PLG mRNA and introns (alt.)
    IVS     <     1       40     PLG intron R
BASE COUNT      365 a    233 c    297 g    341 t
ORIGIN      Unknown number of base pairs after segment 23.
        1 agcctaaccc tcacatgcat ttttctctcc ctctgtatag ggtgacagtg gagggcctct
       61 ggtttgcttc gagaaggaca aatacatttt acaaggagtc acttcttggg gtcttggctg
      121 tgcacgcccc aataagcctg gtgtctatgt tcgtgtttca aggtttgtta cttggattga
      181 gggagtgatg agaaataatt aattggacgg gagacagagt gacgcactga ctcacctaga
      241 ggctgggacg tgggtaggga tttagcatgc tggaaataac tggcagtaat caaacgaaga
      301 cactgtcccc agctaccagc tacgccaaac ctcggcattt tttgtgttat tttctgactg
      361 ctggattctg tagtaaggtg acatagctat gacatttgtt aaaaataaac tctgtactta
      421 actttgattt gagtaaattt tggttttggt cttcaacatt ttcatgctct ttgttcaccc
      481 caccaatttt aaatgggcag atggggggat ttagctgctt ttgataagga acagctgcac
      541 aaaggactga gcaggctgca aggtcacaga ggggagagcc aagaagttgt ccacgcattt
      601 acctcatcag ctaacgaggg cttgacatgc atttttactg tctttattcc tgacactgag
      661 atgaatgttt tcaaagctgc aacatgcatg gggagtcatg cgaaccgatt ctgttattgg
      721 gaatgaaatc tgtcaccgac tgcttgactt gagcccaggg gacacagagc agagagctgt
      781 atatgatgga gtgaaccggt ccatggatgt gtaacacaag accaactgag agtctgaatg
      841 ttattctggg gcacacgtga gtctaggatt ggtgccaaga gcatgtaaat gaacaacaag
      901 caaatattga aggtggacca cttatttccc attgctaatt gcctgcccgg ttttgaaaca
      961 gtctgcagta cacacggtga caggagaatg acctgtggga gagatacatg tttagaagga
     1021 agagaaagga caaaggcaca cgttttacca tttaaaatat tgttaccaaa caaaaatatc
     1081 cattcaaaat acaatttaac aatgcaacag tcatcttaca gcagagaaat gcagagaaaa
     1141 gcaaaactgc aagtgactgt gaataaaggg tgaatgtagt ctcaaatcct caaagagctg
     1201 tgtttatttc attgacaaat agattatttg tattca
//
LOCUS       PARGANTI1     162 bp ds-DNA             INV       12-JUL-1990
DEFINITION  P.primaurelia G surface antigen gene, 5' end.
ACCESSION   M11194
KEYWORDS    G surface antigen; surface antigen.
SEGMENT     1 of 2
SOURCE      P.primaurelia macronucleus DNA.
  ORGANISM  Paramecium primaurelia
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE   1  (bases 1 to 162)
  AUTHORS   Meyer,E., Caron,F. and Baroin,A.
  TITLE     Macronuclear structure of the G surface antigen gene of Paramecium
            primaurelia and direct expression of its repeated epitopes in
            Escherichia coli
  JOURNAL   Mol. Cell. Biol. 5, 2414-2422 (1985)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            F.Caron, 24-OCT-1985.
FEATURES       from  to/span     description
    pept         19  >   162     G surface antigen
BASE COUNT       55 a     23 c     25 g     59 t
ORIGIN      
        1 tgaattttaa tacttttaat gaataataaa ttcatcatat tctcattgtt gcttgcttta
       61 gtagcaagtc aaacatacag tttaacatca tgcacatgtg cataattgtt atcagaagga
      121 gattgcatca aaaatgtttc acttggatgt tcatgggata ca
//
LOCUS       PARGANTI2     798 bp ds-DNA             INV       12-JUL-1990
DEFINITION  P.primaurelia G surface antigen gene, partial cds.
ACCESSION   M11193
KEYWORDS    G surface antigen; surface antigen.
SEGMENT     2 of 2
SOURCE      P.primaurelia macronucleus DNA.
  ORGANISM  Paramecium primaurelia
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE   1  (bases 1 to 798)
  AUTHORS   Meyer,E., Caron,F. and Baroin,A.
  TITLE     Macronuclear structure of the G surface antigen gene of Paramecium
            primaurelia and direct expression of its repeated epitopes in
            Escherichia coli
  JOURNAL   Mol. Cell. Biol. 5, 2414-2422 (1985)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            F.Caron, 24-OCT-1985.
FEATURES       from  to/span     description
    pept    <     1  >   798     G surface antigen (AA at 1)
    rpt           1      222     direct repeat 1
    rpt         223      444     direct repeat 2
    rpt         445      666     direct repeat 3
    rpt         667  >   798     direct repeat 4
BASE COUNT      262 a    153 c    177 g    206 t
ORIGIN      
        1 tgtgcttcaa ttactggaac aggattaacc actgctattt gtggaactta tgatgcaggt
       61 tgtgtggcaa atgttaacgg aacagcttgt taagaaaaat tagcaacatg tgatttgtat
      121 ttaactcaaa actcttgttc tacctcggca gctgcagcaa cagcagataa atgtgcatgg
      181 agtggaaccg cttgccttgc agttacaact gttggtaccc attgtgctta tgttactgga
      241 actggactta ctgatttaat atgtgcagca tataatgcaa attgtacagc taataaagct
      301 ggaacagcat gtcaggagaa aaaggctact tgcaatttat acacaacaga agccacctgt
      361 tcaacatcag cagctgcagc aacagcagat aaatgcgcat ggagtggagc agcttgcctt
      421 gcagtaacaa ctgttgctac agagtgtgct tatgttactg gaactggact tactgattta
      481 atatgtgcag catataatgc aaattgtaca gctaataaag ctggaacagc atgtcaggag
      541 aaaaaggcta cttgcaattt atacacaaca gaagccacct gttcaacatc agcagctgca
      601 gcaacagcag ataaatgcgc atggagtgga gcagcttgcc ttgcagtaac aactgttgct
      661 acagagtgtg cttatgttac tggaactgga ctaacaaatg caatatgtgc agcatataat
      721 gcaaattgta cagctaataa agctggaaca gcatgtcagg agaaaaaggc tacttgcaat
      781 ttatacacaa cagaagcc
//
LOCUS       BOVCASA      1123 bp ss-mRNA            MAM       12-JUL-1990
DEFINITION  Bovine alpha-s1-casein mRNA, complete cds.
ACCESSION   M33123
KEYWORDS    alpha-s1-casein.
SOURCE      Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA,
            clone p-alpha-s1 C228.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1123)
  AUTHORS   Nagao,M., Maki,M., Sasaki,R. and Chiba,R.
  TITLE     Isolation and sequence analysis of bovine alpha-s1-casein cDNA
            clone
  JOURNAL   Agric. Biol. Chem. 48, 1663-1667 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         64      708     alpha-s1-casein precursor
    sigp         64      108     alpha-s1-casein signal peptide
    matp        109      705     alpha-s1-casein
    mRNA    <     1     1123     alpha-s1-casein mRNA
BASE COUNT      331 a    247 c    223 g    322 t
ORIGIN      
        1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca
       61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat
      121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt
      181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt
      241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc
      301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa
      361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac
      421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa
      481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac
      541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg
      601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat
      661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga
      721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg
      781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg
      841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct
      901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa
      961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa
     1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact
     1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc
//
LOCUS       BPHINTXIS    1741 bp ds-DNA             PHG       12-JUL-1990
DEFINITION  Bacteriophage phi-11 integrase (int) and excisionase (xis) genes,
            complete cds.
ACCESSION   M34832
KEYWORDS    excisionase; integrase.
SOURCE      Bacteriophage phi-11 DNA.
  ORGANISM  Bacteriophage phi-11
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 1741)
  AUTHORS   Ye,Z.-H., Buranen,S.L. and Lee,C.Y.
  TITLE     Sequence analysis and comparison of int and xis genes from
            Staphylococcal bacteriophages l54a and phi-11
  JOURNAL   J. Bacteriol. 172, 2568-2575 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        267       67 (c) excisionase (xis)
    pept        379     1425     integrase (int)
BASE COUNT      650 a    264 c    263 g    564 t
ORIGIN      
        1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc
       61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa
      121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac
      181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt
      241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca
      301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt
      361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt
      421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa
      481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac
      541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct
      601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga
      661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta
      721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt
      781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct
      841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta
      901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt
      961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt
     1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa
     1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat
     1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta
     1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa
     1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat
     1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat
     1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg
     1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc
     1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt
     1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat
     1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact
     1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag
     1741 t
//
LOCUS       BSTNPRAS     3510 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  B.stearothermophilus neutral protease (nprS), and transcriptional
            activator (nprA) genes, complete cds.
ACCESSION   M34237
KEYWORDS    neutral protease; transcriptional activator.
SOURCE      B.stearothermophilus (strain TELNE) DNA, clone pSP53.
  ORGANISM  Bacillus stearothermophilus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 3510)
  AUTHORS   Nishiya,Y. and Imanaka,T.
  TITLE     Cloning and nucleotide sequences of the neutral protease gene and
            its transcriptional activator gene from Bacillus stearothermophilus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by Y.Nishiya, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        181     1401     transcriptional activator (nprA)
    pept       1750     3405     neutral protease (nprS) precursor
    sigp       1570     2274     neutral protease (nprS) signal and propeptide
    matp       2275     3219     neutral protease (nprS)
BASE COUNT     1177 a    581 c    737 g   1015 t
ORIGIN      
        1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa
       61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat
      121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa
      181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt
      241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa
      301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa
      361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag
      421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa
      481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat
      541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa
      601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt
      661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg
      721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac
      781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga
      841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat
      901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat
      961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca
     1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac
     1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat
     1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca
     1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt
     1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac
     1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta
     1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga
     1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg
     1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag
     1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc
     1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt
     1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag
     1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga
     1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca
     1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc
     1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt
     1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt
     2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact
     2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac
     2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct
     2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac
     2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta
     2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa
     2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca
     2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca
     2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat
     2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt
     2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac
     2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca
     2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat
     2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat
     2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt
     2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa
     3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc
     3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca
     3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat
     3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa
     3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc
     3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa
     3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca
     3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt
     3481 gcaaaatgcc caaaggctta taatgttgat
//
LOCUS       HSEGP14      3347 bp ds-DNA             VRL       12-JUL-1990
DEFINITION  Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete
            cds.
ACCESSION   M34861
KEYWORDS    glycoprotein 14.
SOURCE      Equine herpesvirus type 1 DNA.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 3347)
  AUTHORS   Guo,P.
  TITLE     Characterization of the gene and an antigenic determinant of equine
            herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent
            glycoproteins of other herpesviruses
  JOURNAL   Gene 87, 249-255 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        300     3239     glycoprotein 14 (gp14)
BASE COUNT      885 a    891 c    851 g    720 t
ORIGIN      
        1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg
       61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg
      121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg
      181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc
      241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca
      301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg
      361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct
      421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa
      481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct
      541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt
      601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac
      661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc
      721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg
      781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct
      841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat
      901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga
      961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata
     1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt
     1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg
     1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg
     1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt
     1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc
     1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag
     1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact
     1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg
     1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta
     1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag
     1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta
     1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc
     1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg
     1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca
     1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa
     1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg
     1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac
     2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg
     2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc
     2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg
     2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc
     2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga
     2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg
     2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag
     2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg
     2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc
     2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac
     2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg
     2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg
     2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa
     2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg
     2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga
     2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc
     3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa
     3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga
     3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg
     3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa
     3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact
     3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg
//
LOCUS       HUMHBGAA     1227 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human A-gamma-globin gene, 3' end.
ACCESSION   M33200
KEYWORDS    A-gamma-globin.
SOURCE      Human (hereditary persistence of fetal hemoglobin individual II-1)
            DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1227)
  AUTHORS   Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G.
  TITLE     Gamma gene promoter and enhancer structure in Seattle variant of
            hereditary persistence of fetal hemoglobin
  JOURNAL   Blood 71, 1108-1112 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1        3     A-gamma-globin (AA at 1)
                                 /hgml_locus_uid="LK0092S" /nomgen="HBG1"
                                 /map="11p15.5"
    mut         794      794     t in wt; c in mutant
    mut         970      970     c in wt; a in mutant
    mut        1186     1186     a in wt; g in mutant
BASE COUNT      366 a    204 c    294 g    363 t
ORIGIN      
        1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca
       61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca
      121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc
      181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt
      241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg
      301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc
      361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg
      421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa
      481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg
      541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc
      601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga
      661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt
      721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa
      781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta
      841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca
      901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat
      961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa
     1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc
     1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc
     1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca
     1201 gatgaagaag gagaaaaaaa taagctt
//
LOCUS       HUMHBQ1A     1114 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human theta-1-globin gene, complete cds.
ACCESSION   M33022
KEYWORDS    theta-1-globin.
SOURCE      Human black female with alpha-thal-2 heterozygosity white cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1114)
  AUTHORS   Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J.
  TITLE     Nucleotide sequence of the human theta-1-globin gene
  JOURNAL   Biochem. Genet. 26, 207-211 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        359      453     theta-1-globin, exon 1
                                 /hgml_locus_uid="LV0155X" /nomgen="HBQ1"
                                 /map="16p13.3"
                538      742     theta-1-globin, exon 2
                852      980     theta-1-globin, exon 3
    IVS         454      537     theta-1-globin intron A
    IVS         743      851     theta-1-globin intron B
BASE COUNT      166 a    386 c    393 g    169 t
ORIGIN      
        1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg
       61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt
      121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc
      181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga
      241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc
      301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat
      361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa
      421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc
      481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac
      541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc
      601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg
      661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg
      721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg
      781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc
      841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag
      901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc
      961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc
     1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag
     1081 gacgcgagag gaaggcgccg cccctcccca agga
//
LOCUS       HUMITIH1A    1149 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds.
ACCESSION   M33033
KEYWORDS    inter-alpha-trypsin inhibitor heavy chain.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1149)
  AUTHORS   Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and
            Martin,J.-P.
  TITLE     Human inter-alpha-trypsin inhibitor: Isolation and characterization
            of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence
            of the H chain
  JOURNAL   Biol. Chem. Hoppe-Seyler 369, 15-18 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  1149     inter-alpha-trypsin inhibitor heavy chain (AA
                                 at 1) /hgml_locus_uid="LE0221G"
                                 /nomgen="ITIH1" /map="3p21.2-p21.1"
BASE COUNT      332 a    292 c    253 g    272 t
ORIGIN      
        1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat
       61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat
      121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa
      181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag
      241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct
      301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc
      361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga
      421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc
      481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac
      541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg
      601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc
      661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact
      721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc
      781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca
      841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa
      901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca
      961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa
     1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag
     1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga
     1141 ttttatttc
//
LOCUS       HUMPTHROM     327 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human thrombin mRNA, 5'end.
ACCESSION   M33031
KEYWORDS    serine protease; thrombin.
SOURCE      Human, cDNA to mRNA, clone pIIH13.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 327)
  AUTHORS   MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C.
  TITLE     Recombinant genetic approaches to functional mapping of thrombin
  JOURNAL   Ann. N.Y. Acad. Sci. 485, 73-79 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         28  >   327     thrombin precursor /hgml_locus_uid="LD0134L"
                                 /nomgen="F2" /map="11p11-q12"
    sigp         28      156     thrombin signal peptide
    matp        157  >   327     prothrombin
BASE COUNT       60 a     97 c    109 g     61 t
ORIGIN      
        1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc
       61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct
      121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg
      181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc
      241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag
      301 acagcgagga cgcctcgaga taagctt
//
LOCUS       MUSCC3A       312 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Mouse complement component C3 mRNA, partial cds.
ACCESSION   M33032
KEYWORDS    complement component C3.
SOURCE      Mouse liver, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Fey,G.H., Wiebauer,K. and Domdey,H.
  TITLE     Amino acid sequences of mouse complement C3 derived from nucleotide
            sequences of cloned cDNA
  JOURNAL   Ann. N.Y. Acad. Sci. 421, 307-312 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     complement component C3 precursor (AA at 1)
    matp    <     1       27     complement component C3-beta subunit (AA at 1)
    matp         40      273     complement component C3-alpha subunit
    matp        274  >   312     complement component C3-alpha' subunit
BASE COUNT       90 a     77 c     90 g     55 t
ORIGIN      
        1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga
       61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt
      121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc
      181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa
      241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca
      301 gaagaagata tt
//
LOCUS       MUSN038A     1260 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Mouse nucleolar protein N038 mRNA, complete cds.
ACCESSION   M33212
KEYWORDS    nucleolar protein N038.
SOURCE      Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone
            lambda-FML-185.19.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1260)
  AUTHORS   Schmidt-Zachmann,M.S. and Franke,W.W.
  TITLE     DNA cloning and amino acid sequence determination of a major
            constituent protein of mammalian nucleoli: Correspondence of the
            nucleoplasmin-related protein N038 to mammalian protein B23
  JOURNAL   Chromosoma 96, 417-426 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         79      957     nucleolar protein N038
    mRNA    <     1     1260     nucleolar protein N038 mRNA
BASE COUNT      419 a    214 c    306 g    321 t
ORIGIN      
        1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga
       61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct
      121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat
      181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa
      241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta
      301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt
      361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag
      421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa
      481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa
      541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat
      601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct
      661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa
      721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa
      781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa
      841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc
      901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa
      961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg
     1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt
     1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac
     1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag
     1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt
//
LOCUS       RATPOS       1804 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Rat type-2A protein phosphatase catalytic subunit mRNA, complete
            cds.
ACCESSION   M33114
KEYWORDS    type-2A protein phosphatase catalytic subunit.
SOURCE      Rat liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1804)
  AUTHORS   Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S.,
            Sugimura,T. and Nagao,M.
  TITLE     Molecular cloning of cDNA for the catalytic subunit of rat liver
            type 2A protein phosphatase, and detection of high levels of
            expression of the gene in normal and cancer cells
  JOURNAL   Biochim. Biophys. Acta 951, 123-129 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        114     1043     type-2A protein phosphatase catalytic subunit
    mRNA    <     1     1804     type-2A protein phosphatase catalytic subunit
                                 mRNA
BASE COUNT      482 a    391 c    437 g    494 t
ORIGIN      
        1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg
       61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg
      121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc
      181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat
      241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc
      301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta
      361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc
      421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga
      481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt
      541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga
      601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac
      661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc
      721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag
      781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc
      841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc
      901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta
      961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc
     1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc
     1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt
     1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct
     1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat
     1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga
     1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc
     1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa
     1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa
     1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg
     1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt
     1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa
     1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct
     1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact
     1801 atat
//
LOCUS       RATSCP2      1409 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds.
ACCESSION   M34728
KEYWORDS    sterol carrier protein-2.
SOURCE      Rat liver, cDNA to mRNA, clone SP43.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1409)
  AUTHORS   Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and
            Davis,L.G.
  TITLE     Characterization of a cDNA encoding rat sterol carrier protein-2
  JOURNAL   DNA Cell Biol. 9, 159-165 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        307     1128     sterol carrier protein-2 (SCP-2)
    mRNA    <     1     1409     sterol carrier protein-2 mRNA
BASE COUNT      387 a    300 c    390 g    332 t
ORIGIN      2 bp upstream of EcoRI site.
        1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc
       61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc
      121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc
      181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt
      241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt
      301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc
      361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa
      421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac
      481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc
      541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg
      601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta
      661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc
      721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag
      781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg
      841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct
      901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag
      961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg
     1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc
     1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg
     1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca
     1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc
     1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg
     1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac
     1381 ttgagtttca gaataaagtt cagaatagt
//
LOCUS       SYNPSBAII     556 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end.
ACCESSION   M34833
KEYWORDS    D1 protein; photosystem II.
SOURCE      Synechococcus (strain PCC 7942) DNA.
  ORGANISM  Synechococcus sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Chroococcales.
REFERENCE   1  (bases 1 to 556)
  AUTHORS   Bustos,S.A., Schaefer,M.R. and Golden,S.S.
  TITLE     Different and rapid responses of four cyanobacterial psbA
            transcripts to changes in light intensity
  JOURNAL   J. Bacteriol. 172, 1998-2004 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         81      425     ORF1
    pept        527  >   556     photosystem II D1 protein (psbAII)
    mRNA         59  >   556     psbAII mRNA (alt.)
    mRNA        478  >   556     psbAII mRNA (alt.)
BASE COUNT      136 a    145 c    142 g    133 t
ORIGIN      
        1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag
       61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg
      121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac
      181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag
      241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg
      301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc
      361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg
      421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg
      481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt
      541 gcagcggcgc gagagc
//
LOCUS       ABCAARAA     1624 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  A.aceti acetic acid resistance protein (aarA) gene, complete cds.
ACCESSION   M34830
KEYWORDS    acetic acid resistance protein.
SOURCE      A.aceti (strain 10-8) DNA, clone pAR1611.
  ORGANISM  Acetobacter aceti
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Azotobacteraceae.
REFERENCE   1  (bases 1 to 1624)
  AUTHORS   Fukaya,M., Takemura,H., Okumura,H., Kawamura,Y., Horinouchi,S. and
            Beppu,T.
  TITLE     Cloning of genes responsible for acetic acid resistance in
            acetobacter aceti
  JOURNAL   J. Bacteriol. 172, 2096-2104 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        185     1495     acetic acid resistance protein (aarA)
    signal     1508     1545     transcription termination signal
    binding     171      176     ribosomal binding site (put.)
BASE COUNT      400 a    446 c    404 g    374 t
ORIGIN      
        1 gcatgcattt gcacacattc gcgcgaccct aagcccaaaa aactgtggtt ttccaagcat
       61 actcctttcc gataacgctt cgtttatcgc tggcaacctt ccggtttcct tttgaatgag
      121 tgacaaagtg tgacgagcag gccgcagcag cgaccgtggc ccaaccatgc agaaggaaac
      181 actaatgagc gcgtcgcaga aagaaggtaa gctatctacc gctaccattt cggttgatgg
      241 aaaatccgcc gaaatgcctg tgctttcagg cactctggga ccggatgtta tcgacatccg
      301 caaacttccg gcgcaactgg gcgttttcac gtttgaccca ggttacgggg aaacagcggc
      361 ctgcaacagc aaaatcacct ttattgatgg tgataaaggc gttctgctgc accgtggtta
      421 ccctattgcg cagctggacg aaaatgcttc ctacgaagaa gttatttatc tgcttttgaa
      481 tggcgaactg cccaacaagg tgcagtacga caccttcacc aacaccctta caaaccatac
      541 gctgctgcac gagcagatcc gtaacttctt taacggcttc cggcgtgatg cccacccaat
      601 ggccattctg tgtggtacgg ttggggcttt gtctgccttc tacccagatg ccaacgatat
      661 tgccattccc gccaatcggg atctggccgc catgcggctg attgccaaaa tcccaaccat
      721 tgcggcatgg gcttacaaat acacgcaggg tgaagccttt atctacccgc ggaatgatct
      781 gaactacgca gaaaacttcc tgtccatgat gttcgcgcgc atgtccgaac cttacaaggt
      841 caaccctgtt ctggcccgcg ccatgaaccg gattctgatt ctgcatgccg atcatgagca
      901 gaatgcctct acctccaccg tacgtctggc tggttctaca ggggccaatc cgtttgcctg
      961 tattgctgcg ggcattgccg ctctgtgggg acctgcacat ggtggcgcaa acgaagctgt
     1021 gctgaaaatg ctggcccgta ttggcaagaa agaaaatatt cctgccttta tcgcacaggt
     1081 gaaggacaag aacagcggcg taaagctgat gggctttggc caccgcgttt acaagaactt
     1141 cgacccacgt gcgaagatca tgcagcagac ctgccacgaa gtgctgacag aacttggcat
     1201 taaggatgat ccgctgctgg atctggcggt tgagctggaa aagattgctc tgagcgatga
     1261 ttacttcgtg cagcgcaaac tttacccgaa tgtggatttc tactctggca tcattctcaa
     1321 ggccatgggc atccccacca gtatgtttac tgtgctgttt gccgtagccc gcaccaccgg
     1381 ctgggtgagc cagtggaagg aaatgattga agaaccgggc cagcgtatca gccgccctcg
     1441 ccagctttat attggcgcac cgcagcgtga ctatgtgccg cttgccaaac gctaaaacag
     1501 actaacccaa aaagccgact tcccgtaagg aaagtcggct ttttgtttgc acgctgtttc
     1561 caaaaaaata gggcggcaga gcgaataaac gctacctagc cttcaggcat aaaaaaacgc
     1621 atgc
//
LOCUS       BOVBADPTA     708 bp ss-mRNA            MAM       12-JUL-1990
DEFINITION  Cow beta adaptin mRNA, partial cds.
ACCESSION   M34177 J05273
KEYWORDS    beta adaptin.
SOURCE      Cow brain, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 708)
  AUTHORS   Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and
            Parham,P.
  TITLE     Conservation and diversity in families of coated vesicle adaptins
  JOURNAL   J. Biol. Chem. 265, 4814-4820 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   708     beta adaptin (AA at 1)
BASE COUNT      198 a    159 c    175 g    176 t
ORIGIN      
        1 gctgtgaaga aagtgattgc tgctatgact gtggggaaag acgttagctc tctctttcca
       61 gatgtagtga actgtatgca gacggataat ctggaactga agaagcttgt gtatctctac
      121 ttgatgaact atgccaagag tcagccagac atggccatca tggctgtcaa cagctttgtg
      181 aaggattgtg aagatcccaa tcctctgatt cgagctttgg cagtcagaac catggggtgc
      241 atccgggtgg acaagataac agagtatctc tgtgagcccc tccgcaagtg cttaaaggat
      301 gaagatccct acgtccggaa gacagcagca gtctgcgtgg caaaactcca tgacatcaat
      361 gcccagatgg tggaagatca gggatttctg gattctctgc gggatctcat agcagattca
      421 aatccaatgg tggtggctaa tgctgtagca gcactatctg aaatcagtga atctcacccc
      481 aacagcaact tactcgatct gaatccacag aacattaata agctactgac agccctgaat
      541 gagtgcaccg aatggggcca gattttcatc ctggactgct tatctaatta caatcctaaa
      601 gatgaccggg aggctcagag catctgtgag cgggtaactc cccggttatc tcatgccaac
      661 tcagcagtgg tgctttcagc agtaaaagtc ctaatgaaat ttttggaa
//
LOCUS       BSUSENSA     1773 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  B.subtilis transcription regulatory protein (senS) gene, complete
            cds.
ACCESSION   M34826 M30611
KEYWORDS    transcription regulatory protein.
SOURCE      B.subtilis (strain DB2) DNA, clone pWL[77,80].
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1773)
  AUTHORS   Wang,L.-F. and Doi,R.H.
  TITLE     Complex character of senS, a novel gene regulating expression of
            extracellular-protein genes of Bacillus subtilis
  JOURNAL   J. Bacteriol. 172, 1939-1947 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.H.Doi, 11-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept       1486     1683     transcription regulatory protein senS
    binding    1470     1477     ribosomal binding site (put.)
    site       1455     1474     transcription termination signal
    site       1671     1698     rho-independent transcription terminator
BASE COUNT      460 a    397 c    388 g    528 t
ORIGIN      
        1 agttcttgga aattctgatt ttcgatatct ggcgaattta cgtagtctcc catcgtttct
       61 ttcgaaaggg acgttctcag cccctcaatc cagcggacat tttgtctttt ttctccaggg
      121 gatgtccagt ttgttaagta ttcctgggcg atgattgcgt cacgataata aaatgccgtt
      181 tggtcgggag cgacccgtcc ggctgccccg ccgagtgctt gctgccagac actggcgttt
      241 tgattcggag cgtgctctaa aaagtgtttt attgttgaga tcgcacgttc tgataatggc
      301 ttttcaatga aagagccgga gcgtttcatt ttttgaggct gattgcctcc cgggctgtta
      361 aaaaaggtta ccgcttcaat gaatggcgtt gtttttacca ttccgcttga cggacttcct
      421 gctttcaata aaggctttaa cagttttttt aactctgttt ttggcccgac aaattggccg
      481 agggcttcta tgcggtttac ttctttaggc caaaactcta ttgatgatgt aagccggtca
      541 tctgtatacg gggcccagtt ctgccacgtg ttatatactt cctcaaaatc atcccatccc
      601 catgtaatag aaaaaatcga cacttgagag atgggcactg ctttaaatgt catggaggtg
      661 actatgccga aattgcctcc tccgcctccc tgagacgccc aaaatgtgga tgatttgaac
      721 agctgactgt aatcagatca gcgccctctt tttcgtctgc tacgatcatc tcaagctgca
      781 cgaggctgtc gcaagtaaga ccggcagccc ttgttaaaag tccaattccc cctccgagag
      841 ttaaacctgt gagccctaca ttagcaatgg tgcctgcggg aagcgtcagg ccgtattgcc
      901 agagtgtccg atagacttct cccaattcag cccccgcttc aatataggcc agctttttat
      961 cctgattcac agttattttt ttcatctcgc ttaaatcaat aacaagaccg ttatttaaaa
     1021 gggaaaagtt ctcatagctg tgtctgccgc ctctaatacg gaaaggcaca cggttttcac
     1081 gcgcccattt cagcgcattg agtgcatcct gtttgttttg gcaaaacaca atgatgtcag
     1141 atcctttcta agcttaggtt aatattggtt cttgcttcgt tatagtccgg atcatcccgt
     1201 gtcacgatac gtccggtcaa ttttgtcttt tccacactcc cacatctctt tctctcgtat
     1261 tctagtttct ctagcttatg cgtcagggga aaagagtgta taaggaaaaa gcggggatgc
     1321 aatctgatac agtgtcaaca ccctcaaaaa atagttgaca ggtcggtatt gtatgaatta
     1381 acatggtcag tacaaatttt tcaaatttat cgcgctgatc ggaacaccga aggctcttat
     1441 cgtttagata agggcctttt ttgtatgaaa aaggggggat tattgatggg agtcaaaaaa
     1501 gaaaagggga gaaaacgatt caggaagcga aaaacctacg ggaatcagat tttgccgctt
     1561 gagctgctga ttgaaaaaaa caaacgagag attataaaca gcgcggaact catggaagaa
     1621 atttatatga agattgatga gaagcatacg caatgtgtaa ctaaatataa aaaaacccgc
     1681 tgactacaac gggtttttgc atttctccat taagaatctt ttttaatcgg caatccaagg
     1741 ccttctgcca cgcgttttcc gtattcagga tcc
//
LOCUS       CHITDNA       176 bp ds-DNA             INV       12-JUL-1990
DEFINITION  C.thummi telomeric DNA.
ACCESSION   M33211
KEYWORDS    telomeric DNA.
SOURCE      C.thummi heat-shocked larvae, cDNA to mRNA, clone lambda-Cth5.
  ORGANISM  Chironomus thummi
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Chironomidae.
REFERENCE   1  (bases 1 to 176)
  AUTHORS   Carmona,M.J., Morcillo,G., Galler,R., Martinez-Salas,E.,
            de la Campa,A.G., Diez,J.L. and Edstroem,J.E.
  TITLE     Cloning and molecular characterization of a telomeric sequence from
            a temperature-induced Balbiani ring
  JOURNAL   Chromosoma 92, 108-115 (1985)
  STANDARD  simple staff_entry
BASE COUNT       63 a     34 c     27 g     52 t
ORIGIN      Chromosome III.
        1 aattctagaa aaatcgagtt ttttcgaaaa catgaaaatt ttttttctct catcctagaa
       61 caagtgtttt agacctcaaa acagatgtga acataaaagt gatgtattga caaaagttgc
      121 tccaaactga gatgcatcca acgtgatatc gatatcccat gtacccccct atggaa
//
LOCUS       ECOSUHBA     1017 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  E.coli extragenic suppressor (suhB) gene, complete cds.
ACCESSION   M34828
KEYWORDS    extragenic suppressor; suhB gene.
SOURCE      E.coli DNA, clone pRY61.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1017)
  AUTHORS   Yano,R., Nagai,H., Shiba,K. and Yura,T.
  TITLE     A mutation that enhances synthesis of sigma-32 and suppresses
            temperature-sensitive growth of the rpoH15 mutant of Escherichia
            coli
  JOURNAL   J. Bacteriol. 172, 2124-2130 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        194      997     suhB protein
    signal      122      127     -35 region
    signal      145      150     -10 region
BASE COUNT      244 a    278 c    259 g    236 t
ORIGIN      55 min on K12 map.
        1 catggcacgg gcaacagaac ccatattgcc ggtgtgtgac gtctccacca gcacaattcg
       61 aatattttgc agcattgtct ttcttcatct aaagattatt cacgcatctt atcataaaac
      121 gaagacagat gccgatctcg ctgctatact ctgcgccgtt ttcccgttct ttaacatcca
      181 gtgagagaga ccgatgcatc cgatgctgaa catcgccgtg cgcgcagcgc gcaaggcggg
      241 taatttaatt gccaaaaact atgaaacccc ggacgctgta gaagcgagcc agaaaggcag
      301 taacgatttc gtgaccaacg tagataaagc tgccgaagcg gtgattatcg acacgattcg
      361 taaatcttac ccacagcaca ccatcatcac cgaagaaagc ggtgaacttg aaggtactga
      421 tcaggatgtt caatgggtta tcgatccact ggatggcact accaacttta tcaaacgtct
      481 gccgcacttc gcggtatcta tcgctgttcg tatcaaaggc cgcaccgaag ttgctgtggt
      541 atacgatcct atgcgtaacg aactgttcac cgccactcgc ggtcagggcg cacagctgaa
      601 cggctaccga ctgctcggca gcaccgctcg cgatctcgac ggtactattc tggcgaccgg
      661 cttcccgttc aaagcaaaac agtacgccac tacctacatc aacatcgtcg gcaaactgtt
      721 caacgaatgt gcagacttcc gtcgtaccgg ttctgcggcg ctggatctgg cttacgtcgc
      781 tgcgggtcgt gttgacggtt tctttgaaat cggtctgcgc ccgtgggact tcgccgcagg
      841 cgagctgctg gttcgtgaag cgggcggcat cgtcagcgac ttcaccggtg gtcataacta
      901 catgctgacc ggtaacatcg ttgctggtaa cccgcgcgtt gttaaagcca tgctggcgaa
      961 catgcgtgac gagttaagcg acgctctgaa gcgttaatga ctcaggcggg tgatatc
//
LOCUS       HUMBADPTA    5701 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human beta adaptin mRNA, complete cds.
ACCESSION   M34175 J05273
KEYWORDS    beta adaptin.
SOURCE      Human fibroblast, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 5701)
  AUTHORS   Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and
            Parham,P.
  TITLE     Conservation and diversity in families of coated vesicle adaptins
  JOURNAL   J. Biol. Chem. 265, 4814-4820 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        178     2991     beta adaptin
    mRNA    <     1     5701     beta adaptin mRNA
    signal     5683     5688     polyA signal
BASE COUNT     1528 a   1373 c   1284 g   1516 t
ORIGIN      
        1 ctgcccacca tctttgtccc tggcaaagtg ggttttgcgc agtggcttag acctagaaaa
       61 gaatcgtgac gggcaggaaa ccattacacc accacctggg ctgtgctctc cggctcccgc
      121 cgccaccccc gccctcgcct tcgcctccgc tccggtgcac attaaagatc caaagtcatg
      181 actgactcca agtatttcac aaccaataaa aaaggagaaa tatttgaact aaaagctgaa
      241 ctcaacaatg aaaagaaaga aaagagaaag gaggctgtga agaaagtgat tgctgctatg
      301 accgtgggga aggatgttag ttctctcttt ccagacgtag tgaactgtat gcagactgac
      361 aatctggaac taaagaagct tgtgtatctc tacttgatga actacgccaa gagtcagcca
      421 gacatggcca tcatggctgt aaacagcttt gtgaaggact gtgaagatcc taatcctttg
      481 attcgagcct tggcagtcag aaccatgggg tgcatccggg tagacaaaat tacagaatat
      541 ctctgtgagc cgctccgcaa gtgcttgaag gatgaggatc cctatgttcg gaaaacagca
      601 gcagtctgcg tggcaaaact ccatgatatc aatgcccaaa tggtggaaga tcagggattt
      661 ctggattctc tacgggatct catagcagat tcaaatccaa tggtggtggc taatgccgta
      721 gcggcattat ctgaaatcag tgagtctcac ccaaacagca acttacttga tctgaaccca
      781 cagaacatta ataagctgct gacagccctg aatgaatgca ctgaatgggg ccagattttc
      841 atcctggact gcctgtctaa ttacaaccct aaagatgatc gggaggctca gagcatctgt
      901 gagcgggtaa ctccccggct atcccatgcc aactcagcag tggtgctttc agcggtaaaa
      961 gtcctaatga agtttctaga attgttacct aaggattctg actactacaa tatgctgctg
     1021 aagaagttag cccctccact tgtcactttg ctgtctgggg agccagaagt gcagtatgtc
     1081 gccctgagga acatcaactt aattgtccag aaaaggcctg aaatcttgaa gcaggaaatc
     1141 aaagtcttct ttgtgaagta caatgatccc atctatgtta aactagagaa gttggacatc
     1201 atgattcgtt tggcatctca agccaacatt gctcaggttc tggcagaact gaaagaatat
     1261 gctacagagg tggatgttga ctttgttcga aaagctgtgc gggccattgg acggtgtgcc
     1321 atcaaggtgg agcaatctgc agagcgctgt gtaagcacat tgcttgatct aatccagacc
     1381 aaagtgaatt atgtggtcca agaagcaatt gttgtcatca gggacatctt ccgcaaatac
     1441 cccaacaagt atgaaagtat catcgccact ctgtgtgaga acttagactc gctggatgag
     1501 ccagatgctc gagcagctat gatttggatt gtgggagaat atgctgaaag aattgacaat
     1561 gcagatgagt tactagaaag cttcctggag ggttttcacg atgaaagcac ccaggtgcag
     1621 ctcactctgc ttactgccat agtgaagctg tttctcaaga aaccatcaga aacacaggag
     1681 ctagtccagc aggtcttgag tttggcaaca caggattctg ataatcctga ccttcgagac
     1741 cggggctata tttattggcg ccttctctca actgaccctg ttacagctaa agaagtagtc
     1801 ttgtctgaga agccactgat ctctgaggag acggacctta ttgagccaac tctgctggat
     1861 gagctaatct gccacattgg ttctttggcc tctgtgtatc ataagcctcc caatgctttt
     1921 gtggaaggaa gtcatggaat tcatcgtaaa cacttgccaa ttcatcatgg gagcactgat
     1981 gcaggtgaca gccctgttgg cactaccact gcaacgaacc tggaacagcc tcaggttatc
     2041 ccctctcaag gtgatcttct aggggatctt ttaaaccttg acctcggtcc cccagtcaat
     2101 gtgccacagg tgtcctccat gcagatggga gcagtggatc tcctaggagg aggactagat
     2161 agtctggtgg gacaatcctt catcccatca tcggtgcctg caacctttgc tccttcacct
     2221 acacctgctg tggtcagcag tggactgaat gacctgtttg aactctccac agggataggc
     2281 atggcacctg gtggatatgt ggctcctaag gctgtctggc tacctgcagt aaaggctaaa
     2341 ggcttggaga tttccggaac atttactcac cgccaagggc acatctatat ggaaatgaac
     2401 ttcaccaata aagctctgca gcacatgaca gattttgcaa tccagtttaa caaaaatagc
     2461 tttggtgtca tccccagcac tcctctggcc atccatacac cactgatgcc aaaccagagc
     2521 attgatgtct ccctgcctct caataccttg ggcccagtca tgaagatgga acctctgaat
     2581 aacctccagg tggctgtgaa aaacaatatc gatgtcttct acttcagctg cctcatccca
     2641 ctcaatgtgc tttttgtaga agatggcaaa atggagcgcc aggtcttcct tgcaacatgg
     2701 aaggatattc ccaatgaaaa tgaacttcag tttcagatta aggaatgtca tttaaatgct
     2761 gacactgttt ccagcaagtt gcaaaacaac aatgtttata ctattgccaa gaggaatgtg
     2821 gaagggcagg acatgctgta ccaatccctg aagctcacta atggcatttg gattttggcc
     2881 gaactacgta tccagccagg aaaccccaat tacacgctgt cactgaagtg tagagctcct
     2941 gaagtctctc aatacatcta tcaggtctac gacagcattt tgaaaaacta acaagactgg
     3001 tccagtaccc ttcaaccatg ctgtgatcgg tgcaagtcaa gaactcttaa ctggaagaaa
     3061 ttgtattgct gcgtagaatc tgaacacact gaggccacct agcaaggtag taactagtct
     3121 aacctgtgct aacattaggg cacaacctgt tggatagttt tagcttcctg tgaacatttg
     3181 taaccactgc ttcagtcacc tcccacctct tgccacctgc tgctgctatc tgtccttact
     3241 tgtgggcttc tccatgctgt gccaatggct ggctttttct acaccctctt ttgagtgtag
     3301 tttggtattt tgtaattgag agctcatttc aaaagcagaa aaagacaaca aatattaaag
     3361 caaggaaaag tgtaactgaa acactgcact ttactgtttt atacttttgt acatatgaga
     3421 aatcaaggga ttagtgcaac cagtagaagg cattgaaatg actgtcatta accacacagt
     3481 cctggaggca gagatgcagt tacctaccct agcttttgat gggttctctt acctgtagta
     3541 gccttatccc tggtcatttg gattttcagt ttgctttttt ctttttttcc cctccaaact
     3601 ccttttcctt ggccaagcct tcatgcttcc ccctttccat attataatct catttgattg
     3661 ctctgcagtt gggaacggtg atcttcttga atgatgtttc agtgtgcaaa aactatagag
     3721 cctgtcagca ccaaagctga cagaagttat accttactcc tttcctttcc cctgaacaaa
     3781 cctgctaatc ccactaattc aggaatttga gtagagatgg ggaacaagaa cccagatgct
     3841 gtcccctcac cccctctcct gtatttctca ggtccagttc aaatctaaaa ttctactttt
     3901 agagttgaaa cagagtaata acttatctaa ccctcttttc ctacaaagga gaaagataaa
     3961 aggcacaaag gttaccgcca aggcccgtca gctgtgtagt ggcaaagccg agaccgagtc
     4021 tcctaagtcc ccgtcagtgt ggttttcacc acaggactgt ctcttgtcgt tttcccctaa
     4081 tgccttctcc tgccttttct gtgcctagtt tttggctctt cacatattcc atattgattt
     4141 tgacgctctg tatattggca tcaggtggca gctgaatatc ttttgaatta ctcgaaggta
     4201 aagccagatg ccagaatgaa ggtgtagcca gtgtttccca tatgcccctg gagccccact
     4261 tattgaggcc agcagaatag gtgcagagat gaagtgagct tagagatgtt gcaaatgctc
     4321 tttatccctt cagctctctg atctgctctt tcttcatgat acttagtctg cagggcatat
     4381 taagatcatc ccagaggttc aggcagttcc tgtcatctct gaaaagactg ggggatatga
     4441 aatcttcccc ctaccccact taatgcgttg gatatgattt ttcaaagaat gcttcatgcc
     4501 caaaatacca gcctgtttag cagtgttaca ctgtttgatc tgcgggcact tgttgcattg
     4561 cctggcaccc aatattcagg gtccatgact aagactggtc ttctcagatg ccctgcttaa
     4621 atcaggggca cttcaggctc cacaggcgtc atgttggact gagacctaac tcactggact
     4681 cagaggagga atcgtggaaa acaagagcaa aactacccca cacccctatt tcatgtctga
     4741 aataaccctg tttcatacca gttgcaaagc ttgtggggag cggtcccaca aagcactttc
     4801 ttaaaccttg agaatctcca agagaaaaat atttggggaa ggagggagga aatatgtccc
     4861 ttgcacacca cccctgaagc acatggcagt aggaaacagc ataggattgt atgtgggagg
     4921 tggataggtc ggtgatgtgt ggagcggaaa agcaggttgg taaagttccc ttcttgggac
     4981 ttattcctgg agtcagtgga tacaagtagt gcagaaggtt cacactgcaa atagtgttct
     5041 catctcaaag caaactatca ttccagaagg aaaagtgtgt cagggcaagc agacaacaca
     5101 atttcctatc agaatatgtc cctcaacccc cgaaacaagg cttctctcag cctccccacc
     5161 agtgatggat aacagctcct attctcagct gacctgactg agccaaccca tgaactcttc
     5221 actccttggg gaagccacct cccatcacac ccctgagcag agttagggag gaattctact
     5281 tcccataaaa ggacctctcc tgagaggcaa aacctgttgc ctccaccacg gcttccctct
     5341 tggctcattc caagcttggc caaattgggg aagtgggatg gaggttgccc tgcatccccc
     5401 ctcctctgcc tgagtgtgtc tttgtaatgt cagctggcat catacaaaga gcaggagaag
     5461 caaacaccca gaactctttt gctggtcaga gattccctga gtgtctgtcc tcacccaagc
     5521 ctgctctgtg tctgtgttgt gaagcttgag actctggaaa gaaatgggga gggggggcag
     5581 gggaaatgtt gccctaagaa tgcttctcat tcctctgttc ttattgggtc ctgtttttcg
     5641 ggagggtggg ggttggggga agcttgacct tgtgtcttcg tcaataaact cacatttaca
     5701 c
//
LOCUS       HUMCD59A     1671 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human lymphocytic antigen CD59/MEM43 mRNA, complete cds.
ACCESSION   M34671 X15861
KEYWORDS    CD59 antigen; cell surface antigen; integral membrane protein.
SOURCE      Human peripheral blood monocyte, cDNA to mRNA, clone R18..
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 108 to 443)
  AUTHORS   Sawada,R., Ohashi,K., Okano,K., Hattori,M., Minato,N. and Naruto,M.
  TITLE     Complementary DNA sequence and deduced peptide sequence for
            CD59/MEM43 antigen, the human homologue of murine lymphocyte
            antigen Ly-6c
  JOURNAL   Nucleic Acids Res. 17, 6728-6728 (1989)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 1671)
  AUTHORS   Sawada,R., Ohashi,K., Anaguchi,H., Okazaki,H., Hattori,M.,
            Minato,N. and Naruto,M.
  TITLE     Isolation and expression of the full-length cDNA encoding CD59
            antigen of human lymphocytes
  JOURNAL   DNA 9, 213-220 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer readable copy for sequence [1] kindly
            provided by Naruto,M., 17-JUL-1989.
            
            [1]  Author address:  Naruto,M.
                 Basic Research Laboratories
                 Toray Industries Inc
                 1111 Tebiro Kamakura 248, Japan.
FEATURES       from  to/span     description
    pept         30      416     antigen CD59 precursor (CD59)
                                 /hgml_locus_uid="LY0169B" /nomgen="CD59"
                                 /map="11pter-p13"
    sigp         30      104     CD59 signal peptide
    matp        105      413     CD59 protein
    mRNA    <     1     1671     CD59 mRNA
    signal      527      532     polyA signal
BASE COUNT      434 a    347 c    390 g    500 t
ORIGIN      
        1 ggcgccgcca ggttctgtgg acaatcacaa tgggaatcca aggagggtct gtcctgttcg
       61 ggctgctgct cgtcctggct gtcttctgcc attcaggtca tagcctgcag tgctacaact
      121 gtcctaaccc aactgctgac tgcaaaacag ccgtcaattg ttcatctgat tttgatgcgt
      181 gtctcattac caaagctggg ttacaagtgt ataacaagtg ttggaagttt gagcattgca
      241 atttcaacga cgtcacaacc cgcttgaggg aaaatgagct aacgtactac tgctgcaaga
      301 aggacctgtg taactttaac gaacagcttg aaaatggtgg gacatcctta tcagagaaaa
      361 cagttcttct gctggtgact ccatttctgg cagcagcctg gagccttcat ccctaagtca
      421 acaccaggag agcttctccc aaactccccg ttcctgcgta gtccgctttc tcttgctgcc
      481 acattctaaa ggcttgatat tttccaaatg gatcctgttg ggaaagaata aaattagctt
      541 gagcaacctg gctaagatag aggggctctg ggagactttg aagaccagtc ctgtttgcag
      601 ggaagcccca cttgaaggaa gaagtctaag agtgaagtag gtgtgacttg aactagattg
      661 catgcttcct cctttgctct tgggaagacc agctttgcag tgacagcttg agtgggttct
      721 ctgcagccct cagattattt ttcctctggc tccttggatg tagtcagtta gcatcattag
      781 tacatctttg gagggtgggg caggagtata tgagcatcct ctctcacatg gaacgctttc
      841 ataaacttca gggatcccgt gttgccatgg aggcatgcca aatgttccat atgtgggtgt
      901 cagtcaggga caacaagatc cttaatgcag agctagagga cttctggcag ggaagtgggg
      961 aagtgttcca gatagcaggg catgaaaact tagagaggta caagtggctg aaaatcgagt
     1021 ttttcctctg tctttaaatt ttatatgggc tttgttatct tccactggaa aagtgtaata
     1081 gcatacatca atggtgtgtt aaagctattt ccttgccttt ttttattgga atggtaggat
     1141 atcttggctt tgccacacac agttacagag tgaacactct actacatgtg actggcagta
     1201 ttaagtgtgc ttattttaaa tgttactggt agaaaggcag ttcaggtatg tgtgtatata
     1261 gtatgaatgc agtggggaca ccctttgtgg ttacagtttg agacttccaa aggtcatcct
     1321 taataacaac agatctgcag gggtatgttt taccatctgc atccagcctc ctgctaactc
     1381 ctagctgact cagcatagat tgtataaaat acctttgtaa cggctcttag cacactcaca
     1441 gatgtttgag gctttcagaa gctcttctaa aaaatgatac acacctttca caagggcaaa
     1501 ctttttcctt ttccctgtgt attctagtga atgaatctca agattcagta gacctaatga
     1561 catttgtatt ttatgatctt ggctgtattt aatggcatag gctgactttt gcagatggag
     1621 gaatttcttg attaatgttg aaaaaaaacc cttgattata ctctgttgga c
//
LOCUS       HUMKER19PA   1586 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human keratin K19 pseudogene.
ACCESSION   M33101
KEYWORDS    keratin K19; pseudogene.
SOURCE      Human, cDNA to mRNA, clone IF7.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1586)
  AUTHORS   Savtchenko,E.S., Schiff,T.A., Jiang,C.-K., Freedberg,I.M. and
            Blumenberg,M.
  TITLE     Embryonic expression of the human 40-kD keratin: Evidence from a
            processed pseudogene sequence
  JOURNAL   Am. J. Hum. Genet. 43, 630-637 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept.ps     120     1310     keratin 19 pseudogene
    signal     1412     1418     polyA signal
BASE COUNT      377 a    405 c    470 g    334 t
ORIGIN      
        1 attgataaac atataatctg atatttatgt aaagtagcta ttttttaaaa aaagtatggc
       61 tcctccctcg aatcgcagcc tctgggacca gggtcgctcc atccgtcgtc cgcctcgcca
      121 tgacttccta cacgtatcgc cagtcgtagg ccaagtagtc cttctggggc ctgggtggtg
      181 gctccgtgag ttttgtggca gaggttgcct ttcgcgcgct cagcatgcac tgggcctctg
      241 gaggccgcgg cgtgtccgtg tcctccgccc gcttcgtgtc tgtcctcgtc ctccttgggg
      301 ggctacggcg gcgtcttggc cgtgtcctac gggctgctgg cgggcaacga gaagctcaat
      361 atgcagaacc tcagcgaccc tctggcctcc tacctggaca aggtgggcgc cctggaggac
      421 gccaacggca aactggaggt gaagatccgc gactggtacc agaagcaggg gcccgggcct
      481 cccgtgacta cagccactct acaagactat ccaggacctg cggtacaaga ttcttggtgc
      541 caccattgag aactccagga ttgtcctgga gatcgacaac gcccgtctgg ctgcagatga
      601 cttccgaacc aagagtgaga cggagcaggc tctgcgcatg agcggaggcc gacatcaacg
      661 gcctgcgcag ggtgctggac gagctgaccc tggccattac cgacctggag atgcagatct
      721 aaggcctgaa ggaagagctg gcctacctga agaagaacca tgagaaggaa atcagtgggc
      781 tgaggggcca agtgggaggc caggtcagtg gggaggtgga ttcggctcag ggcacctatc
      841 tcgccaagat cctgagttac atgcgaacgc aatacgaggt catggcggac aacaactgga
      901 aggatgctga agcctggttc accagccgga ctgaagaatt gaaccgggag gtcgctggcc
      961 acacagatca gctccagatg agccggtcca aggtcgctga cctgcggcgc accctccagg
     1021 gtcttgagct ggagctgcag tcacggctga gcatgaaagc cgccttggaa gccacactgg
     1081 cagaaacgga ggcgcgcttt ggagtccact tggcgcagat ccagccgctg atcaactgta
     1141 ttgaagccca gctgggcgat gtgcgagctg atagtgagcg gcagaatcag gattaacagc
     1201 agttcatgga catcaagtcg cggctggagc aggagatctc cacctaccgc agcctgctcg
     1261 agggccagaa ggatcactac aacaacctgt ccgcctccaa ggtcctctga ggcagcaggc
     1321 taaggggctt ctactgtcct ttggagggtg tctcctgggt agggggatgg gaaggaaggg
     1381 acccttaccc cctgctcttc ccctgatctg ccaataaaat tttatggtcc aaggggaaaa
     1441 aaaaaaaaaa aaaaaatata tatatatata tatatatata tatatatgtg tgtgtgtgtg
     1501 tgtgtgtgtg tgtatatata cgtgtgtgtg tatatatata tatatgaaaa acaatacatg
     1561 ctcgttgtag aaatgtggaa acatgg
//
LOCUS       HUMLOX15A    2671 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human 15-lipoxygenase mRNA, complete cds.
ACCESSION   M23892
KEYWORDS    15-lipoxygenase.
SOURCE      Human reticulocyte, cDNA to mRNA, clone 15LOX.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2671)
  AUTHORS   Sigal,E., Craik,C.S., Highland,E., Grunberger,D., Costello,L.L.,
            Dixon,R.A.F. and Nadel,J.A.
  TITLE     Molecular cloning and primary structure of human 15-lipoxygenase
  JOURNAL   Biochem. Biophys. Res. Commun. 157, 457-464 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          4     1992     15-lipoxygenase
    mRNA    <     1     2671     15-lipoxygenase mRNA
BASE COUNT      580 a    743 c    718 g    630 t
ORIGIN      
        1 aagatgggtc tctaccgcat ccgcgtgtcc actggggcct cgctctatgc cggttccaac
       61 aaccaggtgc agctgtggct ggtcggccag cacggggagg cggcgctcgg gaagcgactg
      121 tggcccgcac ggggcaagga gacagaactc aaggtggaag taccggagta tctggggccg
      181 ctgctgtttg tgaaactgcg caaacggcac ctccttaagg acgacgcctg gttctgcaac
      241 tggatctctg tgcagggccc cggagccggg gacgaggtca ggttcccttg ttaccgctgg
      301 gtggagggca acggcgtcct gagcctgcct gaaggcaccg gccgcactgt gggcgaggac
      361 cctcagggcc tgttccagaa acaccgggaa gaagagctgg aagagagaag gaagttgtac
      421 cggtggggaa actggaagga cgggttaatt ctgaatatgg ctggggccaa actatatgac
      481 ctccctgtgg atgagcgatt tctggaagac aagagagttg actttgaggt ttcgctggcc
      541 aaggggctgg ccgacctcgc tatcaaagac tctctaaatg ttctgacttg ctggaaggat
      601 ctagatgact tcaaccggat tttctggtgt ggtcagagca agctggctga gcgcgtgcgg
      661 gactcctgga aggaagatgc cttatttggg taccagtttc ttaatggcgc caaccccgtg
      721 gtgctgaggc gctctgctca ccttcctgct cgcctagtgt tccctccagg catggaggaa
      781 ctgcaggccc agctggagaa ggagctggag ggaggcacac tgttcgaagc tgacttctcc
      841 ctgctggatg ggatcaaggc caacgtcatt ctctgtagcc agcagcacct ggctgcccct
      901 ctagtcatgc tgaaattgca gcctgatggg aaactcttgc ccatggtcat ccagctccag
      961 ctgccccgca caggatcccc accacctccc cttttcttgc ctacggatcc cccaatggcc
     1021 tggcttctgg ccaaatgctg ggtgcgcagc tctgacttcc agctccatga gctgcagtct
     1081 catcttctga ggggacactt gatggctgag gtcattgttg tggccaccat gaggtgcctg
     1141 ccgtcgatac atcctatctt caagcttata attccccacc tgcgatacac cctggaaatt
     1201 aacgtccggg ccaggactgg gctggtctct gacatgggaa ttttcgacca gataatgagc
     1261 actggtgggg gaggccacgt gcagctgctc aagcaagctg gagccttcct aacctacagc
     1321 tccttctgtc cccctgatga cttggccgac cgggggctcc tgggagtgaa gtcttccttc
     1381 tatgcccaag atgcgctgcg gctctgggaa atcatctatc ggtatgtgga aggaatcgtg
     1441 agtctccact ataagacaga cgtggctgtg aaagacgacc cagagctgca gacctggtgt
     1501 cgagagatca ctgaaatcgg gctgcaaggg gcccaggacc gagggtttcc tgtctcttta
     1561 caggctcggg accaggtttg ccactttgtc accatgtgta tcttcacctg caccggccaa
     1621 cacgcctctg tgcacctggg ccagctggac tggtactctt gggtgcctaa tgcaccctgc
     1681 acgatgcggc tgcccccgcc aaccaccaag gatgcaacgc tggagacagt gatggcgaca
     1741 ctgcccaact tccaccaggc ttctctccag atgtccatca cttggcagct gggcagacgc
     1801 cagcccgtta tggtggctgt gggccagcat gaggaggagt atttttcggg ccctgagcct
     1861 aaggctgtgc tgaagaagtt cagggaggag ctggctgccc tggataagga aattgagatc
     1921 cggaatgcaa agctggacat gccctacgag tacctgcggc ccagcgtggt ggaaaacagt
     1981 gtggccatct aagcgtcgcc accctttggt tatttcagcc cccatcaccc aagccacaag
     2041 ctgacccctt cgtggttata gccctgccct cccaagtccc accctcttcc catgtcccac
     2101 cctccctaga ggggcacctt ttcatggtct ctgcacccag tgaacacatt ttactctaga
     2161 ggcatcacct gggaccttac tcctctttcc ttccttcctc ctttcctatc ttccttcctc
     2221 tctctcttcc tctttcttca ttcagatcta tatggcaaat agccacaatt atataaatca
     2281 tttcaagact agaatagggg gatataatac atattactcc acacctttta tgaatcaaat
     2341 atgatttttt tgttgttgtt aagacagagt ctcactttga cacccaggct ggagtgcagt
     2401 ggtgccatca ccacggctca ctgcagcctc agcgtcctgg gctcaaatga tcctcccacc
     2461 tcagcctcct gagtagctgg gactacaggc tcatgccatc atgcccagct aatatttttt
     2521 tattttcgtg gagacggggc ctcactatgt tgcctaggct ggaaatagga ttttgaaccc
     2581 aaattgagtt taacaataat aaaaagttgt tttacgctaa agatggaaaa gaactaggac
     2641 tgaactattt taaataaaat attggcaaaa g
//
LOCUS       MUSBPGALA     334 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Mouse beta-galactoside-binding lectin (L-14.5) mRNA, 5' end.
ACCESSION   M33214
KEYWORDS    beta-galactoside-binding lectin.
SOURCE      Mouse (strain C57BL/6) 12 day old embryo melanoma cell line
            UV-2237-IP, cDNA to mRNA, clone L3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 334)
  AUTHORS   Raz,A., Carmi,P. and Pazerini,G.
  TITLE     Expression of two different endogenous galactoside-binding lectins
            sharing sequence homology
  JOURNAL   Cancer Res. 48, 645-649 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         16  >   334     14 kDa beta-galactoside-binding lectin
                                 (L-14.5)
BASE COUNT       86 a     94 c     91 g     63 t
ORIGIN      
        1 gaattgggta caatcatggc ctgtggtctg gtggatcagc aagctgaatc tcaaactggg
       61 gcaatgtctc aaagttcggg gcagaggtgg acctcggacg acaggagctt tgtgctgacc
      121 ctgggaaaag acagcaacaa ccgttgccta cacttcaatc ctcgcttcaa tgcccatgga
      181 gacgccaaca ccattctgtg taacaccaag gaagatggga cctggggaac cgaacaccgg
      241 gaacctgcct tccccttcca gcccgggagc atcacagagt gtgcatgcac ctttgaccag
      301 gctgacctga ccatgcaagc tgccagacgg acat
//
LOCUS       MUSBPGALB     621 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Mouse beta-galactoside-binding lectin (L-34) mRNA, 3' end.
ACCESSION   M33215
KEYWORDS    beta-galactoside-binding lectin.
SOURCE      Mouse (strain C57BL/6) 12 day old embryo melanoma cell line
            UV-2237-IP3, cDNA to mRNA, clone M5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 621)
  AUTHORS   Raz,A., Carmi,P. and Pazerini,G.
  TITLE     Expression of two different endogenous galactoside-binding lectins
            sharing sequence homology
  JOURNAL   Cancer Res. 48, 645-649 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      420     34 kDa beta-galactoside-binding lectin (L-34)
BASE COUNT      164 a    171 c    154 g    132 t
ORIGIN      
        1 cccagggcaa cctggggcac ctggggccat ccccagtgct cctggaggct atcctgctgc
       61 tggcccttat ggtgtccccg ctggaccact gacgtgccct atgacctgcc cttgcctgga
      121 ggagtcatgc cccgcatgct gatcacaatc atgggcacag tgaaacccaa cgcaaacagg
      181 attgttctag atttcaggag agggaatgat gttgccttcc actttaaccc ccgcttcaat
      241 gagaacaaca gaagactcat tgtgtgtaac acgaagcagg acaataactg gggaaaggaa
      301 gaaagacagt cagccttccc ctttgagagt ggaaaaccat tcaaaataca agtcctggtt
      361 gcagctgacc attcaggttg cggtcacgat gctcactact gcagtacaac catcggatga
      421 agaacctccg ggaaatcagc caactggcga tcagtggtga cataaccctg caccagcgct
      481 gaaccagcgc catgatctaa gccagaaggg gcggcaccga aaccggccct gtgtgcctta
      541 ggagtgggaa actttgcatt tctctctcct tatccttctt gtaagacatc catttaataa
      601 agtctcatgc tgagagaaaa g
//
LOCUS       MUSP32A      1510 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Mouse tumor-induced 32 kD protein (p32) mRNA, complete cds.
ACCESSION   M33203
KEYWORDS    tumor-induced protein.
SOURCE      Mouse (strain BALB/c) fibroblast cell line 3T3 A31, cDNA to mRNA,
            clone pMp32S.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1510)
  AUTHORS   Kageyama,H., Hiwasa,T., Tokunaga,K. and Sakiyama,S.
  TITLE     Isolation and characterization of a complementary DNA clone for a
            M-r 32,000 protein which is induced with tumor promoters in BALB/c
            3T3 cells
  JOURNAL   Cancer Res. 48, 4795-4798 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         75      944     32 kD protein (p32)
    mRNA    <     1     1510     p32 mRNA
    signal     1491     1496     poly-A signal
BASE COUNT      352 a    420 c    381 g    357 t
ORIGIN      
        1 ccgcgcagag ccgtctcgag catagcccgg agcctgaatc gagcagaacc agcctgaact
       61 agcccagtcc ggtgatggag cgtccacagc ccgacagcat gccccaggat ttgtctgagg
      121 ccttgaagga ggccaccaag gaggtacaca tccaagccga gaatgctgag ttcatgaaga
      181 actttcagaa gggtcaggtg tccagagaag gctttaagct ggtgatggct tccttgtacc
      241 atatctacac ggccctggaa gaggagatag agcgcaacaa gcagaaccca gtctatgccc
      301 cactctactt ccctgaggag ctgcaccgaa gggctgccct ggagcaggac atggccttct
      361 ggtatgggcc tcactggcag gaaatcatcc cttgcacgcc agccacacag cactatgtaa
      421 agcgtctcca cgaggtgggg cgcactcacc ctgagctgct ggtggcccac gcatataccc
      481 gctacctggg tgacctctca gggggtcagg tcctgaagaa gattgcacag aaggccatgg
      541 ccttgcccag ctctggggag ggcctggctt tttttacctt cccgaacatc gacagcccca
      601 ccaagttcaa acagctctat cgtgctcgaa tgaacactct ggagatgaca cctgaggtca
      661 agcacagggt gacagaagag gctaagaccg ccttcctgct caacattgag ctgtttgagg
      721 agctgcaggt gatgctgaca gaggaacaca aagaccagag tccctcacag atggcgtcac
      781 ttcgtcagag gcctgctagc ctggtgcaag atactgcccc tgcagagaca ccccgaggga
      841 aaccccagat cagcactagc tcatcccaga caccgctcct ccagtgggtc ctcactctca
      901 gcttcctgtt ggcaacagtg gcagtgggaa tttatgccat gtaaatgcaa tactggcccc
      961 caggggctgt gaactctgtc caatgtggcc ttctctctgt aagggagaat cttgcctggc
     1021 tctcttctct tgggcctcta agaaagcttt tggggtccct agcccactcc ctgtgtttcc
     1081 tttctctctg gaatggaggg agatacctga cacagttccc tcaccaaaag cacatccagc
     1141 cagtggcctg aactttgaaa ccagcagccc caaatcctgc agcagagccc caaaactggc
     1201 ctgtaaaagc agctgttctg agcccagtgc ccatggttgt aagcatccat gttgactgac
     1261 cacgactgct gtcccccagt gccatggcca ctttgatatc cgtttccaga catttctgtc
     1321 tcgtatttct gtcttgtttt ttattatttc cccagttcta ccagagtaat ggtattttgt
     1381 tgttttgttt tgtcttgttt ttcctaacaa agtggggcta tcttttgagg ggtgggtggg
     1441 aaagaattat ttaatagttg taaccttggt ctctaacttc tgtgtgaaat aataaatggc
     1501 attatctaac
//
LOCUS       PASLKTCABD   7742 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  P.haemolytica leukotoxin gene cluster, complete cds.
ACCESSION   M24197 M34943 M34944
KEYWORDS    LktA membrane protein; cytolysin; hemolysin; leukotoxin.
SOURCE      P.haemolytica (strain PHL101) DNA, clones lambda-sh132 and pSH224.
  ORGANISM  Pasteurella haemolytica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 7742)
  AUTHORS   Highlander,S.K., Chidambaram,M., Engler,M.J. and Weinstock,G.M.
  TITLE     DNA sequence of the Pasteurella haemolytica leukotoxin gene cluster
  JOURNAL   DNA 8, 15-28 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 46 to 276 and 3576 to 3813)
  AUTHORS   Highlander,S.K., Engler,M.J. and Weinstock,G.M.
  TITLE     secretion and expression of the Pasteurella haemolytica leukotoxin
  JOURNAL   J. Bacteriol. 172, 2343-2350 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.K.Highlander, 24-APR-1989.
FEATURES       from  to/span     description
    pept        216      719     leukotoxin (LktC)
    pept        735     3596     leukotoxin membrane protein (LktA)
    pept       3670     5796     leukotoxin (LktB)
    pept       5808     7244     leukotoxin (LktD)
    mRNA        186  >  3596     lktCA mRNA
    signal      173      178     -10 region promoter
    binding     205      209     lktC ribosomal binding site (put.)
    binding     722      726     lktA ribosomal binding site (put.)
    binding    3660     3665     lktB ribosomal binding site (put.)
    binding    5797     5802     lktD ribosomal binding site (put.)
BASE COUNT     2573 a   1357 c   1568 g   2244 t
ORIGIN      
        1 taatattaca atgtaattat tttgtttaat ttccctacat tttgtataac tttaaaacac
       61 tcctttttct cttctgatta tataaaagac aaaaaataca atttaagcta caaaaaacaa
      121 caaaaaacaa caaaaaacac gacaataaga tcgagtaatg attatattat gttataattt
      181 ttgacctaat ttagaataat tatcgagtgc aaattatgaa tcaatcttat tttaacttac
      241 taggaaacat tacttggcta tggatgaact cctccctcca caaagaatgg agctgtgaac
      301 tactagcacg caatgtgatt cctgcaattg aaaatgaaca atatatgcta cttatagata
      361 acggtattcc gatcgcttat tgtagttggg cagatttaaa ccttgagact gaggtgaaat
      421 atattaagga tattaattcg ttaacaccag aagaatggca gtctggtgac agacgctgga
      481 ttattgattg ggtagcacca ttcggacatt ctcaattact ttataaaaaa atgtgtcaga
      541 aataccctga tatgatcgtc agatctatac gcttttatcc aaagcagaaa gaattaggca
      601 aaattgccta ctttaaagga ggtaaattag ataaaaaaac agcaaaaaaa cgttttgata
      661 catatcaaga agagctggca acacgactta aaaatgaatt taattttatt aaaaaataga
      721 aggagacatc ccttatggga actagactta caaccctatc aaatgggcta aaaaacactt
      781 taacggcaac caaaagtggc ttacataaag ccggtcaatc attaacccaa gccggcagtt
      841 ctttaaaaac tggggcaaaa aaaattatcc tctatattcc ccaaaattac caatatgata
      901 ctgaacaagg taatggttta caggatttag tcaaagcggc cgaagagttg gggattgagg
      961 tacaaagaga agaacgcaat aatattgcaa cagctcaaac cagtttaggc acgattcaaa
     1021 ccgctattgg cttaactgag cgtggcattg tgttatccgc tccacaaatt gataaattgc
     1081 tacagaaaac taaagcaggc caagcattag gttctgccga aagcattgta caaaatgcaa
     1141 ataaagccaa aactgtatta tctggcattc aatctatttt aggctcagta ttggctggaa
     1201 tggatttaga tgaggcctta cagaataaca gcaaccaaca tgctcttgct aaagctggct
     1261 tggagctaac aaattcatta attgaaaata ttgctaattc agtaaaaaca cttgacgaat
     1321 ttggtgagca aattagtcaa tttggttcaa aactacaaaa tatcaaaggc ttagggactt
     1381 taggagacaa actcaaaaat atcggtggac ttgataaagc tggccttggt ttagatgtta
     1441 tctcagggct attatcgggc gcaacagctg cacttgtact tgcagataaa aatgcttcaa
     1501 cagctaaaaa agtgggtgcg ggttttgaat tggcaaacca agttgttggt aatattacca
     1561 aagccgtttc ttcttacatt ttagcccaac gtgttgcagc aggtttatct tcaactgggc
     1621 ctgtggctgc tttaattgct tctactgttt ctcttgcgat tagcccatta gcatttgccg
     1681 gtattgccga taaatttaat catgcaaaaa gtttagagag ttatgccgaa cgctttaaaa
     1741 aattaggcta tgacggagat aatttattag cagaatatca gcggggaaca gggactattg
     1801 atgcatcggt tactgcaatt aataccgcat tggccgctat tgctggtggt gtgtctgctg
     1861 ctgcagccgg ctcggttatt gcttcaccga ttgccttatt agtatctggg attaccggtg
     1921 taatttctac gattctgcaa tattctaaac aagcaatgtt gagcacgttg caaataaaaa
     1981 ttcataacaa aattgtagaa tgggaaaaaa ataatcacgg taagaactac tttgaaaatg
     2041 gttacgatgc ccgttatctt gcgaatttac aagataatat gaaattctta ctgaacttaa
     2101 acaaagagtt acaggcagaa cgtgtcatcg ctattactca gcagcaatgg gataacaaca
     2161 ttggtgattt agctggtatt agccgtttag gtgaaaaagt ccttagtggt aaagcctatg
     2221 tggatgcgtt tgaagaaggc aaacacatta aagccgataa attagtacag ttggattcgg
     2281 caaacggtat tattgatgtg agtaattcgg gtaaagcgaa aactcagcat atcttattca
     2341 gaacgccatt attgacgccg ggaacagagc atcgtgaacg cgtacaaaca ggtaaatatg
     2401 aatatattac caagctcaat attaaccgtg tagatagctg gaaaattaca gatggtgcag
     2461 caagttctac ctttgattta actaacgttg ttcagcgtat tggtattgaa ttagacaatg
     2521 ctggaaatgt aactaaaacc aaagaaacaa aaattattgc caaacttggt gaaggtgatg
     2581 acaacgtatt tgttggttct ggtacgacgg aaattgatgg cggtgaaggt tacgaccgag
     2641 ttcactatag ccgtggaaac tatggtgctt taactattga tgcaaccaaa gagaccgagc
     2701 aaggtagtta taccgtaaat cgtttcgtag aaaccggtaa agcactacac gaagtgactt
     2761 caacccatac cgcattagtg ggcaaccgtg aagaaaaaat agaatatcgt catagcaata
     2821 accagcacca tgccggttat tacaccaaag ataccttgaa agctgttgaa gaaattatcg
     2881 gtacatcaca taacgatatc tttaaaggta gtaagttcaa tgatgccttt aacggtggtg
     2941 atggtgtcga tactatttac ggtaacgacg gcaatgaccg cttatttggt ggtaaaggcg
     3001 atgatattct cgatggtgga aatggtgatg attttatcga tggcggtaaa ggcaacgacc
     3061 tattacacgg tggcaagggc gatgatattt tcgttcaccg taaaggcgat ggtaatgata
     3121 ttattaccga ttctgacggc aatgataaat tatcattctc tgattcgaac ttaaaagatt
     3181 taacatttga aaaagttaaa cataatcttg tcatcacgaa tagcaaaaaa gagaaagtga
     3241 ccattcaaaa ctggttccga gaggctgatt ttgctaaaga agtgcctaat tataaagcaa
     3301 ctaaagatga gaaaatcgaa gaaatcatcg gtcaaaatgg cgagcggatc acctcaaagc
     3361 aagttgatga tcttatcgca aaaggtaacg gcaaaattac ccaagatgag ctatcaaaag
     3421 ttgttgataa ctatgaattg ctcaaacata gcaaaaatgt gacaaacagc ttagataagt
     3481 taatctcatc tgtaagtgca tttacctcgt ctaatgattc gagaaatgta ttagtggctc
     3541 caacttcaat gttggatcaa agtttatctt ctcttcaatt tgctagagca gcttaatttt
     3601 taatgattgg caactctata ttgtttcaca cattatagat tgccgtttta ttttataaaa
     3661 ggagacaata tggaagctaa ccatcaaagg aatgatcttg gtttagttgc cctcactatg
     3721 ttggcacaat accataatat ttcgcttaat ccggaagaaa taaaacataa atttgatctt
     3781 gacggaaaag ggctttcttt aactgcttgg cttttagctg caaaatcgtt agcgttgaaa
     3841 gcgaaacaca ttaaaaaaga gatttcccgc ttacacttgg tgaatttacc ggcattagtt
     3901 tggcaagata acggtaaaca ttttttattg gtaaaagtgg ataccgataa taaccgctat
     3961 ttaacttaca atttggaaca agatgctcca caaattctgt caacagacga atttgaagcc
     4021 tgctatcaag ggcagttaat tttggtcacg tccagagctt ccgtagtagg tcaattagca
     4081 aagttcgatt tcacctggtt tattccggcg gtgatcaaat accgaaaaat ctttctagaa
     4141 accttgattg tttcgatctt tttgcaaatt tttgccctaa ttacaccgct attcttccaa
     4201 gttgttatgg ataaagtact ggtgcatcga ggtttttcaa ccttgaatat cattacggtt
     4261 gccttagcta ttgtgatcat ctttgaaatt gtactaagtg gtttgagaac ctatgttttt
     4321 tctcatagca ctagccgtat tgatgttgaa ttaggcgcta aattatttcg acatttatta
     4381 tcactaccca tttcttattt tgaaaacaga cgagttggag atacagtcgc tagggttaga
     4441 gaattagatc aaattcgtaa tttccttacc ggacaagcat taacctcggt gttagatctc
     4501 ttattctctt ttatcttttt tgccgtaatg tggtattaca gcccaaaatt aaccttggta
     4561 attcttggtt cattgccctg ctatatttta tggtcaattt ttattagtcc gattttaaga
     4621 cggcgtttag atgagaaatt tgcccgaagt gctgataacc aagcattctt agttgagtcg
     4681 gtaacagcca tcaatatgat taaagcgatg gcggttgctc cacaaatgac ggatacatgg
     4741 gataaacagc tggcaagcta tgttttcatc agtttccgtg tcaccgtatt agcaaccatt
     4801 gggcaacaag gtgtacaact tattcaaaaa accgttatgg tgattaacct ttggttaggg
     4861 gcacacttag ttatttcagg cgatctgagt attgggcaat taattgcctt taatatgcta
     4921 tcagggcaag tgattgcacc ggtgattcgg ctggctcagc tctggcaaga tttccaacaa
     4981 gttgggattt ccgtcactcg cttaggtgat gttttaaact ctccaaccga acaatatcaa
     5041 ggcaaattat cactaccaga aataaaaggc gatatctcat ttaaaaatat ccgctttaga
     5101 tataaaccag atgcaccaac tattttaaat aatgtgaatt tagaaattag gcaaggagaa
     5161 gtgattggga ttgttggacg ttccggttca ggcaaaagta ctctgactaa attactgcaa
     5221 cgtttttata ttcctgaaaa tgggcaggtt ttgattgatg gacatgatct agccttagct
     5281 gatccaaact ggctacgccg tcaaataggt gtagtgctgc aagataatgt gttattaaac
     5341 cgcagtatcc gagaaaatat tgcgctatca gatccaggaa tgccaatgga gcgagtaatt
     5401 tatgcagcaa aattagcagg ggctcacgat tttatttcag aattgcgtga aggttatacc
     5461 accattgtgg gtgaacaagg agcggggctt tcaggcgggc aacgccaacg gattgcgatt
     5521 gctcgagctt tggtaaacaa cccgaaaatc ctgatttttg atgaggcaac cagtgccctc
     5581 gattacgaat ctgagcatat tattatgcaa aatatgcaaa aaatatgcca aggcagaacc
     5641 gtgattttga ttgcacatcg tttatcgacc gtcaaaaatg cggatcgaat tattgtgatg
     5701 gaaaaggggg aaattgttga gcaaggcaag caccacgaat tactgcaaaa cagtaacgga
     5761 ctttattcct acttacacca attacaactt aattaagaag gaaaacaatg aaaatatggc
     5821 ttagtggtat ttatgaattt ttcctacgct ataaaaacat ttgggcagaa gtatggaaaa
     5881 ttcgtaaaga attagaccac ccaaacagaa aaaaagacga aagtgaattt ttaccggcac
     5941 atttagaact gattgaaacc ccggtttcta aaaaaccacg tctaattgct tatttgatta
     6001 tgctattttt agttgtggca attgtgcttg ccagtgtaag caaagttgaa attgtggcga
     6061 ctgctcccgg taaattaact tttagtggca gaagtaaaga aattaaaccg attgaaaacg
     6121 ccattgtaca agaaattttc gttaaagatg ggcagtttgt ggaaaaaggg caattattag
     6181 tcagcttaac tgcattgggt tctgatgcag atatcaaaaa gaccatggct tcactttctt
     6241 tagctaaact ggagacctat cgctaccaaa ctttgcttac tgccattgaa aaagagtcct
     6301 tgccggtgat tgatttatct agaaccgaat ttaaagattc atcggaagaa gatcgactac
     6361 gtattaaaca cttaattgag gagcaataca ccacttggca aaaacaaaaa acacagaaaa
     6421 ctttagcgta taagcgtaaa gaggctgaaa aacaaacaat atttgcctat gtccgtaaat
     6481 atgaaggtgc aacacgtatt gaacaagaaa aattaaaaga ctttaaggca ctttataaac
     6541 agaagtcttt atctaagcac gaacttcttg cgcaagaaaa taaattaatt gaggctcaga
     6601 atgcagtagc tgtttatcgc tcaaaattaa atgaattaga aaatgatcta ctcaatgtaa
     6661 aagaagaact tgaattgatc acgcaattct ttaaaagcga tgtgttggaa aaattaaagc
     6721 aacatattga aaatgaacgc caacttcggc tcgagttaga aaaaaataat caacgcagac
     6781 aggcctcgat gatcagagca ccggtttccg gtacggttca gcaactgaaa attcacacta
     6841 taggtggtgt tgttacgact gctgaaacct tgatgatcat tgtgccggaa gacgatgtgt
     6901 tagaggccac cgctctggtt ccaaacaaag atatcggctt tgttgcagca gggcaggagg
     6961 tgattattaa agtggaaact ttcccttata cacgctatgg ttatctaact ggtcgaatta
     7021 aacatattag cccggatgcg attgaacaac ctaatgtagg cttagttttt aatgcaacta
     7081 tagctataga taggaagaat ctaacatcgc ctgatgggcg aaaaattgat ttgagttcag
     7141 gtatgacaat aactgctgaa atcaaaaccg gtgaacggag tgtaatgagt tatttactca
     7201 gcccattaga agaatctgtc acagaaagtt taagggaacg ctaatcgaac caaaacaaag
     7261 ccataaaagc cattttgagc ttttatggct ttatttttta gtccacaagc ggacaaaaaa
     7321 gcccaatttt ttacactttt ataacaaatt gttctaacta aaaattacta attcttttct
     7381 tttatagcga tctctatttc atttcattaa cattgactag aagggattat gagcctaagc
     7441 attacgaatc tttctcttgg ctaccgcaaa aatcagcaaa ggcttatttg aaaagcacgg
     7501 tgtcgaggtg gaaaaaccgg tgatgtttcg cagctgggct cagttggtgg aagcttttta
     7561 agtggcaatg tgaacgtggt gcatctgctt tcgcctatga gtttgtgggc gaaatatgga
     7621 gcaaatgctc cggtgaaagc ggtaatgtgg aatcacttgg caggttcggc tttaacggtt
     7681 cgccctgaaa tcaacagtat tgccgaactc tccggcaaaa cggtagaact tccgttttgg
     7741 ta
//
LOCUS       RATBADPTA    3477 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Rat beta adaptin mRNA, complete cds.
ACCESSION   M34176 J05273
KEYWORDS    beta adaptin.
SOURCE      Rat lymphocyte, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3477)
  AUTHORS   Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and
            Parham,P.
  TITLE     Conservation and diversity in families of coated vesicle adaptins
  JOURNAL   J. Biol. Chem. 265, 4814-4820 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         72     2885     beta adaptin
    mRNA    <     1     3477     beta adaptin mRNA
BASE COUNT      914 a    851 c    822 g    890 t
ORIGIN      
        1 cggggctgtg ctctctgact gccgccgcca ccccgcccct tgcctccggt tcacgctgaa
       61 gatccagaat catgactgac tccaagtact tcacaaccaa taagaaggga gaaatctttg
      121 aattaaaagc tgaactcaac aatgaaaaga aagaaaagag gaaggaggct gtgaagaaag
      181 tgattgctgc tatgactgtg gggaaagacg ttagctctct cttcccagat gtggtgaact
      241 gtatgcagac tgacaacctg gaactaaaga agcttgtgta cctctatctg atgaactatg
      301 ccaagagtca gccagacatg gccatcatgg ctgtcaacag ctttgtgaag gattgtgaag
      361 accccaatcc tttgattcga gccttggcag ttagaaccat gggatgcatc cgggtggaca
      421 agattacaga gtatctctgt gaacccctcc gcaagtgctt gaaggatgaa gacccctatg
      481 ttcggaaaac agcagcagta tgcgtggcaa aactccatga tatcaatgcc cagatggtgg
      541 aagatcaggg atttctggat tctctgcggg atctcatagc agattcaaac ccaatggtgg
      601 tggctaatgc tgtagcagca ttgtctgaga tcagtgagtc tcacccaaac agcaacttac
      661 ttgatctgaa ccctcagaat atcaataagc tgctcacagc cctgaatgag tgcactgagt
      721 ggggccagat tttcatcttg gactgcctgt ctaattacaa ccctaaagat gaccgggaag
      781 ctcagagcat ctgtgagcga gtgacgcctc ggctctctca tgccaattct gcagtggtgc
      841 tttcagcagt aaaagttctg atgaagtttc tagagttgtt acccaaggac tctgactact
      901 acaatatgct gctaaagaag ctagcgcctc cacttgtcac tttgctctct ggggagccag
      961 aagtgcagta tgttgccctg aggaacatca acctaattgt ccagaaaagg cctgaaatct
     1021 tgaagcagga aatcaaggtc ttctttgtga agtacaatga tcctatctat gttaaactag
     1081 agaagttaga catcatgatt cgtcttgcat cccaagccaa cattgctcag gttctggcag
     1141 aactgaagga atatgccact gaagttgatg tggactttgt tcgcaaagct gtgagggcca
     1201 ttggacggtg tgccatcaaa gtggagcaat cagcagaacg ctgtgtgagc acactgcttg
     1261 atctaatcca gaccaaagta aattatgtgg tccaagaggc aattgttgtc atcagggaca
     1321 tcttccgaaa ataccccaac aagtatgaga gcattatcgc cacgctgtgt gagaacttgg
     1381 actccctgga tgaacccgat gcccgagcgg ctatgatttg gattgtagga gagtatgctg
     1441 aaagaatcga taatgccgat gagttactag agagcttcct ggaaggtttt catgatgaaa
     1501 gcacccaggt gcagctcacg ttgcttaccg ccatagtgaa actgtttctc aagaagccat
     1561 cagaaacaca ggagctggtc caacaggtct tgagcttggc cacacaggat tctgataatc
     1621 ctgaccttcg agatcggggt tatatttatt ggcgccttct ttcaactgac cctgtgacag
     1681 ccaaagaagt agtgttgtct gagaagccat tgatctctga ggaaacagac ctcattgaac
     1741 ctaccctcct ggatgagctc atctgccaca ttggttcttt ggcctccgtg taccataaac
     1801 ctccgaatgc ttttgtggaa gggagccatg gcattcatcg caaacacttg ccaattcacc
     1861 atgggagcac tgatgcaggt gatagccctg ttggcaccac cactgcaacc aacctggaac
     1921 agcctcaggt catcccctct caaggtgacc ttctggggga tcttttaaat cttgacctgg
     1981 gtcccccagt gaatgtcccg caagtgtcct ccatgcagat gggagcagtg gatcttttag
     2041 gaggaggact ggatagcctg gtaggacagt ccttcatccc gtcatcagtg cctgcaacct
     2101 tcgctccttc acctactcct gctgtggtca gcagtggtct gaatgacctg tttgagcttt
     2161 ccactgggat aggcatggca cctggcggat atgtggctcc taaggcagtc tggctacctg
     2221 ctgtaaaggc taaaggcttg gagatttcgg ggacgtttac tcaccgccaa gggcacatct
     2281 atatggaaat gaacttcacc aacaaagctc tgcagcacat gacggatttt gccatccagt
     2341 ttaacaagaa tagcttcggt gtcatcccga gcactccctt ggccatacat actccgctga
     2401 tgccaaacca gagcattgat gtgtctctgc ctctcaacac cttgggccca gtcatgaaga
     2461 tggagcctct gaataacttg caggtggctg ttaaaaacaa tattgatgtc ttctacttca
     2521 gctgcctcat cccactcaat gtgctttttg tagaagatgg caaaatggaa cgccaggtct
     2581 tccttgcgac gtggaaggat attcccaatg aaaatgagct ccaatttcag attaaggagt
     2641 gtcatttaaa cgctgacaca gtttccagca agttgcaaaa caacaatgtt tacactatcg
     2701 ccaagaggaa tgtggagggg caggacatgc tgtaccagtc cctgaagctc actaatggca
     2761 tttggatttt ggcagagctg cggatccagc caggaaaccc caattatacg ctgtcgctga
     2821 agtgtagagc ccctgaagtc tctcagtaca tctatcaggt ctacgacagc attttgaaaa
     2881 actaataaat gggtccagtc agcctgtaat cagtgcaagc cacgaactct taactgaaag
     2941 acactgtatt gttgtgtaga gcctgaaccc aaaccctgcg gtacccaccc cggtagtggc
     3001 cagtcatttt gtgctgatat tagcactcac cccattggta ggttagcttc ccgtgacatc
     3061 tccttccact atcgcccacc tctgccacct gccgctgctc tctgtcctta gttgtgagtt
     3121 cctctgtgct gtgccaatgg ctagcctttt ctacaccctc ttttgagtgt agtttgatat
     3181 tttgtaatcg aaagctcatt tcacaagcag aaaaaggcaa caagttaatt agagcgagga
     3241 agagtgtcac tgaaacatac actgcacctt attgttttat atttttgtac agatgagata
     3301 gatattgagg tagaacgctg agtagaaagg gtgactgacc ctcctcagac acagtcttat
     3361 tggagacata tggccctggc cccttctggg caaggagagg cgaccccact cctggtcttt
     3421 tgcattttca ccttggccac gccttccagc tctcttatgc ccatgctctc tcatttg
//
LOCUS       RATPSPB      1620 bp ss-mRNA            ROD       12-JUL-1990
DEFINITION  Rat pulmonary surfactant-associated glycoprotein A (SP-A) mRNA,
            complete cds.
ACCESSION   M33201
KEYWORDS    pulmonary surfactant protein A.
SOURCE      Rat fetal lung, cDNA to mRNA, clone SP-A [0.9, 1.6].
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1620)
  AUTHORS   Fisher,J.H., Emrie,P.A., Shannon,J., Sano,K., Hattler,B. and
            Mason,R.J.
  TITLE     Rat pulmonary surfactant protein A is expressed as two differently
            sized mRNA species which arise from differential polyadenylation of
            one transcript
  JOURNAL   Biochim. Biophys. Acta 950, 338-345 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         56      802     pulmonary surfactant protein A (56 could be
                                 29)
BASE COUNT      425 a    400 c    396 g    399 t
ORIGIN      
        1 cagatatcca cacagcctgc aggtctgtat gtggaagcca ctggggatag tagccatgtc
       61 actgtgttct ttggccttca ccctcttctt gactgttgtc gctggtatca agtgcaatgt
      121 gacagacgtt tgtgctggaa gccctgggat ccctggagct cctggaaacc atggtctgcc
      181 tggcagagac gggagagacg gtgtcaaagg agaccctgga cctccaggtc ccatgggccc
      241 tcctggagga atgccaggtc ttcctggacg cgatgggctg cccggaggac ctggtgcacc
      301 tggaggacgt ggagacaagg gagagcctgg agaaaggggc ctgccaggat ttccagctta
      361 cctggatgag gagctccaga ctgaactcta tgagatcaaa catcagattc tgcaaacaat
      421 gggagtcctc agcttgcaag gatccatgct gtcagtgggg gataaagtct tttccaccaa
      481 tgggcagtca gtcaactttg ataccattaa agagatgtgt accagagcag gaggcaacat
      541 tgctgtcccg aggactcctg aggagaacga ggccattgca agtattgcga agaagtacaa
      601 caactatgtc tacttgggca tgattgaaga ccagactcct ggagacttcc actacctgga
      661 tggggcttct gtgaactaca ccaactggta cccaggagaa cccaggggtc agggcaaaga
      721 aaagtgtgta gaaatgtata cagatgggac atggaatgat aggggctgcc tgcagtaccg
      781 gctggctgtt tgtgaatttt gatcaagcaa ttagacgaaa agatgaaccc tcacactgcc
      841 tctatcctga tgattcatct ggtctgtaaa accctgcaac tacctttact tgtggccttc
      901 agtaattaga agcatctttt gtcacccccg ctcccacata gttcccaaac acttctccat
      961 attcattagc aatcctgagt gtttccctag agtcccatct gagcgttcat tcaaggtagc
     1021 cattgtaaac cttggccttg accatgagat ggatagatac ttcctttttc ctcactttat
     1081 ccagtcttca tttataaatg gtggccatga agacccagca tggaaggacc ctctaactaa
     1141 gtgctgccct ctgacctttc cacccttctg tagctcggtg tcccaggatt tagaagtcca
     1201 ggttaaacat aggggatttc tgggaaagcc tagtatgtgg gtgcaggcca cattcatgcc
     1261 atctgtatcc atggctttca aggcaaacat tgtctctaag aagccagaga accaggagaa
     1321 ccaggtagga ccaggtagta ctgggggaac ataaactcac ttggtttggc atgtatggct
     1381 cctccttggg tctggaggtg ccatcttgac cttgaactaa cagcagccac cctgggtttt
     1441 gagagaacga ccttcccagc ccagacccca actcaagtaa tttcctgcta acagacacag
     1501 cctcagttca ctttacatca ctgaggcatt catgatacga actgcaatct gttttctcct
     1561 ctcgtgagtt caatcagcta ttcattaaag tcaactgcat tcaaaaaaaa aaaaaaaaaa
//
LOCUS       FSBCRYGM1     613 bp ss-mRNA            VRT       12-JUL-1990
DEFINITION  Carp gamma-crystallin (gamma-m1) mRNA, complete cds.
ACCESSION   X12902 M33115
KEYWORDS    crystallin; gamma-crystallin.
SOURCE      Cyprinus carpio lens, cDNA to mRNA.
  ORGANISM  Cyprinus carpio
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae.
REFERENCE   1  (bases 1 to 613)
  AUTHORS   Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C.
  TITLE     Carp gamma-crystallins with high methionine content: Cloning and
            sequencing of the complementary DNA
  JOURNAL   Biochim. Biophys. Acta 951, 226-229 (1988)
  STANDARD  simple staff_review
COMMENT     [1]  Author address Chang W.-G., Institute of Biological Chemistry,
            Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C..
            
            Submitted (09-SEP-1988) on tape to the EMBL data library.
FEATURES       from  to/span     description
    pept         34      570     gamma-crystallin (gamma-m1)
BASE COUNT      166 a    133 c    167 g    147 t
ORIGIN      
        1 ctgaagcact gagataaaca accctctacc atcatgggca agatcatctt ctacgaggac
       61 aggaacttcc agggccgcag ctatgactgc atgagcgact gctctgatat ctcctcttac
      121 ctcagccgcg ttggttcaat cagggtggag agtggttgtt tcatggtcta tgagcgcaac
      181 agctacatgg ggaaccagtt cttcctgagg aggggcgagt accatgatat gcagcgcatg
      241 atgagcatgg gcatgatgtt tgacactatc agatcctgcc gcatgattcc tccatacagg
      301 ggttcctaca gaatgaggat ctacgagagg gacaccttcg gaggacagat gcacgaggtg
      361 atggatgact gtgacaacat catggaacgt taccgtatgt ctgactggca gtcttgtcat
      421 gtgatggacg gccactggct cttctatgag cagccacact acagaggcag aatgtggtac
      481 ttcaggcctg gagagtacag gagcttcaga gatatgggat acagcaacat gagattcatg
      541 agcatgaggc gtatcactga tatgtgttaa actgctagaa tatagaagga attaaagtgt
      601 tattctcaga act
//
LOCUS       FSBCRYGM2     554 bp ss-mRNA            VRT       12-JUL-1990
DEFINITION  Carp gamma-crystallin (gamma-m2) mRNA, complete cds.
ACCESSION   X12903 M33116
KEYWORDS    crystallin; gamma-crystallin.
SOURCE      Cyprinus carpio lens, cDNA to mRNA.
  ORGANISM  Cyprinus carpio
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae.
REFERENCE   1  (bases 1 to 554)
  AUTHORS   Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C.
  TITLE     Carp gamma-crystallins with high methionine content: Cloning and
            sequencing of the complementary DNA
  JOURNAL   Biochim. Biophys. Acta 951, 226-229 (1988)
  STANDARD  simple staff_review
COMMENT     [1]  Author address Chang W.-G., Institute of Biological Chemistry,
            Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C..
            
            Submitted (09-SEP-1988) on tape to the EMBL data library.
FEATURES       from  to/span     description
    pept          7      528     gamma-crystallin gamma-m2 (AA 1 - 173)
BASE COUNT      142 a    124 c    150 g    138 t
ORIGIN      
        1 tggcccatga aggtcacctt ttatgaggac aggaacttcc agggtcgctc ttatgactgt
       61 atgagcgact gtgccgattt ctcctcctac atgagccgct gtcactcttg cagagtgcac
      121 agcggatgct ggatgatgta cgatcaaccc aactacatgg gaaatcagta tttctttagg
      181 aggggagagt atgctgatta catgtctatg tttggaatga gcaactgcat caggtcctgc
      241 cgtatgatcc ctatgcacag gggatcctac agaatgagga tctacgagag ggagaacttc
      301 atgggccaga tgtacgaaat ggccgatgac tgtgacagta tcatggaccg ttaccgcatg
      361 cctcactgcc agtcctgcca tgtgatggac ggccactggc tcatgtatga gcagccccac
      421 tacagaggca ggatgtggta cttcaggcct ggagagtaca ggagcttcag caatatgggt
      481 ggaatgagat tcatgagcat gaggcgtatc atggactcct ggtactagag tttatattaa
      541 taaaataact cctc
//
LOCUS       HUMIL2A1      940 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human interleukin 2 gene, exons 1 and 2.
ACCESSION   M33199
KEYWORDS    interleukin; interleukin 2.
SEGMENT     1 of 2
SOURCE      Human DNA, clones Lm HIG[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 940)
  AUTHORS   Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K.
  TITLE     Organization of the DNA regions flanking the human interleukin 2
            gene
  JOURNAL   Biomed. Res. 6, 197-205 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        629      775     interleukin 2, exon 1 /nomgen="IL2"
                                 /map="4q26-q27" /hgml_locus_uid="LT0164X"
                865  /   924     interleukin 2, exon 2
    IVS         776      864     IL2 intron A
    IVS         925  >   940     IL2 intron B
BASE COUNT      313 a    181 c    137 g    309 t
ORIGIN      
        1 cttcaactca ataagcattt taagtattct aatcttagta tttctctagc tgacatgtaa
       61 gaagcaatct atcttattgt atgcaattag ctcattgtgt ggataaaaag gtaaaaccat
      121 tctgaaacag gaaaccaata cacttcctgt ttaatcaaca aatctaaaca tttattcttt
      181 tcatctgttt actcttgctc ttgtccacca caatatgcta ttcacatgtt cagtgtagtt
      241 ttatgacaaa gaaaattttc tgagttactt ttgtatcccc acccccttaa agaaaggagg
      301 aaaaactgtt tcatacagaa ggcgttaatt gcatgaatta gagctatcac ctaagtgtgg
      361 gctaatgtaa caaagaggga tttcacctac atccattcag tcagtctttg ggggtttaaa
      421 gaaattccaa agagtcatca gaagaggaaa aatgaaggta atgttttttc agacaggtaa
      481 agtctttgaa aatatgtgta atatgtaaaa cattttgaca cccccataat atttttccag
      541 aattaacagt ataaattgca tctcttgttc aagagttccc tatcactctc tttaatcact
      601 actcacagta acctcaactc ctgccacaat gtacaggatg caactcctgt cttgcattgc
      661 actaagtctt gcacttgtca caaacagtgc acctacttca agttctacaa agaaaacaca
      721 gctacaactg gagcatttac ttctggattt acagatgatt ttgaatggaa ttaatgtaag
      781 tatatttcct ttcttactaa aattattaca tttagtaatc tagctggaga tcatttctta
      841 taacaatgca ttatactttc ttagaattac aagaatccca aactcaccag gatgctcaca
      901 tttaagtttt acatgcccaa gaaggtaagt acaatatttt
//
LOCUS       HUMIL2A2      569 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human interleukin 2 (IL-2) gene, 3' flank.
ACCESSION   M33198
KEYWORDS    Alu repetitive sequence; interleukin; interleukin 2.
SEGMENT     2 of 2
SOURCE      Human DNA, clones Lm HIG[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 569)
  AUTHORS   Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K.
  TITLE     Organization of the DNA regions flanking the human interleukin 2
            gene
  JOURNAL   Biomed. Res. 6, 197-205 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         136      449     Alu-repeat /nomgen="IL2" /map="4q26-q27"
                                 /hgml_locus_uid="LT0164X"
    rpt         130      135     5' insertion target sequence
    rpt         450      455     3' insertion target sequence
BASE COUNT      204 a    131 c     96 g    138 t
ORIGIN      Unknown number of bp after segment 1.
        1 agcttcaata agatccaatg aatattctag attctatttg tcttctgaag acagcttaat
       61 ctaatttaga taaaaataac atcatccaga gcctctacac tatttcagac acatgtagca
      121 tcagcttaaa aattatgaaa cctactggct aacacgtgaa accttgtcac taccaaaaat
      181 acaaaaaaaa aaaaattagc tgagtgtggt ggcgggcgcg tagtcccagc tactcaggag
      241 gctgaggcag gagaatggcg tgaacttggt aggcagagct gcagtgagcc aagatcgtgc
      301 cattgcactc cagcctgggt gacagagcaa gactccatct caaaaaaaaa aaaaaaaaaa
      361 aagagacctg ctaacacaca cacacacaca cacacacaca ctctctctct ctctctctct
      421 ctctctctct ctctctctct ctctctctca aattaagttg ggcggcaagg ggaaacaata
      481 aacatctcca acataggatt caagtgtagt tataagatac agactttaac taatataata
      541 tgttcaagaa aataaagcat catatctag
//
LOCUS       TOMCPKA       103 bp ss-rRNA            ORG       12-JUL-1990
DEFINITION  Tomato chloroplast 4.5S ribosomal RNA.
ACCESSION   M33098
KEYWORDS    4.5S ribosomal RNA; ribosomal RNA.
SOURCE      Tomato (strain Mill) chloroplast ribosomal RNA.
  ORGANISM  Chloroplast Lycopersicon esculentum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae; Lycopersicon esculentum.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Zhen-Qi,C., Xiao,X. and E,-Sheng.W.
  TITLE     The nucleotide sequence of 4.5 S rRNA from tomato chloroplasts
  JOURNAL   Biochim. Biophys. Acta 866, 89-91 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      103     4.5S ribosomal RNA
BASE COUNT       29 a     19 c     32 g     23 t
ORIGIN      
        1 gaaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagtg
       61 atgtatgcag ctgaggcatc ctaacagatc ggtagacttg aac
//
LOCUS       EBOMAY        157 bp ss-RNA             VRL       12-JUL-1990
DEFINITION  Ebola virus 3' proximal protein gene, 5' end.
ACCESSION   M33062
KEYWORDS    .
SOURCE      Ebola virus (strain MAY; Zaire 1976) RNA.
  ORGANISM  Ebola virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae.
REFERENCE   1  (bases 1 to 157)
  AUTHORS   Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D.
  TITLE     Conservation of the 3' terminal nucleotide sequences of Ebola and
            Marburg virus
  JOURNAL   Virology 149, 251-254 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         53  >   157     3'proximal protein
BASE COUNT       56 a     22 c     31 g     48 t
ORIGIN      
        1 gggcacacaa aaagaaagaa gaatttttag gatcttttgt gtgcgaataa ctatgaggaa
       61 gattaataat ttcctctcat tgaaatttga tgatcggaat ttgaaattga aattgttgat
      121 ctgtaatcac accgttgatt cagagccaca cacaagt
//
LOCUS       ECOBISCASD   3337 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  E.coli biotin sulfoxide reductase (bisC) gene, complete cds.
ACCESSION   M34827
KEYWORDS    biotin sulfoxide reductase; bisC gene.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3337)
  AUTHORS   Pierson,D.E. and Campbell,A.
  TITLE     Cloning and nucleotide sequence of bisC, the structural gene for
            biotin sulfoxide reductase in Escherichia coli
  JOURNAL   J. Bacteriol. 172, 2194-2198 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        577     2757     biotin sulfoxide reductase (bisC)
BASE COUNT      764 a    856 c    927 g    790 t
ORIGIN      
        1 tatccccgct gcgggttacg ctaacaccag tgccgcgcat tttgtcgcgc agttcgcttc
       61 ctgcacatcc atgtaataac caacgccgcc gcccagagct gcgcctgctg ctgcgccaat
      121 cagcgcgcct ttaccgcgat ctttcttcga agaagagagc gcaccaatac ccgcgcccac
      181 gagagagccc agacctgcgc cgatagcaga tttacctgct tcgcgttcgc cggtgtaagg
      241 gttagttgtg cagccagata ccgccagagc gccactcact acggcggcaa taagataaac
      301 acgtttcttc attgttaatc cttaataacc tttttattct ttgccacggg ttccgtggcg
      361 ggagattatg ccgcgtgaac atgaagatta ttcctgggaa tactcggaaa tttgtaagta
      421 atatttaact gctcaataca tctaaccttt caggagtctt cggtttggcc aactcatcct
      481 cacgatattc cgttctgact gccgccattg ggggcccatg ctggttgaaa ccgacggcga
      541 aaccgtgttt agctgcgtgg cgcgttagcc acaggaatgg aaaactcctt gcagagcgcg
      601 gttcgcgacc aggttcacag caatacgcgg gtacgatttc caatggtgcg aaaaggcttt
      661 cttgcgtcac cggaaaaccc gcaaggcatt cgtgggcagg atgaatttgt tcgcgtgagt
      721 tgggatgagg cgctggatct tattcaccaa caacataaac gcattcgtga ggcttatggt
      781 ccggcatcga tttttgctgg ttcctacggc tggcgttcaa acggcgtgct gcataaggcc
      841 tcgacattat tacaacgcta tatggcgctg gcaggcggtt ataccgggca tctgggggat
      901 tattcgaccg gcgcggcaca ggcgatcatg ccgtatgtcg tgggtggtag tgaagtttat
      961 caacagcaga ccagttggcc gctggtgctg gaacatagcg atgtcgtggt gctgtggagt
     1021 gctaacccac tcaatacgct gaaaattgcg tggaatgcat ccgatgagca ggggctttct
     1081 tacttttctg cactgcgtga cagcgggaaa aagctgatct gcattgatcc aatgcgatcg
     1141 gaaaccgtcg atttctttgg cgataaaatg gagtgggtgg caccgcacat gggcaccgat
     1201 gttgcgctga tgctggggat cgcccatacg ctggtggaaa atggttggca cgacgaagcg
     1261 tttctggcgc gttgcaccac aggttatgcc gtcttcgcct cttatttgct gggcgagagt
     1321 gacggaatag cgaaaaccgc cgaatgggca gcagagattt gtggtgttgg cgcagcgaaa
     1381 atccgcgagc tggcggctat tttccaccaa aataccacca tgctgatggc aggctgggga
     1441 atgcagcgcc aacagtttgg tgagcaaaaa cactggatga tcgtcacgct ggcagcaatg
     1501 ttggggcaaa tcggcacacc cggcggcggt tttggtcttt cttaccattt tgccaatggt
     1561 ggtaacccca cgcggcgttc tgcggtgctc tcttccatgc agggcagctt gccgggtggc
     1621 tgcgatgcgg tggataaaat ccctgttgcc cgcattgttg aagcactgga aaaccctggt
     1681 ggcgcatatc aacacaacgg tatgaaccga catttcccgg atattcgttt tatctggtgg
     1741 gcgggcggtg ccaactttac tcatcatcag gataccaatc gcctgatccg tgcctggcaa
     1801 aaaccggagc tggtggtgat ctctgaatgc ttctggacgg cggcggcaaa acacgcggat
     1861 atcgttctgc ctgcgactac ctcttttgag cgtaatgatc tcaccatgac cggtgattac
     1921 agtaatcagc atctggtgcc gatgaagcaa gtggtgccgc cacgctatga agcgcgtaat
     1981 gattttgatg tttttgccga gttaagtgaa cgctgggaga agggcggtta tgcacgtttt
     2041 acggaaggaa aaagtgagct gcaatggctg gaaacgtttt ataacgttgc ccgacagcgc
     2101 ggggcaagcc agcaggttga attgccgcca tttgctgagt tctggcaagc caaccagtta
     2161 attgagatgc cggaaaaccc ggacagcgag cggtttattc gcttcgctgc atttttgccg
     2221 cgatccgctg gcgatccgtt aaaaacgcag cgcaagattg aaatcttctc acagcgtatt
     2281 gccgattacg gttacccgga ttgccctggg catccaatgt ggctggagcc ggacgaatgg
     2341 cagggcaatg ccgaaccaga acagttgcag gtactttctg cccatccggc gcaccgcctg
     2401 cacagccagc tgaattacag ttctctgcgc gaattgtacg cggtggcaaa tcgtgagcct
     2461 gtcaccattc atcctgacga tgcccaggag cgcggcatac aagatggcga tactgttcgg
     2521 ttgtggaacg cacgcgggca aattcttgcc ggagcggtca ttagcgaggg aattaaacct
     2581 ggcgtgattt gcattcacga aggggcatgg ccggatctgg atttaaccgc tgacggtatt
     2641 tgtaaaaacg gcgcagtgaa cgtgctgacc aaagatctcc ccagctcgcg gctgggaatg
     2701 gctgtgcggg taatacggcg ctggcatggc tggaaaaata caacggtccg gaactgacac
     2761 ttacagcgtt tgaaccaccg gccagctcat aatccatgtg ggtagttggg tttcatcctg
     2821 ccatgcgcaa tcgacaatgt gaaaaccctg tgcctggtaa aaatttatcg ccggttgatt
     2881 tttttgataa acctccagca tcaggtgggg atggcgctgc tgcacatact gcatcagcgc
     2941 cttaccaata ccgcgcctga cggccttcgg tgcgacaaac atcgctgcca gaaatcggcc
     3001 ttccataatg ctgacaaaac cgagaagctt accgtcttct tcccagaccc agttttgcgc
     3061 gttggcaaga taggcatccg caccagcgga atgcagtcac gccagtaatt cgcttttata
     3121 aagggatgcc cccaggttgt actttccagc cacagttcga ggatcgcggg gagttctgaa
     3181 cgttgcgctt cccgaatcat ggtttatttc ccggatagca acagcagcca accacatgat
     3241 cattcaccag cccacatgcc tgcataaagg gagtaacaga ttgtggtgcc gacaaactta
     3301 aaaccacgtt ttttcagtgc cttagatagg gcgttcg
//
LOCUS       HUMARXA      1335 bp ss-mRNA            PRI       12-JUL-1990
DEFINITION  Human aldose reductase mRNA, complete cds.
ACCESSION   M34720
KEYWORDS    aldehyde reductase; aldose reductase.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1335)
  AUTHORS   Grundmann,U., Bohn,H., Obermeier,R. and Amann,E.
  TITLE     Cloning and prokaryotic expression of a biologically active human
            placental aldose reductase
  JOURNAL   DNA Cell Biol. 9, 149-157 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         14      964     aldose reductase (EC 1.1.1.21)
    mRNA    <     1     1335     aldose reductase mRNA
BASE COUNT      319 a    347 c    356 g    313 t
ORIGIN      
        1 gagcgcagca gccatggcaa gccgtctcct gctcaacaac ggcgccaaga tgcccatcct
       61 ggggttgggt acctggaagt cccctccagg gcaggtgact gaggccgtga aggtggccat
      121 tgacgtcggg taccgccaca tcgactgtgc ccatgtgtac cagaatgaga atgaggtggg
      181 ggtggccatt caggagaagc tcagggagca ggtggtgaag cgtgaggagc tcttcatcgt
      241 cagcaagctg tggtgcacgt accatgagaa gggcctggtg aaaggagcct gccagaagac
      301 actcagcgac ctgaagctgg actacctgga cctctacctt attcactggc cgactggctt
      361 taagcctggg aaggaatttt tcccattgga tgagtcgggc aatgtggttc ccagtgacac
      421 caacattctg gacacgtggg cggccatgga agagctggtg gatgaagggc tggtgaaagc
      481 tattggcatc tccaacttca accatctcca ggtggagatg atcttaaaca aacctggctt
      541 gaagtataag cctgcagtta accagattga gtgccaccca tatctcactc aggagaagtt
      601 aatccagtac tgccagtcca aaggcatcgt ggtgaccgcc tacagccccc tcggctctcc
      661 tgacaggccc tgggccaagc ccgaggaccc ttctctcctg gaggatccca ggatcaaggc
      721 gatcgcagcc aagcacaata aaactacagc ccaggtcctg atccggttcc ccatgcagag
      781 gaacttggtg gtgatcccca agtctgtgac accagaacgc attgctgaga actttaaggt
      841 ctttgacttt gaactgagca gccaggatat gaccacctta ctcagctaca acaggaactg
      901 gagggtctgt gccttgttga gctgtacctc ccacaaggat taccccttcc atgaagagtt
      961 ttgaagctgt ggttgcctgc tcgtccccaa gtgacctata cctgtgtttc ttgcctcatt
     1021 tttttccttg caaatgtagt atggcctgtg tcactcagca gtgggacagc aacctgtaga
     1081 gtggccagcg agggcgtgtc tagcttgatg ttggatctca agagccctgt cagtagagta
     1141 gaagtctctt ccagtttgct ttgcccttct ttctaccctg ctggggaaag tacaacctga
     1201 ataccctttt ctgaccaaag agaagcaaaa tctaccaggt caaaatagtg ccactaacgg
     1261 ttgagttttg actgcttgga actggaatcc tttcagcaag acttctcttt gcctcaaata
     1321 aaaagtgctt ttgtg
//
LOCUS       HUMARXB       652 bp ds-DNA             PRI       12-JUL-1990
DEFINITION  Human aldose reductase gene, partial cds.
ACCESSION   M34721
KEYWORDS    aldehyde reductase; aldose reductase.
SOURCE      Human placenta DNA, clone lambda-gt11-10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 652)
  AUTHORS   Grundmann,U., Bohn,H., Obermeier,R. and Amann,E.
  TITLE     Cloning and prokaryotic expression of a biologically active human
            placental aldose reductase
  JOURNAL   DNA Cell Biol. 9, 149-157 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       27     aldose reductase, exon X (AA at 1) (EC
                                 1.1.1.21)
                626  >   652     aldose reductase, exon X+1
    IVS          28      625     aldose reductase intron X
BASE COUNT      142 a    164 c    184 g    162 t
ORIGIN      
        1 gccaagcaca ataaaactac agcccaggta cagccacttc aggtgttgct gaccgtccac
       61 aactgcctgc attcctgaca gtcctgttag ccaagaggag gaagtgactg agcctgttac
      121 accctcacag gaagtatggt taggggtcct caagtacaga gtggaaaggg cacagatcgg
      181 ggttttagaa gactctggca tgggctctta gattaatagt gcctgccccc actactgcaa
      241 gggtgactgc cacgagggcc agcgcttgtt cattcatgtg gaacctcatc tgtacaaatg
      301 taagagctct tagccgtgca gggaatgttc tttctcctga gtggtagtgt gcatttctag
      361 ccagtggagg gcctcatgtg gtctcatgat atgcctgaga cactgaagcg tgtggcacag
      421 tggctagcgc aggactctgg agtcagatct ggacctgaat gcgtcgccta cctgttgcta
      481 gctgtgacct gacatcttgg agcccctctc tgatcacctg tggagttcta gcacgtcctt
      541 ctgcaggttg tgtgtgtgag agactgagat gatgggtgcg agtgcctggc atgtatacac
      601 actcactgtc tccttgggct cacaggtcct gatccggttc cccatgcaga gg
//
LOCUS       MRV3TERM       59 bp ss-RNA             VRL       12-JUL-1990
DEFINITION  Marburg virus 3'terminal region of genome.
ACCESSION   M36065
KEYWORDS    .
SOURCE      Marburg virus RNA.
  ORGANISM  Marburg virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae.
REFERENCE   1  (bases 1 to 59)
  AUTHORS   Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D.
  TITLE     Conservation of the 3' terminal nucleotide sequences of Ebola and
            Marburg virus
  JOURNAL   Virology 149, 251-254 (1986)
  STANDARD  simple staff_review
BASE COUNT       15 a     10 c      5 g     29 t
ORIGIN      
        1 tctgtgtgtt ttgttctcta ctactaaaac acatagtata tttatttctt cttataatc
//
LOCUS       RATQRED1      431 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Rat quinone reductase gene, exon 1.
ACCESSION   M33038
KEYWORDS    quinone reductase.
SEGMENT     1 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 431)
  AUTHORS   Bayney,R.M. and Pickett,C.B.
  TITLE     Rat liver NAD(P)H:quinone reductase: Isolation of a quinone
            reductase structural gene and prediction of the NH2 terminal
            sequence of the protein by double-stranded sequencing of exons 1
            and 2
  JOURNAL   Arch. Biochem. Biophys. 260, 847-850 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        244  +   250     quinone reductase, exon 1
    pre-msg     170  >   431     quinone reductase mRNA and introns
    IVS         251  >   431     quinone reductase intron A
BASE COUNT       91 a    125 c    119 g     96 t
ORIGIN      
        1 taacttggta tcctcccccc agcgcctctg ggctggcaat ccagccccgc cctcgctggc
       61 tgccctgcac agtgggctgg gccggaaaag caagatataa agcctgaaag tgctcagtac
      121 agctcgcact agcctaggct gtggcacgca ggatctttcc gaagcatttc agggtcgtcc
      181 tggcaaccag ctgctcagcc aatcagcgct tgacactacg atccgccccc aacttctgga
      241 gccatggcgg gtgagtatgg ctccaactcc agcctaattc atcctgagga ggatgtaggg
      301 gcttgctatg gggtttgttc cttgcctcga agttgaaaag tgtagagatt aggatcctgg
      361 atgagcctcg gtgagtcccc ggaaggagag cttcttctca gaaccatagg tgcagattat
      421 tctgcagccc c
//
LOCUS       RATQRED2      410 bp ds-DNA             ROD       12-JUL-1990
DEFINITION  Rat quinone reductase gene, exon 2.
ACCESSION   M33039
KEYWORDS    quinone reductase.
SEGMENT     2 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 410)
  AUTHORS   Bayney,R.M. and Pickett,C.B.
  TITLE     Rat liver NAD(P)H:quinone reductase: Isolation of a quinone
            reductase structural gene and prediction of the NH2 terminal
            sequence of the protein by double-stranded sequencing of exons 1
            and 2
  JOURNAL   Arch. Biochem. Biophys. 260, 847-850 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +   108  /   272     quinone reductase, exon 2
    IVS     <     1      107     quinone reductase intron A
    IVS         273  >   410     quinone reductase intron B
BASE COUNT      103 a    111 c    102 g     94 t
ORIGIN      
        1 agaaactaag gtggggaacg tgtctggtcc caagcacttt tagattaggg actcacccgt
       61 cctgtttgga ttttctttcc tcacctcctc acgtacgcct taaacagtga gaagagccct
      121 gattgtattg gcccacgcag agaggacatc attcaactat gccatgaagg aggctgctgt
      181 ggaggctctg aagaagaaag gatgggaggt ggtcgaatct gacctctatg ctatgaactt
      241 taaccccctc atttccagaa acgacatcac aggtaagaat cgtctccctc cactgacagt
      301 ggaccacgtg acccagcctc agcccctctt gcctcccaac aggggagccg aaggactcgg
      361 agaactttca gtaccctgtt gagtcatctc tggcgtataa ggaaggccgc
//
LOCUS       RHAFIXA      1040 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  A.caulinodans nitrogen fixation protein (nifO and fixA) genes,
            complete cds and 5'end.
ACCESSION   M35122
KEYWORDS    nitrogen fixation protein.
SOURCE      A.caulinodans (strain ORS571) DNA.
  ORGANISM  Azorhizobium caulinodans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 1040)
  AUTHORS   Kaminski,P.A., Norel,F., Desnoues,N., Kush,A., Salzano,G. and
            Elmerich,C.
  TITLE     Characterization of the fixABC region of Azorhizobium caulinodans
            ORS571 and identification of a new nitrogen fixation gene
  JOURNAL   Mol. Gen. Genet. 214, 496-502 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        328      657     nitrogen fixation protein (nifO)
    pept        939  >  1040     nitrogen fixation protein (fixA)
BASE COUNT      181 a    349 c    328 g    182 t
ORIGIN      
        1 gagctcggcc tctatgacat cgacgccagc gcggtgaacg tcgcgcacgt gcccgtcatt
       61 ccggacgaga acgaggtgag cggcgtcgat atcgtcgtcc gcctgcgtcg cacgggccgc
      121 tgagggacgc tccgcctgtc gccttcgggg cacccgcatc cgcgtagcag cgcggccgcc
      181 tcccgcggac tggccatcgc cagtactggc acgggcattg cttggacctc atccgtgccc
      241 cgacatcggg gcaacgggtt cgcccgccaa agcgaccgga tgagttcacc tcatccgatt
      301 acgcaccaga ctttcaggag acggagcatg gcgaccgccg gcggcatcct cgatcagctc
      361 aacaaggcat ccagcgcgga agacttcttc gcgctgctcg aggtcgatta cgatccccaa
      421 gtggtgaatg tggtgcgcct gcatatcctg cggcgcatgg gccagtatct ggtcagcgag
      481 aatttcgaag gccaggcgga tgacgccatc cgcgcccggt gcaaagaggt gctggaacag
      541 gcctatgcgg acttcctcgc ctcctcgccc ttgcaggagc gggtgttcaa ggtgctgaag
      601 gaggccgccc agccgccgaa gcccaagccg atggtatcgc tcaccgttct caagtgacgt
      661 tccccccctc ccgcgtcctt caaggcggcc tgcacccggc aggccgccct tcgcgtttca
      721 gggcgcgggc gggtggtgag gggccacggg caagacgcgc ctgtcgcatt ccgacgcggg
      781 tggcggacgt tcctgtcggc ggcggagccg gggcggaaag cgcattgtgg catgccagac
      841 agccctttga tttcatgcgc gttttcgggc tgaaagacag ttggtacgac acttgctcat
      901 tcctccccaa gagcccaacc gttccgggag cgaacgcaat gcacatcgtc gtctgcatca
      961 agcaggttcc tgactccgcg cagatccgcg tgcaccccgt gacgaacacc atcatgcgtc
     1021 agggtgtgcc cacgatcatc
//
LOCUS       STMPPG        200 bp ds-DNA             BCT       12-JUL-1990
DEFINITION  S.griseus brown pigment production gene, 5' flank.
ACCESSION   M35117
KEYWORDS    brown pigment production protein.
SOURCE      S.griseus (strain TK21) DNA, clone pARC1.
  ORGANISM  Streptomyces griseus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Horinouchi,S., Nishiyama,M., Nakamura,A. and Beppu,T.
  TITLE     Construction and characterization of multicopy expression-vectors
            on Streptomyces spp
  JOURNAL   Mol. Gen. Genet. 210, 468-475 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        112  >   200     brown pigment production protein mRNA
BASE COUNT       22 a     79 c     71 g     28 t
ORIGIN      
        1 gatcgtccat ggtggccatc ccaccatccg ccgcgccggg gcggcgagcg cgtttcgctg
       61 ggcggacacg ctccccttgc cggtgctagc gcgaccgcgc tagcgtggtc gggtgcccaa
      121 gatccgtatg acgcccctga ccgaccggcg ttcggccggt tcctgaagca cgcccccgac
      181 cgcgcggccg gccgggccgg
//
LOCUS       SYNGPCNA      111 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end.
ACCESSION   M35111
KEYWORDS    nucleoprotein.
SOURCE      Synthetic DNA, clone pACRP1-LCM WE N.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
  TITLE     Expression of the S-coded genes of lymphocytic choriomeningitis
            arenavirus using a baculovirus vector
  JOURNAL   J. Gen. Virol. 67, 1515-1529 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         92  >   111     nucleoprotein
BASE COUNT       37 a     17 c     21 g     36 t
ORIGIN      
        1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc atttgattgc
       61 gcttttattt ggaaattcat tgtgtgacaa aatgtctttg tccaaagaag t
//
LOCUS       SYNGPCNB      111 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  Lymphocytic choriomeningitis virus glycoprotein precursor gene,
            5' end.
ACCESSION   M35112
SOURCE      Synthetic DNA, clone pACRP1-LCM WE G.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
  TITLE     Expression of the S-coded genes of lymphocytic choriomeningitis
            arenavirus using a baculovirus vector
  JOURNAL   J. Gen. Virol. 67, 1515-1529 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        109  >   111     glycoprotein precursor
BASE COUNT       33 a     20 c     25 g     33 t
ORIGIN      
        1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc tttttggatt
       61 gcgctttcct ttaggacaac tgggtgctgg attctatcca gtaaaaggat g
//
LOCUS       SYNGPCNC      131 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end.
ACCESSION   M35113
KEYWORDS    nucleoprotein.
SOURCE      Synthetic DNA, clone pACRP5-LCM WE N.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 131)
  AUTHORS   Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
  TITLE     Expression of the S-coded genes of lymphocytic choriomeningitis
            arenavirus using a baculovirus vector
  JOURNAL   J. Gen. Virol. 67, 1515-1529 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        112  >   131     nucleoprotein
BASE COUNT       41 a     19 c     24 g     47 t
ORIGIN      
        1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg
       61 gatcctaggc atttgattgc gcttttattt ggaaattcat tgtgtgacaa aatgtctttg
      121 tccaaagaag t
//
LOCUS       SYNGPCND      131 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  Lymphocytic choriomeningitis virus glycoprotein precursor gene,
            5' end.
ACCESSION   M35114
KEYWORDS    glycoprotein precursor.
SOURCE      Synthetic DNA, clone pACRP5-LCM WE G.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 131)
  AUTHORS   Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
  TITLE     Expression of the S-coded genes of lymphocytic choriomeningitis
            arenavirus using a baculovirus vector
  JOURNAL   J. Gen. Virol. 67, 1515-1529 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        129  >   131     glycoprotein precursor
BASE COUNT       37 a     22 c     28 g     44 t
ORIGIN      
        1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg
       61 gatcctaggc tttttggatt gcgctttcct ttaggacaac tgggtgctgg attctatcca
      121 gtaaaaggat g
//
LOCUS       SYNGPCNE      155 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end.
ACCESSION   M35115
KEYWORDS    nucleoprotein.
SOURCE      Synthetic DNA, clone pACRP6-LCM WE N.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 155)
  AUTHORS   Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
  TITLE     Expression of the S-coded genes of lymphocytic choriomeningitis
            arenavirus using a baculovirus vector
  JOURNAL   J. Gen. Virol. 67, 1515-1529 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        136  >   155     nucleoprotein
BASE COUNT       53 a     22 c     26 g     54 t
ORIGIN      
        1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa
       61 cagttttgta ataaaaaaac cccggatcct aggcatttga ttgcgctttt atttggaaat
      121 tcattgtgtg acaaaatgtc tttgtccaaa gaagt
//
LOCUS       SYNGPCNF      155 bp ds-DNA             SYN       12-JUL-1990
DEFINITION  Lymphocytic choriomeningitis virus glycoprotein precursor gene,
            5' end.
ACCESSION   M35116
KEYWORDS    glycoprotein precursor.
SOURCE      Synthetic DNA, clone pACRP6-LCM WE G.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 155)
  AUTHORS   Matsuura,Y., Possee,R.D. and Bishop,D.H.L.
  TITLE     Expression of the S-coded genes of lymphocytic choriomeningitis
            arenavirus using a baculovirus vector
  JOURNAL   J. Gen. Virol. 67, 1515-1529 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        153  >   155     glycoprotein precursor
BASE COUNT       49 a     25 c     30 g     51 t
ORIGIN      
        1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa
       61 cagttttgta ataaaaaaac cccggatcct aggctttttg gattgcgctt tcctttagga
      121 caactgggtg ctggattcta tccagtaaaa ggatg
//
LOCUS       XELD7        1051 bp ss-mRNA            VRT       12-JUL-1990
DEFINITION  X.laevis pot. developmental protein (D7) mRNA, complete cds.
ACCESSION   M35119
KEYWORDS    developmental protein D7.
SOURCE      X.laevis, cDNA to mRNA, clones D7.1 and D7.0.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1051)
  AUTHORS   Smith,R.C., Dworkin,M.B. and Dworkin-Rastl,E.
  TITLE     Destruction of a translationally controlled mRNA in Xenopus oocytes
            delays progesterone-induced maturation
  JOURNAL   Genes Dev. 2, 1296-1306 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         27      863     pot. developmental protein D7
BASE COUNT      318 a    240 c    222 g    271 t
ORIGIN      
        1 gaaaaccggg acgtttgggc tgcaatatgg aatttgatga gctgatgcag tgcccatatg
       61 acaaaaatca tatgattcgg cccagccggt ttccctacca ccttgttaaa tgcagagaga
      121 ataatcgtgc agcagctaaa attctagcaa cttgcccata taatgcccgc cacagagtcc
      181 ctaaacagga gcttgatctg cacatggcca gctgtgaata cagggtgacc atggagccca
      241 tttctgctgc attttcacat cagaaggtgg agacctcaac atggcaaagc cctccttgtg
      301 aagaggtctg ggaaactgac gaagatcccg tgtcaaggcc aaagcccttt attttaaatg
      361 attttactcc ttctcagcct tttaatatgt cagaaggtga tggaaatatg ccgtatactg
      421 gaataagcag caactacaga cctgaagtcc aacctatgaa ttcagtcatg caagtaaagc
      481 aaaatcaacc tgaacctgag ccttttacct ccagtgagcg aaactatgat ccacgatcca
      541 aggaaccacc caatccaaag caacctgcag tgaatggcta caaacctgca actacaaata
      601 caaacccatg gtgcaggcaa acgggaggat cgaggggagc tgctcctcca aagttgggtg
      661 ctaaatcctc agatgagggg ccaagaaata aggaatttcc cactccaaag gcgaacttga
      721 tgaatgagta cgtacctgta gcagcaaatg caaatccatg gtgcaggcaa ccaggagggt
      781 ccagtgctgc ttcagaacct ttgggtgttg actccttcga tgagtggcca tgccttggac
      841 gccagccatg ggttagaaag taaatcttca ctttaaaaac aggactttca tctgaacctg
      901 ttcctgactt gtccaactcc tggattttta aaatttgttg tgaagttgcc atttagtatt
      961 tttgtacaaa attttaacag ccttcatttt tacatattaa gctttttatc acaaatataa
     1021 tactaattta cttgaatgtt atttgttaac c
//
LOCUS       YSCNUP1      4986 bp ds-DNA             PLN       12-JUL-1990
DEFINITION  S.cerevisiae nucleoporin (NUP1) gene, complete cds..
ACCESSION   M33632
KEYWORDS    nuclear pore complex protein; nucleoporin.
SOURCE      S.cerevisiae (strain S288C) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 4986)
  AUTHORS   Davis,L.I. and Fink,G.R.
  TITLE     The NUP1 gene encodes an essential component of the yeast nuclear
            pore complex
  JOURNAL   Cell 61, 965-978 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by L.I.Davis, 06-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1001     4231     nucleoporin (NUP1) (put.)
BASE COUNT     1590 a   1036 c    991 g   1369 t
ORIGIN      Chromosome XV; 14 cm prox. to ADE2.
        1 gaattcatca gtgaactctt catcattcaa aaacacccaa tcatagttga acttggagtt
       61 aaatctatct tccacggatt taatagactc agccaacgaa tatagatctc tattacgcac
      121 tagagtgaca aaagtggcct tttcccgagg accggtatac tttggtagtt ccactttcga
      181 gtatttatag cccgagccgg aagatagcac ggtactgtac tttgggcttg gggcagagcc
      241 tctcgcatac tgagctccat gaaagaacac atacacagta aacacggcga ccagaagaag
      301 tcctaatttt ttgtaaacag gctgcttgct agctgggatc ataatcttcg ccattttggt
      361 tattgactct atcccttaaa aactcttctg atggagtact ttacttcgat tgcttaacga
      421 aatctttgtg aggaaaaaga tatctcttaa aattagaaag tacaatagtc tagcgtatta
      481 tactaagaat ctgcaaaaaa gaagcaagaa ggcaccacct attatagacc tttgacacga
      541 agtctctctg gagtgctttg gcctacgtgt gcggtactcg tttacatggg acaaccacgg
      601 tttttttttt ggtgttactg gaggtataca gtgcgtatat ccacttgtac gacaagagat
      661 ttacactaca ccgcgtaaag aaaacgccga caccaaatat aagtcacgtg tatgcaaagc
      721 ctattttatg ccctaatttt caagccccgg tttttacgcc ctagttttta tatttagggt
      781 ttgtcgttgc acgtgatcaa tggttcgtat tatgtgacat tgaaatgctt tttcatttta
      841 atttttttct ttgacgaaat ttcgtaatgt caagaaacac ttaaagaaaa taagtgatga
      901 ggaactcaat aaggacacta cgtagcggtg caaatacgat aggatattag cctcgaaagg
      961 gttataggga cagagagtga gcgacaattt ttagtcattc atgtcttcaa acacttcttc
     1021 tgtgatgtct tctccacgtg tcgaaaagag atcgttttct tccactttaa aatcattctt
     1081 cacaaacccc aataaaaaac ggccatcgag caagaaagtt ttcagttcaa acctctcata
     1141 cgcgaatcat ttggaggaat cagatgttga agacacactg catgttaata agagaaagag
     1201 ggtgtccggt acatcacagc atagcgacag cttaactcag aacaacaata atgcgccaat
     1261 tataatatat ggaaccgaaa acactgagag accgccgctt ttgccaattt tgcccattca
     1321 aagactgagg ttattaaggg agaagcagag ggtgagaaat atgcgtgagc ttggattaat
     1381 tcaatcaact gaatttccat ctattacatc gtcggttata ttgggctctc aaagtaaaag
     1441 cgatgaggga ggatcgtacc tatgcacatc atctactcct tcccctatta aaaacggttc
     1501 ttgcactagg cagttggccg gaaaaagcgg tgaagacacc aatgttggac tacccattct
     1561 caaatcattg aaaaatagat ccaatagaaa aaggtttcat agtcagtcaa aggggaccgt
     1621 gtggtcagca aattttgaat atgatttgtc agaatatgac gctatacaaa aaaaggataa
     1681 caaggataag gaaggtaacg ctggcggtga tcagaagaca agcgagaata gaaataatat
     1741 taagagtagt atttcaaatg gcaatctggc tacaggccct aacctgacaa gcgaaattga
     1801 agacctacgt gcagacatca actctaatag gttatcgaat cctcaaaaaa atctactttt
     1861 aaaaggacca gcttccacag ttgcaaaaac tgcccctatt caggagagct ttgttcccaa
     1921 ttcagagcgc tctggtacgc ctacgttaaa gaaaaatatt gagcccaaaa aggacaaaga
     1981 aagtattgtt ttgcccaccg taggttttga ctttatcaag gacaatgaga ctccatctaa
     2041 gaaaacttct cctaaggcaa cttcttctgc aggtgcagtc tttaaatcga gtgtagaaat
     2101 gggaaaaacc gataagtcaa cgaaaactgc cgaggcgcct accttatcat tcaattttag
     2161 ccaaaaggct aataaaacta aggctgtcga caatactgtc ccttccacaa ctttattcaa
     2221 ttttggtggt aaatcagata ccgttacttc tgccagtcaa ccttttaaat ttggaaagac
     2281 atccgaaaaa agtgaaaatc atacagaatc agacgcgcct ccaaaatcaa ctgctccaat
     2341 attttctttt ggtaaacaag aagagaatgg tgatgaaggt gatgatgaaa atgagcccaa
     2401 aagaaaaagg cgtttacctg ttagcgagga tacaaacacc aagcctttat tcgatttcgg
     2461 caagaccggt gatcaaaagg agaccaaaaa gggagagtca gaaaaggacg catcagggaa
     2521 accaagcttt gtctttggtg caagtgataa gcaagctgaa ggtacaccat tatttacatt
     2581 cggaaaaaaa gctgatgtaa caagcaatat tgactcctct gcacaattta cctttggtaa
     2641 agccgccacc gcgaaagaaa cacacaccaa accatctgag acacctgcca caatagtcaa
     2701 gaagcctact tttacttttg ggcagtcaac aagtgaaaat aagatctctg agggaagtgc
     2761 gaaacctaca ttctctttct ctaagtcaga ggaggaacgt aagagtagtc caatttcaaa
     2821 cgaagcagct aaaccctcgt tttcgtttcc gggcaagcct gttgatgttc aagcaccgac
     2881 ggatgataag actctcaagc caactttttc ttttactgaa cctgctcaaa aagattcatc
     2941 tgttgtttcg gaacctaaaa agccctcctt tacgtttgcg tcttcaaaaa cctcacaacc
     3001 aaagccattg ttttcatttg gtaagtcaga tgcagctaaa gaaccaccag gctctaacac
     3061 ctcattttct ttcactaaac ctcctgctaa tgagacagat aaaagaccta caccgccatc
     3121 tttcaccttt ggcggttcca caacaaataa tacaacaacc actagcacaa aaccatcttt
     3181 tagttttggg gctcccgagt cgatgaagtc gacagcaagt acagcggcag caaatacgga
     3241 gaagctatca aatggctttt cctttacaaa gttcaatcac aataaagaaa agtcaaactc
     3301 tccaacttct ttcttcgatg gttctgcttc ctcaacgccg attcctgtct tgggtaagcc
     3361 aacagacgct actggtaata caacatctaa atctgcattt tcattcggta ctgctaacac
     3421 caatggtacc aatgcctcag caaactccac atcattctcg tttaacgccc ctgctactgg
     3481 taacggcaca actactactt ccaatacctc aggaaccaat atagcgggta catttaacgt
     3541 aggaaaaccg gatcaaagta tcgctagtgg caatacgaat ggagcgggct cggcatttgg
     3601 cttttcgagc tcaggaacag cagcaactgg tgcagcttct aatcaatctt catttaattt
     3661 tggaaacaat ggtgcagggg gtctcaatcc ttttacatca gcaacttcgt caactaatgc
     3721 taatgctggt ttattcaata aacctccttc cacgaatgca caaaatgtca atgttccctc
     3781 tgcttttaat tttacgggaa ataattcaac gcctggtggc ggctctgtat ttaatatgaa
     3841 cggcaacact aatgctaata cggtgtttgc cggctctaat aaccaaccac atcaatcgca
     3901 aaccccatct ttcaatacaa acagctcatt cacgccatca acagttccta atattaattt
     3961 tagcggattg aatggcggaa ttactaatac cgcgaccaat gcattaaggc caagtgatat
     4021 atttggtgcg aatgctgcct ctggttccaa ttcaaacgta acaaatccat catccatttt
     4081 tgggggggca ggtggtgtgc cgacaacttc ttttgggcag ccgcagtcag cccctaatca
     4141 gatggggatg ggaacaaata atggcatgag catgggcggt ggtgttatgg cgaacagaaa
     4201 gattgcaaga atgaggcact ctaaaaggta aatatcggtt atgtaattag gtattgtgtt
     4261 gcttttctga aggatatata ttcctatatt ccttcaactt atacactgca tatgaaactt
     4321 cttgagaagt tttaaataat tggtcttttt tttaatcggg tacaggcgag ataatggata
     4381 cctgttctag ggtaggcaaa tttggaggca agagcccaaa attgttttgt tttgacctgg
     4441 tttctctcag ttaacatatt atgaatagca tcttacaaat aatttacgag cgaaactttc
     4501 gatctgcaaa gcgctccttt tttctttccg aaaaaataca gtaaaataag gtctagtaca
     4561 cagcctgatc tacaattagt gttcttttct gtttacttcg atcagctgtt tagctttgcc
     4621 acttttctag tagctccaca cgctgtgaaa gtaacgagac atagaagtca cggatggatt
     4681 tgaaaaggga ttggttacga tggaaaatta caataggtag tgggcctggc agtattgtgc
     4741 tggatttccc atcatttttg gtagggtgcg tattcacaac aatgatgggg ccaatattac
     4801 agaagcttat aggtaaactt ttggtaggtt taataacggt atgcaaattc ttagttatta
     4861 ttggaagcat tgtcttcgtt atcggagttg catcaaagaa atacacatat gacgatttca
     4921 aagtaagtat aaaaagatca ggagagccag gggaatcaca tgatatgagg actgaaccaa
     4981 aaagaa
//

GenBank-Updates@genbank.bio.net (07/13/90)

LOCUS       TMGMPCPA     1413 bp ss-RNA             VRL       13-JUL-1990
DEFINITION  Tobacco mild green mosaic virus movement and coat protein genes,
            complete cds.
ACCESSION   M34236
KEYWORDS    coat protein; movement protein.
SOURCE      Tobacco mild green mosaic virus (strain PV 228), cDNA to viral RNA.
  ORGANISM  Tobacco mild green mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; 
            Tobamovirus.
REFERENCE   1  (bases 1 to 1413)
  AUTHORS   Nejidat,A., Cellier,F., Holt,C.A., Gafny,R., Eggenberger,A.L. and
            Beachy,R.N.
  TITLE     Examination of movement function complementation between two
            tobamoviruses in a recombinant virus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Nejidat, 11-MAY-1990.
FEATURES       from  to/span     description
    pept         51      851     movement protein
    pept        857     1336     coat protein
BASE COUNT      456 a    214 c    331 g    412 t
ORIGIN      
        1 tgttaacaaa tttttgtgtg ataaattttt atttagaact ttgtttttaa atggctgtta
       61 gtctcagaga tactgtcaaa attagcgagt tcattaatct ttcgaaacag gatgagatac
      121 ttccggcatt catgactaag gtcaagagtg ttagaatatc gactgtggac aagattatgg
      181 ctgttaagaa tgatagtctt tctgatgtag atttacttaa aggtgttaag ttagttaaga
      241 atggttatgt atgcttagct ggtttggtag tgtctgggga gtggaatctc ccggataact
      301 gccgtggtgg tgtcagtgtt tgtattgtag acaagagaat gaaaagaagt aatgaggcaa
      361 cgctgggtgc ttatcacgcc cctgcttgca aaaagaattt ttcctttaag ctaatcccta
      421 attattcaat aacatccgag gatgctgaga agaacccgtg gcaagtgtta gtgaatatca
      481 aaggagtggc tatggaagaa ggatactgtc ctttatcttt ggagttcgtt tcaatttgtg
      541 tagtacataa aaataatgta aagaaaggtt tgagggaacg tattttgaga gtaacagacg
      601 actcgccaat tgaactcact gaaaaagttg ttgaggagtt cgtggatgaa gtaccaatgg
      661 ctgtgaaact cgaaaggttc cggaaaacaa aaaagggaaa aaaaagaaaa aaagaaaaaa
      721 aaaagagagt ggtaggtaat agtgttaata ataagaaaat aaataatagt ggtaagaagg
      781 gtttgaaagt tgaggaaatt gaggataatg taagtgatga cgagtctatc gcgtcatcga
      841 gtacgtttta atcaacatgc cttatacaat caactctccg agccaatttg tttacttaag
      901 ttccgcttat gcagatcctg tgcagctgat caatctgtgt acaaatgcat tgggtaacca
      961 gtttcaaacg caacaagcta ggacaacagt ccaacagcaa tttgcggatg cctggaaacc
     1021 tgtgcctagt atgacagtga gatttcctgc atcggatttc tatgtgtata gatataattc
     1081 gacgcttgat ccgttgatca cggcgttatt aaatagtttt gatactagaa atagaataat
     1141 agaggttgat aatcaacccg caccgaatac tactgaaatc gttaacgcga ctcagagggt
     1201 agacgatgcg actgtagcta taagggcttc aatcaataat ttagctaatg aactggttcg
     1261 tggaactggc atgttcaatc aagcaggctt tgagactgct agtggacttg tctggaccac
     1321 aactccggct acttagctat tgttgtgaga tttcctaaaa taaagtcgct gatactttaa
     1381 attcagggtg gctgatacca aaatcagcag tgg
//
LOCUS       CLOFTHFSA    1965 bp ds-DNA             BCT       13-JUL-1990
DEFINITION  C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene,
            complete cds.
ACCESSION   J02911
KEYWORDS    forymltetrahydrofolate synthetase.
SOURCE      C.thermoaceticum DNA.
  ORGANISM  Clostridium thermoaceticum
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1965)
  AUTHORS   Lovell,C.R., Przybyla,A. and Ljungdahl,L.G.
  TITLE     Primary structure of the thermostable formyltetrahydrofolate
            synthetase from Clostridium thermoaceticum
  JOURNAL   Biochemistry 29, 5687-5694 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Lovell, 13-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         56     1735     formyltetrahydrofolate synthetase (FTHFS) (ttg
                                 start codon) (EC 6.3.4.3)
    mRNA    <     1     1760     FTHFS mRNA
    binding      41       44     ribosomal binding site (put.)
    signal     1761     1785     pot. transcription termination signal
BASE COUNT      459 a    576 c    532 g    398 t
ORIGIN      
        1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc
       61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact
      121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa
      181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt
      241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac
      301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg
      361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat
      421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct
      481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg
      541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat
      601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc
      661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt
      721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga
      781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac
      841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg
      901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc
      961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg
     1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg
     1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa
     1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa
     1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc
     1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact
     1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa
     1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga
     1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta
     1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa
     1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg
     1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc
     1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc
     1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat
     1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca
     1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag
     1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat
//
LOCUS       HUMGLIBSA     176 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A1.
ACCESSION   M34816
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 176)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      31       53     human zinc finger protein binding site
    binding      94      116     human zinc finger protein binding site
    binding     157      176     human zinc finger protein binding site
BASE COUNT       68 a     50 c     36 g     22 t
ORIGIN      
        1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt
       61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta
      121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat
//
LOCUS       HUMGLIBSB     166 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A2.
ACCESSION   M34817
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 166)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      27       49     human zinc finger protein binding site
    binding      90      112     human zinc finger protein binding site
BASE COUNT       60 a     47 c     37 g     22 t
ORIGIN      
        1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa
       61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc
      121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac
//
LOCUS       HUMGLIBSC     313 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A3.
ACCESSION   M34818
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 313)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding       1       23     human zinc finger protein binding site
    binding      64       86     human zinc finger protein binding site
    binding     127      149     human zinc finger protein binding site
    binding     190      212     human zinc finger protein binding site
    binding     253      275     human zinc finger protein binding site
BASE COUNT      110 a     79 c     75 g     49 t
ORIGIN      
        1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc
       61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag
      121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc
      181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac
      241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc
      301 agctgcaagg gct
//
LOCUS       HUMGLIBSD     184 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A4.
ACCESSION   M34819
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 184)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      32       54     human zinc finger protein binding site
    binding      95      117     human zinc finger protein binding site
    binding     158      180     human zinc finger protein binding site
BASE COUNT       67 a     46 c     41 g     30 t
ORIGIN      
        1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg
       61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt
      121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt
      181 tgta
//
LOCUS       HUMGLIBSE      62 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region B1.
ACCESSION   M34820
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone B1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 62)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      11       33     human zinc finger protein binding site
BASE COUNT       18 a     26 c     11 g      7 t
ORIGIN      
        1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc
       61 tt
//
LOCUS       HUMGLIBSF      67 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region C1.
ACCESSION   M34821
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone C1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 67)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      16       29     human zinc finger protein binding site
BASE COUNT       15 a     26 c     15 g     11 t
ORIGIN      
        1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct
       61 ctctgct
//
LOCUS       MLVU3RA       383 bp ss-RNA             VRL       13-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-15.4 U3 region genomic RNA.
ACCESSION   M34722
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA,
            from mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 383)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal      112      178     MuLV promoter region
    binding     131      144     CCAAT-binding transcription factor
    binding     147      152     Moloney leukemia virus binding protein b
    binding     156      167     enhancer core binding protein
    binding     350      362     CCAAT-binding transcription factor
    binding     363      372     glucocorticoid response element
BASE COUNT      127 a    105 c     84 g     67 t
ORIGIN      
        1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc
       61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
      121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc
      181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
      241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
      301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc
      361 caagaacaga tggtccccag aaa
//
LOCUS       MLVU3RB       348 bp ss-RNA             VRL       13-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-15 U3 region genomic RNA.
ACCESSION   M34723
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from
            mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal       77      177     MuLV promoter region
    binding      96      109     CCAAT-binding transcription factor
    binding     112      117     Moloney leukemia virus binding protein b
    binding     121      132     enhancer core binding protein
    binding     142      154     CCAAT-binding transcription factor
    binding     155      164     glucocorticoid response element
BASE COUNT      115 a     97 c     76 g     60 t
ORIGIN      
        1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa
       61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc
      121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct
      181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg
      241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg
      301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc
//
LOCUS       MLVU3RC       445 bp ss-RNA             VRL       13-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-25a U3 region genomic RNA.
ACCESSION   M34724
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from
            mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 445)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal       77      274     MuLV promoter region
    binding      96      109     CCAAT-binding transcription factor
    binding     168      179     CCAAT-binding transcription factor
    binding     238      252     CCAAT-binding transcription factor
    binding     138      143     Moloney leukemia virus binding protein b
    binding     209      214     Moloney leukemia virus binding protein b
    binding     147      158     enhancer core binding protein
    binding     218      229     enhancer core binding protein
    binding     180      189     glucocorticoid response element
    binding     252      263     glucocorticoid response element
BASE COUNT      142 a    126 c    104 g     73 t
ORIGIN      
        1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa
       61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg
      121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa
      181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg
      241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga
      301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa
      361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa
      421 aaagggtaag aaccccacac tcggc
//
LOCUS       MLVU3RD       409 bp ss-RNA             VRL       13-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-25b U3 region genomic RNA.
ACCESSION   M34725
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from
            mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 409)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal      112      238     MuLV promoter region
    binding     131      144     CCAAT-binding transcription factor
    binding     174      179     Moloney leukemia virus binding protein b
    binding     183      194     enhancer core binding protein
    binding     204      215     CCAAT-binding transcription factor
    binding     216      225     glucocorticoid response element
BASE COUNT      133 a    114 c     91 g     71 t
ORIGIN      
        1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc
       61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
      121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata
      181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc
      241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
      301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
      361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc
//
LOCUS       TTHISOR      6434 bp ds-DNA             BCT       13-JUL-1990
DEFINITION  T.thermophilus insertion sequences Is1000A and Is1000B.
ACCESSION   M33159
KEYWORDS    insertion sequence.
SOURCE      T.thermophilus (strain HB8, ATCC 27634), clones pNZ1557 and 1559.
  ORGANISM  Thermus thermophilus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 6434)
  AUTHORS   Ashby,M.K. and Bergquist,P.L.
  TITLE     Cloning and sequence of IS1000 from Thermus thermophilus HB8
  JOURNAL   Plasmid (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.K.Ashby, 22-MAR-1990.
FEATURES       from  to/span     description
    pept        666      106 (c) ORFA
    pept    <   706     1347     ORFB
    pept    <  1251     1961     ORFC (AA at 1251)
    pept    <  2007     3026     ORFD (AA at 2007)
    pept       3711     4664     ORF 1A
    pept       5173     6126     ORF 1B
    trns       3516     4711     IS1000A
    trns       4978     6173     IS1000B
BASE COUNT     1060 a   2367 c   2037 g    970 t
ORIGIN      
        1 ctcgccgccc tcctcctcgg cgggggccaa gcccacctgc tcttcgccct cgccctcagc
       61 ctcggcctcc tttggccccc ccggtgagct cggccaggac cgccttcaag gccctttcca
      121 ggcctcgagg ctccccgtgt tctccaagac ccaggtcgcc cgcttgcgct tctcctcctc
      181 gggcatctgg gcccgctccc gggccaggac ctcctcccgg gaaagccccg agcgcgccat
      241 cacccgcctc acccgctcct ccaggggggc cgccaccaag agggtcccgt ggagcctccc
      301 ctcccacccc ttctcaaaga ggagggggat ctccagaaag acgagggggg cctctaggcg
      361 ggaaagctcc tccataagaa gccgcctgac ctccgggtgg accacggcct cgagggcctt
      421 gagcctctcc gggtcggaaa agacgagccg ggcgagggcc ctccggtcca gcctcccccc
      481 caccaccgcc tcggggaaga ggcgcttcag ctcctcctcc ttgttctccc gggcccgggc
      541 ggcgagcgcg tccaggtcca agacggggta gccccaggac ctgaggaggg cggccacggt
      601 gctcttgccg ctgccgatgt tcccggtgat gccgataata atggggtgct tcgcctcgtg
      661 gcccatatgg acttcccctt ttataccccc aaggacgcct tccccgtggg cggggcggtg
      721 cgggacctcc tcctgggaag aaggccaacg gacctggact acgccgcctt ggaccccgaa
      781 ggggcggccg aggaggcaaa gaggcgcctc gggggaagcc tcttccccct ggaccccaag
      841 cggggccact accgcctggt ggtgggggaa aggaccctgg acttcacccc cttggagggg
      901 aggctggagg aggacctcct ccggcgggac taccgggtga acgccctcct ctggaagggg
      961 ggcgccgtct tcggcctaaa gggggtggag gaggacctga ggcggcgcct cctcgtcccc
     1021 gtgcgggagg aaaacctcta ccaggaccac ctgcgaagcc tccggggggt ccgcctcgcc
     1081 gccaccttgg gcttcgggct tccccagagg acccgggagg cccttggccg ccacgcccgg
     1141 ttcctccagg cccaccccga ggccctcccc gcgcgggaaa gggtcaagga ggagcttgcc
     1201 cggctcctcc tctccccgag ggcggcttcg gcctgcgcct tctggagagg gtggggctcc
     1261 ttggggtcta cctccccgag ctcgccctcc tcgtgggcct ccatcagggc ggggtgcacc
     1321 accttcccgc ctgggaccac accctgagcg cggtcttcca cctcctctgg ctctggcccg
     1381 aggcccccct cgaggcccgc ctcgccgccc tcttccacga cgtgggcaag cccctcaccc
     1441 gccgctttga ccccgaggtg ggccgcttcc gcttcctggg gcacgccgag gtgggggcgg
     1501 agatcgcccg ggcaagcctc ttttggctcc gcttccccaa ggaggtggtg gagcgggcgg
     1561 cgggcctggt gcgccgccac atggaccgcc tccccgagga gaggaaggcc ctgcgccgct
     1621 tcttcctgag gcgccaggac ctcctcccgg acctcgtcta cctcatggcg gcggaccgcc
     1681 tggccacgag gggcgtggag cgcgaggcct gggaggtgct cgggcgctac gaagaggtcc
     1741 tgaaagaccc cctgccgcaa aggcccctcc tctccgggga ggaggtgatg gcccttctgg
     1801 gcctccaaga aggccccgag gtgggaaggg ccctaaaggc cctcctcgag gcccaggccg
     1861 aaggccgggt ggggaccaag gaggaggccc gggcctttct cctatattgg aggggtggaa
     1921 gggaggctca ggcttcggga accccagatc acccccattg aggggggctt cctggtctcc
     1981 gaccctacgg ggtctacgag aagcccctgg ccctcaccga gggcgggctt ttcctcctct
     2041 ccctcatgga ggggaggacc ctggaggagg tgcaggagga ggtgttcaag cgccacgggg
     2101 tcctggtgcc gaagaaggag ctggaagacc tggcgaaggc cctggaggag gcgggcctcc
     2161 tcctcacgga gaaggtggag gcgaggctca aagaggagga ggaaaagctc aagcgggagc
     2221 ggcccatgcg cctcgccggg ctctcctatc ccgagggcga gcgggaggcc cgggccttcc
     2281 tcgaggcctt ccgggcgagc tacccggggg aaggcgagga ggcccgggtc ctcctcatgc
     2341 cccacctgga gccctcccgc gtccccgagg tctacggggc ggccctcgcc gccctggaga
     2401 agaccccgcc ccctgagcgc atctacctgg tgggggtcgc ccacaggccg ctgaaggaga
     2461 aggccgccgc ccttcccgtg cccttccaga cccccttcgg ccccgccctg ccggaccttc
     2521 ccgccctcca ggccttagac gccctccttc cctttgagct cttcaacacg cccctcgcct
     2581 tccgggagga gcacagcctg gagcttcccc tcttcttcct gaaggggcgg ttccccgagg
     2641 cccgcgtcct ccccctcctc gtggccaggc gaagcccgga gctcggggag gccctgaagg
     2701 tggtcctgcg ggacttcccg ggccttctcg tcctcgccgt ggacctctcc cacgtggggc
     2761 cccgcttcgg ggacaccccc ctcacccgca ccctggcgga ggaggcgagg aggcgggacc
     2821 tgggcttcct ggaaaggctc gcggaagggg agccggaggc cgccctcgcc ttcctggggg
     2881 ccaaccccac ccgcatagac ggggtggagg tggtggcaag cctcctcccc ctcctccggg
     2941 aaaggaaggg gaaggtcctg gcccaccgcc tggacctcga ggcccccacc ctaagcgccg
     3001 tgggggcggg caccctggtc ctctaaagag ccccagattt ggaacgacgc gaaggcccgg
     3061 gcgccgaacc gcccgggccc cttttctggg agcgcccctt tgggaaaggg cccggaaggt
     3121 gggtgggcag gcgcgccggg gcctgcgtga aagcctccta gcgcgagggg tacaggcctt
     3181 cacccacgac aaaggggtcg cttgatgggc aagacagtaa tccccttacg gggctcaatc
     3241 ccttgcaacc cttaggagct acctttttgc gcttcccgtg acgagcgtcg caatcccctt
     3301 acggggctca atcccttgca acttccttct aaggaggaat gggcatcatt cacaatcagc
     3361 aagtcgcaat ccccttacgg ggctcaatcc cttgcaacgg attcgcctcg gcccgcctag
     3421 ggactggatc tatgaccccg tcgcaatccc cttacggggc tcaatccctt gcaaccattg
     3481 gtggaggcgc gtgataagcg agtggaggtg aggtggtcgc atgtatcttg ccccctgagg
     3541 gaggccccac cgccccggtt gccgggccac cacgcccgcg ggattgatcg gtagccaccc
     3601 accccggcct ccccgccgca cagtatcccg gtcgtcaaag accgcatcca catggaaggc
     3661 ccacagggtc gggtggccac ctccctcagg ttaccaaaag gagagagccc atgaccttcg
     3721 ccggcattga cgtcagcaaa acccacctgg acctggccct cgtctccaac tcccccaaac
     3781 ccacccgcct ccgcttcccc aactcccctg aaggccgtca agccctcctc gccgccctcg
     3841 cccaccacaa ccccgcctgg gtcgccctgg agcccacggg cgcttaccac ctccccctcc
     3901 tcaagctcct ggcagaaaac cgcctccagg tggccctggt caacccctac catctcgccg
     3961 ccttccgcaa ggccaaggga gaacgccaga agaccgaccg ccaagacgcc ctcctcctcg
     4021 cccgctacgc ccaggtctac cacggggagc tctgggccta caccctcccc ccagaagccc
     4081 tccgtgagct caaagccctg gtgggctacc gggaggactt ggccgggcgg gaaagggcca
     4141 tcctcaacca gatggaggcg gcggagtggg cggggagcaa ggaggtcctc gccctcctcc
     4201 aaaaggagct ggcctgcgtg aaggggcttc tcggggaggt ggaggccagg atccaggccc
     4261 tcctcgccac cctccccgag gccgaggtcc tgatggccct gcccggggtg gggccccagg
     4321 tggcggcagc ggtgctggcc ctcctgcccc cagagctctg gggccgggcg aagagggcgg
     4381 cctcctacgc ggggctcatc cccgagcggg aggagtcggg aaagagcgtg gagaggagtc
     4441 ggctctccaa aaaagggcct cccctcctgc ggcgaaagct ctacatgggc gccctggtgg
     4501 cggtgcgcca tgacccggag atgcgggcct tctaccaccg cctgctctcg cggggaaaga
     4561 gaaagaagca ggcgttggtg gccgtggccc acaagctcct caggcggatg atgggaaggc
     4621 tcagggagta ctacgcgacc cagctagatc aaggggtcgc ttgacaggca agacagtatc
     4681 aatcccctta cggggctcaa tcccttgcaa ctccgacggc aagatcatcg gggtgcggcg
     4741 gctcctcgcc gtcgcaatcc ccttacgggg ctcaatccct tgcaacgcca gtctatgacc
     4801 aagcccctag gctccttcac tttgaagtgt cgcaatcccc ttacggggct caatcccttg
     4861 caacacctca tgggggagta agataggcgg gaaaggaggg aggtcgcaat ccccttacgg
     4921 ggctcaatcc cttgcaacca cgtgaagacg tccctggcct cggtcgtgag ggacgtggtc
     4981 gcatgtatct tgccccctga gggaggcccc accgccccgg ttgccgggcc accacgcccg
     5041 cgggattgat cggtagccac ccaccccggc ctccccgccg cacagtatcc cggtcgtcaa
     5101 agaccgcatc cacatggaag gcccacaggg tcgggtggcc acctccctca ggttaccaaa
     5161 aggagagagc ccatgacctt cgccggcatt gacgtcagca aaacccacct ggacctggcc
     5221 ctcgtctcca actcccccaa acccacccgc ctccgcttcc ccaactcccc tgaaggccgt
     5281 caagccctcc tcgccgccct cgcccaccac aaccccgcct gggtcgccct ggagcccacc
     5341 agcgcctacc acctccccct cctcaagctc ctggcagaaa accgcctcca ggtggccctg
     5401 gtcaacccct accacctcgc cgccttccgc aaggccaagg gagaacgcca gaagaccgac
     5461 cgccaagacg ccctcctcct cgcccgctac gcccaggtct accacgaaga cctccgggcc
     5521 tacaccctac ccccagaaac cctccgggag ctcaaagccc tggtgggcta ccgggaggac
     5581 ttggccgggc gggaaaggac catcctcaac cagatggagg cggcggagtg ggcggggagc
     5641 aaggaggtcc tcgccctcct ccagaaggag ctggcctgcg tgaaggggct tctcggggag
     5701 gtggaggcca ggatccaggc cctcctcgcc accctccccg aggccgaggt cctgatggcc
     5761 ctgcccgggg tggggcccca ggtggcggca gcggtgctgg ccctcctgcc cccagagctc
     5821 tggggccggg cgaagagggc ggcctcctac gcggggctca tccccgagcg ggaggagtcg
     5881 ggaaagagcg tggagaggag tcggctctcc aaaaaagggc ctcccctcct gcggcgaaag
     5941 ctctacatgg gcgccctggt ggcggtgcgc catgacccgg agatgcgggc cttctaccac
     6001 cgcctgctct cgcggggaaa gagaaagaag caggcgttgg tggccgtggc ccacaagctc
     6061 ctcaggcgga tgatgggaag gctcagggag tactacgcaa cccagctaga tcaaggggtc
     6121 gcttgacagg caagacagta tcaatcccct tacggggctc aatcccttgc aacgaacctc
     6181 ctcgggccca ccgagggggt aatcctcgca gagtcgcaat ccccttacgg ggctcaatcc
     6241 cttgcaacct gaaccagaag gccctctggg cggcggcgta cgccgtggcg tcgcaatccc
     6301 cttacggggc tcaatccctt gcaacgcgat tggcgtcgcc ctccccgtta tcgcttacga
     6361 gtcgcatgta tcttgccccc tgagggaggc cccaccgccc cggttgccgg gccaccacgc
     6421 ccgcgggatt gatc
//
LOCUS       MLUDNAA      4171 bp ds-DNA             BCT       13-JUL-1990
DEFINITION  M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34
            (rpmH), DNA biosynthesis initiation protein (dnaA), and DNA
            polymerase III beta subunit (dnaN) genes, complete cds.
ACCESSION   M34006
KEYWORDS    50S ribosomal subunit protein L34;
            DNA biosynthesis initiation protein;
            DNA polymerase III beta subunit; ribonuclease P.
SOURCE      M.luteus (strain IF03333) DNA.
  ORGANISM  Micrococcus luteus
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Micrococcaceae.
REFERENCE   1  (bases 1 to 4171)
  AUTHORS   Fujita,M.Q., Yoshikawa,H. and Ogasawara,N.
  TITLE     Structure of the dnaA region of Micrococcus luteus: Conservation
            and variations among eubacteria
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by N.Ogasawara, 05-MAY-1990.
FEATURES       from  to/span     description
    pept        435       37 (c) ribonuclease P (rnpA)
    pept        601      464 (c) 50S ribosomal subunit protein L34 (rpmH)
    pept        944     2491     DNA biosynthesis initiation protein (dnaA)
    pept       3242  >  4171     DNA polymerase III beta subunit (dnaN)
    signal      843      848     dnaA -35 signal
    signal      866      871     dnaA -10 signal
BASE COUNT      672 a   1558 c   1361 g    580 t
ORIGIN      
        1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg
       61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta
      121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag
      181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg
      241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct
      301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg
      361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg
      421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg
      481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg
      541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca
      601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag
      661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc
      721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc
      781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc
      841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc
      901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc
      961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc
     1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct
     1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc
     1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga
     1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg
     1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac
     1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt
     1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc
     1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg
     1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc
     1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc
     1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga
     1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca
     1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga
     1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt
     1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc
     1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc
     1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta
     2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac
     2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa
     2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac
     2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct
     2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc
     2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa
     2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa
     2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg
     2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact
     2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc
     2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg
     2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct
     2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg
     2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc
     2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca
     2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc
     3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac
     3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca
     3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc
     3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct
     3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc
     3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg
     3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga
     3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc
     3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg
     3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct
     3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac
     3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga
     3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg
     3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac
     3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc
     3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga
     3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac
     4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct
     4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag
     4141 cgccaacgag tccgtgccgt gcacgctcga g
//
LOCUS       CE1CG        6646 bp ds-DNA   circular  BCT       13-JUL-1990
DEFINITION  Plasmid ColE1, complete genome.
ACCESSION   J01566 M33100
KEYWORDS    cea gene; colicin; colicin E1; colicin E1 immunity;
            complete genome; kil gene; origin of replication.
SOURCE      Plasmid ColE1 DNA.
  ORGANISM  Plasmid Colicin E1
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1008 to 1370)
  AUTHORS   Tomizawa,J.-I., Ohmori,H. and Bird,R.E.
  TITLE     Origin of replication of colicin E1 plasmid DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 74, 1865-1869 (1977)
  STANDARD  full staff_review
REFERENCE   2  (bases 1410 to 1599)
  AUTHORS   Bastia,D.
  TITLE     Determination of restriction sites and the nucleotide sequence
            surrounding the relaxation site of ColE1
  JOURNAL   J. Mol. Biol. 124, 601-639 (1978)
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 1613; 6577 to 6646)
  AUTHORS   Oka,A., Nomura,N., Morita,M., Sugisaki,H., Sugimoto,K. and
            Takanami,M.
  TITLE     Nucleotide sequence of small ColE1 derivatives: Structure of the
            regions essential for autonomous replication and colicin E1
            immunity
  JOURNAL   Mol. Gen. Genet. 172, 151-159 (1979)
  STANDARD  full staff_review
REFERENCE   4  (bases 623 to 832)
  AUTHORS   Morita,M. and Oka,A.
  TITLE     The structure of a transcriptional unit on colicin E1 plasmid
  JOURNAL   Eur. J. Biochem. 97, 435-443 (1979)
  STANDARD  full staff_review
REFERENCE   5  (bases 1 to 192; 384 to 519; 6644 to 6646)
  AUTHORS   Patient,R.K.
  TITLE     Characterization of in vitro transcription initiation and
            termination sites in Col E1 DNA
  JOURNAL   Nucleic Acids Res. 6, 2647-2665 (1979)
  STANDARD  full staff_review
REFERENCE   6  (bases 4783 to 5353)
  AUTHORS   Ebina,Y., Kishi,F., Miki,T., Kagamiyama,H., Nakazawa,T. and
            Nakazawa,A.
  TITLE     The nucleotide sequence surrounding the promoter region of colicin
            E1 gene
  JOURNAL   Gene 15, 119-126 (1981)
  STANDARD  full staff_review
REFERENCE   7  (bases 1 to 6646)
  AUTHORS   Chan,P.T., Ohmori,H., Tomizawa,J.-I. and Lebowitz,J.
  TITLE     Nucleotide sequence and gene organization of ColE1 DNA
  JOURNAL   J. Biol. Chem. 260, 8925-8935 (1985)
  STANDARD  full staff_review
REFERENCE   8  (bases 57 to 123)
  AUTHORS   Zhiang,S., Yan,L. and Zubay,G.
  TITLE     Regulation of gene expression in plasmid ColE1: Delayed expression
            of the kil gene
  JOURNAL   J. Bacteriol. 170, 5460-5467 (1988)
  STANDARD  simple staff_entry
REFERENCE   9  (bases 639 to 753)
  AUTHORS   Tomizawa,J.-I.
  TITLE     Control of ColE1 plasmid replication: The process of binding of RNA
            I to the primer transcript
  JOURNAL   Cell 38, 861-870 (1984)
  STANDARD  full staff_review
REFERENCE   10 (bases 5046 to 5110)
  AUTHORS   Parker,R.C.
  TITLE     Mitomycin C-induced bidirectional transcription from the colicin E1
            promoter region in plasmid ColE1
  JOURNAL   Biochim. Biophys. Acta 868, 39-44 (1986)
  STANDARD  full staff_entry
COMMENT     The L (light) strand is shown.  This sequence was kindly supplied
            over computer network by J.Lebowitz (04-APR-1985).  The open
            reading frames noted by [7] are named in FEATURES by the molecular
            mass of their predicted products or by the names given by [7].
FEATURES       from  to/span     description
    pept        400       59 (c) colicin E1 immunity protein (imm)
    pept        448      585     kil protein
    pept       1609     1746     mob1 protein (gtg start codon)
    pept       1803     1612 (c) RNA I inhibition modulator protein (rom; gtg
                                 start codon)
    pept       4362     3943 (c) entry exclusion protein 2 (exc2)
    pept       4862     4443 (c) entry exclusion protein 1 (exc1)
    pept       5140       62     colicin E1 protein (cea)
    RNA         641  >   641     II RNA
    RNA         751      645 (c) I RNA ((3' end +/- 1 bp) [4]
    mRNA       1449  >  3936     mob mRNA (5' end putative) [5],[7]
    mRNA       5056  >   585     colE1-kil mRNA [7]
    mRNA        409  <    59 (c) imm mRNA [5],[7]
    mRNA       4887  <  3943 (c) exc mRNA [5],[7]
    binding    5063     5093     LexA binding site [7]
    ORF         605      895     10.2 kd ORF [7] (ttg start codon)
    ORF        1241      870 (c) 13.8 kd ORF [7] (gtg start codon)
    ORF        1189     1350     6.0 kd ORF [7]
    ORF        1408     1238 (c) 6.4 kd ORF [7]
    ORF        1347     1526     6.9 kd ORF [7] (gtg start codon)
    ORF        1709     1500 (c) 7.6 kd ORF [7]
    ORF        1609     1746     mob1 ORF (gtg start codon)
    ORF        1843     2190     mob2 ORF
    ORF        2180     3733     mob3 ORF
    ORF        2214     2399     mob4 ORF (gtg start codon)
    ORF        2868     3386     mob6 ORF
    ORF        3393     3626     mob7 ORF (gtg start codon)
    ORF        3657     3818     mob8 ORF
    ORF        3766     3936     mob9 ORF (gtg start codon)
    ORF        4258     4647     14.5 kd ORF [7]
    ORF        5435     5653     8.8 kd ORF [7]
    ORF        5852     6013     6.7 kd ORF [7]
    conflict     83       84     aa in [3],[7]; a in [5]
    conflict     99      100     aa in [3],[7]; a in [5]
    conflict    135      135     a in [3],[7]; g in [5]
    conflict    144      144     a in [3],[7]; g in [5]
    conflict    174      174     a in [3],[7]; g in [5]
    conflict    484      485     aa in [3],[7]; a in [5]
    conflict    491      492     tt in [3],[7]; t in [5]
    conflict    506      512     ctaacta in [3],[7]; caca in [5]
    conflict    517      519     cct in [3],[7]; cgg in [5]
    conflict    528      528     c in [7]; g in [3]
    conflict    652      653     aa in [3],[7]; aaa in [4]
    conflict   1111     1111     g in [7]; t in [1],[3]
    orgrpl     1197     1197     replication origin (+/- 1 bp) [1]
    conflict   1366     1366     a in [3],[7]; g in [1]
    conflict   1416     1417     cc in [3],[7]; c in [2]
    conflict   1454     1454     c in [3],[7]; t in [2]
    cutss      1466     1465 (c) relaxation (nic) cut site [2]
    conflict   1581     1582     aa in [3],[7]; a in [2]
    conflict   4784     4787     ctcc in [7]; cgc in [6]
BASE COUNT     1760 a   1421 c   1798 g   1667 t
ORIGIN      2 bp downstream of EcoRI site.
        1 ttctatgctc ctatattgat aagaataaac ttaatactat aaatgaggtg ttagggattt
       61 aattattctt tattgatata aaaagtccta gcaatccaaa tgggattgct aggaccaaac
      121 aaagtagatt atatagcata aataggttta attttgctac gggggcgtta tttaggtttt
      181 ttcttctttc gaaaaaatct ttctttatga agttaaaagc tatgtattca atagcatatt
      241 ttgaatatgg acatagaata gtgcttatca ctattgcata tagcatctta tctgacacaa
      301 ggaaataata cccttcgctg ttttttgtta taaggtatat atatataagt gtgcagtaca
      361 ggccaaataa aatatttttt atgtagtatc ttaagctcat aaattaaacc tcgccatata
      421 ttcttttcat tttataagga tcgagttatg aggaaaagat tttttgtggg aatattcgcg
      481 ataaacctcc ttgttggatg tcaggctaac tatatacctg atgttcaggg agggaccatc
      541 gcaccatcct cctcttctaa actgacgggg atcgcggttc agtagaaaag attaaaggat
      601 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc
      661 taccaacggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg
      721 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag tcgggccact
      781 acttcaagaa ctctgtagca ccgtttgtgc catcatcgct ctgctaatcc ggttaccagt
      841 ggctgctgcc agtggcgtta aggcgtgcct taccgggttg gactcaagac gatagttacc
      901 ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg
      961 aacgacctac accgaactga gataccaaca gcgtgagcta tgagaaagcg ccacgcttcc
     1021 cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac
     1081 gagggagctt ccagggggaa acgcctggta gctttatagt cctgtcgggt ttcgccacct
     1141 ctgacttgag cgtctatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc
     1201 ctgctacgtg gccttcttcc tgttcctggt cttttgctca catgttcttt ccggccttat
     1261 cccctgattc tgtggataac tgtgttaccg tttttgtgtg agtcagtacc gctcgccgca
     1321 gtcgaacgac cgagcgtagc gagtcagtga gcgaggaagc ggaaaagcgc ctggacgtgc
     1381 attttctcct tacgcatctg tgcggcattt cacacccggc atggcgtact tttcatacaa
     1441 tccgcactga tgccgcatgg ttaagccagt atacactccg ctatcgctac gtgactgggt
     1501 cagggctgcg ccccgacacc cgctaaaacc tgctgacgcg ccctgacggg cttgtcagct
     1561 cccggcatcc gctcacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt
     1621 ttcaccgtca tccccgaaac gtgcgaggca gctgcggtaa agctcatcgg cgtggtcgtg
     1681 aagcgattca caaatatcgg cctgttcatc tgcgtccagt tcgttgagct tctccagcag
     1741 cgttaatgtc tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttagt
     1801 cactgatgcc tccgtgtaag ggggatttct gttcatgggg taatgatacc gatgaaacgc
     1861 gagaggatgc tcacaatacg ggttactgat gatgaacatg cccggttact tgaacgctgt
     1921 gagggtaaac aactggcggt atggatgcgg cgggtctgcc tgggggagcc ggttgcccgt
     1981 tccggaaaac tgccgacact ggcaccgccg ttactgcgtc agctggccgc catcggaaat
     2041 aacctgaatc agacagcccg taaggtgaac agcgggcagt ggtcttccgg tgaccgggtt
     2101 caggtggtgg ccgcactgat ggccatcggg gatgagctgc gccggctgcg tctggctgtc
     2161 agggaacagg gggcgcggga tgatagttaa atttcatgcc aggggaaaag gtggtggcag
     2221 tggtccggtt gattacctgc tggggaggga gcgtaaccgc gaaggcgcaa cggtgcttca
     2281 gggtataccg gaagaagtcc gggaactcat cgatgccacg ccatttgcga agaaatacac
     2341 gtccggtgtt ctgtcgttcg cggagaagga gctgccgccg ggaggacgtg aaaaagtgat
     2401 ggcgagcttt gagcgtgtac tgatgcccgg tctcgaaaaa aatcagtaca gcatcctgtg
     2461 ggtggagcac caggacaagg gacggcttga gctgaatttt gtcattccga acatggagct
     2521 acagaccgga aaacgcctcc agccgtacta cgaccgcgca gacaggccta gaattgatgc
     2581 ttggcagacg ctgttaaatc accattacgg gctgcatgac ccgaacgccc cggagaaccg
     2641 caggacgctg acactccctg ataacctgcc tgaaacgaaa caggcgcttg ctgagggcgt
     2701 cacgcgaggt atagatgcac tttaccatgc cggagagata aaaggccgtc aggatgtgat
     2761 tcaggcgctc actgaggcgg ggctggaagt ggtcagggtg acgcgaagca gtatcagcat
     2821 tgcagatccg aacggcggga agaatatcag gctgaaagga gcattttatg agcaatcttt
     2881 tgcagacggg cgcggagttc gagaaaaagc tgaaagagag agccgaatct acagagaaaa
     2941 tgctgaacaa cgagttcagg aggctcggcg aatctgtaag cgaggctgtg acatcaaacg
     3001 agacgaaaat cagagacgct atagccctgt tcacagcctc gacagaggaa tcgctggaaa
     3061 aacaccggga aggggtgaaa gaggcgatga tgcagcacag gagggacgtg ttaaagctgg
     3121 cagggaatac gggcatgatg ttactgggga tagtctttct cctgtttacc gcgagtggcg
     3181 ggacgctctg gtatcttgga gggaggatac aggcgaacct ggaagaaatc aggaagcagg
     3241 aagagacatt gcagaaactg aacgcgaaga catggggcgt ggagtttgtg caggacggga
     3301 acaggaaatt ccttgtcctt ccgtacggga aatcagcgga ggtgattccc tttcagggga
     3361 aagagtgggt acatctgaag gagtgacaca gagtgacaga gctggaaaca catttgctga
     3421 acgccttaga gcagctgcaa caggactata tgcagcggct gagcgaatgg gagagcgcct
     3481 tcgtggaatt gcagaagatg ttttcgctta cgcaacggga caacgcgatg ctgaacgagc
     3541 gggtcatgca gttgagtcag caggtgcaac acttgagcga gcagacagaa cgcttgagcc
     3601 agttatacag cgagaactgg agataagaga ggaacggctg atacaggagc gcgaacatgt
     3661 gttatcccta gaacgggagc gtcagccgga aatacaggaa cgcacgctgg atggcccttc
     3721 gctgggatgg tgaaaccatg aaaaatggca gcttcagtgg attaagtggg ggtaatgtgg
     3781 cctgtaccct ctggttgcat aggtattcat acggttaaaa tttatcaggc gcgatcgcgc
     3841 agtttttagg gtggtttgtt gccattttta cctgtctgct gccgtgatcg cgctgaacgc
     3901 gttttagcgg tgcgtacaat taagggatta tggtaaatcc acttactgtc tgccctcgta
     3961 gccatcgaga taaaccgcag aaatcgtgtc agccagcagc ctggattttt tcggggtaag
     4021 gtctttgctg cggtaatcag tacccgcaaa ttgagttgtg ccccggagtg agtttaaaaa
     4081 ttcttggctg tgaaactgct gaatgcgttg ttgcacatca ctccgtgatt ttcctgtcgc
     4141 tttgtcttta gctcccatat cccagaactg ccgaaaaaaa ggcaccatca ttcgtgttgt
     4201 gtcggctttt tgggtggaaa agttgggatc aaaaccgtca tcagaagcat atacgtaatg
     4261 ccgggtatgt tgttccacgc tgccctttgg ggtacaggct gtgagagcta ctgttgctgt
     4321 tatcatggcg ataagacaaa gtttgtttcc ggtacgtctc atgagtgtct ccctacctta
     4381 aagtatttat tctcggtgag ttattatccg aagactattt tatcatcagt ctctgacagc
     4441 gttcagcgac cgaacaacct ggaccagaat ccccgtttct tttccagttc cagtgcctgc
     4501 ctgagctggg ctatctcatt ttgtagctgt tcccgttctg cttcctggcg tctgcgatcc
     4561 atatcctgtg cctgtttatc ctcaagcatc agcgtcaggc attgtttcag ctcattcagt
     4621 tcccggagaa tgcgttctgt ctgctgatca tgtggatttt ctgcatgtcc ctcactgtgc
     4681 ctttctggtg tctcattctg ctttaattcg ccgtatgccc ggatcagttc actggtttca
     4741 aactcccgtc gtccgtccct gccggtacgg taactgacaa ggcctccgcg catatcacgg
     4801 taaaactgtg aacgcgatct gcctgtcatt tttagtgcgt cccggagtgt gtgccatgcc
     4861 ataaagtgac agtgtcccat agatgtctca tctcatagtt tcagtaaaac ataatgaggt
     4921 ctgagaacgg taatgtttgt gctggttttt gtggcatcgg gcgagaatag cgcgtggtgt
     4981 gaaagactgt ttttttgatc gttttcacaa aaatggaagt ccacagtctt gacagggaaa
     5041 atgcagcggc gtagctttta tgctgtatat aaaaccagtg gttatatgta cagtatttat
     5101 ttttaactta ttgttttaaa agtcaaagag gattttataa tggaaaccgc ggtagcgtac
     5161 tataaagatg gtgttcctta tgatgataag ggacaggtaa ttattactct tttgaatggt
     5221 actcctgacg ggagtggctc tggcggcgga ggtggaaaag gaggcagtaa aagtgaaagt
     5281 tctgcagcta ttcatgcaac tgctaaatgg tctactgctc aattaaagaa aacacaggca
     5341 gagcaggctg cccgggcaaa agctgcagcg gaagcacagg cgaaagcaaa ggcaaacagg
     5401 gatgcgctga ctcagcgcct gaaggatatc gtgaatgagg ctcttcgtca caatgcctca
     5461 cgtacgcctt cagcaacaga gcttgctcat gctaataatg cagctatgca ggcggaagca
     5521 gagcgtttgc gccttgcgaa agcagaagaa aaagcccgta aagaagcgga agcagcagaa
     5581 aaggcttttc aggaagcaga acaacgacgt aaagagattg aacgggagaa ggctgaaaca
     5641 gaacgccagt tgaaactggc tgaagctgaa gagaaacgac tggctgcatt gagtgaagaa
     5701 gctaaagctg ttgagatcgc ccaaaaaaaa ctttctgctg cacaatctga agtggtgaaa
     5761 atggatggag agattaagac tctcaattct cgtttaagct ccagtatcca tgcccgtgat
     5821 gcagaaatga aaacgctcgc tggaaaacga aatgaactgg ctcaggcatc cgctaaatat
     5881 aaagaactgg atgagctggt caaaaaacta tcaccaagag ccaatgatcc gcttcagaac
     5941 cgtccttttt ttgaagcaac cagacgacgg gttggggccg gtaagattag agaagaaaaa
     6001 caaaaacagg taacagcatc agaaacacgt attaaccgga taaatgctga tataactcag
     6061 atccagaagg ctatttctca ggtcagtaat aatcgtaatg ccggtatcgc tcgtgttcat
     6121 gaagctgaag aaaatttgaa aaaagcacag aataatctcc ttaattcaca gattaaggat
     6181 gctgttgatg caacagttag cttttatcaa acgctgactg aaaaatatgg tgaaaaatat
     6241 tcgaaaatgg cacaggaact tgctgataag tctaaaggta agaaaatcgg caatgtgaat
     6301 gaagctctcg ctgcttttga aaaatacaag gatgttttaa ataagaaatt cagcaaagcc
     6361 gatcgtgatg ctatttttaa tgcgttggca tcggtgaagt atgatgactg ggctaaacat
     6421 ttagatcagt ttgccaagta cttgaagatt acggggcatg tttcttttgg atatgatgtg
     6481 gtatctgata tcctaaaaat taaggataca ggtgactgga agccactatt tcttacatta
     6541 gagaagaaag ctgcagatgc aggggtgagt tatgttgttg ctttactttt tagcttgctt
     6601 gctggaacta cattaggtat ttggggtatt gctattgtta caggaa
//
LOCUS       COGTUB1A     2672 bp ds-DNA             PLN       13-JUL-1990
DEFINITION  C.graminicola beta-tubulin-1 (TUB1) gene, complete cds.
ACCESSION   M34491
KEYWORDS    beta-tubulin-1.
SOURCE      C.graminicola DNA.
  ORGANISM  Colletotrichum graminicola
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Diaporthales; Diaporthaceae.
REFERENCE   1  (bases 1 to 2672)
  AUTHORS   Panaccione,D.G. and Hanau,R.M.
  TITLE     Characterization of two divergent beta-tubulin genes from
            Colletotrichum graminicola
  JOURNAL   Gene 86, 163-170 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        662      673     beta-tubulin-1 (TUB1), exon 1
                740      763     beta-tubulin-1, exon 2
                818      843     beta-tubulin-1, exon 3
                905      946     beta-tubulin-1, exon 4
               1014     1068     beta-tubulin-1, exon 5
               1124     1908     beta-tubulin-1, exon 6
               1962     2355     beta-tubulin-1, exon 7
    IVS         674      739     TUB1 intron A
    IVS         764      817     TUB1 intron B
    IVS         844      904     TUB1 intron C
    IVS         947     1013     TUB1 intron D
    IVS        1069     1123     TUB1 intron E
    IVS        1909     1961     TUB1 intron F
BASE COUNT      632 a    754 c    660 g    626 t
ORIGIN      
        1 ggatccggct cggcgttacg gactggaaac ccaaagggca acaatgagta gagcacttag
       61 caaccgtgca acaacaacac catagccagg cagctgtgta tccaggggaa cgaacccatt
      121 ggtccaaggt gttaccaagc ctctaggcaa taatctccca gttacggcca tacaaacctg
      181 attcatgatg cctagaattc tgagcaaagc aaaacagagc atccatgtac caatcatttc
      241 tgtggatata tactcaacgc ggactagctt ggtctctttt gggcgccgcc tccgagagaa
      301 tgggcatcag ccataggtcc ctttcttttt tccttttccc atccccctcc cctccctacc
      361 catcactctc tcttccaaag agtaagcaaa actgtgctgc cggagtaggc agccgtcgaa
      421 tcagaaaggc agccattcaa cgatgcgcaa acatgtgggg attggaaatc cagaccgcaa
      481 agttacactc tgctcgggcc tatccgatac gcagcgataa gttgtcggtc cttggatagg
      541 ggatttgttg tgatttacag gcatatgtat ctataagatg accacctgtc ttgagtagcg
      601 gggaaaatct ctgctttcca ccatcaactt ttgacaatcc acaccattcc caaagttcat
      661 aatgcgtgag attgtgagtt gtaccttgtc cacgtaccta ttgaggtatt gattctcagc
      721 tgactcgggg atccaacaga tacacctcca aaccggccaa tgtgtacgta tctttcgctg
      781 ccgaatggat gagccctgac tgacgaagaa tctgtagggc aaccaagtcg gtacagcatt
      841 ctggtatgag gatgggaccg tcgatgcacc ttcaagagat gagactgatg ttggtcacta
      901 ccaggcaaac cattcacggc gagcacgggc ttgatcaaga tggagtgtga gtctcaccgc
      961 catgtaacgc gacgtcacct cgctcatcga gcgtcaagtt gacattttca tagattccgc
     1021 ggctctgatg agcagcagtc cgagaggctg agcgtctact tcaccgaggt aactcgatac
     1081 agtcgtcgcg cacttctccc tattcctttc tgacacgatg caggctgcca aacaaaagta
     1141 cgttcctcgt gcagtcctgg ttgaccttga gcccgcaacc atggatgcaa tccgttccgg
     1201 tccccttggt gacttcttcc gtcccgacaa catggtctac ggacagtccg gcgcgggtaa
     1261 caactgggca aagggccact acaccgaagg cgccgagctg gttgaccaag tcctcgacgt
     1321 cgtgcgccgc gaagccgaag cctgcgacag tctgcagggt ttccagatca cgcactccct
     1381 cggtggcggc accggctctg gcatgggcac cctcctcatc gcaaaggtcc gcgaggaatt
     1441 ccctgatcgt atgatggcga ccttctctgt cctgccgtct gccaaggtct ctgaggtcgt
     1501 tgtcgagccg tacaatgcga ccttgtcgat tcatcagctc gtcgaaaact cggacgagac
     1561 gttctgtatc gacaacgagg ccctgtatga tatctgccgc cgcaccctga agcaagcgca
     1621 cccgtcctac ggacatctca accacctggt gtccagggtc atgtccggct tgacgacggg
     1681 cttccgcttc ccggggcaat tgaacgccga cctccgcaag ctggccgtga acctcgtccc
     1741 cttccccaga cttcacttct tcaccgtcgg attcgcgccg ctcacctcct cggcctcctt
     1801 ctcaaacctg ggtatcgccg agctcacgca gcagatgttc gacccgaaga acgtcatgtt
     1861 ggcgtctgat ttccgtgacg gacgcttctt gacttgctcc accatgttgt gagttggtcc
     1921 cctcatgtcg cttgatgaat tctttgctaa cgtaccttta gccgtggaaa ggtgtccatg
     1981 aagcaggtcg aggaacagat ccaggccatc aagaacaaga actcagccaa cttcgtggag
     2041 tggatcccga ataacatcca gactgctcat tgctctgtgc cgcccaaggg actggacgta
     2101 tcatcgacgt tcattggaaa ctccaccgca atccagaaca gtttccgcag agtcggcgac
     2161 cagttcagcc tcatgttccg gagaaaggct ttcttgcatt ggtacactgg agagggcatg
     2221 gatgagatgg aattcactga ggccgagtct aacatgaacg atctggtatc cgagtaccag
     2281 cagtaccagg atgcaggcat ggatgacgag tacggcgaag agtacgaaga cgaggctcct
     2341 gctgaggagg aatgagaagg caagacaact cttcgatgtc cttcccgagg tgaagttact
     2401 ttatattcaa ggtagttgat ggtctttgaa ccttttccaa tcactgtaca cacccaatcc
     2461 accatcatag tccgagcagc tcacttgaag ttattttccc tttcgactga gaaggcaata
     2521 gagttgttct atctcgttgc ttgtgcagac aacgaagtgg atcgttacgc aacccctagt
     2581 cttgacttgc aaatcatatc gtcccaagat ccagatggct attagttcat ggatcggtac
     2641 ccttgtagat ccacctgaag tggctccccg gg
//
LOCUS       COGTUB1B     2614 bp ds-DNA             PLN       13-JUL-1990
DEFINITION  C.graminicola beta-tubulin-2 (TUB2) gene, complete cds.
ACCESSION   M34492
KEYWORDS    beta-tubulin-2.
SOURCE      C.graminicola DNA.
  ORGANISM  Colletotrichum graminicola
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Diaporthales; Diaporthaceae.
REFERENCE   1  (bases 1 to 2614)
  AUTHORS   Panaccione,D.G. and Hanau,R.M.
  TITLE     Characterization of two divergent beta-tubulin genes from
            Colletotrichum graminicola
  JOURNAL   Gene 86, 163-170 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        441      452     beta-tubulin-2 (TUB2), exon 1
                655      678     beta-tubulin-2, exon 2
                738      763     beta-tubulin-2, exon 3
                840      881     beta-tubulin-2, exon 4
                942      996     beta-tubulin-2, exon 5
               1068     1858     beta-tubulin-2, exon 6
               1917     2310     beta-tubulin-2, exon 7
    IVS         453      654     TUB2 intron A
    IVS         679      737     TUB2 intron B
    IVS         764      839     TUB2 intron C
    IVS         882      941     TUB2 intron D
    IVS         997     1067     TUB2 intron E
    IVS        1859     1916     TUB2 intron F
BASE COUNT      561 a    802 c    607 g    644 t
ORIGIN      
        1 cccgggtctt ggatcggaga gaacagagga gcagaatagg gattattggg taggcaggta
       61 cgtgcccagg taacacgtcg tgcctgccaa tcaacgccga tctgagtcct cgacttgttg
      121 cgacccacca tcaaatagtg cctgggaatg gtctttcccc aatcaggatc gtgcacggat
      181 cccaatagta aacaagcgag ctgcaccctt ttcctctctg gcctgtctct gggcacctgg
      241 cctgtggcgg tgagcgaaat cacgtttacc cgcaaaacaa aaatcaacaa cctttcccct
      301 acctaaccac accttaatct catccacttt ccaaccacgt ccaccttgga agcttcgcgc
      361 agctctcaag cacctcccac tcgtcctctc ctttgctcca tcgtcggcct agtcagctca
      421 agagcttttc tccattcaca atgcgtgaga ttgtaagtcc ttcccctcaa tcattcgtaa
      481 caaataaacc tgcgaccgac gcgtttggcg acgaatcgtc ggccttgccc ctgaacgtac
      541 cccgccgaca tttccaccca acactggtcc tcaccgaaga cgaccacgat tgccatcacc
      601 gacagtatgc accttgggga tatatcgctg accattgatt gttatactcg ataggttcac
      661 ctccagaccg gccagtgcgt aagtcttctc tgatcccaac caacaatcca aggtgcgggg
      721 ctaacttctt tgaatagggt aaccagattg gtgctgcctt ttggtgcgta gccagaccga
      781 catcatcgac ttcggcgaga ttggcctcga aaagacattg gatattaata cgggcacagg
      841 caaaacatct ctggcgagca cggcctcgac agcaatggcg tgtatgttgc caacctccag
      901 atctggccac ttcctcgagt tcaccgctaa tttctcaaca gttacaatgg cacctctgag
      961 ctccagctcg agcgcatgag cgtctacttc aacgaagttt gttatcctag ccccccccag
     1021 gaagcagaca aacctattga tgaatactga ccttgtcacg tacccaggct tccggcaaca
     1081 agtatgtccc tcgcgccgtc ctcgtcgact tggagcccgg taccatggat gctgttcgcg
     1141 ccggcccctt cggccagctt ttccgccccg acaacttcgt ctttggccag tccggtgccg
     1201 gcaacaactg ggccaagggt cactacaccg aaggagctga gcttgtcgac caggtccttg
     1261 acgtcgtccg ccgcgaggct gagggctgcg actgccttca gggcttccag attacccact
     1321 cccttggtgg aggtactggt gccggtatgg gtactctgtt gatctccaag attcgcgagg
     1381 agttccccga ccgcatgatg gctacctttt ccgtcgttcc ctcccctaag gtttccgaca
     1441 ccgttgttga gccctacaac gccactctct ccgtccacca gctggttgag aactccgatg
     1501 agaccttctg cattgataac gaggctctct acgacatctg catgcgtact cttaagctct
     1561 ctaacccctc gtacggcgac ctgaaccatc tcgtctctgc cgtcatgtcc ggtgtcacta
     1621 cctgcctgcg tttccctggt cagctgaact ctgacctgcg taagctggcc gtcaacatgg
     1681 ttcctttccc ccgtcttcac ttcttcatgg tcggattcgc tcccctgacc agccgtggtg
     1741 cccactcttt ccgcgctgtc agcgttcctg agctcaccca gcagatgttc gaccccaaga
     1801 acatgatggc tgcctctgac ttccgcaacg gtcgctacct gacctgctct gccatcttgt
     1861 aagtgtcatt tccagccaac ctacaatgtc agcacttgct aacagctgcc tctcagccgt
     1921 ggtaaggtcg ccatgaagga cgtcgaggac cagatgcgca acgtcctgaa caagaactct
     1981 tcttacttcg tcgagtggat ccccaacaat gtccagaccg ccctctgctc cattcctccc
     2041 cgcggcctca agatgtcctt cacctttgtc ggtaactcta ccgccatcca ggagctcttc
     2101 aagcgtgtcg gtgagcagtt cactgccatg ttccgtcgca aggctttctt gcattggtac
     2161 actggtgagg gtatggacga gatggagttc actgaggctg agtccaacat gaacgacttg
     2221 gtctctgagt accagcagta ccaggacgct ggtgttgatg aggaggagga ggagtacgag
     2281 gatgacgccc ccctggagga ggaggtttaa gcgttgtctg aaaatgctgt gccaccttgg
     2341 ccatgtcttc acccaacccg tctgcggtgg catttcgctt cactattcca gctctgcact
     2401 ggaaatgggc ttctagatat acctctctta gtagttcgcc tggcgtatca aaatgagtac
     2461 gaagaatcag agattactct gtacaaatta ttggcaacat caaatgcata gttttatggc
     2521 aattgcgaca cctctaatct tgccagagtt caagtattcg tatcttcttt cgtgactgac
     2581 agattatcta tgtttaaacg tggacaccct cgag
//
LOCUS       DDICNPA      6372 bp ds-DNA             INV       13-JUL-1990
DEFINITION  D.discoideum cyclic nucleotide phosphodiesterase gene, complete
            cds.
ACCESSION   M23449
KEYWORDS    cyclic nucleotide phosphodiesterase.
SOURCE      D.discoideum (strain NC4; cell line AX3-K) DNA, clones pGP-1 and
            pC34.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 6372)
  AUTHORS   Podgorski,G.J., Franke,J., Faure,M. and Kessin,R.H.
  TITLE     The cyclic nucleotide phosphodiesterase gene of Dictyostelium
            discoideum utilizes alternate promoters and splicing for the
            synthesis of multiple mRNAs
  JOURNAL   Mol. Cell. Biol. (1989) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 3380 to 4379)
  AUTHORS   Faure,M., Franke,J., Hall,A.L., Podgorski,G.J. and Kessin,R.H.
  TITLE     The cyclic nucleotide phosphodiesterase gene of Dictyostelium
            discoideum contains three promoters specific for growth,
            aggregation, and late development
  JOURNAL   Mol. Cell. Biol. 10, 1921-1930 (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by R.H.Kessin, 23-MAR-1989.
FEATURES       from  to/span     description
    pept       4127     4146     cyclic nucleotide phosphodiesterase, first
                                 expressed exon (EC 3.1.4.17)
               4297     5635     cyclic nucleotide phosphodiesterase, exon 2
    pre-msg    1163     6060     CNP mRNA and introns
    IVS        1780     2266     CNP intron A (no splice consensus at 2266)
    IVS        2377     4077     CNP intron B
    IVS        4147     4296     CNP intron C
    signal     6015     6020     poly-A signal
BASE COUNT     2615 a    750 c    504 g   2503 t
ORIGIN      
        1 tgatcatttt gtaaatcttt caattaaatt tcaacttaca cattaataat aataaaaata
       61 ataataataa taatatcaga cattcacagg atttttataa atccaatttt atttttcatt
      121 gatgtcgttt tattaattga cattcaaatt gtgaataact gacaataaat aaaacagttc
      181 atatatttat ttgtttgaaa aaaaaaaaaa aaaaaaaaaa tcaaagttgc atgtgcgagt
      241 gtaaattggg gtttttttta taaatctaaa ttagttattt tacaatttta gaaccactac
      301 caaattacaa aaaaaaaaaa aaaaaaacca ctcatttatt tttttttttt tttttttttt
      361 ttttatcaca tagattccaa taaaattgtg aaacataaag aggaaaacga taatatggat
      421 tattagaaaa taatcaatac tttttttttt tttttttttt tgtgttttat tggtttttaa
      481 ttaaaagttt tgggggtttt ttattttaat tttttttttt tttgaaaaac atactcactc
      541 acacaaaacg caatcaataa atctattttt tgaaaataaa aaaaatcttt tttttttttt
      601 tttttttttt ttttctagtt ttcaaaaaat tcatttaatc gcattacatt ataaaataaa
      661 aaaattcggt gtcttaaatc tcccaaaatc ttgaatgatt tgtgtgtgtt aaaataataa
      721 tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaatt ccaactttta gatccccaat
      781 cacataggtg tgatatatct aagttattga aaattgtgtt gatattgttt ttcttacttt
      841 tttttttttt ttaaaataac cactttactt tctaaatatt gcatcgcaat gggggtatta
      901 aaagttttgt tatttttatt tacacatctt gatttcaaga taaaaatttt tagtgtaaat
      961 atatcaaaca ccatagagac ttttcacatt atcaagtgtt tttgactctc tatcacactc
     1021 acaaatctga tttttatttt tatttttatt tttttttttt ttttatttta tttttttttt
     1081 attttatttt atttttctcc aaccacaatt gatttcaaaa aaaaaaaagt ctataaaatc
     1141 aaaaaaaaat ttttttttta attaatcaaa aacaaaaaca aaaaacaaaa ataaataaaa
     1201 aaaaaaaaaa aaaacttgta acattaataa taataatttg taattttaaa taccaacttc
     1261 aattccatat tacttttttt gtatcattat ttatatatct caatctattt taaatcacca
     1321 ttgtatcact actaaaaaaa aaaaaaaaaa aaaaatcaaa aatcaaaaat ccaaatttat
     1381 aaaaaaaaaa aaaaaaatca aaattgtaaa aaaataacaa tatcaccctt ttttctgtat
     1441 tattacttgt atttgtaaca ctataaatta aataaataaa aatttaattt tgtatcatta
     1501 ccacactatt aatttttttt ttacaatttt tgatcaataa taaatcgtta aaaaaaaaaa
     1561 aaaaaaaaat ttttaataat aaaataaatt aataccaaaa ataatcaacc aaccaaccaa
     1621 aaattattta attatttatt aaccatttta tcataattaa taaataaaaa aaaaaaaaaa
     1681 ataaaaaaaa aacatcaaaa aaaaaaaata aaaaaacaat accatatcaa attatctctc
     1741 gttctgtact attactcctt aaaaaattta ttaaaaaagg tatgtaaata atataattat
     1801 tattatttta tttattatta tgattccatt taaatgtttc aaaatttggt ttttttattt
     1861 tttccttatt ttcgttcgct ttgaccaatc attttttttt attattttaa tttttaatta
     1921 tttttcgggt tcattccaac atgacaatgg aaaaaattaa aaaagggtat aggaaaagcg
     1981 caaatcaaaa ggaaaaaaaa aaatatcttg cgtttgccaa tttttatttt tttttatttt
     2041 tttttttatt tttttttttt ttgaaatcaa tcaaaatttc caaccttata ataatccaat
     2101 ttaatcacaa atccagtgat tttatcgttg tgtgtgtgtg gtctgatttt aaaatcacca
     2161 actgtttttt tttataatta tttttttttt attttatttt tcaatttctt ttcttttttt
     2221 tttttttatt ttttaatatc ttttattttt tttttttttt ttttttttat tattattaat
     2281 aatcataaaa ctgtaatagt acttaaattg taacaaaata aaaatataaa aataataata
     2341 ataatttaac aataagaata gcaacaatta ttcattgttt gtatttaatc aatatttaga
     2401 atttttatta tttttttttt atatttatta tttattatct atctagaata caagtagaaa
     2461 atataattat ataattttat aatacccatt ataatatatt ttattttatt ttattttatt
     2521 ttattttttt ttctcctaaa atcacccaaa caccccacac acaattttat gtaatattca
     2581 ataacaacat tgattttcac accactcact gcccaattca attcaaacga tagtgagtaa
     2641 ttaattttgt gtttagatgg tagatagaga tgatgtattt tattttttta tatcatccca
     2701 agttccatat aaatcaacat tttatgatgt ttatttgttt ttcaatataa catttttatt
     2761 taataaaaaa atttaattaa atcatatata acactactta ataaaaacat caataatcac
     2821 accacccact accaataata aaaaataaat aaaaaaaata aataaataaa aaattaatta
     2881 aatgtttata tttttataac accacacata cgcccacact ccctctttct ccccactact
     2941 aaacttgata tttttattgt tttttttttt atacaataaa acattgattg ttatattttt
     3001 aattttatga ttaaaaaaaa aaaaaaaaaa caaaaacaaa aactatatat aaaaataccc
     3061 aaaaataaaa tatttttaag ggtagagtga taataattaa tcatacacaa aataatgaaa
     3121 gtcacaagta gttataaaaa aataaaaaaa aaaaaaaaaa aaaggaaaaa aaaaaaaagt
     3181 aataataata ataagaataa taaaaataat aaaaataatt taaaccctat atattgacat
     3241 aatggatagc tctataatgt ataatagtag tatatttatt ttaaattata tataaaaata
     3301 atttacgatt tcaaagatcg aaaccaatta taacactact gttatgaaac aaatgtgtat
     3361 attttacact aacctcgtcg atatctattg tgtgtaacaa ttaggggttt tattatttcc
     3421 ttttttttta aaaaaatttt tttttttttt ttatttacta ctatttttaa aaaaaaaaaa
     3481 aaaaaaaaaa aaaaaaaaaa gataatttat tttgatcttt tatgtgttga tacacctttt
     3541 catgtttaca cacaaacaaa aaacactcga taaattttat tattattttt tatttgtttt
     3601 taatttaatt gatattaaaa aaaaaaaaat taaaaaaaaa aaaaaaaaaa aattttaagt
     3661 gatatataat ttttaaaatt ttttatttat ttattttttt aaaaaattaa tttttcaaac
     3721 aataatacat tcaaaattaa aacaataata acagattcaa aatatcattt ggttttgttt
     3781 tttttttttt ttttatttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaattata
     3841 catataaata tttataacaa taacccacaa acgccacaca ctcacttatt ttttttaatt
     3901 ttaaatttta tttattttat ttatttattt atttatttaa tcaaaaaaaa aaaaaaaaaa
     3961 aaaactaaaa ttaaaattac taacttattt ttttttttat aaatatatat aaaaaaaaaa
     4021 aaaaaataaa attttttttt tttccaatat ctttatcttt tttttttatt ataaaagaag
     4081 aaagaattaa tatagttcga ataactacaa aaaatataca aaaaaaatgg cattaaataa
     4141 aaaattgtaa gtggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaattta aaaatagaaa
     4201 ttgatttgtt gtttatacta attttttttt tttttttttg gaattttttt tttggaaatt
     4261 tttatttttt attttttttt tttttcccaa aaatagaatt agtttattac ttttaatttt
     4321 tataatttta aatattgtaa attcccatca acaagaagat tgtgatgatg acgatgaaga
     4381 tataggaatt tcagcagaga gatcagaaag aagatcagtt aaaaacagta acgatggtag
     4441 taatttttat aatttaaatg attattatac accagagaat tggaatcatt atagtggatc
     4501 atttgcaact aaagattgta gggatgcatc atacataacc ataccattgg gtacaactgg
     4561 tggattggat gaaggtaatc tctcatcatt cctcttgaca aagaaaggta gtaatctttt
     4621 cattgcattg gatgcaggta cagtatggca aggtgttaga agattaacca ccttcaaata
     4681 tttcaataca ctcttcaata ttacctatcc atcatgggca gttttaccag agcaaagaac
     4741 ctcatggttc ctcaagaaac acgttatgag ttatttcatt ggtcacagtc atttagatca
     4801 tgtaggtggt ttaattttag tttcaccaga ggattattta gcaaagaatt ggatcgatgt
     4861 tcaaccacca atcaacaatg gtatcatggg tctcattcgt aaattagggt tcaaaccaac
     4921 tgatttcacc tcttcaagta tcctccaaaa gaaaaccatc atgggtttac catcaaccat
     4981 caattcaatc tctactaatc ttttcaataa tcaagtttgg ccaaatcttc catcatttgg
     5041 tcgttatcaa tactttagtt tagcaagtgg cattgaatat ccattcacag agttggtccc
     5101 atacaatgcc accactatgt ctttggtcgc caatgaattc cctttctctg tcaaagttaa
     5161 accatttgaa ttgtgtcacg ataatttaat atcaacttct ttcctcttca ctgatagtat
     5221 ctctggtgaa caaatcgctt tcttctctga tactggtgtt ccatcctctg tcgcttgtga
     5281 ttgggaaggt aaaatctatg ccgtctggaa acaaattaaa atcgataaat taaaagctat
     5341 ctacatcgaa acttctttcc caaataatac tccagactct gctatgtttg gtcatctccg
     5401 tccaagagat gtcatgaaat taatggatca actcttggtt caatccattc aaacctctcc
     5461 accaatgact aatctcaaac atgttaaatt aatcatcgaa cacattaaac cacaagttgc
     5521 tgaagatcca aatggttgga caactcaaag agtaatctat caacaattaa aagaagcaaa
     5581 taataatggt gttagaatta ttattccaaa tcaaggtgat ccaatttgta tttaaataat
     5641 taattaatat aatatataat aattaaacat atttatttat ttattccacc aaaaaaaaaa
     5701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa attaaaaaaa aaaaaaaaaa aaaaaaatta
     5761 aataataata ataactaaaa tagaaaacaa taccaaaaaa aaaaacaaaa aaaaaaaaaa
     5821 aaaccaatcg acaatttttt tttttttttt tctctaaaac aaatacataa aaaaaatcat
     5881 cttataaata actcctcctc ctttcccctc cttctataat tattcagttt cattaattgt
     5941 atagattttt tttttttaag tcaaccaacc tccttttttt ttaaaccaca aaaaaaaaaa
     6001 aaaaaaaaaa aaaaaataaa ccattttatt tttttttttc caaaaaaatt ttccagtttt
     6061 tttttttttt tattttagaa acattttatt tttactttta aaaaatatgg gtgtttattt
     6121 tgagaaatat attttttatt taataacaac accctttaag gaaagataga taataattaa
     6181 ttatattttt attttacata atgcattttt tttttttttt ttgggtttta ttttattatt
     6241 ttttttttta ttgttttaat attattattt ttttttcata aaactttttg agaatcacta
     6301 tggaagtcac ttaaattatt attttctctt tcgttatcat cttcttggaa atcaacttgt
     6361 tcatcttgat ca
//
LOCUS       HUMCPGISL    2520 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human nested gene protein gene, complete cds.
ACCESSION   M34677
KEYWORDS    coagulation factor VIII; nested gene protein.
SOURCE      Human 4X B cell line (ACTT GM1202) DNA, clone p542.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2520)
  AUTHORS   Levinson,B., Kenwrick,S., Lakich,D., Hammonds,G.Jr. and
            Gitschier,J.
  TITLE     A transcribed gene in an intron of the human factor VIII gene
  JOURNAL   Genomics 7, 1-11 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        585     1268     CpG island protein, exon 1
               1271     1681     CpG island protein, exon 2
    IVS     <     1  >  2520     coagulation factor VIII, intron 22
    mRNA        528     2265     CpG island protein mRNA (5' end = end of cDNA)
    binding     523      532     Sp1 binding site
    binding     543      552     AP4 binding site
BASE COUNT      418 a    843 c    761 g    498 t
ORIGIN      Chromosome X.
        1 cccgggaaag tcctcagcat gatggtgttc gttggggcgg gtgggcctca agaccatgac
       61 tgcagtgttg cgacaggacg gacttctaag gcaaccgggg tggggacgga gaggtggacc
      121 cggtgcttgg caaggggccc tggaccggaa gacggaggac tgagtagtcg gggcgggggg
      181 ggggggggac aacagtctag aaatgcgcag acatggaatt agggtcacga gtttcgtttg
      241 ggtcatgttc agatggagag gctgtgggat gcctttagca acctagacca cttgcattgc
      301 agccgagcta gcagcagctg gtgccccgcc aggccccgcg taccttccca gggtcttgga
      361 tccgcatccc cgagaccctc gatttgcata cgccgctcac agcccggcca ggccccgcct
      421 ccgatcccgc tctttgcacc tcatcagggc aaggggcagg ctcgggctgg ggagccggcg
      481 cgggcacggc ggcagcccgc cgagggggac gcagggcacg tcgccccgcc ccggcccgcc
      541 cgccagctgc ggcggcgcgt cgggaagtgc ttggggcggc gagcatggcg gcagcggctg
      601 caggcctggg cggcggcggc gccggcccgg gacccgaggc cggggacttc ctggcccgct
      661 accggctggt atcgaacaag ctgaagaagc ggttcctgcg gaagccgaac gtggcggagg
      721 ccggcgagca gttcggacag ctgggccggg agctgcgcgc ccaggagtgt ctgccctacg
      781 cggcctggtg ccagctggcg gtggcgcgct gccagcaggc gctcttccac gggcccgggg
      841 aggcgctggc cctcaccgag gccgcccgcc tcttcctgcg gcaggagcgc gacgcgccag
      901 cgctggtctg ccccgccgcc tacggggagc cgctgcaggc cgccgccagc gccctgggcg
      961 ccgcggtgcg tctgcacctc gagctgggcc agccggccgc cgccgccgcc ctctgcctcg
     1021 agctggccgc cgccctgcgc gacctgggcc agccggccgc cgccgccggt cacttccagc
     1081 gcgccgccca gctccagctg ccccagctgc ccctggccgc gctgcaggcg cttggcgagg
     1141 ccgcctcctg ccagctgctg gcgcgcgact acaccggcgc cctggcggtc ttcacgcgca
     1201 tgcagcgcct ggcgcgggag cacggcagcc acccggtgca gtcactgccg ccgcccccgc
     1261 cgccggcacc ccagcccggg cccggggcga cgcccgccct accggccgcg ctgcttcctc
     1321 cgaactccgg ctcggcggcg ccctctcccg cgccctgggc gccttctcgg acgtgctggt
     1381 ccgctgcgag gtgtcccgcg tgctgctgct gctcctcctg caaccaccgc ccgccaagct
     1441 gctgccggag cacgcccaga ccctggagaa gtactcctgg gaggcttttg acagccacgg
     1501 gcaggagagc agcggccagc ttcccgagga gctctttctg ctgctccagt ctttggtcat
     1561 ggctacccac gaaaaggaca cggaagccat caagtcgctg caggtggaga tgtggccact
     1621 gttgactgct gagcagaacc acctccttca cctcgttctg caagaaacca tctcccccta
     1681 ggacagggag tctgatccat cccattcacc cagtgacttc tttttgccca ggcctggact
     1741 ttttgcatca gtcacgttaa ccagatgact ttgcctgtta ccaaacctca tgcatccacg
     1801 tttgcgtctg gggaggaata aaaagacatc gttcccgctt ctgcgttttg ttattcctac
     1861 tgccgccata ggaattattt cgttggctga acgttaccag caccccgaga acacattttg
     1921 atagaatcag agtagaggac atggctgtct tctaaaaagc cacgacatga aaatgacaat
     1981 ccctttcgtc tccttcctcc gctgcttcca cctaacgcag cctcctgcct ccgcctttgt
     2041 ttcatagtga ggattttatt ttgcacggcg ccctccctct aaatacctac cctagatgat
     2101 ttcatcctgc ccctcacttc ttaacatatt cctgtgtctt tgtaatggcc aaatttctcc
     2161 ttccactcgt ccgcacggta cgtcttcatg ggagtcattt tattccttac agcttccttg
     2221 taaccacagc cctaaatcca tgataaagct actcttgtac tcttcactca tgtgtggttt
     2281 tggtgtccgt cgtctataaa atgtgactct cttacctact tgtcaggagg ccagcagcag
     2341 aaccagatag ctgtttgggc tccaggatgt attcgttcct ttgaaatttg cgctatgcct
     2401 cctatgtggg caacgaacct gcagagatca tctatgatgt cctccgtact cgggtgactg
     2461 caatgcagat acatccagtg cagcgggtct ggtgccgatg catcgatttg cattgaattc
//
LOCUS       HUMIGHCY      372 bp ss-mRNA            PRI       13-JUL-1990
DEFINITION  Human Ig active mu-chain mRNA Vv-D-J6-region, partial cds.
ACCESSION   M34678
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; variable region.
SOURCE      Human CD5+ EBV transformed B-cell, cDNA to mRNA, clone EBV 383.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 372)
  AUTHORS   van der Heijden,R.W.J., Bunschoten,H., Pascual,V.,
            Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D.
  TITLE     Nucleotide sequence of a human monoclonal anti-idiotypic antibody
            specific for a rabies virus-neutralizing monoclonal idiotypic
            antibody reveals extensive somatic variability suggestive of an
            antigen-driven immune response
  JOURNAL   J. Immunol. 144, 2835-2839 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   372     Ig mu-cahin Vv-D-J6-region (AA at 1)
                                 /hgml_locus_uid="LW0029F" /nomgen="IGHM"
                                 /map="14q32.33"
    recomb      294      295     Vv-region end/D-region start
    recomb      316      317     D-region end/J6-region start
BASE COUNT       84 a    107 c    108 g     73 t
ORIGIN      
        1 gaggtgcagc tggtgcagtc tggagcagaa gtgaaaaagc cgggggagtc tctgaagatc
       61 tcctgtaagg gttctggata cagctttagc acctcctgga tcggctgggt gcgccagatg
      121 cccgggaaag gcctcgagtg gctggggacc atccatcctg gtgactctga cacaagaaat
      181 agcccctcct tccaaggcca ggtcaccatc tcagtcgaca agtccatcag cacggcctcc
      241 ctgcagtggg gcagcctgaa ggcctcggac accggtatat attactgtgc gagactgata
      301 acaggtcctc gacactacta ctactacgct atggacgtct ggggccaagg gaccacggtc
      361 accgtctcct ca
//
LOCUS       HUMIGHCZ      306 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human Ig expressed mu-chain gene Vv-region.
ACCESSION   M34680
KEYWORDS    immunoglobulin heavy chain; mu-immunoglobulin; variable region.
SOURCE      Human CD5+ EBV transformed B-cell DNA, clone EBV 383ex.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 306)
  AUTHORS   van der Heijden,R.W.J., Bunschoten,H., Pascual,V.,
            Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D.
  TITLE     Nucleotide sequence of a human monoclonal anti-idiotypic antibody
            specific for a rabies virus-neutralizing monoclonal idiotypic
            antibody reveals extensive somatic variability suggestive of an
            antigen-driven immune response
  JOURNAL   J. Immunol. 144, 2835-2839 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   305     Ig mu-chain Vv-region precursor (AA at 1)
                                 /hgml_locus_uid="LW0029F" /nomgen="IGHM"
                                 /map="14q32.33"
    sigp    <     1       12     Ig mu-chain Vv-region signal peptide
    matp         13  >   305     Ig mu-chain Vv-region
BASE COUNT       66 a     81 c     97 g     62 t
ORIGIN      
        1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag
       61 tctctgaaga tctcctgtaa gggttctgga tacagcttta gcaccttctg gatcggctgg
      121 gtgcgccaga tgcccgggaa aggcctggag tggctgggga ccatccatcc tggtgactct
      181 gacacaagaa atagcccgtc cttccaaggc caggtcacca tctcagtcga caagtccatc
      241 agcacggcct tcctgcagtg gggcagcctg aaggcctcgg acaccggtat atattactgt
      301 gcgaga
//
LOCUS       HUMIGHVV      306 bp ds-DNA             PRI       13-JUL-1990
DEFINITION  Human Ig germline mu-chain gene Vv-region.
ACCESSION   M34679
KEYWORDS    immunoglobulin heavy chain; mu-immunoglobulin; variable region.
SOURCE      Human CD5+ EBV transformed B-cell DNA, clone 251.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 306)
  AUTHORS   van der Heijden,R.W.J., Bunschoten H Pascual,V.,
            Uytdehaag,F.G.C.M., Osterhaus,J.D.M.E. and Capra,J.D.
  TITLE     Nucleotide sequence of a human monoclonal anti-idiotypic antibody
            specific for a rabies virus-neutralizing monoclonal idiotypic anti-
            body reveals extensive somatic variability suggestive of an
            antigen-driven immune response
  JOURNAL   J. Immunol. 144, 2835-2839 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   305     Ig mu-chain Vv-region precursor (AA at 1)
                                 /hgml_locus_uid="LW0029F" /nomgen="IGHM"
                                 /map="14q32.33"
    sigp    <     1       12     Ig mu-chain Vv-region signal peptide
    matp         13  >   305     Ig mu-chain Vv-region
BASE COUNT       67 a     83 c     95 g     61 t
ORIGIN      
        1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag
       61 tctctgaaga tctcctgtaa gggttctgga tacagcttta ccagctactg gatcggctgg
      121 gtgcgccaga tgcccgggaa aggcctggag tggatgggga tcatctatcc tggtgactct
      181 gataccagat acagcccgtc cttccaaggc caggtcacca tctcagccga caagtccatc
      241 agcaccgcct acctgcagtg gagcagcctg aaggcctcgg acaccgccat gtattactgt
      301 gcgaga
//
LOCUS       MUSPLII       874 bp ss-mRNA            ROD       13-JUL-1990
DEFINITION  Mouse placental lactogen II (PL-II) mRNA, complete cds.
ACCESSION   M14647
KEYWORDS    placental lactogen II.
SOURCE      Mouse (strain Balb/c) placenta, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 874)
  AUTHORS   Jackson,L.L., Colosi,P., Talamantes,F. and Linzer,D.I.H.
  TITLE     Molecular cloning of mouse placental lactogen cDNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83, 8496-8500 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         60      728     placental lactogen II (mPL-II) precursor
    sigp         60      152     placental lactogen II (mPL-II) signal peptide
    matp        153      725     placental lactogen II (mPL-II)
    mRNA    <     1      874     mPL-II mRNA
BASE COUNT      234 a    212 c    190 g    238 t
ORIGIN      
        1 agcactcggg gaacagcagc cttctggtgt tgtcagcact tcaagagaac tcctcagaga
       61 tgaagctgtc tttgagtcaa ccatgctcct tctctggggc actcctgttg ctggcagtgt
      121 caaacctact tgtctgggag aaggtgactt ccttaccaaa ttatcgatta cccactgaaa
      181 gcctatacca acgtgtgatt gtggtgtcac acaacgccca tgatcttgct tcaaaagcct
      241 tcatggaatt tgaaatgaag tttggtagga cagcttggac ctatggcctg atgttaagcc
      301 cctgccacac tgctgcaatc cttactccag aaaacagcga gcaagtccac cagacaacat
      361 cggaagacct tctgaaagtg tccatcacta ttttacaagc ctgggaagag cctctgaaac
      421 acatggtggc agcagtggct gctcttccac atgtacctga tactctgctg tcaagaacaa
      481 aggagttgga ggaaagaatt caagggcttc tggaaggact gaagatcata ttcaataggg
      541 tttacccagg agctgttgca agtgactata ctttctggtc tgcatggtca gatttgcagt
      601 catctgatga atccactaag aacagtgctc ttagaacctt atggcggtgc gtgcgcaggg
      661 atacacataa agttgacaat tacctcaagg tcctgaagtg tcgtgatgtt cataacaaca
      721 actgctgaac atctatccct cttctctgtc tcttagaagg tccctcgtga tctagacctt
      781 caaagcacca atgaatttcc ccttctttgg tgcctttcta gattgagttg tcatcttacc
      841 cccaaataaa ctgattcttt gcaaatgcta aatt
//
LOCUS       MUSRARGA     2715 bp ss-mRNA            ROD       13-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma (mRAR-gamma-A) mRNA, complete
            cds.
ACCESSION   M34476
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse embryo, cDNA to mRNA, clone lambda-mE13.20.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2715)
  AUTHORS   Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and
            Varmuza,S.
  TITLE     Identification of a novel isoform of the retinoic acid receptor
            gamma expressed in the mouse embryo
  JOURNAL   Mol. Cell. Biol. 10, 2335-2340 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        253     1629     retinoic acid receptor gamma
    mRNA          1     2715     RAR-gamma-A mRNA
BASE COUNT      578 a    839 c    741 g    557 t
ORIGIN      
        1 cggggctctg cggccttcct tcccccgccc tcccctccag cagtttccac caggtccctc
       61 acctcagcct ggcccagtat gtaggaggga atctctgcag aggccagagg gatccttgga
      121 aacccactgg acagaccagg cagggtgggc acggagcctc ccaggccagg gcagtgggca
      181 tgggcggggg ctgtagctga agaccacccc cgcctgctgc agagtccaag ggattcccac
      241 gccgcagcta ccatggccac caataaggag agactctttg cgcccggtgc cctggggcct
      301 ggatctggtt acccaggagc aggcttccca ttcgccttcc caggtgcact cagagggtcg
      361 ccaccatttg agatgctgag ccctagcttc cggggcctgg gccagcctga cctccccaag
      421 gagatggctt ctctctcggt ggagacacag agcaccagct cggaggagat ggtacccagc
      481 tctccctcac ccccaccacc tcctcgggtc tataagccat gctttgtatg caatgacaag
      541 tcttctggct accactatgg ggtcagctcc tgtgaaggct gcaagggctt cttcagacgc
      601 agcattcaga aaaacatggt gtatacatgt caccgtgaca aaaactgtat catcaacaag
      661 gtcaccagaa atcgatgcca gtactgcagg ctacaaaagt gtttcgaagt gggcatgtcc
      721 aaggaagctg taaggaacga tcgaaacaag aagaaaaagg aggtaaaaga ggagggctcg
      781 cccgacagct atgaactgag tccacagtta gaggaactca tcaccaaggt cagcaaagcc
      841 caccaggaga cttttccctc actctgccag ctgggcaagt acaccacgaa ctccagtgca
      901 gatcaccggg tgcagctgga cctggggctg tgggacaagt tcagcgagct ggccaccaaa
      961 tgcatcatca agattgtgga gtttgcgaag cggctgcctg gttttacagg gctcagcatt
     1021 gccgaccaga tcacgctgct caaggctgct tgtctggaca tcctaatgct gcggatctgt
     1081 acaaggtata ccccagagca ggacactatg acattctcgg atgggctgac cctgaaccga
     1141 acccagatgc acaatgctgg ctttgggccc cttacagacc tcgtctttgc ctttgccggg
     1201 cagctgctgc ccctggagat ggatgacacc gagactgggc tacttagtgc tatctgcctc
     1261 atctgtggag accgaatgga cctggaagag cccgagaagg tggacaagct gcaggagccc
     1321 ctgctggaag ccctgaggct ctatgcccgg cgacgagacc cagccaaacc ctacatgttc
     1381 ccaaggatgc tgatgaaaat caccgacctc cggggcatca gcactaaggg agcagaaagg
     1441 gctataaccc tgaagatgga gattccaggc ccgatgccac ccctgatccg agagatgctg
     1501 gagaacccgg agatgtttga ggacgactcc tcgaagcctg gcccccaccc caaggcttcc
     1561 agtgaggacg aagctccagg gggccagggc aaaaggggcc aaagtcccca acctgaccag
     1621 gggccctgac ctaccccgtt gtggggttgg gccccaggca gcagactgac catttcccag
     1681 ataccgccag tgactggggg aggacctgcc ccgcccactc tccacccttt taatgagctc
     1741 gttattttgc caaagtttct aggggtgcct gtgttcatcc ctgtcctgtt ctaactggct
     1801 ccctctacag tcccgcgggg atctgctgca cacctaccag aagagctgga gaaggggcga
     1861 gcctgggtct agactctaaa atctcagcac tgcctttcgg attcaggcca caggctcccg
     1921 acgcaagagg aagccccgcc ttcccatagc cttttcctct gccaggtgct tgggcctctg
     1981 ggagcaaaca ggaacactag agaccaaaag gggggccccc cccgagggga gggctgagcc
     2041 acccttgtgc cccaccctga gcgcctgact ctgtgtgatg aatctgccgc gctcagtgcc
     2101 cccggtgccc catcctaatc ctcctgccgg gtgtaggggc aggctggggc ctgcgttttc
     2161 tagggtgggg cctgagggga agggagaatg tgagcccgag cccgtctgta cctctgctga
     2221 agtacagcac tcatcctggg ccctccatta tataagcctc tcccttactt ctgtctcgtg
     2281 ccttggcctc ccgccctcat ctcagccatc aggcagggcc ctcctacact acagaggggc
     2341 caggggtccc tctcctagtg ccttctaccc ttgatccccc cagagcagct tggccccggg
     2401 gaggggagat gctgcttagc tgatcccacc ttgacccaga ggaagcctct atttatttat
     2461 tagcttttgt ttacaccctg gaaatgaccc cttcctccag gggtcttggg agggggagcc
     2521 cagggcctct gtgacccctt cctttctttc cccaagcccc cagtttatat ttagctgcca
     2581 aataagattc ctgcaggctc ccccatttct ctggggggtt agggtgctgt cccctcccct
     2641 ctgtttatat ctcctacccc gctgttcgca tattgctgag ttttctattt ttgcaaaata
     2701 aagtgatgga aactc
//
LOCUS       MUSRARGB     2496 bp ss-mRNA            ROD       13-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma (RAR-gamma-B) mRNA, complete
            cds.
ACCESSION   M34475
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse embryo, cDNA to mRNA, clone lambda-mE7.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2496)
  AUTHORS   Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and
            Varmuza,S.
  TITLE     Identification of a novel isoform of the retinoic acid receptor
            gamma expressed in the mouse embryo
  JOURNAL   Mol. Cell. Biol. 10, 2335-2340 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         67     1410     retinoic acid receptor gamma
    mRNA          1     2496     RAR-gamma-B mRNA
BASE COUNT      533 a    751 c    684 g    528 t
ORIGIN      
        1 aggaggggga atgggctgcg ggtccttggt gttctagcac ccagtttctc tccaagccgg
       61 gtcgcgatgt acgactgcat ggaatcgttt gtcccgggtc cgcgacggct gtatggggcg
      121 gccgggcccg gggccggctt actacgcaga gccactggca gctcctgctt cgccggactt
      181 gagtcttttg cctgggcaca gcctgccagt ctacaatcgg tggagacaca gagcaccagc
      241 tcggaggaga tggtacccag ctctccctca cccccaccac ctcctcgggt ctataagcca
      301 tgctttgtat gcaatgacaa gtcttctggc taccactatg gggtcagctc ctgtgaaggc
      361 tgcaagggct tcttcagacg cagcattcag aaaaacatgg tgtatacatg tcaccgtgac
      421 aaaaactgta tcatcaacaa ggtcaccaga aatcgatgcc agtactgcag gctacaaaag
      481 tgtttcgaag tgggcatgtc caaggaagct gtaaggaacg atcgaaacaa gaagaaaaag
      541 gaggtaaaag aggagggctc gcccgacagc tatgaactga gtccacagtt agaggaactc
      601 atcaccaagg tcagcaaagc ccaccaggag acttttccct cactctgcca gctgggcaag
      661 tacaccacga actccagtgc agatcaccgg gtgcagctgg acctggggct gtgggacaag
      721 ttcagcgagc tggccaccaa atgcatcatc aagattgtgg agtttgcgaa gcggctgcct
      781 ggttttacag ggctcagcat tgccgaccag atcacgctgc tcaaggctgc ttgtctggac
      841 atcctaatgc tgcggatctg tacaaggtat accccagagc aggacactat gacattctcg
      901 gatgggctga ccctgaaccg aacccagatg cacaatgctg gctttgggcc ccttacagac
      961 ctcgtctttg cctttgccgg gcagctgctg cccctggaga tggatgacac cgagactggg
     1021 ctacttagtg ctatctgcct catctgtgga gaccgaatgg acctggaaga gcccgagaag
     1081 gtggacaagc tgcaggagcc cctgctggaa gccctgaggc tctatgcccg gcgacgagac
     1141 ccagccaaac cctacatgtt cccaaggatg ctgatgaaaa tcaccgacct ccggggcatc
     1201 agcactaagg gagcagaaag ggctataacc ctgaagatgg agattccagg cccgatgcca
     1261 cccctgatcc gagagatgct ggagaacccg gagatgtttg aggacgactc ctcgaagcct
     1321 ggcccccacc ccaaggcttc cagtgaggac gaagctccag ggggccaggg caaaaggggc
     1381 caaagtcccc aacctgacca ggggccctga cctaccccgt tgtggggttg ggccccaggc
     1441 agcagactga ccatttccca gataccgcca gtgactgggg gaggacctgc cccgcccact
     1501 ctccaccctt ttaatgagct cgttattttg ccaaagtttc taggggtgcc tgtgttcatc
     1561 cctgtcctgt tctaactggc tccctctaca gtcccgcggg gatctgctgc acacctacca
     1621 gaagagctgg agaaggggcg agcctgggtc tagactctaa aatctcagca ctgcctttcg
     1681 gattcaggcc acaggctccc gacgcaagag gaagccccgc cttcccatag ccttttcctc
     1741 tgccaggtgc ttgggcctct gggagcaaac aggaacacta gagaccaaaa ggggggcccc
     1801 ccccgagggg agggctgagc cacccttgtg ccccaccctg agcgcctgac tctgtgtgat
     1861 gaatctgccg cgctcagtgc ccccggtgcc ccatcctaat cctcctgccg ggtgtagggg
     1921 caggctgggg cctgcgtttt ctagggtggg gcctgagggg aagggagaat gtgagcccga
     1981 gcccgtctgt acctctgctg aagtacagca ctcatcctgg gccctccatt atataagcct
     2041 ctcccttact tctgtctcgt gccttggcct cccgccctca tctcagccat caggcagggc
     2101 cctcctacac tacagagggg ccaggggtcc ctctcctagt gccttctacc cttgatcccc
     2161 ccagagcagc ttggccccgg ggaggggaga tgctgcttag ctgatcccac cttgacccag
     2221 aggaagcctc tatttattta ttagcttttg tttacaccct ggaaatgacc ccttcctcca
     2281 ggggtcttgg gagggggagc ccagggcctc tgtgacccct tcctttcttt ccccaagccc
     2341 ccagtttata tttagctgcc aaataagatt cctgcaggct cccccatttc tctggggggt
     2401 tagggtgctg tcccctcccc tctgtttata tctcctaccc cgctgttcgc atattgctga
     2461 gttttctatt tttgcaaaat aaagtgatgg aaactc
//

GenBank-Updates@genbank.bio.net (07/14/90)

LOCUS       RATCYP7      3535 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Rat cholesterol 7-alpha-hydroxylase (CYP7) mRNA, complete cds.
ACCESSION   J05430
KEYWORDS    cholesterol 7-alpha-hydroxylase.
SOURCE      Rat (Sprague Dawley) adult male liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3535)
  AUTHORS   Jelinek,D.F., Andersson,S., Slaughter,C.A. and Russell,D.W.
  TITLE     Cloning and regulation of cholesterol 7-alpha-hydroxylase, the rate
            -limiting enzyme in bile acid biosynthesis
  JOURNAL   J. Biol. Chem. 265, 8190-8197 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W.Russell, 07-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         30     1541     cholesterol 7-alpha hydroxylase (CYP7) (EC
                                 1.14.13.17)
    rpt        2195     2313     Alu repeat
BASE COUNT     1035 a    705 c    696 g   1099 t
ORIGIN      
        1 cgctttggaa attttcctgc ttttgcaaaa tgatgactat ttctttgatt tggggaattg
       61 ccgtgttggt gagctgttgc atatggttta ttgttggaat aaggagaagg aaagctggtg
      121 aacctccttt ggagaacggg ttgattccgt acctgggctg tgctctgaaa tttggatcta
      181 atcctcttga gttcctaaga gctaatcaaa ggaagcatgg tcacgttttt acctgcaaac
      241 tgatggggaa atatgtccat ttcatcacaa actccctgtc ataccacaaa gtcttatgtc
      301 atggaaaata ttttgactgg aaaaaatttc attacactac ttctgcgaag gcatttggac
      361 acagaagcat tgacccaaat gatggaaata ccacggaaaa tataaacaac acttttacca
      421 aaaccctcca gggagatgct ctgtgttcac tttctgaagc catgatgcaa aacctccaat
      481 ctgtcatgag acctcctggc cttcctaaat caaagagcaa tgcctgggtc acggaaggga
      541 tgtatgcctt ctgttaccga gtgatgtttg aagctggcta tctaacactg tttggcagag
      601 atatttcaaa gacagacaca caaaaagcac ttattctaaa caaccttgac aacttcaaac
      661 aatttgacca agtctttccg gcactggtgg caggccttcc tattcacttg ttcaagaccg
      721 cacataaagc tcgggaaaag ctggctgagg gattgaagca caagaacctg tgtgtgaggg
      781 accaggtctc tgaactgatc cgtctacgta tgtttctcaa tgacacgctc tccacctttg
      841 acgacatgga gaaggccaag acgcacctcg ctattctctg ggcatctcaa gcaaacacca
      901 ttcctgcaac cttttggagc ttatttcaaa tgatcaggag tcctgaagca atgaaagcag
      961 cctctgaaga agtgagtgga gctttacaga gtgctggcca agagctcagc tctggaggga
     1021 gtgccattta cttggatcaa gtgcaactga atgacctgcc ggtactagac agcatcatca
     1081 aggaggctct gaggctttcc agtgcatcct tgaatatccg cacagctaag gaggacttca
     1141 ctctccatct tgaggacggt tcctataaca tccgaaaaga tgacatgata gctctttatc
     1201 cacagttaat gcacttggat cctgaaatct acccagaccc tttgactttc aaatatgacc
     1261 ggtaccttga tgaaagcggg aaagcaaaga ccaccttcta cagtaatgga aacaagctga
     1321 agtgtttcta catgcccttc ggatcaggcg cgacaatatg tcctggaaga ctctttgccg
     1381 tccaagaaat caagcagttt ttgatcctga tgctctcctg ctttgaactg gagtttgtgg
     1441 agagccaagt caagtgtccc cctctagacc agtcccgggc aggcttggga attttgccac
     1501 cactacatga tattgagttt aaatataaac tgaaacactg atacgtggtt ggaagaagcg
     1561 aacactggat gatgtcactt ggcggctgag agtcatcact aaacaggcct tcgggaccaa
     1621 tgctcactga tgcgccctag cgactggatt agtgggaaga actttgttct cgctgcccac
     1681 attcctgggt gttcacatag ctggggccag agcttcatca ctttcagaaa gcaatgtctt
     1741 ttgtatttat tttcaaaatg aagatattcc aattggcagg atatttttcc taaggaaatt
     1801 gctttatatt tttatgaaaa ctaccaatta attatgaaag ggcttgaaat tcacgtttta
     1861 gtgaaattac tgatttttca ctagtaaggt tcttcaggtg tgaaactgta ttataaaaat
     1921 gttgtaatgg gtcacactgt gctttgcata aaggtaaagg aaactatgtt tcagcctttt
     1981 ctgtgtctat gagcttcgaa aataatctta ctgttctaga aacactgggg aggtttcgac
     2041 atgctctcgc tatattttat tttactgttg ctagaaattt tcattccagt tttcaactac
     2101 cttatctttc ccccattttg acatgcatgc caatgagaag agtatttttt aggaattaac
     2161 aaggcacctc ccagaaccct accctgagac ttttaagcct ttaatcccag cactcgagaa
     2221 gtagagccag gcagatctct gagtctgagg ttattctggt ctacatcagc tccagacaag
     2281 ccaggactac agaatgggat cttgtctaaa aaatacagct aatctttatg tcataactga
     2341 ttatgaatca acctaaaaga taaattttca atcaggactc agagaaaatg agcaattaaa
     2401 aaacttagct ctgaggtatg tggaattcat taagtacaag ttgacattac atgttcttta
     2461 aaaatagttt atgttttatc tctaaatgcc ctgcagatga agaataataa tgaaaagttg
     2521 aataatactg tttaaacact aagtgcaata atgctttggt aatgtacttt aagagaatca
     2581 ttagccgtgc cagttttact aaaatatatt tatatgtaaa ttatatttat ctttttctta
     2641 taccataaat ataaaaatat tgcaacattt agtaatttta aaattatata cctttcagaa
     2701 aatgatgtat gatgtttgta tgttttttaa ctttgaacag aacatttaaa ttattcatct
     2761 acggtgattt ttatcttatt tatttctttt tgtctcattc atatcttgaa gaaatccaaa
     2821 aatatctgaa ggaatcgctc actcaaatgt ctccctatgg ttacagaaaa attcaatacc
     2881 atgtttttgt cctcggggac tgaagcaggg tgtcgtgggt gcgagcagag gctcctgctg
     2941 cagcgagctt tatccacggg actccttaaa cttttaaaat cttatcacta ttatcatgca
     3001 tttattacct aagtaggata tttccctttc ctttttcatt tcagcagagt cccttagcaa
     3061 cccaggctga ctgggaccct ccatgtagct taagctgtga actcactgta cttcctgttt
     3121 tcacttattt taggaagtaa ttttccctat cagaaatttt aattgtttag atgatgtata
     3181 agagtaacac aattctgtta tatactaatc tgtagtaaac taaatttgtt cttagaacaa
     3241 gtttgatgac tctcaaattg aatgtatcca tacatctttc catggcttct tgaatgccca
     3301 tttctcatac acagaatgat gggtttcacg gtgatgtctt cctttcatgt ctttattctt
     3361 gtgcggtgat ggttggcaaa tgatacccat ggagcaaggt tactcttcct atttctgtgc
     3421 agcctaagtg ttaagaataa tttttaaata cttggaggga aggcacattt tgtgtcatat
     3481 gtgaagtgac atgtgacaca cagactagca aatccatgag taaaatttta ttggg
//
LOCUS       PDECYT550    1047 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  P.denitrificans cytochrome c550 gene, complete cds, and
            iso-cytochrome oxidase subunit I (iso-COI) gene, 5' end.
ACCESSION   M27304
KEYWORDS    cytochrome c550; cytochrome oxidase.
SOURCE      P.denitrificans (strain NCIB 8944) DNA, clone pRS59.
  ORGANISM  Paracoccus denitrificans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1047)
  AUTHORS   Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and
            Stouthamer,A.H.
  TITLE     Mutagenesis of the gene encoding cytochrome c-550 of Paracoccus
            denitrificans and analysis of the resultant physiological effects
  JOURNAL   J. Bacteriol. 172, 986-996 (1990)
  STANDARD  full staff_review
REFERENCE   2  (sites; amino acid correction for [1])
  AUTHORS   Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and
            Stouthamer,A.H.
  TITLE     Errata: Mutagenesis of the gene encoding cytochrome c-550 of
            Paracoccus denitrificans and analysis of the resultant
            physiological effects
  JOURNAL   J. Bacteriol. 172, 3534-3534 (1990)
  STANDARD  full staff_review
COMMENT     Printed sequence for [1] kindly submitted by R.J.M.Van Spanning,
            25-AUG-1989.
FEATURES       from  to/span     description
    pept        235      702     cytochrome c550 precursor
    pept        874  >  1047     iso-cytochrome oxidase subunit I
    sigp        235      294     signal peptide cytochrome c550
    matp        295      699     cytochrome c550
    ORF     <     1       90     open reading frame (AA at 1)
    rpt         734      743     inverted repeat copy A
    rpt         757      766     inverted repeat copy B
BASE COUNT      215 a    340 c    309 g    183 t
ORIGIN      1 bp upstream of SalI site.
        1 gtcgacatgg gcctgccggt cggcatggat gacctggacc tggccctgcg ccgcagcttc
       61 gcacggaact ttccgccgct cgccggctga ttttcggccc tgcgacatat tctccctcag
      121 ccgttctacg acgtttttcc tgtattctgc ccgcttggca catgatagcc ctgtcaatcg
      181 gggaagcgga gaggtccgtt gcccggacat gagcgatacc caagaggaaa cgcgatgaag
      241 atcagcatct atgccactct cgccgccatc accctcgccc tgcccgctgc ggcccaggat
      301 ggcgacgccg ccaaaggcga gaaagaattc aacaagtgca aggcttgcca catgatccag
      361 gcgccggacg gcaccgacat catcaagggc ggcaagaccg ggcccaacct ttacggcgtc
      421 gtcggccgca agatcgcctc ggaggagggc ttcaaatacg gcgaaggcat cctcgaggtc
      481 gccgaaaaga accccgacct gacctggacc gaggccgacc tgatcgaata cgtcaccgac
      541 cccaagccct ggctggtcaa gatgaccgac gacaagggcg ccaagaccaa gatgaccttc
      601 aagatgggca agaaccaggc cgacgtggtg gccttcctgg cccagaactc gcccgatgcg
      661 ggcggcgacg gcgaggctgc ggccgagggc gaatcgaact gatcccgacg cctccgagat
      721 gtctcgacgg ccccgcccct gcgtgcgaca aaagcgcgca ggggcgcgac ttttagtcct
      781 ttcttcgcat tgcgaggggt aggattgtgg tctagaaaca ggcgagtccg tcggcctttg
      841 cgccggccag ccgcatgtct agggagtcca cgcatggcag acgcagccgt tcacggccac
      901 ggtgaccatc atgacacccg cgggttcttc acccgctggt tcatgtcaac aaaccacaag
      961 gatatcggta tcctttacct gttcacggcc ggcatcgtcg gcctgatctc ggtatgcttc
     1021 accgtctata tgcggatgga actgcag
//
LOCUS       ECOSPEC      2330 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  E.coli ornithine decarboxylase (speC) and cyclic AMP receptor
            protein genes, complete cds.
ACCESSION   M33766
KEYWORDS    cyclic AMP receptor protein; ornithine decarboxylase; speC gene.
SOURCE      E.coli (strain K12) DNA, clone pLC20-5.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2330)
  AUTHORS   Barroso,L., Moore,R., Wright,J., Patel,T. and Boyle,S.M.
  TITLE     Analysis and sequence of the speC (ornithine decarboxylase) gene of
            Escherichia coli
  JOURNAL   Unpublished (1990)
  STANDARD  unannotated staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.M.Boyle, 22-MAY-1990.
            
               Author address: S.M.Boyle
               Virginia Polytechnic Institute and State University
               College of Veterinary Medicine, VPI & SU
               Blacksburg, VA 24061
FEATURES       from  to/span     description
    pept         86     2281     ornithine decarboxylase (speC) (EC 4.1.1.17)
    pept        719      630 (c) cyclic AMP receptor protein (CRP)
    mRNA         15     2289     speC mRNA (alt., primer extension)
    mRNA         16     2289     speC mRNA (alt., S1 mapping)
    mRNA         27     2289     speC mRNA (alt., primer extension)
    mRNA         30     2289     speC mRNA (alt., S1 mapping)
    mRNA         41     2289     speC mRNA (alt., S1 mapping)
    mRNA         42     2289     speC mRNA (alt., primer extension)
    signal       14       19     -10 region
    signal      733      728 (c) -10 region
    signal      755      750 (c) -35 region
    signal     2289     2320     rho-independent transcription termination
                                 signal
    binding     364      367     pyridoxal-PO4 binding site
    binding      66       72     GTP phosphoryl binding site
    binding     263      266     GTP phosphoryl binding site
    binding     379      382     GTP guanine binding site
BASE COUNT      508 a    546 c    664 g    612 t
ORIGIN      
        1 ctgcaggaca acagtaaaat cagagcgttt ctgcttttac tgatgtctgg cggtcggagc
       61 tggtgaccag tttgacccat atctcatggg gcagggtttt ccaccttgtc cggtattctt
      121 acttccccga aacgggtttg cgcttatgaa atcaatgaat attgccgcca gtagtgaact
      181 ggtatcccga ctttcttctc atcgtcgcgt ggtggcgttg ggagatactg attttacgga
      241 cgtcgcggca gtcgtcatta ccgctgcgga tagtcgcagt ggcattcttg cgttgcttaa
      301 gcgcaccggt tttcatctac cggtgttttt gtattccgaa catgctgttg aattacctgc
      361 gggcgttacg gcggtaatca acggcaacga gcagcagtgg ctggagctgg aatccgcagc
      421 ctgtcagtat gaagagaatt tgctgccacc gttttatgac acgctgacgc agtacgttga
      481 gatgggcaac agcacctttg cttgccctgg acatcaacat ggtgcgtttt ttaaaaagca
      541 tcctgccgga cgccattttt acgatttctt tggtgagaac gtctttcgcg ccgatatgtg
      601 taacgctgac gtaaaattgg gcgatctgct tattcatgaa ggatcggcga aagatgcgca
      661 gaaattcgca gccaaagtct ttcatgccga taaaacctat tttgtgctga acggcacatc
      721 ggcagcgaat aaagtggtga cgaatgcgct gttaacgcgt ggcgatctgg tgctcttcga
      781 ccgtaacaac cataagtcga atcatcacgg cgcgctgatt caggcggggg cgacgccggt
      841 ctatctggaa gcttcacgca acccgtttgg tttcattggc ggtattgatg cgcactgttt
      901 taatgaagag tatctgcgcc agcaaattcg cgacgttgcg ccagaaaaag ccgacctgcc
      961 gcgcccgtat cgcctggcga ttattcagct gggaacctat gacggcactg tctataacgc
     1021 ccgtcaggtg atcgataccg ttgggcatct gtgtgattac attctgtttg attccgcgtg
     1081 ggtcggttat gaacaattta tcccgatgat ggcggatagc tcgccgctgc tgttagaact
     1141 taacgaaaac gatccgggga tctttgtgac tcagtcggtg cacaaacagc aggcgggatt
     1201 ctcacagacg tcgcagatcc ataaaaaaga taaccatatc cgcggacagg cgcgtttttg
     1261 cccgcataag cggttgaata acgcctttat gctccatgct tctaccagcc ctttctatcc
     1321 gctgtttgct gcactggatg ttaacgccaa aattcatgaa ggggagagtg ggcgtcggct
     1381 gtgggctgag tgtgttgaga tagggattga agcgcgcaag gctattcttg cgcgctgtaa
     1441 gctgttccgc ccgtttatcc cgcccgttgt tgatggcaaa ttgtggcagg attatccgac
     1501 atcagtgtta gccagcgacc gccgtttttt cagttttgag ccgggggcga agtggcacgg
     1561 ctttgaagga tatgccgcgg atcagtattt tgttgatccg tgcaagctgt tactcactac
     1621 accaggtatc gatgccgaaa ccggcgaata tagcgacttt ggcgttccgg cgacgattct
     1681 ggcgcactat ctgcgtgaga acggcattgt gccggagaag tgcgatctca actccattct
     1741 gtttttatta actccggcgg aaagccacga gaagctggca caactggtgg cgatgctggc
     1801 gcaatttgaa cagcatattg aggatgactc gccgctggtt gaggtgttgc cgagcgttta
     1861 taacaagtat ccggtgcgct atcgcgacta caccctgcgc cagttgtgtc aggagatgca
     1921 cgatctgtat gtcagtttcg acgtcaaaga cctacaaaaa gcgatgttcc gccagcagag
     1981 tttcccgtca gtggtgatga acccccagga tgcgcatagc gcttatattc gcggtgacgt
     2041 ggagttggtg cggattcgtg atgccgaagg gcgaattgcg gcagaagggg cgttgcctta
     2101 tccacctggc gtgctttgcg tggtacccgg ggaagtctgg ggtggggcgg ttcaacgtta
     2161 tttccttgca ctggaagaag gggtgaattt gttgccggga ttttcgccgg agctgcaagg
     2221 tgtttatagc gaaaccgatg cggatggcgt gaaacggttg tacggttatg tgttgaagta
     2281 agaataaaaa aaacgggtca ccttctggcg acccgttttt ctttgcgtaa
//
LOCUS       HUMLPLAS      557 bp ss-mRNA            PRI       14-JUL-1990
DEFINITION  Human L-plastin mRNA, 5' end.
ACCESSION   M34426
KEYWORDS    L-plastin.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 557)
  AUTHORS   Lin,C.-S., Aebersold,R.H. and Leavitt,J.
  TITLE     Correction of the N-terminal sequences of the human plastin
            isoforms by using anchored polymerase chain reaction:
            Identification of a potential calcium-binding domain
  JOURNAL   Mol. Cell. Biol. 10, 1818-1821 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        138  >   557     L-plastin
BASE COUNT      174 a    111 c    133 g    139 t
ORIGIN      
        1 acttcctgcc ttgtgaccac acacccaggc ttgacaaagc tgttctgcag atcagaaaga
       61 aggggttcct ggtcatacac cagtactacc aaggacagct tttttcctgc aagatctgtt
      121 acctaaagca ataaaaaatg gccagaggat cagtgtccga tgaggaaatg atggagctca
      181 gagaagcttt tgccaaagtt gatactgatg gcaatggata catcagcttc aatgagttga
      241 atgacttgtt caaggctgct tgcttgcctt tgcctgggta tagagtacga gaaattacag
      301 aaaacctgat ggctacaggt gatctggacc aagatggaag gatcagcttt gatgagttta
      361 tcaagatttt ccatggccta aaaagcacag atgttgccaa gacctttaga aaagcaatca
      421 ataagaagga agggatttgt gcaatcggtg gtacttcaga gcagtctagc gttggcaccc
      481 aacactccta ttcagaggaa gaaaagtatg cctttgtcaa ctggataaac aaagccctgg
      541 aaaatgatcc tgattgt
//
LOCUS       HUMTPLAS      516 bp ss-mRNA            PRI       14-JUL-1990
DEFINITION  Human T-plastin mRNA, 5' end.
ACCESSION   M34427
KEYWORDS    T-plastin.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 516)
  AUTHORS   Lin,C.-S., Aebersold,R.H. and Leavitt,J.
  TITLE     Correction of the N-terminal sequences of the human plastin
            isoforms by using anchored polymerase chain reaction:
            Identification of a potential calcium-binding domain
  JOURNAL   Mol. Cell. Biol. 10, 1818-1821 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         88  >   516     T-plastin
BASE COUNT      169 a     85 c    125 g    137 t
ORIGIN      
        1 aaagattccg aggtgcagaa gttgtctgag tgggttggtc ggcggcagtc gggccagacc
       61 caggactctg cgactttaca tctttaaatg gatgagatgg ctaccactca gatttccaaa
      121 gatgagcttg atgaactcaa agaggccttt gcaaaagttg atctcaacag caacggattc
      181 atttgtgact atgaacttca tgagctcttc aaggaagcta atatgccatt accaggatat
      241 aaagtgagag aaattattca gaaactcatg ctggatggtg acaggaataa agatgggaaa
      301 ataagttttg acgaatttgt ttatattttt caagaggtaa aaagtagtga tattgccaag
      361 accttccgca aagcaatcaa caggaaagaa ggtatttgtg ctctgggtgg aacttcagag
      421 ttgtccagcg aaggaacaca gcattcttac tcagaggaag aaaaatatgc ttttgttaac
      481 tggataaaca aagctttgga aaatgatcct gattgt
//
LOCUS       MUSICSBP     2544 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Mouse interferon consensus sequence binding protein mRNA, complete
            cds.
ACCESSION   M32489
KEYWORDS    DNA binding protein; interferon consensus sequence binding protein.
SOURCE      Mouse (strain Balb/c; haplotype H-2d) neonatal spleen and lung,
            cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2544)
  AUTHORS   Driggers,P.H., Ennist,D.L., Gleason,S.L., Mak W,-H., Marks,M.S.,
            Levi,B.-Z., Flanagan,J.R., Appella,E. and Ozato,K.
  TITLE     An interferon gamma-regulated protein that binds the
            interferon-inducible enhancer element of major histocompatibility
            complex class I genes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3743-3747 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Ozato, 23-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         21     1295     interferon consensus sequence binding protein
    binding      21      362     DNA binding site
BASE COUNT      584 a    680 c    739 g    541 t
ORIGIN      
        1 gcgtgggaac cggcggcagg atgtgtgacc ggaacggcgg gcggcggctg cggcagtggc
       61 tgatcgaaca gatcgacagc agcatgtacc cggggctgat ctgggaaaat gatgagaaga
      121 ccatgttccg tatcccctgg aagcatgccg gcaagcagga ttacaatcag gaggtggatg
      181 cttccatctt caaggcctgg gcagttttta aagggaagtt taaagaggga gacaaagctg
      241 aaccagccac gtggaagacg aggttacgct gtgctctgaa caagagccca gattttgaag
      301 aagtgactga ccggtcccag ctggacattt ctgagccata taaagtttac cgaattgtcc
      361 ccgaggaaga acaaaaatgc aagctgggcg tggcacctgc aggctgcatg agcgaagttc
      421 ctgagatgga gtgtggccgc tcagagattg aggagctgat caaggaacct tctgtggatg
      481 agtacatggg tatgaccaag aggagcccat ccccaccaga ggcctgcagg agccagatcc
      541 tccctgactg gtgggtccag cagcccagtg caggcctgcc actggtgacc ggatatgccg
      601 cctatgacac acaccattca gctttctccc agatggtcat cagcttctac tacgggggca
      661 agctggtggg ccaggccacc accacctgcc ttgaaggctg ccgtctctcc ctgagccagc
      721 cggggctgcc taagttgtat gggccggatg gcctggaacc cgtgtgcttt ccgacggccg
      781 acaccatccc cagtgagcgg cagaggcagg tgacccggaa gctgtttggg cacctggaac
      841 gtggcgtgct actgcacagc aaccgcaagg gcgtgttcgt gaagcggctg tgccagggcc
      901 gcgtgttctg cagcggcaac gcggtggtgt gcaagggcag gcccaacaag ctggagcggg
      961 acgaggtggt gcaggtcttt gacaccaacc agttcatccg agagctgcag caattctacg
     1021 ccacccagag ccgcctacct gacagcaggg tggtcctgtg cttcggggag gagtttccgg
     1081 acactgtgcc cttgcgctcc aaactcattc tggtgcaggt agagcagctg tatgccaggc
     1141 agctggtgga ggaagcgggc aagagctgcg gtgctggctc cctgatgcca gccctggagg
     1201 agccccagcc ggaccaggct ttccgcatgt ttccggatat ctgtacctca caccagagac
     1261 ccttttttag agaaaatcaa cagatcaccg tctaagcctc agtccgggca ccccacctcg
     1321 cctgagctca agcttcaaga gtctgtgact aagagaattc cgaaaggatg tggagccctc
     1381 tgactggggt gggcgggtgt cctccaaggg gcctccggaa gcccacagag ggatgcgctc
     1441 ctgctcaggc aggtgtcaga agcttgcagg ggctgtggcc gcaacctgtg attaaagcat
     1501 tccttttcct tacactattt cacccctcac cactaatggc tggcctttct gtgtgctgag
     1561 gtctttcgac agttcaaatc atctggtggc agcagactcg cctttgccct tctgcggccg
     1621 agggcggaga tttatgactt tctctgcttg gttggagaag aagaatcttt actattcagc
     1681 ttcttttctt ttggccagaa ctctgaaaaa aaaaaaaaac tcttttttaa gaccaatatt
     1741 tgtattctca caggctcagc tgtcaatcac ttgagacctt ccctgtaaag tggggcagat
     1801 tttaaatatg ggtgtagata ctcgttgcag ccttcgcagg aattttggtt gtggttcatt
     1861 gattcacaca gactctgtgt cagctgacag ggctgtgtgg ggcatcaaag gaggaccagg
     1921 cactgtggag aagacccatt cactggcatc tcacccttcc ttgtccagct ccatacccag
     1981 tcctaagacc cagtgaaaag ccacgtccaa actgtgctct gggctcatca gtgcccaccc
     2041 acgtaccagg gaaaggcaca cacccatccc agtgggcaca gagcggaatg tccccctacc
     2101 gcaccatttg cgcccccaat ctggctgtcc aacctagttt gtaagtaatc taaatcagtg
     2161 actatagccc cgcctaaggg acacttcccg gaggagggag ccgctgaaaa ggagttagtt
     2221 tgagggtcag tacacaacag gggcagaaag ccaagcagat gtgggggcag ggaggtcatc
     2281 atctgctttt gtctgagaga aggagagctt ctccgtttgt tcaactttgt aacaagctgg
     2341 gttacatgct ccacgcagct agagaagcct aggtgctctg cattccctgg ggaactgcag
     2401 gaaagcctta cctgctgact gttgctctgg ggaaaagcct gagggtccag agcagctaca
     2461 agctacaggc cataccttac aacctgaaaa gctaaggacc acggtgacct tcccggctac
     2521 tgtgtgaagg tgctgggtgg ggcc
//
LOCUS       DDIUMPCMP     694 bp ss-mRNA            INV       14-JUL-1990
DEFINITION  D.discoideum UMP-CMP kinase mRNA, complete cds.
ACCESSION   M34568
KEYWORDS    UMP-CMP kinase.
SOURCE      D.discoideum, cDNA to mRNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 694)
  AUTHORS   Wiesmueller,L., Noegel,A.A., Barzu,O., Gerisch,G. and Schleicher,M.
  TITLE     cDNA-derived sequence of UMP-CMP kinase from Dictyostelium
            discoideum and expression of the enzyme in Escherichia coli
  JOURNAL   J. Biol. Chem. 265, 6339-6345 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         54      638     UMP-CMP kinase
    mRNA    <     1      694     UMP-CMP kinase mRNA
BASE COUNT      287 a     74 c    110 g    223 t
ORIGIN      
        1 tttaaatata taaattttgt aaaacaactt caaatttttt aaattcaaga atgatggaaa
       61 aaagtaaacc aaatgttgta tttgttttag gtggaccagg ttcaggtaaa ggtactcaat
      121 gcgccaacat tgtcagagat tttggatggg tacatttatc agcaggtgat ttattacgtc
      181 aagaacaaca atctggttca aaagatggtg aaatgattgc aacaatgatc aaaaatggtg
      241 aaatcgttcc atcaattgta actgtaaaat tattaaagaa tgctattgat gctaatcaag
      301 gtaaaaactt tttagttgat ggtttcccac gtaatgaaga aaataataat tcttgggaag
      361 aaaatatgaa agattttgtt gatacaaaat ttgttttatt ctttgattgt ccagaagaag
      421 ttatgactca aagattatta aagagaggtg aatcaagtgg tagatcagat gataatattg
      481 aatcaattaa aaagagattc aatacattca acgttcaaac taaattagtt atcgatcatt
      541 ataataaatt tgacaaagtt aaaatcattc cagcaaatag agatgttaat gaagtttata
      601 atgatgttga aaaccttttt aaatcaatgg gtttttaaaa aaaaaaaaaa aaaaaccaat
      661 aaattttaaa attactaaat aataaatata cccc
//
LOCUS       HUMBGALXX    2409 bp ss-mRNA            PRI       14-JUL-1990
DEFINITION  Human beta-galactosidase (GLB1) mRNA, complete cds.
ACCESSION   M34423
KEYWORDS    beta-galactosidase.
SOURCE      Human testis, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2409)
  AUTHORS   Yamamoto,Y., Hake,C.A., Martin,B.M., Kretz,K.A.,
            Ahern-Rindell,A.J., Naylor,S.L., Mudd,M. and O'Brien,J.S.
  TITLE     Isolation, characterization, and mapping of a human acid beta-
            galactosidase cDNA
  JOURNAL   DNA 9, 119-127 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         61     2094     beta-galactosidase precursor (EC 3.2.1.23)
                                 /nomgen="GLB1" /map="3pter-p21"
                                 /hgml_locus_uid="LA0059G "
    sigp         61      129     beta-galactosidase signal peptide (put.)
    matp        130     2091     beta-galactosidase
BASE COUNT      566 a    635 c    613 g    595 t
ORIGIN      
        1 gaattcgggc gcgaagcggc cggcctgggc gccgactgca gagccgggag gctggtggtc
       61 atgccggggt tcctggttcg catcctcctt ctgctgctgg ttctgctgct tctgggccct
      121 acgcgcggct tgcgcaatgc cacccagagg atgtttgaaa ttgactatag ccgggactcc
      181 ttcctcaagg atggccagcc atttcgctac atctcaggaa gcattcacta ctcccgtgtg
      241 ccccgcttct actggaagga ccggctgctg aagatgaaga tggctgggct gaacgccatc
      301 cagacgtatg tgccctggaa ctttcatgag ccctggccag gacagtacca gttttctgag
      361 gaccatgatg tggaatattt tcttcggctg gctcatgagc tgggactgct ggttatcctg
      421 aggcccgggc cctacatctg tgcagagtgg gaaatgggag gattacctgc ttggctgcta
      481 gagaaagagt ctattcttct ccgctcctcc gacccagatt acctggcagc tgtggacaag
      541 tggttgggag tccttctgcc caagatgaag cctctcctct atcagaatgg agggccagtt
      601 ataacagtgc aggttgaaaa tgaatatggc agctactttg cctgtgattt tgactacctg
      661 cgcttcctgc agaagcgctt tcgccaccat ctgggggatg atgtggttct gtttaccact
      721 gatggagcac ataaaacatt cctgaaatgt ggggccctgc agggcctcta caccacggtg
      781 gactttggaa caggcagcaa catcacagat gctttcctaa gccagaggaa gtgtgagccc
      841 aaaggaccct tgatcaattc tgaattctat actggctggc tagatcactg gggccaacct
      901 cactccacaa tcaagaccga agcagtggct tcctccctct atgatatact tgcccgtggg
      961 gcgagtgtga acttgtacat gtttataggt gggaccaatt ttgcctattg gaatggggcc
     1021 aactcaccct atgcagcaca gcccaccagc tacgactatg atgccccact gagtgaggct
     1081 ggggacctca ctgagaagta ttttgctctg cgaaacatca tccagaagtt tgaaaaagta
     1141 ccagaaggtc ctatccctcc atctacacca aagtttgcat atggaaaggt cactttggaa
     1201 aagttaaaga cagtgggagc agctctggac attctgtgtc cctctgggcc catcaaaagc
     1261 ctttatccct tgacatttat ccaggtgaaa cagcattatg ggtttgtgct gtaccggaca
     1321 acacttcctc aagattgcag caacccagca cctctctctt cacccctcaa tggagtccac
     1381 gatcgagcat atgttgctgt ggatgggatc ccccagggag tccttgagcg aaacaatgtg
     1441 atcactctga acataacagg gaaagctgga gccactctgg accttctggt agagaacatg
     1501 ggacgtgtga actatggtgc atatatcaac gattttaagg gtttggtttc taacctgact
     1561 ctcagttcca atatcctcac ggactggacg atctttccac tggacactga ggatgcagtg
     1621 cgcagccacc tggggggctg gggacaccgt gacagtggcc accatgatga agcctgggcc
     1681 cacaactcat ccaactacac gctcccggcc ttttatatgg ggaacttctc cattcccagt
     1741 gggatcccag acttgcccca ggacaccttt atccagtttc ctggatggac caagggccag
     1801 gtctggatta atggctttaa ccttggccgc tattggccag cccggggccc tcagttgacc
     1861 ttgtttgtgc cccagcacat cctgatgacc tcggccccaa acaccatcac cgtgctggaa
     1921 ctggagtggg caccctgcag cagtgatgat ccagaactat gtgctgtgac gttcgtggac
     1981 aggccagtta ttggctcatc tgtgacctac gatcatccct ccaaacctgt tgaaaaaaga
     2041 ctcatgcccc cacccccgca aaaaaacaaa gattcatggc tggaccatgt atgatgatga
     2101 aagcctgtgt ctttgaggga ttctaccctg aacatacctc acagatcctc cctgtcatgc
     2161 cacatttcac tgattggaat gtggaaatgg aaaaggaatt taggatgtgc attttcacct
     2221 gaggtttccc tgcatccctg cagtgccaaa gccccacctt cagggaccac ctggaatgtg
     2281 tgaggggctg acagcacagt aacgtgcata catatctgca gggctggaat ggaagcttta
     2341 aaggtggtag tgatttttat tttggaagaa tcatgttacc tttttgttaa ataaaatttg
     2401 cccgaattc
//
LOCUS       HUMGAA       3441 bp ss-mRNA            PRI       14-JUL-1990
DEFINITION  Human acid alpha-glucosidase (GAA) mRNA, complete cds.
ACCESSION   M34424
KEYWORDS    acid alpha-glucosidase; acid maltase.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3441)
  AUTHORS   Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R.
  TITLE     Sequence of the cDNA and 5'-flanking region for human acid alpha-
            glucosidase, detection of an intron in the 5' untranslated leader
            sequence, definition of 18-bp polymorphisms, and differences with
            previous cDNA and amino acid sequences
  JOURNAL   DNA 9, 85-94 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         33     2891     acid alpha-glucosidase /nomgen="GAA"
                                 /map="17q23" /hgml_locus_uid="LZ0096H"
BASE COUNT      612 a   1170 c   1027 g    632 t
ORIGIN      
        1 gcctgtagga gctgtccagg ccatctccaa ccatgggagt gaggcacccg ccctgctccc
       61 accggctcct ggccgtctgc gccctcgtgt ccttggcaac cgctgcactc ctggggcaca
      121 tcctactcca tgatttcctg ctggttcccc gagagctgag tggctcctcc ccagtcctgg
      181 aggagactca cccagctcac cagcagggag ccagcagacc agggccccgg gatgcccagg
      241 cacaccccgg ccgtcccaga gcagtgccca cacagtgcga cgtccccccc aacagccgct
      301 tcgattgcgc ccctgacaag gccatcaccc aggaacagtg cgaggcccgc ggctgctgct
      361 acatccctgc aaagcagggg ctgcagggag cccagatggg gcagccctgg tgcttcttcc
      421 cacccagcta ccccagctac aagctggaga acctgagctc ctctgaaatg ggctacacgg
      481 ccaccctgac ccgtaccacc cccaccttct tccccaagga catcctgacc ctgcggctgg
      541 acgtgatgat ggagactgag aaccgcctcc acttcacgat caaagatcca gctaacaggc
      601 gctacgaggt gcccttggag accccgcatg tccacagccg ggcaccgtcc ccactctaca
      661 gcgtggagtt ctccgaggag cccttcgggg tgatcgtgcg ccggcagctg gacggccgcg
      721 tgctgctgaa cacgacggtg gcgcccctgt tctttgcgga ccagttcctt cagctgtcca
      781 cctcgctgcc ctcgcagtat atcacaggcc tcgccgagca cctcagtccc ctgatgctca
      841 gcaccagctg gaccaggatc accctgtgga accgggacct tgcgcccacg cccggtgcga
      901 acctctacgg gtctcaccct ttctacctgg cgctggagga cggcgggtcg gcacacgggg
      961 tgttcctgct aaacagcaat gccatggatg tggtcctgca gccgagccct gcccttagct
     1021 ggaggtcgac aggtgggatc ctggatgtct acatcttcct gggcccagag cccaagagcg
     1081 tggtgcagca gtacctggac gttgtgggat acccgttcat gccgccatac tggggcctgg
     1141 gcttccacct gtgccgctgg ggctactcct ccaccgctat cacccgccag gtggtggaga
     1201 acatgaccag ggcccacttc cccctggacg tccagtggaa cgacctggac tacatggact
     1261 cccggaggga cttcacgttc aacaaggatg gcttccggga cttcccggcc atggtgcagg
     1321 agctgcacca gggcggccgg cgctacatga tgatcgtgga tcctgccatc agcagctcgg
     1381 gccctgccgg gagctacagg ccctacgacg agggtctgcg gaggggggtt ttcatcacca
     1441 acgagaccgg ccagccgctg attgggaagg tatggcccgg gtccactgcc ttccccgact
     1501 tcaccaaccc cacagccctg gcctggtggg aggacatggt ggctgagttc catgaccagg
     1561 tgcccttcga cggcatgtgg attgacatga acgagccttc caacttcatc aggggctctg
     1621 aggacggctg ccccaacaat gagctggaga acccacccta cgtgcctggg gtggttgggg
     1681 ggaccctcca ggcggccacc atctgtgcct ccagccacca gtttctctcc acacactaca
     1741 acctgcacaa cctctacggc ctgaccgaag ccatcgcctc ccacagggcg ctggtgaagg
     1801 ctcgggggac acgcccattt gtgatctccc gctcgacctt tgctggccac ggccgatacg
     1861 ccggccactg gacgggggac gtgtggagct cctgggagca gctcgcctcc tccgtgccag
     1921 aaatcctgca gtttaacctg ctgggggtgc ctctggtcgg ggccgacgtc tgcggcttcc
     1981 tgggcaacac ctcagaggag ctgtgtgtgc gctggaccca gctgggggcc ttctacccct
     2041 tcatgcggaa ccacaacagc ctgctcagtc tgccccagga gccgtacagc ttcagcgagc
     2101 cggcccagca ggccatgagg aaggccctca ccctgcgcta cgcactcctc ccccacctct
     2161 acacactgtt ccaccaggcc cacgtcgcgg gggagaccgt ggcccggccc ctcttcctgg
     2221 agttccccaa ggactctagc acctggactg tggaccacca gctcctgtgg ggggaggccc
     2281 tgctcatcac cccagtgctc caggccggga aggccgaagt gactggctac ttccccttgg
     2341 gcacatggta cgacctgcag acggtgccaa tagaggccct tggcagcctc ccacccccac
     2401 ctgcagctcc ccgtgagcca gccatccaca gcgaggggca gtgggtgacg ctgccggccc
     2461 ccctggacac catcaacgtc cacctccggg ctgggtacat catccccctg cagggccctg
     2521 gcctcacaac cacagagtcc cgccagcagc ccatggccct ggctgtggcc ctgaccaagg
     2581 gtggggaggc ccgaggggag ctgttctggg acgatggaga gagcctggaa gtgctggagc
     2641 gaggggccta cacacaggtc atcttcctgg ccaggaataa cacgatcgtg aatgagctgg
     2701 tacgtgtgac cagtgaggga gctggcctgc agctgcagaa ggtgactgtc ctgggcgtgg
     2761 ccacggcgcc ccagcaggtc ctctccaacg gtgtccctgt ctccaacttc acctacagcc
     2821 ccgacaccaa ggtcctggac atctgtgtct cgctgttgat gggagagcag tttctcgtca
     2881 gctggtgtta gccgggcgga gtgtgttagt ctctccagag ggaggctggt tccccaggga
     2941 agcagagcct gtgtgcgggc agcagctgtg tgcgggcctg ggggttgcat gtgtcacctg
     3001 gagctgggca ctaaccattc caagccgccg catcgcttgt ttccacctcc tgggccgggg
     3061 ctctggcccc caacgtgtct aggagagctt tctccctaga tcgcactgtg ggccggggcc
     3121 tggagggctg ctctgtgtta ataagattgt aaggtttgcc ctcctcacct gttgccggca
     3181 tgcgggtagt attagccacc cccctccatc tgttcccagc accggagaag ggggtgctca
     3241 ggtggaggtg tggggtatgc acctgagctc ctgcttcgcg cctgctgctc tgccccaacg
     3301 cgaccgctgc ccggctgccc agagggctgg atgcctgccg gtccccgagc aagcctggga
     3361 actcaggaaa attcacagga cttgggagat tctaaatctt aagtgcaatt atttttaata
     3421 aaaggggcat ttggaatcaa a
//
LOCUS       HUMGAAX       298 bp ds-DNA             PRI       14-JUL-1990
DEFINITION  Human acid alpha-glucosidase (GAA) gene, 5' flank.
ACCESSION   M34425
KEYWORDS    acid alpha-glucosidase; acid maltase.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 298)
  AUTHORS   Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R.
  TITLE     Sequence of the cDNA and 5'-flanking region for human acid alpha-
            glucosidase, detection of an intron in the 5' untranslated leader
            sequence, definition of 18-bp polymorphisms, and differences with
            previous cDNA and amino acid sequences
  JOURNAL   DNA 9, 85-94 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS         274  >   298     acid alpha-glucosidase, intron A
                                 /hgml_locus_uid="LZ0096H" /nomgen="GAA"
                                 /map="17q23"
BASE COUNT       38 a     88 c    128 g     44 t
ORIGIN      
        1 cgtgcggagg tgagccgggc cggggctgcg gggcttccct gagcgcgggc cgggtcggtg
       61 gggcggtcgg ctgcccgcgc ggcctctcag tgggaaagct gaggttgtcg ccggggccgc
      121 gggtggaggt cggggatgag gcagcaggta ggacagtgac ctcggtgacg cgaaggaccc
      181 cggccacctc taggttctcc tcgtccgccc gttgttcagc gagggaggct ctgggcctgc
      241 cgcagctgac ggggaaactg aggcacggag cgggtgagac acctgacgtc tgccccgc
//
LOCUS       HUMMAK        171 bp ds-DNA             PRI       14-JUL-1990
DEFINITION  Human male germ cell-associated kinase (mak) gene, exon N.
ACCESSION   M35863 M34569
KEYWORDS    kinase.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 171)
  AUTHORS   Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M.
  TITLE     A novel mammalian protein kinase gene (mak) is highly expressed in
            testicular germ cells at and after meiosis
  JOURNAL   Mol. Cell. Biol. 10, 2261-2268 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    25  /   157     male germ cell-associated kinase (mak), exon N
                                 (AA at 27)
    IVS     <     1       24     mak intron X
    IVS         158  >   171     mak intron X+1
BASE COUNT       48 a     30 c     38 g     55 t
ORIGIN      
        1 tttttttctc cgtatatcat caaggctttt ttcataggga catgaaacca gaaaacttgc
       61 tttgtatggg tccagagctt gtgaaaattg ctgattttgg acttgcaaga gaattaaggt
      121 cacagccacc atacactgac tatgtatcta ccagatggtg agtaggggtt a
//
LOCUS       RATMAK       2372 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Rat male germ cell-associated kinase (mak) mRNA, complete cds.
ACCESSION   M35862 M34569
KEYWORDS    kinase.
SOURCE      Rat, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2372)
  AUTHORS   Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M.
  TITLE     A novel mammalian protein kinase gene (mak) is highly expressed in
            testicular germ cells at and after meiosis
  JOURNAL   Mol. Cell. Biol. 10, 2261-2268 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        237     2105     male germ cell-associated kinase (mak)
    mRNA    <     1     2372     mak mRNA
BASE COUNT      683 a    575 c    574 g    540 t
ORIGIN      
        1 tctgaatatg tggagactat tatagcaaaa tggactcagt gtctctttgc cagattcctt
       61 attgctgtat cccatgagcc tacctgacct tgaggaagag ccgtgtttaa tgatgacaag
      121 cctgtctccc gtgtagtgac aaaaaggagg aggaccacaa ggaaagacgg atttagattt
      181 ctactcagca aggaagcgag ctctgtcagc aacctctctg gattaccctt cccaagatga
      241 accgatatac aaccatgagg cagctggggg acggcacgta cgggagtgtg cttatgggca
      301 agagcaatga gtctggggag ctggtggcaa tcaagaggat gaagagaaag ttctattctt
      361 gggatgagtg tatgaacttg cgagaagtta agtccctgaa gaaactcaat catgccaatg
      421 tgattaaact aaaagaagtt atcagagaaa atgaccatct ttattttata tttgaatata
      481 tgaaagaaaa cctctatcag ctaatgaaag acagaaacaa actgttccct gagtcagtca
      541 tcagaaatat tatgtatcaa atactacagg ggctggcatt tatccacaaa catggctttt
      601 ttcacaggga catgaaacct gagaatttgc tttgcatggg tccagagctg gtgaagattg
      661 ctgattttgg acttgcgaga gaattaagat cacagccacc atatactgac tatgtgtcta
      721 ccagatggta ccgtgctcct gaagttttgc taaggtcttc agtgtacagc tctcccattg
      781 acgtgtgggc cgtgggaagt ataatggccg agctatatac gtttagaccg cttttcccag
      841 ggaccagtga agttgatgag atctttaaaa tttgccaagt gttagggact cccaagaaaa
      901 gtgactggcc ggaggggtac cagctggcat cctccatgaa cttccgcttt ccccagtgca
      961 ttcctataaa cctgaaaact ctcattccca atgccagtag tgaggctatt cagcttatga
     1021 cagaaatgct taactgggat ccaaagaaac ggccaactgc aagccaggca ctgaagcacc
     1081 catattttca agtcggtcag gtattgggcc cttctgcaca ccatctggat gcaaaacaga
     1141 ctttgcacaa gcagctgcag cctccagagc caaagccatc ttcctctgaa cgggatccta
     1201 agcctttgcc aaacatcctt gatcagcctg ccgggcagcc ccagccaaaa cagggccacc
     1261 aaccactgca ggccattcag ccaccacaga acacagtggt tcagccacct ccaaagcagc
     1321 agggtcacca taagcaacca caaacgatgt ttccaagtat cgtcaaaacc ataccaacga
     1381 atccagtcag cacagtaggc cataaggggg cccggagacg gtggggtcag acagtcttca
     1441 agtctggaga cagctgtgac aacatcgagg actgcgactt gggagcctcc cactccaaga
     1501 agccgagcat ggatgccttc aaggaaaaaa agaagaagga gtctccattt cggtttccag
     1561 aagcaggact cccagtctcc aaccacttga agggggaaaa tagaaattta catgcatcct
     1621 taaaatctga cacaaacttg tcaactgctt caaccgctaa gcagtactat ttgaaacaat
     1681 caagatacct tccgggtgtg aaccccaaga acgtgtcttt ggtagctggc ggcaaggata
     1741 taaattcaca ctcttggaat aatcagctat ttcctaagtc tctgggatcc atgggggcgg
     1801 acctcgcttt caagaggagt aacgcagcag ggaaccttgg gagctacagc gcttacagcc
     1861 agacgggatg cgtgccttcc tttctcaaga aagaagtggg atcagctggc cagaggatcc
     1921 acttggcgcc tctgggtgcg tcggctgccg actatacctg gagcacaaaa actggccgag
     1981 gccagttttc aggacgaact tacaatccca cagccaaaaa tctcaatatt gtgaaccgca
     2041 cacagccagt cccctcggtg cacgggagga cagactgggt ggctaagtac ggaggccacc
     2101 ggtaggagaa gtggaagcct gaagcattgc tccgtagagg acaatcacgc cccttgatcc
     2161 tgggagacgt ctacagagtc tatttctacc gagttccaca ggacgcacac aaccgtgggc
     2221 gcctcagaga ccggaagtca gctcccgatt tcttcctttt ccggaaatgc aatgcatttt
     2281 cttaacttgt ggccagcagt gctgatgcag caccgtgctg aaacttttga gcgggacttt
     2341 taaagagtat tgaataaatt atttgccaaa gt
//
LOCUS       BNRCFIA      1930 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  B.fragilis imipenem-cefoxitin-hydrolyzing enzyme (cfiA) gene,
            complete cds.
ACCESSION   M34831
KEYWORDS    imipenem-cefoxitin-hydrolyzing enzyme.
SOURCE      B.fragilis (strain TAL2480) DNA.
  ORGANISM  Bacteroides fragilis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Anaerobic gram-negative straight, curved and helical rods; 
            Bacteroidaceae.
REFERENCE   1  (bases 1 to 1930)
  AUTHORS   Thompson,J.S. and Malamy,M.H.
  TITLE     Sequencing the gene for an imipenem-cefoxitin-hydrolyzing enzyme
            (cfiA) from Bacteroides fragilis TAL2480 reveals strong similarity
            between cfiA and Bacillus cereus beta-lactamase II
  JOURNAL   J. Bacteriol. 172, 2584-2593 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        533     1282     imipenem-cefoxitin-hydrolyzing enzyme
                                 precursor
    sigp        533      586     imipenem-cefoxitin-hydrolyzing enzyme signal
                                 peptide
    matp        587     1279     imipenem-cefoxitin-hydrolyzing enzyme
BASE COUNT      563 a    435 c    434 g    498 t
ORIGIN      
        1 gatcaataag cagttcatct tgtctgcagt attgaaatca taggagttgt ccagaagcag
       61 atttataggt gatgttctta caagtcagtt cttcgatagc acgcaatatg gtgtctgcgc
      121 tgcaagtgcg aagagttgga tgaagagaca aatgtttcat caggtgagtt gtaacatcct
      181 caatacatga gccgccacaa agatatacgc acatcagaga gcgtagaatt tcgctatatt
      241 gataaccaaa catagtgcat ctcaatccca aggtggaatc tatggtttga gctaaaagag
      301 catcaaattg ctccataata gaaaaaattc ctccaaaagg agtgagtttc tcagatttta
      361 tttgtacttt tgccatgtca tattagagtt ttgcttgtct tcttttcgca acactaaggt
      421 aagtgaaaat tctgacatgg caaaatcctg ggcaactttt tgttgctcag gcacttataa
      481 ataatgttaa actatagtgt tgcggaatta aggcatcata taaaagaata aaatgaaaac
      541 agtatttatc cttatctcca tgcttttccc tgtcgcagtt atggcacaga aaagcgtaaa
      601 aatatccgat gacatcagta tcacccaact ctcggacaaa gtgtacactt atgtatccct
      661 cgccgaaatc gaaggatggg gtatggtacc ttccaacggg atgattgtta tcaacaacca
      721 ccaggcagcg ttgctggaca caccgatcaa tgacgcacaa acggaaatgc tggtcaactg
      781 ggtgacagac tctttgcatg ccaaagtcac cacgtttatc ccgaaccact ggcacggcga
      841 ttgtattggc ggactgggtt acctgcaaag gaaaggtgtc caatcatacg cgaaccagat
      901 gacgatagac ctcgccaagg aaaaagggtt gcccgtaccg gaacatggat tcaccgattc
      961 actgaccgtc agcttggacg gcatgcctct ccaatgttat tatttaggag gcggacatgc
     1021 gaccgacaat atcgtggttt ggctgccgac agagaatatc ctttttggcg gatgtatgct
     1081 taaagacaac caggcgacaa gcatcggcaa catctcggac gcggacgtga cggcatggcc
     1141 gaaaactctc gataaggtaa aagccaagtt cccctcggcc cgttacgtcg tgcccggaca
     1201 tggcgactat ggcggaaccg aactgataga gcataccaag cagatcgtga accaatatat
     1261 agaaagcact tcaaagccat agcccgaatt gtaggcatct acccggtatt ccatgaacga
     1321 agaggcgaaa gcttttatcc cgtccacaga atgtaacccg gccacaaacg agacataaaa
     1381 atatcaattg aacaacacaa cagatccttg tttcatctac ggcgtcgcag ttaggtcttt
     1441 ttttatcaat tgttaagcgt ttcttcccga ttattagcga tctttgccat caaaaaataa
     1501 acgatatgaa ttggattatc ctaacgattg ccggatgctt tgaggtgctg ttcactttct
     1561 gcatggggaa agtgaaagaa acaaccggaa cgaaatgtac ctgtggttca ccggattcct
     1621 gatcgcaact tttataagta tgttactact ggtgaaagcc acccagaatc tgcccatcgg
     1681 cacagcttat gccgtgtgga caggaatagg cgctgtgggt acggtgctcg taggtatcct
     1741 tgtgtttagc gaacccgtca gttttggaga ctcttcttta tagtgatgct gatcagttca
     1801 ataatcgggc tgaagatggt ttcgcattga gggggaaatt catccctcag cacctccggc
     1861 aattccgcca acgaacggat catcagatac tcctccgacg atgaatctcc tcgtgttcac
     1921 atcgctcagc
//
LOCUS       CLTSTRRA      142 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION   M34684
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 1-A.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 142)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt          36       77     STRR1 sequence
    rpt          78      119     STRR1 sequence
BASE COUNT       28 a     64 c     19 g     31 t
ORIGIN      
        1 tatttattcc catcacccaa tccccaatcc ccagtcccca ttacccctta tcccagaggg
       61 gccccgagtt ccccaatccc cattacccct tatcccagag ggtccccgag ttccccagtc
      121 cccagtcccc aatcctgact gg
//
LOCUS       CLTSTRRB       73 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION   M34685
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 1-B.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 73)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       19 a     33 c      5 g     16 t
ORIGIN      
        1 ggagttctcc aatccccaat ccccagtccc caatccccat tccccattcc ccattcccct
       61 caaactaaag aat
//
LOCUS       CLTSTRRC       64 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION   M34686
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 1-C.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 64)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       16 a     26 c      2 g     20 t
ORIGIN      
        1 tttttccaaa tccccaatcc ctagtcccca atccctagtc cccaatccct aatcatttca
       61 attc
//
LOCUS       CLTSTRRD       49 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION   M34687
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 1-C.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 49)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT        9 a     23 c      5 g     12 t
ORIGIN      
        1 ttcctctgct tccccagtcc ccagtcccca gtccccaccc ataaaagtt
//
LOCUS       CLTSTRRE       74 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 1 (STRR1).
ACCESSION   M34688
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 1-E.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 74)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT        6 a     33 c      3 g     32 t
ORIGIN      
        1 aggttttgta ttttcccttt cccctttcac cttttcccct tcccctttca cctttcccct
       61 ttccccaccc ctat
//
LOCUS       CLTSTRRF       75 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION   M34689
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 2-A.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       14 a      8 c     15 g     38 t
ORIGIN      
        1 ctggtttgtt ttgttattgg tcattggtca tttgtcattg gtcatttgtc attagttatt
       61 ggtaattcat atcaa
//
LOCUS       CLTSTRRG       64 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION   M34690
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 2-B.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 64)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       10 a     10 c     11 g     33 t
ORIGIN      
        1 tttttgttgt ttgtcatttg tcattggtca tttgtcatta gtcctttgtc attcaaaccg
       61 agtt
//
LOCUS       CLTSTRRH       77 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION   M34691
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 2-C.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 77)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       18 a      6 c     17 g     36 t
ORIGIN      
        1 ccaatttgta ttagtcattt gttattagtc atttgtcatt tgtcatttgg ggtttgtaat
       61 tggtaatagg aggaatt
//
LOCUS       CLTSTRRI       85 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION   M34692
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 2-D.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       25 a      0 c     25 g     35 t
ORIGIN      
        1 atggggtaat tggtaattgg taattggtaa ttggtaattg gtaattggta attggtaatt
       61 ggtaattggt aattggtaat agatt
//
LOCUS       CLTSTRRJ       80 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 2 (STRR2).
ACCESSION   M34693
KEYWORDS    s.
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 2-E.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       13 a     13 c     17 g     37 t
ORIGIN      
        1 tcccatgtca tttgtcattt gtcattggtt attggttatt ggtcatttgt catttggggt
       61 ttggtaattg acctcaactc
//
LOCUS       CLTSTRRK       76 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Callothrix sp. short tandem repeat family 3 (STRR3).
ACCESSION   M34694
KEYWORDS    .
SOURCE      Callothrix sp. (strain PCC 7601) DNA, clone 3.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N.
  TITLE     Highly repetitive DNA sequences in Cyanobacterial genomes
  JOURNAL   J. Bacteriol. 172, 2755-2761 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT       25 a     15 c     15 g     21 t
ORIGIN      
        1 ttttaagtct gtgaacagtc aacagtcaac agtcaacagt taacagttga aaggacacag
       61 cagtgctatg tccttt
//
LOCUS       CPACM559B    1076 bp ds-DNA             ORG       14-JUL-1990
DEFINITION  C.paradoxa cyanelle cytochrome b-559 (psbE, psbF, psbI, and psbJ)
            genes, complete cds.
ACCESSION   M35129
KEYWORDS    cytochrome b-559.
SOURCE      C.paradoxa cyanelle DNA.
  ORGANISM  Cyanelle Cyanophora paradoxa
            Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; 
            Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa.
REFERENCE   1  (bases 1 to 1076)
  AUTHORS   Cantrell,A. and Bryant,D.A.
  TITLE     Nucleotide sequence of the genes encoding cytochrome b-559 from the
            cyanelle genome of Cyanophora paradoxa
  JOURNAL   Photosyn. Res. 16, 65-81 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        246      470     cytochrome b-559 apoprotein (psbE)
    pept        510      638     cytochrome b-559 apoprotein (psbF)
    pept        648      764     cytochrome b-559 apoprotein (psbI)
    pept        787      909     cytochrome b-559 apoprotein (psbJ)
    signal      926      951     pot. transcription termination signal
BASE COUNT      367 a    142 c    160 g    407 t
ORIGIN      
        1 tctttaagta gtcagatatc tggaaatact tggaaaaaag aaacaagtac agacaatatt
       61 ccttcaaatt tctataaaat tagttttata gataaagaag taggtttata ctaggaaatc
      121 agggaacact tttacgatat gtatcattat aatagaataa ttaatatctt ataatttgaa
      181 taaaatcttt tattatacat aaaagcatca tatatatata tatataattt ttaggagtga
      241 tttatatgtc tggaggaact actggcgaac gcccattttc tgacattgtt actagtattc
      301 gttattgggt tattcatact gtaactattc cattctttat tgttgcaggt tggctttttg
      361 taagtactgg tttagcttat gatgtatttg gtactccaag accagatgaa tacttcactg
      421 aagaacgtca agaagtacca attattaatc aacgtttttc aactaattaa tcgttattaa
      481 tttatagttt ctaaaagagg aaaaaggaaa tgaataatcc taatcaaccg gtttcttatc
      541 caatttttac agttagatgg ttagcaattc atgctattgg aattccagct gtatttttta
      601 ttggatctat tactgcaatg caatttattc aacgatagga gatatatatg gttagccaaa
      661 atcctaatag acaaaaagtt gaattaaatc gtacttccct attctgggga ttacttttaa
      721 ttttcgtatt agcaatttta ttctctagct acatttttaa ctaaatttct tcaataagga
      781 gtctttatgg caaatactgg tggacgcatt cctttatggc ttgttgctac agttgcaggt
      841 ttagcagcta ttggtgtact aggaatcttt ttctatggtg gttactctgg tttaggttcc
      901 tctatttaat aaaatcaatt taagattttt tgtaataaat taattaaaaa aaaatagaga
      961 gttaaatttt tttaactctc tattctttag acatttataa ctaataaaag aaattaagca
     1021 atttcatctg tttcaatata aacaaataat aatgccataa cattgctggt aagaca
//
LOCUS       KVSM121M      612 bp ds-RNA             VRL       14-JUL-1990
DEFINITION  Killer virus of S.cerevisiae (class M1, type 1), M1-2 region mRNA
            fragment.
ACCESSION   M35150
KEYWORDS    .
SOURCE      Killer virus of S.cerevisiae (strain A364A x S7) class M1 type 1,
            cDNA to viral RNA, clones pEH-[2,3].
  ORGANISM  Killer virus of S.cerevisiae
            Viridae; ds-RNA nonenveloped viruses.
REFERENCE   1  (bases 1 to 612)
  AUTHORS   Georgopoulos,D.E., Hannig,E.M. and Leibowitz,M.J.
  TITLE     Sequence of the M1-2 region of killer virus double-stranded RNA
  JOURNAL   Basic Life Sci. 40, 203-213 (1986)
  STANDARD  simple staff_review
BASE COUNT      205 a    140 c    141 g    125 t      1 others
ORIGIN      
        1 aaagaaagaa cnaaaagaaa cagaaaaaga gagaacagga caacaaacgc aacaaaacac
       61 aaacacaagc acactcacct tgagtctaac tggtggcacg cagcatatct caccctgaga
      121 ctaactggcg gcaggcgacc gtgagcatac agcatgcccc actcgattcg agacgcgatt
      181 cgcgctcgta ggtatcgagc ggctacgttg agctattatg gcagtgacat gcgattcgcg
      241 cactgccaag atcagctcag caaagttaag accagtatcg gatatggtag actactacaa
      301 ttcgcacagg tatgagattc tcagtctagt gtatggatga gtagttgagc caatgaatct
      361 agggtttaaa ttactatgca ttgacatata acaggtacaa gcgtagataa tacttactag
      421 gccccagccg gtacaccctg tattgaataa atacgactat ttggccaggt ctggacgggg
      481 cagtcgaatt actaggttga gcacacacac gtgaatcaca caacataaca gtgtaggaac
      541 ataatgtgcc attcgtagtc tgagacgccg ctagcctggt ttaatgcaac agcatagaag
      601 aaacacacat ca
//
LOCUS       PIGHMG2      1153 bp ss-mRNA            MAM       14-JUL-1990
DEFINITION  Pig non-histone chromosomal protein (HMG2) mRNA, complete cds.
ACCESSION   J02895
KEYWORDS    non-histone chromosomal protein.
SOURCE      Pig thymus, cDNA to mRNA, clone pCD-MK02.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1153)
  AUTHORS   Shirakawa,H., Tsuda,K.-i. and Yoshida,M.
  TITLE     Primary structure of non-histone chromosomal protein HMG2 revealed
            by the nucleotide sequence
  JOURNAL   Biochemistry 29, 4419-4423 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by M.Yoshida, 07-DEC-1989.
FEATURES       from  to/span     description
    pept        144      776     non-histone protein HMG2 precursor
    matp        147      773     non-histone protein HMG2
    mRNA    <     1     1153     HMG2 mRNA
BASE COUNT      361 a    224 c    301 g    267 t
ORIGIN      53 bp upstream of HaeIII site.
        1 acaaaccagt tcacgccgga gccgcaggta ggcagcgtcg cggtcggacc cggccgctgc
       61 gcgggagcct gaggagaagc taccaccagg caagagaccc ctccgggccc cggtggacgc
      121 gtcgtcgccg gcgccgcagc accatgggga agggcgaccc caacaagccg cggggcaaga
      181 tgtcctcgta cgccttcttc gtgcagacct gccgggagga gcacaagaag aaacaccccg
      241 attcctcggt caacttcgcc gagttctcca agaagtgctc cgagcgatgg aagactatgt
      301 ctgccaagga aaagtccaag tttgaagata tggcaaaaag tgacaaagct cgctatgacc
      361 gggagatgaa aaattacgtc cctcccaagg gtgacaagaa gggcaagaaa aaagatccca
      421 atgctcccaa aaggcctcca tctgccttct tcctgttttg ctctgaacat cgcccaaaga
      481 tcaaaagtga acaccctggc ttatccattg gggatactgc aaagaaattg ggtgaaatgt
      541 ggtctgagca gtcagccaaa gataaacaac cgtatgaaca gaaagcagct aagctaaagg
      601 aaaaatatga aaaggatatt gctgcatacc gtgccaaggg taagggtgaa gcaggaaaga
      661 agggccctgg caggccaaca ggctctaaga agaagaatga accagaagat gaggaggaag
      721 aggaagagga ggaagaagat gaagacgagg aggaagaaga tgaagatgag gaataaatgg
      781 ctatcctgta atgttgtgtg tggagtgtgt gtgtgtgctc aggcaattat tttgctaaga
      841 atgtgaattc aagtgcagct caatattagc ttcagtataa aaactgtaca gatttttgta
      901 tagctaataa gattctttgt agagaaaata ctttttttaa aagtgcaggt tgcagctttt
      961 tgaggggcta ctacatacag ttagatttta aagcttctga tgttgaatgt ttctaaatat
     1021 ttaatggttt ctttaatttc ttgtgtatgg taacagcaaa ctcgtagaaa taagtatcaa
     1081 tagcaaattt tgggttttct agaatgttgc attttgtttt ttaaaaaaat tttgtaataa
     1141 aattatgtat att
//
LOCUS       PSEHUTH      2249 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  P.putida histidine ammonia-lyase (hutH) gene, complete cds.
ACCESSION   M35140
KEYWORDS    histidase; histidine ammonia-lyase.
SOURCE      P.putida (ATCC 12633) DNA, clone pMC4.
  ORGANISM  Pseudomonas putida
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 2249)
  AUTHORS   Consevage,M.W. and Phillips,A.T.
  TITLE     Sequence analysis of the hutH gene encoding histidine ammonia-lyase
            in Pseudomonas putida
  JOURNAL   J. Bacteriol. 172, 2224-2229 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        316     1848     histidine ammonia-lyase (hutH) precursor (gtg
                                 start codon (E.C. 4.3.1.3)
    matp        319     1845     histidine ammonia-lyase (hutH)
BASE COUNT      392 a    777 c    699 g    381 t
ORIGIN      
        1 ccatggtggt ggcgtgggca tgggcttctc tcagcactcg ggcatggtca tcgtctgcga
       61 cggcaccgat gaagccgccg agcgcatcgc ccgtgtactg accaacgacc cagggactgg
      121 cgtcatgcgc cacgccgatg ccggttatga catcgccatc gactgcgcca aggagcaggg
      181 cctggacctg ccgatgatca ctggctgatt gccacgcttt ggatcgcacc ccgccagggg
      241 tgatactgaa caacaagaag gagcgcgcag gcacccacga acggcctgcg gctccacgcg
      301 atttggagta gtaacgtgac cgaactcacc ctcaagcccg gcaccctgac cctggcccag
      361 ctgcgcgcga tccatgccgc gcccgtgcgc ctgcaactgg atgccagcgc cgcgccggcc
      421 atcgatgcca gcgtcgcctg tgtcgagcag atcattgccg aagaccgcac tgcctacggc
      481 atcaacaccg gtttcggcct gctggcctcg acccgcatcg ccagccacga cctggaaaac
      541 ctgcagcgct cgctggtgct gtcccacgcc gctggcattg gcgcgccgct ggatgacgat
      601 ctggtgcggt tgatcatggt gctgaaaatc aacagcctca gccgtggctt ctccggcatt
      661 cgccgcaaag tcatcgatgc gctgatcgcc ctggtcaacg ccgaagtcta cccgcacatc
      721 ccgctgaaag gctcggtggg tgcttccggc gacctggcgc cgctggcgac catgtcgctg
      781 gtgctgctgg gcgaaggcaa agcccgctac aagggccagt ggctgtcggc caccgaagcc
      841 ctggcggttg ccggcctcga accgctgacc ctggctgcca aagagggcct ggccctgctc
      901 aacggcaccc aggcgtccac cgcctatgcc ctgcgtggcc tgttctatgc cgaagacttg
      961 tacgctgcgg ctatcgcctg cggcggcctg agcgtcgaag ccgtactggg ctcgcgttcg
     1021 ccgttcgatg cgcgtattca cgaagcgcgt ggccagcgcg gccagatcga cactgccgcg
     1081 tgcttccgcg acctgctggg cgattccagc gaagtgtcgc tgtcgcacaa gaactgcgac
     1141 aaggtccagg acccgtactc gctgcgctgc cagccgcagg tcatgggcgc gtgcctgacc
     1201 cagctgcgcc aggctgccga ggtgctgggc atcgaagcca acgccgtgtc ggacaacccg
     1261 ctggtgttcg ctgccgaggg tgacgtgatt tccggtggca acttccacgc agaaccggtg
     1321 gccatggccg ccgacaacct ggccctggcc atcgccgaaa tcggttcgct cagcgagcgc
     1381 cgcatctcgc tgatgatgga caagcacatg tcccagctgc cgccgttcct ggtggaaaac
     1441 ggtggggtca actccggctt catgatcgcc caggtcaccg ctgccgctct ggccagcgag
     1501 aacaaggccc tgtcgcaccc gcacagcgtc gacagcctgc cgacctcggc caaccaggaa
     1561 gaccacgtat cgatggcccc ggctgccggc aagcgcctgt gggaaatggc cgaaaacacc
     1621 cgtggcgtgc ctgccatcga atggctgggc gcatgccagg gcctggacct gcgcaaaggc
     1681 ctgaagactt cggccaagct ggagaaagca cgccaggcgc tgcgcagcga agtggcgcac
     1741 tacgaccgtg accgtttctt cgcgccggac atcgaaaagg ctgtggaact gttggccaag
     1801 ggtagtttga ccggtttgct gccggcaggt gtgctgccaa gcctgtaatg ccccctgggg
     1861 cgctgcgcct tcgcagcaca aggctgctcc tgcaaaagcg cacgcgccct tgtacggaca
     1921 gcctgtgctg cgatgggctg catgccagcc ccctgacctc acgaccgcag gacgattcaa
     1981 ggacgccgac acgcaacaag ctcaaggtct caagtgcggg ctaaatgccc gccacatccg
     2041 tttcatggcc ctcggttctg ccatcggaac cgggctgttc tacggctccg cctcagccat
     2101 ccagatggcc ggcccggccg tgctgctggc ttacctgatc ggcggtgccg ctgtgttcat
     2161 ggtcatgcgc gccctcggcg aaatggccgt gcacaacccg gtggccggct ccttcggcca
     2221 ctacgccagt acctacctcg gccccatgg
//
LOCUS       RATGABAAD    1799 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Rat GABA-A receptor delta subunit mRNA, complete cds.
ACCESSION   M35162
KEYWORDS    GABA-A receptor delta subunit; gamma-aminobutyric acid receptor.
SOURCE      Rat adult brain, cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1799)
  AUTHORS   Zhao,Z.-Y. and Joho,R.H.
  TITLE     Isolation of distantly related members in a multigene family using
            the polymerase chain reaction technique
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 174-182 (1990)
  STANDARD  simple staff_review
REFERENCE   2  (sites; corrections to figure 3 for [1])
  AUTHORS   Zhao,Z.-Y. and Joho,R.H.
  TITLE     Isolation of distantly related members in a multigene family using
            the polymerase chain reaction technique
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 887-887 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         50     1399     GABA-A receptor delta subunit
BASE COUNT      385 a    512 c    500 g    402 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccctg gggctagccg gacctgtccc gcgcacagcc cgcaaggcca tggacgttct
       61 gggctggctg ctgctgccgc tccttctgct gtgcacgcag ccgcaccatg gcgccagagc
      121 aatgaatgac attggggact acgtgggctc caacctggag atatcctggc tccccaacct
      181 ggatggacta atggagggct acgcccgaaa cttccgacca ggcattggag gtcctccagt
      241 gaatgtggcg cttgccctag aggtggccag cattgaccac atctcagaag caaatatgga
      301 atacaccatg acagtgttcc tgcaccagag ctggcgagac agcaggctgt cctacaacca
      361 taccaacgag accctgggcc tggatagccg cttcgtggac aagctgtggc tccctgacac
      421 cttcattgtg aatgccaagt ctgcctggtt ccatgatgtg accgtggaaa acaagcttat
      481 ccgcctacag cccgacggtg tgattttata cagcatccgc atcacctcca cagtggcctg
      541 tgacatggac cttgccaagt accccatgga cgagcaggag tgcatgctgg acctggagag
      601 ctatggctac tcttctgagg acattgtcta ttattggtca gaaaaccagg agcagatcca
      661 cgggctggac aggctgcaac tggcccagtt cactatcacc agttaccgct tcaccacgga
      721 gctgatgaac ttcaaatcag ctggccagtt ccctcgactc agcttacact tccagcttcg
      781 gaggaaccgg ggtgtctaca tcatccagtc ttacatgccc tctgtcctcc tggttgccat
      841 gtcctgggtc tccttctgga ttagccaagc agcagtgcct gccagagtat ctctaggcat
      901 caccactgtg ctgacaatga ccacactcat ggttagtgcc cgctcctccc tcccgcgggc
      961 ttctgctatc aaggctctgg atgtgtattt ctggatctgc tatgtcttcg tgtttgctgc
     1021 cctggtggag tatgcatttg cccacttcaa tgctgactac aggaagaaac ggaaagccaa
     1081 ggtcaaggtc acgaagccaa gggcagagat ggacgtgagg aacgccattg tcctcttctc
     1141 cctctctgct gctggggtca gccaggagtt ggctatctcc cgccgtcaag gccgggtccc
     1201 tgggaacctc atgggttcct ataggtctgt agaagtggag gcaaagaagg agggggggtc
     1261 ccgcccagga ggcccaggag gcatccgttc cagactcaaa cccatcgatg cagacaccat
     1321 cgacatctat gcccgcgctg tgttcccggc agcctttgca gcagtcaaca tcatctactg
     1381 ggcggcgtat accatgtgac ggcagtgcgt acaccacatg acggctttac atgtaccacg
     1441 tgcggtggtt gccacaaact cctggaggaa cagccctctc cgtggatccc aaagtgggac
     1501 agagaggagc gtccgctccc tgggctttac ctcaattttg ctctaccagc ctgcacttcc
     1561 tgtagatgga gcaggatggg ctttgcacag agggccatta gttggcctgt ccctgaagct
     1621 gagcctgcac taaggtgccc ttgggagctg tgaagtttta gactcaggat gggcctgatt
     1681 tctgggcctt tgctctgtgg gatcaggatc agagagaaag agagtagggg tggatgacca
     1741 ctctcatggt gaactggagg agaagggcat tatgaggcct ctttggtccc agggaattc
//
LOCUS       RPVCTSI      2195 bp ds-DNA             VRL       14-JUL-1990
DEFINITION  Raccoonpox virus Sal1 end fragment DNA.
ACCESSION   M23018
KEYWORDS    .
SOURCE      Raccoonpox virus (strain V71-I-85A) DNA, passed in human tk- 143
            cells.
  ORGANISM  Raccoonpox virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 2195)
  AUTHORS   Parsons,B.L. and Pickup,D.J.
  TITLE     Tandemly repeated sequences are present at the ends of the DNA of
            Raccoonpox virus
  JOURNAL   Virology 161, 45-53 (1987)
  STANDARD  simple staff_review
BASE COUNT      880 a    225 c    373 g    717 t
ORIGIN      
        1 ataattaatt taattttaaa tattttattt aatgtctaga aaaaaatgtg taacccagac
       61 tgtagagaac tctctaatgt ggtgaaaaaa tcaatccctt aatcgagact atcagaaaga
      121 gacgaaacat tttttagtga gacattagaa agagaaagag acgaaacatt ttttagtgag
      181 acattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga gaaatatttt
      241 tttgtaaaac ttttttatga ctccattaga aagagatgaa acatttttag tgagacatta
      301 gaaagagaaa tatttttttg taaaactttt ttatgactcc attagaaaga gacgaaacat
      361 ttttagtgag acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta
      421 gaaagagacg aaacattttt agtgagacat tagaaagaga aatatttttt tgtaaaacgt
      481 tttcttaggt caaaagtcaa tgttttaaaa tatggactag aatatgtctg tgtaaaaact
      541 tgtgaaattt tgtaccaaag aaaaacgtga gcagtatccc ctacatggat tttactagat
      601 catttatgta acaaaaaatt atattattcg tgtacgttaa ttataaaaat attgtatgat
      661 atatcgaggt ccattagcaa aaggtgttga tagacaatta atcgtgtgtg atgtaaccat
      721 gattattaca gcagaaagag taagagtatt ttttgtaaaa cttttttatg actccattag
      781 aaagagaaag agacgaaaca tttttagtga gacattagaa agagacgaaa catttttttt
      841 gtaaaacttt tttatgactc cattagaaag agaaagagac gaaacatttt tagtgagaca
      901 ttagaaagag aaatattttt tgtaaaactt tttttatgac tccattagaa agagaaatat
      961 tttttgtaaa acttttttat gactccatta gaaagagacg aaacattttt attgagacat
     1021 tagaaagaga aagagacgaa acattttttt gtaaaacttt tttatgactc cattagaaag
     1081 agaaagagac gaaacatttt tagtgataca ttagaaagag aaatattttt tgtaaaactt
     1141 ttttatgact ccattagaaa gagatgaaac atttttagtg agacattaga aagagaaaga
     1201 gacgaaacat tttttttgta aaactttttt tatgactcca ttagaaagag aaagagacga
     1261 aacattttta gtgagacatt agaaagataa agagacgaaa catttttttt gtaaaacttt
     1321 ttttatgact ccattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga
     1381 gacgaaacat ttttagtgag acattagaaa gataaagaga cgaaacattt tttttgtaaa
     1441 acttttttat gactccatta gaaagagaaa gagacgaaac atttttagtg agacattaga
     1501 aagagacgaa acattttttt tgtaaaactt tttttatgac tccattagaa agagacgaaa
     1561 catttttagt gagacattag aaagataaag agacgaaaca ttttttttgt aaaacttttt
     1621 tatgactcca ttagaaagag aaagagacga aacattttta gtgagacatt agaaagagaa
     1681 atattttttg taaaactttt ttatgactcc attagaaaga gatgaaacat ttttagtgag
     1741 acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta gaaagagatg
     1801 aaacattttt agtgagacat tagaaagaga aagagacgaa acattttttt ttgtaaaact
     1861 tttttatgac tccattagaa agagatgaaa catttttagt gagacattag aaagagaaag
     1921 agacgaaaca tttttttttt gtaaaacttt ttatgactcc attagaaaga gatgaaacat
     1981 ttttagtgag acattagaaa gagaaagagt gacacgataa aacttgttac ttattgttag
     2041 agaaaaagaa tcgactttaa tgcgaataca tttttaaatt tcaaaccgga aataagtgtc
     2101 cggtacactt ttaaaattaa acgtatgtcg gaatccacaa ctaatcctga ggattacacg
     2161 attcagacac acgatttaag ttttgtgtcg tcgac
//
LOCUS       STYFLICDA     612 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  S.muenchen phase 1 flagellin (antigen-determining region) (fliCd)
            gene, partial cds.
ACCESSION   M34824
KEYWORDS    phase 1 flagellin antigen d.
SOURCE      S.muenchen (strain RKS3121) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 612)
  AUTHORS   Smith,N.H., Beltran,P. and Selander,R.K.
  TITLE     Recombination of Salmonella phase 1 flagellin genes generates new
            serovars
  JOURNAL   J. Bacteriol. 172, 2209-2216 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   612     phase 1 flagellin (fliCd) (AA at 1)
BASE COUNT      207 a    114 c    137 g    154 t
ORIGIN      
        1 attgatttaa aagaaattag ctctaaaaca ctgggacttg ataagcttaa tgtccaggat
       61 gcctacaccc cgaaagaaac tgctgtaacc gttgataaaa ctacctataa aaatggtaca
      121 gatactatta cagcccagag caatactgat atccaaactg caattggcgg tggtgcaacg
      181 ggggttactg gggctgatat caaatttaaa gatggtcaat actatttaga tgttaaaggc
      241 ggtgcttctg ctggtgttta taaagccact tatgatgaaa ctacaaagaa agttaatatt
      301 gatacgactg ataaaactcc gttagcaact gcggaagcta cagctattcg gggaacggcc
      361 actataaccc acaaccaaat tgctgaagta acaaaagagg gtgttgatac gaccacagtt
      421 gcggctcaac ttgctgctgc aggggttact ggtgccgata aggacaatac tagccttgta
      481 aaactatcgt ttgaggataa aaacggtaag gttattgatg gtggctatgc agtgaaaatg
      541 ggcgacgatt tctatgccgc tacatatgat gagaaacagg tacaattact gctaaacaac
      601 cactatacag at
//
LOCUS       STYFLICI      585 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  S.typhimurium phase 1 flagellin (antigen-determining region)
            (fliCi) gene, partial cds.
ACCESSION   M34822
KEYWORDS    phase 1 flagellin antigen i.
SOURCE      S.typhimurium (strain RKS3333) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 585)
  AUTHORS   Smith,N.H., Beltran,P. and Selander,R.K.
  TITLE     Recombination of Salmonella phase 1 flagellin genes generates new
            serovars
  JOURNAL   J. Bacteriol. 172, 2209-2216 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   585     phase 1 flagellin (fliCi) (AA at 1)
BASE COUNT      180 a    108 c    142 g    155 t
ORIGIN      
        1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tctgcaacaa
       61 aaatataagg tcagcgatac ggctgcaact gttacaggat atgccgatac tacgattgct
      121 ttagacaata gtacttttaa agcctcggct actggtcttg gtggtactga ccagaaaatt
      181 gatggcgatt taaaatttga tgatacgact ggaaaatatt acgccaaagt taccgttacg
      241 gggggaactg gtaaagatgg ctattatgaa gtttccgttg ataagacgaa cggtgaggtg
      301 actcttgctg gcggtgcgac ttccccgctt acaggtggac tacctgcgac agcaactgag
      361 gatgtgaaaa atgtacaagt tgcaaatgct gatttgacag aggctaaagc cgcattgaca
      421 gcagcaggtg ttaccggcac agcatctgtt gttaagatgt cttatactga taataacggt
      481 aaaactattg atggtggttt agcagttaag gtaggcgatg attactattc tgcaactcaa
      541 aataaagatg gttccataag tattaatact acgaaataca ctgca
//
LOCUS       STYFLICR      579 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  S.heidelberg phase 1 flagellin (antigen-determining region) (fliCr)
            gene, partial cds.
ACCESSION   M34823
KEYWORDS    phase 1 flagellin antigen r.
SOURCE      S.heidelberg (strains RKS539) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 579)
  AUTHORS   Smith,N.H., Beltran,P. and Selander,R.K.
  TITLE     Recombination of Salmonella phase 1 flagellin genes generates new
            serovars
  JOURNAL   J. Bacteriol. 172, 2209-2216 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   579     phase 1 flagellin (fliCr) (AA at 1)
BASE COUNT      181 a    118 c    126 g    154 t
ORIGIN      
        1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tgttcaacaa
       61 aaatataagg tcagcgatac ggctgcaact gtcactggct atacagattc tgctactgct
      121 attgacaaat ctacgtttgc tgcatcagca actaccttag gtggtactcc tgctattact
      181 ggtgatctga agtttgatga tactactgga aaatattacg ctgatgtttc aggtactacg
      241 gctaaagatg gtgtttatga agtaacagtt gcagccgatg gaaaagtcac tttaactggc
      301 acaccaacag gaccaattac tgctggcttc ccttcaactg caacaaaaga tgttaaacaa
      361 actcagcaag aaaacgctga tttgacagag gccaaagccg cattgacagc agcgggtgtt
      421 gcagcggccg gcacagcatc tgttgttaag atgtcttata ctgataataa cggtaaaact
      481 attgatggtg gtttagcagt taaggtaggc gatgattact attctgcaac tcaaaataaa
      541 gatggttcca taagtattaa tactacgaaa tacactgca
//
LOCUS       HUMFGF1A     3343 bp ss-mRNA            PRI       14-JUL-1990
DEFINITION  Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds.
ACCESSION   M34641
KEYWORDS    FGF receptor-1; fibroblast growth factor receptor;
            transmembrane glycoprotein; tyrosine kinase.
SOURCE      Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA,
            clone 1:1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3343)
  AUTHORS   Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L.
  TITLE     cDNA cloning and expression of a human FGF receptor which binds
            acidic and basic FGF
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by L.Claesson-Welsh, 25-MAY-1990.
            
              Author address: L.Claesson-Welsh
              Ludwig Institute for Cancer Research
              Biomedical Center
              Box 595
              S-751 24 Uppsala
              SWEDEN
            
FEATURES       from  to/span     description
    pept         10     2472     FGF receptor-1 precursor
    sigp         10       72     FGF receptor-1 signal peptide
    matp         73     2469     FGF receptor-1
BASE COUNT      766 a    957 c    911 g    709 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca
       61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct
      121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg
      181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac
      241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc
      301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat
      361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag
      421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa
      481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct
      541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct
      601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct
      661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc
      721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca
      781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg
      841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag
      901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc
      961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat
     1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg
     1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc
     1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg
     1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag
     1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct
     1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg
     1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc
     1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg
     1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac
     1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg
     1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc
     1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca
     1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac
     1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc
     1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata
     1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac
     1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac
     2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc
     2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg
     2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca
     2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc
     2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc
     2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag
     2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc
     2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc
     2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg
     2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct
     2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc
     2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg
     2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc
     2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc
     2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc
     2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac
     3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc
     3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat
     3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc
     3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga
     3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt
     3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc
//
LOCUS       BSUSPCR      2852 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  B.subtilis spectinomycin resistance (spc) genes, complete cds.
ACCESSION   M31102
KEYWORDS    spectinomycin resistance.
SOURCE      B.subtilis (strain 1A241) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2852)
  AUTHORS   Yoshikawa,H. and Doi,R.H.
  TITLE     The nucleotide sequence of the Bacillus subtilis spectinomycin
            resistance gene region
  JOURNAL   Nucleic Acids Res. 18, 1647-1647 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.H.Doi, 04-JAN-1990.
FEATURES       from  to/span     description
    pept         52      549     spc ORF1
    pept        563      742     spc ORF2
    pept        773     1213     spc ORF3
    pept       1215     2450     spc ORF4 (ttg start codon)
    pept       2505  >  2852     spc ORF5
BASE COUNT      830 a    535 c    673 g    814 t
ORIGIN      
        1 tgatgcgctc gtgaagctgg acttaaattt taataaaaga aggagggaca catgattatg
       61 cgtcgtattg acccaagcaa attagagtta gaagaacgct tagttacggt taaccgcgta
      121 gcgaaaggtg gtcgtcgttt ccgcttcgca gctctagtcg ttgtcggtga caaaaacgga
      181 cacgtaggat tcggtactgg taaagcacaa gaagtaccag aagcgattcg caaagctgtt
      241 gaagatgcga aaaagaattt gattgaagta ccaatggttg gaactacaat tccacacgaa
      301 atcatcggac gtttcggtgc aggtaacatc ttgttaaaac ctgcttctga aggtactgga
      361 gttatcgctg gaggccctgt acgtgcggta cttgagctag ctggtgtagc tgatatcctt
      421 tctaagtctt taggttctaa cacaccgatc aacatgattc gtgcaacact tcaaggttta
      481 agtgaactta aacgtgctga agacgttgcg aagcttcgtg gaaaatctgt agaagaactg
      541 ttaggataag gagggaacat taatggctaa attagaaatt accctcaaac gaagtgtaat
      601 cggtcgcccg gaagaccaac gcgttactgt aagaacactt ggtttaaaga aaacaaacca
      661 aactgttgtt catgaagaca acgctgcgat ccgtggtatg attaataaag tatctcattt
      721 agtttctgtt aaagaacaat aaaaatattt gatcggataa ggaggtgtcc taatgaaact
      781 tcatgaatta aaaccttcag aaggttcacg caaaacgcgt aatcgcgtag gtcgtggtat
      841 tggttctggc aacggtaaaa cagctggtaa aggtcacaaa ggtcaaaacg ctcgttctgg
      901 cggcggtgta cgccctggat tcgagggggg acaaatgcct ttattccaac gtcttcctaa
      961 acgtggtttc actaacatca accgcaagga atacgctgta gtcaacctag acaaattgaa
     1021 cggttttgca gaaggaacgg aagtcactcc tgaacttctt ctagagactg gtgttattag
     1081 caaacttaat gcaggagtaa agattcttgg caacggtaaa ttagagaaaa aattaactgt
     1141 aaaagccaat aaattctctg cttctgcgaa agaagctgtt gaagctgctg gcggtacagc
     1201 tgaggtgatc taacttgttt aaaacaatct ccaactttat gcgtgtgagt gatatcagga
     1261 ataaaatcat attcacttta ctcatgctta tcgtctttcg cataggtgcg tttattcctg
     1321 tgccttacgt taacgctgaa gcgttacagg cacagtctca aatgggtgtt tttgatctcc
     1381 ttaatacatt tggcggcggt gcgctttacc aattttccat tttcgcaatg ggaattactc
     1441 cttatatcac ggcttcgatc atcattcagc tgcttcagat ggatgtggta ccgaagttta
     1501 ccgagtggtc taagcaaggt gaagttggcc gccgtaaatt agctcagttc acaaggtact
     1561 ttacgattgt gcttggtttc atccaagcgt taggtatgtc atatggattc aacaatctgg
     1621 caaacggtat gctgatcgaa aaatccggtg tatcgacata tcttatcatt gctttagtgc
     1681 tcactggcgg aactgccttt ttaatgtggc ttggggaaca aattacttct catggagtag
     1741 gcaacggaat atcgatcatt atcttcgcgg ggattgtgtc tagtattcca aaaacaattg
     1801 ggcaaatata tgagactcaa tttgtcggca gcaacgatca gttgtttatt catattgtga
     1861 aagtcgcact tcttgtgatt gcgattttag cagttattgt tggagttatt ttcattcagc
     1921 gaggtcagtc tacacacctt ccattgaaag tgaatcctgc aggggttatt ccggtaatct
     1981 ttgcggttgc gtttttgata acgccgcgga cgatcgcgtc attctttgga acaaacgatg
     2041 tgacaaagtg gattcaaaac aactttgata atacgcatcc ggtgggtatg gcgatatatg
     2101 ttgcgttgat tattgccttt acgtactttt atgcttttgt acaggtaaac cctgaacaaa
     2161 tggctgataa ccttaaaaaa cagggtggct atatcccggg ggttcgtcca gggaaaatga
     2221 ctcaagatag aattacgagc attttgtatc gacttacgtt tgtgggttct atattcttag
     2281 ccgtgatttc cattcttcct atctttttca ttcaattcgc tggattgcct caaagtgcac
     2341 aaattggcgg aacatctttg ttaattgttg tcggggtagc cttggagaca atgaaacaac
     2401 tagaaagcca gttggtgaaa cgaaactacc gtggatttat gaaaaactag aggaaatgga
     2461 tttatccatt ccctcttaat aaagagagga cgggggaaat ccgaatgaac ttagtcttaa
     2521 tggggcttcc tggtgccggt aaaggcacac agggcgaacg aattgttgag gattatggga
     2581 ttcctcatat ctcaacagga gatatgttcc gtgctgctat gaaagaagaa acaccactcg
     2641 gactcgaagc aaaatcttat attgataagg gagagctcgt acctgacgaa gtcacaatcg
     2701 gaattgtaaa agagagactt ggcaaagatg attgtgaaag aggttttctt ctggacggat
     2761 ttccgcgaac agtcgctcaa gccgaagctc ttgaagaaat tcttgaggaa tacggcaagc
     2821 cgattgatta tgtcattaac attgaagtcg ac
//
LOCUS       RATHDNFNT    1020 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Rat neurotrophin-3 (HDNF/NT-3) mRNA, complete cds.
ACCESSION   M34643
KEYWORDS    neurotrophic factor; neurotrophin-3.
SOURCE      Rat adult (strain Sprague-Dawley) hippocampus, cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1020)
  AUTHORS   Ernfors,P., Ibanez,C.F., Ebendal,T., Olson,L. and Persson,H.
  TITLE     Molecular cloning and neurotrophic activities of a protein with
            structural similarities to b-nerve growth factor: Developmental and
            topographical expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.P.Ernfors, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         73      849     neurotrophin-3 (HDNF/NT-3) precursor
    sigp         73      120     neurotrophic factor/neurotrophin-3 (HDNF/NT-3)
                                 signal peptide
    matp        490      846     neurotrophic factor/neurotrophin-3
BASE COUNT      320 a    229 c    226 g    245 t
ORIGIN      
        1 gtcgacgtcc ctggaaatag tcatacggat gccatggtta cttctgccac gatcttacag
       61 gtgaacaagg tgatgtccat cttgttttat gtgatatttc ttgcttatct ccgtggcatc
      121 caaggcaaca acatggatca aaggagtttg ccagaagact ctctcaattc cctcattatc
      181 aagttgatcc aggcggatat cttgaaaaac aagctctcca agcagatggt agatgttaag
      241 gaaaattacc agagcaccct gcccaaagca gaggcaccca gagaaccaga gcagggagag
      301 gccaccaggt cagaattcca gccgatgatt gcaacagaca cagaactact acggcaacag
      361 agacgctaca attcaccccg ggtcctgctg agtgacagca cccctttgga gccccctccc
      421 ttatatctaa tggaagatta tgtgggcaac ccggtggtaa ccaatagaac atcaccacgg
      481 aggaaacgct atgcagagca taagagtcac cgaggagagt actcagtgtg tgacagtgag
      541 agcctgtggg tgaccgacaa gtcctcagcc attgacattc ggggacacca ggttacagtg
      601 ttgggagaga tcaaaaccgg caactctcct gtgaaacaat atttttatga aacgaggtgt
      661 aaagaagcca ggccagtcaa aaacggttgc agggggattg atgacaaaca ctggaactct
      721 cagtgcaaaa cgtcgcaaac ctacgtccga gcactgactt cagaaaacaa caaactcgta
      781 ggctggcgct ggatacgaat agacacttcc tgtgtgtgtg ccttgtcaag aaaaatcgga
      841 agaacatgaa ttggcatctg tccccacata taaattatta ctttaaatta tatgatatgc
      901 atgtagcata taaatgttta tattgttttt atatattata agttgacctt tatttattaa
      961 acttcagcaa cccttacagt atataagctt ttttcataat cgggctgctc aaaaaaaaaa
//
LOCUS       ABCRRAA       118 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  Acetobacter sp. (strain MB 58) 5S ribosomal RNA, complete sequence.
ACCESSION   M34766
KEYWORDS    5S ribosomal RNA.
SOURCE      Acetobacter sp. (strain MB 58) rRNA.
  ORGANISM  Acetobacter sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Azotobacteraceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       27 a     40 c     32 g     17 t      2 others
ORIGIN      
        1 gatctggtgg ccatggcggg agcaaatcag ccgatcccat cccgaactcg gccgtcaaat
       61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagayy
//
LOCUS       ABCRRAB       118 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  Acetobacter sp. (strain 914) 5S ribosomal RNA, complete sequence.
ACCESSION   M34767
KEYWORDS    5S ribosomal RNA.
SOURCE      Acetobacter sp. (strain 914) rRNA.
  ORGANISM  Acetobacter sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Azotobacteraceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       27 a     42 c     29 g     18 t      2 others
ORIGIN      
        1 nntctgatgg ccatggcggg agcaaatcac ccgatcccat cccgaactcg gccgtcaaat
       61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagatc
//
LOCUS       ANCRRAA       117 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  A.aquaticus 5S ribosomal RNA, complete sequence.
ACCESSION   M34775
KEYWORDS    5S ribosomal RNA.
SOURCE      A.aquaticus (strain Ma) rRNA.
  ORGANISM  Ancylobacter aquaticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      117     5S ribosomal RNA
BASE COUNT       23 a     40 c     32 g     22 t
ORIGIN      
        1 cctggtggcc attgcgagga gccccaaccc gatcccatcc cgaactcggc cgttaaactc
       61 ctcagcgccg atggtactat gtctcaagac ctgggagagt aggtcgctgc caggcct
//
LOCUS       BLTRRAA       118 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  B.viscosus 5S ribosomal RNA, complete sequence.
ACCESSION   M34774
KEYWORDS    5S ribosomal RNA.
SOURCE      B.viscosus (strain D7) rRNA.
  ORGANISM  Blastobacter viscosus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       23 a     38 c     35 g     22 t
ORIGIN      
        1 gcctggtggc tatggcgagg agtccgaacc cgatcccatc ccgaactcgg ccgttaaact
       61 cctcagcgcc aatggtactt cgtctcaaga cgcgggagag taggtcgctg ccaggcct
//
LOCUS       HYVRRAA       119 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  H.vulgare 5S ribosomal RNA, complete sequence.
ACCESSION   M34768
KEYWORDS    5S ribosomal RNA.
SOURCE      H.vulgare (strain NP-160) rRNA.
  ORGANISM  Hyphomicrobium vulgare
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       22 a     34 c     37 g     26 t
ORIGIN      
        1 gacctggtga ttatggcggg tggtctgcac ccgatctcat tccgaactcg gccgtgaaac
       61 cgcccagcgc caatggtact tcgtcttaag acgcgggaga gtaggtcgtc gccaggtct
//
LOCUS       HYVRRAB       119 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  Hyphomicrobium sp. 5S ribosomal RNA, complete sequence.
ACCESSION   M34769
KEYWORDS    5S ribosomal RNA.
SOURCE      Hyphomicrobium sp. (strain G10) rRNA.
  ORGANISM  Hyphomicrobium sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       20 a     34 c     39 g     26 t
ORIGIN      
        1 gacctggtga ttatggcggg gtggctgcac ccgatcccat tccgaactcg gccgtgaaac
       61 gcccctgcgc cgatggtact tcgtcttaag acgcgggaga gtaggtcgtt gccaggtct
//
LOCUS       MAGRRAA       119 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.agile 5S ribosomal RNA, complete sequence.
ACCESSION   M34759
KEYWORDS    5S ribosomal RNA.
SOURCE      M.agile (strain A20) rRNA.
  ORGANISM  Methylomonas agile
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       29 a     29 c     36 g     22 t      3 others
ORIGIN      
        1 gcttggtggc catagcgagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc
       61 gcttagcgcc gatgatagtg cgacaggttg tcgtgcgaaa gtagggaatt gtmaagcnn
//
LOCUS       MAGRRAB       119 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.rubra 5S ribosomal RNA, complete sequence.
ACCESSION   M34760
KEYWORDS    5S ribosomal RNA.
SOURCE      M.rubra (strain 15S) rRNA.
  ORGANISM  Methylomonas rubra
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       30 a     31 c     33 g     23 t      2 others
ORIGIN      
        1 gcttggtgac catagcaagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc
       61 gcttagcgcc gatgatagtg tggcaggttg ccatgtgaaa gtaggtcatc gcmaagctn
//
LOCUS       MAGRRAC       117 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.methanica 5S ribosomal RNA, complete sequence.
ACCESSION   M34761
KEYWORDS    5S ribosomal RNA.
SOURCE      M.methanica (strain 12) rRNA.
  ORGANISM  Methylomonas methanica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      117     5S ribosomal RNA
BASE COUNT       27 a     30 c     32 g     24 t      4 others
ORIGIN      
        1 ncttggtgac catagcgagc gtgraccacc cgatcccatc ccgaactcgg aagtgaaacc
       61 gctttgcgcc gatgatagtg tggcagtttg ccatgtgaaa gtaggtcatc gcmaagn
//
LOCUS       MLCRRAA       118 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.capsulatus (strain ATCC 19069) 5S ribosomal RNA, complete
            sequence.
ACCESSION   M34758
KEYWORDS    5S ribosomal RNA.
SOURCE      M.capsulatus (strain ATCC 19069) rRNA.
  ORGANISM  Methylococcus capsulatus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       30 a     36 c     36 g     16 t
ORIGIN      
        1 gcctggcggc cagagcgagc gggaaccacc cgatcccatc ccgacctcgg aagtgaaacc
       61 gcttagcgcc gatgatagtg caagaatacc tgtgcgaaag tagggaactg ccaggctc
//
LOCUS       MLCRRAB       117 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.capsulatus (strain Y) 5S ribosomal RNA, complete sequence.
ACCESSION   M34762
KEYWORDS    5S ribosomal RNA.
SOURCE      M.capsulatus (strain Y) rRNA.
  ORGANISM  Methylococcus capsulatus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      117     5S ribosomal RNA
BASE COUNT       31 a     28 c     31 g     23 t      4 others
ORIGIN      
        1 nyttggtgac catagcgagc gtgacccacc cgatcccatc ccgaactcgg aagtgaaacc
       61 gcttagtgcc gatgatagtg tggcaatttg ccatgcgaaa gtagggaatt gcmaaan
//
LOCUS       MTBRRAA       115 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.organophilum 5S ribosomal RNA, complete sequence.
ACCESSION   M34770
KEYWORDS    5S ribosomal RNA.
SOURCE      M.organophilum (strain NP-220) rRNA.
  ORGANISM  Methylobacterium organophilum
            Prokaryota; Bacteria; Methylmonadaceae.
REFERENCE   1  (bases 1 to 115)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      115     5S ribosomal RNA
BASE COUNT       23 a     41 c     34 g     17 t
ORIGIN      
        1 ccggcggcct gagcggtgtg ccagaacccg atcccatctc gaactcggcc gttaaacaca
       61 ccagcgccca tggtactgtg ctccaagaca cgggagagtc ggtcgccgcc aggct
//
LOCUS       MTBRRAB       118 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.extorquens 5S ribosomal RNA, complete sequence.
ACCESSION   M34771
KEYWORDS    5S ribosomal RNA.
SOURCE      M.extorquens (strain AM1) rRNA.
  ORGANISM  Methylobacterium extorquens
            Prokaryota; Bacteria; Methylmonadaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       24 a     39 c     37 g     18 t
ORIGIN      
        1 gcctggcggc ctgagcggtg tggaagaacc cgatcccatc tcgaactcgg ccgttaaaca
       61 caccagcgcc gatggtactg tgtctcaaga cacgggagag tcggtcgccg ccaggccc
//
LOCUS       MYLRRAA       119 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.trichosporium 5S ribosomal RNA, complete sequence.
ACCESSION   M34763
KEYWORDS    5S ribosomal RNA.
SOURCE      M.trichosporium (strain 4E) rRNA.
  ORGANISM  Methylosinus trichosporium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       28 a     29 c     33 g     24 t      5 others
ORIGIN      
        1 nncttggtga ctatagtgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac
       61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagyn
//
LOCUS       MYMRRAA       115 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.methylotrophus 5S ribosomal RNA, complete sequence.
ACCESSION   M34765
KEYWORDS    5S ribosomal RNA.
SOURCE      M.methylotrophus (strain NCIB 10515) rRNA.
  ORGANISM  Methylophilus methylotrophus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 115)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      115     5S ribosomal RNA
BASE COUNT       29 a     33 c     28 g     25 t
ORIGIN      
        1 cttggggaca atagcggttt ggaaccaccc cttcccatct cgaacagggc cgtgaaacga
       61 acttgcgcca atgatagtgt actcttcgta tgcgaaagta ggtcatcccc aagcc
//
LOCUS       MYPRRAA       119 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  M.parvus 5S ribosomal RNA, complete sequence.
ACCESSION   M34764
KEYWORDS    5S ribosomal RNA.
SOURCE      M.parvus (strain 492) rRNA.
  ORGANISM  Methylocystis parvus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       28 a     32 c     33 g     22 t      4 others
ORIGIN      
        1 nncttggtga ccatagcgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac
       61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagcn
//
LOCUS       PSERRAA       118 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  P.stutzeri 5S ribosomal RNA, complete sequence.
ACCESSION   M34776
KEYWORDS    5S ribosomal RNA.
SOURCE      P.stutzeri (strain 8) rRNA.
  ORGANISM  Pseudomonas stutzeri
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       23 a     35 c     35 g     25 t
ORIGIN      
        1 gcctggtggc tatggcgagg aggcctaacc cgatcccatc ccgaactcgg ccgttaaact
       61 cctcagtgcc gatggtacta tgtcttaaga cctgggagag taggtcgtcg ccaggcct
//
LOCUS       RECRRAA       116 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  R.vacuolatum 5S ribosomal RNA, complete sequence.
ACCESSION   M34773
KEYWORDS    5S ribosomal RNA.
SOURCE      R.vacuolatum (strain Rv) rRNA.
  ORGANISM  Renobacter vacuolatum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE   1  (bases 1 to 116)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      116     5S ribosomal RNA
BASE COUNT       23 a     36 c     32 g     25 t
ORIGIN      
        1 ctggtggcca ttgcgaggag ccccaattcg atctcactcc gaactcggcc gttaaactcc
       61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggcct
//
LOCUS       TUMRRAA       116 bp ss-rRNA            RNA       14-JUL-1990
DEFINITION  T.mutans 5S ribosomal RNA, complete sequence.
ACCESSION   M34772
KEYWORDS    5S ribosomal RNA.
SOURCE      T.mutans (strain U2) rRNA.
  ORGANISM  Tuberoidobacter mutans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Nonprosthecate bacteria.
REFERENCE   1  (bases 1 to 116)
  AUTHORS   Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I.,
            Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M.
  TITLE     Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA
            sequencing
  JOURNAL   J. Gen. Microbiol. 136, 441-446 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      116     5S ribosomal RNA
BASE COUNT       23 a     36 c     33 g     24 t
ORIGIN      
        1 ctggtggcca ttgcgaggag cctgaacccg actccactcc gaactcggcc gttaaactcc
       61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggtct
//
LOCUS       ANAPETF1A    1682 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  A.variabilis ferredoxin (petF1) gene, complete cds.
ACCESSION   M35126
KEYWORDS    ferredoxin.
SOURCE      A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1.
  ORGANISM  Anabaena variabilis
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Nostocaceae.
REFERENCE   1  (bases 1 to 1682)
  AUTHORS   Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
            Van Arkel,G. and Weisbeek,P.
  TITLE     Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
            Synechococcus sp. PCC 7942: Structure and regulation
  JOURNAL   Photosyn. Res. 18, 179-204 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        794     1093     ferredoxin (petF1)
    binding     782      785     ribosomal binding site (put.)
BASE COUNT      480 a    353 c    322 g    527 t
ORIGIN      
        1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg
       61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta
      121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat
      181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat
      241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat
      301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata
      361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat
      421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc
      481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat
      541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata
      601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt
      661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat
      721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga
      781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag
      841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta
      901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc
      961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata
     1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga
     1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag
     1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta
     1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga
     1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt
     1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac
     1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct
     1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt
     1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat
     1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg
     1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata
     1681 tc
//
LOCUS       ANIPETF1A    1074 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  A.nidulans ferredoxin (petF1) gene, complete cds.
ACCESSION   M35127
KEYWORDS    ferredoxin.
SOURCE      A.nidulans (strain R2, PCC7942) DNA, clone pA612.
  ORGANISM  Anacystis nidulans
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria.
REFERENCE   1  (bases 1 to 1074)
  AUTHORS   Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
            Van Arkel,G. and Weisbeek,P.
  TITLE     Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
            Synechococcus sp. PCC 7942: Structure and regulation
  JOURNAL   Photosyn. Res. 18, 179-204 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        504      803     ferredoxin (petF1)
    binding     493      496     ribosomal binding site (put.)
BASE COUNT      234 a    298 c    281 g    261 t
ORIGIN      
        1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg
       61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca
      121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac
      181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt
      241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga
      301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa
      361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc
      421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc
      481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg
      541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag
      601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca
      661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg
      721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc
      781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat
      841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc
      901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc
      961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt
     1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag
//
LOCUS       ASAASPPA     1764 bp ds-DNA             PLN       14-JUL-1990
DEFINITION  A.awamori aspergillopepsin A (pepA) gene, complete cds.
ACCESSION   M34454
KEYWORDS    aspartic proteinase; aspergillopepsin.
SOURCE      A.awamori (NRRL3112 mutant strain UVK143f) DNA.
  ORGANISM  Aspergillus awamori
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 1764)
  AUTHORS   Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H.,
            Carlomagno,L.P. and Thompson,S.A.
  TITLE     Molecular cloning and deletion of the gene encoding
            aspergillopepsin A from Aspergillus awamori
  JOURNAL   Gene 86, 153-162 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        177      496     aspergillopepsin (pepA), exon 1
                548      825     aspergillopepsin, exon 2
                878     1126     aspergillopepsin, exon 3
               1186     1523     aspergillopepsin, exon 4
    sigp        177      308     aspergillopepsin signal peptide
    matp        455      496     aspergillopepsin
                548      825     aspergillopepsin
                878     1126     aspergillopepsin
               1186     1523     aspergillopepsin
    pre-msg     124     1764     pepA mRNA and introns (put.)
    pre-msg     125     1764     pepA mRNA and introns (put.)
    pre-msg     127     1764     pepA mRNA and introns (put.)
    IVS         497      547     pepA intron A
    IVS         826      877     pepA intron B
    IVS        1127     1185     pepA intron C
    signal       54       58     TATA box
BASE COUNT      387 a    515 c    445 g    417 t
ORIGIN      
        1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga
       61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc
      121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg
      181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc
      241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc
      301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc
      361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg
      421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca
      481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct
      541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt
      601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg
      661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca
      721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg
      781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac
      841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc
      901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac
      961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc
     1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt
     1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt
     1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc
     1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag
     1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg
     1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct
     1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca
     1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc
     1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg
     1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg
     1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt
     1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact
     1741 gaaccagtag tagtactttc cact
//
LOCUS       BSUPSIF2A    2758 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  B.subtilis protein synthesis initiation factor 2 (infB) gene,
            complete cds.
ACCESSION   M34836
KEYWORDS    protein synthesis initiation factor 2.
SOURCE      B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and
            pJET1.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2758)
  AUTHORS   Shazand,K., Tucker,J., Chiang,R., Stansmore,K.,
            Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and
            Leighton,T.
  TITLE     Isolation and molecular genetic characterization of the Bacillus
            subtilis gene (infB) encoding protein synthesis initiation factor 2
  JOURNAL   J. Bacteriol. 172, 2675-2687 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        381     2531     protein synthesis initiation factor 2 (infB)
    binding     369      375     ribosomal binding site (put.)
BASE COUNT      917 a    535 c    662 g    644 t
ORIGIN      
        1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat
       61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc
      121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac
      181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt
      241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg
      301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta
      361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa
      421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc
      481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg
      541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg
      601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg
      661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca
      721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc
      781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca
      841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc
      901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag
      961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg
     1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa
     1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc
     1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac
     1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga
     1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg
     1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa
     1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg
     1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg
     1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa
     1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg
     1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg
     1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg
     1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg
     1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc
     1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg
     1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg
     1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg
     2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt
     2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc
     2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc
     2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag
     2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca
     2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga
     2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta
     2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg
     2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa
     2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc
     2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt
     2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat
//
LOCUS       CIBRCRA      1845 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Plasmid Colicin Ib-P9 replication control region replicon initiator
            protein (repZ) gene, complete cds.
ACCESSION   M34837
KEYWORDS    replicon initiator protein.
SOURCE      Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1.
  ORGANISM  Plasmid Colicin Ib-P9
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1845)
  AUTHORS   Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K.
  TITLE     Organization of the replication control region of plasmid Co1Ib-P9
  JOURNAL   J. Bacteriol. 172, 1983-1991 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        455     1486     repZ protein
    mRNA        157     1762     repZ protein mRNA (alt.)
    mRNA        158     1762     repZ protein mRNA (alt.)
    mRNA        363  <     1 (c) inc RNA
    binding     442      448     ribosomal binding site (put.)
    signal      377      372 (c) inc -10 region
    signal      400      395 (c) inc -35 region
    mut         353      353     c in wild type; t in inc3
    mut         374      374     t in wild type; c in inc2
    mut         400      400     a in wild type; c in inc1
BASE COUNT      505 a    467 c    490 g    383 t
ORIGIN      
        1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag
       61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct
      121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac
      181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag
      241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa
      301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag
      361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata
      421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata
      481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta
      541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg
      601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg
      661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc
      721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa
      781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc
      841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt
      901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact
      961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct
     1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt
     1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca
     1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct
     1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag
     1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct
     1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct
     1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga
     1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc
     1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa
     1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca
     1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa
     1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt
     1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa
     1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt
//
LOCUS       ECOHEMBA     1085 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  E.coli porphobilinogen synthase (hemB) gene, complete cds.
ACCESSION   M35121
KEYWORDS    5-aminolevulinic acid dehydratase; hemB gene;
            porphobilinogen synthase.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1085)
  AUTHORS   Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A.
  TITLE     Nucleotide sequence of the hemB gene of Escherichia coli K12
  JOURNAL   Mol. Gen. Genet. 214, 503-508 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         80     1054     porphobilinogen synthase (hemB)
    signal       15       20     hemB -35 region
    signal       41       46     hemB -10 region
BASE COUNT      254 a    277 c    301 g    253 t
ORIGIN      
        1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc
       61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc
      121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg
      181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat
      241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg
      301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg
      361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat
      421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg
      481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc
      541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc
      601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac
      661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg
      721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact
      781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga
      841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg
      901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt
      961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc
     1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag
     1081 ttaac
//
LOCUS       ECOSTMA      1418 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  E.coli F factor (stm) gene, complete cds.
ACCESSION   M35124
KEYWORDS    F factor; stm gene.
SOURCE      E.coli DNA, clones pEG[157,159,175,176,220].
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1418)
  AUTHORS   Golub,E.I. and Panzer,H.A.
  TITLE     The F factor of Escherichia coli carries a locus of stable plasmid
            inheritance stm, similar to the parB locus of plasmid RI
  JOURNAL   Mol. Gen. Genet. 214, 353-357 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        830     1009     F factor (stm)
    signal     1013     1039     pot. transcription termination signal
    signal     1074     1105     pot. transcription termination signal
BASE COUNT      279 a    364 c    429 g    345 t      1 others
ORIGIN      
        1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa
       61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct
      121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag
      181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg
      241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact
      301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc
      361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg
      421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc
      481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact
      541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag
      601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc
      661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca
      721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga
      781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc
      841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta
      901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc
      961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg
     1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg
     1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag
     1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct
     1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt
     1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt
     1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg
     1381 aagctgtctc tggcgggctt ccatccctga cgggaaac
//
LOCUS       HUMGPIIBA    3333 bp ss-mRNA            PRI       14-JUL-1990
DEFINITION  Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds.
ACCESSION   M34480
KEYWORDS    platelet glycoprotein IIb.
SOURCE      Human megakaryocytes, cDNA to mRNA, clone IIb[3,4].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3333)
  AUTHORS   Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
            Marguerie,G.
  TITLE     GPIIb and GPIIIa amino acid sequences deduced from human
            megakaryocyte cDNAs
  JOURNAL   Mol. Biol. Rep. 14, 27-33 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         33     3152     platelet glycoprotein IIb (GPIIb)
    mRNA          1     3333     GPIIb mRNA
BASE COUNT      626 a    998 c   1040 g    669 t
ORIGIN      
        1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag
       61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct
      121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt
      181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg
      241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga
      301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg
      361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca
      421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga
      481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc
      541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg
      601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg
      661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc
      721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc
      781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg
      841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca
      901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc
      961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca
     1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag
     1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg
     1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg
     1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc
     1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc
     1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct
     1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag
     1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc
     1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca
     1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc
     1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc
     1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg
     1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc
     1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg
     1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg
     1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga
     1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca
     2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact
     2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg
     2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga
     2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct
     2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg
     2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc
     2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca
     2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc
     2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac
     2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact
     2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca
     2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa
     2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc
     2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc
     2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg
     2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct
     3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc
     3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca
     3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga
     3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt
     3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc
     3301 ccctcctgct gcctaataaa gagactgagc cct
//
LOCUS       HUMGPIIIAA   4487 bp ss-mRNA            PRI       14-JUL-1990
DEFINITION  Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds.
ACCESSION   M35999
KEYWORDS    platelet glycoprotein IIIa.
SOURCE      Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4487)
  AUTHORS   Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
            Marguerie,G.
  TITLE     GPIIb and GPIIIa amino acid sequences deduced from human
            megakaryocyte cDNAs
  JOURNAL   Mol. Biol. Rep. 14, 27-33 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         19     2385     plate glycoprotein IIIa (GPIIIa)
    mRNA    <     1     2603     GPIIIa mRNA
BASE COUNT     1046 a   1090 c   1186 g   1165 t
ORIGIN      
        1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg
       61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga
      121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat
      181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac
      241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc
      301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt
      361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag
      421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg
      481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac
      541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc
      601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt
      661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag
      721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag
      781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt
      841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct
      901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat
      961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt
     1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc
     1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat
     1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta
     1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga
     1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag
     1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag
     1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc
     1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga
     1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc
     1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc
     1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt
     1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg
     1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg
     1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt
     1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc
     1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa
     1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact
     2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag
     2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc
     2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc
     2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct
     2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa
     2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc
     2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag
     2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag
     2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg
     2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag
     2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg
     2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg
     2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt
     2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta
     2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag
     2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc
     3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt
     3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca
     3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga
     3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc
     3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc
     3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc
     3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat
     3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat
     3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac
     3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc
     3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg
     3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat
     3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa
     3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata
     3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa
     3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa
     3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc
     4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt
     4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa
     4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata
     4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga
     4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt
     4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta
     4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc
     4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc
//
LOCUS       MUSCDX1A     1597 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Mouse Cdx-1 homeobox protein mRNA, complete cds.
ACCESSION   M35125
KEYWORDS    Cdx-1 homeobox protein.
SOURCE      Mouse 8.5 day embryo, cDNA to mRNA, clone BH8.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1597)
  AUTHORS   Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D.,
            Guenet,J.-L. and Gruss,P.
  TITLE     A mouse gene homologous to the Drosophila gene caudal is expressed
            in epithelial cells from the embryonic intestine
  JOURNAL   Genes Dev. 2, 1647-1654 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        158      736     Cdx-1 homeobox protein
    site        389      571     Cdx-1 homeobox
BASE COUNT      326 a    523 c    444 g    304 t
ORIGIN      
        1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt
       61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct
      121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca
      181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc
      241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc
      301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg
      361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca
      421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa
      481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa
      541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc
      601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg
      661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga
      721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg
      781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt
      841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt
      901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc
      961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc
     1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca
     1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc
     1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat
     1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg
     1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat
     1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat
     1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca
     1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg
     1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa
     1561 aaatcataca aggcattaaa gaatttttaa aaagttt
//
LOCUS       MUSHSP70B    3518 bp ds-DNA             ROD       14-JUL-1990
DEFINITION  Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds.
ACCESSION   M35021
KEYWORDS    heat shock protein 70.1.
SOURCE      Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3518)
  AUTHORS   Hunt,C. and Calderwood,S.
  TITLE     Characterization and sequence of a mouse hsp70 gene and its
            expression in mouse cell lines
  JOURNAL   Gene 87, 199-204 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        806     2734     heat shock protein 70.1 (hsp70.1)
    signal      507      512     inverted CCAAT box
    signal      547      552     TATA box
BASE COUNT      827 a    993 c   1014 g    684 t
ORIGIN      
        1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag
       61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga
      121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta
      181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca
      241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc
      301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg
      361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac
      421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct
      481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga
      541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat
      601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc
      661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc
      721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag
      781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg
      841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga
      901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg
      961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg
     1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt
     1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg
     1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga
     1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga
     1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat
     1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga
     1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat
     1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga
     1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga
     1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag
     1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga
     1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg
     1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca
     1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga
     1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg
     1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct
     1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc
     2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc
     2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga
     2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc
     2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga
     2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa
     2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca
     2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc
     2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga
     2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt
     2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca
     2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc
     2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg
     2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt
     2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt
     2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat
     2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag
     3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc
     3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt
     3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac
     3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat
     3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt
     3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc
     3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct
     3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt
     3481 agctatcttc agacacacca gaaaagggca tcagatct
//
LOCUS       RDVS10       1319 bp ds-RNA             VRL       14-JUL-1990
DEFINITION  Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds.
ACCESSION   D00241
KEYWORDS    S10 RNA.
SOURCE      Rice dwarf virus, cDNA to viral RNA.
  ORGANISM  Rice dwarf virus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE   1  (bases 1 to 1319)
  AUTHORS   Omura,T., Minobe,Y. and Tsuchizaki,T.
  TITLE     Nucleotide sequence of segment S10 of the rice dwarf virus genome
  JOURNAL   J. Gen. Virol. 69, 227-231 (1988)
  STANDARD  simple automatic
COMMENT     Three contiguous termination codons were detected at the third
            triplet downstream from the first stop codon.
            
            Corresponds to DDBJ (rel3.0) entry <RDVS1O>(D00241) dated
            20-JUN-1988
FEATURES       from  to/span     description
    pept         27     1088     ORF
BASE COUNT      354 a    279 c    284 g    402 t
ORIGIN      5' end of the plus-sense strand of segment S10.
        1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
       61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta
      121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
      181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
      241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
      301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
      361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
      421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
      481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
      541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
      601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
      661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
      721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
      781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
      841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
      901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
      961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
     1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
     1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc
     1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
     1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc
     1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat
//
LOCUS       RDVS10A      1320 bp ds-RNA             VRL       14-JUL-1990
DEFINITION  Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds.
ACCESSION   M35118
KEYWORDS    .
SOURCE      Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367].
  ORGANISM  Rice dwarf virus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE   1  (bases 1 to 1320)
  AUTHORS   Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E.
  TITLE     Nucleotide sequence of rice dwarf virus genome segment 10
  JOURNAL   Proc. Jpn. Acad. 63, 227-230 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         27     1088     ORF
BASE COUNT      357 a    280 c    282 g    401 t
ORIGIN      
        1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
       61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta
      121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
      181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
      241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
      301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
      361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
      421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
      481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
      541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
      601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
      661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
      721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
      781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
      841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
      901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
      961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
     1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
     1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc
     1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
     1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt
     1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat
//
LOCUS       SRVAGSM      3651 bp ss-RNA             VRL       14-JUL-1990
DEFINITION  Sapporo rat virus mRNA M segment, encoding an ORF, complete cds.
ACCESSION   M34882
KEYWORDS    .
SOURCE      Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
            E6 cell line C1008.
  ORGANISM  Sapporo rat virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Bunyaviridae; Hantavirus.
REFERENCE   1  (bases 1 to 3651)
  AUTHORS   Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
            Schmaljohn,C.S.
  TITLE     Coding properties of the S and the M genome segments of Sapporo rat
            virus: Comparison to other causative agents of hemorrhagic fever
            with renal syndrome
  JOURNAL   Virology 176, 114-125 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         47     3448     M segment ORF
BASE COUNT     1119 a    692 c    755 g   1085 t
ORIGIN      
        1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct
       61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat
      121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc
      181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa
      241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa
      301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg
      361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat
      421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc
      481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca
      541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt
      601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga
      661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac
      721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg
      781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat
      841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga
      901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag
      961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat
     1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa
     1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac
     1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc
     1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt
     1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt
     1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt
     1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc
     1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact
     1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt
     1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc
     1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat
     1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca
     1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca
     1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc
     1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg
     1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag
     1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt
     2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata
     2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat
     2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt
     2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca
     2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc
     2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt
     2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt
     2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag
     2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact
     2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg
     2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga
     2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga
     2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt
     2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat
     2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga
     2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt
     3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat
     3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca
     3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca
     3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa
     3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat
     3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt
     3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg
     3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg
     3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt
     3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct
     3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a
//
LOCUS       SRVAGSS      1769 bp ss-RNA             VRL       14-JUL-1990
DEFINITION  Sapporo rat virus mRNA S segment, encoding an ORF, complete cds.
ACCESSION   M34881
KEYWORDS    .
SOURCE      Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
            E6 cell line C1008.
  ORGANISM  Sapporo rat virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Bunyaviridae; Hantavirus.
REFERENCE   1  (bases 1 to 1769)
  AUTHORS   Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
            Schmaljohn,C.S.
  TITLE     Coding properties of the S and the M genome segments of Sapporo rat
            virus: Comparison to other causative agents of hemorrhagic fever
            with renal syndrome
  JOURNAL   Virology 176, 114-125 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         43     1332     S segment ORF
BASE COUNT      569 a    331 c    410 g    459 t
ORIGIN      
        1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa
       61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag
      121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat
      181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc
      241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca
      301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt
      361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg
      421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg
      481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag
      541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc
      601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat
      661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg
      721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc
      781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc
      841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa
      901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg
      961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct
     1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa
     1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc
     1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt
     1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt
     1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct
     1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat
     1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa
     1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat
     1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag
     1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt
     1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa
     1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc
     1741 tttttgggga gtaagggagt ctactacta
//
LOCUS       TRNSTIIA     1315 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  Transposon Tn4521 (from E.coli) heat-stable toxin II gene,
            complete cds.
ACCESSION   M35123
KEYWORDS    heat-stable toxin II; insertion sequence; transposase.
SOURCE      Transposon Tn4521 (from Escherichia coli) DNA.
  ORGANISM  Transposon Tn4521
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1315)
  AUTHORS   Hu,S.T. and Lee,C.H.
  TITLE     Characterization of the transposon carrying the STII gene of
            enterotoxigenic Escherichia coli
  JOURNAL   Mol. Gen. Genet. 214, 490-495 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        879      400 (c) heat-stable toxin II (STII)
    trns         88      607     insertion sequence IS2 (bases 17-536)
    trns        608      970     insertion sequence IS2 (bases 969-1327)
BASE COUNT      313 a    349 c    320 g    333 t
ORIGIN      
        1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct
       61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac
      121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc
      181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt
      241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt
      301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact
      361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata
      421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt
      481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca
      541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc
      601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc
      661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg
      721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag
      781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt
      841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct
      901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg
      961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt
     1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga
     1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac
     1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct
     1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat
     1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga
//
LOCUS       AVOCYP       1500 bp ss-mRNA            PLN       14-JUL-1990
DEFINITION  Avocado cytochrome P-450LXXIA1 (cyp71A1) mRNA, complete cds.
ACCESSION   M32885
KEYWORDS    cytochrome P-450; cytochrome P-450LXXIA1.
SOURCE      Avocado (strain Hass) ripe fruit mesocarp, cDNA to mRNA, clone
            pAVOd8 and pAVOc8, and DNA (bases 1-16).
  ORGANISM  Persea americana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Magnoliidae; Laurales; Lauraceae.
REFERENCE   1  (bases 1 to 1500)
  AUTHORS   Bozak,K.R., Yu,H., Sirevag,R. and Christoffersen,R.E.
  TITLE     Sequence analysis of ripening-related cytochrome P-450 cDNAs from
            avocado fruit
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3904-3908 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.E.Christoffersen, 16-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         21     1436     cytochrome P-450LXXIA1 (cyp71A1)
BASE COUNT      412 a    343 c    354 g    391 t
ORIGIN      
        1 tcactctaga gtaataatcc atggctatct tagtgtctct gctcttctta gccattgctc
       61 tcaccttctt cctcctaaaa ctcaacgaaa aacgagagaa gaaaccgaac ctaccccctt
      121 ctcctcccaa ccttcccatc atcggaaacc tccatcagct cggtaatctc ccacaccgct
      181 ctcttcgctc ccttgcaaac gaactcggac ccttgatact cctccatctg ggtcacatcc
      241 ccactctcat agtctccacc gctgagattg ccgaagagat cttgaaaacc catgatctca
      301 tctttgctag ccgaccatcc acaactgcag ctcgacgaat cttctatgac tgcaccgacg
      361 tggcgttctc tccctacggc gaatattgga ggcaagtaag gaagatctgt gtactcgagc
      421 tccttagcat aaagagagtc aactcctacc gttcgatcag ggaagaagag gtgggcctca
      481 tgatggagag gatctctcaa tcttgctcga caggtgaagc tgttaatcta tcagagctgt
      541 tgctattgct atcaagcggc acaataacaa gggttgcttt tgggaagaag tacgaaggag
      601 aagaagaaag gaagaacaag tttgcggatc ttgcaactga gttgacaact cttatgggag
      661 ctttcttcgt gggagactac tttccttcgt ttgcatgggt tgatgttcta acagggatgg
      721 atgcgaggtt gaagagaaat catggtgaat tggatgcttt tgtggatcac gtaattgatg
      781 accatctcct tagtagaaaa gcgaacggct cagatggagt ggagcagaaa gatttagtgg
      841 atgtgctgct acatctgcag aaggattcct cactcggcgt ccatctcaac agaaataacc
      901 ttaaagccgt catcttggac atgttctctg gtggaactga tacgacggct gtgaccttag
      961 aatgggctat ggcagagctt ataaagcatc ccgatgtgat ggagaaagcc caacaagagg
     1021 taagaagagt tgtggggaaa aaagcaaagg tggaagaaga agatcttcat cagttgcact
     1081 acttaaaact aatcatcaaa gagactctgc gcttgcatcc tgttgctcca ttattagttc
     1141 cacgcgagtc cacaagggat gtagttataa ggggctatca tattcctgca aagacaagag
     1201 tctttattaa tgcatgggca ataggaagag accccaagtc atgggagaat gctgaggaat
     1261 ttctcccaga gagatttgtc aataattctg ttgatttcaa ggggcaagat tttcaactta
     1321 ttccttttgg agcagggagg aggggctgtc cggggattgc atttggcatc agttcagttg
     1381 agatttctct tgccaatctc ttgtactggt tcaactggga attacctggg atctaaccaa
     1441 agaagatctg gacacgctcg aagccgttgg gataaccgtt cacatgaagt ttcctctgca
//
LOCUS       ECODGTP      2760 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  E.coli dGTP triphosphohydrolase (dgt) gene, complete cds,
            and periplasmic protease (htrA) gene, 5'end.
ACCESSION   M31772
KEYWORDS    dGTP triphosphohydrolase; dGTPase;
            deoxyguanosine triphosphate triphosphohydrolase; dgt gene;
            htrA gene; periplasmic protease.
SOURCE      E.coli (strain K12, isolate HR42) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2760)
  AUTHORS   Wurgler,S.M. and Richardson,C.C.
  TITLE     Structure and regulation of the gene for dGTP triphosphohydrolase
            from Escherichia coli
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2740-2744 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.M. Wurgler, 01-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1066     2583     dGTP triphosphohydrolase (dgt) (EC 3.1.5.1)
    pept       2713  >  2760     periplasmic protease (hrtA)
    pept        958      323 (c) ORF
    signal      991      996     -35 region
    signal     1014     1019     -10 region
    site       1025     1046     regulatory signal
    binding    1054     1065     ribosome binding site
    signal     2590     2630     pot. transcription termination signal
BASE COUNT      711 a    665 c    706 g    678 t
ORIGIN      3.6 minutes on K12 map.
        1 cagctggtca acctgccgct cggcattacc tccacgccag gcaatcacca gatcgggttt
       61 cagcgcgaca atgcgttcca gattcatccc ctgccaggcg gaaacctgct caatcttttg
      121 cgcttgtgga ggatagtcgg aatagctgct gaccccaacc ggcgtgatcc cggcggcaaa
      181 ggcaagttca gtgttggcgg gagaaagcgt gatgacgcgc ggcgcggcgt tgagccacag
      241 tggcgcaaga aaagacatgg ggcgaccagc gccctgaaca gtgacttagc catgtgcaag
      301 tttctgcacc agtgactaac catcaggctg gactgtttag cggcaacagc aggaactcat
      361 cgaagctaag atgagactgt tgatcggcca cgtcggagat ggcgcgtacg acaacaaacg
      421 ggacgttgaa attgtggcag acatgggcga ttgccgtcgc ttccatctct acagcaatgg
      481 cctgtgggaa gttgtggcgg attttcgcca gaccaacaga accgttgatg aaagcgtcgc
      541 cgctaacaat caggccacgt acagcgttaa gattcagttc ggcaatgcag gcctcagcgg
      601 cagcgatcag tttatcgtca gctttaaagc ctgccggaca gcctggtaac tgaccgtatt
      661 cataaccaaa tgccgtgaca tccgcgtcgt gataacgtgc ttcgtccgag acaacgatat
      721 cgcccacttt caacgttggt gccaggccac cggcagaacc ggtgttaata atcacatctg
      781 gcttgcagtg ttccaacagc aaagtggcac ccagcgccgc agcgacttta ccgatgcccg
      841 atttcagaag cgcaacctcg gttccattca gttggccggt atagatttcg caaccgccga
      901 gactgatagt ttgacggttt tcgattttgt cacgcagcag cgtaacttct tcttccattg
      961 caccaatgat gccgattttc atagatttac tcgcgataag cccgatttga aggcatagtt
     1021 taccatgcgc ttacggggaa gcgtatttct cacgcgggag aggacatggc acagattgat
     1081 ttccgaaaaa aaataaactg gcatcgtcgt taccgttcac cgcagggcgt taaaaccgaa
     1141 catgagatcc tgcggatctt cgagagcgat cgcgggcgta tcatcaactc tccggcaatt
     1201 cgtcgtctgc aacaaaagac ccaggttttt ccactggagc gcaatgccgc cgtgcgcacg
     1261 cgtcttaccc actcgatgga agtccagcag gtggggcgct acatcgccaa agaaatttta
     1321 agccgtctga aagagcttaa attactggaa gcatacggcc tggatgaact gaccggtccc
     1381 tttgaaagca ttgttgagat gtcatgcctg atgcacgata tcggcaatcc gccgtttggt
     1441 cattttggcg aagcggcgat aaatgactgg tttcgccaac gtttgcaccc ggaagatgcc
     1501 gaaagccagc ctctgactga cgatcgctgc agcgtggcgg cactacgttt acgggacggg
     1561 gaagaaccgc ttaacgagct gcggcgcaag attcgtcagg acttatgtca ttttgagggg
     1621 aatgcacaag gcattcgcct ggtgcataca ttgatgcgga tgaatctcac ctgggcacag
     1681 gttggcggta ttttaaaata tacccgtccg gcgtggtggc gtggcgaaac gcctgagaca
     1741 catcactatt taatgaaaaa gccgggttat tatctttctg aagaagccta tattgcccgg
     1801 ttgcgtaaag aacttaattt ggcgctttac agtcgttttc cattaacgtg gattatggaa
     1861 gctgccgacg acatctccta ttgtgtggca gaccttgaag atgcggtaga gaaaagaata
     1921 tttaccgttg agcagcttta tcatcatttg cacgaagcgt ggggccagca tgagaaaggt
     1981 tcgctctttt cgctggtggt tgaaaatgcc tgggaaaaat cacgctcaaa tagtttaagc
     2041 cgcagtacgg aagatcagtt ttttatgtat ttacgggtaa acaccctaaa taaactggta
     2101 ccctacgcgg cacaacgatt tattgataat ctgcctgcga ttttcgccgg aacgtttaat
     2161 catgcattat tggaagatgc cagcgaatgc agcgatcttc ttaagctata taaaaatgtc
     2221 gctgtaaaac atgtgtttag ccatccagat gtcgagcggc ttgaattgca gggctatcgg
     2281 gtcattagcg gattattaga gatttatcgt cctttattaa gcctgtcgtt atcagacttt
     2341 actgaactgg tagaaaaaga acgggtgaaa cgtttcccta ttgaatcgcg cttattccac
     2401 aaactctcga cgcgccatcg gctggcctat gtcgaggctg tcagtaaatt accgtcagat
     2461 tctcctgagt ttccgctatg ggaatattat taccgttgcc gcctgctgca ggattatatc
     2521 agcggtatga ccgacctcta tgcgtgggat gaataccgac gtctgatggc cgtagaacaa
     2581 taaccaggct tttgtaaaga cgaacaataa atttttacct tttgcagaaa ctttagttcg
     2641 gaacttcagg ctataaaacg aatctgaaga acacagcaat tttgcgttat ctgttaatcg
     2701 agactgaaat acatgaaaaa aaccacatta gcactgagtc gactggctct gagtttaggt
//
LOCUS       RATHMGCOA    1994 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Rat mitochondrial 3-hydroxy-3-methylglutaryl-CoA synthase mRNA,
            complete cds.
ACCESSION   M33648
KEYWORDS    3-hydroxy-3-methylglutaryl-CoA synthase.
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            lambda-cMS1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1994)
  AUTHORS   Ayte,J., Gil-Gomez,G., Haro,D., Marrero,P.F. and Hegardt,F.G.
  TITLE     Rat mitochondrial and cytosolic 3-hydroxy-3-methylglutaryl-CoA
            synthases are encoded by two different genes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3874-3878 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.G.Hegardt, 06-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         50     1576     3-hydroxy-3-methylglutaryl-CoA synthase
                                 precursor (EC 4.1.3.5)
    sigp         50      160     3-hydroxy-3-methylglutaryl-CoA synthase signal
                                 peptide
    matp        161     1573     3-hydroxy-3-methylglutaryl-CoA synthase
    mRNA          1     1994     3-hydroxy-3-methylglutaryl-CoA synthase mRNA
BASE COUNT      493 a    563 c    505 g    433 t
ORIGIN      
        1 atctctccca ggggctgtgg actgctggct ttctgttgat accttagaga tgcagcggct
       61 tttggctcca gcaaggcggg tcctgcaagt gaagagagtc atgcaggaat cttcgctctc
      121 acccgctcac ctgctccccg cagcccagca gaggttttct acaatccctc ctgctcccct
      181 ggccaaaact gatacatggc caaaagatgt gggcatcctt gccctggagg tctactttcc
      241 agcccaatat gtggaccaaa ctgacctgga gaagttcaac aatgtggaag cagggaagta
      301 cacagtgggc ttgggccaga cccgtatggg cttctgttcg gtccaggagg acatcaactc
      361 cttgtgcctc acagtggtgc agaggctgat ggaacgcaca aagctgccat gggatgccgt
      421 aggccgcctg gaagtgggca cggaaaccat cattgacaag tccaaggctg tcaagacagt
      481 gctcatggag ctcttccagg attcaggcaa cactgacatc gagggcatag ataccaccaa
      541 cgcctgctat ggtggcactg cctccctctt caacgctgcc aactggatgg agtccagcta
      601 ctgggatggt cgctatgccc tggtggtctg tggtgatatc gcagtctacc caagtggtaa
      661 cccccgcccc acaggtggtg ccggggctgt ggcaatgctg attgggccca aggccccgct
      721 agtcctggaa caagggctga ggggaaccca catggagaac gcctatgact tctacaaacc
      781 aaacttggcc tcagagtatc cactggtgga tgggaagctg tctatccagt gctacctgcg
      841 ggccttggac cgatgctatg cagcttaccg caggaaaatc cagaatcagt ggaagcaagc
      901 tggaaacaac cagcctttca ccctcgatga cgtgcaatat atgatcttcc acacaccctt
      961 ttgcaagatg gtccagaaat ccctagctcg gctgatgttc aatgacttcc tgtcatctag
     1021 cagtgacaag cagaacaact tatacaaggg tctagaggcc ttcaagggtc taaagctgga
     1081 agaaacctac accaacaagg atgttgacaa ggctctgctg aaggcctccc tggacatgtt
     1141 caacaagaaa accaaggcct ccctttacct ctccacaaac aatgggaaca tgtacacctc
     1201 gtccctctac gggtgcctgg cctcacttct ctcccaccac tctgcccaag aattggccgg
     1261 ctccaggatt ggagccttct cctacggctc aggcttagca gcaagtttct tctcatttcg
     1321 agtgtccaag gacgcttccc caggttcccc tctggagaag ctggtgtcta gtgtgtcaga
     1381 tctgcccaaa cgtctagact cccggagacg catgtcccct gaggaattca cagaaataat
     1441 gaatcagaga gagcaatttt accacaaggt gaacttctct ccccctggtg acacaagcaa
     1501 cctcttccca ggcacttggt accttgaacg agtggatgag atgcaccgca gaaaatatgc
     1561 ccggcgtccc gtctaaggag accaatccat acaaccattc cccggggaaa gaatgtgagc
     1621 agagccgtta cccaaacggc ttccacttaa aattccaccc acagcagtga acggtgaata
     1681 gacacagcga ccccatagga tctgctccgc ggtgaagggc ctccctctgt ggatcctggg
     1741 tgaccctccc tgaagcagtg agcaccacag gttctgctgt ggaccagagc ccccctgtgg
     1801 agagggagaa agaaagggga gccgctgacc tgcagggata cagaccttcc ccacagcctg
     1861 gcagccgccc gtttgttgca gcttattatc agactgtggg ctatcatagt tcatgctcgt
     1921 ttcttaaagt ttcccgagaa tttctaaaat tttgtatcta aacttttaat atggcgatta
     1981 aaaggagaga agga
//
LOCUS       ECOSLTOXI    1761 bp ds-DNA             BCT       14-JUL-1990
DEFINITION  E.coli Shiga-like toxin II (SLTIIva) gene, complete cds.
ACCESSION   M29153
KEYWORDS    Shiga-like toxin II.
SOURCE      E.coli (serotype O128:B12, isolate H.I.8) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1761)
  AUTHORS   Gannon,V.P.J., Teerling,C., Masri,S.A. and Gyles,C.L.
  TITLE     Molecular cloning and nucleotide sequence of another variant of the
            Escherichia coli Shiga-like toxin II family
  JOURNAL   J. Gen. Microbiol. 136, 1125-1135 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.P.J.Gannon, 20-SEP-1989, for release after publication.
FEATURES       from  to/span     description
    pept        254     1213     Shiga-like toxin II (SLTIIva)-A subunit
                                 precursor
    sigp        254      319     Shiga-like toxin II-A subunit signal peptide
    matp        320     1210     Shiga-like toxin II-A subunit
    pept       1226     1489     Shiga-like toxin II-B subunit precursor
    sigp       1226     1282     Shiga-like toxin II-B subunit signal peptide
    matp       1283     1486     Shiga-like toxin II-B subunit
    signal       61       66     -35 region
    signal       82       87     -10 region
    binding     241      247     A subunit binding site
    binding    1215     1221     B subunit binding site
    signal     1659     1265     rho-independent transcription termination
                                 signal
BASE COUNT      487 a    316 c    396 g    562 t
ORIGIN      
        1 atatgtgaga gattctctat tctataaaaa tataaaccat atacccgtaa tcttaaataa
       61 ttgatactat tttccggaag ctatagtgca tcattatctg ctttctgttc ctgataaaca
      121 gggaagtaat acattgtgca gtcatttcct gtgaggggat ttgtttgtaa taaatttctg
      181 ctaaaaacat tccggacttc ttgcgaggta ttattcctgg cgctgccggt tcagactgat
      241 ggagtgttaa tatatgcgac atatattatt aaagctggtg ttgttttttt gtgtttgctt
      301 gtcttcagca tcttatgcag atgagtttac tgtggatttc tcttcgcaaa agagctatgt
      361 tgattcattg aatagtataa ggtcggcaat atccactcca cttggaaata tatctcaggg
      421 tggtgtttct gtttcagtaa ttaatcatgt tctaggcgga aactatatat cattgaatgt
      481 tagaggcctt gatccatata gcgagagatt taaccacctc cgtttaataa tggaacggaa
      541 taacttatat gttgcaggct ttattaatac tgaaacgaat atcttttaca gattctccga
      601 tttctcacat atttcagtgc ctgatgtgat aactgtttcc atgacgacgg acagcagtta
      661 ttcatcatta cagcgaatcg cagatctgga acgtacaggg atgcagattg ggcgtcattc
      721 actggttggt tcatatctgg atttaatgga gttcagagga cgttccatga cccgcgcatc
      781 atccagagct atgctgcgtt ttgtcacagt gatagcagaa gctctgcgat tcagacaaat
      841 acagcgggga ttccgaccgg cgctgtctga ggcatctccg ctttatacaa tgacggctca
      901 ggatgttgac cttaccctga actggggaag aataagtaat gttcttccag agtacagagg
      961 agaggaaggg gtaagaatcg gtaggatatc ttttaatagt ctttctgcga ttctcggaag
     1021 tgttgcggtc atccttaatt gccactcaac cggaagttat tcagttcgtt ccgtgagcca
     1081 aaaacagaaa acagaatgcc agattgttgg agacagggcg gccattaaag taaataatgt
     1141 tttgtgggaa gcgaatacaa tcgctgcttt attaaatcgc aagcctcagg atcttactga
     1201 accaaaccaa taacaggggg tgaatatgaa gaagatgatt attgcagttt tattcggtct
     1261 cttttctgct aattccatgg cggcggattg tgctgtagga aaaattgagt tttccaagta
     1321 taatgaggat gataccttta ctgtgaaggt gtcaggaaga gaatactgga cgaacagatg
     1381 gaatttgcag ccattgttac aaagtgctca gctgacaggg atgactgtaa caatcatatc
     1441 taatacctgc agttcaggct caggctttgc ccaggtgaag tttaactgag aatctacggt
     1501 ttatttatgc gcgtcttttg tttctggacg cagatattat tagtgttgtg gatgctgatt
     1561 aaatttggtc agtgttttcg ttaaagtcat ataaatacag gggcgtcacg cccctttttg
     1621 gtctgtagtt gggtgaagat catcacttta ctttgctcaa ataactcagc acttgccggt
     1681 ttacccttaa ggcaatacct acgttccata aaagaagcat caattaggat gctcgtatac
     1741 ttgttatgtg ctttaaatgt t
//
LOCUS       RATCHOL7H    3545 bp ss-mRNA            ROD       14-JUL-1990
DEFINITION  Rat cholesterol 7-alpha-hydroxylase mRNA, complete cds.
ACCESSION   J05460
KEYWORDS    cholesterol 7-alpha-hydroxylase; cholesterol 7-alpha-monooxygenase.
SOURCE      Rat (strain Wistar King) male 8 week old liver, cDNA to mRNA, clone
            p7-alpha-11.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3545)
  AUTHORS   Noshiro,N., Nishimoto,M. and Okuda,K.
  TITLE     Rat liver cholesterol 7-alpha-hydroxylase: Pretranslational
            regulation for circadian rhythm
  JOURNAL   J. Biol. Chem. 265, 10036-10041 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and  printed sequence for [1] kindly submitted by
            M.Noshiro, 30-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         36     1547     cholesterol 7-alpha-hydroxylase (EC
                                 1.14.13.17)
BASE COUNT     1035 a    710 c    697 g   1103 t
ORIGIN      
        1 ggtctcccct ttggaaattt tcctgctttt gcaaaatgat gactatttct ttgatttggg
       61 gaattgccgt gttggtgagc tgttgcatat ggtttattgt tggaataagg agaaggaaag
      121 ctggtgaacc tcctttggag aacgggttga ttccgtacct gggctgtgct ctgaaatttg
      181 gatctaatcc tcttgagttc ctaagagcta atcaaaggaa gcatggtcac gtttttacct
      241 gcaaactgat ggggaaatat gtccatttca tcacaaactc cctgtcatac cacaaagtct
      301 tatgtcatgg aaaatatttt gactggaaaa aatttcatta cactacttct gcgaaggcat
      361 ttggacacag aagcattgac ccaaatgatg gaaataccac ggaaaatata aacaacactt
      421 ttaccaaaac cctccaggga gatgctctgt gttcactttc tgaagccatg atgcaaaacc
      481 tccaatctgt catgagacct cctggccttc ctaaatcaaa gagcaatgcc tgggtcacgg
      541 aagggatgta tgccttctgt taccgagtga tgtttgaagc tggctatcta acactgtttg
      601 gcagagatat ttcaaagaca gacacacaaa aagcacttat tctaaacaac cttgacaact
      661 tcaaacaatt tgaccaagtc tttccggcac tggtggcagg ccttcctatt cacttgttca
      721 agaccgcaca taaagctcgg gaaaagctgg ctgagggatt gaagcacaag aacctgtgtg
      781 tgagggacca ggtctctgaa ctgatccgtc tacgtatgtt tctcaatgac acgctctcca
      841 cctttgacga catggagaag gccaagacgc acctcgctat tctctgggca tctcaagcaa
      901 acaccattcc tgcaaccttt tggagcttat ttcaaatgat caggagtcct gaagcaatga
      961 aagcagcctc tgaagaagtg agtggagctt tacagagtgc tggccaagag ctcagctctg
     1021 gagggagtgc catttacttg gatcaagtgc aactgaatga cctgccggta ctagacagca
     1081 tcatcaagga ggctctgagg ctttccagtg catccttgaa tatccgcaca gctaaggagg
     1141 acttcactct ccatcttgag gacggttcct ataacatccg aaaagatgac atgatagctc
     1201 tttatccaca gttaatgcac ttggatcctg aaatctaccc agaccctttg actttcaaat
     1261 atgaccggta ccttgatgaa agcgggaaag caaagaccac cttctacagt aatggaaaca
     1321 agctgaagtg tttctacatg cccttcggat caggcgcgac aatatgtcct ggaagactct
     1381 ttgccgtcca agaaatcaag cagtttttga tcctgatgct ctcctgcttt gaactggagt
     1441 ttgtggagag ccaagtcaag tgtccccctc tagaccagtc ccgggcaggc ttgggaattt
     1501 tgccaccact acatgatatt gagtttaaat ataaactgaa acactgatac gtggttggaa
     1561 gaagcgaaca ctggatgatg tcacttggcg gctgagagtc atcactaaac aggccttcgg
     1621 gaccaatgct cactgatgcg ccctagcgac tggattagtg ggaagaactt tgttctcgct
     1681 gcccacattc ctgggtgttc acatagctgg ggccagagct tcatcacttt cagaaagcaa
     1741 tgtcttttgt atttattttc aaaatgaaga tattccaatt ggcaggatat ttttcctaag
     1801 gaaattgctt tatattttta tgaaaactac caattaatta tgaaagggct tgaaattcac
     1861 gttttagtga aattactgat ttttcactag taaggttctt caggtgtgaa actgtattat
     1921 aaaaatgttg taatgggtca cactgtgctt tgcataaagg taaaggaaac tatgtttcag
     1981 ccttttctgt gtctatgagc ttcgaaaata atcttactgt tctagaaaca ctggggaggt
     2041 ttcgacatgc tctcgctata ttttatttta ctgttgctag aaattttcat tccagttttc
     2101 aactacctta tctttccccc attttgacat gcatgccaat gagaagagta ttttttagga
     2161 attaacaagg cacctcccag aaccctaccc tgagactttt aagcctttaa tcccagcact
     2221 cgagaagtag agccaggcag atctctgagt ctgaggttat tctggtctac atcagctcca
     2281 gacaagccag gactacagaa tgggatcttg tctaaaaaat acagctaatc tttatgtcat
     2341 aactgattat gaatcaacct aaaagataaa ttttcaatca ggactcagag aaaatgagca
     2401 attaaaaaac ttagctctga ggtatgtgga attcattaag tacaagttga cattacatgt
     2461 tctttaaaaa tagtttatgt tttatctcta aatgccctgc agatgaagaa taataatgaa
     2521 aagttgaata atactgttta aacactaagt gcaataatgc tttggtaatg tactttaaga
     2581 gaatcattag ccgtgccagt tttactaaaa tatatttata tgtaaattat atttatcttt
     2641 ttcttatacc ataaatataa aaatattgca acatttagta attttaaaat tatatacctt
     2701 tcagaaaatg atgtatgaat gtttgtatgt tttttaactt tgaacagaac atttaaatta
     2761 ttcatctacg gtgattttta tcttatttat ttctttttgt ctcattcata tcttgaagaa
     2821 atccaaaaat atctgaagga atcgctcact caaatgtctc cctatggtta cagaaaaatt
     2881 caataccatg tttttgtcct cggggactga agcagggtgt cgtgggtgcc gagcagaggc
     2941 tcctgctgca gcgagcttta tccacgggac tccttaaact tttaaaatct tatcactatt
     3001 atcatgcatt tattacctaa gtaggatatt tccctttcct ttttcatttc agccgagtcc
     3061 cttagcaacc caggctgact gggaccctcc atgtagctta agctgtgaac tcactgtact
     3121 tcctgttttc acttatttta ggaagtaatt ttccctatca gaaattttaa ttgtttagat
     3181 gatgtataag agtaacacaa ttctgttata tactaatctg tagtaaacta aatttgttct
     3241 tagaacaagt ttgatgactc tcaaattgaa tgtatccata catctttcca tggcttcttg
     3301 aatgcccatt tctcatacac agaatgatgg gtttcacggt gatgtcttcc tttcatgtct
     3361 ttattcttgt gcggtgatgg ttggcaaatg atacccatgg agcaaggtta ctcttcctat
     3421 ttctgtgcag cctaagtgtt aagaataatt tttaaatact tggagggaag gcacattttg
     3481 tgtcatatgt gaagtgacat gtgacacaca gactagcaaa tccttgagta aaattttatt
     3541 gggat
//
LOCUS       HUMCTLA1A    4751 bp ds-DNA             PRI       14-JUL-1990
DEFINITION  Human granzyme B (CTLA-1) gene, complete cds.
ACCESSION   M28879
KEYWORDS    cytotoxic T-lymphocyte-associated serine esterase 1; granzyme B.
SOURCE      Human (patient B) blood DNA, clone lambda-B30.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4751)
  AUTHORS   Haddad,P., Clement,M.-V., Bernard,O., Larsen,C.-J., Degos,L.,
            Sasportes,M. and Mathieu-Mahul,D.
  TITLE     Structural organization of the hCTLA-1 gene encoding human granzyme
            B
  JOURNAL   Gene 87, 265-271 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Haddad, 11-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1161     1215     granzyme B, exon 1 /hgml_locus_uid="LQ0170R"
                                 /nomgen="CTLA1" /map="14q11-q13"
               2256     2403     granzyme B, exon 2
               2857     2992     granzyme B, exon 3
               3200     3460     granzyme B, exon 4
               4105     4248     granzyme B, exon 5
    IVS        1216     2255     CTLA-1 intron A
    IVS        2404     2856     CTLA-1 intron B
    IVS        2993     3199     CTLA-1 intron C
    IVS        3461     4104     CTLA-1 intron D
    signal     4338     4343     poly-A signal
BASE COUNT     1211 a   1292 c   1119 g   1129 t
ORIGIN      
        1 gaattctata ttttgagata taccattcct catagaaaaa tttcctcaca gaaaatataa
       61 aggtggaaac aaatcacaag aatcgaacca tgtagagaga cttagttgtc ttttaacaga
      121 attgggcacg ggctgttcag aaacaacaat ctttcacatc cattataatg atagcattag
      181 tgtagtttgt ttagcaaatg tttactgcga gcctgttatg tgctgagcct gctatgtaag
      241 aagtgtggct ctctggacag gagacagaat actaaacaac acaactactg atctttggct
      301 gcctggcatg cttcctcact tcatatggta tcagcaattt agcaccacaa acgtccttta
      361 gagaccagcc ctttctcatt cttggttcta gtggcttgag tagactgacc ccactaccca
      421 agtggatttg actcctagca attcattaat ctagcccata aatgtcaagt acaggacttt
      481 attgaagcat tcagaaagag gaatagggga tgttagaatc tctagaaagg aagctatgat
      541 aataaatggg ttgctagatg ggtctagtag atggtggcca tgctttgtta ctgccttgtg
      601 tattgtgcta ccatagccct ccccaaactg tactctggct cctggcattt ccgtctcttc
      661 aaccagatgg tcagctctct aagtgaagga gacacatctc caacatgctt ggttctagca
      721 caacagaagg gctcaaacac atacctgcta aagaaactat cctgatggat ttagcagcat
      781 ggccatgagg cattggcggt tctatcactg ggaactcagg tttctggtgc tccagtacct
      841 ctactggctg ataccacatc ctacagttca cttcataggc ttgggttcct gctctgggct
      901 gaataggtgg tccactctga gtcatcagct gtggtgatga tgtggtcact gcatgattct
      961 cacacaagca cccagaggac gtcatcaggc agaggcagtg ggggtgggca gcatttacag
     1021 aaaatctgtg atgagacacc acaaaaccag aggggaacat gaagtcactg agcctgctcc
     1081 acctctttcc tctcccaaga gctaaaagag agcaaggagg aaacaacagc agctccaacc
     1141 agggcagcct tcctgagaag atgcaaccaa tcctgcttct gctggccttc ctcctgctgc
     1201 ccagggcaga tgcaggtgag tgaccgtctt ccaacctcgg ggcccaaccc atcccacagg
     1261 tctcctgccc tttctccaca ttcctgatcc atctatctac caggaatgtt ctgaactcca
     1321 gctcccattc taccaagacc ccccaagtgt gatgctggat aagctatcag caggaatggc
     1381 agagcagcag gccattctca agaagagcca gtgggtacta tcccttcccc agagcccacc
     1441 tttgtcacct ggagagtagg actttcctag aagtaaatgg cagaggatgg gaaactagaa
     1501 aagagaaata ttaaattatt ctagagtagg cctggcttct gtttctggga taagacaggt
     1561 gcttctctca ctgtacttag gagagaaacc cagagctcag ctgacagcag aattggtaca
     1621 atcactgtcc tcagaacact gttaatgtgt ttgctcagtc ccattctcca actctgcttt
     1681 tcttccctgg cctttggtgg ctcccctctt tccaaggatg aggcactacg gcaggcccca
     1741 gcttccctgc tttctagaat tccaccagca ctgctctacc agccctcatc cagaggctaa
     1801 ctggagccag tccatcatgc agccatgaac atttactggg cacccactac atgtcaggct
     1861 ctaggaaaca ggatatgaca gtatctagat ccctccactt acaccctggc cattagaaag
     1921 cagcactatc ctagacacca caggactcat aagggtcttg gaaactcacc tgaaacaaag
     1981 caaagtcagg agaggaatga tcaggagcct ctgggatttc actgtcccta agacaggtat
     2041 gctcgccttc aactacatat ggaagaaaga tttacagacc aaagtctgct gttcttccct
     2101 ttttcagagc aggaaattga agccccttcc tccaggccac tcccaactcc aggctatccc
     2161 aggctcccaa atgcccagga gttctggagc cactaagcag gtgcccaccc agcagattcc
     2221 atgggtgccc acaagcagac agacttttcc ttcaggggag atcatcgggg gacatgaggc
     2281 caagccccac tcccgcccct acatggctta tcttatgatc tgggatcaga agtctctgaa
     2341 gaggtgcggt ggcttcctga tacaagacga cttcgtgctg acagctgctc actgttgggg
     2401 aaggtgagga gcagaaaaca gcccacaccc tcctggaaac actccacaga gacccctgcc
     2461 ttcttcccaa ggagctccct gggctcctgt gaacacacat gccaggaggt ctccttagag
     2521 ggtgagaaaa gggcagttaa gtttgtggag agaggggaag gttggttcca gaggtgctgc
     2581 tgaagtaaga aacagcagag tgaccaagcc tgccatattt agaactgggg gcatactttg
     2641 gcatagaata caaactgaag caattccacc tgtgtttcta gggggaaccg aaccctgaga
     2701 aacctggtgc aattaccaga attccaattc ctggggaccg actgtcctta atttcccctc
     2761 agctgcagcc ctgccccagc tgtcacctgc tcttcactgt ctctgggctg tatacactgt
     2821 gactccaccc ccatcctcac tctgctctct gtgcagctcc ataaatgtca ccttgggggc
     2881 ccacaatatc aaggaacagg agccgaccca gcagtttatc cctgtgaaaa gagccatccc
     2941 ccatccagcc tataatccta agaacttctc caatgacatc atgctactgc aggtgaggca
     3001 cactcctgcc actcttgctc ttcttggtcc agttggttcc actccccctg gaatgccggc
     3061 ccttccctcc tttccatcct ggcctcttgg ttagttccta tgcctcagag gagagaggga
     3121 agattgtgca gccccatcac tgtgtcgggg cccagaagtt cgttggctga cctggacttt
     3181 cttgcctctt ccccaccagc tggagagaaa ggccaagcgg accagagctg tgcagcccct
     3241 caggctacct agcaacaagg cccaggtgaa gccagggcag acatgcagtg tggccggctg
     3301 ggggcagacg gcccccctgg gaaaacactc acacacacta caagaggtga agatgacagt
     3361 gcaggaagat cgaaagtgcg aatctgactt acgccattat tacgacagta ccattgagtt
     3421 gtgcgtgggg gacccagaga ttaaaaagac ttcctttaag gtaagactat gcacctgcct
     3481 ggattggctc ttgggagaaa gatgtttggg gaatatctga gacctggaga ctcaagtagt
     3541 gggggactcc ttcacccact agactgtgat atttctctct ggaaagagaa aaggggacta
     3601 gactgagctg gggagaaatt agggcctctg caaacttacc aagaggcctt atggtggatg
     3661 gtgccttctt tggaaggatg aatttgcaac actccaccca ctccaggtca cagatattag
     3721 gaaactgtgc ccatgggggt gcagctaatt ataaccaggt gtgtcttcag aggctggtac
     3781 ccaacgtggt taatgggctg gtcctccatg gtggacatca gccctccttg cccacttctg
     3841 ggtccttaaa cagccaacgg tcccacatac ctccgatctc aggatctggg ggacatgacg
     3901 gaggctggcc cctgggatga ggtgaagcag taacaatgtc cagggccaga gcttggcagc
     3961 tggggccacc agcggcctgc cctgccctct ggtctcccac atgtaggctg tgcaagttgg
     4021 ccttttctaa aagggggctt gagatggaag agagggcagg acccggagga gcatcagctc
     4081 agtccttcca ctctctattc acagggggac tctggaggcc ctcttgtgtg taacaaggtg
     4141 gcccagggca ttgtctccta tggacgaaac aatggcatgc ctccacgagc ctgcaccaaa
     4201 gtctcaagct ttgtacactg gataaagaaa accatgaaac gctactaact acaggaagca
     4261 aactaagccc ccgctgtaat gaaacacctt ctctggagcc aagtccagat ttacactggg
     4321 agaggtgcca gcaactgaat aaatacctct tagctgagtg gaaagctggt ttcttgttta
     4381 ttcattgacc ctcattctca ggcaccacat ctgcgctatg caggccaatg acacaatttt
     4441 gctgttttct gctttctcct ctcccctcac cccttgccac ctccccaaac ccccacatga
     4501 agctgatact cagctccttc ctatccacac cagtttctcc agggcctgcc ttctgccaag
     4561 gctgaagctg agcaccatca ggagacaaca tggaccactt tggtcctggg gctttgggta
     4621 aacttcttac ctccttctcc agtgttacat tgacagagaa aaaagggata ataccatggg
     4681 acctaactcc tcatcccact ggggctcctc attctcccct gggcttagtt tctctaccct
     4741 cctctgagct c
//

GenBank-Updates@genbank.bio.net (07/17/90)

LOCUS       BPEP69OMP    3000 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  B.pertussis omp69A gene for P.69 outer membrane protein, complete
            cds.
ACCESSION   J04560
KEYWORDS    membrane protein; outer membrane protein.
SOURCE      B.pertussis (CN2992) DNA.
  ORGANISM  Bordetella pertussis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Alcaligenaceae.
REFERENCE   1  (bases 1 to 3000)
  AUTHORS   Charles,I.G., Dougan,G., Pickard,D., Chatfield,S., Smith,M.,
            Novotny,P., Morrissey,P. and Fairweather,N.F.
  TITLE     Molecular cloning and characterization of protective outer membrane
            protein P.69 from Bordetella pertussis
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 3554-3558 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by N.F.Fairweather 26-JUL-1989.
FEATURES       from  to/span     description
    pept        145     2877     P.69 outer membrane protein precursor
    sigp        145      246     P.69 outer membrane protein signal peptide
    matp        247     2277     P.69 outer membrane protein
BASE COUNT      453 a   1019 c   1096 g    432 t
ORIGIN      2 bp upstream of ClaI site.
        1 atcgatgata cgtcgctgta acacgacaaa tagcgtgcgt tgcagcggtt ctggatggcg
       61 ttattcgtac ttttgctgcg cccattcttc cctgttccat cgcggtgcgg gcatggcggg
      121 cgtctgctct ccacctggca tccaatgaac atgtctctgt cacgcattgt caaggcggcg
      181 cccctgcgcc gcaccacgct ggccatggcg ctgggcgcgc tgggcgccgc cccggcggcg
      241 catgccgact ggaacaacca gtccatcgtc aagaccggtg agcgccagca tggcatccat
      301 atccagggct ccgacccggg cggcgtacgg accgccagcg gaaccaccat caaggtaagc
      361 ggccgtcagg cccagggcat cctgctagaa aatcccgcgg ccgagctgca gttccggaac
      421 ggcagtgtca cgtcgtcggg acagttgtcc gacgatggca tccggcgctt tctgggcacc
      481 gtcaccgtca aggccggcaa gctggtcgcc gatcacgcca cgctggccaa cgttggcgac
      541 acctgggacg acgacggcat cgcgctctat gtggccggcg aacaggccca ggccagcatc
      601 gccgacagca ccctgcaggg cgctggcggc gtgcagatcg agcgcggcgc caatgtcacg
      661 gtccaacgca gcgccatcgt cgacgggggc ttgcatatcg gcgccctgca gtcattgcag
      721 ccggaagacc ttccgcccag ccgggtggtg ctgcgcgaca ccaacgtgac cgccgtgccc
      781 gccagcggcg cgcccgcggc ggtgtctgtg ttgggggcca gtgagcttac gctcgacggc
      841 gggcacatca ccggcgggcg ggcagcgggg gtggcggcca tgcaaggggc ggtcgtgcat
      901 ctgcagcgcg cgacgatacg gcgcggggac gcgcttgccg gcggtgcggt tcccggcggt
      961 gcggttcccg gtggtgcggt tcccggcggc ttcggtcccg gcggcttcgg tcccgtcctc
     1021 gacggctggt atggcgtgga cgtatcgggc tccagcgtgg agctcgccca gtcgatcgtc
     1081 gaggcgccgg agctgggcgc cgcaatccgg gtgggccgcg gcgccagggt gacggtgccg
     1141 ggcggcagct tgtccgcacc gcacggcaat gtcatcgaga ccggcggcgc gcgtcgcttt
     1201 gcgcctcaag ccgcgcccct gtcgatcacc ttgcaggccg gcgcgcatgc ccaggggaaa
     1261 gcgctgctgt accgggtcct gccggagccc gtgaagctga cgctgaccgg gggcgccgat
     1321 gcgcagggcg acatcgtcgc gacggagctg ccctccattc ccggcacgtc gatcgggccg
     1381 ctcgacgtgg cgctggccag ccaggcccga tggacgggcg ctacccgcgc ggtcgactcg
     1441 ctgtccatcg acaacgccac ctgggtcatg acggacaact cgaacgtcgg tgcgctacgg
     1501 ctggccagcg acggcagcgt cgatttccag cagccggccg aagctgggcg gttcaaggtc
     1561 ctgacggtca atacgctggc gggttcgggg ctgttccgca tgaatgtctt cgcggacctg
     1621 gggctgagcg acaagctggt cgtcatgcag gacgccagcg gccagcacag gctgtgggtc
     1681 cgcaacagcg gcagcgagcc ggccagcgcc aacaccctgc tgctggtgca gacgccacta
     1741 ggcagcgcgg cgacctttac ccttgccaac aaggacggca aggtcgatat cggtacctat
     1801 cgctatcgat tggccgccaa cggcaatggg cagtggagcc tggtgggcgc gaaggcgccg
     1861 ccggcgccca agcccgcgcc gcagccgggt ccccagccgc cgcagccgcc gcagccgcag
     1921 ccggaagcgc cggcgccgca accgccggcg ggcagggagt tgtccgccgc cgccaacgcg
     1981 gcggtcaaca cgggtggggt gggcctggcc agcacgctct ggtacgccga aagcaatgcg
     2041 ttgtccaagc gcctgggcga gttgcgcctg aatccggacg ccggcggcgc ctggggccgc
     2101 ggcttcgcgc aacgccagca gctggacaac cgcgccgggc ggcgcttcga ccagaaggtg
     2161 gccggcttcg agctgggcgc cgaccacgcg gtggcggtgg ccggcggacg ctggcacctg
     2221 ggcgggctgg ccggctatac gcgcggcgac cgcggcttca ccggcgacgg cggcggccac
     2281 accgacagcg tgcatgtcgg gggctatgcc acatatatcg ccgacagcgg tttctacctg
     2341 gacgcgacgc tgcgcgccag ccgcctggag aatgacttca aggtggcggg cagcgacggg
     2401 tacgcggtca agggcaagta ccgcacccat ggggtgggcg cctcgctcga ggcgggccgg
     2461 cgctttaccc atgccgacgg ctggttcctc gagccgcagg ccgagctggc ggtattccgg
     2521 gccggcggcg gtgcgtaccg cgcggccaac ggcctgcggg tgcgcgacga aggcggcagc
     2581 tcggtgctgg gtcgcctggg cctggaggtc ggcaagcgca tcgaactggc aggcggcagg
     2641 caggtgcagc catacatcaa ggccagcgtg ctgcaggagt tcgacggcgc gggtacggta
     2701 cacaccaacg gcatcgcgca ccgcaccgaa ctgcgcggca cgcgcgccga actgggcctg
     2761 ggcatggccg ccgcgctggg ccgcggccac agcctgtatg cctcgtacga gtactccaag
     2821 ggcccgaagc tggccatgcc gtggaccttc cacgcgggct accggtacag ctggtaaagc
     2881 gaggagggtc tatcccccgc ggaggagttt ttcctggagc ttggccggtg ccagtctcca
     2941 ggctcaggcg gccagggcct gcgggccggg caggccgcgc tggtgctggc cgaaccattg
//
LOCUS       ATHTRPB      4225 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  A.thaliana tryptophan synthase beta subunit (trpB) gene,
            complete cds.
ACCESSION   M23872
KEYWORDS    tryptophan synthase.
SOURCE      A.thaliana (strain Landsberg erecta) DNA, clone lambda-M1-g, and
            cDNA to mRNA (strain Columbia), clone lambda-CD-2.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 882)
  AUTHORS   Berlyn,M.B.
  JOURNAL   Unpublished (1989) Yale Univ, 165 Prospect, New Haven CT, 06511.
  STANDARD  full staff_review
REFERENCE   2  (bases 883 to 4225)
  AUTHORS   Berlyn,M.B., Last,R.L. and Fink,G.R.
  TITLE     A gene encoding the tryptophan synthase beta subunit of Arabidopsis
            thaliana
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 4604-4608 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by M.B.Berlyn, 18-APR-1989.
FEATURES       from  to/span     description
    pept       1517     1858     tryptophan synthase beta subunit, exon 1 (EC
                                 4.2.1.20)
               2109     2465     tryptophan synthase beta subunit, exon 2
               2549     2991     tryptophan synthase beta subunit, exon 3
               3084     3178     tryptophan synthase beta subunit, exon 4
               3275     3450     tryptophan synthase beta subunit, exon 5
    IVS        1859     2108     tryptophan synthase intron A
    IVS        2466     2548     tryptophan synthase intron B
    IVS        2992     3083     tryptophan synthase intron C
    IVS        3179     3274     tryptophan synthase intron D
    site       3643     3645     poly-A addition site
BASE COUNT     1252 a    733 c    817 g   1423 t
ORIGIN      
        1 gaattctttc atatctcctg caaagttctt gatatcaata ctccagcagt aactaagact
       61 tagactcttg agcgtaggag agtttgataa caaagactcg gcctctgtga gcttgatcca
      121 accaatagag agctttctag gcaatcccga gtttttgaac ttggagggat caagcccaca
      181 cgcgtaaatc tttagtgatt cgagatttgt gtttaaaatc cgaattaaaa cctaatcaaa
      241 ttaaaactaa accaaaccaa atacaatcca aaattaaact aattttggtt gagtttggtt
      301 atagttttac taaatccaaa ttaacagaac ataaccaaac ccgaagattt ttagagtctt
      361 tagaatttta aggtgatttt agataaaaga gattaaacac aaatctcgaa aactaaagaa
      421 agagtttttg aaaattttta agtgttttca tgtaaagtgg atttctctgt gttttctgca
      481 ttctgcggat tataactcct atgttttttt tctccgtcaa ttatatgtgt ttattttctc
      541 tattttcttt tatttttatt tttattctct atattagggt ttagtttatg aaaacttttt
      601 gttatctata taggcttggg ggatgtattt aaattagaat ttaaagtgat ttgagttctt
      661 tgagttttta aataatttta acgattttaa aaaagttcgt atgatttttg taaaatctat
      721 taaaatctca ccttaaatca tgggatttgg atttctgtat tttgaactaa gaaaatcctc
      781 tcaaatcctc caaaatcatt aaaattcaaa tccacaaatt gttctgaata acagtgaatt
      841 ttaaggtgga ttttgaaata attagttcaa taacactgaa tttcatgaga ttttttaaaa
      901 tacatgtttg aataacatat gatttataaa ttctacacaa atcttttaaa attctaattt
      961 caatacattg tttttgaaag tgttattgac tcttgccaat atagtatccc aattcccaac
     1021 ttgtgtttca ttttttcatc tatctaataa acaattagat gaacacaaaa aaatattggt
     1081 aggtgatggc tcaattggat atgtttttga aaaccatgtg ttaaaaactt aaaatactat
     1141 ccaacttacc ccagtcctac caactttttt tttcttctct tggtctgctt acatgtgtct
     1201 gcttatatct ccaaaaggaa atagatatat aaaaattcaa atttaaatat ttgcgatttg
     1261 ttaaatttta atcaatattt aatttttgtt tttttttgtt tttttttatg aagacaacaa
     1321 ataaccaaat ttatcaaatc tgatcaaagc agatttagga ttttacaaat atattttttt
     1381 aatatgaatt ttgtggtcag attttgacca attctctttg aaaaaaaaaa aaatctatct
     1441 ataaaaacat gtgttacttt gaaaggatat ttcaaggaga agaatatatt tgactcagag
     1501 agagattcat tcagtaatgg cagcctcagg cacctctgct actttcagag cctccgtttc
     1561 ttcagctcct tcctcttctt cccaattgac ccatttgaaa tcacccttca aagctgtcaa
     1621 atatacgcct ctgccatcgt ctcgctccaa gtcatcatcc ttctccgtct cctgcaccat
     1681 cgccaaggac ccgcctgttc tcatggccgc cggatctgac ccggccctgt ggcaacgacc
     1741 cgattcgttc ggtcggtttg ggaagtttgg tgggaagtat gtacctgaaa cccttatgca
     1801 cgctctatct gagcttgaat ccgctttcta tgctcttgcc accgacgatg atttccaggt
     1861 agagaccttg ttcttcgtga ttagataaaa atgagacatt ttgcgttgtg ggtcgttgga
     1921 attgtagcgt tatcatttgc gggttgtgat aaagttcgga tcttgttgat gggatatgtt
     1981 gattctaggg catttgaatg aaattggttc tgttcattta tgaaatggtt agaaatatgg
     2041 ttttggttcc tctggttcaa tgtctcattc atgtcctttg tgtatttgtg tgaatgaatc
     2101 tttgttagag agagttggct ggaatcttga aggactatgt gggtagagaa agtcctctgt
     2161 attttgcaga gaggcttacg gagcattaca ggcgcgagaa tggcgaaggg cctcttatat
     2221 acttgaagag agaagacttg aatcacacag gagctcacaa gattaacaac gctgtggctc
     2281 aggctcttct tgctaagcgg ttggggaaga agaggattat tgctgagaca ggagcgggtc
     2341 aacatggtgt ggctacagct actgtgtgtg cccgttttgg tttggagtgt attatctata
     2401 tgggtgctca agatatggag agacaagcac tcaatgtgtt cagaatgcga cttcttggtg
     2461 ccgaggtgat gcttaggatt catttccatc tcttttaaat gtgataaact ttgttactga
     2521 accttttttt gggtgtctgg tgctataggt gagaggagtc cactctggaa cagcgacatt
     2581 gaaggatgcg acatctgaag cgataagaga ttgggtgaca aatgtggaga ctacccatta
     2641 catattggga tctgtggcgg gtcctcatcc ttaccccatg atggtcagag actttcacgc
     2701 tgtgattggt aaagaaacaa ggaaacaagc gttggagaaa tggggcggga agccggatgt
     2761 cttggtggct tgtgtcggtg gtggttcaaa cgctatggga ctcttccatg aatttgtgaa
     2821 tgacacagag gtccggatga tcggtgtgga agcagcggga ttcggattgg acagtggcaa
     2881 acacgctgct acattgacaa agggagatgt tggtgtcctc catggagcta tgagttactt
     2941 gctgcaagat gatgatggac aaatcattga accacactcc atcagtgcag ggtaagtgaa
     3001 attctctatt cagatccggt aaccataata agtctactca gttagccacg gctgcatgaa
     3061 catggtttga ttttctctta cagattggac taccctggag tcggacccga gcacagtttc
     3121 tttaaagaca tgggacgggc tgaatactat agcataaccg acgaagaagc gttggaaggt
     3181 aatgtaccga tgcatagtct ttagttctgt atttgaaaat ctgttctttg tttcataact
     3241 tacctttgca ttccctttgg tttgtggtct gtagcgttca agagagtgtc gcggttagag
     3301 ggaatcattc cagcactgga gacctcacac gcactagctt acctcgagaa gctatgtccc
     3361 acattatcag acgggacgag agtggtcttg aacttcagcg gaagaggaga taaagatgtt
     3421 cagacagtgg ctaaatatct tgatgtttga agagagaaac acttatgttt gtttttcagt
     3481 tattcagaca gaatgaaatc ctttttgtac ttgtaatctg tttgtacttt acttctctct
     3541 ggtcagtacg tttccaacaa aattggaaat agaaaaagat tacgaagaga acgaaatttg
     3601 aataaatgta ttatcaataa aataaaaaac atttcatatt ggaaatagtc ttatgttcct
     3661 caatctgctt tctttttcgt tttcaaaacg aaatcctact tttgcagata caaacatgag
     3721 atatgatctc agaactctaa tagcttattt atctcaaatt tcatacgaga tgttacattg
     3781 gaaagaggtc caaattgcta gaagacatat atattcgcat atagtatacc aatctgccaa
     3841 agttatgttc cccttgtgaa ttgtgatcct taagatttaa gaaaaaatag aagtggtgac
     3901 tttgatgtat gaaaaataat aatataagca aagttgcaaa atccaataga gaaagaatct
     3961 tcttttagta tatccttcac caaacatatc tctctgcctc ttctgaaatc ccaagagaca
     4021 tgtctctctc tctcttttcc aagtccgaac caaatctaat gtatcaacat tacttcttct
     4081 gaagtcccaa gagacattgc tctctctctt ttccaagtct gaaccaaatc taatgcatca
     4141 acattacttt tgtaataaaa ttcaataagg attagctgat tactgttgtc actctttggt
     4201 ttcttttcct ttgttaatag aattc
//
LOCUS       HS5E1P       2590 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Murine cytomegalovirus early transcription unit e1 protein gene,
            complete cds.
ACCESSION   M35146
KEYWORDS    .
SOURCE      Murine cytomegalovirus (salivary strand strain Smith; ATCC VR-194)
            DNA, passed in mouse (BALB/c) embryonal fibroblast cell line B25.
  ORGANISM  Murine cytomegalovirus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Betaherpesvirinae.
REFERENCE   1  (bases 1 to 2590)
  AUTHORS   Buehler,B., Keil,G.M., Weiland,F. and Koszinowski,U.H.
  TITLE     Characterization of the murine cytomegalovirus early transcription
            unit e1 that is induced by immediate-early proteins
  JOURNAL   J. Virol. 64, 1907-1919 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        163      955     early transcription unit e1, exon 1
               1049     1225     early transcription unit e1, exon 2
               1552     1574     early transcription unit e1, exon 3
    pre-msg      44     2548     e1 mRNA and introns
    IVS         956     1048     e1 intron A
    IVS        1226     1551     e1 intron B
    signal       16       20     TATA signal
    signal     2548     2553     Poly-A signal
BASE COUNT      564 a    732 c    751 g    543 t
ORIGIN      
        1 gcgtgatgca gactttataa atcgcaagcc ggagcggatc ctaggctcat ttcttctgcg
       61 atcttcgcat ttagcagacc aaatgctgat agttcctgcg tcgcggtaga ttacgtgccc
      121 acttttctcg tcgcgaccgg tgaaaagacc ttcgttcgga ccatggccgc gccagatcga
      181 cgcggatcgc ccatcgtttc gagacgacgt tacttcacct tccggaactc gagtcggacg
      241 ctgcatcaga atgtcacccg catgttcgac ctccgtcagt ggacctacga gagcgcccgc
      301 gttttggact gcacgaacgg cgagggccgc accgagaact gggggcccgg ctggctgtgc
      361 gcgaccatca tgcagtctcc ggacagcggc tcgtcgtccg gatcggcggc ccagggatgc
      421 atgtctctgg acatcacctc ggacgacgga ccggagaagg tcatgaacat gttccaccgc
      481 gggagcatcg tctgtaacaa gaccgtctct tcggtcgcga cgccgccggg gtccgacggc
      541 acgggcgcca gcctcttgac tctggtggcg gacggcagct tgctgcaagt catcctggtc
      601 gaacactcgc cgggcgccca ccgcgccgaa gccgacgatg tcgccgagag cgtcggcaac
      661 gccgcggtgg ccgccgccgt gcagtcggcc acttcgatgg gtagctcgag ctccgtcggc
      721 ggcgccggtg gcggcggctt caattcttcg gaagaacgtc gccgcgataa agactcgtcc
      781 aaatctcacg atgacgaccg acgtaagaga ttcgaactct gcaacggtac ccttcatagg
      841 gagcggacct cgggtagctc cggatcgagc ggcggcggcg gcggcagcgg cagcggaggc
      901 agcagcggcc tcagtacaaa gcagaaggag cgcagcagga ggctcgaaga ggaatgtaag
      961 tagttctgat cggggtttta ctggtcgctt tccccctcga ccttcggtga accgtcacgg
     1021 gactaatgtt gttactctgt tcggaaaggt tctccacgca gcggggggga gcccaagcgc
     1081 cagaagaccc accacgatcc gagaccggag agggatctgg aaccgcctag gagcagtacc
     1141 actgtcgatg gtaatgcaga cgaagctgga gcggtctcct ttttgaacag ttatgctgct
     1201 agttcgctgt ccgccgtcag cgatggtgcg ttaccaccct cctctccggt gactatagag
     1261 acaccaaggt ttaatgactt gatagaatcc gaggaggaag acgatggaac tcggtgttcg
     1321 tctgccgatg gtacggcgac ggtaaccgag accggggcgt ctgccgagat cgagcaaacg
     1381 ttcaccgacg gtgggctagc atgcatgagg tccttaattg actcgcgggc cctcatggat
     1441 ctcgccttta cgagccgcat gacggcggtg aagtcgggat ctgagggagc gggcaacacc
     1501 tgtcctgggg aacagaggct ttctaactgc aatcttcctc cgaattcaca ggaggacaac
     1561 aacccgcgtt ttgaaggcgt ctttaattcc ctgatgagac tcttagacga gtataaagac
     1621 aagagcacca gctcgagttc cgccgccgta ccgggatcca gttgtacagg caccacgcag
     1681 acagagggga tgcctcacca caggggtatc ttcgacaggg acacatccgc cgctcagcag
     1741 aaacctgtat gtgagatacg gcccttcatc gacatagctc gctgtgtcga gccggcggcg
     1801 gcgccgaccg cttcgcgctc cagagccacc acggccgccg ctcgcaggcg gggaagaggc
     1861 tcgcagcctc ggaggaacag tcgggtcgcg gcgcgcgtcg cccaggtcga tccgaccgac
     1921 ccacccgcag ctgctgctgc cgccgccgcc ggtcccacca cgcggggaag gagcaggagt
     1981 aggaggggaa gacgaggccc agacctcacg gaggacgggt tggagattgt cgagacgggc
     2041 gccaccgccg gcactctggc cgtttccgag gaggaaaccg cgatggccgc cgctatgtta
     2101 gaggatatgg tcgacttgga caatgtgttc gatgatctta attgatctca tgaaagatgg
     2161 ataaagaaat gatgctctgt aaactctaac tgactgttgt tatcgatcgc ctcgtctcgc
     2221 tcgggtcttc tgccgatgct gccgtctgta agcacgcaag caagcactcg ctctatagca
     2281 gtagaatatg tatggatgat atacttactg ttgttgttca cttactatta tcacctatat
     2341 aaaacttgta ttcatgatac ttaccactga aaaaattgtg tatgtctttt tatacttttg
     2401 accaccgtta ttgatgataa gtatactctt ctaactgtgt gtgtctctgt ctaccgggcc
     2461 gatggagata ttatccctgg tccccctctg tctgtgctaa caacataaca tccatgtaaa
     2521 taaaattaat atttttaata ttttatcaat aaaaaccaca catttgttac aataaaccgc
     2581 gctcgccttt
//
LOCUS       HSE14GP      3351 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete
            cds.
ACCESSION   M35145
KEYWORDS    glycoprotein 14.
SOURCE      Equine herpesvirus type 1 DNA.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 3351)
  AUTHORS   Guo,P., Goebel,S., Perkus,M.E., Taylor,J., Norton,E., Allen,G.,
            Languet,B., Desmettre,P. and Paoletti,E.
  TITLE     Coexpression by vaccinia virus recombinants of equine herpesvirus 1
            glycoproteins gp13 and gp14 results in potentiated immunity
  JOURNAL   J. Virol. 64, 2399-2406 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        300     3242     glycoprotein 14 (gp14) precursor
    sigp        300      554     glycoprotein 14 (gp14) signal peptide
    matp        555     3239     glycoprotein 14 (gp14)
    signal     3251     3256     Poly-A signal
BASE COUNT      882 a    892 c    858 g    719 t
ORIGIN      
        1 aacgttgggt tgttaccgca tctcaaggag gaactagctc ggtttatgat tactgcggct
       61 aaaggtaatt ggtcaattag cgagtttcaa aggttttatt gctttgaggg agtgacaggt
      121 gtgacggcca cgcagcggct ggcgtggaaa tatatcgggg agctcatcct agccgccgca
      181 gtattctcct cggttttcca ctgtggagag gtgcgcctcc tgcgcgcaga tcgtacctac
      241 ccggactcca gcggcgcaca gcgctgcgtg agcggcattt acataaccta cgaggcgtca
      301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg
      361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct
      421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa
      481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct
      541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt
      601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac
      661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc
      721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg
      781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct
      841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat
      901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga
      961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata
     1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt
     1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg
     1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg
     1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt
     1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc
     1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag
     1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact
     1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg
     1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tctacgttta
     1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag
     1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta
     1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc
     1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg
     1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca
     1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa
     1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg
     1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac
     2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg
     2101 cgtggtgtac cctccaaaac aaagagcgga ccctatggaa cgaaatggtg aagattaacc
     2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg
     2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc
     2281 gctcgatgga cagtaacacg tgctactccc gcccccccgt aacatttaca attactaaga
     2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg
     2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag
     2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg
     2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc
     2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac
     2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg
     2701 acaataccgc agtgattatg caggggatcg ccagcttttt caagggcctg ggtaaagtgg
     2761 gggaggccgt gggaacgctc gttctcggcg ccgccggcgc tgttgtttca accgtatctg
     2821 gaatagcttc gtttttaaac aacccatttg gggggctagc catcggcctg ctggtaatcg
     2881 ccggcctggt agctgcgttt tttgcttaca gatatgtaat gcagatccgc agtaacccca
     2941 tgaaagctct ataccccata acaacaaagg ccttgaaaaa caaagccaaa acttcctacg
     3001 gccagaacga ggaggacgat gggagcgact ttgatgaggc caagcttgaa gaggctcgcg
     3061 aaatgatcaa atacatgtct atggtttcgg ccctggaaaa gcaggaaaag aaagctataa
     3121 agaaaaacag tggggttggc ctgatcgcca gtaacgtctc aaagctggcc ctgcgaaggc
     3181 gcggtcccaa atatacccga ctccaacaga acgataccat ggaaaatgaa aaaatggttt
     3241 aaacatgttt aataaatatt atgacacgta ctcaaagtgt gacctcatat ttgcataacc
     3301 actttctagt tccggcccca aggatattta agcctagtat ctccgccgag g
//
LOCUS       MLV3ENV       232 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Murine leukemia virus p15-gag protein gene, complete cds.
ACCESSION   M35144
KEYWORDS    p15-gag protein.
SOURCE      Murine leukemia virus (strain Emv-3) DNA.
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 232)
  AUTHORS   Mercer,J.A., Lee,K.H., Nexo,B.A., Jenkins,N.A. and Copeland,N.G.
  TITLE     Mechanism of chemical activation of expression of the endogenous
            ecotropic murine leukemia provirus Emv-3
  JOURNAL   J. Virol. 64, 2245-2249 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         17  >   232     p15-gag protein
BASE COUNT       51 a     57 c     53 g     50 t     21 others
ORIGIN      
        1 gaccgcttct aaaaacatgg gaccgaccgt aaccacccct ctgagtctga ccctagaaca
       61 ctgggaagat gtccagcgca tcgcgtccaa tcagtccgta gatgtcaaga agagacgctg
      121 ggtcaccttc tgctctgccg agtggccaac tttcggtgta gggtggccac aagatggtac
      181 ttttaatnnn nnnnnnnnnn nnnnnnnnaa atctaaggtg ttctctcctg gt
//
LOCUS       CTRTPDC      1740 bp ss-mRNA            PLN       17-JUL-1990
DEFINITION  C.roseus tryptophan decarboxylase mRNA, complete cds.
ACCESSION   M25151 J04521
KEYWORDS    decarboxylase; tryptophan decarboxylase.
SOURCE      C.roseus leaf, cDNA to mRNA.
  ORGANISM  Catharanthus roseus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Gentianales; Apocynaceae.
REFERENCE   1  (bases 1 to 1740)
  AUTHORS   De Luca,V., Marineau,C. and Brisson,N.
  TITLE     Molecular cloning and analysis of cDNA encoding a plant tryptophan
            decarboxylase: Comparison with animal dopa decarboxylases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 2582-2586 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Brisson, 01-JUN-1989
FEATURES       from  to/span     description
    pept         70     1572     tryptophan decarboxylase (EC 4.1.1.28)
    mRNA          1     1740     tryptophan decarboxylase mRNA
BASE COUNT      505 a    355 c    359 g    521 t
ORIGIN      Unreported.
        1 ctctctctct ctctctctaa gactttctct ctctacacat acacctacac cagaaaaaag
       61 aaaaaaataa tgggcagcat tgattcaaca aatgtagcca tgtccaattc tccagttgga
      121 gaatttaagc cacttgaagc tgaggaattc cgaaaacaag cccatcgtat ggtagatttc
      181 atagccgatt attacaaaaa tgtggaaaca tatccggtcc ttagcgaagt cgaacctgga
      241 tatctccgaa aacgtatccc cgaaaccgct ccttacctcc ccgaaccact tgacgacatc
      301 atgaaagata ttcagaagga tattatccca ggaatgacaa attggatgag ccctaatttt
      361 tatgcatttt ttcctgccac tgttagttca gctgcctttt taggagaaat gttgtctact
      421 gccctaaatt cagtaggctt tacttgggtt tcttcaccag ccgccaccga attagaaatg
      481 attgttatgg attggttggc tcagatcctt aaactcccca aatctttcat gttttcaggt
      541 accggtggcg gcgtcatcca aaacaccact agcgagtcca ttctttgtac aatcattgcc
      601 gcccgggaaa gggccctgga gaagctcggt cccgatagta ttggaaaact tgtctgttac
      661 ggatccgatc aaacccatac catgttcccc aaaacttgca aattggcggg aatttatccg
      721 aataatatta ggttaatacc tacgaccgtc gaaacggatt tcggcatctc acctcaagtt
      781 ctacgaaaaa tggtcgagga tgacgtggcg gccggatatg taccgctgtt cttatgcgct
      841 accctgggta ccacctcgac cacggctacc gatcctgtgg actcactttc tgaaatcgct
      901 aacgagtttg gtatttggat ccacgtggat gctgcttatg cgggaagcgc ctgtatatgt
      961 cccgagttta gacattactt ggatggaatc gaacgagttg actcactgag tctgagtcca
     1021 cacaaatggc tactcgctta cttagattgc acttgcttgt gggtcaagca accacatttg
     1081 ttactaaggg cactcactac gaatcctgag tatttaaaaa ataaacagag tgatttagac
     1141 aaagttgtgg acttcaaaaa ttggcaaatc gcaacgggac gaaaatttcg gtcgctgaaa
     1201 ctttggctca ttttacgtag ctatggagtt gttaatttac agagtcatat tcgttctgac
     1261 gtcgcaatgg gcaaaatgtt cgaagaatgg gttagatcag actccagatt cgaaattgtg
     1321 gtaccgagaa acttttctct tgtttgtttt agattaaaac ctgacgtttc gagtttacat
     1381 gtagaagaag tgaataagaa acttttggac atgcttaact cgacgggacg agtttatatg
     1441 actcatacta ttgtgggagg catatacatg ctaagactgg ctgttggctc atcgctaact
     1501 gaagaacatc atgtacgccg tgtttgggat ttgattcaaa aattaaccga tgatttgctc
     1561 aaagaagctt gatgaataag taagggtttt tttttaattt ttttttaaat tttatatttg
     1621 ctgattgttt gaagagttta aaaataaagt gatttgtaaa ggtttattgt actcaaacaa
     1681 tcatgcaatt aattatatgt attaattatg acatgagaat aaaatagaat ttgtgtgtgc
//
LOCUS       WHTAMYA      2397 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  Wheat alpha-amylase gene, complete cds.
ACCESSION   M16991
KEYWORDS    alpha-amylase.
SOURCE      Wheat (strain Chinese Spring) distal half-grain DNA, clone
            lambda-Amy3/33.
  ORGANISM  Triticum aestivum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 2397)
  AUTHORS   Baulcombe,D.C., Huttly,A.K., Martienssen,R.A., Barker,R.F. and
            Jarvis,M.G.
  TITLE     A novel wheat alpha-amylase gene (alpha-Amy3)
  JOURNAL   Mol. Gen. Genet. 209, 33-40 (1987)
  STANDARD  full staff_review
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            D.C.Baulcombe, 13-AUG-1987.
            
            The region bounded by nucleotides 975 and 1032 might be a signal
            peptide.
FEATURES       from  to/span     description
    pept        963     1052     alpha-amylase, exon 1
               1138     2028     alpha-amylase, exon 2
               2133     2393     alpha-amylase, exon 3
    pre-msg     908  >  2397     alpha-amylase mRNA and introns (5' end +/- 3
                                 bp)
    IVS        1053     1137     alpha-amylase intron A
    IVS        2029     2132     alpha-amylase intron B
BASE COUNT      613 a    615 c    599 g    570 t
ORIGIN      874 bp upstream of SstI site
        1 aacattcaca atgcaaaatc tatatcatca gatccatcat gaaatatatt ttcatattat
       61 atattttttg gtattgtagc tgcttttttc cttgaataca cacgagtgtg atatatatat
      121 ttttggtatt gtagctgctt taattatata tatatatata tatatataca tatatatata
      181 tatatataat taaagaagat gatgggctaa gaagcccttg gtgttgtaga tgttgatatt
      241 ttttgatata attttggtca aactttataa agtttggatt ttcagaaatc taacacagag
      301 agtaaaaaga aacggaagga gtatataata agagttctag cagccgctcc tccaactgca
      361 cccaactgca ccctcgcctg agccggagcg gcgaggggga ggctaggtcg ccgcgccacc
      421 atgtaaaaag attctttttt tatctttacc gactctagcc tcaccttgta cacacaaaaa
      481 caagccaatc gaggtttttt tctagaatca acatcaaacc atccggtttt atgtgaaccg
      541 cccggttcat cggttttttt tcaaaaaacg tctgattcga ccgggttttc ctggttggac
      601 tcaatcggtc caccagactt aaagaactgc aggaatcgcc ggttccggtt ttcaaaacta
      661 tgagcagagt gcttgtctag tgcagatata catggagagt aattaaacgg cagtaattaa
      721 tagacaaaac tttagctcct aatccatttt atctcagaag ccagcggtcc ttatccatgc
      781 ggatatcttt ggagaaaaat taaggagaat tgatagtttt tactaggcac ctactaatct
      841 ctaagctcca agtacctata aatacaagcg agctcaagca ttgccaaacc aaggagtagt
      901 cgtcggtcca ggagtccaga tttcatagga agtagaggcg agctgcactg cagactgtag
      961 ctatgggcaa gcactctgct actctctgtg gccttctggt tgttgtgctt tgtctcgcct
     1021 ccagcttagc acaggctcaa attcttttcc aggtgagaaa atgcatgcat cttcaagttt
     1081 catttgcacc cagaatcttt catatgatga aatgtgcttt ggtgtttcgt tggccagggg
     1141 tttaattggg aatcgtggaa gacacaaggc gggtggtaca agttcatgca agggaaggtg
     1201 gaggagatcg ccagcaccgg cgccacccac gtctggctcc ccccgccgtc gcaatccgtg
     1261 tcgccggagg gctatctgcc ggggcagctc tacaacctca actccaagta cggcagcggg
     1321 gccgacctca agtcgctgat ccaggcgttc cgcggcaaga acatctcgtg cgtcgccgac
     1381 atcgtcatca accaccgctg cgccgacaag aaggacggcc gcggcgtcta ctgcatcttc
     1441 gagggcggga cgtccgacaa ccgcctcgac tggggccccg acgagatctg cagcgacgac
     1501 accaagtact ccaacggccg cggccacagg gacaccggcg gtggcttcga cgccgcgccc
     1561 gacatcgacc acctcaaccc gcgggtccag agggagctct ccgcctggct caactggctc
     1621 aaaaccgacc tcggcttcga cggatggcgc ctcgacttcg ccaagggata ctccgcggcc
     1681 atggccaaga tctacgtcga caacagcaag ccggcgttcg tcgttggcga gctctacgac
     1741 cgggaccgac agctgctcgc gaactgggtg cggggcgtcg gcgggccggc cacggcgttt
     1801 gacttcccca ccaagggcgt tctccaggag gccgtccagg gcgatctggg gaggatgcgc
     1861 ggcagcgacg gcaaggcgcc cggtatgatt gggtggatgc cggagaagac cgtcacgttc
     1921 atcgacaacc acgacactgg gtcgacgcag aggctgtggc cattcccctc ggacaaggtc
     1981 atgcagggct acgcctacat cctcacacac ccgggcatac catgcatcgt aagtagtagc
     2041 acactacaca acctcaccat aacatttcgc atcaaacgta ccccacgatg tttgtgatct
     2101 gaacttacaa ctacttggtt ttgcgcgcgc agttctacga ccatgtgttc gactggaaac
     2161 tgaagcagga gatcaccgca ctggctacgg tcaggtcaag gaacgggatc catccgggta
     2221 gcacactgga tatcctcaaa gccgagggcg atctctatgt tgccaagatt gggggcaagg
     2281 ttataaccaa gatcgggtca agatacaata ttggcgacaa tgtgatcccc tcgggtttca
     2341 agattgcagc taaaggcaac aactattgtg tctgggagaa gagcggcctc tgaattc
//
LOCUS       STAFEMA      3446 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  S.aureus factor essential for expression of methicillin resistance
            (femA) gene, complete cds, and trpA gene, 3' end.
ACCESSION   M23918
KEYWORDS    femA protein.
SOURCE      S.aureus (strain NCTC 8325; organism BB270) DNA.
  ORGANISM  Staphylococcus aureus
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Micrococcaceae.
REFERENCE   1  (bases 1 to 3446)
  AUTHORS   Berger-Baechi,B., Barberis-Maino,L., Straessle,A. and Kayser,F.H.
  TITLE     FemA, a host mediated factor essential for methicillin resistance
            in Staphylococcus aureus: Molecular cloning and characterization
  JOURNAL   J Antimicrob Chemother 23, 671-673 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by B.Berger-Baechi, 11-APR-1989.
FEATURES       from  to/span     description
    pept    <     1      294     trpA protein (AA at 1)
    pept        577     1878     FemA protein
    pept       1897     3156     ORF 419 protein
    pre-msg     514  >  3446     FemA mRNA
    rpt         361      390     large inverted repeat, left end
    rpt         405      434     large inverted repeat, right end
    rpt         416      423     small inverted repeat, left end
    rpt         430      437     small inverted repeat, right end
BASE COUNT     1281 a    448 c    617 g   1100 t
ORIGIN      3 bp upstream of EcoRV site; segment 18 of S.aureus chromosome.
        1 gatatcgtat cccatgcgga aggctttatt tatactgtga cgatgaatgc gacaacaggg
       61 caaaacggtg cgtttcatcc agaattaaaa cgaaaaattg agtcaattaa agcgatagcc
      121 aatgtgccag ttgtcgcagg atttggtata agaacaccac aacatgttgc agatataaaa
      181 gaggttgcag atggcattgt cattggtagc gaaatcgtta agcgatttaa atctaacacg
      241 cgtgaggaaa tcattaaata tttacaatct atccaacaaa cattgaataa ttaagtttac
      301 ttgatttaaa aaaattaggc gaatactgtt tgaaaaagtg aaaaacggtg aattataaaa
      361 ttgaatacaa tttcaaaaaa agtaatatga gcaaacccaa acgttcatat tacttttttt
      421 gaaattgtat tcaaaaatct aaatattact ataaaagtat acgcaattaa agcgtttatg
      481 ttttagtttt aacattaact attgtatact tatttagatt agatttatta tttttgacat
      541 ttgcagaggg gaaatagaaa aactgcaaat acggaaatga aattaattaa cgagagacaa
      601 ataggagtaa tgataatgaa gtttacaaat ttaacagcta aagagtttgg tgcctttaca
      661 gatagcatgc catacagtca tttcacgcaa actgttggcc actatgagtt aaagcttgct
      721 gaaggttatg aaacacattt agtgggaata aaaaacaata ataacgaggt cattgcagct
      781 tgcttactta ctgctgtacc tgttatgaaa gtgttcaagt atttttattc aaatcgcggt
      841 ccagtgattg attatgaaaa tcaagaactc gtacactttt tctttaatga attatcaaaa
      901 tatgttaaaa aacatcgttg tctataccta catatcgatc catatttacc atatcaatac
      961 ttgaatcatg atggcgagat tacaggtaat gctggtaatg attggttctt tgataaaatg
     1021 agtaacttag gatttgaaca tactggattc cataaaggat ttgatcctgt gctacaaatt
     1081 cgttatcact cagtgttaga tttaaaagat aaaacagcag atgacatcat taaaaatatg
     1141 gatggactta gaaaaagaaa cacgaaaaaa gttaaaaaga atggtgttaa agtaagattt
     1201 ttatctgaag aagaactacc aatttttaga tcatttatgg aagatacgtc agaatcaaaa
     1261 gcttttgctg atcgtgatga caaattttac tacaatcgct taaaatatta caaagaccgt
     1321 gtgttagtac ctttagcgta tatcaacttt gatgaatata ttaaagaact aaacgaagag
     1381 cgtgatattt taaataaaga tttaaataaa gcgttaaagg atattgaaaa acgtcctgaa
     1441 aataaaaaag cacataacaa gcgagataac ttacaacaac aacttgatgc aaatgagcaa
     1501 aagattgaag aaggtaaacg tctacaagaa gaacatggta atgaattacc tatctctgct
     1561 ggtttcttct ttatcaatcc atttgaagtt gtttattatg ctggtggtac atcaaatgca
     1621 ttccgtcatt ttgccggaag ttatgcagtg caatgggaaa tgattaatta tgcattaaat
     1681 catggcattg accgttataa tttctatggt gttagtggta aatttacaga agatgctgaa
     1741 gatgctggtg tagttaaatt caaaaaaggt tacaatgctg aaattattga atatgttggt
     1801 gactttatta aaccaattaa taaacctgtt tacgcagcat ataccgcact taaaaaagtt
     1861 aaagacagaa ttttttagga agggaattat caaaacatga aatttacaga gttaactgtt
     1921 accgaatttg acaactttgt acaaaatcca tcattggaaa gtcattattt ccaagtaaaa
     1981 gaaaatatag ttacccgtga gaatgatggc tttgaagtag ttttattagg tattaaagac
     2041 gacaataaca aagtaattgc agcaagcctt ttctctaaaa ttcctactat gggaagttat
     2101 gtttactatt cgaatcgtgg tccagtaatg gatttttcag atttaggatt agttgattat
     2161 tatttaaaag agttagataa atatttacag caacatcaat gtttatatgt taaattagat
     2221 ccgtattggt tatatcatct atatgataaa gatatcgtgc catttgaagg tcgcgagaaa
     2281 aatgatgccc tagtaaactt gtttaaatca catggttacg agcatcatgg ctttacaact
     2341 gagtatgata catcgagcca agtacgatgg atgggcgtat taaaccttga aggtaaaaca
     2401 cccgaaacat tgaaaaagac atttgatagt caacgtaaac gtaatattaa taaagcgata
     2461 aactatggtg ttaaagtcag attccttgaa cgtgatgagt tcaatctttt cttagattta
     2521 tatcgtgaaa ctgaagagcg tgctggattt gtatcaaaaa cagatgatta tttttataac
     2581 tttattgaca catatggaga taaagtatta gtaccattag catatattga ccttgatgaa
     2641 tatgtgttaa agttgcaaca ggaattgaat gacaaagaaa atcgtcgtga tcaaatgatg
     2701 gcgaaagaaa acaaatcaga taagcaaatg aagaaaattg cagaattaga taagcaaatt
     2761 gatcatgatc agcatgaatt attgaatgca agtgaattga gcaaaacgga cggcccaatt
     2821 ctaaaccttg cttctggcgt ttattttgca aatgcatatg aagtgaatta tttctctggt
     2881 ggttcatcag aaaaatataa tcaatttatg ggaccataca tgatgcattg gtttatgatt
     2941 aactattgct tcgataatgg ctatgatcgt tataatttct atggtttatc aggtgatttt
     3001 acggaaaaca gtgaagatta tggcgtatac cgctttaaac gtggatttaa tgtacaaatc
     3061 gaagaattaa taggggattt ctataaacca attcataaag tgaaatattg gttgttcaca
     3121 acattggata aattacgtaa aaaattaaag aaatagatgt aaatgatgtt agggcttata
     3181 gttattgata ctataggctc ttttttatat gttttttaat aaagcctcga ataatgatat
     3241 attaaatgac aagtttaaca agtgattgca ataaatataa atatgaattg cacaacctat
     3301 ttgtaaatga ttgagttgag tatacttttt aattcattca atatgactca aaacaagcca
     3361 atatgtatta tatttttggt gaataaacct tagcataaca atttatgtct gattatatat
     3421 gtgtgtgttt gcaagtgtgt gatatc
//
LOCUS       WTVPNS11A    1172 bp ds-RNA             VRL       17-JUL-1990
DEFINITION  Wound Tumor Virus genomic segment S10 encoding Pns11 nonstructural
            polypeptide, complete cds.
ACCESSION   M24114 J04352
KEYWORDS    polypeptide Pns11.
SOURCE      Wound Tumor Virus, cDNA to viral RNA.
  ORGANISM  Wound tumor virus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus.
REFERENCE   1  (bases 1 to 1172)
  AUTHORS   Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L.
  TITLE     Complete nucleotide sequence of wound tumor virus genomic segments
            encoding nonstructural polypeptides
  JOURNAL   Virology 171, 222-228 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.L.Nuss, 20-APR-1989.
FEATURES       from  to/span     description
    pept         25     1068     polypeptide Pns11
BASE COUNT      359 a    222 c    236 g    355 t
ORIGIN      
        1 ggtatttttg ttggatctgc tgtgatggat gcttctgtcg acaggatcac aaatctacat
       61 tttgaaatcc tcgctaaagc gggtggacat gaaatacacc aaaaatatga agcaattagg
      121 aagttgaacc ttacgggtga ttcttcaaag agtaatatct ctgtatctgc aaggtcagca
      181 atattgaaat gggctgacgc taaacaagga tacatagcca gccaactgga cgatcgagat
      241 tatggcgacc tcatcgccaa agctgtaatc tttgtcccga tgtctgttat aacaggaggg
      301 aaaaatccga aggatttaat accttacgga gttgtcgctg cagtgttaat ttttgttcct
      361 gaaacgttga cactactgga tgagatagtt attaatttaa tgcatgataa gaaacctttg
      421 tcatcaattt tattgacaaa gatactcaga gacatgaaaa ttgatgtctg tggtagtaat
      481 tttgattcat tttattactg cccaatttcg aggtataata gacatattat taagctagct
      541 ggtgcattac ctcaaatgcc gactagcgtt agattatcag ttaatgattt agctcgagta
      601 gctatttccg aagtacataa ccagcttatt tcggataaac agatgttttt caaactacca
      661 actggttttt cgccaaaagt gcattgttta aaagtacttt gcactactga aatggaaatt
      721 tttcagaaat gggttagaac gttcatgtct gatagaccga acgaattcat ctattctgac
      781 cagtttaata tattatctag gaccacctac ttttcttcag atgatccttt ttcatttttt
      841 actctttgga ggggatggtc aacctacaaa gaaatactct ctcaagatca agcttcaagc
      901 tttcttgaag ccataggaag tggaaagccg ttgagaagtt caatcgctac ttttccatca
      961 atgtttgatg aaggtgctat ttacatcaga tatgaatgga taactccaaa agattcagca
     1021 aattcgaaga aggctggctc ttcagcaccg agcgcgccta aaatgtagaa gatttgtcct
     1081 tgagattcgc atctggagca gcccggccta gtcacccgga gccagatcga gctaaaactc
     1141 gagttgaatt ccccagctcc aacaaacatg at
//
LOCUS       WTVPNS4A     2565 bp ds-RNA             VRL       17-JUL-1990
DEFINITION  Wound Tumor Virus genomic segment S4 encoding Pns4 nonstructural
            polypeptide, complete cds.
ACCESSION   M24117 J04352
KEYWORDS    polypeptide Pns4.
SOURCE      Wound Tumor Virus, cDNA to viral RNA.
  ORGANISM  Wound tumor virus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus.
REFERENCE   1  (bases 1 to 2565)
  AUTHORS   Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L.
  TITLE     Complete nucleotide sequence of wound tumor virus genomic segments
            encoding nonstructural polypeptides
  JOURNAL   Virology 171, 222-228 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.L.Nuss, 20-APR-1989.
FEATURES       from  to/span     description
    pept         64     2262     polypeptide Pns4
BASE COUNT      809 a    463 c    625 g    668 t
ORIGIN      
        1 ggtattgatc ccactttcta gatcagtgag ctagtgaagt aaggttcaaa actaatcatc
       61 gccatgaagc aagtagccgg gtttattggt atgagtcaaa agaacaaagg aattcagcaa
      121 aaccagtggc attccggtcc gccacaggga ctgctttcag gacaaacgaa agctgaacaa
      181 ggtaccagca gtcagcaagc tggagtgaat caaggagaga attccaaatc tggggcggta
      241 gtccagagga aaatgccaat gaggagagac gttaattctg ccgcacaacg tcaacatgta
      301 aggaggttgg tgtacacacc atactttcca aatgaaacat ggaatatatc aacattcaaa
      361 aatggagcta aagacgtaga acggtcggtt atatcgacgc tgaacaacgt cgctgagaat
      421 agagtcatgg acaactgtac ttccagagtc atttttgaga tgactcaaat acaatttgag
      481 tcacttccag atataataag aaatgaattt actagagttg gggatgacgc cttgaaatgg
      541 gcggtccctg aagatttgaa atctgcagat cttgaccata tgatggtcgt gaaattatca
      601 acagaaggca cgatataccc aacgacgctt atttttccag gtggatgtag tgggatggct
      661 aaattgaaat cagtctactc attcttggaa agtcagctag aaagaatagt cacaccgaca
      721 ccctcggtgt ctttgaagta tgttacaagc tgggctgagc acttatttga tctgtgctcc
      781 ggacagctga tcaattcaca aaacgagcgt gtagacaaac tgcttggata catgatctgg
      841 gatattgaaa aagctatcac tttgactgat caagtgatcg catgttacaa ccaccctgaa
      901 gttgtgctga ggaggctagg ggcctcggat attgcatgcg ctgtgttagc tggggagtct
      961 gtggtgaagc tcacccgcct agcgttatcc aagtctcctg ttgatggctg ctcatgctgt
     1021 agaatattgg agctgattct aaatttacca tcacgtaagc ccaacgataa ggtgccgcag
     1081 gtaccattgg acatactatt cgcatctgtc tatagatacg tatcagccat gtgcatgggg
     1141 cgagtgctga atgggcgcat tgacgcatcg gggatacagt cgactgatca tgcgactgct
     1201 tcgatcaagc tgaatgatat aattgtaaat gatcttgagt taaggagtat gggtgttgat
     1261 aaaacgtcat cattcagagg aacgcaatct atgagggcgt tctacgttcc tgagaattta
     1321 gcaggctcta tactagatag gattaacgtg ttggtcatgc gccattttgg aatcctgcat
     1381 atgtggggtt ttaatggtgt agtacttcaa aatcaagagg ggtactgtga ctatcacatt
     1441 ataacaggat taaaccactt aacaactata acgcatacta actccatggt tgcagtacat
     1501 tggggtactg aatcaaggat ggataatata tttgaaataa aagcaagaac tcttccaacc
     1561 gcatccgaga cgatgataac attgatcgaa aatgccttga aggagcaact gacatcaata
     1621 gtgaaagacg gcctcaggaa gggagtaagt ttttcagtga aaagaaatat aaatgactct
     1681 cgctttggat ttgagactaa ttcatcccca gcgatatttc tgaaactcag agacatgcta
     1741 aaaagagcta agcctttctc agacctgtta tccttggcat tgtcaaaggt aataaaaaaa
     1801 gagaacgcta tgatacaacg gtcaatcacc acagtggaag tggccgttgc cattaaaatg
     1861 aaagtctatg ggttggatga gtatgtcagc ctgatgaaag ttgagaagaa ggaagtggaa
     1921 agtggaagcc ttccacttca agaatttctg aaattaaaat caaatgcagc aggggcacag
     1981 tcatcgactg ttgcagttaa gatgaaagaa gaagaagtca atagcaaagc ctattgcctg
     2041 atatcagaaa ctattgttgt aaacatggac gcagtgagaa gtgcatgtgg agtagtgcaa
     2101 tcagagaatc tggtcataaa gtcagaactt tcaggaccag aattgagtga aagtgtaact
     2161 tcagggttga tggagttact aggaagaaat gccggaccaa gtaaatcttg ggctgaccaa
     2221 gttgaagaag cggaaaatga agaggaaaaa caaaaggagt gagtgtgctc catttctgga
     2281 tcggagtgtc ttgcgaggct cgggtctata gatttataaa catgtattat aattaatatt
     2341 attagtggta atatattgca tttgtaaata gtgtgcttat ctcattaggc ctgccattag
     2401 ggcggggtcg tgatgaaccc tcaggactat accacgtccc attttagctc gctatagcaa
     2461 atttgagtat caacataagt gactggagtt taaaccctct tagctttatt cttgagcgaa
     2521 gctataatga cgatggttgg tctagggctg atggatcatc atgat
//
LOCUS       WTVPNS7A     1700 bp ds-RNA             VRL       17-JUL-1990
DEFINITION  Wound Tumor Virus genomic segment S6 encoding Pns7 nonstructural
            polypeptide, complete cds.
ACCESSION   M24116 J04352
KEYWORDS    polypeptide Pns7.
SOURCE      Wound Tumor Virus, cDNA to viral RNA.
  ORGANISM  Wound tumor virus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus.
REFERENCE   1  (bases 1 to 1700)
  AUTHORS   Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L.
  TITLE     Complete nucleotide sequence of wound tumor virus genomic segments
            encoding nonstructural polypeptides
  JOURNAL   Virology 171, 222-228 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.L.Nuss, 20-APR-1989.
FEATURES       from  to/span     description
    pept         45     1607     polypeptide Pns7.
BASE COUNT      551 a    298 c    338 g    513 t
ORIGIN      
        1 ggtattttct cctgaaatac ttgacctggc tagggttcgg tgaaatgaag atttccgact
       61 tctgtttcgc ctctgcaaat gatggatcat acactttaaa agcattttcg gaacttaatg
      121 aatacaagga tgtagtgaaa cttgtctctg atgaaaagat tggagttgga tttcactgct
      181 acaatttagg tctgatgaat atagttgaag atttttccgg aaatcttgac aacgaatctt
      241 acttgaccag taaggtaggc aaaagaatgg cttccgagct agtcactgca tactcaaaat
      301 ttgggtctac tagcagccgg accttacata gctcattaaa tctaccagtc gtaaatataa
      361 cttcattacc aacatcccaa gctaaagatc tcaaaccgaa tcattcattg gatgataagg
      421 gatctatgct acggactcaa atacattcaa ttctgactgg aaacggtccg ctaactataa
      481 aacgaaggat agatgcgttt tattatagcg cctcaagtat attcactaga cacatgactt
      541 cgaaatatgc taatcctgga tcaaatgtgc ctcaaaggtt ttcatttata ccagattgtg
      601 caatgaataa gaaaccaacg ctatttctag agaatagaga taacgaactc caagattcca
      661 tgaccatcat gttaatgttg ggacaggtct tttcggatgc tttaacatat tatctaaatg
      721 cctcaatact atacggtatc ctaggtagaa ttgaatccaa agtgcaggtt gatttaccag
      781 ccatcactct tgaatccgtt catgttacaa ataatttgga aatctctccg gccgcatttg
      841 ctctaatagc gtcagtctgg cttgataaag ctgaaatatt atcaaaatta aatgcgatag
      901 acttcatagt gtctcctgaa gataatgaag atagaatatc aaatttactg aaactgatgc
      961 tgccagttca gtctaataat ataacggttg aaaaatctga cactaggttc tcggtaaccc
     1021 attcagatgg atttatgagg tattatatgt gtttttctaa acacgaattt gattatggag
     1081 accaccttga gtcttttggc atcccagtgc tgcgtgtaag actaggtaaa ccaatatcaa
     1141 atgaactgaa caagccgatg ctggtaatgt tcaagaagca tgaatcaata tctagcataa
     1201 atgttagata tcaagtaaga ggtggaagta tacctaaatt caggacctca gaatttagaa
     1261 gagatatagg aatgttagtt gccaattctc ggtttatggc gactgatatt actttgatat
     1321 tgtctacatt ttatccattc actcaggaaa cggataaatt atttattgaa caacacataa
     1381 aagaaatttt tcttgatatg tatccatgga ttgataaact aacttcagcg gacgctaaat
     1441 ctgaggttaa tatatcatat gggaatttag tgctatactc atatggagag ttagtcaaaa
     1501 atagtatatt tattgcaatg atggacaatt gcaaggacgc ccggaatagt ttttctcggg
     1561 ctgatatgag agagatccaa gcatttgtag cagcgtttac acagtgagta agattttttc
     1621 atgaacctgg agcttgcgag gccaccagca gctaaaaccc gtaggtggag gttgtataca
     1681 tggtgagtgg ggagcatgat
//
LOCUS       ATUVIRD      3662 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  A.tumefaciens virD1/D2 protein genes, complete cds.
ACCESSION   M33673
KEYWORDS    virulence protein.
SOURCE      A.tumefaciens plasmid pTiC58 DNA, from Nicotiana tabacum.
  ORGANISM  Agrobacterium tumefaciens
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 3662)
  AUTHORS   Wang,K., Herrera-Estrella,A.H. and Van Montagu,M.
  TITLE     Overexpression of virD1/D2 genes in A.tumefaciens enhances
            T-complex formation and plant transformation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by A.Herrera-Estrella, 11-APR-1990.
            
               Author address [1]: A.Herrera-Estrella
                               Rijksuniversiteit Gent
                               Department of Genetics
                               Ledeganckstraat 35, Ghent, East Flanders
                               Belgium B-9000
               e-mail [1]: jecop@gengenp.rug.ac.be
FEATURES       from  to/span     description
    pept        648     1091     unknown protein 1
    pept       1125     2468     unknown protein 2
    pept       2483     3079     unknown protein 3
    ORF        3118  >  3662     unknown protein 4
BASE COUNT      870 a    975 c   1042 g    774 t      1 others
ORIGIN      
        1 gtcgacaatg cttcatcgat atcgagcgga gttaacatgg tcgggatcag aagcaggttt
       61 gagctggcaa tgatcgtgtt gttgagttcg ctcgaaccac catgcgtatc ggccagcgca
      121 taatcaaatc cttggagttc ggcgtcctca taggccgcct caaggagtgc catttcctcg
      181 gcggcgtaga cttcgcagaa ggagccctag gtattgctgc gaagggcgtt ttctttccat
      241 cgcgttagtg gtcggttttc atcagcgtcg aagagagcca atcgtttgcc gtcacttgca
      301 aaggcggcgc aaagacccat gagtgccgtg gttttgccgg ctcctccctt gaaggagcaa
      361 aatgtcagaa gtttcatgtc cttatcctgt cgattttgtg aagcggaagt gtgtctgtac
      421 ttttatttgt gtgtatgatt ttgcgataat tcataagtaa tgtagtaatt acctgatttt
      481 atatttcaat tttattgtaa tataatttca attgtaataa tataaaaata aatatccctt
      541 atgtgttctt gatttcgttt tgtatatggc tagattccca tctgccacga cgaggaaatg
      601 ctacggcggg gcaagttcag atctttccgt cttctatgga ggaagctatg tcgcaaggca
      661 gtaggcccac ctcaagtgac attgccgtca accagcgcga atgcgtgaag gttgaaggct
      721 tcaaggtcgt cagtacccga ttaagatcgg ccgaatatga gagtttttct catcaggcac
      781 gcttgctggg cctctccgac agcatggcca tacgggttgc ggtgcgccgc attggtggct
      841 ttcttgaaat cgacgcagag actcgtcata ggatggaggc catactacaa tccataggaa
      901 cactctcaag caacattgcc gcgctgctat ctgcctatgc cgaaaatccg acaatggatt
      961 tggaggcttt gcgagctgaa cgtatcgcct tcggtaaatc tttcgctgac ctcgacggct
     1021 tgctccgttc cattttgtcc gtatcacggc ggcggatcga cggttgctcg ctgctgaaag
     1081 acgccttgta gcactgacgt agcacttggc ggggaacata ttcgatgccc gatcgagctc
     1141 aagttatcat tcgcattgtg ccgggaggtg gcaccaagac ccttcaacaa attatcaatc
     1201 agttggagta tctatcccgg aagggcaggc tggagctgca gcgttcagcc cgacatctcg
     1261 atattcccct gccaccggat caaatccacg aacttgcccg aagctgggtt caagagactg
     1321 gaacttatga cgaaagtcag ccagacgagg aaaggcaaca ggagttgacc acccatatta
     1381 ttgtaagctt ccccgccggt acaagccagg tagcggctta tgcggcgagc cgggagtggg
     1441 cagccgagat gtttgggtca ggcgcagggg ggggccgata caactatctt acggccttcc
     1501 acatcgatcg cgaccaccca catctgcatg tcgtcgtcaa tcggcgcgaa cttttaggac
     1561 acggctggct gaagatatct cggcgccatc cccaactgaa ttacgacgcc ctgcgcataa
     1621 agatggccga gatttcactt cgtcatggca ttgccctcga tgcgagccga cgagcagaac
     1681 gtggcatcac cgagcggccg atcacttatg cccaatatcg gcgccttgag cgggagcagg
     1741 ctcgccaaat ccgtttcgaa gacgcggatt tggaacagtc gtcgccgcaa ggagatcatc
     1801 cagaattcag ccaacctttc gatacatccc catttgaagc atccgcgggc ggaccggagg
     1861 acatgcctcg gcccaacaat cggcagaatg agtcgcaagt tcatctccag gagccagctg
     1921 gtgtcagcaa cgaagccggt gtccttgtgc gggttgcatt ggagacggag cgccttgctc
     1981 aaccattcgt ttccgaaacc attctcgcgg acgacatagg gagcggctct tcgcgtgttg
     2041 ccgagggccg tgtggagagc gcaaaccgca ctcccgatat tcctcgcgca gcaactgaag
     2101 ctgccacgca cacgacacac gaccggcagc ggcgtgcaaa gcgtcctcat gatgacgacg
     2161 gagggccgag tggagcaaaa cgtgtgacat tggaaggcat cgcggttggg ccccagcgaa
     2221 cggccggcga acaggatggc agtagtggcc ccttagaacg gcaagctgga acgtctcggc
     2281 catctccacc gacggccacg acgcgggcca gcaccgcaac cgattcattg tctgctacag
     2341 cccacctcca gcaacggaga ggtgtccttt caaagcgtcc gcgtgaagat gatgatggag
     2401 aaccgagtga acgcaaacgc gagagagatg agcgcagcaa ggacgggcgt gggggaaata
     2461 ggagatagag aaggaccgag tgatggcaaa tgaagagttc accagacact atgcgtggcc
     2521 cgttcctgtg gcttcgaatg atgaagggcg tgggaccgcg cggatcccca tccaggcaca
     2581 atcaatcgtt gctggagaag acggtcggga cacttcggtc ccaacggctt tgtcgcgacc
     2641 gccaattgaa gatatgccgc acggcgtcca agaaacatcg gcgagtggcg gacgactggg
     2701 agcggcccgt ctgcgggatt ccgtaatccc gccaggaata tccgaagccc gcacggacct
     2761 atccgcaatt ttgcggaaaa aaagcggttc tttccgcacc ggtatgcagt atctgcgtgg
     2821 gcttgaacgg gaaaattttg ataaacaaga cagggaagcc agtgcgttgc cagatttaag
     2881 tgcaaggggc ataaagcgac cgcgcgaaat tgagtatccc ggcaatgcaa gcggattaac
     2941 cataaagagg caagacggct taggcataga gatcaatact atctcggcat cttcgcccgt
     3001 gaaccggggc cgcgcattcg tcgaactggc aaggcgcgcc ggaaccgggc gtgtacaatg
     3061 ttcagccatc ggcagataga gcacagaact ctgcgcagga aagttcgacg tttcccgatg
     3121 gcacttctgt ttccgctctc tattcaggcc cactcgcaga atggttcgag agggatactg
     3181 gcagcgagac gaccaggaat tcaggcaaca ccatttcgtc accgctccga gggctggagg
     3241 agttcggcga ttccgcggac agccggtatc ttgggcgcga agctcagagt ctttcagtta
     3301 ccgtaacaac gcctaattcg aatgctgagg caagttctca tagcgcacac actgaaactc
     3361 tcgacgatgt cagcagtgac cgctcgagcg aacaggggag gggtcccctt ggcgccgcaa
     3421 tccttggatc ccatcatgac cttttcnccg cgtgcgcaga agttatcaca aacaaaccgc
     3481 gattcccctg aactgaccga cgccgaccta gcaaaagtcg atgcggtgtt cgaatctctc
     3541 tccaaggggc ccccggcggg ggagagcgcc gcaccggact ttcgtgagcg cggaccaggc
     3601 agcgctttcc agaaagaagg cgtgtccgac agggcgaacg gggtgcccac taactgggag
     3661 gt
//
LOCUS       CHKFIBAB     2730 bp ss-mRNA            VRT       17-JUL-1990
DEFINITION  Chicken alpha-fibrinogen mRNA, 3' end, and ORF, complete cds.
ACCESSION   M34096
KEYWORDS    alpha-fibrinogen; fibrinogen.
SOURCE      Chicken (strain White Leghorn) liver parenchymal cell line, cDNA to
            mRNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 2730)
  AUTHORS   Weissbach,L. and Grieninger,G.
  TITLE     Bipartite mRNA for chicken alpha-fibrinogen potentially encodes an
            amino acid sequence homologous to beta- and gamma-fibrinogens
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5198-5202 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Grieninger, 08-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1     1518     alpha-fibrinogen (AA at 1)
    ORF        1830     2582     ORF
    mRNA    <     1     2730     alpha-fibrinogen mRNA
BASE COUNT      825 a    574 c    678 g    653 t
ORIGIN      
        1 acgatactct gtgtgttgct ctgcctcaat ttagcttggg ctcaggatgg gaagaccacc
       61 tttgaaaagg agggcggagg cgggcgtggc cccaggatcc tagagaacat gcacgagtcg
      121 tcctgcaagt atgagaagaa ctggcctatc tgtgtggatg atgactgggg tacaaaatgt
      181 ccatcatgct gcagaatgca aggaataatt gacgacacag accagaatta cagtcaaaga
      241 atagacaata ttaggcagca actcgcagac agtcaaaaca agtataaaac atccaatcgg
      301 gtaattgtgg aaaccataaa tatactaaag cctggtctgg agggtgccca gcagctcgat
      361 gagaattacg gtcatgtgtc aacagaactg aggaggagaa ttgtgacatt aaagcagcga
      421 gttgccactc aagtgaacag aattaaagct ctgcagaaca gcatccagga gcaggtggtg
      481 gaaatgaagc gcttggaggt ggacattgat attaagatac gagcttgcaa agggagctgt
      541 gctagaagtt ttgattacca ggtggacaaa gaaggctatg acaacatcca gaagcatctt
      601 acccaggcca gctccattga tatgcaccca gactttcaaa caaccacctt gagcacactg
      661 aaaatgaggc ctcttaagga ctcgaatgtt cctgaacatt ttaagctcaa gccttcacca
      721 gaaatgcaag ctatgagtgc atttaataac atcaaacaga tgcaagtggt attagaaaga
      781 ccagaaacag accacgtggc agaagcaagg ggggattctt caccttccca cactggcaaa
      841 ttaattactt cttctcacag gagagagagc cctagcctgg tagacaaaac ctcctctgcc
      901 tcctctgtcc atagatgcac cagaactgtc accaagaaag tcattagtgg ccctgatggt
      961 cctagagaag aaatagttga aaaaatggtt tcctctgatg gctcagactg ctcccatcta
     1021 caaggaggac gagaaggaag cacgtaccat tttagtggga caggtgactt tcacaagcta
     1081 gacaggcttc tgccagatct agagtcattc ttcacccatg actctgtatc cactagtagt
     1141 aggcactcaa ttggctcaag caccagcagc catgtgactg gggcaggcag cagccaccta
     1201 ggcactggag ggaaagacaa attcacagat ttaggggagg aggaagaaga tgactttgga
     1261 ggacttcagc catctggatt tgctgctggc agtgcgagtc actccaagac tgtactgacc
     1321 agctcctctt ctagtttcaa caagggaggc tccacttttg aaaccaagtc actaaagacc
     1381 cgtgaaacat ctgagcagct aggtggggtg caacatgatc agagtgcaga ggacacccca
     1441 gactttaagg cgcgcagctt cagaccagca gcaatgagta caaggagatc ctacaatggg
     1501 aaaggtacac agaagtagta attgaggtag tggagccaaa ttcaattcat aaccaaactg
     1561 acagaatgtt tttagatact gcagtataat agtgtgatga taaaatgcca gaactgtgtc
     1621 tgtttgttgc cactttggag ataaaagaga actatatcac taatagtcag gtattgcata
     1681 gacaattagc aatataagat ctgtgtaagg ttttcctctc tcaattcttt ttaatgttgc
     1741 ctgtcactaa accttatgtt gtccaagaca catttaacaa attttcctca gatctattgc
     1801 tgcttggatt cctctgaagt ttttattaag ctttactgat aacttctgtc aaaccagatc
     1861 aacttttttt tttagactgc gatgatatcc gccagaaaca cacttctggt gccaaaagtg
     1921 gcattttcaa aatcaagcca gagggatcca ataaggtttt gtcagtttat tgcgaccaag
     1981 agaccacttt gggaggatgg ctattgatcc aacagagaat ggatggatca gtgaatttta
     2041 accgtacgtg gcaagactac aggagaggtt tcggcagcgt ggatggcaaa gggcaaggag
     2101 agttgtggct gggcaatgag aatatccact tgctgactca gaacgatact ctgcttcgag
     2161 tagagttaga ggactgggat ggaaatgctg catatgcaga gtacatcgtg caggtaggga
     2221 ctgaagcaga aggctatgcc ctgactgtgt cctcctatga ggggactgct ggggatgctc
     2281 tggttgctgg ctggctggag gagggctctg agtacacatc ccatgcccag atgcagttca
     2341 gcacttttga ccgggaccag gaccactggg aggagagctg tgcagaggtg tacgggggtg
     2401 gctggtggta caacagctgc caggcagcca acctcaatgg catttactat ccagggggcc
     2461 attatgaccc caggtacaat gttccatatg agatcgagaa tggagtagtc tggataccat
     2521 ttagagcctc cgattattcc ctcaaagttg ttagaatgaa aatcagaccc ctggaaaccc
     2581 tgtagaagga cgggcattta atatatgtgt cacaactaaa agttggaaac atcttttata
     2641 tacctgtgtg cacgatgtac ctttacccta tgaatttgaa ggcagcatgg cacatctatg
     2701 gcatacaaaa taaacggatt aattattaac
//
LOCUS       ECOHYA       6023 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  E.coli hya operon encoding hydrogenase isozyme 1 large and small
            subunit (hyaB and A) genes and hyaC-F protein genes, complete cds.
ACCESSION   M34825
KEYWORDS    hyaA gene; hyaB gen; hyaC gene; hyaD gene; hyaE gene; hyaF gene;
            hydrogenase.
SOURCE      E.coli (strain LE392) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 6023)
  AUTHORS   Menon,N.K., Robbins,J., Peck,H.D.Jr., Chatelus,C.Y., Choi,E.-S. and
            Przybyla,A.E.
  TITLE     Cloning and sequencing of a putative Escherichia coli [NiFe]
            hydrogenase-1 operon containing six open reading frames
  JOURNAL   J. Bacteriol. 172, 1969-1977 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        188     1306     hydrogenase-1 small subunit (hyaA) precursor
    sigp        188      322     hydrogenase-1 small subunit (hyaA) signal
                                 peptide
    matp        323     1303     hydrogenase-1 small subunit (hyaA)
    pept       1303     3096     hydrogenase-1 large subunit (hyaB)
    pept       3115     3822     hyaC protein
    pept       3819     4406     hyaD protein
    pept       4403     4801     hyaE protein
    pept       4798     5655     hyaF protein
    signal     5657     5683     pot. transcription termination signal
BASE COUNT     1334 a   1607 c   1720 g   1362 t
ORIGIN      
        1 ttttgtgcaa aagtttcact acgctttatt aacaatactt tctggcgacg tgcgccagtg
       61 cagaaggatg agctttcgtt ttcagcatct cacgtgaagc gatggtttgc cttgctacag
      121 ggacgtcgct tgccgaccat aagcgcccgg tgtcctgccg gtgtcgcaag gaggagagac
      181 gtgcgatatg aataacgagg aaacatttta ccaggccatg cggcgtcagg gcgttacccg
      241 gcgcagcttt ctcaaatatt gtagtctggc tgccacgtcg ctgggattag gcgcgggaat
      301 ggcaccaaag attgcctggg cgctggagaa caaaccgcgc attccggtgg tatggatcca
      361 cggtctggaa tgcacctgct gtaccgaatc ttttatccgc tccgctcacc cactggcgaa
      421 ggacgtcatc ctttccctga tttccctcga ttacgacgat actttgatgg ctgccgccgg
      481 aacccaggcg gaagaagtct ttgaagacat catcacgcaa tacaatggca aatatatcct
      541 cgcagtagaa ggtaatccgc cgctgggcga gcaggggatg ttctgtatca gcagcggtcg
      601 accgtttatt gagaaactca aacgtgccgc tgccggagcc agcgcgatta tcgcctgggg
      661 aacctgcgcg tcctggggct gcgtgcaggc cgcgcgaccc aatccgacgc aggcaacgcc
      721 tatcgacaaa gtcatcaccg acaaacccat tatcaaagta cctggctgcc cgccgatccc
      781 ggatgtgatg agcgccatca ttacttacat ggtgaccttt gatcgcttgc cagatgtcga
      841 cagaatgggc cgtccgctga tgttctatgg tcagcgaatc cacgataaat gctatcgccg
      901 cgcccacttc gacgccggag agttcgtcca gagttgggat gatgacgctg cccgcaaagg
      961 ttactgcctg tacaaaatgg gctgcaaagg gcctaccacc tataacgcct gttcctccac
     1021 acgctggaat gatggcgttt ctttcccaat ccagtctggt cacggctgcc tgggctgtgc
     1081 ggaaaatggt ttctgggatc gcggttcgtt ctacagccgc gtggtcgata ttccgcaaat
     1141 gggtactcat tccaccgccg ataccgtcgg tttaaccgcg cttggcgtgg tggcagcggc
     1201 tgttggtgtg cacgcagtcg ccagcgccgt tgaccagcgc agacgtcata accagcaacc
     1261 tacagaaacc gaacatcagc caggcaatga ggataaacag gcatgagcac tcagtacgaa
     1321 actcagggat acaccatcaa taatgccgga cgccgcctgg tggtcgaccc gattacgcgc
     1381 atcgaaggcc acatgcgctg cgaagtgaat attaacgatc agaatgtgat caccaatgcc
     1441 gtctcctgcg gcaccatctt tcgcgggctg gagatcatcc tacaagggcg cgacccgcgc
     1501 gatgcgcggg cgttcgttga acgtatctgc ggcgtctgta ctggcgtaca cgccctggct
     1561 tcggtttacg ccatcgaaga tgctatcggt attaaagtgc cggacaacgc caatatcatc
     1621 cgcaacatta tgctggcaac gctctggtgc cacgatcatc tggtgcactt ctatcagctt
     1681 gccgggatgg actggatcga tgtgttagat gcgctgaaag ccgacccgcg gaaaacctcc
     1741 gaactggcgc aaagtctctc ctcttggccg aaatcatccc ctggctattt cttcgacgta
     1801 caaaaccgcc tgaaaaaatt tgttgaaggc gggcagttgg ggatcttccg caatggctac
     1861 tgggggcacc cgcagtacaa actgccgcca gaagctaacc tgatgggctt tgcccactat
     1921 ctcgaagctc tcgatttcca gcgtgaaatt gtcaaaatcc acgcggtctt tggcggtaaa
     1981 aacccgcatc caaactggat tgtcggcggg atgccttgcg ccatcaacat tgacgaaagc
     2041 ggcgcggtcg gggcagtcaa tatggaacgc ctgaacctgg tgcagtcaat tatcacccgc
     2101 acggcggact tcattaacaa cgtgatgatc cccgacgcct tagccatcgg tcagttcaac
     2161 aaaccgtgga gcgaaatcgg cactggtctt tctgataaat gcgttctcag ctacggcgca
     2221 ttcccggata ttgccaacga ctttggcgag aaaagtctgc tgatgcctgg cggcgcggtg
     2281 attaacggcg acttcaacaa tgtgctgcca gtggatttgg ttgatccgca gcaggtgcag
     2341 gagtttgtcg accacgcctg gtatcgatat cccaacgatc aggtcgggcg tcatccgttc
     2401 gatggcatca ccgacccgtg gtacaacccc ggcgatgtca aaggcagcga taccaacatt
     2461 cagcagctga atgaacagga acgctactcg tggatcaaag cgccacgctg gcgcggtaac
     2521 gcgatggaag tggggccgct ggcgcgcacg ttaatcgctt atcacaaagg cgatgctgcg
     2581 accgttgagt cggtcgatcg catgatgtcg gcgttgaacc tgccgctttc cggtatccag
     2641 tcaacgttag gccgcatttt gtgccgcgcg cacgaagcgc agtgggccgc aggtaagttg
     2701 cagtatttct tcgacaagct gatgaccaac ctgaaaaacg gcaatctcgc cactgcttcc
     2761 acggaaaaat gggaacctgc aacctggccg acagagtgcc gtggtgtcgg ttttaccgaa
     2821 gcgccgcgcg gggcgttagg ccactgggcc gccattcgcg atggcaagat tgatctctac
     2881 cagtgcgtgg tgccgaccac ctggaacgcc agcccgcgcg atcccaaagg gcagattggc
     2941 gcttatgaag cggcgctgat gaacaccaaa atggcgatcc ccgagcaacc gctggagatc
     3001 ctgcgtactc tgcacagctt tgacccgtgc ctcgcctgtt caacacacgt gctgggcgac
     3061 gacggtagcg agctgatctc cgtgcaggtg cgttaacagc gaaggagaat catcatgcaa
     3121 cagaaaagcg acaacgttgt cagccactat gtctttgaag cgccagtgcg catctggcac
     3181 tggttgacgg tgttatgcat ggcggtgttg atggtcaccg gatactttat cggcaagccg
     3241 ctaccttccg tcagcggcga ggcgacgtat ctgttctata tgggctacat caggttaatt
     3301 cacttcagcg ccgggatggt ttttaccgtg gttttgctga tgcggatcta ctgggctttt
     3361 gttggcaatc gatactcccg cgagctgttt atcgtgccgg tatggcgtaa aagctggtgg
     3421 cagggcgtgt ggtatgaaat ccgctggtat ctgtttctgg caaaacgtcc gagtgccgat
     3481 ataggccata atcccatcgc ccaggcggcg atgttcggct atttcctgat gtcggtcttt
     3541 atgatcatca ctggttttgc gctgtacagc gaacacagcc agtacgctat ttttgcgccg
     3601 ttccgttatg tggtggaatt tttctactgg acgggtggca actcaatgga cattcacagc
     3661 tggcatcggc tggggatgtg gctgattggc gcgtttgtga tcggtcatgt ctacatggcg
     3721 ctgcgtgaag acatcatgtc cgacgacacg gtgatctcca ccatggtcaa cggctaccgt
     3781 agccacaaat ttggcaaaat aagtaacaag gagcgttcat gagcgagcaa cgcgtggtgg
     3841 tcatggggct gggcaacctg ctgtgggccg atgaaggctt cggcgtgcgg gtggcggaac
     3901 ggctgtatgc ccattaccac tggcccgagt atgtggagat tgtcgatggc ggtactcagg
     3961 gactgaactt gctggggtat gtcgaaagcg ccagccatct gttgattctc gatgccattg
     4021 actacgggct ggaacctgga acgctgcgaa cctatgccgg agaacgcatt ccggcttatc
     4081 tcagcgcgaa gaaaatgagc ctgcatcaga acagtttctc cgaagtgttg gcgctggcgg
     4141 atatccgcgg acatctgcca gcacatattg ccctcgtcgg tctgcaaccc gcaatgctcg
     4201 acgactacgg cggtagcctg agcgaactgg cacgggagca actgcccgct gcggaacagg
     4261 cggcgctggc gcagcttgct gcgtggggaa ttgtgccgca accggctaat gaatcgcgct
     4321 gtctcaatta tgactgtctg tcgatggaaa attacgaagg cgttcgcttg cgccagtacc
     4381 ggatgacaca ggaggagcag ggatgagcaa cgacacgcca tttgatgcgt tgtggcaacg
     4441 aatgctggcg cgcggctgga cgccagtcag tgaatcccgt cttgacgact ggcttacgca
     4501 agcgccagac ggcgtggtgt tattaagcag tgacccgaaa cgcacgccag aggtcagcga
     4561 taatccggta atgattggcg aattactgcg cgagtttccc gactatacat ggcaggtggc
     4621 gattgctgac cttgagcaga gcgaagccat cggcgatcgc tttggcgtct ttcgctttcc
     4681 tgccacttta gtgtttaccg gcggaaacta tcgcggcgtg ctgaatggta ttcacccgtg
     4741 ggcggaactg ataaacctga tgcgcgggct tgtcgaaccg cagcaggagc gtgcctcatg
     4801 agcgaaactt ttttccatct gctggggcca ggaacgcaac cgaacgatga cagtttcagc
     4861 atgaatccac tgccgatcac ctgtcaggtg aatgatgaac cgagtatggc ggccctggag
     4921 caatgtgctc acagcccgca ggtgattgcg ctgttaaacg agttacaaca tcaactaagc
     4981 gaacgccaac cgccgttggg cgaggtgctg gcagtcgatc tgttaaatct caacgccgac
     5041 gatcgtcact ttatcaatac gcttctcggg gaaggggaag tgtcagtgcg cattcagcag
     5101 gctgacgaca gtgaaagtga aatacaggag gcgatcttct gcggattatg gcgggtgcgc
     5161 agacgtcgcg gcgaaaagtt gctggaggac aaactggagg ctggctgcgc gccgctggcg
     5221 ttgtggcagg cggcaacgca aaatctcttg ccgacagatt cgctgttacc gccgcccatt
     5281 gatggcctga tgaatggcct accgttggcg catgagttac tggcacatgt acgtaacccc
     5341 gacgcgcagc cgcacagcat taatctgacg caattaccca tcagcgaggc tgatcggctt
     5401 tttctctcac gtctctgtgg gccgggaaat attcagattc gtaccattgg ctatggcgag
     5461 agctatatca acgccacggg gttacgccat gtctggcatt tacgctgtac ggacacctta
     5521 aaaggcccgt tactggaaag ttatgaaatc tgcccaatac cggaagtggt gctggcagcg
     5581 ccagaagatt tggtcgactc tgcgcagcgg cttagcgagg tatgtcagtg gctggcggaa
     5641 gctgcaccga cgtaaaaaga cggtaagtat cgctttcagt cttatgaata tcgcaatcgg
     5701 cgaatacctc tggtcgtaga gtttcaggat aaagagggag atctaccatt atcgggttat
     5761 ttttctctct tcgcctacag gagtgcgcat gtgggatgtc attgatttat cgcgctggca
     5821 gtttgctctg accgcgctgt atcacttttt atttgtaccc cttaccctgg gctgattttt
     5881 ttgctggcta ttatggaaac catttacgtg gtcaccggca aaaacaatct accggcaaaa
     5941 caatctaccg cgatatgacg cgcttctggg gtaagctctt cggtatcaat tttgctcttg
     6001 gcgtggctac ccggcctaac cat
//
LOCUS       HUMFDXC      1174 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human ferredoxin mRNA, complete cds.
ACCESSION   M34788
KEYWORDS    ferredoxin.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1174)
  AUTHORS   Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
  TITLE     Structure, sequence, chromosomal location, and evolution of the
            human ferredoxin gene family
  JOURNAL   DNA Cell Biol. 9, 205-212 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         19      573     ferredoxin
BASE COUNT      337 a    211 c    267 g    359 t
ORIGIN      Chromosome 11q13-qter.
        1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct
       61 gctgtcctcg gcggcccggc cggccggtgg ctgcaccacg ctgggtcccg cgctggatcc
      121 agcggcctgc tgaggaaccg ggggccgggc ggtagcgcgg aggcgagccg gtcgctgagc
      181 gtgtcggcgc gggcccggag cagctcagaa gataaaataa cagtccactt tataaaccgt
      241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga tgttgtggtt
      301 gaaaataatc tagatattga tggctttggt gcatgtgagg gaaccctggc ttgttcaacc
      361 tgtcacctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag
      421 aatgacatgc tcgatctggc atatggacta acagacagat cacggttggg ctgccaaatc
      481 tgtttgacaa aatctatgga caatatgact gttcgagtgc ctgaaacagt ggctgatgcc
      541 agacaatcca ttgatgtggg caagacctcc tgaactagaa caaataggaa tattttcatg
      601 gaattttacc tatttttata attattattt cttaaagtga ttaaatgaga acatggatga
      661 gtggacttca tattatgact agctttacta ttttaattca ccttgcataa ctactgaatt
      721 ttgtcattct tgaaagtatg caatttttat tttggttata ttacaaaaat gtcaatcaaa
      781 tattaaaaaa tagttaatgt gatagaaaaa ccttacatat ttttttctta tgtttgttta
      841 gcgactttag caaaatgttt tcatataatc tcatctgttt acctagaaga taggttaagg
      901 aaatatatta ttattcctgt ttgatgtggg tgaaggcaga gatctaacct ggcttgttta
      961 gggccatacc actaattaga aaatctgtgc tagaacctgt gtcttattcc tataagctat
     1021 gtgttcagac tgaaactgga gaaattatga ctattttatt tatagtagta gttaaatctg
     1081 aatgtgtatg gacaaaaata tttaattgct gagtaaactg cttaacttca aagatagtta
     1141 ttgaccttat aaataaatat ttcaaaattt tgat
//
LOCUS       HUMFDXPSA     676 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human ferredoxin pseudogene, complete cds.
ACCESSION   M34787
KEYWORDS    ferredoxin; pseudogene.
SOURCE      Human DNA, clone h2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 676)
  AUTHORS   Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
  TITLE     Structure, sequence, chromosomal location, and evolution of the
            human ferredoxin gene family
  JOURNAL   DNA Cell Biol. 9, 205-212 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps      19      569     pseudo-ferredoxin
BASE COUNT      171 a    149 c    192 g    164 t
ORIGIN      Chromosome 20.
        1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct
       61 gctgtcctcg gcgacccggc cggccggtgg ctgcaccacg ccgggtcccg cgctggagcc
      121 agcggcctgc tgaggagccg gggaccgggc cggagcgcgg aggcaagccg gccgctgagc
      181 gtgtcggcgg gggcgcggag cagctcagaa gataaagcga cagtccactt tataaactgt
      241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga cgttgtggtt
      301 gaaaataatc cagatattga tggctttggt gcatgtgagg gaactctaac ttgtttaacc
      361 tgtcatctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag
      421 aatcacatgc tcgatctggc atatggacta acagatcaca gttgggctgc caaatctgtt
      481 tgacaaaatc tatggacaat atgactgttc gagtgcctga aacagtggct gatgccagac
      541 aatccattga tgtgggcaag acctcctgaa ctagaacaaa taggaatatt ttcatggaat
      601 tttacctatt tttataatta ttatttaagt gataaaatga gtacatggat gagtggattc
      661 tgtattatga ctagct
//
LOCUS       HUMFDXPSB     976 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human ferredoxin pseudogene, complete cds.
ACCESSION   M34789
KEYWORDS    ferredoxin; pseudogene.
SOURCE      Human DNA, clone h3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 976)
  AUTHORS   Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
  TITLE     Structure, sequence, chromosomal location, and evolution of the
            human ferredoxin gene family
  JOURNAL   DNA Cell Biol. 9, 205-212 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps <     1      386     pseudo-ferredoxin
BASE COUNT      340 a    156 c    168 g    312 t
ORIGIN      Chromosome 21.
        1 ttcagtgaag ggcccagagc agctcggaag ataaaataac agtccacttt ataaaccaac
       61 agtgaaacat taacaaacaa aggaaaagtt gatgactctt tgctagatgt tgtggttgaa
      121 aataatgtag atattgatgg gtttgttgca tggcagggaa ccttggctct gcttgtcacc
      181 tcatttttga aaagcacata tttgagaagt tagacacaat cactgacaag gagattgaca
      241 tgcttgatct ggcatatgga ctgacagaca ggtcagagtt gtattgccaa atctgtttga
      301 caaaatctat gggccatatg actgtctgag tacctgatgg agtggccagt gccagacaat
      361 ccactgatat gggcaagacc acctaagcag aataaatagc aatgctttca ctaaatttta
      421 cctattttat aattatttct tatataatta aataaaaaca tacatgaatg aacttattat
      481 tatgaccaac tttactgttt taattcacct tgtataacta ctgaatttta tagttctgaa
      541 aatacgcaat ttttatattt gttacattag aaaaaggtca gttaaatatt agaaaataat
      601 taatatgaca ataccttaca aattttgcct tacgtttgtt tagcaacttt tagcaaaatg
      661 ttttcacatg gccttatgtc tgcttaccta gaaagtaaat tttttaaaaa agatattatt
      721 atccctgttt tatgtgagtg aaggcaaaga cctaaaatgg cttgttaagg gccatccaac
      781 taattagaaa atatgtacac acctgcatct tgttactata aggtatatgt tagactgaaa
      841 ctggagaaat tatgactatc ttacagcagc aattagatct gactgcatat ggacaaaaac
      901 attaattgct cagtgaactg cttaactgac agttattaac cttacacata aagatttcaa
      961 aattttgatt cagtga
//
LOCUS       HUMFRDXA1     350 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human ferredoxin gene, exon 3.
ACCESSION   M34783
KEYWORDS    ferredoxin.
SEGMENT     1 of 2
SOURCE      Human DNA, clone h4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 350)
  AUTHORS   Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
  TITLE     Structure, sequence, chromosomal location, and evolution of the
            human ferredoxin gene family
  JOURNAL   DNA Cell Biol. 9, 205-212 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    45  +   174     ferredoxin, exon 3
    IVS     <     1       44     ferredoxin intron B
    IVS         175  >   350     ferredoxin intron C
BASE COUNT       99 a     56 c     78 g    117 t
ORIGIN      Chromosome 11q13-qter.
        1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac
       61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc
      121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga
      181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc
      241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga
      301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag
//
LOCUS       HUMFRDXA2    1260 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human ferredoxin gene, exon 4.
ACCESSION   M34785
KEYWORDS    ferredoxin.
SEGMENT     2 of 2
SOURCE      Human DNA, clone h4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1260)
  AUTHORS   Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
  TITLE     Structure, sequence, chromosomal location, and evolution of the
            human ferredoxin gene family
  JOURNAL   DNA Cell Biol. 9, 205-212 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    33      147     ferredoxin, exon 4
    IVS     <     1       32     ferredoxin intron C
BASE COUNT      410 a    196 c    241 g    413 t
ORIGIN      About 5.3 kb after segment 1; chromosome 11q13-qter.
        1 actaaaccat accttccccc ttttccatac agatcacggt tgggctgcca aatctgtttg
       61 acaaaatcta tggacaatat gactgttcga gtgcctgaaa cagtggctga tgccagacaa
      121 tccattgatg tgggcaagac ctcctgaact agaacaaata ggaatatttt catggaattt
      181 tacctatttt tataattatt atttcttaaa gtgattaaat gagaacatgg atgagtggac
      241 ttcatattat gactagcttt actattttaa ttcaccttgc ataactactg aattttgtca
      301 ttcttgaaag tatgcaattt ttattttggt tatattacaa aaatgtcaat caaatattaa
      361 aaaatagtta atgtgataga aaaaccttac atattttttt cttatgtttg tttagcgact
      421 ttagcaaaat gttttcatat aatctcatct gtttacctag aagataggtt aaggaaatat
      481 attattattc ctgtttgatg tgggtgaagg cagagatcta acctggcttg tttagggcca
      541 taccactaat tagaaaatct gtactagaac ctgtgtctta ttcctacaag ctatgtgttc
      601 agactgaaac tggagaaatt atgactattt tatttatagt agtagttaaa tctgaatgtg
      661 tatggacaaa aatatttaat tgctcagtaa actgcttaac ttcaaagata gttattaacc
      721 ttataaataa atatttcaaa attttgattc ggaagactaa gtctggacgt agacattata
      781 atgctatcaa agaagtttga tctctgtttt gactaaacta gaggaaaaat gattggatgt
      841 gtttattctt ttctaagcag aatggtttaa ctttgtactc tttgaaaaat aatgctgatt
      901 tataaatctc tgcctataac agaatggaaa ccttatgaat gaattgtgtt tctctgtcct
      961 gagctggaga agggaatgag caggctgaca cgttgcacag ccccaggtgg cgccattctc
     1021 tcacgcaagg atggggctgc agggtgagca gcgtgggctg cagtgtgtca gtcccaggag
     1081 tgagggagtg gcaagcacca cagattacca cgtatgtgtg gaagacattc gtacccttat
     1141 ctttactata aataaattca taaaagttaa caaaggggta cacagtatgg tctttggaaa
     1201 tataataaaa catcaactaa cttggactaa ttgtgaggaa gagcagaaca aattagtaga
//
LOCUS       HUMFRDXB1     350 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human ferredoxin gene, exon 3.
ACCESSION   M34784
KEYWORDS    ferredoxin.
SEGMENT     1 of 2
SOURCE      Human DNA, clone h1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 350)
  AUTHORS   Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
  TITLE     Structure, sequence, chromosomal location, and evolution of the
            human ferredoxin gene family
  JOURNAL   DNA Cell Biol. 9, 205-212 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    45  +   174     ferredoxin, exon 3
    IVS     <     1       44     ferredoxin intron B
    IVS         175  >   350     ferredoxin intron C
BASE COUNT       99 a     56 c     78 g    117 t
ORIGIN      Chromosome 11q13-qter.
        1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac
       61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc
      121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga
      181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc
      241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga
      301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag
//
LOCUS       HUMFRDXB2    1269 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human ferredoxin gene, exon 4.
ACCESSION   M34786
KEYWORDS    ferredoxin.
SEGMENT     2 of 2
SOURCE      Human DNA, clone h1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1269)
  AUTHORS   Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C.
  TITLE     Structure, sequence, chromosomal location, and evolution of the
            human ferredoxin gene family
  JOURNAL   DNA Cell Biol. 9, 205-212 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    32      146     ferredoxin, exon 4
    IVS     <     1       31     ferredoxin intron C
BASE COUNT      407 a    197 c    246 g    419 t
ORIGIN      About 5.3 kb after segment 1; chromosome 11q13-qter.
        1 actaaaccat accttcccct tttccataca gatcacggtt gggctgccaa atctgtttga
       61 caaaatctat ggacaatatg actgttcgag tgcctgaaac agtggctgat gccagacaat
      121 ccattgatgt gggcaagacc tcctgaacta gaacaaatag gaatattttc atggaatttt
      181 acctattttt ataattatta tttcttaaag tgattaaatg agaacatgga tgagtggact
      241 tcatattatg actagcttta ctattttaat tcaccttgca taactactga attttgtcat
      301 tcttgaaagt atgcaatttt tattttggtt atattacaaa aatgtcaatc aaatattaaa
      361 aaatagttaa tgtgatagaa aaaccttaca tatttttttc ttatgtttgt ttagcgactt
      421 tagcaaaatg ttttcatata atctcatctg tttacctaga agataggtta aggaaatata
      481 ttattattcc tgtttgatgt gggtgaaggc agagatctaa cctggcttgt ttagggccat
      541 accactaatt agaaaatctg tgctagaacc tgtgtcttat tcctataagc tatgtgttca
      601 gactgaaact ggagaaatta tgactatttt atttatagta gtagttaaat ctgaatgtgt
      661 atggacaaaa atatttaatt gctgagtaaa ctgcttaact tcaaagatag ttattgacct
      721 tataaataaa tatttcaaaa ttttgattcg gaagactaag tctggacgta gacattataa
      781 tgctatcaaa gaagtttgat ctctgttttg actaaactag aggaaaaatg attggatgtg
      841 tttattcttt tctaagcaga atggtttaac tttgtactct ttgaaaaata atgctgattt
      901 ataaatctct gcctataaca gaatggaaac cttatgaatt gtttgtgttt ctctgtcctg
      961 agctcggtac ccctggagaa gggaatgagc aggctgacac gttgcacagc cccaggtggc
     1021 gccattctct cacgcaagga tggggctgca gggtgagcag cgtgggctgc agtgtgtcag
     1081 tcccaggagt gagggagtgg caagcaccac agattaccac gtatgtgtgg aagacattcg
     1141 tactcttatc tttactataa ataaattcat aaaagttaac aaaggggtac acagtatggt
     1201 ctttggaaat ataataaaac atcaactaac ttggactaat tgtgaggaag agcagaacaa
     1261 attagtaga
//
LOCUS       RATC1H4SY    3146 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat cytoplasmic C-1-tetrahydrofolate synthase mRNA, complete cds.
ACCESSION   J05519
KEYWORDS    C-1-tetrahydrofolate synthase.
SOURCE      Rat adult liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3146)
  AUTHORS   Thigpen,A.E., West,M.G. and Appling,D.R.
  TITLE     Rat C-1-tetrahydrofolate synthase: cDNA isolation, tissue-specific
            levels of the mRNA, and expression of the protein in yeast
  JOURNAL   J. Biol. Chem. 265, 7907-7913 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.R.Appling, 18-MAY-1990.
FEATURES       from  to/span     description
    pept        121     2928     C-1-tetrahydrofolate synthase
    mRNA    <     1     3146     C-1-tetrahydrofolate synthase mRNA
    signal     3138     3143     poly-A signal
BASE COUNT      845 a    759 c    845 g    697 t
ORIGIN      
        1 cgcactgagt ttccccggga gatccctgag cacgtgggtt gtgctgtccg gtaagcgcac
       61 ggagggactg gaacctgaat tttgctggtg ttcttcgtgg gtcgcagact agctaaggcc
      121 atggcaccag cgggaatcct gaatggaaaa gtggtctccg cgcaaatcag gaatctactg
      181 aagacccagt ttactcagat gcaggagcag gtacctggct tcacccctgg cctggcaatc
      241 ctgcaggttg gcgacagaga tgattccaat ctttatataa atgtgaagct gaaggctgct
      301 caagagatcg ggatcaaagc cactcacatt aaattaccaa gaacttccac agagtcggag
      361 gtgttaaaat acgtcatctc cctgaatgag gatgccactg tgcacggatt catagtacag
      421 ctgcctttag actccgagaa ttccattaac acagaggcag tcatcaatgc cattgcccct
      481 gagaaggatg tggacgggtt gacgagcatc aatgccggga agcttgccag aggtgatcta
      541 aaggactgct tcattccgtg cacacccaaa ggatgcttgg aactcatcaa agagacagga
      601 gtgcagattg caggaaggca cgccgtggtg gtcgggcgaa gtaaaatagt tggtgcaccc
      661 atgcacgacc tgcttctgtg gaacaatgcc acggtgacca cctgccactc caagactgcc
      721 gatctggaca aggaggtaaa taaaggagac attctggtgg ttgcaacagg acagcctgag
      781 atggtgaaag gggagtggat caagcctgga gctgtggtca tagactgtgg gatcaattat
      841 gttccagatg atacaaaacc aaatggaagg aaagttgtgg gtgacgtagc atatgacgag
      901 gccaaggaga aggcgagctt tatcacgcct gtccccggtg gcgtggggcc catgaccgtg
      961 gcaatgctga tgcagagcac agtagagagc gcacagcgct tcctgaagaa atttaagcca
     1021 gggaagtgga caattcagta taacaagctg aacctcaaga cgcctgtacc aagtgacatt
     1081 gctatatcac gatcttgcaa acccaagctc atcggtaacc tggcccgaga aattgggcta
     1141 ctcactgagg aggtggaatt gtatggagaa acaaaggcca aggtcttact gtcagcacta
     1201 gatcgcctga agcatcagcc agatgggaaa tacgttgtgg tgactggaat tactccaaca
     1261 cccctgggag aagggaagag cacaaccacc atcgggcttg tgcaagccct tggcgcccac
     1321 ctgcatcaga atgtttttgc gtgtgtgcga cagccttctc agggccccac ctttgggata
     1381 aaaggtggcg ctgcaggagg cggctattcc caggtcattc ctatggaaga gtttaatctc
     1441 cacctcactg gtgacatcca cgccatcact gccgctaata accttgtggc tgctgctatt
     1501 gacgctcgga tatttcatga gctgacccag acagacaagg ctctctttaa tcgtctggta
     1561 ccctcagtaa atggagtaag aaagttctct gacatccaaa tccgcaggtt acggaggcta
     1621 ggcattgaga agactgaccc cgctgcgctg acagatgatg agataaacag atttgcaaga
     1681 ctagatattg atccagaaac cataacatgg cagagagtgc tggatactaa cgatagattc
     1741 ctgaggaaga tcaccattgg acaggctcca accgagaaag gccacacacg cacggcccag
     1801 tttgatatct ctgtggccag tgaaatcatg gcagtcctgg ccctcactag ttctctggaa
     1861 gacatgagag caaggctggg caaaatggtg gtagcgtcca gtaagaaagg ggagcctatc
     1921 agctgtgaag acctgggcgt gagcggggcg ctgacggtgc tcatgaagga tgcgatcaag
     1981 cccaatctca tgcagacctt agagggcacg ccggtatttg tccatgctgg gccttttgcc
     2041 aacattgcac atgggaattc ctccatcatt gcagaccgga ttgcactcaa gctggttggc
     2101 cctgagggct ttgtagtgac agaagcagga ttcggagcag acataggaat ggaaaagttc
     2161 ttcaacatca agtgccggta ttctggtctc cagcctcatg tggtggttct tgttgccact
     2221 gtcagggctc ttaagatgca cgggggtggc cccacggtca ccgctggact gcctcttccc
     2281 aaggcttaca cagaagagga cctggacctg gtggaaaagg gcttcagtaa cttgaggaaa
     2341 cagatagaaa atgctagaat gtttggagtg cctgtcgtgg tggccatgaa tgcattcaag
     2401 acagatacag atactgagct ggacctcatc ggccgcctct ccagagagca tggggctttc
     2461 gatgctgtca agtgcaccca ctgggcagaa gggggccagg gagccttagc cctggctcag
     2521 gctgtccaga gagcttcaca ggcccccagc agcttccagc tcctctatga cctcaagctc
     2581 tcagttgaag ataaaatcag gattattgca cagaagatct acggggccga tgacatcgaa
     2641 ttgctccctg aagcacaaaa caaagcagaa atctacacaa agcagggctt tgggaatcta
     2701 cccatctgca tggccaaaac acacttgtct ttatctcaca acccagagca aaaaggggtg
     2761 cctactggct tcgttctgcc catccgggac atccgggcca gcgttggggc tggtttcctg
     2821 taccctttag taggaacgat gagcacaatg cctggactcc ctactcgacc ctgtttttat
     2881 gatatcgatt tggaccctga aactgaacaa gtgaatggat tgttttaaac agatcttcca
     2941 tctccaagag gctactctgt ctggccagtg tctgttcagg cccactgagg aagtgtgcag
     3001 agaagtcttg aaagtctgtg cctgccctga agagcttcag aaatagtgga agtttctcta
     3061 aagcctttca tagccttaat tcagatcatg tataaattaa cataaatcat gcctatttac
     3121 atagtaaaag tccacagaat aaatga
//
LOCUS       NGOARGF      1260 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  N.gonorrhoeae ornithine transcarbamoylase (argF) gene, complete
            cds.
ACCESSION   M34930
KEYWORDS    ornithine transcarbamoylase.
SOURCE      N.gonorrhoeae (strain CDC 50) DNA.
  ORGANISM  Neisseria gonorrhoeae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1260)
  AUTHORS   Martin,P.R., Cooperider,J.W. and Mulks,M.H.
  TITLE     Sequence of the argF gene encoding ornithine transcarbamoylase from
            Neisseria gonorrhoeae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.H.Mulks, 06-JUN-1990.
            
                Author address [1]: M.H.Hulks
                               Michigan State University
                               Department of Microbiology and Public Health
                               367 Giltner Hall, East Lansing, MI 43324
                e-mail [1]:    21667CFM@MSU.bitnet
FEATURES       from  to/span     description
    pept        141     1136     ornithine transcarbamoylase (argF) (EC
                                 2.1.3.3)
    binding     128      133     ribosome binding site
    signal     1172     1204     transcription termination signal
    site         39       48     DNA uptake sequence
    site         54       63     DNA uptake sequence
    site        713      722     DNA uptake sequence
    site       1175     1184     DNA uptake sequence
    site       1192     1201     DNA uptake sequence
BASE COUNT      350 a    350 c    319 g    241 t
ORIGIN      
        1 tccccgtcat ccgtatggag taagggattg accgcaatgc cgtctgaaca accttcagac
       61 ggcattgcaa cattccgcta acccttcttt ccgcaaacgc tgcaaatacg gcgttcacgc
      121 cccacataaa ggaaacgaca gtgaacctga aaaaccgcca ttttctgaaa cttttggact
      181 tcacgccgga agaaatcacc gcctacctcg accttgccgc cgagttgaaa gacgccaaaa
      241 aggcagggcg cgagattcag cggatgaaag ggaaaaacat cgccctgatt tttgaaaaaa
      301 catccacgcg cacacgctgt gcgtttgaag tcgccgcacg cgaccaaggc gcggatcgaa
      361 cctatctgga accgtccgcc agccaaatcg ggcacaaaga aagcatcaaa gacacggcgc
      421 gcgtcttagg cagaatgtac gatgccatcg aatatcgcgg cttcgctcag gaaactgtcg
      481 aagaattggc aaaatatgcg ggcgtacccg tgttcaacgg gctgaccaac gagttccatc
      541 ccacacaaat gcttgccgac gcactgacta tgcgcgaaca cagcggcaaa cctttgaacc
      601 aaaccgcgtt tgcctacgtc ggcgacgcgc gttacaacat gggcaattcc ctgctgattt
      661 taggggcaaa attggggatg gacgtgcgta tcggcgcacc tcaaagcctg tggccgtctg
      721 aaggcattat cgccgccgca cacgccgccg ccaaagaaac cggtgcaaaa attaccctga
      781 ccgaaaacgc gcatgaagcc gtcaaaggtg tcggtttcat tcatactgac gtatgggtca
      841 gcatgggcga gccgaaagaa gtctggcagg aacgcatcga tttgctgaaa gattaccgcg
      901 ttacgcccga actgatggcg gcatcgggca atccgcaagt caaattcatg cactgcctgc
      961 ccgccttcca caaccgcgaa accaaagtcg gcgaatggat ttacgaaacc ttcgggctga
     1021 acggtgtgga agttacagaa gaagtattcg aaagtccggc cggcatcgtg ttcgatcagg
     1081 cggaaaaccg tatgcacacg attaaagcgg taatggtcgc ggctctgggc gactgacaga
     1141 actgtgcctg tttaaattca tccgcaacac agataccgtc tgaacacgat gttcagacgg
     1201 tatccatata acaaactgcc tacacgatgt gtaggcagtc ccgtttgaaa acaatcagtt
//
LOCUS       PFACSPA      1218 bp ds-DNA             INV       17-JUL-1990
DEFINITION  P.falciparum CS gene encoding circumsporozoite protein, complete
            cds.
ACCESSION   M22982 J04650
KEYWORDS    circumsporozoite protein.
SOURCE      P.falciparum (strain NF54) DNA, clone 1.3A.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 1218)
  AUTHORS   Caspers,P., Gentz,R., Matile,H., Pink,J.R. and Sinigaglia,F.
  TITLE     The circumsporozoite protein gene from NF54, a Plasmodium
            falciparum isolate used in malaria vaccine trials
  JOURNAL   Mol. Biochem. Parasitol. 35, 185-190 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            provided by P.Caspers, 14-MAR-1989.
FEATURES       from  to/span     description
    pept          1     1218     circumsporozoite protein
BASE COUNT      531 a    259 c    168 g    260 t
ORIGIN      
        1 atgatgagaa aattagctat tttatctgtt tcttcctttt tatttgttga ggccttattc
       61 caggaatacc agtgctatgg aagttcgtca aacacaaggg ttctaaatga attaaattat
      121 gataatgcag gcactaattt atataatgaa ttagaaatga attattatgg gaaacaggaa
      181 aattggtata gtcttaaaaa aaatagtaga tcacttggag aaaatgatga tggaaataac
      241 gaagacaacg agaaattaag gaaaccaaaa cataaaaaat taaagcaacc agcggatggt
      301 aatcctgatc caaatgcaaa cccaaatgta gatcccaatg ccaacccaaa tgtagatcca
      361 aatgcaaacc caaatgtaga tccaaatgca aacccaaatg caaacccaaa tgcaaaccca
      421 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca
      481 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca
      541 aatgcaaacc caaatgcaaa cccaaatgca aaccccaatg caaatcctaa tgcaaaccca
      601 aatgcaaacc caaacgtaga tcctaatgca aatccaaatg caaacccaaa cgcaaacccc
      661 aatgcaaatc ctaatgcaaa ccccaatgca aatcctaatg caaatcctaa tgccaatcca
      721 aatgcaaatc caaatgcaaa cccaaacgca aaccccaatg caaatcctaa tgccaatcca
      781 aatgcaaatc caaatgcaaa cccaaatgca aacccaaatg caaaccccaa tgcaaatcct
      841 aataaaaaca atcaaggtaa tggacaaggt cacaatatgc caaatgaccc aaaccgaaat
      901 gtagatgaaa atgctaatgc caacagtgct gtaaaaaata ataataacga agaaccaagt
      961 gataagcaca taaaagaata tttaaacaaa atacaaaatt ctctttcaac tgaatggtcc
     1021 ccatgtagtg taacttgtgg aaatggtatt caagttagaa taaagcctgg ctctgctaat
     1081 aaacctaaag acgaattaga ttatgcaaat gatattgaaa aaaaaatttg taaaatggaa
     1141 aaatgttcca gtgtgtttaa tgtcgtaaat agttcaatag gattaataat ggtattatcc
     1201 ttcttgttcc ttaattag
//
LOCUS       NPA25K        900 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Autographa californica nuclear polyhedrosis virus 25K protein gene,
            complete cds.
ACCESSION   M22537 M23428
KEYWORDS    25K protein.
SOURCE      Autographa californica nuclear polyhedrosis virus DNA, clone
            HindIII-I.
  ORGANISM  Autographa californica nuclear polyhedrosis virus
            Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; 
            Nuclear polyhedrosis viruses.
REFERENCE   1  (bases 1 to 900)
  AUTHORS   Beames,B. and Summers,M.D.
  TITLE     Location and nucleotide sequence of the 25K protein missing from
            baculovirus few polyhedra (FP) mutants
  JOURNAL   Virology 168, 344-353 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.D.Beames, 10-FEB-1989.
            
            The 25K gene product is not needed for replication of the virus in
            tissue culture.  It is believed to be important in the viral
            occlusion process but this has not yet been experimentally shown.
            Disruption of the gene leads to few polyhedra (FP) phenotype.
FEATURES       from  to/span     description
    pept        115      753     25K protein
    recomb       63       64     IFP1.6 insertion site
    recomb      586      587     IFP2.2 insertion site
BASE COUNT      334 a    151 c    165 g    250 t
ORIGIN      Map position 36.6 to 37.5.
        1 aagaaaacat gattacgtct gtacacgaac gcgtattaac gcagagtgca aagtataaga
       61 gggttaaaaa atatatttta cgcaccatat acgcatcggg ttgatatagt taatatggat
      121 caatttgaac agttgattaa cgtgtctctg ctcaagtcgt tgatcaaaac gcaaatcgac
      181 gaaaatgtgt cggacaatat caagtcgatg agcgaaaaac taaaaaggct agaatacgac
      241 aatctcacag acagcgttga gatatacggt attcacgaca gcaggctgaa taataaaaaa
      301 attagaaact attatttaaa aaagatttgc gctttactcg acctaaactt taaacacgtc
      361 atagaatctt cgtttgacaa aaaccacatt gtggccaagc tgtgtgacgc gacgcgcgct
      421 aaagaatggc aaaccaagtc gcgcgagcgt cgacttaaaa attttaattt aaacattaat
      481 tacgacgggc ccgtaaaaat atttgtggcc gccacagcgg agcaaaagct gttacttaaa
      541 aaaactcgcg acgctctgtt gccgttttac aaatacattt ccatttgtaa aaacggtgtt
      601 atggtgagac gcgacgaaaa gagtcgcgtg tttattgtta aaaatgagca aaatatagaa
      661 tatctcaaag ccaacaaata ttacgctttt catagcgata gcgtcgataa ttttgagtct
      721 gaaaacgata gcgaaaaaat gcttcaaatt taattaaatt taattaataa ttaattttag
      781 tatcaagttg aatctcgtaa tgtaagaaaa aaaacactaa tattggttgt gattcttgcc
      841 agacgtccat tttatgtgat aaataaaaca caaacacaaa gtttttatat tgtcttttat
//
LOCUS       SHPHSKB3C     413 bp ds-DNA             MAM       17-JUL-1990
DEFINITION  Sheep BIIIB4 high sulfur keratin gene, 3' end.
ACCESSION   M21101 J03062
KEYWORDS    BIIIB high-sulphur wool keratin.
SOURCE      Sheep (Dorset horn strain) DNA, clone pSWK31.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 413)
  AUTHORS   Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E.
  TITLE     The keratin BIIIB gene family: Isolation of cDNA clones and
            structure of a gene and a related pseudogene
  JOURNAL   Genomics 4, 182-191 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.J.Frenkel, 24-OCT-1989.
FEATURES       from  to/span     description
    pept    <     1       51     BIIIB4 high-sulfur keratin (AA at 1)
BASE COUNT      112 a     99 c     84 g    118 t
ORIGIN      15 bp upstream of PvuII site.
        1 actacctaca ctcagcccag ctgtgagccc tgcatcccaa gctgctgctg accgacggct
       61 acctcaccca gtgcctgaca gagtcaaccc agaagcttta gtgctcacct gtctcagtac
      121 ctgcaactaa ttatgtctcc gctttcaaag ttggaacaag gcattatcac agacaaccct
      181 cacaaaaaac aaaccaagag actttcgatg gccgtgtagt ggacatcagt gaacaaggac
      241 agctggagta ggtagatgcc tacaggtttc ccagagttgt tcagttcgtt agtgttaaat
      301 tgtatctttc tttggtgctt tgggaaattc gtttccagtc ttgaatggta tctttctgga
      361 aattaaggaa cttcttcatg attattcaat aaagtttaca tctctggtat aat
//
LOCUS       SHPHSKB3D    1080 bp ss-mRNA            MAM       17-JUL-1990
DEFINITION  Sheep BIIIB3 high-sulfur keratin pseudogene mRNA.
ACCESSION   M21102 J03062
KEYWORDS    BIIIB high-sulphur wool keratin.
SOURCE      Sheep (Dorset horn strain), cDNA to mRNA, clone psi-BIIIB3A.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1080)
  AUTHORS   Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E.
  TITLE     The keratin BIIIB gene family: Isolation of cDNA clones and
            structure of a gene and a related pseudogene
  JOURNAL   Genomics 4, 182-191 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.J.Frenkel, 24-OCT-1989.
FEATURES       from  to/span     description
    pept.ps /   191      459     BIIIB3 high-sulfur keratin
BASE COUNT      310 a    226 c    195 g    349 t
ORIGIN      74 bp upstream of AvaII site.
        1 aaatgtcaaa ttgattataa aaattttagt ggaaaacggt ggcataaata aaaaagagag
       61 gaaaactttt ggtggaccaa cacctgatga agcggatata taaagagccc caaagtgaga
      121 ggagacattt cgtcctggat agttcgtctt tcactccaag atgccaaaag aaaccaaggt
      181 tcccggtgcc acggcttgct gagctcccct ctgctgcagt gttcccacca gccccgccac
      241 tactatctgc tcctctgaca aattctgggc atgagtctgc ctgcccagct cctgctcaca
      301 tatggtttgg ttactggaga caacctgctg tgacagctgc ccaaagcctg cacattcctc
      361 aaccctatgt gccaacctgc ttcctgctca actctaccca gcccaccctg gaccaggaaa
      421 ccatcagcct cacagccctg tatcccaagc tgctgctaac tgatgtttga ctcactcagt
      481 gcctggcaat aacacagaag ctatctgttt ggcattcact tgctcagtat tttatcacat
      541 attgaggtag acccaatggc atagatatgg aaggcctact ttttatctta atggaaagaa
      601 aataaataaa ctttatgctt atttggctaa caattttttc atttgggcag gtgaatgtca
      661 tctatgtcaa aatagtatta aagtctataa gacttcagac cctattttat tggtcatatt
      721 gcttcctgga ccccatttat tgtattcggg attttcatag aggagaaata attttgatgg
      781 ttttcctaat aagctatttc tgtggcaaaa aagagaaaga gatgtgaaat tttgttgagt
      841 gctttttctg tatctatgga agtaattatt tagttttttt tatttaatct tttacatttt
      901 aaagtttata actatgttca ttttgattga ttttaaaaat gttaaaccaa ccttcttttc
      961 ctacgaaacc cagattgata taatgtattt tccctttcat acatgattga ctttgatttg
     1021 ctacattaaa aacatttatt tatttggcta cactggatct tagttgcagc agatgagatc
//
LOCUS       SHPHSKB3E    1047 bp ss-mRNA            MAM       17-JUL-1990
DEFINITION  Sheep BIIIB4 high-sulfur keratin mRNA, complete cds.
ACCESSION   M21103 J03062
KEYWORDS    BIIIB high-sulphur wool keratin.
SOURCE      Sheep (Dorset horn strain), cDNA to mRNA, clone BIIIB4.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1047)
  AUTHORS   Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E.
  TITLE     The keratin BIIIB gene family: Isolation of cDNA clones and
            structure of a gene and a related pseudogene
  JOURNAL   Genomics 4, 182-191 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.J.Frenkel, 24-OCT-1989.
FEATURES       from  to/span     description
    pept        289      585     BIIIB4 high-sulfur keratin
BASE COUNT      298 a    294 c    193 g    262 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttgtgt tccagtgaca aatagcaaga gtaatttgtt cagatttagc agtgaactca
       61 aacaaaggtt aattaggaaa ataaagactc tttacaagca tctaaatgtc aaaccagtca
      121 aaggataatg ggtacactaa cataaacaca caggaaagaa gccttccggt tggaccaaca
      181 ccactggaga gggtatataa gagcctcaga gcagaaagag agattctcac gccaggagac
      241 tcgtcttctt taaaaccaaa tcaataaaaa acacagcttc ccaacaccat ggcttgctgt
      301 gcccgcctct gctgcagcgt ccccaccacc cccgccacca ccatctgctc ctctgacaaa
      361 ttctgcagat gtggagtctg tctgcccagc acctgcccac acacagtctg gttcctggag
      421 ccaacctgct gtgacaaccg ccccccacct tgccacattc ctcagccctc tgtgcccacc
      481 tgcttcctgc tcaactcttc ccagcccacc ccaggcctgg aaagcatcaa cctcacaacc
      541 tacactcagc ccagctgtga gccctgcatc ccaagctgct gctgaccgac ggctgcctca
      601 cccactgcct gacagagtca acccagaagc tttagtgctc acctgtctca gtacctgcaa
      661 ctaattatgt ctccgctttc aaagttggaa caaggcatta tcacagacaa ccctcacaaa
      721 aaacaaacca agagactttc gatggccgtg tagtggacat cagtgaacaa ggacagctgg
      781 agtaggtaga tgcctacagg tttcccagcg ttgttcagtt ccttcgtgtt aaattgtatc
      841 tttctttggt gctttgggaa ttctgtttcc agtcttgaat cgtatctttc tggaaattga
      901 ggagcttctt catgattatt ctaataaagt ttccatctct ggcataacat aaatgtctat
      961 aggtatttcc atttattttt gttaacacat caaatctata tcttaactcc aaaggacacc
     1021 attttaacaa actgagaaca tctgcag
//
LOCUS       HUMRARG2      250 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human retinoic acid receptor gamma 2 mRNA, 5' end.
ACCESSION   M32074
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Human fetal skin, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 250)
  AUTHORS   Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
            Leroy,P., Staub,A. and Chambon,P.
  TITLE     Murine isoforms of retinoic acid receptor gamma with specific
            pattern of expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            Chambon,P., 13-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept         62  >   250     retinoic acid receptor gamma 2
    mRNA          1  >   250     retinoic acid receptor gamma 2 mRNA
BASE COUNT       38 a     77 c     87 g     48 t
ORIGIN      Chromosome 12.
        1 ggggaatcgg ctgcgggtcc ttggtgtttc cagcacccag tttcccttaa gccgggtcgc
       61 gatgtacgac tgtatggaaa cgtttgcccc gggtccgcga cggctgtacg gggcggccgg
      121 gcccggggcc ggcttgctgc gcagagccac cggcggctcc tgtttcgccg gacttgaatc
      181 ttttgcctgg ccgcaacccg ccagcctgca atcggtggag acacagagca ccagctcaga
      241 ggagatggtg
//
LOCUS       MUSRARG1      620 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma 1 mRNA, 5' end.
ACCESSION   M32068
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse F9 cell, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 620)
  AUTHORS   Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
            Leroy,P., Staub,A. and Chambon,P.
  TITLE     Murine isoforms of retinoic acid receptor gamma with specific
            pattern of expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            Chambon,P., 13-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        399  >   620     retinoic acid receptor gamma 1
    mRNA          1  >   620     retinoic acid receptor gamma 1 mRNA
BASE COUNT      108 a    217 c    192 g    102 t      1 others
ORIGIN      Chromosome 15.
        1 attttggggc ggtcacgtgg gcgggcgggc tccgagaggc cccgggattg tcccagccta
       61 gagccgtgcc cccctgaagc cccccattac cgcgagtcac taacaccgcg gtgctccatc
      121 cccgagaccg cccgacgccg ggacctgggg ctctgcggcc ttccttcccc cgccctcccc
      181 tccagcagtt tccaccaggt ccctcacctc agcctggccc agtatgtagg agggactctc
      241 tgcangaggc cagagggatc cttggaaacc cactggacag accaggcagg gtgggcacgg
      301 agcctcccag gccagggcag tgggcatggg cgggggctgt agctgaagac cacccccgcc
      361 tgctgcagag tccaagggat tcccacgccg cagctaccat ggccaccaat aaggagagac
      421 tctttgcgcc cggtgccctg gggcctggat ctggttaccc aggagcaggc ttcccattcg
      481 ccttcccagg tgcactcaga gggtcgccac catttgagat gctgagccct agcttccggg
      541 gcctgggcca gcctgacctc cccaaggaga tggcttctct ctcggtggag acacagagca
      601 ccagctcgga ggagatggta
//
LOCUS       MUSRARG2      479 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma 2 mRNA, 5' end.
ACCESSION   M32069
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse F9 L cell, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 479)
  AUTHORS   Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
            Leroy,P., Staub,A. and Chambon,P.
  TITLE     Murine isoforms of retinoic acid receptor gamma with specific
            pattern of expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.Chambon, 13-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        291  >   479     retinoic acid receptor gamma 2
    mRNA          1  >   479     retinoic acid receptor gamma 2 mRNA
BASE COUNT       75 a    163 c    143 g     98 t
ORIGIN      Chromosome 15.
        1 tctaggcttt gcttttacaa aaggtctcca cagagctagc tgcctaggca ccctaagtga
       61 ggcagactga ggcagaggct acagcttcgc cttcgcgccc cgcccagttc ttttcccttg
      121 ctggggatcc ccctcctccc cgcccatccc ttacccccca tgcatctcag cgccctacgc
      181 tagccctccc cctcccccct ccaggagcgg ggcgctgcgg ggggaggagg gggaatgggc
      241 tgcgggtcct tggtgttcta gcacccagtt tctctccaag ccgggtcgcg atgtacgact
      301 gcatggaatc gtttgtcccg ggtccgcgac ggctgtatgg ggcggccggg cccggggccg
      361 gcttactacg cagagccact ggcagctcct gcttcgccgg acttgagtct tttgcctggg
      421 cacagcctgc cagtctacaa tcggtggaga cacagagcac cagctcggag gagatggta
//
LOCUS       MUSRARG3      646 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma 3 mRNA, 5' end.
ACCESSION   M32070
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse 11.5 day embryo F9 L cell, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 646)
  AUTHORS   Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
            Leroy,P., Staub,A. and Chambon,P.
  TITLE     Murine isoforms of retinoic acid receptor gamma with specific
            pattern of expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.Chambon, 13-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        496  >   646     retinoic acid receptor gamma 3
    mRNA    <     1  >   646     retinoic acid receptor gamma 3 mRNA
BASE COUNT      151 a    150 c    184 g    161 t
ORIGIN      Chromosome 15.
        1 tccccacccc caccgccacc aactgcttcc agggaactct gggattacta gagtcaggag
       61 tgagccctaa ccttttcagt tctatgcccc tccccgcccc ctttaaaaat gtgtatggtg
      121 ttctgtctat atgtattttc gcgtgccatt tttgtgcctg gtgcccatgg aagatggaag
      181 agggaaccgg ttctgttaga actggcgatt acaaatggtt gtaaactacc atgtagatgc
      241 tgggaatgga accctggtcc tctgggagag cagccactgc tcttacctgc tgagtcccaa
      301 ccaatcttca actttatgga gcagaagcag agaagttaac tccttggcct ggaatttgca
      361 gaagtgaacg ttaatgtaga agagttggct ttatgggggt ggggatgggg tagggggcag
      421 tggtggggcc tgaaatccca acaagctaca aagagtggtg gtctgggctt tccagggagt
      481 acctgttaag ggcttatgca caagggtgac aacagcggtc accagcaggt cccaagaaag
      541 agaggccatg ggatgagggt gcttctgctc agcttctgct tatcttctca tgctgctttt
      601 cattcagcgg tggagacaca gagcaccagc tcggaggaga tggtac
//
LOCUS       MUSRARG4      309 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma 4 mRNA, 5' end.
ACCESSION   M32071
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse 9.5 day embryo, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 309)
  AUTHORS   Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
            Leroy,P., Staub,A. and Chambon,P.
  TITLE     Murine isoforms of retinoic acid receptor gamma with specific
            pattern of expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.Chambon, 13-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        300  >   309     retinoic acid receptor gamma 4
    mRNA    <     1  >   309     retinoic acid receptor gamma 4 mRNA
BASE COUNT       77 a     81 c    100 g     51 t
ORIGIN      Chromosome 15.
        1 gatgacaatc gtaaggcccc caaatatcca tcttccccct ctctgtaatg ggagatgcca
       61 ggcttggggc ccaggaaccc ctttccagag cggcaataaa aagctgggga tggagaggca
      121 gtaaaaataa agtccagatg agaagcggct ttgaagcctg ggccacagtg cctggcgggg
      181 gtgggctgtt ggaaggggct gagagaggag cctgggacac cccccccccc aaatttactg
      241 ggggttggca ccgcactcgc agtctagcgg tggagacaca gagcaccagc tcggaggaga
      301 tggtaccca
//
LOCUS       MUSRARG5      111 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma 5 mRNA, 5' end.
ACCESSION   M32072
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse F9 cell, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
            Leroy,P., Staub,A. and Chambon,P.
  TITLE     Murine isoforms of retinoic acid receptor gamma with specific
            pattern of expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            Chambon,P., 13-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        102  >   111     retinoic acid receptor gamma 5
    mRNA          1  >   111     retinoic acid receptor gamma 5 mRNA
BASE COUNT       23 a     26 c     37 g     25 t
ORIGIN      Chromosome 15.
        1 tgaagcttct ccttgggggc tggattttgg attgaagagt cagcccctgg caccttgggg
       61 atatccttgc ggtggagaca cagagcacca gctcggagga gatggtaccc a
//
LOCUS       MUSRARG6      420 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid receptor gamma 6 mRNA, 5' end.
ACCESSION   M32073
KEYWORDS    retinoic acid receptor gamma.
SOURCE      Mouse 9.5 day embryo F9, P19, and L cells, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 420)
  AUTHORS   Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A.,
            Leroy,P., Staub,A. and Chambon,P.
  TITLE     Murine isoforms of retinoic acid receptor gamma with specific
            pattern of expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            Chambon,P., 13-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        407  >   420     retinoic acid receptor gamma 6
    mRNA          1  >   420     retinoic acid receptor gamma 6 mRNA
BASE COUNT       84 a    121 c    128 g     87 t
ORIGIN      Chromosome 15.
        1 tgaggcagcg ctctgtgggt agggagggcg tcagtgcagg agagagtacc acccacgctc
       61 aggcctgtgg ggaccccagg atgggctgaa agtgagggcc ggaaaggcct tccaggcttc
      121 cccaaacctc ccagcaccta ccattcaggc aaccccaccc ccagtcttga ataaactccc
      181 tgcacccttc cgcccccttt cttttgaggg ggaatctaac tccagcagga ttcttatgct
      241 aattgggtgc gtggggggtg gggtgggtgg agaaggcttc cctcttgtaa ggtggggagc
      301 tggtctggaa cccccaacct ataggctctt ctgtcctctc actaccttgg gtctcagtat
      361 ggacttgtga ccagcggtgg agacacagag caccagctcg gaggagatgg tacccagctc
//
LOCUS       PMGMTF1       190 bp ds-DNA             ORG       17-JUL-1990
DEFINITION  P.magellanicus mitochondrial fragment I DNA.
ACCESSION   M34468
KEYWORDS    .
SEGMENT     1 of 3
SOURCE      P.magellanicus mitochondrial DNA.
  ORGANISM  Mitochondrion Placopecten magellanicus
            Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; 
            Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; 
            Pectinidae; Placopecten magellanicus.
REFERENCE   1  (bases 1 to 190)
  AUTHORS   La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E.
  TITLE     Molecular characterization of a repeat element causing large-scale
            size variation in the mitochondrial DNA of the sea scallop
            Placopecten magellanicus
  JOURNAL   Mol. Biol. Evol. 7, 45-64 (1990)
  STANDARD  simple staff_review
BASE COUNT       66 a     63 c     14 g     47 t
ORIGIN      
        1 ctttcaaaca caaaaaacac aaaaagctcg ccaataaact ttccaatttt ttaacactac
       61 agctaagttt tccccattct ttcacctcaa gcgaagggaa atctatatcc ccactcccaa
      121 caatctactg gcatactact tatactacca aagaaactca tctacccccc atccacccca
      181 cattggtacc
//
LOCUS       PMGMTF2      1449 bp ds-DNA             ORG       17-JUL-1990
DEFINITION  P.magellanicus mitochondrial fragment II DNA.
ACCESSION   M34466
KEYWORDS    .
SEGMENT     2 of 3
SOURCE      P.magellanicus mitochondrial DNA.
  ORGANISM  Mitochondrion Placopecten magellanicus
            Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; 
            Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; 
            Pectinidae; Placopecten magellanicus.
REFERENCE   1  (bases 1 to 1449)
  AUTHORS   La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E.
  TITLE     Molecular characterization of a repeat element causing large-scale
            size variation in the mitochondrial DNA of the sea scallop
            Placopecten magellanicus
  JOURNAL   Mol. Biol. Evol. 7, 45-64 (1990)
  STANDARD  simple staff_review
BASE COUNT      405 a    403 c    193 g    448 t
ORIGIN      About 500 bp after segment 1.
        1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc
       61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt
      121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca
      181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa
      241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat
      301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg
      361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag
      421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc
      481 ttctctcact aaaccatcca ccccccactt tccaattttt ttaacactac agctaagttt
      541 tccccattct ttcacctcaa gcgagaaggg aaatctatat ccccactccc aacaatctac
      601 tggcatacta cttatactac caaaagaaac tcatctaccc cccatccacc ccacattggt
      661 accagaacgt tcctacaaac cctttcttcg ctataggggg aaggaacctt cttccttccc
      721 ctcggacccc ttcctcgccc cctcccccct cccatcccca acccagctat cctttcccct
      781 cccccctagc ccccctacac gcttaactgg tagtcccccc ccttcctccc cccctccctg
      841 tgcgggtcaa cgaacctgtc tctactccgc tcttgatcaa acctcgaaaa aaaaaatctt
      901 tttttttttt tttttttcgg tctccaagga ctccgtggcc gagaccagga ttctaccctg
      961 tagggaaagt ggagggtaag gggttaaata tttatcccgg gcggggggaa cccggaatta
     1021 tcctttataa ccaatttcaa cccaatttat ggctccgtac caaaacgcta tttgccccac
     1081 ccccctctga ggctttttga ttcagaaaac acgagattta gtgatatttt tcataatatg
     1141 gcacaagcaa ttcactgaag gcctaaattt ccgaccctgg aacaattttt ttcaaaaatt
     1201 tgacatctaa aagtcacctg aatttacagc cttttttttg attttcaaat ttttgccgtt
     1261 tttggacccc gttttttttg actttttttt tgtgtttttt tgtgtttttt tttcaccctt
     1321 ttttttgcca aaaattacca tttttgcttg gctacaataa gtttacaaag ccggctataa
     1381 aaaattttta gaaaaacccg aaaatatcag ttttgagttt tgaaatgaac tttttttttt
     1441 tgactgcag
//
LOCUS       PMGMTF3       746 bp ds-DNA             ORG       17-JUL-1990
DEFINITION  P.magellanicus mitochondrial fragment III DNA.
ACCESSION   M34467
KEYWORDS    .
SEGMENT     3 of 3
SOURCE      P.magellanicus mitochondrial DNA.
  ORGANISM  Mitochondrion Placopecten magellanicus
            Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; 
            Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; 
            Pectinidae; Placopecten magellanicus.
REFERENCE   1  (bases 1 to 746)
  AUTHORS   La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E.
  TITLE     Molecular characterization of a repeat element causing large-scale
            size variation in the mitochondrial DNA of the sea scallop
            Placopecten magellanicus
  JOURNAL   Mol. Biol. Evol. 7, 45-64 (1990)
  STANDARD  simple staff_review
BASE COUNT      260 a    213 c     99 g    174 t
ORIGIN      About 500 bp after segment 2.
        1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc
       61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt
      121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca
      181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa
      241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat
      301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg
      361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag
      421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc
      481 ttctctcact aaaccatcca ccccccactt tcctccggga aaaaaaaaaa aaagaaggga
      541 tctcatataa ccccatctta ccgctccggc acaaccaata gacgattcac aagtagagcc
      601 cagccctcct tgcgacccta accctgaaag taccgaagag acccgcaacc atcacaaaca
      661 aaaacatccg gcctaaagga accaaacccc aacaccagtc cgaggggtta tcgtttcaat
      721 accgcatgca tagaacaaac ctgcag
//
LOCUS       HUMCREB      1026 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human active transcription factor CREB mRNA, complete cds.
ACCESSION   M34356
KEYWORDS    active transcription factor CREB; alternative splicing.
SOURCE      Human T-cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1026)
  AUTHORS   Berkowitz,L.A. and Gilman,M.Z.
  TITLE     Two distinct forms of active transcription factor CREB
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Z.Gilman, 16-MAY-1990.
FEATURES       from  to/span     description
    pept          1     1026     active transcription factor CREB-B
    pept          1      261     active transcription factor CREB-A
                304     1026     active transcription factor CREB-A
BASE COUNT      332 a    249 c    233 g    212 t
ORIGIN      
        1 atgaccatgg aatctggagc cgagaaccag cagagtggag atgcagctgt aacagaagct
       61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct
      121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtgca gctgcccaat
      181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct
      241 ccacaagtcc aaacagttca gtcttcctgt aaggacttaa aaagactttt ctccggaaca
      301 cagatttcaa ctattgcaga aagtgaagat tcacaggagt cagtggatag tgtaactgat
      361 tcccaaaagc gaagggaaat tctttcaagg aggccttcct acaggaaaat tttgaatgac
      421 ttatcttctg atgcaccagg agtgccaagg attgaagaag agaagtctga agaggagact
      481 tcagcacctg ccatcaccac tgtaacggtg ccaactccaa tttaccaaac tagcagtgga
      541 cagtatattg ccattaccca gggaggagca atacagctgg ctaacaatgg taccgatggg
      601 gtacagggcc tgcaaacatt aaccatgacc aatgcagcag ccactcagcc gggtactacc
      661 attctacagt atgcacagac cactgatgga cagcagatct tagtgcccag caaccaagtt
      721 gttgttcaag ctgcctctgg agacgtacaa acataccaga ttcgcacagc acccactagc
      781 actattgccc ctggagttgt tatggcatcc tccccagcac ttcctacaca gcctgctgaa
      841 gaagcagcac gaaagagaga ggtccgtcta atgaagaaca gggaagcagc tcgagagtgt
      901 cgtagaaaga agaaagaata tgtgaaatgt ttagaaaaca gagtggcagt gcttgaaaat
      961 caaaacaaga cattgattga ggagctaaaa gcacttaagg acctttactg ccacaaatca
     1021 gattaa
//
LOCUS       PFAATUBII    3706 bp ds-DNA             INV       17-JUL-1990
DEFINITION  P.falciparum alpha-tubulin II gene, complete cds.
ACCESSION   M34390
KEYWORDS    alpha-tubulin II.
SOURCE      P.falciparum haploid intraerythrocytic cell line K1 (Thailand) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 2779)
  AUTHORS   Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G.,
            Scaife,J.G. and Hyde,J.E.
  TITLE     The tubulin genes of the human malaria parasite Plasmodium
            falciparum: Their chromosomal location and sequence analysis of the
            alpha-tubulin II gene
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 2780 to 3706)
  AUTHORS   Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G.,
            Scaife,J.G. and Hyde,J.E.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Hyde, 22-MAY-1990.
FEATURES       from  to/span     description
    pept        749      810     alpha-tubulin II, exon 1
               1022     1142     alpha-tubulin II, exon 2
               1262     2431     alpha-tubulin II, exon 3
    IVS         811     1021     alpha-tubulin II intron A
    IVS        1143     1261     alpha-tubulin II intron B
BASE COUNT     1390 a    347 c    502 g   1467 t
ORIGIN      Chromosome 4.
        1 gaattcctta tatatgtact cgttttatat ttttttccta taatttcaac atattaaaaa
       61 gtactgaata taacattata tatttacata ttcaagttat atttattttc tttttctttt
      121 ttcttgatat attatattat ttgattattt ttttttcttt taatatgtgg cttatattta
      181 caaaaatagt aatatatata tgttaatatg taaagaaaca gaaatgccat tgtatgaaat
      241 ataaaagttt caattttttt aaaaaatata aaaaaagaaa gaaaataaca aataaaaata
      301 aatatattat atatatgtgt ttatattttt atttttatat ataattatat tttttcttta
      361 tgatatatat tattatatct ttataaagaa gacataaaag aaacagattg attgaagttt
      421 ttcttgcttt aatataagaa agtaatttca taatgaaaat aatatatata tatgtatatg
      481 atattttaat ttataattat aatgagtttg taaaaatata ttttatataa gtttatatgt
      541 attttttata tatatatata tatatatata tggctttgga taatataata tatatatata
      601 aataaataaa tatatatata tatatatata tatatatgta tatttattta catatttatt
      661 taatataggt acaaacatag acctacatat ttttatattt atttatttta ttttatatta
      721 acatttttta tctaacgttt tacgaacgat gagagaagtc attagtattc atgttggaca
      781 ggctggtatt caaataggaa atgcttgctg gtaaaaaaaa aaaaaaaaaa aaaaaaaaaa
      841 aaaaaatata tatatatata tataaatgca tgtgtgttgt atatttttct tcttcatttt
      901 tatttttaaa gagatacata atgtttaatg tattattcaa tgtgttgtgg taaaaacatg
      961 tgtataaatt ataattaaaa atatacatat atatatatat atatatattt tttttttata
     1021 gggaattgtt ttgccttgaa catggaattc aaccggatgg gcagatgcca agtgaccaag
     1081 tcgttgctgg tggtgatgat gcctttaata catttttctc agaaacggga gctggaaaac
     1141 atgtaataat aaaaaaaaaa aaaaaaaaat atatacatat atatatatat atatatatat
     1201 atttatttat ttatttattt acttatacat gtgtgcttct ttttgtcgtt actcatttta
     1261 ggtaccacgt tgtgtgttcg ttgatttaga acccaccgtc gttgacgaag ttcgaacagg
     1321 aacgtatcgt cagctgtttc accctgaaca actaatatct ggaaaagagg atgcagcaaa
     1381 taatttcgca aggggacatt ataccatagg aaaagaaatt gttgatgtat gtttggatag
     1441 ggttcgaaag ttggctgata attgcactgg attacaagga tttttgatgt ttaatgcagt
     1501 aggtggaggt acaggtagtg gtcttggttg tttattatta gaaaggttgg caatagatta
     1561 tggaaagaaa tcaaaattaa atttttgttc gtggccatct cctcaagtat cgacagctgt
     1621 tgtagagcct tataattctg tattatcaac acattcattg ttagaacata cagatgtggc
     1681 aattatgctc gataacgaag caatatatga tatatgtaag aaaaatttag atatagaaag
     1741 gccaacctat actaacttga atagattgat tgctcaagtt atctcttcat taacagcatc
     1801 tttaagattt gatggtgctt tgaatgttga tgtaacagaa tttcagacta atttagtacc
     1861 atatcctaga attcacttta tgttatcatc atatgctcca atcataagtg ctgagaaggc
     1921 atatcacgag caattgtcgg tttctgaaat aacgaattct gcctttgagc ctgcatctat
     1981 gatggcaaag tgtgatccca gacatggaaa atatatggct tgttgtttaa tgtatagagg
     2041 agatgtagta ccaaaggatg ttaatgctgc cgtcgcaact attaagacta agagatctat
     2101 acaattcgtt gattggtgtc ctacgggatt taaatgtgga atcaattatc agccccctac
     2161 cgtggttcca ggaggagatt tagcaaaagt tatgagagct gtttgcatga tcagcaactc
     2221 aacagcaatt gccgaagtat tctcacgaat ggaccaaaaa tttgatttga tgtatgcgaa
     2281 aagggctttc gttcattggt atgtaggtga aggtatggaa gaaggagaat ttagtgaagc
     2341 tagagaggat ttggctgcct tggaaaagga ttatgaagag gtaggaatag aatcgaatga
     2401 tggagaggga gaagatgagg gatatgaatg aacataaata taaatggata aataaataaa
     2461 tatatatata tatatatata tatatatata tatatatatg tatatctact tataattatt
     2521 tttttttttt atctcatacg tcctatattc ccaaaatatt atatatatgt tcataataat
     2581 tatatggaac atatatatcc tcttatcata ttgtatattt attcgttgta aaaaataaaa
     2641 aaaaatatac aagaatatta atattttttt ctatgtattt acaccttcat tagtttaata
     2701 ttcaagaaaa ataatactta ctagaaataa ataaaatata tatatatata tatatgtatg
     2761 tatatatgtg tgtatgtacc attaattagt aagtagtaat ttattatgca atttatttat
     2821 tatatatata tatatatata tatatatata gtatcaaatt aaatatacga caattcgtaa
     2881 gtacattctt gaaaaaataa atgtatagat ataatattat tttttttata caaatttata
     2941 tttgtagaat atatattttt ttccctttat tattatgcat tatataataa tatatacatg
     3001 ataatatgca tacatataca tattatatat atatatgtat gtatgtatat atgtcgtgtg
     3061 ataatttatc tttatgctaa taatacatta cacatataaa tacttattta tatctatata
     3121 tatttttttt atatacgaca ataatttatg taaacattaa gttacatatt aaagttcgat
     3181 taataaatat ataataattt tttttatttt attatatcta tttttccaat gtataaataa
     3241 tatatgaaat atactatatg ctatataata tgagcacaca taagatttgg tgttatataa
     3301 atgtagaagt aatattaata gtatcagacg aaaagaacat tttttttttt tttttttttt
     3361 tttttttggt ccttttaaat tgatataaaa aactaaactt tagagtttct ttaaaatgag
     3421 acatttttta caacttaata taatttaatt tttttttttt tttttttcat atttattaga
     3481 aatgaatata taggttaaca catatatatt atatatatat atattaacat gaaaaggtta
     3541 aaaaaaaata tggaaaatac aaacatataa ttatatacat atatattata tatatgtatg
     3601 tatctttttt atgttttaat ttggtcacgt ggttaatttt atatgatgct aaaaaaattt
     3661 tcttccattt aagaaacagg aattaacaaa aaatttaagg gaattc
//
LOCUS       XELFIMB1     1341 bp ss-mRNA            VRT       17-JUL-1990
DEFINITION  X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds.
ACCESSION   J02910
KEYWORDS    integumentary mucin; mucin.
SOURCE      X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Probst,J.C., Gertzen,E.-M. and Hoffmann,W.
  TITLE     An integumentary mucin (FIM-B.1) from Xenopus laevis homologous
            with von Willebrand factor
  JOURNAL   Biochemistry 29, 6240-6244 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Hoffmann, 26-APR-1990.
FEATURES       from  to/span     description
    pept    <     1     1197     integumentary mucin (FIM-B.1; AA at 1)
    mRNA    <     1     1341     FIM-B.1 mRNA
    signal     1319     1324     poly-A signal
    rpt           1      279     repeat sequence
BASE COUNT      472 a    290 c    268 g    311 t
ORIGIN      
        1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag
       61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca
      121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa
      181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta
      241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca
      301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact
      361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca
      421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc
      481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa
      541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa
      601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg
      661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat
      721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact
      781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt
      841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca
      901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa
      961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg
     1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt
     1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac
     1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga
     1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata
     1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa
     1321 taaatggttt ttttcagctg t
//
LOCUS       YSCANBI      2156 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  S.cerevisiae ANB1 locus encoding protein synthesis initiation
            factor (eIF-4D) gene, complete cds.
ACCESSION   J05455
KEYWORDS    protein synthesis initiation factor eIF-4D.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2156)
  AUTHORS   Mehta,K.D., Leung,D., Lefebvre,L. and Smith,M.
  TITLE     The ANB1 locus of Saccharomyces cerevisiae encodes the protein
            synthesis initiation factor eIF-4D
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Dr. K.D.Mehta, 16-MAR-1990.
FEATURES       from  to/span     description
    pept        569     1042     protein synthesis initiation factor (eIF-4D)
    mRNA        516  >  1042     protein synthesis initiation factor mRNA
                                 (alt.)
    mRNA        526  >  1042     protein synthesis initiation factor mRNA
                                 (alt.)
    mRNA        540  >  1042     protein synthesis initiation factor mRNA
                                 (alt.)
    signal      395      400     TATA box
BASE COUNT      602 a    422 c    369 g    763 t
ORIGIN      3 bp upstream of SmaI site.
        1 cccgggaatt ttagattcag gtaggaaatt gattacataa atactgttac cctgaatcat
       61 attcgacgat gtcgtctcac acggaaatat aattcatttc ttggttttcc aaaaaaattt
      121 tcattttttt tcactttttt gtttcgtcct cctttttttt tttttttatt ttttttcctg
      181 tgttcacctt tttttttttt ttttcagttt acaactttct gcattctttt ctgtgttttt
      241 tttttttcgt ttttccattg ttcgttcgtt gcctgttttt ttgccctatt gttctcgagc
      301 ctaaaaattt tttcctttcc tgctttcctt tcttcgttca aagtttccta ttccattgtt
      361 ctcttcggta aactcattgt tgtcggaact cagatatatt caggtcaatt tactgtactt
      421 caattgactt ttttcttgaa atttcaactt gccttttcaa cttgttcttc ttttttaatc
      481 ttattctaca ctttagttcc cttaccttgt tcctaattat tgtctagcaa aaagaaaaca
      541 tacacctatt tcattcacac actaaaacat gtctgacgaa gaacacacct ttgaaaatgc
      601 tgacgctggt gcctccgcca cctacccaat gcaatgttct gccttgagaa agaatggttt
      661 cgttgtcatc aaaggtagac catgtaagat tgtcgacatg tccacttcca agaccggtaa
      721 gcacggtcac gccaaggtcc atttggttac ccttgatatc ttcactggta agaagttgga
      781 agatttgtct ccatccactc acaacttgga agttccattt gtcaagagaa gcgaatacca
      841 attgttggat attgatgatg gctacttgtc cttgatgacc atggacggtg aaaccaaaga
      901 cgatgtcaag gcaccagaag gtgaattggg tgacagcatg caagctgctt tcgatgaagg
      961 taaggatttg atggtcacta tcatttctgc catgggtgaa gaagctgcca tctccttcaa
     1021 ggaagctcca agatctgatt agatatttta gatttctata gcttacaaaa tacgcgatgt
     1081 ttttttgatc tagaaatctt atttatacat tttatataat attataataa tcataatcat
     1141 aattattcat cctaactaac ttcgttctgt ttcagctacg aagttgcctt cattatcggt
     1201 acgttcgtgt taattggtgt ttgataccat agtcacttat gacgacttac agaacttttt
     1261 tttttgacgg atattttccc ctcaatatgc cgaggatgat aactttataa attgccttct
     1321 ttcggttcct tttttctttt taatttagct gacgggaaca tgtgtatata ccccagaaga
     1381 atttaaacag aaattttttt gagctatttt acgaaaatat taaaaactag ggctacagat
     1441 atcagagaaa aaattactta tatgatgaca ataacttaat atcacagtca ctatcgttct
     1501 cgcctttata tgcgcaatga cagcattcga cttttcctta gaagggatcc gatcaacgaa
     1561 acttcaaaaa gagtagaagt cgaaatattt tttgcacgac gagggtgaca aaccgtgcta
     1621 tgtccacaaa tgaggccctg ctttcccaag ataaattgcg taaacatata attgttacaa
     1681 ggatccgatc aacgaaactt caaaaagagt agaagtcgaa atattttttg cacgacgagg
     1741 gtgacaaacc gtgctatgtc cacaaatgag gccctgcttt cccaagataa attgcgtaaa
     1801 catataattg ttacaaatcc aaggctctgt gatttcttca ccaagtgtag ttgtgatgta
     1861 tagtgtgtag gataggtaat gaccggtaaa tcaagctggt aaaaagtcct aataacaagc
     1921 ccgaatctgg aataatggta atattcccat gccaggtgga cagtgttgtc cttacacaaa
     1981 ctcaagagat taaaggcaac ttggccgagt ggttaaggcg aaagattaga aatcttttgg
     2041 gctttgcccg cgcaggttcg agtcctgcag ttgtcgttat tttttatttc cctttctttt
     2101 cgtacataca caatgcacag tcttcaaagt aaaataccaa cgtatgtatc aagatc
//
LOCUS       BSUBMRX      1402 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  B.subtilis multidrug-resistant protein (BMR) gene, complete cds.
ACCESSION   M33768
KEYWORDS    multidrug-resistance.
SOURCE      B.subtilis DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1402)
  AUTHORS   Neyfakh,A.A., Bidnenko,V.E. and Chen,L.B.
  TITLE     Efflux-mediated multidrug resistance in bacteria: Similarities and
            dissimilarities with mammalian system
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.A.Neyfahh, 20-APR-1990.
FEATURES       from  to/span     description
    pept        195     1364     multidrug-resistance protein (BMR)
    signal      130      135     -35 promoter
    signal      155      160     -10 promoter
BASE COUNT      353 a    300 c    323 g    426 t
ORIGIN      About 216 min. on B.subtilis map.
        1 ggcggatatg gatggagaag tttatacaca tacacccgca gtcattcagg tcctgccaca
       61 gcatatcgac atgcttgttc ccgcaaatga ataactgtca taaggcttta gaaagatttt
      121 gcaaatccgt tgactctccc ctaggaggag gtcttacagt ataagggata cacacttgaa
      181 aaaggggaag tcatatggag aagaaaaata ttaccttaac tatattatta accaatttat
      241 ttattgcttt tttggggatc gggcttgtga ttccagtaac gccgaccatt atgaatgaat
      301 tgcatttatc ggggaccgcg gtcggctata tggttgcctg cttcgctatt acacagctca
      361 ttgtctcacc aatagccgga cgatgggttg atcgcttcgg gcgcaagatc atgatcgtaa
      421 tcggcctgtt gttctttagt gtgtcggagt ttttgttcgg cattggaaaa acagttgaga
      481 tgttatttat cacccgtatg ctgggcggta tcagcgcacc gttcattatg cccggggtca
      541 cggcttttat tgcagatatc acgaccatta aaacacggcc aaaagcgctc ggttatatgt
      601 cagccgctat ttcaacagga tttattatcg gccccggcat cgggggattt ttagcagaag
      661 tccattcccg gctgcctttt ttctttgcgg cagcttttgc actgttagca gccattttat
      721 caatcctcac gctgcgcgag ccggaacgaa accctgaaaa tcaggaaata aaaggacaga
      781 agacaggctt taaacgaatt tttgccccca tgtatttcat agcttttctc attatcttaa
      841 tttcgtcttt tggtttagca tcatttgaat ctttatttgc attattcgtg gatcataaat
      901 tcggatttac ggccagcgac attgccatta tgattacagg aggagcgatt gttggcgcca
      961 ttacgcaagt cgtcttattc gaccgcttca caagatggtt tggcgaaatt catttaattc
     1021 ggtacagctt aattctctcg acgagtctgg tattcttgct gacaacggta cattcatatg
     1081 ttgcgattct gctggtgaca gtcaccgtat ttgtcggatt tgatctcatg cggcctgcgg
     1141 taacgactta cctgtcaaag attgcgggaa atgaacaggg gtttgccggc ggtatgaatt
     1201 caatgtttac aagtatcggc aatgtattcg ggcctattat cggcggaatg ctgttcgata
     1261 tagatgtaaa ctatcctttc tactttgcaa cggtcacctt agccataggg attgcactga
     1321 ccattgcttg gaaagcgcct gcacatctta aagccagcac gtgataagaa gcgcattctt
     1381 tgtgtactgc aaagaatgcg ct
//
LOCUS       ECOMOR       1500 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  E.coli morphology and auto-aggregation control protein (mor)
            gene, complete cds.
ACCESSION   M34102
KEYWORDS    auto-aggregation control protein; morphology control protein.
SOURCE      E.coli (strain K12) DNA, clones pSRW[220,226].
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1500)
  AUTHORS   Warne,S.R., Varley,J.M., Boulnois,G.J. and Norton,M.G.
  TITLE     Identification and characterization of a gene that controls colony
            morphology and auto-aggregation in Escherichia coli K12
  JOURNAL   J. Gen. Microbiol. 136, 455-462 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        203     1120     morphology control protein (put.)
    signal      133      138     -35 region
    signal      157      162     -10 region
    rpt        1135     1164     region of dyad symmetry
    rpt        1167     1211     region of dyad symmetry
BASE COUNT      338 a    364 c    428 g    370 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc
       61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat
      121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta
      181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc
      241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag
      301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg
      361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg
      421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct
      481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct
      541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact
      601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga
      661 agcattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca
      721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat
      781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc
      841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc
      901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggaggcca aacgcgatgg
      961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg
     1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat
     1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc
     1141 gatagcttcg ccatcgtcgg gtagttaaag gtggtgttga cgaagtactc aatagtgttg
     1201 ccgccacctt tctgttccat aatcgcctga ccgatatgaa taatttcggc agcgcgctcg
     1261 ccaaagcagt gaatacccag aatctctttt gtttcccgat ggaacaaaat tttcagcgtg
     1321 cccacgttca tgccgacgat ttgtgcgcgt gccagatgtt taaactgggc gcggcccact
     1381 tcatatggca ctttcattgc ggtcagctgc tgttcggttt tgcccacaga gctgatttcc
     1441 gggatggtgt aaataccggt agggatatct tcaatcagat gtgcggtggc ttcgcctttt
//
LOCUS       ECOTGMETY     800 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  E.coli Met-tRNA-f2 (metY) gene, 5' end.
ACCESSION   M28401
KEYWORDS    transfer RNA-Met.
SOURCE      E.coli (strain K12) DNA, clones p50L and p50.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 800)
  AUTHORS   Granston,A.E., Tompson,D.L. and Friedman,D.I.
  TITLE     Identification of a second promoter for the metY-nusA-infB operon
            of Escherichia coli
  JOURNAL   J. Bacteriol. 172, 2336-2342 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.E.Granston, 27-SEP-1989.
FEATURES       from  to/span     description
    tRNA        682      758     Met-tRNA-f2 (metY)
    anticdn     716      718     Met-tRNA anticodon atg
    signal      560      565     -35 region promoter P1
    signal      584      589     -10 region promoter P1
    signal      638      643     -35 region promoter P0
    signal      661      666     -10 region promoter P0
    mRNA        596  >   800     P1-associated mRNA
BASE COUNT      210 a    191 c    190 g    209 t
ORIGIN      5 bp upstream of KpnI site.
        1 ggtaccagtc acggcgcggc cagcgcgtcg ttttgaaata ggtcaggccg ccggtggtgt
       61 tatgaaatgc gccacactga atagcggcaa taccttcggc caccagttgt ttgcggcagt
      121 cgatcagacg tgcgttctcc gcgccgtatt ccatggcacg acgagggatc gcatcataat
      181 cctcttcgtc tggctggccc aggtttgcag tatatgcata aggaaccgct cccttttgtc
      241 gcatccacag cagtgcggca ctggtgtcca gaccgccaga aaaagcgata ccaatacgtt
      301 gacctaccgg gagatgcttg agaatcgtcg tcataaaata acaccctgct taattaactg
      361 atgatgagcc tggatttccg ctctcactga atttttatgc aaaataaatg agttttcatt
      421 taatcatctt ttatcggaga caggaagagt ttagtgtgtt ttttgtaaaa taatgcgctt
      481 aagggagagc aggagaaggc aaaagtattc aacaaatgaa agtgaactgg atattcattc
      541 acatgattag caataaacgt tgacaaaatg tggcgtggat cactataatg cctgcagatt
      601 ttacgtcccg tctcggtaca ccaaatccca gcagtatttg cattttttac ccaaaacgag
      661 tagaatttgc cacgtttcag gcgcggggtg gagcagcctg gtagctcgtc gggctcataa
      721 cccgaagatc gtcggttcaa atccggcccc cgcaaccact ttcccttaga gtcctttttc
      781 aaatatactg tgaagacttc
//
LOCUS       RAT5HT1A     2314 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  Rat 5-hydroxytryptamine-1a receptor (5-HT-1a) gene, complete cds.
ACCESSION   J05276
KEYWORDS    5-hydroxytryptamine-1a receptor.
SOURCE      Rat DNA, clone D.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2314)
  AUTHORS   Albert,P.R., Zhou,Q.-Y., Van Tol,H.H.M., Bunzow,J.R. and Civelli,O.
  TITLE     Cloning, functional expression, and mRNA tissue distribution of the
            rat 5-hydroxytryptamine-1a receptor gene
  JOURNAL   J. Biol. Chem. 265, 5825-5832 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        123     1391     5-hydroxytryptamine-1a receptor (15-HT-1a)
BASE COUNT      528 a    617 c    591 g    578 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccctc cctctgaaac tccccagaaa agaaggaaga ccctaggggg aggggcgagg
       61 gagatcccct tgcagttctt tcccccctcc cttcccccac cccagcggga gtgcaggcag
      121 gcatggatgt gttcagtttt ggccagggca acaacaccac agcgtcccag gagccctttg
      181 ggaccggcgg caacgttact agcatctccg acgtgacctt cagctaccaa gtgatcacct
      241 ctctgttgct gggtactctc attttctgcg cggtgctcgg caatgcctgc gtggttgctg
      301 ccatcgccct ggagcgctcc ctccagaatg tggccaacta tctcatcggc tccttggcgg
      361 ttactgatct catggtatca gtgctggtgc tgcccatggc tgctctgtac caggtgctca
      421 acaagtggac cctgggccag gtcacctgcg acctgtttat cgccctggat gtactgtgct
      481 gcacctcgtc catcctgcac ctgtgcgcca tcgcgctaga caggtactgg gctatcaccg
      541 accctataga ctatgtgaac aaaaggacgc cccggcgcgc cgctgcgctg atctcgctca
      601 cttggctcat tggctttctc atctccatcc cgcccatgct gggctggcgc accccggaag
      661 accgatcgga ccccgacgcg tgcaccatca gcaaggacca cggctacacc atctactcca
      721 ctttcggcgc tttctatatc ccgctgttgc tcatgctggt tctctacggg cgcatcttca
      781 gagccgcacg cttccgaatc cgcaagactg tcaggaaggt ggaaaagaag ggagcaggca
      841 ccagcttagg aacttcgtcg gcaccgcccc ccaagaagag cctgaacgga cagccaggta
      901 gtggggactg gaggcgctgt gctgagaaca gggcggtggg gactccgtgc actaatgggg
      961 cagtgaggca gggtgacgac gaggccaccc tggaggtgat cgaggtgcac cgagtgggca
     1021 actccaaaga gcaccttcct ctgcccagcg agtcaggatc taactcctat gcccccgcct
     1081 gcttggagag aaaaaatgag cgcaatgctg aagcaaagcg caagatggcc ttggcccgtg
     1141 aaaggaagac ggtgaagact ctgggcatca tcatgggcac cttcatcctc tgctggctgc
     1201 cgtttttcat tgtggctctg gtcctgcctt tctgtgaaag cagctgccac atgcctgcgt
     1261 tgttgggtgc cataattaac tggctaggct actccaactc cctgctcaat ccggttattt
     1321 atgcttattt caacaaagac tttcaaaacg cttttaagaa gataatcaag tgcaagttct
     1381 gccgccgatg atgatgatga tggtggtggt aatagtggtg actatggaga aggaggagga
     1441 ggaggaagag gaggaggaag aagaggaaga ggaggaagag tgtagggttt acaggaccct
     1501 ccccattcac tcagtaccct agccctggaa gcaacaccta agataacttg cttttcctag
     1561 aacatgcagc ttctcagcac atggctttca gaccttgttc tctctatatc tcgtttcaag
     1621 gggacagtac tctttgcaga aatcttagag aaccactagt atctgtgtga aaagttattg
     1681 gcttggtctt tggccctagg atcagttttg gtagttgtaa taacatcctc cctattcaaa
     1741 atcccaattc ttcacgatgg aattttaaac cttggtgatt caaggtagaa agccgaaaca
     1801 gaaaatctaa actgtcctga catctgccag gcccactaga actgtcactc tctgccctac
     1861 ttctggaact ttgaggatca agactgctct gtctctcact ccctcttgct ctagaaacat
     1921 ctgttccctt tgggtgtcaa tctacactgt tgaactggtc tccatgctgg tctggaaaag
     1981 cagttggaag taactactct gggttatgtg ctctggaaat ttttgaatag caaaagatcc
     2041 cccccacccc tgcacaacac ccactttctg tatttttttt aaattttttt ttaatttttg
     2101 aactcaaaat caagtggcaa aagtgtctgt tggagtggac ttataagtag gtctggctcg
     2161 taggaagagc caggaaggtc tgtggatggt ttcttgtcct ggtgttaaca ctcaccaaag
     2221 aaactgacaa tcagactatt tggaaggaag agttttattc ctacagtttg ttggcctttc
     2281 tgtaagttct ggcaaggctc tgtgtcttct gcag
//
LOCUS       RATFCGRE     1341 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat Fc-gamma receptor mRNA, complete cds.
ACCESSION   M32062
KEYWORDS    Fc-gamma receptor.
SOURCE      Rat (strain Fisher 344) natural killer cell, cDNA to mRNA, clone
            6a6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Zeger,D.L., Hogarth,P.M. and Sears,D.W.
  TITLE     Characterization and expression of an Fc-gamma receptor cDNA cloned
            from rat natural killer cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3425-3429 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.W. Sears, 12-FEB-1990.
FEATURES       from  to/span     description
    pept         50      853     Fc-gamma receptor
BASE COUNT      374 a    346 c    319 g    302 t
ORIGIN      
        1 ccagcaattt catctcctag acctcatcag actctgaccc agttcttgaa tgactttgga
       61 gacccagatg tttcagaatg cacattctgg aagccaatgg ctactcccac cactgacaat
      121 gttgctgctg tttgcttttg cagacaggca gacgggagat cttctgaagg ctgtggtgaa
      181 acgtgatccc ccatggatcc aggtgctcaa ggacgacact gtgacgctga cgtgcgaagg
      241 gacccacaat cctggaaact cttctaccca gtggttccac aaccagagct ccacctgggg
      301 ccaggtccaa gccagctaca cgtttaaggc cacagtcaat gacagtggag aataccggtg
      361 ccgaatggcg cacaccagcc tcagcgaccc catacatctg gaagtgattt ctgactggct
      421 gctgctccag acccctcaac tggtgtttga ggaaggggaa accatcacat taaggtgcca
      481 tagctggaag aacaaacagc tgactaaggt cttgttgttc cagaatggaa aacctgtgag
      541 gtattattat caaagcagca atttttcaat ccccaaagcc aaccacagtc acagtgggaa
      601 ctactactgc aaagcatatc taggaaggac aatgcatgtg tccaagcctg tcaccatcac
      661 tgtccaaggt tcagcaaccg cgtccaccag ctctctagtg tggttccatg ccgctttctg
      721 cctagtgatg tgcctcctgt ttgcagtgga caccggcctg tatttctgtg tacggagaaa
      781 tcttcaaacc tcgggggagg actggaggaa atccctgtca gtcggaaagt acaaggctcc
      841 acaggacaaa tgacatccca tcgtatggct aaacagcggc agcttctttt cagccacacc
      901 gccgcttcat ctacagcctt cctttgaaag caacttacca ccaggctgga tatttggtcc
      961 tgcaatcaca actgctagga ggctgaggca ggaggatcac caaaggctgc ccgggtttta
     1021 gagaagagag agtgcaagtc tatcctggat aacccaataa gaccctgggt ttagatggct
     1081 tatcaggaaa gagcacccat tgctaagctc acagacaaga ccacagattg atccctaagg
     1141 acacagggca aagggggaga acaaattcct gaaagttgtc ctctgacctc cacacgtcca
     1201 ccatggcaga tgcacacaat aaacaaatta aaatgtaata tatatatttt aaacaagagg
     1261 tggggaaagg ctgggggttg gtggcacagt ttttatccag gacacatgat gctctggatt
     1321 ttgacctcct ataataaaac g
//
LOCUS       YEPYOPYER    1527 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Y.pestis virulence determinant (yopE) and yopE regulator (yerA)
            genes, complete cds.
ACCESSION   M34279
KEYWORDS    virulence detrminant; yopE regulator.
SOURCE      Y.pestis DNA, clone pYV019.
  ORGANISM  Yersinia pestis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1527)
  AUTHORS   Forsberg,A. and Wolf-Watz,H.
  TITLE     Genetic analysis of the yopE region of Yersinia spp.:
            Identification of a novel conserved locus, yerA, regulating yopE
            expression
  JOURNAL   J. Bacteriol. 172, 1547-1555 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        476       84 (c) yopE regulator (yerA)
    pept        670     1329     virulence determinant (yopE)
BASE COUNT      426 a    329 c    339 g    433 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccgaca ttactaagtg agcgttgtat ctaatcctgg gggcaggtca ttccgtataa
       61 taaggcaaca accaaaaatc tactcaacta aatgaccgtg gtggtgagat tagtgatgag
      121 gtttgtagcc gttcagcccc ctgcaccagc atctcaagct gagtatatag tgagttatta
      181 tccaggctgt tcaatggttg tcgattccat aacactgggt gccccccaac ctcgtcccag
      241 gataagatgg gttttaatat atcttgactg aatatattat ggctaagtaa ggtttccttt
      301 tcatcattat tgtcaagaga aggtagggta aacattaata tttgcccgac aggatgctct
      361 gttatatggc aggcgaattc cccaactttg acaccgataa ccggttcaat agtatctgga
      421 atagacaacg aaagttgttg aaataattga gtgatagctt gttcaaatga atacattatg
      481 atctcataat agttagataa aatatcaact taaccaaagc actctcggca gaccatcaat
      541 tttagcctat aatttttagt ttttgttttg tctaatataa caacaaaaac agcagcgatt
      601 ttttatatag ccatcggcta ttttcccact aagataacct tgttttaata gccaaggtaa
      661 taaatagtca tgaaaatatc atcatttatt tctacatcac tgcccctgcc gacatctgtg
      721 tcaggatcta gcagcgtagg agaaatgtct gggcgctcag tctcacagca aacaagtgat
      781 caatatgcaa acaatctggc cgggcgcact gaaagccctc agggttccag cttagccagc
      841 cgtatcattg agaggttatc atcagtggcc cactctgtga ttgggtttat ccaacgcatg
      901 ttctcggagg ggagccataa accggtggtg acaccagcac ccacacctgc acaaatgcca
      961 agtcctacgt ctttcagtga cagtatcaag caacttgctg ctgagacgct gccaaaatac
     1021 atgcagcagt tgaatagctt ggatgcagag atgctgcaga aaaatcatga tcagttcgct
     1081 acgggcagcg gccctcttcg tggcagtatc actcaatgcc aagggctgat gcagttttgt
     1141 ggtggggaat tgcaagctga ggccagtgcc atcttaaaca cgcctgtttg tggtattccc
     1201 ttctcgcagt ggggaactat tggtggggcg gccagcgcgt acgtcgccag tggcgttgat
     1261 ctaacgcagg cagcaaatga gatcaaaggg ctggcgcaac agatgcagaa attactgtca
     1321 ttgatgtgat atggataaaa acaaggggat agtgtttccc cctttttcta tcaatattgc
     1381 gaatatcttc gtccctgatc tttcaggggc gaatcgtttt ttagcatgct cattgttaga
     1441 atttctgact tatctctctt ctgtattact actcatgctc tggaaaatcc tgaacatgta
     1501 tatctatgga ttgatgcagc actcgag
//
LOCUS       YEPYOPYER1   1762 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Y.enterocolitica virulence determinant (yopE) and yopE regulator
            (yerA) genes, complete cds.
ACCESSION   M34278
KEYWORDS    virulence determinant; yopE regulator.
SEGMENT     1 of 2
SOURCE      Y.enterocolitica DNA, clone pYV8081.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1762)
  AUTHORS   Forsberg,A. and Wolf-Watz,H.
  TITLE     Genetic analysis of the yopE region of Yersinia spp.:
            Identification of a novel conserved locus, yerA, regulating yopE
            expression
  JOURNAL   J. Bacteriol. 172, 1547-1555 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        609      217 (c) yopE regulator (yerA)
    pept        803     1462     virulence determinant (yopE)
    rpt        1468  >  1762     insertion sequence
    rpt        1464     1467     5' insertion target sequence
BASE COUNT      497 a    395 c    382 g    488 t
ORIGIN      
        1 ccggtactcc tcattaccaa tgaggaggcg gtgattccac caagtgaatg ggcttaatac
       61 aagtctttta cattaaaaat tgttaaaaaa cgggagataa aagtcaacac tccaacttgg
      121 tgttaacttt tactgagcga aatctgatat tgctggcacc acaaatttat aggttatcgc
      181 tatttccgta taataaggca acaaccaaaa atctactcaa ctaaatgacc gtggtggtga
      241 gattagtgat gaggtttgta gccgttcagc cccctgtacc agcatttcaa gctgagtata
      301 tagcgagtta ttatccaggt tgttcaatgg ttggcgattc cataacactg ggtgcccccc
      361 gacctcgtcc caagataaga tgggttttaa tatatcttga ctgaatatat tatggctaag
      421 taaggtttcc ttttcattat tattgtcaag agaaggcagg gtaaacatta atatttgccc
      481 gacaggatgc tctgttatat ggcaggcgaa ttccccaact ttgacaccga taaccggttc
      541 aatagtatct ggaatagaca gcgaaagttg ttgaaataat tgagtgatag cttgttcaaa
      601 tgaatacatt atgatctcct aatagttaga taaaatatca agttaaccaa agcactctcg
      661 gcagaccatc aatttgagcc tataattttt agtttttatt ttgtctaata taacaacaaa
      721 aacagcagcg attttttata tagccaccgg ctattttccc actaagataa ccttgtttta
      781 atagccaagg gaataaatag tcatgaaaat atcatcattt atttctacat cactgcccct
      841 gccgacatca gtgtcaggat ctagcagcgt aggagaaatg tctgggcgct cagtctcaca
      901 gcaaaaaagt gagcaatatg caaacaatct ggctgggcgc actgaaagcc ctcagggttc
      961 cagcttagcc agccgtatca ctgagaagtt atcatcaatg gcccactctg cgattgaatt
     1021 tatcaaacgc atgttctcgg aggggagcca taaaccggtg gtgacaccag cacccacgcc
     1081 tgcacaaatg ccaagtccta cgtctttcag tgatagtatc aagcaacttg ctgctgagac
     1141 gctgccaaaa tacatgcagc agttgagtag cttggatgca gagacgctgc agaaaaatca
     1201 tgatcagttc gccacgggca gcggccctct tcgtggcagt atcactcaat gccaagggct
     1261 gatgcagttt tgtggtgggg aattgcaagc tgaggccagt gccatcttaa acacgcctgt
     1321 ttgtggtatt cccttctcgc agtggggaac tattggtggg gcggccagcg cgtacgtcgc
     1381 cagtggcgtt gatctaacgc aggcagcaaa tgagctcaaa gggctggcgc aacagatgca
     1441 tcaattactg tcattgatgt gatatggtga ctagtcctgc tataggttga cacttttcag
     1501 ccttaaaacg cctgatgaac ttcatcgggc gttttgtatt ttaacgacag atgcggtcgc
     1561 cgcgtgttgt agatttccac tgactccctg accatctttc ttgcctggcc aatatcttcc
     1621 ggcttaacca gcagatattc catttttaat atcccattga cccgctccgc cagcgcattc
     1681 tgataacaat catatccatc cgtcatcgaa cagatgacac cgtgacgttg atgtaacgcc
     1741 tgatattctg ttgaacagta ct
//
LOCUS       YEPYOPYER2    546 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Y.enterocolitica DNA with insertion element, 3' end.
ACCESSION   M34280
KEYWORDS    insertion element.
SEGMENT     2 of 2
SOURCE      Y.enterocolitica DNA, clone pYV8081.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 546)
  AUTHORS   Forsberg,A. and Wolf-Watz,H.
  TITLE     Genetic analysis of the yopE region of Yersinia spp.:
            Identification of a novel conserved locus, yerA, regulating yopE
            expression
  JOURNAL   J. Bacteriol. 172, 1547-1555 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt     <     1      242     insertion element
    rpt         243      246     3' insertion target sequence
BASE COUNT      144 a    119 c    109 g    174 t
ORIGIN      About 650 bp after segment 1.
        1 gatccgtgac ccagaaagac aaatccgtca ttaacttgcg tagccacttc aggacggtag
       61 aacggccttg aataccgttg cggtcttgcg ctgtcggtaa gtcatttccc ttttttcgac
      121 ctgttcgaca acggccaatt taaaggatag aggataatgc cgttgcgtgc gtttaacata
      181 ttggttcatc acattttcct aagtacatga gttaaatgtg tcaacgctat ttaggacggg
      241 tcatggataa aaacaagggg atagtgtttc cccctttttc tctcaatatt gcgaatatct
      301 tcgtccctga tctttcaggg gcgaatcgtt ttttagcatg ctcattgtta gaatttctga
      361 cttatctctc ttctgtatta ctactcatgc tctggaaaat cctgggcgtc tatattctat
      421 ggattgatgc aacactcgag aaatcaaaat atcattgcta agcgttatat agtatatacc
      481 gtgcttttta tatgaaaacg gcgcatgtca gagcaaatcc agttacactc agcccctaac
      541 tctaga
//
LOCUS       HUMCDR34     2412 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human cerebellar-degeneration-related antigen (CDR34) gene,
            complete cds.
ACCESSION   M31423
KEYWORDS    cerebellar-degeneration-related antigen.
SOURCE      Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2412)
  AUTHORS   Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A.,
            Chaganti,R.S.K., Posner,J.B. and Old,L.J.
  TITLE     Cerebellar degeneration-related antigen: A highly conserved
            neuroectodermal marker mapped to chromosomes X in human and mouse
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            Y.-T.Chen, 17-JAN-1990
FEATURES       from  to/span     description
    pept        503     1174     cerebellar-degeneration-related antigen
                                 (CDR34)
BASE COUNT      743 a    334 c    669 g    666 t
ORIGIN      
        1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct
       61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat
      121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg
      181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga
      241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg
      301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag
      361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg
      421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga
      481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt
      541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt
      601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga
      661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat
      721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt
      781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga
      841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct
      901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt
      961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga
     1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt
     1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat
     1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg
     1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa
     1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac
     1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa
     1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt
     1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact
     1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat
     1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct
     1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga
     1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct
     1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac
     1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg
     1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata
     1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa
     1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc
     2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt
     2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta
     2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga
     2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta
     2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat
     2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt
     2401 gagatgctgc ag
//
LOCUS       RATHGF       2485 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat hepatocyte growth factor mRNA, complete cds.
ACCESSION   M32987
KEYWORDS    hepatocyte growth factor.
SOURCE      Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2485)
  AUTHORS   Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M.,
            Shimizu,S. and Nakamura,T.
  TITLE     Deduced primary structure of rat hepatocyte growth factor and
            expression of the mRNA in rat tissues
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Nakamura, 26-MAR-1990.
FEATURES       from  to/span     description
    pept        143     2329     hepatocyte growth factor
BASE COUNT      750 a    533 c    577 g    625 t
ORIGIN      
        1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc
       61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga
      121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct
      181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca
      241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa
      301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa
      361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc
      421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt
      481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa
      541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg
      601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct
      661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag
      721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat
      781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg
      841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga
      901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta
      961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc
     1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg
     1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca
     1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa
     1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat
     1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta
     1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg
     1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc
     1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg
     1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga
     1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca
     1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa
     1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc
     1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat
     1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct
     1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct
     1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac
     1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg
     2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt
     2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga
     2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt
     2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc
     2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa
     2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa
     2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca
     2461 ttaatatatg tggcgttttc tgttg
//
LOCUS       MUSIMPD      1707 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse IMP dehydrogenase mRNA, complete cds.
ACCESSION   M33934
KEYWORDS    IMP dehydrogenase.
SOURCE      Mouse, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1707)
  AUTHORS   Tiedemann,A.A. and Smith,J.M.
  TITLE     Isolation and DNA sequence of a cDNA encoding mouse IMP
            dehydrogenase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.M.Smith 27-APR-1990.
            
               Author address:
               Dr. J.M. Smith
               Seattle Biomedical Research Institute
               4 Nickerson St.
               Seattle, WA 98109
FEATURES       from  to/span     description
    pept         67     1611     IMP dehydrogenase (EC 1.2.1.14)
BASE COUNT      457 a    408 c    451 g    391 t
ORIGIN      
        1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc
       61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc
      121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt
      181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag
      241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg
      301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa
      361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc
      421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc
      481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc
      541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc
      601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat
      661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg
      721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa
      781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat
      841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga
      901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag
      961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta
     1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc
     1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt
     1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct
     1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc
     1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat
     1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa
     1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct
     1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa
     1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag
     1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc
     1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa
     1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa
//
LOCUS       NEUPSAS      3681 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  N.crassa arginine-specific carbamoyl phosphate synthetase (arg-2)
            small subunit gene, complete cds.
ACCESSION   J05512
KEYWORDS    carbamoyl phosphate synthetase.
SOURCE      N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and
            cDNA to mRNA, clone pAB1.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 3681)
  AUTHORS   Orbach,M.J., Sachs,M.S. and Yanofsky,C.
  TITLE     The Neurospora crassa arg-2 locus: Structure and expression of the
            gene encoding the small subunit of arginine-specific carbamoyl
            phosphate synthetase
  JOURNAL   J. Biol. Chem. 265, 10981-10987 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.S.Sachs, 07-MAY-1990.
FEATURES       from  to/span     description
    pept       1451     1630     arginine-specific carbamoyl phosphate
                                 synthetase small subunit, exon 1 (arg-2) (EC
                                 6.4.5.5)
               1687     2868     arginine-specific carbamoyl phosphate
                                 synthetase small subunit, exon 2
    pept        870      902     ORF, exon 1
               1270     1282     ORF, exon 2
               1359     1387     ORF, exon 3
    pre-msg     752     3082     arg-2 mRNA and introns
    pre-msg     809     3082     arg-2 mRNA and introns (alt.)
    IVS        1631     1686     arg-2 intron A
    IVS         903     1269     ORF intron A
    IVS        1283     1358     ORF intron B
    signal      161      166     TGACTC box
    signal      568      573     TGACTC box
    signal     1008     1013     TGACTC box
    signal     1307     1312     TGACTC box
BASE COUNT      760 a   1093 c    863 g    965 t
ORIGIN      
        1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg
       61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg
      121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg
      181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga
      241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg
      301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg
      361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg
      421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa
      481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa
      541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg
      601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa
      661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag
      721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca
      781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag
      841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc
      901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc
      961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc
     1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat
     1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc
     1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct
     1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg
     1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt
     1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa
     1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat
     1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg
     1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag
     1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt
     1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc
     1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt
     1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac
     1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat
     1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta
     1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg
     1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc
     2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac
     2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc
     2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac
     2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc
     2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc
     2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg
     2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct
     2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt
     2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc
     2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag
     2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt
     2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag
     2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc
     2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg
     2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca
     2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg
     3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag
     3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc
     3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg
     3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca
     3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag
     3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa
     3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg
     3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg
     3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc
     3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc
     3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg
     3661 atgctacccc ctgtcctgca g
//
LOCUS       HUMOSTRO     1425 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human osteopontin mRNA, complete cds.
ACCESSION   J04765
KEYWORDS    osteopontin; phosphoprotein.
SOURCE      Human adult osteoblast, cDNA to mRNA, clone HOP-10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1425)
  AUTHORS   Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D.
  TITLE     cDNA cloning, chromosomal location and RFLP analysis of human
            osteopontin
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Young, 04-MAY-1990.
FEATURES       from  to/span     description
    pept         68     1012     osteopontin precursor
                                 /hgml_locus_uid="LK0223T" /nomgen="OPN"
                                 /map="unassigned"
    sigp         68      115     osteopontin signal peptide
    matp        176     1012     osteopontin
    mRNA    <     1     1425     osteopontin mRNA
BASE COUNT      449 a    293 c    296 g    387 t
ORIGIN      Chromosome 4q13.
        1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact
       61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc
      121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga
      181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca
      241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc
      301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga
      361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca
      421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac
      481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg
      541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga
      601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt
      661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga
      721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca
      781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc
      841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt
      901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct
      961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa
     1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa
     1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg
     1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa
     1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg
     1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact
     1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag
     1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag
//
LOCUS       MUS25OAS     1412 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse 2'-5' oligo A synthetase mRNA, complete cds.
ACCESSION   M33863
KEYWORDS    2'-5' oligo A synthetase.
SOURCE      Mouse cell line L-929, cDNA to mRNA, clone L3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1412)
  AUTHORS   Coccia,E.M., Nissim,A. and Chebat,J.
  TITLE     A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3
            cells impairs EMCV but not VSV replication
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.M.Coccia, 26-APR-1990.
FEATURES       from  to/span     description
    pept         36     1139     2'-5' oligo A synthetase
    signal     1389     1394     poly-A signal
BASE COUNT      340 a    371 c    390 g    311 t
ORIGIN      
        1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc
       61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg
      121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag
      181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca
      241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct
      301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg
      361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca
      421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg
      481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca
      541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta
      601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc
      661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc
      721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt
      781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc
      841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc
      901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga
      961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc
     1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga
     1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag
     1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc
     1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca
     1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg
     1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc
     1381 tgatcaacaa taaaccacag caggtgccgt ca
//
LOCUS       ACARGA       1212 bp ds-DNA             INV       17-JUL-1990
DEFINITION  A.castellanii 5S RNA.
ACCESSION   M34003
KEYWORDS    5S RNA.
SOURCE      A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1.
  ORGANISM  Acanthamoeba castellanii
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; 
            Acanthamoebidae.
REFERENCE   1  (bases 1 to 1212)
  AUTHORS   Zwick,M.G., Wiggs,M. and Paule,M.R.
  TITLE     Cloning, sequencing and genomic organization of a 5S RNA gene from
            the eukaryotic protist Acanthamoeba castellanii
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Paule, 04-MAY-1990.
            
                  Author address: M.R.Paule
                  Colorado State University, Dept. Biochemistry,
                  Fort Collins, CO 80523
FEATURES       from  to/span     description
    RNA         489      610     5S RNA precursor
    RNA         489      607     5S RNA
BASE COUNT      195 a    374 c    329 g    314 t
ORIGIN      
        1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg
       61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc
      121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta
      181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt
      241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac
      301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg
      361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag
      421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt
      481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt
      541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc
      601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg
      661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa
      721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag
      781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc
      841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact
      901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct
      961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt
     1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg
     1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc
     1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg
     1201 gcaaatgtcg ac
//
LOCUS       ENTOMPX       664 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  E.cloacae outer membrane protein X (OmpX) gene, complete cds.
ACCESSION   M33878
KEYWORDS    outer membrane protein X.
SOURCE      E.cloacae (strain 2249-1) DNA.
  ORGANISM  Enterobacter cloacae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 664)
  AUTHORS   Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and
            van de Klundert,J.A.M.
  TITLE     Molecular characterization of an Enterobacter cloacae outer
            membrane protein OmpX
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Stoorvogel, 26-APR-1990.
            
                     Author address: J.Stoorvogel
                     Dept of Medical Microbiol, Univ Hospital,
                     P.O.Box 9600, 2300 RC Leiden,
                     THE NETHERLANDS
FEATURES       from  to/span     description
    pept         83      601     outer membrane protein X precursor
    sigp         83      151     outer membrane protein X signal peptide
    matp        152      598     outer membrane protein X
    signal       18       25     -35 region
    signal       43       48     -10 region
    binding      74       77     ribosome binding site
    signal      620      649     transcription termination signal
BASE COUNT      171 a    156 c    162 g    175 t
ORIGIN      
        1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac
       61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt
      121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc
      181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta
      241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg
      301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta
      361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca
      421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta
      481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca
      541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta
      601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc
      661 ctct
//
LOCUS       NEUMPPX      2038 bp ss-mRNA            PLN       17-JUL-1990
DEFINITION  N.crassa matrix processing peptidase (MPP) mRNA, complete cds.
ACCESSION   J05484
KEYWORDS    matrix processing peptidase.
SOURCE      N.crassa, cDNA to mRNA.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 2038)
  AUTHORS   Schneider,H., Arretz,M., Wachter,E. and Neupert,W.
  TITLE     Matrix processing peptidase of mitochondria: Structure-function
            relationships
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Neupert, 17-APR-1990.
FEATURES       from  to/span     description
    pept         41     1774     matrix processing peptidase
BASE COUNT      439 a    626 c    552 g    421 t
ORIGIN      
        1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc
       61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc
      121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga
      181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga
      241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa
      301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc
      361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc
      421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc
      481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg
      541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct
      601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg
      661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt
      721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa
      781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac
      841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc
      901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc
      961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga
     1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc
     1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc
     1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg
     1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct
     1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg
     1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat
     1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga
     1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat
     1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg
     1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg
     1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct
     1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga
     1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt
     1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc
     1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc
     1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct
     1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa
//
LOCUS       XELXOTCH     9163 bp ss-mRNA            VRT       17-JUL-1990
DEFINITION  X.laevis Xotch protein mRNA, complete cds.
ACCESSION   M33874
KEYWORDS    Xotch.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 9163)
  AUTHORS   Coffman,C., Harris,W. and Kintner,C.
  TITLE     Xotch, the Xenopus homolog of the Drosophila notch gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Coffman, 26-APR-1990.
            
                        Author address: C.R.Coffman
                        University of California
                        Dept. of Biology-B022,
                        La Jolla, CA 92093
FEATURES       from  to/span     description
    pept        346     7917     Xotch protein
    mRNA    <     1     9163     Xotch mRNA
    signal     9146     9151     poly-A signal
    rpt         409     4632     EGF-like repeats
    rpt        4684     5025     lin-12/Notch repeats
    rpt        5953     6591     cdc-10/SWI-6 repeats
BASE COUNT     2499 a   2192 c   2188 g   2284 t
ORIGIN      
        1 gaattccgga cgcctggtgg aactaaacag actggaatgc actggaacac ggaagcctat
       61 gcggggatac attgtagcgc tgagcccttc cgctgtaaca ttgtagcgct gagccgctca
      121 tacactgtag cgctgtaact tcaggctgtg cgcaaactga gtgcgctatt ctctctccaa
      181 tctggaaact ctgtcgcttt tggaccaggg attgcactga cttggattat cttcaccttt
      241 ctctaaggga tttatagatt tttgccttgg gacctttatc ctccccttgt cctgggacag
      301 ggaccttcat caggatctat gttcctatag tgagactgca ggaaaatgga tcggataggg
      361 ctggctgtgc tgctctgctc cctgcctgtc ctcacacaag gtcttcgatg cactcagact
      421 gcagaaatgt gcctgaacgg tgggaggtgc gaaatgaccc ctggtggaac tggggtgtgc
      481 ctatgcggta atttgtattt cggtgaaagg tgccagttcc ccaatccctg caccataaag
      541 aatcagtgta tgaactttgg aacctgcgag cctgtgctac aggggaacgc catagacttc
      601 atctgccact gccccgtggg tttcaccgac aaagtgtgcc tgactcccgt ggataacgcc
      661 tgcgtcaaca acccgtgccg gaatggagga acctgtgaac ttttgaatag cgtgactgaa
      721 tacaaatgca ggtgtccccc aggatggaca ggggattcgt gccagcaggc agatccatgt
      781 gcctctaatc catgtgccaa tggtgggaag tgtttgccct ttgagatcca gtatatctgc
      841 aagtgtcccc ctggattcca tggagctacc tgcaagcagg acattaatga gtgcagccaa
      901 aatccctgca agaatggcgg ccagtgcatc aatgagtttg gatcataccg ctgcacctgt
      961 cagaacaggt ttactggaag aaactgtgat gagccatatg tcccctgcaa cccatcgccg
     1021 tgcttgaatg gagggacctg ccgccaaact gatgacacct cctatgattg cacttgtctt
     1081 ccaggcttta gtggacaaaa ttgtgaagag aacattgacg actgcccgag caacaactgc
     1141 cgcaatggtg ggacctgtgt ggatggagtc aacacctata actgccagtg tccacctgac
     1201 tggacaggcc aatactgcac ggaggatgtg gacgaatgcc aactgatgcc aaacgcttgc
     1261 caaaatgggg gaacctgcca caacacctac ggtggataca actgtgtttg tgtcaacggc
     1321 tggaccggag aggactgcag cgagaacatt gacgattgtg ccaacgccgc ctgccatagt
     1381 ggtgccacct gtcatgatcg cgtggcatct ttctactgcg agtgcccaca cggccggaca
     1441 ggtctccttt gccacctgga taatgcctgt atcagcaatc cctgcaacga aggctccaac
     1501 tgtgacacca acccagtcaa cgggaaagcc atttgcacct gcccacctgg gtacactggt
     1561 cctgcttgca acaatgatgt ggatgagtgc tctctgggtg ccaatccatg tgagcatggt
     1621 gggagatgta ccaacaccct gggttccttc cagtgcaact gcccccaggg atatgcagga
     1681 ccccgctgcg agattgatgt caatgaatgt ctttccaacc cgtgccaaaa tgattccaca
     1741 tgcctggacc agattggaga gtttcagtgt atatgtatgc caggttacga aggcctttac
     1801 tgtgaaacca acatagatga atgtgccagc aacccctgcc tacataatgg caagtgcatt
     1861 gataaaatca acgagttccg ctgcgactgt cccacaggct tcagtggaaa cctctgccag
     1921 catgactttg atgagtgtac cagcactccg tgcaaaaatg gagccaaatg cttggatggt
     1981 ccaaattctt acacctgcca atgtacagaa ggtttcaccg ggcgccactg cgaacaagac
     2041 atcaatgaat gcattcctga tccgtgccac tacggcacct gcaaagacgg cattgctacc
     2101 ttcacatgcc tgtgccggcc tgggtacacg gggcgtctgt gtgataatga catcaatgag
     2161 tgcctgagca agccgtgcct aaacgggggg cagtgtacag accgagagaa tggatacata
     2221 tgcacttgtc cgaaaggaac cacaggggtg aattgcgaaa ccaaaataga tgactgtgcc
     2281 agtaacctgt gcgataatgg aaagtgcatt gacaagattg atggctacga gtgcacctgc
     2341 gagcctggtt acacaggcaa gttgtgcaat attaacatta acgaatgtga cagcaatcca
     2401 tgccgcaatg gagggacttg caaagaccag atcaatggct tcacatgtgt gtgcccagat
     2461 ggctaccatg accacatgtg cctgtctgaa gtaaatgaat gcaacagcaa tccatgtatc
     2521 cacggggcgt gccatgatgg agtcaatggg tataaatgtg actgtgaagc aggatggagc
     2581 ggtagcaatt gtgacatcaa taacaatgag tgtgaatcca acccgtgtat gaatggtggt
     2641 acctgcaaag acatgaccgg agcatacatc tgtacttgca aagcaggatt tagtgggcca
     2701 aactgccaaa ctaacatcaa cgaatgttcc tctaacccct gcttgaacca tggaacgtgt
     2761 attgatgatg ttgctggtta caaatgcaac tgcatgttac cttatacagg tgccatttgt
     2821 gaggcagttt tggctccgtg tgcaggcagt ccttgtaaaa atggtggcag atgtaaagaa
     2881 tcagaagact ttgaaacctt ctcttgtgaa tgtccacccg ggtggcaagg gcaaacatgt
     2941 gagattgaca tgaatgaatg cgtaaatcgc ccttgccgta acggtgcaac gtgtcaaaac
     3001 accaatggca gctacaagtg caactgcaaa cctggttaca ctgggcggaa ctgcgagatg
     3061 gacattgatg actgccaacc aaatccttgt cacaatggag gatcctgttc tgacggcatt
     3121 aatatgtttt tctgtaactg cccggccggg ttccgggggc ctaagtgcga agaggacatt
     3181 aacgagtgtg ccagtaaccc gtgcaaaaat ggagccaact gcactgattg tgtaaacagc
     3241 tacacatgca cctgccagcc tggatttagt ggcatccact gtgaaagcaa cacccctgac
     3301 tgcaccgaga gctcgtgctt taatggtggc acctgcattg atgggatcaa tacgtttaca
     3361 tgccagtgcc cacctggatt taccggaagc tactgccaac atgatatcaa tgaatgtgac
     3421 tccaagcctt gtttgaacgg aggcacctgc caggacagtt acgggacata caagtgcacc
     3481 tgtccgcaag gatacactgg gctgaactgc cagaacctgg tgcgttggtg tgactcttct
     3541 ccctgtaaaa atggaggaaa gtgctggcag acaaacaact tttatcgttg tgagtgcaag
     3601 agtggctgga ctggtgttta ctgtgatgtc cccagtgtct cctgcgaggt tgctgctaag
     3661 cagcaaggtg tggatatcgt tcatctgtgt cgcaattctg gcatgtgtgt ggacactgga
     3721 aacacacatt tctgccgctg tcaggctggc tacacaggca gttactgcga ggaacaggtg
     3781 gacgaatgct cgccgaatcc ttgtcagaac ggtgccacat gcacagacta cttaggtggc
     3841 tactcttgcg agtgtgttgc cggctaccat ggagtaaact gctccgagga gatcaatgag
     3901 tgtttgtccc acccgtgcca gaatggagga acctgtattg atctcattaa cacctacaaa
     3961 tgctcctgcc ccagaggaac tcaaggtgtg cactgtgaaa ttaatgtgga tgattgcacg
     4021 cccttctacg actccttcac cttggaaccc aaatgcttca acaatggcaa atgtatagac
     4081 cgcgtgggtg gctataactg catctgccct ccaggctttg taggggaacg ctgtgaaggc
     4141 gatgtgaacg agtgcctgtc taatccctgt gactcccgtg gcacccagaa ctgcattcaa
     4201 ctggtgaatg attaccggtg tgagtgccgg cagggattca caggaaggcg ctgtgaatct
     4261 gttgtggatg gttgtaaggg aatgccctgc cgaaacggtg gaacatgtgc tgtcgccagc
     4321 aatacagaac gtggatttat ttgcaaatgc cctcctgggt ttgacggagc cacttgcgag
     4381 tatgattcac gaacctgcag caaccttcgc tgccagaatg gcggcacttg catttcggtg
     4441 ctgacgagtt ccaaatgcgt gtgctcggaa ggctatactg gcgccacatg tcagtaccct
     4501 gttatcagtc cttgtgcttc ccacccttgt tacaatggag gaacctgcca gttctttgct
     4561 gaggaacctt tcttccagtg cttctgtcca aaaaacttta acgggctctt ctgccatatc
     4621 ttggattatg agtttcccgg gggactaggc aagaatatta ctcctcctga taatgacgat
     4681 atctgcgaaa acgagcagtg ctctgagctg gctgacaaca aagtctgcaa cgccaactgc
     4741 aacaaccatg cttgcggatg ggatggagga gactgctctc tcaacttcaa tgacccatgg
     4801 aagaactgca cccagtcgct gcagtgctgg aaatatttca atgatggcaa atgcgattct
     4861 cagtgcaaca acactgggtg tctgtacgac gggtttgact gccagaaagt tgaggttcag
     4921 tgcaaccctt tgtatgacca gtactgcaag gatcactttc aagatggcca ctgcgaccaa
     4981 ggctgtaaca atgcagagtg cgaatgggat ggcctggact gcgccaacat gccagagaac
     5041 ctggctgaag gcaccctggt gttggtcgtc ctcatgcccc ctgaaaggct taagaacaac
     5101 tctgtcaatt tcctccgtga gctgagccga gtgctccaca caaatgtggt gttcaagaaa
     5161 gacagcaaag gcgaatacaa gatctaccct tactatggca atgaggaaga acttaaaaaa
     5221 catcacatca agaggtcaac tgattactgg tcggatgctc cttctgccat cttcagcaca
     5281 atgaaagaat ccattttact aggaaggcat cgtagggagt tggatgaaat ggaagttagg
     5341 gggtcaatcg tatacttgga aatagacaac cgccagtgtt acaaatcctc ctctcagtgc
     5401 ttcaacagtg ctacagatgt tgccgcattc cttggggctt tggcctccct tgggagtctg
     5461 gacacccttt cctataagat tgaggcagtg aaaagtgaga acatggagac ccccaagccc
     5521 tctactctct atcccatgtt atccatgttg gtcattccgt tattaatcat cttcgtcttc
     5581 atgatggtca tcgttaataa gaagcgtcgc cgtgaacacg acagctttgg ttccccgacg
     5641 gctttattcc aaaagaaccc agcaaaaaga aacggcgaga cgccttggga ggattctgtt
     5701 ggcttaaagc ccattaagaa tatgacagat ggctctttta tggacgataa tcagaatgaa
     5761 tggggagatg aggagaccct ggaaaacaag aggttcaggt ttgaagagca agtgattctc
     5821 ccagaacttg ttgacgacaa aactgacccc cgacagtgga cacggcagca ccttgacgcc
     5881 gctgatctgc gcatttcatc catggcccca acgccgccgc agggagagat tgaggccgac
     5941 tgtatggatg tcaatgtccg tggccctgat ggcttcaccc cacttatgat tgcctcctgt
     6001 agcggaggtg gattagagac tggaaacagc gaagaggaag aggatgcttc agctaatatg
     6061 atttccgact tcattgggca gggtgcccaa ctgcataacc aaaccgaccg cactggcgag
     6121 acagcacttc atctggccgc aagatacgca cgtgctgatg cagccaagcg cctgttggaa
     6181 tcgagtgcgg acgccaacgt ccaggataac atgggcagga ctcctctcca tgcagcagtg
     6241 gcggctgatg ctcagggcgt attccagatt ctcattcgga accgagcgac agacttagac
     6301 gcccgcatgt ttgatggcac tacccctctg atcctggccg ctcggctggc cgtggaaggg
     6361 atggtggagg agcttatcaa tgctcatgca gatgtcaacg ctgttgatga atttggaaaa
     6421 tctgctttgc attgggcagc ggctgtgaat aacgttgatg ctgcagctgt gcttctcaag
     6481 aatagtgcaa ataaggacat gcaaaacaac aaggaagaga catccctgtt cttggccgca
     6541 agagaaggca gctacgaaac tgccaaagtc cttttggatc actacgccaa ccgtgacatc
     6601 acagaccaca tggatcggct gcctcgtgac atcgcccaag aacgcatgca ccacgacatt
     6661 gttcacctgc tggatgaata taaccttgtg aagagcccaa cgctgcacaa tggtccgttg
     6721 ggagcaacga cattatcacc tcccatctgc tcccctaatg gttacatggg gaacatgaag
     6781 ccttctgttc agagcaagaa agcccgcaag cccagtatca aaggtaatgg ctgcaaagag
     6841 gccaaagagc tgaaagccag aaggaaaaaa tctcaagatg ggaaaacaac tctcttggat
     6901 tctggcagtt ctggagtgtt gtccccagtg gactccctgg agtcaacaca tggatactta
     6961 tcagatgtgt cttctcctcc gctgatgacc tctccgtttc agcagtctcc atccatgcct
     7021 ctgaaccact tgacaagcat gccagagtcc cagcttggca tgaatcacat aaacatggcc
     7081 accaagcagg aaatggcagc aggttccaac agaatggctt ttgatgccat ggtgccacgt
     7141 ctgacccatc tcaatgcctc aagccctaat accatcatga gcaatggatc catgcatttc
     7201 actgtgggag gagctccgac tatgaacagc caatgtgact ggttagctag gctgcagaat
     7261 gggatggtcc agaatcagta tgacccaatc agaaatggca tccaacaagg caatgctcaa
     7321 caagctcaag ctcttcagca tggccttatg acctcgctcc ataatggtct gccagcaaca
     7381 actctctccc aaatgatgac ctatcaggcc atgcccaaca caaggctagc caatcagcca
     7441 catctaatgc aagcccagca aatgcaacag cagcaaaact tgcagttgca ccagagcatg
     7501 cagcaacaac atcacaattc cagcacgacc tctactcaca tcaactcacc attctgcagc
     7561 agtgacataa gccagacgga cctgcagcaa atgtcaagca acaacattca ttcagtaatg
     7621 ccccaggaca ctcagatatt tgctgcatct ctgccttcca atcttacgca gtctatgaca
     7681 actgcacaat ttttaacccc cccttcccag catagctact cctccccaat ggacaataca
     7741 ccaagccatc aactacaagt accagaccac ccgttcctga cgccttctcc tgagtcacct
     7801 gaccagtggt caagctcctc ccctcattcc aatatgtctg actggtcaga aggaatatca
     7861 agtcctccca cgagtatgca gcctcagcgc acccacatac ctgaagcttt caagtaaaaa
     7921 aaaaaaaagt ttaaaaaaat gtaaaatatt tttaaagaca ctgagagaga ctttaagaga
     7981 ctgaaggaaa tttttatatg gtttttatac ttaaaataac agaacatttg aattttctag
     8041 tatttattta tatatacgtt tgacctaaaa cactgccctt ttatttataa gctttttttc
     8101 atttttagga caaaatattt taacttcttg ccttgaaagt ttttcagttc taaatcttat
     8161 gaaattggtt cctgcctggt attgaaaacg gcaatgtatt tattttttat ttacctgaat
     8221 agtatacagg aacaaaccac tggggtgggg gggttatcgg gatgtgtatt tagcagaaaa
     8281 aagattttct ataaaatgaa atctttcagg ttttcattta tagcactaaa aagattccag
     8341 tattaatttt aaattaaatc atgaagaaga tgctccgaat acctctcgct gtcagggagg
     8401 tgcccaatag gttttttttt tttgtttttt ttttggaaaa cctgaaactc ttggtaagta
     8461 cagaaagaaa gcataaaata caggagggca cgcctaggtc tgatgcattc tcaatgggaa
     8521 taaaatcctg aaaagtgact cattttaaaa aattatattt gttagaaaaa acaaactgcc
     8581 attttgaatc ccttgtgtcg catggggtat tggtcagaaa atgcatgcaa tttttggtac
     8641 aagcccgtgg catattttct acagtttttt tttttttttt tttttttttg taaatatcta
     8701 tggtggattt tgtttcaaaa atctatttgt atatcctgat ttatattgct tatccgagta
     8761 ggagtgcagg ctggatggaa ccatgccgaa gacaaatcgc agattggaca ttgtttttta
     8821 tttttttgcc tacagagact atttgcacat gaacaacaac aaaaaaaaga agatatattt
     8881 aaatataaaa ttttatacaa acgaaaaaac aaaaaacaaa aactgtcgat ttgcctctag
     8941 ttccacttct tctgtgatac gatgcagttg ggcagttcat tccttatgcc gtactgtaca
     9001 aagatctgta gaatttagtt gtttttcctg aacaaataaa tatgtaatgt atcgagcaga
     9061 aatgtagttt acaaaaaaaa ctatttgttg ttgttgctgt taacacttct gtaaacaaat
     9121 tttttttggt ttgtttattt gttgtaataa aattgtacaa aat
//
LOCUS       TRCTC26       638 bp ss-mRNA            INV       17-JUL-1990
DEFINITION  T.cruzi AMP-inducible protein (TC26) mRNA, partial cds.
ACCESSION   M34062
KEYWORDS    AMP-inducible protein.
SOURCE      T.cruzi (strain X10/4) trypomastigote, cDNA to mRNA, clone G26.
  ORGANISM  Trypanosoma cruzi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 638)
  AUTHORS   Heath,S., Hieny,S. and Sher,A.
  TITLE     A cyclic AMP inducible gene expressed during the development of
            infective stages of Trypanosoma cruzi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Heath, 08-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1  >   638     AMP-inducible protein (TC26) (AA at 1)
BASE COUNT      154 a    174 c    168 g    142 t
ORIGIN      
        1 accccaccag accaactggt aatggtagcg accggcgctc agcttggaat tccgtgcgct
       61 gcttctagcc ctagagaagc tgatgatccc ccgtatccga cacaggcgta caaccctgct
      121 tgtggttacg gacagtcagt ctcttctagc ggctctaaac aagggcccgc tcaatcagac
      181 agactggacg gaggatcaga tctggcggcg tctcttgaca ctgacgtgtg ctggctggtc
      241 gtgcacctgc agttttgtta cggacattgt ggagtacatg ctaacgagct tgcagatcag
      301 tatgcgagga actatggaaa gtggacaata cacggagcaa ggaatcgcac ctttatggca
      361 tacggatctg ctgacgtgtt ttactaccca gctcaccaac aagtggcgta ctaccattcg
      421 tcaagacact catcgctacc tgctttgcgg cacaaggcca tcagatctcg cggtaaggac
      481 ctgatcactc aggaagttct acaccgtcag aactggttca cctcgcaagg gcaaggtgcg
      541 gggaatctga gctctggggc cgactatctg ggccgtgaga gattgcacga accaatgccg
      601 tttctgcaac atctcaccgg aacagtctgc atatatat
//
LOCUS       DROANNIX     1104 bp ss-mRNA            INV       17-JUL-1990
DEFINITION  D.melanogaster annexin IX mRNA, 3' end.
ACCESSION   M34068 J05501
KEYWORDS    annexin IX.
SOURCE      D.melanogaster adult head, cDNA to mRNA, clone pD3-6.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1104)
  AUTHORS   Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
            Suedhof,T.C.
  TITLE     Two novel annexins from Drosophila melanogaster: Cloning,
            characterization and differential expression in development
  JOURNAL   J. Biol. Chem. 265, 11382-11388 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Suedhof,  04-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      891     annexin IX (AA at 1)
    signal     1091     1096     poly-A signal
BASE COUNT      281 a    300 c    306 g    217 t
ORIGIN      
        1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg
       61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc
      121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg
      181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga
      241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt
      301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag
      361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat
      421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc
      481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc
      541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag
      601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag
      661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg
      721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt
      781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag
      841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca
      901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa
      961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg
     1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt
     1081 ccctctgtgc aataatcgga attc
//
LOCUS       DROANNX      1192 bp ss-mRNA            INV       17-JUL-1990
DEFINITION  D.melanogaster annexin X mRNA, complete cds.
ACCESSION   M34069 J05501
KEYWORDS    annexin X.
SOURCE      D.melanogaster adult head, cDNA to mRNA, clone pD3-16.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1192)
  AUTHORS   Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and
            Suedhof,T.C.
  TITLE     Two novel annexins from Drosophila melanogaster: Cloning,
            characterization and differential expression in development
  JOURNAL   J. Biol. Chem. 265, 11382-11388 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Suedhof , 04-MAY-1990.
FEATURES       from  to/span     description
    pept         91     1056     annexin X
    signal     1175     1184     poly-A signal
BASE COUNT      271 a    348 c    363 g    210 t
ORIGIN      Chromosome 93B or 19A-4,7.
        1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac
       61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag
      121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga
      181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg
      241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag
      301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag
      361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc
      421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag
      481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc
      541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc
      601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg
      661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc
      721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg
      781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc
      841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc
      901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag
      961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac
     1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc
     1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct
     1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc
//
LOCUS       ECODKSA      1273 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  E.coli dnaK suppressor (dksA) gene, complete cds.
ACCESSION   M34945
KEYWORDS    dnaK suppressor.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1273)
  AUTHORS   Kang,P.J. and Craig,E.A.
  TITLE     Identification and characterization of a new Escherichia coli gene
            that is a dosage-dependent suppressor of a dnaK deletion mutation
  JOURNAL   J. Bacteriol. 172, 2055-2064 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        229      441     ORF 1
    pept        619     1074     dnaK suppressor (dksA)
BASE COUNT      343 a    301 c    333 g    296 t
ORIGIN      
        1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa
       61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc
      121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata
      181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg
      241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt
      301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg
      361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa
      421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg
      481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg
      541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc
      601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt
      661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa
      721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa
      781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac
      841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac
      901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc
      961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg
     1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag
     1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga
     1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct
     1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt
     1261 acgcatagaa gat
//
LOCUS       STFCYCLI     2180 bp ss-mRNA            INV       17-JUL-1990
DEFINITION  Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds.
ACCESSION   M33880
KEYWORDS    cyclin B.
SOURCE      Starfish (A.pectinifera) egg, cDNA to mRNA, clone
            lambda-gt10-cyc10.
  ORGANISM  Asterina pectinifera
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; 
            Stelleroidea; Asteroidea; Spinulosida; Asterinidae.
REFERENCE   1  (bases 1 to 2180)
  AUTHORS   Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T.
  TITLE     The starfish egg mRNA responsible for meiosis reinitiation encodes
            cyclin
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Kishimoto, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        126     1313     cyclin B (CYC)
    mRNA    <   126     2180     cyclin B mRNA
BASE COUNT      650 a    440 c    482 g    608 t
ORIGIN      
        1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg
       61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga
      121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt
      181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc
      241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag
      301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca
      361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa
      421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct
      481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac
      541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt
      601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc
      661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc
      721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc
      781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg
      841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt
      901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca
      961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca
     1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag
     1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc
     1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat
     1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca
     1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg
     1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt
     1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt
     1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg
     1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa
     1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt
     1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga
     1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc
     1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca
     1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac
     1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat
     1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag
     1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat
     2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg
     2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt
     2161 gtaataaagt taatggaaat
//
LOCUS       MUSH2A1X      929 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  Mouse histone H2A.1 gene, complete cds.
ACCESSION   M33988
KEYWORDS    histone H2A.1.
SOURCE      Mouse (BALB/c) DNA, clone 53.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 929)
  AUTHORS   Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
            Sch|mperli,D.
  TITLE     Structure of a mouse histone gene cluster
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Sch|mperli, 02-MAY-1990.
            
                      Author address: Dr.Schmperli
                      Zoologisches Institut, Baltzerstrasse 4
                      CH-3012 Bern
                      SWITZERLAND
FEATURES       from  to/span     description
    pept        164      556     histone H2A.1
    mRNA        119      624     histone H2A.1 mRNA
BASE COUNT      238 a    247 c    222 g    222 t
ORIGIN      
        1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac
       61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct
      121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa
      181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt
      241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc
      301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct
      361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct
      421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg
      481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa
      541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca
      601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg
      661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta
      721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga
      781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt
      841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa
      901 gggccattaa atcccacttc catagtttc
//
LOCUS       MUSH2A1XX     693 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  Mouse histone H3.2 gene, complete cds.
ACCESSION   M33989
KEYWORDS    histone H3.2.
SOURCE      Mouse (strain BALB/c) DNA, clone 53.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 693)
  AUTHORS   Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and
            Sch|mperli,D.
  TITLE     Structure of a mouse histone gene cluster
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Sch|mperil, 02-MAY-1990.
            
                       Author address: D.Schlmperil
                       Zoologisches Institut, Baltzerstrasse 4
                       CH-3012 Bern
                       SWITZERLAND
FEATURES       from  to/span     description
    pept        151      561     histone H3.2
    mRNA        116      611     histone H3.2 mRNA
BASE COUNT      159 a    206 c    184 g    144 t
ORIGIN      
        1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt
       61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat
      121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag
      181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc
      241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc
      301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc
      361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc
      421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg
      481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc
      541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc
      601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg
      661 tttttcaaag atttatctag ttacaaagta tac
//
LOCUS       RATGGSLEC    1358 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat Gal/GalNAc-specific lectin mRNA, complete cds.
ACCESSION   J05495
KEYWORDS    Gal/GalNAc-specific lectin.
SOURCE      Rat (strain Wistar) adult peritoneal macrophage, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1358)
  AUTHORS   Kurata,M.Ii., Itoh,H., Yamashina,N. and Kawasaki,I.T.
  TITLE     Molecular cloning and sequence analysis of cDNA encoding the
            macrophage lectin specific for galactose and N-acetylgalactosamine
  JOURNAL   J. Biol. Chem. 265, 11295-11298 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Kawasaki, 20-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        172     1092     Gal/GalNAc-specific lectin precursor
    sigp        172      195     Gal/GalNAc-specific lectin signal peptide
    matp        196     1089     Gal/GalNAc-specific lectin
    signal     1330     1335     poly-A signal
BASE COUNT      340 a    325 c    365 g    328 t
ORIGIN      
        1 ccacctctgg ccactgctgc acagggaagc cacgatttca cacagttcca ggacttacct
       61 cagcttcaca ctgcttctga cactctcctt ggtcccagct cagtatctgt ctggattctg
      121 gtgtctcggt ttcagttcag atagccctag aggccttgtg tttgtgcaag catgacaatg
      181 gcatatgaaa acttccagaa cttggggagc gaagagaaaa accaagaggc tggtaaagct
      241 cctccccagt ccttcctgtg caatatcctc tcttggaccc acctcctcct gttctccctg
      301 ggccttagcc tcctgctgct ggtggttatc tccgtgattg gatcccaaaa ttcccagtta
      361 aggagggacc tagaaaccct gagaaccact ttagacaaca ccacctccaa caccaaggct
      421 gaactacagg ccctggcctc caggggtgac agcttgcaaa caggaatcaa ttctctgaaa
      481 gtggaggtgg atgatcatgg gcaggaactg caggcaggcc gaggcttgag ccagaaggtg
      541 gcttctctgg agagcacagt ggagaagaag gagcagactc tcagaacaga tctatctgaa
      601 ataaccgatc gtgtgcaaca gctggggaag gacttgaaga ccctgacatg tcagctggcc
      661 agcctcaaga acaacggctc agcagtggcc tgctgccccc ttcactggat ggagcatgaa
      721 ggcagctgct actggttctc tcagtctggg aagccgtggc ctgaagctga caagtactgc
      781 cagctggaga attctaacct agtggtggtc aactccctgg cggagcagaa ttttctacag
      841 actcacatgg gcagtgtggt cacttggatt ggcctaacgg accaaaatgg gccctggcga
      901 tgggtggacg ggaccgacta tgagaaaggc tttacgcact gggccccaaa gcagccagat
      961 aactggtatg gacacgggct gggaggaggc gaggactgtg cacacttcac ctcagatggt
     1021 cgctggaatg atgacgtttg ccagaggccc taccgctggg tctgtgagat gaagctggcc
     1081 aaggacagct aggagcactc tccaagtaat ttatttcttc agtggtttta gctggggaag
     1141 cctggagttt ggactcctcc tatcccattt caatgttatt tgggactttt ctttttgtca
     1201 aaaaatttaa gtcaaggaga aaggattgtg tttgagatgg tagtgtggtg tccggaagaa
     1261 tggggttatt gaaatctgtg gtattttgta cagagtgccg cttattgtag gactttttaa
     1321 tgtaacaaaa ataaaatact gttggtttta aaaaaaac
//
LOCUS       MLUDNAA      4171 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34
            (rpmH), DNA biosynthesis initiation protein (dnaA), and DNA
            polymerase III beta subunit (dnaN) genes, complete cds.
ACCESSION   M34006
KEYWORDS    50S ribosomal subunit protein L34;
            DNA biosynthesis initiation protein;
            DNA polymerase III beta subunit; ribonuclease P.
SOURCE      M.luteus (strain IF03333) DNA.
  ORGANISM  Micrococcus luteus
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Micrococcaceae.
REFERENCE   1  (bases 1 to 4171)
  AUTHORS   Fujita,M.Q., Yoshikawa,H. and Ogasawara,N.
  TITLE     Structure of the dnaA region of Micrococcus luteus: Conservation
            and variations among eubacteria
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Ogasawara, 05-MAY-1990.
FEATURES       from  to/span     description
    pept        435       37 (c) ribonuclease P (rnpA)
    pept        601      464 (c) 50S ribosomal subunit protein L34 (rpmH)
    pept        944     2491     DNA biosynthesis initiation protein (dnaA)
    pept       3242  >  4171     DNA polymerase III beta subunit (dnaN)
    signal      843      848     dnaA -35 signal
    signal      866      871     dnaA -10 signal
BASE COUNT      672 a   1558 c   1361 g    580 t
ORIGIN      
        1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg
       61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta
      121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag
      181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg
      241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct
      301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg
      361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg
      421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg
      481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg
      541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca
      601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag
      661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc
      721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc
      781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc
      841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc
      901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc
      961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc
     1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct
     1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc
     1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga
     1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg
     1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac
     1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt
     1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc
     1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg
     1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc
     1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc
     1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga
     1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca
     1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga
     1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt
     1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc
     1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc
     1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta
     2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac
     2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa
     2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac
     2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct
     2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc
     2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa
     2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa
     2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg
     2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact
     2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc
     2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg
     2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct
     2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg
     2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc
     2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca
     2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc
     3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac
     3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca
     3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc
     3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct
     3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc
     3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg
     3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga
     3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc
     3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg
     3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct
     3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac
     3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga
     3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg
     3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac
     3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc
     3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga
     3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac
     4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct
     4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag
     4141 cgccaacgag tccgtgccgt gcacgctcga g
//
LOCUS       HUMLYAM1     1192 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 1.
ACCESSION   M32406
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     1 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1192)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.-J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept       1040  +  1042     leukocyte adhesion molecule-1 (LAM-1), exon 1
    mRNA        931  >  1192     leukocyte adhesion molecule-1 mRNA
    IVS        1043  >  1192     leukocyte adhesion molecule-1 intron A
BASE COUNT      410 a    185 c    326 g    271 t
ORIGIN      Chromosome 1q22-25.
        1 agagagctgt tataaagatt aaacaatata ataaatatgg cgcgtgagct tcagagtttt
       61 tgttgttgtt attattattt tcctaaaaat gcaaatctga tttgcatttg actcattgac
      121 tcacatcagt gggtcttcct ttttattgtc cttcatcata tgggtcctaa tttcacatgc
      181 agtcttataa aaccatctca ttttatagtc caagaatatt aaaggtactt gtaggctccc
      241 aaacctacac ggtgaaaagc tagagagcat gggctctctt caggggttaa cttcaggaag
      301 tgccactaac aaggacgtcc actaggtggt gagcaaggaa agacggaggt gaaggaaccg
      361 aaacgagtca agtccactgc ttagctctac tgaagttttg caaacatcat aaatatgtct
      421 gaaatgcagt tttgatttgt agtatttgca atttccaagg gccatttacc acaggtagcc
      481 aagagttagt ttagcattta tgaaaaagat aggggagggt ggtggttaag aaggaggtgg
      541 aggagagagt gaaggaggaa gaggagaaca agaaccaaac aaaaacaaga acaagaacaa
      601 gtagaagaag aggagcaggg aggaaaaaga agaggaagaa gaacagcaac aacaatgagt
      661 gaaggaggag gagggtaagg aaagatgcat aggagaatgg aaggaaggat agaaaggagg
      721 gaaggaagag agaatctagt cacattactt tctgatcagc agttcatttt tgtctcagtg
      781 ggaggcaata gaggccagtc taggaaaggg gtggggaaag aggaaagaga agtgcaggag
      841 gaaggggagg cccaagggga ggaggaggag gatgtgagac tgggttagag aaatgaaaga
      901 aagcaaggct ttctgttgac attcagtgca gtctacctgc agcacagcac actccctttg
      961 ggcaaggacc tgagaccctt gtgctaagtc aagaggctca atgggctgca gaagaactag
     1021 agaaggacca agcaaagcca tggtgagcct ttcagcctaa aagacgttta gatgctcaga
     1081 tagaaactct tggggttgta gaggcaggtg gcaaggatag gaatcacccc atttcaattc
     1141 tggttttaaa taatatagaa actaaacatt ttctcagacc ctcaaaaaaa gt
//
LOCUS       HUMLYAM2      363 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 2.
ACCESSION   M32407
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     2 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +   229  +   310     leukocyte adhesion molecule-1 (LAM-1), exon 2
    mRNA    <     1  >   363     leukocyte adhesion molecule-1 mRNA
    IVS     <     1      228     leukocyte adhesion molecule-1 intron A
    IVS         311  >   363     leukocyte adhesion molecule-1 intron B
BASE COUNT      115 a     51 c     69 g    128 t
ORIGIN      Chromosome 1q22-25.
        1 cactgagact aagcgtaaaa taaatagaac aaacaaactg tgcatcagtt ctgatgtaaa
       61 tttgaagtaa ttttcatcta tgtctgagaa acctgttacc tcagacaggg ttagtagaca
      121 tatgtgtttt attctgatta ttaagaaagt tgtaagcacc acctcaaagg ctataaatgt
      181 gtggtttaag ggtatacatc taaatataat tttgtatttc atttgcagat atttccatgg
      241 aaatgtcaga gcacccagag ggacttatgg aacatcttca agttgtgggg gtggacaatg
      301 ctctgttgtg gtatgttatg atatttatat atcactaagt ctattttact tatattcatt
      361 ttt
//
LOCUS       HUMLYAM3      531 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 3.
ACCESSION   M32408
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     3 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 531)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +    73  +   459     leukocyte adhesion molecule-1 (LAM-1), exon 3
    mRNA    <     1  >   531     leukocyte adhesion molecule-1 mRNA
    IVS     <     1       72     leukocyte adhesion molecule-1 intron B
    IVS         460  >   531     leukocyte adhesion molecule-1 intron C
BASE COUNT      172 a    106 c    138 g    115 t
ORIGIN      Chromosome 1q22-25.
        1 ctggagtagt gctaggttct ttttagctgt aacattatgt aagtctgcat aggtcacact
       61 gatgtcttgc agatttcctg gcacatcatg gaaccgactg ctggacttac cattattctg
      121 aaaaacccat gaactggcaa agggctagaa gattctgccg agacaattac acagatttag
      181 ttgccataca aaacaaggcg gaaattgagt atctggagaa gactctgcct ttcagtcgtt
      241 cttactactg gataggaatc cggaagatag gaggaatatg gacgtgggtg ggaaccaaca
      301 aatctctcac tgaagaagca gagaactggg gagatggtga gcccaacaac aagaagaaca
      361 aggaggactg cgtggagatc tatatcaaga gaaacaaaga tgcaggcaaa tggaacgatg
      421 acgcctgcca caaactaaag gcagccctct gttacacagg tagggagtga caagacggct
      481 atgctgcctc agactcagga agggccacgg ttaagagaat actcagattt a
//
LOCUS       HUMLYAM4      832 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 4.
ACCESSION   M32409
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     4 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 832)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +   161  +   268     leukocyte adhesion molecule-1 (LAM-1), exon 4
    mRNA    <     1  >   832     leukocyte adhesion molecule-1 mRNA
    IVS     <     1      160     leukocyte adhesion molecule-1 intron C
    IVS         269  >   832     leukocyte adhesion molecule-1 intron D
BASE COUNT      241 a    163 c    150 g    278 t
ORIGIN      Chromosome 1q22-25.
        1 aaaattttag ccatatgatt tttatgctat gaatttacca aataaacctt tcctgattat
       61 ttaaatcatc tcagacaaaa ggttatctat gtctaaagaa atgactttga gtactaaaat
      121 gtaatcacat taaaatattt tttttctgac ctccttaaag cttcttgcca gccctggtca
      181 tgcagtggcc atggagaatg tgtagaaatc atcaataatt acacctgcaa ctgtgatgtg
      241 gggtactatg ggccccagtg tcagtttggt aagtctcttt cctttctttg cttcttctta
      301 ggtaaagtca caggaatcat tatagcttat catgaagctg gttggaacaa aatgatacta
      361 gccactctga gaaatgggaa gttttgatca gaaagctctg ctttcacaat attgttacct
      421 ttccgtaaag atttcataag tcagcatgaa gtttcgattc acttctcaac aagtcttttt
      481 gagtaccaca agaagcacag tgttgggata aagctgtcag ggttacaata aggaattagc
      541 atggtagatt cccgctctca agaagctcac gatctaatga gcttgttaga ttaattagaa
      601 ctctaaggtc tggaagaaac tatgccattt atcattagga ggctgagtta cccagaaagt
      661 atcttgcttt ttccttctag tagttccttt ccttcttgca gttctccaca cttaacacat
      721 gtgctctgta gcacactgac tttgctggtg gccttctctc tcattttgca catggccaaa
      781 aaacatgtca tctttaagac attgttcaaa gacagtttct tctaggaagc tt
//
LOCUS       HUMLYAM5      712 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 5.
ACCESSION   M32410
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     5 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 712)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +   377  +   562     leukocyte adhesion molecule-1 (LAM-1), exon 5
    mRNA    <     1  >   712     leukocyte adhesion molecule-1 mRNA
    IVS     <     1      376     leukocyte adhesion molecule-1 intron D
    IVS         563  >   712     leukocyte adhesion molecule-1 intron E
BASE COUNT      200 a    145 c    139 g    228 t
ORIGIN      Chromosome 1q22-25.
        1 ctctgatgtg atagttattt cccgactaag ctggtcattc ccagttacac ctatttggct
       61 ttaaggattc tcactacaga taatactgaa gataataata tgaagactag ctaatgttta
      121 cttagaattt ctgatgagtc aggctttgtt ctaacgtcct tgacttatgc taattgaatt
      181 acatttagtt tccatatcaa tttgataaag ataacacaat ttcattattc ctcttatata
      241 gatgaagaaa ctgaagttgg agggttcaag taaccttgtt taaaggcaca tggttatcaa
      301 gtggcagggc taggattcaa atccaggcgt cagttcctct taactcttcc ccatactgtt
      361 tctttcccta ttgaagtgat tcagtgtgag cctttggagg ccccagagct gggtaccatg
      421 gactgtactc accctttggg aaacttcagc ttcagctcac agtgtgcctt cagctgctct
      481 gaaggaacaa acttaactgg gattgaagaa accacctgtg gaccatttgg aaactggtca
      541 tctccagaac caacctgtca aggtgagtaa cttcagacta gaggttttgt catgcaatcc
      601 tgggcttaca gtcagaacat tcagtagaag tttgctgaga agtcaaactt aggatcctaa
      661 tttaacctaa cttttgttta acctactgtg atgtttctca aaggacttat tc
//
LOCUS       HUMLYAM6      451 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 6.
ACCESSION   M32411
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     6 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 451)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +   240  +   425     leukocyte adhesion molecule-1 (LAM-1), exon 6
    mRNA    <     1  >   451     leukocyte adhesion molecule-1 mRNA
    IVS     <     1      239     leukocyte adhesion molecule-1 intron E
    IVS         426  >   451     leukocyte adhesion molecule-1 intron F
BASE COUNT      121 a    107 c     89 g    134 t
ORIGIN      Chromosome 1q22-25.
        1 gagggtcacc ttagctaggg cagcagcctg gagtagctac tcctctcccc acagctttca
       61 atgcttcctt gccttcatct ctcattcacc acccaccatc attctcaaga aaataaagcc
      121 tggaagcaat atcacaagta atgtagtcag gcagctttgg ctaaaaatcc aaagctcaag
      181 ggagggtctc tactcagaaa tactgttttg tctttttttt tttttctttt tcattgaagt
      241 gattcagtgt gagcctctat cagcaccaga tttggggatc atgaactgta gccatcccct
      301 ggccagcttc agctttacct ctgcatgtac cttcatctgc tcagaaggaa ctgagttaat
      361 tgggaagaag aaaaccattt gtgaatcatc tggaatctgg tcaaatccta gtccaatatg
      421 tcaaagtgag taagtttgtc ctggaactga a
//
LOCUS       HUMLYAM7      543 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 7.
ACCESSION   M32412
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     7 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 543)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +   300  +   428     leukocyte adhesion molecule-1 (LAM-1), exon 7
    mRNA    <     1  >   543     leukocyte adhesion molecule-1 mRNA
    IVS     <     1      299     leukocyte adhesion molecule-1 intron F
    IVS         429  >   543     leukocyte adhesion molecule-1 intron G
BASE COUNT      174 a     99 c    122 g    148 t
ORIGIN      Chromosome 1q22-25.
        1 tatcagaact aagaaagctt gggctgcagg tcgactctag gtgcattttc aggaactcta
       61 tgaaccacaa atctgggcat tgagattctg taggcattag actagcaagg ctggtcagtc
      121 tttgcctatg ctgtagactc atcaggggcc ttcccatgcc agtttcctca tctgtcaaat
      181 ggcatcattt gggctactac tgggagatgt aaggaggaaa aaagtcaaat atcatgagat
      241 agactaagga aataatgctg gtggtctcat gctatgtgcc ttactgattt ctctttcaga
      301 attggacaaa agtttctcaa tgattaagga gggtgattat aaccccctct tcattccagt
      361 ggcagtcatg gttactgcat tctctgggtt ggcatttatc atttggctgg caaggagatt
      421 aaaaaaaggt atgtgagttt aacttcacat gaaaagaaca caactttaaa gtgaaaaaga
      481 aaaaaaaaag aaacccacag gaaattaaat gtgatagatc aacacaagca ggatgccaag
      541 ctt
//
LOCUS       HUMLYAM8      524 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 8.
ACCESSION   M32413
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     8 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 524)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +   380  +   398     leukocyte adhesion molecule-1 (LAM-1), exon 8
    mRNA    <     1  >   524     leukocyte adhesion molecule-1 mRNA
    IVS     <     1      379     leukocyte adhesion molecule-1 intron G
    IVS         399  >   524     leukocyte adhesion molecule-1 intron H
BASE COUNT      146 a    104 c    103 g    171 t
ORIGIN      Chromosome 1q22-25.
        1 tagtttacag tattagcagc tgtccctcaa ggaagaatct gcaggtagat gagatgcaga
       61 ttgggtggga taaacacttg aatgacatat tgggtcttgc caccaggcaa tttagcaatt
      121 ctgtcttctt gagtagcacg gagatggaat ggaacctcag gaggcatctg catcaacatg
      181 tctgttctgt attagtgtct accactgttt attaagccag ttcctcaaat ctcctttgac
      241 acagataggg tccacctaac aaatacctaa tatacttcaa aagacagttt tgagagtggg
      301 agtcttcctt ctcccttact tgaaaaactt taaattgtct aatttttgct aatgcctttt
      361 tctctatttt ctatttcagg caagaaatcc aagagaaggt aagttttatt agtggcgagg
      421 agtttccaca tctgctgatt cattctctac ttcttaagtt acttctgctc tagctagaca
      481 catacccata gtagttatta ctgggtctat caatgacaga tagg
//
LOCUS       HUMLYAM9     1696 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human leukocyte adhesion molecule-1 (LAM-1), exon 9.
ACCESSION   M32414
KEYWORDS    leukocyte adhesion molecule-1.
SEGMENT     9 of 9
SOURCE      Human mature lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1696)
  AUTHORS   Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O.,
            Griffin,J. and Tedder,T.F.
  TITLE     Structure of the gene encoding the human leukocyte adhesion
            molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils
  JOURNAL   J. Biol. Chem. 265, 7760-7767 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Tedder, 01-MAR-1990.
FEATURES       from  to/span     description
    pept    +   310      328     leukocyte adhesion molecule-1 (LAM-1), exon 9
    mRNA    <     1     1414     leukocyte adhesion molecule-1 mRNA
    IVS     <     1      309     leukocyte adhesion molecule-1 intron H
BASE COUNT      443 a    387 c    292 g    574 t
ORIGIN      Chromosome 1q22-25.
        1 ataagcatca ctaaagagct tgttaggggt gcagaatctc aggctccact cagacctact
       61 gaatcagagt ctgcatttta acaccatctc tgagtggtaa ggacatgaaa atctgagaag
      121 tgctgctact agggtttgct tacatttgtt catcttcaga ggttcctaaa gcctggcctc
      181 ttgtctgaga tttccagctg aaagcatttc cttgctcctc ttctcatctc taatgaatat
      241 ttacctttac tactaacact ccaagttttg caatttttaa actcttatta tcttttgttt
      301 ttctttcagt atgaatgacc catattaaat cgcccttggt gaaagaaaat tcttggaata
      361 ctaaaaatca tgagatcctt taaatccttc catgaaacgt tttgtgtggt ggcacctcct
      421 acgtcaaaca tgaagtgtgt ttccttcagt gcatctggga agatttctac ctgaccaaca
      481 gttccttcag cttccatttc acccctcatt tatccctcaa cccccagccc acaggtgttt
      541 atacagctca gctttttgtc ttttctgagg agaaacaaat aagaccataa agggaaagga
      601 ttcatgtgga atataaagat ggctgacttt gctctttctt gactcttgtt ttcagtttca
      661 attcagtgct gtacttgatg acagacactt ctaaatgaag tgcaaatttg atacatatgt
      721 gaatatggac tcagttttct tgcagatcaa atttcgcgtc gtcttctgta tacgtggagg
      781 tacactctat gaagtcaaaa gtctacgctc tcctttcttt ctaactccag tgaagtaatg
      841 gggtcctgct caagttgaaa gagtcctatt tgcactgtag cctcgccgtc tgtgaattgg
      901 accatcctat ttaactggct tcagcctccc caccttcttc agccacctct ctttttcagt
      961 tggctgactt ccacacctag catctcatga gtgccaagca aaaggagaga agagagaaat
     1021 agcctgcgct gttttttagt ttgggggttt tgctgtttcc ttttatgaga cccattccta
     1081 tttcttatag tcaatgtttc ttttatcacg atattattag taagaaaaca tcactgaaat
     1141 gctagctgca actgacatct ctttgatgtc atatggaaga gttaaaacag gtggagaaat
     1201 tccttgattc acaatgaaat gctctccttt cccctgcccc cagacctttt atccacttac
     1261 ctagattcta catattcttt aaatttcatc tcaggcctcc ctcaacccca ccacttcttt
     1321 tataactagt cctttactaa tccaacccat gatgagctcc tcttcctggc ttcttactga
     1381 aaggttaccc tgtaacatgc aattttgcat ttgaataaag cctgcttttt aagtgttaac
     1441 tagtttgcct agtttgttat tttgaaaatt gatcatatgt tttgttttct ccccagtgag
     1501 ttacatgctc cttcagggca gagtttgtgt cagatccctg gagtatctag tgcattactt
     1561 gacactcaat aaatgaatgt tcaaataaat cagaaagagc atacagtgca ctgctgatat
     1621 aagtttcagc atccctcttt ctctatggca tctgatgacc tgggtcagat atcacctaat
     1681 gtcaacagct gaattc
//
LOCUS       RATALADPS    1612 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  Rat delta-aminolevulinate dehydratase processed pseudogene,
            complete cds.
ACCESSION   J04764
KEYWORDS    delta-aminolevulinate dehydratase; pseudogene.
SOURCE      Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1612)
  AUTHORS   Bishop,T.R., Frelin,L.P. and Boyer,S.H.
  TITLE     Isolation and characterization of a rat delta-aminolevulinate
            dehydratase processed pseudogene
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.R.Bishop, 20-APR-1990.
FEATURES       from  to/span     description
    pept.ps     271     1264     pseudo-delta-aminolevulinate dehydratase
    rpt         101      108     8 bp flanking direct repeat
    rpt        1385     1392     8 bp flanking direct repeat
BASE COUNT      404 a    411 c    418 g    379 t
ORIGIN      
        1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt
       61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt
      121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca
      181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta
      241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc
      301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa
      361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct
      421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga
      481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca
      541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg
      601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat
      661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg
      721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat
      781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac
      841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat
      901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga
      961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca
     1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct
     1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa
     1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc
     1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga
     1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa
     1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga
     1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta
     1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt
     1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg
     1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at
//
LOCUS       BOVPDEAP      585 bp ss-mRNA            MAM       17-JUL-1990
DEFINITION  Bovine cone photoreceptor cyclic nucleotide phosphodiesterase
            alpha'-subunit (PDE), partial cds.
ACCESSION   M33140 M29465
KEYWORDS    cone photoreceptor cyclic nucleotide phosphodiesterase.
SOURCE      Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 585)
  AUTHORS   Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K.,
            Mullaney,P.J., Walsh,K.A. and Beavo,J.A.
  TITLE     Identification of a noncatalytic cGMP-binding domain conserved in
            both the cGMP-stimulated and photoreceptor cyclic nucleotide
            phosphodiesterases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   585     cone photoreceptor cyclic nucleotide
                                 phosphodiesterase alpha'-subunit (AA at 1)
BASE COUNT      198 a    115 c    126 g    146 t
ORIGIN      
        1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa
       61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt
      121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt
      181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt
      241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag
      301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct
      361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc
      421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa
      481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca
      541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat
//
LOCUS       CHKG1CLSE     240 bp ds-DNA             VRT       17-JUL-1990
DEFINITION  Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer
            cor segments B3 and B4.
ACCESSION   M33954
KEYWORDS    delta-1-crystallin.
SOURCE      Chicken DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 240)
  AUTHORS   Goto,K., Okada,T.S. and Kondoh,H.
  TITLE     Functional cooperation of lens-specific and nonspecific elements in
            the delta-1-crystallin enhancer
  JOURNAL   Mol. Cell. Biol. 10, 958-964 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site         17      116     core segment B3
    site        112      235     core segment B4
BASE COUNT       61 a     52 c     53 g     74 t
ORIGIN      
        1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg
       61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt
      121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga
      181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa
//
LOCUS       CHKOVAL      9206 bp ds-DNA             VRT       17-JUL-1990
DEFINITION  Chicken ovalbumin gene, complete cds.
ACCESSION   J00895
KEYWORDS    ovalbumin.
SOURCE      Chicken oviduct DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1343 to 8906)
  AUTHORS   Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R.,
            Brownlee,G.G. and O'Malley,B.W.
  TITLE     Complete nucleotide sequence of the chicken chromosomal ovalbumin
            gene and its biological significance
  JOURNAL   Biochemistry 20, 6437-6446 (1981)
  STANDARD  full staff_review
REFERENCE   2  (bases 1043 to 1562; 2675 to 4732; 8885 to 9206)
  AUTHORS   Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P.
  TITLE     The ovalbumin gene-sequence of putative control regions
  JOURNAL   Nucleic Acids Res. 8, 127-142 (1980)
  STANDARD  full staff_review
REFERENCE   3  (bases 1357 to 1389; 2941 to 3052; and ivs junctions)
  AUTHORS   Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P.
  TITLE     Ovalbumin gene: evidence for a leader sequence in mRNA and DNA
            sequences at the exon-intron boundaries
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978)
  STANDARD  full staff_review
REFERENCE   4  (bases 1282 to 1420; 2952 to 2999)
  AUTHORS   Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C.,
            Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and
            Chambon,P.
  TITLE     Organisation and sequences at the 5' end of a cloned complete
            ovalbumin gene
  JOURNAL   Nature 278, 428-434 (1979)
  STANDARD  full staff_review
REFERENCE   5  (bases 1343 to 8906; exons only)
  AUTHORS   McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E.,
            Givol,D., Fields,S., Robertson,M. and Brownlee,G.G.
  TITLE     Sequence of chicken ovalbumin mRNA
  JOURNAL   Nature 273, 723-728 (1978)
  STANDARD  full staff_review
REFERENCE   6  (bases 2675 to 5042)
  AUTHORS   Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F.,
            O'Malley,B.W. and Brownlee,G.G.
  TITLE     Sequence of three introns in the chick ovalbumin gene
  JOURNAL   Nature 278, 370-372 (1979)
  STANDARD  full staff_review
REFERENCE   7  (bases 1 to 1042)
  AUTHORS   Heilig,R., Muraskowsky,R. and Mandel,J.L.
  TITLE     The ovalbumin gene family: The 5' end region of the X and Y genes
  JOURNAL   J. Mol. Biol. 156, 1-19 (1982)
  STANDARD  full staff_review
REFERENCE   8  (bases 5576 to 5624)
  AUTHORS   Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W.
  TITLE     The ovalbumin gene: Alleles created by mutations in the intervening
            sequences of the natural gene
  JOURNAL   Cell 16, 201-211 (1979)
  STANDARD  simple staff_entry
REFERENCE   9  (bases 443 to 611)
  AUTHORS   Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M.
  TITLE     The steroid-dependent regulatory element in the ovalbumin gene does
            not function as a typical steroid response element
  JOURNAL   J. Biol. Chem. 265, 7590-7595 (1990)
  STANDARD  simple staff_entry
COMMENT     Eight exons reported. Sequence homologies with adenovirus early 1a,
            adenovirus major late, chicken conalbumin, chkx and chky genes
            noted for 5' flanking sequence.
            
            OV1.3 is identical to OV1.8 except that intron E is shorter by 522
            nucleotides [8].
FEATURES       from  to/span     description
    pept       2996     3163     ovalbumin, exon 1
               3415     3465     ovalbumin, exon 2
               4047     4175     ovalbumin, exon 3
               4576     4693     ovalbumin, exon 4
               5652     5794     ovalbumin, exon 5
               6126     6281     ovalbumin, exon 6
               7864     8259     ovalbumin, exon 7
    pre-msg    1343     8906     oval mRNA and introns
    IVS        1390     2978     oval intron A
    IVS        3164     3414     oval intron B
    IVS        3466     4046     oval intron C
    IVS        4176     4575     oval intron D
    IVS        4694     5651     oval intron E
    IVS        5795     6125     oval intron F
    IVS        6282     7863     oval intron G
    allele     1282     1282     g may be c [1]
    allele     1309     1309     g may be a [1]
    allele     1376     1376     g may be c [1],[7]
    allele     1385     1385     g may be a [1],[7]
    allele     1393     1393     a may be g [1]
    allele     3010     3010     t may be c [1]
    allele     3154     3154     a may be g [1]
    allele     5747     5747     g may be a [1]
    allele     8032     8032     can be g [5]
    conflict   1471     1471     c in [1]; t in [2],[7]
    conflict   1523     1523     a in [1]; t in [2],[7]
    conflict   1538     1539     at in [1]; a in [2]
    conflict   2681     2681     t in [1],[6]; g in [2]
    conflict   2696     2696     a in [1],[6]; g in [2]
    conflict   3552     3552     a in [1],[6]; g in [2]
    conflict   3683     3683     c in [1],[6]; t in [2]
    conflict   3934     3934     a in [1],[6]; c in [2]
    conflict   3960     3960     a in [1],[6]; g in [2]
    conflict   3994     3994     t in [1],[6]; c in [2]
    conflict   4207     4208     tt in [1],[6]; ttt in [2]
    conflict   4396     4396     t in [1],[6]; c in [2]
    conflict   5636     5636     c in [1]; g in [3]
    conflict   6100     6100     t in [1]; tc in [3]
    conflict   6119     6119     t in [1]; tg in [3]
    conflict   8492     8505     gactcacagtactg in [1]; g in [5]
    site        443      611     steroid-dependent regulatory element [9]
BASE COUNT     2994 a   1749 c   1721 g   2742 t
ORIGIN      5 bp upstream of PstI site.
        1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca
       61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg
      121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc
      181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg
      241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca
      301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga
      361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca
      421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca
      481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa
      541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt
      601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg
      661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt
      721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt
      781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc
      841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg
      901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac
      961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag
     1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat
     1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag
     1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat
     1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg
     1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc
     1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa
     1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac
     1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac
     1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta
     1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt
     1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat
     1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg
     1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat
     1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac
     1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg
     1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac
     1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa
     2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact
     2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa
     2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa
     2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa
     2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc
     2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac
     2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt
     2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt
     2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc
     2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat
     2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa
     2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat
     2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca
     2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg
     2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt
     2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg
     3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca
     3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt
     3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa
     3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct
     3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc
     3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc
     3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt
     3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt
     3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct
     3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa
     3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag
     3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta
     3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg
     3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat
     3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag
     3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta
     3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca
     4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta
     4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta
     4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg
     4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat
     4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac
     4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag
     4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat
     4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta
     4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat
     4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa
     4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa
     4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa
     4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt
     4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa
     4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca
     4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc
     4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat
     5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg
     5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga
     5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt
     5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc
     5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt
     5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag
     5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga
     5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa
     5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc
     5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt
     5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc
     5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga
     5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta
     5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg
     5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct
     5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt
     6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca
     6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc
     6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat
     6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca
     6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg
     6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt
     6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag
     6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa
     6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag
     6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga
     6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac
     6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa
     6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca
     6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat
     6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat
     6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag
     6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga
     7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt
     7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag
     7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac
     7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc
     7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg
     7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac
     7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt
     7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga
     7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga
     7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta
     7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga
     7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag
     7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt
     7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg
     7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg
     7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc
     7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct
     8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa
     8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc
     8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc
     8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact
     8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt
     8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga
     8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac
     8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa
     8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa
     8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa
     8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc
     8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca
     8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt
     8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt
     8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc
     8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag
     9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa
     9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg
     9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg
     9181 ctgccctgat ctcggctggg gtgatg
//
LOCUS       PMUCEN        150 bp ds-DNA             PHG       17-JUL-1990
DEFINITION  Bacteriophage Mu wild type DNA fragment with a gyrase cleavage
            site.
ACCESSION   M32302
KEYWORDS    .
SOURCE      Bacteriophage Mu (wild type) DNA.
  ORGANISM  Bacteriophage mu
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Pato,M., Howe,M. and Higgins,P.
  TITLE     DNA gyrase binds to a centrally located replication enhancer (CEN)
            in the bacteriophage Mu genome
  JOURNAL   Unpublished (1990) In Press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.L.Pato, 23-FEB-1990.
FEATURES       from  to/span     description
    site         49       50     DNA gyrase cleavage site in complimentary
                                 strand
    mut          44       44     a in wt; g in Mu nuB103
    mut          50       50     g in wt; c in Mu nuB1
BASE COUNT       40 a     38 c     27 g     45 t
ORIGIN      Map position at 18.0 kb.
        1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat
       61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt
      121 atcttccttt cactttcttt ctccagtact
//
LOCUS       RATRNRTR     2577 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  Rat snRNP-associated polypeptide N, complete cds.
ACCESSION   J05497
KEYWORDS    snRNP-associated polypeptide N.
SOURCE      Rat male adult (Fisher) DNA, clones rgV and rgIII2.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2577)
  AUTHORS   Schmauss,C. and Lerner,M.R.
  TITLE     The closely related small nuclear ribonucleoprotein polypeptides N
            and B/B' are distinguishable by antibodies as well as by
            differences in their mRNAs and gene structures
  JOURNAL   J. Biol. Chem. 265, 10733-10739 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Schmauss, 13-APR-1990.
FEATURES       from  to/span     description
    pept       1314     2036     snRNP-associated polypeptide N
    signal      774      779     TATA box
    signal      698      702     CAAT box
BASE COUNT      822 a    537 c    581 g    637 t
ORIGIN      
        1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg
       61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact
      121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag
      181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag
      241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga
      301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg
      361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct
      421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg
      481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc
      541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt
      601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca
      661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat
      721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa
      781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc
      841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg
      901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag
      961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa
     1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag
     1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga
     1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt
     1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt
     1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg
     1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag
     1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct
     1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag
     1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg
     1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg
     1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag
     1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga
     1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag
     1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa
     1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa
     1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc
     1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata
     2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc
     2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg
     2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca
     2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa
     2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca
     2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa
     2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat
     2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg
     2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc
//
LOCUS       WUCSSP       1323 bp ds-DNA             INV       17-JUL-1990
DEFINITION  W.bancrofti species specific DNA fragment.
ACCESSION   M27140
KEYWORDS    .
SOURCE      W.bancrofti DNA, clone IWb35.
  ORGANISM  Wuchereria bancrofti
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 1323)
  AUTHORS   Dissanayake,S. and Piessens,W.F.
  TITLE     Cloning and characterization of a Wuchereria bancrofti-specific DNA
            sequence
  JOURNAL   Mol. Biochem. Parasitol. 39, 147-150 (1990)
  STANDARD  simple staff_entry
BASE COUNT      399 a    204 c    251 g    469 t
ORIGIN      1288 bp upstream of SacI site.
        1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt
       61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat
      121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta
      181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga
      241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt
      301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg
      361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa
      421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg
      481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg
      541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat
      601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc
      661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc
      721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag
      781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg
      841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa
      901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct
      961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat
     1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac
     1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata
     1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca
     1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg
     1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg
     1321 agc
//
LOCUS       YSCHXT2      2890 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  S.cerevisiae high affinity hexose transporter-2 (HXT2) gene,
            complete cds.
ACCESSION   M33270
KEYWORDS    high affinity hexose transporter-2.
SOURCE      S.cerevisiae (isogenic strain to S288C) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2890)
  AUTHORS   Kruckeberg,A.L. and Bisson,L.F.
  TITLE     The HXT2 gene of Saccharomyces cerevisiae is required for high
            affinity glucose transport
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.L.Kruckeberg, 26-MAR-1990.
FEATURES       from  to/span     description
    pept        818     2443     high affinity hexose transporter-2
BASE COUNT      808 a    571 c    535 g    976 t
ORIGIN      
        1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg
       61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc
      121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata
      181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt
      241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac
      301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc
      361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa
      421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac
      481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg
      541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac
      601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc
      661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca
      721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca
      781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg
      841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga
      901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa
      961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt
     1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag
     1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt
     1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg
     1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat
     1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc
     1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc
     1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt
     1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt
     1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc
     1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc
     1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat
     1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa
     1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt
     1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa
     1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc
     1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat
     1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa
     2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat
     2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc
     2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt
     2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt
     2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa
     2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc
     2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact
     2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt
     2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt
     2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg
     2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt
     2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa
     2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa
     2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat
     2881 cggccatact
//
LOCUS       CHKARCASR    2392 bp ss-mRNA            VRT       17-JUL-1990
DEFINITION  Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5-
            aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR)
            synthetase, complete cds.
ACCESSION   M31764
KEYWORDS    5-aminoimidazole ribonucleotide carboxylase;
            5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase.
SOURCE      Chicken liver, cDNA to mRNA, clone pZD1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 2392)
  AUTHORS   Chen,Z., Dixon,J.E. and Zalkin,H.
  TITLE     Cloning of a chicken liver cDNA encoding 5-aminoimidazole
            ribonucleotide carboxylase and
            5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase
            by functional complementation of Escherichia coli pur mutants
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Zalkin, 01-FEB-1990.
FEATURES       from  to/span     description
    pept         59     1339     5-aminoimidazole ribonucleotide (AIR)
                                 carboxylase-5-
                                 aminoimidazole-4-N-succinocarboxamide
                                 ribonucleotide (SAICAR) synthetase
    matp         59      835     SAICAR synthetase domain
    matp        836     1336     AIR carboxylase domain
    site       2341     2347     poly-A signal
BASE COUNT      696 a    466 c    527 g    702 t      1 others
ORIGIN      
        1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat
       61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga
      121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac
      181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac
      241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa
      301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg
      361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa
      421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg
      481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac
      541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg
      601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac
      661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg
      721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga
      781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct
      841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca
      901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac
      961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg
     1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc
     1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca
     1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc
     1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc
     1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt
     1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag
     1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt
     1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg
     1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta
     1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta
     1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt
     1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat
     1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc
     1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg
     1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt
     1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa
     1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc
     2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct
     2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc
     2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac
     2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa
     2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat
     2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc
//
LOCUS       ECORAFRO     1289 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  E.coli raffinose repressor (rafR) gene, complete cds, and
            alpha-galactosidase (rafA) gene, 5' end.
ACCESSION   M29849
KEYWORDS    alpha-galactosidase; raffinose repressor.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1289)
  AUTHORS   Aslanidis,C. and Schmitt,R.
  TITLE     Regulatory elements of the raffinose operon: Nucleotide sequences
            of operator and repressor genes
  JOURNAL   J. Bacteriol. 172, 2178-2180 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Schmitt, 13-NOV-1989.
FEATURES       from  to/span     description
    pept        166     1176     raffinose repressor
    pept       1270  >  1289     alpha-galactosidase
    binding     156      161     ribosome binding site
    binding    1258     1263     ribosome binding site
    signal      120      126     -35 region
    signal      144      149     -10 region
    signal     1212     1216     -35 region
    signal     1235     1240     -10 region
    site       1196     1213     raf operator
    site       1217     1234     raf operator
BASE COUNT      317 a    283 c    326 g    363 t
ORIGIN      
        1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata
       61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga
      121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg
      181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca
      241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc
      301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat
      361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc
      421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac
      481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat
      541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct
      601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga
      661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc
      721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat
      781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat
      841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat
      901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa
      961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat
     1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg
     1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt
     1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg
     1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg
     1261 agatgtctaa tgatttcaaa gtactgcag
//
LOCUS       HSEDILSTD     351 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Equine herpesvirus type 1 subfragments from near the left terminus
            of STD DNA.
ACCESSION   M34145
KEYWORDS    defective interfering particle.
SOURCE      Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 351)
  AUTHORS   Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
            O'Callaghan,D.J.
  TITLE     Identification of the site of recombination in the generation of
            the genome of DI particles of Equine herpesvirus type 1
  JOURNAL   Virology 175, 448-455 (1990)
  STANDARD  simple staff_review
BASE COUNT       55 a    117 c    127 g     52 t
ORIGIN      
        1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca
       61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga
      121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt
      181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg
      241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc
      301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g
//
LOCUS       HSEDISA       633 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Equine herpesvirus type 1 defective interfering particle DNA.
ACCESSION   M34143
KEYWORDS    defective interfering particle.
SOURCE      Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 633)
  AUTHORS   Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
            O'Callaghan,D.J.
  TITLE     Identification of the site of recombination in the generation of
            the genome of DI particles of Equine herpesvirus type 1
  JOURNAL   Virology 175, 448-455 (1990)
  STANDARD  simple staff_entry
BASE COUNT      124 a    184 c    202 g    123 t
ORIGIN      
        1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
       61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
      121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
      181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
      241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact
      301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc
      361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca
      421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt
      481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg
      541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc
      601 acggcggggc gcagaacgcc cctggccccc ggg
//
LOCUS       HSEDISTD      290 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Equine herpesvirus type 1 subfragments from the inverted repeat
            of STD DNA.
ACCESSION   M34144
KEYWORDS    defective interfering particle.
SOURCE      Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG..
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 290)
  AUTHORS   Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and
            O'Callaghan,D.J.
  TITLE     Identification of the site of recombination in the generation of
            the genome of DI particles of Equine herpesvirus type 1
  JOURNAL   Virology 175, 448-455 (1990)
  STANDARD  simple staff_review
BASE COUNT       69 a     72 c     77 g     72 t
ORIGIN      
        1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac
       61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa
      121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt
      181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa
      241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc
//
LOCUS       HUMAR        1230 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human amphiregulin (AR) mRNA, complete cds.
ACCESSION   M30704
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SOURCE      Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones
            lambda-AR1 and lambda-AR2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1230)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept        210      968     amphiregulin precursor
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    sigp        210      266     amphiregulin signal peptide
    matp        510      962     amphiregulin (alt.)
    matp        528      962     amphiregulin (alt.)
    mRNA          1     1230     amphiregulin mRNA
BASE COUNT      375 a    273 c    285 g    297 t
ORIGIN      
        1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc
       61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg
      121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga
      181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg
      241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg
      301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg
      361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt
      421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac
      481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc
      541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag
      601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc
      661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca
      721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca
      781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg
      841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat
      901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta
      961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc
     1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat
     1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga
     1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt
     1201 tattttacag ctcattaaac ttttttaacc
//
LOCUS       HUMAR1        968 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human amphiregulin (AR) gene, exon 1.
ACCESSION   M30698
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     1 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 968)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept        859  +   919     amphiregulin precursor, exon 1
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    sigp        859      915     amphiregulin signal peptide
    pre-msg     649  >   968     amphiregulin mRNA and introns
    IVS         920  >   968     amphiregulin intron A
BASE COUNT      206 a    280 c    257 g    225 t
ORIGIN      
        1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt
       61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg
      121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg
      181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag
      241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta
      301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta
      361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac
      421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct
      481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt
      541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc
      601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca
      661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc
      721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca
      781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc
      841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg
      901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca
      961 tggcaggt
//
LOCUS       HUMAR2        348 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human amphiregulin (AR) gene, exon 2.
ACCESSION   M30699
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     2 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    51  +   299     amphiregulin precursor, exon 2
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    matp        290  +   299     amphiregulin
    pre-msg <     1  >   348     amphiregulin mRNA and introns
    IVS     <     1       50     amphiregulin intron A
    IVS         300  >   348     amphiregulin intron B
BASE COUNT       88 a     75 c     86 g     99 t
ORIGIN      
        1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc
       61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca
      121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc
      181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc
      241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg
      301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata
//
LOCUS       HUMAR3        302 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human amphiregulin (AR) gene, exon 3.
ACCESSION   M30700
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     3 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 302)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    52  +   252     amphiregulin precursor, exon 3
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    matp    +    52  +   252     amphiregulin
    matp         60  +   252     amphiregulin (alt.)
    pre-msg <     1  >   302     amphiregulin mRNA and introns
    IVS     <     1       51     amphiregulin intron B
    IVS         253  >   302     amphiregulin intron C
BASE COUNT      125 a     46 c     54 g     77 t
ORIGIN      
        1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg
       61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa
      121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta
      181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag
      241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc
      301 tg
//
LOCUS       HUMAR4        254 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human amphiregulin (AR) gene, exon 4.
ACCESSION   M30701
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     4 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 254)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    51  +   204     amphiregulin precursor, exon 4
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    matp    +    51       91     amphiregulin (both alt.)
    pre-msg <     1  >   254     amphiregulin mRNA and introns
    IVS     <     1       50     amphiregulin intron C
    IVS         205  >   254     amphiregulin intron D
BASE COUNT       81 a     42 c     53 g     78 t
ORIGIN      
        1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca
       61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag
      121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac
      181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat
      241 aatgggaggt taat
//
LOCUS       HUMAR5        217 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human amphiregulin (AR) gene, exon 5.
ACCESSION   M30702
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     5 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 217)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pept    +    50      143     amphiregulin precursor, exon 5
                                 /hgml_locus_uid="LW0168B" /nomgen="AREG"
                                 /map="4q13-q21"
    pre-msg <     1  >   217     amphiregulin mRNA and introns
    IVS     <     1       49     amphiregulin intron D
BASE COUNT       89 a     26 c     41 g     61 t
ORIGIN      
        1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac
       61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg
      121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata
      181 tctttagatc atatcctata attttgaaaa atttaac
//
LOCUS       HUMAR6        354 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human amphiregulin (AR) gene, 3' flanking region.
ACCESSION   M30703
KEYWORDS    amphiregulin; growth regulator; tumor inhibitory factor.
SEGMENT     6 of 6
SOURCE      Human breast carcinoma cell line MCF-7 DNA, clones
            lambda-ARH[6,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G.,
            Disteche,C.M., Todaro,G.J. and Shoyab,M.
  TITLE     The amphiregulin gene encodes a novel epidermal growth factor-
            related protein with tumor-inhibitory activity
  JOURNAL   Mol. Cell. Biol. 10, 1969-1981 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 18-DEC-1989.
FEATURES       from  to/span     description
    pre-msg <     1      294     amphiregulon mRNA and intron
BASE COUNT      104 a     48 c     52 g    150 t
ORIGIN      
        1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat
       61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat
      121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct
      181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta
      241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa
      301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc
//
LOCUS       HUMROSSAA    1890 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds.
ACCESSION   M32294
KEYWORDS    Ro ribonucleoprotein autoantigen.
SOURCE      Human T-cell line Jurkat, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1890)
  AUTHORS   McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J.,
            Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M.,
            Sontheimer,R.D. and Capra,J.D.
  TITLE     Molecular cloning, expression and Chromosome 19 localization of a
            human Ro/SS-A auto antigen
  JOURNAL   J. Clin. Invest. 85, 1379-1391 (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.P.McCauliffe, 23-FEB-1990.
FEATURES       from  to/span     description
    pept         67     1320     Ro ribonucleoprotein autoantigen (Ro/SS-A)
                                 precursor
    sigp         67      117     Ro ribonucleoprotein autoantigen signal
                                 peptide
    matp        118     1317     Ro ribonucleoprotein autoantigen
    site       1881     1886     poly-A signal (put.)
BASE COUNT      464 a    497 c    520 g    409 t
ORIGIN      Chromosome 19.
        1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg
       61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag
      121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc
      181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt
      241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg
      301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa
      361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac
      421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc
      481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac
      541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca
      601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac
      661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac
      721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag
      781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac
      841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg
      901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag
      961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc
     1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac
     1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag
     1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag
     1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag
     1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag
     1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg
     1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt
     1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt
     1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt
     1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc
     1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct
     1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca
     1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact
     1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact
     1861 acaaacaaaa tttctattaa attaaatttt
//
LOCUS       PRD1XII      1701 bp ds-DNA             PHG       17-JUL-1990
DEFINITION  Bacteriophage PRD1 early protein genes XII and XIX, complete cds.
ACCESSION   M30146
KEYWORDS    early protein XII; early protein XIX.
SOURCE      Bacteriophage PRD1 (Viradea strain) DNA.
  ORGANISM  Bacteriophage PRD1
            Viridae; ds-DNA nonenveloped viruses; Tectiviridae.
REFERENCE   1  (bases 1 to 1701)
  AUTHORS   Gerendasy,D. and Ito,J.
  TITLE     Nucleotide sequence and transcription of the right early region of
            bacteriophage PRD1
  JOURNAL   J. Bacteriol. 172, 1889-1898 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Gerendasy, 27-NOV-1989.
FEATURES       from  to/span     description
    pept        242      724     early protein XII
    pept        797      955     early protein XIX (put.)
    mRNA        179      732     early protein XII mRNA
    mRNA        181     1100     early protein XIX mRNA
    binding     230      233     ribosome binding site
    binding     785      788     ribosome binding site
    rpt           1      111     inverted terminal repeat
    site        145      172     right early promoter
BASE COUNT      419 a    447 c    390 g    445 t
ORIGIN      
        1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca
       61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat
      121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca
      181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat
      241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag
      301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg
      361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc
      421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct
      481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc
      541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc
      601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc
      661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc
      721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa
      781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc
      841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat
      901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc
      961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct
     1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa
     1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg
     1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag
     1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac
     1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc
     1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga
     1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg
     1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt
     1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg
     1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc
     1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg
     1681 cgagaaaacc gccttgacaa a
//
LOCUS       PSEALALPQ    3141 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  P.aeruginosa exopolysaccharide alginate regulatory protein (algP
            and algQ) genes, complete cds.
ACCESSION   M32077
KEYWORDS    alginate regulatory protein P; alginate regulatory protein Q.
SOURCE      P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient
            PA02003.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 3141)
  AUTHORS   Konyecsni,W.M. and Deretic,V.
  TITLE     DNA sequence and expression analysis of algP and algQ, components
            of the multigene system transcriptionally regulating mucoidy in
            Pseudomonas aeruginosa: algP contains multiple direct repeats
  JOURNAL   J. Bacteriol. 172, 2511-2520 (1990)
  STANDARD  full staff_review
COMMENT     draft entry and computer-readable sequence [1] kindly submitted by
            V.Deretic, 16-FEB-1990.
FEATURES       from  to/span     description
    pept        781     1263     alginate regulatory protein Q
    pept       2079     3137     alginate regulatory protein P (put.)
    mRNA        661     1310     algQ mRNA
    mRNA       2054     3141     algP mRNA
BASE COUNT      607 a   1026 c   1009 g    499 t
ORIGIN      1 bp upstream of BamHI site; map position 9 min.
        1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc
       61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg
      121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg
      181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct
      241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa
      301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg
      361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc
      421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg
      481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt
      541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat
      601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat
      661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc
      721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac
      781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat
      841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag
      901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta
      961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat
     1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc
     1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc
     1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc
     1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc
     1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg
     1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca
     1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg
     1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga
     1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac
     1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga
     1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc
     1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa
     1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc
     1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc
     1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag
     1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc
     1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt
     2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg
     2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag
     2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac
     2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg
     2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt
     2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg
     2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca
     2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca
     2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg
     2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga
     2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc
     2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa
     2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca
     2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg
     2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg
     2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac
     3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc
     3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc
     3121 cgaccagcgc ctcctaaggc g
//
LOCUS       STYAROF      1235 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate
            7-phosphate synthase (aroF) gene, complete cds.
ACCESSION   M31302
KEYWORDS    3-deoxy-d-arabino-heptulosonate 7-phosphate synthase.
SOURCE      S.typhimurium (strain LT2) DNA, clone pGM59.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1235)
  AUTHORS   Muday,G.K. and Herrmann,K.M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 165 to 1235)
  AUTHORS   Muday,G.K. and Herrmann,K.M.
  TITLE     Regulation of the Salmonella typhimurium aroF gene in Escherichia
            coli
  JOURNAL   J. Bacteriol. 172, 2259-2266 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.M.Herrmann, 13-JAN-1990.
            
              Author address: K.M.Herrmann
                Purdue University
                Biochemistry department
                West Lafayette IN, 47907
                (317) 494-1653
            
FEATURES       from  to/span     description
    pept        165     1235     3-deoxy-d-arabino-heptulosonate 7-phosphate
                                 synthase (E.C. 4.1.2.15)
    site          1       20     operator aroFo2
    site         52       71     operator aroFo1
    site         69      113     promoter
    site         75       94     TyrR box-1
BASE COUNT      309 a    286 c    343 g    297 t
ORIGIN      
        1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt
       61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg
      121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc
      181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg
      241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg
      301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg
      361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg
      421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga
      481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa
      541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt
      601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc
      661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt
      721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc
      781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg
      841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag
      901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag
      961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg
     1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata
     1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg
     1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag
     1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa
//
LOCUS       TRBANTAT    18598 bp ds-DNA             INV       17-JUL-1990
DEFINITION  T.brucei expression site variant specific antigen 4 - 7 genes,
            complete cds.
ACCESSION   M33720
KEYWORDS    expression site variant specific antigen.
SOURCE      T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 18598)
  AUTHORS   Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P.,
            Salmon,D. and Steinert,M.
  TITLE     The genes and transcripts of an antigen gene expression site from
            T.brucei
  JOURNAL   Cell 57, 835-845 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Pays, 20-JAN-1989.
FEATURES       from  to/span     description
    pept       1174     2196     expression site variant specific antigen 7
    pept       3528     4733     expression site variant specific antigen 6
    pept       8519     9772     expression site variant specific antigen 5
    pept      14450    18157     expression site variant specific antigen 4
    mRNA        986     2676     ESAG-7 mRNA
    mRNA       3483     5056     ESAG-6 mRNA
    mRNA       8260    11304     ESAG-5 mRNA
    trns       5305     5808     rime retroposon
BASE COUNT     5298 a   3468 c   4707 g   5125 t
ORIGIN      
        1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta
       61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat
      121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg
      181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg
      241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta
      301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc
      361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat
      421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt
      481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt
      541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt
      601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa
      661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag
      721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg
      781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat
      841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca
      901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa
      961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat
     1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg
     1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga
     1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg
     1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca
     1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta
     1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag
     1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc
     1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa
     1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa
     1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg
     1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga
     1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg
     1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt
     1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa
     1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt
     1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg
     1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt
     2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa
     2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa
     2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact
     2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac
     2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa
     2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt
     2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta
     2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt
     2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg
     2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca
     2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta
     2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact
     2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat
     2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat
     2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg
     2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata
     3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg
     3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga
     3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa
     3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg
     3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt
     3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg
     3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt
     3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc
     3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt
     3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga
     3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa
     3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa
     3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg
     3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc
     3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag
     3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt
     3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg
     4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg
     4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg
     4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg
     4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga
     4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg
     4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg
     4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa
     4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg
     4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag
     4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg
     4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga
     4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg
     4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga
     4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa
     4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc
     4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa
     4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag
     5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga
     5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca
     5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat
     5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct
     5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca
     5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg
     5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg
     5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat
     5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca
     5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg
     5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg
     5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca
     5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac
     5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc
     5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg
     5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata
     6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg
     6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg
     6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc
     6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat
     6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac
     6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa
     6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa
     6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta
     6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag
     6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat
     6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg
     6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca
     6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg
     6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag
     6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca
     6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa
     6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct
     7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa
     7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat
     7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag
     7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat
     7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa
     7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta
     7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta
     7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc
     7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca
     7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga
     7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa
     7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg
     7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga
     7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct
     7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt
     7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact
     7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag
     8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg
     8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga
     8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca
     8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca
     8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg
     8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt
     8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg
     8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat
     8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag
     8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga
     8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt
     8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat
     8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga
     8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga
     8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga
     8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa
     9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata
     9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac
     9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc
     9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg
     9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt
     9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa
     9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct
     9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca
     9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac
     9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa
     9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc
     9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt
     9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc
     9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac
     9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta
     9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt
     9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt
    10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt
    10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg
    10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa
    10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata
    10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac
    10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta
    10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg
    10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa
    10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag
    10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt
    10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg
    10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag
    10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg
    10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc
    10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa
    10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg
    10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca
    11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat
    11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc
    11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca
    11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt
    11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg
    11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat
    11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac
    11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct
    11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag
    11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag
    11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt
    11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat
    11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc
    11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata
    11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga
    11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg
    12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt
    12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg
    12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg
    12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc
    12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa
    12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa
    12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg
    12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact
    12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc
    12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct
    12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa
    12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta
    12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga
    12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc
    12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc
    12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata
    12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg
    13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg
    13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt
    13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct
    13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata
    13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca
    13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc
    13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt
    13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa
    13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc
    13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt
    13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta
    13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca
    13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca
    13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct
    13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg
    13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc
    13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc
    14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc
    14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg
    14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt
    14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc
    14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg
    14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc
    14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct
    14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac
    14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt
    14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt
    14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc
    14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt
    14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa
    14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga
    14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt
    14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga
    15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg
    15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt
    15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga
    15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct
    15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat
    15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt
    15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg
    15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa
    15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac
    15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat
    15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac
    15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga
    15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca
    15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg
    15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg
    15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc
    15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag
    16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga
    16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac
    16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag
    16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct
    16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat
    16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat
    16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat
    16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt
    16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct
    16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga
    16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt
    16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct
    16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat
    16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa
    16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc
    16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat
    16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat
    17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct
    17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac
    17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt
    17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa
    17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc
    17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc
    17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc
    17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat
    17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg
    17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat
    17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc
    17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc
    17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga
    17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc
    17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg
    17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg
    18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg
    18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt
    18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc
    18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct
    18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga
    18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga
    18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct
    18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag
    18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc
    18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt
//
LOCUS       JS3PRSBS       81 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Plasmid pJS37 promoter region DNA.
ACCESSION   M34567
KEYWORDS    .
SOURCE      Plasmid pJS37 DNA, from B.subtilis (strain MB11).
  ORGANISM  Plasmid pJS37
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 81)
  AUTHORS   Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M.
  TITLE     Comparative expression of the pC194 cat gene in Streptococcus
            pneumoniae, Bacillus subtilis and Escherichia coli
  JOURNAL   Gene 86, 71-79 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         61       81     leader peptide (put.)
    mRNA         37  >    81     cat mRNA
    mRNA         64  >    81     cat mRNA
    binding      46       51     ribosomal binding site (put.)
    signal       34       62     promoter region 1
    signal        5       34     promoter region 2
    signal       57       62     P1 -10 region
    signal       34       39     P1 -35 region
    signal       29       34     P2 -10 region
    signal        5       10     P2 -35 region
BASE COUNT       36 a      5 c     11 g     29 t
ORIGIN      
        1 atttttaaaa tatatattta tgttacagta atattgactt ttaaaaaagg attgattcta
       61 atgaagaaag cagacaagta a
//
LOCUS       JS3PRSEC       62 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Plasmid pJS37 promoter region DNA.
ACCESSION   M34566
KEYWORDS    .
SOURCE      Plasmid pJS37 DNA, from E.coli (strain C600).
  ORGANISM  Plasmid pJS37
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 62)
  AUTHORS   Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M.
  TITLE     Comparative expression of the pC194 cat gene in Streptococcus
            pneumoniae, Bacillus subtilis and Escherichia coli
  JOURNAL   Gene 86, 71-79 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA         52  >    62     cat mRNA
    signal       21       43     promoter region PT
    signal       43       48     PT -10 region
    signal       21       26     PT -35 region
BASE COUNT       23 a      3 c     14 g     22 t
ORIGIN      
        1 ccgatgatga agaaaagaat ttgaaactta gtttatatgt ggtaaaatgt tttaattggg
       61 at
//
LOCUS       JS3PRSSP      109 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Plasmid pJS37 promoter region DNA.
ACCESSION   M34565
KEYWORDS    .
SOURCE      Plasmid pJS37 DNA, from S.pneumoniae (strain 708).
  ORGANISM  Plasmid pJS37
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M.
  TITLE     Comparative expression of the pC194 cat gene in Streptococcus
            pneumoniae, Bacillus subtilis and Escherichia coli
  JOURNAL   Gene 86, 71-79 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         33       53     leader peptide (put.)
    signal  <     1        6     promoter region 1
    signal        6       34     promoter region 2
    binding      18       23     ribosomal binding site (put.)
BASE COUNT       47 a     13 c     16 g     33 t
ORIGIN      
        1 taatattgac ttttaaaaaa ggattgattc taatgaagaa agcagacaag taagcctcct
       61 aaattcactt tagataaaaa tttaggaggc atatcaaatg aactttaat
//
LOCUS       HUMTNC1      1085 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human fast skeletal muscle troponin C gene, exon 1.
ACCESSION   M33771 J05502
KEYWORDS    troponin C.
SEGMENT     1 of 2
SOURCE      Human HeLa cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1085)
  AUTHORS   Gahlmann,R. and Kedes,L.
  TITLE     Cloning, structural analysis and expression of the human
            fast-twitch skeletal muscle troponin C gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Gahlmann, 20-APR-1990.
FEATURES       from  to/span     description
    pept       1033  +  1035     troponin C, exon 1
    pre-msg     968  >  1085     troponin C mRNA and introns (alt.)
    pre-msg     969  >  1085     troponin C mRNA and introns (alt.)
    IVS        1036  >  1085     troponin C intron A
BASE COUNT      266 a    276 c    306 g    237 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct
       61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc
      121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa
      181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa
      241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc
      301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac
      361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc
      421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca
      481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga
      541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt
      601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc
      661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga
      721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc
      781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt
      841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt
      901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag
      961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg
     1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg
     1081 tggac
//
LOCUS       HUMTNC2      1773 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human fast skeletal muscle troponin C gene, exons 2-6.
ACCESSION   M33772 J05502
KEYWORDS    troponin C.
SEGMENT     2 of 2
SOURCE      Human HeLa cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1773)
  AUTHORS   Gahlmann,R. and Kedes,L.
  TITLE     Cloning, structural analysis and expression of the human
            fast-twitch skeletal muscle troponin C gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Gahlmann, 20-APR-1990.
FEATURES       from  to/span     description
    pept    +    51      102     troponin C, exon 2
                247      390     troponin C, exon 3
                479      593     troponin C, exon 4
                758      894     troponin C, exon 5
               1506     1537     troponin C, exon 6
    pre-msg <     1     1668     troponin C mRNA and introns
    IVS     <     1       50     troponin C intron A
    IVS         103      246     troponin C intron B
    IVS         391      478     troponin C intron C
    IVS         594      757     troponin C intron D
    IVS         895     1505     troponin C intron E
BASE COUNT      358 a    455 c    628 g    331 t      1 others
ORIGIN      About 2.5 kb after segment 1.
        1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc
       61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag
      121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg
      181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc
      241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg
      301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg
      361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat
      421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc
      481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac
      541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct
      601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag
      661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc
      721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc
      781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg
      841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg
      901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct
      961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc
     1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta
     1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat
     1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac
     1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc
     1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa
     1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa
     1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg
     1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc
     1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca
     1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg
     1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc
     1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc
     1741 ggctggcacc acccggagca cccttgggga tcc
//
LOCUS       HUMMHDQ3L     967 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment,
            long terminal repeat region.
ACCESSION   M33841
KEYWORDS    major histocompatibility complex.
SOURCE      Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 967)
  AUTHORS   Kambhu,S., Falldorf,P. and Lee,J.S.
  TITLE     Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
            locus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.S.Lee, 25-APR-1990.
FEATURES       from  to/span     description
    rpt           1        7     inverted repeat A
    rpt         961      967     inverted repeat B
    rpt         560      810     R region
    signal      792      797     poly-A signal
    site         74       79     1/2 GRE
    site         80       87     enhancer core
    site        530      536     TATA box
BASE COUNT      254 a    235 c    222 g    256 t
ORIGIN      Chromosome 6p21.3.
        1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
       61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct
      121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa
      181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
      241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc
      301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt
      361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa
      421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa
      481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg
      541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg
      601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc
      661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg
      721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc
      781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga
      841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct
      901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac
      961 ccctaca
//
LOCUS       HUMMHDQ5L     960 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment,
            long terminal repeat region.
ACCESSION   M33842
KEYWORDS    major histocompatibility complex.
SOURCE      Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 960)
  AUTHORS   Kambhu,S., Falldorf,P. and Lee,J.S.
  TITLE     Endogenous retroviral long terminal repeats (LTR) within the HLA DQ
            locus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.S.Lee, 25-APR-1990.
FEATURES       from  to/span     description
    rpt           1        7     inverted repeat A
    rpt         954      960     inverted repeat B
    rpt         555      803     R region
    signal      785      790     poly-A signal
    site         74       79     1/2 GRE
    site         80       87     enhancer core
    site        525      531     TATA box
BASE COUNT      250 a    232 c    219 g    259 t
ORIGIN      Chromosome 6p21.3.
        1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
       61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct
      121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa
      181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
      241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag
      301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa
      361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt
      421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa
      481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt
      541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca
      601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac
      661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct
      721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat
      781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg
      841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa
      901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca
//
LOCUS       SYNLACZA     6476 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD1.27.
ACCESSION   M34296
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD1.27.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6476)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3500     3501     E.coli lacZ end/SV40 start
    recomb     3634     3635     SV40 end/synthetic start
    recomb     3657     3658     synthetic end/pUC19 start
    recomb     4076     4077     pUC19 end/C.elegans sup-7 start
    recomb     4442     4443     C.elegans sup-7 end/pUC19 start
BASE COUNT     1560 a   1646 c   1676 g   1594 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa
     3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat
     3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg
     3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt
     3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt
     3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa
     3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt
     3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct
     3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
     3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
     3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
     3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
     3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
     3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
     4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
     4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
     4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
     4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
     4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
     4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
     4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
     4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
     4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
     4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
     4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
     4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
     4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
     4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
     4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
     4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
     4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
     5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
     5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
     5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
     5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
     5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
     5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
     5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
     5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
     5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
     5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
     5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
     5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
     5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
     5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
     5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
     5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
     6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
     6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
     6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
     6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
     6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
     6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
     6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
     6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS       SYNLACZB     6096 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD8.02.
ACCESSION   M34297
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD8.02.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6096)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     3275     3276     synthetic end/unknown DNA start
    recomb     3696     3697     unknown DNA end/C.elegans sup-7 end
    recomb     4062     4063     C.elegans sup-7 end/pUC19 start
BASE COUNT     1458 a   1571 c   1616 g   1451 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta
     3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg
     3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt
     3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc
     3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat
     3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
     3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
     3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg
     3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat
     3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca
     3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga
     3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa
     3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac
     4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta
     4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg
     4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac
     4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg
     4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc
     4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg
     4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat
     4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg
     4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg
     4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc
     4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
     4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
     4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc
     4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
     4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
     4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
     4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
     5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca
     5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac
     5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg
     5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag
     5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac
     5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt
     5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg
     5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc
     5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc
     5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc
     5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa
     5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt
     5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg
     5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag
     5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc
     5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc
     6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa
     6061 ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS       SYNLACZC     7376 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD8.33.
ACCESSION   M34298
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD8.33.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7376)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     3244     3245     synthetic end/unknown DNA start
    recomb     3524     3425     unknown DNA end/C.elegans sup-7 end
    recomb     4555     4556     synthetic end/pUC19 start
    recomb     4976     4977     pUC19 end/C.elegans sup-7 start
    recomb     5342     5343     C.elegans sup-7 end/pUC19 start
BASE COUNT     1863 a   1793 c   1814 g   1906 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
     3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
     3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
     3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
     3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
     3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
     3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
     3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
     3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
     3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
     3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
     3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
     3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
     4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
     4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
     4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
     4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
     4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
     4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
     4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
     4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
     4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct
     4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct
     4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc
     4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt
     4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa
     4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
     4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
     4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa
     4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc
     5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt
     5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt
     5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt
     5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac
     5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg
     5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt
     5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc
     5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat
     5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct
     5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca
     5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg
     5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa
     5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg
     5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga
     5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg
     5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt
     6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg
     6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc
     6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca
     6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc
     6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat
     6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
     6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
     6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct
     6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
     6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct
     6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
     6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc
     6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
     6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
     6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
     6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg
     6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg
     7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat
     7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
     7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
     7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
     7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc
     7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct
//
LOCUS       SYNLACZD     5730 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD16.43.
ACCESSION   M34299
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD16.43.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 5730)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     3264     3265     synthetic end/unknown DNA start
    recomb     3513     3514     unknown DNA end/synthetic start
    recomb     3524     3525     synthetic end/pUC19 start
BASE COUNT     1348 a   1488 c   1580 g   1314 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga
     3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg
     3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa
     3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca
     3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat
     3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg
     3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc
     3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta
     3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg
     3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg
     3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac
     3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg
     3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt
     4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg
     4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac
     4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt
     4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag
     4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc
     4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc
     4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta
     4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg
     4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga
     4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac
     4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa
     4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat
     4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc
     4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg
     4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc
     4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg
     4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg
     5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa
     5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg
     5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga
     5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct
     5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca
     5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc
     5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg
     5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc
     5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca
     5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc
     5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga
     5701 gcggataaca atttcacaca ggaaacagct
//
LOCUS       SYNLACZE     7010 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD16.51.
ACCESSION   M34300
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD16.51.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7010)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     4544     4545     synthetic end/pUC19 start
    recomb     4793     4794     pUC19 end/synthetic start
    recomb     3244     3245     synthetic end/unknown DNA start
    recomb     4524     4525     unknown DNA end/synthetic start
    recomb     4804     4805     synthetic end/pUC19 start
BASE COUNT     1753 a   1710 c   1778 g   1769 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat
     3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat
     3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac
     3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg
     3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca
     3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca
     3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc
     3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt
     3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg
     3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa
     3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga
     3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa
     3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa
     4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg
     4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg
     4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa
     4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa
     4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc
     4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta
     4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa
     4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca
     4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg
     4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct
     4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc
     4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat
     4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa
     4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac
     4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
     4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg
     4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc
     5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga
     5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga
     5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg
     5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc
     5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac
     5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact
     5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca
     5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg
     5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact
     5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg
     5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg
     5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat
     5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc
     5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat
     5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt
     5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc
     6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt
     6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac
     6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt
     6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct
     6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga
     6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac
     6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg
     6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt
     6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc
     6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg
     6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc
     6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc
     6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag
     6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca
     6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat
     6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg
     6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct
//
LOCUS       SYNLACZF     7088 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD18.32.
ACCESSION   M34301
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD18.32.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7088)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      180      181     synthetic end/E.coli trpS start
    recomb      264      265     E.coli trpS end/synthetic start
    recomb      268      269     synthetic end/E.coli lacZ start
    recomb     3262     3263     E.coli lacZ end/synthetic start
    recomb     4622     4623     synthetic end/pUC19 start
    recomb     4871     4872     pUC19 end/synthetic start
    recomb     3322     3323     synthetic end/unknown DNA start
    recomb     4602     4603     unknown DNA end/synthetic start
    recomb     4882     4883     synthetic end/pUC19 start
BASE COUNT     1780 a   1724 c   1797 g   1787 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
      181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
      241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
      301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
      361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
      421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
      481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
      541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
      601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
      661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
      721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
      781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
      841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
      901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
      961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
     1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
     1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
     1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
     1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
     1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
     1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
     1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
     1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
     1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
     1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
     1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
     1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
     1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
     1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
     1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
     1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
     1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
     2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
     2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
     2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
     2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
     2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
     2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
     2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
     2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
     2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
     2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
     2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
     2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
     2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
     2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
     2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
     2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
     3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
     3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
     3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
     3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
     3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
     3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
     3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
     3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
     3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
     3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
     3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
     3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
     3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
     3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
     3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
     3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
     3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
     4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa
     4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt
     4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt
     4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt
     4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt
     4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa
     4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta
     4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta
     4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata
     4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac
     4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc
     4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc
     4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt
     4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac
     4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat
     4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat
     4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata
     5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct
     5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa
     5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa
     5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt
     5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg
     5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca
     5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa
     5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt
     5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc
     5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa
     5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga
     5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc
     5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga
     5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga
     5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga
     5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat
     6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt
     6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct
     6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc
     6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc
     6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc
     6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc
     6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg
     6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata
     6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta
     6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc
     6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg
     6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt
     6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt
     6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga
     6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc
     6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg
     6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca
     7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg
     7081 aaacagct
//
LOCUS       SYNLACZG     6563 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD21.28.
ACCESSION   M34302
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD21.28.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6563)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      180      181     synthetic end/E.coli trpS start
    recomb      264      265     E.coli trpS end/synthetic start
    recomb      268      269     synthetic end/E.coli lacZ start
    recomb     3262     3263     E.coli lacZ end/synthetic start
    recomb     4097     4098     synthetic end/pUC19 start
    recomb     4346     4347     pUC19 end/synthetic start
    recomb     3322     3323     synthetic end/unknown DNA start
    recomb     4077     4078     unknown DNA end/synthetic start
    recomb     4357     4358     synthetic end/pUC19 start
BASE COUNT     1587 a   1650 c   1723 g   1603 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
      181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
      241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
      301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
      361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
      421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
      481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
      541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
      601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
      661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
      721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
      781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
      841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
      901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
      961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
     1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
     1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
     1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
     1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
     1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
     1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
     1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
     1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
     1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
     1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
     1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
     1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
     1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
     1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
     1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
     1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
     1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
     2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
     2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
     2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
     2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
     2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
     2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
     2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
     2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
     2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
     2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
     2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
     2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
     2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
     2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
     2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
     2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
     3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
     3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
     3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
     3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
     3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
     3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct
     3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc
     3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc
     3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata
     3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta
     3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac
     3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg
     3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa
     3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc
     3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt
     3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg
     3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta
     4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct
     4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc
     4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga
     4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg
     4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc
     4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata
     4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt
     4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag
     4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca
     4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc
     4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat
     4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc
     4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg
     4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc
     4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat
     4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga
     4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc
     5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc
     5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt
     5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc
     5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc
     5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca
     5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca
     5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt
     5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta
     5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg
     5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc
     5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag
     5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa
     5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc
     5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc
     5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta
     5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag
     6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct
     6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga
     6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc
     6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt
     6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg
     6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg
     6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc
     6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc
     6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata
     6541 acaatttcac acaggaaaca gct
//
LOCUS       SYNLACZH     6562 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD22.04.
ACCESSION   M34303
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD22.04.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6562)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      179      180     synthetic end/E.coli trpS start
    recomb      263      264     E.coli trpS end/synthetic start
    recomb      267      268     synthetic end/E.coli lacZ start
    recomb     3261     3262     E.coli lacZ end/synthetic start
    recomb     4096     4097     synthetic end/pUC19 start
    recomb     4345     4346     pUC19 end/synthetic start
    recomb     3321     3322     synthetic end/unknown DNA start
    recomb     4076     4077     unknown DNA end/synthetic start
    recomb     4356     4357     synthetic end/pUC19 start
BASE COUNT     1587 a   1650 c   1722 g   1603 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg
      181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt
      241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc
      301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata
      361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc
      421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc
      481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca
      541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc
      601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga
      661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg
      721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg
      781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag
      841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga
      901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg
      961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt
     1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta
     1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga
     1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg
     1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa
     1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc
     1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga
     1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt
     1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg
     1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac
     1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg
     1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg
     1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta
     1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt
     1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg
     1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt
     1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg
     1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc
     2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg
     2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga
     2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg
     2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt
     2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac
     2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc
     2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc
     2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta
     2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc
     2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga
     2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg
     2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg
     2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg
     2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc
     2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg
     2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg
     3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct
     3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca
     3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg
     3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct
     3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg
     3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc
     3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca
     3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct
     3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa
     3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac
     3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca
     3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc
     3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa
     3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc
     3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt
     3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg
     3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat
     4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta
     4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct
     4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac
     4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg
     4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg
     4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag
     4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg
     4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga
     4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat
     4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca
     4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc
     4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca
     4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg
     4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca
     4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata
     4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag
     4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg
     5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca
     5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta
     5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct
     5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca
     5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag
     5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat
     5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt
     5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa
     5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga
     5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg
     5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc
     5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag
     5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc
     5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg
     5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac
     5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga
     6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt
     6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag
     6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg
     6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta
     6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc
     6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc
     6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc
     6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca
     6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa
     6541 caatttcaca caggaaacag ct
//
LOCUS       SYNLACZI     6567 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD22.11.
ACCESSION   M34304
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD22.11.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6567)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      184      185     synthetic end/E.coli trpS start
    recomb      268      269     E.coli trpS end/synthetic start
    recomb      272      273     synthetic end/E.coli lacZ start
    recomb     3266     3267     E.coli lacZ end/synthetic start
    recomb     4101     4102     synthetic end/pUC19 start
    recomb     4350     4351     pUC19 end/synthetic start
    recomb     3326     3327     synthetic end/unknown DNA start
    recomb     4081     4082     unknown DNA end/synthetic start
    recomb     4361     4362     synthetic end/pUC19 start
BASE COUNT     1587 a   1651 c   1724 g   1605 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg
      181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca
      241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga
      301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg
      361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga
      421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga
      481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc
      541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga
      601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg
      661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg
      721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt
      781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct
      841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa
      901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg
      961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac
     1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga
     1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa
     1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac
     1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat
     1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca
     1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct
     1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct
     1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac
     1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag
     1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc
     1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc
     1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga
     1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa
     1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga
     1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg
     1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa
     1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa
     2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct
     2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga
     2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc
     2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa
     2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac
     2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg
     2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat
     2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca
     2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct
     2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga
     2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca
     2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac
     2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg
     2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc
     2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg
     2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg
     3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct
     3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt
     3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca
     3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga
     3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca
     3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt
     3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc
     3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag
     3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt
     3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa
     3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg
     3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg
     3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg
     3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat
     3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc
     3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg
     3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg
     4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt
     4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa
     4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag
     4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta
     4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag
     4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt
     4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa
     4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca
     4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc
     4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc
     4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt
     4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt
     4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg
     4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact
     4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg
     4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga
     4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg
     5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa
     5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac
     5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc
     5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca
     5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga
     5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta
     5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc
     5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc
     5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt
     5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac
     5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct
     5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact
     5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg
     5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata
     5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga
     5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag
     6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg
     6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac
     6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca
     6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg
     6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc
     6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa
     6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt
     6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt
     6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg
     6541 gataacaatt tcacacagga aacagct
//
LOCUS       SYNLACZJ     7242 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD26.77.
ACCESSION   M34305
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD26.77.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7242)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      102      103     synthetic end/E.coli trpS start
    recomb      186      187     E.coli trpS end/synthetic start
    recomb      190      191     synthetic end/E.coli lacZ start
    recomb     3184     3185     E.coli lacZ end/synthetic start
    recomb     4776     4777     synthetic end/pUC19 start
    recomb     5025     5026     pUC19 end/synthetic start
    recomb     3244     3245     synthetic end/unknown DNA start
    recomb     4756     4757     unknown DNA end/synthetic start
    recomb     5036     5037     synthetic end/pUC19 start
BASE COUNT     1780 a   1801 c   1846 g   1815 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta
       61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac
      121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc
      181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
      241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
      301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg
      361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc
      421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc
      481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg
      541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat
      601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac
      661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc
      721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg
      781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat
      841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt
      901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa
      961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt
     1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa
     1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa
     1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg
     1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt
     1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt
     1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc
     1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat
     1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg
     1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac
     1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg
     1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg
     1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt
     1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac
     1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc
     1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg
     1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac
     1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag
     2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc
     2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca
     2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa
     2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac
     2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa
     2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa
     2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt
     2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc
     2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac
     2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag
     2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat
     2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg
     2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg
     2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa
     2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac
     2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg
     3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa
     3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg
     3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg
     3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa
     3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc
     3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca
     3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt
     3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc
     3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct
     3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg
     3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca
     3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt
     3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga
     3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat
     3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta
     3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa
     3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt
     4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga
     4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg
     4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat
     4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt
     4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg
     4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta
     4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct
     4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga
     4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt
     4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca
     4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat
     4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt
     4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg
     4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta
     4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg
     4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg
     4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg
     5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg
     5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa
     5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga
     5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc
     5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg
     5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc
     5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat
     5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg
     5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag
     5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa
     5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc
     5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca
     5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc
     5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc
     5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg
     5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta
     6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag
     6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga
     6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc
     6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa
     6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa
     6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc
     6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt
     6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc
     6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac
     6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca
     6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg
     6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag
     6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt
     6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat
     6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc
     6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt
     6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag
     7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca
     7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga
     7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt
     7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct
//
LOCUS       SYNLACZK     6620 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD34.110.
ACCESSION   M34306
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD34.110.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6620)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      237      238     synthetic end/E.coli trpS start
    recomb      321      322     E.coli trpS end/synthetic start
    recomb      325      326     synthetic end/E.coli lacZ start
    recomb     3319     3320     E.coli lacZ end/synthetic start
    recomb     4154     4155     synthetic end/pUC19 start
    recomb     4403     4404     pUC19 end/synthetic start
    recomb     3379     3380     synthetic end/unknown DNA start
    recomb     4134     4135     unknown DNA end/synthetic start
    recomb     4414     4415     synthetic end/pUC19 start
BASE COUNT     1592 a   1665 c   1731 g   1632 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta
      181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta
      241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc
      301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct
      361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc
      421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc
      481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct
      541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc
      601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg
      661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg
      721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc
      781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc
      841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat
      901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact
      961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta
     1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct
     1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc
     1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa
     1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac
     1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat
     1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat
     1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg
     1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac
     1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc
     1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc
     1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg
     1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat
     1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt
     1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc
     1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc
     1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat
     1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat
     2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc
     2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa
     2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc
     2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat
     2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg
     2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc
     2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag
     2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat
     2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac
     2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg
     2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc
     2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa
     2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct
     2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat
     2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg
     2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga
     3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat
     3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc
     3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc
     3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa
     3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg
     3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc
     3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg
     3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc
     3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt
     3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa
     3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac
     3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc
     3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa
     3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa
     3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg
     3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc
     3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt
     4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg
     4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact
     4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga
     4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa
     4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca
     4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta
     4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt
     4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg
     4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca
     4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt
     4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga
     4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga
     4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat
     4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca
     4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt
     4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac
     4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct
     5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga
     5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac
     5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat
     5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg
     5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc
     5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc
     5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg
     5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta
     5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg
     5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga
     5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt
     5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag
     5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa
     5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag
     5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca
     5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac
     6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa
     6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc
     6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg
     6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc
     6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc
     6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag
     6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa
     6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga
     6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc
     6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca
     6601 atttcacaca ggaaacagct
//
LOCUS       SYNLACZL     5808 bp ds-DNA             SYN       17-JUL-1990
DEFINITION  Cloning vector pPD16.01.
ACCESSION   M34307
KEYWORDS    lacZ.
SOURCE      Cloning vector pPD16.01.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 5808)
  AUTHORS   Fire,A.Z., Harrison,S. and Dixon,D.
  TITLE     A modular set of lac-Z fusion vectors for studying gene expression
            in C.elegans
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Z.Fire, 11-MAY-1990.
            
              Author address: A.Z.Fire
              Carnegie Inst of Washington
              Dept Embryology
              115 West Univ Parkway
              Baltimore, MD 21210
              email: AZF@JHUIGF.BITNET
            
FEATURES       from  to/span     description
    recomb       57       58     pUC19 end/synthetic start
    recomb      180      181     synthetic end/E.coli trpS start
    recomb      264      265     E.coli trpS end/synthetic start
    recomb      268      269     synthetic end/E.coli lacZ start
    recomb     3262     3263     E.coli lacZ end/synthetic start
    recomb     3342     3343     synthetic end/unknown DNA start
    recomb     3591     3592     unknown DNA end/synthetic start
    recomb     3602     3603     synthetic end/pUC19 start
BASE COUNT     1375 a   1502 c   1599 g   1332 t
ORIGIN      
        1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat
       61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag
      121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag
      181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt
      241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac
      301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat
      361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg
      421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt
      481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc
      541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat
      601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag
      661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg
      721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc
      781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa
      841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg
      901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct
      961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt
     1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt
     1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg
     1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc
     1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa
     1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag
     1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg
     1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg
     1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac
     1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa
     1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg
     1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc
     1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt
     1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg
     1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac
     1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag
     1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat
     1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat
     2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg
     2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg
     2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg
     2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag
     2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta
     2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag
     2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg
     2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt
     2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg
     2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg
     2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc
     2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc
     2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt
     2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat
     2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc
     2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg
     3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc
     3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac
     3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg
     3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc
     3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt
     3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg
     3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg
     3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg
     3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat
     3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg
     3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt
     3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac
     3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa
     3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat
     3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc
     3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga
     3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg
     4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc
     4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag
     4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc
     4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg
     4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg
     4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac
     4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac
     4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg
     4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg
     4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg
     4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac
     4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg
     4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg
     4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc
     4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc
     4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt
     4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc
     5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact
     5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac
     5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag
     5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg
     5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg
     5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga
     5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt
     5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct
     5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg
     5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt
     5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta
     5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta
     5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct
//
LOCUS       MUSTCAXL      331 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma LD1.
ACCESSION   M34194
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 331)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   331     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       71 a     86 c     89 g     85 t
ORIGIN      
        1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg
       61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg
      121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat
      181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg
      241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata
      301 acttttgggg ctggaaccaa actcaccatt a
//
LOCUS       MUSTCAXM      334 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma LD3.
ACCESSION   M34196
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 334)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   334     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       97 a     84 c     75 g     78 t
ORIGIN      
        1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg
       61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg
      121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga
      181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag
      241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg
      301 acatttggat ctggcacaaa agtcattgtt ctac
//
LOCUS       MUSTCAXN      327 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-14.
ACCESSION   M34198
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 327)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   327     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       92 a     80 c     85 g     70 t
ORIGIN      
        1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
       61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
      121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
      181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
      241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt
      301 ggacagggaa ccaagttatc tatcaag
//
LOCUS       MUSTCAXO      324 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-27.
ACCESSION   M34200
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   324     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       92 a     80 c     83 g     69 t
ORIGIN      
        1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata
       61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga
      121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg
      181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa
      241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc
      301 tctgggacca agctaattat aaag
//
LOCUS       MUSTCAXP      297 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-6.
ACCESSION   M34202
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 297)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   297     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       85 a     73 c     74 g     65 t
ORIGIN      
        1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac
       61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc
      121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc
      181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt
      241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg
//
LOCUS       MUSTCAXQ      193 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, T-cell clone V2.1.
ACCESSION   M34204
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 193)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   193     T-cell receptor alpha-chain (AA at 2)
BASE COUNT       56 a     45 c     49 g     43 t
ORIGIN      
        1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata
       61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc
      121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt
      181 tctgagtgtt ctg
//
LOCUS       MUSTCAXR      333 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-33.
ACCESSION   M34206
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 333)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   333     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       84 a     96 c     78 g     74 t      1 others
ORIGIN      
        1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg
       61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc
      121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa
      181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg
      241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga
      301 ctcacctttg ggaaaggaac tcagctgatc atc
//
LOCUS       MUSTCAXS      339 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma MT1-7.
ACCESSION   M34208
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   339     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       86 a    103 c     75 g     75 t
ORIGIN      
        1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg
       61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat
      121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg
      181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag
      241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga
      301 aagctcacgt ttggacacgg caccatcctt agggtccat
//
LOCUS       MUSTCAXT      342 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma P1F12C4.
ACCESSION   M34210
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 342)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   342     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       80 a    102 c     79 g     81 t
ORIGIN      
        1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg
       61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc
      121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa
      181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg
      241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact
      301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at
//
LOCUS       MUSTCAXU      324 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma P1D3A6.
ACCESSION   M34212
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   324     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       82 a     80 c     81 g     81 t
ORIGIN      
        1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg
       61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga
      121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc
      181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac
      241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt
      301 caaggaaccg ttctgagtgt tata
//
LOCUS       MUSTCAXV      210 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma 1E1O.
ACCESSION   M34214
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 210)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   210     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       54 a     53 c     50 g     53 t
ORIGIN      
        1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg
       61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca
      121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact
      181 tttggtcaag gaaccgttct gagtgttata
//
LOCUS       MUSTCAXW      234 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma 7/6AH1.
ACCESSION   M34216
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 234)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   234     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       55 a     59 c     58 g     62 t
ORIGIN      
        1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct
       61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct
      121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta
      181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc
//
LOCUS       MUSTCAXX      312 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active alpha-chain mRNA V-region,
            partial cds, from hybridoma 2B11.
ACCESSION   M34218
KEYWORDS    T-cell receptor; T-cell receptor alpha-chain; variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   312     T-cell receptor alpha-chain (AA at 1)
BASE COUNT       89 a     79 c     79 g     64 t      1 others
ORIGIN      
        1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag
       61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta
      121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac
      181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg
      241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact
      301 ttgcaagtgc aa
//
LOCUS       MUSTCBYAO     153 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma LD1.
ACCESSION   M34195
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 153)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   153     T-cell receptor beta-chain (AA at 1)
    recomb      122      123     V-region end/J-region start
BASE COUNT       39 a     37 c     36 g     41 t
ORIGIN      
        1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc
       61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta
      121 ggttctggaa atacgctcta ttttggagaa gga
//
LOCUS       MUSTCBYAP     111 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma LD3.
ACCESSION   M34197
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   111     T-cell receptor beta-chain (AA at 1)
    recomb       73       74     V-region end/J-region start
BASE COUNT       18 a     37 c     25 g     31 t
ORIGIN      
        1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc
       61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a
//
LOCUS       MUSTCBYAQ     321 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-14.
ACCESSION   M34199
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   321     T-cell receptor beta-chain (AA at 1)
    recomb      292      293     V-region end/J-region start
BASE COUNT      106 a     81 c     61 g     73 t
ORIGIN      
        1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc
       61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag
      121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc
      181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct
      241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac
      301 accttgtact ttggtgcggg c
//
LOCUS       MUSTCBYAR     210 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-27.
ACCESSION   M34201
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 210)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   210     T-cell receptor beta-chain (AA at 1)
    recomb      167      168     V-region end/J-region start
BASE COUNT       57 a     56 c     43 g     54 t
ORIGIN      
        1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc
       61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg
      121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag
      181 ttcttcggac cagggacacg actcaccgtc
//
LOCUS       MUSTCBYAS     255 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-6.
ACCESSION   M34203
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 255)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   255     T-cell receptor beta-chain (AA at 1)
    recomb      219      220     V-region end/J-region start
BASE COUNT       57 a     68 c     77 g     53 t
ORIGIN      
        1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc
       61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc
      121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc
      181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg
      241 ggcacccgac tatcg
//
LOCUS       MUSTCBYAT     213 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, T-cell clone V2.1.
ACCESSION   M34205
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 213)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   213     T-cell receptor beta-chain (AA at 1)
    recomb      171      172     V-region end/J-region start
BASE COUNT       48 a     59 c     58 g     48 t
ORIGIN      
        1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat
       61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc
      121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag
      181 cagttcttcg gaccagggac acgactcacc gtc
//
LOCUS       MUSTCBYAU     207 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-33.
ACCESSION   M34207
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 207)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   207     T-cell receptor beta-chain (AA at 1)
    recomb      163      164     V-region end/J-region start
BASE COUNT       65 a     54 c     36 g     52 t
ORIGIN      
        1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt
       61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag
      121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt
      181 ttcggtcatg gaaccaagct gtctgtc
//
LOCUS       MUSTCBYAV     156 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma MT1-7.
ACCESSION   M34209
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   156     T-cell receptor beta-chain (AA at 1)
    recomb      118      119     V-region end/J-region start
BASE COUNT       30 a     48 c     40 g     38 t
ORIGIN      
        1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
       61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct
      121 gaacagtact tcggtcccgg caccaggctc acggtt
//
LOCUS       MUSTCBYAW     132 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma P1F12C4.
ACCESSION   M34211
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 132)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   132     T-cell receptor beta-chain (AA at 1)
    recomb       94       95     V-region end/J-region start
BASE COUNT       35 a     39 c     28 g     30 t
ORIGIN      
        1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg
       61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc
      121 agactcacag tt
//
LOCUS       MUSTCBYAX     303 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma P1D3A6.
ACCESSION   M34213
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 303)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   303     T-cell receptor beta-chain (AA at 1)
    recomb      259      260     V-region end/J-region start
BASE COUNT       80 a     72 c     79 g     72 t
ORIGIN      
        1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac
       61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat
      121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg
      181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc
      241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt
      301 gtt
//
LOCUS       MUSTCBYAY     159 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma 1E1O.
ACCESSION   M34215
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 159)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   159     T-cell receptor beta-chain (AA at 1)
    recomb      113      114     V-region end/J-region start
BASE COUNT       35 a     40 c     39 g     45 t
ORIGIN      
        1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg
       61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga
      121 aatacgctct attttggaga aggaagccgg ctcattgtt
//
LOCUS       MUSTCBYAZ     321 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma 7/6AH1.
ACCESSION   M34217
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   321     T-cell receptor beta-chain (AA at 1)
    recomb      275      276     V-region end/J-region start
BASE COUNT       86 a     78 c     81 g     76 t
ORIGIN      
        1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc
       61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg
      121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac
      181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct
      241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga
      301 gaaggaagcc ggctcattgt t
//
LOCUS       MUSTCBYBA     339 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse T-cell receptor active beta-chain mRNA V-J-region,
            partial cds, from hybridoma 2B11.
ACCESSION   M34219
KEYWORDS    T-cell receptor; T-cell receptor beta-chain; joining exon;
            variable region.
SOURCE      Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J.
  TITLE     Structurally diverse T cells can recognize an influenza antigen/MHC
            complex in the same common orientation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.H.Taylor, 16-MAY-1990.
            
              Author address: A.H.Taylor
              Wistar Institute of Anatomy and Biology
              3601 Spruce Street
              Philadelphia, PA 19104
              Tel: (215) 898-3839
            
FEATURES       from  to/span     description
    pept    <     1  >   339     T-cell receptor beta-chain (AA at 1)
    recomb      294      295     V-region end/J-region start
BASE COUNT       90 a     88 c     88 g     73 t
ORIGIN      
        1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca
       61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg
      121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc
      181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg
      241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa
      301 aacaccttgt actttggtgc gggcacccga ctatcggtg
//
LOCUS       CEACAEVA      264 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Caprine arthritis-encephalitis lentivirus tat protein gene,
            complete cds.
ACCESSION   M34092
KEYWORDS    tat protein.
SOURCE      Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from
            goat synovial membrane, clone pCol.9.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
            Robinson,M.M. and Cheevers,W.P.
  TITLE     Genetic structure of the pol-env region of the Caprine arthritis-
            encephalitis lentivirus genome: Possible role in trans-activation
            of the viral long terminal repeat
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.P.Cheevers, 08-MAY-1990.
            
              Author address: W.P.Cheevers
              Washington State University
              Dept Veterinary Microbiology
              Pullman, WA 94164-7040
              email: b384@wsuvmsl.bitnet
            
FEATURES       from  to/span     description
    pept          1      264     tat protein
BASE COUNT       97 a     40 c     80 g     47 t
ORIGIN      
        1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta
       61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg
      121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg
      181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta
      241 tgtaacccgg ggtggggaac ataa
//
LOCUS       CEACAEVB      264 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Caprine arthritis-encephalitis lentivirus tat protein gene,
            complete cds.
ACCESSION   M34093
KEYWORDS    tat protein.
SOURCE      Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from
            goat synovial membrane, clone pC63-49.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G.,
            Robinson,M.M. and Cheevers,W.P.
  TITLE     Genetic structure of the pol-env region of the Caprine arthritis-
            encephalitis lentivirus genome: Possible role in trans-activation
            of the viral long terminal repeat
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.P.Cheevers, 08-MAY-1990.
            
              Author address: W.P.Cheevers
              Washington State University
              Dept Veterinary Microbiology
              Pullman, WA 94164-7040
              email: b384@wsuvmsl.bitnet
            
FEATURES       from  to/span     description
    pept          1      264     tat protein
BASE COUNT       99 a     43 c     75 g     47 t
ORIGIN      
        1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata
       61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg
      121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg
      181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta
      241 tgtaacccgg ggtgggaaac ataa
//
LOCUS       FLAPR834HA    540 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds.
ACCESSION   M34335
KEYWORDS    glycoprotein; hemagglutinin.
SOURCE      Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally
            from human.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 540)
  AUTHORS   Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P.
  TITLE     Rapid detection of influenza virus H1 by the polymerase chain
            reaction
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 425-430 (1990)
  STANDARD  simple staff_review
COMMENT     Sequence reported is + strand.
FEATURES       from  to/span     description
    pept    <     1  >   540     hemagglutinin (AA at 2)
BASE COUNT      179 a    121 c    123 g    117 t
ORIGIN      
        1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta
       61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac
      121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat
      181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg
      241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa
      301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc
      361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa
      421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt
      481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa
//
LOCUS       HUMGPPSBAA    355 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1.
ACCESSION   M34422
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, clone PS-beta-G C.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 355)
  AUTHORS   Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
  TITLE     structure, evolution and chromosomal localization of the human
            pregnancy-specific beta-1 glycoprotein gene family
  JOURNAL   Genomics 6, 579-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS           1      355     pregnancy-specific beta-1 glycoprotein intron
                                 C1
BASE COUNT      110 a     64 c     62 g    119 t
ORIGIN      Chromosome 19.
        1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag
       61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc
      121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg
      181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa
      241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct
      301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag
//
LOCUS       HUMGPPSBD    1418 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION   M34421
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, cDNA to mRNA, clone PS-beta-G B.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1418)
  AUTHORS   Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
  TITLE     Structure, evolution and chromosomal localization of the human
            pregnancy-specific beta-1 glycoprotein gene family
  JOURNAL   Genomics 6, 579-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         79     1359     pregnancy-specific beta-1 glycoprotein
                                 precursor /hgml_locus_uid="LG0073P"
                                 /nomgen="PSBG1" /map="19q13.1-q13.2"
    sigp         79      180     pregnancy-specific beta-1 glycoprotein signal
                                 peptide
    matp        181     1356     pregnancy-specific beta-1 glycoprotein
    mRNA          1     1418     pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT      408 a    398 c    296 g    316 t
ORIGIN      Chromosome 19.
        1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac
       61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc
      121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc
      181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt
      241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac
      301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca
      361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg
      421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa
      481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc
      541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg
      601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag
      661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc
      721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat
      781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat
      841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg
      901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata
      961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac
     1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc
     1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc
     1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa
     1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc
     1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct
     1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga
     1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac
//
LOCUS       HUMGPPSBE    1856 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION   M34420
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, cDNA to mRNA, clone PS-beta-G A.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1856)
  AUTHORS   Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G.
  TITLE     Structure, evolution and chromosomal localization of the human
            pregnancy-specific beta-1 glycoprotein gene family
  JOURNAL   Genomics 6, 579-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         82     1368     pregnancy-specific beta-1 glycoprotein
                                 precursor /hgml_locus_uid="LG0073P"
                                 /nomgen="PSBG1" /map="19q13.1-q13.2"
    sigp         82      183     pregnancy-specific beta-1 glycoprotein signal
                                 peptide
    matp        184     1365     pregnancy-specific beta-1 glycoprotein
    mRNA          1     1856     pregnancy-specific beta-1 glycoprotein mRNA
BASE COUNT      544 a    476 c    364 g    472 t
ORIGIN      Chromosome 19.
        1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag
       61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc
      121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact
      181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta
      241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag
      301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct
      361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc
      421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga
      481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc
      541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact
      601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg
      661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga
      721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg
      781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag
      841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg
      901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg
      961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag
     1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac
     1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc
     1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag
     1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct
     1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc
     1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt
     1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta
     1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca
     1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt
     1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat
     1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga
     1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt
     1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg
     1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc
//
LOCUS       HUMGPPSBF    2004 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds.
ACCESSION   M23575
KEYWORDS    beta-1 glycoprotein.
SOURCE      Human placenta, cDNA to mRNA, clone pSP1-i.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2004)
  AUTHORS   Rooney,B.C., Horne,C.H.W. and Hardman,N.
  TITLE     Molecular cloning of a cDNA for human pregnancy-specific
            beta-1-glycoprotein: Homology with human carcinoembryonic antigen
            and related proteins
  JOURNAL   Gene 71, 439-449 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        232     1518     pregnancy-specific beta-1 glycoprotein
                                 precursor /hgml_locus_uid="LG0073P"
                                 /nomgen="PSBG1" /map="19q13.1-q13.2"
    sigp        232      333     pregnancy-specific beta-1 glycoprotein signal
                                 peptide
    matp        334     1515     pregnancy-specific beta-1 glycoprotein
BASE COUNT      579 a    510 c    406 g    509 t
ORIGIN      
        1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca
       61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta
      121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt
      181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc
      241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca
      301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc
      361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct
      421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta
      481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc
      541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac
      601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac
      661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag
      721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat
      781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt
      841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg
      901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac
      961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa
     1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt
     1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat
     1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca
     1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat
     1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa
     1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag
     1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg
     1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct
     1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac
     1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg
     1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact
     1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg
     1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt
     1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc
     1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc
     1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg
     1981 taataaagtt atctgcacaa accc
//
LOCUS       HUMLEUELA     920 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human elastase/medullasin mRNA, complete cds.
ACCESSION   M34379
KEYWORDS    elastase; medullasin.
SOURCE      Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 920)
  AUTHORS   Okano,K., Aoki,Y., Shimizu,H. and Naruto,M.
  TITLE     Functional expression of human leukocyte elastase (HLE)/medullasin
            in eukaryotic cells
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept         39      842     elastase/medullasin precursor (EC 3.4.21.37)
    sigp         39      119     elastase/medullasin signal peptide
    matp        126      839     elastase/medullasin
BASE COUNT      141 a    329 c    287 g    163 t
ORIGIN      
        1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg
       61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct
      121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc
      181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg
      241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata
      301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg
      361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca
      421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg
      481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg
      541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca
      601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct
      661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct
      721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac
      781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact
      841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca
      901 ataaacattc tctgttttgt
//
LOCUS       MSGIS6110    1360 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  M.tuberculosis-50 complex IS6110 insertion sequence-like element.
ACCESSION   M29899
KEYWORDS    insertion sequence.
SOURCE      M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone
            I21.
  ORGANISM  Mycobacterium tuberculosis
            Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae.
REFERENCE   1  (bases 1 to 1360)
  AUTHORS   Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H.,
            Gicquel,B. and Guesdon,J.L.
  TITLE     IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex
  JOURNAL   Nucleic Acids Res. 18, 188-188 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Thierry 15-NOV-1989.
BASE COUNT      269 a    439 c    432 g    220 t
ORIGIN      
        1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca
       61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca
      121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt
      181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc
      241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa
      301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga
      361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg
      421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg
      481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc
      541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt
      601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca
      661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca
      721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg
      781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag
      841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt
      901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg
      961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca
     1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg
     1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac
     1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc
     1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc
     1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct
     1321 cagatcagag agtctccgga ctcaccgggg cggttcacga
//
LOCUS       MUSIGHAAT     348 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig J558 family active H-chain mRNA V-J3 region
            from hybridoma CE5, partial cds.
ACCESSION   M34119
KEYWORDS    diversity exon; immunoglobulin heavy chain; processed gene.
SOURCE      Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   348     Ig H-chain V-J3 region (AA at 1)
    recomb      294      295     J-region end/D-region start
    recomb      303      304     D-region end/J-region start
BASE COUNT       89 a     82 c     97 g     80 t
ORIGIN      Chromosome 12.
        1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata
       61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg
      121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat
      181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac
      241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt
      301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS       MUSIGKCSR     321 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733,
            partial cds.
ACCESSION   M34120
KEYWORDS    immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE      Mouse (Balb/c) hybridoma GA733, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig kappa-chain (AA at 1)
BASE COUNT       87 a     83 c     74 g     77 t
ORIGIN      Chromosome 6.
        1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc
       61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
      121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat
      181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
      241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga
      301 ggcaccaaac tggaaatcaa a
//
LOCUS       MUSIGKCSS     321 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig active kappa-chain mRNA V-region from hybridoma
            C017-1A, partial cds.
ACCESSION   M34121
KEYWORDS    immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE      Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig kappa-chain (AA at 1)
BASE COUNT       89 a     77 c     78 g     77 t
ORIGIN      Chromosome 6.
        1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc
       61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca
      121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat
      181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct
      241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg
      301 gggaccaagc tggaaataaa a
//
LOCUS       MUSIGKCST     318 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig active kappa-chain mRNA V-region from hybridoma
            CE5, partial cds.
ACCESSION   M34122
KEYWORDS    immunoglobulin light chain; kappa-immunoglobulin; processed gene.
SOURCE      Mouse (Balb/c) hybridoma CE5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H.
  TITLE     Identical D region sequences expressed by murine monoclonal
            antibodies specific for a human tumor-associated antigen
  JOURNAL   J. Immunol. 144, 1965-1968 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   318     Ig kappa-chain (AA at 1)
BASE COUNT       84 a     78 c     80 g     76 t
ORIGIN      Chromosome 6.
        1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc
       61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca
      121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag
      181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
      241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc
      301 accaagctgg aaatcaaa
//
LOCUS       PSERRSAA     1517 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  P.aeruginosa 16S ribosomal RNA.
ACCESSION   M34133
KEYWORDS    16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA.
SOURCE      P.aeruginosa (strain 25330) ribosomal RNA.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1517)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept of Microbiology
              407 S. Goodwin Avenue
              Urbana, IL 61801
              email: carl@ninja.life.uiuc.edu
            
FEATURES       from  to/span     description
    rRNA          1     1517     16S rRNA
BASE COUNT      379 a    334 c    470 g    301 t     33 others
ORIGIN      
        1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa
       61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta
      121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc
      181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta
      241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga
      301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata
      361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat
      421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt
      481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg
      541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg
      601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag
      661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc
      721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg
      781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag
      841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta
      901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag
      961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg
     1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt
     1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca
     1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg
     1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg
     1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact
     1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg
     1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta
     1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn
     1501 nnnnnnnnnn gaacctg
//
LOCUS       RATNESTIN    5946 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat nestin mRNA, complete cds.
ACCESSION   M34384
KEYWORDS    intermediate filament protein; nestin.
SOURCE      Rat (strain E15) embryo central nervous system, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5946)
  AUTHORS   Lendahl,U., Zimmerman,L.B. and McKay,R.D.G.
  TITLE     CNS stem cells express a new class of intermediate filament protein
  JOURNAL   Cell 60, 585-595 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        128     5545     nestin (128 could be 161)
    mRNA          1     5495     nestin mRNA
BASE COUNT     1600 a   1401 c   1871 g   1074 t
ORIGIN      
        1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa
       61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac
      121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg
      181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc
      241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga
      301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga
      361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt
      421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc
      481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga
      541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc
      601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga
      661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt
      721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg
      781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga
      841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac
      901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca
      961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct
     1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg
     1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg
     1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc
     1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt
     1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc
     1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca
     1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga
     1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc
     1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc
     1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga
     1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct
     1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct
     1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa
     1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca
     1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg
     1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt
     1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt
     2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca
     2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga
     2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca
     2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact
     2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg
     2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat
     2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca
     2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa
     2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt
     2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg
     2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa
     2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg
     2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga
     2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc
     2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag
     2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc
     3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc
     3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac
     3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt
     3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa
     3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt
     3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc
     3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc
     3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc
     3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt
     3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga
     3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga
     3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac
     3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga
     3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa
     3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc
     3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga
     3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga
     4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc
     4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa
     4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag
     4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga
     4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc
     4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt
     4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga
     4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga
     4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg
     4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg
     4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc
     4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct
     4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga
     4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga
     4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc
     4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga
     4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa
     5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct
     5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca
     5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga
     5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt
     5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg
     5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga
     5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc
     5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga
     5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg
     5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt
     5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg
     5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc
     5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac
     5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc
     5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct
     5941 acctac
//
LOCUS       CLLRRE       1860 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  C.sapidus 18S rRNA, 3' end.
ACCESSION   M34360
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      C.sapidus rRNA.
  ORGANISM  Callinectes sapidus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; 
            Brachyrhyncha; Portunoidea; Portunidae.
REFERENCE   1  (bases 1 to 1860)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-113 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1860     18S ribosoma RNA
BASE COUNT      359 a    375 c    421 g    367 t    338 others
ORIGIN      
        1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct
       61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt
      121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta
      181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc
      241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc
      301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc
      361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac
      421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac
      481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat
      541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct
      601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg
      661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc
      721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata
      841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc
      901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc
      961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg
     1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt
     1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt
     1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac
     1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg
     1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn
     1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct
     1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc
     1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga
     1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag
     1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc
     1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc
     1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn
     1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
//
LOCUS       HUMNCSRC      138 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human membrane-associated tyrosine protein kinase (C-SRC) gene,
            exons 3, 4, NI, and NII, partial cds.
ACCESSION   M34469
KEYWORDS    membrane-associated tryosine protein kinase.
SOURCE      Human adult brain DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 138)
  AUTHORS   Pyper,J.M. and Bolen,J.B.
  TITLE     Identification of a novel neuronal C-SRC exon expressed in human
            brain
  JOURNAL   Mol. Cell. Biol. 10, 2035-2040 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       29     membrane-associated tyrosine protein kinase
                                 (C-SRC), exon 3 (AA at 1)
                 30       47     membrane-associated tyrosine protein kinase,
                                 exon NI
                 48       80     membrane-associated tyrosine protein kinase,
                                 exon NII
                 81  >   136     membrane-associated tyrosine protein kinase,
                                 exon 4
    variant     117      117     a or g
    variant     135      135     c or t
BASE COUNT       36 a     40 c     40 g     22 t
ORIGIN      
        1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc
       61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga
      121 cagacaggct acatcccc
//
LOCUS       MHVAPEPA     1000 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Murine coronavirus peplomer (S) protein gene.
ACCESSION   M34435
KEYWORDS    peplomer protein.
SOURCE      Murine (strain JHM-DL) RNA.
  ORGANISM  Murine coronavirus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 1000)
  AUTHORS   Banner,L.R., Keck,J.G. and Lai,M.M.C.
  TITLE     A clustering of RNA recombination sites adjacent to a hypervariable
            region of the peplomer gene of Murine coronavirus
  JOURNAL   Virology 175, 548-555 (1990)
  STANDARD  simple staff_review
BASE COUNT      258 a    192 c    227 g    317 t      6 others
ORIGIN      
        1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag
       61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa
      121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct
      181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa
      241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta
      301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt
      361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca
      421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac
      481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg
      541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga
      601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg
      661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt
      721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt
      781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa
      841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa
      901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag
      961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa
//
LOCUS       MHVAPEPB      843 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Murine coronavirus peplomer (S) protein gene.
ACCESSION   M34436
KEYWORDS    peplomer protein.
SOURCE      Murine (strain A59) RNA.
  ORGANISM  Murine coronavirus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 843)
  AUTHORS   Banner,L.R., Keck,J.G. and Lai,M.M.C.
  TITLE     A clustering of RNA recombination sites adjacent to a hypervariable
            region of the peplomer gene of Murine coronavirus
  JOURNAL   Virology 175, 548-555 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site        400      401     deletion site
BASE COUNT      227 a    153 c    183 g    280 t
ORIGIN      
        1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag
       61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa
      121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct
      181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa
      241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta
      301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt
      361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt
      421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc
      481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc
      541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat
      601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt
      661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc
      721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact
      781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc
      841 gaa
//
LOCUS       MHVSP        3780 bp ss-mRNA            VRL       17-JUL-1990
DEFINITION  Mouse hepatitis virus surface protein S mRNA, complete cds.
ACCESSION   X04797 M34437
KEYWORDS    glycoprotein; membrane glycoprotein; surface glycoprotein;
            surface projection glycoprotein.
SOURCE      Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones
            pJMS1010, pJS112, and pJS92.
  ORGANISM  Murine hepatitis virus A59
            Unclassified.
REFERENCE   1  (bases 1 to 3780)
  AUTHORS   Schmidt,I., Skinner,M. and Siddell,S.
  TITLE     Nucleotide Sequence of the Gene Encoding the Surface Projection
            Glycoprotein of Coronavirus MHV-JHM
  JOURNAL   J. Gen. Virol. 68, 47-56 (1987)
  STANDARD  simple automatic
COMMENT        EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE         24     30       put. region of homology preceeding
                                            mRNA 5' initiation  codons
            
               SITE        121    129       pot. N-glycosylation site
               SITE        208    216       pot. N-glycosylation site
               SITE        430    438       pot. N-glycosylation site
               SITE        604    612       pot. N-glycosylation site
               SITE       1099   1107       pot. N-glycosylation site
               SITE       1333   1341       pot. N-glycosylation site
               SITE       1636   1644       pot. N-glycosylation site
               SITE       1732   1740       pot. N-glycosylation site
               SITE       1756   1764       pot. N-glycosylation site
               SITE       1825   1833       pot. N-glycosylation site
               SITE       1900   1914       put. proteolytic cleavage site
               SITE       1972   1980       pot. N-glycosylation site
               SITE       2023   2031       pot. N-glycosylation site
               SITE       2293   2301       pot. N-glycosylation site
               SITE       2440   2448       pot. N-glycosylation site
               SITE       3139   3147       pot. N-glycosylation site
               SITE       3301   3309       pot. N-glycosylation site
               SITE       3331   3339       pot. N-glycosylation site
               SITE       3388   3396       pot. N-glycosylation site
               SITE       3436   3444       pot. N-glycosylation site
               SITE       3499   3507       pot. N-glycosylation site
               SITE       3553   3654       pot. transmembrane domain
               SITE       3622   3675       cysteine-rich region
               SITE       3655   3660       charge cluster
               SITE       3715   3723       pot. N-glycosylation site
FEATURES       from  to/span     description
    pept         31     3738     surface protein S precursor
    sigp         31       60     surface protein S signal peptide
    matp         61     3736     surface protein S
BASE COUNT     1030 a    718 c    800 g   1232 t
ORIGIN      
        1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc
       61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat
      121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact
      181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct
      241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg
      301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac
      361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt
      421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct
      481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat
      541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag
      601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt
      661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg
      721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct
      781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt
      841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt
      901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt
      961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa
     1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact
     1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg
     1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt
     1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt
     1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt
     1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat
     1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat
     1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt
     1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct
     1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc
     1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct
     1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt
     1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat
     1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc
     1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt
     1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc
     1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat
     2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt
     2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat
     2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct
     2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct
     2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac
     2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag
     2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa
     2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg
     2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct
     2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta
     2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac
     2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag
     2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg
     2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca
     2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc
     2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc
     3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata
     3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca
     3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg
     3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat
     3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg
     3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt
     3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc
     3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag
     3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa
     3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt
     3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa
     3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt
     3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt
//
LOCUS       MUSIGLAZ      713 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds.
ACCESSION   M34598 M29013 J03562
KEYWORDS    constant region; immunoglobulin; immunoglobulin light chain;
            joining exon; lambda-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone
            Y31.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 713)
  AUTHORS   Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A.
  TITLE     Structure of a third murine immunoglobulin lambda light chain
            variable region that is expressed in laboratory mice
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 2 to 450)
  AUTHORS   Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
  TITLE     Mouse V-lambda-x gene sequence generates no junctonal diversity and
            is conserved in mammalian species
  JOURNAL   J. Immunol. 144, 2816-2820 (1990)
  STANDARD  simple staff_review
REFERENCE   3  (bases 266 to 429)
  AUTHORS   Sanchez,P. and Cazenave,P.-A.
  TITLE     A new variable region in mouse immunoglobulin lambda light chains
  JOURNAL   J. Exp. Med. 166, 265-270 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         12  >   713     Ig lambda chain precursor V-x,J-2,C-2 region
    sigp         12       68     Ig lambda chain signal peptide
    matp         69  >   713     Ig lambda chain
    recomb      379      380     V-region end/J2-region start
    recomb      414      415     J2-region end/C2-region start
BASE COUNT      175 a    190 c    159 g    189 t
ORIGIN      
        1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt
       61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag
      121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc
      181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca
      241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta
      301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa
      361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc
      421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca
      481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga
      541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca
      601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca
      661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct
//
LOCUS       MUSIGLVD      681 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds.
ACCESSION   M34597
KEYWORDS    constant region; germline; immunoglobulin light chain;
            joining exon; lambda-immunoglobulin; variable region.
SOURCE      Mouse (strain Balb/c AnPt) liver DNA, clone 30X2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
  TITLE     Mouse V-lambda-x gene sequence generates no junctonal diversity and
            is conserved in mammalian species
  JOURNAL   J. Immunol. 144, 2816-2820 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         59      104     Ig lambda-chain precursor Vx-J2-C2 region,
                                 exon 1
                209      531     Ig lambda-chain precursor Vx-J2-C2 region,
                                 exon 2
    sigp         59      104     Ig lambda-chain signal peptide
                209      219     Ig lambda-chain signal peptide
    matp        220      528     Ig lambda-chain
    IVS         105      208     Ig lambda-chain Vx-J2-C2 region intron A
    recomb      530      531     Vx-region end/J2-region start
    recomb      565      566     J2-region end/C2-region start
BASE COUNT      179 a    156 c    136 g    210 t
ORIGIN      Chromosome 16.
        1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat
       61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt
      121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc
      181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca
      241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag
      301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg
      361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct
      421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa
      481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa
      541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta
      601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa
      661 tcctcttgga cttcgtaggg c
//
LOCUS       PAERRE       1877 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  P.kadiakensis 18S rRNA, 3' end.
ACCESSION   M34359
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.kadiakensis rRNA.
  ORGANISM  Palaemonetes kadiakensis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; 
            Palaemonoidea; Palaemonidae.
REFERENCE   1  (bases 1 to 1877)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-113 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1877     18S ribosoma RNA
BASE COUNT      339 a    321 c    399 g    313 t    505 others
ORIGIN      
        1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt
       61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt
      121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac
      181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg
      241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca
      301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct
      361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna
      421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga
      481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac
      541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn
      601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac
      841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna
      901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta
      961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta
     1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa
     1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt
     1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc
     1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt
     1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn
     1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn
     1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg
     1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna
     1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg
     1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca
     1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact
     1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag
     1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn
     1861 nnnnnnnnnn nnnnnnn
//
LOCUS       PBESVBRA      584 bp ds-DNA             INV       17-JUL-1990
DEFINITION  P.berghei telomeric repeat region subfragment alpha DNA.
ACCESSION   M34601
KEYWORDS    .
SOURCE      P.berghei DNA, clone pTel.1.
  ORGANISM  Plasmodium berghei
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 584)
  AUTHORS   Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
  TITLE     Organization of subtelomeric repeats in Plasmodium berghei
  JOURNAL   Mol. Cell. Biol. 10, 2423-2427 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         461      541     27 bp repeats
BASE COUNT      205 a     41 c     85 g    251 t      2 others
ORIGIN      
        1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac
       61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc
      121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata
      181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat
      241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat
      301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa
      361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt
      421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg
      481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg
      541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga
//
LOCUS       PBESVBRB      593 bp ds-DNA             INV       17-JUL-1990
DEFINITION  P.berghei telomeric repeat region subfragment a DNA.
ACCESSION   M34602
KEYWORDS    .
SOURCE      P.berghei DNA, clone pTel.1.
  ORGANISM  Plasmodium berghei
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 593)
  AUTHORS   Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
  TITLE     Organization of subtelomeric repeats in Plasmodium berghei
  JOURNAL   Mol. Cell. Biol. 10, 2423-2427 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         484      510     27 bp repeat motif
BASE COUNT      209 a     40 c     97 g    247 t
ORIGIN      
        1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca
       61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac
      121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa
      181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt
      241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat
      301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat
      361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat
      421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg
      481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt
      541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg
//
LOCUS       PEURRE       1902 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  P.aztecus 18S rRNA, 3' end.
ACCESSION   M34362
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.aztecus rRNA.
  ORGANISM  Penaeus aztecus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; 
            Penaeidae.
REFERENCE   1  (bases 1 to 1902)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1902     18S ribosoma RNA
BASE COUNT      340 a    369 c    427 g    332 t    434 others
ORIGIN      
        1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt
       61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt
      121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg
      181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc
      241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg
      301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc
      361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn
      421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag
      481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga
      541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga
      601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg
      661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc
      721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn
      781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn
      841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg
      901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg
      961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna
     1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn
     1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg
     1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa
     1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn
     1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg
     1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac
     1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta
     1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct
     1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg
     1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact
     1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg
     1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt
     1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn
//
LOCUS       POCRRE       1874 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  P.ascensionis 18S rRNA, 3' end.
ACCESSION   M34358
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.ascensionis rRNA.
  ORGANISM  Procaris ascensionis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; 
            Procaridoidea; Procarididae.
REFERENCE   1  (bases 1 to 1874)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1874     18S ribosoma RNA
BASE COUNT      298 a    291 c    331 g    266 t    688 others
ORIGIN      
        1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct
       61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt
      121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac
      181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt
      241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac
      301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn
      361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc
      481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa
      541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn
      601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt
      781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg
      841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat
      901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat
      961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga
     1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac
     1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt
     1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca
     1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt
     1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta
     1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct
     1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag
     1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca
     1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn
     1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg
     1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg
     1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1861 nnnnnnnnnn nnnn
//
LOCUS       PRARRE       1869 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  P.leonensis 18S rRNA, 3' end.
ACCESSION   M34363
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.leonensis rRNA.
  ORGANISM  Procambarus leonensis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; 
            Astacoidea; Cambaridae.
REFERENCE   1  (bases 1 to 1869)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1869     18S ribosoma RNA
BASE COUNT      409 a    412 c    459 g    419 t    170 others
ORIGIN      
        1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt
       61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt
      121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat
      181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt
      241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc
      301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca
      361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg
      421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt
      481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact
      541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt
      601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag
      661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc
      721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn
      781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa
      841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg
      901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac
      961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg
     1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg
     1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg
     1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa
     1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta
     1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact
     1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg
     1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg
     1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac
     1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa
     1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn
     1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt
     1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa
     1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn
     1861 nnnnnnnnn
//
LOCUS       PVIC1RPTA     711 bp ds-DNA             INV       17-JUL-1990
DEFINITION  P.vivax circumsporozoite protein gene, partial cds.
ACCESSION   M28745 M25758
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax sporozoite (isolate VK247) sporozoite DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 711)
  AUTHORS   Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
            Waters,A.P. and Prasittisuk,C.
  TITLE     Circumsporozoite protein heterogeneity in the human malaria
            parasite Plasmodium vivax
  JOURNAL   Science 245, 973-976 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Lanar, 25-JUN-1989.
FEATURES       from  to/span     description
    pept    <     1  >   711     circumsporozoite protein (AA at 1)
    site          1       15     region 1
    site        697      711     region 2
    site        502      597     post repeat variable region
    site        598      696     post repeat constant region
BASE COUNT      261 a    151 c    226 g     73 t
ORIGIN      
        1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat
       61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca
      121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca
      181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga
      241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc
      301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa
      361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga
      421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat
      481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca
      541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga
      601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta
      661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c
//
LOCUS       PVIC1RPTB     657 bp ds-DNA             INV       17-JUL-1990
DEFINITION  P.vivax circumsporozoite protein gene, partial cds.
ACCESSION   M28746 M25759
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax sporozoite (isolate VK210) DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 657)
  AUTHORS   Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
            Waters,A.P. and Prasittisuk,C.
  TITLE     Circumsporozoite protein heterogeneity in the human malaria
            parasite Plasmodium vivax
  JOURNAL   Science 245, 973-976 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Lanar, 25-JUN-1989.
FEATURES       from  to/span     description
    pept    <     1  >   657     circumsporozoite protein (AA at 1)
    site          1       15     region 1
    site        643      657     region 2
    rpt          16      477     27 bp repeats
    site        478      544     3' post repeat variable region
    site        545      642     3' post repeat constant region
BASE COUNT      229 a    135 c    224 g     69 t
ORIGIN      
        1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat
       61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa
      121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca
      181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat
      241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca
      301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga
      361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca
      421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga
      481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca
      541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa
      601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc
//
LOCUS       SHV2A         554 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Simian hepatitis A virus segment 2A-encoded protein mRNA,
            partial cds.
ACCESSION   M34085
KEYWORDS    .
SOURCE      Simian hepatitis A virus (strain PA21), cDNA to viral RNA.
  ORGANISM  Simian hepatitis A virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae.
REFERENCE   1  (bases 1 to 554)
  AUTHORS   Brown,E.A., Jansen,R.W. and Lemon,S.M.
  TITLE     Characterization of a Simian hepatitis A virus (HAV): Antigenic and
            genetic comparison with human HAV
  JOURNAL   Unpublished (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.A.Brown, 04-MAY-1990.
            
              Author address: E.A.Brown
              547 Burnett-Womack
              CB# 7030
              Chapel Hill, NC 27599
            
FEATURES       from  to/span     description
    pept    <     1  >   554     segment 2A-encoded protein (AA at 1)
BASE COUNT      184 a     71 c    138 g    161 t
ORIGIN      Segment 2A; map position 3108-3662.
        1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg
       61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag
      121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa
      181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg
      241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg
      301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg
      361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat
      421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt
      481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg
      541 tcaggaatac agga
//
LOCUS       SHVVP1CP     2373 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Simian hepatitis A virus capsid protein VP1 mRNA, partial cds.
ACCESSION   M34084
KEYWORDS    capsid protein VP1.
SOURCE      Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed
            in cwll line BS-C-1.
  ORGANISM  Simian hepatitis A virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae.
REFERENCE   1  (bases 1 to 2373)
  AUTHORS   Brown,E.A., Jansen,R.W. and Lemon,S.M.
  TITLE     Characterization of a Simian hepatitis A virus (HAV): Antigenic and
            genetic comparison with human HAV
  JOURNAL   J. Virol. 63, 4932-4937 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.A.Brown, 04-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1  >  2373     capsid protein VP1 (AA at 1)
BASE COUNT      661 a    447 c    491 g    774 t
ORIGIN      Segment P1; map position 735-3107.
        1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg
       61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact
      121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca
      181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga
      241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa
      301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg
      361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca
      421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct
      481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc
      541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat
      601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct
      661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca
      721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt
      781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat
      841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca
      901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag
      961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa
     1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg
     1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt
     1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc
     1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg
     1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag
     1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat
     1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct
     1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct
     1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag
     1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta
     1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca
     1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg
     1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt
     1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg
     1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg
     1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca
     1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg
     2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat
     2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc
     2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg
     2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg
     2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat
     2341 gatcctcgaa ctgaagagga tcgtaaattt gaa
//
LOCUS       STNRRE       1885 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  S.hispidus 18S rRNA, 3' end.
ACCESSION   M34361
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      S.hispidus rRNA.
  ORGANISM  Stenopus hispidus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; 
            Stenopodidae.
REFERENCE   1  (bases 1 to 1885)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustraceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1885     18S ribosoma RNA
BASE COUNT      327 a    336 c    430 g    368 t    424 others
ORIGIN      
        1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt
       61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact
      121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn
      181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta
      241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag
      301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt
      361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga
      421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn
      481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag
      541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag
      601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn
      661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc
      721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg
      841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg
      901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg
      961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt
     1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt
     1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg
     1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat
     1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg
     1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt
     1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga
     1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng
     1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt
     1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc
     1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg
     1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg
     1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn
     1861 nnnnnnnnnn nnnnnnnnnn nnnnn
//
LOCUS       SIVGAGFA1     541 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Simian immunodeficiency virus gag protein (smmgagFA1) gene, partial
            cds.
ACCESSION   M33754
KEYWORDS    gag protein.
SOURCE      Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus
            atys peripheral blood mononuclear cells.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 541)
  AUTHORS   Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N.,
            Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A.
  TITLE     Detection of occult SIVsmm infection in asymptomatic seronegative
            nonhuman primates and evidence for variation in SIV gag sequence
            between endogenous and in vitro propagated virus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.J.Villinger, 18-APR-1990.
            
               Author address: F.J.Villinger
               Emory University,
               Department of Biology
               Winship, Cancer Center
               Atlanta GA, 30322
               Ph: (404) 248-4555.
FEATURES       from  to/span     description
    pept    <     1  >   541     gag protein (AA at 1)
BASE COUNT      191 a    109 c    126 g    115 t
ORIGIN      
        1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta
       61 ggagaacatc aggcagccat gcaaattatt agagagatta taaatgaaga agctgctgat
      121 tgggatctac aacacccgca gccaggtcca ataccagcag tacaactcag agagccaaga
      181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg
      241 caacaaaatc ctgtaccagt gggaaatatt tacaggagat ggatccaact tggattacaa
      301 aagtgtgtca ggatgtataa tccaacaaac attctggacg taaaacaggg gccgaaagag
      361 ccattccaga gctatgtaga caggttctac aaaagcttaa gagcagaaca gacagatcca
      421 gcagtgaaga attggatgac ccaaacactg ctgattcaaa atgctaaccc agattgtaaa
      481 ttggtgctca agggtctggg tatgaatccc actttagaag aaatgctgac agcctgtcag
      541 g
//
LOCUS       SIVGAGFA2     335 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Simian immunodeficiency virus gag protein (smmgagFA2) gene, partial
            cds.
ACCESSION   M33755
KEYWORDS    gag protein.
SOURCE      Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus
            atys peripheral blood mononuclear cells.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 335)
  AUTHORS   Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N.,
            Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A.
  TITLE     Detection of occult SIVsmm infection in asymptomatic seronegative
            nonhuman primates and evidence for variation in SIV gag sequence
            between endogenous and in vitro propagated virus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.J.Villinger, 18-APR-1990.
            
               Author address: F.J.Villinger
               Emory University,
               Department of Biology
               Winship, Cancer Center
               Atlanta GA, 30322
FEATURES       from  to/span     description
    pept    <     1  >   335     gag protein (AA at 1)
BASE COUNT      121 a     70 c     72 g     72 t
ORIGIN      
        1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta
       61 ggagaacatc aggcagccat gcaaattatt agagagatta taaataaaga agctgctgat
      121 tgggatttac aacacccgca gccaggtcca ataccagcag gacaactcag agagccaaga
      181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg
      241 caacaaaatc ctgtaccagt cggaaacatt tacaggagat ggatccaact tgcattacaa
      301 aagtgtgtca ggatgtacaa tccaacaaac attct
//
LOCUS       NEUAMTR      5928 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  N.crassa mating type protein gene, complete cds.
ACCESSION   M33876
KEYWORDS    mating type protein.
SOURCE      N.crassa (strain 74-ORS-A) DNA.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 5928)
  AUTHORS   Grotelueschen,J., Metzenberg,R.L. and Glass,N.L.
  TITLE     The Neurospora crassa A mating type region
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.L.Glass, 26-APR-1990.
FEATURES       from  to/span     description
    pept       4121     4283     mating type protein, exon 1
               4343     5046     mating type protein, exon 2
    IVS        4284     4342     mating type protein intron A
    signal     3858     3906     GC signal
    signal     3945     3949     CAAT box
    signal     5319     5326     poly-A signal
BASE COUNT     1470 a   1459 c   1526 g   1473 t
ORIGIN      
        1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg
       61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga
      121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct
      181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct
      241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag
      301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa
      361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga
      421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa
      481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg
      541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg
      601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg
      661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc
      721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg
      781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc
      841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg
      901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc
      961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat
     1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa
     1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc
     1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg
     1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt
     1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa
     1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc
     1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca
     1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat
     1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac
     1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca
     1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta
     1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga
     1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt
     1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa
     1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac
     1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc
     1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc
     2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg
     2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga
     2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg
     2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca
     2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca
     2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc
     2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca
     2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat
     2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag
     2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc
     2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca
     2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg
     2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag
     2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg
     2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata
     2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg
     3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca
     3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca
     3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt
     3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt
     3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca
     3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag
     3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac
     3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta
     3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca
     3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac
     3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg
     3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca
     3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt
     3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt
     3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc
     3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa
     3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc
     4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt
     4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat
     4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc
     4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt
     4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat
     4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag
     4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat
     4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg
     4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg
     4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg
     4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag
     4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat
     4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt
     4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct
     4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc
     4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat
     4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg
     5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca
     5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc
     5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg
     5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg
     5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg
     5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg
     5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca
     5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa
     5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct
     5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg
     5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag
     5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg
     5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc
     5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc
     5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac
//
LOCUS       HUMNCADH     3451 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human N-cadherin mRNA, complete cds.
ACCESSION   M34064
KEYWORDS    N-cadherin; cell adhesion molecule; transmembrane protein.
SOURCE      Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3451)
  AUTHORS   Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D.,
            Spurr,N. and Goodfellow,P.N.
  TITLE     The N-cadherin gene maps to human Chromosome 18 and is not linked
            to the E-cadherin gene
  JOURNAL   J. Neurochem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.H.Barton, 08-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1     2247     N-cadherin (AA at 1)
BASE COUNT     1041 a    703 c    724 g    983 t
ORIGIN      Chromosome 18.
        1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag
       61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg
      121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg
      181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca
      241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac
      301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga
      361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc
      421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac
      481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga
      541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca
      601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct
      661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata
      721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg
      781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc
      841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc
      901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag
      961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc
     1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc
     1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct
     1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt
     1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct
     1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt
     1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc
     1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct
     1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat
     1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt
     1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg
     1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg
     1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt
     1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt
     1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga
     1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct
     1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc
     1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag
     2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt
     2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt
     2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct
     2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac
     2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc
     2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag
     2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta
     2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt
     2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt
     2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca
     2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta
     2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag
     2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg
     2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga
     2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta
     2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt
     3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt
     3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt
     3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg
     3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa
     3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa
     3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct
     3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg
     3421 ctaattttgg ataacaaaaa aaggggaatt c
//
LOCUS       HUMFGF2H     3365 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) transmembrane form
            mRNA, complete cds.
ACCESSION   M34185
KEYWORDS    FGF receptor; fibroblast growth factor receptor;
            transmembrane tyrosine kinase.
SOURCE      Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
            clone h2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3365)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        256     2457     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      786 a    962 c    917 g    700 t
ORIGIN      
        1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac
       61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca
      121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa
      181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct
      241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca
      301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc
      361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc
      421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa
      481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca
      541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga
      601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac
      661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag
      721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac
      781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag
      841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac
      901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag
      961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt
     1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag
     1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc
     1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag
     1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc
     1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt
     1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag
     1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg
     1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa
     1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac
     1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc
     1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc
     1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc
     1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc
     1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg
     1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc
     1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg
     1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg
     2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg
     2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac
     2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga
     2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag
     2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg
     2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag
     2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg
     2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc
     2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc
     2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct
     2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc
     2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag
     2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg
     2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg
     2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga
     2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg
     3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca
     3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag
     3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat
     3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg
     3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat
     3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa
     3361 aaaaa
//
LOCUS       HUMFGF3H     3503 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) transmembrane form
            mRNA, complete cds.
ACCESSION   M34186
KEYWORDS    FGF receptor; fibroblast growth factor receptor;
            transmembrane tyrosine kinase.
SOURCE      Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
            clone h3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3503)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        527     2722     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      777 a   1044 c   1005 g    677 t
ORIGIN      
        1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc
       61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg
      121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga
      181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc
      241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc
      301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg
      361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga
      421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc
      481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa
      541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc
      601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc
      661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac
      721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt
      781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga
      841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat
      901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta
      961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat
     1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat
     1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa
     1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt
     1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc
     1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt
     1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga
     1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt
     1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa
     1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc
     1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat
     1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga
     1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga
     1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt
     1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa
     1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc
     1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg
     1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc
     2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc
     2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt
     2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa
     2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat
     2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct
     2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg
     2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg
     2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg
     2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta
     2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt
     2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa
     2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc
     2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc
     2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact
     2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga
     2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc
     3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc
     3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac
     3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc
     3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac
     3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg
     3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga
     3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt
     3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata
     3481 tattaaaaac aaaaaaaaaa aaa
//
LOCUS       HUMFGF4H     2283 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) transmembrane form
            mRNA, complete cds.
ACCESSION   M34187
KEYWORDS    FGF receptor; fibroblast growth factor receptor;
            transmembrane tyrosine kinase.
SOURCE      Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA,
            clone h4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2283)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        417     1325     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      566 a    621 c    612 g    484 t
ORIGIN      
        1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca
       61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc
      121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga
      181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc
      241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc
      301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg
      361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt
      421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta
      481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg
      541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg
      601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg
      661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt
      721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg
      781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca
      841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt
      901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta
      961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa
     1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct
     1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct
     1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc
     1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca
     1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc
     1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca
     1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt
     1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag
     1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga
     1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga
     1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa
     1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc
     1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc
     1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg
     1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt
     1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa
     1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag
     2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca
     2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca
     2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac
     2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa
     2281 aaa
//
LOCUS       HUMFGF5H     1625 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human fibroblast growth factor receptor (FGFr) secreted form mRNA,
            complete cds.
ACCESSION   M34188
KEYWORDS    FGF receptor; fibroblast growth factor receptor.
SOURCE      Human female placenta endothelial cell line HUVEC, cDNA to mRNA,
            clone h5.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1625)
  AUTHORS   Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T.
  TITLE     Diverse forms of a receptor for acidic and basic fibroblast growth
            factors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Johnson, 10-MAY-1990.
            
              Author address: D.E.Johnson
              University of California San Francisco
              4th and Parnassus
              Howard Hughes Medical Institute
              San Francisco, CA 94143
              (415) 476-4297
            
FEATURES       from  to/span     description
    pept        523     1425     fibroblast growth factor receptor (FGFr)
                                 transmembrane form
BASE COUNT      368 a    480 c    489 g    288 t
ORIGIN      
        1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg
       61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc
      121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc
      181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat
      241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc
      301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc
      361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct
      421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg
      481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc
      541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc
      601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct
      661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc
      721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa
      781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc
      841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg
      901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc
      961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg
     1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt
     1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg
     1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca
     1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg
     1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat
     1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg
     1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca
     1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca
     1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt
     1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg
     1621 aaaaa
//
LOCUS       HUMINTB1A    1146 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain).
ACCESSION   M34189
KEYWORDS    integrin; integrin beta-1 subunit cytoplasmic domain.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1146)
  AUTHORS   Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F.,
            Stefanuto,G. and Silengo,L.
  TITLE     A human integrin beta-1 subunit with a unique cytoplasmic domain
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Altruda, 10-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      117     integrin beta-1 subunit (AA at 1) (cytoplasmic
                                 domain)
BASE COUNT      373 a    184 c    222 g    367 t
ORIGIN      
        1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg
       61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt
      121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct
      181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga
      241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt
      301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct
      361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct
      421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa
      481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata
      541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta
      601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg
      661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac
      721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg
      781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt
      841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag
      901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg
      961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt
     1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga
     1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc
     1141 gaattc
//
LOCUS       RATIRF1A     2078 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds.
ACCESSION   M34253
KEYWORDS    interferon regulatory factor 1; transcription factor.
SOURCE      Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2078)
  AUTHORS   Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A.
  TITLE     Interferon regulatory factor 1 is an immediate-early gene under
            transcriptional regulation by prolactin in Nb2 T cells
  JOURNAL   Mol. Cell. Biol. 3087, 3094 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.-Y.Yu-Lee, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        198     1184     interferon regulatory factor 1 (IRF-1)
BASE COUNT      542 a    544 c    540 g    452 t
ORIGIN      
        1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg
       61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac
      121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct
      181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc
      241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc
      301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt
      361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca
      421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga
      481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc
      541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa
      601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct
      661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg
      721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg
      781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg
      841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac
      901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca
      961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct
     1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg
     1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact
     1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc
     1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag
     1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg
     1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct
     1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg
     1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga
     1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact
     1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact
     1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa
     1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct
     1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct
     1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt
     1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga
     1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt
     1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta
     2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS       ACCRRSAA     1536 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  A.calcoaceticus 16S ribosomal RNA.
ACCESSION   M34139
KEYWORDS    16S ribosomal RNA.
SOURCE      A.calcoaceticus (strain 33604) ribosomal RNA.
  ORGANISM  Acinetobacter calcoaceticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1536)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept. Microbiology
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1536     16S ribosomal RNA
BASE COUNT      387 a    329 c    460 g    330 t     30 others
ORIGIN      
        1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca
       61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta
      121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc
      181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta
      241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga
      301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata
      361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt
      421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt
      481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc
      541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt
      601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga
      661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg
      721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag
      781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga
      841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact
      901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat
      961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt
     1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt
     1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga
     1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat
     1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta
     1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac
     1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg
     1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct
     1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn
     1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt
//
LOCUS       BDERRSAA     1553 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  B.stolpii 16S ribosomal RNA.
ACCESSION   M34125
KEYWORDS    16S ribosomal RNA.
SOURCE      B.stolpii (strain uki-2) ribosomal RNA.
  ORGANISM  Bdellovibrio stolpii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 1553)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept Microbiology, 131
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1553     16S ribosomal RNA
BASE COUNT      381 a    320 c    429 g    316 t    107 others
ORIGIN      
        1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa
       61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt
      121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca
      181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg
      241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg
      301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc
      361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag
      421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc
      481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt
      541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc
      601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc
      661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga
      721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg
      781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat
      841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag
      901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat
      961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct
     1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag
     1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc
     1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat
     1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac
     1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg
     1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc
     1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac
     1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt
//
LOCUS       PLTRRSAA     1525 bp ss-rRNA            RNA       17-JUL-1990
DEFINITION  P.staleyi 16S ribosomal RNA.
ACCESSION   M34126
KEYWORDS    16S ribosomal RNA.
SOURCE      P.staleyi (strain ATCC 27377) ribosomal RNA.
  ORGANISM  Planctomyces staleyi
            Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales.
REFERENCE   1  (bases 1 to 1525)
  AUTHORS   Woese,C.R. and Oyalzu,H.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 15-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept Microbiology
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1525     16S ribosomal RNA
BASE COUNT      376 a    348 c    482 g    315 t      4 others
ORIGIN      
        1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa
       61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa
      121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg
      181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt
      241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga
      301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga
      361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg
      421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc
      481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta
      541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac
      601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg
      661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg
      721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag
      781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga
      841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac
      901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat
      961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt
     1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt
     1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact
     1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta
     1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta
     1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga
     1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac
     1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca
     1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa
     1501 cctgcggctg gatcacctcc tttct
//
LOCUS       RDCRRSAA     1478 bp ss-rRNA            BCT       17-JUL-1990
DEFINITION  R.purpureus 16S ribosomal RNA.
ACCESSION   M34132
KEYWORDS    16S ribosomal RNA.
SOURCE      R.purpureus (strain 6770) ribosomal RNA.
  ORGANISM  Rhodocyclus purpureus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 1478)
  AUTHORS   Woese,C.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Woese, 09-MAY-1990.
            
              Author address: C.R.Woese
              University of Illinois
              Dept. Microbiology
              131 Burrill Hall
              407 S. Goodwin Ave.
              Urbana, IL 61801
              (217) 333-9369
            
FEATURES       from  to/span     description
    rRNA          1     1478     16S ribosomal RNA
BASE COUNT      367 a    345 c    470 g    289 t      7 others
ORIGIN      
        1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca
       61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat
      121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct
      181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt
      241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg
      301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat
      361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg
      421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg
      481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc
      541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt
      601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga
      661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg
      721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag
      781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa
      841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt
      901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga
      961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag
     1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga
     1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg
     1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc
     1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac
     1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg
     1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc
     1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc
     1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt
//
LOCUS       RATSVPIIA    4161 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  Rat seminal vesicle secretion II protein (SVS II) gene, complete
            cds.
ACCESSION   J05443
KEYWORDS    seminal vesicle secretion II protein.
SOURCE      Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell
            DNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4161)
  AUTHORS   Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G.,
            Matusik,R.J., Carr,S.A. and Crabb,J.W.
  TITLE     Structural characterization of the rat seminal vesicle secretion II
            protein and gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.E.Harris,  25-APR-1990.
FEATURES       from  to/span     description
    pept       2065     2137     seminal vesicle secretion II protein (SVS II)
                                 precursor, exon 1
               2377     3548     seminal vesicle secretion II protein
                                 precursor, exon 2
    sigp       2065     2130     seminal vesicle secretion II protein signal
                                 peptide
    matp       2131     3545     seminal vesicle secretion II protein
    signal     1934     1946     CAAT box
    signal     2013     2020     TATA box
BASE COUNT     1243 a    902 c    822 g   1194 t
ORIGIN      
        1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct
       61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct
      121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt
      181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt
      241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa
      301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc
      361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac
      421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata
      481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta
      541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta
      601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca
      661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc
      721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc
      781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc
      841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa
      901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca
      961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac
     1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct
     1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt
     1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac
     1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta
     1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt
     1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg
     1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact
     1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt
     1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac
     1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac
     1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa
     1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt
     1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat
     1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct
     1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag
     1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg
     1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc
     2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc
     2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag
     2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca
     2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt
     2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga
     2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga
     2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag
     2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc
     2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg
     2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat
     2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct
     2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg
     2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg
     2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt
     2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac
     2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga
     3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa
     3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg
     3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta
     3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag
     3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc
     3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa
     3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat
     3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa
     3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta
     3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc
     3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg
     3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag
     3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa
     3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac
     3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat
     3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg
     3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc
     4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga
     4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc
     4141 aatggtgacc ctgttgaatt c
//
LOCUS       CREAPCYN      577 bp ss-mRNA            PLN       17-JUL-1990
DEFINITION  C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds.
ACCESSION   J05524
KEYWORDS    apoplastocyanin.
SOURCE      C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone
            PC6-2.
  ORGANISM  Chlamydomonas reinhardtii
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Chlamydomonadaceae.
REFERENCE   1  (bases 1 to 577)
  AUTHORS   Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and
            Bogorad,L.
  TITLE     Isolation and characterization of a complementary DNA clone for an
            algal pre-apoplastocyanin
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Hill, 11-MAY-1990.
FEATURES       from  to/span     description
    pept         22      459     apoplastocyanin (PC6-2) precursor
    sigp         22      162     apoplastocyanin signal peptide
    matp        163      456     apoplastocyanin
BASE COUNT       91 a    189 c    171 g    126 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc
       61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt
      121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg
      181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag
      241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc
      301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc
      361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac
      421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat
      481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac
      541 gcagagagcg tgtagctctg taactttttc ggaattc
//
LOCUS       MHVNSGII      870 bp ss-RNA             VRL       17-JUL-1990
DEFINITION  Murine hepatitis virus non-structural protein gene-2 (NS2).
ACCESSION   M34035
KEYWORDS    non structural protein.
SOURCE      Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA.
  ORGANISM  Murine hepatitis virus A59
            Unclassified.
REFERENCE   1  (bases 1 to 870)
  AUTHORS   Schwarz,B., Routledge,E. and Siddell,S.G.
  TITLE     The coronavirus MHV 30 kDa non-structural protein NS2 is not
            essential for virus replication in transformed murine cells
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.G.Siddell, 04-MAY-1990.
            
              Author address: S.G.Siddell
              Inst of Virology
              Univ of Wuerzburg
              Versbacherstrasse 7
              8700 Wuerzburg
            
FEATURES       from  to/span     description
    pept         40      837     non-structural protein-2 (NS2)
BASE COUNT      269 a    147 c    187 g    267 t
ORIGIN      
        1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt
       61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt
      121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag
      181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa
      241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac
      301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat
      361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg
      421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc
      481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag
      541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg
      601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat
      661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag
      721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg
      781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt
      841 gtttttacac tattagtgta ataaacttat
//
LOCUS       HUMFGF1A     3343 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds.
ACCESSION   M34641
KEYWORDS    FGF receptor-1; fibroblast growth factor receptor;
            transmembrane glycoprotein; tyrosine kinase.
SOURCE      Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA,
            clone 1:1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3343)
  AUTHORS   Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L.
  TITLE     cDNA cloning and expression of a human FGF receptor which binds
            acidic and basic FGF
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Claesson-Welsh, 25-MAY-1990.
            
              Author address: L.Claesson-Welsh
              Ludwig Institute for Cancer Research
              Biomedical Center
              Box 595
              S-751 24 Uppsala
              SWEDEN
            
FEATURES       from  to/span     description
    pept         10     2472     FGF receptor-1 precursor
    sigp         10       72     FGF receptor-1 signal peptide
    matp         73     2469     FGF receptor-1
BASE COUNT      766 a    957 c    911 g    709 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca
       61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct
      121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg
      181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac
      241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc
      301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat
      361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag
      421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa
      481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct
      541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct
      601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct
      661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc
      721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca
      781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg
      841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag
      901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc
      961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat
     1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg
     1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc
     1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg
     1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag
     1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct
     1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg
     1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc
     1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg
     1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac
     1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg
     1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc
     1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca
     1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac
     1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc
     1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata
     1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac
     1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac
     2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc
     2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg
     2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca
     2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc
     2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc
     2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag
     2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc
     2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc
     2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg
     2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct
     2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc
     2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg
     2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc
     2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc
     2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc
     2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac
     3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc
     3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat
     3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc
     3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga
     3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt
     3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc
//
LOCUS       ECOFADA      1594 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  E.coli 3-ketoacyl-coenzyme A thiolase (fadA) DNA.
ACCESSION   J05498
KEYWORDS    3-ketoacyl-coenzyme A thiolase.
SOURCE      E.coli (strain K12) DNA, clone pK52.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1594)
  AUTHORS   Yang,S.-Y., Yang,X.-Y.H., Healy-Louie,G., Schulz,H. and Elzinga,M.
  TITLE     Nucleotide sequence of the fadA gene: Primary structure of 3-
            ketoacyl-coenzyme A thiolase from Escherichia coli and the
            structural organization of the fadAB operon
  JOURNAL   J. Biol. Chem. 265, 10424-10429 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-Y.Yang, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        110     1276     3-ketoacyl-coenzyme A thiolase (fadA)
                                 precursor (E.C. 2.3.1.16)
    sigp        110      139     3-ketoacyl-coenzyme A thiolase signal peptide
    matp        140     1273     3-ketoacyl-coenzyme A thiolase
BASE COUNT      329 a    425 c    489 g    351 t
ORIGIN      86 min on K12 map.
        1 tgaagtgccg gaaggtctgc gtaataaagc gcgtcataac gaaccgtact atcctccggt
       61 tgagccagcc cgtccggttg gcgacctgaa aacggcttaa ggagtcacaa tggaacaggt
      121 tgtcattgtc gatgcaattc gcaccccgat gggccgttcg aagggcggtg cttttcgtaa
      181 cgtgcgtgca gaagatctct ccgctcattt aatgcgtagc ttgctggcgc gtaacccggc
      241 gctggaagcg gcggccctcg acgatattta ctggggttgt gtgcagcaga cgctggagca
      301 gggttttaat atcgcccgta acgcggcgct gctggcagaa gtaccacact ctgtcccggc
      361 ggttaccgtt aatcgcttgt gtggttcatc catgcaggca ctgcatgacg cagcacgaat
      421 gatcatgact ggcgatgcgc aggcatgtct ggttggcggc gtggagcata tgggccatgt
      481 gccgatgagt cacggcgtcg attttcaccc cggcctgagc cgcaatgtcg ccaaagcggc
      541 gggcatgatg ggcttaacgg cagaaatgct ggcgcgtatg cacggtatca gccgtgaaat
      601 gcaggatgcc tttgccgcgc ggtcacacgc ccgcgcctgg gccgccacgc agtcggccgc
      661 atttaaaaat gaaatcatcc cgaccggtgg tcacgatgcc gacggcgtcc tgaagcagtt
      721 taattacgac gaagtgattc gcccggaaac caccgtggaa gccctcgcca cgctgcgtcc
      781 ggcgtttgat ccagtaaacg gtatggtaac ggcgggcaca tcttctgcac tttccgatgg
      841 cgcagctgcc atgctggtga tgagtgaaag ccgcgcccat gaattaggtc ttaagccgcg
      901 cgctcgtgtg cgttcgatgg cggtcgttgg ttgtgaccca tcgattatgg gttacggccc
      961 ggttccggcc tcgaaactgg cgctgaaaaa agcggggctt tctgccagcg atatcggcgt
     1021 gtttgaaatg aacgaagcct ttgccgcgca gatcctgcca tgtattaaag atctgggact
     1081 aattgagcag attgacgaga agatcaacct caacggtggc gcgatcgcgc tgggtcatcc
     1141 gctgggttgt tccggtgcgc gtatcagcac cacgctgctg aatctgatgg aacgcaaaga
     1201 cgttcagttt ggtctggcgg acggatgtgt atcgggtctg ggtcagggta ttgcgacggt
     1261 gtttgagcgg gtttaaccaa cttgccggat tgcggccgtt gaacggcctt atccggtcta
     1321 ccgatccggc accaatgtag gcctgataag acgcgccagc gtcgcatcag gcatcatgca
     1381 ccaattgccg gatgcggcgt aaacgcctta tccggctacg gttcggacta tttgtaggca
     1441 cgataagacg cgacagcgtc gcatcaggca tcatgcacca attgccggat gcggcgtaaa
     1501 cgccttatcc ggcctacggt tcggactatt tgtaggcacg ataagacgcg acagcgtcgc
     1561 atcgggcatg gtttgccaga tacgtcatag atct
//
LOCUS       HSSCLS       1645 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Herpesvirus saimiri collagen-like protein mRNA, complete cds.
ACCESSION   M31964
KEYWORDS    collagen-like protein.
SOURCE      Herpesvirus saimiri (strain 484-77), cDNA to mRNA.
  ORGANISM  Herpesvirus saimiri
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 1645)
  AUTHORS   Geck,P., Whitaker,S.A., Medveczky,M.M. and Medveczky,P.G.
  TITLE     Expression of collagen-like sequences by a tumor virus, herpesvirus
            saimiri
  JOURNAL   J. Virol. 64, 3509-3515 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P. Medveczky, 07-FEB-1990.
FEATURES       from  to/span     description
    pept        101      400     collagen-like protein
    pept        580     1224     ORF2
    mRNA         56     1248     collagen-like protein mRNA
BASE COUNT      545 a    363 c    393 g    344 t
ORIGIN      Map position 98-100.
        1 gagtttccaa aatgtagtaa gctaacatat aaatatcagc agaaaataga gtaacacaag
       61 aaacagctaa caagagcaac acaaacacgc ttgaaagaaa atggcaagcg agccaaacct
      121 aagatatcca actgaagaaa caggagatcg aggacctcaa ggacctccag gacctccagg
      181 accccaagga cctccaggac cccaaggacc tccaggaccc caaggacctc caggacccca
      241 aggacctcca ggaccccaag gacctccagg accccaagga cctccaggac ccccaggacc
      301 tccaggacct tcagactcca ggattgtttg taacaaactt attgcttgga atcataattt
      361 tactcttatt aattatagtt gcgatcttac tggtgtctaa attagtagta aactaagaac
      421 agagcaagca gtatagaaga gagtaattat gctagcttaa tttcatctag tagtcagttg
      481 tgtagaactt tttattagta aatctagaag tgttaagaaa aaggtataga aagtagtgac
      541 ctttattagc tgcatcttca tatctctaaa ttagtaagta tggaaaacca gagagaagaa
      601 atagaactaa cagaaattcc agagactgag aaaaaaagaa cagctgaaga aaaattatta
      661 tcatgctctg ctgaaactgc tgaagaaaag gtatcattat gctctgaaga aactacagac
      721 acgtcaagca gttctagcag tgagcaaaca cctgctccta tagaagtaaa cgttaacatt
      781 caaacgagca catacttgcc tcagaacgcg gcaacaaact tgaacagttt gtacacaagt
      841 tttgaagacg caagggcaca aggaaaagga ctggtgcgac acaattcgga cgatttaaaa
      901 agttttttag aaaaataccc accagattat agaaaaccta aacgggactt gtcagaatca
      961 tgggatccag gtatgccgaa accaactttg cctcctaggc ctgcgaacct aggtgcaagc
     1021 caggcaagta ctgtaagacg gcatgtaaga gaacaaaatt ttaaacagct tagagaacgt
     1081 aaagctaatg aagggaaaat agtaaaagat ttgaaaagac ttgagtacaa agttaatata
     1141 attctttgtt tagtagttgt tatattagct ataatattat tactaactgg gctaagtata
     1201 ctgttcataa gaataaaaag ttaatttaac aaagaattta acaaatatgt gtgtgcatgg
     1261 tcattaaaag aagcgggccc caagcagtgg gcgctgggca gctggaaaaa tgtcctgggc
     1321 gtggctagag cgagctcggg gggcaaagcg agcagcgagc ggctgcggag ggagggggaa
     1381 gggattgagc tgccccgccc ctctgggctg agattccaga gcccctgttg tgtccccagg
     1441 ccaccgcagg cccggggctg tgcccctgtg cttggagcca gtccccagag ccctggctag
     1501 gccgaggcgc tgctagagta agggtcccac aactgttcag gcccccccag ggggagagcc
     1561 cagcgcagcg ctctgccggg tagggggggg gtagctggct ccccccccac ctctgcgtga
     1621 agcagagtgc atgcagcccc tgcag
//
LOCUS       MUSAGP1A      777 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION   M34648 J05488
KEYWORDS    acute phase serum protein; alpha-1-acid glycoprotein;
            non-specific immunosuppressant; transmembrane protein;
            tyrosine kinase.
SOURCE      M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1A.
  ORGANISM  Mus caroli
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 777)
  AUTHORS   Prowse,K.R. and Baumann,H.
  TITLE     Molecular characterization and acute-phase expression of the
            multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
            Differences in glucocorticoid stimulation and regulatory elements
            between the rat and mouse AGP genes
  JOURNAL   J. Biol. Chem. 265, 10201-10209 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.R.Prowse, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         39      662     alpha-1-acid glycoprotein (AGP) precursor
    sigp         39       92     alpha-1-acid glycoprotein (AGP) signal peptide
    matp         93      659     alpha-1-acid glycoprotein (AGP)
    signal      747      752     AGP poly-A signal
BASE COUNT      225 a    189 c    186 g    177 t
ORIGIN      Chromosome 4.
        1 atcttttcca agccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg
       61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta
      121 taggcgaacc tatcaccaat gagaccctga gctggctctc tgacaaatgg tttttcattg
      181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt
      241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag
      301 atgaccactg tgtctataac tctacacatc taggaatcca gagagagaat gggaccctct
      361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg
      421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca
      481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg
      541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg
      601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat
      661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct
      721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacacagtca aaaaccg
//
LOCUS       MUSAGP1X      778 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION   M34647 J05488
KEYWORDS    acute phase serum protein; alpha-1-acid glycoprotein;
            non-specific immunosuppressant; transmembrane protein;
            tyrosine kinase.
SOURCE      M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1X.
  ORGANISM  Mus caroli
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 778)
  AUTHORS   Prowse,K.R. and Baumann,H.
  TITLE     Molecular characterization and acute-phase expression of the
            multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
            Differences in glucocorticoid stimulation and regulatory elements
            between the rat and mouse AGP genes
  JOURNAL   J. Biol. Chem. 265, 10201-10209 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.R.Prowse, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         39      662     alpha-1-acid glycoprotein (AGP) precursor
    sigp         39       92     alpha-1-acid glycoprotein (AGP) signal peptide
    matp         93      659     alpha-1-acid glycoprotein (AGP)
    signal      747      752     AGP poly-A signal
BASE COUNT      224 a    191 c    187 g    176 t
ORIGIN      Chromosome 4.
        1 atctcttcca ggccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg
       61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta
      121 taggcgaacc tatcaccaat gagaccctag gctggctatc tgacaaatgg tttttcattg
      181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt
      241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag
      301 atgaccactg tgtctataac tccacacatc taggaatcca gagagagaat gggaccctct
      361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg
      421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca
      481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg
      541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg
      601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat
      661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct
      721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacactgtca aaaacccg
//
LOCUS       MUSAGP2AX     765 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION   M34649 J05488
KEYWORDS    acute phase serum protein; alpha-1-acid glycoprotein;
            non-specific immunosuppressant; transmembrane protein;
            tyrosine kinase.
SOURCE      M.caroli adult hepatocyte, cDNA to mRNA, clones lambda-[2A,2X].
  ORGANISM  Mus caroli
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 765)
  AUTHORS   Prowse,K.R. and Baumann,H.
  TITLE     Molecular characterization and acute-phase expression of the
            multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
            Differences in glucocorticoid stimulation and regulatory elements
            between the rat and mouse AGP genes
  JOURNAL   J. Biol. Chem. 265, 10201-10209 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.R.Prowse, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         29      652     alpha-1-acid glycoprotein (AGP) precursor
    sigp         29       82     alpha-1-acid glycoprotein (AGP) signal peptide
    matp         83      649     alpha-1-acid glycoprotein (AGP)
    signal      735      740     AGP poly-A signal
BASE COUNT      226 a    185 c    185 g    169 t
ORIGIN      Chromosome 4.
        1 gggcctggtg cctctgagtg ccctcagcat ggcactgcac acggttctta tcatgttgag
       61 cctcctgccg atgttggaag ctcagaaccc agaacatgcc aacatcacta taggcgaacc
      121 tatcaccaat gagaccctag gctggctctc tgacaaatgg tttttcatgg gcgcagcttt
      181 cagaaaactc gagtacaggc aggcaattca aatgatgcag actgaatttt tttaccttac
      241 caccaacttg ataaacgaca caatagagct tcgggagtct caaacaatag gtgaccagtg
      301 tgtctataac tccacccatc taggattcca gagagaaaat gggaccttct ccaagtatga
      361 aggaggagta gaaacctttg cccacctgat agtgctaagg aaacatgggg ccttcatgct
      421 tgcctttgat ctcaatgatg agaagaaacg gggactgtcc ctctatgcca aaaggccaga
      481 tatgaccctg gagctacggg aagtattcca gaaggctgtc aaacacgtgg gcatggatga
      541 atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg agaagaagca
      601 gcttgagttg gggaaggaga ccaagaaaga tcctgaggaa ggccaggcat gaactcagct
      661 ctgaactcca tgggctatcc ccaggctcac caaaccccac cccctcctgt gcactttgat
      721 tctgtctctg caacaataaa gatttgctga aagagtcaaa aaccg
//
LOCUS       MUSAGP3X      766 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  M.caroli alpha-1-acid glycoprotein (AGP) mRNA.
ACCESSION   M34646 J05488
KEYWORDS    acute phase serum protein; alpha-1-acid glycoprotein;
            non-specific immunosuppressant; transmembrane protein;
            tyrosine kinase.
SOURCE      M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-3X.
  ORGANISM  Mus caroli
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 766)
  AUTHORS   Prowse,K.R. and Baumann,H.
  TITLE     Molecular characterization and acute-phase expression of the
            multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
            Differences in glucocorticoid stimulation and regulatory elements
            between the rat and mouse AGP genes
  JOURNAL   J. Biol. Chem. 265, 10201-10209 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.R.Prowse, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         31      654     alpha-1-acid glycoprotein (AGP) precursor
    sigp         31       84     alpha-1-acid glycoprotein (AGP) signal peptide
    matp         85      651     alpha-1-acid glycoprotein (AGP)
    signal      737      742     AGP poly-A signal
BASE COUNT      228 a    186 c    185 g    167 t
ORIGIN      Chromosome 4.
        1 cggggcctgg tgcctctgag tgccctcagc atggcactgc acacggttct tatcatgttg
       61 agcctcctgc cgatgttgga agctcagaac ccagaacatg ccaacatcac tataggcgaa
      121 cctatcacca atgagaccct aggctggctc tctgacaaat ggtttttcat gggcgcacgt
      181 ttcagaaaac tcgagtacag gcaggcaatt caaatgatgc agactgaatt tttttacctt
      241 accaccaact tgataaacga cacaatagag cttcgggagt ctcaaacaat aggtgaccag
      301 tgtgtctata actccaccca tctaggattc cagagagaaa atgggacctt ctccaagtat
      361 gaaggaggag tagaaacctt tgcccacctg atagtgctaa ggaaacatgg ggccttcatg
      421 cttgcctttg acctcaatga tgagaagaaa cggggactgt ccctctatgc caaaaggcca
      481 gatatgaccc tggagctacg ggaagtattc cagaaggctg tcaaacacgt gggcatggat
      541 gaatcagaaa tcatatttgt cgactggaaa aaggataagt gtggtcagca ggagaagaag
      601 cagcttgagt tggggaagga gaccaagaaa gatcctgagg aaggccaggc atgaactcag
      661 ctctgaactc catgggctat ccccaggctc accaaacccc accccctcct gtgcactttg
      721 attctgtctc tgcaacaata aagatttgct gaaagaaaaa aaaccg
//
LOCUS       MUSAGP7A     1348 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  M.caroli alpha-1-acid glycoprotein (AGP) gene 7, 5' flank and
            exon 1.
ACCESSION   M34645 J05488
KEYWORDS    alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase.
SOURCE      M.caroli adult hepatocyte DNA, clones lambda-[11B,27A].
  ORGANISM  Mus caroli
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1348)
  AUTHORS   Prowse,K.R. and Baumann,H.
  TITLE     Molecular characterization and acute-phase expression of the
            multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
            Differences in glucocorticoid stimulation and regulatory elements
            between the rat and mouse AGP genes
  JOURNAL   J. Biol. Chem. 265, 10201-10209 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.R.Prowse, 18-MAY-1990.
FEATURES       from  to/span     description
    pept       1232  /  1348     alpha-1-acid glycoprotein (AGP), exon 1
    mRNA       1194  >  1348     AGP mRNA
    signal     1161     1166     TATA box
BASE COUNT      312 a    316 c    356 g    364 t
ORIGIN      Chromosome 4.
        1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga
       61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg
      121 agaccttagc tggatgtcca cttccctgcc tgcacccttg cccttagaat ccacaggctc
      181 tggatacaca gcacagccac aggaagcttt ggggatttct tttccctatc ttaagtttat
      241 tgttttcttc atttggttgt ataacaaaaa gagattttta aatttgctaa tgggtgcaag
      301 tgtgagtgtg tgtgtgtgtg tgtgtgtgtt tctgtgtgtg tgtgtgtgtg tgtgtgtgtg
      361 tgtgtgtgtg tgtgtgtgtg agagagagag agagagagga agggacagag tgaaatagat
      421 catgacactg tgagcatgtt ctaacacgtt tgctgtattg atgttttgaa gcagctcact
      481 gtgatctcaa gctcccagca atgtctctgt gcctcagcgg gagtcaccag gcctgtccct
      541 gcaatacttc tagagattgt tgattatgta gacaaagatc aggtaggaac tagaaccaga
      601 tattgactgg atgctgggca cagaccctgg gttgggcata cactgtccta ggcagatcct
      661 ttcctgctgt agatactggg agctttgctg aactcgatgg tcaactcaga ttcacccctt
      721 tttgggcact tggctgcctt caggctgggt aggagtccgc aggaacatca cactcctttg
      781 aaaaccatcc atctttattc ttggccctca acttgagccc ctaagtgtct tctatgttca
      841 ctatgaaccc tgacctggga ccccttctta tcacgcttcg ggggggtgga gttgatgtag
      901 tgtaggtttc actcctgtta gacagcttca tgggataaga gagaaagggg accgctgtct
      961 gggacctaag tatcatcagg ctagcccttt acctaccttg accatgaatc agccactctg
     1021 gtgtaggcag aagccgactg agggagctgc acaaacctgg cttgagggaa catttggggc
     1081 aagacattta tcaagtgctg gtgactttgt ggcactgctc taacgccctg ctgcagtcac
     1141 atgccctccc cacatcctgt tataaaagtc actgcaccct ccaggcacca gttatctctt
     1201 cccagcactg gtgcctctga gtgccgtcag catggagcta cacacagttc ttgtgatgtt
     1261 gagcctcctg ctgctgttgg aagctcagaa cccagaacat gccaacatca acataggcaa
     1321 acctatcaac aatgagaccc tgagctgg
//
LOCUS       MUSAGP8A     2094 bp ds-DNA             ROD       17-JUL-1990
DEFINITION  M.caroli alpha-1-acid glycoprotein (AGP) gene 8, 5' flank and
            exon 1.
ACCESSION   M34644 J05488
KEYWORDS    alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase.
SOURCE      M.caroli adult hepatocyte DNA, clones lambda-[15A,11B].
  ORGANISM  Mus caroli
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2094)
  AUTHORS   Prowse,K.R. and Baumann,H.
  TITLE     Molecular characterization and acute-phase expression of the
            multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes:
            Differences in glucocorticoid stimulation and regulatory elements
            between the rat and mouse AGP genes
  JOURNAL   J. Biol. Chem. 265, 10201-10209 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.R.Prowse, 18-MAY-1990.
FEATURES       from  to/span     description
    pept       1978  /  2094     alpha-1-acid glycoprotein (AGP), exon 1
    mRNA       1940  >  2094     AGP mRNA
    signal     1907     1912     TATA box
BASE COUNT      554 a    502 c    464 g    574 t
ORIGIN      Chromosome 4.
        1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga
       61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg
      121 agaccttagc tggtgtccac tccctgcctg cacccttgcc cttagaatcc acaggctctg
      181 gatgcacagc acagccacag gaagctttgg agatatctaa tccgtttctt aagtttatta
      241 ttttccatca tttgctttta taacaaaaaa tagatatttt atatctgcca gtgtgtgcaa
      301 atatgtttat gtgcacgtgt acgtgtgcgc ctgtgtgtaa gagacataac agtgacaaag
      361 aggcacaaac agagagatta tgacttctgg gcactgctaa cttgggacag aacatatatc
      421 attggtgctt ttgggctcct catggttatg ctgtttctcc tctgggaaga gagaaacttt
      481 cttaagactg tataaataca acaaaacaca tatgttttgg gtctccagct cagcaacttc
      541 tcctaggccc tgtacagccc actccatggt cagggggctc tcaggaccat ctgtggcccc
      601 ttccttttgc cctaggtggt cactgtcact gacctgatta atatcattac aattagtctg
      661 atcctctaga acgccctgta aacaggctca cacaacccca cctctgggcg atctgctggc
      721 aataatgttg tcgagctgca tttgtatggt caaatctatc agtctttcat ttctctctgt
      781 tgctaagcaa caccatgctt gcatgaaatg tcacaaccca tttctcattt tcctggtggt
      841 aggcacttaa gagcttggag tttgtggcta ctttgaataa tgccattaga acatttacac
      901 acatgatact ggaggtgact catgttttgt tttcctttct gttggtagac taatttttaa
      961 aggcaatatg ttgctctgac acttataaag aggggaagca taaaatacaa aatgccacag
     1021 gataaaagaa ggttctcgtg aaattagaac acagtaagtg aatgacaatt atttatcatt
     1081 aaaacaaaaa ccctgaatac acagaaacat gagaaattgt ggaggacaca ggaagtagat
     1141 tactcaactg tgagcgtgtt ctcatatgtt tggtgctttt gatgttttga aagagctcac
     1201 ggtgatctca agctcctagc aatgactccg agcctcagtg agagtcacca ggcctgtccc
     1261 tagcaataca tttcgagagt gttgattatg tagacaaaaa tcaggctata tctatcaagc
     1321 aggaaccaga accagatatt gactggttgc tgtgcacaga ccctggattg ggcacacact
     1381 attctagaca gatcctttcc tgctgtagat acagggagct ttgctgaact acattttcaa
     1441 ctcagattca tccctctttt ttgggcattt ggctgcctct agcgtgcgta ggggtcccca
     1501 gaaacatcac actcctttgg aaactaatcc atctttgtcc ttggccctta acttgagccc
     1561 ctaaatgact tctaagttca ctatgaaccc tgaccaggac cccttcatag tccccttggt
     1621 gataaataga ataggtctca ctcctgctag gcagcttcat gggataagac agatatggga
     1681 ccatgactgg gatctaagta ttatcaggct agcccagtat ccaccttgac catgaatcag
     1741 ccactctggt gtaggtcagg agcctgtgtc ggggccggct ggctgaggga gctgcacaaa
     1801 gctggcttga gggaacattt tgcgcaagac atttcccaag tgctggggag attgtgccag
     1861 ggctctagag gccctgctgc agtcccatgc cctccccaca ttctgttata aaagccactg
     1921 caccctccat ccaccaatta tctcttccaa gccctggtgc ctctgagtgt cctaagcatg
     1981 gcactgcaca tgattcttgt catgttgagc ctcctgccgc tgttggaagc tcagaaccca
     2041 gaacatgtca acatcactat aggcgaacct atcaccaatg agaccctagg ctgg
//
LOCUS       MUSMK1        933 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid-responsive protein (MK-1) mRNA, complete cds.
ACCESSION   M19662 M34326 J05473
KEYWORDS    DNA-binding protein; regulatory protein;
            retanoic acid-responsive protein; secreted protein.
SOURCE      Mouse (strain 129/Sv) HM-1 embryonal carcinoma, cDNA to mRNA, clone
            pO1-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 33 to 932)
  AUTHORS   Kadomatsu,K., Tomomura,M. and Muramatsu,T.
  TITLE     cDNA cloning and sequencing of a new gene intensely expressed in
            early differentiation stages of embryonal carcinoma cells and in
            mid-gestation period of mouse embryogenesis
  JOURNAL   Biochem. Biophys. Res. Commun. 151, 1312-1318 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 933)
  AUTHORS   Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T.
  TITLE     A retinoic acid responsive gene, MK, found in the teratocarcinoma
            system: Heterogeneity of the transcript and the nature of the
            translation
  JOURNAL   J. Biol. Chem. 265, 10765-10770 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Kadomatsu, 08-JUL-1988.  Draft entry and computer-readable
            sequence for [2] kindly submitted by M.Tomomura and T.Muramatsu,
            16-MAY-1990.
            
            The expression of MK-1 mRNA increases in the early stages of
            retinoic acid-induced differentiation of embryonal carcinoma cells.
            The putative peptide product may act as a DNA-binding protein
            regulating gene expression.
            
FEATURES       from  to/span     description
    pept        249      671     retanoic acid-responsive protein (MK-1)
                                 precursor
    sigp        249      314     retanoic acid-responsive protein signal
                                 peptide
    matp        315      668     retanoic acid-responsive protein
    mRNA    <     1      910     MK-1 mRNA
BASE COUNT      216 a    272 c    270 g    175 t
ORIGIN      
        1 gagtgagcgc acgcagtggc tgtggcccca gtcccttcag gcggctgctc tgccaccaag
       61 ggggctgagg tgggggtggg ggtacgctga gacatcggtt ccaagtcctc cctccgtctc
      121 ccccttgtcg gtccgacgtt ttgggcctgg aaagtgggac aagtcagtca agggtgggag
      181 gtccttcccg cggttcctag cggagaagag actaggcgag aaactctaac ccaggtttta
      241 cccctaggat gcagcaccga ggcttcttcc ttctcgccct tcttgccctc ttggtggtca
      301 cgtccgcggt ggccaaaaaa aaagagaagg tgaagaaggg cagcgagtgt tcggagtgga
      361 cctgggggcc ctgcaccccc agcagcaagg actgcggcat gggcttccgc gagggtacct
      421 gtggggccca gacccagcgc gtccattgca aggtgccctg caactggaag aaggaatttg
      481 gagccgactg caaatacaag tttgagagct ggggggcgtg tgatgggagc actggcacca
      541 aagcccgcca agggaccctg aagaaggcgc ggtacaatgc ccagtgccag gagaccatcc
      601 gcgtgactaa gccctgcacc tccaagacca agtcaaagac caaagccaag aaaggaaaag
      661 gaaaggacta agtcaggagg ccagagagcc tccggcctcg cctggagcct gaacggagcc
      721 ctcctctccc acaggcccaa gatataaccc accagtgcct tttgtcttcc tgtcagctct
      781 gtcaatcacg cctgtcctct cacgcccaca ccaagtgccc aaagtgggga gggacaagag
      841 attctggaaa gtgagcctcc ccataccctc ttttgttctc cccaccctga tacttgttat
      901 taagaaatga ataaaataaa ctcacttttt tcc
//
LOCUS       MUSMK2        730 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid-responsive protein (MK-2) mRNA, complete cds.
ACCESSION   M34327 J05473
KEYWORDS    DNA-binding protein; regulatory protein;
            retanoic acid-responsive protein; secreted protein.
SOURCE      Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clone pB2-25.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 730)
  AUTHORS   Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T.
  TITLE     A retinoic acid responsive gene, MK, found in the teratocarcinoma
            system: Heterogeneity of the transcript and the nature of the
            translation
  JOURNAL   J. Biol. Chem. 265, 10765-10770 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Tomomura and T.Muramatsu, 16-MAY-1990.
FEATURES       from  to/span     description
    pept         44      466     retanoic acid-responsive protein (MK-2)
                                 precursor
    sigp         44      109     retanoic acid-responsive protein signal
                                 peptide
    matp        110      463     retanoic acid-responsive protein
    mRNA    <     1      705     MK-2 mRNA
BASE COUNT      186 a    211 c    206 g    127 t
ORIGIN      
        1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt
       61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga
      121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag
      181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca
      241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga
      301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa
      361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa
      421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga
      481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat
      541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc
      601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata
      661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac
      721 ttttttccaa
//
LOCUS       MUSMK3       1039 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse retinoic acid-responsive protein (MK-3) mRNA, complete cds.
ACCESSION   M34328 J05473
KEYWORDS    DNA-binding protein; regulatory protein;
            retanoic acid-responsive protein; secreted protein.
SOURCE      Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clones pB3-30 and
            FK-3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1039)
  AUTHORS   Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T.
  TITLE     A retinoic acid responsive gene, MK, found in the teratocarcinoma
            system: Heterogeneity of the transcript and the nature of the
            translation
  JOURNAL   J. Biol. Chem. 265, 10765-10770 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Tomomura and T.Muramatsu, 16-MAY-1990.
FEATURES       from  to/span     description
    pept        355      777     retanoic acid-responsive protein (MK-3)
                                 precursor
    sigp        355      420     retanoic acid-responsive protein signal
                                 peptide
    matp        421      774     retanoic acid-responsive protein
    mRNA    <     1     1016     MK-3 mRNA
BASE COUNT      278 a    292 c    302 g    167 t
ORIGIN      
        1 cccaggtacc cagctccctg ccacatcaga gacccttctt gcactctgag tgaactgatt
       61 aaaaaaaaaa aaaaaaaaaa aaaaccaagc cggaggtgag ccgggcctcg aagggaaggt
      121 tcgcgggtgc ggtggccgcc ccgagcctgt gacaccagga catactcccg gggcccgcgg
      181 tgggcaagcg aagtggtgac ctgagagctg acaggctgcg agagggaaaa gtatagacag
      241 gcctagacca ggggaagggg aggggataga gagctgggcc tgctacgagg ggacctgagc
      301 cagaagcgca ctggtaaaac cgaactccag gaccagagac ccagagatca gaggatgcag
      361 caccgaggct tcttccttct cgcccttctt gccctcttgg tggtcacgtc cgcggtggcc
      421 aaaaaaaaag agaaggtgaa gaagggcagc gagtgttcgg agtggacctg ggggccctgc
      481 acccccagca gcaaggactg cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc
      541 cagcgcgtcc attgcaaggt gccctgcaac tggaagaagg aatttggagc cgactgcaaa
      601 tacaagtttg agagctgggg ggcgtgtgat gggagcactg gcaccaaagc ccgccaaggg
      661 accctgaaga aggcgcggta caatgcccag tgccaggaga ccatccgcgt gactaagccc
      721 tgcacctcca agaccaagtc aaagaccaaa gccaagaaag gaaaaggaaa ggactaagtc
      781 aggaggccag agagcctccg gcctcgcctg gagcctgaac ggagccctcc tctcccacag
      841 gcccaagata taacccacca gtgccttttg tcttcctgtc agctctgtca atcacgcctg
      901 tcctctcacg cccacaccaa gtgcccaaag tggggaggga caagagattc tggaaagtga
      961 gcctccccat accctctttt gttctcccca ccctgatact tgttattaag aaatgaataa
     1021 aataaactca cttttttcc
//
LOCUS       YSCVPS33P    2695 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  S.cerevisiae vacuolar biogenesis and vacuolar protein sorting
            protein (VPS33/SLP1), complete cds.
ACCESSION   M34638
KEYWORDS    vacular biogenesis protein; vacuolar protein sorting protein.
SOURCE      S.cerevisiae (strain GRF88) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2695)
  AUTHORS   Banta,L.M., Vida,T.A., Herman,P.K. and Emr,S.D.
  TITLE     Characterization of the yeast Vps33p, a protein required for
            vacuolar protein sorting and vacuole biogenesis
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.M.Banta, 18-MAY-1990.
            
              Author address: L.M.Banta
              California Insitute of Technology
              147-75 Div of Biology
              Pasadena, CA 91125
            
FEATURES       from  to/span     description
    pept        401     2476     Vps33p protein
    signal      245      252     TATA box
    binding     480      498     ATP-binding domain A (put.)
    binding     661      679     ATP-binding domain B (put.)
BASE COUNT      916 a    445 c    522 g    812 t
ORIGIN      
        1 gtcaagaagc gggcttggct gccataatgg aagatttaga tgtcgcaaaa gtggaattac
       61 gtcattttga gaaagctttt aaaggaattg ctaggggcat tactccagaa atgctctctt
      121 attatgaaga gtttgctcta agaagcggtt catcttcgta agcttgttca tagtcaattc
      181 ttttcctttg tgtgctcaat aatagtagat agaaattata ctgaactccg gtcattttgt
      241 ataatatatt aatcacttca cacgaacata cataaataaa atatcataaa ggttagcaaa
      301 ttggaactag ttatatgtta attagttaaa agatagaaaa ttcgagaaag gaagaaaaag
      361 ctgatattgc ccatctccaa ctttatcaaa tcatttcacg atgaatagat tttggaatac
      421 taagaaattt tcattaacaa atgccgatgg actatgtgct accttaaatg agatatctca
      481 aaatgatgaa gttcttgtgg ttcaaccaag tgtattgcca gtactcaata gtttgctaac
      541 tttccaagat ttgactcaat caactcctgt aaggaaaatt acgttactcg atgatcagct
      601 aagtgacgat ttaccgagtg ccttaggcag cgttccgcaa atggatctta tttttcttat
      661 tgatgtcaga acatctctcc gactccctcc acaactgctt gatgctgctc aaaagcacaa
      721 tttatcatct ttgcatataa tatactgtcg atggaaaccg tctttccaaa atactttgga
      781 ggatacagag caatggcaaa aggatggttt cgatttgaat tcaaaaaaaa cacatttccc
      841 taacgtcatt gaatctcagt taaaggagct atcgaacgaa tatacccttt acccttggga
      901 tctcttgccc ttcccacaga ttgatgaaaa tgttctattg actcattccc tttataacat
      961 ggaaaatgta aacatgtatt atcccaactt acgttctttg cagagtgcca cagagtcaat
     1021 actggttgat gatatggtca attcgttgca gagcttgatt tttgaaacta atagtatcat
     1081 aacaaatgtt gtgtcgatag gtaatctgtc taagagatgt agccatcttt tgaagaaacg
     1141 aatcgatgag catcaaacag agaatgattt attcatcaag ggtacgcttt atggtgaacg
     1201 aaccaactgt ggactagaaa tggacttgat tatcttggaa aggaataccg atcctataac
     1261 gccattgttg acacaactta cgtatgcagg aatactagat gatctatatg aattcaattc
     1321 tggcataaag ataaaggaga aagacatgaa cttcaattat aaggaagata aaatatggaa
     1381 tgatttgaaa tttttaaatt ttgggtcgat tgggccgcag ttaaataaat tggcaaagga
     1441 actacaaacg caatatgata caaggcataa agccgagagc gtacatgaaa tcaaagaatt
     1501 cgttgattcc ttaggttctt tgcaacaaag gcaagctttt ttgaaaaatc acacaacctt
     1561 atcatccgac gttttgaaag tggtagagac tgaagagtac ggatctttca ataaaatctt
     1621 agagttagag ctggaaattt tgatgggaaa tacacttaat aacgacattg aagatattat
     1681 actcgagttg cagtaccagt acgaggttga tcaaaagaag attctcagat taatctgttt
     1741 attgtctctt tgtaaaaatt cacttcgaga aaaggattat gaatatctaa gaacctttat
     1801 gatcgactct tggggcattg aaaaatgctt tcaacttgaa tcattggctg agttaggatt
     1861 tttcactagc aaaacgggaa aaactgattt gcatattaca acaagtaagt caacaagatt
     1921 acagaaagaa taccgttata tttcacaatg gttcaataca gtacccatag aagacgagca
     1981 tgctgccgat aaaatcacaa atgagaacga tgacttctcg gaagccactt ttgcttacag
     2041 tggtgtagtg cccttgacaa tgagactggt tcagatgtta tatgataggt ctatcttgtt
     2101 ccataattat tcctcgcagc agccttttat actgtcaaga gaacctagag tttctcaaac
     2161 ggaggattta attgaacagt tatatggaga ctcacatgcg atcgaagaga gtatatgggt
     2221 cccgggaacc attacaaaaa agatcaatgc aagcatcaag agcaataata gacggtccat
     2281 agacggatct aatgggacat ttcatgctgc agaggatatt gcactcgtag tattcctcgg
     2341 aggtgtaaca atgggtgaaa tagctataat gaagcatttg caaaaaatac taggtaaaaa
     2401 aggtatcaat aaaaggttta tcatcatcgc cgatggcttg atcaatggca caaggatcat
     2461 gaactctata tcttaattat tatatgatag atttgttaat tttttgtata tgcaaatgtg
     2521 ctttttttca ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa
     2581 aagctagggg actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct
     2641 ggcttagtga tttataagaa tggttaatta gttttgtatc ctttaatttt ctaga
//
LOCUS       BOVCASA      1123 bp ss-mRNA            MAM       17-JUL-1990
DEFINITION  Bovine alpha-s1-casein mRNA, complete cds.
ACCESSION   M33123
KEYWORDS    alpha-s1-casein.
SOURCE      Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA,
            clone p-alpha-s1 C228.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1123)
  AUTHORS   Nagao,M., Maki,M., Sasaki,R. and Chiba,R.
  TITLE     Isolation and sequence analysis of bovine alpha-s1-casein cDNA
            clone
  JOURNAL   Agric. Biol. Chem. 48, 1663-1667 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         64      708     alpha-s1-casein precursor
    sigp         64      108     alpha-s1-casein signal peptide
    matp        109      705     alpha-s1-casein
    mRNA    <     1     1123     alpha-s1-casein mRNA
BASE COUNT      331 a    247 c    223 g    322 t
ORIGIN      
        1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca
       61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat
      121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt
      181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt
      241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc
      301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa
      361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac
      421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa
      481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac
      541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg
      601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat
      661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga
      721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg
      781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg
      841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct
      901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa
      961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa
     1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact
     1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc
//
LOCUS       BPHINTXIS    1741 bp ds-DNA             PHG       17-JUL-1990
DEFINITION  Bacteriophage phi-11 integrase (int) and excisionase (xis) genes,
            complete cds.
ACCESSION   M34832
KEYWORDS    excisionase; integrase.
SOURCE      Bacteriophage phi-11 DNA.
  ORGANISM  Bacteriophage phi-11
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 1741)
  AUTHORS   Ye,Z.-H., Buranen,S.L. and Lee,C.Y.
  TITLE     Sequence analysis and comparison of int and xis genes from
            Staphylococcal bacteriophages l54a and phi-11
  JOURNAL   J. Bacteriol. 172, 2568-2575 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        267       67 (c) excisionase (xis)
    pept        379     1425     integrase (int)
BASE COUNT      650 a    264 c    263 g    564 t
ORIGIN      
        1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc
       61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa
      121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac
      181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt
      241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca
      301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt
      361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt
      421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa
      481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac
      541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct
      601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga
      661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta
      721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt
      781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct
      841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta
      901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt
      961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt
     1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa
     1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat
     1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta
     1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa
     1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat
     1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat
     1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg
     1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc
     1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt
     1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat
     1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact
     1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag
     1741 t
//
LOCUS       BSTNPRAS     3510 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  B.stearothermophilus neutral protease (nprS), and transcriptional
            activator (nprA) genes, complete cds.
ACCESSION   M34237
KEYWORDS    neutral protease; transcriptional activator.
SOURCE      B.stearothermophilus (strain TELNE) DNA, clone pSP53.
  ORGANISM  Bacillus stearothermophilus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 3510)
  AUTHORS   Nishiya,Y. and Imanaka,T.
  TITLE     Cloning and nucleotide sequences of the neutral protease gene and
            its transcriptional activator gene from Bacillus stearothermophilus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Nishiya, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        181     1401     transcriptional activator (nprA)
    pept       1750     3405     neutral protease (nprS) precursor
    sigp       1570     2274     neutral protease (nprS) signal and propeptide
    matp       2275     3219     neutral protease (nprS)
BASE COUNT     1177 a    581 c    737 g   1015 t
ORIGIN      
        1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa
       61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat
      121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa
      181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt
      241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa
      301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa
      361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag
      421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa
      481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat
      541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa
      601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt
      661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg
      721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac
      781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga
      841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat
      901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat
      961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca
     1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac
     1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat
     1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca
     1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt
     1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac
     1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta
     1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga
     1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg
     1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag
     1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc
     1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt
     1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag
     1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga
     1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca
     1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc
     1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt
     1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt
     2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact
     2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac
     2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct
     2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac
     2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta
     2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa
     2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca
     2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca
     2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat
     2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt
     2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac
     2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca
     2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat
     2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat
     2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt
     2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa
     3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc
     3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca
     3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat
     3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa
     3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc
     3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa
     3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca
     3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt
     3481 gcaaaatgcc caaaggctta taatgttgat
//
LOCUS       HSEGP14      3347 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete
            cds.
ACCESSION   M34861
KEYWORDS    glycoprotein 14.
SOURCE      Equine herpesvirus type 1 DNA.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 3347)
  AUTHORS   Guo,P.
  TITLE     Characterization of the gene and an antigenic determinant of equine
            herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent
            glycoproteins of other herpesviruses
  JOURNAL   Gene 87, 249-255 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        300     3239     glycoprotein 14 (gp14)
BASE COUNT      885 a    891 c    851 g    720 t
ORIGIN      
        1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg
       61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg
      121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg
      181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc
      241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca
      301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg
      361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct
      421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa
      481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct
      541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt
      601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac
      661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc
      721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg
      781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct
      841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat
      901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga
      961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata
     1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt
     1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg
     1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg
     1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt
     1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc
     1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag
     1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact
     1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg
     1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta
     1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag
     1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta
     1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc
     1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg
     1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca
     1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa
     1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg
     1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac
     2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg
     2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc
     2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg
     2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc
     2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga
     2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg
     2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag
     2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg
     2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc
     2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac
     2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg
     2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg
     2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa
     2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg
     2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga
     2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc
     3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa
     3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga
     3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg
     3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa
     3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact
     3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg
//
LOCUS       HUMHBGAA     1227 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human A-gamma-globin gene, 3' end.
ACCESSION   M33200
KEYWORDS    A-gamma-globin.
SOURCE      Human (hereditary persistence of fetal hemoglobin individual II-1)
            DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1227)
  AUTHORS   Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G.
  TITLE     Gamma gene promoter and enhancer structure in Seattle variant of
            hereditary persistence of fetal hemoglobin
  JOURNAL   Blood 71, 1108-1112 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1        3     A-gamma-globin (AA at 1)
                                 /hgml_locus_uid="LK0092S" /nomgen="HBG1"
                                 /map="11p15.5"
    mut         794      794     t in wt; c in mutant
    mut         970      970     c in wt; a in mutant
    mut        1186     1186     a in wt; g in mutant
BASE COUNT      366 a    204 c    294 g    363 t
ORIGIN      
        1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca
       61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca
      121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc
      181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt
      241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg
      301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc
      361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg
      421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa
      481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg
      541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc
      601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga
      661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt
      721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa
      781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta
      841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca
      901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat
      961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa
     1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc
     1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc
     1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca
     1201 gatgaagaag gagaaaaaaa taagctt
//
LOCUS       HUMHBQ1A     1114 bp ds-DNA             PRI       17-JUL-1990
DEFINITION  Human theta-1-globin gene, complete cds.
ACCESSION   M33022
KEYWORDS    theta-1-globin.
SOURCE      Human black female with alpha-thal-2 heterozygosity white cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1114)
  AUTHORS   Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J.
  TITLE     Nucleotide sequence of the human theta-1-globin gene
  JOURNAL   Biochem. Genet. 26, 207-211 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        359      453     theta-1-globin, exon 1
                                 /hgml_locus_uid="LV0155X" /nomgen="HBQ1"
                                 /map="16p13.3"
                538      742     theta-1-globin, exon 2
                852      980     theta-1-globin, exon 3
    IVS         454      537     theta-1-globin intron A
    IVS         743      851     theta-1-globin intron B
BASE COUNT      166 a    386 c    393 g    169 t
ORIGIN      
        1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg
       61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt
      121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc
      181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga
      241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc
      301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat
      361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa
      421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc
      481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac
      541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc
      601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg
      661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg
      721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg
      781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc
      841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag
      901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc
      961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc
     1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag
     1081 gacgcgagag gaaggcgccg cccctcccca agga
//
LOCUS       HUMITIH1A    1149 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds.
ACCESSION   M33033
KEYWORDS    inter-alpha-trypsin inhibitor heavy chain.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1149)
  AUTHORS   Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and
            Martin,J.-P.
  TITLE     Human inter-alpha-trypsin inhibitor: Isolation and characterization
            of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence
            of the H chain
  JOURNAL   Biol. Chem. Hoppe-Seyler 369, 15-18 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  1149     inter-alpha-trypsin inhibitor heavy chain (AA
                                 at 1) /hgml_locus_uid="LE0221G"
                                 /nomgen="ITIH1" /map="3p21.2-p21.1"
BASE COUNT      332 a    292 c    253 g    272 t
ORIGIN      
        1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat
       61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat
      121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa
      181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag
      241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct
      301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc
      361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga
      421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc
      481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac
      541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg
      601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc
      661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact
      721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc
      781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca
      841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa
      901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca
      961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa
     1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag
     1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga
     1141 ttttatttc
//
LOCUS       HUMPTHROM     327 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human thrombin mRNA, 5'end.
ACCESSION   M33031
KEYWORDS    serine protease; thrombin.
SOURCE      Human, cDNA to mRNA, clone pIIH13.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 327)
  AUTHORS   MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C.
  TITLE     Recombinant genetic approaches to functional mapping of thrombin
  JOURNAL   Ann. N.Y. Acad. Sci. 485, 73-79 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         28  >   327     thrombin precursor /hgml_locus_uid="LD0134L"
                                 /nomgen="F2" /map="11p11-q12"
    sigp         28      156     thrombin signal peptide
    matp        157  >   327     prothrombin
BASE COUNT       60 a     97 c    109 g     61 t
ORIGIN      
        1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc
       61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct
      121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg
      181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc
      241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag
      301 acagcgagga cgcctcgaga taagctt
//
LOCUS       MUSCC3A       312 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse complement component C3 mRNA, partial cds.
ACCESSION   M33032
KEYWORDS    complement component C3.
SOURCE      Mouse liver, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Fey,G.H., Wiebauer,K. and Domdey,H.
  TITLE     Amino acid sequences of mouse complement C3 derived from nucleotide
            sequences of cloned cDNA
  JOURNAL   Ann. N.Y. Acad. Sci. 421, 307-312 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     complement component C3 precursor (AA at 1)
    matp    <     1       27     complement component C3-beta subunit (AA at 1)
    matp         40      273     complement component C3-alpha subunit
    matp        274  >   312     complement component C3-alpha' subunit
BASE COUNT       90 a     77 c     90 g     55 t
ORIGIN      
        1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga
       61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt
      121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc
      181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa
      241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca
      301 gaagaagata tt
//
LOCUS       MUSN038A     1260 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse nucleolar protein N038 mRNA, complete cds.
ACCESSION   M33212
KEYWORDS    nucleolar protein N038.
SOURCE      Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone
            lambda-FML-185.19.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1260)
  AUTHORS   Schmidt-Zachmann,M.S. and Franke,W.W.
  TITLE     DNA cloning and amino acid sequence determination of a major
            constituent protein of mammalian nucleoli: Correspondence of the
            nucleoplasmin-related protein N038 to mammalian protein B23
  JOURNAL   Chromosoma 96, 417-426 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         79      957     nucleolar protein N038
    mRNA    <     1     1260     nucleolar protein N038 mRNA
BASE COUNT      419 a    214 c    306 g    321 t
ORIGIN      
        1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga
       61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct
      121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat
      181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa
      241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta
      301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt
      361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag
      421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa
      481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa
      541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat
      601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct
      661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa
      721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa
      781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa
      841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc
      901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa
      961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg
     1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt
     1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac
     1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag
     1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt
//
LOCUS       RATPOS       1804 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat type-2A protein phosphatase catalytic subunit mRNA, complete
            cds.
ACCESSION   M33114
KEYWORDS    type-2A protein phosphatase catalytic subunit.
SOURCE      Rat liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1804)
  AUTHORS   Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S.,
            Sugimura,T. and Nagao,M.
  TITLE     Molecular cloning of cDNA for the catalytic subunit of rat liver
            type 2A protein phosphatase, and detection of high levels of
            expression of the gene in normal and cancer cells
  JOURNAL   Biochim. Biophys. Acta 951, 123-129 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        114     1043     type-2A protein phosphatase catalytic subunit
    mRNA    <     1     1804     type-2A protein phosphatase catalytic subunit
                                 mRNA
BASE COUNT      482 a    391 c    437 g    494 t
ORIGIN      
        1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg
       61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg
      121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc
      181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat
      241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc
      301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta
      361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc
      421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga
      481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt
      541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga
      601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac
      661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc
      721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag
      781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc
      841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc
      901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta
      961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc
     1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc
     1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt
     1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct
     1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat
     1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga
     1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc
     1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa
     1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa
     1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg
     1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt
     1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa
     1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct
     1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact
     1801 atat
//
LOCUS       RATSCP2      1409 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds.
ACCESSION   M34728
KEYWORDS    sterol carrier protein-2.
SOURCE      Rat liver, cDNA to mRNA, clone SP43.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1409)
  AUTHORS   Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and
            Davis,L.G.
  TITLE     Characterization of a cDNA encoding rat sterol carrier protein-2
  JOURNAL   DNA Cell Biol. 9, 159-165 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        307     1128     sterol carrier protein-2 (SCP-2)
    mRNA    <     1     1409     sterol carrier protein-2 mRNA
BASE COUNT      387 a    300 c    390 g    332 t
ORIGIN      2 bp upstream of EcoRI site.
        1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc
       61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc
      121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc
      181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt
      241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt
      301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc
      361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa
      421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac
      481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc
      541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg
      601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta
      661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc
      721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag
      781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg
      841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct
      901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag
      961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg
     1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc
     1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg
     1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca
     1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc
     1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg
     1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac
     1381 ttgagtttca gaataaagtt cagaatagt
//
LOCUS       SYNPSBAII     556 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end.
ACCESSION   M34833
KEYWORDS    D1 protein; photosystem II.
SOURCE      Synechococcus (strain PCC 7942) DNA.
  ORGANISM  Synechococcus sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Chroococcales.
REFERENCE   1  (bases 1 to 556)
  AUTHORS   Bustos,S.A., Schaefer,M.R. and Golden,S.S.
  TITLE     Different and rapid responses of four cyanobacterial psbA
            transcripts to changes in light intensity
  JOURNAL   J. Bacteriol. 172, 1998-2004 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         81      425     ORF1
    pept        527  >   556     photosystem II D1 protein (psbAII)
    mRNA         59  >   556     psbAII mRNA (alt.)
    mRNA        478  >   556     psbAII mRNA (alt.)
BASE COUNT      136 a    145 c    142 g    133 t
ORIGIN      
        1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag
       61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg
      121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac
      181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag
      241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg
      301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc
      361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg
      421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg
      481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt
      541 gcagcggcgc gagagc
//
LOCUS       ACCPCACGH    2391 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  A.calcoaceticus gamma-carboxymuconolactone decarboxylase (pcaC) and
            protocatechuate 3,4-dioxygenase alpha and beta subunit (pcaGH)
            genes, complete cds.
ACCESSION   M33798
KEYWORDS    gamma-carboxymuconolactone decarboxylase;
            protocatechuate 3,4-dioxygenase.
SOURCE      A.calcoaceticus DNA, clone pZR2.
  ORGANISM  Acinetobacter calcoaceticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 2391)
  AUTHORS   Hartnett,C., Neidle,E.L., Ngai,K.-L. and Ornston,N.
  TITLE     DNA sequences of genes encoding Acinetobacter calcoaceticus
            protocatechuate 3,4-dioxygenase: Evidence indicating shuffling of
            genes and of DNA sequences within genes during their evolutionary
            divergence
  JOURNAL   J. Bacteriol. 172, 956-966 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        324      728     gamma-carboxymuconolactone decarboxylase
                                 (pcaC) (EC 4.1.1.44)
    pept        750     1463     protocatechuate 3,4-dioxygenase, alpha subunit
                                 (pcaH) (EC 1.13.11.3)
    pept       1491     2120     protocatechuate 3,4-dioxygenase, beta subunit
                                 (pcaG) (EC 1.13.11.3)
    ORF     <     1      203     ORF (AA at 3)
    binding     312      315     ribosomal binding site (put.)
    binding     737      740     ribosomal binding site (put.)
    binding    1478     1482     ribosomal binding site (put.)
BASE COUNT      670 a    463 c    533 g    725 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttatca aacccaacgt tacttgctct gtttcatttt atgtgcaggt attgccgtca
       61 atggtgctca atcttcaatg ccagtattga gtgcgcgttt ctatccaacg caatgtcgtg
      121 caacaggtgt ggcatggatg tctggaattg gccgttttgg tcgccgtatt tggtgcatgg
      181 attggtgcag ttttattagg taataactgg tcatttacca tgatcctgag catgttgata
      241 tcctgctcgt gccgctgcaa ttgcaatttt tgttaaatcc cttgttggca catacagatg
      301 caacttgatg tgaggtttct cttatgaatg atgaacaacg ttataaacaa gggctggaag
      361 tccgtacaga agttctgggt gaaaagcatg ttaaccgttc acttgagaac ctaaacgact
      421 tcaatcagga ttttcaaaac tttattagcc gttttgcatg gggagaggtt tggtctcgtc
      481 ctggacttcc tcgtcatacg cgtagtttgg tcacaattgc agtgttgttg gcgcttgggc
      541 gtgaagatga gctgcgtatc gatttacgtg cctgtttcaa caatggtgtg accaaagatg
      601 aattaaaaga gttgattcta cattgttcac tttatgcagg cttacccgct tcaaatgctg
      661 cgatgcatat ggcagaagag gtctttaagg acttgggaat tgcacctgaa aaagtaaaca
      721 aggattaatg gatttagagg gagtagaaaa tgtctcaaat tatttgggga gcttacgctc
      781 agtcgaatac cgaagatcat ccgccagctt atcgaccagg ttataaaacc agtgtgttac
      841 gctcgcctaa gaatcgactc atttctattg cagaaacttt aagtgaagtc actgcaccac
      901 attttagtgc tgataaattt ggtccaaaag acaatgactt aatcttgaac tatgccaaag
      961 atggtttgcc gattggtgaa cgtgtcattg tgcatggtta tgtacgtgac cagtttggtc
     1021 gacctgtaaa aaatgcactt gttgaagtat ggcaagccaa tgcatctggt cgttatcgtc
     1081 atccaaatga tcaatatatc ggtgccatgg atcctaattt cggtggttgt ggccgtatgt
     1141 tgactgatga taatggttat tatgtattcc gtacgattaa accaggtcca tacccatggc
     1201 gtaaccgtat caatgaatgg cgtcctgctc acattcactt ctctttaatt gcagatggtt
     1261 gggcacaacg tttaatttcg cagttctatt ttgaaggcga tacattgatt gattcttgcc
     1321 caatcctgaa aaccattcct tctgagcaac aacgtcgtgc gctaattgca ttagaagaca
     1381 agagcaattt cattgaagct gacagccgtt gttatcgctt tgacattacg cttcgtggtc
     1441 gcgcggactt acttcgaaaa tgacttaact taatggatgg agtaaaaaac atgaatggtt
     1501 ggaattttca ggaactcaaa gaaaccccat ctcaaacagg tggtccatat gtccatattg
     1561 gtcttttgcc aaaacaagcc aatatcgaag tatttgaaca caatttaggt aacaaccttg
     1621 tacaggacaa tacgcaaggt caacgcatcc gattagaagg tcaagtgttt gatgggctta
     1681 gtttacctct acgtgacgta ctgattgaaa tctggcaagc agataccaat ggcgtttacc
     1741 caagtcaagc tgatactcaa ggtaaacaag tcgatccgaa ctttttgggt tggggccgta
     1801 caggtgcaga ttttggtaca ggcttctgga gttttaatac cattaagcca ggtgctgttc
     1861 caggccgtaa aggttcaact caagcaccac atatttcact gattattttt gcgcgtggta
     1921 tcaacattgg tcttcacact cgtgtctatt ttgatgacga agcagaagca aatgcaaaag
     1981 atcctgtttt aaacagtatc gagtgggcga cacgtcgtca aacactggtt gccaaacgtg
     2041 aagagcgtga tggtgaagtg gtttatcgtt ttgatattcg tattcaaggt gaaaacgaaa
     2101 cggtattttt cgatatctaa atttttgact atggaacggg ccgtatatct ttcggtatac
     2161 agcctccata attttacaag gaaacgtaaa tgttcaatac aaacaaactg gttaaacaac
     2221 tttgtctaag tggtgcgttg ttgatgtcat tgcctatatt gtcaacgacc tatgcggctg
     2281 aaaatactgt accagcatct aaatcaactt atgttgtaaa aaatttaaac attggtgatt
     2341 tacctgttaa aaccctcgtg ccaatcactg ctaaaacaag agagcaagct t
//
LOCUS       HAMSAA3A      530 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  M.auratus serum amyloid A (SAA3) mRNA, complete cds.
ACCESSION   M33431
KEYWORDS    serum amyloid A protein.
SOURCE      M.auratus (strain LVG) embryo fibroblast, cDNA to mRNA, clone
            pAA825.
  ORGANISM  Mesocricetus auratus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 530)
  AUTHORS   Gervais,C. and Suh,M.
  TITLE     Serum amyloid A protein-related mRNA expression in herpes simplex
            virus type 2 transformed hamster cells
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Suh, 30-MAR-1990.
FEATURES       from  to/span     description
    pept         42      410     serum amyloid A
    signal      510      515     polyA signal
BASE COUNT      147 a    122 c    140 g    121 t
ORIGIN      
        1 ggggacctct agagtcactc accaggcatc tcgaaagcag gatgaagcct ttccttgcca
       61 tcattttttg cttcttgatc ctgggagttg acagccaaag gtggttccag ttcatgaaag
      121 aagctggcca agggagtaca gacatgtgga gagcttactc tgacatgaga gaggccaact
      181 ggaaaaactc agataaatac ttccatgccc gggggaacta tgatgctgcc aaaaggggac
      241 caggaggagc ctgggctgct aaagtgatca gtgatgccag agagggtatt cagaggttca
      301 caggacgtgg agcagcggac tcaagagctg accagtttgc caacaaatgg ggccggagtg
      361 gtaaagaccc caaccacttt cgacctgctg gcctgcctag taaatactga gtttcatctt
      421 catgctattc ccagccatgc accccaaggt tgcatttact agattaagtt agttcctgag
      481 ctctggacgt gtaagcaccc gaaatttaga ataaatacct gtgaaatgct
//
LOCUS       HEIBEXBA      795 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  H.influenza (strain RM153) capsulation protein (bexB) gene, 5' end.
ACCESSION   M33787
KEYWORDS    capsulation protein.
SOURCE      H.influenza (type b, strain RM153) DNA.
  ORGANISM  Haemophilus influenzae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 795)
  AUTHORS   Kroll,J.S. and Moxon,E.R.
  TITLE     Capsulation in distantly related strains of Haemophilus influenzae
            type b: genetic drift and gene transfer at the capsulation locus
  JOURNAL   J. Bacteriol. 172, 1374-1379 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >   795     capsulation protein (bexB)
BASE COUNT      188 a    133 c    177 g    297 t
ORIGIN      
        1 atgcaatatg gtgataaaac aacatttaaa cagtcattag ctattcaagg acgggtgatt
       61 aacgcgttgc ttatgcgtga aataatcact cgttatgggc gtcaaaatat tggctttttt
      121 tggctatttg ttgagccttt gctcatgact ttttttattg taatgatgtg gaaatttatc
      181 cgtgctgata aattttctac tttaaatatg attgcctttg tgatgacggg gtatccaatg
      241 gcaatgatgt ggcgaaatgc ttcaaaccgt gcgattggtt ccatttctgc aaacttgagt
      301 ttactttatc accgaaatgt tcgagtgcta gataccattt ttactcgcgt gttgcttgag
      361 gttgctggtg cttctattgc tcaaattctt tttatggcaa tcttagtcat gattgattgg
      421 attgatgcac cgcatgatgt gttttatatg ctcattgcat ggttcctcat ggcaatgttt
      481 gcttttggct tggggttaat catttgtgcg attgctcagc aatttgatgt gtttggtaaa
      541 atttggggaa cactcagttt cgttttatta ccaatttctg gcgcattctt cttcgtccat
      601 aaccttccag cacaggcaca atctattgca ctttggtttc cgatgattca tggtacagaa
      661 atgttccgcc atggctattt tggcgataca gtggttactt acgaaagtat aggctttttg
      721 gttgtgagtg atctagcctt gttgttactg ggcttagtga tggttaaaaa ctttagtaaa
      781 ggagttgagc cacaa
//
LOCUS       HEIBEXBB      795 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  H.influenza (strain RM926) capsulation protein (bexB) gene, 5' end.
ACCESSION   M33788
KEYWORDS    capsulation protein.
SOURCE      H.influenza (type b, strain RM926 ATCC 8468) DNA, clones
            lambda-926.[8,11,14].
  ORGANISM  Haemophilus influenzae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 795)
  AUTHORS   Kroll,J.S. and Moxon,E.R.
  TITLE     Capsulation in distantly related strains of Haemophilus influenzae
            type b: genetic drift and gene transfer at the capsulation locus
  JOURNAL   J. Bacteriol. 172, 1374-1379 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >   795     bexB protein
BASE COUNT      188 a    128 c    178 g    301 t
ORIGIN      
        1 atgcaatatg gtgatcaaac aacgtttaaa cagtcattag cgattcaagg aagagtgatt
       61 aatgcattac ttatgcgtga aataatcact cgttatgggc gcaaaaatat tggctttcta
      121 tggctatttg ttgagccttt actcatgact ttttttattg tgatgatgtg gaaatttatt
      181 cgtgctgata aattttcaac tttaaatatg attgcttttg tcatgaccgg gtatcctatg
      241 gcgatgatgt ggcgaaatgc ttctaatcga gctattggtt cgatttctgc aaacttgagt
      301 ttactttatc accgaaatgt tcgagtgcta gatactatct ttactcgagt attgcttgag
      361 gttgcaggtg cttcgattgc tcaaatcctt tttatggcag ttttagtgct tattggttgg
      421 attgatgcac cacgtgatgt gttttatatg cttatggcat ggtttctcat ggctatgttt
      481 gcttttgcct taggccttat tatttgtgcg gtagctcaac aatttgatgt gttcggtaaa
      541 atttggggaa ccttaagttt cgtattgtta ccaatttctg gtgcattttt ctttgttcac
      601 aaccttccat cacaggctca atctatcgcg ctttggctcc caatgattca tggtacggaa
      661 atgtttcgtc atggctactt tggggatacg gtcgttactt atgagagtat cggattttta
      721 gtcgtaagtg atttggcctt attgctaatg ggcttggtga tggttaaaaa ctttagtaag
      781 gggattgagc cacaa
//
LOCUS       PSEARCDA     1502 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  P.aeruginosa membrane protein arcD (complete cds) and arcA (5' end)
            genes.
ACCESSION   M33223
KEYWORDS    membrane protein.
SOURCE      P.aeruginosa (strain PAO1) DNA, clone pME183.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1502)
  AUTHORS   Luethi,E., Baur,H., Gamper,M., Brunner,F., Villeval,D.,
            Mercenier,A. and Haas,D.
  TITLE     The arc operon for anaerobic arginine catabolism in Pseudomonas
            aeruginosa contains an additional gene, arcD, encoding a membrane
            protein
  JOURNAL   Gene 87, 37-43 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Haas, 26-MAR-1990.
FEATURES       from  to/span     description
    pept         24     1472     membrane protein (arcD)
    pept       1494  >  1502     membrane protein (arcA)
    binding       8       15     ribosome binding site (put.)
BASE COUNT      247 a    507 c    443 g    305 t
ORIGIN      
        1 attcctatag ggagattccg aaaatgtccc aagaatccag ccaaaaactc cgactaggag
       61 cgctaacggc tctggttgtc ggctcgatga tcggaggggg catcttctcc ctcccgcaaa
      121 acatggcagc cagcgccgat gtcggcgccg tcctgatcgg atgggcgatc accgccgtcg
      181 gcatgctcac cctcgctttc gtcttccaga ccctcgccaa ccgcaagccc gaactcgacg
      241 gcggcgtgta cgcctacgcc aaggccggct tcggcgacta catgggtttc tcctcggcct
      301 ggggctactg gatcagcgcc tggctgggca acgtcggcta cttcgtcctg ttgttcagca
      361 ccctcggcta cttcttcccg atcttcggca agggcgacac cgtcgcggcg atcgtctgcg
      421 cctcggtcct gctctgggcc ctgcacttcc tggtgctgcg cgggatcaag gaggcggcgt
      481 tcatcaacac cgtcaccacc gtggccaagg tcgtgccgct gttcctgttc atcctgatct
      541 gcctgttcgc cttcaagctg gacatcttca ccgccgacat ctggggcaag agcaacccgg
      601 acctgggcag cgtgatgaac caggtgcgca acatgatgct ggtcaccgtc tgggtgttca
      661 tcggcatcga gggcgcgagc atcttctcct cccgcgcgga aaaacgttcc gacgtcggca
      721 aggccaccgt gatcggcttc atcaccgtcc tgctcctgct ggtgctggtc aacgtgctgt
      781 ccatgggcgt gatgacccag ccggaactgg ccaagctgca gaacccgtcg atggcgctgg
      841 tactcgagca tgtggtcggc cactggggcg ccgtgctgat cagcgtcggc ctgctgatct
      901 cgctgctggg cgcgctgctc tcctgggtgc tgctgtgcgc cgagatcatg ttcgccgccg
      961 ccaaggacca caccatgccg gagttcctgc gccgcgagaa cgccaaccag gtgccggcca
     1021 acgccctgtg gctgaccaac atctgcgtac aggtgttcct ggtggtggtg ttcttcacct
     1081 cgggcgaccc ggacggcatg gacccgtaca ccaagatgct gctcctggcc acctcgatga
     1141 tcctgatccc gtacttctgg tccgccgcct atggcctgct gctgaccctg aagggcgaga
     1201 cctacgagaa cgatgcccgc gaacgcagca aggacctggt catcgccggc atcgccgtgg
     1261 cctacgcggt ctggctgctc tacgcggggg gcctgaagta cctgctgctg tccgccctgc
     1321 tgtatgcgcc cggcgcgatc cttttcgcca aggccaagca cgaggtcggc cagccgatct
     1381 tcaccggaat cgaaaaactg atcttcgccg cagtcgtcat tggcgccctg gtggctgcct
     1441 acggcctcta cgacggtttc ctcaccctct gattactcca ctggagacgt gaaatgagca
     1501 cg
//
LOCUS       TOMCD4A      5429 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  Tomato ATP-dependent protease (CD4A) gene, complete cds.
ACCESSION   M32603
KEYWORDS    ATP-dependent protease.
SOURCE      Tomato (L.esculentum) DNA.
  ORGANISM  Lycopersicon esculentum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 5429)
  AUTHORS   Gottesman,S. and Pichersky,E.
  TITLE     A conserved regulatory subunit for ATP-dependent protease in
            prokaryotes and eukaryotes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Pichersky,E., 08-MAR-1990.
FEATURES       from  to/span     description
    pept        787     1357     ATP-dependent protease (CD4A), exon 1
               1461     1576     ATP-dependent protease (CD4A), exon 2
               1828     1950     ATP-dependent protease (CD4A), exon 3
               2034     2219     ATP-dependent protease (CD4A), exon 4
               2853     3068     ATP-dependent protease (CD4A), exon 5
               3248     3465     ATP-dependent protease (CD4A), exon 6
               3768     3843     ATP-dependent protease (CD4A), exon 7
               3926     4015     ATP-dependent protease (CD4A), exon 8
               4125     5309     ATP-dependent protease (CD4A), exon 9
    pre-msg       1     5429     CD4A mRNA and introns
    IVS        1358     1460     CD4A intron A
    IVS        1577     1827     CD4A intron B (no splice consensus)
    IVS        1951     2033     CD4A intron C
    IVS        2220     2852     CD4A intron D
    IVS        3069     3247     CD4A intron E
    IVS        3466     3767     CD4A intron F
    IVS        3844     3925     CD4A intron G (no splice consensus)
    IVS        4016     4124     CD4A intron H
BASE COUNT     1521 a    972 c   1221 g   1715 t
ORIGIN      
        1 tcgaagttgt aaaggttggt cttttgcttc tacttgtttg gactagtttc tttccaagta
       61 atggggttat taagttggta ttgttgtgga ttttgacagg atatgggggt atgatgtagc
      121 tctcatgttt atattttcaa ttaagcaatg tagttctggt tgactagttg tggtgctaga
      181 attcccaatt aacctcttca ctgaattact cctagtgctg aataaaactg ccttatctca
      241 agggtgtggg ttgttgttgt attctagagg gtttgtttat ataatcccat tggaaccaag
      301 ttaacctatg caataggaat tatatattca agattctgct agtttagaaa ctaaagtcct
      361 tgcagcttgg ctgctttatc ctgtatgaaa tagttaccat cctgtggttg catagttttc
      421 caggtcttct tgttttgagg ctttattatt cattttactg atgatataac ttttatgatg
      481 aaatttgaat ttagttttaa gcttatgcgc taggtgtatg ttgctcaaca tgcattcgca
      541 gagttttttg aagtatggca ttctcataat gatcctattt gtctacatta tgtcatctaa
      601 agaaattaac cttcaacggt taatggttta caatggtcca aaaaaagtag ccttcaatgg
      661 ttattagtga tggttttgtt gatgtctgct aactatatgt atatgttatg taatatgatc
      721 cttggtactt cttccattgc cactaatgat cttttcttat atgctatcat gtcaggttgc
      781 aggtccatga tggctagagc tttagttcag tcaacaaaca ttctaccttc agttgctggt
      841 gaaagggctg gacaattcaa tggatcccgg aaagatcaaa gaactgttag aatgctatgt
      901 aatgtaaaat gttgttcctc aaggctgaac aattttgcag gactgcgagg gtgcaacgca
      961 ttagatacac tactagtaaa atctggagaa actctccatt cgaaagtggc agctgcaact
     1021 tttgtcagac ggccacgagg gtgccgattt gtcccaaaag caatgtttga gcggttcact
     1081 gagaaagcaa taaaagtcat tatgcttgcc caagaagagg ccagacgact aggtcacaac
     1141 tttgttggga cagaacagat cctgctgggt cttattggtg agggaactgg tattgctgct
     1201 aaggttctta aatccatggg aattaatttg aaagatgctc gtgtggaagt agagaagatc
     1261 attggcaggg gtagtggatt tattgcagtg gaaatccctt ttactcctcg tgcaaagcgt
     1321 gttctagaac tatctctgga ggaagcccgc caactaggta cccttatcta tcatttcagc
     1381 cttttgttgg atggcctaac tgtcatgcga actcaaaatt catttctttg ttgcatctgt
     1441 tgattggttt tgtgatgcag ggcataatta tattggctcc gagcacttgt tacttggact
     1501 gttgcgcgaa ggtgaaggtg tggctgcccg tgttcttgaa aacttgggtg ctgaccccac
     1561 taacattcgc acacaggcaa gtgattttcc tttctcaaac tggtttgaaa tgcttttctt
     1621 gaagccaagg gtctaatcag aaacagcctc tctacctcca caagtagggg taaggtctgc
     1681 atacatctta ccttcccaga tgccacctgt gggattacac tgggtatgtt gttgttgttg
     1741 tttacttctg atgggggtga gggtatttta aattgcttga cttcattgta atcagtttct
     1801 cttttttctt gtaaaataaa taaataggtg atacggatgg ttggcgagag tagtgaggct
     1861 gttggtgcta gtgttggagg tggaacttct ggcctaaaga tgcctacatt ggaggagtac
     1921 ggcaccaatt tgacaaagtt agctgaagag gtacatagaa agataagtcc gtcataatgt
     1981 ggcaagtgat tggctctctt caattatttg atatggtcat attatatttt caggggaaac
     2041 tagatcctgt agttggaaga caggctcaaa tcgagcgtgt tactcaaatc ttgggtcgcc
     2101 gtaccaaaaa caacccttgc cttattggag aaccaggtgt tggaaaaact gctattgccg
     2161 aagggctagc tcaaagaatt gctaatggtg atgtccctga aacaatagaa ggaaagaagg
     2221 tcacaggggt gtatgataaa ttgcataaca tgttttatgc ttgagctttt tttctcttta
     2281 tcaatttcat ttcaactgtg tggcagatcc agaagctgtg ctacattctt agatagaaag
     2341 acttcaaatc atcattttta taatgtctta gaatttgatt ttatacagag ggtaattttt
     2401 taaattggaa gtaaatattc tgcagacaaa tatatataca caaatattgt tatctgatcc
     2461 cccatataac acccccaccc cccaccccaa aagcctctag gccaaaaatt tacgcgattc
     2521 taagagtatc caatcaatat ttttggtagg cgtgccaaaa atttacagaa acatttgaat
     2581 gtcccactaa cttgtacgaa caggttttca aatttcaacc gccttgagcc actacatatt
     2641 cttcattgtc tgatatatgg actcaaggca tatatcaggt ttagttccct cttacttcaa
     2701 ttctagtcta actgaagtca tagggaagat gaacctattt atctaaatct ttttatatgc
     2761 aactattttg atgaggtata ccctcctttt tcaaaaaatg tgtatatcct tgtcattata
     2821 cgagtgtatg acttgctttg ggtgatggtc aggtcataac acttgatatg ggattgcttg
     2881 ttgctgggac aaaataccgt ggagagtttg aggaaagact gaagaaacta atggaggaaa
     2941 ttaaacaaag tgatgaaata atactgttta ttgatgaagt acacacattg attggagctg
     3001 gagctgcaga gggagcaatt gatgctgcaa acatcttgaa acctgccctg gctagaggcg
     3061 agctacaggt tcgttatcgt ttcgacctgt agtatgttgt tccttggaat ttcaatgagc
     3121 actatgaaat ctaccttaaa tttggtgttt aatcattatc ttggtccgag tacatcttgg
     3181 taaaactaca aaagaatttc aaagttggtg ttaattaaga agtattttgc ttggtttcac
     3241 ttgtcagtgt attggagcca caaccctgga tgagtacaga aagcacattg agaaagaccc
     3301 tgcactagag agaaggttcc agccagttaa agtccctgaa ccttctgtgg atgaaaccat
     3361 acagatcttg aaagggcttc gggagaggta tgagattcat cacaagctcc attacactga
     3421 tgaggcaata gaggccgctg ccaagctttc tcaccagtac atcaggtatt attattaatc
     3481 atatattcat tttccaaagt atcttccgta gtcaagttgt tggtacattt tgactttggg
     3541 atttcacggc aaactaattt attcttgtat gatttaagat atatccttct caataataaa
     3601 actcttagga taagcggctt ttcttttgta tcagtgtctg aataggctga cgcacggttt
     3661 atctcctgca aattgcacat ttatctccac tccaaaacaa tcatcttttt ccgtgattta
     3721 tctgaagtaa agcattaata tgtttttgag ttgaaccttt attccagtga ccgctttctg
     3781 cctgataaag caattgactt gattgatgaa gctggttccc gtgttcgact tcgtcatgca
     3841 caggcataga gtactctttt tgtcttactc tgtcctgcca tttcttccta agggagtcat
     3901 gacccgttat attgatacta tctagctccc tgaggaagca agagagctgg aaaaggagct
     3961 tcgccagatt acaaaggaga aaaatgaagc tgtccgcggt caagattttg agaaggtact
     4021 ggtttcctct cgtcctgtaa aaaaattcct ctggactgat tatttcttag aaaacattga
     4081 gattttcagg tgtgcaattt taatcctctt tatgcattgg acaggctggg gagttgcgtg
     4141 atagagaaat ggatcttaag gcacaaatct cagccttgat agacaaaaac aaagagaaga
     4201 gcaaggcaga atctgaggca ggagatgcag caggtcccat agtgacagag gcagatattc
     4261 agcacattgt ctcttcctgg accggcattc ctgtagagaa ggtctcaact gacgaatctg
     4321 atcgcctcct aaaaatggaa gaaacacttc atacccgagt cattggccag gatgaagcag
     4381 taaaagccat tagtcgcgct atcagacgtg cccgtgttgg cctcaagaat cccaaccgac
     4441 ctattgccag tttcatcttt tctggtccaa ctggtgttgg gaagtcagaa ctggcaaagt
     4501 ctttagcaac gtactacttt ggttctgaag aagcaatgat tcggcttgat atgagtgagt
     4561 ttatggaaag acacactgtg tctaaactca tcggttcacc ccctgggtat gttggttaca
     4621 ctgaaggtgg tcaattaact gaagctgtta ggcgtcgacc ttatacagtt gtgctctttg
     4681 atgagattga gaaggctcat cctgatgtct tcaacatgat gcttcaaatt cttgaagatg
     4741 gaagattgac agacagcaaa ggcagaactg tcgacttcaa gaatacactc ctcatcatga
     4801 catcgaatgt tggaagcagt gtgatagaga aaggaggtcg tcgtataggt tttgatctag
     4861 attttgacga gaaggatagt agttacaacc gtatcaagag cttggtgact gaggagttga
     4921 aacagtactt caggccagag tttttaaaca gattgagtga gatgattgta ttccgtcagc
     4981 tcactaagtt ggaggtgaag gagattgccg atatcatgct taaggaggtc tttgtgaggt
     5041 tgaagaataa ggagatagaa ctccaagtga cagagaggtt tagagatagg gtagttgatg
     5101 aaggatataa cccaagctat ggagctagac cattgaggag agctattatg agactgctag
     5161 aggatagtat ggccgagaag atgcttgcag gtgagatcaa agaaggtgat tcagtaattg
     5221 tggatgtgga ttcagatggc aacgtcaccg ttctcaatgg cactagtggt gctccctcag
     5281 attcagctcc tgagcctatc cttgtgtaga tcaattcttc atttagctct acaaatttgt
     5341 tcttcgtaat gttgcttttg gatgtcttgg ctaagcaaca tgtgagtaat tttttgggtg
     5401 acaggctgat atattcttcc aagttgact
//
LOCUS       TOMCD4B      5846 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  Tomato ATP-dependent protease (CD4B) gene, complete cds.
ACCESSION   M32604
KEYWORDS    ATP-dependent protease.
SOURCE      Tomato (L.esculentum) DNA.
  ORGANISM  Lycopersicon esculentum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 5846)
  AUTHORS   Gottesman,S. and Pichersky,E.
  TITLE     A conserved regulatory subunit for ATP-dependent protease in
            prokaryotes and eukaryotes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Pichersky,E., 08-MAR-1990.
FEATURES       from  to/span     description
    pept       1111     1675     ATP-dependent protease (CD4B), exon 1
               1896     2011     ATP-dependent protease (CD4B), exon 2
               2133     2255     ATP-dependent protease (CD4B), exon 3
               2326     2511     ATP-dependent protease (CD4B), exon 4
               3020     3235     ATP-dependent protease (CD4B), exon 5
               3445     3662     ATP-dependent protease (CD4B), exon 6
               3978     4053     ATP-dependent protease (CD4B), exon 7
               4138     4227     ATP-dependent protease (CD4B), exon 8
               4354     5535     ATP-dependent protease (CD4B), exon 9
    pre-msg       1     5846     CD4B mRNA and introns
    IVS        1676     1895     CD4B intron A
    IVS        2012     2132     CD4B intron B (no splice consensus)
    IVS        2256     2325     CD4B intron C
    IVS        2512     3019     CD4B intron D
    IVS        3236     3444     CD4B intron E
    IVS        3663     3977     CD4B intron F
    IVS        4054     4137     CD4B intron G (no splice consensus)
    IVS        4228     4353     CD4B intron H
BASE COUNT     1592 a   1067 c   1283 g   1904 t
ORIGIN      
        1 aagcttctgt agcccaaagt tttgtccttt cacttcttgt ctactcctca tactgtctct
       61 tccaaattca acagttgcca ctgtcgattt ttcagcagta aatttacctt gctcaatatc
      121 aactccccat agcccaccca aacttctctt catttttata ataatgacgt tgcatgtttc
      181 atacctcgcc taggaatgtg ttcatcgaat ttccgtcatt gtcacatcat atctgttctt
      241 gtcctttact ctttttcctt ttctttcctg tactctcgtt cgtgttgtta ggatttgtac
      301 tgaaaccgtt gggttatcag ttttacagac ctttaagcag tctctctagg gtttactaag
      361 agctggaaat ttgaataatc tgatcaaaat tgctgaggtt cttttaatcc ttctaaagat
      421 ctctgtagca gagtggtgaa aagcggaaca gggggaatac tcggtacaaa tggtgttgtg
      481 ttcaattgca aagaccaaac ctcccattct aatgtgttgc aacaaactat taggggagtt
      541 ttaaggaaga gaaaatgggg gataatgcta cacaatgcta acctattata atcatctatt
      601 tagtttgcaa accccacata tctctgtgtg tattgtccta tttgttcaat ctattcatga
      661 aaccttggct actcaatcct ggattgcgca cgtactactc agcagttgcg tagtgcccta
      721 gttcttatat cttggagggt agactgaatt tggtttatgt caatgaatca tgtttcacac
      781 agatggttac atattttatt acatagaatc tatatgtcat gcatcgactc ctgtcgagta
      841 tggatacttc ctacttcttc ttggaactct ctgattgagt tatctttctg ctaatggact
      901 gtgcctcact atttttatct tcctctggat tggaaaatat ggacattcac ttgttatcaa
      961 tttttgaatt aataatttgt tgatatgttg gctgattgat gtgtcattga cttcatatac
     1021 tgttatgggt ttgtcatgtg gaagatttga tattttcgat tatctaagga gagctttttt
     1081 gattattttc ttatcctggg caggtccatc atggctagag ctctagttca gtcaacaagc
     1141 atcccatcgt cagttgctgg tgaaaggaca accaaattca atggatctgg gaaaacaaaa
     1201 agagctgtta caatgctatg caatgcacaa tcatcttcac ttacactgag ggattttaca
     1261 ggattgcgag gatgcaacgc aatagataca ctagttagat ctggagaaac tctccaatcc
     1321 aaagtagctg ctgcaactta tgtcagacgg ccacgaggtt gccgatttgt accaaaagca
     1381 atgtttgagc gcttcacaga gaaggcgata aaagtcatta tgcttgcaca agaagaggcc
     1441 agacgacttg gtcacaattt tgttggcact gagcagatct tgttgggtct aattggtgag
     1501 ggaactggta ttgctgccaa ggttcttaaa tcgatgggga tcaatttgaa agatgctcgt
     1561 gtggaagtgg aaaagataat tggaaggggt agtgggtttg ttgctgttga gatccctttt
     1621 acccctcgtg ccaagcgtgt tctggaactc tctctggagg aagcccgcca gctaggtatc
     1681 cttttttgat tgtctgtatt tgttgtatta cctttgggac atgtctaaaa gtggttctcc
     1741 ggttgagtct tattctcatg ttgctcgttg acacctttgg tttttcccaa aattcattga
     1801 ttttctttct ctttgcttgc ctttctggtc ttctacatgc accagcatga ttatttgtct
     1861 aaatatttat tccatttata aactggttgg ttcagggcat aactatattg gttcggagca
     1921 cttgctactt ggattgctac gtgaaggtga aggtgtggct gcccgtgttc ttgaaaactt
     1981 gggtgctgac cccagcaaca tccgcactca ggcaagtagt agatctcaca tgcttcctct
     2041 gggtagtcta atattgtgct tttgtgggaa gttggatttt ttggtttcat catgaccagt
     2101 gttcattggt tttattcttc ttctaaaaat aggtgatccg gatggttggt gagagtaatg
     2161 aggctgttgg tgctagcgtt ggaggtggaa cttctggcca aaaaatgcca acactggagg
     2221 agtatggaac aaatttgaca aaattggcgg aagaggtaat ctaaaaaact atatttgtga
     2281 tgatatgtga acagttatga ataattgtcc atttttggtt tgtagggaaa attggacccc
     2341 gttgttggaa gacagccgca aattgaacgg gtcactcaaa tcttgggtcg gcggactaag
     2401 aacaaccctt gtcttattgg tgaaccaggt gttggcaaaa cagctattgc tgagggtttg
     2461 gcacaaagaa ttgcaaatgg tgatgtccct gaaacaattg aggggaagaa ggtacgtcat
     2521 gtggatagct tgcagaatgt gatgtttacc tgtgcccttt cttctcccat ttgttgtatc
     2581 atatggtgaa cccagaagct gagcagggtt atgaggtgca gtctagtggt attttcgatt
     2641 cacaaggatt gggttcaaat gaatgtgcat tctatctcca ctaaattgaa atatgtcaaa
     2701 ttatattttc agtgtctgtg ttctcttgaa tttttttata tataaattct tccgtgctac
     2761 ttgttctctt ggcatatttc tttttagtag attgatttgt cgatctatgc ttataactaa
     2821 ttgagaaaaa acgattagta tcccttgttg aacttaattt cttagctcaa actatcaaac
     2881 tgaagtgatt tcaagtgaga aaggtccacc acctattctt gtgaatctaa tccatctgaa
     2941 actttgtcaa atgtttcgtg ctatgatata gctacctttc tcgttgttcc taagagtcgc
     3001 tttttttggt gataattagg tgataactct tgatatggga ttgcttgttg ctggaacaaa
     3061 ataccgtgga gagtttgagg aaaggctaaa aaagttgatg gaggaaatta agcagagtga
     3121 tgaaataata ctatttatcg atgaagtgca cacattgatt ggagctggag cagcagaggg
     3181 ggccatcgat gctgcaaaca tcttgaaacc tgccctagct cgaggtgaac tacaggtaag
     3241 ttcctccttc gcttgttctt tcctttctgt tctttctaga aattaaaaat gagaactaag
     3301 aatatttctg ttaacaactc aatgttttag aagatatatc attgttttaa catgcatttt
     3361 accgcctcat aagaagttaa aatgactgct cagttaaagc aggagaaaaa acttctacat
     3421 gtgtatttac ttggcttctt gcagtgtatc ggagctacta cactggatga atacagaaag
     3481 catattgaga aagatcctgc actagagagg agattccaac cagttaaggt ccctgaacct
     3541 actgttgatg aaactataca gattctgaaa gggcttcgtg agaggtatga aattcatcac
     3601 aaacttcgtt acactgacga agacttagtg gctgctgctc agctctcata ccagtacatc
     3661 aggtatctaa tctctaccat tttcctatgt cttcttgtac gtttttactc ttttctcgtc
     3721 ctgaattgtt gatgcattgg taccttgaag aaatatgttt gctttaagac ggtagacctg
     3781 aagttcaact tattttatga tagttaagtg tgataataga agttggttta cttttgttta
     3841 tcagcatcta attaggtttt tatgcttatc tctagaaaag attgtcctag attccacgat
     3901 atagattttc gtataaagtt tttttttcct tgttctgcta agtgatacat cttttcactt
     3961 gcgttttcac atttcagtga ccgatttctg cctgacaaag caattgattt gattgatgaa
     4021 gctggttctc gtgttcgact tcgccatgct caggcatgaa atgctctctt agtctctctt
     4081 ttaatctgtc attgccctac tatttgttga gtgacaatgc tctgttgttg cctccagctc
     4141 cccgaggaag caaaagagct cgagaaagaa cttcgtcaga taacaaagga gaagaatgaa
     4201 gctgttcgtg gtcaagattt cgaaaaggta ctgagtttgt catgtacatc taaagaaccc
     4261 ctcaaaaatg gttgttatat agcccaacta ttttcttatc aagtattata attaaagaat
     4321 tttctaatga catggttttg gattaatgga caggctgggg aattacgtga tagagaaatg
     4381 gatctgaagg cacagatcac agccctcata gacaaaaaca aagaggtgag caaggctgaa
     4441 agtgaggctg cagatacagg tccacttgtg acagaagcag atattcagca catcgtctct
     4501 tcttggactg gtatccctgt tgagaaggtc tccactgacg aatctgaccg tctcctaaaa
     4561 atggaagaaa cacttcacac acgaatcatt ggccaggatg aagctgtcaa agccattagt
     4621 cgcgctattc gacgtgcacg agttgggctc aagaatccca acagacctat tgctagtttc
     4681 atcttttccg gtcccactgg tgttgggaaa tcagaactgg caaaggcact ggctgcatat
     4741 tactttggtt ctgaagaagc aatgatccgg cttgatatga gtgagttcat ggaaagacac
     4801 actgtctcta aactcattgg atccccccct ggttatgttg gttacacaga gggtggtcaa
     4861 ttgactgaag ctgttaggcg tcgaccttat acagttgtgc tctttgatga gattgagaag
     4921 gctcatcctg atgtcttcaa catgatgctt caaattcttg aagatggaag attgacagac
     4981 agcaagggta gaactgtaga tttcaagaac acacttctca tcatgacatc aaatgttgga
     5041 agtagtgtaa ttgagaaagg tggccgtcgt ataggttttg atctcgacct tgatgagaag
     5101 gatagcagtt acaatcgcat caagagctta gtgacagaag aattgaagca gtacttcagg
     5161 ccagagtttt tgaacagatt ggatgagatg attgtattcc gtcagctcac taagttagag
     5221 gttaaggaga tagctgatat catgcttaag gaggtctttg agaggttgaa agttaaggag
     5281 atagaacttc aagtgacaga gaggtttaga gacagggtgg ttgacgaggg atacaaccct
     5341 agctatggag cacgacctct gagaagagct attatgagac ttttagagga cagcatggct
     5401 gagaaaatgc ttgcaaatga gattaaagaa ggtgattcag ttatcgtgga cgttgattca
     5461 gatggcaatg tgactgtcct caatggaagt agtggtactc cctcggatcc ggctcctgag
     5521 cctatccccg tttagatcag gtgatcacat tttagctcaa cagtctctgt gtggtctttt
     5581 aagagttcat ttgtgttggc cactaagctt tgtggggctt atgaagcaac ttttgagttt
     5641 atctctggga atgctcaggt aaatcttgcc acgtcgagaa ggcagcatcc tttctatgtt
     5701 gatcttaatg attttacaga agtatactgg attaagattg tagttggtca caagaaagta
     5761 ttgtattaac aagacactcc ttatcaaaaa catatctagt ttattgtagg cgacagttga
     5821 atcgttaatc ctcgtattgt ctatcc
//
LOCUS       PLYORIA       375 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Human polyomavirus BK (strain WWT) origin of replication.
ACCESSION   M34048
KEYWORDS    origin of replication.
SOURCE      Polyomavirus BK (strain WWT) DNA, from human urine.
  ORGANISM  Polyomavirus BK
            Unclassified.
REFERENCE   1  (bases 1 to 375)
  AUTHORS   Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
            Subramani,S. and Traavik,T.
  TITLE     At least two type of control regions can be found among naturally
            occurring BK virus strains
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Johansen, 07-MAY-1990.
               University of Tromso
               P.O. Box 977
               N-9001, Tromso, NORWAY
               e-mail: terje.johansen@esuit.uit.no
FEATURES       from  to/span     description
    orgrpl       97      119     polyomavirus origin of replication
    site          1      375     regulatory region
    site        143      210     P block
    site        211      249     Q block
    site        250      302     R block
BASE COUNT      123 a     86 c     95 g     71 t
ORIGIN      
        1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
       61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
      121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
      181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
      241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt
      301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta
      361 aactggacaa aggcc
//
LOCUS       PLYORIB       424 bp ds-DNA             VRL       17-JUL-1990
DEFINITION  Human polyomavirus BK (strain UT) origin of replication.
ACCESSION   M34049
KEYWORDS    origin of replication.
SOURCE      Human polyomavirus BK (strain UT) DNA, from human urine.
  ORGANISM  Polyomavirus BK
            Unclassified.
REFERENCE   1  (bases 1 to 424)
  AUTHORS   Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P.,
            Subramani,S. and Traavik,T.
  TITLE     At least two type of control regions can be found among naturally
            occurring BK virus strains
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Johansen, 07-MAY-1990.
               University of Tromso
               P.O. Box 977
               N-9001, Tromso, NORWAY
               e-mail: terje.johansen@esuit.uit.no
FEATURES       from  to/span     description
    orgrpl       97      119     polyomavirus origin of replication
    site          1      424     regulatory region
    site        143      210     P block
    site        211      249     Q block
    site        250      261     R block
    site        263      315     P block
    site        316      349     Q block
    site        350      361     R block
BASE COUNT      134 a    103 c    105 g     82 t
ORIGIN      
        1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
       61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
      121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
      181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
      241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa
      301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc
      361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa
      421 ggcc
//
LOCUS       ECOCLPPA     1236 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Escherichia coli ATP-dependent clp protease proteolytic component
            (clpP) gene, complete cds.
ACCESSION   J05534
KEYWORDS    ATP-dependent protease.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1236)
  AUTHORS   Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J.,
            Bowers,B. and Gottesman,S.
  TITLE     Sequence and structure of ClpP, the proteolytic component of the
            ATP-dependent Clp protease of E.coli
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Gottesman, 23-APR-1990.
FEATURES       from  to/span     description
    pept        378     1001     ATP-dependent protease (clpP)
    mRNA        279  >  1236     clpP mRNA
    pept    <     1      133     ORF
    signal      244      249     -35 region
    signal      267      272     -10 region
    signal      165      193     transcription termination signal
    binding     367      372     ribosome binding site (put.)
BASE COUNT      329 a    291 c    318 g    298 t
ORIGIN      
        1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga
       61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa
      121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc
      181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg
      241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac
      301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt
      361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg
      421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt
      481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta
      541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc
      601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc
      661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt
      721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga
      781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg
      841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat
      901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg
      961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg
     1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt
     1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc
     1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg
     1201 cgcaagctga ttgccggtcc atccgtgtat atctgc
//
LOCUS       LACNISC      1222 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  L.lactis insertion sequence IS981.
ACCESSION   M33933
KEYWORDS    insertion sequence.
SOURCE      L.lactis (strain LM0230) DNA, clone pKM40.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 1222)
  AUTHORS   Polzin,K.M. and McKay,L.L.
  TITLE     Identification, DNA sequence and distribution of IS981, a new, high
            copy number insertion sequence in the lactococci related to the
            IS2/IS3 family of IS-elements
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.M.Polzin 27-APR-1990.
               University of Minnesota
               Dept. of Food Science
               1334 Eckles Avenue, St. Paul, MN 55108
               e-mail: kayla@molbio.cbs.umn.edu
FEATURES       from  to/span     description
    pept         58      318     ORF1
    pept        330     1169     ORF2
    binding      44       50     ribosomal binding site (put.)
    rpt           1       40     5' inverted repeat
    rpt        1183     1222     3' inverted repeat
    signal       33       38     -10 region
    signal      269      274     -35 region
    signal      294      299     -10 region
BASE COUNT      442 a    233 c    206 g    341 t
ORIGIN      132 bp upstream of PvuII site.
        1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg
       61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa
      121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata
      181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa
      241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc
      301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca
      361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta
      421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac
      481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac
      541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt
      601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa
      661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata
      721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag
      781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata
      841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc
      901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca
      961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa
     1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct
     1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac
     1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac
     1201 tctaaaaact tgacttaacg tc
//
LOCUS       RATSHIIA     3064 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Rat K+ channel protein (KShIIA) mRNA, complete cds.
ACCESSION   M34052
KEYWORDS    K+ channel protein.
SOURCE      Rat brain, cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3064)
  AUTHORS   McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R.
  TITLE     Molecular cloning of a member of a novel class of K+ channel genes
            in mammals
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Bernardo, 07-MAY-1990.
FEATURES       from  to/span     description
    pept         80     1771     K+ channel protein (KShIIa)
BASE COUNT      777 a    718 c    729 g    840 t
ORIGIN      
        1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag
       61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt
      121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct
      181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct
      241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc
      301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga
      361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt
      421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt
      481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat
      541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct
      601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga
      661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg
      721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc
      781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt
      841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta
      901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac
      961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct
     1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg
     1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag
     1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac
     1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt
     1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc
     1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac
     1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg
     1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa
     1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa
     1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa
     1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga
     1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc
     1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa
     1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag
     1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt
     1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt
     1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc
     2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg
     2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa
     2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg
     2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga
     2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc
     2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt
     2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat
     2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc
     2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca
     2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa
     2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga
     2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga
     2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt
     2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt
     2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt
     2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa
     3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa
     3061 tgat
//
LOCUS       HUMROS1      7375 bp ss-mRNA            PRI       17-JUL-1990
DEFINITION  Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA,
            complete cds.
ACCESSION   M34353
KEYWORDS    oncogene; transmembrane tyrosine-specific protein kinase.
SOURCE      Human glioblastoma cell line SW-1088, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 7375)
  AUTHORS   Birchmeier,C., O'Neil,K., Riggs,M. and Wigler,M.
  TITLE     Characterization of ROS1 cDNA from a human glioblastoma cell line
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4799-4803 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.H.Wigler, 15-MAY-1990.
FEATURES       from  to/span     description
    pept        207     7250     transmembrane tyrosine-specific protein kinase
                                 precursor /hgml_locus_uid="LN0124W"
                                 /nomgen="ROS1" /map="6q21-q22"
    sigp        207      314     transmembrane tyrosine-specific protein kinase
                                 signal peptide
    matp        208     7247     transmembrane tyrosine-specific protein kinase
BASE COUNT     2203 a   1496 c   1605 g   2071 t
ORIGIN      
        1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa
       61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc
      121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca
      181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg
      241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata
      301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata
      361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact
      421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg
      481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa
      541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc
      601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt
      661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct
      721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg
      781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg
      841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc
      901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact
      961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg
     1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt
     1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg
     1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg
     1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca
     1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta
     1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct
     1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc
     1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt
     1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga
     1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg
     1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac
     1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca
     1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat
     1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct
     1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc
     1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca
     1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg
     2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa
     2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa
     2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag
     2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa
     2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt
     2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga
     2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag
     2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc
     2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca
     2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca
     2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa
     2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca
     2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca
     2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata
     2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga
     2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat
     3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc
     3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc
     3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt
     3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg
     3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc
     3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat
     3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac
     3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa
     3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt
     3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta
     3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc
     3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa
     3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg
     3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc
     3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg
     3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat
     3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga
     4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag
     4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa
     4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga
     4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga
     4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga
     4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg
     4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga
     4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt
     4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg
     4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca
     4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg
     4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg
     4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg
     4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa
     4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca
     4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc
     4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg
     5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg
     5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca
     5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt
     5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac
     5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt
     5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg
     5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag
     5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg
     5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca
     5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca
     5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag
     5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg
     5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc
     5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg
     5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc
     5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag
     6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg
     6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa
     6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga
     6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc
     6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag
     6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca
     6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac
     6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact
     6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa
     6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa
     6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt
     6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa
     6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga
     6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc
     6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg
     6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt
     6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt
     7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc
     7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag
     7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc
     7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt
     7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc
     7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc
//
LOCUS       MBOPIVML      969 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  M.lacunata pilin gene inverting protein (PivML) gene, complete cds.
ACCESSION   M34367
KEYWORDS    pilin gene inverting protein.
SOURCE      M.lacunata (ATCC 17956) DNA, clone pMxL1.
  ORGANISM  Moraxella lacunata
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 969)
  AUTHORS   Marrs,C.F., Rozsa,F.W., Hackel,M., Stevens,S.P. and Glasgow,A.C.
  TITLE     Identification, cloning and sequencing of Piv, a new gene involved
            in inverting the pilin gene of M.lacunata
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.F.Marrs, 17-MAY-1990.
FEATURES       from  to/span     description
    pept          1      969     pilin gene inverting protein (PivML)
BASE COUNT      318 a    236 c    165 g    250 t
ORIGIN      
        1 atgtctaaaa cttacattgg gattgatatc gccaaaaaca catttgatgc ctgctttatt
       61 acccataaca cttggcaaaa ctgtacattc acaaataacc aacagggttt tatagagcta
      121 acattgtgga ttcaagcaca tcattataac acatctacct tacatttgat tattgaggca
      181 acgggggctt attgggaaaa attagctcat tgggcaatct ctcatcatca taaggtaagt
      241 attgtcaacc ccttatatat tcatgcatac gccaaaagcc taggcatacg caccaagaca
      301 gataagcagg atgccattct cttggcacgc tatggtgcaa aagaaaatcc acctttatgg
      361 cagcctaaat cagacaacga aataaaactg actgcccttc taaaacaacg tgaacaccac
      421 aaacggcaac tcatcaaaga acgcactcga caggaagctc tgtctattta tgtgaaatcg
      481 tatactgatg acaacattcg tcattggagt gacagcatca cacagctcga tcatcaaatt
      541 tggcaactca tcaattgcac gcccgagcta aattaccgtg ccagcctact tgccacaata
      601 cctggaattg gcaaaaagac actgccacat ctactggttg tcattggaga tggttcatca
      661 tttcaaagtg ccaaacatct agcaagctat gctgggcttg cccctcgtca tcatcaatca
      721 ggcattagta ttcacaagca gtcttccatt ggatttagtg gacaaaaaga actacgttca
      781 gcccttttta tgcccgctgt catagtcagc tttggtcgat atcccgcttt tcaaaagttc
      841 gtcaaacgca tggaacaaaa aggcaaaacc aaaaaacaaa ttattatcgc catcatgcgc
      901 aaattactca ccatcagcta tgccgttatt cgccaaaatc gcccttttga caaacgcatt
      961 catgaataa
//
LOCUS       STYMKAD       915 bp ds-DNA             BCT       17-JUL-1990
DEFINITION  Plasmid pEX102 mouse killing agent D (mkaD) gene, complete cds.
ACCESSION   M34355
KEYWORDS    mouse killing agent D.
SOURCE      Plasmid pEX102 (from Salmonella typhimurium, strain TML R66) DNA.
  ORGANISM  Plasmid pEX102
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 915)
  AUTHORS   Taira,S. and Rhen,M.
  TITLE     Nucleotide sequence of mkaD, a virulence associated gene of
            Salmonella typhimurium containing variable and constant region
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.P.Taira, 16-MAY-1990.
FEATURES       from  to/span     description
    pept        132      857     mouse killing agent D (mkaD)
BASE COUNT      273 a    181 c    213 g    248 t
ORIGIN      
        1 ggaccgaaat atcatttttt cgcctgataa aaaatgaggt tttctggata actaatcgtt
       61 ttattaaaaa aaaactgaga atttatatct aataatatgg cgatatatcc atatcgcaaa
      121 ggagatttcc catgcccata aataggccta atctaaatct aaacatccct cctttgaata
      181 ttgtagctgc ttatgatggg gcggaaatac catctacaaa taagcacctg aaaaataatt
      241 tcaactcctt gcacaaccaa atgcggaaga tgccggtatc ccactttaaa gaggcgctgg
      301 atgtgcctga ctattcaggg atgcgccaga gtggtttctt tgctatgagc caaggttttc
      361 agctgaataa ccatggttac gatgttttca tccatgctcg tcgagaatca cctcagtctc
      421 agggcaaatt tgccggtgac aagttccaca tcagtgtgct cagggatatg gtgccacaag
      481 catttcaagc gctgtccgga ttgctgtttt cagaggacag tccggtagat aagtggaaag
      541 tgaccgatat ggagaaggtc gttcaacaag cccgtgttag cctgggcgct cagttcacgt
      601 tgtatataaa accagaccag gaaaattcgc agtacagtgc gtcgtttctc cacaagacac
      661 ggcaatttat agagtgtctg gaatccagac tatccgaaaa tggggttatt tcaggacagt
      721 gtcctgagtc agacgttcat cctgaaaatt ggaaatatct cagttatcgt aatgaactac
      781 gaagtgggcg tgatggtggc gaaatgcaga gacaggcttt acgtgaggaa ccgttttatc
      841 gtttgatgac agagtaagta tgggtttggg gagcaacgga acagtaaacg ccgttaaaca
      901 gctattttaa atgct
//
LOCUS       TOBEXT       3994 bp ds-DNA             PLN       17-JUL-1990
DEFINITION  N.plumbaginifolia extensin (ext) gene, complete cds.
ACCESSION   M34371
KEYWORDS    cell wall protein; extensin.
SOURCE      N.plumbaginifolia leaf DNA.
  ORGANISM  Nicotiana plumbaginifolia
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 3994)
  AUTHORS   De Loose,M.R.R., Gheyson,G., Tire,C., Gielen,J., Villaroel,R.,
            Genetello,C., Van Montagu,M., Depicker,A. and Inze,D.
  TITLE     The extensin signal peptide allows secretion of heterologous
            proteins from protoplasts
  JOURNAL   Unpublished (1990)
  STANDARD  unannotated staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.R.De Loose, 18-MAY-1990.
            
               Author address: M.R.R.De Loose
               Laboratorium voor Genetika, Wetenschappen,
               K.L.Ledeganckstraat, Gent,
               Oost-Vlaanderen, BELGIUM
FEATURES       from  to/span     description
    pept       1590     2840     extensin precursor
    sigp       1590     1652     extensin signal peptide (could end at 1667)
    matp       1653     2837     extensin (could start at 1668)
    mRNA       1560     3342     extensin mRNA
    rpt        1740     1824     tandem 85 bp repeats
    rpt        1740     1758     19 bp repeats
    signal     1532     1537     TATA box
    signal     3323     3329     poly-A signal
BASE COUNT     1264 a    992 c    550 g   1188 t
ORIGIN      
        1 cgcccgggga tcctcctgga tgccattttt cctttgcttt tggatacctt ttcgcaactt
       61 tagtccattg cttgaacatg gttttttcta cctcttgtct gttcccatga tgataaacta
      121 tgatagctaa cattttcagt aatattggat tgaattagat atactttgaa ttattgagct
      181 acaaaattcg ttgaatattt tgccctatcc cccacccaaa agaaaataat tcataccttt
      241 atttggtaat aaaagtgggt cacatgcaca gtttttctcg tcttctctag attaaactct
      301 ttggaaatga cctactgaaa atactacaca taaaattccc cactcccctc caagttgggg
      361 gagggtgggg gttggtgggt tttttttccc cttgaatcca ttcagccggc ttatgcctcc
      421 cgaggtggtg ggaggtttaa tttgaaccct taccctatag ttaatatcga aatagataat
      481 acaaaggagg ggaacataaa accaaaacct ccgacaaaac atcaaagtgt aggagttcat
      541 gatatacgct tgattgaaag tgcacaatgg tctttgcata ctatcaaagt atcaattggt
      601 tcttgaatta tatctcttaa tataatgtat tgtgtttaat ttctctacta tcccctattt
      661 tataagacta aaagatcctg acatgcttct tgaacacatg tgaaggttag ttaactggtc
      721 agaagtacac aagaattaat ttgtacacct atgtttctca ccagaatgcc cataaattac
      781 atcaattctt caacatcaag agatcgaaaa acttaacaat tgttctaata gttgaactga
      841 gtcgtccgat ccatatccga ttcttcacta gaagcattaa tcatatagag caatttaact
      901 taatttacta tattggtaat catttacata gttaagttat aaaaccttgg agcgacacaa
      961 ggatttacac taatcatgat tgaatattaa ctttactcaa tttatcaatg tctgatacaa
     1021 gtgaattaat ttctctcttt gtgatttcgg tagaaaatta atgtcaagtt tcaaagtttt
     1081 tctttttgaa attagtcata catgtgaata gaacattaat ttaagttaaa gaatcatata
     1141 aaaaagttag ccgctttgat ttctgtaact gaaatcgtgc agagatgagg catccacact
     1201 tgtttttcaa agcttcagta ctatttaaaa acaaagacag taaaaaggag actttttctt
     1261 tgacattatt gcatcagaaa tagtatagct gccataatag tttattcctt tgcttagctt
     1321 gcagcctcta tcaaacaaaa aaatacacca gctcaagtca atttgagccg acaacatgac
     1381 aaaaccaaat caaatatgca tttttttttt tttttttttt tttttttttt tttttttttt
     1441 tttacacttt ggtaggtgtt aagtaatcta gtgagacttt taccttcatt tatgaaaatc
     1501 ttgaaaaggg taattgtcta attgaaagct atataaaggg gtcgaagtga agcttaagag
     1561 gacaacaact tttctcattt gtttcaaaga tgggaaaaat ggcttctcta tttgccacat
     1621 ttttagtggt tttagtgtca cttagcttag cttctgaaag ctcagcaaat tatcaatact
     1681 catctccacc accacctaag aagccatacc acccttcacc aacaccatat tatcctgcac
     1741 cagtttataa atctccacca ccaccaattc cggtttataa gtcaccacca ccacccaaga
     1801 agccatacta ccctccacac accccagttt acaaatctcc accaccacca actccagttt
     1861 acaagtcgcc accaccaccc aagaagccac actaccctcc acacacaccc gtttacaagt
     1921 ctccaccacc acctactcca gtttacaagt caccaccatc acccaagaag ccacactatc
     1981 ctccacacac tccggtttac aagtcgccac caccaccaac tcccgtttac aagtcgccac
     2041 caccacccaa gaagccacac tatccaccac acaccccggt ttacaagtcg ccaccaccac
     2101 caactcccgt ttacaagtcg ccaccaccac caaagaagcc acactatcct ccacacacac
     2161 cagtttacaa gtcaccacca ccacccaaga agccatacta tccaccacac accccagttt
     2221 acaagtcgcc accaccacca actccagttt acaagtcacc accaccacct aagaagccat
     2281 accacccttc accaacaccg tatcatcctt caccagttta caagtcccct ccaccaccaa
     2341 ctccagttta caagtcacca ccaccaccta agaagccata ccacccttca ccaacaccgt
     2401 atcatccatc accagtttac aagtcccctc caccaccaac tccagtttac aagtcaccac
     2461 caccacctaa gaagccatac cacccttcac caacaccgta tcatccatca ccagtttaca
     2521 agtcccctcc accaccaact ccagtttaca agtcaccacc accacctaag aagccatacc
     2581 acccttcacc aacaccgtat catcctgcac cagtttacaa gtccccacca ccaccaactc
     2641 cagtttacaa gtcaccaccg ccaccagtga agccatacca tccttcacca acaccatacc
     2701 atcctgcacc agtttataag tctccaccac caccaactcc tgtttacaag tcacctccac
     2761 caccaactcc agtctacaag tctccaccac cacaccaccc ctatgtttac gcttctcctc
     2821 ctcctcccta ccattactaa gaagtgacat cacaaagctg aaggtaatta aggcaaatta
     2881 attaaaacat tttcaaaatg atatgtcagg aatgcaaaga aaattgcata actttgttta
     2941 tgcaaaacat taatataata taaagaactg cgctctctgc ttattaacta tttttttagc
     3001 ttcatttatt agttactagc tgttcagaat tttcagtatc ttttgatatt acaaagaacc
     3061 taatcagaca atgtatattc ttatgcagga aaagcagaat gttgagctaa aagaaaggct
     3121 ttttccattt tcgagaggca acgagaagaa aagaagaaga aaaaatagag taaataataa
     3181 gccccacggg aggcgaagtt cttttgtagc ttcatgttgt ctaagctatt gatattgttt
     3241 gtaccctata ttttatttct ctgtctttgt gtatgttttg ttcaatttcg atctccttgc
     3301 aaaatgcaga gattatgagc tgaataaagt aagttaatat tattatacgt gttaattttc
     3361 tactcctctc tctagccagc cttttgtttt ctctttttct tatttcattt tctttatatc
     3421 aaaccatttt aggagagggc caaggagtga tccagtaaaa cacgaagatt agaagaaact
     3481 tcccttcttt ttttttatct gaaaaaaatt taaagtcgag atttatctct ttttgtaatg
     3541 gaatcatttc cacagttatg actattttca aattgatatc gccagaccta acaaattaaa
     3601 ttactccttc cgctcatctg gccgtacact tctgctccat ctggccgtac agttctgctc
     3661 catctggcag tgtacggaga atggataact taatattcag tatgaatttg gacataaaat
     3721 cttctaagat tgctagtgaa ttttgtgagc gttattgtgt taaaacaaaa gacagaaaat
     3781 tattagaaga attaaagaaa tatgaaaata gtatagaatc agaaatattt cgagtccaca
     3841 agttttttgt gtgttcttaa ggaatgtttt aacccctcac acgttgccaa ggtaatggat
     3901 taaatcctcc caagataaaa cggaataaac cttcctgcaa cagtggcaat acaaactgca
     3961 ggtcgactct agggatcccc gggcgagcga attc
//
LOCUS       MUSIGHABN     336 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34574
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 336)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   336     Ig heavy-chain V-D-J region (AA at 1)
    recomb      276      277     V-region DNA end/D-region DNA start
    recomb      285      286     D-region DNA end/J-region DNA start
BASE COUNT       86 a     76 c     95 g     79 t
ORIGIN      Chromosome 12.
        1 actggaggag gcttggtgca acctgggggg tcacggggac tctcttgtga aggctcaggg
       61 ttcactttta gtggcttctg gatgagctgg gttcgacaga cacctgggaa gaccctggag
      121 tggattggag acattaattc tgatggcagt gcaataaact acgcaccatc cataaaggat
      181 cgattcacta tcttcagaga caatgacaag agcaccctgt acctgcagat gagcaatgtg
      241 cgatcggagg acacagccac gtatttctgt atgagatata gtaactactg gtacttcgat
      301 gtctggggca caaggaccac ggtcaccgtc tcctca
//
LOCUS       MUSIGHABO     357 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34575
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH9.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 357)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   357     Ig heavy-chain V-D-J-region (AA at 1)
    recomb      294      295     V-region DNA end/D-region DNA start
    recomb      306      307     D-region DNA end/J-region DNA start
BASE COUNT       89 a     77 c    106 g     85 t
ORIGIN      Chromosome 12.
        1 gaagtgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc
       61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca
      121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac
      181 gcagcatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac
      241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatgat
      301 ggttactact ggtacttcga tgtctggggc acaaggacca cggtcaccgt ctcctca
//
LOCUS       MUSIGHABP     354 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34576
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH34.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   354     Ig heavy-chain V-D-J-region (AA at 1)
    recomb      294      295     V-region DNA end/D-region DNA start
    recomb      303      304     D-region DNA end/J-region DNA start
BASE COUNT       88 a     78 c    104 g     83 t      1 others
ORIGIN      Chromosome 12.
        1 gaantgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc
       61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca
      121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac
      181 gcaccatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac
      241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatggt
      301 aactactggt acttcgatgt ctggggcaca gggaccacgg tcaccgtctc ctca
//
LOCUS       MUSIGHABQ     366 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34577
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 366)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   366     Ig heavy chain V-D-J-region (AA at 1)
    recomb      300      301     V-region DNA end/D-region DNA start
    recomb      318      319     D-region DNA end/J-region DNA start
BASE COUNT       89 a     84 c     97 g     96 t
ORIGIN      Chromosome 12.
        1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc
       61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct
      121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca
      181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
      241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga
      301 tattactacg gtagtagcta tgctatggac tactggggtc aaggaacctc agtcaccgtc
      361 tcctca
//
LOCUS       MUSIGHABR     360 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34578
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   360     Ig heavy chain V-D-J-region (AA at 1)
    recomb      300      301     V-region DNA end/D-region DNA start
    recomb      315      316     D-region DNA end/J-region DNA start
BASE COUNT       85 a     87 c     93 g     95 t
ORIGIN      Chromosome 12.
        1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc
       61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct
      121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca
      181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
      241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga
      301 tccctctatg atggttactt tgactactgg ggccaaggca ccactctcac agtctcctca
//
LOCUS       MUSIGHABS     360 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34579
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   360     Ig heavy chain V-D-J-region (AA at 1)
    recomb      294      295     V-region DNA end/D-region DNA start
    recomb      309      310     D-region DNA end/J-region DNA start
BASE COUNT       90 a     93 c     93 g     83 t      1 others
ORIGIN      Chromosome 12.
        1 caggtccaac tgcagcngcc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg
       61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg
      121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac
      181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagagggatt
      301 tatagtaact actggtactt cgatgtctgg ggcacaggga ccacggtcac cgtctcctca
//
LOCUS       MUSIGHABT     357 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34580
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH31.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 357)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   357     Ig heavy chain V-D-J-region (AA at 1)
    recomb      294      295     V-region DNA end/D-region DNA start
    recomb      306      307     D-region DNA end/J-region DNA start
BASE COUNT       90 a     94 c     89 g     82 t      2 others
ORIGIN      Chromosome 12.
        1 caggtccaac tgcancancc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg
       61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg
      121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac
      181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagatactat
      301 ggtaactact ggtacttcga tgtctggggc acagggacca cggtcaccgt ctcctca
//
LOCUS       MUSIGHABU     366 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34581
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH12.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 366)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   366     Ig heavy chain V-D-J-region (AA at 1)
    recomb      294      295     V-region DNA end/D-region DNA start
    recomb      321      322     D-region DNA end/J-region DNA start
BASE COUNT       95 a     93 c     96 g     81 t      1 others
ORIGIN      Chromosome 12.
        1 caggtccaac tgcagcagcc tgggactgaa ctggtnaagc ctggggcttc agtgaagctg
       61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg
      121 cctggacaag gccttgagtg gattggaaat attaatccta gcaatggtgg tactaactac
      181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aatcctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagagattac
      301 tacggtagta gctgggggta ctactttgac tactggggcc aaggcaccac tctcacagtc
      361 tcctca
//
LOCUS       MUSIGHABV     321 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34582
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH27.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig heavy chain V-D-J-region (AA at 1)
    recomb      258      259     V-region DNA end/D-region DNA start
    recomb      270      271     D-region DNA end/J-region DNA start
BASE COUNT       81 a     90 c     63 g     81 t      6 others
ORIGIN      Chromosome 12.
        1 aaaccctnac agtnactctt nnttacctgc tctattactg ntttccccat caccagtggt
       61 tactactgga tctggatccg tcagtcacct gggaaacccc tagaatggat ggggtacatc
      121 actcatagtg gggaaacttt ctacaaccca tctctccaga gccccatctc cattactaga
      181 gaaacgtcaa agaaccagtt cttcctccaa ttgaactctg tgaccacaga ggacacagcc
      241 atgtattact gtncaggaga ctatgatggt tactggtact tcgatgtctg gggcacaggg
      301 accacggtca ccgtctcctc a
//
LOCUS       MUSIGHABW     336 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig heavy-chain mRNA V-D-J region, partial cds.
ACCESSION   M34583
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH32.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 336)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   336     Ig heavy chain V-D-J-region (AA at 1)
    recomb      273      277     V-region DNA end/D-region DNA start
    recomb      285      286     D-region DNA end/J-region DNA start
BASE COUNT       82 a     99 c     71 g     82 t      2 others
ORIGIN      Chromosome 12.
        1 ggacctggcc tggtgaaacc ctcacagtna ctcttcctta cctgctctat tactggtttc
       61 cccatcacca gtggttacta ctggatctgg atccgtcagt cacctgggaa acccctagaa
      121 tggatggggt acatcactca tagtggggaa actttctaca acccatctct ccagagcccc
      181 atctccatta ctagagaaac gtcaaagaac cagttcttcc tccaattgaa ctctgtgacc
      241 acagaggaca cagccatgta ttactgtnca ggagacaccg atggttactg gtacttcgat
      301 gtctggggca cagggaccac ggtcaccgtc tcctca
//
LOCUS       MUSIGKABQ     306 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION   M34586
KEYWORDS    immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 306)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   306     Ig kappa-chain V-J-region (AA at 1)
    recomb      284      285     V-region DNA end/J-region DNA start
BASE COUNT       75 a     85 c     77 g     69 t
ORIGIN      Chromosome 6.
        1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc
       61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc
      121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc
      181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa
      241 gatgctgcca cttattactg ccagcagtgg agtagtaacc cgtggacgtt cggtggaggc
      301 accaag
//
LOCUS       MUSIGKABR     339 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION   M34588
KEYWORDS    immunoglobulin heavy-chain; joining exon;
            kappa-immunoglobulin: processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   339     Ig kappa-chain V-J-region (AA at 1)
    recomb      302      303     V-region DNA end/J-region DNA start
BASE COUNT       89 a     81 c     81 g     86 t      2 others
ORIGIN      Chromosome 6.
        1 gatgttttga tgacccaaac tccactctcc ctncctgtca gtcttggaga tcaagcctcc
       61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg
      121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt
      181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc
      241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg
      301 tggacgttcg gtggaggcac caagctggaa atcaaacgg
//
LOCUS       MUSIGKABS     339 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION   M34589
KEYWORDS    immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH31.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   339     Ig kappa-chain V-J-region (AA at 1)
    recomb      302      303     V-region DNA end/J-region DNA start
BASE COUNT       89 a     79 c     82 g     86 t      3 others
ORIGIN      Chromosome 6.
        1 gatgttttga tgacncaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc
       61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg
      121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt
      181 tctggggtcc caganaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc
      241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg
      301 tggacgttcg gtggaggcac caagctggaa atcaaacgg
//
LOCUS       MUSIGKABT     323 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION   M34590
KEYWORDS    immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH12.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 323)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   323     Ig kappa-chain V-J-region (AA at 1)
    recomb      288      289     V-region DNA end/J-region DNA start
BASE COUNT       96 a     81 c     70 g     76 t
ORIGIN      Chromosome 6.
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa
      241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctccgac gttcggtgga
      301 ggcaccaagc tggaaatcaa acg
//
LOCUS       MUSIGKABU     339 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION   M34591
KEYWORDS    immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH27.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   339     Ig kappa-chain V-J-region (AA at 1)
    recomb      290      291     V-region DNA end/J-region DNA start
BASE COUNT       84 a     91 c     81 g     82 t      1 others
ORIGIN      Chromosome 6.
        1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccggnga gaagatcact
       61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag
      121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca
      181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag
      241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccacg cacgttcggc
      301 tcggggacaa agttggaaat aaaacgggct gatgctgca
//
LOCUS       MUSIGKABV     327 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION   M34592
KEYWORDS    immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH32.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 327)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   327     Ig kappa-chain V-J-region (AA at 1)
    recomb      290      291     V-region DNA end/J-region DNA start
BASE COUNT       82 a     87 c     80 g     78 t
ORIGIN      Chromosome 6.
        1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccgggga gaagatcact
       61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag
      121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca
      181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag
      241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccgta cacgttcgga
      301 ggggggacca agctggaaat aaaacgg
//
LOCUS       MUSIGKABW     324 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig kappa-chain mRNA V-J region, partial cds.
ACCESSION   M34593
KEYWORDS    immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH34.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig kappa-chain V-J-region (AA at 1)
    recomb      287      288     V-region DNA end/J-region DNA start
BASE COUNT       96 a     76 c     80 g     72 t
ORIGIN      Chromosome 6.
        1 gacatcaaga tgacccagtc tccatcctcc atgtatgcat cgctgggaga gagagtcact
       61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
      121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca
      181 agattcagtg gcagtggatc tgggcaagat tattctctaa ccatcagcag cctggagtct
      241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg
      301 gggaccaagc tggaaataaa acgt
//
LOCUS       MUSIGLBLP     349 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig lambda-chain mRNA V-J region, partial cds.
ACCESSION   M34587
KEYWORDS    immunoglobulin heavy-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH9.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 349; revises [1])
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by C.A.Pennell, 20-APR-1990.
            
FEATURES       from  to/span     description
    pept    <     1  >   349     Ig lambda-chain V-J-region (AA at 1)
    recomb      293      294     V-region DNA end/J-region DNA start
BASE COUNT       87 a     94 c     81 g     86 t      1 others
ORIGIN      Chromosome 16.
        1 caggctgttg tgactcagga atctgcactc accacatcac ctggtggaac agtcatactc
       61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa
      121 aaaccagatc atttattcac tggtctaata ggtggtacca gcaaccgagc tccaggtgtt
      181 cctgtcagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca
      241 cagactgagg atgatgcaat gtatttctgt gctctatggt acagcaccca ttatgttttc
      301 ggcggtggaa ccaaggtcac tgtcctaggt cagcccaagt ccantccca
//
LOCUS       MUSIGLBN      330 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig lambda-chain mRNA V-J region, partial cds.
ACCESSION   M34584
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 330)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 330; revises [1])
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   330     Ig lambda-chain V-J-region (AA at 1)
    recomb      291      292     V-region DNA end/J-region DNA start
BASE COUNT       87 a     87 c     79 g     77 t
ORIGIN      Chromosome 16.
        1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc
       61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa
      121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt
      181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca
      241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca ctgggtgttc
      301 ggtggaggaa ccaaactgac tgtcctaggc
//
LOCUS       MUSIGLBO      330 bp ss-mRNA            ROD       17-JUL-1990
DEFINITION  Mouse Ig lambda-chain mRNA V-J region, partial cds.
ACCESSION   M34585
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA
            to mRNA, clone CH2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 330)
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  TITLE     Restricted Ig variable region gene expression among Ly-1+ B cell
            lymphomas
  JOURNAL   J. Immunol. 141, 2788-2796 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 330; revises [1])
  AUTHORS   Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by C.A.Pennell, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   330     Ig lambda-chain V-J-region (AA at 1)
    recomb      291      292     V-region DNA end/J-region DNA start
BASE COUNT       87 a     86 c     78 g     79 t
ORIGIN      Chromosome 16.
        1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc
       61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa
      121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt
      181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca
      241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca tttggtgttc
      301 ggtggaggaa ccaaactgac tgtcctaggc
//

GenBank-Updates@genbank.bio.net (07/18/90)

LOCUS       HUMELA2       906 bp ss-mRNA            PRI       18-JUL-1990
DEFINITION  Human elastase 2 mRNA, complete cds.
ACCESSION   M16631
KEYWORDS    elastase.
SOURCE      Human pancreas, cDNA to mRNA (library of R.Weiss), clones
            hpe2-lambda-[4,10].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 906)
  AUTHORS   Fletcher,T.S., Shen,W.-F. and Largman,C.
  TITLE     Primary structure of human pancreatic elastase 2 determined by
            sequence analysis of the cloned mRNA
  JOURNAL   Biochemistry 26, 7256-7261 (1987)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by T.S.Fletcher, 05-AUG-1987.
            
            A poly-adenylation signal is located at positions 891-896.
FEATURES       from  to/span     description
    pept         22      831     elastase 2 precursor
    sigp         22       69     elastase 2 signal peptide
    matp         70      828     elastase 2
BASE COUNT      197 a    274 c    253 g    182 t
ORIGIN      161 bp upstream of PstI site.
        1 aaacagtccc agggacacac catgataagg acgctgctgc tgtccacttt ggtggctgga
       61 gccctcagtt gtggggaccc cacttaccca ccttatgtga ctagggtggt tggcggtgaa
      121 gaagcgaggc ccaacagctg gccctggcag gtctccctgc agtacagctc caatggcaag
      181 tggtaccaca cctgcggagg gtccctgata gccaacagct gggtcctgac ggctgcccac
      241 tgcatcagct cctccaggac ctaccgcgtg gggctgggcc ggcacaacct ctacgttgcg
      301 gagtccggct cgctggcagt cagtgtctct aagattgtgg tgcacaagga ctggaactcc
      361 aaccaaatct ccaaagggaa cgacattgcc ctgctcaaac tggctaaccc cgtctccctc
      421 accgacaaga tccagctggc ctgcctccct cctgccggca ccattctacc caacaactac
      481 ccctgctacg tcacgggctg gggaaggctg cagaccaacg gggctgttcc tgatgtcctg
      541 cagcagggcc ggttgctggt tgtggactat gccacctgct ccagctctgc ctggtggggc
      601 agcagcgtga aaaccagtat gatctgtgct gggggtgatg gcgtgatctc cagctgcaac
      661 ggagactctg gcgggccact gaactgtcag gcgtctgacg gccggtggca ggtgcacggc
      721 atcgtcagct tcgggtctcg cctcggctgc aactactacc acaagccctc cgtcttcacg
      781 cgggtctcca attacatcga ctggatcaat tcggtgattg caaataacta accaaaagaa
      841 gtccctggga ctgtttcaga cttggaaagg tcacagaagg aaaataatat aataaagtga
      901 caactc
//
LOCUS       CLOFTHFSA    1965 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene,
            complete cds.
ACCESSION   J02911
KEYWORDS    formyltetrahydrofolate synthetase.
SOURCE      C.thermoaceticum DNA.
  ORGANISM  Clostridium thermoaceticum
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1965)
  AUTHORS   Lovell,C.R., Przybyla,A. and Ljungdahl,L.G.
  TITLE     Primary structure of the thermostable formyltetrahydrofolate
            synthetase from Clostridium thermoaceticum
  JOURNAL   Biochemistry 29, 5687-5694 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.R.Lovell, 13-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         56     1735     formyltetrahydrofolate synthetase (FTHFS) (ttg
                                 start codon) (EC 6.3.4.3)
    mRNA    <     1     1760     FTHFS mRNA
    binding      41       44     ribosomal binding site (put.)
    signal     1761     1785     pot. transcription termination signal
BASE COUNT      459 a    576 c    532 g    398 t
ORIGIN      
        1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc
       61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact
      121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa
      181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt
      241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac
      301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg
      361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat
      421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct
      481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg
      541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat
      601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc
      661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt
      721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga
      781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac
      841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg
      901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc
      961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg
     1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg
     1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa
     1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa
     1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc
     1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact
     1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa
     1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga
     1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta
     1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa
     1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg
     1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc
     1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc
     1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat
     1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca
     1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag
     1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat
//
LOCUS       HUMGLIBSA     176 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A1.
ACCESSION   M34816
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 176)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      31       53     human zinc finger protein binding site
    binding      94      116     human zinc finger protein binding site
    binding     157      176     human zinc finger protein binding site
BASE COUNT       68 a     50 c     36 g     22 t
ORIGIN      
        1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt
       61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta
      121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat
//
LOCUS       HUMGLIBSB     166 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A2.
ACCESSION   M34817
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 166)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      27       49     human zinc finger protein binding site
    binding      90      112     human zinc finger protein binding site
BASE COUNT       60 a     47 c     37 g     22 t
ORIGIN      
        1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa
       61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc
      121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac
//
LOCUS       HUMGLIBSC     313 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A3.
ACCESSION   M34818
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 313)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding       1       23     human zinc finger protein binding site
    binding      64       86     human zinc finger protein binding site
    binding     127      149     human zinc finger protein binding site
    binding     190      212     human zinc finger protein binding site
    binding     253      275     human zinc finger protein binding site
BASE COUNT      110 a     79 c     75 g     49 t
ORIGIN      
        1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc
       61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag
      121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc
      181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac
      241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc
      301 agctgcaagg gct
//
LOCUS       HUMGLIBSD     184 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region A4.
ACCESSION   M34819
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone A4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 184)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      32       54     human zinc finger protein binding site
    binding      95      117     human zinc finger protein binding site
    binding     158      180     human zinc finger protein binding site
BASE COUNT       67 a     46 c     41 g     30 t
ORIGIN      
        1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg
       61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt
      121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt
      181 tgta
//
LOCUS       HUMGLIBSE      62 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region B1.
ACCESSION   M34820
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone B1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 62)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      11       33     human zinc finger protein binding site
BASE COUNT       18 a     26 c     11 g      7 t
ORIGIN      
        1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc
       61 tt
//
LOCUS       HUMGLIBSF      67 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human zinc finger protein (GLI) DNA binding region C1.
ACCESSION   M34821
KEYWORDS    GLI protein binding site.
SOURCE      Human DNA, clone C1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 67)
  AUTHORS   Kinzler,K.W. and Vogelstein,B.
  TITLE     The GLI gene encodes a nuclear protein which binds specific
            sequences in the human genome
  JOURNAL   Mol. Cell. Biol. 10, 634-642 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      16       29     human zinc finger protein binding site
BASE COUNT       15 a     26 c     15 g     11 t
ORIGIN      
        1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct
       61 ctctgct
//
LOCUS       MLVU3RA       383 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-15.4 U3 region genomic RNA.
ACCESSION   M34722
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA,
            from mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 383)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal      112      178     MuLV promoter region
    binding     131      144     CCAAT-binding transcription factor
    binding     147      152     Moloney leukemia virus binding protein b
    binding     156      167     enhancer core binding protein
    binding     350      362     CCAAT-binding transcription factor
    binding     363      372     glucocorticoid response element
BASE COUNT      127 a    105 c     84 g     67 t
ORIGIN      
        1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc
       61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
      121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc
      181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
      241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
      301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc
      361 caagaacaga tggtccccag aaa
//
LOCUS       MLVU3RB       348 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-15 U3 region genomic RNA.
ACCESSION   M34723
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from
            mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal       77      177     MuLV promoter region
    binding      96      109     CCAAT-binding transcription factor
    binding     112      117     Moloney leukemia virus binding protein b
    binding     121      132     enhancer core binding protein
    binding     142      154     CCAAT-binding transcription factor
    binding     155      164     glucocorticoid response element
BASE COUNT      115 a     97 c     76 g     60 t
ORIGIN      
        1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa
       61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc
      121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct
      181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg
      241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg
      301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc
//
LOCUS       MLVU3RC       445 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-25a U3 region genomic RNA.
ACCESSION   M34724
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from
            mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 445)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal       77      274     MuLV promoter region
    binding      96      109     CCAAT-binding transcription factor
    binding     168      179     CCAAT-binding transcription factor
    binding     238      252     CCAAT-binding transcription factor
    binding     138      143     Moloney leukemia virus binding protein b
    binding     209      214     Moloney leukemia virus binding protein b
    binding     147      158     enhancer core binding protein
    binding     218      229     enhancer core binding protein
    binding     180      189     glucocorticoid response element
    binding     252      263     glucocorticoid response element
BASE COUNT      142 a    126 c    104 g     73 t
ORIGIN      
        1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa
       61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg
      121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa
      181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg
      241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga
      301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa
      361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa
      421 aaagggtaag aaccccacac tcggc
//
LOCUS       MLVU3RD       409 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Murine leukemia virus CWM-T-25b U3 region genomic RNA.
ACCESSION   M34725
KEYWORDS    U3 region; promoter region.
SOURCE      Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from
            mice (strain CWB).
  ORGANISM  Murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 409)
  AUTHORS   Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C.
  TITLE     Oncogenicity and U3 region sequences of class II recombinant MuLVs
            of CWD mice
  JOURNAL   Virology 176, 166-177 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    signal      112      238     MuLV promoter region
    binding     131      144     CCAAT-binding transcription factor
    binding     174      179     Moloney leukemia virus binding protein b
    binding     183      194     enhancer core binding protein
    binding     204      215     CCAAT-binding transcription factor
    binding     216      225     glucocorticoid response element
BASE COUNT      133 a    114 c     91 g     71 t
ORIGIN      
        1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc
       61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc
      121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata
      181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc
      241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg
      301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc
      361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc
//
LOCUS       SIVPBJA       125 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Simian immunodeficiency virus upstream enhancer region.
ACCESSION   M31325
KEYWORDS    .
SOURCE      Simian (mangabey) immunodeficiency virus (isolate PBj14), clone
            4.41.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 125)
  AUTHORS   Dewhurst,S., Embretson,J.E., Anderson,D.C., Mullins,J.I. and
            Fultz,P.N.
  TITLE     Sequence analysis and acute pathogenicity of molecularly cloned SIV
  JOURNAL   Nature 345, 636-640 (1990)
  STANDARD  simple staff_review
BASE COUNT       40 a     25 c     38 g     22 t
ORIGIN      
        1 gaagaaaaca agctaagaca gcagggactt tccacaaaga cagcagggac tttccacaaa
       61 ggggatgtta cggggaggta ctggggagga gctggctgga acgcccacgt attctctgta
      121 taaat
//
LOCUS       CPARRAA       119 bp ss-rRNA            RNA       18-JUL-1990
DEFINITION  C.paradoxa cellular 5S ribosomal RNA.
ACCESSION   M33029
KEYWORDS    5S ribosomal rRNA.
SOURCE      C.paradoxa rRNA.
  ORGANISM  Cyanophora paradoxa
            Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; 
            Cryptomonadales; Kathablepharidaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Maxwell,E.S., Liu,J. and Shively,J.M.
  TITLE     Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle-
            associated 5S ribosomal RNAs
  JOURNAL   Ann. N.Y. Acad. Sci. 503, 559-561 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       25 a     35 c     37 g     22 t
ORIGIN      
        1 gtgtacggct atactaccgg aaaagcgccc gttcccgtcc gattacgaga gcctagcccg
       61 gtcaggcccg actagtacta gggtggggga ccacctggga acatcgggtg ctgtacact
//
LOCUS       CPARRAB       118 bp ss-rRNA            RNA       18-JUL-1990
DEFINITION  C.paradoxa cyanellar 5S ribosomal RNA.
ACCESSION   M33030
KEYWORDS    5S ribosomal rRNA.
SOURCE      C.paradoxa cyanellar rRNA.
  ORGANISM  Cyanelle Cyanophora paradoxa
            Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; 
            Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Maxwell,E.S., Liu,J. and Shively,J.M.
  TITLE     Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle-
            associated 5S ribosomal RNAs
  JOURNAL   Ann. N.Y. Acad. Sci. 503, 559-561 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       31 a     26 c     31 g     30 t
ORIGIN      
        1 cctggtgctt attgcgtggt ggaaccacac cgatccatac cgaactcggt tgtgaaacat
       61 tactgcggcg acaatacttg aagggtaact ttctggaaag atagcttagt gccaggat
//
LOCUS       SNDZSTR     15384 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Sendai virus nucleoprotein (NP), phosphoprotein (P), matrix protein
            (M), F protein, hemagglutinin-neuraminidase (HN), and L protein
            genes, complete cds.
ACCESSION   M30202 M30203 M30204 M19661
KEYWORDS    F protein; L protein; hemagglutinin-neuraminidase; matrix protein;
            nucleoprotein; phosphoprotein.
SOURCE      Sendai virus (Z strain, F1-R mutant, and ts-f1 mutant) RNA.
  ORGANISM  Parainfluenza virus type 1
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Paramyxoviridae; Paramyxovirus.
REFERENCE   1  (sites)
  AUTHORS   Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K.,
            Klenk,H.-D., Rott,R. and Seto,J.T.
  TITLE     Characterization of a pantropic variant of Sendai virus derived
            from a host range mutant
  JOURNAL   Virology 165, 577-583 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 4866 to 6560; for [1])
  AUTHORS   Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K.,
            Klenk,H.-D., Rott,R. and Seto,J.T.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 15384)
  AUTHORS   Middleton,Y., Tashiro,M., Thai,T., Oh,J., Seymour,J., Pritzer,E.,
            Klenk,H.-D., Rott,R. and Seto,J.T.
  TITLE     Nucleotide sequence analyses of the genes encoding the HN, M, NP,
            P, and L proteins of two host range mutants of Sendai virus
  JOURNAL   Unpublished (1989) California State University at Los Angeles,
            Dept. of Microbiology, Los Angeles, CA 90032-8745
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] and [2] kindly
            submitted by J.T.Seto, 08-JUL-1988, for [3], 27-NOV-1989.
            
               Author Address: J.T.Seto
                               California State University at Los Angeles
                               Department of Microbiology
                               Los Angeles, CA 90032-8745
FEATURES       from  to/span     description
    pept        120     1694     nucleoprotein (NP)
    pept       1844     3550     phosphoprotein (P)
    pept       3669     4715     matrix protein (M)
    pept       4866     6563     F protein
    pept       6693     8420     hemagglutinin-neuraminidase (HN)
    pept       8556    15242     L protein
    variant     310      310     c in strain Z and F1-R mutant; t in ts-f1
                                 mutant
    variant     350      350     c in strain Z and F1-R mutant; t in ts-f1
                                 mutant
    variant    1580     1580     t in strain Z and F1-R mutant; c in ts-f1
                                 mutant
    variant    1586     1586     g in strain Z and ts-f1 mutant; a in F1-R
                                 mutant
    variant    2307     2307     g in strain Z; a in F1-R mutant and ts-fl
                                 mutant
    variant    4020     4020     a in strain Z and F1-R mutant; c in ts-fl
                                 mutant
    variant    4051     4051     g in strain Z and F1-R mutant; a in ts-fl
                                 mutant
    variant    4297     4297     c in strain Z and F1-R mutant; t in ts-fl
                                 mutant
    variant    5053     5053     t in strain Z and F1-R mutant; g in ts-fl
                                 mutant
    variant    5176     5176     g in strain Z and F1-R mutant; a in ts-fl
                                 mutant
    variant    5208     5208     c in strain Z and F1-R mutant; t in ts-fl
                                 mutant
    variant    5212     5212     a in strain Z and F1-R mutant; g in ts-fl
                                 mutant
    variant    5700     5700     a in strain Z and F1-R mutant; g in ts-fl
                                 mutant
    variant    5708     5708     t in strain Z and F1-R mutant; c in ts-fl
                                 mutant
    variant    6530     6530     a in strain Z and F1-R mutant; c in ts-fl
                                 mutant
    variant   10429    10429     g in strain Z; a in F1-R mutant and ts-fl
                                 mutant
    variant   15262    15262     c in strain Z and F1-R mutant; a in ts-fl
                                 mutant
BASE COUNT     4595 a   3329 c   3756 g   3704 t
ORIGIN      
        1 accaaacaag agaaaaaaca tgtatgggat atgtaatgaa gttatacagg attttagggt
       61 caaagtatcc accctgagga gcaggttcca gaccctttgc tttgctgcca aagttcacga
      121 tggccgggtt gttgagcacc ttcgatacat ttagctctag gaggagcgaa agtattaata
      181 agtcgggagg aggtgctgtt atccccggcc agaggagcac agtctcagtg ttcgtactag
      241 gcttaagtgt gactgatgat gcagacaagt tattcattgc aactaccttc ctagctcact
      301 cattggacac agataagcag cactctcaga gaggagggtt cctcgtctcc ctgcttgcca
      361 tggcttacag tagtccagaa ttgtacttga caacaaacgg agtaaacgcc gatgtcaaat
      421 atgtgatcta caacatagag aaagacccta agaggacgaa gacagacgga ttcattgtga
      481 agacgagaga tatggaatat gagaggacca cagaatggct gtttggacct atggtcaaca
      541 agagcccact cttccagggt caacgggatg ctgcagaccc tgacacactc cttcaaatct
      601 atgggtatcc tgcatgccta ggagcaataa ttgtccaagt ctggattgtg ctggtgaagg
      661 ccatcacaag cagcgccggc ttaaggaaag ggttcttcaa caggttagag gcgttcagac
      721 aagacggcac cgtgaaaggt gccttagttt tcactgggga gacagttgag gggataggct
      781 cggttatgag atctcagcaa agccttgtat ctctcatggt tgagaccctt gtgactatga
      841 atactgcaag atctgatctc accacattag agaagaacat ccagatagtt gggaactaca
      901 tccgagatgc agggctggct tccttcatga acactattaa atatggggtg gagacaaaga
      961 tggcagctct aacgttgtca aacctgaggc ccgatattaa taagcttaga agcctcatag
     1021 acacctacct gtcaaaaggc cccagagctc cctttatctg tatcctcaag gaccctgttc
     1081 atggtgaatt tgctccaggc aattatcctg cactatggag ttacgccatg ggagtcgccg
     1141 tcgtacagaa caaggcaatg cagcagtacg tcacagggag gacatacctt gatatggaaa
     1201 tgttcttact aggacaagcc gtggcaaagg atgctgaatc gaagatcacg agtgccttgg
     1261 aagatgagtt aggagtgacg gatacagcca aggggaggct cagacatcat ctggcaaact
     1321 tgtccggtgg ggatggtgct taccgcaaac caacaggcgg tggtgcaatt gaggtagctc
     1381 tagacaatgc cgatatcgac ctagaaacaa aagctcatgc ggaccaggac gctaggggtt
     1441 ggggtggaga tagtggtgaa agatgggcac gtcaggtgag tggtggccac tttgtcacac
     1501 tacatggggc tgaacggtta gaggaggaaa ccaatgatga ggatgtatca gacatagaga
     1561 gaagaatagc catgagactt gcagagagac ggcaagagga ttctgcaacc catggagatg
     1621 aaggccgcaa taacggtgtc gatcacgacg aagatgacga taccgcagca gtagctgggg
     1681 taggaggaat ctaggatcat acgaggcttc aaggtacttg atccgtagta agaaaaactt
     1741 agggtgaaag ttcatccacc gatcggctca ggcaaggcca cacccaaccc caccgaccac
     1801 acccagcagt cgagacagcc acggcttcgg ctacacttac cgcatggatc aagatgcctt
     1861 cattcttaaa gaagattctg aagttgagag ggaggcgcca ggaggacgag agtcgctctc
     1921 ggatgttatc ggattcctcg atgctgtcct gtcgagtgaa ccaactgaca tcggagggga
     1981 cagaagctgg ctccacaaca ccatcaacac tccccaagga ccaggctctg ctcatagagc
     2041 caaaagtgag ggcgaaggag aagtctcaac accgtcgacc caagataatc gatcaggtga
     2101 ggagagtaga gtctctggga gaacaagcaa gccagaggca gaagcacatg ctggaaacct
     2161 tgataaacaa aatatacacc gggcctttgg gggaagaact ggtacaaact ctgtatctca
     2221 ggatctgggc gatggaggag actccggaat ccttgaaaat cctccaaatg agagaggata
     2281 tccgagatca ggtattgaag atgaaagcag agagatggct gcgcaccctg ataagagggg
     2341 agaagaccaa gctgaaggac ttccagaaga ggtacgagga ggtacatccc tacctgatga
     2401 aggagaaggt ggagcaagta ataatggaag aagcatggag cctggcagct cacatagtgc
     2461 aagagtaact ggggtcctgg tgattcctag ccccgaactt gaagaggctg tgctacggag
     2521 gaacaaaaga agacctacca acagtgggtc caaacctctt actccagcaa ccgtgcctgg
     2581 cacccggtcc ccaccgctga atcgttacaa cagcacaggg tcaccaccag gaaaaccccc
     2641 atctacacag gatgagcaca tcaactctgg ggacaccccc gccgtcaggg tcaaagaccg
     2701 gaaaccacca atagggaccc gctctgtctc agattgtcca gccaacggcc gcccaatcca
     2761 cccgggtcta gagaccgact caacaaaaaa gggcatagga gagaacacat catctatgaa
     2821 agagatggct acattgttga cgagtcttgg tgtaatccag tctgctcaag aattcgaatc
     2881 atcccgagac gcgagttatg tgtttgcaag acgtgcccta aagtctgcaa actatgcaga
     2941 gatgacattc aatgtatgcg gcctgatcct ttctgccgag aaatcttccg ctcgtaaggt
     3001 agatgagaac aaacaactgc tcaaacagat ccaagagagc gtggagtcat tccgggatat
     3061 ttataagaga ttctctgagt atcagaaaga acagaactca ttgctgatgt ccaacctatc
     3121 tacacttcat atcatcacag atagaggtgg caagactgac aacacagact cccttacaag
     3181 gtccccctcc gtttttgcaa aatcaaaaga gaacaagact aaggctacca ggtttgaccc
     3241 atctatggag accctagaag atatgaagta caaaccggac ctaatccgag aggatgaatt
     3301 tagagatgag atccgcaacc cggtgtacca agagagggac acagaaccta gggcctcaaa
     3361 cgcatcacgt ctcctcccct ccaaagagaa gcccacaatg cactctctca ggctcgtcat
     3421 agagagcagt cccctaagca gagctgagaa agcagcatat gtgaaatcat tatccaagtg
     3481 caagacagac caagaggtta aggcagtcat ggaactcgta gaagaggaca tagagtcact
     3541 gaccaactag atcccgggtg aggcatccta ccatcctcag tcatagagag atccaatcta
     3601 ccatcagcat cagccagtaa agattaagaa aaacttaggg tgaaagaaat ttcacctaac
     3661 acggcgcaat ggcagatatc tatagattcc ctaagttctc atatgaggat aacggtactg
     3721 tggagcccct gcctctgaga actggtccgg ataagaaagc catcccccac atcaggattg
     3781 tcaaggtagg agtccctcct aaacatggag tgagatacct agatttattg ctcttgggtt
     3841 tctttgagac accgaaacaa acaaccaatc tagggagcgt atctgacttg acagagccga
     3901 ccagctactc aatatgcggc tccgggtcgt tacccatagg tgtggccaaa tactacggga
     3961 ctgatcagga actcttaaag gcctgcaccg atctcagaat tacggtgagg aggactgtta
     4021 gagcaggaga gatgatcgta tacatggtgg gttcgattgg tgctccactc ctaccatggt
     4081 caggcaggct gagacaggga atgatattta atgcaaacaa ggtcgcacta gctccccaat
     4141 gcctccctgt ggacaaggac ataagactca gagtggtgtt tgtcaatggg acatctctag
     4201 gggcaatcac catagccaag atcccaaaga cccttgcaga ccttgcattg cccaactcta
     4261 tatccgttaa tttactggtg acactcaaga ccgggacctc cacagaacaa aagggggtac
     4321 tcccagtact tgatgatcaa ggggagaaaa agctcaattt tatggtgcac ctcgggttga
     4381 tcaggagaaa ggtcgggaag atatactctg ttgagtactg caagagcaag attgagagaa
     4441 tgcggctgat tttctcactt gggttaatcg gcggtataag cttccatgtt caggttattg
     4501 ggacactatc taagacattc atgagtcagc tcgcatggaa gagggcagtc tgcttcccat
     4561 taatggatgt gaatccccat atgaacatgg tgatttgggc ggcatctgta gaaatcacag
     4621 gcgtcgatgc ggtgttccaa ccggccatcc ctcgtgattt ccgctactac cctaatgttg
     4681 tggctaagaa catcggaagg atcagaaagc tgtaaatgtg cacccatcag agacctgcga
     4741 caatgcccca agcagacacc acctggcagt cggagccacc gggtcactcc ttgtcttaaa
     4801 taagaaaaac ttagggataa agtcccttgt gagtgcttgg ttgcaaaact ctccccttgg
     4861 gaaacatgac agcatatatc cagagatcac agtgcatctc aacatcacta ctggttgttc
     4921 tcaccacatt ggtctcgtgt cagattccca gggataggct ctctaacata ggggtcatag
     4981 tcgatgaagg gaaatcactg aagatagctg gatcccacga atcgaggtac atagtactga
     5041 gtctagttcc ggtggtagac tttgagaatg ggtgcggaac agcccaggtt atccagtaca
     5101 agagcctact gaacaggctg ttaatcccat tgagggatgc cttagatctt caggaggctc
     5161 tgataactgt caccagtgat acgacacaaa atgccggtgc tccacagccg aaattcttcg
     5221 gtgctgtgat tggtactatc gcacttggag tggcgacatc agcacaaatc accgcaggga
     5281 ttgcactagc cgaagcgagg gaggccaaaa gagacatagc gctcatcaaa gaatcgatga
     5341 caaaaacaca caagtctata gaactgctgc aaaacgctgt gggggaacaa attcttgctc
     5401 taaagacact ccaggatttc gtgaatgatg agatcaaacc cgcaataagc gaattaggct
     5461 gtgagactgc tgccttaaga ctgggtataa aattgacaca gcattactcc gagctgttaa
     5521 ctgcgttcgg ctcgaatttc ggaaccatcg gagagaagag cctcacgctg caggcgctgt
     5581 cttcacttta ctctgctaac attactgaga ttatgaccac aatcaggaca gggcagtcta
     5641 acatctatga tgtcatttat acagaacaga tcaaaggaac ggtgatagat gtggatctaa
     5701 agagatatat ggtcaccctg tctgtgaaga tccctattct ttctgaagtc ccaggtgtgc
     5761 tcatacacaa ggcatcatct atttcttaca acatagacgg ggaggaatgg tatgtgactg
     5821 tccccagcca tatactcagt cgtgcttctt tcttaggggg tgcagacata accgattgtg
     5881 ttgagtccag attgacctat atatgcccca gggatcccgc acaactgata cctgacagcc
     5941 agcaaaagtg tatcctgggg gacacaacaa ggtgtcctgt cacaaaagtt gtggacagcc
     6001 ttatccccaa gtttgctttt gtgaatgggg gcgttgttgc taactgcata gcatccacat
     6061 gtacctgcgg gacaggccga agaccaatca gtcaggatcg ctctaaaggt gtagtattcc
     6121 taacccatga caactgtggt cttataggtg tcaatggggt agaattgtat gctaaccgga
     6181 gagggcacga tgccacttgg ggggtccaga acttgacagt cggtcctgca attgctatca
     6241 gacccgttga tatttctctc aaccttgctg atgctacgaa tttcttgcaa gactctaagg
     6301 ctgagcttga gaaagcacgg aaaatcctct cggaggtagg tagatggtac aactcaagag
     6361 agactgtgat tacgatcata gtagttatgg tcgtaatatt ggtggtcatt atagtgatca
     6421 tcatcgtgct ttatagactc agaaggtcaa tgctaatggg taatccagat gaccgtatac
     6481 cgagggacac atacacatta gagccgaaga tcagacatat gtacacaaaa ggtgggtttg
     6541 atgcaatggc tgagaaaaga tgatcacgac cattatcaga tgtcttgtaa agcaggcatg
     6601 gtattcgttg agatctgtat ataataagaa aaacttaggg tgaaagtgag gtcgcgcggt
     6661 actttagctt tcacctcaaa caagcacaga tcatggatgg tgataggggc aaacgtgact
     6721 cgtactggtc tacttctcct agtggtagca ctacaaaatt agcatcaggt tgggagaggt
     6781 caagtaaagt tgacacatgg ttgctgattc tctcattcac ccagtgggct ttgtcaattg
     6841 ccacagtgat catctgtatc ataatttctg ctagacaagg gtatagtatg aaagagtact
     6901 caatgactgt agaggcattg aacatgagca gcagggaggt gaaagagtca cttaccagtc
     6961 taataaggca agaggttata gcaagggctg tcaacattca gagctctgtg caaaccggaa
     7021 tcccagtctt gttgaacaaa aacagcaggg atgtcatcca gatgattgat aagtcgtgca
     7081 gcagacaaga gctcactcag cactgtgaga gtacgatcgc agtccaccat gccgagggaa
     7141 ttgccccact tgagccacat agtttctgga gatgccctgt cggagaaccg tatcttagct
     7201 cagatcctga aatctcattg ctgcctggtc cgagcttgtt atctggttct acaacgatct
     7261 ctggatgtgt taggctccct tcactctcaa ttggcgaggc aatctatgcc tattcatcaa
     7321 atctcattac acaaggttgt gctgacatag ggaaatcata tcaggtcctg cagctagggt
     7381 acatatcact caattcagat atgttccctg atcttaaccc cgtagtgtcc cacacttatg
     7441 acatcaacga caatcggaaa tcatgctctg tggtggcaac cgggactagg ggttatcagc
     7501 tttgctccat gccgactgta gacgaaagaa ccgactactc tagtgatggt atcgaggatc
     7561 tggtccttga tgtcctggat ctcaaaggga gaactaagtc tcaccggtat cgcaacagcg
     7621 aggtagatct tgatcacccg ttctctgcac tataccccag tgtaggcaac ggcattgcaa
     7681 cagaaggctc attgatattt cttgggtatg gtggactaac cacccctctg cagggtgata
     7741 caaaatgtag gacccaagga tgccaacagg tgtcgcaaga cacatgcaat gaggctctga
     7801 aaattacatg gctaggaggg aaacaggtgg tcagcgtgat catccaggtc aatgactatc
     7861 tctcagagag gccaaagata agagtcacaa ccattccaat cactgaaaac tatctcgggg
     7921 cggaaggtag attattaaaa ttgggtgatc gggtgtacat ctatacaaga tcatcaggct
     7981 ggcactctca actgcagata ggagtacttg atgtcagcca ccctttgact atcaactgga
     8041 cacctcatga agccttgtct agaccaggaa ataaagagtg caattggtac aataagtgtc
     8101 cgaaggaatg catatcaggc gtatacactg atgcttatcc attgtcccct gatgcagcta
     8161 acgtcgctac cgtcacgcta tatgccaata catcgcgtgt caacccaaca atcatgtatt
     8221 ctaacactac taacattata aatatgttaa ggataaagga tgttcaatta gaggctgcat
     8281 ataccacgac atcgtgtatc acgcattttg gtaaaggcta ctgctttcac atcatcgaga
     8341 tcaatcagaa gagcctgaat accttacagc cgatgctctt taagactagc atccctaaat
     8401 tatgcaaggc cgagtcttaa atttaactga ctagcaggct tgtcggcctt gctgacacta
     8461 gagtcatctc cgaacatcca caatatctct cagtctctta cgtctctcac agtattaaga
     8521 aaaacccagg gtgaatggga agcttgccat aggtcatgga tgggcaggag tcctcccaaa
     8581 acccttctga catactctat ccagaatgcc acctgaactc tcccatagtc agggggaaga
     8641 tagcacagtt gcacgtcttg ttagatgtga accagcccta cagactgaag gacgacagca
     8701 taataaatat tacaaagcac aaaattagga acggaggatt gtcccctcgt caaattaaga
     8761 tcaggtctct gggtaaggct cttcaacgca caataaagga tttagaccga tacacgtttg
     8821 aaccgtaccc aacctactct cacgaattac ttaggcttga tataccagag atatgtgaca
     8881 aaatccgatc cgtcttcgcg gtctcggatc ggctgaccag ggagttatct agtgggttcc
     8941 aggatctttg gttgaatatc ttcaagcaac taggcaatat agaaggaaga gaggggtacg
     9001 atccgttgca ggatatcggc accatcccgg agataactga taagtacagc aggaatagat
     9061 ggtataggcc attcctaact tggttcagca tcaaatatga catgcggtgg atgcagaaga
     9121 ccagaccggg gggacccctt gatacctcta attcacataa cctcctagaa tgcaaatcat
     9181 acactctagt aacatacgga gatcttgtca tgatactgaa caagttgaca ttgacagggt
     9241 atatcctaac ccctgagctg gtcttgatgt attgtgatgt tgtagaagga aggtggaata
     9301 tgtctgctgc agggcatcta gataagaagt ccattgggat aacaagcaaa ggtgaggaat
     9361 tatgggaact agtggattcc ctcttctcaa gtcttggaga ggaaatatac aatgtcatcg
     9421 cactattgga gcccctatca cttgctctca tacaactaaa tgatcctgtt atacctctac
     9481 gtggggcatt tatgaggcat gtgttgacag agctacagac tgttttaaca agtagagacg
     9541 tgtacacaga tgctgaagca gacactattg tggagtcgtt actcgccatt ttccatggaa
     9601 cctctattga tgagaaagca gagatctttt ccttctttag gacatttggc caccccagct
     9661 tagaggctgt cactgccgcc gacaaggtaa gggcccatat gtatgcacaa aaggcaataa
     9721 agcttaagac cctatacgag tgtcatgcag ttttttgcac tatcatcata aatgggtata
     9781 gagagaggca tggcggacag tggcccccct gtgacttccc tgatcacgtg tgtctagaac
     9841 taaggaacgc tcaagggtcc aatacggcaa tctcttatga atgtgctgta gacaactata
     9901 caagtttcat aggcttcaag tttcggaagt ttatagaacc acaactagat gaagatctca
     9961 caatatatat gaaagacaaa gcactatccc ccaggaagga ggcatgggac tctgtatacc
    10021 cggatagtaa tctgtactat aaagccccag agtctgaaga gacccggcgg cttattgaag
    10081 tgttcataaa tgatgagaat ttcaacccag aagaaattat caattatgtg gagtcaggag
    10141 attggttgaa agacgaggag ttcaacatct cgtacagtct caaagagaaa gagatcaagc
    10201 aagagggtcg tctattcgca aaaatgactt ataagatgcg agccgtacag gtgctggcag
    10261 agacactact ggctaaagga ataggagagc tattcaggga aaatgggatg gttaagggag
    10321 agatagacct acttaaaaga ttgactactc tttctgtctc aggcgtcccc aggactgatt
    10381 cagtgtacaa taactctaaa tcatcagaga agagaaacga aggcatggga aataagaact
    10441 ctggggggta ctgggacgaa aagaagaggt ccagacatga attcaaggca acagattcat
    10501 caacagacgg ctatgaaaca ttaagttgct tcctcacaac agacctcaag aaatactgct
    10561 taaactggag atttgagagt actgcattgt ttggtcagag atgcaacgag atatttggct
    10621 tcaagacctt ctttaactgg atgcatccag tccttgaaag gtgtacaata tatgttggag
    10681 atccttactg tccagtcgcc gaccggatgc atcgacaact ccaggatcat gcagactctg
    10741 gcattttcat acataatcct agggggggca tagaaggtta ctgccagaag ctgtggacct
    10801 taatctcaat gagtgcaatc cacctagcag ctgtgagagt gggtgtcagg gtctctgcaa
    10861 tggttcaggg tgacaatcaa gctatagccg tgacatcaag agtacctgta gctcagactt
    10921 acaagcagaa gaaaaatcat gtctatgagg agatcaccaa atatttcggt gctctaagac
    10981 acgtcatgtt tgatgtaggg cacgagctaa aattgaacga gaccatcatt agtagcaaga
    11041 tgtttgtcta tagtaaaagg atatactatg atgggaagat tttaccacag tgcctgaaag
    11101 ccttgaccaa gtgtgtattc tggtccgaga cactggtaga tgaaaacaga tctgcttgtt
    11161 cgaacatctc aacatccata gcaaaagcta tcgaaaatgg gtattctcct atactaggct
    11221 actgcattgc gttgtataag acctgtcagc aggtgtgcat atcactaggg atgactataa
    11281 atccaactat cagcccgacc gtaagagatc aatactttaa gggtaagaat tggctgagat
    11341 gtgcagtgtt gattccagca aatgttggag gattcaacta catgtctaca tctagatgct
    11401 ttgttagaaa tattggagac cccgcagtag cagccctagc tgatctcaaa agattcatca
    11461 gagcggatct gttagacaag caggtattat acagggtcat gaatcaagaa cccggtgact
    11521 ctagttttct agattgggct tcagaccctt attcgtgtaa cctcccgcat tctcagagta
    11581 taactacgat tataaagaat atcactgcta gatctgtgct gcaggaatcc ccgaatcctc
    11641 tactgtctgg tctcttcacc gagactagtg gagaagagga tctcaacctg gcctcgttcc
    11701 ttatggaccg gaaagtcatc ctgccgagag tggctcatga gatcctgggt aattccttaa
    11761 ctggagttag ggaggcgatt gcagggatgc ttgatacgac caagtctcta gtgagagcca
    11821 gcgttaggaa aggaggatta tcatatggga tattgaggag gcttgtcaat tatgatctat
    11881 tgcagtacga gacactgact agaactctca ggaaaccggt gaaagacaac atcgaatatg
    11941 agtatatgtg ttcagttgag ctagctgtcg gtctaaggca gaaaatgtgg atccacctga
    12001 cttacgggag acccatacat gggctagaaa caccagaccc tttagagctc ttgaggggaa
    12061 tatttatcga aggttcagag gtgtgcaagc tttgcaggtc tgaaggagca gaccccatct
    12121 atacatggtt ctatcttcct gacaatatag acctggacac gcttacaaac ggaagtccgg
    12181 ctataagaat cccctatttt ggatcagcca ctgatgaaag gtcggaagcc caactcgggt
    12241 atgtaagaaa tctaagcaaa cccgcaaagg cggccatccg gatagctatg gtgtatacgt
    12301 gggcctacgg gactgatgag atatcgtgga tggaagccgc tcttatagcc caaacaagag
    12361 ctaatctgag cttagagaat ctaaagctgc tgactcctgt ttcaacctcc actaatctat
    12421 ctcataggtt gaaagatacg gcaacccaga tgaagttctc tagtgcaaca ctagtccgtg
    12481 caagtcggtt cataacaata tcaaatgata acatggcact caaagaagca ggggagtcga
    12541 aggatactaa tctcgtgtat cagcagatta tgctaactgg gctaagcttg ttcgagttca
    12601 atatgagata taagaaaggt tccttaggga agccactgat attgcactta catcttaata
    12661 acgggtgctg tataatggag tccccacagg aggcgaatat ccccccaagg tccacattag
    12721 atttagagat tacacaagag aacaataaat tgatctatga tcctgatcca ctcaaggatg
    12781 tggaccttga gctatttagc aaggtcagag atgttgtaca cacagttgac atgacttatt
    12841 ggtcagatga tgaagttatc agagcaacca gtatctgtac tgcaatgacg atagctgata
    12901 caatgtctca attagataga gacaacttaa aagagatgat cgcactagta aatgacgatg
    12961 atgtcaacag cttgattact gagtttatgg tgattgatgt tcctttattt tgctcaacgt
    13021 tcgggggtat tctagtcaat cagtttgcat actcactcta cggcttaaac atcagaggaa
    13081 gggaagaaat atggggacat gtagtccgga ttcttaaaga tacctcccac gcagttttaa
    13141 aagtcttatc taatgctcta tctcatccca aaatcttcaa acgattctgg aatgcaggtg
    13201 tcgtggaacc tgtgtatggg cctaacctct caaatcagga taagatactc ttggccctct
    13261 ctgtctgtga atattctgtg gatctattca tgcacgattg gcaagggggt gtaccgcttg
    13321 agatctttat ctgtgacaat gacccagatg tggccgacat gaggaggtcc tctttcttgg
    13381 caagacatct tgcataccta tgcagcttgg cagagatatc tagggatggg ccaagattag
    13441 aatcaatgaa ctctctagag aggctcgagt cactaaagag ttacctggaa ctcacatttc
    13501 ttgatgaccc ggtactgagg tacagtcagt tgactggcct agtcatcaaa gtattcccat
    13561 ctactttgac ctatatccgg aagtcatcta taaaagtgtt aaggacaaga ggtataggag
    13621 tccctgaagt cttagaagat tgggatcccg aggcagataa tgcactgtta gatggtatcg
    13681 cggcagaaat acaacagaat attcctttgg gacatcagac tagagcccct ttttgggggt
    13741 tgagagtatc caagtcacag gtactgcgtc tccgggggta caaggagatc acaagaggtg
    13801 agataggcag atcaggtgtt ggtctgacgt taccattcga tggaagatat ctatctcacc
    13861 agctgaggct ctttggcatc aacagtacta gctgcttgaa agcacttgaa cttacctacc
    13921 tattgagccc cttagttgac aaggataaag ataggctata tttaggggaa ggagctgggg
    13981 ccatgctttc ctgttatgac gctactcttg gcccatgcat caactattat aactcagggg
    14041 tatactcttg tgatgtcaat gggcagagag agttaaatat atatcctgct gaggtggcac
    14101 tagtgggaaa gaaattaaac aatgttacta gtctgggtca aagagttaaa gtgttattca
    14161 acgggaatcc tggctcgaca tggattggga atgatgagtg tgaggctttg atttggaatg
    14221 aattacagaa tagctcgata ggcctagtcc actgtgacat ggagggagga gatcataagg
    14281 atgatcaagt tgtactgcat gagcattaca gtgtaatccg gatcgcgtat ctggtggggg
    14341 atcgagacgt tgtgcttata agcaagattg ctcccaggct gggcacggat tggaccaggc
    14401 agctcagcct atatctgaga tactgggacg aggttaacct aatagtgctt aaaacatcta
    14461 accctgcttc cacagagatg tatctcctat cgaggcaccc caaatctgac attatagagg
    14521 acagcaagac agtgttagct agtctcctcc ctttgtcaaa agaagatagc atcaagatag
    14581 aaaagtggat cttaatagag aaggcaaagg ctcacgaatg ggttactcgg gaattgagag
    14641 aaggaagctc ttcatcaggg atgcttagac cttaccatca agcactgcag acgtttggct
    14701 ttgaaccaaa cttgtataaa ttgagcagag atttcttgtc caccatgaac atagctgata
    14761 cacacaactg catgatagct ttcaacaggg ttttgaagga tacaatcttc gaatgggcta
    14821 gaataactga gtcagataaa aggcttaaac taactggtaa gtatgacctg tatcctgtga
    14881 gagattcagg caagttgaag acaatttcta gaagacttgt gctatcttgg atatctttat
    14941 ctatgtccac aagattggta actgggtcat tccctgacca gaagtttgaa gcaagacttc
    15001 aattgggaat agtttcatta tcatcccgtg aaatcaggaa cctgagggtt atcacaaaaa
    15061 ctttattaga caggtttgag gatattatac atagtataac gtatagattc ctcaccaaag
    15121 aaataaagat tttgatgaag attttagggg cagtcaagat gttcggggcc aggcaaaatg
    15181 aatacacgac cgtgattgat gatggatcac tgggtgatat cgagccatat gacagctcgt
    15241 aataattagt ccctatcgtg ccgaacgatc gaagctccgc ggtacctgga agtcttggac
    15301 ttgtccatat gacaatagta agaaaaactt acaagaagac aagaaaattt aaaaggatac
    15361 atatctctta aactcttgtc tggt
//
LOCUS       HUMKRASPO     111 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human K-ras proto-oncogene, exon 1.
ACCESSION   M34904
KEYWORDS    K-ras proto-oncogene.
SOURCE      Human normal fetal liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Santos,E., Martin-Zanca,D., Reddy,E.P., Pierotti,M.A.,
            Della Porta,G. and Barbacid,M.
  TITLE     Malignant activation of a K-ras oncogene in lung carcinoma but not
            in normal tissue of the same patient
  JOURNAL   Science 223, 661-664 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  /   111     K-ras p21 protein, exon 1
BASE COUNT       35 a     16 c     30 g     30 t
ORIGIN      
        1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg
       61 atacagctaa ttcagaatca ttttgtggac gaatatgatc caacaataga g
//
LOCUS       HUMGLYE       498 bp ss-mRNA            PRI       18-JUL-1990
DEFINITION  Human glycophorin E mRNA, complete cds.
ACCESSION   M29610 J05183 M29620
KEYWORDS    glycophorin.
SOURCE      Human erythroleukemia proerythroblast cell line K562, cDNA to mRNA,
            clone pHGpE-7.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 498)
  AUTHORS   Kudo,S. and Fukuda,M.
  TITLE     Identification of a novel human glycophorin, glycophorin E, by
            isolation of genomic clones and complementary DNA clones utilizing
            polymerase chain
  JOURNAL   J. Biol. Chem. 265, 1102-1110 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Kudo 31-OCT-1989.
FEATURES       from  to/span     description
    pept         57      293     glycophorin E
    signal      470      481     polyA signal
BASE COUNT      128 a    107 c     94 g    169 t
ORIGIN      
        1 agttgtcttt ggtagttttt ttgcactaac ttcaggagcc agctcgtgat ctcaggatgt
       61 atggaaaaat aatctttgta ttactattgt cagaaattgt gagcatatca gcatcaagta
      121 ccactggtgt ggcaatgcac acttcaacct cttcttcagt cacaaagagt tacatctcat
      181 cacagacaaa tgggataaca ctcattaatt ggtgggcgat ggctcgtgtt atttttgagg
      241 tgatgcttgt tgttgttgga atgatcatct taatttctta ctgtattcga tgactgataa
      301 aggcatgagg atgtggcctg catgctgcct gatcttgcct agaaccagct gcacctgctg
      361 ttctcttgtt atgcaaactg gctgcacctg ctattccttt gcttatgccc caacccttgg
      421 ctatcctaac tccctgttct cctgcctatt actgtattct ctacttctaa ataaaaataa
      481 aacaaaatac aaattatt
//
LOCUS       BAMSYNAMY      85 bp ds-DNA             SYN       18-JUL-1990
DEFINITION  Synthetic B.amyloliquefaciens alpha-amylase gene promoter region.
ACCESSION   M34931
KEYWORDS    .
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
  TITLE     Two tandemly located promoters, artificially constructed, are
            active in a Bacillus subtilis alpha-amylase secretion vector
  JOURNAL   J. Biochem. 99, 1181-1190 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         72  >    85     synthetic alpha-amylase
BASE COUNT       34 a      8 c     19 g     24 t
ORIGIN      
        1 actcggggtt gttattattt tatcgatatg taaaatataa tttctagaag aaaagaaggt
       61 ggagaggaaa catgatccaa aaacg
//
LOCUS       PMP5END       139 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Papaya mosaic virus, 5' end.
ACCESSION   M35054
KEYWORDS    .
SOURCE      Papaya mosaic virus RNA.
  ORGANISM  Papaya mosaic potexvirus
            Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; 
            Potexvirus.
REFERENCE   1  (bases 1 to 139)
  AUTHORS   Lok,S. and Abouhaidar,M.G.
  TITLE     The nucleotide sequence of the 5' end of papaya mosaic virus RNA:
            Site of in vitro assembly initiation
  JOURNAL   Virology 153, 289-296 (1986)
  STANDARD  simple staff_entry
BASE COUNT       58 a     23 c     30 g     27 t      1 others
ORIGIN      
        1 ggaaaagaaa cacaaagcaa agcaaagcaa agcaactcaa ataaancata tttggtcaag
       61 gcacttggta atcaaacggg cacaacccta gattaatgat atggcaaatt tgaggagtgt
      121 tttcgaacag ttgaacgag
//
LOCUS       MZEEMA        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Z.mexicana 180-bp knob-specific repeat region.
ACCESSION   M32533
KEYWORDS    .
SOURCE      Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm1.
  ORGANISM  Zea mexicana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
COMMENT     Zea mexicana is also known as Zea mays subspecies mexicana.
BASE COUNT       58 a     38 c     39 g     45 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aataatatcg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS       MZEEMB        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Z.mexicana 180-bp knob-specific repeat region.
ACCESSION   M32534
KEYWORDS    .
SOURCE      Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm2.
  ORGANISM  Zea mexicana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
COMMENT     Zea mexicana is also known as Zea mays subspecies mexicana.
BASE COUNT       60 a     40 c     38 g     42 t
ORIGIN      
        1 ggccacacaa accccatttt tgtcgaaaat agccatgaac gaccattttc aataatatcg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS       MZEZMA        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32521
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-5.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       58 a     40 c     39 g     43 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS       MZEZMB        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32522
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-21.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       58 a     40 c     39 g     43 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS       MZEZMC        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32523
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-10.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       58 a     41 c     38 g     43 t
ORIGIN      
        1 ggccacacaa cccccatttt tttcgaaaat agccatgaac gaccattttc aataataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat
//
LOCUS       MZEZMD        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32524
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-25.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       58 a     39 c     40 g     43 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat
//
LOCUS       MZEZME        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32525
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm2-3(I).
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       59 a     40 c     38 g     43 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctagtc gatgatcaat
//
LOCUS       MZEZMF        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32526
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-7.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       59 a     40 c     40 g     41 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aaaaataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctgtggtc gacgatcaat
//
LOCUS       MZEZMG        179 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32527
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-26.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 179)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       59 a     38 c     38 g     44 t
ORIGIN      
        1 ggccacaaac ccccattttt gtcgaaaata gccatgaacg accattttca ataataccga
       61 aggctaacac ctatggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg
      121 atctatggca aggaaacata tgtggggtga ggtgtatgag cctctagtcg atgatcaat
//
LOCUS       MZEZMH        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32528
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-14.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       61 a     39 c     38 g     42 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccatttac aataataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaaaaatgt
      121 gatctatgga aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS       MZEZMI        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32529
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-13.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       61 a     40 c     36 g     43 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat aaccatgaac gaccattttc aataatatcg
       61 aaggctaaca cctatggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 aatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS       MZEZMJ        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32530
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm4-1.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       59 a     35 c     38 g     48 t
ORIGIN      
        1 ggccacacaa cccgcatttt ttttgaaaat agccatgaat gaccattttc aataatatta
       61 gaggctaaca cctacgagtt tttaaccaag aaatggtctc caccagaaat tcaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat
//
LOCUS       MZEZMK        179 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32531
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm2-1(I).
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 179)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       62 a     39 c     37 g     41 t
ORIGIN      
        1 ggccacacaa cccccatttt ttcaaaaata gccatgaacg accattttca aaaatacgaa
       61 aggctaacac gtacggattt ttgaccatga aatggtctcc accagaaatc cgaaaatgag
      121 atctacgaca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat
//
LOCUS       MZEZML        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Maize 180-bp knob-specific repeat region.
ACCESSION   M32532
KEYWORDS    .
SOURCE      Maize 6-day-old seedling DNA, clone pZm2-1(II).
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       59 a     34 c     40 g     47 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aaaaatacga
       61 aaggctaaca cgtacggatt tttgaccatg aaatggtttt caccagaaat tcaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat
//
LOCUS       TRD2AA        177 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  T.dactyloides 180-bp knob-specific repeat region.
ACCESSION   M32535
KEYWORDS    .
SOURCE      T.dactyloides 6-day-old seedling DNA, clone 2.
  ORGANISM  Tripsacum dactyloides
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 177)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       60 a     38 c     40 g     39 t
ORIGIN      
        1 ggccacacaa cccccattta tgtcggaaat agcgatgaac gaccattttc aataatacga
       61 aggtaacacc tacgaaattt tgaccaagat aggtctcaac cagaaatcca agaatgtgac
      121 ctatggcaag gaaacatatg tggggtgagg tgtacgagcc tctggtcgat gatcaat
//
LOCUS       TRD2AB        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  T.dactyloides 180-bp knob-specific repeat region.
ACCESSION   M32536
KEYWORDS    .
SOURCE      T.dactyloides 6-day-old seedling DNA, clone 4.
  ORGANISM  Tripsacum dactyloides
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       59 a     40 c     40 g     41 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agcgatgaac gaccattttc aataatacca
       61 aaggctaaca cctatggatt ttggaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat
//
LOCUS       TRD2AC        180 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  T.dactyloides 180-bp knob-specific repeat region.
ACCESSION   M32537
KEYWORDS    .
SOURCE      T.dactyloides 6-day-old seedling DNA, clone 7.
  ORGANISM  Tripsacum dactyloides
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       60 a     41 c     37 g     42 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtttaaaat agccaagaac gaccattttc aataataccg
       61 aatgctaaca cctatggatt attgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctccggtc gatgatcaat
//
LOCUS       TRD2AD        179 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  T.dactyloides 180-bp knob-specific repeat region.
ACCESSION   M32538
KEYWORDS    .
SOURCE      T.dactyloides 6-day-old seedling DNA, clone 8.
  ORGANISM  Tripsacum dactyloides
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 179)
  AUTHORS   Dennis,E.S. and Peacock,W.J.
  TITLE     Knob heterochromatin homology in maize and its relatives
  JOURNAL   J. Mol. Evol. 20, 341-350 (1984)
  STANDARD  simple staff_review
BASE COUNT       58 a     39 c     38 g     44 t
ORIGIN      
        1 ggccacacaa cccccatttt ttttgaaaat agccatgaac gaccattttc aataataccg
       61 aaggtaacac ctacggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg
      121 acctatggca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat
//
LOCUS       VSVNNJ       1329 bp ss-mRNA            VRL       18-JUL-1990
DEFINITION  Vesicular stomatitis virus (New Jersey) nucleocapsid protein mRNA,
            complete cds.
ACCESSION   K02379 M35062
KEYWORDS    nucleocapsid protein; protein N.
SOURCE      VSV (New Jersey serotype; Ogden strain), cDNA to mRNA, clones 53
            and 77.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Banerjee,A.K., Rhodes,D.P. and Gill,D.S.
  TITLE     Complete nucleotide sequence of the mRNA coding for the N protein
            of vesicular stomatitis virus (New Jersey serotype)
  JOURNAL   Virology 137, 432-438 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 1234 to 1329)
  AUTHORS   Rhodes,D.P. and Banerjee,A.K.
  TITLE     Poly(A)-adjacent sequence of the 14.5 S mRNA of vesicular
            stomatitis virus (New Jersey serotype)
  JOURNAL   Virology 105, 297-300 (1980)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         14     1282     nucleocapsid protein
    mRNA          1     1329     N mRNA
    conflict   1264     1264     a in [1]; g in [2]
BASE COUNT      443 a    271 c    281 g    334 t
ORIGIN      182 bp upstream of BstNI site.
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcca taattcagcc
       61 taaactaccg gccaatgagg atccggtaga gtatccagct gattacttca aaaataacac
      121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttaaggg cctatgtcta
      181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc
      241 gttaaaggga gtagaaggaa cattagacag agactgggtt agctttggaa gaacgatcgg
      301 aaagagggaa gagaatgtca aaatcttcga tctagtcaaa gttgaggaac tgaagacagc
      361 tcttcctgat gggaaatcgg atccagatcg atcagctgaa gatgataaat ggcttcctat
      421 ttacatccta ggcctatatc gagtgggtag atccaaagtt acagattaca gaaagaaact
      481 cctggacggt ctagaaaatc agtgtagagt ggcatcaacc agatttgaaa gccttgttga
      541 agatggtctt gacttctttg acatctggga aaatgatccc aacttcacca aaatagttgc
      601 tgcagtggat atgttttttc acatgttcaa gaagcatgaa cgtgctccaa tcagatatgg
      661 aaccatagtc tcacgattca aagactgtgc agcacttgct acatttggac atctcagcaa
      721 ggttagcgga ctttcaattg aagacctaac aacatgggtt ctgaatcgag aagtcgcaga
      781 cgaactatgc caaatgatgt acccaggcca agaaattgac aaggcagatt catacatgcc
      841 atatatgatt gatttcgggt tatcccaaaa atctccttat tcatcagtga agaatccagc
      901 tttccacttc tggggtcaac tggccgcact cttgctaaga tcaactcgag caaaaaatgc
      961 tcggcaaccc gatgacattg agtacacttc gctgacttgt gcaagtttac tactgtcatt
     1021 tgcagttgga tcttcagcag acattgaaca acaattctac atcggagagg acaaatacac
     1081 aacagagaaa gatgacagtc tcaagaaatc tgatgtccca ccgaaaggaa gaaatgttgt
     1141 agactggctc ggttggtatg atgacaatgg agggaaaccc acaccggata tgctcaactt
     1201 cgcacgaaga gcagtcagtt cgttgcagtc actccgcgag aaaacaattg gcaaatacgc
     1261 aaaagtagaa tttgacaaat gaccaggttt gacatatcaa tatcaatatt atattctgtg
     1321 ctatatatg
//
LOCUS       TNCPOLY      3660 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Tobacco necrosis virus RNA-dependent RNA polymerase and coat
            protein genes, complete cds.
ACCESSION   M33002
KEYWORDS    RNA-dependent RNA polymerase; coat protein; polymerase.
SOURCE      Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell
            line, cDNA to viral RNA, clones pFM2[0,1,2,3].
  ORGANISM  Tobacco necrosis virus
            Viridae; ss-RNA nonenveloped viruses.
REFERENCE   1  (bases 1 to 3660)
  AUTHORS   Meulewaeter,F., Seurinck,J. and van Emmelo,J.
  TITLE     Genome structure of tobacco necrosis virus strain A
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Meulewaeter, 20-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         60      665     RNA-dependent RNA polymerase (put.)
                                 /transl_except=(666..668)
                669     2234     RNA-dependent RNA polymerase
    pept       2218     2436     ORF 3
    pept       2440     2610     ORF 4
    pept       2613     3443     coat protein
    pept       3467     3646     ORF 6
    site        666      668     readthrough codon
BASE COUNT     1034 a    890 c    895 g    841 t
ORIGIN      
        1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa
       61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact
      121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag
      181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt
      241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa
      301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg
      361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg
      421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg
      481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac
      541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag
      601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt
      661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac
      721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct
      781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc
      841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc
      901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa
      961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga
     1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag
     1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc
     1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc
     1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg
     1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat
     1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg
     1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa
     1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta
     1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact
     1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc
     1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga
     1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg
     1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt
     1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat
     1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca
     1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc
     1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat
     2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt
     2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg
     2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg
     2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt
     2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa
     2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa
     2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt
     2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca
     2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac
     2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg
     2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc
     2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc
     2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg
     2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag
     2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa
     2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca
     3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca
     3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg
     3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag
     3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta
     3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg
     3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg
     3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa
     3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga
     3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt
     3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa
     3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag
//
LOCUS       BLYPROZ        75 bp ss-mRNA            PLN       18-JUL-1990
DEFINITION  Barley protein Z mRNA, partial cds.
ACCESSION   M35065
KEYWORDS    albumin; protein Z.
SOURCE      Barley, cDNA to mRNA, clone paz1-1.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Rasmussen,S.K., Hopp,H.E., Brandt,A., Svendsen,I. and Hejgaard,J.
  TITLE     A cDNA clone for protein Z, a major barley endosperm albumin
  JOURNAL   Carlsberg Res. Commun. 49, 385-390 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >    75     protein Z (AA at 1)
BASE COUNT       12 a     19 c     19 g     25 t
ORIGIN      
        1 cccctgaagg tggatttggt ggattttgtc gcgaatcacc ctttcctctt ccttattcgg
       61 gaagacattg ccggt
//
LOCUS       BSURPOF      3078 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
            complete cds.
ACCESSION   M34995
KEYWORDS    RNA polymerase; minor sigma-37 factor.
SOURCE      B.subtilis (strains 168 and Marburg), clones
            lambda-[21,gt11,1,11,21,31].
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 368 to 2297)
  AUTHORS   Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
  TITLE     Gene encoding the 37,000-dalton minor sigma factor of Bacillus
            subtilis RNA polymerase: Isolation, nucleotide sequence,
            chromosomal locus, and cryptic function
  JOURNAL   J. Bacteriol. 169, 771-778 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3078)
  AUTHORS   Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
  TITLE     Similar organization of the sigB and spoIIA operons encoding
            alternate sigma factors of Bacillus subtilis RNA polymerase
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.W.Price, 18-FEB-1987.  Draft entry and computer-readable
            sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES       from  to/span     description
    pept        200      529     ORF V
    pept        526     1008     ORF W
    pept        968     1762     37 kd minor sigma factor (rpoF, sigB; ttg
                                 start codon)
    pept       1762     2361     ORF X
    mRNA        131     2368     sigB operon mRNA
    signal      131      168     sigB operon promoter
    signal     2368     2401     sigB transcription termination signal
    binding     188      194     ORF V ribosomal binding site
    binding     511      519     ORF W ribosomal binding site
    binding     954      961     sigB ribosomal binding site
    binding    1750     1753     ORF X ribosomal binding site
BASE COUNT      999 a    575 c    755 g    749 t
ORIGIN      40 degrees on B.subtilis map.
        1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
       61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
      121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
      181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
      241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
      301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
      361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
      421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
      481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
      541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
      601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
      661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
      721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
      781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
      841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
      901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
      961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
     1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
     1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
     1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
     1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
     1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
     1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
     1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
     1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
     1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
     1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
     1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
     1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
     1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
     1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
     1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
     1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
     1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
     2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
     2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
     2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
     2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
     2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
     2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
     2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
     2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
     2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
     2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
     2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
     2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
     2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
     2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
     2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
     2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
     3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
     3061 caaactgctg cgaagctt
//
LOCUS       HUMERPR      1624 bp ss-mRNA            PRI       18-JUL-1990
DEFINITION  Human erythropoietin receptor mRNA, complete cds.
ACCESSION   M34986
KEYWORDS    erythropoietin receptor.
SOURCE      Human cell line OCI-M1, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1624)
  AUTHORS   Ehrenman,K.St. and St John,T.
  TITLE     Sequence of the human erythropoietin gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.St.John, 07-JUN-1990.
            
              Author address: T.St.John
              Hutchinson Cancer Res Center
              1124 Columbia St.
              Seattle, WA
              Email: TSTJOHN@FHCRCVAX
            
FEATURES       from  to/span     description
    pept          1     1527     erythropoietin receptor precursor
    sigp          1       72     erythropoietin receptor signal peptide
    matp         73     1524     erythropoietin receptor
BASE COUNT      276 a    537 c    498 g    313 t
ORIGIN      
        1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
       61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
      121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
      181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
      241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
      301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
      361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
      421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
      481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
      541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
      601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
      661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
      721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
      781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
      841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
      901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
      961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
     1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
     1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
     1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
     1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
     1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
     1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
     1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
     1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
     1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
     1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
     1621 agga
//
LOCUS       HUMGFP40H    4379 bp ds-DNA             PRI       18-JUL-1990
DEFINITION  Human P40 T-cell and mast cell growth factor (hP40) gene, complete
            cds.
ACCESSION   M30135
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor;
            megakeryoblastic leukemia cell growth factor.
SOURCE      Human (cell line CESS) DNA, clone lambda-H40.3a1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4379)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 12-DEC-1989.
FEATURES       from  to/span     description
    pept        796      909     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
               1022     1057     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
               1140     1172     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2461     2592     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               4096     4215     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        796      849     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        850      909     P40 T-cell and mast cell growth factor
               1022     1057     P40 T-cell and mast cell growth factor
               1140     1172     P40 T-cell and mast cell growth factor
               2461     2592     P40 T-cell and mast cell growth factor
               4096     4212     P40 T-cell and mast cell growth factor
    IVS         910     1021     hP40 intron A
    IVS        1058     1139     hP40 intron B
    IVS        1173     2460     hP40 intron C
    IVS        2593     4095     hP40 intron D
    signal      742      747     TATA box
    signal     4354     4359     polyA signal
    rpt        2965     3265     Alu repeat
    binding     370      376     AP-2 binding site
    binding     624      629     IRF-1 binding site
    binding     627      633     AP-1 binding site
BASE COUNT     1208 a    921 c    904 g   1346 t
ORIGIN      
        1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
       61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
      121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
      181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
      241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
      301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
      361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
      421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
      481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
      541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
      601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
      661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
      721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
      781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
      841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
      901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
      961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
     1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
     1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
     1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
     1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
     1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
     1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
     1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
     1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
     1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
     1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
     1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
     1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
     1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
     1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
     1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
     1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
     1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
     2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
     2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
     2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
     2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
     2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
     2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
     2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
     2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
     2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
     2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
     2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
     2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
     2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
     2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
     2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
     2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
     3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
     3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
     3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
     3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
     3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
     3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
     3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
     3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
     3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
     3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
     3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
     3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
     3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
     3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
     3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
     3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
     3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
     4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
     4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
     4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
     4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
     4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
     4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS       MEFDNABP     1710 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION   M34778
KEYWORDS    DNA binding protein HMfB.
SOURCE      M.fervidus DNA.
  ORGANISM  Methanothermus fervidus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanothermaceae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  TITLE     DNA binding protein HMf, isolated from the hypothermophilic
            archaeal Methanothermus fervidus is most closely related to
            histones
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1710)
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Sandman, 01-JUN-1990.
            
               Author address: K.Sandman
               Department of Microbiology
               Ohio State University
               484 W. 12th Ave.
               Columbus, Ohio 43210
               E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES       from  to/span     description
    pept        578      787     DNA binding protein HMfB
    pept       1662      805 (c) ORF285
    signal      450      457     promoter
    signal      523      530     promoter
    signal      790      814     transcription termination signal
BASE COUNT      619 a    263 c    265 g    561 t      2 others
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
       61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
      121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
      181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
      241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
      301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
      361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
      421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
      481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
      541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
      601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
      661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
      721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
      781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
      841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
      901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
      961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
     1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
     1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
     1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
     1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
     1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
     1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
     1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
     1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
     1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
     1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
     1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
     1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS       MUSINT1A     5607 bp ds-DNA             ROD       18-JUL-1990
DEFINITION  Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION   K02593 M34750
KEYWORDS    Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE      Mouse (BALB/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1098 to 5606)
  AUTHORS   Van Ooyen,A. and Nusse,R.
  TITLE     Structure and nucleotide sequence of the putative mammary oncogene
            int-1; Proviral insertions leave the protein-encoding domain intact
  JOURNAL   Cell 39, 233-240 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5607)
  AUTHORS   Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
            Otte,A., Schuuring,E. and Van Ooyen,A.
  TITLE     The promoter of the Wnt-1 (int-1) oncogene and its mechanism of
            activation by insertion of proviral DNA of the mouse mammary tumor
            virus
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [2] kindly submitted
            by R.Nusse, 31-MAY-1990.
FEATURES       from  to/span     description
    pept       1779     1882     Wnt-1 protein (exon 1)
               2452     2705     Wnt-1 protein (exon 2)
               3279     3544     Wnt-1 protein (exon 3)
               4002     4490     Wnt-1 protein (exon 4)
    IVS        1883     2451     Wnt-1 cds intron A
    IVS        2706     3278     Wnt-1 cds intron B
    IVS        3545     4001     Wnt-1 cds intron C
    site       1133     1134     MMTV insertion site in tumor 17 [1]
    site       1394     1395     MMTV insertion site in tumor 35 [1]
    site       1631     1632     MMTV insertion site in tumor 102 [1]
    site       4495     4496     MMTV insertion site in tumor 53 [1]
    mRNA       1428     5398     Wnt-1 mRNA (alt.)
    mRNA       1595     5398     Wnt-1 mRNA (alt.)
    signal     1562     1566     TATA box
    signal     1573     1577     TATA box
BASE COUNT     1188 a   1681 c   1519 g   1219 t
ORIGIN      Chromosome 15.
        1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
       61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
      121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
      181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
      241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
      301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
      361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
      421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
      481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
      541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
      601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
      661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
      721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
      781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
      841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
      901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
      961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
     1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
     1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
     1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
     1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
     1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
     1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
     1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
     1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
     1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
     1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
     1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
     1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
     1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
     1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
     1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
     1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
     1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
     2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
     2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
     2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
     2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
     2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
     2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
     2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
     2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
     2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
     2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
     2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
     2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
     2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
     2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
     2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
     2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
     3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
     3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
     3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
     3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
     3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
     3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
     3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
     3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
     3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
     3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
     3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
     3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
     3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
     3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
     3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
     3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
     3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
     4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
     4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
     4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
     4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
     4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
     4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
     4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
     4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
     4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
     4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
     4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
     4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
     4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
     4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
     4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
     4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
     4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
     5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
     5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
     5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
     5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
     5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
     5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
     5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
     5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
     5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
     5581 gaccctaaat gagaccaact agggatc
//
LOCUS       MUSP40M      3808 bp ds-DNA             ROD       18-JUL-1990
DEFINITION  Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
            cds.
ACCESSION   M30136
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor.
SOURCE      Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
            DNA, clone lambda-S40.1a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3808)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 21-NOV-1989.
FEATURES       from  to/span     description
    pept        644      757     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
                846      881     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
                965      997     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2184     2315     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               3393     3512     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        644      697     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        698      757     P40 T-cell and mast cell growth factor
                846      881     P40 T-cell and mast cell growth factor
                965      997     P40 T-cell and mast cell growth factor
               2184     2315     P40 T-cell and mast cell growth factor
               3393     3509     P40 T-cell and mast cell growth factor
    IVS         758      845     mP40 intron A
    IVS         882      964     mP40 intron B
    IVS         998     2183     mP40 intron C
    IVS        2316     3392     mP40 intron D
    signal      589      594     TATA box
    binding     227      233     AP-2 binding site
    binding     475      480     IRF-1 binding site
    binding     478      484     AP-1 binding site
BASE COUNT     1012 a    903 c    869 g   1024 t
ORIGIN      Chromosome 13.
        1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
       61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
      121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
      181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
      241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
      301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
      361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
      421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
      481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
      541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
      601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
      661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
      721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
      781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
      841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
      901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
      961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
     1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
     1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
     1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
     1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
     1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
     1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
     1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
     1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
     1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
     1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
     1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
     1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
     1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
     1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
     1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
     1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
     1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
     2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
     2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
     2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
     2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
     2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
     2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
     2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
     2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
     2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
     2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
     2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
     2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
     2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
     2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
     2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
     2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
     3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
     3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
     3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
     3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
     3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
     3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
     3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
     3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
     3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
     3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
     3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
     3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
     3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
     3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS       PIGMHDQAA    1105 bp ss-mRNA            MAM       18-JUL-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
            cds.
ACCESSION   M29938
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
            ZQ30-1.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1105)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         13      777     MHC SLA-DQ alpha chain precursor
    sigp         13       81     MHC SLA-DQ alpha chain signal peptide
    matp         82      774     MHC SLA-DQ alpha chain
    mRNA    <     1     1105     MHC SLA-DQ mRNA
BASE COUNT      254 a    290 c    255 g    306 t
ORIGIN      
        1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
       61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
      121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
      181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
      241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
      301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
      361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
      421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
      481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
      541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
      601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
      661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
      721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
      781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
      841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
      901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
      961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
     1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1081 aaacaataaa tcacttcatg aggtt
//
LOCUS       PIGMHDQAB    1174 bp ss-mRNA            MAM       18-JUL-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
            cds.
ACCESSION   M29939
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1174)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         69      836     MHC SLA-DQ alpha chain precursor
    sigp         69      137     MHC SLA-DQ alpha chain signal peptide
    matp        138      833     MHC SLA-DQ alpha chain
BASE COUNT      273 a    313 c    266 g    322 t
ORIGIN      
        1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
       61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
      121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
      181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
      241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
      301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
      361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
      421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
      481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
      541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
      601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
      661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
      721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
      781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
      841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
      901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
      961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
     1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
     1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS       PIGMHDQBA     212 bp ds-DNA             MAM       18-JUL-1990
DEFINITION  Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION   M29944
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Pig (haplotype c/c) adult spleen DNA, clone D26.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
            Gustafsson,K. and LeGuern,C.
  TITLE     Class II genes of miniature swine: II. Molecular identification and
            characterization of B (beta) genes from SLA-c haplotype
  JOURNAL   Immunogenetics 31, 1-6 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept    <     1       12     MHC SLA-DO beta chain
BASE COUNT       38 a     58 c     52 g     64 t
ORIGIN      
        1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
       61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
      121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
      181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS       PP2LEND       266 bp ds-DNA             PHG       18-JUL-1990
DEFINITION  Bacteriophage P2 left end DNA.
ACCESSION   M34756
KEYWORDS    .
SOURCE      Bacteriophage P2 DNA.
  ORGANISM  Bacteriophage P2
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 266)
  AUTHORS   Ziermann,R. and Calendar,R.
  TITLE     Characterization of the cos sites of Bacteriophages P2 and P4
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Ziermann, 05-JUN-1990.
            
              Author address: R.Ziermann
              University of California
              Dept. Mol. Cell Biology
              Berkeley, CA 94720
              Email: calendar@garnet.berkeley.edu
            
FEATURES       from  to/span     description
    site          1       19     cohesive end of phage DNA
    site         20      266     left end of phage DNA
BASE COUNT       41 a     58 c     99 g     68 t
ORIGIN      Map position 0.0-0.8%.
        1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
       61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
      121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
      181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
      241 gagccagccg tttatctcgc ggatcc
//
LOCUS       RCABCHH       314 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
            F108) genes, 3' end, and 5' end respectively.
ACCESSION   M34843
KEYWORDS    .
SOURCE      R.capsulata DNA.
  ORGANISM  Rhodobacter capsulatus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Yang,Z. and Bauer,C.E.
  TITLE     Rhodobacter capsulatus genes involved in early steps of the
            bacteriochlorophyll biosynthetic pathway
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.E.Bauer, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1       79     bacteriochlorophyll biosynthesis protein
                                 (bchH) (AA at 2)
    pept        108  >   314     bacteriochlorophyll biosynthesis protein
                                 (F108)
BASE COUNT       59 a     87 c    119 g     49 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
       61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
      121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
      181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
      241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
      301 agcgcgtgct gcag
//
LOCUS       TREANTAT      999 bp ds-DNA             ORG       18-JUL-1990
DEFINITION  T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION   M34848
KEYWORDS    .
SOURCE      T.evansi AnTat 3/3 kinetoplast minicircle DNA.
  ORGANISM  Kinetoplast Trypanosoma evansi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Trypanosoma evansi.
REFERENCE   1  (bases 1 to 999)
  AUTHORS   Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
            Steinert,M. and Hamers,R.
  TITLE     Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Bajyana Songa, 05-JUN-1990.
FEATURES       from  to/span     description
    site        367      516     kinetoplast minicircle common region (ORI)
BASE COUNT      391 a     78 c    203 g    327 t
ORIGIN      
        1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
       61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
      121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
      181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
      241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
      301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
      361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
      421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
      481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
      541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
      601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
      661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
      721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
      781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
      841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
      901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
      961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS       VIBSCRRE     1524 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  V.alginolyticus Scr repressor protein gene (scrR), complete cds,
            and sucrose transport (scrA) protein, 5' end.
ACCESSION   M35009
KEYWORDS    Scr repressor protein; sucrose transport protein.
SOURCE      V.alginolyticus DNA.
  ORGANISM  Vibrio alginolyticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Blatch,G.L. and Woods,D.R.
  TITLE     Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
            repressor-encoding gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Blatch, 10-JUN-1990.
            
              Author address: G.L.Blatch
              University of Cape Town
              Dept. Microbiology
              Private Bag
              Rondebosch
              7700 SOUTH AFRICA
              Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
            
FEATURES       from  to/span     description
    pept         36  <     1 (c) sucrose transport protein (scrA)
    pept        271      555     Scr repressor protein (scrR)
    pept        791     1090     ORF5
    binding     264      267     scrR ribosomal binding site
    binding     780      785     ORF5 ribosomal binding site
    signal      178      183     scrA -10 region
    signal      203      208     scrA -35 region
    signal      190      195     scrR -10 region
    signal      162      167     scrR -10 region
    signal      762      767     ORF5 -35 region
    signal      742      747     ORF5 -35 region
    site         78       91     repressor operator site (put.)
    site        229      242     repressor operator site (put.)
    site         12       71     ScrR helix-turn-helix motif
BASE COUNT      466 a    344 c    321 g    393 t
ORIGIN      
        1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
       61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
      121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
      181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
      241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
      301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
      361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
      421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
      481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
      541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
      601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
      661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
      721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
      781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
      841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
      901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
      961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
     1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
     1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
     1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
     1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
     1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
     1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
     1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
     1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
     1501 gccaattgtg atattgaaga agct
//
LOCUS       XELNRP1A     2431 bp ss-mRNA            VRT       18-JUL-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
            complete cds.
ACCESSION   M34894
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA, clones 24-39.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2431)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         74     1117     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2431     nrp-1B mRNA
    signal     2407     2418     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
    site        921      941     alternative exon (put.)
BASE COUNT      663 a    522 c    543 g    703 t
ORIGIN      
        1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
       61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
      121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
      181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
      241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
      301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
      361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
      421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
      481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
      541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
      601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
      661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
      721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
      781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
      841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
      901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
      961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
     1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
     1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
     1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
     1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
     1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
     1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
     1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
     1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
     1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
     1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
     1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
     1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
     1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
     1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
     1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
     1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
     1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
     2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
     2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
     2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
     2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
     2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
     2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
     2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS       XELNRP1B     2324 bp ss-mRNA            VRT       18-JUL-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
            complete cds.
ACCESSION   M34895
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2324)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         29     1051     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2324     nrp-1B mRNA
    signal     2306     2311     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
BASE COUNT      647 a    467 c    520 g    690 t
ORIGIN      
        1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
       61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
      121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
      181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
      241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
      301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
      361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
      421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
      481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
      541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
      601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
      661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
      721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
      781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
      841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
      901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
      961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
     1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
     1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
     1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
     1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
     1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
     1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
     1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
     1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
     1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
     1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
     1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
     1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
     1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
     1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
     1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
     1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
     1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
     2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
     2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
     2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
     2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
     2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS       YSJSACDHY    2569 bp ds-DNA             PLN       18-JUL-1990
DEFINITION  Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION   M34929
KEYWORDS    saccharopine dehydrogenase.
SOURCE      Y.lipolytica (strain W29) DNA.
  ORGANISM  Yarrowia lipolytica
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2569)
  AUTHORS   Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
            Gaillardin,C.
  TITLE     Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
            lipolytica
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.M.Fournier, 06-JUN-1990.
            
            ORF1 is probably not translated, since a lacZ fusion/expression
            experiment did not produce results.
FEATURES       from  to/span     description
    pept        649     1929     ORF1
    pept       1764      655 (c) saccharopine dehydrogenase
    mRNA        582     2060     ORF1 mRNA (alt.)
    mRNA        598     2060     ORF1 mRNA (alt.)
    mRNA        599     2060     ORF1 mRNA (alt.)
    mRNA        602     2060     ORF1 mRNA (alt.)
    mRNA        606     2060     ORF1 mRNA (alt.)
    mRNA        609     2060     ORF1 mRNA (alt.)
    mRNA       1807      639 (c) ORF2 mRNA (alt.)
    mRNA       1809      639 (c) ORF2 mRNA (alt.)
    mRNA       1811      639 (c) ORF2 mRNA (alt.)
    mRNA       1812      639 (c) ORF2 mRNA (alt.)
    mRNA       1814      639 (c) ORF2 mRNA (alt.)
    signal      521      528     TATA box
    signal      565      575     TATA box
BASE COUNT      668 a    573 c    666 g    662 t
ORIGIN      
        1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
       61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
      121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
      181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
      241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
      301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
      361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
      421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
      481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
      541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
      601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
      661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
      721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
      781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
      841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
      901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
      961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
     1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
     1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
     1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
     1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
     1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
     1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
     1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
     1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
     1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
     1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
     1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
     1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
     1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
     1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
     1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
     1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
     1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
     2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
     2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
     2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
     2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
     2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
     2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
     2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
     2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
     2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//
LOCUS       MVISAA1A      772 bp ss-mRNA            MAM       18-JUL-1990
DEFINITION  Mink serum amyloid A1 mRNA, complete cds.
ACCESSION   M34953 J05445
KEYWORDS    serum amyloid A1.
SOURCE      Mink liver, cDNA to mRNA, clone pmiSAA1.
  ORGANISM  Mustela vison
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae.
REFERENCE   1  (bases 1 to 772)
  AUTHORS   Marhaug,G., Husby,G. and Dowton,S.B.
  TITLE     Mink serum amyloid A protein--expression and primary structure
            based on cDNA sequences
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.O.Marhaug, 07-JUN-1990.
FEATURES       from  to/span     description
    pept         29      418     serum amyloid A1
    mRNA    <     1      772     serum amyloid A1
BASE COUNT      176 a    205 c    200 g    191 t
ORIGIN      
        1 cagctctgct ccaccagaac tcagcaccat gaagcttttc acgggcctca tcttctgctc
       61 cttggtcctg ggggtcagca gtcaatggta ttcattcatt ggtgaggctg ctcaaggagc
      121 ttgggacatg tacagagcct actctgacat gatagaagcc aagtacaaaa attcggacaa
      181 atacttccat gcccgcggga actatgatgc tgcacaaagg ggccctgggg gtgcgtgggc
      241 tgctaaagtc atcagtgacg ccagagagcg ttctcagaga atcacagacc ttatcaagta
      301 tggagacagc ggccacggag tggaggactc gaaggctgac caagctgcca acgaatgggg
      361 ccggagtggc aaagacccca accacttccg accccctggc ctgcctgaca agtactgagc
      421 tttcccttgg ctctgccctg gggagatggg ctgtgaggcc cctgaaggca agaacagttg
      481 ctgagttaga gttcctgaat tgtataccct tcctactgaa tactttaaag agcacattaa
      541 aatgtttagt aaatgcttgt gaatgggcac ctggatggcc cagttggtta agcggctgtc
      601 ttcggctcat gtcatgatcc tagggtcctg ggaccgagtt ctccatgggg ctccttcctg
      661 ggcagggagt ctgcttctct gcctgcctct ctgcctctgc ctgccactct gcctgcttgt
      721 gtgctctctc tccctctttc tctgacaaat aaataaataa ataaaatctt tt
//
LOCUS       MVISAA2A      552 bp ss-mRNA            MAM       18-JUL-1990
DEFINITION  Mink serum amyloid A2 mRNA, complete cds.
ACCESSION   M34954 J05445
KEYWORDS    serum amyloid A2.
SOURCE      Mink liver, cDNA to mRNA, clone pmiSAA2.
  ORGANISM  Mustela vison
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae.
REFERENCE   1  (bases 1 to 552)
  AUTHORS   Marhaug,G., Husby,G. and Dowton,S.B.
  TITLE     Mink serum amyloid a protein - expression and primary structure
            based on cDNA sequences
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.O.Marhaug, 07-JUN-1990.
FEATURES       from  to/span     description
    pept         31      420     serum amyloid A2 precursor
    sigp         31       84     serum amyloid A2 signal peptide
    matp         85      417     serum amyloid A2
    mRNA    <     1      552     serum amyloid A2 mRNA
BASE COUNT      129 a    144 c    161 g    118 t
ORIGIN      
        1 agcagctctg ctccaccaga actcagcacc atgaagcttt tcacgggcct catcttctgc
       61 tccttggtcc tgggggtcag cagtcaatgg tattcattca ttggtgaggc tgttcaaggg
      121 gcttgggaca tgtacagagc ctactctgac atgagagaag ccaattacaa aaattcggac
      181 aaatacttcc atgcccgtgg gaactatgat gctgcacaaa ggggccctgg gggcgcctgg
      241 gctgctaaag tcatcagtga cgccagagag cgttctcaga gagtcacaga ccttttcaag
      301 tatggagaca gcggccacgg agtggaggac tcgaaggctg accaggctgc caacgaatgg
      361 ggccggagtg gcaaagaccc caaccacttc cgaccctctg gcttgcctga caagtactga
      421 gcttcccctt ggctctgtcc tggggggatg ggctgtgagg accctgggga cagggacacc
      481 cagtcattga gttctctgtc cccagaggct agtggagggc acctaatagg tgtctaataa
      541 atgcttactg ag
//
LOCUS       TFERUBISCO   2172 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  Pot. Thiobacillus sp. ribulose 1,5 bisphosphate carboxylase
            (RUBISCO) large and small subunit genes, complete cds.
ACCESSION   M34536
KEYWORDS    ribulose 1,5 bisphosphate carboxylase.
SOURCE      Pot. Thiobacillus sp. DNA, clones pAH5.0 and pAH196.
  ORGANISM  Thiobacillus sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Colorless sulfur bacteria.
REFERENCE   1  (bases 1 to 2172)
  AUTHORS   Stein,J.L., Haygood,M. and Felbeck,H.
  TITLE     Nucleotide sequence and expression of a deep sea ribulose 1,5
            bisphosphate carboxylase gene cloned from a chemoautotrophic
            bacterial endosymbiont
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.L.Stein, 23-MAY-1990.
            
                        Author address: J.L.Stein
                        Scripps Institution of Oceanography
                        UC San Diego, A-002
                        A-002, Marine Biol Res Div.
                        La Jolla, CA 92093
            
            The taxonomy of this organism could not be determined.  It can only
            live as a symbiont of Alvinoconcha hessleri, a gastropod.
FEATURES       from  to/span     description
    pept        201     1616     ribulose 1,5 bisphosphate carboxylase large
                                 subunit (EC 4.1.1.39)
    pept       1685     2050     ribulose 1,5 bisphosphate carboxylase small
                                 subunit
    signal       19       24     pot. -35 region
    signal       43       49     pot. -10 region
    binding     188      194     ribosome binding site
    binding    1673     1676     ribosome binding site
BASE COUNT      441 a    741 c    641 g    349 t
ORIGIN      
        1 atccttatcc tgaccctcaa tacataatcg tgagcttatg gctataatca gaaagtctga
       61 gttcacttta tagatgacct ctcccatgct cactctcaat cctgagtgat tcactcggat
      121 tggacccgat aaccgtgcgg gctgtcggac gctaggcgtc cggtgcccct cgaacaccga
      181 gcctctggag aggacaagca atggcgaaga aatacgatgc gggcgtaaaa gactaccgcg
      241 aaacctactg gatgccggac tacacgccga aggagaccga cctgctggcg tgcttcaaga
      301 tcattccgca gccgggcgtg ccgcgcgagg aagcccgggc cgccgtggcc gccgagtcgt
      361 ccaccggcac ctggaccacc gtgtggaccg acctgctgac cgacctcgac cactacaagg
      421 gccgtgccta cgccatcgag gacgtgccgg gcgacgagga ggccttctat gccttcatcg
      481 cctaccccat cgacctgttc gaggaaggct cggtggtcaa cgtgtttacc tctctggtgg
      541 gcaacgtgtt cggcttcaag gccatccgcg ccctgcgtct ggaggacgtg cgcttcccca
      601 tcgcatacgt catgacctgc aacggtccgc cgcacggcat ccaggtcgag cgcgacatca
      661 tgaacaagta cggccgtccg ctgttgggct gcaccatcaa gcccaagctg ggcctgtcgg
      721 ccaagaacta cggtcgcgcc gtgtacgaat gcctgcgcgg cggcctggac ttcaccaagg
      781 acgacgagaa cgtcaactcc cagcccttca tgcgctggcg gcacggtttc gacttcgtga
      841 tggaggccat cgagaaggcc gagcgcgaga ccggcgagcg caagggtcac tacctgaacg
      901 tcaccgcgcc caccccggac gagatgttca agcgtgccga gtacgccaag gagatcggcg
      961 cgccgatcat catgcacgac tacatcaccg gcggcttcac cgccaacacc ggcctggcgc
     1021 agtggtgccg tgacaacggc gtgctgctgc acatccaccg cgccatgcac gccgtgctcg
     1081 accgcaaccc gcaccacggc atccacttcc gcgtgctgac caagatcctg cgcctgtccg
     1141 gcggcgacca cctgcacacc ggcaccgtgg tcggcaagct ggagggcgac cgcgaggcga
     1201 ccctcggctg gatcgacctg ctgcgcgagt cctacatcaa ggaagaccgt tcgcgcggca
     1261 tcttcttcga ccaggactgg ggctccatgc ccggcgtctt tgcggcctgc tccggcggta
     1321 tccacgtctg gcacatgccg gcgctggtca ccatcttcgg cgagcatgcc gtgctccagt
     1381 tcggtggcgg caccctgggt cacccctggg gcaacgccgg cgccgcggcc aaccgggtgg
     1441 cgctggaagc ctgcgtcgag gctcgcaacg agggccacga gctggagaaa gagggcaagg
     1501 acatcctcat ccaggccgcc aagcacagcc ccgaactcaa gaccgcgatg gagacctgga
     1561 aggaaatcaa gttcgagttc gacaccgtgg acaaactgga cgtggcgcac aaataagcgc
     1621 cccgccaccc cgggccggtc gtggctggcc gcgccacgca tccccaattt tcgaggtatc
     1681 aacaatgagc gaaatccaag actacaactc gagcgtcagc gatccgtcga gccgcaagtt
     1741 cgagaccttc tcctacctgc ccgaactcgg cgtggaaaag atccgcaagc aggtcgagta
     1801 catcgtcagc aagggctgga acccggccgt cgagcacacc gagccggaga acgccttcga
     1861 ccactactgg tacatgtgga agctgccgat gttcggcgaa accgacgtgg acgccatcct
     1921 ggccgaggcc gaggcatgcc acaaggcgca tccctcgcat cacgtgcgcc tgatcggcta
     1981 cgacaactat gcccagtcgc aaggcactgc catggtgatc ttccgcggcc ccatcagcgc
     2041 caagtgctga gccaccgggg ccgtaaacac gcgactgccg aggtctgccg gcccccgtcg
     2101 catggcgacc cccgagggca tccgccaggt gcgtcacctc gcgatgttct tccgaacatt
     2161 ccaataagcc ag
//
LOCUS       LSVGPCN      3417 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Lassa virus glycoprotein (GPC) and nucleoprotein (N) genes,
            complete cds.
ACCESSION   M36544 M33879
KEYWORDS    glycoprotein; nucleoprotein.
SOURCE      Lassa virus (strain GA391), cDNA to viral RNA.
  ORGANISM  Lassa virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Arenaviridae.
REFERENCE   1  (bases 1 to 3417)
  AUTHORS   Wilson,S.M. and Clegg,J.C.S.
  TITLE     Sequence analysis of the S RNA of Lassa virus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by J.C.S.Clegg, 26-APR-1990.
            
                Author address:J.C.S.Clegg
                PHLS Centre for Applied Microbiology and Research
                Porton Down, Salisbury SP4 OJG
                UNITED KINGDOM
            
FEATURES       from  to/span     description
    pept         71     1543     glycoprotein (GPC)
    pept       3315     1603 (c) nucleoprotein (N)
BASE COUNT      944 a    801 c    706 g    966 t
ORIGIN      
        1 gcaccgggga tcctaggcat ttaggattgc gcattttaaa acctcctttt tggaaagtgt
       61 cgcaatcagg atgggacaga ttgtgacatt cttccaagaa gttcctcatg ttattgagga
      121 agtgatgaat attgtcctta ttgcactatc catcctagca attctgaagg gactatacaa
      181 tgttgccacg tgtggcttga tagggcttgt cacattcctt ctactttcag gaaggtcatg
      241 ctcactgatc tacaaaggga cttacgagct gcaaaccctt gagttgaata tggagactct
      301 taatatgacc atgccgctat cctgcacaaa gaacaacagt catcattata taagggtggg
      361 gaatgagaca ggacttgagc tcaccttgac caacaccagc attcttaatc acaaattctg
      421 taacctctct gatgcccaca aaaggaatct ttatgatcac agcctcatga gtatcatctc
      481 tacctttcat ctgtccattc ccaacttcaa tcaatatgag gcaatgagct gcgatttcaa
      541 tggggggaaa atcactgtgc aatacaacct gagtcatagc ttcgcagtgg atgcagcagg
      601 tcactgcggc acacttgcaa atggtgtctt acaaacattt atgagaatgg cttggggagg
      661 gagttatatt gctcttgatt ctggacgcgg taactgggac tgtataatga ccagttacca
      721 atatctaatc attcagaata caacctggga tgaccactgc caattttcca gaccatcacc
      781 tattggctac cttgggcttc tctcacaaag aactagagac atatacatta gcagaaggtt
      841 gttggggaca ttcacctgga cactatcaga ctcagaggga aatgaaacac cagggggata
      901 ctgccttact agatggatgt tgattgaggc cgaattaaag tgctttggaa acactgcagt
      961 agccaagtgc aatgagaaac atgatgaaga attttgtgac atgctaaggt tgttcgattt
     1021 caacaaacag gccataagga ggctcaaaac agaggcccaa atgagcatac agctgatcaa
     1081 caaggctgtc aatgctttaa taaatgatca gctcatcatg aagaaccact tgagagacat
     1141 catgggcata ccatattgta actacagcag atattggtac cttaaccaca catcaacagg
     1201 aaagacctca ctaccaaggt gttggcttat ctcaaatgga tcatatctca atgagaccaa
     1261 gttttcagat gacatcgaac aacaagctga caacatgata acagagatgc tacaaaagga
     1321 atacatagat agacagggca aaactccact ggggttagtt gacctatttg tttttagcac
     1381 aagtttctat ctgataagca tctttctcca cctggtcaaa ataccaaccc atagacacat
     1441 tgtaggtaaa ccttgcccaa aaccccacag gctgaaccac atgggcatct gctcctgtgg
     1501 tctatacaaa cagccaggtg tgcctgtcag atggaagagg tgaaatccca cagggccccc
     1561 gtgacccacc gccaattggc ggtgggtcac gggggcgtcc atctacagga cgactttagg
     1621 tgttgaagtt ctgaacacca tgtctctagg gagcacaact ctcaggactt ttgcatcaag
     1681 tcctcctgaa actgctgcat caaacataat gcaatccatc agtgcacaat gaggagttat
     1741 ttcctcttta ccacctctct tctttttctc cacaactacc ccagtgtgca tgtgacatag
     1801 atccttgaat tgatcccaaa cagcattctc aaactttctt gaatctgctt tactaagagt
     1861 gatgtcaatc agttttatgt ctctcctccc ttgtgactca aggagttttc tgatatcctc
     1921 tgatccttgg caagtgatga ccatgttccg aggaagggct tctatcactg cactggttaa
     1981 cccaggttgg gcagcaaaca aatcagtcac atccctacca tgtgaatact ttgcatcttg
     2041 tttgaattgc ttcaaatctg ttggctccct aaagaaatgt acatagcaac ccgagctcgg
     2101 ttgataaagg gctatctcaa ctgggtcttc tggtcttcct tcaatatcca tccaggtttt
     2161 tgcgttggga atcaagttga agcacttgaa atctttgaga gtcatcaact gagagtagat
     2221 taatcctgca gcaaagcctg cagactgtaa tgacttgttg gaccccccat ttccaatttt
     2281 ctggggcttg ttgtcttgct caaggtccac cacagtattt tcccatgctc ttcccacaat
     2341 cgaggtcctt gatgcaatat aggaccatcc gtctcctgag agacagatct tgtataggat
     2401 gttctcataa gggttccttt cacccggtgt gtctgataca aacattccca gactcttctt
     2461 aactttcaag attgacttca agataccatc catggtctga ggtgaaacct taatagtctc
     2521 taacatgtta ccaccatcaa gcatgcaggc ccctgctttg acagcagcac ccaaactgaa
     2581 attgtaacca gagatgttga gtgaactttt cttagtgtca atcatattca gaattggatg
     2641 actctgagac aatctgtcga gatcagatga gttggggtat ttggctgtgt aaatcagccc
     2701 taaatctgtc aaagcttgaa cggcatcatt caggtccact tgcccctgtt tggtcaggca
     2761 tgctaaagtt aggcttggca ttgttccgaa ctgattgttg agtaactctg catttttgac
     2821 atcccaaact ctcacaatgc cattactggc accctttccc cctccactta ctccagtcat
     2881 gccaatcatg ttcaaaaggg ctctcctttg atcaagctgt tgtgaactca aattccccat
     2941 ataaactcct gaactcaaag gcctttctgt tctggtgact tttgatttca gtttttctaa
     3001 atcagcggcc aggattagta gatcgtctga acttaaggtt ccaactctta agacactttt
     3061 ctgctgtgtg gatttgagct caacaagatt gttgactgct tgattgagat ctctcagtcg
     3121 ttttaggtcg ccatcatctc ttttctgctt gcgcatcaat ctctgaacat tactgacctc
     3181 ggagaagtca agaccatgaa ggagagcttg agcgtcttta actacctgca actttatgtt
     3241 ggaacagtag ccagatagtt cccttcttag ggattgagtc cacaagcatg acctcacttc
     3301 cttggaagca ctcattgtcg tgatggttgt ctgacccttg agtgggtctt gaatgtggtc
     3361 actccaaagg tttgattagt gcaaagcgca atccaatagc ctaggatcca ctgtgcg
//
LOCUS       MOPGPCNC     3419 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Mopeia virus glycoprotein (GPC) and nucleocapsid protein genes,
            complete cds.
ACCESSION   M33879
KEYWORDS    glycoprotein; nucleocapsid protein; small RNA segment.
SOURCE      Mopeia virus (strain 800150), cDNA to viral RNA.
  ORGANISM  Mopeia virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Arenaviridae.
REFERENCE   1  (bases 1 to 3419)
  AUTHORS   Wilson,S.M. and Clegg,J.C.S.
  TITLE     Sequence analysis of the S RNA of the African arenavirus Mopeia: An
            unusual secondary structure feature in the intergenic region
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by J.C.S.Clegg, 26-APR-1990.
            
                Author address: J.C.S.Clegg
                PHLS Centre for Applied Microbiology and Research
                Porton Down, Salisbury SP4 OJG,
                UNITED KINGDOM
FEATURES       from  to/span     description
    pept         47     1516     glycoprotein
    pept       3352     1640 (c) nucleocapsid protein
BASE COUNT      875 a    819 c    736 g    989 t
ORIGIN      
        1 gtggatccta ggctttttgg ttgcgcattt ctagagcatc tcggagatgg ggcagatagt
       61 caccttcttt caagaggtgc cacacatcct tgaagaagtg atgaacattg tgctgatgac
      121 cctctcaatc ttggccatcc taaagggcat ctacaatgtg atgacctgtg gaatcatcgg
      181 tttgataaca tttttgtttt tgtgtgggag atcatgctca agcatctata aggacaacta
      241 tgagttcttc tctctcgacc tcgacatgtc ttcactgaat gcaacgatgc ctctctcctg
      301 ctcaaagaac aactcccatc actacatcca agttgggaat gagacaggcc tagagctgac
      361 attgacaaac actagcataa ttaatcataa gttttgcaac ctgtctgatg ctcacaggag
      421 gaatctttat gacaaagcac ttatgtcaat cttgacaaca ttccacctaa gcattccaga
      481 ttttaaccaa tatgaggcaa tgtcatgtga ttttaacgga gggaagatct cagttcaata
      541 caacctgtcc cactcaaatt atgtggatgc cggcaaccac tgtggcacta ttgcaaatgg
      601 cattatggat gtttttagaa gaatgtattg gagcacctcc ctttcggttg cctctgacat
      661 aagtgggact caatgcatac agaccgatta taagtatttg attattcaga acacatcatg
      721 ggaggatcat tgcatgttct caagaccctc acccatggga tttttaagcc ttctgtcaca
      781 gagaaccagg aatttctaca tctcaagaag acttttgggt ctttttacat ggactttgag
      841 tgactcggaa gggaacgaca tgccaggtgg ttactgtctc acaaggtcta tgctaatagg
      901 gcttgatctg aaatgctttg ggaacactgc cattgcgaag tgtaatcagg cacatgatga
      961 agaattctgt gacatgctcc gcctttttga ttttaataaa caagcaatta gcaaactgag
     1021 gtctgaagtt cagcagagca taaatttgat aaataaagct gtcaacgccc tcatcaatga
     1081 tcaactggtc atgaggaacc atttgagaga tctaatgggt attccttatt gcaactactc
     1141 taagttttgg tacttgaatg atactaggac agggagaaca tccctcccaa agtgttggct
     1201 tgtgaccaat gggtcatacc taaatgaaac ccagttctca acagaaattg agcaagaagc
     1261 caataacatg ttcactgaca tgttgaggaa ggagtatgag aaaaggcaga gcacaacacc
     1321 tctggggtta gtagatcttt ttgttttctc cactagcttt tatttgatct ccgtgttcct
     1381 ccacctaatc aaaattccaa cacatagaca cataaaaggc aagccttgcc cgaaaccaca
     1441 cagactcaat cacatggcaa tctgttcctg tggcttctat aagcaaccag gtctccccac
     1501 acaatggaaa aggtgaagaa caggggcacc cccgagaccc accgccttcg gcggtgggtc
     1561 tcgggggcct ggagtgactg ttccttgatg gcctccacga cccaccgcca gaggcggtgg
     1621 gtcgtggagg tgtcccttct tacaggacaa ctctgggagg acctgttcta aacaccaggt
     1681 ctctactcaa gactggtctg ggggttggga tttgtggact ccctatgact gctgcttcaa
     1741 acatgagaca atccagcaat gcacagtgtg gtgtcacttc ttctttgcca cctctcttct
     1801 tcttttgcgt tacaatcccc gtatgcattc tacaaaggtg tttgtattca tcccagatct
     1861 tatcctcaaa ttttcttgca tcgtctttct gcatggacac atcaatcagt tttatgtccc
     1921 tcctgttctg ggagtcaaga agctttctga tgtcatcagc accttgacac gacaagacca
     1981 tgttctttgg caggctctca attacagcac ttgtcaaccc cggctgaact gagaataggt
     2041 cttggatgtc aatgccatga gagtgtttgg agtcttgttt gaattgttta atgtctgttg
     2101 gttccctgta aaaatgaata tactgaccat tattaggttg atagatagct atctccacgg
     2161 ggtcttcagg tctcccttct atgtcaatcc atgtcttagc atttggatcc aattgcaaca
     2221 tgctgtcttt gagttccatt gtctgagagt aggtcaaacc agcaggcatg ccttgggcct
     2281 gagcgagtct gttggagtta cctttgtcag gctttgcatt ctgttgcaca tcaccactca
     2341 gatcaacagt tgtgttatcc caagccctgc cgacgatcga tgttcttgat gctatgtaag
     2401 gccaaccctc tccagaaaga catagtttgt agaggagatt ttcataaggg ttcctttccc
     2461 ctggcgtgtc agagacaaac attcccaaag aacgcttgac tttcaaggca gccttcaaaa
     2521 ttccttccaa attggatttt gttactttga tggtctccag catgtttccg ccgtctatta
     2581 gacaagcacc agctttcacc gcagctgaca ggctaaaatt gtaccctgat atgtttatgg
     2641 cacttttctt gggctcaatg atccccaaga ttgggtgggt ctgagagagt ttgtctaagt
     2701 cagacatgtt tggatacttt gcagtgtaca caagccccaa gtctgacaac gattggatca
     2761 catcattcag atctgcctgc ccttgtttgc acatgcaagc aatcgtcagg cttggcattg
     2821 tcccaaactg attgtttaaa agctctgggt ttcgaacatc ccacactcta acaatgccat
     2881 ctccgagagt gtttcctgcc cggaacccgc cacccattcc gaccatctgt agcaaaaccc
     2941 tcctctgttc tagctgctga gctgttaggt tgcccatata gactccagca gcaagaggcc
     3001 tctcacctct gatgactttt gctttgagtc tgtccaggtc ggcagcaagg acgaggaggt
     3061 catcagaggt tagtgtcccc actctcaaaa cattcttctg ttggactgac tttaactcaa
     3121 ctagattgtt cactgcctgg tttaggtccc tcaatctttt caggtcagag tcatccctct
     3181 tctcctttct catcaacctt tgaacattgg caacttcaga gaagtccagc ccatgaagaa
     3241 gtgcttgagc atccttgatg acttggatct ttatgttgga gcagtaccct gagagttctc
     3301 tcctcaggct ctgtgtccac aagaaggact tcacctcctt tgaattggac attgttggag
     3361 gcttttcggt gcagagctga ttccaaaggc gcaatcaatt agcctaggat ccactgtgc
//
LOCUS       HIV2D194     9399 bp ss-RNA             VRL       18-JUL-1990
DEFINITION  Human immunodeficiency virus type 2 (HIV-2), complete proviral
            genome.
ACCESSION   J04542
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 2, isolate D194.
  ORGANISM  Human immunodeficiency virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D.,
            Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C.,
            Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H.
  TITLE     Molecular cloning of two West African human immunodeficiency virus
            type 2 isolates that replicate well in macrophages: A Gambian
            isolate, from a patient with neurologic acquired immunodeficiency
            syndrome, and a highly divergent Ghanian isolate
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 2383-2387 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 9399)
  AUTHORS   Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D.,
            Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C.,
            Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     The gag coding sequence appears to be distorted at the 5' end by a
            frameshift; the env cds has a premature stop codon.  The 5' ends of
            the pol and gag genes have not been determined.
FEATURES       from  to/span     description
    pept    <   851     2053     gag polyprotein (AA at 851)
    pept    <  1657     4878     pol polyprotein (AA at 1657)
    pept       4808     5455     vif protein
    pept       5283     5621     vpX protein
    pept       5621     5938     vpR protein
    pept       5784     6079     tat protein, exon 2 (first expressed exon)
               8212     8308     tat protein, exon 3 (AA at 8213)
    pept       6010     6079     rev protein, exon 2 (first expressed exon)
               8212     8453     rev protein, exon 3 (AA at 8214)
    pept       6086     6886     env ORF
    pept       8471     9244     nef protein
    LTR           1      239     5' LTR
    LTR        8856     9399     3' LTR
    rpt           1      113     R repeat 5' copy
    binding     243      259     primer (Lys-tRNA) binding site
BASE COUNT     3206 a   1924 c   2357 g   1911 t      1 others
ORIGIN      
        1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
       61 gctccacgct tgcttgctta aagacctctt aataaagctg ccagttagaa gcaagttaag
      121 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gagtaacaag
      181 accctggtct gttaggaccc ttcccgcttt gagaatccaa ggcaggaaaa tccctagcag
      241 gttggcgccc gaacagggac ttgaaagagg actgagaagc cctggaacac ggctgagtga
      301 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaaag cgcgggccga
      361 ggtaccgaag cggcgtgtgg agcgggagtg aaagaggcct ccgggtgaag gtaagtacct
      421 acaccgaaaa ctgtagccag aaaaggcttg ttatcctacc tttagacagg tagaagattg
      481 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagacga attagaaaaa
      541 gttaggttac ggcccaacgg aaagaaaaga tacaggttaa aacatgttgt gtgggcagcg
      601 aatgaattag acagattcgg attggcagag agcctgttgg aatcaaaaga aggttgccaa
      661 aagattctta aagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt
      721 tttaataccg tctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa
      781 gaagcaaaga aactagcaca gagacatcta gtggcagaaa caggaactgc agagaaaatg
      841 ccaaatataa gtagaccaac agcaccacct agtgggaaag ggaggaaact tccccgtgca
      901 acaggcaggc ggcaactata tccatgtgcc gctgagcccc cgaactctaa atgcttgggt
      961 aaaattagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc
     1021 agaaggctgc acgccctatg atatcaatca aatgcttaat tgtgtgggcg atcaccaagc
     1081 agctatgcaa ataatcagag aaattattaa tgaggaagca gcagattggg atgcgcagca
     1141 cccaatacca ggccccttac cagcagggca gcttagagac ccaagggggt ctgacatagc
     1201 aggaacaaca agcacagtag atgaacagat ccagtggatg tataggcaac caaatcccgt
     1261 gccggtaggg aacatctaca ggagatggat ccagataggg ctacagaaat gtgtcaggat
     1321 gtacaaccca actaacatct tagatgtgaa gcagggacca aaagaatcgt tccagagcta
     1381 tgtagacaga ttctacaaaa gcctaagggc agaacaaaca gacccggctg taaaaaattg
     1441 gatgacccaa acgctgctaa tacagaatgc caacccagac tgcaagttag tattaaaagg
     1501 actagggatg aatcccaccc tagaggagat gctgactgcc tgccagggag taggcggacc
     1561 aagccagaaa gccagactaa tggctgaagc cctaaaggag gctttgacgc cagcccctat
     1621 cccatttgca gcagcccaac aaagaagggc aattaggtgt tggaattgtg gaaaggaggg
     1681 acactcggcg aaacagtgcc gagcacccag aagacagggc tgctggaagt gtggcaagtc
     1741 aggacacatc atggcaaact gcccggaaag acaggcaggt tttttaggga tgggcccacg
     1801 gggaaagcag ccccgcaact tccccgcggc ccaagctcct caggggctga taccaacagc
     1861 acccccaata gatccagcag tggacctgtt ggagaaatat atgcagcaag ggagaaagca
     1921 gagagagcag agggagagac catacaagga ggtgacggag gacttactgc acctcgagca
     1981 gggagagacg ccccacagag gggcgacaga ggacttgcta cacctcaatt ctctctttgg
     2041 aaaagaccag tagtcacagc attcatcgag gatcagccgg tagaagtctt actagacaca
     2101 ggagctgatg actcaatagt agcaggaata gagttagggg acaattacac tccaaaaata
     2161 gtggggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataaaggta
     2221 ctaaataaaa gagtaagagc caccataatg acaggagata ccccaatcaa catttttggc
     2281 agaaatattc tggcaacctt aggcatgtca ttaaacctac cagtcgccaa gttagaccca
     2341 ataaaagtaa cattgaagcc agggaaagat ggaccaaggc tgaaacaatg gcccctaaca
     2401 aaagaaaaaa tagaagcact aaaagaaatt tgtgaaaaaa tggaaaggga gggccaacta
     2461 gaagaagcac ctccaactaa tccttataat acccccacat ttgcaattaa gaaaaaggac
     2521 aagaacaaat ggagaatgct aatagatttt agagaactaa acagggtgac tcaagatttc
     2581 acagaaattc agctaggaat tccacacccg gcaggattag ccaaaaagaa aaggattact
     2641 gtactagatg taggggatgc ctacttttcc ataccactac atgaagattt taggcaatat
     2701 actgcattta ccctaccatc agtaaacaat gcagagccag aaaaaagata tgtatataag
     2761 gtcttaccac aaggatggaa aggatcacca gcaatctttc aattcatgat gaggcaaatc
     2821 ttagaacctt tcagaaaagc aaacccagac gtcattctca tccaatacat ggatgatatc
     2881 ttaatagcta gtgacaggac gggtttagag catgacaaag tagtcctgca actaaaagaa
     2941 cttctgaatg gcctagggtt ctctacccca gatgagaagt tccaaaagga ccctccgttt
     3001 caatggatgg gctatgaatt gtggccaact aaatggaaac tgcagaaaat acaattacct
     3061 cagaaagaaa tatggacagt caatgacatc caaaaactag taggagtttt gaactgggcg
     3121 gcgcagatct atccagggat aaaaaccaag catttatgta aattgattag aggaaaaatg
     3181 acactcacag aggaagtaca gtggacagag ttagcagagg cagaactaga agaaaacaaa
     3241 attatcttaa gtcaggaaca agagggatcc tactatcagg aagaagaaga actagaagca
     3301 acagtcatca aaagccaaga caatcagtgg gcatacaaaa tacaccaggg agagagggtt
     3361 ctaaaagtag gaaagtatgc gaagataaaa aatactcata ccaatggggt cagactacta
     3421 gcacaagtag tccaaaaaat aggaaaggaa gcactggtca tttggggacg agtgccaaaa
     3481 tttcacctac cggtagagag agacacctgg gagcaatggt gggataacta ctggcaagta
     3541 acatgggtcc cagagtggga cttcgtatct accccaccac tggtcaggtt gacatttaac
     3601 ttggtaggag atcctatacc aggcacagag accttttaca cagatggatc atgcaataga
     3661 cagtcaaaag aaggaaaagc aggatatgta acagatagag ggagagacag ggtaagagta
     3721 ttagagcaaa catccaatca gcaagcagaa ctagaagcct ttgcgatggc actggcagac
     3781 tcaggtccca aggttaatat catagtagac tcacagtatg taatggggat agtagcaggc
     3841 caaccaacag agtcagaaaa tagaatagta aaccaaatca ttgaggacat gataaagaaa
     3901 gaagcagtct atgttgcatg ggtcccagcc cataaaggca taggaggaaa ccaggaagta
     3961 gaccatttag taagtcaggg catcagacaa gtattattcc tggaaaagat agagcccgct
     4021 caagaagaac acgaaaaata tcatagcaat ataaaagaac taacccataa atttggaata
     4081 ccccaactag tggcaagaca gatagtaaac acatgtgccc aatgccaaca gaaaggagaa
     4141 gccatacatg ggcaagtaaa tgcagaaata ggcgtttggc aaatggactg cacacactta
     4201 gaaggaaaaa tcattatagt agcagtgcat gttgcaagtg gattcataga agcagaagtc
     4261 atcccacagg aatcaggaag gcagacagca ctcttcctat taaaactggc cagtaggtgg
     4321 ccaataacac acttgcacac agacaatggc cccaacttca cttcacagga agtgaagatg
     4381 gtggcatggt ggataggtat agagcaatcc tttggagtac cttacaatcc acaaagccag
     4441 ggagtagtag aagcaatgaa tcaccaccta aaaaatcaga taagtagaat tagagaacag
     4501 gcaaatacaa tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaagaagg
     4561 ggaggaatag gggatatgac cccagcagaa agactaatta acatgatcac cacagaacaa
     4621 gaaatacaat tcctccaaag aaaaaattca aattttaaaa aattccaggt ctattacaga
     4681 gaaggcagag atcagctgtg gaaaggacct ggagagctac tgtggaaggg agacggagca
     4741 gtcatagtca aggtaggggc ggacataaaa gtagtaccaa gaaggaaggc caagattatc
     4801 agggactatg gaggaaggca agaactggat agtagttccc acctggaggg tgccagggag
     4861 gatggagagg tggcatagcc ttgtcaagca cctgaagtac agaacaaaag acttagagga
     4921 ggtgcgctat gttccccatc acaaggtagg atgggcatgg tggacttgca gcagggtaat
     4981 attcccacta gaaggagaaa gtcatctaga gatacaggca tattggaacc taacaccaga
     5041 aaaaggatgg ctctcctctc attcagtaag gttaacctgg tatacagaaa agttctggac
     5101 agatgttacc ccagactgtg cagactccct aatacacagc acttatttct cttgctttac
     5161 ggcaggtgaa gtaagaagag ccatcagagg ggaaaagtta ttgtcctgct gcaactaccc
     5221 ccaagctcat aaagcacagg taccatcact tcaataccta gccctagtgg tagtgcaaca
     5281 aaatggcaga ccccagagaa agggtgccgc caggaaacag tggagaagag accattggag
     5341 aggccttcga gtggctagac aggactatag aagccttaaa cagggaggca gtgaaccatc
     5401 tgccccgaga gctcattttc caggtgtggc aaaggtcctg ggcatattgg catgatgaac
     5461 aagggatgtc aacaagttac acaaagtata gatatttgtg cataatgcag aaagctgtgt
     5521 atatacattt caagaagggg tgcacttgcc tggggagagg acatggcccg ggaggatgga
     5581 gaccaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga
     5641 gtttccccca gaagatggga ccccacggag agagctaggg agtacctggg taatagaaac
     5701 tctgaaggaa atcaaggaag aagccttaaa acattttgat ccctgcttgc taattgctct
     5761 tggcaactat atctataata gacatggaga cacccttgaa ggagccagag agctcattag
     5821 agtcctacaa cgagccctct tcgtgcacat cagagcggga tgtgaccgct caagaaaggg
     5881 ccaaacaagg agaagagctc cttgcccagc tgcaccgacc cctagaggca tgcactaact
     5941 catgctattg taagcagtgc agttaccatt gccagctgtg tttcttgaaa aaagggctcg
     6001 ggatatggta tgcgcgacag ggcagacgaa gaaggactcc aagaaaaact aagactcatc
     6061 cgcctcctgc atcagataag taagtatgga gcctggtagg aatcagctgc ttgttgccat
     6121 tttattaact agtgcttgct taatatattg caaacaatat gtgactgttt tctatggcat
     6181 acccgcgtgg agaaatgcat ctattcccct attttgtgca accaaaaata gagatacttg
     6241 ggggaccatc cagtgcttgc cagacaatga tgattatcag gaaataacct taaatgtgac
     6301 agaagctttt gatgcatggg ataatacagt aacagaacaa gcaatagaag atgtctggag
     6361 actgtttgag acatcaataa aaccatgtgt caagttgacg cccctatgtg tggcgatgaa
     6421 ttgtaatata acttcaggga ctaccgcgac cccgagtcca ccaaacatta caataataga
     6481 tgaaaattct acctgtatag gcgacaacaa ctgcacagga ttagggaaag aagaggtggt
     6541 tgagtgtgag ttcaatatga cggggctaga acaagataag aaaaggaagt ataatgacgc
     6601 atggtactca agagatgtgg tttgtgacaa gacaaacgga acaggcacat gttacatgag
     6661 acattgcaac acatcagtca tcaaagagtc atgtgacaag cactattggg atgctatgaa
     6721 gtttagatac tgtgcaccac cgggttttgc cctactaaga tgcaatgata ccaactattc
     6781 aggctttgaa cctaagtgct ctaaagtagt agctgcttca tgcacaagga tgatggaaac
     6841 gcaaacttct acttggtttg gctttaatgg cactagagca gaatagaaca tatatctatt
     6901 ggcatggtaa gataatagga ctatcattag cttaaacagt attataatct cacaatgcat
     6961 tgtaagagac caggaaataa gacagttgta ccaataacac ttatgtcagg gcgaaggttt
     7021 cactctcggc cagtctacaa caaaaaacct gggcaggcat ggtgttggtt tcaaggcaac
     7081 tggatagaag ccatgcggga ggtgaagcaa acccttgcaa aacatcccag gtacggagga
     7141 acaaatgata caggaaaaat taactttacg aagccaggaa taggttcaga cccagaagtg
     7201 acatacatgt ggactaactg cagaggagaa tttctctact gtaatatgac ttggttcctc
     7261 aattgggtag aaaataagac gaaccaaaca cacggcaact atgcgccatg ccatataagg
     7321 cagataatta acacctggca taaggtaggg acaaatgtat atttgcctcc tagggaaggg
     7381 gagttgacct gcaattcaac agtaaccagc ataattgcta acattgactc agatggaaat
     7441 cagaccaaca ttacctttag tgcagaagtg gcagaactgt accgattaga attgggggac
     7501 tacaaattga tagaagtaac accaattccg ttcgcaccta caaaagagaa aagatattcc
     7561 tcggctccag tgaggaacaa aagaggtgtg ttcgtgctag ggttcttggg ttttctcgca
     7621 gcagcaggtt ctgcaatggg cggcncgtcc ttgacgctgt cggctcagtc ccggacttta
     7681 ctggccggga tagtgcagca acagcaacag ctgttggacg tggtcaagag acaacaagaa
     7741 atgttgcgat tgaccgtctg gggaacgaaa aatctccagg caagagtcac tgctatcgag
     7801 aaatacttaa aggaccaggc acagctaaat tcatggggat gtgcgtttag gcaggtctgc
     7861 cacactactg taccatgggt aaatgactcc ttaacacctg actggaacaa tatgacatgg
     7921 caggaatggg aaaaacgagt ccactaccta gaggcaaata tcagtcaaag tttagaacag
     7981 gcacaaattc aacaagaaaa gaatatgtat gaactacaaa aactaaatag ctgggatgtc
     8041 tttggcaact ggtttgattt gacctcctgg atcaaatata ttcaatatgg agtttatata
     8101 gtagtaggaa taataggttt aagaatagcc atatatatag tgcaattgtt aagtagactt
     8161 agaaagggct ataggcctgt tttctcctcc ccccccggtt atctccaaca gatccatatc
     8221 cacacggaca ggggacagcc agccaacgaa gaaacagaag aagacgccgg agacgacagt
     8281 ggtttcggct tgtggccttg gccactaaac tacatacaat tcctgatcca cctactgact
     8341 cgcctcttga ccgggctata caacagctgc aggggcttac tatccaagaa ctccccgacc
     8401 cgccgactga tctcccagag tctaacagca atcagggact ggctgagact taaggcggcc
     8461 tacctgcaat atgggtgcga gtggatccaa gaagcgttcc gagcattcgc aaggactgcg
     8521 agagagacta ttgcgggcgc gtggaggggg ttatgtgaag cagcgcaacg catcgggagg
     8581 ggaatcctcg cagtcccaag aaggatcagg cagggagcag aaatcgccct cctgtgaggg
     8641 acagcagtat cagcagggag agttcatgaa caccccatgg agaaccccag cagcaatagg
     8701 gcagaaaaat tcatataagc agcaaaatat ggatgatgta gattctgatg atgatgacct
     8761 agtgggagtt cctgttatgc caagagtacc gctgagagaa atgacctata aactggcaat
     8821 agatatgtca cattttataa aagaaaaagg aggactggaa gggatatttt acagtaggga
     8881 gagacataga atcctagact tgttcctaga aaaggaggaa gggataatac cagattggca
     8941 gaattatact catgggccag gaacaaggta cccaatgtac ttcgggtggc tgtggaaact
     9001 agtaccagta gacatctcac aagaggcaga ggaagtagag accaactgct tagtacaccc
     9061 agcacaaaca agcagatatg atgacgagca tggggagaca ctagtttggc ggtttgaccc
     9121 catgctggcc tatagttaca aggccttcat tctgcaccca gaagaatttg ggcacaagtc
     9181 aggattgcca gagaaagagt ggaaggcaaa actgaaagca agagggatac catatagtga
     9241 ataacaggaa caaccatact tggtcagggc aggaaatagc tactaagaac agctgagact
     9301 gcagggactt tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgtcat
     9361 attctctgta taaatgtacc cgcttcttgc attgtattc
//
LOCUS       ANAHETA      2608 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  Anabaena sp. heterocyst differentiation protein (hetA) gene,
            complete cds.
ACCESSION   M31722
KEYWORDS    differentiation protein; heterocyst differentiation protein.
SOURCE      Anabaena sp. (strain PCC 7120) DNA.
  ORGANISM  Anabaena sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Nostocaceae.
REFERENCE   1  (bases 54 to 2608)
  AUTHORS   Holland,D. and Wolk,C.P.
  TITLE     Identification and characterization of hetA, a gene that acts early
            in the process of morphological differentiation of heterocysts
  JOURNAL   J. Bacteriol. 172, 3131-3137 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2608)
  AUTHORS   Holland,D. and Wolk,C.P.
  JOURNAL   Unpublished (1990) MSU-DOE Plant Res. Lab., M.S.U., Lansing, MI
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Wolk,C.P., 30-JAN-1990.
FEATURES       from  to/span     description
    pept        198     2021     heterocyst differentiation protein (hetA)
    ORF        2077  >  2608     unidentified reading frame (ORF2)
    rpt        1973     2000     direct repeat
    rpt        2075     2158     direct repeat
BASE COUNT      793 a    510 c    513 g    792 t
ORIGIN      
        1 taaaaatcct atataggact catatttgat ttctgaaaaa ccaaattggc tagaagcttg
       61 ctatacagag ctttgtttct cagtatacgt agcaaaattc aagtacgatt cctatagatg
      121 aattactaca ctttagttaa aaattaggtt gtcacggttc aaaaaattgt cttagaaaat
      181 atacaacgca tttaaaaatg ccaaaatcac cacataaatt atttaaagct aatagcttct
      241 ggaaagagaa caatttaata ttgcgagaaa taaaacattt tcgcaaaata gctatattag
      301 ccgtaatatt ttcattttta gctgccagtt ttgaaggtgt tagtattggt tttttactgt
      361 catttttaca aaaattgact agtcctaatg atccaattca aacaggaatt agctgggttg
      421 acatgatttt ggctgctgat gcttggccga ttccccccat atacaggata tctctactaa
      481 ttttgttgag tacctggatg cgtgctacct tcaattattt tggcggagta tacactgaat
      541 cagcccaact taatttagca gatcgcttac ataagcaaat ttttgagcaa ttacaagctc
      601 tgaggttgag ttactttgct caaactcgtt cgggtgaact cattaatacg attactacag
      661 aaattgaaag aatcaaacag ggttttagtg ggttagcttt tgtgttaacc agaatcatga
      721 ccgtttgcgt ttactttgtt gtgatgtttt ctatatcatg gcaactttcg attatttctg
      781 tgctgatatt tttactgtta gcagtgggat tatctacgct gaataagcga gtcagagaaa
      841 ctagctttgg catttctcat gctaatgctc aatttacggc tgtagctgta gagttcatca
      901 atgggattcg gacaattcaa gcttttggga cacaagaatt tgaacggcaa cgtttctata
      961 aagcgagtac taatcaactg aatgcagcta tcaaagttgt tttagcttgg acactagtaa
     1021 aacctatagc tgaaggtata gctactacgg ttttaattag tttgattgtg atttccttcg
     1081 cgacttttac attaccagtt gcctcattat tgacattctt ctttgtttta gtcagggtga
     1141 ttcctaatat tcaagatatt aatggtacgg ttgcttttct gagtacttta caagggtcat
     1201 cagaaaacat taaaaatatt ttgcagacga ataataaacc ttacttgaaa aatggcaaac
     1261 ttcatttcca agggttgaag cgttcaattg atttagtatc cgtagatttt ggttatactg
     1321 ctgataattt agtgttgaat aatatcactc tgacaattga gcgtggtaaa acaacagctc
     1381 tggtgggtgc atcgggcgct ggtaaaacca cattagctga tttaattccc cgattttacg
     1441 atccgacaga gggacagatt ttagtagatg ggcttgatgt acagtacttt gaaatcaatt
     1501 ccctccgccg caaaatggct gtagttagtc aagatacatt tattttcaac acttctatta
     1561 gagacaatat cgcctacggt acatctgggg cgagtgaagc ggaaattaga gaagtagcgc
     1621 ggctagcaaa tgcgttgcaa tttatcgaag aaatgcccga agggtttgat actaagttag
     1681 gcgatcgcgg tgtccgttta tctggaggac agagacaacg gattgcgatc gctcgtgcat
     1741 tactccgaga tcccgaaatc ctcattcttg acgaagccac cagcgcccta gattcagtct
     1801 ccgagcgatt aattcaggag tctatagaaa aactttccgt gggtagaaca gtaattgcga
     1861 tcgctcacag actctccaca attgccaaag cagataaggt tgtggtgatg gaacaagggc
     1921 gaattgttga gcagggaaat tatcaagaac ttctagaaca acgcggaaag ctctggaaat
     1981 atcaccagat gcaacacgaa tcaggacaga ctaattcgta atatcaattc aaaattcaaa
     2041 attcaaaatt caaaattagg gaagccgagc agaatcatgg ttttggggta tgtatctgtc
     2101 ccattctttt ttcaaatcgg tataactccc caatccccaa tccccaatct ccagtcccca
     2161 atccccaatc cccaatcccc aatccccaat ccccagtccc caatccccaa tcccatgaaa
     2221 atttccgtca tcatctcgaa ttacaactat gctcgttatc tttctagagc aatcaactct
     2281 gttctcgctc aaactcactc agacattgaa atcgttatcg tagatgatgg ttctacagat
     2341 aacagccgtg atgttattac ccaactgcaa gaacaagcac cggataaaat caagcccatc
     2401 tttcaagcaa atcaaggaca gggaggcgct ttcaatgcgg ggtttgcggc ggcgactggc
     2461 gaagtcgtag cttttcttga tgcagacgat gtgtggaaac ctcataaatt acagcgtatt
     2521 gttgaggtat ttcagacatc ggatgtagtt ggtgtcatgc accatttgga tatcatagat
     2581 ggcaatgaca aaacaattga tcaagctt
//
LOCUS       BOVTRLA        85 bp ss-tRNA            RNA       18-JUL-1990
DEFINITION  Bovine amber suppressor Leu-tRNA-cag.
ACCESSION   M35066
KEYWORDS    transfer RNA-Leu.
SOURCE      Calf liver tRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Valle,R.P.C., Morch,M.-D. and Haenni,A.-L.
  TITLE     Novel amber suppressor tRNAs of mammalian origin
  JOURNAL   EMBO J. 6, 3049-3055 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA          1       85     Leu-tRNA
    anticdn      35       37     Leu-tRNA anticodon cag
    modified      6        6     m2g
    modified     10       10     m2g
    modified     12       12     ac4c
    modified     17       17     gm
    modified     19       19     d
    modified     21       21     p
    modified     27       27     m22g
    modified     33       33     um
    modified     38       38     m1g
    modified     39       39     p
    modified     45       45     um
    modified     47       47     p
    modified     50       50     m3c
    modified     57       57     m5c
    modified     64       64     p
    modified     67       67     m1a
BASE COUNT       14 a     25 c     28 g     18 t
ORIGIN      5' end of mature tRNA.
        1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaggtc gcagtctccc tggaggcgtg
       61 ggttcgaatc ccacttctga cacca
//
LOCUS       BOVTRLB        86 bp ss-tRNA            RNA       18-JUL-1990
DEFINITION  Bovine amber suppressor Leu-tRNA-caa.
ACCESSION   M36600
KEYWORDS    transfer RNA-Leu.
SOURCE      Calf liver tRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 86)
  AUTHORS   Valle,R.P.C., Morch,M.-D. and Haenni,A.-L.
  TITLE     Novel amber suppressor tRNAs of mammalian origin
  JOURNAL   EMBO J. 6, 3049-3055 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA          1       86     Leu-tRNA
    anticdn      35       37     Leu-tRNA anticodon caa
    modified      6        6     m2g
    modified     10       10     m2g
    modified     12       12     ac4c
    modified     19       19     d
    modified     21       21     p
    modified     27       27     m22g
    modified     33       33     um
    modified     38       38     m1g
    modified     39       39     p
    modified     45       45     um
    modified     47       47     p
    modified     50       50     m3c
    modified     58       58     m5c
    modified     65       65     p
    modified     68       68     m1a
BASE COUNT       16 a     24 c     28 g     18 t
ORIGIN      5' end of mature tRNA.
        1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaagtc gcagtctcgc atggaggcgt
       61 gggttcgaat cccacttctg acacca
//
LOCUS       BRPMBOI        62 bp ds-DNA             INV       18-JUL-1990
DEFINITION  B.malayi MboI repeat consensus sequence DNA.
ACCESSION   M34369
KEYWORDS    .
SOURCE      B.malayi (strain sub-periodic) DNA.
  ORGANISM  Brugia malayi
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Gnathostomatidae.
REFERENCE   1  (bases 1 to 62)
  AUTHORS   Natarajan,S., Werner,C., Cameron,M. and Rajan,T.V.
  TITLE     Isolation and characterization of a repetitive DNA element from the
            genome of the human filarial parasite, Brugia malayi
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.Natarajan, 18-MAY-1990.
            
                            author address: S.Natarajan
                            University of Connecticut
                            Dept. of Pathology
                            Farmington, CT 06032
BASE COUNT       24 a     12 c      4 g     22 t
ORIGIN      
        1 ccatttctct acagatataa caatatcact agaagacatt ttgattaatt cattaactca
       61 ta
//
LOCUS       DUKARGIN     1568 bp ss-mRNA            VRT       18-JUL-1990
DEFINITION  Duck argininosuccinate lyase/delta 2-crystallin mRNA, complete cds.
ACCESSION   M35132
KEYWORDS    argininosuccinate lyase/delta 2-crystallin.
SOURCE      Duck 14-day old embryo lens, cDNA to mRNA, clone DC5-1.
  ORGANISM  Anas platyrhynchos
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Anseriformes; Anatidae.
REFERENCE   1  (bases 1 to 1568)
  AUTHORS   Wistow,G. and Piatigorsky,J.
  TITLE     Gene conversion and splice-site slippage in the argininosuccinate
            lyase/delta-crystallins of the duck lens. Members of an enzyme
            superfamily
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.J.Wistow, 26-MAR-1990.
FEATURES       from  to/span     description
    pept         74     1480     argininosuccinate lyase/delta 2-crystallin
    signal     1538     1543     poly-A signal
    site         86       91     6 inserted bases at splice site slip
BASE COUNT      389 a    398 c    439 g    342 t
ORIGIN      
        1 ccggggctgc actgcggaca cggttgcacc aggtgctggg atcgccgcac acgcgagcaa
       61 aacgtcctct gagatggcat ctgaggcgag gggagataaa ctttggggag gaagattcag
      121 tggaagcaca gatcccatca tggagaagct caactcttcc attgcctatg accagaggct
      181 gtctgaagtt gatatccagg ggagcatggc ttatgccaaa gctctggaga aggctgggat
      241 cctaactaag actgagctgg agaagatcct gagtggcctg gaaaagatct ctgaggaatg
      301 gtctaaggga gtctttgtgg tgaaacagag cgatgaggat atccacactg ccaacgaacg
      361 cagactgaag gagctgattg gtgatatcgc tggaaagctg cacaccggaa gaagcaggaa
      421 tgatcaggtt gtgactgact tgaagctgtt catgaagaat tccctctcta tcatctccac
      481 tcacctgctg cagctcatca agaccctggt ggaacgtgct gccatagaaa ttgatgttat
      541 cttgcctggc tacacccacc tgcagaaggc ccagcccatc agatggagcc agttcttgct
      601 cagccatgct gttgccctga cccgtgactc tgagcgcctg ggagaggtga agaagaggat
      661 caacgtcctg cccctgggaa gcggtgctct ggctggcaac cccctggaca ttgacagaga
      721 gatgctgcgt agcgagctgg agtttgcttc catcagcctg aacagcatgg atgccattag
      781 tgagagagac tttgtggtgg aattcctctt ccttgccacc ctgctgatga tccaccttag
      841 caagatggct gaagatctca taatctacag caccagcgag tttggcttcc tgaccctttc
      901 tgatgccttc agcactggca gcagcctgat gcctcagaag aagaaccctg atagcctgga
      961 actgatccgc agcaaggctg gacgtgtgtt tggacggctg gcttcaattc tcatggttct
     1021 caaaggactt ccaagcacct acaacaagga cctgcaggag gacaaggagg ctgtctttga
     1081 tgttgtggac accctgactg ctgtgctcca ggttgccact ggggtgatct ctaccctcca
     1141 gatcagcaag gagaacatgg agaaggctct gacccctgag atgctggcta ctgacctggc
     1201 tctctacttg gttcgcaaag gagtgccgtt cagacaagcc cacactgctt cggggaaggc
     1261 cgtccacctt gccgagacca aaggcatcac catcaataaa ctcagcctgg aggacctgaa
     1321 gagcatcagc ccccagttca gcagcgatgt ctcccaggtc ttcaactttg tcaacagcgt
     1381 ggagcaatac actgccctgg cgggtaccgc caagagcagc gtgactaccc agatcgagca
     1441 gctgagggag ctgatgaaga agcagaagga acaagcttag agtgtgggga gatttcccgt
     1501 ggatgtagtg ttttgcttat cacactaatg cagagttaat aaacactgtg gtgtattgta
     1561 gttcactc
//
LOCUS       DUKARGINX    1562 bp ss-mRNA            VRT       18-JUL-1990
DEFINITION  Duck argininosuccinate lyase/delta 1-crystallin mRNA, complete cds.
ACCESSION   M35133
KEYWORDS    argininosuccinate lyase/delta 1-crystallin.
SOURCE      Duck 14-day old embryo lens, cDNA to mRNA, clone DC6-2.
  ORGANISM  Anas platyrhynchos
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Anseriformes; Anatidae.
REFERENCE   1  (bases 1 to 1562)
  AUTHORS   Wistow,G. and Piatigorsky,J.
  TITLE     Gene conversion and splice-site slippage in the argininosuccinate
            lyase/delta-crystallins of the duck lens. Members of an enzyme
            superfamily
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.J.Wistow, 26-MAR-1990.
FEATURES       from  to/span     description
    pept         71     1471     argininosuccinate lyase/delta 1-crystallin
    signal     1529     1534     poly-A signal
BASE COUNT      393 a    400 c    427 g    342 t
ORIGIN      
        1 gggctccact gcagacacgg ttgtaccagg tgccgggatc gccgcacacc cgagcaaaac
       61 gtcgtctgag atggcatccg agggggataa acttatggga ggaaggtttg tcggaagcac
      121 ggatcccatc atgcagatgc tcagcacttc catatccact gaacagaggc tgtctgaggt
      181 tgatatccag gcaagcattg cttatgccaa agccctggag aaggctggga tcctaactaa
      241 gactgaactg gagaagatcc tgagtggcct ggaaaagatc tctgaggaat tgtctaaggg
      301 agtcattgta gtgacccaaa gcgatgagga tatccaaact gccaatgaac gcagactgaa
      361 ggagctgatt ggtgatattg ctggaaagct gcacactgga agaagcagga acgaacaggt
      421 tgtgactgac ttgaagctgt tcatgaagaa ttccctctct atcatctcca ctcacctgct
      481 gcagctcatc aagaccctgg tggaacgtgc tgccatagaa attgatgtta tcttgcctgg
      541 ctacacccac ctgcagaagg cccagcccat cagatggagc cagttcttgc tcagccatgc
      601 tgttgccctg acccgtgact ctgagcgcct gggagaggtg aagaagagga tcaacgtcct
      661 gcccctggga agcggtgctc tggctggcaa ccccctggac attgacagag agatgctgcg
      721 tagcgagctg gagtttgctt ccatcagcct gaacagcatg gatgccatta gtgagagaga
      781 ctttgtggtg gaattcctct ctgttgccac cctgctgctg atccacctta gcaagatggc
      841 tgaagatctc atcatctaca gcaccagcga gtttggcttc ctgacccttt ctgatgcctt
      901 cagcactggc agcagcctga tgcctcagaa gaagaaccct gatagcctgg aactgatccg
      961 cagcaaggct ggacgtgtgt ttggacggct ggcttcaatt ctcatggttc tcaaaggact
     1021 tccaagcacc tacaacaagg acctgcagga agacaaggag gctgtcattg atgttgtgga
     1081 caccctgact gctgtgctcc aggttgccac tggggtgatc tctaccctcc agatcagcaa
     1141 ggagaacatg gagaaggctc tgacccctga gatgctggct actgacctgg ctctctactt
     1201 ggttcgcaaa ggaatgccat tcagacaagc ccacactgct tcggggaagg ccgtccacct
     1261 tgccgagacc aaaggcattg ccatcaataa tctcaccctg gaggacctga agagcatcag
     1321 ccccctgttc agcagcgatg tgtcccaggt cttcaacttt gtcaacagcg tggagcagta
     1381 cactgccctg ggcggtaccg ccaagagcag cgtgactacc cagattgagc agctgaggga
     1441 gctgatgaag aagcagaagg aacaagctta gagtgtgggg agatttcccg tggatgcagc
     1501 attgtgctta tcacactaat gcagagttaa taaacactgt ggtgtattgt agttcactca
     1561 aa
//
LOCUS       VACFGENE     6185 bp ds-DNA             VRL       18-JUL-1990
DEFINITION  Vaccinia virus F1L, F2L, F3L, F4L, F5L, F6L, F7L, F8L, and F9L
            mRNAs, complete cds.
ACCESSION   M34368
KEYWORDS    .
SOURCE      Vaccinia virus (strain WR), cDNA to viral mRNA, clone HindIII F.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 6185)
  AUTHORS   Roseman,N.A. and Slabaugh,M.B.
  TITLE     The vaccinia virus HindIII D fragment: Nucleotide sequence of the
            left 6.2-kb
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.A.Roseman, 17-MAY-1990.
            
               Author address: N.A.Roseman
               Oregon State University
               Biochemistry and Biophysics
               Corvallis, Oregon 97331
FEATURES       from  to/span     description
    pept        119      757     F9L protein
    pept        817     1014     F8L protein
    pept       1162     1404     F7L protein
    pept       1420     1644     F6L protein
    pept       1674     2642     F5L protein
    pept       2674     3633     F4L protein
    pept       3644     5086     F3L protein
    pept       5110     5553     F2L protein
    pept       5565  >  6185     F1L protein
BASE COUNT     2175 a    946 c   1138 g   1926 t
ORIGIN      
        1 gaattcatca tgtgtaccaa aacagactgt gataaatata gattaaaggt ttccattctt
       61 cacccaatta gtttcttgga aaaatttatt atgagagaca ttttctcaga ctggataaat
      121 ggcggaaact aaagagttta aaactttgta taatcttttt atagatagtt atttacaaaa
      181 attagctcaa cattctatcc ctactaatgt cacttgtgct attcatatag gagaggttat
      241 aggacagttt aaaaattgcg cgctccgaat aactaacaaa tgcatgagta attctcgact
      301 tagtttcaca ctcatggttg aatcatttat tgaagtgatt tcattgcttc cggaaaagga
      361 tagaagacgt atcgctgaag aaataggaat agatctagac gatgtaccta gtgcggtatc
      421 caagctagaa aagaactgta atgcgtatgc ggaggttaat aatattatag atatacagaa
      481 attagatatc ggagaatgtt cggctccgcc cggtcaacat atgcttttac agatagttaa
      541 tacaggatcc gcggaacgaa attgtggttt acagacaatt gttaagtcct taaataaaat
      601 atacgttcca cctattatcg aaaaccgatt gccgtattac gatccgtggt ttctagtggg
      661 tgtagcaatt attctagtta tttttactgt agctatttgt tctattagac gaaatctggc
      721 tcttaaatac agatacggaa cgtttttata cgtttaatta ataaaaaaat ttaattacaa
      781 ggtatataat agtactccat ctacgcaatc gcgataatgg agggatctaa acgcaaacac
      841 gacagtcggc gactacaaca agaacaggag cagcttcgtc cacgtacacc gccatcatat
      901 gaagaaattg caaaatatgg acactcattt aacgtgaaaa gatttacgaa tgaagaaatg
      961 tgtcttaaga atgattatcc acgaattata tcatataatc ctccaccaaa atagagtata
     1021 tatatatcat catttcatga tgtatactac tgacatagtt tcaatgtgaa cttttcactt
     1081 tcttgccggt tatgaagaat attttttatt ttaatggtca ttactaatcg tatattataa
     1141 ttgaaaatgg attagtttaa tatgacgctc gtcatgggat cctgctgtgg tagattctgt
     1201 gacgctaaga ataagaataa gaaggaagat gtagaagagg gaagagaagg atgttacaat
     1261 tataagaacc ttaatgatct ggatgaatcc gaagcacgtg tagaatttgg accattatat
     1321 atgataaatg aagaaaaatc agacataaat acattggata taaaaagaag atatagacac
     1381 acgatagagt ctgtatattt ctaaaagttt ttataaaaaa tgagtaaaat actcacgttt
     1441 gttaaaaata agataattga cttgattaat aatgaccaaa ttaaatattc tagagttata
     1501 atgatagaag agtccgatag tcttttaccg gttgatgagg tgcatgctaa ccacggattt
     1561 gactgtgtgg agatgataga tgaaaatata agcaatgaga atatcgaaca gtataaaacc
     1621 gaatcttttt ttacaataaa ttgaaatcaa aacatttatt aaaccgcatc aagatgggta
     1681 cgaacggcgt tagagtattt gtcattttat atttgttggc tgtatgcgga tgtatcgaat
     1741 acgacgtaga cgataatgta catatttgta cccacactaa cgtgtcacat attaatcaca
     1801 ctagttggta ttataatgat aaggttatag cgctagccac cgaggataaa acttctggtt
     1861 atatatcatc attcataaaa cgcgttaata tctcattaac ttgtttaaat atatcgagtt
     1921 tgcggtacga agattctggt acatacaaag gagtatcgca tctaaaagat ggagtcatcg
     1981 ttacaactac tatgaatata tctgtaaagg ctaatatcat tgacttgact ggtagagtgc
     2041 gttatctaac cagaaattat tgcgaagtta aaatacgatg cgaaataaca tctttcgcgc
     2101 ttaatggttc tactacacca ccacatatga tattaggaac agtagataaa tggaaatatc
     2161 ttccatttcc tacagatgat tatagatacg taggggaact gaaaagatat atatctggaa
     2221 acccatatcc aacagagtcg ctagcgttag aaattagctc gacgtttaat cggtttacta
     2281 tcgttaaaaa tttgaacgat gacgagtttt cttgttatct gttttcacaa aattatagtt
     2341 tccataaaat gttgaatgtg cgtaacattt gtgaatccga atgggaggcg ttaaataata
     2401 ataacgataa ttcatcctcc atgcccgctt cccacaacaa tctcgcaaac gatttatcta
     2461 gtatgatgtc acaattacaa aatgataatg atgataataa cgattattca gcacccatga
     2521 atgtcgataa tttaattatg atagtactaa taacaatgct atcaataata cttgtaatta
     2581 ttgtagtgat tgcggcgata tcgatgtaca aaagatccaa gtacaggcat atagataact
     2641 gaaaaaaaat ttattgttat tgttaattta gttatggaac ccatccttgc accaaatcca
     2701 aatagatttg ttattttccc aatccaatat tatgacatct ggaacatgta taaaaaggca
     2761 gaggcatcat tttggacagt ggaagaagta gatatatcta aagatatcaa tgattggaat
     2821 aaactaacac cagacgaaaa atattttata aaacatgtat tggcgttttt tgcagccagt
     2881 gacggaatag tgaatgaaaa tttggcggaa cgattttgta cagaagtaca gattaccgag
     2941 gctagatgtt tctacggatt tcagatggcc attgaaaaca ttcattcgga aatgtatagt
     3001 cttttgatcg atacttatgt taaagatagt aatgaaaaaa actatctctt taatgccata
     3061 gaaacgatgc cttgtgtaaa aaagaaggcc gattgggctc aaaagtggat acatgacagc
     3121 gccggttatg gagagagact tattgccttt gctgcagtag aaggaatctt cttttctgga
     3181 tcattcgctt ccatattttg gcttaaaaag cgtggcctaa tgcccggact cacgttttcc
     3241 aacgaactaa ttagtagaga cgagggtctg cactgcgatt tcgcatgttt gatgtttaaa
     3301 catttattgc atccaccgag tgaagaaacc gttagatcta ttataacaga tgcggtatcc
     3361 attgaacaag aatttcttac tgcggctctt ccagttaaac ttataggaat gaattgtgaa
     3421 atgatgaaaa catatataga attcgtcgcg gatagattga tttctgaatt gggatttaaa
     3481 aaaatttata atgttaccaa tccgtttgat ttcatggaaa atatatcatt ggaaggaaaa
     3541 actaattttt tcgaaaaacg tgtgggtgaa taccaaaaaa tgggagttat gtctcaagaa
     3601 gataatcatt tttctttaga tgttgacttt taaagaaaca taaatgccga tatttgttaa
     3661 tactgtgtac tgtaagaata tattagcatt gtctatgact aagaaattca aaacaattat
     3721 tgatgctata ggtggcaata taatagtcaa ttctacgata ttgaaaaagt tatctcctta
     3781 ctttcgcaca catttacgtc aaaaatacac gaaaaataaa gatccagtta cttgggtttg
     3841 tctagacctt gacattcaca gtttaacttc tatagttatt tactcgtata ctggaaaggt
     3901 atatatagat agtcataacg tcgtcaattt attacgtgct tctatattaa cctctgtaga
     3961 atttatcatc tacacttgta taaactttat cttacgagat tttagaaagg aatattgtgt
     4021 cgagtgttac atgatgggta tagaatacgg actatccaat ctcttatgtc atactaaaaa
     4081 ctttattgcc aaacactttt tggaactgga agatgacatc atagacaatt ttgattatct
     4141 atctatgaaa cttattctag aaagcgatga actaaatgtt ccagatgagg attatgtagt
     4201 tgattttgtc attaagtggt atataaagcg aagaaataaa ttaggaaatc tgctactcct
     4261 tatcaaaaat gtaatcaggt caaattatct ttctcccaga ggtataaata atgtaaaatg
     4321 gatactagac tgtaccaaaa tatttcattg tgataaacaa ccacgcaaat catacaagta
     4381 tccattcata gagtatccta tgaacatgga tcaaattata gatatattcc atatgtgtac
     4441 aagtactcat gttggagaag tagtatatct catcggtgga tggatgaaca atgaaataca
     4501 taacaatgct atagcggtaa attatatatc aaacaattgg attccaattc ctccgatgaa
     4561 tagccccaga ctgtatgcta gcgggatacc cgctaacaat aaattatacg tagtaggagg
     4621 tctaccaaat cccacatctg ttgagcgttg gttccacggg gatgctgctt gggttaatat
     4681 gccgagtctt ctgaaaccta gatgtaatcc agcagtggca tccataaaca atgttatata
     4741 cgtaatggga ggacattctg aaactgatac aactacagaa tatttgctac ccaatcatga
     4801 tcagtggcag tttggaccat ccacttatta tcctcattat aaatcatgcg cgttagtgtt
     4861 cggtagaagg ttattcttgg ttggtagaaa tgcggaattt tattgtgaat ccagcaatac
     4921 atggactctg atagatgatc ctatttatcc gagggataat ccagaattga tcatagtgga
     4981 taataaactg ctattgatag gaggatttta tcgtgaatcg tatatagata ctatagaagt
     5041 gtacaatcat cacacttatt catggaatat atgggatggt aaataatttt gaaataaaat
     5101 attagtttta tgttcaacat gaatattaac tcaccagtta gatttgttaa ggaaactaac
     5161 agagctaaat ctcctactag gcaatcacct tacgccgccg gatatgattt atatagcgct
     5221 tacgattata ctatccctcc aggagaacga cagttaatta agacagatat tagtatgtcc
     5281 atgcctaagt tctgctatgg tagaatagct cctaggtctg gtctgtccct aaaaggcatt
     5341 gatataggag gcggtgtaat agacgaagat tataggggaa acataggagt cattcttatt
     5401 aataatggaa aatgtacgtt taatgtaaat actggagata gaatagctca gctaatctat
     5461 caacgtatat attatccaga actggaagaa gtacaatctc tagatagtac aaatagagga
     5521 gatcaagggt ttggatcaac aggacttaga taataaacaa tagtatgttg tcgatgttta
     5581 tgtgtaataa tatcgtagat tatgtagatg atatagataa tggtatagta caggatatag
     5641 aagatgaggc tagcaataat gttgatcacg actatgtata tccacttcca gaaaatatgg
     5701 tatatagatt tgacaagtcc actaacatac tcgattatct atcaacggaa cgggaccatg
     5761 taatgatggc tgttcgatac tatatgagta aacaacgttt agacgacttg tatagacagt
     5821 tgcccacaaa gactagatca tatatagata ttatcaacat atattgtgat aaagttagta
     5881 atgattataa tagggacatg aatatcatgt atgatatggc atctacaaaa tcatttacag
     5941 tttatgacat aaataacgaa gttaatacta tactaatgga taacaagggg ttgggtgtaa
     6001 gattggcgac aatttcattt ataaccgaat tgggtagacg atgtatgaac ccagtagaaa
     6061 cgataaaaat gtttactcta ctatcgcata ctatatgcga tgattatttt gtagattata
     6121 taacggacat ttcaccacca gataatacca tccctaacac tagcacgcgt gaatatctaa
     6181 agctt
//
LOCUS       BSUCCCA      1172 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  B.subtilis 13 kD cytochrome c-550 (cccA) gene, complete cds.
ACCESSION   J05569
KEYWORDS    cytochrome c-550.
SOURCE      B.subtilis (strain w168) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1172)
  AUTHORS   von Wachenfeldt,C. and Hederstedt,L.
  TITLE     Bacillus subtilis 13 kDa cytochrome c-550 encoded by cccA, consists
            of a membrane-anchor and a heme-domain
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.O.S.von Wachenfeldt, 12-JUN-1990.
FEATURES       from  to/span     description
    pept        542      904     cytochrome c-550 (cccA)
BASE COUNT      400 a    170 c    264 g    338 t
ORIGIN      223 min on B.subtilis map.
        1 cagctggaag atgtgcttga tacgttaact gatcgtgaag aaaatgtatt gcgtcttcga
       61 ttcggtcttg atgacggccg tacaagaaca ttagaagagg tcggcaaagt atttggagta
      121 acgagagagc gtattcgaca aatcgaagcc aaagcgttgc ggaaactaag acatcctagc
      181 agaagtaaac gtttgaaaga tttccttgaa taagatggaa cgggtcttga agatccgttc
      241 ttcttttttt aaaaagatat atggataata tgcctttatt ttactgaaaa atgatgtcat
      301 ttgcaaatga acattgtggt gaaaaatttc aaaatctaat tccatatttt ctattgtaag
      361 cgtatacaat acattataca atagaataaa aaggatatta gagatttagg catgtttcta
      421 ttcaattttg tataaagtgt gaataaaaaa cttttgtata gcaatccatt tactttttgt
      481 aaaaataagt tagaattaga agtgtttaca tagggggaag gatcaaaaag ggggaaggga
      541 aatgaaatgg aacccgctta ttccattttt gctgatcgct gttttaggaa ttggtctaac
      601 tttcttttta tcagtaaaag gacttgatga ctctcgggag attgcgagcg gaggagaaag
      661 caaatctgct gaaaagaaag atgcaaacgc ttcaccagaa gaaatttaca aggcaaactg
      721 catcgcatgc cacggtgaaa attatgaagg tgtttcgggt cctagcttaa aaggtgtagg
      781 ggataagaaa gatgtcgctg aaattaaaac aaagatcgaa aaaggcggta atggaatgcc
      841 ttccggtctt gttccggccg ataagctgga tgatatggca gagtgggtgt caaaaattaa
      901 ataaaagaac tatttttctc tttatacatt cacaggccgc cgagatgttc tcgacagcct
      961 tttatttttg actgctgcgt gcagacccgc atcaaaagaa ataatttcaa aatcatttgc
     1021 ttcagcttgg attccgggct gattttttat atgatagtaa aggaaaaaat gattgagaaa
     1081 agttgtgatg acaagtgaac gaattaaaat tatctaaacg attgcaaaca gtagcggagt
     1141 acataccgaa tggagcggtg atggccgata tc
//
LOCUS       ECOGACAR     1630 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  E.coli glutamate and aspartate carrier (gltP) gene, complete cds.
ACCESSION   M32488
KEYWORDS    glutamate and aspartate carrier.
SOURCE      E.coli (strain K12-BK9MDG) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1630)
  AUTHORS   Wallace,B., Yang,Y.-J., Hong,J. and Lum,D.
  TITLE     Cloning and sequencing of a gene encoding a glutamate and aspartate
            carrier of E.coli K-12
  JOURNAL   J. Bacteriol. 172, 3214-3220 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            Y. Hong, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        151     1338     glutamate and aspartate carrier (gltP)
BASE COUNT      330 a    391 c    472 g    437 t
ORIGIN      
        1 caagggttgc gcaacatacc gcgcaaatga tactgatcat aagcgttaaa aaaatctaca
       61 aaccaacgca acacaattca tgccctggca gtatgtcacg ttctcgcgtt tctgaacggg
      121 gaacggcgct ccattgagga agtcattcat atgaaaaata taaatttcag cctggcctgg
      181 cagattctgt ttgctatggt gctgggcatt ctcctgggaa gctacctgca ctaccatagc
      241 gacagccgcg actggctggt cgtcaatttg ctctctccgg cgggtgatat cttcatccat
      301 ctgattaaaa tgattgttgt gccgattgtg atctccacgc tggtggtggg tatcgcgggt
      361 gttggtgatg ccaaacagct cgggcgtatt ggcgcgaaaa ccattatcta cttcgaggtg
      421 atcaccaccg tcgccatcat tttgggatca ctctggcgaa ccgtcttcca gcccggtgcc
      481 ggggtggata tgtcgcagtt ggcgaccgtc gatatctcga aatatcagag cactacggaa
      541 gcggtacaaa gcagttccca cggcattatg ggcacgattt tgtcgctggt gccgacgaac
      601 attgtggcgt cgatggcgaa aggcgaaatg ctgccgatca tctttttctc ggtgctgttt
      661 ggtctggggc tttcttccct gcccgcgacg catcgtgaac cgctggtgac cgtgttccgc
      721 tccatctctg aaaccatgtt taaagtgact cacatggtga tgcgttatgc accggtgggt
      781 gtgtttgcgc tgattgcggt gacggtggct aactttggtt tctcgtctct gtggccactg
      841 gcgaaactgg tgctgctggt gcatttcgcc attctgttct tcgcgctggt agtgctggga
      901 attgtggcgc gcctgtgcgg gttaagcgtc tggatcctga ttcgtattct gaaagatgag
      961 ctgattctgg cgtactccac tgccagctct gaaagcgtcg tgccgcgaca tatgagaaga
     1021 tggaagccta cggagcaccg gtgtcgatca ccagtttcgt ggtgccgacc ggttaacctc
     1081 ttttaccttg atggttcgac gctgtatcaa agtattgccg ctatcttcat cgcgcagttg
     1141 tatggcattg acctgtccat ctggcaggaa atcattctgg tcgtgacgct gatggtgacc
     1201 tcgaaaggga ttgctggcgt gcctggcgtg tcgtttgtgg tgttgctggc aacgctgggt
     1261 agcgtaggta tccgctggaa ggtctggcgt ttattgctgg tgttgaccgt atcctcgaca
     1321 tggcgcgtac tgccgtgaac gtggtgggta atgcgctggc ggtgctggtg attgccaagt
     1381 gggaacacaa atttgaccgt aagaaagcgc tggcttatga gcgtgaagtg ctgggcaaat
     1441 ttgataaaac tgcggatcaa taattgaaga ttgccgggga tatccacccg gcaatgtgtg
     1501 aatgcctgat gcgacgcttg ccgcgtctta tcaggcctac gccagacagc gcaatagcct
     1561 gatttagcgt gattttgtag gtcggataag gcgtttatgc cgcatccgac atcaacgcct
     1621 gatgcgacgc
//
LOCUS       ECOLYSU      3034 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  E.coli lysyl-tRNA synthetase (lysU) gene, complete cds.
ACCESSION   M30630
KEYWORDS    lysyl-tRNA synthetase.
SOURCE      E.coli (strain K12) isolate CS520 DNA, clone pLC4-5.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3034)
  AUTHORS   Clark,R.L. and Neidhardt,F.C.
  TITLE     Roles of the two lysyl-tRNA synthetases of Escherichia coli:
            Analysis of nucleotide sequences and mutant behavior
  JOURNAL   J. Bacteriol. 172, 3237-3243 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.L.Clark, 12-DEC-1989.
FEATURES       from  to/span     description
    pept        316     1821     lysyl-tRNA synthetase (lysU) (E.C. 6.1.1.6)
    signal      194      199     -35 signal
    signal      216      221     -10 signal
BASE COUNT      850 a    654 c    665 g    865 t
ORIGIN      93.55 min on K12 map.
        1 cgatcatcgt cgtattggcc tttgctaccc gttttctgtt tagcacgccg acgaatatga
       61 tacaggagag caacgattaa tgtcgtaaaa acaattggtt atgaataaat taactttcag
      121 tttataacac taacaaatcg agattaataa gtcatttaaa ccattttgat ggttatttat
      181 tagtgatatc aacttgaggt aagcgttagt ttcgataaga taaactgagt tactaatagt
      241 cgaggcagat aatacagtgt accgatctgt ctctttatct acgctaaatt gaaagctgga
      301 tttagaggaa ccaaaatgtc tgaacaagaa acacggggag ccaatgaggc tattgatttt
      361 aacgatgaac tgagaaatcg ccgcgaaaaa ctggcggcac tacgtcagca aggtgtggcg
      421 tttcccaatg attttcgccg cgaccatacc tctgaccagt tgcacgaaga gtttgatgcg
      481 aaggataacc aggaactgga atccttaaac attgaagtct cggttgctgg ccgaatgatg
      541 acccgtcgta tcatggggaa agcctccttt gtaacgttgc aggatgtcgg tggccgtatt
      601 caactgtacg ttgcaagaga tagcctgcca gaaggtgttt ataacgatca gtttaaaaaa
      661 tgggatctgg gtgacattat cggtgccggt acgctgttta agacgcaaac gggtgagctt
      721 tccattcact gtactgagct gcgcctgctg actaaagcac tacgtccttt accagataaa
      781 ttccatggtc tgcaggatca ggaagtccgt tatcgtcaac gttatctgga cctcatcgct
      841 aacgataaat cccgtcaaac gtttgttgtc cgttcaaaaa ttctggccgc tatccgtcaa
      901 ttcatggtcg cgcgcggctt tatggaagta gaaaccccga tgatgcaggt aattccaggt
      961 ggggcatctg ctcgcccgtt tattacccat cataatgctc tggatttaga tatgtatgct
     1021 cgtatcgcgc cggagctgta tctgaaacgt ctggttgtag gcggttttga acgggtattc
     1081 gaacacgtaa ctcgtaatga aggtattcgt cgccataatc ctgagttcac aatgatggaa
     1141 ctctacatgg cgtatgcgga ttaccacgat ttgattgaac tgacagagtc actgttccgc
     1201 accctggcac aagaggttct gggtaccact aaagtcactt atggcgagca tgtgtttgat
     1261 ttcggcaaac cgtttgaaaa actcaccatg cgcgaagcaa tcaaaaaata tcgtccagaa
     1321 accgatatgg ccgacctgga taattttgat gctcgtaaag cattagctga atctatcggt
     1381 attacggtag agaaaagctg ggggttggga cgtagtgtca cagagatctt tgatgaagtg
     1441 gtagaaggac atgtgattca gccatccttt attacggaat atccggcaga agtgtccccg
     1501 ctggcacgtc gtaatgatgt taacccggaa atcaccgacc gttttgaatt cttcatcggt
     1561 ggtcgtgaaa tcggtaatgg ttttagcgaa ttaaacgacg cagaagatca ggctgaacgt
     1621 ttccaggaac aggttaatgc taaagctgca ggtgacgacg aagccatgtt ctatgacgaa
     1681 gattacgtga ctgcgctgga atatggtctg ccgccaaccg ctggtctggg tattggtatc
     1741 gaccgaatga ttatgctgtt tactaacagc catactattc gcgacgttat tctcttcccg
     1801 gcgatgcgcc cacagaaata aatttcactt taatgaacga agcagtcagg cgactgcttc
     1861 gttcatcgtt ccgcaaatcc agttttgagg agagacgctt cgtttcacga taaccagcgg
     1921 gtccactcaa ctacagctcc taaaagagag aaaaaagtaa ctcgacaggc agactcattg
     1981 gccaggttat tccgaccagg aatgcactaa gtaaacgaat atgtgttttc taatgataga
     2041 gataaaccag agtaatcgca ggttattcac gccgaatacg ttcacacatc acataaaaaa
     2101 ttaatatttt agagaatagt gccatttttt atgttcttct tatccttcct ggctacttgt
     2161 aaaactgacg gaaatagtca tcaactgtca taatgttgag gtcaaaagca atatccggta
     2221 agtctaagca tttgagcctg attaaatctt cgatttcact tgcggctaac gatgtcgtta
     2281 ttgcgaagga gttttcattc aattgtcgtg gtatcccata ttcatcagcc acgatagagg
     2341 agaaattcat atcatacata catcaatgtt cttaatctta agaagagtgg ggaataagat
     2401 gatggataat caaaagaaac aacataagta ttgaacttgt ttttacctcc ataatatgac
     2461 tcctatcaat attaaacgct ctggggacgc aaaacttatt aaccccgatg atgattctca
     2521 tataggagat attctttttt ttgctcagta aaggggatgg acttgggaca aaaaaacgcg
     2581 ctctctgccc attctttgtg catcccccta cccttcagat gctctccttg catcacatcg
     2641 cggagagaat gaaaaatatc gtgactctgt actctcccat caactgcgaa tatcatcata
     2701 ctcccgcgtt ttatcaaatt catgtttcgc aaatgggcat aatgggataa tttttcgttt
     2761 ttcccgacgc attttttcca cgactttcgc aaccagctgt taccaatccc cgttgccttt
     2821 caggctttca tcgacatcgt atgttcgata actcgctaaa tctccgtcgg tcggcacaaa
     2881 gacaatttca gcgatttgct tgccttgttt gtcattaatg taaaatttat tgtggccttc
     2941 gcgtatttcc attttaccct cgcttattta tgacggtatt tcagcgcgcc gctcgggcac
     3001 gtatcatcac tttaaccaca gtggcgacgt cgac
//
LOCUS       PSEBCD       4007 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  P.aeruginosa pilin biogenesis protein (pilB, pilC and pilD) genes,
            complete cds.
ACCESSION   M32066
KEYWORDS    pilin biogenesis protein.
SOURCE      P.aeruginosa (strain PAK) DNA.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 4007)
  AUTHORS   Nunn,D., Bergman,S. and Lory,S.
  TITLE     Products of three accessory genes, pilB, pilC and pilD are required
            for biogenesis of Pseudomonas aeruginosa pili
  JOURNAL   J. Bacteriol. 172, 2911-2919 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.N.Nunn, 12-FEB-1990.
FEATURES       from  to/span     description
    pept        199     1899     pilin biogenesis protein (pilB)
    pept       1903     3123     pilin biogenesis protein (pilC)
    pept       3127     3999     pilin biogenesis protein (pilD)
    mRNA          8  <     1 (c) pilA mRNA
BASE COUNT      882 a   1244 c   1110 g    771 t
ORIGIN      
        1 cgcctaaccc taccaagcac ttaccatgcc aaataccgtt cacatagcga gcaaggccgt
       61 cttatgccct tcccgaggag actcaaactt aaaacggaca ggatgtgaca ctttttgaca
      121 gccggactcc cccgatttgg cagaccctgt tatctgcgtt ataaggcact tcaatagccc
      181 tctgcgattt ccttctccat gaacgacagc atccaactga gcggtctgtc ccgacagctc
      241 gtccaagcca atctactcga cgaaaagacc gccctgcagg cccagacgca ggcgcagcgc
      301 aacaagctgt cgctggtgac gcacctggtg cagaacaagc tggtaagcgg cctggccctg
      361 gccgaattgt ccgccgagca gttcggcatc gcctattgcg acctgaacag cctggacagg
      421 gaaagctttc cgcgggacgc catcagcgaa aagctggtcc gccagcaccg ggtcattccc
      481 ctgtggcggc ggggcaacaa gttgttcgta ggcatctccg acgccgccaa ccaccaggcc
      541 atcaacgacg tccagttcag cactggcctg acgaccgagg ccattctggt cgaggacgac
      601 aagctcggcc ttgccatcga caagctattc gagaatgcaa ccgacggctt ggcaggtctc
      661 gatgacgtgg acctggaagg actggatgta ggcgtcaagg aaacgtccgg acaagaagat
      721 actggcgcgg aggcagacga cgctcccgtc gtccgcttcg tcaacaagat gctgctggac
      781 gctatcaaag gcgggtcttc cgacttgcac ttcgagccct acgagaagat ctaccgggtg
      841 cgcttccgta ccgacggcat gcttcacgaa gtggccaagc cgccgatcca gttggccagt
      901 cgtatctctg ctcgtctcaa ggtaatggcc ggcctggata tctccgaacg acgcaagccg
      961 caggacggac ggatcaagat gcgcgtgtcg aagaccaagt ccatcgactt ccgcgtcaac
     1021 accctgccga ccctgtgggg cgagaagatc gtgatgcgga tcctcgactc ctccagcgcg
     1081 cagatgggca tcgacgccct gggctacgag gaggaccaga aggaactcta cctggccgca
     1141 ctcaagcagc cgcagggcat gatcctggtc accggcccca ccggctcggg caagacggta
     1201 tcgctgtaca ccggcctgaa catcctcaat accaccgaca tcaacatttc caccgccgaa
     1261 gacccggtgg agatcaacct ggaaggcatc aaccaggtca acgtcaatcc gcgccagggc
     1321 atggacttct cccaggcgct gcgcgccttc ctgcgccagg acccggacgt gatcatggtc
     1381 ggcgagatcc gcgacctgga gaccgccgag atcgccatca aggcggcgca gaccgggcat
     1441 atggtgatgt ccaccctgca caccaacagc gccgccgaga ccctgacccg cctgctgaac
     1501 atgggcgtac cggcgttcaa cctggcgacc tcggtgaacc tgatcatcgc ccagcgcctt
     1561 gcgcgaaaac tctgttcgca ctgcaagaaa gagcacgacg taccgaagga aaccctgctt
     1621 cacgaaggtt tcccagaaga gttgatcggc accttcaagc tgtattcacc agtgggctgc
     1681 gaccattgca agaacggtta caagggccgt gtcggtattt atgaagtagt taaaaacacc
     1741 ccggccctgc agcggattat catggaggaa ggcaactcca tcgagatcgc cgagcaagcc
     1801 cgcaaagaag gcttcaacga tctgcgcacc tcaggcctgc tgaaagccat gcaggggatc
     1861 accagcctgg aggaagtcaa ccgcgtgacc aaggattaat ccatggcgga caaagcgtta
     1921 aaaaccagcg ttttcatctg ggagggcacc gacaagaaag gcgccaaggt caagggcgaa
     1981 ctgaccgggc agaatcccat gctggtgaaa gcccatctgc gcaagcaagg catcaatccg
     2041 ctcaaggtac gcaagaaagg tatctccctg ctgggcgcag gcaagaaagt gaaacccatg
     2101 gacatcgccc tgttcacccg gcagatggcg accatgatgg gcgctggcgt tcccctcctg
     2161 caatcgttcg acatcatcgg cgagggcttc gacaacccca acatgcgcaa gcttgtggat
     2221 gaaatcaaac aggaagtttc ctcaggtaac agcctagcca actccttgag aaaaaagccc
     2281 cagtattttg acgagcttta ttgcaacctg gtagatgcag gggaacagtc tggcgccttg
     2341 gaaaaccttc tcgatcgggt ggcaacctat aaagaaaaga cggaatcact gaaagccaag
     2401 atcaaaaagg cgatgaccta tcccattgcc gtcatcattg tcgcactgat tgtatctgcg
     2461 atcctcctga ttaaagtggt tccacaattt cagtcggtct ttgaaggttt cggcgcggaa
     2521 cttcccgcct ttacccagat gattgtcaat ctatcggagt tcatgcagga gtggtggttc
     2581 ttcatcatac tggcgatagc gatatttggc tttgcattca aagaattgca taaacgctca
     2641 caaaaattcc gtgacacact cgacagaacg atcctcaaac ttcccatttt cggaggcatc
     2701 gtctacaaat ctgcggtcgc ccgttatgca cggaccttgt ccacgacctt cgccgcgggt
     2761 gttcccctgg tcgatgcgct cgactccgtc tccggagcga ccggcaatat cgtgttcaag
     2821 aacgcggtca gcaagatcaa gcaagacgtt tccaccggca tgcagctcaa cttctccatg
     2881 cgcaccacca gcgtctttcc caacatggcg atccagatga ccgccatcgg cgaggagtcc
     2941 ggttcgctcg atgagatgct gagcaaagtc gccagctact acgaagagga agtcgacaac
     3001 gccgtggaca acctcaccac gctcatggaa ccgatgatca tggccgttct cggcgtactg
     3061 gttggcggtc tgatcgtggc catgtacctt ccgatcttcc aactcggcaa cgtcgtcgga
     3121 taactgatgc ccctcctcga ctacctggcc agccatccgc tggcctttgt cttgtgcgcc
     3181 atcctccttg gcctgctggt cggcagcttc ctcaacgtcg tcgtgcatcg cctgccgaag
     3241 atgatggagc gcaactggaa ggccgaagcc cgcgaagcgc tcgggctgga gccggaaccc
     3301 aagcaggcga cctacaacct ggtgctgccg aattcggcgt gcccgcgctg cggccacgag
     3361 atccgcccct gggaaaacat cccgctggtc agctacctgg cgctgggcgg caagtgctcg
     3421 tcctgcaagg ccgccatcgg caaacgctac cccctggtgg aactggccac cgcgctgctt
     3481 tccggctacg tcgcctggca tttcggcttc acctggcagg cgggcgcgat gctcctgctg
     3541 acctggggtc tgctggcgat gagcctgatc gatgccgatc accaactgct gccggacgtg
     3601 ctggtgctgc cgctgctctg gctgggcctg atcgccaacc atttcggcct cttcgccagc
     3661 ctcgacgatg cgctgttcgg cgcggtgttc ggctacctga gcctgtggtc ggtgttctgg
     3721 ctgttcaagc tggtgaccgg caaggagggc atgggctacg gcgacttcaa gctgctggcc
     3781 atgctcggtg cctggggtgg ctggcagatc ctgccgctga ccatcctgct gtcgtcgctg
     3841 gtcggggcga tcctcggggt gatcatgctg cgcctgcgca acgccgaaag cggcacgccg
     3901 atccccttcg gtccctatct ggctattgcc gggtggattg ctttgctctg gggtgatcaa
     3961 ataacccgga cctatctgca attcgccgga ttcaaatgac ccaaccc
//
LOCUS       RADHRPER      955 bp ds-DNA             SYN       18-JUL-1990
DEFINITION  Synthetic horseradish peroxidase isoenzyme C (HRP-C) subunit
            alpha-1 DNA, complete cds.
ACCESSION   J05552
KEYWORDS    peroxidase.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 955)
  AUTHORS   Smith,A.T., Santama,N., Dacey,S., Edwards,M., Bray,R.C.,
            Thornely,R.N.F. and Burke,J.F.
  TITLE     Expression of a synthetic gene for horseradish peroxidase C in
            E.coli and folding and activation of the recombinant enzyme with
            calcium and heme
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.T.Smith, 13-JUN-1990.
FEATURES       from  to/span     description
    pept         11      940     synthetic horseradish peroxidase isoenzyme C
                                 (HRP-C) subunit alpha-1 (E.C. 1.11.1.7)
BASE COUNT      246 a    269 c    221 g    219 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttaacc atgcagttaa cccctacatt ctacgacaat agctgtccca acgtgtccaa
       61 catcgttcgc gacacaatcg tcaacgagct cagatccgat cccaggatcg ctgcttcaat
      121 attacgtctg cacttccatg actgcttcgt gaatggttgc gacgctagca tattactgga
      181 caacaccacc agtttccgca ctgaaaagga tgcattcggg aacgctaaca gcgccagggg
      241 ctttccagtg atcgatcgca tgaaggctgc cgttgagtca gcatgcccac gaacagtcag
      301 ttgtgcagac ctgctgacta tagctgcgca acagagcgtg actcttgcag gcggaccgtc
      361 ctggagagtg ccgctcggtc gacgtgactc cctacaggca ttcctagatc tggccaacgc
      421 caacttgcct gctccattct tcaccctgcc ccagctgaag gatagcttta gaaacgtggg
      481 tctgaatcgc tcgagtgacc ttgtggctct gtccggagga cacacatttg gaaagaacca
      541 gtgtaggttc atcatggata ggctctacaa tttcagcaac actgggttac ctgaccccac
      601 gctgaacact acgtatctcc agacactgag aggcttgtgc ccactgaatg gcaacctcag
      661 tgcactagtg gactttgatc tgcggacccc aaccatcttc gataacaagt actatgtgaa
      721 tctagaggag cagaaaggcc tgatacagag tgatcaagaa ctgtttagca gtccaaacgc
      781 cactgacacc atcccactgg tgagaagttt tgctaactct actcaaacct tctttaacgc
      841 cttcgtggaa gccatggacc gtatgggtaa cattacccct ctgacgggta cccaaggcca
      901 gattcgtctg aactgcagag tggtcaacag caactcttaa taaggatccg aattc
//
LOCUS       RK2KORC      1701 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  Plasmid RK2 kil regulatory protein (korC) (complete cds), kleA
            (5' end), and Tn1 bla protein (3' end) genes.
ACCESSION   M32794
KEYWORDS    Tn1 bla protein; kil regulatory protein; kil-kor regulon;
            kleA protein; korC protein.
SOURCE      Plasmid RK2 DNA, passed in E.coli MV10.
  ORGANISM  Plasmid RK2
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1701)
  AUTHORS   Kornacki,J.A., Burlage,R.S. and Figurski,D.H.
  TITLE     The kil-kor regulon of broad host-range Plasmid RK2: Nucleotide
            sequence, polypeptide product and expression of regulatory gene
            korC
  JOURNAL   J. Bacteriol. 172, 3040-3050 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by D.H.Figurski, 13-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1      320     Tn1 bla protein
    pept       1169     1426     korC protein
    pept       1652  >  1701     kleA protein
    pept.ps     468     1146     interrupted ORFX
    trns    <     1      467     Tn1 remnant
    mRNA       1153     1509     korC mRNA
    mRNA       1637  >  1701     kleA mRNA
    binding    1153     1161     korC ribosome binding site
    binding    1637     1641     kleA ribosome binding site
    signal     1568     1597     kleA promoter
    signal     1509     1526     pot. transcription termination signal
    site       1459     1471     KorB box
    site       1552     1562     KorA box
    site       1577     1593     KorC box
BASE COUNT      359 a    481 c    523 g    338 t
ORIGIN      Map position 4.3'-6.0'.
        1 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
       61 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
      121 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtggatctc
      181 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
      241 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
      301 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
      361 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
      421 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccttt tatttccgaa
      481 acgggctatc ggtcccatta cgaccatgcg cgcggcggca tgacggtgga ccaggtagcg
      541 gacggggttt tgcgcgccct gttgcgctcg catcgccggt atctggacgc ccgcgaccag
      601 gaccgccttg ccgacgagcc actgccggcg tggctcgcgg gtatcacgcc accgccgcgc
      661 cgcgtgcggg ccgtggtcga ggactggcgg aagccggacg aactgccgcc gggcttcgcc
      721 tgggttgatg ccgtgctgcc ggcgcaccag gcattcatcg cccgcaaatg ggcggccagc
      781 gccaaggcga agctcgcggc cgcgcgtgcg aaagctcagg agccggccgg gcagcgcagg
      841 gagccggtta caccggccaa gccggagccg gagccggcca aggacgagga cgcgccggcg
      901 tggccggcga cgttctttcc tggcctgcgg tgcgagattg tgagcgtgca tcacccggtt
      961 tttgccaagg aaatcgggaa gcacgtcatc attacgaaga tcagtcccga aacgcgacag
     1021 gtgtgggcgc acgacgacaa gccgccgcgc tatcgcatca atcgcaacgg tagaaaggtt
     1081 tgtgaatatg acccgcgttg catcgagtcg tgctacggct acgaccaatt gcgggccgcc
     1141 atttgataac gattaggaga aaagcagcat gagcgacgtg aatatccggc ttgagtgcct
     1201 gcgcccggcg gaacgctggg tgcagccgac cggcgcagaa atccgggaag tcttgcactt
     1261 ggccggcctc accggcggac aggctgcgcg catcttgggc ttgggtgcca agggcgaccg
     1321 cacggtgcgg cgttgggttg gcgaggattc gccgatcccc tatgccgcct gggcgatcct
     1381 ttgcgatcta gcggggattg gggcgatctg gaaaggccag ggctgacgcc cgcaggactt
     1441 tacgccaagg gagaggactt tagcggctaa aaccgccaac cctgatcgtt gcgatatgct
     1501 gccagcagcc cggcccgcaa gccgggtttt ttcgtgcgcc cgccgcaaat tgtttagcta
     1561 aatttccttg actatctagg gcataatgcc ctaatatagc aatccaaggc cgggcacttc
     1621 gcccaggtca gcaaccggag gatcaacccc gatgaaaagc aagatcatgt cttggctcga
     1681 tgagctgccc ggcgcggccg c
//
LOCUS       STMSUAC      1380 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  S.griseolus herbicide-inducible cytochrome P-450-SU1 (suaC) gene,
            complete cds.
ACCESSION   M32238
KEYWORDS    cytochrome P-450-SU1.
SOURCE      S.griseolus (strain ATCC 11796) DNA, subclone pCAO302.
  ORGANISM  Streptomyces griseolus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 1380)
  AUTHORS   Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M.,
            Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
  TITLE     Genes for two herbicide-inducible cytochromes P-450 from
            Streptomyces griseolus
  JOURNAL   J. Bacteriol. 172, 3335-3345 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Omer, 23-FEB-1990.
FEATURES       from  to/span     description
    pept        128     1348     cytochrome P-450-SU1 (suaC)
    binding     117      120     ribosomal binding site
BASE COUNT      236 a    503 c    445 g    196 t
ORIGIN      
        1 gcggacaggg ggactcctga agatgtctga tagaggccgt tgcgttctct acgggggcaa
       61 gtctatgctc cgaaatagag aacatggcgt tctttaaagg tgagaattct tgaatcggag
      121 tggaccgatg accgataccg ccacgacgcc ccagaccacg gacgcacccg ccttcccgag
      181 caaccggagc tgtccctacc agttaccgga cggctacgcc cagctccggg acacccccgg
      241 ccccctgcac cgggtgacgc tctacgacgg ccgtcaggcg tgggtggtga ccaagcacga
      301 ggccgcgcgc aaactgctcg gcgacccccg gctgtcctcc aaccggacgg acgacaactt
      361 ccccgccacg tcaccgcgct tcgaggccgt ccgggagagc ccgcaggcgt tcatcggcct
      421 ggacccgccc gagcacggca cccggcggcg gatgacgatc agcgagttca ccgtcaagcg
      481 gatcaagggc atgcgccccg aggtcgagga ggtggtgcac ggcttcctcg acgagatgct
      541 ggccgccggc ccgaccgccg acctggtcag tcagttcgcg ctgccggtgc cctccatggt
      601 gatctgccga ctcctcggcg tgccctacgc cgaccacgag ttcttccagg acgcgagcaa
      661 gcggctggtg cagtccacgg acgcgcagag cgcgctcacc gcgcggaacg acctcgcggg
      721 ttacctggac ggcctcatca cccagttcca gaccgaaccg ggcgcgggcc tggtgggcgc
      781 tctggtcgcc gaccagctgg ccaacggcga gatcgaccgt gaggaactga tctccaccgc
      841 gatgctgctc ctcatcgccg gccacgagac cacggcctcg atgacctccc tcagcgtgat
      901 caccctgctg gaccaccccg agcagtacgc cgccctgcgc gccgaccgca gcctcgtgcc
      961 cggcgcggtg gaggaactgc tccgctacct cgccatcgcc gacatcgcgg gcggccgcgt
     1021 cgccacggcg gacatcgagg tcgaggggca cctcatccgg gccggcgagg gcgtgatcgt
     1081 cgtcaactcg atagccaacc gggacggcac ggtgtacgag gacccggacg ccctcgacat
     1141 ccaccgctcc gcgcgccacc acctcgcctt cggcttcggc gtgcaccagt gcctgggcca
     1201 gaacctcgcc cggctggagc tggaggtcat cctcaacgcc ctcatggacc gcgtcccgac
     1261 gctgcgactg gccgtccccg tcgagcagtt ggtgctgcgg ccgggtacga cgatccaggg
     1321 cgtcaacgaa ctcccggtca cctggtgacg ggggagaggg gcaaggacat gaccatgcgg
//
LOCUS       STMSUBC      1441 bp ds-DNA             BCT       18-JUL-1990
DEFINITION  S.griseolus cytochrome P-450-SU2 (subC) gene, complete cds.
ACCESSION   M32239
KEYWORDS    cytochrome P-450-SU2.
SOURCE      S.griseolus (strain ATCC 11796) DNA, subclone pCAO304.
  ORGANISM  Streptomyces griseolus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 1441)
  AUTHORS   Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M.,
            Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
  TITLE     Genes for two herbicide-inducible cytochromes P-450 from
            Streptomyces griseolus
  JOURNAL   J. Bacteriol. 172, 3335-3345 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.A.Omer, 23-FEB-1990.
FEATURES       from  to/span     description
    pept        195     1406     cytochrome P-450-SU2 (subC)
    binding     182      186     ribosomal binding site
BASE COUNT      207 a    605 c    432 g    197 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccggcc accgcccgac ccgtccgcac tccgccccgc cgaccgtcgt ccatccgccc
       61 ctgcggccat gcggtttgag ccaacctcgg tgctgccgcg atctgccctt ccctcccccg
      121 ccgggcctgc gttagcgtga cgacatctta attacctaag ttaggtaatt agctcacgcg
      181 gaaggaccgg ccgcatgacg accgcagaac gcaccgctcc ccccgacgcc ctcaccgtcc
      241 cggccagccg cgcccccggc tgccccttcg accccgcgcc cgacgtcacc gaggcggccc
      301 gcaccgaacc ggtcacccgg gccaccctct gggacggctc ctcctgctgg ctggtgacgc
      361 gccatcagga cgtccgcgcg gtcctcggcg acccgcgctt cagcgccgac gcccaccgca
      421 ccggcttccc cttcctgacc gccggcggcc gcgagatcat cggcaccaac ccgaccttcc
      481 tgcgcatgga cgacccggag cacgcccgac tgcgccggat gctcaccgcc gacttcatcg
      541 tcaagaaggt cgaggcgatg cgccccgagg tgcagcgcct cgccgacgac ctggtcgacc
      601 ggatgaccac cggacgcacc tccgccgacc tggtcaccga gttcgcgctg ccgctgccgt
      661 ccctggtgat ctgcctgctg ctcggcgtcc cctacgagga ccacgcgttc ttccaggagc
      721 gcagccgggt cctgctcacc ctgcggtcca ctcccgagga agtccgggcc gcccaggacg
      781 agttgctgga gtacctcgcc cggctcgccc ggaccaagcg ggagcggccg gacgacgcca
      841 tcatcagccg cctggtcgcc cgcggcgagc tcgacgacac ccagatcgcc accatgggac
      901 gcctgttgct ggtcgccggc cacgagacga ccgccaacat gaccgcgctc tccaccctcg
      961 tgctgctgcg caaccccgac caactcgccc ggctgcgcgc cgaacccgcg ctcgtcaagg
     1021 gcgccgtcga ggagctgctg cgctacctga cgatcgtgca caacggcgtt ccccggatcg
     1081 ccaccgagga cgtgctcatc ggcggccgca ccatcgccgc cggcgagggc gtcctgtgca
     1141 tgatcagctc cgccaaccgg gacgccgagg tgttccccgg cggcgacgac ctcgacgtgg
     1201 cccgcgacgc ccgccgccac gtggccttcg gcttcggcgt ccaccagtgc ctgggacagc
     1261 cgttggccag ggtggagctc cagatcgcca tcgaaacgct gctgcgccgc ctgccggacc
     1321 tgcggctggc cgtgccccac gaggagatcc cgttccgcgg cgacatggcg atctacgggg
     1381 tccactcgct gccgatcgcc tggtagcccg ggcgccccca ccaccgacca ccacgcaccc
     1441 a
//

GenBank-Updates@genbank.bio.net (07/19/90)

LOCUS       HUMPPPB1A    3215 bp ss-mRNA            PRI       19-JUL-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) mRNA, complete
            cds.
ACCESSION   M33689
KEYWORDS    protein phosphotyrosyl phosphatase.
SOURCE      Human placenta, cDNA to mRNA, (library of Clontech), clone
            lambda-16-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3215)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept         73     1380     protein phosphotyrosyl phosphatase 1B (EC
                                 3.1.3.48)
BASE COUNT      818 a    828 c    801 g    768 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg
       61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg
      121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag
      181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg
      241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa
      301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg
      361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa
      421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa
      481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga
      541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat
      601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc
      661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt
      721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac
      781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt
      841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa
      901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac
      961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc
     1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa
     1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc
     1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg
     1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag
     1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct
     1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag
     1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc
     1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc
     1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat
     1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag
     1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc
     1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc
     1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc
     1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat
     1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg
     1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt
     1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata
     2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc
     2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat
     2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac
     2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg
     2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg
     2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc
     2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg
     2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc
     2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca
     2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg
     2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc
     2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg
     2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc
     2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac
     2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg
     2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg
     3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg
     3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag
     3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt
     3181 ttacaatggc catggaataa accattttta caaaa
//
LOCUS       HUMPPPB1A1    276 bp ds-DNA             PRI       19-JUL-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x.
ACCESSION   M33688
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     1 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 276)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    /    34  +   243     protein phosphotyrosyl phosphatase 1B, exon x
                                 (EC 3.1.3.48) (AA at 34)
    pre-msg <     1  >   276     PTP1B mRNA and introns
    IVS     <     1       33     PTP1B intron x-1
    IVS         244  >   276     PTP1B intron x
BASE COUNT       56 a     77 c     69 g     74 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 ctttagaatc tactagatga ttttctcttt cagacccaag aaactcgaga gatcttacat
       61 ttccactata ccacatggcc tgactttgga gtccctgaat caccagcctc attcttgaac
      121 tttcttttca aagtccgaga gtcagggtca ctcagcccgg agcacgggcc cgttgtggtg
      181 cactgcagtg caggcatcgg caggtctgga accttctgtc tggctgatac ctgcctcttg
      241 ctggtaagga ggcctcgcgg gtgccctggg gagctc
//
LOCUS       HUMPPPB1A2    453 bp ds-DNA             PRI       19-JUL-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+1.
ACCESSION   M33687
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     2 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 453)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   236  +   397     protein phosphotyrosyl phosphatase 1B, exon
                                 x+1 (EC 3.1.3.48)
    pre-msg <     1  >   453     PTP1B mRNA and introns
    IVS     <     1      235     PTP1B intron x
    IVS         398  >   453     PTP1B intron x+1
BASE COUNT      104 a    118 c    111 g    120 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 ggggaggtcc cagactctta accagatctc ttgtgaatgc attgcctcag ggaggcacca
       61 agcctttcat gaggacctgt ccccctgacc cagacacctc ccacccagcc ccacctccaa
      121 cactagggat cacatttcag catgagattg ggaggggaca gacatctaac ggtgttatta
      181 acgttgccct tgagaattgg acctggctga cttatatctc ctctctggct ttcagatgga
      241 caagaggaaa gacccttctt ccgttgatat caagaaagtg ctgttagaaa tgaggaagtt
      301 tcggatgggg ctgatccaga cagccgacca gctgcgcttc tcctacctgg ctgtgatcga
      361 aggtgccaaa ttcatcatgg gggactcttc cgtgcaggtc agcattgcct ttgtttgaat
      421 ccaggtgtga ccattttaac ttttttgtct ttg
//
LOCUS       HUMPPPB1A3    426 bp ds-DNA             PRI       19-JUL-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+2.
ACCESSION   M33686
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     3 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 426)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +    46  +   269     protein phosphotyrosyl phosphatase 1B, exon
                                 x+2 (EC 3.1.3.48)
    pre-msg <     1  >   426     PTP1B mRNA and introns
    IVS     <     1       45     PTP1B intron x+1
    IVS         270  >   426     PTP1B intron x+2
BASE COUNT      115 a    122 c    109 g     80 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 gaagtgaaca ctaatagact tccttcctct tgctgctctt tcaaggatca gtggaaggag
       61 ctttcccacg aggacctgga gcccccaccc gagcatatcc ccccacctcc ccggccaccc
      121 aaacgaatcc tggagccaca caatgggaaa tgcagggagt tcttcccaaa tcaccagtgg
      181 gtgaaggaag agacccagga ggataaagac tgccccatca aggaagaaaa aggaagcccc
      241 ttaaatgccg caccctacgg catcgaaagg taatatattg ggtccagctt gttggggtga
      301 ggggaaatga cttctgttct agaaacacac gctggtactg aaaccctgtg atgcagcctc
      361 tgttggcaag cagcgcttcg catccttggg aacagggcgc tggaccaaca cccactccac
      421 tggtgg
//
LOCUS       HUMPPPB1A4    732 bp ds-DNA             PRI       19-JUL-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+3.
ACCESSION   M33685
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     4 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 732)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   402  +   597     protein phosphotyrosyl phosphatase 1B, exon
                                 x+3 (EC 3.1.3.48)
    pre-msg <     1  >   732     PTP1B mRNA and introns
    IVS     <     1      401     PTP1B intron x+2
    IVS         598  >   732     PTP1B intron x+3
BASE COUNT      164 a    193 c    205 g    170 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 tctgtagctc taaagaatga gatctggtgt actgatgtgg ccagacattg caattgcagt
       61 acatgagaag gcaaatcata cagtagtgtg tacaccagtg agtcctccag ccagataaat
      121 cctcacagtg accagtcgcc caggcacctt gtgaacccta ccctgggtgt gggtgctatc
      181 tgaagtacct gggggagggg gtgacaagtg gacttcaggc tgatgtggcc ctggcctggc
      241 cctccctcca agcagagggg gctggcacgc tggaaggtta acatcatcca actctgtcta
      301 cacgtggctt gttttttcct agaattcctg ccacaatagc agcatccttg ccattcattt
      361 tctccaaagt gagtacccat ctctgccctc tgattcctca gcatgagtca agacactgaa
      421 gttagaagtc gggtcgtggg gggaagtctt cgaggtgccc aggctgcctc cccagccaaa
      481 ggggagccgt cactgcccga gaaggacgag gaccatgcac tgagttactg gaagcccttc
      541 ctggtcaaca tgtgcgtggc tacggtcctc acggccggcg cttacctctg ctacagggta
      601 tgtttccact gacagacgcg ctgggcagat gctcgtgtgc agagagcact ggccgctagc
      661 ccgatggtag gattcagttc tgtggtgcat ctgagccagt ctcagaagaa acagatcaag
      721 gttttaagtc tg
//
LOCUS       HUMPPPB1A5    365 bp ds-DNA             PRI       19-JUL-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+4.
ACCESSION   M33684
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     5 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 365)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   266      289     protein phosphotyrosyl phosphatase 1B, exon
                                 x+4 (EC 3.1.3.48)
    pre-msg <     1  >   365     PTP1B mRNA and introns
    IVS     <     1      265     PTP1B intron x+3
BASE COUNT       80 a    101 c     88 g     96 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 tacctcctaa gacttttacg gttttaaata ttttacctct ttccaggtgg catctgagta
       61 catcagatgg ttttgcaaaa tgcaaacaat tttttccttg gggatgattt ttggggagag
      121 ggggctactg taaaaaataa aaccaaaacc ccctttgctc cctcggaggt tgaagttgcc
      181 ggggggtgtg gccggggtca tgcatgaggc gacagcactg caggtgcggg tctgggctca
      241 tctgaactgt ttggtttcat tccagttcct gttcaacagc aacacatagc ctgaccctcc
      301 tccactccac ctccacccac tgtccgcctc tgcccgcaga gcccacgccc gactagcagg
      361 catgc
//
LOCUS       HB3HBLA      1319 bp ds-DNA             PHG       19-JUL-1990
DEFINITION  Bacteriophage HB-3 amidase (hbl) gene, complete cds.
ACCESSION   M34652
KEYWORDS    amidase.
SOURCE      Bacteriophage HB-3 (host Streptococcus pneumoniae) DNA.
  ORGANISM  Bacteriophage HB-3
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 1319)
  AUTHORS   Romero,A., Lopez,R. and Garcia,P.
  TITLE     Sequence of the Streptococcus pneumoniae bacteriophage HB-3 amidase
            reveals high homology with the major host autolysin
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Romero, 25-MAY-1990.
               Consejo superior de Investigaciones Clentificas
               Centro de Investigaciones Biologicas
               Velazquez 144 Madrid, 28006 SPAIN
FEATURES       from  to/span     description
    pept        298     1254     amidase (hbl)
    binding     283      289     ribosomal binding site (put.)
    site       1285     1305     transcription stop sequence
BASE COUNT      412 a    268 c    334 g    305 t
ORIGIN      
        1 aagcttttta acagtagcag taggcggtat tgtaaaagca gtaaaagatt atcttttgcg
       61 taaaggcgga gagaaagcgg tgatcatcgc tgaaattcta gctaaaatgc agttcatgcc
      121 gttgagcaag tagcttcaga gactggctat aagggcgaag aaaagctgga gcaggctcgt
      181 gctaaagtcc gtgctgagct tacaaaatac aatattagca tgactgacaa aaacttagac
      241 accttcgtag agtcagcagt gaagcagatg aatgacgcat ggaaaggacg atagggaatg
      301 gatatcgata gaaacagact acgtacaggc ttgccccagg ttggggtgca gccttatcga
      361 caagtacatg ctcactcaac aggtaaccgc aactcaaccg tacagaatga agcggattat
      421 cactggcgga aagacccaga attaggtttt ttctcgcacg ttgttgggaa ctttcgcatc
      481 atgcaggtcg gacctgtgaa caacggaagt tgggatgttg ggggcggttg gaatgctgag
      541 acctatgcag cggttgaact gattgaaagc cattcaacta aggaagagtt tatggctgac
      601 tatcgcctct atatcgaatt gctacgcaat ctagcggacg aagcaggctt gccgaagact
      661 cttgatacag acgacttggc aggtatcaag acgcatgaat actgtaccaa taaccaacca
      721 aacaaccact cagaccatgt ggatccatat ccatatcttg caagttgggg cattagccgt
      781 gaacagttta agcaagacat cgaaaacggc ttgagcgctg caacaggctg gcagaaaaat
      841 ggcactggct actggtacgt acattcagac ggctcttatt caaaagataa gtttgagaaa
      901 atcaacggta cctggtatta tttcgatggc tcaggctata tgctttcaga ccgctggaag
      961 aagcacacag acggtaattg gtactacttt gaccaatcag gcgaaatggc cacaggctgg
     1021 aagaaaatcg ctgacaagtg gtactatttt gatgtagaag gtgccatgaa gacaggctgg
     1081 gtcaagtaca aggacacttg gtactactta gacgctaaag aaggcgccat ggtatcaaat
     1141 gcctttatcc agtcagcgga cggaacaggc tggtactacc tcaaaccaga cggaacactg
     1201 gcagacaagc cagagttcac agtagagcca gatggcttga ttacagttaa ataaatagaa
     1261 aggaaacttt ctaaattgtt cttcaccgca gctcaggctt acggtttttt tgttttaaa
//
LOCUS       FIBGLUC      1426 bp ds-DNA             BCT       19-JUL-1990
DEFINITION  F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
            complete cds.
ACCESSION   M33676
KEYWORDS    1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase.
SOURCE      F.succinogenes (strain S85) DNA, clone PJI5.
  ORGANISM  Fibrobacter succinogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1426)
  AUTHORS   Teather,R.M. and Erfle,J.D.
  TITLE     DNA sequence of a Fibrobacter succinogenes mixed linkage
            beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
  JOURNAL   J. Bacteriol. 172, 3837-3841 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.M.Teather, 11-APR-1990.
FEATURES       from  to/span     description
    pept        145     1194     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 precursor (EC 3.2.1.73)
    sigp        145      225     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 signal peptide
    matp        226     1191     1,3-1,4-beta-D-glucan 4-glucanohydrolase
    binding     132      137     ribosome binding site
    signal       62       66     -35 region
    signal       85       90     -10 region
BASE COUNT      371 a    346 c    335 g    374 t
ORIGIN      
        1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
       61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
      121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
      181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
      241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
      301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
      361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
      421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
      481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
      541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
      601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
      661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
      721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
      781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
      841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
      901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
      961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
     1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
     1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
     1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
     1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
     1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
     1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
     1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS       PP1BOFFO      931 bp ds-DNA             PHG       19-JUL-1990
DEFINITION  Bacteriophage P1 regulatory protein (bof) gene, complete cds.
ACCESSION   M33224
KEYWORDS    regulatory protein.
SOURCE      Bacteriophage P1 viral DNA.
  ORGANISM  Bacteriophage P1
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 931)
  AUTHORS   Schaefer,T.S. and Hays,J.B.
  TITLE     The bof gene of bacteriophage P1. DNA sequence and evidence for
            roles in regulation of phage c1 and ref genes
  JOURNAL   J. Bacteriol. 172, 3269-3277 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.B.Hayes, 26-MAR-1990.
FEATURES       from  to/span     description
    pept    >   544      789     Bof regulatory protein
    site        541      543     potential ttg start codon for Bof
BASE COUNT      260 a    211 c    243 g    217 t
ORIGIN      Map position 9-10.
        1 gggtaactgg tggattatcg agacaaaaca caacgtggcg gacgttctgg ccgtcatcca
       61 acaatacgca taacaggagc gcccggttcg cgctgcgcat aatatggcca cactatctga
      121 tacaataaaa ccgaataaaa catatcttga ggcggtactg cgtacggcat tattaggaaa
      181 gacagaagac gaatacgttg atttcttcct gtcagggcta cgcgggcgat tactgaaaaa
      241 tccccgcctg taccgcagct atggcccata ctggcggaaa ttaaaaaatt attactggag
      301 cgacggttat ggtaatttcg gtcgtctcgt tgaccgtgac gttcgcaaat tttaccgtta
      361 tgaccgcccg gcgctaacac tcatagccgc gacgctctac agccatgagc gttttgataa
      421 tggtcagata tactcagcct ggcatttact gccagtccct gaagaagttg acgaccagga
      481 ctatgagttt gagtcttacg atttggaagt tgaagccttg gcacaggctg gagagaaaac
      541 ttgaaaaagc gatactacac agtaaagcat gggacgctac gagcattaca agagtttgct
      601 gacaagcata acgttgaggt gcgcagggaa gggggaagta aagctctgcg catgtaccgt
      661 ccggacggga aatggcggac ggtcgtcgat ttcaaaacaa acagtgttcc ccagggcgtc
      721 cgtgaccggg cattcgaaga atgggagcag atcatcatag ataatgcatt gcttctcaat
      781 gcggattaaa cttccccaaa ttagggctgt ttgctcaccg agcatcgctc aaagaagcac
      841 gattcttcaa acatatagat agtgatagtg ccacaacttc tggctctaac gggctgggga
      901 ggcggcgctt tgttgctaaa tgatctggtt t
//
LOCUS       STRTEE6      2508 bp ds-DNA             BCT       19-JUL-1990
DEFINITION  S.pyogenes trypsin-resistant surface T6 protein (tee6) gene,
            complete cds.
ACCESSION   M32978
KEYWORDS    surface protein; trypsin-resistant surface T6 protein.
SOURCE      S.pyogenes (strain D471, sub-species M-type 6) DNA.
  ORGANISM  Streptococcus pyogenes
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Streptococcaceae.
REFERENCE   1  (bases 1 to 2508)
  AUTHORS   Schneewind,O., Jones,K.F. and Fischetti,V.A.
  TITLE     Sequence and structural characteristics of the trypsin-resistant T6
            surface protein of group A streptococci
  JOURNAL   J. Bacteriol. 172, 3310-3317 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.Schneewind, 18-MAR-1990.
FEATURES       from  to/span     description
    pept        719     2332     trypsin-resistant surface T6 protein (tee6)
                                 precursor
    sigp        719      784     trypsin-resistant surface T6 protein signal
                                 peptide
    matp        785     2329     trypsin-resistant surface T6 protein
    binding     706      709     ribosome binding site
    signal     2353     2382     transcription terminator
BASE COUNT      929 a    364 c    480 g    735 t
ORIGIN      
        1 aagcttcaga tgaagcctat gagaagtata aggataacga aggaagatat agcgaaatgg
       61 gagattccga tactgattat ggaaccaacc aaactagttc tggaaaaggt ggtttgcctt
      121 ctaattcaga tgcttcggtt aattatatgg cagatggtcg tgaacagaaa ttaccttata
      181 agcacccagt gattcaggtc aaaacagtac caatcacgtt taccaaagta gatgctgaca
      241 acaaccagaa aaaacttgca ggtgttgagt ttgaactccg taaagaggac aagaagatcg
      301 tctgggaaaa gggaacaaca ggttcaaatg gccaactcaa ctttaagtac cttcaaaaag
      361 gcaaaaccta ttatctgtat gagacgaagg caaaacttgg atacactctt ccagaaaatc
      421 catgggaagt tgccgttgct aacaacggtg atataaaagt aaaacacccg attgaaggtg
      481 aattgaagtc aaaagatggc tcttacatga ttaaaaatta taagatttat cagttgccat
      541 cgtctggggg aagaggaagt caaattttca ttatagttgg tagcatgaca gcaactgtag
      601 cattattatt ttatagacgc caacacagga aaaagcaata ttaaattaat gatcatattt
      661 attgacaaac aggagagaaa cagtgagaga gaagatatta ataacagcaa aaaaactaat
      721 gctagcttgt ttagctatct tagcggtagt agggcttgga atgacaagag tatcagcttt
      781 atcaaaagat gatactgcac aactaaagat aacaaatatt gaaggtgggc caacagtaac
      841 actttataaa ataggagaag gtgtttacaa cactaatggt gattctttta ttaactttaa
      901 atatgctgag ggggtttctt taactgaaac aggacctaca tcacaagaaa ttactactat
      961 tgcaaatggt attaatacgg gtaaaataaa gccttttagt actgaaaacg ttagtatttc
     1021 taatggaaca gcaacttata atgcgagagg tgcatctgtt tatattgcat tattaacagg
     1081 tgcgacagat ggccgtacct acaatcctat tttattagct gcatcttata atggtgaggg
     1141 aaatttagtt actaaaaata ttgattccaa atctaattat ttatatggac aaacaagtgt
     1201 tgcaaaatca tcattaccat ctattacaaa gaaagtaacc gggacaatag atgacgtgaa
     1261 taaaaagact acctcgttag gaagtgtatt gtcttattcg ctgacatttg aattaccaag
     1321 ttataccaaa gaagcagtca ataaaacagt atatgtttct gataatatgt cggaaggtct
     1381 tacttttaac tttaatagtc ttacagtaga atggaaaggt aagatggcta atattactga
     1441 agatggttca gtaatggtag aaaatacaaa aatcggaata gctaaggagg ttaataacgg
     1501 ttttaattta agttttattt atgatagttt agaatctata tcaccaaata taagttataa
     1561 agctgttgta aacaataaag ctattgttgg tgaagagggt aatcctaata aagctgaatt
     1621 cttctattca aataatccaa caaaaggtaa tacatacgat aatttagata ggaagcctga
     1681 taaagggaat ggtattacat ccaaagaaga ttctaaaatt gtttatactt atcaaatagc
     1741 gtttagaaaa gttgatagtg ttagtaagac cccacttatt ggtgcaattt ttggagttta
     1801 tgatactagt aataaattaa ttgatattgt tacaaccaat aaaaatggat atgctatttc
     1861 aacacaagta tcttcaggaa aatataaaat taaggaatta aaagctccta aaggttattc
     1921 attgaataca gaaacttatg aaattacggc aaattgggta actgctacag tcaagacaag
     1981 tgctaattca aaaagtacta cttatacatc tgataaaaat aaggcgacag ataattcaga
     2041 gcaagtagga tggttaaaaa atggtatatt ctattctata gatagtagac ctacaggaaa
     2101 tgatgttaaa gaggcttata ttgaatctac taaggcttta actgatggaa caactttctc
     2161 aaaatcgaat gaaggttcag gtacagtatt attagaaact gacatcccta acaccaagct
     2221 aggtgaatta ccttcgacag gtagcattgg tacttacctc tttaaagcta ttggttcggc
     2281 tgctatgatt ggtgcaattg gtatttatat tgttaaacgt cgtaaagctt aatgctttca
     2341 aaagtcgaaa tcaatcgaga ctgtctttat gcggtctcga tttttaatga taaggaactg
     2401 ctatgacaga aagactaaaa aatctaggga tactcttatt atttttattg ggaacagcca
     2461 tttttcttta ccctacgcta agtagtcagt ggaatgccta tcgtgatc
//
LOCUS       HALHPA       1317 bp ds-DNA             BCT       19-JUL-1990
DEFINITION  H.volcanii histidinol-phosphate-aminotransferase (hisC) gene,
            complete cds.
ACCESSION   M33161
KEYWORDS    histidinol-phosphate-aminotransferase.
SOURCE      H.volcanii (strain DSM 3757) cell line WFD 18 DNA, clone 477.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 1317)
  AUTHORS   Conover,R.K. and Doolittle,W.F.
  TITLE     Characterization of a gene involved in histidine biosynthesis in
            Halobacterium (Haloferax) volcanii: Isolation and rapid mapping by
            transformation of an auxotroph with cosmid DNA
  JOURNAL   J. Bacteriol. 172, 3244-3249 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.K.Conover, 22-MAR-1990.
FEATURES       from  to/span     description
    pept        121     1206     histidinol-phosphate-aminotransferase (hisC)
                                 (EC 2.6.1.9)
BASE COUNT      234 a    504 c    410 g    169 t
ORIGIN      
        1 agtcgttcgg gcggcgctcg gctgacggcc gtcggtcgtc gcgtccccaa cccgaccccc
       61 taccgccacg tccgacccgg agtacgcacc cttaagaacc gcgacccgca ttttccgacc
      121 atgcaaccac gggacctctc cgcgcacgct ccctacgtac ccggccgcgg gacagaggag
      181 gtcgcccgcg aactcggaat ggaccccgag gacctgacga aactctcctc gaacgagaac
      241 ccccacggcc cgagtccgaa ggcggtcgcc gccatcgaag acgccgcgcc gaccgtgagc
      301 gtctacccga agaccgccca cacggacctg accgaacgcc tcgccgacaa gtggggcctc
      361 gcacccgaac aggtgtgggt gtctcccggc gcggacggct ctatcgacta cctgacccgc
      421 gcggtgctcg aaccggacga ccggattctc gaacccgcgc ccggcttttc gtactactcg
      481 atgagcgccc gctaccacca cggcgacgcc gtccagtacg aggtgtcgaa ggacgacgac
      541 ttcgaacaga ccgccgacct cgtcctcgac gcctacgacg gcgagcgcat ggtctacctc
      601 acaacgccgc acaaccccac cggttccgtg ctcccgcggg aggaactcgt cgaactggcc
      661 gagtcggtcg aagagcacac gctcctcgtc gtcgacgagg cctacggcga gttcgccgag
      721 gagccgtcgg ccatcgacct cttgtcggag tacgacaacg tcgcggccct gcggacgttc
      781 tcgaaggcgt acgggctggc cggcctccgc atcggctacg cctgcgtgcc cgaggcgtgg
      841 gccgacgcct acgcccgcgt gaacacgccg ttcgccgcca gcgaggtcgc ctgccgcgcc
      901 gcgctcgccg cgctcgacga cgaggaacac gtcgagaaat ccgtcgagtc ggcccggtgg
      961 tcccgcgact atctccgcga acacctcgac gcgccgacgt gggaaagcga gggcaacttc
     1021 gtcctcgtcg aggtcggcga cgccacggcc gtcaccgagg ccgcccagcg cgagggcgtc
     1081 atcgtccgcg actgcgggag cttcggcctg ccggagtgca tccgcgtctc ctgcggcacg
     1141 gaaacccaga ccaagcgcgc cgtggacgtg ctcaaccgca tcgtctcgga ggtgccgacg
     1201 gcgtgagaga cgacgacacc ggcacgcccg gcaccggaaa gaccacggcg accgagccgg
     1261 tcgccgccga cctcgacctc gacgtggtcc acctcaaccg actcgtgaaa gacgagg
//
LOCUS       BOVGOA        472 bp ss-mRNA            MAM       19-JUL-1990
DEFINITION  B.taurus go-alpha mRNA, 3' end.
ACCESSION   J02900
KEYWORDS    go-alpha.
SOURCE      B.taurus retina, cDNA to mRNA, clone GO3.1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (sites)
  AUTHORS   Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
            Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
  TITLE     Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated
            regions
  JOURNAL   Biochemistry 29, 5069-5076 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 472)
  AUTHORS   Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
            Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by S.R.Price, 12-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1        3     go-alpha (AA at 1)
BASE COUNT      130 a    133 c     88 g    121 t
ORIGIN      
        1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct
       61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat
      121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc
      181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg
      241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa
      301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc
      361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct
      421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt
//
LOCUS       CAJCAT       1334 bp ds-DNA             BCT       19-JUL-1990
DEFINITION  C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene,
            complete cds.
ACCESSION   M35190
KEYWORDS    chloramphenicol acetyltransferase.
SOURCE      C.coli plasmid C-589 DNA.
  ORGANISM  Campylobacter coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 1334)
  AUTHORS   Wang,Y. and Taylor,D.E.
  TITLE     Chloramphenicol resistance in Campylobacter coli, nucleotide
            sequence, expression and cloning vector construction
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Taylor, 15-JUN-1990.
FEATURES       from  to/span     description
    pept        309      932     chloramphenicol acetyltransferase (cat)
    mRNA        277  >   932     chloramphenicol acetyltransferase mRNA
    signal      242      271     promoter
    binding     297      301     ribosome binding site
    signal      960     1006     transcriptional termination signal
BASE COUNT      433 a    232 c    282 g    387 t
ORIGIN      
        1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag
       61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc
      121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat
      181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt
      241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag
      301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg
      361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta
      421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca
      481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt
      541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt
      601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg
      661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg
      721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg
      781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata
      841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt
      901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac
      961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga
     1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac
     1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat
     1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac
     1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag
     1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga
     1321 aataataaat ggct
//
LOCUS       CLOCBA       5120 bp ds-DNA             BCT       19-JUL-1990
DEFINITION  C.acetobutylicum beta-D-galactosidase (cbgA) and
            beta-D-galactosidase regulatory protein (cbgR) genes, complete cds.
ACCESSION   M35107
KEYWORDS    beta-D-galactosidase; beta-D-galactosidase regulatory protein.
SOURCE      C.acetobutylicum (strain NCIB2951) DNA.
  ORGANISM  Clostridium acetobutylicum
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1260 to 5120)
  AUTHORS   Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B.
  TITLE     Clostridium acetobutylicum beta-galactosidase gene, cbgA, is
            positively regulated in Escherichia coli by a novel regulatory
            gene, cbgR
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5120)
  AUTHORS   Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by D.B.Scott, 11-JUN-1990.
            
               Author address:D.B.Scott:
               Molecular Genetics Unit
               Department of Microbiology and Genetics
               Massey University
               Palmerston North, New Zealand
               E-mail:D.B.Scott@massey.ac.nz
FEATURES       from  to/span     description
    pept       1560     4253     beta-D-galactosidase (cbgA)
    pept       4500     4805     beta-D-galactosidase regulatory protein (cbgR)
BASE COUNT     1921 a    683 c    876 g   1640 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac
       61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa
      121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat
      181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat
      241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt
      301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt
      361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga
      421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa
      481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga
      541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt
      601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat
      661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg
      721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat
      781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct
      841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt
      901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt
      961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc
     1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt
     1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa
     1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata
     1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt
     1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa
     1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta
     1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc
     1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg
     1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta
     1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata
     1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag
     1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt
     1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata
     1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta
     1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc
     1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt
     1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag
     2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg
     2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc
     2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa
     2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt
     2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag
     2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt
     2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac
     2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag
     2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct
     2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg
     2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac
     2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag
     2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat
     2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa
     2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct
     2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt
     3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta
     3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt
     3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca
     3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc
     3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta
     3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta
     3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg
     3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat
     3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag
     3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa
     3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg
     3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag
     3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca
     3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt
     3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa
     3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg
     3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa
     4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag
     4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt
     4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact
     4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct
     4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt
     4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag
     4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc
     4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa
     4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata
     4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa
     4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga
     4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc
     4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa
     4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa
     4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag
     4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa
     4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca
     5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg
     5101 ttaatctaat atacgaattc
//
LOCUS       ECOPNCB      1490 bp ds-DNA             BCT       19-JUL-1990
DEFINITION  E.coli nicotinic acid phosphoribosyl transferase (pncB) gene,
            complete cds.
ACCESSION   J05568
KEYWORDS    nicotinic acid phosphoribosyl transferase.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1490)
  AUTHORS   Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J.,
            Meesters,H.A.R. and Witholt,B.
  TITLE     Variation of cofactor levels in Escherichia coli: Sequence analysis
            and expression of the pncB gene encoding nicotinic acid
            phosphoribosyl transferase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1490; revises [1])
  AUTHORS   Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J.,
            Meesters,H.A.R. and Witholt,B.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by P.Terpstra, 31-MAY-1990.
FEATURES       from  to/span     description
    pept        216     1418     nicotinic acid phosphoribosyl transferase
                                 (pncB) (EC 2.4.2.11)
    mRNA        158  >  1490     nicotinic acid phosphoribosyl transferase mRNA
    signal      124      129     -35 region
    signal      146      151     -10 region
    rpt         170      185     inverted repeat
    binding     197      202     ribosome binding site
    signal     1426     1450     rho-independent transcription termination
                                 signal
    revision     56       57     gc in [2]; cg in [1]
    revision    191      191     t in [2]; tt in [1]
BASE COUNT      348 a    374 c    364 g    404 t
ORIGIN      
        1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac
       61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg
      121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac
      181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc
      241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc
      301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta
      361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg
      421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac
      481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata
      541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta
      601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca
      661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct
      721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg
      781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc
      841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa
      901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc
      961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt
     1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg
     1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa
     1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct
     1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg
     1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac
     1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca
     1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc
     1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc
//
LOCUS       NEUCCON13    2728 bp ds-DNA             PLN       19-JUL-1990
DEFINITION  N.crassa conidiation-specific protein (con-13) gene, complete cds.
ACCESSION   M35120
KEYWORDS    conidiation-specific protein.
SOURCE      N.crassa (strain 74-OR23-1A) DNA, clone pCon10a.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 2728)
  AUTHORS   Hager,K.M. and Yanofsky,C.
  TITLE     Genes expressed during conidiation in Neurospora crassa: Molecular
            characterization of con-13
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.M.Hager, 12-JUN-1990.
            
               Author address:K.M.Hager:
               Dept. of Physiology
               UCLA Medical School
               10833 Le Conte Avenue
               Los Angeles, CA 90024-1751
               E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU
FEATURES       from  to/span     description
    pept       1009     1275     conidiation-specific protein (con-13), exon 1
               1333     1847     conidiation-specific protein, exon 2
               1910     2150     conidiation-specific protein, exon 3
    pre-msg     922     2367     con-13 mRNA and introns (alt.)
    pre-msg     927     2367     con-13 mRNA and introns (alt.)
    pre-msg     936     2367     con-13 mRNA and intron (alt.)
    pre-msg     946     2367     con-13 mRNA and intron (alt.)
    IVS        1276     1332     con-13 intron A
    IVS        1848     1909     con-13 intron B
    site       2364     2367     polyadenylation site
BASE COUNT      653 a    695 c    720 g    660 t
ORIGIN      Linkage group IV.
        1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc
       61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt
      121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc
      181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt
      241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa
      301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc
      361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca
      421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta
      481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca
      541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta
      601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt
      661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg
      721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc
      781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt
      841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg
      901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca
      961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct
     1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg
     1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa
     1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg
     1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat
     1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta
     1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc
     1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc
     1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac
     1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt
     1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag
     1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag
     1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc
     1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc
     1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc
     1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt
     1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg
     1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct
     2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg
     2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact
     2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg
     2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt
     2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga
     2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc
     2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa
     2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct
     2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag
     2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga
     2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat
     2701 gatggaaacg agatcgaaac ccattgag
//
LOCUS       VVUVVHAB     2237 bp ds-DNA             BCT       19-JUL-1990
DEFINITION  V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds.
ACCESSION   M34670
KEYWORDS    cytolysin; cytotoxin; hemolysin; toxin.
SOURCE      V.vulnificus (strain EDL174) DNA, clone pCVD702.
  ORGANISM  Vibrio vulnificus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 2237)
  AUTHORS   Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G.
  TITLE     The cytolysin gene of Vibrio vulnificus: Sequence and relationship
            to Vibrio cholerae El Tor hemolysin
  JOURNAL   Infect. Immun. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.B.Kaper, 29-MAY-1990.
FEATURES       from  to/span     description
    pept        745     2160     cytolysin (vvhA) precursor
    sigp        745      804     cytolysin signal peptide
    matp        805     2157     cytolysin
    pept        237      743     pot. cytolysin (vvhB)
    signal       55       60     -35 region
    signal       80       85     -10 region
    signal       87       92     -35 region
    signal      110      115     -10 region
    signal      184      189     -35 region
    signal      206      211     -10 region
    signal     2185     2219     transcription termination signal
    binding      54       69     CRP binding site
    binding      59       74     Fur binding site
    binding     185      199     Fur binding site
    binding     226      231     vvhB ribosome binding site
    binding     730      735     vvhA ribosome binding site
BASE COUNT      639 a    498 c    509 g    591 t
ORIGIN      
        1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata
       61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat
      121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa
      181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc
      241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga
      301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg
      361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt
      421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct
      481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt
      541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc
      601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa
      661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc
      721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta
      781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac
      841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga
      901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg
      961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc
     1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc
     1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac
     1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat
     1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa
     1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt
     1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt
     1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg
     1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt
     1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg
     1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac
     1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg
     1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca
     1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg
     1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc
     1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga
     1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt
     1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt
     2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc
     2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag
     2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta
     2221 ttcgccatct attcgtc
//
LOCUS       CHKMTPEPCK   3571 bp ss-mRNA            ORG       19-JUL-1990
DEFINITION  Chicken mitochondrial phosphoenolpyruvate carboxykinase (PEPCK-M)
            mRNA, complete cds.
ACCESSION   J05419
KEYWORDS    phosphoenolpyruvate carboxykinase.
SOURCE      Chicken 3-day old liver mitochondrion, cDNA to mRNA.
  ORGANISM  Mitochondrion Gallus domesticus
            Unclassified.
REFERENCE   1  (bases 1 to 3571)
  AUTHORS   Weldon,S.L., Rando,A., Matathias,A.S., Hod,Y., Kalonick,P.A.,
            Savon,S., Cook,J.S. and Hanson,R.W.
  TITLE     Mitochondrial phosphoenolpyruvate carboxykinase from the chicken:
            Comparison of the cDNA and protein sequences with the cytosolic
            isozyme
  JOURNAL   J. Biol. Chem. 265, 7308-7317 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.L.Weldon, 17-MAY-1990.
FEATURES       from  to/span     description
    pept         28     1950     phosphoenolpyruvate carboxykinase precursor
                                 (EC4.1.1.32)
    matp         28      126     phosphoenolpyruvate carboxykinase signal
                                 peptide
    matp        127     1947     phosphoenolpyruvate carboxykinase
    rpt        2025     2123     large repeat copy A
    rpt        2124     2223     large repeat copy B
    rpt        2224     2316     large repeat copy C
    rpt        2317     2416     large repeat copy D
    rpt        2417     2508     large repeat copy E
    rpt        2519     2543     small repeat copy A
    rpt        2547     2577     small repeat copy B
    rpt        2580     2611     small repeat copy C
    rpt        2745     2777     small repeat copy D
    rpt        2780     2811     small repeat copy E
    rpt        2847     2878     small repeat copy F
    rpt        2040     2050     GCCAAGATGGC 11 bp repeat
    rpt        2105     2115     GCCAAGATGGC 11 bp repeat
    rpt        2205     2215     GCCAAGATGGC 11 bp repeat
    rpt        2298     2308     GCCAAGATGGC 11 bp repeat
    rpt        2072     2082     TCCAAGATGGC 11 bp repeat
    rpt        2139     2149     TCCAAGATGGC 11 bp repeat
    rpt        2265     2275     TCCAAGATGGC 11 bp repeat
    rpt        2332     2342     TCCAAGATGGC 11 bp repeat
    rpt        2424     2434     TCCAAGATGGC 11 bp repeat
    rpt        2524     2534     TCCAAGATGGC 11 bp repeat
    rpt        2558     2568     TCCAAGATGGC 11 bp repeat
    rpt        2657     2667     TCCAAGATGGC 11 bp repeat
    rpt        2691     2701     TCCAAGATGGC 11 bp repeat
    rpt        2724     2734     TCCAAGATGGC 11 bp repeat
    rpt        2758     2768     TCCAAGATGGC 11 bp repeat
    rpt        2792     2802     TCCAAGATGGC 11 bp repeat
    rpt        2825     2835     TCCAAGATGGC 11 bp repeat
    rpt        2859     2869     TCCAAGATGGC 11 bp repeat
    rpt        2926     2936     TCCAAGATGGC 11 bp repeat
    rpt        2983     2993     TCCAAGATGGC 11 bp repeat
    rpt        3023     3033     TCCAAGATGGC 11 bp repeat
    rpt        3057     3067     TCCAAGATGGC 11 bp repeat
    rpt        3114     3124     TCCAAGATGGC 11 bp repeat
    rpt        3234     3244     TCCAAGATGGC 11 bp repeat
BASE COUNT      589 a   1077 c   1197 g    708 t
ORIGIN      
        1 tcctcgccta tactgggaca atttataatg ttttggttaa gagggggggc gcagagttgt
       61 aggggggggg aaactgagga cagaatgcag cgcgggatgt ggggcgtggg cctggcccgg
      121 cgcaggctga gcacgtcgct gtcggcgctg ccggcggccg cgcgggattt cgtggaggag
      181 gcggtccggc tgtgcaggcc cagggaggtt ctgctgtgcg atgggtccga ggaggagggg
      241 aaggagctgc tcagagggct gcaggacgac ggggtgctgc atccgctgcc caaatacgac
      301 aactgctggt tggctcgcac cgacccccgg gacgtggctc gggtgcaaag caagacggtg
      361 ttggtaaccc ccgaacagag cgacgccgtc cccccacccc ccccatccgg gtccccccaa
      421 ttggggaact ggatgagccc caatgctttc caggcagctg tgcaggagcg tttccccgga
      481 tgcatggcag gccgccccct ctacgtcatc ccattcagca tgggcccccc cacgtccccc
      541 ttggccaaac tgggggttca ggtgaccgac tccccctacg tggtgctctc catgcgcatt
      601 atgacccgcg tgggccccgc ggtgctgcag cgcctcgacg acgacttcgt ccgctgcctc
      661 cactctgtgg ggcggcctct gcccctcacc gagcccctgg tgagctcgtg gccgtgcgac
      721 cggtcccgtg tcctggttgc ccacatcccc tcggagcgcc ggatcgtctc cttcggttcg
      781 ggatacggcg gcaattcgct gctgggcaag aagtgcttcg cgctggccat cgcgtcccgc
      841 atggcccagc agcagggctg gctggccgag cacatgctga ttttgggggt gacgtccccc
      901 agcggtgaga agcgttacat ggcggcggcc tttcccagcg cctgcgggaa aaccaacctg
      961 gccatgatga cccccagcct gccgggttgg cggatccact gcgttgggga cgacattgcg
     1021 tggatgaagt tcgatgatcg agggcgcctc cgcgccatca accccgagcg tggctttttt
     1081 ggggtggccc cggggacgtc gtcgcgcacc aaccccaacg ccatggccac catcgcccgc
     1141 aacaccatct tcaccaacgt ggggctgcga agcgatggcg gcgtctactg ggacggcctg
     1201 gatgagccca cggagcccgg ggtcacctac acctcctggc tgggcaagcc gtggaagcac
     1261 ggtgaccccg agccgtgcgc ccaccccaac tcccgtttct gcgccccggc cgatcagtgc
     1321 cccattatgg acccgcgttg ggacgacccg gaaggagttc ccatcgacgc catcatcttc
     1381 ggggggcgcc gaccccgcgg agtgccgttg gtggtggagg cctttgggtg gcgccacgga
     1441 gttttcatgg gcagcgcaat gaggagcgaa gccaccgccg ccgccgagca caaaggcggc
     1501 cgtttgatgc acgacccctt cgccatgagc ccctttttcg gctacaacgc ggggcgttac
     1561 ctggaacatt ggctgtctac gggtctccgg agcaacgccc gcctcccccg tctgttccac
     1621 gttaattggt tcctccgaga taacgaaggt cgcttcgttt ggcccggctt cggtcacaac
     1681 gcccgcgtct tggcttggat cttcgggagg atccagggga gggacactgc ccggcccacc
     1741 cccatcggtt gggtacccaa agaaggggat ttggacctgg gggggctgcc gggggtcgat
     1801 tactcccaac tgttccctat ggagaagggc ttttgggagg aggagtgcag gcagctgagg
     1861 gagtattacg gggagaactt cggggccgat ctgcccaggg atgtcatggc ggagctggag
     1921 ggcctggagg agagggtgag gaagatgtga ggggtcgggg tggggctgag ggaaaggatg
     1981 gggggaggtt gggggggctg tggggggcga ggtgggggct ggcggtgggg gttggtgagg
     2041 ccaagatggc ccatcggtat gggttggccg ttccaagatg gctgccgccg ctatgagttg
     2101 gtcagccaag atggccgccg acagtgtggg ttggtgggtc caagatggct gccatcggta
     2161 tgggttggcc gttccaagat gctgccgccg ctacgagttg gtcagccaag atggccgccg
     2221 acagcgtggg tccaagaagg ccgccatcat tacgggttgg ccgttccaag atggctgccg
     2281 ccactacgag ttggtcagcc aagatggcca ccgacagtgt gggttggtgg gtccaagatg
     2341 gctgccatca gtatgggttg gccattccaa gatcgtgccg ccgctacgag ttggtcagcc
     2401 aagatggctg ccgacagcgt gggtccaaga tggccgccat cattacgggt tggccgttcc
     2461 aagatcgtgc cgccactacg agttggtcag ccaagatggc caccgacagc gttggttggt
     2521 gggtccaaga tggctgccat cattgtgggt tggccgttcc aagatggccg ccatcactgt
     2581 gggttggccg ttccaaggtg gctgccatct ttgtgggtcg gtgggcccat gatggctgcc
     2641 atcgtgggtt ggctgttcca agatggctgc cagcagcgtg agatgactgt tccaagatgg
     2701 ctgccaccac tatcagttgg ccatccaaga tggccgccaa cagcgtgggt tggtgggtcc
     2761 aagatggccg ccatcactgt gggttggccg ttccaagatg gctgccgcca ctatgagttg
     2821 gccatccaag atggctgcca gcaggatggg ttggtgggtc caagatggct gccaccataa
     2881 tgcattggcc agacaagatg gccaccagca gcatgggatt gccgatccaa gatggccgcc
     2941 ctacctggga aggagccccc tgcctgctca ttggctgagc gctccaagat ggctgccatt
     3001 ccacgtcctc gttggttgac catccaagat ggctgccacc cccacagagt ggccgatcca
     3061 agatggccgc cccgcctggg agggatcctc ctgccctctc attggctgag cgttccaaga
     3121 tggctgccat tccacgtcct cattggttga ccatccacga tggctgctgc cttcctctcc
     3181 attggctacc catctaagat ggctgctctc ctttgtcctg attggctggc caccccaaga
     3241 tggctgctcg tgcccatcct ggctgctcat tggttcctgc agagctgtgg tgcctcccaa
     3301 ttggtcgggg ccatttgata gtgggacttc tgggcgccat cttggagtga cgtcacactg
     3361 tgagcaacgc tgcgttccta ctggcttgcc gcagcctccc atgaccaatg gctgtgtccg
     3421 cttggttgcg aacgccctcg cctaatcaca gcgtcccgtt ggccgagcgg agcgtcctga
     3481 ttggccgagc tcttcccctt gtccaaacgg cagcttccca ttggctgtgc tcatctcaat
     3541 ggcctatcag agccgcccgt ggacctcaga a
//
LOCUS       HUMPANMU     4139 bp ss-mRNA            PRI       19-JUL-1990
DEFINITION  Human pancreatic mucin mRNA, complete cds.
ACCESSION   J05582
KEYWORDS    pancreatic mucin; tumor-associated antigen.
SOURCE      Human pancreatic tumor cell line HPAF-CD11, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4139)
  AUTHORS   Lan,M., Batra,S., Qi W,-N., Metzgar,R. and Hollingsworth,M.
  TITLE     Cloning and sequencing of a human pancreatic tumor mucin
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Hollingsworth, 08-JUN-1990.
FEATURES       from  to/span     description
    pept         74     3841     pancreatic mucin precursor
    sigp         74      136     pancreatic mucin signal peptide
    matp        137     3838     pancreatic mucin
    mRNA    <     1     4139     pancreatic mucin mRNA
    rpt         453     2880     tandem repeat
    rpt         299      452     5' degenerate tandem repeat
    rpt        2881     2957     3' degenerate tandem repeat
    signal     4118     4123     poly-A signal
BASE COUNT      632 a   1910 c   1055 g    542 t
ORIGIN      
        1 ccgctccacc tctcaagcag ccagcgcctg cctgaatctg ttctgccccc tccccaccca
       61 tttcaccacc accatgacac cgggcaccca gtctcctttc ttcctgctgc tgctcctcac
      121 agtgcttaca gttgttacag gttctggtca tgcaagctct accccaggtg gagaaaagga
      181 gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg ctgtgagtat
      241 gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca ctcagggaca
      301 ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca cctggggaca
      361 ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc cgccagccca
      421 cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc ccccagccca
      481 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      541 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      601 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      661 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      721 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      781 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      841 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      901 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
      961 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1021 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1081 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1141 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1201 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1261 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1321 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1381 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1441 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1501 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1561 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1621 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1681 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1741 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1801 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1861 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1921 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     1981 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2041 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2101 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2161 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2221 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2281 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2341 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2401 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2461 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2521 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2581 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2641 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2701 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2761 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2821 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca
     2881 tggtgtcacc tcggccccgg acaacaggcc cgccttgggc tccaccgccc ctccagtcca
     2941 caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg tgcacaacgg
     3001 cacctctgcc agggctacca caaccccagc cagcaagagc actccattct caattcccag
     3061 ccaccactct gatactccta ccacccttgc cagccatagc accaagactg atgccagtag
     3121 cactcaccat agctcggtac ctcctctcac ctcctccaat cacagcactt ctccccagtt
     3181 gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc agtttaattc
     3241 ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca tttctgaaat
     3301 gtttttgcag atttataaac aagggggttt tctgggcctc tccaatatta agttcaggcc
     3361 aggatctgtg gtggtacaat tgactctggc cttccgagaa ggtaccatca atgtccacga
     3421 cgtggagaca cagttcaatc agtataaaac ggaagcagcc tctcgatata acctgacgat
     3481 ctcagacgtc agcgtgagtg atgtgccatt tcctttctct gcccagtctg gggctggggt
     3541 gccaggctgg ggcatcgcgc tgctggtgct ggtctgtgtt ctggttgcgc tggccattgt
     3601 ctatctcatt gccttggctg tctgtcagtg ccgccgaaag aactacgggc agctggacat
     3661 ctttccagcc cgggatacct accatcctat gagcgagtac cccacctacc acacccatgg
     3721 gcgctatgtg ccccctagca gtaccgatcg tagcccctat gagaaggttt ctgcaggtaa
     3781 cggtggcagc agcctctctt acacaaaccc agcagtggca gccgcttctg ccaacttgta
     3841 gggcacgtcg ccgctgagct gagtggccag ccagtgccat tccactccac tcaggttctt
     3901 caggccagag cccctgcacc ctgtttgggc tggtgagctg ggagttcagg tgggctgctc
     3961 acagcctcct tcagaggccc caccaatttc tcggacactt ctcagtgtgt ggaagctcat
     4021 gtgggcccct gaggctcatg cctgggaagt gttgtggggg ctcccaggag gactggccca
     4081 gagagccctg agatagcggg gatcctgaac tggactgaat aaaacgtggt ctcccactg
//
LOCUS       DOGSRP9A     1271 bp ss-mRNA            MAM       19-JUL-1990
DEFINITION  C.lupus signal recognition particle 9 protein (SRP9) mRNA,
            complete cds.
ACCESSION   M34952
KEYWORDS    signal recognition particle protein.
SOURCE      C.lupus (strain Madin-Darby) kidney, cDNA to mRNA.
  ORGANISM  Canis lupus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 1271)
  AUTHORS   Strub,K. and Walter,P.
  TITLE     Assembly of the alu domain of the signal recognition particle
            (SRP): Dimerization of the two protein components is required for
            efficient binding to SRP RNA
  JOURNAL   Mol. Cell. Biol. 10, 777-784 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Strub, 07-JUN-1990.
FEATURES       from  to/span     description
    pept         59      319     signal recognition particle 9 protein
BASE COUNT      372 a    232 c    254 g    413 t
ORIGIN      
        1 gcccacctac cacctacctc gggcggccag aaaccgatgc ggggggccca gcggcaagat
       61 ggcgcagtac cagacttggg aggagttcag ccgcgcggcc gagaaactct acctcgccga
      121 ccctatgaag gcacgtgtgg ttctcaaata taggcattct gatgggagtt tgtgtattaa
      181 agtaacagat gatttagttt gtttggtgta tagaacagac caagcccaag atgtaaagaa
      241 gattgagaaa ttccacagtc aactaatgcg actcatggta gccaaggaat cccgcagtgt
      301 tgccatggaa acggactgac gggtttgaaa tgaagatcct tcatgttctt aggagtaaat
      361 atcttttgaa tcagaaaaag tgttgggaaa gaaaatatgt aactaagtgg gctcttcaga
      421 agtggggaga tcattttttg tactttgttt tttaatgttt actttagaga gctaggaacg
      481 tacatgcttt cggtgaaagc ctttatttat ttttggaaat tcagtaaaag gcagttcttc
      541 cttaaattta gttaatctgt ctttaaaaga aaattaaatt taaccatttt gctggattgt
      601 tgtatttctt ttggagcata aaatttgtgc tattgatgac caacaaacaa acataaaata
      661 tagtaattgg aattacctgt gcacagcagt gtacctatgt ataatatagt aattagtctc
      721 agttctatct aaaagtaatc atggaaatga gtatgcttta cctaaaactt ttccaaactt
      781 aaactgtatt tttgaatgta aggaatttgt agtatcgtta gcttgttgag cagggacttg
      841 ctttaatcta gtttccagtg ctcaaaaaca actgcattta cttgaagtgc atgaacagat
      901 gatcactagt ggactgaacc accatattac gcaagtattt gcctgcagat ttcccatcta
      961 tattttctca gaagggctaa agattatttg aactgttaaa tctttgccat atgtctgtgc
     1021 cactcctgcc tgtttctccc tgtacttaac caaggtgttg aacatgactg tcacaactgt
     1081 tagttaaatc tttgcatatg tctgtgccac tcctgcctgt ttctccctgt acttaaccaa
     1141 ggtgttgaac atgactgtca caactgttat ttttttcatt aagtcagaag gatatcattt
     1201 gatatttatc atataattgt aacctcagtt ttaccatctc aatgtaatgt tcacatgttg
     1261 ttcctacatt a
//
LOCUS       PCHPMMMSA    6409 bp ss-mRNA            INV       19-JUL-1990
DEFINITION  P.chabaudi major merozoite surface antigen mRNA, complete cds.
ACCESSION   M34947
KEYWORDS    major merozoite surface antigen; surface antigen.
SOURCE      P.chabaudi chabaudi (strain IPP-C1), cDNA to mRNA, and DNA, clone
            IPP-C1/C.
  ORGANISM  Plasmodium chabaudi
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 6409)
  AUTHORS   Deleersnijder,W., Hendrix,D., Bendahman,N., Hanegreefs,J.,
            Brijs,L., Hamers-Casterman,C. and Hamers,R.
  TITLE     Molecular cloning and sequence analysis of the gene encoding the
            major surface antigen of Plasmodium chabaudi chabaudi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Deleersnijder, 06-JUN-1990.
FEATURES       from  to/span     description
    pept        667     6024     major merozoite surface antigen precursor
    sigp        667      723     major merozoite surface antigen signal peptide
    matp        724     6021     major merozoite surface antigen
    rpt         324      365     tandem repeat region
BASE COUNT     2681 a    991 c    922 g   1814 t      1 others
ORIGIN      
        1 tctagataat atattttttg tatgcatgct aaaattaatt atacatatat taaatagatt
       61 tgtgcgaatc tttatgtgtg caagttattt tttttaataa taattatcca tataccacat
      121 tatttatttg tgtaccgtta aatatttatt ttctaagcga tttttctcct taaattatat
      181 tttttntgat catttttttt ttttttttgg aaaatcggga gcataaaaaa tatatattac
      241 actttataaa ttttttatac acatttgttt attttatttt atatatattt tttaacacat
      301 ttttattttg aaatgatatg atcaattata aaaaaacaat aacataataa tagtaataat
      361 ttttttttgt acgatatata aaattatgca tttttatttt tatagtaagt taaaaagtgt
      421 attatatgta cgtattttgt ttaacagaac ggaaattaga aaaaacacaa taaaacttat
      481 atatatatgt gtaattagtg tatgtgtata tatttgtcaa cattataaat gatataattg
      541 aacttcaata tttattttta cacaaattag tactaatata aaatgcaaaa gtaatgtacc
      601 tttgtgtgta ttaattttag cattataatt tattccactc tgtatattag ttaagtttcg
      661 ttgaaaatga aggcgatcgg acttttgttt tctttcgttt tttttgctat atattgcaaa
      721 tctgaaacaa taggagttta caatgatctc gttcataagt tagaaaagtt agaagaatta
      781 tcagtagaag gattagaact atttcaaaaa agtcaagtaa ttgtaaatgc acaatcacca
      841 gaaacacctg ttgatccatt tacaaaccct gaatttgcac aaaagttaca accatttatt
      901 ttaaaatttg aagaattagg atttacagaa caaacagagt tagtcaattt aataaaaact
      961 ttaggcccaa ataaatatgg actaaaatat ttaattgaaa gtaaagaaga atttaacgaa
     1021 ttaatgcacg caataaattt ttactatgac gtgcttagag ataaattgaa tgatatgtgt
     1081 gcaaataact attgtgaaat tcctgaacat cttaaaatta atgttgaaga aatcgaaatg
     1141 cttaagaaag ttgtcttagg ctatagaaaa ccaattgaaa atattcaaga tgatcttgta
     1201 aaattagaag aatatattgc aagaaataaa gcaactgctg aaaccttaaa cactcttatt
     1261 actgaagaaa caaaaaaaat aacacctgaa gaagaaacag attgcaacga tactaattgc
     1321 gacaatacta aatatggaaa gaaaaaagca atatatcaag ctatgtacaa tgttatattt
     1381 tacaaaaagc aattagctga aataaaaaaa gtcatcgaag tcttagaaaa gagagttgct
     1441 acattaaaga agaacgaagc cataaaacca ttgttacaac aaatcgaagc tatcagaggt
     1501 ccacctgctg tcactgaagg acaaatagct acagaaggaa gcagcgaaga aacaaaacaa
     1561 aatagtacag aatcatctaa cacaaaaacg actactactg acaaagctgt tacaacccaa
     1621 accgctacta aagcaactgg tacagaaaca aatactggta cagaaacaaa tactggcaca
     1681 gaaacaaata ctgccacagg aacaactact gccacaggaa caactactgc cacaggaaca
     1741 cctactgtca ctgaaccagt tcaagtgcca gccgttcaag ttcttacaga agaagaaaaa
     1801 gcaaaaaaaa tagctgaact ttatgctcaa attaaagaaa ttgcaaaaac tataaaattc
     1861 aatttagacg gaatatttgt cgatccagtt gaattagaat attacaaaaa agaaaaaaaa
     1921 aatgaaagct gccattcaac ttcatcttgc cacaaaaata aaacacctga aactgtaata
     1981 ccattaaatg tacgttatcc aaatggtatt agctacccat taactgaaga agttgtttac
     2041 agcaaaattg ctcataatgc cgctgaaaca acttatggtg atttaacaaa tgtcgataat
     2101 acagccataa cagaagattt aaccacaaat gaacaagcaa gaaaaaattt aattaaagct
     2161 attaaaaaga aaatcgaagc agaagaacaa aaattagtag aattaaaaga tgattatgat
     2221 actaaacttg cagcatttaa tggacaaaaa actccattca aagaagcagc taaaaaattt
     2281 tatgaatcca aatttagaaa taaattgact actgacattt ttgacgattt taaaacaaaa
     2341 agaactgaat atatgaacaa gaaagctgca ttagtaggtt gtgaatatgg aaatactcaa
     2401 caactcatta ataaattaaa taaacaactt aattatttac aagattatgg attaagaaaa
     2461 gaaatagtta acactgaaat tgaatatttt tcaaacaaaa aatcagaatt acaatataat
     2521 attaatagat tagcaaatgc tgttcaagca aaacaaaata tattagttgc atcaaaacat
     2581 attccacttt caacacttgt tgaattacaa atccaaaaat ctttattaac aaaactaatt
     2641 gaacaattaa ataaaactga attttcttta aataaagctc acttaaaaga caagatatac
     2701 gttccacaaa catatggtaa agaaggaaaa ccagaaccat actacttaat agctataaaa
     2761 aaagaaattg acagacttgc caaatttatt cctaaaattg atgatatgat tgagaaagag
     2821 aaacaaaaaa tggaacaaga acatgtagct accggagaat ctgaacaagc ctcttctgcc
     2881 tctggtactg gatcatccac agaaaccaca tcacaaacag caccagccgt tccagctgca
     2941 cccgcaccag cagaaaaggc aaaagaagga acagaatcaa cagaagaaac cccagcagca
     3001 tcaaaaccag ccgaaggtgc agcatccaca ggtgcaacca ccccaacaga acaagaagct
     3061 gcaccaacag aacaagaagc acaacctgca gcacctgaaa caccagcaga ggtaccagca
     3121 ccaaccacgc ctgcagctcc agcaactcca gccgcaccag cagcacccgc aaaaccagtt
     3181 atgacaaaat tatattacct tgaaaaatta aagaaatttt tagcattctc atatgcatgc
     3241 cataaatatg ttttattaca aaactctacc ataaacaaag atgctttaag caaatatgct
     3301 cttacaccag aagaagataa aataagaaca ttaaagagat gcagtgaatt agatgtatta
     3361 ttagctattc aaaataatat gcctactatg tattcacttt atgaaaatgt agttgatggt
     3421 ttacaaaaca tttacactga attatatgaa aaagaaatga tgtatcatat atataactta
     3481 aaagataaaa acccagctgt taaagcttta ttagtaaaag ctggcgtcat tgatccagaa
     3541 ccagtagccc caacaccagc agtaccagca ccagaaactg caccagaaac tgcaccagaa
     3601 actgcaccag aaacaccagc acaagaagct ccacaacaac cagaatcggc acaagcacca
     3661 gaagcagcaa ctgaaacaac aacaccagcc gaatcggcat caacagaacc aacaccaaaa
     3721 gcacctacag caacacccac atctgaaaca gtaacacaag aaggaacaac accagcagca
     3781 ccaaaagcac aagaaggagc atcatcatca gcaccagcac aaccagcccc agcaaaacca
     3841 gcacctgcac aaacagtaac agggcaatca acaaacgttg aaggaagtac tcaagtaaga
     3901 gcagaaagtg aagacgaaat gtttgtcgat gattttgaag tagacaattt ttacaaatct
     3961 tacttacaac aagttgatgg aaataatact caattcatag attttataaa atctaaaaaa
     4021 gaattaatca atgcattgac ccctgaaaaa gttaaccaat tatatcttga tattgcacac
     4081 ttaaaggaat tatcagaaca ttactataat cgttattata aatataaatt aaaattagaa
     4141 agattatatc aaaaacatga acaaattgaa gcagctaacc aaaaagttaa agaaattagc
     4201 gtattaaaat cccgattatt aaaaagaaaa aaatatatta atggtacatt ttatgtatta
     4261 tctggttttg caaatttctt taacaagaga agagaagctg aaaagcaata tgtagataac
     4321 gcaataaaaa atactgatat gttattgaaa tactacaaag ctcgtagtaa atattttact
     4381 tctgaagctg ttcctttaaa aacattaact aaaacatcaa ttgacagaga agccaactac
     4441 ttgaaaatcg aaaaattcag agcatacagc cgattagaat taagattaaa gaaaaatatt
     4501 aacttaggaa aagaaagaat tacatatgta tctggtggtt tacaccatgt atttgaagaa
     4561 tttaaagaac ttttaaaaaa taaaggttat accggaaaaa ctaaccctga aaatgctcct
     4621 gaagttatca aggcattcga acaatataaa gaattacttc caaagggagc aacaactcca
     4681 gctccagtag ttgcacctgt agttgctcca gccccagcta cagcagcccc agcagctgac
     4741 gcaccagtac cagcagccgc agccgcagcc gcatcaggat caggatcagc agccacaaca
     4801 gaaggagaag ccgctacaac agtagttgca agcagcgata atgatgatga tgacgatgat
     4861 gatatggatc aaattgcaaa tgctcaatcc acagacgaag aagtaaaaga tattcttgat
     4921 gcatttaaaa gtgaaaatga atatatatac acaaagagct taggtaacac atataaatca
     4981 tttaaaaaac acatgttaaa agaattttca atgattaaag aagacataat gactggatta
     5041 aactataaat tagaaaaaag aaatgatttc cttgatgtat taagctatga attagcttta
     5101 ttcaaagata taaataccaa caaatttgtt gttaaaaacc cataccaatt attagataat
     5161 gataagaaag acaaacaaat gataaactta aaatatgcca ttaaaggtgt aactgaagat
     5221 atcgaaacag ctactgatgg aattgaattc tttaacaaaa tgattgaatt atacaaacct
     5281 caattaaacg cagttaatga acaaattgct gccataggaa cagaacctac cgatgccgaa
     5341 aaaaagaaat acgctccaat ctttgaagat cttaaaggat tatatgaaac catattgaac
     5401 ggagcagaag aattttcaga attattacaa cacaaacttg aaaactataa aattgaaaaa
     5461 gctggatttg acattttaat ggcaaattta gaaacataca taagaattga cgaaaaactt
     5521 gaagacttcg tagaaagtgc agaaaaaaat aaacacattg cctcaatagc tttaaataac
     5581 ttaaacaaat ctggtttagt aactgaaggt gaatcaaaga aaatattagc aaaaatgctt
     5641 aacatggatg ccatggattt attaggtata ggttctaatc atgtatgtat tagtacaagt
     5701 actcctgaca atgctggatg ctttagatat gatgatggta cagaagaatg gagatgttta
     5761 ttaggtttca aaaaagatga tgatggtaat agatgtgtag cagatgatgc tcctgtttgt
     5821 aataacaaca atggtggatg tgataaaaat gctgattgta gagaagtaga aaatacagat
     5881 agggatcctt ccaaaaaaat tgtatgtact tgtaaagaac caaacccaaa tgcatattat
     5941 gctggtgtat tctgtagttc ttccggattt atgggattat caattttatt gatcatcaca
     6001 ttaattgtat ttaatttatt ttaaataaat gattaaaata tttgttgcat tttatatttt
     6061 tcctatatat attttaaaag ttgtataata catttgaaat atatattttg gcataaattg
     6121 tatatttttt attatataaa aaaatatata tatataattt ttaataaaca tttttaaata
     6181 aacgtacatg tgttttagta taggaaattt tgtatgactt taaaatatga tgatactatt
     6241 ttttttaaat gtatagtaaa ttaatttatt tttatttttt atacaatata ttgtatgtgt
     6301 gttctttatt actattattt tataagtata taaaataaag ctattttttt ttttttttta
     6361 acttcaaaca tatttagtaa cttttttatt taaagaatag ccggaattc
//
LOCUS       SHPMHCA       588 bp ss-mRNA            MAM       19-JUL-1990
DEFINITION  Sheep MHC class I protein gene, 3' end.
ACCESSION   M34672
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Sheep 8-week old, cDNA to mRNA, clone SC17.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 588)
  AUTHORS   Grossberger,D., Hein,W. and Marcuz,A.
  TITLE     Class I major histocompatibility complex cDNA clones from sheep
            thymus: Alternative splicing could make a long cytoplasmic tail
  JOURNAL   Immunogenetics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Grossberger, 30-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      287     MHC protein (AA at 3)
BASE COUNT      120 a    161 c    157 g    150 t
ORIGIN      
        1 ccaggaagtg ggcggccctg gtggttcctt ctggagagga gcacacatac acgtgccgtg
       61 tgcagcacga ggggcttcag gagcctaccc tgagatggga acctcctcag acctccttcc
      121 tcaccattgg catcattggt ctggatctcc tcgtggttgc tgtggtggct ggagctgtga
      181 gctggatgaa gaagctctca ggtgaaaaaa gacggacgta cacacaggct gcaagcagtg
      241 acagtgccca gggctctgat gtgtctctca cggtccctaa agtgtgaaac gctgccttgt
      301 gggactgagt gatgctgcat cccgcaatgt gacgtcagat cctggacccc tctttctcgg
      361 ctgcatccga atgtgtctgt gctcctagta gcataacatg aggagctggg gagactggtc
      421 acccctgccc accacacccc cttctccgct gacctgtgtt ctcctccctg atacactgtc
      481 ctgttccagc agagacaggg ctgggccgtg tcatcgctgt ctttgcttca tatgcactta
      541 gtaatgatgt cttatttcat ctttgaaaat aaaatctgta tatatatc
//
LOCUS       SHPMHCB       841 bp ss-mRNA            MAM       19-JUL-1990
DEFINITION  Sheep MHC class I protein gene, 3' end.
ACCESSION   M34673
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Sheep 8-week old, cDNA to mRNA, clone SCI89.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 841)
  AUTHORS   Grossberger,D., Hein,W. and Marcuz,A.
  TITLE     Class I major histocompatibility complex cDNA clones from sheep
            thymus: Alternative splicing could make a long cytoplasmic tail
  JOURNAL   Immunogenetics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Grossberger, 30-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      543     MHC protein (AA at 1)
BASE COUNT      179 a    235 c    238 g    189 t
ORIGIN      
        1 gaggactacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaccgggaag
       61 gacacgctgc tgccgcagac ccttccaaag gcacatgtga cccgacaccc catctctgag
      121 cgtgaggtac ccttgaggtg ctgggccctg ggcttctacc ctgaggagat ctcactgacc
      181 tggcagcgca atggggagga ccagacccag gacatggagc tcgtggagac caggccttca
      241 ggagatggaa ccttccagaa gagggcggcc ctggtggtgc cttctgaaga ggagcagaga
      301 tacacgtgcc atgtgcagca cgaggggctt caggagctca ccctgagatg ggaacctcct
      361 cagacctcct tcctcaccaa gggcatcatt gttggcctgg ttctcctcgt gctggctgtg
      421 gtggctggag ctgtgatctg gaggaagaag tgctcaggtg aaaaaagagg cacctatacc
      481 caggcttcaa acaatgacat gtgcccaggc tctgatgtgt ctctcacagt tcctaaagtg
      541 tgagacgctg ccttgtggga ctgagtgatg ctgtatccca ctatgtgatg tcagatccct
      601 gacccctctt tctgcagctg catctgaacg ttgtctgtgc tccatgtagc ataacgtgag
      661 gagctgggga gattggtcac ccctgcccac cacaccccct cccgcctgga cctatgtctc
      721 ctccctgata cactgtccta atccagcaga gagggcctgg ctgtctccat ccctgtcttg
      781 cttcatgtgc actgagtaat gatgtcttat acccttattg aaaataaaat ctgtatatat
      841 g
//
LOCUS       SHPMHCC       995 bp ss-mRNA            MAM       19-JUL-1990
DEFINITION  Sheep MHC class I protein gene, 3' end.
ACCESSION   M34674
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Sheep 8-week old, cDNA to mRNA, clone PSCI16.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 995)
  AUTHORS   Grossberger,D., Hein,W. and Marcuz,A.
  TITLE     Class I major histocompatibility complex cDNA clones from sheep
            thymus: Alternative splicing could make a long cytoplasmic tail
  JOURNAL   Immunogenetics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Grossberger, 30-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      537     MHC protein (AA at 1)
    site        466      626     unspliced intron
BASE COUNT      203 a    248 c    288 g    254 t      2 others
ORIGIN      
        1 gaccctccaa aggcacatgt ggcccatcac cccatctctg accgtgaggt caccctgagg
       61 tgctgggccc tgggcttcta ccctgaggag atctcactga cctggcagcg tgacggggag
      121 gaccagactc aggacatgga gtttgtggag accaggcctt caggggatgg aaccttccag
      181 aagtgggcgg ccctggtggt gccttctgga gaggagcaga gatacacgtg ccgtgtgcag
      241 cacgaggggc ttcaggagcc cctcaccctg agatgggaat ctcctcagcc ctccgtcctc
      301 accatgggca tcattgttgg cctggttctc ctcgtggtgg ctgtggtggc tggagctgtg
      361 atctggatga agaagcgctc aggtgaaaaa ggacggatct acacccaggc tgcaagcatg
      421 tacagtgccc agggctctga tgtgtctctc acggttccta aaggtgaggc cctggagtgt
      481 ctagattgga aggagcattg gggcagaggg gacacactgg gtggcggggg tctctgagtg
      541 ggacatgtga gcatgtcggg ggctgtggag aatatcagcc cttacatgac tgacctgaac
      601 tggctcctga ttcttttctc tcacagtgtg agacagctgc cttgtgggga ctgagtgatg
      661 cttggtccca ctttgtgatg tcagatcgcc ggacccctct ttcttcagct gcatctgaat
      721 gtgtctgtgc tcctattagc ataacatgag aagttgggga gactggtcac ccttgcccac
      781 tgtacgctgt ccccaccctg acctgtgttc tcctccctga tccaccatcc tgttcagcga
      841 gacgggctgg gccatcttca ttgctatctt tgcttcacat gcactgagta atgatgtctt
      901 atttccttat tgaaaataaa ttctgtatat atatgaatct attttttcta attggtgcca
      961 tgaaagggnn ttggataata aaatgagaat tcgat
//
LOCUS       SHPMHCD      1050 bp ss-mRNA            MAM       19-JUL-1990
DEFINITION  Sheep MHC class I protein gene, 3' end.
ACCESSION   M34675
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Sheep 8-week old, cDNA to mRNA, clone PSCI11.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1050)
  AUTHORS   Grossberger,D., Hein,W. and Marcuz,A.
  TITLE     Class I major histocompatibility complex cDNA clones from sheep
            thymus: Alternative splicing could make a long cytoplasmic tail
  JOURNAL   Immunogenetics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Grossberger, 30-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      747     MHC protein (AA at 1)
BASE COUNT      216 a    292 c    315 g    227 t
ORIGIN      
        1 ggcgggtctc acaccatcca ggcgatgtac ggctgcgaag tgggacctga cgggcgtctc
       61 ctccgcgggt atgagcagtt cgcctacgaa ggcagagatt acctcgccct gaacgaggac
      121 ctgcgctcct ggaccgcggc ggacacggcg gctcagatca ccaagcgcaa gtgggaggcg
      181 gcaggtgagg cggcgcgtgt gaggatctac ctggagggca cgtgcgtgga gtggctccgc
      241 agacacctgg agaccgggaa ggacacgctg ctgcccgcag accctccaaa ggcacatgtg
      301 acccaacacc ccatcactga gcgtgaggtc accctgaggt gctgggcctt gggcttctac
      361 cctgaggaga tctcactaac ctggcagcac aatgaggagg accagaccca ggacatggag
      421 cttgtgaaga ccaggccttc aggggatgga accttccaga agtgggcagc cctggtggtg
      481 ccttctggaa aggagcagag atacacgtgc cgtgtgcagc acgaggggct tcaggagccc
      541 ctcaccctga gatgggcacc tcctcagacc tccttcctca ccatgggcat cattgttggc
      601 ctggttctcc tcgtggtgac tgtggtggct ggagctgtga tctggaggaa gaagcgctca
      661 ggtgaaaaaa gacagaccta tacccaggct gcaagcagtg acagtgccca gggctctgat
      721 gtgtctctta tggttcctaa agtgtgagac agctgccttg tggggactga gtgatgcttg
      781 gtcccattct gtgacatcag atcttgggac ccctctttct gcaggggcat ctgaatgtgt
      841 ctgtgctcct attagtataa catgaggagt tggggagact ggtcacccct gcccactgca
      901 caccgtcccc accctgacct gtgttctcct tcctgatcca ctgtcctgtt gcagcagaga
      961 cgcctgggcc ctctccatca ctgtctttgc ttcatatgca ctgagtaatg atgtgttatt
     1021 tcctttttga aaataaaatc tgtatatatg
//
LOCUS       SHPMHCE      1396 bp ss-mRNA            MAM       19-JUL-1990
DEFINITION  Sheep MHC class I protein gene, complete cds.
ACCESSION   M34676
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Sheep 8-week old, cDNA to mRNA, clone PSCI12.
  ORGANISM  Ovis aries
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1396)
  AUTHORS   Grossberger,D., Hein,W. and Marcuz,A.
  TITLE     Class I major histocompatibility complex cDNA clones from sheep
            thymus: Alternative splicing could make a long cytoplasmic tail
  JOURNAL   Immunogenetics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Grossberger, 30-MAY-1990.
FEATURES       from  to/span     description
    pept          4     1110     MHC protein precursor
    sigp          4       89     MHC protein signal peptide
    matp         90     1107     MHC protein
BASE COUNT      278 a    403 c    434 g    281 t
ORIGIN      
        1 cccatgacca gaggattgcg agtaatgggg ccgcgaaccc tcctgttgct gctctcggga
       61 gtcctggtcc tgaccgagat ccgggcgggc ccccactcca tgaggtattt cagcaccgcc
      121 gtgtcccgcg ccggcgccgg ggagccccgg tacctggaag tcggctacgt ggacgacacg
      181 cagttcgtgc ggttcgacag cgacgccccg gatccgaaga tggagcagag ggagccgtgg
      241 atgaagcagg tggggccgga gtattgggat cggaacacgc gaaatcccaa gggcaacgca
      301 cagactttcc gagtgggcct gaccatcctg cgcggctact acaaccagag cgagaccggg
      361 tctcacacct ggcagtgtat gtacggctgc gacgtggggc cggacgggcg tctcctccgc
      421 gggttcatgc agttcggcta cgacggcaga gattacatcg ccctgaacga ggacctgcgc
      481 tcctggaccg cggcggacac ggcggctcag gtcacccagc gcaagtggga gaaggaaggt
      541 gcggcggacc actacaggaa ctacgtggag ggcacgtgcg tggagtgcgt gcgcagatac
      601 ctggagatcg ggaaggaaca gctgcagcga gcagaccctc caaaggcaca tgtgacccat
      661 caccccatct ctggccatga tgtcaccctg aggtgctggg ccctgggctt ctaccctgag
      721 gagatctcac tgacctggca gcgcaatggg gaggaccagt tgcaggacat ggagcttgtg
      781 gagactaggc cttcagggga tggaaccttc cagaagtggg cggcccttgg tggtgcttct
      841 ggagaggagc agagatacac gtgccatgtg cagcatgagg ggcttcagga gcccctcacc
      901 ctgagatggg aacctcctca gacctccttc ctcacttcct caatgggcat cattgttggc
      961 ctggttctcc tcgtcatggt ggctgtggtg gctgcagctg tgatctggag gaagaagtgc
     1021 tcaggtgaaa aaagagggac ctatacccag gcttcaagca atgacagtgc ccagggttct
     1081 gatgtgtctc tcacggttca taaagtgtga gacagtgatg ctgcatcccg ctatgtgcca
     1141 tcagatcccc ggacccctct ttctgaagct gcatctgcac gtgtctgtgc tcctagtagc
     1201 ataacgtgag gagttgggga gaccgttcac ccctgcccac cgcgccccct cctgccctga
     1261 cctgtgttct cctccctgat ccactgtcct gttccagcag cagacagggc tgggccgtct
     1321 ccatccctgt ctttgcttcg tatgcactga gtaatgatgt cttatttcct tattgaaaat
     1381 aaaatctgta tgtatg
//
LOCUS       YSPNMT1A     3787 bp ds-DNA             PLN       19-JUL-1990
DEFINITION  S.pombe no message in thiamine protein (nmt1) gene, complete cds.
ACCESSION   J05493
KEYWORDS    .
SOURCE      S.pombe DNA.
  ORGANISM  Schizosaccharomyces pombe
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3787)
  AUTHORS   Maundrell,K.
  TITLE     nmt1 of fission yeast: A highly transcribed gene completely
            repressed by thiamine
  JOURNAL   J. Biol. Chem. 265, 10857-10864 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Maundrell, 07-JUN-1990.
FEATURES       from  to/span     description
    pept       1499     2539     no message in thiamine protein (nmt1)
    mRNA       1430     2681     nmt1 mRNA
    signal     1396     1403     TATA box
BASE COUNT     1218 a    657 c    690 g   1222 t
ORIGIN      
        1 ggctcattta taatctagca ctttatacct tttacctgac tgttgggttg tttatctgac
       61 ctcataaaag aaagtgtcgt tttggaaaaa ttagcattac attgagggtc ctccgctaat
      121 gctcctgcga aaaatgattt taattttgga tgttttttca gaaataaaat gaaaattagc
      181 ttgatataat atcaaccggc agcgagtaat agatttaaaa taaatttgat taattaaaaa
      241 aatttgttgt tttaagcaag ccattttgct aaaatcaaag gtaatggaag agtatttccg
      301 aaaaatctca acacatgtga atgatcagaa aattatcgcc ataaaagaca gaataagtca
      361 tcagcggttg tttcatttcc tatatttttt ttttattttt ttatttttta ataagggaaa
      421 atttaacgtc taaggataca gaagattgtt agcacattaa agtaataaag gcttaagtag
      481 taagtgcctt agcatgttat tgtatttcaa aggacataat ctaaaataat aacaatatca
      541 tttctcacaa gttattcaat tttctttttt ttttctaata atatcaagaa tgtattattt
      601 gtttgacata agtcaactaa tttatttaat atgctggatt aatcttgcag acatgtaaat
      661 taacaagttt tagtcaaata acgttgaagt ttcaatgaac tcaaataatt tctctttttt
      721 tttatataac catatgtcta atctgattta tattttccgc aggatcaact gaagttatga
      781 catttggatt ggatcactta taaccttggt cgccaaataa tacaaaaatc agcgttataa
      841 aacaaagaag gtttttgtta agaaattaat cctctttctt gataagaaag ttgaaccgaa
      901 attgcagata ctgatatatg aaaataatac ccacaatttt gggaatagcg caagcctcaa
      961 tttaaacaat aggtgaggac acatgataat gacctcaatg attgttagaa gaaaagagcc
     1021 tcattacaaa atcgaaaaat gaatggttgg gtacaagttt ccaaaacatg gtaaagtgga
     1081 ctttgcgtat gagacgtaaa tagaaaaaaa cacttgttat atgttttcta gaattattgt
     1141 tgtctcttta tggttggatg atgcaaaata gtaatttcgg ttagttgctg taaaacacca
     1201 cgagacaaat agatatggat atttattaaa tcaggaaaaa cgtaactctc ggctactgga
     1261 tggttcagtc acccaacgat tactggggag agaaaacagg gcaaaagcaa agcttaaagg
     1321 aatccgattg tcattcggca atgtgcagcg aaactaaaaa ccggataatg gacctgttaa
     1381 tcgaaacatt gaagatatat aaaggaagag gaatcctggc atatcatcaa ttgaataagt
     1441 tgaattaatt atttcaatct cattctcact ttctgactta tagtcgcttt gttaaatcat
     1501 gtctactaac aagatcactt tcctcacaaa ctgggaggcc actccttacc atttgcccat
     1561 ctttcttgct caaactcgcg gatactatga gcgtgaaggg attgaggttg ctattctcga
     1621 gcctaccaac ccttccgacg ttacagcatt gattggttct ggtaaggttg acatgggatt
     1681 aaaggccatg atccatactt tagctgctaa ggctcgcgga taccctgtca ccagttttgg
     1741 atctttgtta aatgagcctt tcactggctt aattactttg aagggtaatg gcatcaacga
     1801 cttcaaggac attaaaggaa agcgtattgg ctacgttggt gagtttggaa agatccaact
     1861 cgatgacttg tgcagcaagt tcggtttgtc tccttctgat tatactgcta ttcgctgtgg
     1921 tatgaacatt gcccctgcca tcatcaatgg tgaaatcgat ggcggcattg gcattgaatg
     1981 catgcaacaa gtcgagcttg agcgctggtg cgtctcccaa ggccgcccaa ggtctgatgt
     2041 ccaaatgttg cgtattgatc gattagccaa cttaggttgc tgctgtttct gtaccatttt
     2101 gtatattgca catgatgaat tcattgctaa acatcccgac aagatcaagg ccttcttacg
     2161 tgctatccat tctgctactt tggatatgct taaagatcct gtccaaacct acaaggagta
     2221 cattcacttc aagcgtgaaa tgggatccga acttcatcgg gaacaatttg aacgttgctt
     2281 tgcatatttc tcacatgaca tctctaacgt ccccagagat tggaacaagg ttaccaatta
     2341 ttccaagcgt ttgggcatca tcccccaaga ttttgagccc aactgtacta acggttactt
     2401 gacctgggaa cttgaccccg atgagaagga tcccatgggc aaacaagaag ccattgccga
     2461 gatccaagat gaaattaagc aaaagggagg tgtcttcagc ggcaactcac ttcgttatgt
     2521 cgagcctgcc aacctttaaa aggaatgtct cccttgccag tactgctagg gtttttcttt
     2581 caaactatgg aagcccattc aagctgcata ttacgatttt gtttttcgct tttagaaagt
     2641 ggtttagatg agataataga aaaattcttg atctccgaca acgagtactt ttattttttt
     2701 tgctaatcac tttactcaat attagctcga aatcgtagaa acgtagacgg gtgcgggata
     2761 ccgagtggtg tagttaagaa tttttataaa ccacgtggcc caaaaatatg aacccaaaac
     2821 gtttatacat gagtatactt taagaaggct ataccccttc gtgttagatg tagttttagc
     2881 tacccaaccc gagtctatga gcttgacttc agatgtagaa ggcattaaat cgttttgaat
     2941 attaattaaa aaacgatgaa aattaaatat ttaaaagcaa tcatacgctg aaaatttagt
     3001 gctgtggcta atccttcaac atggaaatgc cataaaagtg actttgacaa aaaaaaaagt
     3061 atatacaggt agtaaactca tctacttcat tgactttgtt tacagcatgt ggaaggagga
     3121 atatttattg ctaaatcgta gtttaacatt caataagtaa tactattgaa attcgacaag
     3181 attggccgca tggatgaaaa agaggcattt tgctttggga gaattagttc aaattagaac
     3241 tgaaaaaaaa aactttacga ggcaaaaatg tcggattgag atcgtaaaag ttcgctcgtc
     3301 gtcttttgct ttgtgattgt tttcatggat acatcttgct ggatatttaa attttagtac
     3361 tatgtataag atattctata aatgttttat cacccaaacc tgttagcgcc ttcttaattc
     3421 tattcaatct ggcttttgct ctgagactac ttcttggact ttcactactt gttagttata
     3481 cggaatttgt gtaattagaa gtgaaataat cctttctatt agtaatgcaa acaaaaatca
     3541 attggaaagc aaatttacac atacttgctg tatcgccttc gactatcttt tcattgcata
     3601 ccatgatttt agacgtttat acttaagcaa ttaaaaggtt ttgattcaat cataaacata
     3661 attatccttg ataaaaaaag aattatacac attgttctct ttatttgact tcgaactgtt
     3721 taacatcgaa acggtcagat gatacaccca ttcctccaat gtaatccctg gcttcttggg
     3781 caagctt
//
LOCUS       CP7CPL       1470 bp ds-DNA             PHG       19-JUL-1990
DEFINITION  Bacteriophage Cp-7 muramidase (cpl7) gene.
ACCESSION   M34779
KEYWORDS    muramidase.
SOURCE      Bacteriophage Cp-7 [from S.pneumoniae] DNA.
  ORGANISM  Bacteriophage Cp-7
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 1470)
  AUTHORS   Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and
            Lopez,R.
  TITLE     Modular organization of the lytic enzymes of Streptococcus
            pneumoniae and its bacteriophages
  JOURNAL   Gene 86, 81-88 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        286     1314     muramidase
BASE COUNT      483 a    270 c    318 g    399 t
ORIGIN      
        1 cagctggaca ggcttaaaag gagttatcaa acataccctt acattcattt tttactactt
       61 tgtagcggta ttcttgacct atattcacgc tatggcagtc ggtcagattt tgctggttat
      121 cattaactta tactatgctt tgtcaatcat ggaaaatctt gctgttatgg gtgtatttat
      181 tcccaagttt atgacggcaa gggtgcaaga agagttacag aaatacacag cacaactaga
      241 cgcagggaaa gacctgctag aagaatttaa aggagaaaag aaataatggt taagaaaaat
      301 gatttatttg tagacgttgc aagccatcaa ggctacgaca tttcaggaat tttagaagaa
      361 gcagggacaa caaacacaat tattaaagtg tcagaaagta caagctattt aaacccttgc
      421 ttgtctgctc aagtgagcca gtcaaatcct atcgggtttt atcattttgc ttgctttggt
      481 ggaaatgaag aagaagcaga agcagaagca cgctatttcc ttgataacgt gcctacacaa
      541 gttaaatacc ttgtactaga ttatgaagac catgcaagcg caagcgtaca aagaaacact
      601 accgcgtgct tacgctttat gcaaatgatc gcagaagctg gatatacacc tatttattat
      661 agttacaaac cgtttacgct tgataatgtg gactatcagc agattttagc acagttccct
      721 aattctctat ggattgcagg ctatggctta aatgatggta cagctaactt tgaatacttt
      781 ccaagcatgg acggtatcag atggtggcaa tattctagta acccgtttga caagaatatt
      841 gtactgttag atgatgagaa agaagataat ataaacaatg aaaacactct aaaaagcctt
      901 accacagtag ccaacgaggt cattcaggga ctttggggca acggtcaaga acgttatgac
      961 agtttagcga atcgagggta tgacccccaa gcggttcaag acaaagtgaa tgaaatctta
     1021 aacgctagag aaattgcaga ccttaccaca gtagccaacg aggtcattca gggactttgg
     1081 ggcaacggtc aagaacgtta tgacagttta gcgaatcgag ggtatgaccc ccaagcggtt
     1141 caagacaaag tgaatgaaat cttaaacgct agagaaattg cagaccttac cacagtagcc
     1201 aacgaggtca ttcagggact ttggggcaac ggtcaagaac gttatgacag tttagcgaat
     1261 cgagggtatg acccccaagc ggttcaagac aaagtgaatg aattactttc ataacaagta
     1321 aaagctagta gaaattttct actagctatt tttatattct gctatgattt tataggcgtc
     1381 ctcatctggg ttatccagag caatggagca aatggcagac aggacagctg ttcatctgat
     1441 tgtatttctg taaatagtga ttttctagct
//
LOCUS       CP9CPL       1253 bp ds-DNA             PHG       19-JUL-1990
DEFINITION  Bacteriophage Cp-9 muramidase (cpl9) gene.
ACCESSION   M34780
KEYWORDS    muramidase.
SOURCE      Bacteriophage Cp-9 [from S.pneumoniae] DNA.
  ORGANISM  Bacteriophage Cp-9
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 1253)
  AUTHORS   Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and
            Lopez,R.
  TITLE     Modular organization of the lytic enzymes of Streptococcus
            pneumoniae and its bacteriophages
  JOURNAL   Gene 86, 81-88 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         52     1071     muramidase
BASE COUNT      403 a    213 c    294 g    343 t
ORIGIN      
        1 agtagacgca ggaaaagacc tgctagaaga atttaaagga gaaaagaaat aatggttaag
       61 aaaaatgatt tatttataga cgtatcaagc cacaacggtt acgatataac aggaatttta
      121 gagcagatgg gaacaacaaa cacgattgtt aaaatctcag aaagtacgac ctatttaaac
      181 ccttgcttgt ctgctcaagt ggaacagtct acccctattg gcttttatca cttcgcacgc
      241 tttggcggag acgtagcaga agctgaaaga gaagcgcagt ttttccttga caacgtgcct
      301 acacaagtta aataccttgt attggactat gaagacgacc caagcggaaa cgcacaagcc
      361 aacactaacg catgcttacg ctttatgcag atgattgcag acgctggata tacacctatt
      421 tattatagtt ataaaccttt cacgcttgat aatgtggact atcagcagat tttagcacag
      481 ttccctaatt ctctctggat tgcagggtat ggcttgaatg atggaaacgc tgattttgaa
      541 tattttccat ctatggacgg gataagatgg tggcagtatt ctagtaaccc gtttgacaag
      601 aatattgtac tgttagacga tgaagaagac gaaaagccaa agactgctgg aacgtggaaa
      661 caagacagta agggctggtg gttcagacgc aataacggta gtttccctta taataaatgg
      721 gaaaaaatcg ggggtgtgtg gtactacttc gatagtaaag gatattgctt aacgagcgaa
      781 tggctcaaag ataatgaaaa atggtactac ctcaaggaca acggcgctat ggtgactggt
      841 tgggtgctag tcgggtcaga gtggtattat atggacgatt caggtgcaat ggttactggt
      901 tgggtcaaat acaagaataa ctggtactat atgacaaatg aacgtggtaa catggtttct
      961 aatgaattta ttaaatctgg aaaaggctgg tatttcatga acacaaacgg agagcttgca
     1021 gacaatccaa gctttacaaa agaaccagac ggacttataa cggtagcata aaaagaaaag
     1081 ctagtagaaa ctttctacta gctgttttta tattctgcaa tgattttata agcgtcttcg
     1141 tctgggttgt ccagagcgat ggagcagatg gcagacagaa ccgctgttca tctgattgta
     1201 tttctgtagg tagtgatttt ctaggctgtt atgttgctga tgtgctttat acc
//
LOCUS       YSCTY31A     5510 bp ds-DNA             PLN       19-JUL-1990
DEFINITION  S.cerevisiae Ty3-1 retrotransposon integrase gene, complete cds,
            and Cys-tRNA gene.
ACCESSION   M34549
KEYWORDS    integrase; transfer RNA-Cys; transposable element; transposon.
SOURCE      S.cerevisiae (strain AB950) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 5510)
  AUTHORS   Hansen,L.J. and Sandmeyer,S.B.
  TITLE     Characterization of a transpositionally active Ty3 element and
            identification of the Ty3 integrase protein
  JOURNAL   J. Virol. 64, 2599-2607 (1990)
  STANDARD  unannotated staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.B.Sandmeyer, 24-MAY-1990.
FEATURES       from  to/span     description
    pept        536     1408     integrase
    tRNA        105       31 (c) Cys-tRNA
    mRNA        343  >  5510     integrase mRNA
    site        121      462     5' sigma element
    site       5132     5471     3' sigma element
    rpt         121      128     5' inverted terminal repeat
    rpt         455      462     3' inverted terminal repeat
    rpt        5132     5139     5' inverted terminal repeat
    rpt        5464     5471     3' inverted terminal repeat
    rpt         116      120     5' insertion target sequence
    rpt         463      467     3' insertion target sequence
    rpt        5127     5131     5' insertion target sequence
    rpt        5472     5476     3' insertion target sequence
BASE COUNT     1955 a   1306 c    919 g   1330 t
ORIGIN      
        1 aactttcatg gaaggaccac ctagttaata aaaagctcgc actcaggatc gaactaagga
       61 ccaacagatt tgcaatctgc tgcgctacca ctgcgccata cgagcttgat tttctgaaag
      121 tgttgtatct caaaatgaga tatgtcagta tgacaatacg tcaccctgaa cgttcataaa
      181 acacatatga aacaacctta taacaaaacg aacaacatga gacaaaaccc gaccttccct
      241 agctgaacta cccaaagtat aaatgcctga acaattagtt tagatccgag attccgcgct
      301 tccaccactt agtatgattc atattttata taatatataa gataagtaac attccgtgaa
      361 ttaatctgat aaactgtttt gacaactggt tacttcccta agactgttta tattaggatt
      421 gtcaagacac tccggtatta ctcgagcccg taatacaaca cctggtagcg ttaaaggtta
      481 ctaattgttc aaacgaacca tcgaaaagcc gaacctagct acaccacacc ccagtatgag
      541 ctttatggat caaatcccag gaggaggaaa ttatccaaaa ctcccagtag aatgccttcc
      601 taacttcccg atccaaccat ctttgacctt cagaggtaga aatgactcgc ataaactgaa
      661 aaactttatc tccgaaataa tgttaaacat gtctatgata tcttggccga atgatgccag
      721 tcgtattgtg tactgcagaa gacatttatt aaaccccgct gctcagtggg ctaatgactt
      781 tgtacaagaa caaggtatac ttgaaataac attcgacaca ttcatacaag gattatatca
      841 gcatttctat aagccaccag atatcaataa aatctttaat gcaatcacgc aactttccga
      901 agctaaactt ggtattgagc gtctcaacca acgattcaga aagatttggg acagaatgcc
      961 accagacttc atgaccgaaa aagctgccat aatgacatat actaggctat tgacaaagga
     1021 aacctataat attgtcagaa tgcacaaacc agagacatta aaagacgcca tggaagaggc
     1081 ttaccagaca actgcactaa ctgaaagatt cttcccagga ttcgaacttg atgctgatgg
     1141 agacactatc atcggtgcca caacccactt acaagaagaa tacgactctg actatgattc
     1201 agaagataat ctgacccaga atggatacgt ccataccgta aggacaagaa gatcttacaa
     1261 taaaccaatg tcaaatcatc gaaacaggag aaataacaac ccatctagag aagaatgtat
     1321 aaaaaatcgg ctatgcttct attgtaagaa agagggacat cgcctgaacg aatgtagagc
     1381 acgtaaggcg agttctaacc gatcttgaac tcgaatcaaa agaccaacaa actcctttta
     1441 tcaaaacctt accaattgta cactatatcg ccatccccga gatggacaat accgccgaaa
     1501 aaaccataaa aatacaaaac acgaaagtaa aaaccctgtt tgacagtgga tcacccacgt
     1561 catttatccg aagagatatt gtagaacttc tcaaatacga aatctacgag acccctccac
     1621 tccgttttag aggattcgta gccaccaaat ccgccgttac atccgaagca gtcaccattg
     1681 acctcaaaat caatgacctg catataactt tagccgcgta catactggat aacatggact
     1741 accaattgtt aattggaaat ccaatcttac gccgctaccc gaaaatcctg cacacagtac
     1801 tgaataccag agagagcccc gactccttaa agcccaagac ttatcgctcc gaaaccgtta
     1861 ataacgttag aacctactcc gctggtaatc gtggtaaccc cagaaacata aaactgtctt
     1921 ttgcccccac cattctcgaa gcaactgacc cgaaatccgc tggtaatcgt ggtgactcca
     1981 gaaccaaaac cctgtctctt gcaaccacta ctcctgcagc aattgacccg cttacgaccc
     2041 ttgataaccc aggtagtact caaagtacat ttgcgcaatt cccgatacct gaagaagcga
     2101 gcatcctaga agaggatgga aaatactcca acgttgtctc aaccattcag agtgtagaac
     2161 ctaatgctac tgatcacagc aataaggaca ccttttgcac tttgccagtt tggttacaac
     2221 agaagtatag agagatcata cgtaatgatc tcccaccaag acctgccgac attaataaca
     2281 tccccgtaaa acatgatatt gaaattaaac ctggcgcaag actacctcga ctacagccat
     2341 accatgttac agaaaagaac gaacaagaaa tcaacaaaat agttcaaaaa ctgctcgata
     2401 acaagttcat tgttccctca aagtcgcctt gcagctcccc tgtagtcctc gtcccgaaga
     2461 aagacggtac cttccgactc tgcgtcgatt accgcaccct gaacaaagct accatctccg
     2521 acccattccc attacccaga atcgacaacc tattgagccg tattggaaat gcccagatat
     2581 ttaccacgct agatttgcat agtggttacc accagatccc gatggaaccc aaagaccgct
     2641 acaaaaccgc ctttgtcaca ccatccggta agtatgaata taccgtcatg ccatttggct
     2701 tagtcaatgc acctagtaca ttcgcaagat acatggctga tacatttaga gacctgagat
     2761 tcgtcaatgt ttaccttgat gatatattaa tattctccga atctccagaa gaacattgga
     2821 aacatttaga cacggtacta gaaagattaa agaacgagaa cctcattgtt aagaagaaaa
     2881 aatgtaaatt tgcatctgaa gaaactgagt ttttaggcta tagtattgga atccagaaaa
     2941 tagctccact acagcacaaa tgtgcagcaa tccgagactt tccgacgcct aaaacagtaa
     3001 aacaagcaca gagattttta ggaatgatta attactacag acgattcatt ccaaattgct
     3061 ccaagattgc acagccaatc caactgttta tttgtgacaa aagtcaatgg acagaaaaac
     3121 aagacaaggc aattgataaa ctaaaagacg ccttgtgtaa ctcccccgtc ctagtaccat
     3181 tcaacaacaa agcaaactac cgacttacaa cagacgcctc aaaagacggc attggtgctg
     3241 ttctagaaga agtcgacaac aagaacaaac ttgttggtgt cgtcggttac ttctctaaat
     3301 ccttagagag tgcccagaaa aactatcctg ctggcgaatt agaactactt ggaattatca
     3361 aagcactcca ccacttccga tatatgcttc acggaaagca tttcacgtta agaacagacc
     3421 acattagttt gttatcatta caaaacaaga acgaacccgc acgacgcgtg caacgctggt
     3481 tagatgacct agccacatat gacttcacct tagaatacct agctggaccc aagaacgttg
     3541 tcgcagatgc catatcccgt gccgtatata ctataacccc cgaaacatcc cgacctatcg
     3601 acacagaaag ctggaaatct tactacaaat cagacccatt atgtagtgct gtcttaattc
     3661 atatgaaaga attgacacaa cacaacgtca cacctgaaga tatgtcagcc ttccgtagtt
     3721 accagaagaa actcgaacta tcagagacct tccgaaagaa ttattcccta gaagacgaaa
     3781 tgatctatta ccaagaccga ctagtagtac caataaaaca acagaacgca gttatgagac
     3841 tatatcatga ccatacctta tttggaggac attttggtgt aacagtgacc cttgcgaaaa
     3901 tcagcccaat ttactattgg ccaaaattac aacattcgat catacaatac atcaggacct
     3961 gcgtacaatg tcaactaata aaatcacacc gaccacgctt acatggacta ttacaaccac
     4021 tccctatagc agaaggaaga tggcttgata tatcaatgga ttttgtgaca ggattacccc
     4081 cgacatcaaa taacttgaat atgatcctcg tcgtagttga tcgtttttcg aaacgcgctc
     4141 acttcatagc tacaaggaaa accttagacg caacacaact aatagatcta ctctttcgat
     4201 acattttttc atatcatggt tttcccagga caataaccag tgatagagat gtccgtatga
     4261 ccgccgacaa atatcaagaa ctcacgaaaa gactaggaat aaaatcgaca atgtcttccg
     4321 cgaaccaccc ccaaacagat ggacaatccg aacgaacgat acagacatta aacaggttac
     4381 taagagccta tgcttcaacc aatattcaga attggcatgt atatttacca caaatcgaat
     4441 ttgtttacaa ttctacacct actagaacac ttggaaaatc accatttgaa attgatttag
     4501 gatatttacc gaatacccct gctattaagt cagatgacga agtcaacgca agaagtttta
     4561 ctgccgtaga acttgccaaa cacctcaaag cccttaccat ccaaacgaag gaacagctag
     4621 aacacgctca aatcgaaatg gaaactaata acaatcaaag acgtaaaccc ttattgttaa
     4681 acataggaga tcacgtatta gtgcatagag atgcatactt caagaaaggt gcttatatga
     4741 aagtacaaca aatatacgtc ggaccatttc gagttgtcaa gaaaataaac gataacgcct
     4801 acgaactaga tttaaactct cacaagaaaa agcacagagt tattaatgta caattcctga
     4861 aaaagtttgt ataccgtcca gacgcgtacc caaagaataa accaatcagc tccactgaaa
     4921 gaattaagag agcacacgaa gttactgcac tcataggaat agatactaca cacaaaactt
     4981 acttatgtca catgcaagat gtagacccaa cactttcagt agaatactca gaagctgaat
     5041 tttgccaaat tcccgaaaga acacgaagat caatattagc caactttaga caactctacg
     5101 aaacacaaga caaccctgag agagaggaag atgttgtatc tcaaaatgag atatgtcagt
     5161 atgacaatac gtcaccctga acgttcataa aacacatatg aaacaacctt ataacaaaac
     5221 gaacaacatg agacaaaacc cgaccttccc tagctgaact acccaaagta taaatgcctg
     5281 aacaattagt ttagatccga gattccgcgc ttccaccact tagtatgatt catattttat
     5341 ataatatata agataagtaa cattccgtga attaatctga taaactgttt tgacaactgg
     5401 ttacttccct aagactgttt atattaggat tgtcaagaca ctccggtatt actcgagccc
     5461 gtaatacaac agaaagttcc attttggatg ctctatttat gggaatatga
//

GenBank-Updates@genbank.bio.net (07/20/90)

LOCUS       SYNM13MP18   7249 bp ds-DNA             SYN       20-JUL-1990
DEFINITION  Cloning vector M13mp18 DNA.
ACCESSION   M11454
KEYWORDS    .
SOURCE      Synthetic DNA.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 7249)
  AUTHORS   Yanisch-Perron,C., Vieira,J. and Messing,J.
  TITLE     Improved M13 phage cloning vectors and host strains: Nucleotide
            sequences of the M13mp18 and pUC19 vectors
  JOURNAL   Gene 33, 103-119 (1985)
  STANDARD  simple staff_entry
BASE COUNT     1768 a   1538 c   1533 g   2410 t
ORIGIN      
        1 aatgctacta ctattagtag aattgatgcc accttttcag ctcgcgcccc aaatgaaaat
       61 atagctaaac aggttattga ccatttgcga aatgtatcta atggtcaaac taaatctact
      121 cgttcgcaga attgggaatc aactgttaca tggaatgaaa cttccagaca ccgtacttta
      181 gttgcatatt taaaacatgt tgagctacag caccagattc agcaattaag ctctaagcca
      241 tccgcaaaaa tgacctctta tcaaaaggag caattaaagg tactctctaa tcctgacctg
      301 ttggagtttg cttccggtct ggttcgcttt gaagctcgaa ttaaaacgcg atatttgaag
      361 tctttcgggc ttcctcttaa tctttttgat gcaatccgct ttgcttctga ctataatagt
      421 cagggtaaag acctgatttt tgatttatgg tcattctcgt tttctgaact gtttaaagca
      481 tttgaggggg attcaatgaa tatttatgac gattccgcag tattggacgc tatccagtct
      541 aaacatttta ctattacccc ctctggcaaa acttcttttg caaaagcctc tcgctatttt
      601 ggtttttatc gtcgtctggt aaacgagggt tatgatagtg ttgctcttac tatgcctcgt
      661 aattcctttt ggcgttatgt atctgcatta gttgaatgtg gtattcctaa atctcaactg
      721 atgaatcttt ctacctgtaa taatgttgtt ccgttagttc gttttattaa cgtagatttt
      781 tcttcccaac gtcctgactg gtataatgag ccagttctta aaatcgcata aggtaattca
      841 caatgattaa agttgaaatt aaaccatctc aagcccaatt tactactcgt tctggtgttc
      901 tcgtcagggc aagccttatt cactgaatga gcagctttgt tacgttgatt tgggtaatga
      961 atatccggtt cttgtcaaga ttactcttga tgaaggtcag ccagcctatg cgcctggtct
     1021 gtacaccgtt catctgtcct ctttcaaagt tggtcagttc ggttccctta tgattgaccg
     1081 tctgcgcctc gttccggcta agtaacatgg agcaggtcgc ggatttcgac acaatttatc
     1141 aggcgatgat acaaatctcc gttgtacttt gtttcgcgct tggtataatc gctgggggtc
     1201 aaagatgagt gttttagtgt attctttcgc ctctttcgtt ttaggttggt gccttcgtag
     1261 tggcattacg tattttaccc gtttaatgga aacttcctca tgaaaaagtc tttagtcctc
     1321 aaagcctctg tagccgttgc taccctcgtt ccgatgctgt ctttcgctgc tgagggtgac
     1381 gatcccgcaa aagcggcctt taactccctg caagcctcag cgaccgaata tatcggttat
     1441 gcgtgggcga tggttgttgt cattgtcggc gcaactatcg gtatcaagct gtttaagaaa
     1501 ttcacctcga aagcaagctg ataaaccgat acaattaaag gctccttttg gagccttttt
     1561 ttttggagat tttcaacgtg aaaaaattat tattcgcaat tcctttagtt gttcctttct
     1621 attctcactc cgctgaaact gttgaaagtt gtttagcaaa accccataca gaaaattcat
     1681 ttactaacgt ctggaaagac gacaaaactt tagatcgtta cgctaactat gagggttgtc
     1741 tgtggaatgc tacaggcgtt gtagtttgta ctggtgacga aactcagtgt tacggtacat
     1801 gggttcctat tgggcttgct atccctgaaa atgagggtgg tggctctgag ggtggcggtt
     1861 ctgagggtgg cggttctgag ggtggcggta ctaaacctcc tgagtacggt gatacaccta
     1921 ttccgggcta tacttatatc aaccctctcg acggcactta tccgcctggt actgagcaaa
     1981 accccgctaa tcctaatcct tctcttgagg agtctcagcc tcttaatact ttcatgtttc
     2041 agaataatag gttccgaaat aggcaggggg cattaactgt ttatacgggc actgttactc
     2101 aaggcactga ccccgttaaa acttattacc agtacactcc tgtatcatca aaagccatgt
     2161 atgacgctta ctggaacggt aaattcagag actgcgcttt ccattctggc tttaatgaag
     2221 atccattcgt ttgtgaatat caaggccaat cgtctgacct gcctcaacct cctgtcaatg
     2281 ctggcggcgg ctctggtggt ggttctggtg gcggctctga gggtggtggc tctgagggtg
     2341 gcggttctga gggtggcggc tctgagggag gcggttccgg tggtggctct ggttccggtg
     2401 attttgatta tgaaaagatg gcaaacgcta ataagggggc tatgaccgaa aatgccgatg
     2461 aaaacgcgct acagtctgac gctaaaggca aacttgattc tgtcgctact gattacggtg
     2521 ctgctatcga tggtttcatt ggtgacgttt ccggccttgc taatggtaat ggtgctactg
     2581 gtgattttgc tggctctaat tcccaaatgg ctcaagtcgg tgacggtgat aattcacctt
     2641 taatgaataa tttccgtcaa tatttacctt ccctccctca atcggttgaa tgtcgccctt
     2701 ttgtctttag cgctggtaaa ccatatgaat tttctattga ttgtgacaaa ataaacttat
     2761 tccgtggtgt ctttgcgttt cttttatatg ttgccacctt tatgtatgta ttttctacgt
     2821 ttgctaacat actgcgtaat aaggagtctt aatcatgcca gttcttttgg gtattccgtt
     2881 attattgcgt ttcctcggtt tccttctggt aactttgttc ggctatctgc ttacttttct
     2941 taaaaagggc ttcggtaaga tagctattgc tatttcattg tttcttgctc ttattattgg
     3001 gcttaactca attcttgtgg gttatctctc tgatattagc gctcaattac cctctgactt
     3061 tgttcagggt gttcagttaa ttctcccgtc taatgcgctt ccctgttttt atgttattct
     3121 ctctgtaaag gctgctattt tcatttttga cgttaaacaa aaaatcgttt cttatttgga
     3181 ttgggataaa taatatggct gtttattttg taactggcaa attaggctct ggaaagacgc
     3241 tcgttagcgt tggtaagatt caggataaaa ttgtagctgg gtgcaaaata gcaactaatc
     3301 ttgatttaag gcttcaaaac ctcccgcaag tcgggaggtt cgctaaaacg cctcgcgttc
     3361 ttagaatacc ggataagcct tctatatctg atttgcttgc tattgggcgc ggtaatgatt
     3421 cctacgatga aaataaaaac ggcttgcttg ttctcgatga gtgcggtact tggtttaata
     3481 cccgttcttg gaatgataag gaaagacagc cgattattga ttggtttcta catgctcgta
     3541 aattaggatg ggatattatt tttcttgttc aggacttatc tattgttgat aaacaggcgc
     3601 gttctgcatt agctgaacat gttgtttatt gtcgtcgtct ggacagaatt actttacctt
     3661 ttgtcggtac tttatattct cttattactg gctcgaaaat gcctctgcct aaattacatg
     3721 ttggcgttgt taaatatggc gattctcaat taagccctac tgttgagcgt tggctttata
     3781 ctggtaagaa tttgtataac gcatatgata ctaaacaggc tttttctagt aattatgatt
     3841 ccggtgttta ttcttattta acgccttatt tatcacacgg tcggtatttc aaaccattaa
     3901 atttaggtca gaagatgaaa ttaactaaaa tatatttgaa aaagttttct cgcgttcttt
     3961 gtcttgcgat tggatttgca tcagcattta catatagtta tataacccaa cctaagccgg
     4021 aggttaaaaa ggtagtctct cagacctatg attttgataa attcactatt gactcttctc
     4081 agcgtcttaa tctaagctat cgctatgttt tcaaggattc taagggaaaa ttaattaata
     4141 gcgacgattt acagaagcaa ggttattcac tcacatatat tgatttatgt actgtttcca
     4201 ttaaaaaagg taattcaaat gaaattgtta aatgtaatta attttgtttt cttgatgttt
     4261 gtttcatcat cttcttttgc tcaggtaatt gaaatgaata attcgcctct gcgcgatttt
     4321 gtaacttggt attcaaagca atcaggcgaa tccgttattg tttctcccga tgtaaaaggt
     4381 actgttactg tatattcatc tgacgttaaa cctgaaaatc tacgcaattt ctttatttct
     4441 gttttacgtg ctaataattt tgatatggtt ggttcaattc cttccataat tcagaagtat
     4501 aatccaaaca atcaggatta tattgatgaa ttgccatcat ctgataatca ggaatatgat
     4561 gataattccg ctccttctgg tggtttcttt gttccgcaaa atgataatgt tactcaaact
     4621 tttaaaatta ataacgttcg ggcaaaggat ttaatacgag ttgtcgaatt gtttgtaaag
     4681 tctaatactt ctaaatcctc aaatgtatta tctattgacg gctctaatct attagttgtt
     4741 agtgcaccta aagatatttt agataacctt cctcaattcc tttctactgt tgatttgcca
     4801 actgaccaga tattgattga gggtttgata tttgaggttc agcaaggtga tgctttagat
     4861 ttttcatttg ctgctggctc tcagcgtggc actgttgcag gcggtgttaa tactgaccgc
     4921 ctcacctctg ttttatcttc tgctggtggt tcgttcggta tttttaatgg cgatgtttta
     4981 gggctatcag ttcgcgcatt aaagactaat agccattcaa aaatattgtc tgtgccacgt
     5041 attcttacgc tttcaggtca gaagggttct atctctgttg gccagaatgt cccttttatt
     5101 actggtcgtg tgactggtga atctgccaat gtaaataatc catttcagac gattgagcgt
     5161 caaaatgtag gtatttccat gagcgttttt cctgttgcaa tggctggcgg taatattgtt
     5221 ctggatatta ccagcaaggc cgatagtttg agttcttcta ctcaggcaag tgatgttatt
     5281 actaatcaaa gaagtattgc tacaacggtt aatttgcgtg atggacagac tcttttactc
     5341 ggtggcctca ctgattataa aaacacttct caagattctg gcgtaccgtt cctgtctaaa
     5401 atccctttaa tcggcctcct gtttagctcc cgctctgatt ccaacgagga aagcacgtta
     5461 tacgtgctcg tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg
     5521 tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt
     5581 cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg
     5641 ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga
     5701 tttgggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac
     5761 gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc
     5821 tatctcgggc tattcttttg atttataagg gattttgccg atttcggaac caccatcaaa
     5881 caggattttc gcctgctggg gcaaaccagc gtggaccgct tgctgcaact ctctcagggc
     5941 caggcggtga agggcaatca gctgttgccc gtctcgctgg tgaaaagaaa aaccaccctg
     6001 gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca
     6061 cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct
     6121 cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat
     6181 tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct
     6241 cggtacccgg ggatcctcta gagtcgacct gcaggcatgc aagcttggca ctggccgtcg
     6301 ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac
     6361 atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac
     6421 agttgcgcag cctgaatggc gaatggcgct ttgcctggtt tccggcacca gaagcggtgc
     6481 cggaaagctg gctggagtgc gatcttcctg aggccgatac ggtcgtcgtc ccctcaaact
     6541 ggcagatgca cggttacgat gcgcccatct acaccaacgt aacctatccc attacggtca
     6601 atccgccgtt tgttcccacg gagaatccga cgggttgtta ctcgctcaca tttaatgttg
     6661 atgaaagctg gctacaggaa ggccagacgc gaattatttt tgatggcgtt cctattggtt
     6721 aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat taacgtttac
     6781 aatttaaata tttgcttata caatcttcct gtttttgggg cttttctgat tatcaaccgg
     6841 ggtacatatg attgacatgc tagttttacg attaccgttc atcgattctc ttgtttgctc
     6901 cagactctca ggcaatgacc tgatagcctt tgtagatctc tcaaaaatag ctaccctctc
     6961 cggcattaat ttatcagcta gaacggttga atatcatatt gatggtgatt tgactgtctc
     7021 cggcctttct cacccttttg aatctttacc tacacattac tcaggcattg catttaaaat
     7081 atatgagggt tctaaaaatt tttatccttg cgttgaaata aaggcttctc ccgcaaaagt
     7141 attacagggt cataatgttt ttggtacaac cgatttagct ttatgctctg aggctttatt
     7201 gcttaatttt gctaattctt tgccttgcct gtatgattta ttggatgtt
//
LOCUS       RATCDOL      1458 bp ss-mRNA            ROD       20-JUL-1990
DEFINITION  Rat cysteine dioxygenase mRNA, complete cds.
ACCESSION   M35266
KEYWORDS    cysteine dioxygenase.
SOURCE      Rat liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1458)
  AUTHORS   Hosokawa,Y., Matsumoto,A., Oka,J., Itakura,H. and Yamaguchi,K.
  TITLE     Isolation and characterization of a cDNA for rat liver cysteine
            dioxygenase
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 473-478 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        190      792     cysteine dioxygenase (EC 1.13.11.20)
BASE COUNT      396 a    325 c    343 g    394 t
ORIGIN      
        1 gagggtttag gctggtctcc ggtgacctcc tagtcctaaa tcttgatacc cttgcaagag
       61 ctttgagcgt gtggggtccc gggcgttcgg ggtcccgggt gtgtgcggtt tgtatagcct
      121 gaagccgggg tcctccgcgc tcgcgtcctc cgcagctgga ctgaagagac gcgtcccagc
      181 cctgcgggga tggaacggac cgagctgctg aagccccgga ccctggccga cctcatccga
      241 atcttgcatg agctcttcgc cggggacgaa gtcaatgtgg aggaggtgca ggctgtgctg
      301 gaagcctacg agagcaatcc tgccgagtgg gctttgtatg ccaaattcga tcaatacagg
      361 tatacccgaa accttgtgga tcaaggaaat gggaagttta atctgatgat tctgtgctgg
      421 ggtgaagggc atggcagcag tattcacgat cacacggact cccactgctt tttgaagctg
      481 ctgcaaggaa atctaaagga gacattgttt gactggcctg acaagaaatc caacgagatg
      541 atcaagaagt ctgaaagaac tttgagggaa aatcagtgtg cctacattaa tgattctatt
      601 ggcttacatc gagtagagaa cgtcagccac acagagcctg ctgtgagcct tcacttgtac
      661 agtccacctt tcgatacatg ccatgccttt gaccaacgaa cagggcataa aaacaaagtc
      721 accatgacat tccacagcaa atttggaatc agaactccat ttacaacttc aggttcactg
      781 gagaacaact aagacctgcc aagcctttca aagttttgct tctgggtcgt tggaatgttt
      841 taccttggat aagagaggcc acccatcatt tgctgtccag ttatacattt taataagtcc
      901 atgctcagtg tgtatactaa ggaagcaaac catcccctga gctatgcagg agaaaaatcc
      961 cactaaagaa aaagtcactt gatttttaat agccaaatca ccttgctccc agttcttctg
     1021 tcttctaact ccatggaaat tctattggga gttctcagtg gggttttttt tcaaccttag
     1081 gaaagcactt ctggtctctg aactctaata atcaataagt aaaaatgaag aaaccacaag
     1141 ctatcacatg tctgttttca tacctggaag tctaagtgtg gaaatcttta atttactttg
     1201 tatgttctta atgtttgaca agaatttttt taaatcttgg ttttcagttt tttcaaccct
     1261 gtttgacaaa ttcctatgct gtggagacta gggatgcaga tagcagtttg gtgtttggta
     1321 gtgaacagca gtggggccag aaatgtgcat gtatccagac ctcctgcaaa taaaaactga
     1381 aactcatgtg taatgtgtgc caccacctta agctgccacc aaaattgcca aacgacttta
     1441 ataaaactgg atttgaga
//
LOCUS       HUMMYOHP     6024 bp ss-mRNA            PRI       20-JUL-1990
DEFINITION  Human perinatal myosin heavy chain mRNA, complete cds.
ACCESSION   M36769 M35250 Y00821
KEYWORDS    myosin; myosin heavy chain.
SOURCE      human fetal muscle, cDNA to mRNA, clone pSMHCP.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 6024)
  AUTHORS   Leinwand,L.A.
  TITLE     ;
  JOURNAL   Unpublished (1989) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 2651 to 6024)
  AUTHORS   Feghali,R. and Leinwand,L.A.
  TITLE     Molecular genetic characterization of a developmentally regulated
            human perinatal myosin heavy chain
  JOURNAL   J. Cell Biol. 108, 1791-1797 (1989)
  STANDARD  simple automatic
REFERENCE   3  (bases 1 to 2650)
  AUTHORS   Karsch-Mizrachi,I., Feghali,R., Shows,T.B. and Leinwand,L.A.
  TITLE     Generation of a full-length human perinatal myosin heavy chain cDNA
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Author address: L.A.Leinwand Albert Einstein College of Medicine
            1300 Morris Park Avenue Bronx, N.Y. 10461, USA.
            
            Submitted (12-APR-1989) on tape to the EMBL data library. Draft
            entry and computer-readable sequence for [3] kindly submitted by
            I.Karsch-Mizrachi, 18-JUN-1990.
FEATURES       from  to/span     description
    pept         74     5887     myosin heavy chain
BASE COUNT     1961 a   1300 c   1586 g   1177 t
ORIGIN      
        1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa
       61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaagctgc
      121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc
      181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag
      241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga
      301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat
      361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat
      421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt
      481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca
      541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc
      601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca
      661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca
      721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc
      781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg
      841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt
      901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa
      961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt
     1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga
     1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac
     1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc
     1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc
     1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa
     1321 aggccagact gtgcagcagg tgtacaatgc cgtgggtgct ctggccaaag ccgtctacga
     1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag
     1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct
     1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat
     1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt
     1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat
     1681 cctggaagag gagtgcatgt tccctaaggc cacggacacc tccttcaaga acaagctgta
     1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc
     1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg
     1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc
     1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc
     1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga
     2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat
     2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct
     2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat
     2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg
     2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca
     2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct
     2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag
     2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg
     2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt
     2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa
     2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct
     2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc
     2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat
     2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa
     2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga
     2941 cattgatgac cttgagctga cactggccaa ggttgagaag cagaaacatg ccacggagaa
     3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc
     3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga
     3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga
     3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg
     3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa
     3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa
     3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc
     3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga
     3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc
     3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa
     3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg
     3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt
     3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag
     3781 taacgcagag gccatttcca aagccaaggg ccaccttgaa aagatgtgcc gctctctaga
     3841 agatcaagtg agtgggctta agaccaagga agaggagcag cagcggctga tcaatgacct
     3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga
     3961 gcaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga
     4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct
     4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg
     4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac
     4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt
     4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc
     4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga
     4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct
     4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga
     4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga
     4561 tcaactcgaa acgctaagaa gagcacataa gaacttgcaa caggagattt ctgacctcac
     4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt
     4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca
     4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga
     4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt
     4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag
     4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa
     4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac
     5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat
     5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga
     5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt
     5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga
     5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga
     5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca
     5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct
     5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca
     5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa
     5581 tgcagaggct gttaaaggtt tacggaaaca tgaccgacga gtaaaagaac tcacctacca
     5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc
     5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct
     5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctgacattgc
     5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc
     5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaaggcacaa aatgtgctat
     5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac
     6001 agtaaatgta tacattaaaa aaaa
//
LOCUS       PT4MRH       1266 bp ds-DNA             PHG       20-JUL-1990
DEFINITION  Bacteriophage T4 rpoH modulating protein (mrh) gene, complete cds.
ACCESSION   M30001
KEYWORDS    Mrh protein; rpoH modulating protein.
SOURCE      Bacteriophage T4 (strain GT7) DNA, clone pMF28,h37.
  ORGANISM  Bacteriophage T4
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 1266)
  AUTHORS   Frazier,M.W. and Mosig,G.
  TITLE     The bacteriophage T4 gene mrh whose product inhibits late T4 gene
            expression in an E.coli rpoH (sigma-32) mutant
  JOURNAL   Gene 88, 7-14 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.W.Frazier, 20-NOV-1989.
FEATURES       from  to/span     description
    pept        782     1132     rpoH modulating protein
    ORF          96      302     mrh ORF soc-1
    site         39       68     early promoter 14.95
BASE COUNT      444 a    167 c    234 g    421 t
ORIGIN      
        1 ctcaaggact ccttcgggag tcctttttca tttaaatggt ttactttcca aaatgagtat
       61 ggtataatag aaattatctt atagaggaga gtactatgtt aaatcgttgg attaaaccaa
      121 atgaagattt ggatattatc atttcacgac atgtaatgaa gaaatatgaa ctacaaccat
      181 ggtctacaga agttgttgtg cattcattta tgatgtacgc agatggttct gtcgaattta
      241 atgtagagat tcgatatgat tatggtgaga agcaagtcga attcaaaaga ggctttttgt
      301 aatgtttatc tttaattggt ttaaaagttt ctttacggat tttttctcta caactctggg
      361 gaaggtgtag ttcctatttc aaatgactac cttcctttaa ctgtagttga atatgtttat
      421 atgggagatg gaacagtaga agcagttact atgacttatg aagaagccca agaatattat
      481 aaaaatcctt ggcgctggtc aacacctact acatcatcta acacacagaa tacacagtct
      541 agttctgatt catatgatac taatgttcct gttcatgtat gggcaggtga ttcatgtgga
      601 agttcttgtg attctagttg ttcatctaca tcttgtgatt gaggaaaatt atggaagcaa
      661 ttttgtttga aatgtctatc aatagtatta gtatggcgtt tgtcaaggat gttccaatta
      721 ctgtagcagt aatgattgat aaaagacaat aatatgtatc tagtagaaga ttttatttca
      781 aatgccaatt ccagaagatg ttgaaataaa acttaaaaag atcggaagga ttgaaactgt
      841 acaaaattct ccatttatgg caattcaagc atttactaaa tctaactata ttgatgttcg
      901 tgaagcatat tataataata aaccgttatc cttctattca tatgattcaa tatatgattg
      961 gaaaatagat aaaggaaata aatttataat tacggacgaa agtgcgctat catattttat
     1021 tacctcttta tggaataatt taaatccaaa tttgctaaaa attcataaat ttgacgatgc
     1081 tcctactatt gttttaggta aaacgaatga aagttctgaa gaagatgctt gaatggttca
     1141 atagaccaaa ctcaatgtat attgatgacg gttgggttga acaagcaaat aaagaaatca
     1201 gaatgaatca gaagaagaat ggatgaaatc aatgattatg ctgagaaaga aaagaaacta
     1261 gaacgc
//
LOCUS       BOVPER       1295 bp ss-mRNA            MAM       20-JUL-1990
DEFINITION  Bovine peripherin mRNA, complete cds.
ACCESSION   J02884
KEYWORDS    peripherin; photoreceptor protein.
SOURCE      Bovine retina, cDNA to mRNA, clones lambda.[5,4,8,17].
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1295)
  AUTHORS   Connell,G.J. and Molday,R.S.
  TITLE     Molecular cloning, primary structure and orientation of the
            vertebrate photoreceptor protein peripherin in the rod outer
            segment disk membrane
  JOURNAL   Biochemistry 29, 4691-4698 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Connell, 20-OCT-1989.
FEATURES       from  to/span     description
    pept    <     1     1023     peripherin
BASE COUNT      272 a    368 c    379 g    276 t
ORIGIN      
        1 aaatttgacc agaagaagcg ggtcaagttg gcccaagggc tctggctcat gaactggttc
       61 tccgtgttgg ctggtatcat catcttcggc ttagggctgt tcctgaagat tgaactccgg
      121 aagagaagcg atgtgatgaa caattctgag agccattttg tgcccaattc cttgatcggg
      181 gtgggggtgc tgtcctgtgt cttcaattct ctggctggca agatctgtta cgacgccctg
      241 gaccctgcca agtacgccaa gtggaagccc tggctgaagc cgtacctggc cgtgtgtgtc
      301 ctcttcaacg tggtcctctt cctggtggcc ctctgctgct tcctcctgcg gggctcgctg
      361 gagagtacgc tggcccacgg actcaagaac ggcatgaaat tctatcggga cacggacacc
      421 ccaggccggt gtttcatgaa gaagaccatc gacatgctgc agatcgagtt caagtgctgc
      481 ggcaacaacg gctttcggga ctggtttgag attcagtgga tcagcaaccg ctatctggat
      541 ttttcctcca aagaagtcaa agatcgcatc aagagcaatg tggacgggcg gtacctggtg
      601 gacggtgtcc ccttcagctg ctgcaacccc aactcaccgc ggccctgcat ccagtaccag
      661 ctcaccaaca actctgcgca ctacagctac gatcaccaga cggaggagct caacctgtgg
      721 ctgcgtggct gcagggccgc cctgctgagc tattacagca acctcatgaa tactacaggc
      781 gctgtgacgc tcctcgtttg gctctttgag gtgaccatca ctgttgggct acgctacctg
      841 cacacggcgc tggaaggcat ggccaacccc gaagaccctg agtgcgagag tgagggctgg
      901 cttctggaga agagcgtgcc ggagacctgg aaggcctttc tggagagtgt gaagaagctg
      961 ggcaagggca accaggtgga agccgagggc gaggacgcag gccaggcccc ggcggcaggc
     1021 tgacggccct gcggccccct cccctctgca cactgaaaag tagtggactc caggaacttc
     1081 ggataccccc cggatcccat cagaatctcc caaagagggc ggccgtctca cagaggctct
     1141 tctggatgtg ggatttaata ttcagggccc ctaaaaactt ctgtggaatg aatgtctcac
     1201 tgtgagtgca tgagtcctgc tcagccccac cgccctggat gcaccttcac ggtgggctca
     1261 tggagggcct ctcagcaggt gactctgtgg tcaca
//
LOCUS       HAMRPS14     5300 bp ds-DNA             ROD       20-JUL-1990
DEFINITION  C.griseus ribosomal protein S14 (RPS14) gene, complete cds.
ACCESSION   M35008
KEYWORDS    ribosomal protein S14.
SOURCE      C.griseus (haplotype EmtB wild type) ovary transformed fibroblast
            cell line CHO DNA.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 5300)
  AUTHORS   Diaz,J.-J., Rhoads,D. and Roufa,D.J.
  TITLE     Genetic analysis of a vital mammalian housekeeping locus using CHO
            cells that express a transfected mutant allele
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.J.Roufa, 08-JUN-1990.
            
                Author address: D.J.Roufa
                Kansas State University
                Division of Biology
                Ackert Hall
                Manhattan, Kansas 66506
FEATURES       from  to/span     description
    pept       1977     2125     ribosomal protein S14, exon 2 (first expressed
                                 exon)
               2576     2737     ribosomal protein S14, exon 3
               4074     4150     ribosomal protein S14, exon 4
               4724     4791     ribosomal protein S14, exon 5
    pre-msg     179     4841     ribosomal protein S14 mRNA and introns
    IVS         220     1974     ribosomal protein S14 intron A
    IVS        2126     2575     ribosomal protein S14 intron B
    IVS        2738     4073     ribosomal protein S14 intron C
    IVS        4151     4723     ribosomal protein S14 intron D
    signal     4819     4824     poly-A signal
BASE COUNT     1315 a   1121 c   1415 g   1449 t
ORIGIN      
        1 agatctggtc acagcgacac gcagcacccg gagtccttaa taagccatct tggtcgccgg
       61 caggcttagc ctccttggaa accgcaatag aggcttcttc cgggagccgg gattccctcc
      121 gtgacctcag aggtacactt ccggaatacg gaagtgaccc gtctgcccgc cccttccttc
      181 tctctctttc cggtggagaa gtctggagac gacgttcagg tagggaggcc cggcggccca
      241 caatcgtgcg gcatcctctg gccaggggcg tctgcgggct tggggtcgcg gtttgtgcgc
      301 ggtaaccgag ccaggtgcgt gatggtgacg ggcggctgtc cgctcgccgc gatccacccg
      361 gcctggggct gcgccgtgtc cgggaggcgg ccttctggcc tgagatcgga aggtctgagg
      421 tggaagtgcg gggcctgcct gggccttgcc gtcacgccac agtcgtcgcc cgagcgttcg
      481 gggagcgtgt ccctagtcgc gggtgggccg tcaccggcac cccattagat tcttaggggc
      541 cgtcgacgac ctgcacctaa cctcatgccc ctcatgtgga aaatacgagc agcgttcgta
      601 tctgcccctt ctgtaaaccg aagcgtaaaa tgggatgtca gaaaacctgg tcacgcagga
      661 ttagagtgct tctaacccat ggtggttacc cctccgagtc ggacacattg cctgttgggg
      721 tgccccctag tagagtagga ctgatgccat atgtttacat ttctctgaat agccctctcg
      781 tctccactga ggacggcaaa gctacactga acagatgtca tgtcttatgg aatcgtgttt
      841 gttagcctaa tttgagggta tgagcaggtg actacatcct ggacctgccc ttgtgctttt
      901 ggcaaataag tcccaaatag tgtgtgaggg aggcactaca gagatcccag gtgtggttga
      961 aacttttaag gcctaggagg aactttgtgg agtaggagca tcttggttct aggaggagta
     1021 taagacaggc tttggtgcag tattgtcctg gacagactta gcctcgggtt ggtcaaaggc
     1081 atgcctgcag ctttggcaaa aaaaaaaaaa gcctctttgc ttgtgtattt tccctgggtc
     1141 ctgtgactgt acttacatta ttgactaggc aggaaggggt tacatgtcag gtgtatttgc
     1201 agttcataac aatagcaaaa ttagagttat gaagtagtag caatgacttt ttgtgtgggg
     1261 ggcaccacaa tatgagaatt gtactaaagg gtcacagcgt taggttgaga accaaagggt
     1321 agactcactt gatgctgaac actgaatttt aggtatggat atctttacca ttaaacttcc
     1381 tggaaggcat accctgacca tcctgggact tgctacgtag accataacta cgatatcctc
     1441 ctgcctgtga ctcccaggtg ctggggctaa agttatacac aagcacacat tgctgtggca
     1501 gtgcttttta atgtttatta ctagtaaagg gtgaaagatt tggttctaaa cagtgagagg
     1561 ctgtcagatt aagaattgtg tgtggacagg gagtccgtta atctgcagtt ggcttcagtt
     1621 gtgtcatgtt aatcaaataa acttcccagc ttaaggagag ggagagggga gttgtgaatg
     1681 gggctttgcc ccaacacctg agttcagcat cctctatttt ggataactac ttgtgaaagt
     1741 ttgaggtgca tattgttcat tgctggaatc ttaggctcat cctgaaaatg cttggagctt
     1801 ttgtttagaa attgtctcct ctcttggcgc ctccctttta gttacaactc agtaattccc
     1861 caagtacaaa tatgttggat attagagagc caaaggagct tgggttttcc tttagtaggc
     1921 taaagtattt ggggatgttt tgtctgttaa catggaggtt ctccttccac tcagaaatgg
     1981 cacctcgcaa gggaaaggaa aagaaggaag aacaggtcat cagcctcgga cctcaagtgg
     2041 ctgaaggaga gaatgtattt ggtgtctgcc acatttttgc atctttcaac gacacctttg
     2101 tccatgttac tgatctttct ggcaagtgag tacttgggga catgaagaca taaaaccaag
     2161 cctcacaggg actgaggtgg gagaaaaagg cataggagtt tatgggattc cctctaaaga
     2221 attactgatt ttcttatttt cccataaatt ataagagaaa acagttgaat gtagctgagg
     2281 attcatccac agaaaatcac cgcaaatagg aagtggaagc agaggatcag aaactcaatt
     2341 atcctcagct acacgcacat gtaaaatata tgcattatgg aagtgattat ggggataaaa
     2401 tgcaaaacta attgttgtat taacagggag tttgtctaga tagggtcctt ggacccctgt
     2461 aattatacac aactcctgtc ttgttcagaa ccaggatgtg tgcttgttgg ggttcagcaa
     2521 gtgtttcctg tgtccttgtt ggaaaatgaa ggcatggatc tgcttttgtc tctagggaaa
     2581 ctatctgccg ggtgactggt ggaatgaagg taaaggctga cagagatgag tcttctcctt
     2641 atgcagccat gttggctgcc caggatgttg cccagaggtg caaggagctg ggcatcactg
     2701 ccctgcatat caaactccga gctacgggag gaaacaggta tggaagtcta gggctggaca
     2761 tttgtaagtt cactgggctt gggggtggat ctgagatttg agtctctgct gtgtctgaga
     2821 gcaagccgtt ttaggaatgt atgtatgttc ttcgtcaggc atggtagctg atacctgtaa
     2881 tcccagcact ccggaaccaa tataggagga ggatctaaag tttggtgcta acctgggcta
     2941 cacagtgagc ctcaaaaata agatagtttg ttcctttatg tttgtccctg tagagtgctg
     3001 cttttatttt ttttttttta aagcatacat gagactgact cttccccagc accaggtaga
     3061 ctgggtgtgg tagtatacac ctgttgtctc agaaagtgga ggtgaggaga gctagagttt
     3121 caagttggtt attggttgca tattgagtag gtggccagct tgagctgaac agtgatgctt
     3181 aaatgatgaa ctgggggtgc atagtccaga tagagacaaa acttcaagag aagcatcttg
     3241 gggtgagaaa gaagttgggc tttacaggga tgattggtgt aagatatgat ctaagaagca
     3301 gggatggagc ctctggggtc ttaagtttta caaaacagag caagggcttg ggcatgaaaa
     3361 ctagaatagg tggctactga gtggaaagat actgcatagg aagccctaag gaagttacat
     3421 actggatgta ggagctggga agtaggcatc acagcaaggg gaggacagcc acaggccagg
     3481 tgactatccc ctcccccccc cacacacaca catgtacact tttaaaggat ttgtaggcat
     3541 ggtggctcac acctttaatc ccagcatctc ttggggggcg tccccccaca aaagtgggaa
     3601 aaattacatg cgtatgttgc ctgtgtgcat gcatgcgtgc attatcggtt ccctgtggag
     3661 gacagaagag gaccatgaat ctcttggaac tggtgttgct gagccacctc tccaggcctc
     3721 agttagcagt tttgaaaggg tttggttttg ttttgttttt tcgagacagg gtttcttctg
     3781 tgtagctttg gagcctatcc tggcactggc tttggagacc aggctggcct caaactcaca
     3841 gagatccgac tgcctcccga gtgctgggat taaaggtgtg caccaacgca ggatttggtt
     3901 ttgttgtgag attgtataat atatgaaaac cgtagcagcc tacaaagtct agaattttgt
     3961 gtaatcccct cacagtgtgc agatctttaa ctatagcccc accaccattc agaagtgaat
     4021 taaaaccatg tgaacattcc cactgacttg accaacaatc cttcattttt taggaccaag
     4081 acccctggac ctggagccca gtcagccctc agagctcttg ctcgttcagg gatgaagatt
     4141 gggcggattg gtaagtgacc cttctagcta acattgggtt tattttgaag cattgacccc
     4201 ttccaaacta tgtgatgtgc atattggttc aagcttgtct ggtgactggt gcagttaact
     4261 tttgttaacc tgtaaaagac attgagagcg ttgatttctg taaatgcccc ttaactgtta
     4321 tattttccca tttgcaattg ggaagactta aaaaacactt gtgaatctgt cctagcagcc
     4381 tttttccctg ctagctaaat gttaagggta tgcccttggg acagtccaag tgaataattg
     4441 cacacacctg gcctaaagag caaacagcca caggtgctca gtgtaggggc tctgagttag
     4501 aaacttgaag gagtccttcc cctctgatgt ttgtgtgcct tctcagagcc tgtgttgtcc
     4561 tcgtccaatg tgggttgaga tgaggaaatg acaggtgagg tgggcccttg tttattgacc
     4621 actggtgata catttgggga agaataaagt ctgctagtta gtctaatggc tagggaggct
     4681 gtgtttatgt agctatgggc aatgacattt tttcttcttg cagaggatgt cacccccatc
     4741 ccctctgaca gcacccgaag gaagggtggt cgtcgtggtc gccgtctgtg aacaggactt
     4801 ctcaaattat tttctgttaa taaattgctt tgtataagct attttggttc tgatgtttgt
     4861 ttgttttgag gcaggatctc tctccatgtc tatgtagaag agtctggcct ccaactcaga
     4921 tccacatacc tgtctctgcc tcccaagtac tggggataaa ggaatgcacc accacacccc
     4981 ggtgacctta ggggccaccc agggcaacac caaggcagtg ttttgagggg atgttaagat
     5041 tgcatgtagg agcttctaat gtaggttggg ggttggcaaa cctgggtttg caggctggct
     5101 ttctgagaat gggaacctga gagaggagat gctgaggctg ccactctggc attaatttgc
     5161 cagccagtat tttcaaagtg gaactgttag tagcccatgc tgtcatggga cagctaaaca
     5221 ccacacacac acacacccca agcagggtag tagagatcac aagttggaag ccagtttaat
     5281 atggtctact tggggaattc
//
LOCUS       GPIFCGR      1494 bp ss-mRNA            ROD       20-JUL-1990
DEFINITION  Guinea pig Fc-gamma-1/gamma-2 receptor mRNA, complete cds.
ACCESSION   M35272
KEYWORDS    Fc-gamma receptor; Fc-gamma-1/gamma-2 receptor.
SOURCE      Guinea pig peritoneal macrophage, cDNA to mRNA, clone D-3.
  ORGANISM  Cavia sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Hystricomorpha; Caviidae; Caviinae.
REFERENCE   1  (bases 1 to 1494)
  AUTHORS   Tominaga,M., Sakata,A., Ohmura,T., Yamashita,T., Koyama,J. and
            Onoue,K.
  TITLE     The structure and expression of the guinea pig Fc receptor for IgG1
            and IgG2 (Fc-gamma-1/gamma-2-R)
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 683-689 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        100      972     Fc-gamma-1/gamma-2 receptor precursor
    sigp        100      147     Fc-gamma-1/gamma-2 receptor signal peptide
    matp        148      969     Fc-gamma-1/gamma-2 receptor
    mRNA    <     1     1494     Fc-gamma-1/gamma-2 receptor mRNA
BASE COUNT      401 a    369 c    348 g    376 t
ORIGIN      
        1 cagcttgctg cagaagtgat ggcgatccct tcgttcttac ctgtccttgg caccaagagt
       61 caccgagctg actataagcc cttgcagact ttgagccaca tgctgctgtg gataactgtg
      121 ctattcctgg ctcctgttgc tgggacctct gcagaccccc cgaaggctgt ggtgagactg
      181 gagcccccat ggatccaggt gctccgggga gaccgcgtga ctctgacctg cgagggtgcc
      241 cccagccctg ggaaccactc cacccagtgg ctccacaacg ggagactcat ccctacccag
      301 gtcctgccca gctaccggtt cactgccaag ggcaatgaca gcggagagta caggtgccag
      361 gtcctgccca gctaccggtt cactgccaag ggcaatgaca gcggagagta caggtgccag
      421 gcgggcggga ccagtctcag cgaccctgtg cgtctggacg tgatttctga ctggctggtg
      481 ctccagactt ctcaactgat tttccaggag ggggacgtca tcgtgctgcg gtgccacagc
      541 tggaataact ggcctttggc caaggtcaca ttctaccaca atggggtagc caagaaatat
      601 ttctctatca gtaaaaattt ctccatccca caagcaaacc acagtcacag tggtgcttac
      661 aactgcacgg gattaatagg aaggacatct cacacatcac cgcctgtgac catcactgtc
      721 caagggccca agtcaagcga ctcttcaatg gtggtgataa ttgtggctgc agtcattggg
      781 attgctacag cggccattgt tgttgctgta gtagctatca tctgcctcaa gaaaaagcag
      841 cctccagcca atctctctga tcctgaggag gtcgctaaat ctgaggttga aaatacaatc
      901 acctattcac ttctcaagca cccggaagct caggatgatg acacagagca tgactatcag
      961 aaccacattt aatctccatt atctggccct gggatttggg ggagaaaaat caagaagtga
     1021 agatctgcta tctccaggcc taaggttccc ttggagaggt cgagaggatg ctgaagttca
     1081 aagaaggagc aggatttttc cagagtcctg tatgtgagtc ctaaagttct ttggcctgac
     1141 actaacagaa aatatgaact ctgaaggctg gctgattctg tgcctcagca cttccctaca
     1201 tcagggctgt tatacagccc cacagccaac aaaatgataa aattaatatt gctaagagat
     1261 tttaacaaca tgtgacatgc ctacattatg ggtaacatga gaaaaattac ataagtatat
     1321 atgatttcag aagtgataaa atcaactaac atctaccaac atattaaaaa tgattgtttc
     1381 agggtgatag aattatcagt ggtttttgtt ctttcttatt ttcctacaaa tctataagtt
     1441 tattttccta taaatcctat aaatcatgta ctgtatttgt aataaaatat tatg
//
LOCUS       NPOCAP       2104 bp ds-DNA             VRL       20-JUL-1990
DEFINITION  Orgyia pseudotsugata polyhedrosis virus capsid protein (P39)
            gene, complete cds.
ACCESSION   M21042
KEYWORDS    capsid protein; structural protein.
SOURCE      Orgyia pseudotsugata polyhedrosis virus DNA, clone cosmid #58.
  ORGANISM  Orgyia pseudotsugata polyhedrosis virus
            Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; 
            Nuclear polyhedrosis viruses.
REFERENCE   1  (bases 1 to 2104)
  AUTHORS   Blissard,G.W., Quant-Russell,R.L., Rohrmann,G.F. and Beaudreau,G.S.
  TITLE     Nucleotide sequence, transcriptional mapping, and temporal
            expression of the gene encoding p39, a major structural protein of
            the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata
  JOURNAL   Virology 168, 354-362 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by G.W.Blissard, 03-NOV-1988.
            
            The p39 protein is found in both the PDV and BV virion phenotypes;
            P39 mRNAs are approximately 2600 nt in length, are expressed late
            (24-48 hpi), and have two major transcription initiation sites.  At
            each major transcription start site, two transcripts are initiated
            within the ATAAG consensus.  Transcripts initiate at positions 2
            and 3 of the ATAAG consensus.
FEATURES       from  to/span     description
    pept        124     1179     capsid protein
    mRNA         27  >  1179     capsid protein mRNA (alt.)
    mRNA         28  >  1179     capsid protein mRNA (alt.)
    mRNA         75  >  1179     capsid protein mRNA (alt.)
    mRNA         76  >  1176     capsid protein mRNA (alt.)
BASE COUNT      495 a    606 c    586 g    417 t
ORIGIN      Map position 57-58.
        1 ccggcgcagc gcaccgtttc tttatataag aatctaaaat tagatacaag aggtcttggc
       61 taaaattgat tgtataagag atttcttgct ctatcaaaac gtcggccccc atcgccgcag
      121 aaaatggcgc tcgtgtcacc cggggtgtcg tcgcgacggt cgaccaacca ctgcatcttt
      181 ggcgcgattg aaccgttcga ctcgtgcgtc acgtacaggt cgccatgctc gtcggacgcg
      241 tcggtggacg acggctggtt catctgcgac taccacctga agttgcgctt caaaatgtcc
      301 aaaatggtgc tgcccattta cgacgaagac gacaaccagt acaagcgcac catagcgcgg
      361 catttggtgg ggcacaaaga aaggggcgta aagcgcattc tggtgcccac cagagccaac
      421 tacatgaccg tgttcaattt gccgggcatg atgctggccg agcagctgat ttttcatttg
      481 atctacgaca accgtttgga agtcaatcgc atttgcgcct cgttgaaaaa caacgaaaac
      541 ttcattgaca acacgtacag cgttgtggag agcgtgtaca gcgcgacgcg caacattctg
      601 tcgctcacag acccgcaagc gtactgctcg cgcgtggcca acgacgacgt gcgctttttc
      661 gacgcgaacg tagtcgacaa caactaccag gcgggcaacg gcgacaccgt gtttaacaac
      721 atgcccgggt ttttgcgcaa cctaatcagg cgcgccgtgg cgcccgaaac gctgcagatc
      781 gacagcgaag acctgcgcct gcgaaactgc aacacctgtg tgattaacaa caccggcttg
      841 gtagcaaccg tgacaaacac cgagctgtac aacccggtgc gcagctcgga catcattaaa
      901 acggggccca atcgcttgca gattagaaac gttttgaagt ttgagggcga cacacgcgcg
      961 ttggagcgca ccttgggccg ctacgaggaa taccccatgt acgtgccctt gtttttgggc
     1021 taccagctgg tcaacctgca aaacgacatt ttgcgcgcca acaacttttt acccgcgccg
     1081 tttggcgtgc cgcaggccgt aaacaacctg gaggcccaag cgcccgcagc gcccgcaccc
     1141 gcagcgcccg cgcccgcgcc cgcagcgccc gttgtttaat tccgttgcca tggacacggt
     1201 gaggctgcag tgccacaaat ttgctgctcg gtgggcgaaa tcaaaaatta ttttttgcag
     1261 cccgtcgacg caatcaccat tttgcccatt gtggagctgc acacgtgccg gcaccaattg
     1321 tgcgtcatgt gcgtgcgcaa aattgcgcag cgcggccgag acaagcgcgt cgagtgcccc
     1381 atgtgccgcc gcaaaaacgc gcactttaac gtgtacagcg tgaaccgtaa ttcggtggat
     1441 gtgttgcgct gcagcgtggc cgacgtgcgc gagcacgggc gctttggcgg attggcggac
     1501 gccgcgtcgc tggctcgtgg gctgtttgag cccagcttgc ttgaagccga gcccgcgccg
     1561 gacaactcgt ttggcccaaa cgagttacaa cttgtgctaa aacgtctaaa ggcgcaaatt
     1621 gaggcgcaaa caagaactaa ttatgatttg caactgcaag caactgcttt ggagcgaaca
     1681 atcgaagaag caaacgaccg cttgggtaaa agccgcggcg actacagcga cgcgtgcaag
     1741 ctgatggacg agcttcgcgg cgacaggctg cgcgcggagc gcgcggtcaa ggcgctcgcc
     1801 gacgcgcacg cccagtgggc ggacaagaac gccaaaatgc ggcgcgagaa cgacaggttg
     1861 acaaatgaaa acattggttt gattcgggac aacaatttat ttaaacaaaa cactgcacgc
     1921 aaacgtaaaa tcgctccgta accgcctttg tttgcgcgtc aaagtccacc tcgtagacgt
     1981 gtcgctcgtt tacgctgccg cagcggtccc agttgtccgt cgtgcaccac gccgcattgt
     2041 gtcggcggca caaatacagc gcttcgttga attgcgccgg cggcagattg aagcggccga
     2101 gctc
//
LOCUS       RATPPTA       981 bp ss-mRNA            ROD       20-JUL-1990
DEFINITION  Rat alpha-tachykinin mRNA, complete cds.
ACCESSION   M34184
KEYWORDS    alpha-tachykinin; substance P.
SOURCE      Rat (Sprague-Dawley) rostral portion of the caudate putamen
            complex, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 981)
  AUTHORS   Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
  TITLE     Three rat preprotachykinin mRNAs encode the neuropeptides substance
            P and neurokinin A
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Krause, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        100      438     preprotachykinin
    sigp        100      156     tachykinin signal peptide (156 could be 171)
    matp        271      303     substance P
    mRNA    <     1      981     tachykinin mRNA
BASE COUNT      271 a    203 c    234 g    273 t
ORIGIN      
        1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggagcgccca gcaagtgcgc
       61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
      121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
      181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gccggagccc
      241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
      301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt
      361 tatgggcatg gtcagatctc tcacaaaatg gcttatgaaa gaagcgcaat gcagaactac
      421 gaaagaaggc gtaaataaac cctgtaacgc actatctatt catctccatc tgtgtccgcg
      481 agcagtgagc ggtaaaataa aaatgtgcgc tatgaggaat gattatttat ttaatatcaa
      541 atgttgttat gagtgaaaaa ctcaaaaaag tgtttatttt ttcatattgt gccaataagc
      601 attgtaattc taatgtggtg acctcctcag acagaagtag aaattagttg taacttcagc
      661 aaagcacagt gttgatggag ttgtacaagt ttgccagcga tgcaagtctc caaagacaga
      721 aaggctgctg tgaggcagtg caggcggctg ctgctggagg cagagaaact cctgtgtgtc
      781 ttgcgcttcc cttggttgct tttatcctaa tgatgtactg agagtttggt atctgactct
      841 atttgtatcc tagcagcatg tttcctgtgt tgtgactata tagagatgtt tttaaaagtt
      901 tcaatgtact tctctggtct tcagtcattg tatgatgtgt tgtgatagct accattttaa
      961 ataaaagaat gtatcttcag g
//
LOCUS       RATPPTB      1035 bp ss-mRNA            ROD       20-JUL-1990
DEFINITION  Rat beta-tachykinin mRNA, complete cds.
ACCESSION   M15191
KEYWORDS    neurokinin A; substance P; tachykinin.
SOURCE      Rat (Sprague-Dawley) rostral portion of the caudate putamen
            complex, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1035)
  AUTHORS   Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
  TITLE     Three rat preprotachykinin mRNAs encode the neuropeptides substance
            P and neurokinin A
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        100      492     beta-preprotachykinin
    sigp        100      156     beta tachykinin signal peptide (156 could be
                                 171)
    matp        271      303     substance P
    matp        391      420     neurokinin A
    mRNA    <     1     1035     beta-preprotachykinin mRNA
BASE COUNT      289 a    212 c    245 g    289 t
ORIGIN      
        1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc
       61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
      121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
      181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gcccgagccc
      241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
      301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt
      361 tatgggcatg gtcagatctc tcacaaaagg cataaaacag attcctttgt tggactaatg
      421 ggcaaaagag ctttaaattc tgtggcttat gaaagaagcg caatgcagaa ctacgaaaga
      481 aggcgtaaat aaaccctgta acgcactatc tattcatctc catctgtgtc cgcgagcagt
      541 gagcggtaaa ataaaaatgt gcgctatgag gaatgattat ttatttaata tcaaatgttg
      601 ttatgagtga aaaactcaaa aaagtgttta ttttttcata ttgtgccaat aagcattgta
      661 attctaatgt ggtgacctcc tcagacagaa gtagaaatta gttgtaactt cagcaaagca
      721 cagtgttgat ggagttgtac aagtttgcca gcgatgcaag tctccaaaga cagaaaggct
      781 gctgtgaggc agtgcaggcg gctgctgctg gaggcagaga aactcctgtg tgtcttgcgc
      841 ttcccttggt tgcttttatc ctaatgatgt actgagagtt tggtatctga ctctatttgt
      901 atcctagcag catgtttcct gtgttgtgac tatatagaga tgtttttaaa agtttcaatg
      961 tacttctctg gtcttcagtc attgtatgat gtgttgtgat agctaccatt ttaaataaaa
     1021 gaatgtatct tcagg
//
LOCUS       RATPPTG       990 bp ss-mRNA            ROD       20-JUL-1990
DEFINITION  Rat gamma-tachykinin mRNA, complete cds.
ACCESSION   M34183
KEYWORDS    gamma-tachykinin; neurokinin; substance P.
SOURCE      Rat (Sprague-Dawley) rostral portion of the caudate putamen
            complex, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 990)
  AUTHORS   Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D.
  TITLE     Three rat preprotachykinin mRNAs encode the neuropeptides substance
            P and neurokinin A
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Krause, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        100      447     preprotachykinin
    sigp        100      156     tachykinin signal peptide (156 could be 171)
    matp        271      303     substance P
    matp        346      375     neurokinin
    mRNA    <     1      990     tachykinin mRNA
BASE COUNT      275 a    202 c    238 g    275 t
ORIGIN      
        1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc
       61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg
      121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat
      181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gccggagccc
      241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta
      301 atgggcaaac gggatgctgg gcatggtcag atctctcaca aaaggcataa aacagattcc
      361 tttgttggac taatgggcaa aagagcttta aattctgtgg cttatgaaag aagcgcaatg
      421 cagaactacg aaagaaggcg taaataaacc ctgtaacgca ctatctattc atctccatct
      481 gtgtccgcga gcagtgagcg gtaaaataaa aatgtgcgct atgaggaatg attatttatt
      541 taatatcaaa tgttgttatg agtgaaaaac tcaaaaaagt gtttattttt tcatattgtg
      601 ccaataagca ttgtaattct aatgtggtga cctcctcaga cagaagtaga aattagttgt
      661 aacttcagca aagcacagtg ttgatggagt tgtacaagtt tgccagcgat gcaagtctcc
      721 aaagacagaa aggctgctgt gaggcagtgc aggcggctgc tgctggaggc agagaaactc
      781 ctgtgtgtct tgcgcttccc ttggttgctt ttatcctaat gatgtactga gagtttggta
      841 tctgactcta tttgtatcct agcagcatgt ttcctgtgtt gtgactatat agagatgttt
      901 ttaaaagttt caatgtactt ctctggtctt cagtcattgt atgatgtgtt gtgatagcta
      961 ccattttaaa taaaagaatg tatcttcagg
//

GenBank-Updates@genbank.bio.net (07/21/90)

LOCUS       BSUGLPKD     4259 bp ds-DNA             BCT       21-JUL-1990
DEFINITION  B.subtilis glycerol kinase (glpK) and glycerol-3-phosphate
            dehydrogenase (glpD) genes, complete cds.
ACCESSION   M34393
KEYWORDS    glycerol kinase; glycerol-3- phosphate dehydrogenase.
SOURCE      B.subtilis (starin W168) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 4259)
  AUTHORS   Holmberg,C., Beijer,L., Rutberg,B. and Rutberg,L.
  TITLE     Glycerol catabolism in Bacillus subtilis: Nucleotide sequence of
            the genes encoding glycerol kinase (glpK) and glycerol-3- phosphate
            dehydrogenase (glpD)
  JOURNAL   Unpublished (1990) University of Lund, Dept. of Microbiology,
            Solvegatan 21, S-223 62 Lund , SWEDEN
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.M.E.Holmberg, 22-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      679     ORF 1 (AA at 2)
    pept        698     2188     glycerol kinase (glpK) (EC 2.7.1.30)
    pept       2329     3996     glycerol-3-phosphate dehydrogenase (glpD) (EC
                                 1.1.99.5)
    pept       4128  >  4259     ORF 4
    binding     682      690     ribosome binding site
    binding    2315     2321     ribosome binding site
    binding    4116     4121     ribosome binding site
    signal     2243     2303     transcription termination signal
    signal     4000     4033     transcription termination signal
    signal     4067     4072     -35 region
    signal     4090     4095     -10 region
BASE COUNT     1269 a    853 c   1092 g   1045 t
ORIGIN      
        1 ccatggcgga tacgcggttg gcggcatcag cggtgcccat ttgaatccgg cgctaacgat
       61 agcgcttgca tttgtaggag attttccttg gaaagaggtt ccggtttata ttgcggcgca
      121 aatgatcgga gcgatcatcg gtgcggtgat tatttatctg cattacctcc cgcactggaa
      181 gtcaacggat gatcccgctg ccaagctggg tgttttctca acagggccca gcattccgca
      241 tacatttgca aacgttttaa gcgaagtgat tgggacattt gtccttgtac ttggaatctt
      301 ggccataggt gcaaatcaat ttacagaagg acttaatcct ttaatcgtcg gtttcctcat
      361 tgtagcaatc ggtatttctc tgggaggcac caccggctat gctatcaatc ctgcacgtga
      421 cttaggtccg cggatcgccc acgcttttct tccgattccg gggaagggct catcaaactg
      481 gaaatacgca tgggttccgg tagtcggccc gattttaggc ggatcattcg gcggtgtatt
      541 ttacaacgct gcatttaaag ggcacatcac aagcagcttc tggattgtaa gcgttatatt
      601 ggttgtggta ttgttaggac tctatgttta tacgaaatca cattctgcta aaacattatc
      661 aaattctaaa tatatttaat caaaggggag acatcttatg gaaacgtaca ttttatcctt
      721 agatcagggg acgacaagtt caagagcgat tctgtttaat aaagaaggca aaattgtcca
      781 ctctgctcaa aaggaattta cacaatactt cccgcatcct ggctgggttg agcataatgc
      841 caatgaaatt tggggctctg tcctcgcggt tatcgcctca gtcatctctg aatcaggaat
      901 cagcgcttct caaattgccg gcatcggcat cacgaaccag cgcgagacga cggttgtgtg
      961 ggataaagat acaggaagtc ctgtctataa tgcaatcgtt tggcagtcca gacagacgtc
     1021 cggcatttgt gaggaacttc gtgaaaaagg atataatgat aaattcagag aaaaaacagg
     1081 gcttttaatc gatccttact tctccggcac gaaggtgaag tggattttag acaatgtgga
     1141 aggcgcaaga gaaaaagcgg aaaaaggcga gctgctgttt ggaacgattg atacgtggct
     1201 catttggaaa atgtcaggcg gaaaagcgca tgtgaccgat tactccaatg cctcaagaac
     1261 actgatgttt aatatttacg atttaaaatg ggacgatcaa ctgctcgaca ttctaggcgt
     1321 accgaaatcc atgctccctg aagtgaagcc gtcctctcat gtgtatgcgg agactgttga
     1381 ttatcacttc ttcggaaaaa atatcccgat tgctggagcg gcaggcgacc agcagtccgc
     1441 attgttcggc caggcatgct ttgaagaagg catggggaaa aacacttacg gcacaggatg
     1501 tttcatgctg atgaataccg gggaaaaagc aattaagtcc gaacatgggc ttttgacaac
     1561 aatcgcttgg ggcattgacg gaaaagtgaa ctatgcgtta gaagggagca tttttgtcgc
     1621 aggctctgcc atccagtggc ttagagacgg tttgagaatg ttccaggatt catcgctaag
     1681 cgaatcttat gcagaaaaag tggattcaac tgacggcgtg tatgttgttc cagcatttgt
     1741 cggactggga acgccttact gggacagcga tgtgcgcggt tcggttttcg gcctgacaag
     1801 agggacaaca aaagagcact ttatccgtgc gacactggag tcattggctt atcagaccaa
     1861 agatgtgctt gacgcaatgg aagcagattc aaacatttca ttaaagacgc tccgtgtaga
     1921 cggaggagct gtaaaaaaca atttcctaat gcagttccaa ggagacctgt tgaatgttcc
     1981 tgtggagcgc ccggaaatta atgaaacgac tgcacttggc gcggcttatt tggcgggtat
     2041 cgctgtggga ttctggaagg accgttctga aatcgcgaac cagtggaatc tggataaacg
     2101 gtttgagcct gaattggaag aagaaaaacg aaatgagctg tataaaggct ggcaaaaagc
     2161 cgtgaaagca gctatggctt ttaaataaag taatactatg gtataatggt tacaagttaa
     2221 taagaacggt cctgagatga ggagagacca cagcaccaaa gtgtaagcat gcactttggc
     2281 tgttgtggtc tctttttcta tttaccgtga caacaaggag gaaacgtaat gatgaatcat
     2341 caattttcaa gtcttgaaag agatcgcatg ctgacagaca tgacgaaaaa aacatatgac
     2401 ctatttatta tcggaggagg aattacagga gccggaacag ctcttgacgc ggcatcaagg
     2461 ggaatgaagg tcgcactaag cgaaatgcag gactttgcgg cgggaacatc aagccgatca
     2521 acgaagctgg tacatggcgg cttgcgctat ttaaaacaat ttgaagtgaa aatggtcgct
     2581 gaggtaggaa aagagcgggc gattgtgtat gaaaacggcc cgcatgtcac tacaccggaa
     2641 tggatgctgc ttccgtttca taaaggcggc acattcggtt catttacaac atctattggg
     2701 ttaagggttt atgacttcct tgcaggtgtg aaaaagtcag aacgaagaag catgctttca
     2761 gcaaaagaaa cgctgcaaaa agagcctttg gtgaaaaaag acggcttaaa aggcggcggg
     2821 tactatgtgg aataccgcac tgacgatgcg agactgacca tcgaagtcat gaaggaagcg
     2881 gttaaattcg gggcagagcc tgtgaattac tccaaagtga aggagcttct ttacgaaaaa
     2941 ggcaaagccg tcggcgtatt aattgaagat gtgctgacaa agaaagaata taaagtgtat
     3001 gcgaaaaaaa ttgtcaatgc tacaggccct tgggtcgatc agctcagaga aaaagaccat
     3061 tcgaaaaacg gaaagcattt gcagcataca aaaggcattc accttgtatt tgaccagtct
     3121 gtctttccgc tgaaacaggc tgtatatttc gatacacctg atggccggat ggtatttgcg
     3181 attcctcgtg aaggcaaaac atacgtggga acaacagaca ctgtttacaa agaggcgctg
     3241 gagcatccgc ggatgacaac ggaggatcgt gattatgtca tcaaatcaat caattacatg
     3301 ttcccggaac tgaatatcac tgcgaatgac atcgaatcca gctgggcggg gctgcgtccg
     3361 ctgattcatg aagaaggcaa agatccttct gaaatttcac ggaaagacga gatttggaca
     3421 tctgactcag gcctgatcac cattgccggc ggaaaactga ctggatacag aaaaatggcg
     3481 gacgacatcg ttgatcttgt ccgtgatcgc ttaaaagaag agggcgaaaa ggattttgga
     3541 ccatgtaaaa cgaaaaacat gccaatctct ggcgggcacg tcggcggttc gaaaaatctt
     3601 atgtccttcg ttaccgcgaa aacaaaagaa ggaattgcag ccggtttatc agaaaaagac
     3661 gcaaaacagc ttgcgatcag atacggctct aacgtagatc gcgtctttga ccgggtagaa
     3721 gcgctgaaag atgaggccgc gaaacgcaac attccggttc atattcttgc tgaggcagaa
     3781 tacagtatag aagaagagat gactgcaacc cctgctgact tctttgtccg cagaacggga
     3841 cgtttatttt ttgatatcaa ttgggtaaga acatataaag atgccgttat tgattttatg
     3901 agcgagcgat tccaatggga tgagcaggcg aaaaacaaac atacagaaaa cctcaacaag
     3961 cttttacacg atgcggtcgt accgcttgag caataaatca taacgggctg tctgcagccc
     4021 gttatttctt tttacgtgcc gaaaggggga gatctcaggt tatcaagtga cagaggcacc
     4081 cgcttcatat aaaataatgg cataagctga tacataggag gacgaatatg acttggagaa
     4141 agagctatga acgctggaaa cagacagaac atttagatct ggaattaaaa gagcgcctta
     4201 ttgaattaga gggagatgaa caggcccttg aggactgttt ctataaagac cttgaattc
//
LOCUS       CLCMTPLAS     379 bp ds-DNA             PLN       21-JUL-1990
DEFINITION  C.purpurea mitochondrial plasmid terminal 5' inverted repeat (TIR).
ACCESSION   M35177
KEYWORDS    .
SOURCE      C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
  ORGANISM  Claviceps purpurea
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE   1  (bases 1 to 379)
  AUTHORS   Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
            Rogmann-Backwinkel,P. and Tudzynski,P.
  TITLE     Structural and functional analysis of mitochondrial plasmids in
            Claviceps purpurea
  JOURNAL   Mol. Gen. Genet. 214, 128-134 (1988)
  STANDARD  simple staff_review
BASE COUNT       97 a     76 c     64 g    142 t
ORIGIN      
        1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
       61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
      121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
      181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
      241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
      301 aactttccgc taaaacccgc acgtcttcgt ctgcatggta tgggcgttag attctattct
      361 acaatgtctt ccgccaaaa
//
LOCUS       CLCMTPLASX    379 bp ds-DNA             PLN       21-JUL-1990
DEFINITION  C.purpurea mitochondrial plasmid 3' terminal inverted repeat (TIR).
ACCESSION   M35178
KEYWORDS    .
SOURCE      C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
  ORGANISM  Claviceps purpurea
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE   1  (bases 1 to 379)
  AUTHORS   Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
            Rogmann-Backwinkel,P. and Tudzynski,P.
  TITLE     Structural and functional analysis of mitochondrial plasmids in
            Claviceps purpurea
  JOURNAL   Mol. Gen. Genet. 214, 128-134 (1988)
  STANDARD  simple staff_review
BASE COUNT      105 a     75 c     62 g    137 t
ORIGIN      
        1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
       61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
      121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
      181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
      241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
      301 aactttccgc taaaacccgc acgtcttact ttagctcaac aaagaaagcg tagtgaactt
      361 aaaaaagcac gtcttgctc
//
LOCUS       DDISP60       328 bp ds-DNA             INV       21-JUL-1990
DEFINITION  D.discoideum spore coat 60 (sp60) gene, 5' flank.
ACCESSION   M34546
KEYWORDS    spore coat 60.
SOURCE      D.discoideum (strain Ax4) DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 328)
  AUTHORS   Fosnaugh,K., Loomis,K. and Loomis,W.F.
  TITLE     Coordinate regulation of the spore coat genes in Dictyosteiium
            discoideum
  JOURNAL   Dev. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable  sequence for [1] kindly
            submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT      168 a     15 c     11 g    134 t
ORIGIN      
        1 aaaacataga acaaatttaa ataattattt tttattaaat tgtattattt ttttttatta
       61 tttatttatt ttttattttc aaaaaaaaaa aaaaaaagtt aacagacaaa aagtataatc
      121 tatttaatta ttcaaaaaaa aaaatattaa atcattgtag tattttgttc atattcaaat
      181 taaatatcaa tacaattaaa aaaagttaaa atgtaattat taatatatat atatatttat
      241 caatatataa tttaatataa aaaaaaaaaa gaaatttaat aaacataaat atttataaaa
      301 tcataaattt aaaataaata attttttt
//
LOCUS       DDISP70       563 bp ds-DNA             INV       21-JUL-1990
DEFINITION  D.discoideum spore coat 70 (sp70) gene, 5' flank.
ACCESSION   M34547
KEYWORDS    spore coat 70.
SOURCE      D.discoideum (strain Ax4) DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 563)
  AUTHORS   Fosnaugh,K., Loomis,K. and Loomis,W.F.
  TITLE     Coordinate regulation of the spore coat genes in Dictyosteiium
            discoideum
  JOURNAL   Dev. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT      207 a     73 c     47 g    236 t
ORIGIN      
        1 gaagtgtgag ttgcgcagtg tgttagtgtg ggtttaatga gagtgttttt tttttttttt
       61 tttttttttt tgaacgtaca tatttatttg tttttattaa tatttcataa atgaatttgt
      121 aaatataaaa ataaattgtt tttttttatt ttattttttt gattatgaca ataatttcaa
      181 taaaatgtaa aatatgataa aaataatttt aattattgaa acacacacac tcactcactt
      241 tttcccacta gtattacact ttccaatgaa aaataaatta atattgaaaa agaaggaaat
      301 ggaaaaaaat gaaaaataaa aataattaaa taaacaataa tattacaaac acactatttc
      361 tttctcaccc acacaaccaa tatactaaca cttctcatac acccaccaac acacctacac
      421 ccagttcttc acattcatac acaaagttta atattacact ttaaaaaaca ttcatattaa
      481 tgatttttat gatttattta tttattaatt tatttaattg tgtattttga ttattgctct
      541 ctacttttta tatatttttt ttc
//
LOCUS       DDISP96       682 bp ds-DNA             INV       21-JUL-1990
DEFINITION  D.discoideum spore coat 96 (sp96) gene, 5' flank.
ACCESSION   M34548
KEYWORDS    spore coat 96.
SOURCE      D.discoideum (strain Ax4) DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 682)
  AUTHORS   Fosnaugh,K., Loomis,K. and Loomis,W.F.
  TITLE     Coordinate regulation of the spore coat genes in Dictyosteiium
            discoideum
  JOURNAL   Dev. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT      321 a     37 c     63 g    261 t
ORIGIN      
        1 tgaaaaaaaa aaaattttgt gtgtgatagt ttgtcaaaaa tcatttgaaa ttaaaatcat
       61 attataatat atattaaaaa aatgttgtat ctttaatttt ttttcatact agtgtgttaa
      121 atgaattaaa gtgtggtaat gaataaatga aaaaaaaaaa aaaattaaag tgagagagtg
      181 tgtaaaagtg gtaatgaatg tgtgtttgag tgtgttaaaa atgtgtttac aaaatataat
      241 aatttaaatt cattcatatt caactatgga gaaagcgtgt tttcaaacac aataaatgtg
      301 taatgaaaaa tacaattaaa taattattta attttttgta tgattaaaaa aaaaagagta
      361 aattagtagt gccaccctaa aacaataata ataataatta aataatatta atatttattt
      421 aattttttta tttaattttt tttcaaaaca attcataata taaatttatt cttttttttt
      481 tttttttttt tttttaaaaa tattcaactc agtaataata ataataaaaa taataacaat
      541 aataaggaat tataatcaaa taataatata gctataatct ataattacta tagtaaaatt
      601 acgtatacat atataaataa ttattattat cattgttttt aaaaataaat aaataaataa
      661 taaaaaaaaa aaaaaaaaaa aa
//
LOCUS       GLAA2GIA     1870 bp ds-DNA             INV       21-JUL-1990
DEFINITION  Giardia lamblia alpha-2 giardin gene, complete cds.
ACCESSION   M34550
KEYWORDS    alpha-2 giardin.
SOURCE      G.lamblia (strain Portland-1) trophozoite DNA, clone pG2.
  ORGANISM  Giardia lamblia
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE   1  (bases 1 to 1213)
  AUTHORS   Alonso,R.A. and Peattie,D.A.
  TITLE     Genomic organization of alpha giardin genes in Giardia lamblia
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1214 to 1870)
  AUTHORS   Alonso,R.A. and Peattie,D.A.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.A.Alonso, 25-MAY-1990.
            
                            Author address: R.A.Alonso
                            Dept of Tropical Public Health
                            Harvard School of Public Health
                            665 Huntington Avenue, Boston, MA 02115
FEATURES       from  to/span     description
    pept        215     1105     alpha-2 giardin
BASE COUNT      466 a    468 c    507 g    429 t
ORIGIN      
        1 atactgcaga tacacctcat ggtgtacatg ctggcggcaa gtatcgtata aactctatgg
       61 agacatgggg cagaccttgt ctcgctgctt tgacgagcga atggccccta ttttgcgcac
      121 gctatggcgc gtcacagatg cataggataa gcgaactcat gatggaaatt caaattacct
      181 taaaatatat tttctgagcg cttacattta gaaaatgccg aagctatccc agatcgtcgc
      241 tgacatgaag caggccatcg atgccaagga cgaggcccag atcgcgttca tagcgtcgga
      301 atactctgca gatgcgcgtc agaggattgc gcaggggtac agggaccaat acggcaagga
      361 gctccccgac gacatcaaga aggcgctcaa gggcggctcc gaggagagcc tcctgatgga
      421 tctcttcagc gatcgtcacg aggtccgcgc ccagcacatc cgcgacgcgc tttccggcaa
      481 gaacgaccac atggccttct tcgacacagt cattctgtgc acccccgaag actggcacga
      541 gaccgtcgcc gcctacacga ggatgttcaa gaagcccctc gtcgaggact tcatgaagga
      601 tgtcggccgc aaggagaatt ggtgtctttt catggagaag tggatggctc acgagcgcac
      661 tagccgtgaa ggcagccctg acgaggaagc agagaaactg aacaaggcgt tcagtgagag
      721 cgaccacgac tacatttcca gcttcatggc tggggtgcct cccgaagaat acaagtccat
      781 caacaccagc ttcaagagcc ttacaggaaa gggcatcgac caggccttcg ccaccatcta
      841 tacggggact gactactact ccctgtactg cgcccacttt gctcttcttg gcatgcacaa
      901 gcttgccgcc tacctggtca actgcgcctg caacgataag ggcgacgaga agcgcatgag
      961 gaggatcacg ggcatgatgg tcgacaagtg cttggctgca aagtatgcct acaagaccta
     1021 cgggagtatg aaagccgacg tcgaaaggtg ctttgataag cgcatggctc cgatcctctg
     1081 taccctctgg cgcctaaggg agtgagcgcc tttactgcgg ggtttccttt cgctagtgaa
     1141 ttgcctagcg ggtgtttact gccttctgtt tctttttgcc ctgtcctggc ggtccaagat
     1201 gctatcctgc gatgagaggt ctttcgagaa cggatgcgag aagtctggga ttgctacatg
     1261 ccgtccacct gactgtgagg gtgagtagga ctggagtagc ggcaaggagt gggagctcat
     1321 gttgtccata gagctgtatg cctgatcaat ctgcccattt cctgcttgtc ctctgcgggt
     1381 tttttgtttc agatctgagc tgcctaaccg agaggggccg tactctatgc ttagttctct
     1441 attaaccaac tgaacagcct ttgctcgctt gcttgccagt gtatatttat atctatattt
     1501 caagaacaga gagaggaccg tgatgaagag gagcaggatg agtgtcgcaa cggaccagaa
     1561 cacgacaata agggctatag acatcttaga atgataacca acaaagctta tgaactcctc
     1621 tccactgaaa gaagaggaaa gataccgagt ctcgtaaaac tgaatggtag tcctatcgat
     1681 ggtgtgcccc cagatagggt ctactatcgg ctggttggta agattgcttg cagcggtatc
     1741 gacagctgcc acaaccacta agtagtcaag gctcgtgctt gtacgaagga aggtcaacgg
     1801 gaatgcgaag catggaccag gtgttatttg cttgctcgat aaactcagga gtatatattt
     1861 cagtattgtc
//
LOCUS       KAEHUTC      1625 bp ds-DNA             BCT       21-JUL-1990
DEFINITION  K.aerogenes histidine utilization repressor C (hutC) gene,
            complete cds.
ACCESSION   M34604
KEYWORDS    histidine utilization repressor C.
SOURCE      K.aerogenes (strain W-70) DNA.
  ORGANISM  Klebsiella aerogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1625)
  AUTHORS   Schwacha,A. and Bender,R.A.
  TITLE     Nucleotide sequence of the gene encoding the repressor for
            histidine utilization genes of Klebsiella aerogenes
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.A.Bender, 29-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      669     histidine utilization repressor G (AA at 1)
    pept        731     1456     histidine utilization repressor C (hutC)
    pept       1599  >  1625     histidine utilization repressor U
    signal      667      686     hutC promoter
    signal     1522     1493     Pc promoter
    signal     1531     1557     Puh promoter
    binding    1475     1495     CRP binding site
    site         34       61     helix-turn-helix
BASE COUNT      311 a    476 c    508 g    330 t
ORIGIN      
        1 gtcgacggcg agcagttaga agccgcgcat caggcgctgc gcgaggcggt ggccgactgt
       61 cagcgggcgg gtaaacgcac gctggtcctg ggcggcggcc acgagacggc cttcggccac
      121 ggcgctgggg tactggatgc cttcccaggc gaaaaagtgg gcattattaa tctcgatgcg
      181 catctggatc tgcgctttgc cgactgcgcc agctccggga cgccgttccg ccagttggcg
      241 ctggagtgtg acgcgcagca gcgcggtttt cactatacct gcatcggggt tagccgggcg
      301 gcgaacaccc aggcgctatg ggatgaagcg gcgcgccgcc aggtggctat cgttgaagat
      361 ctggaggtgc tgacagcctt cgaaactcgc gtgttgccgg agcttgagcg caatatcgcg
      421 caattcgacc gcctgtatct gactatcgat ctcgacgtgc tgccggcgcg agaaatgccg
      481 gcggtgtcgg ccccggcggc gctgggcgtg ccgctgggca cgctgctgcg catcgttgag
      541 ccgctgtgcc gcagcggtaa gctgcaggcg gtggatctgg tggagtttaa cccgctgttt
      601 gacattgacg gtcagggcgc tcgcgcggcg gcccgtgtgg catggcaaat cgcccattgg
      661 tggcgctagc gatccgcgct attattttcg tttcgccgct ccgcttatta cgcataagga
      721 aagccaggcc atgtttgcac aacaacctcg ttccgcgccc gcgcctttct atgaaaaggt
      781 gaagcaggcg atcagcgaaa agatccacag cggcgtctgg cgaccgcatg accgtatccc
      841 gtcggaagct gagctggtgg cccagttcgg cttcagccgg atgaccatca atcgcgcgct
      901 gcgcgagctg acggacgaag gcctgctggt caggttgcaa ggggtgggaa cctttgtcgc
      961 cgagccgaaa gggcagtcgg cgctgttcga ggtgcgcagt attgccgcag aaattgtcgc
     1021 ccgtcaccat cagcaccgct gtgaggtact cctgcttgag gagacccggg cggatcatat
     1081 tcaggcgacg gcgctcagcg tcccggaagg cacccgcatc tttcactcgc tgatggtgca
     1141 ttacgaaaac gaggtgccgg tgcaaatcga agatcgctgc gtcaatgccg cagtggtacc
     1201 ggactatctg catcaggact acaccgccac cacgccccat gattacctgt cgctgatcgc
     1261 gccgttaacc gaaggtgaac atattgttga agcggtgcag gccacggcgg aagagtgcgc
     1321 gctgctgcat attcacgccc acgatccgtg cctgctgatc cgtcgtcgca cctggtcgac
     1381 aacccacatc gtctcccacg cgcgcctgct gttccccggt agccgatatc gtctgcaggg
     1441 gcgctttggc tcctgatcca cgctggcgtc agcaaaacgt gattgctgac gcaatataac
     1501 aaaattgtat catttctgtt aaatcctggc ttgcgcatgc ttgtatagac aagtatatgt
     1561 atctacgtaa acaacgttta ttgtcaggag aaaccccgat gtcgcaaagc aaatatcgcc
     1621 agctg
//
LOCUS       MUSMHH2KBM     66 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse major histocompatibility complex antigen H-2Kbm29 mRNA.
ACCESSION   M34542
KEYWORDS    major histocompatibility complex antigen.
SOURCE      Mouse ((C3H.CAS3(R4)XB6)F2, mutant #6414(bm29)), liver, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Horton,R.M., Loveland,B.E., Parwani,A., Pease,L.R. and
            Fischer Lindahl,K.
  TITLE     Immunological characterization and molecular analysis of the
            H-2Kbm29 mutant
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Weiss,E., Golden,L., Zakut,R., Mellor,A., Fahrner,K., Kvist,R. and
            Flavell,R.A.
  TITLE     The DNA sequence of the H-2Kb gene: Evidence for gene conversion as
            a mechanism for the generation of polymorphism in
            histocompatibility antigens
  JOURNAL   EMBO J. 2, 453-462 (1983)
  STANDARD  full automatic
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.M.Horton, 24-MAY-1990.
            
            Mutation in codon 89 changes bp 43 and 44 to "gc", otherwise this
            sequence is identical to reference 2.
FEATURES       from  to/span     description
    pept    <     1  >    66     MHC H-2Kbm29 (AA at 3)
BASE COUNT       14 a     22 c     19 g     11 t
ORIGIN      
        1 cgagtggacc tgaggaccct gctcggctac tacaaccaga gcgcgggcgg ctctcacact
       61 attcag
//
LOCUS       YSCKGD2      2112 bp ds-DNA             PLN       21-JUL-1990
DEFINITION  S.cerevisiae dihydrolipoyl transsuccinylase (KGD2) gene,
            complete cds.
ACCESSION   M34531
KEYWORDS    alphaketogluterate dehydrogenase complex;
            dihydrolipoyl transsuccinylase.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2112)
  AUTHORS   Repetto,B. and Tzagoloff,A.
  TITLE     Structure and regulation of KGD2, the structural gene for yeast
            dihydrolipoyl transsuccinylase
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Tzagoloff, 29-MAY-1990.
FEATURES       from  to/span     description
    pept        560     1987     dihydrolipoyl transsuccinylase
    mRNA        306  >  1987     dihydrolipoyl transsuccinylase mRNA
    mRNA        451  >  1987     dihydrolipoyl transsuccinylase mRNA (alt.)
    signal      196      200     CAAT box
    signal      228      232     CAAT box
BASE COUNT      649 a    435 c    425 g    603 t
ORIGIN      
        1 tcgaacgtct cgtggaaagc cttgaccgtt gcgtctttac tcgtctctat tcaaccctgt
       61 ggtcctatta gaaccgatct ttgttccatc gtcattacta cttcttgtct cttctctctt
      121 ggggaataat aaacaaaaca attccagcca cccgcatctc cgtcaaagac gatattaaca
      181 accctttttt tgcagccaat caaaactaag gtcctgcccg gagtgagcca ataaagcggc
      241 cacaaattac gcaggaatct atggttccta aagcttattc ttattctaca ttgacaccat
      301 tctactacag ggctcggatc cacacttggt ctatcagcaa atgcaaatcc tttttccctc
      361 aactaggcct acttttcata tataatctct ttttgagctt caggggagga aatatctaac
      421 gtaaaaaaaa gaaaaatccg aaataaaaaa ttaggcagct atctttaacg gtatatatca
      481 gtgaatagat atattgttat aaacttcact accacatttg ttacaaccaa agacacaact
      541 tcagataatt atttaaacaa tgctttccag agcgacgcgt actgcagctg ccaaatcctt
      601 agtaaaatct aaagtggcta gaaatgttat ggctgcttct ttcgtcaaga gacatgcttc
      661 tacaagtttg ttcaaacaag ctaacaaggt cgaatcctta ggttcaatat atttatccgg
      721 caagaaaatt tcagttgcgg cgaatccgtt ctccataact agcaatcgtt ttaaatctac
      781 ctctattgaa gttcctccga tggcagagtc cctgactgaa ggctctttaa aggaatatac
      841 taaaaacgtt ggtgatttta ttaaggagga cgagctgttg gccactattg agaccgataa
      901 aattgatatt gaggtcaatt cgccagtatc aggtactgtt acgaagctaa atttcaaacc
      961 agaggacact gtcactgttg gtgaggagtt agctcaggtc gagcctggtg aagcacctgc
     1021 tgagggttct ggagaatcta agccagagcc taccgaacaa gcggagcatc gcaaggtgtc
     1081 gccgcaaggg aaaactcaag tgaggaaacg gcttcaaaga aagaagctgc tccaaagaaa
     1141 gaagccgctc caaagaaaga agttacagaa ccaaaaaagg actgatcaac caaagaagac
     1201 cgtctctaag gcgcaggaac ccccagtagc ctctaactct ttcacaccat ttccacgtac
     1261 agaaaccagg gtcaaaatga accgtatgag attgaggatt gccgaaagat taaaagagtc
     1321 tcaaaacact gctgcttcct taaccacatt caacgaagtt gacatgtcag ctttgatgga
     1381 aatgaggaaa ctgtataaag atgagattat taagaagacc ggtactaaat tcggattcat
     1441 gggtcttttc tccaaagcat gtaccttggc cgccaaggat attccagccg tcaatggtgc
     1501 cattgaaggt gaccagattg tttatcgtga ttacacagat atttctgttg ctgtggccac
     1561 tccaaagggt ttggttaccc ccgtcgttcg taatgcagag tcattgagtg ttttagatat
     1621 tgagaacgaa attgttcgct tgagtcataa agcgcgtgat ggcaaattaa ccctagaaga
     1681 tatgacgggt ggtactttca ccatatctaa tggtggtgtt tttggttcat tatacggtac
     1741 tcctatcatc aattcaccac aaacagccgt cctaggcttg catggtgtca aagagagacc
     1801 tgtcactgtt aatggacaaa ttgtctcaag accaatgatg tacttggctt tgacttatga
     1861 tcatagattg ctagatggtg agaagctgtt atccttcttg aagactgtta aagagttgat
     1921 tgaagaccct agaaaatgtt gttatggtga tttgaaattt gcagcccata ccaatctgat
     1981 ttcatgatta ttcttgtccg ctattactgt gtgaatctct tctatttaag tatgatattt
     2041 aaagatatgc tatgatgata aatggaacta tgaattaatg ttttcttttt ataatgagat
     2101 atggaaaagc gc
//
LOCUS       PHUDLTN       781 bp ss-mRNA            VRT       21-JUL-1990
DEFINITION  P.bicolor [D-ala-2]deltorphin mRNA, complete cds.
ACCESSION   M34560
KEYWORDS    [D-ala-2]deltorphin.
SOURCE      P.bicolor bicolor skin, cDNA to mRNA, clone AD2.
  ORGANISM  Phyllomedusa bicolor
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Neobatrachia; Bufonoidea; Phyllomedusidae.
REFERENCE   1  (bases 1 to 781)
  AUTHORS   Richter,K., Egger,R., Negri,L., Corsi,R., Sevrini,C. and Kreil,G.
  TITLE     cDNA encoding [D-ala-2]deltorphin precursors from skin of
            Phyllomedusa bicolor also contain genetic information for three
            dermorphin-related opoid peptides
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4836-4839 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Kreil, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         30      713     pre pro[D-ala-2]deltorphin
    matp        348      368     [D-ala-2]deltorphin II
    matp        456      476     [D-ala-2]deltorphin I
    matp        564      584     [D-ala-2]deltorphin I
    matp        672      692     [D-ala-2]deltorphin I
    mRNA    <     1      781     [D-ala-2]deltorphin mRNA
BASE COUNT      323 a     69 c    233 g    156 t
ORIGIN      
        1 tcagtacttc ctgaataaca agacccaaca tgtctttctt gaagaaatcg cttctgcttg
       61 tacttttcct tggattagtg tcccattccg tttgtaaaga agagaaaaga gagactgaag
      121 aggagaatga aaatgaggaa gaaaatcatg aagtgggaag tgagatgaag agatatgcgt
      181 tctggtatcc gaatagagac actgaagaga agaatgaaaa tgaggaagaa aatcaggaag
      241 agggaagtga gatgaagaga tatgcgttcg gctatccgaa aagagagcct gaagaggaaa
      301 atgagaatga ggaagaaaat catgaagagg gaagtgagat gaagagatat gcgtttgaag
      361 ttgtgggagg agaagctaag aaaatgaaaa gagaacctga agaggaaaat gagaatgagg
      421 aagaaaatca tgaagaggga agtgagatga agagatatgc gtttgacgtt gtgggaggag
      481 aagctaagaa aatgaaaaga gagcctgaag aggaaaatga gaatgaggaa gaaaatcatg
      541 aagagggaag tgagatgaag agatatgcgt ttgacgttgt gggaggagaa gctaagaaaa
      601 tgaaaagaga gcctgaagag gaaaatgaga atgaggaaga aaatcatgaa gagggaagtg
      661 agatgaagag atatgcgttt gacgttgtgg gaggagaagc taagaaaatg taatatttca
      721 taacttaaag gagcacaatt atcagttata tgccaaacat atattaaatg atagataact
      781 t
//
LOCUS       HSBGPG3A     1829 bp ds-DNA             VRL       21-JUL-1990
DEFINITION  Bovine herpesvirus type 1 glycoprotein gIII gene, complete cds.
ACCESSION   M27491
KEYWORDS    glycoprotein.
SOURCE      Bovine herpesvirus type 1 DNA, clones pDR[7,123] and p7-123.
  ORGANISM  Bovine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae.
REFERENCE   1  (bases 1 to 1829)
  AUTHORS   Fitzpatrick,D.R., Babiuk,L.A. and Zamb,T.J.
  TITLE     Nucleotide sequence of bovine herpesvirus type 1 glycoprotein gIII,
            a structural model for gIII as a new member of the immunoglobulin
            superfamily, and implications for the homologous glycoproteins of
            other herpesviruses
  JOURNAL   Virology 173, 46-57 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.J/Zamb, 31-AUG-1989.
FEATURES       from  to/span     description
    pept        160     1725     glycoprotein gIII precursor
    sigp        178      222     put. glycoprotein gIII signal peptide
    matp        223     1723     glycoprotein gIII
    signal       25       28     CAAT box
    signal       61       64     TATA box
BASE COUNT      247 a    737 c    597 g    248 t
ORIGIN      97 bp upstream of BamHI site; between map units 0.122 and 0.135.
        1 cgcgcctgca gccgcgcgtg tgctcaatcc cggaccacga aagcacaaaa cggacgccct
       61 taaaaatgta gcccgcgccg cggtcgcggc catcttggat ccacccgcgc gcacgaccgc
      121 cgagagaccg ccagcccgag acctcgccgc gcgtccgcca tgggcccgct ggggcgagcg
      181 tggctgatcg cagctatttt cgcctgggcg ctcctgtctg cccggcgggg gctcgccgag
      241 gaggcggaag cctcgccctc gcctccgccc tccccgtgcc caaccgagac ggaaagctcc
      301 gctgggacca ccggcgcaac gccccccacg cccaacagcc ccgacgctac gccagaggac
      361 agcacgcccg gtgctactac gcccgtgggg acgccggagc cgccgtccgt gtccgagcac
      421 gacccgcccg ttaccaacag cacgccgccg cccgccccgc ccgaggacgg gcgacccggc
      481 ggcgctggca acgcgagccg cgatgggcga cctagcggcg gggggcggcc tcgccccccg
      541 cggccgagca aagccccgcc gaaggagcgc aagtggatgc tctgcgagcg cgaggccgtg
      601 gccgcctcgt acgccgagcc gctgtacgtg cactgcggcg tggccgacaa cgccactggc
      661 ggtgcgcgcc tggagctctg gtttcagcgc gtgggcaggt tccgctccac gcgcggcgac
      721 gacgaggccg tgcgcaaccc ctttccgcgg gccccgcccg tgctgctgtt cgtagcccag
      781 aacggctcga tcgcgtaccg tagcgcggag ctgggcgaca actatatttt cccttcgccc
      841 gccgaccccc gcaacttgcc cctgaccgtg cgctccctga cggccgccac cgagggcgtg
      901 tacacttggc gccgcgacat gggcaccaag tcacagcgca aggtcgtgac cgtcacgacg
      961 caccgcgcgc ccgctgtttc cgtcgaaccc cagccagcgc tagaaggcgc cggctacgcg
     1021 gccgtgtgcc gcgccgccga gtactacccg ccgcgctcca cgcgcctgca ctggttccgc
     1081 aacggctacc ccgtggaggc tcggcacgcg cgcgacgtct ttacggtcga cgactccggg
     1141 ctcttttcgc gcacgtccgt ccttacgctc gaggacgcga cgccaaccgc ccacccgccc
     1201 aacctgcgct gcgacgtctc ctggttccag agcgctaaca tggagcgccg cttttacgcg
     1261 gctggcacgc cggccgttta ccgcccgccc gagctgcgcg tgtacttcga gggcggcgag
     1321 gccgtctgcg aggcgcgctg cgtccccgag gggcgcgtct ccctgcgctg gacggtgcgc
     1381 gacggcatcg ccccgtcgcg cactgagcag accggcgtct gcgccgagcg gcccgggctg
     1441 gtaaacctgc gcggcgtgcg cctgctttct acaaccgacg ggcccgtcga ctacacctgc
     1501 accgccactg gctacccggc accgctgccc gagttctccg cgaccgccac gtacgacgcc
     1561 tcgcccggcc taatcggaag ccccgtcctc gtcagcgtcg tggccgtcgc ctgcggtctc
     1621 ggcgccgtgg ggctcctgct ggtggcggcc tcgtgcctgc ggcgcaaggc ccgggtaatc
     1681 caacccggtc ttactcgcgc tcgcgccctc ggctccgcgc cctagacgac cggcacggcc
     1741 tggaggcgct ggcggctgcc ggtgccgctc acaccgcgcg ccacaaccgc gacgtgtggc
     1801 agcgcttttc ccgcgtctgc gaggccggc
//
LOCUS       DOGCHAMC      933 bp ss-mRNA            MAM       21-JUL-1990
DEFINITION  Dog mast cell chymase mRNA, complete cds.
ACCESSION   J02904
KEYWORDS    chymase.
SOURCE      Dog skin mastocytoma cell line BR, cDNA to mRNA.
  ORGANISM  Canis sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 933)
  AUTHORS   Caughey,G.H., Raymond,W.W. and Vanderslice,P.
  TITLE     Dog mast cell chymase: Molecular cloning and characterization
  JOURNAL   Biochemistry 29, 5166-5171 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.W.Raymond, 17-MAY-1990.
FEATURES       from  to/span     description
    pept         10      759     mast cell chymase precursor
    sigp         10       72     mast cell chymase signal peptide
    matp         73      756     mast cell chymase
    mRNA    <     1      933     mast cell chymase mRNA
    signal      916      921     polyA signal
BASE COUNT      233 a    272 c    229 g    199 t
ORIGIN      
        1 cctgagaaga tgcattgtct tcctctcacc ctgctgctcc ttctcctatg ttccagagca
       61 gaagctgagg agatcatcgg gggcacagag tccaagccac actcccgccc ttacatggcc
      121 cacctggaaa tcctcactct ccggaatcac ctggcgtctt gtggtggttt cctgataagg
      181 cggaacttcg tactgacagc tgctcactgt gcaggaaggt tcataatggt caccctggga
      241 gcccataaca tacaaaagaa agaagacacg tggcagaagc ttgaggtcat aaaacaattc
      301 cctcacccaa aatatgatga ccttactctt cgccatgaca tcatgctact gaagttgaag
      361 gagaaggcca acctgaccct ggccgtgggg accctccccc tctcacccca gttcaacttc
      421 gtcccacctg ggagaatgtg ccgggtggct ggctggggaa aaagacaagt caatggatca
      481 ggctctgaca ctctgcaaga ggtgaagctg aggctcatgg acccccaggc ctgtagacac
      541 tacatggctt ttgaccacaa cctccagctg tgtgttggca atcccaggaa gacaaaatct
      601 gcatttaagg gagattcagg gggccctctt ctgtgtgctg gggtagccca gggaattgtg
      661 tcctatgggc agaatgatgc aaagccccct gctgtcttca cccgaatctc ccactaccgg
      721 ccctggatca ataaggttct gaagcagaat aaagcctgaa tggagcctgg gcaagcctga
      781 ggggaaatct ggaaccagac ctgagcaggc tctctgccac ttactctgga gctgcctcta
      841 gtctctactg aggccccact acatccctca gacccaagag ggctcctcca ggtcacagag
      901 tcctcagtaa acctcaataa agatccagct tcc
//
LOCUS       MUSIGHACQ     340 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION   M34732
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H210-56, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 340)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   340     Ig H-chain VDJ-region (AA at 1)
    recomb      291      292     V-region end/D-region start
    recomb      298      299     D-region end/J-region start
BASE COUNT       80 a     79 c     93 g     88 t
ORIGIN      
        1 gtggagtctg gaggaggctt ggtacagcct gggggttctc tgagactctc ctgtgcaact
       61 tctgggttca ccttcactga ttactacatg agctgggtcc gccagcctcc aggaaaggca
      121 cttgagtggt tgggttttat tagaaacaaa gctaatggtt acacaacaga gtacagtgca
      181 tctgtgaagg gtcggttcac catctccaga gataattccc aaagcatcct ctatcttcaa
      241 atgaacaccc tgagagctga ggacagtgcc acttattact gtgcaagaga tagcggcggg
      301 tttgcttact ggggccaagg gactctggtc actgtctctg
//
LOCUS       MUSIGHACR     286 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION   M34733
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-17, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 286)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   286     Ig H-chain VDJ-region (AA at 1)
    recomb      234      235     V-region end/D-region start
    recomb      238      239     D-region end/J-region start
BASE COUNT       74 a     70 c     68 g     74 t
ORIGIN      
        1 acttctgggt tcaccttcac tgattactac atgagctggg tccgccagcc tccaggaaag
       61 gcacttgagt ggttgggttt tattagaaac aaagctaatg gttacacaac agagtacagt
      121 gcatctgtga agggtcggtt caccatctcc agagataatt cccaaagcat cctctatctt
      181 caaatgaaca ccctgagagc tgaggacagt gccacttatt actgtgcaag agatcaacgc
      241 ctggtttgct tactggggcc aagggactct ggtcactgtc tctgca
//
LOCUS       MUSIGHACS     354 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION   M34734
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-12, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   354     Ig H-chain VDJ-region (AA at 1)
    recomb      303      304     V-region end/D-region start
    recomb      311      312     D-region end/J-region start
BASE COUNT       85 a     79 c     93 g     90 t      7 others
ORIGIN      
        1 gaggtgaagt tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagactc
       61 tcctgtgcaa cctctggttt taccttcacg aattactaca tgnnnnnnnt ccgccagcct
      121 ccaggaaagg cacttgagtg gttgggtgtt attagaaaca aagctaatgg tgacacaaca
      181 gagtataatg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
      241 ctctatcttc aaatgaacac cctgagagct gaggacagtg ccacttatta ctgtgcaaga
      301 gatagcggtc cgtttgctta ctggggccaa gggactctgg tcactgtctc tgca
//
LOCUS       MUSIGHACT     312 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION   M34735
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-3, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig H-chain VDJ-region (AA at 2)
    recomb      262      263     V-region end/D-region start
    recomb      268      269     D-region end/J-region start
BASE COUNT       95 a     69 c     73 g     75 t
ORIGIN      
        1 gcctaaaggg tcattgaaac tctcatgtgc agcctctgga ttcaccttca ataccaatgc
       61 catgaactgg gtccgccagg ctccaggaaa gggtttggaa tgggttgctc gcataagaag
      121 taaaagtaat aattatgcaa catattatgc cgattcagtg aaagacaggt tcaccatctc
      181 cagagatgat tcacaaagca tgctctatct gcaaatgaac aacttgaaaa ctgaggacac
      241 agccatgtat tactgtgtga gagcggcgta ctatgctatg gactactggg gtcaaggaac
      301 ctcagtcacc gt
//
LOCUS       MUSIGHACU     353 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION   M34736
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H230-7, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 353)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   353     Ig H-chain VDJ-region (AA at 1)
    recomb      294      295     V-region end/D-region start
    recomb      308      309     D-region end/J-region start
BASE COUNT       87 a     83 c    100 g     83 t
ORIGIN      
        1 gaagtgcagc tggtggagtc tgggggaggc ttagtgaagc ctggagggtc cctgaaactc
       61 tcctgtgcag cctctggatt cactttcagt gactattaca tgtattgggt tcgccagact
      121 ccggaaaaga ggctggagtg ggtcgcaacc attagtgatg gtggtagtta cacctactat
      181 ccagacagtg tgaaggggcg attcaccatc tccagagaca atgccaagaa caacctgtac
      241 ctgcaaatga gcagtctgaa gtctgaggac acagccatgt attactgtgc aaggtacggg
      301 acccttattg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctc
//
LOCUS       MUSIGHACV     351 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION   M34737
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H35-D1, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 351)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   351     Ig H-chain VDJ-region (AA at 1)
    recomb      294      295     V-region end/D-region start
    recomb      300      301     D-region end/J-region start
BASE COUNT       89 a     82 c     99 g     81 t
ORIGIN      
        1 gaagtgcagc tggtggagtc tgggggaggc ttagtgaagc ctggagggtc cctgaaactc
       61 tcctgtgcag cctctggatt cactttcagt gactattaca tgtattgggt tcgccagact
      121 ccggaaaaga ggctggagtg ggtcgcaacc attagtgatg gtggtagtta cacctactat
      181 ccagacagtg tgaaggggcg attcaccatc tccagagaca atgccaagaa caacctgtac
      241 ctgcaaatga gcagtctgaa gtctgaggac acagccatgt attactgtgc aaggtattac
      301 tacggtagga ttgactactg gggccaaggc accactctca cagtctccgc a
//
LOCUS       MUSIGHACW     348 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA VDJ-region, partial cds.
ACCESSION   M34738
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma L11-1A1, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   348     Ig H-chain VDJ-region (AA at 1)
    recomb      291      292     V-region end/D-region start
    recomb      309      310     D-region end/J-region start
BASE COUNT       84 a     97 c     80 g     87 t
ORIGIN      
        1 gatgtgcagc ttcaggagtc gggacctggc ctggtgaaac cttctcagtc tctgtccctc
       61 acctgcactg tcactggcta ctcaatcacc agtgattatg cctggaactg gatccggcag
      121 tttccaggaa acaagctgga gtggatgggc tacataaact acagtggtta cactacctac
      181 aacccatctc tcaaaagtcg aatctctatc actcgagaca catccaagga ccagttcttc
      241 ctgcagttga attctgtgac tactgaggac acagccacat attactgtgc attactacgg
      301 ctacgtgcgg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS       MUSIGKACN     341 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34739
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H144-17B7,
            cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 341)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   341     Ig kappa-chain VJ-region (AA at 1)
    recomb      302      303     V-region end/J-region start
BASE COUNT       92 a     83 c     87 g     79 t
ORIGIN      
        1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
       61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
      121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
      301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaaac g
//
LOCUS       MUSIGKACO     341 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34740
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H230-7, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 341)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   341     Ig kappa-chain VJ-region (AA at 1)
    recomb      302      303     V-region end/J-region start
BASE COUNT       92 a     82 c     87 g     79 t      1 others
ORIGIN      
        1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
       61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
      121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atnagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
      301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaaac g
//
LOCUS       MUSIGKACP     338 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34741
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H210-56, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 338)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   338     Ig kappa-chain VJ-region (AA at 1)
    recomb      302      303     V-region end/J-region start
BASE COUNT       91 a     82 c     86 g     79 t
ORIGIN      
        1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
       61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
      121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
      301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaa
//
LOCUS       MUSIGKACQ     341 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34742
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-17, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 341)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   341     Ig kappa-chain VJ-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       92 a     83 c     86 g     80 t
ORIGIN      
        1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
       61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
      121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
      301 cctctcacgt tcggtgctgg gaccaagctg gagctgaaac g
//
LOCUS       MUSIGKACR     341 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34743
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-3, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 341)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   341     Ig kappa-chain VJ-region (AA at 1)
    recomb      304      305     V-region end/J-region start
BASE COUNT       96 a     81 c     84 g     80 t
ORIGIN      
        1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
       61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
      121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat
      301 cctctcacgt tcggctcggg gacaaagttg gaaataaaac g
//
LOCUS       MUSIGKACS     335 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34744
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H35-D1, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 335)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   335     Ig kappa-chain VJ-region (AA at 1)
    recomb      302      303     V-region end/J-region start
BASE COUNT       89 a     82 c     84 g     79 t      1 others
ORIGIN      
        1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact
       61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc
      121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atcagcagtg tgcaggctga agacctngca gtttattact gtcagaatga ttatagttat
      301 ccgctcacgt tcggtgctgg gaccaagctg gagct
//
LOCUS       MUSIGKACT     333 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34745
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma L11-1A1, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 333)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   333     Ig kappa-chain VJ-region (AA at 1)
    recomb      296      297     V-region end/J-region start
BASE COUNT       90 a     80 c     86 g     77 t
ORIGIN      
        1 gtgatgacac agtctccatc ctccctgact gtgacagcag gagagaaggt cactatgagc
       61 tgcaagtcca gtcagagtct gttaaacagt ggaaatcaaa agaagtactt gacctggtac
      121 cagcagaaac cagggcagcc tcctaaactg ttgatctact gggcatccac tagggaatct
      181 ggggtccctg atcgcttcac aggcagtgga tctggaacag atttcactct caccatcagc
      241 agtgtgcagg ctgaagacct ggcagtttat tactgtcaga atgattatag ttatccgctc
      301 acgttcggtg ctgggaccaa gctggagctg aaa
//
LOCUS       MUSIGKACU     339 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34746
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma IC5-2A6, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   339     Ig kappa-chain VJ-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       92 a     80 c     84 g     83 t
ORIGIN      
        1 gacattgtga tgacacagtc tccatcttcc ctgactgtga cagcaggaga gaaggtcact
       61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa cttcttgacc
      121 tggtaccagc agaaaccagg acagtctcct aaactgttga tctactgggc atccactagg
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atcagcagtg tgcaggctga agacgtggca gtttattact gtcagaatga ttatacttat
      301 cctctcacgt tcggtgctgg gaccaagctg gagctgaaa
//
LOCUS       MUSIGKACV     318 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34747
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-8, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   318     Ig kappa-chain VJ-region (AA at 2)
    recomb      283      284     V-region end/J-region start
BASE COUNT       85 a     79 c     77 g     77 t
ORIGIN      
        1 tccatcctcc ctgactgtga cagcaggaga gaaggtcact atgagttgca agtccagtca
       61 gagtctctta aacagtggaa ctcaaaagaa ctacttgacc tggtaccagc agaaaccagg
      121 gcagcctcct aaactattga tctactgggc atccactagg gaatctgggg tccctgatcg
      181 cttcacaggc agtggatctg gaacagattt cactctcacc atcatcagtg tgcagactga
      241 agacctggca gtttatttct gtcagaatga ttatagttat ccgctcacgt tcggtgctgg
      301 gaccaagctg gagctgaa
//
LOCUS       MUSIGKACW     246 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34748
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-12, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 246)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   246     Ig kappa-chain VJ-region (AA at 2)
    recomb      211      212     V-region end/J-region start
BASE COUNT       64 a     64 c     59 g     59 t
ORIGIN      
        1 ccgtggaaat caaaagaact acttgacctg gtaccagcag aaaccagggc agcctcctaa
       61 actcttaatc tactgggcat ccactaggga atctggggtc cctgatcgct tcacaggcag
      121 tggatctgga acagatttca ctctcaccat cagcagtgtg cgggctgaag acctggcagt
      181 ttattactgt cagaatgatt atcattatcc gctcactttc ggtgctggga ccaagctgga
      241 cctgaa
//
LOCUS       MUSIGKACX     176 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds.
ACCESSION   M34749
KEYWORDS    immunoglobulin light chain; joining exon; kappa-chain;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) adult spleen anti-HA hybridoma H37-41, cDNA
            to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 176)
  AUTHORS   Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and
            Weigert,M.G.
  TITLE     V region gene usage and somatic mutation in the primary and
            secondary responses to influenza virus hemagglutinin
  JOURNAL   J. Immunol. 144, 2795-2801 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   176     Ig kappa-chain VJ-region (AA at 1)
    recomb      138      139     V-region end/J-region start
BASE COUNT       41 a     45 c     47 g     43 t
ORIGIN      
        1 tgggcatcca ctagggaatc tggggtccct gatcgcttca caggcagtgg atctggaaca
       61 gatttcactc tcaccatcac caatgtgcag gctgaagacc tggcagttta ttactgtcag
      121 aatgattaca cttatccgct cacgttcggt gctgggacca agctggagct gagacg
//
LOCUS       MUSMHH2A      876 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse MHC class I H-2K-kml mRNA (H-2K-kml haplotype), partial cds.
ACCESSION   M34932
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Mouse (strain CBA.M523) lymphoma cell line RDM-4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 876)
  AUTHORS   Martinko,J.M., Solheim,J.C. and Geliebter,J.
  TITLE     The H-2K-km1 mutation: A single nucleotide substitution is
            responsible for multiple functional differences in a class I MHC
            molecule
  JOURNAL   Mol. Immunol. 25, 267-274 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   876     MHC H-2K-kml mRNA (AA at 1)
BASE COUNT      180 a    255 c    286 g    155 t
ORIGIN      
        1 ggcccacatt cgctgaggta tttccacacc gccgtgtccc ggcccggcct cgggaagccc
       61 cggttcatct ctgtcggcta cgtggacgac acgcagttcg tgcgcttcga cagcgacgcg
      121 gagaatccga ggtatgagcc gcgggtgcgg tggatggagc aggtggagcc cgagtattgg
      181 gagcggaaca cgcagatcgc caagggcaat gagcagattt tccgagtgaa cctgaggacc
      241 gcgctgcgct actacaacca gagcgcgggc ggctctcaca cgttccaacg gatgtacggc
      301 tgtgaggtgg ggtcggactg gcgcctcctc cgcgggtacg agcagtacgc atacgacggc
      361 tgcgattaca tcgccctgaa cgaagacctg aaaacgtgga cggcggccga catggcggcg
      421 ctgatcacca aacacaagtg ggagcaggct ggtgctgcag agagagaccg ggcctacctg
      481 gagggcacgt gcgtggagtg gctccgcaga tacctgcagc tcgggaacgc gacgctgccg
      541 cgcacagatt ccccaaaggc ccatgtgacc cgtcacagca gacctgaaga taaagtcacc
      601 ctgaggtgct gggccctggg cttctaccct gctgacatca ccctgacctg gcagttgaat
      661 ggggaggagc tgacccagga catggagctt gtggagacca ggcctgcagg ggatggaacc
      721 ttccagaagt gggcatctgt ggtggtgcct cttgggaagg agcagtatta cacatgccat
      781 gtgtaccatc aggggctgcc tgagcccctc accctgagat gggagcctcc tccatccact
      841 gtctccaaca cggtaatcat tgctgttctg gttgtc
//
LOCUS       PEALEGAB      137 bp ss-mRNA            PLN       21-JUL-1990
DEFINITION  P.sativum legumin mRNA, 3' flank.
ACCESSION   M34908
KEYWORDS    legumin; storage protein.
SOURCE      P.sativum, cDNA to mRNA, clones pDUB[1,3].
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 137)
  AUTHORS   Lycett,G.W., Delauney,A.J. and Croy,R.R.D.
  TITLE     Are plant genes different?
  JOURNAL   FEBS Lett. 153, 43-46 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA    <     1      137     legumin mRNA
BASE COUNT       53 a     20 c     24 g     40 t
ORIGIN      
        1 tagatttcgc accaaatcaa tgaaagtaat gaataagaaa actaaggctt agatgccttt
       61 gttacttgtg taaaataact cgagtcatgt acctttttgc ggaaacagaa taaataaaag
      121 gtaaaatttc agtgctc
//
LOCUS       PEAVICA        81 bp ss-mRNA            PLN       21-JUL-1990
DEFINITION  P.sativum vicilin mRNA, 3' flank.
ACCESSION   M34909
KEYWORDS    storage protein; vicilin.
SOURCE      P.sativum, cDNA to mRNA, clone pDUB2.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 81)
  AUTHORS   Lycett,G.W., Delauney,A.J. and Croy,R.R.D.
  TITLE     Are plant genes different?
  JOURNAL   FEBS Lett. 153, 43-46 (1983)
  STANDARD  simple staff_review
BASE COUNT       32 a     11 c     13 g     25 t
ORIGIN      
        1 taatgagaga tcaaatattt tgcatgtatg ctataaagaa ctatagctca taatgagcaa
       61 ggaataaaac atcgttctct t
//
LOCUS       ECOALKPH       52 bp ds-DNA             SYN       21-JUL-1990
DEFINITION  Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION   M34498
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli (strain AW1043) DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 52)
  AUTHORS   Chou,M.M. and Kendall,D.A.
  TITLE     Polymeric sequences reveal a functional interrelationship between
            hydrophobicity and length of signal peptides
  JOURNAL   J. Biol. Chem. 265, 2873-2880 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    52     synthetic alkaline phosphatase precursor (AA
                                 at 1)
    sigp    <     1  >    52     synthetic alkaline phosphatase precursor
                                 signal peptide
BASE COUNT        9 a     15 c     10 g     18 t
ORIGIN      
        1 tcgactctgc ttctgctgct cttgctgtta ctgttaaccc ctgtgacaaa ag
//
LOCUS       ECOALKPHX      52 bp ds-DNA             SYN       21-JUL-1990
DEFINITION  Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION   M34499
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli (strain AW1043) DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 52)
  AUTHORS   Chou,M.M. and Kendall,D.A.
  TITLE     Polymeric sequences reveal a functional interrelationship between
            hydrophobicity and length of signal peptides
  JOURNAL   J. Biol. Chem. 265, 2873-2880 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    52     synthetic alkaline phosphatase precursor (AA
                                 at 1)
    sigp    <     1  >    52     synthetic alkaline phosphatase precursor
                                 signal peptide
BASE COUNT        8 a     10 c     17 g     17 t
ORIGIN      
        1 tcgactgtcg tggttgtcgt agttgtggtg gtcgttaccc ctgtgacaaa ag
//
LOCUS       ECOALKPHXX     52 bp ds-DNA             SYN       21-JUL-1990
DEFINITION  Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION   M34500
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli (strain AW1043) DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 52)
  AUTHORS   Chou,M.M. and Kendall,D.A.
  TITLE     Polymeric sequences reveal a functional interrelationship between
            hydrophobicity and length of signal peptides
  JOURNAL   J. Biol. Chem. 265, 2873-2880 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    52     synthetic alkaline phosphatase precursor (AA
                                 at 1)
    sigp    <     1  >    52     synthetic alkaline phosphatase precursor
                                 signal peptide
BASE COUNT       10 a     18 c     17 g      7 t
ORIGIN      
        1 tcgactgcag cagctgcagc tgcggctgcc gcggcgaccc ctgtgacaaa ag
//
LOCUS       ECOALKPHY      52 bp ds-DNA             SYN       21-JUL-1990
DEFINITION  Synthetic E.coli alkaline phosphatase gene, partial cds.
ACCESSION   M34501
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli (strain AW1043) DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 52)
  AUTHORS   Chou,M.M. and Kendall,D.A.
  TITLE     Polymeric sequences reveal a functional interrelationship between
            hydrophobicity and length of signal peptides
  JOURNAL   J. Biol. Chem. 265, 2873-2880 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    52     synthetic alkaline phosphatase precursor (AA
                                 at 1)
    sigp    <     1  >    52     synthetic alkaline phosphatase precursor
                                 signal peptide
BASE COUNT        8 a     11 c     25 g      8 t
ORIGIN      
        1 tcgactggcg gtgggggcgg cggaggtggt ggcggtaccc ctgtgacaaa ag
//
LOCUS       CPCURA5A     2029 bp ds-DNA             BCT       21-JUL-1990
DEFINITION  C.neoformans orotidine monophosphate pyrophosphorylase (URA5) gene,
            complete cds.
ACCESSION   M34606
KEYWORDS    orotidine monophosphate pyrophosphorylase.
SOURCE      C.neoformans (strain B-3501) asexual phase cell line B-3501 DNA,
            clone pURA5g2.
  ORGANISM  Cryptococcus neoformans
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina; Filobasidiales.
REFERENCE   1  (bases 1 to 2029)
  AUTHORS   Edman,J.C. and Kwon-Chung,K.J.
  TITLE     Isolation of the URA5 gene from Cryptococcus neoformans var
            neoformans and its use as a selective marker for transformation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Edman, 29-MAY-1990.
            
              Author address: J.C.Edman
              University of California San Francisco
              Hormone Research Institute
              San Francisco, CA 94143-0534
              email: edman@socrates.ucsf.edu
            
FEATURES       from  to/span     description
    pept        714      814     orotidine monophosphate pyrophosphorylase
                                 (URA5), exon 1 (E.C. 2.4.2.10)
                867     1095     orotidine monophosphate pyrophosphorylase,
                                 exon 2
               1145     1492     orotidine monophosphate pyrophosphorylase,
                                 exon 3
    pre-msg <   714     1527     URA5 mRNA and introns
    IVS         815      866     URA5 intron A
    IVS        1096     1144     URA5 intron B
BASE COUNT      512 a    515 c    479 g    523 t
ORIGIN      
        1 gatcttgggg atggtattga agacgatcca gaaaaagtct ctccatccct cccacgttcg
       61 ctgttcggtt tctttgcaaa gagcgaagtt gctcgacctt tgacagaccc taaaatctgg
      121 tcgtattttg cgttcaaaga tcgcttactg cgaccgcgat tttttagagg tggatttttt
      181 gttgcgagac tagcgcatcg gataggcatg ggtgatatac gtttgcgttc cttatgaggg
      241 gtatcatcag tggcaaatta ccgactagat gaagaagacg acctacaccc cgctaaccgg
      301 tgtccgatcg acatgatcca cggtgtccaa agcgaagaag gacatgatga ttggagttcg
      361 cttacctttg gtataaacca gtgttgtttc tgaagaggtg aggaagaaaa tgatctaggg
      421 caataacaga aatcgtttga agaagaagta aaagaaaaga cacacgtcag acaagcatca
      481 tcatcattca ttttgaatac tcatcacggc accttcttcc ttgcatgcgc attattcatg
      541 atgagatgct cgcagccgcc ggatttcggc tgcaatcaag aacgatatcg cgtacggcga
      601 tgtccgcctc cactccatct tccaacaacc accacagtgc ccgataacac tacgacgaaa
      661 attgttcttt tttggatctt gtcgacaacg gcgaccaaaa caagaacttg atcatgtcct
      721 cccaagccct cgactccgcc aaagttgcct tcatcgaggc tgccatcgaa catggcgtgc
      781 ttcttttcgg caactttacc ttgaagtccg gccggtgagc catattgcag cgcttcacag
      841 tccaatcgaa tctgacatgt gttcagccaa tccccttact tcttcaatgc cggtctcctt
      901 tactcttcat cgcttctctc aactaccgct caggcttacg ccaaggtact ttcctcttct
      961 aggattcctg actttgacgt cctcttcggc ccagcttaca agggtatctc cttggctgct
     1021 gtctccgctg taagccttta tcagcaaacc ggcaaagata tcggctactg ctacaacagg
     1081 aaggagaaga aggacgtgag tctgtcctaa ccagtgcgac agcgatgagc tcataagcca
     1141 gtagcacggt gagggcggta ctatggtcgg tgcgcctctc aagggacgaa tcgtcatcat
     1201 cgacgatgtt ctcacctctg gcaaggccat ccgtgaagct attgacattc tcaaggcctc
     1261 ccctgaagcg aagcttgtcg gaattgtcca gcttgtcgac agacaagaga aaggccagag
     1321 cggtagcggc aagagtaccg tacaggaggt tgaggaagag ttcggtgtgc ctgtcgagcc
     1381 tattattggt ttggacgaca ttgtgaagta cttagaaagc tccggcaagt gggaaaagga
     1441 gctgcaagag gtcaggaagt acagggcgga gtacggtgtt cagaggtctt aagggttttc
     1501 ttcttaaatg cacgggttta ggtctagcta atcaagttcc gacatattac aagtttgtaa
     1561 gcttgtatca aaggaactta agtacaggca ggcgtcctga ggcgacaaag gaagctgtaa
     1621 tatgattgtt ggctgtcaat cttcatcgta tctactttgt caatactgac ttcaatgacc
     1681 caataataca attttattag tgttgaccca gaatggttag caggaaactc cccttctctt
     1741 cctctcaatc ccaatcatac ttcatatctc ctgctccccc catttccgtc ttcctcgatg
     1801 actccctggt cccatccctc ccacctcctg gaggcaagct ggagcacctg gacctaatgg
     1861 gtcgtcgccc caaattgcct ccacctctta agacaatcat cgtccaatca aactctgaac
     1921 tatcttccaa gccaatggcc ggatctggac acaagatgat gtcgaagccg cttgatgtgc
     1981 cagggttggt ccgtggcctg gagacgtaag agcgggtagt actgggatc
//
LOCUS       ECOCHEA      2190 bp ds-DNA             BCT       21-JUL-1990
DEFINITION  E.coli cheA protein (complete cds), motB protein (3' end),
            and cheW protein (5' end) genes.
ACCESSION   M34669
KEYWORDS    cheA gene; cheW gene; motB gene.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2190)
  AUTHORS   Kofoid,E.C. and Parkinson,J.S.
  TITLE     Tandem translational starts in the cheA locus of E.coli
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.C.Kofoid, 01-JUN-1990.
            
              Author address: E.C.Kofoid
              University of Utah
              Dept of Biology
              Salt Lake City, UT 84112
              email: kofoid@bioscience.utah.edu
FEATURES       from  to/span     description
    pept    <     1      104     MotB protein
    pept        109     2073     CheA protein (alt.)
    pept        400     2073     CheA protein (alt.)
    pept       2094  >  2190     CheW protein
BASE COUNT      575 a    546 c    609 g    460 t
ORIGIN      42 min on K12 map.
        1 atgaaaacgc cgaaagccag aatgagccag taagcgccct ggaaaaacct gaggttgcac
       61 cacaggtcag tgttcccaca atgccatcag ccgaaccgag gtgacagcgt gagcatggat
      121 ataagcgatt tttatcagac attttttgat gaagcggacg aactgttggc tgacatggag
      181 cagcatttgc tggttttgca gccggaagcg ccagatgccg aacaattgaa tgccatcttt
      241 cgggctgccc actcgatcaa aggaggggca ggaacttttg gcttcagcgt tttgcaggaa
      301 accacgcatc tgatggaaaa cctgctcgat gaagccagac gaggtgagat gcaactcaac
      361 accgacatta tcaatctgtt tttggaaacg aaggacatca tgcaagaaca gctcgacgct
      421 tataaacagt cgcaagagcc ggatgccgcc agcttcgatt atatctgcca ggccttgcgt
      481 caactggcat tagaagcgaa aggcgaaacg ccatccgcag tgacccgatt aagtgtggtt
      541 gccaaaagtg aaccgcaaga tgagcagagt cgcagtcagt cgccgcgacg aattatcctt
      601 tcgccgctga aggccgggga agtcgacctg ctggaagaag aactgggaca tctgacaacg
      661 ttaactgacg tggtgaaagg ggcggattcg ctctcggcaa tattaccggg cgacatcgcc
      721 gaagatgaca tcacagcggt actctgtttt gtgattgaag ccgatcagat tacctttgaa
      781 acagtagaag tctcgccaaa aatatccacc ccaccagtgc ttaaactggc agccgaacaa
      841 gcgccaaccg gccgcgtgga gcgggaaaaa acgacgcgca gcaatgaatc caccagcatc
      901 cgtgtagcgg tagaaaaggt tgatcaatta attaacctcg tcggcgagct ggttatcacc
      961 cagtccatgc ttgcccagcg ttccagcgaa ctggacccgg ttaatcatgg tgatttgata
     1021 accagcatgg ggcagttaca acgtaacgcc cgtgatttgc aggaatcagt gatgtcgatt
     1081 cgcatgatgc cgatggaata tgtttttagt cgctatcccc ggctggtgcg tgatctggcg
     1141 ggaaaactcg gcaagcaggt agaactgacg ctggtgggca gttctactga actcgacaaa
     1201 agcctgatag aacgcattat cgacccgctg acccacctgg tacgcaatag cctcgatcac
     1261 ggtattgaac tgccagaaaa acggctcgcc gcaggtaaaa acagcgtcgg aaatttaatt
     1321 ctgtctgccg aacatcaggg cggcaacatt tgcattgaag tgaccgacga tggggcgggg
     1381 ctaaaccgtg agcgaattct ggcaaaagcg gcctcgcaag gtttgactgt cagcgaaaac
     1441 atgagcgacg acgaagtcgc gatgctgata tttgcacctg gcttctccac ggcagagcag
     1501 gtcaccgacg tctccgggcg cggcgtcggc atggacgtcg ttaaacgtaa tatccagaag
     1561 atgggcggtc atgtcgaaat ccagtcgaag cagggtactg gcactacgat ccgcatttta
     1621 ctgccgctga cgctggccat cctcgacggc atgtccgtac gcgttgcgga tgaagttttc
     1681 attctgccgc tgaatgctgt tatggaatca ctgcaacccc gtgaagccga tctccatcca
     1741 ctggccggcg gcgagcgggt gctggaagtg cggggtgaat atctgcccat cgtcgaactg
     1801 tggaaagtgt tcaacgtcgc gggcgcgaaa accgaagcca cccagggaat tgtggtgatc
     1861 ttacaaagtg gcggtcgccg ctacgccttg ctggtggatc aattaattgg tcaacaccag
     1921 gttgtggtta aaaaccttga aagtaactat cgcaaagtcc ccggcatttc tgctgcgacc
     1981 attcttggcg acggcagcgt ggcactgatt gttgatgtct ccgccttgca ggcgataaac
     2041 cgcgaacaac gtatggcgaa caccgccgcc tgaatgagta aaaaggtaac aatatgaccg
     2101 gtatgacgaa tgtaacaaag ctggccagcg agccgtcacg gcaggaattt ctggtattta
     2161 cccttggtga tgaagagtac ggtattgatg
//
LOCUS       HAMHSP60A    2726 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  C.griseus chaperonin (HSP60) non-functional pseudogene.
ACCESSION   M34562
KEYWORDS    chaperonin; pseudogene.
SOURCE      C.griseus cell line CHO DNA, clone GC-1.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2726)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    IVS        1738     1824     HSP60 intron A
BASE COUNT      915 a    513 c    579 g    719 t
ORIGIN      
        1 catggtgatt tttctttaga tattctgggc ctcctgaaac tgacaagatt ctgtaaagca
       61 aaagacataa tcaacaagat aagacagcag tgcaaacaat gggaaaagtt cttcaccaac
      121 cccatatcta gccaagggca aatttctaaa acatacaaag aactgatgaa agcagtcaac
      181 aaaacaccaa ataatcgccc tcactctccg ccaccagcct gctttgccag atgagaccag
      241 tgtcccgggc gctggctcct catgtcactc gggcctatgc caatgatgta aaatgtgggg
      301 caaatccttg atccttaaag cttcaagttg tagacctttt agccgatgct gtagctgtta
      361 caatggggcc aaagggaaga acagtgatta ttgaacagag ttgaggaagt cccaaagtaa
      421 caaaagatgg ggtcattttt acaaaggcaa ttgatctaaa ggatacatac aaataatatc
      481 agagctaaac ttgttcaagt tgttgccaat aactcaaatg aagaggctgg gatggcacta
      541 ccactgctac tgttctggca agttctattg ccaacgaagg ctttgagaag atcagcaaag
      601 ggtctaatcc agtagaaatc tggagaagtg tgatgttggc tgttgatgct gtaattgctg
      661 aactaacaaa acaatctaaa catgtgacaa cccatgaaga aatttctcag gttgctacaa
      721 tttctgcaaa tggacacata gacattggaa acatcatatc tgatgcaatg aaacatgttg
      781 gagggaaaga cgtcatcaca ctgaaggata gaaataccct gaatgatgaa ttagaaatta
      841 ttggaggcat gaagtttgaa agaagatgta tttctcagac atcaataagt caaaaatgtg
      901 aattccaaga tgcctatgtt ctattgagtg aaaagaaaat ttctattgtc gagcccatta
      961 ctgctcatag gaaacaaatg gtgatatttg ctgaagatgt tgatggagaa gctctaagca
     1021 cactggtttt gaacaggcta aaagctggtc ttcagatcgt agcagttaaa gctccaggtt
     1081 tttggggaca ataggaagaa taagcttaaa gatacggcta ttgctactga tggtgagcaa
     1141 tttggagaac aggggctaaa actaaatatt gatgatggtc aagttcatga tttaggaaaa
     1201 tttggagagg tcattgtccc caaagatgat gccatgcttt tgaaaggaac tgatgactaa
     1261 actcaaattg agaaaccaat tcaagaaatc actaagcagc tagacatcac aactagtgaa
     1321 tatgaaaagg aaaagctaaa cgaaacactt gctaaacttt cagatggtgc agctgtgttt
     1381 aaggttggag gaacaaggga tgttgaagtg aaggagaaga aagacagagt tacaaatgct
     1441 ctcaatgcta caagagcagc tgttgaaaag ggcattgttt taggacttag ctgtgctctg
     1501 cctctgtgca tcccagcctt ggattcctta aagccttcaa atgaagatca gaaaataggt
     1561 atagaaatta ttaaaagagc actcaaaatt cctgcagtga caattgctga gaatgcaggt
     1621 gttgaataat ctttgagagt tgagaaaatt ctgcagggtt cctagggagt tggttattag
     1681 gctatgcccg caaattttgt gaacatggta gaaaaaggaa tcattaatcc aacaaaggta
     1741 aatgagtcag caattcttaa acattggatg gcttgttcta tggtatgtgt tttcattaat
     1801 cattttataa acattttttt gtaggttgta agaactgctt tagtagatgc ctccttgcta
     1861 acttcagcag aagctgtagt gacagaaatt ccaaaagaag agaaggacct tggaatgcgc
     1921 acacatgcct ttaatcctag cacttgcgag gcagaggcag gcagatatct gtgagttcga
     1981 ggccagcctg gactccggag tgagtgccag gatagtctcc aaagctatcc aaagagaaac
     2041 cctgtcttga aaaacaaaca aacaaaaaag acatttgtac attcctgaaa aaaacccaaa
     2101 taatccaatt aaactagggg tcagcctaat aaagataccc gaataggttt ttacatctta
     2161 agagaaaaat ctactaataa atgacttgga gcaggtcctt cacagacaat atttagtgag
     2221 agtaggtagc ctcatgtggg ttagtagttc cagttgcttc aaaatgccct caccttttgt
     2281 cataggctca tgtggaggat ctactccatt ggatggtgat attaaaagta gcacatgggt
     2341 acatgcctta agatattaag atccccaagt aaaaactgca tctcagatgc tctctgtttc
     2401 ttgggccatg atatgggcag tgagatttct gtcacacttg ccaggaaaca cttatgatct
     2461 gtctgtcaca atctcttata ttgtcaagta cttaaccaaa cacaatcacc catgacctct
     2521 catatcctcc tggacttaga caggaggcac aagtctactc aggtataaat atttttactt
     2581 ctgatgcccc acagtaacct acaatggctt cagtccagta cacaggtaag taaacccacc
     2641 aatataccat tccagctgat tgataacctc ttagatacat cccatatgct gcagttccca
     2701 aatggagaca acccaaagtt tcttga
//
LOCUS       HAMHSP70A    2243 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Chinese hamster heat shock protein (cognate form of hsp70) mRNA,
            complete cds.
ACCESSION   M34561
KEYWORDS    heat shock protein; stress protein.
SOURCE      Chinese hamster cell line CHO, cDNA to mRNA, clone RS 23.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2243)
  AUTHORS   Ahmad,S., Ahuja,R., Venner,T.J. and Gupta,R.S.
  TITLE     Identification of a protein altered in mutants resistant to
            microtubule inhibitors with the constitutive form of the major heat
            shock protein (hsp70)
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN% GUPTAR@,SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    pept         50     1990     heat shock protein (hsp70)
    mRNA    <     1     2243     hsp70 mRNA
    signal     2220     2225     poly-A signal
BASE COUNT      654 a    486 c    551 g    552 t
ORIGIN      
        1 gttttgtgtg gtctcgtcgc cagcacagcc gggcctacag caagcaacca tgtctaaggg
       61 acctgcagtt ggtattgatc ttggcaccac ctactcctgt gtgggtgtct tccagcatgg
      121 aaaggtggag ataattgcca atgaccaggg taaccgaacc acgccaagct atgttgcttt
      181 tactgacaca gaacgattaa ttggggatgc cgcgaagaat caggttgcaa tgaaccccac
      241 caacacagtt tttgatgcca aacgtctgat cggacgtagg tttgatgatg ctgttgttca
      301 gtctgatatg aagcactggc ccttcatggt ggtgaacgat gcaggcaggc ccaaggtcca
      361 agtagaatac aaaggagagg caaaaagttt ctacccagag gaggtgtcct ctatggttct
      421 gaccaagatg aaggaaattg cagaagccta tctcgggaag actgttacca atgctgtggt
      481 cacagtgcca gcctacttca atgattctca gcgacaggct acaaaagatg ctggaactat
      541 tgctggcctg aatgtcctgc gaattatcaa tgaaccaact gctgctgcta ttgcttatgg
      601 gctagataag aaggttggag ctgaaaggaa tgtgctcatt tttgacttgg gaggtggtac
      661 ttttgatgtg tctatcctca ctattgagga tggaattttt gaagtcaaat caacagccgg
      721 agacacccac ttgggtggag aagactttga taacagaatg gtcaaccatt tcattgctga
      781 gtttaagcga aacgacaaga aggacatcag cgagaacaag agagctgtca ggcgtctgcg
      841 tactgcctgt gaacgggcaa agcgcaccct ctcctccagc acccaggcca gtattgagat
      901 tgattctctc tatgagggaa ttgatttcta tacctccatt acccgtgctc gattcgaaga
      961 attgaatgct gacctgttcc gaggcactct ggaccctgta gagaaggccc ttcgagatgc
     1021 taaactagac aagtcacaga tccatgacat tgtcttggtg ggtggttcca ccagaatccc
     1081 caagattcag aagcttctgc aagacttctt caatggaaaa gaactgaata agagcattaa
     1141 ccccgatgaa gctgttgctt atggtgcagc tgtccaggcg gccattctat ctggagacaa
     1201 gtctgagaat gttcaggatt tgttgctctt ggatgtcact cctctttccc ttggcattga
     1261 aactgctggt ggagtcatga ctgtcctcat caagcgcaat accaccatcc ctaccaagca
     1321 gacgcagact ttcactacct attctgacaa tcagcctggt gtgctcattc aggtctatga
     1381 aggcgaaagg gccatgacca aggataacaa cctgcttgga aagtttgaac ttacaggcat
     1441 ccctccagca ccccgtggtg ttcctcagat tgaagttact tttgacattg atgccaatgg
     1501 catcctcaat gtctctgctg tagataagag cacaggcaag gagaacaaga tcactatcac
     1561 caacgacaag ggccgcttga gcaaggagga tattgaacgc atggtccagg aagctgagaa
     1621 gtacaaagct gaagatgaga agcagagaga taaggtttcc tccaagaatt cactggagtc
     1681 ctatgctttc aacatgaaag caacagttga agacgagaaa cttcaaggca agatcaatga
     1741 tgaggacaaa cagaagattc tagacaagtg taatgaaatc atcagttggc tggataagaa
     1801 tcagactgca gagaaggaag aatttgaaca ccagcagaaa gaactggaga aagtctgcaa
     1861 ccccatcatt accaagctgt accagagtgc tggtggcatg cctggaggaa tgcccggggg
     1921 cttccctggt ggtggagctc ctccatctgg tggtgcttct tcagggccta ccattgaaga
     1981 ggtggattaa gtcagtccaa gtagaggtgt agcattgttc cacagggaaa catttgaagg
     2041 acccaaattt gcaaagtttc atggcagttt ttaaatttaa actgttaaca ataaattact
     2101 gggcattctc aatacttgaa catggaacat gtacacagga aataaaacat tgctctttat
     2161 aagcactgta tcctaagtgg aaaactcaat gtctaatatg gtggaaaatg gaatgtctaa
     2221 ataaaactat ttaaattggc acc
//
LOCUS       HAMTCP1A     1845 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Chinese hamster T-complex protein 1 (TCP-1) mRNA, complete cds.
ACCESSION   M34665
KEYWORDS    T-complex protein 1.
SOURCE      Chinese hamster cell line CHO, cDNA to mRNA, clone pRG 501.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 1845)
  AUTHORS   Ahmad,S. and Gupta,R.S.
  TITLE     Cloning of a Chinese hamster ovary cell protein homologous to the
            mouse T-complex protein TCP-1: Structural similarity to the
            ubiquitous "Chaperonin" family of heat shock proteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 01-JUN-1990.
            
              Author address: R.S.Gupta
              McMaster University
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR@,SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    pept         89     1759     T-complex protein 1
    signal     1818     1823     poly-A signal
BASE COUNT      530 a    368 c    472 g    475 t
ORIGIN      
        1 gttccccgct gtggtggccg cagtccggcc tctgagcgga gatcccggcc gtccgcggtg
       61 ttagcgaggc cagccaggtt ccctaacgat ggagggccct ttgtccgtgt tcggggaccg
      121 cagcactgga gaggcgatcc gctcccagaa tgttatggct gcagcttcca ttgccaacat
      181 tgtgaaaagt tctctggggc cagttggctt ggataaaatg ttggtggatg acattggtga
      241 tgtaaccatt accaacgatg gggccaccat cctgaagtta ctggaggtag aacaccctgc
      301 agctaaagtt ctttgtgagc tggctgactt gcaagacaaa gaagtgggag atggaactac
      361 ctctgtggta attattgcag cggagcttct gaaaaatgca gatgaactag tcaaacagaa
      421 aattcatcca acatcagtta ttagtggcta tcggcttgcc tgcaaggaag cagtgcgcta
      481 tatcagtgag aacctaatta ttaacacaga cgaacttgga agagactgtc tgattaatgc
      541 cgctaagaca tccatgtctt ccaaaattat tggaataaat ggtgatttct ttgctaatat
      601 ggtggtagat gctgtacttg ctgtcaaata cacagatctg agaggccagc ctcgatatcc
      661 agtcaattct gttaatattc tgaaagccca tggaagaagt caggtagaaa gcatgctgat
      721 caatggctat gcactgaact gtgtggtggg atctcagggc atgcccaaga ggatagttaa
      781 tgcaaaaatt gcatgtcttg acttcagcct gcagaaaaca aaaatgaagc ttggtgtgca
      841 ggtggttatt acagatcctg agaaattgga ccaaattaga cagagagaat cagacatcac
      901 caaggagaga attgaaaaga ttctggcgac tggggccaat gttattctga ccactggtgg
      961 gatagatgat atgtgtctga agtactttgt ggaggctggt gccatggctg ttagacgagt
     1021 tctaaaaagg gatctcaagc gtatcgctaa agcttctgga gcaagtatcc tgtccacact
     1081 ggccaatttg gaaggcgaag aaacttttga agctaccatg ctggggcaag cagaagaagt
     1141 ggtccaggag agaatttgtg atgatgagct gatcttaatc aaaaatacca aagctcgtac
     1201 atctgcatca atcatcttac gaggagcaaa tgatttcatg tgtgatgaaa tggagcgatc
     1261 tttacatgat gctctttgtg tggtgaagag agttttggag tcaaaatctg tagtcccagg
     1321 aggaggtgct gtagaagccg ccctgtccat ataccttgaa aactatgcaa ccagtatggg
     1381 gtctcgggaa cagcttgcta ttgcagagtt tgcaaggtct cttcttgtta ttcctaatac
     1441 actggcagtg aatgctgccc aagactccac tgacctggtt gccaagttac gagcttttca
     1501 caatgaggct caagttaacc cagaacgtaa aaacctgaag tggattggtc ttgatttgat
     1561 caatggaaaa ccacgagaca acaagcaagc aggggtgttt gaaccaacca tagttaaagt
     1621 gaagagcctg aagtttgcaa cagaagctgc aatcactatt cttcgaatcg atgatctgat
     1681 aaaattacac ccggaaacta aagacgataa acatggaagt tatgaaaatg ctgttcactc
     1741 tggagccctt gatgactgat tggatttccc ttttatttat agcagtgtca ggtgcaatgt
     1801 agccttgggt atctcacatt aaagtacagt gagctgtcca cctga
//
LOCUS       HUMHP601A    1941 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 1.
ACCESSION   M34660
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS2-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 111 to 1941)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1941)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      624 a    329 c    446 g    541 t      1 others
ORIGIN      
        1 atatgaaata aagcctttat gtgctctacc taaattaata aatttattta ttttttttng
       61 tatatttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
      121 gcctcgccat gcgacatgtc ctgccagcac cccacagaaa tgcttccatt ccattaccca
      181 cagtctttca ccagatgaga ccagtgtcca ggctactggc tcctcacctc acttgaagtg
      241 atggtaaaga tgtaaaattt ggtgctgatg tccgagcctt aatgcttaaa agtgtagacc
      301 tttttgtcga tgtagccatt acaatggggc caaagggaag aacagtaatt attgaacata
      361 gctgggcaag tcccaaagta acaaaagatg gtgtgactga tgcaaagtca attgacttaa
      421 aagataaata taaaagcatt ggagctaaac ttgtccaaga tgttgccaat aacacagatg
      481 aagagactgg ggatggcact atcacttgct gttgtactgg tatgctctat ttccaaataa
      541 ggtttccaga aggttagcaa aggtgctaat ccagtggaaa tcaggagagg tgtatgttag
      601 ctgatgctgt aattgctgaa cttaaaaagc agtctaaacc tgtgaccaaa cctgaagaaa
      661 ttgcacaggt tgctacaatt tctgcaaatt gagacaaaga aattggcaac atcatctctg
      721 atgcaatgaa aaaagttgga agaaagggca tcatcacagt aaaggatgga aaaacactga
      781 ctgtagaatt agaaattatt gaaggcatga aatttgatcg aggatatatt tctccatact
      841 ttattaatac atcaaaaggt gagaaatgtg aattccagga tgcctatgtt ctgttgcatg
      901 aaaagcaaaa tttctacgtg ctccacgtcc attgtaactg ctcttgcaaa ttgccaatgc
      961 ttactgtaag cctttggtca taattgctgg agatattgat ggagaagctc taactacact
     1021 catcctgaat aggctaaagg ttggtcttca ggttgtggca gtcaaggctc cagggtttgg
     1081 tgacaataga aagaaccagc ttaaagatac ggttattgct actggtggta cagtgtttgg
     1141 agaagagggc tcacactaaa tcttgaagac gttcagcctc gtgacgtagg agagttggag
     1201 aggtactgtg atcaagatat gccatgctct taaaagcaaa aggtaacaag tctcaaatta
     1261 aaaatgtgtt caagaaatca ttgaccagtc agatgtcaca actatgtaat tgaaaaggaa
     1321 aaagtgagtg gagaaacttt cagatggagt agctgtgctg aaggttggtg gcaagtgatg
     1381 ttgaagttaa tgaagagaaa gacagagtta taggtgcact taatgctaca agagctgctg
     1441 ttgaagaagg cattgtttag ggagggggtt gtgccctgct tcgatgcatt ccagccttgg
     1501 actcattcac tccagctaat gaagataaaa taattggtat agaaattatt aaaagaacac
     1561 tcaaaattcc agcaatgaca attcgtaaga atgcaggtgt tgatgcattt ttgatagttg
     1621 agaaaattat gtaaagttcc tcagaagttg gttatgatac tatgttagga gatgtcgtga
     1681 atatggtgga aaaagacatt attgacccaa caaaggttgt gagaactgct tcattggatg
     1741 ctgctggcat gccctctata ttaactacag cagctgttgt agtcacagaa attcctaaag
     1801 aagggaacag ccctggaatg ggtgcatgtg gaatgggagg tggcctgttc taattcctag
     1861 aatagtgctt tacctttatt aatgaattgt gataggaacc caaggcagtg ttcctcacca
     1921 ataacttcag agaagtcagt t
//
LOCUS       HUMHP602A    1920 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 2.
ACCESSION   M34662
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS2-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 117 to 1920)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1920)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      619 a    324 c    431 g    544 t      2 others
ORIGIN      
        1 tggattatgn aaatnaaagc ctttatgtgc tctgcctaaa ttaatatttt attttttttt
       61 gatacttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
      121 gcctcacctc accatgcaca tgtcctgcca ccaccccaca gaaatgcttc acttacccac
      181 agtctttcgc cagatgagac cagtgtccag ggtactggct cctcaccctc acttgaagga
      241 tggtaaagat gtaaaatttg gtgctgatgt ctgcgcctta atgcttcaaa gtgtagactt
      301 ttgtcagtaa ttattgaaca tagctgggaa gtcccaaagt aacaaaagat ggtctgactg
      361 atgcaaagtc aattgactta aaggataaat ataaaagcat tggagctaaa cttctccaag
      421 atgttgccta aaacacaaca aatggagaga ctggggatgg cactatcact gctgctgtac
      481 tggtatgctc tatttccaaa taagccttcc agaaggttag caaaggtgct aatccagtgg
      541 aaatcatcag agagtgtgat gttagctgtt gatgctgtaa ttgctgaact taaaaagcag
      601 tctaaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
      661 cacaaagaaa attggtaaca tcatctctga tgcaatgaaa aagtttggaa gaaagggcat
      721 catcacagta aaggatggaa aaacactgac tgatgaatta gaaattattg aaggcatgaa
      781 aaatttgatc gaggatatat ttctccatac tttcttaata catcaaaagg tgagaaatgt
      841 gaattccagg atgcctatgt tctactgagt gaaaagaaca tttctggtgt ccagtccatt
      901 gtacctgctc ttgaaattgc cagtgcttac cttaagcctt tggtcacaat tgctgaagat
      961 attgatgaag aaacgttaag tacgctcatc ttgaataggc taaaagttgg tcttcaggtt
     1021 gtagcagcca aagtttcagg ttttggtgac aacggagcca gcttaaagat atggctattg
     1081 ctactggtag tgcagtgttt ggagaagagg ggttaaccct aatcttgaag atgctcagcc
     1141 tcatgactca ggaaaatttg gagaggtcat tgtgccaaag tagtgctatg tatttaaaag
     1201 gaaaagatga caaagcttaa attgaaaaat gtattcaaga aatcattaag tagttagata
     1261 tcataactag tggatatgaa aagaaaaaac tgaatgaata tatggcaaaa ctttcagatg
     1321 gagtagccgt gttgatggtt ggttggtgag acaagtgatg ttcaggtgaa agacaagaaa
     1381 gacagattta ccgatgtcct taatgctaca agagctgcta ttgaagaagg cattgttttg
     1441 ggagggggtt gtgccctgct tcgagcattc cactcattca ctccagctaa tgaagataaa
     1501 aataatggca tagaaattat taaaagaaca ctcaaaattc cagcaatgac tattgctaag
     1561 aatgcaggtg ttgatttttg atagttgaga aaattatgca aagttcctca gaagttggtt
     1621 atgatactat gttaggagat gtcgtgaata tggtggaaaa agacattatt gacccaacaa
     1681 aggttgtgag aactgcttta ttggatgctg cttgcatggc tctctattaa ctacagcagc
     1741 tgttgtagtc acaaaaattc ctaaagaagg gaacagccct ggaatgggtg caatgtgtgg
     1801 aatgggaggt ggcctgttct aattcctaga atagtgcttt acctttatta atgaattgtg
     1861 atagggagcc caaggcgtgt tcctcaccaa taacttcaga gaagtcagtt ggagaaaatg
//
LOCUS       HUMHP603A    1912 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 3.
ACCESSION   M34661
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS5-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 99 to 1912)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1912)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      614 a    319 c    449 g    527 t      3 others
ORIGIN      
        1 ctttatgtgc tcttanctaa attaataatt tatttanntt tttttgtata tttgctatgt
       61 cacctttaga aagtaagagg cagctctcac ttgctgctga tgacctgcct cgccatgcac
      121 atgtctgcca gcaccccaca gaaatgcttc cattacccac agtctttcac cagatgagac
      181 cagtgtccag ggtactggct cctcacctca cttgaagtga tggtaaagat gtaaaatttc
      241 ctgctgatgt ccgagcctta atgcttcaaa gtgtagactt ttttgtcgat gctgtagcca
      301 ttacaatggg gccaaaggga agaacagtaa ttattgaaca tagctgggga agtcccaaag
      361 taacaaaaga tggtgtaact gatgcaaagt caattgactt aaaggataaa tataaaagca
      421 ttggagctaa acttgtccaa gatgttgcca ataacacaga tgaagagact gggggatggc
      481 actatcactg ctgctgtact ggtatgctct atattcaaat aaggcttcca gaaggttagc
      541 aaaggtgcta atccagtgga aatcaagaga ggtgtgatgt tagctgttga tgctgtaatt
      601 gctgaactta aaaagcagtg taaacctgtg accaaacctg aaattgcaca ggttgctaca
      661 atttctgcaa atggagacaa agaaattggt aacatctctg atgcaatgaa aaagtttgga
      721 agaaagggca tcacagtaaa ggatggaaaa acactgactg atgaattaga aattattgaa
      781 ggcatgaaaa tttgatcgag gatatatttc tccatacttt attaatacat caaaaggtga
      841 gaaatgtgaa ttccaggatg cctatgttct gttgcatgaa aagaaaattt ctagtgtcca
      901 gtccattgta actgctcttg aaattgccaa tgcttactgt aagcctttgg tcataattgc
      961 tggagacatt gatggagaag ctctaactac actcatcctg aataggctaa aggttggtct
     1021 tcaggttgtg gcagtcaaag ctccagggtt tgggtacaat agaaagaacc agcttaaaga
     1081 tacggtcatt gctactggtg gtacagtgtt tggagaagag ggctgacact aaatcttgaa
     1141 gacgttcagc ctcgtgacgt aggagaagtt ggagaggtca ctgtgaacaa agattatgcc
     1201 atgctcttaa aaggaaaagg taacaagtct caaattgaaa aatgtgttca agaaatcatt
     1261 gaccagtcag atgtcacaac tagtgaatac gaaaaggaaa aagtgagtgg agaaactttc
     1321 agatggagta gctgtgctga aggttgttgg tgggacaagt gatgttgaag tgaatgaaga
     1381 gaaagacaga gttacaggtg cacttaatgc tacaagagct gctgttgaag aaggcattgt
     1441 ttagggaggg ggttgtgccc tgcttcgatg cattccagcc ttggactcat tcactccagc
     1501 taatgaagat aaataattgg tatagaaatt attaaaagaa cactcaaaat tccagcaatg
     1561 acaattgcta agaatgcagc tgttgatggc attttgatag ttgagaaaat tatgtaaagt
     1621 gcctcagaag ttggttatga tactatgtta ggagatgtcg tgaatatggt gaaaaagaca
     1681 ttattgaccc aacaaaggtt gtgagaactg cttcattgga tgctgctggc atggcctctc
     1741 tattaactac agcagctgtt gtagtcacag aaattcctaa agaagagaac agccctggaa
     1801 taggtgcaat gtgtggaatg ggaggtggcc tgttcaaatt cctagaatag tgctttacct
     1861 ttattaatga attgtgatag gaacccaagg cagtgttcct caccaataac tt
//
LOCUS       HUMHP604A    2019 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 4.
ACCESSION   M34663
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS5-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 123 to 2019)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2019)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      657 a    327 c    479 g    556 t
ORIGIN      
        1 agagaaatgt aatatgaaat aaagctttat gtgctctacc taaattaata atttatttat
       61 ttttttttgt atatttgcta tgtcaccttt agaaagtaag aggcagctct cacttgctga
      121 ttgatgacct gatgatgacc tgcctcgcca tgcacatgtc ctgccagcac cccacagaaa
      181 tgcttccatt acccacagtc tttcaccaga tgagaccagt gtccaggcta ctggctactg
      241 gctcctcacc tcacttgaag tgatggtaaa gatgtaaaat ttggtgctga tgtccgagcc
      301 ttaatgcttc aaagtgtata gcttttgtcg agctgttagc cattacaaat gggccaaagg
      361 gaagaacagt agattattga acatagctgg ggaagtccca aagtaacaaa agagtggtgt
      421 gactgatgca aaggcaattg acttaaagga taaatataaa agcattggag ttaaacttgt
      481 ccaagatgtt gccaataaca cagatgaaga agactgggga tggcactatc actgctgctg
      541 tactggtatg ctctatttcc aaataaggct tccagaaggt tagcaaaggt gctaatccgt
      601 ggaaatcagg agaggtgtga tgttagctgt tgatgctgta attgctgaac ttaaaaagca
      661 gtctaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
      721 gacaaagaaa ttggtaacat catctctgat gcaatgaaaa actttggaag aaagggcatc
      781 atcacagtaa aggatggaaa aacactgact gatgaattag aaattattga aggcatgaaa
      841 atttgatcga ggatatattt ctccatactt tattaataca tcaaaaggtg agaaatgtga
      901 attccaggat gcctatgttc tgttgcatga aaagaaaatt tctagtgtcc agtccattgt
      961 aactgctctt gaaattgcca atgcttagtg taagcctttg gtcatatatt gctggagaca
     1021 ttgatggaga agctctaact acactcatcc tgaataagct aaaggttggt cttcaggttg
     1081 tggcagtcaa agctaaagct ccagggtttg gtgacaatag aaagaaccag cttaaagata
     1141 cggttattgc tactggtggt acagtgtttg gagaagaggg ctgacactaa atcttgaaga
     1201 cgttcagcct cgtgacgtag gagaagttgg agaggtcact gtgatcaaga ttatgccatg
     1261 ctcttaaaag gaaaaggtaa caagtctcaa attgaaaaat gtgtgcaaga aatcgttgac
     1321 cagtcagatg tcacaactat gtaatacgaa aaggaaaaag tgagtgaaga aactttcaga
     1381 tggagtagct gtgctgaagg ttggtgggac aagtgatgtt gaagtgaatg aagagaaaga
     1441 cagagttata ggtgcactta atgctacaag agctgctgtt gaagaaggca ttgtttaggg
     1501 agggggttgt gccctgcttc gatgcattcc agccttggac tcattcactc cagctaatga
     1561 agataaaata attggtatag aaattattaa aagaacactc aaaattccag caatgacaat
     1621 tgctaagaat gcaggtgttg atgcattttt gatagttgag aaaattatgt aaagttcctc
     1681 agaagttggt tatgatacta tgttaggaga tgtcgtgaat atggtggaaa aagacattat
     1741 tgacccaaca aaggttgtga gaactgcttc attggatgct gctgcatggc ctctctatta
     1801 ttaactacag cagctgttgt agtcacagaa attcctaaag aagggaacag ccctggaatg
     1861 ggtggaatgg gtgcaatgat atgtggaatg ggaggtggcc tgttctaatt cctagaatag
     1921 tgctttacct ttattaatga attgtgatag gaaacccaag gcagtgttcc tcaccaataa
     1981 cttcagagaa gtcagttgga gaaaatgaag aaaaaggct
//
LOCUS       HUMHSP60A    2202 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human chaperonin (HSP60) mRNA, complete cds.
ACCESSION   M34664
KEYWORDS    chaperonin.
SOURCE      Human placenta cDNA to mRNA, and DNA, clone PGEM-10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2202)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    pept         25     1746     chaperonin (HSP60)
BASE COUNT      699 a    371 c    538 g    594 t
ORIGIN      
        1 cacgcttgcc gccgccccgc agaaatgctt cggttaccca cagtctttcg ccagatgaga
       61 ccggtgtcca gggtactggc tcctcatctc actcgggctt atgccaaaga tgtaaaattt
      121 ggtgcagatg cccgagcctt aatgcttcaa ggtgtagacc ttttagccga tgctgtggcc
      181 gttacaatgg ggccaaaggg aagaacagtg attattgagc agggttgggg aagtcccaaa
      241 gtaacaaaag atggtgtgac tgttgcaaag tcaattgact taaaagataa atacaagaac
      301 attggagcta aacttgttca agatgttgcc aataacacaa atgaagaagc tggggatggc
      361 actaccactg ctactgtact ggcacgctct atagccaagg aaggcttcga gaagattagc
      421 aaaggtgcta atccagtgga aatcaggaga ggtgtgatgt tagctgttga tgctgtaatt
      481 gctgaactta aaaagcagtc taaacctgtg accacccctg aagaaattgc acaggttgct
      541 acgatttctg caaacggaga caaagaaatt ggcaatatca tctctgatgc aatgaaaaaa
      601 gttggaagaa agggtgtcat cacagtaaag gatggaaaaa cactgaatga tgaattagaa
      661 attattgaag gcatgaagtt tgatcgaggc tatatttctc catactttat taatacatca
      721 aaaggtcaga aatgtgaatt ccaggatgcc tatgttctgt tgagtgaaaa gaaaatttct
      781 agtatccagt ccattgtacc tgctcttgaa attgccaatg ctcaccgtaa gcctttggtc
      841 ataatcgctg aagatgttga tggagaagct ctaagtacac tcgtcttgaa taggctaaag
      901 gttggtcttc aggttgtggc agtcaaggct ccagggtttg gtgacaatag aaagaaccag
      961 cttaaagata tggctattgc tactggtggt gcagtgtttg gagaagaggg attgaccctg
     1021 aatcttgaag acgttcagcc tcatgactta ggaaaagttg gagaggtcat tgtgaccaaa
     1081 gacgatgcca tgctcttaaa aggaaaaggt gacaaggctc aaattgaaaa acgtattcaa
     1141 gaaatcattg agcagttaga tgtcacaact agtgaatatg aaaaggaaaa actgaatgaa
     1201 cggcttgcaa aactttcaga tggagtggct gtgctgaagg ttggtgggac aagtgatgtt
     1261 gaagtgaatg aaaagaaaga cagagttaca gatgccctta atgctacaag agctgctgtt
     1321 gaagaaggca ttgttttggg agggggttgt gccctccttc gatgcattcc agccttggac
     1381 tcattgactc cagctaatga agatcaaaaa attggtatag aaattattaa aagaacactc
     1441 aaaattccag caatgaccat tgctaagaat gcaggtgttg aaggatcttt gatagttgag
     1501 aaaattatgc aaagttcctc agaagttggt tatgatgcta tggctggaga ttttgtgaat
     1561 atggtggaaa aaggaatcat tgacccaaca aaggttgtga gaactgcttt attggatgct
     1621 gctggtgtgg cctctctgtt aactacagca gaagttgtag tcacagaaat tcctaaagaa
     1681 gagaaggacc ctggaatggg tgcaatgggt ggaatgggag gtggtatggg aggtggcatg
     1741 ttctaactcc tagactagtg ctttaccttt attaatgaac tgtgacagga agcccaaggc
     1801 agtgttcctc accaataact tcagagaagt cagttggaga aaatgaagaa aaaggctggc
     1861 tgaaaatcac tataaccatc agttactggt ttcagttgac aaaatatata atggtttact
     1921 gctgtcattg tccatgccta cagataattt attttgtatt tttgaataaa aaacatttgt
     1981 acattcctga tactgggtac aagagccatg taccagtgta ctgctttcaa cttaaatcac
     2041 tgaggcattt ttactactat tctgttaaaa tcaggatttt agtgcttgcc accaccagat
     2101 gagaagttaa gcagcctttc tgtggagagt gagaataatt gtgtacaaag tagagaagta
     2161 tccaattatg tgacaacctt tgtgtaataa aaatttgttt aa
//
LOCUS       MUSGCANF     4049 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse guanylate cyclase/atrial natriuretic factor mRNA,
            complete cds.
ACCESSION   J05504
KEYWORDS    atrial natriuretic factor; guanylate cyclase.
SOURCE      Mouse (strain C57B1/6J) Leydig cell line MA-10, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4049)
  AUTHORS   Pandey,K.N. and Singh,S.
  TITLE     Molecular cloning and expression of murine guanylate cyclase/atrial
            natriuretic factor receptor cDNA
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.N.Pandey, 31-MAY-1990.
FEATURES       from  to/span     description
    pept        348     3521     guanylate cyclase/atrial natriuretic factor
    signal     4029     4034     poly-A signal
BASE COUNT      879 a   1164 c   1140 g    866 t
ORIGIN      
        1 cagaaaccct cccaaactcc tatagccaca cacacctttc ccggccaaga tccaaacaaa
       61 cctctacttt cctcttccct aggagccaga ctcccttcgg gtgctgcgct cgctctcacc
      121 tgctctaaag cacctccgct ctcggacgct cccaattccg ccctcctgct cgacggcggg
      181 acagtcgcag cctcggcagg cagcttgctc tcgccgctgc ggcttcaacc cagccccctc
      241 cctcgctacg gctgggcgct cttgactccc gaccctcgcc tctgagcccg aggacggcga
      301 tcagaccatg gtgacagcgc tgctccgtcg ctgcgctcgc tgaggccatg ccgcgttccc
      361 gacgcgtccg tccgcgccta agggcgctgc tgctgctacc gccgctgctg ctgctccgaa
      421 gcggccacgc gagcgacctg accgtggctg tggtgctgcc cgtgaccaac acctcgtacc
      481 cgtggtcctg ggcgcgtgta gggccggcgg tggaactggc tctcgggagg gtgaaggctc
      541 ggccggactt gctgccgggt tggacggtcc gtatggtgct gggcagcagc gagaacgcgg
      601 cgggcgtctg ctccgacacc gctgcaccgc tggccgcggt ggatctcaag tgggagcaca
      661 gccccgccgt gttcctgggc cccggctgcg tatactctgc tgccccggtg gaccgcttca
      721 ccgcgcactg gcggttgccg ctgctgacgg ctggcgcccc ggctctgggc atcggggtga
      781 aggatgagta cgcgttaacc acccgcacag gacccagcca tgtcaagctg ggcgacttcg
      841 tgacggcgct gcatcgacgg ctgggctggg agcaccaggc gcttgtgctc tatgcagatc
      901 ggctgggcga cgaccggccg tgcttcttca tagtggaggg gctgtacatg cgggtgcgtg
      961 agcgactcaa catcacagta aatcaccagg agttcgtcga gggcgacccg gaccactaca
     1021 ccaagctact gcggaccgtg cagcgcaagg gcagagttat ctacatctgc agttctccgg
     1081 atgccttcag gaatctgatg cttttggccc tggatgctgg cctgactggg gaggactatg
     1141 ttttcttcca cctggatgtg tttgggcaaa gccttcaggg tgctcagggc cctgttccag
     1201 agaagccctg ggaaagagac gatgggcagg ataggagagc ccgccagcgc tttcaggctg
     1261 caaaaattat tacttacaaa gaacccgata atcctgagta cttggaattc ctgaagcagc
     1321 taaaactctt ggctgacaag aaattcaact tcaccatgga ggatggcctg aaaaatatca
     1381 tcccagcatc cttccatgac gggctcctgc tctatgtcca ggcagtgaca gagactctgg
     1441 cacagggggg cactgtcact gatggagaga acatcactca gcggatgtgg aaccgaagct
     1501 tccaaggtgt gacaggatac ctgaaaattg atagaaatgg agatcgggac actgattctc
     1561 ctctctggga tatggacccc gagacaggtg ccttcagggt tgtcctgaac tttaatggta
     1621 cttcccagga gctgatggct gtgtcagaac acagattata ctggcctctg ggatacccac
     1681 ctcctgacat ccctaaatgt ggctttgaca atgaggaccc agcctgcaac caagaccact
     1741 tttccacact ggaggttctg gctttggtgg gcagcctctc tctggttagc tttctgatcg
     1801 tgtctttctt catatacagg aagatgcagc tggaaaagga gctggtctca gagttgtggc
     1861 gggtgcgctg ggaggacttg cagcccagca gcctggagag gcaccttcgg agcgctggca
     1921 gtcggctgac cctgagtggg cgaggctcca attatggctc cctgctaacc acggagggcc
     1981 agttccaagt ctttgccaag acagcatact ataagggcaa cctcgtggct gtgaaacgtg
     2041 tgaaccggaa acgcattgag ttgacacgaa aagtcctgtt tgaacttaaa catatgcggg
     2101 atgtgcagaa tgagcaattg accagatttg tgggagcttg taccgaccct cccaacatct
     2161 gtatcctcac agagtactgt ccccgtggaa gcctacagga cattctagag aatgagagta
     2221 ttaccctgga ctggatgttt cggtactcac tcaccaatga cattgtcaag ggaatgctct
     2281 ttctacacaa cggggccatt tgttcccatg ggaacctcaa gtcatccaac tgcgtggtag
     2341 atggacgttt tgtgttaaag atcacagact atgggctcga gagcttcaga gacccggagc
     2401 cagagcaagg acacaccctc tttgccaaaa aactgtggac tgcacctgag ctcctgcgaa
     2461 tggcttcccc acctgcccgt ggctcccaag ctggggatgt ctacagtttt ggtatcatcc
     2521 ttcaggaaat tgccctaaga agtggggtct tctatgtgga aggtttggac ctcagcccaa
     2581 aagagatcat tgagcgtgtg actcggggtg agcagccccc attccgacct tccatggatc
     2641 tgcagagcca cctggaggaa ctggggcagc tgatgcagag gtgctgggca gaggatcctc
     2701 aggagcggcc accctttcaa cagatccgcc tggcgctgcg caagttcaac aaggagaaca
     2761 gcagcaacat cctggacaac ctgctgtcac gcatggaaca gtacgccaac aacctggagg
     2821 aactggtaga ggagagaaca cagccttatc tggaggagaa gcgcaaagct gaggccctgc
     2881 tttaccagat tctgcctcac tctgtggctg agcagctgaa gagaggcgag acagtccagg
     2941 ctgaggcatt tgatagtgtt actatctatt tcagtgatat cgtgggcttt acagctcttt
     3001 cagcagagag cacacccatg caggtggtca ccctgctcaa tgatctgtac acctgttttg
     3061 atgctgtcat agacaacttt gatgtgtaca aggtagagac cattggtgat gcttacatgg
     3121 tggtatcagg gctcccagtg aggaatggac agctccatgc ccgagaggta gcccgaatgg
     3181 cacttgcact gctcgatgct gtacgctcct tccgcatcgg ccataggccc caggaacagc
     3241 tgcgcttgcg cattggaatt cacacaggtc ctgtgtgtgc tggtgtggta gggctaaaga
     3301 tgccccgata ctgcctcttt ggagacacag tcaacacagc ttcaagaatg gagtctaatg
     3361 gggaagccct caggatccac ttgtcttcgg agaccaaggc tgtgctggaa gagttcgatg
     3421 gtttcgagct ggagctccga ggggatgtgg aaatgaaggg caaaggcaag gttcgttcct
     3481 attggctcct cggggaccgg ggatgcagct ctcgagcctg acctactgcc ctgctattcc
     3541 ttgtcacctc ccctccctat cccagcaatg acacgggtct ccaacttccc cctctcccac
     3601 agcagctcag ccactgtgga aagattaggg acctaaccag cgcagtcatc agatgtgacc
     3661 tctgagagag gatggagatg gtggggactg gagggggact cctaagttta tagggctgac
     3721 tgaaataccc agtcactccc gtagcacatg ccccgccccc cccccgcccc cccactcagc
     3781 tgcctagcag acagtgattc cttctgccgc cctcaactta gctccactgt gagttagagg
     3841 gagggaaatt gccacctgaa ggaaagagaa aagagattct cggggtttgc aggaggcagg
     3901 cagtcctgtg tcacaaatac tcccctcact cccagtccac cacctgcccc accgacttcc
     3961 cttcccacac agtgcactga ggagaagaga ggcatggggt tgccttgctt ctcctatgag
     4021 caaaacccat taaagtcttt attcctgtg
//
LOCUS       MUSMHH2DR    1415 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse MHC class I H-2Dr protein mRNA, complete cds.
ACCESSION   M34961
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Mouse (strain B10.RIII) spleen, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1415)
  AUTHORS   Zeling,C. and Pease,L.R.
  TITLE     Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Zeling, 29-MAY-1990.
            
              Author address: C.Zeling
              Mayo Clinic
              323 Guggenheim
              200 1st St. SW
              Rochester, MN 55905
              email: Pease@mayo.edu
            
FEATURES       from  to/span     description
    pept          1     1089     MHC H-2Dr protein precursor
    sigp          1       72     MHC H-2Dr protein signal peptide
    matp         73     1086     MHC H-2Dr protein
BASE COUNT      298 a    398 c    442 g    277 t
ORIGIN      
        1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
       61 cagacccgcg cgggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
      121 ctcggggagc cccggtacat agaagtcggc tacgtggacg acacggagtt cgtgcgcttc
      181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
      241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagat tttccgagtg
      301 aacctgagga ccctgctcag ctactacaac cagagcaagg gcggctctca cacgatccag
      361 tggatgttcg gctgtgacgt ggagtcggac gggcgcctcc tccgcgggta ccagcagtac
      421 gcatacgacg gctgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
      481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg ctggtgctgc agagagaaag
      541 agggcctacc tagagggcac gtgcgtggag tggctcctca gacacctgaa gaacgggaac
      601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaaa
      661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
      721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
      781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctctcgggaa ggagcagaat
      841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
      901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
      961 gccattattg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
     1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
     1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
     1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
     1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
     1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
     1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
     1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS       MUSMHH2DS    1415 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  M.musculus MHC class I H-2Ds protein mRNA, complete cds.
ACCESSION   M34962
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      M.musculus (strain B10.RIII) spleen, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1415)
  AUTHORS   Zeling,C. and Pease,L.R.
  TITLE     Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Zeling, 29-MAY-1990.
            
              Author address: C.Zeling
              Mayo Clinic
              323 Guggenheim
              200 1st St. SW
              Rochester, MN 55905
              email: Pease@mayo.edu
            
FEATURES       from  to/span     description
    pept          1     1089     MHC H-2Dr protein precursor
    sigp          1       72     MHC H-2Dr protein signal peptide
    matp         73     1086     MHC H-2Dr protein
BASE COUNT      292 a    401 c    444 g    278 t
ORIGIN      
        1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
       61 cagacccgcg ctggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
      121 ctcggggagc cccggtacat agaagtcggc tacgtggaca acacggagtt cgtgcgcttc
      181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
      241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagag tttccgagtg
      301 aacctgagga ccgcgctccg ctactacaac cagagcaagg gcggctctca cacggtccag
      361 gtgatgtctg gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta ccagcagcac
      421 gcatacgacg gccgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
      481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg gtggtgatgc agagcattac
      541 aaggcctacc tagagggcac gtgcgtggag tggctcctca gatacctgga gctcgggaac
      601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaga
      661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
      721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
      781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctcttgggaa ggagcagaat
      841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
      901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
      961 gccattactg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
     1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
     1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
     1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
     1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
     1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
     1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
     1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS       HUMCETP      1787 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human cholesteryl ester transfer protein mRNA, complete cds.
ACCESSION   M30185
KEYWORDS    cholesteryl ester transfer protein; transfer protein.
SOURCE      Human adult liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1787)
  AUTHORS   Drayna,D., Jarnagin,A.S., McLean,J., Henzel,W., Kohr,W.,
            Fielding,C. and Lawn,R.
  TITLE     Cloning and sequencing of human cholesteryl ester transfer protein
            cDNA
  JOURNAL   Nature 327, 632-634 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        131     1612     cholesteryl ester transfer protein precursor
                                 /nomgen="CETP" /map="16q21"
                                 /hgml_locus_uid="LF0130L"
    sigp        131      181     cholesteryl ester transfer protein signal
                                 peptide
    matp        182     1609     cholesteryl ester transfer protein
    mRNA    <     1     1787     CETP mRNA
BASE COUNT      397 a    531 c    456 g    403 t
ORIGIN      
        1 gtgaatctct ggggccagga agaccctgct gcccggaaga gcctcatgtt ccgtgggggc
       61 tgggcggaca tacatatacg ggctccaggc tgaacggctc gggccactta cacaccactg
      121 cctgataacc atgctggctg ccacagtcct gaccctggcc ctgctgggca atgcccatgc
      181 ctgctccaaa ggcacctcgc acgaggcagg catcgtgtgc cgcatcacca agcctgccct
      241 cctggtgttg aaccacgaga ctgccaaggt gatccagacc gccttccagc gagccagcta
      301 cccagatatc acgggcgaga aggccatgat gctccttggc caagtcaagt atgggttgca
      361 caacatccag atcagccact tgtccatcgc cagcagccag gtggagctgg tggaagccaa
      421 gtccattgat gtctccattc agaacgtgtc tgtggtcttc aaggggaccc tgaagtatgg
      481 ctacaccact gcctggtggc tgggtattga tcagtccatt gacttcgaga tcgactctgc
      541 cattgacctc cagatcaaca cacagctgac ctgtgactct ggtagagtgc ggaccgatgc
      601 ccctgactgc tacctgtctt tccataagct gctcctgcat ctccaagggg agcgagagcc
      661 tgggtggatc aagcagctgt tcacaaattt catctccttc accctgaagc tggtcctgaa
      721 gggacagatc tgcaaagaga tcaacgtcat ctctaacatc atggccgatt ttgtccagac
      781 aagggctgcc agcatccttt cagatggaga cattggggtg gacatttccc tgacaggtga
      841 tcccgtcatc acagcctcct acctggagtc ccatcacaag ggtcatttca tctacaagaa
      901 tgtctcagag gacctccccc tccccacctt ctcgcccaca ctgctggggg actcccgcat
      961 gctgtacttc tggttctctg agcgagtctt ccactcgctg gccaaggtag ctttccagga
     1021 tggccgcctc atgctcagcc tgatgggaga cgagttcaag gcagtgctgg agacctgggg
     1081 cttcaacacc aaccaggaaa tcttccaaga ggttgtcggc ggcttcccca gccaggccca
     1141 agtcaccgtc cactgcctca agatgcccaa gatctcctgc caaaacaagg gagtcgtggt
     1201 caattcttca gtgatggtga aattcctctt tccacgccca gaccagcaac attctgtagc
     1261 ttacacattt gaagaggata tcgtgactac cgtccaggcc tcctattcta agaaaaagct
     1321 cttcttaagc ctcttggatt tccagattac accaaagact gtttccaact tgactgagag
     1381 cagctccgag tccatccaga gcttcctgca gtcaatgatc accgctgtgg gcatccctga
     1441 ggtcatgtct cggctcgagg tagtgtttac agccctcatg aacagcaaag gcgtgagcct
     1501 cttcgacatc atcaaccctg agattatcac tcgagatggc ttcctgctgc tgcagatgga
     1561 ctttggcttc cctgagcacc tgctggtgga tttcctccag agcttgagct agaagtctcc
     1621 aaggaggtcg ggatggggct tgtagcagaa ggcaagcacc aggctcacag ctggaaccct
     1681 ggtgtctcct ccagcgtggt ggaagttggg ttaggagtac ggagatggag attggctccc
     1741 aactcctccc tatcctaaag gcccactggc attaaagtgc tgtatcc
//
LOCUS       HUMGHRA04     289 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human growth hormone receptor gene, exon 5.
ACCESSION   M28461 M26401
KEYWORDS    growth hormone binding-protein; growth hormone receptor;
            transmembrane protein.
SEGMENT     4 of 10
SOURCE      Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 289)
  AUTHORS   Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
            Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
  TITLE     Characterization of the human growth hormone receptor gene and the
            demonstration of a partial gene deletion in two patients with
            Laron-type dwarfism
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.I.Wood, 02-AUG-1989.
FEATURES       from  to/span     description
    pept    +    59  +   231     growth hormone receptor, exon 5 /nomgen="GHR"
                                 /map="5p13-p12" /hgml_locus_uid="LA0219G"
    pre-msg <     1  >   289     growth hormone receptor mRNA and introns
    IVS     <     1  >    58     GHR Intron D
    IVS         232  >   289     GHR Intron E
BASE COUNT       90 a     45 c     55 g     99 t
ORIGIN      About 6 kb after segment 3.
        1 acttaagcta caacatgatt tttggaacaa ttaatctttt tttaaccctt cattttagga
       61 acactcaaga atggactcaa gaatggaaag aatgccctga ttatgtttct gctggggaaa
      121 acagctgtta ctttaattca tcgtttacct ccatctggat accttattgt atcaagctaa
      181 ctagcaatgg tggtacagtg gatgaaaagt gtttctctgt tgatgaaata ggtaaatcac
      241 aggtttttgt ttcatttgac atagttttag actaaataaa tggggaagc
//
LOCUS       HUMGHRA06     362 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human growth hormone receptor gene, exon 7.
ACCESSION   M28463 M26401
KEYWORDS    growth hormone binding-protein; growth hormone receptor;
            transmembrane protein.
SEGMENT     6 of 10
SOURCE      Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 362)
  AUTHORS   Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
            Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
  TITLE     Characterization of the human growth hormone receptor gene and the
            demonstration of a partial gene deletion in two patients with
            Laron-type dwarfism
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.I.Wood, 02-AUG-1989.
FEATURES       from  to/span     description
    pept    +    75  +   240     growth hormone receptor, exon 7 /nomgen="GHR"
                                 /map="5p13-p12" /hgml_locus_uid="LA0219G"
    pre-msg <     1  >   362     growth hormone receptor mRNA and introns
    IVS     <     1       74     GHR Intron F
    IVS         241  >   362     GHR Intron G
BASE COUNT      117 a     58 c     76 g    111 t
ORIGIN      Undetermined number of bp after segment 5.
        1 atacctgtag tgttcattgg cattgagttg ttgactcttt ggccaatatg gcgtttatat
       61 ttttgtcttg aaagatggac cctatattga caacatcagt tccagtgtac tcattgaaag
      121 tggataagga atatgaagtg cgtgtgagat ccaaacaacg aaactctgga aattatggcg
      181 agttcagtga ggtgctctat gtaacacttc ctcagatgag ccaatttaca tgtgaagaag
      241 gtaaaagaaa taaaagatta aaatagtagc taacctggct tttgtcaata taacagttga
      301 ttcacccctg cactggtagt gtgttgtcca aatcaaaata tattaacatc agatatcagg
      361 at
//
LOCUS       HUMGHRA07     212 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human growth hormone receptor gene, exon 8.
ACCESSION   M28464 M26401
KEYWORDS    growth hormone binding-protein; growth hormone receptor;
            transmembrane protein.
SEGMENT     7 of 10
SOURCE      Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
            Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
  TITLE     Characterization of the human growth hormone receptor gene and the
            demonstration of a partial gene deletion in two patients with
            Laron-type dwarfism
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.I.Wood, 02-AUG-1989.
FEATURES       from  to/span     description
    pept    +    59  +   149     growth hormone receptor, exon 8 /nomgen="GHR"
                                 /map="5p13-p12" /hgml_locus_uid="LA0219G"
    pre-msg <     1  >   212     growth hormone receptor mRNA and introns
    IVS     <     1       58     GHR Intron G
    IVS         150  >   212     GHR Intron H
BASE COUNT       53 a     33 c     43 g     83 t
ORIGIN      Undetermined number of bp after segment 6.
        1 gaaactgtgc ttcaactagt cgtaattctg aaagcgaaat attcttgtgt gtttgcagat
       61 ttctactttc catggctctt aattattatc tttggaatat ttgggctaac agtgatgcta
      121 tttgtattct tattttctaa acagcaaagg taggatgtag gaaggtagta ttctttggta
      181 ccttctgtac cagttgtgtt agaccttgcc at
//
LOCUS       HUMGHRA08     198 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human growth hormone receptor gene, exon 9.
ACCESSION   M28465 M26401
KEYWORDS    growth hormone binding-protein; growth hormone receptor;
            transmembrane protein.
SEGMENT     8 of 10
SOURCE      Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 198)
  AUTHORS   Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
            Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
  TITLE     Characterization of the human growth hormone receptor gene and the
            demonstration of a partial gene deletion in two patients with
            Laron-type dwarfism
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.I.Wood, 02-AUG-1989.
FEATURES       from  to/span     description
    pept    +    63  +   132     growth hormone receptor, exon 9 /nomgen="GHR"
                                 /map="5p13-p12" /hgml_locus_uid="LA0219G"
    pre-msg <     1  >   198     growth hormone receptor mRNA and introns
    IVS     <     1       62     GHR Intron H
    IVS         133  >   198     GHR Intron I
BASE COUNT       68 a     35 c     30 g     65 t
ORIGIN      About 4 kb after segment 7.
        1 gctataattg agaatatgta gcttttaaga tgtcaaaacc aaaattttta tatgttttca
       61 aggattaaaa tgctgattct gcccccagtt ccagttccaa agattaaagg aatcgatcca
      121 gatctcctca aggtaactaa taattttatc taaagttgta gctagtacta attaacacct
      181 gaagactcct gtcatatg
//
LOCUS       HUMGHRA09    1881 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human growth hormone receptor gene, exon 10.
ACCESSION   M28466 M26401
KEYWORDS    growth hormone binding-protein; growth hormone receptor;
            transmembrane protein.
SEGMENT     9 of 10
SOURCE      Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1881)
  AUTHORS   Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R.,
            Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I.
  TITLE     Characterization of the human growth hormone receptor gene and the
            demonstration of a partial gene deletion in two patients with
            Laron-type dwarfism
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.I.Wood, 02-AUG-1989.
FEATURES       from  to/span     description
    pept    +    64     1035     growth hormone receptor, exon 10 /nomgen="GHR"
                                 /map="5p13-p12" /hgml_locus_uid="LA0219G"
    pre-msg <     1  >  1881     growth hormone receptor mRNA and introns
    IVS     <     1       63     GHR Intron I
BASE COUNT      607 a    366 c    362 g    546 t
ORIGIN      About 0.5 kb after segment 8.
        1 gctaattcat ttaattatta tgagtttctt ttcatagatc ttcattttct ttctattttc
       61 taggaaggaa aattagagga ggtgaacaca atcttagcca ttcatgatag ctataaaccc
      121 gaattccaca gtgatgactc ttgggttgaa tttattgagc tagatattga tgagccagat
      181 gaaaagactg aggaatcaga cacagacaga cttctaagca gtgaccatga gaaatcacat
      241 agtaacctag gggtgaagga tggcgactct ggacgtacca gctgttgtga acctgacatt
      301 ctggagactg atttcaatgc caatgacata catgagggta cctcagaggt tgctcagcca
      361 cagaggttaa aaggggaagc agatctctta tgccttgacc agaagaatca aaataactca
      421 ccttatcatg atgcttgccc tgctactcag cagcccagtg ttatccaagc agagaaaaac
      481 aaaccacaac cacttcctac tgaaggagct gagtcaactc accaagctgc ccatattcag
      541 ctaagcaatc caagttcact gtcaaacatc gacttttatg cccaggtgag cgacattaca
      601 ccagcaggta gtgtggtcct ttccccgggc caaaagaata aggcagggat gtcccaatgt
      661 gacatgcacc cggaaatggt ctcactctgc caagaaaact tccttatgga caatgcctac
      721 ttctgtgagg cagatgccaa aaagtgcctc cctgtggctc ctcacatcaa ggttgaatca
      781 cacatacagc caagcttaaa ccaagaggac atttacatca ccacagaaag ccttaccact
      841 gctgctggga ggcctgggac aggagaacat gttccaggtt ctgagatgcc tgtcccagac
      901 tatacctcca ttcatatagt acagtcccca cagggcctca tactcaatgc gactgccttg
      961 cccttgcctg acaaagagtt tctctcatca tgtggctatg tgagcacaga ccaactgaac
     1021 aaaatcatgc cttagccttt ctttggtttc ccaagagcta cgtatttaat agcaaagaat
     1081 tgactggggc aataacgttt aagccaaaac aatgtttaaa ccttttttgg gggagtgaca
     1141 ggatggggta tggattctaa aatgcctttt cccaaaatgt tgaaatatga tgttaaaaaa
     1201 ataagaagaa tgcttaatca gatagatatt cctattgtgc aatgtaaata ttttaaagaa
     1261 ttgtgtcaga ctgtttagta gcagtgattg tcttaatatt gtgggtgtta atttttgata
     1321 ctaagcattg aatgactatg tttttaatgt atagtaaatc acgctttttg aaaaagcgaa
     1381 aaaatcaggt ggcttttgcg gttcaggaaa attgaatgca aaccatagca caggctaatt
     1441 ttttgttgtt tcttaaataa gaaacttttt tatttaaaaa actaaaaact agaggtgaga
     1501 aatttaaact ataagcaaga aggcaaaaat agtttggata tgtaaaacat ttattttgac
     1561 ataaagttga taaagatatt ttttaataat ttagacttca agcatggcta ttttatatta
     1621 cactacacac tgtgtactgc agttggtatg acccctctaa ggagtgtagc aactacagtc
     1681 taaagctggt ttaatgtttt ggccaatgca cctaaagaaa aacaaactcg ttttttacaa
     1741 agccctttta tacctcccca gactccttca acaattctaa aatgattgta gtaatctgca
     1801 ttattggaat ataattgttt tatctgaatt tttaaacaag tatttgttaa tttagaaaac
     1861 tttaaagcgt ttgcacagat c
//
LOCUS       HUMMHDQAR1    426 bp ds-DNA             PRI       21-JUL-1990
DEFINITION  Human MHC class II HLA-DQ-alpha chain gene, exon 1.
ACCESSION   M29613
KEYWORDS    cell surface protein; class II gene; integral membrane protein;
            major histocompatibility complex.
SEGMENT     1 of 2
SOURCE      Human (haplotype HLA-D/DR) adult male DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 426)
  AUTHORS   Jonsson,A.-K., Hyldig-Nielsen,J.-J., Servenius,B., Larhammar,D.,
            Andersson,G., Joergensen,F., Peterson,P.A. and Rask,L.
  TITLE     Class II genes of the human major histocompatibility complex:
            Comparisons of the DQ and DX alpha and beta genes
  JOURNAL   J. Biol. Chem. 262, 8767-8777 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        173  +   254     MHC HLA-DQ-alpha chain precursor, exon 1
                                 /nomgen="HLA-DQA1" /map="6p21.3"
                                 /hgml_locus_uid="LY0151Z"
    sigp        173      241     MHC HLA-DQ-alpha chain signal peptide
    matp        242  +   254     MHC HLA-DQ-alpha chain
    IVS         255  >   426     MHC HLA-DQ-alpha, intron A
BASE COUNT      120 a     90 c    108 g    108 t
ORIGIN      
        1 atacaaactc ttcagctagt aactgagatg tcaccatggg ggatttttct aattggccaa
       61 aacctgacct ggcagggttt ggtttgggtg tcttcagatt tccttgtctt gaggccctca
      121 caattgctct acagctcaga acagcaactg ctgaggctgc cttgggaaga ggatgatcct
      181 aaacaaagct ctgatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg
      241 tgaagacatt gtgggtgagt gcatgagtga ggaatgttct ctggagctga aaaacagtaa
      301 attgaaggaa aagagagaaa gcgatttgca gagaaattgt agagattcca agaccccttc
      361 agtattaaga gaattaaaaa ttatagctgt tcctccttca ggaaaccaga gccccaacca
      421 ctcttt
//
LOCUS       SOYP34A      1287 bp ss-mRNA            PLN       21-JUL-1990
DEFINITION  Soybean oil bodies-associated protein P34 mRNA, complete cds.
ACCESSION   J05560
KEYWORDS    oil bodies-associated protein.
SOURCE      Soybean (strain Century) seed, cDNA to mRNA, clone P34/1350.
  ORGANISM  Glycine max
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 1287)
  AUTHORS   Kalinski,A.J., Weiseman,J.M., Matthews,B.F. and Herman,E.M.
  TITLE     Molecular cloning of a protein associated with soybean oil bodies
            which is similar to thiol proteases of the papain family
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.M.Herman, 18-MAY-1990.
FEATURES       from  to/span     description
    pept          3     1142     oil bodies-associated protein P34 precursor
    sigp          3      368     oil bodies-associated protein P34 signal
                                 peptide
    matp        369     1139     protein P34
    mRNA    <     1     1287     P34 protein mRNA
BASE COUNT      407 a    240 c    297 g    343 t
ORIGIN      
        1 ttatgggttt ccttgtgttg cttcttttct ccctcttagg tctctcttct agttccagca
       61 tatcaactca tcgttccata ttggaccttg acctaaccaa gtttaccaca cagaaacagg
      121 tgtcttcact gttccaacta tggaagagtg agcatggacg tgtctaccat aaccacgaag
      181 aagaggcaaa gagacttgag attttcaaga ataactcgaa ctatatcagg gacatgaatg
      241 caaacagaaa atcaccccat tctcatcgtt taggattgaa caagtttgct gacatcactc
      301 ctcaagagtt cagcaaaaag tacttgcaag ctcccaagga tgtgtcgcag caaatcaaaa
      361 tggccaacaa gaaaatgaag aaggaacaat attcttgtga ccatccacct gcatcatggg
      421 attggaggaa aaaaggtgtc atcacccaag taaagtacca agggggctgt ggaaggggtt
      481 gggcgttttc tgccacggga gccatagaac cagcacatgc aatagcaaca ggagaccttg
      541 ttagcctttc tgaacaagaa ctcgtagact gtgtggaaga aagcgaaggt tgttacaatg
      601 gatggcacta tcaatcgttc gaatgggttt tagaacatgg tggtattgcc actgatgatg
      661 attatcctta cagagctaaa gagggtagat gcaaagccaa taagatacaa gacaaggtta
      721 caattgacgg atatgaaact gtaataatgt cagatgagag tacagaatca gagacagagc
      781 aagcgttctt aagcgccatc cttgagcaac caattagtgt ctcaattgat gcaaaagatt
      841 ttcatttata caccggggga atttatgatg gagaaaactg tacaagtccg tatgggatta
      901 atcactttgt tttacttgtg ggttatggtt cagcggatgg tgtagattac tggatagcga
      961 aaaattcatg gggagaagat tggggagaag atggttacat ttggatccaa agaaacacgg
     1021 gtaatttatt aggagtgtgt gggatgaatt atttcgcttc atacccaacc aaagaggaat
     1081 cagaaacact ggtgtctgct cgcgttaaag gtcatcgaag agttgatcac tctcctcttt
     1141 gaagccgtaa aggttcaata caacgagtgc ttgttttctt agggacaagc attgtactta
     1201 tgtatgattc tgtgtaacca tgagtctcca cgttgtacta atgtgaaggg caaaaataaa
     1261 acacacaaca agttcgtttt tctcaat
//
LOCUS       XELRPS22A     531 bp ss-mRNA            VRT       21-JUL-1990
DEFINITION  X.laevis ribosomal protein S22, 40S subunit mRNA, complete cds.
ACCESSION   M34706
KEYWORDS    nucleolar protein; ribosomal protein; ribosomal protein S22.
SOURCE      X.laevis stage 17 embryo, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 531)
  AUTHORS   Keiper,B.D. and Wormington,M.
  TITLE     Nucleotide sequence and 40S subunit assembly of Xenopus laevis
            ribosomal protein S22
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Wormington, 18-MAY-1990.
            
               Author Address [1]: M.Wormington
                                   University of Virginia
                                   Department of Biology
                                   Gilmer Hall
                                   Charlottesville, VA 22901
FEATURES       from  to/span     description
    pept        131      490     ribosomal protein S22, 40S subunit
    mRNA    <     1      531     ribosomal protein S22 mRNA
BASE COUNT      158 a    117 c    124 g    132 t
ORIGIN      
        1 tgtttaataa aagaaagtaa tagtaacttc aacctctaca cgtggctgtt cctgactcac
       61 cgctgttcgc tcggcaagca tcggatcgca ccgtcggacg aataagtcgg tcagggaagt
      121 gataagaaaa atggctttta aagacccagg caaagcccct gtagaccagg aagtggccat
      181 ccatcgtatc aggattactt taacaagtcg taatgtgaag tctctggaaa aagtgtgtgc
      241 tgatctgatc cgtggtgcca aagagaagaa cctgaaggtt aagggcccag tccgtatgcc
      301 taccaagact cttcgtatca caaccagaaa aacaccttgc ggtgagggtt ccaaaacctg
      361 ggatcgtttc cagatgcgca tccacaagcg cctgattgac ctgcacagtc cttccgagat
      421 tgttaagcag atcacttcca tcagtatcga acctggtgta gaagttgaag ttactattgc
      481 tgatgcataa atgacacttc tgtttaataa aagaaagtaa tgtggattgt t
//
LOCUS       HUMSSBLA1      81 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human SS-B/La protein mRNA, partial cds.
ACCESSION   M35261
KEYWORDS    SS-B/La antigen; SS-B/La protein; autoantigen;
            small nuclear ribonucleoprotein.
SEGMENT     1 of 3
SOURCE      Human fibroblast, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 81)
  AUTHORS   Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N.,
            Nishioka,K. and Miyamoto,T.
  TITLE     Fine epitope mapping the human SS-B/La protein: Identification of a
            distinct autoepitope homologous to a viral gag polyprotein
  JOURNAL   J. Clin. Invest. 85, 1566-1574 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >    81     SS-B/La protein (AA at 1)
BASE COUNT       36 a     14 c     15 g     16 t
ORIGIN      
        1 gaaatcagtg aagataaaac taaaatcaga aggtctccaa gcaaacccct acctgaagtg
       61 actgatgagt ataaaaatga t
//
LOCUS       HUMSSBLA2     155 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human SS-B/La protein mRNA, partial cds.
ACCESSION   M35263
KEYWORDS    SS-B/La antigen; SS-B/La protein; autoantigen;
            small nuclear ribonucleoprotein.
SEGMENT     2 of 3
SOURCE      Human fibroblast, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 155)
  AUTHORS   Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N.,
            Nishioka,K. and Miyamoto,T.
  TITLE     Fine epitope mapping the human SS-B/La protein: Identification of a
            distinct autoepitope homologous to a viral gag polyprotein
  JOURNAL   J. Clin. Invest. 85, 1566-1574 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   155     SS-B/La protein (AA at 3)
BASE COUNT       81 a     19 c     30 g     25 t
ORIGIN      About 200 base pairs after segment 1.
        1 agaagtacaa agaaacagac ctgctaatac ttttcaagga cgattacttt gccaaaaaaa
       61 atgaagaaag aaaacaaaat aaagtggaag ctaaattaag agctaaacag gagcaagaag
      121 caaaacaaaa gttagaagaa gatgctgaaa tgaaa
//
LOCUS       HUMSSBLA3     192 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human SS-B/La protein mRNA, partial cds.
ACCESSION   M35262
KEYWORDS    SS-B/La antigen; SS-B/La protein; autoantigen;
            small nuclear ribonucleoprotein.
SEGMENT     3 of 3
SOURCE      Human fibroblast, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 192)
  AUTHORS   Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N.,
            Nishioka,K. and Miyamoto,T.
  TITLE     Fine epitope mapping the human SS-B/La protein: Identification of a
            distinct autoepitope homologous to a viral gag polyprotein
  JOURNAL   J. Clin. Invest. 85, 1566-1574 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   192     SS-B/La protein (AA at 1)
BASE COUNT       90 a     23 c     50 g     29 t
ORIGIN      About 170 base pairs after segment 2.
        1 gccaaggaag cattgggtaa agccaaagat gcaaataatg gtaacctaca attaaggaac
       61 aaagaagtga cttgggaagt actagaagga gaggtggaaa aagaagcact gaagaaaata
      121 atagaagacc aacaagaatc cctaaacaaa tggaagtcaa aaggtcgtag atttaaagga
      181 aaaggaaagg gt
//
LOCUS       RATPPTA1     1757 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  Rat tachykinin (PPT) gene, exons 1 and 2.
ACCESSION   M34159
KEYWORDS    alternative splicing; neurokinin A; neuropeptide K;
            neuropeptide gamma; substance P; tachykinin.
SEGMENT     1 of 4
SOURCE      Rat (strain Sprague-Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1757)
  AUTHORS   Carter,M.S. and Krause,J.E.
  TITLE     Structure, expression and some regulatory mechanisms of the rat
            preprotachykinin gene encoding substance P, neurokinin A,
            neuropeptide K, and neuropeptide gamma
  JOURNAL   J. Neurosci. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Krause, 10-MAY-1990.
FEATURES       from  to/span     description
    pept       1400  +  1522     beta-preprotachykinin (PPT), exon 2 (first
                                 expressed exon)
    sigp       1400     1456     beta-tachykinin signal peptide (1456 could be
                                 1471)
    pep$       1400  +  1522     gamma-preprotachykinin (PPT), exon 2 (first
                                 expressed exon)
    sigp       1400     1456     gamma-tachykinin signal peptide (1456 could be
                                 1471)
    pep$       1400  +  1522     alpha-preprotachykinin (PPT), exon 2 (first
                                 expressed exon)
    sigp       1400     1456     alpha-tachykinin signal peptide (1456 could be
                                 1471)
    pre-msg     866  >  1757     PPT mRNA and introns
    IVS         966     1390     PPT intron A
    IVS        1523  >  1757     PPT intron B
BASE COUNT      411 a    443 c    470 g    433 t
ORIGIN      
        1 ctgcagagct ccaaaggtaa gcatccagcc tttctagtcc cccaacaagg ctaaagggga
       61 gagaggcaca attatcctct tcccacccct tctgccttca gggtgtgcct gggaagaagc
      121 tgtaggggaa caaaagatgc cttagaatgg ctgatgggta agttctacat gagaaaggag
      181 gtttaaattc ctctttcccc taaatgtaaa acaaacctgc cttcatcctc tgaagcggga
      241 gaccggaaac acttttgcag tgctagagaa atgagaatat tctgactgat ttggtgggga
      301 ggggggttgg gggggtgtgt tccagcccta gatataacac ctcataaacc ttaagacaca
      361 taaagtagaa atgaaaggaa aaccccgctt gcttcatccc tctgaagtgc ttgctggtgt
      421 cttagtatta ttcacaaggt tttgctgctc aagttatttg gctgtcctca aagcgcaata
      481 ttccctgatg cctcttgaga gaaaagttcc ctaagtccga agcatgagtc acttcgctca
      541 gttttgatga gtaatctcag gtgtcactga accttgttcg gaagaagagg ggaggggggc
      601 gtcagatttg cagacggaag aaaacaggtc tctctggatt ggatggcgag acctcgactt
      661 ccctaaaatt gcgtcatttc gaacccaatt tggtccagat gttatggact ccgacgggtt
      721 accgtctcgg aaactctatc acgcaagcaa aaggcgaggg ggcggctaat taaatattga
      781 gcagaaagtc gcgtggggag agtgtcacgt ggctctccag gctcatcacg cctgagataa
      841 ataaggcgaa gcaggagcag ggactagagc gcactcggac cagctccact ccagcaccgc
      901 ggcggaggag agcgaggagc gcccagcaag tgcgcacctg cggagcatca ccgggtccga
      961 ccgcagtgag taccactccc gcctgccatc tgccgctgcc tggggcccgt cgtgccttgg
     1021 tcagaggtgg cgtccgcccg gggtttcacc tgcagcagcc acagacccgg agagaaactg
     1081 tgcaaatacc aacacctctc tttgtctgcc ttcaggcttc ggagtgtggg tcagtgggta
     1141 gggggctggg acgttgagag gcaaagagag gaggacttga ggctctttgg cacgtcagta
     1201 gccttcttaa agggtttgga gaaatgttcg acctggtggg tttttgtttt tgttttaatc
     1261 ttggtttagc gagacctctt ctttcttcct tgtggcattg aaattcagct gctctaacgt
     1321 cttgggtggc cccccgccta catatgtata acgattcttt ctctccttct ctgtctccta
     1381 ctctctccag aaatccaaca tgaaaatcct cgtggcggtg gcggtctttt ttctcgtttc
     1441 cactcaactg tttgcagagg aaatcggtgc caacgatgat ctaaattatt ggtccgactg
     1501 gtccgacagt gaccaaatca aggtgagacc aaccgggcgt gggagctgtt gccgtcccct
     1561 tgtggtagca ccctgactag cctagctgga gcctgagagg ctcattggat tgtaagggcg
     1621 tcatacccgg acaaagcaca aggcttattg cccagaacta agtttctccc tgagggagga
     1681 aaatgtcctg ggttcagtgt ctccccaaca gaaaaggagt tcttctttcc ttcctccctc
     1741 cctccctccc tccctcc
//
LOCUS       RATPPTA2      875 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  Rat tachykinin (PPT) gene, exons 3 and 4.
ACCESSION   M34160
KEYWORDS    neurokinin A; neuropeptide K; neuropeptide gamma; substance P;
            tachykinin.
SEGMENT     2 of 4
SOURCE      Rat (strain Sprague-Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 875)
  AUTHORS   Carter,M.S. and Krause,J.E.
  TITLE     Structure, expression and some regulatory mechanisms of the rat
            preprotachykinin gene encoding substance P, neurokinin A,
            neuropeptide K, and neuropeptide gamma
  JOURNAL   J. Neurosci. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Krause, 10-MAY-1990.
FEATURES       from  to/span     description
    pept    +   187      283     beta-preprotachykinin (PPT), exon 3
                722  +   766     beta-preprotachykinin, exon 4
    matp        235      267     substance P
    pep$    +   187  +   283     gamma-preprotachykinin (PPT), exon 3
    matp        235      267     substance P
    pep$    +   187      283     alpha-preprotachykinin (PPT), exon 3
                722  +   766     alpha-preprotachykinin, exon 4
    matp        235      267     substance P
    pre-msg <     1  >   875     PPT mRNA and introns
    IVS     <     1      186     PPT intron B
    IVS         284      721     PPT intron C
    IVS         767  >   875     PPT intron D
BASE COUNT      213 a    191 c    219 g    252 t
ORIGIN      
        1 ccctgataga ctttggggga gaaggaaatc tcaggcgggt ccgggttggg gaaagtgctg
       61 catagattct agtatggccc ggtctcatgc tagcggctgc ttctgtggct ctgaggaaag
      121 ctcaggcaca ccgtgaatgt ggaccttgtt cacttaattt gtctactcgc tggtttgtcc
      181 ctccaggagg caatgccgga gccctttgag catcttcttc agagaatcgc ccgaagaccc
      241 aagcctcagc agttctttgg attaatgggc aaacgggatg ctggtgagat gagcaatagt
      301 cactaggtct atcaggctac ccggtctgca tgcctgctcc ttcctggagc acccaagtgt
      361 ctcttgtacc gcgagtataa ttttacatgc ctaaagactt ttgattccct gaagtcttcg
      421 atccactaac acttgcacta gctcaaagtg gaggggaggg gggatccatt tcccttgctt
      481 cacagcacca aggtgaccca aattggaaac taaccttagc taaacactcc cctgactaaa
      541 gatcccaatt acaagggtcc tcaggtatgc taatggtcca ggctcaggtg aaagatgttt
      601 tcttggattg ggttagattt ggctatacct cgctacagtg ataggaattt tcaattgcct
      661 tgagttgatg gcacagtcag tggagcttgt gaagagttaa tgtcaattca tctcttgtca
      721 gattcctcaa ttgaaaaaca agtggccctg ttaaaggctc tttatggtaa acatccctat
      781 aatcttattt cctatatgat gatcatgtaa aagtatttga tggtataatc agacttgcag
      841 tgtgcctata gagagtgtgt gtgtgtgtgt gtgtg
//
LOCUS       RATPPTA3     2503 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  Rat tachykinin (PPT) gene, exons 5 and 6.
ACCESSION   M34161
KEYWORDS    neurokinin A; neuropeptide K; neuropeptide gamma; substance P;
            tachykinin.
SEGMENT     3 of 4
SOURCE      Rat (strain Sprague-Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2503)
  AUTHORS   Carter,M.S. and Krause,J.E.
  TITLE     Structure, expression and some regulatory mechanisms of the rat
            preprotachykinin gene encoding substance P, neurokinin A,
            neuropeptide K, and neuropeptide gamma
  JOURNAL   J. Neurosci. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Krause, 10-MAY-1990.
FEATURES       from  to/span     description
    pept    +   140      163     beta-preprotachykinin (PPT), exon 5
               1493  +  1546     beta-preprotachykinin, exon 6
    matp       1495     1524     neurokinin
    pep$    +   140      163     gamma-preprotachykinin (PPT), exon 5
               1493  +  1546     gamma-preprotachykinin, exon 6
    matp       1495     1524     neurokinin
    pep$    +   140  +   163     alpha-preprotachykinin (PPT), exon 5
    pre-msg <     1  >  2503     PPT mRNA and introns
    IVS     <     1      139     PPT intron D
    IVS         164     1492     PPT intron E
    IVS        1547  >  2053     PPT intron F
BASE COUNT      861 a    454 c    428 g    760 t
ORIGIN      
        1 catctacttt agtcctcaga acagcacata aataagcgtt aaatatttcg aaaatcaact
       61 ccttaagttc tctggaaaag ataaaatagc tttaaaaaag aaacccatgt atctgctaac
      121 tggctctctt ttcttctagg gcatggtcag atctctcaca aaagtaagtt cacaatcatt
      181 tgacatttat caaatgctaa tgttaatgat agtgaatatt gcatcctgtg ttaccttttg
      241 cagcacaaca tagagcccaa tggttttgat tgagataatg tagggaaaga aagacagaat
      301 gtgatttgct gaatgatcac tgactagact ctttatgtag acagattacg taccaaatgc
      361 acattacatt gagagaaaag atacactata gtttttagtt tatattctct ctagcgtaat
      421 cacatgagca ctttccagcc tgtccagaag acgagagaga gagagagaga gagagagaga
      481 gagagagaga gagagaacac ttatatacac cttctaattc atattcatga actacctttt
      541 ataatggcag gaaggggatt aagtagactg ggtgaatatt tgagtagaaa atgcactcga
      601 aatgtgatag agatacaggt ttattttccc ccttgatccc ataattgaat gggaacctgg
      661 gagagaagag gtgcagattg cagaactgga gggtccttag agctatttcc tagttaattg
      721 gatccagagg agacaggaag tcccttgggc ccctggaaat gttgcctgct tctatgaggg
      781 tcatgaattg gactaaccct ggcagggtac accccaaacc attaacactg tggtctcaac
      841 taacacaatg gaaactaaga aaaatcctcc ccgttgcagt atgataataa ccttccccaa
      901 agccttcatc tttggggaaa aaaaatgaca tttactaagg ggttaaattt taaagaaatc
      961 ttatacagca aaaaaagtaa taaaatgaat gtatcttatt aatgttaaaa aacaaattgc
     1021 tttcaggaac agtattttta agtctaaatc tactaaatgt gaccccttag aatattccaa
     1081 tactaaggat ctgacttaaa gttcccttgt tgagaaatgc aacatatttg gtaataaaga
     1141 cataaacgtg gatgtctttt cccacacaaa aacatataat taaagtcccc caaaattcaa
     1201 agctactcca gtggcaacca tatttaaaac ttcataatag ataaaacata atacagataa
     1261 aagattctgc taatgtctag ataaagaact gagaaaagta ctcacaatat gttgttgcag
     1321 tgtggtttgc aaggcttata taagtctttg tgggttcacc agtgggtgac atagaaacat
     1381 tcatttattt attctacagg catatattca gaagcttgtt aactttgtac tagttatgag
     1441 tttatttctt caaaaacata catactaaaa tacccctaaa tgtattttcc agggcataaa
     1501 acagattcct ttgttggact aatgggcaaa agagctttaa attctggtat gtataaaatc
     1561 atgtctgaaa atatgtaaat caacttctgt aatttttacg gaaagtaaca attttcaaca
     1621 tactaaaaaa aaaaaaagga gccaatagac ttaacatgag ccatttctgt agacaaattt
     1681 tcataggagc gtttgtagcc aagagtgggc agaaagctga ttaagctgat accaccacaa
     1741 tgtaagtata tttctctcaa agtgccattc tgtccagcat ctgccccact tagtcactac
     1801 atatcctgaa tctttggtct agtctgacac tctttttcac agatattctt aataaaatgt
     1861 cctggaccct tagagagaga tgagagcttt tttacttaca aactgaagtg atgtggtact
     1921 ttcatcagac taaccattta gtgattttct actagaaaaa taaatagata atgacaagca
     1981 aaatgtttct aaaattactg tctctctctg aagttatgga agctcaaaat ccctgtctta
     2041 atgggtggca aagaggctga gtgtagtggt gtccattcta acgccttcaa accattcttc
     2101 ctgttccaga tttagaccta catttaaaag taccacacgc cccaaaccca ttttaaacct
     2161 aagcgggatt ttccttggcc cagctgcata atctctcatt tatcctgctt gaactttcta
     2221 cataaatgcc acagcatatc tctaaagccc ctcactgttc acctcattta atcaattata
     2281 ttcaatgtta aataatccca ttattgtttt catgaaaatt ggctatcagt gaaaatgggt
     2341 taaaatgata aacacctgat tagaaattaa ccagcatctg tttatgatat ttagttcaca
     2401 gaaaatttat aagatgtgct ggcttaattt ttaaatatct gactcatccc caaaataggt
     2461 cactctatag gtatcttaat caatgtagac atattttata ttt
//
LOCUS       RATPPTA4      779 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  Rat tachykinin (PPT) gene, exon 7.
ACCESSION   M34162
KEYWORDS    neurokinin A; neuropeptide K; neuropeptide gamma; substance P;
            tachykinin.
SEGMENT     4 of 4
SOURCE      Rat (strain Sprague-Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 779)
  AUTHORS   Carter,M.S. and Krause,J.E.
  TITLE     Structure, expression and some regulatory mechanisms of the rat
            preprotachykinin gene encoding substance P, neurokinin A,
            neuropeptide K, and neuropeptide gamma
  JOURNAL   J. Neurosci. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Krause, 10-MAY-1990.
FEATURES       from  to/span     description
    pept    +     7       56     beta-preprotachykinin (PPT), exon 7
    pep$    +     7       56     gamma-preprotachykinin (PPT), exon 7
    pep$    +     7       56     alpha-preprotachykinin (PPT), exon 7
    pre-msg <     1      701     PPT mRNA and introns
    IVS     <     1        6     PPT intron F
BASE COUNT      228 a    150 c    159 g    242 t
ORIGIN      
        1 ctgcagtggc ttatgaaaga agcgcaatgc agaactacga aagaaggcgt aaataaaccc
       61 tgtaacgcac tatctattca tctccatctg tgtccgcgag cagtgagcgg taaaataaaa
      121 atgtgcgcta tgaggaatga ttatttattt aatatcaaat gttgttatga gtgaaaaact
      181 caaaaaagtg tttatttttt catattgtgc caataagcat tgtaattcta atgtggtgac
      241 ctcctcagac agaagtagaa attagttgta acttcagcaa agcacagtgt tgatggagtt
      301 gtacaagttt gccagcgatg caagtctcca aagacagaaa ggctgctgtg aggcagtgca
      361 ggcggctgct gctggaggca gagaaactcc tgtgtgtctt gcgcttccct tggttgcttt
      421 tatcctaatg atgtactgag agtttggtat ctgactctat ttgtatccta gcagcatgtt
      481 tcctgtgttg tgactatata gagatgtttt taaaagtttc aatgtacttc tctggtcttc
      541 agtcattgta tgatgtgttg tgatagctac cattttaaat aaaagaatgt atcttcagga
      601 attttgtcac attctttcat taatatttat gctaaccaca ccatctctat taggatgagt
      661 gtgatcaagc tgtttgggta aacacttcta tctctctcag gcacagtgca aattgagtgt
      721 gcgcgcaaca cacacacaca cacacacaca cacacacaca tacctccaca cacattttt
//
LOCUS       MXAFRZE      2493 bp ds-DNA             BCT       21-JUL-1990
DEFINITION  M.xanthus gliding motility regulatory protein (frzE) gene, complete
            cds.
ACCESSION   M35192
KEYWORDS    gliding motility regulatory protein.
SOURCE      M.xanthus (strain DZF1) DNA.
  ORGANISM  Myxococcus xanthus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; 
            Myxococcaceae.
REFERENCE   1  (bases 1 to 2493)
  AUTHORS   McCleary,W.R. and Zusman,D.R.
  TITLE     FrzE of Myxococcus xanthus is homologous to both CheA and CheY of
            Salmonella typhimurium
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.R.McCleary, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         64     2397     gliding motility regulatory protein (frzE)
BASE COUNT      375 a    848 c    926 g    344 t
ORIGIN      
        1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg
       61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc
      121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag
      181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt
      241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag
      301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc
      361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc
      421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg
      481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc
      541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc
      601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc
      661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc
      721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc
      781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg
      841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg
      901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc
      961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc
     1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag
     1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg
     1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg
     1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc
     1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc
     1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg
     1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc
     1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag
     1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc
     1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg
     1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg
     1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg
     1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc
     1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac
     1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc
     1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc
     1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc
     2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag
     2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag
     2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg
     2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg
     2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc
     2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga
     2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg
     2461 caaggggctg cgtgcgctcg cggcccgggg cct
//
LOCUS       RATPCBB       463 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Rat PCB-binding protein mRNA, complete cds..
ACCESSION   J05536
KEYWORDS    PCB binding protein; binding protein.
SOURCE      Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone
            pPCB6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 463)
  AUTHORS   Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J.,
            Gillner,M., Gustafsson,J.-A. and Lund,J.
  TITLE     Cloning, structure, and expression of a rat binding protein for
            polychlorinated biphenyls: Homology to the hormonally regulated
            progesterone-binding protein uteroglobin
  JOURNAL   J. Biol. Chem. 265, 1-4 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.Lund, 15-JUN-1990.
FEATURES       from  to/span     description
    pept         58      348     PCB binding protein precursor
    sigp         58      114     PCB binding protein signal peptide
    matp        115      345     PCB binding protein
BASE COUNT      117 a    156 c     84 g    106 t
ORIGIN      
        1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg
       61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg
      121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat
      181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag
      241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg
      301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat
      361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta
      421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc
//
LOCUS       YSBCPR       2990 bp ds-DNA             PLN       21-JUL-1990
DEFINITION  C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds.
ACCESSION   M35199
KEYWORDS    NADPH-cytochrome P450 reductase.
SOURCE      C.tropicalis (ATCC 750) DNA, clone pTS1.
  ORGANISM  Candida tropicalis
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 2990)
  AUTHORS   Sutter,T.R., Sanglard,D. and Loper,J.C.
  TITLE     Isolation and characterization of the alkane-inducible NADPH-
            cytochrome P450 oxidoreductase gene from Candida tropicalis:
            Identification of invariant residues within similar amino acid
            sequences of divergent flavoproteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            T.R.Sutter, 15-JUN-1990.
            
            Author name and address
               T.R.Sutter
               CIIT, P.O. Box 12137
               Research Triangle Park, NC 27709
FEATURES       from  to/span     description
    pept        554     2596     NADPH-cytochrome P450 reductase
    signal      198      204     TATA box
BASE COUNT      988 a    442 c    558 g   1002 t
ORIGIN      
        1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa
       61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg
      121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag
      181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag
      241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag
      301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa
      361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata
      421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta
      481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc
      541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt
      601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt
      661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa
      721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa
      781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc
      841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt
      901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac
      961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac
     1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg
     1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt
     1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa
     1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga
     1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt
     1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga
     1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa
     1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat
     1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa
     1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat
     1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg
     1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga
     1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc
     1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt
     1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt
     1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac
     1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca
     2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag
     2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg
     2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg
     2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt
     2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa
     2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa
     2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc
     2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga
     2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca
     2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta
     2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga
     2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat
     2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc
     2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta
     2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt
     2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta
//
LOCUS       HUMCYPIIF    1825 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human cytochrome P450IIF1 protein (CYP2F) mRNA, complete cds.
ACCESSION   J02906
KEYWORDS    cytochrome P450.
SOURCE      Human lung, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1825)
  AUTHORS   Nhamburo,P.T., Kimura,S., McBride,O.W., Kozak,C.A.,
            Gelboin,H.V. and Gonzalez,F.J.
  TITLE     The human CYP2F gene subfamily: Identification of a cDNA encoding a
            new cytochrome P450, cDNA-directed expression and chromosome
            mapping
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.T.Nhamburo, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         56     1531     cytochrome P450IIF1 /nomgen="CYP2F"
                                 /map="19q13.1-q13.2" /hgml_locus_uid="LV0170W"
    mRNA    <     1     1813     CYP2F mRNA
BASE COUNT      356 a    596 c    470 g    403 t
ORIGIN      
        1 gcaggctcag cgcatcccag ccagtgtctc ctgcagctca gcagctgcct tcaccatgga
       61 cagcataagc acagccatct tactcctgct cctggctctc gtctgtctgc tcctgaccct
      121 aagctcaaga gataagggaa agctgcctcc gggacccaga cccctctcaa tcctgggaaa
      181 cctgctgctg ctttgctccc aagacatgct gacttctctc actaagctga gcaaggagta
      241 tggctccatg tacacagtgc acctgggacc caggcgggtg gtggtcctca gcgggtacca
      301 agctgtgaag gaggccctgg tggaccaggg agaggagttt agtggccgcg gtgactaccc
      361 tgcctttttc aactttacca agggcaatgg catcgccttc tccagtgggg atcgatggaa
      421 ggtcctgaga cagttctcta tccagattct acggaatttc gggatgggga agagaagcat
      481 tgaggagcga atcctagagg agggcagctt cctgctggcg gacgtgcgga aaactgaagg
      541 cgagcccttt gaccccacgt ttgtgctgag tcgctcagtg tccaacatta tctgttccgt
      601 gctcttcggc agccgcttcg actatgatga tgagcgtctg ctcaccatta tccgccttat
      661 caatgacaac ttccaaatca tgagcagccc ctggggcgag ttgtacgaca tcctagaccc
      721 cagattcccg agcctcctgg actgggtgcc tgggccgcac caacgcatct tccagaactt
      781 caagtgcctg agagacctca tcgcccacag cgtccacgac caccaggcct cgtctccccg
      841 ggacttcatc cagtgcttcc tcaccaagat ggcagaggag aaggaggacc cactgagcca
      901 cttccacatg gataccctgc tgatgaccac acataacctg ctctttggcg gcaccaagac
      961 ggtgagcacc acgctgcacc acgccttcct ggcactcatg aagtacccaa aagttcaagc
     1021 ccgcgtgcag gaggagatcg acctcgtggt gggacgcgcg cggctgccgg cgctgaagga
     1081 ccgcgcggcc atgccttaca cagacgcggt gatccacgag gtgcagcgct ttgcagacat
     1141 catccccatg aacttgccgc accgcgtcac tagggacacg gcctttcgcg gcttcctgat
     1201 acccaagggc accgatgtca tcaccctcct taacaccgtc cactacgacc ccagccagtt
     1261 cctgacgccc caggagttca accccgagca ttttttggat gccaatcagt ccttcaagaa
     1321 gagtccagcc ttcatgccct tctcagctgg gcgccgtctg tgcctgggag agctgctggc
     1381 gcgcatggag ctctttctgt acctcaccgc catcctgcag agcttttcgc tgcagccgct
     1441 gggtgcgccc gaggacatcg acctgacccc actcagctca ggtcttggca atttgccgcg
     1501 gcctttccag ctgtgcctgc gcccgcgcta acgccccggc ccttccagat tcgcctgtga
     1561 gcgatgaggc ccacccatgt gggttgctac gtccccttct tggtccacag tctgccctca
     1621 tccctctggc agtcacgctg tcttccctgc atgctgtgcc tgccgcgtgc ccttccccca
     1681 tccctccaat ctgtgccccg tctgcagggc agaggcagat gtggcatgtc tttttgtacc
     1741 cacagagctt gttctatggc acgccctttt ctaggctttt tgtatcattt cttagtacat
     1801 tgtaatagat tcaaaccagt cttgg
//
LOCUS       MUSADA3A      891 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  Mouse adenosine deaminase (ada) gene, 3' end.
ACCESSION   J05523
KEYWORDS    adenosine deaminase.
SOURCE      Mouse fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 891)
  AUTHORS   Maa,M.-C., Chinsky,J.M., Ramamurthy,V., Martin,B.D. and
            Kellems,R.E.
  TITLE     Identification of transcription stop sites at the 5' and 3' ends of
            the murine adenosine deaminase gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.Ramamurthy, 18-MAY-1990.
BASE COUNT      241 a    207 c    262 g    181 t
ORIGIN      
        1 atagctggcc accttgcctt gattctatca aaggcattcc agtcccagca tcaagcacat
       61 tctgtcctta tgcctccctt tcagacatca gtcatggtgg ctagcaagca tcatgtactg
      121 ggaggaccca aatacacaaa gactctcagt aaggcaaacg gtgttgtaaa ggcctacatg
      181 aggggactag tgatggctaa ggatgaaggc aggcagcaaa caaagctgat gagagctgag
      241 cagagcacag gagccttgaa gagccccttg gcactgtgcc cggtgtgcca gctgggtcca
      301 ctgctcattc ggttggacag tgagcagccg ttgagtaggt acccgtcacg tgccaaacac
      361 tgttagtcac agagggcatg agacagcctg tgctggaaca gctcagttca tagggccatg
      421 gagatgggga gaaaggggcg cctctgtcag agacaagctg tggtctggga aggccttagc
      481 actaaaagca ccacaatgag aagcaaccgc cagaagcagg gcccgcaggc ctttgttcca
      541 gctgcaaaga gaaaggaaaa agtggggaat aagagttggg gctgcggagg gggtggggag
      601 cattgtgcag gttccgtact tgaacagaaa gcagggacca acacaaggaa ggccaaaggg
      661 ctcagcctgg gaaactttgt gaggctatgg aggtaagggg cagggtattc tcccgaggca
      721 tagagactgt cataccaatg atggtcattg gctgggcgag cctaagaaga ctccagtttc
      781 ccttgacctc agttaaatct attacatgag aaactgggtc cctgtcccca acattgtccc
      841 tgttccgtgt tcctaagtca tttctgggca ataaggactg agaactctag a
//
LOCUS       YSCG6PD      3782 bp ds-DNA             PLN       21-JUL-1990
DEFINITION  S.cerevisiae glucose-6-phosphate dehydrogenase (ZWF1) gene and ORF,
            complete cds.
ACCESSION   M34709
KEYWORDS    glucose-6-phosphate dehydrogenase.
SOURCE      S.cerevisiae (strain S288C) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3782)
  AUTHORS   Nogae,I. and Johnston,M.
  TITLE     Isolation and characterization of the ZWF1 gene of Saccharomyces
            cerevisiae, encoding glucose-6-phosphate dehydrogenase
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Johnston, 18-MAY-1990.
FEATURES       from  to/span     description
    pept       1585     3102     glucose-6-phosphate dehydrogenase (ZWF1) (EC
                                 1.1.1.49)
    pept        246     1037     pot. ORF
    mRNA       1472  >  3102     ZWF1 mRNA
BASE COUNT     1066 a    899 c    887 g    930 t
ORIGIN      
        1 atcgattagc aaaaccgtag agaagatcat tgcgcataag aaacaaaaag aaaataccgg
       61 tgcggaccgc aagcctctat tatctgctgt gtgtccggga tttcttatat acacggaaaa
      121 gaccaaaccg cagttggtgc ctatgctgct caatgtcaag tcgcctcagc aaattacggg
      181 ctcgttgatc agagctacgt tcgagagtct ctatcgcgcg tgagtccttt taccatctgt
      241 cgctgatgcc atgttttgac aagaaactag aagcgtcaag gccagaatcg cttgacgatg
      301 ggattgactg tgtcattacg ccacgtgaaa tagtgacaat gttacaagag ttgaatttag
      361 acttcaagtc gtttcttact gaggacacaa gtttatatgg cgctatcacc gccgggtggg
      421 acccgcgagt ccactgggcg tcgaatctgg gaggcacttg tggtggctat gcctaccagt
      481 acgtaacggc tgtccaacga ctacacccag gaagtcagat gatagttctg gagggtagaa
      541 acagcgacat tgtcgagtac cgattgctgc acgatgatcg cataatagcc gccgccagcg
      601 agctttccgg gtttagaaac atccaaaatc tggtgcgtaa actgacctcg ggctccgggt
      661 cagagcgcaa gaggaacatc agctctgcgg aagagagacg aacaggtcca aaggcgaact
      721 cacgcgagat ggcgccagcc gccacggccg atccctacca ctcggactac atcgaggtaa
      781 acgcgtgtcc gggtgcatgc atgaacggtg gtggcctact caatggcgag caaaacagtc
      841 tcaagcggaa gcagttggtt cagacactga ataagcgtca cggagaggag ctagcaatgg
      901 tagatcctct aaccctggga cccaagctag aagaggcggc accgcccgct ttcgctagag
      961 tacgtcttcg cgcccgtcaa gcaggccgtc gaaaaggatc tcgtctctgt tgggagcacc
     1021 tggtaagtaa ggtgtagttt tgcacccgtg tacataagcg tgaaatcacc acaaactgtg
     1081 tgtatcaagt acatagtgac atttaaataa tagcaagaac aacaataata gtagcgctac
     1141 tggaagcacc acgtaatagt ggaaaagaac tggaaaaacc gctataagat gcatactccg
     1201 gcggtcttac gcggagatac aagcttccaa cggtgctaaa agcccggttt cggctcggcc
     1261 ggaggaggaa gagagacgaa aaaaaaaaaa atgactaaaa aaaaaatgga atattattaa
     1321 tgtgggattt ttggctcaag gtgtggtggc cccttttcta agggtggcga attcttcaat
     1381 gtacggaaaa ctcgccaagg ctatcccata tataagcaaa ctgtgggttc atctatatac
     1441 cgacacataa cacctaaagt ggcttcctcc tgcccctctc tcccttttct ccactcaccc
     1501 ctccttctcc cccttccccc tctccaattg gctgtataga cagaaagagt aaatccaata
     1561 gaatagaaaa ccacataagg caagatgagt gaaggccccg tcaaattcga aaaaaatacc
     1621 gtcatatctg tctttggtgc gtcaggtgat ctggcaaaga agaagacttt tcccgcctta
     1681 tttgggcttt tcagagaagg ttaccttgat ccatctacca agatcttcgg ttatgcccgg
     1741 tccaaattgt ccatggagga ggacctgaag tcccgtgtcc taccccactt gaaaaaacct
     1801 cacggtgaag ccgatgactc taaggtcgaa cagttcttca agatggtcag ctacatttcg
     1861 ggaaattacg acacagatga aggcttcgac gaattaagaa cgcagatcga gaaattcgag
     1921 aaaagtgcca acgtcgatgt cccacaccgt ctcttctatc tggccttgcc gccaagcgtt
     1981 tttttgacgg tggccaagca gatcaagagt cgtgtgtacg cagagaatgg catcacccgt
     2041 gtaatcgtag agaaaccttt cggccacgac ctggcctctg ccagggagct gcaaaaaaac
     2101 ctgggggccc tctttaaaga agaagagttg tacagaattg accattactt gggtaaagag
     2161 ttggtcaaga atcttttagt cttgaggttc ggtaaccagt ttttgaatgc ctcgtggaat
     2221 agagacaaca ttcaaagcgt tcagatttcg tttaaagaga ggttcggcac cgaaggccgt
     2281 ggcggctatt tcgactctat aggcataatc agagacgtga tgcagaacca tctgttacaa
     2341 atcatgactc tcttgactat ggaaagaccg gtgtcttttg acccggaatc tattcgtgac
     2401 gaaaaggtta aggttctaaa ggccgtggcc cccatcgaca cggacgacgt cctcttgggc
     2461 cagtacggta aatctgagga cgggtctaag cccgcctacg tggatgatga cactgtagac
     2521 aaggactcta aatgtgtcac ttttgcagca atgactttca acatcgaaaa cgagcgttgg
     2581 gagggcgtcc ccatcatgat gcgtgccggt aaggctttga atgagtccaa ggtggagatc
     2641 agactgcagt acaaagcggt cgcatcgggt gtcttcaaag acattccaaa taacgaactg
     2701 gtcatcagag tgcagcccga tgccgctgtg tacctaaagt ttaatgctaa gacccctggt
     2761 ctgtcaaatg ctacccaagt cacagatctg aatctaactt acgcaagcag gtaccaagac
     2821 ttttggattc cagaggctta cgaggtgttg ataagagacg ccctactggg tgaccattcc
     2881 aactttgtca gagatgacga attggatatc agttggggca tattcacccc attactcaag
     2941 cacatagagc gtccggacgg tccaacaccg gaaatttacc cctacggatc aagaggtcca
     3001 aagggattga aggaatatat gcaaaaacac aagtatgtta tgcccgaaaa gcacccttac
     3061 gcttggcccg tgactaagcc agaagatacg aaggataatt agaaaaatgc aagcacattc
     3121 atttatcggc taagtcactg aaattttttt tttcgagtga ttttatcttg caagctcact
     3181 ctctttgttc ttttctcctc tataatggca tctccccccc accaactcct ctctacttgc
     3241 gtgtgtattt gtttgtatac atgtgtaaat atatatacat aaaagaatgt cgtctcatgt
     3301 attttttaac ttttagccgc ggtcagtgac attttggctt tcccaccatt ccacgtctga
     3361 aaaaaaaaaa aaaaaaagag ataaaatatg aattgaatat atatcaaaaa tgtctgcaaa
     3421 aatttttacg aatcagtccg attggacttg tacttatcat tgatttgctc gttatgcgtc
     3481 ttatccactt gattagctat tccctgtaaa gtcttagaca ctgcctcagt ggccccaatt
     3541 aacggccgaa taattgccac aggtgcagct ctagcaacat aaactgctgc ggcgctcggc
     3601 cctcccggtt tatcatcctt catctgtccc tttgccctcc ataccgcatc gtacgcaata
     3661 tgcatatgct tttctaatga actatacgcc tccttcaaac ccgtgggcag gtcgaggggc
     3721 tgatctgcgt acagactcac tattgtgggt tcgttgtgat ctcccggtga agacatatcg
     3781 at
//
LOCUS       HLTLYS        642 bp ss-mRNA            INV       21-JUL-1990
DEFINITION  H.rufescens sperm lysin mRNA, complete cds.
ACCESSION   M34388
KEYWORDS    sperm lysin.
SOURCE      H.rufescens testis, cDNA to mRNA, clones r13 and r14.
  ORGANISM  Haliotis rufescens
            Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; 
            Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE   1  (bases 1 to 642)
  AUTHORS   Vacquier,V.D., Carner,K.R. and Stout,C.D.
  TITLE     Species specific sequences of abalone sperm lysin: The sperm
            protein that dissolves a hole in the egg envelope
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES       from  to/span     description
    pept         16      480     sperm lysin precursor
    sigp         16       69     sperm lysin signal peptide
    matp         70      477     sperm lysin
    mRNA    <     1      642     sperm lysin mRNA
    signal      625      630     poly-A signal
BASE COUNT      202 a    124 c    151 g    165 t
ORIGIN      
        1 cgaacagatt acaagatgaa gctgttggtg ctttgcattt ttgcgatgat ggcgacattg
       61 gcaatgtcac gcagctggca ttatgttgaa cccaagtttt taaacaaggc ttttgaagtt
      121 gcactgaagg tacagatcat tgctggattc gatagagggc tggtcaaatg gcttagagtt
      181 catggcagaa cattgtctac cgttcagaag aaggcgctgt acttcgtcaa tagacgttac
      241 atgcagaccc actgggcgaa ttacatgctc tggataaata agaagatcga tgcactgggc
      301 agaactccag ttgttggtga ctacacacgg ttaggtgccg aaattggaag acgtattgat
      361 atggcttact tttacgactt cctgaaagat aagaacatga tcccaaaata tctgccatac
      421 atggaagaaa ttaatagaat gcgcccggca gatgtcccag ttaaatacat gggcaagtga
      481 aacccatgaa gaagacagct tcatgcaccg tgcatgtcaa cagatacaga aaacatcgtc
      541 aaacacgggg agctatgaca ccatcctaac atgggttggc tatttgttgc acgattttta
      601 aaacggaaag attgtacatt ccagaataaa tagatttact ac
//
LOCUS       HLTLYS2       650 bp ss-mRNA            INV       21-JUL-1990
DEFINITION  H.corrugata sperm lysin mRNA, complete cds.
ACCESSION   M34389
KEYWORDS    sperm lysin.
SOURCE      H.corrugata testis, cDNA to mRNA, clones p1 and p3.
  ORGANISM  Haliotis corrugata
            Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; 
            Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE   1  (bases 1 to 650)
  AUTHORS   Vacquier,V.D., Carner,K.R. and Stout,C.D.
  TITLE     Species specific sequences of abalone sperm lysin: The sperm
            protein that dissolves a hole in the egg envelope
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES       from  to/span     description
    pept         16      483     sperm lysin precursor
    sigp         16       69     sperm lysin signal peptide
    matp         70      480     sperm lysin
    mRNA          1      650     sperm lysin mRNA
    signal      635      640     poly-A signal
BASE COUNT      194 a    134 c    162 g    160 t
ORIGIN      
        1 ggaacagatt acaagatgaa gctgttggtg ctttgccttt ttgcgatgat ggcgacattg
       61 gctgtgtcgc gccaccgctt ccgttttatt ccacataagt atatacgcaa ggagtttgaa
      121 gttgcactga aggtagagat cattgccgga ttcgatagaa cgctggtcaa atggcttaga
      181 gttcatggcg gacgtttgag tacagttcag aagaaggcgc tgtacttcgt caatagacgt
      241 tacatgcaga cccactggca gaattacatg ctctggatag tgaggaagac cgatgcactg
      301 ggcagacctc cagttgttgc tgactacagt cggttaggtg ccgaaattgg aagacgtatt
      361 gatatggcat atttttacaa cttcctgaac gggaggaaca tgatcccaaa atatctgcca
      421 tacatggagg aaattaatag aatgcgcccg gcagatgtcc cagttgcaaa cagaggcaag
      481 tgaaaccaac gaagaagaaa gctgcatgca ccctgcatgt caacagatac agaaaacatc
      541 gtccgtcaaa cacgaggagc tgtgacagca tccatcccaa catgggttgg ctatttgttg
      601 cacgatttta acacagaaag attgtacatt ccagaataaa tagatttact
//
LOCUS       HUMROSSA     1843 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human 52 kD ribonucleoprotein Ro/SSA, complete cds.
ACCESSION   M34551
KEYWORDS    ribonucleoprotein.
SOURCE      Human thymocyte, cDNA to mRNA, clone FI18.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1843)
  AUTHORS   Itoh,K., Itoh,Y. and Frank,M.B.
  TITLE     Heterogeneity in human Ro/SSA ribonucleoproteins: The 52 and 60 kd
            Ro/SSA proteins are encoded by separate genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.B.Frank, 18-MAY-1990.
            
                             Author address: M.B.Frank
                             Oklahoma Medical Res Foundation
                             Arthritis and Immunology program
                             825 WE 13th St, Oklahoma City, OK 73104
FEATURES       from  to/span     description
    pept         39     1466     52 kD Ro/SSA ribonucleoprotein
    signal     1827     1832     poly-A signal
BASE COUNT      474 a    458 c    488 g    423 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgggc acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga
       61 caatgatgtg ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga
      121 gcatcgagtg tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg
      181 gcagcgtctg tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac
      241 agctagccaa catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac
      301 agggggaacg gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga
      361 aggccctttg ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc
      421 ttgaggaggc tgcacaggag taccaggaga agctccaggt ggcattaggg gaactgagaa
      481 gaaagcagga gttggctgag aagttggaag tggaaattgc aataaagaga gcagactgga
      541 agaaaacagt ggaaacacag aaatctagga ttcacgcaga gtttgtgcag caaaaaaact
      601 tcctggttga agaagaacag aggcagctgc aggagctgga gaaggatgag agggagcagc
      661 tgagaatcct gggggagaaa gaggccaagc tggcccagca gagccaggcc ctacaggagc
      721 tcatctcaga gctagatcga aggtgccaca gctcagcact ggaactgctg caggaggtga
      781 taattgtcct ggaaaggagt gagtcctgga acctgaagga cctggatatt acctctccag
      841 aactcaggag tgtgtgccat gtgccagggc tgaagaagat gctgaggaca tgtgcagtcc
      901 acatcactct ggatccagac acagccaatc cgtggctgat actttcagaa gatcggagac
      961 aagtgaggct tggagacacc cagcagagca tacctggaaa tgaagagaga tttgatagtt
     1021 atcctatggt cctgggtgcc cagcactttc actctggaaa acattactgg gaggtagatg
     1081 tgacaggaaa ggaggcctgg gacctgggtg tctgcagaga ctctgtgcgc aggaaggggc
     1141 actttttgct tagttccaag agtggcttct ggacaatttg gttgtggaac aaacaaaaat
     1201 atgaggctgg cacctacccc cagactcccc tccaccttca ggtgcctcca tgccaagttg
     1261 ggattttcct ggactatgag gctggcatgg tctccttcta caacatcact gaccatggct
     1321 ccctcatcta ctccttctct gaatgtgcct ttacaggacc tctgcggccc ttcttcagtc
     1381 ctggtttcaa tgatggagga aaaaacacag cccctctaac cctctgtcca ctgaatattg
     1441 gatcacaagg atccactgac tattgatggc tttctctgga cactgccact ctccccattg
     1501 gcaccgcttc tcagccacaa accctgcctc ttttccccat gaactctgaa ccacctttgt
     1561 ctctgcagag gcatccggat cccagcaagc gagctttagc agggaagtca cttcaccatc
     1621 aacattcctg ccccagatgg ctttgtgatt ccctccagtg aagcagcctc cttatatttg
     1681 gcccaaactc atcttgatca accaaaaaca tgtttctgcc ttctttatgg gacttaagtt
     1741 ttttttttct cctctccatc tctaggatgt cgtctttggt gagatctcta ttatatcttg
     1801 tatggtttgc aaaagggctt cctaaaaata aaaacccgaa ttc
//
LOCUS       TRBBS12      4004 bp ds-DNA             INV       21-JUL-1990
DEFINITION  T.brucei expression site-associated protein (BS1.1 and BS1.2)
            genes, 5' end and complete cds.
ACCESSION   M34520
KEYWORDS    expression site-associated protein.
SOURCE      T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
            clone BS1.1-2.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 4004)
  AUTHORS   Hobbs,M.R. and Boothroyd,J.C.
  TITLE     An expression-site-associated gene family of trypanosomes is
            expressed in vivo and shows homology to a VSG gene
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Hobbs, 22-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      129     expression site-associated protein (BS1.1)
    pept       1318     2151     expression site-associated protein (BS1.2)
    site       3083     3604     RIME sequence
BASE COUNT     1181 a    650 c   1052 g   1121 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcaact atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg
       61 gcagcagtac atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga
      121 gtgctgtgaa ggaaggatgc gacagaacgt gcgctgctta gtgtgaaaga ttatggtaat
      181 ggagggttgt gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg
      241 ggcaaaagta acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa
      301 ggaaagatgt gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg
      361 agatgtaagg ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa
      421 tttagttaca acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt
      481 ctactgttta atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc
      541 gaatttcact tctccagaat ctttatttta ggtgcgctat agatgatagt ccaaagagaa
      601 ggaattacaa ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga
      661 ttcttttcta tgtcacttgt gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg
      721 aaatggggcg tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt
      781 atgatgcata gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta
      841 ggtggaagcg gctaatgatt gctagcagca ccaacagagg aagttaattg ttgacgtagc
      901 cgtttgtgaa tgagatagaa ggttattggg tggaacatta tcttggaaag agtttttctt
      961 ccactaaaaa tgtgtagagt ggtggtttcg agaaaatggt ttagatattg aggaagaaac
     1021 aaggatttga ggtgcaagtt gatgcgctgt atgatttgtc ttgccgttgc tgttacttat
     1081 tgttgagttg gtatttgagg aaaatgtttt ttcagtgtgc gttgttatat acggctgtga
     1141 gttggtgatg tttatttgaa tgatagcttg ataggataag atgaaataat actaagcatg
     1201 tattcttctg gctgttcgcc gaaatggtaa tctatgttac ccatagcacg aaaaaagttt
     1261 ttttactcag taactgaagc ggaaagtttt aaaaagcata atttaaaaaa gttgataatg
     1321 atgaagtttt ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa
     1381 aataaaagga atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgaactat
     1441 cttaaaggaa tagcgcacag agtaaacagc gagagtgccg tggttacgaa aaaactatca
     1501 gatttgaaaa tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct
     1561 ggcgagcagg attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt
     1621 acattccagc aggagttgga ggaaatgagg aagccatccg cgttagcagc agctgcagct
     1681 gggatagcag ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg
     1741 tcatcacagt tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa
     1801 gaatgttttg atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta
     1861 aaagagtcgg cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat
     1921 agcttgggtg cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga
     1981 acagatactg acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt
     2041 ggtggagtga tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa
     2101 tacgtgatgc tacagagtac ggtgcagtac ggtggaccga ggatccagta aagtgagcat
     2161 atttaaggat gtcatccgcc tgtttgcgcg cttccaagaa gcaaaaaatg aagtgatgaa
     2221 taaaataaaa actactgtgg atgaattggc gaaatgcata gggcagaaag aggttgagct
     2281 cactgatgat cagctttacg aggaatttat ctgggagacc atacacagat tggagctgtc
     2341 aaagagagtg agtgaacaac tctctttagg tgaagaagag gagacgatcc tgaaatctaa
     2401 ctatactgca gaaccggtcc gtggactttc acggtagcgg ggtccaacgc ggcagcagta
     2461 catttgagtg tttctaccgc tgcactttgt ttttcagttt tattgttggg agtgctgtga
     2521 aggaaggatg cgacagaagc tgcgctgctt agtgtgaaag attatggtaa tggagggttg
     2581 tgaaagattg ggggaacaaa aacctatttc ttttatttgg gggaacaaat gggcaaaagt
     2641 aacgtaagtt tccagtggga gtggtatgtg tgtgtgtatg gggctggcta aggaaagatg
     2701 tgagttcggc atgtggtatg tacaagctac gaaaacgtgt gaaacaaaac gagatgtaag
     2761 gggaaaatgt aacaaccaac tatgttaaat ttcaggagac tatttttcaa atttagttac
     2821 aacaaagtaa atgtcaaata atgccaacta taaaggaaaa ctagactaat tctactgttt
     2881 aatgtgacga gaattgtggc gtgaaaccta gtaactgctt atcatggcgt cgaatttcac
     2941 ttctccagaa tctttatttt agtgcgctat agatgatagt ccaaagagaa ggaattacaa
     3001 ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga ttcttttcta
     3061 tgtctcttgt gaaggtcgat ctttttgcta ccccctggcg atgccggcca cctcaacgtg
     3121 gtgccagggt ccagtgaccc cgtatcattg ggggaagcca agagccagca gcgttccttt
     3181 catggggaac actgctgtag ctccggctac gcatcataca gcacagggat cagcaggcgt
     3241 cttgctggga caccgttttt catttgtcgg tccctgggca cgtgccagcg tcgatcagca
     3301 gtatcatccg cgctaagatg ctgctgtccg gtgatgtgga cctccaaaaa aggattgcca
     3361 ttttggcatc ttttggagag tccagggtgg gaggcttctc gccccatctg ctgtattccg
     3421 ttcaactgcg gacgtacaac aaaaaattat agagggtgtg ttaggatgaa tgaaaaaggg
     3481 agactctgcc acagtcgcca gaccgatagc actctcaggg ctctacggtg ttggctgatg
     3541 gccgcgcaga ggggggaaac tctcacgaag gcacgaagaa aattcaaaaa aaaatttttg
     3601 ctaccacagg ctgtaaggat aacgctggag atggagccga ctgtggtttt tcttgtgaat
     3661 gcgtatcttc gtaaccggtt gttaagaagg gatcggacga aggaatttgt ctgagtatga
     3721 ggtaatgtcg aggttttctt tcattggatg tagtaataat ctccaaattt gttggaacat
     3781 ataatattat tgcaataaat gcttgaagag tggaattatt gatggattga cttgagtatc
     3841 atgattccaa tgaacactct tataatcctt cgaatagcaa acgtatgaac aaattaacgt
     3901 agcgtcatgt agacatggac atttttgtaa tgatacgact gctattgcac atctgaaaat
     3961 ttctatgacg gaccagtgcc gttaatcacc gtacaattga attc
//
LOCUS       TRBBS134     2396 bp ds-DNA             INV       21-JUL-1990
DEFINITION  T.brucei expression site-associated protein (BS1.3 and BS1.4)
            genes, 3' and 5' ends.
ACCESSION   M34521
KEYWORDS    expression site-associated protein.
SOURCE      T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
            clone BS1.3-4.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 2396)
  AUTHORS   Hobbs,M.R. and Boothroyd,J.C.
  TITLE     An expression-site-associated gene family of trypanosomes is
            expressed in vivo and shows homology to a VSG gene
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Hobbs, 22-MAY-1990.
FEATURES       from  to/span     description
    pept       1836  >  2396     expression site-associated protein (BS1.4)
    ORF     <     1     1206     expression site-associated protein (BS1.3)
BASE COUNT      744 a    357 c    642 g    653 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttgcaa ctccatgaag tgttatcatc catttccttc ggtagcttgg gtgtgaaaaa
       61 tattcgagga ggaaatggga gggatggatg caatttggtg agaacagaca ctaacggtat
      121 attaaatggg ggttcaccaa cacggcacaa cttaacgtgg ggtggtggag tgatgaattt
      181 tggatcctat caaaacggaa gtatgtacgt tgagggtggt gaatacggtg atgctacaga
      241 gtacggtgcg gtacggtgga ctgaagatcc cagtaaagtg agcatatttg aggatgtcat
      301 tcgcctgttt gcgcgcttcc aagaagcaaa aaatgcagtg atgaggagaa taaaaactac
      361 tgtggatgaa ttgacgaaat gtatagggca gaaggaggct gaactcacta atgatcagat
      421 ttacgaggaa tttatctggg agaccataca cagattggag ctgtcaaaga gagtgagtga
      481 acaaccctct ttaggtgaag aagaggagac aatcctgaaa tctaactata ctgcagaacc
      541 tgtccgtgga cctttcacgg gagcggggtc caacacagta gcagtacatt tgagtgtttc
      601 taccgctgca ctctgttgtt tagctttatt gttgggagtg ctgtgaggga aggatgcgac
      661 agaaactgcg ctgcttagcg tgaaagatta tggtaatgga gggttgggaa agattggggg
      721 aacaaaaacc tatttctttt atttggggga acaaatgggc aaaagtaacg tgagtttcca
      781 gcgggagtgg tgtgtgtgtg catggggctg actaaggaaa gatgtaagtt cggaatgtgg
      841 tatgtacaag ctacgaaaac gtgtgaaaca aaacgatatg taaggggaaa atgtaacaac
      901 caactatgtt aaatttcaag agactatttt tcaaatttag ttacaacaaa gtaaatgtca
      961 aataatgcca actataaagg aaaactagac taattctact gtttaatgtg acgagaattg
     1021 tggcgtgaaa cctagtaacg accacgtcac gatgtggaat ttcacttctc cagaatcttt
     1081 gttttagtgc gctatagatg atagtccaaa gagaaggaat tgcagttatt ataacccatc
     1141 caaagataga tcaaagactc ctcttctccg agtgattctt ttctatgtca cttgtggaag
     1201 gtcgatcttg gaacaatatc gaacgtgtga agcctgaaat gaggcgttat tattgcttct
     1261 ctttttggtg gttgttagag ttatataccg ctatggtatg atgcacagag gtaacgctaa
     1321 aaataagtga caaggcgttg tagtagggcg tatggtagtt ggaagcggct aatgattgct
     1381 ggcctcacca acagaggaag ttaattgttg acgtagccgt ttgtgaatga gatagaaggt
     1441 tatcgggtgg aacattatct tggaaagagt ttttcatcac ctaaaaatgt gtagagtggt
     1501 ggtttggaga aaatggttta gatattgagg aggaaacaag gatttgaggt gcaagttgat
     1561 gcgctgtaag atttgtatta ccgttggtgt tacttattgt tgagtgagta tttgagggaa
     1621 atgttatggg aggtgcttcg ttgttacata cggctctgag tcagtggtgt tcatttgaat
     1681 gaaatcttga taggataaga tgaaataata ctaagcatgt attcttctgt gctgccgccg
     1741 aaatggtaat ctatcttacc tacagcacga aaaatttatt ttttttactc agtaacaaag
     1801 cggaaagttt taaaagcata atttaaaaaa gtgatatgat gaggttttgg tttgtgttgt
     1861 tggccctttt gggaaaaaaa acacatgcat attatgaaaa tgaaaggaat gcattaaacg
     1921 caaccgccgc taataaagtt tgtgggctat cgaactatct taaaggagta gcgcacagag
     1981 taaacagcga gagtgctgtg gttacggaaa aactatcaga tttgaaaatg agaagcatcc
     2041 agttgcagct atcagtaatg cggaacagag tgccttctgg cgagaaggat tgtaaagaca
     2101 taaggacact cttgaaaaca gtattgagga atgagtttac attccagcag gagttggagg
     2161 aaatgaggaa cgcatccgcg ttagcagcag ctgcagctgg gttggcagct ggaagactgg
     2221 aagagtggat ttttgtattt gcacaggcag cagacaggtc atcacagttt tgcataagtg
     2281 tggggaagca tatcgcggct gagcacggca acttgcaaga atgttttgat ggaacaattg
     2341 gacctgaaac cctttacaaa attgaggatt cacgtgtaaa agagtcggcg cagaaa
//
LOCUS       TRBBS15       714 bp ss-mRNA            INV       21-JUL-1990
DEFINITION  T.brucei expression site-associated protein (BS1.5) gene, partial
            cds.
ACCESSION   M34522
KEYWORDS    expression site-associated protein.
SOURCE      T.brucei (strain 427) bloodstream stage isolate MiTat 1.4, cDNA to
            mRNA, clone BS1.5.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 714)
  AUTHORS   Hobbs,M.R. and Boothroyd,J.C.
  TITLE     An expression-site-associated gene family of trypanosomes is
            expressed in vivo and shows homology to a VSG gene
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Hobbs, 22-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1  >   714     expression site-associated protein (BS1.5) (AA
                                 at 1)
BASE COUNT      226 a    116 c    203 g    169 t
ORIGIN      
        1 agcgagagtg ccgtggttac ggaaaaacta tcagatttga aaatgagaag catccagttg
       61 cagttatcag taatgcggaa cagagtgcct tctggcgaga aggattgtaa agacatcagg
      121 acactcttga aaacagtatt gaggaatgag tttacattcc agcaggagtt ggaggaaatg
      181 aggaacgcat ccgcgttagc agcagctgca gctgggttgg cagctggaag actggaagag
      241 tggatttttg tatttgcaca ggcagcagac aggtcatcac agttttgcat aagtgtgggg
      301 aaaactatcc caccggaaca aaacaacttg caagaatgtt ttgatggaac aattggacct
      361 gaaacccttt acaaaattga ggattcacgt gtaaaagagt ctgcaaagaa aagcttgcaa
      421 ctccatgaag cgttatcatc catttccttc aatagtttgg gtgcggaaag tatccgagga
      481 ggaaatggga aggatggatg taatttggtg agaacagaca ctgacggtat attaaatggg
      541 ggttcaccaa cacggcacaa tttaacgtgg ggtggtggag tgatgaattt tggatcctat
      601 caaaacggaa gtatgtacgt tgagggggga gaatacggtg atgctacaga gtacggtgca
      661 gtacggtgga ctgaagatcc cagtaaagta agcatattta aggatgtcat tcgc
//
LOCUS       HUMCALCAT    1308 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human calcineurin catalytic subunit mRNA, complete cds.
ACCESSION   J05480
KEYWORDS    calcineurin.
SOURCE      Human adult cerebral cortex and 7-year-old cerebellum, cDNA to
            mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1308)
  AUTHORS   Kincaid,R.L., Giri,P.R., Higuchi,S., Tamura,J., Dixon,S.C.,
            Marietta,C.A., Amorese,D.A. and Martin,B.M.
  TITLE     Cloning and characterization of molecular isoforms of the catalytic
            subunit of calcineurin using nonisotopic methods
  JOURNAL   J. Biol. Chem. 265, 11312-11319 (1990)
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by R.Kincaid,
            08-MAY-1990, for release after publication.
            
            Two cDNA isoforms were found that either contained (class 1) or
            lacked (class 2) base pairs 535 to 564, but otherwise appeared
            identical; this region presumably represents an alternatively
            spliced exon.
FEATURES       from  to/span     description
    pept    <     1      762     calcineurin catalytic subunit
    signal      921      926     poly-A signal
    signal      927      932     poly-A signal
BASE COUNT      401 a    251 c    283 g    373 t
ORIGIN      
        1 ttacagcaca ataacttgtt atctatactc cgagcccacg aagcccaaga tgcagggtac
       61 cgcatgtaca ggaaaagcca aacaacaggc ttcccttctc taattacaat tttttcagca
      121 ccaaattact tagatgtata caataacaaa gctgcagtat tgaagtatga gaacaatgtt
      181 atgaatatca ggcaattcaa ctgttctcct catccatact ggctcccaaa tttcatggat
      241 gtttttactt ggtcccttcc atttgttggg gaaaaagtga ctgagatgct ggtaaatgtc
      301 ctcaacatct gctcagatga tgaactaggg tcagaagaag atggatttga tggtgcaaca
      361 gctgcagccc ggaaagaggt gataaggaac aagatccgag caataggcaa aatggccaga
      421 gtgttctcag tgctcagaga agagagtgag agtgtgctga cgctgaaagg cttgacccca
      481 actggcatgc tccccagcgg agtactttct ggagggaagc aaaccctgca aagcgctact
      541 gttgaggcta tcgaggctga tgaagctatc aaaggatttt caccacaaca taagatcact
      601 agcttcgagg aagctaaggg cttagaccga attaatgaga ggatgccgcc tcgcagagat
      661 gccatgccct ctgacgccaa ccttaactcc atcaacaagg ctctcacctc agagactaac
      721 ggcacggaca gcaatggcag taatagcagc aatattcagt gaccacttac tgttcacttt
      781 tttttttttt tttttttttt tttttgagct gcggggcatg atgggattgc tgcatatcag
      841 cagttggatg ttcttgcctc tgacagtagc ttatttgctc tgggggccag gaattggatt
      901 cagtttacac tatcattaaa aataaaaata aaaaaagggg gagagagata ataaactata
      961 ttttggtggg gctggtgatt aaacacctct tttgggtatg ccttttaaaa atgcttatag
     1021 agaaaaaaaa ttttaaaaag aaagctaatg ctagatatac tgcaatgtta ggggaatgaa
     1081 catgttttcc tactgaattg gggacttcta gataggttaa ggttttattc tgttactgga
     1141 catcaaaact ttgtctaatt tcttactcta ttgtacgttt acagtcgcag cactaaaatg
     1201 gatgtcatca aacattttta acatgatgta caaactaagg actatttatt gataatgttt
     1261 tgctactctt gtcagacaat ggctataaac tgaattaggc agtcttaa
//
LOCUS       MUSCALCAT    2194 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Mouse calcineurin catalytic subunit mRNA, complete cds.
ACCESSION   J05479
KEYWORDS    calcineurin.
SOURCE      Mouse (strain NIH Swiss) 18-day-old adult brain, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2194)
  AUTHORS   Kincaid,R.L., Giri,P.R., Higuchi,S., Tamura,J., Dixon,S.C.,
            Marietta,C.A., Amorese,D.A. and Martin,B.M.
  TITLE     Cloning and characterization of molecular isoforms of the catalytic
            subunit of calcineurin using nonisotopic methods
  JOURNAL   J. Biol. Chem. 265, 11312-11319 (1990)
  STANDARD  full staff_review
COMMENT     Authorin submission for [1] kindly submitted by R.Kincaid,
            08-MAY-1990, for release after publication.
            
            Two cDNA isoforms were found that either contained (class 1) or
            lacked (class 2) base pairs 1414 to 1443, but otherwise appeared
            identical; this region presumably represents an alternatively
            spliced exon.
FEATURES       from  to/span     description
    pept         76     1641     calcineurin catalytic subunit
    mRNA    <     1     2194     calcineurin catalytic subunit mRNA
    signal     1786     1791     poly-A signal
    signal     1792     1797     poly-A signal
BASE COUNT      624 a    488 c    529 g    553 t
ORIGIN      
        1 gcgccggtgc ggtcggggtg tgcagtcgga cgggacgagc agcgcgtcgc tgtccccccc
       61 tcccggtggc tggagatgtc cgagcccaag gcgattgatc ccaagttgtc gacgaccgac
      121 agggtggtga aagccgttcc atttccacca agtcaccggc tgacagcaaa ggaagtgttt
      181 gataatgatg ggaaacctcg tgtggatatc ttaaaagcac atctcatgaa ggagggcagg
      241 ctggaagaaa gtgttgcatt gagaataata acagagggtg cttcgattct ccgacaggaa
      301 aaaaacttgc tggatatcga cgcaccagtc acagtttgtg gggacatcca tggacaattc
      361 tttgacttga tgaagctctt tgaagtggga ggatctcctg ccaacactcg ctacctcttc
      421 ttaggggact atgttgacag agggtacttc agtatcgaat gtgtgctgta tttgtgggcc
      481 ttgaaaattc tttaccccaa aacactgttt ttacttcgcg gaaaccatga atgtaggcac
      541 ctcacagagt atttcacgtt taaacaagaa tgtaaaataa agtattcaga acgcgtttat
      601 gacgcctgta tggatgcctt cgactgcctt cccctggctg cgctaatgaa ccagcagttc
      661 ctgtgtgtac acggtggttt gtctccagag attaacactc tagatgacat cagaaaatta
      721 gaccgattca aagaaccacc tgcttatggg cccatgtgtg acatcctatg gtcagacccc
      781 ctggaggact ttggaaatga gaagactcag gaacatttca ctcacaacac agtcagaggc
      841 tgttcgtact tctacagtta cccagctgtg tgtgacttcc tgcagcacaa taatttgttg
      901 tccatactcc gcgcccacga agcccaggat gcagggtacc gcatgtacag gaaaagccaa
      961 acaacaggct tcccgtctct aattacaatc ttctcggcac caaattactt agatgtgtac
     1021 aataacaaag ctgcagtgtt gaagtacgag aacaatgtga tgaacatcag gcagttcaac
     1081 tgctccccgc atccgtactg gctcccaaat ttcatggatg ttttcacctg gtcgctgcca
     1141 tttgttgggg agaaagtgac tgagatgctg gtcaatgttc tcaacatctg ctccgacgat
     1201 gaactggggt cagaagaaga tggatttgac ggagccacgg ccgcagcccg gaaggaagtc
     1261 atcagaaaca agatccgagc aataggcaaa atggccagag tgttctcagt tctcagagaa
     1321 gagagtgaga gtgtcctgac actgaagggc ctgaccccaa ctggcatgct ccccagcgga
     1381 gtgctctctg gcgggaaaca gactctgcaa agcgctactg ttgaggctat tgaggctgat
     1441 gaagccatca aaggattttc accacaacat aagatcacta gcttcgagga ggccaagggc
     1501 ttagaccgaa ttaacgagag gatgccacct cgcagagacg ccatgccctc tgacgccaac
     1561 cttaactcca tcaacaaggc tctcgcctca gagactaacg gcacggacag caatggcagt
     1621 aatagcagca atatccagtg accacttcct gttcactttt tttttttttt gagctgcagg
     1681 gcatgatggg attgctgcat ctcagcagtt ggatgttctt gcctctgaag gtagcttgtt
     1741 tgctctgggg gccaggaatt ggattcagtt tacactatca tgaaaaataa aaataaaaaa
     1801 agagggagag agataataaa ctatattttg gtgagggtgg tgattaaaca cctcttttgg
     1861 gtatgccttt aaaaaatgct tctagggcaa aaaagtttta aaaagaaagc taatgctagc
     1921 tatactgcaa tgttagggga atgaacgcgt tttcctactg cactggggac ttttagatag
     1981 gttaatgaaa ggcctttatt ctgttactgg acacgaaaac tttgtctaat ttcttatact
     2041 ctattgtacc tttacagtcg cagcactaaa atggaagaca tcaaacattt ttaacagaaa
     2101 aaaaaaaaga tgtaaaaact aactaaggac tatttattaa tgatgttttg ctactcctgt
     2161 cagacaatgg ctataaactg aattaggcag tctt
//
LOCUS       RATGGLUT     1060 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  Rat gamma-glutamyltransferase gene, 5' promoter region.
ACCESSION   J05515
KEYWORDS    gamma-glutamyltransferase.
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1060)
  AUTHORS   Rajagopalan,S., Park,J.H., Patel,P., Lebovitz,R.M. and
            Lieberman,M.W.
  TITLE     Cloning and analysis of the rat gamma glutamyltransferase gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Rajagopalan, 25-MAY-1990.
BASE COUNT      235 a    277 c    275 g    273 t
ORIGIN      
        1 cagctgcctt ctggaggacc aaactgttca ggggaaggac aggaagaaat gagcctgtgc
       61 cttcaggtca gagtcatgcc tagatctggg cgggagagct acaagggata ctgaccagga
      121 gatagggtgt tgtcccctcc cccctggggt ttggtatcct cctctgcctt aagagttgca
      181 aatcgacttt cccacataac aggcaccaaa tccagttagg accaacccca ccttccaatc
      241 caggggagag gaatgtcagc aatgcgtggg cgtgtccttc taatgtgttt tccttgagtg
      301 ttgtatgtgg accatctgca tgctcggtac ccagaggcca tcaggtctct tggaacagga
      361 attgttgatg tgaaatgcca tgtggttgct gggataggaa ctcaggactc cggaagaacc
      421 ttctcttctc cagtccccct ctgttgtttt tttttttttt ttttttgaga tacgatctca
      481 cactgtagca caggctaatc cagaactcac taggtaggtc agactgggct caaatcacag
      541 cgattctgct tctgcttcct gagtgccagg gtttgcaggt gttagctatc atgcccagtc
      601 ttaacatttc acacacgcca gtccaagtta ttaaaaaaca acccggcagt tgagggcagg
      661 gccctcaagt cccacaactg gtgcgtgcgt accaagtcca atgcgggaaa ggcctggacc
      721 cttgaaccct ttgggcggtt cacttgttag ctcttactac caaatcctgg gcttacacat
      781 gaatgccagc ccctccctgc ccagttctgt gacccccttc cccgggcagc tcttgggaga
      841 agtcatgcat acatggaggc ggtgccagcc tctttgactc cagagttcag cgggagacag
      901 agggagctca tcacatcagg caccccagaa gagttctggg cctgcttcac gtttaacttt
      961 gtgattttca ggagtaccag cctgctctaa cggtttcagg gaagattggc tgtgggtttc
     1021 cgcagagtgt gggggagttc ctgcttatcc atacagctga
//
LOCUS       YSCRGI        130 bp ds-DNA             PLN       21-JUL-1990
DEFINITION  Yeast 35S ribosomal RNA gene, 5' end.
ACCESSION   M35269
KEYWORDS    35S ribosomal RNA.
SOURCE      Yeast DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 130)
  AUTHORS   Riggs,D.L. and Nomura,M.
  TITLE     Specific transcription of Saccharomyces cerevisiae 35 S rDNA by RNA
            polymerase I in vitro
  JOURNAL   J. Biol. Chem. 265, 7596-7603 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA        101  >   130     35S ribosomal RNA
BASE COUNT       48 a      9 c     43 g     30 t
ORIGIN      
        1 agagggcaaa agaaaataaa agtaagattt tagtttgtaa tgggaggggg ggtttagtca
       61 tggagtacaa gtgtgaggaa aagtagttgg gaggtacttc atgcgaaagc agttgaagac
      121 aagttcgaaa
//
LOCUS       ACNRRDA      1486 bp ss-rRNA            RNA       21-JUL-1990
DEFINITION  A.lignieresii 16S ribosomal RNA.
ACCESSION   M35017
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      A.lignieresii (strain ATCC 19393) rRNA.
  ORGANISM  Actinobacillus lignieresii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 1486)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1486     16S ribosomal RNA
BASE COUNT      384 a    303 c    448 g    321 t     30 others
ORIGIN      
        1 attnannant ttnatcangg ctcagattna ncgctggcgg caggcttaac acatgcaagt
       61 cgaacggtaa cgggaagggn ncttncnttc tttgccgacg agtggcggac gggtgagtaa
      121 tgcttgggaa tntggcttat ggagggggat aactacggga aactgtagct aataccgcgt
      181 aatatcttag gattaaaggg tgggactttc gggccacctg ccataagatg agcccaagtg
      241 ggattaggta gttggttagg taaaggctga ccaagccgac gatctctagc tngtctgaga
      301 ggatgaccag ccacactnga actgagacac ggtccagact cctacgggag gcagcagtng
      361 ggaatattgc acaatggggg gaaccctnat gcagccatgc cgcgtgaatg aagaaggcct
      421 tcgggttgta aagttctttc ggtagcgagg aaggtatcaa atttnataga tttngtaatt
      481 gacgttaact acagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag
      541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggtt nattaagtga
      601 gatgtgaaag ccccgggctt aacctgggna ttgcatttca tactggtcaa ctagagtact
      661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
      721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcgaaagc gtggggagca
      781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
      841 ntnagtctgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
      901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
      961 atgnaacgcg aagaacctta cctactcttg acatccatgg aatcttgtag agatacgaga
     1021 gtgccttcgg gaaccatgag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
     1081 gttgggttna gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attnggtcgg
     1141 gaactcnaag gagactgccg gtgatnaacc ggaggaaggt ggggatgacg tcaagtcatc
     1201 atggccctta cgagtagggc tacacacgtg ctacaatggc gtatacagag ggaagcaaga
     1261 tggcgacatg gagcaaatct cacaaagtac gtctaagtcc ggattggagt ctgcaactcg
     1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
     1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
     1441 ttaaccgcaa ggggggcgtt taccacggta tgattcatga ctnggg
//
LOCUS       BNORRDA      1532 bp ss-rRNA            RNA       21-JUL-1990
DEFINITION  D.nodosus 16S ribosomal RNA.
ACCESSION   M35016
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      D.nodosus (strain 198A; ATCC 27521) rRNA.
  ORGANISM  Bacteroides nodosus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Anaerobic gram-negative straight, curved and helical rods; 
            Bacteroidaceae.
REFERENCE   1  (bases 1 to 1532)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1532     16S ribosomal RNA
BASE COUNT      410 a    330 c    465 g    324 t      3 others
ORIGIN      
        1 aactgaagag tttgattctg gctcagattg aacgctggcg gcatgcttaa cacatgcaag
       61 tcgaacgggg ttatgtagct tgctatgtaa cctagtggcg gacgggtgag taatatatag
      121 gaatctgcct tatggtgggg gacaacgtat ggaaacgtac gctaataccg cataagattg
      181 aagaatgaaa gcgggggctc gaaagacctc gcgccgtaag atgagcctat atcggattag
      241 ctagttggtg gggtaagagc ctaccaaggc gacgatccgt agctggtttg agagaatgat
      301 cagccacatc gggactgaga cacggcccna actcctacgg gaggcagcag tggggaatat
      361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
      421 gtaaagcact tttattagtg aagaacggtg catggttaat acccatgcaa ttgacattag
      481 ctaaggaaaa agcaccggct nactccgtgc cagcagccgc ggtaatacgg agggtgcaag
      541 cgttattcgg aatgactggg cgtaaagcgc acgcaggtgg ttttataagt caggtgtgaa
      601 atccctgggc tcaacctagg aattgcattt gaaaccgtaa gactagagta tgtcagagga
      661 aggcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
      721 aggcagcctt ctggggcaat actgacactc atgtgcgaaa gcgtgggtag caaacaggat
      781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggtgttgggt agaaatactc
      841 ggtaccgaag ctaacgcaat aagttgaccg cctggggagt acggccgcaa ggttgaaact
      901 caaagaaatt gacggggacc cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg
      961 cgaagaacct taccaggctt tgacatccag agaattctac agagatgtgg gagtgccttc
     1021 gggaactctg agacaggtgt tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt
     1081 aagtcccgca acgagcgcaa cccttatcct tacttgccag cacgtaaagg tgggaactat
     1141 aaggagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc atcatggccc
     1201 ttacagcctg ggctacacac gtgctacaat gggcggtaca gacggtagcc aacccgcgag
     1261 ggggagccaa tctgagaaaa ccgttcgtag tccggattgc agtctgcaac tcgactgcat
     1321 gaagtcggaa tcgctagtaa tcgcgaatca gcacgtcgcg gtgaatacgt tcccgggtct
     1381 tgtacacact gcccgtcaca ccatgggagt ttgttgcacc agaagtaggt agcttaaccg
     1441 taaggagggc gcttaccacg gtgtggccga tgactggggt gaagtcgtaa caaggtaacc
     1501 gtaggggaac ctgcggttgg atcacctcct tt
//
LOCUS       CAJSAPA      3974 bp ds-DNA             BCT       21-JUL-1990
DEFINITION  C.fetus surface array protein (sapA) gene, complete cds.
ACCESSION   J05577
KEYWORDS    surface array protein.
SOURCE      C.fetus fetus DNA.
  ORGANISM  Campylobacter fetus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 3974)
  AUTHORS   Blaser,M.J. and Gotschlich,E.C.
  TITLE     Surface array protein of Campylobacter fetus: Cloning and gene
            structure
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.J.Blaser, 01-JUN-1990.
FEATURES       from  to/span     description
    pept         24     2825     surface array protein (sapA)
    binding      12       15     ribosomal binding site
    signal     2846     2870     transcription termination signal
BASE COUNT     1293 a    641 c    785 g   1255 t
ORIGIN      
        1 ctattttatt aaggagtcct taaatgttaa acaaaacaga tgtttcaatg ctttatatca
       61 ctattatggg tatggcaagt gagggtgatg gtaataagta ttggttagat tatgccaata
      121 ataatagttt aggagtttca agtttagcta atattatgct tgatagtcca ggggcggcta
      181 aattctttgg tgattctctt ttagcaggta atgaaaaaga ttttgttact aagatatata
      241 gtatagcttt aggtaatact agtgatgttg atggcattaa ttattggact aaggcaataa
      301 ctggcggtgg agaatttact gatagtaagg gtaatgttat tagtgttgct agtttaagca
      361 agggtgattt aataggtgct atgattaact ctatggttaa tggcggtagt gctgagtcta
      421 aggctatatt tgaggctaag gcagctgcta gtgattactt tgccgatgct actttggtaa
      481 gggatattag tggattagat gagggtacta cttctaagtt aattagcgag attaatagtg
      541 ctagtgatct tgataaggtt aagagtgaga ttgatgcttt gaagagtgag ctacctaatc
      601 cgggtagtac ttatgatctt acagagggta atgataattt aaagggtact gatttagacg
      661 atacttttaa tgggactaca tatgtaggta atggtactaa taagagtact cttagtgcat
      721 ttgataagac tagatggtcg gtgcttggga gagatacgtt gaatgcgata tttactgcaa
      781 taacacgcgc tgcgctacta actgatcaag ctgaattgat aatcactaaa aggcgtacaa
      841 acgtagaaaa tatcaatata atttcagatc tagaaacaag tggcgatttc gttttcaacg
      901 gttatgaaaa agtaggattt aacgtacttg gcgatatagt tagctttgct accgacgcat
      961 ctaaaagcgt aaatgtagaa acaacaggaa cgataactgc tttcaccgca gccggaacag
     1021 gcaaagtcga tgttgtcgcc ggtaaaatct ctgcccttac ggccgattcg cgaacaagcg
     1081 taaatttaac tgctacaaac gacactatca cattaaccag tgcaaacgct gctactagtg
     1141 tgaatttaaa acagcggcag gccaaagacg ctacaataac atccgcaatg cagcaaaaat
     1201 ataacaatag acgcaacagg attgcaacta taacttcagc tacggctgta gagaatttga
     1261 cagttaaaca tgcaactaac gtagcgctaa atggtggcat ggataaactt gcaacagtta
     1321 ctcttgacaa tgctgcttta actgctgcaa tagatataaa atctgcaagc acactaaatt
     1381 taataaattc aagtgttaac ggaccaaaac atctctacag cagcaaaaga cgttactgta
     1441 aatttaagcg agctgctgct aaggttaaat taaacacaac tgctgctacg gatcaaactg
     1501 tgactcttaa agctaatgcc acagacaata gtcttgaatt tgatagcgca actgcaaaaa
     1561 ctacgtcagt cactgctagt ggtagtggaa aaactttggt tatcaaaggt gctgaagtag
     1621 aaactctagt taatatagac actactgcgt tcaatgcgct gcagagtgtt agcttcggca
     1681 aaactggtca aggcggcaaa tttagcgtaa aaacaggtac aggcgatgat aaaatagagt
     1741 ttgtaggcac aactttaact gaaggttcag taatcgatgc gccgggtaat gatactatag
     1801 ctatgaaatc agctgctttg actagtgcaa atttcactat gatcaaaaac atagagaatg
     1861 ttgctattag cgacgcggtg gctacggctg accttagctc ttctgcgttt aaaaactcag
     1921 tcatcataac tactaaagag gctgccgata cgactttaac tataaataaa gaccaagtaa
     1981 ttaattttac tgcggcagat gcaggctcag ttaagcttat tacagtgaaa ctaaatgatg
     2041 taacagcgct aatgattgtt gttaaaatag tattagatgc tgccgctaaa gatactaata
     2101 tagctttggg cacagcagca gccgataaag ctcttgtaat tgatacgggt atagaaactc
     2161 ttaatatcac ttcgttagtt aaagctacaa gccctgaaac tacagcaaat accgtaaatg
     2221 ctaaattgac agatgtaact tctatcatta tagatgggat gcaaattact ttaggtcatg
     2281 ccggcactgc tggaactgac tatagtaaag tttctatgat agatgcaagt gctcttaaag
     2341 ctggtcttac atttgatgca agtgctataa cattaggagc taatgctact ataaaaggtg
     2401 gctctggagc tgatagtata actgttaaag gcggaaatat cgttgtagat ttagttgctg
     2461 gcggagacga tactataact cttaaaaaag gtgctgagaa aacagatatt acaaccgtta
     2521 ataactttaa tgctggcgat aagatagata tagctgatgc caaaaatggt actttcacat
     2581 ttaataaaat tacaatgaat agcgatgcta atttagatga ttatatcact aaagcagtag
     2641 caggagacgg atcaacaaac tctgctgtaa gctactcaca taatggatat acttatgtag
     2701 tagttgatgg aacagctgga gctactttca ctaaagctac tgatacaatt atcaaacttt
     2761 caggtacgct agatcttaaa cttagcggag ataatgttgt agttgatgat ggaagcgtaa
     2821 tttaagcaat agtaattgct tgataaaaaa gccttaattt ttaaggcttt taatttgtct
     2881 aaatttatct tagcttaggc gtatttattt gaaatctcgt cagacggatc gtcttaaaat
     2941 aaggttttat agagacaaag tatattttag accaaacctt aataaatagt agatagtttt
     3001 ttgctaagag gagctaggag aactagctcc ttgaatattt ttaaatttat ttgagttgta
     3061 ttttaaattc aacatatctg ttttttgatg ttttgatcga aatttgtcta ttgtatccat
     3121 cctaaaaatt tcagctcttc aactcttttt gtgtgaaatt gttctaataa ggttgataaa
     3181 ttttcatatt tgtcacttgg aacaagaaaa ctctcgtctc cgactctttt caccaatttt
     3241 tgtttataaa actttaaatt taaaggtttt ataagatcta tatatgaact aaactcatct
     3301 ttatagctat tagctatact aaaagcgatc tgcttgaact ctttgaaatc aacgtttaaa
     3361 attttacaaa gcggcttaaa atcgtttagc ctaatgtttt gacgcttacc gtttataggt
     3421 aaatagctct caaatccttt tgccgtacta taaaatcccg ctgctcatca cgtgcataca
     3481 taggcgcatc agcactatat ctccgcttgt tataagagaa agatttttca gcatgcagat
     3541 cttccgtatt ttatgataac gctataaaca tatatcttca aaagctcctt cttttcagta
     3601 tctgatagta gctctttgct tatgcgctta aaaagctttt cgctggttgt gtcatattta
     3661 tttatactag tagtcctaga tacgaggcaa actcgacgac gttataatga tgattttcta
     3721 gtttatcaaa tcttttaatt acgtagtgat actcgtcgtc atctcctgcc actacagcac
     3781 ttcacggaac tctgattttc aacgtatttt tagcaaaact caaatgaaga tgctcattta
     3841 tcagcgaggt gcggaaaata acttttgaac tggttggatt tgatctgatt ttcgaatatg
     3901 gttttaaaat atagtttgaa gcagcgtctt gatggattat ctttcgttca aaatcgatat
     3961 ttactaattt tttg
//
LOCUS       CDBRRDA      1470 bp ss-rRNA            RNA       21-JUL-1990
DEFINITION  C.hominis 16S ribosomal RNA.
ACCESSION   M35014
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      C.hominis (strain ATCC 16826) rRNA.
  ORGANISM  Cardiobacterium hominis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods.
REFERENCE   1  (bases 1 to 1470)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1470     16S ribosomal RNA
BASE COUNT      373 a    331 c    466 g    296 t      4 others
ORIGIN      
        1 actgaagagt ttgattctgg ctcagattga acgctggcgg catgcttaac acatgcaagt
       61 cgaacggaaa cgatggagct tgctccaggc gtcgagtggc gaacgggtga gtaacgcatg
      121 ggaatctgcc ttttgctggg ggataacgta gggaaactta cgctaatacc gcataacacc
      181 taagggtgaa agcgggggac cgaaagcctc gcggcaagag atgagcccat gttggattag
      241 ctagttggtg gggtaaaggc ctaccaaggc gacgatccat agctggtctg agaggatgat
      301 cagccacact gggactgaga cacggcccag actccnacgg gaggcagcag tggggaatat
      361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
      421 gtaaagcact ttcagtaggg aggaaaggtg cgtagttaat acctgcgcaa ttgacgttac
      481 ctacagaaga agcaccggct aactccgtgc cagcagccgc ggtaatacgg agggtgcgag
      541 cgttattcgg aattactggg cgtaaagcgc acgcaggcgg ttgcccaagt cagatgtgaa
      601 agccccgggc ttaacctggg aactgcattt gaaactgggc gactagagta tgaaagagga
      661 aagcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
      721 aggcagcttt ctgggtcgat actgacgctc atgtgcgaaa gcgtggggag caaacaggat
      781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggcgtcgggt tgttaaagac
      841 tcggtgcngg agctaacgca ttaagttgac cgcctgggga gtacggccgc aaggttgaaa
      901 ctcaaagaaa ttgacgggga cccgcncaag cggtggagca tgtggtttaa ttcgatgcaa
      961 cgcgaagaac cttaccaggc cttgacatcc taggaacttg gcagagatgc cttggtgcct
     1021 tcgggaacct agagacaggt gttgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg
     1081 ttaagtcccg caacgagcgc aacccttatc cttacttgcc agcactttgg gtgggaactg
     1141 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc
     1201 cttacggcct gggctacaca cgtgctacaa tggtcggtac agacggtagc gaagccgcga
     1261 ggtggagcca atctgagaaa gccgatcgta gtccggattg cagtctgcaa ctcgactgca
     1321 tgaagtcgga atcgctagta atcgcgaatc agcatgtcgc ggtgaatacg ttcccgggtc
     1381 ttgtacacac tgcccgtcac accatgggag tttgttgcac cagaagcagg tagcttaact
     1441 taggagggcg cttgccacgg tgtggccgat
//
LOCUS       GLARGBA       763 bp ds-DNA             INV       21-JUL-1990
DEFINITION  G.lamblia 5.8S rRNA gene.
ACCESSION   M35013
KEYWORDS    5.8S ribosomal RNA; large subunit rRNA; small subunit rRNA.
SOURCE      G.lamblia DNA, clones B4 and C4.
  ORGANISM  Giardia lamblia
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE   1  (bases 1 to 763)
  AUTHORS   Edlind,T.D., Sharetzsky,C. and Cha,M.E.
  TITLE     Ribosomal RNA of the primitive eukaryote Giardia lamblia: Large
            subunit domain I and potential processing signals
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.D.Edlind, 04-JUN-1990.
FEATURES       from  to/span     description
    rRNA        177      321     5.8S ribosomal RNA
    rRNA    <     1      146     small subunit ribosomal RNA
    rRNA        372  >   763     large subunit ribosomal RNA
BASE COUNT      121 a    293 c    293 g     56 t
ORIGIN      
        1 gtacacaccg cccgtcgctc ctaccgactg ggcgcggcgg cgagcgcccc ggacgcgcga
       61 agggccgcga gcccccgcgc ctggaggaag gagaagtcgt aacaaggtat ccgtaggtga
      121 acctgcggat ggatccctcg cgcgccccgc gcgtcgcccc cgcggcccgg tcggcccccg
      181 aacgccccgc cggcggatgc ctcggcccgg gcggcgacga agagcgcggc ggagcgcgag
      241 acgcggtgcg gacccgcccg ccccgagaag caccgaccct cgaacgcagc gcgccccggc
      301 gccgccgcct cggcgcccgc cgcgtgccgc gcgccgcgcc gcagagagcg ccccggggcg
      361 gtcccgccgg gctgcgcggc ccgaggcggc gggggcgacg ggcggaactt aagcatatca
      421 gtacgccccg gaggagaaac caaccgggat tccccgtagc ggcgagcgac gcgggaggag
      481 cccgccccga aggcgcgctg tggggcgcag gcgcaggccc gccgcgaggg ggcccgaggg
      541 ccccgcccga gagggtgcaa gccccgtacg gcggccggcg cctgcgcgcg agtagcgctg
      601 cttgagcgtg cagcgcggaa gggaggcgcg gcccttccaa ggctaaatac gccccgggac
      661 cgatagcgga ccaagtagcg cgagcgaacg gtgaaaagga cgcctgcggc cgctcaaaag
      721 acctgaaccg ccgccgcgcc gccggccccg tctcgaaaca cgg
//
LOCUS       HEIRRDA      1472 bp ss-rRNA            RNA       21-JUL-1990
DEFINITION  H.influenzae 16S ribosomal RNA.
ACCESSION   M35019
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      H.influenzae (strain ATCC 33391) rRNA.
  ORGANISM  Haemophilus influenzae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 1472)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1472     16S ribosomal RNA
BASE COUNT      376 a    300 c    459 g    311 t     26 others
ORIGIN      
        1 attgaagagt ttgatcatgg ctcagattga acgctggcgg caggcttaac acatgcaagt
       61 cgaacggtag caggagaaag cttgctttct tgctgacgag tggcggacgg gtgagtaatg
      121 cttgggaatc tggcttatgg agggggataa cgacgggaaa ctgtcgctaa taccgcgtat
      181 tatcggaaga tgaaagtgcg ggactgagag gccgcatgcc ataggangag cccaagtggg
      241 attaggtagt tggtggggta aatgcctacc aagcctncga tctctagctg gtctgagang
      301 atgaccagcc acactggaac tgagacacgg tccagactcc tacgggaggc agcagtgggg
      361 aatattgcgc natgggggga accctgacgc agccatgccg cgtgaatgaa gaaggccttc
      421 gggttgtaaa gttctttcgg tattgaggaa ggttgatgtg ttaatagcac atcaaattga
      481 cgttaaatac agaagaagca ccggctnact ccgtgccagc agccgcggta atacggaggg
      541 tgcgagcgtt aatcggaata actgggcgta aagggcacgc aggcggttat ttaagtgagg
      601 tgtgaaagcc ccgggcttaa cctgggnatt gcatttcaga ctgggtaact agagtacttt
      661 agggaggggt agaattccac gtgtagcggt gaaatgcgta gagatgtgga ggaataccga
      721 aggcgaaggc agccccttgg gaatgtactg acgctcatgt gcgaaagcgt ggggagcaaa
      781 caggattaga taccctggta gtccacgctg taaacgctgt cgatttnggg gttggggttn
      841 aactctggca cccgtagcta acgtgataaa tcgaccgcct ggggagtacg gccgcaaggt
      901 taaaactcaa atgaattgac gggggccngc ncaannggtg gagcatgtgg tttaattcga
      961 tgnaacgcga agaaccttac ctactcttga catcctaaga agagctcaga gatgagcttg
     1021 tgccttcggg aacttagaga caggtgctgc atggctgtcg tcagctcgtg ttgtgaaatg
     1081 ttgggttnag tcccgcaacg agcgcaaccc ttatcctttg ttgccagcga cttggtcggg
     1141 aactcaaagg agactgccag tgataaactg gaggaaggtn gggatgacgt caagtcatca
     1201 tggcccttac gagtagggct acacacgtgc tacaatggcg tatacagagg gaagcgaagc
     1261 tgcgaggtgg agcgaatctc ataaagtacg tctaagtccg gattggagtc tgcaactcga
     1321 ctccatgaag tcggaatcgc tagtaatcgc gaatcagaat gtcgcggtga atacgttccc
     1381 gggccttgta cacaccgccc gtcacaccnt gggagtgggt tgtaccagaa gtagatagct
     1441 naacctnnnn gagggcgtnn ncnacggtan ga
//
LOCUS       HUMIBSUB     3110 bp ss-mRNA            PRI       21-JUL-1990
DEFINITION  Human integrin beta-5 subunit mRNA, complete cds.
ACCESSION   M35011
KEYWORDS    integrin.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3110)
  AUTHORS   Suzuki,S., Huang Z,-S. and Tanihara,H.
  TITLE     Cloning of a new integrin beta subunit exhibiting high homology
            with integrin beta-3 subunit
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Suzuki, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         30     2420     integrin beta-5 subunit
BASE COUNT      726 a    809 c    887 g    688 t
ORIGIN      
        1 cgcgccgccg ctgagggagg cgccccacca tgccgcgggc cccggcgccg ctgtacgcct
       61 gcctcctggg gctctgcgcg ctcctgcccc ggctcgcagg tctcaacata tgcactagtg
      121 gaagtgccac ctcatgtgaa gaatgtctgc taatccaccc aaaatgtgcc tggtgctcca
      181 aagaggactt cggaagccca cggtccatca cctctcggtg tgatctgagg gcaaaccttg
      241 tcaaaaatgg ctgtggaggt gagatagaga gcccagccag cagcttccat gtcctgagga
      301 gcctgcccct cagcagcaag ggttcgggct ctgcaggctg ggacgtcatt cagatgacac
      361 cacaggagat tgccgtgaac ctccggcccg gtgacaagac caccttccag ctacaggttc
      421 gccaggtgga ggactatcct gtggacctgt actacctgat ggacctctcc ctgtccatga
      481 aggatgactt ggacaatatc cggagcctgg gcaccaaact cgcggaggag atgaggaagc
      541 tcaccagcaa cttccggttg ggatttgggt cttttgttga taaggacatc tctcctttct
      601 cctacgcggc accgaggtac cagaccaatc cgtgcattgg ttacaagttg tttccaaatt
      661 gcgtcccctc ctttgggttc cgccatctgc tgcctctcac agacagagtg gacagcttca
      721 atgaggaagt tcggaaacag agggtgtccc ggaaccgaga tgcccctgag gggggctttg
      781 atgcagtact ccaggcagcc gtctgcaagg agaagattgg ctggcgaaag gatgcactgc
      841 atttgctggt gttcacaaca gatgatgtgc cccacatcgc attggatgga aaattgggag
      901 gcctggtgca gccacacgat ggccagtgcc acctgaacga ggccaacgag tacactgcat
      961 ccaaccagat ggactatcca tcccttgcct tgcttggaga gaaattggca gagaacaaca
     1021 tcaacctcat ctttgcagtg acaaaaaacc attatatgct gtacaagaat tttacagccc
     1081 tgatacctgg aacaacggtg gagattttag atggagactc caaaaatatt attcaactga
     1141 ttattaatgc atacaatagt atccggtcta aagtggagtt gtcagtctgg gatcagcctg
     1201 aggatcttaa tctcttcttt actgctacct gccaagatgg ggtatcctat cctggtcaga
     1261 ggaagtgtga gggtctgaag attggggaca cggcatcttt tgaagtatca ttggaggccc
     1321 gaagctgtcc cagcagacac acggagcatg tgtttgccct gcggccggtg ggattccggg
     1381 acagcctgga ggtgggggtc acctacaact gcacgtgcgg ctgcagcgtg gggctggaac
     1441 ccaacagcgc caggtgcaac gggagcggga cctatgtctg cggcctgtgt gagtgcagcc
     1501 ccggctacct gggcaccagg tgcgagtgcc aggatgggga gaaccagagc gtgtaccaga
     1561 acctgtgccg ggaggcagag ggcaagccac tgtgcagcgg gcgtggggac tgcagctgca
     1621 accagtgctc ctgcttcgag agcgagtttg gcaagatcta tgggcctttc tgtgagtgcg
     1681 acaacttctc ctgtgccagg aacaagggag tcctctgctc aggccatggc gagtgtcact
     1741 gcggggaatg caagtgccat gcaggttaca tcggggacaa ctgtaactgc tcgacagaca
     1801 tcagcacatg ccggggcaga gatggccaga tctgcagcga gcgtgggcac tgtctctgtg
     1861 ggcagtgcca atgcacggag ccgggggcct ttggggagat gtgtgagaag tgccccacct
     1921 gcccggatgc atgcagcacc aagagagatt gcgtcgagtg cctgctgctc cactctggga
     1981 aacctgacaa ccagacctgc cacagcctat gcagggatga ggtgatcaca tgggtggaca
     2041 ccatcgtgaa agatgaccag gaggctgtgc tatgtttcta caaaaccgcc aaggactgcg
     2101 tcatgatgtt cacctatgtg gagctcccca gtgggaagtc caacctgacc gtcctcaggg
     2161 agccagagtg tggaaacacc cccaacgcca tgaccatcct cctggctgtg gtcggtagca
     2221 tcctccttgt tgggcttgca ctcctggcta tctggaagct gcttgtcacc atccacgacc
     2281 ggagggagtt tgcaaagttt cagagcgagc gatccagggc ccgctatgaa atggcttcaa
     2341 atccattata cagaaagcct atctccacgc acactgtgga cttcaccttc aacaaatcct
     2401 acaatggcac tgtggactga tgtttccttc tccgaggggc tggagcgggg atctgatgaa
     2461 aaggatcaga ctgaaacgcc ttgcacggct gctcggcttg atcacagctc cctaggtagg
     2521 caccacagag aagaccttct agtgagcctg ggccaggagc ccacagtgcc tgtacaggaa
     2581 ggtgcctggc catgtcacct ggctgctagg ccagagccat gccaggctgc gtccctccga
     2641 gcttgggata aagcaagggg accttggcgc tctcagcttt ccctgccaca tccagcttgt
     2701 tgtcccaatg aaatactgag atgctgggct gtctctccct tccaggaatg ctgggccccc
     2761 agcctggcca gacaagaaga ctgtcaggaa gggtcggagt ctgtaaaacc agcatacagt
     2821 ttggcttttt tcacattgat catttttata tgaaataaaa agatcctgca tttatggtgt
     2881 agttctgagt cctgagactt ttctgcgtga tggctatgcc ttgcacacag gtgttggtga
     2941 tggggctgtt gagatgcctg ttgaaggtac atcgtttgca aatgtgagtt tcctctcctg
     3001 tccgtgtttg tttagtactt ttataatgaa aagaaacaag attgtttggg attggaagta
     3061 aagattaaaa ccaaaagaat ttgtgtttgt ctgataaaaa aaaaaaaaaa
//
LOCUS       KINRRDAA     1474 bp ss-rRNA            RNA       21-JUL-1990
DEFINITION  K.indologenes 16S ribosomal RNA.
ACCESSION   M35015
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      K.indologenes (strain ATCC 25869) rRNA.
  ORGANISM  Kingella indologenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1474)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1474     16S ribosomal RNA
BASE COUNT      375 a    322 c    451 g    315 t     11 others
ORIGIN      
        1 ttgaactgaa gagtttgatt ctggctcaga ttgaacgctg gcggcatgct taacacatgc
       61 aagtcgaacg agggaagcag cttgctgcng acctngtggc ggacgggtga gtaacgtata
      121 ggaatctacc ttggggtggg ggataacgta tggaaacgta cgctaatacc acataacact
      181 tacgagtcaa agcgggggat cttcggacct cgcgccctta gatgagccta tattggatta
      241 gctagttggt rgggtaaagg cctaccaagg cgacgatcca tagctggttt gagaggatga
      301 tcagccacat cgggactgag acacggcccn nactcctacg ggaggcagca gtggggaata
      361 ttggncnatg ggggcaaccc tgatccagca atgccgcgtg tgtgaagaag gccttngggt
      421 tgtaaagcac tttcgttagt gaagaaaggt ttatggttaa tacctgtgaa tttgacatta
      481 gctaaagaag aagcaccggc taactccgtg ccagcagccg cggtaatacg gagggtgcna
      541 gcgttattcg gaatgactgg gcgtaaagcg cacgcaggcg gttatttaag tcagatgtga
      601 aagccccggg cttaacctgg gnattgcatt tgaaactggg taactagagt atgtgagagg
      661 aaggcggaat ttccagtgta gcagtgaaat gcgtagagat tggaaggaac accgatggcg
      721 aaggcagcct tctggggcaa tactgacgct catgtgcgaa agcgtgggta gcaaacagga
      781 ttagataccc tggtagtcca cgccctaaac gatgtcaact aggtgttggg cacttaagtg
      841 ctcggtaccg cagctaacgc attaagttga ccgcctgggg agtacggccg caaggttgaa
      901 actcaaagaa attgacgggg acccgcacaa gcggtggagc atgtggttta attcgatgca
      961 acgcgaagaa ccttaccagg ccttgacatc cagagaagtt accagagatg gtttcgtgcc
     1021 ttcgggaact ctgagacagg tgttgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg
     1081 gttaagtccc gcaacgagcg caacccttat ccttacttgc cagcacttcg ggtgggaact
     1141 ataaggagac tgccggtgac aaaccggagg aaggtgggga tgacgtcaag tcatcatggc
     1201 ccttacggcc tgggctacac acgtgctaca atggtcggta cagacggttg ccaacccgcg
     1261 agggggagct aatctgagaa agccgatcgt agtccggatt gcactctgca actcgagtgc
     1321 atgaagtcgg aatcgctagt aatcgcgaat cagaacgtcg cggtgaatac gttcccgggt
     1381 cttgtacaca ctgcccgtca caccatggga gtgggttgca ccagaagtag ctagcttatc
     1441 agggcggtta ccacggtgtg attcangact aggg
//
LOCUS       MUSTNFBG     3219 bp ds-DNA             ROD       21-JUL-1990
DEFINITION  Mouse tumor necrosis factor-beta (lymphotoxin) gene.
ACCESSION   Y00137
KEYWORDS    lymphotoxin; signal peptide; tumor necrosis factor.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3219)
  AUTHORS   Gray,P.W., Chen,E., Tang,W.-L. and Ruddle,N.
  TITLE     The murine tumor necrosis factor-beta (lymphotoxin) gene sequence
  JOURNAL   Nucleic Acids Res. 15, 3937-3937 (1987)
  STANDARD  simple automatic
REFERENCE   2  (sites for [1])
  AUTHORS   Fashena,S.J., Tang,W.-L., Sarr,T. and Ruddle,N.H.
  TITLE     The murine lymphotoxin gene promoter: Characterization and negative
            regulation
  JOURNAL   J. Immunol. 145, 177-183 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept       1632     1727     lymphotoxin precursor, exon 2 (first expressed
                                 exon)
               1811     1910     lymphotoxin precursor, exon 3
               2135     2547     lymphotoxin precursor, exon 4
    sigp       1632     1727     lymphotoxin signal peptide
               1811     1813     lymphotoxin signal peptide
    matp       1814     1910     lymphotoxin
               2135     2547     lymphotoxin
    pre-msg     685     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg     857     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg     865     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg    1114     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg    1179     3121     lymphotoxin mRNA and introns (major alt.)
    IVS        1286     1622     put. intron I
    IVS        1728     1810     intron II
    IVS        1911     2134     intron III
    signal     1153     1158     put. TATA-box
BASE COUNT      724 a    971 c    744 g    780 t
ORIGIN      
        1 tgaaagctcc ctctgtacag agcattggaa gcctggggtg tacatttggg gttacatgat
       61 cttggggttc taagagaata cccccaaatc atcttccaga cctggaacat tctaggacag
      121 ggttctcaac cttcctaact ccatgaccct ttaatacagt tcctcatgtt gtggtgaccc
      181 caaccataca attattttcg ttgctatttc ataactgtaa tttcgctgct attatgaaca
      241 taatgtaaat atttgtttta aatagaggtt tgccaaagag accttgccac aggttgagac
      301 tgccgctcca gagagtaagg gacacattaa aattgttaca caccagatcc cccaaatttg
      361 gggagagggc actgtaatgg aacttcttga cattaaactg gcagataaac tggcagaaaa
      421 aaaaaaaaaa aagctgggca gtggtggcac acacctttaa tcccagcact tgggaggcag
      481 aggcaggcgg atttctgagt tctaggccag cctggtcgac agagtgagtt tcaggacagc
      541 cagggctaca cagagaaacc ctgtctcgaa aaaagcaaaa aaaaaaaaaa aaaactggca
      601 gatgaccaga aaatacagat atattggaat aactgtgact tgaaccccca aagacaagag
      661 aggaaatagg cctgaagggg cggcaggcat gtcaagcatc cagagccctg ggttcgaacc
      721 tgaaaaaaca aaggtgccgc taaccacatg tggcttcgga gccctccaga catgaccatg
      781 atcgacagag agggaaatgt gcagagaagc ctgtgagcag tcaagggtgc agaagtgata
      841 taaaccatca ctcttcaggg aaccaggctt ccagtcacag cccagctgta ccctctccac
      901 gaattgctcg gccgttcact ggaactcctg ggcctgaccc agctccctgc tagtccctgc
      961 ggcccacagt tccccggacc cgactccctt tcccagaacg cagtagtcta agcccttagc
     1021 ctgcggttct ctcctaggcc ccagcctttc ctgccttcga ctgaaacagc agcatcttct
     1081 aagcctgggg cttccccaag ccccagcccc gacctagaac ccgcccgctg cctgccacac
     1141 tgccgcttcc tctataaagg gacccgacgc cagcgcccag gaccccgcac agcaggtgag
     1201 cctctcctac cctgtctcct tgggcttacc ctggtatcag gcatccctca ggatccccag
     1261 ccttaatggg tctggtcctc ctgtcgtggc tttgattttt ggtctgttcc tgtggcggcc
     1321 ttatcagtct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct
     1381 ctctctctct ctctttctct ctctctgcct ctgttagcca ttgtctgttt ctatggtgga
     1441 gctttcctct tcccctctgt ctctccttat ccctgctcac ttcagggttc ccctgcctgt
     1501 ccccttttct gtctgtcgcc ctgtctctca gggtggctgt ctcagctggg aggtaaggtc
     1561 tgtcttcctc tgtgtgcccc gcctccgcta cacacacaca ctctctctct ctctctcagc
     1621 aggttctcca catgacactg ctcggccgtc tccacctctt gagggtgctt ggcacccctc
     1681 ctgtcttcct cctggggctg ctgctggccc tgcctctagg ggcccaggtg aggcagcaag
     1741 agattggggg tgctggggtg gcctagctaa ctcagagtcc tagagtcctc tccactctct
     1801 tctgtcccag ggactctctg gtgtccgctt ctccgctgcc aggacagccc atccactccc
     1861 tcagaagcac ttgacccatg gcatcctgaa acctgctgct caccttgttg gtaaacttct
     1921 gcctccagag gagaggtcca gtccctgcct tttgtcctac ttgcccaggg gcccaggcga
     1981 tcttcccatc tccccacacc aacttttctt accctaaggg caggcacccc actcccaatc
     2041 tccctaccaa ccatcccact tgtccagtgc ctgctcctca gggatgggga cctctgatct
     2101 tgatagcccc ccaatgtctt gtgcctcttc ccagggtacc ccagcaagca gaactcactg
     2161 ctctggagag caagcacgga tcgtgccttt ctccgacatg gcttctcttt gagcaacaac
     2221 tccctcctga tccccaccag tggcctctac tttgtctact cccaggtggt tttctctgga
     2281 gaaagctgct cccccagggc cattcccact cccatctacc tggcacacga ggtccagctc
     2341 ttttcctccc aatacccctt ccatgtgcct ctcctcagtg cgcagaagtc tgtgtatccg
     2401 ggacttcaag gaccgtgggt gcgctcaatg taccaggggg ctgtgttcct gctcagtaag
     2461 ggagaccagc tgtccaccca caccgacggc atctcccatc tacacttcag ccccagcagt
     2521 gtattctttg gagcctttgc actgtagatt ctaaagaaac ccaagaattg gattccaggc
     2581 ctccatcctg accgttgttt caagggtcac atccccacag tctccagcct tccccactaa
     2641 aataacctgg agctctcacg ggagtctgag acacttcagg ggactacatc ttccccaggg
     2701 ccactccaga tgctcagggg acgactcaag cctacctaga agttcctgca cagagcaggg
     2761 tttttgtggg tctaggtcgg acagagacct ggacatgaag gagggacaga catgggagag
     2821 gtggctggga acaggggaag gttgactatt tatggagaga aaagttaagt tatttattta
     2881 tagagaatag aaagagggga aaaatagaaa gccgtcagat gacaactagg tcccagacac
     2941 aaaggtgtct cacctcagac aggacccatc taagagagag atggcgagag aattagatgt
     3001 gggtgaccaa ggggttctag aagaaagcac gaagctctaa aagccagcca ctgcttggct
     3061 agacatccac agggaccccc tgcaccatct gtgaaaccca ataaacctct tttctctgag
     3121 attctgtctg cttgtgtctg tcttgcgttg ggggagaaac ttcctggtct ctttaaggag
     3181 tggagcaggg gacagaggcc tcagttggcc atgggatcc
//
LOCUS       NGORRDAA     1478 bp ss-rRNA            RNA       21-JUL-1990
DEFINITION  N.denitrificans 16S ribosomal RNA.
ACCESSION   M35020
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      N.denitrificans (strain ATCC 14686) rRNA.
  ORGANISM  Neisseria denitrificans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1478)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1478     16S ribosomal RNA
BASE COUNT      380 a    331 c    448 g    308 t     11 others
ORIGIN      
        1 aacataagag tttgatcctg gctcagattg aacgctggcg gcatgcttta cacatgcaag
       61 tcgaacggca gcggagaagt gcttgcactt ctgccggcga gtggcgaacg ggtgagtaac
      121 atattggaac gtaccgagta atgggggata actaatcgaa agattagcta ataccgcata
      181 cgtcctgagg gagaaagcgg gggatcttcg gacctcgcgt tattcgagcg gccaatatct
      241 gattagctag ttggtggggt aaaggcctac caaggcgacg atcagtagcg ggtctgagag
      301 gatgatccgc cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg
      361 gaattttgga cgatgggcgc aagcctgatc cagccatgcc gcgtgtatga agaaggcctt
      421 cgggttgtaa agtacttttg tnngggaaga aaagggattt gctaataccg aatcctcatg
      481 acggtacctn aagaataagc accggctaac tacgtgccag nanccgcggn natacgtagg
      541 gtgcgagcgt taatcggaat tactgggcgt aaagcgagcg cagacggtta cttaagcagg
      601 atgtgaaatc cccgagctca acttgggaac tgcgttctga actgggtagc tagagtatgt
      661 cngagggggg tagaattcca cgtgtagcag tgaaatgcgt agagatgtgg aggaataccg
      721 atggcgaagg cagccccctg ggataatact gacgttcatg ctcgaaagcg tgggtagcaa
      781 acaggattag ataccctggt agtccacgcc ctaaacgatg tcgattagct gttggggtac
      841 ttgatacctt agtagcgtag ctaacgcgtg aaattgaccg cctggggagt acggtcgcaa
      901 gattaaaact caaaggaatt gacggggacc cgcacaagcg gtggatgatg tggattaatt
      961 cgatgcaacg cgaagaacct tacctggtct tgacatgtac ggaatctccc agagacggga
     1021 gngtgccttc gggagccgta acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag
     1081 atgttgggtt aagtcccgca acgagcgcaa cccttgtcat tagttgccat cattcagttg
     1141 ggcactctaa tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct
     1201 catggccctt atgaccaggg cttcacacgt catacaatgg tcggtacaga gggtagccaa
     1261 gccgcgaggt ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc
     1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc
     1381 ccgggtcttg tacacaccgc ccgtcacacc ntgggagtgg gggataccag aagtaggtag
     1441 ggtaaccgca aggagcccnc ttaccacggt atacttca
//
LOCUS       PASRRDA      1481 bp ss-rRNA            RNA       21-JUL-1990
DEFINITION  P.multocida 16S ribosomal RNA.
ACCESSION   M35018
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      P.multocida (strain NCTC 10322) rRNA.
  ORGANISM  Pasteurella multocida
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 1481)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1481     16S ribosomal RNA
BASE COUNT      374 a    305 c    459 g    321 t     22 others
ORIGIN      
        1 attgnagagt ttgancangg ctcagattga ncgctggcgg caggcttaac acatgcaagt
       61 cgaacggtag caggaagaaa gcttnctttc tttgctgacg agtggcggac gggtgagtaa
      121 tgcttgggna tctggcttat ggagggggat aactgtggga aactgcagct aataccgcgt
      181 attctcttag gaggaaaggg tgggaccttn gggccacctg ccataagatg agcccaagtg
      241 ggattaggta gttggtgggg taaaggccta ccaagcctnc gatctctagc tngtctgaga
      301 ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg
      361 ggaatattgc gcaatggggg gaaccctgac gcagccatgc cgcgtgaatg aagaaggcct
      421 tcgggttgta aagttctttc ggtnatgagg aagggatgtt nttaaataga tagcatcatt
      481 gacgttaatt ncagaagaag caccggctna ctccgtgcca gcagccgcgg taatacggag
      541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggac ttttaagtga
      601 gatgtgaaat ccccgagctt aacttgggaa ctgcatttca gactgggagt ctagagtnct
      661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
      721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcnaaagc gtggggagca
      781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
      841 ntntgcttgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
      901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
      961 atgnaacgcg aagaacctta cctactcttg acatcctaag aagagctcag agatgagctt
     1021 gtgccttcgg gaacttagag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
     1081 gttgggttaa gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attcggtcgg
     1141 gaactcaaag gagactgcca gtgannaact ggaggaaggt ggggatgacg tcaagtcatc
     1201 atggccctta cgagtagggc tacacacgtg ctacaatggt gcatacagag ggcagcgaga
     1261 gtgcgagctt gagcgaatct cagaaagtgc atctaagtcc ggattggagt ctgcaactcg
     1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
     1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
     1441 ttaaccntcg ggggggcgtt taccacggta tgattcatga c
//
LOCUS       RATSUBB      1124 bp ss-mRNA            ROD       21-JUL-1990
DEFINITION  Rat F-0-ATPase subunit b mRNA, complete cds.
ACCESSION   M35052
KEYWORDS    F-0-ATPase subunit b.
SOURCE      Rat (strain Wistar) liver, cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1124)
  AUTHORS   Tsurumi,C., Yoshihara,Y., Osaka,F., Yamada,F., Tani,I., Higuti,T.,
            Shimizu,M., Oeda,K., Ohkawa,H., Toda,H., Kakuno,T., Sakiyama,F.,
            Kumatori,A., Tanaka,K. and Ichihara,A.
  TITLE     cDNA cloning and sequencing for the import precursor of subunit b
            in H+-ATP synthase from rat mitochondria
  JOURNAL   Biochem. Biophys. Res. Commun. 169, 136-142 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Higuti, 04-JUN-1990.
FEATURES       from  to/span     description
    pept         60      830     F-0-ATPase subunit b precursor
    sigp         60      185     F-0-ATPase subunit b signal peptide
    matp        186      827     F-0-ATPase subunit b
    signal     1106     1111     poly-A signal
BASE COUNT      323 a    241 c    266 g    294 t
ORIGIN      
        1 gagtgtctcg ggtcggggtc acagggacat cggtaccgct gtctggatct tcgcagacaa
       61 tgctgtcccg ggtggtactt tctgccgccg ccacagcggc cccgtgtctg aagaacgcgg
      121 ccgtcctggg tccaggggtt ttacaggcaa caagggtctt tcacacagga cagcctcgcc
      181 ttgcccctct accacctctt cctgaatatg gaggaaaagt acgtcttggg ctgattcctg
      241 aggaattttt ccagttcctt taccctaaga ctggtgtaac aggaccctac gtgcttggaa
      301 ctgggcttag cttgtacttt ctatccaaag aaatatatgt gattacccca gagaccttct
      361 ctaccatatc agtagtaggg ttgattgtct atgtgattaa gaaatatggc gcctctattg
      421 gagaatttat tgacaaactt aatgaggaaa aaattgctca actagaagaa ataaagcagt
      481 caagcatgaa gcaaatccag gatgcaatca acagggagaa ggcacagcag gcactggttc
      541 agaagcgcca ctacctcttc gatgttcaga ggaataacat tgccctggcc ttggaggtca
      601 cttaccggga acgactacat aaagcatata aggaggtaaa gaatcgcctg gactaccata
      661 tttctgtaca ggacatgatg cgtcgcaagg agggagagca catgataaac tgggtggaga
      721 agcatgtgat acagagcatt tctgcacagc aggaaaagga gaccattgcc aagtgcatcg
      781 gagatctaaa gatgcttgca aagaaggctc aagctcagcc aattatgtga atgtgtctat
      841 ctcagttgtg atagccagag agggttaact taaatgagaa ctactctact tgaagaactc
      901 tttctgtatt gctgtctaat gaaattgtgg tgtacctttc ctaagaaatg atcaaggtcc
      961 atttagtggc ttaaccctat ttctgatcct tgctctgtat tcggagttgt ctgatcacat
     1021 ttgagtaagc aatttgcagc aactcgctgc ccaacagaaa ttaccaagtt atagtttaaa
     1081 cttgtaatta gttaaaacat cttacaataa aatgtttgaa acag
//

GenBank-Updates@genbank.bio.net (07/22/90)

LOCUS       BORFLAG       684 bp ds-DNA             BCT       22-JUL-1990
DEFINITION  B.burgdorferei flagellin gene, 5' end.
ACCESSION   M34710
KEYWORDS    flagellin.
SOURCE      B.burgdorferei (strain B31; isolate ATCC #35210) DNA.
  ORGANISM  Borrelia burgdorferei
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; 
            Spirochaetales; Spirochaetaceae.
REFERENCE   1  (bases 201 to 684)
  AUTHORS   Picken,R.N.
  TITLE     Nucleotide sequence of the flagellin gene of Borrelia hermsii and
            identification of oligonucleotides for the specific amplification
            and detection of Borrelia burgdorferi flagellin sequence by the
            polymerase chain reaction
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 200)
  AUTHORS   Picken,R.N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by R.N.Picken, 18-MAY-1990.
            
              Author address:R.N.Picken:
              Baxter Healthcare Corp., Pandex Division
              909 Orchard Street
              Mundelein, IL 60060
FEATURES       from  to/span     description
    pept        493  >   684     flagellin
    binding     480      486     ribosome binding site
BASE COUNT      249 a     74 c    128 g    233 t
ORIGIN      
        1 tctagatatt taagacttga tgagaaaaag tttgatgaat caattcgtaa taatattgat
       61 aatgttagag agcttttttt atatgatctt aatggtgaca gagtgtatga taatggaatt
      121 gctaaaatgc taggagattg tctgtcgcct cttgtggctt ccggaggagt tatttataat
      181 aaaataaaga attacgactt gaaaattttt aatcaaaaaa ataaagttga agattataaa
      241 aagaagtacg aagatagaga gagaaaagtg gaaggtgaac ttaatacctt ggattttacc
      301 gttaagcgca tgaaagatca agaaaataca ttaaaggctt ttgattttaa tcaaagaaat
      361 aaataataat aattattttt aatgctattg ctatttgcgt ttcttttttt ttaatttttg
      421 tgctattctt tttaacaggc aaaaggattt gccaaagtca gaaatttaaa ttttatcatg
      481 gaggaatgat atatgattat caatcataat acatcagcta ttaatgcttc aagaaataat
      541 ggcattaacg ctgctaatct tagtaaaact caagaaaagc tttctagtgg gtacagaatt
      601 aatcgagctt ctgatgatgc tgctggcatg ggagtttctg gtaagattaa tgctcaaata
      661 agaggtttgt cacaagcttc taga
//
LOCUS       BORGFLAGA     620 bp ds-DNA             BCT       22-JUL-1990
DEFINITION  B.burgdorferei DNA.
ACCESSION   M34711
KEYWORDS    .
SOURCE      B.burgdorferei (strain B31; isolate ATCC #35210) DNA.
  ORGANISM  Borrelia burgdorferei
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; 
            Spirochaetales; Spirochaetaceae.
REFERENCE   1  (bases 1 to 620)
  AUTHORS   Picken,R.N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.N.Picken, 18-MAY-1990.
            
              Author address:R.N.Picken:
              Baxter Healthcare Corp., Pandex Division
              909 Orchard Street
              Mundelein, IL 60060
BASE COUNT      220 a    113 c    125 g    162 t
ORIGIN      
        1 tctagaaata cttcaaaggc tattaatttt attcagacaa cagaagggaa tttaaatgaa
       61 gtagaaaaag tcttagtaag aatgaaggaa ttggcagttc aatcaggtaa cggcacatat
      121 tcagatgcag acagaggttc tatacaaatt gaaatagagc aacttacaga cgaaattaat
      181 agaattgctg atcaagctca atataaccaa atgcacatgt tatcaaacaa atctgcttct
      241 caaaatgtaa gaacagctga agagcttgga atgcagcctg caaaaattaa cacaccagca
      301 tcactttcag ggtctcaagc gtcttggact ttaagagttc atgttggagc aaaccaagat
      361 gaacctattg ctgtaaatat ttatgcagct aatgttgcaa atcttttctc tggtgaggga
      421 gctcaaactg ctcaggctgc accggttcaa gagggtgttc aacaggaagg agctcaacag
      481 ccagcacctg ctacagcacc ttctcaaggc ggagttaatt ctcctgttaa tgttacaact
      541 acagttgatg ctaatacatc acttgctaaa attgaaaatg ctattagaat gataagtgat
      601 caaagggcaa atttaggtgc
//
LOCUS       SSOBETA      2346 bp ds-DNA             BCT       22-JUL-1990
DEFINITION  S.solfataricus beta-D-galactosidase (lacS) gene, complete cds.
ACCESSION   M34696
KEYWORDS    beta-D-galactosidase.
SOURCE      S.solfataricus (strain MT-4) DNA, clone C1.
  ORGANISM  Sulfolobus solfataricus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 2346)
  AUTHORS   Cubellis,M.V., Rozzo,C., Montecucchi,P. and Rossi,M.
  TITLE     Isolation and sequencing of a new beta-galactosidase
            archaebacterial gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.V.Cubellis, 18-MAY-1990.
            
               Author address:M.V.Cubellis
               Dipt. Chimica Organica e Biologica
               Universita di Napoli
               Via Mezzocannone, 16
               80134 Napoli, Italia
FEATURES       from  to/span     description
    pept        229     1698     beta-D-galactosidase (lacS) (EC 3.2.1.23)
BASE COUNT      770 a    402 c    487 g    687 t
ORIGIN      
        1 aaggagaaac ttggcagttt ataacttgac agtaggttgt ggagtgatga ctggatcaat
       61 actaggagga gtagcatata attacgttac acaattttat aacccaatat attcaataga
      121 ccttatgctt atcctatcct ctattctaag attctcggta tctcccctat tcttgaccat
      181 aaaagatact cgctcaaagc ttaaataata ttaatcataa ataaagtcat gtactcattt
      241 ccaaatagct ttaggtttgg ttggtcccag gccggatttc aatcagaaat gggaacacca
      301 gggtcagaag atccaaatac tgactggtat aaatgggttc atgatccaga aaacatggca
      361 gcgggattag taagtggaga tctaccagaa aatgggccag gctactgggg aaactataag
      421 acatttcacg ataatgcaca aaaaatggga ttaaaaatag ctagactaaa tgtggaatgg
      481 tctaggatat ttcctaatcc attaccaagg ccacaaaact ttgatgaatc aaaacaagat
      541 gtgacagagg ttgagataaa cgaaaacgag ttaaagagac ttgacgagta cgctaataaa
      601 gacgcattaa accattacag ggaaatattc aaggatctta aaagtagagg actttacttt
      661 atactaaaca tgtatcattg gccattacct ctatggttac acgacccaat aagagtaaga
      721 agaggagatt ttactggacc aagtggttgg ctaagtacta gaacagttta cgaattcgct
      781 agattctcag cttatatagc ttggaaattc gatgatctag tggatgagta ctcaacaatg
      841 aatgaaccta acgttgttgg aggtttagga tacgttggtg ttaagtccgg ttttccccca
      901 ggatacctaa gctttgaact ttcccgtagg catatgtata acatcattca agctcacgca
      961 agagcgtatg atgggataaa gagtgtttct aaaaaaccag ttggaattat ttacgctaat
     1021 agctcattcc agccgttaac ggataaagat atggaagcgg tagagatggc tgaaaatgat
     1081 aatagatggt ggttctttga tgctataata agaggtgaga tcaccagagg aaacgagaag
     1141 attgtaagag atgacctaaa gggtagattg gattggattg gagttaatta ttacactagg
     1201 actgttgtga agaggactga aaagggatac gttagcttag gaggttacgg tcacggatgt
     1261 gagaggaatt ctgtaagttt agcgggatta ccaaccagcg acttcggctg ggagttcttc
     1321 ccagaaggtt tatatgacgt tttgacgaaa tactggaata gatatcatct ctatatgtac
     1381 gttactgaaa atggtattgc ggatgatgcc gattatcaaa ggccctatta tttagtatct
     1441 cacgtttatc aagttcatag agcaataaat agtggtgcag atgttagagg gtatttacat
     1501 tggtctctag ctgataatta cgaatgggct tcaggattct ctatgaggtt tggtctgtta
     1561 aaggtcgatt acaacactaa gagactatac tggagaccct cagcactagt atatagggaa
     1621 atcgccacaa atggcgcaat aactgatgaa atagagcact taaatagcgt acctccagta
     1681 aagccattaa ggcactaaac tttctcaagt ctcactatac caaatgagtt ttcttttaat
     1741 cttattctaa tctcattttc attagattgc aatactttca taccttctat attatttatt
     1801 ttgtaccttt tgggatctac acttaatgtt agcctaattg gaaagtcatt tagatttaat
     1861 actgttacca gtccatccct tttaattatt aatgaaaata agaagggata agtagcgata
     1921 gcccttattc cgatatggtc tccaacaata tcccttatta tctgccttgc aacactaggg
     1981 tagaactctg aaatcagata tggtaggtaa gttgtaagtg ataggacgta aactttagag
     2041 ttagagtaag tgttctgaaa gactactggg tgcaattcga caccgttata ggcgtaaagg
     2101 attggcgtag ctccgtttaa tgaaaatata ggtcctacag ggaaattggc ttgcctcttg
     2161 taatatgacc aatagaacgt tttcccatcc ctggttaacg cattgacact aacactatcg
     2221 taaatcaagt taccgacacc aagaattttc agtgcagtat cccccaagac ttcaataagc
     2281 tttttagctg cacttgctgt aaacattaag ttaactcccc tattaagtaa atccacaata
     2341 tctaga
//

GenBank-Updates@genbank.bio.net (07/24/90)

LOCUS       HUMHPRTB    56736 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete
            cds.
ACCESSION   M26434
KEYWORDS    Alu repetitive element; LINE repetitive element;
            hypoxanthine phosphoribosyltransferase.
SOURCE      Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia
            patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 56736)
  AUTHORS   Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J.,
            Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W.
  TITLE     Automated DNA sequencing of the human HPRT locus
  JOURNAL   Genomics 6, 593-608 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (sites; mutations causing gout)
  AUTHORS   Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N.
  TITLE     A possible nucleotide-binding domain in the tertiary fold of
            phosphoribosyltransferase
  JOURNAL   J. Biol. Chem. 258, 6450-6457 (1983)
  STANDARD  full staff_entry
REFERENCE   3  (sites; mutations causing gout)
  AUTHORS   Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P.
  TITLE     Resolution of a missense mutant in human genomic DNA by denaturing
            gradient gel electrophoresis and direct sequencing using in vitro
            DNA amplification: HPRT-Munich
  JOURNAL   Am. J. Hum. Genet. 42, 726-734 (1988)
  STANDARD  full staff_entry
REFERENCE   4  (sites; mutations causing gout)
  AUTHORS   Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D.
  TITLE     Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence
            for identical mutations in two partially deficient subjects
  JOURNAL   J. Clin. Invest. 82, 2164-2167 (1988)
  STANDARD  full staff_entry
REFERENCE   5  (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Davidson,B.L., Palella,T.D. and Kelly,W.N.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase: A single
            nucleotide substitution in cDNA clones isolated from a patient with
            Lesch-Nyhan syndrome (HPRT-Midland)
  JOURNAL   Gene 68, 85-91 (1988)
  STANDARD  full staff_entry
REFERENCE   6  (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
  TITLE     Genetic basis of hypoxanthine guanine phosphoribosyltransferase
            deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint)
  JOURNAL   Gene 63, 331-336 (1988)
  STANDARD  full staff_entry
REFERENCE   7  (sites; mutations causing Lesch-Nyhan syndrome and gout)
  AUTHORS   Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase deficiency.
            The molecular defect in a patient with gout (HPRT-Ashville)
  JOURNAL   J. Biol. Chem. 264, 520-525 (1989)
  STANDARD  full staff_entry
REFERENCE   8  (sites; mutations causing gout)
  AUTHORS   Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N.
  TITLE     Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
            deficiency in ten subjects determined by direct sequencing of
            amplified transcripts
  JOURNAL   J. Clin. Invest. 84, 342-346 (1989)
  STANDARD  full staff_entry
REFERENCE   9  (sites; mutations causing gout)
  AUTHORS   Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and
            Stambrook,P.J.
  TITLE     Nucleotide sequence and organization of the mouse adenine
            phosphoribosyltransferase gene: Presence of a coding region common
            to animal and bacterial phosphoribosyltransferases that has a
            variable intron/exon arrangement
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985)
  STANDARD  full staff_entry
REFERENCE   10 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D.
  TITLE     Identification of a single nucleotide change in the
            hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale)
            responsible for Lesch-Nyhan syndrome
  JOURNAL   J. Clin. Invest. 83, 11-13 (1989)
  STANDARD  full staff_entry
REFERENCE   11 (sites; mutations causing gout)
  AUTHORS   Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N.
  TITLE     Identification of a single nucleotide change in a mutant gene for
            hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor)
  JOURNAL   Hum. Genet. 79, 39-43 (1988)
  STANDARD  full staff_entry
REFERENCE   12 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T.
  TITLE     Multiplex DNA deletion detection and exon sequencing of the
            hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families
  JOURNAL   Genomics 7, 235-244 (1990)
  STANDARD  full staff_entry
REFERENCE   13 (sites; mutations causing Lesch-Nyhan syndrome and gout)
  AUTHORS   Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T.
  TITLE     Identification of mutations leading to the Lesch-Nyhan syndrome by
            automated direct DNA sequencing of in vitro amplified cDNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989)
  STANDARD  full staff_entry
REFERENCE   14 (sites; mutations causing gout)
  AUTHORS   Hershey,H.V. and Taylor,M.W.
  TITLE     Nucleotide sequence and deduced amino acid sequence of Escherichia
            coli adenine phosphoribosyltransferase and comparison with other
            analogous enzymes
  JOURNAL   Gene 43, 287-293 (1986)
  STANDARD  full staff_entry
REFERENCE   15 (sites; conserved domain)
  AUTHORS   King,A. and Melton,D.W.
  TITLE     Characterisation of cDNA clones for hypoxanthine-guanine
            phosphoribosyltransferase from the human malarial parasite,
            Plasmodium falciparum: Comparisons to the mammalian gene and
            protein
  JOURNAL   Nucleic Acids Res. 15, 10469-10481 (1987)
  STANDARD  full staff_entry
REFERENCE   16 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and
            Caskey,C.T.
  TITLE     Molecular analysis of a female Lesch-Nyhan patient
  JOURNAL   J. Clin. Invest. 84, 1024-1027 (1989)
  STANDARD  full staff_entry
REFERENCE   17 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Stout,J.T. and Caskey,C.T.
  TITLE     HPRT: Gene structure, expression, and mutation
  JOURNAL   Annu. Rev. Genet. 19, 127-148 (1985)
  STANDARD  full staff_entry
REFERENCE   18 (sites;
  AUTHORS   Wilson,J.M. and Kelley,W.N.
  TITLE     Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
            deficiency in a patient with the Lesch-Nyhan syndrome
  JOURNAL   J. Clin. Invest. 71, 1331-1335 (1983)
  STANDARD  full staff_entry
REFERENCE   19 (sites; mutations causing Lesch-Nyhan syndrome and gout)
  AUTHORS   Wilson,J.M. and Kelley,W.N.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase. Structural
            alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated
            from a patient with gout
  JOURNAL   J. Biol. Chem. 259, 27-30 (1984)
  STANDARD  full staff_entry
REFERENCE   20 (sites; mutations causing gout)
  AUTHORS   Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase. Molecular
            abnormality in a mutant form of the enzyme (HPRT-Toronto)
  JOURNAL   J. Biol. Chem. 258, 6458-6460 (1983)
  STANDARD  full staff_entry
REFERENCE   21 (sites; mutations causing gout)
  AUTHORS   Wilson,J.M., Tarr,G.E. and Kelley,W.N.
  TITLE     Human hypoxanthine (guanine) phosphoribosyltransferase: An amino
            acid substitution in a mutant form of the enzyme isolated from a
            patient with gout
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983)
  STANDARD  full staff_entry
REFERENCE   22 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G.,
            Hildebrand,B.M. and Caskey,C.T.
  TITLE     Molecular evidence for new mutation at the hprt locus in
            Lesch-Nyhan patients
  JOURNAL   Nature 310, 412-414 (1984)
  STANDARD  full staff_entry
REFERENCE   23 (sites; mutant reversion)
  AUTHORS   Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and
            Caskey,C.T.
  TITLE     Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene
            rearrangement
  JOURNAL   Somat. Cell Mol. Genet. 14, 293-303 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Edwards, 26-JUL-1989.
            
               Mutant    Description
            
               RJK 1780  missing intron 1 (partial and exon 2 [12]
               RJK  849  missing intron 3 (partial) and exons 4-9 (no mRNA)
            [22]
               RJK  984  missing intron 5 (partial) and exons 6-9 (no mRNA)
            [17]
                         [22]
               GM  3467  missing intron 8 (partial) and exon 9 (no mRNA) [12]
                         [22]
               RJK  853  complete gene deletion [12] [22] [16]
               GM  2227  inversion of exons 6-9 (no mRNA) [23]
               GM  1662  and GM 6804 duplication of exons 2 and 3 and elongated
                         mRNA [22] [23]
               Connersville missing intron 7 (partial and exon 8 [7]
FEATURES       from  to/span     description
    pept       1677     1703     hypoxanthine phosphoribosyltransferase, exon 1
                                 /nomgen="HPRT" /map="Xq26"
                                 /hgml_locus_uid="LE0031K"
              14780    14886     hypoxanthine phosphoribosyltransferase, exon 2
              16603    16786     hypoxanthine phosphoribosyltransferase, exon 3
              27891    27956     hypoxanthine phosphoribosyltransferase, exon 4
              31617    31634     hypoxanthine phosphoribosyltransferase, exon 5
              34937    35019     hypoxanthine phosphoribosyltransferase, exon 6
              39815    39861     hypoxanthine phosphoribosyltransferase, exon 7
              40033    40109     hypoxanthine phosphoribosyltransferase, exon 8
              41454    41501     hypoxanthine phosphoribosyltransferase, exon 9
    IVS        1704    14779     HPRT intron A
    IVS       14887    16602     HPRT intron B
    IVS       16787    27890     HPRT intron C
    IVS       27957    31616     HPRT intron D
    IVS       31635    34936     HPRT intron E
    IVS       35020    39814     HPRT intron F
    IVS       39862    40032     HPRT intron G
    IVS       40110    41453     HPRT intron H
    rpt         258       41 (c) Alu repeat copy A
    rpt        1012      694 (c) Alu repeat copy B
    rpt        3532     3342 (c) Alu repeat copy C
    rpt        3982     3343 (c) Alu repeat copy D
    rpt        6232     5937 (c) Alu repeat copy E
    rpt        6524     6288 (c) Alu repeat copy F
    rpt        7504     7807     Alu repeat copy G
    rpt        8713     8408 (c) Alu repeat copy H
    rpt        9477     9169 (c) Alu repeat copy I
    rpt        9699    10142     Alu repeat copy J
    rpt       10407    10293 (c) Alu repeat copy K
    rpt       12154    12310     Alu repeat copy L
    rpt       13302    12998 (c) Alu repeat copy M
    rpt       13638    13314 (c) Alu repeat copy N
    rpt       14238    13923 (c) Alu repeat copy O
    rpt       14610    14239 (c) Alu repeat copy P
    rpt       16090    15843 (c) Alu repeat copy Q
    rpt       16518    16293 (c) Alu repeat copy R
    rpt       18783    18439 (c) Alu repeat copy S
    rpt       18962    19084     Alu repeat copy T
    rpt       19397    19104 (c) Alu repeat copy U
    rpt       20064    19778 (c) Alu repeat copy V
    rpt       20486    20636     Alu repeat copy W
    rpt       20990    20696 (c) Alu repeat copy X
    rpt       21293    20996 (c) Alu repeat copy Y
    rpt       22171    21878 (c) Alu repeat copy Z
    rpt       23772    23460 (c) Alu repeat copy AA
    rpt       25310    25426     Alu repeat copy AB
    rpt       26388    26279 (c) Alu repeat copy AC
    rpt       29863    29560 (c) Alu repeat copy AD
    rpt       33618    33148 (c) Alu repeat copy AE
    rpt       36575    36645     Alu repeat copy AF
    rpt       38613    39039     Alu repeat copy AG
    rpt       39091    39388     Alu repeat copy AH
    rpt       40622    40925     Alu repeat copy AI
    rpt       40952    41262     Alu repeat copy AJ
    rpt       42640    42936     Alu repeat copy AK
    rpt       44776    44486 (c) Alu repeat copy AL
    rpt       46627    46305 (c) Alu repeat copy AM
    rpt       47520    47763     Alu repeat copy AN
    rpt       49257    49551     Alu repeat copy AO
    rpt       49705    49993     Alu repeat copy AP
    rpt       50002    50312     Alu repeat copy AQ
    rpt       51066    50774 (c) Alu repeat copy AR
    rpt       52276    51978 (c) Alu repeat copy AS
    rpt       52407    52674     Alu repeat copy AT
    rpt       54653    54551 (c) Alu repeat copy AU
    rpt       55465    55164 (c) Alu repeat copy AV
    rpt       56523    56689     Alu repeat copy AW
    rpt       11312    11254 (c) LINE repeat copy A (partial)
    rpt       18284    18242 (c) LINE repeat copy B (partial)
    rpt       27091    26843 (c) LINE repeat copy C (partial)
    rpt       37558    37300 (c) LINE repeat copy D (partial)
    rpt       39451    39427 (c) LINE repeat copy E (partial)
    rpt       51317    51100 (c) LINE repeat copy F (partial)
    rpt         261      321     short interspersed repeat copy A
    rpt        1013     1054     short interspersed repeat copy B
    rpt        2902     2869 (c) short interspersed repeat copy C
    rpt        3078     3107     short interspersed repeat copy D
    rpt        4228     4254     short interspersed repeat copy E
    rpt        4463     4494     short interspersed repeat copy F
    rpt       21582    21635     short interspersed repeat copy G
    rpt       28430    28394 (c) short interspersed repeat copy H
    rpt       28923    28981     short interspersed repeat copy I
    rpt       35562    35604     short interspersed repeat copy J
    rpt       44042    44108     short interspersed repeat copy K
    rpt       46628    46688     short interspersed repeat copy L
    rpt       51715    51687 (c) short interspersed repeat copy M
    unsure    17479    17479     polymorphism or cloning error
    unsure    24903    24903     polymorphism or cloning error
    mut        1664     1678     ggccggctccgttat in wt; gt in gout RJK 951 [13]
    mut       14778    14778     a in wt; t in Lesch-Nyhan RJK 1760, no exon 2
                                 [12]
    mut       14808    14809     tt in wt; ttt in Lesch-Nyhan Chicago [7]
    mut       14874    14874     t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7]
    mut       14877    14879     tta in wt; ta in Lesch-Nyhan RJK 1939 [12]
    mut       14886    14886     g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys
                                 [12]
    mut       16619    16619     c in wt; g in gout Toronto Arg->Gly [20]
    mut       16623    16664     atgtgatgaaggagatgggaggccatcacattgtagccctct in
                                 wt; at in Lesch-Nyhan RJK 2108 [12]
    mut       16675    16676     ag in wt; agg in Lesch-Nyhan RJK 866 [13]
    mut       16677    16677     g in wt; a in Lesch-Nyhan New Haven Gly->Glu
                                 [7]
    mut       16679    16679     g in wt; c in Lesch-Nyhan Yale Gly->Arg [10]
    mut       16690    16690     c in wt; a in Lesch-Nyhan Flint RJK 892
                                 Phe->Leu [5] [13]
    mut       16707    16707     a in wt; t in gout Arlingen Asp->Val [7]
    mut       16756    16759     tgta in wt; ta in Lesch-Nyhan RJK 1332 [13]
    mut       16780    16780     c in wt; a in gout Munich Ser->Arg [3] [19]
    mut       16784    16787     tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12]
    mut       27897    27897     c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG
                                 [12]
    mut       27911    27911     c in wt; t in gout London Ser->Leu [2] [4]
                                 [12] [14]
    mut       31621    31621     t in wt; a in Lesch-Nyhan Midland RJK 896
                                 Val->Asp [6] [13]
    mut       31622    31624     ctt in wt; ct in Lesch-Nyhan RJK 2019 [12]
    mut       31624    31624     t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser
                                 [13]
    mut       31628    31628     t in wt; g in gout Ann Arbor Ile->Met [11]
    mut       34962    34970     tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK
                                 1210 Met->Lys [13]
    mut       35015    35015     g in wt; t in gout Milwaukee RJK 949 Ala->Ser
                                 [7] [13]
    mut       39837    39837     c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA
                                 [13]
    mut       39842    39846     tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12]
    mut       39858    39858     g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr
                                 [12]
    mut       39866    39866     g in wt; a in Lesch-Nyhan RJK 1934. no exon 7
                                 [12]
    mut       40034    40038     tgttg in wt; tg in Lesch-Nyhan Michigan RJK
                                 855 [7] [13]
    mut       40080    40080     g in wt; a in Lesch-Nyhan Kingston RJK 2188
                                 Asp->Asn [12] [19]
    mut       40095    40095     t in wt; g in Lesch-Nyhan New Briton RJK 950
                                 Phe->Val [7] [13]
    mut       40102    40102     a in wt; g in gout Ashville Asp->Gly [8]
    mut       40114    40114     g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no
                                 exon 8 [12]
    mut       41486    41509     aaaatacaaagcctaagatgagag in wt; ag in
                                 Lesch-Nyhan Evansville RJK 894 [7] [12]
    mut       41450    41452     ata in wt; ttt in Lesch-Nyhan RJK 906, GM
                                 1899, downstream cryptic splice site used
    mut       41454    41454     c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079,
                                 His->Asp [12] [13]
    mut       41461    41461     g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr
                                 [13]
BASE COUNT    15689 a  11281 c  11599 g  18167 t
ORIGIN      1 bp upstream of EcoRI site; chromosome Xq26.
        1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt
       61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc
      121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc
      181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt
      241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga
      301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc
      361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag
      421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc
      481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt
      541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct
      601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga
      661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt
      721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat
      781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga
      841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag
      901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct
      961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc
     1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag
     1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac
     1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc
     1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg
     1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt
     1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg
     1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg
     1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg
     1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc
     1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc
     1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg
     1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg
     1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg
     1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc
     1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga
     1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca
     1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc
     2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat
     2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg
     2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag
     2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg
     2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct
     2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt
     2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg
     2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca
     2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg
     2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt
     2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg
     2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa
     2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt
     2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg
     2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc
     2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc
     3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt
     3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt
     3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct
     3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat
     3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt
     3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag
     3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc
     3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc
     3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta
     3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac
     3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc
     3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt
     3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc
     3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc
     3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc
     3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag
     3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc
     4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca
     4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc
     4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc
     4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa
     4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca
     4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga
     4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac
     4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg
     4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt
     4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac
     4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg
     4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag
     4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg
     4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca
     4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg
     4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat
     4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg
     5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc
     5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta
     5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc
     5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact
     5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct
     5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg
     5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg
     5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc
     5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa
     5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat
     5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa
     5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg
     5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc
     5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa
     5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt
     5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt
     6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg
     6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga
     6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg
     6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc
     6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt
     6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc
     6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg
     6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt
     6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga
     6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac
     6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct
     6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag
     6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc
     6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt
     6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg
     6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc
     6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca
     7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc
     7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga
     7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt
     7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc
     7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg
     7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag
     7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag
     7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca
     7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt
     7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac
     7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga
     7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt
     7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa
     7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct
     7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact
     7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt
     7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc
     8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac
     8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct
     8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg
     8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat
     8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag
     8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat
     8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg
     8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg
     8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc
     8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac
     8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg
     8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc
     8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat
     8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga
     8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta
     8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg
     9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt
     9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag
     9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt
     9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc
     9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag
     9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt
     9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca
     9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc
     9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc
     9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat
     9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg
     9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc
     9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga
     9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca
     9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat
     9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta
     9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc
    10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc
    10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa
    10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa
    10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat
    10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt
    10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct
    10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt
    10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga
    10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca
    10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct
    10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg
    10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg
    10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa
    10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta
    10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa
    10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca
    10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct
    11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa
    11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc
    11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat
    11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc
    11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag
    11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag
    11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga
    11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg
    11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg
    11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt
    11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac
    11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa
    11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa
    11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa
    11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt
    11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt
    12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag
    12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat
    12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat
    12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc
    12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt
    12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt
    12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct
    12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat
    12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa
    12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag
    12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt
    12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac
    12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca
    12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct
    12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt
    12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc
    12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta
    13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc
    13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg
    13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc
    13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct
    13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt
    13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg
    13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat
    13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca
    13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga
    13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt
    13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac
    13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta
    13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg
    13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc
    13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat
    13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg
    13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc
    14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt
    14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga
    14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt
    14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc
    14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg
    14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc
    14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc
    14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca
    14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg
    14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata
    14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg
    14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac
    14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg
    14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat
    14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac
    14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa
    15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt
    15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg
    15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg
    15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg
    15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt
    15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat
    15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt
    15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata
    15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat
    15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag
    15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg
    15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt
    15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga
    15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc
    15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca
    15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag
    15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt
    16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag
    16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt
    16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa
    16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa
    16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt
    16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact
    16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac
    16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac
    16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa
    16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg
    16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg
    16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag
    16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa
    16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa
    16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta
    16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta
    16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg
    17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt
    17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt
    17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt
    17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc
    17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg
    17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc
    17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt
    17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa
    17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc
    17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat
    17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc
    17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat
    17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact
    17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa
    17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt
    17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa
    18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat
    18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat
    18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt
    18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat
    18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc
    18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg
    18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt
    18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt
    18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca
    18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc
    18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt
    18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg
    18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg
    18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat
    18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa
    18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat
    18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg
    19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa
    19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag
    19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga
    19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc
    19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa
    19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg
    19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc
    19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa
    19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg
    19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa
    19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa
    19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg
    19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag
    19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga
    19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag
    19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt
    19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct
    20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt
    20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac
    20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct
    20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga
    20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat
    20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg
    20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat
    20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt
    20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc
    20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag
    20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt
    20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca
    20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta
    20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg
    20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc
    20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt
    21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc
    21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg
    21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc
    21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc
    21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa
    21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat
    21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa
    21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat
    21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat
    21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt
    21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt
    21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta
    21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt
    21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt
    21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct
    21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct
    21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat
    22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc
    22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg
    22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca
    22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag
    22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc
    22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat
    22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg
    22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct
    22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca
    22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt
    22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc
    22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta
    22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct
    22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta
    22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct
    22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa
    22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa
    23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag
    23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc
    23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa
    23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc
    23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac
    23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga
    23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt
    23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc
    23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc
    23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat
    23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc
    23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc
    23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc
    23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga
    23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga
    23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa
    24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg
    24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat
    24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat
    24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct
    24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt
    24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga
    24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg
    24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt
    24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc
    24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt
    24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc
    24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg
    24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat
    24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa
    24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt
    24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg
    24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc
    25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag
    25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc
    25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact
    25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta
    25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg
    25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc
    25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag
    25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc
    25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca
    25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag
    25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag
    25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt
    25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag
    25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat
    25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa
    25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata
    25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg
    26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt
    26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat
    26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt
    26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag
    26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc
    26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact
    26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg
    26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat
    26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag
    26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt
    26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt
    26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata
    26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta
    26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat
    26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg
    26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata
    27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata
    27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg
    27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc
    27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat
    27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag
    27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga
    27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt
    27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac
    27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa
    27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg
    27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt
    27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat
    27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct
    27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata
    27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt
    27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat
    27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct
    28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg
    28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca
    28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat
    28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa
    28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca
    28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc
    28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg
    28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa
    28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt
    28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt
    28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt
    28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat
    28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa
    28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac
    28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac
    28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca
    28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct
    29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac
    29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa
    29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc
    29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga
    29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt
    29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca
    29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca
    29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt
    29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt
    29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca
    29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg
    29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt
    29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct
    29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg
    29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag
    29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct
    30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga
    30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact
    30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt
    30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa
    30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt
    30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg
    30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc
    30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc
    30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag
    30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga
    30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta
    30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata
    30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct
    30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata
    30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa
    30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca
    30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga
    31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg
    31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac
    31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat
    31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga
    31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg
    31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt
    31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt
    31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa
    31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat
    31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg
    31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac
    31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat
    31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg
    31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt
    31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt
    31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac
    31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta
    32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa
    32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag
    32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta
    32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt
    32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt
    32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact
    32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc
    32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat
    32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa
    32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta
    32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca
    32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg
    32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc
    32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt
    32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt
    32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt
    33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc
    33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac
    33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta
    33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac
    33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta
    33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta
    33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata
    33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata
    33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc
    33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat
    33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa
    33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct
    33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc
    33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt
    33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga
    33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct
    33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag
    34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc
    34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat
    34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac
    34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt
    34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga
    34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt
    34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc
    34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt
    34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca
    34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct
    34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac
    34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa
    34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc
    34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc
    34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg
    34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt
    34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac
    35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt
    35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat
    35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta
    35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag
    35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga
    35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc
    35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc
    35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt
    35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag
    35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat
    35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta
    35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat
    35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta
    35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg
    35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt
    35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa
    36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt
    36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag
    36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat
    36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga
    36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa
    36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt
    36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga
    36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct
    36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg
    36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg
    36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga
    36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag
    36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga
    36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta
    36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt
    36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag
    36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta
    37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa
    37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc
    37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc
    37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg
    37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc
    37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg
    37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac
    37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag
    37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg
    37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc
    37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt
    37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc
    37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt
    37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc
    37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc
    37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag
    37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa
    38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat
    38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc
    38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat
    38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga
    38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata
    38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa
    38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag
    38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt
    38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg
    38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc
    38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg
    38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa
    38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg
    38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa
    38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt
    38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg
    39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa
    39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt
    39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg
    39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc
    39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga
    39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga
    39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc
    39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca
    39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac
    39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa
    39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc
    39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg
    39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct
    39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa
    39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct
    39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga
    39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat
    40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg
    40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt
    40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa
    40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct
    40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag
    40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag
    40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc
    40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact
    40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg
    40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg
    40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag
    40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac
    40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag
    40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg
    40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca
    40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca
    40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct
    41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg
    41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc
    41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac
    41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg
    41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct
    41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc
    41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt
    41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag
    41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg
    41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta
    41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac
    41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc
    41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt
    41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata
    41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga
    41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg
    42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt
    42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc
    42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt
    42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt
    42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta
    42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg
    42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct
    42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt
    42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta
    42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag
    42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta
    42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca
    42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa
    42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg
    42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag
    42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac
    42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa
    43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt
    43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc
    43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct
    43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat
    43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat
    43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc
    43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt
    43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc
    43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt
    43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt
    43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc
    43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca
    43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag
    43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg
    43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa
    43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg
    43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt
    44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca
    44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc
    44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta
    44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact
    44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc
    44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca
    44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg
    44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg
    44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc
    44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac
    44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg
    44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta
    44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca
    44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc
    44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact
    44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt
    45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga
    45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt
    45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg
    45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc
    45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta
    45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg
    45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc
    45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac
    45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct
    45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca
    45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa
    45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa
    45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa
    45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat
    45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg
    45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt
    45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt
    46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca
    46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc
    46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca
    46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg
    46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt
    46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag
    46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct
    46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt
    46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc
    46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc
    46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc
    46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa
    46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc
    46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat
    46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg
    46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc
    46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac
    47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac
    47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca
    47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg
    47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca
    47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc
    47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga
    47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc
    47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa
    47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact
    47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag
    47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg
    47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc
    47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa
    47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga
    47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg
    47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga
    48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc
    48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct
    48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc
    48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg
    48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg
    48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt
    48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg
    48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa
    48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat
    48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc
    48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg
    48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga
    48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat
    48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg
    48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc
    48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt
    48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt
    49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg
    49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag
    49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc
    49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct
    49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat
    49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa
    49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc
    49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc
    49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc
    49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc
    49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta
    49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg
    49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa
    49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc
    49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag
    49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct
    49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt
    50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat
    50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc
    50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt
    50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct
    50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga
    50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca
    50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt
    50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca
    50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt
    50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg
    50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca
    50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa
    50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga
    50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct
    50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt
    50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac
    51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca
    51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca
    51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct
    51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca
    51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac
    51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc
    51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac
    51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa
    51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc
    51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct
    51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc
    51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc
    51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga
    51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt
    51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac
    51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg
    51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa
    52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga
    52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag
    52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa
    52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg
    52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat
    52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg
    52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca
    52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa
    52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag
    52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca
    52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt
    52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta
    52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa
    52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa
    52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac
    52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc
    52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca
    53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata
    53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt
    53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag
    53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc
    53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact
    53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc
    53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat
    53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag
    53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca
    53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag
    53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc
    53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc
    53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc
    53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt
    53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta
    53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct
    54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg
    54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac
    54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt
    54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg
    54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc
    54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc
    54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg
    54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga
    54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag
    54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc
    54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta
    54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta
    54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg
    54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta
    54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga
    54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa
    54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca
    55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc
    55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca
    55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc
    55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc
    55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc
    55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta
    55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg
    55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg
    55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt
    55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt
    55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg
    55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat
    55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac
    55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt
    55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa
    55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa
    55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta
    56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg
    56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag
    56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca
    56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt
    56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca
    56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt
    56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct
    56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg
    56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg
    56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa
    56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc
    56701 gccgcccctg gtgatggacc agcggggctt cgacga
//
LOCUS       CHTMOMPCT    1407 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  C.trachomatis major outer membrane protein gene, complete cds.
ACCESSION   J03813
KEYWORDS    major outer membrane protein.
SOURCE      C.trachomatis (serotype A) DNA.
  ORGANISM  Chlamydia trachomatis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE   1  (sites; amino acid sequences)
  AUTHORS   Baehr,W., Zhang,Y.-X., Joseph,T., Su,H., Nano,F.E.,
            Everett,K.D.E. and Caldwell,H.D.
  TITLE     Mapping antigenic domains expressed by Chlamydia trachomatis major
            outer membrane protein genes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 4000-4004 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1407)
  AUTHORS   Baehr,W.
  JOURNAL   Unpublished (1988) Rocky Mountain laboratories, Hamilton, MT 59840
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly provided
            by W.Baehr, 04-MAY-1988
FEATURES       from  to/span     description
    pept        169     1359     major outer membrane protein
BASE COUNT      403 a    283 c    311 g    410 t
ORIGIN      1 bp upstream of XhoI site.
        1 ctcgagcatt gaacgacatg ttctcgatta aggctgcttt tacttgcaag acattcctca
       61 ggccattaat tgctacagga catcttgtct ggctttaact aggacacagt gccgccagaa
      121 aaagatagcg agcacaaaga gagctaatta tacaatttag aggtaagaat gaaaaaactc
      181 ttgaaatcgg tattagtatt tgccgctttg agttctgctt cctccttgca agctctgcct
      241 gtggggaatc ctgctgaacc aagccttatg atcgacggaa ttctgtggga aggtttcggc
      301 ggagatcctt gcgatccttg caccacttgg tgtgacgcta tcagcatgcg tatgggttac
      361 tatggtgact ttgttttcga ccgtgttttg aaaacagatg tgaataaaga atttcagatg
      421 ggagcggcgc ctactaccag cgatgtagca ggcttagaaa aggatccagt agcaaatgtt
      481 gctcgcccaa atcccgctta tggcaaacac atgcaagatg ctgaaatgtt tacgaacgct
      541 gcttacatgg cattaaatat ctgggatcgt tttgatgtat tttgtacatt gggagcaact
      601 accggttatt taaaaggaaa ctccgcttcc ttcaacttag ttggattatt cggaacaaaa
      661 acacaatctt ctggctttga tacagcgaat attgttccta acactgcttt gaatcaagct
      721 gtggttgagc tttatacaga cactaccttt gcttggagcg taggtgctcg tgcagctctc
      781 tgggaatgtg ggtgtgcaac gttaggagct tctttccaat atgctcaatc taaacctaaa
      841 gtagaagagt tgaatgttct ttgtaatgca tccgaattta ctattaataa gccgaaagga
      901 tatgttgggg cggaatttcc acttgatatt accgcaggaa cagaagctgc gacaggaact
      961 aaggatgcct ctattgacta ccatgagtgg caagcaagtt tagccctttc ttacagatta
     1021 aatatgttca ctccttacat tggagttaaa tggtctagag taagttttga tgccgacacg
     1081 atccgtatcg ctcagcctaa attggctaaa ccagtcttgg ataccactac tctaaacccg
     1141 accatcgctg gtaaaggaac tgtggtctct tccgcagaaa acgaactggc tgatacaatg
     1201 caaatcgttt ccttgcagtt gaacaagatg aaatctagaa aatcttgcgg tattgcagta
     1261 ggaacaactg ttgtagatgc agataaatac gcagttacaa ttgagactcg cttgatcgat
     1321 gagagagcag ctcacgtaaa tgcacaattc cgcttctaat attctaataa ttgtataatt
     1381 atattagaat cttttcttag taagctt
//
LOCUS       DROD1P       1734 bp ss-mRNA            INV       24-JUL-1990
DEFINITION  D.melanogaster chromosomal protein D1 mRNA, complete cds.
ACCESSION   J04725
KEYWORDS    chromosomal protein.
SOURCE      D.melanogaster (strain Oregon R) embryo, cDNA to mRNA, (library of
            T.-S.Hsieh), clones D1-[9,9.14,15].
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 1734)
  AUTHORS   Ashley,C.T., Pendleton,C.G., Jennings,W.W., Saxena,A. and
            Glover,C.V.C.
  TITLE     Isolation and sequencing of cDNA clones encoding Drosophila
            chromosomal protein D1: A repeating motif in proteins which
            recognize AT DNA
  JOURNAL   J. Biol. Chem. 264, 8394-8401 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by C.V.C.Glover, 29-MAR-1989.
FEATURES       from  to/span     description
    pept        228     1295     chromosomal protein D1
    mRNA    <     1     1734     D1 mRNA
    signal     1599     1604     polyadenylation signal
    signal     1717     1722     polyadenylation signal
BASE COUNT      558 a    410 c    462 g    304 t
ORIGIN      Map position 85D1-2; polytene band.
        1 acatcactac tttgaactgt actagcgtgc gcgtatttta caattacaaa ttgagaaggc
       61 aaacttgatt ttcggcaaac aaaaaggtga agaagaagag cggagccaag cgaaaagagt
      121 gagcgtgtgt tcgtgagaga gagcatttgt gcgcgtgtgt gtgtgagggg acgcgaagaa
      181 acaaggattt tttttattcg cttgctaatt gaatcccgta cagagaaatg gaggaagttg
      241 cggtaaagaa gcgcggccga ccgtcaaagg catccgtcgg tggtaaatct tcaacagcgg
      301 cggttgcagc aatctcgcca ggcatcaaaa agcgcggtcg tcccgccaag aacaagggct
      361 ccagtggcgg tggtggacag cggggtcgcc caccgaaggc gtcaaaaatc caaaatgacg
      421 aagatcccga agacgagggg gaggaggacg gcgatggcga tggctccggc gctgagcttg
      481 caaacaactc atcaccctcg ccgacgaagg gcaggggacg acccaagagc agtggtggcg
      541 ccggatcagg atccggggat tcggtgaaga cgcccggttc cgccaagaag cgcaaggccg
      601 gcaggcccaa gaagcaccag cccagcgata gcgagaatga agacgatcag gatgaggacg
      661 acgacggcaa cagcagcatc gaagaacgtc gcccagtggg ccgaccgtcg gctggatccg
      721 tcaacctgaa catatcgcgc acgggacgcg gcctgggaag acccaagaag cgggctgtcg
      781 agtccaatgg cgatggggag ccacaagtgc ccaagaaacg cggtcgtcca ccacagaaca
      841 agtcgggtag cggtggcagt actggctacg tacccaccgg tcgtccgcgc ggtcgtccaa
      901 aggccaatgc tgcacccgtc gagaagcacg aggacaatga cgacgatcag gacgatgaga
      961 actccgggga ggaggaacac agctcgcctg agaagacggt agtggcgccg aagaaacgtg
     1021 gacgtccctc tcttgccgct ggcaaggtgt ccaaagagga aacaacaaag ccccgtagtc
     1081 gcccagcgaa aaatattgat gacgatgcag atgatgctga ttccgccgat cagggccaac
     1141 acaactccaa gaaggagtcg aacgacgagg accgtgccgt ggatgggaca ccaaccaaag
     1201 gagatggcct caaatggaac tccgatggcg aaaatgatgc caacgacggt tatgtttcgg
     1261 ataactacaa cgattccgaa tcggtagctg cctaaaacat acaatttata aacatttcca
     1321 cacacccctc cccaaaaatc aacacaaaaa aaaaaagaat gttgtaaaag caaaaaaaaa
     1381 aaaaacgaaa aaaaatctgc aagtaagaga agaaaacaca aaaatggata ggacctactt
     1441 ggactttaga gaaccttccc ctcctaaaac aacacacttt gtctacatat atattccgat
     1501 ttattgctga gaaatcggac aaaccgttta caactgcact gcactgtaat gacgacacat
     1561 agctcggaaa acgatttcaa cttggattga aattaccaaa taaataacac ataaataaca
     1621 tatttgggga agatttcaag ttccgaagta gaactgaact ttattgtaac aacaaatttg
     1681 atttgattgc agaaatttca atcgtataca agtttaaata aacctactat acct
//
LOCUS       ECOESTA2A     549 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  E.coli estA2 gene encoding heat-stable enterotoxin A2 (STa2)
            protein, complete cds.
ACCESSION   M18345
KEYWORDS    STa2 protein; estA2 gene; heat-stable enterotoxin A2.
SOURCE      E.coli (strain CRL 25090; isolated from human in Bangladesh) HB101
            pGK22 cell line DNA, clones 2,12,21, and 27.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 549)
  AUTHORS   Guzman-Verduzio,L.M. and Kupersztoch,Y.M.
  TITLE     Rectification of two Escherichia coli heat-stable enterotoxin
            allele sequences and lack of biological effect of the change of the
            carboxyterminal tyrosine for histidine
  JOURNAL   Infect. Immun. 57, 645-648 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.M.Kupersztoch, 08-NOV-1988.
FEATURES       from  to/span     description
    pept        149      367     heat-stable enterotoxin A2 precursor
    sigp        149      205     heat-stable enterotoxin A2 signal peptide
    matp        308      364     heat-stable enterotoxin A2
    mRNA        132      549     heat-stable enterotoxin A2 mRNA
BASE COUNT      182 a     93 c    115 g    159 t
ORIGIN      Unreported.
        1 gaattccccg ccctaaaaca taatattatt atgctcttcg tagcggagag tatagtatga
       61 tacacatcac aaaaaaaaaa taaaaaagtt tgcgcaatcg ttctgatttt gatttaaata
      121 ttcgtggacg acgtgtttcg gaggtaatat gaaaaaatca atattattta tttttctttc
      181 tgtattatct ttttcacctt tcgctcagga tgctaaacca gcagggtctt caaaagaaaa
      241 aattacacta gaatcgaaaa aatgtaacat tgtaaaaaaa aataatgaaa gtagtcctga
      301 aagcatgaat agtagcaatt actgctgtga attgtgttgt aatcctgctt gtaccgggtg
      361 ctattaataa tataaaggga actaaacagt tccctttata tttgtgtgcg ccgtggctgg
      421 cgctgttctt caactgtgga ggctgaagaa cgactaagag gtgaaagtcc tccacacacc
      481 cggtgagggg aagtgttagc ggaaggcaag gtgatcctac ccacgtaata tggacacagg
      541 ggaattcgt
//
LOCUS       ECOESTA3A     403 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  E.coli estA3 gene encoding heat-stable enterotoxin A3 (STa3)
            protein, complete cds.
ACCESSION   M18346
KEYWORDS    STa3 protein; estA3 gene; heat-stable enterotoxin A3.
SOURCE      E.coli (strain 153837-2; isolated from human in Bangladesh) HB101
            pYK117 cell line DNA, clones 5 and 7.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 403)
  AUTHORS   Guzman-Verduzio,L.M. and Kupersztoch,Y.M.
  TITLE     Rectification of two Escherichia coli heat-stable enterotoxin
            allele sequences and lack of biological effect of the change of the
            carboxyterminal tyrosine for histidine
  JOURNAL   Infect. Immun. 57, 645-658 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.M.Kupersztoch, 08-NOV-1988.
FEATURES       from  to/span     description
    pept        147      365     heat-stable enterotoxin A3 precursor
    sigp        147      203     heat-stable enterotoxin A3 signal peptide
    matp        204      362     heat-stable enterotoxin A3
    mRNA        130  >   403     heat-stable enterotoxin A3 mRNA
BASE COUNT      139 a     61 c     72 g    131 t
ORIGIN      Unreported.
        1 ttttcggtcg ccgaaaaaga taatattact atgctcttcg tagcggagag tatagtatga
       61 tgttcatcac aaaaaaaata aaaaagtttg cgcaaccgtt ctgattttga ttcaaatgtt
      121 cgtggatgcc atgttccgga ggtaatatga agaaatcaat attatttatt tttctttctg
      181 tattgtcttt ttcacctttc gctcaggatg ctaaaccagt agagtcttca aaagaaaaaa
      241 tcacactaga atcaaaaaaa tgtaacattg caaaaaaaag taataaaagt ggtcctgaaa
      301 gcatgaatag tagcaattac tgctgtgaat tgtgttgtaa tcctgcttgt accgggtgct
      361 attaataata taaagggaac taaacagttc cctttatatt tgt
//
LOCUS       HIV2ISY      9636 bp ss-RNA             VRL       24-JUL-1990
DEFINITION  Human immunodeficiency virus type 2, isolate SBLISY, complete
            genome.
ACCESSION   J04498
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 2 (HIV-2) proviral DNA, isolate
            SBLISY.
  ORGANISM  Human immunodeficiency virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9636)
  AUTHORS   Franchini,G., Fargnoli,K.A., Giombini,F., Jagodzinski,L.,
            De Rossi,A., Bosch,M., Biberfeld,G., Fenyo,A.M., Albert,J.,
            Gallo,R.C. and Wong-Staal,F.
  TITLE     Molecular and biological characterization of a replication
            competent human immunodeficiency type 2 (HIV-2) proviral clone
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 2433-2437 (1988)
  STANDARD  full staff_review
COMMENT     Clean copy of sequence [1] kindly provided by G. Franchini,
            30-JAN-1989.
FEATURES       from  to/span     description
    pept        547     2109     gag polyprotein
    pept    <  1827     4934     pol polyprotein (NH2 terminus uncertain; AA at
                                 1827)
    pept       4864     5511     vif protein
    pept       5339     5677     vpx protein
    pept       5677     5994     vpr protein
    pept       5840     6135     tat, exon 2 (first expressed exon)
               8281     8377     tat, exon 3 (AA at 8282)
    pept       6066     6135     rev, exon 2 (first expressed exon)
               8281     8717     rev, exon 3 (AA at 8283)
    pept       6145     8685     envelope polyprotein
    pept       8519     9289     nef protein
    pre-msg       1     9636     genomic mRNA
    pre-msg       1     9636     tat, rev, nef subgenomic mRNA
    IVS        6136     8280     tat cds intron 2
    IVS        6136     8280     rev cds intron 2
    IVS        6136     8280     tat, rev, nef subgenomic mRNA intron 2
    LTR     <     1      300     5' LTR
    LTR        8904  >  9636     3' LTR
    rpt           1      174     R repeat 5' copy
    rpt        9457     9636     R repeat 3' copy
    binding     304      320     primer (Lys-tRNA) binding site
    signal     9609     9614     mRNA polyadenylation signal
BASE COUNT     3291 a   1940 c   2396 g   2009 t
ORIGIN      
        1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
       61 gtagagcctg ggtgttccct gctggactct caccagtgct tggccggcgc tgggcagacg
      121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa
      181 gtgtgtgttc ccatctctcc tagtcgccgc ctggtcattc ggtgttctcc tgagtaacaa
      241 gaccctggtc tgttaggacc cttcttgctt tgggaaaccg aggcaggaaa atccctagca
      301 ggttggcgcc cgaacaggga cttgaggaag actgagaagc cttggaacac ggctgagtga
      361 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaagg cgcaggccaa
      421 ggtaccaaag gcggcgtgtg gagcgggagt caagaggcct ccgggtgaag gtaagtacct
      481 acaccaaaaa ctgtagccgg aaaaggcttg ttatcctacc tttagacagg tagaagattg
      541 tgggagatgg gcgcgaaaaa ctccgtcttg agagggaaaa aggcagatga attagaaaaa
      601 attaggttac ggcccggcgg gaagaaaaaa tacagattaa aacatattgt gtgggcagcg
      661 aatgaattgg acagattcgg attaacagag agcctgttgg agtcaaaaga aggttgccaa
      721 aaaattattt cagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt
      781 tataatacta cctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa
      841 gaagcaaaaa gaatagtagg gagacatcta gtggcagaaa cagaaactgc agagaaaatg
      901 ccaaatataa gtagaccaac agcaccacct agcgggaaag ggggaaactt ccccgtgcaa
      961 caaataggcg gcaactatgt ccatctgccg ctgagtcccc gaaccctaaa tgcttgggta
     1021 aagttagtag aggagaaaaa gttcggggca gaagtagtgc cgggatttca ggcactctca
     1081 gaaggctgca cgccctatga tattaatcaa atgcttaatt gtgtgggcga ccatcaagca
     1141 gcgatgcaaa taatcagaga aattattaat gaagaagcag cagactggga tgtacaacat
     1201 ccaataccag gccccttacc agcggggcag ctcagagatc cacgaggatc tgacatagca
     1261 gggacaacaa gcacagtaga ggaacagatc gaatggatgt ataggcaaga aaatcctgta
     1321 ccagtaggaa acatctatag gagatggatc cagataggac tgcagaagtg tgtcagaatg
     1381 tacaatccaa ccaacattct agacataaaa caaggaccaa aagagtcgtt ccaaagctat
     1441 gtggatagat tctacaaaag cttaagggca gaacagacag atgcagcagt gaagaattgg
     1501 atgacccaga cgctgctagt gcaatcgaac ccagactgta agttagtact aaagggacta
     1561 gggatgaatc ctaccttaga agagatgcta accgcctgtc aagggatagg tggaccaggc
     1621 cagaaggcca gactaatggc agaagcctta aaagaggcca tgcgaccagc ccctatccca
     1681 tttgcagcag cccaacagaa aagggcaatt aagtgttgga attgtggaaa ggaagggcac
     1741 tcggcaagac aatgccgagc gcctagaaga cagggctgct ggaaatgtgg caagtcagga
     1801 cacatcatgg caaactgccc agatagacag gctggttttt tagggcttgg accatgggga
     1861 aagaagcccc gcaacttccc cgtggtccca agttcgcagg ggctaacacc aacagcaccc
     1921 ccaatggatc cagcagtgga cctactggag aagtacatgc agcaagggag aaaacagaga
     1981 gagcagagac aaagaccata caaagaagtg acagaggact tgctgcatct cgagcaagga
     2041 gagacaccac acagagagac gacagaggac ttgctgcacc tcaattctct ctttggaaac
     2101 gaccagtagt cacagcatac attgaggatc agccagtaga agttttacta gacacagggg
     2161 ctgacgactc aatagtagca ggaatagagt tagggagcaa ttatagtcca aaaatagtag
     2221 ggggaatagg aggattcata aataccaaag aatataaaga tgtagaaata agagtgctaa
     2281 ataaaaaggt aagagccacc ataatgacag gtgatacccc aatcaacatt tttggcagaa
     2341 atatcctgac agccttgggc atgtcattaa atttaccagt cgccaaaata gaaccagtaa
     2401 aagtaacatt aaagccagga aaagatgggc caaaacaaag acaatggccc ttaacaagag
     2461 aaaaaataga agcactaaga gaaatctgtg aaaaaatgga aagagaaggt cagctagaag
     2521 aagcgcctcc aactaatccc tataataccc ctacatttgc aattaagaaa aaggacaaaa
     2581 acaaatggag gatgctgata gattttagag aactaaacaa ggtaactcaa gatttcacag
     2641 aggttcagtt aggaattcca cacccagcag gattagccaa gaaaagaaga attactgtgt
     2701 tagatgtagg agatgcctac ttttccatac ccctatatga ggattttaga cagtatactg
     2761 catttactct gccatcagta aacaatgcag aaccaggaaa aagatatata tacaaagtct
     2821 taccacaggg atggaagggg tcaccagcaa tttttcaata cacaatgagg caagtcttag
     2881 aaccattcag aaaagcaaac ccagatgtca ttatcgttca gtacatggat gatatcttaa
     2941 tagctagtga caggacagat ttggaacatg acaaagtagt cctacagcta aaggaacttc
     3001 taaatggcct gggattttcc accccagacg aaaagttcca aaaggaccct ccataccaat
     3061 ggatgggcta tgaactgtgg ccaaccaaat ggaaattgca aaaaatacaa ttgccccaaa
     3121 aggaagtatg gacagttaat gacatccaga aactagtggg tgtcctaaac tgggcggcac
     3181 aaatctaccc aggaataaag accaaacact tatgtaagct aattagagga aagatgacac
     3241 ccacggaaga agtacagtgg acagaattag cagaagcaga gctggaggaa aacaaaatta
     3301 tcttaagcca ggaacaggag ggacactatt accaagagga aaaagagtta gaagcaacag
     3361 ttcaaaagga tcaagacaat cagtggacat ataaagtaca ccagggagaa aaaattctaa
     3421 aagtaggaaa atatgcaaag ataaaaaata cccataccaa cggggtcagg ttgttggcac
     3481 aggtagttca gaaaatagga aaagaagcac tagtcatttg gggacgaata ccaaaatttc
     3541 acctaccagt agaaagagag acctgggaac agtggtggga taactattgg caagtgacat
     3601 ggatcccaga ctgggacttc gtatccaccc caccgttggt caggttagca tttaacctgg
     3661 taaaagatcc tataccaggc gcagagacct tctacacgga tggatcttgc aataggcaat
     3721 caaaagaggg aaaagcagga tatataacag atagaggaaa agacaaagta aggatattag
     3781 agcaaactac caaccaacaa gcagaattag aagcctttgc aatggcagta acagactcag
     3841 gtccaaaagt caatattgta gtagactcac agtatgtaat gggaatagta acaggccaac
     3901 cggctgaatc agagagtaga atagtaaata aaattataga agagatgata aaaaaggaag
     3961 caatctatgt tgcatgggtc ccggcccaca aaggcatagg aggaaatcaa gaaattgacc
     4021 acttagtaag tcagggcatc agacaagtat tattcctaga gagaatagag cccgctcagg
     4081 aagaacatgg aaaatatcat agcaatgtaa aagaactagc ccataagttt ggattaccca
     4141 acctggtggc aagacaaata gtaaacacat gtgcccagtg ccaacaaaaa ggggaagcta
     4201 tacatgggca agtaaatgca gaactaggca cctggcaaat ggactgcaca cacttagaag
     4261 gaaaaatcat tatagtagca gtacatgttg caagtggatt tatagaagca gaagtcatcc
     4321 cacaggaatc aggaaggcaa acagcactct tcctattaaa actggccagt aggtggccaa
     4381 taacacactt gcacacagat aatggtgcca acttcacttc acaggaggta aagatggtag
     4441 catggtgggt aggcatagaa caatcctttg gagtacctta caatccacaa agccagggag
     4501 tagtagaagc aatgaatcac cacctgaaaa atcagataga aagaattaga gagcaggcaa
     4561 atacaatgga aacaatagta ctaatggcag ttcattgcat gaattttaaa agaaggggag
     4621 gaatagggga tatgacccca gtagaaagac tagtcaatat gatcaccaca gaacaagaaa
     4681 tacaattcct ccaagcaaaa aattcaaaat taaaaaattt tcgggtctat ttcagagaag
     4741 gcagaaatca actgtggcaa ggacctgggg agctactgtg gaaaggggac ggagcagtca
     4801 tagtcaaggt agggacagat ataaaagtaa taccaagaag aaaggccaag atcatcagag
     4861 actatggacc aaggcaagag atggatagcg gttcccacct ggagggtgcc agggaggatg
     4921 gagaaatggc atagccttat caagtatcta aaatacagaa caaaagatct agaacaggtg
     4981 cgctatgttc cccaccataa ggtggggtgg gcatggtgga cttgcagcag ggtaatattc
     5041 ccattaaaag gaaacagtca tctagagata caggcatatt ggaacctaac accagaaaaa
     5101 ggatggctct cctcttattc agtaagaatg acttggtact cagaagggtt ctggacagat
     5161 gttaccccag actgtgcaga caccctaata cacagcactt atttctcttg ctttacggca
     5221 ggtgaagtaa gaagagccat caggggagaa aagtcattgt cctgctgcaa ttatccccaa
     5281 gcccataagt ccaaggtacc gtcactccaa tttctggcct tagtagtagt acagcaaaat
     5341 gacaaacccc agagagacaa taccaccagg aaacagtggc gaagaaacta tcgaagaggc
     5401 cttcgattgg ctagacagga cggtagaagc cataaacaga gaggcagtga accacctgcc
     5461 cagggagctt attttccagg tgtggcaaag gtcctggaga tactggcatg atgagcaagg
     5521 gatgtcacga agctacacaa agtatagata tttgtgctta atgcagaaag ctgtgttcat
     5581 gcatttcaag aaagggtgca cttgccgggg ggaaggacat gggccaggag ggtggagatc
     5641 aggacctccc cctcctcctc ccccaggttt agtctaatga ctgaagcacc agcagagttt
     5701 cccccggagg atgggacccc accgagggaa ccaggggatg agtgggtaat agaaattctg
     5761 agagaaataa aggaagaagc tttaaagcat tttgaccctc gcttgctaac tgctcttggc
     5821 tactatatct atactagaca tggagacacc cttgaaggcg ccagagagct cattagggtc
     5881 ctacaacgag ccctcttcac gcacttcaga gcaggatgtg gccactcaag aattggccaa
     5941 ccaaggggaa gaaatcctct ctcagctata ccgaccccta gaaacatgca ataacaaatg
     6001 cttttgtaag gggtgctgct tccattgcca gctgtgtttt ttaaacaagg ggctcgggat
     6061 atgttatgac cgaaagggca gacgaagaag gagtccgaag aaaactaagg ctcattcgtc
     6121 tcctgcatca gacaagtgag tacaatgagt ggtaaaattc agctgcttgt tgcctttctg
     6181 ctaactagtg cttgcttaat atattgcacc aaatatgtga ctgttttcta tggagtaccc
     6241 gtgtggaaaa atgcatccat tcccctcttt tgtgcaacta aaaatagaga tacttgggga
     6301 accatacagt gcttgccaga caatgatgat tatcaagaga tacctttgaa tgtaacagag
     6361 gcttttgacg catgggataa tatagtaaca gaacaagcag tagaagatgt ctggaatcta
     6421 tttgagacat caataaaacc atgtgtcaaa ttaacgcctt tatgtgtaac aatgaactgt
     6481 aacgcaagta cagagagcgc agttgcaact acaagcccat ctggacctga tatgataaat
     6541 gatactgatc catgcataca attgaacaat tgctcaggac tgagggagga agacatggtc
     6601 gagtgtcagt tcaatatgac aggactagag ttagataaga aaaaacagta tagtgaaacc
     6661 tggtactcaa aagatgtggt ttgtgaatca gataacagca cagaccgaaa aagatgttac
     6721 atgaaccatt gcaacacatc agtcatcaca gagtcatgtg acaagcacta ttgggatgct
     6781 atgagattta gatactgtgc accaccgggt tttgtcttgc taaggtgcaa tgataccaat
     6841 tactcaggct ttgagcccaa ttgctctaaa gtagtagctt ctacatgtac aagaatgatg
     6901 gaaacgcaac cttctacttg gcttggcttt aatggcacta gggcagaaaa tagaacatat
     6961 atctattggc atggtaggga taacagaact attatcagct taaacaaata ttataatctc
     7021 accatacttt gtaggagacc agaaaataaa acagttgtac caataacact catgtcaggc
     7081 cgcagatttc actcccagaa gatcatcaat aaaaaaccca ggcaagcatg gtgccggttc
     7141 aaaggcgagt ggagggaagc catgcaggag gtgaaacaaa cccttgtaaa acatcccagg
     7201 tataaaggaa ccaatgacac aaataaaatt aactttacag caccagaaaa agactcagac
     7261 ccagaagtag catatatgtg gactaactgc agaggagaat tcctctattg caacatgact
     7321 tggttcctta attgggtaga aaacaagacg ggtcaacagc ataactatgt gccgtgccat
     7381 atagagcaaa taattaatac ctggcataag gtagggaaaa atgtatattt gcctcctagg
     7441 gaaggagagt tgtcctgcga atcaacagtg accagtatca ttgctaacat tgatgttgat
     7501 ggagataacc ggacaaatat tacctttagt gcagaggtgg cagaactata ccgattggaa
     7561 ttgggggatt ataaattagt agaagtaaca ccaattggct tcgcccctac agcagaaaaa
     7621 agatactcct ctgctccagg gagacataag agaggtgtgc ttgtgctagg gttcctaggt
     7681 tttctcacga cagcaggtgc tgcaatgggg gcggcgtctc tgacgctgtc ggctcagtct
     7741 cggactttat tccgtgggat agtgcagcaa cagcaacagc tgttggacgt ggtcaagaga
     7801 caacaagaaa tgttgcgact gaccgtctgg ggaactaaaa acctccaggc aagagtcact
     7861 gctattgaga agtacctagc agaccaggcg cgactaaatt catggggatg tgcgtttaga
     7921 caagtctgcc acactactgt accatgggta aatgacacct taacacctga gtggaacaac
     7981 atgacatggc aagaatggga acacaaaatc cgcttcctag aggcaaatat cagtgagagt
     8041 ttagaacagg cacaaatcca gcaagaaaag aatatgtatg agctgcaaaa gctaaatagc
     8101 tgggatgttt ttggcaattg gtttgactta acctcctgga tcaagtatat tcaatatgga
     8161 gtcatgatag tagtaggaat agtagctctc agaatagtaa tatatgtagt acaaatgcta
     8221 agtagactta gaaagggcta taggcctgtt ttctcttccc cccccggtta tattcaacag
     8281 atccatatcc acaaggactg ggaacagcca gacagagaag aaacagaaga agacgttggg
     8341 aacgacgttg gaagcagatc ctggccttgg ccgatagaat atatacattt cctgatccgc
     8401 ctgctgatcc gcctcttgac cagactatac aacagctgca gggacttact atccagactc
     8461 tacctgatcc tccaaccact cagagactgg ctgagactca aggcagccta cctgcagtat
     8521 gggtgcgagt ggatccaaga agcgttccag gccctcgcga gggttacaag agagactctt
     8581 acgagcgcgg ggaggagctt gtggggggct ctgggacgaa tcggaagggg gatactcgca
     8641 gttccacgaa ggatcaggca gggagcagaa attgccctcc tgtgagggac agagatatca
     8701 gcaaggagac tttatgaata ccccatggag aaccccagca acagaaaagg aaaaagaatc
     8761 gtacaggcaa caaaatatgg atgatgtaga ttcagatgat gatgacctag taggggtctc
     8821 tgacacatca agagtaccat tgagagcaat gacatataga atggcagtag acatgtcaga
     8881 tttaataaaa gataaggggg gactggaagg gatgtattac agtgagagaa gacatagaat
     8941 cctagacata tacttagaaa aggaagaagg gataattcca gattggcaga actatactca
     9001 tgggctagga gtaaggtacc caatgttctt tgggtggcta tggaagctag taccagtaac
     9061 tgtcccacaa gaaggggagg acactgagac tctctgctta atgcactcag cacaagtaag
     9121 cagatttgat gacccgcatg gggagacact agtctggaag tttgacccca tgctggctca
     9181 tgagtacacg acctttattc tatacccaga ggaatttggg cacaagtcag gaatggaaga
     9241 agatgactgg aaggcaaaac tgaaagcaag agggatacca tttagttaaa aacaggaaca
     9301 accatacttg gtcaggacag gaagtagcta ctgaaaacag ctgagactgc agggactttc
     9361 cagaaggggc tgtaaccagg ggagggacat gggaggagct ggtggggaac gccctcatac
     9421 tttctgtata aatgtacccg ctgctcgcat tgtattcagt cgctctgcgg agaggctggc
     9481 agattgagcc ctgggaggtt ctctccagca ctagcaggta gagcctgggt gttccctgct
     9541 ggactctcac cagtgcttgg ccggcgctgg gcagacggct ccacgcttgc ttgcttaaaa
     9601 gacctcttaa taaagctgcc agttagaagc aagtta
//
LOCUS       MUSIGHAAL     411 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig productively rearranged H-chain V-region VDJ mRNA from
            hybridoma HY2-2C8.
ACCESSION   M22438
KEYWORDS    immunoglobulin; immunoglobulin heavy chain; joining exon;
            variable region.
SOURCE      Mouse (strain SM/J) hybridoma HY2-2C8 cell, cDNA to mRNA, clone 1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 411)
  AUTHORS   Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K.
  TITLE     A single V-H gene is utilized predominantly in anti-BrMRBC
            hybridomas derived from purified Ly-1 B cells: Definition of the
            V-H11 family
  JOURNAL   J. Immunol. 142, 3643-3651 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.E.Carmack 03-FEB-1989.
FEATURES       from  to/span     description
    pept          1  /   411     Ig heavy-chain (V-J-region) precursor
    sigp          1       57     Ig heavy-chain (V-J-region) signal peptide
    matp         58  /   411     Ig heavy-chain (V-J-region)
    recomb      147      148     framework 1 end/CDR 1 start
    recomb      162      163     CDR 1 end/framework 2 start
    recomb      204      205     framework 2 end/CDR 2 start
    recomb      255      256     CDR 2 end/framework 3 start
    recomb      351      352     framework 3 end/CDR 3 or D-region start
    recomb      360      361     CDR 3 or D-region end/J-region start
BASE COUNT      100 a     86 c    118 g    107 t
ORIGIN      Chromosome 12.
        1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa
       61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct
      121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct
      181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca
      241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg
      301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatgggccc
      361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a
//
LOCUS       MUSIGHAAM     411 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig productively rearranged H-chain V-region VDJ mRNA from
            hybridoma HY1-1H2.
ACCESSION   M22439
KEYWORDS    immunoglobulin; immunoglobulin heavy chain; joining exon;
            variable region.
SOURCE      Mouse (strain SM/J) hybridoma HY1-1H2 cell, cDNA to mRNA, clone 1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 411)
  AUTHORS   Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K.
  TITLE     A single V-H gene is utilized predominantly in anti-BrMRBC
            hybridomas derived from purified Ly-1 B cells: Definition of the
            V-H11 family
  JOURNAL   J. Immunol. 142, 3643-3651 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.E.Carmack 03-FEB-1989.
FEATURES       from  to/span     description
    pept          1  /   411     Ig heavy-chain (V-J-region) precursor
    sigp          1       54     Ig heavy-chain (V-J-region) signal peptide
    matp         55  /   411     Ig heavy-chain (V-J-region)
    recomb      141      142     framework 1 end/CDR 1 start
    recomb      159      160     CDR 1 end/framework 2 start
    recomb      204      205     framework 2 end/CDR 2 start
    recomb      252      253     CDR 2 end/framework 3 start
    recomb      348      349     framework 3 end/CDR 3 or D-region start
    recomb      360      361     CDR 3 or D-region end/J-region start
BASE COUNT       94 a    115 c     96 g    106 t
ORIGIN      Chromosome 12.
        1 atgagagtgc tgggattttt gtgcctggtg acagtccttc ctggtagcct gtctcagatg
       61 cagcttcagg agtcaggacc tggcctggtg aaaccctcac agtcactctt cctcgcctgc
      121 tctattactg gtttccccat caccagtggt tactactgga tctggatccg tcagtcacct
      181 gggaaacccc tagaatggat ggggtacatc actcatagtg gggaaacttt ctacaaccca
      241 tccctccaga gccccatctc cattactaga gaaacatcca agaaccagtt ctttctgcaa
      301 ttgaactctg tgaccacaga ggacacagcc atgtattact gtgcaggaga ctatgatggt
      361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a
//
LOCUS       RATCKBR      4360 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Rat creatine kinase gene, complete cds.
ACCESSION   M18668 M26669
KEYWORDS    creatine kinase.
SOURCE      Rat (Sprague Dawley) brain DNA, clone lambda ckb-2.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4360)
  AUTHORS   Benfield,P.A., Graf,D., Korolkoff,P.N., Hobson,G. and Pearson,M.L.
  TITLE     Isolation of four rat creatine kinase genes and identification of
            multiple potential promoter sequences within the rat brain creatine
            kinase promoter region
  JOURNAL   Gene 63, 227-243 (1988)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.A.Benfield, 23-MAR-1988.
FEATURES       from  to/span     description
    pept       1276     1468     creatine kinase, exon 2 (first expressed exon)
               1656     1810     creatine kinase, exon 3
               1888     2020     creatine kinase, exon 4
               2197     2368     creatine kinase, exon 5
               2941     3064     creatine kinase, exon 6
               3174     3363     creatine kinase, exon 7
               3443     3621     creatine kinase, exon 8
    pre-msg     941     3818     creatine kinase mRNA and introns
    IVS        1015     1254     creatine kinase intron A
    IVS        1469     1655     creatine kinase intron B
    IVS        1811     1887     creatine kinase intron C
    IVS        2021     2196     creatine kinase intron D
    IVS        2369     2940     creatine kinase intron E
    IVS        3065     3173     creatine kinase intron F
    IVS        3364     3442     creatine kinase intron G
BASE COUNT      801 a   1314 c   1278 g    967 t
ORIGIN      
        1 aagtactgct aaacctactt tccagagaag atgaaactgc tggcagctgg aggcagggtg
       61 tgaacgcctg tgttcacaca ttccaattca cagccggcac tggctgtggt caaattccag
      121 gtttccccgt acctgcgtgt gtgggacacg gggtgggggt caaagagact ctttcaatct
      181 ggtggatcta ttaatagtta tttttgactg gctgcttcct ccacccccag tttgccctgc
      241 cgtaggtact gtctttgtag gtggggtggt agaaaccggt ccctcaggaa agaacctggg
      301 gatttgaaga ttcaaaaaca gtctctagga gctcagtgtc ttaatttttc agggtccggg
      361 tccaagggtc ttggctaggt tccttagggc ccgcccaagg tcagaacacc tggtcttccg
      421 gcggacgagc cagatgcgct ctgtgtggac caagaacacc caggagatgc ccgcaggcac
      481 cttgaggaag gtcagctccc ctgtaccgtt ttgtaccgcc tggatgcagt ggatggaagc
      541 cgtaccccat gctaaaaacc tggtatctct tgccatgacc ccctcaacac tgctgtcggc
      601 ctgacaaggc aggaagcagg gaaggcagga acctcaagtg ctaactggct ttccagactc
      661 agtctctcgc cccaggcatg tgtgtgaagc aggtctaggt atctacggtc ctgggatttc
      721 gtcttcgaga tcctgagcga gcgccacggg gttgaccaaa ctgcgcagcg gggtcgagac
      781 ttggggacca tggggtgggc cgctgggggt gtcgagggtt gttgcctcgg acaaagcggc
      841 ggcaccaccc caaagcgcgg gccaatggaa tgaatgggct ataaatagcc gccaatggga
      901 ggccggcgac gcgcccctta agagctcagg gagcagcgag cggccgtcgt tcttctgcgt
      961 gcgcccggag ctgcaagcac aagcagtctg cgttcctgct ccgtcggaat cccggtgagc
     1021 gggtcggagg gtgaggggct agtcctgagg gcgtacgcgg ttgggtgcat cacgcgcagc
     1081 tcgttgcccg gagcgcgtcc ggaccgccgg ggaacttggg atgcgccgga ctcaaaagat
     1141 gcagacctcg ctgaccttgg tgacgtccga ggaaagcttc gggggtccgg attggtctgc
     1201 agggtctctg gagaccggtg tgtaaagctc ctctgacccc gctcttcccc gcagcctgcc
     1261 gccgccgccg ccgccatgcc cttctccaac agccacaaca cgcagaagct gcgcttcccg
     1321 gccgaggatg aattccctga tctgagcagc cacaacaacc atatggccaa ggtgctgacc
     1381 cccgagctgt acgcggagct ccgtgccaag tgcacgccga gcggctttac gttggacgac
     1441 gccatccaga ctggcgtaga caatccgggt atgcacaccc ttgtagcgtc aggctttcgt
     1501 ctccccaaca agccccccgg gcaaggatcc cactgctctt ccctgaacct cggcggggtg
     1561 gggtctcctg gtcctctctg cgcttagcca taagggcctc agtgccttgc tctagcttgg
     1621 cgctgacgtc actgtcgccg cgccccatcc tccaggccac ccgtacatca tgacagtggg
     1681 tgcagtggcg ggcgacgagg agagttacga cgtattcaag gaccttttcg accccatcat
     1741 cgaggaccgg cacggcggct accagcccag tgatgagcac aagactgacc tcaacccaga
     1801 caacctgcag gtgcggggcg tcagggtccg ggcgtgctgg ggagaggggg tctcggcgct
     1861 cactccggcc actgcttgca tccctagggc ggcgatgacc tggaccccaa ctacgtgctg
     1921 agctcgcggg tgcgcacagg ccgaagcatc cgcggcttct gcctcccccc tcactgcagc
     1981 cgtggggagc gccgcgccat cgagaagctg gcagtagaag gtagggtccg gtatcagcca
     2041 ccagagctgc tgcgttcttg tctgggcgcg cagcgttccc ccagggtctg agggtggaag
     2101 acccaactaa gacagcgggg acactgccgc gggcgggtgg cgagctttct agctcgccca
     2161 cccacccacc ggctccttga actctgtact ccgcagccct gtccagccta gatggcgacc
     2221 tgtctggcag gtactatgcg ctcaagagca tgaccgaggc ggacgagcag cagctcattg
     2281 acgaccactt cctcttcgac aagcctgtgt cgcctctgct gctggcctcc gcgatggccc
     2341 gcgactggcc ggatgctcgc ggcatttggt acgagccctc ttccctcaca gtccccggaa
     2401 ggtggggcac ggcctgaatt cgtagattgt gcgttaccac actgtgcgca ggtgaccttg
     2461 gttctgcgga tgactgaggt ccctgcgccg cgggaggccg cagtctccag ggattcaagg
     2521 gtggtgacca gtttctttgg cgtctgttct ccgccctcct cctgggagcc ggcgcttctt
     2581 gttttctctc ctccttctca cccccttatt ccgccgggat tctgccaggt gccagtgacg
     2641 caaaagcctc cgcacccgtc cggcagggct cctacccctg cagactgcgc gggcgagcgt
     2701 gcttcctctg gtgggctgct ctggctgctc cagccccttg cgggccacgc agcacatgac
     2761 tggtgattga ggtgccctgg tctccctgcc ttcgagtgtt gctgggtggg cgtcttagag
     2821 acagcccaca ctcctgcact tggaaagcgc tttggaggtc acccacctag ggcgtcctgc
     2881 tgggtgtgac ctaactaagg ctgtagactt tgactttgtc tgcccgaccc cctcctccag
     2941 gcacaatgac aataagacgt tcctggtgtg gatcaacgag gaggaccacc tgcgggttat
     3001 ctccatgcag aaagggggca acatgaagga agttttcacg cgattctgca ctggcctcac
     3061 tcaggtctgg cctggtgcta aggagggagg gggtcggggg gatttgagtc cacatttccc
     3121 agggtgcttg gggttccagc tacattttcc cttaactttc ccttcacctt cagattgaaa
     3181 ctctcttcaa gtctaagaac tatgagttca tgtggaaccc tcacctgggc tacatcctca
     3241 cgtgcccatc caacctgggc actgggcttc gggcaggcgt gcacatcaag ctgccccacc
     3301 tgggaaagca cgagaagttc tcggaggtgc tcaagcgact gcggcttcag aagcgaggca
     3361 caggtgaggg gcagacgact caggtgaagc agcaaacgcg gctttcccca gccctgctga
     3421 cctgcctgtc tcccatcccc aggtggtgtg gacaccgctg ctgtgggtgg agtttttgat
     3481 gtctccaacg ctgaccgcct gggcttctcg gaggtggagc tggtgcagat ggtggtggac
     3541 ggagtgaagc tactcattga gatggagcag cggcttgagc agggtcagcc cattgacgac
     3601 ctcatgcctg cccagaagtg aagcctggcc ctagccacca ccaggctgcc gcttcctaac
     3661 ttattacccg ggcagtgccc gccatgcatc cttgatgttt gccgcctggc gtgagccctt
     3721 agcctcgctg tagagacttc tgtcgccctg ggtagagttt attttttctg atggctaagc
     3781 tgttgcagac actgaaataa attagggttt ggcctgccct atgtctgagt gttgcttctc
     3841 ctttcttaga gacagtcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg
     3901 tgtgtgtgtc catccgtcct tcctggcctg tcacttcacc tagcagatga aacatgagtc
     3961 ttaagataca gggcagaggg agggagactt gagtgcagcc tgaacatcta aggacatttg
     4021 tacttgcagg gtggagccta gtatttcctt agctccagcc agctaaaggg agtcagcaac
     4081 ttgacatctt ccccagtttc tttctgccgc attcctggca gcctgaatat ctaaagttct
     4141 atctcttcac ttccttcccc tccaggcttc aacacttact ccagttctta aataattctc
     4201 tccctacttc ctccagcttc caaccccagg ttgagtgcct cctcctgatc cctcagccca
     4261 gctctccagc cccatccaga caagagcttt tcagaaccac ccgtccctct ctcttgcagc
     4321 ttcctctctg gggccctcct ccttgggagg agacaagctt
//
LOCUS       RATTHY        797 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Rat prothymosin-alpha mRNA, complete cds.
ACCESSION   M20035
KEYWORDS    prothymosin.
SOURCE      Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA,
            clone pRSpro-187.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 797)
  AUTHORS   Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L.
  TITLE     Prothymosin alpha and parathymosin: Amino acid sequences deduced
            from the cloned rat spleen cDNAs
  JOURNAL   Arch. Biochem. Biophys. 263, 305-310 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            B.L.Horecker, 26-APR-1988.
FEATURES       from  to/span     description
    pept        174      512     prothymosin precursor
    matp        177      509     prothymosin
BASE COUNT      248 a    187 c    219 g    143 t
ORIGIN      Unreported.
        1 gccatctttg catttgttcc cgggtcgtgc tccgcgctca catgcaagcc aacctatcgg
       61 ccgacccacc ggtcatcctc caaacgcgga cttccggcta gcttatctcg gccaagagtc
      121 ctcgaaactc gacatattct tacgcgtaga ccagaccacc ggcgtgcccc accatgtcag
      181 acgcggcagt ggacaccagc tccgagatca ccaccaagga cttgaaggag aagaaggaag
      241 ttgtggagga ggcagagaat ggaagagacg cacctgccaa tgggaacgct caaaatgagg
      301 aaaatgggga gcaggaggct gacaatgagg tagatgaaga agaggaagaa ggtggggagg
      361 aagaggagga ggaggaagaa ggtgatggtg aggaagaaga tggagatgaa gatgaggaag
      421 ctgaggctcc tacgggcaag cgggtagctg aggatgatga ggatgatgat gttgagacca
      481 agaagcagaa gaagactgat gaggatgact agacagcaaa aggaaaagct aaccttacgc
      541 accgtgacct attcaccctc cacttcccgt ctcagaattt aaacgtggtc accttcgagt
      601 agagaagcag gcccccgccg cccactccag gtgcacccac acatgacatg cgctcttcac
      661 acccacaaaa cacaacatga ttgcacatgg aggagaaaga acagaactcc aggcctcttt
      721 ttttcttaaa aatctttaaa ggaattgttg attttattaa ctaagcaaat tgaggcagca
      781 ttaataaatt cgacaca
//
LOCUS       RATTHYP       378 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Rat spleen parathymosin-alpha mRNA, 3' end.
ACCESSION   M20616
KEYWORDS    parathymosin.
SOURCE      Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA,
            clone pRSpara-5187.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 378)
  AUTHORS   Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L.
  TITLE     Prothymosin-alpha and parathymosin: Amino acid sequences deduced
            from the cloned rat spleen cDNAs-1
  JOURNAL   Arch. Biochem. Biophys. 263, 305-310 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            B.L.Horecker, 26-APR-1988.
FEATURES       from  to/span     description
    pept    <     1      245     parathymosin precursor (AA at 3)
BASE COUNT      111 a     69 c    144 g     54 t
ORIGIN      Unreported.
        1 acaaggtgga ggagaaggct ggccggaaag aacggaagaa agaagtagtg gaggaggagg
       61 agaatggagc tgaggaggag gaagaagaaa ctgctgagga tggagaggat gatgatgaag
      121 gagacgaaga agatgaggag gaagaggagg aggaggatga aggccccgtg cggaagagaa
      181 ctgctgaaga ggaggatgaa gcggatccca agaggcagaa gacagaaaac ggggcgtcgg
      241 cttgagcccc tgcccgtggc ttggggatgg gaggcccctc aggtctggag gtgggcagga
      301 acacacaatc agccccttct ctgctcctgc tctgcctgcc agagcttgac tgcttgacag
      361 cttattcatt tcagcatc
//
LOCUS       STYNUSAINF    228 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  S.typhimurium nusA-infB operon promoter region.
ACCESSION   M28868
KEYWORDS    .
SOURCE      S.typhimurium (strain LT2) DNA, clone 40.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 228)
  AUTHORS   Granston,A.E., Tompson,D.L. and Friedman,D.I.
  TITLE     Identification of a second promoter for the metY-nusA-infB operon
            of Escherichia coli
  JOURNAL   J. Bacteriol. 172, 2336-2342 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.E.Granston, 27-SEP-1989.
FEATURES       from  to/span     description
    signal      159      164     -35 region promoter P1
    signal      183      188     -10 region promoter P1
BASE COUNT       69 a     39 c     54 g     66 t
ORIGIN      
        1 aaaataaatg agttttcatt taatcatctt ttagcagaga caggaagcgt ttagtgaagg
       61 tttactgaaa taacgttgtg acatgcgtgt ataagccgga gaggtgggat gagtataacg
      121 aagagtgata ttcattcatc tgcatattgg gaatagggtt gacataactc atgccaggtc
      181 actatactgt gcgcagattt tacgtcccgt cctcggtacc aaatccca
//
LOCUS       SYNAA113M    9186 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Sequencing vector pAA113M DNA.
ACCESSION   M20189
KEYWORDS    cloning vector; synthetic DNA.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 9186)
  AUTHORS   Ahmed,A.
  TITLE     A vector for sequencing long (40-kb) DNA fragments
  JOURNAL   Gene 75, 315-321 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            provided by A.Ahmed, 28-SEP-1988.  pAA113M  is a cosmid vector
            specially developed to sequence long (40kb) DNA fragments.  It
            makes use of the cos site to clone long segments, IS1 to subdivide
            the cloned fragment by deletions, galKT to select such deletions,
            and some unique restriction sites to further shorten the fragment
            for supercoil or M13 sequencing.  It contains a normal plasmid ori
            of replication and also the M13 ori of replication to obtain DNA in
            the double-stranded or single- stranded form.
FEATURES       from  to/span     description
    pept       3787     4977     tet protein
    ORF        7433     8293     amp protein
BASE COUNT     2051 a   2446 c   2453 g   2236 t
ORIGIN      Unreported.
        1 caattactgc aatgccctcg taattaagtg aatttacaat atcgtcctgt tcggagggaa
       61 gaacgcggga tgttcattct tcatcacttt taattgatgt atatgctctc ttttctgacg
      121 ttagtctccg acggcaggct tcaatgaccc aggctgagaa attcccggac cctttttgct
      181 caagagcgat gttaatttgt tcaatcattt ggttaggaaa gcggatgttg cgggttgttg
      241 ttctgcgggt tctgttcttc gttgacatga ggttgccccg tattcagtgt cgctgatttg
      301 tattgtctga agttgttttt acgttaagtt gatgcagatc aattaatacg atacctgcgt
      361 cataattgat tatttgacgt ggtttgatgg cctccacgca cgttgtgata tgtagatgat
      421 aatcattatc actttacggg tcctttccgg tgatccgaca ggttacgggg cggcgacctc
      481 gcgggttttc gctatttatg aaaattttcc ggtttaaggc gtttccgttc ttcttcgtca
      541 taacttaatg tttttattta aaataccctc tgaaaagaaa ggaaacgaca ggtgctgaaa
      601 gcgaggcttt ttggcctctg tcgtttcctt tctctgtttt tgtccgtgga atgaacaatg
      661 gaagtcaaca aaaagcagct ggctgacatt ttcggtgcga gtatccgtac cattcagaac
      721 tggcaggaac agggaatgcc cgttctgcga ggcggtggca agggtaatga ggtgctttat
      781 gactctgccg ccgtcataaa atggtatgcc gaaagggatg ctgaaattga gaacgaaaag
      841 ctgcgccggg aggttgaaga actgcggcag gccagcgagg cagatcaaca gtcggtacgg
      901 ctgaccatcg ggtgccagtg cgggagtttc gttcagcact gtcctgctcc ttgtgatggt
      961 ttacaaacgt aaaaagtctc tttaatacct gtttttgctt catattgttc agcgacacgt
     1021 tgctgtacgg caggcaccag ctcttccggg atcagcgcga cgatacagcc gccaaatccg
     1081 ccgccggtca tgcgtacgcc acctttgtcg ccaatcacag ctttgacgat ttctaccaga
     1141 gtgtcaattt gcggcacggt gatttcgaaa tcatcgcgca tagaggcatg agactccgcc
     1201 atcaactcgc ccatacgttt caggtcgcct tgctccagcg cgctggcagc ttcaacggtg
     1261 cgggcgtttt cagtcagtat atgacgcacg cgttttgcca cgatcgggtc cagttcatgc
     1321 gcaacagcgt tgaactcttc aatggtgaca tcacgcaggg ctggctgctg gaagaaacgc
     1381 gcaccggttt cgcactgttc acgacgggtg ttgtattcgc tgccaaccag ggtacgtttg
     1441 aagttactgt tgatgatgac gacagccaca cctttgggca tggaaactgc tttggtcccc
     1501 agtgagcggc aatcgatcag caaggcatga tctttcttgc cgagcgcgga aattagctga
     1561 tccatgatcc cgcagttaca gcctacaaac tggttttctg cttcctgacc gttaagcgcg
     1621 atttgtgcgc cgtccagcgg cagatgataa agctgctgca atacggttcc gaccgcgact
     1681 tccagtgaag cggaagaact taacccggca ccctgcggca cattgccgtg atcaaccatg
     1741 tccacgccgc cgaagctgtt gttacgcagt tgcagatgtt tcaccacgcc acgaacgtag
     1801 ttagcccatt gatagttttc atgtgcgaca atgggcgcat cgagggaaaa ctcgtcgagc
     1861 tgattttcat aatcggctgc catcacgcga actttacggt catcgcgtgg tgcacaactg
     1921 atcacggttt gataatcaat cgcgcagggc agaacgaaac cgtcgttgta gtcggtgtgt
     1981 tcaccaatca aattcacgcg gccaggcgcc tgaatggtgt gagtggcagg gtagccaaat
     2041 gcgttggcaa acagagattg tgttttttct ttcagactca tttcttacac tccggattcg
     2101 cgaaaatgga tatcgctgac tgcgcgcaaa cgctctgctg cctgttctgc ggtcaggtct
     2161 cgctgggtct ctgccagcat ttcataacca accataaatt tacgtacggt ggcggacggc
     2221 agcagaggcg gataaaagtg cgcgtgcagc tgccagtgtt gattctcttc gccattaaat
     2281 ggcgcgccgt gccagcccat agagtagggg aaggagcact ggaagaggtt gtcataacga
     2341 ctggtcagct ttttcaacgc cagcgccaga tcgctgcgct gggcgtcggt caaatcggtg
     2401 atccgtaaaa cgtgggcttt gggcagcagt agcgtttcga acggccaggc agcccagtaa
     2461 ggcacgacgg ctaaccagtg ttcggtttcg acaacggtac ggctaccgtc tgccagctcg
     2521 cgctgaacat aatccaccag cattggtgat ttctgttcgg caaaatattc tttttgcagg
     2581 cggtcttcgc gctcagcttc gttaggcagg aagctatttg cccaaatctg accgtgcgga
     2641 tgcgggttag agcagcccat cgccgcgcct ttgttttcaa aaacctgcac ccatgggtac
     2701 gttttcccca gttctgcggt ttgctcctgc caggttttga cgatttccgt caatgctgca
     2761 acgctgagct ctggcagcgt tttactgtga tccggtgaaa agcagatcac ccggctggtg
     2821 ccgcgcgcgc tctggcaacg catcagcgga tcgtgacttt ctggcgcatc tggcgtgtca
     2881 gacatcaaag ccgcaaagtc attagtgaaa acgtaagtcc cggtgtaatc ggggttttta
     2941 tcgcctgtca cccgcacatt acctgcgcag aggaagcaat ctggatcgtg cgcaggtaac
     3001 acctgtttgg ctggcgtttc ctgcgccccc tgccagggcg ttagccgggt gcggtgaaac
     3061 cagaatccat tgcccggtga gcggttgtag cggcgatgtg gatgatcaac gggattaaat
     3121 tgcgtcatgg tcgttcctta atcgggatat ccctgtggat ggcgtgactg ccagtgccag
     3181 gtgtcctgcg ccatttcatc gagtgtgcgc gttacgcagt tcagttcacg gtcggctttg
     3241 ctggcgtccg ccagtaggcc ggaaggtcgc cctcgcgagc ggtgcaaaat gataattaac
     3301 cggtttgccg cacgtttgct gaaggcatta accacgtcca gcacgctgtt gcctacgcca
     3361 gcgccgaggt tgtagatgtg tacgcctggc ttgttcgcca gtttttccat cgccacgacg
     3421 tgaccgtccg ccagatccat tacgtggatg taatcgcgta cgccagtacc atcttcggtc
     3481 ggataatcgt taccaaaaat cgccagcgag tcgcgacggc ctacagcaac ctgggcgatg
     3541 tatggcatca ggttattcgg aatgccttgc ggatcttcgc ccatatcgcc cgacggatgc
     3601 gcgccaaccg ggttgaagta gcgcacgagg gcaatgctcc agtccggctg ggctttttgc
     3661 agatcggtga ggatctgttc caccatcagc ttgcttttgc cgtaagggct ttgcggtgtg
     3721 ccggtcggga agctataatg cggtagttta tcacagttaa attgctaacg cagtcaggca
     3781 ccgtgtatga aatctaacaa tgcgctcatc gtcatcctcg gcaccgtcac cctggatgct
     3841 gtaggcatag gcttggttat gccggtactg ccgggcctct tgcgggatat cgtccattcc
     3901 gacagcatcg ccagtcacta tggcgtgctg ctagcgctat atgcgttgat gcaatttcta
     3961 tgcgcacccg ttctcggagc actgtccgac cgctttggcc gccgcccagt cctgctcgct
     4021 tcgctacttg gagccactat cgactacgcg atcatggcga ccacacccgt cctgtggatc
     4081 ctctacgccg gacgcatcgt ggccggcatc accggcgcca caggtgcggt tgctggcgcc
     4141 tatatcgccg acatcaccga tggggaagat cgggctcgcc acttcgggct catgagcgct
     4201 tgtttcggcg tgggtatggt ggcaggcccc gtggccgggg gactgttggg cgccatctcc
     4261 ttgcatgcac cattccttgc ggcggcggtg ctcaacggcc tcaacctact actgggctgc
     4321 ttcctaatgc aggagtcgca taagggagag cgtcgaccga tgcccttgag agccttcaac
     4381 ccagtcagct ccttccggtg ggcgcggggc atgactatcg tcgccgcact tatgactgtc
     4441 ttctttatca tgcaactcgt aggacaggtg ccggcagcgc tctgggtcat tttcggcgag
     4501 gaccgctttc gctggagcgc gacgatgatc ggcctgtcgc ttgcggtatt cggaatcttg
     4561 cacgccctcg ctcaagcctt cgtcactggt cccgccacca aacgtttcgg cgagaagcag
     4621 gccattatcg ccggcatggc ggccgacgcg ctgggctacg tcttgctggc gttcgcgacg
     4681 cgaggctgga tggccttccc cattatgatt cttctcgctt ccggcggcat cgggatgccc
     4741 gcgttgcagg ccatgctgtc caggcaggta gatgacgacc atcagggaca gcttcaagga
     4801 tcgctcgcgg ctcttaccag cctaacttcg atcattggac cgctgatcgt cacggcgatt
     4861 tatgccgcct cggcgagcac atggaacggg ttggcatgga ttgtaggcgc cgccctatac
     4921 cttgtctgcc tccccgcgtt gcgtcgcggt gcatggagcc gggccacctc gacctgaatg
     4981 gaagccggcg gcacctcgct aacggattca ccactccaag aattggagcc aatcaattct
     5041 tgcggagaac tgtgaatgcg caaaccaacc cttggcagaa catatccatc gcgtccgcca
     5101 tctccagcag ccgcacgcgg cgcatctcgg gcagcgttgg gtcctggcca cgggtgcgca
     5161 tgatcgtgct cctgtcgttg aggacccggc taggctggcg gggttgcctt actggttagc
     5221 agaatgaatc accgatacgc gagcgaacgt gaagcgactg ctgctgcaaa acgtctgcga
     5281 cctgagcaac aacatgaatg gtcttcggtt tccgtgtttc gtaaagtctg gaaacgcgga
     5341 agtcagcgcc ctgcaccatt atgttccgga tctgcatcgc aggatgctgc tggctaccct
     5401 gtggaacacc tacatctgta ttaacgaagc gctggcattg accctgagtg atttttctct
     5461 ggtcccgccg catccatacc gccagttgtt taccctcaca acgttccagt aaccgggcat
     5521 gttcatcatc agtaacccgt atcgtgagca tcctctctcg tttcatcggt atcattaccc
     5581 ccatgaacag aaattccccc ttacacggag gcatcaagtg accaaacagg aaaaaaccgc
     5641 ccttaacatg gcccgcttta tcagaagcca gacattaacg cttctggaga aactcaacga
     5701 gctggacgcg gatgaacagg cagacatctg tgaatcgctt cacgaccacg ctgatgagct
     5761 ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct
     5821 cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg
     5881 cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag
     5941 cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat
     6001 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc
     6061 gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct
     6121 cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg
     6181 tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc
     6241 cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga
     6301 aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct
     6361 cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg
     6421 gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag
     6481 ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat
     6541 cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac
     6601 aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac
     6661 tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc
     6721 ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt
     6781 tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc
     6841 ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg
     6901 agattatcaa aaaggatctt cacctagatc cttttaaatt gtaaacgtta atattttgtt
     6961 aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg
     7021 caaaatccct tataaatcaa aagaatagcc cgagataggg ttgagtgttg ttccagtttg
     7081 gaacaagagt ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta
     7141 tcagggcgat ggcccactac gtgaaccatc acccaaatca agttttttgg ggtcgaggtg
     7201 ccgtaaagca ctaaatcgga accctaaagg gagcccccga tttagagctt gacggggaaa
     7261 gccggcgaac gtggcgagaa aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct
     7321 ggcaagtgta gcggtcacgc tgcgcgtaac caccacaccc gccgcgctta atgcgccgct
     7381 acagggcgcg taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat
     7441 gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct
     7501 gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg
     7561 caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag
     7621 ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta
     7681 attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg
     7741 ccattgctgc aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg
     7801 gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct
     7861 ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta
     7921 tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg
     7981 gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc
     8041 cggcgtcaac acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg
     8101 gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga
     8161 tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg
     8221 ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat
     8281 gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc
     8341 tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca
     8401 catttccccg aaaagtgcgg taatgactcc aacttattga tagtgtttta tgttcagata
     8461 atgcccgatg actttgtcat gcagctccac cgattttgag aacgacagcg acttccgtcc
     8521 cagccgtgcc aggtgctgcc tcagattcag gttatgccgc tcaattcgct gcgtatatcg
     8581 cttgctgatt acgtgcagct ttcccttcag gcgggattca tacagcggcc agccatccgt
     8641 catccatatc accacgtcaa agggtgacag caggctcata agacgcccca gcgtcgccat
     8701 agtgcgttca ccgaatacgt gcgcaacaac cgtcttccgg agactgtcat acgcgtaaaa
     8761 cagccagcgc tggcgcgatt tagccccgac atagccccac tgttcgtcca tttccgcgca
     8821 gacgatgacg tcactgcccg gctgtatgcg cgaggttacc gactgcggcc tgagtttttt
     8881 aagtgacgta aaatcgtgtt gaggccaacg cccataatgc gggctgttgc ccggcatcca
     8941 acgccattca tggccatatc aatgattttc tggtgcgtac cgggttgaga agcggtgtaa
     9001 gtgaactgca gttgccatgt tttacggcag tgagagcaga gatagcgctg atgtccggcg
     9061 gtgcttttgc cgttacgcac caccccgtca gtagctgaac aggagggaca gctgatagaa
     9121 acagaagcca ctggagcacc tcaaaaacac catcatacac taaatcagta agttggcagc
     9181 atcacc
//
LOCUS       SYNHUMUBA     246 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Synthetic human ubiquitin gene (Thr14->Cys), complete cds.
ACCESSION   M21581
KEYWORDS    ubiquitin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (sites)
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  TITLE     Ubiquitin function studied by disulfide engineering
  JOURNAL   J. Biol. Chem. 264, 1887-1893 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 246; for [1])
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  JOURNAL   Unpublished (1989) See comment for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by D.Ecker, 01-DEC-1988.
            
            Author address [1] Smith Kline and French Laboratories P.O. Box
            1539 King of Prussia, PA 19046
FEATURES       from  to/span     description
    pept         10      240     synthetic ubiquitin
BASE COUNT       81 a     51 c     54 g     60 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccatatg tctagaagtt
       61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
      121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
      181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga
      241 ggtacc
//
LOCUS       SYNHUMUBB     246 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Synthetic human ubiquitin gene (Phe4->Cys), complete cds.
ACCESSION   M22214
KEYWORDS    ubiquitin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (sites)
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  TITLE     Ubiquitin function studied by disulfide engineering
  JOURNAL   J. Biol. Chem. 264, 1887-1893 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 246; for [1])
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  JOURNAL   Unpublished (1989) See comment for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by D.Ecker, 01-DEC-1988.
            
            Author address [1] Smith Kline and French Laboratories P.O. Box
            1539 King of Prussia, PA 19046
FEATURES       from  to/span     description
    pept         10      240     synthetic ubiquitin
BASE COUNT       82 a     51 c     54 g     59 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt
       61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
      121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
      181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga
      241 ggtacc
//
LOCUS       SYNHUMUBC     246 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys), complete
            cds.
ACCESSION   M22215
KEYWORDS    ubiquitin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (sites)
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  TITLE     Ubiquitin function studied by disulfide engineering
  JOURNAL   J. Biol. Chem. 264, 1887-1893 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 246; for [1])
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  JOURNAL   Unpublished (1989) See comment for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by D.Ecker, 01-DEC-1988.
            
            Author address [1] Smith Kline and French Laboratories P.O. Box
            1539 King of Prussia, PA 19046
FEATURES       from  to/span     description
    pept         10      240     synthetic ubiquitin
BASE COUNT       81 a     50 c     55 g     60 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt
       61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
      121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
      181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga
      241 ggtacc
//
LOCUS       SYNHUMUBD     246 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys, Thr66->Cys),
            complete cds.
ACCESSION   M22216
KEYWORDS    ubiquitin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (sites)
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  TITLE     Ubiquitin function studied by disulfide engineering
  JOURNAL   J. Biol. Chem. 264, 1887-1893 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 246; for [1])
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  JOURNAL   Unpublished (1989) See comment for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by D.Ecker, 01-DEC-1988.
            
            Author address [1] Smith Kline and French Laboratories P.O. Box
            1539 King of Prussia, PA 19046
FEATURES       from  to/span     description
    pept         10      240     synthetic ubiquitin
BASE COUNT       80 a     48 c     56 g     62 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt
       61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
      121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
      181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga
      241 ggtacc
//
LOCUS       SYNHUMUBE     246 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Synthetic human ubiquitin gene (Phe4->Cys, Thr66->Cys), complete
            cds.
ACCESSION   M22217
KEYWORDS    ubiquitin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (sites)
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  TITLE     Ubiquitin function studied by disulfide engineering
  JOURNAL   J. Biol. Chem. 264, 1887-1893 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 246; for [1])
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  JOURNAL   Unpublished (1989) See comment for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by D.Ecker, 01-DEC-1988.
            
            Author address [1] Smith Kline and French Laboratories P.O. Box
            1539 King of Prussia, PA 19046
FEATURES       from  to/span     description
    pept         10      240     synthetic ubiquitin
BASE COUNT       81 a     49 c     55 g     61 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt
       61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
      121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
      181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga
      241 ggtacc
//
LOCUS       SYNHUMUBF     246 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Synthetic human ubiquitin gene (Thr66->Cys), complete cds.
ACCESSION   M22218
KEYWORDS    ubiquitin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (sites)
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  TITLE     Ubiquitin function studied by disulfide engineering
  JOURNAL   J. Biol. Chem. 264, 1887-1893 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 246; for [1])
  AUTHORS   Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A.,
            Dixon,J.S., Weber,P.L. and Crooke,S.T.
  JOURNAL   Unpublished (1989) See comment for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by D.Ecker, 01-DEC-1988.
            
            Author address [1] Smith Kline and French Laboratories P.O. Box
            1539 King of Prussia, PA 19046
FEATURES       from  to/span     description
    pept         10      240     synthetic ubiquitin
BASE COUNT       81 a     50 c     54 g     61 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccataac tctagaagtt
       61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca
      121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct
      181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga
      241 ggtacc
//
LOCUS       XELC3        1087 bp ss-mRNA            VRT       24-JUL-1990
DEFINITION  X.laevis complement component 3 (C3) mRNA, 3' end.
ACCESSION   M21790 J04493
KEYWORDS    complement component 3; serum protein.
SOURCE      X.laevis (adult) liver, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1087)
  AUTHORS   Grossberger,D., Marcuz,A., Du Pasquier,L. and Lambris,J.D.
  TITLE     Conservation of structural and functional domains in complement
            component C3 of Xenopus laevis and mammals
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1323-1327 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.Grossberger, 14-DEC-1988.
FEATURES       from  to/span     description
    pept    <     1      969     complement component 3 (AA at 1)
BASE COUNT      327 a    211 c    258 g    291 t
ORIGIN      Unreported.
        1 gggagggtgg tgaccgtgta ccagtctctt gtcatagaga aagaaagaaa gtgccttaac
       61 tttgaactgt cttgtgaatg taaaggaagt acaacttgca agacctcccg aagtgcaaag
      121 gcaacagtgt ctatagaagc ctgtgcaagg catctcaaga acgttgatgc caccatgtcc
      181 attattgata tttccatgat gactggcttt tctcctgata ccgattccct ggataggcta
      241 atgaagggag tggacaaata catctctaaa tatgaagtca acaaaggagc aaatgacaag
      301 ggaacactta ttctctactt ggacaaagtc tcccacatag acgaagaatg tgtgaagttt
      361 tatgctcatc aatactttga agtgggtttc atccagccag cttctgtaac tgtgtatgac
      421 tattatactc cagataatcg ttgcactaaa ttttaccatg tggaagaagg cagtgccttg
      481 ctggcaggga tttgccaagg ggatatatgc cgatgtgcag aagagaactg tttcatgcag
      541 cagcaaattg agggtaaaat cactgctgac atgagagtca acatggcttg gctcccggag
      601 tggaattttg tgtacaaggc tactctcacg gagttgcagc ccagcgacaa ctatgacaac
      661 tatgtaatga caattaagaa ggtcattaag caaggcacag atgaggatcc tgaggacaag
      721 acacgcaatt ttatcagcca tatcaaatgc cgaaaagctt taaatatgca gctgaaccga
      781 gattatctga tttggggggt aactggtgac ctctggcgcc acgatggata ttcctacatc
      841 attgggaagg acacatggat ggagtggtgg cccaatgaga gggagtgtca acaacgtgag
      901 aaccaggatc tctgcgatga ctttgagaca gtttctgaca acctggagat tgttggttgt
      961 cccaactgag gaacctaaat tcagaccaaa atgtttccat cttcccaagc tcatctgtga
     1021 tgtttaattt tgtaaatgtt gttcagtgtt ttatatacag ttccaataaa gcatttattt
     1081 gcttaac
//
LOCUS       YSCKEX2A     4106 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  S.cerevisiae prohormone processing enzyme (KEX2) gene, complete
            cds.
ACCESSION   M24201
KEYWORDS    Ca-dependent serine protease; furin-related protein;
            processing enzyme; protease; subtilism-related protease.
SOURCE      S.cerevisiae (strain DBY939) DNA, clone YEp24-pJ2B.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 4106)
  AUTHORS   Fuller,R.S., Brake,A. and Thorner,J.
  TITLE     Yeast prohormone processing enzyme (KEX2 gene product) is a Ca2+
            -dependent serine protease
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1434-1438 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Brake, 09-MAY-1989
FEATURES       from  to/span     description
    pept       1329     3773     prohormone processing enzyme (KEX2)
BASE COUNT     1327 a    803 c    827 g   1149 t
ORIGIN      Chromosome 14, left arm.
        1 gaattctctg ttgactacta aactgagaga atttgccgag actctaagaa cagctttgaa
       61 agagcgttct gccgatgatt ccataattgt cactctgaga gagcaaatgc aaagagaaat
      121 cttcaggttg atgtcgttgt tcatggacat acctccagtg caaccaaacg agcaattcac
      181 ttgggaatac gttgacaaag acaagaaaat ccacactatc aaatcgactc cgttagaatt
      241 tgcctccaaa tacgcaaaat tggacccttc cacgccagtc tcattgatca atgatccaag
      301 acaccatatg gtaaattaat taagatcgat cgtttaggaa acgtccttgg cggagatgcc
      361 gtgatttact taaatgttga caatgaaaca ctatctaaat tggttgttaa gagattacaa
      421 aataacaaag ctgtcttttt tggatctcac actccaaagt tcatggacaa gaaaactggt
      481 gtcatggata ttgaattgtg gaactatcct gccatggcta taatttacct cagcaaaagg
      541 catccggtat tagataccat gaaagtttga tgactcatgc tatgttggat cactggctgc
      601 cacgtcgatg aaacgtctaa attaccactt cgctaccgtc tgaaaattcc tggggtaaag
      661 actccggtaa agacggatta tacgtgatga ctcaaaagta cttcgaggag tactgctttc
      721 aaattgtggt cgatatcaat gaattgccaa aagagctggc ttcaaaattc acctcaggta
      781 aggaagagcc gattgtcttg cccatctgga cccaatggtg ctttggccaa ataaatagtt
      841 tcagcagctc tgatgtagat acacgtatct cgacatgttt tatttttact atacatacat
      901 aaaagaaata aaaaatgata acgtgtatat tattattcat ataatcaatg agggtcattt
      961 tctgaaacgc aaaaaacggt aaatggaaaa aaaataaaga tagaaaaaga aaacaaacaa
     1021 aggaaaggtt agcatattaa ataactgagc tgatacttca acagcatcgc tgaagagaac
     1081 agtattgaaa ccgaaacatt ttctaaaggc aaacaaggta ctccatattt gctggacgtg
     1141 ttctttctct cgtttcatat gcataattct gtcataagcc tgttcttttt cctggcttaa
     1201 acatcccgtt ttgtaaaaga gaaatctatt ccacatattt cattcattcg gctaccatac
     1261 taaggataaa ctaatcccgt tgttttttgg cctcgtcaca taattataaa ctactaaccc
     1321 attatcagat gaaagtgagg aaatatatta ctttatgctt ttggtgggcc ttttcaacat
     1381 ccgctcttgt atcatcacaa caaattccat tgaaggacca tacgtcacga cagtattttg
     1441 ctgtagaaag caatgaaaca ttatcccgct tggaggaaat gcatccaaat tggaaatatg
     1501 aacatgatgt tcgagggcta ccaaaccatt atgttttttc aaaagagttg ctaaaattgg
     1561 gcaaaagatc atcattagaa gagttacagg gggataacaa cgaccacata ttatctgtcc
     1621 atgatttatt cccgcgtaac gacctattta agagactacc ggtgcctgct ccaccaatgg
     1681 actcaagctt gttaccggta aaagaagctg aggataaact cagcataaat gatccgcttt
     1741 ttgagaggca gtggcacttg gtcaatccaa gttttcctgg cagtgatata aatgttcttg
     1801 atctgtggta caataatatt acaggcgcag gggtcgtggc tgccattgtt gatgatggcc
     1861 ttgactacga aaatgaagac ttgaaggata atttttgcgc tgaaggttct tgggatttca
     1921 acgacaatac caatttacct aaaccaagat tatctgatga ctaccatggt acgagatgtg
     1981 caggtgaaat agctgccaaa aaaggtaaca atttttgcgg tgtcggggta ggttacaacg
     2041 ctaaaatctc aggcataaga atcttatccg gtgatatcac tacggaagat gaagctgcgt
     2101 ccttgattta tggtctagac gtaaacgata tatattcatg ctcatggggt cccgctgatg
     2161 acggaagaca tttacaaggc cctagtgacc tggtgaaaaa ggctttagta aaaggtgtta
     2221 ctgagggaag agattccaaa ggagcgattt acgtttttgc cagtggaaat ggtggaactc
     2281 gtggtgataa ttgcaattac gacggctata ctaattccat atattctatt actattgggg
     2341 ctattgatca caaagatcta catcctcctt attccgaagg ttgttccgcc gtcatggcag
     2401 tcacgtattc ttcaggttca ggcgaatata ttcattcgag tgatatcaac ggcagatgca
     2461 gtaatagcca cggtggaacg tctgcggctg ctccattagc tgccggtgtt tacactttgt
     2521 tactagaagc caacccaaac ctaacttgga gagacgtaca gtatttatca atcttgtctg
     2581 cggtagggtt agaaaagaac gctgacggag attggagaga tagcgccatg gggaagaaat
     2641 actctcatcg ctatggcttt ggtaaaatcg atgcccataa gttaattgaa atgtccaaga
     2701 cctgggagaa tgttaacgca caaacctggt tttacctgcc aacattgtat gtttcccagt
     2761 ccacaaactc cacggaagag acattagaat ccgtcataac catatcagaa aaaagtcttc
     2821 aagatgctaa cttcaagaga attgagcacg tcacggtaac tgtagatatt gatacagaaa
     2881 ttaggggaac tacgactgtc gatttaatat caccagcggg gataatttca aaccttggcg
     2941 ttgtaagacc aagagatgtt tcatcagagg gattcaaaga ctggacattc atgtctgtag
     3001 cacattgggg tgagaacggc gtaggtgatt ggaaaatcaa ggttaagaca acagaaaatg
     3061 gacacaggat tgacttccac agttggaggc tgaagctctt tggggaatcc attgattcat
     3121 ctaaaacaga aactttcgtc tttggaaacg ataaagagga ggttgaacca gctgctacag
     3181 aaagtaccgt atcacaatat tctgccagtt caacttctat ttccatcagc gctacttcta
     3241 catcttctat ctcaattggt gtggaaacgt cggccattcc ccaaacgact actgcgagta
     3301 ccgatcctga ttctgatcca aacactccta aaaaactttc ctctcctagg caagccatgc
     3361 attatttttt aacaatattt ttgattggcg ccacattttt ggtgttatac ttcatgtttt
     3421 ttatgaaatc aaggagaagg atcagaaggt caagagcgga aacgtatgaa ttcgatatca
     3481 ttgatacaga ctctgagtac gattctactt tggacaatgg aacttccgga attactgagc
     3541 ccgaagaggt tgaggacttc gattttgatt tgtccgatga agaccatctt gcaagtttgt
     3601 cttcatcaga aaacggtgat gctgaacata caattgatag tgtactaaca aacgaaaatc
     3661 catttagtga ccctataaag caaaagttcc caaatgacgc caacgcagaa tctgcttcca
     3721 ataaattaca agaattacag cctgatgttc ctccatcttc cggacgatcg tgattcgata
     3781 tgtacagaaa gcttcaaatt acaaaatagc atttttttct tatagattat aatactctct
     3841 catacgtata cgtatatgtg tatatgatat ataaacaaac attaatatcc tattccttcc
     3901 gtttgaaatc cctatgatgt actttgcatt gtttgcaccc gcgaataaaa tgaaaactcc
     3961 gaaccgatat atcaagcaca taaaagggga gggtccaatt aatgcatatt taagaccaca
     4021 gctgaataac tttaaaacgg cagacaaaac aaaaaatagg tcgaataaac cttacctgcc
     4081 tagaaggaat gacagcagct aataag
//
LOCUS       YSCTASPT     2641 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  S.cerevisiae mitochondrial aspartyl-tRNA synthetase (MSD1) gene,
            complete cds.
ACCESSION   M26020
KEYWORDS    aspartyl-tRNA synthetase.
SOURCE      S.cerevisiae (strain D273-10B) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2641)
  AUTHORS   Tzagoloff,A.
  JOURNAL   Unpublished (1989) Columbia U. Dept. Biol. Sci. NY NY 10027
  STANDARD  full staff_review
REFERENCE   2  (sites; for [1])
  AUTHORS   Gampel,A. and Tzagoloff,A.
  TITLE     Homology of aspartyl-and lysyl-tRNA synthetases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Tzagoloff, 06-JUL-1989.
FEATURES       from  to/span     description
    pept        601     2577     aspartyl-tRNA synthetase
BASE COUNT      878 a    466 c    551 g    746 t
ORIGIN      
        1 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacctg taacaatagc aataccccaa
       61 atacctaatg tagttccagc aagcaagcta aaaagtaaag caacaacata actcacccct
      121 gcatctgcag cttttgcccg ggcagcctgc tctgcctgtg ttttctttaa ttgagcagta
      181 gaccatttag cagttgcatg aatagctgca gcgtcacatc ggataataat gatggcagcc
      241 attgtagaag tgccttttgc atttctagtc tctttctcgg tctagctagt tttactacat
      301 cgcgaagata gaatcttaga tcacactgcc aatcttgtgc aagctgtggc aaaatcgacc
      361 ttctccatgc gtccatggta aatgaaatgt tttgctttct ttttcgtgct tgcaaagttt
      421 ggtagatgaa atttcagaaa gctactgccg taccaaagta caataataag gtgaaaatgg
      481 gagtaggaag taatagatgg cattgaatgc ctcatttatt cattattcat gttttcttaa
      541 cattttgatt cagttgagtg catgcttttt tgaaaagact aaacattaca agttttaatc
      601 atgttggccc gttccagggt gtgcttgcag acaatcacta gacggttggc agactttcca
      661 gaagccaatg ctattaagaa aaaatttctc tttaggaagg acacctcgac catcaaacag
      721 ttaaaaggac tgtctagcgg ccagaaaata gtactcaatg gatggataga gcagaagcca
      781 aaaagagttg ggaaaaattt gatcttcgga cttttaaggg actctaacgg tgacattatc
      841 cagttggttg ataacaaatc gttgttgaaa ggctttactt tagaggatgt ggttcaagcc
      901 gtaggtatac tctctttgaa gaggaagcta tcaaatgagg acgcagatga gtacgaagtg
      961 caattggagg atattactgt gttaaatgca tctaataaaa aaccagcgca aatgcaggat
     1021 tttaaattgt cagccatata cccacctgag ttccgctatt tgcaattgag aaatcccaaa
     1081 tatcaagatt ttttaaagaa gaggtcatct atctctaagg aaataagaaa ctccttcaac
     1141 aattttgatt ttacggaggt cgagacccca atgttattta aagctacccc agaaggcgca
     1201 agagagtttc tggttccaac aaggacaaag agatccgatg gtaaaccatc gttttatgct
     1261 ctagatcaga gccctcaaca atacaagcaa ctcttaatgg ctagtggtgt caacaaatat
     1321 tatcaaatgg caaggtgctt tagagatgaa gatttaagag cagacaggca gcctgagttt
     1381 acacaggttg atatggaaat ggcctttgct aattctgaag atgtcatgaa aatcatagaa
     1441 aagacagttt ctggggtatg gagtaaattt tccaaaaaac gaggattatt gactttagac
     1501 agtaagggta cattagtgcc tgcgaaaaag gaaaacggca cagtatctat ctttcgtatg
     1561 acctacgaac aagccatgac ctcatatggt attgacaagc cagatttgag agcgccagat
     1621 ttgaagatta tcaatttagg cgagttcaat gcctttagtc atttgaacaa aaaatttccc
     1681 gtttttgaag taattattct aagaagtgcc ttttcaaata tggaagagta caaagaacga
     1741 tggtcgtttc tgacaaataa cagtaattac aattatagag ttccaatagt gctaccaatt
     1801 gaaaatgacg aacaagctaa ttcaaattgg tttgagaatt ttcatgcaat tgccacgttt
     1861 gaaaacccac atctaataac caaatttctg aaactgaaaa aaggtgacat tgtatgcggt
     1921 tgtacgagag agccaaacca ttccattttc gagaatccta ctcccctggg aagattgaga
     1981 cagttggtgc tacaaagtga gcatgggaaa aatatctatc atgctgtcaa taaggatgtt
     2041 gcctcatgga ttgtggattt cccgttattt tctcccgtta taattgaaga taagtctggt
     2101 aaaaaagaaa agcttgcata tccggagtac gaaaaggata gactatgttc cacgcatcat
     2161 ccttttacta tggtgaagct taaagactac gaaaaattag aaaagacgcc ggaaaagtgc
     2221 ttgggtcggc attatgacct cgtagttaac ggtgtggaac ttggtggtgg ctcaacaaga
     2281 attcacgatc caagattaca agactatatt ttcgaagata tcctcaaaat agataatgca
     2341 tatgaactat ttggccattt actgaatgct tttgatatgg gaacaccgcc acacgctgga
     2401 tttgctattg gttttgatcg tatgtgcgct atgatctgtg aaactgagag tataagggac
     2461 gtaatcgcct tcccaaaaag tattaccggg gctgatttgg ttgtcaagag tccaagtgtg
     2521 atacctgaaa gcattctgga accttacaat atcaagtata gtaattcaaa aaaatgagca
     2581 gaagttgatg ttacctagat gtgagagaat cttagtgtgg ctgcagtatg taggtttttc
     2641 t
//
LOCUS       YSCTRMSD     2604 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  S.cerevisiae MSD gene encoding mitochondrial aspartyl-tRNA
            synthetase, complete cds.
ACCESSION   M24418
KEYWORDS    aspartyl-tRNA synthetase.
SOURCE      S.cerevisiae (strain D273-10B) DNA, clone pG94/T1.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2604)
  AUTHORS   Gampel,A. and Tzagoloff,A.
  TITLE     Homology of aspartyl- and lysyl-tRNA synthetases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry of sequence [1] kindly submitted by A.Tzagoloff,
            01-MAY-1989.
FEATURES       from  to/span     description
    pept        565     2541     mitochondrial aspartyl-tRNA synthetase
BASE COUNT      842 a    466 c    551 g    745 t
ORIGIN      
        1 cctgtaacaa tagcaatacc ccaaatacct aatgtagttc cagcaagcaa gctaaaaagt
       61 aaagcaacaa cataactcac ccctgcatct gcagcttttg cccgggcagc ctgctctgcc
      121 tgtgttttct ttaattgagc agtagaccat ttagcagttg catgaatagc tgcagcgtca
      181 catcggataa taatgatggc agccattgta gaagtgcctt ttgcatttct agtctctttc
      241 tcggtctagc tagttttact acatcgcgaa gatagaatct tagatcacac tgccaatctt
      301 gtgcaagctg tggcaaaatc gaccttctcc atgcgtccat ggtaaatgaa atgttttgct
      361 ttctttttcg tgcttgcaaa gtttggtaga tgaaatttca gaaagctact gccgtaccaa
      421 agtacaataa taaggtgaaa atgggagtag gaagtaatag atggcattga atgcctcatt
      481 tattcattat tcatgttttc ttaacatttt gattcagttg agtgcatgct tttttgaaaa
      541 gactaaacat tacaagtttt aatcatgttg gcccgttcca gggtgtgctt gcagacaatc
      601 actagacggt tggcagactt tccagaagcc aatgctatta agaaaaaatt tctctttagg
      661 aaggacacct cgaccatcaa acagttaaaa ggactgtcta gcggccagaa aatagtactc
      721 aatggatgga tagagcagaa gccaaaaaga gttgggaaaa atttgatctt cggactttta
      781 agggactcta acggtgacat tatccagttg gttgataaca aatcgttgtt gaaaggcttt
      841 actttagagg atgtggttca agccgtaggt atactctctt tgaagaggaa gctatcaaat
      901 gaggacgcag atgagtacga agtgcaattg gaggatatta ctgtgttaaa tgcatctaat
      961 aaaaaaccag cgcaaatgca ggattttaaa ttgtcagcca tatacccacc tgagttccgc
     1021 tatttgcaat tgagaaatcc caaatatcaa gattttttaa agaagaggtc atctatctct
     1081 aaggaaataa gaaactcctt caacaatttt gattttacgg aggtcgagac cccaatgtta
     1141 tttaaagcta ccccagaagg cgcaagagag tttctggttc caacaaggac aaagagatcc
     1201 gatggtaaac catcgtttta tgctctagat cagagccctc aacaatacaa gcaactctta
     1261 atggctagtg gtgtcaacaa atattatcaa atggcaaggt gctttagaga tgaagattta
     1321 agagcagaca ggcagcctga gtttacacag gttgatatgg aaatggcctt tgctaattct
     1381 gaagatgtca tgaaaatcat agaaaagaca gtttctgggg tatggagtaa attttccaaa
     1441 aaacgaggat tattgacttt agacagtaag ggtacattag tgcctgcgaa aaaggaaaac
     1501 ggcacagtat ctatctttcg tatgacctac gaacaagcca tgacctcata tggtattgac
     1561 aagccagatt tgagagcgcc agatttgaag attatcaatt taggcgagtt caatgccttt
     1621 agtcatttga acaaaaaatt tcccgttttt gaagtaatta ttctaagaag tgccttttca
     1681 aatatggaag agtacaaaga acgatggtcg tttctgacaa ataacagtaa ttacaattat
     1741 agagttccaa tagtgctacc aattgaaaat gacgaacaag ctaattcaaa ttggtttgag
     1801 aattttcatg caattgccac gtttgaaaac ccacatctaa taaccaaatt tctgaaactg
     1861 aaaaaaggtg acattgtatg cggttgtacg agagagccaa accattccat tttcgagaat
     1921 cctactcccc tgggaagatt gagacagttg gtgctacaaa gtgagcatgg gaaaaatatc
     1981 tatcatgctg tcaataagga tgttgcctca tggattgtgg atttcccgtt attttctccc
     2041 gttataattg aagataagtc tggtaaaaaa gaaaagcttg catatccgga gtacgaaaag
     2101 gatagactat gttccacgca tcatcctttt actatggtga agcttaaaga ctacgaaaaa
     2161 ttagaaaaga cgccggaaaa gtgcttgggt cggcattatg acctcgtagt taacggtgtg
     2221 gaacttggtg gtggctcaac aagaattcac gatccaagat tacaagacta tattttcgaa
     2281 gatatcctca aaatagataa tgcatatgaa ctatttggcc atttactgaa tgcttttgat
     2341 atgggaacac cgccacacgc tggatttgct attggttttg atcgtatgtg cgctatgatc
     2401 tgtgaaactg agagtataag ggacgtaatc gccttcccaa aaagtattac cggggctgat
     2461 ttggttgtca agagtccaag tgtgatacct gaaagcattc tggaacctta caatatcaag
     2521 tatagtaatt caaaaaaatg agcagaagtt gatgttacct agatgtgaga gaatcttagt
     2581 gtggctgcag tatgtaggtt tttc
//
LOCUS       MUSCD14A     1873 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse monocyte/granulocyte cell surface glycoprotein homologous
            (CD14) gene, complete cds.
ACCESSION   M34510
KEYWORDS    CD14 receptor protein;
            monocyte/granulocyte cell surface glycoprotein.
SOURCE      Mouse (strain BALB/c) adult liver macrophage cell line J774A.1 DNA,
            clone pMCD14/5, and cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1873)
  AUTHORS   Ferrero,E., Hsieh,C.-L., Francke,U. and Goyert,S.M.
  TITLE     CD14 is a member of the family of leucine-rich proteins and is
            encoded by a gene syntenic with multiple receptor genes
  JOURNAL   J. Immunol. 145, 331-336 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by E.M.Ferrero, 25-MAY-1990.
FEATURES       from  to/span     description
    pept        173      175     murine CD14 protein (MCD14), exon 1
                273     1370     murine CD14 protein, exon 2
    sigp        173      175     murine CD14 protein signal peptide
                273      314     murine CD14 protein signal peptide
    matp        315     1367     murine CD14 protein
    pre-msg      62     1602     murine CD14 mRNA and introns
    IVS         176      272     murine CD14 protein intron A
    signal     1433     1438     poly-A signal
    signal     1583     1588     poly-A signal
BASE COUNT      427 a    500 c    478 g    468 t
ORIGIN      Chromosome 18.
        1 aggaagggac agggtgacac cccaggatta cataaattta caggggctgc cgaattggtc
       61 gaacaagccc gtggaacctg gaagccagag aacaccatcg ctgtaaagga aagaaactga
      121 agcttttctc ggagcctatc tgggctgctc aaactttcag aatctaccga ccatggtgag
      181 tcagacagac tgtcttgggg tggaactgga gccaacctga ggaatctcag ggtcctggca
      241 ggagtctccc tgtcccccta ctttctcctc aggagcgtgt gcttggcttg ttgctgttgc
      301 ttctggtgca cgcctctccc gccccaccag agccctgcga gctagacgag gaaagttgct
      361 cctgcaactt ctcagatccg aagccagatt ggtccagcgc tttcaattgt ttgggggcgg
      421 cagatgtgga attgtacggc ggcggccgca gcctggaata ccttctaaag cgtgtggaca
      481 cggaagcaga tctggggcag ttcactgata ttatcaagtc tctgtcctta aagcggctta
      541 cggtgcgggc cgcgcggatt cctagtcgga ttctattcgg agccctgcgt gtgctcggga
      601 tttccggcct ccaggaactg actcttgaaa atctcgaggt aaccggcacc gcgccgccac
      661 cgcttctgga agccaccgga cccgatctca acatcttgaa cctccgcaac gtgtcgtggg
      721 caacaaggga tgcctggctc gcagaactgc agcagtggct aaagcctgga ctcaaggtac
      781 tgagtattgc ccaagcacac tcactcaact tttcctgcga acaggtccgc gtcttccctg
      841 ccctctccac cttagacctg tctgacaatc ctgaattggg cgagagagga ctgatctcag
      901 ccctctgtcc cctcaagttc ccgaccctcc aagttttagc gctgcgtaac gcggggatgg
      961 agacgcccag cggcgtgtgc tctgcgctgg ccgcagcaag ggtacagctg caaggactag
     1021 accttagtca caattcactg cgggatgctg caggcgctcc gagttgtgac tggcccagtc
     1081 agctaaactc gctcaatctg tctttcactg ggctgaagca ggtacctaaa gggctgccag
     1141 ccaagctcag cgtgctggat ctcagttaca acaggctgga taggaaccct agcccagatg
     1201 agctgcccca agtggggaac ctgtcactta aaggaaatcc ctttttggac tctgaatccc
     1261 actcggagaa gtttaactct ggcgtagtca ccgccggagc tccatcatcc caagcagtgg
     1321 ccttgtcagg aactctggct ttgctcctag gagatcgcct ctttgtttaa ggaacatttg
     1381 catcctcctg gtttctgagg gtcctcgtca acgaatcctc tgctttaaat ttattaaaat
     1441 cttaatccac gatgtaagga aagaaaggca gtcaagatgg ttcagtgggt aaaagccagc
     1501 aaacttgacc cctgatttta accctcagga tccacacgga aggggaaaac tcactcctga
     1561 aagttgtcca tctgtgctca caaataaata ttttttaaaa taacaatgtg tttgttggtt
     1621 ttgtttttgt ttgggttttg ttgtggtttt gtttgttttg ttttgttttt gagacagtct
     1681 ggctatgtat ccttggctgg cctcaaactc ataaagatca agatcggcct gcctctacct
     1741 ccaaatgctc tggttaaagg gatgtgcctc catgcccagt tgaagtcatc ctgaaccacg
     1801 agtccaggcc actcactctt tactaagatc tttactaagt atctgccata ggtcaggctc
     1861 agtgctaaat gag
//
LOCUS       YSTAKYAMT     563 bp ss-mRNA            PLN       24-JUL-1990
DEFINITION  Yeast sp. mitochondrial adenylate kinase (AKY2) gene, 5' end.
ACCESSION   M35511
KEYWORDS    adenylate kinase.
SOURCE      Yeast sp. (wild type strain D273-10B, ATCC 25657), cDNA to mRNA.
  ORGANISM  Yeast sp.
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE   1  (bases 1 to 563)
  AUTHORS   Oechsner,U., Magdolen,V., Zoglowek,C., Haecker,U. and Bandlow,W.
  TITLE     Yeast adenylate kinase is transcribed constitutively from a
            promoter in the short intergenic region to the histone H2A-1 gene
  JOURNAL   FEBS Lett. 242, 187-193 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        561  >   563     adenylate kinase (AKY2)
BASE COUNT      153 a    100 c     98 g    212 t
ORIGIN      
        1 taagatcggt tctggtattt taaagaaggc ggaaggaact aaactgcttc tccattgtaa
       61 ctatattaag tgtctattag acctttctgt ttttcctttt tatttttcta tatataattt
      121 attgggtata taattaacgg taacatatgt catgcatgat ataaattttt tgttgtgtct
      181 tcctgttttc tctgttgctc tgtttaggtt cattgggcac tgttgcttac ttgaacatga
      241 ttgagtagca aacacattct gtgctttgtt cactttgata gtgtgacgaa attttcacaa
      301 ccaaaggaaa aaagctcacg attgcgcgat cctgtcacgg agaactgtcc gcagcagccc
      361 gcggcttctg tccttattcg cccatttttt tttgattttc gactttttca ctctggctag
      421 ttttattacg catatttgta tatatatata cgcataaatt tctgaaatgg attacctggt
      481 ttatcagaaa cgagagcttt aataaagaaa cattaacgtt tctctggtaa agtcaccaca
      541 cagcatcaaa tataacagta atg
//
LOCUS       MLMPOLA      1680 bp ss-RNA             VRL       24-JUL-1990
DEFINITION  Moloney murine leukemia virus pol gene, partial cds.
ACCESSION   M32803
KEYWORDS    .
SOURCE      Moloney murine leukemia virus RNA, clone 3-1R.
  ORGANISM  Moloney murine leukemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine leukemia viruses.
REFERENCE   1  (bases 1 to 1680)
  AUTHORS   Martinelli,S.C. and Goff,S.P.
  TITLE     Rapid reversion of a deletion mutation in Moloney murine leukemia
            virus by recombination with a closely related endogenous provirus
  JOURNAL   Virology 174, 135-144 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      684     pol protein (AA at 1)
                688  >  1680     pol protein
    site        685      687     pol protein in-frame stop codon
BASE COUNT      469 a    467 c    435 g    309 t
ORIGIN      
        1 gcttttcccc tcgagcgccc agactgggat tacaccaccc aggcaggtag gaaccaccta
       61 gtccactatc gccagttgct cctagcgggt ctccaaaacg cgggcagaag ccccaccaat
      121 ttggccaagg taaaagggat aacacaggga cctaatgagt ctccctcagc ctttttagag
      181 agactcaagg aggcctatcg caggtacact ccttatgacc ctgaggaccc agggcaagaa
      241 accaatgtgt ctatgtcatt catctggcag tctgccccgg atatcgggcg aaagttaggg
      301 cggttagaag atttaaagag caagacctta ggagacttag ttagggaggc tgaaaagatc
      361 tttaataaac gtgaaacccc ggaagaaaga gaggaacgta tcaggagaga aacagaggaa
      421 aaagaagaac gccgtaggac agtggatgag cagaaagaga aagaaaggga ccgcagaaga
      481 catagagaga tgagcaagct cttggccact gtagttattg gtcaggaaca ggatagacag
      541 gagggagagc ggaagaggcc ccaacttgat aaggatcaat gcgcctactg caaagaaaag
      601 gggcactggg ctaaagactg tcccaagaag ccacgagggc cccgaggacc caggccccag
      661 acctccctcc tgaccttagg tgactaggga ggtcagggtc aggacccccc ccctgaaccc
      721 aggataaccc tcaaagtcgg ggggcaaccc gtcaccttcc tggtagatac tggggcccaa
      781 cactccgtgc tgacccaaaa tcctggaccc ctaagtgaca agtctgcctg ggtccaaggg
      841 gctactggag gaaagcggta tcgctggacc acggatcgca aagtgcatct agctaccggt
      901 aaggtcaccc actctttcct ccatgtacca gactgcccct atcctctgct aggaagagat
      961 ttgctgacta aactaaaagc ccaaatccac tttgagggat caggagctca ggttgtggga
     1021 ccaatgggac agcccctgca agtgctgacc ctaaacatag aagatgagta tcggctacat
     1081 gagacctcaa aagagccgga tgtttctcta gggttcacat ggctttctga ttttcctcag
     1141 gcctgggcgg aatccggggg catgggactg gcagttcgcc aagctcctct gatcatacct
     1201 ctgaaggcaa cctctacccc cgtgtccata aaacaatacc ccatgtcaca agaagccaga
     1261 ctggggatca agccccacat acagagactg ttggaccagg gaatactggt accctgccag
     1321 tccccctgga acacgcccct gctacccgtt aagaaaccag ggactaatga ttacaggcct
     1381 gtccaagatc tgagagaagt caacaagcgg gtggaagaca tccaccccac cgtgcccaac
     1441 ccttacaacc tcttgagcgg gctcccaccg tcccaccagt ggtacactgt gcttgactta
     1501 aaggatgcct ttttctgcct gagactccac cccaccagtc agcctctctt cgcctttgag
     1561 tggagagacc cagagatggg aatctcagga caattaacct ggaccagact cccacagggt
     1621 ttcaaaaaca gtcccaccct gtttgatgag gcactgcaca gagacctagc agacttccgg
//
LOCUS       RHIISRA      1350 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  Plasmid RP4 of Rhizobium transposon Tn1 transposable element ISR1
            open reading frames A, A1, A2 and A3.
ACCESSION   X06616
KEYWORDS    transposable element.
SOURCE      Plasmid RP4 transposon Tn 1 of Rhizobium sp. DNA.
  ORGANISM  Rhizobium sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 1350)
  AUTHORS   Priefer,U.B., Kalinowski,J., Rueger,B., Heumann,W. and Puehler,A.
  TITLE     ISR1, a transposable DNA sequence resident in Rhizobium class IV
            strains, shows structural characteristics of classical insertion
            elements
  JOURNAL   Plasmid 21, 120-128 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    ORF          94      360     ORF A3
    ORF         433      714     ORF A1
    ORF         787     1272     ORF A2
    ORF         900       64 (c) ORF A
BASE COUNT      258 a    415 c    470 g    207 t
ORIGIN      
        1 acggaaattg ctcatcagct cagtattgcc tgacgtgacc ccctgaaact cctccaggaa
       61 tagctagagt ccgcccaacg aaggacggac agaatgaagc gatcgaggtt cacggaagag
      121 cagatcatcg ggatcctgcg ggagcaggag gccggtgtag cgacggcgga ggtgtgccga
      181 cgccacgggg tcagctcggc gaccttttac aaatggaagg ccaagtttgg cgggctcgac
      241 gtgtcggagg ctcggcggct caaggcgctc gaggatgaga acgcccggct gaagcggatg
      301 ctggcggacg cgatgctgga caacgtggcc ctgaaggacc tgctgggaaa aaagtggtga
      361 cgcccgccgg ctatcgcgag gcggctggcc atctgcaggc cgcctacgag atgagcgaaa
      421 ggcgggcgtg ccgtgttctg ggcgtcgatc ggacgagcgt gcgctatcag gcgacgcgcc
      481 cggacgacgc gctctgcgcg accggctgaa ggccctggcc aggaacgtcg acggttcggc
      541 tatcgtcgcc tgcacgtgct gctgcggcgc gagggccatg cggtcaacag gaaacgggtc
      601 cagcggatct atcgcgacga gcaactgacg gtgcgccggc gcgcgccgca agcgagcgat
      661 gggcacgcgg cgaccgtgac ctgccgctgg cgcccaacca gcgctggtcg ctgacttcgt
      721 cgcagaccag ctgacggacg ggcggcgctt ccgcatcctg acggtgatcg ataactgtac
      781 gcgcgagtgc ttggcgctgg tggccgacac ctcgctgtcg ggcgcgcggg tcgtgcggga
      841 gttggacgcc gtcatccggc agcgagggcg gcccgacacc atcgtcagcg acaacggcac
      901 ggagtacacc tcgaacgcgg tcctggcctg ggcgacgaca ccggcgtcgg ctggcactac
      961 atcgcgccgg gcaagcccca gcagaacggc ttcaacgaaa gcttcaacgg gcggctgcgt
     1021 gacgagctgc tgaacgagac gctgttccgc tcgctgccgc cgctcgcgcc gtgcggaggc
     1081 gtggcgacgc gattacaacg agcagcgtcc gcactcgaag ctcgggtggc tgacgccgca
     1141 ggcttatgca gaagcgctca ccgcacacaa cggccggcct gctgcgtggt tgatggctgc
     1201 gcagaccggc ctcttgccaa cccagccgat cacagctcag atcaacctag gactctcgtt
     1261 atggctggat gagaaacggg ggtcacgtca tgcccgctcc acggtttata aaattcttga
     1321 agacgaaagg gcctcgtgat acgcctattt
//
LOCUS       HAMACAPT     1837 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA,
            partial cds.
ACCESSION   J05590 M22755
KEYWORDS    N-acetylglucosamine-1-phosphate transferase; glycosyl transferase;
            tunicamycin-resistance.
SOURCE      Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone
            TRG-[4,22].
  ORGANISM  Cricetulus longicaudatus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 889 to 960)
  AUTHORS   Lehrman,M.A., Zhu,X. and Khounlo,S.
  TITLE     Amplification and molecular cloning of the hamster tunicamycin-
            sensitive N-acetylglucosamine-1-phosphate transferase gene: The
            hamster and yeast enzymes share a common peptide sequence
  JOURNAL   J. Biol. Chem. 263, 19796-19803 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1837)
  AUTHORS   Zhu,X. and Lehrman,M.A.
  TITLE     Cloning, sequence and expression of a cDNA encoding hamster
            UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate
            transferase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and sequence for [1] kindly submitted by M. Lehrman,
            04-APR-1989.  Draft entry and computer-readable sequence for [1]
            kindly submitted by M.Lehrman, 12-JUN-1990.
FEATURES       from  to/span     description
    pept        148     1374     N-acetylglucosamine-1-phosphate transferase
                                 (GPT)
    binding      67       79     dolichol binding site
    binding     222      234     dolichol binding site
BASE COUNT      363 a    515 c    418 g    541 t
ORIGIN      
        1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg
       61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg
      121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg
      181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc
      241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg
      301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc
      361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc
      421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg
      481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct
      541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta
      601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc
      661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat
      721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta
      781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt
      841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga
      901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc
      961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct
     1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg
     1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc
     1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag
     1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt
     1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc
     1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc
     1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca
     1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt
     1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt
     1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc
     1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat
     1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag
     1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc
     1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt
//
LOCUS       HS4NA3ABC   10089 bp ds-DNA             VRL       24-JUL-1990
DEFINITION  Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C
            genes, complete cds.
ACCESSION   M34440
KEYWORDS    nuclear antigen.
SOURCE      Epstein-Barr virus type 2 (isolate AG876) DNA.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 10089)
  AUTHORS   Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E.,
            Rickinson,A. and Kieff,E.
  TITLE     Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their
            EBNA-3A, EBNA-3B and EBNA-3C genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.T.Sample, 23-MAY-1990.
               Kieff Laboratory
               Harvard Medical School
               75 Francis Street
               Boston, MA 02115
FEATURES       from  to/span     description
    pept        189      524     nuclear antigen EBNA-3A, exon 1
                610     3051     nuclear antigen EBNA-3A, exon 2
    pept       3237     3593     nuclear antigen EBNA-3B, exon 1
               3672     6155     nuclear antigen EBNA-3B, exon 2
    pept       6277     6636     nuclear antigen EBNA-3C, exon 1
               6711     9560     nuclear antigen EBNA-3C, exon 2
    IVS         525      609     EBNA-3A intron
    IVS        3594     3671     EBNA-3B intron
    IVS        6637     6710     EBNA-3C intron
BASE COUNT     2495 a   2837 c   2619 g   2138 t
ORIGIN      
        1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct
       61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag
      121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg
      181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag
      241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca
      301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc
      361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac
      421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc
      481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg
      541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt
      601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat
      661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg
      721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg
      781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact
      841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg
      901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg
      961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc
     1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct
     1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac
     1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat
     1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg
     1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca
     1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg
     1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg
     1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct
     1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa
     1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac
     1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc
     1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc
     1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg
     1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg
     1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa
     1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca
     1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca
     2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag
     2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc
     2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac
     2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg
     2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa
     2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg
     2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg
     2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc
     2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg
     2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg
     2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa
     2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg
     2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct
     2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat
     2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt
     2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg
     3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg
     3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat
     3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc
     3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga
     3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg
     3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac
     3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag
     3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca
     3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc
     3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag
     3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc
     3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga
     3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac
     3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct
     3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata
     3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa
     3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag
     4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt
     4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga
     4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata
     4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag
     4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct
     4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg
     4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa
     4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg
     4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca
     4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag
     4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc
     4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg
     4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca
     4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca
     4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga
     4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt
     4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac
     5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc
     5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac
     5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct
     5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc
     5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc
     5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc
     5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc
     5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc
     5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc
     5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag
     5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta
     5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg
     5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac
     5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac
     5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc
     5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct
     6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt
     6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt
     6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg
     6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta
     6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg
     6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat
     6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac
     6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc
     6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta
     6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca
     6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc
     6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc
     6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct
     6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg
     6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat
     6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca
     6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg
     7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag
     7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga
     7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc
     7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt
     7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta
     7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc
     7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg
     7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag
     7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc
     7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac
     7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag
     7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg
     7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag
     7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt
     7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga
     7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta
     7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac
     8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc
     8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc
     8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc
     8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc
     8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc
     8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg
     8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac
     8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg
     8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg
     8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata
     8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc
     8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact
     8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc
     8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt
     8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag
     8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc
     9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac
     9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc
     9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc
     9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac
     9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg
     9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag
     9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa
     9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg
     9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca
     9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga
     9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta
     9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca
     9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg
     9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg
     9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg
     9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag
     9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc
    10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt
    10081 ggaggatcc
//
LOCUS       HUMPAPB      3061 bp ss-mRNA            PRI       24-JUL-1990
DEFINITION  Human prostatic acid phosphatase mRNA, complete cds.
ACCESSION   M34840
KEYWORDS    glycoprotein; prostatic acid phosphatase; secretory protein.
SOURCE      Human prostate, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1199)
  AUTHORS   Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
            Moore,D.L.
  TITLE     Covalent structure, disulfide bonding and identification of
            reactive surface and active site residues of human prostatic acid
            phosphatase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3061)
  AUTHORS   Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
            Moore,D.L.
  TITLE     Covalent structure, disulfide bonding and identification of
            reactive surface and active site residues of human prostatic acid
            phosphatase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.L.Van Etten, 08-JUN-1990.
               Chemistry Department
               Purdue University
               West Lafayette, IN 47907
FEATURES       from  to/span     description
    pept         15     1175     prostatic acid phosphatase precursor (EC
                                 3.1.3.2) /hgml_locus_uid="LA0168F"
                                 /nomgen="ACPP" /map="3q21-qter"
    sigp         15      110     prostatic acid phosphatase signal peptide
    matp        111     1172     prostatic acid phosphatase
BASE COUNT      917 a    642 c    681 g    821 t
ORIGIN      Chromosome 3q21-qter.
        1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc
       61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga
      121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg
      181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg
      241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt
      301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg
      361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc
      421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc
      481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg
      541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag
      601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg
      661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga
      721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat
      781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc
      841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc
      901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga
      961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc
     1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt
     1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa
     1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag
     1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc
     1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca
     1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt
     1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa
     1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt
     1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt
     1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga
     1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc
     1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa
     1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt
     1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat
     1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata
     1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga
     1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag
     2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag
     2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt
     2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta
     2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca
     2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa
     2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc
     2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac
     2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa
     2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat
     2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac
     2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc
     2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc
     2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat
     2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc
     2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat
     2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta
     3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa
     3061 a
//
LOCUS       LDV3ECPA     1064 bp ss-RNA             VRL       24-JUL-1990
DEFINITION  Lactate dehydrogenase-elevating virus capsid protein gene, complete
            cds.
ACCESSION   M34856
KEYWORDS    viral capsid protein.
SOURCE      Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones
            A16, B24, B63, B90, B103, C44, dt4.
  ORGANISM  Lactate dehydrogenase-elevating virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Togaviridae.
REFERENCE   1  (bases 1 to 1064)
  AUTHORS   Godeny,E.K., Speicher,D.W. and Brinton,M.A.
  TITLE     Map location of lactate dehydrogenase-elevating virus (LDV) capsid
            protein (Vp1) gene
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.K.Godeny, 06-JUN-1990.
FEATURES       from  to/span     description
    pept        637      984     capsid protein
    pept        135      650     unidentified ORF
BASE COUNT      267 a    223 c    265 g    309 t
ORIGIN      3' terminus region.
        1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc
       61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc
      121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt
      181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg
      241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt
      301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg
      361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc
      421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg
      481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta
      541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga
      601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc
      661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat
      721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc
      781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga
      841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt
      901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat
      961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg
     1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt
//
LOCUS       MBIFRHADGB   3903 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase
            (frhADGB) gene, complete cds.
ACCESSION   J02914
KEYWORDS    8-hydroxy-5-deazaflavin-reducing hydrogenase.
SOURCE      M.thermoautotrophicum (strain delta-H) DNA.
  ORGANISM  Methanobacterium thermoautotrophicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanobacteriaceae.
REFERENCE   1  (bases 1 to 3903)
  AUTHORS   Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T.
  TITLE     Cloning, sequence determination, and expression of the genes
            encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin
            reducing hydrogenase from Methanobacterium thermoautotrophicum
            delta-H
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.Alex, 25-MAY-1990.
FEATURES       from  to/span     description
    pept        298     1515     F420 hydrogenase alpha-subunit (frhA)
    pept       1524     2000     F420 hydrogenase delta-subunit (frhD)
    pept       2114     2824     F420 hydrogenase gamma-subuint (frhG)
    pept       2837     3682     F420 hydrogenase beta-subunit (frhB)
    binding     284      293     ribosomal binding site (put.)
    binding    1514     1520     ribosomal binding site (put.)
    binding    2100     2106     ribosomal binding site (put.)
    binding    2823     2832     ribosomal binding site (put.)
BASE COUNT     1201 a    878 c    965 g    859 t
ORIGIN      
        1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta
       61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac
      121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt
      181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt
      241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg
      301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg
      361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg
      421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg
      481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc
      541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac
      601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac
      661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt
      721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac
      781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa
      841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc
      901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa
      961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata
     1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca
     1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt
     1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg
     1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta
     1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag
     1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca
     1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc
     1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa
     1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg
     1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt
     1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc
     1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg
     1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg
     1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa
     1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa
     1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg
     1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc
     2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga
     2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag
     2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact
     2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga
     2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga
     2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac
     2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt
     2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc
     2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc
     2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt
     2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct
     2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg
     2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct
     2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac
     2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata
     2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg
     3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa
     3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga
     3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta
     3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat
     3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat
     3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga
     3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg
     3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg
     3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga
     3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga
     3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat
     3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt
     3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa
     3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt
     3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat
     3901 cga
//
LOCUS       MUSLRCNA     1751 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse loricrin gene, complete cds.
ACCESSION   M34398
KEYWORDS    cell envelope protein; loricrin.
SOURCE      Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1751)
  AUTHORS   Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D.,
            Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M.,
            Yuspa,S.H. and Roop,D.R.
  TITLE     Identification of a major keratinocyte cell envelope protein,
            loricrin
  JOURNAL   Cell 61, 1103-1112 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Rothnagel, 22-MAY-1990.
FEATURES       from  to/span     description
    pept         71     1516     loricrin
    signal     1740     1745     polyA signal
BASE COUNT      223 a    493 c    706 g    329 t
ORIGIN      
        1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc
       61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg
      121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg
      181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg
      241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg
      301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg
      361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg
      421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg
      481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg
      541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg
      601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg
      661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc
      721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg
      781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg
      841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg
      901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg
      961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg
     1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg
     1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc
     1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg
     1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag
     1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg
     1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg
     1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg
     1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac
     1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc
     1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac
     1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct
     1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca
     1741 ataaatttgc t
//
LOCUS       PFAGIPA      1906 bp ds-DNA             INV       24-JUL-1990
DEFINITION  P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete
            cds.
ACCESSION   J05544
KEYWORDS    glucosephosphate isomerase.
SOURCE      P.falciparum (strain 7G8) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 1906)
  AUTHORS   Kaslow,D.C. and Hill,S.
  TITLE     Cloning metabolic pathway genes by complementation in E.coli:
            Isolation and expression of Plasmodium falciparum glucosephosphate
            isomerase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.C.Kaslow, 04-JUN-1990.
FEATURES       from  to/span     description
    pept         28     1803     glucosephosphate isomerase (EC 5.3.1.9)
BASE COUNT      782 a    174 c    289 g    661 t
ORIGIN      Chromosome 14.
        1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat
       61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat
      121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct
      181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa
      241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc
      301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata
      361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat
      421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg
      481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa
      541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat
      601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat
      661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct
      721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt
      781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg
      841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta
      901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga
      961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat
     1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat
     1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca
     1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa
     1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat
     1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa
     1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg
     1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa
     1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt
     1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta
     1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt
     1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat
     1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat
     1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag
     1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt
     1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt
//
LOCUS       RATGADA      3198 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Rat glutamic acid decarboxylase (GAD) mRNA, complete cds.
ACCESSION   M34445
KEYWORDS    glutamic acid decarboxylase.
SOURCE      Rat brain and hypothalamus, cDNA to mRNA, (libraries of
            D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and
            pCR22HP.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3198)
  AUTHORS   Wyborski,R.J., Bond,R.W. and Gottlieb,D.I.
  TITLE     Characterization of a cDNA coding for rat glutamic acid
            decarboxylase
  JOURNAL   Mol. Brain Res. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.I.Gottlieb, 23-MAY-1990.
FEATURES       from  to/span     description
    pept        184     1965     glutamic acid decarboxylase
    mRNA    <     1     3198     glutamic acid decarboxylase mRNA
    signal     3174     3179     polyA signal
BASE COUNT      869 a    742 c    767 g    820 t
ORIGIN      
        1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca
       61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc
      121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag
      181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat
      241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc
      301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag
      361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa
      421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct
      481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg
      541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac
      601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac
      661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt
      721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta
      781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc
      841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca
      901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc
      961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc
     1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg
     1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt
     1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat
     1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca
     1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg
     1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc
     1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa
     1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag
     1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac
     1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc
     1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt
     1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa
     1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc
     1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg
     1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc
     1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc
     1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca
     2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg
     2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa
     2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat
     2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc
     2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca
     2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag
     2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta
     2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca
     2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac
     2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag
     2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt
     2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta
     2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt
     2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt
     2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc
     2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa
     3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta
     3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt
     3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa
     3181 taaaaaatat tctccatg
//
LOCUS       RATTSHRA     5270 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Rat thyroid stimulating hormone receptor mRNA, complete cds.
ACCESSION   M34842
KEYWORDS    TSH receptor; thyroid stimulating hormone receptor;
            transmembrane protein.
SOURCE      Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to
            mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5270)
  AUTHORS   Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C.,
            McBride,O.W. and Kohn,L.D.
  TITLE     Cloning, chromosomal assignment and regulation of the rat
            thyrotropin receptor by thyrotropin, agents which increase cAMP
            levels and thyroid autoantibodies
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.D.Kohn, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         55     2349     thyroid stimulating hormone receptor precursor
    sigp         55      117     thyroid stimulating hormone receptor signal
                                 peptide
    matp        118     2346     thyroid stimulating hormone receptor
    signal     5256     5261     poly-A signal
BASE COUNT     1415 a   1343 c   1176 g   1336 t
ORIGIN      
        1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg
       61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga
      121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag
      181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact
      241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat
      301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa
      361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta
      421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct
      481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct
      541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg
      601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg
      661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga
      721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct
      781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag
      841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac
      901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat
      961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc
     1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag
     1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag
     1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac
     1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca
     1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg
     1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc
     1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc
     1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac
     1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt
     1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc
     1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg
     1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc
     1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac
     1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg
     1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt
     1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc
     1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg
     2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc
     2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag
     2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa
     2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt
     2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca
     2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa
     2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct
     2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc
     2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac
     2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg
     2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag
     2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga
     2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat
     2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc
     2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat
     2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt
     3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta
     3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat
     3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac
     3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga
     3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt
     3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa
     3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct
     3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt
     3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt
     3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata
     3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt
     3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac
     3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat
     3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga
     3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat
     3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag
     3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc
     4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc
     4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg
     4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa
     4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga
     4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag
     4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc
     4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg
     4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc
     4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct
     4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg
     4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca
     4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa
     4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta
     4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga
     4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg
     4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc
     4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg
     5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca
     5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt
     5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg
     5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg
//
LOCUS       STMBLIPA      840 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  S.clavuligerus beta-lactamase inhibitory protein gene, complete
            cds.
ACCESSION   M34538
KEYWORDS    beta-lactamase inhibitory protein.
SOURCE      S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP.
  ORGANISM  Streptomyces clavuligerus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 840)
  AUTHORS   Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E.
  TITLE     Isolation and characterization of a beta-lactamase inhibitory
            protein from Streptomyces clavuligerus and cloning and analysis of
            corresponding gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.E.Jensen, 25-MAY-1990.
               Dept. of Microbiology
               University of Alberta
               Edmonton, Alberta Canada T6G 2E9
FEATURES       from  to/span     description
    pept         67      672     beta-lactamase inhibitory protein precursor
                                 (bliP)
    sigp         67      174     beta-lactamase inhibitory protein signal
                                 peptide
    matp        175      669     beta-lactamase inhibitory protein
    binding      52       63     ribosomal binding site
    signal      686      724     transcription termination signal
BASE COUNT      140 a    248 c    320 g    132 t
ORIGIN      
        1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga
       61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc
      121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg
      181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc
      241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc
      301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc
      361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg
      421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg
      481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg
      541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc
      601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg
      661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc
      721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg
      781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc
//
LOCUS       TRBPARPA      763 bp ss-mRNA            INV       24-JUL-1990
DEFINITION  T.brucei procyclin (pPS-1) mRNA, complete cds.
ACCESSION   M34790
KEYWORDS    procyclic acidic repetitive protein; procyclin.
SOURCE      T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone
            pPS-1.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 763)
  AUTHORS   Dorn,P.L., Aman,R.A. and Boothroyd,J.C.
  TITLE     Inhibition of protein synthesis results in super-induction of
            procyclin RNA
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Boothroyd, 04-JUN-1990.
               Stanford University School of Medicine
               Dept of Microbiology and Immunology
               Sherman Fairchild Science Building
               Stanford, CA 94305-5402 USA
FEATURES       from  to/span     description
    pept         29      466     procyclin
    mRNA    <     1      763     procyclin mRNA
    rpt         215      393     12 bp degenerate repeats
BASE COUNT      192 a    215 c    156 g    200 t
ORIGIN      
        1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct
       61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc
      121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg
      181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga
      241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga
      301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga
      361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc
      421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt
      481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc
      541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt
      601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc
      661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag
      721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt
//
LOCUS       W10PPIBCA    4761 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes,
            complete cds, and ippD gene, 3' end.
ACCESSION   M34849
KEYWORDS    antigen.
SOURCE      Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA,
            clones HB101/pHS[4108,5357].
  ORGANISM  Plasmid pWR100
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 4761)
  AUTHORS   Baudry,B., Kaczorek,M. and Sansonetti,P.J.
  TITLE     Nucleotide sequence of the invasion plasmid antigen B and C genes
            (ipaB and ipaC) of Shigella flexneri
  JOURNAL   Microb. Pathog. 4, 345-357 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 538 and 3987 to 4761)
  AUTHORS   Baudry,B., Kaczorek,M. and Sansonetti,P.J.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Baudry, 05-JUN-1990.
               Center for Vaccine Development
               University of Maryland at Baltimore
               10 South Pine Street
               Baltimore, MD 21201
FEATURES       from  to/span     description
    pept        606     1073     polypeptide I (ippI)
    pept       1079     2821     polypeptide B (ipaB)
    pept       2841     3932     polypeptide C (ipaC)
    pept       3983  >  4761     ipaD protein
    pept    <     1      549     unidentified ORF
    binding     595      600     ippI ribosomal binding site
    binding    1065     1070     ipaB ribosomal binding site
    binding    2829     2834     ipaC ribosomal binding site
    binding    3971     3977     ipaD ribosomal binding site
    signal      556      561     ippI -10 region
    signal     2723     2728     ipaC -35 region
    signal     2745     2750     ipaC -10 region
BASE COUNT     1769 a    952 c    805 g   1235 t
ORIGIN      
        1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat
       61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat
      121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct
      181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca
      241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat
      301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc
      361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg
      421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca
      481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat
      541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag
      601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa
      661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca
      721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct
      781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta
      841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat
      901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag
      961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat
     1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat
     1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga
     1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac
     1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa
     1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct
     1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa
     1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga
     1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat
     1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat
     1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga
     1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca
     1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat
     1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa
     1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg
     1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc
     1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga
     1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt
     2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt
     2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca
     2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc
     2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc
     2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac
     2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac
     2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat
     2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc
     2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc
     2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa
     2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga
     2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc
     2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg
     2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata
     2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc
     2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat
     3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg
     3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag
     3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag
     3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga
     3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc
     3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa
     3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc
     3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa
     3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc
     3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt
     3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc
     3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca
     3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca
     3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc
     3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag
     3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg
     3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc
     4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa
     4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa
     4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac
     4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc
     4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt
     4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga
     4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata
     4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag
     4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa
     4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc
     4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac
     4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat
     4741 agacaatatg ttaaaaagct t
//
LOCUS       YEPOMPHA      824 bp ss-mRNA            BCT       24-JUL-1990
DEFINITION  Y.enterocolitica cationic outer membrane protein (ompH) gene,
            complete cds.
ACCESSION   M34854
KEYWORDS    cationic outer membrane protein;
            enterobacterial outer membrane protein;
            lipopolysaccharide binding protein; secretory protein;
            structural membrane protein.
SOURCE      Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 824)
  AUTHORS   Hirvas,L., Koski,P. and Vaara,M.
  TITLE     The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and
            comparison to known ompH sequences
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.H.Hirvas, 06-JUN-1990.
               University of Helsinki
FEATURES       from  to/span     description
    pept        324      818     ompH protein precursor (gtg start codon)
    sigp        324      392     ompH protein signal peptide
    matp        393      815     ompH protein
    binding     308      317     ompH ribosomal binding site
    signal      120      125     ompH -35 region
    signal      143      148     ompH -10 region
BASE COUNT      257 a    147 c    198 g    221 t      1 others
ORIGIN      
        1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg
       61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg
      121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt
      181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt
      241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa
      301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag
      361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa
      421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag
      481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc
      541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga
      601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc
      661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta
      721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta
      781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac
//
LOCUS       YSCEXG1A     2056 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase
            (EXG1) gene, complete cds.
ACCESSION   M34341
KEYWORDS    1,3-glucan glucanohydrolase; exo-1,3-B-glucanase.
SOURCE      S.cerevisiae (strain AB320) DNA, clone YEp13.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2056)
  AUTHORS   Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A.,
            Nebreda,A.R., Mendez,E. and del Rey,F.
  TITLE     Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1,
            in the yeast Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.del Rey, 17-MAY-1990.
               Instituto de Microbiologia bioquimica
               Facultad de Biologia
               CSIC/Universidad de Salamanca
               37008 Salamanca, SPAIN
FEATURES       from  to/span     description
    pept        363     1709     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase precursor (EXG1, EC 3.2.1.58)
    sigp        363      482     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase signal peptide
    matp        483     1706     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase
    mRNA        265  >  1709     EXG1 mRNA
    signal      153      159     TATA box
BASE COUNT      610 a    395 c    396 g    655 t
ORIGIN      Chromosome XII-right arm.
        1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg
       61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac
      121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa
      181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat
      241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt
      301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta
      361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta
      421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa
      481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg
      541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg
      601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg
      661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg
      721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc
      781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca
      841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt
      901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca
      961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat
     1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca
     1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca
     1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact
     1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct
     1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg
     1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga
     1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg
     1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt
     1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct
     1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg
     1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc
     1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat
     1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt
     1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa
     1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta
     1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat
     1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt
     2041 gagcttttcc gccggc
//
LOCUS       HUMPSG6A     1409 bp ss-mRNA            PRI       24-JUL-1990
DEFINITION  Human pregnancy-specific glycoprotein-6 (PSG6) mRNA,
            complete cds.
ACCESSION   M33666
KEYWORDS    pregnancy-specific glycoprotein.
SOURCE      Human fetal liver first trimester, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1409)
  AUTHORS   Zimmermann,W., Weiss,M. and Thompson,J.A.
  TITLE     cDNA cloning demonstrates the expression of pregnancy-specific
            glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
            family, in fetal liver
  JOURNAL   Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         53     1360     pregnancy-specific glycoprotein-6
BASE COUNT      395 a    403 c    294 g    317 t
ORIGIN      
        1 gggcgggcct aggctcatct ccacagggga gaacacacag acagcagaga ccatgggacc
       61 cctctcagcc cctccctgca ctcagcacat cacctggaag gggctcctgc tcacagcatc
      121 acttttaaac ttctggaacc tgcccaccac tgcccaagta ataattgaag ccaagccacc
      181 caaagtttcc gaggggaagg atgttcttct acttgtccac aatttgcccc agaatcttac
      241 tggctacatc tggtacaaag ggcaaatgac ggacctctac cattacatta catcatatgt
      301 agtacacggt caaattatat atgggcctgc ctacagtgga cgagaaacag tatattccaa
      361 tgcatccctg ctgatccaga atgtcacaca ggaggatgca ggatcctaca ccttacacat
      421 cataaagcga ggcgatggga ctggaggagt aactggatat ttcactgtca ccttatactc
      481 ggagactccc aagccctcca tctccagcag caacttaaac cccagggagg tcatggaggc
      541 tgtgcgctta atctgtgatc ctgagactcc ggatgcaagc tacctgtggt tgctgaatgg
      601 tcagaacctc cctatgactc acaggttgca gctgtccaaa accaacagga ccctctatct
      661 atttggtgtc acaaagtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag
      721 tgccagccgc agtgacccag tcaccctgaa tctcctcccg aagctgccca tgccttacat
      781 caccatcaac aacttaaacc ccagggagaa gaaggatgtg ttagccttca cctgtgaacc
      841 taagagtcgg aactacacct acatttggtg gctaaatggt cagagcctcc cggtcagtcc
      901 gagggtaaag cgacccattg aaaacaggat actcattcta cccagtgtca cgagaaatga
      961 aacaggaccc tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtaacccagt
     1021 caccctgaat gtcctctatg gtccagacct ccccagaatt tacccttcat tcacctatta
     1081 ccgttcagga gaaaacctcg acttgtcctg ctttgcggac tctaacccac cggcagagta
     1141 ttcttggaca attaatggga agtttcagct atcaggacaa aagctcttta tcccccaaat
     1201 tactacaaat catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga
     1261 aatctccaaa tccatgatag tcaaagtctc tgagacagca tctccccagg ttacctatgc
     1321 tggtccaaac acctggtttc aagaaatcct tctgctgtga cctcccaaag tgctaggatt
     1381 aaaacatgac ccaccatgaa acccgccca
//
LOCUS       HUMPSGA      1930 bp ss-mRNA            PRI       24-JUL-1990
DEFINITION  Human pregnancy-specific glycoprotein-1a (PSG-1a) mRNA,
            complete cds.
ACCESSION   M33664
KEYWORDS    pregnancy-specific glycoprotein.
SOURCE      Human fetal liver first trimester, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1930)
  AUTHORS   Zimmermann,W., Weiss,M. and Thompson,J.A.
  TITLE     cDNA cloning demonstrates the expression of pregnancy-specific
            glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
            family, in fetal liver
  JOURNAL   Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1     1260     pregnancy-specific glycoprotein-1a
BASE COUNT      560 a    520 c    390 g    460 t
ORIGIN      
        1 atgggaaccc tctcagcccc tccctgcaca cagcgcatca aatggaaggg gctcctgctc
       61 acagcatcac ttttaaactt ctggaacctg cccaccactg cccaagtcac gattgaagcc
      121 cagccaccca aagtttccga ggggaaggat gttcttctac ttgtccacaa tttgccccag
      181 aatcttaccg gctacatctg gtacaaaggg caaatgaggg acctctacca ttacattaca
      241 tcatatgtag tagacggtga aataattata tatgggcctg catatagtgg acgagaaaca
      301 gcatattcca atgcatccct gctgatccag aatgtcaccc gggaggacgc aggatcctac
      361 accttacaca tcataaaggg agatgatggg actagaggag taactggacg tttcaccttc
      421 accttacacc tggagactcc taagccctcc atctccagca gcaacttaaa tcccagggag
      481 accatggagg ctgtgagctt aacctgtgac cctgagactc cagacgcaag ctacctgtgg
      541 tggatgaatg gtcagagcct ccctatgact cacagcttga agctgtccga aaccaacagg
      601 accctctttc tattgggtgt cacaaagtat actgcaggac cctatgaatg tgaaatacgg
      661 aacccagtga gtgccagccg cagtgaccca gtcaccctga atctcctccc gaagctgccc
      721 aagccctaca tcaccatcaa caacttaaac cccagggaga ataaggatgt cttaaacttc
      781 acctgtgaac ctaagagtga gaactacacc tacatttggt ggctaaatgg tcagagcctc
      841 ccggtcagtc ccagggtaaa gcgacccatt gaaaacagga tcctcattct acccagtgtc
      901 acgagaaatg aaacaggacc ctatcaatgt gaaatacggg accgatatgg tggcatccgc
      961 agtgacccag tcaccctgaa tgtcctctat ggtccagacc tccccagaat ttacccttca
     1021 ttcacctatt accgttcagg agaagtcctc tacttgtcct gttctgcgga ctctaaccca
     1081 ccggcacagt attcttggac aattaatgaa aagtttcagc taccaggaca aaagctcttt
     1141 atccgccata ttactacaaa gcatagcggg ctctatgttt gctctgttcg taactcagcc
     1201 actggcaagg aaagctccaa atccatgaca gtcgaagtct ctgactggac agttccctga
     1261 attctactag tacctccaat tccattttct cccatggaat cactaagagc aagacccact
     1321 ctgttccaga agccctataa gctggaggtg gacaactcaa tgtaaatttc atgggaaaac
     1381 ccttgtacct gaagcatgag ccactcagaa ctcaccaaaa tattcgacac cataacaaca
     1441 gatgctcaaa ctgtaaacca ggacaacaag tggatgactt cacactgtgg acagtttttc
     1501 ccaagatgtc agaagaagac tccccatcat gatgaggctc tcccccctct taactgtcct
     1561 tgctcatgcc tgcctctttc acttggcagg ataatgcagt cattagaatt tcacatgtag
     1621 tagcttctga gagtaacaac agagtgtcag atatgtcatc tcaacctcaa acttttatgt
     1681 aacatctcag ggggaaatgt ggctctctcc accttgcata cagggctccc aatagaaatc
     1741 aacacagaga tattgcctgt gtgttttcag agaagatggt ttgtatgaag acgtaggaaa
     1801 gctgaaatta taatagagtc ccctttaaat ccacattgtg tggatggctc ttgccgtttc
     1861 ctaagagata cattgtaaaa cgtgacagta agacattcta gcagaataaa acatgtacca
     1921 cctttgcccc
//
LOCUS       HUMPSGC      1307 bp ss-mRNA            PRI       24-JUL-1990
DEFINITION  Human pregnancy-specific glycoprotein-4 (PSG4) mRNA, 3' end.
ACCESSION   M33665
KEYWORDS    pregnancy-specific glycoprotein.
SOURCE      Human fetal liver first trimester, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1307)
  AUTHORS   Zimmermann,W., Weiss,M. and Thompson,J.A.
  TITLE     cDNA cloning demonstrates the expression of pregnancy-specific
            glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
            family, in fetal liver
  JOURNAL   Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      639     pregnancy-specific glycoprotein-4 (AA at 1)
BASE COUNT      394 a    339 c    249 g    325 t
ORIGIN      
        1 gcggggtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag tgccagccgc
       61 agtgacccag tcaccctgaa tctcctccca aagctgtcca agccctacat cacaatcaac
      121 aacttaaacc ccagagagaa taaggatgtc ttaaccttca cctgtgaacc taagagtgag
      181 aactacacct acatttggtg gctaaatggt cagagcctcc ctgtcagtcc cagggtaaag
      241 cgacccattg aaaacaggat cctcattcta cccaatgtca cgagaaatga aacaggacct
      301 tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtgacccagt caccctgaat
      361 gtcctctatg gtccagacct ccccagcatt tacccttcat tcacctatta ccgttcagga
      421 gaaaacctct acttgtcctg cttcgccgag tctaacccac gggcacaata ttcttggaca
      481 attaatggga agtttcagct atcaggacaa aagctctcta tcccccaaat aactacaaag
      541 catagtgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa
      601 tccatcacag tcaaagtctc tgactggata ttaccctgaa ttctactagt tcctccaatt
      661 ccattttctc ccatggaatc acgaagagca agacccactc tgttccagaa gccctataag
      721 ctggaggtgg acaactcgat gtaaatttca tgggaaaacc cttgtacctg acatgtgagc
      781 cactcagaac tcaccaaaat gttcgacacc ataacaacag ctactcaaac tgtaaaccag
      841 gataagaagt tgatgacttc acactgtgga cagtttttcc aaagatgtca gaacaagact
      901 ccccatcatg ataaggctcc cacccctctt aactgtcctt gctcatgcct gcctctttca
      961 cttggcagga taatgcagtc attagaattt cacatgtagt agcttctgag ggtaacaaca
     1021 gagtgtcaga tatgtcatct caacctcaaa cttttacgta acatctcagg cgaaatgtgg
     1081 ctctctccat cttgcataca gggctcccaa tagaaatgaa cacagagata ttgcctgtgt
     1141 gtttgcagag aagatggttt ctataaagag taggaaagct gaaattatag tagagtctcc
     1201 tttaaatgca cattgtgtgg atggctctca ccatttccta agagatacag tgtaaaacgt
     1261 gacagtaata ctgattctag cagaataaaa catgtaccac atttccc
//
LOCUS       HUMPSGD      1699 bp ss-mRNA            PRI       24-JUL-1990
DEFINITION  Human pregnancy-specific glycoprotein-1d (PSG-1d) mRNA,
            complete cds.
ACCESSION   M33663
KEYWORDS    pregnancy-specific glycoprotein.
SOURCE      Human fetal liver first trimester, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1699)
  AUTHORS   Zimmermann,W., Weiss,M. and Thompson,J.A.
  TITLE     cDNA cloning demonstrates the expression of pregnancy-specific
            glycoprotein genes, a subgroup of the carcinoembryonic antigen gene
            family, in fetal liver
  JOURNAL   Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         64     1344     pregnancy-specific glycoprotein-1d
BASE COUNT      493 a    442 c    343 g    421 t
ORIGIN      
        1 ctcagagagt ttctggatcc taggcttatc tccacagagg agaacacaca agcagcagag
       61 accatgggaa ccctctcagc ccctccctgc acacagcgca tcaaatggaa ggggctcctg
      121 ctcacagcat cacttttaaa cttctggaac ctgcccacca ctgcccaagt cacgattgaa
      181 gcccagccac ccaaagtttc cgaggggaag gatgttcttc tacttgtcca caatttgccc
      241 cagaatctta ccggctacat ctggtacaaa gggcaaatga gggacctcta ccattacatt
      301 acatcatatg tagtagacgg tgaaataatt atatatgggc ctgcatatag tggacgagaa
      361 acagcatatt ccaatgcatc cctgctgatc cagaatgtca cccgggagga cgcaggatcc
      421 tacaccttac acatcataaa gggagatgat gggactagag gagtaactgg acgtttcacc
      481 ttcaccttac acctggagac tcctaagccc tccatctcca gcagcaactt aaatcccagg
      541 gagaccatgg aggctgtgag cttaacctgt gaccctgaga ctccagacgc aagctacctg
      601 tggtggatga atggtcagag cctccctatg actcacagct tgaagctgtc cgaaaccaac
      661 aggaccctct ttctattggg tgtcacaaag tatactgcag gaccctatga atgtgaaata
      721 cggaacccag tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct cccgaagctg
      781 cccaagccct acatcaccat caacaactta aaccccaggg agaataagga tgtcttaaac
      841 ttcacctgtg aacctaagag tgagaactac acctacattt ggtggctaaa tggtcagagc
      901 ctcccggtca gtcccagggt aaagcgaccc attgaaaaca ggatcctcat tctacccagt
      961 gtcacgagaa atgaaacagg accctatcaa tgtgaaatac gggaccgata tggtggcatc
     1021 cgcagtgacc cagtcaccct gaatgtcctc tatggtccag acctccccag aatttaccct
     1081 tcattcacct attaccgttc aggagaagtc ctctacttgt cctgttctgc ggactctaac
     1141 ccaccggcac agtattcttg gacaattaat gaaaagtttc agctaccagg acaaaagctc
     1201 tttatccgcc atattactac aaagcatagc gggctctatg tttgctctgt tcgtaactca
     1261 gccactggca aggaaagctc caaatccatg acagtcgaag tctctggtaa gtggatccca
     1321 gcatcgttgg caatagggtt ttaggtggag tctatctggc attcagagaa gagtcaggaa
     1381 aacaattgta ttcccagcct gtgtccctag ggcacaagca aatcccaaat tctcctcctg
     1441 aaccctccaa atttgtctaa gaacttcgaa aactttaaca aacaggctga tatcttcata
     1501 atattcccag cctagaccaa gcaggaagaa cattgatttc attgaaataa ttgataataa
     1561 tgaagataat gtttttatga tttttatttg aaaatttgct aattctttaa atggtttgtt
     1621 ttctacattg atggaatttt tctcttttaa tctatctaca gcttatagca gttcaataaa
     1681 ctatacttct gggaaccgt
//
LOCUS       RHMDCTAB      764 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  R.meliloti dctA and dctB genes, 5' end.
ACCESSION   M33555
KEYWORDS    regulatory protein.
SOURCE      R.meliloti DNA.
  ORGANISM  Rhizobium meliloti
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 764)
  AUTHORS   Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F.
  TITLE     Genetic analysis and regulation of the Rhizobium meliloti genes
            controlling C-4-dicarboxylic acid transport
  JOURNAL   Gene 85, 135-144 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        726  >   764     dctA protein
BASE COUNT      149 a    239 c    252 g    124 t
ORIGIN      
        1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg
       61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc
      121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg
      181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg
      241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc
      301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg
      361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca
      421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc
      481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc
      541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg
      601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt
      661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc
      721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg
//
LOCUS       MNKSVAA       589 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Monkey sp. DNA fragment found also in SV40 genome.
ACCESSION   M33902
KEYWORDS    .
SOURCE      Monkey sp. DNA.
  ORGANISM  Monkey sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; 
            Cercopithecidae.
REFERENCE   1  (bases 1 to 589)
  AUTHORS   Szymanski,P. and Woodworth,M.
  TITLE     A 69-base-pair monkey DNA sequence enhances simian virus 40
            replication and transcription through multiple motifs
  JOURNAL   J. Virol. 64, 1360-1365 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    binding     221      230     AP-4 binding site
    binding     243      249     AP-1 binding site
BASE COUNT      162 a    161 c    118 g    148 t
ORIGIN      
        1 aagcttcgta gaatctcaga aagtcataaa gctacaaacg agttcacaaa caagtgatgc
       61 atagaccaca cccagcccac agatgagttc tgtgtggctc acacttattt tttaaaacag
      121 agtttgctgc caacacttca aaacccaatg attttgcata aaaatccaga tttattagaa
      181 aatctgtgga cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga
      241 gatgagtcac agttcccacc ccactcaccc atgtcaccta tatggcttgg aaggcatgtg
      301 agtttgcaac ctgctcaaag ccacacccca gaaacggcct gctaaaatgg gattccagaa
      361 tccacttgca cctttcgtct ttctcagtct cccctgactc aacacagcac actggcattg
      421 ttttttttcc tctcagaaaa ctgagcatgc acttttgaga atcagagaga aatccgcctc
      481 tcaaatgtca gaaaacagct gatgctgcct cttacttggg ggcctcttga cttcccacgt
      541 gtcttactgc cacttgagag ggaggggtgc aggcaggcag gctccctgg
//
LOCUS       MNKSVAB        69 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Monkey sp. DNA fragment also found in SV40 genome.
ACCESSION   M33903
KEYWORDS    .
SOURCE      Monkey sp. DNA.
  ORGANISM  Monkey sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; 
            Cercopithecidae.
REFERENCE   1  (bases 1 to 69)
  AUTHORS   Szymanski,P. and Woodworth,M.
  TITLE     A 69-base-pair monkey DNA sequence enhances simian virus 40
            replication and transcription through multiple motifs
  JOURNAL   J. Virol. 64, 1360-1365 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    binding       1        7     AP-1 binding site
    binding      41       50     AP-4 binding site
    binding      63       69     AP-1 binding site
BASE COUNT       16 a     21 c     15 g     17 t
ORIGIN      
        1 tgactcatca cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga
       61 gatgagtca
//
LOCUS       RSSFBPPRK    2099 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  R.sphaeroides fructose 1,6-bisphosphatase (fbpB) and
            phosphoribulokinase (prkB) genes, complete cds.
ACCESSION   J02922
KEYWORDS    1,6-bisphosphatase; phosphoribulokinase.
SOURCE      R.sphaeroides (strain HR) DNA.
  ORGANISM  Rhodobacter sphaeroides
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 2099)
  AUTHORS   Gibson,J.L., Chen,J.-H., Tower,P.A. and Tabita,F.R.
  TITLE     The form II fructose 1,6-bisphosphatase and phosphoribulokinase
            genes form part of a large operon in Rhodobacter sphaeroides:
            Primary structure and insertional mutagenesis analysis
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.L.Gibson, 15-JUN-1990.
FEATURES       from  to/span     description
    pept         44     1039     fructose 1,6-bisphosphatase (fbpB)
    pept       1043     1921     phosphoribulokinase (prkB)
BASE COUNT      361 a    694 c    720 g    324 t
ORIGIN      
        1 ggatcctgac gcaacgtccg ccgcgacaga ggcaggagga gccatggcca tcgagctgga
       61 ggacctgggg ctgagccccg atgtggcgga cgtgatgcag cgtctggcgc gcgtgggggc
      121 aggcatcgcc cgcatcatct cgcgcaacgg gctcgagcgc gatctgggcg cgggcgtcgg
      181 caccaatgcc ggaggagacg ggcagaaggc gctcgacgtg atcgcggacg acgcgttccg
      241 cgcggcgctc gaaggctctg cggtggctta ttacgcctcc gaggagcagg acgaagtggt
      301 gacgctgggc gagggaagcc tcgcgctcgc catcgacccg ctggacggct cgtccaacat
      361 cgatgtgaac gtgtcgatcg ggacgatctt ctcgatcttc ccggcggcgg ctggccccga
      421 ggccagcttc ctgcgcccgg gcaccgagca gattgccggc ggctacatca tctacgggcc
      481 gcaatgcgcg ctggtctgca gcttcgggca gggcgtgcag cactgggtgc tcgacctcga
      541 tgcgggcatc ttccggcgga tgcccgacat ccgcccgctg ccggccgaga cgtccgagtt
      601 tgcgatcaac gcctcgaact accgccactg gccgcagccg atccgcgcct tcgtcgacga
      661 tctggtcgcc ggggccgagg ggccgcgcgg caagaacttc aacatgcgct ggatcgcctc
      721 gctggtggcc gagacgcacc gcatcctgat gcggggcggg gtgtttctct atcccggcga
      781 cgagcgcaag ggctacgagc ggggccggct gcgccatgtc tacgaatgcg cgcccatcgc
      841 cttcctgatc gcgaatgtcg gggggggcgc caccgacggc tgcgccgaca tcctgaccgc
      901 gctgcccgac cggctgcacg cccgcacccc cttcgtcttc ggctgcgcga gcaaggtcgc
      961 ccgcgtcgcc gcctatcacg atctggcctg cgaagagacg tccgctctct tcggcagccg
     1021 gggcctgttc cggagttaaa gagtgtcgaa gaaatatccc atcatttccg tggtcggctc
     1081 gtccggcgcg ggcacctcga cggtcaagaa cacgttcgag cagatcttcc gccgcgaggg
     1141 ggtcaagtcc gtctcgatcg agggcgacgc cttccaccgc ttcaaccggg ccgacatgaa
     1201 ggccgaactc gagcggcgct atgcggcggg cgatgcgacc ttctcgcatt tctcctacga
     1261 ggcgaacgaa ctgaaggagc tggagcgcgt cttccgcgaa tatggcgaga cggggcgcgg
     1321 ccgcacccgc acctatgtcc atgacgatgc cgaagccgcc cggacgggcg tggcccccgg
     1381 caatttcacc caatgggcgc cgttcgagga caacagcgac ctgcttttct acgaggggct
     1441 gcacggctgc gtggtcaatg acgaggtgaa cctcgtccgc catgccgatc tgaagctcgg
     1501 cgtggcgccg gtcatcaacc ttgaatggat ccagaagatc caccgcgacc gggcgcagcg
     1561 cggctatacg accgaagccg tcaccgacgt gatcctgcgc cggatgtatg cctacgtcca
     1621 ctgtatcgtc ccgcaattct ccgagacgga catcaacttc cagcgcgtgc cggtggtgga
     1681 cacctcgaac ccgttcatcg cgcgctggat ccccacgccg gacgagagcc tgatcgtgat
     1741 ccggttcaag aacccgcgcg ggatcgactg cccctatctc acctcgatga tcgcgggctc
     1801 gtggatgagc cgggcgaatt ccatcgtggt gccgggcaac aagcaggatc tggcgatgca
     1861 gctgatcctg acgccgctca tcgagcggat ggtgcgcgag gcgcgccgcg cgcgggcctg
     1921 aggcgcgaca gacagacgga gagggggccg cggatcgagc cgcggcccgg caaccgggac
     1981 gcacagcgaa ggatgagagc catgaaggac attggagccg cgcaggagac gcggatggcg
     2041 aacgcatccg ggccctcgcg atggatgccg tgaagaaggc caagtcgggc catcccggg
//
LOCUS       YSCACH1A     2466 bp ss-mRNA            PLN       24-JUL-1990
DEFINITION  S.cerevisiae acetyl-CoA hydrolase (ACH1) mRNA, complete cds.
ACCESSION   M31036
KEYWORDS    acetyl-CoA hydrolase.
SOURCE      S.cerevisiae (strain T3A), cDNA to mRNA, clones pBA[4,5].
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2466)
  AUTHORS   Lee,F.-J.S., Lin,L.-W. and Smith,J.A.
  TITLE     A glucose-repressible gene encodes acetyl-CoA hydrolase from
            Saccharomyces cerevisiae
  JOURNAL   J. Biol. Chem. 265, 7413-7418 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.-W.Lin, 27-DEC-1989.
FEATURES       from  to/span     description
    pept        614     2194     acetyl-CoA hydrolase (ACH1) (EC 3.1.2.1)
BASE COUNT      730 a    529 c    558 g    649 t
ORIGIN      Chromosome II.
        1 gttagccatc attatcatta aaatatcaac ccgaagaaca atgtatacat atacatatac
       61 gtacacatat acatatgtac atatgacata cgtattagcc gctgaggacg cggacgtata
      121 aaaggacaat acttatatgg agctaagggg agcagttacg caactccgtg atcgcgcgcc
      181 acgggccgtc ggcggctgtt aattgaagaa aaaaaaaatg aagaaccaca aggggtgatc
      241 catataggtg actagcatca tcccctgcga cgcgcggccc gccgggcaaa ggcgggcaat
      301 gcgcgctgct gattggcctc gaggacaacg ccctcaacca catccgcaac agccaatccc
      361 atcggagcgt caaactacca aagtagtgat tgtatggatc accactgtat tgtggacggt
      421 aagcgcttgc tggagcaaat gtgtaatcaa gttgctgtgt atatatagac gttagatgtg
      481 ttctacccct tcttttgtct tgtgcccacc gggcttacat tagcacacaa agcagcaaga
      541 gaccgtctta ctagacaata gcggcaaaac aaacaacaca tttctttttt tctttttcac
      601 atattgcact aaaatgacaa tttctaattt gttaaagcag agagttaggt atgctcccta
      661 tctgaaaaaa gttaaggaag ctcacgagct tattccattg ttcaagaatg gtcagtacct
      721 tgggtggtcc ggttttacag gagtgggtac tcccaaggca gtgccggagg cactgataga
      781 tcacgtggag aagaacaatt tacaagggaa gttgagattc aacctttttg ttggagcttc
      841 tgctggtcca gaggaaaacc gttgggctga acacgacatg atcattaaga gagcccctca
      901 tcaagtaggg aaacccattg caaaggcaat taaccagggt agaattgagt tctttgataa
      961 acatctgtcc atgttccctc aggatctgac atacgggttc tacaccaggg aaagaaaaga
     1021 caacaaaatc cttgattata ctataatcga ggcaacggcc attaaagagg acgggtctat
     1081 cgtcccaggt ccctctgtcg gtggttctcc agaattcatt acagtcagtg ataaagtgat
     1141 tattgaggtt aacacggcta cgccttcgtt cgagggtatt cacgatatag acatgcccgt
     1201 gaacccacct ttcaggaaac catacccata tctgaaagtg gacgacaagt gtggtgttga
     1261 ctccatcccg gttgatcctg aaaaggttgt tgcgattgtg gagtccacca tgagggacca
     1321 ggtcccacca aatacgccct ctgacgacat gtccagggct attgcaggtc atttggtcga
     1381 gtttttcaga aacgaggtaa aacatggtag gctacctgaa aacctgctgc ctttacaaag
     1441 tggtataggt aacattgcta acgctgtcat tgaagggctt gctggcgccc aattcaagca
     1501 cttgactgta tggacggaag tgctgcagga ctcgttattg gatcttttcg agaacggatc
     1561 tttggactac tccactgcta cttccgtgag attgactgaa aagggtttcg acagagcctt
     1621 tgcaaactgg gaaaatttca aacacagatt gtgtttgaga tctcaagttg tctcgaacaa
     1681 tccggaaatg atccgtagat tccctgtcat cgccatgaat accccagtag aagttgacat
     1741 ttacgcgcac gccaattcta caaatgtgaa tggttcccgt atgttgaacg ggttgggtgg
     1801 atctgctgat ttcttgagaa atgcaaagtt gtccatcatg catgccccct ctgcaagacc
     1861 aactaaagta gaccctaccg gtatctctac cattgttcct atggcctctc atgtagatca
     1921 aactgagcat gacctggaca tcttggtcac tgaccaaggt ttggcggatc taagaggtct
     1981 atcgcctaag gaaagagccc gtgaaatcat caacaagtgt gctcatcccg attatcaagc
     2041 tttgttgacc gattacttgg acagagcaga gcattacgct aaaaagcaca attgcttgca
     2101 tgaaccacac atgctaaaga atgctttcaa gttccacacc aacttagctg aaaagggtac
     2161 aatgaaggtc gacagctggg aaccagttga ctagtgtttg tgcgcaaacc gagagatgag
     2221 tatttaacaa aaaaaagaaa ggaaatgata tgattatgat tttatgttta taaagctttt
     2281 atccaatgcg ttgttttttc ttgcatattt ataccttttg cgctcatgga gggagttaat
     2341 caatacgcat gacgtctagt taattcacag gtagtactgt atatttatat gtttacacaa
     2401 taattatgta ttaagtagtg attagtaaaa aaaactaaga ggttgaaagt catcaaccct
     2461 tatatt
//
LOCUS       YSCYEF3B     3969 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  S.cerevisiae elongation factor 3 (EF-3) gene, complete cds.
ACCESSION   J05583
KEYWORDS    elongation factor.
SOURCE      S.cerevisiae DNA, clone YEpEF3.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3969)
  AUTHORS   Sandbaken,M.G., Lupisella,J.A., DiDomenico,B. and Chakraburtty,K.
  TITLE     Protein synthesis in yeast: Structural and functional analysis of
            the gene encoding elongation factor three (EF-3)
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Chakraburtty, 15-JUN-1990.
FEATURES       from  to/span     description
    pept        489     3623     elongation factor 3
    signal      202      212     regulatory UAS-RPG
    binding    1875     1898     nucleotide binding consensus GX4GK(S/T)
    binding    2589     2612     nucleotide binding consensus GX4GK(S/T)
    binding     463      470     nucleotide binding consensus GX4GK(S/T)
    binding     701      708     nucleotide binding consensus GX4GK(S/T)
    binding    2117     2213     nucleotide binding consensus
    binding    3176     3272     nucleotide binding consensus
    binding     544      575     nucleotide binding consensus
    binding     897      928     nucleotide binding consensus
BASE COUNT     1203 a    758 c    809 g   1199 t
ORIGIN      
        1 caacttacaa tcattgttcg ccccttccat acttactgcc actcgcaaaa gggcccaacc
       61 agggcaatta cgtatcaaaa aatcatgaca ggctgggtaa taaatattcg tgaagaaaga
      121 agaaattaaa aaaagaaacg aagaagcaaa aaaaagaaaa gactccgttt aatcactttc
      181 aaccgcggtt tatccggccc cacccatgca taaccctaaa ttattagatc acttagcacg
      241 tgaaaaagaa acgtttttaa tgtttttttt ttttttcttt ttcttttttt gcgttggtga
      301 aaattttttc gcttcctcga gtataattat ctcatctcat ctttcatata agataagaag
      361 ttttataaaa accttttgca tcaaaatttt gttgaataac tctttttctt acgctctctt
      421 tctttcctta attgttttct aaagaaccgt gtatttttct agttcgaatc catcgataac
      481 attaaaagat gtctgattcc cagcaatcca ttaaggttct agaagaacta ttccagaagt
      541 tatctgttgc cactgctgac aacagacacg aaatcgcttc tgaagtcgct tctttcttga
      601 atggtaacat cattgaacat gatgttccag aacacttctt cggtgaattg gccaagggta
      661 tcaaggacaa gaagaccgct gctaacgcca tgcaagctgt tgctcacatt gctaaccaat
      721 ctaacttgtc tccatctgtt gaaccataca tcgtccaatt ggttccagct atctgtacca
      781 acgcaggtaa caaggacaaa gaaattcaat ctgttgcttc cgaaactttg atttccatcg
      841 ttaacgctgt taacccagtt gccatcaaag ctttgttgcc acatttgact aacgctattg
      901 tagaaactaa caaatggcaa gaaaagattg ctattttggc agctttctct gctatggtcg
      961 atgctgctaa ggatcaagtt gccctaagaa tgccagaatt gattccagtc ttgtctgaaa
     1021 ccatgtggga caccaagaag gaagtcaagg ctgctgctac tgccgccatg accaaggcta
     1081 ccgaaactgt tgacaacaag gatattgaac gtttcattcc aagtttgatt caatgtattg
     1141 ctgacccaac tgaagttcca gaaaccgttc atttgctagg tgctactact ttcgttgctg
     1201 aagttactcc agctactttg tccatcatgg tcccattgtt gtccagaggt ttgaacgaaa
     1261 gagaaaccgg tatcaagcgt aagtctgctg ttattattga caacatgtgt aagttggtcg
     1321 aagacccaca agttattgct cctttcttgg gtaaattgtt gccaggtttg aagagtaact
     1381 ttgctaccat tgctgaccca gaagccagag aagttacttt gagagctttg aagactttga
     1441 gaagagttgg taacgttggt gaagacgatg ctattccaga actttctcac gctggtgacg
     1501 tttctactac tttgcaagtc gttaacgaat tgttgaagga cgaaaccgtt gctccaagat
     1561 ttaagattgt cgtcgagtac attgccgcca ttggtgctga tttgatcgat gaaagaatca
     1621 ttgaccaaca agcttggttc acccacatca ccccatacat gactatcttc ttgcacgaaa
     1681 agaaggccaa ggacatcttg gacgaattca gaaagagagc tgtcgacaac attccagttg
     1741 gtccaaactt cgacgacgaa gaagacgaag gtgaagactt atgtaactgt gaattttctt
     1801 tggcttatgg tgctaaaatc ttgttgaaca agacccaatt aagattgaag agagccagaa
     1861 gatatggtat ctgtggtcca aacggttgtg gtaagtccac tttaatgaga gctattgcca
     1921 acggtcaagt tgatggtttc ccaacccaag aagaatgtag aaccgtctac gtcgaacacg
     1981 acattgatgg tactcactct gacacttccg tcttggattt cgttttcgaa tctggtgttg
     2041 gtactaaaga agctatcaag gacaaattga ttgaattcgg tttcaccgat gaaatgattg
     2101 ctatgccaat ctctgcttta tctggtggtt ggaagatgaa gttggctcta gctagagctg
     2161 tgttgagaaa tgctgatatc ttgttgttag atgaaccaac taaccatttg gataccgtca
     2221 acgttgcttg gttagttaac tacttgaaca cctgtggtat cacttctatc actatttctc
     2281 acgactccgt tttcttagat aacgtctgtg aatatattat taactacgaa ggtttgaagt
     2341 tgagaaagta caagggtaac tttaccgaat tcgttaagaa gtgtccagct gctaaggctt
     2401 acgaagaatt atccaacact gatttggaat tcaagttccc agaaccaggt tacttggaag
     2461 gtgttaagac taagcaaaag gctattgtca aggttaccaa catggaattc caatatccag
     2521 gtacctctaa gccacaaatc actgacatta acttccaatg ttctttgtct tccagaattg
     2581 ctgtcattgg tccaaatggt gctggtaagt ctactttgat taacgtcttg actggtgaac
     2641 tattaccaac ctctggtgaa gtctacaccc acgaaaattg tcgtatcgct tacattaagc
     2701 aacacgcttt tgctcatatc gaatctcatt tggacaagac tccatctgaa tatatccaat
     2761 ggagattcca aaccggtgaa gatagagaaa ccatggacag agctaacaga caaatcaacg
     2821 aaaacgatgc tgaagctatg aacaagatct tcaagattga aggtacccct agaagaattg
     2881 ccggtatcca ctccagaaga aagttcaaga acacttacga atatgaatgt tctttcttat
     2941 tgggtgaaaa cattggtatg aaatctgaaa gatgggttcc aatgatgtcc gtcgacaacg
     3001 cttggattcc aagaggtgaa ttggttgaat ctcactctaa gatggttgct gaagttgata
     3061 tgaaggaagc tttggcttct ggtcaattcc gtccattaac cagaaaagaa attgaagaac
     3121 attgttccat gttgggtttg gacccagaaa ttgtttctca ctccagaatt agaggtttgt
     3181 ctggtggtca aaaggttaag ttggtcttag ctgccggtac atggcaaaga cctcacttga
     3241 ttgtcttaga tgaacctacc aactatctgg acagagattc tttgggtgct ttgtctaagg
     3301 ctttgaagga atttgaaggt ggtgttatta tcattactca ctctgctgaa ttcacaaaga
     3361 acttgactga agaagtctgg gccgtcaagg acggtagaat gactccatct ggtcacaact
     3421 gggttagtgg tcaaggtgct ggtccaagaa tcgaaaagaa ggaagacgaa gaagataaat
     3481 tcgatgctat gggtaacaag attgccggtg gtaagaagaa gaagaagttg tcttctgcgg
     3541 aattgagaaa gaagaagaag gaaagaatga agaagaagaa ggaattgggt gatgcttacg
     3601 tttcttctga cgaagaattc taatcttttt gatcactgct ttcacagttt tctttaagat
     3661 ttttattgat caataattta tgtatatttt aatttctatg tttttgtaat attgtttatt
     3721 ttggtaaaat atagacgcaa cttccttatt ataaagaaag gcattattta aaagaaaaag
     3781 cgttccatta gtcagacatc tttttttttt tcatacattc ttaagcctca ggcaaattga
     3841 gcattgcctc ataccctttt cggtaagagg gtaacgaaaa tatttttttg gaagaataaa
     3901 aataggtgac ggatcataga ctaggaagct ttaaaacatg attgagcgta atattatatt
     3961 ccttctaga
//
LOCUS       HUMKSAA      1504 bp ss-mRNA            PRI       24-JUL-1990
DEFINITION  Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds.
ACCESSION   M32325
KEYWORDS    adenocarcinoma-associated antigen.
SOURCE      Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1504)
  AUTHORS   Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D.,
            Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A.
  TITLE     Molecular cloning and characterization of a human
            adenocarcinoma/epithelial cell surface antigen complementary DNA
  JOURNAL   Cancer Res. 49, 314-317 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.D. Sharp, 22-EB-1990.
FEATURES       from  to/span     description
    pept        155     1099     adenocarcinoma-associated antigen precursor
                                 (KSA)
    sigp        155      223     adenocarcinoma-associated antigen signal
                                 peptide
    matp        398     1096     adenocarcinoma-associated antigen
    mRNA    <     1     1504     adenocarcinoma-associated antigen mRNA
    site       1486     1491     polyadenylation site
BASE COUNT      442 a    302 c    356 g    404 t
ORIGIN      
        1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc
       61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc
      121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt
      181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg
      241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac
      301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat
      361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca
      421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca
      481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa
      541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa
      601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga
      661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa
      721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat
      781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa
      841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat
      901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat
      961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc
     1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca
     1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat
     1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg
     1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga
     1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt
     1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc
     1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc
     1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa
     1501 actg
//
LOCUS       HAMHITAO1    1581 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Hamster G-o protein alpha subunit 1 (GNAO I) mRNA, complete cds.
ACCESSION   M33661 J05476
KEYWORDS    G-o protein alpha subunit.
SOURCE      Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA.
  ORGANISM  Cricetulus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 1581)
  AUTHORS   Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C.,
            Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L.
  TITLE     Molecular cloning of a novel splice variant of the alpha subunit of
            the mammalian G-o protein
  JOURNAL   J. Biol. Chem. 265, 11220-11226 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Birnbaumer, 10-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        458     1522     G-o protein alpha subunit 1
BASE COUNT      357 a    472 c    405 g    347 t
ORIGIN      
        1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc
       61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt
      121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc
      181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg
      241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt
      301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt
      361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca
      421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga
      481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat
      541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac
      601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca
      661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat
      721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg
      781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat
      841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca
      901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta
      961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac
     1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga
     1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct
     1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgcatgc acgagtctct
     1201 catgctcttc gactccatct gtaacaacaa gtttttcatc gatacctcca tcattctctt
     1261 cctcaacaag aaagacctct ttggcgagaa gattaagaag tcacccttga ccatctgctt
     1321 ccctgaatac ccaggctcca acacctatga agatgcagct gcctacatcc aaacacagtt
     1381 tgaaagcaaa aaccgctcac ctaacaaaga aatctattgt cacatgactt gtgccacaga
     1441 cacgaataat atccaggtgg tattcgacgc tgtcaccgac atcatcattg ccaacaatct
     1501 ccggggctgc ggcttgtact gacctcttgt cctgtatagc aacctatttg actgcttcat
     1561 ggactctttg ctgttgatgt t
//
LOCUS       HAMHITAO2    1678 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Hamster G-o protein alpha subunit 2 mRNA, complete cds.
ACCESSION   M33662 J05476
KEYWORDS    G-o protein alpha subunit.
SOURCE      Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA.
  ORGANISM  Cricetulus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 1678)
  AUTHORS   Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C.,
            Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L.
  TITLE     Molecular cloning of a novel splice variant of the alpha subunit of
            the mammalian G-o protein
  JOURNAL   J. Biol. Chem. 265, 11220-11226 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Birnbaumer, 10-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        458     1522     G-o protein alpha subunit 2
BASE COUNT      374 a    514 c    448 g    342 t
ORIGIN      
        1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc
       61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt
      121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc
      181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg
      241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt
      301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt
      361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca
      421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga
      481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat
      541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac
      601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca
      661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat
      721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg
      781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat
      841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca
      901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta
      961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac
     1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga
     1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct
     1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgaatgc atgagtccct
     1201 gaagcttttt gacagcatct gcaacaacaa gtggttcaca gacacgtcta tcatcctgtt
     1261 tctcaacaag aaggacatat ttgaggagaa gatcaccagg tccccactga ccatctgctt
     1321 tcctgaatac acaggcccca gtgccttcac agaagctgtg gctcacatcc aggggcagta
     1381 tgagagtaag aacaaatcag ctcacaagga aatctacacc catttcacct gtgccaccga
     1441 caccaacaac atccagtttg tctttgatgc tgtgacggac gtcatcatcg ccaaaaatct
     1501 gcggggctgt ggactctact gagccctggc ctcctgccca gcctgctgct cacttctccc
     1561 ctggacccag agctctgcta ccactcgatg ctctgtgcac tgagggaaaa cccagaggct
     1621 ggccctgggg gaggaggaag caccctctga gcatccccac cctaccccac tccagagg
//
LOCUS       HUMPEPD1      610 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Human peptidase D gene, exon 1.
ACCESSION   M36549 J05513
KEYWORDS    peptidase D; prolidase.
SEGMENT     1 of 2
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 610)
  AUTHORS   Tanoue,A., Endo,F. and Matsuda,I.
  TITLE     Structural organization of the gene for human prolidase (peptidase
            D) and demonstration of a partial gene deletion in a patient with
            prolidase deficiency
  JOURNAL   J. Biol. Chem. 265, 11306-11311 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            F.Endo, 23-MAY-1990.
FEATURES       from  to/span     description
    pept        372  /   388     peptidase D, exon 1 (EC 3.4.13.9)
                                 /hgml_locus_uid="LF0121K" /nomgen="PEPD"
                                 /map="19q12-q13.2"
    pre-msg     241  >   610     PEPD mRNA and introns
    IVS         389  >   610     PEPD intron A
BASE COUNT       96 a    225 c    185 g    104 t
ORIGIN      
        1 gtggtgtacg tgtacgtaag atcggggact gaacggagaa tcaagatgtc ctctgcttca
       61 ggacctccct cccaccacga aaaggctggc agatgcagat gctgcccgct aacccgacct
      121 actgtctgca gccggcagcc cgctcattta tcctggccga gctcttgggc ccccaatctc
      181 caggagcccc gccccatcgg ctccgccctt ccccaggtac cgccccttcc cgcccgtgga
      241 atttggctct ggaggaggtg ggtcaaggac tccatgcccc gccccctgcc ccgcccttac
      301 cccgccccat gcattggcac cggaggggct catggacgcg cgcacttcag tgacgccggt
      361 gccgggcgaa catggcggcg gccaccgggt gagtgctgcg cctccctcgc tctttcccgc
      421 tggcgtggag aggggcatgg gtgtcccggc cggaccgcgg gtgagagacc gtcaccgggt
      481 cgcgtgaatg ccgcgggtag gcgcctcttc ccgttggccc ggaagtccca gcgccggtca
      541 ggcgagcctg cacgcgtccg agtcaaaggt caacccgagc aggacccgac tacctctcct
      601 aacctctcca
//
LOCUS       HUMPEPD2      530 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Human peptidase D gene, 3' end.
ACCESSION   M35496 J05513
KEYWORDS    peptidase D; prolidase.
SEGMENT     2 of 2
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 530)
  AUTHORS   Tanoue,A., Endo,F. and Matsuda,I.
  TITLE     Structural organization of the gene for human prolidase (peptidase
            D) and demonstration of a partial gene deletion in a patient with
            prolidase deficiency
  JOURNAL   J. Biol. Chem. 265, 11306-11311 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            F.Endo, 23-MAY-1990.
FEATURES       from  to/span     description
    pept    /     1        3     peptidase D, exon 15 (EC 3.4.13.9)
                                 /hgml_locus_uid="LF0121K" /nomgen="PEPD"
                                 /map="19q12-q13.2"
    pre-msg <     1      394     PEPD mRNA and introns
    signal      373      378     poly-A signal
BASE COUNT      133 a    132 c    114 g    151 t
ORIGIN      About 110 kb after segment 1.
        1 tagagccagc cagaaatccc agcgcacctg ggggcctggc cttgcaacct cttttcgtga
       61 tgggcagcct gctggtcagc actccagtag cgagagacgg cacccagaat cagatcccag
      121 cttcggcatt tgatcagacc aaacagtgct gtttcccggg gaggaaacac ttttttaatt
      181 acccttttgc aggcaccacc tttaatctgt tttatacctt gcttattaaa tgagcgactt
      241 aaaatgattg aaaataatgc tgttctttag tagcaactaa aatgtgtctt gctgtcattt
      301 atattccttt tcccaggaaa gaagcatttc tgatactttc tgtcaaaaat caatatgcag
      361 aatggcattt gcaataaaag gtttcctaaa atggtccttt gtgcgtcggc ttcccctttc
      421 attcctttgc agatgagcat ttgggacccc gtccgacccc tgaaaagatc ggctcacctg
      481 gtgtcagtat cagtcattcc aggaagatgg ctcgtggcag cacaggccct
//
LOCUS       SYNPMAMA     4354 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Artificial cloning vector pMAMPF.
ACCESSION   M33637
KEYWORDS    .
SOURCE      Synthetic DNA from E.coli cell line WK6-lambd-pMAMPF3, from pMA58.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 4354)
  AUTHORS   Szardenings,M. and Collins,J.
  TITLE     A plasmid optimized for protein design projects: pMAMPF
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Collins, 09-APR-1990.
FEATURES       from  to/span     description
    signal     4121     4183     OmpA signal sequence
    site       4181     4186     NaeI cloning site
    site       4184     4308     Stuffer (pBR325:5900 to 29)
BASE COUNT     1152 a   1054 c   1012 g   1136 t
ORIGIN      
        1 aattcacctc gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt
       61 ttttttggag attttcaacg tgaaaaaatt attattcgca attccaagct aattcacctc
      121 gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt ttttttggag
      181 attttcaacg tgaaaaaatt attattcgca attccaagct ctgcctcgcg cgtttcggtg
      241 atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag
      301 cggatgcaga tcacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc
      361 gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt
      421 cctttctcgc cacgttcgcc agctttcccc gtcaagctct aaatcggggg ctccctttag
      481 ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt
      541 cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt
      601 tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt
      661 cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt
      721 aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttga tctgcgctcg
      781 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca
      841 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac
      901 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac
      961 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg
     1021 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac
     1081 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat
     1141 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag
     1201 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac
     1261 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt
     1321 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt
     1381 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc
     1441 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga
     1501 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac
     1561 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc
     1621 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct
     1681 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca
     1741 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct
     1801 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca
     1861 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc
     1921 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg
     1981 cgcaacgttg ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct
     2041 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa
     2101 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta
     2161 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc
     2221 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg
     2281 agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa
     2341 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg
     2401 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc
     2461 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg
     2521 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcagacag
     2581 ttttattgtt catgatgata tatttttatc ttgtgcaatg taacatcaga gattttgaga
     2641 cacaacgtgg ctttgttgaa taaatcgaac ttttgctgag ttgactcccc gcgcgcgatg
     2701 ggtcgaattt gctttcgaaa aaaaagcccg ctcattaggc gggctaaaaa aaagcccgct
     2761 cattaggcgg gctcgaattt ctgccattca tccgcttatt atcacttatt caggcgtagc
     2821 aaccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg ccctgccact
     2881 catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca tcacagacgg
     2941 catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta taatatttgc
     3001 ccatagtgaa aacgggggcg aagaagttgt ccatattcgc cacgtttaaa tcaaaactgg
     3061 tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac cctttaggga
     3121 aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt agaaactgcc
     3181 ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa
     3241 cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc attgccatac
     3301 gaaattccgg atgagcattc atcaggcggg caagaatgtg aataaaggcc ggataaaact
     3361 tgtgcttatt tttctttacg gtctttaaaa aggccgtaat atccagctga acggtctggt
     3421 tataggtaca ttgagcaact gactgaaatg cctcaaaatg ttctttacga tgccattggg
     3481 atatatcaac ggtggtatat ccagtgattt ttttctccat tttagcttcc ttagctcctg
     3541 aaaatctcga taactcaaaa aatacgcccg gtagtgatct tatttcatta tggtgaaagt
     3601 tggaacctct tacgtgccga tcaacgtctc attttcgcca aaagttggcc cagggcttcc
     3661 cggtatcaac agggacacca ggatttattt attctgcgaa gtgatcttcc gtcacaggta
     3721 tttattcgaa gacgaaaggg catcgcgcgc ggggaattgg ccacgatgcg tccggcgtag
     3781 aggatctctc acctaccaaa caatgccccc ctgcaaaaaa taaattcata taaaaaacat
     3841 atagataacc atctgcggtg ataaattatc tctggcggtg ttgacataaa taccactggc
     3901 ggtgatactg agcacatcag caggacgcac tgaccaccat gaaggtgacg ctcttaaaat
     3961 taagccctga agaagggcag cattcaaagc agaaggcttt ggggtgtgtg atacgaaacg
     4021 aagcattgga attctacaac ttgcttggat tcctacaaag aagcagcaat tttcagtgtc
     4081 agaagtcgac caaggaggtc tagataacga ggcgcaaaaa atgaaaaaga cagctatcgc
     4141 gattgcagtg gcactggctg gtttcgctac cgtagcgcag gccggcgata atggcctgct
     4201 tctcgccgaa acgtttggtg gcgggaccag tgacgaaggc ttgagcgagg gcgtgcaaga
     4261 ttccgaatac cgcaagcgac aggccatgtt tgacagctta tcatcgataa gcttggatcc
     4321 gtcgacctgc agccaagctt ggtctagagg tcga
//
LOCUS       DSBRR1610    1376 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.autotrophicum 16S ribosomal rRNA.
ACCESSION   M34409
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.autotrophicum (strain DSM 3382) rRNA.
  ORGANISM  Desulfobacterium autotrophicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1376)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1376     16S rRNA
BASE COUNT      340 a    287 c    399 g    293 t     57 others
ORIGIN      
        1 aactggagag tttgattctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag
       61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcacgggt gagtaacacg
      121 tggataatct acctttgaat ccgggataac tgttcgaaag aatcgctaat accggatgaa
      181 gtcagtnttt ccttggaagg attgatgaaa gacagcctct tcttgaaagc tnttgtttga
      241 agatgagtcc gcgcaccatt agttngttgg tggggtaacg gcctaccaag acnttnatgg
      301 ttagctggtc tgagaggatg atcagccacn ctggaactga cacacggtcc ngactcctnc
      361 gggaggcagc agtgaggaat tttgcgcaat gggngcnacc ctnacgcagc aacgccgcgt
      421 gagtgaagaa ggcctntggg tcgtaaagct ctgtcaatgg ggaagaagtt rcttcatatt
      481 aatrgtatgt ngtattgacg gtacccatnn nngaagcacc ggcnnactcc gtgccagcan
      541 nnncggtaat anggagngtg cnagcgtnat tcggaatnat tnggngtnaa gggcacgcag
      601 gcggtnttgc cagtcagatg tgaaagcccg gggctcaacc ctggaagtgc atttgaaact
      661 acaaggcttg agtacggtng aggaaagggg nattcctngt gtagaggtga aattcgtaga
      721 tatcaggagg aacaccggtg gcgaaggcgc cnntctggac csnnactgac gctnaggtnc
      781 naaggcgtgg gnagcgaaca ggattagata ccctggtagt ccacgcagta aacgttgtat
      841 actaggtgta gcgggntatt gacccntgct gtgccggagc taacgcatta agtataccgc
      901 ctgggaagta cggtcgcaag actaaaactc aaagaaattg acgggggccc gcacaagcgg
      961 tggagcatgt ggtttaattc ganncaacgc gcagaacctt acccagactt gacatcctga
     1021 gaatctgtta gagatattag agtgccttcg ggaactcaga gacaggtgct gcatggctgt
     1081 cgtcagctcg tgtcgtgaga tgttcggtta agtccggcaa cgagcgcaac ccctatcttc
     1141 agttgccagc acatcatggt gggaactctg aagagactgc cccggttaac ggggaggaag
     1201 gnggggatga cgtcaagtcc tcatggccct tatgtctggg gctacacacg tgntacaatg
     1261 gtatgtacna cgggcagcga acctgcnnag gtgngcgaat ccctnaaagc atatcccagt
     1321 ccggattgga gtctgcaact cgactccatg aagttggaat cgctagtaat cgtgga
//
LOCUS       DSBRR1677    1375 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.niacini 16S ribosomal rRNA.
ACCESSION   M34406
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.niacini (strain DSM 2650) rRNA.
  ORGANISM  Desulfobacterium niacini
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1375)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1375     16S rRNA
BASE COUNT      292 a    252 c    340 g    256 t    235 others
ORIGIN      
        1 taactggaga gtttgattct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa
       61 gtcgaacgag aaagtgcctg cttgcaggaa cgagtagagt ggcgcacggg tgagtaacac
      121 gtggataatc taccttcaaa tcgggaataa ccattcgaaa ggatggctaa tgccggataa
      181 agtcagatrt tctttggagt atttgatgaa agggtgcctc tgattgaaag catctgtttg
      241 aagatgagtc cgcgcaccat tagtttgttg gtggggtaac ggcctaccaa gactncgatg
      301 gtnagctngt ctgagaggat gatcagccac actggaactg acacacggtc crgrctccta
      361 cgggaggcag cagtgaggaa ttttgcgcaa tgggggcaac cctnacgcag caacgcngcg
      421 tgagtgaaga aggccnnngg gtcgtaaagc tntgtccatg gggaagaaat tatttagtgt
      481 tratrgcatn ntttattgac nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttcctgg tgtagaggtg aaattcgtag
      721 atatcaggag gaacaccggt ggcgaaggcg ccngtctgga ccgatactga cgctsaggtg
      781 cgaaggcgtg ggnagcgaac gggattagat rccccggtag tccacgcagt aaacgttgta
      841 tgctaggtgt agcgggtatt gcccntgctn cgccgcagtt aacacattaa gcataccgcc
      901 tnnnnngtac ggtcgcaaga ctaaaactca aagaaattga cgggggcccg cacaagcggt
      961 ggagcatgtg gtttrattcg acnnaacgcg aagaacctta cccagacttg acatcttgag
     1021 aatcctttcg aaagttagga gtgccttcgg nngctcaaag acagnngctg catggctgtc
     1081 gtcagctcgt gtcgtgagat gtttggttaa gtccagcaac gagcgcaacc cttgtcttca
     1141 gttgccagcg cgtaatggcg gaactctgaa gagactgccc cggtnaacgg ggaggaaggt
     1201 ggggatgacg tcaagtcctc atggcccnna tgtctgggct acrcrcgtgc tacaatggta
     1261 tgtacaacgg gcagcaangc agcgatgcca agcaaatccc ttaaagcata tcccrgtccg
     1321 gattggagtc tgcractcga ctccatgaag ttggaatcgc tagtaatcgt ggatc
//
LOCUS       DSBRR169     1383 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.vacuolatum 16S ribosomal rRNA.
ACCESSION   M34408
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.vacuolatum (strain DSM 3385) rRNA.
  ORGANISM  Desulfobacterium vacuolatum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1383)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1383     16S rRNA
BASE COUNT      345 a    295 c    411 g    294 t     38 others
ORIGIN      
        1 ttctggctca gratnaacgc tggcggngtg crnaacacat gcaagtcgaa cgrgraagtt
       61 sctgcttgca ggaacgagga gagtggcgct cgggngagta actcgtggat aatctacctt
      121 caaatcggga ataaccattc gaaaggatgg ctaatgccgg ataaagtcag atrttctttg
      181 grrtntttga tgaaagggag cctctgnata taagctnttg tttgaagatg agtccgcgca
      241 ccattagttt gttggtgggg taacggccta ccaagactgt gatggtnagc tggtctgaga
      301 ggatgatcag ccacactgga actgacacac ggtccagact cctacgggag gcagcagtga
      361 ggaattttgc gcaatggggg aaaccctgac gcagcaacgc cgcgtgagtg aagaaggccc
      421 ttgggtcgta aagctctgtc acagggaaga aattattatg ctttaataga gtgtrgtatt
      481 gacggtaccc tgtgaagaag caccggctaa ctccgtgcca gcagccgcgg taatacgggg
      541 ggtgcaagcg ttattcggaa ttattnggcg taaagggcac gcaggcggtc ctgtcagtca
      601 gatgtgaaag ccggggctca accccggacg tgcatttgaa acagcaggnc ttgagtacgg
      661 nngaggaaag ggnnnttcct ggtgtagagg tgaaattcgt agatatcagg aggaacaccg
      721 gtggcgaagg cgccnntctr gaccgatact gacgctnagg tacnaaggcg tgggtagcga
      781 acgggattag ataccccggt agtccacgca gtaaacgttg tatgctaggt gtagcrgggt
      841 attgacccct gctgtgccgc agttaacaca ttaagcatac cgcctgggga gtacggtcgc
      901 aagactnaaa ctnaaagaaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa
      961 ttcgacgnaa cgcgaagaac cttacccaga cttgacatct tgagaatcct ntagaaattt
     1021 aggagtgcct tcgggagctc aaagacaggt gctccatggc tgtcgtcagc tcgtgtcgtg
     1081 agatgtttgg ttaagtccag caacgagcgc aacccttgtc ttcagttgcc agcacataat
     1141 ggtgggaact ctgaagagac tgccccggtt aacggggagg aaggtggggn tgacgtcaag
     1201 tcctcatggc cctnatgtct ggggctacac acgtgctaca atggtatgta caacgggcrg
     1261 caaggcagcg atgccaagca aatcccttaa agcatatccc agtccggatt ggagtctgcg
     1321 actcgactcc atgaagttgg aatcgctagt aatcgtggat cagcatgccn cggtgaatac
     1381 gtt
//
LOCUS       DSFRR166     1372 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.multivorans 16S ribosomal rRNA.
ACCESSION   M34405
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.multivorans (ATCC 33890) rRNA.
  ORGANISM  Desulfococcus multivorans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1372)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1372     16S rRNA
BASE COUNT      329 a    295 c    429 g    291 t     28 others
ORIGIN      
        1 taattgacga gtttgatcct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa
       61 gtcgcacgag aacgcttctg cttgcagggg ctagtaagtg gcgcacgggt gagtaacgcg
      121 tgggtaatct acccttggat tcgggataac ccttcgaaag gagtgctaat accggatgaa
      181 gtcatttagg ctgtggttta gatgatgaaa ggtggcctct ccgaggaagc taccgtttgg
      241 ggatgagccc gcgtaccatt agcttgttgg tggggtaacg gcctaccaag gcgacgatgg
      301 ttagctngtc tgagaggatg atcagccaca ctggaactga cacacggtcn ngnctcctac
      361 gggaggcagc agtgaggaat tttgcnnnnn gggggnaacc ntnacgcagc aacgccgcgt
      421 gagtgaagaa ggccttcggg tcgtaaagct ctgtcagtgg gaagaacctt cgtatgttga
      481 atatgcatac ggactgacgg taccacagaa ggaagcaccg gctaactccg tgccagcagc
      541 cgcggtaata cggagggtgc aagcgttatt cggaattatt gggcgtaaag agcgcgtagg
      601 cggctttgca agtcaggtgt gaaatccggg gctnaacccc ggaagagcat ttgatactgt
      661 ggagcttgag tatgggagag gnnagtggna ttcctggtgt agcggtgaaa tgcgtagata
      721 tcaggaggaa caccggtggc gaaggcggyt tcctggacca atactgacgc tgaggcgcga
      781 aggcgtgggg agcaaacagg attagatacc ctngtagtcc acgcagtaaa cgttgatcac
      841 taggtgtagc gggtattgac ccctgctgtg ccggagttaa cgcattaagt gatccgcctg
      901 gggagtacga tcgcaagatt aaaactcaaa ggaattgacg ggggcccgca caagcggtgg
      961 agcatgtggt ttaattcgan nnaacgcgaa gaaccttacc tggatttgac atccgcggrr
     1021 ttcttatgaa aatagggagt gccttcgggg agccgcgaga caggtgctgc atggctgtcg
     1081 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ctrtcttcag
     1141 ttaccatcat taagttgggg actctgaaga tactgccccg gttaacggga ggaaggtggg
     1201 gatgacgtca agtcctcatg gcctttatgt ccagggctac acacgtgcta caatgggctg
     1261 tacnnagggn agcgatctcg cgagagcaag ccaatcccaa aaagcagtcc ttagttcggr
     1321 ttggagtctg caactcgact ccatgaagct ggaatcgcta gtaatcgcgg at
//
LOCUS       DSLRR165     1381 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.pigra 16S ribosomal rRNA.
ACCESSION   M34404
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.pigra (ATCC 29098) rRNA.
  ORGANISM  Desulfomonas pigra
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1381)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1381     16S rRNA
BASE COUNT      322 a    294 c    388 g    250 t    127 others
ORIGIN      
        1 gaactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa
       61 gtcgtacgcg aaangggact tcggtcccga gtaaagtggc gcgacgggtg agtaacgacg
      121 tggataatct gcctctatga tgggnataac rgttggaaac gactgctaat accgaatacg
      181 ctcatgatga ncttgtggaa aggtggcctn tgcttgcaag ctatcgcata gagatgagtc
      241 cgcgtcccat tagctagttg gtggggtaac ggcctaccaa ggcaacgatg ggtagccgat
      301 ctgagaggat gatcggccac actggaactg aaacacggtc cagactccta cgggaggcag
      361 cagtggggaa tattgcgcaa tgggcgaaag cctgacgcag cgacgccgcg tgagggatga
      421 aggtcttcgg atcgtaaacc tctgtcagaa gggaagaaac tagggtgctc taatcatcat
      481 cctaactgac ggtaccttca aaggaagcac cggctaactc cgtgccagca gccgcggtaa
      541 tacggagggt gcaagcgtta atcggaatca ctgggcgtaa agcgcacgta ggctnttatg
      601 taagtcaggg gtgaaagccc acggctnaac cgtggaactg cccttgatac tgcacggctn
      661 gaatnnnnnn nnnnggtncg gnattccagg tgtaggagtg aaatccgtag atatctggag
      721 gaacatcagt ggcgaaggcg gcnacctnga cctngaccgn nattgacgct naggtgcnna
      781 agcgtgggga gcaaacagga ttagataccc tngtngtcca cgccgtaaac gatggatgct
      841 agatgtcggg atgtannnct cggtgtcgta gtnaacgcgt naagcatccg gcctggggag
      901 tacggnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      961 nnnnnnnnnn nnnnnnnaac gcgaagaacc ttacctaggt ttgacatctg gggaaccctc
     1021 ccgaarrtga ggrrtrrcct tcggggagcc ccaagacagg tgctgcatgg ctgtcgtcag
     1081 ctnnnnnnnt gagatgttgg gttaagtccc gcaacgagcg caacccctat gcatagttgc
     1141 cagcaagtaa agttgggcac tctatgcaga ctgnncgggt tracagggga ggaaggtggg
     1201 gacgacgtca agtcatnatg gcccttacgc ctagggctac anacgtacta cnatggcacg
     1261 cacaaagggc agcgataccg tgagatggag ccaatcccaa aaaacgtgtc ccagtccgga
     1321 ttgcagtctg caactcgact gcatgaagtc ggaatcgcta gtaattcgag gtcagcatnc
     1381 c
//
LOCUS       DSORR1618    1402 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  Desulfotomaculum orientis 16S ribosomal rRNA.
ACCESSION   M34417
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.orientis (strain Singapore I, NCIMB 8382) rRNA.
  ORGANISM  Desulfotomaculum orientis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1402)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1402     16S rRNA
BASE COUNT      301 a    253 c    386 g    255 t    207 others
ORIGIN      
        1 ttttttggag agtttgrtcc tggctcagga ngaacgctgg cggcgtgcct aacacatgca
       61 agtcgaacgg agaattgaat aagcttgctt agacaattst tagtggcgga cgggtgagta
      121 acgcgtgggt aacctgccca taaakccggg acaacccttg gaaacgaggg ctaataccgg
      181 atatcttaga tyytggcatc aggayttaag gaaagatggc ctctgaacat gctatcgatt
      241 atggatggac ccgcgtctga ttagctagtt ggtggggtaa aggcctacca aggcgacgat
      301 cagtagccgg cctgagaggg tgaacggcca cactgggact gagacacggc cnngactcct
      361 acgggaggca gcagtgggga atcttccgca atggacgaaa gtctgacgga gcaacgcngc
      421 gtgtatgatg aaggtcttcg gattgtaaag tactgtcagt gtggggacgg tgctntttga
      481 aaatattgag gagacatgac ggtacccaag gaggaagccc cggctaannn nnnnnnnnnn
      541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncggaatta ctgggcgtta agcgcgcgtt
      601 ggcggatgct taagtcckgt gtgaaagatc agggctcaac cctnagagtg cattcggtaa
      661 tgggtattgn nnnnnnnnga gaggaaagtg gaattccacg tgtagcggtg aaatgcgtag
      721 atatgtggag gaacaccagc ggcgaaggcg actttctgga ctgtaactga cgctgaggcg
      781 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgag
      841 tgctaggtgt agagggtatc nncacnttct gtgccgcagt taacgcgata agcactccgc
      901 ctggggagta cggccgcaag gttnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1021 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngctg tgagacaggt ggtgcatggt
     1081 tgtcgtcagc ttgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccctgta
     1141 tttagttgct acgagtaagg ttgagcnctc tagatagact gccggtgaca aaccggagga
     1201 aggtggggat gacgtcaaat catnatgccc nttatgtcct gggctacaca cgtgctacaa
     1261 tggccggtac agacggaagc gaagccgcga ggtgaagcca atccgagaaa gccggtctcr
     1321 gttcggattg caggctgcaa ctcgcctgca tgaagtcgga atcgctagta atcgcaggtc
     1381 agcatactgc ggtgaatacg tt
//
LOCUS       DSORR1619    1368 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.ruminis 16S ribosomal rRNA.
ACCESSION   M34418
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.ruminis (strain DL NCIMB 8452) rRNA.
  ORGANISM  Desulfotomaculum ruminis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1368)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1368     16S rRNA
BASE COUNT      341 a    284 c    420 g    267 t     56 others
ORIGIN      
        1 ttatggagag tttrgtcctg gctcaggacg aacgctggcg gcgtgcttaa cacatgcaag
       61 tcgaacggag tttaaaatga aacctngtga tttttraact tagtggcgga cgggtgagta
      121 acgcgtggat aacctggcgg atagaccggg ataacagttg gaaacgactg ctaataccgg
      181 atacgctcct aggaacgcat gttctaagga ggaaagattt atcgctaacc gatggatccg
      241 cgtcccatta gctagttgga ggtgtaacgg acccctaagg cgacgatggg tagccggcct
      301 gagagggcga acggccacac tggaactgag acacggcctn gactcctacg ggaggcagca
      361 gtggggaata ttccgcaatg ggcgaaagcc tgagggagca acgcngcgtg agggaagaag
      421 gctttcgggt tgtaaacctc tgtcttaagg gaagaacaaa atgacggtac cttgagagga
      481 agccccggct annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag cgttgtccgg
      541 aattactggg cgtaaagggc gcgtaggtgg tgcattaagt tagaggtgaa agtgccgggc
      601 nnaaccccgt gaggcctctg atactggtgg acttgagtgc aggagagggg agcagaattc
      661 ccagtgtagc ggtgaaatgc gtagatattg ggaggaatac cagtggcgaa ggcggcnntc
      721 tngactgtaa ctgacactga ggcgcgaaac cgtggggagc gaacaggatt agataccctg
      781 gtagtccacg ccgtaaacga tgagtgctag gtgttggggg tatcgaccac ccccagtgcc
      841 gtagttaaca caataagcac tccgcctggg gagtacggtc gcaagactga aactcaaagg
      901 aattgacggg ggcccgcaca agcggtggag tatgtggttt aattcgacgn aacgcgaaga
      961 accttaccag ggcttgacat ccgtctnaaa agtataggga tatactcttg ccttcggggn
     1021 nacagagaga caggtggtgc atngttgtcg tcagctcgtg tcgtgagatg ttgggttaag
     1081 tcccgcaacg agcgcaaccc ctracattcg ttgccagcga gtaatgtcgg gaactcgaat
     1141 gtgactgccg ttgacaaaac ggaggaaggt ggggatgacg tcaaatcatc atgcccctta
     1201 tgtcctgggc tacrcgcgta ctacnatggc cggtacagac ggaagcgaag ctgtgaagtg
     1261 aagcaaatct gagaaagccg gtcccagttc ggattgtagt ctgcaactcg actacatggg
     1321 gtcggaatcg ctagtaatcg caggtcagca tactgcggtn aatacgtt
//
LOCUS       DSPRR1613    1390 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.hydrogenophilus 16S ribosomal rRNA.
ACCESSION   M34412
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.hydrogenophilus (DSM 3380) rRNA.
  ORGANISM  Desulfobacter hydrogenophilus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1390)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1390     16S rRNA
BASE COUNT      354 a    297 c    416 g    292 t     31 others
ORIGIN      
        1 actgagagtt tggacctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc
       61 gaacgagaaa argattgctt gcaatcctga gtagagtggc gcrcgggtga gtaacrcgta
      121 gataatctgc cttcaagcct ggaataacta ttcgaaaggg tagctaatac cggataaagt
      181 cgatttacac nagtrgattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat
      241 gagtctgcgg accattagct tgttggtggg gtaaaggcct accaaggcag cgatggttag
      301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccagac tcctacggga
      361 ggcagcagtg aggaattttg cgcaatgggg gcaaccctga cgcagcaacg ccgcgtgagt
      421 gaagaaggcc tttgggtcgt aaagctctgt caactgggaa gaaattagct tctattaata
      481 gtagttgcta ttgacggtac cagtngacga agcgccggcn aactcngtgc cagcagcngc
      541 ggtaacacgg ggggcgcnag cgtnattcgg aattattggg cgtaaagggc gcgcaggcgg
      601 tnttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa
      661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc
      721 aggaggaaca ccgatggcga aggcatctct ctrgaccgat attgacgctn aggcgcnaag
      781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacactc
      841 ggtgtngcgg atattnaaat ctgctgtgcc taagctaacg cattaagtgt accgcctggn
      901 ragtacggtc gcaagactaa aactcaaagg aattgacggg ggcccgcaca agcggtggag
      961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata
     1021 ttgtgtaatt gcaatagtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca
     1081 gctcgtntcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgtcagttg
     1141 ccagcacntr atggtgggaa ctctggcgag actgccccrg nnnacgggra ggaaggtggg
     1201 gatgacgtca agtcctcatg gcccttatac ccagggctac acacgtgcta caatggtagg
     1261 tacaaagggc agcgactctg crragggaag cgaatcccaa aagcctatct cagtccggat
     1321 tggggtctgc aactcgaccc catgaagttg gaatcgctag taatcgcgga tcagcatgcc
     1381 gcggtnaata
//
LOCUS       DSPRR1614    1396 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.curvatus 16S ribosomal rRNA.
ACCESSION   M34413
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.curvatus (DSM 3379) rRNA.
  ORGANISM  Desulfobacter curvatus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1396)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1396     16S rRNA
BASE COUNT      339 a    292 c    432 g    285 t     48 others
ORIGIN      
        1 actggagagt ttgatcctgg ctcagaatga acgctggcgg cgtgctnaac acatgcaagt
       61 cgaacgagaa agggattgct tgcaatcccg agtagagtgg cgcacgggtg agnnacacgt
      121 rgatratctg ccttcaagcc tgggataact rttcgaaagg gtagctaata ccggataaag
      181 tcggnncacn nnagtggatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga
      241 tgagtttgcg taccattagc ttgttggtgg ggtaacggcc taccaaggca acgatggtna
      301 gctggtctga gaggatgaca ggccacactg gaactgagac acggtcgact cgtgaacggg
      361 aggcagcagt gaggaatttt gcgcaatggg ggaaaccctg acgcagcgac gccgcgtgag
      421 tgaagaaggc ctttgggtcg taaagctctg tcgactggga agaagttcca atgttttaat
      481 agaaggtggt attgagggta ccaggggagg aacgaccggc nnactccgtg ccagcagcag
      541 cggtaacacn nngggggcna gcgttattcg gaanattggg cntaaagggc gcgcaggcgg
      601 tcttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa
      661 gacttgaata ccgtagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc
      721 aggaggaaca ccgatggcga aggcatctct ctggaccgat attgacgctn aggcgnnaag
      781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtnaacg ttgtacnctc
      841 ggtgtngcgg atatnnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg
      901 gagtacggtc gcaagactaa aactcnaagg aattgacggg ggcccgcaca gtcggtggag
      961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata
     1021 yygrgtaatn gnrrtngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca
     1081 gctcgtgtcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgttagttg
     1141 ccagcacgtt atgtgggaac tctaacgaga ctgcccgggt nnaccgggag gaaggcgggg
     1201 atgacgtcaa gtcctcatgg cccttatatc cagggctacg cgcgtgctac natggtaggt
     1261 acaaagggca gcgactttgc rraaagaagc gaatgcccaa aagcctatcg cngtccggat
     1321 tggggtctgc nactcgaccc catgaagttg gaatcgctag taatcgcgga tcngcatgcc
     1381 gcggtnaata tgttcc
//
LOCUS       DSPRR1615    1373 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  Desulfobacter latus 16S ribosomal rRNA.
ACCESSION   M34414
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.latus (DSM 3381) rRNA.
  ORGANISM  Desulfobacter latus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1373)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1373     16S rRNA
BASE COUNT      322 a    268 c    393 g    267 t    123 others
ORIGIN      
        1 ctggagagtt tgatcctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc
       61 gaacgagaaa gggattgctt gcaatcctga gtagagtggc gcacgggtga gtaacacgta
      121 gataatctgc cttcaagcct gggataacta ttcgaaaggg tagctaatac cggataaagt
      181 cgattyacay nagtnnattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat
      241 gagtttgcgt accattagct agttggtggg gtnaaggcct accaaggcnn cgatggnnag
      301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccngnc tcctacggga
      361 ggcagcagtg aggaattttg cgcratgggg gcaacccnna cgcagcaatg ccgcgtgagt
      421 gaagaaggcc tttgggtcgt aaagctctgt caacaaggaa gaaattagga attattaata
      481 gtngtttcta ttgacggtac ttgtngagga agcgccggct nactccgtgc cagcagccgc
      541 ggtaacacgg ggggcgcnag cgttattcgg aattattggg cgtaaagggc gcgcaggcgg
      601 tnttgtccgt naggtgtgaa agcccggggc tcaaccccgg aagagcactt gaaacagcaa
      661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc
      721 aggaggaaca ccgatggcga aggcatctct ctngaccgrt attgacgctn aggcgnnaag
      781 gcgtggggag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacgctc
      841 ggtgtngcgg atattnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg
      901 nagtacggtc gcaagactaa aactcraagg aattgrcggg ggcccgcrca agcggtggag
      961 catgtggttt aattcgatnn aacgcgaaga accttacctg ggtntgacat cctgtgaata
     1021 tcccgtnatt gggatngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtcn
     1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1141 nnnnnnnnnn nnnntgggaa ctctggcgag actgccccgg tnnacgggga ggaaggtggg
     1201 gatgacgtca agtcctsrtg gccctnatat ccagggctac acacgtgcta caatggtagg
     1261 tacaaagggc agcgactttg crrraagaag cgaatcccaa aangcctntc ccagtccgaa
     1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atc
//
LOCUS       DSPRR1616    1397 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  Desulfobacter sp. Vulgaris 16S ribosomal rRNA.
ACCESSION   M34415
KEYWORDS    ribosomal 16S rRNA.
SOURCE      Desulfobacter sp. (strain 3ac10 DSM 2035) rRNA.
  ORGANISM  Desulfobacter sp.
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1397)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1388     16S rRNA
BASE COUNT      363 a    290 c    404 g    296 t     44 others
ORIGIN      
        1 actggagagt ttgrtcctgg ctcagaatga acgctggcgg cgtgcttaac acatgcaagt
       61 cgaacgagaa agggattgct tgcratcctg agtagagtgg cgcacgggtg agtaacgcgt
      121 agataatctg ccttcaagcc tgggataact attcgaaagg gtagctaata ccggataaag
      181 tcgattcaca taagtaaatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga
      241 tgagtttgcg taccattagc ttgttggtgg ggtaaaggcc taccaaggca acgatggtna
      301 gctggtctga gaggatgatc agccacactg gaactggaac acggtccaga ctcctacggg
      361 aggcagcagt gaggaatttt gcrgcrratg ggggcaaccc tgacgcagca acgccgcgtg
      421 agtgaagaag gcctttgggt cgtaaagctc tgtcaacaag gaagaaatta caattattta
      481 atagataatt gtattgacgg tacttgtaga ggaagcgccg gcnaactcnn nnnnnnnnnn
      541 nnnggtaaca cggggggcgc aagcgttaat tggaattatt gggcgtaaag ggcgcgcagg
      601 cggtcttgtc cgtcaggtgt gaaagctcgg ggctcaaccc cggaagtgca cttgaaacag
      661 caagacttga atacgggaga ggagagagga attcctggtg tagaggtgaa attcgtagat
      721 atcaggagga acaccgatgg cgaaggcatc tctctggacc gatattgacg ctnaggcgct
      781 aaggcgtggg tagcgaacgg gattagatac cccggtagtc cacgcagtaa acgttgtaca
      841 ctcggtgtgg cggatattta aatctgctgt gcccaagcta acgcattaag tgtaccgcct
      901 gggaagtacg gtcgcaagac taaaactcaa aggaattgac gggggcccgc acaagcggtg
      961 gagcatgtgg tttaattcga cgnaacgcga agaaccttac ctgggtttga catcctgtga
     1021 atatcccgta attgggatag tgccttcggg agcacagaga caggtgctgc atggctgtcg
     1081 tcagctcgtg tngtgagatg tttggttaag tccagcaacg agcgcaaccc ttacgtcagt
     1141 tgccagcaca taatggtggg aactctggcg aactgcccgc tanannnnnn nnnnnnnnnn
     1201 nntgacgtca agtcctcatg gcccttatat ccagggctac acacgtgcta caatggtagg
     1261 tacaaagggc accgactctg caaagggaag cgaaatccca aaagcctatc tcagtccgga
     1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atcagcatgc
     1381 cgcggagaat atgttcc
//
LOCUS       DSPRR1617    1472 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  Desulfobacter sp. 16S ribosomal rRNA.
ACCESSION   M34416
KEYWORDS    ribosomal 16S rRNA.
SOURCE      Desulfobacter sp. (strain 4ac11 DSM 2057) rRNA.
  ORGANISM  Desulfobacter sp.
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1472)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1472     16S rRNA
BASE COUNT      368 a    312 c    442 g    313 t     37 others
ORIGIN      
        1 aactggagag tttgrtcctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag
       61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcrcgggt gagtaacrcg
      121 tagataatct gccttcaagc ctgggataac tatccgaaag gatagctaat accggataaa
      181 gtcgatttac acaagtagac cgatgaaaga ttgcctcttc ttgaaagcaa ttgtttgggg
      241 atgagtttgc gtaccattag ctaggttggt ggggtaaagg cctaccaagg cngngatggt
      301 nagctggtct gagaggatga tcagtcacac tggaactgga acgcggtcca gnctcctacg
      361 ggaggcagca gtgaggaatt ttgcgcaatg ggggcaaccc tgacgcagcg acgccgcgtg
      421 agtgaagaag gccnntgggt cgtaaagctc tgtcwacwag gaagaaatta ggatctatta
      481 atagtgggtt ctattgacgg tacttttgga ggaagcgccg gctnactckk tgccagcagc
      541 ngcggtaaca cggggggcgc nagcgttatt cggaattatt gggcgtnaag ggngcgcagg
      601 cggtcttgtc tgttaggtgt gaaagcgggg gctcaacccc ggaagtgcac ttgaaacagc
      661 aagacttgaa tacgggagag gagagaggaa ttcctggtgt agaggtgaaa ttcgtagata
      721 tcaggaggaa caccgatggc gaaggcatct ctctggaccg atattgacgc tnaggcgnna
      781 aggcgtgggt agcgaacggg attagatacc ccggtagtcc acgcagtnaa cgttgtacac
      841 tcggtgtagc ggatattnaa atctgctgtg ccnaagctaa cgcattaagt gtaccgcctn
      901 gnnngtacgg tcgcaagact aaaactcaaa ggaattgacg ggggcccgca caagcggtgg
      961 agcatgtggt ttaattcgac nnaacgcgaa gaaccttacc tgggtttgac atcctgtgaa
     1021 tatcccgtaa ttgggrtrgt gccttcggga gcacagagac aggtgctgca tggctgtcgt
     1081 cagctcgtgt cgtgagatgt ttggttaagt ccagcaacga gcgcaaccct tatcgtcagt
     1141 tgccagcacg taatggtggg aactctggcg agactgcccc ggtcaacggg gaggaaggtg
     1201 gggatgacgt caagtcctca tggcccttat atccagggct acacacgtgc tacaatggta
     1261 ggtacaaagg gcagcgactt tgcaaaaagg agcgaatccc naaaagccta tctcagtccg
     1321 aattggggtc tgcaactnga ccgnatgaag ttggaatcgc tagtaatcgc ggatcagcat
     1381 gccgcggtna atatgttccc gggccttgta cacaccgccc gtcacaccat ggaagttgat
     1441 tatacccgac gtcrctgggc taactattta ga
//
LOCUS       DSURR1611    1371 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.propionicus 16S ribosomal rRNA.
ACCESSION   M34410
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.propionicus (ATCC 33891) rRNA.
  ORGANISM  Desulfobulbus propionicus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1371)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1371     16S rRNA
BASE COUNT      333 a    271 c    397 g    276 t     94 others
ORIGIN      
        1 naactggaga gtttnatcct ggctcagaac gaacgctggc ggcgtgctta acacatgcaa
       61 gtcgaacgcg aaagggactt cggtcctgag taaagtggcg cacgggtgag taacgcgtag
      121 ataacctnnc ttcatgtctg gaataatacg ccgaaagggg trctaatacc ggatattntn
      181 gctnttcata agttatgcaa gcraaaggtg gcctctgcat atgctactgc rtgaagaggg
      241 gtctgcgtac cattagctag taggtggggt aatggcctac ctaggcgacg atggtnagcg
      301 ggtctgagag gatgatccgc nacactggca ctggaacacg ggcnngnctc ctacgggagg
      361 cagcagtgag gaatattgcg caatgggggn aacccnnacg cagcgacgcn gcgtgagtga
      421 ggaaggcctt cgggtcgtaa agctctgtca agaggaaaga agtgtacaat ggctnatacc
      481 tnttgtnttt gacggtacct ctnaaggaag caccggctna ctccgtncca gcagcnncgg
      541 taatacggag ggtgcaagcg tnnttcggaa tcactnggng tnaagggcgc gtaggcggnt
      601 tggtaagtca gatgtgaaag cccacggctn aactgtggaa gtgcatttga aactgtcagg
      661 cntgagtacc agaggggaaa gtggaattcc cggtgtagag gtnaaattcg tagatatcgg
      721 gaggaatacc ggtggcgaag gcgactttct ggctngatac tnacgctnag gcgcnaaagc
      781 gtggggagca aacaggatta gataccctng tagtccacgc tgtaaacgat gtgaactaga
      841 tgcagggggt gtnnatccnn nctgtgtngc agctaacgca ttnagttnnn nnnnnnnnna
      901 gtacggnngc aagattaaaa ctcaaaggaa ttgacggggn ncngcacaag cggtggagta
      961 tgtggtttaa ttcgatnnaa cgcgaagaac ctnacctngt nttgacatcc cggnaatctn
     1021 ttggaaacaa gagagtgctt ccgnnagnat ctggagacag nnnctgcatn gctntcgtna
     1081 nnnnntgttg tgagatgttg ggttaagtcc cnnaacgagc gcaacccttg cctttagttg
     1141 ccagcagttc ggctggrcac tctaaaggga ctgccggtgt taaaccggag gaaggtgggg
     1201 atgacgtcaa gtcctnatgg ccnntatgac cagggctaca cacgtcctac aatggccgat
     1261 acaaagggca gcgacactgc gaggtggagc taatcccata aaatcggnct cagtccggat
     1321 tggagtctgc aactcgactc catgaagttg gaatcgctag taatcgngga t
//
LOCUS       DSURR1612    1365 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  Desulfobulbus sp. 16S ribosomal rRNA.
ACCESSION   M34411
KEYWORDS    ribosomal 16S rRNA.
SOURCE      Desulfobulbus sp. (strain 3pr10, DSM 2058) rRNA.
  ORGANISM  Desulfobulbus sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1365)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1365     16S rRNA
BASE COUNT      299 a    267 c    389 g    267 t    143 others
ORIGIN      
        1 gaagagtttn attctggctc agaacgaacg ctggcggcgt gcntnncaca tgcaagtcga
       61 accgnnnnnn nnnnnnnntc ctagtagtgg cgcacgggtg agtracgcgt agatracctg
      121 ncctnatgtc tgggatrccc caccgaaagg tgggctaata ccggatgtnc tnntntttca
      181 tnagcgtaat aaagaaaagg tggcctctga tattaagctn ctgcrtgagr aggggtctgc
      241 gtaccattag ctagttggtg gggtnagngc ctaccaaggc atcgatggtn agcgggtctg
      301 agngnatgat ccgtcacact ggcactgaaa cncgggccng rctccnacgg gaggcngcng
      361 tgaggaatnt tgcgcnatgg gggcnaccct nacgcagcga cgccgcgtga gtgagnaagg
      421 ctttcgggtc gtaaagctct gtnnaaggga agaaatgcat aatnattnat ncttnttatg
      481 tttgacggta cctttaaagg aagcaccggc taactccgtg ccagnagccg cggtaatacg
      541 gagggtgcaa gcgttgtccg gaattactgg gcgtnaaggg cacgcaggcg gcctgataag
      601 tcagatgtga aagcccacgg ctnnaccgtg gaagtgcatt tgaaactatt aggnnngagt
      661 atcagagggg aaagtggaat tcctggtgta gaggtgaaat tcgtagatat caggaggaat
      721 rccagtggcg aaggcgactt tctggctgaa tactgacgct gaggtgcgaa agcgtgggga
      781 gsaaacagga ttagataccc tggtagtcca cgccgtaaac gatgtcaact aggtgtaggg
      841 ggggttnatc csstctgtgc cgcagctrac gcattragtt gacngcctgg ggagtacggt
      901 cgcaagatta aaactcnaag naattgrcgg gggcccgcrc nagcggtgga gtatgtggtt
      961 taattcgatg nnacgcgaag aaccttacct ggtcttgaca tcccaagaat cttctagaaa
     1021 tagaagagtg cttttcggag aacttggtga caggtgctgc atggctgtcg tcagcnnnnn
     1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ttgccagtag
     1141 ttcggctgag cactctaaag ggactgccgg tgtnnaaccg gaggaaggtg gggatgacgt
     1201 caagtcctca tggcsnntat gaccagggct acacacgtac tacnatggcc ggtacnaagg
     1261 gcagcgacag ggcgrcctna agcgaatccc gtraagccgg nctcngtccg gattggagtc
     1321 tgcnactcga ctccatgaag tcggaatcgc tagtaatcgt ggatc
//
LOCUS       DSVRR168     1527 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.variabilis 16S ribosomal rRNA.
ACCESSION   M34407
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.variabilis (strain DSM 2060) rRNA.
  ORGANISM  Desulfosarcina variabilis
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Sulfolobales.
REFERENCE   1  (bases 1 to 1527)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1527     16S rRNA
BASE COUNT      380 a    338 c    458 g    328 t     23 others
ORIGIN      
        1 ttaattggag agtttgatcc tggctcagaa tgaacgctgg cggcgtgctt aacacatgca
       61 agtcgtacga gaacgctcta gcttgctaga gcaagtaaag tggcgcacgg gtgagtaacg
      121 cgtgggtaac ctaccttcga attggggata acgttgcgaa agcgacgcta ataccgaata
      181 atatccaaat atcttcggat ttttggatca agggtggcct cttcttgaaa gctactgttt
      241 gaagatgggc ccgcgtacca ttagcttgtt ggtggggtaa tggcctacca aggcaacgat
      301 ggttagctgg tctgagagga tgatcagcca cactggaact gacacacggt ccagactcct
      361 acgggaggca gcagtgagga attttgcgca atgggcgaaa gcctgacgca gcaacgccgc
      421 gtgagtgatg aaggctttcg ggtcgtaaag ctctgtcaag tgggaagaac ctatataatg
      481 ttaatatcat tatatactga cggtaccact gaaggaagca ccggctaact ccgtgccagc
      541 agccgcggta atacgggggg tgcaagcgtt attcggaatc attgggcgta aagagcacgt
      601 aggcggcctg gtaagtcaga tgtgaaagcc cggggcttaa ccccggaagt gcatttgata
      661 ctgtcaggct tgagtatggg agagggaagt ggaattcctg gtgtagaggt gaaattcgta
      721 gatatcagga ggaacaccgg tggcgaaggc gacttcctgg accaatactg acgctgaggt
      781 gcgaaggcgt gggtagcaaa caggattaga taccctggta gtccacgcag taaacgttga
      841 tcactaggtg tagcrgggta ttgacccctg ctgtgccgca gttaacgcat taagtgatcc
      901 gcctggggag tacggtcgca agattaaaac tcaaaggaat tgacgggggc ccgcacaagc
      961 ggtggagcat gtggtttaat tcgannnaac gcgaagaacc ttacctggat ttgacatccc
     1021 gggaatccta tcgaaagact ggagtgccct tcggggagcc cggwgacagg tgctgcatgg
     1081 ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccttgt
     1141 ctttagttac catcatttag ttggggactc taaagatact gccccggtca acggggagga
     1201 aggtggggat gacgtcaagt cctcatggcc tttatgtcca gggctacaca cgtgctacaa
     1261 tgggcggtac aaagggtagc gaacccgcga gggtaagcca atcccaaaaa gccgtcccaa
     1321 gttcggattg gagtctgcaa ctcgactcca tgaagctgga atcgctagta atcgcggatc
     1381 agcatgccgc ggtgaatacg ttcccgggcn ttgtacacac cgcccgtcac accatgagag
     1441 ttggttgtac ccgaagtcgt cgggctaacc cttcggggat gcangcgcct aaggtatggc
     1501 tgannnnnnn nnnnnnnnng taacaag
//
LOCUS       DVURR16      1388 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.vulgaris 16S ribosomal rRNA.
ACCESSION   M34399
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.vulgaris (strain Hildenborough DSM 644) rRNA.
  ORGANISM  Desulfovibrio vulgaris
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1388)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1388     16S rRNA
BASE COUNT      306 a    315 c    418 g    248 t    101 others
ORIGIN      
        1 taactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa
       61 gtcgngcgtg aaaggacttc ggyccgagta aagcggcgcr cgggtgagta acgcgtggat
      121 gatctaccca tgagttggga ataacggctg gaaacggtcg ctaataccga atacgctccg
      181 atttcracnt cgggggaaag gtggcctctg cttgcaagct actgctcatg gatgagtccg
      241 cgtcccatta gctagttggt ggggtaacgg cccacnaagg cgacratggg nagccggtct
      301 gagaggatga ccggccacac tgggactgga acacggccna gactcctacg ggaggcagca
      361 gtggggaata ttgcgcaatg ggcgaaagcc tnacgcagcg acgccgcgtr agggatgaag
      421 gtcctcggat cgtnaacctc tgtcnggagg gaagaaccgc cacggtgctn atcagccgtn
      481 gtctgacggt acctccagag gaagcaccgg ctaactccgt gccagcagcc gcggtaatac
      541 ggagggtgca agcgttaatc ggaatcactg ggcgtaaagc gcacgtaggc tgcttggtaa
      601 gtcaggggtg aaagccggnn nctcaaccgc ggaattgcct ttgatactgc caagcnagag
      661 tccgggagag ggtagtggaa ttccaggtgt aggagtgaaa tcmgtagaga tctggaggaa
      721 catcagtggc gaaggcgact acctggaccg gnactgacgc tnaggtgcnn aagcgtgggg
      781 agcaaacagg attagatacc ctngtagtcc acgccgtaaa cgatggatgc taggtrtcgg
      841 ggccttgarr cttcggtgcc gtagttaacg cgttaagcaa tccgcctngg gagtacggtc
      901 gcnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      961 nnnnnnnnnn aacgcgaaga accttaccta ggtttgacat ccggaagacc ttcccgaaaa
     1021 ggaaggntgc cttcggggaa ttccgagaca ggtgctgcat ggctgtcgtc agctcgtgcc
     1081 gtgaggtgtt gggttaagtc ccgcaacgag cgcaacccct attgccagtt gctaccaggt
     1141 aatgctgggc tctctggtga gactgccccg gttaacgggg aggaaggtgg ggacgacgtc
     1201 aagtcatgat ggcccttacg cctagggcta cacacgtact acaatggcgc atacaaaggg
     1261 cagcgatacc rcraggtgga gccaatccca aaaagtgcgt cccagtccgg attgcagtct
     1321 gcaactcgac tgcatgaagt tggaatcgct agtaattcga gatcagcatg cncgggtgaa
     1381 tgcgttcc
//
LOCUS       DVURR161     1370 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.gigas 16S ribosomal rRNA.
ACCESSION   M34400
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.gigas (ATCC 19364) rRNA.
  ORGANISM  Desulfovibrio gigas
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1370)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1370     16S rRNA
BASE COUNT      310 a    303 c    393 g    243 t    121 others
ORIGIN      
        1 taactggaga gtttgattct ggctcagatt gaacgctrgc ggcgtgctta acacatgcaa
       61 gtcgagcgag anngggtctt tcgggacctg agtacagcgg cgcacgggtg agtaacgcgt
      121 ggatgatcta cctcattgrc cgggataacg gtgggaaacc gctgctaata ctggatacgc
      181 ttcatggaaa catgagggaa aggtggcctc tgatttaagc trtcacaatg agatgagtcc
      241 gcgtctcatt agcttgttgg tagggtaatg gcctaccaag gctacgatga gtagcggccc
      301 tgagagggtg gccrgccaca ctggaactga aacacggtcc agactcctac gggaggcagc
      361 artggggaat attgcgcaat gggcgaaagc ctnacgcagc gacgccgcgt gagggaagaa
      421 ggccttcggg tcgtaaacct ctgtctggag gaaagaaacc cataggtgtg aatagggctt
      481 atggctgacg gtacctccga aggaagcacc ggctaactcc gtgccagcag ccgcggtaat
      541 acggagggtg caagcgttaa tcggaatcac tgggcgtaaa gcgcacgtag gctgtcttgc
      601 aagtcaggtg tgaaagccct cggctcaacc ggggaactgc acttnaaact gcatgacttg
      661 agtacaggag aggatggcag aattccgggt gtaggggtna aatccgtaga tatccggagg
      721 aataccagtg gcgaaggcgg ccatctggac tgtcactgac gctgaggtgc rnaagcgtgg
      781 ggagcaaaca ggattagata ccctngtagt ccacgctgta aacgatggac gctnggtrtc
      841 grggggtcgc ccctcggcgc cgaactaacg cgttnannnn nnnnnnnnnn nnnnnnnnnn
      901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      961 nnnnnnnnnn aacgcgaaga accttaccta ggcttgacat cccgggaacc ctcccgaaaa
     1021 ggaggnntgc tcttcggaga gcccggagac aggtgctgca tggctrtcgt cagctcgtgc
     1081 cgtgaggtgt tgggttaagt cccgcaacga gcgcaacccc tgtccatagt tgccnnnngg
     1141 taaagctngn cactctatgg agactgcctt ggcaacaagg aggaaggtgg ggatgacgtc
     1201 aagtcatcat ggcccttacg cctagggcta cacacgtact acaatggcga gtacaaaggg
     1261 ttgcgaaacc gcgaggttga gccaatccca aaaagctmgt cccagtccgg attgcagtct
     1321 gcaactcgac tgcatgaagt cggaatcgct agtaatcccg gatcagcatg
//
LOCUS       DVURR162     1400 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.salexigens 16S ribosomal rRNA.
ACCESSION   M34401
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.salexigens (ATCC 14822) rRNA.
  ORGANISM  Desulfovibrio salexigens
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1400)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1400     16S rRNA
BASE COUNT      320 a    306 c    393 g    270 t    111 others
ORIGIN      
        1 actggagagt ttgattctgg ctcagattga acgctggtgg cgtgcttaac acatgcaagt
       61 cgtgcgagaa cagttccttc gggaacctag tagagcggcg cacgggtgag taacgcgtgg
      121 ataatctacc cagaagactg ggataacagt tggaaacgac tgctaatacc ggatacgttt
      181 catatttaac tnnatnagag aaaggtggcc tctntttcaa gctatcactt ttggatgagt
      241 ccgcgtttca ttagctagtt ggtaaggtaa cggcttacca aggcgacgat gaatagctgg
      301 tctgagagga tgaccagcca cactgggact ggaacacggc cnagactcct acgggaggca
      361 gcagtgggnn atattgcgca atgggggaaa ccctgacgca gcgacgccat gtgagggacg
      421 aaggctttcg ggtcgtaaac ctctgtcrrg rrggaagaaa ctgtttgarg ctaataccct
      481 ntttcacttg acggtaccct cagaggaagc accggctaac tccgtgccag cagccgcggt
      541 aatacggagg gtgcgagcgt taatcggaat cactgggcgt aaagcgcgcg taggtggcgc
      601 gataagtcag gcgtgaaagc cctcggctca accggggaat tgcrcttgat actgtcgtgc
      661 ttgagtctcg gagagggtgg cggaattcca ggtgtaggag tgaaatccgt agatatctgg
      721 aggaacacca gtggcgaagg cggccacctg gacgagtact gacgctgagg tgcgaaagcg
      781 tggggagcaa acaggattag ataccctngt agtccacgcc gtaaacgatg gatgctagat
      841 gtcgggcctt aaccaggttc ggtgtcgaag ttaacgcgat aagcatccnn nnnnnnnnnn
      901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      961 nnnnnnnnnn nnnnnnaacg cgaagnacct tacctggact tgacatcctg agaatcctct
     1021 agaaatagag gagtgccttc gggnnattca gtgacaggtn ctgcatggct ntcgtcagct
     1081 cgtgccgtga ggtgttgggt taagtcccgc aacgagcgca acccctattg ctagttgcca
     1141 tcacataatg gtgggcactc tagtgagact gcccgggtca accgggagga aggtgggnac
     1201 gacgtcaagt catcatggcc cttacgtcca gggctacaca cgtactacaa tggtggatac
     1261 aaagggttgc caagccgcga ggccgagcca atcccaaaaa gtccrtccca gtccggatcg
     1321 cagtctgcaa ctcgactgtg tgaagttgga atcgctagta atcccggatc agcatgccgg
     1381 ggtnaatacg ttcccgggcc
//
LOCUS       DVURR163     1395 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.sapovorans 16S ribosomal rRNA.
ACCESSION   M34402
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.sapovorans (strain 1pa3 ATCC 33892) rRNA.
  ORGANISM  Desulfovibrio sapovorans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1395)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1395     16S rRNA
BASE COUNT      334 a    291 c    427 g    286 t     57 others
ORIGIN      
        1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag
       61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg
      121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt
      181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga
      241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna
      301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg
      361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag
      421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac
      481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg
      541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg
      601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc
      661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat
      721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa
      781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact
      841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg
      901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag
      961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact
     1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc
     1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt
     1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg
     1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat
     1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg
     1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg
     1381 ccgcggtgaa tacgt
//
LOCUS       DVURR164     1395 bp ss-rRNA            BCT       24-JUL-1990
DEFINITION  D.baarsi 16S ribosomal rRNA.
ACCESSION   M34403
KEYWORDS    ribosomal 16S rRNA.
SOURCE      D.baarsi (strain 2st14 DSM 2075) rRNA.
  ORGANISM  Desulfovibrio baarsi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1395)
  AUTHORS   Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A.
  TITLE     Natural relationships among sulfate-reducing eubacteria
  JOURNAL   J. Bacteriol. 171, 6689-6695 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.A.Stahl, 22-MAY-1990.
FEATURES       from  to/span     description
    rRNA    <     1     1395     16S rRNA
BASE COUNT      334 a    291 c    427 g    286 t     57 others
ORIGIN      
        1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag
       61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg
      121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt
      181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga
      241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna
      301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg
      361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag
      421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac
      481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg
      541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg
      601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc
      661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat
      721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa
      781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact
      841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg
      901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag
      961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact
     1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc
     1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt
     1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg
     1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat
     1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg
     1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg
     1381 ccgcggtgaa tacgt
//
LOCUS       DRORNP70K    3569 bp ds-DNA             INV       24-JUL-1990
DEFINITION  D.melanogaster U1 70K small nuclear ribonucleoprotein gene,
            complete cds.
ACCESSION   M31162
KEYWORDS    ribonucleoprotein; small nuclear ribonucleoprotein.
SOURCE      D.melanogaster (strain Oregon R) DNA, clones lambda D331.[1,2]; and
            adult head, cDNA to mRNA, clones pBS9.[1,3] and lambda R3-2.1.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 3569)
  AUTHORS   Mancebo,R., Lo,P.C.H. and Mount,S.M.
  TITLE     Structure and expression of the Drosophila melanogaster gene for
            the U1 small nuclear ribonucleoprotein particle 70K protein
  JOURNAL   Mol. Cell. Biol. 10, 2492-2502 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Mancebo, 09-JAN-1990.
FEATURES       from  to/span     description
    pept        614     1960     U1 70K small nuclear ribonucleoprotein, exon 2
                                 (first expressed exon)
    pre-msg <     1     2259     snRNP mRNA and intron (alt.)
    pre-msg <     1     3367     snRNP mRNA and intron (alt.)
    IVS         488      566     U1 70K snRNP, intron A
    site        103      182     RNA recognition motif
BASE COUNT      972 a    839 c    919 g    839 t
ORIGIN      Chromosome II; map position 27D1,2.
        1 gcatgaagga aaatattcta caaaaaactt caattttata aaattcattt aaaatacaaa
       61 attgtacgta aacttaacgt aaccgttact cagttatgga atgtgtgagc gagatggtga
      121 agcagcagca agtgatgtag caaattgcaa ttgaacgcga gtgggaaaag gggcaactat
      181 aaaaccgaga aacttgcttt tagcatggat tcgaacccct tatttatagt actctggatg
      241 tccgagacac acacctattt gtggtattta tattttataa cgtaagtagt atatttaatt
      301 actaatcagt atttcatgcg gaattcttcc gcttaattca tagaccgcgc gggggtcaca
      361 cttgctactc aagccaggcg aaaaactaaa gaaaatcggg aaaatacttg gtctgcaccg
      421 aattatattg ctggtactta aacgaagtac cctagattta ttcttgccaa gcggatggct
      481 gtttaaggtg agttgcgcca gcgcttacta tcccttgtgg agtaaacaaa ctccaaccta
      541 acctcaaact gaccgttttt ttgcagacga ggaacttcag gaaaaggtaa aacaaaacaa
      601 aaaagcccac aaaatgaccc aatatctgcc gccgaatctg ctggcgctgt tcgcggcacg
      661 ggagcccatc ccgttcatgc cgccggtgga caagctgccg cacgagaaga agtctcgcgg
      721 ctacctggga gtggccaagt tcatggccga tttcgaggat cccaaggaca cgccgctgcc
      781 gaaaacggtg gaaacgcgtc aggagcggct ggagcgacgc cggcgcgaga aggccgagca
      841 ggtggcctac aagctggagc gtgagatagc gctgtgggac cccacagaga tcaaaaatgc
      901 cacggaggac ccgtttcgca cgctgttcat tgcacgcatc aactacgaca cgtccgagtc
      961 gaagctgcgg cgtgagttcg agttctacgg gcccatcaag aagatcgtcc tgatccacga
     1021 ccaggaatca ggtaaaccca agggctacgc cttcatcgag tacgagcacg agcgggacat
     1081 gcatgccgcc tacaagcacg ccgatggtaa gaagatcgac agcaagcgcg tcctggtgga
     1141 cgtggagcgg gctcgcacgg tcaagggctg gctgcctcga cgcctgggcg gcggtctggg
     1201 tggaacgcgc cgcggcggca acgatgtcaa cattaagcac tccggccgcg aggacaacga
     1261 gagggaacgc gagcgctacc ggctggagcg ggagcgtgag gatcgcgagg gtcctggacg
     1321 cggcggcggc tccaatggcc tggatgcccg gcccggacgc ggtttcggtg cggaacgtcg
     1381 acgttcccgc tccagggaac gccgcgaccg tgaacgagat cgcggacggg gcgctgtggc
     1441 tagcagcggt cgctcgcgca gccgttctcg cgagcgcaga aaacgacgag cgggcagccg
     1501 ggagcggtac gacgagttcg accgccgcga tcggcgggac agggagcgcg agcgtgatcg
     1561 cgatcgcgag cgtgagaaga aaaagaagcg ctccaagtct cgcgaacgcg aatcctccag
     1621 ggagcgtcgc gaacggaagc gagagagaag ggaccgtgaa cgcggcaccg gatccggcgg
     1681 cgatgtcaag gagcgcaagc ccgatttccg tgatatggat gtcatcaaga tcaaggagga
     1741 gcccgtcgac gatggctatc ccacatttga ctaccagaac gcgaccatca agcgtgagat
     1801 cgacgatgag gatgaggaga agtaccggcc gccgcctgcg catcacaata tgttcagtgt
     1861 gccgccgccg cccattttgg ggcgtggaaa tgccagcacg aatcccaatc ccgacaatgg
     1921 ccagcagagc tccggcgacc cgagttggtg gcgtcagtag agtcttggag cgatgtgtca
     1981 aggttaatat ttctagaagt cagacgtctg tgggtcgcaa ctatttattc attcgactcc
     2041 gaggcacccc aatcctggag agcctaaccg tagctatcca ttttacacag aaaattttaa
     2101 ggtaccactt ggaggagggc caagtagaac aagattatag aaaacccgac cgctaaacgc
     2161 agaatccgct aatgtgtgcg tagcttaaat cacttaaatt tataagtaac tcttaacaaa
     2221 tgaatatgaa aacagtaagt aaaataaagc tagccctcat gtgtttgttt ccccaccttt
     2281 ggtaaggggg ttaaagggaa tacggagagt caggagctgg aacgctttcg gtggcgcata
     2341 caccgtacta tatggttact ccatcccatg gtggttcctg ggattttcta actcacctaa
     2401 cataataagc tgaacaatac aaacccttgc actaactcgt gccttttatt ttctctgttt
     2461 ttttgcagtt ttcaatcaat tgaaaatctg actctgacta gtgtgaaagc aaaagcataa
     2521 gtatttaatc aaacaaacag taatccaaaa acggaaatta gttccgccag tattcgtagc
     2581 ccatgcccaa gtctaaattc caagcccaca tcaggtaatt tggtctacgc acaaacctca
     2641 ctaatccatg cgtctaccgt tctaggacag ctctagaatc aagacagcta ccgcaatact
     2701 tttccaatct cctccgctct gggttgcctg tgttgtgtgg tggtgtggcg tgtaagttga
     2761 ttccgggcta agaaattttg taaaccaaaa ccttttccgt aagtttgccc cggtaagatt
     2821 acgatatcct cgcctaaccg ccgtggatcg gatggatgag tgagttagtg taagggagct
     2881 ttcctctgtt tggtacacat tgcgaactgc tccgatggcc tgtggcaatc tactccattc
     2941 atccattatg tctgtaacca atttaccatt tcgatctttt catgtacgtt gagctgattg
     3001 ttatcgtact gaagacgaat ccgcggcgga aactcgcata gaacaaacag aaaactgcgc
     3061 aaaggtttgc ttttggtaac tggtaacgat tggttgtggt tggtcaggtc aggggtcttt
     3121 cgagctggtt agcctccact ttggattgcg gacggtgaca aaattctggg ggtcttttat
     3181 ttttgtattg tattgtttgg cagacaggat ccatgcaatt gccttttcgt accgtgcgac
     3241 atacgggaaa caaactatcg tgtagcaggg catttttcct attacaccat tattaagaaa
     3301 agcggacaga gatcaatgtc acatttagat ttgatcaaat aaagaaaaga attcctcctt
     3361 acacgaaaaa gtacttgttt catttctaag tatttatcaa gatgacttga attttttaca
     3421 attttcgctt gttaaactgg tctatatgtc ctggataaga tttgcagtgg cgtccttgga
     3481 atctctaaaa atgtatacat ttgttgcaaa taaagcattg tgaaatctat ggacattaaa
     3541 tgcgacctct attggaaaac attcacata
//
LOCUS       FSBAFP05A     946 bp ds-DNA             VRT       24-JUL-1990
DEFINITION  Macrozoarces americanus antifreeze protein OP5A gene, complete cds.
ACCESSION   J03924
KEYWORDS    antifreeze protein.
SOURCE      Ocean pout adult testis DNA, clone lambda-OP5.
  ORGANISM  Macrozoarces americanus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Gadiformes; Zoarcoidei; Zoarcidae.
REFERENCE   1  (bases 1 to 946)
  AUTHORS   Hew,C.L., Wang,N.-C., Joshi,S., Fletcher,G.L., Scott,G.K.,
            Hayes,P.H., Buettner,B. and Davies,P.L.
  TITLE     Multiple genes provide the basis for antifreeze protein diversity
            and dosage in the ocean pout, Macrozoarces americanus
  JOURNAL   J. Biol. Chem. 263, 12049-12055 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.H.Hayes, 13-JUN-1988.
FEATURES       from  to/span     description
    pept        270      338     antifreeze protein, exon 1
                521      715     antifreeze protein, exon 2
    IVS         339      520     antifreeze protein, intron A
BASE COUNT      283 a    209 c    186 g    268 t
ORIGIN      211 bp upstream of Sau3AI site.
        1 aagcttgtga tagtttggac aaaaacaagt tatactttac ttataagaat ataaaatttc
       61 cattgcaatt ggcataagga ggtgtgacac agtgacctac tttcaggcca ataggaaacg
      121 ggatatgccg gttaagtcct cccacatact gtatattaga tgcagcacat ggacctgtcc
      181 tgtcagaagt ctcagctaca gctttcactt cgatctccga taattaatta attaattaat
      241 tattaattaa ttaagtctca gccacagcca tgaagtccgt tattttaacc ggtttgcttt
      301 tcgtcctcct ttgtgtcgac cacatgacag ccagccaggt gagatattct tgctccacta
      361 aaaaatattc aaaaatgtga gctacagtaa aattcaacag tgttctgttt agaaagacag
      421 agaacctttt aagtaaacat ttttagaatt ttctttttca actgtgccat gagaacaata
      481 ataacgtctg accttttatc ttccattctt caacggtcag tccgtggtgg ccacccagct
      541 gatccccata aatactgccc tgactccggt gatgatggag gggaaggtga ccaacccaat
      601 aggcatcccg ttcgcagaga tgtcccaaat agtggggaag caagtgaaca cgccagtggc
      661 taagggccaa accctcatgc caaacatggt gaaaacgtac gccgcgggaa agtagttctg
      721 agggtgccaa ggagcttctt cccaaaacca aaagaagaaa tgccccctct cacaattaac
      781 cctgtttttg tcacaaaccc aagtctgtta actgaacatg tcaaaacctg tggagactgt
      841 tgagatttga tgttctgaaa agataaagcc tataaataaa atgttgccca aatttcctgc
      901 ctgatgtttt tctttgtcgt tgctacatgg ctttgctgct cggatc
//
LOCUS       HUMCRPX        78 bp ss-mRNA            PRI       24-JUL-1990
DEFINITION  Human C-reactive protein (CRP) mRNA, 5' end.
ACCESSION   M35163
KEYWORDS    C-reactive protein.
SOURCE      Human adult liver, cDNA to mRNA, clone pCRP5.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 78)
  AUTHORS   Tucci,A., Goldberger,G., Whitehead,A.S., Kay,R.M., Woods,D.E. and
            Colten,H.R.
  TITLE     Biosynthesis and postsynthetic processing of human C-reactive
            protein
  JOURNAL   J. Immunol. 131, 2416-2419 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1  >    78     C-reactive protein precursor
                                 /hgml_locus_uid="LR0093W" /nomgen="CRP"
                                 /map="1q21-q23"
    sigp          1       54     C-reactive protein signal peptide
    matp         55  >    78     C-reactive protein
BASE COUNT       15 a     17 c     22 g     24 t      0 others
ORIGIN      
        1 atggagaagc tgttgtgttt cttggtcttg accagcctct ctcatgcttt tggccagaca
       61 gacatgtcga ggaaggct
//
LOCUS       PT3TAUF       117 bp ds-DNA             PHG       24-JUL-1990
DEFINITION  Bacteriophage T3 early terminator (T3Te) region.
ACCESSION   M35171
KEYWORDS    early terminator region.
SOURCE      Bacteriophage T3 DNA.
  ORGANISM  Bacteriophage T3
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Briat,J.-F., Bollag,G., Kearney,C.A., Molineux,I. and
            Chamberlin,M.J.
  TITLE     Tau factor from Escherichia coli mediates accurate and efficient
            termination of transcription at the bacteriophage T3 early
            termination site in vitro
  JOURNAL   J. Mol. Biol. 198, 43-49 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    signal       85      106     transcription termination signal
BASE COUNT       28 a     37 c     28 g     24 t
ORIGIN      
        1 cggacggctc cctgcgacac cctagcttcg agaagttccg aggaactgag gacaaccctc
       61 aagagaaaat gtaaccaact cactggctca ccttcacggg tgggcctttc ttcgttc
//
LOCUS       VIBRHP       1489 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  V.harveyi Arg-tRNA, two Pro-tRNA and two His-tRNA genes.
ACCESSION   M29762 M31832
KEYWORDS    transfer RNA-Arg; transfer RNA-His; transfer RNA-Pro.
SOURCE      V.harveyi (strain ATCC 14126) DNA.
  ORGANISM  Vibrio harveyi
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 1489)
  AUTHORS   Giroux,S. and Cedergren,R.
  TITLE     Evolution of a tRNA operon in gamma purple bacteria
  JOURNAL   J. Bacteriol. 171, 6446-6454 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.J.Cedergren, 06-NOV-1989.
FEATURES       from  to/span     description
    tRNA         60      136     Arg-tRNA
    tRNA        171      247     His-tRNA
    tRNA        324      400     Pro-tRNA
    tRNA        450      523     His-tRNA
    tRNA        588      664     Pro-tRNA
    anticdn      94       96     Arg-tRNA anticodon ccg
    anticdn     205      207     His-tRNA anticodon gtg
    anticdn     358      360     Pro-tRNA anticodon tgg
    anticdn     481      483     His-tRNA anticodon gtg
    anticdn     617      619     Pro-tRNA anticodon tgg
    signal       11       16     -35 region
    signal       34       39     -10 region
BASE COUNT      351 a    331 c    360 g    447 t
ORIGIN      
        1 aaaaaagtac tagacggcat agtgtgatat ccgtattatt cctctccgcc gatgggcatg
       61 cgcccgtagc tcagttggat agagcgttgg cctccggagc caaaggtcga aggttcgaat
      121 cctttcgggc gtgccattcc ggaatacaaa ttaaaggcac gaacaatagt ggtggctata
      181 gctcagttgg tagagccctg gattgtgatt ccggtggtcg cgagttcgaa tctcgttagc
      241 caccccatta tttcggtaac tcattaaatt agtgaagttt ccagttttga ttcaaatatc
      301 ccaagagtcg aaacaaaatt cgtcggtgaa tagcgcagct tggtagcgca tctggtttgg
      361 gaccagaggg tcgggggttc gaatccctct tcaccgacca ctatttagtt tatggctata
      421 atagagcggt aaacggcaaa attgatggtg gctatagctc agttggtaga gccctggatt
      481 gtgattccgg tggtcgcgag ttcgaatctc gttagccacc ccattaattt aggtgacttc
      541 ttagttgaag tatcctgtct gatttgagag aatcagcaaa tacagttcgg tgaatagcgc
      601 agcttggtag cgcatctggt ttgggaccag agggtcgggg gttcgaatcc ctcttcaccg
      661 accactatta gaaagcctgc tttttagcag gctttcgtcg ttttagagct tatgatctcg
      721 cttggactag atgagtcggc aatatttcaa tggatttgaa cccatatggg gttcgcctga
      781 agttagaata tggctcttca tcgaccaata tttaaagcct cagtcgaaag actagggctt
      841 ttttgtattt ctcaccctaa aaaatctgca ttccatttca atatcagtaa acatgtacag
      901 ctctggaagc ttttgaccac ctatgatcta aatgcgattc gagctgataa aagaaacccc
      961 agcacgttgg ctggggtttt gtgtttcttg aattaagcgc tttactgagc aagcgttttt
     1021 aggatttcat tcagctcaac cgtctctcgc acttgatagt tgggtttgat atcggttgta
     1081 tttccgtacc tgttgcgttt acccacacgc tcagaatacc catattgttg gcacccgcaa
     1141 tgtctgctgg tagagagtcg ccaatatgaa tcacctcttc tggctttgca tccacaggtt
     1201 gagtgctttg cggaaaatgt tggctgctgg cttctcttct ggctcttccc caccgacgat
     1261 aatgtgatca acccattcgc tcatttgtgt tgctttgagt tttgggtgct gtgagaagat
     1321 tggaccgtta gtgatcacca ctaatttgta gtgttgacgt agctcggtca gcatctcttt
     1381 tacgccaggg aagaagtcaa aagcccccat gcgagccgag tcaaaatagc tctgcgcctg
     1441 ttgagccttt tctgtatcaa tctctatact ttgctctgcc aagatggcc
//
LOCUS       YSCPOX1      2915 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  Yeast (S.cerevisiae) acyl-coenzyme A oxidase (POX1) gene, complete
            cds.
ACCESSION   M27515
KEYWORDS    acyl-coenzyme A oxidase.
SOURCE      Yeast (S. cerevisiae) DNA, clone PAD17.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2915)
  AUTHORS   Dmochowska,A., Dignard,D., Maleszka,R. and Thomas,D.Y.
  TITLE     Structure and transcriptional control of the Saccharomyces
            cerevisiae POX1 gene encoding acyl-coenzyme A oxidase
  JOURNAL   Gene 88, 247-252 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.Dignard, 01-SEP-1989.
FEATURES       from  to/span     description
    pept        455     2701     acyl-coenzyme A oxidase
    mRNA        364     2843     POX1 mRNA (minor alt.)
    mRNA        365     2843     POX1 mRNA (minor alt.)
    mRNA        369     2843     POX1 mRNA (minor alt.)
    mRNA        386     2843     POX1 mRNA (major alt.)
    mRNA        400     2843     POX1 mRNA (major alt.)
    mRNA        406     2843     POX1 mRNA (minor alt.)
    mRNA        413     2843     POX1 mRNA (minor alt.)
BASE COUNT      895 a    532 c    600 g    888 t
ORIGIN      1 bp upstream of BglII site; chromosome VII.
        1 agatctcgac caaaaaaagg gggtagggta ataaaattaa ccctattatt ttttaacttt
       61 aaaacctata atgtgctaat attttattat aaacctcctt tttttgcgtt caaaccctga
      121 cacattttaa gccctatatt tacggtatta gttgattaaa ctccgaagcg aaaggaattc
      181 ggtcattagc ggctaatagc cgttggggta aatcacctac aagcaagtac acaagagaac
      241 gttggcgttg ttaagtcaaa gcactaatac attggggctt taagagtgtt tataaaggtc
      301 taacctgtaa aaattattta aacaacttga acaggcctta aagttttcct cattccgctc
      361 atcatcacta atattgctct ccgtttttga atacacactt gacactaata agtatcacag
      421 aaaaaaagaa aatataataa attagtattg cgatatgacg agacgtacta ctattaatcc
      481 cgattcggtg gttctgaatc ctcaaaaatt tatccagaaa gaaagggcgg attcgaaaat
      541 caaagttgac caagttaaca catttttaga gtcatccccg gagaggagaa ctctgacgca
      601 cgccttaata gaccaaatag tgaatgatcc tatattgaaa actgatacgg actattacga
      661 tgctaaaaaa atgcaagaga gagaaattac tgccaaaaaa atagctaggc ttgctagtta
      721 tatggagcac gatatcaaaa cagtgcgcaa acactttcgc gacactgacc tgatgaaaga
      781 gttgcaagca aatgatccag acaaagcttc gcctttaaca aacaaagacc tttttatatt
      841 cgataagaga ttgtcacttg tagcaaatat tgatcctcaa ttgggtacgc gcgtgggtgt
      901 acacttgggg ctatttggta attgtatcaa gggcaatggt actgatgagc aaatccggta
      961 ttggttgcag gagagaggtg ccactttgat gaaaggtata tatggctgtt ttgcaatgac
     1021 tgagttagga catggttcca atgttgccca gctgcagact agggctgtgt acgataagca
     1081 aaatgatact tttgtaattg atacacctga tctaactgcc accaaatggt ggattggtgg
     1141 ggctgcccat tctgccacgc acgctgccgt gtacgccaga ttgatcgttg aaggtaaaga
     1201 ctacggtgta aaaacattcg ttgttcctct gagagaccct tcgactttcc aactgttagc
     1261 tggtgtttcc ataggggata ttggagcgaa gatgggtcgt gacggtattg ataatggctg
     1321 gatccagttc agaaacgtag ttatccctag agaatttatg ctaagtagat ttaccaaagt
     1381 tgtccgttct ccagatggtt cagtcaccgt caaaactgag ccacaattgg atcaaatttc
     1441 tggttatagt gcattgttaa gtggtagagt taacatggtc atggattcat ttaggtttgg
     1501 ctccaaattt gctactattg ctgtacgtta cgcggttggt cgtcagcaat tcgcacctag
     1561 aaagggattg tctgaaacac aattaatcga ctatcccctt caccaatatc gtgttttacc
     1621 acaattgtgt gttccatatt tggtgtcacc tgtagctttt aagttaatgg acaactatta
     1681 ttccactttg gacgagttat acaacgcttc ctcatctgca tacaaagctg ctctggttac
     1741 cgtgagtaaa aagttgaaga atttatttat tgatagcgcc acgttgaaag ccaccaatac
     1801 ttggttaatt gctacactga ttgatgagtt gagacagact tgcggaggac atgggtattc
     1861 acagtataac ggatttggta aaggctatga cgactgggtg gttcagtgca catgggaggg
     1921 tgataataat gttttatctt taacttcagc aaaatcaata ttgaaaaaat ttatcgattc
     1981 agccacaaag ggtagatttg acaacacact ggatgtggac tcattctctt acttaaaacc
     2041 tcagtacata ggatctgtgg tttctggaga aataaagagt ggtttaaagg agttgggtga
     2101 ttatactgaa atttggtcta tcaccttaat caaattactg gcacatattg gtactttagt
     2161 tgaaaaatca agaagtattg atagcgtttc taagctttta gtcttagtat ccaaatttca
     2221 tgccttgcgc tgcatgttga aaacctatta cgacaagtta aactctcgtg attcacatat
     2281 ttccgatgaa attacaaagg aatctatgtg gaatgtttat aagttatttt ccttgtattt
     2341 tattgacaag cattccggag aattccaaca attcaagatc ttcactcctg atcagatctc
     2401 taaagttgtg cagccacaac tattggctct tttgccaatt gtgaggaaag actgtatagg
     2461 tctgacagac tcctttgaat tacctgacgc gatgttaaat tctcctatag gttactttga
     2521 tggcgatatc tatcacaatt acttcaatga agtttgccgc aataatccag tggaggcaga
     2581 tggggcaggg aagccttctt atcatgcgct gttgagcagc atgctcggta gaggtttcga
     2641 atttgaccaa aagttaggtg gtgcagctaa tgcggaaatt ttatcgaaaa taaacaagtg
     2701 agtagaggtt tcctgttttc cttcgaaccc tctgttttgc gacttttgtt tcaattcaac
     2761 tagtgtcgcc aagttttaac aaaaagttac aaaatcctag tgagaggcca tcttatgtgc
     2821 ataacggtac tctctatcta tttacatatc taatactatt cacataacta tgacgaatca
     2881 atgacatgac tacatttacc aatgtatagt agtaa
//
LOCUS       YSPUGU1       507 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  S.pombe U1 small nuclear RNA gene (snu1), complete cds.
ACCESSION   M29062
KEYWORDS    U1 small nuclear RNA; small nuclear RNA.
SOURCE      S.pombe (sub-species lindner; strain h-972) DNA, clone Bb.
  ORGANISM  Schizosaccharomyces pombe
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 507)
  AUTHORS   Porter,G., Brennwald,P. and Wise,J.A.
  TITLE     U1 small nuclear RNA from Schizosaccharomyces pombe has unique and
            conserved features and is encoded by an essential single-copy gene
  JOURNAL   Mol. Cell. Biol. 10, 2874-2881 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Porter, 13-OCT-1989.
FEATURES       from  to/span     description
    uRNA        240      387     U1 small nuclear RNA
BASE COUNT      148 a     82 c     89 g    188 t
ORIGIN      3 bp upstream of HincII site.
        1 gttaactatt catggcttga aagcaacgaa atcatttcct attatgaaat tttggagatt
       61 tatataatta caaatcgctg aacgatttca ctaatcagca aaatttaacg atttaaatat
      121 cgtttttgtt aaataattat actgtacttc tttattgcat gtaagaatgt gactactttc
      181 gtgactgctt cgttccgcgt cgttactata aatatgggtg cttacactta cggttcttta
      241 cttacctggc atgagtttct gcagcacaag aattgtggag actcagttat ttgtcttggc
      301 attgcactga gccctgacga ataactgtgg actggctaag gtcagctccg gatgcatcat
      361 ttttgagttc gtccctcatt tggggcaatt ttactaatta aggcgatatg gttgccttaa
      421 atagtaaagt tatccgaaat ggattatgta aagtaacatt ttctttcatc atatataatt
      481 gtaaattatt gtatcataat taatatt
//
LOCUS       MUSIGHAAZ     149 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            C16-11D11, partial cds.
ACCESSION   M33399
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) adult Id+ hybridoma C16-11D11, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 149)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   149     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       87       88     V-region end/D-region start
    recomb      102      103     D-region end/JH4-region start
BASE COUNT       39 a     45 c     35 g     30 t
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
       61 gacactgccg tctattactg tgctagagct ggtaagggag gctatgctat ggactactgg
      121 ggtcaaggaa cctcagcacc gtctcctca
//
LOCUS       MUSIGHABB     136 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            C20-7D6, partial cds.
ACCESSION   M33401
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) adult Id+ hybridoma C20-7D6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 136)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   135     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       87       88     V-region end/D-region start
    recomb      105      106     D-region end/JH4-region start
BASE COUNT       36 a     37 c     32 g     26 t      5 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gncatctgag
       61 gacactncng tctattactg tgctagaggg nataggtacg acggtnctat ggactactgg
      121 ggtcaaggaa cctcag
//
LOCUS       MUSIGHABC     131 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            29-1G5, partial cds.
ACCESSION   M33402
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 131)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   131     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       84       85     V-region end/D-region start
    recomb       87       88     D-region end/JH4-region start
BASE COUNT       34 a     40 c     27 g     28 t      2 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
       61 gacactgcng tctattactg tgctccgtac tatgctatgg actactgggg tcaaggaacc
      121 tcagtnaccg t
//
LOCUS       MUSIGHABD     123 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            23-9D10, partial cds.
ACCESSION   M33403
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   123     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       84       85     V-region end/D-region start
    recomb       87       88     D-region end/JH4-region start
BASE COUNT       28 a     37 c     29 g     27 t      2 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
       61 gacactgcng tctattactg tgctggggct tactggggcc aagggactct ggtnactgtc
      121 tct
//
LOCUS       MUSIGHABE     129 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            41-4F5, partial cds.
ACCESSION   M33404
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 129)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   129     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       84       85     V-region end/D-region start
    recomb       90       91     D-region end/JH4-region start
BASE COUNT       34 a     42 c     24 g     25 t      4 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag
       61 gacactgcng tctattactg tgctaactgg gactactggg gnnaaggcac cactctcaca
      121 gtctcctca
//
LOCUS       MUSIGHABF     132 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            49-1G5, partial cds.
ACCESSION   M33405
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 132)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   132     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       87       88     V-region end/D-region start
    recomb       90       91     D-region end/JH4-region start
BASE COUNT       34 a     44 c     25 g     27 t      2 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
       61 gacacngcng tctattactg tgctaggtac tactttgact actggggcca aggcaccact
      121 ctcacagtct cc
//
LOCUS       MUSIGHABG     141 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            23-10E7, partial cds.
ACCESSION   M33406
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   141     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       87       88     V-region end/D-region start
    recomb       99      100     D-region end/JH4-region start
BASE COUNT       39 a     43 c     30 g     27 t      2 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct aacatctgag
       61 gacactgcng tctattactg tgctagaagg ggnaactacg ctatggacta ctggggtcaa
      121 ggaacctcag tcaccgtctc c
//
LOCUS       MUSIGHABH     151 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            44-2F5, partial cds.
ACCESSION   M33407
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 151)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   151     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       84       85     V-region end/D-region start
    recomb      105      106     D-region end/JH4-region start
BASE COUNT       39 a     48 c     27 g     34 t      3 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag
       61 gacactgcng tctattactg tgcnctcatc tactatggta acacctatgc tatggactac
      121 tggggtcaag gaacctcagt caccgtctcc t
//
LOCUS       MUSIGHABI     160 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            25-5E10, partial cds.
ACCESSION   M33408
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 160)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   160     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       87       88     V-region end/D-region start
    recomb      117      118     D-region end/JH4-region start
BASE COUNT       44 a     44 c     36 g     33 t      3 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag
       61 gacactgcng tctattactg tgctagaaaa gactattact acggtagtag cgggggagct
      121 atggactact ggggtcaagg aacctcagtn accgtctcct
//
LOCUS       MUSIGHABJ     125 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            49-1E7, partial cds.
ACCESSION   M33409
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1E7, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 125)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   125     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       87       88     V-region end/D-region start
    recomb       90       91     D-region end/JH4-region start
BASE COUNT       32 a     42 c     26 g     24 t      1 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag
       61 gacactgcng tctattactg tgctagctgg gactactggg gcaaggcacc actctcacag
      121 tctcc
//
LOCUS       MUSIGHABK      91 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            50-2G3, partial cds.
ACCESSION   M33410
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 50-2G3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 91)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    91     Ig H-chain V-D-JH4-region (AA at 3)
    recomb       29       30     V-region end/D-region start
    recomb       47       48     D-region end/JH4-region start
BASE COUNT       21 a     18 c     23 g     24 t      5 others
ORIGIN      Chromosome 12.
        1 gagacactnc ngtctattac tgtgctagag gtactacggt agtagtctat gctatggact
       61 actggggtna aggaacctna gtnaccgtct c
//
LOCUS       MUSIGHABL     107 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            50-6D8, partial cds.
ACCESSION   M33411
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 50-6D8, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 107)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   107     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       48       49     V-region end/D-region start
    recomb       57       58     D-region end/JH4-region start
BASE COUNT       26 a     28 c     24 g     28 t      1 others
ORIGIN      Chromosome 12.
        1 ctcagcagcc taacatctga ggacactgcn gtctattact gtgctagtag gtacgattac
       61 tatgctatgg actactgggg tcaaggaacc tcagtcaccg tctctca
//
LOCUS       MUSIGHADE     404 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain gene V-region, Id+ hybridoma
            C16-15F6, partial cds.
ACCESSION   M33390
KEYWORDS    immunoglobulin heavy chain; processed gene; variable region.
SOURCE      Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 404)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       20     Ig H-chain V-region, exon 1 (AA at 2)
                100  >   404     Ig H-chain V-region, exon 2
    IVS          21       99     Ig H-chain V-region intron A
BASE COUNT      105 a    100 c    106 g     91 t      2 others
ORIGIN      Chromosome 12.
        1 cctgatggca gtggttacag gtaaggggct ccnaagtccc aaacttgagg gtccataaac
       61 tctgtgacag tggcaatcac tttgcctttc tttctacagg ggtcaattca gaggttcagc
      121 tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg tcctgcacag
      181 cttctggctt caacattaaa gacacctata tgcactgggt gaagcagagg cctgaacagg
      241 gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat gacccgaagt
      301 tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac ctgcagctca
      361 gcagcctgac atctgaggac actgccgtct attactgtgc taga
//
LOCUS       MUSIGHADF     141 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            J7-11H3, partial cds.
ACCESSION   M33400
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) adult Id+ hybridoma J7-11H3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   141     Ig H-chain V-D-JH4-region (AA at 1)
    recomb       87       88     V-region end/D-region start
    recomb       96       97     D-region end/JH4-region start
BASE COUNT       36 a     43 c     28 g     29 t      5 others
ORIGIN      Chromosome 12.
        1 ataacagcag acacatcctc caacacagcc tacctgcagc tnagcagcct gacatctgag
       61 gacactgcng tctattactg tgctaggagt agctacgact actttgacta ctggggnnna
      121 ggcaccactc tcacagtctc c
//
LOCUS       MUSIGHZD      234 bp ss-mRNA            ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma
            C16-15F6, partial cds.
ACCESSION   M17008 M33398
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 234)
  AUTHORS   Busto,P., Gerstein,R., Dupre,L., Giorgetti,C.A., Selsing,E. and
            Press,J.L.
  TITLE     Molecular analysis of heavy and light chains used by primary and
            secondary anti-(T,G)-A--L antibodies produced by normal and Xid
            mice
  JOURNAL   J. Immunol. 139, 608-618 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 82 to 234)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   234     Ig H-chain V-D-JH4-region (AA at 1)
    recomb      168      169     V-region end/D-region start
    recomb      186      187     D-region end/JH4-region start
BASE COUNT       65 a     61 c     58 g     47 t      3 others
ORIGIN      Chromosome 12.
        1 cagggccnnn agtggattgg aaggattgat cctgcgaatg gtaatactaa atatgacccg
       61 aagttccagg gcaaggccac tataacagca gacacatcct ccaacacagc ctacctgcag
      121 ctcagcagcc tgacatctga ggacactgcc gtctattact gtgctagaag agggggcaat
      181 ggtaactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca
//
LOCUS       MUSIGHZL      291 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene V-region, Id+ hybridoma
            29-1G5, partial cds.
ACCESSION   M33391
KEYWORDS    germline; immunoglobulin heavy chain; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig H-chain V-region (AA at 1)
BASE COUNT       77 a     69 c     70 g     57 t     18 others
ORIGIN      Chromosome 12.
        1 gaggntcagc tncagnagtc tggggnagag ctngtnaagc caggngcctc agtcaagttg
       61 tcctgcacag cttctnnnnn naacattaaa gacacctata tgcactgggt naagcagagg
      121 cctnaacagg gcctngagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
      181 gacccgaagt tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac
      241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t
//
LOCUS       MUSIGHZM      291 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene V-region, Id+ hybridoma
            23-9D10, partial cds.
ACCESSION   M33392
KEYWORDS    germline; immunoglobulin heavy chain; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig H-chain V-region (AA at 1)
BASE COUNT       77 a     71 c     75 g     60 t      8 others
ORIGIN      Chromosome 12.
        1 gaggttcagc tgnagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg
       61 tcctgcacag cttctgnntt caacattaaa gacacctata tgcactgggt gaagcagagg
      121 cctnaacagg gcctngagtg gattggaagg atngatcctg cgaatggtaa tactaaatat
      181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
      241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t
//
LOCUS       MUSIGHZN      291 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene V-region, Id+ hybridoma
            41-4F5, partial cds.
ACCESSION   M33393
KEYWORDS    germline; immunoglobulin heavy chain; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig H-chain V-region (AA at 1)
BASE COUNT       77 a     71 c     76 g     60 t      7 others
ORIGIN      Chromosome 12.
        1 gaggntcagc tgcagnagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg
       61 tcctgcacag cttctggntt caacattaaa gacacctata tgcactgggt naagcagagg
      121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
      181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
      241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc t
//
LOCUS       MUSIGHZO      294 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene V1A-J2-region, Id+ hybridoma
            49-1G5, partial cds.
ACCESSION   M33394
KEYWORDS    germline; immunoglobulin heavy chain; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   294     Ig H-chain V-region (AA at 1)
BASE COUNT       78 a     69 c     76 g     59 t     12 others
ORIGIN      Chromosome 12.
        1 gaggttcagc tgcagcagtc tggggcagan cttgtnaagc caggggcctc agtcaagttg
       61 tcctgcacag cttctggnnt naacattaaa gacacctata tgcactgggt gaagcagagg
      121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
      181 gacnngaagt tcnagngcaa ggccactata acagcagaca catcctccaa cacagcctac
      241 ctgcagctca gcagcctgac atctgaggac acngcngtct attactgtgc tagg
//
LOCUS       MUSIGHZP      294 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene V-region, Id+ hybridoma
            23-10E7, partial cds.
ACCESSION   M33395
KEYWORDS    germline; immunoglobulin heavy chain; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   294     Ig H-chain V-region (AA at 1)
BASE COUNT       80 a     72 c     78 g     60 t      4 others
ORIGIN      Chromosome 12.
        1 gaggttcagc tgcagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg
       61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg
      121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
      181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
      241 ctgcagctca gcagcctaac atctgaggac actgcngtct attactgtgc taga
//
LOCUS       MUSIGHZQ      291 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene V-region, Id+ hybridoma
            44-2F5, partial cds.
ACCESSION   M33396
KEYWORDS    germline; immunoglobulin heavy chain; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 291)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   291     Ig H-chain V-region (AA at 1)
BASE COUNT       77 a     72 c     76 g     59 t      7 others
ORIGIN      Chromosome 12.
        1 gaggttcagc tgcagcagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg
       61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg
      121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
      181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac
      241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc n
//
LOCUS       MUSIGHZT      294 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene V-region, Id+ hybridoma
            25-5E10, partial cds.
ACCESSION   M33397
KEYWORDS    germline; immunoglobulin heavy chain; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   294     Ig H-chain V-region (AA at 1)
BASE COUNT       79 a     73 c     78 g     60 t      4 others
ORIGIN      Chromosome 12.
        1 gaggttcagc tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg
       61 tcctgcacag cttctggcnt caacattaaa gacacctata tgcactgggt gaagcagagg
      121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat
      181 gacccgaagt tcnagggcaa ggccactata acagcagaca catcctccaa cacagcctac
      241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc taga
//
LOCUS       MUSIGKABO     339 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V1A-J2-region, Id+ hybridoma
            29-1G5, partial cds.
ACCESSION   M33389
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 339)
  AUTHORS   Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E.,
            Zhukovsky,E. and Press,J.L.
  TITLE     Neonatal and adult primary B cells use the same germ-line V-H and
            V-kappa genes in their (T,G)-A-L-specific repertoire
  JOURNAL   J. Immunol. 144, 583-592 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   339     Ig kappa-chain V1A-J2-region (AA at 1)
    recomb      300      301     V1A-region end/J2-region start
BASE COUNT       88 a     84 c     83 g     82 t      2 others
ORIGIN      Chromosome 6.
        1 gatgttgtga tgacccaaac tccnctctcc ctgcctgtca gtcttggaga tcaagcctcc
       61 atctcttgca gatctagtca gagccttgta cacagtaatg gaaacaccta tttacattgg
      121 tacctgcaga agccaggcca gtctccaaag ctcctgatct acaaagtttc caaccgattt
      181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc
      241 agcagagngg aggctgagga tctgggagtt tatttctgct ctcaaagtac acatgttccg
      301 tacacgttcg gaggggggac caagctggaa ataaaacgg
//
LOCUS       M36836       2651 bp   -                UNA       24-JUL-1990
DEFINITION  Figure 1. Nucleotide sequence of the characterized locust mtDNA
            segment.
ACCESSION   M36836
REFERENCE   1  (bases 1 to 2651)
  AUTHORS   Haucke,H.-R. and Gellissen,G.
  TITLE     different mitochondrial gene orders amino insects: exchanged trna
            gene positions in the coii/coiii region between an orthopteran and
            a dipteran species
  JOURNAL   Curr. Genet. 14, 471-476 (1988)
  STANDARD  unannotated staff_entry
BASE COUNT     1034 a    419 c    303 g    895 t
ORIGIN      
        1 cacagatact cagaactacc actaattaat ttatagattc taatatggca gattaatgca
       61 ttagatttaa gctctaaaaa taaagttttg accttttatt agaaaaatat ggcaacatga
      121 tcaaacttat cattacaaga tggagcttca ccattaatag aacaattatc attctttcat
      181 gatcatacaa taattgatct attattaatc acaataattg taggatattc gcttagatat
      241 atattattaa caaaatatac aaaccgaaat atacttcatg gtcatttaat tgaaactatc
      301 tgaacagcac taccagcaat tacattaatc tttattgcat taccatcatt acgattacta
      361 tatttacttg atgattcatc agatgctata attacaatta aaacaattgg acgacaatga
      421 tactgaagtt atgaatattc agatttcatt aatgtagaat ttgatacata tataactcct
      481 gaaaatgaat taaatacaga tgaattccga ctcttagaag ttgataatcg aacaacatta
      541 cctataaata cagaagtacg agtattaact agagcatctg atgtactcca ctcatgagca
      601 gtaccagctt tagtattaaa aattgacgca acacccggac gactaaacca aggtatattt
      661 ataattaacc gcccaggtct atttttcggt caatgctcag aaatctgtgg agctaatcat
      721 agatttatac caattgtaat tgaaagaaca tcaattaaac ttttcattaa atgattatct
      781 aacataatat aaggagttag ttaaaatata acattagaat gtcaatctaa aataactaaa
      841 aatagtacac cttgaaccat cagatgactg aaagtaagta atggtctctt aaaccaaaaa
      901 atagtaaatt aacgtttact tctgatgagg taaattaaaa acttaatccc tcaaatatca
      961 ccaataatat gattttcact attcattata ttctcaataa caataatatt atttaatcaa
     1021 ttaaacttct tctcttataa accaaataaa attataagaa gtaataataa aattaaaaaa
     1081 aaaaacatta attgaatatg ataacaaatc tattctcaac attcgaccca tcaactaatt
     1141 tatttaattt atcattaaat tgaactagaa cattccttgg actattatta attccatcaa
     1201 tattttgatt aataccttct cgaattaata ttttatgaaa taaaataaat ctaaacttac
     1261 ataatgaatt taaaacatta ttaggaaaaa actcatttca aggatcaaca ctaattctaa
     1321 tttcaatttt tatcataata ttatttaaca attttatagg attattccca tacattttta
     1381 caagaactag tcatataaca ttaacattct caattgcatt accaatatga ataagattta
     1441 tattatttgg atgaattaat aatactaatc atatatttac acatcttgtc ccacaaggaa
     1501 caccaaatgc acttatatca tttatggttc taattgaaac aattagaaat gttattcggc
     1561 cagggaccct agctgtacga ttagctgcaa atatgattgc aggacatctt ttattaacat
     1621 tattaggaaa tacaggaccg tcactaacaa caagaattat attatttcta attattggac
     1681 aaatattact tttaattctt gaatcagccg tagctataat tcaagcatac gtattctcaa
     1741 ttctaagaac attatattca agagaagttt actaaactta tgttaacaaa taacaacaat
     1801 cacccatttc atatagtaga ttatagacca tgaccattaa ctggagcaat cggagcaata
     1861 attctaacct caggaataac taaatgattt catacattca atataaattt attaataatt
     1921 ggaataactg tcattgtatt aacaataatt caatggtgac gagatgtagt tcgagaaggt
     1981 acatttcaag gtctacacac taaattagta tcaaaaggac tgcgatgagg aataatttta
     2041 tttattgcat cagaagtttt atttttcgcc tcattcttct gagcattctt taatagtaga
     2101 ttagcaccaa caattgaact aggaataaaa tgacctccaa taggaattca accatttaat
     2161 ccaatccaaa ttccactact taatactgca attttactag cgtcaggagt tacaattaca
     2221 tgagcacatc acagaatcat agaatgtaat cattcacaag cattacaagg attattcttt
     2281 acagtaatat taggatttta ttttacttta ttacaaatat acgaatattg agaagcacct
     2341 tttaccattg cagacgcagt atatggatct acattcttcg tagcaacagg attccatggg
     2401 ctacatgtaa tcattggtac aacattctta ttaacatgcc taattcgaca tataataaac
     2461 caattctcat ctaatcacca ctttggattt gaagctgcag catgatactg acactttgtt
     2521 gatgtagtat gattattttt atatttatca atttactgat gaggtagata attaattttt
     2581 ctagtataat tagtacattt gacttccaat caaaagattg aaatatcaag aaaaataatt
     2641 ataatattaa t
//
LOCUS       ECOFPP       1950 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  E.coli dipeptide transport protein (fpp) gene, complete cds.
ACCESSION   M35045
KEYWORDS    fpp gene; periplasmic protein.
SOURCE      E.coli (strain K12; isolate E1222) DNA, clone lambda-10.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1950)
  AUTHORS   Olson,E.R., Dunyak,D.S., Jurss,L.M. and Poorman,R.A.
  TITLE     Identification and characterization of fpp, an E.coli gene involved
            in dipeptide transport
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.R.Olson, 04-JUN-1990.
            
               Author address: E.R.Olson
               Molecular Biology Research, 7242-209-7
               The UpJohn Company
               Kalamazoo, MI 49001
FEATURES       from  to/span     description
    pept        147     1754     dipeptide transport protein (fpp) precursor
    sigp        147      230     dipeptide transport protein signal peptide
    matp        231     1751     dipeptide transport protein
    site       1792     1888     REP sequence
BASE COUNT      548 a    499 c    491 g    412 t
ORIGIN      79.5 min on the K12 map.
        1 ggatccgcac tgttacactg atgttaatta gtacggcatc cccacctcat aacgttgacc
       61 cgaccgggca aaaaacaaaa aaggtcaggc agcgacaacc cactgcaaag ggttaaaaca
      121 acaaacatca caattggagc agaataatgc gtatttcctt gaaaaagtca gggatgctga
      181 agcttggtct cagcctggtg gctatgaccg tcgcagcaag tgttcaggct aaaactctgg
      241 tttattgctc agaaggatct ccggaagggt ttaacccgca gctgtttacc tccggcacca
      301 cctatgacgc ctcttccgtc ccgctttata accgtctggt tgaatttaaa atcggcacca
      361 ccgaagtgat cccgggcctc gctgaaaagt gggaagtcag cgaagacggt aaaacctata
      421 ccttccatct gcgtaaaggt gtgaagtggc acgacaataa agaattcaaa ccgacgcgtg
      481 aactgaacgc cgatgatgtg gtgttctcgt tcgatcgtca gaaaaacgcg caaaacccgt
      541 accataaagt ttctggcggc agctacgaat acttcgaagg catgggcttg ccagagctga
      601 tcagtgaagt gaaaaaggtg gacgacaaca ccgttcagtt tgtgctgact cgcccggaag
      661 cgccgttcct cgctgacctg gcaatggact tcgcctctat tctgtcaaaa gaatatgctg
      721 atgcgatgat gaaagccggt acaccggaaa aactggacct caacccaatc ggaaccggtc
      781 cgttccagtt acagcagtat caaaaagatt cccgtatccg ctacaaagcg tttgatggct
      841 actggggcac caaaccgcag atcgatacgc tggttttctc tattacccct gacgcttccg
      901 tgcgttacgc gaaattgcag aagaatgaat gccaggtgat gccgtacccg aacccggcag
      961 atatcgctcg catgaagcag gataaatcca tcaatctgat ggaaatgccg gggctgaacg
     1021 tcggttatct ctcgtataac gtgcagaaaa aaccactcga tgacgtgaaa gttcgccagg
     1081 ctctgaccta cgcggtgaac aaagacgcga tcatcaaagc ggtttatcag ggcgcgggcg
     1141 tatcagcgaa aaacctgatc ccgccaacca tgtggggcta taacgacgac gttcaggact
     1201 acacctacga tcctgaaaaa gcgaaagcct tgctgaaaga agcgggtctg gaaaaaggtt
     1261 tctccatcga cctgtgggcg atgccggtac aacgtccgta taacccgaac gctcgccgca
     1321 tggcggagat gattcaggca gactgggcga aagtcggcgt gcaggccaaa attgtcacct
     1381 acgaatgggg tgagtacctc aagcgtgcga aagatggcga gcaccagacg gtaatgatgg
     1441 gctggactgg cgataacggg gatccggata acttcttcgc caccctgttc agctgcgccg
     1501 cctctgaaca aggctccaac tactcaaaat ggtgctacaa accgtttgaa gatctgattc
     1561 aaccggcgcg tgctaccgac gaccacaata aacgcgttga actgtacaaa caagcgcagg
     1621 tggtgatgca cgatcaggct ccggcactga tcatcgctca ctccaccgtg tttgaaccgg
     1681 tacgtaaaga agttaaaggc tatgtggttg atccattagg caaacatcac ttcgaaaacg
     1741 tctctatcga ataattaaaa gccatacaag actgatggca aaggcaaaaa tgcctgatgc
     1801 gctccgctta tcaggcctac gaaaattctg caatgtattg aatttgcacg attttgtagg
     1861 ccggataagg cgttaacgcg catccggcat aaacaaagcg cactttgtca acaatctgta
     1921 tacccggtgg cgctgtgcct ctgcgtgcag
//
LOCUS       ECOSHL       1820 bp ds-DNA             BCT       24-JUL-1990
DEFINITION  E.coli shl gene, complete cds, and acetolactate synthase II (ilvH)
            gene, 3' end.
ACCESSION   M35034
KEYWORDS    acetolactate synthase.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1820)
  AUTHORS   Leclerc,G., Noel,G. and Drapeau,G.
  TITLE     Molecular cloning, nucleotide sequence and expression of shl
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Drapeau, 01-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1      105     acetolactate synthase (ilvH) (AA at 1)
    pept        117      203     leader protein
    pept        285     1289     shl protein
    signal       66       73     -35 region
    signal       89       95     -10 region
BASE COUNT      446 a    433 c    463 g    478 t
ORIGIN      1 bp upstream HindIII site.
        1 aagcttgatg catttttagc atcgattcgc gatgtggcga aaattgtgga ggttgctcgc
       61 tctggtgtgg tcggactttc gcgcggcgat aaaataatgc gttgagaatg atctcaatgc
      121 gcaatttaca gcccaacatg tcacgttggg ctttttttgc gaaatcagtg ggaacctgga
      181 ataaaagcag ttgccgcagt taattttctg cgcttagatg ttaatgaatt taacccatac
      241 cagtacaatg gctatggttt ttacatttta cgcaaggggc aattgtgaaa ctggatgaaa
      301 tcgctcggct ggcgggagtg tcgcggacca ctgcaagcta tgttattaac ggcaaagcga
      361 agcaataccg tgtgagcgac aaaaccgttg aaaaagtcat ggctgtggtg cgtgagcaca
      421 attaccaccc gaacgccgtg gcagctgggc ttcgtgctgg acgcacacgt tctattggtc
      481 ttgtgatccc cgatctggag aacaccagct atacccgcat cgctaactat cttgaacgcc
      541 aggcgcggca acggggttat caactgctga ttgcctgctc agaagatcag ccagacaacg
      601 aaatgcggtg cattgagcac cttttacagc gtcaggttga tgccattatt gtttcgacgt
      661 cgttgcctcc tgagcatcct ttttatcaac gctgggctaa cgacccgttc ccgattgtcg
      721 cgctggaccg cgccctcgat cgtgaacact tcaccagcgt ggttggtgcc gatcaggatg
      781 atgccgaaat gctggcggaa gagttacgta agtttcccgc cgagacggtg ctttatcttg
      841 gtgcgctacc ggagctttct gtcagcttcc tgcgtgaaca aggtttccgt actgcctgga
      901 aagatgatcc gcgcgaagtg catttcctgt atgccaacag ctatgagcgg gaggcggctg
      961 cccagttatt cgaaaaatgg ctggaaacgc atccgatgcc gcaggcgctg ttcacaacgt
     1021 cgtttgcgtt gttgcaagga gtgatggatg tcacgctgcg tcgcgacggc aaactgcctt
     1081 ctgacctggc aattgccacc tttggcgata acgaactgct cgacttctta cagtgtccgg
     1141 tgctggcagt ggctcaacgt caccgcgatg tcgcagagcg tgtgctggag attgtcctgg
     1201 caagcctgga cgaaccgcgt aagccaaaac ctggtttaac gcgcattaaa cgtaatctct
     1261 atcgccgcgg cgtgctcagc cgtagctaag ccgcgaacaa aaatacgcgc caggtgaatt
     1321 tccctctggc gcgtagagta cgggactgga catcaatatg cttaaagtaa ataagactat
     1381 tcctgactat tattgataaa tgcttttaaa cccgcccgtt aattaactca ccagctgaaa
     1441 ttcacaataa ttaagtgata tcgacagcgc gtttttgcat tattttgtta catgcggcga
     1501 tgaattgccg atttaacaaa cacttttctt tgcttttgcg caaacccgct ggcatcaagc
     1561 gccacacaga cgtaacaagg actgttaacc ggggaagata tgtcctaaaa tgccgctcgc
     1621 gtcgcaaact gacactttat atttgctgtg gaaaatagtg agtcatttta aaacggtgat
     1681 gacgatgagg gattttttct tacagctatt cataacgtta atttgcttcg cacgttggac
     1741 gtaaaataaa caacgctgat attagccgta aacatcgggt tttttacctc ggtatgcctt
     1801 gtgactggct tgacaagctt
//
LOCUS       HUMATCT4A    1742 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Human T4 surface glycoprotein CD4 gene, complete cds.
ACCESSION   M35160
KEYWORDS    glycoprotein; immunoglobulin supergene family; recognition antigen;
            surface antigen.
SOURCE      Human (haplotype OKT4-negative) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1742)
  AUTHORS   Hodge,T.W., Sasso,D.R., Steve,J. and McDougal,S.
  TITLE     Humans with OKT4-epitope deficiency have a single base change in
            the CD4 gene resulting in substitutions of TRP-240 for ARG-240
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.W.Hodge, 12-JUN-1990.
            
            A cytosine to thymidine transition at nucleotide position 868
            results in substitution of TRP-240 for ARG-240.
            
               Author address: T.W.Hodge;
               A-25 Bldg 1  Rm 1226
               Centers for Disease Control
               1600 Clifton Rd.
               Atlanta, GA 30333
FEATURES       from  to/span     description
    pept         76     1452     T4 surface glycoprotein precursor
    sigp         76      150     T4 surface glycoprotein signal peptide
    matp        151     1449     T4 surface glycoprotein mature peptide
    mut         792      792     g in OKT4 epitope deficiency; c in wt
BASE COUNT      405 a    491 c    466 g    380 t
ORIGIN      
        1 caagcccaga gccctgccat ttctgtgggc tcaggtccct actgctcagc cccttcctcc
       61 ctcggcaagg ccacaatgaa ccggggagtc ccttttaggc acttgcttct ggtgctgcaa
      121 ctggcgctcc tcccagcagc cactcaggga aagaaagtgg tgctgggcaa aaaaggggat
      181 acagtggaac tgacctgtac agcttcccag aagaagagca tacaattcca ctggaaaaac
      241 tccaaccaga taaagattct gggaaatcag ggctccttct taactaaagg tccatccaag
      301 ctgaatgatc gcgctgactc aagaagaagc ctttgggacc aaggaaactt ccccctgatc
      361 atcaagaatc ttaagataga agactcagat acttacatct gtgaagtgga ggaccagaag
      421 gaggaggtgc aattgctagt gttcggattg actgccaact ctgacaccca cctgcttcag
      481 gggcagagcc tgaccctgac cttggagagc ccccctggta gtagcccctc agtgcaatgt
      541 aggagtccaa ggggtaaaaa catacagggg gggaagaccc tctccgtgtc tcagctggag
      601 ctccaggata gtggcacctg gacatgcact gtcttgcaga accagaagaa ggtggagttc
      661 aaaatagaca tcgtggtgct agctttccag aaggcctcca gcatagtcta taagaaagag
      721 ggggaacagg tggagttctc cttcccactc gcctttacag ttgaaaagct gacgggcagt
      781 ggcgagctgt ggtggcaggc ggagagggct tcctcctcca agtcttggat cacctttgac
      841 ctgaagaaca aggaagtgtc tgtaaaatgg gttacccagg accctaagct ccagatgggc
      901 aagaagctcc cgctccacct caccctgccc caggccttgc ctcagtatgc tggctctgga
      961 aacctcaccc tggcccttga agcgaaaaca ggaaagttgc atcaggaagt gaacctggtg
     1021 gtgatgagag ccactcagct ccagaaaaat ttgacctgtg aggtgtgggg acccacctcc
     1081 cctaagctga tgctgagctt gaaactggag aacaaggagg caaaggtctc gaagcgggag
     1141 aaggcggtgt gggtgctgaa ccctgaggcg gggatgtggc agtgtctgct gagtgactcg
     1201 ggacaggtcc tgctggaatc caacatcaag gttctgccca catggtccac cccggtgcag
     1261 ccaatggccc tgattgtgct ggggggcgtc gccggcctcc tgcttttcat tgggctaggc
     1321 atcttcttct gtgtcaggtg ccggcaccga aggcgccaag cagagcggat gtctcagatc
     1381 aagagactcc tcagtgagaa gaagacctgc cagtgccctc accggtttca gaagacatgt
     1441 agccccattt gaggcacgag gccaggcaga tcccacttgc agcctcccca ggtgtctgcc
     1501 ccgcgtttcc tgcctgcgga ccagatgaat gtagcagatc ccacgctctg gcctcctgtt
     1561 cgtcctccct acaatttgcc attgtttctc ctgggttagg ccccggcttc actggttgag
     1621 tgttgctctc tagtttccag aggcttaatc acaccgtcct ccacgccatt tccttttcct
     1681 tcaagcctag cccttctctc attatttctc tctgaccctc tccccactgc tcatttggat
     1741 cc
//
LOCUS       HUMFUSED      555 bp ds-DNA             SYN       24-JUL-1990
DEFINITION  Human interleukin-1beta/growth hormone fusion protein gene,
            complete cds.
ACCESSION   M35049
KEYWORDS    growth hormone; interleukin-1-beta.
SOURCE      Human synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 555)
  AUTHORS   Pecceu,R., Dousset,P., Shire,D., Cavrois,E., Marchese,E.,
            Ferrara,P., Kaghad,M., Dumont,X. and Lupker,J.
  TITLE     Human interleukin-1-beta fused to the human growth hormone signal
            peptide is N-glycosylated and secreted by Chinese hamster ovary
            cells
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Pecceu, 08-JUN-1990.
            
               Author address: F.Pecceu
               SANOFI ELF BIO-RECHERCHES
               LABEGE INNOPOLE B.P. 137
               31328 LABEGE CEDEX
               FRANCE
FEATURES       from  to/span     description
    pept         11      550     human growth hormone/interleukin--1beta fusion
                                 protein
    sigp         11       88     human growth hormone signal peptide
    matp         89      547     interleukin-1-beta
BASE COUNT      154 a    142 c    135 g    124 t
ORIGIN      
        1 agcttccgcc atggctacag gctcccggac gtccctgctc ctggcttttg gcctgctctg
       61 cctgccctgg cttcaagagg gcagtgccgc acctgtaaga tctctgaact gcacgctccg
      121 ggactcacag caaaagagct tggtgatgtc tggtccatat gaactgaaag ctctccacct
      181 gcagggacag gatatggagc aacaagtggt gttctccatg tcctttgtcc aaggagaaga
      241 aagtaatgac aaaatacctg tggcgttggg cctcaaggaa aagaatctgt acttaagctg
      301 cgtgttgaaa gatgataagc ccactctaca gctggagagt gtagacccca aaaattaccc
      361 aaagaagaag atggaaaagc gatttgtctt caacaagata gaaatcaata acaagctgga
      421 atttgagtct gcccagttcc ccaactggta catcagcacc tctcaagcag aaaacatgcc
      481 cgtcttcctg ggtggtacca aaggcggcca ggatataact gacttcacca tgcaatttgt
      541 gagctcctag gatcc
//
LOCUS       HUMGLYA      1200 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Human Miltenberger V-like glycophorin gene, exons 3 and 4.
ACCESSION   J05563
KEYWORDS    glycophorin A; transmembrane protein.
SOURCE      Human leukocyte DNA, clone Mi51.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1200)
  AUTHORS   Kudo,S., Chagnovich,D., Rearden,A., Mattei,M.-G. and Fukuda,M.
  TITLE     Molecular analysis of a hybrid gene encoding human glycophorin
            variant Miltenberger V-like molecule
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Kudo and M.Fukuda, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    /    81      176     Miltenberger V-like glycophorin, exon 3 (AA at
                                 81)
               1110  /  1148     Miltenberger V-like glycophorin, exon 4
    IVS     <     1       80     Miltenberger V-like glycophorin intron B
    IVS         177     1109     Miltenberger V-like glycophorin intron C
    site       1079     1116     crossing-over site
BASE COUNT      387 a    236 c    212 g    365 t
ORIGIN      
        1 agatttgtct ttcatgatac gctctatgtc cacgcagtca cctcattctt gacccctttc
       61 tcaacttctc ttatatgcag atacgcacaa acgggacaca tatgcagcca ctcctagagc
      121 tcatgaagtt tcagaaattt ctgttagaac tgtttaccct ccagaagagg aaaccggtat
      181 gttcttagtt ttaaatagtt gctctggagt cattgttgtg attgaactct gtttacacga
      241 gctgtaactc atgacagttc tcaagctttc gtgacagaaa acccatctct tttactccaa
      301 agcccatata gcacccacaa ctattaactg tgaccaagaa agagaaggca agccccaatt
      361 aacctttgta cgtaaagcct aaagaatgaa aaaatatacc tgaatcctca atcatcaaac
      421 agcatagtat atactaagta atttgtaata attaaactct agaaaattgt gtggcttcgg
      481 tagtaagaga gcttcatgac ataaaatggc aagtggagac agagacaaaa gtaggatgtg
      541 gactgagagg gaaggttagc acaggtggaa cagtaaggca accatactat caattgctac
      601 tgacatagaa tccagagaga ctattggcaa aagctcaaat gagacacagt aacagtttag
      661 attcagacag tggctgtggc ataaatcaga aaattgatag ccgcatgatc cctctttgca
      721 tgggactggc atctgtgtgg agtaatggtt ccatatgcct cctttcttct ccttattttt
      781 aaatttttta aaaatgcatt gcttcttgtg gaagtcaata agtgattctt ccaatacttt
      841 ctcattcctt ccccctcagt tatgagacaa tttgcttatt tctcatccat gaatacttgt
      901 tgggtcatta aaagtagata ctgaaattac taatggtacg actgacatat tacctcataa
      961 atgttactag ctagatgttg aaagttgacc aacaactctc aaaatatgat taagaaaagg
     1021 aaacccacag aacagtttga ttccaaaatg atttttttct ttgcacatgc cttacttatt
     1081 tggacttaca ttgaaatttt gctttatagg agaaatggga caacttgtcc atcgtttcac
     1141 tgtaccaggt atgttaatat ttgacaaaga ataaaagtca ttccatttta aactatccat
//
LOCUS       HUMMHDRB     1172 bp ds-DNA             PRI       24-JUL-1990
DEFINITION  Human MHC class II HLA-DR-beta DR2 gene, complete cds.
ACCESSION   M35159
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human (black with haplotype DR2) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1172)
  AUTHORS   Demopulos,J.T., Hodge,T.W., Wooten,V. and Acton,R.T.
  TITLE     A novel DRB1 allele in DR2-positive American blacks
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.W.Hodge, 12-JUN-1990.
            
            A thymidine to cytosine transition at nucleotide position 88
            results in an amino acid change from tyrosine to histidine at
            residue 30.
            
               Author address: T.W.Hodge;
               A-25 Bldg 1  Rm 1226
               Centers for Disease Control
               1600 Clifton Rd.
               Atlanta, GA 30333
FEATURES       from  to/span     description
    pept          6      806     MHC HLA-DR-beta chain precursor
                                 /hgml_locus_uid="LV0063D" /nomgen="HLA-DRB1"
                                 /map="6p21.3"
    sigp          6       92     MHC HLA-DR-beta chain signal peptide
    matp         93      803     MHC-HLA-DR-beta chain
BASE COUNT      272 a    315 c    324 g    261 t
ORIGIN      
        1 ccagcatggt gtgtctgaag ctccctggag gttcctacat ggcaaagctg acagtgacac
       61 tgatggtgct gagctcccca ctggcttcgg ctggggacac ccgaccacgt ttcttgcagc
      121 aggataagta tgagtgtcat ttcttcaacg ggacggagcg ggtgcggttc ctgcacagag
      181 acatctataa ccaagaggag gacttgcgct tcgacagcga cgtgggggag taccgggcgg
      241 tgacggagct ggggcggcct gacgctgagt actggaacag ccagaaggac ttcctggaag
      301 acaggcgcgc cgcggtggac acctactgca gacacaacta cggggttggt gagagcttca
      361 cagtgcagcg gcgagttgag cctaaggtga ctgtgtatcc tgcaaggacc cagaccctgc
      421 agcaccacaa cctcctggtc tgctctgtga atggtttcta tccaggcagc attgaagtca
      481 ggtggttccg gaacagccag gaagagaagg ctggggtggt gtccacaggc ctgattcaga
      541 atggagactg gaccttccag accctggtga tgctggaaac agttcctcga agtggagagg
      601 tttacacctg ccaagtggag cacccaagcg tgacgagccc tctcacagtg gaatggagag
      661 cacagtctga atctgcacag agcaagatgc tgagtggagt cgggggcttt gtgctgggcc
      721 tgctcttcct tggggccggg ctattcatct acttcaagaa tcagaaaggg cactctggac
      781 ttcacccaac aggactcgtg agctgaagtg cagatgacca cattcaaggg ggaaccttct
      841 gccccagctt tgcatgatga aaagctttcc tgcttggctc ttattcttcc acaagagagg
      901 actttctcag gccctggttg ctaccggttc agcaactctg cagaaaatgt ccatccttgt
      961 ggcttcctca gctcctgccc cttggcctga agtcccagca ttgatggcag tgcctcatct
     1021 tcaactttag tgctcccctt tacctaaccc tacggcctcc catgcatctg tactccccct
     1081 gtgtgccaca aatgcactac gttattaaat ttttctgaag cccagagtta aaaatcatct
     1141 gtccacctgg ctccaaagac aaaaaataaa aa
//
LOCUS       MUM3TER        55 bp ss-RNA             VRL       24-JUL-1990
DEFINITION  Mumps virus genome 3' terminus mRNA.
ACCESSION   M35284
KEYWORDS    .
SOURCE      Mumps virus (strain SBL-1), cDNA to mRNA, passed in Vero cells.
  ORGANISM  Mumps virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Paramyxoviridae; Paramyxovirus.
REFERENCE   1  (bases 1 to 55)
  AUTHORS   Elango,N., Varsanyi,T.M., Koevamees,J. and Norrby,E.
  TITLE     Molecular cloning and characterization of six genes, determination
            of gene order and intergenic sequences and leader sequence of mumps
            virus
  JOURNAL   J. Gen. Virol. 69, 2893-2900 (1988)
  STANDARD  simple staff_review
BASE COUNT        9 a     17 c      4 g     24 t      1 others
ORIGIN      
        1 nggttcccct ctttcttcta ccctataacc atcttgttta tcacattctt tgtca
//
LOCUS       MUSIGCR      2210 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Murine Ig germline membrane-bound gamma 2a-chain gene C region,
            exon 2.
ACCESSION   M35032
KEYWORDS    constant region; gamma-immunoglobulin; germline;
            immunoglobulin heavy-chain.
SOURCE      Murine (strain BALB/c) myeloma cell line RPC 5 DNA, clone
            CH.M.IgG2a-11.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2210)
  AUTHORS   Hall,B. and Milcarek,C.
  TITLE     Sequence and polyadenylation site determination of the murine
            immunoglobulin gamma 2a membrane 3' UT region
  JOURNAL   Mol. Immunol. 26, 819-826 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.L.Hall, 08-JUN-1990.
FEATURES       from  to/span     description
    pept    /    37      120     IgG-2a chain, exon 2
    pre-msg <     1     1682     IgG-2a chain mRNA (alt.)
    pre-msg <     1     1759     IgG-2a chain mRNA (alt.)
    IVS     <     1       36     IgG-2a chain intron A (no splice consensus)
    signal     1506     1511     major Poly-A signal
    signal     1585     1590     minor Poly-A signal
    rpt        1270     1305     (GA)17 repeats
    rpt        1395     1440     (CT)23 repeats
    site       1536     1543     McLauchlan consensus sequence
    site       1552     1559     McLauchlan consensus sequence
    site       1625     1633     McLauchlan consensus sequence
    site       1536     1540     Berget consensus sequence
BASE COUNT      596 a    543 c    557 g    514 t
ORIGIN      
        1 aagagaagcg ccaagagcac agcgccgact cctaccgtaa agtggatctt ctcctctgtg
       61 gtggagctga agcagacgat ctcccctgac tacagaaaca tgattgggca gggagcctag
      121 gccacttcct ctgggatcag aagagcttcc taggccctgc agaagcccat ccatcctact
      181 gtgcagccta acagggaggc cacactctag ccctatgact ctctgatcag aactcccatg
      241 gtctcctctt tggaggacca cgtgcagtgc aggctttgcc cagacctaaa cacttccaca
      301 gcagtcgcca gatatctaac tactccggac cagaagaacc atctccttcc aaaccagcac
      361 tagggatctg agatctcaga atgtttgcct aagaagagct ggaaatccag gcttcctgtg
      421 ttctgctaca aggacatcag cctggatttg acctggacca cacattttca tctaaatgag
      481 ttttccacaa aggacacgtt tcagatcctt gaatgagacc tctacatgga agaccagagt
      541 cactataccc aaaggtcact ctgtatcctt gcaccagcta tactggacag cttccttcct
      601 ggtacttcag tgaccctggc tgaggaaagg atctgtgacc tcaactgttt ggagagcctc
      661 tggaagatgt agtcttctct tcctgctacc accaacatgc tggatctcag atgcagaatc
      721 caatccacag acaccactga ccacacaacc tgaagacaag gccattgcca cctccacaga
      781 gatgccatcc acactctgtg gagaaataag gagtgctttg tgcagcctct gcaaagctct
      841 ggcagggatt agagtataca cactgagtac tgactaggtg accaggcaga aaaacctcca
      901 ggagaaggaa caatggggga gagatgtgaa cagatagtta gaaaaagcat ggtgtcacag
      961 gtctgctctg tggactgatt tccagattgg accacctaca gcagaaacca tcggttgcag
     1021 tggcaatcta ggaggaccaa cctggaatag gagggctgct gtggtcaatg gagagtagac
     1081 ctgtatctat ttctccactg cctcttatga ccaataagaa gccagagtct ccagacagaa
     1141 agaaagaaag aaagaaagaa agaaagaaag aaagaaagag agagagagag agagagagag
     1201 agagaggaag gaaggaagga aggaaggaag gaaggaagga aggaggagga ggaggaggag
     1261 gaggaggagg agagagagag agagagagag agagagagag agagagcacc agcttttctg
     1321 tgactggaag gaaatgctta gagagcttgg atctttaaag cttctttttt ctagagacca
     1381 tgaatgtctt tgttctctct ctctctctct ctctctctct ctctctctct ctctctctct
     1441 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gcgtgcatgc acgctattgt tttggcattt
     1501 gaaacaataa aacattcttt taatattctg tatctcatgg ttccccttct gtgtggatca
     1561 gccctaacac ccaggaacag ggacaataaa cagaccacag ccatgtacag ccttctacct
     1621 cccttctggt tctgacctcc cagaggtccc tcagtgggcc cctcacagct gggtttcttc
     1681 cctggcagtg ccaccaagag ctcaggcacc tctgagctgg aggctgtcct gatgccatag
     1741 gcaggctatg gagcagagat gatgaccacg gtgactcagg tgagccaggc aaagcctccc
     1801 atgccagaag agaagcgtgt ggtactcact ggcctcgggc tgctacggat tcagcaaaga
     1861 gcatggatcc agctacctcc atagtggatg cttattgtct caagctgtgt tcagagaaat
     1921 ttcagatcag gtattaatag ggtgggttct ccttacctgg taggtgcctt ctcccatggc
     1981 acctcacagg ctctccatct gtgtgtctgg gtcctgatct cttctcataa gtacacagac
     2041 aggctggaat ggtacaacct ggcctcatta taacttacca gttatgatcc tatctgcaag
     2101 gcaggctcag tctgaggtct ggcggcttaa atgacaattg gtgtgtcttc tagaacagaa
     2161 tttagtccac attagcgcgc ctgtggacca cagtatatct ctctggcctc
//
LOCUS       YSCARF2      1012 bp ds-DNA             PLN       24-JUL-1990
DEFINITION  S.cerevisiae ADP-ribosylation factor 2 (ARF2) gene, complete cds.
ACCESSION   M35158
KEYWORDS    ADP-ribosylation factor 2.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1012)
  AUTHORS   Stearns,T., Hoyt,M.A., Botstein,D. and Kahn,R.A.
  TITLE     ADP-ribosylation factor is an essential protein in yeast and is
            encoded by two genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.A.Kahn, 05-JUN-1990.
            
               Author address: R.A.Kahn
               Lab. of Biol. Chem., DCT
               Bldg. 37; Room 5D-02
               National Cancer Institute
               Bethesda, MD  20892
FEATURES       from  to/span     description
    pept        266      811     ADP-ribosylation factor 2 (ARF2)
    binding      24       30     consensus GTP-binding site
    binding      67       71     consensus GTP-binding site
    binding     126      129     consensus GTP-binding site
BASE COUNT      301 a    180 c    222 g    309 t
ORIGIN      
        1 gcctgcaggt acgccccttt atttgatcag gaagccgtat tgattatcta atagggccta
       61 gttatcctaa ttgtggggag tcgagcagta cggctctgat gtttttcgaa cgaagataag
      121 gagttgacat acaaagtcaa cagaagttct tcttgttagc gtctctgtgc tcaatatctc
      181 tctttttttc tttaagtagt aattactaac atcagccaac caatagagat aaaaaaaaaa
      241 ggaattaaga tttcatagag aaaagatggg tctatacgct tctaagttat tcagcaatct
      301 ttttggcaac aaagagatgc gtatacttat ggttggtcta gatggtgccg gtaagaccac
      361 cgttttgtac aagttgaagt tgggcgaagt tatcactacc attccaacca ttggtttcaa
      421 cgttgagact gtccaatata agaacatttc cttcactgtc tgggacgtcg gtggacaaga
      481 caggattaga tctttatgga gacactacta cagaaacacc gaaggtgtta tttttgtcat
      541 cgattccaac gatagatcgc gtattggtga agccagagaa gtcatgcaga gaatgctgaa
      601 tgaagatgaa ttgagaaatg ctgtctggtt agtcttcgct aacaaacaag atttgccaga
      661 agccatgtct gctgctgaaa tcaccgaaaa attaggttta cattctatta gaaaccgtcc
      721 atggtttatc cagtctactt gtgcaacctc gggtgaaggt ctgtacgaag gtctggagtg
      781 gttaagcaac aacttgaaga atcaatccta atctaaatct gtatagaacg tttagtcatg
      841 cgcaccttgt gtgttttgtt tctagattgt tttattttta tgattgttga agatataaac
      901 cactgtatag ttgtataaga taggataatg atggtgcact gaaaataaac ttactagctc
      961 tttaatattg caacggcttg taacgggcga ctctagagga tccccgggta cc
//
LOCUS       CODCPRRKA      94 bp ss-RNA             RNA       24-JUL-1990
DEFINITION  Codium fragile chloroplast 4.5S RNA, complete cds.
ACCESSION   M35276 M15192
KEYWORDS    4.5S RNA.
SOURCE      C.fragile chloroplast RNA.
  ORGANISM  Chloroplast Codium fragile
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Brypsidales; Codiaceae; Codium fragile.
REFERENCE   1  (bases 1 to 94)
  AUTHORS   Francis,M.A., Balint,R.F. and Dudock,B.S.
  TITLE     A novel variety of 4.5 S RNA from Codium fragile chloroplasts
  JOURNAL   J. Biol. Chem. 262, 1848-1854 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    RNA           1       94     4.5 S RNA
BASE COUNT       35 a     11 c     14 g     34 t
ORIGIN      
        1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc
       61 atttgtattt gaaaattgtt taggaactag gcac
//
LOCUS       HAMAPBRBD    2339 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Hamster apolipoprotein (apoB) gene, partial cds
            (LDL receptor-binding domain).
ACCESSION   M35187
KEYWORDS    apolipoprotein B.
SOURCE      Hamster DNA.
  ORGANISM  Mesocricetus auratus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2339)
  AUTHORS   Smith,T.J., Hautamaa,D. and Maeda,N.
  TITLE     Sequence of the putative low-density lipoprotein receptor-binding
            regions of apolipoprotein B in mouse and hamster
  JOURNAL   Gene 87, 309-310 (1990)
  STANDARD  simple staff_review
COMMENT     Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications
            on line 4 and line 17 of printed sequence.  The hamster sequence
            should be 1 bp to the left on these lines.
FEATURES       from  to/span     description
    pept    <     1  >  2339     apolipoprotein (apoB) (LDL receptor-binding
                                 domain) (AA at 1)
BASE COUNT      725 a    519 c    450 g    645 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat
       61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat
      121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc
      181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct
      241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat
      301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt
      361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat
      421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac
      481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta
      541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag
      601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg
      661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc
      721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc
      781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc
      841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt
      901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt
      961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt
     1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat
     1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga
     1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac
     1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa
     1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag
     1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac
     1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt
     1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg
     1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact
     1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac
     1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg
     1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg
     1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt
     1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt
     1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt
     1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac
     1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca
     2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa
     2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac
     2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta
     2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct
     2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc
//
LOCUS       MUSAPBRBD    2354 bp ds-DNA             ROD       24-JUL-1990
DEFINITION  Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding
            domain).
ACCESSION   M35186
KEYWORDS    apolipoprotein B.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2354)
  AUTHORS   Smith,T.J., Hautamaa,D. and Maeda,N.
  TITLE     Sequence of the putative low-density lipoprotein receptor-binding
            regions of apolipoprotein B in mouse and hamster
  JOURNAL   Gene 87, 309-310 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  2354     apolipoprotein (apoB) (LDL receptor-binding
                                 domain) (AA at 1)
BASE COUNT      757 a    525 c    423 g    649 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat
       61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat
      121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc
      181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct
      241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat
      301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt
      361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat
      421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac
      481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg
      541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa
      601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta
      661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc
      721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc
      781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt
      841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt
      901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt
      961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt
     1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat
     1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga
     1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac
     1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa
     1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag
     1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac
     1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt
     1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt
     1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa
     1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc
     1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg
     1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca
     1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc
     1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac
     1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag
     1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct
     1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag
     2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg
     2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt
     2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat
     2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga
     2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat
     2341 gtatatctga attc
//
LOCUS       DDINDK        518 bp ss-mRNA            INV       24-JUL-1990
DEFINITION  D.discoideum nucleoside diphosphate kinase Gip17 mRNA, complete
            cds.
ACCESSION   J05457
KEYWORDS    nucleoside diphosphate kinase.
SOURCE      D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clones
            G[821,951].
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 518)
  AUTHORS   Lacombe,M.L., Wallet,V., Troll,H. and Veron,M.
  TITLE     Functional cloning of a nucleoside diphosphate kinase from
            Dictyostelium discoideum
  JOURNAL   J. Biol. Chem. 265, 10012-10018 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.L.Lacombe, 16-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept         19      486     nucleoside diphosphate kinase Gip17 (EC
                                 2.7.4.6)
    mRNA    <     1      518     Gip17 mRNA
    signal      485      490     poly-A signal
BASE COUNT      164 a     97 c     97 g    160 t
ORIGIN      
        1 taacctaata agattaaaat gtccacaaat aaagtaaaca aagaaagaac tttccttgct
       61 gttaaaccag acggtgttgc tcgtggttta gttggtgaaa tcatcgccag atacgaaaag
      121 aaaggtttcg ttttagttgg tttaaaacaa ttagttccaa ccaaagactt agctgaatct
      181 cactatgctg aacacaaaga aagaccattc ttcggtggtt tagtctcatt cattacctct
      241 ggtccagtcg ttgctatggt cttcgaaggt aaaggtgttg ttgcctctgc ccgtttaatg
      301 atcggtgtta ccaacccatt agcctcagcc ccaggttcaa ttcgtggtga tttcggtgtt
      361 gatgttggta gaaacatcat ccacggttct gattcagttg aatctgccaa cagagaaatt
      421 gctttatggt tcaaaccaga agaattatta actgaagtta aaccaaaccc aaatttatac
      481 gaataaataa tttagctaaa ttctaaatca tttaatgt
//
LOCUS       DDINDKA       184 bp ss-mRNA            INV       24-JUL-1990
DEFINITION  D.discoideum nucleoside diphosphate kinase Gip17 mRNA, 5' end.
ACCESSION   M36679
KEYWORDS    nucleoside diphosphate kinase.
SOURCE      D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clone
            G821.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 184)
  AUTHORS   Lacombe,M.L., Wallet,V., Troll,H. and Veron,M.
  TITLE     Functional cloning of a nucleoside diphosphate kinase from
            Dictyostelium discoideum
  JOURNAL   J. Biol. Chem. 265, 10012-10018 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept        137  >   184     nucleoside diphosphate kinase Gip17 (EC
                                 2.7.4.6)
BASE COUNT       77 a     21 c     14 g     72 t
ORIGIN      
        1 atttttgatt ttctttattc ttatttgtaa ttgtatataa tagtaaaaaa aaaaaattct
       61 cagatttata atttattatt atttttcaaa tttcttcaat tctttgtatt aaatattact
      121 caaacataaa ataaaaatgg cctcaaccaa aagaagatta aacagagaag aatcaatgtc
      181 caca
//

GenBank-Updates@genbank.bio.net (07/25/90)

LOCUS       DOGRAB2A      656 bp ss-mRNA            MAM       25-JUL-1990
DEFINITION  C.familiaris GTP-binding protein (rab2) mRNA, complete cds.
ACCESSION   M35521
KEYWORDS    GTP-binding protein.
SOURCE      C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to
            mRNA, clone II.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 656)
  AUTHORS   Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M.
  TITLE     Localization of low-molecular weight GTP binding proteins to
            exocytic and endocytic compartments
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Chavrier, 22-JUN-1990.
            
            
            Base-pairs 508 to 564 form a synthetic peptide used to raise
            antibodies.
FEATURES       from  to/span     description
    pept          7      645     GTP-binding protein (rab2)
BASE COUNT      209 a    124 c    158 g    165 t
ORIGIN      
        1 gcggccatgg cgtacgctta tctcttcaag tacatcatca tcggcgacac aggtgttggt
       61 aaatcatgct tattgctaca gtttacagac aagaggtttc agccagtgca tgacctgact
      121 atcggtgtag agtttggtgc tcgaatgata actattgatg ggaaacagat aaaacttcag
      181 atatgggata cggcagggca agagtccttt cgttccatca caaggtcata ttacagaggt
      241 gcagcagggg ctttactagt gtatgatatt acaaggagag atacattcaa ccacttgaca
      301 acctggttag aagatgcccg ccagcattcc aattccaaca tggtcattat gcttattgga
      361 aataaaagtg atttagaatc aagaagagaa gtaaaaaaag aagaaggtga agcttttgca
      421 cgagaacatg gacttatctt catggaaact tctgctaaga ctgcttccaa tgtagaagag
      481 gcatttatta atacagcaaa agaaatttat gagaaaatcc aagaaggagt ctttgacatt
      541 aataatgagg caaacggcat taaaattggc cctcagcacg ctgctactaa tgccacacac
      601 gcgggcaatc agggaggaca gcaggccggg ggaggctgct gttgagtccg tttttt
//
LOCUS       DOGRAB5A      796 bp ss-mRNA            MAM       25-JUL-1990
DEFINITION  C.familiaris GTP-binding protein (rab5) mRNA, complete cds.
ACCESSION   M35520
KEYWORDS    GTP-binding protein.
SOURCE      C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to
            mRNA, clone II.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 796)
  AUTHORS   Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M.
  TITLE     Localization of low-molecular weight GTP binding proteins to
            exocytic and endocytic compartments
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Chavrier, 22-JUN-1990.
            
            Base-pairs 664 to 711 form a synthetic peptide used to raise
            antibodies.
FEATURES       from  to/span     description
    pept        121      768     GTP-binding protein (rab5)
BASE COUNT      267 a    163 c    174 g    192 t
ORIGIN      
        1 ccgcggctcc tcgtgctgcg gcctcaggtt tctgtatatc cagaaagaaa aaatttgaca
       61 ccttgcatcc tggaagttca tttaagagac tgaaattagg gacttctttc aaatttggac
      121 atggctaatc gaggagcaac aagacccaac gggccaaata ctggaaataa aatatgccag
      181 ttcaaactag tacttctggg agagtctgct gttggcaaat caagcctagt gcttcgtttt
      241 gtgaagggcc aatttcatga atttcaagag agtaccatag gggctgcttt tctaacccaa
      301 actgtgtgtc ttgatgatac aacagtaaag tttgaaatat gggatacagc tggtcaagaa
      361 cgataccata gcttagcacc aatgtactac agaggagcac aagcagccat agttgtatat
      421 gatatcacaa atgaggagtc ctttgccaga gccaaaaact gggttaaaga acttcagagg
      481 caagccagtc ctaacattgt aatagcttta tcaggaaaca aggctgatct tgcaaataaa
      541 agagctgtcg atttccagga agcacagtcc tatgcagatg acaacagttt attattcatg
      601 gagacatcag ctaaaacatc gatgaacgta aatgaaatat tcatggcaat agctaaaaag
      661 ttgccaaaga acgaaccaca gaatccagga gcaaattctg ccagaggaag aggagtagac
      721 cttactgaac ccacgcagcc aaccaggagt cagtgttgta gtaactaaac ctccagtttg
      781 aacttcctgg aatatc
//
LOCUS       DOGRAB7A      811 bp ss-mRNA            MAM       25-JUL-1990
DEFINITION  C.familiaris GTP-binding protein (rab7) mRNA, complete cds.
ACCESSION   M35522
KEYWORDS    GTP-binding protein.
SOURCE      C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to
            mRNA, clone II.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 811)
  AUTHORS   Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M.
  TITLE     Localization of low-molecular weight GTP binding proteins to
            exocytic and endocytic compartments
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Chavrier, 22-JUN-1990.
            
            
            Base-pairs 542 to 592 form a synthetic peptide used to raise
            antibodies.
FEATURES       from  to/span     description
    pept         20      643     GTP-binding protein (rab7)
BASE COUNT      251 a    206 c    194 g    160 t
ORIGIN      
        1 gagcggctgc gtttgaagga tgacctctag gaagaaagtg ttgctgaagg ttatcatcct
       61 gggagattct ggagttggta agacatcact catgaaccag tatgtgaaca agaaattcag
      121 taatcagtac aaagctacaa taggagcaga ctttctgaca aaggaggtga tggtggatga
      181 cagactagtt acaatgcaga tctgggacac agcaggccag gaacggttcc agtcccttgg
      241 tgtggccttc tacagaggtg cagactgctg cgttctggta tttgacgtta ctgcccccaa
      301 cacattcaaa accctcgata gctggagaga tgagtttctc atccaggcca gtccccggga
      361 tcctgaaaac ttccctttcg ttgtgttggg aaacaagatt gacctcgaaa acagacaagt
      421 ggccacaaag cgggcacagg cctggtgcta cagcaaaaac aacattccct acttcgagac
      481 cagtgccaag gaggccatca atgtggagca ggcgttccag acgattgcaa ggaatgcact
      541 taaacaggaa acagaggtgg agctgtacaa tgaattccct gaacccatca aactggacaa
      601 gaacgaccgg gccaagacct cagcggaaag ctgcagttgc tgaaggggca gtgagagcag
      661 agcacagagt ccttcacaaa caaagaacac acttaggcct tccaacacga gcccccttct
      721 tctcttccaa acaaaacata aagtcatctc tcgaatccag ctgccaaaag accctaccaa
      781 acacttcacc ctgacacaca catacacaca c
//
LOCUS       HUMU7AA       649 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human U7 small nuclear RNA pseudogene, fragment 32sm.
ACCESSION   M35537
KEYWORDS    U7 small nuclear RNA; pseudogene.
SOURCE      Human liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 649)
  AUTHORS   Soldati,D. and Schimperli,D.
  TITLE     Structures of four human pseudogenes for U7 small nuclear RNA
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES       from  to/span     description
    uRNA.ps     293      352     pseudo-U7 uRNA
BASE COUNT      217 a    107 c    115 g    199 t     11 others
ORIGIN      
        1 attatggcag agtacatgta acatatagtt tgctattcna actgattttt gacaaagata
       61 caacagcana tcaatggagg aacaatagcn tttttaacaa atggtgttgg cacaactgga
      121 caactgtaag nnaaagaaaa tgaanttcaa tctanatctc anaccgtatt aaaaaaaact
      181 caaagtgggc cacagactta gatataaaat gtaaaactat aacactttta gaaaanatat
      241 aggagaanat ctatgggatt tagggcaaaa gcatgattca aaaaaggaaa gtcagtgtta
      301 cagccctttt agaatttgtc tagcaggttt tctggttttc cagaaaacct ccacataaaa
      361 aggaaaaaga aaaaaaggaa aaagtaataa attagtatga attgagcatt ttaatgattc
      421 tattttattg cctttgttgg cttattaaat ataactctct gttttgttat tttagtggtt
      481 gctttaggtt ttatagtaat acatctttaa cctgttacag tccaccttct ttttgtttgt
      541 ttgttttgga agcagggtct cactctgtca ccaaggctag agtgcagtgg cactatcacg
      601 gctcactgca acctcaacct cccaggctcc agngttcctc ctgctgcag
//
LOCUS       HUMU7AB       521 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human U7 small nuclear RNA pseudogene, fragment 32BG.
ACCESSION   M35538
KEYWORDS    U7 small nuclear RNA; pseudogene.
SOURCE      Human liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 521)
  AUTHORS   Soldati,D. and Schimperli,D.
  TITLE     Structures of four human pseudogenes for U7 small nuclear RNA
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES       from  to/span     description
    uRNA.ps     295      344     pseudo-U7 uRNA
BASE COUNT      186 a    102 c     89 g    144 t
ORIGIN      
        1 tttcttcttt ttccacctct tgtctattca ggccctcagt gaattggatc atgctcaccc
       61 acatcagggc aggcaatcta cttattgagt tcactgattc aaatgataac ctcacctgga
      121 aaaatcctca cagacccaga aataatgttt aatctaagca cccatggcca gtcaagttga
      181 gacataaaat tagccatcac agtacaggca tacctgggaa atgacgcagg ttcagttcca
      241 gaccatcaca ataaagcaaa tattgcaata aagtgagtca caaaaagaaa aagtcagtgt
      301 tacagctttt agaatttgtc tagcaggttt tctggaaaac cttcacaaaa aaaggagaaa
      361 gagtgcatat aaaatgctta tgttgatacc atactgtagt ctattaagtg tgcaatagca
      421 ttatgtctat aaaacaatgt acatacttta aaaatatttt attgttaaaa catgctatca
      481 cagagacaca aagtgagcac atgctgttgg aaaaatggta c
//
LOCUS       HUMU7AC       513 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human U7 small nuclear RNA pseudogene.
ACCESSION   M35539
KEYWORDS    U7 small nuclear RNA; pseudogene.
SOURCE      Human liver DNA, clone 25H.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 513)
  AUTHORS   Soldati,D. and Schimperli,D.
  TITLE     Structures of four human pseudogenes for U7 small nuclear RNA
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES       from  to/span     description
    uRNA.ps     204      264     pseudo-U7 uRNA
BASE COUNT      127 a     83 c     81 g    222 t
ORIGIN      
        1 aattgtctgt ctttcatatt tttgtcattc tcgtgagtgt gaagtggtat ctcattgtgg
       61 ttttgatttg catttcccta atgactaatg gtgttgaata tcttttcata tgcttataag
      121 ccatttatat gtctttggag aaattctttt caaatctctt gctcatttta aaattaggtt
      181 gtcattttat tacggagttg cattagtgtt acagctcttt tagaatttgt ctagcaggtt
      241 ttctgatttt tacccggaac ccctccccag ccaaaagtaa aagaaaaaaa aagctgcaat
      301 agttctttat atagtttaga tacaaggccc ttatcagata tttgattttc aaatattgtc
      361 tcccattctg tgagttgttt tttcactctc ttgatggtgt catatgaagc acaaattttt
      421 ttttttattt tgataatgtc ccatttatct atgtattttt tcttttcatt tgtgcttttg
      481 gtgtcgtacc taagaaactg ctgcttaact caa
//
LOCUS       HUMU7AD       418 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human U7 small nuclear RNA pseudogene, fragment 36h.
ACCESSION   M35540
KEYWORDS    U7 small nuclear RNA; pseudogene.
SOURCE      Human liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 418)
  AUTHORS   Soldati,D. and Schimperli,D.
  TITLE     Structures of four human pseudogenes for U7 small nuclear RNA
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.Schimperli, 22-JUN-1990.
FEATURES       from  to/span     description
    uRNA.ps     229      286     pseudo-U7 uRNA
BASE COUNT      128 a     58 c    106 g    122 t      4 others
ORIGIN      
        1 agaggcacat gtcaagatga agctctggtg aagaattgat caaaaatagt ggcggagtga
       61 gatggagatt taaatccaag ggctgattta tgaaggcttc aaagattttt tttttttaaa
      121 gaaagaacat agattagttg tttctgaggg ctggagggga cagagataga ggcggcgacg
      181 gaaggatcct tcaggtttct tcttgaggtg attaaacgtt ctgaaatcgc gtgttacagc
      241 tcttttggaa tttgtctagc aggttttctg gttttcactg caaaacccca cagtnnnaaa
      301 acagaaaaaa aaawttatcc taaaattggg ctgtggtaat ggttgcgcat atgctgtgaa
      361 taggcttcca aatattgaaa tgtccacttc aaacgagtga actgtatggt atgtgaat
//
LOCUS       SCMPMYA1     3156 bp ss-mRNA            INV       25-JUL-1990
DEFINITION  S.mansoni paramyosin mRNA, complete cds.
ACCESSION   M35499
KEYWORDS    paramyosin.
SEGMENT     1 of 2
SOURCE      S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones
            Pmy[1,8,11,15].
  ORGANISM  Schistosoma mansoni
            Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; 
            Digenea; Strigeidida; Strigeata; Schistosomatoidea; 
            Schistosomatidae.
REFERENCE   1  (bases 1 to 3156)
  AUTHORS   Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and
            Shoemaker,C.B.
  TITLE     Paramyosin is the Schistosoma mansoni (trematoda) homologue of
            antigen B from Taenia solium (cestoda)
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.P.Laclette, 22-JUN-1990.
            
               Author address: J.P.Laclette
               Department of Tropical Public Health
               Harvard School of Public Health
               665 Huntington Avenue
               Boston, MA 02115
               Email: zehm%hscvax%harvunxwxw.edu
FEATURES       from  to/span     description
    pept         47     2647     paramyosin
    mRNA    <     1  >  3156     paramyosin mRNA
BASE COUNT     1279 a    435 c    516 g    923 t      3 others
ORIGIN      
        1 tctttcacta atattaaaaa gaaaaattta aaaaaaaaga ggaaaaatga tgaatcatga
       61 tacagaatct catgtgaaaa tatcaagaac tatttatcga ggagtatcac caagtacaac
      121 aagacttgag agtcgagtac gggaattaga agatcttttg gatttagaac gtgatgcaag
      181 agttcgagct gaacgacatg ctgctgattt aggttttcaa gtggatgcat tatcagaacg
      241 tttagatgaa gctggaggtt ctacaacaca aactcaagaa ttattaaaac gtcgtgaaat
      301 ggaaatcaat aaactacgta aagatttaga aaatgctaat gcatcacttg aactagctga
      361 aacatcaatg agacgtcgac atcaaacagc attgaatgaa ttagctttgg aagttgaaaa
      421 tttacaaaaa caaaaaggaa aggctgaaaa agacaaaagt catttgatta tggaagtgga
      481 taatgttcta ggacaattag atggtgcatt aaaagctaag caatcagctg aatcaaaatt
      541 agaaggatta gatagtcaat taaatcgttt aaaatcatta accgacgatt tacaaagaca
      601 attaactgaa ttaaataatg ctaaatcaag attaacatca gaaaattttg aattattaca
      661 tataaatcaa gattatgaag cacaaatatt aaattattct aaagctaaat catcacttga
      721 aagtcaagta gatgatttaa aaagatcatt agatgatgaa gctaaaaatc gttttaatct
      781 tcaagctcaa cttacatcac ttcaaatgga ttatgataat ttacaagcta aatatgatga
      841 agaaagtgaa gaagctagta atttacgtag tcaagtatct aaatttaacg ctgatattgc
      901 tgcattaaaa tcgaaatttg aacgtgaact tatgagtaaa acagaagaat tcgaagaaat
      961 gaagaggaaa ttcactatga gaattaccga acttgaagat actgctgaaa gagaacgatt
     1021 aaaagcggta tcattagaaa aacttaaaac aaaattaaca ttagaaatta aagatttaca
     1081 atctgaaata gaaagtcttt cattagaaaa tagtgaatta attcgtcgtg ctaaagctgc
     1141 tgaatcatta gcttctgatt tacaacgtcg tgttgatgaa ttaacaattg aagtgaatac
     1201 attaacatca caaaatagtc aattagaaag tgaaaatcta cgtttaaaaa gtttagttaa
     1261 tgatttaacg gataaaaata atttattaga acgtgaaaat cgtcaaatga atgatcaagt
     1321 caaagaatta aaaagttcac ttcgtgatgc taatcgtcgt cttactgatt tagaagcatt
     1381 aagatcgcaa ttagaggctg aaagagataa tcttgcatca gctttacatg atgctgaaga
     1441 agcattacat gatatggatc aaaagtatca agcatcacaa gctgcattaa atcatttgaa
     1501 atctgaaatg gaacaaaggc ttagagaaag agatgaagaa ttagaaagtt taagaaaaag
     1561 tactactaga acaattgaag aattaactgt tacaataact gaaatggaag ttaaatataa
     1621 atcagaatta tcacgtttaa aaaaacgtta tgaatcaaat attgctgatt tagaaattca
     1681 acttgataca gctaataaag ctaatgcaaa tcttatgaaa gagaataaaa atttatcaca
     1741 acgtgttaaa gatttagaaa catttttaga tgaagaacgt cgtcttcgtg aagcagctga
     1801 aaataattta caaattactg aacataaacg tttacaatta gcaaatgaaa ttgaagaaat
     1861 acgtagtaca ttagaaaatt tagaacgttt acgtaaacat gctgaaacag aacttgaaga
     1921 agctcaatca cgtgttagtg aattaactat tcaagttaat acattaacta atgataaacg
     1981 tcgtcttgaa ggtgatattg gtgtaatgca ggctgatatg gatgatgcta ttaatgctaa
     2041 acaagcttct gaagatcgag caattagatt aaataatgaa gtattacgtt tagctgatga
     2101 attacgtcaa gaacaaggaa attataaaca tgctgaagca ttaagaaaac aattagaaat
     2161 tgaaatacgt gaaattacag ttaaattaga agaagctgaa gcatctgcta cacgtgaagg
     2221 tcgtcgtatg gtacaaaaat tacaggctcg tgtacgtgaa cttgaatcag aattcgatgg
     2281 tgaatcaaga agatgtaaag atgcattagc tcaagcacgt aaatttgaac gtcaatataa
     2341 agaattacaa acacaagctg aagatgatcg tcgtatggta ttagaacttc aagatttatt
     2401 agataaaact caaatgaaaa tgaaagccta taaacgtcaa ttggaagaaa tggaagaagt
     2461 atctcaaatt acaatgaata aatatcgtaa agcccaacaa caaattgaag aagctgaaca
     2521 tcgtgcagat atggctgaac gtacagtcac tgtacgtcgt gttggtccag gtggacgtgc
     2581 tgtttctgta gcacgtgaat tatctgtcac atcaaataga ggaatgagag caacaagtat
     2641 gatgtaaagc acttaaataa taataataat agtgatacta tacacatata caaacgccta
     2701 tatctttctt tctctctttg tttcgttttc ctcatcttcg ctttttttta gtcatgatat
     2761 tcatctaaat gaggaaatta tcaataatga cctattatta ttcaatgtgc tttactttac
     2821 ttcccaccct aaatctcctc ggtatatcgt ttcccttttt ttttcttttt ttttctaaaa
     2881 acaaaaaatt ctaaaagtga aagacgaaaa aaaaaaannn cagaaatttg tttcctcctc
     2941 tcatattttc tctttgttct ttttattcat ttcatttatt gtattattaa tattgctatt
     3001 attattattg ttattactac ctaaccgatg gtttcaacga cagcaatctc ccatatttct
     3061 acacacacac acacacaaca cacacaacac acaaaagtat ctgtgcaatc gtaatagata
     3121 atctttattt attgattaaa aaaaaaaaaa aaaaaa
//
LOCUS       SCMPMYA2      217 bp ss-mRNA            INV       25-JUL-1990
DEFINITION  S.mansoni paramyosin mRNA, 3' flank.
ACCESSION   M36871
KEYWORDS    paramyosin.
SEGMENT     2 of 2
SOURCE      S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones
            Pmy[1,8,11,15].
  ORGANISM  Schistosoma mansoni
            Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; 
            Digenea; Strigeidida; Strigeata; Schistosomatoidea; 
            Schistosomatidae.
REFERENCE   1  (bases 1 to 217)
  AUTHORS   Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and
            Shoemaker,C.B.
  TITLE     Paramyosin is the Schistosoma mansoni (trematoda) homologue of
            antigen B from Taenia solium (cestoda)
  JOURNAL   Unpublished (1990) Harvard 665 Huntington Avenue, Boston, MA 02115
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.P.Laclette, 22-JUN-1990.
            
               Author address: J.P.Laclette
               Department of Tropical Public Health
               Harvard School of Public Health
               665 Huntington Avenue
               Boston, MA 02115
               Email: zehm%hscvax%harvunxwxw.edu
FEATURES       from  to/span     description
    mRNA    <     1      217     paramyosin mRNA
BASE COUNT       66 a     46 c     17 g     88 t
ORIGIN      About 1 kb after segment 1.
        1 cagaaatttg tttcctcctc tcatattttc tctttgttct ttttattcat ttcatttatt
       61 gtattattaa tattgctatt attattattg ttattactac ctaaccgatg gtttcaacga
      121 cagcaatctc ccatatttct acacacacac acacacaaca cacacaacac acaaaagtat
      181 ctgtgcaatc gtaatagata atctttattt attgatt
//
LOCUS       ECAPNL        420 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.carotovora pectin lyase (PNL) gene, 5' end.
ACCESSION   M35271
KEYWORDS    pectin lyase.
SOURCE      E.carotovora DNA, clone pTN2159.
  ORGANISM  Erwinia carotovora
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 420)
  AUTHORS   Nishida,T., Suzuki,T., Ito,K., Kamio,Y. and Izaki,K.
  TITLE     Cloning and expression of pectin lyase gene from Erwinia carotovora
            in Escherichia coli
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 801-808 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        284  >   420     pectin lyase (EC 4.2.2.10)
BASE COUNT      127 a     66 c     94 g    133 t
ORIGIN      
        1 cctatcagtc tgatgaagtt gaacaggctg cgaaccgtat ttttaatggc ggcgggtaaa
       61 aggctggtga tgataatcgt agcgctgcca ttttactaaa agatggcggc gtattaattg
      121 ggtattgaat tattcgcaag gttgtttttt tattaaactc gattaataag cgtaatgaaa
      181 tcctttctat acaattttta attgtcggag gcgtattatt tagtctcaat taaataatac
      241 gctggaagac attattattc actcattgta aaaaggaaaa cttatggctt atccaacaac
      301 aaatcttact gggcttattg gttttgcaaa agcggcaaaa gttaccggag gaacgggcgg
      361 taaagtcgtt acggtaaatt ctttggccga ttttaaatca gcggtgacgg ttccgcaaaa
//
LOCUS       ECOUXEX       318 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.coli uxaCT-exuT intercistronic region.
ACCESSION   M35280
KEYWORDS    catabolite receptor protein.
SOURCE      E.coli (strain K-12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Blanco,C. and Mata-Gilsinger,M.
  TITLE     Identification of cyclic AMP-CRP binding sites in the
            intercistronic regulatory uxaCA-exuT region of Escherichia coli
  JOURNAL   FEMS Microbiol. Lett. 33, 205-209 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site         46       71     catabolite receptor protein binding site 1
    site        165      193     catabolite receptor protein binding site 2
BASE COUNT       98 a     62 c     70 g     88 t
ORIGIN      
        1 gtcgacttat gatttgcgac ggcagaaaga taacttgtca tacaacttta aaaggtgaga
       61 gccatcacaa atgtgggaat atttgtaggg acattacctg acgacagcaa ggccagtact
      121 ggcgcggcct gcagcgagat ttaccacttt gagagtaatt tttttaacta cgtttattga
      181 tctaactcac gaaaatatct tcggactctg gaaattggtg tgataacttt gtcagcatcg
      241 caccataagc aagctagctc actcgttcga agaggaagac gaaaataact ccgtttatga
      301 ctgaagatta tcctgtta
//
LOCUS       HUMSYNIFA     144 bp ds-DNA             SYN       25-JUL-1990
DEFINITION  Human synthetic interferon alpha-2 gene, 3' end.
ACCESSION   M35281
KEYWORDS    interferon.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 144)
  AUTHORS   Rossi,J.J., Kierzek,R., Huang,T., Walker,P.A. and Itakura,K.
  TITLE     An alternate method for synthesis of double-stranded DNA segments
  JOURNAL   J. Biol. Chem. 257, 9226-9229 (1982)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <    13      135     interferon alpha-2
BASE COUNT       37 a     32 c     32 g     43 t
ORIGIN      
        1 caagaattca tgatcactct gtacctgaag gaaaagaaat actctccgtg tgcttgggaa
       61 gttgtacgtg ctgaaatcat gcgttctttc tccctgtcta ctaaccttca ggagtctctg
      121 cgttctaaag aatagctgca gtgg
//
LOCUS       RATMAL5      1104 bp ds-DNA             ROD       25-JUL-1990
DEFINITION  Rat malic enzyme (ME) gene, 5' end.
ACCESSION   M35258 M21619
KEYWORDS    malic enzyme.
SOURCE      Rat (Sprague-Dawley, female) liver, clone lambda-g-ME-29.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1104)
  AUTHORS   Morioka,H., Tennyson,G.E. and Nikodem,V.M.
  TITLE     Structural and functional analysis of the rat malic enzyme gene
            promoter
  JOURNAL   Mol. Cell. Biol. 8, 3542-3545 (1988)
  STANDARD  simple staff_review
REFERENCE   2  (bases 427 to 925; revises [1])
  AUTHORS   Petty,K.J., Desvergne,B., Mitsuhashi,T. and Nikodem,V.M.
  TITLE     Identification of a thyroid hormone response element in the malic
            enzyme gene
  JOURNAL   J. Biol. Chem. 265, 7395-7400 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        918  >  1104     malic enzyme (EC 1.1.1.40)
    mRNA        883  >  1104     malic enzyme mRNA
    rpt         814      823     direct repeat
    rpt         827      836     direct repeat
BASE COUNT      220 a    376 c    290 g    218 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgcat agcccagaag ctatagctgt actgatgggc tcaagtaaaa taattagaaa
       61 ttatttctca ggtatctagg caatatttaa cccccaaatt gttccgcagt gtctagatga
      121 acaccataga atttggccgt gcgacttaac tgaaaagaaa gggctttgtt gtctgaaggc
      181 tgcttggctg tattgttttg ttttaatcag acatccttgg gagacatagg atttatttct
      241 ccagtccttg gatcttcaag tataaatatc aataatacaa ccactgggtt tcagtactgg
      301 aagacctgtt attctgaccc tctgtcatca gagaagaaac catacatcat cttgcaaaaa
      361 ttaacatctt ggtttccaga acgctcagga aaattgttct taagctcaat aggactggcc
      421 actggacctg tgccctctaa cacctttttc ttaccacgtt cgaacacaat tccctcagat
      481 actattcaga aacaggcgag gagtcgcccg ccctatcgcc cagtgccatc gaggcctggg
      541 cattctgggt caaagttgat cccctcctgc atcaggcccc tggggcatgg ctggcatcca
      601 ggacgttggg gttaggggag gacagtggac gagcggagga agcgaggcgg cccgcccctc
      661 acccgtcggt gcccaggtcg cacgctcggc gctcaccagc ttggccggcg ccccgccccc
      721 gcctcctcgc acggcggctc ggccgatgcc gccgtgactc agcgcttctc gcgggccgcc
      781 cgcgcggccg cggctaggcc gggctcctcc cgcctcgcca ccccctctcg ccacccacgc
      841 ccgcccccgg ccgcggggcc ttccgtcgca cggccgccgc ccgccgcact cccgtccgcc
      901 ccgccacggt gctggccatg gatccccgag ccccccgccg ccgacacacc caccagcgcg
      961 gctacctgct gacgcgggac ccgcatctca acaaggtgag ccccgccccg agagccgccc
     1021 tgggcccgcc gctgggctcg ggcacccgcg tcccaccgag gggacggtcc cacccgggag
     1081 gccactgcgg agccggcgcc aacg
//
LOCUS       RATSPA       1595 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Rat serine pyruvate aminotransferase mRNA, complete cds.
ACCESSION   M35270 X06357
KEYWORDS    serine pyruvate aminotransferase.
SOURCE      Rat (strain Wistar) liver, clones pRspt910,321].
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 63 to 1595)
  AUTHORS   Oda,T., Miyajima,H., Suzuki,Y. and Ichiyama,A.
  TITLE     Nucleotide sequence of the cDNA encoding the precursor for
            mitochondrial serine:pyruvate aminotransferase of rat liver
  JOURNAL   Eur. J. Biochem. 168, 537-542 (1987)
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 198)
  AUTHORS   Oda,T., Funai,T. and Ichiyama,A.
  TITLE     Generation from a single gene of two mRNAs that encode the
            mitochondrial and peroxisomal serine:pyruvate aminotransferase of
            rat liver
  JOURNAL   J. Biol. Chem. 265, 7513-7519 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        109     1353     peroxisomal serine:pyruvate aminotransferase
                                 precursor (EC 2.6.1.51; pSPT)
    sigp        109      118     serine:pyruvate aminotransferase signal
                                 peptide
    matp        119     1350     serine:pyruvate aminotransferase
    pept        175     1353     mitochondrial serine:pyruvate aminotransferase
                                 (mSPT)
    mRNA         61  >  1520     pSPT mRNA (alt.)
    mRNA         62  >  1520     pSPT mRNA (alt.)
    mRNA        127  >  1520     mSPT mRNA (alt.)
    mRNA        129  >  1520     mSPT mRNA (alt.)
    mRNA        130  >  1520     mSPT mRNA (alt.)
    signal     1515     1520     polyA signal
BASE COUNT      376 a    437 c    455 g    327 t
ORIGIN      
        1 aggacaaaca tcgatcaggg tcaaattgac aataaaaggg ctggagcaag caacagggac
       61 tcaccaacca ggcctcgcct ctgagttcag cccagagcta gctgggaaat gttccggatg
      121 ttggccaagg ccagtgtgac gctgggctcc agggcagcaa gttgggtacg gaacatgggc
      181 tcgcaccagc tgctggtgcc acccccagag gccctgagca agcccctgtc aattcctaag
      241 aggctcctgt tgggtccggg accctccaac ctggctcctc gtgtgctagc agctggaagt
      301 ctgaggatga ttggccacat gcaaaaagag atgtttcaga tcatggatga gatcaagcag
      361 ggcatccagt atgtgttcca gaccaggaac cccctcacac tggttgtcag cggctcagga
      421 cattgtgcca tggagactgc cctgttcaac ctcctggagc ctggggactc ctttcttgtg
      481 ggaaccaatg gcatctgggg gatacgggct gcagagatcg ctgagcggat tggagcccgt
      541 gtgcaccaga tgatcaagaa gcctggagaa cattacacac tgcaggaggt ggaggagggc
      601 ctggctcagc ataaaccagt gttgctgttc ctgacccacg gggagtcatc cactggtgtg
      661 ctgcagcccc tggatggttt cggggagctc tgccacaggt atcagtgcct actcctggtg
      721 gactcggtgg catcattggg cggagtccct atctacatgg accaacaagg catcgacatc
      781 ttgtactctg gctctcagaa ggtcctgaat gccccaccag ggatctccct catctccttc
      841 aacgacaagg ccaaatccaa agtctactcc cggaagacaa agccagtctc cttctacaca
      901 gacatcactt atttgtccaa gttgtggggc tgtgagggca agaccagagt aattcatcat
      961 acgttgcctg tcatcagctt atactgcctg agggagagcc tagcactcat ttcagagcag
     1021 ggcctggaga attcctggcg gcgtcacagg gaggctacag cacatctgca caagtgcctg
     1081 cgggagttgg gcttaaagtt ctttgtgaag gacccggaaa tccggctacc tacaatcacc
     1141 accgtgaccg tgcctgccgg ctacaactgg agggacatcg tcagctacgt gctggaccac
     1201 ttcaacattg aaatctctgg tggtcttggg ccctctgagg ataaggtgct gcggattggc
     1261 ctcctgggct acaacgccac cacagagaat gcggaccgtg tagcggaggc cctgagggag
     1321 gccctgcaac attgtcctaa gaataaattg tgagcatcgt ctcaccagac tgtgccctcc
     1381 tggaggggct gggaatatag caggaacgag aagactgtgc aagccctcca gccagcaaag
     1441 gctgccgatg taaccaggcg ggaagggtca gggcgaagct gcccctctcc ccacagatgg
     1501 agccctgtgg tcacatgatg ctaatcacct tccgatgaag ctgcattctg caggccactg
     1561 gacttcggga atattcaata aagtacttgc cagac
//
LOCUS       YSCCOX9A      180 bp ds-DNA             PLN       25-JUL-1990
DEFINITION  S.cerevisiae cytochrome c oxidase subunit VIIa (COX9) gene,
            complete cds.
ACCESSION   M35260
KEYWORDS    cytochrome c oxidase.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Duhl,D.M., Powell,T. and Poyton,R.O.
  TITLE     Mitochondrial import of cytochrome c oxidase subunit VIIa in
            Saccharomyces cerevisiae: Identification of sequences required for
            mitochondrial localization in vivo
  JOURNAL   J. Biol. Chem. 265, 7273-7277 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1      180     cytochrome c oxidase subunit VIIa
BASE COUNT       53 a     35 c     51 g     41 t
ORIGIN      
        1 atgactattg ctccaattac tggtacgatc aagagaagag tcatcatgga catcgtcctc
       61 gggttctccc tcgggggtgt catggcctct tactggtggt ggggattcca catggataag
      121 attaacaaga gagagaagtt ctacgcagag ctagctgaga ggaaaaagca agagaactga
//
LOCUS       DROTNCOPIA    276 bp ds-DNA             INV       25-JUL-1990
DEFINITION  D.melanogaster transposable element copia DNA in omega-aLTR1.
ACCESSION   M35053
KEYWORDS    copia transposon; transposable element.
SOURCE      D.melanogaster (strain w-a-1A) DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 276)
  AUTHORS   Zachar,Z., Davison,D., Garza,D. and Bingham,P.M.
  TITLE     A detailed developmental and structural study of the
            transcriptional effects of insertion of the copia transposon into
            the white locus of Drosophila melanogaster
  JOURNAL   Genetics 111, 495-515 (1985)
  STANDARD  simple staff_entry
BASE COUNT      100 a     42 c     34 g    100 t
ORIGIN      
        1 tgttggaata tactattcaa cctacaaaaa taacgttaaa caacactact ttatatttga
       61 tatgaatggc cacacctttt atgccataaa acatattgta agagaatacc actcttttta
      121 ttccttcttt ccttcttgta cgttttttgc tgtgagtagg tcgtggtgct ggtgttgcag
      181 ttgaaataac ttaaaatata aatcataaaa ctcaaacata aacttgacta tttatttatt
      241 tattaagaaa ggaaatataa attataaatt acaaca
//
LOCUS       ECOMETBJI      82 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.coli metB-metJ intercistronic DNA region.
ACCESSION   M34899
KEYWORDS    .
SOURCE      E.coli (strain K12) DNA, clone pAA110.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 82)
  AUTHORS   Smith,A.A., Greene,R.C., Kirby,T.W. and Hindenach,B.R.
  TITLE     Isolation and characterization of the product of the methionine-
            regulatory gene metJ of Escherichia coli K-12
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6104-6108 (1985)
  STANDARD  simple staff_entry
BASE COUNT       25 a     13 c     15 g     29 t
ORIGIN      
        1 tataatttta acggctattt gggatttgct catctatacg caaagaagtt tagatgtcca
       61 gatgtattga cgtccattaa ca
//
LOCUS       MNICPRRKA     103 bp ds-DNA             RNA       25-JUL-1990
DEFINITION  M.rugicum 4.5S ribosomal RNA.
ACCESSION   M35056
KEYWORDS    4.5S ribosomal RNA.
SOURCE      M.rugicum chloroplast DNA.
  ORGANISM  Chloroplast Mnium rugicum
            Eukaryota; Plantae; Embryobionta; Bryophyta; Bryopsida; Bryidae; 
            Bryales; Mniaceae; Mnium rugicum.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Troitsky,A.V., Bobrova,V.K., Ponomarev,A.G. and Antonov,A.S.
  TITLE     The nucleotide sequence of chloroplast 4.5 S rRNA from Mnium
            rugicum (Bryophyta): Mosses also posses this type of RNA
  JOURNAL   FEBS Lett. 176, 105-109 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      103     4.5S ribosomal RNA
BASE COUNT       33 a     20 c     28 g     22 t
ORIGIN      
        1 taaggtgacg gcaagactag ccgtttatca tcacgatagg tgccaagtgg aagtgcagta
       61 atgtatgcag ctgaggcatc ctaacagacc gagagattta aac
//
LOCUS       MUSCABLA      125 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse tyrosine kinase (c-abl) mRNA, 3' terminus.
ACCESSION   M34905
KEYWORDS    tyrosine kinase.
SOURCE      Mouse (strain NIH Swiss) testis, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 125)
  AUTHORS   Meijer,D., Hermans,A., von Lindern,M., van Agthoven,T.,
            de Klein,A., Mackenbach,P., Grootegoed,A., Talarico,D.,
            Valle,G.D. and Grosveld,G.
  TITLE     Molecular characterization of the testis specific c-abl mRNA in
            mouse
  JOURNAL   EMBO J. 6, 4041-4048 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    mRNA    <     1       44     tyrosine kinase (c-abl) mRNA (alt.)
    mRNA    <     1      125     tyrosine kinase (c-abl) mRNA (alt.)
BASE COUNT       26 a     35 c     26 g     38 t
ORIGIN      
        1 gcttactgta cctgcacctt tgatgcttac aaactgtccc cgagagcctg tgctcactgt
       61 gttttcattg gaaggaagct gcttactgta cctgcacctt tgatgcttac aaactgtccc
      121 cgaga
//
LOCUS       SOPMPDNA      111 bp ds-DNA             SYN       25-JUL-1990
DEFINITION  Synthetic ovalbumin pre-message selfprimer DNA.
ACCESSION   M35058
KEYWORDS    ovalbumin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Oyama,F., Kikuchi,R. and Uchida,T.
  TITLE     A synthetic, partial pre-mRNA for ovalbumin primes its own
            complementary DNA with reverse transcriptase
  JOURNAL   J. Biochem. 104, 403-408 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site          3        3     cDNA start with primer
    site         36       36     cDNA start without primer
    site         69       84     primer-independent cDNA
BASE COUNT       44 a     15 c     20 g     32 t
ORIGIN      
        1 atcctggaag tttatcaaag cgaacaacct gtaattgaaa ataatagtag ctgaaataat
       61 ggttatgaca aaaagaagtt atgcaatcca gtttcaagat ttctagctag t
//
LOCUS       XELRRAA       121 bp ss-RNA             RNA       25-JUL-1990
DEFINITION  X.laevis 5S RNA.
ACCESSION   M35055
KEYWORDS    5S ribosomal RNA.
SOURCE      X.laevis kidney ribosomal RNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Brownlee,G.G., Cartwright,E., McShane,T. and Williamson,R.
  TITLE     The nucleotide sequence of somatic 5 S RNA from Xenopus laevis
  JOURNAL   FEBS Lett. 25, 8-12 (1972)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1      121     5S ribosomal RNA
BASE COUNT       24 a     34 c     38 g     25 t
ORIGIN      
        1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg
       61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt
      121 t
//
LOCUS       YSCTRR2        76 bp ss-tRNA            RNA       25-JUL-1990
DEFINITION  Yeast (S.cerevisiae, Brewer's) Arg-tRNA-II.
ACCESSION   K00157 M34900
KEYWORDS    transfer RNA; transfer RNA-Arg.
SOURCE      Yeast (Saccharomyces cerevisiae, Brewer's) tRNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Weissenbach,J., Martin,R. and Dirheimer,G.
  TITLE     Nucleotide sequence of tRNA-Arg-II from Brewer's yeast
  JOURNAL   FEBS Lett. 28, 353-355 (1972)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 76)
  AUTHORS   Weissenbach,J., Martin,R. and Dirheimer,G.
  TITLE     The primary structure of Arg-tRNA-II from brewer's yeast: Partial
            digestion with ribonuclease T-1 and derivation of the complete
            sequence
  JOURNAL   Eur. J. Biochem. 56, 527-532 (1975)
  STANDARD  full staff_review
COMMENT     Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
            entry 0130 in Nucleic Acids Res. 11, r1-r54 (1983).
FEATURES       from  to/span     description
    tRNA          1       76     Arg-tRNA-II (NAR: 0130)
    anticdn      34       36     Arg-tRNA-II anticodon gcg
    modified      1        1     f = pseudouridine
    modified      9        9     m1g = 1-methylguanosine
    modified     10       10     m2g = 2-methylguanosine
    modified     16       16     d = dihydrouridine
    modified     19       19     d = dihydrouridine
    modified     26       26     m22g = 2,2-dimethylguanosine
    modified     27       27     f = pseudouridine
    modified     34       34     i = inosine
    modified     47       47     d = dihydrouridine
    modified     49       49     m5c = 5-methylcytidine
    modified     54       54     t = 5-methyluridine
    modified     55       55     f = pseudouridine
    modified     58       58     m1a = 1-methyladenosine
BASE COUNT       15 a     22 c     24 g     15 t
ORIGIN      5' end of mature tRNA.
        1 ttcctcgtgg cccaatggtc acggcgtctg gctgcgaacc agaagattcc aggttcaagt
       61 cctggcgggg aagcca
//
LOCUS       YSCTRT1A       76 bp ss-tRNA            RNA       25-JUL-1990
DEFINITION  Yeast (S.cerevisiae, brewer's) Thr-tRNA-1a.
ACCESSION   K00278 M34898
KEYWORDS    transfer RNA; transfer RNA-Thr.
SOURCE      Yeast (Saccharomyces cerevisiae, brewer's) tRNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Weissenbach,J., Kirarly,I. and Dirheimer,G.
  TITLE     The nucleotide sequences of two threonine tRNAs from Brewer's yeast
  JOURNAL   FEBS Lett. 71, 6-8 (1976)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 76)
  AUTHORS   Weissenbach,J., Kiraly,I. and Dirheimer,G.
  TITLE     Structure primaire des Thr-tRNA-1a-et-b de levure de biere
  JOURNAL   Biochimie 59, 381-391 (1977)
  STANDARD  full staff_review
COMMENT     Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
            entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast
            Thr-tRNA-1 is 50% Thr-tRNA-1a and 50% Thr-tRNA-1b [1].
FEATURES       from  to/span     description
    tRNA          1       76     Thr-tRNA-1a (NAR: 1760)
    anticdn      34       36     Thr-tRNA-1a anticodon ggt
    modified     10       10     m2g
    modified     16       16     d
    modified     17       17     d
    modified     20       20     d
    modified     26       26     m22g
    modified     32       32     m3c
    modified     34       34     i
    modified     37       37     t6a
    modified     39       39     f
    modified     47       47     d
    modified     48       48     m5c
    modified     54       54     t
    modified     55       55     f
    modified     58       58     m1a
BASE COUNT       20 a     17 c     21 g     18 t
ORIGIN      5' end of mature tRNA.
        1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcat cggttcaaat
       61 ccgattggaa gcacca
//
LOCUS       YSCTRT1B       76 bp ss-tRNA            RNA       25-JUL-1990
DEFINITION  Yeast (S.cerevisiae, brewer's) Thr-tRNA-1b.
ACCESSION   K00279 M34898
KEYWORDS    transfer RNA; transfer RNA-Thr.
SOURCE      Yeast (Saccharomyces cerevisiae, brewer's) tRNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Weissenbach,J., Kirarly,I. and Dirheimer,G.
  TITLE     The nucleotide sequences of two threonine tRNAs from Brewer's yeast
  JOURNAL   FEBS Lett. 71, 6-8 (1976)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 76)
  AUTHORS   Weissenbach,J., Kiraly,I. and Dirheimer,G.
  TITLE     Structure primaire des Thr-tRNA-1a-et-b de levure de biere
  JOURNAL   Biochimie 59, 381-391 (1977)
  STANDARD  full staff_review
COMMENT     Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
            entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast
            Thr-tRNA-1 is 50% Thr-tRNA-1b and 50% Thr-tRNA-1a [1].
FEATURES       from  to/span     description
    tRNA          1       76     Thr-tRNA-1b (NAR: 1760)
    modified     10       10     m2g
    modified     16       16     d
    modified     17       17     d
    modified     20       20     d
    modified     26       26     m22g
    modified     32       32     m3c
    modified     34       34     i
    anticdn      34       36     Thr-tRNA-1b anticodon ggt
    modified     37       37     t6a
    modified     39       39     f
    modified     47       47     d
    modified     48       48     m5c
    modified     54       54     t
    modified     55       55     f
    modified     58       58     m1a
BASE COUNT       19 a     18 c     22 g     17 t
ORIGIN      5' end of mature tRNA.
        1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcgt cggttcaaat
       61 ccgactggaa gcacca
//
LOCUS       BSTGLGBA     2735 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  B.stearothermophilus branching enzyme (glgB) gene, complete cds.
ACCESSION   M35089
KEYWORDS    branching enzyme.
SOURCE      B.stearothermophilus (strain 1503-4R, variant 4) DNA, clone pKVS1.
  ORGANISM  Bacillus stearothermophilus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2735)
  AUTHORS   Kiel,J.A.K.W., Boels,J.M., Beldman,G. and Venema,G.
  TITLE     Molecular cloning and nucleotide sequence of the branching enzyme
            gene (glgB) from Bacillus stearothermophilus, expression in E.coli
            and B.subtilis
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.K.W.Kiel, 12-JUN-1990.
               Dept of Genetics
               Center of Biological Sciences
               Kerklaan 30, NL 9751 NN
               Haren, THE NETHERLANDS
FEATURES       from  to/span     description
    pept        522     2441     branching enzyme (glgB) (EC 2.4.1.18)
    pept        325  <     1 (c) unidentified ORF2
    binding     337      330 (c) ORF2 ribosomal binding site (put.)
    binding     504      516     glgB ribosomal binding site (put.)
    signal      370      365 (c) ORF2 -10 region (put.)
    signal      394      389 (c) ORF2 -35 region (put.)
    signal      446      458     glgB -35 region (put.)
    signal      469      479     glgB -10 region (put.)
BASE COUNT      835 a    492 c    634 g    774 t
ORIGIN      
        1 gaattccaat ggaaataatg gctaacgtaa ggccgtttaa aaaggacgta ataatttcaa
       61 agcgcaaata accgaatgta aatcgatgat ttggcggacg catggcaaga taaagagcga
      121 tcatgctaag cccaagcgcc aatacgtcag atgccatatg ggcagagtcg gaaagcaaag
      181 ctaaggaatt ggataatagc cccccaacaa tttccacaat cgtaaaaaac aatgttaaaa
      241 cgagagtgat ccaaagcgtt tttttcgatt gattttgcgt ttttacatga ggaagatggt
      301 gataatcgta ttgaattggt gacatgacac acctcttatt tagaattatt tttaatttat
      361 atacattata atatagtttt ttataattgt gcaaaaaaat tttttgttta tttatcgaaa
      421 aatgtaaaaa aaatacaatt tttttatcaa ggaatttatg gaatcgctgt ggaatataag
      481 taacaacggt aagaaacttt aaggaaagga tgcgatacag attgatcgcc gtcggtccca
      541 ctgatttaga aatctattta tttcatgaag gcagcttata taaaagttat gaattgtttg
      601 gtgcacatgt gataaagaaa aatggcatgg tcggaacccg gttttgtgta tgggcacccc
      661 atgcgcggga agtgcgatta gtcggcagtt ttaatgaatg gaacggaact aattttaacc
      721 ttatgaaagt aagtaatcaa ggcgtatgga tgatttttat tcctgaaaac ttagaagggc
      781 atttatataa atacgaaatt acgacgaacg atgggaatgt tctgttaaaa tcggatccat
      841 acgcgtttta ctccgagttg cgtccccata ctgcttccat tgtctacaac ataaaaggat
      901 atcaatggaa tgaccagaca tggcgacgga agaaacagcg aaagcgaatt tatgaccagc
      961 ctttgttcat ttatgaactt cactttggtt cgtggaaaaa gaaagaggac ggcagttttt
     1021 atacatatca agagatggca gaggagctaa tcccttatgt tctcgaacat gggtttactc
     1081 atattgagct gctcccactc gtcgagcatc cgttcgatcg ttcttgggga tatcagggaa
     1141 taggttatta ttcagcaaca agccgctacg gaacaccgca tgatttgatg tattttattg
     1201 accgctgtca ccaagctgga ataggcgtca ttctcgattg ggttcctggc cacttttgta
     1261 aagattccca tgggttatat atgtttgatg gcgcaccggc atatgaatat gccaacatgc
     1321 aagaccggga aaattacgta tggggaacgg caaactttga ccttggcaag ccggaagtcc
     1381 gcagcttttt gatttccaat gcgttatttt ggatggaata tttccatgtg gacgggtttc
     1441 gtgtagatgc tgttgccaat atgttatatt ggccaaacag cgacgtacta tacaaaaata
     1501 cgtatgccgt ggagttcttg caaaaattaa atgaaacggt attcgcctat gatccgaaca
     1561 tattaatgat tgccgaagat tcgacagact ggccgcgcgt cactgctcca acatacgacg
     1621 gaggattagg atttaactat aaatggaaca tgggatggat gaacgatatt ttaacttata
     1681 tggaaacgcc gcctgaacat cgaaaatacg tgcacaataa agtaacattt tccctcttgt
     1741 atgcgtattc ggaaaatttc attttacctt tttcccatga cgaggtcgta catggaaaaa
     1801 aatcgctgtt aagtaaaatg ccggggacat atgaggaaaa gtttgcgcaa ttaaggttgc
     1861 tgtatggata tttgttgacg catcctggta agaaattatt gtttatgggc ggcgaatttg
     1921 gccagtttga tgaatggaaa gatttagagc agctggattg gatgcttttt gattttgata
     1981 tgcatcggaa tatgaatatg tatgtgaaag aattgttgaa atgttataag cgctataaac
     2041 cgctttatga gttagaccac tctccagatg gattcgagtg gattgatgtt cataacgccg
     2101 aacaaagtat tttctcattc attcgcagag gaaaaaaaga ggatgatttg cttattgttg
     2161 tgtgtaattt cacaaataaa gtataccacg gttataaagt tggtgttccg ttatttacaa
     2221 gatatcggga agtaatcaat agcgatgcaa tccaattcgg cggctttggg aatatcaatc
     2281 caaaaccgat tgcggcgatg gaagggccgt ttcacggaaa gccatatcat attcagatga
     2341 cgatcccgcc gtttggcatt tctattttaa gaccagtaaa aaaaggtagc gtcaaaagtt
     2401 ttatgaaaac tccacatccg ccatcccatg gagcatcgta aggcatcctt ggagccggat
     2461 tcgcccttga ccaacacccg ccaaaggtgt gaaagggacg tcaagggcga cggggacaaa
     2521 aaagagggca taggaaagcc gcccttgccc ttaccgaatt ttacctttga cgaggttcgg
     2581 ttggtcaagg gttcgcttcg ccgaatccgg ctgttcttct gatccatggg ctccggcgga
     2641 caaaaaagtt aggctgcctc ttgttggagg aaatcttgag ccatggcgat cagcttcgtc
     2701 caccgggccg gcatatgggg cagatcggcg agctc
//
LOCUS       HUMETMAGA    3343 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human secreted epithelial tumor mucin antigen (H23Ag) gene,
            complete cds.
ACCESSION   M35093
KEYWORDS    cell surface antigen; tumor mucin antigen.
SOURCE      Human breast tumor cell line MCF7 DNA, clone lambda-gtWES.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3343)
  AUTHORS   Tsarfaty,I., Hareuveni,M., Horev,J., Zaretsky,J., Weiss,M.,
            Jeltsch,J.M., Garnier,J.M., Lathe,R., Keydar,I. and Wreschner,D.H.
  TITLE     Isolation and characterization of an expressed hypervariable gene
            coding for a breast cancer associated antigen
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.Tsarfaty, 12-JUN-1990.
FEATURES       from  to/span     description
    pept        785      842     secreted epithelial tumor mucin antigen
                                 precursor, exon 1 (H23Ag)
               1342     2207     secreted epithelial tumor mucin antigen
                                 precursor, exon 2 (H23Ag)
    sigp        785      805     secreted epithelial tumor mucin antigen signal
                                 peptide
    matp        806      842     secreted epithelial tumor mucin antigen
               1342     2207     secreted epithelial tumor mucin antigen
    pre-msg     777  >   842     H23Ag mRNA and introns
    IVS         843     1341     H23Ag intron A
    signal      384      397     H23Ag ERE
    signal      633      644     H23Ag CACCT motifs
    signal      689      692     H23Ag TATA box
    site       1063     1090     put. enhancer
    rpt        1670     1729     repeat unit
BASE COUNT      679 a    986 c    981 g    697 t
ORIGIN      Chromosome 1q21-q24.
        1 gagctcctgg ccagtggtgg agagtggcaa ggaaggaccc tagggttcat cggagcccag
       61 gtttactccc ttaagtggaa atttcttccc ccactcccct ccttggcttt ctccaaggag
      121 ggaaccccag gctgctggaa agtccggctg gggcggggac tgtgggtttc agggtagaac
      181 tgcgtgtgga acgggacagg gagcggttag aagggtgggg ctattccggg aagtggtggt
      241 ggggggaggg agcccaaaac tagcacctag tccactcatt atccagccct cttatttctc
      301 ggccgcctct gcttcagtgg acccggggag ggcggggaag tggagtggga gacctagggg
      361 tgggcttccc gaccttgctg tacaggacct cgacctagct ggctttgttc cccatcccca
      421 gttagttgtt gccctgaggc taaaactaga gcccaggggc cccaagttcc agactgcccc
      481 tcccccctcc cccggagcca gggagtggtt ggtgaaaggg ggaggccagc tggagaagaa
      541 acgggtagtc aggggttgca gcattagagc ccttgtagcc ctagcccagg aatggttgga
      601 gagagaagag tagagtaggg aggggggttt gtcacctgtc acctgctcgg ctgtgcctag
      661 ggcgggcggg ggggagtggg gggaccggta taaagcggta ggcgcctgtg cccgctccac
      721 ctctcaagca gccagcgcct gcctgaatct gttctgcccc ctccccaccc atttcaccac
      781 caccatgaca ccgggcaccc agtctccttt cttcctgctg ctgctcctca cagtgcttac
      841 aggtgagggg cacgaggtgg ggagtgggct gccctgctta ggtggtcttc gtggtctttc
      901 tgtgggtttt gctccctggc agatggcacc agaagttaag gtaagaattg cagacagagg
      961 ctgccctgtc tgtgccagaa ggagggagag gctaaggaca ggctgagaag agttgccccc
     1021 aaccctgaga gtgggtacca ggggcaagca aatgtcctgt agagaagtct agggggaaga
     1081 gagtagggag agggaaggct taagagggga agaaatgcag gggccatgag ccaaggccta
     1141 tgggcagaga gaaggaggct gctgcaggaa ggaggcggcc aacccagggg ttactgaggc
     1201 tgcccactcc ccagtcctcc tggtattatt tctctggtgg ccaggcttat attttcttct
     1261 tgctcttatt tttccttcat aaagacccaa ccctatgact ttaacttctt acagctacca
     1321 cagcccctgg gcccgcaaca gttgttacag gttctggtca tgcaagctct accccaggtg
     1381 gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg
     1441 ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca
     1501 ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca
     1561 cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc
     1621 cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc
     1681 ccccagccca gggtgtcacc tcggccccgg agaccaggcc gcccccgggc tccaccgccc
     1741 ccccagccca tggtgtcacc tcggcgccgg acaacaggcc cgccttggcg tccaccgccc
     1801 ctccagtcca caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg
     1861 tgcacaacgg cacctctgcc agggctacca caaccccagc cagcaagagc actccattct
     1921 caattcccag ccaccactct gatactccta ccacccttgc cagccatagc accaagactg
     1981 atgccagtag cactcaccat agcacggtac ctcctctcac ctcctccaat cacagcactt
     2041 ctccccagtt gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc
     2101 agtttaattc ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca
     2161 tttctgaaat ggtgagtatc ggcctttcct tccccatgct cccctgaagc agccatcaga
     2221 actgtccaca ccctttgcat caagcctgag tcctttccct ctcaccccag tttttgcaga
     2281 tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggtac agttctgggt
     2341 gtggacccag tgtggtggtt ggaggggtgg gtggtggtca tgagccgtag ggagggactg
     2401 gtgcacttaa ggttggggga agagtgctga gccagagctg ggacccgtgg ctgaagtgcc
     2461 catttccctg tgaccaggcc aggatctgtg gtggtacaat tgactctggc cttccgagaa
     2521 ggtaccatca atgtccacga cgtggagaca cagttcaatc agtataaaac ggaagcagcc
     2581 tctcgatata acctgacgat ctcaagacgt cagcggtgag gctacttccc tgctgcagcc
     2641 agcaccatgc cggggcccct ctccttccag tgtctgggtc cccgctcttt ccttagtgct
     2701 ggcagcggga ggggcgcctc ctctgggaga ctgccctgac cactgctttt ccttttagtg
     2761 agtgatgtgc catttccttt ctctgaccag tctggggctg gggtgccagg ctggggcatc
     2821 gcgctgctgg tgctggtctg tgttctggtt gcgctggcca ttgtctatct cattgccttg
     2881 gtgagtgcag tccctggccc tgatcagagc cccccggtag aaggcactcc atggcctgcc
     2941 ataacctcct atctccccag gctgtctgtc agtgccgccg aaagaactac gggcagctgg
     3001 acatctttcc agcccgggat acctaccatc ctatgagcga gtaccccacc taccacaccc
     3061 atgggcgcta tgtgccccta gcagtaccga tcgtagcccc tatgagaagg tgagattggg
     3121 ccccacaggc aggggaagca gagggtttgg ctgggcaagg attctgaagg gggtacttgg
     3181 aaaacccaaa gagcttggaa gaggtgagaa gtggcgtgaa gtgagcaggg gagggctggc
     3241 aaggatgagg ggcagaggtc agaggagttt tgggggacag gcctgggagg agactatgga
     3301 agaaaggggc ccctcaaaag ggagtgcccc actgccagaa ttc
//
LOCUS       MPMVPIA      1155 bp ds-DNA             VRL       25-JUL-1990
DEFINITION  Mouse polyomavirus major structural protein (VP1) gene, complete
            cds.
ACCESSION   M34958
KEYWORDS    major structural protein.
SOURCE      Mouse polyomavirus (strain RA) DNA.
  ORGANISM  Mouse polyomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 1155)
  AUTHORS   Freund,R., Garcea,R.L., Sahli,R. and Benjamin,T.L.
  TITLE     A specific amino acid substitution in polyoma virus VP1 correlates
            with plaque size and hemagglutination behavior
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Freund, 08-JUN-1990.
            
               Author address: R.Freund
               Bldg C2 RM 129A
               Dept of Pathology
               Harvard Medical School
               200 Longwood Avenue
               Boston, MA 02115
FEATURES       from  to/span     description
    pept          1     1155     VPI protein (VPI)
BASE COUNT      367 a    274 c    285 g    229 t
ORIGIN      
        1 atggccccca aaagaaaaag cggcgtctct aaatgcgaga caaaatgtac aaaggcctgt
       61 ccaagacccg cacccgttcc caaactgctt attaaagggg gtatggaggt gctggacctt
      121 gtgacagggc cagacagtgt gacagaaata gaagcttttc tgaaccccag aatggggcag
      181 ccacccaccc ctgaaagcct aacagaggga gggcaatact atggttggag cagagggatt
      241 aatttggcta catcagatac agaggattcc ccaggaaata atacacttcc cacatggagt
      301 atggcaaagc tccagcttcc catgctcaat gaggacctca cctgtgacac cctacaaatg
      361 tgggaggcag tctcagtgaa aaccgaggtg gtgggctctg gctcactgtt agatgtgcat
      421 gggttcaaca aacccacaga tacagtaaac acaaaaggaa tttccactcc agtggaaggc
      481 agccaatatc atgtgtttgc tgtgggcggg gaaccgcttg acctccaggg acttgtgaca
      541 gatgccagaa caaaatacaa ggaagaaggg gtagtaacaa tcaaaacaat cacaaagaag
      601 gacatggtca acaaagacca agtcctgaat ccaattagca aggccaagct ggataaggac
      661 ggaatgtatc cagttgaaat ctggcatcca gatccagcaa aaaatgagaa cacaaggtac
      721 tttggcaatt acactggagg cacaacaact ccacccgtcc tgcagttcac aaacaccctg
      781 acaactgtgc tcctagatga aaatggagtt gggcccctct gtaaaggaga gggcctatac
      841 ctctcctgtg tagatataat gggctggaga gttacaagaa actatgatgt ccatcactgg
      901 agagggcttc ccagatattt caaaatcacc ctgagaaaaa gatgggtcaa aaatccctat
      961 cccatggcct ccctcataag ttcccttttc aacaacatgc tcccccaagt gcagggccaa
     1021 cccatggaag gggagaacac ccaggtagag gaggttagag tgtatgatgg gactgaacct
     1081 gtaccggggg accctgatat gacgcgctat gttgaccgct ttggaaaaac aaagactgta
     1141 tttcctggaa attaa
//
LOCUS       MYCP115A     3082 bp ss-mRNA            BCT       25-JUL-1990
DEFINITION  M.hyorhinis 115 kDa protein (p115) gene, complete cds.
ACCESSION   M34956
KEYWORDS    .
SOURCE      M.hyorhinis (strain GDL) DNA, clone MhrG27.
  ORGANISM  Mycoplasma hyorhinis
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 3082)
  AUTHORS   Notarnicola,S.M., McIntoch,M.A. and Wise,K.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.S.Wise, 08-JUN-1990.
               University of Missouri-Columbia
               Dept of Mol Microbiol and Immunol
               School of Medicine-M653
               Columbia, MO 65212
FEATURES       from  to/span     description
    pept         70     3009     115 kDa protein
BASE COUNT     1324 a    410 c    443 g    905 t
ORIGIN      
        1 gaattctttt ttaataattt ttttacttta aaattctagt taaaactcta caaaaaaaca
       61 aggacaaata tgttaaagct tattaaaatt gaaatcgaag gttttaaatc gttcgccgat
      121 ccgatcagca taaatttcga tggttctgtt gtaggaatag ttggaccaaa tggttcagga
      181 aaatctaata ttaatgacgc aattagatga gtattaggtg aacaatcagc aaaacaactt
      241 cgtggactaa atatggatga tgttatcttt gcaggttcca aaactgtcaa acctcaagaa
      301 aaagcaatgg taaaattaac cttcaaaaat gaagatgcaa ttgaagaaac gaaacaaatt
      361 tttactattt ctcgtttact taaaagaggt caaggaacta atgaatattt ttacaatgat
      421 caacctgtta gatataaaga tattaaaaat ttagctgttg aatctggaat ttctaaatct
      481 tcacttgcaa ttatttccca aggtactata tctgaaattg cagaagcaac gcctgaacaa
      541 agaaaagcag ttattgaaga agctgctgga acttcaaaat acaaattaga caaagaagaa
      601 gcacaaaaga aacttattag aacaaatgat gcaattgata aattacaagg tgcaatcaaa
      661 gagttagaac gtcaagtaaa ctcgcttgat aaacaagctt ctaaagcaaa aatttattta
      721 gaaaaaagta aagctcttga atcagttgaa gtaggtttaa ttgttaatga tctaaacttt
      781 ttcaatgaaa aattaaataa tttaaatact tcactattag aagtagaaca acaaagaaat
      841 gatcttgaac tcaacattca aacttatgaa tccagtattt cacaaactgt tcattttaaa
      901 acagaagttg aatcttcaat ccaagaaatt acttcaaaat tagacaattt aaaaaacgca
      961 ctttccgaaa tcaaccttca agaagctaga attgaagaac gtagaaaatt aattatcagt
     1021 ggtgaaattg tagttgatca aaaaacaaaa attgaagaaa ttaaaaaaca agttgaatca
     1081 ctcaaaatac aaataaatgc ttcaaaacaa agagaaattg aactagacca acaacttaca
     1141 agactaaatg caaaagctaa ttctttaaaa ttgcaagaaa atgatattaa taaagaaatt
     1201 ggtgtattac ttgaaaaaaa atcagctgct gcagcaaata ttaatatatt aaaacaacaa
     1261 tttgaaaata aaagttttct ttctaaagga attaaaacta ttaaagataa ctcattttta
     1321 tttgatggtt acattggatt agcttctgaa ttatttaaag tagaatccga atttagttta
     1381 gcaattgaaa ctgttttagg tgctgcttta aatcaaatag taatgaaaac atctgaagat
     1441 gtacttcaag ctattgactt tttaaagaaa aatctttcag gtaaagcaac ttttattcct
     1501 ttaacatcta ttaaagaaag agaagtaaga gaagatcatt tacttgtttt aaaaggacaa
     1561 aaaggatttt taggtgttgc aaaagaacta attgaatttg atactcaatt taacaaactc
     1621 tttggatttt tacttggaaa catcttagtg gttgataatg tagacaatgc aaatagaata
     1681 gctaaaatat tagatcataa atacactata gtttctttag aaggtgattt attcagacca
     1741 ggcggaacca ttactggagg ttcaaaacta gaaagaactt ctattttaaa ttacgatatc
     1801 aaaataaaag aacacacaaa tacacttaaa tttgctgaag atcaaattca tgatttaaaa
     1861 attaaacagc aaacaatata taacgaaatt gaaacagtca attcaacaat ccaacaagta
     1921 aaaattgaag ctaattcaat aaattcaaaa cttaatatct taaacgaaga attaaataac
     1981 ttaaaactaa acgcaagcga aattttcaaa gaacaacaag aagaccaaga gagtttaaat
     2041 ttaagttttg attctgaaaa attgaacata gaaaaacaaa tttctactct aacaattgaa
     2101 ttaaattcta aaaaagatcg actaacaaat ttaattagtg agcaaggaaa aggagaaacc
     2161 aagaaacaag aattagatgc caaactaaga aaattaaaca ctcaacactc agatagtatc
     2221 actgaacaaa acagagcaaa attcttggta gagcaaaatc aaaaaagact ttctgagcac
     2281 tacaaattaa ctttagaagc tgctagtgaa caatattctt tagatttaga cattgaacaa
     2341 gcaagacatt ttgttgatag ccttaaaaaa gagttaaaag aattaggaaa cgttaattta
     2401 gaagcaatta ctgaatttga agaagtaaat caacgttacc aagagaaaaa acaatacatc
     2461 gaagaactaa ccactgctaa atccaaaatt gaagaagcaa tttctgattt agataaaatt
     2521 attatcaata aaacaacaga aattgttaac ttagtaaata atgaatttaa tatggtattt
     2581 caaaaaatgt ttggtggtgg aaaagcagaa attcacttca cagacaaaaa tgatatttta
     2641 aattctggtg ttgaaatatc tgcacaacca cctggtaaaa caattaaaaa cttacgactt
     2701 ttttcaggtg gagaaaaagc tattattgca atttcacttc tttttgctat tttaaaagca
     2761 agaccaattc cattgtgtat tttagacgaa gttgaagctg cacttgatga atctaatgtt
     2821 attcgttatg tagaattttt aaaattacta aaagaaaata ctcaattctt aattattact
     2881 caccgttcag gaacaatgtc aagagtagat cagttacttg gagttactat gcaaaaacgt
     2941 ggagttactt ccattttctc agttgaacta agcaaagcaa aagagatgct aaaagacgaa
     3001 ttaaaataat acaaataaaa ataaaaaaaa cagaagtttg aagtgaggtg ataccctttt
     3061 cttgaaaaaa ttttttgagt gt
//
LOCUS       PPHVLCRA      314 bp ds-DNA             VRL       25-JUL-1990
DEFINITION  Human papillomavirus type 6 long control region DNA.
ACCESSION   M35091
KEYWORDS    .
SOURCE      Human papillomavirus type 6 (patient specimen X020) DNA.
  ORGANISM  Human papillomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Hrisomalos,T.F., Boggs,D.L. and Fife,K.H.
  TITLE     The human papillomavirus type 6 long control region and human
            cellular DNA contain related sequences
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.H.Fife, 12-JUN-1990.
            
               AUTHOR address: K.H.Fife
               Emerson Hall 435
               Indiana University School of Medicine
               545 Barnhill Dr.
               Indianapolis, IN 46202-5124
FEATURES       from  to/span     description
    pept    <     1       21     L1 open reading frame (AA at 1)
    signal      231      236     polyA signal
    site         81      175     insert (as compared to prototype sequence)
    site        245      259     insert (as compared to prototype sequence)
BASE COUNT       75 a     29 c     72 g    138 t
ORIGIN      Mapped between nucleotides 7271 to 7476.
        1 cgcgccaaaa ccaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt
       61 actgttatgt atatgtgttt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta
      121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt
      181 tatgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat
      241 taactacatt attgtatatc ttgttacacc ctgtgactca gtggctgttg cacgcgtttt
      301 ggtttgcacg cgcc
//
LOCUS       PPHVLCRB      300 bp ds-DNA             VRL       25-JUL-1990
DEFINITION  Human papillomavirus type 6 long control region DNA.
ACCESSION   M35092
KEYWORDS    .
SOURCE      Human papillomavirus type 6 (patient specimen X019) DNA.
  ORGANISM  Human papillomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Hrisomalos,T.F., Boggs,D.L. and Fife,K.H.
  TITLE     The human papillomavirus type 6 long control region and human
            cellular DNA contain related sequences
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.H.Fife, 12-JUN-1990.
               Emerson Hall 435
               Indiana Univ School of Medicine
               545 Barnhill Dr.
               Indianapolis, IN 46202-5124
FEATURES       from  to/span     description
    pept    <     1       21     L1 open reading frame (AA at 1)
    signal      231      236     polyA signal
    site         81      175     insert (as compared to prototype sequence)
BASE COUNT       68 a     29 c     72 g    131 t
ORIGIN      Mapped between nucleotides 7271 to 7476.
        1 cgcgccaaaa ctaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt
       61 actgttatgt atatgtgtgt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta
      121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt
      181 tctgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat
      241 tacctcttgt tacaccctgt gactcagtgg ctgttgcacg cgttttggtt tgcacgcgcc
//
LOCUS       TRHTCSA      1010 bp ss-mRNA            PLN       25-JUL-1990
DEFINITION  T.kirilowii trichosanthin (TCS) mRNA, complete cds.
ACCESSION   M34858
KEYWORDS    ribosome inactivating protein; trichosanthin.
SOURCE      T.kirilowii maximowicz, cDNA to mRNA.
  ORGANISM  Trichosanthes kirilowii
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Violales; Cucurbitaceae.
REFERENCE   1  (bases 1 to 1010)
  AUTHORS   Shaw,P.-C., Yung,M.-H., Zhu,R.-H., Ho,W.K.-K., Ng,T.-B. and
            Yeung,H.-W.
  TITLE     Molecular cloning of trichosanthin cDNA and its expression in
            Escherichia coli
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.-C.Shaw, 06-JUN-1990.
            
               Author address: P.-C.Shaw
               Department of Biochemistry
               Chinese University of Hong Kong
               Shatin, NT, HONG KONG
FEATURES       from  to/span     description
    pept         10      879     trichosanthin precursor
    sigp         10       78     trichosanthin signal peptide
    matp         79      819     trichosanthin
    variant     196      196     t in wild type; a in allele
    variant     197      197     c in wild type; g in allele
    variant     468      468     t in wild type; a in allele
BASE COUNT      290 a    218 c    205 g    297 t
ORIGIN      
        1 gtcaaaaaga tgatcagatt cttagtcctc tctttgctaa ttctcaccct cttcctaaca
       61 actcctgctg tggagggcga tgttagcttc cgtttatcag gtgcaacaag cagttcctat
      121 ggagttttca tttcaaatct gagaaaagct cttccaaatg aaaggaaact gtacgatatc
      181 cctctgttac gttcctctct tccaggttct caacgctacg cattgatcca tctcacaaat
      241 tacgccgatg aaaccatttc agtggccata gacgtaacga acgtctatat tatgggatat
      301 cgcgctggcg atacatccta ttttttcaac gaggcttctg caacagaagc tgcaaaatat
      361 gtattcaaag acgctatgcg aaaagttacg cttccatatt ctggcaatta cgaaaggctt
      421 caaactgctg caggcaaaat aagggaaaat attccgcttg gactccctgc tttggacagt
      481 gccattacca ctttgtttta ctacaacgcc aattctgctg cgtcggcact tatggtactc
      541 attcagtcga cgtctgaggc tgcgaggtat aaatttattg agcaacaaat tgggaagcgt
      601 gttgacaaaa ccttcctacc aagtttagca attataagtt tggaaaatag ttggtctgct
      661 ctctccaagc aaattcagat agcgagtact aataatggac agtttgaaag tcctgttgtg
      721 cttataaatg ctcaaaacca acgagtcacg ataaccaatg ttgatgctgg agttgtaacc
      781 tccaacatcg cgttgctgct gaatagaaac aatatggcag ccatggatga cgatgttcct
      841 atgacacaga gctttggatg tggaagttat gctatttagt gtaacttcaa gctacgtacg
      901 agtacaaact cccacttgaa gaatctatta tcgtttgaga gtttaatcta cttgtagaaa
      961 taataaagca tgttcgtgtg accgacctac gtggatgctc tgtatgtgtg
//
LOCUS       CIBABI       1989 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  Plasmid ColIB abortive infection protein (abi) gene, complete cds.
ACCESSION   J03314
KEYWORDS    abortive infection protein.
SOURCE      Plasmid ColIB DNA, clone pTP64, isolated from E.coli K-12 strain
            W3110.
  ORGANISM  Plasmid Colicin Ib
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1989)
  AUTHORS   Gupta,S.K. and McCorquodale,D.J.
  TITLE     Nucleotide sequence of a DNA fragment that contains the Abi gene of
            the ColIb plasmid
  JOURNAL   Plasmid 20, 194-206 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.J.McCorquodale, 21-NOV-1988.
FEATURES       from  to/span     description
    pept       1306     1575     abortive infection protein
    binding    1028     1050     LexA binding site
    binding    1100     1121     LexA binding site
    binding    1215     1234     LexA binding site
    binding    1232     1252     LexA binding site
BASE COUNT      476 a    533 c    531 g    449 t
ORIGIN      5bp upstream of PstI site.
        1 ctgcaggtcc gtgccgacca ggtgcttaag gggtggaaaa atatcccgcg cgggatctcc
       61 ctgaccttct ccctgtttgc cgagatcgcc ggccgggaca gggaaaccat cgaccaggcc
      121 tggaaaaata tcttctactc gcaactgagg gaaaaaaaac accgctttta ccaaagatat
      181 cgaggccatc cgcgccctga aaaaactgcc tgccctcacc ggcgacagct ggcgcgggat
      241 ggcatcacgg tgcgtatcta ccgcccggaa aattacgccc gcggcgatgg cggcttacac
      301 tgagcctccc ggaaaattac gccacccaga tgtggaacat cccgttcccg gagcttgaat
      361 accgcctctt taccgccgat ccgggctaca gcgccctgat cagcgccgaa cccgacaggt
      421 gggacaaggc cttccgtttt gtggacgggg tgtgcgagct tcacctttac accaacggtg
      481 tggaaggaag atcacaatcc caccccgctc ggggatgtcg ctcaggcgct gatcaacgtg
      541 gtggaagaaa acctgctgta acggacccgg atgctgcggg cacaactgca tcatcaggag
      601 gatgcaatga aaggacgaca gagccgctat gttaccggcg gagagagttt cgcggagatt
      661 gcccgtctcc cttcaggggc ggtggtgagg ctctgtctga acaccggtct tgaggatgcg
      721 ctgcgggagg cctccaaatc gctcaagtca gccttcaccc gttccgggcg aaaatgccgg
      781 ctgtcagcgg gtacggcgca ggggccgttt accggacgcc ggcaggcgtg gccacacatc
      841 tcttcgtctc ggtactctga gggggcaggg ggcaaaaaaa gtaaaaatgt attcgccagg
      901 ttgcccggag gtgaaggaaa atagacatac agcagaacga cggatagcac tttttgctaa
      961 atggacatca gtattactat gctatagttg ctttaatgga taagtgcgcc ttgacaaagg
     1021 cggtgatttc tgttaacatt actctcatag tattgttccg tcccgctcca ccccaacaag
     1081 atccgtttat ttcccgccag actggttatc accattcagg cccggatttt tttggatttt
     1141 tttccgggga gcccccggac gagcttaaaa tcggtatgac aaacaggagg atgcgaatga
     1201 acacatcata acagagctga aagataaaac attctgtacg gcattaacag cgttcacgtg
     1261 tgtgaggcgc cgggtgcctt ttgacttaaa aacgaggtta ttgagatgac caaaatcaag
     1321 acagttactt ttgtaaatac ttacccggga gggtctatga aaaacttgtt agacaccgag
     1381 ggaacggttc tattcccatt ccagactgaa atccatttta tttggacgat tttctccacc
     1441 gttaaacgcc tggttatcgg aaccagggac catatttgcc agaagcaata ctggagcgcc
     1501 tgtctctgta ttttgcttct tatggcctat gtgggtctct gtgctgcggt ggtctggttt
     1561 gtagtgccct gctgaaggcc tttatagtgt cgaaatttgc ggtttcggca ctatgggtca
     1621 cgccagtaaa gcgcggacta ctctggggta tcggtaaagt ggttaccgcc acttgccgaa
     1681 gatttactct gctaaagtaa gtagccgcaa cgctacacga actgatggtg aatgtcaaca
     1741 gatactcacc atctccttac ggcggtggtc cctgtgacca ctggcctttc gcgtgggtgc
     1801 aacacggcaa aactcctctg tacaacaggc tcccgccgtc attttccggc acaggtgagg
     1861 ccggaattcg gactaaaacg taaaccgcgg gccagtccgg tagcgttcac tatcggccag
     1921 cattctctca accagagaga aatccttttc accgcagaac acgtacgtct ccgcgaactc
     1981 cacctgcag
//
LOCUS       HUMET3       2223 bp ss-mRNA            PRI       25-JUL-1990
DEFINITION  Human endothelin 3 (EDN3) mRNA, complete cds.
ACCESSION   J05081
KEYWORDS    endothelin.
SOURCE      Human adult hypothalamus, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2223)
  AUTHORS   Bloch,K.D., Eddy,R.L., Shows,T.B. and Quertermous,T.
  TITLE     cDNA cloning and chromosomal assignment of the gene encoding
            endothelin 3
  JOURNAL   J. Biol. Chem. 264, 18156-18161 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by K.D.Bloch, 06-OCT-1989.
FEATURES       from  to/span     description
    pept        194      910     endothelin 3 precursor
                                 /hgml_locus_uid="LU0066V" /map="unassigned"
                                 /nomgen="EDN1"
    sigp        194      268     endothelin 3 signal peptide (put.)
    matp        484      544     endothelin 3
    matp        670      712     endothelin-like protein
BASE COUNT      575 a    535 c    583 g    530 t
ORIGIN      
        1 cgggtagcgc gctctgaaag tttatgaccg ccgcagccaa ctcctggccg gagctggaga
       61 cgcagcgagc gatcggccgg cctcgaaccc ccacagctgg agggcgaggc cagctgtacc
      121 cggccccagt gccctttcgc ggccacaagc ggccgtcctc ctggtccggt gctccggcgc
      181 ctgatctagg ttcatggagc cggggctgtg gctccttttc gggctcacag tgacctccgc
      241 cgcaggattc gtgccttgct cccagtctgg ggatgctggc aggcgcggcg tgtcccaggc
      301 ccccactgca gccagatctg agggggactg tgaagagact gtggctggcc ctggcgagga
      361 gactgtggct ggccctggcg aggggactgt ggccccgaca gcactgcagg gtccaagccc
      421 tggaagccct gggcaggagc aggcggccga gggggcccct gagcaccacc gatccaggcg
      481 ctgcacgtgc ttcacctaca aggacaagga gtgtgtctac tattgccacc tggacatcat
      541 ttggatcaac actcccgaac agacggtgcc ctatggactg tccaactaca gaggaagctt
      601 ccggggcaag aggtctgcgg ggccacttcc agggaatctg cagctctcac atcggccaca
      661 cttgcgctgc gcttgtgtgg ggagatatga caaggcctgc ctgcactttt gcacccaaac
      721 tctggacgtc agcagtaatt caaggacggc agaaaaaaca gacaaagaag aggaagggaa
      781 ggttgaagtc aaggaccaac aaagcaagca ggctttagac ctccaccatc caaagctcat
      841 gcccggcagt ggactcgccc tcgctccatc tacctgcccc cgctgcctct ttcaggaagg
      901 agccccttag gaggacaggc ctgcagctcc aatttcatgc aggaaattgg ttttggagag
      961 ttttggcaag ttggaaagcc acttactggc ttttgacatg acttctcttg gagaataagt
     1021 ggactccaag ctaactcttt gcaaatgtaa acacatgtcc atcttgttaa taaatgcaaa
     1081 atgcccgtgc agcagaagca tgcgactttc atatccttgc ctagaatagg ctgcatggtg
     1141 tatgtcagtg agggccacga ggcgtcggct ttagacacag atcatagctc tacaggagtt
     1201 tatgaatttg aagcttatgg gattttggca gagaaatttt cagctgtgct tgatacccac
     1261 caaaagaatg tatctcgaaa gaatgaagga agaagaaaaa aggatccttg atgtttgtga
     1321 caagaaaatg agaaagttag tatctgcaat acagagcttg ttcctgttca gtgactgacc
     1381 ctctgtattc tgtatagaca ccaggccgat acacagtgga gttcccaggc cttgtttgca
     1441 ggaagccgac tgtaaagaca gccccagctc aaggctatta ggttgaatat ttgctttcat
     1501 gagtaaatgt ggatctttgg ggaatggctt caaaataagt cacgaacaca aattctttgt
     1561 aaattatgta aattcctgtt tatataaatt ggcaacaact tataccgtct gacagttcaa
     1621 aatctctttc agctgcgctc ttcccaccga gccgagctta ctgtgagtgt ggagatgtta
     1681 tcccaccatg taaagtcgcc tgcgcagggg agggctgccc atctccccaa cccagtcaca
     1741 gagagatagg aaacggcatt tgagtgggtg tccagggccc cgtagagaga catttaagat
     1801 ggtgtatgac agagcattgg ccttgaccaa atgttaaatc ctctgtgtgt atttcataag
     1861 ttattacagg tataaaagtg atgacctatc atgaggaaat gaaagtggct gatttgctgg
     1921 taggattttg tacagtttag agaagcgatt atttattgtg aaactgttct ccactccaac
     1981 tcctttatgt ggatctgttc aaagtagtca ctgtatatac gtatagagag gtagataggt
     2041 aggtagattt taaattgcat tctgaataca aactcatact ccttagagct tgaattacat
     2101 ttttaaaatg catatgtgct gtttggcacc gtggcaagat ggtatcagag agaaacccat
     2161 caattgctca aatactcaga aagtactgtc aaaagcctaa taaaaaacct aaagtttgct
     2221 ctg
//
LOCUS       HUMSATAA      293 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human alpha satellite DNA, clone pC1.8.
ACCESSION   M26918 J04744
KEYWORDS    alpha satellite DNA; satellite DNA.
SOURCE      Human (cell line HHW423) DNA, clone pC1.8.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 293)
  AUTHORS   Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A.
  TITLE     A human alphoid DNA clone from the EcoRI dimeric family: Genomic
            and internal organization and chromosomal assignment
  JOURNAL   Genomics 5, 822-828 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by A.Baldini, 08-AUG-1989.
FEATURES       from  to/span     description
    rpt           1      293     alpha-satellite
BASE COUNT       84 a     55 c     63 g     91 t
ORIGIN      Chromosomes 1, 5, and 19; centromere.
        1 gatcctttac acagagcaga cttgaaacac tctttttgtg gaatttgcag tggagatttc
       61 aagcgctttg aggccaatgg cagaaaagga aatacttcga tataaaaact agacagaatc
      121 attctcagaa actgctctgc gatgtgtcgg ttcaactctc agagtttaac ttttcttttc
      181 attcagcagt ttggaaacac tctgtttgta aagtctgcaa cgtggatatt tgaccactta
      241 gaggccttcg ttggaaacgg gtttttttcc tgtaaggcta gacagaagaa ttc
//
LOCUS       HUMSATAB      344 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human alpha satellite DNA, clone pC1.8.
ACCESSION   M26919 J04744
KEYWORDS    alpha satellite DNA; satellite DNA.
SOURCE      Human (cell line HHW423) DNA, pC1.8.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 344)
  AUTHORS   Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A.
  TITLE     A human alphoid DNA clone from the EcoRI dimeric family: Genomic
            and internal organization and chromosomal assignment
  JOURNAL   Genomics 5, 822-828 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by A.Baldini, 08-AUG-1989.
FEATURES       from  to/span     description
    rpt           1      344     alpha-satellite
BASE COUNT       98 a     67 c     68 g    111 t
ORIGIN      Chromosomes 1, 5 and 9; centromere.
        1 gaattcccag tagcttcctt gtgttgtgaa cattcaactc acagagttga acgttccctt
       61 agacagagca gatttgaaca ctctttttgt gcaattggca agtggagatt tcaagcgctt
      121 taaggtcaat ggcagaaaag gaaatatctt cgtttcaaaa ctagacagaa tcattcccac
      181 aaactgcgtt gtgatgtgtt cattcaactc acacagttta acctttcttt tcatagagca
      241 gttaggaaac agtctgtttg taaattctct aagtggatat tctgacatct tgtggccttc
      301 gttggaaacg ggatttcttc atattctgct agacagaaga attc
//
LOCUS       HUMSATAC     1049 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human alpha satellite DNA, clone pC1.8.
ACCESSION   M26920 J04744
KEYWORDS    alpha satellite DNA; satellite DNA.
SOURCE      Human (cell line HHW423) DNA, clone pC1.8.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1049)
  AUTHORS   Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A.
  TITLE     A human alphoid DNA clone from the EcoRI dimeric family: Genomic
            and internal organization and chromosomal assignment
  JOURNAL   Genomics 5, 822-828 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by A.Baldini 08-AUG-1989.
FEATURES       from  to/span     description
    rpt           1     1049     alpha-satellite
BASE COUNT      295 a    198 c    221 g    335 t
ORIGIN      Chromosomes 1, 5 and 19; centromere.
        1 aaattttctt ttcatacagc agagtttgga aacactctgt ttgtaaagtc tgcacgtgga
       61 taagttgtcc acttagaggc attcgttgga aacgggtttt tttcatgtaa ggctacacag
      121 aagaattccc agtaacttcc ttgtgttgtg tgtatcaact caaagagttg aacgatcctt
      181 tacacagagc agacttctaa cactcttttt gtggaatttg caagtggaga tttcagccgc
      241 tttgaagtca aaggtagaaa aggaaatatc ttcctataaa aactagacag aatgattctc
      301 agaaactcct ttgtgatgtg tgcgttcaac tcacagagtt taacctttct tttcatagag
      361 cagttaggaa acactctgtt tgtaaagtct gcaagtggat attcagacct ctttgaggcc
      421 ttcgtggaac gggttttcat ataaggctag gcagagaatt cccagtaact tccttgtgtt
      481 gtgtgtgtca actcacagag ttgactttca tttacacaga gcagacttga aacactcttt
      541 ttgtaattgc aagtggagat ttcaagcgct ttgagcaagg ccgaaaagga aatatcttcg
      601 tataaaaact agacagaatc attctcagaa actgctctgc gatgtgtgcg ttcaactctc
      661 agagtttaac ttttcttttc atcagcagtt tggaaacact ctgtttgtaa agtctgcacg
      721 tggatatttt gaccacttag aggccttcgt tggaaacggg tttttttcct gtaaggctag
      781 acagaagaat tccctgtagc ttccttgtgt tgtgtacatt caacgcacag agttgaacgt
      841 tcccttagac agagcagatt tgaaacactc tttttgtgca attggcaagt ggagatttca
      901 ggcgctttaa ggtcaatggc agaaaaggaa atatcttcgt ttcaaaacta gacagaatca
      961 ttcccacaaa ctgcgtggtg atgtgttcgt tcaactcaca gagtttaacc tttcctttca
     1021 tagagcagtt aggaaacagt ctgtttttt
//
LOCUS       PVYCPA       1122 bp ss-RNA             VRL       25-JUL-1990
DEFINITION  Potato virus Y coat protein gene, 3' end.
ACCESSION   M22470
KEYWORDS    coat protein.
SOURCE      Potato virus Y (necrotic strain; isolate New Zealand; N-PVY),
            passed in Nicotiana tabacum cv. Burley 21, cDNA to viral RNA, clone
            PVYN 27.
  ORGANISM  Potato virus Y
            Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; 
            Potyvirus.
REFERENCE   1  (bases 1 to 1122)
  AUTHORS   Hay,J.M., Fellowes,A.P. and Timmerman,G.M.
  TITLE     Nucleotide sequence of the coat protein gene of a necrotic strain
            of potato virus Y from New Zealand
  JOURNAL   Arch. Virol. 107, 111-122 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by J.M.Hay, 09-FEB-1989.
FEATURES       from  to/span     description
    pept    <     1      796     coat protein (AA at 2)
BASE COUNT      351 a    196 c    263 g    312 t
ORIGIN      6 bp upstream of TaqI site.
        1 cacaatcgat gcaggaggaa gcactaaaaa ggatgcaaaa caagagcaag gtagcattca
       61 accaaatttc aacaaggaaa aggaaaagga cgtgaatgtt ggaacatctg gaactcatac
      121 tgtgccacga attaaagcta tcacgtccaa aatgagaatg cccaagagta aaggtgcaat
      181 tgcattaaat ttggaacact tactcgagta tgctccacag caaattgaca tctcaaatac
      241 tcgagcaact caatcacagt ttgatacgtg gtatgaagca gtacaacttg catacgacat
      301 aggagaaact gaaatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg
      361 aacctcgcca aacatcaacg gagtttgggt tatgatggat ggagatgaac aagtcgaata
      421 cccactaaaa ccaatcgttg agaatgcaaa accaacactt aggcaaatca tggcacattt
      481 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg
      541 atatggttta gttcgtaatc tgcgcgatgg aagtttggct cgctatgctt ttgactttta
      601 tgaagttaca tcacggacac cagtgagggc tagagaggca cacattcaaa tgaaggccgc
      661 agctttaaaa tcagctcaat ctcgactttt cggattggat ggtggcatta gtacacaaga
      721 ggaaaacaca gagaggcaca ccaccgagga tgtttctcca agtatgcata ctctacttgg
      781 agtgaagaac atgtgattgt agtgtctttc cggacgatat atagatattt atgtttgcag
      841 taagtatttt ggcttttcct gtactacttt tatcgaaatt aataatcgtt tgaatattac
      901 tggcagatag gggtggtata gcgattccgt cgttgtagtg accttagctg tcgtttctgt
      961 attattatgt ttgtataaaa gtgccgggtt gttgttgttg tggctgatct atcgattagt
     1021 tgatgttgcg atttgtcgta gcagtgacta tgtctggatt tagttagttg ggtgatgctg
     1081 tgattctgtc atagcagtga ctgtaaactt caatcaggag ac
//
LOCUS       SRAAFPG      2420 bp ds-DNA             VRT       25-JUL-1990
DEFINITION  Sea raven (H.americanus) antifreeze protein type II gene, complete
            cds.
ACCESSION   J05100
KEYWORDS    antifreeze protein.
SOURCE      Sea raven (adult) testes DNA, clone lambda SR7.
  ORGANISM  Hemitripterus americanus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Scorpaeniformes; Cottoidei; Cottidae.
REFERENCE   1  (bases 1 to 2420)
  AUTHORS   Hayes,P.H., Scott,G.K., Ng,N.F.L., Hew,C.L. and Davies,P.L.
  TITLE     Cystine-rich type II antifreeze protein precursor is initiated from
            the third AUG codon of its mRNA
  JOURNAL   J. Biol. Chem. 264, 18761-18767 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.L.Davies, 19-OCT-1989.
FEATURES       from  to/span     description
    pept        434      494     antifreeze protein, exon 2 (first expressed
                                 exon)
               1246     1382     antifreeze protein, exon 3
               1488     1604     antifreeze protein, exon 4
               1697     1805     antifreeze protein, exon 5
               2045     2112     antifreeze protein, exon 6
    pre-msg     226     2112     antifreeze protein mRNA and introns
    IVS         284      389     antifreeze protein intron A
    IVS         495     1245     antifreeze protein intron B
    IVS        1383     1487     antifreeze protein intron C
    IVS        1605     1696     antifreeze protein intron D
    IVS        1806     2044     antifreeze protein intron E
    rpt          74       94     repeat copy A
    rpt          95      114     repeat copy B
    rpt         115      135     repeat copy C
    signal       74      135     antifreeze protein regulatory sequence (put.)
    site        161      164     antifreeze protein CAAT box
    site        195      198     antifreeze protein TATA box
    signal     2368     2373     antifreeze protein polyA signal
BASE COUNT      684 a    475 c    496 g    765 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttcaga aattcactcc tttttctaat attaacttta aagccacagt gtgcgatttg
       61 gagccctttg atttgttgtt ttcaaagttc aaactgttgt ttcaaaattc aaactgttgt
      121 tttcaaagtt caaactgatg ccagtgtcca taataaaaat caatgtatga ataatattgt
      181 gaaatgtaat tgactatata agagctggtc tttctctagt tcagcacatg aatgcagagg
      241 caacaggctg acactgaaac aagagaagat atttctacag caggtttgct ctcagcctct
      301 tcttcgtcct gccgagcccc acaggcactg tgctgccctg ctgtctttgt aattcattgc
      361 aactcttgtg tttttctctt ctgatgcagg gctatcaatc atcttcatcg tctgcaccat
      421 ctctaccacg aggatgctga ctgtgtctct actggtttgt gccatgatgg ctctgactca
      481 agctaatgat gacagtgagt ctcagtctta cattctgtgt gtaggatact atactgtctg
      541 taaatatatt caattgtaga cctattaaga tgctgtgaat attaatatta ggtaatattt
      601 agtttattta tatatgtata tatatttgac agtaataaca aaaaactagg atagattgca
      661 atccgacttt ttgttatctt tattgttaac aatattaaag acataattcc atagaattat
      721 ataatttaca tagaaacagc aaatacaact gtcagagaaa gacttgacag ctaaagcagg
      781 agagatcaag tgtagaaggg agatttgatc tcgtctcaac tgaagctaga actgaatgta
      841 ctaacttatt tttggtgaaa caaccgaata attaattcat ttttccccca caaaactaaa
      901 cgagacgcag accaagctaa gtgtgtgcta acagtaatca gcattcgttt agcaaagtat
      961 tagtaactgc catcacagct tttgactcta gtggaattca tgaaatttgg cagaacaaag
     1021 gagacctgtg cacatctgat tccaatgaga atacaatgtg cttcacagaa aagcacttca
     1081 ccaatcctgt acacattcat aaagccacag aaaaaaagag agctgattaa tcgtcgttcc
     1141 ctctgctctg acaataaaag gattataaac tccagatttc tgataaacag actcggtggc
     1201 ttacctgtga tcagacatgt tacccactct tctgtttgtc ctcagaaata ctcaaaggca
     1261 cggctacaga ggctggaccg gtctctcaga gagccggacc aaactgtccc gctggttggc
     1321 aacctcttgg tgaccgctgt atctattatg agacaacagc gatgacttgg gctctggctg
     1381 aggtagtcag gatatgatta tgattcagat tgcttctaaa ctggtctggt ggtattgcct
     1441 tacatgctcg gttaattgag catgagcttg actcatttcc actgcagaca aactgtatga
     1501 aattgggtgg acaccttgca tccatccaca gccaggagga gcatagtttc attcagacct
     1561 tgaatgctgg tgttgtatgg atcggaggct ccgcttgcct ccaggtaaaa cattgcatta
     1621 caatggtggc agaaagaaag gatttttatt acatgctatc ttactatacg tatattcttt
     1681 cctttctgtt ttctaggcag gtgcttggac ctggtctgat ggtacaccta tgaattttcg
     1741 ttcctggtgt tctaccaaac ctgatgatgt actggccgcg tgctgtatgc agatgactgc
     1801 tgcaggtaaa tcacaacaca ttagagcata gtattaaatg actgaaggca gtagtgttgt
     1861 ttagtacatt tggttcatct tgagatcaat actctcagaa tttcactttt gaatcacttt
     1921 tgttcttcag ttcatgtgta gctttggcct cgttatccgt gtctttgtct gtctagtgat
     1981 gaagacagtt tcaggttagg ttggtatggc gctgactcac ttcttgtgtt tttgatgttt
     2041 acagctgacc aatgctggga tgacttgcct tgtccggcgt cccacaaatc agtctgcgcc
     2101 atgacattct aagctaacac agaggccatc catcacacaa acactttagt gggtgtttga
     2161 ttgtgtgtgt tcgcatactc atctgtgttc gtgtcaacag cctcatgctg aacctgaagg
     2221 ttcaaaatct catatgacat ctttaattct ttgctattgt tggagctgcc tgaaaggatg
     2281 agacgacaag agctggaaag catctgaggg attttaggaa gaaagtgaat ggttatgaaa
     2341 atgatggtct ttttatgtat tatgtcaaat taaaaggctg acacgttgaa acaaactctt
     2401 ctgtgagttt ggcagaattc
//
LOCUS       YSCTFIIDA    2439 bp ds-DNA             PLN       25-JUL-1990
DEFINITION  Yeast (S.cerevisiae) TATA-binding protein (TFIID) gene, complete
            cds.
ACCESSION   M27135
KEYWORDS    DNA binding protein; TATA-binding protein; transcription factor.
SOURCE      Yeast (S.cerevisiae, strain S288C) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2439)
  AUTHORS   Hahn,S., Buratowski,S., Sharp,P.A. and Guarente,L.
  TITLE     Isolation of the gene encoding the yeast TATA binding protein
            TFIID: A gene identical to the SPT15 suppressor of Ty element
            insertions
  JOURNAL   Cell 58, 1173-1181 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Hahn, 10-AUG-1989.
FEATURES       from  to/span     description
    pept       1237     1959     TATA-binding protein
BASE COUNT      737 a    481 c    472 g    749 t
ORIGIN      1 bp upstream of EcoRI site; chromosome 5 right arm after TRP1.
        1 gaattcgttc aagtggtccg taatattccc gtctttacaa agctggatta ccatctctaa
       61 tgccaacttc catgcatata gctcaggccc caccgtgtgc agctccgtgc ttcgcagctc
      121 ctgcagagca tcctcgggga ttgggaacct ctcatttagc aagtaattca cataacacag
      181 atttagaaac catttccatt gtgacttttc ccgacattgc gagagtagcc catgaaaact
      241 cgtcttcacc ctgcggtgct gtttcagctt aatgcaaagc atcacgccga catactggaa
      301 tacggatgcc caattttgat acaactcatc ctgcaaattt accatgtact ggactaattc
      361 attgcaattt cttagtgcaa tcttatagtg gaacttactg tctctcataa gtggcaagtc
      421 atgtaacagc agaaactcgc aacgcatgat ctcttctacc aaatctgtgt cgctctggtg
      481 cgtttgtaac cgttctttca aactggaaat gtaaagctct gctaggtcaa aattatacgt
      541 ctcctgtatc aataactcca ccatctcaaa cgtgacctta ctatcctcca gaactgaaag
      601 cgtacatttc gttttcaata gctgaaacat ctggatagac atgttcatga ggccataata
      661 ctgcttcaac ccttcctcag aaccgatttt attcgcaatt gatatgcatg gtctctgtat
      721 tcctgtgcta agtggtatac ttgtgaaata ctaagtttgt cgccaagatt ttccatgaat
      781 ttgtacttct ttcgaaatcg ttcaatttct accaatactg attcccctct gatagctgag
      841 atgtcgggat tccctttgct gatagatcta actcatctct ttacgtattt taattgtgaa
      901 gccgtaaata gttatcttcc aagtttctct tacgcgagct ttttgggaaa agaaaaaaat
      961 ttgaagatct acatataaaa catggcttca aaggattact aatgactttt tttaccttga
     1021 taggtattct tgatggtaag agtaaacaag ggacgtgaaa attacagtag ttactgtttt
     1081 ttttggacta taagatcggg ggaaagataa cacataagaa ataaaacgac tactagttag
     1141 actgctctgc ggaagaagca aggaagtaaa ggctgcattt tatttttctt ttctagtcca
     1201 acataaacag gtgtatcaag agaaactttt ttaattatgg ccgatgagga acgtttaaag
     1261 gagtttaaag aggcaaacaa gatagtgttt gatccaaata ccagacaagt atgggaaaac
     1321 cagaatcgag atggtacaaa accagcaact actttccaga gtgaagagga cataaaaaga
     1381 gctgccccag aatctgaaaa agacacctcc gccacatcag gtattgttcc aacactacaa
     1441 aacattgtgg caactgtgac tttggggtgc aggttagatc tgaaaacagt tgcgctacat
     1501 gcccgtaatg cagaatataa ccccaagcgt tttgctgctg tcatcatgcg tattagagag
     1561 ccaaaaacta cagctttaat ttttgcctca gggaaaatgg ttgttaccgg tgcaaaaagt
     1621 gaggatgact caaagctggc cagtagaaaa tatgcaagaa ttatccaaaa aatcgggttt
     1681 gctgctaaat tcacagactt caaaatacaa aatattgtcg gttcgtgtga cgttaaattc
     1741 cctatacgtc tagaagggtt agcattcagt catggtactt tctcctccta tgagccagaa
     1801 ttgtttcctg gtttgatcta tagaatggtg aagccgaaaa ttgtgttgtt aatttttgtt
     1861 tcaggaaaga ttgttcttac tggtgcaaag caaagggaag aaatttacca agcttttgaa
     1921 gctatatacc ctgtgctaag tgaatttaga aaaatgtgat ggggaaggag tagacgaaaa
     1981 gaaaaaaagg ttttctattt gttccatttt ctcaattatt aatggtcctc aaagaaataa
     2041 aagaaaagga agaagaagta attgtaatat caaacggttt tttatagtat attcttctta
     2101 ttctatattt atatatcaat gttttataat aagatgttta ttcatagcat atctggtgga
     2161 tcgtctctat taagcgccag cgaggtgttt gcctctgcat ttttcagcaa agcaagctcc
     2221 ctttccagct tgaatctatg ttcacgctca tccgacaatt ctttttcata ctttctttgt
     2281 gtactcgtaa gcactttttt aaactcactt gtcattattg aaagtgaacg tgatccagaa
     2341 ccgcttgtgg ggcttcctac agaggaaggt gaacttggat cccaagtcac tggcgaactc
     2401 gctggtgatg acatgccgaa attatgtctg cttgaattc
//
LOCUS       ECOPUTC       730 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.coli putC region encoding proline uptake protein (putP) and
            proline oxidase (putA) genes, 5'ends.
ACCESSION   M35174
KEYWORDS    putA protein; putC region; putP protein.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 730)
  AUTHORS   Nakao,T., Yamato,I. and Anraku,Y.
  TITLE     Nucleotide sequence of putC, the regulatory region for the put
            regulon of Escherichia coli K 12
  JOURNAL   Mol. Gen. Genet. 210, 364-368 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        129  <     1 (c) proline uptake protein (putP)
    pept        549  >   730     proline oxidase (putA)
    pept        209      544     ORF
    mRNA        266  <     1 (c) putP mRNA (alt.)
    mRNA        255  <     1 (c) putP mRNA (alt.)
    mRNA        249  <     1 (c) putP mRNA (alt.)
    mRNA        224  <     1 (c) putP mRNA (alt.)
    mRNA        142  <     1 (c) putP mRNA (alt.)
    mRNA        506  >   730     putA mRNA
BASE COUNT      205 a    173 c    144 g    208 t
ORIGIN      
        1 cccaagacta cgaccgccca gaatatagtc gtcaaagttt ttcgttgatc gccaggcgat
       61 aaacccaatc aatatcatgc caaagatata gacacaaaat gtcaccaaca tcggtgtgct
      121 aatagccatc taaagtctcc aaaaaattat tatcggcaat gtcgaaactt gccgttatat
      181 ctgccaccgg aacggggtaa cagagtttat gttttaccag ggcgaccgta tcctgccgga
      241 agcgctggtt attcacaatc gatttaacac accatttaca ttaaatttta gtgctcagcg
      301 acactatttt tcatcaggtt gcactctctc acattttttg cggttgcacc tttcaaaaat
      361 gttaactgcc gcagagaaaa agtctgagtt atttttttcc ctgtcatatc gatttctttt
      421 attaacattt cattcatttt taagcttgct acgcatgtca catttaacat ggttgcacaa
      481 agttgcaaca tcatggatat ttcacgataa cgttaagttg cacctttctg aacaacagga
      541 gtaatggcat gggaaccacc accatggggg ttaagctgga cgacgcgacg cgtgagcgta
      601 ttaagttcgc cgcgacacgt atcgatcgca caccacactg gttaattaag caggcgattt
      661 tttcttatgc tcgaacaact ggaaaacagc gatactctgc cggagctacc tgcgctgctt
      721 tctggcgcgg
//
LOCUS       FIBGLUC      1426 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
            complete cds.
ACCESSION   M33676 M33311
KEYWORDS    1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase.
SOURCE      F.succinogenes (strain S85) DNA, clone PJI5.
  ORGANISM  Fibrobacter succinogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1426)
  AUTHORS   Teather,R.M. and Erfle,J.D.
  TITLE     DNA sequence of a Fibrobacter succinogenes mixed linkage
            beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
  JOURNAL   J. Bacteriol. 172, 3837-3841 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.M.Teather, 11-APR-1990.
FEATURES       from  to/span     description
    pept        145     1194     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 precursor (EC 3.2.1.73)
    sigp        145      225     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 signal peptide
    matp        226     1191     1,3-1,4-beta-D-glucan 4-glucanohydrolase
    binding     132      137     ribosome binding site
    signal       62       66     -35 region
    signal       85       90     -10 region
BASE COUNT      371 a    346 c    335 g    374 t
ORIGIN      
        1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
       61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
      121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
      181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
      241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
      301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
      361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
      421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
      481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
      541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
      601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
      661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
      721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
      781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
      841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
      901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
      961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
     1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
     1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
     1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
     1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
     1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
     1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
     1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS       BFRRCRRA       89 bp ss-RNA             PHG       25-JUL-1990
DEFINITION  Bacteriophage fr coat protein replicase cistron (R region) RNA.
ACCESSION   M35063
KEYWORDS    coat protein.
SOURCE      Bacteriophage fr RNA.
  ORGANISM  Bacteriophage fr
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Leviviridae.
REFERENCE   1  (bases 1 to 89)
  AUTHORS   Cielens,I.E., Jansone,I.V., Gribanov,V.A., Vishnevskii,Y.I.,
            Berzin,V.M. and Gren,E.J.
  TITLE     Regulator region of phage fr replicase cistron: II. Isolation and
            structure of specific fr RNA fragments
  JOURNAL   Mol. Biol. 16, 886-892 (1982)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         55  >    89     coat protein (R region)
    pept    <     1       20     undefined ORF (AA at 3)
BASE COUNT       34 a     23 c     13 g     19 t
ORIGIN      
        1 ccaactcggg aatctactaa gaaacccgtg ccattccaac aatgaggaat acccatgtca
       61 aaatcaacaa agaagttcaa ctctttatg
//
LOCUS       CHKAGLBB       71 bp ss-mRNA            VRT       25-JUL-1990
DEFINITION  Chicken alpha-globin gene, partial cds.
ACCESSION   M35068
KEYWORDS    alpha-globin.
SOURCE      Chicken (strain white Leghorn) 2-3 week old, cDNA to mRNA, clone
            pHb1003.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 71)
  AUTHORS   Cummings,I.W., Liu,A.Y. and Salser,W.A.
  TITLE     Identification of a new chicken alpha-globin structural gene by
            complementary DNA cloning
  JOURNAL   Nature 276, 418-419 (1978)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >    71     alpha-globin (AA at 1)
BASE COUNT       17 a     22 c     18 g     14 t
ORIGIN      
        1 aagaaggtag tggctgcctt gatcgaggct gccaaccaca ttgatgacat cgccggcacc
       61 ctctccaagc t
//
LOCUS       ECOTGLPA      141 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.coli suppressor tRNA-Leu (leuX) precursor gene.
ACCESSION   M35064
KEYWORDS    leuX gene; suppressor transfer RNA-Leu.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Nomura,T. and Ishihama,A.
  TITLE     A novel function of RNase P from Escherichia coli: Processing of a
            suppressor tRNA precursor
  JOURNAL   EMBO J. 7, 3539-3545 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA         23      114     Leu-tRNA
    anticdn      57       59     Leu-tRNA anticodon caa
    site         35       36     self-cleavage site
BASE COUNT       38 a     34 c     33 g     36 t
ORIGIN      
        1 gttttccgca tacctcttca gtgccgaagt ggcgaaatcg gtagacgcag ttgattcaaa
       61 atcaaccgta gaaatacgtg ccggttcgag tccggccttc ggcaccaaaa gtatgtaaat
      121 agacctcaac tgaggtcttt t
//
LOCUS       HUMFBPC        66 bp ss-mRNA            PRI       25-JUL-1990
DEFINITION  Human folate binding protein mRNA, partial cds.
ACCESSION   M35069
KEYWORDS    folate binding protein.
SOURCE      Human epidermoid carcinoma cell line KB, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Sadasivan,E. and Rothenberg,S.P.
  TITLE     Molecular cloning of the complementary DNA for a human folate
            binding protein
  JOURNAL   Proc. Soc. Exp. Biol. Med. 189, 240-244 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >    66     folate binding protein (AA at 1)
BASE COUNT       21 a     17 c     17 g     11 t
ORIGIN      
        1 acaaggattg catgggccag gactgagctt ctcaatgtct gcatgaacgc caagcaccac
       61 aaggaa
//
LOCUS       HUMMETONA      90 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human met oncogene, middle exon.
ACCESSION   M35073
KEYWORDS    met oncogene; tyrosine kinase.
SOURCE      Human cell line MNNG-HOS DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 90)
  AUTHORS   Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O.,
            Rowley,J.D., Blair,D.G. and Vande Woude,G.F.
  TITLE     The human met oncogene is related to the tyrosine kinase oncogenes
  JOURNAL   Nature 318, 385-388 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    22  /    90     met oncogene (AA at 24)
                                 /hgml_locus_uid="LN0032R" /nomgen="MET"
                                 /map="7q31"
    IVS     <     1       21     met oncogene intron
BASE COUNT       28 a     16 c     21 g     25 t
ORIGIN      Chromosome 7q31.
        1 ttggctttgg tcttcaagta gccaaagcga tgaaatatct tgcaagcaaa aagtttgtcc
       61 acagagactt ggctgcaaga aactgtatgt
//
LOCUS       HUMMETONB     375 bp ss-mRNA            PRI       25-JUL-1990
DEFINITION  Human met oncogene mRNA, 3' end.
ACCESSION   M35074
KEYWORDS    met oncogene; tyrosine kinase.
SOURCE      Human cell line MNNG-HOS, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 375)
  AUTHORS   Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O.,
            Rowley,J.D., Blair,D.G. and Vande Woude,G.F.
  TITLE     The human met oncogene is related to the tyrosine kinase oncogenes
  JOURNAL   Nature 318, 385-388 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      375     met oncogene /hgml_locus_uid="LN0032R"
                                 /nomgen="MET" /map="7q31"
BASE COUNT       94 a     97 c     83 g    101 t
ORIGIN      Chromosome 7q31.
        1 tggtcctttg gcgtcgtcct ctgggagctg atgacaagag gagccccacc ttatcctgac
       61 gtaaacacct ttgatataac tgtttacttg ttgcaaggga gaagactcct acaacccgaa
      121 tactgcccag accccttata tgaagtaatg ctaaaatgct ggcaccctaa agccgaaatg
      181 cgcccatcct tttctgaact ggtgtcccgg atatcagcga tcttctctac tttcattggg
      241 gagcactatg tccatgtgaa cgctacttat gtgaacgtaa aatgtgtcgc tccgtatcct
      301 tctctgttgt catcagaaga taacgctgat gatgaggtgg acacacgacc agcctccttc
      361 tgggagacat catag
//
LOCUS       MS23ENDA      105 bp ss-RNA             PHG       25-JUL-1990
DEFINITION  Bacteriophage MS2 3' terminal fragment.
ACCESSION   M35059
KEYWORDS    .
SOURCE      Bacteriophage MS2 RNA.
  ORGANISM  Bacteriophage MS2
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Leviviridae.
REFERENCE   1  (bases 1 to 105)
  AUTHORS   Contreras,R., Vandenberghe,A., Jou,W.M., De Wachter,R. and Fiers,W.
  TITLE     Studies on the Bacteriophage MS2 nucleotide sequence of a 3'
            terminal fragment (n=104)
  JOURNAL   FEBS Lett. 18, 141-144 (1971)
  STANDARD  simple staff_entry
BASE COUNT       21 a     34 c     30 g     20 t
ORIGIN      
        1 gctccaccga aaggtgggcg ggcttcggcc cagggacccc tccctaaaga gaggacccgg
       61 gattctcccg atttggtaac tagctgcttg gctagttacc accca
//
LOCUS       PEAPCATE     1004 bp ds-DNA             SYN       25-JUL-1990
DEFINITION  Chimaeric gene with P.sativum ribulose 1,5-bisphosphate carboxylase
            5' flank/A.tumefaciens chloramphenicol acetyltransferase gene, 5'
            end.
ACCESSION   M35072
KEYWORDS    .
SOURCE      Recombined Pisum sativum and Agribacterium tumefaciens DNA inserted
            in decapitated tobacco seedlings.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 1004)
  AUTHORS   Herrera-Estrella,L., Van den Broeck,G., Maenhaut,R.,
            Van Montagu,M., Schell,J., Timko,M. and Cashmore,A.
  TITLE     Light-inducible and chloroplast-associated expression of a
            chimaeric gene introduced into Nicotiana tabacum using a Ti plasmid
            vector
  JOURNAL   Nature 310, 115-120 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept       1002  >  1004     chloramphenicol acetyltransferase (CAT)
    recomb      965      966     P.sativum DNA end/CAT DNA start
    signal      943      946     TATA box
BASE COUNT      309 a    176 c    146 g    373 t
ORIGIN      
        1 gaattcaaca ttggctatta ctggttttac aaagtcagac taaggagcat gtccaaccac
       61 tataaggtct ataataggat ttaccttttt ccttagaagc actttaatca actagaaatc
      121 aaagaagcaa aatgtagtgt ctagatcttc atcagaagta aagtatagag ctttagcaaa
      181 cacatcatgt gagacacgat ggtttctata cttgcttcag gatctctgca tttcccatac
      241 ctcgttcatg acaattgcaa accaacctcg tacatttgat gcccataatt tctgaaaacc
      301 aagttgcata cctcttcacc aaaactcttc atcttggtct cttctcctct tttgttcaca
      361 aactaggaat tattaacttt cattctaatt tataggggct gctacaactt aatatatttt
      421 taattatttt tattctctta atttcctttt tttctatttg tttgtcaggt agttgagata
      481 tttgggctaa tctattagag atagtttctc taacaaactt gtaactttgg gtctatatta
      541 gctaatgatt catcttatat tttttcaaat gaatcattaa taaaactttc ctcttttatt
      601 taattttttc aattcagttt catcatcaaa gcaaatgttt ccctgccatc tgtttgtcaa
      661 cactaacatc taatgtactt atctcattag tttaattatt gtttgatcat gtttaatcct
      721 tctagtgttg ttagtttttt cagttagctt aatgggcatc ttacacgtgg cattatccta
      781 ttggtggcaa atgataaggt taggacacac aacttttcaa tcttgtgtgg ttaatatggc
      841 tgcaaagttt atcatttcac aatctaacaa gattggtact aggcagtagc taattaccac
      901 aatattaaga ccataatatt ggaaatagat aaataaaaac attatatata gcaagtttta
      961 gcagaagctt ggcgagattt tcaggagcta aggaagctaa aatg
//
LOCUS       TRFMTTGVA     149 bp ds-DNA             ORG       25-JUL-1990
DEFINITION  C.oncopelti mitochondrion Val-tRNA gene.
ACCESSION   M35071
KEYWORDS    transfer RNA-Val.
SOURCE      C.oncopelti mitochondrial DNA, clone pCo150.
  ORGANISM  Mitochondrion Crithidia oncopelti
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Crithidia oncopelti.
REFERENCE   1  (bases 1 to 149)
  AUTHORS   Entelis,N.S., Maslov,D.A., Bol'shakova,E.V. and Zaitseva,G.N.
  TITLE     Primary structure of an unusual valine tRNA gene from mitochondria
            of Crithidia oncopelti
  JOURNAL   Dokl. Biochem. 297, 435-438 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA         18       89     Val-tRNA
    anticdn      45       47     Val-tRNA anticodon tac
BASE COUNT       44 a     39 c     16 g     50 t
ORIGIN      
        1 gatctaaaat ccctgttaga cacttgtttt tgcaaacgta taattacgtt ttctacacca
       61 aaacccttta aatccctgtt aggaccccat ttcttcaaat gtataatcac gttttctgcg
      121 tccaaacccc ttaaaaccca gatttcgat
//
LOCUS       YSCTRV2A       75 bp ss-tRNA            RNA       25-JUL-1990
DEFINITION  Yeast (S.cerevisiae, Baker's) Val-tRNA-2a.
ACCESSION   M35070 K01066
KEYWORDS    transfer RNA-Val.
SOURCE      Yeast (S.cerevisiae, Baker's) tRNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N. and Baev,A.A.
  TITLE     Nucleotide sequence in Val-tRNA-2a from Baker's yeast
  JOURNAL   FEBS Lett. 45, 333-336 (1974)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 75)
  AUTHORS   Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N.. and Baev,A.A.
  TITLE     Primary structure of Val-tRNA-2a from Baker's yeast
  JOURNAL   Mol. Biol. 9, 42-48 (1975)
  STANDARD  simple staff_entry
COMMENT     Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their
            entry 2050 in Nucleic Acids Res. 11, r1-r54 (1983). [1] compared
            given sequence with that of baker's yeast Val-tRNA-1.
FEATURES       from  to/span     description
    tRNA          1       75     Val-tRNA-2a (NAR: 2050)
    anticdn      35       37     Val-tRNA-2a anticodon tac
    modified     10       10     m2g
    modified     16       16     d
    modified     19       19     d
    modified     20       20     d
    modified     27       27     m22g
    modified     28       28     p
    modified     33       33     p
    modified     35       35     unidentified uridine derivative
    modified     46       46     d
    modified     48       48     m5c
    modified     53       53     t
    modified     54       54     p
    modified     57       57     m1a
BASE COUNT       16 a     22 c     20 g     17 t
ORIGIN      5' end of mature tRNA.
        1 ggtccaatgg tccagtggtt caagacgtcg cctttacacg gcgaatcccg agttcgaacc
       61 tcggttggat cacca
//
LOCUS       YSCTRW         75 bp ss-tRNA            RNA       25-JUL-1990
DEFINITION  Yeast (S.cerevisiae) Trp-tRNA-cca.
ACCESSION   M35060 X02698
KEYWORDS    transfer RNA-Trp.
SOURCE      Yeast tRNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Keith,G., Roy,A., Ebel,J.P. and Dirheimer,G.
  TITLE     The nucleotide sequences of two tryptophane-tRNAs from Brewer's
            yeast
  JOURNAL   FEBS Lett. 17, 306-308 (1971)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 75)
  AUTHORS   Keith,G., Roy,A., Ebel,J.-P. and Dirheimer,G.
  TITLE     The primary structure of tryptophan transfer ribonucleic acid from
            Brewer's yeast: II. Partial digestion with pancreatic ribonuclease
            and derivation of complete sequence
  JOURNAL   Biochimie 54, 1417-1426 (1972)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    tRNA          1       75     transfer RNA-Trp
    anticdn      33       35     Trp-tRNA anticodon cca
    modified      9        9     1-methylguanosine
    modified     10       10     2-methylguanosine
    modified     16       16     dihydrouridine
    modified     17       17     2'-O-methylguanosine
    modified     19       19     dihydrouridine
    modified     25       25     pseudouridine
    modified     26       26     pseudouridine
    modified     27       27     pseudouridine
    modified     31       31     2'O-methylcytidine
    modified     33       33     2'O-methylcytidine
    modified     38       38     pseudouridine
    modified     45       45     7-methylguanosine
    modified     47       47     dihydrouridine
    modified     53       53     5-methyluridine (ribosylthymine)
    modified     54       54     pseudouridine
    modified     57       57     1-methyladenosine
    modified     64       64     pot. pseudouridine
BASE COUNT       17 a     18 c     20 g     20 t
ORIGIN      
        1 gaagcggtgg ctcaatggta gagctttcga ctccaaatcg aagggttgca ggttcaattc
       61 ctgtccgttt cacca
//
LOCUS       YSUTRAI        76 bp ss-tRNA            RNA       25-JUL-1990
DEFINITION  Yeast (T.utilis) Ala-tRNA-I.
ACCESSION   M35061 K00143
KEYWORDS    transfer RNA-Ala.
SOURCE      Yeast (T.utilis) tRNA.
  ORGANISM  Candida utilis
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Takemura,S., Ogawa,K. and Nakazawa,K.
  TITLE     Nucleotide sequence of alanine tRNA I from Torulopsis utilis
  JOURNAL   FEBS Lett. 25, 29-32 (1972)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 76)
  AUTHORS   Takemura,S. and Ogawa,K.
  TITLE     The primary structure of alanine transfer ribonucleic acid 1 from
            Torulopsis utilis: II. Partial digestion with ribonuclease T-1 and
            derivation of the complete sequence
  JOURNAL   J. Biochem. 74, 323-333 (1973)
  STANDARD  full staff_review
COMMENT     Contributed on tape April 1983 by M.Sprinzl and D.H.Gauss; from
            their entry 0020 in Nucleic Acids Res. 11, r1-r54 (1983). [1]: The
            cloverleaf model for the secondary structure was compared with that
            of Saccharomyces Ala-tRNA, especially with respect to the
            aminoacyl-tRNA synthetase recognition sites.
FEATURES       from  to/span     description
    tRNA          1       76     Ala-tRNA-I (NAR: 0020)
    anticdn      34       36     Ala-tRNA-I anticodon ggc
    modified      9        9     m1g = 1-methylguanosine
    modified     16       16     d = dihydrouridine
    modified     17       17     d = dihydrouridine
    modified     20       20     d = dihydrouridine
    modified     26       26     m22g = 2,2-dimethylguanosine
    modified     27       27     f = pseudouridine
    modified     34       34     i = inosine
    modified     37       37     m1i = 1-methylinosine
    modified     38       38     f = pseudouridine
    modified     47       47     d = dihydrouridine
    modified     54       54     t = 5-methyluridine
    modified     55       55     f = pseudouridine
    modified     58       58     m1a = 1-methyladenosine
BASE COUNT        9 a     21 c     28 g     18 t
ORIGIN      5' end of mature tRNA
        1 gggcgtgtgg cgtagttggt agcgcgttcg cttggcgtgc gaaaggtctc cggttcgact
       61 ccggactcgt ccacca
//
LOCUS       MUSPTKA       211 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD15.
ACCESSION   M33421
KEYWORDS    protein-tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 211)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   211     protein-tyrosine kinase (AA at 3) (EC
                                 2.7.1.112)
BASE COUNT       47 a     55 c     64 g     45 t
ORIGIN      
        1 ggatccacag ggacctggct gctcggaact gcctggtgac agagaagaat gtcctgaaga
       61 tcagcgactt tgggatgtcc cgcgaagaag ctgatgggat ctatgccgcc tgcagcggcc
      121 tcagacaagt ccctgttaag tggactgccc ctgaggccct taactacgga cgctactcct
      181 cagagagtga tgtgtggagc tttggaattc c
//
LOCUS       MUSPTKB       211 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD16.
ACCESSION   M33422
KEYWORDS    protein-tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD16.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 211)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   211     protein-tyrosine kinase (AA at 3) (EC
                                 2.7.1.112)
BASE COUNT       50 a     50 c     59 g     52 t
ORIGIN      
        1 ggatccacag agaccttgct gctaggaact gcatggatgc cgaagatttc acagtaaaaa
       61 ttggagattt cggtatgaca cgagacatct acgagacgga ctactaccgg aaaggcggga
      121 aggggttgct gcctgtgcgc tggatgtctc tcgagtccct caaggatggt gtcttcacta
      181 ctcattctga cgtctggtcc ttcggaattc c
//
LOCUS       MUSPTKC       214 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD17.
ACCESSION   M33423 M22448 J04523
KEYWORDS    protein-tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD17.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 214)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
REFERENCE   2  (sites)
  AUTHORS   Wilks,A.F.
  TITLE     Two putative protein-tyrosine kinases identified by application of
            the polymerase chain reaction
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [2] kindly submitted by
            A.Wilks, 08-FEB-1989, for release after publication.
FEATURES       from  to/span     description
    pept    <     1  >   214     protein-tyrosine kinase (AA at 3) (EC
                                 2.7.1.112)
BASE COUNT       69 a     42 c     58 g     45 t
ORIGIN      
        1 ggatccacag ggacctggca acaaggaaca tattggtgga aaatgagaac agggttaaaa
       61 taggagactt cggattaacc aaagtcttgc cgcaggacaa agaatactac aaagtaaagg
      121 agccagggga aagaccgata ttctggtacg cacctgaatc cttgacggag agcaagtttt
      181 ctgtggcctc agatgtctgg tcctttggaa ttcc
//
LOCUS       MUSPTKD       217 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD19.
ACCESSION   M33424
KEYWORDS    protein-tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD19.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 217)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   217     protein-tyrosine kinase (AA at 3) (EC
                                 2.7.1.112)
BASE COUNT       64 a     44 c     57 g     52 t
ORIGIN      
        1 ggatccacag agacttagct gcaagaaact gcatgttgga tgaaaaattc actgtcaagg
       61 ttgctgattt cggtcttgcc agagacatgt acgataaaga gtactatagt gtccacaaca
      121 agacgggtgc caagctacca gtgaagtgga tggctttaga gagtctgcaa aggcagaagt
      181 tcaccaccac gtcagatgtg tggtcctttg gaattcc
//
LOCUS       MUSPTKE       214 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD22.
ACCESSION   M33425 M22447 J04523
KEYWORDS    protein-tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD22.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 214)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
REFERENCE   2  (sites)
  AUTHORS   Wilks,A.F.
  TITLE     Two putative protein-tyrosine kinases identified by application of
            the polymerase chain reaction
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [2] kindly submitted by
            A.Wilks, 08-FEB-1989, for release after publication.
FEATURES       from  to/span     description
    pept    <     1  >   214     protein-tyrosine kinase (AA at 3) (EC
                                 2.7.1.112)
BASE COUNT       59 a     46 c     55 g     54 t
ORIGIN      
        1 ggatccaccg ggacttagca gcaagaaatg tccttgttga gagtgagcat caagtgaaga
       61 tcggagactt tggtttaacc aaagcaattg aaaccgataa ggagtactac acagtcaagg
      121 acgaccggga cagcccagtg ttctggtacg ctccggagtg tttaatccag tgtaaatttt
      181 atatcgcctc tgacgtctgg tcctttggaa ttcc
//
LOCUS       MUSPTKF       208 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD175.
ACCESSION   M33426
KEYWORDS    protein-tyrosine kinase.
SOURCE      Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD175.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 208)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   208     protein-tyrosine kinase (AA at 3) (EC
                                 2.7.1.112)
BASE COUNT       49 a     53 c     55 g     51 t
ORIGIN      
        1 ggatccaccg tgatctgcga gctgctaacg tcctggtctc tgagtcactc atgtgcaaga
       61 ttgcagactt tggcctcgcg agagtcatcg aagataacga gtacacagca agggaaggtg
      121 cgaagttccc tatcaagtgg acagctccag aggcgttcaa cttcggctgc ttcactatca
      181 aatctgacgt gtggtccttt ggaattcc
//
LOCUS       MUSPTKG       208 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone W3.13.
ACCESSION   M33427
KEYWORDS    protein-tyrosine kinase.
SOURCE      Mouse haemopoietic cell line WEH1-3B D+, cDNA to mRNA, clone W3.13.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 208)
  AUTHORS   Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J.
  TITLE     The application of the polymerase chain reaction to cloning members
            of the protein tyrosine kinase family
  JOURNAL   Gene 85, 67-74 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   208     protein-tyrosine kinase (AA at 3) (EC
                                 2.7.1.112)
BASE COUNT       53 a     44 c     56 g     55 t
ORIGIN      
        1 ggatccacag agacctggct gccagaaatt gtctagtgaa tgaagcagga gttgtcaaag
       61 tatctgattt tggaatggcc aggtacgttc tggatgatca gtacacaagt tcttctggcg
      121 ccaagttccc tgtgaagtgg tgtcccccag aagagtttaa ttacagccgc tttagcagca
      181 agtcagacgt gtggtcctat ggaattcc
//
LOCUS       RATCROS1A    7839 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Rat lung-derived c-ros-1 proto-oncogene mRNA, complete cds.
ACCESSION   M35104
KEYWORDS    c-ros-1 proto-oncogene; tyrosine kinase.
SOURCE      Rat (strain Fischer) lung, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 7839)
  AUTHORS   Matsushime,H. and Shibuya,M.
  TITLE     Tissue-specific expression of rat c-ros-1 gene and partial
            structural similarity of its predicted products with sev protein of
            Drosophila melanogaster
  JOURNAL   J. Virol. 64, 2117-2125 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        402     7355     c-ros-1 tyrosine kinase (put.)
    mRNA    <     1     7839     c-ros-1 mRNA
BASE COUNT     2159 a   1760 c   1887 g   2033 t
ORIGIN      
        1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag
       61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg
      121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag
      181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc
      241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc
      301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac
      361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc
      421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg
      481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca
      541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta
      601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg
      661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca
      721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac
      781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac
      841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc
      901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc
      961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc
     1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac
     1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa
     1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca
     1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta
     1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa
     1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt
     1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt
     1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg
     1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt
     1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct
     1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt
     1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg
     1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt
     1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg
     1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc
     1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag
     1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag
     2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc
     2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca
     2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg
     2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt
     2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca
     2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta
     2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt
     2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg
     2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct
     2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt
     2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg
     2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca
     2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg
     2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc
     2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca
     2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg
     3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa
     3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc
     3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg
     3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc
     3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc
     3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct
     3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat
     3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt
     3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta
     3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg
     3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt
     3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct
     3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat
     3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc
     3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg
     3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt
     3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact
     4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg
     4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata
     4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt
     4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc
     4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat
     4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact
     4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca
     4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta
     4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg
     4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag
     4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa
     4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc
     4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta
     4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc
     4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt
     4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg
     4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg
     5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg
     5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg
     5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc
     5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt
     5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt
     5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt
     5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca
     5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag
     5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac
     5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag
     5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact
     5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca
     5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta
     5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt
     5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag
     5941 tcccactgac ctttgtctgg catagaagct tgaaaaacca caaagctacc aaggaaggcc
     6001 tctcagttct caacgacaat gaccaagagt tggctgagct tcgaggactg gcggctggag
     6061 tgggcctggc caatgcctgc tatgcagtac atactcttcc aacccaagag gagattgaaa
     6121 gtcttcccgc cttccctcgg gagaagctga gcctgcgcct tctgttggga agtggagctt
     6181 ttggagaagt gtacgagggc acagctgtag acatcctagg acggggaagt ggagaaatca
     6241 aggtggccgt gaagaccctg aagaaaggtt cgacagacca ggagaagatc gagttcctga
     6301 aggaggcaca cctgatgagc aagtttaatc accccaacat tctgaagcag ctgggagtct
     6361 gtctgctgag tgaaccccag tacattatcc tggaactgat ggaaggggga gaccttctaa
     6421 gctatctgcg caaagcccga gggacaacgt tgtctggccc tttactcaca ttggctgacc
     6481 tggtagagct gtgtgtagat atttcaaaag gctgcgtcta cttggagcag atgcacttca
     6541 ttcacaggga tctggcagct cggaattgcc ttgtgtctgt gaaagactat accagtcctc
     6601 gggtagtcaa gatcggtgac tttggtttgg caagggaaat ctataagcat gattattata
     6661 gaaagagagg ggaaggcctg cttcctgtcc ggtggatggc tcctgaaaac ttgatggatg
     6721 gaatcttcac ttcccagtct gatgtatggt cttttggaat tttggtttgg gagattttaa
     6781 ctcttggtca tcaaccttat ccagcgcatt ccaaccttga tgttttaaac tatgtgcaag
     6841 caggagggag actggagcca ccgagaaact gtcctgatga tctgtggaat ttaatgttcc
     6901 gatgttgggc ccaagaacct gaccaaagac ccactttcta taacattcaa gaccagcttc
     6961 agttattcag aaatgtttcc ttaaacaatg tttctcactg tggacaagca gctcctgctg
     7021 gtggagtcat caacaaaggc tttgaaggtg aagacaatga aatggccact ttgaattcag
     7081 atgacacgat gccagttgcc ttgatggaaa ccaggaacca agaaggatta aattatatgg
     7141 tacttgccac aaagtgtagc caaagtgagg atcgttatga gggtcctcta ggctctaagg
     7201 aatctgggtt gcatgatctg aagaaagacg agaggcaacc agcagacaaa gatttctgcc
     7261 agcaaccaca ggtggcttat ggctctcctg gccactctga aggcctgaac tatgcctgtc
     7321 ttgctcacag tggacatgga gatgtgtctg aataatagta tctcatagga aacatagcac
     7381 tgagatgaac actgtattaa gttaaaaaga agaaaggtgg ggtggcagtc tagactctga
     7441 actgacacag ccaagttcca aagttctgat cttggttcca agagccatta tgtttcattc
     7501 agcattctct ttaccagtga cgtaaccttc agtggattat cagaggaacc tgtgtgtgtg
     7561 cggaaatccc aggacaaatc ctaagtctgg gaagaaaaca tcactgtctc tctcctctga
     7621 agccctttac ttcagagcat tgcctgccct ggcaatctta ctaggttcat gcaaggatgt
     7681 gagtggggga ggggccggag tctgctgagg accacctgaa ctacagatta ccttaagagg
     7741 atgcaggaaa caattactca cacaggagga agcagcctgt ggaccatgag gaatcatctg
     7801 gcacgctatt attccaataa aatattccct ttaatcatc
//
LOCUS       RATCROS1B    8010 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Rat lung-derived L01 c-ros-1 proto-oncogene mRNA, complete cds.
ACCESSION   M35105
KEYWORDS    c-ros-1 proto-oncogene; tyrosine kinase.
SOURCE      Rat (strain Fischer) lung, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 8010)
  AUTHORS   Matsushime,H. and Shibuya,M.
  TITLE     Tissue-specific expression of rat c-ros-1 gene and partial
            structural similarity of its predicted products with sev protein of
            Drosophila melanogaster
  JOURNAL   J. Virol. 64, 2117-2125 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        402     5966     c-ros-1 unknown protein
    mRNA    <     1     8010     c-ros-1 mRNA
BASE COUNT     2197 a   1812 c   1930 g   2071 t
ORIGIN      
        1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag
       61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg
      121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag
      181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc
      241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc
      301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac
      361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc
      421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg
      481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca
      541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta
      601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg
      661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca
      721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac
      781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac
      841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc
      901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc
      961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc
     1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac
     1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa
     1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca
     1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta
     1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa
     1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt
     1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt
     1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg
     1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt
     1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct
     1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt
     1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg
     1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt
     1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg
     1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc
     1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag
     1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag
     2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc
     2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca
     2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg
     2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt
     2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca
     2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta
     2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt
     2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg
     2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct
     2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt
     2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg
     2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca
     2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg
     2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc
     2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca
     2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg
     3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa
     3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc
     3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg
     3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc
     3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc
     3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct
     3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat
     3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt
     3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta
     3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg
     3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt
     3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct
     3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat
     3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc
     3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg
     3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt
     3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact
     4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg
     4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata
     4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt
     4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc
     4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat
     4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact
     4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca
     4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta
     4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg
     4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag
     4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa
     4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc
     4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta
     4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc
     4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt
     4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg
     4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg
     5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg
     5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg
     5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc
     5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt
     5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt
     5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt
     5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca
     5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag
     5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac
     5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag
     5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact
     5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca
     5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta
     5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt
     5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag
     5941 tcccactgac ctttgcctgt cactgaagct ggggctcaca gatcagctag gccggctggc
     6001 caacagatcc ccgagatctg cctgcctctg acctctacct ccaacactgg ggctacagat
     6061 gtgtgctaca ttctcagtat ttaactgggt gctgaggaac caagcacagg tcctcatgct
     6121 cgtaagtctg gcatagaagc ttgaaaaacc acaaagctac caaggaaggc ctctcagttc
     6181 tcaacgacaa tgaccaagag ttggctgagc ttcgaggact ggcggctgga gtgggcctgg
     6241 ccaatgcctg ctatgcagta catactcttc caacccaaga ggagattgaa agtcttcccg
     6301 ccttccctcg ggagaagctg agcctgcgcc ttctgttggg aagtggagct tttggagaag
     6361 tgtacgaggg cacagctgta gacatcctag gacggggaag tggagaaatc aaggtggccg
     6421 tgaagaccct gaagaaaggt tcgacagacc aggagaagat cgagttcctg aaggaggcac
     6481 acctgatgag caagtttaat caccccaaca ttctgaagca gctgggagtc tgtctgctga
     6541 gtgaacccca gtacattatc ctggaactga tggaaggggg agaccttcta agctatctgc
     6601 gcaaagcccg agggacaacg ttgtctggcc ctttactcac attggctgac ctggtagagc
     6661 tgtgtgtaga tatttcaaaa ggctgcgtct acttggagca gatgcacttc attcacaggg
     6721 atctggcagc tcggaattgc cttgtgtctg tgaaagacta taccagtcct cgggtagtca
     6781 agatcggtga ctttggtttg gcaagggaaa tctataagca tgattattat agaaagagag
     6841 gggaaggcct gcttcctgtc cggtggatgg ctcctgaaaa cttgatggat ggaatcttca
     6901 cttcccagtc tgatgtatgg tcttttggaa ttttggtttg ggagatttta actcttggtc
     6961 atcaacctta tccagcgcat tccaaccttg atgttttaaa ctatgtgcaa gcaggaggga
     7021 gactggagcc accgagaaac tgtcctgatg atctgtggaa tttaatgttc cgatgttggg
     7081 cccaagaacc tgaccaaaga cccactttct ataacattca agaccagctt cagttattca
     7141 gaaatgtttc cttaaacaat gtttctcact gtggacaagc agctcctgct ggtggagtca
     7201 tcaacaaagg ctttgaaggt gaagacaatg aaatggccac tttgaattca gatgacacga
     7261 tgccagttgc cttgatggaa accaggaacc aagaaggatt aaattatatg gtacttgcca
     7321 caaagtgtag ccaaagtgag gatcgttatg agggtcctct aggctctaag gaatctgggt
     7381 tgcatgatct gaagaaagac gagaggcaac cagcagacaa agatttctgc cagcaaccac
     7441 aggtggctta tggctctcct ggccactctg aaggcctgaa ctatgcctgt cttgctcaca
     7501 gtggacatgg agatgtgtct gaataatagt atctcatagg aaacatagca ctgagatgaa
     7561 cactgtatta agttaaaaag aagaaaggtg gggtggcagt ctagactctg aactgacaca
     7621 gccaagttcc aaagttctga tcttggttcc aagagccatt atgtttcatt cagcattctc
     7681 tttaccagtg acgtaacctt cagtggatta tcagaggaac ctgtgtgtgt gcggaaatcc
     7741 caggacaaat cctaagtctg ggaagaaaac atcactgtct ctctcctctg aagcccttta
     7801 cttcagagca ttgcctgccc tggcaatctt actaggttca tgcaaggatg tgagtggggg
     7861 aggggccgga gtctgctgag gaccacctga actacagatt accttaagag gatgcaggaa
     7921 acaattactc acacaggagg aagcagcctg tggaccatga ggaatcatct ggcacgctat
     7981 tattccaata aaatattccc tttaatcatc
//
LOCUS       RATCROS1C    7902 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Rat heart-derived c-ros-1 proto-oncogene mRNA, complete cds.
ACCESSION   M35106
KEYWORDS    c-ros-1 proto-oncogene; tyrosine kinase.
SOURCE      Rat (strain Fischer) heart, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 7902)
  AUTHORS   Matsushime,H. and Shibuya,M.
  TITLE     Tissue-specific expression of rat c-ros-1 gene and partial
            structural similarity of its predicted products with sev protein of
            Drosophila melanogaster
  JOURNAL   J. Virol. 64, 2117-2125 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        402     7418     c-ros-1 tyrosine kinase (put.)
    mRNA    <     1     7902     c-ros-1 mRNA
BASE COUNT     2171 a   1775 c   1899 g   2057 t
ORIGIN      
        1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag
       61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg
      121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag
      181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc
      241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc
      301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac
      361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc
      421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg
      481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca
      541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta
      601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg
      661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca
      721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac
      781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac
      841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc
      901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc
      961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc
     1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac
     1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa
     1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca
     1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta
     1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa
     1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt
     1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt
     1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg
     1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt
     1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct
     1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt
     1681 cctacaatgg gtatgtcttt tatctcctaa gagacggcat ttatagagtc catcttcctt
     1741 tgccgtctgt cagggacacc aaagctgtgc gtattgtgga gagtggcaca ttaaaggact
     1801 tcgcagtaaa gccgcagtcc aagcgaatca tttacttcaa tggcaccatg caagtcttca
     1861 tgtcgacatt tctggatggc tcggcattcc acagggttct gccgtgggtc ccccttgcgg
     1921 atgtgaagag ctttgcttgt gaaaacaatg acttcctcat cacagatggc aaggccattt
     1981 tccaacagga ctctctgtct ttcaatgagt tcatcgtggg atgtgacctg agtcacatag
     2041 aagaatttgg gtttggtaac ttggtcatct ttggctcctc cgtccagtcg taccctctgc
     2101 caggccatcc acaggaggtc tcggtgctgt ttggttctcg agaggccctt attcagtgga
     2161 agcctccgat tctcgccata ggagccagtc cttccgcctg gcagaactgg acttatgagg
     2221 tcaaagtttc ctcccaggac attctggaaa ccactcaagt tttcttgaac ataagcagga
     2281 ctgtgctgaa tgtacccaag ctgcaaagtt ctacaaagta catggtgtct gtgcgagcaa
     2341 gttctcctaa aggcccaggc ccatggtcag aaccctcagt gggtactacc ttggtaccag
     2401 ccactgagcc accgttcatc atggctgtga aagaagatgg gctttggagc aaaccactca
     2461 gtagttttgg cccaggagag ttcctatcct ctgacgtagg aaacgtgtca gatatggatt
     2521 ggtataacaa cagcctctac tacagtgaca caaaaggcaa tgtgtatgtg cggcctctga
     2581 atgggatgga tatctcggag aattaccaca tatccagcat tgcaggagct tgtgccttgg
     2641 cctttgaatg gctgggtcac tttctctact gggctgggaa gacatatgtg attcaaaggc
     2701 agtctgtgtt aacgggacac acagacattg tgactcacgt gaagctgttg gtgaatgaca
     2761 tggccgtgga tccagttggt ggctatctgt actggacgac gctctactcg gttgaaagca
     2821 ccagactcaa tggagaaagt tctcttgtac tacaggctca gccctggctc tctggaaaaa
     2881 aggttattgc tctaacatta gacctcagcg atgggctcct gtactggctg gtgcaggaca
     2941 atcagtgtat tcacctgtac acggctgttc tccggggatg gagtggtgcg gatgctacca
     3001 tcaccgagtt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactaca
     3061 gcggtagact cttctggatc aatggcttta ggatcatcac agcacaggaa ataggtcaga
     3121 gaaccagcgt gtctgtttct gagccaggga aattcaatca gtttacgatc atacagacat
     3181 ccctcaagcc tctgccaggg aacttttcct ctactcccac ggttatccca gattctgttc
     3241 aggagtcctc atttcgaatt gaaggacaca cttcaagttt ccgaatcctg tggaatgagc
     3301 cccctgcggt ggactggggc atagttttct acagtgtgga atttagtgct cattctaagt
     3361 tcctggctat tgaacaacag tctttacctg tttttactgt ggaaggactg gagccctatg
     3421 ccttatttaa tctttctgtc actccttata cctattgggg aaaaggtcaa aaaacatctc
     3481 tatcatttcg agcgcctgaa tcagttccgt cagcaccaga gaaccccaga atatttatat
     3541 tgtcacttgg aagatacacc aggaagaatg aagtcgtggt agagtttagg tggaataaac
     3601 ctaagcatga aaatggagtg ctaaccaaat ctgaaatctt ctaccacata tctaaacaaa
     3661 gtggcacaaa taaatcaacg gaagactggg tatctgtcag cgttacaccg ccggtgatgt
     3721 cttttcaact tgaagccatg agtcctgggt atattgtttc cttccaggtt cgagtcttca
     3781 cctccaaagg gccaggacca ttttctgata tagtgatgtc taaaacatca gaaatcaagc
     3841 catgtccata tctcatatct cttcttggca ataagattga gttcttagac atggaccaaa
     3901 atcaagttgt gtggacattt tccctggagg gagccgtcag cacagtgggg tacacagcgg
     3961 atgatgaaat ggggtatttc gctcaaggag atgcactctt ccttctgaat ttgcacaatc
     4021 attccagctc caagcttttc caggacgtgc tggcttctga cattgcggtt attgctgttg
     4081 actggatcgc aaggcacctc tactttgctc tgaaagcatc gcaagatgga acacagatat
     4141 tcgatgttga ccttgaacac aaggtgaaat cccccaggga ggtgaagatt tgcaaaagcc
     4201 atacagcaat aatttctttc tctatgtatc ccctcttaag tcgcctgtat tggacagaag
     4261 tttcagatct gggctaccag atgttctact gcaatattag cagtcacacc ttgcatcacg
     4321 ttctacaacc caaggcctca aaccagcatg gaaggagaca gtgttcttgt aatgtgacag
     4381 aatccgagtt aagtggggca atgactgtgg acacgtctga tccagacaga ccttggatat
     4441 actttaccaa acagcaagag atctgggcca tggatctgga aggatgtcag tgttggaaag
     4501 tcatcatggt acctgctacc cctggaaaaa gaatcattag tttaacagtg gatggggagt
     4561 ttatatattg gatcacaaca atgaaggacg acacagaaat ttatcaagca aagaagggaa
     4621 gtggggccat cctctcccag gtgaaggccc ccaggagtaa gcatatcttg gcttacagtt
     4681 cagctctgca accttttcca gataaagcat atctgtctgt agcttccaat atggtagaag
     4741 caagtatatt gaatgccacc aacaccagcc tcattctcaa gttacctcca gtcaagacaa
     4801 acctcacgtg gcatggaatt accactccca cgtcaacata cctggtttac tatatggagg
     4861 ctaatagggc aaacagctct gacaggaaac acaacatgtt ggaatcacag gagaatgtag
     4921 cccggattga aggtctgcag ccattttcaa catacgtgat tcagatagct gtgaagaact
     4981 attattctga tcctttagaa catctctctc tgggaaaaga gattcaagga aaaactaaaa
     5041 gtggagtgcc cggggcagtt tgtcatatca atgcaactgt gctgtcggac accagtcttc
     5101 ttgtattctg gacagaatcg cataaaccaa acggacccaa agagttagtc cgctatcagt
     5161 tggttatgtc atacctggct ccgattcctg agactcctct aagacaggac gaatttccaa
     5221 gcgccaggct ttctctactt gtcactaaac tctctggtgg acaacaatat gtgctgaaga
     5281 tccttgcctg ccactcagag gaaatgtggt gtactgagag tcatcctgtc agtgtcaaca
     5341 tgtttgacac accggagaaa ccttctgcct tggttccaga gaacactagt ctgctgttgg
     5401 attggaaggc tccgtctaac gctaacctca ccagattttg gtttgaactc cagaagtgga
     5461 agtatagtga gttttaccat gtcaaggctt catgcagcca aggtccagtt tatgtctgta
     5521 acatcgcaaa tctgcagcct tacactcctt ataacatccg agtggtggtg gtctatacga
     5581 caggagaaaa tagctcctcg attcccgaga gcttcaagac aaaagctgga gtcccaagca
     5641 aaccagggat tcctaagtta ctagaaggga gtaaaaattc aatccagtgg gaaaaagccg
     5701 aagataacgg gaacagattg atgtactaca ccctggaggt cagaaaaagc atttcaaatg
     5761 actcacggga ccagagttta aggtggacgg cggtgtttaa tgggtcctgc agtagcattt
     5821 gcacatggag gtcaaaaaac ctaaaaggaa ctttccagtt cagagcagta gcgtcaaatg
     5881 ctattggatt tggagaatac agtgaaatca gtgaagatat tacattagtg gaagatggtt
     5941 tttggataac agaaacaagt tttatactta ctatcatagt tgggatattt ctggttgcca
     6001 cagtcccact gacctttgtc tggcatagaa gcttgaaaaa ccacaaagct accaaggaag
     6061 gcctctcagt tctcaacgac aatgaccaag agttggctga gcttcgagga ctggcggctg
     6121 gagtgggcct ggccaatgcc tgctatgcag tacatactct tccaacccaa gaggagattg
     6181 aaagtcttcc cgccttccct cgggagaagc tgagcctgcg ccttctgttg ggaagtggag
     6241 cttttggaga agtgtacgag ggcacagctg tagacatcct aggacgggga agtggagaaa
     6301 tcaaggtggc cgtgaagacc ctgaagaaag gttcgacaga ccaggagaag atcgagttcc
     6361 tgaaggaggc acacctgatg agcaagttta atcaccccaa cattctgaag cagctgggag
     6421 tctgtctgct gagtgaaccc cagtacatta tcctggaact gatggaaggg ggagaccttc
     6481 taagctatct gcgcaaagcc cgagggacaa cgttgtctgg ccctttactc acattggctg
     6541 acctggtaga gctgtgtgta gatatttcaa aaggctgcgt ctacttggag cagatgcact
     6601 tcattcacag ggatctggca gctcggaatt gccttgtgtc tgtgaaagac tataccagtc
     6661 ctcgggtagt caagatcggt gactttggtt tggcaaggga aatctataag catgattatt
     6721 atagaaagag aggggaaggc ctgcttcctg tccggtggat ggctcctgaa aacttgatgg
     6781 atggaatctt cacttcccag tctgatgtat ggtcttttgg aattttggtt tgggagattt
     6841 taactcttgg tcatcaacct tatccagcgc attccaacct tgatgtttta aactatgtgc
     6901 aagcaggagg gagactggag ccaccgagaa actgtcctga tgatctgtgg aatttaatgt
     6961 tccgatgttg ggcccaagaa cctgaccaaa gacccacttt ctataacatt caagaccagc
     7021 ttcagttatt cagaaatgtt tccttaaaca atgtttctca ctgtggacaa gcagctcctg
     7081 ctggtggagt catcaacaaa ggctttgaag gtgaagacaa tgaaatggcc actttgaatt
     7141 cagatgacac gatgccagtt gccttgatgg aaaccaggaa ccaagaagga ttaaattata
     7201 tggtacttgc cacaaagtgt agccaaagtg aggatcgtta tgagggtcct ctaggctcta
     7261 aggaatctgg gttgcatgat ctgaagaaag acgagaggca accagcagac aaagatttct
     7321 gccagcaacc acaggtggct tatggctctc ctggccactc tgaaggcctg aactatgcct
     7381 gtcttgctca cagtggacat ggagatgtgt ctgaataata gtatctcata ggaaacatag
     7441 cactgagatg aacactgtat taagttaaaa agaagaaagg tggggtggca gtctagactc
     7501 tgaactgaca cagccaagtt ccaaagttct gatcttggtt ccaagagcca ttatgtttca
     7561 ttcagcattc tctttaccag tgacgtaacc ttcagtggat tatcagagga acctgtgtgt
     7621 gtgcggaaat cccaggacaa atcctaagtc tgggaagaaa acatcactgt ctctctcctc
     7681 tgaagccctt tacttcagag cattgcctgc cctggcaatc ttactaggtt catgcaagga
     7741 tgtgagtggg ggaggggccg gagtctgctg aggaccacct gaactacaga ttaccttaag
     7801 aggatgcagg aaacaattac tcacacagga ggaagcagcc tgtggaccat gaggaatcat
     7861 ctggcacgct attattccaa taaaatattc cctttaatca tc
//
LOCUS       HUMFVIIIM      65 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Human mutant coagulation factor VIII exon 13 duplication region.
ACCESSION   M34731
KEYWORDS    coagulation factor VIII.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 65)
  AUTHORS   Murru,S., Casula,L., Pecorara,M., Mori,P., Cao,A. and Pirastu,M.
  TITLE     Illegitimate recombination produced a duplication within the FVIII
            gene in a patient with mild hemophilia A
  JOURNAL   Genomics 7, 115-118 (1990)
  STANDARD  simple staff_review
COMMENT     As a result of illegitimate recombination of two misaligned
            chromosomes, exon 13 of the factor VIII is duplicated in its
            entirety.  The exon undergoes normal splicing and its incorporation
            into the mRNA generates an unstable protein.
FEATURES       from  to/span     description
    recomb       25       26     chromosome DNA end; misaligned chromosome DNA
                                 start
BASE COUNT       26 a      3 c     11 g     25 t
ORIGIN      
        1 aagttttagg ggtacatgtg cacaattagt ttgaaataat ttaattagtt tgaaataatt
       61 taaaa
//
LOCUS       EUBBAIA3     2596 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  Eubacterium sp. baiA3 protein gene, complete cds.
ACCESSION   M34658
KEYWORDS    .
SOURCE      Eubacterium sp. (strain VPI 12708) DNA.
  ORGANISM  Eubacterium sp.
            Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE   1  (bases 135 to 2242)
  AUTHORS   Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B.
  TITLE     Multiple copies of a bile acid-inducible gene in Eubacterium sp.
            strain VPI 12708
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 134; 2243 to 2596)
  AUTHORS   Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by D.H.Mallonee, 24-MAY-1990.
FEATURES       from  to/span     description
    pept       1165     1914     baiA3 protein gene
BASE COUNT      778 a    521 c    633 g    664 t
ORIGIN      
        1 tccctgtgct ttcttctgca gttcataaaa tccgccgcca caaatccaag aatccacaat
       61 agactcagaa gcaaggcgta ttccagcgca tccattggga tattatacaa atagaatagc
      121 aaggcaaata tggccatgat cccggcgtac atgcctattc ccctgatatg atccctgata
      181 tatcttcctg tcaatctcat gcctgcacca tatatcctat tccttttttc gttacgatcc
      241 atttgcattt atcctgtctg atgcggatta tgcatcgtat accgctggcc ttacagagga
      301 taacaaagag gatatggttt tctttaacgt gaaggatgtg atggatactt atccattcgc
      361 caaagaactg gaagaagaat atatcgcgca tgccacagat atctcggacc attattttct
      421 ttatgatgcc cgcgaagaag aacttgcaaa aaaagcaggg gaaccctaca catattcagg
      481 cagggtaggg atgacggcgg acaatccgga acttcttcag gactggaaat atgcgcctgc
      541 cttcaaagtt cttacaaaag gggaggttat gcagatgatt gcggtattcg tgatgcttag
      601 cgcctacatt gcgataattg ccctggcggc aatcggggtt atgacttatg taagaagcgt
      661 taccattgct gtcgataaca ggcagctgtt cgaggatatg aagaagctgg gggccagccg
      721 ggattatgag acgcgggtgg taaaagtaca gcttcgcaag atcttcttat atcccggtat
      781 cgcaggatgc gggatatccc tggtctttac ggtcctgatg ctctttttta acaatatgcg
      841 cctggaaatt gaagaaatca ggctgatcgg aatcgagagc attatgattg gggcatccgc
      901 catcttcctg tacgtactgt accggatctc ttttcggaag atgagaagca tgctggatct
      961 atagggaaac aaaatagtga tagtgtttgc aaactttttg tccatggact gcttatattt
     1021 tgcaattaaa aaagaacttt acaagttgta agatgccgtg tgattttcca atgtcgcgtc
     1081 ctgtaaaatg ttaaagttgt atcaatcgat acgatacttt ggcagatatg ataagccaaa
     1141 ggaaaagaaa ggaaggaaaa gttcatgaaa cttgtacagg acaaaattac aattatcaca
     1201 ggcggaaccc gtggaatcgg attcgcagca gcaaaactct ttattgagaa tggagcaaaa
     1261 gtctccatat ttggcgagac ccaggaagag gtagacacag cgctggctca gttaaaggaa
     1321 ctctatccgg aggaagaggt attaggattc gctccagacc ttacatcaag agatgctgtt
     1381 atggcagcag ttggaacggt tgcacagaag tacggaagac tggatgtcat gatcaacaac
     1441 gcaggcatta caatgaattc tgtattctcc agggtatcag aagaggattt caaaaatata
     1501 atggacatca atgttaacgg cgtattcaat ggcgcatggt ctgcttatca gtgcatgaaa
     1561 gatgcaaagc agggcgttat catcaatacg gcatctgtaa ccggaatcta tggttcctta
     1621 tcaggaatcg gatatcctac cagcaaggcg ggcgtaatcg gcctgactca tggtcttgga
     1681 agagagatta tccgtaagaa catccgtgta gttggcgttg cacctggcgt tgtagataca
     1741 gatatgacga aggggcttcc accggagatc ctggaggact acttgaagac actgccaatg
     1801 aagagaatgc ttaagccgga agagatcgcg aatgtatatc tgttccttgc atccgacctg
     1861 gctagcggca tcacggctac gacgatcagc gtagatgggg cttacaggcc atagaaaaga
     1921 catactgcta ttaattccat agttcatact ccaagaacag gcaggcaaga ggcatttgcg
     1981 ttttagcgcg gatgcccggg cctgcctgat ttaattcagc tggtatatca tgaaattcag
     2041 atatgcggcg aacaggcacc atataaggta ggggatctgt agataggcgg caacaggact
     2101 tatcttgtga aactgatata tcatcagggc tatgaggatg ataagcacga gaagccataa
     2161 aaatgcaaag aggtacatgg aaaagccgaa aaagaatatg ctccagagga agttgaagaa
     2221 cagctggata aaatatagtc gaagcgcctt attcttttca ggagtttcgg attcatagat
     2281 tatataagaa gatatcccca ttaatatata taatatggtc cagacgatgg gaaataggaa
     2341 ggacggagga ctaagaggcg gcttattcaa tgccaaatag gccgccgaat tgccgcttaa
     2401 gagagcagac aaggatcctg ccgcaagagg aataaggata aaaataatga gagcgctttt
     2461 gtttttgatg ttcatatata ccggctccag gcatgacttt caatattata tgaaaaatct
     2521 ccgggaaata tgaacggtat ctccggcttt acttgccgct ctttgacttg cccgccgtct
     2581 ctttgagcag ttccag
//
LOCUS       ECOTRAU      1080 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.coli F plasmid transfer operon: traU gene, complete cds;
            traW gene, 3' end; and trbC gene, 5' end.
ACCESSION   M34695
KEYWORDS    periplasmic protein; transfer operon.
SOURCE      E.coli F Plasmid (strain K12; isolate Flac plasmid FLO) DNA, clones
            pKI[182;282;175].
  ORGANISM  Plasmid F
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1080)
  AUTHORS   Moore,D., Maneewannakul,K., Maneewannakul,S., Wu,J.H.,
            Ippen-Ihler,K.A. and Bradley,D.E.
  TITLE     Characterization of the F plasmid conjugative transfer gene traU
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.A.Ippen-Ihler, 25-MAY-1990.
FEATURES       from  to/span     description
    pept         49     1041     traU protein precursor
    sigp          1       66     traU protein signal peptide
    matp         67     1038     traU protein
    pept    <     1       52     traW protein
    pept       1050  >  1080     trbC protein
BASE COUNT      242 a    281 c    311 g    246 t
ORIGIN      Map position 77.9-78.9 units on the genome.
        1 cgatcgcttc ctgaaggtgg aatttattcc ggcagaggag ggcagaaaat gaagcgaagg
       61 ctgtggctgc tgatgttatt ccttttcgcc ggtcatgtcc ctgcggcgtc tgcggattct
      121 gcctgtgagg ggcgttttgt aaacccgatc acagatatct gctggagctg tattttcccg
      181 ctctcgctgg gcagtatcaa agtcagtcag ggcaaggtcc ccgacacggc gaacccgtcg
      241 atgcccattc agatttgtcc ggcaccgccg ccgctgttca ggcgtatcgg gctggccatt
      301 ggttactggg agccgatggc gttgacggac gtcacccggt caccgggatg catggtgaac
      361 ctgggcttca gcctgccggc ttttggtaaa acggcacagg gaacggcgaa aaaggatgag
      421 aagcaggtaa atggggcgtt ctatcacgtt cactggtaca aatacccgct gacgtactgg
      481 ctgaacatca tcacatcgct gggctgtctg gaaggtggtg acatggatat cgcttatctt
      541 tctgaaatcg accccacctg gacggacagc agcctgacca ccattctcaa tccggaagct
      601 gtcatctttg ccaatccgat agcacaggga gcctgcgcag cagatgcgat tgccagcgcc
      661 tttaatatgc ctctcgatgt tctgttctgg tgtgccggtt cgcagggaag tatgtacccg
      721 ttcaatggct gggtgagtaa tgagtccagt ccgttgcagt cctccctgct ggtcagtgaa
      781 cgcatggcgt tcaagctgca ccgtcagggc atgattatgg aaaccatcgg gaaaaataac
      841 gccgtctgta atgaatatcc gtccccaatc ctgcccaaag aacgctggcg ttaccagatg
      901 gtgaatatgt atccggacag cgggcagtgc cacccgttcg ggcgcagcgt gacccgctgg
      961 gaaaccggga aaaatccgcc caacacaaag aaaaacttcg gctacctgat gtggcgtaaa
     1021 cgtaactgtg tcttcctgtg aggtgaatga tgaagctgag tatgaaatct ctggcagcac
//
LOCUS       MUSSMRNAA      74 bp ss-RNA             RNA       25-JUL-1990
DEFINITION  Mouse brain-specific small RNA, clone pABr-4.
ACCESSION   M35067
KEYWORDS    small RNA.
SOURCE      Mouse 17-day fetus, cDNA to RNA, clone pABr-4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 74)
  AUTHORS   Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S.
  TITLE     Conservation of the ID sequence and its expression as small RNA in
            rodent brains: Analysis with cDNA for mouse brain-specific small
            RNA
  JOURNAL   Mol. Brain Res. 2, 43-49 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    RNA     <     1  >    74     brain-specific small RNA
BASE COUNT       12 a     16 c     28 g     18 t
ORIGIN      
        1 ggggttgggg atttagctca gtggtagagc gcttgcctag caagcaaggc cctgggttcg
       61 gtcctaagct ctgg
//
LOCUS       MUSSMRNAB      74 bp ss-RNA             RNA       25-JUL-1990
DEFINITION  Mouse brain-specific small RNA, clone pABr-9.
ACCESSION   M36619
KEYWORDS    small RNA.
SOURCE      Mouse 17-day fetus, cDNA to RNA, clone pABr-9.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 74)
  AUTHORS   Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S.
  TITLE     Conservation of the ID sequence and its expression as small RNA in
            rodent brains: Analysis with cDNA for mouse brain-specific small
            RNA
  JOURNAL   Mol. Brain Res. 2, 43-49 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    RNA     <     1  >    74     brain-specific small RNA
BASE COUNT       11 a     15 c     28 g     20 t
ORIGIN      
        1 ggggttgggg atttagctta gtggtagagc ttgcctagca agcgcaaggc cctgggttcg
       61 gtccttagct ctgg
//
LOCUS       BOVPRLB      1214 bp ds-DNA             MAM       25-JUL-1990
DEFINITION  Bovine prolactin gene, exon 5.
ACCESSION   M34535
KEYWORDS    prolactin.
SOURCE      Bovine pituitary DNA, and cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 628)
  AUTHORS   Carroll,S.M., Narayan,P. and Rottman,F.M.
  TITLE     N-6-methyladenosine resides in an intron-specific region of bovine
            prolactin pre-mRNA
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 629 to 1214)
  AUTHORS   Carroll,S.M., Narayan,P. and Rottman,F.M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Narayan, 16-MAY-1990.
            
            The cDNA sequence which corresponds to this gene is found in J.B.C.
            257: 678-681 (1982), accession number M25007.
            
               Author address: P.Narayan
               Dept. of Molecular Biology and Microbiology
               School of Medicine
               Case Western University
               Cleveland, OH 44106
FEATURES       from  to/span     description
    pept    /   629      820     prolactin, exon 5 (AA at 629)
    pre-msg <     1      971     prolactin mRNA and intron
    IVS     <     1      628     prolactin intron D
BASE COUNT      400 a    229 c    197 g    388 t
ORIGIN      
        1 gtgagcttca tgaaagcttc cttgctattt tcatgaatga gagaggtgat ttctgtaatg
       61 aggaatgagt tttgaactat ctcactgtac aagaacacaa ttcaggcctt ctttttctag
      121 accggtgtta cataaagcaa gaacctgttc attcatagtg atagattcta ttgtaagtga
      181 attagaattc caccagcaat ttttcacaga ggtatagtct ttcttgaatt gtacagttac
      241 accaaaatct tgcctcttcc tgggtacaga tggctgaaat attttcaagg ataagagaat
      301 tagagaatac aatttgcaag ataaatgttt tcttcaaaat atcccaagat atcctctact
      361 gaaattcagc ttgtattctt tctctattct cctcaaacca caggatgaga atgagaagaa
      421 agaaaagaga agatcaaaac caaatacttg agttctgctt tagtttttat taataaatta
      481 ctaacatata tctgatacac tggctccaaa atccaagtgt agagactttc atgtatcttc
      541 cctaattttt aatttgataa atagaaagaa caaagatgag ctaatactac taaaactcat
      601 aataactcat tatcttttgg atgtttaggt tattcctgga gccaaagaga ctgagcccta
      661 ccctgtgtgg tcaggactcc cgtccctgca aactaaggat gaagatgcac gttattctgc
      721 tttttataac ctgctccact gcctgcgcag ggattcaagc aagattgaca cttaccttaa
      781 gctcctgaat tgcagaatca tctacaacaa caactgctaa gcccacattc catcctatcc
      841 atttctgaga tggttcttaa tgatccattc cctggcaaac ttctctgagc tttatagctt
      901 tgtaatgcat gcttggctct aatgggtttc atcttaaata aaaacagact ctgtagcgat
      961 gtcaaaatct aagactgcaa ttttgtcaat gtttcttatc ttcatttaat agacaatcaa
     1021 atgaaaatcc ttccttatga ttgagagaaa gaacttctga ttaaaatttg tcacaaatag
     1081 cagaaactga cattacaaag accgttaata acttacttta gaatcacagc aaattattct
     1141 ggggtcaagt tattagaatt aaaaattaga taaacattca ttgtgttggt catgctacca
     1201 agaagactga attc
//
LOCUS       RHMCYA       1560 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  R.meliloti adenylate cyclase (cyaA) gene, complete cds.
ACCESSION   M35096
KEYWORDS    adenylate cyclase.
SOURCE      R.meliloti DNA.
  ORGANISM  Rhizobium meliloti
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 1560)
  AUTHORS   Beuve,A., Boesten,B., Crasnier,M., Danchin,A. and O'Gara,F.
  TITLE     Rhizobium meliloti adenylate cyclase is related to eucaryotic
            adenylate and guanylate cyclases
  JOURNAL   J. Bacteriol. 172, 2614-2621 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        885     1466     adenylate cyclase (cyaA)
    binding     871      879     ribosome binding site
BASE COUNT      292 a    512 c    487 g    269 t
ORIGIN      
        1 ggatcctgtt cctggacgcg agcggcctgc agtttgccga acgtcacgct gcctccaacg
       61 gcttcgatcc gaggacgcgg ccctggtacc gcgcggccgt caacggcaag gcgccggtgg
      121 ccatcggtcc ctatgagatg gccaccacag gcaatctcgg gatgaccata tcgcaagcgc
      181 accgcggcaa cccccaaatc gtcatcggcg ccgatgtcgt tctcgatacg atcacggatt
      241 ttctgtcccg cgagcggctg accgacgact cggtttcctt cgtgctcgat gcggtgggac
      301 gaccgatcat ccactccgac tccaccatga tgcggcgcat catggcatcg aagggccggg
      361 accggccggt ggccacgccg caggaggatg gactgatcga gagcatccgg cgcaacccgc
      421 caccggccgg aaaggcaact ctcgtcgaag tcggaaaccg cacctatctc gtcacggtgg
      481 cgccgctcga atcggcattg cttctgtccg ggcaccgggt ggtcgtcgcc gcccctctcg
      541 acgagctgct ggcggccgca aacgagacgc tcgttcaggg acttgccgtc tcgggcgccg
      601 tggtggtggt cgccgttctc ctggccctcg tgcttgcgca tctgatcacg aagtcgctca
      661 accagctcac cgacagcgcc aaccgcctgc aggacctgga tttcgccact cctatcgacg
      721 tttcgtcgca tgtggcggaa atctcgacgc tcaacggcgc aatgaacagg gctcgcgacg
      781 cgatcttcac cttcgcgctc tatgttccga aggagctggt gcgcaagggc atcgaatccg
      841 gccatttcgg cggccgcgcc gcatggcggc aggaggtgac ggcgatgttc accgacatct
      901 acgacttcac caccatcagc gagggccggt cgccggaaga agtggtcgcg atgctctcgg
      961 agtatttcga cctgttcagc gaggtcgtcg ccgcccacga cggaaccatc atccaattcc
     1021 atggagactc ggtctttgcc atgtggaacg cgccggtcgc cgataccagg catgccgagc
     1081 atgcctgtcg atgcgcactc gcggtcgagg agaggctcga ggccttcaat tctgcgcaac
     1141 gcgccagcgg attgccggag ttccgcaccc gcttcggcat ccacaccgga acggccgtcg
     1201 tcggcagcgt cggcgccaag gaacggctgc aatatacggc gatgggcgac acggtgaacg
     1261 tcgcctcgcg gctcgagggc atgaacaagg attacggcac gagcgttctt gcaagcggcg
     1321 cggtggtcgc ccaatgcaaa gacatggtga agttccgccc gctcggcacc gccaaggcaa
     1381 agggccgttc gacggcgctc gacatttacg aagtcgtggg cgtcgtccgc gcggtgaaca
     1441 ctaccgaagc cggaacggcc gcctgaggaa aggcagatgc cgcggcgaac ggcggccccg
     1501 ctgaattcgc ttcgaaactc tgaaagcaaa aaagcccgga aacccgggct ttttttgact
//
LOCUS       ECOCYSD       492 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.coli sulfate adenylate transferase (cysD) gene, 5' end.
ACCESSION   M35098
KEYWORDS    sulfate adenylate transferase.
SOURCE      E.coli (strain K-12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Malo,M.S. and Loughlin,R.E.
  TITLE     Promoter elements and regulation of expression of the cysD gene of
            Escherichia coli K-12
  JOURNAL   Gene 87, 127-131 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        412  >   492     sulfate adenylate transferase (cysD)
    mRNA        373  >   492     cysD mRNA
BASE COUNT      141 a    110 c    118 g    123 t
ORIGIN      
        1 ctgcaggagt tccggtcatg cgtcccggaa agaaagtagc aatatgtcgt gcctgagtat
       61 tagcaaaatc gccaggttta ggtgacgagg cgtgtacggg gagaataaag catacgccga
      121 gcgccagggc agcggtacgg tggcgcaatg cggaaaacat agtgagtcct taaataccat
      181 gcaaattttt ttaccgccat agtatgaaac tgccgctgcg ctaaaacaat ttcaaatctt
      241 cctaaacgcc cgaaatccgg tgccttaagc actttttgat attagctttg ccaaatcgtt
      301 attccgttaa ggaactactc attctaattg gtaatttcat tcgttctctt acgctcccta
      361 tagtcgaaac atctgatggc aagaaaatag cggtattgca aaggaacggt tatggatcaa
      421 atacgactta ctcacctgcg gcaactggag gcggaaagca tccacattat tcgcgaggtg
      481 gcggcagaat tc
//
LOCUS       CHPCOX41A     956 bp ds-DNA             PRI       25-JUL-1990
DEFINITION  Chimpanzee cytochrome c oxidase subunit IV (COX4P1) processed
            pseudogene, complete cds.
ACCESSION   M34599
KEYWORDS    cytochrome c oxidase subunit IV; pseudogene.
SOURCE      Chimpanzee DNA, clone lambda-Ch1.
  ORGANISM  Pan troglodytes
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae.
REFERENCE   1  (bases 1 to 956)
  AUTHORS   Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I.
  TITLE     Novel use of a chimpanzee pseudogene for chromosomal mapping of
            human cytochrome c oxidase subunit IV
  JOURNAL   Gene 86, 209-216 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        216      665     cytochrome c oxidase subunit IV (COXIV)
                                 pseudogene (E.C. 1.9.3.1)
    signal      881      888     poly-A signal
BASE COUNT      268 a    222 c    277 g    189 t
ORIGIN      Chromosome 14q21-qter.
        1 ggtacctcca atcccagcta ctcgggaggc tgaggcagga gaatcacttg aactcgggag
       61 gcggaggttg cagtgagctg agatcacgcc tctgcgctac agcctgggca acaagagcaa
      121 aactccgtct cggaaaagaa aaaaacaaaa aagaactact ggggtcgcgg gacaccgggc
      181 atagagggcg gcggtggtgg ggcagctgcg gcagaatgtt ggctaccagg gtagttagcc
      241 tagttggcaa gcgagcaatt tccaccttgg tgtctgtacg agcacacgga aatgttgtga
      301 agagcgatga ctatgcgctc ccagcttatg tggatcgacg tgactatccc gtacccgatg
      361 tggcccatgt caagcacctg tctgccagac agaaagcctt gaagaagaag gagaaggcct
      421 cctggagcaa ccgctccacg gatgggaaag tcgagttgta tcacattcag ttcaaggaga
      481 gctttgctga gatgaacagg ggcgtgaacg agtggaagat ggttgtgggc gctgccatgt
      541 tcttccttgg cttcacggcg ttcattatca tctgggagaa gcgctgtgtg tacggcccca
      601 tcccgcacac ctttgacaaa gagtgggtgc ccatgcagac caagaggatg ctggacatga
      661 ggtgaacccc tgcagggctt cgccagccaa gtgggactat gacaagaacg agtggaagaa
      721 gtgaacccct gcagggcttc gccagccaag tgggactatg acaagaacga gtggaagaag
      781 tgagagatgc tgtcctgctt ttgagccttg ctctgtcacc tccatactat aactccatgc
      841 ctatttactg gaaacctgtt atgccaaaca gtaccactgc taataaatga ccagtttacc
      901 tgaaagaaaa aaaaaaaaag aactactgaa gtgaaagaaa aatctggaga aagtac
//
LOCUS       CHTMOMPA      682 bp ss-mRNA            BCT       25-JUL-1990
DEFINITION  C.trachomatis outer membrane protein (ompl) gene, 5' end.
ACCESSION   M35099
KEYWORDS    outer membrane protein.
SOURCE      C.trachomatis (serovar L2/434/Bu), cDNA to mRNA.
  ORGANISM  Chlamydia trachomatis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE   1  (bases 1 to 682)
  AUTHORS   Kaul,R., Duncan,M.J.J., Guest,J. and Wenman,W.M.
  TITLE     Expression of the Chlamydia trachomatis major outer membrane
            protein-encoding gene in Escherichia coli: Role of the 3' end in
            mRNA stability
  JOURNAL   Gene 87, 97-103 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        568  >   682     outer membrane protein (ompl) precursor
    sigp        568      633     outer membrane protein (ompl) signal peptide
    matp        634  >   682     outer membrane protein (ompl)
    mRNA          1  >   682     ompl mRNA
BASE COUNT      218 a    144 c    111 g    209 t
ORIGIN      
        1 aaaaacactt tctttgtagt aataaaaacg atttctatca aaacaaattc ttagattttc
       61 ttacaaaaat ctcctctttt cttttagcca aacccccatc ttcgagctat tccaaacaca
      121 aaaatcttag gttttggaaa ttaacaactc ataaaaattg aactgttttg taattaactc
      181 aaaaccctct cattctcaac aatcaacata ttgccaacat ggcttttgct ctcggtttca
      241 cagcgatttt tttcgcaaaa accaagaaca taaaacataa aaagatatac aaaaatggct
      301 ctctgcttta tcgctaaatc aggaggcgct taagggcttc ttcctgggac gaacgttttt
      361 cttatcaact ttacgagaat aagaaaattt tgttatggtc tcgagcattg aacgacatgt
      421 tctcgattaa ggctgctttt acttgcaaga cattcctcag gccattaatt gctacaggac
      481 atcttgtctg gctttaacta ggacgcagtg ccgccagaaa aagatagcga gcacaaagag
      541 agctaattat acaatttaga ggtaagaatg aaaaaactct tgaaatcggt attagtgttt
      601 gccgctttga gttctgcttc ctccttgcaa gctctgcctg tggggaatcc tgctgaacca
      661 agccttatga tcgacggaat tc
//
LOCUS       ECOK99FIM     740 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  E.coli K99 fimbrial subunit gene, complete cds.
ACCESSION   M35282
KEYWORDS    K99 fimbrial subunit.
SOURCE      E.coli (strain K-12 C600) DNA, clones 1, 2, 3, 4 and 5.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 740)
  AUTHORS   Roosendaal,B., Gaastra,W. and de Graaf,F.K.
  TITLE     The nucleotide sequence of the gene encoding the K99 subunit of
            enterotoxigenic Escherichia coli
  JOURNAL   FEMS Microbiol. Lett. 22, 253-258 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         70      615     K99 fimbrial subunit precursor
    sigp         70      135     K99 fimbrial subunit signal peptide
    matp        136      612     K99 fimbrial subunit
    pept        648  >   740     ORF1
BASE COUNT      235 a    133 c    146 g    226 t
ORIGIN      
        1 tagggaatgg ctatgttttc tggtgattcc acggaactaa aaaataatat cgaacaatgg
       61 agaatctaga tgaaaaaaac actgctagct attatcttag gtggtatggc ttttgcgact
      121 accaatgctt ctgcgaatac aggtactatt aacttcaatg gcaaaataac gagtgctact
      181 tgtacaattg accctgaggt caatggtaat cgtacatcaa ctatagatct tgggcaggct
      241 gctattagtg gtcatggcac tgtagtggat tttaaactaa aaccagcgcc cggcagtaat
      301 gactgcctag cgaaaacaaa tgctcgtatt gactggtctg gttctatgaa cagtttaggt
      361 tttaataata cagcttcagg aaatactgct gctaaaggat accatatgac tttgcgcgca
      421 acaaacgttg gaaatgggtc tggtggtgct aatattaata cttcattcac tacggctgaa
      481 tacactcaca cttctgcaat tcagtcattt aactattcag cccagctgaa aaaagatgac
      541 cgcgctccgt ctaatggtgg atataaagct ggcgtattta ctacttcagc atccttctta
      601 gtcacttata tgtaatattt aaagtatttt acattgcggg catatctatg attgcccgca
      661 atattactga tggatattat atgaatagaa aaaaacatca gattttaaaa attttattgt
      721 tgtgtctaat aagcagtaaa
//
LOCUS       ECORRDAA       72 bp ss-rRNA            RNA       25-JUL-1990
DEFINITION  E.coli 16S rRNA fragment.
ACCESSION   M35308
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      E.coli (MRE 600) ribosomal RNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 72)
  AUTHORS   Ehresmann,C., Fellner,P. and Ebel,J.P.
  TITLE     Nucleotide sequences of sections of 16S ribosomal RNA
  JOURNAL   Nature 227, 1321-1323 (1970)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    72     16S rRNA
BASE COUNT       17 a     16 c     20 g     19 t
ORIGIN      
        1 ggcttggttt gcaagtgtca gatactgtta agcatctgaa atccccgggc taaccctggg
       61 aactgatgac tg
//
LOCUS       ECORRDAB      174 bp ss-rRNA            RNA       25-JUL-1990
DEFINITION  E.coli 16S rRNA fragment.
ACCESSION   M35309
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      E.coli (MRE 600) ribosomal RNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 174)
  AUTHORS   Ehresmann,C., Fellner,P. and Ebel,J.P.
  TITLE     Nucleotide sequences of sections of 16S ribosomal RNA
  JOURNAL   Nature 227, 1321-1323 (1970)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >   158     16S rRNA
BASE COUNT       42 a     40 c     55 g     37 t
ORIGIN      
        1 ggcatgaaga cacactgcta actccgaata cgcacaagcc cgtaatggag cgacggtggg
       61 ccttgttccc gtgccccgat gtggggtgga ggtgactgtg ggttgtgata ttcggggagg
      121 caaaagaagt agcgagtcta accttgctta ccactttgcc taatacggga aacg
//
LOCUS       HPTRRA        117 bp ss-rRNA            RNA       25-JUL-1990
DEFINITION  H.aurantiacus 5S rRNA gene.
ACCESSION   M35310
KEYWORDS    5S ribosomal RNA; ribosomal RNA.
SOURCE      H.aurantiacus (strain Sengas Wie 2) ribosomal RNA.
  ORGANISM  Herpetosiphon aurantiacus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; 
            Cytophagaceae.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Van den Eynde,H., Stackebrandt,E. and De Wachter,R.
  TITLE     The structure of the 5S ribosomal RNA of a member of the phylum of
            green non-sulfur bacteria and relatives
  JOURNAL   FEBS Lett. 213, 301-303 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      117     5S rRNA
BASE COUNT       22 a     37 c     40 g     18 t
ORIGIN      
        1 tccggtggca atgtcggagg ggtcccaccc gttcccatcc cgaacacgga agttaagccc
       61 tccagagccg atggtactcc gcggggaacc gcgcgggaga gtaggtcgct gccggat
//
LOCUS       HUMCOX4AA     634 bp ss-mRNA            PRI       25-JUL-1990
DEFINITION  Human cytochrome c oxidase subunit IV (COX4) mRNA, complete cds.
ACCESSION   M34600
KEYWORDS    cytochrome c oxidase; cytochrome c oxidase subunit IV.
SOURCE      Human liver, cDNA to mRNA, clones pCOX4.-[111 and 4.2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 634)
  AUTHORS   Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I.
  TITLE     Novel use of a chimpanzee pseudogene for chromosomal mapping of
            human cytochrome c oxidase subunit IV
  JOURNAL   Gene 86, 209-216 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      510     cytochrome c oxidase subunit IV (COX4)
                                 /hgml_locus_uid="LS0022W" /nomgen="COX4L2"
                                 /map="16q22-q24"
    mRNA    <     1      634     COX4 mRNA
BASE COUNT      156 a    157 c    180 g    141 t
ORIGIN      
        1 atgttggcta ccagggtatt tagcctagtt ggcaagcgag caatttccac ctctgtgtgt
       61 gtacgagctc atgaaagtgt tgtgaagagc gaagactttt cgctcccagc ttatatggat
      121 cggcgtgacc accccttgcc ggaggtggcc catgtcaagc acctgtctgc cagccagaag
      181 gcactgaagg agaaggagaa ggcctcctgg agcagcctct ccatggatga gaaagtcgag
      241 ttgtatcgca ttaagttcaa ggagagcttt gctgagatga acaggggctc gaacgagtgg
      301 aagacggttg tgggcggtgc catgttcttc atcggtttca ccgcgctcgt tatcatgtgg
      361 cagaagcact atgtgtacgg ccccctcccg caaagctttg acaaagagtg ggtggccaag
      421 cagaccaaga ggatgctgga catgaaggtg aaccccatcc agggcttagc ctccaagtgg
      481 gactacgaaa agaacgagtg gaagaagtga gagatgctgc ctgcgcctgc acctgcgcct
      541 ggctctgtca ccgccatgca actccatgcc tatttactgg aaacctgtta tgccaaacag
      601 ttgtaccact gctaataaat gaccagttta cctg
//
LOCUS       MRGRBMII     2574 bp ds-DNA             VRT       25-JUL-1990
DEFINITION  M.serrator retropseudogene-like repetitive element I (RBMI).
ACCESSION   M35143
KEYWORDS    repetitive sequence.
SOURCE      M.serrator blood DNA.
  ORGANISM  Mergus serrator
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Anseriformes; Anatidae.
REFERENCE   1  (bases 1 to 2574)
  AUTHORS   McHugh,K.P., Madsen,C.S. and de Kloet,S.R.
  TITLE     A highly repeated retropseudogene-like sequence in DNA of the
            redbreasted merganser (Mergus serrator)
  JOURNAL   Gene 87, 193-197 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           1     2574     retropseudogene-like repetitive element
    pept       1403      909 (c) ORF1
    pept       2118     1426 (c) ORF2
BASE COUNT      691 a    600 c    484 g    790 t      9 others
ORIGIN      
        1 gaattcctca aacacgctgc ggctgcttac ctttaataca cccgttgcat gcgatggagc
       61 tgtatttctt gcttttncct gcactggaag gcttcccttc cttgtcaggt tgtttactgc
      121 cctcactctt ctgcattgct cacatgaaga gccatctgga ggatgggttt cttccttctt
      181 ctcccgggtt atcttctgga aacgaggacc taagtattcc aaggagcctt tcactttcct
      241 ggtgtttctc cttttttttc tttttcttct cctttttctt ctttttctta tgcttgtgat
      301 tggcattgtc aaagtggagc gcacagaaac acaaatcgtg aagtctgaaa gaaacatgca
      361 agttaaaaag agaaaaaaag atgtggcact tgttgcctat atgaaacttt atttttttta
      421 ccacaggtga tgatttgcag catgtcagct attttgtggt gctttgtgca cacgcaactt
      481 acttacttta gatgcagcaa acttaagccc tcagattgaa ggaccatagg ctggtttgta
      541 cacagatcat taaccatggt tagctctgga atacgtgcaa gcagaaaaaa acttttaacc
      601 taatccggaa tggtgtacag atgtgattcg aactatgtgg tctaacgcta gtgctctgac
      661 acaattcagc aatagctttc ctatcttcac tgaacaccta cacacagacc cagccagctg
      721 atgctatcta aataacttag aaactaccag aaaaaaaaaa aaaaaaaaaa gaagaaaaaa
      781 cgagaataaa aaaaaaaagt agaaaaaaaa aaaaaaagga agacatgaga agcacccaga
      841 aatgaattag gataaaaaat tcggagtatg ctggaatcct tgcttacttg gaatccttct
      901 ctgcatgttt aatccttaga cttctttttt cttctagaac ttgttgatat ttttgcattt
      961 ttttcaccac ctaaaagctc cttttctatc tttctgtctt tcctttctat ttcactttca
     1021 ctaccttctg cacgggtata ttttcttttt ctgtttcttt ctgtttcatt tttctggcga
     1081 cagttctcca aatgagctga cacgggtgga agcgcatgtc tttcacgaga atgtcttctg
     1141 gaatgttgct gatgtaccga gcaacgatgc aagtctgctc ggggtgtgct aaagcgacgt
     1201 acatcttcct ctctcaagag ggaactgtga ggccatccgc ttttgtaatg ataactctta
     1261 tgtgacctgc tgtagtaagt tgcagtcgat ttgtcaaagg ctgcatcgcc gtgagacaac
     1321 tttctctctc tactgtctcc tgtcgcatga ggtgaatagt aatcattgta atagctacat
     1381 ctttcccatc tccgagcntt catcctcgat agtatctntc tctgctcaac ttctttgccc
     1441 tttggatcgg taatatctat tgctacctcg ttctgatctt cctccgcttg ccagatctgt
     1501 actttgaata tttgacngct cttctgccat tctcagggct gtttctttca nnnnggaaag
     1561 atctgcacct gcttcccccc cagtgctcct gcttgtgacg cttttgctca acaacttcca
     1621 cgctctgaga acacctcctc ttgctggaag gacctgcttt ttgactctcc ttctcttcag
     1681 taggagcatg ttcctcttgc tttggtaatg ctctttgtca gtgtttttag tctcgncttg
     1741 tatcttggca tctctctgta atagctgagg aggaaaggtt tttagagcta cattcagtgt
     1801 cagacttgag agaggaagct tgccgcaatt tctcaccagg ctcagaagac tctttgccgg
     1861 acaaaacgtt ttcttttgaa atgaggtcac gttcttttca tcttcttgct ttctccttat
     1921 ctccaccgtc atcacattgg tactgtgcga ggtattcatc attccagtag attttcgagg
     1981 gtccgcaaca ctgcacaaaa taaaagcaca tttctcagtt ctgctgaagg acgtgaatat
     2041 taagaggaaa accttccaaa agtcgaacaa acaaacaaaa acctccggac tacaggaaca
     2101 ctctccaaga tatgccattt agaaacctct cctgtcatta ggacaccttc ttcagctcca
     2161 cagaaagggg ttttgccctc ttgcttctga agccattgca ctaaaaagca aacgcagtgc
     2221 tgtctccctc cacatgctgc tctgaataag agccagaata ttcaaaacca ctctctttgt
     2281 tctcccacat agccgaaaaa acaccggttg aaacagagtt ttctacctct cgcccaacaa
     2341 tttacattca catagcctat gactgaaaaa ataaaaggcg gggctgagga ggaacagcca
     2401 gtgttggaaa tgaaaagaag cagcccgttc cttcatagtc ttaagcctat gctactagga
     2461 aaacaaaaca aaacaaaaca aaacaagagg agaggagaac aacagcggga aattttcctg
     2521 ttctccaggt gttaaattgc aaagcctcct ctggaggatc acagctgtga attc
//
LOCUS       MUSLAMB      2789 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse lamin B mRNA, complete cds.
ACCESSION   M35153
KEYWORDS    lamin B.
SOURCE      Mouse liver, cDNA to mRNA, clone FML11-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2789)
  AUTHORS   Hoeger,T.H., Krohne,G. and Franke,W.W.
  TITLE     Amino acid sequence and molecular characterization of murine lamin
            B as deduced from cDNA clones
  JOURNAL   Eur J Cell Biol 47, 283-290 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        256     2019     lamin B
    mRNA    <     1     2789     lamin B mRNA
    site       2768     2773     poly-A signal
BASE COUNT      696 a    657 c    812 g    624 t
ORIGIN      
        1 aataatctta agctcttaca aagagctgcg ggcgggagac tcgcgtccgg cgcacagccg
       61 tctgcgtctc ccggctgccc tggcctcttc ccgcgcgcgc gtgcagtgtg cgtgtacact
      121 cacaaagggc gtctggcggg cgatccgcgg ccctcccgct tcgctctttg tgcggtagcc
      181 ccgccgccac cgccagccca ggtccgctcg atcctcaccg gcctgtggtt tgtaccttcg
      241 gtcccgccgc ccgccatggc gaccgcgacc cccgtgcagc agcagcgggc gggcagccgc
      301 gccagcgccc ccgccacgcc gctcagcccc acgcgcctgt cgcgcctgca ggagaaagag
      361 gagctgcggg agctcaacga ccgcctggct gtgtacatcg ataaggtccg cagcctggag
      421 acggagaaca gcgcgctgca gctgcaggtg accgagcggg aggaggtgcg cggccgcgag
      481 ctcaccggcc tcaaggctct ctacgagacc gagctggccg acgcacgccg cgctctggac
      541 gacacggccc gcgagcgcgc caagcttcag atcgagctgg gcaagttcaa ggccgagcac
      601 gaccagctgc tgctcaatta tgccaagaag gaatctgatc tcagtggagc ccagatcaag
      661 cttcgagagt atgaggcggc actaaactct aaggatgcgg cgctggcaac tgccctaggg
      721 gacaaaaaga gtttagaggg agacttggag gatctgaaag atcagattgc ccagctagaa
      781 gcatccttat ctgccgccaa aaagcagtta gcagatgaaa ctttacttaa agtggatttg
      841 gagaatcgct gtcagagcct tactgaggac ttggagtttc gtaaaaatat gtatgaagag
      901 gagatcaatg agacaaggag gaagcatgag acccgcttgg tggaagtgga ctctgggcgt
      961 cagattgagt atgagtacaa gctggctcaa gccctgcatg agatgcggga gcagcacgac
     1021 gcgcaggtga ggctgtacaa ggaagagctg gagcagacct accacgccaa gcttgagaat
     1081 gccagactct cctcagagat gaacacttcc actgtcaaca gtgcccggga agagctgatg
     1141 gagagccgga tgaggatcga gagcctctcc tcacagctct ctaacctgca gaaagagtct
     1201 agagcgtgtt tggaaaggat ccaggaattg gaggacatgc ttgctaagga gagagacaac
     1261 tcgcgccgca tgctgtctga cagagagaga gagatggcgg agatcaggga ccagatgcag
     1321 cagcagctga gtgattatga gcagctgctg gacgtgaagc tggccctgga catggagatc
     1381 agcgcctaca ggaagctcct ggaaggcgaa gaagagcggt taaagctctc tccaagccct
     1441 tcttcccggg tgaccgtgtc cagagcgtcc tccagtcgca gtgtgcgcac caccagagga
     1501 aagcggaaga gagttgatgt ggaggagtcg gaggcgagca gcagtgttag catttcccac
     1561 tctgcctcag ccacggggaa cgtgtgcatt gaagagatag atgttgatgg gaagtttatt
     1621 cgcttgaaga acacttctga gcaggatcaa ccaatgggag gctgggagat gatcagaaaa
     1681 attggagaca catcagtcag ttacaaatat acctcaagat atgtgctgaa ggctggccag
     1741 actgtcacag tgtgggctgc aaatgctggc gtcacagcca gccctccaac tgacctcatc
     1801 tggaagaacc agaactcttg gggtactggt gaagatgtga aggttatgct taagaattct
     1861 cagggagagg aggttgctca gagaagctct gtcttcaaga ccaccatacc cgaggaggag
     1921 gaggaggagg aggagcccat cggagtggct gtggaggagg agcgtttcca ccagcaggga
     1981 gccccaagag catggaataa aagctgtgcc attatgtgaa cttatcaaga catggtcgat
     2041 cttcctcaag ctagaagcat ggagtcctgt atacagtgca gagccttctc agaagcacat
     2101 gatatttttg tatttccttt atgtgaattt ttaagctgcg aatctgatgg ccttaatttc
     2161 ctttttgaca ctgaaagttt tgtcaaaaga aatcctatcc atacacgttg taagatgtga
     2221 attattgaca ctgagctaac tgtactgttt ggaaaggggc cctcaagttt ttggcatttt
     2281 ttctttcctt tttgtatgtg tgtatgtaat tttttttttt taagttcttt taagagggga
     2341 caaggagggt aagaaaacca ctgcgtgtcc gggcattaat tgaagcttgc tctccctaga
     2401 tgggcggtct gctctcggtc cttctctgct ctctataaaa tggtgctgtc ggggagggag
     2461 gggggaagtt tttcaatata tgaacttttg tatggaattt tttgtaataa gtgatcaggt
     2521 tacaattttt ttaaatagaa aagagaagaa aaacgttgta agaacggaat attaatctag
     2581 tcacccatgt acgcactctg gatggaggtt ctacagagct gttgattggt caactacttc
     2641 tcttacattg ttgactcatg aggggagcgg gcaggcgggt gagggtgggg gaaggctttc
     2701 tcttcaaatt cgctagttga gtttttaaga tagtgtacat gcttacattt cttatccgac
     2761 attaacaaat aaaacgctgt tttcctatt
//
LOCUS       MUSMSTA1     1651 bp ds-DNA             ROD       25-JUL-1990
DEFINITION  Mouse metastatic cell protein (mts1) gene, exons 1 and 2.
ACCESSION   M36578 M35147
KEYWORDS    mts1 protein.
SEGMENT     1 of 2
SOURCE      Mouse metastatic cell line DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1651)
  AUTHORS   Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and
            Lukanidin,E.M.
  TITLE     Structure of gene mts1, transcribed in metastatic mouse tumor cells
  JOURNAL   Gene 87, 219-223 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1498  /  1638     mts1 protein, exon 2 (first expressed exon)
    pre-msg     255  >  1076     mts1 mRNA and introns
    IVS         293     1482     mts1 intron A
    IVS        1639  >  1651     mts1 intron B
    signal      225      231     TATA box
BASE COUNT      380 a    358 c    460 g    437 t     16 others
ORIGIN      
        1 ttctggctga gctgtggctg cttggtggtg tccaccccat ccaagcctct gccgtgccca
       61 ctggagctca ctcactactt gattgtgcct gctggggagg gagcaggaag cctagatccc
      121 agactgggct ggtcgagggt gctatgacat ttactacatc aaccaacagc aagagcacag
      181 tatccatgtt cccccatcct ctgcatgggc agggcctagc agggtataaa taggtcagat
      241 tgttgggctc tccccaaacc tctctattca gcacttcctc tctcttggtc tggtgagttg
      301 tgttggtctg atagcactgc tagcggcatt agaggctgag gctagggtag aagaaagggg
      361 ggctgctgtg ggggaacaga tgtctttaat aaatccagat gagagattct gatgtggagg
      421 ttcatgtatg tgtgtgtgtg tgtgtgtttt cacgagaatg aaaaccaaaa aaaaaaaaaa
      481 aaaaaaaaaa agtgtataaa tggctacatc tgagctcccg aaggttttga gatactgagg
      541 ctggcttgca tgttgctata gtgtatattg gtggtgcttg ggagtcactg tcatgcatag
      601 gatgctgact cgtgttgctg ggtaatacaa gacagtgtgt ggacactcgg gtacaggaag
      661 caaagcgaag gcatcagtag gcctttttgt tttacagtat ttaaattaca gtttttattt
      721 gtgtgtatga gcgtatgggt tgggctggag caaatgccaa ggcgacattg tgggagccaa
      781 aggacaattt gtgtgggagt caactcgttc cttctagcat gtgggctgtg gggatcaaac
      841 tcaggccttg gagcttggtg gcaagcacct ctacccattg agctatctct ccagcaccct
      901 cctgcagnnn nnnnnnnnnn nntttgtagt gtcttgtttt taattgccct atgaacatat
      961 agcacctagg ccaagaaagc ctagcttccc caccctctcc tcttgcatcc ctacctctgc
     1021 cacttcatct tactcctatt aggcagctgg ggtttttcca cttttttttt gtctgcctct
     1081 gggcaggcag ccagcagccg cgcccaacgc tgggagggag aagaatgggc caggcctgtg
     1141 cttgtggttg agctgtggga gtgagtaagc tgatggaaaa ctgctgttgt tgaggccata
     1201 gctgagaggc acagaaaggt gctggcatag gtctccagag tttgaggggt agctttgcag
     1261 gtttcagagc ccagagcaca tgtgaccttc ttgccaccaa tgggtcccat tcctctgatc
     1321 cccnaggggg tgaggtccat ctcttagaga gttgtgggat agagcactta aaatgggaac
     1381 agaatgagtg tgatttgggt catgctcagc aacacatatc cagttctcaa cacactgttg
     1441 gcgtgggttg gagaatgtta cttttgtgtc tcctgccctt aggtctcaac ggttaccatg
     1501 gcaagaccct tggaggaggc cttggatgta attgtgtcca ccttccacaa atactcaggc
     1561 aaagagggtg acaagttcaa gctgaacaag acagagctca aggagctact gaccagggag
     1621 ctgcctagct tcctgggggt aagtgggtcc t
//
LOCUS       MUSMSTA2      545 bp ds-DNA             ROD       25-JUL-1990
DEFINITION  Mouse mts1 protein gene, exon 2.
ACCESSION   M36579 M35147
KEYWORDS    mts1 protein.
SEGMENT     2 of 2
SOURCE      Mouse metastatic cell line NIH3T3 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 545)
  AUTHORS   Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and
            Lukanidin,E.M.
  TITLE     Structure of gene mts1, transcribed in metastatic mouse tumor cells
  JOURNAL   Gene 87, 219-223 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    14      178     mts1 protein, exon 2
    pre-msg <     1      315     mts1 mRNA and introns
    IVS     <     1       13     mts1 intron B
    signal      297      302     poly-A signal
BASE COUNT      126 a     97 c    170 g    152 t
ORIGIN      
        1 cttcaacggc cagaaaagga cagatgaagc tgcattccag aaggtgatga gcaacttgga
       61 cagcaacagg gacaatgaag ttgacttcca ggagtactgt gtcttcctgt cctgcattgc
      121 catgatgtgc aatgaattct ttgagggctg cccagataag gagccccgga agaagtgaag
      181 actcctcaga tgaagtgttg gggtgtagtt tgccagtggg ggatcttccc tgttggctgt
      241 gagcatagtg ccttactctg gcttcttcgc acatgtgcac agtgctgagc aaattcaata
      301 aaaggttttg aaactattag ctgttgtctg agagactgga gctatgggct gagggctgtg
      361 gtagagactg ctggaagttg acctgagctt tgtggggcca aactaaaaaa aggtcgggga
      421 gggggtgggt ggcttatttt gagtacattg caagtatgta tttgtgtgtg tcggcttagt
      481 catgcgtgca tgtgtgcgtg cgtgtgtgtt tgtgtgtgtt tacgtgctcc tatatagcaa
      541 ccgag
//
LOCUS       MUSNFH       3959 bp ss-mRNA            ROD       25-JUL-1990
DEFINITION  Mouse neurofilament component (NF-H) mRNA, complete cds.
ACCESSION   M35131
KEYWORDS    neurofilament protein.
SOURCE      Mouse (strain Swiss-Webster) brain, cDNA to mRNA, clones pMuH1,
            pMuH5, and DNA, clone lambda-5A.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3959)
  AUTHORS   Shneidman,P.S., Carden,M.J., Lees,J.F. and Lazzarini,R.A.
  TITLE     The structure of the largest murine neurofilament protein (NF-H) as
            revealed by cDNA and genomic sequences
  JOURNAL   Mol. Brain Res. 4, 217-231 (1988)
  STANDARD  simple staff_review
COMMENT     Nucleotides 1-955 are derived from genomic DNA.
FEATURES       from  to/span     description
    pept        154     3372     neurofilament component (NF-H)
    signal       59       65     TATA box
    signal     3936     3942     NF-H mRNA
BASE COUNT     1104 a   1122 c   1197 g    536 t
ORIGIN      
        1 ggggccgcgg gggaggaggt ggagcccact gccgaggggc cggaccgggc caccgcgata
       61 taaaagagcc ggagtcccag agctgccgca gtgctgcctg ccccgtccca gccccgcact
      121 cccgctccgc tggcggccgc acctgctccg gccatgatga gcttcggcag cgccgatgcg
      181 ctgctgggcg ccccgttcgc gccgctgcac ggaggcggca gcctgcacta ctcgctgagc
      241 cgcaaggcag gcccgggcgg cacgcgctcc gcggccggct cctccagcgg cttccactcg
      301 tgggcgcgga cgtccgtgag ctccgtgtcc gcctcaccca gccgcttccg cggcgccgcc
      361 tcgagcaccg actcgctaga caccctaagc aacggcccag agggctgcgt ggtggcggcg
      421 gtggcggcgc gcagcgagaa ggagcagctg caggctctga acgaccgctt cgcgggctac
      481 atcgacaagg tgaggcagct cgaggcgcac aaccgcagcc tggagggcga ggcggcggcg
      541 ctgcggcagc aacaagccgg ccgcgccgcc atgggcgagc tgtacgagcg cgaggtgcgc
      601 gagatgcgcg gcgccgtgct gcgcctcggg gcggcgcgcg ggcagctgcg cctggagcag
      661 gagcacctgc tggaggacat cgctcacgtc cgccagcggc tggacgagga ggcccggcag
      721 cgtgaggagg cggaggcggc ggcgcgcgcc ctggcgcgct tcgcgcagga ggcggaagcg
      781 gcgcgcgtgg agctgcagaa gaaggcgcag gcgctgcagg aggagtgcgg ctacctgcgg
      841 cgccaccacc aggaggaggt gggcgagctg ctcggtcaga tccagggctg cggggccgcg
      901 caggcgcagg ctcaggccga ggctcgcgac gccctcaagt gcgacgtgac gtcggcgctg
      961 cgggagatcc gcgcgcagct cgaaggccac gcggtgcaga gcacgctgca gtccgaggag
     1021 tggttccgag tgaggttgga ccgactctca gaggcagcca aagtgaacac agatgctatg
     1081 cgctcggccc aagaggagat aactgagtac cggcggcagc tgcaagccag gaccacagag
     1141 ttggaggccc tgaaaagcac caaggagtca ctggagaggc agcgctctga gctagaggac
     1201 cgtcatcagg cagacattgc ctcctaccag gacgctattc agcagctgga cagtgagctg
     1261 agaaacacca agtgggagat ggctgcacag ctccgagagt accaggacct gctcaacgtc
     1321 aagatggccc tggacattga gattgccgct tacagaaagc tcctggaagg cgaagagtgt
     1381 cggattggct ttggtccgag tcccttctct cttactgaag gactcccaaa aattccctcc
     1441 atatccacgc acataaaagt caaaagcgaa gagatgataa aggtagtaga gaaatccgag
     1501 aaggaaactg tgattgtaga aggacagaca gaagagatcc gggtgacgga aggagtgaca
     1561 gaagaggagg acaaagaggc ccaaggtcag gaaggagaag aagcagaaga gggagaagaa
     1621 aaagaagaag aggaaggagc agcagctaca tctccccctg cagaagaggc tgcatctcca
     1681 gaaaaagaaa ccaagtctcg tgtgaaagaa gaggccaagt ccccaggtga ggccaagtcc
     1741 ccaggtgagg ccaagtcccc aggtgaggcc aagtccccag ctgaggccaa gtccccaggt
     1801 gaggccaagt ccccacgtga ggccaagtcc ccaggtgagg ccaagtctcc agctgagccc
     1861 aagtctccag ctgagcccaa gtctccagct gaggccaagt caccagctga gcccaagtct
     1921 ccagctacag tgaagtctcc aggtgaggcc aagtcaccat ctgaggccaa atctccagct
     1981 gaagccaaat ctccagctga ggccaaatct ccagctgagg ccaaatctcc agctgaggcc
     2041 aagtcaccag ctgaagccaa gtcaccagct gaagccaaat ctccagctac agtgaagtct
     2101 ccaggtgagg ccaagtcacc atctgaggcc aaatctccag ctgaagccaa atctccagct
     2161 gaggccaaat ctccagctga ggccaaatct ccagctgagg tcaagtcacc aggtgaggcc
     2221 aagtctccag ctgagcccaa gtcaccagct gaggccaaat ctccagctgc agtgaagtca
     2281 ccagctgagg ccaagtctcc agctgcagtc aagtccccag gtgaggccaa gtccccaggt
     2341 gaggccaagt caccagctga ggccaaatct ccagctgagg ccaagtcacc aattgaggta
     2401 aaatctccag agaaggccaa gacccccgtc aaggaaggag caaaatctcc agctgaggcc
     2461 aagtctcctg agaaggccaa gtcccccgtg aaggaagata tcaagccccc agctgaggcg
     2521 aaatcccctg agaaggccaa gagccccatg aaggaaggag caaagcctcc tgagaaggcc
     2581 aagcctctag atgtgaagtc tccggaagcc cagactccag tacaggagga agcgaacgac
     2641 cccacagaca tcagaccccc tgagcaggtg aaaagtcctg ccaaggagaa ggccaagtcc
     2701 cctgagaagg aagaagccaa gacttctgaa aaggtggctc ccaagaagga agaggtgaag
     2761 tcccctgtga aggaggaggt aaaagccaaa gaacccccaa agaaggtaga agaagagaag
     2821 acactgccta caccaaagac agaggcgaag gagagtaaga aagacgaagc tcccaaggag
     2881 gccccgaagc ccaaggtgga ggagaagaag gaaactccca cggaaaagcc caaggactct
     2941 acagcagaag ccaagaagga agaggctgga gagaagaaga aagccgtggc ctcagaggag
     3001 gagactcctg ccaagttggg tgtgaaggaa gaagctaaac ccaaagagaa gacagagaca
     3061 accaagacag aagcagaaga caccaaggcc aaagaaccta gcaaacccac agagacggaa
     3121 aagccaaaga aagaggagat gccagcggca ccagagaaga aagacaccaa ggaggagaag
     3181 accacagagt ccaggaagcc tgaggagaag cccaaaatgg aggccaaggt caaggaggat
     3241 gacaagagcc tttccaaaga gcctagcaaa cccaagacag aaaaggctga aaaatcctct
     3301 agcacagacc agaaagaaag ccagccccca gagaagacca cagaggacaa ggccaccaag
     3361 ggagagaagt aagagaacaa gagaaacacc cagaatagcc aaagaaactc aggacggtcc
     3421 cagtactcag gggtcggcgt aataaatttt atttcttcct ttccctccgt aagaagaaac
     3481 actgcttaga tggtgggcct gccctcacca aacaggaatt tctattaaga ttaagttagc
     3541 aagagaagat aaccctgagc cttgtccccc acgccgaaaa ccctccccag gtgatggaca
     3601 attatgatag cttcttgtag ccgaacgtga tgtatgctga acgctacgcg taaaacacgc
     3661 gtctaaaaac tgccccctcc tttccaagta agtgcattta tttcctgtat gtccaactga
     3721 cagatgaccg caataatgaa tgagcagtta gaaacgcatt atgcttgaaa tgttgtaacc
     3781 tattcctgaa tgccttcttg ttttccaaag gagtggtcag gcccttgccc agtacacgct
     3841 cctggaagag ctgcagcagg tgaggcaggg cgctggccac tgaaccacgc cagggtgtac
     3901 tctccactga agtccacttt caattgcttc catgcaataa aaccaagtgc ttctgaaat
//
LOCUS       MUSRGCA       350 bp ds-DNA             ROD       25-JUL-1990
DEFINITION  Mouse 18S rRNA gene.
ACCESSION   M35283
KEYWORDS    18S ribosomal RNA; processing factor; ribosomal RNA.
SOURCE      Mouse (strain S100) ribosomal DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 350)
  AUTHORS   Mishima,Y., Katayama,M. and Ogata,K.
  TITLE     Identification of a protein factor and the nucleotide sequence
            required for processing of mouse precursor rRNA
  JOURNAL   J. Biochem. 104, 515-520 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA        325  >   350     18S rRNA
    site        220      220     processing site
BASE COUNT       29 a    128 c    117 g     76 t
ORIGIN      
        1 tcgacgttcc ggctctcccg atgccgaggg gttcgggatt tgtgccgggg acggagggga
       61 gagcgggtaa gagaggtgtc ggagagctgt cccggggcga cgctcgggtt ggctttgccg
      121 cgtgcgtgtg ctcgcggcgg gttttgtcgg accccgacgg ggtcggtccg gccgcatgca
      181 ctctcccgtt ccgcgcgagc gccgcccggc tcacccccgg tttgtcctcc cgcgaggctc
      241 tccgccgccg cctcctcctc ctctctcgcg ctctctgttc cgcctggtcc tgtcccaccc
      301 ccgacggctt cgctcgcgct tccttacctg gttgatcctg ccagtagcat
//
LOCUS       MYCRDNAA      190 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  M.hyorhinis A-repeat sequence DNA.
ACCESSION   M35303
KEYWORDS    A-repeat.
SOURCE      M.hyorhinis DNA, clone pG102.1.
  ORGANISM  Mycoplasma hyorhinis
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 190)
  AUTHORS   Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
  TITLE     Reiterated DNA sequences defining genomic diversity within the
            species Mycoplasma hyorhinis
  JOURNAL   Mol. Microbiol. 2, 665-672 (1988)
  STANDARD  simple staff_review
BASE COUNT       94 a     15 c     31 g     50 t
ORIGIN      
        1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
       61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat
      121 agaggctata aaattggact taaaaaagtt cacagattat tggaaaaatt caatcttaaa
      181 gcaatttgtt
//
LOCUS       MYCRDNAB      190 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  M.hyorhinis A-repeat sequence DNA.
ACCESSION   M35304
KEYWORDS    A-repeat.
SOURCE      M.hyorhinis DNA, clone pG102.3.
  ORGANISM  Mycoplasma hyorhinis
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 190)
  AUTHORS   Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
  TITLE     Reiterated DNA sequences defining genomic diversity within the
            species Mycoplasma hyorhinis
  JOURNAL   Mol. Microbiol. 2, 665-672 (1988)
  STANDARD  simple staff_review
BASE COUNT       87 a     16 c     38 g     49 t
ORIGIN      
        1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
       61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat
      121 agaggctata aaattggact tagatagggt tgagtgttgt tccagtttgg acaagaagtc
      181 cactattaaa
//
LOCUS       MYCRDNAC      191 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  M.hyorhinis A-repeat sequence DNA.
ACCESSION   M35305
KEYWORDS    A-repeat.
SOURCE      M.hyorhinis DNA, clone pG101.
  ORGANISM  Mycoplasma hyorhinis
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 191)
  AUTHORS   Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
  TITLE     Reiterated DNA sequences defining genomic diversity within the
            species Mycoplasma hyorhinis
  JOURNAL   Mol. Microbiol. 2, 665-672 (1988)
  STANDARD  simple staff_review
BASE COUNT       97 a     18 c     29 g     47 t
ORIGIN      
        1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
       61 atttttgaaa gaaaataaag caagatatga aaaaaagaga atcaaagctg aactcaataa
      121 tagaggctat aaaattggac ttaaaaaagt tcacagatta ttgaaaaaat tcaatctcaa
      181 agcaatttgt t
//
LOCUS       MYCRDNAD      191 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  M.hyopneumoniae A-repeat sequence DNA.
ACCESSION   M35306
KEYWORDS    A-repeat.
SOURCE      M.hyopneumoniae DNA, clone pJ125.
  ORGANISM  Mycoplasma hyopneumoniae
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 191)
  AUTHORS   Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
  TITLE     Reiterated DNA sequences defining genomic diversity within the
            species Mycoplasma hyorhinis
  JOURNAL   Mol. Microbiol. 2, 665-672 (1988)
  STANDARD  simple staff_review
BASE COUNT       94 a     17 c     31 g     49 t
ORIGIN      
        1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt
       61 atttttgaaa gaaaataaag caagatatgc aaaaaagaga ataaaagctg atcttaataa
      121 tagaggctat aaaattggac ttaaaaaagt tcgcagatta ttggaaaaat tcaatctcaa
      181 agcaatttgt t
//
LOCUS       MYCRDNAE      210 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  M.hyorhinis B-repeat sequence DNA.
ACCESSION   M35307
KEYWORDS    B-repeat.
SOURCE      M.hyorhinis DNA, clones pG102.[1,3].
  ORGANISM  Mycoplasma hyorhinis
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 210)
  AUTHORS   Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A.
  TITLE     Reiterated DNA sequences defining genomic diversity within the
            species Mycoplasma hyorhinis
  JOURNAL   Mol. Microbiol. 2, 665-672 (1988)
  STANDARD  simple staff_review
BASE COUNT       84 a     29 c     28 g     69 t
ORIGIN      
        1 gaattcttta aatttagtag aaatcaaaaa aactcaacaa ggcaactgag ttcgttataa
       61 aaaagtttat caatatgcta aattcgatgc aagaactaaa caatttatct tagttgaaaa
      121 aggcgttcct tttactaata tgattattgc taatcaaaac aatctacatt tgaatatttt
      181 gactgacagg ttctaaagaa tgcagcattt
//
LOCUS       STRLACZ       209 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  S.bovis lactose catabolic protein (lacZ) gene, 5' end.
ACCESSION   M35285
KEYWORDS    catabolic protein.
SOURCE      S.bovis (strain H/3) DNA.
  ORGANISM  Streptococcus bovis
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Streptococcaceae.
REFERENCE   1  (bases 1 to 209)
  AUTHORS   Gilbert,H.J. and Hall,J.
  TITLE     Molecular cloning of Streptococcus bovis lactose catabolic genes
  JOURNAL   J. Gen. Microbiol. 133, 2285-2293 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        184  >   209     lactose catabolic protein (lacZ)
BASE COUNT       59 a     55 c     43 g     52 t
ORIGIN      
        1 tcgattagcc cttggaccct gctagtcttg acctgcctag gtttcccagg tcaagttccc
       61 agttaccgac tacccgtaaa tcgatactac gccattgtta gatcggatct gaacccgtaa
      121 ctttatagtt gggtatcgtg agcagatcac aatatcccac aataaaagga ggataacatc
      181 caaatgatca cggacacagt ggccatcga
//
LOCUS       STYSSCA      1551 bp ds-DNA             BCT       25-JUL-1990
DEFINITION  S.typhimurium Ssc protein (ssc) gene, complete cds.
ACCESSION   M35193
KEYWORDS    Ssc protein.
SOURCE      S.typhimurium (strain SH5014, isolate LT2) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1551)
  AUTHORS   Hirvas,L., Koski,P. and Vaara,M.
  TITLE     Characterization of a new protein encoding region between ompH and
            lipid A biosynthesis genes of Salmonella typhimurium
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.H.Hirvas, 15-JUN-1990.
            
              Author address: L.H.Hirvas
              University of Helsinki
              Dept of Bacteriology and Immunology
              Haartmaninkatu 3
              00290 Helsinki
              FINLAND
            
FEATURES       from  to/span     description
    pept         19     1044     Ssc protein
BASE COUNT      362 a    354 c    438 g    397 t
ORIGIN      
        1 aaacaggtta aataagtaat gccttcaatt cgactggctg acttagcaga acagttggat
       61 gcagaattac acggtgatgg cgatatcgtc atcaccggcg ttgcgtccat gcaatctgca
      121 acaacaggcc acattacgtt tatggtgaat cctaagtacc gtgaacactt aggtttatgc
      181 caggcttctg cggttgtcat gacgcaggac gatcttcctt ttgctaagag tgcggcgctg
      241 gtagttaaaa atccctacct gacctacgcg cgcatggcgc aaattttaga tactacgccg
      301 cagcccgcgc agaatatcgc gccaagcgcc gtgattgatg cgacggcaac gctgggtagc
      361 aatgtttcag tcggcgcgaa tgcggtgatt gaatctggcg tacaactggg cgataacgtg
      421 gttatcggcg caggctgttt cgtcggaaaa aatagcaaaa tcggggcggg ttcacgcttg
      481 tgggcgaacg taacgattta ccacgacatt cagatcggtg agaattgcct gatccagtcc
      541 agtacggtga tcggcgcgga cggttttggc tacgctaacg atcgtggcaa ctgggtgaag
      601 atcccacaac tgggccgggt cattattggc gatcgtgtcg agatcggcgc ttgtaccacc
      661 attgaccgtg gcgcgttgga tgatactgtt attggcaatg gcgtgattat tgataatcag
      721 tgccagattg cacataacgt cgtgattggc gacaatacgg cagttgccgg tggcgtcatt
      781 atggcgggta gcctgaagat tggccgttac tgcatgattg gcggcgccag cgtgatcaat
      841 gggcatatgg aaatatgcga caaagtcacg gtaactggca tgggtatggt gatgcgtccc
      901 atcacggaac cgggcgtcta ctcctcaggc attccgctgc aacccaacaa agtatggcgt
      961 aaaactgctg cactggtgat gaacattgat gatatgagca agcgtctcaa agcgattgag
     1021 cgcaaggtta atcaacaaga ctaacgttcc gccttgtagt tgccattctt ttccggcctg
     1081 tcacattcat acgattgcgg caggccgtgt tattattgcc tttttgtata tttggacagg
     1141 aagagtattt tgactactaa cactcatact ctgcagattg aagagatttt agagcttctg
     1201 ccgcaccgtt ttccgttttt actggtcgat cgcgtgctgg actttgaaga aggtcgtttt
     1261 ctgcgtgcgg tgaaaaatgt ctccgtcaac gagccgtttt tccaggggca tttcccgggc
     1321 aaaccgattt tgccaggcgt gctgattctg gaagcgatgg cgcaggcaac cggtattctg
     1381 gcgtttaaaa gcgttggtaa actggaacct ggcgaactgt attatttcgc gggtattgat
     1441 gaagcgcgct ttaagcgtcc ggtggtgcca ggcgatcaga tgatcatgga agtcactttc
     1501 gagaaaacgc gccgtggcct gacccgcttt aaaggggttg cgctggtcga c
//
LOCUS       TOBRUBPA      979 bp ds-DNA             PLN       25-JUL-1990
DEFINITION  Tobacco ribulose-1,5-bisphosphate carboxylase small subunit gene,
            exons 1 and 2.
ACCESSION   M32419
KEYWORDS    ribulose-1,5-bisphosphate carboxylase.
SOURCE      Tobacco DNA, clone TSSU3-8.
  ORGANISM  Nicotiana tabacum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 979)
  AUTHORS   O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K.
  TITLE     Isolation of tobacco SSU genes: Characterization of a
            transcription- ally active pseudogene
  JOURNAL   Nucleic Acids Res. 15, 8661-8676 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        584      760     ribulose-1,5-bisphosphate carboxylase small
                                 subunit precursor, exon 1
                854  >   979     ribulose-1,5-bisphosphate carboxylase small
                                 subunit precursor, exon 2
    sigp        584      757     ribulose-1,5-bisphosphate carboxylase small
                                 subunit signal peptide
    matp        758      760     ribulose-1,5-bisphosphate carboxylase small
                                 subunit
                854  >   979     ribulose-1,5-bisphosphate carboxylase small
                                 subunit
    IVS         761      853     ribulose-1,5-bisphosphate carboxylase small
                                 subunit intron A
BASE COUNT      320 a    175 c    185 g    299 t
ORIGIN      
        1 ttaattatgt ctttgtttgc ttctcatgtg ataaagaatc gaagccttga tgaacataat
       61 ttgcatttga gtagtgaata gctgctttca caaagagtac tctagctatt aagtttagtt
      121 tgaatatttt gaaacacaaa aatatatgta tacatacaaa aacaaatacc gcaatagtcc
      181 aagcaaaagg gactttaaaa aaaaaaacca acctcaatta cacattcata tcctcttcct
      241 accccatcta ggatgagata agattactga ggttgtttac acgtggcacc tccattgtgg
      301 tgaattaaat gatcaatggc ttagctcaaa atataatttt ccaacctttc atgtgtggat
      361 attaagtttt gtgtagtgaa tcaagaacca cataatccaa tggttagctt tactccaaga
      421 tgagggggtt gttgattttt gtccgttaga tatgggaaat atgtaaaacc ttatcattat
      481 atatagagtg gtgggcaact atgcaatgac catcttggaa gtttaaagga aaaaaaagga
      541 aagggagaaa gagaaatctt tctgtcttaa agtgtaatta acaatggctt cctcagttct
      601 ttcctctgca gcagttgcca cccgcagcaa tgttgctcaa gctaacatgg ttgcaccttt
      661 cactggcctt aagtcagctg cctcattccc tgtttcaagg aagcaaaacc ttgacatcac
      721 ttccattgcc agcaacggcg gaagagtgca atgcatgcag gtaatttata tacaatgaca
      781 gtgcaaaaaa ttttgataca attaatgcat cttaacatgt catagctaaa aattctattt
      841 tggtggaata taggtgtggc caccaattaa caagaagaag tacgagactc tctcatacct
      901 tcctgatttg agccaggagc aattgcttag tgaagttgag taccttttga aaaatggatg
      961 ggttccttgc ttggaattc
//
LOCUS       TOBRUBPB     1337 bp ds-DNA             PLN       25-JUL-1990
DEFINITION  Tobacco ribulose-1,5-bisphosphate carboxylase small subunit
            pseudogene, complete cds.
ACCESSION   M32420
KEYWORDS    pseudogene; ribulose-1,5-bisphosphate carboxylase.
SOURCE      Tobacco DNA, clone TSSU3-2.
  ORGANISM  Nicotiana tabacum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 1337)
  AUTHORS   O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K.
  TITLE     Isolation of tobacco SSU genes: Characterization of a
            transcription- ally active pseudogene
  JOURNAL   Nucleic Acids Res. 15, 8661-8676 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps     272      451     ribulose-1,5-bisphosphate carboxylase small
                                 subunit, exon 1
                620      754     ribulose-1,5-bisphosphate carboxylase small
                                 subunit, exon 2
                912     1100     ribulose-1,5-bisphosphate carboxylase small
                                 subunit, exon 3
    IVS         452      619     ribulose-1,5-bisphosphate carboxylase small
                                 subunit intron A
    IVS         755      911     ribulose-1,5-bisphosphate carboxylase small
                                 subunit intron B
BASE COUNT      379 a    249 c    256 g    453 t
ORIGIN      
        1 gttttagaaa atatttccca ttcacaaatt aagtttggga actttgagat aaggacgact
       61 gagtgtaatc aatgtcaggg gttcaaattt atgtgcccgt caatttttca atccacggct
      121 acgattcctc taagatgagg tcattgcttg cttgtgtccg ttagatgaga aaaagacgtg
      181 aaaccttatc actatatata gcactcatca cacccttgaa agcaaaggtc aagggaagca
      241 atagctttaa gctaaacaat tactttcaac aatggcttcg tctgtgattt cctcagccgc
      301 tgccgttgcc accggcgcta atgcggctca agccagtatg gttgcacctt tcactggcct
      361 caaatccgcc tactccttcc ctgtttccag aaaacaaaac cttgacatta cttccattgc
      421 tagcaatggt ggaagagttt aatgcatgca ggtttgtagc atatattatt gtagttagct
      481 tatataaact gatagagtaa agaaatttta cgttatatat tgatatattt taacctggta
      541 atttgattta tttttcatat tattaatccc acttttttat tgtacttatg aagtttattt
      601 taattcttta tatatatagg tgtggccacc aattaacaag aagaagtacg agacactctc
      661 ataccttcct gatttgagcg aggagcaatt gcttagggaa gttgaatacc ttttgaaaaa
      721 tggatgggtt ccttgcttgg aattcgagac tgaggtcaaa catctattct aaatcatgct
      781 actattatca agcataacta acatgaataa ctcaatccta actagtttgg gattagacat
      841 atatagttga ttaagtgaaa gaggagtatt atctcatgtt aatgttttgt ttatcttgtg
      901 gatatgcgca gcacggattc gtctaccgtg agaataacaa gtcaccaggt tactacgatg
      961 gaagggccac tcaggtcttg gctgaggtcg aggaggcaaa gaaggcttac ccacaagcct
     1021 ggatcagaat cattggattc gacaacgtcc gtcaagtgca atgcatcagt ttcatcgcct
     1081 acaagcccgc aggctactaa aatctccatt tttaagacaa cttaccgtat gtattcaggg
     1141 gaagtttgtt tgaattctcc ttgtgttttt ccccggagaa actgttttgg ttttcctttg
     1201 ttttaattcc ttctttctat tcggtgtata tttttgaatt ccaatcaagt ttatgagaac
     1261 taataatgtc atttgtttct ttcgtaattt gctttgtggt gtacatcggt tttaattatc
     1321 cgagtaatat ctgcttt
//
LOCUS       ZYMCPA       1374 bp ss-RNA             VRL       25-JUL-1990
DEFINITION  Zucchini yellow mosaic virus coat protein (cp) mRNA, 3' end.
ACCESSION   M35095
KEYWORDS    capsid protein; coat protein.
SOURCE      Zucchini yellow mosaic virus, cDNA to viral RNA, clone ZYKS-22cp.
  ORGANISM  Zucchini yellow mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; 
            Potyvirus.
REFERENCE   1  (bases 1 to 1374)
  AUTHORS   Gal-On,A., Antignus,Y., Rosner,A. and Raccah,B.
  TITLE     Nucleotide sequence of the zucchini yellow mosaic virus capsid-
            encoding gene and its expression in Escherichia coli
  JOURNAL   Gene 87, 273-277 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1     1164     coat protein (cp) (AA at 1)
    mRNA    <     1     1374     cp mRNA
BASE COUNT      423 a    279 c    342 g    330 t
ORIGIN      
        1 tcgacgaagg agagattgtt tccaatttta gagtgggata gaagcaaaga aattatgcac
       61 cgaacagagg ctatttgcgc tgcgatgatt gaggcatggg gacacaccga gcttttacaa
      121 gagatcagaa agttttatct atggttcgtt gaaaaggaag aagtgcgaga attagccgcc
      181 ctcggaaaag ctccatacat agctgagaca gcacttcgta agctatacac tgacaaggga
      241 gcggatacaa gtgaactggc acgttatcta caagccctcc accaagacat cttctttgaa
      301 caaggagaca ctgtaatgct ccaatcaggc actcagccaa ctgtggcaga cactggagcc
      361 acaaagaaag acaaagaaga tgacaaaggg aaaaacaagg atgttacagg ctccggctca
      421 agtgagaaaa cagtggcagc tgtcacgaag gacaaggatg taaatgctgg ttctcatggg
      481 aaaattgtgc cgcgtctttc gaagataaca aagaagatgt cactgccacg cgtgaaagga
      541 aatgtgatac tcgacattga tcacttgctg gagtataagc cggatcaaat tgagttatac
      601 aacacacgag cgtctcatca gcaattcgcc tcttggttca accaagttaa aacagaatat
      661 gatctgaatg agcaacagat gggagttgta atgaatggtt tcatggtttg gtgcatcgaa
      721 aatggcacgt cacccgacat taacggagta tgggttatga tggacggtaa tgagcaggtt
      781 gaatatcctt tgaaaccaat agttgaaaat gcaaagccaa cgctgcgaca aataatgcat
      841 cacttttcag atgcagcgga ggcatatata gagatgagaa atgcagaggc accatacatg
      901 ccgaggtatg gtttgcttcg aaacttacgg gataggagtt tggcacgata tgctttcgac
      961 ttctacgaag tcaattccaa aactccggaa agagcccgcg aagctgttgc gcagatgaaa
     1021 gcagcagccc ttagcaatgt ttcttcaagg ttgtttggcc ttgatggaaa tgttgccacc
     1081 actagcgaag acactgaacg gcacactgca cgtgatgtta ataggaacat gcacaccttg
     1141 ctaggtgtga atacaatgca gtaaagggta ggtcgcctac ctaggttatc gtttcgctcc
     1201 gacgtaattc taatatttac cgctttatgt gatgtcttta catttctaga gtgggcctcc
     1261 cacctttaaa gcgtaaagtt tatgttagtt gtccaggagt gccgtagtcc tgtcggaagc
     1321 tttagtgtga gcctctcacg aataagctcg agattagact ccgtttgcaa gcct
//

GenBank-Updates@genbank.bio.net (07/26/90)

LOCUS       ATUNPSS      6425 bp ds-DNA             SYN       26-JUL-1990
DEFINITION  A.tumefaciens neomycin phosphotransferase and
            streptomycin/spectinomycin adenyltransferase, complete cds.
ACCESSION   M35007
KEYWORDS    neomycin phosphotransferase;
            streptomycin/spectinomycin adenyltransferase.
SOURCE      N.tabacum T-DNA inserts in A.tumefaciens DNA.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6425)
  AUTHORS   Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
            Depicker,A.
  TITLE     Cloning and sequence analysis of truncated T-DNA inserts from
            Nicotiana tabacum
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.R.Gheysen, 01-JUN-1990.
FEATURES       from  to/span     description
    pept       2782     1985 (c) neomycin phosphotransferase
    pept       4983     4042 (c) streptomycin/spectinomycin adenyltransferase
    recomb       24       25     T-DNA end/plant DNA start
    recomb       40       41     plant DNA end/T-DNA start
    recomb     1094     1095     plant DNA end/T-DNA start
    recomb     2786     2787     T-DNA end/plant DNA start
    recomb     3044     3045     T-DNA end/plant DNA start
    recomb     3354     3355     plant DNA end/T-DNA start
    recomb     5585     5586     T-DNA end/plant DNA start
    recomb     6389     6390     T-DNA end/plant DNA start
    site          1       24     left T-DNA border
    site        372      395     24 bp border-like sequence
    site       1569     1592     24 bp border-like sequence
    site       1669     1692     24 bp border-like sequence
    site       1779     1756 (c) 24 bp border-like sequence
    site       2128     2105 (c) 24 bp border-like sequence
    site       2449     2472     24 bp border-like sequence
    site       2485     2462 (c) 24 bp border-like sequence
    site       3660     3683     24 bp border-like sequence
    site       3875     3898     24 bp border-like sequence
    site       4359     4336 (c) 24 bp border-like sequence
    site       5868     5891     24 bp border-like sequence
BASE COUNT     1509 a   1754 c   1710 g   1452 t
ORIGIN      
        1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa
       61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca
      121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa
      181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct
      241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc
      301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat
      361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc
      421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt
      481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt
      541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa
      601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg
      661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa
      721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc
      781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac
      841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc
      901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc
      961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc
     1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa
     1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc
     1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc
     1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc
     1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa
     1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt
     1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat
     1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga
     1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata
     1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta
     1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata
     1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt
     1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct
     1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc
     1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa
     1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc
     1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg
     2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat
     2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga
     2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg
     2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg
     2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc
     2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat
     2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa
     2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg
     2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata
     2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa
     2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct
     2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca
     2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag
     2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg
     2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc
     2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg
     3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg
     3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg
     3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc
     3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg
     3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat
     3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc
     3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac
     3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag
     3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg
     3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat
     3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat
     3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa
     3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg
     3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt
     3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg
     3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg
     3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg
     4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta
     4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag
     4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc
     4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg
     4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag
     4321 cgttaaggtt tcatttagcg cctcaaatag atcctgttca ggaaccggat caaagagttc
     4381 ctccgccgct ggacctacca aggcaacgct atgttctctt gcttttgtca gcaagatagc
     4441 cagatcaatg tcgatcgtgg ctggctcgaa gatacctgca agaatgtcat tgcgctgcca
     4501 ttctccaaat tgcagttcgc gcttagctgg ataacgccac ggaatgatgt cgtcgtgcac
     4561 aacaatggtg acttctacag cgcggagaat ctcgctctct ccaggggaag ccgaagtttc
     4621 caaaaggtcg ttgatcaaag ctcgccgcgt tgtttcatca agccttacgg tcaccgtaac
     4681 cagcaaatca atatcactgt gtggcttcag gccgccatcc actgcggagc cgtacaaatg
     4741 tacggccagc aacgtcggtt cgagatggcg ctcgatgacg ccaactacct ctgatagttg
     4801 agtcgatact tcggcgatca ccgcttccct catgatgttt aactttgttt tagggcgact
     4861 gccctgctgc gtaacatcgt tgctgctcca taacatcaaa catcgaccca cggcgtaacg
     4921 cgcttgctgc ttggatgccc gaggcataga ctgtacccca aaaaaacagt cataacaagc
     4981 catgaaaacc gccactgcgc cgttaccacc gctgcgttcg gtcaaggttc tggaccagtt
     5041 gcgtgaggcc atacgctact tgcattacag cttacgaacc gaacaggctt atgtccactg
     5101 ggttcgtgcc ttcatccgtt tccacggtgt gcgtcacccg gcaaccttgg gcagcagcga
     5161 agtcgaggca tttctgtcct ggctggcgaa cgagcgcaag gtttcggtct ccacgcatcg
     5221 tcaggcattg gcggccttgc tgttcttcta cggcaagtgc tgtgcacgga tctgccctgg
     5281 cttcaggaga tcggaagacc tcggccgtcc gggcgcttgc cggtggtgct gaccccggat
     5341 gaagtggttc gcatcctcgg ttttctggaa ggcgagcatc gtttgttcgc ccagcttctg
     5401 tatggaacgg gcatgcggat cagtgagggt ttgcaactgc gggtcaagga ctggatttcg
     5461 atcacggcac gatcatcgtg cgggagggca agggctccaa ggatcgggcc ttgatgttac
     5521 ccgagagctt ggcacccagc ctgcgcgagc agctgtctcg tgcacgggca tggtggctga
     5581 aggactaggc cgagggccgc agcggcgttg cgcttcccga cgcccttgag cggaagtatc
     5641 cgcgcgccgg gcattcctgg ccgtggttct gggtttttgc gcagcacacg cattcgaccg
     5701 atccacggag cggtgtcgtg cgtcgccatc acatgtatga ccagaccttt cagcgcgcct
     5761 tcaaacgtgc cgtagaacaa gcaggcatca cgaagcccgc cacaccgcac accctccgcc
     5821 actcgttcgc gacggccttg ctccgcagcg gttacgacat tcgaaccgtg caggatctgc
     5881 tcggccattc cgacgtctct acgacgatga tttacacgca tgtgctgaaa gttggcggtg
     5941 ccggagtgcg ctcaccgctt gatgcctgcc gcccctcact gtgagaggta gggcagcgca
     6001 agtcaatcct agcggattca ctacccctgc gcgaaggcca tcggtgccgc atcgaacggc
     6061 cggttgcgga aagtcctccc tgcgtccgct gatggccggc agcagcccgt cgttgaagga
     6121 tccctgaaag cgacgttgga tgttaacatc tacaaattgc cttttcttac gaccatgtac
     6181 gtaagcgctt acgtttttgg tggacccttg aggaaactgg tagctgttgt gggcctgtgg
     6241 tctcaagatg gatcattaat ttccaccttc acctacgatg gggggcatcg caccggtgag
     6301 taatattgta cggctaagag cgaatttggc ctgtagacct caattgcgag ctttctaatt
     6361 tcaaactatt cgggcctaac ttttggtgtg atgatgctga ctggcaggat atataccgtt
     6421 gtaat
//
LOCUS       TOBNPTII      200 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  N.tabacum nptII gene, complete cds.
ACCESSION   M34757
KEYWORDS    nptII protein.
SOURCE      N.tabacum (strain SR1) DNA.
  ORGANISM  Nicotiana tabacum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
            Depicker,A.
  TITLE     Cloning and sequence analysis of truncated T-DNA inserts from
            Nicotiana tabacum
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.R.Gheysen, 01-JUN-1990.
FEATURES       from  to/span     description
    pept        198  >   200     nptII protein
    mRNA        133  >   200     nptII protein mRNA (5' end +/- 2 bp)
    recomb      193      194     T-DNA end/plant DNA start
    signal       25       30     CAAT box
    signal       47       55     CAAT box
    signal      101      107     TATA box
BASE COUNT       64 a     53 c     32 g     51 t
ORIGIN      
        1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg
       61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta
      121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg
      181 atacaccaaa tcggatcatg
//
LOCUS       BOVANDRE     2461 bp ss-mRNA            MAM       26-JUL-1990
DEFINITION  Cow alpha-1C-adrenergic receptor mRNA, complete cds.
ACCESSION   J05426
KEYWORDS    alpha-1C-adrenergic receptor.
SOURCE      Cow adult brain cortex, cDNA to mRNA, clone B12.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 2461)
  AUTHORS   Schwinn,D.A., Lomasney,J.W., Lorenz,W., Szklut,P.J.,
            Fremeau,R.T.Jr., Yang-Feng,T.L., Caron,M.G., Lefkowitz,R.J. and
            Cotecchia,S.
  TITLE     Molecular cloning and expression of the cDNA for a novel
            alpha-1-adrenergic receptor subtype
  JOURNAL   J. Biol. Chem. 265, 8183-8189 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by D.Schwinn, 29-MAY-1990.
FEATURES       from  to/span     description
    pept         97     1497     alpha-1C-adrenergic receptor
BASE COUNT      551 a    667 c    647 g    596 t
ORIGIN      
        1 tgactccccg ctccctcgct cccctcctcc tcacccgccg aggggtggcc ctcaagagcc
       61 ggactttgcc ggccccggcc ccggggggct gggaccatgg tgtttctctc cggaaatgcc
      121 tccgacagct ccaactgcac ccacccgccg ccaccggtga acatttccaa ggccattctg
      181 ctcggggtga tcttgggggg cctcatcctt ttcggggtac tggggaacat cctcgtgatc
      241 ctttccgtgg cctgccaccg gcacctgcac tcggtcacac actactacat cgtcaacctg
      301 gcggtggccg accttctcct cacttccacg gtgctgccct tctccgctat cttcgagatc
      361 ttgggctact gggccttcgg cagggtcttc tgcaatgtct gggcggcggt ggacgtcctg
      421 tgctgcacgg cttccatcat gggactctgc atcatctcca tcgaccgcta catcggcgtg
      481 agctatcctc tgcgctaccc caccatcgtc acccagaaga ggggcctcat ggccctgctc
      541 tgcgtctggg cgctctcttt ggtcatctcc atcgggcccc tcttcggctg gaggcagccg
      601 gccccggagg acgagaccat ctgccagatc aacgaggagc cgggctacgt gctcttctcg
      661 gctctgggct ccttctacgt gccgctgacc atcatcctgg tcatgtactg ccgggtctac
      721 gtcgtggcca agagggagag ccggggcctc aagtcgggcc ttaagaccga caagtcagac
      781 tcggagcagg tgacgctccg catccatcgc aaaaacgccc aggtaggagg cagcggggtg
      841 accagcgcca agaacaagac gcacttctcc gtgagactgc tcaaattttc ccgcgagaag
      901 aaagcggcca aaacgctggg catcgtggtc ggctgcttcg tcctctgctg gctgcctttt
      961 ttcttagtga tgcccattgg gtctttcttt cctgatttca ggccctcaga aaccgttttt
     1021 aaaatagcat tttggctcgg ttacctaaac agctgcatca accccattat atacccatgc
     1081 tccagtcaag agtttaaaaa ggcctttcag aatgtcttga gaatccagtg tctgcgacga
     1141 aagcagtcct ccaaacacac cctgggctac acgctgcacg cacccagcca cgtcctggag
     1201 ggacagcaca aggacctggt tcgcattccg gtgggatctg cagagacctt ctataagatc
     1261 tccaagacgg atggggtctg tgaatggaaa attttctctt ccctaccccg cggatctgcc
     1321 aggatggcgg tggccagaga cccatcagcc tgcaccactg cccgggtgag aagtaaaagc
     1381 tttttgcaag tgtgctgttg cctggggccc tcgaccccca gtcatggaga gaatcatcag
     1441 attccgacca ttaagatcca caccatctcc ctcagtgaaa atggggagga agtctaaagg
     1501 acaggaaagg tcagaaggat gggagggtga tcttaggtac ccactctcca cttccttctg
     1561 ggaaggccag ttcacgttcc gtggatgctg agacacagcc agtaaaccag ggaccatctg
     1621 ggaatgggct ggggaggaga gctgactctg gggcagaggt agggcttaga gacgagagag
     1681 gatgtcctac caccatccag ttcactatga tgagaaacag catttccttg aggctaatgc
     1741 tctctgggtc attctctgag cctgctttct acgcctgtcc ctttcaacga caaacaccat
     1801 gggaaacaga atttcataca caatccaaaa gacgataaat ataggattat gatttcatca
     1861 tgaatatttt gagcatgcac tctaagtttg gagctatttc ttgatggagt gaggggattt
     1921 tatttccagg ctaaacttgc tgaaagccac gttggatttt tatggagaga aggcctggag
     1981 aggaagagcc ttaagatggt ggccaatatc cagacgcatt atttttagag caagttttac
     2041 agtccaccct ttctcagttt gggtgaaact tgacagtgag attttattta ccttttgctg
     2101 ctgcttgaca ggatactgct cccaattccc taaggatgag ggtgaggggt actcattatg
     2161 ccaatggtca tctgcacttg ggtatagaga gtgttgaaag aaccagttgg gaaaaggatg
     2221 gcttttcctg gtggaagaca gtaaggatga gagtcagttc ttcaaattct atggacagaa
     2281 ttccattaag tggttccaag atcaggtgga ggaaggcttc ttgtgtaaca tatttaaaga
     2341 tcaagagttt ggggtggggt gggtgctact ttcaagctaa gatagaggct gcaaaattac
     2401 tccacagcct tttcaacatg gcatagaaag gcttttcttg gcaaatcact taccttttcc
     2461 a
//
LOCUS       CHKANCC2A    1229 bp ss-mRNA            VRT       26-JUL-1990
DEFINITION  Chicken anchorin CII mRNA, 3' end.
ACCESSION   M30971 J03194
KEYWORDS    anchorin; collagen-binding protein.
SOURCE      Chicken cartilage (sternum) and bone, cDNA to mRNA, clones
            A[1,4,6,7,14,15,22,23].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1229)
  AUTHORS   Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M.,
            Deutzmann,R., Mollenhauer,J. and von der Mark,K.
  TITLE     The structure of anchorin CII, a collagen binding protein isolated
            from chondrocyte membrane
  JOURNAL   J. Biol. Chem. 263, 5921-5925 (1988)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 373 to 504)
  AUTHORS   Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M.,
            Deutzmann,R., Mollenhauer,J. and von der Mark,K.
  TITLE     The structure of anchorin CII, a collagen binding protein isolated
            from chondrocyte membrane
  JOURNAL   J. Biol. Chem. 265, 8344-8344 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         19      984     anchorin CII
BASE COUNT      353 a    230 c    310 g    336 t
ORIGIN      28 bp upstream of AccI site.
        1 cccggcgaac cggggaagat ggcgaagtat acaagaggca ccgtgacagc attctctcct
       61 tttgatgcca gagctgatgc agaagccctt cgcaaggcca tgaagggaat ggggactgat
      121 gaagagacaa ttctgaagat ccttaccagc agaaataatg ctcaacgtca agaaattgca
      181 tctgctttta aaacactgtt tggcagggat cttgtggatg acctgaaatc agaacttact
      241 ggcaagtttg aaacactgat ggtatctttg atgagaccag cacgtatttt tgatgcgcat
      301 gcactgaagc atgcaatcaa gggagcagga accaatgaga aagtgttgac tgaaattctt
      361 gcctccagaa cacctgctga agtgcagaat attaaacagg tttatatgca agagtatgag
      421 gccaacttgg aggataagat cacaggagag acatcaggcc attttcagag actgctggtg
      481 gtcctgctgc aggcaaatag agatcctgat ggcagagttg acgaggctct tgttgagaag
      541 gatgctcagg tcttgtttag agctggggag ctaaaatggg gaacagatga agaaacattc
      601 atcaccatct tgggaactcg aagtgtttct catttgagga gggtgtttga caaatacatg
      661 actatttctg gctttcaaat tgaagaaacc attgaccgtg aaacctctgg tgatttggag
      721 aagttgcttt tggcagttgt gaagtgcatc cgaagtgtgc ctgcttattt tgctgaaact
      781 ttgtattatt ctatgaaagg ggctggcact gatgatgata ccctgatcag agtcatggtt
      841 tcaagaagtg aaatcgacct gttggatatt agacatgaat tcagaaagaa ttttgcgaaa
      901 tcgttgtatc agatgattca gaaagataca tctggggact acaggaaggc actcctgctc
      961 ctctgtggtg gagatgatga gtaatggtgg cagcgacgtg aaggatttct tgtaatccag
     1021 ctttgcagcc cttcagttag catgcctagc taagattttg catcttaatg ctttatggct
     1081 gttcgaattt atattcatat cacacttatt aaacacaaac atgttactac tagctgataa
     1141 acagtccctc ctcctcagac gtcctgactc tgggaatttc agtgccttct gagtgtatgc
     1201 aaagtctctc atggagtaga gtagtatcg
//
LOCUS       ECOHLYCA      633 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  E.coli hly plasmid hemolysin (hlyC) gene, complete cds.
ACCESSION   M35668
KEYWORDS    hemolysin.
SOURCE      E.coli hly plasmid pHly152 DNA, clone pANN202-419.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 633)
  AUTHORS   Goebel,W., Hacker,J., Knapp,S., Then,I., Wagner,W., Hughes,C. and
            Juarez,A.
  TITLE     Structure, function, and regulation of the plasmid-encoded
            hemolysin determinant of Escherichia coli
  JOURNAL   Basic Life Sci. 30, 791-805 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        121      633     hemolysin (hlyC)
BASE COUNT      219 a     99 c    115 g    200 t
ORIGIN      
        1 tagtcacgca ataaaacgtt ctttaatatt aatgcagtta tgacattaaa ggcaagaaac
       61 ataaaggcat atttttgcca caatatttaa tcatataatt taagttgtag tgagtttatt
      121 atgaatataa acaaaccatt agagattctt gggcatgtat cctggctatg ggccagttct
      181 ccactacaca gaaactggcc agtatctttg tttgcaataa atgtattacc cgcaatacag
      241 gctaaccaat atgttttatt aacccgggat gattaccctg tcgcgtattg tagttgggct
      301 aatttaagtt tagaaaatga aattaaatat cttaatgatg ttacctcatt agttgcagaa
      361 gactggactt caggtgatcg taaatggttc attgactgga ttgctccttt cggggataac
      421 ggtgccctgt acaaatatat gcgaaaaaaa ttccctgatg aactattcag agccatcagg
      481 gtggatccca aaactcatgt tggtaaagta tcagaatttc atggaggtaa aattgataaa
      541 cagttagcga ataaaatttt taaacaatat caccacgagt taataactga agtaaaaaga
      601 aagtcagatt ttaatttttc attaactggt taa
//
LOCUS       ECOTRMX4       77 bp ss-tRNA            RNA       26-JUL-1990
DEFINITION  E.coli f-Met-tRNA.
ACCESSION   M35184
KEYWORDS    transfer RNA-f-Met.
SOURCE      E.coli tRNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 77)
  AUTHORS   Dube,S.K., Marcker,K.A., Clark,B.F.C. and Cory,S.
  TITLE     Nucleotide sequence of N-formyl-methionyl-transfer RNA
  JOURNAL   Nature 218, 232-233 (1968)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       77     f-Met-tRNA
    anticdn      35       37     f-Met-tRNA anticodon cat
    modified     21       21     d
    modified     33       33     2'Ome
    modified     47       47     m7g
    modified     56       56     p
BASE COUNT       14 a     26 c     25 g     12 t
ORIGIN      
        1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa
       61 tccggccccc gcaacca
//
LOCUS       HECDA8       1435 bp ss-rRNA            BCT       26-JUL-1990
DEFINITION  H.mustelae 16S ribosomal RNA.
ACCESSION   M35048
KEYWORDS    16S ribosomal RNA.
SOURCE      H.mustelae (strain ATCC 43772) ribosomal RNA.
  ORGANISM  Helicobacter mustelae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 9 to 1435)
  AUTHORS   Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
            Ferrero,R.
  TITLE     The phylogeny of Helicobacter felis sp. nov., a spiral-shaped
            bacterium isolated from the gastric mucosa of the cat, Helicobacter
            mustelae, and related bacteria
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1435)
  AUTHORS   Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
            Ferrero,R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by B.J.Paster, 06-JUN-1990.
            
               Author address:B.J.Paster
               Forsyth Dental Center
               140 Fenway
               Boston, MA 02115
FEATURES       from  to/span     description
    rRNA          1  >  1435     16S ribosomal RNA
BASE COUNT      375 a    301 c    412 g    326 t     21 others
ORIGIN      
        1 attatggaga gtttnatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa
       61 gtcgaacgat gaagcttcta gcttgctaga agtggattag tggcgcacgg gtgagtaacg
      121 cataggttat gtgccccata gtctgggata gccactggaa acggtgatta atactggata
      181 ctcctacggg ggnaaagntn ttcgctatgg gatcagccta tgtcctatca gcttgttggt
      241 gaggtaatgg ctcacnnagg ctatgacggg tatccggcct nagagggtga tcggacacac
      301 tggaactgag acacggtcca gactcctacg ggaggcagca gtagggaata ttgctcaatg
      361 ggcgaaagcc tgaagcagca acgccgcgtg gaggatgaag gttttaggat tgtaaactcc
      421 ttttctaaga gaagataatg acggtatctt aggaataagc accggcnnac tccgtgccag
      481 cagccgcggn antacggagg gtgcnagcgt tactcggaat cactgggcgt naagagcgcg
      541 taggcggagt aataagtcag atgtgaaatc ctgtagctta actacagaac tgcatttgaa
      601 actgttattc tagagtgtgg gagaggtagg tggaattctt ggtgtagggg tnaaatccgt
      661 agagatcaag aggaatactc attgcgaagg cgacctactg gaacattact gacgctgatg
      721 cgcgaaagcg tggggagcaa acaggattag ataccctggt agtccacgcc ctaaacgatg
      781 aatgctagtt gttggggtgc ttgtcactcc agtaatgcag ttaacacatt aagcattccg
      841 cctggggagt acggtcgcaa gattaaaact caaaggaata gacggggacc cgcacaagcg
      901 gtggagcatg tggtttaatt cgannntacg cgaagaacct tacctaggct tgacattgat
      961 agaatctgct agaaatagcg gagtgtctag tttactagac cttgaaaaca ggtgctgcac
     1021 ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctc
     1081 gttcttagtt gctagcagtt cggctgagca ctctaagaag actgccttcg tnaggaggag
     1141 gaaggtgagg acgacgttaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac
     1201 aatggggtgc acaaagagac gcaataccgc gaggtggagc aaatctcaaa aacatctctc
     1261 agttcggatt gtagtctgca actcgactac atgaagctgg aatcgctagt aatcgtgaat
     1321 cagccatgtc acggtgaata cgttcccggg tcttgtactc accgnccgtc acaccatggg
     1381 agttgtattc gccttaagcc gggatgctaa attggctacc gtccanggcg gatnc
//
LOCUS       HECRDA       1446 bp ss-rRNA            BCT       26-JUL-1990
DEFINITION  H.felis 16S ribosomal RNA.
ACCESSION   M35047
KEYWORDS    16S ribosomal RNA.
SOURCE      H.felis (ATCC 49179) ribosomal RNA.
  ORGANISM  Helicobacter felis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 9 to 1446)
  AUTHORS   Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
            Ferrero,R.
  TITLE     The phylogeny of Helicobacter felis sp. nov., a spiral-shaped
            bacterium isolated from the gastric mucosa of the cat, Helicobacter
            mustelae, and related bacteria
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1446)
  AUTHORS   Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and
            Ferrero,R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by B.J.Paster, 06-JUN-1990.
            
               Author address:B.J.Paster
               Forsyth Dental Center
               140 Fenway
               Boston, MA 02115
FEATURES       from  to/span     description
    rRNA          1  >  1446     16S ribosomal RNA
BASE COUNT      354 a    299 c    402 g    310 t     81 others
ORIGIN      
        1 tttatggaga gtttgatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa
       61 gtcgaacgat gaagcctagc ttgctaggcg gattagtggc gcacgggtga gtaacgcata
      121 gatgacatgc cctttagttt gggatagcca ctagaaatgg tgattaatac caaatactac
      181 ctacggggga aagatttatc gctaaaggat tggtctatgt cctatcagct tgttggtgag
      241 gtaaaggctc acnnaggcta tgacgggtat ccggcctgag agggtgaacg gacacactgg
      301 aactgagaca cggtccagac tccnncggga ggcagcagta gggaatattg ctcaatgggc
      361 gcaagcctga agcagcaacg ccgcgtggag gatgaaggtt ttaggattgt aaactccttt
      421 tgtcagagaa gataatgacg gtatctgacg aataagcacc ggctanctcc gtgccagcag
      481 ccgcggtaat acggagggtg cnagcgttac tcggaatcnc tgggcgtaaa gagtgcgtag
      541 gcggggttgt aagtcagatg tgaaatccta tggcttaacc atagaactgc atttgaaact
      601 acaactctgg agtgtgggag aggtaggtgg aattcttggt gtaggggtaa aatccgtaga
      661 gatcaagagg aatactcatt gcgaaggcga cctgctggaa caatactgac gctgattgcn
      721 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccct aaacgatgga
      781 tgctagttgt tggggggctt tgtcctccca gtaatgcagc taacgcctta agcatcccgc
      841 ctggggagta cggtcgcaag annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc gaagaacctt acctaggctt gacattgaan
      961 gaatctgcta gaaatatgtg agtgtctagc ttgctagacc ctgaaaacag gtgctgcacg
     1021 gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc cgcaacgagc gcaaccctct
     1081 ttcttagttg ctaacaggta gtgctgagct ctctaagaat actgcctgcg taagcaggag
     1141 gaaggtgagg acgacgtcaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac
     1201 aatggggtgc acaaagagat gcaatgccgc gaggttgagc caatcttaaa aacnnctctc
     1261 agttcggatt gcaggctgca actcgcctgc atgaagctgg aatcgctagt aatcgcaaat
     1321 cagccatgtt gcggtgaata cgttcccggg tcttgtactc accgnncgtc acaccatggg
     1381 agttgtgttt gccttaagtc aggatgctaa ggtagctact gcccacggca cacacagcga
     1441 ctgggg
//
LOCUS       HUMHPBS       821 bp ss-mRNA            PRI       26-JUL-1990
DEFINITION  Human peripheral benzodiazepine receptor (hpbs) mRNA, complete cds.
ACCESSION   M36035
KEYWORDS    peripheral benzodiazepine receptor.
SOURCE      Human hystiocytic lymphoma monocyte-like cell line U937, cDNA to
            mRNA, clone p-hPBS11.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 821)
  AUTHORS   Riond,J., Mattei,M.G., Kaghad,M., Dumont,X., Guillemot,J.C.,
            Le Fur,G., Caput,D. and Ferrara,P.
  TITLE     Molecular cloning and chromosomal localization of a human
            peripheral-type benzodiazepine receptor
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Riond, 27-JUN-1990.
            
               Author address:J.Riond
               SANOFI ELF BIO RECHERCHES
               BP137  31328 LABEGE CEDEX
               FRANCE
FEATURES       from  to/span     description
    pept         62      571     peripheral benzodiazepine receptor
    mRNA    <     1      811     peripheral benzodiazepine receptor mRNA
    site        800      805     polyadenylation site
BASE COUNT      118 a    271 c    260 g    171 t      1 others
ORIGIN      Chromosome 22, map position q13.3.
        1 agtgcccttc ccggagcgtg ccctcgccgc tgagctcccc tgaacagcag ctgcagcagc
       61 catggccccg ccctgggtgc ccgccatggg cttcacgctg gcgcccagcc tggggtgctt
      121 cgtgggctcc cgctttgtcc acggcgaggg tctccgctgg tacgccggcc tgcagaagcc
      181 ctcgtggcac ccgccccact gggtgctggg ccctgtctgg ggcacgctct actcagccat
      241 ggggtacggc tcctacctgg tctggaaaga gctgggaggc ttcacagaga aggctgtggt
      301 tcccctgggc ctctacactg ggcagctggc cctgaactgg gcatggcccc ccatcttctt
      361 tggtgcccga caaatgggct gggccttggt ggatctcctg ctggtcagtg gggcggcggc
      421 ngccactacc gtggcctggt accaggtgag cccgctggcc gcccgcctgc tctaccccta
      481 cctggcctgg ctggccttcg cgaccacact caactactgc gtatggcggg acaaccatgg
      541 ctggcatggg ggacggcggc tgccagagtg agtgcccggc ccaccaggga ctgcagctgc
      601 accagcaggt gccatcacgc ttgtgatgtg gtggccgtca cgctttcatg accactgggc
      661 ctgctagtct gtcagggcct tggcccaggg gtcagcagag cttcagaggt tgccccacct
      721 gagcccccac ccgggagcag tgtcctgtgc tttctgcatg cttagagcat gttcttggaa
      781 catggaattt tataagctga ataaagtttt tgacttcctt t
//
LOCUS       XELAAA        121 bp ss-rRNA            VRT       26-JUL-1990
DEFINITION  X.laevis 5S ribosomal RNA.
ACCESSION   M35175
KEYWORDS    5S ribosomal RNA.
SOURCE      X.laevis somatic cell ribosomal RNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Wegnez,M. and Denis,H.
  TITLE     Expression des genes ribosomiques 5 S chez le Xenope
  JOURNAL   Arch. Int. Physiol. Biochim. 81, 211-213 (1973)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      121     5S ribosomal RNA
BASE COUNT       24 a     34 c     38 g     25 t
ORIGIN      
        1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg
       61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt
      121 t
//
LOCUS       XELAAB        121 bp ss-rRNA            VRT       26-JUL-1990
DEFINITION  X.laevis 5S ribosomal RNA.
ACCESSION   M35176
KEYWORDS    5S ribosomal RNA.
SOURCE      X.laevis oocyte ribosomal RNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Wegnez,M. and Denis,H.
  TITLE     Expression des genes ribosomiques 5 S chez le Xenope
  JOURNAL   Arch. Int. Physiol. Biochim. 81, 211-213 (1973)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      121     5S ribosomal RNA
BASE COUNT       25 a     33 c     37 g     26 t
ORIGIN      
        1 gcctacggcc acaccaccct gaaagtgcct gatctcgtct gatctcagaa gcgatacagg
       61 gtcgggcctg gttagtactc ggatgggaga ccgcctggga ataccaggtg tcgtaggctt
      121 t
//
LOCUS       ECORR50L1A    165 bp ds-DNA             RNA       26-JUL-1990
DEFINITION  E.coli 50S rRNA protein L1-associated RNA.
ACCESSION   M24864
KEYWORDS    50S ribosomal RNA.
SOURCE      E.coli 50S ribosomal RNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 165)
  AUTHORS   Branlant,C., Krol,A., Sriwidada,J. and Brimacombe,R.
  TITLE     RNA sequences associated with proteins L1, L9, and L5, L18, L25, in
            ribonucleoprotein fragments isolated from the 50-S subunit of
            Escherichia coli ribosomes
  JOURNAL   Eur. J. Biochem. 70, 483-492 (1976)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    modified     13       13     7-methyluridine
    unsure       61       61     u could be a
    unsure      141      141     c could be g
BASE COUNT       36 a     35 c     47 g     47 t
ORIGIN      
        1 taacctttac tatggcgaca ctgaacattg agccttgatg tgtaggatag gtgggagctt
       61 tgaagtggac gtgccagtct gcatggagcc gaccttgaaa taccctttac aatgtttgat
      121 gttctaacgt ggacccgctt cgggttgcat cgcggacagt gtctg
//
LOCUS       BMOSP1       1512 bp ds-DNA             INV       26-JUL-1990
DEFINITION  Silkworm (B.mori) storage protein 2 (SP2) gene, exon 1.
ACCESSION   M24371 J04829
KEYWORDS    arylphorin-type storage protein; storage protein;
            storage protein 2.
SEGMENT     1 of 2
SOURCE      Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to
            mRNA.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 983)
  AUTHORS   Fujii Tomino,S.
  TITLE     Structure of the gene for the arylphorin-type storage protein, Sp 2
            of Bombyx mori
  JOURNAL   J. Biol. Chem. 264, 11020-11025 (1989)
  STANDARD  simple staff_review
REFERENCE   2  (bases 984 to 1512)
  AUTHORS   Fujii,T., Sakurai,H., Izumi,S. and Tomino,S.
  JOURNAL   Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan
  STANDARD  simple staff_review
COMMENT     Draft entry and sequence for [1],[2] kindly submitted by S.Tomino,
            28-APR-1989.
FEATURES       from  to/span     description
    pept        796  +   883     storage protein SP2, exon 1
    pre-msg     771  >  1512     SP2 mRNA and intron
    IVS         884  >  1512     SP2 intron A
    binding     436      441     glucocorticoid-receptor binding site
    site        447      457     SV40 enhancer core
    conflict    199      199     c in [1]; g in [2]
BASE COUNT      475 a    266 c    247 g    524 t
ORIGIN      Unreported.
        1 aagcttttta aaaaaagaac tttatttaat tttaataatt aaaacatttg aaattaacaa
       61 ttgaaattaa ttggcgcaag tgtcaccggg agcgcggtta gaattgaact gcgtgatcta
      121 tcggtaacct aactaagctg cattacgtcg tgcaccttac attgcacatt tatgtacatt
      181 aaaaatatat aacagtaccc aataaaaaag cattatttcg tcttgtaaca gtcggttgaa
      241 aaattgaaag taattaacga catgcttaga gtttcgatcg tagtaaaagc tacgttttgt
      301 ctatcatatt agaaagatat agtaacttct tttgtctctc tttattcttt aaattttact
      361 taatcaggtg aatagctttt actactttac tcaatgtttt catcatactc ctggctaagt
      421 cttcgctagc ccgcctgtcc tagtaagccg tggaaaggct ccgggacacc agcaaacctt
      481 caatcataaa aaaaattgct ttcatcatgt tttcgtttac agttttacaa atatttcata
      541 attttccatt cctttttttt gaattatata ataataacaa gaaaaaaact ttatatctat
      601 ttgtttatca tcatcgttga aatttatatt cagtaattca aattatgaga ccggtgaaaa
      661 ggtcagtaga ttacgttgat aatgaaagca taacacttgt tgctaatgag tgcatgtttc
      721 gggagaagat aaagtgtggg tataaatatt cgaaaacgga ttgcagaagc acagtttgct
      781 tctaggctgg aaaccatgaa gtctgtcttg attctggctg ggcttgtagc cgtcgcgctc
      841 agcagtgcag taccaaaacc gagcaccata aagtcaaaaa atggtaagcg ttaaatagta
      901 gtgctctatt ttaatacgct tttattatta ttattattat tattaattct ttatttcagt
      961 tttgtttttt aaaaccataa cattttgtta gtagtaatta cttatatcta tgttagtgac
     1021 ttaaaaaatc taacacataa ctctcattat atatatacat tttataccat tacatttttt
     1081 attttatttt tttctccttc caagtgccta ctgcaaaggc tattgatcag cagtccctcg
     1141 atcttgctcg atatgattct caaaagactg ttgccactgt cacgaactcg acgcaataac
     1201 gatgcacttc tcttccgcat tattgcaaag aagtcatcgg tgtgagatgt cgcaaacatt
     1261 gtggatgcac tacaaaagcg cggcagtgac aacatcatcc taaacgcatt attatattga
     1321 acgcgtaggg cattgtaagc tctccgcgtg tatgtggtcc acagactact ggcgtaaaaa
     1381 ttctggcaat aagctttaaa aattgtaatt tgacatacta tcgcaaccag taaatctgcg
     1441 ggccagcata ttgcatctta ctatcaatta ttattattat tttttttatt gcttagatgt
     1501 gtggacgagc tc
//
LOCUS       BMOSP2       3876 bp ds-DNA             INV       26-JUL-1990
DEFINITION  Silkworm (B.mori) storage protein 2 (SP2) gene, exons 2,3,4, and 5.
ACCESSION   M24370 J04829
KEYWORDS    arylphorin-type storage protein; storage protein;
            storage protein 2.
SEGMENT     2 of 2
SOURCE      Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to
            mRNA.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1288 to 3876)
  AUTHORS   Fujii,T., Sakurai,H., Izumi,S. and Tomino,S.
  TITLE     Structure of the gene for the arylphorin-type storage protein, Sp 2
            of Bombyx mori
  JOURNAL   J. Biol. Chem. 264, 11020-11025 (1989)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 1287)
  AUTHORS   Tomino,S.
  JOURNAL   Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan
  STANDARD  simple staff_review
COMMENT     Draft entry and sequence for [1],[2] kindly submitted by S.Tomino,
            28-APR-1989.
FEATURES       from  to/span     description
    pept    +  1388     1521     storage protein SP2, exon 2
               1614     2444     storage protein SP2, exon 3
               2526     2692     storage protein SP2, exon 4
               2850     3744     storage protein SP2, exon 5
    pre-msg <     1     3846     SP2 mRNA and introns
    IVS     <     1     1387     SP2 intron A
    IVS        1522     1613     SP2 intron B
    IVS        2445     2525     SP2 intron C
    IVS        2693     2849     SP2 intron D
BASE COUNT     1204 a    734 c    696 g   1242 t
ORIGIN      Unknown number of bp after segment 1.
        1 ttgctagccc ttcttcttta tgttttggag aaggttctca attcaaaatg tacgttttca
       61 ttatagcctt attacgaaag cttatacgaa cgttatatct ttaactatgc atacagccgt
      121 ctattgaatc attgttgtta taaattgttt tacaattgct ataggctcac atctctttct
      181 gaggcgtgat ttagaaaagg atgcacgatg cgtgatccaa tttggaattt gatagctcgg
      241 cctcatctcc tgcctcatag caaggccgat tttgtgaggc ctcctatcta aactaaaaag
      301 aacaaaaccg cacttacccc gcagcggccg actaggttgc actgttgcta taccatcatt
      361 tgtatgttgg tatattatta ccgctgtaat gtataggtac attaccgcca gtattgcata
      421 tgttgcacga tgaacatgtt caatatatgt aaaatttaca atttaaatac gtcaccgttt
      481 caacacaaaa ctatttgcaa atggattcat cattcatcat ctaaactcgt cgtggcctaa
      541 aggataagac gtccggtgca ttcgtgttga gcgatgcacc ggtgctcgaa tcccaagcgg
      601 gtaccaattt ttctaatgga atacgtactc aacaaatgtt catgattgac ttccacggta
      661 aaggaataac atctatacta atattataaa gaggaaagat ttgtttgttt gtttgtttcg
      721 aataggctcc gaaactactg gaccgatttg aaaaattctt tttccattag aagccaacat
      781 tgtccctgat gaacataggc tacatttttt aatttttttt tttttttttg tttcatgtgt
      841 gttttaatgt ttccgaagcg aagcgagggc gggtcgctag tcgtgtaata aaaatcaaag
      901 ccgcaaaaat tataatttgc gtaattacta gtggtaggac ctcttgtgac gcaagggtag
      961 gtacttgaga ccttagaatt tatatctcaa ggtgggtggt gcatatacgt tgtaaatgtc
     1021 tatggggtct agtaaccgct taacaccagg tggactagtt cagccaccta agcaataaaa
     1081 ataaaaatca tcaaaataga aaatcaacca ttgtaggttt ataccgtatt gactaagtaa
     1141 taaagaaaag caggtttttt ttacaaacaa ccaaattatg taataaaagt aaatatagta
     1201 agctatgaac gaccgattag tggtaacata tcggcgctga aagttcctaa tgtgctttga
     1261 tgccaatatt tatctcagaa ttgaagttat tcaatacttt ccagataatg atgacatcta
     1321 agtgatatcg cttattcgta aatacttctt tataaaatat ttacatatat ttttttactt
     1381 tattcagtgg atgccgtatt tgttgaaaag caaaagaaaa ttctgtcctt cttccaagat
     1441 gtgagccaac taaacactga tgatgaatat tataaaattg gcaaagacta tgatatcgaa
     1501 atgaatatgg acaactacac tgtaagtact aataattaat atcaatttaa atttaacgtg
     1561 aatttgtttg tttctttctt tctttattga aaaccatgtt tcatatttaa cagaacaaga
     1621 aagctgttga agaatttctg aagatgtaca ggactggttt tatgcctaag aatttagagt
     1681 tctccgtttt ttatgacaag atgagggatg aagctattgc tctattggat ttattctatt
     1741 acgctaagga ctttgaaacg ttctacaaga gtgcctgttt tgcgcgtgtg catctcaatc
     1801 aaggtcaatt cttgtatgcc ttctacatcg ctgttatcca gcgccctgat tgccacggtt
     1861 tcgttgttcc tgctccgtat gaagtatacc ctaaaatgtt tatgaatatg gaagtgctgc
     1921 aaaaaattta cgtaacaaag atgcaacatg gcctcattaa tcctgaagcc gcagctaagt
     1981 atggcattca caaggaaaac gactacttcg tttacaaagc caattattct aacgccgttt
     2041 tatacaataa tgaagaacaa aggctgacat acttcactga ggatattggc atgaacgctt
     2101 actactacta cttccactct catttaccgt tctggtggac atcagaaaaa tacggagccc
     2161 ttaaagagcg tcgtggagag gtttacttct acttctacca gcaattattg gctcgttact
     2221 actttgagcg tcttaccaat ggacttggta agattcccga attctcatgg tactctccga
     2281 taaagactgg atactatcca ttgatgctaa ctaagtttac acccttcgca caaagacctg
     2341 actactacaa cttgcacacc gaagaaaact atgaaagagt aagattcctt gacacttatg
     2401 agaagacatt cgttcagttc ctccaaaagg accactttga agccgtaagt tcgaacatta
     2461 agtgtctaat cttattggtt tatttctaaa aatgtatgaa tttaatagat tttttcattt
     2521 tatagttcgg acaaaaaatt gattttcacg acccgaaagc cattaacttc gtcggcaact
     2581 actggcaaga taatgcagat ctgtatggag aagaagtcac aaaagattac caacgttctt
     2641 acgaagtatt tgcgcgccgt gtgctaggtg ctgcgcctat gccattcgac aagtacgttt
     2701 aaaaaatatt ttcaaaactt aatttttact aagcaatgac gacaactctt ttctacgtta
     2761 tatccaagtc aaccgtaatc cggatttgtc tttgtacgtt tgcaaaaaaa ttaatagtaa
     2821 tacatagttt cttcatgcta ctttttcagg tacactttca tgcctagtgc aatggacttt
     2881 taccagactt ctcttcgtga tcctgctttc tatcagctct acaacagaat tgtggaatac
     2941 atcgttgagt tcaagcaata cttgaagcct tacactcaag acaaacttta ctttgatggt
     3001 gtcaagataa ctgatgttaa agtcgacaaa ttgacaacat tcttcgagaa ctttgaattc
     3061 gacgccagca acagcgtgta ctttagtaag gaggagatta agaacaatca cgtccatgag
     3121 ttaaggtgcg ccacacgatt gaaccacagc cccttcaacg ttaacattga ggttgattct
     3181 aatgtcgcca gtgacgctgt tgtcaaaatg ttgctggccc ccaaatacga tgacaacgga
     3241 atacctctca cattagagga caactggatg aaattcttcg agttggactg gttcacaact
     3301 aaactcaccg ctggtcagaa caagattatc cgcaattcga atgaatttgt catatttaaa
     3361 gaagactccg tgccaatgac tgaaattatg aagatgctcg acgaaggaaa agtacctttt
     3421 gatatgtcgg aagagttctg ttacatgcct aaaagactca tgctgcctag aggtactgaa
     3481 ggtggattcc cattccagct ctttgttttc gtctatccat tcgacaacaa aggcaaggac
     3541 ttggctcctt tcgaatcttt tgttcttgac aataacctct tggcttccct ctggatcgcc
     3601 ccgttgttga tgcattattc aaggttccta acatgtattt caaggatatt ttcatttacc
     3661 acgagggtga acggttccct tacaaattca atcttccttc gtatgacaca catgataatg
     3721 ttgttccaaa aaattaaatt ttaataaact gatgaatttt gcatccgtaa tatccaaaga
     3781 aaatgtaaaa actttaagta gaactgttat gatttagaaa aaataaaatc aagtaggtaa
     3841 aattataatt atgtattttt attgcatgca ttttta
//
LOCUS       HUMG6PA      1464 bp ss-mRNA            PRI       26-JUL-1990
DEFINITION  Human glucose-6-phosphate dehydrogenase, complete cds.
ACCESSION   M24470 M27958
KEYWORDS    glucose-6-phosphate dehydrogenase.
SOURCE      Human, cDNA to mRNA, clone NG6PD 1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1464)
  AUTHORS   Kanno,H., Huang,I.-Y., Kan,Y.W. and Yoshida,A.
  TITLE     Two structural genes on different chromosomes are required for
            encoding the major subunit of human red cell glucose-6-phosphate
            dehydrogenase
  JOURNAL   Cell 58, 595-606 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and sequence for [1] kindly submitted by A.Yoshida,
            02-MAY-1989.
FEATURES       from  to/span     description
    pept         72     1109     glucose-6-phosphate dehydrogenase
                                 /nomgen="G6PD" /map="Xq28"
                                 /hgml_locus_uid="LH0033J"
    mRNA    <     1     1464     glucose-6-phosphate dehydrogenase mRNA
BASE COUNT      331 a    404 c    389 g    340 t
ORIGIN      
        1 ctccccgcgc cgccccgcgc aggcgccccc gccccgccgt cgccgccgcc gcagccagga
       61 gccgctgcac catgccccgc atagatgcgg acctcaagct cgacttcaag gacgtcctgc
      121 tccgacctaa gcggagcagc ctcaagagcc gagccgaggt ggatcttgaa cgcaccttca
      181 cgtttcgaaa ttcaaagcag acctactcag ggattcccat catcgtggcc aacatggaca
      241 ctgtgggcac gtttgagatg gcagccgtga tgtcacagca ctccatgttt acagcaattc
      301 ataagcatta ctccctggat gactggaagc tctttgccac aaatcaccca gaatgcctgc
      361 agaatgtagc cgtgagttca ggcagtgggc agaatgatct ggaaaagatg accagcatcc
      421 tggaagctgt gccacaggtt aagtttattt gcctggatgt ggccaatggg tattcagaac
      481 attttgtgga attcgtgaaa cttgtccgtg ccaaatttcc tgaacacacc attatggcag
      541 ggaacgtggt gacaggagaa atggtagaag agcttattct ttccggagca gatatcatca
      601 aagtgggagt tggaccaggt tctgtgtgca ccacccgcac caagacggga gtggggtacc
      661 cccagctgag tgccgtcatt gagtgtgccg actctgccca cggcctgaag ggccacatca
      721 tctctgatgg aggctgtacg tgtccagggg atgtcgccaa agcctttgga actggagcag
      781 attttgtcat gctgggagga atgttttcgg gtcatacgga gtgtgctgga gaagtgattg
      841 agaggaacgg acggaagctc aagctcttct acgggatgag ctctgacacc gccatgaaca
      901 agcacgcagg aggagttgct gagtacagag cctctgaggg taagactgtg gaagttcctt
      961 acaaaggaga tgtggaaaac actatcctgg atattctcgg gggactgagg tccacgtgca
     1021 cctacgtggg ggccgccaaa ctcaaggagc tcagcaggag ggcaacattc atccgggtga
     1081 cccagcagca caacaccgtg ttcagctaac cctggggaca aagcagcgtc tggctcgatg
     1141 gaagcgtcca aacctgcttt tcccatctcc ccccaagtct gttccgtcag agcttctggc
     1201 tgctcctgaa tggtggaatg cctgtgtcct ctcttctgtc tcctgccgcc tggaggcttc
     1261 ggggctctcc cgcctgcctt ctcggggccc agacgcaagg caccgattgg gccaacatca
     1321 gagccctgct gcccagaact cataacctca ttgttcaaac caacacttgc acctttctct
     1381 ttttctcttt ctctctccct ttctttgttt ttctttcttt tttaaaagaa gatggtttca
     1441 gctttaatat aatgctatta tctt
//
LOCUS       MUSGT1A      2544 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse glucose transporter 1 mRNA, complete cds.
ACCESSION   M23384 J04557
KEYWORDS    glucose transporter 1.
SOURCE      Mouse adipocyte cell line 3T3-L1, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites)
  AUTHORS   Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T.,
            Cornelius,P., Pekala,P.H. and Lane,M.D.
  TITLE     Sequence, tissue distribution, and differential expression of mRNA
            for a putative insulin-responsive glucose transporter in mouse
            3T3-L1 adipocytes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 3150-3154 (1989)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 2544; for [1])
  AUTHORS   Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T.,
            Cornelius,P., Pekala,P.H. and Lane,M.D.
  JOURNAL   Unpublished (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly provided
            by M.Lane, 28-MAR-1989.
FEATURES       from  to/span     description
    pept        190     1668     glucose transporter 1
BASE COUNT      514 a    719 c    679 g    632 t
ORIGIN      
        1 ttggtcctat aaaaaggcag ctccgcgcgc tctcccccaa gagcagaggc ttgcttgtag
       61 agtgacgatc tgagctacgg ggtcttaagt gcgtcagggc gtggaggtct ggcgggagac
      121 gcatagttac agcgcgtccg ttctccgtct cgcagccggc acagctagag cttcgagcgc
      181 agcgcggcca tggatcccag cagcaagaag gtgacgggcc gcctcatgtt ggctgtggga
      241 ggagcagtgc tcggatcact gcagttcggc tataacactg gtgtcatcaa cgccccccag
      301 aaggttattg aggagttcta caatcaaaca tggaaccacc gcatcggaga gcccatccca
      361 tccaccacac tcaccacgct ttggtctctc tccgtggcca tcttctctgt cgggggcatg
      421 attggttcct tctctgtcgg cctctttgtt aatcgctttg gcaggcggaa ctccatgctg
      481 atgatgaacc tgttggcctt tgtggctgct gtgcttatgg gcttctccaa actgggcaag
      541 tcctttgaga tgctgatcct gggccgcttc atcatcggtg tgtactgcgg cctgactact
      601 ggctttgtgc ccatgtatgt gggagaggtg tcacctacag ctctacgtgg agccctaggc
      661 acactgcacc agctgggaat cgtcgttggc atccttattg cccaggtgtt tggcttagac
      721 tccatcatgg gcaatgcaga cttgtggcct ctgctgctca gtgtcgtctt cgtcccagcc
      781 ctgctacagt gtatcctgtt gcccttctgc cccgagagcc cccgcttcct gctcatcaat
      841 cgtaacgagg agaaccgggc caagagtgtg ctgaagaagc ttcgagggac agccgatgtg
      901 acccgagacc tgcaggagat gaaagaagag ggtcggcaga tgatgcggga gaagaaggtc
      961 accatcttgg agctgttccg ctcacccgcc taccgccagc ccatcctcat cgctgtggtg
     1021 ctgcagctgt cccagcagct gtcgggtatc aatgctgtgt tctactactc aacgagcatc
     1081 ttcgagaagg caggtgtgca gcagcctgtg tacgccacca tcggctccgg tatcgtcaac
     1141 acggccttca ctgtggtgtc gctgtttgtt gtagagcgag ctggacgacg gaccctgcac
     1201 ctcattggcc tggctggcat ggcaggctgt gctgtgctca tgaccatcgc cctggccttg
     1261 ctggaacggc tgccttggat gtcctatctg agcatcgtgg ccatctttgg ctttgtggcc
     1321 ttctttgaag taggccctgg tcctattcca tggttcattg tggccgagct gttcagccag
     1381 gggccccgtc ctgctcgtat tgctgtggct ggcttctcca actggacctc aaacttcatt
     1441 gtgggcatgt gcttccagta tgtggagcaa ctgtgcggcc cctacgtctt catcatcttc
     1501 acggtgctcc tcgtgctctt cttcatcttc acctacttca aagtccctga gaccaaaggc
     1561 cgaaccttcg atgagatcgc ttccggcttc cggcaggggg gtgccagcca aagtgacaag
     1621 acacccgagg agctcttcca ccctctgggg gcggactccc aagtgtgagg agccccacac
     1681 ccagcccggc ctgctccctg cagcccaagg atctctctgg agcacaggca gctagatgag
     1741 acctcttccg aaccgacaga tctcgggcaa gccgggcctg ggcgcctttc ctcagccagc
     1801 agtgaagtcc aggaggatat tcaggacttt gatggctcca gaatttttaa tgaaagcaag
     1861 actgctgctc agatctattc agataagcag caggttttat aattttttta ttactgattt
     1921 tgttattttt tttttttatc agccactctc ctatctccac actgtagtct tcaccttgat
     1981 tggcccagtg cctgagggtg gggaccacgc cctgtccaga cacttgcctt ctttgccaag
     2041 ctaatctgta gggctggacc tatggccaag gacacactaa taccgaactc tgagctagga
     2101 ggctttacgc tggaggcggt agctgccacc cacttccgca ggcctggacc tcggcaccat
     2161 aggggtccgg actccatttt aggattcgcc cattcctgtc tcttcctacc caaccactca
     2221 attaatcttt ccttgcctga gaccagttgg aagcactgga gtgcagggag gagagggaag
     2281 ggccaggctg ggctgccagg ttctagtctc ctgtgcactg agggccacac aaacaccatg
     2341 agaaggacct cggaggctga gaacttaact gctgaagaca cggacactcc tgccctgctg
     2401 tgtatagatg gaagatattt atatactggt tgtcaatatt aaatacagac actaagttat
     2461 agtatatctg gacaaaccca cttgtaaata caccaacaaa ctcctgtaac tttacctaag
     2521 cagatataaa tggctggttt ttag
//
LOCUS       MUSMS6HM      321 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  M.musculus Ms6-hm locus, repeat elements.
ACCESSION   J04743
KEYWORDS    dispersed repetitive element; minisatellite sequence.
SOURCE      M.musculus (strain C57BL/6J) DNA, clone pMm3-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Kelly,R., Bulfield,G., Collick,A., Gibbs,M. and Jeffreys,A.J.
  TITLE     Characterization of a highly unstable mouse minisatellite locus:
            Evidence for somatic mutation during early development
  JOURNAL   Genomics 5, 844-856 (1989)
  STANDARD  full staff_entry
COMMENT     Printed sequence for [1] kindly submitted by R.Kelly, 08-AUG-1989.
FEATURES       from  to/span     description
    rpt          37      168     MT dispersed repetitive element
    rpt         168      263     tandem repeated element
    rpt         263  >   321     MT dispersed repetitive element
BASE COUNT       84 a     66 c    109 g     62 t
ORIGIN      
        1 gatccccagt gatgtaaacc agactatatg gctaactgtt ttagttagag tttctagttg
       61 ctgtgaccaa caccatgacc aaaaagcaag ttggggagga aaggatttat ttgacttaca
      121 cttccatata actgttcatc atcaaaagaa atcaggacag aaacccgggg gcagggcagg
      181 gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg
      241 gcagggcagg gcagggcagg gcagggctga tgtagcgtca ctgaggagtc ctgcttccta
      301 ctttgcttcc atgggtggat c
//
LOCUS       RABCYP4A6    1790 bp ss-mRNA            MAM       26-JUL-1990
DEFINITION  Rabbit cytochrome P450IVA6 (CYP4A6) mRNA, complete cds.
ACCESSION   M28656
KEYWORDS    cytochrome P450; lauric acid omega-hydroxylase.
SOURCE      Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA,
            clone KdA6.
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 1790)
  AUTHORS   Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T.,
            Muerhoff,A.S. and Masters,B.S.
  TITLE     Cloning and expression of three rabbit kidney cDNAs encoding lauric
            acid omega-hydroxylases
  JOURNAL   Biochemistry 29, 873-879 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.F.Johnson, 04-OCT-1989.
FEATURES       from  to/span     description
    pept         14     1546     lauric acid omega-hydroxylase
BASE COUNT      341 a    610 c    479 g    360 t
ORIGIN      
        1 gggccgctgc accatgagcg tgtctgcact gaaccccacc cggctcccgg gcagcctctc
       61 cgggctcctc caagtggcgg gcctgctggg cctgctcctg ctgctgctca aggcagctca
      121 gctctacctg caccgccagt ggctgctcag agccctccag cagttcccgt gcccaccctt
      181 ccactggctc ctggggcaca gccgagagtt ccaaaatggc catgagttac aagtgatgct
      241 gaaatgggtg gagaaattcc caagtgcttg tcctcgctgg ctatggggga gcagagccca
      301 cctcctgatc tatgaccctg actacatgaa ggtgattctg gggagatcag acccaaaagc
      361 tcaaggttcc tacagattcc tggctccctg gattgggtat ggtttgctcc tgctgaatgg
      421 gcagacgtgg ttccagcacc ggcgcatgct caccccagcc ttccactacg acatcctgaa
      481 gccctacgtg gggctcatgg cggactccgt ccaaatcatg ctggacaaat gggagcagct
      541 ggtcagccag gactcctccc tggaggtctt ccaagacatc tccctgatga ccctggacac
      601 catcatgaag tgtgccttca gccaccaggg cagcgtccag ttggacagga attcccagtc
      661 ctacatccag gctgttgggg acctgaacaa cctgttcttt tcccgagtga ggaacgtctt
      721 tcatcagagt gacaccatct acaggctgag ccctgaaggc cgcttgtccc accgtgcctg
      781 ccagctcgcc cacgagcaca cagaccgagt gatccagcag aggaaggctc agctgcagca
      841 ggagggggag ctggagaagg tcaggaggaa gaggcgcttg gacttcctgg acgtcctcct
      901 ctttgccaag atggagaacg ggagcagcct gtccgaccag gacctccgcg ccgaggtgga
      961 cacgttcatg ttcgagggcc acgacaccac ggccagcggc atctcctgga tcttctatgc
     1021 cctggccacg caccccgagc atcagcaccg gtgccgcgag gagatccagg gcctcctggg
     1081 ggacggagcc tccatcacct gggagcacct ggaccagatg ccctacacca ccatgtgcat
     1141 caaggaggcg ctgagactct acccaccagt gccaggtgtc ggcagacagc tcagctcacc
     1201 tgtcaccttc cctgatggac gctccctccc caagggtgtc atagtcacgc tctccatcta
     1261 cgcccttcac cacaacccga aggtgtggcc aaacccagag gtgtttgacc ctttcccgtt
     1321 cgcaccgggt tctgctcgcc acagccacgc tttcctgccc ttctcaggag gaccacggaa
     1381 ctgcatcggg aagcaatttg ccatgaatga gctgaaggtg gccgtggccc tgaccctcgt
     1441 gcgcttcgag ctgctgccag atcccaaaag agtcccggac caaaaaccac gtcttgtgct
     1501 gaagtccagc aacgggatcc acctgcgtct gaggaagctc cgctaaccct ggtggggaca
     1561 agagcaggct ctggggcctt ctgccaggcg tcctggcttc ctgtcacctg cccatgcccc
     1621 ctgcctgtct gcccacatcc tgctttctat ccaccagcac ttcttccacc tgtctgcctt
     1681 gctgcctctt ggcctccagg ctgtctgtcc tctcgcacct tcctctgggc cactgacctg
     1741 tctgtctact gtccgcttcc tgccagcatc tctgaccgtg cacctaaccc
//
LOCUS       RABCYP4A7    1694 bp ss-mRNA            MAM       26-JUL-1990
DEFINITION  Rabbit cytochrome P450IVA7 (CYP4A7) mRNA, complete cds.
ACCESSION   M28657
KEYWORDS    cytochrome P450; lauric acid omega-hydroxylase.
SOURCE      Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA,
            clone KdB18.
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 1694)
  AUTHORS   Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T.,
            Muerhoff,A.S. and Masters,B.S.
  TITLE     Cloning and expression of three rabbit kidney cDNAs encoding lauric
            acid omega-hydroxylases
  JOURNAL   Biochemistry 29, 873-879 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.F.Johnson, 04-OCT-1989.
FEATURES       from  to/span     description
    pept         25     1560     lauric acid omega-hydroxylase
BASE COUNT      330 a    564 c    463 g    337 t
ORIGIN      
        1 ggcagatcca gaagctgctg caccatgagc gtgtctgcgc tgagctccac ccggctcccg
       61 ggcagcttct ccgggttcct ccaagcggcg gccctgctgg gcctactcct gctgctgctc
      121 aaggcagctc agctctacct gcgccgccag tggctgctca gagccctcca gcagttcccg
      181 tgcccaccct cccactggct cctggggcac agccgagagt ttccaataga ctcggagctg
      241 cagcaggtgc tgaagcgagt ggagaaattc ccaagcgcct gtcctcgctg gctgtggggg
      301 agtgagctgt ttctcatttg ctacgaccct gactacatga agacgattct ggggcgatca
      361 gacccaaagg ctcgtgtttc ctacagcttc ctggctccct ggattgggta tggcttgctg
      421 cttttggaag ggcagacgtg gttccagcac cggcgcatgc tcaccccagc cttccactac
      481 gacatcctga agccctacgt ggggctcatg gtggactccg tccaagtgat gctggacaaa
      541 ctggagaagc tcgcccgcaa ggacgcgcct ctggagatat acgaacacgt ctccctgatg
      601 accctggaaa ccatcatgaa gtgcgccttc agccaccagg gcagcgtcca gctggaaagc
      661 aggacctcca aatcctacat ccaggctgtc agggagctca gcgacttggc attgcagcgg
      721 gtgaggaacg tctttcacca gagcgacttc ctctacaggc tgagccctga gggccgcttg
      781 tcccaccgtg cctgccagct cgcccacgag cacacagacc gagtgatcca gcagaggaag
      841 gctcagctgc agcaggaggg ggagctggag aaggtcagga ggaagaggcg cttggacttc
      901 ctggacgtcc tcctctttgc caagatggag aacgggagca gcctgtccga ccaggacctc
      961 cgcgccgagg tggacacgtt catgttcgag ggccacgaca ccacggccag cggcatctcc
     1021 tggatcttct atgccctggc cacgcacccc gagcatcagc accggtgccg cgaggagatc
     1081 cagggtctcc tgggggacgg agcctccatc acctgggagc acctggacaa gatgccctac
     1141 accaccatgt gcatcaagga ggcgctgaga ctctacccac cggtgccagg tgtcggcagc
     1201 aagctcagct cacctgtcac cttccctgat ggacgctccc tccccaaggg catcataatc
     1261 acactctcca tctatggcct gcatcacaac ccgaaggtgt ggccaaaccc agaggtgttt
     1321 gacccttccc gcttcgcacc gggttctgct cgccacagcc acgctttcct gcccttctca
     1381 ggaggatcga ggaactgcat cgggaaacaa tttgccatga acgagctgaa ggtggccgtg
     1441 gccctgaccc tcgtgcgctt cgagctgctg ccggatccca ccagagtccc catccccata
     1501 acaagacttg tgctgaagtc taagaatggg attcacctac gtctcaggaa gctccactaa
     1561 ccctgctgga aacaagaatg gtctgccagg cgtcctctct tcctgtcacc tgcccgtgtc
     1621 ccgcactctg tctgtatctt gctttctctc tacctacctg cccttcttcc acctgcctcc
     1681 gattcggcct tttg
//
LOCUS       HUMRGIT      1095 bp ds-DNA             PRI       26-JUL-1990
DEFINITION  Human rRNA gene internal transcribed spacer 1 (ITS1).
ACCESSION   M36624
KEYWORDS    internal transcribed spacer.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1095)
  AUTHORS   Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and
            Schmickel,R.D.
  TITLE     Ribosomal RNA gene sequences and hominoid phylogeny
  JOURNAL   Mol. Biol. Evol. 7, 203-219 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
BASE COUNT       83 a    429 c    441 g    139 t      3 others
ORIGIN      
        1 acggagcccg gagggcgagg cccgcggcgg cgccgccgcc gccgcgcgct tccctccgca
       61 cacccacccc cccaccgcga cgcggcgcgt gcgcgggcgg ggcccgcgtg cccgttcgtt
      121 cgctcgctcg ttcgttcgcc gcccggcccc gccgccgcga gagccgagaa ctcgggaggg
      181 agacgggggg gagagagaga gagagagaga gagagagaga gagagagaga gaaagaaggg
      241 cgtgtcgttg gtgtgcgcgt gtcgtggggc cggcgggcgg cggggagcgg tccccggccg
      301 cggccccgac grcgtgggtg tcggcgggcg cgggggcggt tctcggcggc gtcgcggcgg
      361 gtctgggggg gtctcggtgc cctcctcccc gccggggccc gtcgtccggc cccgccgcgc
      421 cggctccccg tcttcggggc cggccggatt cccgtcgcct ccgccgcgcc gctccgcgcc
      481 gccgggcacg gccccgctcg ctctccccgg ccttcccgct agggcgtctc gagggtcggg
      541 ggccggacgc cggtcccctc ccccgcctcc tcgtccgccc ccccgccgtc caggtaccta
      601 gcgcgttccg gcgcggaggt ttaaagaccc cttgggggga tcgcccgtcc gcccgtgggt
      661 cgggggcggt ggtgggcccg cgggggagtc ccgtcgggag gggcccggcc cctcccgcgc
      721 ctccaccgcg gactccgctc cccggccggg gccgcgccgc cgccgmcgcc gcggcggccg
      781 tcgggtgggg gctttacccg gcggccgtcg cgcgcctgcc gcgcgtgtgg cgtgcgcccc
      841 gcgccgtggg ggcgggaacc cccgggcgcc tgtggggtgg tgtccgcgct cgcccccgcg
      901 tgggcggcgc gcgcctcccc gtggtgtgaa accttccgac ccctctccgg agtccggtcc
      961 cgtttgctgt ctcgtctggc cggcctgagg caaccccctc tcctcttggg cggggggggs
     1021 ggggggacgt gccgcgccag gaagggcctc ctcccggtgc gtcgtcggga gcgccctcgc
     1081 caaatcgacc tcgta
//
LOCUS       MHV1NP       1670 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Mouse hepatitis virus nucleocapsid (N-MHV1) RNA, complete cds.
ACCESSION   M35253
KEYWORDS    N protein; RNA binding viral structural protein;
            nucleocapsid protein.
SOURCE      Mouse hepatitis virus (strain 1), cDNA to viral RNA.
  ORGANISM  Mouse hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 1670)
  AUTHORS   Parker,M.M. and Masters,P.S.
  TITLE     Sequence comparison of the N genes of five strains of the
            coronavirus mouse hepatitis virus suggests a three domain structure
            for the nucleocapsid protein
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Masters, 19-JUN-1990.
FEATURES       from  to/span     description
    pept          1     1368     hepatitis virus nucleocapsid (N-MHV1) ORF 1
    pept         65      688     hepatitis virus nucleocapsid (N-MHV1) ORF 2
BASE COUNT      497 a    364 c    426 g    383 t
ORIGIN      
        1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgt aaaccgcgct
       61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat
      121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg
      181 agtgtggttc cccattactc ttggttttcg ggcattaccc aatttcagaa gggaaaagag
      241 tttcagtttg cacaaggaca gggagtgcct attgccaacg gaatcccagc ttcagagcaa
      301 aagggatatt ggtatagaca caaccgacgg tcttttaaaa cacctgatgg ccagcagaag
      361 cagctactgc ccagatggta tttttactat cttggaacag ggccccatgc tggcgcagag
      421 tatggcgacg atatcgacgg agttgtctgg gtcgcaagcc aacaggccga cactaagacc
      481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg
      541 cccggtacgg tattgcctca aggtttttat gttgaaggct caggaaggtc tgcacctgct
      601 agtcgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc
      661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
      721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacaaa gcaaagcgcc
      781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag
      841 tgccctgtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
      901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca
      961 cctagtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct
     1021 gatgacccca ccaaagatgt gtatgaattg cagtattcag gtgcaattag atttgatagt
     1081 actctcccag gatttgagac tatcatgaaa gtgttgaatg agaatttgga tgcctaccag
     1141 gatcaagctg gtggtgcaga tgtagtgagc ccaaagcccc aaagaaagag agggacaaaa
     1201 caaaaggctc tgaaaggtga agtagataat gtaagcgttg caaagcccaa aagctctgtg
     1261 cagcgaaatg taagtagaga attaacccct gaggatcgta gtctgttggc tcagatcctt
     1321 gatgatggcg ttgtgcctga tgggttagaa gatgactcta atgtgtaaag agaatgaatc
     1381 ctatgtcggc actcggtggt aacccctcgc gagaaagtcg ggataggaca ctctctatca
     1441 gaatggatgt cttgctgtca taacagatag agaaggttgt ggcagaccct gtatcaatta
     1501 gttgaaagag attgcaaaat agagaatgtg tgagagaagt tagcaaggtc ctacgtctaa
     1561 ccataagaac ggcgataggc gcccccctgg gaagagctca catcagggta ctattcctgc
     1621 aatgccctag taaatgaatg aagttgatca tggccaattg gaagaatcac
//
LOCUS       MHV3NP       1666 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Mouse hepatitis virus nucleocapsid (N-MHV3) RNA, complete cds.
ACCESSION   M35254
KEYWORDS    N protein; RNA binding viral structural protein;
            nucleocapsid protein.
SOURCE      Mouse hepatitis virus (strain 3), cDNA to viral RNA.
  ORGANISM  Mouse hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 1666)
  AUTHORS   Parker,M.M. and Masters,P.S.
  TITLE     Sequence comparison of the N genes of five strains of the
            coronavirus mouse hepatitis virus suggests a three domain structure
            for the nucleocapsid protein
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Masters, 19-JUN-1990.
FEATURES       from  to/span     description
    pept          1     1365     hepatitis virus nucleocapsid (N-MHV3) ORF 1
    pept         65      688     hepatitis virus nucleocapsid (N-MHV3) ORF 2
BASE COUNT      494 a    358 c    432 g    382 t
ORIGIN      
        1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgg aaaccgcgct
       61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat
      121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg
      181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag
      241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa
      301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag
      361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt
      421 tatggagaca gcattgaagg agtcttctgg gttgcaaaca gccaagcgga caccaatacc
      481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg
      541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct
      601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc
      661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
      721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc
      781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag
      841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
      901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca
      961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct
     1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt
     1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag
     1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct
     1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag
     1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat
     1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta
     1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa
     1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt
     1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca
     1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcctgcaatg
     1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcgc
//
LOCUS       MHVA59NP     1666 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Mouse hepatitis virus nucleocapsid (N-MHVA59) RNA, complete cds.
ACCESSION   M35256
KEYWORDS    N protein; RNA binding viral structural protein;
            nucleocapsid protein.
SOURCE      Mouse hepatitis virus (strain A59), cDNA to viral RNA.
  ORGANISM  Mouse hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 1666)
  AUTHORS   Parker,M.M. and Masters,P.S.
  TITLE     Sequence comparison of the N genes of five strains of the
            coronavirus mouse hepatitis virus suggests a three domain structure
            for the nucleocapsid protein
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Masters, 19-JUN-1990.
FEATURES       from  to/span     description
    pept          1     1365     hepatitis virus nucleocapsid (N-MHVA59) ORF 1
    pept         65      688     hepatitis virus nucleocapsid (N-MHVA59) ORF 2
BASE COUNT      497 a    355 c    433 g    381 t
ORIGIN      
        1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgg aagccgctct
       61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgcgc tggaaataat
      121 ggaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg
      181 agtgtggttc cccattactc ttggttttcg ggcattaccc aattccagaa gggaaaagag
      241 tttcagtttg tacaaggaca gggagtgcct attgccaatg gaatcccagc ttcagagcaa
      301 aagggatatt ggtatagaca caaccgacgt tcttttaaaa cacctgatgg ccagcagaag
      361 cagctactgc ccagatggta tttttactat ctcggaacag ggccccatgc tggcgcagag
      421 tatggcgacg atatcgaagg agttgtctgg gtcgcaagcc aacaggccga cactaagacc
      481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg
      541 cccggtacgg ttttgcctca gggtttttat gttgaaggct caggaaggtc tgcacctgct
      601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc
      661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
      721 gttttggcta agctcggtaa agatgccggt cagcccaagc aagtaacaaa gcaaagtgcc
      781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag
      841 tgcccagtgc agcaatgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
      901 atgcttaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca
      961 gctggtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct
     1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt
     1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag
     1141 aaggatggtg gtgcagatgt agtgagccca aagccccaaa gaaaagggcg tagacaggct
     1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag
     1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatcctagat
     1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta
     1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa
     1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt
     1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca
     1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg
     1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac
//
LOCUS       MHVSHV       1666 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Mouse hepatitis virus nucleocapsid (N-MHVS) RNA, complete cds.
ACCESSION   M35255
KEYWORDS    N protein; RNA binding viral structural protein;
            nucleocapsid protein.
SOURCE      Mouse hepatitis virus (strain S), cDNA to viral RNA.
  ORGANISM  Mouse hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 1666)
  AUTHORS   Parker,M.M. and Masters,P.S.
  TITLE     Sequence comparison of the N genes of five strains of the
            coronavirus mouse hepatitis virus suggests a three domain structure
            for the nucleocapsid protein
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Masters, 19-JUN-1990.
FEATURES       from  to/span     description
    pept          1     1365     hepatitis virus nucleocapsid (N-MHVS) ORF 1
    pept         65      688     hepatitis virus nucleocapsid (N-MHVS) ORF 2
BASE COUNT      494 a    357 c    430 g    385 t
ORIGIN      
        1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgt aaaccgcgct
       61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat
      121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg
      181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag
      241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa
      301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag
      361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt
      421 tatggagaca gcattgaagg tgtcttctgg gttgcaaaca gccaagcgga caccaatacc
      481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg
      541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct
      601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc
      661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt
      721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc
      781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag
      841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa
      901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca
      961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct
     1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt
     1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag
     1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct
     1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag
     1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat
     1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta
     1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa
     1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt
     1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca
     1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg
     1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac
//
LOCUS       MXAFRZGF     2999 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  M.xanthus frzG and frzF genes, complete cds.
ACCESSION   M35200
KEYWORDS    FrzF protein; FrzG protein; methylesterase; methyltransferase.
SOURCE      M.xanthus (strain DZF1) DNA.
  ORGANISM  Myxococcus xanthus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; 
            Myxococcaceae.
REFERENCE   1  (bases 1 to 2999)
  AUTHORS   McCleary,W.R., McBride,M.J. and Zusman,D.R.
  TITLE     Developmental sensory transduction in Myxococcus xanthus involves
            methylation and demethylation of FrzCD
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.R.McCleary, 15-JUN-1990.
FEATURES       from  to/span     description
    pept        120     1124     FrzG protein
    pept    /  1117     2899     FrzF protein (AA at 2)
BASE COUNT      383 a    990 c   1134 g    492 t
ORIGIN      
        1 ggatgccggc gcggacgcgt acctcgtcaa gggcgagctg ggcgtggagg ttctcgcgca
       61 ggccatcgac cggctgacct gaggagccag gcttgggcgg tggcgcggta gtcgcaggaa
      121 tggcgtttcg ggtgctcatg gtgggcaagg ggctgcgtgc gctcgcggcc cggggcctgt
      181 tcgatgggga atccctggtg cccgtggggc cggcggaggt ggacttcgcc ggcgccctgg
      241 tggccgtgca gcggcacttc ccggacgtgg tgctggtgga cctgagcgcg ctggacgcgc
      301 tgcccgccat cgagcacgtc atggtggagc ggcccgtgcc ggtgctggcg ttgcaccccg
      361 gcgtgttgtc cggccaggag gccttccagg cgatggtggc gggcgcgctg gacgtgctgg
      421 agcgtccggc gaaccccggg cccgagttct ggacgcacgt gtcgcgcaag ctggtgctgc
      481 tggcgcaggt gaaggcggtg cggcaggtgc agacgcggcc gccaccgcaa gcggcgcgtg
      541 aggcgaagcc gcctcctccg tatccgctgg tggccatcgc cgcgtccctg ggtggcccca
      601 aggcggtggc gcaggtgctg cggatgattc cgcgcgcctt cccggcgccc atcgcctact
      661 gccagcacat cagcgacggt ttcacggaag ggctggcgca ctggttgtcc aatgaaacgg
      721 cgctgcgcgt gctggaggcc gagcatgacg tgctcatggc gccgggcacg gtgtacatcg
      781 ctccgtcggg cagtcacctc ttggtccgac ccgagggcag gttggagctg gacgcgggcc
      841 ccgcgcttcg cggtttccgg ccgtcctgtg acatgctgct gacttcagcg ggtgagtcgt
      901 tcggcccgcg ctgcatcggg gtcatcctga cgggcatggg gcgcgacggg gcgcgagggt
      961 tgaaggagat tcgagagcgc ggcggtcgga ccattgccca ggacgaagcg tcgagcgtcg
     1021 tctggggcat gccgcgcgag gcggtgttga tgggcgcggc gcacgaggtg ctgccactga
     1081 gccggattgg cgcggcgctg atgcagtggg tggatgtgtg ttgacggcga gccagaaagt
     1141 cttgcaacaa ctcgcggcgc tgctgctgga gcgcgcgggg ctgaaaatca cgccggatgg
     1201 cttccacagc ctccgactgg cgctgtccgc gcggatgccc gtgctggggc tggaagagcc
     1261 cgagcactac atccagcgac tgacgggcgc cggtggcgaa gaggagctgc gctcgctgtt
     1321 gccgctggtg acggtggggc acacggagtt cttccgcgac gcgaagcagt tccgcgcgct
     1381 ggagaagagc gtgctgccgg acctggtgtc ccgttcgcgg cgcgagatgc gcaaggtgtc
     1441 catctggtcc gcgggctgcg cgacggggga ggagccctac agcctggcca tggtgctggc
     1501 ggagctgggc gcgctgtcgc tggaggtgga cctgtgggcc accgacctca acctggccgc
     1561 ggtggaggcc gcgaagcagg ggcgcttcac ctcgcggcgg gccatcagca tcaaccaggc
     1621 gcggctgacg cgcttcttca agcccgtgga agagggctat gaggcgctgc ccgcgctgcg
     1681 tgagtacatc cgcttcgatg gacagaacct ggcggttccc gtcttcgaca aggtggccct
     1741 gtcgtcgctg gacctcatcc tctgccgcaa cgtcatcatc tacttcgacc tgcccaccat
     1801 ccgcgggttg atggaccgct tcctcgccgc gctgcggccg ggcgggctgt tgttcctggg
     1861 gtactcggag agcctcttca aggtctacga ccgcttcgag atgatcgaag tcgatggggc
     1921 gttcgtgtac cgccgcccgc tgaacgacaa gagcatgcgg gcgccgccgc tgcgcatcac
     1981 cccgtatcct ggcgagcccg atgtcgccgc gcgcaggccc gtgcctgcgg acgcgttcac
     2041 cgcggacctg cgcaagcgga tgctgcccga ggacgtcccg ttgacgacgc ggctgcccgc
     2101 ggtgtcagcg tcgtcggtgg cggcgcctgg ctcgcccagc gtgacgctgc cggcgctggg
     2161 ggcctcttcg agtccgcgtt ccgtggtgcc ggggcggctg cccgcggtgt cgcctcactc
     2221 gccgctgccg gccatcgccg cgcgctcgcg tgtcaccgcg gagttgccca cggtgggaag
     2281 cgtggactcc gcccgtccgc gcatcaccac cgagctgccg gccgtggcca ccacgccgcg
     2341 cgcgcccacc gtggaggtgc ccgcctggcc cacgctgctg cctccggcgg agcggctggc
     2401 catggcggtg cggaagatgg cgcaggggga tttctcggcg gccatcgctg gcgtgcagcg
     2461 gctgctcgcg gacgagccca gtgacttgga tgggctgctg acgctgggca acctgttctc
     2521 gctcaccggc cgcatccccg aggcgcgcga ggccttcgcg caggccattc agcgcgagcc
     2581 gctgtgcgtg gaggcgcggg tgttcggcgg ggtcgccgcg ttgcaagcag gggagttgag
     2641 cgaggcgcgc tccgagctga gcaaggccct gttcctggag cccacgctgg ccattggcca
     2701 ctacctgctg gcgcaggtgc acgagcgcac gcaggaccat gaggcggccc gccgcagcta
     2761 ccgcaacgcc attgcccagc ttcgcttccc gcagcgtccc ctcgcggggc actacccgga
     2821 gatgccggac tcggcggatg ccatctctcg cgcggcgcgt tacgccctgg ccgcgctgga
     2881 ggagcagccc ctgcgctgag gcaggggccg cgtcccaggc ttcacgtcag tccaggctgc
     2941 tcttcacctg gtccaggctc ttgctcgggt cgagcacgga gccgaacttc ttctgcagg
//
LOCUS       ECOSFIM       762 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  E.coli S-fimbrial protein (sfaA) gene, complete cds.
ACCESSION   M35273
KEYWORDS    S-fimbrial protein.
SOURCE      E.coli (strain 536) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 762)
  AUTHORS   Schmoll,T., Hacker,J. and Goebel,W.
  TITLE     Nucleotide sequence of the sfaA gene coding for the S-fimbrial
            protein subunit of Escherichia coli
  JOURNAL   FEMS Microbiol. Lett. 41, 229-235 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        166      708     S-fimbrial protein precursor
    sigp        166      237     S-fimbrial protein signal peptide
    matp        238      705     S-fimbrial protein
BASE COUNT      218 a    140 c    172 g    232 t
ORIGIN      
        1 gaaaatatta tcggagataa tgtcataaat gctgcctgag tgtatttctc acattgcatt
       61 tatgaagttc tcctgaaaaa agattcccgt cgttcgggat attgattgtg tctgttgtga
      121 tgacagatac ggtgtgcgta gttcaattaa aaacaggaat taaatatgaa gttaaaattc
      181 atctccatgg ctgtattttc agccctgacc ttgggtgttg cgacaaatgc gtctgctgtc
      241 accacggtta atggtggtac agttcatttt aagggggaag ttgttgatgc tgcatgtgct
      301 gtaaacacta attcagcaaa tcaaacgttt tctgggcaag ttcgttcagc taagttggcg
      361 aatgatggag agaagagttc ccctgttgga tttagtattg aacttaatga ctgtagttct
      421 gcaactgccg ggcatgcatc aattatcttt gcaggaaatg ttattgctac acacaatgat
      481 gtgctgtctc tacagaatag tgctgcaggt agtgcaacaa atgtaggtat tcagatattg
      541 gatcatacag gtactgcagt tcaatttgac ggagtgactg catctacaca atttacatta
      601 acagatggca ccaataaaat tcctttccag gcagtttatt atgcaacagg taagtcaacg
      661 cctggtattg ccaacgccga cgccaccttt aaagttcagt accagtaata tcagaacagt
      721 gtaacgatat atacccggcc aggagggctg tttttatcat gc
//
LOCUS       ECOSRNB       655 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  F plasmid (from E.coli) stable RNA degradation promoter (srnB)
            gene, complete cds.
ACCESSION   M35279
KEYWORDS    .
SOURCE      F plasmid (from E.coli) DNA.
  ORGANISM  Plasmid F
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 655)
  AUTHORS   Akimoto,S., Ono,K., Ono,T. and Ohnishi,Y.
  TITLE     Nucleotide sequence of the F plasmid gene srnB that promotes
            degradation of stable RNA in Escherichia coli
  JOURNAL   FEMS Microbiol. Lett. 33, 241-245 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        251      457     stable RNA degradation promoter (srnB)
    signal      127      132     -35 region
    signal      150      155     -10 region
    signal      466      485     transcription termination signal (put.)
    binding     242      245     ribosome binding site
BASE COUNT      163 a    156 c    173 g    163 t
ORIGIN      
        1 aattcccatt ctggaccagc gggagcatac gaacaataat ttacggtttc gcgctatagc
       61 tggctcaagt taggttggac cctgaatctc cagacaacca atatctgatc gcgccagtgg
      121 tggcagttat taagcaacag ggaatgtggt attatcgcgg cgggtgtctg agcctttctg
      181 gttcaggcaa gacgcaggta ccagaaatgc gaagacccca cttgttaatc cattaactcg
      241 tgaggtctgc atgaagtacc ttaacactac tgattgtagc ctcttccttg cagagaggtc
      301 aaagtttatg acgaaatatg cccttatcgg gttgctcgcc gtgtgcgcta cggtgttgtg
      361 tttttcactg atattcaggg aacggttatg tgagctgaat attcacaggg gaaatacagt
      421 ggtgcaggta actctggcct acgaagcacg gaagtaagct gccgggcggg gacggaagtc
      481 cccgctttcc ggaagtgtga ggtatttcag gggcagacac ccgacatgcc agaaacagcc
      541 ggtcccgccc ggggccggca cccaggttca ggcatttcct gcttttcagt catttcatta
      601 tcaaaatcac attaaacggt cgtaatcaga catgatttgt gcgccaacac agatc
//
LOCUS       HUMTIMP2     1062 bp ss-mRNA            PRI       26-JUL-1990
DEFINITION  Human metalloproteinase-2 inhibitor (TIMP-2) mRNA, complete cds.
ACCESSION   J05593
KEYWORDS    metalloproteinase-2 inhibitor.
SOURCE      Human melanoma cell line A2058, cDNA to mRNA, clone pT2-M01.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1062)
  AUTHORS   Stetler-Stevenson,W.G., Brown,P.D., Onisto,M., Levy,A.T. and
            Liotta,L.A.
  TITLE     Tissue inhibitor of metalloproteinases-2 (TIMP-2) mRNA expression
            in tumor cell lines and human tumor tissues
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by W.G.Stetler-Stevenson, 19-JUN-1990.
FEATURES       from  to/span     description
    pept        271      933     metalloproteinase-2 inhibitor precursor
    sigp        271      348     metalloproteinase-2 inhibitor signal peptide
    matp        349      930     metalloproteinase-2 inhibitor
BASE COUNT      222 a    370 c    316 g    154 t
ORIGIN      
        1 ggggccgccg agagccgcag cgccgctcgc ccgccgcccc ccaccccgcc gccccgcccg
       61 gcgaattgcg ccccgcgccc tcccctcgcg cccccgagac aaagaggaga gaaagtttgc
      121 gcggccgagc gggcaggtga ggagggtgag ccgcgcggag gggcccgcct cggccccggc
      181 tcagcccccg cccgcgcccc cagcccgccg ccgcgagcag cgcccggacc ccccagcggc
      241 ggccccgccc gcccagcccc ccggcccgcc atgggcgccg cggcccgcac cctgcggctg
      301 gcgctcggcc tcctgctgct ggcgacgctg cttcgcccgg ccgacgcctg cagctgctcc
      361 ccggtgcacc cgcaacaggc gttttgcaat gcagatgtag tgatcagggc caaagcggtc
      421 agtgagaagg aagtggactc tggaaacgac atttatggca accctatcaa gaggatccag
      481 tatgagatca agcagataaa gatgttcaaa gggcctgaga aggatataga gtttatctac
      541 acggccccct cctcggcagt gtgtggggtc tcgctggacg ttggaggaaa gaaggaatat
      601 ctcattgcag gaaaggccga gggggacggc aagatgcaca tcaccctctg tgacttcatc
      661 gtgccctggg acaccctgag caccacccag aagaagagcc tgaaccacag gtaccagatg
      721 ggctgcgagt gcaagatcac gcgctgcccc atgatcccgt gctacatctc ctccccggac
      781 gagtgcctct ggatggactg ggtcacagag aagaacatca acgggcacca ggccaagttc
      841 ttcgcctgca tcaagagaag tgacggctcc tgtgcgtggt accgcggcgc ggcgcccccc
      901 aagcaggagt ttctcgacat cgaggaccca taagcaggcc tccaacgccc ctgtggccaa
      961 ctgcaaaaaa agcctccaag ggtttcgact ggtccagctc tgacatccct tcctggaaac
     1021 agcatgaata aaacactcat cccatgggtc caaattaata tg
//
LOCUS       ALREV1        717 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Rous sarcoma defective endogenous virus ev-1 locus gag polyprotein
            RNA, 5' end.
ACCESSION   M30517
KEYWORDS    gag polyprotein.
SOURCE      Rous sarcoma defective endogenous virus (strain Prague C), cDNA to
            viral RNA, clone pGD27.
  ORGANISM  Rous sarcoma virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Avian sarcoma viruses.
REFERENCE   1  (bases 1 to 717)
  AUTHORS   Vogt,V.M., Pepinsky,R.B. and Southard,L.E.
  TITLE     Primary structure of p19 species of avian sarcoma and leukemia
            viruses
  JOURNAL   J. Virol. 56, 31-39 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept          1  >   717     gag polyprotein
    matp          1      465     p19 protein
    matp        466      531     p2 protein
    matp        532      717     pp10 protein
BASE COUNT      161 a    173 c    246 g    137 t
ORIGIN      
        1 atggaagccg tcataaaggt gatttcgtcc gcgtgtaaaa cctattgcgg gaaaacctct
       61 ccttctaaga aggaaatagg ggccatgttg tccctgttac aaaaggaagg gttgcttatg
      121 tctccctcag acttatattc cccggggtcc tgggatccca ttaccgcggc gctctcccag
      181 cgggcaatgg tacttgggaa atcgggagag ttaaaaacct ggggattggt tttgggggca
      241 ttgaaggcgg ctcgagagga acaggttaca tctgagcaag caaagttttg gttgggatta
      301 gggggaggga gggtctctcc cccaggtccg gagtgcatcg agaaaccagc aacggagcgg
      361 cgaatcgaca aaggggagga agtgggagaa acaactgcgc agcgagatgc gaagatggcg
      421 ccggagaaaa tggccacacc taaaaccgtt ggcacatcct gctatcagtg cggaacagct
      481 actggctgta attgcgccac agcctcggcc cctcctcctc cttatgtggg gagtggtttg
      541 tatccttccc tggcgggggt gggagagcag cagggccagg ggggtgacac accttggggg
      601 gcggaacagc caagggcgga gccagggcac gcgggtctgg cccctgggcc ggccctgact
      661 gactgggcaa ggatcaggga ggagcttgcg agtactggtc cgcccgtggt ggccatg
//
LOCUS       ALREV2        564 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Rous sarcoma endogenous virus ev-2 locus gag polyprotein RNA,
            partial cds.
ACCESSION   M30518
KEYWORDS    gag polyprotein.
SOURCE      Rous sarcoma endogenous virus (strain Prague C), cDNA to viral RNA,
            clone pAS2.
  ORGANISM  Rous sarcoma virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Avian sarcoma viruses.
REFERENCE   1  (bases 1 to 564)
  AUTHORS   Vogt,V.M., Pepinsky,R.B. and Southard,L.E.
  TITLE     Primary structure of p19 species of avian sarcoma and leukemia
            viruses
  JOURNAL   J. Virol. 56, 31-39 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    <     1  >   564     gag polyprotein (AA at 1)
    matp    <     1      312     p19 protein
    matp        313      378     p2 protein
    matp        379      564     pp10 protein
BASE COUNT      123 a    138 c    207 g     96 t
ORIGIN      
        1 gatcccatta ccgcggcgct ctcccagcgg gcaatggtac ttgggaaatc gggagagtta
       61 aaaacctggg gattggtttt gggggcattg aaggcggctc gagaggaaca ggttacatct
      121 gagcaagcaa agttttggtt gggattaggg ggagggaggg tctctccccc aggtccggag
      181 tgcatcgaga aaccagcaac ggagcggcga atcgacaaag gggaggaagt gggagaaaca
      241 actgtgcagc gagatgcgaa gatggcgccg gaggaaacgg ccacacctaa aaccgttggc
      301 acatcctgct atcattgcgg aacagctatt ggctgtaatt gcgccacagc ctcggcccct
      361 cctcctcctt atgtggggag tggtttgtat ccttccctgg cgggggtggg agagcagcag
      421 ggccaggggg gtgacacacc tcggggggcg gaacagccaa gggcggagcc agggcacgcg
      481 ggtctggccc ctgggccggc cctgactgac tgggcaagga tcagggagga gcttgcgagt
      541 acaggtccgc ccgtggtggc catg
//
LOCUS       HAMCHO1      1953 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  C.griseus intracisternal A-particle retrovirus like sequences.
ACCESSION   M34949
KEYWORDS    p27 protein; pseudogene.
SOURCE      C.griseus adult ovary, cDNA to mRNA, clone CHIAP.SW2.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 1953)
  AUTHORS   Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H.,
            Nguyen,T.P. and Wurm,F.M.
  TITLE     Presence and transcription of intracisternal A-particle-related
            sequences in CHO cells
  JOURNAL   J. Virol. 64, 2021-2032 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps       2      277     IAP p27 homologue
                277      564     IAP p27 homologue
    mRNA    <     1     1953     p27 (pot.) mRNA
BASE COUNT      485 a    473 c    509 g    486 t
ORIGIN      
        1 ctttactctt acacaattgg atagacttgc cctaaatgcc ttgacgccat ctgactggca
       61 gatggtcaca aaagctgcgc ttgtcagcat gggccaatac atggagtgga aagcactctg
      121 gcatgaggcc gcccaagagc aggccagagc taacgcgacg gccttaactc ctgagcaaca
      181 actatggaca ttcgacctgt taacgggcca gggtcgtttt gcagctgatc aaacaaatta
      241 tcattggggc gcttatccac aaatcgacaa cgcggcatta gggcctgaaa ggtgctctcc
      301 aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca ggagactttc
      361 tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga tcctgagcag
      421 gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg tcgcgcagct
      481 atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag agaacttggg
      541 ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa gcgccccctt
      601 aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat tatggcagat
      661 ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca tccaatatga
      721 aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt tacagaattg
      781 gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc agattaactc
      841 cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt cgactgtctc
      901 ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct gcggattggt
      961 gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca aggtggttat
     1021 cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt taacaatact
     1081 taagcaatag gcgctggcca gacagctctt gcacacccgg agcctaggct cattgcacag
     1141 ggtagagtgt ctggcttgag cagcccatga gggaatgtgg agcaaggcat cgcacagaag
     1201 agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg gttgcctgcc
     1261 ctagtctccc tttcccagaa aacggcagag gacaggtcga gagcgcttcg ggtcaagcta
     1321 acagcctaat ggcgactctc gtacacagtc ttaatgtttg attgggaagg tacaacctct
     1381 gcctctatcc ctcaacatat gggtgaccta tttgcttgta aaaatatgta agccttatca
     1441 ttaattaata aaaaagggga gatgtaggga gccgtccctg cattctctat tacaagatgg
     1501 cgcctgcatc cggcaggcac cgaatggtaa acaagttaat gcgcaggtgc tgggtaactt
     1561 tccatccctt ggtctctgcc tctcccgtgg cgtcatatgg tccgatgagc tgcagccagt
     1621 cagggggtga cacgtccgag gcggtggttg ccagcctata taagggatgg gtttttggga
     1681 gttcggggtc tctgctctgt aagcttatgc tctccctctc aagatgcatt aaagctttac
     1741 tacagaagga tcctgaatgt cctgcgtcat tcttgctggc gagacggtag cgcgggacag
     1801 atggtgacag ccggtgcaga aagtgtcaac ctcagcttcc ttctccagga agacttcagc
     1861 ctgggactgc tcctctacag agccccctac caagattatc taacctgcct gccttcttgt
     1921 tgagctgtgt gtaataaact cattgagttt ccc
//
LOCUS       HAMCHO2      1570 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  C.griseus intracisternal A-particle retrovirus like sequences.
ACCESSION   M34950
KEYWORDS    p27 protein; pseudogene.
SOURCE      C.griseus adult ovary, cDNA to mRNA, clone CHIAP.LY6.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 1570)
  AUTHORS   Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H.,
            Nguyen,T.P. and Wurm,F.M.
  TITLE     Presence and transcription of intracisternal A-particle-related
            sequences in CHO cells
  JOURNAL   J. Virol. 64, 2021-2032 (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept.ps      26      694     IAP p27 homologue
BASE COUNT      418 a    368 c    406 g    378 t
ORIGIN      
        1 aaaaagaaag ctgggcctcg cttttcccat ctttgagggc attgagggag agtgtatgca
       61 tgcacccatg gagtataatc agataaaaga attggcagaa tcagtcagga aatatggagt
      121 cacagccaac tttactctta cacaattgga tagacttgcc ctaaatgcct tgacgccatc
      181 tgactggcag atggtcacaa aagctgcgct tgtcagcatg ggccaataca tggagtggaa
      241 agcactctgg catgaggccg cccaagagca ggccagagct aacgcgacgg ccttaactcc
      301 tgagcaacaa ctatggacat tcgacctgtt aacgggccag ggtcgttttg cagctgatca
      361 aacaaattat cattggggcg cttatccaca aatcgacaac gcggccatta gggcctgaaa
      421 ggtgctctcc aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca
      481 ggagactttc tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga
      541 tcctgagcag gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg
      601 tcgcgcagct atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag
      661 agaacttggg ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa
      721 gcgccccctt aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat
      781 tatggcagat ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca
      841 tccaatatga aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt
      901 tacagaattg gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc
      961 agattaactc cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt
     1021 cagctgtctc ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct
     1081 gcggattggt gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca
     1141 aggtggttat cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt
     1201 taacaatact taagcaatag gccgctggcc agacagctct tgcacacccg gagcctaggc
     1261 tcattgcaca gggtagagtg tctggcttga gcagcccatg agggatgtgg agcaaggcat
     1321 cgcacagaag agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg
     1381 gttgcctgcc ctagtctccc tttcccagaa aaacggcaga ggacaggtcg agagcgcttc
     1441 gggtcaagct aacagcctaa tggcgactct cgtacacagt cttaatgttt gattgggaag
     1501 gtacaacctc tgcctctatc cctcaacata tgggtgacct atttgcttgt aaaaatatga
     1561 agccttatca
//
LOCUS       HAMCHO3      2186 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  C.griseus intracisternal A-particle retrovirus like sequences.
ACCESSION   M34951
KEYWORDS    protease; pseudogene.
SOURCE      C.griseus adult ovary, cDNA to mRNA, clone CHIAP.YL[7,9].
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2186)
  AUTHORS   Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H.,
            Nguyen,T.P. and Wurm,F.M.
  TITLE     Presence and transcription of intracisternal A-particle-related
            sequences in CHO cells
  JOURNAL   J. Virol. 64, 2021-2032 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps     553     1281     IAP protease homologue
BASE COUNT      649 a    431 c    518 g    588 t
ORIGIN      
        1 gcaataactc catataaggg taaaggcctt gaagtctgga tgaaagtctg tagggagtta
       61 gggggtccgc tgactaatgc tggactagca gctgctgtgt tgcaattaac taagaaaggt
      121 ggaggttcag gagcttgctt taaatgcggc aagcaagggc atttgaaaaa gcaatgcccc
      181 gagggaggaa acactaaagt caataaactt tgctccgcgc cctaagcaac ctggcttatg
      241 tcctagatgt agaaaaggaa atcattgggc taaggattgt agatcagtaa aagacatcag
      301 tggacagcct cttgttcagg ggtatggagg agcccgttca aaaaacggac gacggggccc
      361 acgaccccag ggcccacaaa tatatggggc catggaggat cagaaccagg agcagagtcc
      421 cgaaacctgg ccctctcttc gtcatccgag ggaccgagga gagccactac aggctccgcg
      481 gggctggact tacgctccac caccagactc gtattaactc ccagaatggg ggtccagctt
      541 gttgacaccg attttaaggg accccttgag cctggcacag taggtttgct tataggaaga
      601 tcatctgcag cattgaaagg tttacgagta catcctggag ttatagatcc tgattacatg
      661 ggtgtagtaa agatcatggt agaatctcct agagggatta cggccatttc tcctggagac
      721 aggatagcac agttactgct tttgccaagc ttgcatgaca agtttccagc acaagccaga
      781 gagagaggag agggaaactt tggctccact ggatcaaact taactttcct agctttagac
      841 cttgatcaac gtccaaccct tgagttaata gtgaatggta agaaaatctt aggcttacta
      901 gattctggag ctgataagag catcatagcc actaaagatt ggccctctgg ctggcctata
      961 caggtttctt ctcaaagttt acaaggttta ggctatgcta aggctcctga tatgagtgct
     1021 agacaattgc cttggaaaga tcaggaaggg cattcaggga ccatgcaacc ttatgtgtta
     1081 gacttaccaa tttcattatg gggaagagat ttgttaaagg atatgggttt taaactcaca
     1141 aatgaatact cagaaacatc tcaaggtatc atgaaacgaa tgggatacag tcccaggcca
     1201 ggcctcggga aacatctgca gggtcgtacc agtcctatta attccacaat tgagaccaaa
     1261 gaatctaggt ctgggttttt cctagggcca ctgaggaggt attcctatta cctggaaaac
     1321 agaggagccg gtatgggttc ctcagtggcc actttcctct gagaaactgg aagctgctaa
     1381 gactctagtg cgggagcagc tggatctggg gcatataaaa tcctctgtat ctccatggaa
     1441 tactcctatt tttgtcatta agaaaaaatc tggtaaatgg agactgcttc acgatcttag
     1501 agctattaat caacagatgc aaattatggg ccctgtacaa cgtggtcttc cacttttaac
     1561 ttctttacct gcatcatggc ctatcatctc tatagatatt aaagattgct tcttttccat
     1621 acctttgtgt gccaaggatt cagggcgttt tgcgtttacg ctgccctctt gtaatcatga
     1681 acaacctgat ttaaggtatg aatgggatag tgttggccac aggggatggc caatagtcct
     1741 actatgtgtc agttgtttgt agcagaagca attgctcctt ttgagagtgg actttcccaa
     1801 agattagatg tgttcattat atggatgata ttttattggc tgccaaagat gataaaacgc
     1861 ttaataaggc atatacaaaa ttggtaaaat tgcttgagat gcataattta gtcatagcct
     1921 cagaaaaggt acaaaaggac actgttgtta actatctagg ggctaagatt ctccctcata
     1981 caattattcc acaaaagata gagattagaa aagataattt aaaaactctt aatgattttc
     2041 aaaagttgtt gggagatata aattggataa gatgttattt aaaattacca aattatgagt
     2101 tgaagccatt gtataatatt ctcaatggtg attcagcatt agattcacct aggcagttaa
     2161 ctgctgaagc cagagaagct ttaaag
//
LOCUS       HUMCHRM      2098 bp ds-DNA             PRI       26-JUL-1990
DEFINITION  Human muscarinic acetylcholine receptor gene, complete cds.
ACCESSION   M35128 Y00508
KEYWORDS    muscarinic acetylcholine receptor.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2098)
  AUTHORS   Allard,W.J., Sigal,I.S. and Dixon,R.A.F.
  TITLE     Sequence of the gene encoding the human M1 muscarinic acetylcholine
            receptor
  JOURNAL   Nucleic Acids Res. 15, 10604-10604 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        449     1831     muscarinic acetylcholine receptor
BASE COUNT      458 a    662 c    570 g    408 t
ORIGIN      
        1 agtatagctt ataagtggat gaatgcttga gaagttgcag attatacaaa gtagttccca
       61 actcctgcaa cccagtatgt aagatagaat tgtagttaat ttcccagtaa gaaaatgagc
      121 ctgagtctga aaggtaaaac tgaatgaagt attcaaaccc tggatcccaa agccactcca
      181 cgctgctggc aaatccactt atggctggga aagtgccact gcataaatga ccatgagtgg
      241 gcaccggtaa gggagggtga tgctatctgg tctgaagctc tgaagggcaa gaattacatc
      301 ccatgcatct tccaataagg tctatcagaa atgtccagtg gcccaaccaa agcccatgtc
      361 ctctctttta ggtgatgact ttcccctgag gaagccctgt agcgtgcctg gaggaagggg
      421 tctccaaccc cagccccacc tagccaccat gaacacttca gccccacctg ctgtcagccc
      481 caacatcacc gtcctggcac caggaaaggg gccctggcaa gtggccttca ttgggatcac
      541 cacgggcctc ctgtcgctag ccacagtgac aggcaacctg ctggtactca tctccttcaa
      601 ggtcaacacg gagctcaaga cagtcaataa ctacttcctg ctgagcctgg cctgtgctga
      661 cctcatcatc ggtaccttct ccatgaacct ctataccacg tacctgctca tgggccactg
      721 ggctctgggc acgctggctt gtgacctctg gctggccctg gactatgtgg ccagcaatgc
      781 ctccgtcatg aatctgctgc tcatcagctt tgaccgctac ttctccgtga ctcggcccct
      841 gagctaccgt gccaagcgca caccccgccg ggcagctctg atgatcggcc tggcctggct
      901 ggtttccttt gtgctctggg ccccagccat cctcttctgg cagtacctgg taggggagcg
      961 gacagtgcta gctgggcagt gctacatcca gttcctctcc cagcccatca tcacctttgg
     1021 cacagccatg gctgccttct acctccctgt cacagtcatg tgcacgctct actggcgcat
     1081 ctaccgggag acagagaacc gagcacggga gctggcagcc cttcagggct ccgagacgcc
     1141 aggcaaaggg ggtggcagca gcagcagctc agagaggtct cagccagggg ctgagggctc
     1201 accagagact cctccaggcc gctgctgccg ctgctgccgg gcccccaggc tgctgcaggc
     1261 ctacagctgg aaggaagaag aggaagagga cgaaggctcc atggagtccc tcacatcctc
     1321 agagggagag gagcctggct ccgaagtggt gatcaagatg ccaatggtgg accccgaggc
     1381 acaggccccc accaagcagc ccccacggag ctccccaaat acagtcaaga ggccgactaa
     1441 gaaagggcgt gatcgagctg gcaagggcca gaagccccgt ggaaaggagc agctggccaa
     1501 gcggaagacc ttctcgctgg tcaaggagaa gaaggcggct cggaccctga gtgccatcct
     1561 cctggccttc atcctcacct ggacaccgta caacatcatg gtgctggtgt ccacgttctg
     1621 caaggactgt gttcccgaga ccctgtggga gctgggctac tggctgtgct acgtcaacag
     1681 caccatcaac cccatgtgct acgcactctg caacaaagcc ttccgggaca cctttcgcct
     1741 gctgctgctt tgccgctggg acaagagacg ctggcgcaag atccccaagc gccctggctc
     1801 cgtgcaccgc actccctccc gccaatgctg atagtcccct ctcctgcatc cctccacccc
     1861 agtccccggg aaaaggccgg tcggaagagg gcaggggctg catcctcagc cccagggccc
     1921 tgctcaggcc tcacctggct tcccaggacc ctgggtcacc ttcctgggca gcccagagag
     1981 acctgccaac tttccagact tcgctattcc caggcaggga gggaaacccg gggaactggt
     2041 ttttctgttc cctgctgggt gggaatgcgc tcttcacagg aagaaggccc gggaggag
//
LOCUS       MVOTRPBA     2874 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  M.voltae tryptophan synthase operon (trp) genes, complete cds.
ACCESSION   M35130
KEYWORDS    tryptophan synthase.
SOURCE      M.voltae (PS DSM 1537) DNA.
  ORGANISM  Methanococcus voltae
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanococcales; Methanococcaceae.
REFERENCE   1  (bases 1 to 2874)
  AUTHORS   Sibold,L. and Henriquet,M.
  TITLE     Cloning of the trp genes from the archaebacterium Methanococcus
            voltae: Nucleotide sequence of the trpBA genes
  JOURNAL   Mol. Gen. Genet. 214, 439-450 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      206     tryptophan synthase F (AA at 3)
    pept        304     1533     tryptophan synthase B
    pept       1571     2425     tryptophan synthase A
    pept       2460     2600     ORF 46
    pept    >  2874     2666 (c) ORF 68 (AA at 2872)
BASE COUNT     1046 a    381 c    569 g    878 t
ORIGIN      
        1 gggttgcgga aactcatgac catagagtta gcgaaattat ctccaaaaaa tttgatgtcg
       61 tacttgcagg cggtataact tttgaaaacg tgagaaaaat tgtaaattcc gtaaaacccg
      121 ttggaattga tgtttctagt ggcgttgagt taaacaacag aaaaaacgaa ttattaataa
      181 aaaagatttg tcataatttg atttaattag aattaattag aattaatcga attttaacta
      241 attaaaaatt ataggttatt aaattatgac taaatacagt atatgtaaaa ttaaggtgaa
      301 attatgaaat gtaatacaaa atgtgacaaa aatggatatt ttggggaatt tgggggtcaa
      361 tatatacctg aagttttaaa accggctgtt gaagagctta aagaagccta taaagagtta
      421 aaagatgacg aagactttca aaatgagctt gcatactatt taaaacatta tgcaggacgt
      481 gaaactcccc tatattatgc aaaaaacttg actgaaaaac ttggtggtgc caaaatctac
      541 ctaaaaagag aggacttatt gcatggtggt gcccataaaa ccaataacac tattggtcaa
      601 gcacttcttg ctaaaaaaat gggtaaaaca agaataattg ctgagacggg tgcgggtcaa
      661 catggtgttg gcacgtctat ggcaggagca ctttttggtc tcgaaacaga gatttttatg
      721 ggtagggtag atacagaacg acaacaacct aacgtagcac gtatgaaatt attgggtgca
      781 aaagttacgc cagtcgatac aggttccaaa gttttaaaag acgctgtaaa tgaagctatg
      841 agaaattgga ctgctacttt tgaaaatact cactatttac ttggcactgt gatgggtcca
      901 cacccattcc caactatggt gagagatttt cagtcagtaa ttgggaaaga agttaaaaaa
      961 caaataatgg agcaggaaga aagacttcct gattatttag ttgcctgtat tggagggggt
     1021 agcaatgcaa tgggtttatt tcatccattt ttaagtaata atatcagtac tggcaatgat
     1081 gatgccaaaa atgttaaaat gataggaata gaggctgcag gtaaggggct taacactagc
     1141 cttcacggtg catccataac taaaggtgaa aaaggggtac ttcacggtat gctttcgtat
     1201 ttcttacaag acgaggatgg acaaatagaa gaagcttata gtatttctgc cggattggat
     1261 tacccaggga taggtccaga gcatgcttat ttacataacc ttgggcgtgt gcagtatgct
     1321 tcagcaactg ataaacaggc cttaaaagca tttatggaac ttacgagaac cgaaggaatt
     1381 atcccggctc tagaatcgtc tcacgcgatt gcttatgcca ttgaaaatgc aggaaatatg
     1441 gataaggacg atataatggt aataaacctt tcaggacgtg gggataaaga tttaaacaca
     1501 gtaataaatg cagtacataa attgggttgt taaaattaat taaaattaat taaaatatcg
     1561 aggaatttaa atgaaaaact tagaaaattt agaaaaagat ttgaaaaatg acttaaaaaa
     1621 agatttgaaa aaagaaaaac caattttagt tagtttttta gtatcagggg acccaaatat
     1681 tgaagctaca ctaaaattta tgaatgcact agacgaatat tgcggagtta tagaactagg
     1741 tataccattt agtgacccga tagcagatgg ttcaactatt caagaggcaa atgtacgttc
     1801 cttatcaaat ggttataaaa tacatcaatc ttttgacgta ttacgggaat ttaggaaatt
     1861 ttcagatacg ccagttgtac ttatgacgta ttacaatcca atatataaaa gaggtattga
     1921 aaattttgta attcaagcaa aagaagcagg ggcaaatggg cttataattg tagatttacc
     1981 cctagatgaa gcagaacagt atagggcaat atgtaaaaag catgatatgg gaacagtatt
     2041 ccttgtagcc ccaaatacac ctgatgagag gttgatgtat tctgatgagg ctagtacact
     2101 gtttttatac gtaatttcga catttggtat tactggagct aggggttcat ttgaaaagat
     2161 gacttttgaa tttatagctc gtgcaaaaaa tctttgcgat aaaaataagc tgtatgtagg
     2221 ttttggaatt tcaaacggtg aacatgctga aaaaataatt gaaaatggtg ctgacggtgt
     2281 tattgtaggg agtgcttttg tagatattat taaggaatac ggggattcta atgaaactat
     2341 ttataaatta aaagaattag ctcgggaatt aagcgaaggg attcataaag gttatgttaa
     2401 atacaatgaa aagaataaat attaaataat ataatttatt ttaaattttg ggtggagata
     2461 tgaatttaaa agataatata ctttataaat caatcaaatg gttttttgcg gttaaatcgg
     2521 agaaacctaa aaattacgat actgaagtaa aacctatatt gtatgagcaa gagcgacgtg
     2581 gtagacgccg tatattataa taaattctaa tttaaaaaaa taaaaaaaga aattatatta
     2641 ttgtagtatt taattaatta ttcatttaat tcttttttga attcaaaaag tttttggcaa
     2701 tgtccattat attttcagat attatatatt ttgaattttc ttttaaaacg ctatttgcaa
     2761 cgtcaagtga tttataaact tgtgcttcac ctttaaagta catttgtgcc gcttctgcaa
     2821 ctgtttttat tgctttagcc tgcccctcag cttcaattct aatactttct gcag
//
LOCUS       MYCSD1XX      425 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  M.pneumoniae SDC1 repetitive sequence.
ACCESSION   M35024
KEYWORDS    .
SOURCE      M.pneumoniae (strain M-129) DNA.
  ORGANISM  Mycoplasma pneumoniae
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 425)
  AUTHORS   Colman,S.D., Hu,P.-c. and Bott,K.F.
  TITLE     Prevalence of novel repeat sequence in and around the P1 operon in
            the genome of Mycoplasma pneumoniae
  JOURNAL   Gene 87, 91-96 (1990)
  STANDARD  simple staff_review
BASE COUNT      100 a    118 c    120 g     87 t
ORIGIN      
        1 aattcgaatt tgaaggccca aggcctcacc caacccgcct acctcatcgc cggtcttgac
       61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg
      121 acgactgatt cgagcgcttc gacctacaac caagcactcg cctggtcgac cacggccggg
      181 ttggacagtg atggggggta caaggccttg gtggaaaaca cggccgggct caacggcccg
      241 attaatggct tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg
      301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa
      361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat
      421 gatgg
//
LOCUS       MYCSDC1       425 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  M.pneumoniae SDC1 repetitive sequence.
ACCESSION   M35022
KEYWORDS    .
SOURCE      M.pneumoniae (strain M-129) DNA, clone MP135.
  ORGANISM  Mycoplasma pneumoniae
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 425)
  AUTHORS   Colman,S.D., Hu,P.-c. and Bott,K.F.
  TITLE     Prevalence of novel repeat sequences in and around the P1 operon in
            the genome of Mycoplasma pneumoniae
  JOURNAL   Gene 87, 91-96 (1990)
  STANDARD  simple staff_review
BASE COUNT      102 a    110 c    123 g     90 t
ORIGIN      
        1 aattcgaatt tgaaggctca aggcctcacc caacccgcct acctcatcgc cggtcttgac
       61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg
      121 agcacggaaa acagtgctgc caccaaagac caagcactcg cctggtcgac cacggccggg
      181 ttggacagtg ctggggggta caaggccttg gtggaaaaca cggccgggct caacggtccg
      241 attaatggct tgtttaccct gctcgacagc tttgcctatg tgaccccggt gagtggcatg
      301 aaagggggta gtcagaataa cgaagaagtg cagaccaagt atcccgttaa ggatgatagt
      361 aaggcttccg ccaaaattgc gtccttaatt aatgccagcc cactcaacag ttatggggat
      421 gatgg
//
LOCUS       MYCSDC1X      425 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  M.pneumoniae SDC1 repetitive sequence.
ACCESSION   M35023
KEYWORDS    .
SOURCE      M.pneumoniae (strain M-129) DNA, clone MP46..
  ORGANISM  Mycoplasma pneumoniae
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 425)
  AUTHORS   Colman,S.D., Hu,P.-c. and Bott,K.F.
  TITLE     Prevalence of novel repeat sequences in and around the P1 operon in
            the genome of Mycoplasma pneumoniae
  JOURNAL   Gene 87, 91-96 (1990)
  STANDARD  simple staff_review
BASE COUNT      105 a    114 c    117 g     89 t
ORIGIN      
        1 aattcgaatt tgaagaccca aggcctcacc caacccgcct acctcatcgc cggtcttgac
       61 gttgtggccg accacctcgt ctttgcggca tttaaagcgg gcgcggtggg gtatgatatg
      121 acgactgatt cgaacgcttc gacctacaac caagcactcg tctggtcgac cacggccggg
      181 ttggacagtg atggggggac aaggctttgg tagaaaacac aggccgggct caacggcccg
      241 attaatggtt tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg
      301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa
      361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat
      421 gatgg
//
LOCUS       MZECAT1      2065 bp ss-mRNA            PLN       26-JUL-1990
DEFINITION  Z.mays catalase isozyme 1 (CAT-1) mRNA, complete cds.
ACCESSION   M33104
KEYWORDS    catalase isozyme 1.
SOURCE      Z.mays seedling, cDNA to mRNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 2065)
  AUTHORS   Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G.
  TITLE     Characterization of catalase transcripts and their differential
            expression in maize
  JOURNAL   Biochim. Biophys. Acta 951, 104-116 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        169     1647     catalase isozyme 1 (EC 1.11.1.6)
    mRNA          1     2065     catalase isozyme 1 mRNA
BASE COUNT      502 a    558 c    487 g    518 t
ORIGIN      
        1 gaaaaaaaag gggaaatcgg cttcctactc cccgtcctta tcgccagccg aaccgacatg
       61 ttttctcccc ccttctcgcc ttctccttct ccccctagtc tagaggcgtt tgctccccaa
      121 ctccttcggc ccgtccgccc gcccactcga ctgatcccac cggcagccat ggatccatac
      181 aagcaccgcc cgtctagtgg gagcaactcc agcttctgga ccaccaactc cggcgccccc
      241 gtctggaaca acaactctgc cctcaccgtc ggacagcgag gtccaatcct ccttgaggat
      301 tatcatctaa tcgaaaagct tgctcagttc gacagagaac gtatccctga acgtgttgtg
      361 catgcacggg gagccagtgc caagggtttc tttgaggtca ctcatgatgt ctctcacctt
      421 acatgtgctg attttctccg tgctcctggg gtccagacac ctgttattgt ccgtttctct
      481 acagttgtgc atgagcgtgg aagccctgag accttgaggg atccacgtgg ttttgctgtc
      541 aagttctaca ccagagaggg taactttgac ctcgtgggta acaacatgcc tgtgtttttc
      601 atacgagatg ggatgaaatt ccctgacatg gtccacgctt tcaagccgaa tccaaagacc
      661 aatttgcagg agaactggag aatagtagat ttcttctctc accacccaga gagcctacac
      721 atgttcacct tcctctttga cgatgttggc atcccactca actacaggca catggagggc
      781 tttggtgtca atacctactc cttgatcaac agggatggaa agcctcacct tgtgaaattc
      841 cattggaagc ctacttgtgg tgtgaaatgc ttgctcgaca atgaagctgt gactgttgga
      901 ggcacctgcc acagccatgc gacgaaggat ctatatgatt ccatcgcagc tgggaattac
      961 cctgaatgga agctctacat ccagactatt gatcttgacc atgaggataa gtttgacttt
     1021 gacccgctcg atgtcaccaa gacctggcct gaggatatca tcccgctgca gcccgttgga
     1081 cggatggtcc tgaacaagaa cgtcgacaac ttctttgcag agaatgaaca gattgctttc
     1141 tgcccagcga ttagtgttcc tgcaattcac tattctgatg ataagctgct ccagacgaga
     1201 atcttctcct atgctgatac ccagaggcac cgccttggtc caaactatct gatgcttcct
     1261 gtgaatgcac caaaatgtgc ccaccacaat aaccaccatg atgggttcat gaacttcatg
     1321 cacagggacg aagaggtgaa ctacttccct tcgaggtttg atcccgcccg tcacgcggag
     1381 aaggtcccca ttcctccccg tgttctaaca cgctgtcgtg agaagtgcat cattcagaag
     1441 gagaacaact tcaagcaggc tggcgagaga tatcgttcct tcgaccctgc aaggcaagac
     1501 cggttcatcc agcgatgggt tgacgcactg acacaccctc gcgtgaccca tgaacaccgt
     1561 accatttgga tctcctactg gtcccagtgc gacgccgctc ttggccagaa gctgccttct
     1621 aggctgaacc tgaagccgag catgtaagga tcgacgagga agaaagcagg caccggtggc
     1681 caaggatgca acgcaacatg gagcgtgtga tgtttacacc aatataattg aataaacagg
     1741 ggatgtgcgc gttgtcgtac ttatgctgat gctgatggtc ggtggtcgat tatatatact
     1801 ggaacttctg gtgtatgctc ttctcttctg gggagacgta atctaacgaa gaagaatgtg
     1861 tgtcattgtg gcctgtgcta caaaccctgc tgtatgggcc tgtctataag aaaacacgga
     1921 tggagttgtg acgttatgtt ctgacagttt atttactaat gagcacatac tttgatctaa
     1981 ctagaacgaa gagaagttca cggaactgtc ggacacatgc agcaaggatc ctcattataa
     2041 tacgaatcac tcttcgtttg cattc
//
LOCUS       MZECAT3      1790 bp ss-mRNA            PLN       26-JUL-1990
DEFINITION  Z.mays catalase isozyme 3 (CAT-3) mRNA, complete cds.
ACCESSION   M33103
KEYWORDS    catalase isozyme 3.
SOURCE      Z.mays seedling, cDNA to mRNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1790)
  AUTHORS   Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G.
  TITLE     Characterization of catalase transcripts and their differential
            expression in maize
  JOURNAL   Biochim. Biophys. Acta 951, 104-116 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         22     1509     catalase isozyme 3 (EC 1.11.1.6)
    mRNA          1     1790     catalase isozyme 3 mRNA
BASE COUNT      386 a    550 c    550 g    304 t
ORIGIN      
        1 cgtgggtagc tagctaggtg aatgacaatg gatcctacca agttccgtcc gtccagcagc
       61 cacgacacga cggtgacgac gacgaacgct ggcgctcctg tgtggaacga caacgaggcg
      121 ctgactgtgg ggcctcgcgg tcccatcctg ctggaggact accacctgat cgagaaggtg
      181 gcgcacttcg accgcgagcg catcccggag agggtggtgc acgcgcgtgg cgcgtccgcc
      241 aagggcttct tcgagtcgac ccacgacgtg acgtcgctga cgtgcgccga cttcctgcgc
      301 gcgcccggcg tgcggacgcc cgtgatcgtg cgcttctcgc aggtgatccc agagccgggg
      361 tccggacgga cgatccgaga cgcgcgcggg ttcgccgtga agttctacac ccgcgagggc
      421 aactgggacc tgctgggcaa caacttcccc gtcttcttca tccgcgacgg catcaagttc
      481 cccgacgtga tccacgcgtt caagcccaac ccgcggtcgc acgtgcagga gtactggcgg
      541 gtgttcgact tcctgtcgca cctccccgag agcctgcaca ccttcttctt cctcttcgac
      601 cacgtgggcg tgccgtccga ctaccgccac atggaagggt tcggcgtgaa cacgtacacg
      661 ttcgtgagcg cggcggggaa ggcgcagtac gtgaagttcc actggaagcc gacgtgcggc
      721 gagcggtcca tcctgacgga cgaggaggcg cgcgtcgggg gacggaacca cagccacacg
      781 caggacctgt acgactccat cgcggcggag gggagcttcc cggagtggac gctgtacgtg
      841 caggtgatgg acccggcaca gcaggagcag tacgacttcg acccgctgga cgacaccaag
      901 acgtggccgg aggacctgtt gccgctccgc cccgtgggga ggctggtgct ggacaggaac
      961 gtggacaact tcttgaacga gaacgagcag ctggcgttcg ggccggggct ggtggtgcca
     1021 gggatctact actcggacga caagatgctg cagtgccggg tgttcgccta cgccgacacg
     1081 cagcgctaca ggctgggtcc caactacctg atgctgcccg tcaacgcgcc gcgctgcggc
     1141 acccacaaca accactacga cggcgccatg aacttgatgc accgcgacga ggaggtggac
     1201 tactacccgt ccaggcacgc gcgccgctgc ggcagggcgg cgcccacgcc actgccgccc
     1261 aggccggtcg cggggaggag ggagaaggca accatacgca agcccaacga cttcaagcag
     1321 ccaggggaga ggtaccgctc ctgggacgcc gaccgacagg accgattcgt gaaggcgatt
     1381 cgccgactcg ctcggacacc caaacgtcag ccagagctca ggtccatctg gatagacctc
     1441 ctcgccaagg tcgacgcgtc gctggggatg aagattgcca cccggctcaa catgaaggca
     1501 aacatgtgat gcttgtgctg aatagaataa taatgaagac gcatgcatgt cgtcgccagg
     1561 aacaagagaa ataataacaa gaccaccacg catgggcata ctccatatat atatgtatag
     1621 cccgtgcccg tgtccgcctt tgtaccaata caagccaaga ctagtggatg tattattatt
     1681 attattattg cgctatcaca tacatgtacc cctgctacct gaagatggat attgtatcca
     1741 gttatcaaat taagacacct gcagcaaaaa aactatatat gttgcataag
//
LOCUS       BRVRNASA      197 bp ss-mRNA            VRL       26-JUL-1990
DEFINITION  Berne virus ORF5 mRNA, 5'end.
ACCESSION   M33503 M33501
KEYWORDS    core protein.
SOURCE      Berne virus (strain P138/72) viral RNA.
  ORGANISM  Berne virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Toroviridae.
REFERENCE   1  (bases 1 to 197)
  AUTHORS   Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
  TITLE     A 3'-coterminal nested set of independently transcribed mRNAs is
            generated during Berne virus replication
  JOURNAL   J. Virol. 64, 331-338 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        137  >   197     ORF5
    mRNA        113  >   197     RNA5
BASE COUNT       50 a     27 c     38 g     82 t
ORIGIN      
        1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag
       61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt
      121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc
      181 acctcaggtt gttgcat
//
LOCUS       BRVRNASB      179 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Berne virus ORF3 mRNA, 5' end.
ACCESSION   M33502
KEYWORDS    core protein.
SOURCE      Berne virus (strain P138/72) viral RNA.
  ORGANISM  Berne virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Toroviridae.
REFERENCE   1  (bases 1 to 179)
  AUTHORS   Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
  TITLE     A 3'-coterminal nested set of independently transcribed mRNAs is
            generated during Berne virus replication
  JOURNAL   J. Virol. 64, 331-338 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        153  >   179     ORF3
BASE COUNT       52 a     17 c     34 g     76 t
ORIGIN      
        1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat
       61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag
      121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt
//
LOCUS       CHKGLOBA     1204 bp ds-DNA             VRT       26-JUL-1990
DEFINITION  Chicken pie-alpha-globin gene, fragment H3/H4.
ACCESSION   M30485
KEYWORDS    pie-alpha-globin.
SOURCE      Chicken AEV transformed erythroblast DNA, fragment H3/H4.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1204)
  AUTHORS   Broders,F., Zahraoui,A. and Scherrer,K.
  TITLE     The chicken alpha-globin gene domain is transcribed into a
            17-kilobase polycistronic RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA         91  >  1204     pie-alpha-globin mRNA fragment H3/H4 (put.)
BASE COUNT      282 a    252 c    263 g    407 t
ORIGIN      
        1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg
       61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat
      121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg
      181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg
      241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt
      301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac
      361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga
      421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg
      481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag
      541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc
      601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga
      661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag
      721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg
      781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac
      841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt
      901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca
      961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc
     1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt
     1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg
     1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact
     1201 ggtt
//
LOCUS       CHKGLOBB      582 bp ds-DNA             VRT       26-JUL-1990
DEFINITION  Chicken pie-alpha-globin gene, fragment H10.
ACCESSION   M30486
KEYWORDS    pie-alpha-globin.
SOURCE      Chicken AEV transformed erythroblast DNA, fragment H10.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 582)
  AUTHORS   Broders,F., Zahraoui,A. and Scherrer,K.
  TITLE     The chicken alpha-globin gene domain is transcribed into a
            17-kilobase polycistronic RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        241  >   582     pie-alpha-globin fragment mRNA H10
BASE COUNT      171 a    128 c    108 g    175 t
ORIGIN      
        1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata
       61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct
      121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg
      181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata
      241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac
      301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg
      361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat
      421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg
      481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca
      541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc
//
LOCUS       ECOPHOAA      600 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  E.coli alkaline phosphatase (phoA) gene, 5' end.
ACCESSION   M33536
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli (strain K-12) cell line BW7710 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 600)
  AUTHORS   Agrawal,D.K. and Wanner,B.L.
  TITLE     A phoA structural gene mutation that conditionally affects
            formation of the enzyme bacterial alkaline phosphatase
  JOURNAL   J. Bacteriol. 172, 3180-3190 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.K.Agrawal, 03-APR-1990.
            
            The phoA503 mutation does not interfere with export of active
            enzyme but does interfere with assembly.
FEATURES       from  to/span     description
    pept        283  >   600     alkaline phosphatase precursor (phoA) (EC
                                 3.1.3.1)
    sigp        283      345     alkaline phosphatase signal peptide
    matp        346  >   600     alkaline phosphatase
    variant     413      413     c in wild type; t in phoA503 mutation
BASE COUNT      159 a    130 c    151 g    160 t
ORIGIN      Map position 8.7 minutes; 1 bp upstream of HindIII site.
        1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc
       61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg
      121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt
      181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct
      241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt
      301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg
      361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt
      421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat
      481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat
      541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac
//
LOCUS       GCOEARA      1771 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  G.tikvahiae McLachlan 18S ribosomal RNA gene.
ACCESSION   M33640
KEYWORDS    18S ribosomal RNA.
SOURCE      G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA.
  ORGANISM  Gracilaria tikvahiae McLachlan
            Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; 
            Florideophycideae; Gigartinales; Gracilariaceae.
REFERENCE   1  (bases 1 to 1771)
  AUTHORS   Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A.
  TITLE     Nucleotide sequence of the 18S ribosomal RNA gene from the red alga
            Gracilaria tikvahiae mclachlan
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ragan 08-APR-1990.  Atlantic Research Lab, National Research
            Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA
            B3H 3Z1
FEATURES       from  to/span     description
    rRNA          1     1771     18S ribosomal RNA
BASE COUNT      445 a    371 c    501 g    454 t
ORIGIN      
        1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc
       61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct
      121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata
      181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt
      241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat
      301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg
      361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc
      421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta
      481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc
      541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc
      601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg
      661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta
      721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata
      781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg
      841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg
      901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga
      961 tcgaagacga tcagataccg tcgtagtctt tac
tataaac gatgaggact ggagatcgga
     1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg
     1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg
     1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg
     1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt
     1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag
     1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc
     1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc
     1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat
     1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa
     1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct
     1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat
     1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt
     1741 ttccgtaggt gaacctgcag aaggatcaag c
//
LOCUS       HS6MCP       4440 bp ds-DNA             VRL       26-JUL-1990
DEFINITION  Human herpesvirus type 6 major capsid protein (MCP) gene, complete
            cds.
ACCESSION   M33515
KEYWORDS    major capsid protein.
SOURCE      Human herpesvirus type 6 DNA.
  ORGANISM  Human herpesvirus type 6
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 4440)
  AUTHORS   Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R.
  TITLE     Identification, cloning, and expression of the major capsid protein
            gene of human herpesvirus 6
  JOURNAL   J. Virol. 64, 714-722 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        235     4272     major capsid protein (MCP)
BASE COUNT     1422 a   1169 c    785 g   1064 t
ORIGIN      
        1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct
       61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag
      121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac
      181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa
      241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac
      301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat
      361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa
      421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat
      481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca
      541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa
      601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca
      661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga
      721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc
      781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat
      841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga
      901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt
      961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact
     1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa
     1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat
     1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg
     1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa
     1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg
     1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac
     1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg
     1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac
     1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt
     1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga
     1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa
     1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga
     1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac
     1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc
     1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct
     1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca
     1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt
     2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct
     2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat
     2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac
     2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta
     2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg
     2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc
     2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta
     2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc
     2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac
     2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag
     2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag
     2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc
     2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt
     2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc
     2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca
     2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac
     3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac
     3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat
     3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt
     3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa
     3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag
     3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc
     3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat
     3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat
     3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg
     3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc
     3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg
     3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt
     3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac
     3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc
     3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac
     3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt
     3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac
     4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag
     4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac
     4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc
     4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa
     4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga
     4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct
     4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca
//
LOCUS       HUMSEXREPB    916 bp ds-DNA             PRI       26-JUL-1990
DEFINITION  Human sex chromosome repeat, clone pDP330.
ACCESSION   M33524
KEYWORDS    sex chromosome repeat.
SOURCE      Human cell line OXENII DNA, clone pDP320.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 916)
  AUTHORS   Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
            Page,D.C.
  TITLE     Human sex-chromosome-specific repeats within a region of
            pseudoautosomal/Yq homology
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.M.C.Fisher, 03-APR-1990.
FEATURES       from  to/span     description
    site          1      388     low copy flanking sequence
    rpt         389      916     sex chromosome repeat
BASE COUNT      206 a    228 c    229 g    253 t
ORIGIN      Chromosome Yp.
        1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta
       61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc
      121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt
      181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca
      241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg
      301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
      361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc
      421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga
      481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct
      541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct
      601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca
      661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct
      721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg
      781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct
      841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc
      901 ctggaggctg gaattc
//
LOCUS       HUMSEXRPA     918 bp ds-DNA             PRI       26-JUL-1990
DEFINITION  Human sex chromosome repeat, clone pDP316.
ACCESSION   M33523
KEYWORDS    sex chromosome repeat.
SOURCE      Human cell line OXENII DNA, clone pDP316.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 918)
  AUTHORS   Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
            Page,D.C.
  TITLE     Human sex-chromosome-specific repeats within a region of
            pseudoautosomal/Yq homology
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.M.C.Fisher, 03-APR-1990.
FEATURES       from  to/span     description
    site          1      388     low copy flanking sequence
    rpt         389      918     sex chromosome repeat
BASE COUNT      207 a    242 c    226 g    243 t
ORIGIN      Chromosome Yp.
        1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta
       61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc
      121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt
      181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca
      241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg
      301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
      361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc
      421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg
      481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc
      541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc
      601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac
      661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc
      721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta
      781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg
      841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag
      901 tcctggaggc tggaattc
//
LOCUS       IRICAP       2461 bp ds-DNA             VRL       26-JUL-1990
DEFINITION  Iridescent virus type 1 capsid protein gene, complete cds.
ACCESSION   M33542
KEYWORDS    capsid protein.
SOURCE      Iridescent virus type 1 DNA.
  ORGANISM  Iridescent virus type 1
            Viridae; ds-DNA nonenveloped viruses; Iridoviridae.
REFERENCE   1  (bases 1 to 2461)
  AUTHORS   Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L.
  TITLE     Molecular cloning, characterization, and expression of the Tipula
            iridescent virus capsid gene
  JOURNAL   J. Virol. 64, 125-136 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        601     1995     capsid protein
    mRNA        587  >  2461     capsid protein mRNA ( 5' end +/- 5 bp)
BASE COUNT      717 a    462 c    443 g    839 t
ORIGIN      
        1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat
       61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa
      121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta
      181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt
      241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat
      301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt
      361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt
      421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt
      481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt
      541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac
      601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa
      661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga
      721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct
      781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg
      841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta
      901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat
      961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct
     1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca
     1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca
     1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag
     1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct
     1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct
     1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt
     1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt
     1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca
     1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat
     1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt
     1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga
     1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt
     1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc
     1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg
     1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc
     1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac
     1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga
     2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt
     2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa
     2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa
     2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt
     2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc
     2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc
     2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct
     2461 t
//
LOCUS       MUSIGHAAR     363 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds.
ACCESSION   M33679
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
  TITLE     Clustered heavy chain somatic mutations shared by anti p
            azophenylarsonate antibodies confer enhanced affinity and ablate
            the cross-reactive idiotype
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by B.Parhami-Seren, 11-APR-1990.  Massachusetts General Hospital,
            Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig heavy chain V-D-J region (AA at 1)
BASE COUNT       98 a     83 c     89 g     93 t
ORIGIN      
        1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg
       61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg
      121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac
      181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac
      241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt
      301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc
      361 tca
//
LOCUS       MUSIGKABF     324 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse Ig rearranged L-chain mRNA V-J region, partial cds.
ACCESSION   M33678
KEYWORDS    immunoglobulin light chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
  TITLE     Clustered heavy chain somatic mutations shared by anti p
            azophenylarsonate antibodies confer enhanced affinity and ablate
            the cross-reactive idiotype
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by B.Parhami-Seren, 11-APR-1990.   Massachusetts General Hospital,
            Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig light-chain V-J region (AA at 1)
BASE COUNT       96 a     77 c     73 g     77 t      1 others
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat
      241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
      301 ggcaccaagt tggaaatcaa acgg
//
LOCUS       MUSTCVYAN    2567 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5'
            end.
ACCESSION   M33500
KEYWORDS    T cell receptor; beta-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, hybridoma B.1.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2567)
  AUTHORS   Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and
            Fotedar,A.
  TITLE     Transcription of the T cell receptor beta-chain gene is controlled
            by multiple regulatory elements
  JOURNAL   J. Immunol. 144, 1111-1119 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       2544  >  2567     T cell receptor beta-chain V-2 region
                                 precursor
    sigp       2544  >  2567     T cell receptor beta-chain signal peptide
    mRNA       2478  >  2567     T cell receptor beta-chain mRNA
BASE COUNT      708 a    560 c    583 g    716 t
ORIGIN      
        1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag
       61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc
      121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga
      181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag
      241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga
      301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc
      361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc
      421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc
      481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga
      541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt
      601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt
      661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga
      721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg
      781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga
      841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa
      901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata
      961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac
     1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag
     1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt
     1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta
     1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta
     1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat
     1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa
     1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc
     1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct
     1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa
     1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc
     1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc
     1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag
     1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag
     1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa
     1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc
     1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct
     1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat
     2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg
     2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc
     2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta
     2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact
     2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca
     2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt
     2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt
     2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat
     2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc
//
LOCUS       R751TRA       578 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  Plasmid R751 traJ and traK genes, 5'end.
ACCESSION   M25422
KEYWORDS    inverted repeat; transfer origin region.
SOURCE      Plasmid R751 (strain HB101, Inc P-beta) DNA.
  ORGANISM  Plasmid R751
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 578)
  AUTHORS   Lanka,E. and Euerste,J.P.
  TITLE     Conjugative transfer of promiscuous IncP plasmids: Interaction of
            plasmid-encoded products with the transfer origin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            E.Lanka, 17-JUL-1989.
FEATURES       from  to/span     description
    pept         48  <     1 (c) traJ protein
    pept        403  >   578     traK protein
    signal      243      211     promoter PL
    signal      266      294     promoter PR
    rpt          49       64     inverted repeat
    rpt         118      157     inverted repeat
    rpt         296      331     inverted repeat
BASE COUNT      141 a    168 c    163 g    106 t
ORIGIN      
        1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc
       61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg
      121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt
      181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat
      241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct
      301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag
      361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc
      421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac
      481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg
      541 atgaaaacga tctgggagca catgaaggaa accggccg
//
LOCUS       RP4TRAB       571 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  Plasmid RP4 traJ and traK genes, 5' end.
ACCESSION   M25423
KEYWORDS    inerted repeat; transfer origin region.
SOURCE      Plasmid RP4 (strain HB101, IncP-alpha) DNA.
  ORGANISM  Plasmid RP4
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 571)
  AUTHORS   Lanka,E. and Euerste,J.P.
  TITLE     Conjugative transfer of promiscuous IncP plasmids: Interaction of
            plasmid-encoded products with the transfer origin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            E.Lanka, 17-JUL-1989.
FEATURES       from  to/span     description
    pept         26  <     1 (c) traJ protein
    pept        394  >   571     traK protein
    rpt          48       63     inverted repeat
    rpt         118      157     inverted repeat
    signal      219      192     promoter PL
    rpt         281      318     inverted repeat
    signal      246      272     inverted repeat
BASE COUNT      135 a    148 c    181 g    107 t
ORIGIN      
        1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc
       61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg
      121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt
      181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa
      241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa
      301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg
      361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg
      421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg
      481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc
      541 atttgggagc acatgcggga aacggggaag g
//
LOCUS       STAREPEBR    2389 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  S.aureus ethidium resistance (ebr) and replication protein (repA)
            genes, complete cds.
ACCESSION   M33479
KEYWORDS    ethidium resistance protein; replication protein.
SOURCE      S.aureus plasmid DNA.
  ORGANISM  Staphylococcus aureus
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Micrococcaceae.
REFERENCE   1  (bases 1 to 2389)
  AUTHORS   Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J.
  TITLE     Genetic mapping and nucleotide sequence of pWBG32, an ethidium
            bromide resistance plasmid naturally occurring in Staphylococcus
            aureus
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First
            Avenue, RM 1166, New York, NY 10016
FEATURES       from  to/span     description
    pept       1153     1476     ethidium resistance protein (ebr)
BASE COUNT      796 a    403 c    290 g    900 t
ORIGIN      
        1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa
       61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca
      121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt
      181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg
      241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa
      301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg
      361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc
      421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat
      481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa
      541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata
      601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt
      661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc
      721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt
      781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt
      841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt
      901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg
      961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac
     1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca
     1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa
     1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga
     1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata
     1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat
     1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt
     1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc
     1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt
     1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc
     1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa
     1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt
     1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata
     1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta
     1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg
     1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg
     1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg
     1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca
     2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg
     2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct
     2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa
     2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc
     2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag
     2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg
//
LOCUS       XANAVR       2100 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  X.campestris avirulence protein (avrBs1) gene, complete cds.
ACCESSION   M32142 J03672
KEYWORDS    avirulence protein.
SOURCE      X.campestris (strain E3, race 2, pv. vesicatoria) DNA.
  ORGANISM  Xanthomonas campestris
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 2100)
  AUTHORS   Ronald,P.C. and Staskawicz,B.J.
  TITLE     The avirulence gene avrBs-1 from Xanthomonas campestris pv.
            vesicatoria encodes a 50-kD protein
  JOURNAL   Mol. Plant Microb. Interact. 1, 191-198 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.Ronald, 15-FEB-1990.
FEATURES       from  to/span     description
    pept        308      622     ORF1
    pept        713     2050     ORF2
BASE COUNT      656 a    423 c    505 g    516 t
ORIGIN      
        1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca
       61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag
      121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc
      181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa
      241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt
      301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg
      361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt
      421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa
      481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct
      541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga
      601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc
      661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga
      721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc
      781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc
      841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc
      901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag
      961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt
     1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc
     1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc
     1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca
     1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc
     1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg
     1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt
     1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct
     1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga
     1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat
     1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct
     1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac
     1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca
     1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac
     1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga
     1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga
     1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga
     1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg
     2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc
//
LOCUS       YSCSCD25     5055 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  S.cerevisiae SCD25 gene, complete cds.
ACCESSION   M26647 M31771
KEYWORDS    Ras protein; SCD25 gene; cell division cycle.
SOURCE      S.cerevisiae (strain OL136) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 2129 to 5055)
  AUTHORS   Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M.
  TITLE     The C-terminal part of a gene partially homologous to CDC25 gene
            suppresses the CDC25-5 mutation in Saccharomyces cerevisiae
  JOURNAL   Gene 77, 21-30 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3880)
  AUTHORS   Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and
            Jacquet,M.
  TITLE     SCD25, a CDC25 like gene, which contains a RAS activating domain is
            a dispensable gene of Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak,
            01-FEB-1990.  Laboratoire IGD, Groupe des laboratoires de biologie
            cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex
FEATURES       from  to/span     description
    pept        128     3880     SCD25 protein
    pept       4319  >  5055     ORF X
BASE COUNT     1638 a    973 c    900 g   1544 t
ORIGIN      
        1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc
       61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc
      121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga
      181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt
      241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa
      301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa
      361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa
      421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa
      481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg
      541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag
      601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt
      661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat
      721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa
      781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt
      841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt
      901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag
      961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa
     1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc
     1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct
     1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac
     1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac
     1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg
     1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt
     1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg
     1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag
     1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca
     1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc
     1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac
     1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga
     1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg
     1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga
     1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga
     1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa
     1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga
     2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa
     2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt
     2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca
     2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat
     2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga
     2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg
     2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg
     2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga
     2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt
     2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga
     2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag
     2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc
     2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac
     2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga
     2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac
     2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt
     3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat
     3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag
     3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag
     3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa
     3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt
     3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa
     3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag
     3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg
     3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata
     3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa
     3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt
     3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc
     3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga
     3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt
     3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt
     3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca
     3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta
     4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc
     4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc
     4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca
     4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg
     4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat
     4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg
     4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt
     4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag
     4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac
     4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag
     4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt
     4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa
     4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta
     4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt
     4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga
     4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt
     4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa
     5041 gaaaatcaaa gatct
//
LOCUS       ADBMLPA       101 bp ds-DNA             VRL       26-JUL-1990
DEFINITION  Mastadenovirus 2 R1, R2 and R3 binding sites.
ACCESSION   M33540
KEYWORDS    .
SOURCE      Mastadenovirus 2 viral DNA.
  ORGANISM  Mastadenovirus 2
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 101)
  AUTHORS   Leong,K., Lee,W. and Berk,A.J.
  TITLE     High-level transcription from the adenovirus major late promoter
            requires downstream binding sites for late-phase-specific factors
  JOURNAL   J. Virol. 64, 51-60 (1990)
  STANDARD  simple staff_review
COMMENT     Sequence-specific binding proteins are induced during the late
            phase of infection. These proteins interact with three regions in
            the first intron of the major late promoter (MLP).
BASE COUNT       24 a     25 c     26 g     26 t
ORIGIN      
        1 ccagctgttg gggtgagtac tccctctcaa aagcgggcat gacttctgcg ctaagattgt
       61 cagtttccaa aaacgaggag gatttgatat tcacctggcc c
//
LOCUS       LB3HDCBA      804 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  Lactobacillus 30a histidine decarboxylase-B (hdcB) gene,
            complete cds.
ACCESSION   X13099
KEYWORDS    histidine decarboxylase.
SOURCE      Lactobacillus 30a DNA.
  ORGANISM  Lactobacillus 30a
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 804)
  AUTHORS   Copeland,W.C., Domena,J.D. and Robertus,J.D.
  TITLE     The molecular cloning, sequence and expression of the hdcB gene
            from Lactobacillus 30a
  JOURNAL   Gene 85, 259-265 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         85      609     histidine decarboxylase-B (hdcB)
BASE COUNT      277 a    140 c    152 g    235 t
ORIGIN      
        1 actaatccac aggacatagt ttgaggaaga gatggtgttt actacctctt cctttaatat
       61 tttgtaagtt aaggattgat tgcaatgagc aacagtaact accaagttag tttagaacga
      121 attaaaaaag ttgtccctga agaactctta accaatgcat tgttagcagc tattgacaat
      181 tctggtgaaa ggatgtcaca aataatagtc gataaaaaag ataacggcaa cgactattac
      241 ctcaccatcc atagattctt cgtttatagc aacgaagaat tcaccgcttt tgataaagaa
      301 gatgttgcag atgtcgaatt cgttaatggt acgccagatg gtgaagtaat cattacttta
      361 aaggacggca aagtgttgca cccgtctcac atttgttacg gccgagcttt tgactttatc
      421 caagatgtca agccaaaagt aattacaatg gcgggatatg acagcacaat tcgaggcgaa
      481 tttccacaat tattagatcc agatcatgcg gaagagattg atcgattacg tcgctggatg
      541 caagatggaa atattagcca ttacgaatac gatgatgcaa atccagctta tccaaaagca
      601 ggaaaataaa aaaacatatt gacatatcat cagatatagg ttatgttaca atcaagcatc
      661 ttaataggta atgcgcaatt tatatctttg aatatagttc cattatttat ttataaatag
      721 ttactccgaa aaggactacg tacctactat acttttaaat aaatatattt cgtgatgggg
      781 agcgttatta ccccggctgt cgac
//
LOCUS       LBPREPA      3547 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  L.plantarum repA, repB and repC genes, complete cds.
ACCESSION   M33531
KEYWORDS    rep protein.
SOURCE      L.plantarum DNA.
  ORGANISM  Lactobacillus plantarum
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 3547)
  AUTHORS   Bates,E.E.M. and Gilbert,H.J.
  TITLE     Characterization of a cryptic plasmid from Lactobacillus plantarum
  JOURNAL   Gene 85, 253-258 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       2191     2349     repA protein
    pept       2406     3062     repB protein
    pept        570     1655     repC protein
BASE COUNT     1189 a    589 c    758 g   1011 t
ORIGIN      
        1 gatatctggt taactttgat cacattagtg atcaaattca tttctttagc cccatcaaac
       61 gatcagtttg ctttatgaaa gtgaccgctt gatggggctt tttcgtttac cttttgtcaa
      121 aggtaaggtg tgacgggctt gactttgggt ggcgttgtgc ggaagcgcaa tcgacacgat
      181 tttgactttg aggggagtta agaggggaag cgtagcgccc cttcttacaa gtgtaaagtg
      241 tggacaagag agcgtagcga tattgtctac actttacccc aattgtcatg cgactttaaa
      301 tagaattatt gattaataaa agccccctga caaaagtcga agggggactt ttattttagt
      361 ttgaggtttg catacctact taaaaaagta gggcagcaaa acgtcaaaca ggtatcagct
      421 aatcatccga tagggtgcgc tgatacggtc ctcaaaagag agccgacaga gccgtctgca
      481 agacccctcg gcggaggccc acctttacga agtaagatat agtgggttat actttacttg
      541 gaagataact ccgaaatgag gtgcatacaa tgagttttgc agtggctaga atgacgaaat
      601 taaaagctga taatttagtc ggcattggca atcatgacca acggaaaacg actaatcaca
      661 gcaacgaaga tattgatgtt tcccgctctc acctgaatta tgatttagtg gctgggcgca
      721 ctgataactt taaaacggat attgaagcct atatcaacga aaacaaagcg agtaagcggg
      781 cagttcgcaa agacgctgtt ttagtcaatg agtggattat aaccagtgac aaagactttt
      841 ttgagcaatt agacgaagcc gaaacccgta aatattttga aacagccaaa caatattttg
      901 cagataacta tggtgacgaa aatattcgct atgcagttgt tcatatggac gagaagaccc
      961 ctcacatgca tatgggcatt gtgccctttg atgatgataa aaagctctca gctaagcgta
     1021 tattcaatcg tgaagcctta cagcacattc aagaggaatt accacagtac ctcaaagaaa
     1081 atggctttga tgttcaacgt ggtaacaaaa ataaagagcg taagaattta tcagtacccg
     1141 aatacaaagc tatgcgggaa gaattgaaaa aaatagagac cgaaaaacaa gagacacaag
     1201 caaagcttgc agatacaaaa aaacagcttg atgagatcaa accacgggat accaagaaaa
     1261 ttgctagtaa acccaccttg atgaataaaa ataaagtcac ggttgataaa tctgatctcg
     1321 ctgatttgga acaaagggcg gtgactagcg acgcttataa ctttgaaaaa attcatctgg
     1381 aagtaggaaa tcatagttta cgtaatgatt tgagtgaagc caagggccgc aactatgaac
     1441 tgagaaaaga aaatgagcga ttgcaaaaac tagtaggaac gcttcaaggc attatacgaa
     1501 atgttgatga gtttctacac aaaaaactag gtattaattt acctgaaaag tggctagagc
     1561 gtgcaggact aaaagaaccg tctaaaaaag cccctgaaag ctcacaggaa ctcgacagac
     1621 ataaatctga tgaattaggc ggtccacatc tttaaatcgc ttatacgagc ttaaaatggc
     1681 gtttaagagc ttaatttacc atctcgctag attgaacgta gttaactttg tgtccgtcaa
     1741 cggtaaatcg acgtaggcgt tttatagccg ctgggctatt agacgcccta ggaggcttta
     1801 aggagttgat agactagcgg ataaaacact tttgcacatg caaagaaaag cacccctgct
     1861 ttttttgcct gccccacggc gagtgcgggg tgagtttagc gggtgctccc gtcatttatg
     1921 gggtcaagct gacacagctt gcgggtttgg gcagagccca tattttggtt tggtttgagt
     1981 gggataaaaa aattgggcga aaaacatggg ggtactacga caccccccca tgtgtccatt
     2041 gtccattaaa cagaacactt ttttcaagaa accttttagg ttaggggttt tcgggggggt
     2101 ttgagatttt ataaaaaatg ttgtatttct aacgtatgta taatataatg atggaataga
     2161 gataaaaata gtaagaaaga aggttttttg atggttgaag ttgaaaagaa aaaaattact
     2221 ttgtctatac ctgttgaaac taatggaaag ctggaagaat tggcccagaa atatggcatg
     2281 actaaatctg gattggttaa ttttttggtt aatcaggttg cagaagctgg aactatttat
     2341 aggcaataaa aaaagcgccc tgtgcatagg acgcaatcta aaagtctgtg aggtaattat
     2401 aacatatgaa aagtgaatct aaaatcgatt ggacggtacc tcgtccaaat aaaaatccca
     2461 aaacaaaaca gccttataaa cgtggtcgta attggggtat tgttgtttat cctgaaagtc
     2521 ttcctgaaaa ttggaaagat attatcaggc aagagcctat tgctgtcagt cccttacatg
     2581 ataaagatgt taaccctgat ggagaaaaga aaaaatctca ctatcatctt gttttgaact
     2641 ataaagggaa caaatctttt gaacaaattg atgaaattgc taggtcttta agggcgcctg
     2701 ctcctcaaag aattagtagt ttaactggcg ctgttaggta cttgacacat atggataatc
     2761 ctgaaaaata tcagtatgat aatgctgata ttgagacctt tggaggcttt gatttagaga
     2821 gttgcttagc tctttctact ggcgataagc gccaagcctt acgtgacatg ttggctttta
     2881 tttctgaaaa tgaaattatg catttaaaag actttgcaga ttattgcatg tctgaggaag
     2941 caccagctgg ctggttcgaa cttctaactg aaaggaatac gctttttatt aaagaatata
     3001 tcaagtcaaa ttggcagaaa caacagtatg ctagtaaaaa catcaataaa atgtcggatt
     3061 aaaattttat tgatgttgtt gctatattat tagtgaaagg atggtttact ttatgccaac
     3121 aagaaaaaat attttagatg atattcaaga acatattgac aatgaagaac gtgttttggt
     3181 tactaattca agcaaaatta actagcacca cgcgtataga gtgatttaaa ataactaaca
     3241 tcgtttttat ttgaatttag aagggaagag atttttatta aaaatatagg ttttaactca
     3301 aattatttta aaacctggta tttttggcta ggcatattaa cggtagtggg attaatcggt
     3361 gatcccattt tacactatca ttcttcaact agtccgtggt tacaaatact tattgctatt
     3421 ttattatttg tagcagcatt taccaaaaaa ataaataata actgacttaa atcgcaattc
     3481 actctaaact tttaacaaat ttgttatcat aattgggtaa ggtgtttgca agttaagtat
     3541 ttttccc
//
LOCUS       RATUD2A01    1088 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 1.
ACCESSION   M35202 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     1 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1088)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP-glucuronosyltransferase, UDPGTr-2, gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pept        349  +  1072     UDP glucuronosyltransferase-2 (UDPGTr-2)
    pre-msg     315  >  1088     UDP glucuronosyltransferase-2 mRNA and intron
    IVS        1073  >  1088     UDP glucuronosyltransferase-2 intron A
    signal      288      293     CAT box
BASE COUNT      329 a    194 c    216 g    349 t
ORIGIN      
        1 ctgcagtcaa cggatcttca ctgctatgta agaacattta agaaataaga gctttcatct
       61 gtgattttta catgactcta acacgttata atcaacagat gatgtttgca catgagaagt
      121 gattcaattt tggctgaata gaatcaggga caaaaaagac aaataaactc tgttaacctt
      181 gagctcatgt tccatgcttg tatttacaca tggcgtaaca tcattgcact catctaatcg
      241 gtgatggttt aaaagttata tattaatttc ttgggtgact gaactttcat aaaaaacatg
      301 aatatctaca atgaacgaca gatatcaaaa gcattccatt tctgcaagat gtctatgaaa
      361 cagacttcag tgtttctgtt gatacagctc atatgctact ttagacctgg agcctgtgga
      421 aaagtgctag tgtggcccac agaatacagc cactggatta atataaagat aattctgaat
      481 gaacttgccc agagaggtca tgaagtcacg gttcttgtat cttcggcttc cattctcatt
      541 gagcctacca aggaatcttc tattaatttt gagatttact ctgtaccttt gagtaaaagt
      601 gatcttgaat atagttttgc aaaatggata gatgaatgga cacgtgattt tgaaacactc
      661 tcgatttgga catattattc aaaaatgcaa aaagtcttca atgaatattc tgatgtcgtt
      721 gaaaatttat gcaaagcact catttggaac aagagtctta tgaaaaaact ccaaggatct
      781 caatttgatg tcattctcgc agatgctgtg ggtccctgtg gtgagctgct agcagaactg
      841 cttaagacac ctttagtgta cagtctccgc ttctgtcctg gatacagatg tgaaaagttc
      901 agtgggggac ttccactgcc tccttcctat gtgcctgttg ttctttcaga attaagtgac
      961 cgcatgacat ttgtggaaag agtgaagaat atgttgcaga tgctgtattt tgacttttgg
     1021 tttcaaccat ttaaagagaa gtcctggagt cagttttaca gtgatgttct aggtaaactg
     1081 tgcctttc
//
LOCUS       RATUD2A02     373 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A.
ACCESSION   M35078 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     2 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 373)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   373     UDP glucuronosyltransferase-2 mRNA and intron
    IVS     <     1  >   373     UDP glucuronosyltransferase-2 intron A
BASE COUNT      137 a     75 c     56 g    105 t
ORIGIN      About 0.1 kb after segment 1.
        1 aaaatgctat agagtaactg agcagaacac tccaaaaatt actatccatg taaactgaga
       61 caaagatttc tcttagtaat cactagatct actctaagtt tgtcttagta aaagaaactc
      121 caagtttctc gaatgcttta atgactgtag atgcgaacac taaagagtca ttatatacca
      181 ccacaactat ctgtgtagca cagaaggaaa catgttccct tatacaaatt actcacttgc
      241 aaatgatgaa aaaactccaa ggagctaagt ttgatgttat cacctagaat atcacgacag
      301 gttttctcac aattaaatca tatcactaga accagaaaca gtcaaggcat cttagtttct
      361 tcgagttcag ctg
//
LOCUS       RATUD2A03     380 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A.
ACCESSION   M35079 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     3 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 380)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   380     UDP glucuronosyltransferase-2 mRNA and intron
    IVS     <     1  >   380     UDP glucuronosyltransferase-2 intron A
BASE COUNT      154 a     49 c     55 g    122 t
ORIGIN      About 0.1 kb after segment 2.
        1 tcaaataaaa tagtacctaa attaatagga gaaagaattt aaaggttaac tatttgtgga
       61 aatatccagg tgtaactttg acatatacaa ctaagttagt attacttgtc tcttctaata
      121 ggcacagcac agtagtgata aaaagaaact tagtcataaa ctgcagatta tcacagtgca
      181 tttcaagaat cagaaatcaa aagaatagct actaaaatgt ataaagtaga tgaaatattc
      241 tacaaaagtt gatttttcta aggcattttc aagctttttt gcaaggaaca aatgttccaa
      301 attcattggt gtaactttag aaaacatgta attgacaaca ttgatattat gttatacatt
      361 atatcataat caaatgactt
//
LOCUS       RATUD2A04    1435 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 2.
ACCESSION   M35080 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     4 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1435)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >  1435     UDP glucuronosyltransferase-2 mRNA and intron
    pept    +    51  +   199     UDP glucuronosyltransferase-2, exon 2
    IVS     <     1       50     UDP glucuronosyltransferase-2 intron A
    IVS         200  >  1435     UDP glucuronosyltransferase-2 intron B
BASE COUNT      375 a    301 c    268 g    485 t      6 others
ORIGIN      About 0.2 kb after segment 3.
        1 ccacaaaacc tcttttcacc attgagtatt tttatctgtt ttggatgcag gtagacccac
       61 aacattaact gagatgatgg ggaaggcaga tatatggctc attcgaacct tctgggactt
      121 ggaatttcca cacccattct tacctaattt tgactttgtt ggaggactac attgcaaacc
      181 agccaaacca ctgcctaggg taacattgga ttgttttcct tgataaactg ttcgttcctt
      241 tatcattctt tatttgtttt tacaaagagg atagtttatt ttaattatta atatttatct
      301 ttaatctttt tttacagtcc agtaattatc cccttctgga ccaccctcgt tccatcctcc
      361 tcctcccttg ctccaagagt atgtatgcca ggagcctcct gcgatggaga ggatagtgtc
      421 aggggtgcag gagggaacaa agtaagactc tggtgtggct ttaaagctga cggtctcctg
      481 acattctaac tctctacctg ttcagaaaca ctgatgataa cttctagaaa atcatacaaa
      541 ctttcttgct ctttctcatg ataaaaggct gctggcttgg gaatcagtac ctgtaactta
      601 acaacagagg attgagcaat gtggccttgg tcctatatag taggaactgt gtggctctaa
      661 ctttcagcct gctagtcaga anngcagaag ggatctttcc acatgatgtc tcctccttct
      721 tcttcttgta gtcctcctct actctcctgg attctcaact gggatcagac gccctgccct
      781 cttctcttct gcccagctga tcgattcttt attaactaat caaggatgat ctaaattatt
      841 ttatacataa cattgagacc agtgatgctt gactgtgcca aattttggac tgcaaccaga
      901 tatctgggca taaaaattag cacatgaata cacagtgtaa aaaaaaaacc gtcccctaac
      961 actcacctat tgttttctgc atgtgggtga gtctacatgt gtctgatggg aggcctgtgc
     1021 atgtttcttt ttacaactag gtcccttnnc tg
gtatataa gtttcattac taggaagtgt
     1081 tagcatttaa tggtaatttt gttagatgga tgggattgtg aatttaaaac ttgccttgaa
     1141 gtagattttg agtgacatag cacattttta aattttattt tgtgtttttt taaagaggac
     1201 atctctctat agcttanntg tccttaacct catagcagtc cttctgcctc agtctcccat
     1261 gtgctgagat tagaccagtc ttaatacctc ttctgaaaca tgatgtgtaa tatcagtgat
     1321 ggagatctta ctgtgcacag ctttagatca tgatgtttag cagattgtaa cttccattca
     1381 tgagaagaaa ctgcacaaac catctcattc ctgtcttact ttattgattg gaagc
//
LOCUS       RATUD2A05     769 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron B.
ACCESSION   M35081 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     5 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 769)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   769     UDP glucuronosyltransferase-2 mRNA and intron
    IVS     <     1  >   769     UDP glucuronosyltransferase-2 intron B
BASE COUNT      293 a    116 c    117 g    243 t
ORIGIN      About 0.1 kb after segment 4.
        1 aaatgctact tcatttgatc ttgaaggtgt gtgagctgtc attatttaat tggtacggta
       61 tttctttcaa ataaacaatt aaaatagtgt tcttttcttt aaaaaaataa agaaaaaaga
      121 gatcataaag aaaaaaagaa gttgcagaaa gaaaagggga caccttgaaa agtgattata
      181 gcacttatta ctaagttgta aaaggtttcc tatgaaaact atctaagaag ataagtagaa
      241 aagtcctaat gagggaaagg aaaaaaaaat tcttctcctt ctcatcattt tgtcctcagt
      301 acttacacat cttttcagaa tacatgacca caagttaaaa gtcataacaa aaaattaaat
      361 aataaattta agtagaagtt tacaagaaaa aaatgcttac atgcatatcc attaggagta
      421 atttctggct aaacaccatt cacatggctc cacaggttca tagaaggttg aaaaccataa
      481 ttaaaattat tagtgaagtt ttgtattgat gaacccagtc catattttat cttctgtctt
      541 agcacctata ataaatttta gttccctttt tacgaccttt agttaagtgt tttacaacct
      601 cttggattgt gctctgagaa gaagaaagtc tggttgctat ctaagaacaa ttaactggtg
      661 acacatagga gactgataca gttctcattg cacttttcac tatcagaaaa ggaactaaaa
      721 taattccact ataaaagagc ttaataatca ctgatatact tagatctct
//
LOCUS       RATUD2A06     359 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 3.
ACCESSION   M35082 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     6 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 359)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   359     UDP glucuronosyltransferase-2 mRNA and intron
    pept    +   175  +   306     UDP glucuronosyltransferase-2, exon 3
    IVS     <     1      174     UDP glucuronosyltransferase-2 intron B
    IVS         307  >   359     UDP glucuronosyltransferase-2 intron C
BASE COUNT      114 a     69 c     75 g    101 t
ORIGIN      About 4.0 kb after segment 5.
        1 gtagtatagt acaaatgcac acttaatgaa cactgggtac cgaggcaatg gatacactgg
       61 tctcccaaaa taattccagg aattacataa tttcctctgg taagtttgtc tcggtagttg
      121 agacaatgct tcccatgcaa ccattcatct gtgatgtcat aaccatcttc ataggaaatg
      181 gaagaatttg ttcagagctc tggagaacat ggtgtagtgg tgttttctct gggatcaatg
      241 gttaaaaacc tgactgaaga aaaagccaat gtagttgctt ctgctcttgc ccaaattcca
      301 cagaaggtaa gataaaatgt ccacagagat ggcaaatgta ttataagtca tctgaaccc
//
LOCUS       RATUD2A07     609 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exons 4 and 5.
ACCESSION   M35083 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     7 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 609)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +    69      156     UDP glucuronosyltransferase-2, exon 4
                292  +   511     UDP glucuronosyltransferase-2, exon 5
    pre-msg <     1  >   609     UDP glucuronosyltransferase-2 mRNA and intron
    IVS     <     1       68     UDP glucuronosyltransferase-2 intron C
    IVS         157      291     UDP glucuronosyltransferase-2 intron D
    IVS         512  >   609     UDP glucuronosyltransferase-2 intron E
BASE COUNT      170 a    119 c    115 g    205 t
ORIGIN      About 0.6 kb after segment 6.
        1 ccaggaacaa attttaccaa agccttggaa tttctgtaat taaataaggc attgtctgtg
       61 tgtaacaggt tgtatggaga tttgatggta agaaaccaga taccttagga tctaacactc
      121 ggctgtacaa gtggatcccc cagaatgacc ttcttggtaa ggcaaagttt aactacaagt
      181 ttgtggctat agtaacacac tttcttgaga atagcacact tctgagtctt catattttcc
      241 tctcttaaat attattcggt caataattat gtcaacttct tctcattgca ggtcatccaa
      301 aaaccaaagc ttttgtagct catggtggaa caaatggcat ctatgaggca atctaccatg
      361 gcattcctat tgttggtatt cccttgtttg cagatcaacc ggataacatt aatcacatgg
      421 tagccaaagg agctgctgtt agagttgact tcagcatact gtcaactaca ggccttctca
      481 ctgccttgaa gattgtcatg aatgaccctt cgtgagtctg tttgtttgtt gaagttgttt
      541 tttccaagga aggctgtttc tttttctttt ttgaaacata atttttacta tataactaca
      601 agagctgcc
//
LOCUS       RATUD2A08     316 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E.
ACCESSION   M35084 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     8 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 316)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   316     UDP glucuronosyltransferase-2 mRNA and intron
    IVS     <     1  >   316     UDP glucuronosyltransferase-2 intron E
BASE COUNT      112 a     39 c     42 g    123 t
ORIGIN      About 0.1 kb after segment 7.
        1 ctcatagata tttgcttgct tcagcctcct gggtgctggg attagaaata tctgaattta
       61 tatttgctgt gaataactat tattttaaaa atattgacag attcagatga tcatcagatt
      121 gattttatcc tatttgaagg agggagaata atttcgaaaa attatgtttt tgcatatctg
      181 aaatatgtgc ttttttaaca ataaagttac tctaaatttc taattgaatc aattagacat
      241 gattattctc aaactattct atataaagaa ataatattac aaatatttat ctattataac
      301 aaaggacaca ttttct
//
LOCUS       RATUD2A09     487 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E.
ACCESSION   M35085 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     9 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 487)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pre-msg <     1  >   487     UDP glucuronosyltransferase-2 mRNA and intron
    IVS     <     1  >   487     UDP glucuronosyltransferase-2 intron E
BASE COUNT      196 a     89 c     80 g    122 t
ORIGIN      About 0.05 kb after segment 8.
        1 gaataagaga cagtattaaa ttcatacaaa tacctggaga acactattgt aatttcaagg
       61 tttgctagaa gacaaatgta cctaatgaga aggtcctgag tcaaaaataa ctggagaaag
      121 tgctgttcgt tcctacatac acagtcttct agtccaggaa cagaattaaa ttgttttcat
      181 tgtggtgaat tcttgtggaa ctgttgtaca aagaagagtc ataaacaaca aagtgttttt
      241 agaagaagaa cctagttata aacagataca taggagagga aaaaaaacta gagaggagat
      301 atcgaacatg acatatgacc tggaaaaagt tctatggcta cttcccttct tggtcttata
      361 tcatgagtta catgttacac aaaaacacac acacacaaac aaacacacac aaacatacac
      421 acacacaaac atacacacac acaaacatac aaacacatac acacaagttt gtgtgtctta
      481 ctagttt
//
LOCUS       RATUD2A10     895 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 6.
ACCESSION   M35086 J05482
KEYWORDS    UDP glucuronosyltransferase-2.
SEGMENT     10 of 10
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pUDPGTr-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 895)
  AUTHORS   Mackenzie,P.I. and Rodbourn,L.
  TITLE     Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene
            and characterization of its promoter
  JOURNAL   J. Biol. Chem. 265, 11328-11332 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   212      488     UDP glucuronosyltransferase-2, exon 6
    pre-msg <     1      836     UDP glucuronosyltransferase-2 mRNA and intron
    IVS     <     1      211     UDP glucuronosyltransferase-2 intron E
    signal      808      812     poly-A signal
BASE COUNT      248 a    206 c    163 g    278 t
ORIGIN      About 0.15 kb after segment 9.
        1 gttcataatt accctgtgct aaacaagact gtttcactgt ctttcctgtc actcaactct
       61 cctctgccac cacctgaaac aaaacacttg agtgggaagt atacatgatt tattttaagt
      121 tgcttgtgag acttttccct aaaacaacaa atgttgttaa gtcatcaaat tgcctcctct
      181 ttaatcttag ttgtatacat tgtcccttca gctataagga gaatgccatg agattatcca
      241 gaatccacca tgatcagcca gtgaagcccc tggaccgagc cgtcttctgg atcgagtatg
      301 tcatgcgtca caaaggagcc aagcacctcc gctcaactct gcatgacctt agctggttcc
      361 agtaccactc tctggatgtc attgggttcc tattgctctg tgtggtaggt gtggtattca
      421 tcatcacaaa attctgcctc ttttgttgcc gtaagactgc taacatggga aagaagaaga
      481 aagagtagca tcataaaggc tgaagcagag ccctgagaga tgagcctctg ccagctgctt
      541 ccagcaggaa cctgttgtca tgccagtgcc ttccctctaa aagaagacag cgttgggacc
      601 tcattgaaca tggctccaat gaattcacta tgttctgaag acatgcaaga tttcatgcca
      661 aatatatatt cagtgctaaa aaaacaaaat cctgtgttca gtttagaatg ttttgatgta
      721 gctgagaagc tttgcccaac aacaataact gaagctactg tagttcataa agttcacatg
      781 gctttatagc ctttgcaaaa catatctata aatcaattac tttttgaaaa tacccagcct
      841 gctttgtctt catttagtag actatttttc tctccttctt tcttttttct tcttt
//
LOCUS       RATUDPA      1858 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-5 (UDPGTr-5) mRNA, complete cds.
ACCESSION   M33746 J05440
KEYWORDS    UDP glucuronosyltransferase-5.
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            UDPGTr-5.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1858)
  AUTHORS   Mackenzie,P.I.
  TITLE     The cDNA sequence and expression of a variant 17B-hydroxysteroid
            UDP glucuronosyltransferase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pept         43     1635     UDP glucuronosyltransferase-5 precursor (EC
                                 2.4.1.17)
    sigp         43      111     UDP glucuronosyltransferase-5 signal peptide
                                 (put.)
    matp        112     1632     UDP glucuronosyltransferase-5
    mRNA    <     1     1858     UDP glucuronosyltransferase-5 mRNA
BASE COUNT      559 a    365 c    387 g    547 t
ORIGIN      
        1 agaaaggaac acagtgtgaa cagaaggatt ttgattttca aaatgccagg aaaatggatt
       61 tttgctctgc tcttgctgca gataagtttc tgcctcagat ctgcgaaatg tgggaaggtg
      121 ttggtgtggc cgatggaatt cagtcactgg atgaatataa aaacaatact ggatgaactt
      181 gtacagaggg gccatgaagt cactgttctg aaaccttcag cttactatgt tcttgatcca
      241 aaaaaatcgc cagaccttaa gtttgaaact tttcctacat ctgtcagtaa agatgaactg
      301 gaaaaatatt tcataaaact tgctgatgcg tggacttatg agttgcaaag agatacatgt
      361 ttgtcttttt ctcctttact acaaaatatg atggatgaat tttctgatta ttatctaagt
      421 gtttgtaaag acgccgtttc aaacaagcag ctcatggcaa aactacagga atccaagttt
      481 gatgttcttt tgtcagatcc tgtggctgcc tgtggggagc tgatagccga agtgctccac
      541 attccttttc tgtacagtct tcgtgcctct ccaggccata aaattgaaaa gtccagtgga
      601 agatttatac tacctccctc ttatgtgcct gtaattttgt caggattggg tggccaaatg
      661 acattcatag acagggttaa aaatatgata tgtatgcttt attttgactt ttggttccat
      721 atgtttaatg ccaagaattg ggatccattt tatactgaga ttttgggaag gcccaccacc
      781 ttagctgaga caatgggcaa agcagaaatg tggctcatta gatcctactg ggatttggag
      841 tttccccacc caacattacc aaatgttgac tacattggag gactccaatg caaacctgct
      901 aaacccttgc ccaaggatat agaagacttt gtccagagct ctggagagca tggtgtggtg
      961 gtgttttctc tggggtcaat ggtcagcagc atgacagaag aaaaggccaa cgcaattgca
     1021 tgggcccttg cccagattcc acaaaaggtt ctttggaaat ttgatggcaa aatcccagca
     1081 actttaggac ccaataccag agtctacaag tggcttcccc agaatgacct ccttggtcat
     1141 ccaaaaacca aagcctttgt aactcatggt ggagccaatg gtgtctatga ggccatctat
     1201 catggaatcc ctatgattgg cattcctatg tttggagaac aacatgataa cattgcccac
     1261 atggtggcca aaggagcagc tgttacactg aatatcagga caatgtcaaa gtcagatttg
     1321 ttcaatgcac ttaaggaagt aataaacaat cctttctata aaaaaaatgc tatgtggctg
     1381 tcaaccattc accatgacca acctatgaaa cccctggaca aggctatctt ctggattgag
     1441 tatgtcatgc gccacaaaag agccaagcac ctgagaccac ttggacataa ccttccctgg
     1501 taccagtacc actctctgga tgtgattgga ttcctgctag cctgtttggc agtcattgca
     1561 gcccttgctg taaaatgctt cttgttcatt taccgattct ttgcaaagaa gcaaaagaaa
     1621 atgaagaatg agtagagctc gttgacaatg cactacagga atgaaattta agcctcattc
     1681 taatttatga atcactttct taacacttcc tgattttttt ttgtggaggc agatcatcat
     1741 tgtaagaaga catatagctc tgtgaatatt gatatgttat caaaatttta aaatcactta
     1801 atgtaaaaaa gttgcattgt agaaaaattg aggaaaataa agtttacttg atagtctt
//
LOCUS       RATUDPB      2216 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Rat UDP glucuronosyltransferase-21 (UDPGTr-21) mRNA, 3' end.
ACCESSION   M33747 J05440
KEYWORDS    UDP glucuronosyltransferase-21.
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            UDPGTr-21.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2216)
  AUTHORS   Mackenzie,P.I.
  TITLE     The cDNA sequence and expression of a variant 17B-hydroxysteroid
            UDP glucuronosyltransferase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.I.Mackenzie, 13-APR-1990.
FEATURES       from  to/span     description
    pept    <     1      463     UDP glucuronosyltransferase-21 (AA at 2) (EC
                                 2.4.1.17)
    mRNA    <     1     2216     UDP glucuronosyltransferase-21 nRNA
BASE COUNT      738 a    386 c    386 g    706 t
ORIGIN      
        1 agccaatggt gtctatgagg ccatctatca tggaatccct atgattggca ttcctatgtt
       61 gggagaacaa catgataaca ttgcccacat ggtggccaaa ggagcagctg ttacactgaa
      121 tatcaggaca atgtcaaagt cagatttgtt caatgcactt aaggaagtaa taaacaatcc
      181 tttctataaa aaaaatgcta cgtggctgtc aaccattcac catgaccaac ctatgaaacc
      241 cctggacaag gctatcttct ggattgagta tgtcatgcgc cacaaaagag ccaagcacct
      301 gagaccactt ggacataacc ttccctggta ccagtaccac tctctggatg tgattggatt
      361 cctgctagcc tgtttggcag tcattgcagc ccttgctgta aaatgcttct tgttcattta
      421 ccgattcttt gcaaagaagc aaaagaaaat gaagaatgag tagagctcgt tgacaatgca
      481 ctacaggaat gaaatttaag cctcattcta atttatgaat cactttctta acatttcctg
      541 attttttttt gtggaggcag atcatcattg taagaagaca tatagctctg tgaatattga
      601 tatgttatca aaattttaaa atcacttaat gtaaaaaagt tgcattgtag aaaaattgag
      661 gaaaataaag tttacttgat agtcttaaaa atcacagtat taaccttaca atatttgaat
      721 attgtccatt gacctctttc tctgagactg aatctgtagc tttcatacaa ataagtagct
      781 aacttgtata ctataaatat ggacatataa atagtttttt ctgtaatagt cttaattatt
      841 tgtagtcggg gataaagtgt ggtttggttt ggatattcat ttcaaagggt aggaatctgt
      901 tggctatttt gttcctgtaa caaaatgtgc tgaccaaaag catctccagg gaaaagcaga
      961 gcagtttatt ttgagttgtg cttacagatc ctgagaacgc aggatagata ggaaggcagg
     1021 gcagcagtca gccagatgac aaaactctct cattacatct taaccacaca tagaaagcac
     1081 aaagagtgag caaaaagtgt gactatggtg tgaactttca aagcttgctc cagtgatata
     1141 tttcctccaa aaagatttaa cccctttaaa taatattcct gtacccctgg agttgggagt
     1201 ttagctcagt ggtagagcat ttgcctacca aacacaaggc tctgtgttca gtcctcagct
     1261 ccgggggaaa aaaaaaagaa agattccata acctcaaaca gcattacaaa ttttggaaaa
     1321 tgtgctaaaa ttcatcagcc tatctgaaac attttacatt gaatccataa caggaaataa
     1381 acctgtttct taattcttat tttttagcat accattctaa tactccaagt tctaacacag
     1441 cacttgtacc tcttcaatgt aatttaacta tgatcatgag gcataatgtt cattggaaat
     1501 gaagcatatg aacaggaaac aaataaaagt cctaactaaa gtaaacttag ctttgagatt
     1561 ggctattaca agtctggttg taattccact aatgctgcca tatgctgtga ggaatgttat
     1621 aaaagagcta tgtaactatt atgacagttg tagcttttag cattgaaata catagatatt
     1681 aatataaaag taagtgtata atatgatgct taaatgtgta acctaatatt ttagaataaa
     1741 tttaattagt ggaaacattc tagacaggaa cagtaaatat atccaacatc attattcttt
     1801 gatttaaaaa atgcaatttg gaggttcttc cctgcaaaag actatctctt tcccactctt
     1861 aacattactt aggtgcttat tacagtttta tgttgagttg gggaaagggt aaaattgacc
     1921 cctttccata ttagcatgaa tattggtatc atctttattg agatcttgtt taggaaccca
     1981 ttatgagact tcaggagtat aactttcata atgtttgtaa tagatgcaac tttacagcag
     2041 acaacttgat ccttctggcc tcttaaatct ttccatcctc tattatgtaa tgttttgttg
     2101 atagttactt cagtatttga cacaagattc aataatttta tgcctatggg ttccatcaaa
     2161 catcatgact ctatatatat gtaaatccaa aataagaaat aaaaaatagt gtatct
//
LOCUS       BCEHEMOL      280 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  B.cereus hemolysin gene, partial cds.
ACCESSION   M35411
KEYWORDS    hemolysin.
SOURCE      B.cereus DNA.
  ORGANISM  Bacillus cereus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 280)
  AUTHORS   Gilmore,M.S., Gilmore,K.S. and Goebel,W.
  TITLE     A new strategy for ordered DNA sequencing based on a novel method
            for the rapid purification of near-milligram quantities of a cloned
            restriction fragment
  JOURNAL   Gene Anal. Tech. 2, 108-114 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   280     hemolysin (AA at 2)
BASE COUNT       99 a     43 c     51 g     87 t
ORIGIN      
        1 gaattctcat ttatggattg taaaccgtgc aattgatatt atgtctcgta atacaacact
       61 tgtaaaacaa gatcgagttg cactattaaa tgaatggcgt actgagttag agaacggtat
      121 ttatgctgct gactatgaaa atccttatta tgataatagc acatttgctt cacatttcta
      181 tgaccctgac aatgggaaaa cttatattcc gtatgcaaag caggcaaagg aaactggagc
      241 taaatatttt aaattagctg gtgagtctta caaaaataaa
//
LOCUS       BPEFHAA       164 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  B.pertussis filamentous hemagglutinin antigen gene, partial cds.
ACCESSION   M35274
KEYWORDS    filamentous hemagglutinin antigen.
SOURCE      B.pertussis DNA, clone lambda-FHA15.
  ORGANISM  Bordetella pertussis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Alcaligenaceae.
REFERENCE   1  (bases 1 to 164)
  AUTHORS   Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and
            Ullmann,A.
  TITLE     Molecular cloning of a coding sequence of Bordetella pertussis
            filamentous hemagglutinin gene
  JOURNAL   FEMS Microbiol. Lett. 37, 73-77 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   164     filamentous hemagglutinin antigen (AA at 1)
BASE COUNT       38 a     45 c     60 g     21 t
ORIGIN      
        1 gaattccaca tgcacctgga tgcgccgcgc atcgagaaca ccgcgaaact gacgcggcga
       61 ggtgcaacgc aaaggcgtgc aggacgtcgg gggaggcgag cacggccgct ggacgtatcg
      121 gctatgtcaa ctactggttg cgcgcgcatg gaagaaggcg ggca
//
LOCUS       BPEFHAB       165 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  B.pertussis filamentous hemagglutinin antigen gene, partial cds.
ACCESSION   M35275
KEYWORDS    filamentous hemagglutinin antigen.
SOURCE      B.pertussis DNA, clone lambda-FHA15.
  ORGANISM  Bordetella pertussis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Alcaligenaceae.
REFERENCE   1  (bases 1 to 165)
  AUTHORS   Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and
            Ullmann,A.
  TITLE     Molecular cloning of a coding sequence of Bordetella pertussis
            filamentous hemagglutinin gene
  JOURNAL   FEMS Microbiol. Lett. 37, 73-77 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   165     filamentous hemagglutinin antigen (AA at 1)
BASE COUNT       30 a     54 c     55 g     26 t
ORIGIN      
        1 gaattcggac cagcctggcc cgagcgctgc atgccgcgcg ggaaggccca cacagttggt
       61 cccgacactg ccactttccg agtcccatcg caacgggcgg tgatccactc gtcgttggcg
      121 cgtgatagac agcgcgtgca tgcgagagcg catgcagcag gctgg
//
LOCUS       CRECYCA       662 bp ss-mRNA            PLN       26-JUL-1990
DEFINITION  C.reinhardtii mitochondrial apocytochrome c (cyc) mRNA, complete
            cds.
ACCESSION   M35173
KEYWORDS    apocytochrome c; cytochrome c apoprotein.
SOURCE      C.reinhardtii, cDNA to mRNA, clone C321.
  ORGANISM  Chlamydomonas reinhardtii
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Chlamydomonadaceae.
REFERENCE   1  (bases 1 to 662)
  AUTHORS   Amati,B.B., Goldschmidt-Clermont,M., Wallace,C.J.A. and
            Rochaix,J.-D.
  TITLE     cDNA and deduced amino acid sequences of cytochrome c from
            Chlamydomonas reinhardtii: Unexpected functional and phylogenetic
            implications
  JOURNAL   J. Mol. Evol. 28, 151-160 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         42      380     apocytochrome c (cyc)
BASE COUNT      147 a    176 c    200 g    139 t
ORIGIN      
        1 ccgaaccaaa acctttcctg tgacccttct atctgcttaa aatgtcgacc ttcgctgagg
       61 cccccgctgg cgaccttgct cgcggcgaga agattttcaa gaccaagtgc gcgcaatgcc
      121 acgttgctga gaagggcggc ggccacaagc agggccccaa cctgggcggt ctgttcggcc
      181 gtgtctcggg cactgctgcc ggcttcgcat actcgaaggc gaacaaggag gctgccgtga
      241 cctggggcga gagcactctc tacgagtacc tgctgaaccc caagaagtac atgcctggca
      301 acaagatggt gttcgctggc ctgaagaagc ccgaggagcg cgccgatctg attgcctacc
      361 tgaagcaggc gactgcttaa actgcgcgcg gcttagcaag cggcttcatt cattaggcag
      421 aagcgggtct caagagcggg atagggttgc atctgggcgc ggcgtgtgtt cgcttcagaa
      481 cgtcccacca gatgcaacag gcggatgtgt tacgagtgtc gagtgtgtac tgatgatggt
      541 gtgcatgtgt aacggcgaca tacggatgga atagacatat cgtcttgaag actgtctcat
      601 aggcagagac atctgctcac aggcaactta ttatgtctgc catgggcggt cgtaaagaat
      661 tc
//
LOCUS       ECOABC       1993 bp ds-DNA             SYN       26-JUL-1990
DEFINITION  Synthetic plasmid (for E.coli) DNA.
ACCESSION   M34519
KEYWORDS    b-galactosidase; b-lactamase; bla gene; lacZ gene; promoter.
SOURCE      Synthetic DNA.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 1993)
  AUTHORS   Hayden,M.A., Shallcross,M.A., Stotland,E. and Mandecki,W.
  TITLE     A totally synthetic plasmid for general cloning, gene expression
            and mutagenesis in Escherichia coli
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Mandecki, 22-MAY-1990.
            
               Author address:W.Mandecki
               Abbott Laboratories
               Corporate Molecular Biology D93D
               Abbott Park, IL 60064
FEATURES       from  to/span     description
    pept        119      301     b-galactosidase
    pept        438     1298     b-lactamase
    site         81      327     lacZ fragment mRNA
    site        136      137     SmaI site for introduction of multicloning
    signal      301      336     trpA terminator
    signal      337      402     bla promoter P3
    mRNA        403     1329     bla mRNA
    signal     1299     1342     phage fd terminator
    signal     1343     1408     RNAII promoter
    mRNA       1409     1962     RNAII mRNA
    signal     1415     1440     RNAI terminator
    signal     1518     1555     RNAI promoter
    mRNA       1422     1517     RNAI mRNA
    site       1961     1962     RNaseH cleavage site
BASE COUNT      524 a    475 c    500 g    494 t
ORIGIN      
        1 gaattgatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctg
       61 ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat
      121 gactatgatt acgcccgggc ttgccgtcgt tttacagcga cgagactggg aaaatcctgg
      181 cgttacccaa cttaatcgcc ttgccgcaca cccccctttc gccagttggc gtaatagcga
      241 agaagcccgc accgaccgcc cttcccaaca gttgcgtagt ctgaatggcg aatggcgtta
      301 aactagtagc ccgcctaatg agcgggcttt tttttaattc ccctatttgt ttatttttct
      361 aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat
      421 attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg
      481 cggcattttg ccttcctgtt tttgctcacc cagaaacgct cgtgaaagta aaagacgcag
      541 aggaccaatt gggggcacga gtgggataca tagaactgga cttgaatagc ggtaaaatcc
      601 ttgagagttt tcgccctgaa gagcgttttc caatgatgag cactttcaaa gttctgctat
      661 gtggagcagt attatcccgt gtagatgcgg ggcaagagca actcggacga cgaatacact
      721 attcgcagaa tgacttggtt gaatactccc cagtgacaga aaagcacctt acggacggaa
      781 tgacggtaag agaattatgt agtgccgcca taacgatgag tgataacact gcggcgaact
      841 tacttctgac aaccatcggt ggaccgaagg aattaaccgc ttttttgcac aatatgggag
      901 accatgtaac tcgccttgac cgttgggaac cagaactgaa tgaagccata ccaaacgacg
      961 agcgagacac cacaatgcct gcggcaatgg caacaacatt acgcaaacta ttaactggcg
     1021 aactacttac tctggcttca cggcaacaat taatagactg gcttgaagcg gataaagttg
     1081 caggaccact actgcgttcg gcacttcctg ctggctggtt tattgctgat aaatctgggg
     1141 caggagagcg tggttcacgg ggtatcattg ccgcacttgg accagatggt aagccttccc
     1201 gtatcgtagt tatctacacg acgggtagtc aggcaactat ggacgaacga aatagacaga
     1261 ttgctgaaat aggggcttca ctgattaagc attggtaaac cgatacaatt aaaggctcct
     1321 tttggagcct ttttttttgg acggaccgag tagaaaagat caaaggatct tcttgagatc
     1381 ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg
     1441 tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag
     1501 cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact
     1561 ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg
     1621 gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc
     1681 ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg
     1741 aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggacaaagg
     1801 cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag
     1861 ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc
     1921 gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct
     1981 ttttacggtt cct
//
LOCUS       HS1PROM       591 bp ds-DNA             VRL       26-JUL-1990
DEFINITION  Herpes simplex virus type 1 joint promoter.
ACCESSION   M34532
KEYWORDS    promoter.
SOURCE      Herpes simplex virus type 1 (strain KOS) DNA, clone pRAB6.
  ORGANISM  Herpes simplex virus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 591)
  AUTHORS   Bohenzky,R.A., Papavassiliou,A.P., Gelman,I.H. and Silverstein,S.
  TITLE     Identification of novel transcripts mapping to the joint region of
            Herpes simplex virus type 1
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Silverstein, 23-MAY-1990.
            
               Author address:S.Silverstein
               Dept. of Microbiology
               Columbia University
               701 W. 168th ST.
               N.Y.,  N.Y. 10032
FEATURES       from  to/span     description
    binding     134      139     CTF binding site
    binding     290      295     Sp1 binding site
    binding     508      513     Sp1 binding site
    binding     387      392     Sp1 binding site
    binding     480      485     Sp1 binding site
    binding     492      497     Sp1 binding site
    site        298      306     Oct1/a-TIF site
    site        327      330     a4 enhancer
    site        366      372     E4TF1 site
    signal      528      533     TATA box
BASE COUNT      103 a    193 c    204 g     91 t
ORIGIN      Map position 0.794-0.798.
        1 gcatgcccct cccgccgacg caacaggggc ttggcctgcg tcggtgcccc ggggcttccc
       61 gccttcccga agaaactcat taccataccc ggaaccccag gggaccaatg cgggttcatt
      121 gagcgacccg cgggccaatg cgcgaggggc cgtgtgttcc gccaaaaaag caattaacat
      181 aacccggaac cccaggggag tggttacgcg cggcgcggga ggcggggaat accggggttg
      241 cccattaagg gccgcgggaa ttgccggaag cgggtaatgt cggccggggc cgcccattaa
      301 tgagtttcta attaccatac cgggaagcgg aacaaggcct ctgcaagttt ttaattacca
      361 taccgggaag tgggcgcccg cccagtgggc gggagttacc gcccagtggg ccggcccgac
      421 gactcggcgg acgctggttg gccgggcccc gccgcgctgg cggccgccga ttggccagtc
      481 ccgccctccg agggcggccc gcctcggggg cgggccggct ccaagcgtat atatgcgcgg
      541 ctcctgccat cgtctctccg gagagcggct tggtgcggac ctgcagccaa g
//
LOCUS       MZEHETRO      184 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  Corn heterochromatin repetitive DNA.
ACCESSION   M35408
KEYWORDS    .
SOURCE      Corn knob heterochromatin DNA, clone pZm4.25.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 184)
  AUTHORS   Peacock,W.J., Dennis,E.S., Rhoades,M.M. and Pryor,A.J.
  TITLE     Highly repeated DNA sequence limited to knob heterochromatin in
            maize
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 4490-4494 (1981)
  STANDARD  simple staff_review
BASE COUNT       58 a     41 c     42 g     43 t
ORIGIN      
        1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg
       61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt
      121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat
      181 ggcc
//
LOCUS       RATRSB1       170 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Rat B1 repetitive sequence.
ACCESSION   M35409
KEYWORDS    B1 repetitive sequence.
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 170)
  AUTHORS   Blin,N., Weber,T. and Alonso,A.
  TITLE     Cross-reaction of snRNA and an Alu I-like sequence from rat with
            DNAs from different eucaryotic species
  JOURNAL   Nucleic Acids Res. 11, 1375-1388 (1983)
  STANDARD  simple staff_entry
BASE COUNT       57 a     29 c     44 g     40 t
ORIGIN      
        1 aaaaaaaagc aaatgacagc tgtgtgtggt ttcatatgtg tttaatccag cactcaggag
       61 gcagaggtaa atggatctct gtgagttcga gtccagtctg gctacaaagc aagttctaga
      121 gcagccaggg ctgttacaca gagaaactct gtcttggaag ataaaaaaga
//
LOCUS       SHFINV        261 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  Plasmid pINV (from S.flexneri) RepA gene, 5' end.
ACCESSION   M35403
KEYWORDS    .
SOURCE      Plasmid pWR110 (from S.flexneri 5) DNA.
  ORGANISM  Shigella flexneri
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 261)
  AUTHORS   Silva,R.M., Saadi,S. and Maas,W.K.
  TITLE     A basic replicon of virulence-associated plasmids of Shigella spp.
            and enteroinvasive Escherichia coli is homologous with a basic
            replicon in plasmids of IncF groups
  JOURNAL   Infect. Immun. 56, 836-842 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        256  >   261     repA protein
    mRNA        171       82 (c) inc mRNA
BASE COUNT       70 a     54 c     68 g     69 t
ORIGIN      
        1 gatcgtttaa ggaattttat ggctggccac gccttaaggt ggcagggaac tggttctgat
       61 gtggatgtac aggagccaga aaagcaaaaa ccccgataat cttctttaac tttggcgagt
      121 cagaaagatt accggggccc acttaaaccg tatagccaac aatcaagcta tgcggggagt
      181 atagttatat gcccggaaaa gttcaagact tctttctgtg ctcgctcctt ctgcgcattg
      241 taagtgcagg atggtgtgac t
//
LOCUS       YSYPSKLA      598 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  S.kluyveri plasmid pSKL left-end inverted terminal repeat.
ACCESSION   M35319
KEYWORDS    .
SOURCE      S.kluyveri plasmid pSKL DNA.
  ORGANISM  Saccharomyces kluyveri
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 598)
  AUTHORS   Kitada,K. and Hishinuma,F.
  TITLE     A new linear DNA plasmid isolated from the yeast Saccharomyces
            kluyveri
  JOURNAL   Mol. Gen. Genet. 206, 377-381 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
BASE COUNT      247 a     18 c    206 g    127 t
ORIGIN      
        1 aaaaggtata gatatagata tattttttat gggtttggaa gggggaagtg gaagaatgta
       61 tcgtgtaaaa aaagagcaaa aaaaaaatta gatgagagaa ggggaaaaga ggggagtgta
      121 tcatgtgaaa aaacgcgtca aaatgaagag aagggaaaaa ggggagagtg tatcgtgggg
      181 aaagtgaatt ttgaagaaga gaaggggaaa agaggggagt gtatcgtcta agaagggggt
      241 attataagag aaggggatat tggtagagtg tattgaatgt ggcttagcaa aaatagaaaa
      301 agggtaaaaa atgggggata aaaaaaagaa aaaaacggta ttaaggggag aaggggaaaa
      361 gggtagagtg tatcgtgcaa aaagtgagtt caaaatgaag agaaggggaa aagggtagag
      421 tgtatcgtgg gggaaagtga gtttaaatga agagaagggg aaaagggtag agtgtatcgt
      481 gggggaaagt gagtttaaat gaagagaagg gaaaaagggg gagtgtatcg tataaaaagt
      541 gaatatattt tatttgatgg gattaagtat tgaaaatgga aatggatgat aggttgtt
//
LOCUS       YSYPSKLB      117 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  S.kluyveri plasmid pSKL right-end DNA.
ACCESSION   M35320
KEYWORDS    .
SOURCE      S.kluyveri plasmid pSKL DNA.
  ORGANISM  Saccharomyces kluyveri
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Kitada,K. and Hishinuma,F.
  TITLE     A new linear DNA plasmid isolated from the yeast Saccharomyces
            kluyveri
  JOURNAL   Mol. Gen. Genet. 206, 377-381 (1987)
  STANDARD  simple staff_review
BASE COUNT       35 a      5 c     18 g     59 t
ORIGIN      
        1 caaaaagtga gattaggggg agaatatatt tattatgtta aatataaggt agttttttta
       61 taatttattt aatttatttt gtttgtattt tagcttcttt aattagtctg tattctt
//
LOCUS       XELTRH       1442 bp ss-mRNA            VRT       26-JUL-1990
DEFINITION  X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds.
ACCESSION   M34699 K00931 J05514
KEYWORDS    thyrotropin releasing hormone.
SOURCE      X.laevis skin, cDNA to mRNA, clone L4 and 8/136.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 478)
  AUTHORS   Richter,K., Kawashima,E., Egger,R. and Kreil,G.
  TITLE     Biosynthesis of thyrotropin releasing hormone in the skin of
            Xenopus laevis: Partial sequence of the precursor deduced from
            cloned cDNA
  JOURNAL   EMBO J. 3, 617-621 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 15 to 1442)
  AUTHORS   Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
  TITLE     Two precursors of thyrotropin releasing hormone from skin of
            Xenopus laevis: Each contains seven copies of end-product
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by G.Kreil, 18-MAY-1990.
FEATURES       from  to/span     description
    pept        110      793     thyrotropin releasing hormone precursor
    matp        332      340     thyrotropin releasing hormone copy 1
    matp        374      382     thyrotropin releasing hormone copy 2
    matp        428      436     thyrotropin releasing hormone copy 3
    matp        470      478     thyrotropin releasing hormone copy 4
    mRNA    <     1     1442     TRH mRNA
    conflict    139      139     t in [2]; c in [1]
    conflict    214      216     tct in [2]; ctc in [1]
    conflict    319      319     g in [2]; t in [1]
BASE COUNT      460 a    286 c    334 g    362 t
ORIGIN      
        1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag
       61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt
      121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc
      181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg
      241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct
      301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga
      361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc
      421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg
      481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga
      541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt
      601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga
      661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg
      721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag
      781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca
      841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga
      901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt
      961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag
     1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt
     1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca
     1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat
     1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct
     1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt
     1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg
     1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa
     1441 tg
//
LOCUS       XELTRHA      2955 bp ss-mRNA            VRT       26-JUL-1990
DEFINITION  X.laevis thyrotropin releasing hormone mRNA.
ACCESSION   M34698 J05514
KEYWORDS    thyrotropin releasing hormone.
SOURCE      X.laevis, cDNA to mRNA, clone C6.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2955)
  AUTHORS   Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
  TITLE     Two precursors of thyrotropin releasing hormone from skin of
            Xenopus laevis: Each contains seven copies of end-product
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Kreil, 18-MAY-1990.
FEATURES       from  to/span     description
    pept        157      831     thyrotropin releasing hormone
BASE COUNT      927 a    597 c    604 g    827 t
ORIGIN      
        1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc
       61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga
      121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg
      181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag
      241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc
      301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg
      361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga
      421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac
      481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag
      541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat
      601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa
      661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa
      721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac
      781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag
      841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt
      901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac
      961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc
     1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca
     1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa
     1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc
     1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca
     1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg
     1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca
     1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg
     1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa
     1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct
     1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa
     1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt
     1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt
     1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca
     1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat
     1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg
     1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga
     1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg
     2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga
     2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca
     2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct
     2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc
     2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac
     2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg
     2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata
     2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga
     2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc
     2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa
     2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg
     2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct
     2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa
     2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct
     2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt
     2941 tcgtggcgct ggccc
//
LOCUS       MUSIGCS       302 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Mouse Ig heavy-chain gene enhancer region.
ACCESSION   M35179
KEYWORDS    constant region; germline; immunoglobulin heavy-chain.
SOURCE      Mouse (strain BXXB:SB/Le) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 302)
  AUTHORS   Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J.
  TITLE     Molecular aspects of murine systemic lupus erythematosus
  JOURNAL   Springer Semin. Immunopathol. 9, 121-142 (1986)
  STANDARD  simple staff_review
BASE COUNT       90 a     59 c     68 g     85 t
ORIGIN      
        1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa
       61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc
      121 cccaccaaac cgaaagtcca ggctgagcaa aacaccacct gggtaatttg catttctaaa
      181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct
      241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat
      301 gt
//
LOCUS       MUSIGCT       313 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Mouse Ig heavy-chain gene enhancer region.
ACCESSION   M35180
KEYWORDS    constant region; germline; immunoglobulin heavy-chain.
SOURCE      Mouse (lupus erythematosus strain MRL/I) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 313)
  AUTHORS   Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J.
  TITLE     Molecular aspects of murine systemic lupus erythematosus
  JOURNAL   Springer Semin. Immunopathol. 9, 121-142 (1986)
  STANDARD  simple staff_review
BASE COUNT       94 a     59 c     69 g     91 t
ORIGIN      
        1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa
       61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc
      121 cccaccaaac cgaaagtcta ggctgagcaa aacaccacct gggtaatttg catttctaaa
      181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct
      241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat
      301 gtatttagaa ttc
//
LOCUS       MUSTCBYBB     459 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Mouse T-cell receptor C beta-1/2 recombinant chain, exon 1.
ACCESSION   M35181
KEYWORDS    T-cell receptor beta chain; constant region; germline.
SOURCE      Mouse (strain NZW) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 459)
  AUTHORS   Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J.
  TITLE     Molecular aspects of murine systemic lupus erythematosus
  JOURNAL   Springer Semin. Immunopathol. 9, 121-142 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pre-msg <     1  >   459     TCR C-beta-1/2 recombinant chain, exon 1
    IVS         436  >   459     TCR C-beta-1/2 intron A (no splice consensus)
BASE COUNT      117 a    124 c    126 g     92 t
ORIGIN      
        1 ttacaagatc aaggcagatc cagatagctc tcagaccatt cgtactctct ttactttcca
       61 gaggatctga gaaatgtgac tccacccaag gtctccttgt ttgagccatc aaaagcagag
      121 attgcaaaca aacaaaaggc taccctcgtg tgcttggcca ggggcttctt ccctgaccac
      181 gtggagctga gctggtgggt gaatggcagg gaggtccaca gtggggtcag cacggaccct
      241 caggcctaca aggagagcaa ttatagctac tgcctgagca gccggctgag ggtctctgct
      301 accttctggc acaatcctcg aaaccacttc cgctgccaag tgcagttcca tgggctttca
      361 gaggaggaca agtggccaga gggctcaccc aaacctgtca cacagaacat cagtgcagag
      421 gcctggggcc gagcaggtaa gtgcggacct catgaggaa
//
LOCUS       HAMSCARPB     537 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Hamster alpha-crystallin B chain mRNA, 5' end.
ACCESSION   J03849
KEYWORDS    alpha-crystallin B chain.
SOURCE      Hamster scrapie infected brain, cDNA to mRNA.
  ORGANISM  Mesocricetus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 537)
  AUTHORS   Duguid,J.R., Rohwer,R.G. and Seed,B.
  TITLE     Isolation of cDNAs of scrapie-modulated RNAs by subtractive
            hybridization of a cDNA library
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            J.Duguid, 25-OCT-1990.
FEATURES       from  to/span     description
    pept         21  >   537     alpha-crystallin B chain
BASE COUNT      113 a    177 c    127 g    120 t
ORIGIN      
        1 catacattca cctagccacc atggacatcg ccatccacca cccctggatc cgccgtccct
       61 ttttcccttt ccactccccc agccgcctct ttgaccagtt cttcggagag cacctgttgg
      121 agtctgacct cttctcaact gccacttctc tgagtccctt ctacctgcgg ccaccttcct
      181 tccttcgggc acccagctgg attgacactg gactctcaga gatgcggatg gagaaggaca
      241 gattctccgt caacctggat gtgaagcact tctccccgga agagctgaaa gtcaaggtgc
      301 tgggggacgt ggttgaagtg catggcaagc acgaagagcg ccaggacgaa cacggcttca
      361 tctctaggga gttccatagg aagtaccgga tcccagctga tgtggatcct ctgaccatta
      421 cttcatccct gtcatctgac ggcgtcctca ctgtgaatgg accaaggaaa caggcctctg
      481 gccccgagcg taccattccc atcacccgtg aagagaagcc tgctgtcact gcagccc
//
LOCUS       HAMSCRAP      282 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Hamster glial fibrillary acidic protein mRNA, partial cds.
ACCESSION   J03847
KEYWORDS    glial fibrillary acidic protein.
SOURCE      Hamster scrapie infected brain, cDNA to mRNA.
  ORGANISM  Mesocricetus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 282)
  AUTHORS   Duguid,J.R., Rohwer,R.G. and Seed,B.
  TITLE     Isolation of cDNAs of scrapie-modulated RNAs by subtractive
            hybridization of a cDNA library
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            J.Duguid, 25-OCT-1990.
FEATURES       from  to/span     description
    pept    <     1  >   282     glial fibrillary acidic protein (AA at 1)
BASE COUNT       90 a     69 c     76 g     47 t
ORIGIN      
        1 gagggccaaa gcctcaagga ggagatggct cgccacctgc aggagtatca agatctactc
       61 aatgtcaagc tagccctgga catcgagatt gccacctata ggaaattgct agaaggcgag
      121 gaaaaccgca tcaccatccc tgtacaaact ttctccaacc tgcaaatccg agaaaccagc
      181 ctggacacca agtccgtgtc agaaggacac ctcaagagga acatcgtggt aaagacagtg
      241 gagatgaggg atggtgaggt cattaaggag tccaagcagg ag
//
LOCUS       HAMSCRAPA     327 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Hamster metallothionein II mRNA, complete cds.
ACCESSION   J03848
KEYWORDS    metallothionein II.
SOURCE      Hamster scrapie infected brain, cDNA to mRNA.
  ORGANISM  Mesocricetus sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 327)
  AUTHORS   Duguid,J.R., Rohwer,R.G. and Seed,B.
  TITLE     Isolation of cDNAs of scrapie-modulated RNAs by subtractive
            hybridization of a cDNA library
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            J.Duguid, 25-OCT-1990.
FEATURES       from  to/span     description
    pept         62      247     metallothionein II
BASE COUNT       65 a    100 c     81 g     81 t
ORIGIN      
        1 cactcaagtt tcgacttttc ctcggtcctc agccggtctt caaccgccgc cttcactcgc
       61 catggacccc aactgctcct gtgccacaga tggatcctgc tcctgctctg ggtcttgcaa
      121 atgcaaagag tgcaaatgca ccacgtgcaa gaaaagctgc tgctcctgct gcccggtggg
      181 ctgtgcgaag tgctcccagg gctgcgtctg caaagaggct tcggagaagt gcagctgctg
      241 cgcctgaagc ggattcccct cagctgtctg taaatagagc aatgtgtaga aacgtattgg
      301 tttttttaca accccgtcct attctcc
//
LOCUS       ASOTAAG1     2935 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G1) gene, complete cds.
ACCESSION   M33218
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 2935)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        607      772     Taka-amylase A (Taa-G1) precursor, exon 1
                828      868     Taka-amylase A precursor, exon 2
                955     1070     Taka-amylase A precursor, exon 3
               1140     1248     Taka-amylase A precursor, exon 4
               1317     1545     Taka-amylase A precursor, exon 5
               1603     1765     Taka-amylase A precursor, exon 6
               1830     1976     Taka-amylase A precursor, exon 7
               2041     2281     Taka-amylase A precursor, exon 8
               2360     2647     Taka-amylase A precursor, exon 9
    sigp        607      669     Taka-amylase A signal peptide
    matp        670      772     Taka-amylase A
                828      868     Taka-amylase A
                955     1070     Taka-amylase A
               1140     1248     Taka-amylase A
               1317     1545     Taka-amylase A
               1603     1765     Taka-amylase A
               1830     1976     Taka-amylase A
               2041     2281     Taka-amylase A
               2360     2644     Taka-amylase A
    pre-msg     543  >  2789     Taa-G1 mRNA and introns
    IVS         773      827     Taa-G1 intron A (no splice consensus)
    IVS         869      954     Taa-G1 intron B
    IVS        1071     1139     Taa-G1 intron C
    IVS        1249     1316     Taa-G1 intron D
    IVS        1546     1602     Taa-G1 intron E
    IVS        1766     1829     Taa-G1 intron F
    IVS        1977     2040     Taa-G1 intron G
    IVS        2282     2359     Taa-G1 intron H
    signal     2784     2789     poly-A signal
BASE COUNT      818 a    752 c    657 g    708 t
ORIGIN      
        1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact
       61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca
      121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca
      181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag
      241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat
      301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat
      361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt
      421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc
      481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg
      541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag
      601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct
      661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat
      721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt
      781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg
      841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg
      901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac
      961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc
     1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat
     1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga
     1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt
     1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct
     1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct
     1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact
     1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct
     1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga
     1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc
     1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg
     1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt
     1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca
     1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt
     1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca
     1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca
     1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa
     1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag
     2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg
     2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa
     2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc
     2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa
     2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga
     2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg
     2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga
     2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt
     2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg
     2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag
     2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag
     2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga
     2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca
     2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc
     2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg
//
LOCUS       ASOTAAG2A1    197 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G2) gene, 5' end.
ACCESSION   M33220
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 197)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        195  >   197     Taka-amylase A (Taa-G2) precursor
    pre-msg     131  >   197     Taa-G2 mRNA and introns
    signal        2       11     CAAT box
    signal       95      100     TATA box
BASE COUNT       59 a     52 c     36 g     50 t
ORIGIN      
        1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa
       61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg
      121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc
      181 cacagaaggc atttatg
//
LOCUS       ASOTAAG2A2    198 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  A.oryzae Taka-amylase A (Taa-G2) gene, 3' end.
ACCESSION   M33222
KEYWORDS    Taka-amylase A.
SOURCE      A.oryzae (strain JCM02239) DNA.
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 198)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     Taka-amylase A (Taa-G2) (AA at 1)
    pre-msg <     1      151     Taa-G2 mRNA and introns (alt.)
    pre-msg <     1      156     Taa-G2 mRNA and introns (alt.)
BASE COUNT       45 a     48 c     53 g     52 t
ORIGIN      About 2.1 kb after segment 1.
        1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
       61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
      121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc
      181 gtcctccaga ggcgagga
//
LOCUS       ASOTTAM1      191 bp ss-mRNA            PLN       26-JUL-1990
DEFINITION  A.oryzae Taka-amylase A (Taa) mRNA, 3' end.
ACCESSION   M33219
KEYWORDS    Taka-amylase A.
SEGMENT     1 of 2
SOURCE      A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 191)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        189  >   191     Taka-amylase A (Taa)
    mRNA        125  >   191     Taa mRNA
BASE COUNT       56 a     52 c     32 g     51 t
ORIGIN      
        1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca
       61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt
      121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga
      181 aggcatttat g
//
LOCUS       ASOTTAM2      156 bp ss-mRNA            PLN       26-JUL-1990
DEFINITION  A.oryzae Taka-amylase A (Taa) mRNA, 5' end.
ACCESSION   M33221
KEYWORDS    Taka-amylase A.
SEGMENT     2 of 2
SOURCE      A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4].
  ORGANISM  Aspergillus oryzae
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and
            Udaka,S.
  TITLE     Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A:
            Evidence for multiple related genes
  JOURNAL   Gene 84, 319-327 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        9     Taka-amylase A (Taa) (AA at 1)
    mRNA    <     1      151     Taa mRNA (alt.)
    mRNA    <     1      156     Taa mRNA (alt.)
BASE COUNT       40 a     35 c     42 g     39 t
ORIGIN      About 1.5 kb after segment 1.
        1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg
       61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac
      121 ggcaccgggc cccgactgag agcccgtctc gaatcc
//
LOCUS       CIPNADGAPD   1354 bp ss-mRNA            PLN       26-JUL-1990
DEFINITION  M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH)
            mRNA, complete cds.
ACCESSION   J05223
KEYWORDS    D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase;
            glyceraldehyde-3-phosphate dehydrogenase.
SOURCE      M.crystallinum, cDNA to mRNA.
  ORGANISM  Mesembryanthemum crystallinum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Aizoaceae.
REFERENCE   1  (bases 1 to 1354)
  AUTHORS   Ostrem,J.A., Vernon,D.M. and Bohnert,H.J.
  TITLE     Increased expression of a gene coding for NAD:glyceraldehyde-3-
            phosphate dehydrogenase during the transition from C-3
            photosynthesis to crassulacean acid metabolism in Mesembryanthemum
            crystallinum
  JOURNAL   J. Biol. Chem. 265, 3497-3502 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         55     1068     glyceraldehyde-3-phosphate dehydrogenase (EC
                                 1.2.1.12)
BASE COUNT      315 a    308 c    345 g    386 t
ORIGIN      
        1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct
       61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc
      121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac
      181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag
      241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg
      301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga
      361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt
      421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac
      481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg
      541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc
      601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg
      661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt
      721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg
      781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct
      841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat
      901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga
      961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac
     1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga
     1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc
     1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt
     1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta
     1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc
     1321 agcatttatg aaacattgct gtgagctttt gagt
//
LOCUS       ECOPRIA      2658 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  E.coli primosomal protein n' (priA) gene, complete cds, and cytR
            gene, 5' end.
ACCESSION   M33293
KEYWORDS    cytR gene; priA gene; primosomal protein.
SOURCE      E.coli (strain W3110) DNA, clone pEL042.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2658)
  AUTHORS   Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A.
  TITLE     The priA gene encoding the primosomal replicative n' protein of
            Escherichia coli
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4620-4624 (1990)
  STANDARD  full automatic
COMMENT     Authorin sequence for [1] kindly submitted by G.C.Allen Jr.,
            26-MAR-1990.
FEATURES       from  to/span     description
    pept         64     2262     primosomal protein n'
    pept       2418  >  2658     cytR gene product
    signal       27       32     -10 region
    signal        5       10     -35 region
    binding      53       59     ribosome binding site
BASE COUNT      578 a    738 c    756 g    586 t
ORIGIN      
        1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat
       61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg
      121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag
      181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag
      241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga
      301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc
      361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt
      421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa
      481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa
      541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa
      601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg
      661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc
      721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa
      781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg
      841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc
      901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt
      961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt
     1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac
     1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc
     1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt
     1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg
     1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac
     1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt
     1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac
     1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc
     1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg
     1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg
     1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca
     1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg
     1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc
     1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac
     1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa
     1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg
     1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg
     2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg
     2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg
     2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt
     2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg
     2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg
     2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg
     2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg
     2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata
     2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac
     2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc
     2641 cggatatctg cgatcccc
//
LOCUS       EWCTELRNA     657 bp ds-DNA             INV       26-JUL-1990
DEFINITION  E.crassus telomerase RNA component gene, complete cds.
ACCESSION   M33461
KEYWORDS    telomerase RNA.
SOURCE      E.crassus DNA.
  ORGANISM  Euplotes crassus
            Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; 
            Spirotricha; Heterotrichida; Clevelandellina.
REFERENCE   1  (bases 1 to 657)
  AUTHORS   Shippen-Lentz,D. and Blackburn,E.H.
  TITLE     Functional evidence for an RNA template in telomerase
  JOURNAL   Science 247, 546-552 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    RNA         152      342     telomerase RNA component
    site        186      197     functional telomeric template
BASE COUNT      202 a    122 c    111 g    222 t
ORIGIN      
        1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga
       61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt
      121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg
      181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg
      241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt
      301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg
      361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt
      421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat
      481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga
      541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac
      601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt
//
LOCUS       HS4DWXJ       160 bp ds-DNA             VRL       26-JUL-1990
DEFINITION  Epstein-Barr virus defective WZhet junction.
ACCESSION   M33474
KEYWORDS    .
SOURCE      Epstein-Barr virus (strain HR-1, clinical sample 9) DNA.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 160)
  AUTHORS   Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
  TITLE     Defective viral DNA in Epstein-Barr virus-associated oral hairy
            leukoplakia
  JOURNAL   J. Virol. 64, 397-400 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb       87       90     WZhet junction
BASE COUNT       41 a     45 c     41 g     33 t
ORIGIN      
        1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc
       61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact
      121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac
//
LOCUS       HS4WXJ        181 bp ds-DNA             VRL       26-JUL-1990
DEFINITION  Epstein-Barr virus WZhet junction, HR-1 clone 5.
ACCESSION   M33473
KEYWORDS    .
SOURCE      Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 181)
  AUTHORS   Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W.
  TITLE     Defective viral DNA in Epstein-Barr virus-associated oral hairy
            leukoplakia
  JOURNAL   J. Virol. 64, 397-400 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      108      111     WZhet junction
BASE COUNT       50 a     42 c     46 g     43 t
ORIGIN      
        1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt
       61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca
      121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga
      181 c
//
LOCUS       HUMREGA      4251 bp ds-DNA             PRI       26-JUL-1990
DEFINITION  Human regenerating protein (reg) gene, complete cds.
ACCESSION   J05412
KEYWORDS    pancreatic stone protein; pancreatic thread protein;
            regenerating protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4251)
  AUTHORS   Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
  TITLE     Complete nucleotide sequence of the human reg gene and its
            expression in normal and tumoral tissues: The reg protein,
            pancreatic stone protein, and pancreatic thread protein are one and
            the same product of the gene
  JOURNAL   J. Biol. Chem. 265, 7432-7439 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            H.Okamoto, 23-FEB-1990.
FEATURES       from  to/span     description
    pept       1571     1634     regenerating protein (reg), exon 2 (first
                                 expressed exon)
               2270     2388     regenerating protein, exon 3
               2696     2833     regenerating protein, exon 4
               3549     3660     regenerating protein, exon 5
               3856     3923     regenerating protein, exon 6
    pre-msg    1196     4116     reg mRNA and introns
    IVS        1224     1524     reg intron A
    IVS        1635     2269     reg intron B
    IVS        2389     2695     reg intron C
    IVS        2834     3548     reg intron D
    IVS        3661     3855     reg intron E
    signal     1169     1174     TATA box
BASE COUNT     1161 a    927 c    869 g   1294 t
ORIGIN      
        1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc
       61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat
      121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt
      181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc
      241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt
      301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc
      361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc
      421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat
      481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac
      541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg
      601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg
      661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata
      721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag
      781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc
      841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga
      901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt
      961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag
     1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc
     1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa
     1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa
     1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat
     1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta
     1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc
     1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag
     1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac
     1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt
     1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct
     1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact
     1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt
     1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat
     1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg
     1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt
     1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa
     1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac
     2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga
     2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat
     2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa
     2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc
     2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc
     2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag
     2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca
     2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc
     2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac
     2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca
     2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta
     2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt
     2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca
     2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta
     2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag
     2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata
     3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt
     3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca
     3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct
     3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa
     3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga
     3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt
     3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa
     3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc
     3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc
     3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg
     3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag
     3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg
     3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt
     3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt
     3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct
     3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc
     3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca
     4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt
     4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc
     4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct
     4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c
//
LOCUS       HUMREGRELA   1524 bp ds-DNA             PRI       26-JUL-1990
DEFINITION  Human reg-related sequence, complete cds.
ACCESSION   J05413
KEYWORDS    pancreatic stone protein; pancreatic thread protein;
            regenerating protein.
SOURCE      Human esophageal mucosa DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H.
  TITLE     Complete nucleotide sequence of the human reg gene and its
            expression in normal and tumoral tissues: The reg protein,
            pancreatic stone protein, and pancreatic thread protein are one and
            the same product of the gene
  JOURNAL   J. Biol. Chem. 265, 7432-7439 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            H.Okamoto, 23-FEB-1990.
BASE COUNT      382 a    368 c    345 g    429 t
ORIGIN      
        1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt
       61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt
      121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg
      181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat
      241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg
      301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga
      361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata
      421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat
      481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag
      541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa
      601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc
      661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac
      721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta
      781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca
      841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt
      901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat
      961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct
     1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc
     1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga
     1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc
     1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct
     1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct
     1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc
     1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg
     1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc
     1501 tataaatcat ataatgtaaa ggat
//
LOCUS       MDPCGA       4801 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Aleutian mink disease parvovirus complete genome.
ACCESSION   M20036
KEYWORDS    complete genome.
SOURCE      Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4.
  ORGANISM  Aleutian mink disease parvovirus
            Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE   1  (bases 1 to 4592)
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  TITLE     Nucleotide sequence and genomic organization of Aleutian mink
            disease parvovirus (ADV): Sequence comparisons between a
            nonpathogenic and pathogenic strain of ADV
  JOURNAL   J. Virol. 62, 2903-2915 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 4593 to 4801)
  AUTHORS   Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W.,
            Perryman,S. and Wolfinbarger,J.B.
  TITLE     Nucleotide sequence of the 5'-terminal palindrome of Aleutian mink
            disease parvovirus (ADV) and construction of an infectious
            molecular clone
  JOURNAL   J. Virol. 64, 3551-3556 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by M.E.Bloom, 18-MAR-1990.
            
            Sequence reported below is (+) strand.
FEATURES       from  to/span     description
    ORF         116     1978     Left ORF
    ORF        1983     2207     Middle ORF2
    ORF        1993     2211     Middle ORF1
    ORF        2241     4399     Right ORF
BASE COUNT     1740 a    912 c    943 g   1206 t
ORIGIN      
        1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt
       61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt
      121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag
      181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact
      241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt
      301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact
      361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc
      421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac
      481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt
      541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta
      601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag
      661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa
      721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc
      781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga
      841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta
      901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt
      961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga
     1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg
     1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa
     1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt
     1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga
     1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt
     1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga
     1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa
     1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg
     1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta
     1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat
     1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac
     1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg
     1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa
     1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa
     1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca
     1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt
     1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca
     2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa
     2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg
     2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata
     2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta
     2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc
     2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag
     2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg
     2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg
     2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt
     2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact
     2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag
     2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata
     2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta
     2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa
     2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt
     2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg
     3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc
     3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct
     3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta
     3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt
     3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg
     3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa
     3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg
     3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt
     3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac
     3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga
     3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat
     3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac
     3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt
     3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc
     3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg
     3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg
     3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac
     4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg
     4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa
     4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag
     4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag
     4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg
     4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact
     4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc
     4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta
     4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac
     4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt
     4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt
     4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct
     4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg
     4801 a
//
LOCUS       MDPUPS       3454 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial
            sequence.
ACCESSION   M32981
KEYWORDS    .
SOURCE      Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA.
  ORGANISM  Aleutian mink disease parvovirus
            Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus.
REFERENCE   1  (sites)
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  TITLE     Nucleotide sequence and genomic organization of Aleutian mink
            disease parvovirus (ADV): Sequence comparisons between a
            nonpathogenic and pathogenic strain of ADV
  JOURNAL   J. Virol. 62, 2903-2915 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3454, for [1])
  AUTHORS   Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and
            Wolfinbarger,J.B.
  JOURNAL   Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by M.E.Bloom 18-MAR-1990.
            
            Sequence reported below is (+) strand.
BASE COUNT     1268 a    673 c    698 g    815 t
ORIGIN      
        1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc
       61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac
      121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac
      181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta
      241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa
      301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga
      361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat
      421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt
      481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa
      541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta
      601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat
      661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag
      721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga
      781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat
      841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc
      901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact
      961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt
     1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac
     1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac
     1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa
     1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta
     1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag
     1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac
     1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg
     1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa
     1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac
     1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca
     1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga
     1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc
     1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg
     1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta
     1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt
     1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga
     1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag
     2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag
     2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac
     2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt
     2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga
     2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc
     2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg
     2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat
     2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta
     2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg
     2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa
     2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg
     2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta
     2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc
     2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa
     2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt
     2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca
     3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg
     3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct
     3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga
     3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt
     3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc
     3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt
     3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa
     3421 aacctatggt tactttactt ggagaggcaa gctt
//
LOCUS       MUSAA2DEL     300 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion
            fragment.
ACCESSION   M33468
KEYWORDS    deletion mutant.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
  TITLE     Molecular analysis of two mouse dilute locus deletion mutations:
            Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
            lethal Aa2 alleles
  JOURNAL   Mol. Cell. Biol. 10, 501-509 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      130      133     deletion breakpoint
BASE COUNT       82 a     45 c     76 g     97 t
ORIGIN      
        1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact
       61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt
      121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt
      181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata
      241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa
//
LOCUS       MUSSL20JA     300 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion
            fragment.
ACCESSION   M33467
KEYWORDS    deletion mutant.
SOURCE      Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A.
  TITLE     Molecular analysis of two mouse dilute locus deletion mutations:
            Spontaneous dilute lethal-20J and radiation-induced dilute prenatal
            lethal Aa2 alleles
  JOURNAL   Mol. Cell. Biol. 10, 501-509 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept.ps /    42      122     dilute gene, exon 3
                179  >   300     dilute gene, exon 5
    recomb      129      130     deletion breakpoint
BASE COUNT       86 a     51 c     61 g    102 t
ORIGIN      
        1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac
       61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac
      121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta
      181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta
      241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc
//
LOCUS       PRVVP4       2359 bp ds-RNA             VRL       26-JUL-1990
DEFINITION  Porcine rotavirus capsid protein VP4 gene, complete cds.
ACCESSION   M33516
KEYWORDS    capsid protein VP4.
SOURCE      Porcine rotavirus (Gottfried strain; serotype 4) DNA.
  ORGANISM  Porcine rotavirus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 2359)
  AUTHORS   Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K.
  TITLE     Similarity of the outer capsid protein VP4 of the Gottfried strain
            of porcine rotavirus to that of asymptomatic human rotavirus
            strains
  JOURNAL   J. Virol. 64, 414-418 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         10     2337     capsid protein VP4
BASE COUNT      812 a    406 c    442 g    699 t
ORIGIN      
        1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa
       61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg
      121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat
      181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca
      241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac
      301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga
      361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg
      421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta
      481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat
      541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa
      601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag
      661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta
      721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag
      781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt
      841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca
      901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt
      961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc
     1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac
     1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg
     1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt
     1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc
     1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca
     1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga
     1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac
     1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg
     1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata
     1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt
     1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct
     1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc
     1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata
     1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga
     1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct
     1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat
     1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat
     2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc
     2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata
     2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc
     2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa
     2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag
     2341 ctctatagag gatgtgacc
//
LOCUS       RATSTAA      1000 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds.
ACCESSION   M33329
KEYWORDS    hydroxysteroid sulfotransferase a.
SOURCE      Rat (strain Sprague-Dawley) female liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1000)
  AUTHORS   Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K.,
            Yamazoe,Y. and Kato,R.
  TITLE     cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a
            strong homology in amino acid sequence with the senescence marker
            protein SMP-2 in rat livers
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         38      892     hydroxysteroid sulfotransferase a (STa)
    mRNA    <     1     1000     STa mRNA
    signal      977      982     poly-A signal
BASE COUNT      299 a    187 c    211 g    303 t
ORIGIN      
        1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga
       61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa
      121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg
      181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt
      241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa
      301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct
      361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc
      421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta
      481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc
      541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa
      601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga
      661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc
      721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg
      781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt
      841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa
      901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa
      961 gtgaatgtgg tcattgaata aattctattc tggcattgtg
//
LOCUS       SMFPOLENV    3534 bp ss-RNA             VRL       26-JUL-1990
DEFINITION  Simian foamy virus type 1 polymerase (pol) gene, 3' end; and
            envelope (env) gene, complete cds.
ACCESSION   M33561
KEYWORDS    envelope protein; polymerase.
SOURCE      Simian foamy virus type 1, cDNA to viral RNA.
  ORGANISM  Simian foamy virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Spumavirinae.
REFERENCE   1  (bases 1 to 3534)
  AUTHORS   Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A.
  TITLE     Relationship of the env genes and the endonuclease domain of the
            pol genes of simian foamy virus type 1 and human foamy virus
  JOURNAL   J. Virol. 64, 406-410 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      582     polymerase (AA at 1)
    pept        524     3481     envelope protein
BASE COUNT     1164 a    648 c    674 g   1048 t
ORIGIN      
        1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac
       61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta
      121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat
      181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta
      241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc
      301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg
      361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc
      421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt
      481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg
      541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac
      601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag
      661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg
      721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat
      781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt
      841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt
      901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt
      961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt
     1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt
     1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt
     1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca
     1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac
     1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca
     1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa
     1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa
     1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga
     1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg
     1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc
     1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt
     1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca
     1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa
     1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg
     1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt
     1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa
     1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc
     2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac
     2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa
     2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag
     2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg
     2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt
     2341 atatttactc cgggatcatg tggtaaccct gatggaagct gcccttcatg atgtttcgat
     2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat
     2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt
     2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta
     2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga
     2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt
     2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa
     2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt
     2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg
     2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg
     2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt
     3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc
     3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt
     3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa
     3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg
     3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc
     3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg
     3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact
     3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg
     3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat
//
LOCUS       MUSC5DPROA   5401 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse complement component C5D (pro-C5D) mRNA, complete cds.
ACCESSION   M35526 J05234
KEYWORDS    complement component C5D.
SOURCE      Mouse (strain B10.D2/oSnJ) liver, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5401)
  AUTHORS   Wetsel,R.A., Fleischer,D.T. and Haviland,D.L.
  TITLE     Deficiency of the murine fifth complement component (C5): A 2- base
            pair gene deletion in a 5'-exon
  JOURNAL   J. Biol. Chem. 265, 2435-2440 (1990)
  STANDARD  full staff_review
COMMENT     This coding sequence is translated into a truncated protein of the
            fifth complement component C5, for the C5S sequence see accession #
            M35525.
FEATURES       from  to/span     description
    pept         14      664     complement component C5D
BASE COUNT     1636 a   1212 c   1221 g   1332 t
ORIGIN      Chromosome 2.
        1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac
       61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc
      121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct
      181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga
      241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga
      301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa
      361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac
      421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa
      481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga
      541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta
      601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcact
      661 ttgaaattaa agaatatgtc ttgccacgat tctctgtttc aatagaacta gaaagaacct
      721 tcattggcta taaaaacttt aagaactttg aaatcactgt gaaagcaaga tatttttata
      781 ataaagtggt acctgatgct gaagtgtatg ccttttttgg attgagagag gacataaaag
      841 atgaggagaa gcagatgatg cacaaagcca cacaagccgc aaagttggtt gacggagttg
      901 ctcagatctc ttttgattct gaaacagcag ttaaagagct gtcctacaac agtctagaag
      961 acttaaacaa caagtacctt tatattgcag taacagtcac agaatcttca ggtggatttt
     1021 cagaagaggc agaaatccct ggagtcaaat atgtcctctc tccctacaca ctgaatttgg
     1081 tcgctactcc tcttttcgtg aagcccggga ttccattttc catcaaggca caggttaaag
     1141 attcactcga gcaggcggta ggaggggtcc cagtaactct gatggcacaa acagtcgatg
     1201 tgaatcaaga gacatctgac ttggaaacaa agaggagcat cactcacgac actgatggag
     1261 tagctgtgtt tgtgctgaac ctcccatcaa acgtgacggt gctaaagttt gagatcagaa
     1321 ctgatgaccc agaacttccc gaagaaaatc aagccagcaa agagtacgaa gcagttgcgt
     1381 actcgtctct cagccaaagt tacatttaca tcgcttggac tgaaaactac aagcccatgc
     1441 ttgtgggaga atacctgaat attatggtta cccccaagag cccatatatc gacaaaataa
     1501 ctcactataa ttacttgatt ttatccaaag gcaaaattgt acagtacggc acaagagaga
     1561 aacttttctc ctcaacttat caaaatataa atattccagt gacacagaac atggttcctt
     1621 cagcacgact cctggtctat tacatagtca caggggagca aacagcagaa ttagtggctg
     1681 acgcagtctg gataaatatt gaggagaagt gtggcaacca gctccaggtc catctgtctc
     1741 cagatgaata tgtgtattct ccaggccaaa ctgtgtccct tgacatggtg actgaagcag
     1801 actcatgggt agcactatca gcagtggaca gagctgtgta taaagtccag ggaaacgcca
     1861 aaagggccat gcaaagagtc tttcgagctt tggatgaaaa gagtgacctg ggctgtgggg
     1921 caggtggtgg ccatgacaat gcagatgtat tccatctagc tgggctcacc ttcctcacca
     1981 acgcaaacgc agatgactcc cattatcgtg atgactcttg taaagaaatt ctcaggtcaa
     2041 agagaaatct gcatctccta aggcagaaaa tagaagaaca agctgctaag tacaaacata
     2101 gtgtgctaaa gaaatgctgc tatgacggag cccgagtgaa cttctatgaa acctgtgagg
     2161 agcgagtggc ccgggttacc ataggccctc tctgcatcag ggccttcaac gagtgctgta
     2221 ctattgcgaa caagatccga aaagaaagcc cccataaacc tgtccaactg ggaaggatcc
     2281 acattaagac cctgttacca gtgatgaagg cagatatccg aagctacttt ccagagagct
     2341 ggctatggga aattcatcgc gttcccaaaa gaaaacagct gcaggtcacg ctgcctgact
     2401 cactaacgac ttgggaaatt caaggcattg gcatttcaga caatggtata tgtgttgctg
     2461 atacactcaa ggcaaaggtg ttcaaagaag tcttcctgga gatgaacata ccatattctg
     2521 ttgtgcgagg agaacagatc caattgaaag gaactgttta caactatatg acctcaggga
     2581 caaagttctg tgttaaaatg tctgctgtgg agggaatctg cacttcggga agctcagctg
     2641 ctagccttca cacctccagg ccctccagat gtgtgttcca gaggatagag ggctcgtcca
     2701 gtcacttggt gaccttcacc ctgcttcctc tggaaattgg ccttcactcc ataaacttct
     2761 cactagagac ctcatttggg aaagacatct tagtaaagac attacgggta gtgccagaag
     2821 gagtcaagag ggaaagctat gccggcgtga ttctggaccc taagggaatt cgtggtattg
     2881 ttaacagacg aaaggaattc ccatacagga tcccattaga tttggtcccc aagaccaaag
     2941 ttgaaaggat tttgagtgtc aaaggactgc ttgtagggga gttcttgtcc acggttctga
     3001 gtaaggaagg catcgacatc ctaacccacc tccccaaggg cagtgcagag gcagagctca
     3061 tgagcatagc tccggtgttc tatgttttcc actacctgga agcaggaaac cattggaata
     3121 ttttctatcc tgatacactg agtaaaagac agagcctgga gaaaaaaata aaacaagggg
     3181 tggtgagcgt catgtcctac agaaacgctg actattccta cagcatgtgg aagggggcga
     3241 gcgctagtac ctggctgaca gcttttgctc tgagagtgct tggacaggtg gccaagtatg
     3301 taaaacagga tgaaaactca atttgtaact ctttgctatg gctggttgag aagtgtcagc
     3361 tggaaaacgg ctctttcaag gaaaattccc aatatctacc aataaaatta cagggtactt
     3421 tgcctgctga agcccaagag aaaactttgt atcttacagc cttttctgtg attggaatta
     3481 gaaaggcagt tgacatatgc cccaccatga aaatccacac agcgctagat aaagccgact
     3541 ccttcctgct tgaaaacacc ctgccatcca agagcacctt cacactggcc attgtagcct
     3601 atgctctttc cctaggagac agaacccacc cgaggtttcg tctaattgtg tcggccctga
     3661 ggaaggaagc ttttgttaaa ggtgatccgc ccatttaccg ttactggaga gataccctca
     3721 aacgtccaga cagctctgtg cccagcagcg gcacagcagg tatggttgaa accacagcct
     3781 atgctttgct cgccagcctg aaactgaagg atatgaatta cgccaacccc atcatcaagt
     3841 ggctatctga agagcagagg tatggaggcg gcttttattc cacccaggat acgattaatg
     3901 ccatcgaggg cctgacagaa tattcactcc tgttaaaaca aattcatttg gatatggaca
     3961 tcaatgtcgc ctacaaacac gaaggtgact tccacaagta taaggtgaca gagaagcatt
     4021 tcctggggag gccagtggag gtatctctca atgatgacct tgttgtcagc acaggctaca
     4081 gcagtggctt ggccacagta tatgtaaaaa ctgtggttca caaaattagt gtctctgagg
     4141 aattttgcag cttttacttg aaaattgata cccaagatat tgaagcatcc agccacttca
     4201 ggctcagtga ctctggattc aagcgcataa tagcatgtgc cagctacaag cccagcaagg
     4261 aggagtcaac atccgggtcc tcccatgcag taatggatat atcactgccg actggaatcg
     4321 gagcaaacga ggaagattta cgggctcttg tggaaggagt ggatcaacta ctaactgatt
     4381 accagatcaa agatggccat gtcattctgc aactgaattc gatcccctcc agagatttcc
     4441 tctgtgtccg gttccggata tttgaacttt tccaagttgg gtttctgaat cctgctacct
     4501 tcacggtgta cgagtatcac agaccagata agcagtgcac catgatttat agcatttctg
     4561 acaccaggct tcagaaagtc tgtgaaggag cagcttgcac atgtgtggaa gctgactgtg
     4621 cgcaactgca ggcagaagtg gacctagcca tctctgcaga ctccagaaaa gagaaagcct
     4681 gtaaaccaga gactgcatat gcttataaag tcaggatcac atcagccact gaagaaaatg
     4741 tttttgtcaa gtacactgcg actcttctgg tcacttacaa aacaggggaa gctgctgatg
     4801 agaattcgga ggtcaccttc attaaaaaga tgagctgtac caatgccaac ctggtgaaag
     4861 ggaagcagta tttaatcatg ggcaaagagg ttctgcagat caaacacaat ttcagtttca
     4921 agtatatata ccctctagat tcctccacct ggattgaata ttggcccaca gacacaacgt
     4981 gtccatcctg tcaagcattt gtagagaatt tgaataactt tgctgaagac ctctttttaa
     5041 acagctgtga atgaaaagtt ctgctgcacg aagattcctc ctgcggcggg gggatttctc
     5101 ctcctctggc ttggaaacct agcctagaat cagatacact ttctttagag taaagcacaa
     5161 gctgatgagt tacgactttg tgaaatggat agccttgagg ggaggcgaaa acaggtcccc
     5221 caaggctatc agacgtcagt gccaatagac tgaaacaagt ctgtaaagtt agcagtcagg
     5281 ggtgttggtt ggggccggaa gaagagaccc actgaaactg tagcccctta tcaaaacata
     5341 tccttgcttg aaagaaaaat accaaggaca gaaaatgcca taaaatcttg actttgcact
     5401 c
//
LOCUS       MUSC5PRO     5403 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse complement component C5S (pro-C5) mRNA, complete cds.
ACCESSION   M35525 M15079 J05234
KEYWORDS    clotting factor; complement component C5; complement protein.
SOURCE      Mouse (strain B10.D2/nSnJ) liver, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 141 to 5403)
  AUTHORS   Wetsel,R.A., Ogata,R.T. and Tack,B.F.
  TITLE     Primary structure of the fifth component of murine complement
  JOURNAL   Biochemistry 26, 737-743 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5403)
  AUTHORS   Wetsel,R.A., Fleischer,D.T. and Haviland,D.L.
  TITLE     Deficiency of the murine fifth complement component (C5): A 2- base
            pair gene deletion in a 5'-exon
  JOURNAL   J. Biol. Chem. 265, 2435-2440 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Noack, 02-APR-1987; for [2] by R.A.Wetsel, 01-DEC-1989.
            
            For the C5D sequence see accession # M35526.
FEATURES       from  to/span     description
    pept         14     5056     complement component C5S precursor
    sigp         14       67     complement component C5S signal peptide
    matp         68     2035     complement component pro-C5S beta-chain
    matp       2048     5053     complement component pro-C5S alpha-chain
    mut         659      660     2 bp deletion in C5D
BASE COUNT     1640 a   1212 c   1219 g   1332 t
ORIGIN      Chromosome 2.
        1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac
       61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc
      121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct
      181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga
      241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga
      301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa
      361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac
      421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa
      481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga
      541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta
      601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcata
      661 ctttgaaatt aaagaatatg tcttgccacg attctctgtt tcaatagaac tagaaagaac
      721 cttcattggc tataaaaact ttaagaactt tgaaatcact gtgaaagcaa gatattttta
      781 taataaagtg gtacctgatg ctgaagtgta tgcctttttt ggattgagag aggacataaa
      841 agatgaggag aagcagatga tgcacaaagc cacacaagcc gcaaagttgg ttgacggagt
      901 tgctcagatc tcttttgatt ctgaaacagc agttaaagag ctgtcctaca acagtctaga
      961 agacttaaac aacaagtacc tttatattgc agtaacagtc acagaatctt caggtggatt
     1021 ttcagaagag gcagaaatcc ctggagtcaa atatgtcctc tctccctaca cactgaattt
     1081 ggtcgctact cctcttttcg tgaagcccgg gattccattt tccatcaagg cacaggttaa
     1141 agattcactc gagcaggcgg taggaggggt cccagtaact ctgatggcac aaacagtcga
     1201 tgtgaatcaa gagacatctg acttggaaac aaagaggagc atcactcatg acactgatgg
     1261 agtagctgtg tttgtgctga acctcccatc aaatgtgacg gtgctaaagt ttgagatcag
     1321 aactgatgac ccagaacttc ccgaagaaaa tcaagccagc aaagagtacg aagcagttgc
     1381 gtactcgtct ctcagccaaa gttacattta catcgcttgg actgaaaact acaagcccat
     1441 gcttgtggga gaatacctga atattatggt tacccccaag agcccatata tcgacaaaat
     1501 aactcactat aattacttga ttttatccaa aggcaaaatt gtacagtacg gcacaagaga
     1561 gaaacttttc tcctcaactt atcaaaatat aaatattcca gtgacacaga acatggttcc
     1621 ttcagcacga ctcctggtct attacatagt cacaggggag caaacagcag aattagtggc
     1681 tgacgcagtc tggataaata ttgaggagaa gtgtggcaac cagctccagg tccatctgtc
     1741 tccagatgaa tatgtgtatt ctccaggcca aactgtgtcc cttgacatgg tgactgaagc
     1801 agactcatgg gtagcactat cagcagtgga cagagctgtg tataaagtcc agggaaacgc
     1861 caaaagggcc atgcaaagag tctttcaagc tttggatgaa aagagtgacc tgggctgtgg
     1921 ggcaggtggt ggccatgaca atgcagatgt attccatcta gctgggctca ccttcctcac
     1981 caacgcaaac gcagatgact cccattatcg tgatgactct tgtaaagaaa ttctcaggtc
     2041 aaagagaaac ctgcatctcc taaggcagaa aatagaagaa caagctgcta agtacaaaca
     2101 tagtgtgcca aagaaatgct gctatgacgg agcccgagtg aacttctacg aaacctgtga
     2161 ggagcgagtg gcccgggtta ccataggccc tctctgcatc agggccttca acgagtgctg
     2221 tactattgcg aacaagatcc gaaaagaaag cccccataaa cctgtccaac tgggaaggat
     2281 ccacattaag accctgttac cagtgatgaa ggcagatatc cgaagctact ttccagagag
     2341 ctggctatgg gaaattcatc gcgttcccaa aagaaaacag ctgcaggtca cgctgcctga
     2401 ctcactaacg acttgggaaa ttcaaggcat tggcatttca gacaatggta tatgtgttgc
     2461 tgatacactc aaggcaaagg tgttcaaaga agtcttcctg gagatgaaca taccatattc
     2521 tgttgtgcga ggagaacaga tccaattgaa aggaactgtt tacaactata tgacctcagg
     2581 gacaaagttc tgtgttaaaa tgtctgctgt ggaggggatc tgcacttcag gaagctcagc
     2641 tgctagcctt cacacctcca ggccctccag atgtgtgttc cagaggatag agggctcgtc
     2701 cagtcacttg gtgaccttca ccctgcttcc tctggaaatt ggccttcact ccataaactt
     2761 ctcactagag acctcatttg ggaaagacat cttagtaaag acattacggg tagtgccaga
     2821 aggagtcaag agggaaagct atgccggcgt gattctggac cctaagggaa ttcgtggtat
     2881 tgttaacaga cgaaaggaat tcccatacag gatcccatta gatttggtcc ccaagaccaa
     2941 agttgaaagg attttgagtg tcaaaggact gcttgtaggg gagttcttgt ccacggttct
     3001 gagtaaggaa ggcatcaaca tcctaaccca cctccccaag ggcagtgcag aggcagagct
     3061 catgagcata gctccggtgt tctatgtttt ccactacctg gaagcaggaa accattggaa
     3121 tattttctat cctgatacac tgagtaaaag acagagcctg gagaaaaaaa taaaacaagg
     3181 ggtggtgagc gtcatgtcct acagaaacgc tgactattcc tacagcatgt ggaagggggc
     3241 gagcgctagt acctggctga cagcttttgc tctgagagtg cttggacagg tggccaagta
     3301 tgtaaaacag gatgaaaact caatttgtaa ctctttgcta tggctggttg agaagtgtca
     3361 gctggaaaac ggctctttca aggaaaattc ccaatatcta ccaataaaat tacagggtac
     3421 tttgcctgct gaagcccaag agaaaacttt gtatcttaca gccttttctg tgattggaat
     3481 tagaaaggca gttgacatat gccccaccat gaaaatccac acagcgctag ataaagccga
     3541 ctccttcctg cttgaaaaca ccctgccatc caagagcacc ttcacactgg ccattgtagc
     3601 ctatgctctt tccctaggag acagaaccca cccgaggttt cgtctaattg tgtcggccct
     3661 gaggaaggaa gcttttgtta aaggtgatcc gcccatttac cgttactgga gagataccct
     3721 caaacgtcca gacagctctg tgcccagcag cggcacagca ggtatggttg aaaccacagc
     3781 ctatgctttg ctcgccagcc tgaaactgaa ggatatgaat tacgccaacc ccatcatcaa
     3841 gtggctatct gaagagcaga ggtatggagg cggcttttat tccacccagg atacgattaa
     3901 tgccatcgag ggcctgacag aatattcact cctgttaaaa caaattcatt tggatatgga
     3961 catcaatgtc gcctacaaac acgaaggtga cttccacaag tataaggtga cagagaagca
     4021 tttcctgggg aggccagtgg aggtatctct caatgatgac cttgttgtca gcacaggcta
     4081 cagcagtggc ttggccacag tatatgtaaa aactgtggtt cacaaaatta gtgtctctga
     4141 ggaattttgc agcttttact tgaaaattga tacccaagat attgaagcat ccagccactt
     4201 caggctcagt gactctggat tcaagcgcat aatagcatgt gccagctaca agcccagcaa
     4261 ggaggagtca acatccgggt cctcccatgc agtaatggat atatcactgc cgactggaat
     4321 cggagcaaac gaggaagatt tacgggctct tgtggaagga gtggatcaac tactaactga
     4381 ttaccagatc aaagatggcc atgtcattct gcaactgaat tcgatcccct ccagagattt
     4441 cctctgtgtc cggttccgga tatttgaact tttccaagtt gggtttctga atcctgctac
     4501 cttcacggtg tacgagtatc acagaccaga taagcagtgc accatgattt atagcatttc
     4561 tgacaccagg cttcagaaag tctgtgaagg agcagcttgc acatgtgtgg aagctgactg
     4621 tgcgcaactg caggcagaag tagacctagc catctctgca gactccagaa aagagaaagc
     4681 ctgtaaacca gagactgcat atgcttataa agtcaggatc acatcagcca ctgaagaaaa
     4741 tgtttttgtc aagtacactg cgactcttct ggtcacttac aaaacagggg aagctgctga
     4801 tgagaattcg gaggtcacct tcattaaaaa gatgagctgt accaatgcca acctggtgaa
     4861 agggaagcag tatttaatca tgggcaaaga ggttctgcag atcaaacaca atttcagttt
     4921 caagtatata taccctctag attcctccac ctggattgaa tattggccca cagacacaac
     4981 gtgtccatcc tgtcaagcat ttgtagagaa tttgaataac tttgctgaag acctcttttt
     5041 aaacagctgt gaatgaaaag ttctgctgca cgaagattcc tcctgcggcg gggggattgc
     5101 tcctcctctg gcttggaaac ctagcctaga atcagataca ctttctttag agtaaagcac
     5161 aagctgatga gttacgactt tgtgaaatgg atagccttga ggggaggcga aaacaggtcc
     5221 cccaaggcta tcagatgtca gtgccaatag actgaaacaa gtctgtaaag ttagcagtca
     5281 ggggtgttgg ttggggccgg aagaagagac ccactgaaac tgtagcccct tatcaaaaca
     5341 tatccttgct tgaaagaaaa ataccaagga cagaaaatgc cataaaatct tgactttgca
     5401 ctc
//
LOCUS       HUMENN       1592 bp ss-mRNA            PRI       26-JUL-1990
DEFINITION  Human endonexin II mRNA, complete cds.
ACCESSION   J03745
KEYWORDS    Ca2+ -dependent phospholipid binding protein; endonexin.
SOURCE      Human placenta, cDNA to mRNA, (library of Clonetech Laboratories
            Inc.).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1592)
  AUTHORS   Kaplan,R., Jaye,M., Burgess,W.H., Schlaepfer,D.D. and Haigler,H.T.
  TITLE     Cloning and expression of cDNA for human endonexin II, a Ca2+ and
            phospholipid binding protein
  JOURNAL   J. Biol. Chem. 263, 8037-8043 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by H.T.Haigler, 06-APR-1988
FEATURES       from  to/span     description
    pept        160     1122     endonexin II /hgml_locus_uid="LS0217S"
                                 /nomgen="ENX2" /map="4q28-q32"
    mRNA    <     1     1592     endonexin II mRNA
BASE COUNT      434 a    337 c    366 g    455 t
ORIGIN      284 bp upstream of HincII site.
        1 ttggatcagt ctaggtgcag ctgccggatc cttcagcgtc tgcatctcgg cgtcgcccgc
       61 gtaccgtcgc ccggctctcc gccgctctcc cggggtttcg gggcacttgg gtcccacagt
      121 ctggtcctgc ttcaccttcc cctgacctga gtagtcgcca tggcacaggt tctcagaggc
      181 actgtgactg acttccctgg atttgatgag cgggctgatg cagaaactct tcggaaggct
      241 atgaaaggct tgggcacaga tgaggagagc atcctgactc tgttgacatc ccgaagtaat
      301 gctcagcgcc aggaaatctc tgcagctttt aagactctgt ttggcaggga tcttctggat
      361 gacctgaaat cagaactaac tggaaaattt gaaaaattaa ttgtggctct gatgaaaccc
      421 tctcggcttt atgatgctta tgaactgaaa catgccttga agggagctgg aacaaatgaa
      481 aaagtactga cagaaattat tgcttcaagg acacctgaag aactgagagc catcaaacaa
      541 gtttatgaag aagaatatgg ctcaagcctg gaagatgacg tggtggggga cacttcaggg
      601 tactaccagc ggatgttggt ggttctcctt caggctaaca gagaccctga tgctggaatt
      661 gatgaagctc aagttgaaca agatgctcag gctttatttc aggctggaga acttaaatgg
      721 gggacagatg aagaaaagtt tatcaccatc tttggaacac gaagtgtgtc tcatttgaga
      781 aaggtgtttg acaagtacat gactatatca ggatttcaaa ttgaggaaac cattgaccgc
      841 gagacttctg gcaatttaga gcaactactc cttgctgttg tgaaatctat tcgaagtata
      901 cctgcctacc ttgcagagac cctctattat gctatgaagg gagctgggac agatgatcat
      961 accctcatca gagtcatggt ttccaggagt gagattgatc tgtttaacat caggaaggag
     1021 tttaggaaga attttgccac ctctctttat tccatgatta agggagatac atctggggac
     1081 tataagaaag ctcttctgct gctctgtgga gaagatgact aacgtgtcac ggggaagagc
     1141 tccctgctgt gtgcctgcac caccccactg ccttccttca gcacctttag ctgcatttgt
     1201 atgccagtgc ttaacacatt gccttattca tactagcatg ctcatgacca acacatacac
     1261 gtcatagaat gaaaatagtg gtgcttcttt ctgatctcta gtggagatct ctttgactgc
     1321 tgtagtacta aagtgtactt aatgttacta agtttaatgc ctggccattt tccatttata
     1381 tatatttttt aagaggctag agtgctttta gcctttttta aaaactccat ttatattaca
     1441 tttgtaacca tgatacttta atcagaagct tagccttgaa attgtgaact cttggaaatg
     1501 ttattagtga agttcgcaac taaactaaac ctgtaaaatt atgatgattg tattcaaaag
     1561 attaatgaaa aataaacatt tctgtccccc tg
//
LOCUS       CPAFPRFA     1277 bp ds-DNA             ORG       26-JUL-1990
DEFINITION  C.paradoxa cyanelle ferredoxin (petF) and ribosomal protein S10
            (rps10; rpsJ) genes, complete cds, and elongation factor Tu (tufA)
            gene, 5' end.
ACCESSION   M35206
KEYWORDS    elongation factor Tu; ferredoxin; ribosomal protein S10.
SOURCE      C.paradoxa (isolate UTEX LB555) cyanelle DNA, clone pCpcGP1.3.
  ORGANISM  Cyanelle Cyanophora paradoxa
            Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; 
            Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa.
REFERENCE   1  (bases 1 to 1277)
  AUTHORS   Bryant,D.A., Schluchter,W.M. and Stirewalt,V.L.
  TITLE     Ferredoxin and ribosomal protein S10 are encoded on the cyanelle
            genome of Cyanophora paradoxa
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.A.Bryant, 14-JUN-1990.
            
              Author address: D.A.Bryant
              s-101 Frear Bldg.
              Dept. of Mol. and Cell Biol.
              Pennsylvania State University
              University Park, PA 16802
              email: DAB14@PSUVM
            
FEATURES       from  to/span     description
    pept        338      637     ferredoxin (petF)
    pept       1027      710 (c) ribosomal protein S10 (rps10; rpsJ)
    pept    >  1277     1114 (c) elongation factor Tu (tufA; AA at 1275)
    binding     313      318     ribosome binding site
    rpt         643      694     inverted repeat
    rpt        1065     1097     inverted repeat
BASE COUNT      477 a    170 c    163 g    467 t
ORIGIN      
        1 agatcttatc taagatatgt aaataaataa aaatatatat ctatatttat agtatatatt
       61 aatttttttt aaaaatcgat actaaattta aattttcctt ttttttcttt ataaaaattt
      121 aattttaaat agaaaaaatt aagtttttcg aaaaaagcaa ttaaaacata ttaaaaaaaa
      181 attaataaac atggtaaact ttaaatataa atttataatt aactgaaaaa ataataaaaa
      241 taaatttata tatatatata ttttagatta aaataattta aattaaatta ttaaaagttc
      301 taccttgtaa ctataattat ttaggagata gtattttatg gcagtatata aagttcgtct
      361 tatttgtgaa gaacaaggtt tagataccac tattgaatgt ccagatgatg agtacattct
      421 tgatgcagca gaagaacaag gtattgattt accatactcc tgtcgtgcag gtgcatgttc
      481 tacttgtgca ggtaaagtgg tagaaggaac tgtagatcaa tctgatcaat ctttcttaga
      541 tgacgctcaa ttagcagctg gttatgtatt aacttgtgta gcatacccat cttctgactg
      601 tacagttaaa actcaccaag aagaatctct ttactaaaaa ataaaaaatc taaataataa
      661 aatagaaatc tctattttat tatttagatt ttcttaattc aaaaaaaaac taaagtttaa
      721 cttccacatc aacacctgct ggtaaatcta aacgagttaa agtatcaatt gttttggaag
      781 atggtaaata taaatcaatt attctgcgat gaactctaat ttcgaaatgt tctcgtgaat
      841 ctttatctac atgtggggaa cgtaaaacgc aataaatttt cttttttgtt ggtaaaggaa
      901 taggtcctac tgcggtagca tcagttcgtt ttgcagcttc aataatttgt tcacatgagt
      961 tttctaataa tgaagagtca taagaacgta gttgaatacg aatttttaat tgttgattac
     1021 tggccataat ttttaatttt taatttttat tttttaaatt aaaaagagag aaataaatac
     1081 attttctatt tctctctaaa atttagattt taattatttt aaaatcttag aaactacacc
     1141 tgcaccaatt gtacgaccac cttcacgaat cgcgaaacgc ataccttgtt caatcgcaat
     1201 tggatgtact aaacttactg tcattttaat acgatctcct ggcataacca tttctgcatt
     1261 actaccatca tctgcag
//
LOCUS       MUSIGHZSA     333 bp ds-DNA             ROD       26-JUL-1990
DEFINITION  Mouse Ig germline H-chain gene, D region.
ACCESSION   M35332
KEYWORDS    diversity exon; germline; immunoglobulin heavy chain.
SOURCE      Mouse liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 333)
  AUTHORS   Landolfi,N.F., Capra,J.D. and Tucker,P.W.
  TITLE     Germ-line sequence of the D-H segment employed in Ars-A antibodies:
            Implications for the generation of junctional diversity
  JOURNAL   J. Immunol. 137, 362-365 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /   107  /   129     Ig heavy chain D region (AA at 107; 107 could
                                 be 109)
    iDNA    <     1      106     V-D intervening DNA
    iDNA        130  >   333     D-J intervening DNA
BASE COUNT       95 a     98 c     62 g     78 t
ORIGIN      
        1 tgacaactga aactcaaccg tgctgcctgg cccccaatgc tctctacacc tgcaaaacca
       61 gagaccatac tggccagtgc tttttgtgaa gggatctact actgtgttta ttactatggt
      121 ggtagctacc acagtgctat atccatcagc aaaaacccat tgtgcccagc agactcttga
      181 gctcgaaaaa ctgagtctag aaaagctggc atcacggggt ttatatcccg agtcttgacc
      241 actgacccat taatactatc caacacagag ctctccgtct gcccacaaag aaatccaacc
      301 accctaaagt cagatcctct agagtcgacc tgc
//
LOCUS       WHTREPTA      295 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD1.
ACCESSION   M35329
KEYWORDS    repetitive DNA.
SOURCE      T.monococcum aegilopoides leaf DNA, clone pTbUCD1.
  ORGANISM  Triticum monococcum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 295)
  AUTHORS   Dvorak,J., McGuire,P.E. and Cassidy,B.
  TITLE     Apparent sources of the A genomes of wheats inferred from
            polymorphism in abundance and restriction fragment length of
            repeated nucleotide sequences
  JOURNAL   Genome 30, 680-689 (1988)
  STANDARD  simple staff_review
BASE COUNT       72 a     58 c     66 g     99 t
ORIGIN      
        1 tccagacttg ggtaacaggg tgtgccttag aatcccagtt gatagtgggc agtcctgaca
       61 gaagatagtg cactgagcca aacttgaatg tgtcaagtgc ttcattcgga atctccttgt
      121 acatgttgaa catagagttg tggtccatct ttttcttggc ataaatgtcc aagtcatctg
      181 cttgctcctc tggggcattg atcattataa gtaatagtct tttcttcaac tttaataggt
      241 gcagctactt ttacttctat gggaggatga tatttaaacc acttctcctt gggga
//
LOCUS       WHTREPTB      273 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD2.
ACCESSION   M35330
KEYWORDS    repetitive DNA.
SOURCE      T.monococcum aegilopoides leaf DNA, clone pTbUCD2.
  ORGANISM  Triticum monococcum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 273)
  AUTHORS   Dvorak,J., McGuire,P.E. and Cassidy,B.
  TITLE     Apparent sources of the A genomes of wheats inferred from
            polymorphism in abundance and restriction fragment length of
            repeated nucleotide sequences
  JOURNAL   Genome 30, 680-689 (1988)
  STANDARD  simple staff_review
BASE COUNT       58 a     44 c     74 g     97 t
ORIGIN      
        1 ctggccatgg agggcctatg tagatagaca ggcttcgaga agcttctttc tttctagtgt
       61 ctgtactcag accggttgct tccgcatgtg cttgtatgag tgtatgactt gagtgtcggg
      121 tcatgtgacc cctatctgta tgaacatgtt atgtatggct ctctagagcc tttaaataaa
      181 gtacttgagt tgtagagtat tgttgtgatg ccatgttgta tgtactcata tcgggcatat
      241 tgtgtgtatg attgaaatgc ttggtatgag tgg
//
LOCUS       WHTREPTC      229 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD3.
ACCESSION   M35331
KEYWORDS    repetitive DNA.
SOURCE      T.monococcum aegilopoides leaf DNA, clone pTbUCD3.
  ORGANISM  Triticum monococcum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 229)
  AUTHORS   Dvorak,J., McGuire,P.E. and Cassidy,B.
  TITLE     Apparent sources of the A genomes of wheats inferred from
            polymorphism in abundance and restriction fragment length of
            repeated nucleotide sequences
  JOURNAL   Genome 30, 680-689 (1988)
  STANDARD  simple staff_review
BASE COUNT       85 a     55 c     37 g     52 t
ORIGIN      
        1 caaattagct actccagtat gtaaaaacct gtttgtccaa cacttagcag atttcactct
       61 tgatagatca ctagcaatag ctcccgcaaa atcgcaaaag agttcatgat ctgcccaaaa
      121 caacaactat gcaaaagttg agctcgattg agtcaaccta gggtgctcca acataacaag
      181 taaagacatg gatggattaa gcacaacaag catgacaaac cactcttac
//
LOCUS       RATMTXXX      169 bp ds-DNA             ORG       26-JUL-1990
DEFINITION  Rat mitochondrial HindIII fragment.
ACCESSION   M35251
KEYWORDS    .
SOURCE      Rat mitochondrial DNA.
  ORGANISM  Mitochondrion Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; 
            Rattus norvegicus.
REFERENCE   1  (bases 1 to 169)
  AUTHORS   Brown,G.G., Castora,F.J., Frantz,S.C. and Simpson,M.V.
  TITLE     Mitochondrial DNA polymorphism: Evolutionary studies on the genus
            Rattus
  JOURNAL   Ann. N.Y. Acad. Sci. 361, 135-153 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    ORF     <     1  >   169     ORF
    allele       56       56     a in type A; g in type B
    allele       80       80     a in type A; g in type B
    allele      122      122     t in type A; g in type B
BASE COUNT       44 a     24 c     43 g     58 t
ORIGIN      
        1 agcttgctaa tagtcatcat gttgctatca atggaaagat tatttgtaat cctcgagcta
       61 taattatagt tcggctgtga attcgttcgt agttggtgtt tgctaggcag aataagagtg
      121 atgaggttaa gccgtgggcg attattagta ttgtagctcc catgaagct
//
LOCUS       MUSCRABP      868 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA,
            complete cds.
ACCESSION   M35523
KEYWORDS    cellular retinoic acid-binding protein.
SOURCE      Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 868)
  AUTHORS   Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S.
  TITLE     Molecular cloning of a novel cellular retinoic acid-binding protein
            expressed during mouse embryogenesis and in adult skin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by V.Giguere, 22-JUN-1990.
FEATURES       from  to/span     description
    pept        116      532     cellular retinoic acid-binding protein
BASE COUNT      226 a    218 c    220 g    204 t
ORIGIN      Chromosome 2.
        1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac
       61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc
      121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc
      181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt
      241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac
      301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg
      361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa
      421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct
      481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac
      541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc
      601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc
      661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc
      721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac
      781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt
      841 taattttatt aaaatgtttc cggaattc
//
LOCUS       BTHCRYIA     4320 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  B.thuringiensis delta-endotoxin gene, complete cds.
ACCESSION   M35524
KEYWORDS    delta-endotoxin.
SOURCE      B.thuringiensis kenyae (strain HD588) DNA.
  ORGANISM  Bacillus thuringiensis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 4320)
  AUTHORS   Von Tersch,M.A., Loidl,R.H., Jany,C.S. and Johnson,T.B.
  TITLE     Insecticidal toxin genes from Bacillus thuringiensis variety
            kenyae: Cloning characterization and comparative studies
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.A.Von Tersch, 19-JUN-1990.
            
               Author address: M.A.Von Tersch
               Ecogen Inc.
               2005 Cabot Blvd. West
               Loanghorne PA 19047
FEATURES       from  to/span     description
    pept        239     3772     delta-endotoxin
    binding     228      232     ribosome binding site
BASE COUNT     1392 a    724 c    909 g   1295 t
ORIGIN      
        1 gttaacggaa tacaaaccct taatgcattg gttaaacatt gtaaagtcta aagcatggat
       61 aatgggcgag aagtaagtag attgttaaca ccctgggtca aaaattgata tttagtaaaa
      121 ttagttgcac tttgtgcatt ttttcataag atgagtcata tgttttaaat tgtagtaatg
      181 aaaaacagta ttatatcata atgaattggt atcttaataa aagagatgga ggtaacttat
      241 ggataacaat ccgaacatca atgaatgcat tccttataat tgtttaagta accctgaagt
      301 agaagtatta ggtggagaaa gaatagaaac tggttacacc ccaatcgata tttccttgtc
      361 gctaacgcaa tttcttttga gtgaatttgt tcccggtgct ggatttgtgt taggactagt
      421 tgatataata tggggaattt ttggtccctc tcaatgggac gcatttcttg tacaaattga
      481 acagttaatt aaccaaagaa tagaagaatt cgctaggaac caagccattt ctagattaga
      541 aggactaagc aatctttatc aaatttacgc agaatctttt agagagtggg aagcagatcc
      601 tactaatcca gcattaagag aagagatgcg tattcaattc aatgacatga acagtgccct
      661 tacaaccgct attcctcttt tggcagttca aaattatcaa gttcctcttt tatcagtata
      721 tgttcaagct gcaaatttac atttatcagt tttgagagat gtttcagtgt ttggacaaag
      781 gtggggattt gatgccgcga ctatcaatag tcgttataat gatttaacta ggcttattgg
      841 caactataca gatcatgctg tacgctggta caatacggga ttagaacgtg tatggggacc
      901 ggattctaga gattgggtaa ggtataatca atttagaaga gaattaacac taactgtatt
      961 agatatcgtt gctctgttcc cgaattatga tagtagaaga tatccaattc gaacagtttc
     1021 ccaattaaca agagaaattt atacaaaccc agtattagaa aattttgatg gtagttttcg
     1081 aggctcggct cagggcatag aaagaagtat taggagtcca catttgatgg atatacttaa
     1141 cagtataacc atctatacgg atgctcatag gggttattat tattggtcag ggcatcaaat
     1201 aatggcttct cctgtcggtt tttcggggcc agaattcacg tttccgctat atggaaccat
     1261 gggaaatgca gctccacaac aacgtattgt tgctcaacta ggtcagggcg tgtatagaac
     1321 attatcctct actttttata gaagaccttt taatataggg ataaataatc aacaactatc
     1381 tgttcttgac gggacagaat ttgcttatgg aacctcctca aatttgccat ccgctgtata
     1441 cagaaaaagc ggaacggtag attcgctgga tgaaatacca ccacagaata acaacgtgcc
     1501 acctaggcaa ggatttagtc atcgattaag ccatgtttca atgtttcgtt caggctctag
     1561 tagtagtgta agtataataa gagctcctat gttctcttgg atacatcgta gtgctgaatt
     1621 taataatata attgcatcgg atagtattac tcaaatccct gcagtgaagg gaaactttct
     1681 ttttaatggt tctgtaattt caggaccagg atttactggt ggggacttag ttagattaaa
     1741 tagtagtgga aataacattc agaatagagg gtatattgaa gttccaattc acttcccatc
     1801 gacatctacc agatatcgag ttcgtgtacg gtatgcttct gtaaccccga ttcacctcaa
     1861 cgttaattgg ggtaattcat ccattttttc caatacagta ccagctacag ctacgtcatt
     1921 agataatcta caatcaagtg attttggtta ttttgaaagt gccaatgctt ttacatcttc
     1981 attaggtaat atagtaggtg ttagaaattt tagtgggact gcaggagtga taatagacag
     2041 atttgaattt attccagtta ctgcaacact cgaggctgaa tataatctgg aaagagcgca
     2101 gaaggcggtg aatgcgctgt ttacgtctac aaaccaacta gggctaaaaa caaatgtaac
     2161 ggattatcat attgatcaag tgtccaattt agttacgtgt ttatcggatg aattttgtct
     2221 ggatgaaaag cgagaattgt ccgagaaagt caaacatgcg aagcgactca gtgatgaacg
     2281 caatttactc caagattcaa atttcaaaga cattaatagg caaccagaac gtgggtgggg
     2341 cggaagtaca gggattacca tccaaggagg ggatgacgta tttaaagaaa attacgtcac
     2401 actatcaggt acctttgatg agtgctatcc aacatatttg tatcaaaaaa tcgatgaatc
     2461 aaaattaaaa gcctttaccc gttatcaatt aagagggtat atcgaagata gtcaagactt
     2521 agaaatctat ttaattcgct acaatgcaaa acatgaaaca gtaaatgtgc caggtacggg
     2581 ttccttatgg ccgctttcag cccaaagtcc aatcggaaag tgtggagagc cgaatcgatt
     2641 cgcgccacac cttgaatgga atcctgactt agattgttcg tgtagggatg gagaaaagtg
     2701 tgcccatcat tcgcatcatt tctccttaga cattgatgta ggatgtacag acttaaatga
     2761 ggacctaggt gtatgggtga tctttaagat taagacgcaa gatgggcacg caagactagg
     2821 gaatctagag tttctcgaag agaaaccatt agtaggagaa gcgctagctc gtgtgaaaag
     2881 agcggagaaa aaatggagag acaaacgtga aaaattggaa tgggaaacaa atatcgttta
     2941 taaagaggca aaagaatctg tagatgcttt atttgtaaac tctcaatatg atcaattaca
     3001 agcggatacg aatattgcca tgattcatgc ggcagataaa cgtgttcata gcattcgaga
     3061 agcttatctg cctgagctgt ctgtgattcc gggtgtcaat gcggctattt ttgaagaatt
     3121 agaagggcgt attttcactg cattctccct atatgatgcg agaaatgtca ttaaaaatgg
     3181 tgattttaat aatggcttat cctgctggaa cgtgaaaggg catgtagatg tagaagaaca
     3241 aaacaaccaa cgttcggtcc ttgttgttcc ggaatgggaa gcagaagtgt cacaagaagt
     3301 tcgtgtctgt ccgggtcgtg gctatatcct tcgtgtcaca gcgtacaagg agggatatgg
     3361 agaaggttgc gtaaccattc atgagatcga gaacaataca gacgaactga agtttagcaa
     3421 ctgcgtagaa gaggaaatct atccaaataa cacggtaacg tgtaatgatt atactgtaaa
     3481 tcaagaagaa tacggaggtg cgtacacttc tcgtaatcga ggatataacg aagctccttc
     3541 cgtaccagct gattatgcgt cagtctatga agaaaaatcg tatacagatg gacgaagaga
     3601 gaatccttgt gaatttaaca gagggtatag ggattacacg ccactaccag ttggttatgt
     3661 gacaaaagaa ttagaatact tcccagaaac cgataaggta tggattgaga ttggagaaac
     3721 ggaaggaaca tttatcgtgg acagcgtgga attactcctt atggaggaat agtctcatgc
     3781 aaactcaggt ttaaatatcg ttttcaaatc aattgtccaa gagcagcatt acaaatagat
     3841 aagtaatttg ttgtaatgaa aaacggacat cacctccatt gaaacggagt gatgtccgtt
     3901 ttactatgtt attttctagt aatacatatg tatagagcaa cttaatcaag cagagatatt
     3961 ttcacctatc gatgaaaata tctctgcttt ttcttttttt atttggtata tgctttactt
     4021 gtaatcgaaa ataaagcact aatagggtgt ttttgcccat cccttcggga aatcaagact
     4081 aaaatgaaaa ataaacagaa aatataaggc tcttactttg tggatatgac cacaaagtaa
     4141 gagccttatt tcattaaatt tgttcataca tttttccttg tagtcttttg ttttcatcct
     4201 ttaatcgcct attctcgtac tctacttcct tgattcgatc ccgtaataat tgaatcattg
     4261 catctttatt ttcatcactc attttccgtt tttcgaattt tggagataca gctcgttgct
//
LOCUS       HUMHBLOD     3373 bp ss-mRNA            PRI       26-JUL-1990
DEFINITION  Human GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase
            mRNA, complete cds.
ACCESSION   M35531
KEYWORDS    GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase.
SOURCE      Human epidermal carcinoma cell line A431, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3373)
  AUTHORS   Larsen,R.D., Ernst,L.K., Nair,R.P. and Lowe,J.B.
  TITLE     Molecular cloning, sequence and expression of a human GDP-L-fucose:
            Beta-D-galactoside 2-alpha-l-fucosyltransferase cDNA that can be
            from the H blood group antigen
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  ull staff_review staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by J.B.Lowe, 22-JUN-1990.
FEATURES       from  to/span     description
    pept        104     1201     GDP-L-fucose:beta-D-galactoside
                                 2-alpha-L-fucosyltransferase
    site       1744     2385     Alu sequence homologue
BASE COUNT      687 a    925 c    905 g    856 t
ORIGIN      
        1 gcctggcgtt ccaggggcgg ccggatgtgg cctgcctttg cggagggtgc gctccggcca
       61 cgaaaagcgg actgtggatc tgccacctgc aagcagctcg gccatgtggc tccggagcca
      121 tcgtcagctc tgcctggcct tcctgctagt ctgtgtcctc tctgtaatct tcttcctcca
      181 tatccatcaa gacagctttc cacatggcct aggcctgtcg atcctgtgtc cagaccgccg
      241 cctggtgaca cccccagtgg ccatcttctg cctgccgggt actgcgatgg gccccaacgc
      301 ctcctcttcc tgtccccagc accctgcttc cctctccggc acctggactg tctaccccaa
      361 tggccggttt ggtaatcaga tgggacagta tgccacgctg ctggctctgg cccagctcaa
      421 cggccgccgg gcctttatcc tgcctgccat gcatgccgcc ctggccccgg tattccgcat
      481 caccctgccc gtgctggccc cagaagtgga cagccgcacg ccgtggcggg agctgcagct
      541 tcacgactgg atgtcggagg agtacgcgga cttgagagat cctttcctga agctctctgg
      601 cttcccctgc tcttggactt tcttccacca tctccgggaa cagatccgca gagagttcac
      661 cctgcacgac caccttcggg aagaggcgca gagtgtgctg ggtcagctcc gcctgggccg
      721 cacaggggac cgcccgcgca cctttgtcgg cgtccacgtg cgccgtgggg actatctgca
      781 ggttatgcct cagcgctgga agggtgtggt gggcgacagc gcctacctcc ggcaggccat
      841 ggactggttc cgggcacggc acgaagcccc cgttttcgtg gtcaccagca acggcatgga
      901 gtggtgtaaa gaaaacatcg acacctccca gggcgatgtg acgtttgctg gcgatggaca
      961 ggaggctaca ccgtggaaag actttgccct gctcacacag tgcaaccaca ccattatgac
     1021 cattggcacc ttcggcttct gggctgccta cctggctggc ggagacactg tctacctggc
     1081 caacttcacc ctgccagact ctgagttcct gaagatcttt aagccggagg cggccttcct
     1141 gcccgagtgg gtgggcatta atgcagactt gtctccactc tggacattgg ctaagccttg
     1201 agagccaggg agactttctg aagtagcctg atctttctag agccagcagt acgtggcttc
     1261 agaggcctgg catcttctgg agaagcttgt ggtgttcctg aagcaaatgg gtgcccgtat
     1321 ccagagtgat tctagttggg agagttggag agaaggggga cgtttctgga actgtctgaa
     1381 tattctagaa ctagcaaaac atcttttcct gatggctggc aggcagttct agaagccaca
     1441 gtgcccacct gctcttccca gcccatatct acagtacttc cagatggctg cccccaggaa
     1501 tggggaactc tccctctggt ctactctaga agaggggtta cttctcccct gggtcctcca
     1561 aagactgaag gagcatatga ttgctccaga gcaagcattc accaagtccc cttctgtgtt
     1621 tctggagtga ttctagaggg agacttgttc tagagaggac caggtttgat gcctgtgaag
     1681 aaccctgcag ggcccttatg gacaggatgg ggttctggaa atccagataa ctaaggtgaa
     1741 gaatcttttt agtttttttt tttttttttt ggagacaggg tctcgctctg ttgcccaggc
     1801 tggagtgcag tggcgtgatc ttggctcact gcaacttccg cctcctgtgt tcaagcgatt
     1861 ctcctgtctc agcctcctga gtagatggga ctacaggcac aggccattat gcctggctaa
     1921 tttttgtatt tttagtagag acagggtttc accatgttgg ccgggatggt ctcgatctcc
     1981 tgaccttgtc atccacctgt cttggcctcc caaagtgctg ggattactgg catgagccac
     2041 tgtgcccagc ccggatattt ttttttaatt atttatttat ttatttattt attgagacgg
     2101 agtcttgctc tgtagcccag gccagagtgc agtggcgcga tctcagctca ctgcaagctc
     2161 tgcctcccgg gttcatgcca ttctgcctca gcctcctgag tagctgggac tacaggcgcc
     2221 cgccaccacg cccggctaat tttttttgta tttttagtag agacggggtt tcatcgtgtt
     2281 aaccaggatg gtctcgatct cctgacctcg tgatctgccc acctcggcct cccacagtgc
     2341 tgggattacc ggcgtgagcc accatgcctg gcccggataa ttttttttaa tttttgtaga
     2401 gacgaggtct tgtgatattg cccaggctgt tcttcaactc ctgggctcaa gcagtcctcc
     2461 caccttggcc tcccagaatg ctgggtttat agatgtgagc cagcacaccg ggccaagtga
     2521 agaatctaat gaatgtgcaa cctaattgta gcatctaatg aatgttccac cattgctgga
     2581 aaaattgaga tggaaaacaa accatctcta gttggccagc gtcttgctct gttcacagtc
     2641 tctggaaaag ctggggtagt tggtgagcag agcgggactc tgtccaacaa gccccacagc
     2701 ccctcaaaga cttttttttg tttgttttga gcagacaggc taaaatgtga acgtggggtg
     2761 agggatcact gccaaaatgg tacagcttct ggagcagaac tttccaggga tccagggaca
     2821 ctttttttta aagctcataa actgccaaga gctccatata ttgggtgtga gttcaggttg
     2881 cctctcacaa tgaaggaagt tggtctttgt ctgcaggtgg gctgctgagg gtctgggatc
     2941 tgttttctgg aagtgtgcag gtataaacac accctctgtg cttgtgacaa actggcaggt
     3001 accgtgctca ttgctaacca ctgtctgtcc ctgaactccc agaaccacta catctggctt
     3061 tgggcaggtc tgagataaaa cgatctaaag gtaggcagac cctggaccca gcctcagatc
     3121 caggcaggag cacgaggtct ggccaaggtg gacggggttg tcgagatctc aggagcccct
     3181 tgctgttttt tggagggtga aagaagaaac cttaaacata gtcagctctg atcacatccc
     3241 ctgtctactc atccagaccc catgcctgta ggcttatcag ggagttacag ttacaattgt
     3301 tacagtactg ttcccaactc agctgccacg ggtgagagag caggaggtat gaattaaaag
     3361 tctacagcac taa
//
LOCUS       MUSCRABPA     868 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA,
            complete cds.
ACCESSION   M35523
KEYWORDS    cellular retinoic acid-binding protein.
SOURCE      Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 868)
  AUTHORS   Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S.
  TITLE     Molecular cloning of a novel cellular retinoic acid-binding protein
            expressed during mouse embryogenesis and in adult skin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by V.Giguere, 22-JUN-1990.
FEATURES       from  to/span     description
    pept        116      532     cellular retinoic acid-binding protein
BASE COUNT      226 a    218 c    220 g    204 t
ORIGIN      Chromosome 2.
        1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac
       61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc
      121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc
      181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt
      241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac
      301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg
      361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa
      421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct
      481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac
      541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc
      601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc
      661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc
      721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac
      781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt
      841 taattttatt aaaatgtttc cggaattc
//
LOCUS       YSCGLN3      3021 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  S.cerevisiae nitrogen regulatory protein (GLN3) gene, complete cds.
ACCESSION   M35267
KEYWORDS    nitrogen regulatory protein.
SOURCE      S.cerevisiae (strain S288C) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3021)
  AUTHORS   Minehart,P.L. and Magasanik,B.
  TITLE     Sequence and expression of GLN3, a positive nitrogen regulatory
            gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by P.Minehart, 19-JUN-1990.
            
                Author address: P.Minehart
                MIT, 56-428 MIT
                77 Mass Ave.
                Cambridge, MA 02139
FEATURES       from  to/span     description
    pept        730     2922     GLN3 protein
    signal      509      514     TATA box
    site          1      140     acidic activation region
BASE COUNT     1020 a    704 c    530 g    767 t
ORIGIN      Chromosome VL, map position 43cm distal to GCN4.
        1 gacgtcaact ccatagaagt gacttttccg ccaaagaaga ggacctcgcc ataagcaatg
       61 agaatgatcg tcagattctt gaaaattgtg tagatgggca cggcaaggta ttgtaagctc
      121 tttgacgacg tataaatcat caatacgagc agcaaagaaa ttggaaacca gttttttaca
      181 tctgtcctgt tcaaagatca aaaattagca acgcctacaa ttcgtaggat acatagcgtc
      241 acagtgcaca ccagtgattg tacaaacaac atcacaaagt tcatgttaaa gttgtccagg
      301 ttaaccacga atttgttcgt tactgtcatc aaaatcgagg acgcgcagta agataagatt
      361 gaagccggcc cagagttggc cactgattcc gtccattcat gcttatgctt gctcataatt
      421 accacacctt cttgatctct ttacagcttt tcaaccttcc attcttgtac tctatctcta
      481 cctggccctt taaacattct taatatgata tattcacatt ttttgctcta ttacccggcg
      541 gacaggttcc cgaaagaaag tgacatggca atgctgagag agtggaaaga gtcatcttgc
      601 aagacagaga aagatgttca agagtggtaa gctaatgtca gcgcagtagc ccatcccaca
      661 ataacagagt gtgtaagaaa gagagacgag agagagcaca gggccccctt ttcccccacc
      721 aacaaacaaa tgcaagacga ccccgaaaat tcgaagctgt acgacctgct gaatagtcat
      781 ctggacgtgc atggtcgaag taatgaagag ccgagacaaa ctggtgacag taggagccag
      841 agtagtggca acaccggtga aaacgaggag gatatagcat ttgccagtgg attaaacggc
      901 ggcacattcg actcaatgct ggaggcactg cccgatgatt tatattttac ggacttcgtg
      961 tctcctttta cagcagctgc cacgaccagc gtgactacta agacggtcaa ggacaccaca
     1021 ccagctacca atcatatgga tgatgatatt gcgatgtttg attcacttgc cacaactcag
     1081 cccatcgaca tagccgcatc caaccaacaa aatggtgaaa ttgcacaact ttgggacttt
     1141 aacgtggacc aattcaacat gacgcccagc aactcgagcg gttcagctac tattagtgct
     1201 cctaacagct ttacttccga cataccgcaa tacaaccacg gttccctcgg caacagcgtc
     1261 tccaaatcct cactgttccc gtataattcc agcacgtcca acagcaacat caaccagcca
     1321 tctatcaata acaactcaaa tactaatgcg cagtcccacc attccttcaa catctacaaa
     1381 ctacaaaaca acaactcatc ttcatccgct atgaacatta ccaataataa taatagcaac
     1441 aatagtaata tccagcatcc ttttctgaag aagagcgatt cgataggatt atcttcatcc
     1501 aacacaacaa attctgtaag aaaaaactca cttatcaagc caatgtcgtc cacgtccctg
     1561 gccaatttca aaagagctgc ctcagtatct tccagtatat ccaatatgga accatcagga
     1621 caaaataaaa aacctctgat acaatgtttc aattgtaaaa ctttcaagac accgctttgg
     1681 aggagaagcc cagaggggaa tactctttgc aatgcctgcg gtcttttcca gaaattacat
     1741 ggtaccatga ggccattatc cttaaaatcg gacgttatca aaaagaggat ttcaaagaag
     1801 agagccaaac aaacggaccc aaacattgca caaaatactc caagtgcacc tgcaactgcc
     1861 tcaacttcag taaccactac aaatgctaaa cccatacgat cgaggaaaaa atcactacaa
     1921 caaaactctt tatctagagt gatacctgaa gaaatcatta gagacaacat cggtaatact
     1981 aataatatcc ttaatgtaaa taggggaggc tataacttca actcagtccc ctccccggtc
     2041 ctcatgaaca gccaatcgta taatagtagt aacgcaaatt ttaatggagc aagcaatgca
     2101 aatttgaatt ctaataactt aatgcgtcac aattcgaaca ctgttactgg taattttaga
     2161 aggtcttcaa gacgaagtag tacttcatcg aacacctcaa gttccagtaa atcttcatcc
     2221 agatctgttg ttccgatatt accaaaacct tcacctaata gcgctaattc acagcagttc
     2281 aacatgaaca tgaacctaat gaacacaaca aataatgtaa gtgcaggaaa tagtgtcgca
     2341 tcctcaccaa gaattatatc gtccgcaaac tttaactcaa atagtcctct acagcagaat
     2401 ctattatcaa attctttcca acgtcaagga atgaatatac caagaagaaa gatgtcgcgc
     2461 aatgcatcgt actcctcatc gtttatggct gcgtctttgc aacaactgca cgaacagcaa
     2521 caagtggacg tgaattccaa cacaaacacg aattcgaata gacagaattg gaattcaagc
     2581 aatagcgttt caacaaattc aagatcatca aattttgtct ctcaaaagcc aaattttgat
     2641 atttttaata ctcctgtaga ttcaccgagt gtctcaagac cttcttcaag aaaatcacat
     2701 acctcattgt tatcacaaca attgcagaac tcggagtcga attcgtttat ctcaaatcac
     2761 aaatttaaca atagattatc aagtgactct acttcaccta taaaatatga agcagatgtg
     2821 agtgcaggcg gaaagatcag tgaggataat tccacaaaag gatcttctaa agaaagttca
     2881 gcaattgctg acgaattgga ttggttaaaa tttggtatat gaccgcgtat tatcattatc
     2941 attattctta ttatgttaat aattactgaa cggttgcatt gatagatttt cattacctct
     3001 gaccacaatc ctgagcattg g
//
LOCUS       BLYHISH3PA    505 bp ss-mRNA            PLN       26-JUL-1990
DEFINITION  Barley histone H3 mRNA, 3' end.
ACCESSION   M34928
KEYWORDS    histone H3 protein.
SOURCE      Barley (strain Nudinka) seed scutella 2 days after germination,
            cDNA to mRNA.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 505)
  AUTHORS   Chojecki,J.
  TITLE     Identification and characterization of a cDNA clone for histone H3
            in barley
  JOURNAL   Carlsberg Res. Commun. 51, 211-217 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      243     histone H3 protein
    mRNA    <     1      505     histone H3 mRNA
BASE COUNT       95 a    138 c    146 g    126 t
ORIGIN      
        1 aagagcaccg agctgctgat ccgcaagctc ccgttccagc gcctggtgag ggagatcgcg
       61 caggacttca agaccgacct caggttccag tcccacgccg tgctggccct ccaggaggcc
      121 gccgaggcgt acctcgtcgg gctgttcgag gacaccaacc tgtgcgccat ccacgccaag
      181 cgcgtcacca tcatgcccaa ggacatccag ctcgcccgcc gcatccgcgg ggagcgcgcc
      241 taagccaccc agagcgctgc attcgggagc gatgacaccg ttcgccagca ttagtgtagt
      301 tgattggctt tccttgtcca gatatgcgtc ttgtggttcg ttgtagaaac cctggttggt
      361 tggttcccgt agttacagag acttttctgc ttaagtggtt ttggtttgcg gtgttgcaaa
      421 ccgatgctta ctgtgatgca aattgttggt taatgtagtg ttgattgaca attatcgatg
      481 gatgaacttg tggtgttgcg tagtt
//
LOCUS       BMOFIBA       324 bp ss-mRNA            INV       26-JUL-1990
DEFINITION  B.mori silk fibroin mRNA, partial cds.
ACCESSION   M35378
KEYWORDS    fibroin.
SOURCE      B.mori (Kinryu x Showa) posterior silk gland, cDNA to mRNA.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Mita,K., Ichimura,S., Zama,M. and James,T.C.
  TITLE     Specific codon usage pattern and its implications on the secondary
            structure of silk fibroin mRNA
  JOURNAL   J. Mol. Biol. 203, 917-925 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   324     silk fibroin (AA at 1)
BASE COUNT       35 a     60 c    144 g     85 t
ORIGIN      
        1 ggatacggag caggagctgg aagcggagct gcctctggtg ccggtgccgg ttcaggtgct
       61 ggtgctggtt caggagctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct
      121 ggttcaggtg ctggtgctgg ttcaggagct ggtgctggtt caggtgctgg tgctggttca
      181 ggagctggtg ctggatacgg agcaggagct ggcgttggat acggagcagg agctgggagc
      241 ggagctgcct ctggtgctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct
      301 ggttcaggtg ctggtgctgg ttca
//
LOCUS       DROMETA       338 bp ss-mRNA            INV       26-JUL-1990
DEFINITION  D.melanogaster metallothionein (MT) mRNA, complete cds.
ACCESSION   M35390
KEYWORDS    metallothionein.
SOURCE      D.melanogaster larva, cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 338)
  AUTHORS   Maroni,G., Lastowski-Perry,D., Otto,E. and Watson,D.
  TITLE     Effects of heavy metals on Drosophila larvae and a metallothionein
            cDNA
  JOURNAL   Environ. Health Perspect. 65, 107-116 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        124      246     metallothionein
    mRNA    <     1      338     metallothionein mRNA
    signal      308      313     polyA signal
BASE COUNT      101 a     88 c     77 g     72 t
ORIGIN      
        1 gatcagttgt ggtcagcagc aaaatcaagt gaatcatctc agtgcaacta aaggcctaaa
       61 tagcccatac ctaccttttt tgtaaacaag tgaacaagtt cgaggaaata caactcaatc
      121 aagatgcctt gcccatgcgg aagcggatgc aaatgcgcca gccaggccac caagggatcc
      181 tgcaactgcg gatctgactg caagtgcggc ggcgacaaga aatccgcctg cggctgctcc
      241 gagtgagctt tcccccaaaa aagatctgga gtagaggcgc tgcatcttgt ctctctacac
      301 accctgcaat aaatgtccaa ttaaagtaat tgatgcct
//
LOCUS       HUMVPREBA     503 bp ds-DNA             PRI       26-JUL-1990
DEFINITION  Human pre-B lymphocyte VpreB gene, 5' end.
ACCESSION   M34927
KEYWORDS    .
SOURCE      Human myeloid cell line U937 DNA, clone pHVPB-6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 503)
  AUTHORS   Bauer,S.R., Kudo,A. and Melchers,F.
  TITLE     Structure and pre-B lymphocyte restricted expression of the VpreB
            gene in humans and conservation of its structure in other mammalian
            species
  JOURNAL   EMBO J. 7, 111-116 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1       46     VpreB protein precursor, exon 1
                133  >   503     VpreB protein precursor, exon 2
    sigp          1       46     VpreB protein signal peptide
                133      143     VpreB protein signal peptide
    matp        144  >   503     VpreB protein
    IVS          47      132     VpreB intron A
BASE COUNT      104 a    160 c    140 g     99 t
ORIGIN      
        1 atgtcctggg ctcctgtcct gctcatgcac tttgtctact gcacaggtga gggaaccccc
       61 agatcccaaa gactcctgcc ccttccttca tcctgccctg cccccacggg ccacatgcat
      121 ctgtgtcacc aggttgtggt cctcagccgg tgctacatca gccgccggcc atgtcctcgg
      181 cccttggaac cacaatccgc ctcacctgca ccctgaggaa cgaccatgac atcggtgtgt
      241 acagcgtcta ctggtaccag cagaggccgg gccaccctcc caggttcctg ctgagatatt
      301 tctcacaatc agacaagagc cagggccccc aggtcccccc tcgcttctct ggatccaaag
      361 atgtggccag gaacaggggg tatttgagca tctctgagct gcagcctgag gacgaggcta
      421 tgtattactg tgctatgggg gcccgcagct cggagaagga ggagagggag agggagtggg
      481 aggaagaaat ggaacccact gca
//
LOCUS       MUSNGF       1176 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Mouse nerve growth factor (NGF) precursor mRNA, complete cds.
ACCESSION   M35075 J00608
KEYWORDS    nerve growth factor.
SOURCE      Mouse male submaxillary gland, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1176)
  AUTHORS   Scott,J., Selby,M., Urdea,M., Quiroga,M., Bell,G.I. and Rutter,W.J.
  TITLE     Isolation and nucleotide sequence of a cDNA encoding the precursor
            of mouse nerve growth factor
  JOURNAL   Nature 302, 538-540 (1983)
  STANDARD  simple staff_review
REFERENCE   2  (bases 3 to 226)
  AUTHORS   Edwards,R.H., Selby,M.J. and Rutter,W.J.
  TITLE     Differential RNA splicing predicts two distinct nerve growth factor
            precursors
  JOURNAL   Nature 319, 784-787 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         96     1019     nerve growth factor precursor
    sigp         96      656     nerve growth factor signal peptide
    matp        657     1010     nerve growth factor
    mRNA          1     1176     NGF mRNA
BASE COUNT      283 a    330 c    295 g    268 t
ORIGIN      
        1 gagcgcctgg agccggaggg gagcgcatcg agtgactttg gagctggcct tatatttgga
       61 tctcccgggc agctttttgg aaactcctag tgaacatgct gtgcctcaag ccagtgaaat
      121 taggctccct ggaggtggga cacgggcagc atggtggagt tttggcctgt ggtcgtgcag
      181 tccagggggc tggatggcat gctggaccca agctcacctc agtgtctggg cccaataaag
      241 gttttgccaa ggacgcagct ttctatactg gccgcagtga ggtgcatagc gtaatgtcca
      301 tgttgttcta cactctgatc actgcgtttt tgatcggcgt acaggcagaa ccgtacacag
      361 atagcaatgt cccagaagga gactctgtcc ctgaagccca ctggactaaa cttcagcatt
      421 cccttgacac agccctccgc agagcccgca gtgcccctac tgcaccaata gctgcccgag
      481 tgacagggca gacccgcaac atcactgtag accccagact gtttaagaaa cggagactcc
      541 actcaccccg tgtgctgttc agcacccagc ctccacccac ctcttcagac actctggatc
      601 tagacttcca ggcccatggt acaatccctt tcaacaggac tcaccggagc aagcgctcat
      661 ccacccaccc agtcttccac atgggggagt tctcagtgtg tgacagtgtc agtgtgtggg
      721 ttggagataa gaccacagcc acagacatca agggcaagga ggtgacagtg ctggccgagg
      781 tgaacattaa caacagtgta ttcagacagt acttttttga gaccaagtgc cgagcctcca
      841 atcctgttga gagtgggtgc cggggcatcg actccaaaca ctggaactca tactgcacca
      901 cgactcacac cttcgtcaag gcgttgacaa cagatgagaa gcaggctgcc tggaggttca
      961 tccggataga cacagcctgt gtgtgtgtgc tcagcaggaa ggctacaaga agaggctgac
     1021 ttgcctgcag cccccttccc cacctgcccc ctccacactc tcttgggccc ctccctacct
     1081 cagcctgtaa attattttaa attataagga ctgcatgata atttatcgtt tatacaattt
     1141 taaagacatt atttattaaa ttttcaaagc atcctg
//
LOCUS       RATXDHA      4162 bp ss-mRNA            ROD       26-JUL-1990
DEFINITION  Rat xanthine dehydrogenase mRNA, complete cds.
ACCESSION   J05579
KEYWORDS    xanthine dehydrogenase.
SOURCE      Rat (strain Wistar) liver, cDNA to mRNA, clones
            lambda-RXD[7,32,42,51].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4162)
  AUTHORS   Amaya,Y., Yamazaki K,-i., Sato,M., Noda,K., Nishino,T. and
            Nishino,T.
  TITLE     Proteolytic conversion of xanthine dehydrogenase from the NAD-
            dependent type to the oxygen-dependent type: Amino acid sequence of
            rat liver xanthine dehydrogenase and identification of the cleavage
            sites of the enzyme protein during irreversible conversion by
            trypsin
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            Y.Amaya, 22-JUN-1990.
FEATURES       from  to/span     description
    pept         27     3986     xanthine dehydrogenase (EC 1.1.1.204)
    signal     4125     4130     polyA signal (put.)
    signal     4146     4151     polyA signal (put.)
BASE COUNT     1055 a   1046 c   1121 g    940 t
ORIGIN      
        1 agagctcagt gactccagca gccacgatga ctgcggatga gttggtcttc tttgtgaatg
       61 gcaaaaaggt ggtggagaaa aatgcggacc ctgaaacaac acttctggtc tacctgagaa
      121 gaaagttggg gctatgtggg accaagcttg gctgtggaga aggtggctgt ggggcatgca
      181 ccgtgatgat ctccaagtat gaccgtcttc agaacaagat tgttcatttt tctgtcaatg
      241 cctgcttggc tcccatctgc tccttgcacc atgttgctgt gaccaccgtg gaaggcatag
      301 gaaacaccca gaagctgcat cctgtacagg agagaattgc cagaagccat ggttcccagt
      361 gtgggttctg cactcctggc attgtcatga gtatgtacac actgctccgg aaccagcctg
      421 agcctactgt tgaggagatc gagaatgcct tccaaggaaa cctctgtcgc tgtacaggct
      481 acagacccat cctccaggga ttccggacct ttgccaagga tggtgggtgc tgtggaggga
      541 gtggaaacaa cccaaactgc tgtatgaacc agacgaaaga ccaaacggtt tctctctcac
      601 cttctttatt caacccagag gatttcaaac ctttagatcc cacgcaagag cccatcttcc
      661 ccccagagtt gctgaggctg aaagacactc cccagaagaa gctgcgtttt gaaggggaac
      721 gtgtgacctg gatccaggct tcaactatgg aggagctgct tgacctgaaa gctcagcacc
      781 ctgatgccaa gctggtggtg ggaaacacag agataggcat tgaaatgaaa tttaagaata
      841 tgctatttcc tctgatcgtc tgcccagcct ggatccctga actgaattca gtggtgcatg
      901 ggcctgaggg aatctccttc ggagcttctt gcccccttag cttggtggaa agtgtcctgg
      961 cggaggagat tgctaaactt ccagagcaaa agacagaggt gttcagaggc gtgatggagc
     1021 agctgcgctg gtttgccggc aagcaggtca agtccgtggc gtccatcgga gggaacatca
     1081 tcactgccag ccccatctct gacctcaacc ctgtgttcat ggccagtgga gccaagctga
     1141 ctctggtgtc tagaggtacc aggagaactg ttcggatgga tcataccttc ttccctggct
     1201 acagaaagac tctgctcaga ccagaggaga tattgctgtc catcgagatc ccctatagca
     1261 aggagggaga gtttttctca gccttcaagc aggcctccag gagggaagat gacattgcca
     1321 aggtgactag tggcatgaga gtcctgttca aaccggggac cattgaagtg caggaactgt
     1381 ccctttgctt cggagggatg gccgacagaa ctatctcagc cctcaagacc actccgaagc
     1441 agctatcgaa gtcctggaat gaggagctgc agctggcccc cgatgcccct ggtggtatgg
     1501 tggaattccg gcgcaccctc accctcagct tcttcttcaa gttctacctg acagtgctcc
     1561 agaagctggg cagagcggac cttgaggata tgtgtggtaa actggacccc acctttgcca
     1621 gtgccaccct gctctttcag aaggaccctc cagctaatgt ccagcttttc caagaggtgc
     1681 caaaggatca gtctgaggag gacatggtgg gccggcccct gcctcacctg gcggcaaaca
     1741 tgcaggcatc gggagaggcc gtgtactgtg atgacattcc ccgctatgag aatgagctct
     1801 ctctcaggct ggtcaccagc acccgggcgc atgctaaaat cacgtccatc gacacttcag
     1861 aagccaagaa ggtgccaggg tttgtttgct tcctcaccgc agaggatgtc cctaatagta
     1921 atgcaaccgg ccttttcaat gatgaaactg tctttgcgaa ggatgaggtt acttgtgttg
     1981 ggcacatcat tggtgctgtg gtcgctgaca ccccagaaca cgcacagaga gctgcgagag
     2041 gggtgaaaat cacctatgaa gatcttccag ccattatcac aatccaggat gctataaaca
     2101 acaactcctt ttatggctct gagataaaaa ttgagaaagg agatctcaag aaaggctttt
     2161 cagaagctga caatgttgtc tcaggagagt tgtatatcgg tggccaggag cacttctacc
     2221 tggagaccaa ctgcaccatt gccgtgccaa aaggcgaggc aggcgagatg gagctgttcg
     2281 tgagcacaca gaacaccatg aaaacccaga gctttgttgc aaaaatgttg ggcgttccgg
     2341 acaacagaat cgtagtccga gtgaagagga tgggtggagg ctttggaggg aaggagaccc
     2401 ggagcactgt ggtgtccaca gcactggcct tggctgcaca caagactggc cggcccgtac
     2461 gttgcatgtt ggaccgagat gaggacatgc tgataactgg tggcagacat cccttcctgg
     2521 ctaaatacaa ggttggcttc atgaagactg ggactgtagt ggctctcgag gtggctcact
     2581 tcagcaatgg tggtaacact gaggatctct ctcggagtat aatggaacga gctttgttcc
     2641 acatggataa cgcctataag atccccaaca ttcgaggcac tgggaggatt tgcaagacta
     2701 atctgccctc caacacagcc ttcagaggtt ttgggggtcc tcaggggatg ctaatcgcag
     2761 aatactggat gagcgaggtc gccataacct gtgggctgcc tgcagaggag gtacggagga
     2821 aaaacatgta caaagaaggg gacctgactc acttcaacca gaagctggag gggttcacct
     2881 tgcccaggtg ctgggatgaa tgcatcgcca gctctcagta tcttgctcgc aagagggaag
     2941 tggagaaatt caacagggag aattgttgga aaaagagagg gctgtgtata atcccaacta
     3001 agtttggaat aagctttaca cttccttttc tgaaccaggg aggcgctctg gttcacgtgt
     3061 acactgatgg ttcggtgctg ttgacccatg gagggactga gatgggccaa ggccttcaca
     3121 ccaagatggt tcaggtggcc agcagagctc tgaaaatccc cacctccaag attcatataa
     3181 gtgagacaag cactaacacc gtccccaaca cttctcccac agctgcctct gccagtgctg
     3241 acctcaatgg acagggtgtt tatgaagcat gccagaccat actgaaaagg ctggaacctt
     3301 tcaagaagaa gaaacccacc ggcccctggg aggcatgggt gatggacgcc tatacgagcg
     3361 cagtgagttt gtccgcaact ggattttata agacacccaa ccttggctac agctttgaga
     3421 caaactccgg aaatcccttc cactatttca gttatggggt ggcttgctct gaagtagaaa
     3481 ttgactgctt aacaggggat cataagaatc tccgtacgga tatcgtcatg gatgttggtt
     3541 ccagcttgaa tcctgccatt gatattggac aagtagaggg ggcatttgtc cagggccttg
     3601 gtctcttcac tatggaggag ctgcactact cccctgaggg gagcctgcat actcgtggcc
     3661 ccagtaccta caaaatccct gcatttggta gcatccccat tgagttcaga gtatccctac
     3721 tccgggactg ccccaacaag agggccatct atgcatccaa ggctgttggg gagccacctc
     3781 ttttcctggc ttcctctatc ttctttgcca tcaaagatgc cattcgtgca gctcgagctc
     3841 agcacggaga taacgcaaaa caacttttcc agctagacag ccctgccact ccggagaaga
     3901 tccgaaacgc ctgtgtggac cagttcacca ccctgtgtgt cactggagta ccagaaaact
     3961 gtaaatcctg gtctgtgagg atctgaagag aaggtctcca ccattggttt gtaccgcacc
     4021 aggattcctt ggagccacaa gcacatcctg tagtatccag atttccgcat gccgcgtggg
     4081 actcagcagg atgacatttt caggaagatg gacattttga cccaaataag agctgcaaac
     4141 aaaccaataa gcaaatgggg ag
//
LOCUS       RICHISH2AA    321 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  Rice histone H2A gene, 5' end.
ACCESSION   M35379
KEYWORDS    histone.
SOURCE      Rice DNA, clone pIR22.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Thomas,G. and Padayatty,J.D.
  TITLE     Restriction map and partial sequence of a rice DNA fragment
            carrying histone genes H2A, H2B and H4
  JOURNAL   Indian J Biochem Biophys 21, 1-6 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        260  >   321     histone H2A protein
    mRNA        186  >   321     histone H2A mRNA
    signal       36       40     CAAT box
    signal       74       77     GATCC motif
    signal      138      145     TATA box
BASE COUNT       74 a     70 c     60 g     73 t     44 others
ORIGIN      
        1 caaaggacnt gttcccgctg atgtgagcaa ttgtcacaat gccctcccaa acngttttca
       61 gatngtngat gtggatcnnn antttnttgc gnntnnanac ctggctctcg ttttttcgca
      121 angtcccgaa cnnnnngtat aaatagcgtg tggacccgta ncgtgagaac tcgtgatctn
      181 atttcatctg gaacgactcn nggaatnttc cgaaaannnn nnnnnnnnng ccgaaagcct
      241 tttggaactt ttcnnccaaa tgcacaccaa aggcctcngg aagnnttttc ancgcaaaaa
      301 gatatcaccc gcagggatca c
//
LOCUS       TEYMT14SRR    169 bp ds-DNA             ORG       26-JUL-1990
DEFINITION  T.pyriformis mitochondrial 14S rRNA.
ACCESSION   M35376
KEYWORDS    14S ribosomal RNA.
SOURCE      T.pyriformis (strain ST) linear mitochondrial DNA.
  ORGANISM  Mitochondrion Tetrahymena pyriformis
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae; 
            Tetrahymena pyriformis.
REFERENCE   1  (bases 1 to 169)
  AUTHORS   Suyama,Y., Fukuhara,H. and Sor,F.
  TITLE     A fine restriction map of the linear mitochondrial DNA of
            Tetrahyemena pyriformis: Genome size, map locations of rRNA and
            tRNA genes, terminal inversion repeat, and restriction site
            polymorphism
  JOURNAL   Curr. Genet. 9, 479-493 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   169     14S rRNA
    site         31      144     conserved U5 region
BASE COUNT       54 a     26 c     37 g     52 t
ORIGIN      
        1 gaattcagaa tagctaacgc aaagtattct gcttggggag tattatcgca agattaaaac
       61 ttaactgaat tggcgggaat ttgttcgaac ggtggaacat gtggtttaat gcgataatcc
      121 acgcaaaatc ttaccaacgt tttaggcttt atctgataat atggttaac
//
LOCUS       YSCPET122    2862 bp ds-DNA             PLN       26-JUL-1990
DEFINITION  Yeast PET122 encoded protein gene, complete cds.
ACCESSION   X07558
KEYWORDS    PET122 encoded protein.
SOURCE      Yeast (S.cerevisiae, strain AB320) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2862)
  AUTHORS   Ohmen,J.D., Burke,K.A. and McEwen,J.E.
  TITLE     Divergent overlapping transcripts at the PET122 locus in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 3027-3035 (1990)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 953 to 2862)
  AUTHORS   Ohmen,J.D., Kloeckener-Gruissem,B. and McEwen,J.E.
  TITLE     Molecular cloning and nucleotide sequence of the nuclear PET122
            gene required for expression of the mitochondrial COX3 gene in
            S.cerevisiae
  JOURNAL   Nucleic Acids Res. 16, 10783-10862 (1988)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.D.Ohmen, 11-JUN-1990.
            
            Draft entry and computer-readable sequence for [1] submitted to
            EMBL by J.D.Ohmen, 09-JUN-1989.
            
            EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE        746   1096       similarity to E.coli alanyl
                                            tRNA-synthetase (AA 116-232) [1]
            
            Author address: McEwan J.E.
                            Department of Microbiology
                            College of Letters and Science
                            University of California
                            405 Hilgard Avenue
                            Los Angeles, CA 90024-1489
FEATURES       from  to/span     description
    pept       1139  <     1 (c) ORF3
    pept       1354     2118     PET122 protein
    pept       2171  >  2862     ORF2
    mRNA       1144  <     1 (c) ORF3 mRNA
    mRNA       1152  <     1 (c) ORF3 mRNA
    mRNA       1157  <     1 (c) ORF3 mRNA
    mRNA       1159  <     1 (c) ORF3 mRNA
    mRNA       1870  <     1 (c) ORF3 mRNA
    mRNA       1875  <     1 (c) ORF3 mRNA
    mRNA       1882  <     1 (c) ORF3 mRNA
    mRNA       1883  <     1 (c) ORF3 mRNA
    mRNA       1887  <     1 (c) ORF3 mRNA
    mRNA       1895  <     1 (c) ORF3 mRNA
    mRNA       1907  <     1 (c) ORF3 mRNA
    mRNA       1343  >  2119     PET122 mRNA
    mRNA       1348  >  2119     PET122 mRNA
    mRNA       1354  >  2119     PET122 mRNA
    mRNA       2140  >  2862     ORF2 mRNA
    mRNA       2143  >  2862     ORF2 mRNA
    mRNA       2147  >  2862     ORF2 mRNA
    mRNA       2150  >  2862     ORF2 mRNA
BASE COUNT      748 a    652 c    795 g    667 t
ORIGIN      
        1 aagctttctt gtaacttctt ctcattatct tgcatcaatt gccttctttc cgcctgatct
       61 cttgcctttt gaatgttatg ttttaatgat tggaagatgc ccatgttctc tgtgggggaa
      121 gcgccagcga taggagtcct tggtttagct acttctgtta tcttcagttt cgaacgaacc
      181 catttgtttc tcaaaatcat tgtctgtagg acggagaagg caccattaaa ggcaaagtag
      241 aggaccacag cggacgataa gttcattgtg gccggtatag aaatgatcgg tagaatagtg
      301 aaaagacgct tcatgggaga actgaattgt tgagcaccag tctcaccccc cagccttgta
      361 aatgagatga acacagcggc agtgattact tgcaaaccta agtaagggtc tgcttgagtc
      421 aagtctgtaa accaagcgac accttgatta gcgaacccat ctactgggta gttagccatg
      481 tgtctcaatg cgttgaaaaa cccaagggcg attggaattt gtagcatggg tgcggccagc
      541 catctgttct taatgccgtg cgaggagagc agttttttcc tttgcatggc gactagctga
      601 ccttgttgca aatctgtagt ggacattagc ttattattca aggcgtccag ctcgggcttg
      661 atatgggaat ttctagcaac agtatcagag gacttgacat agaggggaaa catcaggcat
      721 cgaatgagga tggtggtggc cgcgatagtt ccccaccaag gcaacccaga gtaaacatga
      781 acggcctcca agacgtgttg gataatgtcc gagggccagt accaggtttg ggccaggcca
      841 atgctattta agtaccctat atgggaggac aactcgccca ctgtttgggt cgtgttagcg
      901 ataaggtccg aagtagaagc ggaaagagaa ggagctgaag aggttaattc atcgatggaa
      961 ggcaactggg tttggatttc cgagacatcg ttggcatttg ggcccgtcga attaaatctt
     1021 ttggcctgaa aagagatcca tgacggatgg ggccggggca atactatggt tcgagcggtg
     1081 gccagtctgg aagaggcagc aaaccttgac gtgacgagtc gagaggtgag tttgaacatc
     1141 gtcggggagg ttattctgtg gctccgcttg tacgtgaaca gatacgtata gagggcgagc
     1201 cactggttaa atttttcatg gctcggatta cttccgtact gctggctaaa atcgaaatct
     1261 cggcctgctg agagtgtttt gagcaatcaa gggaacatct gaacgtggaa gagcagacga
     1321 ggcattagct cgaacataag aacggaacac gtcatgttga ctatcacgaa aagactggtg
     1381 accaccgatg tgcggtcgcg aatactgtta agcagtttaa acgggaaaat gtccgatgca
     1441 ctggcgctgc tgcgtcagca gcagcagacc agcgtggatg tggagctgct gcacacgatg
     1501 ctagcgcgag ccgctgcgct tgcccatgcc gacactatag catacatgtg gtatcagcat
     1561 gtgatgccac gccggttgcc agtagagggc cgcctgctat gtgaaatggc tggcgtagca
     1621 ttgtaccagg acaggctctt cttacccgcg cagttcctcc agcactacca ggcgatgaat
     1681 cgcgatcgtc gcaccagccc agaagatgaa ctgattgagt atgagcttag acggattaaa
     1741 gtcgaagcgt ttgcgcgtgg cacaatgcac tccacggcgc tcagggaaaa gtggaaggta
     1801 ttcttgcagg agatggatac gctaccaggg cagccgccat taaggctgcg cgacttcccg
     1861 caaatgacca aggctatggg catagcattg atgcagcaag atgagcaagc agctgccctg
     1921 gcgttgtttg gacgacagcc cctagtgata aagaacgaat ggtcactacc gctactactg
     1981 gctggtgtcc tttggcatgt tcccggccca gcgcaggcgc gacgtgtgct ggcggagttc
     2041 cgtcaaagtt atcgcgggct gccgctgctg gatgccgaac tagtgataaa gagaagagga
     2101 tttgaaatca acacataaat ctgggtggag catcgctgta acaaggaaca acgcgtgcta
     2161 gcaagcggta atgaaataca aggaaatcaa tttcttcaag ggccatccga gctcgaggtt
     2221 gctgcctcga gaagcagtaa ttcaagcgac tgcggctata ttggggcccg agaccaggga
     2281 gtacgataac gacccctata acaggcatcc gctgacgtac ggttcggacg aaggtgccct
     2341 gtgggtgcga gagcagattt gtacgtttct gaatgatcag ctgtttaagt tcgaaaatgg
     2401 ggctcggagc aggacacggg cagactattt gaatctgaat agcggcgctt cgtatggcat
     2461 gctgaacatc cttctgcaaa caaccttgcc acataacggg tataccaggc aggcgttcat
     2521 catcacgcca acatatttct tgatcaacaa ttgcttcaca gatgcgggat tcaaggggaa
     2581 aatgaccgcc atcaacgagc agggccacga ctcgattgat ttcgagtcgt tgatttctgc
     2641 ccttgagcag cacgaggcgg agccgcagcc ccatagtacc acagagatga ttcaggggcc
     2701 aaagttgacc aagaaggtct acaggtacgt tatgtactgc atcccgacgt ttgcaaaccc
     2761 atcgggaaac acatactcgc ttgagaccag acgcagactt atcgacatcg ctcggaagta
     2821 cgacatgctg ataatcactg atgacgtgta cgatattcta ga
//
LOCUS       ECO987P       954 bp ds-DNA             BCT       26-JUL-1990
DEFINITION  E.coli fimbriae 987P subunit gene, complete cds.
ACCESSION   M35257
KEYWORDS    fimbriae.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 954)
  AUTHORS   De Graaf,F.K. and Klaasen,P.
  TITLE     Nucleotide sequence of the gene encoding the 987P fimbrial subunit
            of Escherichia coli
  JOURNAL   FEMS Microbiol. Lett. 42, 253-258 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        259      843     fimbriae 987P subunit precursor
    sigp        259      328     fimbriae 987P subunit signal peptide
    matp        329      840     fimbriae 987P subunit
BASE COUNT      309 a    170 c    180 g    295 t
ORIGIN      
        1 aaatttagaa aagtgcatta tgcttatcac tagataagaa aataaaacac gaaatatagc
       61 gagccatata gcctgttgtg tttgtaatag ataaaaaaca cgcaattgat tatttatgta
      121 tctttttgtt tgtatttttt tattaaaaaa agcacacaat tactgcgtgc atcgaaatga
      181 gttgaagtgg atgcatatat gcatgaaatg cttttaactt gaaagtctta atgtttctat
      241 taattaagat aaggtaatat gagaatgaaa aaatccgcat taacattagc agtgctttcc
      301 tctctgttca gtggttactc gctcgcagcg cccgctgaaa acaacaccag ccaggcaaat
      361 ttagacttta ctggtaaagt tactgccagt ctatgccaag tggatacttc taatctgtcg
      421 caaaccatag atcttggaga gttgtctact tctgctctta aagctactgg caaggggcct
      481 gccaagtcat ttgcagttaa tcttatcaac tgcgatacaa cattgaattc tattaaatac
      541 actattgctg gtaataataa tacaggaagt gatactaaat atttagttcc agcctccaat
      601 gatactagtg catcaggagt tggcgtatac attcaggaca acaacgccca ggctgtggaa
      661 attggtactg aaaaaactgt acctgtggta tcaaatggcg gattagctct ttcagaccaa
      721 agtattccac tgcaagcata catcggaacc accacaggga atcctgatac aaacggtgga
      781 gttacggccg gtactgtcac tgctagtgca gtaatgacta ttcgttcagc aggtacaccg
      841 taattagata acaattttta tacaacaaaa caggaaggat tttgaactaa tccttcctgt
      901 tattggagat tgaaatgtct aagtttgtaa tatttcttgt gtttttgttt atat
//

GenBank-Updates@genbank.bio.net (07/27/90)

LOCUS       HUMHPRTB    56736 bp ds-DNA             PRI       27-JUL-1990
DEFINITION  Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete
            cds.
ACCESSION   M26434
KEYWORDS    Alu repetitive element; LINE repetitive element;
            hypoxanthine phosphoribosyltransferase.
SOURCE      Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia
            patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 56736)
  AUTHORS   Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J.,
            Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W.
  TITLE     Automated DNA sequencing of the human HPRT locus
  JOURNAL   Genomics 6, 593-608 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (sites; mutations causing gout)
  AUTHORS   Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N.
  TITLE     A possible nucleotide-binding domain in the tertiary fold of
            phosphoribosyltransferase
  JOURNAL   J. Biol. Chem. 258, 6450-6457 (1983)
  STANDARD  full staff_entry
REFERENCE   3  (sites; mutations causing gout)
  AUTHORS   Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P.
  TITLE     Resolution of a missense mutant in human genomic DNA by denaturing
            gradient gel electrophoresis and direct sequencing using in vitro
            DNA amplification: HPRT-Munich
  JOURNAL   Am. J. Hum. Genet. 42, 726-734 (1988)
  STANDARD  full staff_entry
REFERENCE   4  (sites; mutations causing gout)
  AUTHORS   Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D.
  TITLE     Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence
            for identical mutations in two partially deficient subjects
  JOURNAL   J. Clin. Invest. 82, 2164-2167 (1988)
  STANDARD  full staff_entry
REFERENCE   5  (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Davidson,B.L., Palella,T.D. and Kelly,W.N.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase: A single
            nucleotide substitution in cDNA clones isolated from a patient with
            Lesch-Nyhan syndrome (HPRT-Midland)
  JOURNAL   Gene 68, 85-91 (1988)
  STANDARD  full staff_entry
REFERENCE   6  (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
  TITLE     Genetic basis of hypoxanthine guanine phosphoribosyltransferase
            deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint)
  JOURNAL   Gene 63, 331-336 (1988)
  STANDARD  full staff_entry
REFERENCE   7  (sites; mutations causing Lesch-Nyhan syndrome and gout)
  AUTHORS   Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase deficiency.
            The molecular defect in a patient with gout (HPRT-Ashville)
  JOURNAL   J. Biol. Chem. 264, 520-525 (1989)
  STANDARD  full staff_entry
REFERENCE   8  (sites; mutations causing gout)
  AUTHORS   Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N.
  TITLE     Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
            deficiency in ten subjects determined by direct sequencing of
            amplified transcripts
  JOURNAL   J. Clin. Invest. 84, 342-346 (1989)
  STANDARD  full staff_entry
REFERENCE   9  (sites; mutations causing gout)
  AUTHORS   Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and
            Stambrook,P.J.
  TITLE     Nucleotide sequence and organization of the mouse adenine
            phosphoribosyltransferase gene: Presence of a coding region common
            to animal and bacterial phosphoribosyltransferases that has a
            variable intron/exon arrangement
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985)
  STANDARD  full staff_entry
REFERENCE   10 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D.
  TITLE     Identification of a single nucleotide change in the
            hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale)
            responsible for Lesch-Nyhan syndrome
  JOURNAL   J. Clin. Invest. 83, 11-13 (1989)
  STANDARD  full staff_entry
REFERENCE   11 (sites; mutations causing gout)
  AUTHORS   Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N.
  TITLE     Identification of a single nucleotide change in a mutant gene for
            hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor)
  JOURNAL   Hum. Genet. 79, 39-43 (1988)
  STANDARD  full staff_entry
REFERENCE   12 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T.
  TITLE     Multiplex DNA deletion detection and exon sequencing of the
            hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families
  JOURNAL   Genomics 7, 235-244 (1990)
  STANDARD  full staff_entry
REFERENCE   13 (sites; mutations causing Lesch-Nyhan syndrome and gout)
  AUTHORS   Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T.
  TITLE     Identification of mutations leading to the Lesch-Nyhan syndrome by
            automated direct DNA sequencing of in vitro amplified cDNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989)
  STANDARD  full staff_entry
REFERENCE   14 (sites; mutations causing gout)
  AUTHORS   Hershey,H.V. and Taylor,M.W.
  TITLE     Nucleotide sequence and deduced amino acid sequence of Escherichia
            coli adenine phosphoribosyltransferase and comparison with other
            analogous enzymes
  JOURNAL   Gene 43, 287-293 (1986)
  STANDARD  full staff_entry
REFERENCE   15 (sites; conserved domain)
  AUTHORS   King,A. and Melton,D.W.
  TITLE     Characterisation of cDNA clones for hypoxanthine-guanine
            phosphoribosyltransferase from the human malarial parasite,
            Plasmodium falciparum: Comparisons to the mammalian gene and
            protein
  JOURNAL   Nucleic Acids Res. 15, 10469-10481 (1987)
  STANDARD  full staff_entry
REFERENCE   16 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and
            Caskey,C.T.
  TITLE     Molecular analysis of a female Lesch-Nyhan patient
  JOURNAL   J. Clin. Invest. 84, 1024-1027 (1989)
  STANDARD  full staff_entry
REFERENCE   17 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Stout,J.T. and Caskey,C.T.
  TITLE     HPRT: Gene structure, expression, and mutation
  JOURNAL   Annu. Rev. Genet. 19, 127-148 (1985)
  STANDARD  full staff_entry
REFERENCE   18 (sites; Lesch-Nyhan mutation)
  AUTHORS   Wilson,J.M. and Kelley,W.N.
  TITLE     Molecular basis of hypoxanthine-guanine phosphoribosyltransferase
            deficiency in a patient with the Lesch-Nyhan syndrome
  JOURNAL   J. Clin. Invest. 71, 1331-1335 (1983)
  STANDARD  full staff_entry
REFERENCE   19 (sites; mutations causing Lesch-Nyhan syndrome and gout)
  AUTHORS   Wilson,J.M. and Kelley,W.N.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase. Structural
            alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated
            from a patient with gout
  JOURNAL   J. Biol. Chem. 259, 27-30 (1984)
  STANDARD  full staff_entry
REFERENCE   20 (sites; mutations causing gout)
  AUTHORS   Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N.
  TITLE     Human hypoxanthine-guanine phosphoribosyltransferase. Molecular
            abnormality in a mutant form of the enzyme (HPRT-Toronto)
  JOURNAL   J. Biol. Chem. 258, 6458-6460 (1983)
  STANDARD  full staff_entry
REFERENCE   21 (sites; mutations causing gout)
  AUTHORS   Wilson,J.M., Tarr,G.E. and Kelley,W.N.
  TITLE     Human hypoxanthine (guanine) phosphoribosyltransferase: An amino
            acid substitution in a mutant form of the enzyme isolated from a
            patient with gout
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983)
  STANDARD  full staff_entry
REFERENCE   22 (sites; mutations causing Lesch-Nyhan syndrome)
  AUTHORS   Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G.,
            Hildebrand,B.M. and Caskey,C.T.
  TITLE     Molecular evidence for new mutation at the hprt locus in
            Lesch-Nyhan patients
  JOURNAL   Nature 310, 412-414 (1984)
  STANDARD  full staff_entry
REFERENCE   23 (sites; mutant reversion)
  AUTHORS   Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and
            Caskey,C.T.
  TITLE     Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene
            rearrangement
  JOURNAL   Somat. Cell Mol. Genet. 14, 293-303 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Edwards, 26-JUL-1989.
            
               Mutant    Description
            
               RJK 1780  missing intron 1 (partial and exon 2 [12]
               RJK  849  missing intron 3 (partial) and exons 4-9 (no mRNA)
            [22]
               RJK  984  missing intron 5 (partial) and exons 6-9 (no mRNA)
            [17]
                         [22]
               GM  3467  missing intron 8 (partial) and exon 9 (no mRNA) [12]
                         [22]
               RJK  853  complete gene deletion [12] [22] [16]
               GM  2227  inversion of exons 6-9 (no mRNA) [23]
               GM  1662  and GM 6804 duplication of exons 2 and 3 and elongated
                         mRNA [22] [23]
               Connersville missing intron 7 (partial and exon 8 [7]
FEATURES       from  to/span     description
    pept       1677     1703     hypoxanthine phosphoribosyltransferase, exon 1
                                 /nomgen="HPRT" /map="Xq26"
                                 /hgml_locus_uid="LE0031K"
              14780    14886     hypoxanthine phosphoribosyltransferase, exon 2
              16603    16786     hypoxanthine phosphoribosyltransferase, exon 3
              27891    27956     hypoxanthine phosphoribosyltransferase, exon 4
              31617    31634     hypoxanthine phosphoribosyltransferase, exon 5
              34937    35019     hypoxanthine phosphoribosyltransferase, exon 6
              39815    39861     hypoxanthine phosphoribosyltransferase, exon 7
              40033    40109     hypoxanthine phosphoribosyltransferase, exon 8
              41454    41501     hypoxanthine phosphoribosyltransferase, exon 9
    IVS        1704    14779     HPRT intron A
    IVS       14887    16602     HPRT intron B
    IVS       16787    27890     HPRT intron C
    IVS       27957    31616     HPRT intron D
    IVS       31635    34936     HPRT intron E
    IVS       35020    39814     HPRT intron F
    IVS       39862    40032     HPRT intron G
    IVS       40110    41453     HPRT intron H
    rpt         258       41 (c) Alu repeat copy A
    rpt        1012      694 (c) Alu repeat copy B
    rpt        3532     3342 (c) Alu repeat copy C
    rpt        3982     3343 (c) Alu repeat copy D
    rpt        6232     5937 (c) Alu repeat copy E
    rpt        6524     6288 (c) Alu repeat copy F
    rpt        7504     7807     Alu repeat copy G
    rpt        8713     8408 (c) Alu repeat copy H
    rpt        9477     9169 (c) Alu repeat copy I
    rpt        9699    10142     Alu repeat copy J
    rpt       10407    10293 (c) Alu repeat copy K
    rpt       12154    12310     Alu repeat copy L
    rpt       13302    12998 (c) Alu repeat copy M
    rpt       13638    13314 (c) Alu repeat copy N
    rpt       14238    13923 (c) Alu repeat copy O
    rpt       14610    14239 (c) Alu repeat copy P
    rpt       16090    15843 (c) Alu repeat copy Q
    rpt       16518    16293 (c) Alu repeat copy R
    rpt       18783    18439 (c) Alu repeat copy S
    rpt       18962    19084     Alu repeat copy T
    rpt       19397    19104 (c) Alu repeat copy U
    rpt       20064    19778 (c) Alu repeat copy V
    rpt       20486    20636     Alu repeat copy W
    rpt       20990    20696 (c) Alu repeat copy X
    rpt       21293    20996 (c) Alu repeat copy Y
    rpt       22171    21878 (c) Alu repeat copy Z
    rpt       23772    23460 (c) Alu repeat copy AA
    rpt       25310    25426     Alu repeat copy AB
    rpt       26388    26279 (c) Alu repeat copy AC
    rpt       29863    29560 (c) Alu repeat copy AD
    rpt       33618    33148 (c) Alu repeat copy AE
    rpt       36575    36645     Alu repeat copy AF
    rpt       38613    39039     Alu repeat copy AG
    rpt       39091    39388     Alu repeat copy AH
    rpt       40622    40925     Alu repeat copy AI
    rpt       40952    41262     Alu repeat copy AJ
    rpt       42640    42936     Alu repeat copy AK
    rpt       44776    44486 (c) Alu repeat copy AL
    rpt       46627    46305 (c) Alu repeat copy AM
    rpt       47520    47763     Alu repeat copy AN
    rpt       49257    49551     Alu repeat copy AO
    rpt       49705    49993     Alu repeat copy AP
    rpt       50002    50312     Alu repeat copy AQ
    rpt       51066    50774 (c) Alu repeat copy AR
    rpt       52276    51978 (c) Alu repeat copy AS
    rpt       52407    52674     Alu repeat copy AT
    rpt       54653    54551 (c) Alu repeat copy AU
    rpt       55465    55164 (c) Alu repeat copy AV
    rpt       56523    56689     Alu repeat copy AW
    rpt       11312    11254 (c) LINE repeat copy A (partial)
    rpt       18284    18242 (c) LINE repeat copy B (partial)
    rpt       27091    26843 (c) LINE repeat copy C (partial)
    rpt       37558    37300 (c) LINE repeat copy D (partial)
    rpt       39451    39427 (c) LINE repeat copy E (partial)
    rpt       51317    51100 (c) LINE repeat copy F (partial)
    rpt         261      321     short interspersed repeat copy A
    rpt        1013     1054     short interspersed repeat copy B
    rpt        2902     2869 (c) short interspersed repeat copy C
    rpt        3078     3107     short interspersed repeat copy D
    rpt        4228     4254     short interspersed repeat copy E
    rpt        4463     4494     short interspersed repeat copy F
    rpt       21582    21635     short interspersed repeat copy G
    rpt       28430    28394 (c) short interspersed repeat copy H
    rpt       28923    28981     short interspersed repeat copy I
    rpt       35562    35604     short interspersed repeat copy J
    rpt       44042    44108     short interspersed repeat copy K
    rpt       46628    46688     short interspersed repeat copy L
    rpt       51715    51687 (c) short interspersed repeat copy M
    unsure    17479    17479     polymorphism or cloning error
    unsure    24903    24903     polymorphism or cloning error
    mut        1664     1678     ggccggctccgttat in wt; gt in gout RJK 951 [13]
    mut       14778    14778     a in wt; t in Lesch-Nyhan RJK 1760, no exon 2
                                 [12]
    mut       14808    14809     tt in wt; ttt in Lesch-Nyhan Chicago [7]
    mut       14874    14874     t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7]
    mut       14877    14879     tta in wt; ta in Lesch-Nyhan RJK 1939 [12]
    mut       14886    14886     g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys
                                 [12]
    mut       16619    16619     c in wt; g in gout Toronto Arg->Gly [20]
    mut       16623    16664     atgtgatgaaggagatgggaggccatcacattgtagccctct in
                                 wt; at in Lesch-Nyhan RJK 2108 [12]
    mut       16675    16676     ag in wt; agg in Lesch-Nyhan RJK 866 [13]
    mut       16677    16677     g in wt; a in Lesch-Nyhan New Haven Gly->Glu
                                 [7]
    mut       16679    16679     g in wt; c in Lesch-Nyhan Yale Gly->Arg [10]
    mut       16690    16690     c in wt; a in Lesch-Nyhan Flint RJK 892
                                 Phe->Leu [5] [13]
    mut       16707    16707     a in wt; t in gout Arlingen Asp->Val [7]
    mut       16756    16759     tgta in wt; ta in Lesch-Nyhan RJK 1332 [13]
    mut       16780    16780     c in wt; a in gout Munich Ser->Arg [3] [19]
    mut       16784    16787     tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12]
    mut       27897    27897     c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG
                                 [12]
    mut       27911    27911     c in wt; t in gout London Ser->Leu [2] [4]
                                 [12] [14]
    mut       31621    31621     t in wt; a in Lesch-Nyhan Midland RJK 896
                                 Val->Asp [6] [13]
    mut       31622    31624     ctt in wt; ct in Lesch-Nyhan RJK 2019 [12]
    mut       31624    31624     t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser
                                 [13]
    mut       31628    31628     t in wt; g in gout Ann Arbor Ile->Met [11]
    mut       34962    34970     tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK
                                 1210 Met->Lys [13]
    mut       35015    35015     g in wt; t in gout Milwaukee RJK 949 Ala->Ser
                                 [7] [13]
    mut       39837    39837     c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA
                                 [13]
    mut       39842    39846     tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12]
    mut       39858    39858     g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr
                                 [12]
    mut       39866    39866     g in wt; a in Lesch-Nyhan RJK 1934. no exon 7
                                 [12]
    mut       40034    40038     tgttg in wt; tg in Lesch-Nyhan Michigan RJK
                                 855 [7] [13]
    mut       40080    40080     g in wt; a in Lesch-Nyhan Kingston RJK 2188
                                 Asp->Asn [12] [19]
    mut       40095    40095     t in wt; g in Lesch-Nyhan New Briton RJK 950
                                 Phe->Val [7] [13]
    mut       40102    40102     a in wt; g in gout Ashville Asp->Gly [8]
    mut       40114    40114     g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no
                                 exon 8 [12]
    mut       41486    41509     aaaatacaaagcctaagatgagag in wt; ag in
                                 Lesch-Nyhan Evansville RJK 894 [7] [12]
    mut       41450    41452     ata in wt; ttt in Lesch-Nyhan RJK 906, GM
                                 1899, downstream cryptic splice site used
    mut       41454    41454     c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079,
                                 His->Asp [12] [13]
    mut       41461    41461     g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr
                                 [13]
BASE COUNT    15689 a  11281 c  11599 g  18167 t
ORIGIN      1 bp upstream of EcoRI site; chromosome Xq26.
        1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt
       61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc
      121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc
      181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt
      241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga
      301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc
      361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag
      421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc
      481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt
      541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct
      601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga
      661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt
      721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat
      781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga
      841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag
      901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct
      961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc
     1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag
     1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac
     1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc
     1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg
     1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt
     1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg
     1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg
     1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg
     1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc
     1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc
     1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg
     1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg
     1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg
     1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc
     1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga
     1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca
     1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc
     2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat
     2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg
     2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag
     2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg
     2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct
     2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt
     2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg
     2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca
     2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg
     2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt
     2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg
     2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa
     2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt
     2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg
     2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc
     2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc
     3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt
     3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt
     3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct
     3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat
     3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt
     3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag
     3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc
     3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc
     3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta
     3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac
     3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc
     3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt
     3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc
     3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc
     3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc
     3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag
     3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc
     4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca
     4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc
     4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc
     4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa
     4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca
     4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga
     4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac
     4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg
     4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt
     4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac
     4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg
     4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag
     4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg
     4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca
     4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg
     4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat
     4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg
     5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc
     5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta
     5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc
     5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact
     5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct
     5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg
     5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg
     5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc
     5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa
     5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat
     5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa
     5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg
     5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc
     5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa
     5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt
     5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt
     6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg
     6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga
     6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg
     6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc
     6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt
     6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc
     6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg
     6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt
     6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga
     6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac
     6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct
     6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag
     6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc
     6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt
     6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg
     6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc
     6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca
     7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc
     7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga
     7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt
     7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc
     7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg
     7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag
     7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag
     7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca
     7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt
     7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac
     7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga
     7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt
     7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa
     7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct
     7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact
     7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt
     7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc
     8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac
     8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct
     8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg
     8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat
     8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag
     8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat
     8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg
     8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg
     8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc
     8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac
     8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg
     8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc
     8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat
     8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga
     8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta
     8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg
     9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt
     9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag
     9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt
     9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc
     9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag
     9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt
     9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca
     9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc
     9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc
     9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat
     9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg
     9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc
     9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga
     9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca
     9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat
     9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta
     9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc
    10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc
    10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa
    10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa
    10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat
    10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt
    10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct
    10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt
    10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga
    10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca
    10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct
    10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg
    10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg
    10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa
    10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta
    10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa
    10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca
    10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct
    11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa
    11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc
    11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat
    11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc
    11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag
    11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag
    11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga
    11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg
    11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg
    11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt
    11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac
    11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa
    11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa
    11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa
    11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt
    11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt
    12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag
    12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat
    12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat
    12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc
    12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt
    12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt
    12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct
    12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat
    12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa
    12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag
    12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt
    12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac
    12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca
    12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct
    12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt
    12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc
    12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta
    13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc
    13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg
    13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc
    13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct
    13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt
    13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg
    13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat
    13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca
    13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga
    13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt
    13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac
    13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta
    13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg
    13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc
    13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat
    13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg
    13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc
    14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt
    14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga
    14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt
    14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc
    14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg
    14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc
    14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc
    14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca
    14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg
    14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata
    14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg
    14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac
    14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg
    14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat
    14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac
    14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa
    15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt
    15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg
    15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg
    15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg
    15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt
    15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat
    15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt
    15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata
    15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat
    15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag
    15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg
    15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt
    15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga
    15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc
    15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca
    15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag
    15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt
    16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag
    16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt
    16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa
    16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa
    16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt
    16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact
    16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac
    16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac
    16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa
    16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg
    16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg
    16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag
    16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa
    16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa
    16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta
    16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta
    16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg
    17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt
    17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt
    17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt
    17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc
    17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg
    17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc
    17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt
    17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa
    17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc
    17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat
    17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc
    17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat
    17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact
    17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa
    17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt
    17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa
    18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat
    18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat
    18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt
    18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat
    18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc
    18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg
    18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt
    18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt
    18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca
    18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc
    18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt
    18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg
    18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg
    18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat
    18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa
    18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat
    18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg
    19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa
    19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag
    19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga
    19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc
    19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa
    19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg
    19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc
    19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa
    19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg
    19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa
    19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa
    19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg
    19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag
    19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga
    19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag
    19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt
    19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct
    20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt
    20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac
    20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct
    20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga
    20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat
    20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg
    20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat
    20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt
    20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc
    20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag
    20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt
    20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca
    20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta
    20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg
    20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc
    20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt
    21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc
    21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg
    21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc
    21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc
    21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa
    21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat
    21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa
    21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat
    21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat
    21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt
    21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt
    21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta
    21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt
    21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt
    21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct
    21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct
    21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat
    22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc
    22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg
    22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca
    22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag
    22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc
    22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat
    22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg
    22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct
    22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca
    22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt
    22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc
    22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta
    22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct
    22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta
    22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct
    22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa
    22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa
    23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag
    23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc
    23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa
    23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc
    23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac
    23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga
    23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt
    23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc
    23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc
    23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat
    23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc
    23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc
    23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc
    23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga
    23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga
    23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa
    24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg
    24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat
    24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat
    24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct
    24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt
    24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga
    24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg
    24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt
    24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc
    24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt
    24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc
    24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg
    24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat
    24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa
    24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt
    24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg
    24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc
    25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag
    25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc
    25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact
    25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta
    25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg
    25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc
    25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag
    25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc
    25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca
    25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag
    25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag
    25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt
    25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag
    25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat
    25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa
    25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata
    25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg
    26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt
    26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat
    26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt
    26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag
    26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc
    26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact
    26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg
    26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat
    26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag
    26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt
    26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt
    26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata
    26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta
    26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat
    26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg
    26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata
    27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata
    27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg
    27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc
    27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat
    27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag
    27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga
    27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt
    27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac
    27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa
    27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg
    27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt
    27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat
    27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct
    27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata
    27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt
    27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat
    27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct
    28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg
    28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca
    28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat
    28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa
    28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca
    28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc
    28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg
    28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa
    28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt
    28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt
    28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt
    28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat
    28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa
    28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac
    28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac
    28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca
    28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct
    29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac
    29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa
    29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc
    29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga
    29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt
    29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca
    29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca
    29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt
    29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt
    29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca
    29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg
    29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt
    29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct
    29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg
    29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag
    29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct
    30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga
    30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact
    30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt
    30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa
    30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt
    30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg
    30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc
    30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc
    30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag
    30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga
    30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta
    30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata
    30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct
    30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata
    30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa
    30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca
    30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga
    31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg
    31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac
    31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat
    31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga
    31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg
    31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt
    31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt
    31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa
    31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat
    31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg
    31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac
    31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat
    31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg
    31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt
    31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt
    31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac
    31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta
    32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa
    32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag
    32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta
    32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt
    32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt
    32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact
    32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc
    32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat
    32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa
    32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta
    32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca
    32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg
    32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc
    32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt
    32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt
    32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt
    33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc
    33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac
    33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta
    33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac
    33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta
    33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta
    33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata
    33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata
    33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc
    33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat
    33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa
    33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct
    33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc
    33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt
    33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga
    33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct
    33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag
    34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc
    34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat
    34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac
    34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt
    34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga
    34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt
    34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc
    34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt
    34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca
    34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct
    34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac
    34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa
    34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc
    34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc
    34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg
    34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt
    34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac
    35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt
    35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat
    35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta
    35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag
    35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga
    35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc
    35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc
    35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt
    35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag
    35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat
    35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta
    35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat
    35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta
    35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg
    35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt
    35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa
    36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt
    36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag
    36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat
    36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga
    36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa
    36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt
    36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga
    36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct
    36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg
    36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg
    36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga
    36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag
    36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga
    36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta
    36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt
    36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag
    36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta
    37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa
    37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc
    37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc
    37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg
    37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc
    37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg
    37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac
    37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag
    37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg
    37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc
    37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt
    37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc
    37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt
    37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc
    37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc
    37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag
    37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa
    38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat
    38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc
    38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat
    38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga
    38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata
    38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa
    38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag
    38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt
    38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg
    38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc
    38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg
    38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa
    38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg
    38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa
    38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt
    38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg
    39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa
    39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt
    39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg
    39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc
    39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga
    39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga
    39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc
    39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca
    39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac
    39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa
    39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc
    39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg
    39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct
    39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa
    39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct
    39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga
    39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat
    40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg
    40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt
    40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa
    40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct
    40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag
    40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag
    40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc
    40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact
    40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg
    40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg
    40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag
    40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac
    40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag
    40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg
    40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca
    40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca
    40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct
    41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg
    41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc
    41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac
    41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg
    41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct
    41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc
    41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt
    41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag
    41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg
    41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta
    41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac
    41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc
    41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt
    41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata
    41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga
    41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg
    42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt
    42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc
    42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt
    42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt
    42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta
    42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg
    42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct
    42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt
    42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta
    42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag
    42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta
    42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca
    42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa
    42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg
    42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag
    42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac
    42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa
    43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt
    43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc
    43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct
    43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat
    43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat
    43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc
    43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt
    43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc
    43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt
    43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt
    43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc
    43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca
    43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag
    43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg
    43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa
    43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg
    43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt
    44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca
    44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc
    44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta
    44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact
    44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc
    44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca
    44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg
    44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg
    44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc
    44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac
    44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg
    44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta
    44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca
    44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc
    44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact
    44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt
    45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga
    45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt
    45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg
    45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc
    45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta
    45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg
    45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc
    45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac
    45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct
    45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca
    45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa
    45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa
    45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa
    45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat
    45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg
    45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt
    45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt
    46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca
    46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc
    46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca
    46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg
    46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt
    46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag
    46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct
    46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt
    46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc
    46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc
    46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc
    46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa
    46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc
    46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat
    46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg
    46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc
    46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac
    47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac
    47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca
    47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg
    47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca
    47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc
    47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga
    47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc
    47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa
    47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact
    47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag
    47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg
    47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc
    47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa
    47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga
    47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg
    47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga
    48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc
    48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct
    48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc
    48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg
    48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg
    48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt
    48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg
    48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa
    48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat
    48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc
    48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg
    48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga
    48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat
    48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg
    48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc
    48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt
    48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt
    49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg
    49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag
    49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc
    49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct
    49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat
    49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa
    49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc
    49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc
    49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc
    49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc
    49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta
    49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg
    49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa
    49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc
    49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag
    49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct
    49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt
    50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat
    50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc
    50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt
    50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct
    50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga
    50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca
    50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt
    50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca
    50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt
    50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg
    50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca
    50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa
    50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga
    50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct
    50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt
    50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac
    51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca
    51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca
    51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct
    51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca
    51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac
    51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc
    51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac
    51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa
    51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc
    51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct
    51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc
    51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc
    51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga
    51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt
    51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac
    51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg
    51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa
    52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga
    52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag
    52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa
    52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg
    52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat
    52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg
    52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca
    52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa
    52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag
    52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca
    52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt
    52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta
    52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa
    52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa
    52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac
    52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc
    52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca
    53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata
    53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt
    53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag
    53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc
    53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact
    53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc
    53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat
    53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag
    53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca
    53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag
    53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc
    53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc
    53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc
    53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt
    53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta
    53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct
    54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg
    54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac
    54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt
    54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg
    54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc
    54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc
    54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg
    54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga
    54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag
    54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc
    54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta
    54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta
    54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg
    54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta
    54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga
    54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa
    54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca
    55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc
    55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca
    55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc
    55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc
    55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc
    55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta
    55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg
    55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg
    55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt
    55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt
    55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg
    55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat
    55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac
    55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt
    55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa
    55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa
    55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta
    56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg
    56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag
    56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca
    56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt
    56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca
    56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt
    56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct
    56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg
    56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg
    56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa
    56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc
    56701 gccgcccctg gtgatggacc agcggggctt cgacga
//
LOCUS       VSVGLYPA     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 84-PN-H) glycoprotein gene,
            complete cds.
ACCESSION   M35207
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-PN-H, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      482 a    364 c    364 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc ctcgacacca tgaagtgcct tttgtactta gctctgttat
       61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggtaattgga
      121 aaaatgttcc ttccagttac cattactgcc catcaagctc agatttgaat tggcataatg
      181 gcctaatagg cacagcctta caggtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
      301 cgaaatatat aacacattcc atccggtcct tcacaccatc cgtggagcaa tgcaaggaaa
      361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactccc catcatgtac
      481 ttgtcgatga gtacacggga gaatgggttg attcccaatt catcaacggg aaatgcagca
      541 atgacatctg ccctactgtg cataactcca caacctggca ttccgattac aaagttaaag
      601 ggctgtgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggggagc
      661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgct tacgaaactg
      721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
      781 tttggttcga aatggcagac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
      841 aagggtcaag catatctgct ccatctcaga catcagtgga cgtgagtctc atccaggatg
      901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
      961 ttcccatctc tccggtagac ctaagctatc ttgctcctaa aaatccagga accggcccag
     1021 ctttcacaat catcaatggt actctgaaat attttgagac caggtacatc agagtcgata
     1081 ttgctgctcc aattctccca agaatggttg gaacaatcag tggaactacc acagaaagag
     1141 aactgtggga agactgggca ccatatgaag atgtggaaat cgggcctaat ggagttctga
     1201 aaaccagttc aggatataaa tttcctttat acatgattgg acatggcatg ctggactctg
     1261 atctccatct cagctcgaag gctcaagtgt ttgaacaccc tcacattcag gatgctgcat
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttgtcc aaaaatccaa
     1381 tcgagctcgt agaagggtgg ttcagcggtt ggaagagctc cattgcttct tttttcttta
     1441 tcataggttt gatcattgga ttattcttgg tcctccgagt tggtatctat ctttgcatca
     1501 agttaaaaca gaccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaaccaga atcggacacg acagagtaat catatttaaa ccaaatcgac ttgtgatatc
     1621 atgctccaag aggctttaat tatgtttcaa ttttatg
//
LOCUS       VSVGLYPB     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 85-CR-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35208
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-CR-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      487 a    358 c    366 g    446 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagtcttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPC     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 87-CR-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35209
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-CR-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      489 a    358 c    364 g    446 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPD     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 84-CR-B) glycoprotein gene,
            complete cds.
ACCESSION   M35210
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-CR-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      488 a    357 c    365 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 gattgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPE     1665 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 87-OA-B) glycoprotein gene,
            complete cds.
ACCESSION   M35211
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-OA-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      476 a    364 c    358 g    467 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
     1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcgtct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
     1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
     1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS       VSVGLYPF     1665 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 87-VC-B) glycoprotein gene,
            complete cds.
ACCESSION   M35212
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-VC-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      476 a    362 c    358 g    469 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccttg caagttaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
     1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcgtct tagctcaaag gctcaggtat ttgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
     1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
     1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS       VSVGLYPG     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 84-PN-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35213
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-PN-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      488 a    356 c    366 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttctggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcggg tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccga
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctatgcatca
     1501 acttacggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgtttgaa ttgtatg
//
LOCUS       VSVGLYPH     1665 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain Mudd-Summers) glycoprotein gene,
            complete cds.
ACCESSION   M35214
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain Mudd-Summers, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      477 a    366 c    353 g    469 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacca tgaagtgcct tttgtactta gcttttttat
       61 tcatcggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttagtagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtacggac
      301 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag cgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttccgactat aaggtcaaag
      601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaagaag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
     1021 tctttaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
     1141 tactgtggga tgactgggct ccatatgaag acgtgggaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt ttgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatggtgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagtttgt agaaggttgg ttcagtagtt ggaagagctc tattgcctct tttttcttta
     1441 ccatagggtt aatcattgga ctattcttgg ttctccgagt tggtatttat ctttgcatta
     1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 cgtaactcaa atcctgcaca acagattctt catgtttgaa ccaaatcaac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttaattt ttatg
//
LOCUS       VSVGLYPI     1665 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 86-DF-P) glycoprotein gene,
            complete cds.
ACCESSION   M35215
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-DF-P, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      477 a    363 c    357 g    468 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
     1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctct tttttcttta
     1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
     1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS       VSVGLYPJ     1788 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 82-HD-B) glycoprotein gene,
            complete cds.
ACCESSION   M35216
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 82-HD-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1788)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      508 a    362 c    344 g    574 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
     1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS       VSVGLYPK     1788 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 83-HD-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35217
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 83-HD-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1788)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      508 a    362 c    344 g    574 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
     1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS       VSVGLYPL     1815 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 83-HD-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35218
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 83-HD-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1815)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      512 a    358 c    346 g    599 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gattgctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatt agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcactat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggcctaatt catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taattttatt tttttatttt tattttttat tttttatttt ttatttttta
     1741 ttttttattt tttattttta ttttttatct ttttattttt tattttttat ttttattttt
     1801 atttttattt ttatg
//
LOCUS       VSVGLYPM     1665 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain San Juan 56-NM-B) glycoprotein
            gene, complete cds.
ACCESSION   M35219
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain San Juan 56-NM-B, Indiana
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      484 a    364 c    355 g    462 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccata caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 cgaagtatat aacacagtcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
      541 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
     1021 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt
     1321 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta
     1441 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta
     1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc
     1621 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatg
//
LOCUS       VSVGLYPN     1812 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 85-ES-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35220
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-ES-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1812)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      512 a    362 c    347 g    591 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
     1741 tttatttttt attttttatt tttatttttt atctttttat ttcttatttt tatttttatt
     1801 tttattttta tg
//
LOCUS       VSVGLYPO     1861 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 86-ES-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35221
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-ES-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1861)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      520 a    363 c    347 g    631 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
     1741 ttttattttt tattttttta ttttttattt tttatttttt atttttattt tttatttttt
     1801 tattttttta ttttttattt ttatttttat ttttattttt atttttattt ttatttttat
     1861 g
//
LOCUS       VSVGLYPP     1802 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B)
            glycoprotein gene, complete cds.
ACCESSION   M35222
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B, Indiana
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1802)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      506 a    362 c    348 g    586 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
       61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
     1741 ttttttattt ttatttttta tctctttatt ttttattttt atttttattt ttatttttta
     1801 tg
//
LOCUS       VSVGLYPQ     1868 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 87-ES-B) glycoprotein gene,
            complete cds.
ACCESSION   M35223
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-ES-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1868)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      520 a    362 c    347 g    639 t
ORIGIN      
        1 aacagagatc aatctgtttt cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat tacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct catcatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catatttcaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
     1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
     1801 atttttttat ttttttattt tttattttta tttttatttt tatttttatt tttattttta
     1861 tttttatg
//
LOCUS       VSVGLYPR     1860 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 86-ES-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35224
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-ES-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1860)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      520 a    362 c    347 g    631 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcta caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
     1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
     1801 atttttttat tttttatttt tatttttatt tttattttta tttttatttt tatttttatg
//
LOCUS       VSVGLYPS     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 69-PN-L) glycoprotein gene,
            complete cds.
ACCESSION   M35225
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 69-PN-L, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      487 a    358 c    364 g    448 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcattggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattattgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgccacgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 ctaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca agtgactcct catcatgtat
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggagaac
      661 tctcatccct agggaaggaa ggcacgggat tcagaagcaa ttactttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgac aaagatctat ttgctgcagc aaaattcccg gagtgtcccg
      841 aaggatcgag tatctccgct ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt ggagcaggtc
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctaaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccacct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg ttcagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttgttcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca ataccacaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPT     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 87-CR-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35226
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-CR-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      490 a    358 c    363 g    446 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg aatcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPU     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 85-CR-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35227
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-CR-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      486 a    357 c    367 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttgga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPV     1796 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene,
            complete cds.
ACCESSION   M35228
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-GM-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1796)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      503 a    362 c    349 g    582 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
       61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttcccattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat agtcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
     1741 ttttttattt tttattttta ttttttatct ttttattttt atttttattt tttatg
//
LOCUS       VSVGLYPW     1800 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 85-GM-B) glycoprotein gene,
            complete cds.
ACCESSION   M35229
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-GM-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1800)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      506 a    362 c    348 g    584 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
       61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttta ttatttttta ttttttattt
     1741 ttatttttta tttttatttt ttcatctttt tattttttat ttttattttt attttttatg
//
LOCUS       VSVGLYPX     1652 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 59-PN-L) glycoprotein gene,
            complete cds.
ACCESSION   M35230
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 59-PN-L, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1652)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      487 a    344 c    367 g    453 t      1 others
ORIGIN      
        1 aacagagatc aatctgtctt cctagcatca tgaaatgnat tttgtacttc gcttttctcc
       61 tggtaggagt gaactgtaaa tttacaatag ttttcccaca taatcaaaaa gggaactgga
      121 aaaatgtgcc atccagttac cattattgtc catctagttc cgatcttaat tggcacaatg
      181 gcctgattgg tacagcatta caagtcaaga tgcccaaaag tcacaaagca attcaggcag
      241 acggctggat gtgtcatgca tccaaatggg tcaccacatg tgacttcagg tggtacggac
      301 ccaaatatat tacgcattct attcggtcat tcacaccctc tgtggaacag tgcagagaaa
      361 gcattgaaca gaccaaacaa ggaatctgga taaatcctgg ctttcctcct cagagttgtg
      421 gatatgcaac tgtgacggac gcggaagcag taattgtcca ggtcacccca catcatgtgc
      481 tcgtagatga gtacacaggc gaatgggtag attcccaatt tttgaacgga aagggcaaag
      541 acgacatatg ccccacagtc cataattcga ccacatggca ttcagactac aaagtcaaag
      601 ggctttgtga tgctaaccta atctcaatgg acatcacttt cttctcagaa gatggtgagc
      661 tgtcctcctt gggcaaagat gggactggat tcagaagcaa ttactttgct tatgaaaatg
      721 gagataaggc ttgccgcatg cagtattgca aacattgggg agtcagactc ccttctggag
      781 tctggttcga gatggcagac aaagatttgt ttgcaggagc caagttcccc gagtgtccag
      841 acggaaccag tatctctgct ccttctcaga catccgtaga tgtaagctta attcaggacg
      901 tcgaaagaat cttggactat tccttgtgtc aagagacttg gagtaaaatc aaggcaggcc
      961 ttccgatttc tccagttgat ctaagctatc tcgctccaaa aaatccaggg acaggtcccg
     1021 ctttcaccat catcaatggg accttgaagt actttgagac cagatacatc agagtggaca
     1081 ttgctgctcc aatcataccc agaatggtgg gaactatcag tggaacatct acagaaagag
     1141 aattgtggga tgactggtac tcttatgaag atgtggagat tgggcccaat ggagtattga
     1201 aaacaacctc aggatataaa tttcctcttt atatgattgg acacggaatg ctggactctg
     1261 atcttcacct gacatcgaag gcccaggttt ttgaacatcc ccacattcag gacgctgcat
     1321 cacagcttcc tgacggcgag actttatttt ttggtgatac cgggctctca aaaaacccca
     1381 ttgaattagt agagggatgg ttcagtgggt ggaaaagctc tattgcttcg ttcttctttg
     1441 tcattggatt gatcattgga ttatatttgg ttttaagggt gggaatatat ctttgtctta
     1501 aactaaagca tacaaaaaag agaaaaattt acacagacat tgagatgaac agacttggaa
     1561 aataaccaaa attgtgtgat ggatttgctt ttttgccaaa gaagcctgtg atatccttct
     1621 cagagaagct ttaatctttt attgacaata tg
//
LOCUS       VSVGLYPY     1657 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 84-PN-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35231
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-PN-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      489 a    367 c    357 g    444 t
ORIGIN      
        1 aacagagatc catctgtttc ctcgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggaaattgga
      121 aaaatgttcc ttccagttac cattactgcc cgtcaagttc agatttgaac tggcataacg
      181 gcctaatagg cacagcccta caggtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
      301 cgaaatatat aacacattcc atccggtcct tcacaccatc tgtggagcaa tgcaaggaaa
      361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtaactccc catcatgtac
      481 ttgtcgatga gtacacggga gagtgggtag attcccaatt catcaacggg aaatgcagca
      541 atgacatatg ccctactgtc cataactcca caacctggca ttccgactac aaagtcaaag
      601 ggctgtgtga ttctaacctc atctccatgg acatcacctt cttctcagaa gatggggagc
      661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgca tacgaaacag
      721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
      781 tctggtttga aatggccgac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
      841 aagggtcaag catctctgct ccatctcaga catcagtgga cgtgagtctt attcaggacg
      901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
      961 ttcccatctc tccggtagat ctaagctatc ttgctccaaa aaacccagga accgggccag
     1021 cctttacaat aatcaatggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattctccca agaatggtcg gaacaatcag tggaactacc accgagagag
     1141 aattgtggga cgactgggca ccatatgaag atgttgaaat cggaccaaat ggagttctga
     1201 aaaccagttc aggatataaa ttccctttat atatgattgg acatggcatg ctggactctg
     1261 atctccatct cagctcaaag gctcaggtat ttgaacaccc tcacattcag gatgctgcat
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaatccaa
     1381 tcgagctcgt agaagggtgg ttcagtggtt ggaaaagctc cattgcttct tttttcttta
     1441 tcataggttt gatcattgga ttattcttgg ttctccgagt tggtatctat ctctgcatca
     1501 agttaaaaca taccaaaaaa agaaaaatct atacagacat tgagatgaat cgactcggaa
     1561 agtaactgga atcagacacg acagaataat catatttaaa ccaaatcgac ttgtgatatc
     1621 atgctctaag aggctttaat tatgttttaa ttttatg
//
LOCUS       VSVGLYPZ     1812 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Vesicular stomatitis virus (strain 85-ES-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35232
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-ES-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1812)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      513 a    360 c    346 g    593 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacttggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat aaagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
     1741 tttatttttt attttttatt tttatttttt atctttttat tttttatttt tatttttatt
     1801 tttattttta tg
//
LOCUS       CAVXDHB      8062 bp ds-DNA             INV       27-JUL-1990
DEFINITION  C.vicina xanthine dehydrogenase (Xdh) gene allele 1, exons 2-4.
ACCESSION   M30316
KEYWORDS    xanthine dehydrogenase.
SOURCE      C.vicina third instar larvae fat body DNA (library of Schenkel et
            al.), clone Lcv1.
  ORGANISM  Calliphora vicina
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae.
REFERENCE   1  (bases 1 to 8062)
  AUTHORS   Houde,M., Tiveron,M.-C. and Bregegere,F.
  TITLE     Divergence of the nucleotide sequences encoding xanthine
            dehydrogenase in Calliphora vicina and Drosophila melanogaster
  JOURNAL   Gene 85, 391-402 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Bregegere, 01-DEC-1989.
FEATURES       from  to/span     description
    pept    /  2538     6299     xanthine dehydrogenase, exon 2 (AA at 2538)
                                 (EC 1.1.1.204)
               6367     6534     xanthine dehydrogenase, exon 3
               7233     7283     xanthine dehydrogenase, exon 4
    pre-msg <     1     7400     xanthine dehydrogenase mRNA and introns
    IVS     <     1     2537     xanthine dehydrogenase intron A
    IVS        6300     6366     xanthine dehydrogenase intron B
    IVS        6535     7232     xanthine dehydrogenase intron C
    signal     7370     7375     poly-A signal
    signal     7391     7396     poly-A signal
BASE COUNT     2516 a   1358 c   1473 g   2715 t
ORIGIN      
        1 aagcttcaac ctgtaaaata aatttcaata aaatgttata aaatcttata ttttggtaaa
       61 aattccattt gaataaaaca aatctagacg aaattataca tacaatcaaa tttatgcact
      121 tcagatgttg aaaacagatt ggaaaattac ttattatacg tttcttagtg ttcgatttat
      181 ttcgcttagc aataattgca aaatatttga ttgatttgta accataataa catcgttttt
      241 aaatatgagt acttacttgt cttatcaact ttgttgttta tatatcaata tatatgtata
      301 tttcattaaa aaacaataac aattttaaaa tggcaacttt ttgtgtaacc ttaatgaaaa
      361 aaaaaaaata acacataaaa catcttatca atgactgctg tgagaccccc ctgtgacgta
      421 cagataataa caatcttaaa actgttatac tgcacttttg taggactaat gaattagtaa
      481 acaaataaat aactctgtat aaatatagaa cattttatta tccacatatt agttaaattt
      541 gtcttggagg tctaatagct agtttctggg ataataaaat ctacattctt cgtttaaacc
      601 tagattaagt aaactacaaa ttttatttct cactaatggc gtaaagagta tatgtaatct
      661 aggtttaact gagcattatt gacgagttaa acctatgaaa aaaattattc caattgattt
      721 tgtttcttca gattccattt gatttcgaca gttcccaaaa tattttataa ttctgaattt
      781 ccaaatttaa tttggttttt ggtaaccccc tgtttatact tgacaaattt tgatcataat
      841 aattaatgac gtttgttgcc aagacaaagt tgtcctgcaa tagcactaac aattgtgtca
      901 taacgaagca ataatactaa taaatgaaga cattacctga aatttttttt atcttgacaa
      961 caattttgac ttttatcatg ataaaaattt atcaggtata gataattatt taaaaatatc
     1021 tgaaaaacaa ataattacaa actaataaca attcgacata taagacattc acacgatgaa
     1081 ttttcagtca tgcaattgta gcaattctct tttgaatttt cttttactcc acccctcttg
     1141 ccaacaaact cataaaatca cttagagtga catgacaaaa caagacatcg taaaaattcc
     1201 ttattaaaaa aaaacaagtt agagagctat attcggcagt gccgaatctt atataccctt
     1261 caccaaatta tacttcaaaa taaaaatttt aaatattttt aagtaaacaa attttttttt
     1321 ccaaagttgt ttttttaagt ttttaggaaa tttttttttt cgaattgtta ttttaaaatt
     1381 tttaaacagc tgttacgaac aaaatcaact atgtattgtg aatgaaaagt tcatgagaat
     1441 attacgatag caattttccc tttgagggaa cataaatttc acgtattatt gccgataggg
     1501 tctattgact agcatgtatt ttgttgttgc aagagttagg ttttggacaa cagacttagg
     1561 tttttgactc gtctcgtctt ttattacaca gttttatttt atatttatta ttacagaagt
     1621 ttaaaagttt ccattgtttt gccaatgtta agcattatat ccacatatac atgtattttt
     1681 gttaattatt ccctatattt taatagaaaa cctggtgaaa ggtcatattt tttaaaactg
     1741 cacagagaaa atgattcggt tgcagacata gacattttcg gttctatcaa cagaaaatca
     1801 gttgataaag aagaatttcc gttgaagcaa ccaaactttt gttacctctt ctaaaatgtt
     1861 gtagccataa ctgtaaaatt cggtcactaa gaatcattcg attggcaaca aattcggttg
     1921 ctatcacgaa tctgttttct ctgtgtgaaa gtatggaata ttgttttaaa aaagtttcca
     1981 ttttttcgct agtaggtctg ggttaattaa taactcgcgg cttcgtataa cgattcctgt
     2041 tctacgtcgt ttacaccaac cgcatgtatt taatttgaac aaatcctggt ggagaccata
     2101 aaaacccttc gatactggtg gagtatcatt tcaaactcaa atcgtaaatt ggtagagaac
     2161 catataaaac tcgacggatc aactggagca gtatcattat cactcttgat tttaataaca
     2221 aggtttaagc attctcactt ctatgaagac atatgtaggt catttaggct tacttaaata
     2281 gacatgggat aacctggaaa atttgtgaga cgttatactc aaccaacaaa ctttaaaaca
     2341 attttattgt cactctctgt ctaaagcatc gtaatctata catcacttta cttcacaaat
     2401 gttacaatac tctccgtaat taatcttaaa tatcatcaat tacacatttt gtttaagtaa
     2461 tttaaaaata ttaagttctg cattcttaaa atacagcttt gattcaagaa atataactat
     2521 ttgtcttatt ttcctaggta attgatacca atcccgatcc agaatgtacc cttctaacat
     2581 atttacgtga aaaactaaga ttatgtggca caaaattggg ctgtggcgag ggtggttgtg
     2641 gtgcctgcac cgtcatgatt tctcgcattg atacgctaac gaatcggatt aaacacatag
     2701 cggtgaatgc ctgcctgact ccggtatgtg ccatgcatgg tagtgccgtg actaccgtgg
     2761 aaggtattgg cagtacgcga actcgattgc atccagtaca agagcgtctg gctaaagccc
     2821 acggcagtca gtgtggtttt tgtacacccg gtattgtcat gtctatgtat gcactgttga
     2881 gaaatttatc ccagccctcg atgaaagatt tggaaattgc tttccaggga aatttgtgca
     2941 gatgcacggg ctatagaccc attttagagg gttataaaac atttactaaa gaattcggct
     3001 gtgctatggg tgataaatgt tgtaaggtga atggcaataa atgtggagag ggtatggaaa
     3061 atggtggtga catggtcgac gataaactat ttgagaaaag tgaatttgta cctttcgatc
     3121 cctcacaaga acccatattt cctccggagc tgcagctgaa taaagattgg gactctcaaa
     3181 cgttagtgta taagggcgaa cgtgctactt ggtatagacc tggcaattta gaagatttac
     3241 tgaagattaa ggcccaattt ccggaagcca agttggttgt gggcaacact gaaataggcg
     3301 tggaggttaa atttaaacat tttctatatc cagtcttggt aaatcccaca aaagttaagg
     3361 aaatgattga tgtccaggaa ctcgaggata gtatttattt tggtgcttcc gttagtttaa
     3421 tggatataga taggatttta agatccagca tagaaaagct acccgaacat caaacaagat
     3481 ttttccaatg tgccgtgaat atgttgcatt attttgctgg taaacaaatt cgtaatgtgg
     3541 cgtcattggg tggcaatatt atgacgggta gtccaatatc cgatatgaat ccggtgttga
     3601 tggcgggtgc tgttaagttg aaagtggcca agtatgttga gggtcaaata aaatacagag
     3661 aagtttgcat ggcatctggc ttctttacgg gttatcgcaa gaatgttata gaacccacag
     3721 aaattttggt gggtctgtat ttccccaaga ctttggaaca tcaatatgtg gtggctttca
     3781 agcaggccaa acgtagagat gatgatattg ctattgtaaa tgcagccata aatgtattca
     3841 ttgatcccag gtccataact gtggataagg tgtatatggc ttttggtggt atggctccta
     3901 ccactgtcct ggccacacgc actgccgaca taatggttaa acaacaatgg aataaggtgc
     3961 ttatggagcg tgtggtggaa aatctgtgtg cagagttgcc tttggctccc tcagctcctg
     4021 gtggcatgat tgcctatcgt cgttccctgg tggtgagctt atttttcaag gcctacttaa
     4081 caataaccca acaattaatt aaatctggca ttttgcccca agattcttta ccccaagaag
     4141 aactaagcgg ttctgatgtc ttccatactc ctgccttaaa aagtgctcaa ctgtttgaaa
     4201 aagtttcaaa caaacaatcg gaatgtgatc ctattggcag accaaagata catgcctccg
     4261 cattgaagca ggctaccgga gaagccatct actgcgatga tatgcctcgc atggaaaatg
     4321 aattatattt ggcgctggtg ttaagtacaa aagctcatgc gaaaatccta agcattgatg
     4381 cttcggaagc tctggcaatg cccggtgttc atgctttctt ctccagcaaa gacataaccc
     4441 aacatgaaaa tgaagttggt ccagtgtttc atgatgaaga ggttttcgcc tcagatatgg
     4501 tttactgtca aggtcaagtt attggagcta tagcagccga taatccaaac tttagctcaa
     4561 agacggctcg taaggttaca atcgaatatg aagatataaa gccagttatt ataactattg
     4621 aacaggctat cgagcataaa tcctacttcc ctgattatcc ccgttttacg gaaataggtg
     4681 atgtagagaa ggctttcagt gaggccgatc atgtctatga aggttcttgt cgcatgggtg
     4741 gccaggaaca tttctatttg gaaacccatg cctctttagc cgtgccccga gattcagatg
     4801 aaattgaaat cttctgttca acacaacatc cctctgaggt acaaaaacta gtggcccatg
     4861 tattgtcaac ttcagcacat cgagtagtgt gtcgtgccaa acgtttggga ggaggttttg
     4921 gtggcaaaga gtctcgtgct atagctgttg ccttgcccgt ggctttggcc tgccatcgtc
     4981 ttcgtagacc aatacgttgt atgctcgatc gtgatgaaga tatgatgatc accggtactc
     5041 gtcatccctt tttatttaaa tacaaaattg cttttacctc tgaaggtcgt ttaaccggtt
     5101 gctatattga gtgctacaat aatgccggtt ggtctatgga tctttccttc tccgtattgg
     5161 aaagagccat gtttcatttt gaaaattgct ataaaatacc caacattaag gtgggtggtt
     5221 gggtttgtaa aacaaactta ccatcgaaca cggcattccg tggctttggt ggtcctcagg
     5281 gcatgtttgc gggagagcat ataatacgtg atgtggcccg catattgggc aaggactatt
     5341 tggaaattat gaaacaaaat ttctataaag aaggcgatat aacgcactat caacaaaagt
     5401 tggataactt tcccattgag aaatgtttct acgattgcct gcaacagtcg aactattacc
     5461 agaaaaggaa agaaattgaa gagtttaatc gtaaccaccg ttggcgcaag cgtggcatat
     5521 cattagttcc cacaaaatat ggcatagcct ttggtgtttc acatctcaat caagctggtg
     5581 ctttaataaa tatttatgct gatggctcgg tgctgctttc gcatggtggt gtggaaattg
     5641 gccaaggttt acataccaaa atgatccagt gttgtgctcg tgctttacaa atacccatag
     5701 aattcattca catttccgag actgccactg ataaagtgcc aaatacatca cctacggctg
     5761 ctagttcggg ttccgatcta aacggcatgg ccgttctgga tgcttgtgaa aagttaaata
     5821 aacgtctggc tccgattaag gaggccaatc cgaatggctc ttggacagaa tggattaata
     5881 aagcctattt cgaacgtgtc agcctgtcgg ctaccggctt ctatagaatg cctgatatcg
     5941 gctatgatcc tgttcaaaat ccaaacgctc taatgtacaa ctattttacc aatggtgttg
     6001 gttcgtcgat agttgaaatc gattgtctaa ctggagatca tcaggtccta agcacagata
     6061 ttgttatgga tattggttcg agtttgaatc cggccattga tattggtcaa attgagggag
     6121 ccttcatgca aggctacggc ctattcacat tggaagaaat gatttactcg cctcagggtg
     6181 tactctattc tcgcggcccc ggcatgtaca aattacctgg ttttgctgat ataccgggag
     6241 aatttaatgt gacaattttg acaggagctg ctaatccccg ggcggtttac tcctccaagg
     6301 taagagataa tttgtgtttt aattgaagtt gtatgtttta ttatattgtg tttatgttta
     6361 tttcaggctg ttggcgaacc tcctttgttt attggctgtt ctgtgttctt tgccattaaa
     6421 gaagccatta cgtcggctcg actaatgaat ggcttgagcg aggactttaa attagaatca
     6481 ccagctactt cggctcgtat acgtatggct tgtcaggatg agtttacgaa tttggtaaga
     6541 aaatattaac aatttgaaat ggctcttatt aaatctttat atattgtttt aatgaagatc
     6601 tagttagttt tctataggaa aatttcaatt gattcggcat tatattaaaa acgttggata
     6661 caactagttt tctttaggaa atttttggat tcaacttgtt tttattagaa aattatttga
     6721 ttcgattaga ttttagactt cttttatatg aaaatttgca ttcgactagt tttttacagc
     6781 aagtattctt ttttataccc ttcaccttcg ggagcagggg atatataagt ttgtcattcc
     6841 gtttgtaatt tccacaatat aattttccga ccctataaag tcgattaagc catgtccgtc
     6901 tgtctatctg tcggcttgcc tgtctgttga aattttctga atacatttga ttaattaatt
     6961 cgacatgttt tctgtggcaa acttgagatt cgactaggtt tatttcaaaa acttttttat
     7021 tcaattagtt ttcattataa aatttaccat ccaattgttt ttattagaaa atttccgttt
     7081 aattataagt tttcattaaa aactattcga ttcgagtagt tgtctgaaga aactgtttgt
     7141 tttcctaaag aaaaccttcg attagactag ttttgattaa aaaacctttc gattcgaaca
     7201 tttttaatgc caaatttttt ttatttttac agattgagca gccccctgct ggaagttatg
     7261 tgccatggaa tattgtacct taaaattaat atttttttta atttattagc aaaaatttag
     7321 ctagaaataa acgcaataga catttttttt gtatacaaat aaactgtgaa attaaatgaa
     7381 atatttagaa aattaaatat aatactcgta tttgcattta ttacaatgaa atataaaact
     7441 gattaaatgg aaaatgaaat gatgtgcaat taaatgttaa tggcagagtc tgtattaaca
     7501 gaattgacct caattaaatg atcattactt tgttcattta tcacattggc attatcatct
     7561 ccggacaaag tttcatacgg ttgattgtgg ctgaaaagga aatggaaaat tttaacagta
     7621 tttcttcaca ccttaaagta tgctttaaaa tgtagataat aaataatttt attaaaagca
     7681 ctgggagaaa acgtttaact gttaactact atctacttta ccttggtttt ttttcaattt
     7741 aaacaatttt tatttgtttt ttttttaatt caacacatca actactttta tttgtttaat
     7801 ttataaaatt ttttcataca aaatttgagc atgcatgcga tacttgatga cttttgtcaa
     7861 cttttggcgt tgaattgtat ttaacgatat ttgagttttg aacattcgaa cagctacaaa
     7921 cactttttct aaatagtttg ctaaatttag ggcaatgaca ggtaaatgaa ggacgtcttc
     7981 taacgctaaa gtaagtcaaa tttcaagtaa ttaaaatatt aatacaaaaa aaaaatttga
     8041 gtgtggttta aaagtaagcg ct
//
LOCUS       CAVXDHC      3346 bp ds-DNA             INV       27-JUL-1990
DEFINITION  C.vicina xanthine dehydrogenase (Xdh) gene allele 2, exon 1.
ACCESSION   M30488
KEYWORDS    xanthine dehydrogenase.
SOURCE      C.vicina third instar larvae fat body DNA (library of Schenkel et
            al.), clone Lcv2.
  ORGANISM  Calliphora vicina
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae.
REFERENCE   1  (bases 1 to 3346)
  AUTHORS   Houde,M., Theron,M.-C. and Bregegere,F.
  TITLE     Divergence of the nucleotide sequences encoding xanthine
            dehydrogenase in Calliphora vicina and Drosophila melanogaster
  JOURNAL   Gene 85, 391-402 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Bregegere, 01-DEC-1989.
FEATURES       from  to/span     description
    pept       1529  /  1609     xanthine dehydrogenase, exon 1 (AA at 1529)
                                 (EC 1.1.1.204)
    pre-msg    1305  >  3346     xanthine dehydrogenase mRNA and introns
    IVS     <  1610     3346     xanthine dehydrogenase intron A
BASE COUNT     1213 a    444 c    419 g   1270 t
ORIGIN      
        1 aactgttcga ataatttgaa attgccgatt ttcaaataac aaataaaccg attaatttgt
       61 gtcaattaac cgattaaccg aaattccttt tttttgcgct ttaaaatttt tttttgtatt
      121 tatttttccg tttcaattca aatacaaatt aaaatttgat attttttgaa catccaataa
      181 acatgattaa tgagtatgta taacaactga catgttttga aactttgttt gtatttacat
      241 gtatagacta aacttttttg aattgagtct tttatcataa ctaaatgaat ctattaaatt
      301 aatccaaatc taaaacaatc caaaaaagaa tattctttat catgtttttg atttctccaa
      361 catataaaat cagtgagaga gttttttcca agaaaagttt tattaaatct gaagaaaaaa
      421 atcgtttaaa ttatattctt tttataaaag attatttcag aagatctctt gctgcaacaa
      481 cgtaataatt aaaaaaaagt cgtttccaaa aaatgtcttt gaatatttta tgacatttta
      541 ctatttctta tataaatttt caacaaatca tgcgatttca gaaatataaa tagtagatgt
      601 taatatcttt ctaatctgta tttaacccaa atttttactt atgaaatata cgagaaaaca
      661 tgaattttaa ttttgatgtt tgcaataata aaaaacactc tcacacaaaa aataattgac
      721 ttttttgaaa actatatgaa agattataga tcagcatatt ttgtattact cagttcataa
      781 aacacggatt ttgagttatg acgttgttgc agcaaataca aaaagattac aaaactgatc
      841 tcaaatacct tatttgctgt tttttatgtt tttgtacaca aaattcgttg ttgtattttc
      901 aatttaaaat gaaaatagaa attattcggt taatcgaata attttaaatt aaccgattat
      961 taaccgaata aatctaaacc tcgattaatt atttgctcga ttaaccgatt aaaccagaga
     1021 cccgattaat tgaataccct acccccatct ttttatgcaa tttttgccct gtaaatgtca
     1081 atgcgacaac actaccacta ctgtatattt cttattgtag cgccctcatg agtattaaga
     1141 catttactct cgccctcgat tttatttgta ttattctctg gttatcatca cacttgttga
     1201 aaatttatac acaaatatct tagcgactct ctaccgtagc cagccggttg tacaattaat
     1261 tattaatacc agtataaata aaatttttat attcaaataa ataaataatt tgtattcgac
     1321 tctcagttct ccagtttgca acgtgtgatc attgtgtcga gacgtatcgt ttaaaagtgc
     1381 ttataatata cgtacaatat ttcatattgc gtttattatc gattgtttga ttaaaaaaat
     1441 tgaattgaaa cgaaaataaa gacatcattg caattagtta aataaaaatt gataaaacaa
     1501 gtggaattca gacagctaaa cgtttaatat gacgcaagaa cataatgcag cagtcctaga
     1561 tttaaatcca acattttcta cattgatatt ttttgttaat ggcaaaaagg taattaatgt
     1621 aatagtattt aaaaataatt tatgtatgaa aacagttagt ataatatttg gttaaaacat
     1681 aactatacat tgtgattgag ggggggttta gggaacaaat tttattaaca cacatattta
     1741 aatacacgaa aattttataa ataaaagata tttgttgatt atttattgag tcaattccct
     1801 aatatttgtt gtaatcgtac tcgcattgtt attgtctgtc atacataaaa acttctgtct
     1861 gtctgtctca acttctcatc aaacgagtaa tcatcatgaa cacttatacc tggtttgacg
     1921 gcataaataa taaaatagag ttgcacagtg aggtcgatcg taaaacatgc taataaatct
     1981 gaatcttctg acctatttcg attacaaaat gatgtataaa aaactaactt tgaaaaaata
     2041 aattttagta taatttgtat gcaattaaaa tctgctcccc tgagcttgtt ttaggacggg
     2101 tgattaatgt ttaaatgtca gattaatatt ctaattaata tgtttattat aatttgtaaa
     2161 aaaaatttaa gaactagatg tgacacttag aatttttttt tttttaaatc actattttct
     2221 tcctttgcct gaagtagggt acatatcgat tttaaaatct tatcgaaaat gaaaaaatat
     2281 tacaattaag ctacattttt gtcaaaatgc atccattttt atatccttca ccatgagtgg
     2341 cataagtctg tcattccgat tgtaaaatct acatttttca ttttcgaccc cacaaagaaa
     2401 atatattcta gatcgttata catcaatata tcgggaattc ttccggctcg gaaactattt
     2461 aaaaccgaaa aaatcggctg agatataagg aataaaccgg gacaacctcg actcttggcc
     2521 tatttttgat ttatattttt ttgtcataaa ttaaaaaaag aattaaaaat aaaatttgga
     2581 aaaaacttct ttaaacaaaa ttaaaaaaca agttagagag atacatatat tcggccgtgc
     2641 atcttatata cccttcacca aattatactt caaaataaaa attttaaata gttttaggta
     2701 aacaaaattt attttttttc caaagtagtt tttttttaat tttttggaaa aaaaaatttt
     2761 cgaattgttt tttaaaattt taattttttt tttgtttttt taatatttag cgaaaaaaaa
     2821 tttttttgtg aaaaaaaaat taacaaaaaa ctatttacat gtcattataa attttttgaa
     2881 ataagatttt gacattctcg taaaatcagc ctaaggtgtt ccaacatttt gattaaataa
     2941 tatctagttg atttttttgt tgttgttgct tttgtctatc attcaggttc agttgtaaac
     3001 attaacgtgt tttgtttgcc ttatcacttg tttgcatttt tagaaaatgt gttttaatta
     3061 aattagttta gaaaagtttt ttaaatggaa attattttaa gagcattgtc tggacattta
     3121 tagatatata ttattgttaa ttgtattagt taaattttga ataaattgtg gaaaaatatc
     3181 agttattttt tggttaaatt aaggaattag tacataaaaa cgaaagatag ctgagaattt
     3241 ccatattagc gaatgcttct tggcttccga gtaaaaactc gcccgttagc gaattccatc
     3301 atagcgttgt caatgattct tacagaaaca tcaagtatta aatgtt
//
LOCUS       RATCGM1A     1104 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Rat carcinoembryonic antigen (rnCGM1) gene, partial cds.
ACCESSION   M22226 J04626
KEYWORDS    carcinoembryonic antigen.
SOURCE      Rat (strain Wistar) adult male DNA, clone lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1104)
  AUTHORS   Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
            Zimmermann,W.
  TITLE     Identification of a carcinoembryonic antigen gene family in the
            rat: Analysis of the N-termianl domains reveals
            immunoglobulin-like, hypervariable regions
  JOURNAL   J. Biol. Chem. 264, 6906-6912 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Zimmermann 12-JAN-1989.
FEATURES       from  to/span     description
    pept    /   723  /  1082     carcinoembryonic antigen rnCGM1, exon x (AA at
                                 725)
    pre-msg <     1  >  1104     carcinoembryonic antigen rnCGM1, mRNA and
                                 introns
    IVS     <     1      722     carcinoembryonic antigen rnCGM1, intron y
    IVS        1083  >  1104     carcinoembryonic antigen rnCGM1, intron y+1
BASE COUNT      314 a    262 c    232 g    296 t
ORIGIN      Unreported.
        1 gaattcactc ctcagctctc acagcataga tggacataca gactcctgaa ggctcttctc
       61 ttccctccac actggtgtgt gtcacgtacc tgtagtgtgc acactgggac atgtaccttc
      121 ccaaaccctc acgaacaata cagaaatatt aaattacact tgaatataat tatttttatg
      181 tgctataaac atggaaatta tgtagacaaa cccagagata tcttttcttc cttccttcct
      241 tccttcttcc ttccttcctt ccttcctctt tttccatact agtttctgag attttttgag
      301 gaactgaacc ttccaaaaag accataccaa tccctgtcct caaaaagcct tttttattct
      361 aatggactgg aaatcattgt atccagagga gaaagtcaat gatttagtgg aaccataaat
      421 agaacagaaa acattcagga agtgaggatt gtatggagga ggaaaaagag gaggaggagg
      481 aggaagagga ggaggaggag gaggaggacc gagagccggt tctccactca ccagacactt
      541 tatggaaaga gtgatatggg gacacctgag tagaggattc cacagagagg aaatgacacc
      601 ctttgaggtt ctgagggcat ggaggtcatg ctgctcacct ccattaaggg tgcatcctac
      661 ctacaggctg agggatgctc acacctgctc aggattgtca acttttctct cttcccttct
      721 agcctccctc ttaacctgct ggctcctgcc caccactgcc caagtctcca ttgaatcctt
      781 accaccccag gtggttgaag gagaaaatgt tcttctacgt gttgacaatt tgccagagaa
      841 tctcatagcc tttgtctggt acaaagggct gacaaacatg agcctcggag ttgcactgta
      901 ttcactaacc tataacgtaa ctgtgacggg acctgtgcac agtggtagag agacattgta
      961 cagcaatggg tccctgtgga tccaaaatgt cacccagaag gacacaggat tctacaccct
     1021 acgaaccata agtaatcatg gagaaattgt atcaaataca tccctgcacc ttcatgtgta
     1081 ctgtaagtaa ttctttgtga attc
//
LOCUS       RATCGM1B      654 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Rat carcinoembryonic antigen rnCGM2 gene, partial cds.
ACCESSION   M22227 J04626
KEYWORDS    carcinoembryonic antigen.
SOURCE      Rat (strain Wistar) adult male DNA, clone lambda-rnCGM2-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 654)
  AUTHORS   Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
            Zimmermann,W.
  TITLE     Identification of a carcinoembryonic antigen gene family in the
            rat: Analysis of the N-termianl domains reveals
            immunoglobulin-like, hypervariable regions
  JOURNAL   J. Biol. Chem. 264, 6906-6912 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Zimmermann 12-JAN-1989.
FEATURES       from  to/span     description
    pept    /   249  /   608     carcinoembryonic antigen rnCGM2, exon x (AA at
                                 251)
    pre-msg <     1  >   654     carcinoembryonic antigen rnCGM2, mRNA and
                                 introns
    IVS     <     1      248     carcinoembryonic antigen rnCGM2, intron y
    IVS         609  >   654     carcinoembryonic antigen rnCGM2, intron y+1
BASE COUNT      168 a    169 c    171 g    146 t
ORIGIN      Unreported.
        1 aatgtaattc ttgttggaga gtgagtgggg agccatgcag acacgggagg agagagaccc
       61 gtacaaaagg tcactccagc ttcgggggac tgggaacata gatgatgaag tttccctgca
      121 ccaatgagag cgacgccctc accccacacc tcggcagaag atgaacacac ctacctgttc
      181 cggacttggg cctcctctca gcgatcacta agcttctgac actgatggag ttttttcctt
      241 ctccctagct tccttcttaa cctgctggaa tgcacccgcc gctgccgagc tcactattga
      301 attagtgcca cccatggttg ctgaaggcgg aaactccgtt ttgtttgtgc atgaaatgcc
      361 attgaatgtc caggcgtttt actggtacaa acagagagat ccgacgaaga gctatgaagt
      421 cgcgcggtac ttaacaccca ccaacgaaag ttcgaagatg cctcagcaca gcggccggaa
      481 aaccgtattc tacagtggat ccctgctgat cagaaacgtc acccaggccg acagtggagt
      541 ctacacctta ctaacattta acacagaaat gcaaagcgaa ttaacacatg tgcatctgga
      601 agtacgcggt aggtggttgc gggatctctg ggtgctaggg gtcggggtga gctc
//
LOCUS       RATCGM1C     1185 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Rat carcinoembryonic antigen rnCGM3 gene, partial cds.
ACCESSION   M22228 J04626
KEYWORDS    carcinoembryonic antigen.
SOURCE      Rat (strain Wistar) adult male DNA, clone lambda-rnCGM3-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1185)
  AUTHORS   Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
            Zimmermann,W.
  TITLE     Identification of a carcinoembryonic antigen gene family in the
            rat: Analysis of the N-terminal domains reveals
            immunoglobulin-like, hypervariable regions
  JOURNAL   J. Biol. Chem. 264, 6906-6912 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Zimmermann 12-JAN-1989.
FEATURES       from  to/span     description
    pept    /   729  /  1088     carcinoembryonic antigen rnCGM3, exon x (AA at
                                 731)
    pre-msg <     1  >  1185     carcinoembryonic antigan rnCGM3, mRNA and
                                 introns
    IVS     <     1      728     carcinoembryonic antigen rnCGM3, intron y
    IVS        1089  >  1185     carcinoembryonic antigen rnCGM3, intron y+1
BASE COUNT      312 a    279 c    277 g    317 t
ORIGIN      Unreported.
        1 ctgcagtgtg cacagcaaga cattgtgctt tcccgaaccc acacgaacac actgaattat
       61 taaatcacac ttgaatatat tgatttccct ttgctctgag cctgggcact atgtagataa
      121 gtccatggaa atattaatct ttccttcctt ccttccttcc ttccttcctt ccttccttcc
      181 ttccttcctt tcttccttct tctagttctt ttcacgtttt cccttttctt tttctctcca
      241 atttgtttct aatctatttt caggaactga accttccaaa aagatgattc cagtccctgt
      301 cctcacaaag cccttttctt gtggactgga agtcagagta tccagagaaa ggcaatggtt
      361 taatggaacc tcaaacagaa cagaaaacaa ttctgagagt gagcattgca tgaggaagag
      421 gaggaacggg aagaggagga ggaagaggag gaggaagagg aggaggaaga ggaatgggaa
      481 gaggaggagg aagaggagga agaggaggaa gaggtcagac agctgcttca cctctcacca
      541 gacactctat gggaagaatg atatggggac acctgagtag aggattcctg gagaggaaat
      601 gacagctttt gagtctttga gggcatggag gtcatgctgc tcacctccat taagggtgca
      661 tcctacctac aggctgaggg atgctcacac ctgctcagga tcggtgactt ttttctcttc
      721 ccttctagcc tcacttttaa cctgctggct cctgcccacc actgcccacg tcaccctcaa
      781 gtcctcaccg ccccaggtgg ttgaaggaga aaacgttctt ctaagtgctg acaatctgcc
      841 agagaacatt atagctttcg cctggtacaa aggggagacc gacatgaacc gtggaattgc
      901 actgtattca ctgaggtata ctgtaagttt gacggggcct gtgcacagtg gtcgagagac
      961 attgtacagc gacgggtccc tgtggatcaa aaatgtcacc caggaggaca caggatttta
     1021 tacctttcga atcataaata atcatggaaa aattcaatca aatacaaccc tgttccttca
     1081 cgtgaaatgt aagtaactct ttgtgaactg tgggttttgg gtggtgtcct tccactagac
     1141 acatagaagt atcaggccag ggctgtgtct cccttccccc tgcag
//
LOCUS       RATCGM1D      748 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Rat carcinoembryonic antigen rnCGM4 gene, partial cds.
ACCESSION   M22229
KEYWORDS    carcinoembryonic antigen.
SOURCE      Rat (strain Wistar) adult male DNA, clone lambda-rnCGM4-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 748)
  AUTHORS   Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
            Zimmermann,W.
  TITLE     Identification of a carcinoembryonic antigen gene family in the
            rat: Analysis of the N-terminal domains reveals
            immunoglobulin-like, hypervariable regions
  JOURNAL   J. Biol. Chem. 264, 6906-6912 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Zimmermann 12-JAN-1989.
FEATURES       from  to/span     description
    pept    /    90  /   434     carcinoembryonic antigen rnCGM4, exon x (AA at
                                 92)
    pre-msg <     1  >   748     carcinoembryonic antigen rnCGM4, mRNA and
                                 introns
    IVS     <     1       89     carcinoembryonic antigen rnCGM4, intron y
    IVS         435  >   748     carcinoembryonic antigen rnCGM4, intron y+1
BASE COUNT      197 a    158 c    183 g    210 t
ORIGIN      Unreported.
        1 gagctctggg aaggcagaag tgtgattttt taaaaaacca acagatttca cctgctcaat
       61 atcgatggtt gctctgtctt cccttttagc ctcccttcta acctgttggc tcctgactac
      121 tgcccaggtc aacattgaat cggtgccatt caatgtggtt gaaggggaaa acgtccttct
      181 tcttgtccac aatctgccag agaatctcat agcctttgcc tggtatagag ggctgaggaa
      241 aattggagta tacatactga acactgaagt aagtgtgacg gggccaatgt acagcggtag
      301 agagacagtg tacagcaatg gttccctgtg tatccgcaat gtcacccaga aggacacagg
      361 attctacact ctacgaacag tcaacacacg tggagaaact gtatcaacaa catccttgta
      421 cctctatgtg tacagtaagt gatactttgt gaactctggg tgttgtgtgg ggttcattcc
      481 gtagacacac acagaagagg caggcctacc taccctttgc attgtgtctc cttattgagg
      541 tgtgaacatt taactcaggc taaggagagt aatgccaatt gaatagaatc cttcttttga
      601 ctttaccttg tagtcagctg gatgtgtggt taactcagtg aaggacatca gcccttgtct
      661 agacttctgg ggttcttagc agtaatgtgt ccttgggaaa gaccttgagg gaaggagatt
      721 gggtttgaat gagatagcca taggatcc
//
LOCUS       RATCGM1E      750 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Rat carcinoembryonic antigen rnCGM5 gene, partial cds.
ACCESSION   M22230
KEYWORDS    carcinoembryonic antigen.
SOURCE      Rat (strain Wistar) adult male DNA, clone lambda-rnCGM5-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 750)
  AUTHORS   Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and
            Zimmermann,W.
  TITLE     Identification of a carcinoembryonic antigen gene family in the
            rat: Analysis of th N-terminal domains reveals immunoglobulin-like,
            hypervariable regions
  JOURNAL   J. Biol. Chem. 264, 6906-6912 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Zimmermann 12-JAN-1989.
FEATURES       from  to/span     description
    pept    /   118  /   477     carcinoembryonic antigen rnCGM5, exon x (AA at
                                 120)
    pre-msg <     1  >   750     carcnioembryonic antigen rnCGM5, mRNA and
                                 introns
    IVS     <     1      117     carcinoembryonic antigen rnCGM5, intron y
    IVS         478  >   750     carcinoembryonic antigen rnCGM5, intron y+1
BASE COUNT      180 a    207 c    178 g    185 t
ORIGIN      Unreported.
        1 agcgtaggca ggagactcca cacctcagct gaccactgga cacagctgct cggactcagg
       61 caccatctta gccaaatact aaagtcctga tgttgacgga tctctcttcc cttctagcct
      121 ctcttttcat ctgtgggcgt ccttttaacc ctgccaagct cactattgaa tcagtgccgc
      181 ccagtgttgc tgaaggggga agcgttcttc tcctcgttca caatctccag gacgagcttc
      241 gagggttttt ctggtacaaa ggggcgtcta tgtctagcaa ccatgagata gcccgataca
      301 gaacagcaaa gaattcaagt gtgccaggcc ctgcccacag tggtagagag acggtgtaca
      361 gcaatggatc cctcctgctc cagaatgtca cccggaatga cactgggttc tacaccctac
      421 gcactctgaa aagacatcag aaaatggaat tggcacacgt gcaacttcag gtggacagta
      481 agtgattttc cgtgatcgtt cagtgctggg tgggtctttg acacacagga ctgtcacccc
      541 tggcatgtgg ctacctcctc tctgcctttt tatccccatg ttgtggttaa ccactatgtg
      601 caggacacat gtgatggaaa gaaatgccca tgggtcagac ttatcatctg actctcccct
      661 gtatcaagga cagtaactca accctaggtg ctagactctg cccagtcatc tggggcatct
      721 tgccatgcaa cgtgaggaaa ccatggatcc
//
LOCUS       RATGSTAA     1116 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Rat glutathione S-transferase mRNA, complete cds.
ACCESSION   M28241
KEYWORDS    glutathione S-transferase.
SOURCE      Rat liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1116)
  AUTHORS   Hsieh,J.-C., Liu,L.-F., Chen,W.-L. and Tam,M.F.
  TITLE     Expression of Yb1 glutathione S-transferase using a Baculovirus
            expression system
  JOURNAL   Biochem. Biophys. Res. Commun. 162, 1147-1154 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by J.-C.Hsieh, 21-SEP-1989.
FEATURES       from  to/span     description
    pept        103      753     glutathione S-transferase (EC 2.5.1.18)
BASE COUNT      268 a    334 c    246 g    268 t
ORIGIN      
        1 tttgtccggc ccacgtttct ctggtaatct gtataaatcg caacacacag gtcaattctg
       61 ctgaagccaa attgagaaga ccacagcgcc agaaccatgc ctatgatact gggatactgg
      121 aacgtccgcg ggctgacaca cccgatccgc ctgctcctgg aatacacaga ctcaagctat
      181 gaggagaaga gatacgccat gggcgacgct cccgactatg acagaagcca gtggctgaat
      241 gagaagttca aactgggcct ggacttcccc aatctgccct acttaattga tggatcgcgc
      301 aagattaccc agagcaatgc cataatgcgc taccttgccc gcaagcacca cctgtgtgga
      361 gagacagagg aggagcggat tcgtgcagac attgtggaga accaggtcat ggacaaccgc
      421 atgcagctca tcatgctttg ttacaacccc gactttgaga agcagaagcc agagttcttg
      481 aagaccatcc ctgagaagat gaagctctac tctgagttcc tgggcaagcg accatggttt
      541 gcaggggaca aggtcaccta tgtggatttc cttgcttatg acattcttga ccagtaccac
      601 atttttgagc ccaagtgcct ggacgccttc ccaaacctga aggacttcct ggcccgcttc
      661 gagggcctga agaagatctc tgcctacatg aagagcagcc gctacctctc aacacctata
      721 ttttcgaagt tggcccaatg gagtaacaag taggcccttg ctacactggc actcacagag
      781 aggacctgtc cacattggat cctgcaggca ccctggcctt ctgcactgtg gttctctctc
      841 cttcctgctc ccttctccag ctttgctagc cccatctcct caacctcacc ccagtcatgc
      901 ccacatagtc ttcattctcc ccactttctt tcatagtggt ccccttcttt attgacacct
      961 taacacaacc tcacagtcct tttctgtgat ttgaggtctg ccctgaactc agtctcccta
     1021 gacttacccc aaatgtaaca ctgtctcagt gccagcctgt tcctggtggg ggagctgccc
     1081 caggcctgtc tcatctttaa taaagcctga aacacc
//
LOCUS       MUSTCGXI      354 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse T-cell receptor gamma-chain mRNA V4-J1 region, 3' end.
ACCESSION   M30171
KEYWORDS    T-cell receptor beta-chain; joining exon; variable region.
SOURCE      Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Heilig,J.S. and Tonegawa,S.
  TITLE     Diversity of murine gamma genes and expression in fetal and adult T
            lymphocytes
  JOURNAL   Nature 322, 836-840 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      342     T-cell receptor beta-chain V4-J1 region (AA at
                                 1)
    recomb      293      294     V-region DNA end/J-region DNA start
BASE COUNT      118 a     72 c     69 g     95 t
ORIGIN      
        1 aagttggagc aacctgaaat atcaatttcc agaccaagag atgagactgc acaaatatcc
       61 tgtaaagttt tcatcgaaag ctttaggagt gtaaccatac actggtaccg gcagaaacca
      121 aaccaaggtt tagagtttct attatatgtc cttgcaaccc ctacccatgt tttcttagat
      181 aaggagtaca agaaaatgga ggcaagtaaa aatcctagtg cttctacatc gatattgaca
      241 atatattcct tggaggaaga agacgaagct atctactact gttcctacgg ccaagtagct
      301 caggttttca caaggtattt gcagaaggaa ctaagctcat agtaattccc tctg
//
LOCUS       MUSTCGXJ      353 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse T-cell receptor beta-chain mRNA V2-J2 region, 3' end.
ACCESSION   M30172
KEYWORDS    T-cell receptor beta-chain; joining exon; variable region.
SOURCE      Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 353)
  AUTHORS   Heilig,J.S. and Tonegawa,S.
  TITLE     Diversity of murine gamma genes and expression in fetal and adult T
            lymphocytes
  JOURNAL   Nature 322, 836-840 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      297     T-cell receptor beta-chain V2-J2 region (AA at
                                 1)
    recomb      294      295     V-region DNA end/J-region DNA start
BASE COUNT      131 a     71 c     65 g     86 t
ORIGIN      
        1 cagctggagc aaactgaatt atcggtcacc agagagacag atgagaatgt gcaaatatcc
       61 tgtatagttt atcttccata tttctccaac acagctatac attggtaccg gcaaaaaaca
      121 aatcaacagt ttgagtatct aatatatgtc gcaacaaact acaatcaacg acccttagga
      181 gggaagcaca aaaaaattga agcaagtaaa gattttaaaa gttctacctc aaccttggaa
      241 ataaattact tgaagaaaga agatgaagcc acctactact gtgcagtctg gatatagctc
      301 gggctttcac aaggtatttg cagaaggaac aaagctcata gtaattccct ccg
//
LOCUS       MUSTCGXK       76 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse T-cell receptor beta-chain mRNA V5-J1 region, 3' end.
ACCESSION   M30173
KEYWORDS    T-cell receptor beta-chain; joining exon; variable region.
SOURCE      Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Heilig,J.S. and Tonegawa,S.
  TITLE     Diversity of murine gamma genes and expression in fetal and adult T
            lymphocytes
  JOURNAL   Nature 322, 836-840 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       21     T-cell receptor beta-chain V5-J1 region (AA at
                                 1)
    recomb       18       19     V-region DNA end/J-region DNA start
BASE COUNT       19 a     18 c     16 g     23 t
ORIGIN      
        1 acgtactact gtgcctgcta gctcaggttt tcacaaggta tttgcagaag gaactagctc
       61 atagtaattc cctctg
//
LOCUS       MUSTCGXL      926 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse T-cell receptor beta-chain mRNA C1 region, 3' end.
ACCESSION   M30174
KEYWORDS    T-cell receptor beta-chain; constant region.
SOURCE      Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 926)
  AUTHORS   Heilig,J.S. and Tonegawa,S.
  TITLE     Diversity of murine gamma genes and expression in fetal and adult T
            lymphocytes
  JOURNAL   Nature 322, 836-840 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      518     T-cell receptor beta-chain C1 region (AA at 3)
BASE COUNT      267 a    231 c    169 g    259 t
ORIGIN      
        1 acaaaagact tgatgcagac atttccccca agcccactat tttccttcct tctgttgctg
       61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttccagatg
      121 ttataagggt gtattggaaa gaaaaggatg gcaatactat cctggactcc caggaagggg
      181 atacgctgaa gactaacgac acatacatga agtttagctg gcttacagtg cccgaaaggg
      241 caatggggaa agagcacaga tgtattgtca aacatgagaa caacaaagga ggagcagatc
      301 aagagatttt ctttccttca ataaagaaag ttgctgtgag taccaagcct acaacttgct
      361 ggcaagataa aaatgatgtg ctgcagctcc agttcacgat cacctctgcc tactacacct
      421 accttctcct gctcctcaag agtgtgatct acttggccat catcagcttc tctctgctta
      481 gaagaacatc tgtctgtggc aatgagaaga aatcctaaag aaacgagtgg tggtacagca
      541 agtcagctgg atttcatcct cactgccata aaggtgcctt aacgaggtcc agctgccttc
      601 acttgttggc tttcacttct ataaagtccc tcactcatgt tgcataaaca ttttctgaac
      661 gtttgtgtgc aatttctgca acttctctct aactcacatc acctcattcc tgattccatc
      721 cactgcagaa gtcccctccc ccagaagtct gaagcatcaa aattctagta cccatagcct
      781 acagctttta cccatagccc ttgttgctct tgaaccagct aactccatga agacccccac
      841 tctagacaac ctgaagagcc ttctttactg tttgattctg tgccttgtat ttcacattat
      901 ggacaatata ttttttaaaa attaaa
//
LOCUS       MUSTCGXM      913 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse T-cell receptor beta-chain mRNA C2 region, 3' end.
ACCESSION   M30175
KEYWORDS    T-cell receptor beta-chain; constant region.
SOURCE      Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 913)
  AUTHORS   Heilig,J.S. and Tonegawa,S.
  TITLE     Diversity of murine gamma genes and expression in fetal and adult T
            lymphocytes
  JOURNAL   Nature 322, 836-840 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      503     T-cell receptor beta-chain C2 region (AA at 3)
BASE COUNT      263 a    223 c    168 g    259 t
ORIGIN      
        1 acaaaaggct tgatgcagac atttccccca agcccactat ttttcttcct tctgttgctg
       61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttcccgatg
      121 tcataagggt gtattggaaa gaaaagaatg gcaatactat cctggactcc caggaagggg
      181 atacgctgaa gactaagggc acatacatga agtttagctg gcttactgtg cccgaaaggg
      241 caatggggaa agagcacagt tgtattgtca aacatgagaa caacaaagga ggagcagatc
      301 aagagatttt cttcccttca ataaagaaag ttgctacaac ttgctggcaa gataaaaatg
      361 atgtgctgca gttccagttc acgagcacct ctgcctacta cacctacctc ctcctgctcc
      421 tcaagagtgt gatctacttg gccatcatca gcttctctct gcttagaaga acatctgtct
      481 gtggcaatga gaagaagtcc taaagaaacg agtggtggta cagcaagtca gctggatttc
      541 atcctcactg ccataaaggt gccttaacgg ggaaccagat gccttctctt gttggctttc
      601 acttctataa agtccctcac tcatgttgca taaacatttt ctgaactttt gtatgcaatt
      661 tcagcaactt tttttaaact gaactcacct tcttcctgat tccatccact ccagaagtcc
      721 cctcccccag aagcctgaaa cattaaaatt ctagtaccca tagcctacag cttttaccca
      781 tggcccttgt tgctcttgaa ccagctaact ccatgaagac ccccactcta gacaacctga
      841 agagccttct ttactgtttg attctgtgcc ttgtatttca gattattgac aataaatttt
      901 ttaaaaaatt aaa
//
LOCUS       DVUHYDAB     3180 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  D.fructosovorans periplasmic hydrogenase (hydA and hydB) gene,
            complete cds.
ACCESSION   M35333
KEYWORDS    periplasmic hydrogenase.
SOURCE      D.fructosovarans (strain DSM 3604) DNA, clone DHS-alpha-[pHH7].
  ORGANISM  Desulfovibrio fructosovorans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 3180)
  AUTHORS   Rousset,M., Dermoun,Z., Matchikian,C.E. and Belaich,J.P.
  TITLE     Cloning and sequencing of the locus encoding the large and small
            subunit genes of the periplasmic [Ni,Fe] hydrogenase Desulfovibrio
            fructosovorans
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Rousset, 15-JUN-1990.
FEATURES       from  to/span     description
    pept        301     1242     periplasmic hydrogenase-alpha subunit
                                 precursor (hydA) (EC 1.18.99,1)
    sigp        301      450     periplasmic hydrogenase-alpha subunit signal
                                 peptide
    matp        452     1239     periplasmic hydrogenase
    pept       1314     3008     periplasmic hydrogenase-beta subunit (hydB)
    mRNA        129     3060     periplasmic hydrogenase mRNA
    signal       97      102     -35 region
    signal      120      125     -10 region
    binding     289      293     ribosome binding site for hydA
    binding    1303     1307     ribosome binding site for hydB
    signal     3025     3060     termination signal
BASE COUNT      660 a   1150 c    866 g    504 t
ORIGIN      
        1 cgtgctgcat acgatttctt tattcatatc ctttacttcc ccggcctcat caccccggta
       61 acacccttga atcacgtgaa aaatttgcca aaccccattg acgttcaaac aaactcgtga
      121 ttaactgcaa aagggaaatc gaacccggcg acgcaggaac gcacgcgtcc gtcacataga
      181 ggcggcggcg gccggttcgc ctgcgacggc cgcgtgccga acgggtcaac ggatgctacg
      241 tggcctgggg atcatgggac gccccggacc atgcctctcg taaaacaaag gaggacgttt
      301 atgaactttt ccgtgggtct tggcaggatg aatgcggaaa aacggcttgt gcaaaacggc
      361 gtctcccgcc gcgacttcat gaaattttgc gccaccgtgg ccgcggccat gggcatgggc
      421 ccggcgttcg cgcccaaggt cgccgaagca ttgacggcca aacaccgtcc gtcggtggtc
      481 tggctgcaca acgccgagtg caccggctgc accgaagcgg cgatccggac gatcaaacct
      541 tatatagacg cgctcattct cgacaccatc tccctggatt accaggagac catcatggcc
      601 gcggccggcg agacgtccga ggcggccctg caccaggccc tcgaaggcaa ggacggctac
      661 tacctcgtgg tcgagggcgg cctgcccacc atcgacggcg gccagtgggg catggttgcc
      721 ggccatccca tgatcgagac caccaagaag gccgcggcca aggccaaggg catcatctgc
      781 atccggcacc tgcctcacgg cggcgtccag aaggccaaac ccaatcccag ccaggccaag
      841 ggcgtgtccg aagccctcgg cgtcaagacc atcaacatcc ccggctgccc gcccaacccc
      901 atcaacttcg tgggcgccgt ggtccatgtc ctgaccaagg gcatcccgga tctcgacgag
      961 aacggccgtc cgaagctctt ctacggcgag ctggtccacg acaactgtcc gcgcctgccc
     1021 cacttcgagg cctccgaatt cgcgccctcc ttcgattccg aagaggccaa gaaaggcttc
     1081 tgcctctacg aactcggctg caagggcccc gttacctaca acaactgccc caaggtgctg
     1141 ttcaaccagg tcaactggcc cgtccaggcc ggccacccct gcctcggctg cagcgagccg
     1201 gacttctggg acaccatgac gccgttctac gagcagggct aaccccctcc tttgtagcgg
     1261 ccagtacccg gacaccttca acagcaccga acgtctcgtg acggaggaag catatggctg
     1321 agagcaaacc cacgccacaa tccaccttca ccggccccat cgtggtcgac cccattaccc
     1381 ggatcgaagg tcacttgcgg atcatggtcg aggtggaaaa cggtaaggtc aaggacgcct
     1441 ggagctcctc gcagctcttc cgcggcctgg aaatcatcct caaaggccgc gatccccgcg
     1501 acgcccagca cttcacccag cgcgcctgcg gcgtgtgcac gtacgtccac gccctggcct
     1561 ccagccgctg cgtcgatgac gccgtcaagg tcagcatccc ggccaacgcc cgcatgatgc
     1621 gcaacctggt catggcctcc cagtatctcc atgaccacct cgtccacttc tatcacctgc
     1681 acgccctcga ctgggtcgac gtgaccgccg ccctcaaggc cgatcccaac aaggccgcca
     1741 aactggcagc ctccatcgac acggctcgga ccggcaactc ggaaaaggcc ctcaaggcag
     1801 tccaggacaa gctgaaagcc ttcgtcgagt ccggacagct cggcatcttc accaacgcct
     1861 acttcctcgg cggccacaaa gcctactacc tgccgcccga ggtcaacctc atcgccaccg
     1921 cccactacct ggaagccctg cacatgcagg tcaaggcggc cagcgccatg gccatcctcg
     1981 gcggcaagaa cccccacacc cagttcaccg tcgtgggcgg ctgctccaac taccagggcc
     2041 tgaccaagga cccgctggcc aactacctgg ccctgagcaa ggaagtctgc cagttcgtca
     2101 acgagtgcta catccctgac ctgctggccg tggccggctt ctacaaggac tggggcggca
     2161 tcggcggcac cagcaactac ctggccttcg gcgagttcgc caccgacgac agctcccccg
     2221 agaaacacct ggccacctcg cagtttccgt ccggcgtcat caccggccgc gacctcggca
     2281 aggtggataa cgtggacctc ggcgccatct acgaagacgt caagtactcc tggtacgccc
     2341 ccggcggcga cggcaagcac ccctacgact gcgtcaccga tcccaagtac accaagctcg
     2401 acgacaagga ccactactcc tggatgaagg ccccccgcta caagggcaag gccatggaag
     2461 tcggtccctt ggcccgcacc ttcatcgcct acgccaaggg gcagcccgac ttcaaaaagg
     2521 tcgtggacat ggtcctcggc aaactctccg tcccggccac ggccctgcat tcgaccctcg
     2581 gacgcaccgc cgcccgcggc atcgagaccg ccatcgtctg cgccaacatg gagaagtgga
     2641 tcaaggaaat ggccgacagc ggcgccaagg acaacaccct gtgcgccaag tgggagatgc
     2701 ccgaggagtc caagggcgtc ggcctggccg atgctccccg cggctccctg tcccactgga
     2761 tccgcatcaa gggcaagaag atcgacaact tccagctggt tgtcccctcg acctggaacc
     2821 tcggtccccg ggggccccag ggcgacaaga gcccggtgga agaggccctt atcggcacgc
     2881 ccatcgccga tcccaaacgc ccggtcgaaa tcctgcgcac ggtccacgcc ttcgacccct
     2941 gcatcgcctg cggcgtgcac gtcatcgagc ccgagaccaa cgagatcctc aagttcaagg
     3001 tttgctaaga gcggcaaccc tctaatacga aggacccgct tcgggccggg ttcttcgttt
     3061 ctcaaggagg tcgcatgtcc gataccccgc gcaaaatcct catcctcggc ctcggcaaca
     3121 tcctctacac cgacgagggc gtcggcgtgc gggccgtgga gcgccttctc gaaacccacg
//
LOCUS       MTFRECA      1349 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  M.flagellatum recA gene, complete cds.
ACCESSION   M35325
KEYWORDS    co-protease; recombinase.
SOURCE      M.flagellatum (strain MFK1) DNA.
  ORGANISM  Methylobacillus flagellatum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Methylococcaceae.
REFERENCE   1  (bases 1 to 1349)
  AUTHORS   Gomelsky,M., Bolstin,A. and Tsygankov,Y.D.
  TITLE     The nucleotide sequence and regulation of recA gene from
            Methylobacillus flagellatum
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.V.Gomelsky, 19-JUN-1990.
            
               Author address:M.V.Gomelsky
               Institute of Genetics and Section of Industrial Microorganisms
               1st Dorozhry pr-d,1
               Moscow 113545 U.S.S.R.
FEATURES       from  to/span     description
    pept        293     1327     RecA protein
    site          1      292     LexA repressor binding site
BASE COUNT      320 a    413 c    386 g    230 t
ORIGIN      
        1 aggagatggc cctgggcgcg ctggcctaca gcaccgccga cattagtgct tcctacacgg
       61 gcatcgctgg gccgatggcg gaagcgagca aaaaccggtc ggtaccgtat gctttggctg
      121 ggcccatcgc gatggccgga tcatcaccca aacccaccac ttcctgggcg agagacgccg
      181 tgcgcagaca gtctgtcaaa accgcattaa ccggcttgtt gcagcttact ttacccactg
      241 atttatgaaa taattgcagg ctttagcagg caactcgaaa aggtattcac tgatggatga
      301 aaacagaagc aaagcgctag ccgccgccct ctcgcaaatc gagaaacaat ttggcaaagg
      361 ctctatcatg cgcatgggcg acaccgatgt cgctgctgac atccaggcag tttctaccgg
      421 ctccctgggc ctggacatcg ccctgggtat tggcggcttg ccgcgcggcc gcatcgtcga
      481 gatttacggc ccggaatcct ccggcaagac cacgctgaca ttgtctgtga tcgcgcagat
      541 gcaaaagctg ggcggcacgg cagcattcat cgacgcagaa cacgcgctcg acccggtcta
      601 cgcgcagaag ctcggtgtca atgtttccga tctgctcatc tcccagcccg acaccggcga
      661 acaggcgctc gagatcgccg acatgctggt gcgttccggc tcggtagatg tcgtggtggt
      721 cgactccgtc gctgcgctga cgcccaaggc cgaaatcgaa ggcgaaatgg gcgactcgca
      781 catgggcctg caggcccgcc tgatgtcgca ggcactgcgc aagctcaccg ccaacatcaa
      841 gcgcaccaac accttggtca tcttcatcaa ccagatccgc atgaagatcg gcctgatgtt
      901 cggcaacccc gagaccacca ccggcggcaa tgcactcaag ttctacgcct ccgtgcgcct
      961 ggacatccgc cgcaccggcg cgatcaagaa gggggacgag gtcacgggat cagaaacccg
     1021 ggtcaaggta gtgaagaaca acgtggcccc tccgttcaag ctagcggaat tcgacatcct
     1081 ctatggcgaa ggtatctcgc gcgagggcga aatcatcgaa ctcggcgtca accttaaatt
     1141 gatcgagaag gccggtgcct ggtacagcta caagggcgag aagatcggcc agggcaagga
     1201 caacgcgcgc gagttcctca gggagcaccc ggaaatcgcc aatgagatcg acgccaagat
     1261 tcgcgagcat tccaatctgg ccaacgccgc gatgacgacg gcaccggatg aagaaagcga
     1321 cgaataagcc taccaccatg aatgcgaga
//
LOCUS       CHKIGLAO      329 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION   M34804
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Chicken (inbred strain B19) 3 month old bursa DNA, clone 2.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 329)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    34  >   329     Ig lambda-chain V1-J region, exon N precursor
                                 (AA at 36)
    sigp    /    34       50     Ig lambda-chain V1-J region, exon N signal
                                 peptide (AA at 36)
    matp         51  >   329     Ig lambda-chain V1-J region, exon N
    IVS     <     1       33     Ig lambda-chain V1-J region intron X
BASE COUNT       65 a    107 c     82 g     75 t
ORIGIN      
        1 ccgttttctc ccctctttcc tcctcctctc caggttccct ggtgcaggca gcactgactc
       61 agccggcctc ggtgtcagca aacctgggag aaaccgtcga gatcacctgc tccgggggta
      121 gtggcagcta cggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct
      181 atgctaacac caacagaccc tcagacatcc cttcacgatt ctccggttcc aaatccggct
      241 ccacgggcac attaaccatc actggggtcc aagccgagga cgaggctgtc tatttctgtg
      301 ggagcaggga cagcaactat gttggtata
//
LOCUS       CHKIGLAP      332 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION   M34805
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Chicken (inbred strain B19) 3 month old bursa DNA, clone 36.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 332)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    34  >   332     Ig lambda-chain V1-J region, exon N precursor
                                 (AA at 36)
    sigp    /    34       50     Ig lambda-chain V1-J region, exon N signal
                                 peptide (AA at 36)
    matp         51  >   332     Ig lambda-chain V1-J region, exon N
    IVS     <     1       33     Ig lambda-chain V1-J region intron X
BASE COUNT       70 a    110 c     79 g     73 t
ORIGIN      
        1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc
       61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggagta
      121 gtggcagcta ctatggctgg taccagcaga aggcacctgg cagtgcccct gtcactgtga
      181 tctatgacaa caccaacaga ccctcggaca tcccttcacg attctccggt tccaaatccg
      241 gctccacggc cacattaacc atcactgggg tccaagccga cgacgaggct gtctatttct
      301 gtgggagtac agacagcagc tatattggta ta
//
LOCUS       CHKIGLAQ      335 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION   M34806
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Chicken (inbred strain B19) 3 month old bursa DNA, clone 24.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 335)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    34  >   335     Ig lambda-chain V1-J region, exon N precursor
                                 (AA at 36)
    sigp    /    34       50     Ig lambda-chain V1-J region, exon N signal
                                 peptide (AA at 36)
    matp         51  >   335     Ig lambda-chain V1-J region, exon N
    IVS     <     1       33     Ig lambda-chain V1-J region intron X
BASE COUNT       71 a    110 c     77 g     77 t
ORIGIN      
        1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc
       61 agccggcctc ggtgtcagca aacctgggag aaaccgtcaa gatcacctgc tcagggagta
      121 gcaatactta ttactattat ggctggttcc agcagaagtc tcctggcagt gcccctgtca
      181 ctgtgatcta taacaacgac aagagaccct cggacatccc ttcacgattc tccggtgccc
      241 tatccggctc cacagccaca ttaaccatca ctggggtcca agccgacgac gaggctgtct
      301 atttctgtgg gagcaacgac agcagtactg ctgca
//
LOCUS       CHKIGLAR      330 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION   M34807
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Chicken (inbred strain B19) 3 month old spleen DNA, clone 1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 330)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    29  >   330     Ig lambda-chain V1-J region, exon N precursor
                                 (AA at 31)
    sigp    /    29       45     Ig lambda-chain V1-J region, exon N signal
                                 peptide (AA at 31)
    matp         46  >   330     Ig lambda-chain V1-J region, exon N
    IVS     <     1       28     Ig lambda-chain V1-J region intron X
BASE COUNT       71 a    101 c     80 g     78 t
ORIGIN      
        1 ccccattttt tctcctctcc ttctccaggt tccctggtgc aggcagcgct gactcagccg
       61 gcctcggtgt cagcaaacct gggagaaacc gtcgagatca cctgctccgg gggtagcagc
      121 gactatggct ggtaccagca gaagtcacct ggcagtgccc ctctcactgt gatctataat
      181 ggcaataaca gaccctcaaa catcccttca cgattctccg gttccctatc cggctccaca
      241 aacacattaa ccatcactgg ggtccaagtc gaggacgagg ctgtctattt ctgtgggagt
      301 gcagacagca gtgctggtta tgctgatata
//
LOCUS       CHKIGLAS      332 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION   M34808
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Chicken (inbred strain B19) 3 month old spleen DNA, clone 126.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 332)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    34  >   332     Ig lambda-chain V1-J region, exon N precursor
                                 (AA at 36)
    sigp    /    34       50     Ig lambda-chain V1-J region, exon N signal
                                 peptide (AA at 36)
    matp         51  >   332     Ig lambda-chain V1-J region, exon N
    IVS     <     1       33     Ig lambda-chain V1-J region intron X
BASE COUNT       70 a    106 c     79 g     77 t
ORIGIN      
        1 ccgttttctc ccctctctcc tctccctctc caggttccct ggtgcaggca gcgctgactc
       61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggggtt
      121 acagcgacta tggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct
      181 atgctaacac caacagaccc tcgaacatcc cttcacgatt ctccggttcc aaatctggct
      241 ccacaaacac attaaccatc actggggtcc aagccgagga cgaggctgtc tattactgtg
      301 ggagctggga tagcagtggt aatagtggta ta
//
LOCUS       CHKIGLAT      108 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged lambda-chain gene V1-J region, partial cds.
ACCESSION   M34809
KEYWORDS    immunoglobulin light-chain; joining exon; lambda-immunoglobulin;
            processed gene; variable region.
SOURCE      Chicken (inbred strain B19) 3 month old spleen DNA, clone 34.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 108)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   108     Ig lambda-chain V1-J region, exon N (AA at 1)
BASE COUNT       22 a     27 c     35 g     24 t
ORIGIN      
        1 ggatccggct ccacgggcac attaaccatc actggggtcc gagccgagga cgaggctgtc
       61 tattactgtg gtggctacga cggcagcagt gcttatgctg gtgatata
//
LOCUS       CHKIGLAU      529 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION   M34811
KEYWORDS    immunoglobulin light-chain; joining exon; processed gene.
SOURCE      Chicken (inbred strain B19) 3 month old bursa DNA, clone 2.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 529)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  /    37     Ig light-chain J region, exon N (AA at 3)
    IVS          38  >   529     IgL intron X
BASE COUNT       94 a    123 c    155 g    157 t
ORIGIN      
        1 ggtatatttg gggccgggac aaccttgacc gtcctaggtg agtcgctgac ctcgtctcgg
       61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
      121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
      181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
      241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
      301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
      361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
      421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
      481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS       CHKIGLAV      529 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION   M34812
KEYWORDS    immunoglobulin light-chain; joining exon; processed gene.
SOURCE      Chicken (inbred strain B19) 3 month old spleen DNA, clone 1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 529)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  /    37     Ig light-chain J region, exon N (AA at 3)
    IVS          38  >   529     IgL intron X
BASE COUNT       95 a    124 c    154 g    156 t
ORIGIN      
        1 gatatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg
       61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
      121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
      181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
      241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
      301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
      361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
      421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
      481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS       CHKIGLAW      529 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION   M34813
KEYWORDS    immunoglobulin light-chain; joining exon; processed gene.
SOURCE      Chicken (inbred strain B19) 3 month old spleen DNA, clone 126.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 529)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  /    37     Ig light-chain J region, exon N (AA at 3)
    IVS          38  >   529     IgL intron X
BASE COUNT       94 a    123 c    155 g    155 t      2 others
ORIGIN      
        1 ggtanntttg gggacgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg
       61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
      121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
      181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
      241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
      301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
      361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
      421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
      481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS       CHKIGLAX      529 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig rearranged light-chain gene J region, partial cds.
ACCESSION   M34814
KEYWORDS    immunoglobulin light-chain; joining exon; processed gene.
SOURCE      Chicken (inbred strain B19) 3 month old spleen DNA, clone 34.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 529)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  /    37     Ig light-chain J region, exon N (AA at 3)
    IVS          38  >   529     IgL intron X
BASE COUNT       97 a    126 c    152 g    154 t
ORIGIN      
        1 gatatattag gggccgggac aaccctgacc gtcctaggtg agtcactgac ctcgtctcgg
       61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat tcttggtgat ttgggggttt
      121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
      181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
      241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
      301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
      361 tttgtcagga aattacttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
      421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
      481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS       CHKIGLJA      529 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig germline light-chain gene J region, partial cds.
ACCESSION   M34810
KEYWORDS    germline; immunoglobulin light-chain; joining exon.
SOURCE      Chicken (inbred strain B19) 3 month old bursa DNA, clone B19-Eco.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 529)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  /    37     Ig light-chain J region, exon N (AA at 3)
    IVS          38  >   529     IgL intron X
BASE COUNT       94 a    124 c    155 g    156 t
ORIGIN      
        1 ggtatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg
       61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt
      121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg
      181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga
      241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt
      301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc
      361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac
      421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca
      481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct
//
LOCUS       CHKIGLVE      335 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken Ig germline lambda-chain gene V1-J region, partial cds.
ACCESSION   M34803
KEYWORDS    germline; immunoglobulin light-chain; joining exon;
            lambda-immunoglobulin; variable region.
SOURCE      Chicken (inbred strain B19) 3 month old bursa DNA, clone B19/g1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 335)
  AUTHORS   Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I.
  TITLE     Somatic diversification of chicken immunoglobulin light chains by
            point mutations
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    34  >   335     Ig lambda-chain V1-J region, exon N precursor
                                 (AA at 36)
    sigp    /    34       50     Ig lambda-chain V1-J region, exon N signal
                                 peptide (AA at 36)
    matp         51  >   335     Ig lambda-chain V1-J region, exon N
    IVS     <     1       33     Ig lambda-chain V1-J region intron X
BASE COUNT       70 a    111 c     82 g     72 t
ORIGIN      
        1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc
       61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccggggata
      121 ggagctacta tggctggtac cagcagaagg cacctggcag tgcccctgtc actgtgatct
      181 atgctaacac caacagaccc tcggacatcc cttcacgatt ctccggttcc aaatccggct
      241 ccacagccac attaaccatc actggggtcc aagccgacga cgaggctgtc tattactgtg
      301 ggagtgcaga cagcagcagt actgctggtg gtata
//
LOCUS       CH1L1CG      7502 bp ds-DNA   circular  BCT       27-JUL-1990
DEFINITION  Plasmid pCHL1 (from C.trachomatis) virulence proteins pGP1-D,
            pGP2-D, pGP3-D, pGP4-D, pGP5-D, pGP6-D, pGP7-D, and pGP8-D,
            complete cds.
ACCESSION   J03321
KEYWORDS    complete genome; virulence protein.
SOURCE      Plasmid pCHL1 (from Chlamydia trachomatis trachoma biovar, serotype
            D) DNA.
  ORGANISM  Plasmid pCHL1
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 7502)
  AUTHORS   Comanducci,M., Ricci,S., Cevenini,R. and Ratti,G.
  TITLE     Diversity of the Chlamydia trachomatis common plasmid in biovars
            with different pathogenicity
  JOURNAL   Plasmid 23, 149-154 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Ratti, 28-MAR-1990.
FEATURES       from  to/span     description
    pept       1480      488 (c) pGP8-D
    pept       1579     2934     pGP1-D
    pept       2928     3992     pGP2-D
    pept       4054     4848     pGP3-D
    pept       4918     5226     pGP4-D
    pept       5317     6048     pGP5-D (gtg start codon)
    pept       6045     6788     pGP6-D (gtg start codon)
    pept       7022     7502     pGP7-D
                  1      437     pGP7-D
    rpt        6857     6945     four tandem 22bp repeats
BASE COUNT     2460 a   1285 c   1433 g   2324 t
ORIGIN      
        1 ggatccgtaa gttagacgaa attttgtctt tgcgcacaga cgatctattt tttgcatcca
       61 atcagatttc ctttcgcatt aaaaaaagac agaataaaga aaccaaaatt ctaatcacat
      121 ttcctatcag cttaatggaa gagttgcaaa aatacacttg tgggagaaat gggagagtat
      181 ttgtttctaa aatagggatt cctgtaacaa caagtcaggt tgcgcataat tttaggcttg
      241 cagagttcca tagtgctatg aaaataaaaa ttactcccag agtacttcgt gcaagcgctt
      301 tgattcattt aaagcaaata ggattaaaag atgaggaaat catgcgtatt tcctgtcttt
      361 catcgagaca aagtgtgtgt tcttattgtt ctggggaaga ggtaattcct ctagtacaaa
      421 cacccacaat attgtgatat aattaaaatt atattcatat tctgttgcca gaaaaaacac
      481 ctttaggcta tattagagcc atcttctttg aagcgttgtc ttctcgagaa gatttatcgt
      541 acgcaaatat catctttgcg gttgcgtgtc ctgtgacctt cattatgtcg gagtctgagc
      601 accctaggcg tttgtactcc gtcacagcgg ttgctcgaag cacgtgcggg gttattttaa
      661 aagggattgc agcttgtagt cctgcttgag agaacgtgcg ggcgatttgc cttaacccca
      721 ccatttttcc ggagcgagtt acgaagacaa aacctcttcg ttgaccgatg tactcttgta
      781 gaaagtgcat aaacttctga ggataagtta taataatcct cttttctgtc tgacggttct
      841 taagctggga gaaagaaatg gtagcttgtt ggaaacaaat ctgactaatc tccaagctta
      901 agacttcaga ggagcgttta cctccttgga gcattgtctg ggcgatcaac caatcccggg
      961 cattgatttt ttttagctct tttaggaagg atgctgtttg caaactgttc atcgcatccg
     1021 tttttactat ttccctggtt ttaaaaaatg ttcgactatt ttcttgttta gaaggttgcg
     1081 ctatagcgac tattccttga gtcatcctgt ttaggaatct tgttaaggaa atatagcttg
     1141 ctgctcgaac ttgtttagta ccttcggtcc aagaagtctt ggcagaggaa acttttttaa
     1201 tcgcatctag gattagatta tgatttaaaa gggaaaactc ttgcagattc atatccaagg
     1261 acaatagacc aatcttttct aaagacaaaa aagatcctcg atatgatcta caagtatgtt
     1321 tgttgagtga tgcggtccaa tgcataataa cttcgaataa ggagaagctt ttcatgcgtt
     1381 tccaatagga ttcttggcga atttttaaaa cttcctgata agacttttca ctatattcta
     1441 acgacatttc ttgctgcaaa gataaaatcc ctttacccat gaaatccctc gtgatataac
     1501 ctatccgtaa aatgtcctga ttagtgaaat aatcaggttg ttaacaggat agcacgctcg
     1561 gtattttttt atataaacat gaaaactcgt tccgaaatag aaaatcgcat gcaagatatc
     1621 gagtatgcgt tgttaggtaa agctctgata tttgaagact ctactgagta tattctgagg
     1681 cagcttgcta attatgagtt taagtgttct catcataaaa acatattcat agtatttaaa
     1741 cacttaaaag acaatggatt acctataact gtagactcgg cttgggaaga gcttttgcgg
     1801 cgtcgtatca aagatatgga caaatcgtat ctcgggttaa tgttgcatga tgctttatca
     1861 aatgacaagc ttagatccgt ttctcatacg gttttcctcg atgatttgag cgtgtgtagc
     1921 gctgaagaaa atttgagtaa tttcattttc cgctcgttta atgagtacaa tgaaaatcca
     1981 ttgcgtagat ctccgtttct attgcttgag cgtataaagg gaaggcttga tagtgctata
     2041 gcaaagactt tttctattcg cagcgctaga ggccggtcta tttatgatat attctcacag
     2101 tcagaaattg gagtgctggc tcgtataaaa aaaagacgag tagcgttctc tgagaatcaa
     2161 aattctttct ttgatggctt cccaacagga tacaaggata ttgatgataa aggagttatc
     2221 ttagctaaag gtaatttcgt gattatagca gctagaccat ctatagggaa aacagcttta
     2281 gctatagaca tggcgataaa tcttgcggtt actcaacagc gtagagttgg tttcctatct
     2341 ctagaaatga gcgcaggtca aattgttgag cggattattg ctaatttaac aggaatatct
     2401 ggtgaaaaat tacaaagagg ggatctctct aaagaagaat tattccgagt agaagaagct
     2461 ggagaaacgg ttagagaatc acatttttat atctgcagtg atagtcagta taagcttaac
     2521 ttaatcgcga atcagatccg gttgctgaga aaagaagatc gagtagacgt aatatttatc
     2581 gattacttgc agttgatcaa ctcatcggtt ggagaaaatc gtcaaaatga aatagcagat
     2641 atatctagaa ccttaagagg tttagcctca gagctaaaca ttcctatagt ttgtttatcc
     2701 caactatcta gaaaagttga ggatagagca aataaagttc ccatgctttc agatttgcga
     2761 gacagcggtc aaatagagca agacgcagat gtgattttgt ttatcaatag gaaggaatcg
     2821 tcttctaatt gtgagataac tgttgggaaa aatagacatg gatcggtttt ctcttcggta
     2881 ttacatttcg atccaaaaat tagtaaattc tccgctatta aaaaagtatg gtaaattata
     2941 gtaactgcca cttcatcaaa agtcctatcc accttgaaaa tcagaagttt ggaagaagac
     3001 ctggtcaatc tattaagata tctcccaaat tggctcaaaa tgggatggta gaagttatag
     3061 gtcttgattt tctttcatct cattaccatg cattagcagc tatccaaaga ttactgaccg
     3121 caacgaatta caaggggaac acaaaagggg ttgttttatc cagagaatca aatagttttc
     3181 aatttgaagg atggatacca agaatccgtt ttacaaaaac tgaattctta gaggcttatg
     3241 gagttaagcg gtataaaaca tccagaaata agtatgagtt tagtggaaaa gaagctgaaa
     3301 ctgctttaga agccttatac catttaggac atcaaccgtt tttaatagtg gcaactagaa
     3361 ctcgatggac taatggaaca caaatagtag accgttacca aactctttct ccgatcatta
     3421 ggatttacga aggatgggaa ggtttaactg acgaagaaaa tatagatata gacttaacac
     3481 cttttaattc accacctaca cggaaacata aagggttcgt tgtagagcca tgtcctatct
     3541 tggtagatca aatagaatcc tactttgtaa tcaagcctgc aaatgtatac caagaaataa
     3601 aaatgcgttt cccaaatgca tcaaagtatg cttacacatt tatcgactgg gtgattacag
     3661 cagctgcgaa aaagagacga aaattaacta aggataattc ttggccagaa aacttgttat
     3721 taaacgttaa cgttaaaagt cttgcatata ttttaaggat gaatcggtac atctgtacaa
     3781 ggaactggaa aaaaatcgag ttagctatcg ataaatgtat agaaatcgcc attcagcttg
     3841 gctggttatc tagaagaaaa cgcattgaat ttctggattc ttctaaactc tctaaaaaag
     3901 aaattctata tctaaataaa gagcgctttg aagaaataac taagaaatct aaagaacaaa
     3961 tggaacaatt agaacaagaa tctattaatt aatagcaagc ttgaaactaa aaacctaatt
     4021 tatttaaagc tcaaaataaa aaagagtttt aaaatgggaa attctggttt ttatttgtat
     4081 aacactgaaa actgcgtctt tgctgataat atcaaagttg ggcaaatgac agagccgctc
     4141 aaggaccagc aaataatcct tgggacaaca tcaacacctg tcgcagccaa aatgacagct
     4201 tctgatggaa tatctttaac agtctccaat aattcatcaa ccaatgcttc tattacaatt
     4261 ggtttggatg cggaaaaagc ttaccagctt attctagaaa agttgggaga tcaaattctt
     4321 gatggaattg ctgatactat tgttgatagt acagtccaag atattttaga caaaatcaaa
     4381 acagaccctt ctctaggttt gttgaaagct tttaacaact ttccaatcac taataaaatt
     4441 caatgcaacg ggttattcac tcccagtaac attgaaactt tattaggagg aactgaaata
     4501 ggaaaattca cagtcacacc caaaagctct gggagcatgt tcttagtctc agcagatatt
     4561 attgcatcaa gaatggaagg cggcgttgtt ctagctttgg tacgagaagg tgattctaag
     4621 ccctgcgcga ttagttatgg atactcatca ggcattccta atttatgtag tctaagaacc
     4681 agtattacta atacaggatt gactccgaca acgtattcat tacgtgtagg cggtttagaa
     4741 agcggtgtgg tatgggttaa tgccctttct aatggcaatg atattttagg aataacaaat
     4801 acttctaatg tatctttttt agaggtaata cctcaaacaa acgcttaaac aatttttatt
     4861 ggatttttct tataggtttt atatttagag aaaacagttc gaattacggg gtttgttatg
     4921 caaaataaaa gaaaagtgag ggacgatttt attaaaattg ttaaagatgt gaaaaaagat
     4981 ttccccgaat tagacctaaa aatacgagta aacaaggaaa aagtaacttt cttaaattct
     5041 cccttagaac tctaccataa aagtgtctca ctaattctag gactgcttca acaaatagaa
     5101 aactctttag gattattccc agactctcct gttcttgaaa aattagagga taacagttta
     5161 aagctaaaaa aggctttgat tatgcttatc ttgtctagaa aagacatgtt ttccaaggct
     5221 gaatagacaa cttactctaa cgttggagtt gatttgcaca ccttagtttt ttgctctttt
     5281 aagggaggaa ctggaaaaac aacactttct ctaaacgtgg gatgcaactt ggcccaattt
     5341 ttagggaaaa aagtgttact tgctgaccta gacccgcaat ccaatttatc ttctggattg
     5401 ggggctagtg tcagaagtga ccaaaaaggc ttgcacgaca tagtatacac atcaaacgat
     5461 ttaaaatcaa tcatttgcga aacaaaaaaa gatagtgtgg acctaattcc tgcatcattt
     5521 tcatccgaac agtttagaga attggatatt catagaggac ctagtaacaa cttaaagtta
     5581 tttctgaatg agtactgcgc tcctttttat gacatctgca taatagacac tccacctagc
     5641 ctaggagggt taacgaaaga agcttttgtt gcaggagaca aattaattgc ttgtttaact
     5701 ccagaacctt tttctattct agggttacaa aagatacgtg aattcttaag ttcggtcgga
     5761 aaacctgaag aagaacacat tcttggaata gctttgtctt tttgggatga tcgtaactcg
     5821 actaaccaaa tgtatataga cattatcgag tctatttaca aaaacaagct tttttcaaca
     5881 aaaattcgtc gagatatttc tctcagccgt tctcttctta aagaagattc tgtagctaat
     5941 gtctatccaa attctagggc cgcagaagat attctgaagt taacgcatga aatagcaaat
     6001 attttgcata tcgaatatga acgagattac tctcagagga caacgtgaac aaactaaaaa
     6061 aagaagcgga tgtctttttt aaaaaaaatc aaactgccgc ttctctagat tttaagaaga
     6121 cgcttccctc cattgaacta ttctcagcaa ctttgaattc tgaggaaagt cagagtttgg
     6181 atcgattatt tttatcagag tcccaaaact attcggatga agaattttat caagaagaca
     6241 tcctagcggt aaaactgctt actggtcaga taaaatccat acagaagcaa cacgtacttc
     6301 ttttaggaga aaaaatctat aatgctagaa aaatcctgag taaggatcac ttctcctcaa
     6361 caactttttc atcttggata gagttagttt ttagaactaa gtcttctgct tacaatgctc
     6421 ttgcatatta cgagcttttt ataaacctcc ccaaccaaac tctacaaaaa gagtttcaat
     6481 cgatccccta taaatccgca tatattttgg ccgctagaaa aggcgattta aaaaccaagg
     6541 tcgatgtgat agggaaagta tgtggaatgt cgaactcatc ggcgataagg gtgttggatc
     6601 aatttcttcc ttcatctaga aacaaagacg ttagagaaac gatagataag tctgattcag
     6661 agaagaatcg ccaattatct gatttcttaa tagagatact tcgcatcatg tgttccggag
     6721 tttctttgtc ctcctataac gaaaatcttc tacaacagct ttttgaactt tttaagcaaa
     6781 agagctgatc ctccgtcagc tcatatatat atatctatta tatatatata tttagggatt
     6841 tgatttcacg agagagattt gcaactcttg gtggtagact ttgcaactct tggtggtaga
     6901 ctttgcaact cttggtggta gactttgcaa ctcttggtgg tagacttggt cataatggac
     6961 ttttgttaaa aaatttatta aaatcttaga gctccgattt tgaatagctt tggttaagaa
     7021 aatgggctcg atggctttcc ataaaagtag attgttttta acttttgggg acgcgtcgga
     7081 aatttggtta tctactttat cttatctaac tagaaaaaat tatgcgtctg ggattaactt
     7141 tcttgtttct ttagagattc tggatttatc ggaaaccttg ataaaggcta tttctcttga
     7201 ccacagcgaa tctttgttta aaatcaagtc tctagatgtt tttaatggaa aagttgtttc
     7261 agaggcatct aaacaggcta gagcggcatg ctacatatct ttcacaaagt ttttgtatag
     7321 attgaccaag ggatatatta aacccgctat tccattgaaa gattttggaa acactacatt
     7381 ttttaaaatc cgagacaaaa tcaaaacaga atcgatttct aagcaggaat ggacagtttt
     7441 ttttgaagcg ctccggatag tgaattatag agactattta atcggtaaat tgattgtaca
     7501 ag
//
LOCUS       MUSMIP1X      638 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse macrophage inflammatory protein 1-beta (MIP-1) mRNA,
            complete cds.
ACCESSION   M35590
KEYWORDS    macrophage inflammatory protein 1-beta.
SOURCE      Mouse macrophage cell line RAW 264.7, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 638)
  AUTHORS   Sherry,B., Tekamp-Olson,P., Gallegos,C., Bauer,D., Davatelis,G.,
            Wolpe,S.D., Masiarz,F., Coit,D. and Cerami,A.
  TITLE     Resolution of the two components of macrophage inflammatory protein
            1, and cloning and characterization of one of those components,
            macrophage inflammatory protein 1-beta
  JOURNAL   J. Exp. Med. 168, 2251-2259 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         63      341     macrophage inflammatory protein 1-beta (MIP-1)
                                 precursor
    sigp         63      131     macrophage inflammatory protein 1-beta signal
                                 peptide
    matp        132      338     macrophage inflammatory protein 1-beta
    mRNA    <     1      638     macrophage inflammatory protein 1 mRNA
BASE COUNT      125 a    175 c    139 g    199 t
ORIGIN      
        1 gcttctgaag cttctgggcc ctgcagtccc agctctgtgc aaacctaacc ccgagcaaca
       61 ccatgaagct ctgcgtgtct gccctctctc tcctcttgct cgtggctgcc ttctgtgctc
      121 cagggttctc agcaccaatg ggctctgacc ctcccacttc ctgctgtttc tcttacacct
      181 cccggcagct tcacagaagc tttgtgatgg attactatga gaccagcagt ctttgctcca
      241 agccagctgt ggtattcctg accaaaagag gcagacagat ctgtcctaac cccagtcagc
      301 cctgggtcac tgagtacatg agtcacttgg agttgaactg agcagctcca gcggcagggc
      361 aggaggagcc acttcaggag aggcctcctc agccctgatg cttctcactg agaagcgtcc
      421 ttgctcctca cgttcagatt tcctgcccct cttcttaatt taaatctctg tgtagacttt
      481 gttttgtttt tttgggggag tattatttct attatttatg ttttagttat aggacgcgtc
      541 tctcccatgg agatggtcca ccattgctgt ttctctgcta ttgtggatat gactgtgaaa
      601 ttgatttcat gcattttcat aataaatctt tctttaag
//
LOCUS       DENAAA        699 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Dengue virus type 2 RNA.
ACCESSION   M36806
KEYWORDS    .
SOURCE      Dengue virus type 2, cDNA to viral RNA.
  ORGANISM  Dengue virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE   1  (bases 1 to 699)
  AUTHORS   Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L.,
            Strauss,E.G. and Strauss,J.H.
  TITLE     Conserved elements in the 3' untranslated region of flavivirus RNAs
            and potential cyclization sequences
  JOURNAL   J. Mol. Biol. 198, 33-41 (1987)
  STANDARD  simple staff_review
BASE COUNT      170 a    160 c    183 g    186 t
ORIGIN      
        1 cctccatcag agcatggctg tcttagggct actcttctgc ctagtgacat tcccaagctg
       61 tgagtgtttc agggtttcag aagagggact aggacatgtc agataggaga tgtgtgactt
      121 atgttgatgt tgcttgtccc caggtgtcct atcccaggtg cagctgaagc agtcaggacc
      181 tggcctggtg cagccctcac agagcctgtc catcacctgc acagtctctg gtttctcatt
      241 aactacctat ggtgtaaact gggttcgcca gtctccagga aagggtctgg agtggctggg
      301 agtgatatgg agtggtggaa gcacagagta taatgcagct ttcatatcca gactgaacat
      361 caccaaggac aattccaaga ccaagttttc tttaaaatga acagtctgca atctaatgac
      421 acagccatat attactgtgc cagaagggac tacgttagta ggtacgaccc gtttgcttac
      481 tggggccaag ggactctggt cactgtctct gcaggtgagt cctaacttct cccattctag
      541 atgcatcttg gggggattct gagccttcag gaccaagatt ctctgcaaac gaatcaagat
      601 tcaacccctt tgtaccaaaa ttgagacagg ggtctgggct agggactctc tgcctgctgg
      661 tctgtggtgg cattagaaat gaagtatgat gaaggatct
//
LOCUS       MVEAAA        558 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Murray Valley encephalitis virus RNA.
ACCESSION   M35172
KEYWORDS    repetitive DNA.
SOURCE      Murray Valley encephalitis virus, cDNA to viral RNA.
  ORGANISM  Murray Valley encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Flaviviridae; Flavivirus (arbovirus group B).
REFERENCE   1  (bases 1 to 558)
  AUTHORS   Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L.,
            Strauss,E.G. and Strauss,J.H.
  TITLE     Conserved elements in the 3' untranslated region of flavivirus RNAs
            and potential cyclization sequences
  JOURNAL   J. Mol. Biol. 198, 33-41 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    ORF     <     1       30     open reading frame
    rpt         184      211     repeat
    rpt         333      359     repeat
BASE COUNT      170 a    122 c    162 g    104 t
ORIGIN      
        1 actcatgtga gtgaagatag ggtcttgtaa ataacattga tagaaaattt tgtaaatatt
       61 taatgtaata tagtataggt aaaatttttt gaaattaagt aaaattaagt agcaagactt
      121 gatagtcagg ccagccggtt aggctgcacc cgaaggttgg tagacggtgc tgcctgcgac
      181 caaccccagg aggactgggt taccaaagct gattctccac ggttggaaag cctcccagaa
      241 ccgtctcgga agaggagtcc ctgccaacaa tggagatgaa gcccgtgtca gagccgaaag
      301 cgccacttcg ccgaggagtg caatctgtga ggccccagga ggactgggta aacaaagccg
      361 taaggccccc gcagcccggg ccgggaggag gtgatgcaaa ccccggcgaa ggactagagg
      421 ttagaggaga ccctgcggaa gaaatgagtg gcccaagctc gccgaagctg taaggcgggt
      481 ggacggacta gaggttagag gagaccccac tctcaaaagc atcaaacaac agcatattga
      541 cacctgggaa aagactag
//
LOCUS       ECOSTII       806 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  E.coli heat-stable toxin II (STII) gene, complete cds.
ACCESSION   M35586
KEYWORDS    heat-stable toxin II.
SOURCE      E.coli (strain K12/711) DNA, pCHL6.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 806)
  AUTHORS   Lee,C.H., Moseley,S.L., Moon,H.W., Whipp,S.C., Gyles,C.L. and So,M.
  TITLE     Characterization of the gene encoding heat-stable toxin II and
            preliminary molecular epidemiological studies of enterotoxigenic
            Escherichia coli heat-stable toxin II producers
  JOURNAL   Infect. Immun. 42, 264-268 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        189       40 (c) ORF 1
    pept        443      658     heat-stable toxin II (put.)
BASE COUNT      287 a    141 c    135 g    243 t
ORIGIN      
        1 aagcttagta ttgttgacat gaacagcaaa tcgacttaat caaaaaacta tcagagaacg
       61 ctgcattatt gattttagga ctttcaatgc aaatatcatc gaggaacgta catcattgcc
      121 cacagataca acggatagtt gtatgaaagc ataaagcaaa atcaagattg caaacacaga
      181 agggggcatt cctccccctc tcttttgcac ttctttccta tatttttaca agaaaagaaa
      241 atgattttat atgttgaata ataaaacaat caatatgtaa atacctacaa cgggtgattg
      301 acactacact cattaactat actgcaagta gcattaaaaa tcttaataaa ggagagcttc
      361 gtcacatttt tttgacttga ctcatataaa ggcccactgg tataagtttt attgcttata
      421 gcaataaggt tgaggtgatt ttatgaaaaa gaatatcgca tttcttcttg catctatgtt
      481 cgttttttct attgctacaa atgcctatgc atctacacaa tcaaataaaa aagatctgtg
      541 tgaacattat agacaaatag ccaaggaaag ttgtaaaaaa ggttttttag gggttagaga
      601 tggtactgct ggagcatgct ttggcgccca aataatggtt gcagcaaaag gatgctaata
      661 tatttatcaa tagcattcag caccatatac acaaaaataa tttttcataa aaagaactct
      721 ataaaataaa tattttttgt gacaatgtcc taacgcaaga cggacattgt ccatttctca
      781 ctgcaggctg cggctgagcg ataccc
//
LOCUS       TRNTETE       830 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Transposon Tn10 (from E.coli) tetracycline resistance E (tetE)
            gene, complete cds.
ACCESSION   M34933
KEYWORDS    tetracycline resistance.
SOURCE      Transposon Tn10 (from E.coli) DNA, clone pSL1503.
  ORGANISM  Transposon Tn10
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 830)
  AUTHORS   Tovar,K., Ernst,A. and Hillen,W.
  TITLE     Identification and nucleotide sequence of the class E tet
            regulatory elements and operator and inducer binding of the encoded
            purified Tet repressor
  JOURNAL   Mol. Gen. Genet. 215, 76-80 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        152      787     tetracycline resistance determinant E
BASE COUNT      244 a    179 c    187 g    220 t
ORIGIN      
        1 agttaacgtt ctctatcgat gatagggttt gaaaaataac tctatcagtg atagattgtc
       61 aacagcaagt atcaattgca agagatagct actatcccaa actttttatt gagatagtca
      121 ctatctaaca gttgtccaaa aggagatagt gatggcacga ctaagcttgg acgacgtaat
      181 ttcaatggcg ctcaccctgc tggacagcga agggctagag ggcttgacta cgcgtaagct
      241 ggcgcagtcc ctaaaaattg agcaaccgac tctgtattgg cacctgcgca acaagcagac
      301 tcttatgaac atgctttcag aggcaatact ggcgaagcat cacacccgtt cagcaccgtt
      361 accgactgag agttggcagc agtttctcca ggaaaatgct ctgagtttcc gtaaagcatt
      421 actggtccat cgtgatggag cccgattgca tatagggacc tctcctacgc ccccccagtt
      481 tgaacaagca gaggcgcaac tacgctgtct atgcgatgca gggttttcgg tcgaggaggc
      541 tcttttcatt ctgcaatcta tcagccattt tacgttgggt gcagtattag aggagcaagc
      601 aacaaaccag atagaaaata atcatgtgat agacgctgca ccaccattat tacaagaggc
      661 atttaatatt caggcgagaa cctctgctga aatggccttc catttcgggc tgaaatcatt
      721 aatatttgga ttttctgcac agttagatga aaaaaagcat acacccattg aggatggtaa
      781 taaatgatgc tatctatgtg tcaactctaa tttatagtta tggatagtgt
//
LOCUS       CLONIFC      1191 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  C.pasteurianum nitrogenase C (nifC) gene, complete cds.
ACCESSION   M34365
KEYWORDS    nitrogen-fixation protein; nitrogenase C.
SOURCE      C.pasteurianum (strain W5) DNA.
  ORGANISM  Clostridium pasteurianum
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1191)
  AUTHORS   Wang,S.-Z., Chen,J.-S. and Johnson,J.L.
  TITLE     A nitrogen-fixation gene (nifC) in Clostridium pasteurianum with
            sequence similarity to chlJ of Escherichia coli
  JOURNAL   Biochem. Biophys. Res. Commun. 169, 1122-1128 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-Z.Wang, 18-MAY-1990.
FEATURES       from  to/span     description
    pept        331     1191     nitrogenase C (nifC)
BASE COUNT      403 a    132 c    178 g    478 t
ORIGIN      
        1 tttagtacat ctgaaatctg taattaagga gataataagt caaagtttaa tatatttgat
       61 ttgttatttc cttaattttt tataaataac tgctatagta taactgtata aaatatggta
      121 taatgtcgta gttgataata attatatttt tttatatatc attctgtgcc tataataatg
      181 tatgaactaa tatttattga tacaataaat tataaaatta tataagaata tatagaatat
      241 tatattgttt tatataagtt atagtatttt atattaaaca acttaattct aaatattttt
      301 ttatgtattt aacaggaggt aatataatac atggaaaata ataaaaaaat attagagagt
      361 tcaaaaaaac tttcttccta tggagatgga gagagcaggt tttcttttct tgaaaaaata
      421 ttagctcctt tatttttagc tctaacggct atatattttg taatgcttat atttccaata
      481 atttcaatga ttcggtattc aggtggttct cacattattc aaacccttta tgatcaagat
      541 aatataaaga ctataatttt aagctttgta acttcactta tagcacttat atttactttt
      601 atcataggaa ctccaacagc tttttgtatc aattttgtta gaaataaagt tttatctaaa
      661 atattagata tatttgtaga aattcctgta gttttgcctc cagcagtagc tggtattgct
      721 ttgctactag cttttggtaa aaatggtgta gtgggtaatt ttttatctaa tcatgggata
      781 aatgttatat ttacctctac agctgtaata attgcacaat tttttgtatc ttctgctctt
      841 tatgtaagag ttcttagaga ttctgttaaa tcagtaccta ttgaactttt tgaagtgagt
      901 tatgtgctag gggcaggaaa aattgaaaca ataattaaaa ttatgattcc aatgcttaaa
      961 aagtcaatcg tgtctggact tattttggct tggataagat ctcttggaga gtttggagcc
     1021 actttgatgt ttgcaggaaa cattatagga aaaacaagaa ccataccact tcaaatctat
     1081 acttatatgc aggatgatat taaaatggct actgcttttg ctacgatttt gtatataatg
     1141 acttttgtat tgttattatt agttagactt agtattaggg atgatgatta g
//
LOCUS       LACISTRAN     808 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  L.lactis insertion sequence IS946 transposase gene, complete cds.
ACCESSION   M33868
KEYWORDS    transposase.
SOURCE      L.lactis DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 808)
  AUTHORS   Romero,D.A. and Klaenhammer,T.R.
  TITLE     Characterization of gram-positive insertion sequence IS946, an
            Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid
            pTR2030
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.A.Romero, 26-APR-1990.
            
                        Author address: D.A.Romero
                        North Carolina State University
                        Dept. of Food Science
                        Box 7624, Raleigh, NC 27695
FEATURES       from  to/span     description
    pept         74      754     transposase (put.)
    rpt           1       18     inverted repeat
    rpt         791      808     inverted repeat
    signal        6       35     promoter region
    binding      62       65     ribosomal binding site
BASE COUNT      281 a    137 c    158 g    232 t
ORIGIN      
        1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag
       61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt
      121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata
      181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa
      241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga
      301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga
      361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt
      421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc
      481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca
      541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg
      601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat
      661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac
      721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta
      781 tttgattttt aaactttgca acagaacc
//
LOCUS       MUSMB1       5629 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Murine B cell 1 (mb-1) gene, complete cds.
ACCESSION   M31773
KEYWORDS    murine B cell 1.
SOURCE      Mouse (strain Balb/c) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5629)
  AUTHORS   Kashiwamura,S.-I., Koyama,T., Matsuo,T., Steinmetz,M.,
            Kimoto,M. and Sakaguchi,N.
  TITLE     Structure of the murine mb-1 gene encoding a putative
            sIgM-associated molecule
  JOURNAL   J. Immunol. 145, 337-343 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Sakaguchi, 01-FEB-1990.
FEATURES       from  to/span     description
    pept        277      343     B cell 1, exon 1
               1823     2116     B cell 1, exon 2
               2308     2426     B cell 1, exon 3
               4450     4518     B cell 1, exon 4
               4646     4759     B cell 1, exon 5
    pre-msg     268     5072     B cell 1 mRNA and introns
    IVS         344     1822     mb-1 intron A
    IVS        2117     2307     mb-1 intron B
    IVS        2427     4449     mb-1 intron C
    IVS        4519     4645     mb-1 intron D
BASE COUNT     1332 a   1515 c   1400 g   1382 t
ORIGIN      
        1 gatcctttct cagggatcca tggtgatgaa ccaggccttg aaccaccctc tccccgaccc
       61 cacgcactag agagagactc aagggaattg tggccaccca ggtgcagggc agttcctctc
      121 cactcagagc ccacacatat ggcaaataaa gggccaggag taagggccac tccagcccat
      181 ctccggcacg gctgaacagg aagtgaggcg gagtttggag tggaatctgg tacctcaagc
      241 tacggccacc caaactaacc gactcactgg gagacgatgc cagggggtct agaagccctc
      301 agagccctgc ctctcctcct cttcttgtca tacgcctgtt tgggtaatgt ggccaaaggg
      361 aaggtgcggg tggggcgggg gtgttgcggc tgacaggaga ggtgcagagt ggagtcagaa
      421 atggccaggc agggtgatgc ataacgtggc atctcttaaa gtaacgccag aactctagga
      481 aactgaggag ggaggaacat gcatttgagg ccagctggcg tacataatga cttctatcta
      541 ggccaacctc gcaagttact cattaaaact ctctcaaaaa gaaaaggagt gtcgggaggg
      601 aatcagggaa agcctctctc agctagctca gacagctgag taagtccact ttaggaccaa
      661 cactaccttg cccacgacac acgaccttag ctcacagtca gaatgagaac tgagtaagtg
      721 tgactgaaca cacaccagtg agtgtagcaa tggtgtcggt gagggaacgg gttttattgt
      781 tgttgttgtt gttgttgttg ttttttcctc tgccttacat ctggaggcag gcttgccatg
      841 tgaccacata gacccaggtt tgcagtggag gtcttacgag actggggccc tggtctctgt
      901 gtcttaagcc tgtgtgacag gaactggctg gggctctttc ccctcccttt ctttcttcct
      961 cacttccgac cttcttcttt tctgtcatgt ccacaaaagg gacctacaag ccagtgtgga
     1021 gctggggagg ctgagaacag ggtggggggt ggggggggcg ggtcaagata aatacaagag
     1081 cagacaccat agccaccagt ccaccccgtc tcttagcctc ctccgatgcc ccagacagaa
     1141 ctcagtccca gtgcctaggt taatactggt ttctctgttt tgtttttcga gacagggatt
     1201 ctgtgtgcaa ctggaacgta ctctgttaga ccagcaggcc tcaaactccc agagatctgc
     1261 ctgacttctc tctcctgagt gctgggatta aaggcatgta ccagtactgc ctggcagtcg
     1321 ccacatttaa attaaattac agagatgaga aacagcctag gacaaaaaaa caaaaaaaca
     1381 aacaaacaaa caaacaaaaa aaacaaaaac aaaaaaaaaa acaaggccaa gcgcagtaaa
     1441 ggagagattc aggaaggtgg gaattgggct ctactatctt ttccaatagc ctggttactg
     1501 ctgagctgag ttttaccgct gcatctaagg ttctgcccca ccttcacccc ctccccgtgg
     1561 ggttcctaag agttgtaagg aatggggaac aaatgggatg gttttcatga aagaatccct
     1621 ggggtttttg gggaacatca aaatacaaag gggaaaggga agaaagaggc catagtgtgc
     1681 ccccccccaa gtctcccatc atctttttct agtacattac cattggtacc ggctccaatt
     1741 ctgatgtgaa ggggtctggg ctagagcaat catctccatt ctccaggcta tgacccatct
     1801 gtctcctctc ctctctccac aggtcccgga tgccaggccc tgcgggtaga agggggtcca
     1861 ccatccctga cggtgaactt gggcgaggag gcccgcctca cctgtgaaaa caatggcagg
     1921 aaccctaata tcacatggtg gttcagcctt cagtctaaca tcacatggcc cccagtgcca
     1981 ctgggtcctg gccagggtac cacaggccag ctgttcttcc ccgaagtaaa caagaacaca
     2041 ggggcttgta ctgggtgcca agtgatagaa aacaacatat taaaacgctc ctgtggtact
     2101 tacctccgcg tgcgcagtga gtagggaggg cgctggcctc ttgcgttccc tgctccctct
     2161 ttcttccaaa acattaggag cagacgtagc tcctccctcc tggacctgcc acccagccac
     2221 agagactggt tgcgagggct ccctgacctc gcaagggtct aggcgagaag aagggacaca
     2281 gaatgctgag cagcacctgt cttcaagatc cagtccctag gcccttcctg gacatggggg
     2341 aaggtaccaa gaaccgcatc atcacagcag aagggatcat cttgctgttc tgtgcagtgg
     2401 tgccagggac gctgctgcta ttcagggtga gtctccccca agtgacaccc ttcaagtcat
     2461 cttggcttac ctgggcctgg actgtgactc ccagtgtcca gctggtcctc tgaattcgag
     2521 ctcgcccgga gatcttgtac tgatgccaac tgatagaaaa caacatatta aaacgctcct
     2581 tgtacttacc atcccgtgcg cagtgagtag gaggcgcctg gcctccttgc tttccctgct
     2641 ccctctttct tcaaaacatt agagcaagct agctcctcct cctgacctcg acagcacaga
     2701 gatggtggtt caggcgcgtc cgaccctcgc aagggtctgg gctgccagaa gaagggacac
     2761 cagaatgctg agcagcaccc tgtcttcaca gatccagtcc ctaggccctt cctggacatg
     2821 ggggaaggta ccaagaaccg catcatcaca gcagaaggga tcatcttgct gttctgtgca
     2881 gtggtgccag ggacgctgct gctattcagg gtgagtctcc cccaagtgac acccctcaag
     2941 tcatcttggc ttccctgggc ctggactgtg actcccagtg tccagctggt ccttgggtct
     3001 tccgaattcc aaggtttgtc cttccaattt gggaggctgg gaataaccac caaaaggatt
     3061 ccaggggggt cagctgtccc cacggccctc atagaaaccc tctcacttct ggccgccaac
     3121 gggagtccct gcaaatgcag ccaagacaag ggaagaaggt gtgtcctccc agagcccctg
     3181 cgtctcctta ggccagaaga caacattttt caccacattg gggcttcatt attgtacgcc
     3241 tggctggcct tgaactcaca gagatctgcc tatttctgcc tcccaagtgc tgctgggatt
     3301 aaagggccac catgcctgcc cttcttcctt tctttaactt tttcttttcc ccaccaccta
     3361 accccccaag actgggtctc tctgtgtagc cctggtgtcc tagatataag gctggctcag
     3421 actcacagag atccacctgc ctctgcctcc tgagtgctgg gattacagtt gtgcacccct
     3481 atcgccctgc actttattta tgtcttagtg tgtacgtggg catgttgagt gtggaggtcg
     3541 aagaacaact ttcaggagtt gattctttct ccactgtgga ttcgcacaca aattcatatc
     3601 ctcaggtttg caggcataca ttaccagttg agctatcatc tgacttcttt gcctgagcta
     3661 tcatctgact tctttgcttt catttttatt ttagtttatt ttattttatt tttttaaagc
     3721 tagatattag tctagctagc ctcaaacttg ctacatcaca tggctggcct gattcctctt
     3781 taccttccaa gcactgaaat tacagggcag tggcaccacc agttttattt tgtttgtttt
     3841 aatttttgtt ttgtgttttg ttttgttttt gatggggagg gttagacagc attttgctat
     3901 gtaattcagg ctgggctaga actcactcta tagatcatat tggtctggaa ttctcctgcc
     3961 tcagtttgcc aagcactgga attacagatc tccagcacca ctcattccct gctcgccgtt
     4021 ttgcttgtca ttgaggagca gtagagagtc agcatggtgg tgcacagctt tgaccctagc
     4081 agcactcagg aggcagaggc aggcggatct ctcgggagcc aaggccagcc tggtccacgt
     4141 actgagttcc caggtagttc ttgctagata gagatcctca gagcatcagt ggagtggcag
     4201 ggaggtgcag cagttccact tgagacaggg agattgctta aagtttgagg cccataaaaa
     4261 ggaaagagag gagaggggca gagactagaa tcttgacaag atacctgatg gtgggaatct
     4321 agaccttgat ggtggggagc caagtatccc tcactctttc tgataaagca cttgctagta
     4381 ggagacagtc taaaaaccct tcacctccct ccttgctaaa cgtcgcttct acttcatccc
     4441 cttctgtaga aacggtggca aaatgagaag tttggggtgg acatgccaga tgactatgaa
     4501 gatgaaaatc tctatgaggt aagcactgtt tgcagcaggc gtcgggtgca tggggtgtgg
     4561 tgtctgcctt gagctctcac cctggagttg actggactgg ggctccagct gtcagggtac
     4621 tgatgtctcc tctctgaccc tgcagggcct gaaccttgat gactgttcta tgtatgagga
     4681 catctccagg ggactccagg gcacctacca ggatgtgggc aacctccaca ttggagatgc
     4741 ccagctggaa aagccatgac tgacatgtcc cacccttccc tgcctgccat atgtctgact
     4801 ccagcatccc tgcctcccca ctcttcccct ggagtccccc cttcccagtg cttgctccag
     4861 ccactcgagt gcgctaccct tgcctccagg ctgccccacc ctattctccc acctaatctt
     4921 gtccttcccc aagtccagct ccctctctag tgggtaatga gcccttaatc gctgcctcta
     4981 ggggagctga ttacagcatc ctcgttagtg tcaccccccc tcctcccaga tctgtcatgg
     5041 ccacttaagt gataataaat ccttcccaaa gcagccttgg caggagtcat ggatctcatg
     5101 tagactgctc ctcaacactg ccagggaacc aggagctaga aaacccaagt cccctccccc
     5161 ctcagaccca ggattctagg ttcaagttct cacctgacca cttctagatc tacctgacaa
     5221 ctcttcgaac ggtacagata gactctgcgt atgtcttcga cttcccctca acccctcccc
     5281 ccgaaaataa cggtcaaaga gctctcttgg ctaatcccca ccccaattgt aaatgtctgg
     5341 tttatctctt atgggcctca ccctcctctg tcccaaaaca aggggaacca gcccggtgtt
     5401 aacaaatttt tacctatatt taaactctga gaacacaaga tattattttc tggagtgggg
     5461 cctaattcct catgggttca cttgcataag gcagatttgg agagtggctg tcaaaaaccg
     5521 gggggagggg gatgacagag agagctagag tggtgtgaca gtcattggga tgggaagagc
     5581 agactcaggt tgttaaaaga caggaggcca agaggagact cagtgggga
//
LOCUS       FRNTUL4      1233 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  F.tularensis 17 kDa major membrane protein (TUL4) and 13-kDa
            protein genes, complete cds.
ACCESSION   M32059
KEYWORDS    13-kDa protein; 17 kDa major membrane protein.
SOURCE      F.tularensis (strain live vaccine) DNA.
  ORGANISM  Francisella tularensis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria.
REFERENCE   1  (bases 1 to 1233)
  AUTHORS   Sjoestedt,A., Sandstroem,G., Taernvik,A. and Jaurin,B.
  TITLE     Nucleotide sequence and T cell epitopes of a membrane protein of
            Francisella tularensis
  JOURNAL   J. Immunol. 145, 311-317 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A.Sjoestedt, 12-FEB-1990.
FEATURES       from  to/span     description
    pept         99      470     13-kDa protein
    pept        551     1000     17 kDa major membrane protein (TUL4) precursor
    sigp        551      607     17 kDa major membrane protein (TUL4) signal
                                 peptide
    matp        608      997     17 kDa major membrane protein (TUL4)
    binding      85       97     ribosome binding site
    binding     539      548     ribosome binding site
BASE COUNT      440 a    171 c    204 g    418 t
ORIGIN      
        1 gaattcatag tcgcctcaaa atatgtttag atatgttgtt aataagttat aatcttatat
       61 atagttaact atgttattat aatcaaggtt tttttgctat gatgacttta caagaagaaa
      121 aaatacaagc gccagtattt tttaaagaat atgtcaaagg taggtttata cttaatatag
      181 gtgagtataa tcatccattg attttatcag caactcaagt gcttgagtat caagataaaa
      241 ttgatgatat ccaaagtatc aaaaaaagtc atcttgatct tatcttagcg actaatcctg
      301 aaataatact cataggtacc ggtgaaaaac aacttttgcc tccacttgag ataattaatc
      361 aaatcgcaaa agctggtaaa agtgttgatt ttatggcgag tgatactgct tgtaagacat
      421 ataatttgct tgttaatgaa aatcgtaatg ttagctgtat catcatttaa taaactgctg
      481 tttattttat tttaattaat gttataatcg atttgagtat atgtgaatat ttaaaaatag
      541 gagtatctat atgaaaaaaa taattaagct tagtctttta tctttatcaa tcgcaggttt
      601 agcgagctgt tctactctag ggttaggtgg ctctgatgat gcaaaagctt cagctaaaga
      661 tactgctgct gctcagacag ctactactga gcaagctgct gctgtatcta agccaactgc
      721 aaaagtaagt ttaaataaac ttggtcagga taaaataaaa gcaactgtat atacagcata
      781 caataataac ccacaaggaa gtgtaagatt acaatggcag gctccagaag gttctaagtg
      841 ccatgataca agcttcccaa ttactaagta tgctgagaag aacgataaaa cttgggcaac
      901 tgtaacagtt aagcaaggta ataacttctg tagcggtaag tggacagcta atgtagttta
      961 tgacaaagaa gtaatcgctt ctgattcaat aaatatttaa tttcttcaaa tatttataat
     1021 tttctagttt ttgcttttct caaaaaaaaa taacggtgtg attaattttg cttagctaga
     1081 ttagttgtac ttttaacttt ttactaaaaa gatttgctat aattgcttaa atataacaaa
     1141 taaaaaagta atagtttaat gaaaaacatt atcttgttat taatgatatt ctcgatgtta
     1201 gttgtactta gccatgtgca ccaaccacag ctg
//
LOCUS       TIPREGU     29802 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Plasmid Ti (from A.tumefaciens) virulence regulon (vir) genes,
            complete cds.
ACCESSION   J03320
KEYWORDS    virulence regulon.
SOURCE      Plasmid Ti (strain C58) (from A.tumefaciens) DNA.
  ORGANISM  Plasmid Ti
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 29802)
  AUTHORS   Rogowsky,P.M., Powell,B.S., Shirasu,K., Lin,T.-S., Morel,P.,
            Zyprian,E.M., Steck,T.R. and Kado,C.I.
  TITLE     Molecular characterization of the vir regulon of Agrobacterium
            tumefaciens: Complete nucleotide sequence and gene organization of
            the 28.63-kbp regulon cloned as a single unit
  JOURNAL   Plasmid 23, 85-106 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Kado, 02-MAR-1990.
FEATURES       from  to/span     description
    pept       1983     4484     virA protein
    pept       5029     5766     virB1 protein
    pept       5766     6131     virB2 protein
    pept       6131     6457     virB3 protein
    pept       6457     8823     virB4 protein
    pept       8840     9502     virB5 protein
    pept       9603    10487     virB6 protein
    pept      10523    10690     virB7 protein
    pept      10677    11390     virB8 protein
    pept      11387    12268     virB9 protein
    pept      12265    13398     virB10 protein
    pept      13439    14473     virB11 protein
    pept      14728    15453     virG protein (ttg start codon)
    pept      17174    17617     virD1 protein
    pept      17651    18994     virD2 protein
    pept      19009    21030     virD3 protein
    pept      21104    23101     virD4 protein
    pept      23098    25734     ORF 5
    pept      26802    27053     virE1 protein
    pept      27169    27360     virE2
    pept      27374    29044     virE3 29044
    site       1983    29044     vir regulon region
BASE COUNT     7085 a   7819 c   8213 g   6685 t
ORIGIN      
        1 taccgacgat atgtgcagtc catcagcagt ataatgatta caggagacca aaaattggtt
       61 tcaaatgtaa ccgaataagc ctgctgtacg taatggcaat cggtcgatat cagtcggcac
      121 agccactcgt cttgctgaga tgcaatgtcg acgtgtaacc aaaatgactt tgatggaagt
      181 cgactttgaa gcctgaccga caccggctaa tcgaaatctg caactgctgg aactcggaag
      241 cggccaaatc cctcgaggac cgagcgccgg ccgaggtgct tcgaaagaaa tcacttactc
      301 aaataagatg tgcgaatagc tttggcgcgc cgcatgtcgc gcttggaggt gacgccgacc
      361 gatgctgcgc cgccgatggc catccaacac gcaggtcaaa tgatgaagcg gcgaatcacc
      421 gaattcgcgt cagcgtgacc gggcaaacag gctgtcctgc gccatcttct ggccactgag
      481 gaaaatcgcg ctcctgcgag agcgcatgtt caagatattc attagctatg gcctctatga
      541 gctctacgtg ccgccctgca tcaatatttg gcaactggct gattgcgaga tcgtgtttgc
      601 gcgcaaaacg aattgcgcag cgatatccgt cgatatcttc caaaatcggt cgagcggcag
      661 ggtagttcca gagttccgcc aactcctcca acaacgaggg gcgatcttcc cggatggcaa
      721 acatttccgc aacgcgttgc ttggctcggg tgagaaaggc gtccgaatcc ccaagacgta
      781 gccgcttgac atgccattga aaacccgatc tccaaaacgg acttttagcc atgcaattga
      841 gaagcgaaat cgacccgccc tcgagaataa gaccctcttc ggacttccgc caatccactt
      901 cgaatatgag tcgacgatgg gcactctcag cgtcaaggat gccctcggtg agggggcggg
      961 aatccaaata tattctccgc gttgattgca attccgattc caaaggtctt ccgctacctg
     1021 tcgcgatttg aggacagcat tgcacacgat caagggcaac caccggccac ccggtttctt
     1081 gtgcgatttg gatcgccatg tccgttttgc cgctgcaagt cggtccgtag atgagatgga
     1141 gtaacatgtg cgcctcatat aaatttaatg ggatatccca tctgttccaa accagatttt
     1201 tcaattcgga ttatgatgtt ttactgcatg atttttacca gctttacgca atctgctgaa
     1261 ctaatatggc gctacatttg taatcgtaca tttttgacgt tgccggctca ttcggccatg
     1321 agagtttcca tcccgttgtc ctcctgaaca attgaggagt ttggccccga cacctgctta
     1381 tggtgaatgc agcggacccc gttaacgcaa cgtcgaaaag cgataggcga tttgagtagg
     1441 aagccacatc ggccgcgcgg tatgctccgt cagcacctga cagttcgtgt tgcagctttg
     1501 cttggtcacc tgaaacgaaa aatgccgctg aagaatattt aacggatatg cacacgaaaa
     1561 agccggttgg gtctcatatg ccgctgacgt cctggcttcg tttcgaagcc gacagcctct
     1621 aaggtggtac gaacacagac ctaatatccg ctagtgtccg cggcagccgc cacggcagtt
     1681 ctcgctgcgc atagcatagc aacttgccct gtcgacgcat gatcagccat attgttaatg
     1741 tcgacgattt catttttctc ggtgcaaacg gcgacatcct gaccatatcg atcctaccag
     1801 ccaccgcgcg gagtcaaaga acacctcggc aaccggaaac atccattgga atgccgaggc
     1861 tgtgtttcat ttgaaacaca ctgagtcgac gtttgttgct tcaaacccat ttacaaaccc
     1921 tactgtgcgg cctaagggcc aacggggtgg gactgagcgc tggtacgagg acgtaagtgc
     1981 ggatgaatgg aaggtattca ccgtctcggc aagatttcaa gacaggcgcc aagccttggt
     2041 ctatcctggc cttggtagtt gctgcaatga tttttgcctt gatggcgatt acgtcttggc
     2101 aggacaatga aaccaatcgg gcgatcctga cccaattgcg agctattaac atcgacagtg
     2161 cttcgctgca gcgggatgta ctctccgcgg aagcgggtgt ggtggcgaac taccggccca
     2221 ttatctccag gttgggagct ttgcggaaga acctggaaaa tttgaagcga ctatttaaac
     2281 aatctcatct tgtgatcggc aatgatttct ctcaactgct cgacaagcta aaggtgtctg
     2341 tggatacgac cgacgcggcc gttgcagcct tcggagcgca aaacgtgctc ctgcaagatt
     2401 cgcttgccag cttcactcgc gcgcttagta ttcttcccaa aatgtcgtcg acggatcaga
     2461 cggtcgaaaa ttcgaacgaa ttgggcagcc tgatgctgcg gtttgtgcgt cagccaagcc
     2521 cagcactctc gttggagatc agtcacgaac tcgacatgct ccaaaaagct agcggtgggg
     2581 ctgaagttcc tatccgtata cttgcacgcg agggtcgcgt catcttgtcg attttgcccc
     2641 gtgtgaacga tgccgtaaac atgattcaga cctccgacac cgctgaaatt gccgaaagat
     2701 tggagcgcaa gtgtttggag gcctatagct tgcaaagcgt gagggagcag cgggcacgga
     2761 tcttcttggg ttccgtttcg gtgggccttt gcatctacat catctcactg gtctataggc
     2821 tgcgtcggaa aacggcttgg ttaacgcggc gtttggatta cgaagaggta atcaaagaga
     2881 ttggggtttg tttcgagggg ggaggggcca cagcgtcgtc cctcaattcg tccgcgcaag
     2941 ctgccgttgg aattattcaa cgcttcttta atgcggaatc gtgtgcacta gcattggtgg
     3001 accatggtga caggtgggct gtcgaaagtt tcgctgcgaa gctgcctgag cccgtctggg
     3061 aggacctcgc gctacgcgag atggtttctc ttgccagagc ggatgagcgt gcgtcagtat
     3121 tccgcatcat gtcgacgcga aaggtcagct gccttcctcc ggagactccg ggcgtttcta
     3181 tgctgctggc acacaaatct acggatcaac tgatagcgat ttgttccctc ggttaccagg
     3241 gctatcgtct gaaatcttgt ccaggcgaag ttcagcttct tgaactcgcc accgcctgcc
     3301 tctgccacta tatcgatgtc cggcgtaagc agaccgaatg cgattttctg gagaggcgat
     3361 tagagcatgc ggaacgtctt caggcagttg gtacacttgc tggtggaata gcgcatgagt
     3421 tcaacaacat tttgggagca atcctcgggt acgctgaaat ggcgcaaaac atgctgcgtc
     3481 ggtcatctgt cacccgaaga cacattgacc aaattatttc gtcgggtgac agagccaggc
     3541 tcattatcga tcagatcttg actctaagtc gaaaactaga acgcgtgaca aagccgttca
     3601 gtgtctccga actcgtaatg gaaattgctc ccttattgcg tgttgcattg cagcgcaaca
     3661 tcgagctgaa gttcaagttt gacgacaaga agagcgtagt cgaaggaagc ccgctagagg
     3721 ttcagcagat gctgatgaat ctttgcaaga acgcttccca ggcgtttacc gccgatggtc
     3781 aaatcgacat catcgttagc cgaatttttg tatctcgaca gaaagtactg gcgcatggtg
     3841 ttatgccagc tggcgactat gttcttcttt ctgtcagcga cgacggtgaa ggcattgccg
     3901 aaactgtgct cccccacatt tttgagcctt tctttacaac acgctcttgc agcggcggca
     3961 cgggtctagg tcttgctgcc gtgcacggtc atgtcagcgc gcttgcagga tatattgacg
     4021 ttacttcagc cgtagggcga gggacgcgct tcgacattta tctacctcct tcctcgaaga
     4081 agcccgtcag cccggacgcg ttttttgggc cctgtaaaac accgcgtggc aacggagaaa
     4141 ttgtggcatt gattgagcca gatcctgtct tgcgggaggt gtacgaagac aagatcgccg
     4201 ctctgggcta tgagccggtg ggctttaaga catgtgcaga cctttgcaat tggatatcaa
     4261 aaggcaagca agccgatctg gttctagttg accaatcgtc tcttcccgag aatcagagtg
     4321 ctactgcttt gcacgcagcc ttcaagacgg cgtccatcat cattggagga agtgatctta
     4381 aaatgtcact ttccagcgat gacatgacgt ccgcgctttt tctgcccaaa cctatatcct
     4441 ccaggacgat ggcctacgcg attcgtacca agatcaaagc ctagagttga ggatgttttt
     4501 caggagactc gaccagggta atcagcacgt ctgttcggcg cgcctatagc tatggtttct
     4561 cgcaatcgct ttgggcagtc ggttgtggcg gtgcgtcttc tgagacccga aggttggcga
     4621 ttatctgctg gaggtcttcg ataccgtctt cgaagaacaa gacatatgtt cgctgagctc
     4681 ggacatagga tagtcggcat aggcggatac cctcgccacg caaatggcgc gatcgacacc
     4741 ctgatcggag cctgcgaagt gccacagttt attggaacca tcccccgttc attccgccct
     4801 ggaatttcac ttcgacggca ttctagatcc tcgccgttgt cttagagagc catctttcgt
     4861 cggcgcaccc ttaagtgcat ggaaagccgt tttcgcttca aatgaaatcg aaaagaagaa
     4921 aacaaaaatc ctagagtaac cgaccctccc gataatcgtg aacatcagat cacagcattt
     4981 cttccgaccg aagtggctgt gttggttatg agcttgggga gattgggaat gttgaaggca
     5041 acagggccgc tgtcgattat cttactggcc tccacgtgca cgtcgagtgg tgctgcccca
     5101 ctttcatttg ctgagttcaa taattttgca cgcgaatgcg ctccatccgt tgctccatct
     5161 acgcttgcag cgatcgctca ggtcgaaagt cgctttgatc cgcttgctgt gcatgacaat
     5221 accaccggcg aaacgcttca ctggcagaac caggctcaag caacgcaagt cgtgatggac
     5281 ggtctcgaag cacggcattc gctggatgtt gggctcatgc agatcaattc ccggaatttt
     5341 tccgtgctcg gtctgacacc tgacggagcc cttcaacctt gcacgtcatt atctgtcgcc
     5401 gcaaacttgc ttgggagccg ctacgctggc ggcaacacgg ctgacgacga gcaattgtcg
     5461 cttcgtcggg caatctccgc ctataacacc ggtgatttca cgcacggctt cgcgaacggc
     5521 tacgtgcgaa aagttgaaac ggccgctcaa cagctcgtcc ccccgttaac cgcgcgtcca
     5581 aaagatgatc gtgagaagcc gggatccgag gaaacatggg atgtttgggg ggcatataag
     5641 cgtcgttccc cggagggcgg agctggcggg tcatccggtc cgcctccgcc gccagacgag
     5701 gacaaccgca aatccgaaga cgacgatcaa ctcttgttcg acttaaatca aggaggtccg
     5761 caataatgcg atgctttgaa agataccgtg tacatctgaa tcgcctctcg ctctcgaacg
     5821 cggtgatgcg catggtatcc ggctatgcgc cgagcgtggt cggtgcaatg gggtggagca
     5881 ttttctcctc tgggccggcc gcggcccaat ctgcaggtgg cggcactgac ccagccacaa
     5941 tggttaacaa catatgcacg tttatccttg gtccgttcgg ccagtcactc gctgttcttg
     6001 gcatcgtggc catcggaatc tcctggatgt tcggtcgcgc ttcactcggt ctcgttgccg
     6061 gcgtcgtcgg cggcattgtc atcatgtttg gagccagctt cctgggcaaa acgctgactg
     6121 gaggtggcta atgaatgatc gtctggaaga agcaaccctt tacttggcgg cgacacggcc
     6181 cgcattgttt cttggcgtgc cgctgacgtt ggcggggcta ctcgtgatgt ttgccggctt
     6241 tgtcatcgtc atcgttcaga acccgctgta cgaagtcgtt ctcgtgccct tgtggttcgg
     6301 agcgcggctt gtggtggaac gagactataa cgcggccagc gtcgttctac tttttttgca
     6361 gacggcggga aggagcgttg atggtctgat ttggggcggc gcaagcgtta gcccaaatcc
     6421 aatcaaggtt cccgcgcgag ggagaggaat ggcgtaatgc tcggagcaag tggcacaacc
     6481 gaaaggtctg gcgaggtcta tctaccctac gtcgggcacg tcagcgacca tattgtcctt
     6541 ctagaagatg gatcgatcat gacgatggcg cacgtaagtg gcatggcctt cgaactcgaa
     6601 gatgccgaaa tgcgcaatcg acgttgccgt gcatttaata cgctcttgcg caatatcgct
     6661 gatgatcatg tgtcaatata tgctcacctc gtacgtcatg acgatgtgcc gccgtcaccc
     6721 gcgcgacatt tccgcagcgc tttttccgcc agtctgagcg aagcttttga ggagcgcgtt
     6781 ctctccggca aactccttcg caatgaccac ttccttacgc tgatcgtgtc tccccgggcc
     6841 gcgcttggca aagtgaggag aaggttcacc aaacgctaca gacaaaaaga aaacgatctc
     6901 acagctcaaa ccaggaacct ggaagatctc tggcatcttg tcgctggcgc tctcgaagcg
     6961 tacggcctgc gtcgtcttgg tattcgtgag aagcaagatg tgctttttac ggaggttgga
     7021 gaagctctgc ggctgataat gactggtcga ttcacgccgg ttcccgtcgt tagcggttcg
     7081 ctcggcgcct cgatctatac cgaccgagtt atttgcggca agcggggact cgagatccga
     7141 acaccaaaag atagttatgt gggatctatt tactcgtttc gcgaataccc cgcaacgacg
     7201 ggaccgggta tgctcaacgt gctactgtct ctcgattttc cgcttgttct gacgcagagc
     7261 ttctcgtttc tgactcgctc gcaagcccac tcgaagctca gcctcaagtc cagccaaatg
     7321 ttgagttctg gcgacaaagc cgtcacccaa atcagcaagt tatccgaggc ggaggacgca
     7381 ctagcgagca acgaattcgt attgggggcg catcatgtga gtctttgcat atatgcaaat
     7441 gatctcaata atcttgcaga tagaggtgcc cgcgcccgga cgcgattggc ggatgcggga
     7501 gctgttgttg tccaagaggg catcggcatg gaggcggctt attggtcgca gctgccaggc
     7561 aactataagt ggcgcacgcg tccgggagcg atcacatcgc gcaacttcgc tggtttagtc
     7621 tcattcgaga attttcccga gggatccggc tcaggtcact ggggcaacgc gattgcgcgc
     7681 tttcgtacca atggtggaac ccctttcgac tacatcccgc acgagcacga tgtcggcatg
     7741 acggcgatat tcggtcccat cgggaggggt aaaacgacgc tcatgacctt tatcctcgct
     7801 atgctcgagc agagcatggt cgaccgcgcg ggtgcggttg tcctcttcga caaggaccgc
     7861 ggcagtgagc tgctggttcg cgccaccggg ggaacatatt tggcgctccg tagaggagga
     7921 agcggattgg cgccattgcg tggcctggaa aatacagcgg cttcacatga ttttctgcgc
     7981 gaatggatcg tggcgctcat tgagagcgat ggccgtggag gaatatcccc ccaggaaaat
     8041 cgccgtctgg tgcggggtat ccatcggcag ctctcgtttg atccccacat gcgctcaatc
     8101 gcggggttac gcgaattttt gttgcatgga cccgccgaag gggcgggagc gagactccaa
     8161 cgctggtgcc gtggcaatgc actaggctgg gcgttcgacg gcgagctcga cgaagtaaag
     8221 ttggatcctt cgattactgg tttcgacatg acgcatcttc tcgaatatga ggaagtatgc
     8281 gctgccgcgc gagcatatct tctgcaccgc attggagcca tggttgacgg ccgtcggttt
     8341 gtgatgagtt gcgatgagtt tcgcgcctat ttgctaaatc ctaaatttgc ggcggtcgtc
     8401 gacaagttcc tgcttactgt ccgcaaaaac aatgggatgc tgatactggc aacgcagcaa
     8461 cctgagcatg tcctggaatc gcagctaggc gccagtctcg tcgcgcaatg tatgacgaag
     8521 attttctatc cttcacccac ggcagatcga tcggcttaca tcgatggact gaaatgtact
     8581 gaaaaggaat ttcaggcgat ccgtgaagac atggcagtag gtagccggaa gtttctgctt
     8641 aaacgagaaa gcggaagcgt cgtctgcgaa ttcgatctgc gggaaatgcg cgaatatgtc
     8701 gccgtacttt cggggcgcgc caacacggtg cgcttcgcgg atcagcttcg caaagtacag
     8761 ggggacaacc catcagcctg gctcagcgaa tttatggctc gttaccacga ggcaaaagat
     8821 tgatcaaagg tggggaacta tgaagatcat gcaacttgtt gctgcggcca tggccgtcag
     8881 ccttctttcg gtcgggcccg cgcgggcgca gttcgttgtc agcgatccgg cgacggaagc
     8941 tgagacgctg gcgacggcgc tcgagactgc ggcaaatctc gaacagacca taacgatggt
     9001 ggcgatgtta acctcggctt atggcgtcac cggcctacta acttcgctca accaaaaaaa
     9061 tcagtatccc tcgaccaggg acttggacac ggaaatgttt tcgccgcgaa tgccaatgtc
     9121 gaccacggca cgtgcgatca ccaccgatac agatcgcgcc gtagttggtg gcgacgctga
     9181 agcggatctg ttgcgatcgc agatcaccgg ttccgcaaat agcgcgggca ttgcggctga
     9241 caacctggag acgatggaca aacgcttaac agcgaatgcc gagacctcga cacagctttc
     9301 tcgctctcgc aatatcatgc aggcaaccgt taccaacggt ttgcttctca agcagatcca
     9361 tgacgcaatg attcaaaatg tacaggcgac cagcttgtta acgatgacca ccgcgcaggc
     9421 tggccttcat gaggcggaag aggcggccgc tcaacgtaag gagcatcaaa agactgcggt
     9481 catctttggg gccgtcccct gagactgggc gacttattcg tgccgcccat ctgcacattt
     9541 caggatgcga gttcattgta tccgacattc tgcgacaagc cagtcaaggt caggtccaaa
     9601 cgatgaattt cacgatcccg gcgccgttta cggccattca tacgatcttc gatctagcct
     9661 ttacgacaag cctggacaca atgcttggga cgatccaaga ggcggtgagc gcgccattgg
     9721 tcgcctgcgt cactctttgg attatcgttc agggtattct ggtcatgcgt ggcgaaatcg
     9781 acacgcgcgg cggtatcact cgggtgatca cggtcaccgt cgttgttgcc cttgtcgtcg
     9841 ggcaggccaa ctaccacgac tatgtggttt cagtctttga agagacgatt ccaaacttta
     9901 ttcagcagtt tagtggcagc ggcctgcctc tgcagaccat tcccgctcag ctcgatacaa
     9961 tgttcgccct aacccaggct gcatttcaga gaattgcatc tgaaatcggc ccgatgaatg
    10021 accaggacat ccttgctttc cagggggctc agtgggtctt ttacggcacg ctctggtctg
    10081 ccttcggaat ctacgacgcc gtcggaattc tcacgaaagt tcttctggcg atcggccctt
    10141 tgatcctcac gggatatatc tttgatcgca cgcgcgacat cgcggcaaag tggatcgggc
    10201 aacttatcac ctatggtctc ctacttctcc tgctaaacct cgtggcgaca atagtcatcc
    10261 taaccgaagc gactgcactc acgcttatgc ttggtgtaat cacccttgcc ggtacgaccg
    10321 cggccaagat cattggtctt tacgaactcg acatgttctt tctgactggt gacgcactca
    10381 ttgtcgcttt gccggcaatc gccaacatcg gaggcagtta ttggagcggc gcgacccaat
    10441 ctgccaacag cttgtaccgt cgcttcgctc aggtcgatcg tcgttaggtc ccgcaaaaaa
    10501 attcacgtcg gtggagaact ctatgaaata ttgcctgctg tgcctggctc tcgctttaag
    10561 cggctgccag acaaacgata aattggcgag ctgcaaaggc ccaatattcc cgctgaatgt
    10621 ggggcgatgg cagcctacgc cgtcagatct tcagctcagc aacgtaggtg gtcgccatga
    10681 aggggtctga atacgccttg ctagtagcgc gggaaaccct ggctgagcac tacaaggaag
    10741 tggaagcttt ccaaactgcg cgtgcgaaat cagcgcggcg cctctccaag gtcattgcag
    10801 ctgtcgcaac catcgcggtt ttggggaatg ttgcgcaagc cttcacaatt gccaccatgg
    10861 tgccgctgat caggcttgtg ccggtatatc tttggatacg gccggatggc accgttgaca
    10921 gcgaggtgtc cgtctcccga ttgcctgcaa ctcaagagga ggccgtcgtt aacgcctcac
    10981 tgtgggagta tgttcggctg cgcgagagct atgatgccga taccgcccag tatgcctatg
    11041 acctggtctc gaacttcacg cgcccaatgg tgcgccaaaa ttatcagcaa ttcttcaatt
    11101 atcccaatcc aacttcgcct caagtcatcc tcggcaaaca cggcaggcta gaggtcgaac
    11161 acatcgcttc gaatgatgtt actccgggtg tgcagcaaat tcgctataag cgaaccctca
    11221 tcgttgacgg caaaatgccg atggcgagca cttggactgc tacggttcgt tatgaaaagg
    11281 tgaccagctt gcccggcaga ttgagactga ccaacccggg aggcttggtt gtcacctcct
    11341 accagacatc ggaagatacc gtttcgaacg caggccacag cgaaccatga cgaaaaaagc
    11401 atttctcact ctggcatgtt tactttttgc ggcgattggc gcgagggctg aagacacgcc
    11461 aacggcgggc agacttgatc cgcgcatgcg ttatctcgct tacaatcccg atcaagtggt
    11521 gcgcctttca acggcggttg gagccacttt ggttgttact ttcggggcta acgaaacggt
    11581 gacagctgtt gccgtttcca atagcaaaga tctcgcggcc cttccacgcg gaaattatct
    11641 tttcttcaag gctagcaagg ttctcccacc ccagccagtg gtcgtgctaa ctgcgagtga
    11701 cgccggtatg cgacgctacg ttttcagcat ctcttccaag acgctgccgc acctcgataa
    11761 agagcaggcc gatctctact atagcgtaca attcgcttac cctgccgatg acgcagcggc
    11821 tccgcagaag gcggcacaag agaaggctgt tgcagaccgt atacgtgcgg aagcgcaata
    11881 tcaacagaga gcagagggtt tattggagca gcctgccacg accgttggtg ccgaggacaa
    11941 gaattggcac tatgtcgctc agggcgatcg ttcgctgttg ccgctcgaag tcttcgatga
    12001 tggatttacg acggtatttc acttcccagg taatgtacgc ataccctcca tctacacgat
    12061 aaatccggat ggaaaggaag ctgtcgctaa ctattcagtc aaagggagct atgtcgagat
    12121 ttcttcggtt tcccgtggtt ggcgtctgag ggatggccac acggtattat gcatttggaa
    12181 taccgcctac gatcccgtcg gccgcaggcc ggagacgggc actgtgaggc ccgatgtgaa
    12241 gcgcgtccta aaggaggtga gaggatgaac gacgacaatc agcaatcggc gcatgatgtc
    12301 gatgcgtcgg ggtccctggt ctccgacaca catcaccggc gcctttcggg ggctcaaaag
    12361 ttgatcgtag gaggtgtagt tctcgcgcta tcacttagcc tcatttggct tggcgggcgt
    12421 gaaaagaagg aaaacgggga cgcacccccg tcaaccatga tcgccacgaa caccaagcca
    12481 tttcatccgg ctccgattga cgttacactt gatcctccgg ctgcccagga agctgttcag
    12541 ccgactgctc ctccgccagc acgaagtgag ccggaacggc atgagccgcg gccggaagaa
    12601 acaccgattt ttgcgtacac cagtggtgat caagggacca gcaagcgcgt tcaacaaggc
    12661 gaaacggacc gaagacgcga aggcaatggg gaagacagtc ctttgccgaa ggtcgaagtg
    12721 tccgccgaga atgatctctc gatacgcatg aagcccaccg agctgcagcc caccagggct
    12781 acgctcttgc ctcatcccga cttcatggtg acggagggga cgattattcc atgtatcttg
    12841 caaacggcaa tcgacaccag tctggcaggc tatgtaaaat gcgtgttacc ctgggatgtt
    12901 cgtggaacaa cgaacaacgt tgtgcttctt gatcgcggca ccaccgttgt tggcgagatc
    12961 cagcgcggtt tgcaacaggg agatgcgcgt gtttttgtgc tctgggatcg ggcggagaca
    13021 cccgaccatg ccatgatttc gcttgcgtca ccaagcgctg acgaactcgg tcgctcggga
    13081 ttgccgggca ccgtcgacaa tcacttctgg cagcgcttta gcggggccat gctcttgagt
    13141 gtcgtccaag gtccgttcca ggcagcgagc acctacgctg gcagctcggg tggagggacg
    13201 agcttcaaca gcgtccagaa taacggtgaa caaacggcag acacagccct caaggccacg
    13261 atcaacatac cgccaaccct gaagaagaat cagggcgaca cggtctccat ttttgtcgca
    13321 cgggatctcg atttctcagg catataccag cttcgtatgg ctggtcgcgc ggcgcggggg
    13381 cgggatcgcc gtccataacg aattcaactt atcacttaca gatggagata cgatacaaat
    13441 ggaggtggat ccgcaattac gaatccttct caagccgatt ttggaatggc tcgatgaccc
    13501 gcggaccgaa gaagttgcga taaatcgacc tggggaggca tttgtgcgcc aggccggcgc
    13561 cttcctcaag ttccctttgc ctgtctccta tgacgatctc gaagatatcg ctattttagc
    13621 aggcgcgctg agaaaacagg acgttggacc acgcaaccca ctttgcgcaa ctgaacttcc
    13681 agacggcgag cggctgcaga tctgtttgcc gccgacggta ccatcgggca ccgtcagctt
    13741 gacgattcga cggccaagtt cccgtgtttc tagtctcaaa gaagtctcgt cccgttacga
    13801 tgctccgagg tggaatcagt ggaaggaacg aaaaaaacgg catgctcagc atgatgaagc
    13861 tatccttcgg tactatgaca acggggatct ggaggcgttt ctgcacgcat gtgtcgttgg
    13921 tcggttgacg atgctgcttt gcggacccac cgggagtggc aagacaacga tgagcaagac
    13981 cttgatcaac gctatcccgc cgcaggaaag gctgattacc atcgaagata cgctcgaact
    14041 cgtcattcca cacgagaacc acgtaaggct gctttattct aagaatgggg ctgggctggg
    14101 cgcagttacc gctgagcacc tgctacaggc tagcctgcgc atgcgaccgg accgaatact
    14161 gctcggcgag atacgcgacg atgccgcgtg ggcttatctg agtgaagtcg tctcagggca
    14221 tccgggatcg atttccacaa tacatggtgc caatcccgtc caaggtttca aaaagctatt
    14281 ttcgctcgtg aaaagcagcg ctcagggggc tagcttggaa gatcgcaccc tgattgacat
    14341 gctcgcaacc gcagttgatg tcatcgtacc cttccgtgcc cacggtgaca tttacgaggt
    14401 gggcgaaatc tggctcgctg ccgatgcgcg tcggcgcggt gagacaatag gcgatcttct
    14461 taaccagcag tagttgtgat ccatgtttct aaatgccgca tggcgcgttg tagaattacg
    14521 tttgtagcaa tgctcagcaa tctttgtcat caaacggaga catctagttt gcatttctgt
    14581 cgtgcgcggt ttggtcgaaa tcttgccgaa atgcccgtgt agtgagagaa aattaaagag
    14641 tggagtctag caaatacaac ctttacgtgt ataaattctg ttgagctgca aatggctggc
    14701 caggatccta gattgagagg tgaaccgttg aaacacgttc ttgtcatcga tgacgatgtc
    14761 gctatgcggc atcttatagt cgagtatctt acgatccatg cctttaaggt gactgcggta
    14821 gccgacagca agcagttcaa tcgtgtactc tgctccgaga cggtcgatgt cgtggtcgtc
    14881 gatcttaatt tgggtcgcga agatgggctt gaaattgttc gtagtctggc cacgaagtcc
    14941 gatgttccaa tcataattat tagcggcgct cgcctcgaag aggcggacaa agttattgcg
    15001 ctcgagttgg gagcaaccga ttttattgcc aagccttttg ggacgcggga atttctggcg
    15061 cgcatccgtg ttgcgttacg cgtgcggccc agtgtcgcgc gaaccaaaga tcgacgctca
    15121 tttagtttcg ctgactggac acttaatctc aggcgacgcc gcttgatttc ggaagagggc
    15181 agtgaggtga aactcacggc aggtgagttt aatctcctgg ttgctttcct ggagaagccg
    15241 cgcgacgtcc tatcccggga gcagcttctg atcgccagtc gggtacgcga ggaggaggtg
    15301 tatgacagaa gtattgatgt cctcattttg cggctgcgcc ggaagcttga gggggatccg
    15361 acgacccctc agttgatcaa gactgcaaga ggtgctggct atttctttga cgctgacgtg
    15421 gatgtttcgt acgggggtgt gatggcggcc tgaggtagag gtgcatttcg cctttagcaa
    15481 tctgttccca acgtgagcag attgctatgc ggcttggcaa agctgccttt cctcggtact
    15541 atccgaaaaa ctcagcactg cggagtgatt ggatgggtcc tatcttttga gagatcagct
    15601 gttcgttgcc ttctcccgag caaagaaaca tgcaagcgct gcggtagcca gcttgtggcc
    15661 gaaagcccgg gcggtctcca atcccaatgg atcaaagtga ttgcgagcgg cttctattag
    15721 cgagaccggg aacatgcggg aggtctgaac gatgattgat ttttcgaaag ctgtgtgagg
    15781 gatcggataa ctcttcggag ccgcacgaaa cgatccatcc gccagcatgt tttcaaaatc
    15841 gccaagcgca cggcgcaaga tcatttgtag cgacttggaa ggactgtatt gcaggatcag
    15901 gttgtcatat atcttcgata cttcaggcgc gggcgggcgc gctgaaagga agacctggat
    15961 cttttctggc gctgtcgtcg aactcaaagc atccacggtc agcatcgatt gctgatcaca
    16021 gctgtgacaa cgcttggcgg tggctggggc aggtcgtcga tcttcctcgt cgagattttc
    16081 aggcggctgc ggcagggtcg agttttgggt ggcaacaggc aaagaaggat ggacgatttc
    16141 gggtcgagcg gcggcaagcc gcctggcctc cccgacagac aaagcgggtt tgcgaattcc
    16201 catcttcacc cctccaaggc ttcgctaacc aatttggaga tagtgacgag ttcctccatg
    16261 gcgattctga gattccgttc gaggaggcgc attgtcggat cggttctcat attcagcaat
    16321 gtgagatgca acatgccacg ttccttcatc gcggcaaatg cgtctctctc gtgcatggga
    16381 gactgtacaa ctggaaggct tgcgagcatg tccgacatcg cgcgctgcga tgtggtcaat
    16441 cgaccaaccg gcacgcgttg gcgcaatacg gctgtcggaa ttgccaagtt ctcgctcagc
    16501 agcagttcaa tgacatagcg gtaggtcgac aatgcttcat cgatatcgag cggagttaac
    16561 atggtcggga tcagaagcag gtttgagctg gcaatgatcg tgttgttgag ttcgctcgaa
    16621 ccaccatgcg tatcggccag cgcataatca aatccctgga gttcggcgtc ctcataggcc
    16681 gcctcaagga gtgccatttc ctcggcggcg tagacttcgc agaaggagcc ccaggtattg
    16741 ctgcgaaggg cgttttcttt ccatcgcgtt agtggtcggt tttcatcagc gtcgaagaga
    16801 gccaatcgtt tgccgtcact tgcaaaggcg gcgcaaagac ccatgagtgc cgtggttttg
    16861 ccggctcctc ccttgaagga gcaaaatgtc agaagtttca tgtccttatc ctgtcgattt
    16921 tgtgaagcgg aagtgtgtct gtacttttat ttgtgtgtat gattttgcga taattcataa
    16981 gtaatgtagt aattacctga ttttatattt caattttatt gtaatataat ttcaattgta
    17041 ataatataaa aataaatatc ccttatgtgt tcttgatttc gttttgtata tggctagatt
    17101 cccatctgcc acgacgagga aatgctacgg cggggcaagt tccaagatct ttccgtcttc
    17161 tatggaggaa gctatgtcgc aaggcagtag gcccacctca agtgacattg ccgtcaacca
    17221 gcgcgaatgc gtgaaggttg aaggcttcaa ggtcgtcagt acccgattaa gatcggccga
    17281 atatgagagt ttttctcatc aggcacgctt gctgggcctc tccgacagca tggccatacg
    17341 ggttgcggtg cgccgcattg gtggctttct tgaaatcgac gcagagactc gtcataggat
    17401 ggaggccata ctacaatcca taggaacact ctcaagcaac attgccgcgc tgctatctgc
    17461 ctatgccgaa aatccgacaa tggatttgga ggctttgcga gctgaacgta tcgccttcgg
    17521 taaatctttc gctgacctcg acggcttgct ccgttccatt ttgtccgtat cacggcggcg
    17581 gatcgacggt tgctcgctgc tgaaagacgc cttgtagcac tgacgtagca cttggcgggg
    17641 aacatattcg atgcccgatc gagctcaagt tatcattcgc attatgccgg gaggtggcac
    17701 caagaccctt caacaaatta tcaatcagtt ggagtatcta tcccggaagg gcaggctgga
    17761 gctgcagcgt tcagcccgac atctcgatat tcccctgcca ccggatcaaa tccacgaact
    17821 tgcccgaagc tgggttcaag agactggaac ttatgacgaa agtcagccag acgaggaaag
    17881 gcaacaggag ttgaccaccc atattattgt aagcttcccc gccggtacaa gccaggtagc
    17941 ggcttatgcg gcgagccggg agtgggcagc cgagatgttt gggtcaggcg cagggggggg
    18001 ccgatacaac tatcttacgg ccttccacat cgatcgcgac cacccacatc tgcatgtcgt
    18061 cgtcaatcgg cgcgaacttt taggacacgg ctggctgaag atatctcggc gccatcccca
    18121 actgaattac gacgccctgc gcataaagat ggccgagatt tcacttcgtc atggcattgc
    18181 cctcgatgcg agccgacgag cagaacgtgg catcaccgag cggccgatca cttatgccca
    18241 atatcggcgc cttgagcggg agcaggctcg ccaaatccgt ttcgaagacg cggatttgga
    18301 acagtcgtcg ccgcaaggag atcatccaga attcagccaa cctttcgata catccccatt
    18361 tgaagcatcc gcgggcggac cggaggacat gcctcggccc aacaatcggc agaatgagtc
    18421 gcaagttcat ctccaggagc cagctggtgt cagcaacgaa gccggtgtcc ttgtgcgggt
    18481 tgcattggag acggagcgcc ttgctcaacc attcgtttcc gaaaccattc tcgcggacga
    18541 catagggagc ggctcttcgc gtgttgccga gggccgtgtg gagagcgcaa accgcactcc
    18601 cgatattcct cgcgcagcaa ctgaagctgc cacgcacacg acacacgacc ggcagcggcg
    18661 tgcaaagcgt cctcatgatg acgacggagg gccgagtgga gcaaaacgtg tgacattgga
    18721 aggcatcgcg gttgggcccc aggcgaacgc cggcgaacag gatggcagta gtggcccctt
    18781 agtacggcaa gctggaacgt ctcggccatc tccaccgacg gccacgacgc gggccagcac
    18841 cgcaaccgat tcattgtctg ctacagccca cctccagcaa cggagaggtg tcctttcaaa
    18901 gcgtccgcgt gaagatgatg atggagaacc gagtgaacgc aaacgcgaga gagatgagcg
    18961 cagcaaggac gggcgtgggg gaaataggag atagagaagg accgagtgat ggcaaatgaa
    19021 gagttcacca gacactatgc gtggcccgtt cctgtggctt cgaatgatga agggcgtggg
    19081 accgcgcgga tccccatcca ggcacaatca atcgttgctg gagaagacgg tcgggacact
    19141 tcggtcccaa cggctttgtc gcgaccgcca attgaagata tgccgcacgg cgtccaagaa
    19201 acatcggcga gtggcggacg actgggagcg gcccgtctgc gggattccgt aatcccgcca
    19261 ggaatatccg aagcccgcac ggacctatcc gcaattttgc ggaaaaaaag cggttctttc
    19321 cgcaccggta tgcagtatct gcgtgggctt gaacgggaaa attttgataa acaagacagg
    19381 gaagccagtg cgttgccaga tttaagtgca aggggcataa agcgaccgcg cgaaattgag
    19441 tatcccggca atgcaagcgg attaaccata aagaggcaag acggcttagg catagagatc
    19501 aatactatct cggcatcttc gcccgtgaac cgggccgcgc attcgtcgaa ctggcaaggc
    19561 gcgccggaac cgggcgtgta caatgttcag ccatcggcag atagagcaca gaactctgcg
    19621 caggaaagtt cgacgtttcc cgatggcact tctgtttccg ctctctattc aggcccactc
    19681 gcagaatggt tcgagaggga tactggcagc gagacgacca ggaattcagg caacaccatt
    19741 tcgtcaccgc tccgagggct ggaggagttc ggcgattccg cggacagccg gtatcttggg
    19801 cgcgaagctc agagtctttc agttaccgta acaacgccta attcgaatgc tgaggcaagt
    19861 tctcatagcg cacacactga aactctcgac gatgtcagca gtgaccgctc gagcgaacag
    19921 gggaggggtc cccttggcgc cgcaatcctt ggatcccatc atgacctttc accgcgtgcg
    19981 cagaagttat cacaaacaaa ccgcgattcc cctgaactga ccgacgccga cctggcaaaa
    20041 gtcgatgcgg tgttcgaatc tctctccaag gggcccccgg gggcggagag cgccgcaccg
    20101 gactttcgtg agcgcggacc aggcagcgct ttccagaaag aaggcgtgtc cgacagggcg
    20161 aacggggtgc ccactaactg ggaggtacct tttggtcgcg gtggcgggca ttcaccgcaa
    20221 gcgctgaggt cctcaggagt ggagctcgat gacttccctg atttcaccga ggccgaactg
    20281 gcaaaaatcg acgcactagt cgaatctcac tccaacagat ctttatcagt gcgaaatata
    20341 gtaccagatt tgcgcggagc gggagcagac aacgtcttcc ggaaagaagg cgttgtcgaa
    20401 cgcgcggaga agatgccgat cgatagtgtg agtcttacca gactgaatgg cgagcgttca
    20461 cgttcaccga agacatcgca agcaagcctt gaggatttcc ccgatctaac cgatgccgat
    20521 ctggcccata tcgaagagtc ggagcggatc gcaagaactg ctgttgaaaa ggggaagcaa
    20581 aaaatctcga ccgaagccga tacgcggttt gatttgggca actcttctgc accccgggtt
    20641 agccctcgat cggttacccc gttggttccc aatgccaatc agccgattac atcgtggttc
    20701 tacgaggcac aaaagacttg tgacaaattg gtagagaaca cttacgtgaa gccggccgtt
    20761 gacagttcta gggcgcgaaa tgacgtcgaa aacaccgctg ccaggcttgg tgaccctgct
    20821 cctgctttgg ggcatgacaa tcttggtaga actcgcgcgt taacgcccgt tcgcgacgtg
    20881 atgtctagac ccagcgcgga ccgacaactg gctagtcatg cggcagaaca ttcggcaata
    20941 gacgacattt ggaaacggga tgaccgtgat cgcaggacac acccttatag aggattggat
    21001 tcgcgttcac gtgaaggttg cgggcggtaa aggctgtgtt gttcgcagca acacccaaat
    21061 caaaagccca atttgcgatc gatcgaagaa tctcggtgaa gtgatgaatt cgagcaagac
    21121 tacgccccag cgtttagctg tcagcatcgt atgttcgctg gcagccggtt tttgcgcggc
    21181 aagtctctat gtaacatttc gccatggttt caatggcgaa gcgatgatga cgtttagcgt
    21241 cttcgccttt tggtacgaga ccccgctcta tatgggtcat gcgacccccg tcttctattg
    21301 cggtttagcc attgtcgtct cgacgtctat tgttgtgctg ttaagccaac ttatcatatc
    21361 gtttcgcaat cacgagcatc atggcacggc tcgttgggcg ggatttggcg aaatgcggca
    21421 cgccggttac ctgcagcgct ataatcgtat caaggggcca atctttggca agacgtgtgg
    21481 ccctcgttgg ttcggcagct atctgaccaa tggcgaacag cctcacagtc ttgttgtcgc
    21541 gccaacgcgc gccggcaaag gcgttggcgt cgttattccg acgctcctaa ccttcaaggg
    21601 ttcggtgata gccctcgacg tcaaaggcga gctttttgag ctgacttcca gagcacgcaa
    21661 agcgggccgc gacgccgttt tcaagttctc gcctctggat ccagagcgac ggactcattg
    21721 ttacaatccc gtcctcgata ttgcagcttt accgcccgag cggcagttta ccgagacgcg
    21781 ccgcctcgcc gcaaacctca tcacggccaa aggcaaggga gcggaaggct ttatcgacgg
    21841 cgcgcgggac cttttcgttg cgggcatcct tacctgtatt gaccgcggta cgccaacaat
    21901 tggtgcggtc tacgacttat ttgctcaacc tggagagaag tacaaacttt ttgcgcacct
    21961 cgcggaagaa agccgaaata aagaggctca gcgcattttc gacaatatgg cgggtaacga
    22021 cacgaaaatt ttgacttctt atacatcagt gctcggcgac ggcggactta acctgtgggc
    22081 tgatccattg gttaaagcag cgacaagtcg atcagatttt tccgtctatg atctgcgtcg
    22141 gaagaggacc tgcgtctatc tttgcgtcag tcccaacgac ctcgaggtcg tggcgccatt
    22201 gatgcgcctc ctttttcagc aggtcgtgtc aattctgcag cgatcactgc caggtaaaga
    22261 tgagcggcat gaagttttgt ttctccttga cgaatttaaa cacctgggga agcttgaggc
    22321 aatcgagacc gcgattacaa ccatcgccgg ttacaaaggc cgcttcatgt ttattattca
    22381 gagtctctcg gccttgacgg gcatctacga tgacgcgggc aaacaaaact ttctcagtaa
    22441 cactggcgtg caagtattta tggccacggc cgacgacgaa acaccgacct atatctcaaa
    22501 agctatcggc gactatacgt tcaaggcgcg ttcgacctca tacagtcaag cccgaatgtt
    22561 cgatcataac atccagattt ccgatcaggg ggcaccgctt ttgcgccccg aacaagtgcg
    22621 cttgcttgac gataacaatg aaatcgtcct tattaaaggg catccgcctc tcaaactacg
    22681 aaaggtgcga tattattccg atcgtatgct gaggcgcctt ttcgaatgcc aaattggcgc
    22741 cctccctgag cccgcatctt tgatgctttc ggaaggtgtc catcgggatg ggcaagacct
    22801 cagtcaacaa gccgccgtca cggaggcagc tagtactcga tcgataccta ataatatgga
    22861 agccgctaca ccgcaaaaca gtgaaatgga tgacgagcag gacagcctcc caactggaat
    22921 tgacgtcccc cagggcctta ttgaaagcga tgaggtgaaa gaagacgccg gtggcgtggt
    22981 gccagatttc ggtgtttcag ctgaaatggc tccagctatg attgcacaac agcagctgct
    23041 ggagcagatc attgcgcttc agcaacgata tggacccgcg tcctcgcact cggtgaaatg
    23101 acttcaagct gcctttcaca ttggaatcat tcgatgcggg gcgtcgtcag atatccggtt
    23161 gtattcgaat gcggactaga accccgaggt tgccgattaa cagatatcgc catgagacct
    23221 tcaggaaacc cgaacgtcga tcttagcggt tcgactgcat cgcttgccga agttcccgcc
    23281 ggagctaccc ctgtccttaa tctaatcgag cccaggaacc gtccggctga cgactcgctt
    23341 gagggccaaa ccgatcgcgg cgagcatcca tctgcatcat ttgactatga tggcatgaag
    23401 cttggcgccg cggagcgtga agcatacgag aactggtgtc catcgaaccg gcctacatgg
    23461 aaagatctgg tactcagggc gcgccttgat gcaatcgaca gttccgcttg gctccccgat
    23521 ttgggcgagg agtcgccttt gatcttcaga tatgaaggga ttccgctggg tgagggggaa
    23581 cggcaagcct acaaagaatg gcaagaggag gctcagccca catgggaaga cctcgttgtc
    23641 aacgcacgaa tggcggtacc tgatccttgt gctgacgttg cagacgagca caatcccctc
    23701 aaagaaggcg aggagtttcg gtctgaagcg tcgaaacgca agcggaaaaa accgatcgac
    23761 caggacgaga attctcctac atcgttttac tatgacggga tgaggctcgg agaacccgag
    23821 cgcgaggcat atgataactg gggcaacgcg gagccgccca cgtggaaaga cctggtactt
    23881 aaggcgcgcc ttgatgcaat tgacagctcc gcctggctct ttgcttcaga agggtcttcc
    23941 tcgacttttg agtatgaggg aattccactg ggtgaggggg aacggcaagc ctacaaagaa
    24001 tggcaagagg acgctcagcc cacgtgggag gacctcgtca ttaatgcacg catggcagaa
    24061 ctcgaccatc cttcttggat tacagacgag cacaattccc ttgaagaaaa cttagagttt
    24121 cggcccgatg caagacaggc cagcctgaag gactcgaccg accagcggaa gagttcttcc
    24181 gcgtcattta tctatgatgg aatgaagctc ggggaacccg agactgctgc atacgagaac
    24241 tggagcaaac cggaacgacc gtcatgggaa gccctcatcc tagatgggcg ccaggcttcc
    24301 atagcaagct cttcggtttc gaattcgtta cttgcaaaga catcctcgcc agtctttcta
    24361 tacgagggaa tgtcgctagg ggatgcggaa cgtcaatcgt atccaaggtg gaggcagcct
    24421 gcccaaccgc gatggcaaaa tcttgtggtg aacgcgcgcc ttgcggatct tgatccgttg
    24481 gcctggattc ccgatgaaca tgatccgttt gcggaagccg aggcgcttag ccccacttcg
    24541 caatcgagtg gcgccaacaa gtccaaccgc gctttgggtc aatcagattc aggccggccc
    24601 gccttcgcat atctggcagc acaagaggcg agtcacctgc aatcaccggc atgctcacaa
    24661 ttggaaacga ggcgtgcatt aaatttcggg tcgcctggac cggatgcaaa tccaacggaa
    24721 agcatcgcca aatgtaatcg tttggatggc gtcagtaaaa ttaagcgatt aggcaccaaa
    24781 ggccgccggg cagtaaacgc gactattcac ggcggcaagt tcggtgcgca aggactgttg
    24841 tccgaagatt gcggacaagc ggctgagccc tctccatcgg aacaaactac tcggccgcga
    24901 accgataata ttggtaccta tgcaagccgg aaaaacgaac gagctcggct ggctacagag
    24961 accgggaaat acgaatcgga gcacattttc ggatttaagg tcgtccacga tactttgcgg
    25021 gcgaccaaag aaggccgtcg tctcgaaagg ccaatgccgg catacctcga atacaaggag
    25081 cttcatcggc aacacgtagg gacaggaaga ggacggactg gactggtcgg gcgcggatgg
    25141 ccagatgatg cgagctatcg cgcagatcaa agggcaaccc tgtcggaccc tgttgcgtcc
    25201 gcggaaggtg caacagcttc aaatgggtac cagttgaacc agctgggcta cgcgcaccag
    25261 ctcgccaccc atggtctcca aagtgaaacg cccgatgggg ttaccatgcc aattcaagtt
    25321 gcgacaatta gctacaacta tacagtgagt cgtgatccgg tcctttcacc acccagcaaa
    25381 gaacaagcgc ccccattgct acacctgggc cttcgtgggc aaacggaagc tgtgcttgcc
    25441 cgcgagaccg cattgaccgg aaaatggcca actctcgagc gtgagtgtca agtgtatcag
    25501 cggtttttgg ccctgtacga cgtaaagaag gatctggacg ccaaacaact tggtgtacgg
    25561 cagaaaaagc aggcactcgt tgcggcgttg aaccggaccg ccggcttaat tggcgcgtca
    25621 cctttggaag cccaatcgtc gaccgctgag gttgaatata ctaccgacga gcccgatgaa
    25681 cgacgggttt atgatccgcg cgatcgaggc agagacaaag cgttcaatcg ctgacaaagt
    25741 cgcctagaca atgattgtta cggacttcgc aatcgatttc ttgacagcag tgccattcaa
    25801 cttttgatcc accatcggaa aataatcggc ggagaaacgc acgacagctt ctattgccaa
    25861 gccccgtcga tcgatctcat ggcatattcc cgttatggtc gatttttcga ggggccgcgc
    25921 aacctctctc agcaaagatg aagttctcgc gggtccgcca tgggtctcca ggaaaatcct
    25981 gggcgtcacg gtcggtaaag aattgaagac tttaagcaaa cgcgaacttg acgttcaaag
    26041 attacgcgcc gcggaacgcg ccctcgataa acaagtctgg cagaatcctc ccgttaaccc
    26101 ctcccgtcga aacgaggaga tgccaatagt catcttcgaa acgaacgcga tgagcggcta
    26161 gagtcggaga tgacgatcgc gcatcaagcc gcttccgtct ccgttggcga ggatcgccaa
    26221 gctcttgatg ccgctcgatc gaccttgaat gaactgcaca actcgccatc gtcggatgat
    26281 agggcaaact tgagtttgcc gagagccgaa atttgcgacc gtacaacgta tccgccgtct
    26341 ccatcccatt gaatacggat atcctgagca ttaggtccgt gccaatacgt ctccggcaag
    26401 agctgcagat ttggcatggc ggatgatgag ggatttgctg ggcgctttga gattgtcgag
    26461 gcgcgtcgag gaatactatc cccctcgaca agcgaagagt ctgatccgtg gggtcgaggc
    26521 ctgcagaaca ccgtaaacgc taacggcggg atcccgaatg accagtcatt gtggcattcc
    26581 gaaagaatta cattcacacg gcaccggtta ctcgatcact atctcctgcg gccttattcg
    26641 ataaagtcgt gctaactccg tgacttggca agaattacat atgaaacgcg atattcgttt
    26701 aatgacggag gctcggatgg tagattcacc gaaggctcaa ttctattagg agtcttgatt
    26761 gtgtccgaag agtgactgcc gctgcagcag cctagaaagc aatgcacggt gatgatgttg
    26821 atcggacggc tttcgtcccg cgcaggctgc cggctgaagc cgcttacgtc gatagtcaag
    26881 ttgacctggc ggcaaaccga caacctcact gtgtgacttg ttttgagcga cttttggttc
    26941 aagagaaaga tgaaaaccgt cgacgattgt gggcttccgg caaagcatgc gccacgtctg
    27001 aagagagttg tgcaggaggc cctacactcg caccgggcca ttcggggcca tgaagatgga
    27061 gcaaacctta atttgcaccg ataattttcc gttcgatgga tttgggctgg gcaggatcgg
    27121 agcggaaggc actaggaaaa tcgtaatttc tgtaggagaa acaatgatat ggtgatcatc
    27181 aagctaaatg cgaataaaaa catgcctgtc ttggcggttg agaagccgca agaaattcac
    27241 aaagaggagt tgagcgacca tcaccagtca aatggcttta cgagtttgga tctcgaaatg
    27301 attgaactgg agaattttgt ccttcactgc ccgctccccg aagaaaacct agccggctaa
    27361 aaggagttag acgatggatc cgaaggccga aggcaatggt gaaaatatca ctgagaccgc
    27421 agcaggcaat gtcgaaactt ctgatttcgt gaatttgaag cgccagaaga gggagggcgt
    27481 aaattccacc gggatgtccg aaattgatat gacgggtagc caagaaactc ccgaacacaa
    27541 catgcacgga agcccgactc acacggatga tctcggcccg cggttggatg cggacatgct
    27601 cgattctcag tcaagtcatg tttctagcag cgctcaaggc aatcggtctg aggttgaaaa
    27661 tgagctatcc aacttattcg cgaagatggc tttaccaggc catgatcggc gtaccgacga
    27721 gtatattctt gtgcggcaaa ccggacaaga caagttcgca ggtactacta aatgtaacct
    27781 cgatcatctg cccaccaagg cggaattcaa tgcgagctgc cggctctata gggatggagt
    27841 cggcaactac tatcccccgc ccctcgcatt cgagaggatc gatatcccgg agcaattggc
    27901 tgcacaattg cataacctgg agccaagaga acagagtaaa cagtgttttc agtacaagtt
    27961 ggaagtctgg aatcgcgctc acgcagagat gggcatcact ggcaccgaca tcttctatca
    28021 aacagacaag aatattaagc tcgaccgaaa ttataaattg aggcctgagg atagatatat
    28081 acaaacagag aaatacgggc gcagagaaat tcaaaaacgc tatgagcacc agtttcaagc
    28141 tggttcactg ctgccggata tcttaatcaa gaccccgcaa aatgatatac atttctcgta
    28201 caggtttgcg ggcgaccgtt acgctaacaa gcgatttgag gaattcgaac gcgcaatcaa
    28261 aactaaatac ggtagcgata ccgagatcaa gctcaaatcc aaatctggga ttatgcatga
    28321 ctccaaatat ttggaatcat gggagcgggg cagtgcggat atccgtttcg cagagttcgc
    28381 cggcgagaat cgagctcaca acaagcagtt tccggctgcg actgtgaata tgggaaggca
    28441 gccagatggc cagggaggga tgactcgcga tcgccatgta agcgttgact acctattgca
    28501 aaacctaccc aactcccctt ggacgcaagc cttgaaagag ggaaagttgt gggatcgagt
    28561 tcaggtcctt gctcgcgacg gaaaccgtta catgtcacct tcaagactgg aatattccga
    28621 ccccgaacac tttacccaac tgatggatca agttggtctg cccgtgtcga tgggtcggca
    28681 aagtcatgcg aatagtgtca agtttgagca gtttgacaga caggcagcgg ttattgttgc
    28741 ggatggcccg aacttacgtg aggttccaga tttgtccccg gaaaagttgc aacaactgtc
    28801 tcaaaaagat gtcctgatag cggatcgcaa tgaaaagggg caaagaaccg gcacttacac
    28861 taatgttgtg gaatatgagc gcctgatgat gaaattaccg agcgacgcag cgcagcttct
    28921 cgctgaaccg tccgatagat attcacgtgc ttttgtccgg ccggagccag cattgccccc
    28981 catcagtgac agccggcgga cttatgaaag ccgaccgcgc ggcccaaccg taaacagtct
    29041 gtagatttcc tgataccgcg tcagtgacga agccctttag ttcaaaatat gcatttgaaa
    29101 ggattcaaca atggtcgaca cgacgaagaa gagtgtcgcg aagtcgctta cggctgacat
    29161 gcgccgttct gctaagcggc tttctaagca aatgcgtaaa gcctcgctta ctgaagagga
    29221 ggcaacaagg aatctagccc ggctcgaaac gccggaccag aagcgaaaat atgtcgccga
    29281 tatgcagata atcgacaagc tggaagacgg ctttcgaggc gaaataagct ataaaatgct
    29341 gggaaataaa cagcttcggg tcgacagccc aaaagaatta acgcgcgagc atggtataat
    29401 aagaaaaaca agaaaggttc tgaagcgtaa cgcagagact ggcaatgttt acttgggtct
    29461 ccacgaaaag aagacctgga ggagcgttag cagccatctt tatgccgagg acggtacact
    29521 tcgcgcgaag catgtgaaat acaaagacgg acgctttgaa gaaaaatggg aacgagacga
    29581 aaatggcctg ctgttccgca cgcagtttgt caaccgaaat cggctatttc aacctatttc
    29641 cgagaaggtc agcacaccct accggagcgg accggaaaac cggctctttc gtgaactaac
    29701 ccgtcgaaaa ggttccaagc aggaaacttt tgagcgggac gaaaaaggca acctcgagct
    29761 catcggcagc aaacgtctcg gcttttccaa gaattcgacg aa
//
LOCUS       HUMPCOLLA2    700 bp ds-DNA             PRI       27-JUL-1990
DEFINITION  Human procollagen alpha2(I) chain gene, exon 6.
ACCESSION   M35391
KEYWORDS    procollagen alpha2(I).
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 700)
  AUTHORS   Weil,D., D'Alessio,M., Ramirez,F. and Eyre,D.
  TITLE     Structural and functional characterization of a splicing mutation
            in the pro-alpha2(I) collagen gene of an Ehlers-Danlos type VII
            patient
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by F.Ramirez, 19-JUN-1990.
            
FEATURES       from  to/span     description
    pept    /     6  /    59     procollagen alpha2(I) chain, exon 6 (AA at 6)
                                 /hgml_locus_uid="LP0002V" /nomgen="COL1A2"
                                 /map="7q21.3-q22.1"
    IVS     <     1        5     procollagen alpha2(I) chain intron E
    IVS          60  >   700     procollagen alpha2(I) chain intron F
BASE COUNT      235 a    120 c    101 g    244 t
ORIGIN      
        1 tctagaactt tgctgctcag tatgatggaa aaggagttgg acttggccct ggaccaatgg
       61 tatgcttatc tgtttatctt agccaaaaaa attgctaaat aaatcattca ttttatgtca
      121 cattttacca cgccatttat ttagctacct aagttaacac tcaatactta gattatataa
      181 aaaacaactc tttttgtttt caaatttatg aaaacataag ttaaggagtt cacttttctt
      241 tacaaaagaa agattaattg atcttttatg attatatgat ctttttgatt atatgatcct
      301 cattaagata gatcatatac ttatgtccga agaaataatc tttcggcaat atgccaaatc
      361 aattcgggaa atcaatttaa tttcaaaagt aatccatctg attaattttt catatcttca
      421 atcggtagat ttagcaacac tagaggcata gaactattta ttaagttctc tgaacttgtt
      481 ggaaaggatc aacaagttct atctagtcca gctaactcat ttttaaaatg ggagagttta
      541 agcccttttc tcaaagtcat ccaggtaact aatgacataa ctagaactag atgccaggca
      601 agatgtctaa tatttgctta catcatggtt tatgtaccta gtccttgaat aaaccactca
      661 tttagtcaac agatattaat cagatgcctt caatgggccc
//
LOCUS       MUSC4P       1739 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Mouse complement C4 5' flank.
ACCESSION   D90052
KEYWORDS    C4; complement C4; promoter.
SOURCE      Mouse (strain B10.BR) genomic DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1739)
  AUTHORS   Nakayame K Pattanakitsakul,S.-N., Yokoyama,S., Kimura,H.,
            Nosaka,M. and Takahashi,M.
  TITLE     Post-transcriptional regulation of the complement C4 gene in the
            low C4-producing strain of mouse
  JOURNAL   Immunogenetics 31, 361-367 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Masaru
            Nonaka Cancer Research Institute, Kanazawa University 13-1
            Takaramachi Kanazawa 920 Japan Phone:  0762-62-8151
FEATURES       from  to/span     description
    site       1691     1691     transcription initiation site
BASE COUNT      376 a    464 c    476 g    423 t
ORIGIN      chromosome 17.
        1 gtcaacagtt attgcacatg cttaatgtaa acctgtagcc tctacatgct tgtgtgcctg
       61 cccagacagt atatgacacc cacacccaca cccacaccca cacccacccc accccacttc
      121 acacccaggc agcagtagtg gacccaacag ggtaagtggg tccctgcttg ggttgcctgt
      181 gtttctggaa gctcaaggtc catgcatcag cagtagggga agggagggtt ctcagcccag
      241 gctgcatcat catggccgcc taccacctgc aggcacctgg tgaacgctgg ggtcctcact
      301 gtccgagatg ctggaagctg gtggctggct gtgcctggag ctgggagatt catcaagtgc
      361 tttgttaaag gtatggtgcc tgaggagcaa cacccatggg gggtctttag gtctccaaat
      421 ttgactcact tcttcctttt cctggacctt ctcctccagg gcgccaggct gtactgagca
      481 tggtgtggaa ggccaagtac tgggagcttg ccctgtcaga gctcctgggc cgcagggccc
      541 ctttggcagt gcggctaggt cttgcctacc atgtgcacga cctcattgga gcccagctgg
      601 tggactggtg agtctagtcc tgagacttgg gaaacgtgca gggacccagg gttggtgtcc
      661 ccctaagtgt tgcttctgtc atcagtgaag ccaagattct ctcattcccg gactcgggag
      721 aagactggac taaattctgg atttgcacac tttgggtcta ggcagctggc tccagtcttg
      781 tctttctgtt acacagtgtc cccaccactt ctggaaccct ccttcgcctg ccagatacat
      841 gaggagccaa tgcttggatt ctgcaactca ctgagtgagg ttcctggaag tgccacccca
      901 gggtggctga gcaagtcacc gcagtgggtg ccaggctcta ctgctgcaag ctgggcttct
      961 acctgagccg ggctgtgggc attgcagctc ttgcttctgt gcgtgtggag tcaggagccg
     1021 tgccaagggg atgagaaggt gggattgcta gagacactgg agcagggagt agaaaactct
     1081 gcccttcacg tcaggctgaa attgccaaat aaaatacttg tgcctgtact tctttctgtg
     1141 tcctttgttc agtgtggtgt atgtacggct gtgtgaagcc caacttgggg ctgagcattt
     1201 catcattgtt tagagagaga acaaggctta ctcaggccaa tttccctgtc tggggtgaga
     1261 gtgtacctca tcttctgagc tataggcttt ctgttagagc cataccaagc agaaggctgc
     1321 cttgaatatg aggccatcga ctggggaacc cattctagtt ttgttaggac ctttacctgt
     1381 cttatctatg tgtcatgttg gagctgaata cacgcacaca tggacaagca catgtattcg
     1441 tacacacaca cacacacacg cacacgcaca caaacacaca cacaagggga agtccctggt
     1501 actgggaata aacaagcacc agatcaattt tggggccaag atttctacag agaccctggt
     1561 gtttgggttg cctccccagc tctgggccta gggccagttc tgtttcttgg ccatcacgtg
     1621 gtttcccggc tcatggggca aagggagggg caaggtccag atcaaactct gccccaaccc
     1681 taggttgatc agaagggagc agacagtcag accagacagg tctgaccttt cctggatcc
//
LOCUS       BSURPOF      3078 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
            complete cds.
ACCESSION   M34995
KEYWORDS    RNA polymerase; minor sigma-37 factor.
SOURCE      B.subtilis (strains 168 and Marburg), clones
            lambda-[21,gt11,1,11,21,31].
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 368 to 2297)
  AUTHORS   Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
  TITLE     Gene encoding the 37,000-dalton minor sigma factor of Bacillus
            subtilis RNA polymerase: Isolation, nucleotide sequence,
            chromosomal locus, and cryptic function
  JOURNAL   J. Bacteriol. 169, 771-778 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3078)
  AUTHORS   Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
  TITLE     Similar organization of the sigB and spoIIA operons encoding
            alternate sigma factors of Bacillus subtilis RNA polymerase
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.W.Price, 18-FEB-1987.  Draft entry and computer-readable
            sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES       from  to/span     description
    pept        200      529     ORF V
    pept        526     1008     ORF W
    pept        968     1762     37 kd minor sigma factor (rpoF, sigB; ttg
                                 start codon)
    pept       1762     2361     ORF X
    mRNA        131     2368     sigB operon mRNA
    signal      131      168     sigB operon promoter
    signal     2368     2401     sigB transcription termination signal
    binding     188      194     ORF V ribosomal binding site
    binding     511      519     ORF W ribosomal binding site
    binding     954      961     sigB ribosomal binding site
    binding    1750     1753     ORF X ribosomal binding site
BASE COUNT      999 a    575 c    755 g    749 t
ORIGIN      40 degrees on B.subtilis map.
        1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
       61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
      121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
      181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
      241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
      301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
      361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
      421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
      481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
      541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
      601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
      661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
      721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
      781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
      841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
      901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
      961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
     1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
     1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
     1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
     1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
     1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
     1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
     1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
     1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
     1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
     1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
     1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
     1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
     1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
     1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
     1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
     1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
     1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
     2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
     2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
     2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
     2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
     2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
     2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
     2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
     2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
     2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
     2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
     2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
     2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
     2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
     2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
     2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
     2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
     3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
     3061 caaactgctg cgaagctt
//
LOCUS       HUMERPR      1624 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human erythropoietin receptor mRNA, complete cds.
ACCESSION   M34986
KEYWORDS    erythropoietin receptor.
SOURCE      Human cell line OCI-M1, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1624)
  AUTHORS   Ehrenman,K. and St John,T.
  TITLE     Sequence of the human erythropoietin gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.St.John, 07-JUN-1990.
            
              Author address: T.St.John
              Hutchinson Cancer Res Center
              1124 Columbia St.
              Seattle, WA
              Email: TSTJOHN@FHCRCVAX
            
FEATURES       from  to/span     description
    pept          1     1527     erythropoietin receptor precursor
    sigp          1       72     erythropoietin receptor signal peptide
    matp         73     1524     erythropoietin receptor
BASE COUNT      276 a    537 c    498 g    313 t
ORIGIN      
        1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
       61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
      121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
      181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
      241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
      301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
      361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
      421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
      481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
      541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
      601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
      661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
      721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
      781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
      841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
      901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
      961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
     1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
     1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
     1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
     1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
     1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
     1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
     1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
     1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
     1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
     1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
     1621 agga
//
LOCUS       HUMGFP40H    4379 bp ds-DNA             PRI       27-JUL-1990
DEFINITION  Human P40 T-cell and mast cell growth factor (hP40) gene, complete
            cds.
ACCESSION   M30135
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor;
            megakeryoblastic leukemia cell growth factor.
SOURCE      Human (cell line CESS) DNA, clone lambda-H40.3a1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4379)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 12-DEC-1989.
FEATURES       from  to/span     description
    pept        796      909     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
               1022     1057     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
               1140     1172     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2461     2592     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               4096     4215     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        796      849     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        850      909     P40 T-cell and mast cell growth factor
               1022     1057     P40 T-cell and mast cell growth factor
               1140     1172     P40 T-cell and mast cell growth factor
               2461     2592     P40 T-cell and mast cell growth factor
               4096     4212     P40 T-cell and mast cell growth factor
    IVS         910     1021     hP40 intron A
    IVS        1058     1139     hP40 intron B
    IVS        1173     2460     hP40 intron C
    IVS        2593     4095     hP40 intron D
    signal      742      747     TATA box
    signal     4354     4359     polyA signal
    rpt        2965     3265     Alu repeat
    binding     370      376     AP-2 binding site
    binding     624      629     IRF-1 binding site
    binding     627      633     AP-1 binding site
BASE COUNT     1208 a    921 c    904 g   1346 t
ORIGIN      
        1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
       61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
      121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
      181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
      241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
      301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
      361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
      421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
      481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
      541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
      601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
      661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
      721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
      781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
      841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
      901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
      961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
     1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
     1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
     1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
     1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
     1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
     1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
     1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
     1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
     1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
     1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
     1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
     1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
     1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
     1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
     1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
     1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
     1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
     2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
     2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
     2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
     2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
     2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
     2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
     2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
     2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
     2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
     2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
     2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
     2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
     2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
     2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
     2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
     2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
     3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
     3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
     3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
     3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
     3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
     3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
     3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
     3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
     3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
     3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
     3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
     3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
     3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
     3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
     3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
     3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
     3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
     4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
     4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
     4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
     4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
     4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
     4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS       MEFDNABP     1710 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION   M34778
KEYWORDS    DNA binding protein HMfB.
SOURCE      M.fervidus DNA.
  ORGANISM  Methanothermus fervidus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanothermaceae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  TITLE     DNA binding protein HMf, isolated from the hypothermophilic
            archaeal Methanothermus fervidus is most closely related to
            histones
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1710)
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Sandman, 01-JUN-1990.
            
               Author address: K.Sandman
               Department of Microbiology
               Ohio State University
               484 W. 12th Ave.
               Columbus, Ohio 43210
               E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES       from  to/span     description
    pept        578      787     DNA binding protein HMfB
    pept       1662      805 (c) ORF285
    signal      450      457     promoter
    signal      523      530     promoter
    signal      790      814     transcription termination signal
BASE COUNT      619 a    263 c    265 g    561 t      2 others
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
       61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
      121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
      181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
      241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
      301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
      361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
      421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
      481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
      541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
      601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
      661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
      721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
      781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
      841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
      901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
      961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
     1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
     1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
     1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
     1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
     1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
     1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
     1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
     1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
     1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
     1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
     1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
     1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS       MUSINT1A     5607 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION   K02593 M34750
KEYWORDS    Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE      Mouse (BALB/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1098 to 5606)
  AUTHORS   Van Ooyen,A. and Nusse,R.
  TITLE     Structure and nucleotide sequence of the putative mammary oncogene
            int-1; Proviral insertions leave the protein-encoding domain intact
  JOURNAL   Cell 39, 233-240 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5607)
  AUTHORS   Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
            Otte,A., Schuuring,E. and Van Ooyen,A.
  TITLE     The promoter of the Wnt-1 (int-1) oncogene and its mechanism of
            activation by insertion of proviral DNA of the mouse mammary tumor
            virus
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [2] kindly submitted
            by R.Nusse, 31-MAY-1990.
FEATURES       from  to/span     description
    pept       1779     1882     Wnt-1 protein (exon 1)
               2452     2705     Wnt-1 protein (exon 2)
               3279     3544     Wnt-1 protein (exon 3)
               4002     4490     Wnt-1 protein (exon 4)
    IVS        1883     2451     Wnt-1 cds intron A
    IVS        2706     3278     Wnt-1 cds intron B
    IVS        3545     4001     Wnt-1 cds intron C
    site       1133     1134     MMTV insertion site in tumor 17 [1]
    site       1394     1395     MMTV insertion site in tumor 35 [1]
    site       1631     1632     MMTV insertion site in tumor 102 [1]
    site       4495     4496     MMTV insertion site in tumor 53 [1]
    mRNA       1428     5398     Wnt-1 mRNA (alt.)
    mRNA       1595     5398     Wnt-1 mRNA (alt.)
    signal     1562     1566     TATA box
    signal     1573     1577     TATA box
BASE COUNT     1188 a   1681 c   1519 g   1219 t
ORIGIN      Chromosome 15.
        1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
       61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
      121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
      181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
      241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
      301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
      361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
      421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
      481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
      541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
      601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
      661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
      721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
      781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
      841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
      901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
      961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
     1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
     1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
     1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
     1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
     1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
     1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
     1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
     1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
     1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
     1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
     1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
     1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
     1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
     1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
     1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
     1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
     1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
     2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
     2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
     2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
     2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
     2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
     2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
     2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
     2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
     2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
     2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
     2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
     2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
     2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
     2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
     2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
     2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
     3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
     3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
     3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
     3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
     3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
     3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
     3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
     3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
     3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
     3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
     3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
     3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
     3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
     3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
     3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
     3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
     3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
     4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
     4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
     4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
     4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
     4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
     4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
     4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
     4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
     4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
     4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
     4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
     4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
     4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
     4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
     4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
     4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
     4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
     5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
     5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
     5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
     5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
     5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
     5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
     5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
     5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
     5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
     5581 gaccctaaat gagaccaact agggatc
//
LOCUS       MUSP40M      3808 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
            cds.
ACCESSION   M30136
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor.
SOURCE      Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
            DNA, clone lambda-S40.1a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3808)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 21-NOV-1989.
FEATURES       from  to/span     description
    pept        644      757     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
                846      881     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
                965      997     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2184     2315     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               3393     3512     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        644      697     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        698      757     P40 T-cell and mast cell growth factor
                846      881     P40 T-cell and mast cell growth factor
                965      997     P40 T-cell and mast cell growth factor
               2184     2315     P40 T-cell and mast cell growth factor
               3393     3509     P40 T-cell and mast cell growth factor
    IVS         758      845     mP40 intron A
    IVS         882      964     mP40 intron B
    IVS         998     2183     mP40 intron C
    IVS        2316     3392     mP40 intron D
    signal      589      594     TATA box
    binding     227      233     AP-2 binding site
    binding     475      480     IRF-1 binding site
    binding     478      484     AP-1 binding site
BASE COUNT     1012 a    903 c    869 g   1024 t
ORIGIN      Chromosome 13.
        1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
       61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
      121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
      181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
      241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
      301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
      361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
      421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
      481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
      541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
      601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
      661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
      721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
      781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
      841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
      901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
      961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
     1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
     1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
     1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
     1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
     1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
     1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
     1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
     1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
     1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
     1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
     1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
     1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
     1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
     1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
     1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
     1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
     1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
     2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
     2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
     2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
     2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
     2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
     2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
     2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
     2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
     2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
     2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
     2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
     2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
     2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
     2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
     2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
     2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
     3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
     3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
     3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
     3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
     3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
     3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
     3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
     3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
     3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
     3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
     3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
     3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
     3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
     3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS       PIGMHDQAA    1105 bp ss-mRNA            MAM       27-JUL-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
            cds.
ACCESSION   M29938
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
            ZQ30-1.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1105)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         13      777     MHC SLA-DQ alpha chain precursor
    sigp         13       81     MHC SLA-DQ alpha chain signal peptide
    matp         82      774     MHC SLA-DQ alpha chain
    mRNA    <     1     1105     MHC SLA-DQ mRNA
BASE COUNT      254 a    290 c    255 g    306 t
ORIGIN      
        1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
       61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
      121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
      181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
      241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
      301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
      361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
      421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
      481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
      541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
      601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
      661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
      721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
      781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
      841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
      901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
      961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
     1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1081 aaacaataaa tcacttcatg aggtt
//
LOCUS       PIGMHDQAB    1174 bp ss-mRNA            MAM       27-JUL-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
            cds.
ACCESSION   M29939
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1174)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         69      836     MHC SLA-DQ alpha chain precursor
    sigp         69      137     MHC SLA-DQ alpha chain signal peptide
    matp        138      833     MHC SLA-DQ alpha chain
BASE COUNT      273 a    313 c    266 g    322 t
ORIGIN      
        1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
       61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
      121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
      181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
      241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
      301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
      361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
      421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
      481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
      541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
      601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
      661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
      721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
      781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
      841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
      901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
      961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
     1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
     1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS       PIGMHDQBA     212 bp ds-DNA             MAM       27-JUL-1990
DEFINITION  Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION   M29944
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Pig (haplotype c/c) adult spleen DNA, clone D26.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
            Gustafsson,K. and LeGuern,C.
  TITLE     Class II genes of miniature swine: II. Molecular identification and
            characterization of B (beta) genes from SLA-c haplotype
  JOURNAL   Immunogenetics 31, 1-6 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept    <     1       12     MHC SLA-DO beta chain
BASE COUNT       38 a     58 c     52 g     64 t
ORIGIN      
        1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
       61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
      121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
      181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS       PP2LEND       266 bp ds-DNA             PHG       27-JUL-1990
DEFINITION  Bacteriophage P2 left end DNA.
ACCESSION   M34756
KEYWORDS    .
SOURCE      Bacteriophage P2 DNA.
  ORGANISM  Bacteriophage P2
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 266)
  AUTHORS   Ziermann,R. and Calendar,R.
  TITLE     Characterization of the cos sites of Bacteriophages P2 and P4
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Ziermann, 05-JUN-1990.
            
              Author address: R.Ziermann
              University of California
              Dept. Mol. Cell Biology
              Berkeley, CA 94720
              Email: calendar@garnet.berkeley.edu
            
FEATURES       from  to/span     description
    site          1       19     cohesive end of phage DNA
    site         20      266     left end of phage DNA
BASE COUNT       41 a     58 c     99 g     68 t
ORIGIN      Map position 0.0-0.8%.
        1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
       61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
      121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
      181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
      241 gagccagccg tttatctcgc ggatcc
//
LOCUS       RCABCHH       314 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
            F108) genes, 3' end, and 5' end respectively.
ACCESSION   M34843
KEYWORDS    .
SOURCE      R.capsulata DNA.
  ORGANISM  Rhodobacter capsulatus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Yang,Z. and Bauer,C.E.
  TITLE     Rhodobacter capsulatus genes involved in early steps of the
            bacteriochlorophyll biosynthetic pathway
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.E.Bauer, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1       79     bacteriochlorophyll biosynthesis protein
                                 (bchH) (AA at 2)
    pept        108  >   314     bacteriochlorophyll biosynthesis protein
                                 (F108)
BASE COUNT       59 a     87 c    119 g     49 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
       61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
      121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
      181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
      241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
      301 agcgcgtgct gcag
//
LOCUS       TREANTAT      999 bp ds-DNA             ORG       27-JUL-1990
DEFINITION  T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION   M34848
KEYWORDS    .
SOURCE      T.evansi AnTat 3/3 kinetoplast minicircle DNA.
  ORGANISM  Kinetoplast Trypanosoma evansi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Trypanosoma evansi.
REFERENCE   1  (bases 1 to 999)
  AUTHORS   Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
            Steinert,M. and Hamers,R.
  TITLE     Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Bajyana Songa, 05-JUN-1990.
FEATURES       from  to/span     description
    site        367      516     kinetoplast minicircle common region (ORI)
BASE COUNT      391 a     78 c    203 g    327 t
ORIGIN      
        1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
       61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
      121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
      181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
      241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
      301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
      361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
      421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
      481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
      541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
      601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
      661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
      721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
      781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
      841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
      901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
      961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS       VIBSCRRE     1524 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  V.alginolyticus Scr repressor protein gene (scrR), complete cds,
            and sucrose transport (scrA) protein, 5' end.
ACCESSION   M35009
KEYWORDS    Scr repressor protein; sucrose transport protein.
SOURCE      V.alginolyticus DNA.
  ORGANISM  Vibrio alginolyticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Blatch,G.L. and Woods,D.R.
  TITLE     Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
            repressor-encoding gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Blatch, 10-JUN-1990.
            
              Author address: G.L.Blatch
              University of Cape Town
              Dept. Microbiology
              Private Bag
              Rondebosch
              7700 SOUTH AFRICA
              Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
            
FEATURES       from  to/span     description
    pept         36  <     1 (c) sucrose transport protein (scrA)
    pept        271      555     Scr repressor protein (scrR)
    pept        791     1090     ORF5
    binding     264      267     scrR ribosomal binding site
    binding     780      785     ORF5 ribosomal binding site
    signal      178      183     scrA -10 region
    signal      203      208     scrA -35 region
    signal      190      195     scrR -10 region
    signal      162      167     scrR -10 region
    signal      762      767     ORF5 -35 region
    signal      742      747     ORF5 -35 region
    site         78       91     repressor operator site (put.)
    site        229      242     repressor operator site (put.)
    site         12       71     ScrR helix-turn-helix motif
BASE COUNT      466 a    344 c    321 g    393 t
ORIGIN      
        1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
       61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
      121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
      181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
      241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
      301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
      361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
      421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
      481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
      541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
      601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
      661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
      721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
      781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
      841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
      901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
      961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
     1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
     1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
     1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
     1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
     1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
     1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
     1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
     1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
     1501 gccaattgtg atattgaaga agct
//
LOCUS       XELNRP1A     2431 bp ss-mRNA            VRT       27-JUL-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
            complete cds.
ACCESSION   M34894
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA, clones 24-39.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2431)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         74     1117     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2431     nrp-1B mRNA
    signal     2407     2418     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
    site        921      941     alternative exon (put.)
BASE COUNT      663 a    522 c    543 g    703 t
ORIGIN      
        1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
       61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
      121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
      181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
      241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
      301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
      361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
      421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
      481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
      541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
      601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
      661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
      721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
      781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
      841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
      901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
      961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
     1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
     1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
     1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
     1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
     1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
     1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
     1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
     1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
     1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
     1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
     1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
     1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
     1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
     1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
     1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
     1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
     1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
     2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
     2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
     2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
     2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
     2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
     2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
     2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS       XELNRP1B     2324 bp ss-mRNA            VRT       27-JUL-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
            complete cds.
ACCESSION   M34895
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2324)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         29     1051     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2324     nrp-1B mRNA
    signal     2306     2311     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
BASE COUNT      647 a    467 c    520 g    690 t
ORIGIN      
        1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
       61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
      121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
      181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
      241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
      301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
      361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
      421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
      481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
      541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
      601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
      661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
      721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
      781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
      841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
      901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
      961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
     1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
     1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
     1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
     1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
     1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
     1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
     1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
     1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
     1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
     1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
     1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
     1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
     1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
     1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
     1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
     1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
     1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
     2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
     2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
     2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
     2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
     2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS       YSJSACDHY    2569 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION   M34929
KEYWORDS    saccharopine dehydrogenase.
SOURCE      Y.lipolytica (strain W29) DNA.
  ORGANISM  Yarrowia lipolytica
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2569)
  AUTHORS   Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
            Gaillardin,C.
  TITLE     Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
            lipolytica
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.M.Fournier, 06-JUN-1990.
            
            ORF1 is probably not translated, since a lacZ fusion/expression
            experiment did not produce results.
FEATURES       from  to/span     description
    pept        649     1929     ORF1
    pept       1764      655 (c) saccharopine dehydrogenase
    mRNA        582     2060     ORF1 mRNA (alt.)
    mRNA        598     2060     ORF1 mRNA (alt.)
    mRNA        599     2060     ORF1 mRNA (alt.)
    mRNA        602     2060     ORF1 mRNA (alt.)
    mRNA        606     2060     ORF1 mRNA (alt.)
    mRNA        609     2060     ORF1 mRNA (alt.)
    mRNA       1807      639 (c) ORF2 mRNA (alt.)
    mRNA       1809      639 (c) ORF2 mRNA (alt.)
    mRNA       1811      639 (c) ORF2 mRNA (alt.)
    mRNA       1812      639 (c) ORF2 mRNA (alt.)
    mRNA       1814      639 (c) ORF2 mRNA (alt.)
    signal      521      528     TATA box
    signal      565      575     TATA box
BASE COUNT      668 a    573 c    666 g    662 t
ORIGIN      
        1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
       61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
      121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
      181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
      241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
      301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
      361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
      421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
      481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
      541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
      601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
      661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
      721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
      781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
      841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
      901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
      961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
     1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
     1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
     1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
     1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
     1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
     1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
     1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
     1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
     1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
     1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
     1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
     1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
     1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
     1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
     1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
     1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
     1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
     2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
     2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
     2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
     2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
     2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
     2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
     2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
     2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
     2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//
LOCUS       HUMMHDQ1A    1515 bp ds-DNA             PRI       27-JUL-1990
DEFINITION  Human MHC class II HLA-DQ-alpha gene, regulatory region.
ACCESSION   M33765
KEYWORDS    cell surface antigen; cell surface glycoprotein; class II gene;
            major histocompatibility complex.
SOURCE      Human (DR4, DR4) DNA, clone H11A.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1515)
  AUTHORS   Ombra,M.N., Del Pozzo,G., Perfetto,C., Maffe,A. and Guardiola,J.
  TITLE     Effect of the AIR-I locus on the activation of an enhancerless
            HLA-DQA1 promoter
  JOURNAL   Immunogenetics 31, 368-376 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Guardiola, 18-APR-1990.
FEATURES       from  to/span     description
    pre-msg     655  >  1515     HLA-DQA1 mRNA and introns
                                 /hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1"
                                 /map="6p21.3"
    IVS         785  >  1515     HLA-DQA1 intron A
BASE COUNT      441 a    322 c    340 g    412 t
ORIGIN      Chromosome 6p21.3.
        1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag
       61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga
      121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac
      181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag
      241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc
      301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag
      361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag
      421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag
      481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct
      541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg
      601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta
      661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc
      721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt
      781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga
      841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta
      901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt
      961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg
     1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt
     1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc
     1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc
     1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac
     1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg
     1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca
     1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac
     1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta
     1501 gcaaaataag ggatc
//
LOCUS       BSUAMYEA      727 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  B.subtilis alpha-amylase (amyE) gene, 5' end.
ACCESSION   M35517
KEYWORDS    alpha-amylase.
SOURCE      B.subtilis (strain 2633) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 727)
  AUTHORS   Emori,M., Tojo,T. and Maruo,B.
  TITLE     Molecular cloning and expression of an alpha-amylase gene from an
            alpha-amylase extrahyper producing Bacullis subtilis
  JOURNAL   Agric. Biol. Chem. 52, 399-406 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        599  >   727     alpha-amylase structural protein (amyE)
BASE COUNT      207 a    125 c    179 g    216 t
ORIGIN      
        1 gatcatccgc ggcattatgt ttgaatttcc gtttaaagaa tgggctgcaa gcttgttcat
       61 cattatctta tattactgca tcagggctgc ggcatccgga atgctcatgc cgagaataga
      121 caccaaagaa gaactgcaaa aacgggtgaa gcagcagcga atagaatcag ttgcggtcgc
      181 ctttgcggta gtggtgctta cgatgtacga cagggggatt ccccatacat tcttcgcttg
      241 gctgaaaatg attcttcttt ttatcgtctg cggcggcgtt ctgtttctgc ttcggtatgt
      301 gattgtgaag ctggcttaca gaagagcggt aaaagaagaa ataaaaaaga aatcatcttt
      361 tttgtttgga aagcgaggga agcgtacaca gtctcgggca gtttttttat aggaacattg
      421 atttgtattc actctgccaa gttgttttga tagagtgatt gtgataattt aaaatgtaag
      481 tgttaacaaa attctccagt cttcacatca gtttgaaagg aggaagcgga agaatgaagt
      541 aagagggatt tttgactccg aagtaagtct tcaaaaaatc aaataaggag tgtcaagaat
      601 gtttgcaaaa cgattcaaaa cctctttact gccgttattc gctggatttt tattgctgtt
      661 ttatttggtt ctggcaggac cggcggctgc gagtgctgaa acggcgaaca aatcgaatga
      721 gcttaca
//
LOCUS       CHKCKMB       274 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken b-creatine kinase gene, 3' end.
ACCESSION   M35381
KEYWORDS    B-creatine kinase.
SOURCE      Chicken brain DNA, clone CK-B.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 274)
  AUTHORS   Kwiatkowski,R.W., Ehrismann,R., Schweinfest,C.W. and Dottin,R.P.
  TITLE     Accumulation of creatine kinase mRNA during myogenesis: Molecular
            cloning of a b-creatine kinase cDNA
  JOURNAL   Dev. Biol. 112, 84-88 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       51     M-creatine kinase (AA at 1)
                 97      243     M-creatine kinase
BASE COUNT       60 a     81 c     75 g     58 t
ORIGIN      
        1 ctgcagaaac gaggcacagg tggtgtggac acagctgcag ccgaagcctc ttgaggagcc
       61 cccccccccc cccccccccc tgcagtcgga ggggtgtttg atgtatccaa tgctgatcgt
      121 cttggcttct ctgaggtaga gctggtgcag atggtggtag atggtgtgaa gctgctcatt
      181 gaaatggaaa aacgccttga gaaaggccag tccattgatg acctcatgcc agctcagaaa
      241 taaagcactt tattctcccc ccgcccccct gcag
//
LOCUS       CHKCMYBA1     172 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken c-myb gene, exon 1.
ACCESSION   M35506
KEYWORDS    c-myb protein.
SEGMENT     1 of 4
SOURCE      Chicken hematopoietic cell DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 172)
  AUTHORS   Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
  TITLE     Organization of 5'-proximal c-myb exons in chicken DNA:
            Implications for c-myb tissue-specific transcription
  JOURNAL   FEBS Lett. 232, 227-234 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        140  +   162     thymic c-myb protein, exon 1
    IVS         163  >   172     thymic c-myb protein intron A
BASE COUNT       16 a     73 c     62 g     21 t
ORIGIN      
        1 aacctgtttc ctcctcctcc tcccgcggct cctcggcact tcgctgcgcg gcggcggagc
       61 gcctgggcgc agcccggcag cagggcgcgg cggcgggcgc gggctccccc gtcctcttcc
      121 cccagcggcc gccgcgagga tggcccggag accccggcac aggtaacggg gc
//
LOCUS       CHKCMYBA2     290 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken c-myb gene, exon 2.
ACCESSION   M35507
KEYWORDS    c-myb protein.
SEGMENT     2 of 4
SOURCE      Chicken hematopoietic cell DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 290)
  AUTHORS   Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
  TITLE     Organization of 5'-proximal c-myb exons in chicken DNA:
            Implications for c-myb tissue-specific transcription
  JOURNAL   FEBS Lett. 232, 227-234 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +   163  +   280     thymic c-myb protein, exon 2
    pep$        200  +   280     yolk sac c-myb protein, exon 1
    IVS     <     1      162     thymic c-myb protein intron A
    IVS         281  >   290     thymic c-myb protein intron B
    IVS         281  >   290     yolk sac c-myb protein intron A
BASE COUNT       78 a     59 c     71 g     82 t
ORIGIN      
        1 aatactagaa caacagaatg cagaacaaaa tcttgttgtg caagttttaa aacttttatc
       61 tgtgtaacct ttgtaaagat tgtgtgtttt ctgtgtgctg tgtaattaag tagctccagt
      121 cctcacgttc cgctctgtgt ctcgctgtgt ttcctgctgc agcatataca gcagcgatga
      181 cgatgaagaa gatgttgaga tgtacgacca cgattacgac ggcctgcttc ctaaggctgg
      241 gaaacgtcac ctagggaaaa ccaggtggac ccgtgaagag gtaattgccg
//
LOCUS       CHKCMYBA3      92 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken c-myb gene, exon 3.
ACCESSION   M35508
KEYWORDS    c-myb protein.
SEGMENT     3 of 4
SOURCE      Chicken hematopoietic cell DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 92)
  AUTHORS   Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
  TITLE     Organization of 5'-proximal c-myb exons in chicken DNA:
            Implications for c-myb tissue-specific transcription
  JOURNAL   FEBS Lett. 232, 227-234 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +    82     thymic c-myb protein, exon 3
    pep$    +    11  +    82     yolk sac c-myb protein, exon 2
    IVS     <     1       10     thymic c-myb protein intron B
    IVS          83  >    92     thymic c-myb protein intron C
    IVS     <     1       10     yolk sac c-myb protein intron A
    IVS          83  >    92     yolk sac c-myb protein intron B
BASE COUNT       32 a     16 c     24 g     20 t
ORIGIN      
        1 aatgatccag gatgagaaac tgaagaaact tgtggaacag aatggcacag aagactggaa
       61 agtcattgcc agtttccttc ctgtaggtag ac
//
LOCUS       CHKCMYBA4      34 bp ds-DNA             VRT       27-JUL-1990
DEFINITION  Chicken c-myb gene, exon 4.
ACCESSION   M35509
KEYWORDS    c-myb protein.
SEGMENT     4 of 4
SOURCE      Chicken hematopoietic cell DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 34)
  AUTHORS   Soret,J., Vellard,M., Martinerie,C. and Perbal,B.
  TITLE     Organization of 5'-proximal c-myb exons in chicken DNA:
            Implications for c-myb tissue-specific transcription
  JOURNAL   FEBS Lett. 232, 227-234 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  >    34     thymic c-myb protein, exon 4
    pep$    +    11  >    34     yolk sac c-myb protein, exon 3
    IVS     <     1       10     thymic c-myb protein intron C
    IVS     <     1       10     yolk sac c-myb protein intron B
BASE COUNT       10 a      6 c      8 g     10 t
ORIGIN      
        1 atttcattag aatcggacag atgttcagtg ccag
//
LOCUS       MUSFOS       3673 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Mouse c-fos gene; cellular homolog to viral oncogene.
ACCESSION   J00370
KEYWORDS    fos oncogene; oncogene.
SOURCE      Mouse DNA, clones pc-fos(mouse)-1 and lambda-c-fos(mouse)-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3548)
  AUTHORS   Van Beveren,C., van Straaten,F., Curran,T., Mueller,R. and
            Verma,I.M.
  TITLE     Analysis of FBJ-MuSV provirus and c-fos (mouse) gene reveals that
            viral and cellular fos gene products have different carboxy termini
  JOURNAL   Cell 32, 1241-1255 (1983)
  STANDARD  full staff_review
REFERENCE   2  (bases 2799 to 3673)
  AUTHORS   Meijlink,F., Curran,T., Miller,A.D. and Verma,I.M.
  TITLE     Removal of a 67-base-pair sequence in the noncoding region of
            protooncogene fos converts it to a transforming gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 4987-4991 (1985)
  STANDARD  simple staff_review
COMMENT     A TATA box was found at bp 101-106 and a poly-A signal at
            3493-3498.  [1] also reported the proviral FBJ-MuSV sequence, which
            is found in the viral section of the data base.
FEATURES       from  to/span     description
    pept        284      424     c-fos protein exon 1
               1179     1430     c-fos protein exon 2
               1836     1943     c-fos protein exon 3
               2061     2702     c-fos protein exon 4
    pre-msg     133  >  2702     c-fos protein mRNA
    IVS         425     1178     c-fos protein intron A
    IVS        1431     1835     c-fos protein intron B
    IVS        1944     2060     c-fos protein intron C
    variant    2856     2856     g in [1]; c in [2]
    variant    3135     3136     gt in [1]; gtt in [2]
BASE COUNT      817 a    987 c    974 g    895 t
ORIGIN      997 bp upstream of EcoRI site.
        1 gagttgacga cagagcgccc gcagagggcc ttggggcgcg cttccccccc cttccagttc
       61 cgcccagtga cgtaggaagt ccatccattc acagcgcttc tataaaggcg ccagctgagg
      121 cgcctactac tccaaccgcg actgcagcga gcaactgaga agactggata gagccggcgg
      181 ttccgcgaac gagcagtgac cgcgctccca cccagctctg ctctgcagct cccaccagtg
      241 tctacccctg gaccccttgc cgggctttcc ccaaacttcg accatgatgt tctcgggttt
      301 caacgccgac tacgaggcgt catcctcccg ctgcagtagc gcctccccgg ccggggacag
      361 cctttcctac taccattccc cagccgactc cttctccagc atgggctctc ctgtcaacac
      421 acaggtgagt ttggctttgt gtagccgcca ggtccgcgct gagggtcgcc gtggaggaga
      481 cactggggtg tgactcgcag gggcgggggg gtcttccttt ttcgctctgg agggagactg
      541 gcgcggtcag agcagcctta gcctgggaac ccaggacttg tctgagcgcg tgcacacttg
      601 tcatagtaag acttagtgac cccttcccgc gcggcaggtt tattctgagt ggcctgcctg
      661 cattcttctc tcggccgact tgtttctgag atcagccggg gccaacaagt ctcgagcaaa
      721 gagtcgctaa ctagagtttg ggaggcggca aaccgcggca atcccccctc ccggggcagc
      781 ctggagcagg gaggagggag gagggaggag ggtgctgcgg gcgggtgtgt aaggcagttt
      841 cattgataaa aagcgagttc attctggaga ctccggagca gcgcctgcgt cagcgcagac
      901 gtcagggata tttataacaa accccctttc gagcgagtga tgccgaaggg ataacgggaa
      961 cgcagcagta ggatggagga gaaaggctgc gctgcggaat tcaagggagg atattgggag
     1021 agcttttatc tccgatgagg tgcatacagg aagacataag cagtctctga ccggaatgct
     1081 tctctctccc tgcttcatgc gacactaggg ccacttgctc cacctgtgtc tggaacctcc
     1141 tcgctcacct ccgctttcct ctttttgttt tgtttcagga cttttgcgca gatctgtccg
     1201 tctctagtgc caactttatc cccacggtga cagccatctc caccagccca gacctgcagt
     1261 ggctggtgca gcccactctg gtctcctccg tggccccatc gcagaccaga gcgccccatc
     1321 cttacggact ccccacccag tctgctgggg cttacgccag agcgggaatg gtgaagaccg
     1381 tgtcaggagg cagagcgcag agcatcggca gaaggggcaa agtagagcag gtgagcagcg
     1441 attctggacc tttgtgggct gggggggggg gggggggcgg agactgacgc acagaccaca
     1501 caacagagaa gggacgctac tgactgcact tcctgaccag gagctgtggc tgctagccct
     1561 ttccctccct tgtcagattt tgacagttgg acccaagaca aactctagac agtttccctg
     1621 acagcttcct acttcattct ctagccgggg agcttctttg ttcccctgct aaagatctca
     1681 ctttaaatgc aaatcacact ctgcctgcca actgcaggtt agaaaaactg cttcaccgag
     1741 aggtgcgggt gctgtaggag ccagtttcac tggggtgact gaatggaggt gacactagac
     1801 aaccttaact gaatgttggt ccttttcttc tatagctatc tcctgaagag gaagagaaac
     1861 ggagaatccg aagggaacgg aataagatgg ctgcagccaa gtgccggaat cggaggaggg
     1921 agctgacaga tacactccaa gcggtaggtt gaaccagctg ctgctcctga aactttatta
     1981 aagttggagc ttgggactat gggcgcaggg tccttgagca tgcccgtgtc ttatgctttc
     2041 ttatatctct ccctatgcag gagacagatc aacttgaaga tgagaagtct gcgttgcaga
     2101 ctgagattgc caatctgctg aaagagaagg aaaaactgga gtttattttg gcagcccacc
     2161 gacctgcctg caagatcccc gatgaccttg gcttcccaga ggagatgtct gtggcctccc
     2221 tggatttgac tggaggtctg cctgaggctt ccaccccaga gtctgaggag gccttcaccc
     2281 tgccccttct caacgaccct gagcccaagc catccttgga gccagtcaag agcatcagca
     2341 acgtggagct gaaggcagaa ccctttgatg acttcttgtt tccggcatca tctaggccca
     2401 gtggctcaga gacctcccgc tctgtgccag atgtggacct gtccggttcc ttctatgcag
     2461 cagactggga gcctctgcac agcaattcct tggggatggg gcccatggtc acagagctgg
     2521 agcccctgtg tactcccgtg gtcacctgta ctccgggctg cactacttac acgtcttcct
     2581 ttgtcttcac ctaccctgaa gctgactcct tcccaagctg tgccgctgcc caccgaaagg
     2641 gcagcagcag caacgagccc tcctccgact ccctgagctc acccacgctg ctggccctgt
     2701 gagcagtcag agaaggcaag gcagccggca tccagacgtg ccactgcccg agctggtgca
     2761 ttacagagag gagaaacacg tcttccctcg aaggttcccg tcgacctagg gaggacctta
     2821 cctgttcgtg aaacacacca ggctgtgggc ctcaaggact tgcaagcatc cacatctggc
     2881 ctccagtcct cacctcttcc agagatgtag caaaaacaaa acaaaacaaa acaaaaaacc
     2941 gcatggagtg tgttgttcct agtgacacct gagagctggt agttagtaga gcatgtgagt
     3001 caaggcctgg tctgtgtctc ttttctcttt ctccttagtt ttctcatagc actaactaat
     3061 ctgttgggtt cattattgga attaacctgg tgctggattg tatctagtgc agctgatttt
     3121 aacaatacct actgtgttcc tggcaatagc gtgttccaat tagaaacgac caatattaaa
     3181 ctaagaaaag ataggacttt attttccagt agatagaaat caatagctat atccatgtac
     3241 tgtagtcctt cagcgtcaat gttcattgtc atgttactga tcatgcattg tcgaggtggt
     3301 ctgaatgttc tgacattaac agttttccat gaaaacgttt ttattgtgtt ttcaatttat
     3361 ttattaagat ggattctcag atatttatat ttttatttta tttttttcta ccctgaggtc
     3421 tttcgacatg tggaaagtga atttgaatga aaaattttaa gcattgtttg cttattgttc
     3481 caggacattg tcaataaaag catttaagtt gaatgcgacc accttcttgc tctctttatt
     3541 ctcagttttg tatggtttca ggaaggcctc tgaggagacc agtttgtcaa gatgggtggg
     3601 tcctggaggg gaacacgccc tctgtcccct tgtcactcag aggacacgta gttcagggta
     3661 ttttacagat gtg
//
LOCUS       MUSIGHACX     312 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION   M34880
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-J-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       83 a     77 c     72 g     80 t
ORIGIN      Chromosome 6.
        1 gacattgtga tgacacagtc tccatcctcc ctggctatgt cagtaggaca gaaggtcact
       61 atgaggtgca agtccagtca gagcctttta aatagtagca gtcaaaagaa ctatttggcc
      121 tggtaccagc agaaaccagg acagtctcct aaacttctgg tatactttgc atccactagg
      181 gaatctgggg tccctgatcg cttcataggc agtggatctg ggacagattt cactcttacc
      241 atcagcagtg tgcaggctga agacctggca gattacttct gtcagcaaca ttatagcact
      301 ccgtacacgt tc
//
LOCUS       MUSIGHACY     312 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION   M34883
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-J-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       80 a     78 c     68 g     83 t      3 others
ORIGIN      Chromosome 6.
        1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc
       61 atctcttgca gatctagtca gaacattgta catactnnna atggaaacac ctatttagaa
      121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacaaagt ttccaaccga
      181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag
      241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt
      301 cctccgacgt tc
//
LOCUS       MUSIGHACZ     312 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION   M34884
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-J-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       78 a     76 c     72 g     83 t      3 others
ORIGIN      Chromosome 6.
        1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc
       61 atctcttgca gattcagtca gagcattgta catactnnna atggaaacac ctatttagaa
      121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacagagt ttccaaccga
      181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag
      241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt
      301 ccgtggacgt tc
//
LOCUS       MUSIGHADA     312 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW.
ACCESSION   M34885
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-J-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       69 a     75 c     78 g     87 t      3 others
ORIGIN      Chromosome 6.
        1 gatattgtga tgactcaggc tgcaccctct gtacctgtca ctcctggaga gtcagtatcc
       61 atctcctgca ggtctagtaa gagtctcctg catagtnnna atggcaacac ttacttgtat
      121 tggttcctgc agaggccagg ccagtctcct cagctcctga tatatcggat gtccaacctt
      181 gcctcaggcg tcccagacag gttcagtggc agtgggtcag gaactgcttt cacactgaga
      241 atcagtagag tggaggctga ggatgtgggt gtttattact gtatgcaaca tctagaatat
      301 cctttgacgt tc
//
LOCUS       MUSIGHADB     312 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J.
ACCESSION   M34886
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse hybridoma A/J, cDNA to mRNA, clone AF1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-J-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       86 a     73 c     62 g     73 t     18 others
ORIGIN      Chromosome 6.
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gggcaagtca gnnnnnnnnn nnnnnnnnng acattagcaa ttatttaaac
      121 tggtatcagc agaaaccaga tggaactgtt aaactcctga tctactacac atcaagatta
      181 cactcaggag tcccatcaag gttcagtggc agtgggtctg gaacagatta ttctctcacc
      241 attagcaacc tggagcaaga agatattgcc acttactttt gccaacaggg taatacgctt
      301 ccgtggacgt tc
//
LOCUS       MUSIGHADC     312 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J.
ACCESSION   M34887
KEYWORDS    immunoglobulin light chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse hybridoma A/J, cDNA to mRNA, clone AF2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig kappa-chain V-J-region (AA at 1)
    recomb      303      304     V-region end/J-region start
BASE COUNT       87 a     79 c     70 g     76 t
ORIGIN      Chromosome 6.
        1 gacattgtga tgacacagtc tccatcctcc ctgactgtca cagcaggaga gaaggtcact
       61 atgagttgca agtccagtca gagtctgtta aacagtggaa atcaaaacaa ctacttgacc
      121 tggtaccagc agaaaccagg gcagcctcct aaactgttaa tctactgggc atccactaga
      181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc
      241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatcgttat
      301 ccgctcacat tc
//
LOCUS       MUSIGHADG     327 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION   M34888
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 327)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   327     Ig H-chain V-D-J-region (AA at 1)
    recomb      294      295     V-region end/D-region start
    recomb      297      298     D-region end/J-region start
BASE COUNT       87 a     78 c     87 g     75 t      0 others
ORIGIN      Chromosome 12.
        1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggattc agtgaagatg
       61 tcctgcaagg cttctggcta cacattcact gactcctaca tggactgggt gaagcagagc
      121 cctggaaaga gccttgagtg gattggatat atttatccta acaatggtgg tactaactac
      181 aatcagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac
      241 atggaactcc acagcctgac atctgaggac tctgcagtct attactgtgc aagatggact
      301 tactggggcc aagggactct ggtcact
//
LOCUS       MUSIGHADH     345 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION   M34889
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 345)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   345     Ig H-chain V-D-J-region (AA at 1)
    recomb      293      294     V-region end/D-region start
    recomb      309      310     D-region end/J-region start
BASE COUNT       81 a     93 c     86 g     85 t      0 others
ORIGIN      Chromosome 12.
        1 cagcctgggg ctgagcttgt gaagcctggg acttctgtga agttgtcctg caggccttct
       61 ggcttcacct acaccgactt ctggattcac tgggtgaaac agaggcctgg acaaggcctt
      121 gagtggattg gagagattaa tcctagcaat ggtggtacta gctacaatga gaagttcaag
      181 accaaggcca cactgactgc agacaaatcc tccagcacag tccttacatg caactcaacg
      241 agcctgacat ctgacgactc tgcggtctat ttctgtacaa gaccccctct ctcctatggt
      301 aacttcgcct ggtttactta ctggggccaa gggactctgg tcact
//
LOCUS       MUSIGHADI     264 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION   M34890
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   264     Ig H-chain V-D-J-region (AA at 1)
    recomb      223      224     V-region end/D-region start
    recomb      230      231     D-region end/J-region start
BASE COUNT       70 a     68 c     67 g     59 t      0 others
ORIGIN      Chromosome 12.
        1 ttcactacct actggatgca ctgggtgaag cagaggcctg gacaaggcct tgagtggatc
       61 ggagagattg atccttctga tacttatact taccacaatc aaaacttcaa gggcaaggcc
      121 acattgactg tagacaaatc ctccagcaca gcctacatgc aactcagcag cctgacatct
      181 gaggactctg cggtctatta ctgtgcaaga aggtcgatgg gccactacgg gcttgcatac
      241 tggggccaag ggactctggt cact
//
LOCUS       MUSIGHADJ     329 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW.
ACCESSION   M34891
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 329)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   329     Ig H-chain V-D-J-region (AA at 1)
    recomb      273      274     V-region end/D-region start
    recomb      294      295     D-region end/J-region start
BASE COUNT       80 a     84 c     82 g     83 t
ORIGIN      Chromosome 12.
        1 ttagtgaagc ttggagggtc cctgcaactc tcctgtgcag cctctggatt cactttcagt
       61 agctattaca tgtcttgggt tcgccagact cctgagaaga ggctggagtg ggtcgcaacc
      121 attagtaatg gtggtgttag gcacctacta tccagacagt gtgaaggacc gcttcacatc
      181 tccagagaca atgccaggaa caccctctac ctgcgaatga gcagtctgaa ttctgaggac
      241 acagccgtgt attactgtgc aagagatgga tcttatggta actaccacgt accttacttt
      301 gactactggg gccaaggcac cactctcac
//
LOCUS       MUSIGHADK     271 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J.
ACCESSION   M34892
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 271)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   271     Ig H-chain V-D-J-region (AA at 2)
    recomb      226      227     V-region end/D-region start
    recomb      238      239     D-region end/J-region start
BASE COUNT       69 a     63 c     73 g     66 t      0 others
ORIGIN      Chromosome 12.
        1 attcactttc agtagctatg ccatgtcttg ggttcgccag actccagaga agaggctgga
       61 gtgggtcgca tccattaata gtggtggtag tacctactat ccagacagtg tgaagggccg
      121 attcaccatc tccagagata atggcaggaa caccctgtac ctgcaaatga gcagtctgag
      181 gtctgaggac acggccatgt attattgtgg aagaggcagt tacccctatt actatggtct
      241 ggactactgg ggtcaaggaa cctcagtcac c
//
LOCUS       MUSIGHADL     348 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J.
ACCESSION   M34893
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Andria,M.L., Levy,S. and Benjamini,E.
  TITLE     Diverse V-H and V-L genes are used to produce antibodies against a
            defined protein epitope
  JOURNAL   J. Immunol. 144, 2614-2619 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   348     Ig H-chain V-D-J-region (AA at 1)
    recomb      303      304     V-region end/D-region start
    recomb      315      316     D-region end/J-region start
BASE COUNT       86 a     80 c     99 g     83 t      0 others
ORIGIN      Chromosome 12.
        1 gaagtgaagc tggtggagtc tgggggaggc ttagtgacgc ctggagggtc cctgaaactc
       61 tcctgtgcag cctctggatt cactttcagt agctatgcca tgtcttgggt tcgccagact
      121 ccagagaaga ggctggaatg ggtcgcatcc attaataatg gtggtaatac ctactatcca
      181 gacagtgtga agggccgatt caccatctcc agagataatg gcaggaacac cctgtacctg
      241 caaatgagca gtctgaggtc tgaggacacg gccatgtatt actgtggaag aggcagttat
      301 ccctattact atggtctaga ctactggggt caaggaacct cagtcacc
//
LOCUS       NEUMTNDCO     839 bp ds-DNA             ORG       27-JUL-1990
DEFINITION  N.crassa mitochondrial pseudo-NADH dehydrogenase (ND5) gene, 3'
            end, and cytochrome oxidase subunit 2 (co2) gene, 5' end.
ACCESSION   M35513
KEYWORDS    NADH dehydrogenase; cytochrome oxidase subunit 2; pseudogene.
SOURCE      N.crassa (strain RL3-8A) stopper mutant ER-3 mitochondrial DNA.
  ORGANISM  Mitochondrion Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa.
REFERENCE   1  (bases 1 to 839)
  AUTHORS   Almasan,A. and Mishra,N.C.
  TITLE     Molecular characterization of the mitochondrial DNA of a new
            stopper mutant ER-3 of Neurospora crassa
  JOURNAL   Genetics 120, 935-945 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps >     1      270     pseudo-NADH dehydrogenase (ND5; AA at 1)
    pept        645  >   839     cytochrome oxidase subunit 2 (co2)
BASE COUNT      313 a    124 c    119 g    283 t
ORIGIN      
        1 tgtatgaaat atttagctga aatattagga ggtaaaaaac atttattaaa aagttacgaa
       61 ggttacaatg taacagttaa taccactaaa ttatctccta ttgtacaata ttttaatctt
      121 tatcctttaa aaactaaaaa gtatattaca tattttaact gaataaaaat atataagtta
      181 gtaatagata aaaaacataa tgatcctgaa aatttattat tgattacgaa atataaaaat
      241 aacattaata aatctgatta taataaataa atcaactttc ggcggacccc gccccttcgg
      301 ggcgggggcc cgtttttagt aataactaaa aattaaacac agctacataa atgaagatat
      361 agtccgatca gttaagtaat taactgcgta ttccacccct cccttccccc gccccgaagg
      421 cgcccccgaa aaaacaaagg acgctagcgt agctaacctt atgagcagca ggctataaaa
      481 acccattagg cacagggtat acttcgttag tggataagaa ttagcctaag ccttaatata
      541 caatacagtt taatgttata tttaaattct atatttatga aaagatataa tatatttatt
      601 attatggttt aatattcaat atagaatatt ctatttcttt ttttatggga ttattattta
      661 ataatttaat tatgaatttt gatgctccaa gcccatgagg tatctatttt caagatagtg
      721 ctactcctca gatggaaggt ttagttgaat tacatgacaa tattatgtac tatctggtgg
      781 taatactatt tgttgtagga tgaatcttat tatctataat aagaaattat atcagtaca
//
LOCUS       PUMCHSA1       90 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  P.crispum chalcone synthase gene, 5' flank.
ACCESSION   M35515
KEYWORDS    chalcone synthasse.
SEGMENT     1 of 2
SOURCE      P.crispum (strain Mooskrause) DNA, clone PcCHS-1.
  ORGANISM  Petroselinum crispum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Apiales; Apiaceae.
REFERENCE   1  (bases 1 to 90)
  AUTHORS   Herrmann,A., Schulz,W. and Hahlbrock,K.
  TITLE     Two alleles of the single-copy chalcone synthase gene in parsley
            differ by a transposon-like element
  JOURNAL   Mol. Gen. Genet. 212, 93-98 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt          36       43     5' insertion target sequence
    rpt          44       58     5' inverted terminal repeat
    trns         44  >    90     transposon-like sequence
BASE COUNT       29 a     14 c     23 g     24 t
ORIGIN      
        1 tgaaaatgca tgttacaaac gttccctaat aatatagtag aagtagggtg taaatgagtc
       61 gatacgctca cgaacagttc ggtgttcggg
//
LOCUS       PUMCHSA2      732 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  P.crispum chalcone synthase gene, exon 1.
ACCESSION   M35516
KEYWORDS    chalcone synthasse.
SEGMENT     2 of 2
SOURCE      P.crispum (strain Mooskrause) DNA, clone PcCHS-1.
  ORGANISM  Petroselinum crispum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Apiales; Apiaceae.
REFERENCE   1  (bases 1 to 732)
  AUTHORS   Herrmann,A., Schulz,W. and Hahlbrock,K.
  TITLE     Two alleles of the single-copy chalcone synthase gene in parsley
            differ by a transposon-like element
  JOURNAL   Mol. Gen. Genet. 212, 93-98 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        725  >   732     chalcone synthase, exon 1
    mRNA        591  >   732     chalcone synthase mRNA (alt.)
    mRNA        639  >   732     chalcone synthase mRNA (alt.)
    trns    <     1       53     transposon-like sequence
    rpt          38       53     3' inverted terminal repeat
    rpt          54       61     3' insertion target sequence
BASE COUNT      245 a    139 c    126 g    222 t
ORIGIN      About 810 bp after segment 1.
        1 ctgaaacgtg aacacttaag aattcggctc ggttcggctc atttacagcc ctaagtagaa
       61 gtgatccata gcattttata ataactaaca ttacatatat attacccttt ttctacattc
      121 gtgtagagag aaaatgactc aataattcaa ttagtattat agaatttctt gttggtaact
      181 cttaatttta tttattaata gaataacgga aacgaaacga gatgggattg gtagatggga
      241 aacagaaata ggcacgtcac ggggtgtatg tatttattaa aaacaatggc atataggtaa
      301 gttgaggata acaattaaat cgtggtttca ttacaatagt tctgaaagaa gaggtagaat
      361 cgattgtcct tcacttgatg tatctactca ctttggatgt acgtggaggc ctaaatcccc
      421 accatggttg caagtagcag ttacatccac ataactcctc ccttattcca cgtggccatc
      481 cggtggccgt ccctccaacc taacctccct tgaatttcta tcagtaaatt ttcaaccctc
      541 tctcttctct gtcgtcacta tataatttcc ctacctacat ctctcttgat atctctttct
      601 ctctataaat atttaacaag gagaaaggaa aaataacaac acaaatcaga cagatcagat
      661 catcccttta cataaagata aggggtaagg gggggtttat agagagattt ataaaaaaaa
      721 aaaaatggca aa
//
LOCUS       R11ORIA       589 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Plasmid R1162 origin of replication region DNA.
ACCESSION   M35512
KEYWORDS    origin of replication.
SOURCE      Plasmid R1162 DNA.
  ORGANISM  Plasmid R1162
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 589)
  AUTHORS   Meyer,R.J., Lin,L.-S., Kim,K. and Brasch,M.A.
  TITLE     Broad host-range plasmid R1162: Replication, incompatibility, and
            copy-number control
  JOURNAL   Basic Life Sci. 30, 173-188 (1985)
  STANDARD  simple staff_review
BASE COUNT      114 a    156 c    199 g    120 t
ORIGIN      
        1 ccgggctgaa tgatcgaccg agacaggccc tgcggggctg cacacgcgcc cccacccttc
       61 gggtaggggg aaaggccgct aaagcggcta aaagcgctcc agcgtatttc tgcggggttt
      121 ggtgtggggt ttagcgggct ttgcccgcct ttccccctgc cgcgcagcgg tggggcggtg
      181 tgtagcctag cgcagcgaat agaccagcta tccggcctct gggcatattg ggcagggcag
      241 cagcgcccca cagggcgtga ctaaccgcgc ctagtggatt attcttagat aatcatggat
      301 ggatttttcc aacaccccgc cagcccccgc ccctgctggg tttgcaggtt tgggggcgtg
      361 acagttattg caggggttcg tgacagttat tgcagggggg cgtgacagtt attgcagggg
      421 ttcgtgacag ttagtacggg atgacgggca ctggctggca atgtctagca acggcaggca
      481 tgtcggctga cggtaaaaca actttccgct aagcgataga ctgtatgtga aacacagtat
      541 tgcaaggacg cggaacatgc ctcatgtggc ggccaggacg gccagccgg
//
LOCUS       STRSPEC       936 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  S.pyogenes streptococcal pyrogenic exotoxin type C (speC) gene,
            complete cds.
ACCESSION   M35514
KEYWORDS    pyrogenic exotoxin.
SOURCE      S.pyogenes DNA.
  ORGANISM  Streptococcus pyogenes
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Streptococcaceae.
REFERENCE   1  (bases 1 to 936)
  AUTHORS   Goshorn,S.C. and Schlievert,P.M.
  TITLE     Nucleotide sequence of Streptococcal pyrogenic exotoxin type C
  JOURNAL   Infect. Immun. 56, 2518-2520 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        154      861     streptococcal pyrogenic exotoxin type C (speC)
                                 precursor
    sigp        154      234     streptococcal pyrogenic exotoxin type C signal
                                 peptide (put.)
    matp        235      858     streptococcal pyrogenic exotoxin type C
BASE COUNT      364 a    144 c    127 g    301 t
ORIGIN      
        1 caaccttgac tatttaaatg gaactgccac tcctaaaaac taaaatataa atacatttat
       61 aaaatttcta aataaacaga aatctgattt ttaactactt actgctattt catgtattct
      121 cgtacgagta atacatttaa ttaaggagaa aaaatgaaaa agattaacat catcaaaata
      181 gttttcataa ttacagtcat actgatttct acttatttca cctatcatca aagtgactct
      241 aagaaagaca tttcgaatgt taaaagtgat ttactttatg catacactat aactccttat
      301 gattataaag attgcagggt aaatttttca acgacacaca cattaaacat tgatactcaa
      361 aaatatagag ggaaagacta ttatattagt tccgaaatgt cttatgaggc ctctcaaaaa
      421 tttaaacgag atgatcatgt agatgttttt ggattatttt atattcttaa ttctcacacc
      481 ggtgagtaca tctatggagg aattacgcct gctcaaaata ataaagtaaa tcataaatta
      541 ttgggaaatc tatttatttc gggagaatct caacagaact taaataacaa gattattcta
      601 gaaaaggata tcgtaacttt ccaggaaatt gactttaaaa tcagaaaata ccttatggat
      661 aattataaaa tttatgacgc tacttctcct tatgtaagcg gcagaatcga aattggcaca
      721 aaagatggga aacatgagca aatagactta tttgactcac caaatgaagg gactagatca
      781 gatatttttg caaaatataa agataataga attatcaata tgaagaactt tagtcatttc
      841 gatatttatc ttgaaaaata attcatcata cacaaaaaac cgcccagaat aatctgagcg
      901 gttttgtctt atctcggagc tttacctcct aattta
//
LOCUS       HUMTROPI      816 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds.
ACCESSION   J04760
KEYWORDS    slow-twitch skeletal troponin I; troponin I.
SOURCE      Human adult slow-twitch skeletal muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 816)
  AUTHORS   Wade,R., Eddy,R., Shows,T.B. and Kedes,L.
  TITLE     cDNA sequence, tissue-specific expression, and chromosomal mapping
            of the human slow-twitch skeletal muscle isoform of troponin I
  JOURNAL   Genomics 7, 346-357 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Wade, 15-MAR-1990.
FEATURES       from  to/span     description
    pept         77      640     slow-twitch skeletal troponin I (TNN1)
    mRNA    <     1      816     TNN1 mRNA
BASE COUNT      180 a    247 c    244 g    145 t
ORIGIN      Chromosome 1q12-qter.
        1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg
       61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa
      121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga
      181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg
      241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt
      301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa
      361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg
      421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc
      481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc
      541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa
      601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc
      661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg
      721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct
      781 ctggaactgg gattaaacag atacccaaga ggcagg
//
LOCUS       PPB4TSP       349 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  Bovine papillomavirus major transcript, 5' flank.
ACCESSION   M35265
KEYWORDS    major transcript.
SOURCE      Bovine papillomavirus (type 4) DNA.
  ORGANISM  Bovine papillomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Stamps,A.C. and Campo,M.S.
  TITLE     Mapping of two novel transcripts of Bovine papillomavirus type 4
  JOURNAL   J. Gen. Virol. 69, 3033-3045 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pre-msg     320  >   349     BPV-4 major transcript
BASE COUNT      109 a     54 c     88 g     98 t
ORIGIN      
        1 ctttgcagat gaattagaac aggaggaaca gtaaatagaa tattgctgtg cttttacatt
       61 tgagaaggat tgtagtgtgg gcatttaaaa gctgaccttc cagtcttaat tgcagtaggc
      121 gcctaagagg gtggtggtgg tataagttca agtttttcaa gaaaatgaag gggcagaacg
      181 tgacattaca ggacattgca atagaattag aggatacaat tagtccaatt aacttgcatt
      241 gtgaagagga gattgaaact gaggaggtgg acacccctaa cccttttgca ataacagcaa
      301 cttgttatgc ttgcgagcaa gtccttcgtt tagctgttgt aacgtcaac
//
LOCUS       PPBBPV4X       69 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  Bovine papillomavirus X protein gene, partial cds.
ACCESSION   M35264
KEYWORDS    .
SOURCE      Bovine papillomavirus (type 4) DNA.
  ORGANISM  Bovine papillomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 69)
  AUTHORS   Stamps,A.C. and Campo,M.S.
  TITLE     Mapping of two novel transcripts of Bovine papillomavirus type 4
  JOURNAL   J. Gen. Virol. 69, 3033-3045 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    69     X protein (AA at 3)
BASE COUNT       18 a     27 c     19 g      5 t
ORIGIN      
        1 acaacgcggg acccaaaccg gggaccacgc ccgaggacgt agcagaccgt ccccccgatc
       61 ttccagaga
//
LOCUS       BLIPGECR      314 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  B.licheniformis regulatory extracellular protease gene, complete
            cds.
ACCESSION   M35503
KEYWORDS    extracellular protease.
SOURCE      B.licheniformis DNA, clone pAN2.
  ORGANISM  Bacillus licheniformis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Watanabe,K., Sato,N., Asano,K., Hatanaka,Y., Okada,J.,
            Murata,K. and Kimura,A.
  TITLE     Nucleotide sequence of the gene increasing the extracellular
            proteolytic activities of Bacillus licheniformis; Comparison with
            similar phenotypic genes from other Bacillus sp
  JOURNAL   Agric. Biol. Chem. 51, 2807-2809 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         27      149     regulatory extracellular protease
    binding      12       16     ribosomal binding site (put.)
    binding      20       23     ribosomal binding site (put.)
BASE COUNT      118 a     53 c     55 g     88 t
ORIGIN      
        1 gatcttccaa aaggagtgtg gaaccggtgg aaaagcaaca aattgaagaa ttaaaacaac
       61 tgctttggcg gctagagaat gaaatcagag aaacaaagga ctccttgcgc aagattaaca
      121 aaagcattga tctgttaata caagtttaaa gggatttggt ttctttttca ccgaaaagtt
      181 gcatataaat tcatgcgtta caccgcaaac tttaaataga ccataacacc cattttacac
      241 aacttgcgat tagttctcaa atcctaatga aacataaaat ttttgtttgt ttttcaagga
      301 gtcaaataaa gatc
//
LOCUS       DOGPLBA       832 bp ss-mRNA            MAM       27-JUL-1990
DEFINITION  Dog cardiac phospholamban mRNA, complete cds.
ACCESSION   M35393
KEYWORDS    phospholamban.
SOURCE      Dog heart, cDNA to mRNA, clone pPLB1.
  ORGANISM  Canis sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 832)
  AUTHORS   Uyeda,A., Kitano,K., Fujii,J., Kadoma,M., Tada,M. and Tanaka,S.
  TITLE     Characterization of recombinant cDNA clones for canine cardiac
            phospholamban
  JOURNAL   Nucleic Acids Symp Ser 17, 121-124 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        181      339     cardiac phospholamban (EC 2.7..1.37)
BASE COUNT      277 a    161 c    125 g    269 t
ORIGIN      
        1 agaaaacttt ctaactaaac accgataaga cttcatacaa ctcacaatac tttatattgt
       61 aatcatcaca agagccaagg ctacctaaaa gaagagagtg gttgagctca catttggccg
      121 ccagcttttt acctttctct tcaccattta aaacttgaga cttcctgctt tcctggggtc
      181 atggataaag tccaatacct cactcgctct gctattagaa gagcttcaac cattgaaatg
      241 cctcaacaag cacgtcaaaa tcttcagaac ctatttataa atttctgtct cattttaata
      301 tgtctcttgt tgatctgcat cattgtgatg cttctctgaa gttctgctgc aatctccagt
      361 gatgcaactt gtcaccatca acttaatatc tgccatccca tgaagagggg aaaataatac
      421 tatataacag accacttcta agtagaagat tttacttgtg aaaaggtcaa gattcagaac
      481 aaaagaaatt attaacaaat gtcttcatct gtgggatttt gtaaacatga aaagagcttt
      541 attttcaaaa attaacttca aaatgactat aggtgcgcat aatgtaattg ctgaattcct
      601 caacaaagct tgtaaaagtt tctatgccaa attttttctg agggtaaagt aggagtttag
      661 ttttaaaact gctctgctaa ccagttcact tcacatataa agcattagct tcactatttg
      721 agctaaatat ttatattgta ctgtaaatgc ctatgtaatg tttattaaga tttttcaagt
      781 ctccgctaag tacgaaaata atcatccaaa tgaagtcatc atttgaaata gc
//
LOCUS       DVUFDXA       707 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  D.salexigens flavodoxin gene, complete cds.
ACCESSION   M35475
KEYWORDS    flavodoxin.
SOURCE      D.salexigens (ATCC 14822) DNA.
  ORGANISM  Desulfovibrio salexigens
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 707)
  AUTHORS   Helms,L.R., Krey,G.D. and Swenson,R.P.
  TITLE     Identification, sequence determination, and expression of the
            flavodoxin gene form Desulfovibrio salexigens
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 809-817 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        238      678     flavodoxin
    binding     227      232     ribosome binding site (put.)
    signal      129      135     flavodoxin -35 region
    signal      152      157     flavodoxin -10 region
BASE COUNT      223 a    152 c    149 g    183 t
ORIGIN      
        1 agactaacaa taaaaatttt gttttctcac aaacaacatt ttgaacgcgc atatatcggt
       61 acgtcttaag gagtgcataa caccactcag gcgcggcgta aacactcaat taccgattgg
      121 ctgacatttt cacattttct taatattttt ataaattgat attgactttg aatttcactc
      181 tcactacaaa ctaacacaac gttatcaaac cgattacaga tttataagga ggacactatg
      241 tccaaatcac tgatcgttta cggctctact accggaaata ctgaaacagc cgccgaatac
      301 gtggctgaag cgtttgaaaa caaagaaatt gatgtggaac ttaaaaatgt tactgatgtc
      361 agtgttgccg atctcggcaa cggatacgac atcgtgctat tcggctgctc tacctggggc
      421 gaagaagaaa ttgaattgca ggacgacttc atccccctct acgattccct cgaaaacgca
      481 gacctgaagg gcaagaaagt atctgtcttc ggatgcggtg actccgatta cacttatttt
      541 tgcggtgcag tagatgccat cgaagaaaaa ctcgaaaaaa tgggggccgt cgtcataggt
      601 gacagcctca agatcgatgg cgacccggaa cgcgatgaga ttgtaagctg gggttcagga
      661 atagcggaca aaatttaaat taatacagct cagctatcct tacgaag
//
LOCUS       HUMCKRASA     450 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human PR310 c-K-ras protein mRNA, 5' end.
ACCESSION   M35504
KEYWORDS    c-K-ras protein; oncogene.
SOURCE      Human (patient PR310) lung carcinoma, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 450)
  AUTHORS   Yamamoto,F., Nakano,H., Neville,C. and Perucho,M.
  TITLE     Structure and mechanisms of activation of c-K-ras oncogenes in
            human lung cancer
  JOURNAL   Prog. Med. Virol. 32, 101-114 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >   450     PR310 c-K-ras oncogene
BASE COUNT      155 a     71 c    106 g    118 t
ORIGIN      
        1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg
       61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac
      121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt
      181 catgaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt
      241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt
      301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg
      361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct
      421 tttattcaaa catcagcaaa gacaagacag
//
LOCUS       HUMCKRASB     450 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human PR371 c-K-ras oncogene mRNA, 5' end.
ACCESSION   M35505
KEYWORDS    c-K-ras oncogene.
SOURCE      Human (patient PR371) lung carcinoma, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 450)
  AUTHORS   Yamamoto,F., Nakano,H., Neville,C. and Perucho,M.
  TITLE     Structure and mechanisms of activation of c-K-ras oncogenes in
            human lung cancer
  JOURNAL   Prog. Med. Virol. 32, 101-114 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >   450     PR371 c-K-ras oncogene
BASE COUNT      156 a     71 c    105 g    118 t
ORIGIN      
        1 atgactgaat ataaacttgt ggtagttgga gcttgtggcg taggcaagag tgccttgacg
       61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac
      121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt
      181 caagaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt
      241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt
      301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg
      361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct
      421 tttattcaaa catcagcaaa gacaagacag
//
LOCUS       HUMTCRAER     237 bp ds-DNA             PRI       27-JUL-1990
DEFINITION  Human T cell receptor-alpha enhancer region DNA.
ACCESSION   M35477
KEYWORDS    T-cell receptor.
SOURCE      Human T-lymphocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 237)
  AUTHORS   Winto,A. and Baltimore,D.
  TITLE     Developmental regulation of the TCR-alpha-delta locus
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 87-92 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      28       60     NF-alpha-1 protein binding site
    binding     128      139     NF-alpha-2 protein binding site
    binding     159      172     NF-alpha-3 protein binding site
    binding     175      189     NF-alpha-5 protein binding site
    binding     192      203     NF-alpha-4 protein binding site
    mut         180      187     gcagaagc in NF-alpha-5; tactccta in
                                 Mut-alpha-5
    mut         192      197     tcctct in NF-alpha-4; gaagag in Mut-alpha-4
BASE COUNT       73 a     58 c     57 g     49 t
ORIGIN      
        1 agacagctgc accctgaaat gggtaagctg gtcagatagt gaatcaatag ccagaagtag
       61 aacaggaaat ggaaaaagtt tcccacttcc ctccaggtgt ttgggtctga acagcctccc
      121 acttccatga cgtcacggct gctgacatgg gcaaacaggt ccccctttga agctctcccg
      181 cagaagccac atcctctgga aagaggagtt aaaaatacag agttagagat aagatct
//
LOCUS       HUMTGIGMU     352 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human Ig mu-chain transgenic mRNA, partial cds.
ACCESSION   M35478
KEYWORDS    immunoglobulin heavy-chain; mu-immunoglobulin; transgenic gene.
SOURCE      Human leukemic patient mononuclear cell line, cDNA to mRNA, clones
            pTMMG1-[4,5,12].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 352)
  AUTHORS   Shimizu,A., Kinashi,T., Nussenzweig,M.C., Mizuta,T.-R.,
            Leder,P. and Honjo,T.
  TITLE     Molecular mechanism for immunoglobulin double-isotype expression
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 175-181 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   352     mu-immunoglobulin transgene protein (AA at 2)
                                 /hgml_locus_uid="LW0029F" /nomgen="IGHM"
                                 /map="14q32.33"
    recomb      226      227     V region end/D region start
    recomb      274      275     D region end/J6 region start
    recomb      332      333     J6 region end/C1 region start
BASE COUNT       84 a     99 c    104 g     65 t
ORIGIN      
        1 ggcttctgga ggcaccttca gcagctatgc tatcagctgg gtgcgacagg cccctggaca
       61 agggcttgag tggatgggag ggatcatccc tatctttggt acagcaaact acgcacagaa
      121 gttccagggc agagtcacga ttaccgcgga cgaatccacg agcacagcct acatggagct
      181 gagcagcctg agatctgagg acacggccgt gtattactgt gcgaaaaccg ggatcctggg
      241 gcggtatagc agtggctggt acccgaactc ggactactac tactacggta tggacgtctg
      301 gggccaaggg accacggtca ccgtctcctc agccaaaacg acacccccat ct
//
LOCUS       MUSIGHADD     411 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Ig mu-chain mRNA VDJ region, 5' end.
ACCESSION   M35502
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse hybridoma CP8 B3-D3, cDNA to mRNA, clone pCP12.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 411)
  AUTHORS   Reininger,L., Kaushik,A., Izui,S. and Jaton,J.-C.
  TITLE     A member of a new V-H gene family encodes anti-bromelinized mouse
            red blood cell autoantibodies
  JOURNAL   Eur. J. Immunol. 18, 1521-1526 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >   411     Ig mu-chain VDJ region precursor
    sigp          1       57     Ig mu-chain signal peptide
    matp         58  >   411     Ig mu-chain
    recomb      351      352     V region end/D region start
    recomb      360      361     D region end/J1 region start
BASE COUNT      101 a     84 c    118 g    108 t
ORIGIN      
        1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa
       61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct
      121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct
      181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca
      241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg
      301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatggtaac
      361 tactggtact tcgatgtctg gggcgcaggg accacggtca ccgtctcctc a
//
LOCUS       PRW1707PPS     66 bp ds-DNA             SYN       27-JUL-1990
DEFINITION  Plasmid pRW1707 DNA with polypurine-polypyrimidine sequences.
ACCESSION   M35455
KEYWORDS    .
SOURCE      Synthetic DNA, clone pRW1707.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Shimizu,M., Hanvey,J.C. and Wells,R.D.
  TITLE     Multiple non-B-DNA conformations of polypurine-polypyrimidine
            sequences in plasmids
  JOURNAL   Biochemistry 29, 4704-4713 (1990)
  STANDARD  simple staff_entry
BASE COUNT       36 a      5 c     21 g      4 t
ORIGIN      
        1 ggatccgaag aagaagaaga agaagaagaa gaattcgaag aagaagaaga agaagaagaa
       61 ggatcc
//
LOCUS       PRW1718PPS     72 bp ds-DNA             SYN       27-JUL-1990
DEFINITION  Plasmid pRW1718 DNA with polypurine-polypyrimidine sequences.
ACCESSION   M35460
KEYWORDS    .
SOURCE      Synthetic DNA, clone pRW1718.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 72)
  AUTHORS   Shimizu,M., Hanvey,J.C. and Wells,R.D.
  TITLE     Multiple non-B-DNA conformations of polypurine-polypyrimidine
            sequences in plasmids
  JOURNAL   Biochemistry 29, 4704-4713 (1990)
  STANDARD  simple staff_entry
BASE COUNT       38 a      6 c     22 g      6 t
ORIGIN      
        1 ggatccgaag aagaagaaga agaagaagaa gaattaattc gcgaagaaga agaagaagaa
       61 gaagaaggat cc
//
LOCUS       PRW1724PPS     66 bp ds-DNA             SYN       27-JUL-1990
DEFINITION  Plasmid pRW1724 DNA with polypurine-polypyrimidine sequences.
ACCESSION   M35458
KEYWORDS    .
SOURCE      Synthetic DNA, clone pRW1724.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Shimizu,M., Hanvey,J.C. and Wells,R.D.
  TITLE     Multiple non-B-DNA conformations of polypurine-polypyrimidine
            sequences in plasmids
  JOURNAL   Biochemistry 29, 4704-4713 (1990)
  STANDARD  simple staff_entry
BASE COUNT       19 a      6 c     38 g      3 t
ORIGIN      
        1 ggatccggag gaggaggagg aggaggagga ggatccggag gaggaggagg aggaggagga
       61 ggatcc
//
LOCUS       PRW1725PPS     66 bp ds-DNA             SYN       27-JUL-1990
DEFINITION  Plasmid pRW1725 DNA with polypurine-polypyrimidine sequences.
ACCESSION   M35459
KEYWORDS    .
SOURCE      Synthetic DNA, clone pRW1725.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Shimizu,M., Hanvey,J.C. and Wells,R.D.
  TITLE     Multiple non-B-DNA conformations of polypurine-polypyrimidine
            sequences in plasmids
  JOURNAL   Biochemistry 29, 4704-4713 (1990)
  STANDARD  simple staff_entry
BASE COUNT       27 a      6 c     30 g      3 t
ORIGIN      
        1 ggatccggag gaggaggagg aggaggagga ggatccgaag aagaagaaga agaagaagaa
       61 ggatcc
//
LOCUS       PT1INC3AA     330 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Plasmid pT181 (from S.aureus) inc3A region DNA with repC gene,
            5' end.
ACCESSION   M35476
KEYWORDS    .
SOURCE      Plasmid pT181 (from Staphylococcus aureus) DNA.
  ORGANISM  Plasmid pT181
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 330)
  AUTHORS   Highlander,S.K. and Novick,R.P.
  TITLE     Mutational and physiological analyses of plasmid pT181 functions
            expressing incompatibility
  JOURNAL   Plasmid 23, 1-15 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         11  <     1 (c) RepC protein
    mRNA        191  <     1 (c) RepC mRNA III
    mRNA        291  <     1 (c) RepC mRNA IV
    RNA         106      190     RNA I
    RNA         106      250     RNA II
    binding      27       22 (c) RepC ribosomal binding site (put.)
    site        151      176     inverted repeat terminator structure
    site        218      249     inverted repeat terminator structure
BASE COUNT      113 a     51 c     37 g    129 t
ORIGIN      
        1 tttttataca taaaaaatcg actccttaat ctcaatttcg tttaaggaat cgctcaccca
       61 aatatatatc ttgatgtata tttaaatatc gtttaatatc taaatataca agattataaa
      121 aacaactcag tgtttttttc tttgaatgat gtcgttcaca aactttggtc agggcgtgag
      181 cgactccttt ttattttgtt attaatataa cactatcaaa agatttggtc taatcagatc
      241 aagtcttttt ttatttaagc atttgtatta tctggtaaac agttaaaact actaaaacac
      301 caagtacata cttacttgtt ataaaattct
//
LOCUS       RICRGC1       145 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  Rice 17S rRNA gene.
ACCESSION   M35382 M35383
KEYWORDS    17S ribosomal RNA.
SEGMENT     1 of 4
SOURCE      Rice (variety IR20) 48 hr-germinated embryo DNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 145)
  AUTHORS   Hariharan,N. and Padayatty,J.D.
  TITLE     Ribosomal RNA gene repeat units in rice embryos
  JOURNAL   Indian J Biochem Biophys 24, 157-164 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1  >   145     17S rRNA
    variant     105      106     cg in IR20; cgcggccg in Mangetsumochi
    variant     112      113     gt in IR20; gcgt in Mangetsumochi
BASE COUNT       30 a     40 c     43 g     32 t
ORIGIN      
        1 gatccattgg agggcaagtc tggtgccagc agccgcggta attccagctc caatagcgta
       61 tatttaagtt gttgcagtta aaaagctcgt agttggacct tgggcgggcc ggtccgcctc
      121 acggcaggca ccgacctgct cgacc
//
LOCUS       RICRGC2       260 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  Rice 5.8S rRNA gene.
ACCESSION   M35384
KEYWORDS    5.8S ribosomal RNA.
SEGMENT     2 of 4
SOURCE      Rice (variety IR20) 48 hr-germinated embryo DNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 260)
  AUTHORS   Hariharan,N. and Padayatty,J.D.
  TITLE     Ribosomal RNA gene repeat units in rice embryos
  JOURNAL   Indian J Biochem Biophys 24, 157-164 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA         85      247     5.8S rRNA
BASE COUNT       61 a     85 c     76 g     38 t
ORIGIN      About 1.0 kbp after segment 1.
        1 cagaacccac ggcgccgacg gcgtcaagga acacagcgag gccctggccg tccggcggcg
       61 cgcgcgatac cacgagttaa atccacacga ctctcggcaa cggatatctc ggctctcgca
      121 tcgatgaaga acgtagcgaa atgcgatacc tggtgtgaat tgcagaatcc cgtgaaccat
      181 cgagtctttg aacgcaagtt gcgcccgagg ccatccggcc gagggcacgc ctgcctgggc
      241 gtcacgccaa aagacgctcc
//
LOCUS       RICRGC3        91 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  Rice 25S rRNA gene.
ACCESSION   M35385
KEYWORDS    25S ribosomal RNA.
SEGMENT     3 of 4
SOURCE      Rice (variety IR20) 48 hr-germinated embryo DNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 91)
  AUTHORS   Hariharan,N. and Padayatty,J.D.
  TITLE     Ribosomal RNA gene repeat units in rice embryos
  JOURNAL   Indian J Biochem Biophys 24, 157-164 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >    91     25S rRNA
BASE COUNT       30 a     12 c     29 g     20 t
ORIGIN      About 1.6 kbp after segment 2.
        1 gatcttggtg gtagtagcaa atattcaaat gagaacttga agggcgaaga ggagaaaggt
       61 tccatgtgaa cggcacttgc acatgggtaa g
//
LOCUS       RICRGC4       161 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  Rice 25S rRNA gene.
ACCESSION   M35386
KEYWORDS    25S ribosomal RNA.
SEGMENT     4 of 4
SOURCE      Rice (variety IR20) 48 hr-germinated embryo DNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 161)
  AUTHORS   Hariharan,N. and Padayatty,J.D.
  TITLE     Ribosomal RNA gene repeat units in rice embryos
  JOURNAL   Indian J Biochem Biophys 24, 157-164 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1      161     25S rRNA
    variant      24       25     cc in IR20; ccgc in Mangetsumochi
    variant      63       68     gccctt in IR20; tccacg in Mangetsumochi
    variant     128      128     c in IR20; g in Mangetsumochi
BASE COUNT       36 a     52 c     48 g     25 t
ORIGIN      About 250 bp after segment 3.
        1 cgcacgtcgc gcggtgtccg gtgcccccgg cggcccttga aaatccggag gaccgagtac
       61 cggcccttcc cggtcgtact cataaccgca tcaggtctcc aaggtgaacg acctctggcc
      121 aatggaacaa tgtaggcaag ggaagtcggc aaaacggatc c
//
LOCUS       STRRRA       5100 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Plasmid pAM-beta-1 (from S.faecalis) replication region DNA.
ACCESSION   X17092
KEYWORDS    .
SOURCE      plasmid pAM-beta-1 (from Streptococcus faecalis) DNA.
  ORGANISM  Plasmid pAM-beta-1
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 5100)
  AUTHORS   Swinfield,T.-J., Oultram,J.D., Thompson,D.E., Brehm,J.K. and
            Minton,N.P.
  TITLE     Physical characterization of the replication region of the
            Streptococcus faecalis plasmid pAM-beta-1
  JOURNAL   Gene 87, 79-90 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      476     ORFA (AA at 3)
    pept        505      873     ORFB
    pept        933     1799     ORFC
    pept       2121     2405     ORFF
    pept       2772     3071     ORFD
    pept       3114     4604     ORFE
    pept       4954     5091     ORFG
    binding     488      496     ORFB ribosomal binding site (put.)
    binding     919      927     ORFC ribosomal binding site (put.)
    binding    2759     2765     ORFD ribosomal binding site (put.)
    binding    3100     3106     ORFE ribosomal binding site (put.)
    binding    4942     4948     ORFG ribosomal binding site (put.)
BASE COUNT     1953 a    806 c    944 g   1397 t
ORIGIN      
        1 gaattcattt gttcatatct attttgactc tttttattgt attattgata ggaataagtg
       61 ttattcaaca agtacaactt caatcagtca aaaaagagag cgaattattg gaagaacaaa
      121 ttgaacgtgt caaagaaacg gatatttctc aatcgaaaat agatactttt ggaagatact
      181 ttttgactta ctatttttca caggaaaaaa atcaagaaaa ttatcagtca agtttaagaa
      241 catatgtatc cgaaaaggta gatatatccg attggaaagc gttagggaaa acgcttaaga
      301 gtgtcaatta ctatggtagt gaacaaacaa aaaaagggta tagcgttgaa taccttctaa
      361 atgtttcagt agacaatcgt agtaaaatgc aaaaaatcac ttttgaggtt gaaccaacta
      421 aaaacggctt tcttgtgaca actcaaccaa aacttacaga cttctctttt aactgaaata
      481 cgctaaaaag agagtgctta aacaatggga aaaattaatt taaatcaaat ttacacagca
      541 aaagaaatga gtgaaaggat aggtaagaat agaaattacc tatcccaagc ttatcgtaac
      601 aacaaacatg aaatacttaa aaattttaat tatcgaaaaa ttggtggaac aattattttc
      661 tctgataatc ccaataatga tttatcgcaa ttgattacag caaaagaagc aagccagttg
      721 ttaggaaaga atgatgaata ctttgcccac atttataaac ggtttcctca tagactagag
      781 ggaatcgatc acatttatac aggtaaaaca ttattcttaa cgaaagaatc gctagaagtt
      841 tttaaaaaaa agatgaacaa aaatgtcaga taaagcatat ttatgatata ctatttatgt
      901 acacaaatat taatttttaa ggagagaaga aaatgaaaaa aattgccctt ttaacacttt
      961 ttagtacagc gctattggta tcagccccac ttgttagttt tgcggacgaa gcaacaggta
     1021 gttcaggtgt caatgttcta gctgatgatg tagttgatcc agtagaacca accgatccaa
     1081 ctactccggt tgatccagtt gatccagttg atccagttga tccagttgat ccagttgatc
     1141 cagttgatcc agtcgatcca gttgatccag tcgatccaac agagccaacg gaaccaacag
     1201 agccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acggaaccaa
     1261 cggaaccaac ggaaccaacg gaaccaacag agccaacgga accaacggaa ccaacagagc
     1321 caacggaacc aacagagcca acggaaccaa cggaaccaac agagccaaca gagccaacgg
     1381 aaccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acagagccaa
     1441 cggaaccaac ggagccaacg gaaccaacgg aaccaagcaa gccgacagag ccaacagaac
     1501 caagtaaacc agtagacaca aacccaatcg aaaacccagt taatacagat acaggcgttg
     1561 tgattgttgc agttgaagat agtaaaccaa tcattcaatt agcagacgga acaacaaaaa
     1621 aagttgaagc taaagaaatt ggcgcagacg ttcaaaaaga cggaacagta acggttaaag
     1681 gttcagacgg aaaaatgaaa gttttaccaa aaacaggaga aacagaaaat attgcattgt
     1741 ctgtcctagg ttcattaatg gtattaggat cagcattcat ttttaaaaag agaatctaat
     1801 taagtatttt taaaagcatg gtcgcaagtt tcactagcag ccatgctttt tttgaatctt
     1861 tgtaggacgt tttaaagaga attacccact ttaaaattaa acatgcgctt agaatcgctt
     1921 taggaaacac gatccagtcc aataatcgtc gataaaaact tttgaaaaag gttggtgaaa
     1981 ttacctactt ttggaataat cacaaatcac aagtgattaa tcacaaatca caagtgatta
     2041 atcacttgtt tattaagata ttaaaagcta taatttaaat aaagcgtgaa ttttattaca
     2101 caaaaagagg ggggagaaac ttggaactag catttagaga aagcttaaaa aagatgagag
     2161 gtaccaaatc aaaagaaaaa ttctcccaag aattagaaat gagtagatca aattattcac
     2221 gaatagaatc aggaaaatca gatccaacca taaaaacact agaacaaatt gcaaagttaa
     2281 ctaactcaac gctagtagtg gatttaatcc caaatgagcc aacagaacca gaaccagaaa
     2341 cagaatcaga acaagtaaca ttggatttag aaatggaaga agaaaaaagc aatgacttcg
     2401 tgtgaataat gcacgaaatc gttgcttatt tttttttaaa agcggtatac tagatataac
     2461 gaaacaacga actgaataga aacgaaaaaa gagccatgac acatttataa aatgtttgac
     2521 gacattttat aaatgcatag cccgataaga ttgccaaacc aacgcttatc agttagtcag
     2581 atgaactctt ccctcgtaag aagttattta attaactttg tttgaagacg gtatataacc
     2641 gtactatcat tatataggga aatcagagag ttttcaagta tctaagctac tgaatttaag
     2701 aattgttaag caatcaatcg gaaatcgttt gattgctttt tttgtattca tttatagaag
     2761 gtggagtttg tatgaatcat gatgaatgta aaacttatat aaaaaatagt ttattggaga
     2821 taagaaaatt agcaaatatc tatacactag aaacgtttaa gaaagagtta gaaaagagaa
     2881 atatctactt agaaacaaaa tcagataagt atttttcttc ggagggggaa gattatatat
     2941 ataagttaat agaaaataac aaaataattt attcgattag tggaaaaaaa ttgacttata
     3001 aaggaaaaaa atctttttca aaacatgcaa tattgaaaca gttgaatgaa aaagcaaacc
     3061 aagttaatta aacaacctat tttataggat ttataggaaa ggagaacagc tgaatgaata
     3121 tcccttttgt tgtagaaact gtgcttcatg acggcttgtt aaagtacaaa tttaaaaata
     3181 gtaaaattcg ctcaatcact accaagccag gtaaaagcaa aggggctatt tttgcgtatc
     3241 gctcaaaatc aagcatgatt ggcggtcgtg gtgttgttct gacttccgag gaagcgattc
     3301 aagaaaatca agatacattt acacattgga cacccaacgt ttatcgttat ggaacgtatg
     3361 cagacgaaaa ccgttcatac acgaaaggac attctgaaaa caatttaaga caaatcaata
     3421 ccttctttat tgattttgat attcacacgg caaaagaaac tatttcagca agcgatattt
     3481 taacaaccgc tattgattta ggttttatgc ctactatgat tatcaaatct gataaaggtt
     3541 atcaagcata ttttgtttta gaaacgccag tctatgtgac ttcaaaatca gaatttaaat
     3601 ctgtcaaagc agccaaaata atttcgcaaa atatccgaga atattttgga aagtctttgc
     3661 cagttgatct aacgtgtaat cattttggta ttgctcgcat accaagaacg gacaatgtag
     3721 aattttttga tcctaattac cgttattctt tcaaagaatg gcaagattgg tctttcaaac
     3781 aaacagataa taagggcttt actcgttcaa gtctaacggt tttaagcggt acagaaggca
     3841 aaaaacaagt agatgaaccc tggtttaatc tcttattgca cgaaacgaaa ttttcaggag
     3901 aaaagggttt aatagggcgt aataacgtca tgtttaccct ctctttagcc tactttagtt
     3961 caggctattc aatcgaaacg tgcgaatata atatgtttga gtttaataat cgattagatc
     4021 aacccttaga agaaaaagaa gtaatcaaaa ttgttagaag tgcctattca gaaaactatc
     4081 aaggggctaa tagggaatac attaccattc tttgcaaagc ttgggtatca agtgatttaa
     4141 ccagtaaaga tttatttgtc cgtcaagggt ggtttaaatt caagaaaaaa agaagcgaac
     4201 gtcaacgtgt tcatttgtca gaatggaaag aagatttaat ggcttatatt agcgaaaaaa
     4261 gcgatgtata caagccttat ttagtgacga ccaaaaaaga gattagagaa gtgctaggca
     4321 ttcctgaacg gacattagat aaattgctga aggtactgaa ggcgaatcag gaaattttct
     4381 ttaagattaa accaggaaga aatggtggca ttcaacttgc tagtgttaaa tcattgttgc
     4441 tatcgatcat taaagtaaaa aaagaagaaa aagaaagcta tataaaggcg ctgacaaatt
     4501 cttttgactt agagcataca ttcattcaag agactttaaa caagctagca gaacgcccta
     4561 aaacggacac acaactcgat ttgtttagct atgatacagg ctgaaaataa aacccgcact
     4621 atgccattac atttatatct atgatacgtg tttgtttttt ctttgctgtt tagcgaatga
     4681 ttagcagaaa tatacagagt aagattttaa ttaattatta gggggagaag gagagagtag
     4741 cccgaaaact tttagttggc ttggactgaa cgaagtgagg gaaaggctac taaaacgtcg
     4801 aggggcagtg agagcgaagc gaacacttga ttttttaatt ttctatcttt tataggtcat
     4861 tagagtatac ttatttgtcc tataaactat ttagcagcat aatagattta ttgaataggt
     4921 catttaagtt gagcatatta gaggaggaaa atcttggaga aatatttgaa gaacccgatt
     4981 acatggattg gattagttct tgtggttacg tggtttttaa ctaaaagtag tgaatttttg
     5041 atttttggtg tgtgtgtctt gttgttagta tttgctagtc aaagtgatta aatagaattc
//
LOCUS       SUSISDNA      241 bp ds-DNA             INV       27-JUL-1990
DEFINITION  Sea urchin integration site DNA.
ACCESSION   M34912
KEYWORDS    .
SOURCE      Sea urchin DNA, clone 1.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 241)
  AUTHORS   Flytzanis,C.N., McMahon,A.P., Hough-Evans,B.R., Katula,K.S.,
            Britten,R.J. and Davidson,E.H.
  TITLE     Persistence and integration of cloned DNA in postembryonic sea
            urchins
  JOURNAL   Dev. Biol. 108, 431-442 (1985)
  STANDARD  simple staff_entry
BASE COUNT       70 a     42 c     33 g     96 t
ORIGIN      
        1 gaaaagtaac gatattcgtt tttataaaag gtcattttca cgtagagatg aatttacctg
       61 gtactaataa ctgatgcaga gtcgttttca aacagtgtca aatgcaaacc tttttgtctg
      121 actggctctt ttttaaattt gtattaatgc cataacactc actttaattt gcgacatggc
      181 tctttccatg tcactttttt attcttcaca tgataaatat tctaaaacta tctgttcttt
      241 t
//
LOCUS       MSQD7AA      3819 bp ds-DNA             INV       27-JUL-1990
DEFINITION  A.aegypti D7 gene, exons 1-5.
ACCESSION   M33156
KEYWORDS    .
SOURCE      A.aigypti (strain Rockefeller) female salivary gland DNA, clone
            AEGD7.
  ORGANISM  Aedes aegypti
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae.
REFERENCE   1  (bases 1 to 3819)
  AUTHORS   James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J.
  TITLE     Molecular analysis of a gene expressed in the female salivary
            glands of the vector mosquito, Aedes aegypti
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.A.James, 22-MAR-1990.
            
              Author address:A.A.James:
              U of C, Irvine, Dept Biochem
              Irvine, CA 92717
FEATURES       from  to/span     description
    pept       2377     2421     D7 protein, exon 1
               2481     2696     D7 protein, exon 2
               2757     2975     D7 protein, exon 3
               3038     3232     D7 protein, exon 4
               3288     3578     D7 protein, exon 5
    pre-msg    2350  >  3819     D7 protein mRNA and introns
    IVS        2422     2480     D7 protein intron A
    IVS        2697     2756     D7 protein intron B
    IVS        2976     3037     D7 protein intron C
    IVS        3233     3287     D7 protein intron D
    signal     2321     2326     TATA box
    signal     3634     3639     poly-A signal
BASE COUNT     1202 a    721 c    744 g   1152 t
ORIGIN      
        1 gaattcaagg aacctttaaa tgaatctctt gaaatatccc tggaaacatt actcgtgaag
       61 tcctggagaa ttttcagtta taaatctggg aagaatcctt gcatgaatcc ctggggtaat
      121 ttatgaagaa atcctttgag gtattcttgc aagaatctct tgacgttgaa tgcatttggt
      181 ttggcttgaa atattttaag gttttggtga aattagaata cttggagaaa tctgtgataa
      241 tattttgaat atattcttga aataatttta gaaagagtta ccgcattaat tggaactgaa
      301 atcgctggcg gaatttttgt gaaaatcttc aagaaatttc aaaatgtctc tgaaaaaacc
      361 ctgaaggaaa ttactaaagg ataatttgac agattttttt tattattgac tctttattat
      421 ggagaaattc agcccttagc tagttcatct ccttgacaga attactgaaa aacgactctg
      481 ttttttgtaa tctactaatt cagagtccaa cgctcgggtt ttgaatttcc catatttcct
      541 acacaaatcc atgaaagaat aatagtcgag aactgtcaaa aagttatgaa aaattattga
      601 aaacggaatc gagactagca ctgtaaaaga gtcatctatt gtcatttcag ttacaagggc
      661 caatcaattg acacaaacac aagcttcacc aactcttcca gaaattttaa agcatctact
      721 gatattggct tagataattt atacgtattc ctcctagatt tggcccagtt gttagttaca
      781 tgaaaggcca aatgttaaag gttaaatatc cgcagagtcc attgtcaatc agtgaagtat
      841 tttaagccag catgaaaggg tattatcaac attttttcca aaaccctccg ggttcattta
      901 tgctgagaag gaaaaaccga agtgattttt ttgcgctctc gtgctctgca taataacctt
      961 accgtccttc aattaaactt tccgtcaaag tcttcatagt ccaagcaata aaaaaaaaga
     1021 tgaaaatttc ttcaataagg tatcaccaaa gacacatacc ataccgaggg accaaatgca
     1081 gtactagaag tggtaccaaa tcagagcccg agtgggacgg acctggtatc actcgacata
     1141 ttcactgttt actgatgtcg tacaaacagc tattcgtcgt aattcttcgt ctgccacttc
     1201 aaatgaaaac ctgctaggtg aaacggacga acaatccaaa ctttccttca gcaaatcttg
     1261 tcctacccat cagaaagtgt tatttatgag gactcctgag cggtactgca atgctctcat
     1321 atgagcgctg gacggcactg acgttcacct tccggatcct cgtggagtgc ttgctgtttc
     1381 gacgccatct tcgattgaac tgtcagttat tactctcaac agaagcagta agcttcacct
     1441 taaggctaag taacccgtca ttcgttttgg caacaatgat gacttttcag cttgcatttc
     1501 aaaatgataa aacgcagtct tgatagttta tactgacttg aaaaagtatc actgtacgca
     1561 tttacatgca taaagtatgc tgatactttt tcagctgtgt cagtgcaaaa gcaactgatt
     1621 ttctttgatt cgaaatcatg agatgaatta gcaacaacca tcaacgacgc gtacaaattt
     1681 taatgacggc ctacttcgcc ttaacagcct tgtcgtgatg acgatccttg cagttgcgac
     1741 aggaatactt cgacaaatcc tagctagatt tcccgatctg aagcagtttg ttaaaaaaaa
     1801 atattttcag ttgagaaatg attttcctca cctcagagag ttaccggatc taaccgtatt
     1861 aaaggttgtc aatcaggata tgcgcatcca gtaccgattc cttggataac gatggcaaat
     1921 cgattaaaac ttgtgcctac gcaatttaat agttctagtt gtctctaagc aatctatcag
     1981 gtcttttggt tctcgcgaaa tctgattaat tcatcgacat taccgtcgag agcctggagc
     2041 ttgatttgcg acaaacatat tgatcgattg cttgaataca taagcttttg ctaagcctat
     2101 taatccaata atagatctct tcagagacga atggaagcac accatagtac tttataatat
     2161 tgaacatttg ttagtactgt gcacgtgtgc acacattcat gtacatatgg atgatttgat
     2221 cagctttact caattctttc ttcacactct gctggttgtt tatatttgcg taaatctatt
     2281 atctgttatc agctgactgc taaatcacat tcaacttgca tataaaacgc attaggcctg
     2341 tctaacattc atcccgatca tacttacaat actagaatga agctacttct attactcgca
     2401 atttttacaa ccttttccgt ggtaagatgt tcctagtgca tccaactctt catttcttaa
     2461 caaatactcc accttttaag gttgcctcaa tgggaccatt tgatccggag gagatgctgt
     2521 tcatctttac gcgctgcatg gaagacaatt tggaagatgg agcgaatcga cttccaatgt
     2581 tagcgaaatg gaaagaatgg attaacgaac cggtagacag ccccgcaact cagtgtttcg
     2641 gcaaatgcgt cctggtaaga acaggtctgt acgatccggt agcccaaaag ttcgatgtaa
     2701 gttggtggtt gatatccggg tggaattgga taatttcatc gcttcaattg ttgtaggcgt
     2761 cggtgatcca ggagcaattt aaggcttatc cgtccttggg ggaaaagagc aaagttgaag
     2821 catatgctaa cgcagttaaa cagttgcctt ccacaaataa cgactgtgcc gctgttttca
     2881 aagcgtacga tcctgttcat aaggcgcata aggacaccag caagaacttg ttccatggaa
     2941 acaaggagtt gaccaagggc ctctatgaga agttggtaag tgtttccgtg ttattcgatt
     3001 gaaacaattg tgtgtatgaa catattaatt gcttcaggga aaagacattc gccagaagaa
     3061 gcaatcctac ttcgagtttt gcgagaacaa gtactaccca gctgggtcag ataagcgtca
     3121 gcaactttgt cagataaggc aatacactgt cttagatgat gcgctgttca aggagcacac
     3181 tgattgcgtg atgaagggta ttcgctacat tacgaaggat aatcaactgg atgtaagaaa
     3241 ataataaata tcatgcaaac atcatagtaa tcgacatttc ctttcaggtg gaagaggtga
     3301 agcgggactt caagctagtg aataaagata cgaaggcact cgaaaaggtt ttgaatgact
     3361 gtaagtctaa ggaaccaagc aacgcaaaag agaaatcatg gcactactac aaatgtttgg
     3421 tggaatcttc ggttaaggat gatttcaaag aggcttttga ctatcgtgag gtacggtcac
     3481 agatttatgc cttcaatttg cccaagaaac aggcttacag caaaccagca gtgcaatctc
     3541 aagtgatgga aatcgacggt aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt
     3601 ccaattcaat ggtttcatcg aagtagaaag ttaaataaag atataataat catcacgcag
     3661 aggaactttg ttttaactta tattgtttat ttggtaattg ttgtagttat ttttaaatta
     3721 ttttaatttg ggaagccgaa aaaccgttaa taggaaatta aagtagtttt tgaatcattg
     3781 ttattataaa atatcgcata gttctgtact gaagaattc
//
LOCUS       MSQD7AB      1040 bp ss-mRNA            INV       27-JUL-1990
DEFINITION  A.aegypti D7 mRNA, complete cds.
ACCESSION   M33157
KEYWORDS    .
SOURCE      A.aigypti (strain Rockefeller) female salivary gland, cDNA to mRNA,
            clone AEGD7.
  ORGANISM  Aedes aegypti
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae.
REFERENCE   1  (bases 1 to 1040)
  AUTHORS   James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J.
  TITLE     Molecular analysis of a gene expressed in the female salivary
            glands of the vector mosquito, Aedes aegypti
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.A.James, 22-MAR-1990.
            
              Author address:A.A.James
              U of C, Irvine, Dept Biochem
              Irvine, CA 92717
FEATURES       from  to/span     description
    pept         13      978     D7 protein
    signal     1016     1021     poly-A signal
BASE COUNT      335 a    208 c    251 g    246 t
ORIGIN      
        1 gaattcccta gaatgaagct gcctctatta ctcgcaatag ttacaacctt ttccgtggtt
       61 gcttcaacgg gaccatttga tccggaggag atgctgttca cctttacgcg ctgcatggaa
      121 gacaatttgg aagatggacc gaatcgactt ccaatgttag cgaaatggaa agaatggatt
      181 aacgaaccgg tagacagccc cgcaactcag tgtttcggca aatgcgtcct ggtaagaaca
      241 ggtctgtacg atccggtagc ccaaaagttc gatgcgtcgg tgatccagga gcaatttaag
      301 gcttatccgt ccttggggga aaagagcaaa gttgaagcat atgctaacgc agttcaacag
      361 ttgccttcca caaataacga ctgtgccgct gttttcaaag cgtacgatcc tgttcataag
      421 gcgcataagg acaccagcaa gaacttgttc catggaaaca aggagttgac caagggcctc
      481 tatgagaagt tgggaaaaga cattcgccag aagaagaaat cctacttcga gttttgcgag
      541 aacaagtact acccagctgg atcagataag cgccagcaac tttgtaagat aaggcaatac
      601 actgtcttag atgatgcgct gttcaaggag cacactgatt gcgtgatgaa gggtattcgc
      661 tacataacga agaataatga actggatgct gaagaggtga aacgggactt caagctagtg
      721 aataaagata cgaaggcact tgaaaaggtt ttgaatgact gtaagtctaa ggaaccaagc
      781 aacgcaaaag agaaatcatg gcactactac aaatgtttgg tggaatcttc ggttaaggat
      841 gatttcaaag aggcttttga ctatcgtgag gtacggtcac agatttatgc cttcaatttg
      901 cccaagaagc aggcttacag caaaccagca gtgcaatctc aagtgatgga aatcgacggt
      961 aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt ccaattcaat ggttaaataa
     1021 agaaataatg atcatcacgc
//
LOCUS       ADGLS        1477 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units.
ACCESSION   K03325
KEYWORDS    alternate splicing; terminal repeat.
SOURCE      Adenovirus type 7 (simian) DNA.
  ORGANISM  Mastadenovirus s7
            Unclassified.
REFERENCE   1  (bases 1 to 1477)
  AUTHORS   Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E.
  TITLE     E1a regions of the human adenoviruses and of the highly oncogenic
            simian adenovirus 7 are closely related
  JOURNAL   J. Virol. 53, 399-409 (1985)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.Kimelman (15-AUG-1985).
            
            The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA
            hybridization and S1 nuclease digestion.  The peptide coding
            regions were deduced by analogy to other adenoviruses and following
            the most likely mode of translation.  The intron of the mRNA III
            transcribed from the E1a of other adenoviruses has not yet been
            determined by S1 mapping.
            
            Enhancer sequences are located at positions 165-173 and 265-270.  A
            TATA box is located at position 408-413.  A polyadenylation signal
            is located at posityion 1447-1452.
FEATURES       from  to/span     description
    pept        486      837     E1a 28 K protein from mRNA I, exon 1
               1178     1407     E1a 28 K protein from mRNA I, exon 2
    pept        486     1056     E1a 24 K protein from mRNA II, exon 1
               1178     1407     E1a 24 K protein from mRNA II, exon 2
    pre-msg     439     1477     E1a mRNA
    IVS         838     1177     E1a 24K pept intron A
    IVS        1057     1177     E1a 28K pept intron A
BASE COUNT      331 a    334 c    442 g    370 t
ORIGIN      5' end of the l strand of the genome.
        1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt
       61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg
      121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt
      181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt
      241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga
      301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc
      361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc
      421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg
      481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata
      541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca
      601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg
      661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc
      721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg
      781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg
      841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg
      901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg
      961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc
     1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact
     1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga
     1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca
     1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc
     1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca
     1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg
     1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta
     1441 tcaggaaata aaaagattaa ctggattctt tgtgcct
//
LOCUS       ADGRS         116 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  adenovirus type 7 (simian) right end of genome.
ACCESSION   V00035 J01991
KEYWORDS    terminal repeat.
SOURCE      adenovirus type 7 (simian) dna.
  ORGANISM  Mastadenovirus s7
            Unclassified.
REFERENCE   1  (bases 1 to 116)
  AUTHORS   Broker,T.R.
  TITLE     terminal nucleotide sequences and restriction endonuclease cleavage
            maps of simian and murine adenoviruses
  JOURNAL   (in) Tooze,J. (Ed.);
            DNA Tumor Viruses, Second Edition Revised: 1037-1046;
            Cold Spring Harbor Laboratory, Cold Spring Harbor (1981).
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 116)
  AUTHORS   Tolun,A., Alestroem,P. and Pettersson,U.
  TITLE     sequence of inverted terminal repetitions from different
            adenoviruses: demonstration of conserved sequences and homology
            between sa7 termini and sv40 dna
  JOURNAL   Cell 17, 705-713 (1979)
  STANDARD  simple staff_review
COMMENT     L-strand is given, from 5' to 3'.
BASE COUNT       25 a     15 c     46 g     30 t
ORIGIN      
        1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg
       61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct
//
LOCUS       ADGSA7PE1    2338 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  Simian adenovirus SA7P, leftmost 2338 bp.
ACCESSION   X01027
KEYWORDS    E1A protein; E1b protein; inverted terminal repeat;
            overlapping genes; transforming gene.
SOURCE      Simian adenovirus SA 7P.
  ORGANISM  Mastadenovirus s7
            Unclassified.
REFERENCE   1  (bases 1 to 2338; enum. 1 to 2338)
  AUTHORS   Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and
            Van Ormondt,H.
  TITLE     The nucleotide sequence of the leftmost XhoI fragment (6%) of
            simian adenovirus SA7P
  JOURNAL   J. Gen. Virol. 65, 1699-1708 (1984)
  STANDARD  simple automatic
COMMENT     **map: leftmost XhoI fragment (6%);
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE          1    183       inverted terminal repeat (ITR)
                                            of SA7P
               PRM         409    416       TATA E1A region
               CAP         440    440       cap site E1A RNAs
               MSG         440    838       "12S" E1A RNA
               MSG         440    838       1st exon "12S" E1A RNA
               IVS         839   1175       intron "12S" E1A RNA
               MSG        1176  >1445       2nd exon "12S" E1A RNA
            
               MSG         440   1057       1st exon "13S" E1A RNA
               IVS        1058   1175       intron "13S" E1A RNA
               MSG        1176  >1445       2nd exon "13S" E1A RNA
            
               SITE       1445   1450       AATAAA E1A
               PRM        1508   1514       TATA E1B
               CAP        1538   1538       cap E1B RNA's
            
            This virus, an independent isolate from Moscow, USSR, is highly
            related to simian adenovirus 7 (SA7).  The E1A sequences are 95%
            homologous to those of SA7 see $<K03322>$ J. VVrol. 5:399-409(1985)
FEATURES       from  to/span     description
    pept        487      838     E1A 21K protein (part 1)
               1176  /  1401     E1A 21K protein (part 2)
    pept        487     1057     E1A 29K protein (part 1)
               1176  /  1401     E1A 29K protein (part 2)
    pept       1569     2111     pot. 21K E1B protein
    pept       1874  >  2338     pot. 55K E1B protein
BASE COUNT      514 a    535 c    725 g    564 t
ORIGIN      
        1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag
       61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg
      121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt
      181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt
      241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg
      301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg
      361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg
      421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct
      481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat
      541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac
      601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag
      661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg
      721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct
      781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt
      841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg
      901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat
      961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg
     1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact
     1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg
     1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga
     1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac
     1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc
     1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc
     1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata
     1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct
     1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca
     1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg
     1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc
     1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt
     1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg
     1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct
     1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca
     1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc
     1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc
     2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa
     2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg
     2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat
     2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg
     2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag
//
LOCUS       ADGVARNA      850 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene.
ACCESSION   M14082
KEYWORDS    virus-associated RNA.
SOURCE      Simian adenovirus type 7 DNA.
  ORGANISM  Mastadenovirus s7
            Unclassified.
REFERENCE   1  (bases 1 to 850)
  AUTHORS   Larsson,S., Svensson,C. and Akusjaervi,G.
  TITLE     Characterization of a low-molecular-weight virus-associated (VA)
            RNA encoded by simian adenovirus type 7 which functionally can
            substitute for adenovirus type 5 VA RNA-1
  JOURNAL   J. Virol. 60, 635-644 (1986)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by S.Larsson, 15-JAN-1987.
            
            Simian adenovirus type 7 VA RNA stimulates protein synthesis in a
            transient expression assay and can functionally substitute for VA
            RNA-I during lytic growth of human Ad5.  The SA7 genome encodes
            only one VA RNA species.
FEATURES       from  to/span     description
    RNA         181      352     virus-associated RNA (5' and 3' ends +/-2bp)
    ORF         105  <     1 (c) ORF A cds
    ORF         366  >   850     ORF B cds
BASE COUNT      140 a    260 c    338 g    112 t
ORIGIN      150 bp upstream of XbaI site; 30 mu on viral map.
        1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc
       61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg
      121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga
      181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg
      241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg
      301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt
      361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag
      421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc
      481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg
      541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc
      601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc
      661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca
      721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc
      781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg
      841 tggccccggg
//
LOCUS       ADZITRL       150 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  adenovirus 31 inverted terminal repeat & repl ori left end.
ACCESSION   J01958
KEYWORDS    origin of replication; terminal repeat.
SOURCE      ad31 (adenovirus 31).
  ORGANISM  Mastadenovirus 31
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Stillman,B.W., Topp,W.C. and Engler,J.A.
  TITLE     conserved sequences at the origin of adenovirus dna replication
  JOURNAL   J. Virol. 44, 530-537 (1982)
  STANDARD  simple staff_review
BASE COUNT       38 a     29 c     48 g     35 t
ORIGIN      
        1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
       61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
      121 gtgggagccg ggcgcgccgg atgtgacgcg
//
LOCUS       ADZITRR       150 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  adenovirus 31 inverted terminal repeat & repl ori right end.
ACCESSION   J01959
KEYWORDS    origin of replication; terminal repeat.
SOURCE      ad31 (adenovirus 31).
  ORGANISM  Mastadenovirus 31
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 150)
  AUTHORS   Stillman,B.W., Topp,W.C. and Engler,J.A.
  TITLE     conserved sequences at the origin of adenovirus dna replication
  JOURNAL   J. Virol. 44, 530-537 (1982)
  STANDARD  simple staff_review
BASE COUNT       40 a     28 c     47 g     35 t
ORIGIN      
        1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag
       61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt
      121 gtgggagccg ggcgcgccgg atgtgacgaa
//
LOCUS       HSKGAORA     3726 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown
            function.
ACCESSION   M33303
KEYWORDS    glycoprotein A.
SOURCE      Herpesvirus of turkey DNA.
  ORGANISM  Gallid herpesvirus type 2
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 3726)
  AUTHORS   Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K.
  TITLE     Homologies between herpesvirus of turkey and Marek's disease virus
            type-1 DNAs within two co-linearly arranged open reading frames,
            one encoding glycoprotein A
  JOURNAL   Gene 84, 399-405 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        207     1676     glycoprotein A
    pept       2118     2756     ORF2
BASE COUNT     1016 a    884 c    815 g   1011 t
ORIGIN      
        1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat
       61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt
      121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc
      181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc
      241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg
      301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg
      361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg
      421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca
      481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta
      541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt
      601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg
      661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta
      721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc
      781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg
      841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata
      901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat
      961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg
     1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg
     1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca
     1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct
     1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga
     1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta
     1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta
     1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc
     1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa
     1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt
     1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt
     1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt
     1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc
     1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact
     1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat
     1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt
     1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa
     1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca
     2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg
     2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg
     2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc
     2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat
     2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg
     2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg
     2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta
     2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg
     2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca
     2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca
     2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac
     2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac
     2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata
     2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta
     2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc
     2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga
     3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc
     3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc
     3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt
     3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg
     3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt
     3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc
     3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa
     3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca
     3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga
     3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag
     3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat
     3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat
     3721 gaattc
//
LOCUS       HSKTK        2039 bp ds-DNA             VRL       27-JUL-1990
DEFINITION  Herpesvirus of turkey thymidine kinase gene, complete cds.
ACCESSION   M26659
KEYWORDS    thymidine kinase.
SOURCE      Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo
            fibroblasts.
  ORGANISM  Gallid herpesvirus type 2
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 2039)
  AUTHORS   Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K.
  TITLE     Genetic and biochemical characterization of the thymidine kinase
            gene from herpesvirus of turkeys
  JOURNAL   J. Virol. 63, 2847-2852 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.L.Martin, 03-AUG-1989.
FEATURES       from  to/span     description
    pept        608     1540     thymidine kinase (EC 2.7.1.21)
BASE COUNT      570 a    458 c    437 g    574 t
ORIGIN      649 bp upstream of XbaI site.
        1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa
       61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg
      121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt
      181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta
      241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa
      301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc
      361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg
      421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat
      481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct
      541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg
      601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc
      661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc
      721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt
      781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc
      841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga
      901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt
      961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact
     1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc
     1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg
     1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt
     1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga
     1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt
     1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct
     1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta
     1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga
     1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga
     1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat
     1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat
     1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt
     1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa
     1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc
     1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc
     1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc
     1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat
//
LOCUS       LACHSP       3095 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene,
            complete cds.
ACCESSION   M30192
KEYWORDS    .
SOURCE      Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 3095)
  AUTHORS   Hill,C., Miller,L.A. and Klaenhammer,T.R.
  TITLE     Nucleotide sequence and distribution of the pTR2030 resistance
            determinant (hsp) which aborts Bacteriophage infection in
            Lactococci
  JOURNAL   Appl. Environ. Microbiol. 56, 2255-2258 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Hill, 29-NOV-1989.
FEATURES       from  to/span     description
    pept       1105     2991     hsp gene (aborts phage infection)
BASE COUNT     1129 a    439 c    518 g   1009 t
ORIGIN      
        1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg
       61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga
      121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat
      181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt
      241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac
      301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga
      361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat
      421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca
      481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac
      541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct
      601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca
      661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat
      721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa
      781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg
      841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc
      901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat
      961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt
     1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag
     1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca
     1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct
     1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca
     1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca
     1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata
     1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat
     1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa
     1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt
     1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata
     1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa
     1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat
     1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt
     1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat
     1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat
     1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa
     1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat
     2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct
     2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct
     2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat
     2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct
     2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta
     2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata
     2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa
     2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa
     2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata
     2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac
     2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct
     2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa
     2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa
     2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc
     2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa
     2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt
     3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat
     3061 ctcctcatcc gcatttggat aaagatgact ataaa
//
LOCUS       PH15RER      2016 bp ds-DNA             PHG       27-JUL-1990
DEFINITION  Bacteriophage phi-15 right early region containing genes 17,
            16.5-16.9, complete cds.
ACCESSION   M28830
KEYWORDS    DNA replication protein.
SOURCE      Bacteriophage phi-15 DNA.
  ORGANISM  Bacteriophage phi-15
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 2016)
  AUTHORS   Benes,V., Arnold,L., Smrt,J. and Paces,V.
  TITLE     Nucleotide sequence of the right early region of Bacillus phage
            phi-15 and comparison with related phages: Reorganization of gene
            17 during evolution
  JOURNAL   Gene 75, 341-347 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        203      664     gene 17 peptide
    pept        666      992     gene 16.9 peptide (put.)
    pept        989     1306     gene 16.8 peptide (put.)
    pept       1308     1700     gene 16.7 peptide (put.)
    pept       1697     1861     gene 16.6 peptide (put.)
    pept       1858     1971     gene 16.5 peptide (put.)
BASE COUNT      744 a    374 c    382 g    516 t
ORIGIN      
        1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca
       61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa
      121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa
      181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga
      241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc
      301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt
      361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca
      421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc
      481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt
      541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga
      601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa
      661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca
      721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag
      781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa
      841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta
      901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca
      961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg
     1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga
     1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg
     1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat
     1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt
     1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt
     1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa
     1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc
     1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt
     1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc
     1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg
     1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat
     1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt
     1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt
     1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg
     1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc
     1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt
     1981 ttgattgttt cacgtggaac atcactgtat acgcat
//
LOCUS       STLENV       4122 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Simian T-cell leukemia virus, pol-env-pX-3' LTR region.
ACCESSION   M11373 M11374
KEYWORDS    env gene; pol gene.
SOURCE      Simian T-cell leukaemia virus genomic RNA.
  ORGANISM  Simian T-cell lymphotropic virus type I
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 4122)
  AUTHORS   Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and
            Yoshida,M.
  TITLE     Sequence homology of the simian retrovirus genome with human T-cell
            leukemia virus type I
  JOURNAL   Virology 144, 59-65 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        311     1777     env protein
BASE COUNT      905 a   1456 c    741 g   1020 t
ORIGIN      
        1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt
       61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc
      121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc
      181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc
      241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac
      301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc
      361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca
      421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct
      481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca
      541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg
      601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg
      661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca
      721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg
      781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa
      841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga
      901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac
      961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg
     1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta
     1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt
     1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc
     1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc
     1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc
     1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt
     1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca
     1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca
     1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc
     1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca
     1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat
     1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc
     1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca
     1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca
     1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac
     1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag
     1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc
     2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt
     2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg
     2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt
     2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta
     2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc
     2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca
     2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga
     2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac
     2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa
     2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa
     2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt
     2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag
     2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa
     2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc
     2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc
     2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat
     3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact
     3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt
     3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc
     3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt
     3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc
     3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt
     3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa
     3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta
     3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg
     3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat
     3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc
     3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg
     3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt
     3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt
     3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag
     3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc
     3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc
     4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc
     4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca
//
LOCUS       HUMIGHDA      368 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human Ig H-chain mRNA V-D-J region, partial cds.
ACCESSION   M33858
KEYWORDS    diversity region; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region;
            variable region V-H4.
SOURCE      Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA,
            clone C.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 368)
  AUTHORS   Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J.
  TITLE     Intraclonal diversity in the V-H genes expressed by CD5- chronic
            lymphocytic leukemia-producing pathologic IgM rheumatoid factor
  JOURNAL   J. Immunol. 144, 1526-1530 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   368     Ig H-chain V4-D-J region (AA at 1)
BASE COUNT       73 a    110 c    112 g     73 t
ORIGIN      
        1 caggtgcatc tacagcagtg gggcgcagga ctgttgaagc gttcggagac cctgtccctc
       61 acctgcgctg tctatggtgg gcccttcagt ggttactatt ggagctgggt ccgccagccc
      121 ccagggaagg gaccggagtg gattggggaa atcaatcata gtggacgcac cacctacaac
      181 ccgtccctca cgagtcgagt caccatgtca ctggacacgt ccaagaacca gttctccctg
      241 aagctgacct ctgtgaccgc cgcggacacg gctgtttatt actgtgcgag aggcccttgc
      301 gaggcatact gtactgatga tgccccgcaa ggcggtgcgc ggtacttcca gcactggggc
      361 cagggcac
//
LOCUS       HUMIGHDB      260 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human Ig H-chain mRNA V-D-J region, partial cds.
ACCESSION   M33859
KEYWORDS    diversity region; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region;
            variable region V-H4.
SOURCE      Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA,
            clone A.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 260)
  AUTHORS   Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J.
  TITLE     Intraclonal diversity in the V-H genes expressed by CD5- chronic
            lymphocytic leukemia-producing pathologic IgM rheumatoid factor
  JOURNAL   J. Immunol. 144, 1526-1530 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   260     Ig H-chain V4-D-J region (AA at 1)
BASE COUNT       57 a     79 c     75 g     49 t
ORIGIN      
        1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc
       61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac
      121 cagttctccc tgaagttgac ctctgtgacc gccgcggaca cggctgttta tttctgtgcg
      181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aaggcggtgc gcggtacttc
      241 cagcactggg gccagggcac
//
LOCUS       HUMIGHDC      260 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human Ig H-chain mRNA V-D-J region, partial cds.
ACCESSION   M33860
KEYWORDS    diversity region; immunoglobulin heavy chain; joining exon;
            mu-immunoglobulin; processed gene; variable region;
            variable region V-H4.
SOURCE      Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA,
            clone B.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 260)
  AUTHORS   Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J.
  TITLE     Intraclonal diversity in the V-H genes expressed by CD5- chronic
            lymphocytic leukemia-producing pathologic IgM rheumatoid factor
  JOURNAL   J. Immunol. 144, 1526-1530 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   260     Ig H-chain V4-D-J region (AA at 1)
BASE COUNT       58 a     82 c     74 g     46 t
ORIGIN      
        1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc
       61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac
      121 cagttctccc tgaagctgac ctctgcgacc gccgcggaca cggctgttta ttactgtgcg
      181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aagccggtgc gcggtacttc
      241 cagcactggg gccagggcac
//
LOCUS       HUMHLADQA    1468 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC class II HLA-DQA1 mRNA, complete cds.
ACCESSION   M33906
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone
            pII-alpha-7.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1468)
  AUTHORS   Jonsson,A.-K., Andersson,L. and Rask,L.
  TITLE     Complete sequences of DQA1 and DQB1 cDNA clones corresponding to
            the DQw4 specificity
  JOURNAL   Immunogenetics 30, 232-234 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         44      811     MHC HLA-DQA1-alpha protein
                                 /hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1"
                                 /map="6p21.3"
    signal     1119     1124     poly-A signal (pot.)
BASE COUNT      356 a    381 c    309 g    422 t
ORIGIN      
        1 cacagctcag agcagcaact gctgaggctg ccttgggaag aggatgatcc taaacaaagc
       61 tctgctgctg ggggcccttg ccctgaccac cgtgatgagc ccctgtggag gtgaagacat
      121 tgtggctgac catgttgcct cttatggtgt aaacttgtac cagtcttacg gtccctctgg
      181 ccagtacacc catgaatttg atggagacga gcagttctac gtggacctgg ggaggaagga
      241 gactgtctgg tgtttgcctg ttctcagaca atttagattt gacccgcaat ttgcactgac
      301 aaacatcgct gtgacaaaac acaacttgaa catcctgatt aaacgctcca actctactgc
      361 tgctaccaat gaggttcctg aggtcacagt gttttccaag tctcccgtga cgctgggtca
      421 gcccaacacc ctcatctgtc ttgtggacaa catctttcct cctgtggtca acatcacatg
      481 gctgagcaat gggcactcag tcacagaagg tgtttctgag accagcttcc tctccaagag
      541 tgatcattcc ttcttcaaga tcagttacct caccttcctc ccttctgctg atgagattta
      601 tgactgcaag gtggagcact ggggcctgga cgagcctctt ctgaaacact gggagcctga
      661 gattccagcc cctatgtcag agctcacaga gactgtggtc tgcgccctgg gattgtctgt
      721 gggcctcgtg ggcattgtgg tgggcactgt cttcatcatc atccgaggcc tgcgttcagt
      781 tggtgcttcc agacaccaag ggcccttgtg aatcccatcc tggaagggaa ggtgcatcgc
      841 catctacagg agcagaagag tggacttgct acatgaccta gcattatttt ctggccccat
      901 ttatcatatc ccttttctcc tccaaatgtt tctcctctca cctcttctgt gggacttaaa
      961 ttgctatatc tgctcagagc tcacaaatgc ctttgaatta tttccctgac ttcctgattt
     1021 ttttcttttc tcaagtgtta cctactaaga gatgcctgga gtaagccacc cagctaccta
     1081 attcctcagt aacctccatc tataatctcc atggaagcaa caaattccct ttatgagata
     1141 tatgtcaaat ttttccatct ttcatccagg gctgactgaa accgtggcta agaattggga
     1201 gactctcttg tttcaagcca atttaacatc atttaccaga tcatttgtca tgtccagtaa
     1261 cacagaagca accaactaca gtatagcctg ataacatgtt gatttcttag ctgacgttaa
     1321 tatttctttc ttccttgtgt tcccaccctt ggcattgcca cccacccctc aattaaggca
     1381 acaatgaagt taatggatac cctctgcctt tggctcagaa atgttatagc aaaaatttta
     1441 aaataaaaaa gtaagtctgt attaattg
//
LOCUS       HUMHLADQB     896 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC class II HLA-DQB1 mRNA, complete cds.
ACCESSION   M33907
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone
            pII-beta-10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 896)
  AUTHORS   Jonsson,A.-K., Andersson,L. and Rask,L.
  TITLE     Complete sequences of DQA1 and DQB1 cDNA clones corresponding to
            the DQw4 specificity
  JOURNAL   Immunogenetics 30, 232-234 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         46      831     MHC HLA-DQB1-beta protein
                                 /hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1"
                                 /map="6p21.3"
BASE COUNT      185 a    252 c    263 g    196 t
ORIGIN      
        1 gtccaagctg tgttgactac cactactttt cccttcgtct caattatgtc ttggaagaag
       61 gctttgcgga tccctggagg ccttcgggta gcaactgtga ccttgatgct ggcgatgctg
      121 agcaccccgg tggctgaggg cagagactct cccgaggatt tcgtgttcca gtttaagggc
      181 atgtgctact tcaccaacgg gaccgagcgc gtgcggggtg tgaccagata catctataac
      241 cgagaggagt acgcgcgctt cgacagcgac gtgggggtgt atcgggcggt gacgccgctg
      301 gggcggcttg acgccgagta ctggaatagc cagaaggaca tcctggagga ggaccgggcg
      361 tcggtggaca ccgtatgcag acacaactac cagttggagc tccgcacgac cttgcagcgg
      421 cgagtggagc ccacagtgac catctcccca tccaggacag aggccctcaa ccaccacaac
      481 ctgctggtct gctcagtgac agatttctat ccagcccaga tcaaagtccg gtggtttcgg
      541 aatgaccagg aggagacaac tggcgttgtg tccacccccc ttattaggaa cggtgactgg
      601 accttccaga tcctggtgat gctggaaatg actccccagc gtggagacgt ctacacctgc
      661 cacgtggagc accccagcct ccagaacccc atcatcgtgg agtggcgggc tcagtctgaa
      721 tctgcccaga gcaagatgct gagtggcatt ggaggcttcg tgctggggct gatcttcctc
      781 gggctgggcc ttattatcca tcacaggagt cagaaagggc tcctgcactg actcctgaga
      841 ctattttaac tgggattggt tatcactttt ctgtaacgcc tgcttgtccc tgccca
//
LOCUS       HAMRPS14     5300 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  C.griseus ribosomal protein S14 (RPS14) gene, complete cds.
ACCESSION   M35008
KEYWORDS    ribosomal protein S14.
SOURCE      C.griseus (haplotype EmtB wild type) ovary transformed fibroblast
            cell line CHO DNA.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (base 1 to 5300)
  AUTHORS   Rhoads,D. and Roufa,D.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (sites for [1])
  AUTHORS   Diaz,J.-J., Rhoads,D. and Roufa,D.J.
  TITLE     Genetic analysis of a vital mammalian housekeeping locus using CHO
            cells that express a transfected mutant allele
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by D.J.Roufa, 08-JUN-1990.
            
                Author address: D.J.Roufa
                Kansas State University
                Division of Biology
                Ackert Hall
                Manhattan, Kansas 66506
FEATURES       from  to/span     description
    pept       1977     2125     ribosomal protein S14, exon 2 (first expressed
                                 exon)
               2576     2737     ribosomal protein S14, exon 3
               4074     4150     ribosomal protein S14, exon 4
               4724     4791     ribosomal protein S14, exon 5
    pre-msg     183     4841     ribosomal protein S14 mRNA and introns (5' end
                                 +/- 4 bp)
    IVS         220     1974     ribosomal protein S14 intron A
    IVS        2126     2575     ribosomal protein S14 intron B
    IVS        2738     4073     ribosomal protein S14 intron C
    IVS        4151     4723     ribosomal protein S14 intron D
    signal     4819     4824     poly-A signal
    mut        4783     4783     c in wt; g in mutant Emr-75 [ref. 2] null
                                 Arg-150 -> Gly with ThaI rflp.
BASE COUNT     1315 a   1121 c   1415 g   1449 t
ORIGIN      
        1 agatctggtc acagcgacac gcagcacccg gagtccttaa taagccatct tggtcgccgg
       61 caggcttagc ctccttggaa accgcaatag aggcttcttc cgggagccgg gattccctcc
      121 gtgacctcag aggtacactt ccggaatacg gaagtgaccc gtctgcccgc cccttccttc
      181 tctctctttc cggtggagaa gtctggagac gacgttcagg tagggaggcc cggcggccca
      241 caatcgtgcg gcatcctctg gccaggggcg tctgcgggct tggggtcgcg gtttgtgcgc
      301 ggtaaccgag ccaggtgcgt gatggtgacg ggcggctgtc cgctcgccgc gatccacccg
      361 gcctggggct gcgccgtgtc cgggaggcgg ccttctggcc tgagatcgga aggtctgagg
      421 tggaagtgcg gggcctgcct gggccttgcc gtcacgccac agtcgtcgcc cgagcgttcg
      481 gggagcgtgt ccctagtcgc gggtgggccg tcaccggcac cccattagat tcttaggggc
      541 cgtcgacgac ctgcacctaa cctcatgccc ctcatgtgga aaatacgagc agcgttcgta
      601 tctgcccctt ctgtaaaccg aagcgtaaaa tgggatgtca gaaaacctgg tcacgcagga
      661 ttagagtgct tctaacccat ggtggttacc cctccgagtc ggacacattg cctgttgggg
      721 tgccccctag tagagtagga ctgatgccat atgtttacat ttctctgaat agccctctcg
      781 tctccactga ggacggcaaa gctacactga acagatgtca tgtcttatgg aatcgtgttt
      841 gttagcctaa tttgagggta tgagcaggtg actacatcct ggacctgccc ttgtgctttt
      901 ggcaaataag tcccaaatag tgtgtgaggg aggcactaca gagatcccag gtgtggttga
      961 aacttttaag gcctaggagg aactttgtgg agtaggagca tcttggttct aggaggagta
     1021 taagacaggc tttggtgcag tattgtcctg gacagactta gcctcgggtt ggtcaaaggc
     1081 atgcctgcag ctttggcaaa aaaaaaaaaa gcctctttgc ttgtgtattt tccctgggtc
     1141 ctgtgactgt acttacatta ttgactaggc aggaaggggt tacatgtcag gtgtatttgc
     1201 agttcataac aatagcaaaa ttagagttat gaagtagtag caatgacttt ttgtgtgggg
     1261 ggcaccacaa tatgagaatt gtactaaagg gtcacagcgt taggttgaga accaaagggt
     1321 agactcactt gatgctgaac actgaatttt aggtatggat atctttacca ttaaacttcc
     1381 tggaaggcat accctgacca tcctgggact tgctacgtag accataacta cgatatcctc
     1441 ctgcctgtga ctcccaggtg ctggggctaa agttatacac aagcacacat tgctgtggca
     1501 gtgcttttta atgtttatta ctagtaaagg gtgaaagatt tggttctaaa cagtgagagg
     1561 ctgtcagatt aagaattgtg tgtggacagg gagtccgtta atctgcagtt ggcttcagtt
     1621 gtgtcatgtt aatcaaataa acttcccagc ttaaggagag ggagagggga gttgtgaatg
     1681 gggctttgcc ccaacacctg agttcagcat cctctatttt ggataactac ttgtgaaagt
     1741 ttgaggtgca tattgttcat tgctggaatc ttaggctcat cctgaaaatg cttggagctt
     1801 ttgtttagaa attgtctcct ctcttggcgc ctccctttta gttacaactc agtaattccc
     1861 caagtacaaa tatgttggat attagagagc caaaggagct tgggttttcc tttagtaggc
     1921 taaagtattt ggggatgttt tgtctgttaa catggaggtt ctccttccac tcagaaatgg
     1981 cacctcgcaa gggaaaggaa aagaaggaag aacaggtcat cagcctcgga cctcaagtgg
     2041 ctgaaggaga gaatgtattt ggtgtctgcc acatttttgc atctttcaac gacacctttg
     2101 tccatgttac tgatctttct ggcaagtgag tacttgggga catgaagaca taaaaccaag
     2161 cctcacaggg actgaggtgg gagaaaaagg cataggagtt tatgggattc cctctaaaga
     2221 attactgatt ttcttatttt cccataaatt ataagagaaa acagttgaat gtagctgagg
     2281 attcatccac agaaaatcac cgcaaatagg aagtggaagc agaggatcag aaactcaatt
     2341 atcctcagct acacgcacat gtaaaatata tgcattatgg aagtgattat ggggataaaa
     2401 tgcaaaacta attgttgtat taacagggag tttgtctaga tagggtcctt ggacccctgt
     2461 aattatacac aactcctgtc ttgttcagaa ccaggatgtg tgcttgttgg ggttcagcaa
     2521 gtgtttcctg tgtccttgtt ggaaaatgaa ggcatggatc tgcttttgtc tctagggaaa
     2581 ctatctgccg ggtgactggt ggaatgaagg taaaggctga cagagatgag tcttctcctt
     2641 atgcagccat gttggctgcc caggatgttg cccagaggtg caaggagctg ggcatcactg
     2701 ccctgcatat caaactccga gctacgggag gaaacaggta tggaagtcta gggctggaca
     2761 tttgtaagtt cactgggctt gggggtggat ctgagatttg agtctctgct gtgtctgaga
     2821 gcaagccgtt ttaggaatgt atgtatgttc ttcgtcaggc atggtagctg atacctgtaa
     2881 tcccagcact ccggaaccaa tataggagga ggatctaaag tttggtgcta acctgggcta
     2941 cacagtgagc ctcaaaaata agatagtttg ttcctttatg tttgtccctg tagagtgctg
     3001 cttttatttt ttttttttta aagcatacat gagactgact cttccccagc accaggtaga
     3061 ctgggtgtgg tagtatacac ctgttgtctc agaaagtgga ggtgaggaga gctagagttt
     3121 caagttggtt attggttgca tattgagtag gtggccagct tgagctgaac agtgatgctt
     3181 aaatgatgaa ctgggggtgc atagtccaga tagagacaaa acttcaagag aagcatcttg
     3241 gggtgagaaa gaagttgggc tttacaggga tgattggtgt aagatatgat ctaagaagca
     3301 gggatggagc ctctggggtc ttaagtttta caaaacagag caagggcttg ggcatgaaaa
     3361 ctagaatagg tggctactga gtggaaagat actgcatagg aagccctaag gaagttacat
     3421 actggatgta ggagctggga agtaggcatc acagcaaggg gaggacagcc acaggccagg
     3481 tgactatccc ctcccccccc cacacacaca catgtacact tttaaaggat ttgtaggcat
     3541 ggtggctcac acctttaatc ccagcatctc ttggggggcg tccccccaca aaagtgggaa
     3601 aaattacatg cgtatgttgc ctgtgtgcat gcatgcgtgc attatcggtt ccctgtggag
     3661 gacagaagag gaccatgaat ctcttggaac tggtgttgct gagccacctc tccaggcctc
     3721 agttagcagt tttgaaaggg tttggttttg ttttgttttt tcgagacagg gtttcttctg
     3781 tgtagctttg gagcctatcc tggcactggc tttggagacc aggctggcct caaactcaca
     3841 gagatccgac tgcctcccga gtgctgggat taaaggtgtg caccaacgca ggatttggtt
     3901 ttgttgtgag attgtataat atatgaaaac cgtagcagcc tacaaagtct agaattttgt
     3961 gtaatcccct cacagtgtgc agatctttaa ctatagcccc accaccattc agaagtgaat
     4021 taaaaccatg tgaacattcc cactgacttg accaacaatc cttcattttt taggaccaag
     4081 acccctggac ctggagccca gtcagccctc agagctcttg ctcgttcagg gatgaagatt
     4141 gggcggattg gtaagtgacc cttctagcta acattgggtt tattttgaag cattgacccc
     4201 ttccaaacta tgtgatgtgc atattggttc aagcttgtct ggtgactggt gcagttaact
     4261 tttgttaacc tgtaaaagac attgagagcg ttgatttctg taaatgcccc ttaactgtta
     4321 tattttccca tttgcaattg ggaagactta aaaaacactt gtgaatctgt cctagcagcc
     4381 tttttccctg ctagctaaat gttaagggta tgcccttggg acagtccaag tgaataattg
     4441 cacacacctg gcctaaagag caaacagcca caggtgctca gtgtaggggc tctgagttag
     4501 aaacttgaag gagtccttcc cctctgatgt ttgtgtgcct tctcagagcc tgtgttgtcc
     4561 tcgtccaatg tgggttgaga tgaggaaatg acaggtgagg tgggcccttg tttattgacc
     4621 actggtgata catttgggga agaataaagt ctgctagtta gtctaatggc tagggaggct
     4681 gtgtttatgt agctatgggc aatgacattt tttcttcttg cagaggatgt cacccccatc
     4741 ccctctgaca gcacccgaag gaagggtggt cgtcgtggtc gccgtctgtg aacaggactt
     4801 ctcaaattat tttctgttaa taaattgctt tgtataagct attttggttc tgatgtttgt
     4861 ttgttttgag gcaggatctc tctccatgtc tatgtagaag agtctggcct ccaactcaga
     4921 tccacatacc tgtctctgcc tcccaagtac tggggataaa ggaatgcacc accacacccc
     4981 ggtgacctta ggggccaccc agggcaacac caaggcagtg ttttgagggg atgttaagat
     5041 tgcatgtagg agcttctaat gtaggttggg ggttggcaaa cctgggtttg caggctggct
     5101 ttctgagaat gggaacctga gagaggagat gctgaggctg ccactctggc attaatttgc
     5161 cagccagtat tttcaaagtg gaactgttag tagcccatgc tgtcatggga cagctaaaca
     5221 ccacacacac acacacccca agcagggtag tagagatcac aagttggaag ccagtttaat
     5281 atggtctact tggggaattc
//
LOCUS       AFATFDCD     7956 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  A.eutrophus plasmid pJP4 chlorocatechol oxidative (tfdCDEF) operon,
            complete cds; and 2,4-dichlorophenol hydroxylase (tfdB) gene,
            complete cds.
ACCESSION   M35097 X07754
KEYWORDS    2,4-dichlorophenol hydroxylase; chlorocatechol 1,2-dioxygenase;
            chloromuconate cycloisomerase; tdfF gene; tfdE gene.
SOURCE      A.eutrophus (strain JMP134) plasmid pJP4 DNA.
  ORGANISM  Plasmid pJP4
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1598; enum. 1 to 1598)
  AUTHORS   Perkins,E.J.
  JOURNAL   Unpublished (1988) Dept of Biochem,Univ of Wash.,Seattle,WA 98195
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 339)
  AUTHORS   Perkins,E.J., Bolton,G.W., Gordon,M.P. and Lurquin,P.F.
  TITLE     Partial nucleotide sequence of the chlorocatechol degradative
            operon tfdCDEF of pJP4 and similarity to promoters of the
            chlorinated aromatic degradative operons tfdA and clcABD
  JOURNAL   Nucleic Acids Res. 16, 7200 (1988)
  STANDARD  simple automatic
REFERENCE   3  (bases 1 to 7956)
  AUTHORS   Perkins,E.J., Gordon,M.P., Caceres,O. and Lurquin,P.F.
  TITLE     Organization and sequence analysis of the 2,4-dichlorophenol
            hydroxylase and dichlorocatechol oxidative operons of plasmid pJP4
  JOURNAL   J. Bacteriol. 172, 2351-2359 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        337     1104     chlorocatechol 1,2-dioxygenase (tfdC)
    pept       1101     2213     chloromuconate cycloisomerase (tfdD)
    pept       2288     2992     tfdE product
    pept       2989     4053     tfdF product
    pept       4398     6194     2,4-dichlorophenol hydroxylase (tfdB)
    conflict    681      682     gg in [3]; cc in [1]
BASE COUNT     1716 a   2209 c   2403 g   1628 t
ORIGIN      
        1 aagctttgcc ccaatatcgc gttccaaggc ctggatctga cgagtaagcg gtggctgcga
       61 tatgtgcatc ctctgtgccg ctgttccgaa acctcccgcc tccgcgaccg cgacgaagta
      121 tttcaactgt cttatttcca tgttccgtcc cgttgcgctg cctgttacaa ggcgattgtg
      181 agcaagcacc tgcggtctcg caactaggga aagacatacc gaaaccgtat ggtctaggcg
      241 gagaaaaggt attggacggt atgggattgc cgtctctatt atgactgtgc ggggcaattg
      301 tcgcccacgt gttcatgttt tcatgacgga ggcaaagtga acaaaagagt caaggatgtt
      361 gtcgatgcga tcgtcgcggc ggtacagcgg gtgctcgacc agaaggaggt tactgaggcg
      421 gaataccgaa ctgcggttca ttacctcatg caagtagccg aacagcgtga aaccgctctt
      481 ctttgcgatg tttttttcaa tagcaccgtg gctgcgacga aggctcgtat tagcgaaggt
      541 tcaacgcctg ctatcgaggg accctactat cgcgacgacg cccctctggt cgatgaccgg
      601 ctcaagactt acgacacgga cgaccacaag cctttgctta tccagggaac ggtcaaagcg
      661 gtcgacggga gcgttgtcga ggacgtgacg attgatgtct ggcattcgac gcccgatggg
      721 aagtacagcg gtttccatga cgacatcccg actgattttt atcgagggaa gctcagggtg
      781 ggcaccgatg gcagcttccg cgtgcgcaca acgatgccgg tgccgtatca gatcccggat
      841 cagggtccca cgggcgcatt gctcgaaacc atgggtggtc actcgtggcg tcccgctcat
      901 gtacatttca aggtgaaggc gccgggatat gaaacgttga ccacgcagta ctacttcgaa
      961 ggtggagatt ggatcacgga cgactgttgc aacggcgtcc agtctagtct gattactccc
     1021 gatattgtgg aagagggtgt ccgcctgatg aacatcaact tcgttatcga gcccgcgcgt
     1081 gcgcaggcgg gggcaaaccc gtgaagattg acgcgattga agcagtgatc gtggatgtgc
     1141 cgaccaagcg gccgatccag atgtcgatca ctaccgtgca ccagcagagc tacgttatcg
     1201 tccgggtgta ttcggagggg ctcgttggtg tcggcgaggg tggaagcgtt ggtggtcccg
     1261 tctggagcgc agagtgtgcg gagacgatca agatcatcgt ggaacggtat ctcgcgcccc
     1321 acctcctcgg aactgatgcg ttcaacgttt caggtgcact gcaaaccatg gcgcgtgccg
     1381 tcaccggaaa cgcctctgca aaggctgcgg tcgagatggc gttactggat ctcaaagctc
     1441 gagcgttagg cgtatcgatc gccgagttac ttggcgggcc gttgcgcagt gcgattccga
     1501 ttgcctggac attggcgagc ggagatacga aacgcgatct cgattctgcc gtcgagatga
     1561 ttgaaagacg acgacacaat cgcttcaaag tcaagcttgg cttccggtcg ccccaagacg
     1621 atctcatcca tatggaggct ttgtcaaata gtctcggatc gaaggcctac cttcgcgttg
     1681 acgtgaatca ggcttgggac gagcaagtgg cgtccgtcta cattcctgaa ctggaggcgc
     1741 ttggcgtgga actcatcgaa cagccggtcg gccgcgaaaa tacgcaagcg ttgaggcggc
     1801 tctccgacaa caaccgcgtg gccatcatgg ccgatgagag cctgagtacg ttggcctcgg
     1861 cattcgatct cgcgcgcgac cgaagtgtgg atgtcttttc gctgaagctt tgcaacatgg
     1921 gaggggtctc ggcgacgcaa aagatagcag cggtcgcgga agcaagcggg attgcatcgt
     1981 atggcggaac aatgcttgac tcgacgatcg gcacatcggt tgcacttcag ctctattcta
     2041 cggttccatc gcttccgttc ggttgcgaac tgatcggtcc cttcgtgttg gcagacacgc
     2101 tgagccacga gccactcgag atccgggatt acgaactgca ggttcccact ggcgtaggtc
     2161 acggcatgac gcttgacgag gacaaggtgc gccaatacgc acgcgtcagc taggcaaaac
     2221 gcgcagcgca ggccgcggct gtggatcgcc tcgaatgcaa ctgcgcatgt gtaactggag
     2281 actaaatatg ttatcagacg gcgttgagat cacgtcgcgc tcgggtggtc gctttggtgc
     2341 ctacctcgga aagccgacga cggactccgc acccatcgtc gtgatcgcgc aggagatttt
     2401 cgggatcacg cctttcataa gagaaaccgt ggaatggctc gttggtgcgg gctttggttg
     2461 cgtgtgtccg gatctgtact ggagacaggc gccgaatatc gagcttgatg caaacgtacc
     2521 atcggaacgg gaacaggcgc ttgcgttgtt tcgcgacttt gacatggagg caggagtcaa
     2581 tgacctttca tgcgccattg aatacgctcg tgcgctccct ttctcgaacg gtcgtgtcgc
     2641 tgtagtcggg tactgcctgg gaggtgcgct ggcgtttgat gtggcggcgc gatcattggc
     2701 cgattgctca atcggctatt acggcgtggg acttgaaaaa aaggtctcac tggtgccagc
     2761 tattacacga ccggcgatgt tccacatggg taccaaagat cactatgtca cagaagaggc
     2821 ccgtagcatt ctcgaagagc atttcggtcg aaacaaaaat ctgagtctgc actggtatcc
     2881 agtggggcat tcatttgcac ggtcatccag cccaaatttc gatcaggcgg cgacaactgt
     2941 ggccaatgct cggacgctcg aactgcttgc aatgttgaag gacccttcat gaagaagttc
     3001 acgcttgact acctgagccc gagggtcgtc ttcggggcgg gcactgcttc tgcattgcca
     3061 gatgaaatag gacgccttgg cgcacgccgg cccttggtat taagcagccc ggaacaacgc
     3121 gagttagcga aggatatcgt ccgtccgata ggtgacaggg tagctggata tttcgatggc
     3181 gcgacgatgc atgttcccgt cgacgtcatc cagaaagccg agcgggcttt taacgatact
     3241 gacgccgact caatcatcgc gatcggggga ggatcgacca ccggactcgc aaaaatcctt
     3301 tcgatgaacc ttgacgtccc aagtctggtt ataccaacga cctatgccgg tagtgaaatg
     3361 actaccattt ggggtgtcac ggaaggcgga atgaagagga ccggccgcga ccccaaggtg
     3421 ctaccgaaga ccgtgattta tgatccattg ctcacggtcg atttgccgct tgctatctcg
     3481 gtgacgagcg ccttgaatgc gatcgctcac gccgcagaag gtctgtactc ggccgacctc
     3541 aatcccgttc tcgagaccat gtgtaagcag ggcatatgcg ccttgttcga tgcaatcccg
     3601 cgcctggtgg caaagccgac tgacgccgaa gcgcgtacgg atgccctttt tggggcatgg
     3661 atgtgtggca ctgcactgtg ccacttgggc atggggctac atcacaaact ctgccacacg
     3721 cttgggggaa cccttaatct tccccacgcg gagacacatg caatcgtact accacacgca
     3781 ctggcataca atctgccgta cgccgcgcca gctgagcgac tgcttcagga agtcgccggc
     3841 agtagtgacg tcccgagcgc gctatatgat ctcgccagaa atgctggagc accactcagt
     3901 ctcgccgaaa tcggtatgcg gcctgaagat attccgaggg tacgcgacct cgcgctaagg
     3961 gaccaatatc cgaatccgcg tccgctggaa tcggacgcat tggaaacatt gttagtcaat
     4021 gcgtttcgtg ggcgaagacc ggatttcaaa taatgtgacc tgcactccgc gtttagtacg
     4081 gtagcggtga agagcccgtt ccaaagtcga catcgggtct tcaagctaaa gcgaagcaca
     4141 tgaatttgca tccactgttc cttgtggagc atcgccagcc ttcgcgcgaa aaaggccgac
     4201 tgtactggag gtggctcgga ttttctttgc gcgaagtggc tcatttttac tttgcgcgca
     4261 acaaaggctc ttcagcctta ctattcatcg cgcgttaaag acggtgaagt ttaaaaaatc
     4321 gacttggccg aacgcgagat gaacaatccc tgtttcttct cacaaagaga gagacataac
     4381 aattgacgga ggaagacatg gcattgacga tcgaaaccga cgtgctggtc gttggcaccg
     4441 ggccggcggg cgcttcggcg ggggcactgc ttgcgcgata cggtgtcagg acgatgctca
     4501 ttaacaagta caactggacg gctccgacgc cgcgtgccca catcaccaac caacgcacaa
     4561 tggagatcct gcgtgatctc ggtctcgaag ctgaagcccg cctctacgcc gcgccgaatg
     4621 acctaatggg cgagaacacc atctgcgcct cgttggccgg cgaggagttc ggtcggattc
     4681 gcacctgggg gacggatgtt cgccggcgcg ccgattacga cgaatgttcg ccgacctcca
     4741 tgtgcgacct gccacagaac tatctcgagc cgattctcgt caagtcggcc gcactcgacg
     4801 gctgcaaggt gcgcttcgac acagagtatc ttggacacga acaggatgct gacggcgtgt
     4861 cgtcgcgctt gcgcgatcgt ctgaatggcg aagagtttac cgtccggtcg aaatatctga
     4921 tcggcgccga cggcgccaac agccgcgtcg tgtcagacct ggacctgccg ctcgaaggta
     4981 ccatgggcaa gtccggttcc atcaacttgc tgttcgaggc tgatctcgac cgctacgtcg
     5041 cgcatcgtcc atcggtcctt tactgggtca tccagcccgg ctccgatatc ggcggtctcg
     5101 gcatcggcgt ggtgcgcatg gtccggccgt ggaacaaatg gcttgcgatc tgggggtacg
     5161 acgtcgagca gggcccgccg gagatctcgg aatccttcgc ccgccgcatc gtccacaatc
     5221 tgatcggcga tgacagcgtg ccgctgaaaa ttgagggcat ctcgacctgg acggtcaacg
     5281 acatgtacgc gacccggttg cagcaaggac gggtcttctg cgccggcgac gcggtccacc
     5341 ggcatccgcc gaccaacgga ctcggctcaa acacctccat ccaggacagc ttcaatctcg
     5401 cctggaagat cgccatggtg ctgaacggca cggccgacga gagcctgctg gacacctaca
     5461 cgatcgagcg ggcgccgatc gcgaagcagg tggtctgcag ggcgaacaag tcgcttgagg
     5521 atttcccgcc gatcgcgatg gctctggggt tgccgcaggc caagagcgcc gacgagatga
     5581 agagcaacat ggctcgccgc aaggaacctg ggcccgaggc gcaagcgcag cgcacgcggt
     5641 tacgcgaggc catcgccggg accaattacg tttacaacgc ccacggcgtc gaaatgaatc
     5701 agcgttacga cagtcccgcg atcgtggcag acaattcgcc ggacgaagtc ttccgcgacg
     5761 tcgagctcta tcaccaggcg tcgacgcggc ctggagcgcc catgccgcat gtctgggtct
     5821 atgcgagtgg tgatggacac cgtatctcaa caaaggatct ctgcgggaag ggcaacttca
     5881 ctctctttac cgggattggc ggcgcagcct ggcaggatgc cgcggcggcc gtgagccggc
     5941 aactcggggt ggcagtgacg gtgcgcatca ttgggcctgg ccaggcctac gaggatcatt
     6001 acggcgactt cgcgagaatt tccgagatca tcgacaccgg cgccatcctg gtgcggccgg
     6061 atttccacgt cgcctatcgg gcgacgtcct tgccggccga tgcggcaggc gacctcgtgt
     6121 ccgccatgag gcggatcctc ggacgccagt cagagaggtc gagcgcgctg cgcgttacgt
     6181 ctcgagctat ataaggagtg gcgtcgcgat ctattcgagg tgagacggaa gcgctcacct
     6241 cgggttgcgg gaggggaagc gtgatcttgc gcagcgtgag atggcccgtg gcgacgcggg
     6301 aaggttgtcg cgcgattaag gcaaggcggt gcgcaacgac gtgacgtgtt ttctggaccg
     6361 gacgtaactg cagatcatct aatttgggaa gttcgtaccg gtgtcgaccg ccttgctgat
     6421 gatgctatta agcacgtgag cagacgacgg gccggtgcag ccaacgttga aaaaagagcg
     6481 cttgattgca atgtcgacga cacgatctac attactgtca tttgaacgac cgcttgctcg
     6541 tgcggattca accggtggat gcaacagttt gctgaaatcg ttcggcgggc gttccgtggt
     6601 tcagggtttt cctcggtcgc ccatccagcc gcctcgccac ggtattgagt agccaattgg
     6661 tacaccgaaa tgtaggtgcc gttggggatg tctcgatagc ccgttcgcat tttcattcga
     6721 cccgcgctgc caaggatgct gggggttcag taactgcagg ctgaactgcg ccaggtgaac
     6781 caaacgctga tcgtcaagca aaatgaaata acccagttga acagagaaag cgcgcggctc
     6841 gtcgccgaag ccggcgtcgc gacaaggcat ctacgcgagc tgcaagcgca cagcaagtag
     6901 ttacagaata cgttgaacca gtcgctcgcc gaccacgcac gcaccaaggc cgagcacgat
     6961 gcgctgctaa caccgcgagg actgaggcga ctgaactgac gcggctgcgc gaagcacgtg
     7021 aatcgctgac cgccgatctg acgaagctca cagcgcagtt cgacgagcag cagttgctcg
     7081 gggactatcg gatgcgtctg ggcctcacgg acgcggcggg atgaaagcgc cagattccgc
     7141 atcaaaagga tggggatcta cgccggaacc ccagaaaatt ccgtcgttcc acattgtgaa
     7201 cattaatcgg cttattccag ccgaagaatg ttggagcgac gtccggtgag tgacatgacg
     7261 gtctcggtca cacggaccat caggaccgtt acgcggggcc cagttcgatt acattgtggg
     7321 acaggtcgag acaatcttcg cagtcccggc aaacccagtg cagcatcggc ccgaccggat
     7381 ctggatggtc gctgaacgtt gtcagcgccg ggtcgtacta gcatcggacg tgctcgagcg
     7441 cagtatccgc tcgagattag actcgatgcg gtccaggctg cattcaccga tgccggactt
     7501 gaacaccgtc acaccgcctg acgggttgac cgcgaagaag aaaaacgcct tcgtgtcgac
     7561 catgtgcgac aggacttgtc ctggcgatga agttgctcgg attgatccag cgcattgaag
     7621 atgcggtagc cgtaaaactc taagccgaag tgtacgacct cgaagtcgtc ggtgccgcgg
     7681 atcgagtggc caaagttgaa accgctgcgt gttacgtcgt tcggaatgtc atccaccggc
     7741 cgcagtgcca ggcaagcata gtgcgtctgg ccctcggcca cgagtcccgt cgcaaagcag
     7801 ggcgcgcgct cggccagcaa ggcctcgaca aaattccggt tcatgcggag cgttggcagc
     7861 atcgggcatc cttcatcgtg aatcaaatca acatttgccg acctggctgt gcggtgcgtc
     7921 gacgtcgaac atcaaaggct atacatcgtc gagctc
//
LOCUS       ECOCLPAA     3380 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  E.coli ATP-dependent C1p protease (clpA) gene, complete cds.
ACCESSION   M31045
KEYWORDS    C1p protease; clpA gene.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3380)
  AUTHORS   Gottesman,S., Clark,W.P. and Maurizi,M.R.
  TITLE     The ATP-dependent Clp protease of Escherichia coli: Sequence of
            ClpA and identification of a Clp-specific substrate
  JOURNAL   J. Biol. Chem. 265, 7886-7893 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Gottesman, 03-JAN-1990.
FEATURES       from  to/span     description
    pept       1000     3276     ClpA protein
    signal      779      784     -35 region
    signal      805      810     -10 region
    binding     990      994     ribosome binding site
    binding     214      221     ATP binding site
    binding     495      502     ATP binding site
BASE COUNT      877 a    759 c    943 g    801 t
ORIGIN      1 bp upstream of BamHI site; 19 min. on K12 map.
        1 ggatccagta gatgctctgg cacgagagta atttttgaga taaaaatcga gccgatcggg
       61 cggcattttg cctttaggat gtacacaatg agacagaaga gctatgcgac tgccgcttct
      121 acttcgacgg gcacaataac actggcgtga ttgccttttg gcccctggtg gacatcaaac
      181 tgaacggatt gtccagcttt tagcgttctg taaccatcca tctgaatggt ggaataatga
      241 gcgaaaatat cttcgccgcc gccttcaggg cagatgaaac caaacccttt ggcattgttg
      301 aaccacttaa cagtaccctt ttccatgctt cgacatcctt cgcaaatctt atacaagtaa
      361 gatggaataa accggggtca gagagggggc tgttcaaaac ctcgccaact ctagaaatac
      421 aatttagaga attagggcga gccgtcaagc atttgacagg ggacaagggg caggtatgaa
      481 tcaaaaattt gaagcagtta acgctattga caggaatgtg acagatgtcg ctgatgccaa
      541 cgatagatga tagttatcta tcatgtggag tagattggtc aggcaaataa gctcttgtca
      601 gcggcagggc gttctgccga taaccgtaac cgaagatgat aactgacaat gggtaaaacg
      661 aacgactggc tggactttga tcaactggcg gaagaaaaag ttcgcgacgc gctaaaaccg
      721 ccatctatgt ataaagtgat attagtcaat gatgattaca ctccgatgga gtttgttatt
      781 gacgtgttac aaaaattctt ttcttatgat gtagaacgtg caacgcaatt gatgctcgct
      841 gttcactacc aggggaaggc catttgcgga gtctttaccg ccgaggttgc agaaaccaaa
      901 gtggcgatgg tgaacaagta cgcgagggag aatgagcatc cattgctgtg tacgctagaa
      961 aaagcctgaa tgcaggcata aaaattgggg gaggtgccta tgctcaatca agaactggaa
     1021 ctcagtttaa atatggcttt cgccagagcg cgcgagcacc gtcatgagtt tatgaccgtc
     1081 gagcacttgt tactggcgct gctcagtaac ccatctgccc gggaggcgct ggaagcgtgt
     1141 tctgtggatt tggttgcgct ccgtcaggaa ctggaagcct ttattgaaca aaccacaccc
     1201 gttctgcctg ccagtgaaga ggagcgcgac acacagccga cgctgagttt tcagcgtgta
     1261 ctgcaacgtg cggtcttcca tgtccagtcc tccggtcgca atgaggttac cggtgcaaac
     1321 gttctggtcg ctatctttag cgaacaggag tcgcaggcgg catatctgtt gcgtaaacac
     1381 gaagtcagcc gtctcgatgt ggtgaacttt atctctcatg gcacgcgtaa agacgagccg
     1441 acacagtctt ctgatcctgg cagccagcca aacagcgaag aacaagctgg tggggaggaa
     1501 cgtatggaga atttcacgac gaacctgaat cagcttgcgc gcgtgggcgg aatcgaccca
     1561 ctgattggtc gtgagaagga gctggagcgt gctattcagg ttctctgccg tcgccgtaaa
     1621 aacaacccgc tgctggtggg ggaatctggt gtcggtaaaa ccgcgattgc ggaaggtctt
     1681 gcctggcgaa ttgttcaggg cgatgtgccg gaagtgatgg ctgactgtac gatttactct
     1741 ctcgatatcg gttctctgtt agcgggcaca aaatatcgcg gcgactttga aaaacgtttt
     1801 aaagcgttgc tcaagcagct ggagcaggac actaacagca tcctgtttat tgatgagatc
     1861 cacaccatta tcggtgcggg tgcagcgtct ggtggtcagg tcgatgcggc taacctaatc
     1921 aaaccgttgc tctccagcgg taaaattcgt gtaattggtt cgacaaccta tcaggagttc
     1981 agcaacattt tcgagaaaga ccgtgctctg gcgcgtcgct tccagaaaat tgatattact
     2041 gaaccgtcga tcgaagaaac tgttcaaatc atcaatggcc tgaaaccgaa gtatgaaggc
     2101 caccacgacg tgcgttatac cgcaaaagcg gtgcgtgcgg cggtagagct ggcggtgaaa
     2161 tacattaacg atcgtcatct gccggataaa gccattgatg ttatcgacga agcgggcgct
     2221 cgcgcacgcg tgatgccggt aagcaaacgc aagaaaaccg ttaatgtggc ggatattgag
     2281 tccgtggtgg cccgtattgc acgcattcca gagaagagtg tttctcagag tgatcgtgat
     2341 accctgaaaa acctcggcga tcgcttgaaa atgctggtct tcggtcagga taaagccatt
     2401 gaggcgctga ctgaagccat taagatggcg cgtgcaggtt taggtcacga acataaaccg
     2461 gttggttcgt tcctgtttgc cggccctacc ggggtcggga aaacagaggt gacggtacag
     2521 ctttcgaaag ctttgggcat tgagcttctg cgctttgata tgtccgagta tatggaacgc
     2581 cataccgtca gccgtgttat tggtgcgcct ccgggatacg ttggttttga tcagggcggt
     2641 ttgctgactg atgcggtcat caagcatcca catgcggtgc tgctgctgga cgaaatcgag
     2701 aaagcgcacc cggacgtgtt caatattctg ttgcaggtga tggataacgg tacgctgacc
     2761 gataacaacg gacgcaaagc agacttccgt aacgtggtgc tggtgatgac caccaacgcc
     2821 ggggtacggg aaactgagcg caaatccatt ggtcttatcc accaggataa cagcaccgat
     2881 gcgatggaag agatcaagaa gatctttaca ccggaattcc gtaaccgtct cgacaacatt
     2941 atctggtttg atcatctgtc aaccgacgtg atccatcagg tggtggataa attcatcgtc
     3001 gagttgcagg ttcagctgga tcagaaaggt gtttctctgg aagtgagcca ggaagcgcgt
     3061 aactggctgg ccgagaaagg ttacgaccgg gcaatgggcg ctcgtccgat ggcgcgtgtc
     3121 atccaggaca acctgaaaaa accgctcgcc aacgaactgc tgtttggttc gctggtggac
     3181 ggcggtcagg tcaccgtcgc gctggataaa gagaaaaatg agctgactta cggattccag
     3241 agtgcacaaa agcacaaggc ggaagcagcg cattaatctg attgttaggt aggttggtca
     3301 agtccgtaat ctcgaaagag gttacggact ttttgtttat ggggtggagg agttcagacc
     3361 ctttttttat gatgatggta
//
LOCUS       HUMCSAE      1372 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human epithelial glycoprotein (EGP) mRNA, complete cds.
ACCESSION   M32306
KEYWORDS    cell surface antigen; cell surface glycoprotein;
            epithelial glycoprotein; transmembrane protein;
            tumor-associated antigen.
SOURCE      Human colon carcinoma cell line HT-29, cDNA to mRNA, (library
            HT-29/cdm8).
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1372)
  AUTHORS   Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J.,
            Gattoni-Celli,S. and Brand,S.J.
  TITLE     Epithelial glycoprotein is a member of a family of epithelial cell
            surface antigens homologous to nidogen, a matrix adhesion protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2755-2759 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            B.Simon, 23-FEB-1990.
FEATURES       from  to/span     description
    pept         22      966     epithelial glycoprotein (EGP) precursor
    sigp         22       84     epithelial glycoprotein signal peptide
    matp         85      963     epithelial glycoprotein
    mRNA    <     1     1372     EGP mRNA
BASE COUNT      436 a    233 c    317 g    386 t
ORIGIN      
        1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt
       61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag
      121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca
      181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg
      241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg
      301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc
      361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata
      421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga
      481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt
      541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt
      601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct
      661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg
      721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat
      781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg
      841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga
      901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat
      961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac
     1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc
     1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc
     1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact
     1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt
     1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt
     1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg
//
LOCUS       HUMMHCAD     1093 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC HLA protein, allele B7, complete cds.
ACCESSION   M32317
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human blood B-cell cell line JY cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1093)
  AUTHORS   Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
  TITLE     Rapid cloning of HLA-A,B cDNA by using the polymerase chain
            reaction: Frequency and nature of errors produced in amplification
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence [1] kindly submitted by E.Warren,
            22-FEB-1990.
FEATURES       from  to/span     description
    pept          1     1089     HLA protein allele B7
    sigp          1       72     HLA protein signal peptide
    matp         73     1086     HLA protein mature peptide
BASE COUNT      221 a    335 c    364 g    173 t
ORIGIN      
        1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc
       61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc
      121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc
      181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg
      241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag
      301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag
      361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac
      421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg
      481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg
      541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag
      601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac
      661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc
      721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca
      781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
      841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg
      901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
      961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa
     1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc
     1081 acagcttgaa aag
//
LOCUS       HUMMHCAE     1089 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC HLA protein, allele B57, complete cds.
ACCESSION   M32318
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human blood B-cell cell line WIN and cell line MOC cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1089)
  AUTHORS   Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
  TITLE     Rapid cloning of HLA-A,B cDNA by using the polymerase chain
            reaction: Frequency and nature of errors produced in amplification
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence [1] kindly submitted by E.Warren,
            22-FEB-1990.
FEATURES       from  to/span     description
    pept          1     1089     HLA protein allele B57
    sigp          1       72     HLA protein signal peptide
    matp         73     1086     HLA protein mature peptide
BASE COUNT      217 a    331 c    366 g    175 t
ORIGIN      
        1 atgcgggtca cggcaccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc
       61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc
      121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacccagtt cgtgaggttc
      181 gacagcgacg ccgcgagtcc gaggatggcg ccccgggcgc catggataga gcaggagggg
      241 ccggagtatt gggacgggga gacacggaac atgaaggcct ccgcgcagac ttaccgagag
      301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca catcatccag
      361 gtgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtcc
      421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg
      481 gacacggcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcagctg
      541 agagcctacc tggagggcct gtgcgtggag tggctccgca gatacctgga gaacgggaag
      601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac
      661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc
      721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca
      781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
      841 tacacatgcc atgtacagca tgaggggctg ccaaagcccc tcaccctgag atgggagcca
      901 tcttcccaat ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
      961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagctc aggtggaaaa
     1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc
     1081 acagcttga
//
LOCUS       HUMMHCAF     1093 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC HLA protein, allele B51, complete cds.
ACCESSION   M32319
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human blood B-cell cell line BM92 cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1093)
  AUTHORS   Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
  TITLE     Rapid cloning of HLA-A,B cDNA by using the polymerase chain
            reaction: Frequency and nature of errors produced in amplification
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence [1] kindly submitted by E.Warren,
            22-FEB-1990.
FEATURES       from  to/span     description
    pept          1     1089     HLA protein allele B51
    sigp          1       72     HLA protein signal peptide
    matp         73     1086     HLA protein mature peptide
BASE COUNT      228 a    334 c    358 g    173 t
ORIGIN      
        1 atgcgggtca cggcgccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc
       61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc
      121 cgcggggagc cccgcttcat tgcagtgggc tacgtggacg acacccagtt cgtgaggttc
      181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc catggataga gcaggagggg
      241 ccggagtatt gggaccggaa cacacagatc ttcaagacca acacacagac ttaccgagag
      301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca cacttggcag
      361 acgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca taaccagtac
      421 gcctacgacg gcaaagatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg
      481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagctg
      541 agagcctacc tggagggcct gtgcgtggag tggctccgca gacacctgga gaacgggaag
      601 gagacgctgc agcgcgcgga ccccccaaag acacacgtga cccaccaccc cgtctctgac
      661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc
      721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca
      781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
      841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca
      901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
      961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa
     1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc
     1081 acagcttgaa aag
//
LOCUS       HUMMHCAG     1093 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC HLA protein, allele B37, complete cds.
ACCESSION   M32320
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human blood B-cell cell line KASO and cell line MG cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1093)
  AUTHORS   Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
  TITLE     Rapid cloning of HLA-A,B cDNA by using the polymerase chain
            reaction: Frequency and nature of errors produced in amplification
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence [1] kindly submitted by E.Warren,
            22-FEB-1990.
FEATURES       from  to/span     description
    pept          1     1089     HLA protein allele B37
    sigp          1       72     HLA protein signal peptide
    matp         73     1086     HLA protein mature peptide
BASE COUNT      224 a    337 c    358 g    174 t
ORIGIN      
        1 atgcgggtca cggcgccccg aaccctcctc ctgctgctct ggggggcagt ggccctgacc
       61 gagacctggg ctggctccca ctccatgagg tatttccaca cctccgtgtc ccggcccggc
      121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc
      181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc cgtggataga gcaggagggg
      241 ccggagtatt gggaccggga gacacagatc tccaagacca acacacagac ttaccgagag
      301 gacctgcgga ccctgctccg ctactacaac cagagcgagg ccgggtctca caccatccag
      361 aggatgtctg gctgcgacgt ggggccggac gggcgcctcc tccgcgggta taaccagttc
      421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg
      481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcaggac
      541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag
      601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac
      661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc
      721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca
      781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
      841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca
      901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
      961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa
     1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc
     1081 acagcttgaa aag
//
LOCUS       HUMMHCAH     1098 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC HLA protein, allele A25, complete cds.
ACCESSION   M32321
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human blood B-cell cell line BM92 cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1098)
  AUTHORS   Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
  TITLE     Rapid cloning of HLA-A,B cDNA by using the polymerase chain
            reaction: Frequency and nature of errors produced in amplification
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence [1] kindly submitted by E.Warren,
            22-FEB-1990.
FEATURES       from  to/span     description
    pept          1     1098     HLA protein allele A25
    sigp          1       72     HLA protein signal peptide
    matp         73     1095     HLA protein mature peptide
BASE COUNT      216 a    329 c    372 g    181 t
ORIGIN      
        1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggccct ggccctgacc
       61 cagacctggg cgggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc
      121 cgcggggagc cccgcttcat cgccgtgggc tacgtggacg acacgcagtt cgtgcggttc
      181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggg
      241 ccggagtatt gggaccggaa cacacggaat gtgaaggccc actcacagac tgaccgagag
      301 agcctgcgga tcgcgctccg ctactacaac cagagcgagg acggttctca caccatccag
      361 aggatgtatg gctgcgacgt ggggccggac gggcgcttcc tccgcgggta ccagcaggac
      421 gcttacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcttg gaccgcggcg
      481 gacatggcgg ctcagatcac ccagcgcaag tgggagacgg cccatgaggc ggagcagtgg
      541 agagcctacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaacgggaag
      601 gagacgctgc agcgcacgga cgcccccaag acgcatatga ctcaccacgc tgtctctgac
      661 catgaggcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc
      721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca
      781 ggggatggga ccttccagaa gtgggcgtct gtggtggtgc cttctggaca ggagcagaga
      841 tacacctgcc atgtgcagca tgagggtctg cccaagcccc tcaccctgag atgggagccg
      901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct
      961 gtgatcgctg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa
     1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tatgtctctc
     1081 acagcttgta aagtgtga
//
LOCUS       HUMMHCAI     1098 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human MHC HLA protein, allele A2, complete cds.
ACCESSION   M32322
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human blood B-cell cell line JY cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1098)
  AUTHORS   Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P.
  TITLE     Rapid cloning of HLA-A,B cDNA by using the polymerase chain
            reaction: Frequency and nature of errors produced in amplification
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence [1] kindly submitted by E.Warren,
            22-FEB-1990.
FEATURES       from  to/span     description
    pept          1     1098     HLA protein allele A2
    sigp          1       72     HLA protein signal peptide
    matp         73     1095     HLA protein mature peptide
BASE COUNT      220 a    323 c    368 g    187 t
ORIGIN      
        1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggctct ggccctgacc
       61 cagacctggg cgggctctca ctccatgagg tatttcttca catccgtgtc ccggcccggc
      121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacgcagtt cgtgcggttc
      181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggt
      241 ccggagtatt gggacgggga gacacggaaa gtgaaggccc actcacagac tcaccgagtg
      301 gacctgggga ccctgcgcgg ctactacaac cagagcgagg ccggttctca caccgtccag
      361 aggatgtatg gctgcgacgt ggggtcggac tggcgcttcc tccgcgggta ccaccagtac
      421 gcctacgacg gcaaggatta catcgccctg aaagaggacc tgcgctcttg gaccgcggcg
      481 gacatggcag ctcagaccac caagcacaag tgggaggcgg cccatgtggc ggagcagttg
      541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag
      601 gagacgctgc agcgcacgga cgcccccaaa acgcatatga ctcaccacgc tgtctctgac
      661 catgaagcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc
      721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca
      781 ggggatggaa ccttccagaa gtgggcggct gtggtggtgc cttctggaca ggagcagaga
      841 tacacctgcc atgtgcagca tgagggtttg cccaagcccc tcaccctgag atgggagccg
      901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct
      961 gtgatcactg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa
     1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tgtgtctctc
     1081 acagcttgta aagtgtga
//
LOCUS       HUMNOXF      2206 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human neutrophil oxidase factor (p67-phox) mRNA, complete cds.
ACCESSION   M32011
KEYWORDS    neutrophil oxidase factor.
SOURCE      Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA,
            clone 10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2206)
  AUTHORS   Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G.,
            Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L.
  TITLE     Cloning of a 67kD neutrophil oxidase factor with similarity to a
            noncatalytic region of P60-c-src
  JOURNAL   Science 248, 727-730 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            H.L.Malech, 08-FEB-1990.
FEATURES       from  to/span     description
    pept         68     1648     neutrophil oxidase factor (p67-phox)
    mRNA    <     1     2206     p67-phox
    signal     2030     2035     poly-A signal
    signal     2181     2186     poly-A signal
BASE COUNT      624 a    499 c    573 g    510 t
ORIGIN      
        1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta
       61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga
      121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg
      181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa
      241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg
      301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt
      361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct
      421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa
      481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa
      541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc
      601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta
      661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc
      721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag
      781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga
      841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc
      901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga
      961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc
     1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa
     1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta
     1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat
     1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga
     1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa
     1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga
     1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa
     1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt
     1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga
     1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga
     1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat
     1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca
     1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga
     1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat
     1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct
     1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt
     1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa
     2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa
     2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg
     2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc
//
LOCUS       HUMPTPBX     3247 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human phosphotyrosyl-protein phosphatase (PTP-1B) mRNA, complete
            cds.
ACCESSION   M31724
KEYWORDS    phosphotyrosyl-protein phosphatase.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3247)
  AUTHORS   Chernoff,J., Schievella,A.R., Jost,C.A., Erikson,R.L. and Neel,B.G.
  TITLE     Cloning of a cDNA for a major human protein-tyrosine-phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2735-2739 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Chernoff,J., 13-MAR-1990.
FEATURES       from  to/span     description
    pept         91     1398     phosphotyrosyl-protein phosphatase (EC
                                 3.1.3.48)
    mRNA    <     1     3247     phosphotyrosyl-protein phosphatase mRNA
BASE COUNT      836 a    834 c    806 g    771 t
ORIGIN      
        1 gggcgggcct cggggctaag agcgcgacgc ctagagcggc agacggcgca gtgggccgag
       61 aaggaggcgc agcagccgcc ctggcccgtc atggagatgg aaaaggagtt cgagcagatc
      121 gacaagtccg ggagctgggc ggccatttac caggatatcc gacatgaagc cagtgacttc
      181 ccatgtagag tggccaagct tcctaagaac aaaaaccgaa ataggtacag agacgtcagt
      241 ccctttgacc atagtcggat taaactacat caagaagata atgactatat caacgctagt
      301 ttgataaaaa tggaagaagc ccaaaggagt tacattctta cccagggccc tttgcctaac
      361 acatgcggtc acttttggga gatggtgtgg gagcagaaaa gcaggggtgt cgtcatgctc
      421 aacagagtga tggagaaagg ttcgttaaaa tgcgcacaat actggccaca aaaagaagaa
      481 aaagagatga tctttgaaga cacaaatttg aaattaacat tgatctctga agatatcaag
      541 tcatattata cagtgcgaca gctagaattg gaaaacctta caacccaaga aactcgagag
      601 atcttacatt tccactatac cacatggcct gactttggag tccctgaatc accagcctca
      661 ttcttgaact ttcttttcaa agtccgagag tcagggtcac tcagcccgga gcacgggccc
      721 gttgtggtgc actgcagtgc aggcatcggc aggtctggaa ccttctgtct ggctgatacc
      781 tgcctcctgc tgatggacaa gaggaaagac ccttcttccg ttgatatcaa gaaagtgctg
      841 ttagaaatga ggaagtttcg gatggggttg atccagacag ccgaccagct gcgcttctcc
      901 tacctggctg tgatcgaagg tgccaaattc atcatggggg actcttccgt gcaggatcag
      961 tggaaggagc tttcccacga ggacctggag cccccacccg agcatatccc cccacctccc
     1021 cggccaccca aacgaatcct ggagccacac aatgggaaat gcagggagtt cttcccaaat
     1081 caccagtggg tgaaggaaga gacccaggag gataaagact gccccatcaa ggaagaaaaa
     1141 ggaagcccct taaatgccgc accctacggc atcgaaagca tgagtcaaga cactgaagtt
     1201 agaagtcggg tcgtgggggg aagtcttcga ggtgcccagg ctgcctcccc agccaaaggg
     1261 gagccgtcac tgcccgagaa ggacgaggac catgcactga gttactggaa gcccttcctg
     1321 gtcaacatgt gcgtggctac ggtcctcacg gccggcgctt acctctgcta caggttcctg
     1381 ttcaacagca acacatagcc tgaccctcct ccactccacc tccacccact gtccgcctct
     1441 gcccgcagag cccacgcccg actagcaggc atgccgcggt aggtaagggc cgccggaccg
     1501 cgtagagagc cgggccccgg acggacgttg gttctgcact aaaacccatc ttccccggat
     1561 gtgtgtctca cccctcatcc ttttactttt tgccccttcc actttgagta ccaaatccac
     1621 aagccatttt ttgaggagag tgaaagagag taccatgctg gcggcgcaga gggaaggggc
     1681 ctacacccgt cttggggctc gccccaccca gggctccctc ctggagcatc ccaggcggcg
     1741 cacgccaaca gcccccccct tgaatctgca gggagcaact ctccactcca tatttattta
     1801 aacaattttt tccccaaagg catccatagt gcactagcat tttcttgaac caataatgta
     1861 ttaaaatttt ttgatgtcag ccttgcatca agggctttat caaaaagtac aataataaat
     1921 cctcaggtag tactgggaat ggaaggcttt gccatgggcc tgctgcgtca gaccagtact
     1981 gggaaggagg acggttgtaa gcagttgtta tttagtgata ttgtgggtaa cgtgagaaga
     2041 tagaacaatg ctataatata taatgaacac gtgggtattt aataagaaac atgatgtgag
     2101 attactttgt cccgcttatt ctcctccctg ttatctgcta gatctagttc tcaatcactg
     2161 ctcccccgtg tgtattagaa tgcatgtaag gtcttcttgt gtcctgatga aaaatatgtg
     2221 cttgaaatga gaaactttga tctctgctta ctaatgtgcc ccatgtccaa gtccaacctg
     2281 cctgtgcatg acctgatcat tacatggctg tggttcctaa gcctgttgct gaagtcattg
     2341 tcgctcagca atagggtgca gttttccagg aataggcatt tgctaattcc tggcatgaca
     2401 ctctagtgac ttcctggtga ggcccagcct gtcctggtac agcagggtct tgctgtaact
     2461 cagacattcc aagggtatgg gaagccatat tcacacctca cgctctggac atgatttagg
     2521 gaagcaggga caccccccgc cccccacctt tgggatcagc ctccgccatt ccaagtcaac
     2581 actcttcttg agcagaccgt gatttggaag agaggcacct gctggaaacc acacttcttg
     2641 aaacagcctg ggtgacggtc ctttaggcag cctgccgccg tctctgtccc ggttcacctt
     2701 gccgagagag gcgcgtctgc cccaccctca aaccctgtgg ggcctgatgg tgctcacgac
     2761 tcttcctgca aagggaactg aagacctcca cattaagtgg ctttttaaca tgaaaaacac
     2821 ggcagctgta gctcccgagc tactctcttg ccagcatttt cacattttgc ctttctcgtg
     2881 gtagaagcca gtacagagaa attctgtggt gggaacattc gaggtgtcac cctgcagagc
     2941 tatggtgagg tgtggataag gcttaggtgc caggctgtaa gcattctgag ctggcttgtt
     3001 gtttttaagt cctgtatatg tatgtagtag tttgggtgtg tatatatagt agcatttcaa
     3061 aatggacgta ctggtttaac ctcctatcct tggagagcag ctggctctcc accttgttac
     3121 acattatgtt agagaggtag cgagctgctc tgctatatgc cttaagccaa tatttactca
     3181 tcaggtcatt attttttaca atggccatgg aataaaccat ttttacaaaa ataaaaacaa
     3241 aaaaagc
//
LOCUS       HUMUDPGTA    1855 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA,
            complete cds.
ACCESSION   J05428
KEYWORDS    3,4-catechol estrogen UDP-glucuronosyltransferase.
SOURCE      Human liver, cDNA to mRNA, clone 63-11.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1855)
  AUTHORS   Ritter,J.K., Sheen,Y.Y. and Owens,I.S.
  TITLE     Cloning and expression of human liver UDP-glucuronosyltransferase
            in COS-1 cells: 3,4-catechol estrogens and estriol as primary
            substrates
  JOURNAL   J. Biol. Chem. 265, 7900-7906 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.S.Owens, 22-FEB-1990.
FEATURES       from  to/span     description
    pept         15     1604     UDP-glucuronosyltransferase (EC 2.4.1.17)
BASE COUNT      571 a    337 c    394 g    553 t
ORIGIN      
        1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct
       61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt
      121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac
      181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa
      241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga
      301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca
      361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat
      421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct
      481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc
      541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg
      601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct
      661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt
      721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat
      781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt
      841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg
      901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca
      961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc
     1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt
     1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg
     1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt
     1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg
     1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc
     1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc
     1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc
     1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt
     1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt
     1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg
     1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat
     1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat
     1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa
     1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt
//
LOCUS       RATGFGDVA     630 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Rat glioma-derived vascular endothelial cell growth factor mRNA,
            complete cds.
ACCESSION   M32167
KEYWORDS    endothelial cell growth factor protein.
SOURCE      Rat cell line GS-9L, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 630)
  AUTHORS   Conn,G., Bayne,M.L., Soderman,D.D., Kwok,P.W., Sullivan,K.A.,
            Palisi,T.M., Hope,D.A. and Thomas,K.A.
  TITLE     Amino acid and cDNA sequences of a vascular endothelial cell
            mitogen that is homologous to platelet-derived growth factor
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2628-2632 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.L.Bayne, 16-FEB-1900.
FEATURES       from  to/span     description
    pept          5      577     endothelial cell growth factor protein
                                 precursor
    sigp          5       82     endothelial cell growth factor protein signal
                                 peptide
    matp         83      574     endothelial cell growth factor protein
BASE COUNT      169 a    163 c    172 g    126 t
ORIGIN      
        1 aaccatgaac tttctgctct cttgggtgca ctggaccctg gctttactgc tgtacctcca
       61 ccatgccaag tggtcccagg ctgcacccac gacagaaggg gagcagaaag cccatgaagt
      121 ggtgaagttc atggacgtct accagcgcag ctattgccgt ccgattgaga ccctggtgga
      181 catcttccag gagtaccccg atgagataga gtatatcttc aagccgtcct gtgtgcccct
      241 aatgcggtgt gcgggctgct gcaatgatga agccctggag tgcgtgccca cgtcggagag
      301 caacgtcact atgcagatca tgcggatcaa acctcaccaa agccagcaca taggagagat
      361 gagcttcctg cagcatagca gatgtgaatg cagaccaaag aaagatagaa caaagccaga
      421 aaatcactgt gagccttgtt cagagcggag aaagcatttg tttgtccaag atccgcagac
      481 gtgtaaatgt tcctgcaaaa acacagactc gcgttgcaag gcgaggcagc ttgagttaaa
      541 cgaacgtact tgcagatgtg acaagccaag gcggtgagcc aggctgcagg aaggagcctc
      601 ctcagggttt cgggaactag acctctcacc
//
LOCUS       RATYWKII     1821 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Rat sperm membrane protein (YWK-II) mRNA, 3' end.
ACCESSION   M31322
KEYWORDS    sperm membrane protein.
SOURCE      Rat testis cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1821)
  AUTHORS   Yan,Y.C., Bai,Y., Wang,L.F. and Miao,S.Koide.S.S.
  TITLE     Characterization of cDNA encoding a rat sperm membrane protein
            related to A4 amyloid protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2405-2408 (1990)
  STANDARD  full  staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.S.Koide 17-JAN-1990.
FEATURES       from  to/span     description
    pept    <     1      576     sperm membrane protein (AA at 1)
    mRNA    <     1     1821     YWK-II mRNA
BASE COUNT      447 a    427 c    416 g    531 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgtta cctcctccat ctcagagaac cctgtggatg tccgggtgag ctctgaggaa
       61 agtgaggaga tcccgccgtt ccaccctttc catcccttcc catccttgtc tgagaacgaa
      121 gacactcagc cggagttgta ccacccaatg aaaaaaggat ctggaatggc agagcaagat
      181 gggggcctga ttggtgcaga agaaaaggtg atcaacagta agaataaaat ggatgaaaat
      241 atggtcattg acgagactct ggatgttaag gaaatgattt tcaatgctga gagagttggg
      301 ggtctggagg aagagccgga ctctgtgggg cctctacggg aggacttcag tttgagcagc
      361 agtgccctta ttggcttgct ggtcatcgcg gtggccattg ctacagtcat cgtcatcagc
      421 ctggtgatgc tgaggaagag gcaatacggc accatcagcc atggcattgt ggaggttcat
      481 ccaatgctca ccccagaaga gcgtcacttg aacaagatgc agaaccacgg ctatgaaaac
      541 ccaacctaca aatacctgga gcagatgcag atttaaggac agcagcgtgt gcgacaccct
      601 ggctgaggct gctgcaggtg ggctggaaga gcctcagcgt ttgtgcttga ctgctgacca
      661 ccagcggtgc cagaggcctc atcctacatc ctgctctcct ggattgttaa gactataaag
      721 tactactgta ggattgcaat ttccattctt ttaaatgggt ttaaaagatg ttaatataac
      781 aatatatgat atataaacct taagtgaaaa aaagatctat tgcagatatc tgatggatgt
      841 agttttcttt ttttaaatta gaaatgccac ttctattgta ttgtctcaca catgctctat
      901 ataaatggaa aatgttgatt tttcaatgat agactatata cacaggctgt tcccgttatg
      961 taagtctgtt ctttaggctc gtttgctggg ctggttttgt cgtcatttgt tttaatgtat
     1021 aaaggcagta ttcccctttt caggttgctg agaaatgtaa gtggaactga agtacattgt
     1081 atgcagttac tgactgtttt aggcatagtc tccttggaag cctagagctt ccagtgccgg
     1141 gtgtccagtg cctgtcacca aagcaagggc taagtcacct tgagctagct ggatgcaaac
     1201 tagatccact gtgctttcct tcaaatccag ttcttccaca gcaaccagcc catagttgtt
     1261 ctgtgttctt ccacagctgt ttacggtagc ctcctagcca ctctcctcag caagtgcatc
     1321 caagagtgca ccaccccctt ctttggacgt ctccgtccca tgcactgacc ctctgcttgc
     1381 cttcgtacct cacttcctcc accgctcttc agcccctttg atgtcccctc agagaatacc
     1441 gatatacaca tggctaagga cccaggagac ttcacgggag gcctcattag gtgaaaggac
     1501 gatgttctgg gctgtacatg aaattggatc tgtagacact gtgtttcctt cactgacttg
     1561 taatgtcacg cagctggagt tgatgccaca acccttagtg ctttgttgct gttttgtttt
     1621 tcagggttct ggtaacctgc tactgttttt gttttggttt tggtttggtt ttttttgtat
     1681 ttttctgtga tttccctccc cttccccccc atgcctcttc ccactatgca cagatggaaa
     1741 ctttacctac aaactccttc gtatgatctg tggagaatgt acagaactta ttacatcaat
     1801 aaaacacttt aacttccccc g
//
LOCUS       RHBNODVW     4227 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  B.japonicum nodulation protein (nodV, nodW) genes, complete cds.
ACCESSION   M31765
KEYWORDS    nodulation protein; regulatory protein.
SOURCE      B.japonicum (strain USDA3I1b110spc4) DNA, clone pRJ141.
  ORGANISM  Bradyrhizobium japonicum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 4227)
  AUTHORS   Goettfert,M., Grob,P. and Hennecke,H.
  TITLE     Proposed regulatory pathway encoded by the nodV and nodW genes,
            determinants of host specificity in Bradyrhizobium japonicum
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2680-2684 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Goettfert, 01-FEB-1990.
FEATURES       from  to/span     description
    pept        648     3317     nodulation protein (nodV)
    pept       3314     3997     nodulation protein (nodW; gtg start)
BASE COUNT      919 a   1147 c   1228 g    933 t
ORIGIN      
        1 ggatcctgcg gacgcagctt gtcgttctcg ttacgatgcg cgctgcgtta gcagagcatg
       61 gtgggtgtga tcagcgtgaa tggcaccgac aaagccgtgg cgagtcatac gaggctgccc
      121 tgcgttctgc gacgccgctc gcttccagcg gaaccagttt ttcgcagcta aagcgcgata
      181 agattggaac gaatcatcat cgcgcgtagg actcgcgcga ctccggcttc tgtgtcggca
      241 ggcgaacttg cgctttcttc gctttagcgg ctgagtcggg gagacgagtc attgggcagc
      301 ttttttctga tccttgccgg tccggaggcg cgccttcgaa cctatcggct gaactgaaac
      361 cactgtcctc attccgcggc ctaatggggc gcattggccg taggctcgag cgatgcgcaa
      421 aagcggacgt ttcagcttcg actaaagtcg cgtcagcgac acctactgaa ggtttaggac
      481 aaatatacgt atgtttataa tgcgtttttt ccccgcgcgt gctgagttcg gcagccgatg
      541 atcgatcttg cacacgctat catgcggacc caagtgttca agattgcctc atttttgcct
      601 aaaatgctat agctctcccg ccggataaca ggccggacgg tctcaaaatg tataggatca
      661 gcagtttcaa acgttcagcc cctcagttga ccttgggcag catagcgcta gccgcggtta
      721 cactgacttg cgtgtacttt caggcgcatt tcgccgccgc ggcgttcgcc tatttgttag
      781 tagtcttact attttcgttg atgggcagct tcattgcttc atcagcgctt tgcatcgtcg
      841 caatcgctgc tctcgcatac tactttgcgc cgccggcgtt tagtttacga atcgatgatc
      901 cccgcgatgt tcctgtggtt gttgcatttc ttattgtctc tgttgtcgga acgtacctga
      961 ttggaaaact ccgccaggaa agagaggctg cacgtgtggc tgcggccaag cttcagcgga
     1021 gtgcctcgga tttggaggat cgtgaaaaac ggtggcgcgc aattttcgag cacaatccag
     1081 ccatgtactt catggtcgat gaggccggca ttgtcctcaa cgtcaatacg ttgggcgcga
     1141 cacaactggg atttgcttgt gcagaactat tgggccaatc cgtgctcgac gtatttctgg
     1201 aggaggatcg cgcattcgtc cgcaaatgca ttcagacgtg tcttgaggat gttggacaat
     1261 cgcgcacttg ggacgtccgg aaagtcagga aggacggttc tgtattgtgg gtgcgtgaaa
     1321 acgccaaggc catgctttgg gccggcgacc gccccgtcat cctcatggcg tgcgaagata
     1381 ttacggagcg caagcagacc gagcttgcgc tgcagcggag cgaagcgcat ttggctcacg
     1441 cgcaggagtt gagtcataca ggcagcttca gctggaacgc ctctaccggc gaggccttct
     1501 ggtctaagga gacatttcgg attttccaaa tcgatcttca gacgacaccg gcgccacaac
     1561 tcgtcattga gcgcacgcac ccagatgata gggcttctgt caaagagatt atcgatgaag
     1621 cgatgcgaga cctgagggat ttcgagcacg agtaccggct gctgctacct gacggctccg
     1681 tgaagcacat ccatgcgcag gcacgagtca cgcgaaccgc ctctggtgaa attgagtttg
     1741 ttggggcagc caccgatatt acggcagcta ggcgagcaga acagcagttg cgccgaagcg
     1801 aggcctatct ggccgaggct cagcatctca ctcacacagg cagctggtcc tgggacgtcc
     1861 acacacgaga tttcgtttat cgctccgctg aggtcgaccg cctgtttggc tttaacccac
     1921 aagagccggt ttcgctagag actattcgat cgcgcatcca tccggaagac ttgccagggt
     1981 tgcaggaggt gcagcgtcag gcgattgacc aggagcacga acggttcgaa tatgatttcc
     2041 gtgttattct gccagatggc gggataaggc gcatacactc cgttgcacac gttgtcgtcg
     2101 gcagcgatgg taatgtcagc gagctgatcg gaacacatat ggatgttacc gagcaacacg
     2161 cagctaggga acgcttggaa aacacgcttg tagcgctgcg cgagagcgaa cagcgctttc
     2221 gcgactatgc cgagactgct tccgactggc tctgggaaac cgggccagat catcgggtca
     2281 ctcacttgtc cgagcacacc agcgctgcgg gaattttggc gacagggtta acgggcctgc
     2341 ttcgctggga catcgcgtgc gacatggaag aagaaccgga gaagtggcgt cagcatcggg
     2401 cgacgttgca ggcacacctg ccgttccggg atctcatcta ccgtaccgta aataggatgg
     2461 gatctccgat ctacgtccgc actagtggca agcccttctt cgacggaaac ggaaattttt
     2521 tgggctatcg cggcgtcagc actgacatca ccgctaccat tcgcgctgat caggccgaac
     2581 aagagctgcg aaaggcacag gcggagcttg cacatgtgac gcgtgtaacg accttgggag
     2641 aaatgacaac ttctatcgcc cacgagataa cccaaccact cgccgctatc ctcagcaacg
     2701 ccgatgcgtg cctcggctgg atggctcgcg atgttcccaa tcttgcagcc gcgcgctctt
     2761 cagtcgaatg gatcatagaa gatgcaatcc gggcaagcga ggtgatccgt agtattcgcg
     2821 cactcgcgaa aaagggcgag atcgagatgg tgccgctcga cattaatcag gtggttaggg
     2881 acgtcagcgc gctggtaaca cgagagctgg tgagccacca agtgacgttg cgaagcgagt
     2941 tggcgtctgc gctgcctagg gtcctcggtg atcgaattca gctacaacaa gtgatcatca
     3001 atctggtgat gaacggaatc gaggccatgg acgcagttac agaccggccg cgtgaacttc
     3061 tgattcaatc atctacggac gatctggggt acgtgcagct ttccgtgacc gattgcggcg
     3121 tcgggatcgc cgagaatgac gcggaccgcg tcttggaccc cttcttcacc accaaatcga
     3181 gcggcctagg aatgggcctt tcgatctgcc ggtcgatcgt ggaagttcac ggaggacgaa
     3241 tttcagtggt tcagaaaaat ggaccgggcg cgacgttcca gtttgccctt ccgctgcata
     3301 aggaggccat ctcgtgacag gacgatttga ctggagaggc caaggcggac ataccgaggc
     3361 ttcgacgaag gcaatcgtct ttgtcgtcga ggatgacatc tctatgcgtc gctcgcttac
     3421 gaaccttttt cgatcggtag gcttggaggt cgtggcgttc ggatcggccc gtgaaatgct
     3481 gcagagcaca atgccggacg tcacaagctg tctagttctt gatgtccggc tgccgggcct
     3541 gagcggcctt gactaccaga ccgagctcgc caggttgaac atacacatcc caatcatctt
     3601 cattaccggc catggcgaca ttcccatgac cgtcagggcc atgaagggag gcgcggtcga
     3661 tttcctcagc aagcccttcc gcgatcagga actgcttgat gccgtcgttg cggcgaccga
     3721 acgcgatcgc aaaagacgag aagctcagcg aaccgtggcg aacctgaaat ctctatttga
     3781 gaccctaagc ccgcgagaac aggcagtgat gaaactggtc gcgacggggc tgatgaacaa
     3841 gcaggtagcc gccgaacttg ggctcgccga gatcaccgtc aagatctacc ggggacacgt
     3901 aatgaaaaag atgcgtgccc gctcgctggc tgacttgatc agaatgagcg agacgctcgg
     3961 aattagcgcc aatcacactg aacaaaccca agtatgattt tacaattcca tcacttaagc
     4021 ccactttcgc gaaagtggct gacggtttgg cagccgctgt actgcgtcgg gagggctcat
     4081 cttgtccacg cctttgattt ccgtcgttga cgacgacccc tcggtccgtg cggcgacaga
     4141 aaaccttttg aaatcgcgtg gctacgtcgt gcagatattt gcctcggccg aggcgctcct
     4201 gcggtcgccg cggttgaacg agatatc
//
LOCUS       ANAPETF1A    1682 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  A.variabilis ferredoxin (petF1) gene, complete cds.
ACCESSION   M35126
KEYWORDS    ferredoxin.
SOURCE      A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1.
  ORGANISM  Anabaena variabilis
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Nostocaceae.
REFERENCE   1  (bases 1 to 1682)
  AUTHORS   Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
            Van Arkel,G. and Weisbeek,P.
  TITLE     Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
            Synechococcus sp. PCC 7942: Structure and regulation
  JOURNAL   Photosyn. Res. 18, 179-204 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        794     1093     ferredoxin (petF1)
    binding     782      785     ribosomal binding site (put.)
BASE COUNT      480 a    353 c    322 g    527 t
ORIGIN      
        1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg
       61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta
      121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat
      181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat
      241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat
      301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata
      361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat
      421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc
      481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat
      541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata
      601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt
      661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat
      721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga
      781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag
      841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta
      901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc
      961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata
     1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga
     1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag
     1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta
     1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga
     1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt
     1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac
     1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct
     1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt
     1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat
     1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg
     1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata
     1681 tc
//
LOCUS       ANIPETF1A    1074 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  A.nidulans ferredoxin (petF1) gene, complete cds.
ACCESSION   M35127
KEYWORDS    ferredoxin.
SOURCE      A.nidulans (strain R2, PCC7942) DNA, clone pA612.
  ORGANISM  Anacystis nidulans
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria.
REFERENCE   1  (bases 1 to 1074)
  AUTHORS   Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M.,
            Van Arkel,G. and Weisbeek,P.
  TITLE     Genes encoding ferredoxins from Anabaena sp. PCC 7937 and
            Synechococcus sp. PCC 7942: Structure and regulation
  JOURNAL   Photosyn. Res. 18, 179-204 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        504      803     ferredoxin (petF1)
    binding     493      496     ribosomal binding site (put.)
BASE COUNT      234 a    298 c    281 g    261 t
ORIGIN      
        1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg
       61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca
      121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac
      181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt
      241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga
      301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa
      361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc
      421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc
      481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg
      541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag
      601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca
      661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg
      721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc
      781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat
      841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc
      901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc
      961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt
     1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag
//
LOCUS       ASAASPPA     1764 bp ds-DNA             PLN       27-JUL-1990
DEFINITION  A.awamori aspergillopepsin A (pepA) gene, complete cds.
ACCESSION   M34454
KEYWORDS    aspartic proteinase; aspergillopepsin.
SOURCE      A.awamori (NRRL3112 mutant strain UVK143f) DNA.
  ORGANISM  Aspergillus awamori
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae.
REFERENCE   1  (bases 1 to 1764)
  AUTHORS   Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H.,
            Carlomagno,L.P. and Thompson,S.A.
  TITLE     Molecular cloning and deletion of the gene encoding
            aspergillopepsin A from Aspergillus awamori
  JOURNAL   Gene 86, 153-162 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        177      496     aspergillopepsin (pepA), exon 1
                548      825     aspergillopepsin, exon 2
                878     1126     aspergillopepsin, exon 3
               1186     1523     aspergillopepsin, exon 4
    sigp        177      308     aspergillopepsin signal peptide
    matp        455      496     aspergillopepsin
                548      825     aspergillopepsin
                878     1126     aspergillopepsin
               1186     1523     aspergillopepsin
    pre-msg     124     1764     pepA mRNA and introns (put.)
    pre-msg     125     1764     pepA mRNA and introns (put.)
    pre-msg     127     1764     pepA mRNA and introns (put.)
    IVS         497      547     pepA intron A
    IVS         826      877     pepA intron B
    IVS        1127     1185     pepA intron C
    signal       54       58     TATA box
BASE COUNT      387 a    515 c    445 g    417 t
ORIGIN      
        1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga
       61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc
      121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg
      181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc
      241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc
      301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc
      361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg
      421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca
      481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct
      541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt
      601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg
      661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca
      721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg
      781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac
      841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc
      901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac
      961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc
     1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt
     1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt
     1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc
     1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag
     1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg
     1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct
     1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca
     1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc
     1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg
     1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg
     1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt
     1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact
     1741 gaaccagtag tagtactttc cact
//
LOCUS       BSUPSIF2A    2758 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  B.subtilis protein synthesis initiation factor 2 (infB) gene,
            complete cds.
ACCESSION   M34836
KEYWORDS    protein synthesis initiation factor 2.
SOURCE      B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and
            pJET1.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2758)
  AUTHORS   Shazand,K., Tucker,J., Chiang,R., Stansmore,K.,
            Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and
            Leighton,T.
  TITLE     Isolation and molecular genetic characterization of the Bacillus
            subtilis gene (infB) encoding protein synthesis initiation factor 2
  JOURNAL   J. Bacteriol. 172, 2675-2687 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        381     2531     protein synthesis initiation factor 2 (infB)
    binding     369      375     ribosomal binding site (put.)
BASE COUNT      917 a    535 c    662 g    644 t
ORIGIN      
        1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat
       61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc
      121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac
      181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt
      241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg
      301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta
      361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa
      421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc
      481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg
      541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg
      601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg
      661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca
      721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc
      781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca
      841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc
      901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag
      961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg
     1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa
     1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc
     1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac
     1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga
     1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg
     1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa
     1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg
     1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg
     1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa
     1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg
     1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg
     1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg
     1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg
     1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc
     1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg
     1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg
     1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg
     2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt
     2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc
     2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc
     2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag
     2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca
     2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga
     2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta
     2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg
     2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa
     2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc
     2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt
     2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat
//
LOCUS       CIBRCRA      1845 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Plasmid Colicin Ib-P9 replication control region replicon initiator
            protein (repZ) gene, complete cds.
ACCESSION   M34837
KEYWORDS    replicon initiator protein.
SOURCE      Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1.
  ORGANISM  Plasmid Colicin Ib-P9
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1845)
  AUTHORS   Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K.
  TITLE     Organization of the replication control region of plasmid Co1Ib-P9
  JOURNAL   J. Bacteriol. 172, 1983-1991 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        455     1486     repZ protein
    mRNA        157     1762     repZ protein mRNA (alt.)
    mRNA        158     1762     repZ protein mRNA (alt.)
    mRNA        363  <     1 (c) inc RNA
    binding     442      448     ribosomal binding site (put.)
    signal      377      372 (c) inc -10 region
    signal      400      395 (c) inc -35 region
    mut         353      353     c in wild type; t in inc3
    mut         374      374     t in wild type; c in inc2
    mut         400      400     a in wild type; c in inc1
BASE COUNT      505 a    467 c    490 g    383 t
ORIGIN      
        1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag
       61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct
      121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac
      181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag
      241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa
      301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag
      361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata
      421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata
      481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta
      541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg
      601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg
      661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc
      721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa
      781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc
      841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt
      901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact
      961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct
     1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt
     1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca
     1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct
     1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag
     1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct
     1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct
     1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga
     1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc
     1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa
     1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca
     1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa
     1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt
     1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa
     1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt
//
LOCUS       ECOHEMBA     1085 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  E.coli porphobilinogen synthase (hemB) gene, complete cds.
ACCESSION   M35121
KEYWORDS    5-aminolevulinic acid dehydratase; hemB gene;
            porphobilinogen synthase.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1085)
  AUTHORS   Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A.
  TITLE     Nucleotide sequence of the hemB gene of Escherichia coli K12
  JOURNAL   Mol. Gen. Genet. 214, 503-508 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         80     1054     porphobilinogen synthase (hemB)
    signal       15       20     hemB -35 region
    signal       41       46     hemB -10 region
BASE COUNT      254 a    277 c    301 g    253 t
ORIGIN      
        1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc
       61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc
      121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg
      181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat
      241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg
      301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg
      361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat
      421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg
      481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc
      541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc
      601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac
      661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg
      721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact
      781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga
      841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg
      901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt
      961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc
     1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag
     1081 ttaac
//
LOCUS       ECOSTMA      1418 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  E.coli F factor (stm) gene, complete cds.
ACCESSION   M35124
KEYWORDS    F factor; stm gene.
SOURCE      E.coli DNA, clones pEG[157,159,175,176,220].
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1418)
  AUTHORS   Golub,E.I. and Panzer,H.A.
  TITLE     The F factor of Escherichia coli carries a locus of stable plasmid
            inheritance stm, similar to the parB locus of plasmid RI
  JOURNAL   Mol. Gen. Genet. 214, 353-357 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        830     1009     F factor (stm)
    signal     1013     1039     pot. transcription termination signal
    signal     1074     1105     pot. transcription termination signal
BASE COUNT      279 a    364 c    429 g    345 t      1 others
ORIGIN      
        1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa
       61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct
      121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag
      181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg
      241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact
      301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc
      361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg
      421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc
      481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact
      541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag
      601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc
      661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca
      721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga
      781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc
      841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta
      901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc
      961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg
     1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg
     1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag
     1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct
     1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt
     1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt
     1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg
     1381 aagctgtctc tggcgggctt ccatccctga cgggaaac
//
LOCUS       HUMGPIIBA    3333 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds.
ACCESSION   M34480
KEYWORDS    platelet glycoprotein IIb.
SOURCE      Human megakaryocytes, cDNA to mRNA, clone IIb[3,4].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3333)
  AUTHORS   Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
            Marguerie,G.
  TITLE     GPIIb and GPIIIa amino acid sequences deduced from human
            megakaryocyte cDNAs
  JOURNAL   Mol. Biol. Rep. 14, 27-33 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         33     3152     platelet glycoprotein IIb (GPIIb)
    mRNA          1     3333     GPIIb mRNA
BASE COUNT      626 a    998 c   1040 g    669 t
ORIGIN      
        1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag
       61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct
      121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt
      181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg
      241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga
      301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg
      361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca
      421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga
      481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc
      541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg
      601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg
      661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc
      721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc
      781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg
      841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca
      901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc
      961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca
     1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag
     1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg
     1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg
     1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc
     1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc
     1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct
     1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag
     1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc
     1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca
     1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc
     1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc
     1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg
     1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc
     1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg
     1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg
     1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga
     1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca
     2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact
     2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg
     2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga
     2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct
     2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg
     2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc
     2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca
     2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc
     2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac
     2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact
     2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca
     2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa
     2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc
     2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc
     2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg
     2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct
     3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc
     3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca
     3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga
     3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt
     3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc
     3301 ccctcctgct gcctaataaa gagactgagc cct
//
LOCUS       HUMGPIIIAA   4487 bp ss-mRNA            PRI       27-JUL-1990
DEFINITION  Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds.
ACCESSION   M35999
KEYWORDS    platelet glycoprotein IIIa.
SOURCE      Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4487)
  AUTHORS   Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and
            Marguerie,G.
  TITLE     GPIIb and GPIIIa amino acid sequences deduced from human
            megakaryocyte cDNAs
  JOURNAL   Mol. Biol. Rep. 14, 27-33 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         19     2385     plate glycoprotein IIIa (GPIIIa)
    mRNA    <     1     2603     GPIIIa mRNA
BASE COUNT     1046 a   1090 c   1186 g   1165 t
ORIGIN      
        1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg
       61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga
      121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat
      181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac
      241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc
      301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt
      361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag
      421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg
      481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac
      541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc
      601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt
      661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag
      721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag
      781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt
      841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct
      901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat
      961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt
     1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc
     1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat
     1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta
     1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga
     1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag
     1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag
     1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc
     1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga
     1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc
     1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc
     1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt
     1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg
     1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg
     1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt
     1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc
     1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa
     1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact
     2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag
     2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc
     2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc
     2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct
     2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa
     2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc
     2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag
     2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag
     2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg
     2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag
     2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg
     2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg
     2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt
     2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta
     2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag
     2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc
     3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt
     3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca
     3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga
     3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc
     3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc
     3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc
     3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat
     3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat
     3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac
     3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc
     3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg
     3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat
     3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa
     3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata
     3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa
     3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa
     3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc
     4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt
     4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa
     4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata
     4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga
     4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt
     4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta
     4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc
     4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc
//
LOCUS       MUSCDX1A     1597 bp ss-mRNA            ROD       27-JUL-1990
DEFINITION  Mouse Cdx-1 homeobox protein mRNA, complete cds.
ACCESSION   M37163 M35125
KEYWORDS    Cdx-1 homeobox protein.
SOURCE      Mouse 8.5 day embryo, cDNA to mRNA, clone BH8.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1597)
  AUTHORS   Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D.,
            Guenet,J.-L. and Gruss,P.
  TITLE     A mouse gene homologous to the Drosophila gene caudal is expressed
            in epithelial cells from the embryonic intestine
  JOURNAL   Genes Dev. 2, 1647-1654 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        158      736     Cdx-1 homeobox protein
    site        389      571     Cdx-1 homeobox
BASE COUNT      326 a    523 c    444 g    304 t
ORIGIN      
        1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt
       61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct
      121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca
      181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc
      241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc
      301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg
      361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca
      421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa
      481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa
      541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc
      601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg
      661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga
      721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg
      781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt
      841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt
      901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc
      961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc
     1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca
     1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc
     1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat
     1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg
     1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat
     1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat
     1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca
     1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg
     1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa
     1561 aaatcataca aggcattaaa gaatttttaa aaagttt
//
LOCUS       MUSHSP70B    3518 bp ds-DNA             ROD       27-JUL-1990
DEFINITION  Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds.
ACCESSION   M35021
KEYWORDS    heat shock protein 70.1.
SOURCE      Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3].
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3518)
  AUTHORS   Hunt,C. and Calderwood,S.
  TITLE     Characterization and sequence of a mouse hsp70 gene and its
            expression in mouse cell lines
  JOURNAL   Gene 87, 199-204 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        806     2734     heat shock protein 70.1 (hsp70.1)
    signal      507      512     inverted CCAAT box
    signal      547      552     TATA box
BASE COUNT      827 a    993 c   1014 g    684 t
ORIGIN      
        1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag
       61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga
      121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta
      181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca
      241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc
      301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg
      361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac
      421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct
      481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga
      541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat
      601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc
      661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc
      721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag
      781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg
      841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga
      901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg
      961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg
     1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt
     1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg
     1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga
     1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga
     1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat
     1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga
     1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat
     1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga
     1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga
     1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag
     1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga
     1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg
     1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca
     1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga
     1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg
     1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct
     1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc
     2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc
     2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga
     2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc
     2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga
     2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa
     2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca
     2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc
     2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga
     2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt
     2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca
     2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc
     2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg
     2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt
     2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt
     2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat
     2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag
     3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc
     3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt
     3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac
     3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat
     3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt
     3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc
     3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct
     3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt
     3481 agctatcttc agacacacca gaaaagggca tcagatct
//
LOCUS       RDVS10       1319 bp ds-RNA             VRL       27-JUL-1990
DEFINITION  Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds.
ACCESSION   D00241
KEYWORDS    S10 RNA.
SOURCE      Rice dwarf virus, cDNA to viral RNA.
  ORGANISM  Rice dwarf virus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE   1  (bases 1 to 1319)
  AUTHORS   Omura,T., Minobe,Y. and Tsuchizaki,T.
  TITLE     Nucleotide sequence of segment S10 of the rice dwarf virus genome
  JOURNAL   J. Gen. Virol. 69, 227-231 (1988)
  STANDARD  simple automatic
COMMENT     Three contiguous termination codons were detected at the third
            triplet downstream from the first stop codon.
            
            Corresponds to DDBJ (rel3.0) entry <RDVS1O>(D00241) dated
            20-JUN-1988
FEATURES       from  to/span     description
    pept         27     1088     ORF
BASE COUNT      354 a    279 c    284 g    402 t
ORIGIN      5' end of the plus-sense strand of segment S10.
        1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
       61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta
      121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
      181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
      241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
      301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
      361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
      421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
      481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
      541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
      601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
      661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
      721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
      781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
      841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
      901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
      961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
     1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
     1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc
     1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
     1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc
     1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat
//
LOCUS       RDVS10A      1320 bp ds-RNA             VRL       27-JUL-1990
DEFINITION  Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds.
ACCESSION   M35118
KEYWORDS    .
SOURCE      Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367].
  ORGANISM  Rice dwarf virus
            Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus.
REFERENCE   1  (bases 1 to 1320)
  AUTHORS   Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E.
  TITLE     Nucleotide sequence of rice dwarf virus genome segment 10
  JOURNAL   Proc. Jpn. Acad. 63, 227-230 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         27     1088     ORF
BASE COUNT      357 a    280 c    282 g    401 t
ORIGIN      
        1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc
       61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta
      121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag
      181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact
      241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg
      301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt
      361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc
      421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt
      481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc
      541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag
      601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat
      661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac
      721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt
      781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc
      841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc
      901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca
      961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta
     1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca
     1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc
     1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac
     1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt
     1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat
//
LOCUS       SRVAGSM      3651 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Sapporo rat virus mRNA M segment, encoding an ORF, complete cds.
ACCESSION   M34882
KEYWORDS    .
SOURCE      Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
            E6 cell line C1008.
  ORGANISM  Sapporo rat virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Bunyaviridae; Hantavirus.
REFERENCE   1  (bases 1 to 3651)
  AUTHORS   Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
            Schmaljohn,C.S.
  TITLE     Coding properties of the S and the M genome segments of Sapporo rat
            virus: Comparison to other causative agents of hemorrhagic fever
            with renal syndrome
  JOURNAL   Virology 176, 114-125 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         47     3448     M segment ORF
BASE COUNT     1119 a    692 c    755 g   1085 t
ORIGIN      
        1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct
       61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat
      121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc
      181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa
      241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa
      301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg
      361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat
      421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc
      481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca
      541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt
      601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga
      661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac
      721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg
      781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat
      841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga
      901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag
      961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat
     1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa
     1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac
     1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc
     1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt
     1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt
     1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt
     1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc
     1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact
     1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt
     1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc
     1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat
     1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca
     1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca
     1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc
     1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg
     1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag
     1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt
     2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata
     2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat
     2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt
     2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca
     2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc
     2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt
     2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt
     2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag
     2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact
     2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg
     2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga
     2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga
     2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt
     2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat
     2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga
     2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt
     3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat
     3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca
     3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca
     3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa
     3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat
     3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt
     3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg
     3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg
     3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt
     3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct
     3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a
//
LOCUS       SRVAGSS      1769 bp ss-RNA             VRL       27-JUL-1990
DEFINITION  Sapporo rat virus mRNA S segment, encoding an ORF, complete cds.
ACCESSION   M34881
KEYWORDS    .
SOURCE      Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero
            E6 cell line C1008.
  ORGANISM  Sapporo rat virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Bunyaviridae; Hantavirus.
REFERENCE   1  (bases 1 to 1769)
  AUTHORS   Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and
            Schmaljohn,C.S.
  TITLE     Coding properties of the S and the M genome segments of Sapporo rat
            virus: Comparison to other causative agents of hemorrhagic fever
            with renal syndrome
  JOURNAL   Virology 176, 114-125 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         43     1332     S segment ORF
BASE COUNT      569 a    331 c    410 g    459 t
ORIGIN      
        1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa
       61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag
      121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat
      181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc
      241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca
      301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt
      361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg
      421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg
      481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag
      541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc
      601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat
      661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg
      721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc
      781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc
      841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa
      901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg
      961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct
     1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa
     1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc
     1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt
     1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt
     1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct
     1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat
     1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa
     1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat
     1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag
     1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt
     1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa
     1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc
     1741 tttttgggga gtaagggagt ctactacta
//
LOCUS       TRNSTIIA     1315 bp ds-DNA             BCT       27-JUL-1990
DEFINITION  Transposon Tn4521 (from E.coli) heat-stable toxin II gene,
            complete cds.
ACCESSION   M35123
KEYWORDS    heat-stable toxin II; insertion sequence; transposase.
SOURCE      Transposon Tn4521 (from Escherichia coli) DNA.
  ORGANISM  Transposon Tn4521
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1315)
  AUTHORS   Hu,S.T. and Lee,C.H.
  TITLE     Characterization of the transposon carrying the STII gene of
            enterotoxigenic Escherichia coli
  JOURNAL   Mol. Gen. Genet. 214, 490-495 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        879      400 (c) heat-stable toxin II (STII)
    trns         88      607     insertion sequence IS2 (bases 17-536)
    trns        608      970     insertion sequence IS2 (bases 969-1327)
BASE COUNT      313 a    349 c    320 g    333 t
ORIGIN      
        1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct
       61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac
      121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc
      181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt
      241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt
      301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact
      361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata
      421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt
      481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca
      541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc
      601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc
      661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg
      721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag
      781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt
      841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct
      901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg
      961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt
     1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga
     1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac
     1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct
     1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat
     1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga
//

GenBank-Updates@genbank.bio.net (07/28/90)

LOCUS       MUSMDR1A     4924 bp ss-mRNA            ROD       28-JUL-1990
DEFINITION  Mouse P-glycoprotein (mdr1a) mRNA, complete cds.
ACCESSION   M33581
KEYWORDS    P-glycoprotein.
SOURCE      Mouse (strain BALB/c/NIH) macrophage-like cell line
            J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library
            pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4924)
  AUTHORS   Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and
            Horwitz,S.B.
  TITLE     Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
            reveals the basis for differential transcript heterogeneity in
            multidrug-resistant J774.2 cells
  JOURNAL   Mol. Cell. Biol. 10, 3596-3606 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine,
            1300 Morris Park Ave, Bronx, NY 10461
FEATURES       from  to/span     description
    pept        137     3967     P-glycoprotein (mdr1a)
    signal     4315     4320     poly-A signal
    signal     4898     4903     poly-A signal
BASE COUNT     1450 a   1021 c   1210 g   1243 t
ORIGIN      Chromosome 5.
        1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc
       61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga
      121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt
      181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt
      241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct
      301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac
      361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga
      421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac
      481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc
      541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga
      601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt
      661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac
      721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt
      781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt
      841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc
      901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa
      961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat
     1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc
     1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt
     1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg
     1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc
     1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt
     1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag
     1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct
     1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag
     1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct
     1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga
     1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca
     1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat
     1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc
     1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg
     1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc
     1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa
     1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa
     2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg
     2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga
     2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg
     2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat
     2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt
     2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt
     2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa
     2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca
     2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc
     2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa
     2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact
     2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt
     2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga
     2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat
     2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg
     2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt
     3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc
     3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc
     3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga
     3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg
     3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga
     3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt
     3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa
     3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga
     3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt
     3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga
     3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg
     3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct
     3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga
     3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa
     3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca
     3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg
     3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt
     4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca
     4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa
     4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag
     4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat
     4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa
     4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa
     4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat
     4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag
     4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct
     4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata
     4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg
     4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca
     4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca
     4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt
     4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa
     4921 aaat
//
LOCUS       MUSMDRXX     2873 bp ds-DNA             ROD       28-JUL-1990
DEFINITION  Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2.
ACCESSION   M33580
KEYWORDS    P-glycoprotein.
SOURCE      Mouse (strain BALB/c/NIH) macrophage-like cell line
            J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2873)
  AUTHORS   Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and
            Horwitz,S.B.
  TITLE     Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
            reveals the basis for differential transcript heterogeneity in
            multidrug-resistant J774.2 cells
  JOURNAL   Mol. Cell. Biol. 10, 3596-3606 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.S.Kirschner, 05-APR-1990.  Albert Einstein C., 1300 Morris Pk,
            Bronx, 10461.  E-mail: kirschne@aecom.yu.edu.
FEATURES       from  to/span     description
    pre-msg    1992  >  2873     P-glycoprotein mRNA and introns
    IVS        2120     2606     P-glycoprotein intron A
    IVS        2678  >  2873     P -glycoprotein intron B
    signal     1904     1912     CAAT box
    signal     1956     1963     TATA box
    site       1880     1887     SP-1 site
    site       1921     1927     SP-1 site
    site       1937     1944     SP-1 site
    site       1869     1875     AP-1 site
    rpt           1     1300     L1Md repetitive element
BASE COUNT      860 a    621 c    714 g    678 t
ORIGIN      Chromosome 5.
        1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct
       61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa
      121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa
      181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc
      241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca
      301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag
      361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga
      421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat
      481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac
      541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct
      601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga
      661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt
      721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc
      781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt
      841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac
      901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta
      961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag
     1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc
     1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg
     1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg
     1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa
     1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa
     1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa
     1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact
     1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta
     1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca
     1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg
     1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt
     1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag
     1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag
     1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca
     1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct
     1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa
     1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct
     2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc
     2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag
     2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag
     2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg
     2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg
     2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag
     2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc
     2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt
     2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg
     2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa
     2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg
     2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat
     2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct
     2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc
//
LOCUS       HUMKSAA      1504 bp ss-mRNA            PRI       28-JUL-1990
DEFINITION  Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds.
ACCESSION   M32325
KEYWORDS    adenocarcinoma-associated antigen.
SOURCE      Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1504)
  AUTHORS   Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D.,
            Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A.
  TITLE     Molecular cloning and characterization of a human
            adenocarcinoma/epithelial cell surface antigen complementary DNA
  JOURNAL   Cancer Res. 49, 314-317 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.D. Sharp, 22-FEB-1990.
            
            There are a few base differences between the sequence presented
            here and that which appears in entry M26481. The difference occurs
            mostly in the Poly-A signal.
FEATURES       from  to/span     description
    pept        155     1099     adenocarcinoma-associated antigen precursor
                                 (KSA)
    sigp        155      223     adenocarcinoma-associated antigen signal
                                 peptide
    matp        398     1096     adenocarcinoma-associated antigen
    mRNA    <     1     1504     adenocarcinoma-associated antigen mRNA
    site       1486     1491     polyadenylation site
BASE COUNT      442 a    302 c    356 g    404 t
ORIGIN      
        1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc
       61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc
      121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt
      181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg
      241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac
      301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat
      361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca
      421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca
      481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa
      541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa
      601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga
      661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa
      721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat
      781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa
      841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat
      901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat
      961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc
     1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca
     1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat
     1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg
     1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga
     1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt
     1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc
     1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc
     1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa
     1501 actg
//
LOCUS       HUMMHDNDRW   1066 bp ss-mRNA            PRI       28-JUL-1990
DEFINITION  Human MHC class II DN alpha mRNA, complete cds.
ACCESSION   M26039 M27046
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to
            mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1066)
  AUTHORS   Jonsson,A.-K. and Rask,L.
  TITLE     Human class II DNA and DOB genes display low sequence variability
  JOURNAL   Immunogenetics 29, 411-413 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by A.-K.Jonsson, 07-JUL-1989.
FEATURES       from  to/span     description
    pept         26      778     MHC DN alpha protein precursor
                                 /nomgen="HLA-DNA" /map="6p21.3"
                                 /hgml_locus_uid="LV0117X"
    sigp         26      106     MHC DN alpha protein signal peptide
    matp        107      775     MHC DN alpha protein
    mRNA    <     1     1066     MHCDNA-a mRNA
BASE COUNT      199 a    337 c    283 g    247 t
ORIGIN      Chromosome 6p21.3.
        1 catttgatta aagcaccaga gtgtaatggc cctcagagca gggctggtcc tggggttcca
       61 caccctgatg accctcctga gcccgcagga ggcaggggcc accaaggctg accacatggg
      121 ctcctacgga cccgccttct accagtctta cggcgcctcg ggccagttca cccatgaatt
      181 tgatgaggaa cagctgttct ctgtggacct gaagaaaagc gaggccgtgt ggcgtctgcc
      241 tgagtttggt gactttgccc gctttgaccc gcagggcggg ctggccggca tcgccgcaat
      301 caaagcccat ctggacatcc tggtggagcg ctccaaccgc agcagagcca tcaacgtgcc
      361 tccacgggtg accgtgctcc ccaagtctcg ggtggagctg ggccagccca acatcctcat
      421 ctgcatcgtg gacaacatct tcccccctgt gatcaatatc acctggctgc gcaacggcca
      481 aactgtcact gagggagtgg cccagaccag cttctattcc cagcctgacc atttgttccg
      541 caagttccac tacctgccct tcgtgccctc agccgaggac gtctatgact gccaggtgga
      601 gcactggggc ctggatgcgc cactcctcag gcattgggag ctccaggtgc ctattccacc
      661 accagatgcc atggagaccc tggtctgtgc cctgggcctg gccatcggcc tggtgggctt
      721 cctcgtgggc accgtcctca tcatcatggg cacatatgtg tccagtgtcc ccaggtaatg
      781 atccttctga gagaaatgac ttgtgggaga caccctgcag atcctcatgg gtttgtgaca
      841 gaccctgcgt gctcagtgcc ctttaagtgc atcccgctgt gctgactttg agtgggatca
      901 acatctgtcc tacgggtccc ctcttttttg gccccagtat tcatggcagg gtttgttgga
      961 cacctactag cttcccttcc cattcaacac acacacacat tcttgctcta cccaaagctc
     1021 tggctggcag cactaaatgc tttggtggtg tttgcactgt gtcctt
//
LOCUS       HUMMHDOBDR   1293 bp ss-mRNA            PRI       28-JUL-1990
DEFINITION  Human MHC class II DO beta mRNA, complete cds.
ACCESSION   M26040 M27047
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to
            mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1293)
  AUTHORS   Jonsson,A.-K. and Rask,L.
  TITLE     Human class II DNA and DOB genes display low sequence variability
  JOURNAL   Immunogenetics 29, 411-413 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by A.-K.Jonsson, 07-JUL-1989.
FEATURES       from  to/span     description
    pept         36      857     MHC DO beta protein precursor
                                 /nomgen="HLA-DOB" /map="6p21.3"
                                 /hgml_locus_uid="LM0050S"
    sigp         36      125     MHC DO beta protein signal peptide
    matp        126      854     MHC DO beta protein
BASE COUNT      293 a    315 c    330 g    355 t
ORIGIN      Chromosome 6p21.3.
        1 cgattttact gtctcatttt tttcctttct ccagaatggg ttctgggtgg gtcccctggg
       61 tggtggctct gctagtgaat ctgacccgac tggattcctc catgactcaa ggcacagact
      121 ctccagaaga ttttgtgatt caggcaaagg ctgactgtta cttcaccaac gggacagaaa
      181 aggtgcagtt tgtggtcaga ttcatcttta acttggagga gtatgtacgt ttcgacagtg
      241 atgtggggat gtttgtggca ttgaccaagc tggggcagcc agatgctgag cagtggaaca
      301 gccggctgga tctcttggag aggagcagac aggccgtgga tggggtctgt agacacaact
      361 acaggctggg cgcacccttc actgtgggga gaaaagtgca accagaggtg acagtgtacc
      421 cagagaggac cccactcctg caccagcata atctgctgca ctgctctgtg acaggcttct
      481 atccagggga tatcaagatc aagtggttcc tgaatgggca ggaggagaga gctggggtca
      541 tgtccactgg ccctatcagg aatggagact ggacctttca gactgtggtg atgctagaaa
      601 tgactcctga acttggacat gtctacacct gccttgtcga tcactccagc ctgctgagcc
      661 ctgtttctgt ggagtggaga gctcagtctg aatattcttg gagaaagatg ctgagtggca
      721 ttgcagcctt cctacttggg ctaatcttcc ttctggtggg aatcgtcatc cagctaaggg
      781 ctcagaaagg atatgtgagg acgcagatgt ctggtaatga ggtctcaaga gctgttctgc
      841 tccctcagtc atgctaaggt cctcactgaa gcttctctct ctggagcctg aagtagtgat
      901 gagtagtctg ggccctgggt gaggtaaagg acattcatga ggtcaatgtt ctgggaataa
      961 ctctcttccc tgatccttgg aggagcccga actgattctg gagctctgtg ttctgagatc
     1021 atgcatctcc cacccatctg cccttctccc ttctacgtgt acatcattaa tccccattgc
     1081 caagggcatt gtccagaaac tcccctgaga ccttactcct tccagcccca aatcatttac
     1141 ttttctgtgg tccagcccta ctcctataag tcatgatctc caaagctttc tgtcttccaa
     1201 ctgcagtctc cacagtcttc agaagacaaa tgctcaggta gtcactgttt ccttttcact
     1261 gtttttaaaa accttttatt gtcaaataaa atg
//
LOCUS       TRPFLAA       966 bp ds-DNA             BCT       28-JUL-1990
DEFINITION  T.pallidum endoflagellar sheath protein (flaA) gene, 3' end.
ACCESSION   M26525
KEYWORDS    endoflagellar sheath protein.
SOURCE      T.pallidum (strain Nichols) DNA.
  ORGANISM  Treponema pallidum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; 
            Spirochaetales; Spirochaetaceae.
REFERENCE   1  (bases 1 to 966)
  AUTHORS   Isaacs,R.D., Hanke,J.H., Guzman-Verduzco,L.-M., Newport,G.,
            Agabian,N., Norgard,M.V., Lukehart,S.A. and Radolf,J.D.
  TITLE     Molecular cloning and DNA sequence analysis of the 37-kilodalton
            endoflagellar sheath protein of Treponema pallidum
  JOURNAL   Infect. Immun. 57, 3403-3411 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.D.Radolf, 26-JUL-1989.
FEATURES       from  to/span     description
    pept    <     1      966     endoflagellar sheath protein (AA at 1)
BASE COUNT      199 a    196 c    325 g    246 t
ORIGIN      Unreported
        1 aagctgaacg ctgatatcat ggcggataag agtggaggta tgacgcataa tcggcgtacc
       61 gttctggact atgcttctct ggcggatacc tcgtacactg acgagcagaa ggcattgatg
      121 agatcttctc ttgcggttgc acagtgggag gttgtgctga attcttccgc gcgtaatcct
      181 gtcgcccatg ctgcctctcg cgttattgag gctccggtaa gtgagggagc gaagagtttt
      241 gctggtgagc gtgtccttgg tgtgcgcgtg ttgttcccca cgtgggacag taacgcaaac
      301 gcaatgataa agccggcgtt cgtaattcct gcgtacgagg tgatggctca ggtggacgat
      361 cagggtaatg tacaggcccc cacagaggag gagaaggctt ctggaaaggg gcgttttgaa
      421 gatgggtacg gagtggtaaa gaatgtgggt gttcttaagt ccatcgcggt gaacacttac
      481 gggatgaatt atcctcatgg tttgtacgtg atgatgcggg atcaggatgg tgaggtgcat
      541 cgctacttca tggggtatct cctgttcgac tcctggaagg agttggtgtg gaacaatcct
      601 tcgtatatct ctgatgttcg gtcgcgggag gtgcgcttgt atcccgtgta tcccgcgtcg
      661 acgccccacg tcgtgtttga aggctttatg gttactaggg acgcggctca tgccggaggg
      721 gactatgttg gttatttcaa ggacgtcaag attatctatg ataaggcggt gctgagtacg
      781 gtgcgcgatt ttgcggacga ggacctgtgg ggtatccagg cgcggcgtga ggctgagcgt
      841 aagagagttg aggttgcgcg tttcgggcag cagcaggtgc tgcgttatat agagcaagag
      901 aagcttgcta cagaggttgg ttttacaccc tctgggggtg ctcagcggca ggaagagcag
      961 cagtag
//
LOCUS       DROMPP1      3376 bp ds-DNA             INV       28-JUL-1990
DEFINITION  D.melanogaster membrane protein (patched) gene complete cds.
ACCESSION   M28418
KEYWORDS    transmembrane protein.
SEGMENT     1 of 2
SOURCE      D.melanogaster (embryo), DNA and cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 3376)
  AUTHORS   Hooper,J.E. and Scott,M.P.
  TITLE     The Drosophila patched gene encodes a putative membrane protein
            required for segmental patterning
  JOURNAL   Cell 59, 751-765 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by J.E.Hooper, 27-SEP-1989.
            
FEATURES       from  to/span     description
    pept       3161  +  3289     membrane protein exon 1
    pre-msg    2389  >  3376     MPR mRNA and intron
    IVS        3290  >  3376     MPR intron A
BASE COUNT     1059 a    731 c    764 g    822 t
ORIGIN      Chromosome 2, map position 44D-E.
        1 gatcctgaat tgagaaatat agattgaaac agaattcatt accatttaag caatcattat
       61 ttatgggggc gtaatgcgcc tccgagtagg caatgctttt cttgacattg ttactaagaa
      121 ttgtgaatga tatttgggcg tggatcaacg ccgattaaaa gctgcttttg cttccaggcg
      181 gccagagaag agatccaaac ttcaactcca gccataaaag caacaacatt tccgtctccc
      241 ccttgtagct ccccttcctc cggctcttcc actctccacg aaacggcaaa tgaagctctc
      301 aaagcgaact gtgcttcgct ggtggtccat tggcagctgc cgccacacag gcgctgcttt
      361 tgtgtgtgtg tgtaatatca atcttgctct ccctctcttt ttatctctct tgggaattgg
      421 agctgcatgc gaattgagcg acagcaaaac gaactgcaag tcattgagag gagagcaaaa
      481 actcgagcgc aagccaaaga tagcgcaatc tggggagagc gaaataaagc taaaatatgc
      541 atgttggaga aaaaatgccg cccatgtcgc caaaatgcgc cacacgcaga gtgagcgggc
      601 ggaggtggga gtaatggaaa gggcgatgag ggaacgatta gcttgaagag agagaacaac
      661 aaatgaatgt gctgcaacgt tagttcaggt gagcgcgtta gagagagagt tgttgttttt
      721 tgattgtaat agctcgcttg gtggtgggtc cacattcaca tctccctctc ccactctttc
      781 tccccgaaag agagagcggg agcgaagggg cacgagggga gcacgatgac tatgcagttg
      841 cattcaattt gaatttccat ggtgctgatg attcgagcgc caattttttc gaagagttct
      901 tatttgttta cttcgttgtt gttgcctcaa ttggaaaggg aaaatgtgga atgcggagaa
      961 acaccagaag caaatgcatt tccattcata aatccaaaga agttttaaag ataacatgtc
     1021 atttggctta agttcgtggt gcacaaaaaa gatcggtttg cggttgtcgc atgaaatgag
     1081 tttattccat tggtatatta ttattcagaa attaaaaaaa aacttgttta gtctattttt
     1141 tttttttaaa taaaaaaaaa aaattctttt ataagtcgat tttagagtaa atatttaaag
     1201 actacgtcta ataaacatat aatttgttct gtgttttaat ttgccggcaa aaacaaacct
     1261 acttgtgtgg tcctcgcaca ctcataaccc ctcgcatatt tgagattcat ggggcaagag
     1321 gctgcaaaaa caatggaaag ggaaaagcag aaacatcctg ccgctcataa tttagcatcg
     1381 gaacatgcaa aaacagacat catcgcatgg ggcagcagca acagccataa aaccaacacg
     1441 agcaatgtaa agctaacaaa tttgccaaca gttcgcggca cggctacaca cacacacatg
     1501 catgcgcagc ctgccacgca cgcgcttccc ccaaacaaat acacacacac acactgagac
     1561 gaaagctcca ttgggcagcg ctgccgacgc tgaaggccga catcggcaga gctgaacgtt
     1621 tgggtagggg accacccaca tcgcttggcg gtttcagttt aatgaaggca gaaacaaatt
     1681 tatttttggg tggtccacac tgcagcgaaa ataaactaca gtggcaacaa caaaccagca
     1741 gccaaggcac tttgggtggt ccatgcaaaa aaaaaacaaa ttacggcatg cgaataacaa
     1801 tagaaattag cgctctcgtg gcggagctat ttgggtatat tagagctaca tattttattt
     1861 gtttataaaa agtataaatg taaacaatga gttccaagca ttaagtccgt atgctcaaca
     1921 attacattat cattattatt atcacttaaa tatttacaaa ggatatttaa acagtaatag
     1981 atatatattt tatttcttaa tttctgttaa catatgtatt tacattggta gttattcttt
     2041 attttgcaac aagcattcat aaattttata taacaaactt ggtattttct cggaaaaact
     2101 cctgaatcac ccctcggtat tttgtgcgtt gagctatcgt taaagcagcc ctcgcagaga
     2161 gcgttctcaa accaaaatgg ccgcacacga aacaagagag cgagtgagag tagggagagc
     2221 gtctgtgttg tgtgttgagt gtcgcccacg cacacaggcg caaaacagtg cacacagacg
     2281 cccgctgggc aagagagagt gagagagaga aacagcggcg cgcgctcgcc taatgaagtt
     2341 gttggcctgg ctggcgtgcc gcatccacga gatacagata catctctcag actgcgtgcg
     2401 atcctcgaac gaaacggttg taagtgcgga gcgcgacgac ttgttattcg tatttccgac
     2461 tactggcact ctctgtgtgt ggtatactaa caagatagat atcacagaac tcgtggaaaa
     2521 gctaagatat tgtacctcac ggatgcgagg cgaagttcat ggattaaatg ccaggcaaca
     2581 acaaaagcca gccaaccagc cagtgtttgt gtgtgtgcgt cgccaagtgc aaagtaaagt
     2641 aaaggtaaaa gagcgaaagg cgagagagaa aaccgaatac gtgagtcgtc cgactgccgc
     2701 ttttccatgt gtaaaagatc tgtgaaaatt ctgtcaaatt cccctgagaa attgtgccca
     2761 agataaaacc cgaaaaccgc gttttaatcg tcgaaaaaac ccagcaaaag cgaagccagc
     2821 aatcacaaca aaacaacata acgagagctc agatacacag cgtgctcagt gagtgagcga
     2881 gagagcgcgg gagagagcgt ctcttgattt aaaatacaaa ataattaaaa ataaaaatgc
     2941 ggaatgcagt gcaaaatgca gccaaacaaa atacgagatt ccaataacaa ttaatcgaac
     3001 cgaaagtcca cgaacaatcc gcacactgtc tcccaagtct cagttctcag gacgcagacg
     3061 aacggcaggc actgtagaaa gaccgattcc gcagcacact cccatctgca catctccgcc
     3121 acgcgattcc gtccggaatc tggctataaa cataaccata atggaccgcg acagcctccc
     3181 acgcgttccg gacacacacg gcgatgtggt cgatgagaaa ttattctcgg atctttacat
     3241 acgcaccagc tgggtggacg cccaagtggc gctcgatcag atagataagg tgagtgccca
     3301 actacagtga actttcactg tgaaggatag ccatgtgttg aattcaataa tattcttgat
     3361 cgtattcgga ggatcc
//
LOCUS       DROMPP2      5665 bp ds-DNA             INV       28-JUL-1990
DEFINITION  D.melanogaster membrane protein (patched) gene, complete cds.
ACCESSION   M28999
KEYWORDS    transmembrane protein.
SEGMENT     2 of 2
SOURCE      D.melanogaster (embryo), DNA and cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 5665)
  AUTHORS   Hooper,J.E. and Scott,M.P.
  TITLE     The Drosophila patched gene encodes a putative membrane protein
            required for segmental patterning
  JOURNAL   Cell 59, 751-765 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by J.E.Hooper, 27-SEP-1989.
            
            Mak called J.E.Hooper and requested copy of cds be sent showing
            introns and exons, 9-OCT-1989.  Copy was received and corrections
            made, 18-OCT-1989.
FEATURES       from  to/span     description
    pept    +    95      648     membrane protein exon 2
                986     2912     membrane protein exon 3
               3051     3258     membrane protein exon 4
               3322     3593     membrane protein exon 5
               3789     4559     membrane protein exon 6
    pre-msg <     1  >  5448     MPR mRNA and introns
    IVS     <     1       94     MPR intron A
    IVS         649      985     MPR intron B
    IVS        2913     3050     MPR intron C
    IVS        3259     3321     MPR intron D
    IVS        3594     3788     MPR intron E
BASE COUNT     1397 a   1537 c   1350 g   1381 t
ORIGIN      About 9.2kb after segment 1, Chromosome 2, map position 44D-E.
        1 aattttaatg cgtattttat ggcagtggag caaggcgggg gaatctaaaa aaaaaactaa
       61 acgctaaatt ccgtattttt gttgcatttt tcagggcaaa gcgcgtggca gccgcacggc
      121 gatctatctg cgatcagtat tccagtccca cctcgaaacc ctcggcagct ccgtgcaaaa
      181 gcacgcgggc aaggtgctat tcgtggctat cctggtgctg agcaccttct gcgtcggcct
      241 gaagagcgcc cagatccact ccaaggtgca ccagctgtgg atccaggagg gcggccggct
      301 ggaggcggaa ctggcctaca cacagaagac gatcggcgag gacgagtcgg ccacgcatca
      361 gctgctcatt cagacgaccc acgacccgaa cgcctccgtc ctgcatccgc aggcgctgct
      421 tgcccacctg gaggtcctgg tcaaggccac cgccgtcaag gtgcacctct acgacaccga
      481 atgggggctg cgcgacatgt gcaacatgcc gagcacgccc tccttcgagg gcatctacta
      541 catcgagcag atcctgcgcc acctcattcc gtgctcgatc atcacgccgc tggactgttt
      601 ctgggaggga agccagctgt tgggtccgga atcagcggtc gttataccgt aagtagttaa
      661 tatgtagtta atagccacat cttatagatt ctaaagtgaa cgtatccctt atgaccatat
      721 ccttttgcat gatctacttt aacccacagt acttctctat tcatattaag gaattaataa
      781 agtacttact ttgcgcttac ctttattaaa tacgatagct tatctttata aacttgctat
      841 caagtcgaaa gataaacgtg acaagagtat ctttgtactt atcccagttg cttaccatcg
      901 taaataatct tcttattaat aaatattcgt aaataaatat tcttaactca acaaatccat
      961 ctttattatt gttactcctc tacagaggcc tcaaccaacg actcctgtgg accaccctga
     1021 atcccgcctc tgtgatgcag tatatgaaac aaaagatgtc cgaggaaaag atcagcttcg
     1081 acttcgagac cgtggagcag tacatgaagc gtgcggccat tggcagtggc tacatggaga
     1141 agccctgcct gaacccactg aatcccaatt gcccggacac ggcaccgaac aagaacagca
     1201 cccagccgcc ggatgtggga gccatcctgt ccggaggctg ctacggttat gccgcgaagc
     1261 acatgcactg gccggaggag ctgattgtgg gcggacggaa gaggaaccgc agcggacact
     1321 tgaggaaggc ccaggccctg cagtcggtgg tgcagctgat gaccgagaag gaaatgtacg
     1381 accagtggca ggacaactac aaggtgcacc atcttggatg gacgcaggag aaggcagcgg
     1441 aggttttgaa cgcctggcag cgcaactttt cgcgggaggt ggaacagctg ctacgtaaac
     1501 agtcgagaat tgccaccaac tacgatatct acgtgttcag ctcggctgca ctggatgaca
     1561 tcctggccaa gttctcccat cccagcgcct tgtccattgt catcggcgtg gccgtcaccg
     1621 ttttgtatgc cttttgcacg ctcctccgct ggagggaccc cgtccgtggc cagagcagtg
     1681 tgggcgtggc cggagttctg ctcatgtgct tcagtaccgc cgccggattg ggattgtcag
     1741 ccctgctcgg tatcgttttc aatgcgctga ccgctgccta tgcggagagc aatcggcggg
     1801 agcagaccaa gctgattctc aagaacgcca gcacccaggt ggttccgttt ttggcccttg
     1861 gtctgggcgt cgatcacatc ttcatagtgg gaccgagcat cctgttcagt gcctgcagca
     1921 ccgcaggatc cttctttgcg gccgccttta ttccggtgcc ggctttgaag gtattctgtc
     1981 tgcaggctgc catcgtaatg tgctccaatt tggcagcggc tctattggtt tttccggcca
     2041 tgatttcgtt ggatctacgg agacgtaccg ccggcagggc ggacatcttc tgctgctgtt
     2101 ttccggtgtg gaaggaacag ccgaaggtgg cacctccggt gctgccgctg aacaacaaca
     2161 acgggcgcgg ggcccggcat ccgaagagct gcaacaacaa cagggtgccg ctgcccgccc
     2221 agaatcctct gctggaacag agggcagaca tccctgggag cagtcactca ctggcgtcct
     2281 tctccctggc aaccttcgcc tttcagcact acactccctt cctcatgcgc agctgggtga
     2341 agttcctgac cgttatgggt ttcctggcgg ccctcatatc cagcttgtat gcctccacgc
     2401 gccttcagga tggcctggac attattgatc tggtgcccaa ggacagcaac gagcacaagt
     2461 tcctggatgc tcaaactcgg ctctttggct tctacagcat gtatgcggtt acccagggca
     2521 actttgaata tcccacccag cagcagttgc tcagggacta ccatgattcc tttgtgcggg
     2581 tgccacatgt gatcaagaat gataacggtg gactgccgga cttctggctg ctgctcttca
     2641 gcgagtggct gggtaatctg caaaagatat tcgacgagga ataccgcgac ggacggctga
     2701 ccaaggagtg ctggttccca aacgccagca gcgatgccat cctggcctac aagctaatcg
     2761 tgcaaaccgg ccatgtggac aaccccgtgg acaaggaact ggtgctcacc aatcgcctgg
     2821 tcaacagcga tggcatcatc aaccaacgcg ccttctacaa ctatctgtcg gcatgggcca
     2881 ccaacgacgt cttcgcctac ggagcttctc aggtgggtct tcttattaaa ttaaattaaa
     2941 ttaaattaaa ttagatcgcc ttagttctcc tcatatgtac atacatatta taacttatcg
     3001 cactccaaag ttaaagatta ctaaatgtgt gtgtatcttt attcttacag ggcaaattgt
     3061 atccggaacc gcgccagtat tttcaccaac ccaacgagta cgatcttaag atacccaaga
     3121 gtctgccatt ggtctacgct cagatgccct tttacctcca cggactaaca gatacctcgc
     3181 agatcaagac cctgataggt catattcgcg acctgagcgt caagtacgag ggcttcggcc
     3241 tgcccaacta tccatcgggt gagtcggaaa tgagtacttc atacatgggg cccaactaac
     3301 agtcgattta tttatcgcca ggcattccct tcatcttctg ggagcagtac atgaccctgc
     3361 gctcctcact ggccatgatc ctggcctgcg tgctactcgc cgccctggtg ctggtctccc
     3421 tgctcctgct ctccgtttgg gccgccgttc tcgtgatcct cagcgttctg gcctcgctgg
     3481 cccagatctt tggggccatg actctgctgg gcatcaaact ctcggccatt ccggcagtca
     3541 tactcatcct cagcgtgggc atgatgctgt gcttcaatgt gctgatatca ctggtgagtc
     3601 ttcatttctg gctggaccat taagagcttc ggagtgagtc ttcatttctg gctggaccat
     3661 taagagcttc ggagtgagtc ttcatttctg gctggaccat taagagcttc ggattttcca
     3721 gagatatccc aagacttttc attggatcct cttcagcaca cattaattgc ttatctttcc
     3781 gattctaggg cttcatgaca tccgttggca accgacagcg ccgcgtccag ctgagcatgc
     3841 agatgtccct gggaccactt gtccacggca tgctgacctc cggagtggcc gtgttcatgc
     3901 tctccacgtc gccctttgag tttgtgatcc ggcacttctg ctggcttctg ctggtggtct
     3961 tatgcgttgg cgcctgcaac agccttttgg tgttccccat cctactgagc atggtgggac
     4021 cggaggcgga gctggtgccg ctggagcatc cagaccgcat atccacgccc tctccgctgc
     4081 ccgtgcgcag cagcaagaga tcgggcaaat cctatgtggt gcagggatcg cgatcctcgc
     4141 gaggcagctg ccagaagtcg catcaccacc accacaaaga ccttaatgat ccatcgctga
     4201 cgacgatcac cgaggagccg cagtcgtgga agtccagcaa ctcgtccatc cagatgccca
     4261 atgattggac ctaccagccg cgggaacagc gacccgcctc ctacgcggcc ccgccccccg
     4321 cctatcacaa ggccgccgcc cagcagcacc accagcatca gggcccgccc acaacgcccc
     4381 cgcctccctt cccgacggcc tatccgccgg agctgcagag catcgtggtg cagccggagg
     4441 tgacggtgga gacgacgcac tcggacagca acaccaccaa ggtgacggcc acggccaaca
     4501 tcaaggtgga gctggccatg cccggcaggg cggtgcgcag ctataacttt acgagttagc
     4561 actagcacta gttcctgtag ctattaggac gtatctttag actctagcct aagccgtaac
     4621 cctatttgta tctgtaaaat cgatttgtcc agcgggtctg ctgaggattt cgttctcatg
     4681 gattctcatg gattctcatg gatgcttaaa tggcatggta attggcaaaa tatcaatttt
     4741 tgtgtctcaa aaagatgcat tagcttatgg tttcaagata catttttaaa gagtccgcca
     4801 gatatttata taaaaaaaat ccaaaatcga cgtatccatg aaaattgaaa agctaagcag
     4861 acccgtatgt atgtatatgt gtatgcatgt tagttaattt cccgaagtcc ggtatttata
     4921 gcagctgcct tccgcgcccc ccttcccttg aaatgaacac ccttccagcc acgccccacc
     4981 gcccctctgc gtagcagctt tgtatgtatg tagtatgcta gcacctaagg aatacttaaa
     5041 cttagagata tttattgtaa cacacgcaaa acacacacaa tgtacttaca tataattcaa
     5101 tgcgagattc acccacacaa aaaggaaaca caacaaacta gtaattgtag ctcgtaattt
     5161 agtttaaata tgttacataa aacacaagga cttgaaccaa aatagtatcg cttaaacgga
     5221 aacgagagaa acgagaaaaa ataactatta cttaatcaac tacaagagag atatccctcc
     5281 tcccctaacc gtacttacaa ccaaaataaa acaagagtat aagcataaaa atggaaaacg
     5341 aagcgaggaa cgattgtaaa cgcggtcatt tatcctgtac atttgttgcc cgaagactga
     5401 ctgtcttttt tttaataaaa atatatatta tacagttttt taaaagcgaa attcatgact
     5461 tttttttaac agtgagcaga gaacaaaaga aacggaagtt ttcgctgtat caataaaaag
     5521 attccatttt tttaataaat tgtaaaaatc ctaaaaaaaa gaagactaca aaagtttaaa
     5581 tttttatacg ttattgataa acttttatac acgaaaatac ttgtacttag ctatgatcaa
     5641 ctccttggct taagtctcgg gtaag
//
LOCUS       BLYGEH       1250 bp ss-mRNA            PLN       28-JUL-1990
DEFINITION  Barley (1->3)-beta-glucan endohydrolase mRNA, complete cds.
ACCESSION   M23548 X15205
KEYWORDS    glucan endohydrolase.
SOURCE      Barley (2 days into germination) scutellum, cDNA to mRNA, clone
            lambda-3.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1250)
  AUTHORS   Hoej,P.B., Hartman,D.J., Morrice,N.A., Doan,D.N.P. and Fincher,G.B.
  TITLE     Purification of (1->3)-beta-glucan endohydrolase isoenzyme II from
            germinated barley and determination of its primary structure from a
            cDNA clone
  JOURNAL   Plant Mol. Biol. 13, 31-42 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by G.W.Fincher, 02-MAY-1989.
FEATURES       from  to/span     description
    pept         48     1052     glucan endohydrolase precursor
    sigp         48      131     glucan endohydrolase signal peptide
    matp        132     1049     glucan endohydrolase
    mRNA    <     1     1250     GEH mRNA
BASE COUNT      262 a    404 c    362 g    222 t
ORIGIN      
        1 ccagcattgc atagcatttg agcaccagat actccatgtg tgcagcaatg gctagaaaag
       61 atgttgcctc catgtttgca gctgctctct tcattggagc gttcgctgct gttcctacga
      121 gtgtgcagtc catcggcgtg tgctacggcg tgatcggcaa caacctcccc tcccggagcg
      181 acgtggtgca gctctacagg tccaagggca tcaacggcat gcgcatctac ttcgccgacg
      241 ggcaggccct ctcggcgctc cgcaactccg gcatcggcct catcctcgac atcggcaacg
      301 accagctcgc caacatcgcc gccagcacct ccaacgcggc gtcctgggtc cagaacaacg
      361 tgcggcccta ctaccctgcc gtgaacatca agtacatcgc cgccggcaac gaggtgcagg
      421 gcggcgccac gcagagcatc ctgccggcca tgcgcaacct caacgcggcc ctctccgcgg
      481 cggggctcgg cgccatcaag gtgtccacct ccatccggtt cgacgaggtg gccaactcct
      541 tcccgccctc cgccggcgtg ttcaagaacg cctacatgac ggacgtggcc cggctcctcg
      601 cgagcaccgg cgcgccgctg ctcgccaacg tctaccccta cttcgcgtac cgtgacaacc
      661 ccgggagcat cagcctgaac tacgcgacgt tccagccggg caccaccgtg cgtgaccaga
      721 acaacgggct gacctacacg tccctgttcg acgcgatggt ggacgccgtg tacgcggcgc
      781 tggagaaggc cggcgcgccg gcggtgaagg tggtggtgtc ggagagcggg tggccgtcgg
      841 cgggcgggtt tgcggcgtcg gccggcaatg cgcggacgta caaccagggg ctgatcaacc
      901 acgtcggcgg gggcacgccc aagaagcggg aggcgctgga gacgtacatc ttcgccatgt
      961 tcaacgagaa ccagaagacc ggggacgcca cggagaggag cttcgggctc ttcaacccgg
     1021 acaagtcgcc ggcatacaac atccagttct agtgtagcta cctagctcac atacctacat
     1081 ccccagccta aataaataag ctgctcgtac gtacgtaatg cggcatccaa gtgtaacgta
     1141 gacacgtaca ttcatccatg gaagagtgca accaagcatg cgttaacttc ctggtgatga
     1201 tacatcatca tggtatgaat aaaagatatg gaagatgtta tgaatttgtg
//
LOCUS       ECOPOLBDA    4666 bp ds-DNA             BCT       28-JUL-1990
DEFINITION  E.coli DNA polymerase (polB) gene, 5' flank.
ACCESSION   M35371
KEYWORDS    DNA polymerase; polB gene.
SOURCE      E.coli (strain W3110) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 4666)
  AUTHORS   Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R.
  TITLE     Nucleotide sequence and deletion analysis of the polB gene of
            E.coli
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Chen, 20-JUN-1990.
            
              Author address: H.Chen
              Baylor College of Medicine
              Cell Biology and IMG
              One Baylor Plaza
              Houston, TX 77030
              email: hchen@vulture.bcm.tmc.edu
            
FEATURES       from  to/span     description
    signal     1209     1214     -35 region
    signal     1229     1234     -10 region
    binding    1250     1255     ribosome binding site
    site       1193     1208     lexA box
BASE COUNT     1100 a   1278 c   1257 g   1031 t
ORIGIN      
        1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg
       61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg
      121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga
      181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc
      241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc
      301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg
      361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg
      421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca
      481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct
      541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg
      601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat
      661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga
      721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac
      781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac
      841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga
      901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca
      961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg
     1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat
     1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga
     1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa
     1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc
     1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca
     1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa
     1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt
     1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat
     1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt
     1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt
     1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt
     1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc
     1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat
     1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc
     1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg
     1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt
     1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc
     2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag
     2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag
     2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc
     2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg
     2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac
     2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg
     2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc
     2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac
     2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg
     2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc
     2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa
     2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt
     2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc
     2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac
     2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa
     2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa
     3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc
     3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga
     3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc
     3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac
     3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat
     3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg
     3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc
     3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg
     3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc
     3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca
     3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa
     3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac
     3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat
     3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga
     3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac
     3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc
     3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg
     4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt
     4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga
     4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt
     4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa
     4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg
     4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc
     4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct
     4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca
     4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca
     4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc
     4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc
//
LOCUS       HUMBIGFII    1387 bp ss-mRNA            PRI       28-JUL-1990
DEFINITION  Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA,
            complete cds.
ACCESSION   M35410
KEYWORDS    insulin-like growth factor binding protein 2.
SOURCE      Human 67-year old retina, cDNA to mRNA, clone AS200.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1387)
  AUTHORS   Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B.,
            Francke,U. and Swaroop,A.
  TITLE     Sequence analysis, expression and chromosomal localization of a
            gene, isolated from a subtracted human retina cDNA library, that
            encodes an insulin-like growth factor binding protein (IGFBP2)
  JOURNAL   Exp. Eye Res. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Swaroop, 20-JUN-1990.
FEATURES       from  to/span     description
    pept         64     1050     insulin-like growth factor binding protein 2
                                 (IGFBP2) precursor /hgml_locus_uid="LN0220S"
                                 /nomgen="IBP1" /map="7p13-p12"
    sigp         64      180     insulin-like growth factor binding protein 2
                                 signal peptide
    matp        181     1047     insulin-like growth factor binding protein 2
    mRNA    <     1     1387     IGFBP2 mRNA
    signal     1362     1367     poly-A signal
    binding     175      197     ATP binding site
BASE COUNT      232 a    455 c    477 g    223 t
ORIGIN      
        1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc
       61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg
      121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg
      181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg
      241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg
      301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag
      361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc
      421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac
      481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga
      541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct
      601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact
      661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag
      721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag
      781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg
      841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg
      901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga
      961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct
     1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg
     1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg
     1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg
     1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc
     1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga
     1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta
     1381 aagtgtg
//
LOCUS       HUMLBPA      1431 bp ss-mRNA            PRI       28-JUL-1990
DEFINITION  Human lipopolysaccharide binding protein (LBP) mRNA,
            complete cds.
ACCESSION   M35533
KEYWORDS    lipopolysaccharide binding protein.
SOURCE      Human liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  TITLE     Structure and function of lipopolysaccharide binding protein
  JOURNAL   Science (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1431)
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Tobias, 21-JUN-1990.
            
               Author address: P.S.Tobias
               Department of Immunology, IMM-12
               10466 N. Torrey Pines Rd
               La Jolla, CA 92037
FEATURES       from  to/span     description
    pept          1  >  1431     lipopolysaccharide binding protein (LBP)
                                 precursor
    sigp          1       75     lipopolysaccharide binding protein signal
                                 peptide
    matp         76     1431     lipopolysaccharide binding protein
BASE COUNT      319 a    417 c    359 g    336 t
ORIGIN      
        1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc
       61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag
      121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct
      181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc
      241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc
      301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag
      361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc
      421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc
      481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc
      541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg
      601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca
      661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc
      721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt
      781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat
      841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca
      901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc
      961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc
     1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata
     1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc
     1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca
     1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg
     1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg
     1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag
     1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t
//
LOCUS       HUMPEC12L    2344 bp ds-DNA             PRI       28-JUL-1990
DEFINITION  Human cell 12-lipoxygenase gene, complete cds.
ACCESSION   M35418
KEYWORDS    lipoxygenase.
SOURCE      Human platelet/erythroleukemia cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2344)
  AUTHORS   Funk,C.D., Furci,L. and FitzGerald,G.A.
  TITLE     Molecular cloning, primary structure and expression of the human
            platelet/erythroleukemia cell 12-lipoxygenase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.D.Funk, 20-JUN-1990.
FEATURES       from  to/span     description
    pept         42     2033     12-lipoxygenase
    signal     2324     2329     poly-A signal
    mRNA    <     1     2344     12-lipoxygenase mRNA
BASE COUNT      514 a    692 c    642 g    496 t
ORIGIN      
        1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc
       61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg
      121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg
      181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca
      241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg
      301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga
      361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc
      421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag
      481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc
      541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc
      601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct
      661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt
      721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc
      781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag
      841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca
      901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc
      961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa
     1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg
     1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg
     1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca
     1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac
     1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac
     1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc
     1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc
     1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg
     1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc
     1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc
     1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc
     1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca
     1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg
     1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc
     1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat
     1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact
     1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag
     2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt
     2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac
     2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt
     2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg
     2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac
     2341 actg
//
LOCUS       HUMRALBA     1327 bp ss-mRNA            PRI       28-JUL-1990
DEFINITION  Human GTP-binding protein (RALB) mRNA, complete cds.
ACCESSION   M35416
KEYWORDS    GTP-binding protein.
SOURCE      Human retina, cDNA to mRNA, clone AS181.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1327)
  AUTHORS   Hsieh,C.-L., Swaroop,A. and Francke,U.
  TITLE     Chromosomal localization and cDNA sequence of human RALB, a GTP
            binding protein
  JOURNAL   Somat. Cell Mol. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Swaroop, 20-JUN-1990.
FEATURES       from  to/span     description
    pept        171      791     GTP-binding protein (RALB)
    mRNA    <     1     1327     RALB mRNA
    signal     1303     1308     poly-A signal
BASE COUNT      368 a    273 c    373 g    313 t
ORIGIN      Chromosome cen-q13.
        1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc
       61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc
      121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca
      181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag
      241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg
      301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga
      361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc
      421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa
      481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc
      541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca
      601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca
      661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa
      721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt
      781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg
      841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc
      901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa
      961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc
     1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt
     1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta
     1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag
     1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag
     1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga
     1321 aacagct
//
LOCUS       LACLACR      1332 bp ds-DNA             BCT       28-JUL-1990
DEFINITION  L.lactis lactose phosphotransferase system repressor (lacR) gene,
            complete cds.
ACCESSION   M35375
KEYWORDS    lactose phosphotransferase system repressor; lactose repressor.
SOURCE      L.lactis (strain MG1820) DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 1332)
  AUTHORS   Van Rooijen,R.J. and Devos,W.M.
  TITLE     Molecular cloning, transcriptional analysis, and nucleotide
            sequence of LACR, a gene encoding the repressor of the lactose
            phosphotransferase system of Lactococcus lactis
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.J.Van Rooijen, 20-JUN-1990.
            
              Author address: R.J.Van Rooijen
              Netherlands Inst for Dairy Research
              Kernhemseweg 2
              P.O. Box 20
              6710 BA EDE
              THE NETHERLANDS
            
FEATURES       from  to/span     description
    pept        370     1155     lactose repressor (lacR; alt.)
    pept        388     1155     lactose repressor (lacR; alt.)
    mRNA         79     1245     lactose repressor mRNA
    signal     1215  >  1155     transcription termination signal
    binding     353      357     ribosomal binding site
    signal       45       51     -35 region
    signal       68       74     -10 region
BASE COUNT      469 a    207 c    198 g    458 t
ORIGIN      
        1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt
       61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca
      121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac
      181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata
      241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg
      301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa
      361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta
      421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt
      481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt
      541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag
      601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc
      661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa
      721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg
      781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata
      841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa
      901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa
      961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt
     1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc
     1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt
     1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac
     1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt
     1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga
     1321 taattggata tc
//
LOCUS       MARCMYCA     1391 bp ds-DNA             ROD       28-JUL-1990
DEFINITION  Woodchuck c-myc protein gene, exon 1.
ACCESSION   M35498
KEYWORDS    c-myc protein.
SOURCE      Woodchuck (Marmota monax) DNA.
  ORGANISM  Marmota monax
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; 
            Marmotini.
REFERENCE   1  (bases 1 to 1391)
  AUTHORS   Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J.
  TITLE     Evolutionary conservation of target sequences for cis-acting
            regulation in c-myc exon 1 and its upstream sequences
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Buendia, 21-JUN-1990.
FEATURES       from  to/span     description
    pre-msg     818  >  1391     c-myc mRNA and introns
    IVS        1378  >  1391     c-myc intron 1
    site        159      220     P0 promoter
    site        790      794     TATA box
    site        964      968     TATA box
BASE COUNT      317 a    385 c    383 g    306 t
ORIGIN      
        1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc
       61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat
      121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac
      181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg
      241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg
      301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc
      361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc
      421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt
      481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag
      541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa
      601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg
      661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag
      721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca
      781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg
      841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg
      901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag
      961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac
     1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt
     1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc
     1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg
     1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg
     1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg
     1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta
     1381 agcaccagat c
//
LOCUS       ONGOSTLE      214 bp ds-DNA             INV       28-JUL-1990
DEFINITION  O.volvulus recombinant antigen gene, 3' end.
ACCESSION   M35370
KEYWORDS    T-cell epitope; recombinant antigen.
SOURCE      O.volvulus DNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 214)
  AUTHORS   Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B.
  TITLE     The identification of an Onchocerca-specific recombinant antigen
            containing a T cell epitope
  JOURNAL   Unpublished (19900
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.B.Nutman, 19-JUN-1990.
            
              Author address: T.B.Nutman
              Inst. Lab. of Parasitic Diseases
              NIAID
              National Institutes of Health
              Bldg 4, Rm 126
              9000 Rockville Pike
              Bethesda, MD 20892
              email: tbn@helix.nih.gov
            
FEATURES       from  to/span     description
    pept    <     1       93     recombinant antigen (AA at 1)
    site         91       93     nematode splice junction
BASE COUNT       81 a     37 c     31 g     65 t
ORIGIN      
        1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt
       61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa
      121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga
      181 aaactttcac ttatgtaaaa ttgcgcacaa gacc
//
LOCUS       PSEIAAL      2766 bp ds-DNA             BCT       28-JUL-1990
DEFINITION  P.syringae IAA-lysine synthetase (iaaL) gene, complete cds.
ACCESSION   M35373
KEYWORDS    IAA-lysine synthetase.
SOURCE      P.syringae savastanoi DNA.
  ORGANISM  Pseudomonas syringae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 2766)
  AUTHORS   Roberto,F., Klee,H., White,F., Nordeen,R. and Kosuge,T.
  TITLE     Expression and fine structure of the gene encoding IAA-lysine
            synthetase from Pseudomonas savastanoi
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.J.Klee, 20-JUN-1990.
FEATURES       from  to/span     description
    pept       1100     2287     IAA-lysine synthetase (iaaL)
    pept         95     1003     ORF1
BASE COUNT      637 a    760 c    754 g    615 t
ORIGIN      
        1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc
       61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag
      121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag
      181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat
      241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac
      301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt
      361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat
      421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt
      481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct
      541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat
      601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt
      661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc
      721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac
      781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg
      841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac
      901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca
      961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt
     1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa
     1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc
     1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt
     1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct
     1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa
     1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt
     1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct
     1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt
     1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa
     1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca
     1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt
     1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg
     1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt
     1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac
     1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat
     1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat
     1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc
     2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt
     2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga
     2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa
     2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg
     2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc
     2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga
     2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc
     2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg
     2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac
     2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg
     2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct
     2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag
     2761 ccgcct
//
LOCUS       RABLPBA      1446 bp ss-mRNA            MAM       28-JUL-1990
DEFINITION  Rabbit lipopolysaccharide binding protein (LBP) protein mRNA,
            complete cds.
ACCESSION   M35534
KEYWORDS    lipopolysaccharide binding protein.
SOURCE      Rabbit liver, cDNA to mRNA.
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (ases 1 to 1446ites; for [2]
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  TITLE     Structure and function of lipopolysaccharide binding protein
  JOURNAL   Science (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1446)
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Tobias, 21-JUN-1990.
            
               Author address: P.S.Tobias
               Department of Immunology, IMM-12
               10466 N. Torrey Pines Rd
               La Jolla, CA 92037
FEATURES       from  to/span     description
    pept          1  >  1446     lipopolysaccharide binding protein (LBP)
                                 precursor
    sigp          1       78     lipopolysaccharide binding protein signal
                                 peptide
    matp         79     1446     lipopolysaccharide binding protein
BASE COUNT      316 a    454 c    391 g    285 t
ORIGIN      
        1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct
       61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg
      121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg
      181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac
      241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag
      301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc
      361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc
      421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc
      481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac
      541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg
      601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca
      661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca
      721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca
      781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt
      841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg
      901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag
      961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc
     1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa
     1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc
     1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact
     1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc
     1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag
     1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac
     1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg
     1441 agagtt
//
LOCUS       RATUKATPA    1484 bp ss-mRNA            ROD       28-JUL-1990
DEFINITION  R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete
            cds.
ACCESSION   M35535
KEYWORDS    (H+,K+)-ATPase beta-subunit.
SOURCE      R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1484)
  AUTHORS   Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P.,
            Forte,J.G. and Levenson,R.
  TITLE     Cloning of the H,K-ATPase beta subunit: Tissue-specific expression,
            chromosomal assignment, and relationship to Na,K-ATPase beta
            subunits
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.A.Canfield, 21-JUN-1990.
            
              Author address: V.A.Canfield
              Yale University School of Medicine
              Dept. of Cell Biology
              P.O. Box 3333
              New Haven, CT 06510
              email: levenson@YALEMED
            
FEATURES       from  to/span     description
    pept        176     1060     gastric (H+,K+)-ATPase beta-subunit (E.C.
                                 3.6.1.3)
    mRNA          1     1484     gastric (H+,K+)-ATPase beta-subunit
BASE COUNT      363 a    457 c    362 g    302 t
ORIGIN      
        1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta
       61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag
      121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc
      181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg
      241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct
      301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct
      361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt
      421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga
      481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc
      541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt
      601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg
      661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa
      721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac
      781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta
      841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc
      901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt
      961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta
     1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca
     1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca
     1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga
     1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc
     1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg
     1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata
     1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg
     1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta
//
LOCUS       YSCMYO2A     5675 bp ds-DNA             PLN       28-JUL-1990
DEFINITION  S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds.
ACCESSION   M35532
KEYWORDS    myosin-1.
SOURCE      S.cerevisiae (strain GRF88) DNA, clone 10-2B.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 5675)
  AUTHORS   Johnston,G.C., Prendergast,J.A. and Singer,R.A.
  TITLE     The S.cerevisiae MYO2 gene encodes an essential myosin for
            vectorial transport of vesicles
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.C.Johnston, 21-JUN-1990.
            
              Author address: G.C.Johnston
              Dalhousie University
              7E Tupper Medical Bldg.
              Halifax, N.S.
              CANADA B3H 4H7
              email: JOHNSTON@AC.DAL.CA
            
FEATURES       from  to/span     description
    pept        581     5305     myosin 1 isoform (MYO2)
    binding     138      171     ATP-binding site
    binding     443      523     actin-binding site
    rpt         926      981     heptad repeat
    rpt        1010     1086     heptad repeat
BASE COUNT     1994 a    986 c   1110 g   1585 t
ORIGIN      Chromosome 15 right arm.
        1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt
       61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa
      121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt
      181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa
      241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc
      301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta
      361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc
      421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt
      481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg
      541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac
      601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga
      661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt
      721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc
      781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt
      841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct
      901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc
      961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc
     1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg
     1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga
     1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc
     1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc
     1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag
     1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa
     1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca
     1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg
     1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac
     1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat
     1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact
     1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca
     1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc
     1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat
     1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct
     1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta
     1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga
     2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat
     2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc
     2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac
     2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc
     2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga
     2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg
     2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg
     2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt
     2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg
     2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc
     2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc
     2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg
     2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat
     2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt
     2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc
     2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat
     3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt
     3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc
     3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag
     3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat
     3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac
     3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa
     3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt
     3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag
     3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa
     3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca
     3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga
     3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga
     3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca
     3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg
     3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc
     3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct
     3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg
     4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac
     4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc
     4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat
     4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca
     4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag
     4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa
     4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta
     4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa
     4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat
     4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt
     4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg
     4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt
     4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg
     4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca
     4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca
     4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac
     4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat
     5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc
     5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac
     5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc
     5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca
     5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat
     5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa
     5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg
     5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat
     5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga
     5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat
     5641 aataacaact aagaataaac acattctcat tttta
//
LOCUS       PTUB256       205 bp ds-DNA             SYN       28-JUL-1990
DEFINITION  Synthetic pTUB256 alpha-amylase gene promoter region.
ACCESSION   M36663
KEYWORDS    alpha-amylase.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 205)
  AUTHORS   Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
  TITLE     Two tandemly located promoters, artificially constructed, are
            active in a Bacillus subtilis alpha-amylase secretion vector
  JOURNAL   J. Biochem. 99, 1181-1190 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        164  >   205     synthetic alpha-amylase
BASE COUNT       73 a     29 c     44 g     59 t
ORIGIN      
        1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct
       61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
      121 ctccgaagta agtcttcaaa aaatcaaata aggagtgtca agaatgtttg caaaacgatt
      181 caaaacctct ttactgccgt tattc
//
LOCUS       PTUB261       232 bp ds-DNA             SYN       28-JUL-1990
DEFINITION  Synthetic pTUB261 alpha-amylase gene promoter region.
ACCESSION   M36664
KEYWORDS    alpha-amylase.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 232)
  AUTHORS   Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
  TITLE     Two tandemly located promoters, artificially constructed, are
            active in a Bacillus subtilis alpha-amylase secretion vector
  JOURNAL   J. Biochem. 99, 1181-1190 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        191  >   232     synthetic alpha-amylase
BASE COUNT       82 a     28 c     52 g     70 t
ORIGIN      
        1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct
       61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
      121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg
      181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc
//
LOCUS       PTUB263       232 bp ds-DNA             SYN       28-JUL-1990
DEFINITION  Synthetic pTUB263 alpha-amylase gene promoter region.
ACCESSION   M36665
KEYWORDS    alpha-amylase.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 232)
  AUTHORS   Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
  TITLE     Two tandemly located promoters, artificially constructed, are
            active in a Bacillus subtilis alpha-amylase secretion vector
  JOURNAL   J. Biochem. 99, 1181-1190 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        191  >   232     synthetic alpha-amylase
BASE COUNT       78 a     40 c     49 g     65 t
ORIGIN      
        1 aagcactccc gcgatcgcct atttggcttt tccccaaaat gtaagcgtga acaaaattct
       61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
      121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg
      181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc
//
LOCUS       PTUB265       214 bp ds-DNA             SYN       28-JUL-1990
DEFINITION  Synthetic pTUB265 alpha-amylase gene promoter region.
ACCESSION   M36666
KEYWORDS    alpha-amylase.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 214)
  AUTHORS   Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K.
  TITLE     Two tandemly located promoters, artificially constructed, are
            active in a Bacillus subtilis alpha-amylase secretion vector
  JOURNAL   J. Biochem. 99, 1181-1190 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        173  >   214     synthetic alpha-amylase
BASE COUNT       73 a     28 c     50 g     63 t
ORIGIN      
        1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaatcgtga acaaaattct
       61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga
      121 ctcggggttg ttattatttt atcgctagaa gaaaagaagg tggagaggaa acatgatcca
      181 aaaacgattc aaaacctctt tactgccgtt attc
//
LOCUS       RABMEPHA     1653 bp ss-mRNA            MAM       28-JUL-1990
DEFINITION  Rabbit microsomal epoxide hydrolase.
ACCESSION   M21496
KEYWORDS    microsomal epoxide hydrolase.
SOURCE      Rabbit (New Zealand White) adult liver cDNA to mRNA, clone pEH.
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 1653)
  AUTHORS   Hassett,C., Turnblom,S.M., DeAngeles,A. and Omiecinski,C.J.
  TITLE     Rabbit microsomal epoxide hydrolase: Isolation and characterization
            of the xenobiotic metabolizing enzyme cDNA
  JOURNAL   Arch. Biochem. Biophys. 271, 380-389 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable copy for sequence [1] kindly
            submitted by C.Hassett 12-JAN-1989.
FEATURES       from  to/span     description
    pept        148     1515     microsomal epoxide hydrolase (EC 3.3.2.3)
BASE COUNT      351 a    505 c    465 g    332 t
ORIGIN      
        1 cggcatccgc aaggacctgt acgccaacac ggtgctgtct cgcctctccc gcagctctgc
       61 agtgtcgccg tgcgcagagt tccacagctc tgcttcccaa gcaggtgagc agaggctgac
      121 aacacagcgc ccttgtggac aggagccatg ttgctggaac tccttctcgc ctcggtgctg
      181 ggcttcgtca tctactggtt cgtctctgga gacaaggagg agagtctgcc actggaggat
      241 gggtggtggg gcccggggtc gaggcccgta ggcctggagg acgagagcat ccggcccttc
      301 aaggtggaga cgtcggacga agagatcaac gacttacacc agaggatcga caggatccgc
      361 ttgaccccac ctttggagaa cagccgcttc cactacggct tcaactccaa ctacctgaag
      421 aagatcctct cctactggag gcacgaattc gactggaaga agcaagtgga gattctgaac
      481 tcataccctc acttcaagac caagatcgaa gggctggaca tccacttcat ccacgtgaag
      541 cccccgcagg tgccccctgg ccgcacccca aagcccttgc tgatggtgca tggctggccc
      601 ggctccttct tcgagttcta caaaatcatc ccgctgctga ctgaccccaa gagccacggc
      661 ctgagcgatg agcacatctt tgaagtcatc tgcccttcca ttccaggcta tggcttctca
      721 caggcatctt ccaagaaggg cttcaactcg gtgagcaccg ccaggatctt ctacaagctg
      781 atgctgcggc tgggcttcca ggagttctac atccagggcg gggactgggg ggccctggtc
      841 tgcacgaaca tggcccagct ggtgcccagc cacgtgaaag gtctgcactt gaacatggct
      901 ttgattttaa gaaatcacta cactctgacc ctcctgctgg gacggcgcat cgggggactt
      961 cttggctaca ctgagaggga catggagctg ctgtacccct tcaaggagaa ggtgttctac
     1021 agtctgatga gggagagcgg ctacatgcac atccgggcca ccaagcccga cactgtgggc
     1081 tgtgctctga atgactctcc tgtgggactg gctgcataca ttctagagaa attttccacc
     1141 tggaccaact cagaattccg agacctggag gacggaggcc tggagaggaa gttctccctg
     1201 caggacctgc tgaccaacat catgatctac tggaccactg gctccatcgt ctcctcccag
     1261 cgctactaca aggagaacct gggccagggc ttcatggccc acaagcatga gcggctgaag
     1321 gtccacgtgc ccacgggctt cgcagccttc ccgtgtgaga taatgcatgt gccagagaag
     1381 tgggtgagga ccaagtaccc gcagctcatc tcctactcct acatgccccg cgggggccac
     1441 ttcgccgcct tcgaggagcc ggagctgctg gcccgggaca tctgcaagtt cgtggggctg
     1501 gtggagcggc agtgatgctc ccagccttgc ctggggtgag gggtcggctt gcctcctccc
     1561 ctggcctgct ggaacccacc tcaggcctcc atactcactg tctcaccccc atggcgtggc
     1621 tgataaatga tttgactccc aaaaaaaaaa aaa
//
LOCUS       XELBETA      1138 bp ss-mRNA            VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A1 mRNA, complete cds.
ACCESSION   M35359
KEYWORDS    thyroid hormone receptor beta A1 protein.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1138)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept          1     1110     thyroid hormone receptor beta A1 protein
BASE COUNT      351 a    228 c    283 g    276 t
ORIGIN      
        1 atggaagggt atatacccag ctacttggat aaagatgagc tatgcgtggt gtgtggagac
       61 aaggctacag ggtatcatta tagatgtatc acctgcgagg gctgcaaggg cttttttaga
      121 agaactattc agaagaacct ccacccaagc tattcttgta aatatgaagg aaaatgtgtt
      181 atagacaaag taacaagaaa ccagtgccaa gaatgtcgct tcaaaaagtg catcgctgtt
      241 ggaatggcaa cagacttggt tttggatgac aacaaacgtt tggcaaaaag aaagctcata
      301 gaagaaaaca gagaaaaaag acggaaagat gagattcaga aatcacttgt tcagaaacct
      361 gaacccacac aagaagaatg ggagttgata caagttgtca ctgaagcaca tgtggccacc
      421 aacgcacagg gaagccactg gaaacagaaa agaaaatttt tgccagagga cattggacaa
      481 gctcccatag ttaatgcgcc cgagggtgga aaagtggact tagaagcctt cagccagttt
      541 acaaaaataa tcaccccagc aattacaaga gttgttgatt ttgccaaaaa gctacctatg
      601 ttttgtgagc tgccatgtga agaccagatc atccttctta aaggctgttg tatggagatc
      661 atgtcgctcc gagcagcagt gcgttatgac cccgaaagtg aaactctaac gttaaatggt
      721 gagatggcag tgacaagggg gcagctaaaa aatggaggac ttggagtggt ttcagatgcc
      781 atctttgact taggggtatc gctttcttca ttcagtcttg atgataccga agtcgccttg
      841 ttgcaggctg tgctgcttat gtcatcagat cggcctggtc ttgctagcgt ggagagaata
      901 gaaaagtgcc aggaaggttt cctcttggct tttgaacact acattaatta caggaaacat
      961 aacattgcac acttttggcc aaaactgctg atgaaagtca ccgacctccg catgattgga
     1021 gcgtgccacg ccagccggtt cctgcacatg aaggtggagt gccccactga actgtttccc
     1081 ccactgttct tggaagtgtt tgaggactag aacagactgt gcttctggat tctcagca
//
LOCUS       XELBETA1      259 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon A.
ACCESSION   M35345
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     1 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 259)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       63 a     55 c     88 g     53 t
ORIGIN      
        1 aaattgggat ctatcctggg agagaatgga aatagacgac agcgctttat cctgactgaa
       61 ctgaggcagg ggtaacgctg ggagtgactg gcatagcagg ggctgcgggg aggcacttca
      121 gtccgtgcca agtccaacat tgtagctagt gacgagaatc gtactacagt gcgggctctc
      181 actaagtgac gctcgaattc gggaagaacg acgcggcagc tgttgcatta tggtgcgtct
      241 gtaggtcgga gagccggcg
//
LOCUS       XELBETA2       97 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon B.
ACCESSION   M35346
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     2 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 97)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       14 a     32 c     24 g     27 t
ORIGIN      
        1 atttcaggac agcccagcgc cctggtgcac gatcagctgt agatctccct gtctgtgtcg
       61 ctgctgccgc tgctacttca gttcctctga ctgtcag
//
LOCUS       XELBETA3       44 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon C.
ACCESSION   M35347
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     3 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 44)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       17 a      5 c     13 g      9 t
ORIGIN      
        1 atgttgaaga ctgattgggg ttaagcaggc acatacaaga aaag
//
LOCUS       XELBETA4       79 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon D.
ACCESSION   M35348
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     4 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 79)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       32 a     12 c     21 g     14 t
ORIGIN      
        1 acagaagccg tgaaccaatg cagaattaca ggaaaggacg aggattgaaa catctgtaca
       61 tgagaaggaa tttctgaag
//
LOCUS       XELBETA5       72 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon E.
ACCESSION   M35349
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     5 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 72)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       19 a     15 c     20 g     18 t
ORIGIN      
        1 ttaaagttga agtatttctg gtcaggtgat ctctgaggca gcgcacaggc cctcacaaaa
       61 tggtggctca ag
//
LOCUS       XELBETA6       46 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon F.
ACCESSION   M35350
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     6 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 46)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       17 a     11 c      8 g     10 t
ORIGIN      
        1 gttcctctca agcccaggaa caaaaaccgg aaatttttca aatgag
//
LOCUS       XELBETA7       64 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon G.
ACCESSION   M35351
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     7 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 64)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept         46  >    64     thyroid hormone receptor beta A protein, exon
                                 G (first expressed exon) (alt.)
BASE COUNT       20 a     13 c     14 g     17 t
ORIGIN      
        1 gctatatgtg attcttagaa gaatgagcgg accttccaat ccataatgcc aagcagtatg
       61 tcag
//
LOCUS       XELBETA8      191 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A gene, exon H.
ACCESSION   M35352
KEYWORDS    thyroid hormone receptor beta A protein.
SEGMENT     8 of 8
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 191)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            A unique procedure for translation determination reveals that
            alternate translation initiation occurs at exons G and H of the
            beta A thyroid hormone receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept        185  >   191     thyroid hormone receptor beta A protein, exon
                                 H (first expressed exon) (alt.)
BASE COUNT       57 a     31 c     37 g     66 t
ORIGIN      
        1 gcagagtata tggtttagaa gaactaacac agaagttttt tgttggacac tactctccat
       61 aatgacaatg agatttccat tgtaacatcc taattgtaac cagtaatcag agatgctgct
      121 tggacagtgc ttacagcttt tttaaagaga ttttttattt ttgctttgca tcgaaccgtg
      181 tactatggaa g
//
LOCUS       XELBETAB     1150 bp ss-mRNA            VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta A5 mRNA, complete cds.
ACCESSION   M35360
KEYWORDS    thyroid hormone receptor beta A5 protein.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1150)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept          1     1122     thyroid hormone receptor beta A5 protein
BASE COUNT      354 a    232 c    285 g    279 t
ORIGIN      
        1 atgccaagca gtatgtcagg gtatataccc agctacttgg ataaagatga gctatgcgtg
       61 gtgtgtggag acaaggctac agggtatcat tatagatgta tcacctgcga gggctgcaag
      121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcttg taaatatgaa
      181 ggaaaatgtg ttatagacaa agtaacaaga aaccagtgcc aagaatgtcg cttcaaaaag
      241 tgcatcgctg ttggaatggc aacagacttg gttttggatg acaacaaacg tttggcaaaa
      301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag atgagattca gaaatcactt
      361 gttcagaaac ctgaacccac acaagaagaa tgggagttga tacaagttgt cactgaagca
      421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag
      481 gacattggac aagctcccat agttaatgcg cccgagggtg gaaaagtgga cttagaagcc
      541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagttgttga ttttgccaaa
      601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt
      661 tgtatggaga tcatgtcgct ccgagcagca gtgcgttatg accccgaaag tgaaactcta
      721 acgttaaatg gtgagatggc agtgacaagg gggcagctaa aaaatggagg acttggagtg
      781 gtttcagatg ccatctttga cttaggggta tcgctttctt cattcagtct tgatgatacc
      841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcggcctgg tcttgctagc
      901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat
      961 tacaggaaac ataacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc
     1021 cgcatgattg gagcgtgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact
     1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gtgcttctgg
     1141 attctcagca
//
LOCUS       XELBETAC     1132 bp ss-mRNA            VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B1 mRNA, complete cds.
ACCESSION   M35361
KEYWORDS    thyroid hormone receptor beta B1 protein.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1132)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept          1     1122     thyroid hormone receptor beta B1 protein
BASE COUNT      352 a    233 c    284 g    263 t
ORIGIN      
        1 atgccaagca gtatgtcagg gtacataccc agctacttgg ataaagatga gctatgtgtg
       61 gtatgtggag acaaagctac agggtatcac tatagatgta tcacctgcga gggctgcaag
      121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcctg taaatatgaa
      181 ggaaaatgtg ttatagacaa agtaacaagg aaccagtgcc aagaatgtcg cttcaaaaag
      241 tgcaaaactg ttggaatggc aacagacttg gttttggatg acagcaaacg tttggcgaaa
      301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag acgagataca gaaatcaatt
      361 gttcagagac cggaaccaac acaagaagaa tgggagttga tacaagttgt cactgaagca
      421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag
      481 gacattggac aagctcccat agttaatgcg cctgaaggtg gaaaagtgga cttagaagcc
      541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagtggttga ttttgccaaa
      601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt
      661 tgtatggaga tcatgtccct ccgagcagcc gtgcggtatg accccgaaag tgaaactcta
      721 acgctgaatg gggagatggc agtgacaagg gggcagctaa aaaatggagg actcggtgtg
      781 gtctcagatg ccatctttga cttgggggtg tcgctttctt cattcagtct tgatgatacc
      841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcgtcctgg tctctctagt
      901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat
      961 tacaggaaac acaacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc
     1021 cgcatgatcg gagcatgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact
     1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gt
//
LOCUS       XELBETAD     1255 bp ss-mRNA            VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B2 mRNA, complete cds.
ACCESSION   M35362
KEYWORDS    thyroid hormone receptor beta B2 protein.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1255)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept          1     1245     thyroid hormone receptor beta B2 protein
BASE COUNT      383 a    258 c    323 g    291 t
ORIGIN      
        1 atgccaagca gtatgtcagt tcggcttttc actgcatctg ccgcacaaag aaagaagata
       61 caggaagggg attgctgtgt ggtgctcgct ggaaaaaccc agggccggtt tatattgata
      121 ggagcagtgg cccgggtatc agggtacata cccagctact tggataaaga tgagctatgt
      181 gtggtatgtg gagacaaagc tacagggtat cactatagat gtatcacctg cgagggctgc
      241 aagggctttt ttagaagaac tattcagaag aacctccacc caagctattc ctgtaaatat
      301 gaaggaaaat gtgttataga caaagtaaca aggaaccagt gccaagaatg tcgcttcaaa
      361 aagtgcaaaa ctgttggaat ggcaacagac ttggttttgg atgacagcaa acgtttggcg
      421 aaaagaaagc tcatagaaga aaacagagaa aaaagacgga aagacgagat acagaaatca
      481 attgttcaga gaccggaacc aacacaagaa gaatgggagt tgatacaagt tgtcactgaa
      541 gcacatgtgg ccaccaacgc acagggaagc cactggaaac agaaaagaaa atttttgcca
      601 gaggacattg gacaagctcc catagttaat gcgcctgaag gtggaaaagt ggacttagaa
      661 gccttcagcc agtttacaaa aataatcacc ccagcaatta caagagtggt tgattttgcc
      721 aaaaagctac ctatgttttg tgagctgcca tgtgaagacc agatcatcct tcttaaaggc
      781 tgttgtatgg agatcatgtc cctccgagca gccgtgcggt atgaccccga aagtgaaact
      841 ctaacgctga atggggagat ggcagtgaca agggggcagc taaaaaatgg aggactcggt
      901 gtggtctcag atgccatctt tgacttgggg gtgtcgcttt cttcattcag tcttgatgat
      961 accgaagtcg ccttgttgca ggctgtgctg cttatgtcat cagatcgtcc tggtctctct
     1021 agtgtggaga gaatagaaaa gtgccaggaa ggtttcctct tggcttttga acactacatt
     1081 aattacagga aacacaacat tgcacacttt tggccaaaac tgctgatgaa agtcaccgac
     1141 ctccgcatga tcggagcatg ccacgccagc cggttcctgc acatgaaggt ggagtgcccc
     1201 actgaactgt ttcccccact gttcttggaa gtgtttgagg actagaacag actgt
//
LOCUS       XELBETB1      226 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B gene, exon A.
ACCESSION   M35353
KEYWORDS    thyroid hormone receptor beta B protein.
SEGMENT     1 of 6
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 226)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            Exons D, E and H do not exist for the beta B thyroid hormone
            receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       57 a     50 c     73 g     46 t
ORIGIN      
        1 agcttcatta tcctgactga acacaagcag ggataacgct gggagtgact ggcatagcag
       61 gggctgcagg gaggcacttc ataatccgtg ccaaatccaa cgttgtagcg agtgacgaga
      121 atcgtagagt gcgcggaaca gtctcacgga cgctggggtt tgggaaggac gacgcggcag
      181 ctgttgcact acgttacgtc taactctata ggttggagag ctgacg
//
LOCUS       XELBETB2       65 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B gene, exon B.
ACCESSION   M35354
KEYWORDS    thyroid hormone receptor beta B protein.
SEGMENT     2 of 6
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 65)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            Exons D, E and H do not exist for the beta B thyroid hormone
            receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT        7 a     19 c     16 g     23 t
ORIGIN      
        1 agctgtagat ctcctgtctg tgttgctgcc actgctgttg ctgctccagt tcctctgact
       61 gtcag
//
LOCUS       XELBETB3       50 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B gene, exon C.
ACCESSION   M35355
KEYWORDS    thyroid hormone receptor beta B protein.
SEGMENT     3 of 6
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            Exons D, E and H do not exist for the beta B thyroid hormone
            receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       19 a      5 c     15 g     11 t
ORIGIN      
        1 atgttgaaga gtgattgggg ttaagcaggc acatactgta caagaaaaag
//
LOCUS       XELBETB4       67 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B gene, exon F.
ACCESSION   M35356
KEYWORDS    thyroid hormone receptor beta B protein.
SEGMENT     4 of 6
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 67)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            Exons D, E and H do not exist for the beta B thyroid hormone
            receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       27 a     14 c     12 g     14 t
ORIGIN      
        1 ctacaggttt ccctcaagca ccaagaacga aaaccagaaa gaatttgcag agaatttttc
       61 aaatgag
//
LOCUS       XELBETB5       64 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B gene, exon G.
ACCESSION   M35357
KEYWORDS    thyroid hormone receptor beta B protein.
SEGMENT     5 of 6
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 64)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            Exons D, E and H do not exist for the beta B thyroid hormone
            receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept         46  >    64     thyroid hormone receptor beta B gene
BASE COUNT       21 a     12 c     14 g     17 t
ORIGIN      
        1 gttatatgtg atgcttagaa gaatgagcag accttccaat ccataatgcc aagcagtatg
       61 tcag
//
LOCUS       XELBETB6      123 bp ds-DNA             VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor beta B gene, exon H.
ACCESSION   M35358
KEYWORDS    thyroid hormone receptor beta B protein.
SEGMENT     6 of 6
SOURCE      X.laevis DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Yaoita, 15-JUN-1990.
            
            Exons D, E and H do not exist for the beta B thyroid hormone
            receptor protein.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
BASE COUNT       31 a     25 c     39 g     28 t
ORIGIN      
        1 ttcggctttt cactgcatct gccgcacaaa gaaagaagat acaggaaggg gattgctgtg
       61 tggtgctcgc tggaaaaacc cagggccggt ttatattgat aggagcagtg gcccgggtat
      121 cag
//
LOCUS       XELTHYA      1406 bp ss-mRNA            VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor alpha A mRNA, complete cds.
ACCESSION   M35343
KEYWORDS    thyroid hormone receptor protein.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1406)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by Y.Yaoita, 15-JUN-1990.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept        122     1378     thyroid hormone receptor protein
BASE COUNT      346 a    350 c    406 g    304 t
ORIGIN      
        1 gtcgacctgt gagaggcgtc cgcccgcctc catgtgaacg ctacgcccca tgatcctcgg
       61 ggagctgggg gcggagcccg ccttggtctc ttcggattgg ttctggatgg aattacgttg
      121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc
      181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga
      241 cagcttggtg tctctgccct ctgcagggta catccccagc tatctggaca aagatgagcc
      301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg
      361 gtgtaagggt ttctttcgcc gcaccatcca gaagaacctg cacccctcct actcgtgcaa
      421 gtacgatggc tgctgcatta tcgacaagat cacccgaaat cagtgccagc tctgccgctt
      481 caagaaatgc attgccgttg gcatggcaat ggatcttgtc ctggatgatg gcaagcgggt
      541 agccaagcga aaactgattg aagagaatcg acagcggcgg cggaaggagg agatgatcaa
      601 gactctgcaa cagcgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac
      661 agaagctcac aggagtacca atgctcaggg cagccactgg aaacagcgta ggaagtttct
      721 gccggaagat atcgggcagt ctcccatggc ttccatgccg gatggggata aagttgacct
      781 ggaagctttc agtgagttca ccaagataat caccccggca attaccagag tggtggactt
      841 tgccaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa
      901 aggatgttgt atggagatca tgtctctccg tgctgctgta cgctacgatc cagacagcga
      961 gaccctaacg ctgagcggag agatggctgt gaaacgggag cagcttaaga acggaggtct
     1021 gggtgttgtc tctgatgcca tctttgacct cgggaggtcg cttgctgcgt ttaaccttga
     1081 cgatacggaa gtggcgctgc tgcaggctgt tttgctaatg tcatcagacc gaactggttt
     1141 aatctgcacg gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta
     1201 catcaaccat cgcaaacaca acattcccca cttctggccc aaactcctaa tgaaggtgac
     1261 ggacctgcgc atgatagggg catgccatgc cagccgcttt ctgcacatga aggtcgagtg
     1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg
     1381 gacagtgcat gtcggtagag aggaaa
//
LOCUS       XELTHYB      1406 bp ss-mRNA            VRT       28-JUL-1990
DEFINITION  X.laevis thyroid hormone receptor alpha B mRNA, complete cds.
ACCESSION   M35344
KEYWORDS    thyroid hormone receptor protein.
SOURCE      X.laevis, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1406)
  AUTHORS   Yaoita,Y., Shi,Y.-B. and Brown,D.D.
  TITLE     The Xenopus laevis alpha and beta thyroid hormone receptors
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by Y.Yaoita, 15-JUN-1990.
            
              Author address:Yoshio Yaoita
              Carnegie Inst of Washington
              115 West University Parkway
              Baltimore, MD 21210
FEATURES       from  to/span     description
    pept        122     1378     thyroid hormone receptor protein
BASE COUNT      350 a    347 c    402 g    307 t
ORIGIN      
        1 gtcgacctgt gagaggcgcc cgcccgcctc catgtgaaag ccacgcccca tgagccttgg
       61 gcagctgggg gcggagccca ccttggtctc ttcggattgg ttctggatgg aattacgttg
      121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc
      181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga
      241 cagcttggtg tctctgcccc ctgcagggta catccccagc tatctggaca aagatgagcc
      301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg
      361 gtgcaagggt ttcttccgcc gcaccatcca gaagaacctg cacccctcct attcttgcaa
      421 gtacgatggc tgctgcatta tcgacaaaat cacccgtaat cagtgccagc tctgccgctt
      481 caagaaatgc attgccgttg gcatggcaat ggatcttgac ctggatgata gcaagcgggt
      541 agccaagcga aaactgattg aagaggatcg agtgcggcgg cggaaggagg agatgatcaa
      601 gactctgcaa cagtgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac
      661 agaagctcac aggagtacca atgcccaggg cagccattgg aaacagcgta ggaagtttct
      721 gccagaagac atcggacagt ctcctatggc ttccatgcca gatggggata aagttgacct
      781 ggaagctttc agtgagttca ccaaaataat caccccggca attaccagag tggtggactt
      841 tgcgaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa
      901 aggatgttgt atggagatca tgtctcttcg tgctgctgtg cgctacgatc cagacagcga
      961 gaccctaacg ctgagcggcg agatggcggt gaaacgggag cagcttaaga acggaggtct
     1021 gggtgttgtc tctgatgcca tctttgacct tgggaggtcg cttgctgcgt tcaaccttga
     1081 tgatacggaa gtggcactgt tgcaggctgt tttgctaatg tcatcagacc gtactggttt
     1141 aatctgcaca gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta
     1201 catcaaccat cgcaaacaca acattcccca cttctggccc aagctcctaa tgaaggtgac
     1261 ggacctgcgc atgatagggg catgccatgc cagctgcttt ctgcacatga aggtcgagtg
     1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg
     1381 gacagtgcat gtcggtagag aggaaa
//
LOCUS       RATSIMPA1     205 bp ds-DNA             ROD       28-JUL-1990
DEFINITION  Rat simple sequence DNA, clone 5.
ACCESSION   M36626
KEYWORDS    simple sequence DNA.
SEGMENT     1 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 205)
  AUTHORS   Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
  TITLE     Simple sequences of the rat genome detected by hybridization with
            adenovirus DNA
  JOURNAL   Dokl. Biochem. 276, 189-193 (1984)
  STANDARD  simple staff_entry
BASE COUNT       52 a     53 c     79 g     21 t
ORIGIN      
        1 cagctctgtc ctgttgtcgc ccttgggcag agttgtgcct cctgctcttc tttccctaag
       61 gaggggcagc agcagcagca gcaggaggag caggaggagc agcagcagga gcagcaggag
      121 cagcagcagc agcaggagga gcaggagcag cagcaggagc agcagcagca ggagcagcag
      181 cagcagcagg agcaggagga gcagc
//
LOCUS       RATSIMPA2     146 bp ds-DNA             ROD       28-JUL-1990
DEFINITION  Rat simple sequence DNA, clone 5.
ACCESSION   M32514
KEYWORDS    simple sequence DNA.
SEGMENT     2 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 146)
  AUTHORS   Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
  TITLE     Simple sequences of the rat genome detected by hybridization with
            adenovirus DNA
  JOURNAL   Dokl. Biochem. 276, 189-193 (1984)
  STANDARD  simple staff_entry
BASE COUNT       46 a     36 c     61 g      3 t
ORIGIN      About 500 base pairs after segment 1.
        1 agcagcagca gcaggagcag caggaggagc agcaggagca ggagcagcag gagcagcagc
       61 aggagcagga gcaggagcag caggagcagc aggagcagca gcaggagcag cagcagcagc
      121 agcagcagca gcggtgcagc tccatg
//
LOCUS       RATSIMPB      380 bp ds-DNA             ROD       28-JUL-1990
DEFINITION  Rat simple sequence DNA, clone 8.
ACCESSION   M32515
KEYWORDS    simple sequence DNA.
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 380)
  AUTHORS   Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
  TITLE     Simple sequences of the rat genome detected by hybridization with
            adenovirus DNA
  JOURNAL   Dokl. Biochem. 276, 189-193 (1984)
  STANDARD  simple staff_entry
BASE COUNT      131 a     96 c    138 g     15 t
ORIGIN      
        1 tgatcattgc tgcaatccca cagcaggagc agcagcagga gcagcagcag cagcaggagc
       61 aggagcagcc acaggaggag cagcaacaag aggcagcagc agcagcagga gcagcagcag
      121 caggaggagc agcaacagga gcagcagcaa caggagcagc agcaggaaca gaacaggagc
      181 agcagcagca ggaacaagga gtagcagcag cagcagcagg aacaggagaa gcagcagcag
      241 cagcagcagc aggagcagga gcagcaggag cagcagcagc agcagtagga gcagcagcag
      301 cagcaggagc agcagcagca gcagcaggag gagcagcagc agcagcagca cagcagcagg
      361 gtacttggtg atcccttgac
//
LOCUS       RATSIMPC      542 bp ds-DNA             ROD       28-JUL-1990
DEFINITION  Rat simple sequence DNA, clone 16.
ACCESSION   M36627
KEYWORDS    simple sequence DNA.
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 542)
  AUTHORS   Ivanova,M.N., Frolova,E.I. and Georgiev,G.L.
  TITLE     Simple sequences of the rat genome detected by hybridization with
            adenovirus DNA
  JOURNAL   Dokl. Biochem. 276, 189-193 (1984)
  STANDARD  simple staff_entry
BASE COUNT       76 a    255 c    142 g     69 t
ORIGIN      
        1 ggatccaccg cctgagtagc cgccgccaca gctagagccg cctccacctc caccgccgtc
       61 ggagtagccg cctccgcagc tggagccacc gccgccgccg ccgccggagt acttgccccc
      121 ttcggaccgc cgccgcgacc accgggctgc cgctccagag gagcctccgc agtaggagcc
      181 gccgcctcct gattcgtctc ctatagttgg agcctccgcc tccgtcggag tcgccgccgc
      241 cgccgtagcc ggagccgccg ccgccgccgc ccgcctccgg agtaccttga cgccgccgcc
      301 gccgccgccg gagtacttcg cccctccgga ccgccgccgc gaccagagaa ctgacgcccc
      361 ctccggagcc gcctccgccg ccgcagctgg aaccacctcc ataggaacca ccgcctccgc
      421 ctccgcctcc gcagccagag cctcctccag atgagccacc tccgcagctg ggagcctcca
      481 ccgctaccac caccgctata gtaaccgcca ccgccgcctc ctcctccacc agaggtcttt
      541 tc
//
LOCUS       RATPSTIAA    2382 bp ss-mRNA            ROD       28-JUL-1990
DEFINITION  Rat pancreatic secretory trypsin inhibitor-like protein (PSTI)
            mRNA, complete cds.
ACCESSION   M35299
KEYWORDS    monitor protein;
            pancreatic secretory trypsin inhibitor-like protein.
SOURCE      Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 427; 594 to 1693; 1954 to 2338)
  AUTHORS   Fukuoka,S.-I. and Scheele,G.
  TITLE     Rapid and selective cloning of monitor peptide, a novel
            CCK-releasing peptide, using minimal amino acid sequence and the
            polymerase chain reaction (PCR)
  JOURNAL   Pancreas 4, 1-7 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2382)
  AUTHORS   Fukuoka,S.-I. and Scheele,G.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.-I.Fukuoka, 19-JUN-1990.
FEATURES       from  to/span     description
    pept         10      249     pancreatic secretory trypsin inhibitor-like
                                 protein precursor
    sigp         10       63     pancreatic secretory trypsin inhibitor-like
                                 protein signal peptide
    matp         64      246     pancreatic secretory trypsin inhibitor-like
                                 protein
    signal     1665     1670     poly-A signal
    signal     2064     2069     poly-A signal
    signal     2303     2308     poly-A signal
BASE COUNT      770 a    423 c    517 g    672 t
ORIGIN      
        1 tctacaacca tgaaggtagc aattatcttt cttctcagtg ctttggccct gctcagttta
       61 gcaggtaacc ctccagctga ggtgaatgga aaaacgccta attgccctaa gcaaattatg
      121 ggatgtccca ggatttatga ccctgtgtgt gggactaacg gaattactta ccccagtgaa
      181 tgcagtctgt gctttgaaaa caggaaattc ggaacatcta tccacattca gaggagaggg
      241 acttgctgaa tgtcctgatt ttgaaatctt ttagggctac cataatgttt agcaagaagg
      301 tttgctgaat aaatgcatct gaacatattt tgttcttccc aaagcttttg ctcaaaggca
      361 tatatgagta tattgagaat agggatctga gaagaaaacc agagtagagc aagctttacc
      421 acttagttct tcatgctcat acttcaaaaa ttgcagatga tgacaacaca tagttgagca
      481 tgaacatgtg taatgaatag agtttgggtt aggatgaaga aggtagccta tctgtgcaca
      541 agaaagaagt agactgactt ggatctttct taggggagtt taccaaagga aagactgcct
      601 tgtatatcta cagtgtttca cttgtgagac accacaactc tgcagattta ctcttgttct
      661 gtgaggaaac ttagaagagt caaattgttt gactaatagt ccaacataca tgatgccagg
      721 gtgttctttt agatcaagct gacctcttcc ttcatccata tgagcactcc ttcttttaac
      781 cacaatcttc tcttgtggat catgccttga ctttcttcaa tgggaatcct agataatatt
      841 ccctactgta agatcttgca tgtctatatt cagtgataga atatagacgt gatataatag
      901 gatataacca aatgaattag aaacaaggaa atattctcaa aagggaaagt atcaacaact
      961 acttttaaaa aaggaatcat tttaagatcc tgagtttcta aagaaaatct tagtctaaga
     1021 tggaaagaga gtaaagagct aacacaggtg agtctgggca aggaacccta gtacagtggg
     1081 gttgggtcag cacctttgcc agaaataacc aagctattca gaaatacact aggaaaggag
     1141 agttgcctag taacccactt ctggtcatat tcagtattca tgccttgaac tgaactcttg
     1201 ctcctagagg atgctataac taacaaaccg agcaacttaa acagcctgac agctctcacc
     1261 aaataccttg ctatctcaag ttatggatgc aagatggctc ccagtgtcta tctgtgattc
     1321 tagaggacac ttgaagggca ccaacactta acaaattctg tgggggtaaa tttattttaa
     1381 tcactggatg ctggaagaca cacacagaga cacaaacaca caaagagaga cagagagaga
     1441 gaaagagaga gagagaggta gagagagaga gagagagaga gagggagaga gagggagaga
     1501 gagagtgttt tgggttttgt tgttgttgtt gttgttgatt tggaattata tcaagatata
     1561 agataatctc aaatgtatct ttagtagttc tgctccctgg acccatgaga agacaggaat
     1621 gaggattctg tgcatgtggt acttacattt caaaaggagt atctaataaa ctggaaactg
     1681 cttaaaagaa tgagactatc agcactgata agaatataaa gcttcaagct atgaagagtg
     1741 attcaaagaa ggaaaagaat tccctcagaa ctgggaggac cttttaaaaa attctgagtc
     1801 cccgtttcta aagtttcacc ttcctaactt catgtatttt ttaatagctc aaagagtcca
     1861 attactgctg ctcatatact catgagtgtg acaccatgca ctgttactgc caatatatga
     1921 aaggccatac ccctaaagaa aattgactta agaactcctt gtttagggtt gggtacttct
     1981 gtgaccctcc cacattcatg ctggaatgtt gactggcttc atttttataa ggcaaaagat
     2041 cttcccactc tcttctgaga gagaataaat cagttttgct caatggagtg attctgagta
     2101 tactaatcac gatcccagga caggccccat tctcacaagc agttagctaa cacaaataga
     2161 actccatatt ttatagcagt ttttatcttt tgttcttggt tttagttctt attttcaaga
     2221 cagagaaaaa cacatgaagt tggaagggta gaagtggggg ggggcgtggg tctgggagga
     2281 gttgggggat agagaaaaat ataataaaaa tatatgaaat tctcgagaat gaataaatgg
     2341 aattcgatat caagcttatc gataccgtcg acctcgaggg gg
//
LOCUS       RATPSTIBA     300 bp ss-mRNA            ROD       28-JUL-1990
DEFINITION  Rat pancreatic secretory trypsin inhibitor-like protein (PSTI)
            mRNA, 3' end.
ACCESSION   M35300
KEYWORDS    monitor protein;
            pancreatic secretory trypsin inhibitor-like protein.
SOURCE      Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP3.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Fukuoka,S.-I. and Scheele,G.
  TITLE     Rapid and selective cloning of monitor peptide, a novel
            CCK-releasing peptide, using minimal amino acid sequence and the
            polymerase chain reaction (PCR)
  JOURNAL   Pancreas 4, 1-7 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.-I.Fukuoka, 19-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1      116     pancreatic secretory trypsin inhibitor-like
                                 protein precursor (AA at 3)
    matp    <     1      113     pancreatic secretory trypsin inhibitor-like
                                 protein
    signal      285      290     poly-A signal
BASE COUNT       98 a     57 c     65 g     80 t
ORIGIN      
        1 gtcccaggat ttatgaccct gtgtgtggga ctaacggaat tacttacccc agtgaatgca
       61 gtctgtgctt tgaaaacagg aaattcggaa catctatcca cattcagagg agatagagcg
      121 tctgcaaaaa cagatcgaac ggcataagaa gaagattaat acctaaagaa tagtgaggca
      181 ttgagtgcac acagtcagtc tctcacatag tggcagtatc attcccactc ttatagagat
      241 tgttttgaat gattgatgtt tgaccatgtg tgctactaac agataataaa ttatcaccag
//
LOCUS       SYNTRPA      3763 bp ds-DNA   circular  SYN       28-JUL-1990
DEFINITION  Cloning vector pATH3 propagated in E.coli.
ACCESSION   M33622
KEYWORDS    trpE' protein.
SOURCE      Synthetic DNA, clone pATH3.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 3763)
  AUTHORS   Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
  TITLE     High-expression vectors with multiple cloning sites for
            construction of trpE-fusion genes: pATH vectors
  JOURNAL   Meth. Enzymol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Hill, 06-APR-1990.
            
            Nucleotides 1-147 are provided as a personal communication from
            R.P.Gunsalus at the Dept. of Microbiology at UCLA.
            
            Construction of pATH3: 1.  PvuII-HindIII fragment from the 5' end
            of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds)
            was ligated to the HindIII-PvuII fragment of pBR322 containing the
            bla (= Amp-resistance) gene and origin of replication, but not the
            rop gene, which encodes a negative regulator of ColE1 replication.
            In addition, the EcoRI site in the pBR322 backbone was eliminated.
            This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141
            (1984))
            
            2.  The BglII-HindIII fragment (nt 1392 of trpE to the end of the
            trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
            fragment and an EcoRI-HindIII fragment, both from the MCS of
            M13mp12.  This plasmid is pATH1 (see GenBank acc M32985 for more
            details).
            
            3.  The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
            the remaining plasmid religated.  This produced plasmid pATH2
            (GenBank acc M33624)
            
            4.  An EcoRI linker was inserted at the remaining SmaI site of
            pATH2 replacing the SmaI site and changing the reading frames of
            the other sites in the MCS.  This plasmid is pATH3.
FEATURES       from  to/span     description
    pept        423  >  1391     trpE' fusion protein
BASE COUNT      926 a    942 c    946 g    949 t
ORIGIN      
        1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
       61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
      121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
      181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
      241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
      301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
      361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
      421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
      481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
      541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
      601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
      661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
      721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
      781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
      841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
      901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
      961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
     1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
     1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
     1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
     1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
     1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
     1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
     1381 agattgagat ccccccgaat tcggggggat cctctagagt cgacctgcag cccaagctta
     1441 tcgatgataa gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac
     1501 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt
     1561 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt
     1621 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta
     1681 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg
     1741 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac
     1801 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg
     1861 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc
     1921 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg
     1981 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat
     2041 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg
     2101 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg
     2161 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc
     2221 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt
     2281 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct
     2341 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc
     2401 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca
     2461 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct
     2521 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt
     2581 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga
     2641 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca
     2701 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac
     2761 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg
     2821 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag
     2881 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac
     2941 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt
     3001 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg
     3061 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc
     3121 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc
     3181 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc
     3241 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa
     3301 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt
     3361 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg
     3421 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag
     3481 agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt
     3541 gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc
     3601 gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg
     3661 acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg
     3721 catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg agg
//
LOCUS       SYNTRPB      3771 bp ds-DNA   circular  SYN       28-JUL-1990
DEFINITION  Cloning vector pATH10, propagated in E.coli.
ACCESSION   M33623
KEYWORDS    beta-lactamase; trpE' protein.
SOURCE      Synthetic DNA, clone pATH10.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 3771)
  AUTHORS   Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
  TITLE     High-expression vectors with multiple cloning sites for
            construction of trpE-fusion genes path vectors
  JOURNAL   Meth. Enzymol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Hill 06-APR-1990.
            
            Nucleotides 1-147 are provided as a personal communication from
            R.P.Gunsalus at the Dept. of Microbiology at UCLA.
            
            Construction of pATH10: 1.  PvuII-HindIII fragment from the 5' end
            of the trp operon (through nt 1999 of ECOTGP, which is in the trpD
            cds) was ligated to the HindIII-PvuII fragment of pBR322 containing
            the bla (= Amp-resistance) gene and origin of replication, but not
            the rop gene, which encodes a negative regulator of ColE1
            replication.  In addition, the EcoRI site in the pBR322 backbone
            was eliminated.  This plasmid is pKRS101. (Spindler et al. M.
            Virol. 49, 132-141 (1984))
            
            2.  The BglII-HindIII fragment (nt 423 of trpE to the end of the
            trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
            fragment and an EcoRI-HindIII fragment, both from the MCS of
            M13mp12.  This plasmid is pATH1 (see GenBank acc M32985 for more
            details).
            
            3.  The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
            the remaining plasmid religated producing plasmid pATH2 (GenBank
            acc M33624).
            
            4.  An interim vector was constructed by inserting an EcoRI linker
            at the remaining SmaI site of pATH2.
            
            5.  The EcoRI-HindIII fragment of MCS in this interim vector was
            replaced with the EcoRI-HindIII fragment containing the MCS of
            M13mp12.
            
            6.  Tha AvaII-AvaII fragment that spanned the PstI site in the bla
            gene of this interim vector was replaced with the corresponding
            AvaII fragment from pUC8, eliminating this PstI site, making the
            PstI site in the MCS unique.  This is plasmid pATH10.
FEATURES       from  to/span     description
    pept        423     1472     trpE' protein
    pept       1688     2548     beta-lactamase
BASE COUNT      927 a    945 c    948 g    951 t
ORIGIN      
        1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
       61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
      121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
      181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
      241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
      301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
      361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
      421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
      481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
      541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
      601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
      661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
      721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
      781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
      841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
      901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
      961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
     1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
     1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
     1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
     1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
     1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
     1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
     1381 agattgagat cccccggaat tcgagctcgc ccggggatcc tctagagtcg acctgcagcc
     1441 caagcttatc gatgataagc tgtcaaacat gagaattaat tcttgaagac gaaagggcct
     1501 cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg
     1561 tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc
     1621 aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag
     1681 gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg
     1741 ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt
     1801 gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt
     1861 tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt
     1921 attatcccgt gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa
     1981 tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag
     2041 agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac
     2101 aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac
     2161 tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac
     2221 cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac
     2281 tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact
     2341 tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg
     2401 tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt
     2461 tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat
     2521 aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta
     2581 gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa
     2641 tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga
     2701 aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac
     2761 aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt
     2821 tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc
     2881 gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat
     2941 cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag
     3001 acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc
     3061 cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc attgagaaag
     3121 cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac
     3181 aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg
     3241 gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct
     3301 atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc
     3361 tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga
     3421 gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga
     3481 agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg
     3541 catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact
     3601 ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac
     3661 gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc
     3721 gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag g
//
LOCUS       SYNTRPC      3753 bp ds-DNA             SYN       28-JUL-1990
DEFINITION  Cloning vector pATH2, propagated in E.coli.
ACCESSION   M33624
KEYWORDS    beta-lactamase; trpE' protein.
SOURCE      Synthetic DNA, clone pATH2.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 3753)
  AUTHORS   Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
  TITLE     High-expression vectors with multiple cloning sites for
            construction of trpe-fusion genes path vectors
  JOURNAL   Meth. Enzymol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Hill 06-APR-1990.
            
            Nucleotides 1-147 are provided as a personal communication from
            R.P.Gunsalus at the Dept. of Microbiology at UCLA.
            
            Construction of pATH2: 1.  PvuII-HindIII fragment from the 5' end
            of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds)
            was ligated to the HindIII-PvuII fragment of pBR322 containing the
            bla (= Amp-resistance) gene and origin of replication, but not the
            rop gene, which encodes a negative regulator of ColE1 replication.
            In addition, the EcoRI site in the pBR322 backbone was eliminated.
            This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141
            (1984))
            
            2.  The BglII-HindIII fragment (nt 1392 of trpE to the end of the
            trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
            fragment and an EcoRI-HindIII fragment, both from the MCS of
            M13mp12.  This plasmid is pATH1 (see GenBank acc M32985 for more
            details).
            
            3.  The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
            the remaining plasmid religated.  This produced plasmid pATH2
FEATURES       from  to/span     description
    pept        423     1454     trpE' protein
    pept       1670     2530     beta-lactamase
BASE COUNT      924 a    939 c    943 g    947 t
ORIGIN      
        1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
       61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
      121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
      181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
      241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
      301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
      361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
      421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
      481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
      541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
      601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
      661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
      721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
      781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
      841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
      901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
      961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
     1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
     1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
     1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
     1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
     1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
     1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
     1381 agattgagat ccccggggat cctctagagt cgacctgcag cccaagctta tcgatgataa
     1441 gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac gcctattttt
     1501 ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa
     1561 tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat
     1621 gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca
     1681 acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca
     1741 cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta
     1801 catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt
     1861 tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtgttgacgc
     1921 cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc
     1981 accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc
     2041 cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa
     2101 ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga
     2161 accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgcagcaat
     2221 ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca
     2281 attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc
     2341 ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat
     2401 tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag
     2461 tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa
     2521 gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca
     2581 tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc
     2641 ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc
     2701 ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc
     2761 agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt
     2821 cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt
     2881 caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc
     2941 tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa
     3001 ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac
     3061 ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc ttcccgaagg
     3121 gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga
     3181 gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact
     3241 tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa
     3301 cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc
     3361 gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg
     3421 ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcctgat
     3481 gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag
     3541 tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc gctacgtgac
     3601 tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt
     3661 ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag
     3721 aggttttcac cgtcatcacc gaaacgcgcg agg
//
LOCUS       SYNTRPD      3772 bp ds-DNA             SYN       28-JUL-1990
DEFINITION  Cloning vector pATH11, propagated in E.coli.
ACCESSION   M33625
KEYWORDS    beta-lactamase; trpE' protein.
SOURCE      Synthetic DNA, clone pATH11.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 3772)
  AUTHORS   Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A.
  TITLE     High-expression vectors with multiple cloning sites for
            construction of trpe-fusion genes path vectors
  JOURNAL   Meth. Enzymol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.E.Hill 06-APR-1990.
            
            Nucleotides 1-144 are provided as a personal communication from
            R.P.Gunsalus at the Dept. of Microbiology at UCLA.
            
            Construction of pATH11: 1.  PvuII-HindIII fragment from the 5' end
            of the trp operon (through nt 1999 of ECOTGP, which is in the trpD
            cds) was ligated to the HindIII-PvuII fragment of pBR322 containing
            the bla (= Amp-resistance) gene and origin of replication, but not
            the rop gene, which encodes a negative regulator of ColE1
            replication.  In addition, the EcoRI site in the pBR322 backbone
            was eliminated.  This plasmid is pKRS101. (Spindler et al. M.
            Virol. 49, 132-141 (1984))
            
            2.  The BglII-HindIII fragment (nt 423 of trpE to the end of the
            trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI
            fragment and an EcoRI-HindIII fragment, both from the MCS of
            M13mp12.  This plasmid is pATH1 (see GenBank acc M32985 for more
            details).
            
            3.  The SmaI-SmaI fragment from the MCS of pATH1 was deleted and
            the remaining plasmid religated.  This produced plasmid pATH2
            (GenBank acc M33624).
            
            4.  An interim vector was constructed by inserting an EcoRI linker
            at the remaining SmaI site of pATH2.
            
            5.  The EcoRI-HindIII fragment of MCS in this interim vector was
            replaced with the EcoRI-HindIII fragment containing the MCS of
            M13mp12.
            
            6.  Tha AvaII-AvaII fragment that spanned the PstI site in the bla
            gene of this interim vector was replaced with the corresponding
            AvaII fragment from pUC8, eliminating this PstI site, making the
            PstI site in the MCS unique.  This is plasmid pATH11.
FEATURES       from  to/span     description
    pept        423     1487     trpE' protein
    pept       1689     2549     beta-lactamase
BASE COUNT      927 a    946 c    948 g    951 t
ORIGIN      
        1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg
       61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt
      121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg
      181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc
      241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt
      301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag
      361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa
      421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca
      481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat
      541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc
      601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc
      661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact
      721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct
      781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag
      841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag
      901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa
      961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc
     1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga
     1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc
     1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag
     1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg
     1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg
     1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc
     1381 agattgagat ccccccggaa ttcgagctcg cccggggatc ctctagagtc gacctgcagc
     1441 ccaagcttat cgatgataag ctgtcaaaca tgagaattaa ttcttgaaga cgaaagggcc
     1501 tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag
     1561 gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt
     1621 caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa
     1681 ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt
     1741 gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt
     1801 tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt
     1861 ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg
     1921 tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga
     1981 atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa
     2041 gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga
     2101 caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa
     2161 ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca
     2221 ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta
     2281 ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac
     2341 ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc
     2401 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag
     2461 ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga
     2521 taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt
     2581 agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata
     2641 atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag
     2701 aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa
     2761 caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt
     2821 ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc
     2881 cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa
     2941 tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa
     3001 gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc
     3061 ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag cattgagaaa
     3121 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa
     3181 caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg
     3241 ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc
     3301 tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg
     3361 ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg
     3421 agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg
     3481 aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc
     3541 gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac
     3601 tccgctatcg ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga
     3661 cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc
     3721 cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gg
//
LOCUS       ECOTRMF        77 bp ss-tRNA            RNA       28-JUL-1990
DEFINITION  E. coli initiator Met-tRNA-f.
ACCESSION   K00305 M25117
KEYWORDS    transfer RNA; transfer RNA-Met.
SOURCE      E. coli (strain CA265) tRNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 77)
  AUTHORS   Dube,S.K. and Marcker,K.A.
  TITLE     The nucleotide sequence of N-formyl-methionyl-transfer RNA: Partial
            digestion with pancreatic and T-1 ribonuclease and derivation of
            the total primary structure
  JOURNAL   Eur. J. Biochem. 8, 256-262 (1969)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 77)
  AUTHORS   Uemura,H., Imai,M., Ohtsuka,E., Ikehara,M. and Soell,D.
  TITLE     E. coli initiator tRNA analogs with different nucleotides in the
            discriminator base position
  JOURNAL   Nucleic Acids Res. 10, 6531-6539 (1982)
  STANDARD  full staff_review
REFERENCE   3  (sites)
  AUTHORS   Dahlberg,J.E., Kintner,C. and Lund,E.
  TITLE     Specific binding of tRNA-Met-f to 23S rRNA of Escherichia coli
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 1071-1075 (1978)
  STANDARD  simple staff_entry
COMMENT     [1] Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss from
            their entry 1310 in Nucleic Acids Res. 11, r1-r54 (1983).
            
            [1] notes that there may be either another minor Met-tRNA-f or a
            modification of this sequence, because there is a small amount of
            an oligonucleotide which shows that base 47 is adenosine instead of
            m7g.
            
            [2] generated all possible substitutions at the fourth base up from
            the 3' end (position 74); all four variants accepted methionine in
            in-vitro aminoacylation reactions, implying that the "discriminator
            hypothesis" is incorrect.
FEATURES       from  to/span     description
    tRNA          1       77     Met-tRNA (NAR: 1310)
    modified      8        8     s4u
    modified     21       21     d
    modified     33       33     cm
    anticdn      35       37     Met-tRNA-f anticodon cat
    modified     47       47     m7g
    modified     55       55     t
    modified     56       56     f
BASE COUNT       14 a     26 c     25 g     12 t
ORIGIN      5' end of mature Met-tRNA-f.
        1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa
       61 tccggccccc gcaacca
//
LOCUS       MCPRNA3A       80 bp ss-RNA             VRL       28-JUL-1990
DEFINITION  Cowpea mosaic virus M RNA 3' terminal sequence.
ACCESSION   M25438
KEYWORDS    .
SOURCE      Cowpea mosaic virus RNA.
  ORGANISM  Cowpea mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Comoviridae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Davies,J.W., Stanley,J. and Van Kammen,A.
  TITLE     Sequence homology adjacent to the 3' terminal poly(A) of cowpea
            mosaic virus RNAs
  JOURNAL   Nucleic Acids Res. 7, 493-500 (1979)
  STANDARD  simple staff_entry
BASE COUNT       22 a     13 c     13 g     32 t
ORIGIN      
        1 tatgaattta atttcttttg tgagctcctg tttagcaggt cgtcccttca gcaaggacac
       61 aaaaagattt taattttatt
//
LOCUS       MCPRNA3B       80 bp ss-RNA             VRL       28-JUL-1990
DEFINITION  Cowpea mosaic virus B RNA 3' terminal sequence.
ACCESSION   M25439
KEYWORDS    .
SOURCE      Cowpea mosaic virus RNA.
  ORGANISM  Cowpea mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Comoviridae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Davies,J.W., Stanley,J. and Van Kammen,A.
  TITLE     Sequence homology adjacent to the 3' terminal poly(A) of cowpea
            mosaic virus RNAs
  JOURNAL   Nucleic Acids Res. 7, 493-500 (1979)
  STANDARD  simple staff_entry
BASE COUNT       22 a     12 c     13 g     33 t
ORIGIN      
        1 taaataatgc ttatgttttt gtttgctcct gtttagcagg tcgttccttc agcaagaaca
       61 acaaaaatat gtgttttatt
//
LOCUS       PPCCGAAA     5306 bp ds-DNA             VRL       28-JUL-1990
DEFINITION  Hamster papovavirus complete genome.
ACCESSION   M26281
KEYWORDS    complete genome.
SOURCE      Hamster papovavirus DNA.
  ORGANISM  Hamster papovavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 5306)
  AUTHORS   Delmas,V., Bastien,C., Scherneck,S. and Feunteun,J.
  TITLE     A new member of the polyomavirus family: The hamster papovavirus.
            Complete nucleotide sequence and transformation properties
  JOURNAL   EMBO J. 4, 1279-1286 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    ORF         192      839     early proteins polyprotein (T antigens)
    ORF        5083     4046 (c) VP2
    ORF        4711     4046 (c) VP2
    ORF        4045     2927 (c) VP1
BASE COUNT     1595 a   1124 c   1080 g   1507 t
ORIGIN      
        1 ccccttgcct ccttagctct caagtagaaa aggaagagag gcttttgggg ctttttggct
       61 ttaagcctca ttttatgagc aggaggagct tgttgcaact tgagaggcgt tttgaggctt
      121 ccaggcagag aatactcaca gaccccacac agtctagacg ctcagaagca tctctagctg
      181 caacaagcaa gatggataga attcttacta aagaagaaaa gcaagcctta ataagtttac
      241 tagatttgga gccacaatat tggggagact atggacgaat gcagaaatgc tacaagaaaa
      301 agtgtcttca actgcatcct gataaaggtg gcaatgaaga gctcatgcaa cagcttaata
      361 ccctgtggac caaactaaaa gatggtcttt acagagttag gctgttactt gggcctagtc
      421 aggtaagaag acttggaaaa gatcagtgga atttatcttt acagcaaaca ttttctggta
      481 cctactttag gaggctctgc agactcccca ttacctgcct aagaaacaag ggaattagta
      541 cctgcaattg catactttgt ttgctcagaa aacagcattt tctgctaaag aagtcctgga
      601 gagtaccttg cctggtgtta ggagaatgct actgcataga ctgctttgcc ttatggtttg
      661 gcctgccagt taccaatatg ctggttccat tatatgcaca atttcttgct ccaatacctg
      721 tggattggct tgatctgaat gttcatgagg tctacaatcc ggcctcaggt atgtatgaat
      781 atggggggct tatagttgta actgtacaag tttaaaatgt gcttttttca ggaccctaat
      841 gcttccacct ccaccagcag acccggagag ttctacaatc ctgacacagg aggatactgg
      901 tcctactctt atgggtcagc aggatactct gaccagcaga agaaatactg ggaagagttt
      961 ttctctaagt gggatgttaa tgaggacctc acctgccaag aagagttatc atcatcagaa
     1021 gatgaattca ccccctggca tcccaatccc cccccctccc ctgtttctat ttccagtgac
     1081 agctccagtt cctcctgtga cgaggaatac ccaagaaact caagcagaaa gagaaaacga
     1141 gtacatgcca atggctcccc aaatacacct atacagccaa ataagagagc ccacacacca
     1201 ggaggaggaa gaaccacaat acgaggagat accgatatac ctagaactcc tgccagagaa
     1261 tcccaatcaa catttggctc ttacttcaac agcacggagg agcttgagga ggaaatatca
     1321 caaacacaac agtcacatca taacacaacg ccaaagaaac cgcctccgac ggttagtcct
     1381 gatgattttc ctactatcct tagggggttt ctttctcacg ctattttttc taataaaacg
     1441 caaaatgcat ttataatcta cagtactaag gaaaaatgtg aagtacttta tgaacaaata
     1501 gacaaatata atccagacta taaaggtatc ttcattatga aacaaacaga agcatttgta
     1561 atgtttatga ctcctggaaa acatagagta gctgcagtta aaagttactg ttgtaaattt
     1621 tgtaccgtta gcttcctgct atgcaaagct gttacaaaac cgttagagtt gtataactgt
     1681 gtggctaaat gtgatgactt tcaaatttta aaagaaaata agcctggtct atatcatttt
     1741 gaattctgtg atgaaaaaaa agaggtgaag caaatagact ggaatttcct aacatctttt
     1801 gcagttgaaa atgagttaga tgatcctctt gtaattatgg gacattatct agaatttagt
     1861 cagtgtgaaa gctcttgcaa aaagtgtgca gaagctttac caaggatgaa agtccactgg
     1921 gctaaccaca gtcagcactt agagaatgct gagcttttct tacactgcaa acaacagaaa
     1981 agtatctgtc agcaagcagc agataatgtt ctggcaagga gaagattaaa ggtccttgaa
     2041 tcaacaagac aagaattgtt ggcagagaga ctgaacaaac tgttagacca attaaaagat
     2101 ttatctcctg tagataagca tttatatctt gctggagtag cctggtacca atgtatgttt
     2161 cctgattttg agatgatgtt attagatatt ttaaaattgt ttactgaaaa tgttccaaaa
     2221 aaaagaaatg tactttttag aggtcctgta aattcaggga aaactagcct tgctgcagct
     2281 atcatgaatc ttgtaggagg agttgccctc aatgttaatt gtcctgcaga taagctcaac
     2341 tttgaacttg gtgttgctat agataaattt gcagtagtct ttgaagatgt caaaggacaa
     2401 accggagata agagacacct acagtctgga cttggaatta ataaccttga taacctgaga
     2461 gattaccttg atggaagtgt aaaggttaat ttagaaaaga agcatgtaaa taagaggtcc
     2521 cagatatttc ctccttgtat tgttactgct aatgaatatt tttttcctca aacactctat
     2581 gccagattcc ataaagttta taactttgaa gtgaaggatt ttcttgccaa gagccttgag
     2641 gaaaacagtt acatggggag acatagagtc tgtcaaagtc cacttacaat gctgatagca
     2701 ttgctttgga atgtacccac tgaaaatttt gataagtctc tcaaagagaa ggtggaaaca
     2761 gaaaagaagg ttttgtctga tatgtgtaac tttactacat ttgcagaaat gtgtctcaat
     2821 attcagaggg gtgctgatcc ccttgaggca ttgtaattga ggaggaaaca ataattgatg
     2881 aataaagcat ttattagaag ctctgtgtac agtcattttt caagcattag tttgctggtt
     2941 ttgcaggggg tttagtatgc tgttggccat acttgtcaat gaacctattc acatctgggt
     3001 caccaggaac agcctctgta ccctcataaa tcctgacttc ttctacctga gcagcttctc
     3061 cttccatggg ctggccttca attgttggaa gcatattgtt gtacaaagaa gctagcaagc
     3121 ttgtaactgg gtaaggattt ttcacccatc tttttctcaa ggtcacatta aaatatctag
     3181 gcagccccct ccaatgccag cctgcactgt tgtattctat gtaccagccc ataacatctg
     3241 ctgcactgag ataaagccca tctcctttgc aaagaggccc aaccccattt tcatccagaa
     3301 gcacagtagt caaggtatta gtaaactgca tcactggtgg agtaccagta ccacctgtga
     3361 ggtacctacc atccttgtcc aattttgctt ttgcagtagg gtccagcacc tggtttgtgg
     3421 aagtcattgc tttgccagta acagttttga tactaacaat agctgcctca taatttgcat
     3481 tatagttctg cactaggcct tgcaaatcta atggttctcc tcccactgca aacatgtggt
     3541 aagttgtacc ctcaactggt ttggaaattc caatatcctt tgtctcactt ctggagccat
     3601 atccatgcac atttagaagg gatcccactc caacaacttc agtttttaca gatacagcct
     3661 cccacatttg aagggtatca caggtcaaat cttcattcag tgttggaagc tgtattttag
     3721 ccatactgta atatggcagt tgattagcct tcacttcatc agcagtaagg gagctattta
     3781 ctttaatact ctgggagaac ccataatact ggccatcagt tcctgtgcca ggcttgttct
     3841 gacccattct aggattaagg taggcctcaa tttgtgtgat actgtcttct cctgttacaa
     3901 gatcaagcac acccacacca ccccgcataa taagcttggg aacattagca ggctttggac
     3961 agggtttcca caggggtttg cacatctact ggaagcgccg ctttttcttt ttggggccat
     4021 actcaacctc atcaatgtat gtctgccaag taggactaat gtctccgtac aatcctagaa
     4081 ttaaaggaag catccaatca ggtgtcactc tttggtgggc tccaccagga gcaaaatacc
     4141 tcatgatatt tgcccctgat tcaaaccaac tagaagagtc ttcctgctgt tgacttcttc
     4201 tctggacatc aggtctccct agttcagctt ctaatacttg tctgctattg gcatcttcaa
     4261 tagaaggtct attactgtat tctaaagctc tttctatttg tcttctttga gctggattaa
     4321 ttcctggaag ttctgcatag tagttttgta ggccaccata tattctacta taggcctctc
     4381 taggtaaatt agtaacaacc catctactat tttccatcat tctggcaatg gcatctaaaa
     4441 attgatgggt ggtctgcaaa cttaagtccc tcacagcaga ttctacagct ccttgagttt
     4501 cccttctcaa agtatcccag atatactctc ctacagattg aaataatgag tggccccagc
     4561 catgaataac atccaaggca tgggtaaatg actgtacacc agggaataat atatcatagt
     4621 agtcagctgg tctcctggga ataagtgcca tgtttctatt cacaatcggt acttcgtgag
     4681 caagatagcc gtgtagactt cccaaagaga aggctgctga acctgcaaca gtttgaaaaa
     4741 taaatgctgt ctggactgat tctctcacaa actcagtcat tactgttgat gttaactcag
     4801 gggcagcttg cataaatata aacatgtctt cacttaggcc aattgaagac aaagctgtct
     4861 cagctcctaa aaacccctcc atagttatta atgaagtaac ttgggcatct atcgcggcaa
     4921 aggcttctcc actaagtatg gcctctactg aaattccagt aactgatgaa atttcggaga
     4981 ggtagctgat catctcaata atcactgaaa tggcagatcc catgttgact tacttgaaca
     5041 gtttgaaaat cttctgaact gtttcaggca ggtttttagg ccgaattcta aagaaacaga
     5101 aagcaaacac tcagcgccga agagcaggaa atggctgacc actgcacttg ggcgacacga
     5161 cacgcctagc gataaggaag tcaccatggc aacataaccg cagcactgct gttgtcacag
     5221 ttgcctagca aatgacagac tcagcaacca caggagagga aatgataggg ctagcatttt
     5281 ttcaaatgta aaccagaggc tagggg
//
LOCUS       RATGST2YB     500 bp ss-mRNA            ROD       28-JUL-1990
DEFINITION  Rat liver glutathione S-transferase Ya subunit mRNA.
ACCESSION   M26874
KEYWORDS    S-transferase; glutathione S-transferase; ligandin; transferase.
SOURCE      Rat liver cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 500)
  AUTHORS   Daniel,V., Sarid,S., Bar-Nun,S. and Litwack,G.
  TITLE     Rat ligandin mRNA molecular cloning and sequencing
  JOURNAL   Arch. Biochem. Biophys. 227, 266-271 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      402     glutathione S-transferase Ya subunit (AA at 1)
BASE COUNT      153 a    112 c    109 g    126 t
ORIGIN      
        1 gccctgattg acatgtattc agagggtatt ttagatctga ctgaaatgat tatccaattg
       61 gtaatatgtc ccccagacca aagagaagcc aagaccgcct tggcaaaaga caggaccaaa
      121 aaccggtact tgcctgcctt tgaaaaggtg ttgaagagcc atggccaaga ctaccttgta
      181 ggtaacaggc tgacccgggt agacatccac ctgctggaac ttctcctcta tgttgaagag
      241 tttgatgcca gccttctgac ctctttccct ctgctgaagg ccttcaagag cagaatcagc
      301 agcctcccca atgtgaagaa gttcctgcag cctggcagtc agagaaagct tcccgtggat
      361 gcaaaacaaa tcgaagaagc aaggaagatt ttcaagtttt agcggagctg cactatccaa
      421 tttctttatg ttttgcaaaa aatgagaagc aattgttgat cctaggtatt tttgaaataa
      481 taaacacgaa aaaatactct
//
LOCUS       CPARBCSL      528 bp ds-DNA             PLN       28-JUL-1990
DEFINITION  C.paradoxa ribulose-1,5-bisphosphate carboxylase/oxygenase
            large (rbcL) and small (rbcS) subunits, 3' end and complete cds.
ACCESSION   M35728
KEYWORDS    ribulose-1,5-bisphosphate carboxylase/oxygenase.
SOURCE      C.paradoxa DNA.
  ORGANISM  Cyanophora paradoxa
            Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae;
            Cryptomonadales; Kathablepharidaceae.
REFERENCE   1  (bases 1 to 528)
  AUTHORS   Starnes,S.M., Lambert,D.H., Maxwell,E.S., Stevens,S.E.Jr.,
            Porter,R.D. and Shively,J.M.
  TITLE     Cotranscription of the large and small subunit genes of ribulose-
            1,5-bisphosphate carboxylase/oxygenase in Cyanophora paradoxa
  JOURNAL   FEMS Microbiol. Lett. 28, 165-169 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       18     ribulose-1,5-bisphosphate
                                 carboxylase/oxygenase large subunit (rbcL)
    pept        124      444     ribulose-1,5-bisphosphate (AA at 1)
                                 carboxylase/oxygenase small subunit (rbcS)
BASE COUNT      189 a     85 c     60 g    194 t
ORIGIN      
        1 actattgata ctatctaata tcatttaatt tatttaatta tttagagttt aaaactctaa
       61 ataattaatc aaaatgatat tacttcaatc tatttttacc ttaaaattcg gaattataaa
      121 taaatgcaac ttagagtaga acgtaagttc gaaacttttt cttatttacc accattaaac
      181 gaccaacaga ttgcgcgtca attacaatac gcactttcca atggttatag cccagcaatc
      241 gaattcagtt ttacaggtaa agctgaagac ttagtatgga ctttatggaa attaccttta
      301 tttggtgcac aatctcctga agaagtactt agcgaaattc aagcttgtaa acaacagttc
      361 cctaatgctt acattcgtgt tgtagcattt gactctatca gacaagttca aactttaatg
      421 ttcttagttt acaaaccatt atagtttaat tgatatctac tctaattgat agatatcaat
      481 ttttaattaa tctacaaaac aaaattatct aattattatt aatacttt
//
LOCUS       HUMCFIX       873 bp ss-mRNA            PRI       28-JUL-1990
DEFINITION  Human coagulation factor IX mRNA, partial cds.
ACCESSION   M35672
KEYWORDS    coagulation factor IX; serine protease.
SOURCE      Human adult liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 873)
  AUTHORS   Jagadeeswaran,P., Lavelle,D.E., Kaul,R., Mohandas,T. and
            Warren,S.T.
  TITLE     Isolation and characteriztion of human factor IX cDNA:
            Identification of Taq I polymorphism and regional assignment
  JOURNAL   Somat. Cell Mol. Genet. 10, 465-473 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   873     coagulation factor IX (AA at 1)
BASE COUNT      279 a    146 c    205 g    243 t
ORIGIN      
        1 aacgccaaca aaattctgaa tcggccaaag aggtataatt caggtaaatt ggaagagttt
       61 gttcaaggga accttgagag agaatgtatg gaagaaaagt gtagttttga agaagcacga
      121 gaagtttttg aaaacactga aagaacaact gaattttgga agcagtatgt tgatggagat
      181 cagtgtgagt ccaatccatg tttaaatggc ggcagttgca aggatgacat taattcctat
      241 gaatgttggt gtccctttgg atttgaagga aagaactgtg aattagatgt aacatgtaac
      301 attaagaatg gcagatgcga gcagttttgt aaaaatagtg ctgataacaa ggtggtttgc
      361 tcctgtactg agggatatcg acttgcagaa aaccagaagt cctgtgaacc agcagtgcca
      421 tttccatgtg gaagagtttc tgtttcacaa acttctaagc tcacccgtgc tgagactgtt
      481 tttcctgatg tggactatgt aaattctact gaagctgaaa ccattttgga taacatcact
      541 caaagcaccc aatcatttaa tgacttcact cgggttgttg gtggagaaga tgccaaacca
      601 ggtcaattcc cttggcaggt tgttttgaat ggtaaagttg atgcattctg tggaggctct
      661 atcgttaatg aaaaatggat tgtaactgct gcccactgtg ttgaaactgg tgttaaaatt
      721 acagttgtcg caggtgaaca taatattgag gagacagaac atacagagca aaagcgaaat
      781 gtgattcgaa ttattcctca ccacaactac aatgcagcta ttaataagta caaccatgac
      841 attgcccttc tggaactgga cgaaccctta gtg
//
LOCUS       HUMMHDRBPV    292 bp ds-DNA             PRI       28-JUL-1990
DEFINITION  Human MHC class II HLA-DR-beta-I allele gene, partial cds.
ACCESSION   M35651
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human (Pemphigus vulgaris patient, haplotype DR4 Dw10) blood DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 292)
  AUTHORS   Scharf,S.J., Long,C.M. and Erlich,H.A.
  TITLE     Sequence analysis of the HLA-Dr-beta and HLA-DQ-beta loci from
            three Pemphigus vulgaris patients
  JOURNAL   Hum. Immunol. 22, 61-69 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    26  >   292     HLA-DR-beta, exon 2 (AA at 26)
BASE COUNT       64 a     74 c    101 g     53 t
ORIGIN      
        1 ccggatcctt cgtgtcccca gaccacgttt cttggagcag gttaaacatg agtgtcattt
       61 cttcaacggg acggagcggg tgcggttcct ggacagatac ttctatcacc aagaggagta
      121 cgtgcgcttc gacagcgacg tgggggagta ccgggcggtg acggagctgg ggcggcctga
      181 tgccgagtac tggaacagcc agaaggacat cctggaagac gagcgggccg cggtggacac
      241 ctactgcaga cacaactacg gggttgtgga gagcttcaca gtgcagcggc ga
//
LOCUS       MUSC3B        647 bp ss-mRNA            ROD       28-JUL-1990
DEFINITION  Mouse complement component 3 (C3) mRNA, partial cds.
ACCESSION   M35659
KEYWORDS    complement component 3.
SOURCE      Mouse liver, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 647)
  AUTHORS   Fey,G., Domdey,H., Wiebauer,K., Whitehead,A.S. and Odink,K.
  TITLE     Structure and expression of the C3 gene
  JOURNAL   Springer Semin. Immunopathol. 6, 119-147 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   647     complement 3 (AA at 1)
BASE COUNT      170 a    171 c    168 g    138 t
ORIGIN      
        1 atccccatgt attccatcat tactcccaat gtcctacggc tggagagcga agagaccatc
       61 gtactggagg cccacgatgc tcagggtgac atcccagtca cagtcactgt gcaagacttc
      121 ctaaagaggc aagtgctgac cagtgagaag acagtgttga caggagccag tggacatctg
      181 agaagcgtct ccatcaagat tccagccagt aaggaattca actcagataa ggaggggcac
      241 aagtacgtga cagtggtggc aaacttcggg gaaacggtgg tggagaaagc agtgatggta
      301 agcttccaga gtgggtacct cttcatccag acagaccaga ccatctacac ccccggctcc
      361 actgtcttat atcggatctt cactgtggac aacaacctac tgcccgtggg caagacagtc
      421 gtcatcctca ttgagacccc cgatggcatt cctgtcaaga gagacattct gtcttccaac
      481 aaccaacacg gcatcttgcc tttgtcttgg aacattcctg aactggtcaa catggggcag
      541 tggaagatcc gagcctttta cgaacatgcg ccgaagcaga tcttctccgc agagtttgag
      601 gtgaaggaat acgtgctgcc cagttttgag gtccgggtgg agcccac
//
LOCUS       P30LTA        777 bp ds-DNA             BCT       28-JUL-1990
DEFINITION  Plasmid P307 (from E.coli) heat-labile enterotoxin subunit A (LTA)
            gene, complete cds.
ACCESSION   M35581
KEYWORDS    enterotoxin.
SOURCE      Plasmid P307 (from Escherichia coli) DNA, clone pAT153.
  ORGANISM  Plasmid P307
            Unclassified.
REFERENCE   1  (bases 1 to 777)
  AUTHORS   Dykes,C.W., Halliday,I.J., Hobden,A.N., Read,M.J. and Harford,S.
  TITLE     A comparison of the nucleotide sequence of the A subunit of heat-
            labile enterotoxin and cholera toxin
  JOURNAL   FEMS Microbiol. Lett. 26, 171-174 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      777     heat-labile enterotoxin subunit A (LTA)
BASE COUNT      255 a    136 c    164 g    222 t
ORIGIN      
        1 atgaaaaata taactttcat tttttttatt ttattagcat cgccattata tgcaaatggc
       61 gacagattat accgtgctga ctctagaccc ccagatgaaa taaaacgttc cggaggtctt
      121 atgcccagag ggcataatga gtacttcgat agaggaactc aaatgaatat taatctttat
      181 gatcacgcga gaggaacaca aaccggcttt gtcagatatg atgacggata tgtttccact
      241 tctcttagtt tgagaagtgc tcacttagca ggacagtcta tattatcagg atattccact
      301 tactatatat atgttatagc gacagcacca aatatgttta atgttaatga tgtattaggc
      361 gtatacagcc ctcacccata tgaacaggag gtttctgcgt taggtggaat accatattct
      421 cagatatatg gatggtatcg tgttaatttt ggtgtgattg atgaacgatt acatcgtaac
      481 agggaatata gagaccggta ttacagaaat ctgaatatag ctccggcaga ggatggttac
      541 agattagcag gtttcccacc ggatcaccaa gcttggagag aagaaccctg gattcatcat
      601 gcaccacaag gttgtggaaa ttcatcaaga acaatcacag gtgatacttg taatgaggag
      661 acccagaatc tgagcacaat atatctcagg gaatatcaat caaaagttaa gaggcagata
      721 ttttcagact atcagtcaga ggttgacata tataacagaa ttcgggatga attatga
//
LOCUS       PIGFSHB       929 bp ss-mRNA            MAM       28-JUL-1990
DEFINITION  Pig follicle stimulating hormone (FSH) beta-subunit mRNA,
            3" end.
ACCESSION   M35676
KEYWORDS    follicle stimulating hormone.
SOURCE      Pig anterior pituitary, cDNA to mRNA.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 929)
  AUTHORS   Kato,Y.
  TITLE     Cloning and DNA sequence analysis of the cDNA for the precursor of
            porcine follicle stimulating hormone (FSH) beta-subunit
  JOURNAL   Mol. Cell Endocrinol. 55, 107-112 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      348     follicle stimulating hormone beta-subunit (AA
                                 at 1)
BASE COUNT      256 a    230 c    207 g    236 t
ORIGIN      
        1 gccatctgct gcaatagctg tgagctgacc aacatcacca tcacagtgga gaaagaggag
       61 tgtaacttct gcataagcat caacaccacg tggtgtgctg gctattgcta cacccgggac
      121 ctggtataca aggacccagc caggcccaac atccagaaaa catgtacctt caaggagctg
      181 gtgtacgaga ccgtgaaagt acctggctgt gctcaccatg cagactccct gtatacgtat
      241 ccagtagcca ctgaatgtca ctgtggcaag tgtgacagtg acagtactga ctgcaccgtg
      301 agaggcctgg ggcccagcta ctgctccttc agtgaaatga aagaataaag agcagtggac
      361 atttcatgct tcctaccctt gtctgaagga ccaagacgtc caagaagttt gtgtgtacat
      421 gtgcccaggc tgcaaaccac tatgagagac cccactgatc cctgctgtcc tgtggaggag
      481 gagctccagg aatgcagagt gctagggcct cagtcccatc accactcaac cctgtatttt
      541 gggtctggtt ccataagttt tattcggtct ttttttttaa attactcaat gaattttatt
      601 acatttataa ttgtacaatg atcatcacaa cccaatttta taggatttcc atcccaaacc
      661 cccagcatag acccccatct cccaatctgt ctcatttgga aaccataagt ttttcaaagt
      721 ccgtgagtca gtatctactc agtcttatta ccttaaagac atgtgggtgt tttctgttta
      781 ataatcttag aaatcctctc aagacaggga tatggaccca gaggaaggaa atgggctaag
      841 aatgggtgaa aggactaaat gcagcattct cccactagac acagcagcct acaagagcag
      901 ggccagtctc tttgtcatga gtgtggccg
//

GenBank-Updates@genbank.bio.net (07/31/90)

LOCUS       CHKCYTA      4400 bp ds-DNA             VRT       31-JUL-1990
DEFINITION  Chicken cytotactin gene, 5' flank.
ACCESSION   M35369
KEYWORDS    cytotactin.
SOURCE      Chicken DNA, (library of Clontech), clone CTG[1,2].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 4400)
  AUTHORS   Jones,F.S., Crossin,K.L., Cunningham,B.A. and Edelman,G.M.
  TITLE     Identification and characterization of the promoter for the
            cytotactin gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.S.Jones, 19-JUN-1990.
            
                Author address: F.S.Jones
                Laboratory of Developmental and Molecular Biology
                The Rockefeller University
                1230 York Avenue
                N.Y., N.Y. 10021
FEATURES       from  to/span     description
    signal     3305     3301 (c) GAGA box
    signal     3996     4001     TATA box
BASE COUNT     1156 a    937 c   1031 g   1276 t
ORIGIN      
        1 gtcgacctgc aggtcaacgg atctgagcat cacttgccaa cttgtaccct cagatgctgt
       61 gtctatatct gcttaggtca gacttccatt ggagtagggg atatatttct gcttttccta
      121 atatcaaaac aatgcccttc agcatgggtt gcaattatga gaatcaatcc agcctcaaag
      181 caatgtcatt gtcaatgctg tatgttgaat taaatgtagt catactgctg tttacactgg
      241 aaaaggatgg tctatgaata actatataaa atagtattcc catttaaata tatgcactga
      301 acaaatagtc aagagtttgc agtgtttttg aaaactcgta ttatacttga gagccattag
      361 ttcaggaaag gaggatatta gtacacgggc agcatgaatt gttgagaata tcagatactt
      421 aaagcaaaag tttagctatt tagatgctat ctgaaaaaga agttaagcta tcttcttgcg
      481 tattagcact gagtgattcc aaacagcaaa tgtagaggat cgaccctgtg atatgatgtc
      541 tctctttaaa acctactctt tcatctttct cttaacatta cataagataa actatagttg
      601 gtgggttctg tttctcaggc tacctattct gaatctggtg tggctgtttc tgctgtgttc
      661 ttgtgcgctg ctgttgagcc tggtactgct ctaacatccc tttttttgca agaaatgtgc
      721 ttccttccag ctacaggcac tgaatacttc ccctggctcc tgaaattgtc caagaatgtt
      781 tccctgcccc aaggcatgat catctgtcca tttgttttgt aagttatatt ccagtggctc
      841 agtgcattcc catctgcaca ctattgaaat cagcttcgct acgtgcatgg caagatgcat
      901 ctcacgctgt gctcatctct gtgccttcag ggcagctttt acactgtctg atatgccaat
      961 acaaaataaa ttattcctgg gactgggtag attaagtttt caagctgaac taagactaat
     1021 tacctgtgtt gcaacaattt gtcccaaggt aacgatgagt atgtagtgtg aaatccacta
     1081 ttttccagaa tgatgagttg ggatctcttt caggctttca tgctgttctt gatttcatgc
     1141 tatgtccaac tgttcctaga tttgtctgta atgattatcc ttacatgtgt gaattgcatc
     1201 tgcagtgtga gcttgtgtgt aaaaagacaa cgataaactc agctgcatat acaaaattta
     1261 atgctcgttt actttctcct tttcattaaa atagaaatca actgatattg acagtcatct
     1321 agtgaaagag aatggcacag ggcaaatgat tcagttgtaa tccttcatgt ttgacatcac
     1381 aacctaaagc tatcctgctg attaaaatac attttccctg ttctttcatg ttatcatttg
     1441 cttctgactc aatgtatcac tcagtgtttt ccacagagtg catttaaaga tgtgtgagcc
     1501 agaaaagata aaaccaggac aggcctggac tcttacctga ctcttattca ccgcatcacg
     1561 gcacgttgtt tgaagtcagg taagtgggtg gaggaagata ggagaaaaaa ttggtttaaa
     1621 ttttgattca atagctgctg gcagaagacc tggtgttagc aaacactaca aaaacaaatc
     1681 ctgcagattg cttcccgttg tttctgtctt gcttagactt gtggatgtgg acagagagat
     1741 ggagtgtttc tcaaatggat ctgaatggac ttctggagta agagaaagca catggtgcat
     1801 tctatgggga ctgtctgctt ctcagtcccc atcagatctt attgggaaaa tctcacattg
     1861 ggtttttgcc attgagttta attgaatttc acaggactgt atgactgggg gagaattaag
     1921 ctatgctggc accatgatca ccttgagaat attctgggga gaacagcaga caaacattga
     1981 gtatttattg ggggctgcaa acggtaacct tattgccttt ggaatcaatg gaaacgtcca
     2041 gaacaagttc tacttttaag aggtgttcca gaaccgtgga gatgtggcac tgcaggacgt
     2101 ggtcagtggg catggtggga tgggttgggg ttggacttgg gggatctggg agatcttttg
     2161 caagcctaat gattctgtga ttctaagtca gcagaaagaa tccaatgcct ttactgggga
     2221 gcagtcagca gttctgacac tctttccagt aggatcttgt ttctgctctt atgacgtgac
     2281 tgctatggaa cacagcgagc cagaaaatta aagcaaaaat cagaaatgag tgctcataaa
     2341 tggctgatag ggatgcatcc ctgattgtac tgaagcgttt gatctcggtg tttcctgaat
     2401 acttctgtag ttgctcggag aaatgggatc tgtttgttgt acaaatcatt tgtaaaaacg
     2461 tcaccaagtt ttgcttgctt ttttgtcagc acctgtttag aaatgattga gagttgtgct
     2521 gtgcattgaa gaagggaatg agctcagtaa tattgttaca catcaggaaa taaacacaag
     2581 ggtcactggt aaataggaaa tccccaccaa agaaaatctg acccgtgtta ccaaaagaac
     2641 tctgttttct gtaatgatga taataataat gattctaaaa tggccatcag aattttgggg
     2701 ccaggaattc cccatccctt cccgaaggtg gagaatgggc caagttttcc tgttagtgct
     2761 gtgcctgtga agcaggaaaa ggaaaaaatc ctagaagtga tgcgagtcca tggaggctgg
     2821 ggaaggccag ctgggaggag gcacactcag aaagggagaa tgctcagaag ggcctggaca
     2881 gtccctctta attcttgtga gcacccttgt tgctgaatag gaccccactg ctttcttact
     2941 gggcagtccc acagcctcct gaggtaggtg ggtaatgctt aaagggattt ggatcagtcc
     3001 tgcttctgaa atggtgaagc aatttcatcc cttctcctag aacacagaca tgacagagct
     3061 taactctcta agagcttaac cctgcagctc ccctccctct aactaccagc ccccccccac
     3121 ccccgaaaga aaggtagatt cagatagaag ggagctgact gctccctgag tgctaactgt
     3181 gatcacaatt ctttgccatg aggctgagat ttgagggtaa gcaggagacc ctccctctga
     3241 gcagcactgt gcattgcccg caggtccggg actgaccaag ggactcacgg aggggttgga
     3301 ggatctttcc aacacccaaa tgcacagtct ctctttaaaa agaaggaagc cctcctgtgc
     3361 gcttctgtgg gcttttatct ctctcttttt aagagaatca gcttaggatg ctgctcagcc
     3421 ctgctgtccc ccgcagctga gggagagcag aagcagcgtg ggatgcagag atcacaagcc
     3481 tggaaagacc gatgtgtgcc agatcagcag cttgtgtaca tccgagctgc tctgctcatc
     3541 cttgcagagc agtgccagcc ccactcgctg cccatccgcc ccagcacact ccgagggact
     3601 ttttggaatg gaaaccctgt gtttgtgatc cccctgattc cagtttgctg ctctttcgca
     3661 gtgttaaaat aaaggggaaa ggggtttaaa attcctgata atgcactctg ctgatgctgt
     3721 aaggaaaata ttcctctttc tgagtcattt gccctgagct gacttttttt tttttgtttg
     3781 aagggttcag gggtttcctt ctttagccat aaactgcatg caaattcttt gtgtgtgtgt
     3841 gtatgacagt gtgtgtctgc gagtgcgtgt gtgtttcttc agaagctaag ccaaagagag
     3901 aggctgttcc tttaaagcaa aactgaaaaa aaaaatgcct tcaggcatcc tattacagtg
     3961 gaatgagcca aggatcagga tttgcagctc agctatataa acccggcaga atctggctga
     4021 tctgaccagt gtgccgcact gtcaaaccct cctttcacac acgcgcgcac caaatgagac
     4081 ggcacaactt ctctgagttt tgacaggacg gcgaggaatc cgggagccga cagctggctg
     4141 cagtacctct gcttcgtgga ggtcgcccgt ggcaggatct gatccgtcag cccacacggt
     4201 aagcctcctt ccttcctctg ttcaaatatc aggagctctt ttgccaccca agtctgcgac
     4261 acggttccat tgcgaccgtg gcagctgtga ctttgttcag tcggagcggg caagggtttt
     4321 taggagggag tgtttctgct ttcgttcggc tgtgctccct gatggcataa gaacgtagga
     4381 aaggtttcag cggcatcgat
//
LOCUS       HUMIFNB       507 bp ds-DNA             SYN       31-JUL-1990
DEFINITION  Human fibroblast interferon (IFN-beta) gene, complete cds.
ACCESSION   M35591
KEYWORDS    interferon.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 507)
  AUTHORS   Nagase,Y., Nakamura,N., Tohyama,J., Watanabe,S., Ogino,H.,
            Horikoshi,K., Nii,A., Soma,M., Nobuhara,M., Suzuki,Y. and
            Mochida,E.
  TITLE     Chemical synthesis of a human fibroblast interferon gene and its
            expression in Escherichia coli
  JOURNAL   Nucleic Acids Symp Ser 12, 83-86 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          3      503     fibroblast interferon (IFN-beta)
BASE COUNT      140 a    126 c    106 g    135 t
ORIGIN      
        1 cgatgtcgta caacctgctt ggattcctgc aacgctctag caacttccaa tgtcagaaac
       61 tgctctggca gcttaatggt cgtctggaat actgcctgaa agaccgtatg aacttcgaca
      121 tcccggaaga gatcaagcaa ctccagcaat tccagaaaga agacgcagct ctgactatct
      181 acgaaatgct tcagaacatc ttcgcaatct ttcgtcagga cagctcctct actggttgga
      241 acgagactat cgttgagaac ctgctcgcta acgtatatca tcagatcaac cacctgaaga
      301 ctgttcttga agagaagctg gagaaagaag atttcactcg aggtaaactg atgtcttccc
      361 ttcatctgaa gcgttattac ggtcgtatcc tgcattacct gaaagctaag gaatactctc
      421 attgcgcttg gactatcgtt cgtgtagaaa tcctgcgtaa tttctacttt atcaaccgtc
      481 tgactgggta ccttcgcaac tgatagg
//
LOCUS       HUMMAC2       908 bp ss-mRNA            PRI       31-JUL-1990
DEFINITION  Human macrophage galactose-specific lectin (hMAC-2) mRNA, complete
            cds.
ACCESSION   M35368
KEYWORDS    macrophage galactose-specific lectin.
SOURCE      Human colon epithelial carcinoma cell line HT29, cDNA to mRNA,
            clone hMac2.2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 908)
  AUTHORS   Cherayil,B., Chaitovitz,S., Wong,C. and Pillai,S.
  TITLE     Molecular cloning of a macrophage galactose-specific lectin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.J.Cherayil, 22-JUN-1990.
FEATURES       from  to/span     description
    pept         23      769     macrophage galactose-specific lectin (hMAC-2)
    mRNA    <     1      908     macrophage galactose-specific lectin mRNA
    site        137      248     lectin domain
BASE COUNT      251 a    233 c    217 g    207 t
ORIGIN      
        1 ggagccagcc aacgagcgga aaatggcaga caatttttcg ctccatgatg cgttatctgg
       61 gtctggaaac ccaaaccctc aaggatggcc tggcgcatgg gggaaccagc ctgctgggca
      121 ggggctaccc aggggcttcc tatcctgggg cctaccccgg gcaggcaccc ccagggctta
      181 tcctggacag gcacctccag gcgcctaccc tggagcacct ggagcttatc ccggagcacc
      241 tgcacctgga gtctacccag ggccacccag cggccctggc tacccatctt ctggacagcc
      301 aagtgccccc ggagcctacc ctgccactgg cccctatggc gcccctgctg ggccactgat
      361 tgtgccttat aacctgcctt tgcctggggg agtggtgcct cgcatgctga taacaattct
      421 gggcacggtg aagcccaatg caaacagaat tgctttagat ttccaaagag ggaatgatgt
      481 tgccttccac tttaacccac gcttcaatga gaacaacagg agagtcattg tttgcaatac
      541 aaagctggat aataactggg gaagggaaga aagacagtcg gttttcccat ttgaaagtgg
      601 gaaaccattc aaaatacaag tactggttga acctgaccac ttcaaggttg cagtgaatga
      661 tgctcacttg ttgcagtaca atcatcgggt taaaaaactc aatgaaatca gcaaactggg
      721 aatttctggt gacatagacc tcaccagtgc ttcatatacc atgatataat ctgaaagggg
      781 cagattaaaa aaaaaaaaaa agaatctaaa ccttacatgt gtaaaggttt catgttcact
      841 gtgagtgaaa atttttacat tcatcaatat ccctcttgta agtcatctac ttaataaata
      901 ttacagag
//
LOCUS       MUSIGLCD      794 bp ds-DNA             ROD       31-JUL-1990
DEFINITION  Mouse Ig germline lambda-chain gene C5 region.
ACCESSION   M35582
KEYWORDS    constant region; germline; immunoglobulin light-chain;
            lambda-immunoglobulin.
SOURCE      Mouse (strain SPE) DNA.
  ORGANISM  Mus spretus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 794)
  AUTHORS   Mami,F., Cazenave,P.-A. and Kindt,T.J.
  TITLE     Conservation of the immunoglobulin C-lambda-5 gene in the Mus genus
  JOURNAL   EMBO J. 7, 117-122 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /   294      613     Ig lambda-chain C5 region
BASE COUNT      195 a    188 c    195 g    216 t
ORIGIN      
        1 ggttccagct tcaaggggct agagaattca gctggtctta gtctcagcgg gggaactgag
       61 attgcaaggg tctgggtctg ggtcatttta tctggaagag gaacatgttc taatggatgc
      121 taggctgtct gctctccaag ggactcaagt ggtcagagga gaagaaggaa gcatccctgg
      181 atggaagact gatgctgtag tgaatggcca cagagctcct gataagagaa ggacgcttcc
      241 ttatcacatg ggctctccta tgctaactct tatccccttc tctatctgcg caggtcagcc
      301 caagtctgat cccttggtca ctctgttcct gccttcctta aagaatcttc aggccaataa
      361 ggtcacactg gtgtgtttgg tgagcgaatt ctacccaggt actttggtgg tggactggaa
      421 ggtagatggg gtccctgtca ctcagggtgt agagacaacc caaccctcca aacagaccaa
      481 caacaaatac atggtcagca gctacctgac attgatatct gaccagtgga tgcctcacag
      541 tagatacagc tgccgggtca ctcatgaagg aaacactgtg gagaagagtg tgtcacctgc
      601 tgagtgttct tagaccacaa tcctccctga agccttaggg gcctggatct gaagtgccag
      661 aaaaagttgt tttttgtttt tttttttttt tccccattaa ccatctcact gtctttcctg
      721 tgcctaatac tcaataaata tcttaccacc aaccagagag tcttgtcgat ctcatcattt
      781 gcatgccagg gatg
//
LOCUS       PSELSP       2396 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  P.fluorescens signal peptidase II (lsp) gene, complete cds and
            isoleucyl-tRNA synthetase (ileS) gene, 3' end.
ACCESSION   M35366
KEYWORDS    isoleucyl-tRNA synthetase; signal peptidase II.
SOURCE      P.fluorescens (strain NCIB 10586) DNA.
  ORGANISM  Pseudomonas fluorescens
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 2396)
  AUTHORS   Isaki,L., Beers,R. and Wu,H.
  TITLE     Nucleotide sequence of the Pseudomonas fluorescens signal peptidase
            II (lsp) and flanking genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Wu, 18-JUN-1990.
            
                Author address:H.Wu
                Uniformed Services University of the Health Sciences
                4301 Jones Bridge Rd.
                Bethesda, MD 20814
FEATURES       from  to/span     description
    pept    <     1      729     isoleucyl-tRNA synthetase (ileS) (AA at 1)
    pept        722     1234     signal peptidase II (lsp)
    pept       1227     1679     ORF149
    pept       1857  >  2396     ORF316
    rpt         768      810     inverted repeat
    rpt        1239     1268     inverted repeat
    rpt        1640     1674     inverted repeat
    rpt        1697     1889     inverted repeat
BASE COUNT      498 a    706 c    712 g    478 t      2 others
ORIGIN      
        1 ctgcagcgcg agttgcagga acactacggc gaataccgct tctggaacgt ctactccaag
       61 atccacaact tctgcgtgca ggagctgggt ggtttctacc tcgacatcat caaggaccgc
      121 cagtacacca ctggcgccaa cagcaaggcg cgccgttcgg cagaccgcgc tgtaccacat
      181 cagcgaaggc tggtgcgctg gatcgcaccg atcctggcat tcaccgctga cgaactgtgg
      241 gagtacctgc cgggcgagcg taacgagtcc gtcatgctca acacctggta cgaaggcctg
      301 accgaactgc cggctgactt cgaactgggc cgcgagtact gggagggcgt gatggccgtc
      361 aaggttgcgg tgaacaagga actggaagtc cagcgtgcgg ccaaggccgt gggtggcaac
      421 ctgcaagccg aagtcaccct gtttgccgaa gacggcctga ccgccgacct ggccaagctg
      481 agcaacgagc tgcgcttcgt gctgatcacc tctactgcga gcctggcgcc gtttacccag
      541 gctccggcag atgcggtggc taccgaagtg cctggcctta agctcaaagt ggtcaagtcg
      601 gccttcccca agtgcgctcg ttgctggcac tgccgtgaag acgtcggcgt gaaccctgag
      661 cacccggaaa tctgcggtcg ttgtgtcgac aacatcagcg gtgaaggcga ggttcgccac
      721 tatgcctaat gcagacagtc gtttcggacg tctgggctgg ctcgtactga gcctgctggt
      781 cctggtcatt gaccaggtca gcaaggctca cttcgagggc tccctggaaa tgttccagca
      841 aatcgtggtg atcccggatt atttcagctg gaccctggcc tacaacactg gcgccgcctt
      901 cagcttcctc gctgacggcg gtggctggca gcgctggctg tttgctgtga tcgccgtggt
      961 ggtaagtgcc gtactggtgg tgtggctcaa gcgcctgggc cgcgacgaca cctggctggc
     1021 catcgcgctg gccctagtgt tgggtggcgc gctgggcaac ctgtatgacc gcatcgccct
     1081 gggccatgtg atcgacttta tcctggtgca ttggcagaac cgccactact tcccggcgtt
     1141 caactttgcc gacagtgcta tcaccgtcgg cgcaatcatg ctggcgctgg atatgttcaa
     1201 gagcaagaaa accggagaaa ccgtcaatga ctgatcaggt attggctgag caacgcatcg
     1261 gccagaacac ggaagtcact ttgcatttcg cactgcgcct ggagaatggc gacacggtcg
     1321 acagcacgtt cgacaaggcc cccgcgacct tcaaggtcgg cgacggcaac ctgctgccgg
     1381 gtttcgaagc ggcactgttc ggtttcaagg ccggtgacaa gcgcaccctg caaatcctgc
     1441 cggaaaacgc ctttggccag cccaacccgc aaaacgtgca gatcatcccc cgttcgcagt
     1501 tccagaacat ggacctgtcg gaaggcttgc tggtgatctt caatgatgcg gcgaacactg
     1561 aattgcctgg cgtggtgaaa gcatttgatg acgcgcaagt gaccatcgac ttcaatcacc
     1621 cgttggccgg caagacgttg acctttgacg tcgagattat cgacgtcaaa gcgctgtaac
     1681 cgaccgtacg cggtctaaaa tgtgggaggg ggcttgctcc cgatagcggt atatcagcta
     1741 aatgtgcaga ctgatatacc tatatcggga gcaagccccc tcccaccttt gatctccatt
     1801 gtcggttgag tcagtgtcaa attgactcaa tatggctgca atacacgagg cacagcatgc
     1861 aaatcaaact cgccaacccc cgtggcttct gcgccggcgt ggaccgggcg atcgaaatcg
     1921 tcaatcgcgc cctggaagtc ttcgggccgc cgatttatgt gcgccatgaa gtcgtccata
     1981 acaaatttgt ggtcgaagac ttgcgtgcgc gcggggcgat ctttgtcgaa gaactcgatc
     2041 aggtgaaaga cgacgtgatc gtcatcttca gtgcccacgg tgtttcccag gctgtacgta
     2101 ccgaagcggc gggccgtggc ctgaaggtgt tcgatgccac ctgcccactg gtgaccaagg
     2161 tgcatatcga ggtggcgcgc tacagccgcg acggtcgtga gtgcatcctg atcggccacg
     2221 ccggtcaccc ggangtagaa ggcaccatgg gccaatacga cgccagcaac ggcggtgcta
     2281 tctacctggt cgaggacgag aaagacgtcg ccaacttgca ggtgcacaat ccggaacgcc
     2341 tggccttcgt gacccagacc accttgtcca tggacgacac cagccgngta atcgat
//
LOCUS       PSEXILES     1128 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  P.fluorescens X and isoleucyl-tRNA synthetase (ileS) genes,
            complete cds.
ACCESSION   M35367
KEYWORDS    isoleucyl-tRNA synthetase.
SOURCE      P.fluorescens (strain NCIB 10586) DNA.
  ORGANISM  Pseudomonas fluorescens
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1128)
  AUTHORS   Isaki,L., Beers,R. and Wu,H.
  TITLE     Nucleotide sequence of the Pseudomonas fluorescens signal peptidase
            II (lsp) and flanking genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Wu, 18-JUN-1990.
            
                Author address:H.Wu
                Uniformed Services University of the Health Sciences
                4301 Jones Bridge Rd.
                Bethesda, MD 20814
FEATURES       from  to/span     description
    pept         90     1025     protein X
    pept       1040  >  1128     isoleucyl-tRNA synthetase (ileS)
    signal       56       61     TATA box
BASE COUNT      198 a    349 c    349 g    231 t      1 others
ORIGIN      
        1 tcgattcgac ccatttggcc tgcgctgttg cctgtcgtcc ggggctgggt gtggttataa
       61 tcgaccactt tatgagcaag aagcgcgtta tgcagctggt tcgaggtctc cacaacctgc
      121 gccccgagca tcggggctgc gtcgccacta ttggcaactt tgacggtgtt caccgtggcc
      181 accaggctat cctggcaagg ctgcgcgagc gtgcggtcga gttgggtgtg cccagctgcg
      241 tggtgatttt cgagccacaa ccgcgggagt tctttacccc ggaaacagcg ccggcccgct
      301 tggcgcttgc gcgacaagct gcaactgctg gcggagaggg cgtggaccgc gtcctctgcc
      361 tggctttcaa ccagcgtttg cgcagcctca gcgccgccga gttcgtcgac cgcattctgg
      421 tcgatggcct gggtgtacaa cacctggagg ttggtgacga cttccatttc ggttgcgatc
      481 gggtcgggga tttcgatttc ctgcaacatg ccggcgtcaa ccagggcttt accgttgaag
      541 ccgcccaaac cgtcgaactg gacggcctgc gtgtgagcag cacccaggtg cgtaacgcct
      601 tggctgccgc cgacttcgac ctggccgagc gtttgctcgg tcgcccgttc cgcattgccg
      661 ggcgggtact gcacggccag aagctggcgc gccaattggg cacgccaact gccaacgtgc
      721 aactcaagcg ccgccgagtg ccgctgactg gggtttacct ggtgagcgtc gacatcgacg
      781 gccaatcgtg gccgggagtc gccaatatag gcgtcaggcc cacggttgca ggtgatggca
      841 aggcccacct ggaagttcac cttttggatt ttgccggtga tttatacgac cggcgtttga
      901 cggtggtttt ccaccagaag ctgcgtgaag agcagcgttt cgcctccctg gaggcgttga
      961 aaacggcgat caatgcggat gtcgccgccg cccgtgcact agccgcacct agcgcccatc
     1021 gctaaccgaa gagccttaaa tgaccgacta taaagccacg ntaaaccttc cggacaccgc
     1081 cttcccaatg aaggccggcc tgccacagcg cgaaccgcag tcctgcag
//
LOCUS       BLY656        672 bp ss-mRNA            PLN       31-JUL-1990
DEFINITION  H.vulgare thaumatin II mRNA, complete cds.
ACCESSION   M35656
KEYWORDS    thaumatin II.
SOURCE      H.vulgare (cv. Bomi) endosperm, cDNA to mRNA, clone pUP-23.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 672)
  AUTHORS   Lazaro,A., Rodriguez-Palenzuela,P., Marana,C., Carbonero,P. and
            Garcia-Olmedo,F.
  TITLE     Signal peptide homology between the sweet protein thaumatin II and
            unrelated cereal alpha-amylase/trypsin inhibitors
  JOURNAL   FEBS Lett. 239, 147-150 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         39      482     thaumatin II precursor
    sigp         39      101     thaumatin II signal peptide
    matp        102      479     thaumatin II
BASE COUNT      155 a    175 c    209 g    133 t
ORIGIN      
        1 aagagattga accaacgacc aataaactag tatcaacaat ggcatccgac catcgtcgct
       61 tcgtcctctc cggcgccgtc ttgctctcgg tcctcgccgt cgccgccgcc accttggaga
      121 gcgtcaagga cgagtgccaa ctaggggtgg acttcccgca taacccgtta gccacctgcc
      181 acacctacgt gataaaacgg gtctgcggcc gcggtcccag ccggcccatg ctggtgaagg
      241 agcggtgctg ccgggagctg gcggccgtcc cggatcactg ccggtgcgag gcgctgcgca
      301 tcctcatgga cggggtgcgc acgccggagg gccgcgtggt tgagggacgg ctcggtgaca
      361 ggcgtgactg cccgagggag gagcagaggg cgttcgccgc cacgcttgtc acggcggcgg
      421 agtgcaacct atcgtccgtc caggcgccgg gagtacgctt ggtgctactg gcagatggat
      481 gacgatgcaa atgcgccaag gtaatgaagc ggagtactgt atacagaata aaagtactcg
      541 agtgaaaaca aactcataaa taaaccttgt gagatgtatg cgtatgatct atggtgtgga
      601 cagttaaatt gtggccgatt gatgaataaa aaaggttgga acaaattaaa ttgttgtggg
      661 ttcatatact at
//
LOCUS       SPIFERRI      632 bp ss-mRNA            PLN       31-JUL-1990
DEFINITION  Spinach ferredoxin I mRNA, complete cds.
ACCESSION   M35660
KEYWORDS    ferredoxin I.
SOURCE      Spinach (var. Monatol) seedlings, cDNA to mRNA, p6SocFd/1.
  ORGANISM  Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE   1  (bases 1 to 632)
  AUTHORS   Wedel,N., Bartling,D. and Herrmann,R.G.
  TITLE     Analysis of cDNA clones encoding the entire ferredoxin I precursor
            polypeptide from spinach
  JOURNAL   Botanica Acta 101, 295-300 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         17      460     ferredoxin I precursor
    sigp         17      166     ferredoxin I signal peptide
    matp        167      457     ferredoxin I
    mRNA    <     1      632     ferredoxin I mRNA
BASE COUNT      176 a    131 c    139 g    186 t
ORIGIN      
        1 ttacaaaatt gtaaaaatgg cagcaaccac cacaacaatg atgggcatgg ccaccacctt
       61 tgtcccaaaa ccccaagcac caccaatgat ggcggcgctt ccatccaaca ccggccgctc
      121 tttgttcgga ctcaagaccg gtagccgtgg cggaaggatg acaatggctg cctacaaggt
      181 aaccttggta acacccaccg gtaacgtaga gtttcaatgc ccagacgatg tttacatctt
      241 ggatgctgct gaagaagaag gcattgactt gccttactca tgcagagctg ggtcgtgctc
      301 ttcatgcgcc ggaaagctta agacaggtag tcttaaccaa gatgatcaga gttttttgga
      361 tgacgatcag atcgatgaag gatgggttct tacctgtgct gcttaccctg ttagtgatgt
      421 tactattgag acccacaagg aagaggagct tactgcctaa ttcatttttt tttatttttt
      481 attattttgt tctcatttga gagggaattg aaagactaaa aaagagtaaa tgcagcgagg
      541 agtttttact tcttcgatct gtggtttgta attgtgtatt atcatgttaa tcaattgctc
      601 ttataagctt aattactacg taatatatat at
//
LOCUS       GEM5SRRNA     108 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  G.obscuriglobus 5S ribosomal RNA.
ACCESSION   M35170
KEYWORDS    5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      G.obscuriglobus (strain UQM 2246) ribosomal RNA.
  ORGANISM  Gemmata obscuriglobus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE   1  (bases 1 to 108)
  AUTHORS   Bomar,D., Giovannoni,S. and Stackebrandt,E.
  TITLE     A unique type of eubacterial 5S rRNA in members of the order
            Planctomycetales
  JOURNAL   J. Mol. Evol. 27, 121-125 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   108     5S rRNA
BASE COUNT       25 a     33 c     31 g     19 t
ORIGIN      
        1 ttccggtgac catacccaaa cggaaacacc cgttcccatt ccgaacacgg ccgtgaaagt
       61 ttgaggccga tggtagtgcg tccagcgcga gagtaggtat cgccggcc
//
LOCUS       ISO5SRRNA     111 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  I.pallida 5S ribosomal RNA.
ACCESSION   M35169
KEYWORDS    5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      I.pallida (strain IS 1B) ribosomal RNA.
  ORGANISM  Isosphaera pallida
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Bomar,D., Giovannoni,S. and Stackebrandt,E.
  TITLE     A unique type of eubacterial 5S rRNA in members of the order
            Planctomycetales
  JOURNAL   J. Mol. Evol. 27, 121-125 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   111     5S rRNA
BASE COUNT       19 a     39 c     32 g     21 t
ORIGIN      
        1 ctccggtgac cataccgtcg gggtcctacc cgttcccatt ccgaacacgg ccgtcaagcc
       61 cgccgggcct atgatagtgc gtcagcgcga aagtcggtga tcgccggatc t
//
LOCUS       MUSINTZ1      271 bp ds-DNA             ROD       31-JUL-1990
DEFINITION  Mouse integration site for foreign DNA, 5' flank.
ACCESSION   M35630
KEYWORDS    integration site.
SEGMENT     1 of 3
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 271)
  AUTHORS   Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G.
  TITLE     Structural features of the integration site of foreign DNA in the
            genome of transgenic mice
  JOURNAL   Mol. Biol. 22, 1235-1242 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           9       50     region of dyad symmetry
BASE COUNT      108 a     48 c     55 g     60 t
ORIGIN      
        1 gaaaaatcta cgttaataaa acgaactaac ggaacaacat tattacaggt agaaagattc
       61 atcagttgag atttaggaat accacatcaa ctaatgcaga tacataacgc caaaaggaat
      121 tacgaggcat agtaagagca acactatcat aaccctcgtt taccagacga cgataaaaac
      181 caaaatagcg agaggctttt gcaaagaagt tttgccagag ggggtaatag taaatgttta
      241 gactggatag cgtccaatat gcggtcgtca a
//
LOCUS       MUSINTZ2      300 bp ds-DNA             ROD       31-JUL-1990
DEFINITION  Mouse integration site for foreign DNA, 3' flank.
ACCESSION   M35631
KEYWORDS    integration site.
SEGMENT     2 of 3
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G.
  TITLE     Structural features of the integration site of foreign DNA in the
            genome of transgenic mice
  JOURNAL   Mol. Biol. 22, 1235-1242 (1988)
  STANDARD  simple staff_review
BASE COUNT       56 a     94 c     81 g     69 t
ORIGIN      About 1.5 kb after segment 1.
        1 ccagctggtg ctccatgtgc agccaggtcc tggccccact atgtccatcc cagctccagc
       61 ccctgcttgc tggcagcagc ctggcagccc caggcacttc tcccagcacg ctgccatgtc
      121 actggtcagg tcgggtgctg cctgcctgct gccagcaggc tgtccccggt gtacccactg
      181 tttccccact gctaggtcat ctgtgcactg atgcatgcag gtgattctcg gtaatatggc
      241 atacgtgttc gggaatgatc gtaatcaaaa aggcgagata agctggtcag agatatatag
//
LOCUS       MUSINTZ3       99 bp ds-DNA             ROD       31-JUL-1990
DEFINITION  Mouse repetitive sequence DNA.
ACCESSION   M35632
KEYWORDS    .
SEGMENT     3 of 3
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 99)
  AUTHORS   Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G.
  TITLE     Structural features of the integration site of foreign DNA in the
            genome of transgenic mice
  JOURNAL   Mol. Biol. 22, 1235-1242 (1988)
  STANDARD  simple staff_review
BASE COUNT       12 a     26 c     41 g     20 t
ORIGIN      About 1.5 kb after segment 3.
        1 agctgggaat cggggggggg ggggggggcg ggtggtggtt tggccaggga tttgggcacc
       61 ttcctaagcc ccaaccctaa tgccggtttc cttcccccg
//
LOCUS       NPOCAPA      3555 bp ds-DNA             VRL       31-JUL-1990
DEFINITION  O.pseudotsugata polyhedrosis virus capsid protein p87 gene,
            complete cds.
ACCESSION   M35445 D00514
KEYWORDS    capsid protein; structural protein.
SOURCE      O.pseudotsugata polyhedrosis virus DNA.
  ORGANISM  Orgyia pseudotsugata polyhedrosis virus
            Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; 
            Nuclear polyhedrosis viruses.
REFERENCE   1  (bases 1 to 3555)
  AUTHORS   Mueller,R., Pearson,M.N., Russell,R.L.Q. and Rohrmann,G.F.
  TITLE     A capsid-associated protein of the multicapsid nuclear polyhedrosis
            virus of Orgyia pseudotsugata: Genetic location, sequence,
            transcriptional mapping, and immunocytochemical characterization
  JOURNAL   Virology 176, 133-144 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        216     2090     capsid protein p87
    mRNA        182     2282     capsid protein p87 mRNA (alt.)
    mRNA        209     2282     capsid protein p87 mRNA (alt.)
    signal     2194     2199     polyA site
BASE COUNT      946 a   1003 c    880 g    726 t
ORIGIN      
        1 gaattctgta aaaaaggtta ggccctttat gtcaatcaaa ctgtgctggt caaagtactt
       61 ggccgtcaaa aaggttagcg agtcgatttc gcgttggtgc atttgcgctt caaagcgcac
      121 gtgttcaaag cggtgctcca gaccgtgcac cgcgttgaac ctcaaattgt acaacaattt
      181 gtaagtgtgc atgttgcatt cgatatagcc ttataatgga cgagcagcat tcattacgca
      241 ttgccgcgct ggctggcgag atcttgacgc gcgaccgcgc acaggtgaac acaatcatac
      301 acagccccga gcgcgcgctg ggccaaaaat tggacgccat tacggcgcta gtcgattcaa
      361 tgcagcccgg cacgccgcgc gacgctgctg tgaacgaagc cgcaaacgtt accgcgcaaa
      421 gcccaatgtc ggaaactcaa gacccgcagc gcgccaacga caacgtgtca gataccgttg
      481 caaatgaaaa cgcgcaaaac ttgttgttgg agggccaaga ccgcgttttg cgacatcgcg
      541 ttttgcaaat cgcggtaacg tttttgcagc gcaacaagcg agtaaaagcg aacgcgacca
      601 cgttggccca gattgaggaa gcgctccgaa actatgaaac tgcaaaaaac agcggcgcaa
      661 gcgacagtgt aatcgacggg tttcttgaac gtgccgaatc tttgtttaac accctaaaaa
      721 acatatcgct atccgagctg ttggaccgcg aaagcgcagt gtttgcggac acggaaagcg
      781 ctccgcggac gcaaaccgcg gacaactcgc cgccacctgt gagcgaacaa gactttgacc
      841 ggttggacat tagccagctc accgattaca tcgagaacaa ttacagggat cagtttgatt
      901 tcgacaaaca caacagtgtg gaagacgtgc gcaactttgc aaaaaatctt tggcgcaaca
      961 aaacgcgtgt cacgtccacg cctctgcagg aataccagat gccaccgcaa acgcccgcgc
     1021 ctctgcaaga ccaaatgcca ccgcaaacgc ccgcgtacgc gacgcccgcg cagcagccaa
     1081 gccagccaac gcccgcgcaa acgcccgcgc agcagccaag ccagccaacg cccgcgtacg
     1141 ttacgtctgc gcaaacgccc gcgcagcagc caagccagcc aacgcccgtt tccaattaca
     1201 gttgggaaag gcgcgtcgcg tcaatgtttg cgaatacaga tttaccacaa aacgtgccgc
     1261 tgcccgacag ctacgacacc tcgtccgtca ttgggcaaaa acggcgcaag cggcgcgcgc
     1321 cgccgctgcc gccgtactcg tcagacgaag aagacgccgc cccgccacgt tcgccaaaac
     1381 gcaaagagtc gctttcctcg tctgaagaag acgaatttga ctacgagcgc gaacaaaaac
     1441 gccggcgcga ggaggataaa aactttttgc gcctaaaggc gttggaactg tccaagtacg
     1501 ccggcgttaa cgagcgcatg gaaaagattg tgcgcgttac caaggcgatg caacaaacgt
     1561 acgactactg caattgcaaa aacacaatca gcggcacgcc cgccgcggcg tcgtttatta
     1621 acttgttgcg gcggctgaac acctacaacc tcagccacgt ggaaatgacg gtcaactttt
     1681 acgagctctt gtaccccctc acgctgtaca acgacgagtc caaccgcatt gttggctaca
     1741 tattcgcggc gaccaactat tttcaaaatt gcgccaaaaa ctttgggcgc atgcgcgccg
     1801 agtttaacga gcacggcccg tttgcgcaaa ttgattcgct cgtcatgttt gtcatcaaat
     1861 tcaacttttt gtgcgacttg caaacgtttt ttggaaaaat tgacggcttg ccaatgttgg
     1921 cccagcccaa catcaagacg cacaccgtgc tggtgatgcg cgacaagatc gtcaagctgg
     1981 cctttggcgc gctgcaatac gacaccagtc tgaaaacgga caatcggcgg gaccccaaac
     2041 acctgcaacg actcatccag ttgatgaacg ccgattttaa cataatgtaa acaaatgtgt
     2101 tagcaattta tttattacaa acagagttac gatattcggt gtacatggtt ttataatctt
     2161 tgtgatcaaa gttaaaacgt tttctataca aataataaag cttgttaaac agctgttttc
     2221 gttttttttt gtgtcgcaaa caaatttttt gtggcaatgg cgtagcacat ctagaaaatt
     2281 aaatttttcg ccgaccgcac actcttcatt aagctcgtta cggctggccg cctatgcgat
     2341 aactgcacgt gacacacagg gtgtagtggc cgcatggcgc cgccaccatg cgcttcggct
     2401 cgctcatgca aacaatgcac tcgtcgttgg cttcgtcggt aaacacgcgc tcaacagtaa
     2461 ttttacaaac ggtaacggcg tctggtttta tgtaataata gcatttggag cggcaaatta
     2521 aggaactgtc tttgggcgcc acggcaaaag tggcgccctc gcgccgcaac cgcttgggaa
     2581 aggtgcagtt ggcgtcgcgc cgataggcgg gatcgtctat acgcacgcgc agcctgccgt
     2641 tcacctttac acaaatcaac ttaacgtgcg ccattgcacg gtcgaatgta ggaccagtgt
     2701 caattattta agacaccgtt agcataaccg ctggcccaat ggacgccaaa tacaaggtgg
     2761 tcgacgtgga cacctttgcc cgccagctaa tcacggacaa gtgcagcgag ctgattgaaa
     2821 ccgaaaatct attgcctgca aacattttgc atgttgttaa acaagcaaga gacaaatact
     2881 ttgaagaccc ctcggtcaaa aactacgaat atgtcaaaaa cctgtttttg cgcacaaaat
     2941 acatggacga ctcaattgac tacaaaaatt ttaatcggcg cgtgttgctg attgtgttca
     3001 aatttgcgct caaccgaggc tccggctact ttccctcgta cagggagctc attgaagtgg
     3061 ccgtcaagcg cctcaacaaa atcaaccccg acctcaagag ctcgccgcgc gccatgttgc
     3121 agcactacaa cgagtgcctg gaaaacctgg acaacccggt gaccgacgag caccacctgt
     3181 taacgtttgg aaaagaagtg gccaccaaaa tgtttatcga ggcgttcgag tttagctacg
     3241 ccagcaacaa cgaaatcaac ttaactacca acaagcgcgg ctcggacctg ttcgacccga
     3301 taccgatgcc cgcgcccgcg cccgcgcctt cggcgtcttt gttggacaac gtcatgaacg
     3361 agcgcaagcg caagttgcag gcgtcggtca cgacaacgcc gcccaaacga tgcaaattgg
     3421 ccgaccggcc ggcgcagacc acacaagaca cgccgcgcgc gccgcaaccg gcgcctgtgc
     3481 gcgcgcagcg gcccctgttt acgctgtaga cgcgcgcgca gaggcgtcgg tgggcggcgc
     3541 gtaatttcgc tgcag
//
LOCUS       PLL5SRRNA     110 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  P.marina 5S ribosomal RNA.
ACCESSION   M35164
KEYWORDS    5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      P.marina (strain IFAM 1313) ribosomal RNA.
  ORGANISM  Pirella marina
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE   1  (bases 1 to 110)
  AUTHORS   Bomar,D., Giovannoni,S. and Stackebrandt,E.
  TITLE     A unique type of eubacterial 5S rRNA in members of the order
            Planctomycetales
  JOURNAL   J. Mol. Evol. 27, 121-125 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   110     5S rRNA
BASE COUNT       30 a     29 c     25 g     26 t
ORIGIN      
        1 ttccggtgac catatcgaaa aggtcatacc tgttcccatt ccgaacacag ccgtcaagct
       61 tttcgagccg atgatagtac cacaagtgtg aaagtaggta tcgccggatc
//
LOCUS       PLLS5RRNAA    111 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  Pirella sp. 5S ribosomal RNA.
ACCESSION   M35165
KEYWORDS    5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      Pirella sp. (strain IFAM 1310) ribosomal RNA.
  ORGANISM  Pirella sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Bomar,D., Giovannoni,S. and Stackebrandt,E.
  TITLE     A unique type of eubacterial 5S rRNA in members of the order
            Planctomycetales
  JOURNAL   J. Mol. Evol. 27, 121-125 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   111     5S rRNA
BASE COUNT       29 a     29 c     25 g     28 t
ORIGIN      
        1 ttccggcgat catatcttaa aggttatacc tgttcccatt ccgaacacag cagtcaagct
       61 ttaagagccg atgatagtgc ccaccagcgt gaaagtaggt cttgccggat c
//
LOCUS       PLLS5RRNAB    111 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  Pirella sp. 5S ribosomal RNA.
ACCESSION   M35166
KEYWORDS    5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      Pirella sp. (strain IFAM 1441) ribosomal RNA.
  ORGANISM  Pirella sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Bomar,D., Giovannoni,S. and Stackebrandt,E.
  TITLE     A unique type of eubacterial 5S rRNA in members of the order
            Planctomycetales
  JOURNAL   J. Mol. Evol. 27, 121-125 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   111     5S rRNA
BASE COUNT       29 a     31 c     27 g     24 t
ORIGIN      
        1 ttccggtgac catatggttg tggaaacacc tgttcccttt ccgaacacag cagttaagca
       61 caaccagccg atgatagtgc ccaccagtgc gaaagtaggt atcgccggat c
//
LOCUS       PLT5SRRNAA    111 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  P.limnophilus 5S ribosomal RNA.
ACCESSION   M35167
KEYWORDS    5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      P.limnophilus (strain IFAM 1008) ribosomal RNA.
  ORGANISM  Planctomyces limnophilus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Bomar,D., Giovannoni,S. and Stackebrandt,E.
  TITLE     A unique type of eubacterial 5S rRNA in members of the order
            Planctomycetales
  JOURNAL   J. Mol. Evol. 27, 121-125 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   111     5S rRNA
BASE COUNT       26 a     33 c     29 g     23 t
ORIGIN      
        1 ttccggtgac tttacgcgtg aggaaacact cgttcccatt ccgaacacga cagttaagct
       61 cccgcggccg atgatagtgc ccaccagcgt gaaagtaggt tatcgccgga c
//
LOCUS       PLT5SRRNAB    110 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  P.brasiliensis 5S ribosomal RNA.
ACCESSION   M35168
KEYWORDS    5S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      P.brasiliensis (strain IFAM 1448) ribosomal RNA.
  ORGANISM  Planctomyces brasiliensis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Planctomycetales.
REFERENCE   1  (bases 1 to 110)
  AUTHORS   Bomar,D., Giovannoni,S. and Stackebrandt,E.
  TITLE     A unique type of eubacterial 5S rRNA in members of the order
            Planctomycetales
  JOURNAL   J. Mol. Evol. 27, 121-125 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   110     5S rRNA
BASE COUNT       28 a     26 c     29 g     27 t
ORIGIN      
        1 ttctggtgac tttacgtctg gggaaacact cgttcccatt ccgaacacga cagttaagcc
       61 ctgacggccg atgatagtac tgcaagtgtg aaagtaggtg attgccagac
//
LOCUS       MZEPPDK1F1   1272 bp ds-DNA             PLN       31-JUL-1990
DEFINITION  Z.mays pyruvate,orthophosphate dikinase (PPDK1) gene, 5' end.
ACCESSION   X14927
KEYWORDS    pyruvate,orthophosphate dikinase.
SEGMENT     1 of 2
SOURCE      Z.mays mays (strain B73) DNA, clone H2-lambda-21; and cDNA to mRNA,
            clone pPPDK2.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1272)
  AUTHORS   Glackin,C.A. and Grula,J.W.
  TITLE     Organ-specific transcripts of different size and abundance derive
            from the same pyruvate,orthophosphate dikinase gene in maize
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.A.Glackin, 31-MAR-1989, for release after publication.
FEATURES       from  to/span     description
    pept       1017  /  1241     pyruvate,orthophosphate dikinase precursor
                                 (PPDK1) (EC 2.7.9.1)
    sigp       1017     1241     pyruvate,orthophosphate dikinase signal
                                 peptide
    pre-msg     805  >  1272     PPDK1 mRNA and intron
    IVS        1242  >  1272     PPDK1 intron A
BASE COUNT      306 a    324 c    327 g    315 t
ORIGIN      
        1 agtactgaaa gcaatctaaa gacatggagg tggaaggcct gacgtagata gagaagatgc
       61 tcttagcttt cattgtcttt cttttgtagt catctgattt acctctctcg tttatacact
      121 ggttttttaa acactcctta acttttcaaa ttgtctcttt ctttacccta gactagataa
      181 ttttaatggt gattttgcta atgtggcgcc atgttagata gaggtaaaat gaactagtta
      241 aaagctcaga gtgataaatc aggctctcaa aaattcataa actgtttttt aaatatccaa
      301 atatttttac atggaaaata ataaaattta gtttagtatt aaaaaattca gttgaatata
      361 gttttgtctt caaaaattat gaaactgatc ttaattattt ttccttaaaa ccgtgctcta
      421 tctttgatgt ctagtttgag acgattatat aatttttttt gtgcttaact acgacgagct
      481 gaagtacgta gaaatactag tggagtcgtg ccgcgtgtgc ctgtagccac tcgtacgcta
      541 cagcccaagc gctagagccc aagaggccgg aggtggaagg cgtcgcggca ctatagccac
      601 tcgccgcaag agcccaagag gccggagctg aaggatgagg gtctgggtgt tcacgaattg
      661 cctggaggca ggaggctcgt cgtccggagc cacaggcgtg gagacgtccg ggataaggtg
      721 agcagccgct gcgatagggg cgcgtgtgaa ccccgtcgcg ccccacggat ggtataagaa
      781 taaaggcatt ccgcgtgcag gattcacccg ttcgcctctc accttttcgc tgtactcact
      841 cgccacacac accccctctc cagctccgtt ggagctccgg acagcagcag gcgcggggcg
      901 gtcacgtagt aagcagctct cggctccctc tccccttgct ccatatgatc gtgcaaccca
      961 tcgagctacg cgcgtggact gccttccctg ggtcggcgca ggaggggatc ggaaggatgg
     1021 cggcgtcggt ttccagggcc atctgcgttc agaagccggg ctcaaaatgc accagggaca
     1081 gggaagcgac ctccttcgcc cgccgatcgg tcgcagcgcc gaggcccccg cacgccaaag
     1141 ccgccggcgt catccgctcc gactccggcg cgggacgggg ccagcattgc tcgccgctga
     1201 gggccgtcgt tgacgccgcg ccgatacaga cgaccaaaaa ggtatccctt gcagctctta
     1261 gaaactgaat tc
//
LOCUS       MZEPPDK1F2    265 bp ds-DNA             PLN       31-JUL-1990
DEFINITION  Z.mays pyruvate,orthophosphate dikinase (PPDK1) gene, 3' end.
ACCESSION   X14929
KEYWORDS    pyruvate,orthophosphate dikinase.
SEGMENT     2 of 2
SOURCE      Z.mays mays (strain B73) DNA, clones H2-lambda-[21,13].
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 265)
  AUTHORS   Glackin,C.A. and Grula,J.W.
  TITLE     Organ-specific transcripts of different size and abundance derive
            from the same pyruvate,orthophosphate dikinase gene in maize
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.A.Glackin, 31-MAR-1989, for release after publication.
FEATURES       from  to/span     description
    pept    <     1       23     pyruvate,orthophosphate dikinase (PPDK1) (EC
                                 2.7.9.1) (AA at 3)
BASE COUNT       62 a     62 c     50 g     91 t
ORIGIN      Undetermined number of base pairs after segment 1.
        1 ctgcagctca ggtgcttgtc tgaggctgcc tcctcgttgg caaccggatt gcctgctgct
       61 ggtggatgtg gtgatcaaca gtattattac agagccatgc tatgtgaaca ttactagtag
      121 cagtgctcat aaaagctaca atcccatctc cctttttttt ccagtcatgt aaaacttcca
      181 aactgctcca tggttcaaaa ctctgttctt caatacatca tcaattatcg attatatatg
      241 tggcagaggt tttttctttt ctttc
//
LOCUS       MZEPPDK2F     322 bp ds-DNA             PLN       31-JUL-1990
DEFINITION  Z.mays pyruvate,orthophosphate dikinase (PPDK2) gene, 3' end.
ACCESSION   M32081
KEYWORDS    pyruvate,orthophosphate dikinase.
SOURCE      Z.mays mays (strain B73) DNA, clone H2-lambda-23.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 258)
  AUTHORS   Glackin,C.A. and Grula,J.W.
  TITLE     Organ-specific transcripts of different size and abundance derive
            from the same pyruvate,orthophosphate dikinase gene in maize
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 322)
  AUTHORS   Glackin,C.A. and Grula,J.W.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1],[2] kindly submitted
            by C.A.Glackin, 13-FEB-1990.
            
               Author address: C.A.Glackin
               Phytogen
               101 Waverly Dr.
               Pasadena, CA 91105
FEATURES       from  to/span     description
    pept    <     1       23     pyruvate,orthophosphate dikinase (PPDK2) (EC
                                 2.7.9.1) (AA at 3)
    mRNA    <     1      322     PPDK2 mRNA
BASE COUNT       75 a     74 c     74 g     99 t
ORIGIN      
        1 ctgcagctca ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc tgttggtgca
       61 tgctggtgat taataatact actatgacag agccatatgc tctgtgaaga gtattagtag
      121 cagcgctcat aaaagctaca gttccatcta ttttctcagc tatgtaaaac ttccaaactg
      181 ttcatgctta aaactgaggg ttttcgtggt gtgagatgtg catgtcgttg ttgaggccat
      241 tgctgcacat tccacctatt gaggccctcc tcaaattaag cctcgaacaa gctgatcatc
      301 ttttctgaga actctagact cg
//
LOCUS       MZEPPDKLR     246 bp ss-mRNA            PLN       31-JUL-1990
DEFINITION  Z.mays pyruvate,orthophosphate dikinase (PPDK1) mRNA, 3' end.
ACCESSION   X14928
KEYWORDS    pyruvate,orthophosphate dikinase.
SOURCE      Z.mays mays (strain B73) leaf and root, cDNA to mRNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 246)
  AUTHORS   Glackin,C.A. and Grula,J.W.
  TITLE     Organ-specific transcripts of different size and abundance derive
            from the same pyruvate,orthophosphate dikinase gene in maize
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.A.Glackin, 31-MAR-1989, for release after publication.
FEATURES       from  to/span     description
    pept    <     1       23     pyruvate,orthophosphate dikinase (PPDK1) (EC
                                 2.7.9.1) (AA at 3)
BASE COUNT       61 a     59 c     48 g     78 t
ORIGIN      
        1 ctgcagctca ggtgcttgtc tgaggctgcc tcctcgttgg caaccggatt gcctgctgct
       61 ggtggatgtg gtgatcaaca gtattattac agagccatgc tatgtgaaca ttactagtag
      121 cagtgctcat aaaagctaca atcccatctc cctttttttt ccagtcatgt aaaacttcca
      181 aactgctcca tggttcaaaa ctctgttctt caatacatca tcaattatcg attatatatg
      241 tggcag
//
LOCUS       BABAPOAI      860 bp ss-mRNA            PRI       31-JUL-1990
DEFINITION  Baboon apolipoprotein A-I (ApoA-I) mRNA, complete cds.
ACCESSION   M35634
KEYWORDS    apolipoprotein A-I.
SOURCE      Baboon liver, cDNA to mRNA.
  ORGANISM  Papio hamadryas
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; 
            Cercopithecidae; Cercopithecinae.
REFERENCE   1  (bases 1 to 860)
  AUTHORS   Hixson,J.E., Borenstein,S., Cox,L.A., Rainwater,D.L. and
            VandeBerg,J.L.
  TITLE     The baboon gene for apolipoprotein A-I: Characterization of a cDNA
            clone and identification of DNA polymorphisms for genetic studies
            of cholesterol metabolism
  JOURNAL   Gene 74, 483-490 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          3      806     apolipoprotein A-I (ApoA-I)
    mRNA    <     1      860     ApoA-I mRNA
BASE COUNT      191 a    265 c    286 g    118 t
ORIGIN      
        1 ggatgaaagc tacggtgctg accttggccg tgctcttcct gacggggagc caggctcggc
       61 atttctggca gcaagatgaa cccccacaga ccccctggga tcgagtgaag gacctggtca
      121 ctgtgtacgt ggaagcactc aaagacagcg gcaaagacta tgtgtcccag tttgaaggct
      181 ccgccttggg aaaacagcta aacctaaagc tcctggacaa ctgggacagc gtgacctcca
      241 ccgtcagcaa gctgcgcgaa cagctcggcc cggtgaccca ggagttctgg gataacctgg
      301 aaaaggagac agagggcctg aggcaggaga tgagcaagga tctggaggag gtgaaggcca
      361 aggtgcagcc ctacctggac gacttccaga agaagtggca ggaggagatg gagctctacc
      421 gccagaaggt ggagccgctg cgcgcggagc tccacgaggg cacgcgccag aagctgcacg
      481 agctgcacga gaaactgagc ccgctggggg aggaggtgcg cgaccgcgcg cgcgcccacg
      541 tggacgcgct gcgcacgcat ctggccccct acagcgatga gctgcgccag cgcctggccg
      601 cgcgccttga ggcgctcaag gagaacggcg gcgcccgcct ggcggagtac cacgccaagg
      661 ccagcgagca tctgagcacg ctcagcgaga aggccaagcc cgcgctcgag gacctccgcc
      721 agggcctgct gcccgtgctg gagagcttca aggtcagctt cctgagcgct ctcgaggagt
      781 acactaagaa gctcagcacc cagtgaggcg cccgccgccg ccccccttcc cggtgctcag
      841 aataaacgtt tccgaagtgg
//
LOCUS       DROARSDA      721 bp ds-DNA             INV       31-JUL-1990
DEFINITION  D.melanogaster autonomically replicating sequence (ARS).
ACCESSION   M35633
KEYWORDS    autonomically replicating sequence.
SOURCE      D.melanogaster (strain Dm665) DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 721)
  AUTHORS   Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N.
  TITLE     Structure of ARS elements from Drosophila melanogaster telomeres
  JOURNAL   Genetika 24, 407-414 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site        574      584     core sequence
BASE COUNT      237 a    168 c    110 g    206 t
ORIGIN      
        1 cctcgctata tcaacgtctt cgtgccgcga tccaactgga cggagccatg ttcgggacca
       61 ccttcccgca catgttcttg acatgttagc tgccgaactt tagaccccag ccgctcctct
      121 ctgctatgca ggagtatatg taagccgagt atacatgggt acatactttc caatacgcca
      181 cctcaaaagt tacattgctg tacttttatt acttgcatgt ccttctgggg aataattttg
      241 atccagcatt cgctgcgagg tgtgggaaac caggtgggaa acccagtaaa acccaaccag
      301 atcaattgaa catattattt acataaattc agtttgcaaa caattttaac taaaatcaat
      361 attattcaac tttgtagaac ctcaaaatta tcaataaact taattaaatt taatttaatg
      421 atactttata tcacgacccc cagttcagtc tgtttcttca tttcaatcta agcatatatt
      481 gttttaattc attctgagag tccaagccca cttctcaatg taatgaatct tgcagatggt
      541 aacggcttag ttagtatatc agcaagttgt aaataaatat aaataaatat aacaatcaaa
      601 taacgcactt acctccttga ccacagccaa tccaacgcaa ccgaaaacag gagacgggcc
      661 ccgcaaacgc aaattaaaat tgccaacatt accgattata actacaaaaa actaacaagc
      721 t
//
LOCUS       DROARSDB      275 bp ds-DNA             INV       31-JUL-1990
DEFINITION  D.melanogaster autonomously replicating sequence (ARS).
ACCESSION   M36876
KEYWORDS    autonomously replicating sequence.
SOURCE      D.melanogaster (strain Dm665) deletion mutant pSL41 DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 275)
  AUTHORS   Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N.
  TITLE     Structure of ARS elements from Drosophila melanogaster telomeres
  JOURNAL   Genetika 24, 407-414 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site        146      156     core sequence
BASE COUNT      118 a     56 c     34 g     67 t
ORIGIN      
        1 tttttgcata ataatttcaa ttgtaaaatg taaacaaacc acgcaatatg ataatgttac
       61 cagtccagat tactgtcaaa aacctagtac gaaaaaaata ctaactagga ccaaactatt
      121 atttaagctt tataatattt ataaaataat ataaaagctg gaattgccgc ttgaccacag
      181 ccaatccaac gcaaccgaaa acaggagacg ggccccgcaa acgcaaatta aaattgccaa
      241 cattaccgat tataactaca aaaaactaac aagct
//
LOCUS       DROARSDC      290 bp ds-DNA             INV       31-JUL-1990
DEFINITION  D.melanogaster autonomously replicating sequence (ARS).
ACCESSION   M36877
KEYWORDS    ARS element.
SOURCE      D.melanogaster (strain Dm665) deletion mutant pSL52 DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 290)
  AUTHORS   Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N.
  TITLE     Structure of ARS elements from Drosophila melanogaster telomeres
  JOURNAL   Genetika 24, 407-414 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site         81       90     core sequence
BASE COUNT      103 a     58 c     52 g     77 t
ORIGIN      
        1 agattactgt caaaaaccta gtacgaaaaa aatactaact aggaccaaac tattatttaa
       61 gctttataat atttataaaa taatataaaa gctggaattg ccgctttgat gcgtttgagg
      121 tctttgtaaa gggcatcgag tcctcggacc attaagggtt gctgccctgt cgaaactctg
      181 acgtttgatt tgtttttggc ccaacgcaac cgaaaacagg agacgggccc cgcaaacgca
      241 aattaaaatt gccaacatta ccgattataa ctacaaaaaa ctaacaagct
//
LOCUS       GIBINVOL     1569 bp ds-DNA             PRI       31-JUL-1990
DEFINITION  H.lar involucrin gene, complete cds.
ACCESSION   M35447
KEYWORDS    involucrin.
SOURCE      H.lar keratinocyte DNA, clones 1 and 2.
  ORGANISM  Hylobates lar
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hylobatidae.
REFERENCE   1  (bases 1 to 1569)
  AUTHORS   Djian,P. and Green,H.
  TITLE     The involucrin gene of the gibbon: The middle region shared by the
            hominoids
  JOURNAL   Mol. Biol. Evol. 7, 220-227 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1     1569     involucrin
BASE COUNT      472 a    389 c    550 g    158 t
ORIGIN      
        1 atgtcccagc aacacacact gccagtgacc ctctcccctg ccctcagtca ggagctcctc
       61 aagactgttc ctcctccagt caatacccag caggagcaaa tgaaacagcc aactccactg
      121 cctcccccat gccagaaggt gcttggtgag ctcccagtgg aggtcccatc aaagcaagag
      181 gaaaagcaca tgactactgt aaaggggctg cctgagcaag aatgtgagca acagcagcag
      241 gagccacagg agcaggagct gcagcaacag cactgggaac agcatgagga acatcagaaa
      301 gcagaaaacc cagagcagca gcttaagcag gagaaagcac aaagggatca gcagctaaac
      361 gaacatctgg aagaagagaa gaagctctta gaccagcaac tgaatcaaga gctaatcaag
      421 agagatgagc aactgggaat aaagaaagag caactcttgg agctcacaga gcagcaggag
      481 gggcagttgg agcacctaga acagcaggag gggcagctgg agctcccaga gcagcaggag
      541 gggcagctgg agcacctgga gcagcaggag gggcagctga agcacctgga tcagcagggg
      601 aagcagccag agctcccaga gcagcaggtg gcacagctga agcacctgga gcagcaggag
      661 gggcagctga agcatctgga gcaccagaag ggggagctgc aggtcccaga ggagcaggtg
      721 gggcagctga agtacctgga acagcaggag gggcagctga agcacctgga tcagcaggag
      781 aagcagccag agctcccaga gcagcaggtg gggcagctga agcacctgga gcagcaggag
      841 gggcagctgg agcacatgga gcaccaggaa gggcagctgg ggctcccaga gcagcaggtg
      901 gggcagctga agcagctgga ggagcaggag ggacagccaa agcacctgga ggaggaggag
      961 gggcagctga agcacctggt gcagcaggag gggcagctgg agcatctggt gcagcaggag
     1021 aggcagctgg agcagcagga ggggaaggtg cagcacctgg agcagcaggt ggagcagctg
     1081 aagcacctag aggagcagga ggggcagctg aagcatctgg agcagcagca ggggcagctg
     1141 gaggtctcag agcagcaggt ggggcagcca aagcacctgg agcaggaggg gaagcagctg
     1201 gagctcccag agcagcaaga gggccagctg aagcacctgg agaagcagga ggcacagctg
     1261 gagctcccag agcagcaggt aggacagcca aagcacccgg agcaacagga gaagcagcta
     1321 gagcacccag agcagcagga gggacaacta aaacatctgg agcagcagga ggggcagctg
     1381 aaagacctgg agcagcagaa ggggcagctg gagcagcagc aggggcagct ggagcagcct
     1441 gtgtttgccc cagctccagg ccaggtccaa gacattcaac cagtcctgcc cacaaaggga
     1501 gaagcattgc ttcctgtaga gcagcagcag cagaagcagg aggtgcagtg gccacccaaa
     1561 cataaataa
//
LOCUS       HUMHEMBP      822 bp ss-mRNA            PRI       31-JUL-1990
DEFINITION  Human eosinophil major basic protein mRNA, complete cds.
ACCESSION   M35670
KEYWORDS    major basic protein.
SOURCE      Human eosinophil, cDNA to mRNA, clone MBP-lambda-9.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 822)
  AUTHORS   McGrogan,M., Simonsen,C., Scott,R., Giffith,J., Ellis,N.,
            Kennedy,J., Campanelli,D., Nathan,C. and Gabay,J.
  TITLE     Isolation of a complementary DNA clone encoding a precursor to
            human eosinophil major basic protein
  JOURNAL   J. Exp. Med. 168, 2295-2308 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         49      717     eosinophil major basic protein precursor
    sigp         49       94     eosinophil major basic protein signal peptide
    matp         95      714     eosinophil major basic protein
    mRNA    <     1      822     eosinophil major basic protein mRNA
BASE COUNT      173 a    219 c    236 g    194 t
ORIGIN      
        1 gggaagatct aaagacccag gaaggtctct gggtgggata aagccaagat gaaactcccc
       61 ttacttctgg ctcttctatt tggggcagtt tctgctcttc atctaaggtc tgagacttcc
      121 acctttgaga cccctttggg tgctaagacg ctgcctgagg atgaggagac accagagcag
      181 gagatggagg agaccccttg cagggagctg gaggaagagg aggagtgggg ctctggaagt
      241 gaagatgcct ccaagaaaga tggggctgtt gagtctatct cagtgccaga tatggtggac
      301 aaaaacctta cgtgtcctga ggaagaggac acagtaaaag tggtgggcat ccctgggtgc
      361 cagacctgcc gctacctcct ggtgagaagt cttcagacgt ttagtcaagc ttggtttact
      421 tgccggaggt gctacagggg caacctggtt tccatccaca acttcaatat taattatcga
      481 atccagtgtt ctgtcagcgc gctcaaccag ggtcaagtct ggattggagg caggatcaca
      541 ggctcgggtc gctgcagacg ctttcagtgg gttgacggca gccgctggaa ctttgcgtac
      601 tgggctgctc accagccctg gtcccgcggt ggtcactgcg tggccctgtg tacccgagga
      661 ggctactggc gtcgagccca ctgcctcaga agacttcctt tcatctgttc ctactgagct
      721 ggtcccagcc gacagttcag agctgccctc tcctgggcat ggcctcccct cctctgcttg
      781 ccatccctcc ctccacctcc ctgcaataaa atgggtttta ct
//
LOCUS       HUMIGHVX      529 bp ds-DNA             PRI       31-JUL-1990
DEFINITION  Human Ig germline H-chain gene V-III region subgroup, exons 1 and
            2.
ACCESSION   M35415
KEYWORDS    germline; immunoglobulin heavy chain; variable region;
            variable region subgroup VH-III.
SOURCE      Human fetal liver DNA, clone lambda-VH-26.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 529)
  AUTHORS   Matthyssens,G. and Rabbitts,T.H.
  TITLE     Structure and arrangement of human heavy chain variable region
            genes
  JOURNAL   immune system 1, 132-138 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         29       74     Ig heavy chain V-region precursor, exon 1
                                 /hgml_locus_uid="LK0080M" /nomgen="IGHV"
                                 /map="14q32.33"
                179  /   483     Ig H-chain V-region precursor, exon 2
    sigp         29       74     Ig H-chain signal peptide
                179      189     Ig H-chain signal peptide
    matp        190  /   483     Ig H-chain
    IVS          75      178     Ig H-chain V-region intron A
    iDNA        484  >   529     Ig iDNA
BASE COUNT      131 a    106 c    165 g    127 t
ORIGIN      
        1 gatcaggact gaacagagag aactcaccat ggagtttggg ctgagctggc tttttcttgt
       61 ggctatttta aaaggtaatt catggagaaa tagaaaaatt gagtgtggaa tggataagag
      121 tgagagaaac agtggatacg tgtggcagtt tctgaccagg gtttcttttt gtttgcaggt
      181 gtccagtgtg aggtgcagct gttggagtct gggggaggct tggtacagcc tggggggtcc
      241 ctgagactct cctgtgcagc ctctggattc acctttagca gctatgccat gagctgggtc
      301 cgccaggctc cagggaaggg gctggagtgg gtctcagcta ttagtggtag tggtggtagc
      361 acatactacg gagactccgt gaagggccgg ttcaccatct caagagacaa ttccaagaac
      421 acgctgtatc tgcaaatgaa cagcctgaga gccgaggaca cggccgtata ttactgtgcg
      481 aaagacacag tgagggaagt cattgtgagc ccagacacaa acctccctg
//
LOCUS       HUMPRAPP      256 bp ss-mRNA            PRI       31-JUL-1990
DEFINITION  Human amyloid beta precursor protein (ABPP) mRNA, 5' end.
ACCESSION   M35675
KEYWORDS    amyloid beta.
SOURCE      Human fetal muscle, cDNA to mRNA, clone lambda-HAP2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 256)
  AUTHORS   Mita,S., Sadlock,J., Herbert,J. and Schon,E.A.
  TITLE     A cDNA specifying the human amyloid beta precursor protein (ABPP)
            encodes a 95-kDa polypeptide
  JOURNAL   Nucleic Acids Res. 16, 9351-9351 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          8  >   256     amyloid beta precursor protein (ABPP)
                                 /hgml_locus_uid="LG0136J" /nomgen="APP"
                                 /map="21q21.2"
BASE COUNT       42 a     77 c     93 g     44 t
ORIGIN      
        1 ggaggatatg aagtccatct tcacttcaga gatctcctcc gtcttgatat ttgtcaaccc
       61 agaacctggt cgagtggtca gtttcctcgg cagcggtagg cgagagcacg cggaggagcg
      121 tgcgcggggg ccccgggaga cggcggcggt ggcggcgcgg gcagagcaag gacgcggcgg
      181 atcccactcg cacagcagcg cactcggtgc cccgcgcagg gtcgcgatgc tgcccggttt
      241 ggcactgctc ctgctg
//
LOCUS       MUSRP32A      901 bp ds-DNA             ROD       31-JUL-1990
DEFINITION  Mouse ribosomal protein L32' (L32') gene, complete cds.
ACCESSION   M35397
KEYWORDS    ribosomal protein L32'.
SOURCE      Mouse cell line L1210 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 901)
  AUTHORS   Jacks,C.M., Biltz,R.E. and Hackett,P.B.
  TITLE     Analysis of potential expression of highly related members of the
            ribosomal protein L32 gene family
  JOURNAL   Nucleic Acids Res. 16, 10751-10764 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        255      662     ribosomal protein L32'
    mRNA        200  >   695     L32' mRNA (alt.)
    mRNA        203  >   695     L32' mRNA (alt.)
    mRNA        204  >   695     L32' mRNA (alt.)
    mRNA        207  >   695     L32' mRNA (alt.)
    mRNA        246  >   695     L32' mRNA (alt.)
    mRNA        247  >   695     L32' mRNA (alt.)
BASE COUNT      290 a    203 c    210 g    198 t
ORIGIN      
        1 gtccaacaag tatttccttg gccaggctta gcgttcttcg agtttatctt ctagggggaa
       61 aaaatcaact agcaacatat aaaaaggtct tatttgtcca ggaaaaaaaa aggaagcaac
      121 attatagatt gactcttgca tagtatgtat tttggaatcc gagtaatgca acagtataat
      181 gtggttaaga acaaagactc tggctccctc ctcggcactg cctacgaggt ggctaccttc
      241 tgttatgcag catcatggct gccctccggc ctctggtgaa gcccaagatc gtcaaaaaga
      301 ggaccaaaaa gttcatcaga caccagtcag accgatatgt gaaaattaag tggaactgga
      361 ggaaacccag aggcatcgac aacagggtgc ggagaaggtt caagggccag atcctgatgc
      421 ccaacatcgg ttacaggagc aacaagaaaa ccaagcacac gctgtctagt ggcttccgca
      481 agttcctggt ccacaatatc aaggagccgg aggtgctgtt gatgtgcaac aaatcttacc
      541 gtgctgagat tgctcacaac gtgtcctcta agaaccgaaa agccattgta gaaagagcag
      601 cacagctggc catcagagtc accaatccca acaccaggct acacagcgaa gaaaatgaat
      661 agatggcttg tgtacatgtt ttgtgtttaa ataaaatcac aaaaactgga aaaaaaaaag
      721 aataaagact ctgaggctgg aagagggtgt caaatgcccc tggaactgga gttacagaca
      781 attgtgagcc accccccccc catgggagct ggtgatggaa cttgggtccc ctggaagaga
      841 ggcaagcact attaattatg gagccatttc ttcagccaaa catcccatat tttaatccga
      901 a
//
LOCUS       SPIB559AC     597 bp ds-DNA             ORG       31-JUL-1990
DEFINITION  S.oleracea chloroplast apocytochrome b-559 gene, complete cds.
ACCESSION   M35673
KEYWORDS    apocytochrome b-559.
SOURCE      S.oleracea chloroplast DNA.
  ORGANISM  Chloroplast Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae; 
            Spinacia oleracea.
REFERENCE   1  (bases 1 to 597)
  AUTHORS   Herrmann,R.G., Alt,J., Schiller,B., Widger,W.R. and Cramer,W.A.
  TITLE     Nucleotide sequence of the gene for apocytochrome b-559 on the
            spinach plastid chromosome: Implications for the structure of the
            membrane protein
  JOURNAL   FEBS Lett. 176, 239-244 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        152      403     apocytochrome b-559
    pept        412      531     ORF 39
BASE COUNT      173 a    120 c    117 g    187 t
ORIGIN      
        1 attcggtata ctctaaagac acctttggta ctatattggc aatcctacaa gtattttacg
       61 tttttacgtg gagtaaaaag taaatagaaa ttgactgatt tcatctttta cggatttgcc
      121 tttgactgta caagaatatg tggagctcag tatgtctgga agcacaggag aacgttcttt
      181 tgctgatatt attaccagta ttcgatactg ggttattcat agcattacta taccttccct
      241 attcattgcg ggttggttat tcgtcagcac aggtttagct tacgatgtgt ttggaagccc
      301 tcgtccaaac gaatatttca cagagagccg acaaggaatt ccattaataa ctggccgttt
      361 tgactctttg gaacaacttg atgaatttag tagatccttt taggaggccc aatgaccata
      421 gatcgaacct atccaatttt tacagtgcgc tggctggcta ttcacggact ggctgtacct
      481 accgtttctt ttttggggtc aatatccgca atgcagttca tccaacgata aacaaaaccg
      541 aatccgaatt atagagctac gacacaatca aacccgaacg aacaaaatgt tgaattg
//
LOCUS       SYNQBRNA      672 bp ss-RNA             SYN       31-JUL-1990
DEFINITION  Synthetic lambda-N protein RNA.
ACCESSION   M35652
KEYWORDS    lambda-N protein.
SOURCE      Synthetic RNA, clone pDM1001.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 672)
  AUTHORS   Mills,D.R.
  TITLE     Engineered recombinant messenger RNA can be replicated and
            expressed inside bacterial cells by an RNA bacteriophage replicase
  JOURNAL   J. Mol. Biol. 200, 489-500 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    RNA         157      516     lambda N-protein complementary RNA
BASE COUNT      122 a    190 c    183 g    177 t
ORIGIN      
        1 gggaggagag agggcaaagc agatccccct ctcactcgta agagtaattg tgataattta
       61 ttcacaatta ggcgccatat tggcgccctc ccccaggtta cgcctcgtgt agagacgcaa
      121 ccttcgtgcc ctggaagacc ccgtagggcc tcgaggccaa tatctaagta actagataag
      181 aggaatcgat tttcccttaa ttttctggcg tccactgcat gttatgccgc gttcgccagg
      241 cttgctgtac catgtgcgct gattcttgcg ctcaatacgt tgcaggttgc tttcaatctg
      301 tttgtggtat tcagccagca ctgtaaggtc tatcggattt agtgcgcttt ctactcgtga
      361 tttcggtttg cgattcagcg agagaatagg gcggttaact ggttttgcgc ttaccccaac
      421 caacagggga tttgctgctt tccattgagc ctgtttctct gcgcgacgtt cgcggcggcg
      481 tgtttgtgca tccatctgga ttctcctgtc agttagcctc gaggcgcttt cgcgctctcc
      541 caggtgacgc ctcgtgaaga ggcgcgacct tcgtgcgttt cggcgacgca cgagaaccgc
      601 cacgctgctt cgcagcgtgg ccccttcgcg cagcccgctg cgcgaggtga ccccccgaag
      661 ggggcttccc ca
//
LOCUS       SYNT4ENDV     424 bp ds-DNA             SYN       31-JUL-1990
DEFINITION  Synthetic T4 endonuclease V (T4endV) gene, complete cds.
ACCESSION   M35392
KEYWORDS    T4 endonuclease V.
SOURCE      Synthetic gene, expressed in E.coli.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 424)
  AUTHORS   Inaoka,T., Miura,K. and Ohtsuka,E.
  TITLE     Chemical synthesis of the T4 endonuclease V gene and its expression
            in Escherichia coli
  JOURNAL   Nucleic Acids Symp Ser 17, 105-108 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          5      421     T4 endonuclease V (T4endV)
BASE COUNT      120 a     90 c     86 g    128 t
ORIGIN      
        1 cgatatgacg cgtatcaacc ttactttagt atccgagtta gctgaccaac acttaatggc
       61 tgaataccgt gaattgccgc gtgtttttgg tgcagttcgt aagcacgtag caaacggtaa
      121 acgtgttcgt gacttcaaaa tcagtcctac ttttatcctt ggcgcaggtc atgttacatt
      181 cttctacgat aagctcgagt tcttacgcaa gcgtcaaatt gagcttatag ctgaatgttt
      241 gaaacgtggc ttcaatatca aggatactac agtccaggac atcagtgaca ttcctcaaga
      301 attccgtggt gattatattc cccatgaagc ttctattgct atatcacaag ctcgtttaga
      361 tgaaaaaatt gcacaacgtc ctacttggta caaatactac ggtaaggcga tttatgcatg
      421 atag
//
LOCUS       TOMACS       1819 bp ss-mRNA            PLN       31-JUL-1990
DEFINITION  Tomato 1-aminocyclopropane-1-carboxylate synthase mRNA, complete
            cds.
ACCESSION   M34289
KEYWORDS    1-aminocyclopropane-1-carboxylate synthase.
SOURCE      Tomato (strain Mill., cv. Orlando), fruit, cDNA to mRNA.
  ORGANISM  Lycopersicon esculentum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 1819)
  AUTHORS   Van Der Straeten,D., Van Wiemeersch,L., Goodman,H.M. and
            Van Montagu,M.
  TITLE     Cloning and sequence of two different cDNAs encoding
            1-aminocyclopropane-1-carboxylate synthase in tomato
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4859-4863 (1990)
  STANDARD  unannotated staff_entry
FEATURES       from  to/span     description
    pept        154     1611     1-aminocyclopropane-1-carboxylate synthase
    mRNA    <     1     1819     1-aminocyclopropane-1-carboxylate synthase
                                 mRNA
BASE COUNT      610 a    285 c    365 g    559 t
ORIGIN      
        1 ccaaacacat aatactttta atacaattag ttatttatta gaagtattta aagtaaagca
       61 cttgtgagtt gtgtacattt tattaatctt catcttctta attctcttca gtttttaatt
      121 tcttcacttc taaactcatt tagtaaaaaa aaaatgggat ttgagattgc aaagaccaac
      181 tcaatcttat caaaattggc tactaatgaa gagcatggcg aaaactcgcc atattttgat
      241 gggtggaaag catacgatag tgatcctttc caccctctaa aaaaccccaa cggagttatc
      301 caaatgggtc ttgctgaaaa tcagctttgt ttagacttga tagaagattg gattaagaga
      361 aacccaaaag gttcaatttg ttctgaagga atcaaatcat tcaaggccat tgccaacttt
      421 caagattatc atggcttgcc tgaattcaga aaagcgattg cgaaatttat ggagaaaaca
      481 agaggaggaa gagttagatt tgatccagaa agagttgtta tggctggtgg tgccactgga
      541 gctaatgaga caattatatt ttgtttggct gatcctggcg atgcattttt agtaccttca
      601 ccatactacc cagcatttaa cagagattta agatggagaa ctggagtaca acttattcca
      661 attcactgtg agagctccaa taatttcaaa attacttcaa aagcagtaaa agaagcatat
      721 gaaaatgcac aaaaatcaaa catcaaagta aaaggtttga ttttgaccaa tccatcaaat
      781 ccattgggca ccactttgga caaagacaca ctgaaaagtg tcttgagttt caccaaccaa
      841 cacaacatcc accttgtttg tgacgaaatc tacgcagcca ctgtctttga cacgcctcaa
      901 ttcgtcagta tagctgaaat cctcgatgaa caggaaatga cttactgcaa caaagattta
      961 gttcacatcg tctacagtct ttcaaaagac atggggttac caggatttag agtcggaatc
     1021 atatattctt ttaacgacga tgtcgttaat tgtgctagaa aaatgtcgag tttcggttta
     1081 gtatctacac aaacgcaata ttttttagcg gcaatgccat cggacgaaaa attcgtcgat
     1141 aattttctaa gagaaagcgc gatgaggtta ggtaaaaggc acaaacattt tactaatgga
     1201 cttgaagtag tgggaattaa atgcttgaaa aataatgcgg ggcttttttg ttggatggat
     1261 ttgcgtccac ttttaaggga atcgactttc gatagcgaaa tgtcgttatg gagagttatt
     1321 ataaacgatg ttaagcttaa cgtctcgctt ggatcttcgt ttgaatgtca agagccaggg
     1381 tggttccgag tttgttttgc aaatatggat gatggaacgg ttgatattgc gctcgcgagg
     1441 attcggaggt tcgtaggtgt tgagaaaagt ggagataaat cgagttcgat ggaaaagaag
     1501 caacaatgga agaagaataa tttgagactt agtttttcga aaagaatgta tgatgaaagt
     1561 gttttgtcac cactttcgtc acctattcct ccctcaccat tagttcgtta agacttaatt
     1621 aaaagggaag aatttaattt atgttttttt atattttgaa aaaaatttgt aagaataaga
     1681 ttataatagg aaaagaaaat aagtatgtag gatgaggagt attttcagaa atagttgtta
     1741 gcgtatgtat tgacaactgg tctatgtact tagacatcat aatttgtctt agctaattaa
     1801 tgaatgcaaa agtgaagtt
//
LOCUS       TTHRGG        873 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  T.thermophilus 23S ribosomal RNA gene.
ACCESSION   M35674
KEYWORDS    23S ribosomal RNA.
SOURCE      T.thermophilus (strain HB8; ATCC 27634) DNA.
  ORGANISM  Thermus thermophilus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 873)
  AUTHORS   Hartmann,R.K., Vogel,D.W., Kroeger,B., Ulbrich,N. and Erdmann,V.A.
  TITLE     Studies on rDNA from the extreme thermophilic eubacterium Thermus
            thermophilus HB8
  JOURNAL   FEBS Lett. 218, 215-221 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      873     23S RNA gene
BASE COUNT      167 a    248 c    321 g    137 t
ORIGIN      
        1 ctggccgtga agatgcggcc tacccgtggc aggacgaaaa gaccccgtgg agctttactg
       61 cagcctggtg ttggctcttg gtcgcgcctg cgtaggatag gtgggagcct gtgaaccccc
      121 gcctccgggt gggggggagg cgccggtgaa ataccaccct ggcgcggctg ggggcctaac
      181 cctcggatgg ggggacagcg cttggcgggc agtttgactg gggcggtcgc ctcctaaaag
      241 gtaacggagg cgcccaaagg tcccctcagg cgggacggaa atccgccgga gagcgcaagg
      301 gtagaagggg gcctgactgc gaggcctgca agccgagcag gggcgaaagc cgggcctagt
      361 gaaccggtgg tcccgtgtgg aagggccatc gatcaacgga taaaagttac cccggggata
      421 acaggctgat ctcccccgag cgtccacagc ggcggggagg tttggcacct cgatgtcggc
      481 tcgtcgcatc ctggggctga agaaggtccc aagggttggg ctgttcgccc attaaagcgg
      541 cacgcgagct gggttcagaa cgtcgtgaga cagttcggtc tctatccgcc acgggcgcag
      601 gaggcttgag gggggctctt cctagtacga gaggaccgga agggacgcac ctctggtttc
      661 ccagctgtcc ctccaggggc ataagctggg tagccatgtg cggaagggat aaccgctgaa
      721 agcatctaag cgggaagccc gccccaagat gaggcctccc acggcgtcaa gccggtaagg
      781 acccgggaag accacccggt ggatgggccg ggggtgtaag cgccgcgagg cgttgagccg
      841 accggtccca atcgtccgag gtcttgaccc ctc
//
LOCUS       WHTCPRS2A     780 bp ds-DNA             ORG       31-JUL-1990
DEFINITION  T.aestivum chloroplast ribosomal protein S2 gene, complete cds.
ACCESSION   M35396
KEYWORDS    ribosomal protein S2.
SOURCE      T.aestivum (cv. Mardler) chloroplast DNA.
  ORGANISM  Chloroplast Triticum aestivum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Triticum aestivum.
REFERENCE   1  (bases 1 to 780)
  AUTHORS   Hoeglund,A.-S. and Gray,J.C.
  TITLE     Nucleotide sequence of the gene for ribosomal protein S2 in wheat
            chloroplast DNA
  JOURNAL   Nucleic Acids Res. 15, 10590-10590 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         46      756     ribosomal protein S2
BASE COUNT      266 a    118 c    164 g    232 t
ORIGIN      
        1 ccgtaatgga atggtaggat taaaaaaaaa ggtaatgtgg aaaaaatgac aagaagatat
       61 tggaacatta atttgaaaga gatgatagaa gcaggagttc attttggtca tggtattaag
      121 aaatggaatc ctaaaatggc cccttacatt tcggcaaagc gtaaaggtac tcatattata
      181 aatctcgcta gaacggctcg ttttttatca gaagcttgtg atttagtttt tgatgcagca
      241 agtcagggaa caagtttctt aattgttggt accaaaaaaa gagcaacaga tttagtagca
      301 tcagctgcaa taagggctcg ttgtcattat gttaataaaa agtggttcag tggtatgtta
      361 acgaattggt cgattacgaa aactagactt tctcaattta gagacttaag agcagaagaa
      421 aaaatgggaa aattccacca tctcccaaaa agagatgtgg caatcttgaa gagaaaatta
      481 tctaccttgc aaaggtatct cggcgggatc aaatatatga cgagattgcc agacattgtg
      541 atcgtccttg atcagcaaaa agagtatata gctcttcggg aatgtgccat tttggggatt
      601 cctactattt ctttagtcga tacaaattgt gacccggatc tcgcgaatat atcgattcca
      661 gccaacgatg acactatgac ttcaattcga ttgattctta acaaattagt attttcaatt
      721 tgtgagggcc gttctctcta tataagaaat cgttgattaa gaatatatag tgaattcttg
//
LOCUS       YSCVDACA      852 bp ss-mRNA            PLN       31-JUL-1990
DEFINITION  Yeast voltage-dependent anion-selective channel (VDAC) protein
            mRNA, complete cds.
ACCESSION   M34907
KEYWORDS    voltage-dependent anion-selective channel protein.
SOURCE      Yeast, cDNA to mRNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 852)
  AUTHORS   Forte,M., Guy,H.R. and Mannella,C.A.
  TITLE     Molecular genetics of the VDAC ion channel: Structural model and
            sequence analysis
  JOURNAL   J Bioenerg Biomembr 19, 341-350 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      852     voltage-dependent anion-selective channel
                                 (VDAC) protein
BASE COUNT      224 a    213 c    172 g    243 t
ORIGIN      
        1 atgtctcctc cagtttacag cgatatctcc agaaatatca atgacctatt gaacaaggat
       61 ttctatcatg ctaccccagc tgcctttgat gtgcaaacaa caaccgccaa tggcattaag
      121 ttctcattga aggctaaaca gcctgtcaaa gacggtccac tgtctactaa cgtggaagca
      181 aagttgaatg acaagcaaac cggcttgggt ctaactcaag gctggtctaa cacaaacaac
      241 ttgcaaacca aattagagtt tgccaacttg acccctggtc taaagaacga attgatcact
      301 tctttgactc caggcgtcgc caagtccgcc gtcttaaaca ctacgttcac agaacctttc
      361 ttcaccgcaa gaggtgcctt tgacttgtgt ttgaagtcac caacatttgt tggtgactta
      421 actatggccc acgaaggtat tgttggtggc gcagagtttg gttacgatat cagcgccggt
      481 tccatttctc gttatgccat ggctttaagt tatttcgcca aagactactc cttgggcgct
      541 acattgaaca acgagcaaat aactaccgtt gacttcttcc aaaacgtcaa cgccttttta
      601 caggtcgcta aggctacaat gatgaactgc aaactaccta actccaatgt caacatcgaa
      661 ttcgccacta gatatttgcc tgatgcatct tcccaagtta aggctaaggt gtccgattcc
      721 ggtattgtca ctttggctta caagcaattg ttaagacctg gcgtcactct gggtgtcggt
      781 tcctctttcg atgctttgaa gttgtctgaa cctgttcaca agctaggttg gtctttgtcc
      841 ttcgacgctt ga
//
LOCUS       ATHHISH3A     411 bp ds-DNA             PLN       31-JUL-1990
DEFINITION  A.thaliana histone H3 gene, complete cds.
ACCESSION   M35387
KEYWORDS    histone H3.
SOURCE      A.thaliana (strain Strasbourg) adult leaf DNA.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 411)
  AUTHORS   Chaubet,N., Chaboute,M.-E., Philipps,G. and Gigot,C.
  TITLE     Histone genes in higher plants: Organization and expression
  JOURNAL   Dev. Genet. 8, 461-473 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1      411     histone H3
BASE COUNT      119 a    111 c    109 g     72 t
ORIGIN      
        1 atggctcgta ccaagcagac ggctaggaaa tcaaccggag gaaaggctcc aaggaagcag
       61 ctggcgacaa aagcggcgag gaaatctgct ccggccaccg gaggagtgaa gaagccacac
      121 agattcagac ccggaactgt tgctttaaga gaaatcagga agtaccagaa gagcactgag
      181 cttcttatcc gtaaactccc tttccagagg cttgtcagag agatcgctca ggacttcaaa
      241 accgatcttc gtttccagag cagtgccgtc gccgctcttc aggaagctgc tgaagcttac
      301 ctcgtcggac tcttcgaaga caccaatctc tgcgccattc acgccaagag ggtcacgatc
      361 atgcccaagg acatccagct cgcgagaaga atcagaggag aacgagctta g
//
LOCUS       BLYCHORDA     391 bp ss-mRNA            PLN       31-JUL-1990
DEFINITION  Barley C-hordein storage protein, 3' end.
ACCESSION   M35610
KEYWORDS    C-hordein; storage protein.
SOURCE      Barley 20-day old endosperm, cDNA to mRNA, clone pc hor1-3.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 391)
  AUTHORS   Rasmussen,S.K. and Brandt,A.
  TITLE     Nucleotide sequences of cDNA clones for C-hordein polypeptides
  JOURNAL   Carlsberg Res. Commun. 51, 371-379 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      219     C-hordein storage protein
    mRNA    <     1      391     C-hordein storage protein mRNA
    signal      310      315     polyA signal (put.)
    signal      365      370     polyA signal (put.)
BASE COUNT      134 a    120 c     57 g     80 t
ORIGIN      
        1 tttccccagc cccaggaacc atttccccag caaccacaac aaccattccc actgcaaccg
       61 caacaaccat tcccccaaca accacaacaa ccatttcccc agccccaaca accattccgc
      121 caacaagcag aactaataat tccccagcaa cctcaacaac cattccctct gcagccacac
      181 caaccatata cacaacaaac catctggagt atggtctagt catcaggggc ctatgaagcg
      241 acaagttgta atactaaatg ggtggatcat catcctttag tcaacggagt gtttaatgta
      301 atgatgataa ataaagtgat gtggaccatc atgtgtaacc ccggcctata ctagttcaaa
      361 tgagaataaa agaccaaaga aagttcttgt c
//
LOCUS       BLYCHORDB     255 bp ss-mRNA            PLN       31-JUL-1990
DEFINITION  Barley C-hordein storage protein, 3' end.
ACCESSION   M35611
KEYWORDS    C-hordein; storage protein.
SOURCE      Barley 20-day old endosperm, cDNA to mRNA, clone pc-919.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 255)
  AUTHORS   Rasmussen,S.K. and Brandt,A.
  TITLE     Nucleotide sequences of cDNA clones for C-hordein polypeptides
  JOURNAL   Carlsberg Res. Commun. 51, 371-379 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      207     C-hordein storage protein
BASE COUNT       97 a     96 c     24 g     38 t
ORIGIN      
        1 tacccacaac aaccacaacc atttccacaa cagcccatcc cacaacaacc acaaccatac
       61 ccacaacaac cacaaccatt ttcacaacag cccatcccac aacaaccaca accataccca
      121 caacaaccac aaccatttcc acaacaaccc atccctctgc agccacacca accatataca
      181 caacaaacca tctggagtat ggtctagtca tcaggggcct atgaagcgac aagttgtaat
      241 actaaatggg tggat
//
LOCUS       BOVFGFAA      668 bp ss-mRNA            MAM       31-JUL-1990
DEFINITION  Bovine acidic eye-derived fibroblast growth factor (EDGF II) mRNA,
            complete cds.
ACCESSION   M35608
KEYWORDS    acidic eye-derived fibroblast growth factor.
SOURCE      Bovine retina, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 668)
  AUTHORS   Alterio,J., Halley,C., Brou,C., Soussi,T., Courtois,Y. and
            Laurent,M.
  TITLE     Characterization of a bovine acidic FGF cDNA clone and its
            expression in brain and retina
  JOURNAL   FEBS Lett. 242, 41-46 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         91      558     acidic eye-derived fibroblast growth factor
BASE COUNT      173 a    184 c    170 g    141 t
ORIGIN      
        1 ggatcctctt tcccttctac tggagaggaa aagccctcag cctgcaagct gttcagcctt
       61 gaaacagcca caaccagcag ctgctgagcc atggctgaag gagaaaccac gaccttcacg
      121 gccctgactg agaagtttaa cctgcctcta ggcaattaca agaagcccaa gctcctctac
      181 tgcagcaacg ggggctactt cctgagaatc ctcccagatg gcacagtgga tgggacgaag
      241 gacaggagcg accagcacat tcagctgcag ctctgtgcgg aaagcatagg ggaggtgtat
      301 attaagagta cggagactgg ccagttcttg gccatggaca ccgacgggct tttgtacggc
      361 tcacagacac ccaatgagga atgtttgttc ctggaaaggt tggaggaaaa ccattacaac
      421 acctacatat ccaagaagca tgcagagaag cattggttcg ttggtctcaa gaagaacgga
      481 aggtctaaac tcggtcctcg gactcacttc ggccagaaag ccatcttgtt tctccccctg
      541 ccagtctcct ctgattaaag aaatctgttg tgggtgctga gccactccag aggaatctga
      601 aggggtcctc acctggctga ccccagattg taccctttac cattggccgt gctaacccct
      661 ggcccaca
//
LOCUS       CHKGHA        778 bp ss-mRNA            VRT       31-JUL-1990
DEFINITION  Chicken growth hormone mRNA, complete cds.
ACCESSION   M35609
KEYWORDS    growth hormone; somatotropin.
SOURCE      Chicken hypophysis, cDNA to mRNA, clone pcGH1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 778)
  AUTHORS   Zhvirblis,G.S., Gorbulev,V.G., Rubtsov,P.M., Karapetyan,R.V.,
            Zhuravlev,I.V., Fisinin,V.I., Skryabin,K.G. and Baev,A.A.
  TITLE     Genetic engineering of peptide hormones: I. Cloning and primary
            structure of cDNA of chicken growth hormone
  JOURNAL   Mol. Biol. 21, 1324-1328 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         36      686     growth hormone
    mRNA    <     1      778     GH mRNA
BASE COUNT      178 a    257 c    196 g    147 t
ORIGIN      
        1 gttcaagcaa cacctgagca actctcccgg caggaatggc tccaggctcg tggttttctc
       61 ctctcctcat cgctgtggtc acgctgggac tgccgcagga agctgctgcc accttccctg
      121 ccatgcccct ctccaacctg tttgccaacg ctgtgctgag ggctcagcac ctccacctcc
      181 tggctgccga gacatataaa gagttcgaac gcacctatat tccggaggac cagaggtaca
      241 ccaacaaaaa ctcccaggct gcgttttgtt actcagaaac catcccagct cccacgggga
      301 aggatgacgc ccagcagaag tcagacatgg agctgcttcg gttttcactg gttctcatcc
      361 agtcctggct gactcccgtg caatacctaa gcaaggtgtt cacgaacaac ttggtttttg
      421 gcacctcaga cagagtgttt gagaaactaa aggacctgga agaagggatc caagccctga
      481 tgagggagct ggaggaccgc agcccgcggg gcccgcagct cctcagaccc acctacgaca
      541 agttcgacat ccacctgcgc aacgaggacg ccctgctgaa gaactacggc ctgctgtcct
      601 gcttcaagaa ggatctgcac aaggtggaga cctacctgaa ggtgatgaag tgccggcgct
      661 tcggagagag caactgcacc atctgaggcc ccgtgcctgc gccatggctg atggccctgt
      721 cccccccccc cccttcctcc ccgtcaccaa aaacacgagg aataaacccc acagcgcc
//
LOCUS       ECOTRYSU3A     85 bp ss-tRNA            RNA       31-JUL-1990
DEFINITION  E.coli su+III tRNA-Tyr suppressor.
ACCESSION   M35560
KEYWORDS    transfer tRNA-Tyr.
SOURCE      E.coli (strain MB100) RNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Abelson,J.N., Gefter,M.L., Barnett,L., Landy,A., Russell,R.L. and
            Smith,J.D.
  TITLE     Mutant tyrosine transfer ribonucleic acids
  JOURNAL   J. Mol. Biol. 47, 15-28 (1970)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA          1       85     suppressor Tyr-tRNA su-III
    anticdn      35       37     suppressor Tyr-tRNA su-III anticodon cta
    modified      8        8     s4u
    modified     17       17     gm
    modified     38       38     ms2i6a
    modified     40       40     p
    modified     54       54     p
    mut          15       15     g in wild-type; a in su-15
    mut          17       17     gm in wild-type; a in su-17
    mut          31       31     g in wild-type; a in su-12
BASE COUNT       19 a     28 c     22 g     16 t
ORIGIN      
        1 ggtggggttc ccgagcggcc aaagggagca gactctaaat ctgccgtcat cgacttcgaa
       61 ggttcgaatc cttcccccac cacca
//
LOCUS       HUMG6PDC      608 bp ss-mRNA            PRI       31-JUL-1990
DEFINITION  Human glucose-6-phosphate dehydrogenase (G6PD) mRNA, 3' end.
ACCESSION   M35604
KEYWORDS    glucose-6-phosphate dehydrogenase.
SOURCE      Human fibroblast, cDNA to mRNA, clone pGD6405.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 608)
  AUTHORS   Toniolo,D., Persico,M.G., Battistuzzi,G. and Luzzatto,L.
  TITLE     Partial purification and characterization of the messenger RNA for
            human glucose-6-phosphate dehydrogenase
  JOURNAL   Mol. Biol. Med. 2, 89-103 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      103     glucose-6-phosphate dehydrogenase (G6PD)
                                 /hgml_locus_uid="LH0033J" /nomgen="G6PD"
                                 /map="Xq28"
    mRNA    <     1      608     G6PD mRNA
    signal      587      592     polyA signal
BASE COUNT      143 a    143 c    183 g    138 t      1 others
ORIGIN      Chromosome Xq28.
        1 ggagaagaac cttggggtgc cagggtgggg gcagaagcat gggctgggtt ccggttcatc
       61 ctcctccacc ctgccgtgtg tgtgggcaca agaggacatc taaccacctg ctccttggac
      121 gaggccccca ggggtggtag aggctggaag gaagccacat caggaggacg ccactccggc
      181 ccttcaccct tgccaagtga gctgctcaca gtgtggtcag ggctgcgcgt gctggaggcc
      241 ctcctgcctg ggccttgtgg ggcaaatatt gggtccccag gctggaaaga tggacagagg
      301 cccaatgggt gaaggctttg aagagcacac agaagcccct ggccccccac gagagctgga
      361 gagccatgta tatggcttca aagccaccta cggcagggac acactcgtga gcatgtgtgg
      421 cctgcagttc aggtgataca tttaccagtg ttcttgtttg tgtggtgcca ggaaattgat
      481 tttggaaaaa gtgaaataac attaaaggtg aatgtgaggc ttctactttt atccaaaagg
      541 agctatatta gctaggctgt ttctgatayc caatcattgg tttaacaata aaggcaattt
      601 gtttaatc
//
LOCUS       MSGMPB57A     553 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  M.bovis immunogenic protein MPB57 gene, complete cds.
ACCESSION   M35389
KEYWORDS    immunogenic protein MPB57.
SOURCE      M.bovis (strain BCG) DNA.
  ORGANISM  Mycobacterium bovis
            Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae.
REFERENCE   1  (bases 1 to 553)
  AUTHORS   Yamaguchi,R., Matsuo,K., Yamazaki,A., Nagai,S., Terasaka,K. and
            Yamada,T.
  TITLE     Immunogenic protein MPB57 from Mycobacterium bovis BCG: Molecular
            cloning, nucleotide sequence and expression
  JOURNAL   FEBS Lett. 240, 115-117 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        223      525     immunogenic protein MPB57
    binding     209      213     MPB57 ribosomal binding site
    signal       29       34     MPB57 -35 region
    signal       50       55     MPB57 -10 region
BASE COUNT      113 a    170 c    184 g     86 t
ORIGIN      
        1 ttggcaacca ggaagcaagg ggccgccctt gagtgctagc actctcatgt atagagtgct
       61 agatggcaat cggctaaccc ctgcgtcggc acccgcgacg acggcgcagg gcgcggacgt
      121 cacctggtaa ttcggacggt tcgggcacgc cccggaccga ccgccaactc cggtccgggc
      181 gagcgtcccg ggctctgatc caaatagtgg agggctccaa tcgtggcgaa ggtgaacatc
      241 aagccactcg aggacaagat tctcgtgcag gccaacgagg ccgagaccac gaccgcgtcc
      301 ggtctggtca ttcctgacac cgccaaggag aagccgcagg agggcaccgt cgttgccgtc
      361 ggccctggcc ggtgggacga ggacggcgag aagcggatcc cgctggacgt tgcggagggt
      421 gacaccgtca tctacagcaa gtacggcggc accgagatca agtacaacgg cgaggaatac
      481 ctgatcctgt cggcacgcga cgtcgttggc cgtcgtttca agtagtagag cgtgttcccc
      541 ggcgatcccg tcg
//
LOCUS       MUSMTMTAAA     78 bp ds-DNA             ORG       31-JUL-1990
DEFINITION  Mouse mitochondrial maternally transmitted antigen-alpha
            (Mta-alpha) mRNA, 5' end.
ACCESSION   M35494
KEYWORDS    maternally transmitted antigen.
SOURCE      Mouse mitochondrial DNA.
  ORGANISM  Mitochondrion Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; 
            Mus musculus.
REFERENCE   1  (bases 1 to 78)
  AUTHORS   Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
            Yonekawa,H.
  TITLE     Molecular definition of a mitochondrially encoded mouse minor
            histocompatibility antigen
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >    78     maternally transmitted antigen-alpha
                                 (Mta-alpha) (gtg start codon)
BASE COUNT       22 a     23 c      8 g     25 t
ORIGIN      
        1 gtgttcttta ttaatatcct aacactcctc gtccccattc taatcgccat agccttccta
       61 acattagtag aacgcaaa
//
LOCUS       MUSMTMTABA     78 bp ds-DNA             ORG       31-JUL-1990
DEFINITION  Mouse mitochondrial maternally transmitted antigen-beta (Mta-beta)
            mRNA, 5' end.
ACCESSION   M36861
KEYWORDS    maternally transmitted antigen.
SOURCE      Mouse mitochondrial DNA.
  ORGANISM  Mitochondrion Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; 
            Mus musculus.
REFERENCE   1  (bases 1 to 78)
  AUTHORS   Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
            Yonekawa,H.
  TITLE     Molecular definition of a mitochondrially encoded mouse minor
            histocompatibility antigen
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >    78     maternally transmitted antigen-beta (Mta-beta)
                                 (gtg start codon)
BASE COUNT       21 a     24 c      9 g     24 t
ORIGIN      
        1 gtgttcttta ttaatgccct aacactcctc gtccccattc taatcgctat agccttccta
       61 acactagtag aacgcaaa
//
LOCUS       MUSMTMTACA     78 bp ds-DNA             ORG       31-JUL-1990
DEFINITION  Mouse mitochondrial maternally transmitted antigen-gamma
            (Mta-gamma) mRNA, 5' end.
ACCESSION   M36862
KEYWORDS    maternally transmitted antigen.
SOURCE      Mouse mitochondrial DNA.
  ORGANISM  Mitochondrion Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; 
            Mus musculus.
REFERENCE   1  (bases 1 to 78)
  AUTHORS   Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
            Yonekawa,H.
  TITLE     Molecular definition of a mitochondrially encoded mouse minor
            histocompatibility antigen
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >    78     maternally transmitted antigen-gamma
                                 (Mta-gamma) (gtg start codon)
BASE COUNT       21 a     23 c      9 g     25 t
ORIGIN      
        1 gtgttcttta ttaatgtcct aacactcctc gtccccattc taatcgctat agccttccta
       61 acactagtag aacgcaaa
//
LOCUS       MUSMTMTADA     78 bp ds-DNA             ORG       31-JUL-1990
DEFINITION  Mouse mitochondrial maternally transmitted antigen-delta
            (Mta-delta) mRNA, 5' end.
ACCESSION   M36863
KEYWORDS    maternally transmitted antigen.
SOURCE      Mouse mitochondrial DNA.
  ORGANISM  Mitochondrion Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; 
            Mus musculus.
REFERENCE   1  (bases 1 to 78)
  AUTHORS   Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and
            Yonekawa,H.
  TITLE     Molecular definition of a mitochondrially encoded mouse minor
            histocompatibility antigen
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >    78     maternally transmitted antigen-delta
                                 (Mta-delta) (gtg start codon)
BASE COUNT       22 a     23 c      8 g     25 t
ORIGIN      
        1 gtgttcttta ttaataccct aacactcctc gtccctattc taatcgctat agccttccta
       61 acactagtag aacgcaaa
//
LOCUS       MZEHISH3A     411 bp ds-DNA             PLN       31-JUL-1990
DEFINITION  Z.mays histone H3 gene, complete cds.
ACCESSION   M35388
KEYWORDS    histone H3.
SOURCE      Z.mays (L. cv. INRA 258) 5-6 day plantlet DNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 411)
  AUTHORS   Chaubet,N., Chaboute,M.-E., Philipps,G. and Gigot,C.
  TITLE     Histone genes in higher plants: Organization and expression
  JOURNAL   Dev. Genet. 8, 461-473 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1      411     histone H3
BASE COUNT       77 a    154 c    128 g     52 t
ORIGIN      
        1 atggcccgca cgaagcagac ggcgcgcaag tcgacgggcg gcaaggcgcc ccgcaagcag
       61 ctggccacca aggcggcgcg caagtcggcg ccggcaaccg gtggcgtgaa gaagcctcac
      121 cgcttccgcc ccggcaccgt cgcgctccgg gagattcgca agtaccagaa gagcacggag
      181 ctgctcatcc gcaagctgcc cttccagcgc ctcgtccgtg agatcgcgca ggatttcaag
      241 accgacctcc gcttccagtc ctccgctgtc gccgcgctgc aggaggccgc cgaggcctac
      301 ctcgtggggc tcttcgagga caccaacctc tgcgccatcc acgccaagcg cgtcaccatc
      361 atgcccaagg acatccagct cgcgcgccgc atcaggggcg agagggcttg a
//
LOCUS       RATFIBAA      511 bp ss-mRNA            ROD       31-JUL-1990
DEFINITION  Rat alpha-fibrinogen mRNA, 3' end.
ACCESSION   M35601
KEYWORDS    alpha-fibrinogen.
SOURCE      Rat (strain Wistar) male 16-hr regenerating liver, cDNA to mRNA,
            clone lambda-7A.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 511)
  AUTHORS   Sobczak,J., Lotti,A.-M., Taroux,P. and Duguet,M.
  TITLE     Molecular cloning of mRNA sequences transiently induced during rat
            liver regeneration
  JOURNAL   Exp. Cell Res. 169, 47-56 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      282     alpha-fibrinogen
BASE COUNT      145 a    124 c    100 g    141 t      1 others
ORIGIN      
        1 cctgaacttg gttcctttta tgacagccgc tttggttcac tcacaaagaa cttcaaagaa
       61 tttggcagta agacctctga ttctgacatc ttcacagaca tcgagaaccc tagctcccat
      121 gtacctgagt tttcttncag tagtaaaacc tcaactgtca ggaaacaagt aaccaagagc
      181 tataaaatgg cagatgaggc agcaagtgaa gctcaccaag aaggagacac tcgaaccacc
      241 aagaggggcc gagctcgcac aatgagaggt atccacgctt aactctggga agttgccctg
      301 accccctaga ctaagttaac catttctgca aagtgcttac caggcgcgct ctttcttaac
      361 ctcttctagt gctttggtgg aatctcattt tttttcatgc tagactgtac gttccttggg
      421 ggcagggact ttgccatgtg tctatttctc taattcccca atgcataaca gtgcagtcat
      481 ttctcaataa atatatttta aataaatgaa c
//
LOCUS       RATFIBBA      261 bp ss-mRNA            ROD       31-JUL-1990
DEFINITION  Rat beta-fibrinogen mRNA, 3' end.
ACCESSION   M35602
KEYWORDS    beta-fibrinogen.
SOURCE      Rat (strain Wistar) male 16-hr regenerating liver, cDNA to mRNA,
            clone lambda-7B.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 261)
  AUTHORS   Sobczak,J., Lotti,A.-M., Taroux,P. and Duguet,M.
  TITLE     Molecular cloning of mRNA sequences transiently induced during rat
            liver regeneration
  JOURNAL   Exp. Cell Res. 169, 47-56 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      168     beta-fibrinogen
BASE COUNT       73 a     49 c     61 g     78 t
ORIGIN      
        1 tgtcatgcag ccaatccaaa cggcagatac tactggggtg gacagtacac ctgggacatg
       61 gcaaagcatg gcacagatga tggtgtagta tggatgaatt ggaaggggtc atggtactca
      121 atgaggaaga tgagtatgaa gatcaggccc ttcttcccac agcaatagtc cccaatagcg
      181 tagatttttg ctcttctgta tgtgacaaca tttttgtaca ttatgttatt ggaattttct
      241 ttcatacatt atattcctac t
//
LOCUS       RATNKBA       727 bp ss-mRNA            ROD       31-JUL-1990
DEFINITION  Rat neurokinin B mRNA, complete cds.
ACCESSION   M35607
KEYWORDS    neurokinin B.
SOURCE      Rat cerebral cortex, cDNA to mRNA, clone cRNKp[1,6].
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 727)
  AUTHORS   Bonner,T.I., Affolter,H.-U., Young,A.C. and Young,W.S.III.
  TITLE     A cDNA encoding the precursor of the rat neuropeptide neurokinin B
  JOURNAL   Mol. Brain Res. 2, 243-249 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        115      465     neurokinin B precursor
    sigp        115      162     neurokinin B signal peptide
    matp        358      387     neurokinin B
    mRNA    <     1      727     neurokinin B mRNA
BASE COUNT      155 a    228 c    179 g    165 t
ORIGIN      
        1 agctgggact ggttcctgcc acctccccga gcccctgcac tcttgtctct ctctgtctcc
       61 acagcagcgt caccggctcc cggatcctgc cagccgtcca gctctacagg caccatgagg
      121 agcgccatgc tgttcgcggc tgtcctcgcc ctcagcttgg catggacctt cggggctgcg
      181 tgtgaggaac ctcaggagca gggtgggagg ctcagtaagg actctgatct ctctctgcta
      241 cctccacccc tgcttcggag actctacgac agccgctcca tctctctgga aggattgctg
      301 aaagtgctga gcaaggctag cgtgggaccg aaggagacat cacttccaca gaaacgtgac
      361 atgcacgact tctttgtggg acttatgggc aagaggaaca gccaaccaga cactcccgct
      421 gatgtggttg aagagaacac ccccagcttt ggcgtcctca aataggccag cagtgcagaa
      481 aagcactcca ctctcagacc ctggactgca tcataaagac agggttcctg tggcggtccc
      541 agtgcctgcg ctcctgcttc cctgcctgca aggtcctcct gttggctccc ttccctactc
      601 tgcacagatg ctgcatatga acagcctcta cccccatatc aattatggtt tctgtagtgt
      661 cctgcattaa aaataccatg tctcctcctc aacaataaag ggtttttaca atggagtgac
      721 tgaaaag
//
LOCUS       RABIGHCPS     577 bp ss-mRNA            MAM       31-JUL-1990
DEFINITION  Rabbit nonproductive VHa1 mRNA for Ig(mu) heavy chain constant
            region.
ACCESSION   X13699
KEYWORDS    constant region; immunoglobulin; immunoglobulin heavy chain;
            mu-immunoglobulin.
SOURCE      rabbit (Oryctolagus cuniculus).
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 577)
  AUTHORS   Mage,R.G.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 577)
  AUTHORS   Mage,R.G., Newman,B.A., Harindranath,N., Bernstein,K.E.,
            Becker,R.S. and Knight,K.L.
  TITLE     Evolutionary conservation of splice sites in sterile C-mu
            transcripts and of immunoglobulin heavy chain (IgH) enhancer
            sequences
  JOURNAL   Mol. Immunol. 26, 1007-1010 (1989)
  STANDARD  simple automatic
COMMENT     *source: strain=allotype-defined NIH VHal Ckb9 Rabbit No CW247-4;
            *source: library=pBR322; clone=pB3; The sequence overlaps with that
            reported by Bernstein et. al. in J. Immunol. 132:490-495(1984)
            J00666 (F-I haplotype).  The cDNA pB3 sequence starts at a position
            corresponding to pos.  70 in a germline DNA from JH-C-mu intron 80
            bp downstream of the Ig(mu) enhancer. See x13700.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE        549   >577       CH1 domain of Ig(mu)H chain
            
            [1] Author address Mage R.G., Lab of Immunology, NIAID , NIH,
            Building 10,11-N311, NIH, Bethesda, MD 20892.
            
            Submitted (05-DEC-1988) on tape to the EMBL data library
BASE COUNT      145 a    120 c    132 g    180 t
ORIGIN      
        1 tagaaagctc aagcttagct tatcaatttc taaattgtat tcatttttaa agtaagatat
       61 tttagaaatt aaacttacag atggtttatt aatgcgccca tgaaactttt taaaattact
      121 ttgttatttc ttcatacggt aattaacctc aaacttcaga gacctcaaag cattattttt
      181 gcaacgcctg cgaccgtcat ttttaagagt ctgttttctc tatgattaga agagacgttt
      241 tgaggcttca tttcagtatt tttcaagact gcttttcaag tattcatttt tgcaccagtt
      301 ctagctgagg tggacctaag tgtcctgagt tattgggtta aacgtgtgac ttggtgaatt
      361 ggagagccat cagacagtgc agagacagga gtcgggagct ctgcagaagc cgggccaccc
      421 acgccttgag gtttcccaag accgccgggt ctgggggtgc agccgtggct tctggttagt
      481 caggaattcc tgtttggagt ggacttggag gacccgcagg acgtgccagt gaccagccac
      541 tgcacatgtg agcctgtcat ctccaactct gtacccg
//
LOCUS       RABIGHENH    1257 bp ds-DNA             MAM       31-JUL-1990
DEFINITION  Rabbit IgH gene JH-C(mu) intron with enhancer region yielding a
            nonproductive C(mu) transcript.
ACCESSION   X13700
KEYWORDS    enhancer; germ line; immunoglobulin; immunoglobulin heavy chain;
            mu-immunoglobulin.
SOURCE      rabbit (Oryctolagus cuniculus).
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 1257)
  AUTHORS   Mage,R.G.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1257)
  AUTHORS   Mage,R.G., Newman,B.A., Harindranath,N., Bernstein,K.E.,
            Becker,R.S. and Knight,K.L.
  TITLE     Evolutionary conservation of splice sites in sterile C-mu
            transcripts and of immunoglobulin heavy chain (IgH) enhancer
            sequences
  JOURNAL   Mol. Immunol. 26, 1007-1010 (1989)
  STANDARD  simple automatic
COMMENT     *source: library=cosmid; clone=COS 3.3; tissue=sperm; *source:
            strain=allotype-defined a3n80f71g75 rabbit No X314-6.  see x13699
            for corresponding transcript seq
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               IVS           1   1257       JH-C(mu) intron
               SITE        591    598       conserved enhancer octamer
               TRANSCR     670   1227       homolog. to pB3 cDNA (X13699)
               IVS        1228  >1257       intron (to form pB3 cDNA)
            
            [1] Author address Mage R.G., Lab of Immunology, NIAID , NIH\,
            Building 10,11-N311, NIH, Bethesda, MD 20892.
            
            Submitted (05-DEC-1988) on tape to the EMBL data library
BASE COUNT      328 a    248 c    320 g    361 t
ORIGIN      
        1 gttagaaaga gaacggttgt cagcagtaca aggaaggagg gccgtggagc cccaaaggcg
       61 ctgcctctga cagggaccta gcaaggcctg ggcagcccaa ccgtgctggg acggcgccca
      121 ccaggtgtgc agggacatgg tgtccgaagt ggcttctgag aaaagtggct gagaaaattg
      181 agaatgaaga ctttttgttc atgtatattc agctttttgt gtttttttaa tgtaaaggac
      241 ttttagatta ttattgttca gtatctgtct gggattggca caagacagca gagaaacagt
      301 ggagtggcac cagcaggccc caatacagga ctgagttttg acaaaaatta tttaagttta
      361 atttttaatt aattgagcaa tgccagatta gtgcatgagt gacagccaag atggccgcgc
      421 atccggccgg acacctgcag caggtggcag gaagcatgtc gcactgagag tctattttag
      481 gaagccaaaa agcaccctgt gtgaatttat agctcctgtt tttgaagtgg tggttttcaa
      541 gtggttttcc cggatccgaa agtgttgcag agcaaaacaa cacctgggta gtttgcatat
      601 ctaaaatagg acaggggact gccagggttc ctcttgtcac tatttgaatt tattttttca
      661 atctgaactt agcaacttct agtttgctca agcttagctt atcaatttct aaattgtatt
      721 catttttaaa gtaagatatt ttagaaatta aacttacaga tggtttatta atgcgcccat
      781 gaaacttttt aaaattactt tgttatttct tcatacggta attaacctca aacttcagag
      841 acctcaaagc attatttttg caacgcctgc gaccgtcatt tttaagagtc tattttctct
      901 atgattagaa gagacgtttt gaggcttcat ttcagtattt ttcaagactg cttttcaagt
      961 attcattttc acaccagttc tagctgaggt ggacctaagt gtcctgagtt attgggttaa
     1021 acgtgtgact tggtgaattg gagagccatc agacagtgca gagacaggag tcgggagctc
     1081 tgcagaagcc gggccaccca cgccttgagg tttcccagga ccgccgggtc tgggggtgca
     1141 gccgtggctt ctggttagtc aggaattcct gtttggagtg gacttggagg acccgcagga
     1201 cgtgccggtg accagccact gcacatggtg agaggggcag ctgctctgcc acagatc
//
LOCUS       ECOTN5011      60 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  E.coli mercuric ion resistance transposon (Tn501), 5' end.
ACCESSION   M25453
KEYWORDS    transposon.
SEGMENT     1 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 60)
  AUTHORS   Brown,N.L., Choi,C.-L., Richmond,M.H. and Grinsted,J.
  TITLE     Nucleotide sequences at the ends of the mercury resistance
            transposon, Tn501
  JOURNAL   Nucleic Acids Res. 8, 1933-1945 (1980)
  STANDARD  simple staff_review
COMMENT     Transposon Tn501 was originally isolated in Pseudomonas aeruginosa.
FEATURES       from  to/span     description
    recomb       12       13     E.coli DNA end/Tn501 DNA start
BASE COUNT       21 a     11 c     17 g     11 t
ORIGIN      
        1 ctgaaactat gagggggaac cgcagaattc ggaaaaaatc gtacgctaag ctaacggtgt
//
LOCUS       ECOTN5012      60 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  E.coli mercuric ion resistance transposon (Tn501), 3' end.
ACCESSION   M25454
KEYWORDS    transposon.
SEGMENT     2 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 60)
  AUTHORS   Brown,N.L., Choi,C.-L., Grinsted,J., Richmond,M.H. and
            Whitehead,P.R.
  TITLE     Nucleotide sequences at the ends of the mercury resistance
            transposon, Tn501
  JOURNAL   Nucleic Acids Res. 8, 1933-1945 (1980)
  STANDARD  simple staff_review
COMMENT     Transposon Tn501 was originally isolated in Pseudomonas aeruginosa.
FEATURES       from  to/span     description
    recomb       48       49     Tn501 DNA end/E.coli DNA start
BASE COUNT       15 a     17 c     13 g     15 t
ORIGIN      About 8.2 kb after segment 1
        1 cccggaaaac cttagcgtac gattttttcc gaattctgcg agccccccta tgagatgaga
//
LOCUS       COXCBBEA     1749 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  Plasmid QpRS (from Coxiella burnetii) surface protein (cbbE') gene,
            complete cds.
ACCESSION   M29982
KEYWORDS    surface protein.
SOURCE      Plasmid QpRS (from Coxiella burnetti, strain Biotzere, isolate
            priscilla Q177) DNA.
  ORGANISM  Plasmid QpRS
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1749)
  AUTHORS   Minnick,M.F., Heinzen,R.A., Frazier,M.E. and Mallavia,L.P.
  TITLE     Characterization and expression of the cbbE' gene of Coxiella
            burnetii
  JOURNAL   J. Gen. Microbiol. 136, 1099-1107 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.A.Heinze, 20-NOV-1989.
FEATURES       from  to/span     description
    pept        196     1683     E' protein
    signal      157      162     cbbE' -35 region (pot.)
    signal      178      183     cbbE' -10 region (pot.)
    signal     1691     1713     rho-independent transcription terminator
                                 (pot.)
    binding     187      193     ribosome binding site (pot.)
BASE COUNT      579 a    335 c    343 g    492 t
ORIGIN      
        1 gtcgacgtct gcctttgccg ctatagttca ccttcaaata aacaacaata tttatttata
       61 aattttcgga gtccatcatg ccgctgagaa aggaaacttt taaagtaagt tacacattag
      121 gtttttttgt gaaaataaat cgactagctt agcttattta atcctttctt aaaattttat
      181 aataacggag agactatgcc taaaaaactc gtacccaaag actatgaata tatccatctg
      241 gatcttacca ccggtgaaat aaactttacg tcgtttaatt cgcttgaaga attgcaagcc
      301 tctttaaaag aaggtcagat ttttttccac aaaagtgtca tctttgaaga aaaaccagaa
      361 agtggggaaa tttactcccc taaactgata agccagatat atcgaaaaga acaggaactc
      421 tttgaaataa gggaaaaaag taagggacat ccactaccgg ttactaagaa actgcttaag
      481 agagggcagg gaacgatagt gtgttgcggt atttacacaa aagaactctt gaaaaacgta
      541 gcggaaaagg gacagtacga tacccaatgt gacgatctaa atttgggaat ttttcacgta
      601 cgcgcccata aacctttagg catcgcgcaa aggcttgtgc atcttccgct tcccgaggat
      661 gcttcttccg ctgcagtagc cactgaaaat ttattcggct taatacgatt tatactcgtc
      721 aatgatcccg ctaagaaaaa aatttactta cctatctctt gttttgcaat tgagaagcgt
      781 atagaacaag agcatataat tggatattct cagaaagata gcctggcgct ctctcagcga
      841 gcttattatg aatataagaa ggacggaacg cttatcgggc tagtcgcatt gatcggtgtt
      901 gacgtaaaga tagatggtaa gctaggtttt ttatatcatc cggtgtggcg tgagaaacaa
      961 tgggcattaa aattcaatga aaaaatgttt tattgtgctg tttcccgtgc agaaaaagaa
     1021 aaagtcttta agccgcctta ttatttggag ccgactgcga taattgtcga tgtaactgaa
     1081 acgcccgtta agcgcttaaa gaatacaagt gaggactatt tatggttgga ggtttcccaa
     1141 atttcggcaa aattctctct tttttgtgca caaaacaatt tgaaattgga gaaggctgat
     1201 tcaaaaaata aatctccttt cgtggcccta tcaatggaat cgattagtga attaacaggc
     1261 gaacagaaaa gagcttttgt caaaatcttg aatattcccg gaatcatttt ctcttcctca
     1321 accctagcta aagcgagact agagagcaaa cttcaatata ttggaccagc attaattgaa
     1381 gccgctgcag atggaaattt caccgatgtg gttgatataa taaatcgaat agaaccactc
     1441 tatgattaca aagagatttt aaaagaggca ttaaaaacac aacgtttggg aacgggcaat
     1501 actcctttac aggaggctat caaaggacag catacaagcc tagttaagta cttcagttcg
     1561 ctatccgctt cgttgaaagt cataaatcat aaaaatcatc aaggattaac agcactcaat
     1621 ttcgctacag ccattggatc atcccctgcg atcgtacaag agcttgaatg gtgctcccaa
     1681 taaagttagg aatggtccgc cgcgcaaatc attgctaata cgttatctga aaaggtgtca
     1741 tccccttcc
//
LOCUS       DROOPA        240 bp ds-DNA             INV       31-JUL-1990
DEFINITION  D.melanogaster neurogenic locus mastermind opa repeat sequence.
ACCESSION   M35653
KEYWORDS    mastermind locus.
SOURCE      D.melanogaster DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 240)
  AUTHORS   Yedvobnick,B., Smoller,D., Young,P. and Mills,D.
  TITLE     Molecular analysis of the neurogenic locus mastermind of Drosophila
            melanogaster
  JOURNAL   Genetics 118, 483-497 (1988)
  STANDARD  simple staff_review
BASE COUNT       68 a     81 c     67 g     24 t
ORIGIN      
        1 ccgccgcacg gcatgaatcc gcagcaacag cagcagcagc aacaacagca gcagcagcaa
       61 cagcaacagg cccagcagca acatggtcaa atgatgggac aaggacagcc gggtcgctat
      121 aacgactacg gcggcggctt tcccaatgac tttggcctgg gacccaatgg tccgcagcag
      181 cagcagcagc aggcgcagca acagcagccg cagcagcaac acctgccgcc gcagttccat
//
LOCUS       DRORS1         60 bp ds-DNA             INV       31-JUL-1990
DEFINITION  D.melanogaster neurogenic locus mastermind RS repeat sequence.
ACCESSION   M35654
KEYWORDS    mastermind locus.
SOURCE      D.melanogaster DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 60)
  AUTHORS   Yedvobnick,B., Smoller,D., Young,P. and Mills,D.
  TITLE     Molecular analysis of the neurogenic locus mastermind of Drosophila
            melanogaster
  JOURNAL   Genetics 118, 483-497 (1988)
  STANDARD  simple staff_review
BASE COUNT       29 a     29 c      1 g      1 t
ORIGIN      
        1 gcacacacac acacacacac acacacacac acacacacac acacacacac acacacacat
//
LOCUS       DRORS2         60 bp ds-DNA             INV       31-JUL-1990
DEFINITION  D.melanogaster neurogenic locus mastermind RS repeat sequence.
ACCESSION   M35655
KEYWORDS    mastermind locus.
SOURCE      D.melanogaster DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 60)
  AUTHORS   Yedvobnick,B., Smoller,D., Young,P. and Mills,D.
  TITLE     Molecular analysis of the neurogenic locus mastermind of Drosophila
            melanogaster
  JOURNAL   Genetics 118, 483-497 (1988)
  STANDARD  simple staff_review
BASE COUNT       20 a     27 c      7 g      6 t
ORIGIN      
        1 agttgccctg ctcacacaca cgcacacaca cacacacaca cacacacatg catgcacgcc
//
LOCUS       ECOCS3        612 bp ds-DNA             BCT       31-JUL-1990
DEFINITION  E.coli major fimbrial subunit of CS3 gene, complete cds.
ACCESSION   M35657
KEYWORDS    fimbrial subunit.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 612)
  AUTHORS   Boylan,M., Smyth,C.J. and Scott,J.R.
  TITLE     Nucleotide sequence of the gene encoding the major subunit of CS3
            fimbriae of enterotoxigenic Escherichia coli
  JOURNAL   Infect. Immun. 56, 3297-3300 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         82      588     fimbrial subunit precursor of CS3
    sigp         82      126     CS3 fimbrial subunit signal peptide
    matp        127      585     CS3 fimbrial subunit
BASE COUNT      197 a    114 c    110 g    191 t
ORIGIN      
        1 agcagtacag ttccaggtac gtatactgtt ggtcttaacg taaccagtaa tgttatttaa
       61 agtgaatgta tgagggattc gatgttaaaa ataaaatact tattaatagg tctttcactg
      121 tcagctatga gttcatactc actagctgca gcggggccca ctctaaccaa agaactggca
      181 ttaaatgtgc tttctcctgc agctctggat gcaacttggg ctcctcagga taatttaaca
      241 ttatccaata ctggcgtttc taatactttg gtgggtgttt tgactctttc aaataccagt
      301 attgatacag ttagcattgc gagtacaagt gtttctgata catctaagaa tggtacagta
      361 acttttgcac atgagacaaa taactctgct agctttgcca ccaccatttc aacagataat
      421 gccaacatta cgttggataa aaatgctgga aatacgattg ttaaaactac aaatgggagt
      481 cagttgccaa ctaatttacc acttaagttt attaccactg aaggtaacga acatttagtt
      541 tcaggtaatt accgtgcaaa tataacaatt acttcgacaa ttaaataatt atataataga
      601 cgtagccttc ga
//
LOCUS       HUMCRYAX1     189 bp ds-DNA             PRI       31-JUL-1990
DEFINITION  Human alpha-A-crystallin gene, exon 1.
ACCESSION   M35629
KEYWORDS    alpha-A-crystallin.
SEGMENT     1 of 2
SOURCE      Human spleen DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 189)
  AUTHORS   McDevitt,D.S., Hawkins,J.W., Jaworski,C.J. and Piatigorsky,J.
  TITLE     Isolation and partial characterization of the human alpha-A-
            crystallin gene
  JOURNAL   Exp. Eye Res. 43, 285-291 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1  /   189     alpha-A-crystallin, exon 1
BASE COUNT       27 a     72 c     45 g     45 t
ORIGIN      
        1 atggacgtga ccatccagca cccctggttc aagcggaccc tggggccctt ctaccccagc
       61 cggctgttcg accagttttt cggcgagggg ctttttgagt atgacctgct gcccttcctg
      121 tcgtccacca tcacccccta ctaccgccag tccctcttcc gcaccgtgct ggactccggc
      181 atctctgag
//
LOCUS       HUMCRYAX2      51 bp ds-DNA             PRI       31-JUL-1990
DEFINITION  Human alpha-A-crystallin gene, exon 3 (partial).
ACCESSION   M35628
KEYWORDS    alpha-A-crystallin.
SEGMENT     2 of 2
SOURCE      Human spleen DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 51)
  AUTHORS   McDevitt,D.S., Hawkins,J.W., Jaworski,C.J. and Piatigorsky,J.
  TITLE     Isolation and partial characterization of the human alpha-A-
            crystallin gene
  JOURNAL   Exp. Eye Res. 43, 285-291 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       27     alpha-A-crystallin, exon 3 (AA at 1)
BASE COUNT        7 a     23 c     11 g     10 t
ORIGIN      About 1.7 kb after segment 1
        1 aagcccacct cggctccctc gtcctaagca gcattgcctc ggctggctcc c
//
LOCUS       HUMTBP1      1341 bp ss-mRNA            PRI       31-JUL-1990
DEFINITION  Human tat binding protein-1 (tbp-1) mRNA, complete cds.
ACCESSION   M34079
KEYWORDS    Tat binding protein-1.
SOURCE      Human Jurkat T-cell line, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites for [2])
  AUTHORS   Nelbock,P., Dillion,P.J., Perkins,A. and Rosen,C.A.
  TITLE     A cDNA for a protein that interacts with the human immunodeficiency
            virus tat transactivator
  JOURNAL   Science 248, 1650-1653 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1341)
  AUTHORS   Nelbock,P., Dillion,P.J., Perkins,A. and Rosen,C.A.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.A.Rosen, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         63     1277     tat binding protein-1 (tbp-1)
BASE COUNT      350 a    337 c    412 g    242 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccggc gaccgtgtgg gatgaggccg agcaagatgg aattggggag gaggtgctca
       61 agatgtccac ggaggagatc atccagcgca cacggctgct ggacagtgag atcaagatca
      121 tgaagagtga agtgttgaga gtcacccatg agctccaagc catgaaggac aagataaaag
      181 agaacagtga gaaaatcaaa gtgaacaaga ccctgccgta ccttgtctcc aacgtcatcg
      241 agctcctgga tgttgatcct aatgaccaag aggaggatgg tgccaatatt gacctggact
      301 cccagaggaa gggcaagtgt gctgtgatca aaacctctac acgacagacg tacttccttc
      361 ctgtgattgg gttggtggat gctgaaaagc taaagccagg agacctggtg ggtgtgaaca
      421 aagactccta tctgatcctg gagacgctgc ccacagagta tgactcgcgg gtgaaggcca
      481 tggaggtaga cgagaggccc acggagcaat acagtgacat tgggggtttg gacaagcaga
      541 tccaggagct ggtggaggcc attgtcttgc caatgaacca caaggagaag tttgagaact
      601 tggggatcca acctccaaaa ggggtgctga tgtatgggcc cccagggacg gggaagaccc
      661 tcctggcccg ggcctgtgcc gcacagacta aggccacctt cctaaagctg gctggccccc
      721 agctggtgca gatgttcatt ggagatggtg ccaagctagt ccgggatgcc tttgccctgg
      781 ccaaggagaa agcgccctct atcatcttca ttgatgagtt ggatgccatc ggcaccaagc
      841 gctttgacag tgagaaggct ggggaccggg aggtgcagag gacaatgctg gagcttctga
      901 accagctgga tggcttccag cccaacaccc aagttaaggt aattgcagcc acaaacaggg
      961 tggacatcct ggaccccgcc ctcctccgct cgggccgcct tgaccgcaag atagagttcc
     1021 cgatgcccaa tgaggaggcc cgggccagaa tcatgcagat ccactcccga aagatgaatg
     1081 tcagtcctga cgtgaactac gaggagctgg cccgctgcac agatgacttc aatggggccc
     1141 agtgcaaggc tgtgtgtgtg gaggcgggca tgatcgcact ggccaggggt gccacggagc
     1201 tcacccacga ggactacatg gaaggcatcc tggaggtgca ggccaagaag aaagccaacc
     1261 tacaatacta cgcctaggca cacaggccag ccccagtctc acggctgaag tgcgcaataa
     1321 aagatggttt agggggaatt c
//
LOCUS       MUSMETMO      280 bp ds-DNA             SYN       31-JUL-1990
DEFINITION  MoMLV/mouse metallothionein promoter recombinant DNA.
ACCESSION   M35969
KEYWORDS    .
SOURCE      Mouse and mouse moloney leukemia virus DNA.
REFERENCE   1  (bases 1 to 280)
  AUTHORS   Mclvor,R.S.
  TITLE     Deletion in a recombinant retroviral vector resulting from a
            cryptic splice donor signal in the moloney leukemia virus encvelope
            gene
  JOURNAL   Virology 176, 652-655 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb      127      128     MoMLV DNA end/mouse metallothionein promoter
                                 DNA start
    recomb      241      242     metallothionein promoter region deletion
BASE COUNT       64 a     78 c     77 g     61 t
ORIGIN      
        1 gggtccagac tccttctact gtgcctattg gggctgtgag acaaccggta gagcttactg
       61 gaagccctcc tcatcatggg atttcatcac agtaaacaac aatctcacct ctgaccaggc
      121 tgtccagctt cgatccagag agagacctgg gcggagctgg tcgctgctag gaactccagg
      181 aaaggagaag ctgaggttac cacgctgcga atgggtttac ggagatagct ggctttccgg
      241 gctccagctt caccagatcc gtcgacctgc agccaagctt
//
LOCUS       PLYREGU       203 bp ds-DNA             VRL       31-JUL-1990
DEFINITION  Polyomavirus JC HindIII C fragment regulatory sequence.
ACCESSION   M34921
KEYWORDS    .
SOURCE      Polyomavirus JC DNA.
  ORGANISM  Polyomavirus JC
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 203)
  AUTHORS   Frisque,R.J.
  TITLE     Regulatory sequences and virus-cell interactions of JC virus
  JOURNAL   Prog. Clin. Biol. Res. 105, 41-59 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         99  <     1 (c) T- antigen
BASE COUNT       62 a     55 c     31 g     55 t
ORIGIN      
        1 aagctttctc atgacaggaa tgttccccca tgcagaccta tcaaggccta ataaatccat
       61 aagctccatg gattcctccc tattcagcac tttgtccatt ttagcttttt gcagcaaaaa
      121 attactgcaa aaaagggaaa aacaagggaa tttccctggc ctcctaaaaa gcctccacgc
      181 ccttactact tctgagtaag ctt
//
LOCUS       PLYREGUX      292 bp ds-DNA             VRL       31-JUL-1990
DEFINITION  Polyomavirus JC regulatory sequence.
ACCESSION   M34922
KEYWORDS    .
SOURCE      Polyomavirus JC DNA, clones Mad1-[TC,Br].
  ORGANISM  Polyomavirus JC
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 292)
  AUTHORS   Frisque,R.J.
  TITLE     Regulatory sequences and virus-cell interactions of JC virus
  JOURNAL   Prog. Clin. Biol. Res. 105, 41-59 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    binding       3       19     pot. T-antigen binding site
    rpt          20      117     tandem repeat copy A
    rpt         118      215     tandem repeat copy B
BASE COUNT       96 a     68 c     81 g     47 t
ORIGIN      0.67 - 0.73 mu.
        1 cggaggcggc ctcggcctcc tgtatatata aaaaaaaggg aagggatggc tgccagccaa
       61 gcatgagctc atacctaggg agccaaccag ctaacagcca gtaaacaaag cacaaggctg
      121 tatatataaa aaaaagggaa gggatggctg ccagccaagc atgagctcat acctagggag
      181 ccaaccagct aacagccagt aaacaaagca caaggggaag tggaaagcag ccaagggaac
      241 atgttttgcg agccagagct gttttggctt gtcaccagct ggccatggtt ct
//
LOCUS       PT7RNA        203 bp ds-DNA             PHG       31-JUL-1990
DEFINITION  Bacteriophage T7 DNA sequence.
ACCESSION   M35614
KEYWORDS    .
SOURCE      Bacteriophage T7 (strain delta-D111) DNA RNA.
  ORGANISM  Bacteriophage T7
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 203)
  AUTHORS   Aivazashvili,V.A., Bibilashvili,R.S., Vartikyan,R.M. and
            Kutateladze,T.V.
  TITLE     Factors affecting the discontinuous in vitro elongation of RNA by
            RNA polymerase from Escherichia coli
  JOURNAL   Mol. Biol. 15, 510-522 (1981)
  STANDARD  simple staff_review
BASE COUNT       63 a     54 c     49 g     35 t      2 others
ORIGIN      
        1 catcgagagg gacacggcga atagccatcc caatccacac gtccaacggg gcaaccgtat
       61 gtacacctga tgggttcgca atgaacacna cgaatcgaac gccttaagcg tgaactccgc
      121 attaaccgca agattacnag ataggtcggc tatgacgaac gcactgatgg catagaggta
      181 atgcatgcat gcacacatat acg
//
LOCUS       PT7RNAX       209 bp ds-DNA             PHG       31-JUL-1990
DEFINITION  Bacteriophage T7 DNA sequence.
ACCESSION   M35615
KEYWORDS    .
SOURCE      Bacteriophage T7 DNA.
  ORGANISM  Bacteriophage T7
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 209)
  AUTHORS   Aivazashvili,V.A., Bibilashvili,R.S., Vartikyan,R.M. and
            Kutateladze,T.V.
  TITLE     Factors affecting the discontinuous in vitro elongation of RNA by
            RNA polymerase from Escherichia coli
  JOURNAL   Mol. Biol. 15, 510-522 (1981)
  STANDARD  simple staff_review
BASE COUNT       71 a     52 c     54 g     32 t      0 others
ORIGIN      
        1 catcgagagg gacacggcga atagccatcc caatcgacac cggggtcaac cggataagta
       61 gacagcctga taagtcgcac gaaaaacagg tattgacaac atgaagtaac atgcagtaag
      121 atacaaatcg ctaggtaaca ctagcagcgt caaccgggcg cacagtgcct tctaggtgac
      181 ttaagcgcac cacggcacat aaggtgaag
//
LOCUS       SEQPGH        879 bp ss-mRNA            VRT       31-JUL-1990
DEFINITION  S.quinqueradiata growth hormone (GH) mRNA, complete cds.
ACCESSION   M35627
KEYWORDS    growth hormone.
SOURCE      S.quinqueradiata (pituitary glands), cDNA to mRNA.
  ORGANISM  Seriola quinqueradiata
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Perciformes; Percoidei; Carangidae.
REFERENCE   1  (bases 1 to 879)
  AUTHORS   Watahiki,M., Tanaka,M., Masuda,N., Yamakawa,M., Yoneda,Y. and
            Nakashima,K.
  TITLE     cDNA cloning and primary structure of yellow tail (Seriola
            quinqueradiata) pregrowth hormone
  JOURNAL   Gen. Comp. Endocrinol. 70, 401-406 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         97      711     growth hormone (GH) precursor
    sigp         97      147     growth hormone signal peptide (pot.) (other
                                 potential cleavage site on bp 150)
    matp        150      708     growth hormone (pot.) (other matp starts on bp
                                 153)
BASE COUNT      226 a    239 c    192 g    222 t
ORIGIN      
        1 acaacacact gaacaactga actcagctca gattaaacag aacctgaacc tgaagcagac
       61 cctgaaccag aacctgaact tgaaccctga tcagccatgg acagagttgt ccttctgctg
      121 tcagtcctgt ctctgggcgt ctcctctcag ccaatcacag acagccagca tctgttctcc
      181 atcgctgtca gcagaatcca aaacctccac ctgctcgctc agagactctt ctccaacttc
      241 gagagtactc tgcagacgga ggaccagcgt caactcaaca aaatcttcct acaggatttc
      301 tgtaactctg attacatcat cagtcccatt gacaagcatg agacacaacg cagctctgtt
      361 ctgaagctgt tatcgatctc ctatcgattg gtggagtctt gggagttctc cagtcgcttt
      421 ctgtctggag gttctgctct gaggaaccag atttcaccca gactgtctga actcaagaca
      481 ggaatccaac tgctgatcac agccaatcag gacggagcag agatgttctc tgacgtctcg
      541 gccctccagc tcgctccata tggaaacttc tatcagagtc tgggaggcga agagttgctg
      601 agacgaaact acgaattgct ggcctgtttc aagaaggaca tgcacaaggt ggagacgtac
      661 ctgacggtgg ctaaatgtcg gctctctcca gaagctaact gcaccctgta gccccgtctc
      721 tccacagtga agcctgtcgt tgatgatgta atcctgtatg ttctcaagct ccgcccctat
      781 gttagcatta gtgtttacct ctgtcaggtg ctgaagtcca aactgatgat gtcatagtga
      841 tgtcatacat tcagcttgtg aaataaagtg tgttaattc
//
LOCUS       VIBRR5S       118 bp ss-rRNA            RNA       31-JUL-1990
DEFINITION  V.marinus 5S ribosomal RNA.
ACCESSION   M35658
KEYWORDS    5S ribosomal RNA.
SOURCE      V.marinus (strain MP-1, ATCC 15381) RNA.
  ORGANISM  Vibrio marinus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   MacDonell,M.T. and Colwell,R.R.
  TITLE     The nucleotide sequence of 5S ribosomal RNA from Vibrio marinus
  JOURNAL   Microbiol. Sci. 1, 229-231 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      118     5S rRNA
BASE COUNT       27 a     25 c     34 g     32 t
ORIGIN      
        1 tgtttggtga ccatagtgtt gtggtaccac ctgactccat tccgaactca gtagtgaaac
       61 gtaataacgc cgatggtagt gtggggtttc cccatgtgag agtaggtatc gccaggca
//

GenBank-Updates@genbank.bio.net (08/01/90)

LOCUS       RATGGLUT     1060 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat gamma-glutamyltransferase gene, 5' promoter region.
ACCESSION   J05515
KEYWORDS    gamma-glutamyltransferase.
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1060)
  AUTHORS   Rajagopalan,S., Park,J.-H., Patel,P.D., Lebovitz,R.M. and
            Lieberman,M.W.
  TITLE     Cloning and analysis of the rat gamma-glutamyltransferase gene
  JOURNAL   J. Biol. Chem. 265, 11721-11725 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Rajagopalan, 25-MAY-1990.
FEATURES       from  to/span     description
    mRNA        834  >  1060     gamm-glutamyltransferase mRNA
BASE COUNT      235 a    277 c    275 g    273 t
ORIGIN      
        1 cagctgcctt ctggaggacc aaactgttca ggggaaggac aggaagaaat gagcctgtgc
       61 cttcaggtca gagtcatgcc tagatctggg cgggagagct acaagggata ctgaccagga
      121 gatagggtgt tgtcccctcc cccctggggt ttggtatcct cctctgcctt aagagttgca
      181 aatcgacttt cccacataac aggcaccaaa tccagttagg accaacccca ccttccaatc
      241 caggggagag gaatgtcagc aatgcgtggg cgtgtccttc taatgtgttt tccttgagtg
      301 ttgtatgtgg accatctgca tgctcggtac ccagaggcca tcaggtctct tggaacagga
      361 attgttgatg tgaaatgcca tgtggttgct gggataggaa ctcaggactc cggaagaacc
      421 ttctcttctc cagtccccct ctgttgtttt tttttttttt ttttttgaga tacgatctca
      481 cactgtagca caggctaatc cagaactcac taggtaggtc agactgggct caaatcacag
      541 cgattctgct tctgcttcct gagtgccagg gtttgcaggt gttagctatc atgcccagtc
      601 ttaacatttc acacacgcca gtccaagtta ttaaaaaaca acccggcagt tgagggcagg
      661 gccctcaagt cccacaactg gtgcgtgcgt accaagtcca atgcgggaaa ggcctggacc
      721 cttgaaccct ttgggcggtt cacttgttag ctcttactac caaatcctgg gcttacacat
      781 gaatgccagc ccctccctgc ccagttctgt gacccccttc cccgggcagc tcttgggaga
      841 agtcatgcat acatggaggc ggtgccagcc tctttgactc cagagttcag cgggagacag
      901 agggagctca tcacatcagg caccccagaa gagttctggg cctgcttcac gtttaacttt
      961 gtgattttca ggagtaccag cctgctctaa cggtttcagg gaagattggc tgtgggtttc
     1021 cgcagagtgt gggggagttc ctgcttatcc atacagctga
//
LOCUS       ACMGAG        167 bp ss-RNA             VRL       01-AUG-1990
DEFINITION  Avian myelocytomatosis retrovirus gag gene, partial cds.
ACCESSION   M35626
KEYWORDS    gag protein; oncogene.
SOURCE      Avian myelocytomatosis retrovirus (mutant MC29-10H) RNA.
  ORGANISM  Avian myelocytomatosis retrovirus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Avian myelocytomatosis viruses.
REFERENCE   1  (bases 1 to 167)
  AUTHORS   Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and
            Patschinsky,T.
  TITLE     Structure of mutant and wild-type MC29 v-myc alleles and
            biochemical properties of their protein products
  JOURNAL   Oncogene 1, 97-109 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   167     gag protein (AA at 1)
BASE COUNT       38 a     47 c     52 g     30 t
ORIGIN      
        1 ggggaggagc ttgcgagtac aggtccgccc gtggtggcca tgcctgtagt gattaacaca
       61 gagggacccg cctggacccc tctggagcca aaattgatca caagactggc tgatacggtc
      121 aggaccaagg gcttacgatc cccgattact atagcggcgg ccactcg
//
LOCUS       ACMVMYC       333 bp ss-RNA             VRL       01-AUG-1990
DEFINITION  Avian myelocytomatosis retrovirus v-myc gene, partial cds.
ACCESSION   M35624
KEYWORDS    oncogene; v-myc protein.
SOURCE      Avian myelocytomatosis retrovirus (mutant MC29-10A) RNA.
  ORGANISM  Avian myelocytomatosis retrovirus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Avian myelocytomatosis viruses.
REFERENCE   1  (bases 1 to 333)
  AUTHORS   Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and
            Patschinsky,T.
  TITLE     Structure of mutant and wild-type MC29 v-myc alleles and
            biochemical properties of their protein products
  JOURNAL   Oncogene 1, 97-109 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   333     v-myc protein (AA at 1)
BASE COUNT       72 a    130 c     90 g     41 t
ORIGIN      
        1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtggtc
       61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccggcgc caaccccgcg
      121 gctctgctgg gggtcgacac gccgcccacg atccaccaac acaactacgc tgctcctccc
      181 tccaccaagg tggaataccc agccgccaag aggctaaagt tggacagtgg cagggtcctc
      241 aaacagatca gcaacaaccg aaaatgctcc agtccccgca cgttagactc agaggagaac
      301 gacaagaggc gaacgcacaa cgtcttggag cgc
//
LOCUS       ACMVMYCA      202 bp ss-RNA             VRL       01-AUG-1990
DEFINITION  Avian myelocytomatosis retrovirus v-myc gene, partial cds.
ACCESSION   M35625
KEYWORDS    oncogene; v-myc protein.
SOURCE      Avian myelocytomatosis retrovirus (mutants MC29-10C and 10H) RNA.
  ORGANISM  Avian myelocytomatosis retrovirus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Avian myelocytomatosis viruses.
REFERENCE   1  (bases 1 to 202)
  AUTHORS   Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and
            Patschinsky,T.
  TITLE     Structure of mutant and wild-type MC29 v-myc alleles and
            biochemical properties of their protein products
  JOURNAL   Oncogene 1, 97-109 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   202     v-myc protein (AA at 1)
BASE COUNT       34 a     77 c     62 g     29 t
ORIGIN      
        1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtcgtc
       61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccgacga caagaggcga
      121 acgcacaacg tcttggagcg ccagcgaagg aatgagctga agctgcgttt ctttgccctg
      181 cgtgaccaga tacccgaggt gg
//
LOCUS       HUM3BHSD     1565 bp ss-mRNA            PRI       01-AUG-1990
DEFINITION  Human placental 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase
            mRNA, complete cds.
ACCESSION   M35493
KEYWORDS    3-beta-hydroxysteroid dehydrogenase/5-4-isomerase.
SOURCE      Human placenta, cDNA to mRNA, clone H3-beta-hp6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Lorence,M.C., Murry,B.A., Trant,J.M. and Mason,J.I.
  TITLE     Human 3-beta-hydroxysteroid dehydrogenase/delta-5->4isomerase from
            placenta: Expression in nonsteroidogenic cells of a protein that
            catalyzes the dehydrogenation/isomerization of C21 and C19 steroids
  JOURNAL   Endocrinology 126, 2493-2498 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         31     1152     3-beta-hydroxysteroid
                                 dehydrogenase/5-4-isomerase
    mRNA    <     1     1565     3-beta-hydroxysteroid
                                 dehydrogenase/5-4-isomerase
BASE COUNT      417 a    381 c    376 g    391 t
ORIGIN      
        1 gcggagtgat tcctgctact ttggatggcc atgacgggct ggagctgcct tgtgacagga
       61 gcaggagggt ttctgggaca gaggatcatc cgcctcttgg tgaaggagaa ggagctgaag
      121 gagatcaggg tcttggacaa ggccttcgga ccagaattga gagaggaatt ttctaaactc
      181 cagaacaaga ccaagctgac agtgctggaa ggagacattc tggatgagcc attcctgaag
      241 agagcctgcc aggacgtctc ggtcatcatc cacaccgcct gtatcattga tgtcttcggt
      301 gtcactcaca gagagtctat catgaatgtc aatgtgaaag gtacccagct cctgttagag
      361 gcctgtgtcc aagctagtgt gccagtcttc atctacacca gtagcataga ggtagccggg
      421 cccaactcct acaaggaaat catccagaat ggccatgaag aagagcctct ggaaaacaca
      481 tggcccgctc catacccaca cagcaaaaag cttgctgaga aggctgtact ggcggctaac
      541 gggtggaatc tgaaaaacgg cggcaccctg tacacttgtg ccttacgacc catgtatatc
      601 tatggggaag gaagccgatt cctttctgct agtataaacg aggccctgaa caacaatggg
      661 atcctgtcaa gtgttggaaa gttctccact gttaacccag tctatgttgg caatgtggcc
      721 tgggcccaca ttctggcctt gagggccctg caggacccca agaaggcccc aagcatccga
      781 ggacagttct actatatctc agatgacacg cctcaccaaa gctatgataa ccttaattac
      841 accctgagca aagagttcgg cctccgcctt gattccagat ggagctttcc tttatccctg
      901 atgtattgga ttggcttcct gctggaaata gtgagcttcc tactcaggcc aatttacacc
      961 tatcgaccgc ccttcaaccg ccacatagtc acattgtcaa atagcgtatt caccttctct
     1021 tataagaagg ctcagcgaga tctggcgtat aagccactct acagctggga ggaagccaag
     1081 cagaaaacgg tggagtgggt tggttccctt gtggaccggc acaaggagac cctgaagtcc
     1141 aagactcagt gatttaagga tgacagagat gtgcatgtgg gtattgttag gagatgtcat
     1201 caagctccac cctcctggcc tcatacagaa agtgacaagg gcacaagctc aggtcctgct
     1261 gcctcccttt catacaatgg ccaacttatt gtattcctca tgtcatcaaa acctgcgcag
     1321 tcattggccc aacaagaagg tttctgtcct aatcatatac cagaggaaag accatgtggt
     1381 ttgctgttac caaatctcag tagctgattc tgaacaattt agggactctt ttaacttgag
     1441 ggtcgttttg actactagag ctccatttct actcttaaat gagaaaggat ttcctttctt
     1501 tttaatcttc cattccttca catagtttga taaaaagatc aataaatgtt tgaatgttta
     1561 atgtg
//
LOCUS       HUMMHB7B     1089 bp ss-mRNA            PRI       01-AUG-1990
DEFINITION  Human class I HLA-B7 mRNA, complete cds.
ACCESSION   M35444
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1089)
  AUTHORS   Parham,P., Benjamin,R.J., Chen,B.P., Clayberger,C., Ennis,P.D.,
            Krensky,A.M., Lawlor,D.A., Littman,D.R., Norment,A.M., Orr,H.T.,
            Salter,R.D. and Zemmour,J.
  TITLE     Diversity of class I HLA molecules: Functional and evolutionary
            interactions with T cells
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 529-543 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1     1089     MHC HLA-B7 /hgml_locus_uid="LX0031C"
                                 /nomgen="HLA-A" /map="6p21.3"
BASE COUNT      218 a    335 c    363 g    173 t
ORIGIN      
        1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc
       61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc
      121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc
      181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg
      241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag
      301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag
      361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac
      421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg
      481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg
      541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag
      601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac
      661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc
      721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca
      781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga
      841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg
      901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt
      961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa
     1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc
     1081 acagcttga
//
LOCUS       MUSMUPE       872 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse major urinary protein mRNA, complete cds.
ACCESSION   M28649
KEYWORDS    major urinary protein.
SOURCE      Mouse liver, cDNA to mRNA, clones 8-1 and 13-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 872)
  AUTHORS   Bennett,A.L., Paulson,K.E., Miller,R.E. and Darnell,J.E.Jr.
  TITLE     Aquisition of antigens characteristic of adult pericentral
            hepatocytes by differentiating fetal hepatoblasts in vitro
  JOURNAL   J. Cell Biol. 105, 1073-1085 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         65      601     major urinary protein
    mRNA         43      872     major urinary protein
BASE COUNT      266 a    188 c    170 g    248 t
ORIGIN      
        1 gccacgatca caagaaagat gtggtcctga cagacagaca atcctattcc ctaccaaaat
       61 gaagatgctg ctgctgctgt gtttgggact gaccctagtc tgtgtccatg cagaagaagc
      121 tagttctacg ggaaggaact ttaatgtaga aaagattaat ggggaatggc atactattat
      181 cctggccttt gacaaaagag aaaagataga agataatggc aactttagac tttttctgga
      241 gcaaatccat gtcttggaga attccttagt tcttaaattc catactgtaa gagatgaaga
      301 gtgctcggaa ttatctatgg ttgctgacaa aacagaaaag gctggtgaat attctgtgac
      361 gtatgatgga ttcaatacat ttactatacc taagacagac tatgataact ttcttatggc
      421 tcatctcatt aacgaaaatg atggggaaac cttccagctg atggggctct atggccgaga
      481 accagatttg agttcagaca tcaaggaaag gtttgcacaa ctatgtgaga agcatggaat
      541 ccttagagaa aatatcattg acctatccaa tgccaatcgc tgcctccagg cccgagaatg
      601 aagaatggcc tgagcctcca gtgttgagtg gagacttctc accaggactc caccatcatc
      661 ccttcctatc catacagcat ccccagtata aattctgtga tctgcattcc atcctgtctc
      721 actgagaagt ccaattccag tctatccaca tgttacctag gatacctcat caagaatcaa
      781 agacttcttt aaatttttct ttgatatacc catgacaatt tttcatgaat ttcttcctct
      841 tcctgttcaa taaatgatta cccttgcact ta
//
LOCUS       RATMHREC     1552 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Rat MHC class I IgG Fc region receptor large subunit p51 (FcRn)
            mRNA, complete cds.
ACCESSION   M35495
KEYWORDS    IgG Fc region receptor large subunit p51; cell surface antigen;
            cell surface glycoprotein; class I gene; integral membrane protein;
            major histocompatibility complex.
SOURCE      Rat 11 day old epithelium, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1552)
  AUTHORS   Simister,N.E. and Mostov,K.E.
  TITLE     Cloning and expression of the neonatal rat intestinal FC receptor,
            a major histocompatibilty complex class I antigen homolog
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 571-580 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        205     1305     IgG Fc region receptor large subunit p51
                                 (FcRn) precursor
    sigp        205      270     IgG Fc region receptor large subunit p51
                                 signal peptide
    matp        271     1302     IgG Fc region receptor large subunit p51
    mRNA    <     1     1552     FcRn mRNA
BASE COUNT      312 a    420 c    443 g    377 t
ORIGIN      
        1 tcagttctgt aattaattaa ctaacgtgga tcaaatgaga aggtgaaagt tcacacagga
       61 gcactcctgt cgtcttggac tgggtctcca tcccaccatc cagtgccctg gtctacgaag
      121 agtccacagg gaccttgtga agaatcaaca aggcggggtc cagaggagtc acgtgtgcct
      181 tccactccgg gtcgccctgt caggatgggg atgtcccagc ccggggtcct cctcagcctc
      241 ttattggtcc tcctgcctca gacctgggga gcggagcccc gtctcccact gatgtatcat
      301 cttgcagctg tgtctgactt atcaacgggg cttccctctt tctgggccac gggctggctg
      361 ggtgctcagc aatatctgac ctacaacaac ctgcggcagg aggctgaccc ctgtggggcc
      421 tggatatggg aaaaccaggt gtcttggtat tgggagaagg agaccacgga tctgaaaagc
      481 aaagaacagc tcttcttgga ggccatcagg accctggaga accaaataaa tgggaccttc
      541 acactgcagg gcctgctggg ctgtgaactg gcccctgata attcttcatt gcccacggct
      601 gtgtttgccc tcaatggtga ggagttcatg cggttcaacc caagaacggg caactggagt
      661 ggggagtggc cggagacaga tatcgttggt aatctgtgga tgaagcaacc tgaggcggcc
      721 aggaaggaga gcgagttcct gctaacttct tgtcctgagc ggctgctagg ccacctggag
      781 aggggccgtc agaacctgga gtggaaggag ccgccatcta tgcgcctgaa ggcccgtcct
      841 ggcaactctg gctcctcagt actgacctgt gctgctttct ccttctaccc gccggagctc
      901 aagtttcgat tcctgcgcaa tgggctagcc tcaggctctg ggaattgcag cactggtccc
      961 aatggtgatg gatctttcca tgcatggtca ttgctagagg tcaaacgtgg agatgaacac
     1021 cattaccaat gtcaagtgga gcatgagggg ctggcccagc ctctcactgt ggacctagat
     1081 tcgcccgcca gatcttctgt gcctgtggtc ggaatcattc ttggtttatt gctggtggta
     1141 gtggccatcg cagggggtgt gctgctatgg aacaggatgc gaagtgggct gccagcccca
     1201 tggctttctc tcagtggtga tgactctggc gacctattgc ctggtgggaa cttgcccccg
     1261 gaggctgaac ctcaaggtgt aaatgccttt ccggccactt cctgatgcca acccaggccc
     1321 catacccatt gcagcctgtg gggctgtgtg acctcctgaa ctgtctctga gcctcccgag
     1381 ggagccctgg gctggatgtc ctcctcgtgg atcccttctt ttgtggcctg cttcagtttc
     1441 ccctcttaat gtcaatggct atttccatct ccacataaat ttgggcccaa atctgtgtgt
     1501 gcatcgttat tctcaggttt caggcagccg gaataaattg aacaagtttg ag
//
LOCUS       YSCATP10     2343 bp ds-DNA             PLN       01-AUG-1990
DEFINITION  S.cerevisiae ATP10 (essential for mitochondrial ATPase complex
            assembly) gene, complete cds.
ACCESSION   J05463
KEYWORDS    .
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2343)
  AUTHORS   Ackerman,S.H. and Tzagoloff,A.
  TITLE     ATP10, a yeast nuclear gene required for the assembly of the
            mitochondrial F1-Fo complex
  JOURNAL   J. Biol. Chem. 265, 9952-9959 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Tzagoloff, 19-JUN-1990.
            
            The gene sequence submitted codes for a protein that is essential
            for the biosynthesis of the F1-F0 ATPase complex of the
            mitochondrial inner membrane.
            
            Bases 1 to 977 are shown on the complementary strand as shown in
            Fig. 6.
FEATURES       from  to/span     description
    pept        629  <     1 (c) ORF
    pept        976      680 (c) ORF
    pept       1444     2238     ATP10 protein
BASE COUNT      692 a    447 c    490 g    714 t
ORIGIN      
        1 agatcttttg gctcaggtat aaattcgaac gtctcgattt cccttatcag tttatggaat
       61 ggcttaaacc aagatgaaga tttccctagc tctaattgaa cttgcaccat atatataact
      121 tttccaaaag agtaaaaata caaatccaga ttatcgattt tattaaattc ttgccaatga
      181 ctattgaacg taggtgggag tcgggcatta cttcttgtca caaacgctac tgtctttgcc
      241 gtatgatttt tcagacattc aggctttctg ggaaacttgt caaattgaaa gctataatta
      301 tatgaacctg gtttaacttt gaacggcttg gaggagccat caagagcatt ccatacatta
      361 tctggaggga aaactctctg ttcgaatttc attaatgtat gaaaggattt gttgtcttgg
      421 cccggcatca tcatgccatt ctgttgaaac atgtactctt gatcaatttt tgttaaggtc
      481 tctgagaatc cttttagaat gacggaaatt ttccttatag atagcgcttt tgttaactga
      541 agactaacta tccctgacat ttgatcatta gagctataaa actccccgtt gtacggtggg
      601 tttaaggata ttgaaatttt tggagccatg gtttgacaaa ctgtatggtt ctcaaccttc
      661 tctaatcaaa agcagaatct taaatataaa cactcacaga atatccgttg gtcaatgaag
      721 taattctcct ttgtactggc tgctttttct cctctagttt atgtaattct acttttggat
      781 gggtgcgact gcttttaatt gattgagtgg cggtgttaga agggctgtag agtcgaaggc
      841 ttgtttctct cttacgcacc tcttgtgaaa agggcgtgca ccttccccag gaccctctct
      901 caccctcaac ccgcattttg ctgagaattt tcaccaaggc cctaggtgat attagattcc
      961 acctgactaa ttgcattaca gccgacccaa ggcaatatca gtttaataaa atatcatgta
     1021 tctcaccctc ttcttggtat tagtaaagag acgcctgatc ttgtaacagt ggtgaagatt
     1081 gtactagagc agaatcaaga atttaaaagt gtaaggcagg cagaggcgat gtacataaac
     1141 ttcgaagtaa gaaatattta atagttctcg ccacatcact atgcagctat ataaaaacta
     1201 ctataaacgt ttgttttgtt ccttacgcac aatatccttg cctagaaatc gtttttgaaa
     1261 tttaaatttt tattaccatt tatttgattc gccttcagaa aaatatggaa gagtgcatat
     1321 ttaaaaagga ctatttcagc atatagtaaa agtcaggtta tttgtttatt tgcgatatca
     1381 gagtaactta aactaactat gcagggcact tttaaaaggt tttaccatcc cacgcttacg
     1441 cggatgtcct tcttggataa attcctcaag cctatgatgg caacggcttc cccaaaggaa
     1501 taccagatca aacaactggt caagccaata ggcttaacac aagcaccaag gaaaagcacc
     1561 aaatactccc aggggaactc tttgagggat atgtttgatt cggaaaagac aaaccacaga
     1621 gttaaagagt tggccgttga attcagcaaa tctggacttt atgacgtgca agtcttccaa
     1681 aagacaaagg ggaaattgtt tatagctcca gtttcatatt ggaaagaaga taaagctttg
     1741 ttttttcctc atttgatagg aacggcaatg gatggtacga aacaacagaa tatcgaggat
     1801 atgttaaggg gtaaaaccag tatagtgagg ttatttagta cagcatctgg cgataagttg
     1861 agtagttcat acttccaagg aatcgtagac gataacaaaa aaactgacta cttgactgaa
     1921 gctgatgcgc gtttaagttt aaatgacagt aacgtccaaa tcatcgaggt caatcttgta
     1981 gaaaacgctg tgaaaagtgc tctagtgaaa acgcttgctc gttgggccaa tcgcgttcca
     2041 tcctggcgcc agccatttta tttcgaatat tctagaggcc aatggccatt ttccgtcagg
     2101 gaagagctct tttgcaataa tgtcttttct ggatacgtct ttcttgtgga ccagcagtta
     2161 aaaattaggt gggcagcttg cggggaggct actccatctg aaaaggaagc attgtggaag
     2221 tttgccaaac gtctgtgaag ttgacgcttt gtgcggcggc caacaaggga tgggcggcta
     2281 tttggcgatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag
     2341 tag
//
LOCUS       DDIDPYK1A    1090 bp ss-mRNA            INV       01-AUG-1990
DEFINITION  D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds.
ACCESSION   M33785
KEYWORDS    protein-tyrosine kinase-1.
SOURCE      D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 1090)
  AUTHORS   Tan,J.L. and Spudich,J.A.
  TITLE     Developmentally regulated protein-tyrosine kinase genes in
            Dictyostelium discoideum
  JOURNAL   Mol. Cell. Biol. 10, 3578-3583 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.L.Tan, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1     1014     protein-tyrosine kinase-1 (DPYK1; AA at 1)
BASE COUNT      371 a    198 c    186 g    335 t
ORIGIN      
        1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca
       61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat
      121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat
      181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca
      241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta
      301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa
      361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa
      421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat
      481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac
      541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct
      601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt
      661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat
      721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat
      781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca
      841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat
      901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta
      961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt
     1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca
     1081 cttcaactcg
//
LOCUS       DDIDPYK2A    1291 bp ss-mRNA            INV       01-AUG-1990
DEFINITION  D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds.
ACCESSION   M33784
KEYWORDS    protein-tyrosine kinase-2.
SOURCE      D.discoideum (strain AX-3) 4-hour, cDNA to mRNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 1291)
  AUTHORS   Tan,J.L. and Spudich,J.A.
  TITLE     Developmentally regulated protein-tyrosine kinase genes in
            Dictyostelium discoideum
  JOURNAL   Mol. Cell. Biol. 10, 3578-3583 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.L.Tan, 20-APR-1990.
FEATURES       from  to/span     description
    pept    <     1     1233     protein-tyrosine kinase-2 (DPYK2; AA at 1)
BASE COUNT      491 a    207 c    203 g    390 t
ORIGIN      
        1 cgattctaca atacaacaaa ctctactaaa gatatcacat ttttagtttg tgataatcct
       61 gattcaacta aagaaaagag taacgtttca aatacttcat caataatttc cgcttcaaat
      121 ttaaatagac atataacacc aaattctcat atgagaccta gaggtagatc aatttctgaa
      181 tctttaatta tgtcaccaat taataaagaa tctttaaatg atattcaaag agcaattgaa
      241 agtgaaaaaa taaagaaaac taaatttgaa gaattaaaat caatattggg cgaaagagaa
      301 tatataattg atataaatga tattcaattt atacaaaaag ttggagaagg tgcattcagt
      361 gaagtttggg aaggttggtg gaaaggtatt catgttgcca taaaaaagtt aaagattata
      421 ggagatgaag aacaattcaa agagagattc attagagagg ttcaaaattt gaaaaaagga
      481 aatcatcaaa acattgtcat gtttattggt gcatgttata aaccagcatg tatcataaca
      541 gagtatatgg caggtggtag tctttacaat atacttcata atccaaatag ttccactcca
      601 aaagttaaat attctttccc attggttttg aaaatggcaa ccgacatggc attgggctta
      661 ttacatcttc attccatcac cattgtgcat cgtgatttaa ccagtcaaaa cattctattg
      721 gatgaattgg gtaatataaa gatctctgat tttggtttat ctgctgaaaa gagtagagaa
      781 ggttcaatga caatgacaaa tggtggcatt tgcaatccaa gatggagacc acccgaattg
      841 acaaagaatt taggtcacta ctcggaaaag gttgatgtct attgtttctc tctagtagtt
      901 tgggaaattt taactggcga aattcctttc tctgatttag atggatctca acgatccgct
      961 caagtagctt atgctggttt aagaccacca ataccagagt attgcgatcc tgaattaaaa
     1021 ttactcttaa ctcaatgttg ggaggctgat ccaaatgata gacctccctt tacctatata
     1081 gtaaacaaat taaaagaaat ctcttggaat aatccaattg gtttcgtctc tgatcaattc
     1141 tatcaatata gcgaaccttc aactccaaga ttagcattat caaatcaatc ttcaaattca
     1201 agtagtattt ctttatcacc aactaaatta taaaaaaaaa aaaaaaaaaa aacaaatttc
     1261 aaacaccaaa caccaccact catcaaaatc g
//
LOCUS       HUMSPTB      6765 bp ss-mRNA            PRI       01-AUG-1990
DEFINITION  Human beta-spectrin (SPTB) mRNA, complete cds.
ACCESSION   J05500
KEYWORDS    beta-spectrin; spectrin.
SOURCE      Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and
            V252.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 6765)
  AUTHORS   Winkelmann,J.C., Chang,J.-G., Tse,W.T., Scarpa,A.L.,
            Marchesi,V.T. and Forget,B.G.
  TITLE     Full length sequence of the cDNA for human erythroid beta-spectrin
  JOURNAL   J. Biol. Chem. 265, 11827-11832 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Winkelmann, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         96     6509     beta-spectrin /nomgen="SPTB" /map="14"
                                 /hgml_locus_uid="LS0033T"
    mRNA    <     1     6765     SPTB mRNA
    signal     6716     6722     poly-A signal
BASE COUNT     1626 a   1822 c   2146 g   1171 t
ORIGIN      Chromosome 14q23-q24.
        1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag
       61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa
      121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg
      181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg
      241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg
      301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca
      361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc
      421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac
      481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc
      541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag
      601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag
      661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta
      721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca
      781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc
      841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct
      901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc
      961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg
     1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca
     1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca
     1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca
     1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag
     1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg
     1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc
     1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata
     1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga
     1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg
     1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc
     1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg
     1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg
     1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga
     1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg
     1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc
     1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg
     1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg
     2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc
     2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc
     2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt
     2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga
     2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc
     2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg
     2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc
     2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat
     2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgcaggcc ctgcgggagc
     2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg
     2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga
     2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca
     2761 ggttcgacat cctggaccag gagatgaaga ccttgatgac tcagattgat ggtgtgaacc
     2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc
     2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg
     2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt
     3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag
     3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg
     3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga
     3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat
     3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg
     3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca
     3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg
     3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc
     3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg
     3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg
     3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat
     3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga
     3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg
     3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg
     3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc
     3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct
     3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc
     4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc
     4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc
     4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa
     4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg
     4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcggtcagac gacccgggca
     4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga
     4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg
     4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa
     4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg
     4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta
     4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg
     4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg
     4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc
     4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt
     4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct
     4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc
     4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg
     5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg
     5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca
     5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa
     5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc
     5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca
     5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg
     5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc
     5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga
     5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca
     5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg
     5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg
     5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc
     5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct
     5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc
     5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca
     5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc
     6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt
     6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc
     6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg
     6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca
     6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg
     6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga
     6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg
     6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac
     6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg
     6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg
     6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag
     6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa
     6721 attttattgt ataaatatat caccatttac atgaggggaa acact
//
LOCUS       STYEUTBC     2526 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes,
            complete cds.
ACCESSION   J05518
KEYWORDS    ethanolamine ammonia-lyase.
SOURCE      S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2526)
  AUTHORS   Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M.
  TITLE     Cloning, sequencing and expression of the genes encoding the
            alcohol-dependent ethanolamine ammonia-lyase of Salmonella
            typhimurium
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.M.Babior, 08-MAY-1990.
FEATURES       from  to/span     description
    pept        141     1499     ethanolamine ammonia-lyase (eutB)
    pept       1518     2378     ethanolamine ammonia-lyase (eutC)
    binding     130      133     ribosome binding site
    binding    1507     1510     ribosome binding site
BASE COUNT      563 a    687 c    779 g    497 t
ORIGIN      
        1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga
       61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt
      121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt
      181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg
      241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa
      301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc
      361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca
      421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc
      481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc
      541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc
      601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg
      661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc
      721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga
      781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc
      841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct
      901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt
      961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct
     1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg
     1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc
     1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga
     1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc
     1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc
     1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg
     1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg
     1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat
     1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg
     1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc
     1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg
     1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc
     1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg
     1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg
     1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc
     1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg
     1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga
     2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact
     2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca
     2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg
     2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc
     2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg
     2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga
     2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg
     2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt
     2521 gcctcc
//
LOCUS       XELPCNA      1018 bp ss-mRNA            VRT       01-AUG-1990
DEFINITION  X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete
            cds.
ACCESSION   M34080
KEYWORDS    nuclear protein; proliferating cell nuclear antigen.
SOURCE      X.laevis oocyte, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1018)
  AUTHORS   Leibovici,M., Gusse,M., Bravo,R. and Mechali,M.
  TITLE     Characterization and developmental expression of Xenopus
            proliferating cell nuclear antigen (PCNA)
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Leibovici, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         28      813     proliferating cell nuclear antigen (PCNA)
    mRNA    <     1     1018     PCNA mRNA
BASE COUNT      284 a    223 c    237 g    274 t
ORIGIN      
        1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc
       61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca
      121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact
      181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag
      241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg
      301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt
      361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa
      421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc
      481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca
      541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa
      601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat
      661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca
      721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg
      781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc
      841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa
      901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa
      961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac
//
LOCUS       YSCAAC2A     1333 bp ds-DNA             PLN       01-AUG-1990
DEFINITION  S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete
            cds.
ACCESSION   M34076 J05542
KEYWORDS    ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE      S.cerevisiae (strain W303-1B) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1333)
  AUTHORS   Kolarov,J., Kolarova,N. and Nelson,N.
  TITLE     A third ADP/ATP-translocator in yeast
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Nelson, 08-MAY-1990.
FEATURES       from  to/span     description
    pept        235     1158     ADP/ATP-translocator protein (AAC2)
BASE COUNT      388 a    209 c    301 g    435 t
ORIGIN      
        1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc
       61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg
      121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa
      181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt
      241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt
      301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat
      361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt
      421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat
      481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg
      541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct
      601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc
      661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg
      721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc
      781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct
      841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg
      901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga
      961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa
     1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg
     1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc
     1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc
     1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag
     1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc
     1321 agccactgtt gtt
//
LOCUS       YSCAAC3      1308 bp ds-DNA             PLN       01-AUG-1990
DEFINITION  S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete
            cds.
ACCESSION   M34075 J05542
KEYWORDS    ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE      S.cerevisiae (strain W303-1B) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1308)
  AUTHORS   Kolarov,J., Kolarova,N. and Nelson,N.
  TITLE     A third ADP/ATP-translocator in yeast
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Nelson, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         78     1034     ADP/ATP-translocator protein (AAC3)
BASE COUNT      353 a    228 c    263 g    464 t
ORIGIN      
        1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata
       61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc
      121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg
      181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag
      241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca
      301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta
      361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt
      421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg
      481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat
      541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg
      601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac
      661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga
      721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt
      781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa
      841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga
      901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag
      961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta
     1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc
     1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata
     1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa
     1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc
     1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt
//
LOCUS       HUMMTVA1      367 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 13) mitochondrial DNA sequences, 5' end.
ACCESSION   M28909
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 367)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      121 a    123 c     42 g     80 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc
      241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat cacccccctc
      361 agatagg
//
LOCUS       HUMMTVA2      361 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 13) mitochondrial DNA sequences, 3' end.
ACCESSION   M28910
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 361)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      107 a    102 c     54 g     93 t      5 others
ORIGIN      
        1 tttggtattt tcgtctgggg ggtgtgcacg cgatagcatt gcgagacgct ggagccggag
       61 caccctatgt cgcagtatct gtctttgatt cctgccccat cctattattt atcgcaccta
      121 cgttcaatat tacaggcgaa catacnctac taaagtgtgt taattaatta atgcttgtag
      181 gacataataa taacaattaa atgtctgcac agccactttc cacacagaca tcataacaaa
      241 aaatttncca ccaaaccccc ccnnntcccc ccgcttctgg ccacagcact taaacacatc
      301 tctgccaaac cccaaaaaca aagaacccta acaccagcct aaccagattt caaattttat
      361 c
//
LOCUS       HUMMTVB1      367 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 14) mitochondrial DNA sequences, 5' end.
ACCESSION   M28911
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 367)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      121 a    120 c     43 g     82 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gcaatcaacc ttcaactgtc acatatcaac cgtaactcca aagccacccc
      241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
      361 agatagg
//
LOCUS       HUMMTVB2      356 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 14) mitochondrial DNA sequences, 3' end.
ACCESSION   M28912
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 356)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      105 a    104 c     52 g     90 t      5 others
ORIGIN      
        1 ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc tggagccgga gcaccctatg
       61 tcgcagtatc tgtctttgat tcctgcccca tcccattatt tatcgcacct acgttcaata
      121 ttacaggcga acatacncta ctaaagtgtg ttaattaatt aatgcttgta ggacataata
      181 ataacaattn aatgtctgca cagccacttt ccacacagac atcataacaa aaaatttncc
      241 accaaacccc ccccnntccc cccgcttctg gccacagcac ttaaacacat ctctgccaaa
      301 ccccaaaaac aaagaaccct aacaccagcc taaccagatt tcaaatttta tctttt
//
LOCUS       HUMMTVC1      367 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 11) mitochondrial DNA sequences, 5' end.
ACCESSION   M28905
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 367)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      119 a    123 c     45 g     79 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtacg gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc
      241 tcacccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
      361 agatagg
//
LOCUS       HUMMTVC2      371 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 11) mitochondrial DNA sequences, 3' end.
ACCESSION   M28906
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 371)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      109 a    107 c     55 g     94 t      6 others
ORIGIN      
        1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc
       61 tggagccgga gcaccctatg tcgcagcacc tgtctttgat tcctgcccca ttccattatt
      121 tatcgcacct acgttcaata ttacaggcga acatacncta ctaaagtgtg ttaattaatt
      181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac
      241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac
      301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt
      361 tcaaatttta t
//
LOCUS       HUMMTVD1      368 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 12) mitochondrial DNA sequences, 5' end.
ACCESSION   M28907
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 368)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      119 a    122 c     46 g     80 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacggtac cataaatact
      121 tgaccacctg tagtacataa aaacccanac ccacatcaaa accctccccc catgcttaca
      181 agcaagcaca gcaatcaacc ttcaactgtc acacatcaac tgcaactcca aagccacccc
      241 tcacccacta ggatatcaac aaacctactc acccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
      361 agataggg
//
LOCUS       HUMMTVD2      375 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 12) mitochondrial DNA sequences, 3' end.
ACCESSION   M28908
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 375)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      109 a    104 c     59 g     97 t      6 others
ORIGIN      
        1 ggaggctctn ccatgcattt ggtattttcg tctggggggt gtgcacgcga tagcattgcg
       61 agacgctgga gccggagcac cctatgtgca gtatctgtct ttgattcctg ccccattcca
      121 ttatttatcg cacctacgtt caatattaca ggcgagcata cnctattaaa gtgtattaat
      181 taattaatgc ttgtaggaca taataataac aattaaatgt ctgcacagcc actttccaca
      241 cagatcataa caaaaaattt nccaccaaac ccccccnnnt ccccccgctt ctggccacag
      301 cacttaaaca catctctgcc aaaccccaaa aacaaagaac cctaacacca gcctaaccag
      361 atttcaaatt ttatc
//
LOCUS       HUMMTVE1      367 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 1-4) mitochondrial DNA sequences, 5' end.
ACCESSION   M28893
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 367)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      120 a    121 c     44 g     81 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac cgcaactcca aagccacccc
      241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
      361 agatagg
//
LOCUS       HUMMTVE2      362 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 1-4) mitochondrial DNA sequences, 3' end.
ACCESSION   M28894
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 362)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      108 a    102 c     54 g     93 t      5 others
ORIGIN      
        1 catttggtat tttcgtctgg ggggtgtgca cgcgatagca ttgcgagacg ctggagccgg
       61 agcaccctat gtcgcagtat ctgtctttga ttcctgcccc atcctattat ttatcgcacc
      121 tacgttcaat attacaggcg aacatacnct actaaagtgt gttaattaat taatgcttgt
      181 aggacataat aataacaatt aaatgtctgc acagccactt tccacacaga catcataaca
      241 aaaaatttnc caccaaaccc ccccnnntcc ccccgcttct ggccacagca cttaaacaca
      301 tctctgccaa accccaaaaa caaagaaccc taacaccagc ctaaccagat ttcaaatttt
      361 at
//
LOCUS       HUMMTVF1      369 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 7) mitochondrial DNA sequences, 5' end.
ACCESSION   M28899
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 369)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      120 a    122 c     47 g     79 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gcaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccactcc
      241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
      361 agatagggg
//
LOCUS       HUMMTVF2      371 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 7) mitochondrial DNA sequences, 3' end.
ACCESSION   M28900
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 371)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      109 a    107 c     55 g     94 t      6 others
ORIGIN      
        1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc
       61 tggagccgga gcaccctatg tcgcagtatc tgtctttgat tcctgcccca tcccattatt
      121 tatcgcacct acgttcaata ttacaggcga acatacncta ccaaagtgtg ttaattaatt
      181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac
      241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac
      301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt
      361 tcaaatttta t
//
LOCUS       HUMMTVG1      340 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 5) mitochondrial DNA sequences, 5' end.
ACCESSION   M28895
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 340)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      114 a    111 c     39 g     75 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gtaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccacccc
      241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt
//
LOCUS       HUMMTVG2      349 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 5) mitochondrial DNA sequences, 3' end.
ACCESSION   M28896
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      106 a    103 c     52 g     83 t      5 others
ORIGIN      
        1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc
       61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc
      121 aatattacag gcgaacatac nctaccaaag tgtgttaatt aattaatgct tgtaggacat
      181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt
      241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc
      301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaa
//
LOCUS       HUMMTVH1      348 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 8) mitochondrial DNA sequences, 5' end.
ACCESSION   M28901
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      115 a    115 c     38 g     79 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac tgcaactcca aagccacccc
      241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatcc
//
LOCUS       HUMMTVH2      355 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 8) mitochondrial DNA sequences, 3' end.
ACCESSION   M28902
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 355)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      107 a    101 c     52 g     90 t      5 others
ORIGIN      
        1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc
       61 tatgtcgcag tatctgtctt tgattcctgc cccatcctat tatttatcgc acctacgttc
      121 aatattacag gcgaacatac nctactaaag tgtgttaatt aattaatgct tgtaggacat
      181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt
      241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc
      301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat
//
LOCUS       HUMMTVI1      367 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 6) mitochondrial DNA sequences, 5' end.
ACCESSION   M28897
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 367)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      121 a    121 c     43 g     81 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gtaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc
      241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca
      301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc
      361 agatagg
//
LOCUS       HUMMTVI2      358 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 6) mitochondrial DNA sequences, 3' end.
ACCESSION   M28898
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 358)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      106 a    102 c     54 g     91 t      5 others
ORIGIN      
        1 ttggtatttt cgtctggggg gtgtgcacgc gatagcattg cgagacgctg gagccggagc
       61 accctatgtc gcagtatctg tctttgattc ctgccccatc ccattattta tcgcacctac
      121 gttcaatatt acaggcgaac atacnctact aaagtgtgtt aattaattaa tgcttgtagg
      181 acataataat aacaattaaa tgtctgcaca gccactttcc acacagacat cataacaaaa
      241 aatttnccac caaacccccc cnnntccccc cgcttctggc cacagcactt aaacacatct
      301 ctgccaaacc ccaaaaacaa agaaccctaa caccagccta accagatttc aaattttt
//
LOCUS       HUMMTVJ1      365 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 9,10) mitochondrial DNA sequences, 5' end.
ACCESSION   M28903
KEYWORDS    mitochondrial DNA.
SEGMENT     1 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 365)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      120 a    122 c     43 g     79 t      1 others
ORIGIN      
        1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc
       61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact
      121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca
      181 agcaagtaca gcaatcaacc ttcaactgtc acaatcaacc gcaactccaa agccacccct
      241 cacccactag gataccaaca aacctaccca cccttaacag tacatagcac ataaagccat
      301 ttaccgtaca tagcacatta cagtcaaatc ccttctcgtc cccatggatg acccccctca
      361 gatag
//
LOCUS       HUMMTVJ2      355 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Human (!Kung 9,10) mitochondrial DNA sequences, 3' end.
ACCESSION   M28904
KEYWORDS    mitochondrial DNA.
SEGMENT     2 of 2
SOURCE      Human mitochondrial hair root DNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 355)
  AUTHORS   Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and
            Wilson,A.C.
  TITLE     Mitochondrial DNA sequences in single hairs from a southern African
            population
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Vigilant 06-OCT-1989.
BASE COUNT      107 a    102 c     52 g     90 t      4 others
ORIGIN      
        1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc
       61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc
      121 aatattacag gcgaacatac nctattaaag tgtgttaatt aattaatgct tgtaggacat
      181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt
      241 tnccaccaaa ccccccccnn tccccccgct tctggccaca gcacttaaac acatctctgc
      301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat
//
LOCUS       HUMLD78A     3176 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human cytokine LD78 alpha gene, complete cds.
ACCESSION   D90144
KEYWORDS    LD78; LD78 alpha; cytokine; inducible gene family;
            secreted peptide.
SOURCE      Human blood lymphocyte DNA , clone Lm LD-3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3176)
  AUTHORS   Nakao,M., Nomiyama,H. and Shimada,K.
  TITLE     Structures of human genes coding for cytokine LD78 and their
            expression
  JOURNAL   Mol. Cell. Biol. 10, 3646-3658 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hisayuki
            Nomiyama Department of Biochemistry Kumamoto University Medical
            School 2-2-1 Honjo, Kumamoto 860 Japan Phone:  096-344-2111 Fax:
            096-372-6140
FEATURES       from  to/span     description
    pept       1155     1227     cytokine LD78 alpha precursor, exon 1
               1916     2030     cytokine LD78 alpha precursor, exon 2
               2451     2541     cytokine LD78 alpha precursor, exon 3
    sigp       1155     1219     cytokine LD78 alpha signal peptide
    matp       1220     1227     cytokine LD78 alpha mature peptide
               1916     2030     cytokine LD78 alpha mature peptide
               2451     2538     cytokine LD78 alpha mature peptide
    pre-msg    1069     2957     cytokine LD78 alpha mRNA and introns
    IVS        1228     1915     cytokine LD78 alpha intron A
    IVS        2031     2450     cytokine LD78 alpha intron B
    signal     1041     1045     TATA box
BASE COUNT      833 a    741 c    752 g    850 t
ORIGIN      
        1 acccagggac ctatcacaca aatataagaa ctattcattc tttaaggcat gtatttccaa
       61 gcctttgtat ttttttccat gcttagggtt ggcaaggaat atatatatat ttgtacaaat
      121 atatatgtgt atatgtacaa atacatgtat atatagtaca aatatatata tatatttgta
      181 caattcttca gactttgtag aatttgtata atgtcgtatc ttgctttttt taaccactga
      241 tgttataagc atatttatgc cacttcattc attttagaga cttaataata aatgatctag
      301 tggataattt atcattccct gatggagaaa aatttagctt tgtttatttt agagttataa
      361 acgatgctgg gtcaggtatc tttatgtttg aagatggctc catatttggg ttgtttccac
      421 agaactcttt cctagaaatg ctttttctag gttaatggct acagatattt ctaggcacct
      481 gacatattga cacccacctc taaagtattt ttatgatcca caactagcgt ttaacacagc
      541 gccctagtca ctacatgact aataaataga caaatgactg aaacatgacc tcatgctttc
      601 tattcctcca gctttcattc agttctttgc ctctgggagg aggaagggtt gtgcagccct
      661 ccacagcatc agcccatcaa ccctatccct gtggttatag cagctgagga agcagaattg
      721 cagctctgtg ggaaggaatg gggctggaga gttcatgcac agaccagttc ttatgagaag
      781 ggactgacta agaatagcct tgggttgaca tatacccctc ttcacactca caggagaaac
      841 catttcccta tgaaactata acaagtcatg agttgagagc tgagagttag agaatagctc
      901 aaagatgcta ttcttggata tcctgagccc ctgtggtcac cagggaccct gagttgtgca
      961 acttagcatg acagcatcac tacgcttaaa aatttccctc ctcaccccca gattccattt
     1021 ccccatccgc cagggctgcc tataaagagg agagctggtt tcagacttca gaaggacacg
     1081 ggcagcagac agtggtcagt cctttcttgg ctctgctgac actcgagccc acattccgtc
     1141 acctgctcag aatcatgcag gtctccactg ctgcccttgc tgtcctcctc tgcaccatgg
     1201 ctctctgcaa ccagttctct gcatcacgtg agtctgagtt tcgttgtggg tatcaccact
     1261 ctctggccat ggttagacca catcaatctt ttcttgtggc ctaaaagccc ccaagagaaa
     1321 agagaacttc ttaaagggct gccaaacatc ttggtctttc tctttaagac ttttattttt
     1381 atctctagaa ggggtcttag ccccctagtc tccaggtatg agaatctagg caggggcagg
     1441 ggagttacag tcccttttac agatagaaaa acagggttcg aaacgaatca gttagcaaga
     1501 ggcagaatcc agggctgctt acttcccagt ggggtatgtt gttcactctc cagctcactc
     1561 taggtctccc aggagctctg tcccttggat gtcttatgag agatgtccaa ggcttctctt
     1621 gggttggggt atgacttctt gaaccagaca aaattccctg aagagaactg agataagaga
     1681 acagtccgtt caggtatctg gatcacacag agaaacagag aacccactat gaagagtcaa
     1741 ggagaaagaa ggatacagac agaaacaaag agacatttct cagcaaaaat gcccaaatgc
     1801 cttccagtca cttggtctga gcaagcctgc cttcctcaac tgctcgggga tcagaagctg
     1861 cctggccttt tcttctgagc tgtgactcgg gctcattctc ttcctttctc cacagttgct
     1921 gctgacacgc cgaccgcctg ctgcttcagc tacacctccc ggcagattcc acagaatttc
     1981 atagctgact actttgagac gagcagccag tgctccaagc ccggtgtcat gtaagtgcca
     2041 gtcttcctgc tcacctctat ggaggtaggg agggtcaggg ttggggcaga gacaggccag
     2101 aaggctatcc tggaaaggcc cagccttcag gagcctatcg gggatacagg acgcagggct
     2161 ccgaggtgtg acctgacttg gagctggagt gaggcatgtg ttacagagtc aggaagggct
     2221 gccccagccc agaggaaagg gacaggaaga aggaggcagc gggacactct gagggccacc
     2281 cctactgagt cactgagaga agctctctag acagagatag gcagggggcc cctgaaagag
     2341 gagcaagccc tgagctgccc aggacagaga gcagaatggt ggggccatgg tgggcccagg
     2401 attcccctgc tggattcccc agtgcttaac tcttcctccc ttctccacag cttcctaacc
     2461 aagcgaagcc ggcaggtctg tgctgacccc agtgaggagt gggtccagaa atatgtcagc
     2521 gacctggagc tgagtgcctg aggggtccag aagcttcgag gcccagcgac ctcggtgggc
     2581 ccagtgggga ggagcaggag cctgagcctt gggaacatgc gtgtgacctc cacagctacc
     2641 tcttctatgg actggttgtt gccaaacagc cacactgtgg gactcttctt aacttaaatt
     2701 ttaatttatt tatactattt agtttttgta atttattttc gatttcacag tgtgtttgtg
     2761 attgtttgct ctgagagttc ccctgtcccc tcccccttcc ctcacaccgc gtctggtgac
     2821 aaccgagtgg ctgtcatcag cctgtgtagg cagtcatggc accaaagcca ccagactgac
     2881 aaatgtgtat cggatgcttt tgttcagggc tgtgatcggc ctggggaaat aataaagatg
     2941 ctcttttaaa aggtaaacca gtattgagtt tggttttgtt tttctggcaa atcaaaatca
     3001 ctggttaaga ggaatcatag gcaaagatta ggaagaggtg aaatggaggg aaattgggag
     3061 agatggggag ggctaccaca gagttatcca ctttacaacg gagacacagt tctggaacat
     3121 tgaaactacg aatatgttat aactcaaatc ataacatgca tgctctagga gaattc
//
LOCUS       HUMLD78B     3112 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human cytokine LD78 beta gene.
ACCESSION   D90145
KEYWORDS    LD78; LD78 beta; cytokine; inducible gene family; secreted peptide.
SOURCE      Human placenta DNA, clone Lm LD-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3112)
  AUTHORS   Nakao,M., Nomiyama,H. and Shimada,K.
  TITLE     Structures of human genes coding for cytokine LD78 and their
            expression
  JOURNAL   Mol. Cell. Biol. 10, 3646-3658 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hisayuki
            Nomiyama Department of Biochemistry Kumamoto University Medical
            School 2-2-1 Honjo, Kumamoto 860 Japan Phone:  096-344-2111 Fax:
            096-372-6140
FEATURES       from  to/span     description
    pept       1192     1267     cytokine LD78 beta precursor, exon 1
               1953     2067     cytokine LD78 beta precursor, exon 2
               2488     2578     cytokine LD78 beta precursor, exon 3
    sigp       1192     1259     cytokine LD78 beta signal peptide
    matp       1260     1267     cytokine LD78 beta mature peptide
               1953     2067     cytokine LD78 beta mature peptide
               2488     2575     cytokine LD78 beta mature peptide
    pre-msg    1106     2995     cytokine LD78 beta mRNA and introns
    IVS        1268     1952     cytokine LD78 intron A
    IVS        2068     2487     cytokine LD78 intron B
    rpt         498      797     Alu repeat
    signal     1078     1082     TATA box
BASE COUNT      756 a    775 c    780 g    801 t
ORIGIN      
        1 ttagagactt aataataaag gatcttgtgg ataatttatc attccctgat agagaaaaat
       61 ttagctttgc ttattttaga gttataaatg atgctgggtc aggtatcttt atgtttgaag
      121 atggctccat atttgggttg tttccacaga actctttccc agaaatgctt tttctaggtt
      181 aatggctaca catatttcta ggcacctgac atactgacac ccacctctaa agtattttta
      241 tgatccacaa ctagcgttta acacagcgcc ccagtcactc cgagactaat aaatagacaa
      301 atgactgaaa cgtgacctca tgctttctat tcctccagct ttcattgagt tcctttcctc
      361 tgggaggact gggggttgtc tagccctcca cagcatcagc ccattgaccc tatccttgtg
      421 gttatagcag ctgaggaagc agaattacag ctctgtggga aggaatgggg ctggagagtt
      481 catgcataga ccaattcttt tttttttttt tttttgagat ggagtttcac ttttgttgcc
      541 caggctggag tgcaatggca tgatctcagc tcaccacagc ccccacctcc tgggttcaag
      601 cgattctcct gccctcagcc tcccgagtag ctgggattac aggcatgtgc caccacgcct
      661 gactactttt gtatttttag tagagatgga gtttctcttt cttggtcagg ttggtctcaa
      721 actcctgacc tcaggtgatc cgcagcctcg gcctcccaaa gtgttgggat tacaggtgtg
      781 agcgaccatg cctggctgca tagaccagtt cttatgagaa gggatcaact aagaatagcc
      841 ttgggttgac acacacccct cttcacactc acaggagaaa ccccatgaag ctagaaccag
      901 tcatgagttg agagctgaga gttagagagt agctcagaga tgctattctt ggatatcctg
      961 agcccctgtg gtcaccaggg accctgagtt gtgcaacact cagcatgaca gcatcactac
     1021 acttaaaaat ttccctcctc acccccagat tccatttccc catccgccag ggctgcctat
     1081 aaagaggaga gatggcttca gacatcagaa ggacgcaggc agcaaagagt agtcagtccc
     1141 ttcttggctc tgctgacact cgagcccaca ttccatcacc tgctcccaat catgcaggtc
     1201 tccactgctg cccttgccgt cctcctctgc accatggctc tctgcaacca ggtcctctct
     1261 gcaccacgtg agtccatgtt gttgttgtgg gtatcaccac tctctggcca tggttagacc
     1321 acatcagtct ttttttgcgg cctgagagcc ccgaagagaa aagaaggaag ttcttaaagc
     1381 gctgccaaac accttggtct ttttcttcac aacttttatt tttatctcta gaaggggtct
     1441 tagccctcct agtctccagg tatgagaatc taggcagggg caggggagtt acagtccctt
     1501 gtacagatag aaaaacaggg ttcaaaacga atcagtttgc aagaggcaga atccagggct
     1561 gcttacttcc cagtggggtc tgttgttcac tctccagctc accctaggtc tcccaggagc
     1621 cctgtccctt ggatgtctta tgagagatgt ccagggcttc tcttgggctg gggtatgact
     1681 tcttgaaccg acaaaattcc atgaagagag ctaagagaac agtccattca ggtatctgga
     1741 tcacatagag aaacagagaa cccactatga agagtcaagg ggaaagagga atatagacag
     1801 aaacaaagag acatttctct gcaaaacccc ccaaatgcct tgcagtcact tggtctgagc
     1861 aagcctgccc tcctcaacca ctcagggatc agaagctgcc tggccttttc ttctgagctg
     1921 tgactcgggc ttattctctc ctttctccgc agttgctgct gacacgccga ccgcctgctg
     1981 cttcagctac acctcccgac agattccaca gaatttcata gctgactact ttgagacgag
     2041 cagccagtgc tccaagccca gtgtcatgta agtgccagtc ttcctgctca cctctaggga
     2101 ggtagggagt gtcagggtgg gggcagaaac aggccagaag gccatcctgg aaaggcccag
     2161 ccttcaggag cctatcgggg atacaggacg cagggcactg aggtgtgacc tgacttgggg
     2221 ctggagtgag gtgggtgtta cagagtcagg aagggctgcc ccaggccaga ggaaaggaac
     2281 aggaagaagg aggcagcagg acactctgag ggcccccttg cctggagtca ctgagagaag
     2341 ctctctagac ggagataggc agggggcccc tgagagagga gcaggccttg agctgcccag
     2401 gacagagagc aggatgtcag gccatggtgg gcccaggatt ccccggctgg attccccagt
     2461 gcttaactct tcctcccttc tccacagctt cctaaccaag agaggccggc aggtctgtgc
     2521 tgaccccagt gaggagtggg tccagaaata cgtcagtgac ctggagctga gtgcctgagg
     2581 ggtccagaag cttcgaggcc cagcgacctc agtgggccca gtggggagga gcaggagcct
     2641 gagccttggg aacatgcgtg tgacctctac agctacctct tctatggact ggttattgcc
     2701 aaacagccac actgtgggac tcttcttaac ttaaatttta atttatttat actatttagt
     2761 ttttataatt tatttttgat ttcacagtgt gtttgtgatt gtttgctctg agagttcccc
     2821 ctgtcccctc caccttccct cacagtgtgt ctggtgacga ccgagtggct gtcatcggcc
     2881 tgtgtaggca gtcatggcac caaagccacc agactgacaa atgtgtatca gatgcttttg
     2941 ttcagggctg tgatcggcct ggggaaataa taaagatgtt cttttaaacg gtaaaccagt
     3001 attgagtttg gttttgtttt tctggcaaat caaaatcact agttaagagg aatcataggc
     3061 aaagattagg aagaggtgaa atggagggaa actgggagag atggggagcg ct
//
LOCUS       XELTRH       1442 bp ss-mRNA            VRT       01-AUG-1990
DEFINITION  X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds.
ACCESSION   M34699 K00931 J05514
KEYWORDS    thyrotropin releasing hormone.
SOURCE      X.laevis skin, cDNA to mRNA, clone L4 and 8/136.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 478)
  AUTHORS   Richter,K., Kawashima,E., Egger,R. and Kreil,G.
  TITLE     Biosynthesis of thyrotropin releasing hormone in the skin of
            Xenopus laevis: Partial sequence of the precursor deduced from
            cloned cDNA
  JOURNAL   EMBO J. 3, 617-621 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 15 to 1442)
  AUTHORS   Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
  TITLE     Two precursors of thyrotropin releasing hormone from skin of
            Xenopus laevis: Each contains seven copies of the end product
  JOURNAL   J. Biol. Chem. 265, 11731-11733 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by G.Kreil, 18-MAY-1990.
FEATURES       from  to/span     description
    pept        110      793     thyrotropin releasing hormone precursor
    matp        332      340     thyrotropin releasing hormone copy 1
    matp        374      382     thyrotropin releasing hormone copy 2
    matp        428      436     thyrotropin releasing hormone copy 3
    matp        470      478     thyrotropin releasing hormone copy 4
    matp        566      574     thyrotropin releasing hormone copy 5
    matp        611      619     thyrotropin releasing hormone copy 6
    matp        686      694     thyrotropin releasing hormone copy 7
    mRNA    <     1     1442     TRH mRNA
    conflict    139      139     t in [2]; c in [1]
    conflict    214      216     tct in [2]; ctc in [1]
    conflict    319      319     g in [2]; t in [1]
BASE COUNT      460 a    286 c    334 g    362 t
ORIGIN      
        1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag
       61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt
      121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc
      181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg
      241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct
      301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga
      361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc
      421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg
      481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga
      541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt
      601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga
      661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg
      721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag
      781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca
      841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga
      901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt
      961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag
     1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt
     1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca
     1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat
     1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct
     1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt
     1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg
     1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa
     1441 tg
//
LOCUS       XELTRHA      2955 bp ss-mRNA            VRT       01-AUG-1990
DEFINITION  X.laevis thyrotropin releasing hormone mRNA, complete cds.
ACCESSION   M34698 J05514
KEYWORDS    thyrotropin releasing hormone.
SOURCE      X.laevis, cDNA to mRNA, clone C6.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (sites)
  AUTHORS   Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
  TITLE     Two precursors of thyrotropin releasing hormone from skin of
            Xenopus laevis: Each contains seven copies of the end product
  JOURNAL   J. Biol. Chem. 265, 11731-11733 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2955; for [1])
  AUTHORS   Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by G.Kreil, 18-MAY-1990.
FEATURES       from  to/span     description
    pept        157      831     thyrotropin releasing hormone
    matp        379      387     thyrotropin releasing hormone copy 1
    matp        421      429     thyrotropin releasing hormone copy 2
    matp        475      483     thyrotropin releasing hormone copy 3
    matp        517      525     thyrotropin releasing hormone copy 4
    matp        613      621     thyrotropin releasing hormone copy 5
    matp        658      666     thyrotropin releasing hormone copy 6
    matp        733      741     thyrotropin releasing hormone copy 7
BASE COUNT      927 a    597 c    604 g    827 t
ORIGIN      
        1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc
       61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga
      121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg
      181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag
      241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc
      301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg
      361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga
      421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac
      481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag
      541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat
      601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa
      661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa
      721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac
      781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag
      841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt
      901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac
      961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc
     1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca
     1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa
     1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc
     1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca
     1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg
     1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca
     1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg
     1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa
     1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct
     1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa
     1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt
     1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt
     1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca
     1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat
     1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg
     1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga
     1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg
     2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga
     2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca
     2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct
     2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc
     2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac
     2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg
     2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata
     2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga
     2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc
     2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa
     2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg
     2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct
     2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa
     2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct
     2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt
     2941 tcgtggcgct ggccc
//
LOCUS       PVICSD       1107 bp ds-DNA             INV       01-AUG-1990
DEFINITION  P.vivax circumsporozoite protein gene, complete cds.
ACCESSION   M34697
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax (strain Thai; isolate NYU Thai) sporozoite DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 286 to 798)
  AUTHORS   Arnot,D.E., Stewart,M.J. and Barnwell,J.W.
  TITLE     Antigenic diversity in Thai Plasmodium vivax circumsporozoite
            proteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 285; 799 to 1107)
  AUTHORS   Arnot,D.E., Stewart,M.J. and Barnwell,J.W.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Arnot, 18-MAY-1990.
            
            The bases in reference [2] are identical to bases 140 to 442 and
            995 to 1294 of the sequence of the North Korean strain published
            in Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106; accession number
            M20671.
            
               Author address: D.E.Arnot
               Dept. of Genetics
               University of Edinburgh
               West Mains Rd., Edingburgh, EM93JM
               Scotland
FEATURES       from  to/span     description
    pept          1     1107     circumsporozoite protein
BASE COUNT      382 a    212 c    338 g    175 t
ORIGIN      
        1 atgaagaact tcattctctt ggctgtttct tccatcctgt tggtggactt gttccccacg
       61 cactgcgggc acaatgtaga tctgtccaag gccataaatt taaatggagt aaacttcaat
      121 aatgtagacg ccagttcact tggcgcggca cacgtaggac aaagtgctag ccgaggcaga
      181 ggacttggtg agaacccaga tgacgaggaa ggagatgcta aaaaaaaaaa ggatggaaag
      241 aaagcagaac caaaaaatcc acgtgaaaat aagctgaaac aaccaggaga cagagcagat
      301 ggacagccag caggagacag agcagatgga cagccagcag gtgatagagc agatggacaa
      361 ccagcaggtg atagagctgg acagccagca ggagatagag cagatggaca gccagcagga
      421 gacagagcag atggacagcc agcaggagac agagcagatg gacagccagc aggagacaga
      481 gcagatggac agccagcagg tgacagagct ggacaaccag caggtgatag agctggacag
      541 ccagcaggcg atagagcaga tggacagcca gcaggagata gagctggaca gccagcaggc
      601 gatagagcag atggacagcc agcaggagat agagctggac aaccagcagg agatagagca
      661 gatggacaac cagcaggaga tagagctgga cagccagcag gagatagagc tggacagcca
      721 gcaggagata gagctggaca gccagcagga gatagagctg gacagccagc aggaaatggt
      781 gcaggtggac aggcagcagg aggaaacgca ggaggacagg gacaaaataa tgaaggtgcg
      841 aatgccccaa atgaaaagtc tgtgaaagaa tacctagata aagttagagc taccgttggc
      901 accgaatgga ctccatgcag tgtaacctgt ggagtgggtg taagagtcag aagaagagtt
      961 aatgcagcta acaaaaaacc agaggatctt actttgaatg accttgagac tgatgtttgt
     1021 acaatggata agtgtgctgg catatttaac gttgtgagta attcattagg gctagtcata
     1081 ttgttagtcc tagcattatt caattaa
//
LOCUS       ATTRRA        119 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  A.solani 5S rRNA.
ACCESSION   M35573
KEYWORDS    5S ribosomal RNA.
SOURCE      A.solani (strain CBS 277-32) 5S rRNA.
  ORGANISM  Atractiella solani
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales; 
            Auriculariaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   119     5S rRNA
BASE COUNT       29 a     30 c     34 g     26 t
ORIGIN      
        1 aggtgcgacc ataccgtgtt gaaaattctg catcccgtcc gatctgcaaa gacaagcaac
       61 acagggccca gtcagtagtg cggtgggtga ccacgtgcga atactgtggt gttgcactt
//
LOCUS       CETRRA        118 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  C.cornigerum 5S rRNA.
ACCESSION   M35577
KEYWORDS    5S ribosomal RNA.
SOURCE      C.cornigerum (strain FO 29225) 5S rRNA.
  ORGANISM  Ceratobasidium cornigerum
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Phragmobasidiomycetes; Metabasidiomycetidae; Metatremellales; 
            Ceratobasidiaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   118     5S rRNA
BASE COUNT       23 a     35 c     37 g     23 t
ORIGIN      
        1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga
       61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt
//
LOCUS       GRARRA        118 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  G.phoenicis 5S rRNA.
ACCESSION   M35575
KEYWORDS    5S ribosomal RNA.
SOURCE      G.phoenicis (strain PB 4349) 5S rRNA.
  ORGANISM  Graphiola phoenicis
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; Teliomycetes; 
            Ustilaginales; Graphiolaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   118     5S rRNA
BASE COUNT       26 a     33 c     36 g     23 t
ORIGIN      
        1 atctgcggcc atagaaccgt gaaaataccg catcccgtcc gatctgcgaa gtcaagcacg
       61 gtatcgccta gtcagtactg cggtggggga ccacgcggga atcctgggtg ctgcagtt
//
LOCUS       PLARRA        119 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  P.peniophorae 5S rRNA.
ACCESSION   M35571
KEYWORDS    5S ribosomal RNA.
SOURCE      P.peniophorae (strain FO 22315) 5S rRNA.
  ORGANISM  Platygloea peniophorae
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales; 
            Auriculariaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   119     5S rRNA
BASE COUNT       27 a     35 c     36 g     21 t
ORIGIN      
        1 atctgcggcc ataccgtgat gaacattccg cgtcccgtcc gatccgcgca gacaagcatc
       61 acaggggcca gagagtattg acgtgggtga ccagtcgaga acactgtgct gccgcaggt
//
LOCUS       PLERRA        119 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  P.faginea 5S rRNA.
ACCESSION   M35574
KEYWORDS    5S ribosomal RNA.
SOURCE      P.faginea (strain FO 22315) 5S rRNA.
  ORGANISM  Phleogena faginea
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Phragmobasidiomycetes; Heterobasidiomycetidae; Eutremellales; 
            Phleogenaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   119     5S rRNA
BASE COUNT       28 a     30 c     34 g     27 t
ORIGIN      
        1 atgtgcgacc ataccaagct gaaaatactg catcccgtct gatctgcaca gtcaagcagc
       61 ttagggccca gtcagtagtg cggtggggga ccatgcgcga acattgtggt gttgcactt
//
LOCUS       SEPRRA        119 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  S.carestianum 5S rRNA.
ACCESSION   M35572
KEYWORDS    5S ribosomal RNA.
SOURCE      S.carestianum (strain FO 25109) 5S rRNA.
  ORGANISM  Septobasidium carestianum
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Phragmobasidiomycetes; Heterobasidiomycetidae; Septobasidiales; 
            Septobasidiaceae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   119     5S rRNA
BASE COUNT       25 a     37 c     36 g     21 t
ORIGIN      
        1 atctggggcc ataccacagt gaacacaccg catcccgtcc gatctgcgca gttaaccact
       61 gtagggccga gtcagtagtg cggtggggga ccacgcgcga atactctggt gccccaggt
//
LOCUS       TULRRA        118 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  T.violea 5S rRNA.
ACCESSION   M35576
KEYWORDS    5S ribosomal RNA.
SOURCE      T.violea (strain FO 29326) 5S rRNA.
  ORGANISM  Tulasnella violea
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales; 
            Tulasnellaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   118     5S rRNA
BASE COUNT       30 a     29 c     31 g     28 t
ORIGIN      
        1 atcttcggcc ataggacaga gaaaataccg catcccgtcc gatctgcgca gtcaagctct
       61 gtaccgctta gttagtacca tagtggggga ccatatggga atcctgagtg ctgaagtt
//
LOCUS       UTHRRA        118 bp ss-RNA             RNA       01-AUG-1990
DEFINITION  U.fusisporum 5S rRNA.
ACCESSION   M35578
KEYWORDS    5S ribosomal RNA.
SOURCE      U.fusisporum (strain FO 25106) 5S rRNA.
  ORGANISM  Uthatobasidium fusisporum
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales; 
            Tulasnellaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Blanz,P.A. and Gottschalk,M.
  TITLE     Systematic position of Septobasidium, Graphiola and other
            basidiomycetes as deduced on the basis of their 5S ribosomal RNA
            nucleotide sequences
  JOURNAL   Syst. Appl. Microbiol. 8, 121-127 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1  >   118     5S rRNA
BASE COUNT       23 a     35 c     37 g     23 t
ORIGIN      
        1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga
       61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt
//
LOCUS       C11CMIA      2149 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  Plasmid pColBM-C1139 colicin lysis protein (cmi) gene, 5' end.
ACCESSION   M35683
KEYWORDS    colicin lysis protein.
SOURCE      Plasmid pColBM-C1139 DNA.
  ORGANISM  Plasmid pColBM-C1139
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 2149)
  AUTHORS   Thumm,G., Oelschlaeger,T. and Braun,V.
  TITLE     Plasmid pColBM-C1139 does not encode a colicin lysis protein but
            contains sequences highly homologous to the D protein (resolvase)
            and the oriV region of the miniF plasmid
  JOURNAL   Plasmid 20, 75-82 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1108     1890     ORF
    pept       1991  >  2149     colicin lysis protein (cmi)
BASE COUNT      512 a    510 c    578 g    549 t
ORIGIN      
        1 gaattcatct tttggccgtt tacgtctgtt ccgttatcct gatgatacga tgttctgcac
       61 gttctgccgg gaagatgcag atgattcgct taaaagtatt atgacccatc tctgggagct
      121 ggatgcagag atgacagatc ctgtcatagc tatgtttaat cacgtctgag tgccgtgagt
      181 gatttctgtc ttttatgcaa cagtgccaag atattgtaat caaaaaaaag cattaatgca
      241 ttttggacag taatctattt taattgatga catagaggca ttaatctttc tttttcttca
      301 ggaagatccg aaaactcctg gtcacggatc ttcctctccc ccacacaacg ccacctcctg
      361 taagcacaac atgtggtgcc ggattcagct gctgatgaca ctatatgttg tgtcatctcc
      421 ctgacctgtg atgcgtcgcg caggggcgga aaacagcgat atgatgattt cctcggcgtg
      481 gtacacttcc ggaaagttgt gatattccgg aaagtcggat ctgacggaaa cggctctccg
      541 gtaatttaac ggcgtggtta tatggatgct tgttatcatg gtgatgatga taacggcatg
      601 atgttatcag acggcgtgac ggtaagggca gtgatgatgg atgacgttat cgcatgaccg
      661 tccctgcccg gaaaagaaaa aaggagtcac ccatgttttt tattgagaat gaaggtcagg
      721 ctgtcgccgg aacggattac tggcagtctg tacaggcgca ggccggatat gtctacctca
      781 gctggaatgc cggcgcagcc aggctgcttg tcccggatgc ggcaaaacat ttactcaggg
      841 agatgcgggg ggctgagtac gtcatcatca gtaagggagc actgcatggc cgcgatgcgc
      901 tggaactggt atttgaagac ggcagcgatg cgccgtttgt gatccacatg ctgagtgagc
      961 agtgcgatcg cctgctcccc gaaaacaacc agggaggggg ttttgttgtc accgtctgga
     1021 cgcgtggcgg taaccagctc cgttatccgg gaaagtaccg ggttgtggaa aacctgcccg
     1081 acgtttcccc gtggagtgaa cactgatatg cagcacctgc cggcaccgat ccaccatgcc
     1141 cgggatgctg ttcagcttcc tgttgccatc gattatccgg cagcgctggc actccgccag
     1201 atgtcgatgg ttcatgatga actgcccaaa tacctgctgg cccctgaagt gagcgccctg
     1261 ctccattacg tcccggatct gcgccgcaag atgctgctgg ccacactgtg gaacaccggt
     1321 gcgcgcatta atgaagcact ggcgctgacg cggggggatt tttcgctcac gcctccgtat
     1381 ccgtttgtgc agctggccac tctgaagcag cggacagaaa aagccgccag gacggcagga
     1441 agaatgcccg ccggtcagca gactcaccgg ctggttccgc tctccgactc ctggtacgtc
     1501 agccagctgc agacgatggt agccacactg aaaatcccca tggaacggcg taataaacga
     1561 acaggcagga cagagaaagc gcggatctgg gaagtgacgg acagaacggt caggacctgg
     1621 attggggagg cggttgccgc cgctgccgct gatggtgtga cgttctctgt cccggtcacg
     1681 ccacatacgt tccgccattc ctatgcgatg cacatgctgt atgccggtat accgcttaag
     1741 gttctgcaga gtctgatggg gcataagtcc atcagctcaa cagaggtcta cacgaaggtg
     1801 tttgcactgg atgtggctgc acggcaccgg gtgcagtttt cgatgcctga gtccgatgcg
     1861 gtcacaatgc tgaaaaacag acatgcataa taagtcacaa ttatgaattg tgatttcttc
     1921 tataaaaaag agaccactgc aatatgtgat ctcttgtatt atttcataat tgttaaagcc
     1981 acttcacagt atgctcacat tgtacggata tattcgtaat gtttttttat atcgaatgaa
     2041 cgacagaagt tgtggagatt ttatgaaagt aattagcatg aaatttattt ttattttaac
     2101 gattattgct cttgctgctg tttttttctg gtctgaagat aaaggtccg
//
LOCUS       DOGPPPP       427 bp ss-mRNA            MAM       01-AUG-1990
DEFINITION  Canine pancreatic polypeptide mRNA, complete cds.
ACCESSION   M35596
KEYWORDS    pancreatic polypeptide.
SOURCE      Canine pancreas, cDNA to mRNA.
  ORGANISM  Canis lupus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 427)
  AUTHORS   Toothman,P. and Paquette,T.L.
  TITLE     Canine pancreatic polypeptide complementary deoxyribonucleic acid
            sequence: Pancreatic polypeptide and insulin messenger ribonucleic
            acid distribution in the lobes of the pancreas
  JOURNAL   Mol. Endocrinol. 1, 413-419 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         21      302     pancreatic polypeptide precursor
    sigp         21      107     pancreatic polypeptide signal peptide
    matp        108      215     pancreatic polypeptide
    matp        225      284     icosapeptide
    mRNA          1      427     pancreatic polypeptide mRNA
BASE COUNT       88 a    149 c    115 g     75 t
ORIGIN      
        1 tccgcccctt aggactcggg atgcctgccg cctgccgctg cctcttcctg ctgctcctgt
       61 cagcctgtgt ggctctgttg ctgcagccgc cactgggtac ccggggggcc ccgctggagc
      121 cagtgtatcc gggggacgat gccacaccag agcagatggc ccagtacgcg gctgagctcc
      181 gcagatacat caacatgctg accaggccca ggtatgggaa aagagacaga ggagaaatgc
      241 gggacatcct ggaatggggc tccccccatg cagccgcccc cagggagctg atggacgagt
      301 aatgccacct ccaagtaatg ccacctctgc ctctcaggcc aatgccagcc tacctctccc
      361 ctctgcaccc ctggccaaag cttgctccct gctctcacac acagactaaa taaagcaagt
      421 caaagtc
//
LOCUS       GVICG         296 bp ss-RNA   circular  VRL       01-AUG-1990
DEFINITION  Grapevine viroid grapevine isolate (SHV-g(GV)) complete genome.
ACCESSION   M35717
KEYWORDS    complete genome.
SOURCE      Grapevine viroid RNA.
  ORGANISM  Grapevine viroid
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 296)
  AUTHORS   Sano,T., Ohshima,K., Hataya,T., Uyeda,I., Shikata,E., Chou,T.-G.,
            Meshi,T. and Okada,Y.
  TITLE     A viroid resembling hop stunt viroid in grapevines from Europe, the
            United States and Japan
  JOURNAL   J. Gen. Virol. 67, 1673-1678 (1986)
  STANDARD  simple staff_review
BASE COUNT       60 a     87 c     80 g     69 t
ORIGIN      
        1 ctggggaatt ctcgagttgc cgcatcaggc aagcaaagaa aaaacaaggc agggaggtac
       61 ttacctgaga aaggagcccc ggggcaactc ttctcagaat ccagcgagag gcgtggagag
      121 agggccgcgg tgctctggag tagaggctct gcttcagaac accatcgatc gtcccttctt
      181 ctttaccttc ttctggctct tccgatgaga cgcgaccggt ggcatcacct ctcggttcgt
      241 cccaacctgc tttttgtcta tctgagcctc tgccgcggat cctctcttga gcccct
//
LOCUS       HUMTCAJK       94 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human T-cell receptor germline J-alpha RP DNA, partial cds.
ACCESSION   M35619
KEYWORDS    T-cell receptor alpha-chain; antigen receptor; germline;
            joining exon.
SOURCE      Human T-cell line RPMI 8402 DNA, clone lambda-R15.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 94)
  AUTHORS   Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
  TITLE     Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
            alpha locus and aberrant recombination between J-alpha segments
  JOURNAL   EMBO J. 7, 1661-1668 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    32  /    92     T-cell receptor germline J-alpha RP region (AA
                                 at 32) /hgml_locus_uid="LX0123X"
                                 /nomgen="TCRA" /map="14q11.2"
    IVS          93  >    94     TCR intron
    signal        2       10     nonamer recombination signal
    signal       23       29     heptamer recombination signal
BASE COUNT       25 a     23 c     22 g     24 t
ORIGIN      
        1 aggtttctgt tatgaagcat ctcacagtgt aaataccggc actgccagta aactcacctt
       61 tgggactgga acaagacttc aggtcacgct cggt
//
LOCUS       HUMTCAJM       80 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human T-cell receptor unproductively rearranged J-alpha AA/J-alpha
            AB DNA pseudogene, partial cds.
ACCESSION   M35621
KEYWORDS    T-cell receptor alpha-chain; antigen receptor; joining exon;
            processed gene; pseudogene.
SOURCE      Human cell line AT5-B1 tumor DNA, clone lambda-A30.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 80)
  AUTHORS   Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
  TITLE     Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
            alpha locus and aberrant recombination between J-alpha segments
  JOURNAL   EMBO J. 7, 1661-1668 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps /    30  >    78     T-cell receptor unproductively rearranged
                                 J-alpha AA/J-alpha AB region (AA at 30)
                                 /hgml_locus_uid="LX0123X" /nomgen="TCRA"
                                 /map="14q11.2"
    recomb       26       27     J-alpha AA end/J-alpha AB start
    signal        7       15     nonamer recombination signal
BASE COUNT       23 a     15 c     18 g     24 t
ORIGIN      
        1 tatgttggtt tatgtagaga cacatataga ccgacaagct catctttggg actgggacca
       61 gattacaagt ctttccaagt
//
LOCUS       HUMTCAZI      520 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human T-cell receptor productively rearranged V-alpha-J-alpha DNA,
            exons 1 and 2.
ACCESSION   M35617
KEYWORDS    T-cell receptor alpha-chain; antigen receptor; joining exon;
            processed gene; variable region.
SOURCE      Human T-cell line RPMI 8402 DNA, clone lambda-R10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 520)
  AUTHORS   Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
  TITLE     Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
            alpha locus and aberrant recombination between J-alpha segments
  JOURNAL   EMBO J. 7, 1661-1668 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         11       56     T-cell receptor V-alpha-J-alpha region, exon 1
                                 /hgml_locus_uid="LX0123X" /nomgen="TCRA"
                                 /map="14q11.2"
                166  /   513     T-cell receptor V-alpha-J-alpha region, exon 2
    IVS          57      165     T-cell receptor intron A
    IVS         514  >   520     T-cell receptor intron B
BASE COUNT      115 a    129 c    122 g    154 t
ORIGIN      
        1 ttgctcagcc atgctcctgg agcttatccc actgctgggg atacattttg tcctgagtga
       61 gtaaaaattt ctttatggtc tctagttcca caggttctga ctagaaatgc ttgcttttta
      121 tactgagtct gcactgcttt cactgatagt acgttgtttt tccaggaact gccagagccc
      181 agtcagtgac ccagcctgac atccacatca ctgtctctga aggagcctca ctggagttga
      241 gatgtaacta ttcctatggg gcaacacctt atctcttctg gtatgtccag tcccccggcc
      301 aaggcctcca gctgctcctg aagtactttt caggagacac tctggttcaa ggcattaaag
      361 gctttgaggc tgaatttaag aggagtcaat cttccttcaa cctgaggaaa ccctctgtgc
      421 attggagtga tgctgctgag tacttctgtg ctgtggttgg cactgccagt aaactcacct
      481 ttgggactgg aacaagactt caggtcacgc tcggtaggta
//
LOCUS       HUMTCAZJ      130 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human T-cell receptor unproductively rearranged J-alpha RX/J-alpha
            RP DNA, partial cds.
ACCESSION   M35618
KEYWORDS    T-cell receptor alpha-chain; antigen receptor; joining exon;
            processed gene.
SOURCE      Human T-cell line RPMI 8402 DNA, clone lambda-R15.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 130)
  AUTHORS   Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
  TITLE     Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
            alpha locus and aberrant recombination between J-alpha segments
  JOURNAL   EMBO J. 7, 1661-1668 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps /    71  /   127     T-cell receptor J-alpha RP region (AA at 71)
                                 /hgml_locus_uid="LX0123X" /nomgen="TCRA"
                                 /map="14q11.2"
    pept.ps /    68  /     9 (c) T-cell receptor J-alpha RX (AA at 68)
    recomb       69       70     J-alpha RX end/J-alpha RP start
BASE COUNT       29 a     42 c     22 g     37 t
ORIGIN      
        1 tttaaagata gcttcactct cacttgcgtc cccattccaa atgtaaattt cctgtttccc
       61 cccctccgtt accggcactg ccagtaaact cacctttggg actggaacaa gacttcaggt
      121 cacgctcggt
//
LOCUS       HUMTCAZL       97 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human T-cell receptor germline J-alpha AA DNA, partial cds.
ACCESSION   M35620
KEYWORDS    T-cell receptor alpha-chain; antigen receptor; germline;
            joining exon.
SOURCE      Human cell line AT5-B1 tumor DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 97)
  AUTHORS   Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
  TITLE     Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
            alpha locus and aberrant recombination between J-alpha segments
  JOURNAL   EMBO J. 7, 1661-1668 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    35  /    95     T-cell receptor germline J-alpha RP region (AA
                                 at 35) /hgml_locus_uid="LX0123X"
                                 /nomgen="TCRA" /map="14q11.2"
    IVS          96  >    97     TCR intron
    signal        7       15     nonamer recombination signal
    signal       28       34     heptamer recombination signal
BASE COUNT       31 a     19 c     21 g     26 t
ORIGIN      
        1 tatgttggtt tatgtagaga cacataacac tgtgactacc tcaggaacct acaaatacat
       61 ctttggaaca ggcaccaggc tgaaggtttt agcaagt
//
LOCUS       HUMTCAZN       89 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human T-cell receptor germline J-alpha AB DNA, partial cds.
ACCESSION   M35622
KEYWORDS    T-cell receptor alpha-chain; antigen receptor; germline;
            joining exon.
SOURCE      Human cell line AT5-B1 tumor DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 89)
  AUTHORS   Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H.
  TITLE     Complex rearrangements within the human J-delta-C-delta/J-alpha-C-
            alpha locus and aberrant recombination between J-alpha segments
  JOURNAL   EMBO J. 7, 1661-1668 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    30  >    87     T-cell receptor germline J-alpha RP region (AA
                                 at 30) /hgml_locus_uid="LX0123X"
                                 /nomgen="TCRA" /map="14q11.2"
    IVS          88  >    89     TCR intron
    signal        2       10     nonamer recombination signal
    signal       23       29     heptamer recombination signal
BASE COUNT       23 a     19 c     18 g     29 t
ORIGIN      
        1 aggtttttgt agatctcagt atcactgtgt cttataacac cgacaagctc atctttggga
       61 ctgggaccag attacaagtc tttccaagt
//
LOCUS       MUSBMTA       141 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION   M35719
KEYWORDS    thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE      Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Wood,W.M., Gordon,D.F. and Ridgway,E.C.
  TITLE     Expression of the beta-subunit gene of Murine thyrotropin results
            in multiple messenger ribonucleic acid species which are generated
            by alternative exon splicing
  JOURNAL   Mol. Endocrinol. 1, 875-883 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        118  >   141     thyrotropin beta-subunit
BASE COUNT       40 a     30 c     39 g     32 t
ORIGIN      
        1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gactgagcgc
       61 atacgagtgg agagaaaaat attctgcttc agtcaagagc tggggttgtt caaaagcatg
      121 agtgctgccg tcctcctctc c
//
LOCUS       MUSBMTB        99 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION   M35720
KEYWORDS    thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE      Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 99)
  AUTHORS   Wood,W.M., Gordon,D.F. and Ridgway,E.C.
  TITLE     Expression of the beta-subunit gene of Murine thyrotropin results
            in multiple messenger ribonucleic acid species which are generated
            by alternative exon splicing
  JOURNAL   Mol. Endocrinol. 1, 875-883 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         76  >    99     thyrotropin beta-subunit
BASE COUNT       26 a     24 c     29 g     20 t
ORIGIN      
        1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gagtgaccgc
       61 atacgagtgg agagcatgag tgctgccgtc ctcctctcc
//
LOCUS       MUSBMTC        93 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION   M35721
KEYWORDS    thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE      Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 93)
  AUTHORS   Wood,W.M., Gordon,D.F. and Ridgway,E.C.
  TITLE     Expression of the beta-subunit gene of Murine thyrotropin results
            in multiple messenger ribonucleic acid species which are generated
            by alternative exon splicing
  JOURNAL   Mol. Endocrinol. 1, 875-883 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         70  >    93     thyrotropin beta-subunit
BASE COUNT       26 a     22 c     21 g     24 t
ORIGIN      
        1 agcagtaact cactcatgca aagtaagaaa aatattctgc ttcagtgaag agctggggtt
       61 gttcaaagca tgagtgctgc cgtcctcctc tcc
//
LOCUS       MUSBMTD        52 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse thyrotropin beta-subunit mRNA, 5' end.
ACCESSION   M35723
KEYWORDS    thyroid stimulating hormone; thyrotropin beta-subunit.
SOURCE      Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 52)
  AUTHORS   Wood,W.M., Gordon,D.F. and Ridgway,E.C.
  TITLE     Expression of the beta-subunit gene of Murine thyrotropin results
            in multiple messenger ribonucleic acid species which are generated
            by alternative exon splicing
  JOURNAL   Mol. Endocrinol. 1, 875-883 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         29  >    52     thyrotropin beta-subunit
BASE COUNT       13 a     17 c     10 g     12 t
ORIGIN      
        1 agcagtaact cactcatgca aagtaagcat gagtgctgcc gtcctcctct cc
//
LOCUS       MUSIGKCSU     444 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse Ig aberrantly rearranged kappa-chain mRNA V-J2-C-region,
            complete cds.
ACCESSION   M35669
KEYWORDS    constant region; immunoglobulin light chain; joining exon;
            kappa-immunoglobulin; variable region.
SOURCE      Mouse myeloma MOPC-21, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 444)
  AUTHORS   Carroll,W.L., Mendel,E. and Levy,S.
  TITLE     Hybridoma fusion cell lines contain an aberrant kappa transcript
  JOURNAL   Mol. Immunol. 25, 991-995 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         28      414     Ig kappa-chain V-J2-C-region precursor
    sigp         28       87     Ig kappa-chain V-J2-C-region signal peptide
    matp         88      411     Ig kappa-chain V-J2-C-region
    recomb      380      381     V-region end/J2-region start
    recomb      411      412     J2-region end/C-region start
BASE COUNT      108 a    122 c    111 g    103 t
ORIGIN      Chromosome 6.
        1 cagcatcctc tcttccagct ctcagagatg gagacagaca cactcctgtt atgggtactg
       61 ctgctctggg ttccaggttc cactggtgac attgtgctga cacagtctcc tgcttcctta
      121 gctgtatctc tggggcagag ggccaccatc tcatacaggg ccagcaaaag tgtcagtaca
      181 tctggctata gttatatgca ctggaaccaa cagaaaccag gacagccacc cagactcctc
      241 atctatcttg tatccaacct agaatctggg gtccctgcca ggttcagtgg cagtgggtct
      301 gggacagact tcaccctcaa catccatcct gtggaggagg aggatgctgc aacctattac
      361 tgtcagcaca ttagggagct tacacgttcg gaggggggac caagctggaa ataaaacggg
      421 ctgatgctgc accaactgta tcca
//
LOCUS       MUSLACPI      844 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse placental lactogen I (mPL-I) mRNA, complete cds.
ACCESSION   M35662
KEYWORDS    placental lactogen I.
SOURCE      Mouse (strain Swiss-Webster) day 10 placenta, cDNA to mRNA, clone
            1.5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 844)
  AUTHORS   Colosi,P., Talamantes,F. and Linzer,D.I.H.
  TITLE     Molecular cloning and expression of mouse placental lactogen I
            complementary deoxyribonucleic acid
  JOURNAL   Mol. Endocrinol. 1, 767-776 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         42      716     placental lactogen I (mPL-I) precursor
    sigp         42      131     placental lactogen I (mPL-I) signal peptide
    matp        132      713     placental lactogen I (mPL-I)
    mRNA    <     1      844     mPL-I mRNA
    signal      821      831     mPL-I poly-A signal
BASE COUNT      243 a    188 c    176 g    237 t
ORIGIN      
        1 ttcctcactt ggagcctaca ttgtggtgga tcttctcaga aatgcagctg actttgaatc
       61 tttcaggctc cgcaggaatg caattgttgc tgctggtgtc aagcctactc ctttgggaga
      121 atgtgtcctc caaaccaact gccatggtgc ccactgaaga cctgtatact cgtttggctg
      181 aactgctcca taatacattt atcttggccg cagatgtgta tagggaattt gatttggatt
      241 ttttcgataa aacttggata acagacagaa cacttcccct gtgtcatact gcttccatcc
      301 atactccaga gaatcgagag gaagtccacg aaactaaaac tgaagacctt ctgaaagcaa
      361 tgatcaatgt ttcaatttcc tggaaagaac ctctgaaaca cctggtgtct gcactgacgg
      421 ctctcccagg agcttctgag agtatgggga aaaaagctgc tgacattaag ggcagaaacc
      481 ttgtaattct ggagggactt cagacaatat acaacaggtc tcaggctaac attgaagaaa
      541 atgaaaattt tgactaccct gcttggtctg gactcgaaga actgcagtca cctaacgaag
      601 acactcatct ttttgccgtt tataatctat gccgctgcat taaaagggac atccataaga
      661 tagacagcta tatcaaagtc ttgaggtgcc gagttgtctt tcagaacgaa tgttgagtgc
      721 ccacccagcg aagccctgcc cacatggtct ttgttgaacc agacttgtaa tgctttcccc
      781 tcctcagtta tgatgagcta taatggaatt attgtcataa aataaaataa aattatttag
      841 attc
//
LOCUS       BLYGSA       1621 bp ss-mRNA            PLN       01-AUG-1990
DEFINITION  Barley glutamate 1-semialdehyde aminotransferase (GSA) mRNA,
            complete cds.
ACCESSION   M31545
KEYWORDS    glutamate 1-semialdehyde aminotransferase.
SOURCE      Barley (cv. Bonus) 5 day old dark grown seedling, cDNA to mRNA.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1621)
  AUTHORS   Grimm,B.
  TITLE     Primary structure of a key enzyme in plant tetrapyrrole synthesis:
            Glutamate 1-semialdehyde aminotransferase
  JOURNAL   Unpublished (1990) Carlsberg Laboratory, Dept. of Physiology, Gamle
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grimm, 22-JAN-1990.
FEATURES       from  to/span     description
    pept         20     1429     glutamate 1-semialdehyde aminotransferase
                                 (GSA) precursor (EC 5.4.3.8)
    sigp         20      121     glutamate 1-semialdehyde aminotransferase
                                 signal peptide
    matp        122     1426     glutamate 1-semialdehyde aminotransferase
    signal     1598     1603     polyA signal
    mRNA    <     1     1621     GSA mRNA
BASE COUNT      362 a    363 c    459 g    437 t
ORIGIN      
        1 ggagaaggaa ggcagcatca tggccggagc agcagccgcc gtggcctccg gcatatcgat
       61 caggcctgta gccgcgccta agatctcgcg cgcgccccgc tctcggtcgg tggtgagggc
      121 ggccgtctcc atagacgaga aggcttacac ggttcagaaa tccgaggaga tcttcaacgc
      181 cgccaaggaa ttgatgcctg gtggtgttaa ttcaccagtc cgtgccttca aatcagtcgg
      241 cgggcagccc atagtttttg attctgtgaa gggctctcat atgtgggatg tcgatggaaa
      301 tgaatatatt gattatgttg gttcctgggg tcctgcaatc attggtcatg cagatgacaa
      361 ggtgaatgct gcacttattg aaactctgaa gaagggtact agctttggtg ctccatgtgc
      421 gttggagaat gtgttggctc aaatggtcat ctccgctgtg ccgagtatcg aaatggttcg
      481 ttttgtaaat tcaggaacag aagcttgcat gggagcactc cgccttgtgc gtgcattcac
      541 tgggagggaa aagattctca agtttgaagg ctgttaccat ggccatgcag attccttcct
      601 tgttaaagca ggcagtggtg ttgccaccct cggcctccca gactcccctg gagtgcctaa
      661 gggagccacc gttgggactc taacagcacc ttataatgat gctgatgcgg ttaaaaagct
      721 gtttgaggat aacaaagggg agattgctgc agtcttcctt gagccggttg ttggcaatgc
      781 tggcttcatt cctccgcagc ctgctttcct aaatgctctc cgtgaggtga ccaaacaaga
      841 cggcgcactt ctggtgtttg atgaagtgat gactcctttc cgtttagctt atggtggggc
      901 acaagagtac tttggaatca cccctgatgt gacaaccttg ggccaaatta ttggcggtgg
      961 tcttccggtt ggtgcttacg gtggacggaa ggatatcatg gagatggttg ctccagcagg
     1021 gccaatgtac caggcaggaa ccctcagtgg aaaccctcta gctatgactg ctggaatcca
     1081 cactctcaag cgtctgatgg agcctggcac ctatgaatac ttagacaagg tcactggtga
     1141 acttgtccgg ggcatattgg atgtgggcgc taaaacaggg cacgagatgt gtggaggaca
     1201 catcagaggc atgttcggat tcttcttcgc aggtggccca gtgcacaact ttgatgatgc
     1261 caagaagagt gacacagcga agtttgggag gttccaccgt ggaatgctgg gcgaaggcgt
     1321 gtatctggca ccatcccagt tcgaggcagg ttttacaagc ttggcacaca ccacccaaga
     1381 cattgagaaa accgtggagg ctgccgagaa ggttcttcga tggatataga tgatttggat
     1441 tgcaaacctt ttgaagcttt tccttctgtt gtattctgtt agtttgtacg tggctgaagt
     1501 ttagttttgt attgtatttt gttgtgcagc agcagtatct tgtctctagc ccatttttct
     1561 tcttctgagt tagcatttgg ggtgattttg tcttggcaat aaaactttgg ctacgacctc
     1621 c
//
LOCUS       MUSSVSIV      541 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse seminal vesicle secretory protein IV (SVS IV) mRNA, 3' end.
ACCESSION   M35732
KEYWORDS    seminal vesicle secretory protein IV.
SOURCE      Mouse adult seminal vesicle, cDNA to mRNA, clone p2A2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 541)
  AUTHORS   Chen,Y.H., Pentecostt,B.T., McLachlan,J.A. and Teng,C.T.
  TITLE     The androgen-dependent mouse seminal vesicle secretory protein IV:
            Characterization and complementary deoxyribonucleic acid cloning
  JOURNAL   Mol. Endocrinol. 1, 707-716 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      329     seminal vesicle secretory protein IV (SVS IV)
                                 precursor (AA at 3)
    sigp    <     1       50     seminal vesicle secretory protein IV (SVS IV)
                                 signal peptide
    matp         51      326     seminal vesicle secretory protein IV (SVS IV)
    mRNA    <     1      541     SVS IV mRNA
BASE COUNT      154 a    107 c    131 g    149 t
ORIGIN      
        1 gtttgttcct cttttctctg cttctccttc tggtgacagg agccattggg aagaaaacta
       61 aggaaaaatt cttgcagtcg gaagaaactg tcagagagag cttctcgacg ggaagcagag
      121 gccatatgtc aagaagttct gagccagagg tatttgttag gccacaggac tccatcggtg
      181 acgaagcttc tgaggaaatg agtagtagta gtagtagtag aagaagaagt aagattatct
      241 ctagcagttc tgatggttct aatatggaag gtgagagttc atattcaaag agaaagaaga
      301 gccggttttc tcaagatgca ctcgagtgat actgcattga ccagctgaac atctggacca
      361 atatgctgga gccatatcgc cagaacagag cccatgatgt cttcagcata cagctcccat
      421 gtggtctcag aggcagtccc tggatggcat ttacttccca tgcttgtttg tcttgaggtt
      481 cttaaaccta acatttactc tggagctttc tttccaataa agagataaca attgcatcat
      541 t
//
LOCUS       NEMRPT        677 bp ds-DNA             INV       01-AUG-1990
DEFINITION  A.lumbricoides BamHI repetitive DNA.
ACCESSION   M35399
KEYWORDS    BamHI repetitive sequence.
SOURCE      A.lumbricoides DNA, clone AL700-1.
  ORGANISM  Ascaris lumbricoides
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; 
            Ascaridida; Ascaridina; Ascaridoidea; Ascarididae.
REFERENCE   1  (bases 1 to 677)
  AUTHORS   Warren,T. and Pasternak,J.J.
  TITLE     A related moderately repetitive DNA family in the nematodes Ascaris
            lumbricoides and Panagrellus silusiae
  JOURNAL   Nucleic Acids Res. 16, 10833-10847 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           1      677     BamHI repeat
BASE COUNT      186 a    158 c    161 g    172 t
ORIGIN      
        1 ggatccgagt aagtgtgcaa aaacagcatt atttatgtaa acgaagctca attacatttc
       61 taagtgcaat tacggctgta tcacgggttg gcaactccat attccacgga aatccaccca
      121 ttcaacgggt gcaattcccg tgagtatcgt aaaataggag agtgaaagct cagaatgcgg
      181 ctagaatgtg tcatcttgtt gccaaatcgg agatatgtat cgtgtgaatt gacatgtatc
      241 atgccaaggt aggtcggaaa ggccaaagaa aagcggaaac cagacggtcg gaaagtacag
      301 aactcgattc ttgcgattgt gcatcttcga gttctggtaa gtgtaaatgc gagtccggtg
      361 tctgatcgga tctgatcggc cagtgccgag gcttacacgt gactatcaca tagtctcact
      421 ctttcactct tcccttttcg cgatttccga ttcagtgcta acaactcgac gtagacaccc
      481 cactctttct cctgcgcatt cctatgccgg tcaccgattg ggtcgcaaaa tgccaaagga
      541 cagggcatgt aagcccgcat cttaattgtt aagattcacc gatgaatcgt caaaaatttt
      601 gcaaaagcta gtggaaaacg gggttttgag gcccgttcca ccggcaaacc gtcatcgtgc
      661 gccgatcaga tggatcc
//
LOCUS       PNGRPT        682 bp ds-DNA             INV       01-AUG-1990
DEFINITION  P.silusiae BamHI repetitive DNA.
ACCESSION   M35398
KEYWORDS    BamHI repetitive sequence.
SOURCE      P.silusiae DNA, clone PS700-1.
  ORGANISM  Panagrellus silusiae
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; 
            Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae.
REFERENCE   1  (bases 1 to 682)
  AUTHORS   Warren,T. and Pasternak,J.J.
  TITLE     A related moderately repetitive DNA family in the nematodes Ascaris
            lumbricoides and Panagrellus silusiae
  JOURNAL   Nucleic Acids Res. 16, 10833-10847 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           1      682     BamHI repeat
BASE COUNT      201 a    154 c    155 g    172 t
ORIGIN      
        1 ggatccgcag cgaattgtgt aaaacagcat taattatgta aaagaagctc aattaacctt
       61 tctaagtgca attgaggctg tatcacgggt tggcaacctc gtattccacg gaaatccacc
      121 cattcaacgg gtgcgatttc gtgtttttcg taaaaatcgg attctgaagg ctagaatccg
      181 gccagaatgt gtcatcttgt tccaaatgag agttatttga catctgaatc acatttgaaa
      241 tgcaaagaca ggtcggaaag gccaaacaag agcgaaaacc cgcgggtcgc caaaagtacc
      301 agaactcgat tcttgcgatt tttcgcattt tcgagttctg gtaagtgcaa aaagtttcga
      361 tttcggatct gcatcggaat ctgattgccc acgtgccaga aggcttaaaa acgtgcacaa
      421 accacatggt taccctttac cttgttttcg aaatttaaca aaaagtgcaa aaaccgggta
      481 aaaacccatc tttggcctgc gcattgccaa tggcggtcat cgatgggtcg cgaagtgcca
      541 aagggaccaa ggtgtaagcc cgcatcatat ctgttaagat tcatcgatga atcggccaat
      601 attttgaaaa gctagtggaa aaacgcgttt tgacgcccgt ttccaccggc aaaccgtcat
      661 cgtgcgccga tcagacggat cc
//
LOCUS       TETTRGA        75 bp ss-tRNA            RNA       01-AUG-1990
DEFINITION  T.thermophila Gln-tRNA-UUG.
ACCESSION   M35400
KEYWORDS    glutamine tRNA.
SOURCE      T.thermophila tRNA.
  ORGANISM  Tetrahymena thermophila
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H.
  TITLE     Dramatic events in ciliate evolution: Alteration of UAA and UAG
            termination codons to glutamine codons due to anticodon mutations
            in two Tetrahymena tRNAs-Gln
  JOURNAL   EMBO J. 5, 1307-1311 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       75     Gln-tRNA
    modified      9        9     m1g
    modified     10       10     m2g
    modified     13       13     p
    modified     19       19     d
    modified     20       20     d
    modified     34       34     um
    anticdn      34       36     Gln-tRNA anticodon ttg
    modified     39       39     p
    modified     48       48     m5c
    modified     54       54     p
    modified     57       57     m1a
BASE COUNT       15 a     19 c     21 g     18 t      2 others
ORIGIN      
        1 ggttgtatgg tgtagcggaa agcaccgagg actttgaatc ctctgacctg ggttcgaatc
       61 ccagtacgac ctcca
//
LOCUS       TETTRGB        75 bp ss-tRNA            RNA       01-AUG-1990
DEFINITION  T.thermophila Gln-tRNA-CUA.
ACCESSION   M35401
KEYWORDS    transfer RNA-Gln.
SOURCE      T.thermophila tRNA.
  ORGANISM  Tetrahymena thermophila
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H.
  TITLE     Dramatic events in ciliate evolution: Alteration of UAA and UAG
            termination codons to glutamine codons due to anticodon mutations
            in two Tetrahymena tRNAs-Gln
  JOURNAL   EMBO J. 5, 1307-1311 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       75     Gln-tRNA
    modified     10       10     m2g
    modified     13       13     p
    modified     19       19     d
    modified     20       20     d
    anticdn      34       36     Gln-tRNA anticodon cta
    modified     37       37     t6a
    modified     39       39     p
    modified     48       48     m5c
    modified     54       54     p
    modified     57       57     m1a
BASE COUNT       19 a     18 c     18 g     19 t      1 others
ORIGIN      
        1 ggttctatag tatagcgcaa agtactgggg antctaaatc ccttgacctg ggttcgaatc
       61 ccagtaggac ctcca
//
LOCUS       TETTRGC        75 bp ss-tRNA            RNA       01-AUG-1990
DEFINITION  T.thermophila Gln-tRNA-UUA.
ACCESSION   M35402
KEYWORDS    transfer RNA-Gln.
SOURCE      T.thermophila tRNA.
  ORGANISM  Tetrahymena thermophila
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H.
  TITLE     Dramatic events in ciliate evolution: Alteration of UAA and UAG
            termination codons to glutamine codons due to anticodon mutations
            in two Tetrahymena tRNAs-Gln
  JOURNAL   EMBO J. 5, 1307-1311 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       75     Gln-tRNA
    modified     10       10     m2g
    modified     13       13     p
    modified     16       16     d
    modified     19       19     d
    modified     20       20     d
    modified     32       32     cm
    modified     34       34     um
    anticdn      34       36     Gln-tRNA anticodon tta
    modified     37       37     t6a
    modified     39       39     p
    modified     48       48     m5c
    modified     54       54     p
    modified     57       57     m1a
BASE COUNT       16 a     17 c     20 g     19 t      3 others
ORIGIN      
        1 ggttccatag tatagdggdd agtactgggg actttaaatc ccttgacctg ggttcgaatc
       61 ccagtgggac ctcca
//
LOCUS       BEGRR5S       120 bp ss-rRNA            RNA       01-AUG-1990
DEFINITION  B.alba 5S ribosomal RNA.
ACCESSION   M35565
KEYWORDS    5S ribosomal RNA.
SOURCE      B.alba (strain B18LD) rRNA.
  ORGANISM  Beggiatoa alba
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; 
            Beggiatoaceae.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
            Pace,N.R.
  TITLE     Phylogenetic analysis of certain sulfide-oxidizing and related
            morphologically conspicuous bacteria by 5S ribosomal ribonucleic
            acid sequences
  JOURNAL   Int. J. Syst. Bacteriol. 37, 116-122 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      120     5S ribosomal RNA
BASE COUNT       32 a     32 c     29 g     27 t
ORIGIN      
        1 ttcttggcga ccatagcaaa taggaaccac ccgaccccat cccgaactcg gtagtgaaac
       61 tgttctgcgc cgatgatagt gtggatactc tccatgtgaa agtaggttat cgccaagagc
//
LOCUS       ECOHEMC      2092 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  E.coli porphobilinogen deaminase (hemC) and uroporphyrinogen III
            synthase (hemD) genes, complete cds.
ACCESSION   X04242 M35827
KEYWORDS    deaminase; hemC gene; hemD gene; porphobilinogen deaminase;
            uroporphyrinogen III synthase.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1957)
  AUTHORS   Thomas,S.D. and Jordan,P.M.
  TITLE     Nucleotide sequence of the hemC locus encoding porphobilinogen
            deaminase of Escherichia coli K12
  JOURNAL   Nucleic Acids Res. 14, 6215-6226 (1986)
  STANDARD  simple automatic
REFERENCE   2  (bases 1290 to 2092)
  AUTHORS   Jordan,P.M., Mgbeje,B.I.A., Thomas,S.D. and Alwan,A.F.
  TITLE     Nucleotide sequence for the hemD gene of Escherichia coli encoding
            uroporphyrinogen III synthase and initial evidence for a hem operon
  JOURNAL   Biochem. J. 249, 613-616 (1988)
  STANDARD  simple staff_review
COMMENT     Data kindly reviewed (11-SEP-1986) by P. Jordan
FEATURES       from  to/span     description
    pept        390     1331     porphobilinogen deaminase (hemC)
    pept       1328     2068     uroporphyrinogen III synthase (hemD)
    signal      330      335     put. -35 region
    rpt          63       67     inverted repeat A
    rpt          78       82     direct repeat 1
    rpt         349      853     inverted repat A'
    signal      354      359     put. -10 region
    rpt         356      360     direct repeat 1
    rpt         367      371     direct repeat 1
    binding     377      381     put. ribosome binding site
    signal     1508     1522     pot. transcription termination signal
BASE COUNT      495 a    540 c    566 g    491 t
ORIGIN      
        1 caagacgtat cgcctgattt gctacccgtc atgactgtga ttccgccaac atcaacggta
       61 acacgcggca ttcgggatat ttcgtatgtc aaaggtaacc gttaccactt ttcgcgcctg
      121 gtttttttag tttcacgacg aaaaaatggt ctaaaacgtg atcaatttaa caccttgctg
      181 attgaccgta aagaaagatg cgctacatac aagtgtagca ccgtttattc tctgtaaatt
      241 ccttattaca acggcgtgaa acgcctgtca ggatccactg ccagacctca ttttacggtt
      301 tgcgcaggcg tctacgtttc accacaacac tgacatcact ctggcaagga tgttaggatg
      361 gaccacggat gataatgacg gtaacaagca tgttagacaa tgttttaaga attgccacac
      421 gccaaagccc acttgcactc tggcaggcac actatgtcaa agacaagttg atggcgagcc
      481 atccgggcct ggtcgttgaa ctggtaccga tggtgacgcg cggcgatgtg attcttgata
      541 cgccgctggc gaaagtaggc ggaaaaggct tatttgtaaa agagctggaa gtcgcgctcc
      601 tcgaaaatcg cgccgatatc gccgtacact caatgaaaga tgtgccggtt gaattcccgc
      661 aaggtctggg actggtcact atttgtgagc gtgaagatcc tcgcgatgcc tttgtgtcca
      721 ataactatga cagtctggat gcgttaccgg caggcagtat cgtcgggacg tccagtttac
      781 gtcgccagtg ccaactggct gaacgccgtc cggatctgat tatccgctcc ctgcgcggca
      841 acgtcggcac tcgcctgagc aaactggata acggcgaata cgatgccatc attcttgccg
      901 tagccggact aaaacgttta ggtctggagt cacgtattcg cgccgcgttg ccacccgaga
      961 tttctcttcc ggcggtagga caaggtgcgg tgggtattga atgccgcctt gatgattcac
     1021 gcactcgcga gctgcttgcc gcgctgaatc accacgaaac tgcactgcgc gttaccgcag
     1081 aacgcgccat gaatacccgt ctcgaaggcg catgtcaggt gccaattggt agctacgccg
     1141 agcttattga tggcgaaatc tggctgcgtg ggctggtcgg cgcgccggac ggttcgcaga
     1201 ttattcgcgg tgaacgccgc ggtgcgccgc aagatgccga acaaatgggg atttcgctgg
     1261 cagaagagct actgaataac ggcgcgcgcg agatcctcgc tgaagtctat aacggagacg
     1321 ccccggcatg agtatccttg tcacccgccc gtctcccgct ggagaagagt tagtgagccg
     1381 tctgcgcaca ctggggcagg tggcctggca ttttccgctg attgagtttt ctccgggtca
     1441 acaattaccg caacttgctg atcaactggc agcgctgggg gagagcgatc tgttgtttgc
     1501 cctctcgcaa cacgcggttg cttttgccca atcacagctg catcagcaag atcgtaaatg
     1561 gccccgacta cctgattatt tcgccattgg acgcaccacc gcactggcac tacataccgt
     1621 aagtggacag aagattctct acccgcagga tcgggaaatc agcgaagtct tgctacaatt
     1681 acctgaatta caaaatattg cgggcaaacg tgcgctgata ttacgtggca atggtggtcg
     1741 tgagctaatt ggggataccc tgacggcgcg cggtgctgag gtcacttttt gtgaatgtta
     1801 tcaacgatgc gcaatccatt acgatggtgc agaagaagcg atgcgctggc aagcccgcga
     1861 ggtgacgatg gtcgttgtta ccagcggtga aatgttgcag caactctggt cactgatccc
     1921 acaatggtat cgtgagcact ggttactaca ctgtcgacta ttggtcgtca gtgagcgttt
     1981 ggcgaaactc gcccgggaac tgggctggca agacattaag gtcgccgata acgctgacaa
     2041 cgatgcgctt ttacgggcat tacaataact ctcataacag gaagccataa tg
//
LOCUS       LTTRR5S       117 bp ss-rRNA            RNA       01-AUG-1990
DEFINITION  L.discophora 5S ribosomal RNA.
ACCESSION   M35569
KEYWORDS    5S ribosomal RNA.
SOURCE      L.discophora (strain Stokes) rRNA.
  ORGANISM  Leptothrix discophora
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Budding and/or appendaged bacteria; Prosthecate bacteria.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
            Pace,N.R.
  TITLE     Phylogenetic analysis of certain sulfide-oxidizing and related
            morphologically conspicuous bacteria by 5S ribosomal ribonucleic
            acid sequences
  JOURNAL   Int. J. Syst. Bacteriol. 37, 116-122 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      117     5S ribosomal RNA
BASE COUNT       27 a     35 c     32 g     23 t
ORIGIN      
        1 atgcctgacg accatagcga ggtggtccca ctccttccca tcccgaacag gacagtgaaa
       61 cgcctcagcg ccgatgatag tgcgcattcg cgtgtgaaag taggtcatcg tcaggct
//
LOCUS       TBSACG       4776 bp ss-RNA             VRL       01-AUG-1990
DEFINITION  Tomato bushy stunt virus complete genome.
ACCESSION   M21958 M31019
KEYWORDS    capsid protein; coat protein; complete genome; p19 protein;
            p22 protein; p33 protein; p41 protein; p92 protein.
SOURCE      Tomato bushy stunt virus (strain cherry), cDNA to viral RNA.
  ORGANISM  Tomato bushy stunt virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Tombusvirus.
REFERENCE   1  (bases 2621 to 4776)
  AUTHORS   Hillman,B.I., Hearne,P., Rochon,D. and Morris,T.J.
  TITLE     Organization of tomato bushy stunt virus genome: Characterization
            of the coat protein gene and the 3' terminus
  JOURNAL   Virology 169, 42-50 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2620)
  AUTHORS   Hearne,P.Q., Knorr,D.A., Hillman,B.I. and Morris,T.J.
  TITLE     The complete genome structure and synthesis of infectious RNA from
            clones of tomato bushy stunt virus
  JOURNAL   Virology 177, 141-151 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.Q.Hearne, 16-DEC-1988.  Draft entry and computer-readable
            sequence for [2] kindly submitted by D.Knorr, 21-DEC-1989.
            
            The 5' terminal nucleotide was not determined.  However, in the
            infectious constructs, two 5' terminal "g" residues are added, one
            of which is removed during subsequent replication in host plants.
FEATURES       from  to/span     description
    pept        166     1056     p33 protein
    pept        166     2622     p92 protein (read-through of p33)
    pept       2652     3818     p41 capsid protein
    pept       3888     4406     p19 protein
    pept       3856     4425     p22 protein
    mRNA       2621     4776     2.2kb subgenomic mRNA
    mRNA       3841     4776     0.9kb subgenomic mRNA
BASE COUNT     1257 a    983 c   1315 g   1220 t      1 others
ORIGIN      
        1 naaattctcc aggatttctc gacctagttc gtttatctgg tgacttgcgc taccgttgct
       61 ttgcgtagag aatttctctc cataattatt atctttagtt gtggggtttg aaggttgggt
      121 ctacctttcg gggggataaa ttgtaacttc caacaaacaa gcgacatgga gaccatcaag
      181 agaatgattt ggcctaagaa agagattttt gtgggtgatt tcgcaaccgg agtgaatagg
      241 acagttccgg tgaacatctt tcaattggtg tgtcgtgtgg ttctgagata catgaggaca
      301 gggaaaatag agtgtgattc tgacagcatg actaagttta tagttgaatt actcaaaact
      361 gattgtgctg ccaaatggga atggttcatg aagagacggc agaggggtga ttacattgtc
      421 cctctatcta tagcctccat accaatcata ccgctgttga gttatgccac tagggtacgc
      481 gcagtctcag tcaaggcttt tggcaatgaa ctatcgttca atgtcagggt gcctagacca
      541 tctgtaccta agaaaggatt gctcctcaga ctggcggcag gtctagcgtt agctcctata
      601 tgcgcgctgg ccgtgtacgc taccctacct agggaaaaac tgtcggtatt taagctgaga
      661 actgaggcac gagcacacat ggaggatgag agagaagcga cagattgtct ggtggttgag
      721 ccggcaaggg aacttaaggg taaagatggt gaggatctcc tcactggtag tagattgact
      781 aaggtgatcg cgtccactgg gcgccctcgt cgaagacctt atgcggcaaa gatcgcacag
      841 gtggcgagag caaaggtggg ttaccttaag aacagtccag agaatagact aatctaccag
      901 agggtgatga tcgagatcat ggacaaagac tgcgtcaggt atgttgacag ggatgtcata
      961 ttgcctttgg ctattggatg ctgttttgtc tatccggatg gagtggagga gtcggcggca
     1021 ctatggggct cacaggagtc cctgggtgtc aaatagggag gcctagtacg tctacctggg
     1081 gttgtaacac agatcaatcg agatatccca tctgatgtgt tacttcctca ggaggtgcta
     1141 gaggttcgta caggacctcc caatgctaag gaccgtaata tatttatggt tgcaggttgc
     1201 ccatcacagg cacggttctt agtacataat cactgcctga aaaaccttaa aaggggtctt
     1261 gtggagagag tcttctgcgt agagagaaac gggaagctcg ctcgcactcc acaacctacc
     1321 aaaggagcct ttggacgtct ttccccgttc aggaaagcgg tttgtgagaa ggttggggta
     1381 gcccaccgac ttgggtatga tgggtttctg tcatactaca gcggtgcgaa actccgtact
     1441 tacacacgag ccgtggagag tctgcatatc acacctgtct ccgagaggga tagtcacttg
     1501 actaccttcg taaaagcaga gaagatatcg acgtctaagg gtgacccagc acctcgggtg
     1561 attcagcctc gaaacccgag gtacaatgtg gaacttggaa gatatctacg gcatatggaa
     1621 tccaagctga tgaaagctgt tgatggcgtg ttcggagaga cgacatgcat caaaggatac
     1681 acagctgatg aggtaggtgc aattttccgg gctaaatggg acaggtttga taagcctgtc
     1741 gccatagggc tcgatgcatc taggtttgat caacactgtt ccgttgaagc attgcaatat
     1801 gagcatagct tctacagggc catgtaccct ggcaacaagc tcttgggcaa gttgttggaa
     1861 tggcagctcc ataataaagg taaaggttat gttccagatg gaactataac ctatcgcaag
     1921 gagggctgtc gcatgagtgg ggatataaac acctcgttgg gcaactatct actgatgtgt
     1981 gcaatggtac atgggtacat gcgtcatctg gggattaatg agtttagtct ggcaaactgt
     2041 ggggatgatt gcgtcctaat tgtcgaacgc aggaatctta agcagataca gagaacttta
     2101 ccggagtatt tcctcaatct gggatatact atgaaggtgg agcaacctgt atttcaactg
     2161 gaagaggttg aattttgcca ggcacaccca gtacagtttc aaggcggttg gaagatggtt
     2221 cgaaacgtcc gtactgctat gagcaaggat gtgcactgtg tcaacaatat acgcgatttg
     2281 gcgacgagga gagcttggag taatgctcaa catcatgggg gtctagcgct tagtgctggt
     2341 attccagttg tggagacgtt ttactctagg tttaagcttt atgatgtacc tcgtaaacat
     2401 caacgtattg acacggtcac aaatgtgcac aagtggcgtg gatccggtgg gagttatgtt
     2461 gtgacccctg aatctagggc tagcttttgg gctgcctttg gactcacggg ggatgagcaa
     2521 ctggctctgg aggaccgtct ggaaagatgg gagatggatc tgtttggaga ggagggtgtt
     2581 gacgctcatg agcccagcat cctcgactcc gccgtagctt gaccaagaat acacacacgc
     2641 aggatagaca catggcaatg gtaaagagaa acaacaacac gggaatgatc ccggtgagta
     2701 caaagcaatt actggcattg ggtgcggccg ctggggccac agccttgcag ggatttgtca
     2761 agaataatgg gatggccatc gttgaggggg ctgtcgatct gactaaaaga gcgtacaaag
     2821 cagtgcggag aagaggaggt aagaaacagc agatgattaa tcatgtaggt ggtacaggtg
     2881 gtgctataat ggcgccggta gcagtgacta gacaacttgt cggtagtaag cctaagttta
     2941 ctggcaggac gtctggctct gtcacagtta cccaccgtga gtatctgtca caagtgaata
     3001 attccacggg tttccaagtt aatgggggaa ttgtcggcaa tttgttacag cttaacccgt
     3061 tgaatggtac attgttctct tggttgccag cgatagcatc caattttgat cagtacacat
     3121 tcaacagcgt tgtgctacat tatgtgcccc tatgttcaac tactgaggta gggagagtgg
     3181 ctatttactt tgataaggac tcagaagatc cagaacctgc tgatagagtt gagttggcga
     3241 attacagcgt gcttaaagag acagcccctt gggctgaagc gatgcttagg gtacccaccg
     3301 ataagattaa gagattttgt gatgacagtt ccacatctga tcacaaactt atcgacttgg
     3361 gtcaattggg cattgctaca tatggtggcg ctgggactaa tgctgtgggg gatatcttta
     3421 tctcgtacag tgttacgtta tatttccctc aacctacgaa cacactcctt agtaccagaa
     3481 ggctcgacct tgctggcgct cttgtcacag catctggccc tggatacctc ctggtgtcta
     3541 ggactgccac tgtattgaca atgacattcc gtgctacagg cacgtttgtc atatccggga
     3601 cgtatcggtg cctcacggca acaacgttag gcttggctgg cggagtgaat gtcaatagta
     3661 tcacagttgt agataacata ggtacagaca gtgcgttttt cataaattgt actgtctcta
     3721 acctaccatc tgtggtgaca ttcacatcta ccggtatcac atctgccaca gtacattgcg
     3781 tgcgcgcgac acgacagaat gatgtttctc taatttagtg tgtcctgcga ggggcctctt
     3841 gaacaagacc agttcatgga tactgaatac gaacaagtca ataaaccatg gaacgagcta
     3901 tacaaggaaa cgacgctagg gaacaagcta acagtgaacg ttgggatgga ggatcaggag
     3961 gtaccacttc tcccttcaaa cttcctgacg aaagtccgag ttggactgag tggcggctac
     4021 ataacgatga gacgaattcg aatcaagata atccccttgg tttcaaggaa agctggggtt
     4081 tcgggaaagt tgtatttaag agatatctca gatacgacag gacggaagct tcactgcaca
     4141 gagtccttgg atcttggacg ggagattcgg ttaactatgc agcatctcga tttttcggtt
     4201 tcgaccagat cggatgtacc tatagtattc ggtttcgagg agttagtatc accgtttctg
     4261 gagggtcgcg aactcttcag catctctgtg agatggcaat tcggtctaag caagaactgc
     4321 tacagcttgc cccaatcgaa gtggaaagta atgtatcaag aggatgccct gaaggtactg
     4381 agaccttcga aaaagaaagc gagtaagaca gactcttcag tctgagtttg tggagatgag
     4441 tgtaaatctg gcatagcata caggttactc ttgttgggtt ctggatgtta ggatgacgag
     4501 tcgactcggg ctccgcacta ggtttggtcg cctaggggat ggagatatgg aaagggtctc
     4561 gtgtggtatc agtcggtcga aagacgcgct tccaacatgg gcctatggtc ggataagtct
     4621 tagcaatacc agccagcatg aattggattc ctgtttacga aagttaggtg tcacttgtgg
     4681 aagcggaccc agacacggtt gatctcaccc ttcggggggc tatagagatc gctggaagca
     4741 ctaccggaca accggaacat tgcagaaatg cagccc
//
LOCUS       THTRR5S       122 bp ss-rRNA            RNA       01-AUG-1990
DEFINITION  T.nivea 5S ribosomal RNA.
ACCESSION   M35563
KEYWORDS    5S ribosomal RNA.
SOURCE      T.nivea (strain JP2) rRNA.
  ORGANISM  Thiothrix nivea
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; 
            Leucotrichaceae.
REFERENCE   1  (bases 1 to 122)
  AUTHORS   Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
            Pace,N.R.
  TITLE     Phylogenetic analysis of certain sulfide-oxidizing and related
            morphologically conspicuous bacteria by 5S ribosomal ribonucleic
            acid sequences
  JOURNAL   Int. J. Syst. Bacteriol. 37, 116-122 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      122     5S ribosomal RNA
BASE COUNT       27 a     35 c     36 g     24 t
ORIGIN      
        1 tttgcctggt gtccatagag cactggaacc acctgatccc atcccgaact cagaagtgaa
       61 acggtgcatc gccgatggta gtgtggggcc tccccatgtg agagtaggtc aacgccaggc
      121 gc
//
LOCUS       THVRR5S       123 bp ss-rRNA            RNA       01-AUG-1990
DEFINITION  Thiovulum sp. 5S ribosomal RNA.
ACCESSION   M35570
KEYWORDS    5S ribosomal RNA.
SOURCE      Thiovulum sp. rRNA.
  ORGANISM  Thiovulum sp.
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Colorless sulfur bacteria.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
            Pace,N.R.
  TITLE     Phylogenetic analysis of certain sulfide-oxidizing and related
            morphologically conspicuous bacteria by 5S ribosomal ribonucleic
            acid sequences
  JOURNAL   Int. J. Syst. Bacteriol. 37, 116-122 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      123     5S ribosomal RNA
BASE COUNT       30 a     28 c     30 g     35 t
ORIGIN      
        1 tttggttggt gattacagag aaaaggtcac actcagctcc atttcgaacc tgaaagttaa
       61 gcttttcttc gtcgataata ctgcccccta cgggggtggg acggtagatc gttgccaacc
      121 att
//
LOCUS       VITRR5S       118 bp ss-rRNA            RNA       01-AUG-1990
DEFINITION  V.beggiatoides 5S ribosomal RNA.
ACCESSION   M35566
KEYWORDS    5S ribosomal RNA.
SOURCE      V.beggiatoides (strain B23SS) rRNA.
  ORGANISM  Vitreoscilla beggiatoides
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; 
            Beggiatoaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
            Pace,N.R.
  TITLE     Phylogenetic analysis of certain sulfide-oxidizing and related
            morphologically conspicuous bacteria by 5S ribosomal ribonucleic
            acid sequences
  JOURNAL   Int. J. Syst. Bacteriol. 37, 116-122 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       30 a     35 c     31 g     22 t
ORIGIN      
        1 cgcctgacga ccacagcgac tgtgaaccac ccgaccccat ctcgaactcg gtagtgaaac
       61 cagtcagcgc cgatgatagt gtggcatatg ccatgtgaaa gtaggtcatc gtcaggct
//
LOCUS       VITRR5SX      118 bp ss-rRNA            RNA       01-AUG-1990
DEFINITION  V.stercoraria 5S ribosomal RNA.
ACCESSION   M35567
KEYWORDS    5S ribosomal RNA.
SOURCE      V.stercoraria (strain VT1) rRNA.
  ORGANISM  Vitreoscilla stercoraria
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; 
            Beggiatoaceae.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
            Pace,N.R.
  TITLE     Phylogenetic analysis of certain sulfide-oxidizing and related
            morphologically conspicuous bacteria by 5S ribosomal ribonucleic
            acid sequences
  JOURNAL   Int. J. Syst. Bacteriol. 37, 116-122 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      118     5S ribosomal RNA
BASE COUNT       30 a     32 c     30 g     26 t
ORIGIN      
        1 tgtttgacga ccatagcgag ttggtcccac gccttcccat cccgaacagg accgtgaaac
       61 gacttagcgc cgatgatagt gtggattacc catgtgaaag taggtcatcg tcaaacgc
//
LOCUS       VITRR5SXX     116 bp ss-rRNA            RNA       01-AUG-1990
DEFINITION  V.filiformis 5S ribosomal RNA.
ACCESSION   M35568
KEYWORDS    5S ribosomal RNA.
SOURCE      V.filiformis (strain ATCC 15551) rRNA.
  ORGANISM  Vitreoscilla filiformis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; 
            Beggiatoaceae.
REFERENCE   1  (bases 1 to 116)
  AUTHORS   Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and
            Pace,N.R.
  TITLE     Phylogenetic analysis of certain sulfide-oxidizing and related
            morphologically conspicuous bacteria by 5S ribosomal ribonucleic
            acid sequences
  JOURNAL   Int. J. Syst. Bacteriol. 37, 116-122 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      116     5S ribosomal RNA
BASE COUNT       27 a     34 c     31 g     24 t
ORIGIN      
        1 gcctgatgac catagcaagg tggtcccact ccttcccatc ccgaacagga cagtgaaacg
       61 ccttagcgcc gatgatagtg cggttctccc gtgtgaaagt aggacatcgt caggct
//
LOCUS       PVICSA       1895 bp ds-DNA             INV       01-AUG-1990
DEFINITION  Plasmodium vivax circumsporozoite protein gene, complete cds.
ACCESSION   M11926 M20671 J04090
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax (strain Belem) DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 1529)
  AUTHORS   Arnot,D.E., Barnwell,J.W., Tam,J.P., Nussenzweig,V.,
            Nussenzweig,R.S. and Enea,V.
  TITLE     Circumsporozoite protein of Plasmodium vivax: Gene cloning and
            characterization of the immunodominant epitope
  JOURNAL   Science 230, 815-818 (1985)
  STANDARD  simple staff_review
REFERENCE   2  (bases 158 to 1294; revises [1])
  AUTHORS   Arnot,D.E., Barnwell,J.W. and Stewart,M.J.
  TITLE     Does biased gene conversion influence polymorphism in the
            circumsporozoite protein-encoding gene of Plasmodium vivax?
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 157; 1295 to 1895; revises [1])
  AUTHORS   Arnot,D.E.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2],[3] kindly
            submitted by D.E.Arnot, 14-SEP-1988.
FEATURES       from  to/span     description
    pept        158     1294     circumsporozoite protein
BASE COUNT      674 a    347 c    471 g    403 t
ORIGIN      
        1 ctgcataagg caaactcaca aacatccaaa aaaatataca tatatatatt tatatacacg
       61 tgtatatatt attaagcggc ttaagttaag caagcaaaac agccaaaggc ctacaagtgt
      121 aaacagcttc ctgcacacac gtatatacca gaacaagatg aagaacttca ttctcttggc
      181 tgtttcttcc atcctgttgg tggacttgtt ccccacgcac tgcgggcaca atgtagatct
      241 gtccaaggcc ataaatttaa atggagtaaa cttcaataat gtagacgcca gttcacttgg
      301 cgcggcacac gtaggacaaa gtgctagccg aggcagagga cttggtgaga acccagatga
      361 cgaggaagga gatgctaaaa aaaaaaagga tggaaagaaa gcagaaccaa aaaatccacg
      421 tgaaaataag ctgaaacaac caggagacag agcagatgga cagccagcag gagacagagc
      481 agatggacag ccagcaggtg atagagcaga tggacaacca gcaggagata gagcagctgg
      541 acaaccagca ggagatagag cagatggaca gccagcagga gacagagcag atggacagcc
      601 agcaggagac agagcagatg gacaaccagc aggagacaga gcagatggac aaccagcagg
      661 tgatagagca gctggacaac cagcaggtga tagagcagct ggacaaccag caggagatag
      721 agcagatgga cagccagcag gagatagagc agctggacag ccagcaggag atagagcaga
      781 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagatggaca
      841 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacagccagc
      901 aggagataga gcagctggac agccagcagg agatagagca gctggacagc cagcaggaaa
      961 tggtgcaggt ggacaggcag caggaggaaa cgcaggagga ggacagggac aaaataatga
     1021 aggtgcgaat gccccaaatg aaaagtctgt gaaagaatac ctagataaag ttagagctac
     1081 cgttggcacc gaatggactc catgcagtgt aacctgtgga gtgggtgtaa gagtcagaag
     1141 aagagttaat gcagctaaca aaaaaccaga ggatcttact ttgaatgacc ttgagactga
     1201 tgtttgtaca atggataagt gtgctggcat atttaacgtt gtgagtaatt cattagggct
     1261 agtcatattg ttagtcctag cattattcaa ttaagtagct gacatccatt attttcggcg
     1321 tcctccacgg tgcatattaa gtgttttgtg ttttgtacat gcacataaat acttgcccgt
     1381 agggacatga tttttttccc tttcttatga atgttccctg ctgtttgcac gtaactgtat
     1441 gtacgtgcgc gtaaggcata gtaagtaaca cctcttacac attatgcgct tacgcacaat
     1501 cagttgtgca attctagaaa acacgatatg agtattttta aacacttatc gtccaaaaaa
     1561 acaaaaaaaa cagaaaaaac agaaaaaaca gaaaaaacaa aaaaaaacaa aaaaaaacaa
     1621 aaaaaaacaa aaaaaacaca tttatattaa cttttccttt ttgattgacc cttttttgac
     1681 gtatattttt tttttttttt cgtatgtatt atatatactg cttaacgtag agaacttaaa
     1741 ttttgagaat gtattttttt ttaacaagtt aaaaaaagaa ctggtatttt tgggaattca
     1801 aaaaatttgc aaattcaaaa gaggcgagtt aaaatttgcg ccgtggcaaa cggggtgcgt
     1861 gcgggagtcg tgcaaatgtg gcttatatcc ggggg
//
LOCUS       PVICSC       1375 bp ds-DNA             INV       01-AUG-1990
DEFINITION  Plasmodium vivax circumsporozoite protein gene, 3' end.
ACCESSION   M20670 J04090
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax (strain North Korean) DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 1105)
  AUTHORS   Arnot,D.E., Barnwell,J.W. and Stewart,M.J.
  TITLE     Does biased gene conversion influence polymorphism in the
            circumsporozoite protein-encoding gene of Plasmodium vivax?
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1106 to 1375)
  AUTHORS   Arnot,D.E.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Arnot, 14-SEP-1988.
FEATURES       from  to/span     description
    pept    <     1     1105     circumsporozoite protein (AA at 2)
BASE COUNT      464 a    260 c    407 g    244 t
ORIGIN      Sau3AI site.
        1 agatctgtcc aaggccataa atttaaatgg agtaaacttc aataatgtag acgccagttc
       61 acttggcgcg gcacacgtag gacaaagtgc tagccgaggc agaggacttg gtgagaaccc
      121 agatgacgag gaaggagatg ctaaaaaaaa aaaggatgga aagaaagcag aaccaaaaaa
      181 tccacgtgaa aataagctga aacaaccagg agacagagca gatggacagc cagcaggaga
      241 cagagcagat ggacagccag caggagacag agcagatgga caggcagcag gaaatggtgc
      301 aggtggacag ccagcaggtg atagagcagc tggacaacca gcaggcgatg gagcagctgg
      361 acagccagca ggcgatagag cagatggaca gccagcagga gatagagcag ctggacagcc
      421 agcaggcgat agagcagatg gacagccagc aggagataga gcagctggac agccagcagg
      481 cgatagagca gatggacagc cagcaggaga tagagcagct ggacaggcag caggaaatgg
      541 tgcaggtgga caggcagcag gaaatggtgc aggtggacaa ccagcaggag atagagcagc
      601 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca
      661 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacaggcagc
      721 aggaaatggt gcaggtggac aggcagcagg aggaaatgcg gcaaacaaga aggcagaaga
      781 cgcaggagga aacgcaggag gaaacgcagg aggacaggga caaaataatg aaggtgcgaa
      841 tgccccaaat gaaaagtctg tgaaagaata cctagataaa gttagagcta ccgttggcac
      901 cgaatggact ccatgcagtg taacctgtgg agtgggtgta agagtcagaa gaagagttaa
      961 tgcagctaac aaaaaaccag aggatcttac tttgaatgac cttgagactg atgtttgtac
     1021 aatggataag tgtgctggca tatttaacgt tgtgagtaat tcattagggc tagtcatatt
     1081 gttagtccta gcattattca attaagtagc tgacatccat tattttcggc gtcctccacg
     1141 gtgcatatta agtgttttgt gttttgtaca tgcacataaa tacttgcccg tagggacatg
     1201 atttttttcc ctttcttatg aatgttccct gctgtttgca cgtaactgta tgtacgtgcg
     1261 cgtaaggcat agtaagtaac acctcttaca cattatgcgt tacgcacaat cagttgtgca
     1321 attctagaaa acacgatatg agtattttta aacacttatc gtgaccaaaa aaaca
//
LOCUS       ECOHSEST      360 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  E.coli heat-stable enterotoxin gene, complete cds.
ACCESSION   M34916
KEYWORDS    heat-stable enterotoxin.
SOURCE      E.coli (strain 153837-2) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Moseley,S.L., Hardy,J.W., Huq,M.I., Echeverria,P. and Falkow,S.
  TITLE     Isolation and nucleotide sequence determination of a gene encoding
            a heat-stable enterotoxin of Escherichia coli
  JOURNAL   Infect. Immun. 39, 1167-1174 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         48      266     heat-stable enterotoxin
    signal      268      301     pot. transcription termination signal
BASE COUNT      115 a     54 c     65 g    126 t
ORIGIN      
        1 ttctggtttt gattcaaatg ttcgtggatg ccatgtccgg aggtaatatg aagaaatcaa
       61 tattatttat ttttctttct gtattgtctt tttcaccttt ccctcaggat gctaaaccag
      121 tagagtcttc aaaagaaaaa atcacactag aatcaaaaaa atgtaacatt gcaaaaaaaa
      181 gtaataaaag tggtcctgaa agcatgaata gtagcaatta ctgctgtgaa ttgtgttgta
      241 atcctgcttg taccgggtgc tattaataat ataaagggaa ctaaacagtt ccctttatat
      301 ttgttctgat tctgatgatg tctgtaacgt atgtacctgt tgctttgttg aataaatcga
//
LOCUS       HUMRENA1      826 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human renin gene, exon 1.
ACCESSION   M10030 M34914
KEYWORDS    aspartyl protease; renin.
SEGMENT     1 of 5
SOURCE      Human fetal liver DNA (library of Lawn et al.), clone lambda-III.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 826)
  AUTHORS   Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
            Morris,B.J. and Shine,J.
  TITLE     Primary structure of the human renin gene
  JOURNAL   DNA 3, 457-468 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 276 to 583)
  AUTHORS   Shine,J., Hardman,J.A., Hort,Y.J., Tellam,J.T., Catanzaro,D.F.,
            Morris,B.J. and Baxter,J.D.
  TITLE     Structure of the human renin gene
  JOURNAL   Trans Assoc Am Physicians 97, 63-69 (1984)
  STANDARD  simple staff_review
COMMENT     There is only a single renin gene in the human haploid genome [1].
            It is comprised of 10 exons encoding 406 amino acids.  The first
            intron separates the 5' untranslated region and the signal peptide
            coding region from the remainder of the gene.  Exon 2 comprises
            most of the sequence coding for the pro portion of the enzyme.
            Precise boundaries were not indicated by in figure 2 of [1], but
            were taken from the text and from other human renin entries.
FEATURES       from  to/span     description
    pept        626  +   723     preprorenin /hgml_locus_uid="LW0050B"
                                 /nomgen="REN" /map="1q32"
    sigp        626      685     renin signal peptide
    pre-msg     584  >   826     renin mRNA [1]
    IVS         724  >   826     renin intron A
    site        520      521     ga in [1]; gagca in [2]
BASE COUNT      190 a    226 c    222 g    188 t
ORIGIN      Chromosome 1q32; 437 bp upstream of KpnI site.
        1 gatctaccca ccttggcctc ccaaagtgct gggacaggtg tgagccacca tgcctggccc
       61 ctctactctt ataattaaac cagctgttgc ttttcctgcc aagaaaccag tcatgaagat
      121 tcacccatgt tctagatggg aaaactgggc tgtagctggg agaggccagt cagggacaaa
      181 gccaaagtta atatagagaa tggagcttcc agggtatagg ggttgggtct gggctaggga
      241 gctggaaacc taggttttac gcttgtccca gttttgatgt tagccctgac agtgctgttt
      301 ctcatcagcc tctgcctgct ccaggggtca cagggccaag ccagatagag ggctgctagc
      361 gtcactggac acaagattgc tttcccacag ctgtccttcc tccagcccct ctgctcccca
      421 tccggaaacc tgggtaccct tcacccacct agctctgtcc cgcagtgaga tttattgctg
      481 actgccctgc catctacccc agggtaataa atcagggcag agcagaattg caatcacccc
      541 atgcatggag tgtataaaag gggaagggct aagggagcca cagaacctca gtggatctca
      601 gagagagccc cagactgagg gaagcatgga tggatggaga aggatgcctc gctggggact
      661 gctgctgctg ctctggggct cctgtacctt tggtctcccg acagacacca ccacctttaa
      721 acggtaattg gtaactcagg cagagaaggg gtgggcaggg gtgtaggttc ccaccttccc
      781 aacaccctgg cttttccaca tgcggtgtca ttcagtcctt acgatc
//
LOCUS       HUMRENA2      373 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human renin gene, exon 2.
ACCESSION   M10128
KEYWORDS    renin.
SEGMENT     2 of 5
SOURCE      Human fetal liver DNA, clones lambda-[III,V].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 373)
  AUTHORS   Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
            Morris,B.J. and Shine,J.
  TITLE     Primary structure of the human renin gene
  JOURNAL   DNA 3, 457-468 (1984)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    +   105  +   255     preprorenin, exon 2 /nomgen="REN" /map="1q32"
                                 /hgml_locus_uid="LW0050B"
    matp        205  +   255     renin
    pre-msg <     1  >   373     renin mRNA
    IVS     <     1      104     renin intron A
    IVS         256  >   373     renin intron B
BASE COUNT       79 a    107 c     96 g     91 t
ORIGIN      Chromosome 1q32; about 4.8 kb after segment 1.
        1 aacgttaaag gtggttgtac taaagagagg ggtttggcct cagggactca catgtggtgg
       61 aggtacagca cttttctatt tttgcttcct ccaccctggg ccaggatctt cctcaagaga
      121 atgccctcaa tccgagaaag cctgaaggaa cgaggtgtgg acatggccag gcttggtccc
      181 gagtggagcc aacccatgaa gaggctgaca cttggcaaca ccacctcctc cgtgatcctc
      241 accaactaca tggacgtgag tgcttggctc agcccctcgc tccctccctg tctcctttcc
      301 ctcatggacc tagggctttc tttgctgcaa gactcaccct ttccaagctg tgtttgacga
      361 aggcgctgag tag
//
LOCUS       HUMRENA3     2480 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human renin gene, exons 3, 4 and 5.
ACCESSION   M10150
KEYWORDS    renin.
SEGMENT     3 of 5
SOURCE      Human fetal liver DNA, clones lambda-[III,V].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2480)
  AUTHORS   Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
            Morris,B.J. and Shine,J.
  TITLE     Primary structure of the human renin gene
  JOURNAL   DNA 3, 457-468 (1984)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    +   140      263     preprorenin, exon 3 /nomgen="REN" /map="1q32"
                                 /hgml_locus_uid="LW0050B"
                879      997     preprorenin, exon 4
               1949  +  2145     preprorenin, exon 5
    matp    +   140      263     renin
                879      997     renin
               1949  +  2145     renin
    pre-msg <     1  >  2480     renin mRNA
    IVS     <     1      139     renin intron B
    IVS         264      878     renin intron C
    IVS         998     1948     renin intron D
    IVS        2146  >  2480     renin intron E
BASE COUNT      568 a    710 c    622 g    579 t      1 others
ORIGIN      Chromosome 1q32; about 0.4 kb after segment 2.
        1 ctgcaggaaa atggaaaccc cgacaggtat aggacctcgc ctggggcaag tctacacccg
       61 agagccaaga gtgaagccag gcaagacccc aagcccaagg tcccctgagc ccctccagcc
      121 ctctcttttt accccacaga cccagtacta tggcgagatt gggatcggga ccccacccca
      181 aaccttcaaa gtcgtctttg acactggttc gtccaatgtt tgggtgccct cctccaagtg
      241 cagccgtctc tacactgcct gtggtgagac ctaagaccca cagtgcctct cctccatccc
      301 cctgccctac tgtgcatgag caatcctgcc caacacccag ctcccatccc tcttgccacc
      361 aagggagtgg cttcctctct gcctctgtgc ccactgacat gtaggggaga ggggaagatg
      421 tctcccgttt ttctgataca gccaccaagg ttaaaaacaa aaaaaggtcc aagaacccct
      481 gagnacccag gaggaccagt tcccagtcgt cctgagattg agacaggact gaattctcaa
      541 acccatccca ggcactcgga actcttccat ccctagtctt aatcaacaac ctcttactag
      601 cacttactct gtgcctggca tacttctctg gtgttatcag tggttagtga ttactttaaa
      661 ttccttcatt taggacaaaa ttctcgatgt atgggacact taggagagcc caagaaaccc
      721 agtccttgat tgatgaagca catattccaa gccccctgac cctagggcca ctcatccctg
      781 cacctaagct aaccagccat acccacaatg caccctgcct ctgagtcccc ctgtctgggc
      841 cactcttgga caaacctgag cctctgtccc cctgccagtg tatcacaagc tcttcgatgc
      901 ttcggattcc tccagctaca agcacaatgg aacagaactc accctccgct attcaacagg
      961 gacagtcagt ggctttctca gccaggacat catcaccgta agttgggccg ccctaggtca
     1021 tctgccccgg accccttctg tccccaggcc tctcctgacc caccagggcc cacacctgcg
     1081 gggaggtaca ctgcagccca cttggagcct ggggagctga ggaacaccct actctgccac
     1141 atctggtgtt gaaagcagca gtacctatgg gggagcaagc ctgggctacg ggctcaccgt
     1201 tgggtggttt gtggatgttt ttgcatctaa cttgcatgta gggctgtcct gagccccgtg
     1261 gctgcagtca agtaactcgt cccaagttca ccagctctga ctggggctac taccctagac
     1321 tgaaatcctg ggtcagagtc aggctatttt agggtcaggc atagttttaa ggtcacatta
     1381 gttgactctg ggactcaggt caaggctctc ttttcttttc catgtggccc atgtctgacc
     1441 gtttcctcat cctggagttt ctcaggccct gctccatcag agttagggga ggggcacacg
     1501 tggcacctga gaggaaatca gggtgattcc tgcctccctt cctttttctg ttgaactctg
     1561 atataaagga ggaagaaggg caagcttgtc tgtgctaaag aaacccttcg cccatgataa
     1621 gggtggggcc aagacccagt cctgccaggc acgaaagtct ggccactggg gaggggagga
     1681 gctcttggac ttttcttttg cgcttggcag gaccaccctc tcagcctctg ctctccgatc
     1741 cctggtcaac tctagctctc tctgggctcc gcagcagaga tgtgtattgg cacagagtgt
     1801 gtgcgtgcag ggttgaggca atactcttac cccgatttct gtaccctgga gcatgtgtgc
     1861 ccctgggatc cctagtgtgg atgcccagac cagactccaa ccaaggaggg gcagtgggct
     1921 tggtctccta tggtccttcc tcccacaggt gggtggaatc acggtgacac agatgtttgg
     1981 agaggtcacg gagatgcccg ccttaccctt catgctggcc gagtttgatg gggttgtggg
     2041 catgggcttc attgaacagg ccattggcag ggtcacccct atcttcgaca acatcatctc
     2101 ccaaggggtg ctaaaagagg acgtcttctc tttctactac aacaggtggg gactgggact
     2161 ccaagggctg aggtgggggg caggagggga gaagagatgg ggagtggaag gagagtctgg
     2221 gccagaattg taaagtgttt gtaacttagg tgacagccaa tcaatatcta gagctgtact
     2281 agccaatatg gaaggcacta ttgcaaattt aaacttaact taaatacagc ttaagcatca
     2341 attaagcatt caactggctg gcctcttagt tgtactagcc acagctcaat gcctggcagc
     2401 cacggtggct agtaactaca gtctagtaca gtgcagatag agatatccag catgacagga
     2461 catctataga cagcgccact
//
LOCUS       HUMRENA4     3057 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human renin gene, exons 6, 7, 8, and 9.
ACCESSION   M10151
KEYWORDS    renin.
SEGMENT     4 of 5
SOURCE      Human fetal liver DNA, clones lambda-[III,V].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3057)
  AUTHORS   Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
            Morris,B.J. and Shine,J.
  TITLE     Primary structure of the human renin gene
  JOURNAL   DNA 3, 457-468 (1984)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept    +   998     1006     preprorenin, exon 6 /nomgen="REN" /map="1q32"
                                 /hgml_locus_uid="LW0050B"
               1572     1691     preprorenin, exon 7
               2061     2202     preprorenin, exon 8
               2466  +  2564     preprorenin, exon 9
    matp    +   998     1006     renin
               1572     1691     renin
               2061     2202     renin
               2466  +  2564     renin
    pre-msg <     1  >  3057     renin mRNA
    IVS     <     1      997     renin intron E
    IVS        1007     1571     renin intron F
    IVS        1692     2060     renin intron G
    IVS        2203     2465     renin intron H
    IVS        2565  >  3057     renin intron I
BASE COUNT      750 a    799 c    825 g    683 t
ORIGIN      Chromosome 1q32; about 0.6 kb after segment 3.
        1 aaaagaatag aggaggatca gagttcagag aaatctcaca gtaaaatgga gaggagtctc
       61 cggtttggtg atagaaagtg aggccttgag aaaaggccaa ttggcggctc tgcattcagg
      121 ggtggtcttt agaagaactg ttttagagga ggtgggggca aggccagatg gcaagaagtt
      181 aagaggtgga cgacgtgggt gtcaggaagt ggaggtcatg agatgtacgc tgccctggga
      241 cattcaacag ggaagggaat ggggggtggc gtgggggggt gagatccaga agcagaagag
      301 gaagggtggg tgtttttaaa tgctagagga tgctcgagtg atcgcctgta ggtggaggaa
      361 gaacccaata gaaagaaaga gattaaaaat gtggaaagaa gaggagctaa atgggggcac
      421 tggagtttag aggccttgaa agagatgagg aaccagcaga taggaagaag ccaggtttta
      481 cagaggagag ggctggcctc ttcttttatc ttgggatggg aaggagggaa catccagaga
      541 gatactgaag tgttgagaga caggcaggag ggaatttgtg ctagcatata cacatacgag
      601 ttccgaattt ataaaaacac aagtagtttg cagttgcaca aaataacata tgcacaccta
      661 cacacccatg cacacatgtg catgtgaatt ctggaaaaac acatcacaca cacaggcatg
      721 ccctggagac taggcctaca gtagtccctg agccaagtgc agtgaggagg aaaggaaggt
      781 gaggggaatc atctccagac ggggcaccag gagcctggct ccagtccccc acttgttcac
      841 tcatggactg ggtaacttca ggcaagtgac ttcgcctctt ggtgactcca ttgcctgaag
      901 ggcaaagaga gtacataaca cccaccctgc caaacagcag ggtgatgagg ctggcatgaa
      961 atgaagcttc ctttctgctg tctctctttc tctgcagaga ttccgagtaa ggagacaaaa
     1021 cccccacatg gctgtgacct tccagtattc cccgagcacc tgacctagaa ttacacacgc
     1081 caccggccca aaactcacat cagcaagtcc cagcctccgc tagatgccga agttctctgt
     1141 ctctccttcc tgctctctcc atgccacctg cccaccccat acccaatagc ctccccaggg
     1201 tcccctccca tgcacctgct caatcagcag caacccaaga gtgaggggtg tccatttgtg
     1261 tcttgttcac atccactcac tgtccttgta cctgctcctt ttctgtgacc tctctgggga
     1321 tgctttttgg gggaacagct ggactaccct ggaacaacct ctggttggtc ttggggaggg
     1381 gaagaaaggc agagaagcag tatgttctgc atgcttccca acgacagctc cgagcctggc
     1441 tgtctgtccc acattcctct gctctagagc cctctgtcct cccctcgacc cttgtgcaac
     1501 cttccccaat tgcctgagtt gctgggtcct ggaggttatg ggtttccaag agcttctgat
     1561 ctttccttta ggaattccca atcgctggga ggacagattg tgctgggagg cagcgacccc
     1621 cagcattacg aagggaattt ccactatatc aacctcatca agactggtgt ctggcagatt
     1681 caaatgaagg ggtcagaaat cctcagaccc tccccgggct ccaaaaaatg ctgccgtcac
     1741 tggggttggg gagggcgggc gcggactgca ttaccatcct gccctctttc caaatgcagc
     1801 cacttcttaa gcacagccac catttgctct ctgcctggct ctggtccagg ctggggcaga
     1861 gagaagggag gggcctgggc cggagtggtg gaggccgaga gtaccttccc tcctctactc
     1921 actgcctcaa cagccagcca gcgtggcgct ccacccaccc acccaccact caggaaggac
     1981 atgcagcctg gcgtgcccat cagccttctg tctgtctgtc tgtctgtctg tctctctgtc
     2041 tgactgtggc gctcccccag ggtgtctgtg gggtcatcca ccttgctctg tgaagacggc
     2101 tgcctggcat tggtagacac cggtgcatcc tacatctcag gttctaccag ctccatagag
     2161 aagctcatgg aggccttggg agccaagaag aggctgtttg atgtaagaag ccaaagaggg
     2221 aaggtgctgt gggtgtgggg agcggccacc tggtatcggc tcacaaatcc cccaggcaaa
     2281 tgaggccatc tcaggccttc gcttgttcac ctcacactct ccacacatgt ggctggtcac
     2341 ccatggggcg gggcactgtc cccagccctc tccagcagag agacccaggg ccaccagcgc
     2401 aggactcctt gtctgctgag acgtcgttcc atactcaaga aggctctctt tgccccccac
     2461 cccagtatgt cgtgaagtgt aacgagggcc ctacactccc cgacatctct ttccacctgg
     2521 gaggcaaaga atacacgctc accagcgcgg actatgtatt tcaggtgagg ttcgagtcgg
     2581 ccccctcggt ggcagggaga aaggctggac agagaccctc aagagtgaca gattacaatg
     2641 cacagatcat gttagaactg tagttctcaa acttggctgt gcatgtcacc tggagagctt
     2701 tggaaaaatc caggtacctg ggccacatcc catacctatt aaatcagaac ctctagaagt
     2761 gggacctggg gttcagtttc cccagatgat tccaatgtgt ggccatgttt gggcatcact
     2821 atgcctgttc cctcatctcc attttctcat caaatactcc caataatcct atgctcctat
     2881 attcttaccc tcttttcata atcaataggc ttagagaatt tgaataactt gtctaggatc
     2941 agaagctaag gcaaactgta agctcctgaa ggaagcacgt tgcctgatgc cctgtttgcc
     3001 tgggatctag cacaggggct aaacatagga atggtgcagt ccacgatggg gcaaaat
//
LOCUS       HUMRENA5      763 bp ds-DNA             PRI       01-AUG-1990
DEFINITION  Human renin gene, exon 10.
ACCESSION   M10152
KEYWORDS    renin.
SEGMENT     5 of 5
SOURCE      Human fetal liver DNA, clone lambda-V.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 763)
  AUTHORS   Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D.,
            Morris,B.J. and Shine,J.
  TITLE     Primary structure of the human renin gene
  JOURNAL   DNA 3, 457-468 (1984)
  STANDARD  full staff_review
COMMENT     A poly-adenylation signal is located at positions 357-362.
FEATURES       from  to/span     description
    pept    +    22      183     preprorenin, exon 10 /nomgen="REN" /map="1q32"
                                 /hgml_locus_uid="LW0050B"
    matp    +    22      180     renin
    pre-msg <     1  >   183     renin mRNA
    IVS     <     1       21     renin intron I
BASE COUNT      170 a    236 c    194 g    163 t
ORIGIN      Chromosome 1q32; about 0.6 kb after segment 4.
        1 aaaactctcc ccctctgcca ggaatcctac agtagtaaaa agctgtgcac actggccatc
       61 cacgccatgg atatcccgcc acccactgga cccacctggg ccctgggggc caccttcatc
      121 cgaaagttct acacagagtt tgatcggcgt aacaaccgca ttggcttcgc cttggcccgc
      181 tgaggccctc tgccacccag gcaggccctg ccttcagccc tggcccagag ctggaacact
      241 ctctgagatg cccctctgcc tgccttatgc cctcagatgg agacattgga tgtggagctc
      301 ctgctggatg cgtgccctga cccctcacag cccttccctg ctttgaggac aaagagaata
      361 aagacttcat gttcacagcc tgttgcatct gggttcacta gggtttagaa cagagggagg
      421 ggctgcgtga tcatgtgtgg acaggaatgt gacacagaca agctacacat tagcctaggc
      481 cacaggttct tgcgtgcagg gatgatgcca tccatctgcc atcaacggga ctcaggtgga
      541 gctgttacac aacctcaggt gggaagtctg aaaagagccg gaaccaagct ccctgctatc
      601 gactcaggga ccaaggcgta atgctgtggc gagtagactg gggtcagaaa gttgtcccag
      661 ctcacagaag ccagctctga gttcagactc tgctctgctg agctagtcag ccctgtctct
      721 tgtccctgca aaactcccct cacctgtcct tatccacctg cag
//
LOCUS       SYNT1RNAA     324 bp ds-DNA             SYN       01-AUG-1990
DEFINITION  Synthetic ribonuclease T1 gene, 3' end.
ACCESSION   M37098 M35733 M35736
KEYWORDS    ribonuclease T1.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Ikehara,M., Ohtsuka,E., Uesugi,S., Kikyodani,T., Aoyama,Y.,
            Tokunaga,T. and Fujimoto,K.
  TITLE     Synthesis and expression of RNase T1 gene
  JOURNAL   Nucleic Acids Symp Ser 15, 197-200 (1984)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 324)
  AUTHORS   Nishikawa,S., Morioka,H., Tokunaga,T., Aoyama,Y., Kikyotani,S.,
            Fujimoto,K., Yanase,K., Tanaka,T., Uesugi,S., Ohtsuka,E. and
            Ikehara,M.
  TITLE     Synthesis and expression of the native RNase T1 gene and several
            mutant genes
  JOURNAL   Nucleic Acids Symp Ser 16, 287-290 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      321     ribonuclease T1 precursor (AA at 1)
    sigp    <     1        6     ribonuclease T1 signal peptide
    matp          7      318     ribonuclease T1
BASE COUNT       73 a     98 c     75 g     78 t
ORIGIN      
        1 ttcatggctt gcgactacac ctgcggcagc aactgctact ctagctctga cgtttctacc
       61 gctcaggctg ctggctacca gctgcacgag gacggcgaaa ccgttggctc taactcttac
      121 ccgcacaaat acaacaacta tgagggcttc gactttagcg tttcttctcc gtactacgaa
      181 tggccgatcc tgtctagcgg cgacgtttac tccggtccag gtagcggtgc tgaccgtgta
      241 gtattcaacg aaaacaacca gctcgctggc gttatcaccc acaccggcgc ttctggcaac
      301 aactttgtag aatgcaccta atag
//
LOCUS       TIPCDREG      209 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  Plasmid pTiC58 promoter-active fragment CD25 DNA.
ACCESSION   M35735
KEYWORDS    .
SOURCE      Plasmid pTiC58 DNA.
  ORGANISM  Plasmid pTiC58
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 209)
  AUTHORS   Tait,R.C. and Kado,C.I.
  TITLE     Regulation of the virC and virD promoters of pTiC58 by the ros
            chromosomal mutation of Agrobacterium tumefaciens
  JOURNAL   Mol. Microbiol. 2, 385-392 (1988)
  STANDARD  simple staff_review
BASE COUNT       60 a     37 c     41 g     71 t
ORIGIN      
        1 gtcgacccgg gatccgcggc gataattcat aagtaatgta gtaattacct gattttatat
       61 ttcaatttta ttgtaatata atttcaattg taataatata aaaataaata tcccttatgt
      121 gttcttgatt tcgttttgta tatggctaga ttcccatctg ccacgacgag gaaatgctac
      181 ggcggggcaa gttcagatcc cgggtcgac
//
LOCUS       FIBGLUC      1426 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene,
            complete cds.
ACCESSION   M33676 M33311
KEYWORDS    1,3-1,4-beta-D-glucan 4-glucanohydrolase;
            mised-linkage beta-glucanase.
SOURCE      F.succinogenes (strain S85) DNA, clone PJI5.
  ORGANISM  Fibrobacter succinogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Sulfate- or sulfur-reducing dissimilatory bacteria.
REFERENCE   1  (bases 1 to 1426)
  AUTHORS   Teather,R.M. and Erfle,J.D.
  TITLE     DNA sequence of a Fibrobacter succinogenes mixed-linkage
            beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene
  JOURNAL   J. Bacteriol. 172, 3837-3841 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.M.Teather, 11-APR-1990.
FEATURES       from  to/span     description
    pept        145     1194     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 precursor (EC 3.2.1.73)
    sigp        145      225     1,3-1,4-beta-D-glucan 4-glucanohydrolase
                                 signal peptide
    matp        226     1191     1,3-1,4-beta-D-glucan 4-glucanohydrolase
    binding     132      137     ribosome binding site
    signal       62       66     -35 region
    signal       85       90     -10 region
BASE COUNT      371 a    346 c    335 g    374 t
ORIGIN      
        1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt
       61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt
      121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc
      181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt
      241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg
      301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc
      361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc
      421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct
      481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac
      541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac
      601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt
      661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat
      721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat
      781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt
      841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc
      901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa
      961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct
     1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac
     1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt
     1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg
     1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt
     1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa
     1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt
     1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa
//
LOCUS       BOVRS157A     824 bp ss-mRNA            MAM       01-AUG-1990
DEFINITION  Bovine retina-specific 15.7 kDa protein mRNA, complete cds.
ACCESSION   M34915
KEYWORDS    .
SOURCE      Bovine retina, cDNA to mRNA, clone pCR18.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 824)
  AUTHORS   Nakagawa,Y., Kuo,C.-H., Ishii,K., Shiosaka,S., Tohyama,M. and
            Miki,N.
  TITLE     Cloning and characterization of a cDNA specific for bovine retina
  JOURNAL   Neurosci. Res. 3, 300-310 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        138      581     retina-specific 15.7 kDa protein
    mRNA    <     1      824     retina-specific mRNA
    signal      800      805     polyA signal
BASE COUNT      178 a    219 c    224 g    203 t
ORIGIN      
        1 tttagcctca gccgtgaccg gccccgtccc gcggcgccgg gagttcgtgt gaacgggtag
       61 gtgtaccgac ttcgcccgtc cgtgaatccc gtggtcgcaa aggcccgcgc ggcgggccgg
      121 gttctgccga taccttaatg ggctgtgcgc gaggagagcc tcaattgcaa gttggtcgag
      181 gagatcgcca cgctggtgca gagctggcct cactagttgc ggctagtgta ggacgttgta
      241 ctccgacatt ccgcaagccc ttccacacgg acagtcctag catccagggt cagtggcacc
      301 ccttcaccaa caaaccgaca gcactggggt gctcctcgag aggtccagaa tcctgccccg
      361 acccagcggc cagcacaatg aagaccaact ccatacccac agtttggact tttactccag
      421 cagagggtgg ttcctgctcc tggtttgctt cacgggagac agatgaagcc accaatgggg
      481 tacttcttgc ttgggataaa gaagagctgc ctgtctcttt tgatgtccac cgtgaggcag
      541 ggactgtgag tctcctcatt cttagccagt tgacatcctg aaaccctgag aatcttcaga
      601 gatttgactt ggtcttcatt tcttaaatcc aaatcaataa tagtgatctc aaatcaagtg
      661 agggctttca aggctggctt ctgaagaatt ccttttggcc tgtttctgta gccagtgacc
      721 aagagagtct gctgtgagct ggcattgggc taggccttgt atctatgtga tgtttgtgtg
      781 cagttagaaa actgaagtta ataaatttgc caaggtcaca cttg
//
LOCUS       CHKFRA2A1     360 bp ds-DNA             VRT       01-AUG-1990
DEFINITION  Chicken fra-2 oncogene gene, exon 1.
ACCESSION   D90104
KEYWORDS    fos-related gene; fra-2 gene; oncogene.
SEGMENT     1 of 4
SOURCE      Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Nishina,H., Sato,H., Suzuki,T. and Iba,H.
  TITLE     Isolation and characterization of fra-2, and additional member of
            the fos gene family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        241  +   339     fra-2 protein, exon 1
    pre-msg <     1  >   360     fra-2 mRNA and introns
    IVS         340  >   360     fra-2 intron A
BASE COUNT       39 a    123 c    113 g     85 t
ORIGIN      
        1 tgtttttttg gttgtttttt ttttttgtcg gctttccgct ttttcttttt ttcttttttt
       61 tccctttttc tatttttccc ccccttcttc ttctcccgct gcggactctc ccccggctgc
      121 gggaggcgcg aggcagagcc cgagaggtcg gcacggagca gggggcgggg agacggcgag
      181 ggagcggcgg ccgcggcgcg ggaaggcggg gacgcggctc ccccgggccg gcctcggacc
      241 atgtaccagg actatcccgg gagcttcgac acctcctcca gaggcagcag cggctccccg
      301 ggacaccccg agccctactc cgccggcgca gcccagcagg tagggccgcc tccgccccgt
//
LOCUS       CHKFRA2A2     297 bp ds-DNA             VRT       01-AUG-1990
DEFINITION  Chicken fra-2 oncogene gene, exon 2.
ACCESSION   D90105
KEYWORDS    fos-related gene; fra-2 gene; oncogene.
SEGMENT     2 of 4
SOURCE      Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 297)
  AUTHORS   Nishina,H., Sato,H., Suzuki,T. and Iba,H.
  TITLE     Isolation and characterization of fra-2, and additional member of
            the fos gene family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    22  +   276     fra-2 protein, exon 2
    pre-msg <     1  >   297     fra-2 mRNA and introns
    IVS     <     1       21     fra-2 intron A
    IVS         277  >   297     fra-2 intron B
BASE COUNT       68 a    103 c     74 g     52 t
ORIGIN      About 5 kbp after segment 1.
        1 ctcccccacc tttcctccta gaaattccga gtagatatgc caggatcagg cagtgctttt
       61 attcccacga tcaacgccat cacaaccagc caagacctgc agtggatggt gcagcccacc
      121 gtcatcacct ccatgtccag cccgtactct cgctcgcacc cctacagcca cccactgccg
      181 ccgctgtcct cggtggctgg acacacggcc cttcagcgac cgggcgtgat caaaaccatc
      241 ggcaccacag tgggacggag acgaagggat gagcaggtaa ctgtgtgagc aggagga
//
LOCUS       CHKFRA2A3     149 bp ds-DNA             VRT       01-AUG-1990
DEFINITION  Chicken fra-2 oncogene gene, exon 3.
ACCESSION   D90106
KEYWORDS    fos-related gene; fra-2 gene; oncogene.
SEGMENT     3 of 4
SOURCE      Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 149)
  AUTHORS   Nishina,H., Sato,H., Suzuki,T. and Iba,H.
  TITLE     Isolation and characterization of fra-2, and additional member of
            the fos gene family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    22  +   129     fra-2 protein, exon 3
    pre-msg <     1  >   149     fra-2 mRNA and introns
    IVS     <     1       21     fra-2 intron B
    IVS         130  >   149     fra-2 intron C
BASE COUNT       42 a     34 c     46 g     27 t
ORIGIN      About 4 kbp after segment 2.
        1 tttcttggca cttgcccata gctgtcgcct gaggaagaag agaagcgaag gatccggaga
       61 gagaggaaca agctggcagc tgctaaatgt cgtaacaggc gccgagagct aacagagaaa
      121 ctccaggcgg tacgtgctct gcatgcatt
//
LOCUS       CHKFRA2A4     744 bp ds-DNA             VRT       01-AUG-1990
DEFINITION  Chicken fra-2 oncogene gene, exon 4.
ACCESSION   D90107
KEYWORDS    fos-related gene; fra-2 gene; oncogene.
SEGMENT     4 of 4
SOURCE      Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 744)
  AUTHORS   Nishina,H., Sato,H., Suzuki,T. and Iba,H.
  TITLE     Isolation and characterization of fra-2, and additional member of
            the fos gene family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +    22      531     fra-2 protein, exon 4
    pre-msg <     1  >   744     fra-2 mRNA and introns
    IVS     <     1       21     fra-2 intron C
BASE COUNT      180 a    206 c    209 g    149 t
ORIGIN      About 2 kbp after segment 3.
        1 ttattccctt tttgtctgca ggaaactgag gtgctggagg aggaaaagtc agtgcttcaa
       61 aaagagattg ctgagctcca gaaggagaag gagaaactag agttcatgct ggttgctcac
      121 agccctgtgt gtaaaatcag ccctgaggaa cgtcggagcc caccaaccag cagcctccag
      181 agcgttcgga ctggagcgag cggagcagtg gtggtgaagc aggagcctgt ggaggaagag
      241 atcccatctt cctctttggt ccttgacaaa gctcagaggt ctgtcattaa gcccatcagc
      301 attgctggag gttattatgg ggaggaggca ctcaacactc ccatcgtggt gacctcgaca
      361 ccagccatca ctcctggttc ctccaacttg gtgttcacct accccaatgt cttggatcag
      421 gagtctcctc tctccccgtc cgagtcctgc tccaaagctc accggaggag cagcagcagc
      481 ggcgaccagt cctcggattc cttgaactct cccaccttgc tggcattgta atcccctgag
      541 gcccccccat tgccagtgtg ttacatcccc cgcccggctc catggggaga cccctccatg
      601 ggattagaga caggcacagg atcgttcaag cacaagggca gcaagaacaa gaatggggaa
      661 atgctgcagc tccaggaaag agagtgagga ccaatgccag ctccctggag gcaggaaatg
      721 gcaagggtgg gactgatgca ccag
//
LOCUS       ECOTGP       7335 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  E.coli tryptophan operon: entire DNA sequence.
ACCESSION   J01714 M12471 M12472 M25593
KEYWORDS    anthranilate isomerase; anthranilate synthetase; attenuator;
            glutamine amidotransferase; isomerase; leader peptide;
            phosphoribosyl anthranilate synthetase; synthetase; transferase;
            trp operon; trpA gene; trpB gene; trpC gene; trpD gene; trpE gene;
            tryptophan synthetase.
SOURCE      Escherichia coli RNA and DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 5917 to 6133)
  AUTHORS   Platt,T. and Yanofsky,C.
  TITLE     An intercistronic region and ribosome-binding site in bacterial
            messenger RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 72, 2399-2403 (1975)
  STANDARD  full staff_review
REFERENCE   2  (bases 84 to 141)
  AUTHORS   Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and
            Yanofsky,C.
  TITLE     Nucleotide sequence of region preceding trp mRNA initiation site
            and its role in promoter and operator function
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 73, 2351-2355 (1976)
  STANDARD  full staff_review
REFERENCE   3  (bases 117 to 310)
  AUTHORS   Squires,C., Lee,F., Bertrand,K., Squires,C.L., Bronson,M.J. and
            Yanofsky,C.
  TITLE     Nucleotide sequence of the 5' end of tryptophan messenger RNA of
            Escherichia coli
  JOURNAL   J. Mol. Biol. 103, 351-381 (1976)
  STANDARD  full staff_review
REFERENCE   4  (bases 230 to 272)
  AUTHORS   Bertrand,K., Korn,L.J., Lee,F. and Yanofsky,C.
  TITLE     The attenuator of the tryptophan operon of Escherichia coli:
            heterogeneous 3'-OH termini in vivo and deletion mapping of
            functions
  JOURNAL   J. Mol. Biol. 117, 227-247 (1977)
  STANDARD  full staff_review
REFERENCE   5  (bases 230 to 272)
  AUTHORS   Stauffer,G.V., Zurawski,G. and Yanofsky,C.
  TITLE     Single base-pair alterations in the Escherichia coli trp operon
            leader region that relieve transcription termination at the trp
            attenuator
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 4833-4837 (1978)
  STANDARD  full staff_review
REFERENCE   6  (bases 6707 to 6863)
  AUTHORS   Wu,A.M. and Platt,T.
  TITLE     Transcription termination: nucleotide sequence at 3' end of
            tryptophan operon in Escherichia coli
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 5442-5446 (1978)
  STANDARD  full staff_review
REFERENCE   7  (bases 1 to 140)
  AUTHORS   Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and
            Yanofsky,C.
  TITLE     Nucleotide sequence of the promoter-operator region of the
            tryptophan operon of Escherichia coli
  JOURNAL   J. Mol. Biol. 121, 113-137 (1978)
  STANDARD  full staff_review
REFERENCE   8  (bases 2351 to 2503)
  AUTHORS   Miozzari,G.F. and Yanofsky,C.
  TITLE     Gene fusion during the evolution of the tryptophan operon in
            enterobacteriaceae
  JOURNAL   Nature 277, 486-489 (1979)
  STANDARD  full staff_review
REFERENCE   9  (bases 5932 to 6809)
  AUTHORS   Nichols,B.P. and Yanofsky,C.
  TITLE     Nucleotide sequences of trpA of Salmonella typhimurium
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 76, 5244-5248 (1979)
  STANDARD  full staff_review
REFERENCE   10 (bases 117 to 256)
  AUTHORS   Oxender,D.L., Zurawski,G. and Yanofsky,C.
  TITLE     Attenuation in the Escherichia coli tryptophan operon: role of RNA
            secondary structure involving the tryptophan codon region
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 76, 5524-5528 (1979)
  STANDARD  full staff_review
REFERENCE   11 (bases 3422 to 4824)
  AUTHORS   Christie,G.E. and Platt,T.
  TITLE     Gene structure in the tryptophan operon of Escherichia coli:
            nucleotide sequence of trpC and the flanking intercistronic regions
  JOURNAL   J. Mol. Biol. 142, 519-530 (1980)
  STANDARD  full staff_review
REFERENCE   12 (bases 230 to 296)
  AUTHORS   Farnham,P.J. and Platt,T.
  TITLE     A model for transcription termination suggested by studies on the
            trp attenuator in vitro using base analogs
  JOURNAL   Cell 20, 739-748 (1980)
  STANDARD  full staff_review
REFERENCE   13 (bases 4810 to 6003)
  AUTHORS   Crawford,I.P., Nichols,B.P. and Yanofsky,C.
  TITLE     Nucleotide sequence of the trpB gene in Escherichia coli and
            Salmonella typhimurium
  JOURNAL   J. Mol. Biol. 142, 489-502 (1980)
  STANDARD  full staff_review
REFERENCE   14 (bases 1761 to 2443)
  AUTHORS   Nichols,B.P., Miozzari,G.F., van Cleemput,M., Bennett,G.N. and
            Yanofsky,C.
  TITLE     Nucleotide sequences of the trpG regions of Escherichia coli,
            Shigella dysenteriae, Salmonella typhimurium and Serratia
            marcescens
  JOURNAL   J. Mol. Biol. 142, 503-517 (1980)
  STANDARD  full staff_review
REFERENCE   15 (bases 6707 to 7335)
  AUTHORS   Wu,A.M., Chapman,A.B., Platt,T., Guarente,L.P. and Beckwith,J.
  TITLE     Deletions of distal sequence affect termination of transcription at
            the end of the tryptophan operon in E. coli
  JOURNAL   Cell 19, 829-836 (1980)
  STANDARD  full staff_review
REFERENCE   16 (bases 279 to 1843)
  AUTHORS   Nichols,B.P., van Cleemput,M. and Yanofsky,C.
  TITLE     Nucleotide sequence of Escherichia coli trpE: anthranilate
            synthetase component I contains no tryptophan residues
  JOURNAL   J. Mol. Biol. 146, 45-54 (1981)
  STANDARD  full staff_review
REFERENCE   17 (bases 5932 to 6809)
  AUTHORS   Schneider,W.P., Nichols,B.P. and Yanofsky,C.
  TITLE     Procedure for production of hybrid genes and proteins and its use
            in assessing significance of amino acid differences in homologous
            tryptophan synthetase alpha polypeptides
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 2169-2173 (1981)
  STANDARD  full staff_review
REFERENCE   18 (bases 6807 to 6856; 7057 to 7119)
  AUTHORS   Wu,A.M., Christie,G.E. and Platt,T.
  TITLE     Tandem termination sites in the tryptophan operon of Escherichia
            coli
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 2913-2917 (1981)
  STANDARD  full staff_review
REFERENCE   19 (review; bases 77 to 6809; compiled)
  AUTHORS   Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E.,
            Horowitz,H., van Cleemput,M. and Wu,A.M.
  TITLE     The complete nucleotide sequence of the tryptophan operon of
            Escherichia coli
  JOURNAL   Nucleic Acids Res. 9, 6647-6668 (1981)
  STANDARD  full staff_review
REFERENCE   20 (bases 2504 to 3436)
  AUTHORS   Horowitz,H., Christie,G.E. and Platt,T.
  TITLE     Nucleotide sequence of the trpD gene, encoding anthranilate
            synthetase component II of Escherichia coli
  JOURNAL   J. Mol. Biol. 156, 245-256 (1982)
  STANDARD  full staff_review
REFERENCE   21 (bases 57 to 137)
  AUTHORS   Windass,J.D., Newton,C.R., De Maeyer-Guignard,J., Moore,V.E.,
            Markham,A.F. and Edge,M.D.
  TITLE     The construction of a synthetic Escherichia coli trp promoter and
            its use in the expression of a synthetic interferon gene
  JOURNAL   Nucleic Acids Res. 10, 6639-6657 (1982)
  STANDARD  full staff_review
REFERENCE   22 (sites; mutational analysis of the regulatory region)
  AUTHORS   Kolter,R. and Yanofsky,C.
  TITLE     Genetic analysis of the tryptophan operon regulatory region using
            site-directed mutagenesis
  JOURNAL   J. Mol. Biol. 175, 299-312 (1984)
  STANDARD  full staff_entry
REFERENCE   23 (bases 36 to 136)
  AUTHORS   Brown,K.D., Bennet,G.N., Lee,F., Schweingruber,M.E. and Yanofsky,C.
  TITLE     RNA polymerase interaction at the promoter-operator region of the
            tryptophan operon of Escherichia coli and Salmonella typhimurium
  JOURNAL   J. Mol. Biol. 121, 153-177 (1978)
  STANDARD  simple staff_entry
COMMENT        The tryptophan operon of E.coli consists of a repressor(trpR), a
            promoter(trpP), an operator(trpO), an attenuator which is part of a
            leader peptide region(trpL) and five structural genes:
            trpE(anthranilate synthetase), trpD(glutamine amido transferase and
            anthranilate 5-phosphoribosylpyrophosphate phosphoribosyl-
            transferase), trpC(phosphoribosyl anthranilate isomerase-indole
            glycerol phosphate synthetase), trpB(tryptophan synthetase beta)
            and trpA(tryptophan synthetase alpha).
            
               The promoter region covers approximately 40 bases upstream from
            the mRNA initiation site(75-116); the operator approximately 20
            bases upstream with two-fold axes of symmetry around 104-105 and
            109-110([2],[7],[20]). The attenuator region is the first 140
            nucleotides(117-256) of the mRNA leader, a G-C rich region with a
            two-fold axis of symmetry around base 240 and an A-T rich region
            with its axis about bases 259-260; it provides a second site for
            control of transcription ([4],[5],[10],[12]). Two mRNA termination
            regions are reported: trpT (bases 6807-6856) and trpT' (bases
            7057-7119), the first of which bears some similarity to the
            attenuator region ([18]). A chi site for recombination is localized
            between bases 2492 and 2501 and the trp-P2 promoter is located
            between bases 3240 and 3280 ([20]).
            
               The trpE gene is unusual in that it codes for no tryptophan
            residues([16]). The two enzymatic functions coded by trpG and trpD
            genes in S.marcescens are coded by the single trpD gene in E.coli
            and other enterobacteriaceae. This appears to have occurred via
            base changes at sites 2420 and 2438. The intercistronic regions for
            the structural genes show little superfluity: the trpE-trpD and
            trpB-trpA boundaries consist of 'tgatg'; the trpD-trpC boundary is
            'taaatgatg' and the trpC-trpB boundary is 'taaggaaaggaacaatg'. All
            the cistrons show a high degree of homology with their correlates
            among the enterobacteriaceae. Sequence discrepancies in early
            work([3]) are corrected in later work from the same
            laboratory([10],[19]). [17] also sequenced S.typhimurium trpA
            region. [19] compiles sequences from
            [7],[8],[9],[11],[13],[14],[16],[20].
FEATURES       from  to/span     description
    pept        143      187     trp operon leader peptide (putative)
    pept        279     1841     anthranilate synthetase component I
                                 /nomgen="trpE"
    pept       1841     3436     anthranilate synthetase component II:
                                 glutamine amidotransferase and phosphoribosyl
                                 anthranilate synthetase /nomgen="trpD"
    pept       3440     4798     anthranilate isomerase /nomgen="trpC"
    pept       4810     6003     tryptophan synthetase beta subunit
                                 /nomgen="trpB"
    pept       6003     6809     tryptophan synthetase alpha subunit
                                 /nomgen="trpA"
    mRNA        117      257     trp mRNA (alt.) [2],[3],[7],[10],[21]
    mRNA        117     6842     trp mRNA (alt.) [2],[3],[6],[7],[10],[18],[21]
                                 used
    revision   1787     1787     c in [16]; t in [14]
    revision   1793     1793     t in [16]; c in [14]
    conflict   3526     3530     gg in [19]; gaatg in [11]
    conflict   4289     4293     gc in [19]; gttgc in [11]
    conflict   5949     5949     c in [1]; a in [17]
BASE COUNT     1740 a   1926 c   1960 g   1705 t      4 others
ORIGIN      9 bp upstream from HhaI site [7].
        1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc
       61 aaatattctg aaatgagctg ttgacaatta atcatcgaac tagttaacta gtacgcaagt
      121 tcacgtaaaa agggtatcga caatgaaagc aattttcgta ctgaaaggtt ggtggcgcac
      181 ttcctgaaac gggcagtgta ttcaccatgc gtaaagcaat cagataccca gcccgcctaa
      241 tgagcgggct tttttttgaa caaaattaga gaataacaat gcaaacacaa aaaccgactc
      301 tcgaactgct aacctgcgaa ggcgcttatc gcgacaatcc caccgcgctt tttcaccagt
      361 tgtgtgggga tcgtccggca acgctgctgc tggaatccgc agatatcgac agcaaagatg
      421 atttaaaaag cctgctgctg gtagacagtg cgctgcgcat tacagcttta ggtgacactg
      481 tcacaatcca ggcactttcc ggcaacggcg aagccctcct ggcactactg gataacgccc
      541 tgcctgcggg tgtggaaagt gaacaatcac caaactgccg tgtgctgcgc ttcccccctg
      601 tcagtccact gctggatgaa gacgcccgct tatgctccct ttcggttttt gacgctttcc
      661 gtttattgca gaatctgttg aatgtaccga aggaagaacg agaagccatg ttcttcagcg
      721 gcctgttctc ttatgacctt gtggcgggat ttgaagattt accgcaactg tcagcggaaa
      781 ataactgccc tgatttctgt ttttatctcg ctgaaacgct gatggtgatt gaccatcaga
      841 aaaaaagcac ccgtattcag gccagcctgt ttgctccgaa tgaagaagaa aaacaacgtc
      901 tcactgctcg cctgaacgaa ctacgtcagc aactgaccga agccgcgccg ccgctgccag
      961 tggtttccgt gccgcatatg cgttgtgaat gtaatcagag cgatgaagag ttcggtggcg
     1021 tagtgcgttt gttgcaaaaa gcgattcgcg ctggagaaat tttccaggtg gtgccatctc
     1081 gccgtttctc tctgccctgc ccgtcaccgc tggcggccta ttacgtgctg aaaaagagta
     1141 atcccagccc gtacatgttt tttatgcagg ataatgattt caccctattt ggcgcgtcgc
     1201 cggaaagctc gctcaagtat gatgccacca gccgccagat tgagatctac ccgattgccg
     1261 gaacacgccc acgcggtcgt cgcgccgatg gttcactgga cagagatctc gacagccgta
     1321 ttgaactgga aatgcgtacc gatcataaag agctgtctga acatctgatg ctggttgatc
     1381 tcgcccgtaa tgatctggca cgcatttgca cccccggcag ccgctacgtc gccgatctca
     1441 ccaaagttga ccgttattcc tatgtgatgc acctcgtctc tcgcgtagtc ggcgaactgc
     1501 gtcacgatct tgacgccctg cacgcttatc gcgcctgtat gaatatgggg acgttaagcg
     1561 gtgcgccgaa agtacgcgct atgcagttaa ttgccgaggc ggaaggtcgt cgccgcggca
     1621 gctacggcgg cgcggtaggt tatttcaccg cgcatggcga tctcgacacc tgcattgtga
     1681 tccgctcggc gctggtggaa aacggtatcg ccaccgtgca agcgggtgct ggtgtagtcc
     1741 ttgattctgt tccgcagtcg gaagccgacg aaacccgtaa caaagcccgc gctgtactgc
     1801 gcgctattgc caccgcgcat catgcacagg agactttctg atggctgaca ttctgctgct
     1861 cgataatatc gactctttta cgtacaacct ggcagatcag ttgcgcagca atgggcataa
     1921 cgtggtgatt taccgcaacc atataccggc gcaaacctta attgaacgct tggcgaccat
     1981 gagtaatccg gtgctgatgc tttctcctgg ccccggtgtg ccgagcgaag ccggttgtat
     2041 gccggaactc ctcacccgct tgcgtggcaa gctgcccatt attggcattt gcctcggaca
     2101 tcaggcgatt gtcgaagctt acgggggcta tgtcggtcag gcgggcgaaa ttctccacgg
     2161 taaagcctcc agcattgaac atgacggtca ggcgatgttt gccggattaa caaacccgct
     2221 gccggtggcg cgttatcact cgctggttgg cagtaacatt ccggccggtt taaccatcaa
     2281 cgcccatttt aatggcatgg tgatggcagt acgtcacgat gcggatcgcg tttgtggatt
     2341 ccagttccat ccggaatcca ttctcaccac ccagggcgct cgcctgctgg aacaaacgct
     2401 ggcctgggcg cagcataaac tagagccagc caacacgctg caaccgattc tggaaaaact
     2461 gtatcaggcg cagacgctta gccaacaaga aagccaccag ctgttttcag cggtggtgcg
     2521 tggcgagctg aagccggaac aactggcggc ggcgctggtg agcatgaaaa ttcgcggtga
     2581 gcacccgaac gagatcgccg gggcagcaac cgcgctactg gaaaacgcag cgccgttccc
     2641 gcgcccggat tatctgtttg ctgatatcgt cggtactggc ggtgacggca gcaacagtat
     2701 caatatttct accgccagtg cgtttgtcgc cgcggcctgt gggctgaaag tggcgaaaca
     2761 cggcaaccgt agcgtctcca gtaaatctgg ttcgtccgat ctgctggcgg cgttcggtat
     2821 taatcttgat atgaacgccg ataaatcgcg ccaggcgctg gatgagttag gtgtatgttt
     2881 cctctttgcg ccgaagtatc acaccggatt ccgccacgcg atgccggttc gccagcaact
     2941 gaaaacccgc accctgttca atgtgctggg gccattgatt aacccggcgc atccgccgct
     3001 ggcgttaatt ggtgtttata gtccggaact ggtgctgccg attgccgaaa ccttgcgcgt
     3061 gctggggtat caacgcgcgg cggtggtgca cagcggcggg atggatgaag tttcattaca
     3121 cgcgccgaca atcgttgccg aactgcatga cggcgaaatt aaaagctatc agctcaccgc
     3181 agaagacttt ggcctgacac cctaccacca ggagcaactg gcaggcggaa caccggaaga
     3241 aaaccgtgac attttaacac gtttgttaca aggtaaaggc gacgccgccc atgaagcagc
     3301 cgtcgctgcg aacgtcgcca tgttaatgcg cctgcatggc catgaagatc tgcaagccaa
     3361 tgcgcaaacc gttcttgagg tactgcgcag tggttccgct tacgacagag tcaccgcact
     3421 ggcggcacga gggtaaatga tgcaaaccgt tttagcgaaa atcgtcgcag acaaggcgat
     3481 ttgggtagaa gcccgcaaac agcagcaacc gctggccagt tttcagaatg aggttcagcc
     3541 gagcacgcga catttttatg atgcgctaca gggtgcgcgc acggcgttta ttctggagtg
     3601 caagaaagcg tcgccgtcaa aaggcgtgat ccgtgatgat ttcgatccag cacgcattgc
     3661 cgccatttat aaacattacg cttcggcaat ttcggtgctg actgatgaga aatatttcag
     3721 gggtagcttt aatttcctcc ccatcgtcag ccaaatcgcc ccgcagccga ttttatgtaa
     3781 agacttcatt atcgaccctt accagatcta tctggcgcgc tattaccagg ccgatgcctg
     3841 cttattaatg ctttcagtac tggatgacga ccaatatcgc cagcttgccg ccgtcgctca
     3901 cagtctggag atgggggtgc tgaccgaagt cagtaatgaa gaggaacagg agcgcgccat
     3961 tgcattggga gcaaaggtcg ttggcatcaa caaccgcgat ctgcgtgatt tgtcgattga
     4021 tctcaaccgt acccgcgagc ttgcgccgaa actggggcac aacgtgacgg taatcagcga
     4081 atccggcatc aatacttacg ctcaggtgcg cgagttaagc cacttcgcta acggttttct
     4141 gattggttcg gcgttgatgg cccatgacga tttgcacgcc gccgtgcgcc gggtgttgct
     4201 gggtgagaat aaagtatgtg gcctgacgcg tgggcaagat gctaaagcag cttatgacgc
     4261 gggcgcgatt tacggtgggt tgatttttgt tgcgacatca ccgcgttgcg tcaacgttga
     4321 acaggcgcag gaagtgatgg ctgcggcacc gttgcagtat gttggcgtgt tccgcaatca
     4381 cgatattgcc gatgtggtgg acaaagctaa ggtgttatcg ctggtggcag tgcaactgca
     4441 tggtaatgaa gaacagctgt atatcgatac gctgcgtgaa gctctgccag cacatgttgc
     4501 catctggaaa gcattaagcg tcggtgaaac cctgcccgcc cgcgagtttc agcacgttga
     4561 taaatatgtt ttagacaacg gccagggtgg aagcgggcaa cgttttgact ggtcactatt
     4621 aaatggtcaa acgcttggca acgttctgct ggcggggggc ttaggcgcag ataactgcgt
     4681 ggaagcggca caaaccggct gcgccggact tgattttaat tctgctgtag agtcgcaacc
     4741 gggcatcaaa gacgcacgtc ttttggcctc ggttttccag acgctgcgcg catattaagg
     4801 aaaggaacaa tgacaacatt acttaacccc tattttggtg agtttggcgg catgtacgtg
     4861 ccacaaatcc tgatgcctgc tctgcgccag ctggaagaag cttttgtcag tgcgcaaaaa
     4921 gatcctgaat ttcaggctca gttcaacgac ctgctgaaaa actatgccgg gcgtccaacc
     4981 gcgctgacca aatgccagaa cattacagcc gggacgaaca ccacgctgta tctcaagcgt
     5041 gaagatttgc tgcacggcgg cgcgcataaa actaaccagg tgctggggca ggcgttgctg
     5101 gcgaagcgga tgggtaaaac cgaaatcatc gccgaaaccg gtgccggtca gcatggcgtg
     5161 gcgtcggccc tggccagcgc cctgctcggc ctgaaatgcc gtatttatat gggtgccaaa
     5221 gacgttgaac gccagtcgcc taacgttttt cgtatgcgct taatgggtgc ggaagtgatc
     5281 ccggtgcata gcggttccgc gacgctgaaa gatgcctgta acgaggcgct gcgcgactgg
     5341 tccggtagtt acgaaaccgc gcactatatg ctgggcaccg cagctggccc gcatccttat
     5401 ccgaccattg tgcgtgagtt tcagcggatg attggcgaag aaaccaaagc gcagattctg
     5461 gaaagagaag gtcgcctgcc ggatgccgtt atcgcctgtg ttggcggcgg ttcgaatgcc
     5521 atcggcatgt ttgctgattt catcaatgaa accaacgtcg gcctgattgg tgtggagcca
     5581 ggtggtcacg gtatcgaaac tggcgagcac ggcgcaccgc taaaacatgg tcgcgtgggt
     5641 atctatttcg gtatgaaagc gccgatgatg caaaccgaag acgggcagat tgaagaatct
     5701 tactccatct ccgccggact ggatttcccg tctgtcggcc cacaacacgc gtatcttaac
     5761 agcactggac gcgctgatta cgtgtctatt accgatgatg aagcccttga agccttcaaa
     5821 acgctgtgcc tgcacgaagg gatcatcccg gcgctggaat cctcccacgc cttggcccat
     5881 gcgttgaaaa tgatgcgcga aaacccggat aaagagcagc tactggtggt taacctttcc
     5941 ggtcgcggcg ataaagacat cttcaccgtt cacgatattt tgaaagcacg aggggaaatc
     6001 tgatggaacg ctacgaatct ctgtttgccc agttgaagga gcgcaaagaa ggcgcattcg
     6061 ttcctttcgt cacgctcggt gatccgggca ttgagcagtc attgaaaatt atcgatacgc
     6121 taattgaagc cggtgctgac gcgctggagt taggtatccc cttctccgac ccactggcgg
     6181 atggcccgac gattcaaaac gccactctgc gcgcctttgc ggcaggtgtg actccggcac
     6241 aatgttttga aatgctggca ctgattcgcc agaaacaccc gaccattccc attggcctgt
     6301 tgatgtatgc caatctggtg tttaacaaag gcattgatga gttttatgcc cagtgcgaaa
     6361 aagtcggcgt cgattcggtg ctggttgccg atgtgccagt tgaagagtcc gcgcccttcc
     6421 gccaggccgc gttgcgtcac aacgtcgcac ctatcttcat ctgcccgcca aatgccgatg
     6481 acgacctgct gcgccagata gcctcttacg gtcgtggtta cacctatttg ctgtcacgag
     6541 caggcgtgac cggcgcagaa aaccgcgccg cgttacccct caatcatctg gttgcgaagc
     6601 tgaaagagta caacgctgca cctccattgc agggatttgg tatttccgcc ccggatcagg
     6661 taaaagcagc gattgatgca ggagctgcgg gcgcgatttc tggttcggcc attgttaaaa
     6721 tcatcgagca acatattaat gagccagaga aaatgctggc ggcactgaaa gtttttgtac
     6781 aaccgatgaa agcggcgacg cgcagttaat cccacagccg ccagttccgc tggcggcatt
     6841 ttaactttct ttaatgaagc cggaaaaatc ctaaattcat ttaatattta tctttttacc
     6901 gtttcgctta ccccggtcga tcgtyractt acgtcatttt tccgcccaac agtaatataa
     6961 acaaacaaat taaacccgca acataacacc agtaaaatca ataattttct ctaagtcact
     7021 tattcctcag gtaattctta atatatccag aatgttcctc aaaatatatt ttccctctat
     7081 cttctcgttg cgcttaattt gactaattct cattagcgac taattttaat gagtgtcgac
     7141 acacaacact catattaatg aaacaatgca acgcaacggg agaaataaca tggccgaaca
     7201 tcgtggtggt tcaggaaatt tcgccgaaga ccgtgagaag gcatccgacg cagccgtaaa
     7261 ggcggtcagc atagcggcgg taattttaaa aatgatcgca acgcgcatct gaagcgggta
     7321 aaaaaggcgg tyrac
//
LOCUS       HUMGSTH       808 bp ss-mRNA            PRI       01-AUG-1990
DEFINITION  Human glutathione S-transferase (GST) a-subunit mRNA, complete
            cds.
ACCESSION   M14777
KEYWORDS    GSH S-transferase; glutathione S-transferase.
SOURCE      Human liver, cDNA to mRNA, clone pGTH1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 808)
  AUTHORS   Tu,C.-P.D. and Qian,B.
  TITLE     Human liver glutathione S-transferases: Complete primary sequence
            of an H-a subunit cDNA
  JOURNAL   Biochem. Biophys. Res. Commun. 141, 229-237 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         67      735     glutathione S-transferase (GST, EC 2.5.1.18)
                                 /hgml_locus_uid="LL0130R" /nomgen="GST2"
                                 /map="6p12.2"
    mRNA    <     1      808     GST mRNA
    signal      792      797     polyA signal
BASE COUNT      252 a    175 c    192 g    189 t
ORIGIN      Chromosome 6p12.2.
        1 agttgtcgag ccaggacggt gacagcgttt aacaaagctt agagaaacct ccaggagact
       61 gctatcatgg cagagaagcc caagctccac tacttcaatg cacggggcag aatggagtcc
      121 acccggtggc tcctggctgc agctggagta gagtttgaag agaaatttat aaaatctgca
      181 gaagatttgg acaagttaag aaatgatgga tatttgatgt tccagcaagt gccaatggtt
      241 gagattgatg ggatgaagct ggtgcagacc agagccattc tcaactacat tgccagcaaa
      301 tacaacctct atgggaaaga cataaaggag agagccctga ttgatatgta tatagaaggt
      361 atagcagatt tgggtgaaat gatcctcctt ctgcccgtat gtccacctga ggaaaaagat
      421 gccaagcttg ccttgatcaa ggagaaaata aaaaatcgct acttccctgc ctttgaaaaa
      481 gtcttaaaga gccatggaca agactacctt gttggcaaca agctgagccg ggctgacatt
      541 catctggtgg aacttctcta ctacgtcgag gagcttgact ccagtcttat ctccagcttc
      601 cctctgctga aggccctgaa aaccagaatc agcaacctgc ccacagtgaa gaagtttcta
      661 cagcctggca gcccaaggaa gcctcccatg gatgagaaat ctttagaaga agcaaggaag
      721 attttcaggt tttaataacg cagtcatgga ggccaagaac ttgcaatacc aatgttctaa
      781 agttttgcaa caataaagta ctttacct
//
LOCUS       MUSIGKACY     321 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse lysozyme-binding Ig kappa chain (HyHEL-10) V23-J2 region
            mRNA, partial cds.
ACCESSION   M35667
KEYWORDS    immunoglobulin light-chain; kappa-immunoglobulin; processed gene;
            variable region VK23.
SOURCE      Mouse hybridoma, cDNA to mRNA, clone 10K-106.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Mainhart,Smith-Gill-S.J., Lavoie,C., Feldman,T.B., Drohan,R.J. and
            Brooks,W.B.R.
  TITLE     A three-dimensional model of an anti-lysozyme antibody
  JOURNAL   J. Mol. Biol. 194, 713-724 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   321     lysozyme binding Ig kappa chain V23-J2 region
                                 (AA at 1)
    recomb      285      286     V23 region end/J2 region start
BASE COUNT       88 a     80 c     75 g     78 t
ORIGIN      
        1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaaa tagcgtcagt
       61 ctttcctgca gggccagcca aagtattggc aacaacctac actggtatca acaaaaatca
      121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc
      181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact
      241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggccgtacac gttcggaggg
      301 gggaccaagc tggaaataaa a
//
LOCUS       MUSLTAGBSA    237 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Mouse SV40 transformed large T-antigen binding site DNA.
ACCESSION   M35500
KEYWORDS    large T antigen.
SOURCE      Mouse (strain BALB/c) SV40 transformed cell line SVA31E7 DNA, clone
            p27.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 237)
  AUTHORS   Lane,D.P., Simanis,V., Bartsch,R., Yewdell,J., Gannon,J. and
            Mole,S.
  TITLE     Cellular targets for SV40 large T-antigen
  JOURNAL   Proc. R. Soc. Lond., B, Biol. Sci. 226, 25-42 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    binding      84      118     large T-antigen binding site
BASE COUNT       67 a     65 c     60 g     45 t
ORIGIN      
        1 ggatccatcc cataatcagc ctctaaacgc tgacaccatt gcatacacta gcaagatttt
       61 gctgaaagaa ccctgatata gctgtctctt gtgaggctat gccggggcct agcaaacaca
      121 gaagtggatg ctcacagtca gctagtggat cacagggccc ccaatggagg agctagagaa
      181 agtacccaag gagctaaagg gatcctctac gccggacgca tcgtggccag tcaccgc
//
LOCUS       PEAIVSS       350 bp ds-DNA             PLN       01-AUG-1990
DEFINITION  Pea legumin J gene, exons 1 and 2 (partial).
ACCESSION   M26771
KEYWORDS    legumin.
SOURCE      Pea DNA, clone pSP65LegJi.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 350)
  AUTHORS   Brown,J.W.S., Feix,G. and Frendewey,D.
  TITLE     Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa
            cell nuclear extract
  JOURNAL   EMBO J. 5, 2749-2758 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       48     legumin J, exon 1 (AA at 3)
                 50      144     legumin J, exon 2
    pre-msg <     1  >   350     legumin J mRNA and introns
    IVS          49      186     legumin intron
BASE COUNT      117 a     74 c     71 g     88 t
ORIGIN      
        1 gaatacacgg aattcgagct cgcccgggga tcccattcaa ccccaagagt aagtaatagt
       61 gtatccatac attacattat ctcttataaa ttgttcatac agcatgctca ttcgattata
      121 actttaaaag tttctaatgt ataatttgtt atactaaatc aatcacacgt aaatatgtgt
      181 atgcaggtat tttaccttgg tgggaaccca gaaacagagt tccccgaaac acaggaggaa
      241 caacaaggaa ggcatcggca aaagcatagt taccctgttg gacgtaggag tggacatcac
      301 caacaagaag aggaatggga tcctctagag tcgacctgca gcccaagctt
//
LOCUS       RATCGM1AA    3190 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) mRNA, complete
            cds.
ACCESSION   M32474 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SOURCE      R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation,
            cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3190)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analysis imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        122     2251     carcinoembryonic antigen-related protein
                                 precursor (CGM1)
    sigp        122      220     carcinoembryonic antigen-related protein
                                 signal peptide
    matp        221     2248     CGM1 protein
BASE COUNT      871 a    798 c    693 g    828 t
ORIGIN      
        1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta
       61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca
      121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct
      181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc
      241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga
      301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact
      361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt
      421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac
      481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt
      541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc
      601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga
      661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc
      721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac
      781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
      841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt
      901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc
      961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga
     1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc
     1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac
     1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta
     1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt
     1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc
     1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct
     1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta
     1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta
     1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact
     1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac
     1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt
     1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc
     1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag
     1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc
     1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa
     1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca
     1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg
     2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct
     2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag
     2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct
     2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga
     2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg
     2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca
     2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac
     2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc
     2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct
     2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt
     2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag
     2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa
     2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat
     2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct
     2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag
     2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct
     3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt
     3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag
     3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt
     3181 ttctccaaat
//
LOCUS       RATCGM1AC1   2238 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene, exons
            1 and 2.
ACCESSION   M32476 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     1 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2238)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene and analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept        704      767     carcinoembryonic antigen-related protein
                                 (CGM1) precursor, exon 1
               1770  +  2129     carcinoembryonic antigen-related protein, exon
                                 2
    sigp        704      767     carcinoembryonic antigen-related protein
                                 (CGM1) signal peptide
               1770     1804     carcinoembryonic antigen-related protein
                                 signal peptide
    matp       1805  +  2129     carcinoembryonic antigen-related protein
    pre-msg     512  >  2238     CGM1 mRNA and introns
    IVS         768     1769     CGM1 intron A
    IVS        2130  >  2238     CGM1 intron B
BASE COUNT      615 a    532 c    539 g    552 t
ORIGIN      
        1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata
       61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg
      121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca
      181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc
      241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg
      301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt
      361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct
      421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca
      481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg
      541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg
      601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga
      661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt
      721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta
      781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt
      841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg
      901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt
      961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg
     1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg
     1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt
     1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa
     1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc
     1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt
     1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc
     1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa
     1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta
     1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag
     1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag
     1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg
     1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg
     1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac
     1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct
     1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac
     1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc
     1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac
     2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc
     2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg
     2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc
     2221 tgcattgatc tacatgtt
//
LOCUS       RATCGM1AC2    539 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32477 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     2 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 539)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   539     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      126 a    127 c    138 g    148 t
ORIGIN      
        1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact
       61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt
      121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg
      181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct
      241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac
      301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt
      361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc
      421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg
      481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc
//
LOCUS       RATCGM1AC3    828 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron B.
ACCESSION   M32478 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     3 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 828)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   828     carcinoembryonic antigen-related protein
                                 intron B
BASE COUNT      193 a    255 c    190 g    190 t
ORIGIN      
        1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct
       61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc
      121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca
      181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa
      241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata
      301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt
      361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt
      421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg
      481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag
      541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc
      601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga
      661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac
      721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac
      781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc
//
LOCUS       RATCGM1AC4    642 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 3.
ACCESSION   M32479 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     4 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 642)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    61  +   420     carcinoembryonic antigen-related protein
                                 (CGM1), exon 3
    matp    +    61  +   420     carcinoembryonic antigen-related protein
    pre-msg <     1  >   642     CGM1 mRNA and introns
    IVS     <     1       60     CGM1 intron B
    IVS         421  >   642     CGM1 intron C
BASE COUNT      151 a    160 c    154 g    177 t
ORIGIN      
        1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag
       61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc
      121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc
      181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat
      241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt
      301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct
      361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca
      421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt
      481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg
      541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt
      601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg
//
LOCUS       RATCGM1AC5    616 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 4.
ACCESSION   M32480 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     5 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 616)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +    90  +   449     carcinoembryonic antigen-related protein
                                 (CGM1), exon 4
    matp    +    90  +   449     carcinoembryonic antigen-related protein
    pre-msg <     1  >   616     CGM1 mRNA and introns
    IVS     <     1       89     CGM1 intron C
    IVS         450      616     CGM1 intron D
BASE COUNT      152 a    153 c    147 g    164 t
ORIGIN      
        1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa
       61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac
      121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct
      181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat
      241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg
      301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt
      361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa
      421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg
      481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag
      541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca
      601 ttgtctgatt cccttc
//
LOCUS       RATCGM1AC6    654 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            intron 4.
ACCESSION   M32481 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     6 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 654)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   654     carcinoembryonic antigen-related protein
                                 intron D
BASE COUNT      185 a    146 c    155 g    168 t
ORIGIN      
        1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg
       61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat
      121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc
      181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac
      241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg
      301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga
      361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat
      421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt
      481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt
      541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca
      601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa
//
LOCUS       RATCGM1AC7    492 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 5.
ACCESSION   M32482 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     7 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   118  +   471     carcinoembryonic antigen-related protein
                                 (CGM1), exon 5
    matp    +   118  +   471     carcinoembryonic antigen-related protein
    pre-msg <     1  >   492     CGM1 mRNA and introns
    IVS     <     1      117     CGM1 intron D
    IVS         472      492     CGM1 intron E
BASE COUNT      134 a    134 c    105 g    119 t
ORIGIN      
        1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc
       61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct
      121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg
      181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt
      241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag
      301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg
      361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa
      421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac
      481 tctcaaggtc tc
//
LOCUS       RATCGM1AC8   1341 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM1) gene,
            exon 6.
ACCESSION   M32483 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SEGMENT     8 of 8
SOURCE      R.norvegicus (strain Sprague-Dawley) liver DNA, clone
            lambda-rnCGM1-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1341)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    +   640  /   987     carcinoembryonic antigen-related protein
                                 (CGM1), exon 6
    matp    +   640  /   987     carcinoembryonic antigen-related protein
    pre-msg <     1  >  1341     CGM1 mRNA and introns
    IVS     <     1      639     CGM1 intron E
    IVS         988     1341     CGM1 intron F
BASE COUNT      357 a    329 c    321 g    334 t
ORIGIN      
        1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt
       61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat
      121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga
      181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca
      241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa
      301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat
      361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag
      421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga
      481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg
      541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt
      601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca
      661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc
      721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca
      781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc
      841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca
      901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag
      961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt
     1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc
     1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa
     1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct
     1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg
     1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag
     1321 atccgttgcc tgacggtcga c
//
LOCUS       RATCGM4AA    4627 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Rat carcinoembryonic antigen-related protein (CGM4) gene, exons 2
            and 3.
ACCESSION   M32475 J05417
KEYWORDS    carcinoembryonic antigen-related protein.
SOURCE      Rat (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM415-1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites for [2])
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  TITLE     cDNA and gene analyses imply a novel structure for a rat
            carcinoembryonic antigen-related protein
  JOURNAL   J. Biol. Chem. 265, 7872-7879 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 4627)
  AUTHORS   Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Zimmermann, 02-MAR-1990.
FEATURES       from  to/span     description
    pept    /   145      489     carcinoembryonic antigen-related protein
                                 (CGM4), exon 2 (AA at 147)
               3379  /  3738     carcinoembryonic antigen-related protein exon
                                 3
    pre-msg <     1  >  4627     CGM4 mRNA and introns
    IVS     <     1      144     CGM4 intron B
    IVS         490     3378     CGM4 intron C
    IVS        3739  >  4627     CGM4 intron D
BASE COUNT     1213 a   1165 c   1162 g   1087 t
ORIGIN      
        1 agatctgggt cgacctgcag gtcaacggat ctgggcctta gcaggagtgt gggcagagct
       61 ctgggaaggc agaagtgtga ttttttaaaa aaccaacaga tttcacctgc tcaatatcga
      121 tggttgctct gtcttccctt ttagcctccc ttctaacctg ttggctcctg actactgccc
      181 aggtcaacat tgaatcggtg ccattcaatg tggttgaagg ggaaaacgtc cttcttcttg
      241 tccacaatct gccagagaat ctcatagcct ttgcctggta tagagggctg aggaaaattg
      301 gagtatacat actgaacact gaagtaagtg tgacggggcc aatgtacagc ggtagagaga
      361 cagtgtacag caatggttcc ctgtgtatcc gcaatgtcac ccagaaggac acaggattct
      421 acactctacg aacagtcaac acacgtggag aaactgtatc aacaacatcc ttgtacctct
      481 atgtgtacag taagtgatac tttgtgaact ctgggtgttg tgtggggttc attccgtaga
      541 cacacacaga agaggcaggc ctacctaccc tttgcattgt gtctccttat tgaggtgtga
      601 acatttaact caggctaagg agagtaatgc caattgaata gaatccttct tttgacttta
      661 ccttgtagtc agctggatgt gtggttaact cagtgaagga catcagccct tgtctagact
      721 tctggggttc ttagcagtaa tgtgtccttg ggaaagacct tgagggaagg agattgggtt
      781 tgaatgagat agccatagga tcctcatgga agtgagaacc agaaagccct ggctccagac
      841 ctctgtcctg actcatctcc tgatggcccc gagaagcatt ttacaaaggc tggattctga
      901 catctgttgg cagggaacag tgcttttgag gagcaaatcc ttgtgccaca tacaatcacc
      961 tggtgcacgg ccatgagagc cacagttagg cgaggtctcc tggatctctc cagtgactca
     1021 tcagggagag aatagaaaga cagatgtccc ggccactaag ttaactgtta tgatggcctt
     1081 atgagacttc caggaaggtc atggttgcca ggaagaggga caaaggacac agatccccct
     1141 gacagttgct tgtcctttgg ggtccagctc atagaagtct gtccgcaggc aaatgacacc
     1201 aggctctgct gatgtggata gctccccaga tctgagctgc agttctccca gcgatcacga
     1261 gggccgcctc agggaaacac aattaacacc cagaagagta tttgtctaaa ccaggaactt
     1321 acctcctcct ctggctagct cccctgttcc tacagacatg ggggtcacac agccttctca
     1381 gacctaccag ctgcctcctt ttctgctgcc ttgctaggga attatgtgta gtggctgctt
     1441 tgtgtatttt ctttggaaaa gatagagtat cctaagggaa tcacccagac agaggttcaa
     1501 ggcatctctg aaaggccagg cagcacatgg cagagccacc tcacagctca ggacccagag
     1561 gaagtgtgcc caccatcttg aatccatgca tgggacgatg gagcccagag ctacgttcca
     1621 ggactcaggt cacctcccac acactcaaga agtgaggctc ctgacacagc tgctcctggg
     1681 ccccttttct ccctgagaat cctgactggt ggctgcagtg agaacacatc tgtcccctcc
     1741 cccactcgtc acacagctgg cccttgggat cctcacacac atctctgtct ccttcctcct
     1801 gagagcaaac tacctctttg acgggcactg agaacacagg gcagactggg tgcccagctg
     1861 gttctgggtc acccagggag tgcagaggct cactcactgg tgctgactga gccaggaaga
     1921 ggccagaaca gagggatgcc ccccgggtga gctgctgtct tcttagggca cagagatgct
     1981 cagaggtttg tttgtcactg tgagctctgt ggcatgagac agaaagagcc cagaggagag
     2041 gttaggtgtg taggactgag tgtgcacagg gcagagaaca gagttaccca cagcccacgg
     2101 gactctggga tatgatcctg tctggcggag gctgagctca gaggatcaga gaacttggga
     2161 gctgtattgg agcagatgtg ctacagactg aggacagatc tggccacaga gaccagggcg
     2221 gtgctctgta ccatctgcaa acaatgcccc acctgttggt gctcctgctc acagatgagg
     2281 agaccacatt ttacagtgtg tgagaggaga ggactcacct actgtctaaa gtctcttcaa
     2341 ggggacaggg actggagaag agtttcaggt ttgtagggct gaaaacacta aagtataggg
     2401 gctcatcatc atcatcatca ccaccgccat caccaccacc accaccacca ccaccaccac
     2461 caccaccacc accaccacca ccatcatcat catcatcatg aggctcttgg taaataagaa
     2521 gaagcagggg gaggaggaga ttattgtcaa cccacagttc accatcaatg agcccagtgt
     2581 tctgaagact gaggttctca gctgtgatgc cccaaataag aaaccaagct ggtgttgatc
     2641 agtgacatgg ctcagtggat ctgggtgttt gcttcatgtc tgacaacctg agaaccagtg
     2701 aacacaagtt gtccctgacc tccacctagg gacggcgttt tgcacccaac acagacacac
     2761 tgaggcatgc ccttgcacat gaactcatac accaatataa taagcaaatg cataaaaatt
     2821 atagcaaatg gaagcagtca acactgtatt cccaaacata ctaatttgtt aaataaatcc
     2881 atggccatgt attcattcat tcattcattc actcattcat ttactctcca agatatttga
     2941 gttttctttt gcagtctttt ttttttaaaa gataatataa gacaaatccc agttctcatt
     3001 attccctagc cctagactgg aagacgacca gtgaagaaag ctagaaggcg aatcagtcac
     3061 taaaggacaa gaaacaaaag agtcagagtg tgacggtcgg gaggcttcac cccaacaccc
     3121 atcgactgac actgagggtg agcagggatc tgaggacggt gaggcagggc catgttgaca
     3181 cctgaggaga gagcagcata gagaggaaat gacaagtgag gggcgcggag tgcatggagg
     3241 taatgcactg acctccacta gctagggcag ggagactccc acacctcagc tgaccactgg
     3301 acacagctgc tcggactcag gcaccatctt agccaaatac taaagtcctg atgttgacgg
     3361 atctctcttc ccttctagcc tctcttttca tctgtgggcg tccttttaac cctgccaagc
     3421 tcactattga atcagtgccg cccagtgttg ctgaaggggg aagcgttctt ctcctcgttc
     3481 acaatctcca ggacgagctt cgagggtttt tctggtacaa aggggcgtct atgtctagca
     3541 accatgagat agcccgatac agaacagcaa agaattcaag tgtgccaggc cctgcccaca
     3601 gtggtagaga gacggtgtac agcaatggat ccctcctgct ccagaatgtc acccggaatg
     3661 acactgggtt ctacacccta cgcactctga aaagacatca gaaaatggaa ttggcacacg
     3721 tgcaacttca ggtggacagt aagtgatttt ccgtgatcgt tcagtgctgg gtgggtcttt
     3781 gacacacagg actgtcaccc ctggcatgtg gctacctcct ctctgccttt ttatccccat
     3841 gttgtggtta accactatgt gcaggacaca tgtgatggaa agaaatgccc atgggtcaga
     3901 cttatcatct gactctcccc tgtatcaagg acagtaactc aaccctaggt gctagactct
     3961 gcccagtcat ctggggcatc ttgccatgca acgtgaggaa accatggatc ctcacagcgt
     4021 ggtgagcacc aggaagctct gatctcagtc gtttgtccca gacttgactg caaatgtctc
     4081 taggagcatt ttgtcaggag tgctgcttac tgcctctctc ctcacagcct gccatcctga
     4141 tcttatagta acccaggaca ctgagcccag gggtgaaaat gctcccagtt gggctgggct
     4201 ctcccacatc aatcaccaat taaaaatgta ctacaggtta gcccacaggt tattttggtg
     4261 gtggcatttt aaattgaggc ccttgtttca aaaaattcta gcttgtgtta agttgacata
     4321 aagccagcag cacgattcct gagccctccc caatacctat atctgccaag aagaccagac
     4381 tgttcccacc catcatccgg ccttagtcct gggtgctata ggctgggacg tgagaacatg
     4441 tggaatgtga agtctgagga tgaccgcagg tacaaaggag atgagaaagt cagagagtgt
     4501 gtatccaggg tgtgtagaga ccaaaggtca ggggaggcat catcccaaag cacagtgtgc
     4561 atgagtatgt gcaatgtctg aatgagggca gtgagggaca gccacggaga caccaaggac
     4621 agagctc
//
LOCUS       STMRGDA      2540 bp ds-DNA             BCT       01-AUG-1990
DEFINITION  S.coelicolor 16S rRNA gene and 23S rRNA, 5' end (rrnD) gene
            cluster.
ACCESSION   Y00411 M35377
KEYWORDS    16S ribosomal RNA; 23S ribosomal RNA.
SOURCE      S.coelicolor (strain 1147 A3(2)) DNA, clone RSC33.
  ORGANISM  Streptomyces coelicolor
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 705 to 2230)
  AUTHORS   Baylis,H.A. and Bibb,M.J.
  TITLE     The nucleotide sequence of a 16S rRNA gene from Streptomyces
            coelicolor A3(2)
  JOURNAL   Nucleic Acids Res. 15, 7176-7176 (1987)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 771 and 2196 to 2540)
  AUTHORS   Baylis,H.A. and Bibb,M.J.
  TITLE     Transcriptional analysis of the 16S rRNA gene of the rrnD gene set
            of Streptomyces coelicolor A3(2)
  JOURNAL   Mol. Microbiol. 2, 569-579 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      144     ORF (AA at 1)
    rRNA        536  >  2231     16S rRNA gene
    rRNA        537  >  2231     16S rRNA gene
    rRNA        704     2231     16S rRNA
    rRNA       2507  >  2540     pot. 23S rRNA
BASE COUNT      573 a    654 c    834 g    478 t      1 others
ORIGIN      
        1 tgggcccgca tcaccatcgg cgtcctcgcc gagctggcct tcctggccta cgtctacgtt
       61 ctgggcggcc gagccgtgcg cgacggcgag acgggtgacg tcgaggcagc cgaacgcagc
      121 gccacggtgc caacagccgc ctgatgtgca tccacccctg cgagctgcta gtgtcctctt
      181 cgttcccgca agagccgttg acacggagcg agcggggagg tagattcgaa cagttgcctg
      241 gagacgggtt caccccagag ggcaacagtg aacatctacc agcttctccg aatcaacgaa
      301 ttcgacgaag cactctcccg atgaatcgga aacgaaggcc ggtaagaccg gctcgaaagt
      361 tctgataaag tcggagccgc cggaaaggga aacgcgaaag cgggaacctg gaaagcgccg
      421 aggaaatcgg atcggaaaga tctgatagag tcggaaacgc aagaccgaag ggaagcgccc
      481 ggaggaaagc ccgagagggt gagtacaaag gaagcgtgcc gttccttgag aactcaacag
      541 cgtgccaaaa gtcaacgcca gatatgttga taccccgacc tgatcggatc tccgttcggg
      601 ttgaggttcc tttgaagtaa cacaacagcg aggacgctgt gaacggtcgg attattcctc
      661 cgactgttcc gctctcgtgg tgtcacccga ttacgggtat acattcacgg agagtttgat
      721 cctggctcag gacgaacgct ggcggcgtgc ttaacacatg caagtcgaac gatgaaccac
      781 ttcggtgggg attagtggcg aacgggtgag taacacgtgg gcaatctgcc cttcactctg
      841 ggacaagccc tggaaacggg gtctaatacc ggatactgac cctcgcaggc atctgcgagg
      901 ttcgaaagct ccggcggtga aggatgagcc cgcggcctat cagcttgttg gtgaggtaat
      961 ggctcaccaa ggcgacgacg ggtagccggc ctgagagggc gaccggccac actgggactg
     1021 agacacggcc cagactccta cgggaggcag cagtggggaa tgttgcacaa tgggcgaaag
     1081 cctgatgcag cgacgccgcg tgagggatga cggccttcgg gttgtaaacc tctttcagca
     1141 gggaagaagc gaaagtgacg gtacctgcag aagaagcgcc ggctaactac gtgccagcag
     1201 ccgcggtaat acgtagggcg caagcgttgt ccggaattat tgggcgtaaa gagctcgtag
     1261 gcggcttgtc acgtcggttg tgaaagcccg gggcttaacc ccgccactgc agtcgatacg
     1321 ggcaggctag agttcggtag gggagatcgg aattcctggt gtagcggtga aatgcgcaga
     1381 tatcaggagg aacaccggtg gcgaaggcgg atctctgggc cgatactgac gctgaggagc
     1441 gaaagngtgg ggagcgaaca ggattagata ccctggtagt ccacgccgta aacggtgggc
     1501 actaggtgtg ggcaacattc cacgttgtcc gtgccgcagc taacgcatta agtgccccgc
     1561 ctggggagta cggccgcaag gctaaaactc aaaggaattg acgggggccc gcacaagcgg
     1621 cggagcatgt ggcttaattc gacgcaacgc gaagaacctt accaaggctt gacatacacc
     1681 ggaaagcatc agagatggtg ccccccttgt ggtcggtgta caggtggtgc atggctgtcg
     1741 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ttgtcccgtg
     1801 ttgccagcaa gccttcgggg tgttggggac tcacgggaga ccgccgggtc aactcggagg
     1861 aaggtgggga cgacgtcaag tcatcatgcc ccttatgtct tgggctgcac acgtgctaca
     1921 atggccggta caatgagctg cgataccgca aggtggagcg aatctcaaaa agccggtctc
     1981 agttcggatt ggggtctgca actcgacccc atgaagtcgg agtcgctagt aatcgcagat
     2041 cagcattgct gcggtgaata cgttcccggg ccttgtacac accgcccgtc acgtcacgaa
     2101 agtcggtaac acccgaagcc ggtggcccaa ccccttgtgg gagggagctg tcgaaggtgg
     2161 gactggcgat tgggacgaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca
     2221 cctcctttct aaggagcaca tagccgactg cagcgaaatg tcctgcacgg ttgctcatgg
     2281 gtggaacgtt gactactcgg cacggtcttc ttgatggatc actagtactg cttcggcgtg
     2341 gaacgtgact tcaaagaggg gttcgtgtcg ggcacgctgt tgggtatctg agggtacggc
     2401 cgtgaggtcg ccttcagttg ccggccccgg taaaaatccg cgtgagtggg ttgtgacggg
     2461 tggttggtcg ttgtttgaga actgcacagt ggacgcgagc atctgtggcc aagtttttaa
     2521 gggcgcacgg tggatgcctt
//
LOCUS       SUSCYIIAA     230 bp ds-DNA             INV       01-AUG-1990
DEFINITION  S.purpuratus cytoskeletal actin CyIIa gene, complete cds.
ACCESSION   M35321 M35322
KEYWORDS    cytoskeletal actin SpG11A.
SOURCE      S.purpuratus DNA, clone pSpG11A.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 230)
  AUTHORS   Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M.
  TITLE     DNA sequence analysis and structural relationships among the
            cytoskeletal actin genes of the sea urchin Strongylocentrotus
            purpuratus
  JOURNAL   J. Mol. Evol. 28, 72-86 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        219  >   230     actin CyIIa
BASE COUNT       74 a     43 c     33 g     80 t
ORIGIN      
        1 ttcgaattgt cactcattct tcaaataaag attgtgagat cacgcgtttt ctgtacccta
       61 ccctacaaat acgtaggaca cctgggtatg tagtgaacct taaagtttat aaatgatgtt
      121 cttgtttgtc catcaattta accgggaaaa aaatttatct gtctaatatc attatctatt
      181 ttcacacttt tagatcaaac tagattaaac aaatcatcat gtgtgacgac
//
LOCUS       SUSCYIIBA    1972 bp ds-DNA             INV       01-AUG-1990
DEFINITION  S.purpuratus cytoskeletal actin CyIIb gene, complete cds.
ACCESSION   M35323
KEYWORDS    cytoskeletal actin CyIIb.
SOURCE      S.purpuratus DNA, clone pSpG11A.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 1972)
  AUTHORS   Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M.
  TITLE     DNA sequence analysis and structural relationships among the
            cytoskeletal actin genes of the sea urchin Strongylocentrotus
            purpuratus
  JOURNAL   J. Mol. Evol. 28, 72-86 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        251      616     cytoskeletal actin CyIIb, exon 1
                841     1087     cytoskeletal actin CyIIb, exon 2
               1312     1829     cytoskeletal actin CyIIb, exon 2
    pre-msg       1     1972     CyIIb mRNA and introns
    IVS         617      840     CyIIb intron A
    IVS        1088     1311     CyIIb intron B
BASE COUNT      529 a    492 c    408 g    543 t
ORIGIN      
        1 tcggcagttc aagaccacgt gtgtttcccg gattggtaaa ctccttatca cgaactcctt
       61 atcagtaaaa cttacgagct ttgtacactt ttaatgactt ttcgattatt ctttcaagag
      121 attttccctg ccacaaaatt acttagttct tttatttctc attcctgtgc aattccaatt
      181 actagcattt tatttatgat ccatttttgt gtttttattt tagagtaaat aaaacgagaa
      241 atcaatcatc atgtgtgacg acgatgttgc cgctcttgtc atcgacaacg gatccggtat
      301 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtcttcccat ccatcgttgg
      361 cagaccccgt caccagggtg tcatggtcgg catgggacag aaggacagct acgtcggaga
      421 cgaggcccag agcaagagag gtatcctcac cctgaagtac cccatcgagc acggtatcgt
      481 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg aactccgtgt
      541 tgccccggag gagcaccccg tcctccttac cgaggctccc ctcaacccca aggccaacag
      601 ggaaaagatg acacaggtta gaaaaagcaa tatgcctatt attgaagtaa tcaaattctc
      661 aaaacaaata cattctcaca tttaaacatc ttaatttaag ctgtttatta atattaatat
      721 caagtgagtt tcgttgttga aataacagcg attgactaaa atgaacttgt atcaaacttg
      781 ttgtgattag tgaaatgaaa tcggtgatta acaattgttt tgttttcatg tcttctgcag
      841 atcatgttcg agaccttcaa ctcacccgcc atgtacgtcg ctatccaggc cgtgctttcc
      901 ctctacgcct ctggtcgtac cactggtatc gttttcgact ctggtgatgg tgtttcacac
      961 acagtgccca tctacgaggg ttatgccctt ccccacgcca tcctccgtct ggacttggct
     1021 ggacgtgatc tcacagacta cctgatgaag atccttaccg agcgtggcta ctctttcacc
     1081 accaccggta agatatcttt tttttacaat caaagagtga gtgaagctat cacctgcatc
     1141 ctgtgcttaa agaatattaa aaaaagagga gggaagatat tatatatgat taatgttcat
     1201 tttctttgga ctttgacaat aacattttgg ggggatagaa agtgaatgtt gcttttcgtt
     1261 atacattcgt aactaactaa tttcatcttg tttttttttt ctatcttgca gctgagcgtg
     1321 aaatcgttcg tgacatcaag gagaagctct gctacgttgc tcttgacttt gagcaagaga
     1381 tgcagactgc tgcctcatcc tcctccctcg agaagagcta cgagcttccc gacggacagg
     1441 tcatcaccat tggcaacgag cgattccgtg ccccagaggc cctcttccag ccagccttcc
     1501 ttggaatgga atccgctgga atccacgaga cctgctacaa cagcatcatg aagtgcgatg
     1561 ttgacatccg taaggatctg tacgccaaca ctgttctgtc tggaggctcc accatgttcc
     1621 caggaatcgc cgacaggatg cagaaggaga tcaccgccct tgccccacca accatgaaga
     1681 tcaagatcat tgctcctcca gaaaggaaat actccgtatg gatcggaggc tccatccttg
     1741 cctctctctc caccttccaa cagatgtgga tcagcaagca ggaatacgat gagtccggcc
     1801 catccatcgt ccacaggaag tgcttctaaa caactcgctt ttggtgaaca aactcttgaa
     1861 catcaatatc aaggaaacga ccatgatctc aaattgcaaa gtttaagtat gacaccattg
     1921 cgggcaatgc agccgaaaaa ctcgcgcttt ctcaaaactt ggaggactgc ag
//
LOCUS       SUSCYIIIBA   2918 bp ds-DNA             INV       01-AUG-1990
DEFINITION  S.purpuratus cytoskeletal actin CyIIIb gene, complete cds.
ACCESSION   M35324
KEYWORDS    cytoskeletal actin CyIIIb.
SOURCE      S.purpuratus DNA, clone pSpG11A.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 2918)
  AUTHORS   Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M.
  TITLE     DNA sequence analysis and structural relationships among the
            cytoskeletal actin genes of the sea urchin Strongylocentrotus
            purpuratus
  JOURNAL   J. Mol. Evol. 28, 72-86 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        371      736     cytoskeletal actin CyIIIb, exon 1
               1634     1880     cytoskeletal actin CyIIIb, exon 2
               2247     2764     cytoskeletal actin CyIIIb, exon 2
    pre-msg       1     1972     CyIIIb mRNA and introns
    IVS         737     1633     CyIIIb intron A
    IVS        1881     2246     CyIIIb intron B
BASE COUNT      850 a    668 c    559 g    841 t
ORIGIN      
        1 acggttcggg catttaggga tagctttgat tttaagaatg ttaaaatgag aatgtcaaat
       61 agcctaacgc tggtctgtgc cagtaaacat gaatcaattc caaatgttga tatattaata
      121 gtcggggagt tcaaatagga caacatgttt cacggggata gaattatcag acataattat
      181 aatcccactt tgtcgtgaat tttgttggtt gtatgaaagt tttttagacc gtttgaaagg
      241 aaaacagacc tatgccaaat ccaccaccac gaattaacta gtctgcaaac aaagaaacta
      301 aaattaatat ttctctgggt atgtttttct catattcagg acaggaaaac gaaattcaat
      361 catcatgtgt atgtgtgacg atgatgttgc cgctcttgtc gtcgacaacg ggtccggaat
      421 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtctttccat ccatcgttgg
      481 caggccccgt caccagggtg tcatggttgg tatgggacaa aaggacagct acgttggaga
      541 cgaagcacag agcaagagag gtatcctcac cctgaagtac cctattgagc acggtatcgt
      601 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg agctccgtgt
      661 tgccccagag gagcaccccg tccttctgac agaggccccc ctcaacccta aggccaacag
      721 ggaaaagatg acacaggtaa ggatatagtg cggaattgca aaacattcct taaagatact
      781 atgtctcttt tgcacccaac atcagattct gtagaacttt gcaggaacta taattatgac
      841 ttgtcatgta tgtcctatct atgaaatcta aacattagca atgtcgtatt attcgaatta
      901 tgcaaggaaa cccgtttatc ttctagactt cactgtcaga cttactgaca tctatttttc
      961 tttattgtaa taacatacat acatttagct ttaacaggta catgagcatt tgtctacatc
     1021 aataacccac tatttgtgac ggccaaaatt aaactgattg aatatttgta cagcacaaaa
     1081 cgtacgacca atcggtgaaa gggtgtgaaa atgaaactat tacttaggtg atcgcaatta
     1141 cttaactcga ttcgataact aatggtaaca tgtagttatt ttcccactaa aagccctttt
     1201 taatcctttc gtttcgaagg aacttctaac ttagtttttt tccttcaaat gcagttggaa
     1261 tttaatcttt tcattgttgg cctgcaaatg ggacatacag tagtaccttt aactgcattt
     1321 tggcaggaat gaaatgaaca acggctacag atagcccacg tcaccaatag cctacataag
     1381 cgaagaaaac tagtcggata cccccacacg accgacatat cgctctccct gaccaatcta
     1441 aaatatcgtt tttctttttt aaagtccata aaatgctatg aaaacctttt cgtttcttta
     1501 ctgcagtgaa aataaaagct gatacggact acgagtacaa aatcgcgaac attcagataa
     1561 aaaagttgaa tttgcccagt ttataatccc tagagtttat tcttaattca aaaaaatatt
     1621 cttcttttgt tagatcatgt ttgagacctt caactcgccc gccatgtacg tcgccatcca
     1681 ggccgtgctt tccctctacg cctctggtcg taccactggt atcgttttcg actctggcga
     1741 cggtgtttca cacactgtac caatctatga gggttacgcc ctcccccacg ccatcatccg
     1801 tctggacttg gctggacgtg atcttaccga ttacctgatg aagatcctta ccgagcgtgg
     1861 ctactctttc accaccactg gtaagacatg atatggataa tagcaatagc taatgatgat
     1921 aattaaaata gggataattg ataatattag aatactaatg taaacagatg aatgtcttac
     1981 caaagggcag tctgtctcgg gttttgaatt caaaaacctc acatctcgtt atctttaagc
     2041 cgcagaccac aacacctgca tgttcatttt tttttttact gcttgttcaa atccttttga
     2101 caaagcgaat atctgattag atcgataata attaataaca aataccctct aagtcccgga
     2161 gtttcaacac atttccattg ttatcttcac attttacaat ttgtctgcaa ttgatatgtg
     2221 actgcatcca ttattatctc ttacagctga gcgtgaaatc gtccgtgaca taaaggagaa
     2281 gctctgctac gtagctcttg attttgagga ggagatgcaa actgctgcct catcctcctc
     2341 cctcgagaag agctacgagc ttcccgacgg acaggtcatc accatcggca acgagcgatt
     2401 tcgttgctca gagaccctct tacagccctc tttcattgga atggaatctg ctggaatcca
     2461 tgagacctgt tataacagca tcatgaagtg cgatgttgac atccgtaagg atctatacgc
     2521 caacaccgtt ctctccggag cttccaccat gttcccagga atcgctgaca ggatgcagaa
     2581 agagattgtc gcccttgccc caccaaccat gaagatcaag atcatcgctc ctcctgagag
     2641 gaaatactct gtatggatcg gaggctccat tcttgcctct ctctccacct tccaacagat
     2701 gtggatcagc aagcaggaat acgatgagtc tggtccatcc atcgtccaca ggaagtgctt
     2761 ctaaacaacc ttccaacaga tttggatcag caagcaggaa tacaatgagt ccggtccatc
     2821 catcgtccaa gggaagtgct tctaaacaac ttgattttct tctacttcta atgagcaacc
     2881 tgattttttt aattctgttt cactccatgt tgccacct
//
LOCUS       WHTIVSS       310 bp ds-DNA             PLN       01-AUG-1990
DEFINITION  Wheat amylase gene, exons 2 and 3 (partial).
ACCESSION   M26770
KEYWORDS    .
SOURCE      Wheat DNA, clone pSP64Amyi.
  ORGANISM  Triticum aestivum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 310)
  AUTHORS   Brown,J.W.S., Feix,G. and Frendewey,D.
  TITLE     Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa
            cell nuclear extract
  JOURNAL   EMBO J. 5, 2749-2758 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      109     amylase, exon 2 (AA at 1)
                214  >   310     amylase, exon 3
    pre-msg <     1  >   310     amylase mRNA and intron
    IVS         110      213     amylase intron 2
BASE COUNT       79 a     91 c     76 g     64 t
ORIGIN      
        1 gaatacaagc ttgggctgca ggtcgacgca gaggctgtgg ccattcccct cggacaaggt
       61 catgcagggc tacgcctaca tcctcacaca cccgggcata ccatgcatcg taagtagtag
      121 cacactacac aacctcacca taacatttcg catcaaacgt accccacgat gtttgtgatc
      181 tgaacttaca actacttggt tttgcgcgcg cagttctacg accatgtgtt cgactggaaa
      241 ctgaagcagg agatcaccgc actggctacg gtcaggtcaa ggaacgggat ccccgggcga
      301 gctcgaattc
//
LOCUS       YSCMTARSA     384 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Yeast (S.cerevisiae) mitochondrial autonomously replicating
            sequence DNA.
ACCESSION   M35612
KEYWORDS    .
SOURCE      S.cerevisiae (strain 992) mitochondrial DNA, clone pYmit1021.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 384)
  AUTHORS   Mabuchi,T., Nishikawa,S. and Wakabayashi,K.
  TITLE     The nucleotide sequence of mitochondrial ARS in Saccharomyces
            cerevisiae
  JOURNAL   J. Gen. Appl. Microbiol. 30, 469-478 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site         46       56     consensus autonomously replicating sequence
    site        126      136     consensus autonomously replicating sequence
    site        245      255     consensus autonomously replicating sequence
    site        290      300     consensus autonomously replicating sequence
    site        148      156     ori/rep GC cluster A
    site        187      194     ori/rep GC cluster A
BASE COUNT      126 a     29 c     34 g    195 t
ORIGIN      
        1 ccgccgcggg cggacgccgg aggagaatta tatttttata taataattta tatttctata
       61 tatatatata tatattatat ataaatatta ttatatatat ttttatatat attataatta
      121 tattcattaa tattttatta tagtggtggg ggtcccaatt attattttca ataataattt
      181 atcatgggac ccggatatct tcttgttttt atttattatt ttttttaatt tattttaatt
      241 atttatttat aatttatatt atacaattta ttatttcgtt aataccttta tttatattat
      301 ataatatatt atattattat aatatattta ttgattatat taatacattt aactaatgtg
      361 tgctctatat ttattgaata gttt
//
LOCUS       YSCMTARSB     218 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Yeast (S.cerevisiae) mitochondrial Ser-tRNA, 3' end in and
            autonomously replicating sequence.
ACCESSION   M35613
KEYWORDS    transfer RNA-Ser.
SOURCE      S.cerevisiae (strain 992) mitochondrial DNA, clone
            pYmit1S2SC-delta-11.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 218)
  AUTHORS   Mabuchi,T., Nishikawa,S. and Wakabayashi,K.
  TITLE     The nucleotide sequence of mitochondrial ARS in Saccharomyces
            cerevisiae
  JOURNAL   J. Gen. Appl. Microbiol. 30, 469-478 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    tRNA    <     1       49     Ser-tRNA
    site        138      148     consensus autonomously replicating sequence
BASE COUNT       99 a     25 c     11 g     83 t
ORIGIN      
        1 ctatcattag tctttattgg ctacgtaggt tcaaatccta catcatccgt aataatacat
       61 atatataata ataattttaa tattattcct ataaaaataa aataaataaa taaataataa
      121 taattaatta attttaataa atataaaata tataaaataa taataataat aattattatt
      181 ttaataatat tatttatata atagtccggc ccgccccc
//
LOCUS       MUSMDRXX     2873 bp ds-DNA             ROD       01-AUG-1990
DEFINITION  Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2.
ACCESSION   M33580
KEYWORDS    P-glycoprotein.
SOURCE      Mouse (strain BALB/c/NIH) macrophage-like cell line
            J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2873)
  AUTHORS   Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L.,
            Hartstein,M. and Horwitz,S.B.
  TITLE     Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
            reveals the basis for differential transcript heterogeneity in
            multidrug-resistant J774.2 cells
  JOURNAL   Mol. Cell. Biol. 10, 3596-3606 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.S.Kirschner, 05-APR-1990.
FEATURES       from  to/span     description
    pept       2613  /  2677     P-glycoprotein, exon 2 (first expressed exon)
    pre-msg    1992  >  2873     P-glycoprotein mRNA and introns (alt.)
    pre-msg    1801  >  2873     P-glycoprotein mRNA and introns (alt.)
    IVS        2120     2606     P-glycoprotein intron A
    IVS        2678  >  2873     P -glycoprotein intron B
    signal     1904     1912     CAAT box
    signal     1956     1963     TATA box
    site       1880     1887     SP-1 site
    site       1921     1927     SP-1 site
    site       1937     1944     SP-1 site
    site       1869     1875     AP-1 site
    rpt           1     1300     L1Md repetitive element
BASE COUNT      860 a    621 c    714 g    678 t
ORIGIN      Chromosome 5.
        1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct
       61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa
      121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa
      181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc
      241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca
      301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag
      361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga
      421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat
      481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac
      541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct
      601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga
      661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt
      721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc
      781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt
      841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac
      901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta
      961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag
     1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc
     1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg
     1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg
     1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa
     1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa
     1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa
     1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact
     1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta
     1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca
     1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg
     1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt
     1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag
     1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag
     1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca
     1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct
     1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa
     1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct
     2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc
     2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag
     2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag
     2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg
     2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg
     2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag
     2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc
     2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt
     2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg
     2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa
     2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg
     2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat
     2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct
     2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc
//
LOCUS       MUSMDR1A     4924 bp ss-mRNA            ROD       01-AUG-1990
DEFINITION  Mouse P-glycoprotein (mdr1a) mRNA, complete cds.
ACCESSION   M33581
KEYWORDS    P-glycoprotein.
SOURCE      Mouse (strain BALB/c/NIH) macrophage-like cell line
            J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library
            pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4924)
  AUTHORS   Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L.,
            Hartstein,M. and Horwitz,S.B.
  TITLE     Structural analysis of the mouse mdr1a (P-glycoprotein) promoter
            reveals the basis for differential transcript heterogeneity in
            multidrug-resistant J774.2 cells
  JOURNAL   Mol. Cell. Biol. 10, 3596-3606 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine,
            1300 Morris Park Ave, Bronx, NY 10461
FEATURES       from  to/span     description
    pept        137     3967     P-glycoprotein (mdr1a)
    mRNA    <     1     4924     P-glycoprotein mRNA
    signal     4315     4320     poly-A signal
    signal     4898     4903     poly-A signal
BASE COUNT     1450 a   1021 c   1210 g   1243 t
ORIGIN      Chromosome 5.
        1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc
       61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga
      121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt
      181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt
      241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct
      301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac
      361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga
      421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac
      481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc
      541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga
      601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt
      661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac
      721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt
      781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt
      841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc
      901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa
      961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat
     1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc
     1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt
     1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg
     1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc
     1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt
     1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag
     1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct
     1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag
     1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct
     1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga
     1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca
     1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat
     1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc
     1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg
     1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc
     1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa
     1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa
     2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg
     2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga
     2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg
     2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat
     2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt
     2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt
     2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa
     2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca
     2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc
     2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa
     2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact
     2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt
     2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga
     2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat
     2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg
     2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt
     3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc
     3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc
     3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga
     3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg
     3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga
     3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt
     3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa
     3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga
     3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt
     3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga
     3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg
     3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct
     3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga
     3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa
     3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca
     3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg
     3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt
     4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca
     4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa
     4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag
     4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat
     4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa
     4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa
     4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat
     4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag
     4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct
     4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata
     4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg
     4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca
     4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca
     4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt
     4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa
     4921 aaat
//
LOCUS       RICCPCTA     2526 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Rice chloroplast beta and epsilon subunit (atpB and atpE) genes,
            complete cds.
ACCESSION   M31464 Y00323
KEYWORDS    atpB protein; atpE protein.
SOURCE      Rice chloroplast DNA, clone Ct-3.
  ORGANISM  Chloroplast Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE   1  (bases 1 to 2526)
  AUTHORS   Moon,E., Kao,T.-h. and Wu,R.
  TITLE     Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters
            from rice
  JOURNAL   Nucleic Acids Res. 15, 4358-4359 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        398     1894     atpB protein
    pept       1891     2304     atpE protein
BASE COUNT      770 a    459 c    563 g    734 t
ORIGIN      
        1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttgacagc
       61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag
      121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc
      181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag
      241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga
      301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt
      361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc
      421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg
      481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt
      541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt
      601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg
      661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg
      721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc
      781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat
      841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat
      901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa
      961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga
     1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga
     1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag
     1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt
     1201 gcttctattc atcgataata tctttcgttt tgttcaagca ggatcggagg tatctgcctt
     1261 attagggaga atgccctctg cagtgggtta tcaacctact cttagtacag aaatgggttc
     1321 tttgcaagaa agaattactt ctactaaaaa gggatctata acttcgatcc aagcggttta
     1381 tgtacctgcg gacgatttga ccgaccctgc tcctgctaca acatttgcac atttggatgc
     1441 tactaccgta ctttccagag gattagcttc caaagggatt tatcctgcag tagatccttt
     1501 agattcaacc tcaactatgt tacaacctcg gatcgttggc aacgaacatt atgaaactgc
     1561 gcaaagagtt aagcaaactt tacaacgtta caaagaactt caggacatta tcgcaattct
     1621 tgggttggat gaattatcgg aggaggatcg tttaactgta gcaagagcac gaaaaattga
     1681 gcgcttctta tcacaaccgt tttttgtggc agaagttttt accggttctc caggaaagta
     1741 tgttggtctt gcagaaacta ttaggggatt tcaactaatc ctttccggag aattagacgg
     1801 cctacccgaa caggcttttt atttggtggg taacatcgat gaagctagca cgaaagctat
     1861 aaacttagaa gaggagaaca acttgaagaa atgaaattaa atctttatgt actgactcct
     1921 aagcgaatta tttgggattg tgaagtgaaa gaaatcattt tatctactaa tagtggccaa
     1981 attggcgtat taccaaacca cgcccccatt aacacagctg tagatatggg tcccttgaga
     2041 atacgcctcc tcaacgatca atggttaacg gcggttctgt ggagcggttt tgccagaata
     2101 gttaataatg agatcatcat tttaggaaat gatgcggaac tgggtagtga cattgatccg
     2161 gaagaagctc aacaggcact tgaaatagcc gaagctaacg tgagtagagc tgagggtacg
     2221 aaagaattgg ttgaagcgaa ggtagctctc agacgagcta ggatacgagt cgaggctgtt
     2281 aattggattc ccccatctaa ttgaagacaa cccaacggtt tagttgatac aaagaaaaag
     2341 ggaagagggg tagaaaaaat tattagatag cgaagcgaag tagggccaat gctatctagt
     2401 aatttttcta cctacctacc tactattgga tttgaaccaa tgactcccgc cgtatgaaag
     2461 caatactcta accactgagt taagtaggca atttatcacc acaaaggaag accctttact
     2521 tcgatc
//
LOCUS       RICCPCTB     2524 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Rice mitochondrial beta and epsilon subunit (atpB and atpE)
            pseudogenes, complete cds.
ACCESSION   M31465 Y00323
KEYWORDS    pseudogene.
SOURCE      Rice chloroplast DNA, clone Ct-1.
  ORGANISM  Chloroplast Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE   1  (bases 1 to 2524)
  AUTHORS   Moon,E., Kao,T.-h. and Wu,R.
  TITLE     Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters
            from rice
  JOURNAL   Nucleic Acids Res. 15, 4358-4359 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps     398     1392     atpB pseudogene
    pept.ps    1389     2302     atpE pseudogene
BASE COUNT      769 a    458 c    564 g    733 t
ORIGIN      
        1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttggcagc
       61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag
      121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc
      181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag
      241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga
      301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt
      361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc
      421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg
      481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt
      541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt
      601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg
      661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg
      721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc
      781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat
      841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat
      901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa
      961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga
     1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga
     1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag
     1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt
     1201 gctctattca tcgataatat ctttcgtttt gttcaagcag gatcggaggt atctgcctta
     1261 ttagggagaa tgccctctgc agtgggttat caacctactc ttagtacaga aatgggttct
     1321 ttgcaagaaa gaattacttc tactaaaaag ggatctataa cttcgatcca agcggtttat
     1381 gtacctgcgg acgatttgac cgaccctgct cctgctacaa catttgcaca tttggatgct
     1441 actaccgtac tttccagagg attagcttcc aaagggattt atctgcagta gatcctttag
     1501 attcaacctc aactatgtta caacctcgga tcgttggcaa cgaacattat gaaactgcgc
     1561 aaagagttaa gcaaacttta caacgttaca aagaacttca ggacattatc gcaattcttg
     1621 ggttggatga attatcggag gaggatcgtt taactgtagc aagagcacga aaaattgagc
     1681 gcttcttatc acaaccgttt tttgtggcag aagtttttac cggttctcca ggaaagtatg
     1741 ttggtcttgc agaaactatt aggggatttc aactaatcct ttccggagaa ttagacggcc
     1801 tacccgaaca ggctttttat ttggtgggta acatcgatga agctagcacg aaagctataa
     1861 acttagaaga ggagaacaac ttgaagaaat gaaattaaat ctttatgtac tgactcctaa
     1921 gcgaattatt tgggattgtg aagtgaaaga aatcatttta tctactaata gtggccaaat
     1981 tggcgtatta ccaaaccacg cccccattaa cacagctgta gatatgggtc ccttgagaat
     2041 acgcctcctc aacgatcaat ggttaacggc ggttctgtgg agcggttttg ccagaatagt
     2101 taataatgag atcatcattt taggaaatga tgcggaactg ggtagtgaca ttgatccgga
     2161 agaagctcaa caggcacttg aaatagccga agctaacgtg agtagagctg agggtacgaa
     2221 agaattggtt gaagcgaagg tagctctcag acgagctagg atacgagtcg aggctgttaa
     2281 ttggattccc ccatctaatt gaagacaacc caacggttta gttgatacaa agaaaaaggg
     2341 aagaggggta gaaaaaatta ttagatagcg aagcgaagta gggccaatgc tatctagtaa
     2401 tttttctacc tacctaccta ctattggatt tgaaccaatg actcccgccg tatgaaagca
     2461 atactctaac cactgagtta agtaggcaat ttatcaccac aaaggaagac cctttacttc
     2521 gatc
//
LOCUS       RICMTBEA     2281 bp ds-DNA             ORG       01-AUG-1990
DEFINITION  Rice mitochondrial beta and epsilon subunit (atpB and atpE)
            pseudogene, complete cds.
ACCESSION   M31466 Y00323
KEYWORDS    pseudogene.
SOURCE      Rice mitochondrion DNA, clone Mt-0.
  ORGANISM  Mitochondrion Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE   1  (bases 1 to 2281)
  AUTHORS   Moon,E., Kao,T.-h. and Wu,R.
  TITLE     Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters
            from rice
  JOURNAL   Nucleic Acids Res. 15, 4358-4359 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps     384     1657     atpB pseudogene
    pept.ps    1654     2067     atpE pseudogene
BASE COUNT      710 a    416 c    488 g    667 t
ORIGIN      
        1 cccctttctt attttgagtc caaataccta aatactatga aaattctctg ttgacagcaa
       61 tctatgcttc acagtagtat atattttgta tatcgaagtc ctagataaga aatggagtag
      121 gcacagatcc ttcacaaaag gcgaaatgta tatgaaaaaa agattgattg aactttccga
      181 cggactcatg gaatgagtaa acgattgaat gggattcgtt tgggcaacga aatcaagtgc
      241 tggtcccctt ttctctctta ttgaattaac taattcattt ccttttgact tttgttggat
      301 ttttggatat ttttttggtg ttgatttggc attattcaac aagataaaaa gaaaaatttc
      361 tataaattcc ttttttttta attatgagaa ccaatcctac tacttctcat cccggggttt
      421 ctacaattga agaaaaaagt acagggcgta tcgatcaaat tattggaccc gtgctggatg
      481 ccacttttcc cccgggcaag ttaccttata tttataacgc tttggtagtc gagacactga
      541 gggtaagcaa attaatgtga cttgtgaggt acaacaatta ttaggaaata atcgagttag
      601 aacgaaatta tccatctttg aaactggtat taaggtggtc gatcttttag ctccttatcg
      661 gcgtggagga aaaatcggac tatttggggg aactggagta ggtaaaacag tactcatcat
      721 ggaattaatc aacaatattg ctaaagctca tagaggcgta tccgtatttg gcggagtagg
      781 ggaacggact cgtgaaggaa atgatcttta tatggaaata aaggagtaat taatgaaaaa
      841 aatccttgag gaatcaaagg tagctctagt ctatggccaa atgaatgaac gccaggagct
      901 cgtatgagag ttggtttgac tgccctaact atggcagaat atttccgaga tgttattaag
      961 caagacgtgc ttctattcat cgataatatc tttcgttttg ttcaagcagg atcgggggta
     1021 tttgccttat tagggagaat gccctctgca gtgggttatc aacctactct tagtacagaa
     1081 atgggttctt tgcaagaaag aattacttct actaaaaagg gatctataac ttcgatccaa
     1141 gcggtttatg tacctgcgga cgatttgacc gaccctgctc ctgccacaac atttgcacat
     1201 ttggatgcta ctaccgtact ttccagagga ttagcttcca agggtattta tcctagatcc
     1261 tttagattca acctcaacta tgttacaacc tcggatcgtt ggcaacgaac attatgaaac
     1321 tgcgcaaaga gttaagcaaa ctttacaacg ttacaaagaa cttcaggaca ttatcgcaat
     1381 tcttgggttg gatgaattat cggaggagga tcgtttaact gtagcaagag cacgaaaaat
     1441 tgagcgcttc ctatcacaac cgttctttgt ggcagaagtt tttaccggtt ctccaggaaa
     1501 gtatgttggt cttgcagaaa caattcgggg atttcaacta atcctttccg gagaattaga
     1561 cggcctaccc gaacaggctt tttatttggt gggtaacatc gatgaagcta gcacgaaagc
     1621 tataaactta gaagaggaaa acaacttgaa gaaatgaaat taaatcttta tgtactgact
     1681 cctaagcgaa ttatttggga ttgtgaagtg aaagaaatca ttttttctac taatagtggc
     1741 caaattggcg tattaccaaa ccacgccccc attaacacag ctgtagatat gggtcccttg
     1801 agaatacgcc tcctcaacga tcaatggtta acggcggttc tgtggagcgg ttttgccaga
     1861 atagttaata atgagatcat cattttagga aatgatgcgg aactgggtag tgacattgat
     1921 ccggaagaag ctcaacaggc acttgaaata gccgaagcta acgtgagtag agctgagggt
     1981 acgaaagaat tggttgaagc gaacgtagct ctcagacgag ctgggatacg agtcgaggct
     2041 gttaattgga ttcccccatc taattgaaga caatccaacg gtttagttga tacaaagaaa
     2101 aagggtctaa aaagttatta gatagcgaag cgaagtaagt ccaatgctat ctagtaattt
     2161 ttctacctac ctacctacta ttggatttga accaatgact cccgccgtat gaaagcaata
     2221 ctctaaccac tgagttaagt aggcaattta tcaccacaaa ggaagaccct ttacttcgat
     2281 c
//

GenBank-Updates@genbank.bio.net (08/02/90)

LOCUS       ECOALPHOA     171 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33724
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 171)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chang,C.N., Kuang,W.-J. and Chen,E.Y.
  TITLE     Nucleotide sequence of the alkaline phosphatase gene of Escherichia
            coli
  JOURNAL   Gene 44, 121-125 (1986)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1      141     phoA coding sequence /note="upstream sequence
                                 of psiA44:Mu dI" /evidence=EXPERIMENTAL
                                 [1],[2]
    site        141      142     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="psiA44:Mu dI insertion site" [1]
    trns        142  >   171     Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT       49 a     45 c     51 g     26 t
ORIGIN      
        1 caaagctccg ggcctcaccc aggcgctaaa taccaaagat ggcgcagtga tggtgatgag
       61 ttacgggaac tccgaagagg attcacaaga acataccggc agtcagttgc gtattgcggc
      121 gtatggcccg catgccgcca atgaagcggc gcacgaaaaa cgcgaaagcg t
//
LOCUS       ECOALPHOB     183 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33725
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 183)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chang,C.N., Kuang,W.-J. and Chen,E.Y.
  TITLE     Nucleotide sequence of the alkaline phosphatase gene of Escherichia
            coli
  JOURNAL   Gene 44, 121-125 (1986)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1      153     phoA coding sequence /note="upstream sequence
                                 of psiA10:Mu dI" /evidence=EXPERIMENTAL
                                 [1],[2]
    site        153      154     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="psiA10::Mu dI insertion site" [1]
    trns        154  >   183     Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT       55 a     39 c     39 g     50 t
ORIGIN      
        1 ctgtcataaa gttgtcacgg ccgagactta tagtcgcttt gtttttattt tttaatgtat
       61 ttgtacatgg agaaaataaa gtgaaacaaa gcactattgc actggcactc ttaccgttac
      121 tgtttacccc tgtgacaaaa gcccggacac cagtgaagcg gcgcacgaaa aacgcgaaag
      181 cgt
//
LOCUS       ECOALPHOD     242 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33726
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 242)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chang,C.N., Kuang,W.-J. and Chen,E.Y.
  TITLE     Nucleotide sequence of the alkaline phosphatase gene of Escherichia
            coli
  JOURNAL   Gene 44, 121-125 (1986)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1      212     phoA coding sequence /note="upstream sequence
                                 of psiA40" /evidence=EXPERIMENTAL [1],[2]
    site        212      213     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="psiA40:Mu dI insertion site" [1]
    trns        213  >   242     Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT       59 a     61 c     64 g     58 t
ORIGIN      
        1 attgcactgg cactcttacc gttactgttt acccctgtga caaaagcccg gacaccagaa
       61 atgcctgttc tggaaaaccg ggctgctcag ggcgatatta ctgcacccgg cggtgctcgc
      121 cgtttaacgg gtgatcagac tgccgctctg cgttattctc ttagcgataa acctgcaaaa
      181 aatattattt tgctgattgg cgatgggatg ggtgaagcgg cgcacgaaaa acgcgaaagc
      241 gt
//
LOCUS       ECOALPHOE      97 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33727
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 97)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chang,C.N., Kuang,W.-J. and Chen,E.Y.
  TITLE     Nucleotide sequence of the alkaline phosphatase gene of Escherichia
            coli
  JOURNAL   Gene 44, 121-125 (1986)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       67     phoA coding sequence /note="upstream sequence
                                 of psiA5::Mu dI. psiA30 upstream region is
                                 identical" /evidence=EXPERIMENTAL [1],[2]
    site         67       68     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="psiA5:Mu dI insertion site; psiA30
                                 upstream region is identical" [1]
    trns         68  >    97     Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT       34 a     13 c     22 g     28 t
ORIGIN      
        1 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt
       61 gcactggtga agcggcgcac gaaaaacgcg aaagcgt
//
LOCUS       ECOALPHOF      82 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33728
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 82)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
  TITLE     Molecular biology of carbon-phosphorus bond cleavage: Cloning and
            sequencing of the phn (psiD) genes involved in alkylphosphonate
            uptake and C-P lyase activity in Escherichia coli B
  JOURNAL   J. Biol. Chem. 265, 4461-4471 (1990)
  STANDARD  full staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       52     phn operon sequence /note="upstream sequence
                                 of psiD33; psiD41 is identical"
                                 /evidence=EXPERIMENTAL [1],[2]
    site         52       53     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="psiD33:Mu dI insertion site; psiD41 is
                                 identical" [1]
    trns         53  >    82     Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT       28 a     19 c     22 g     13 t
ORIGIN      
        1 attgaggaaa acgaatgaac gctaagataa ttgcctcgct ggccttcacc agtgaagcgg
       61 cgcacgaaaa acgcgaaagc gt
//
LOCUS       ECOALPHOG      94 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33729
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 94)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chang,C.N., Kuang,W.-J. and Chen,E.Y.
  TITLE     Nucleotide sequence of the alkaline phosphatase gene of Escherichia
            coli
  JOURNAL   Gene 44, 121-125 (1986)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       64     phoA coding sequence /note="upstream sequence
                                 of psiA4" /evidence=EXPERIMENTAL [1],[2]
    site         65       65     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="psiA4:Mu dI insertion site" [1]
    trns         65  >    94     Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT       35 a     18 c     22 g     19 t
ORIGIN      
        1 aatgtatttg tacatggaga aaataaagtg aaacaaagca ctattgcact ggcactctta
       61 ccgttgaagc ggcgcacgaa aaacgcgaaa gcgt
//
LOCUS       ECOALPHOH     233 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33730
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 233)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T.
  TITLE     Molecular biology of carbon-phosphorus bond cleavage: Cloning and
            sequencing of the phn (psiD) genes involved in alkylphosphonate
            uptake and C-P lyase activity in Escherichia coli B
  JOURNAL   J. Biol. Chem. 265, 4461-4471 (1990)
  STANDARD  full staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1      203     phn operon sequence /note="upstream sequence
                                 of psiD31" /evidence=EXPERIMENTAL [1],[2]
    site        203      204     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="Mu dI insertion site into phnD gene
                                 (psiD31::Mu dI)" [1]
    trns        204  >   233     Mu dI-R /evidence=EXPERIMENTAL [1]
    variant     201      201     g in E.coli; a in E.coli B
    variant     174      174     c in E.coli; g in E.coli B
    variant     141      141     g in E.coli; a in E.coli B
    variant     135      135     g in E.coli; a in E.coli B
    variant      76       76     t in E.coli; c in E.coli B
    variant      51       51     g in E.coli; a in E.coli B
BASE COUNT       62 a     64 c     66 g     41 t
ORIGIN      
        1 tcgctggcct tcaccagcat gttcagcctc agcaccctgt taagcccggc gcacgccgaa
       61 gagcaggaaa aggcgttgaa tttcggcatt atttcaacgg aatcacagca aaacctgaaa
      121 ccgcaatgga cgccgttctt gcaggatatg gagaagaagc tgggcgtgaa ggtcaacgcc
      181 ttctttgccc cggactacgc gggtgaagcg gcgcacgaaa aacgcgaaag cgt
//
LOCUS       ECOALPHOJ      88 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33732
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chang,C.N., Kuang,W.-J. and Chen,E.Y.
  TITLE     Nucleotide sequence of the alkaline phosphatase gene of Escherichia
            coli
  JOURNAL   Gene 44, 121-125 (1986)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       58     phoA coding sequence /note="upstream sequence
                                 of psiA12; upstream sequence of psiA13::Mu dI
                                 is identical" /evidence=EXPERIMENTAL [1],[2]
    site         58       59     Mu dI insertion site /evidence=EXPERIMENTAL
                                 /note="psiA12:Mu dI insertion site; psiA13::Mu
                                 dI is identical" [1]
    trns         59  >    88     Mu dI-R /evidence=EXPERIMENTAL [1]
BASE COUNT       33 a     13 c     21 g     21 t
ORIGIN      
        1 ttttttaatg tatttgtaca tggagaaaat aaagtgaaac aaagcactat tgcactggtg
       61 aagcggcgca cgaaaaacgc gaaagcgt
//
LOCUS       ECOALPHOK      66 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33733
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Chang,C.N., Kuang,W.-J. and Chen,E.Y.
  TITLE     Nucleotide sequence of the alkaline phosphatase gene of Escherichia
            coli
  JOURNAL   Gene 44, 121-125 (1986)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       36     psiF (orf-106) coding sequence /pseudo
                                 /evidence=EXPERIMENTAL [1],[2]
    trns         37  >    66     Mu dI-R
BASE COUNT       26 a     17 c     15 g      8 t
ORIGIN      
        1 cgcgaatgca ataatcaagc aacacaacaa tctctgtgaa gcggcgcacg aaaaacgcga
       61 aagcgt
//
LOCUS       ECOALPHOL      67 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33734
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 67)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       37     psiR43 coding sequence /pseudo
    trns         38  >    67     Mu dI-R
BASE COUNT       23 a     18 c     17 g      9 t
ORIGIN      
        1 ctctaattag gacaacagaa tcactcggta ccggacctga agcggcgcac gaaaaacgcg
       61 aaagcgt
//
LOCUS       ECOALPHOM     106 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33735
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 106)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Overduin,P., Boos,W. and Tommassen,J.
  TITLE     Nucleotide sequence of the ugp genes of e. coli k-12: homology to
            the maltose system
  JOURNAL   Mol. Microbiol. 2, 767-775 (1988)
  STANDARD  simple staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       76     ugpB coding sequence /pseudo
                                 /evidence=EXPERIMENTAL [1],[2]
    trns         77  >   106     Mu dI-R
BASE COUNT       41 a     26 c     22 g     17 t
ORIGIN      
        1 gcccaacgtt ttaacgccga aaacccggat tacaaaattg taccgaccta taaaggcaac
       61 tacgaacaga atttaatgaa gcggcgcacg aaaaacgcga aagcgt
//
LOCUS       ECOALPHON      67 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33736
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 67)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       37     psiH6 coding sequence /pseudo [1]
    trns         38  >    67     Mu dI-R
BASE COUNT       22 a     14 c     17 g     14 t
ORIGIN      
        1 ggcatcatga ctatcgtaag tttgatctat cacatagtga agcggcgcac gaaaaacgcg
       61 aaagcgt
//
LOCUS       ECOALPHOO      82 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33737
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 82)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M.,
            Flores,N., Becerril,B., Valle,F. and Bolivar,F.
  TITLE     Determination of the nucleotide sequence for the glutamate synthase
            structural genes of Escherichia coli K-12
  JOURNAL   Gene 60, 1-11 (1987)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       52     gltD coding sequence /pseudo
                                 /evidence=EXPERIMENTAL [1],[2]
    trns         53  >    82     Mu dI-R
BASE COUNT       26 a     21 c     28 g      7 t
ORIGIN      
        1 gaacatgccg ggttcccgcc gcgaagtgaa aaacgcccgg gaagaaggcg tatgaagcgg
       61 cgcacgaaaa acgcgaaagc gt
//
LOCUS       ECOALPHOP     116 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33738
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 116)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       86     psiK20 coding sequence /pseudo [1]
    trns         87  >   116     Mu dI-R
BASE COUNT       31 a     26 c     34 g     25 t
ORIGIN      
        1 gcgactcaga gagatcagca agtcagcact accgaccgtg ggcagttttc aaaaggtcat
       61 cggtagtagt ttttgcgttc tgcgtttgaa gcggcgcacg aaaaacgcga aagcgt
//
LOCUS       ECOALPHOQ      70 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33739
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Cole,S.T., Eiglmeier,K., Ahmed,S., Honore,N., Elmes,L.,
            Anderson,W.F. and Weiner,J.H.
  TITLE     Nucleotide sequence and gene-polypeptide relationships of the
            glpABC operon encoding the anaerobic sn-glycerol-3-phosphate
            dehydrogenase of Escherichia coli K-12
  JOURNAL   J. Bacteriol. 170, 2448-2456 (1988)
  STANDARD  simple staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       40     glpB coding sequence /pseudo [1],[2]
    trns         41  >    70     Mu dI-R
BASE COUNT       19 a     20 c     21 g     10 t
ORIGIN      
        1 acaacgcgtg ctcgatctcg cttgccaggc gcaggcatta tgaagcggcg cacgaaaaac
       61 gcgaaagcgt
//
LOCUS       ECOALPHOR      78 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33740
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 78)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M.,
            Flores,N., Becerril,B., Valle,F. and Bolivar,F.
  TITLE     Determination of the nucleotide sequence for the glutamate synthase
            structural genes of Escherichia coli K-12
  JOURNAL   Gene 60, 1-11 (1987)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       48     gltB coding sequence /pseudo
                                 /evidence=EXPERIMENTAL [1],[2]
    trns         49  >    78     Mu dI-R
BASE COUNT       20 a     21 c     24 g     13 t
ORIGIN      
        1 cgaactgatg gttatcgaca cccgcagtgg gcgtattctg cactcggctg aagcggcgca
       61 cgaaaaacgc gaaagcgt
//
LOCUS       ECOALPHOS      87 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33741
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 87)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       57     paiJ22 coding sequence /pseudo [1]
    trns         58  >    87     Mu dI-R
BASE COUNT       28 a     20 c     22 g     17 t
ORIGIN      
        1 agactatgag acggaatgaa gaactgttca tcacattctt tattccgagc cgctacgtga
       61 agcggcgcac gaaaaacgcg aaagcgt
//
LOCUS       ECOALPHOT      81 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33742
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 81)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Davis,E.O. and Henderson,P.J.F.
  TITLE     The cloning and DNA sequence of the gene xylE for xylose-proton
            symport in Escherichia coli K12
  JOURNAL   J. Biol. Chem. 262, 13928-13932 (1987)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       51     psiE coding sequence /pseudo [1],[2]
    trns         52  >    81     Mu dI-R
BASE COUNT       25 a     19 c     24 g     13 t
ORIGIN      
        1 gtgcatctgg ctgatgtgct gttcgcgcca gaacaaacca gcaaatatga gtgaagcggc
       61 gcacgaaaaa cgcgaaagcg t
//
LOCUS       ECOALPHOU      69 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33743
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 69)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M.,
            Flores,N., Becerril,B., Valle,F. and Bolivar,F.
  TITLE     Determination of the nucleotide sequence for the glutamate synthase
            structural genes of Escherichia coli K-12
  JOURNAL   Gene 60, 1-11 (1987)
  STANDARD  simple staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       39     gltB coding sequence /evidence=EXPERIMENTAL
                                 [1],[2]
    trns         40  >    69     Mu dI-R [1]
BASE COUNT       19 a     15 c     21 g     14 t
ORIGIN      
        1 ctttgacttt aactccatgc atatggagcc gtgggatggt gaagcggcgc acgaaaaacg
       61 cgaaagcgt
//
LOCUS       ECOALPHOV      94 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33744
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 94)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       64     psiH16 coding sequence /pseudo [1]
    trns         65  >    94     Mu dI-R [1]
BASE COUNT       19 a     26 c     29 g     20 t
ORIGIN      
        1 ccatcttagc agcgtctggt ccgggtcggc ttaggggctt cctttatgca gtactgcctg
       61 ccgatgaagc ggcgcacgaa aaacgcgaaa gcgt
//
LOCUS       ECOALPHOW      59 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Escherichia coli Mu dI insertion site.
ACCESSION   M33745
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 59)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by P.M.Steed
            13-APR-1990.
FEATURES       from  to/span     description
    site          1       29     psiJ49 coding sequence /pseudo [1]
    trns         30  >    59     Mu dI-R [1]
BASE COUNT       15 a     13 c     18 g     13 t
ORIGIN      
        1 ccgattgcgg ctgtttatgg gtatctatct gaagcggcgc acgaaaaacg cgaaagcgt
//
LOCUS       SYNFUSAAA     240 bp ds-DNA             SYN       02-AUG-1990
DEFINITION  Bacteriophage Mu fusion gene of the trp operon.
ACCESSION   M33723
KEYWORDS    .
SOURCE      Bacteriophage Mu and E.coli lacZ DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 240)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Allet,B.
  TITLE     Mu insertion duplicates a five base pair sequence at the host
            inserted site
  JOURNAL   Cell 16, 123-129 (1979)
  STANDARD  simple staff_entry
REFERENCE   3  (sites)
  AUTHORS   Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E.,
            Horowitz,H., van Cleemput,M. and Wu,A.M.
  TITLE     The complete nucleotide sequence of the tryptophan operon of
            Escherichia coli
  JOURNAL   Nucleic Acids Res. 9, 6647-6668 (1981)
  STANDARD  full staff_entry
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    signal        1      104     stem_loop, Mu dI-R [1]
    site        105      240     trp operon (partial) /note="trp operon
                                 sequence present in Mu dI is derived from the
                                 trpA-lacZ W209 protein fusion and contains
                                 last base of trpC, all of trpB and 187 bases
                                 of trpA" [1],[3]
    site          1       48     MuR derived sequence [1],[2]
    site        104       43 (c) MuR derived sequence [1],[2]
BASE COUNT       67 a     54 c     57 g     62 t
ORIGIN      
        1 tgaagcggcg cacgaaaaac gcgaaagcgt ttcacgataa atgcgaaacg taaagttttc
       61 gcatttatcg tgaaacgctt tcgcgttttt cgtgcgccgc ttcattaagg aaaggaacaa
      121 tgacaacatt acttaacccc tattttggtg agtttggcgg catgtacgtg ccacaaatcc
      181 tgatgcctgc tctgcgccag ctggaagaag cttttgtcag tgcgcaaaaa gatcctgaat
//
LOCUS       SYNFUSAAB      53 bp ds-DNA             SYN       02-AUG-1990
DEFINITION  Bacteriophage Mu fusion gene of the trp operon.
ACCESSION   M33731
KEYWORDS    .
SOURCE      Bacteriophage Mu and E.coli lacZ DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 53)
  AUTHORS   Metcalf,W.W., Steed,P.M. and Wanner,B.L.
  TITLE     Identification of phosphate-starvation-inducible genes in
            Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI)
            transcriptional fusions
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E.,
            Horowitz,H., van Cleemput,M. and Wu,A.M.
  TITLE     The complete nucleotide sequence of the tryptophan operon of
            Escherichia coli
  JOURNAL   Nucleic Acids Res. 9, 6647-6668 (1981)
  STANDARD  full staff_entry
REFERENCE   3  (sites)
  AUTHORS   Kalnins,A., Otto,K., Ruether,U. and Mueller-Hill,B.
  TITLE     Sequence of the lacZ gene of Escherichia coli
  JOURNAL   EMBO J. 2, 593-597 (1983)
  STANDARD  full staff_review
COMMENT     Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf
            13-APR-1990.
FEATURES       from  to/span     description
    site          1        9     lacZ coding sequence /note="lacZ sequence
                                 begins with third codon of LacZ protein"
                                 [1],[3]
    site         53       10 (c) trpA coding sequence
BASE COUNT       13 a     14 c     17 g      9 t
ORIGIN      
        1 atccgtaatc gccagtgggt cggagaaggg gatacctaac tccagcgcgt cag
//
LOCUS       ATH60CPNA     735 bp ss-mRNA            PLN       02-AUG-1990
DEFINITION  A.thaliana plastid 60-kDa chaperonin-60 alpha polypeptide
            (cpn-60 alpha) mRNA, partial cds.
ACCESSION   M35597
KEYWORDS    chaperonin-60.
SOURCE      A.thaliana (cv. Columbia), cDNA to mRNA, clone pTZ19R.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 735)
  AUTHORS   Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
  TITLE     Unique composition of plastid chaperonin-60: Alpha and beta
            polypeptide-encoding genes are highly divergent
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.M.Hemmingsen, 26-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1  >   735     60-kDa chaperonin-60 alpha-polypeptide (AA at
                                 1)
BASE COUNT      187 a    139 c    196 g    213 t
ORIGIN      
        1 aagcttgctg attgtgttgg tctcactctt ggccctagag ggaggaatgt tgtgttggat
       61 gaatttggaa gtcctaaggt tgtgaatgat ggagtcacca ttgctagagc tattgagtta
      121 cctaatgcca tggaaaatgc tggtgcagcg cttattcgtg aggttgcgag taagactaat
      181 gactctgctg gtgatgggac aaccactgcg tctatccttg ctcgggagat aatcaaacat
      241 ggattgttga gcgtcacttc tggtgcgaat cccgtttcac tcaagagggg aattgataag
      301 actgttcaag gtctgatcga agagcttcag aagaaagcta ggcccgtgaa aggtcgtgta
      361 gactacagag ctgtggcttc tatctctgct ggaaatgatg accttatagg gtcaatgatt
      421 gctgatgcca tcgacaaagt tggacctgat ggtgttttgt ccattgaatc ttcatcttct
      481 tttgagacca cggtcgaagt tgaagaaggg atggagattg acagaggtta catctcgcct
      541 cagtttgtta caaaccctga gaaactacta gctgagtttg agaatgctag ggtattgatc
      601 actgatcaga agatcactgc gatcaaagac atcatcccta ttttggagaa gaccactcag
      661 cttcgtgctc cgttgttgat tattgcagag gatgttactg gtgaagcctt agctaccctt
      721 gtcgtgaaca agctt
//
LOCUS       ATH60CPNB     559 bp ss-mRNA            PLN       02-AUG-1990
DEFINITION  A.thaliana plastid 60-kDa chaperonin-60 beta-polypeptide
            (cpn-60 beta) mRNA, partial cds.
ACCESSION   M35598
KEYWORDS    chaperonin-60.
SOURCE      A.thaliana (cv. Columbia), cDNA to mRNA, clone ptz19R.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 559)
  AUTHORS   Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
  TITLE     Unique composition of plastid chaperonin-60: Alpha and beta
            polypeptide-encoding genes are highly divergent
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.M.Hemmingsen, 26-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1  >   559     60-kDa chaperonin-60 beta-polypeptide (AA at
                                 1)
BASE COUNT      166 a     96 c    155 g    142 t
ORIGIN      
        1 aagcttgtga ggcaagcggc tgccaaaacc aatgacttgg ctggtgacgg tacaacaaca
       61 tctgttgttc ttgcacaagg ttttatcgct gagggtgtca aggtggtagc tgctggtgca
      121 aaccctgttt tgatcaccag aggtattgag aagacagcaa aggctttggt taccgagctc
      181 aagaaaatgt ctaaggaggt tgaagacagt gaacttgcag atgttgcagc tgttagtgcg
      241 ggtaacaacg acgaaattgg aaatatgatt gctgaagcaa tgagcaaagt gggcaggaaa
      301 ggtgtggtga cccttgagga gggtaaaagt gccgagaaca accttcacgt cgtagaagga
      361 atgcaatttg accgtggtta tatctctcct tactttgtga cagacagcga gaaaatgtcc
      421 gtcgagttcg acaattgcaa gttacttctc gttgacaaga aaattaccaa tgcaagggat
      481 cttgttggtg ttttggagga tgcaattaga ggcggatacc caatattaat tattgctgaa
      541 gacattgaac aagaagctt
//
LOCUS       BNA60CPNA    1783 bp ss-mRNA            PLN       02-AUG-1990
DEFINITION  B.napus plastid 60-kDa chaperonin-60 alpha-polypeptide
            (cpn-60 alpha) mRNA, partial cds.
ACCESSION   M35599
KEYWORDS    chaperonin-60 alpha-polypeptide.
SOURCE      B.napus (cv. Westar) plastid chaperonin-60, cDNA to mRNA.
  ORGANISM  Brassica napus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 1783)
  AUTHORS   Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
  TITLE     Unique composition of plastid chaperonin-60: Alpha and beta
            polypeptide-encoding genes are highly divergent
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.M.Hemmingsen, 26-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1     1638     60-kDa chaperonin-60 alpha-polypeptide
                                 precursor (AA at 1)
    sigp    <     1       21     60-kDa chaperonin-60 alpha-polypeptide signal
                                 peptide
    matp    <     1     1635     60-kDa chaperonin-60 alpha-polypeptide
BASE COUNT      460 a    395 c    486 g    442 t
ORIGIN      
        1 cgccgtttca gcgttagagc taatgtaaaa gagatctctt tcgaccagag ctcaagagct
       61 gctcttcaag ctggtattga caagcttgct gatgccgttg gtcttaccct tggccctaga
      121 ggaagaaatg ttgtgttgga tgagtttgga agtcccaagg tcgtgaatga tggagtcacc
      181 attgctaggg ccattgagtt acctgacgct atggagaatg ctggtgcagc acttatccgt
      241 gaggttgcta gtaagactaa tgactcagct ggtgacggca caaccactgc ctccgtcctt
      301 gctcgggaga taatcaaaca cggcttacta agcgtcactt ccggtgccaa tccagtctca
      361 ctcaagagag gaatcgataa gaccgttcaa gctttgatcg aagagctcga gaagagggct
      421 agacctgtca aaggcggtag tgacatcaaa gccgtggcta caatctctgc tggaaatgat
      481 gagcttgtgg gaacaatgat tgctgatgcc attgataaag ttggacctga tggtgttttg
      541 tccattgaat cttcttcctc ctttgagact accgtcgaag ttgaagaagg aatggagatt
      601 gatagaggtt acatctcacc acagtttgtt acaaaccctg agaagctcct agtcgagttc
      661 gagaacgcgc gcgtgctgat caccgatcag aagatcactg cgatcaaaga catcatcccg
      721 atcctggaga aaaccactca gctccgagct ccgttgctca tcatcgcgga ggatgtcacc
      781 ggcgaggccc tggcgaccct cgtcgtgaac aaactccgcg gcgtgctcaa cgtggtcgcc
      841 gtcaaggctc cagggtttgg tgaaagaaga aaagctatgc ttcaggacat tgccatcttg
      901 acggggccga gtacgaccct cgacatgggc cttctggtgg agaacacgac catagatcag
      961 ttggggatcg ctaggaaagt cactattagc aaagactcga ccacgcttat agctgatgcg
     1021 gcttccaagg agaccctgca agctcgtatc tcccagctta agaaggagct gtcagagact
     1081 gactccgtgt acgactcgga gaagctcgcc gagagaatgc ccaagctgcc aggcggtgtt
     1141 gccgttatta aagtcggagc ggcgactgag actgagctcg aggaccgtaa gcttcgtatc
     1201 gaggacgcga agaacgctac atttgctgca atcgaagaag ggatagttcc tggcggcggt
     1261 gccacgctgg tgcatctatc aactgtgatt cctgctatta aggagaagct tgaggatgct
     1321 gatgagcgtt tgggagccga catagtacag aaggctttgg tggcaaggtc gcttattgca
     1381 cagaacgctg gaataggagg agaagttgtt gtggagaaga ttatgttcag cgaatgggag
     1441 atagggtaca acgccatgac tgatacatat gagaatctgt tggaagctgg agtgattgat
     1501 ccagctaaag tgacgagatg tgcgcttcag aacgctgctt cggttgcggg gatggtgctg
     1561 accactcagg ccattgttgt cgacaaaccg aaacctaagg ctcctactgc tgctcctcct
     1621 cagggtctca tggtgtaatg aaatcactta tcatatggct atatttgttg tccccaatta
     1681 tatctgcaat atttttctct ggggaaatga gtagtttgtt taacggtaat gagaatatca
     1741 tctttttgat ttatgatttg caaaataaaa gtttatattc aat
//
LOCUS       BNA60CPNB    1982 bp ss-mRNA            PLN       02-AUG-1990
DEFINITION  B.napus plastid 60-kDa chaperonin-60 beta-polypeptide
            (cpn-60 beta) mRNA, partial cds.
ACCESSION   M35600
KEYWORDS    chaperonin-60 beta-polypeptide.
SOURCE      B.napus (cv. Westar) plastid chaperonin-60, cDNA to mRNA.
  ORGANISM  Brassica napus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 1982)
  AUTHORS   Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M.
  TITLE     Unique composition of plastid chaperonin-60: Alpha and beta
            polypeptide-encoding genes are highly divergent
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.M.Hemmingsen, 26-JUN-1990.
FEATURES       from  to/span     description
    pept         64     1830     60-kDa beta-polypeptide of plastid
                                 chaperonin-60 precursor
    sigp         64      228     60-kDa beta-polypeptide of plastid
                                 chaperonin-60 signal peptide
    matp        229     1827     60-kDa beta-polypeptide of plastid
                                 chaperonin-60
BASE COUNT      564 a    372 c    547 g    499 t
ORIGIN      
        1 cctgctcccc agtttcttat ccgtggtgct tctccgcctc ttcgcttcac ttctcttgaa
       61 aagatggcat caaccttcac tgcaacgtct tccattggct ccatggttgc tccaaatgcc
      121 cacaaaacgg ataagaagct tatgaacaag ttgtcttcaa gctctttcgg gaggaggcag
      181 aacgtgttcc ccaagctcag aagatccagt cctgcagttg tatgtgcagc caaggagttg
      241 catttcaaca aagacgggac taccatcagg aagcttcaag ctggtgtcaa caagcttgca
      301 gacctagttg gtgtgacact tggacctaaa gggcgaaacg ttgttcttga gagcaagtat
      361 ggatcaccaa gaattgtcaa tgatggtgtg actgttgcaa gggaggttga attggaagac
      421 cctgttgaga acattggtgc caaacttgtt aggcaagcag ctgccaagac caatgatctg
      481 gccggtgatg gtaccacaac atctgtggtt cttgcacaag gttttattgc tgagggtgtc
      541 aaggtggtgg ctgctggtgc aaaccctgta ttgatcacta gaggcattga gaagacagca
      601 aaggctttgg tagccgagct caagaaaatg tctaaggagg ttgaagacag tgagcttgca
      661 gatgtggcag ccgttagtgc cggtaacaat gcagaaattg gaagcatgat tgctgaagca
      721 atgagcagag tgggcaggaa gggtgtggtg acacttgagg agggtaaaag tgcagagaac
      781 gctctctacg tggtggaagg aatgcaattt gatcgaggtt atgtctcccc ttactttgtg
      841 acagacagcg agaaaatgtc agttgagttc gacaattgca agttgcttct tgttgacaag
      901 aaaattacca atgcaaggga tcttgttggt gttctggagg atgcaattag aggaggatac
      961 ccaattttaa taattgcgga agacattgag caggaggctt tagcgaccct tgttgttaac
     1021 aagcttagag gcacactgaa gattgcagct ctcaaagctc caggatttgg agagcgcaag
     1081 agccaatacc ttgatgatat tgccatcctc actggagcga ctgtgattcg tgaggaagtt
     1141 ggtctttcac ttgacaaagc tggaaaagag gttcttggac atgcagcaaa agttgtcctc
     1201 actaaggaga cttcgaccat tgtgggtgat gggagcacac aggacgcagt gcaaaagcgt
     1261 gttacacaaa ttaagaacct tattgagcaa gcagagcaag actatgagaa ggaaaaactg
     1321 aatgagagaa ttgcaaagct ctctggtgga gttgctgtga ttcaggtcgg agcacaaact
     1381 gagacagaac tcaaagagaa gaaactgaga gttgaagatg ctcttaatgc tacaaaggct
     1441 gctgttgagg aaggaattgt cgttggtggt ggttgtactc tgcttcgcct tgcttccaag
     1501 gtggatgcca ttaaagccgc ccttgacaat gatgaagaaa aggttggagc cgacatcgtg
     1561 aaaagagcac tgagttaccc tctgaaactg attgccaaga atgcaggagt caatggaagc
     1621 gtagttagcg agaaggtgct ttctaacgat aatgtgaagt tcggttacaa tgctgcaacc
     1681 ggcaagtacg aggatctaat ggctgcagga atcatcgatc caactaaggt tgtgagatgt
     1741 tgcttggaac atgcagcttc ggttgcaaag acattcttga tgtctgactg tgtggttgtt
     1801 gagatcaagg agcctgagcc agttccgtag gcaaccaatg gacaactcag gttatggata
     1861 ctgaggagat gggggaatga agtaactctg tcggaaattg attcctttta gcttttgtaa
     1921 tagatgagca agttgtaatg atttcgaaca aaagaatgtg ctgtgtactg aaattttgaa
     1981 cc
//
LOCUS       HS4RAJI     11835 bp ds-DNA             VRL       02-AUG-1990
DEFINITION  Epstein-Barr virus B95-8 deletion region.
ACCESSION   M35547
KEYWORDS    .
SOURCE      Epstein-Barr virus (strain Raji) DNA.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 11835)
  AUTHORS   Parker,B.D., Bankier,A., Satchwell,S., Barrell,B. and Farrell,P.J.
  TITLE     Sequence and transcription of Raji Epstein-Barr virus DNA spanning
            the B95-8 deletion region
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Farrell, 25-JUN-1990.
FEATURES       from  to/span     description
    pept       3625      851 (c) LF3 gene product
    pept      10605     9316 (c) LF2 gene product
    pept    > 11835    10566 (c) LF1 gene product (AA at 11834)
    orgrpl     3554     4609     origin of replication
BASE COUNT     1948 a   3315 c   3869 g   2703 t
ORIGIN      
        1 ggccgctgtt cacctaaagt gacgcaaggt ctgtcagccg ccagggtccg tttaccaggc
       61 tttcaggtgt ggaatttaga tagagtgggt gtgtgctctt gtttaattac accaagatca
      121 ccaccctcta tccatatccc acaattgata aacctccgca tgtccaacca ccacgttgaa
      181 caggatgtgg caccctaaga ggacgcaggc atacaaggtt attacccagt ccttgtatgc
      241 ctggtgtccc cttagtggga cgcaggccta ggtagcatca tttacactaa aagcagtgac
      301 cttgttggta ctttaaggtt ggtccaatcc ataggctttt tttgtgaaaa cccggggatc
      361 ggactagcct tagagtaact caaggccaag catttcacac ctgcaaatgc accatgtaac
      421 cacagatcta aactgaaagt tgcagcttta gatggcaagg aaacttgggt ttcaggcata
      481 gaaagcctgg ctcactatag cagcccatgt ttgttccagg gtgggggaaa ggcacgtgcc
      541 cttagaaaac ttagctgcaa aaattctatt gtgttgggag agcctctata tctaaaggcc
      601 tttcctcaca atacaaatgt tactaacgtc tgccctctgg agacctgcta tgtggctaga
      661 cgtatggcct acccaagacg ttgggggtct cgggtaggcc atgattcttc caggcatagg
      721 ttacaaccag tcactgctat caagcctact cagttcccaa cgcagcacat accccccgcc
      781 tctcctgcca tgaggactta tggcagtgtt tactgttctg cttttactct tggaccaggc
      841 tgtcattcta tcagaataac aggggaagca aggccccctg cttcagcggg acacgtgttt
      901 ctagaatctc ggagccaata actacctgcc cctctaatct gtatgctgca tgaaaaacca
      961 catacacgtg atgtaagttt agccagttta ttgttacacc aatgccccga aagtctcccc
     1021 ctgtcccttt gggtctcagg acccagccct ggagctcggg ggcggccggg tggcccaccg
     1081 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg
     1141 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg
     1201 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc
     1261 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg
     1321 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc
     1381 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg
     1441 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt
     1501 ccgctgcccc gctccggcgg ggggtggccg gctgcagccg ggtccggggt tccggccctg
     1561 gagctcgggg ggcggccggg tggcccaccg ggtccgctgg gtccgctgcc ccgctccggc
     1621 ggggggtggc cggctgcagc cgggtccggg gttccggccc tggagctcgg ggggcggccg
     1681 ggtggcccac cgggtccgct gggtccgctg ccccgctccg gcggggggtg gccggctgca
     1741 gccgggtccg gggttccggc cctggagctc ggggggcggc cgggtggccc accgggtccg
     1801 ctgggtccgc tgccccgctc cggcgggggg tggccggctg cagccgggtc cggggttccg
     1861 gccctggagc tcggggggcg gccgggtggc ccaccgggtc cgctgggtcc gctgccccgc
     1921 tccggcgggg ggtggccggc tgcagccggg tccggggttc cggccctgga gctcgggggg
     1981 cggccgggtg gcccaccggg tccgctgggt ccgctgcccc gctccggcgg ggggtggccg
     2041 gctgcagccg ggtccggggt tccggccctg gagctcgggg ggcggccggg tggcccaccg
     2101 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg
     2161 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg
     2221 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc
     2281 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg
     2341 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc
     2401 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg
     2461 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt
     2521 ccgctgcccc gctccggcgg ggggtggccg gctgcagccg ggtccggggt tccggccctg
     2581 gagctcgggg ggcggccggg tggcccaccg ggtccgctgg gtccgctgcc ccgctccggc
     2641 ggggggtggc cggctgcagc cgggtccggg gttccggccc tggagctcgg ggggcggccg
     2701 ggtggcccac cgggtccgct gggtccgctg ccccgctccg gcggggggtg gccggctgca
     2761 gccgggtccg gggttccggc cctggagctc ggggggcggc cgggtggccc accgggtccg
     2821 ctgggtccgc tgccccgctc cggcgggggg tggccggctg cagccgggtc cggggttccg
     2881 gccctggagc tcggggggcg gccgggtggc ccaccgggtc cgctgggtcc gctgccccgc
     2941 tccggcgggg ggtggccggc tgcagccggg tccggggttc cggccctgga gctcgggggg
     3001 cggccgggtg gcccaccggg tccgctgggt ccgctgcccc gctccggcgg ggggtggccg
     3061 gctgcagccg ggtccggggt tccggccctg gagctcgggg ggcggccggg tggcccaccg
     3121 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg
     3181 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg
     3241 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc
     3301 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg
     3361 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc
     3421 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg
     3481 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt
     3541 ccgctgcccc gctccggcgg ggatgggggt gcgctcccag gccggaccct ggtgccaggc
     3601 agggaccccg cgccacccgc ttcatggggg gggaggccgc cgcaaggacg ccgggccggc
     3661 tgggaggtgt gcaccccccg agcgtctgga cgacgctggc gagccgggcc agctcgcctt
     3721 cttttatcct ctttttgggg tctctgtgca ataccttaag gtttgctcag gagtgggggg
     3781 cttctcattg gttaattcag gtgtgtgatt ttagcccgtt gggttacatt aaggtgtgta
     3841 accaggtggg tggtacctgg aggtcattct attgggataa cgagaggagg aggggctaga
     3901 ggcccgcgag atttggggta ggcggagcct caggagggtc ccctccatag ggttgaacca
     3961 ggagggggag gatcgggctc cgccccgata tacctagtgg gtggagccta gaggtaggta
     4021 tccatagggt tccattatcc tggaggtatc ctaagctccg cccctatata ccaggtgggt
     4081 ggagctaggt aggattcagc taggttccta ctggggtacc cccctaccct accttaaggt
     4141 gcgccaccct tcctccttcc gttttaatgg tagaataacc tataggttat taacctagtg
     4201 gtggaatagg gtattgcagc tgggtatata cctataggta tatagaacct agaggaaggg
     4261 aaccctatag tgtaatccct ccccccccta cccccccctc ccttacggtt gcctgagccc
     4321 atcccccacc ccagcacccc ggggtgacgt ggcaccccgc gtgccttact gacttgtcac
     4381 ctttgcacat ttggtcagct gaccgatgct cgccacttcc tgggtcatga cctggcctgt
     4441 gccttgtccc atggacaatg tccctccagc gtggtggctg cctttgggat gcatcacttt
     4501 gagccactaa gcccccgttg ctcgccttgc ctgcctcacc atgacacact aagcccctgc
     4561 taatccatga gccccgcctt taggaagcac cacgtcccgg ggacggaagc tggattttgg
     4621 ccagtcttca attttgggga gtggttttgt gtgagccgga agttggcaat ggggtgaggg
     4681 tggcgctggt taagctgacg acctcccaag gtctctcacc ctgggtacac aggtggggcg
     4741 gcagcctcta actttggctg tggcctctat ttcctccctt tcctagccag ggccatgtgt
     4801 tcctgcatgt ctacttgcct cctgtggtgg cagagcttgg ccctgggccc aacccccgcc
     4861 ttgggagcct gtaggggcca acacccttgg tttgtttgtg ttcctgtttg ctggcaactt
     4921 actggcagcc gagcagattc taatgggcgc ccgccttctt tctctcttgt tttattaata
     4981 gaatctcagc caggacctat acctgagact tcaaagtctg gtcctgggtt ctgagacccc
     5041 caagatttgt catgcacacc tgcacacctg ttggtattgg gtttctattc ttgagtgtga
     5101 aagtttgtaa aaaaattcat aaaatgtcac taattcctct tacctgttta gggtattgtg
     5161 caattcttca gcctgcctat tttcaatttg cctaaggtgg caatttaaga tgtggttaat
     5221 taaccatttt cctgtctgac accactgcat gggcaaccgg gttccatggc acatttagag
     5281 ataaacatag atgtcttgtc ttgctcatgt gcagaggagg gggtgttggt gtgcaatata
     5341 gtttctggat tccaaattga gttgggggtg ctattttcac tatggaatta aattactgac
     5401 attagacagt ggacaccggg ctatatgtgg ggatgtctgt ggcttgtcat ttcctcttag
     5461 aaggtaatcc cccatcttaa cttcccttta aattgtgatg caagccctgg gttatttata
     5521 gaatgattat ctaggtttga tagtctgaag gctgggcaga gaatgtttgt aatttttatt
     5581 caccttcttt accccccacg agtatccagt tctagaagat ctcctgatat cccgggctgc
     5641 cattattccc ttgagtgtta tagcttcctc ttaacttaag caagagctcc aggatgttag
     5701 cttttttggt ggggctggtt gtcaggaaga ggttccagtg ttgtccttta tttttagatg
     5761 ttagctttgt gttaggttag tatgggctgg gtattcacta gtgaaggcaa ctaacacagt
     5821 tagacgtgct agttgtgccc actggtgttt atccggtccc aaatgtcacc acagaacaca
     5881 gggggctgga tttggcagca gcacttgtgc ttttgttgat ttttacccgt gtatcagagt
     5941 gggggatgct agccaattta gcttcccctc cccttaacag ggggtctcgc ggggtgccaa
     6001 ttgtcgcctg ccttcccccg cttccccttg ttaacttata gcatgatagg taggtcacct
     6061 aacgtggaag cctggtgggt gatccttcct cggtagggag cgcttagggc tgttgagctc
     6121 aacagcccca cctgggtaaa atgtatgttc taaagagtta cccaattata acaaaactgt
     6181 tgtagggtaa cgaagacctg atggaagtgg tattgttgcc gttgaaagac gggtgtcctg
     6241 gctcaagttc gcacttccta tacagtgtta aagccttgta tcggaagttt gggcttcgtc
     6301 ccagtgtact cgataatgtc gactgctgcg aaaggtttgg accgtcttcc agtaggtgtt
     6361 gggggtccca aatcacgagg ttaggcaggt gcacttggct ctttaggagg gacccttaag
     6421 ccagacaatg tagtgcccct tttttttgca aattggcctt attattaatt tcttgttaac
     6481 actaattctg ttctatgacc ctgtgttttt cagatgccgt tgaacgtgtc actgagctga
     6541 atttggacgc agctacttga cctttgcccc cgtgcctcca gcgctgataa gtgctgcgtc
     6601 cactttgtgt tacaggtggg ccaaacctcc agaatatcaa ttggtggggc cttggtgggc
     6661 tgcataaggc agtaggtttg aggtgaccta cttggaccat gtggatccag tgtcctgatc
     6721 ctggaccttg actatgaaac aattctaaaa aaatgcatca tagtccagtg tccagggaca
     6781 gtgcactcgg aagtctcatc atctccgttt gtgtgtttag tgtggccagt acggccaccc
     6841 ctgtgccacg ccctggcatg ctgctgacat ctggccgcca atttcagcgg gcccttttcc
     6901 cccttgttca ccccatagca agaagggtag gttacatggg tattttccca tcagcacctg
     6961 actggccggt gcaattagag gagagggcaa caacgcaagg ctgttgtttt atttgggtta
     7021 caagagctgc ggcggtcgat gggttcactg attacggttt cctagattgt acagatgaac
     7081 tagaactgtc acaatctatg gggtcgtaga cagtgtgctt accagacttc catggaagat
     7141 gtgaatttgc tgctagctat atgggtggtg ctatgggctc cctagggact catgtagtgg
     7201 ggctttgtga tagctaatga atgtggcagc tgttgtttgt actggaccct gaattggaaa
     7261 cagtaacttg gattctgtaa cacttcatgg gtcccgtagt gacaactatg ctgaatatct
     7321 tgaatatggg aggagggggg ctttgggttc cattgtgtgc cctttcctgg ccaacgtgag
     7381 ggtcctagtg ttatagggcg tggcagtttt cttgagggct aataacccgg gtgaggcggt
     7441 tgtcacaggt gctagaccct ggagttgaac cagtaccact cggttacaaa gtcatggtct
     7501 agtagttgtg accctgcaaa gctacgtggg gatgagcagc cagggacttt ggttggcaag
     7561 cagacaggcg gcgcattgga accccagagg agtgtcccgg ggccacctct ttggttctgt
     7621 acatattttg ttattgtaca taaccatgga gttggctgtg gtgcactcca tctggtaagg
     7681 gggctggtgc ggacgcctgt gtttagtcta tgccaatgtt tacctgcctt gggttactat
     7741 tccaaacgac cacacctttg aggacacctg gagccctgat cattctcggc ttttactgcc
     7801 acctggcttc tgttgggtca gacagtttgg tgcgctagtt gtgtgcttag cagcaacgca
     7861 caccaggctg actgccttag cagtgtggcc ctttattgtg gcatcctaag gagggattct
     7921 ggagtgcctt tcgcgtgaag catgccctga gacgtactcg agttaggact taatcgctcc
     7981 tgtgccgctg gatgagggag cgccaatttg tacatcctag ctctggccat agagttagcc
     8041 cacccttgtg tctccctttg gcctttgcgg tgccaatttc cggtggtttc ccttttccgc
     8101 ccgtttatcc aatagcatgt aagagaggtt gcctagattt ggcaactttg agggaacgtt
     8161 ccgtgtagct ggtgacctaa cacccgccca tcaccaccgg acagattctg aacttgtcct
     8221 gtggtgtttg gtgtggtttt ggggtacgca ggagtacgtt ggaatgcttt ggagccgaga
     8281 gggatgggcc cgcttgtgcg cttatgtgtt acacggtgcc aataaccggc ccggtgcggc
     8341 tgccccgtga cccgtgggcc ttaccttcct ggccatcggg ggaccctggt gctagggtcc
     8401 cttgtgttgc tttctgccat aggggggaaa gcatcgcctt cagaattggc tgctccgttg
     8461 gaacatttga ggcctactgt atccgtgtcc tgacaacatt ccccgcaaac atgacatggg
     8521 ttaatttaaa catgttttgt ttgcttggga atgctcttag ggcctggaag cttgtcattg
     8581 gattcatcgt ttcctgaact acaggcgtag ggcctattgt agcaggcatg tcttcattcc
     8641 tgcgtaccga atggcatgaa ggcacagcct gttaccattg gcaccttttt tccatgtaaa
     8701 cctccgtgat cctgggtcct ttggagactc aagtgtgaat ttgttttggt gttcggcgcc
     8761 agggtatctc gacgttggaa tgtcaactca acttgggcac ctcgataacc ggctcgtggc
     8821 tcgtacagac gattgtttgg ctctgtaact tgccagggac ggctgacgat gtgtttagtc
     8881 tgccacttgc atccggcgct ttggttactc gggagactaa tggggggtgt ggtatggcac
     8941 aggctggggg tgagtctggg gatgtccctg ggcgttgctg cagcccattc gccctctggg
     9001 gatgagatgt tcaggggtgg ccggtaccct acgctgccga tttacataat ataaattgta
     9061 aatgctgcag tagtagggat ctggacgcgc gacctgctac tcttcggaaa cgccaaccca
     9121 ggagcgtcgc ctctggcccc atactcccgc catgcgactg ctcgccccct cccaggcctc
     9181 cctggtgagc ccttgccgct ccccgcattc ctgctttcgg cgcccctgcg gatcccgatg
     9241 acagcaggcc tttccttccc ccgttaatga aaagaatgac agtgaggttg tgacagaagg
     9301 acagctttat tcagtttaca gagtgccctc ggaggctacg atattcccgt taaatgtctt
     9361 gttgattctc tcaaaggtgg ggagggagga gctctccaca acaatgttcc ctggcagcgt
     9421 gagcgcgcag ccctgccgtt ggatgtatct tctcatgatg gtgctgatag aggggtctcc
     9481 ggcgtagatg aaaaaggcct gggccatgct ctggccggtc acgatcgtta tggggttgtt
     9541 ggaaatgttc cggaccgtca gcttgagggt ctggcccggc ttccactcct gtgggtagac
     9601 gtagaagacc gggttggagg agtgggacac gacaacggcc gtaatcttgg agctcagggg
     9661 ggcctcgtag gtgttgttgt attccagctc cgtgatgaaa ttaggaggaa taatcacagg
     9721 ggagccaaag tagcggatgt ctgtggattc cccgtcccag cgccagtggc tcttagggta
     9781 ggggttgtaa cggaaggcaa taatcacatc atccaatagg gtcatgccca ccttgacgtt
     9841 cagcgggccc tctcgtttca ggtccggcgt gtccacggag actcggacgt agcccttacc
     9901 gcggcgtatg gcgtttaccg gacacacctt ccccgggaat gtgtgaatac gggcgtatga
     9961 ctttagaaat gggggcgtgt gctgcgccag caggtaaggc aggcactcgt cctggctggt
    10021 gacgggagag ccactgagga agatctgggg ctcgctggtg tttagcttgt ccccgctctg
    10081 ggtgcaggag cgtgtcagct gaatgtcgct ctgcccgggc agaatctgca ggtagaggta
    10141 ggggttcttg accaatctga tgggcacaat gtaccaggta aacttccctt tctctatgaa
    10201 caggctgcgc ggattcagga cgcttagcac gatgtcctgg tcagagtgca taacgaagaa
    10261 gggcttgagg aatacctcgt tgtcttccgc tccaaagaac aaaaacgcga ccgtaaagta
    10321 gcggctgccg taggtggtcg tgttgaagga gaaagaaggt aacttgaagc tgagtatctg
    10381 gcccaccgag gggcagggag gcagctcttg gcactgcgcg tccagctgca atacctgctt
    10441 gttggtgacg cggacgtatg aggggaagat ctcgtacttc cacacgcctc tcatgaacga
    10501 cgtgtctggt ttttcagtgg gccgcaggcg gcggaggctg ttcctgaacg acgagcgccg
    10561 ggacgctagt gctgcatggg ctcctccggg gtaagcttcg gccatggccg gagctcgtcg
    10621 acgggcaagg tgagagtcgg ggggcgggcg acggtgcggc cccaatacaa ctctccgctc
    10681 gttagctggt agaatatccg cccggcgtct aggttgtcac ttcgctcggc cggccagaag
    10741 agcgcaagtc caagtctggt gctggggccg atgtgcagcg gtttgtgccc gcagttgtag
    10801 actgtcattt ttatgggcga gtgggcggtc cacacgcgcg ggcgcagcac ccattggtcg
    10861 cacgccgcct cctggaatgt aaacccccag agagagggcg tgccgccctg gagatggccc
    10921 tgtgccatca catgtatttc ctccttgggt ggaacaacgg cgtcgtgctc cgggtggagg
    10981 gggaatagcg tccaggcatc tttcagggtc acgagaccgg ggtccatgct cagagaacag
    11041 ccctcccggg cggtgggcgg cccgggctcc agcagaacgt cgcagaccca gccctcctcg
    11101 gccctgtcca cctgtatgtc caggtgcacg gacccggagg ctgcgtctcg tgacatggcc
    11161 aggcctggtg ccagccgacc acgtcccgtg tcccagccga ggccgcgcca gagcagagcc
    11221 cgggactgac tcagggccac atcccctcgg cccgcggacg ccgcctcgcc agcccccggg
    11281 ccttcatggg cccgctttct acctctctcc ggcaccccag cctggtcagc cgcagaggaa
    11341 gcatgacctt ggggtgggac ggggcaggcg tgatcctggg cgcaatcttt gccgatcccc
    11401 acaccttcac tccttgttag gttgatagaa tgtcggtacc acgccacggg gggcgggccc
    11461 gcatagggaa aagccaggga gagcgatgtg ggcgaggatg ggctcaggcg gccccagaca
    11521 cgcaatttgc ccccctgggc ggccgcagcc tgcccctcgg cggcccgtgc cccagctccg
    11581 tcacgggggg cgcataggag gggtatatct aggatagccg cacctacaca aatgagacac
    11641 agacacaggt cgtgaggatt taggcaacgc aggcttgtct ttatagttac aaacatggga
    11701 gcgtgcacct ggaagatgca gctggggtag atctttacat ctttacaggg cgcagcggcc
    11761 gccagacact gaagggcaga gttcacggcg ggcacctccc agagggagcc caccagcccg
    11821 tacctggcca cggcc
//
LOCUS       HUMG25KA     1014 bp ss-mRNA            PRI       02-AUG-1990
DEFINITION  Human GTP-binding protein (G25K) mRNA, complete cds.
ACCESSION   M35543
KEYWORDS    G25K gene; GTP-binding protein G25K.
SOURCE      Human fetal brain, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1014)
  AUTHORS   Munemitsu,S., Innis,M.A., Clark,R., McCormick,F., Ullrich,A. and
            Polakis,P.
  TITLE     The molecular cloning and expression of G25K, the human homolog of
            the yeast cell cycle gene CDC42
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Munemitsu, 25-JUN-1990.
            
              Author address: S.Munemitsu
              Cetus Corporation
              1400 53rd Street
              Emeryville, CA 94608
            
FEATURES       from  to/span     description
    pept          4      579     GTP-binding protein G25K
BASE COUNT      286 a    204 c    210 g    314 t
ORIGIN      
        1 gcaatgcaga caattaagtg tgttgttgtg ggcgatggtg ctgttggtaa aacatgtctc
       61 ctgatatcct acacaacaaa caaatttcca tcggaatatg taccgactgt ttttgacaac
      121 tatgcagtca cagttatgat tggtggagaa ccatatactc ttggactttt tgatactgca
      181 gggcaagagg attatgacag attacgaccg ctgagttatc cacaaacaga tgtatttcta
      241 gtctgttttt cagtggtctc tccatcttca tttgaaaacg tgaaagaaaa gtgggtgcct
      301 gagataactc accactgtcc aaagactcct ttcttgcttg ttgggactca aattgatctc
      361 agagatgacc cctctactat tgagaaactt gccaagaaca aacagaagcc tatcactcca
      421 gagactgctg aaaagctggc ccgtgacctg aaggctgtca agtatgtgga gtgttctgca
      481 cttacacaga gaggtctgaa gaatgtgttt gatgaggcta tcctagctgc cctcgagcct
      541 ccggaaactc aacccaaaag gaagtgctgt atattctaaa ctgttttctc cttcccttct
      601 ttgctgctgc ttcctgtccc actactgtag aaagatcgtt taaaaacaaa ggaataaaac
      661 catcctgttt gaaagcctct gcgtcttttt actcaccacc ttagagcaac ctctgtatta
      721 gtttttgatc aagaattgca atatcatata aattttttgt gatcagtagt caagttggac
      781 ttgttttaac gttctgctgc ttgagttgcc tgatgctcag agctttttgg tttggattac
      841 tattgcaaag ggaacttggt ctggcttaga tgtcctcttg gagaaaataa caagagtttt
      901 aacacttcta gatcttagtt cagatggaga aagtaacaca aacatcattt tactcttatg
      961 atcaattgtt aattgtaatt gcatgacaaa ccttatggaa aaggggtgac ctgg
//
LOCUS       HUMMUCAB     1804 bp ss-mRNA            PRI       02-AUG-1990
DEFINITION  Human polymorphic epithelial mucin (PEM) mRNA, complete cds.
ACCESSION   J05581
KEYWORDS    polymorphic epithelial mucin.
SOURCE      Human adult mammary adenocarcinoma cell line BT20, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1804)
  AUTHORS   Gendler,S.J., Lancaster,C.A., Taylor-Papadimitriou,J., Duhig,T.,
            Peat,N., Burchell,J., Pemberton,L., Lalani,E.-N. and Wilson,D.
  TITLE     Molecular cloning and expression of the human tumour-associated
            polymorphic epithelial mucin, PEM
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.J.Gendler, 26-JUN-1990.
FEATURES       from  to/span     description
    pept         73     1500     polymorphic epithelial mucin (PEM) precursor
    sigp         73      135     polymorphic epithelial mucin signal peptide
    matp        136     1497     polymorphic epithelial mucin
    signal     1783     1788     poly-A signal
    mRNA    <     1     1804     PEM mRNA
BASE COUNT      361 a    622 c    434 g    387 t
ORIGIN      
        1 cgctccacct ctcaagcagc cagcgcctgc ctgaatctgt tctgccccct ccccacccat
       61 ttcaccacca ccatgacacc gggcacccag tctcctttct tcctgctgct gctcctcaca
      121 gtgcttacag ttgttacagg ttctggtcat gcaagctcta ccccaggtgg agaaaaggag
      181 acttcggcta cccagagaag ttcagtgccc agctctactg agaagaatgc tgtgagtatg
      241 accagcagcg tactctccag ccacagcccc ggttcaggct cctccaccac tcagggacag
      301 gatgtcactc tggccccggc cacggaacca gcttcaggtt cagctgccac ctggggacag
      361 gatgtcacct cggtcccagt caccaggcca gccctgggct ccaccacccc gccagcccac
      421 gatgtcacct cagccccgga caacaagcca gccccgggct ccaccgcccc cccagcccac
      481 ggtgtcacct cggccccgga caccaggccg gccccgggct ccaccgcccc cccagcccat
      541 ggtgtcacct cggccccgga caacaggccc gccttgggct ccaccgcccc tccagtccac
      601 aatgtcacct cggcctcagg ctctgcatca ggctcagctt ctactctggt gcacaacggc
      661 acctctgcca gggctaccac aaccccagcc agcaagagca ctccattctc aattcccagc
      721 caccactctg atactcctac cacccttgcc agccatagca ccaagactga tgccagtagc
      781 actcaccata gcacggtacc tcctctcacc tcctccaatc acagcacttc tccccagttg
      841 tctactgggg tctctttctt tttcctgtct tttcacattt caaacctcca gtttaattcc
      901 tctctggaag atcccagcac cgactactac caagagctgc agagagacat ttctgaaatg
      961 tttttgcaga tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggcca
     1021 ggatctgtgg tggtacaatt gactctggcc ttccgagaag gtaccatcaa tgtccacgac
     1081 gtggagacac agttcaatca gtataaaacg gaagcagcct ctcgatataa cctgacgatc
     1141 tcagacgtca gcgtgagtga tgtgccattt cctttctctg cccagtctgg ggctggggtg
     1201 ccaggctggg gcatcgcgct gctggtgctg gtctgtgttc tggttgcgct ggccattgtc
     1261 tatctcattg ccttggctgt ctgtcagtgc cgccgaaaga actacgggca gctggacatc
     1321 tttccagccc gggataccta ccatcctatg agcgagtacc ccacctacca cacccatggg
     1381 cgctatgtgc cccctagcag taccgatcgt agcccctatg agaaggtttc tgcaggtaat
     1441 ggtggcagca gcctctctta cacaaaccca gcagtggcag ccacttctgc caacttgtag
     1501 gggcacgtcg cccgctgagc tgagtggcca gccagtgcca ttccactcca ctcaggttct
     1561 tcagggccag agcccctgca ccctgtttgg gctggtgagc tgggagttca ggtgggctgc
     1621 tcacaccgtc cttcagaggc cccaccaatt tctcggacac ttctcagtgt gtggaagctc
     1681 atgtgggccc ctgaggctca tgcctgggaa gtgttgtggt gggggctccc aggaggactg
     1741 gcccagagag ccctgagata gcggggatcc tgaactggac tgaataaaac gtggtctccc
     1801 actg
//
LOCUS       RABGHKAB     1367 bp ss-mRNA            MAM       02-AUG-1990
DEFINITION  O.cuniculus gastric H+ - K+ ATPase beta subunit mRNA, complete cds.
ACCESSION   M35544
KEYWORDS    H+ - K+ ATPase beta subunit.
SOURCE      O.cuniculus (strain New Zealand White) adult stomach, cDNA to mRNA,
            clone RB-49.
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (bases 1 to 1367)
  AUTHORS   Reuben,M.A., Lasater,L.S. and Sachs,G.
  TITLE     Characterization of a beta subunit of the gastric H,K ATPase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Reuben, 25-JUN-1990.
FEATURES       from  to/span     description
    pept         54      929     H+/K+ ATPase beta subunit
    mRNA          1     1367     H+/K+ ATPase beta subunit mRNA
BASE COUNT      293 a    450 c    376 g    248 t
ORIGIN      
        1 tgtccatcta aggggtgcag ggtgggggag cggcggcttc cacagcagac accatggccg
       61 ccttgcagga gaagaagtcg tgcagccagc gcatggagga gttccgccac tactgctgga
      121 acccggacac ggggcagatg ctgggccgca ccctgtcccg ctgggtgtgg atcagcctct
      181 actacgtggc cttctacgtg gtgatgaccg ggctgtttgc cctgtgcatc tacgtgctca
      241 tgcaaaccat cgacccctac acgcctgact accaggacca gctgaagtcg ccaggcgtaa
      301 ccttaaggcc agacgtgtat ggggagaagg gcctggaaat ccactacaac atctctgaca
      361 accgcacctg gaccagcctc acgcacaccc tgagaagctt cctggcaggt tactcacccg
      421 cagcccaggt ggacaacatc aactgcacct ccaagaccta cttcttccag gagagcttcg
      481 gggctcccaa ccacaccaag ttctcctgca agttcacggc ggacatgctg gagaactgct
      541 ccggcctcac cgaccccagc ttcggcttca aggaaggaaa gccctgcttc atcattaaga
      601 tgaacaggat tgtcaggttc ctgcccagca acagcacacc accccgcgtg gactgcacct
      661 tcctggacat gccccaccag gccctcacgc cgctgcaggt cgagtactac ccccccaacg
      721 gcaccttcag cctccactac ttcccttact acggaaagaa agcgcagccg cactacagca
      781 accctctggt ggccgccaag ctcctcaacg tgcccacgaa cacggaggtc gtggtcctct
      841 gcaagatcct ggccgatcac gtgaccttcg acaacccgca cgacccctac gaggggaaag
      901 tggagttcaa gctcaagatc cagaagtgag ctcgcggtcg tgggtctcac gggggtcccg
      961 ctgggccctg tggacgcgtt tctgaggccc gcaccagtgc ccccctggga gcgccaggca
     1021 agcagccgcg tgcacagcgc ttccattggc gtccgctgct tcccgtggtg tcccgtcccg
     1081 tgcaggcgca tctgtagtag gatggaaaat gccggctaac ccagccatgg caaggcggcg
     1141 accgcgagtg cccagagccc agaactgacc cgtcctttcc gtgcggggag caccttcttg
     1201 ctaccattgg tttttctgac gtcagtgtga gcggcttccc agatgacgca ggggctgtta
     1261 gtgtgctgag gtgcggtgac gtgagacgcc catggcaacc cccactcaca gcgcaatccc
     1321 cgtgttgtaa aaataaaacg tgttagttac tctaggtaaa acaaaaa
//
LOCUS       ECOTRRNA23    485 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  E.coli 23S small subunit ribosomal RNA, 5' end.
ACCESSION   M24881
KEYWORDS    23S ribosomal RNA; small subunit ribosomal RNA.
SOURCE      E.coli rRNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 485)
  AUTHORS   Branlant,C., Sri Widada,J., Krol,A. and Ebel,J.-P.
  TITLE     RNA sequences in ribonucleoprotein fragments of the complex formed
            from ribosomal 23-S RNA and ribosomal protein L24 of Escherichia
            coli
  JOURNAL   Eur. J. Biochem. 74, 155-170 (1977)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1  >   485     23S rRNA
BASE COUNT      143 a    105 c    151 g     86 t
ORIGIN      
        1 ggttaagcga ctaagccgga gcgtacacgg atggtgcaac gccctgggca gtcagagagc
       61 gtgcgatgaa ggacgctaat ctgataagcg atatggtaag gtcgaaccgt gttataaccg
      121 atttccggcg aatggggaaa cagtgtgttt cgacacacta tcattaactg aatcaatagg
      181 ttaatgaacc ggggggaact gaaacatcta agtaccccga aaaggggaaa taaccgagat
      241 tcccccagta gagagcgtag ggcccagagc ctgaacgaat cagtgtgtgg aagcgataca
      301 gggtgaaagg gcgtgtctgt tagcgccccg acagtacaca aaaatgcaca tgctgagctc
      361 gggacacgag tgatgagggg gggcgcgaaa agaaccccga aagggggcgt ctgaatatgt
      421 atccctgacc atcctccaag gctaaatacc cttggggggg gactgaccga tagtaccgaa
      481 gagtg
//
LOCUS       KPNNIFBQ     2178 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  K.pneumoniae nitrogen fixation (nifB and niQ) genes, complete cds.
ACCESSION   M15545
KEYWORDS    nitrogen fixation protein.
SOURCE      K.pneumoniae DNA.
  ORGANISM  Klebsiella pneumoniae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2178)
  AUTHORS   Buikema,W.J., Klingensmith,J.A., Gibbons,S.L. and Ausubel,F.M.
  TITLE     Conservation of structure and location of Rhizobium meliloti and
            Klebsiella pneumoniae nifB genes
  JOURNAL   J. Bacteriol. 169, 1120-1126 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        199     1605     nitrogen fixation B (nifB)
    pept       1605     2108     nitrogen fixation Q (nifQ)
BASE COUNT      402 a    670 c    692 g    414 t
ORIGIN      
        1 tcagattatg gatatcacca tgccgcgact gtgaagcctt atgtgagatt caggacattg
       61 tcgccagcgc ggcggaattg cgacaattca gggacgcggg ttgccggtta aaaagtctac
      121 ttttcatgcg gttgcgaaat taacctctgg tacagcattt gcagcaggaa ggtatcgccc
      181 aaccacgaag gtacgaccat gacttcctgc tcctcttttt ctggcggcaa agcctgccgc
      241 ccggcggatg acagcgcatt gacgccgctt gtggccgata aagctgccgc gcacccctgc
      301 tactctcgcc atgggcatca ccgtttcgcg cggatgcatc tgcccgtcgc gcccgcctgc
      361 aatttgcagt gcaactactg taatcgcaaa ttcgattgca gcaacgagtc ccgccccggg
      421 gtatcgtcaa cgctgctgac gcctgaacag gcggtcgtga aagtgcgtca ggtcgcgcag
      481 gcgatcccgc agctttcggt ggtgggcatc gccgggcccg gcgatccgct cgccaatatc
      541 gcccgcacct ttcgcaccct ggagctgatc cgcgaacagc tgccggacct gaaattatgc
      601 ctgtcgacca acggactgat gctgcctgac gcggtggacc gcctgctgga tgtcggcgtt
      661 gaccacgtca cggtcaccat taacaccctc gacgcggaga ttgccgcgca aatctacgcc
      721 tggctatggc tggacggcga acgctacagc gggcgcgaag gcggagagat cctgattgcc
      781 cgtcagcttg agggcgtacg caggctgacc gccaaaggcg tgctggtgaa aataaattcg
      841 gtgctgatcc ccggtatcaa cgatagcggc atggccgacg tgagccgcgc gctgcgggcc
      901 agcggcgcgt ttatccataa tattatgccg ctgatcgcca ggccggagca cggcacggtg
      961 tttcggctca acggccagcc ggagccggac gccgagacgc tcgccgccac ccgcagccgg
     1021 tgcggcgaag tgatgccgca gatgacccac tgccaccagt gtcgcgccga cgccattggg
     1081 atgctcggcg aagaccgcag ccagcagttt acccagcttc cggcgccaga gagtctcccg
     1141 gcctggctgc cgatcctcca ccagcgcgcg cagctgcacg ccagcattgc gacccgcggc
     1201 gaatctgaag ccgatgacgc ctgcctggtc gccgtggcgt caagccgcgg ggacgtcatt
     1261 gattgtcact ttggtcacgc cgaccggttc tacatttaca gcctctcggc cgccggtatg
     1321 gtgctggtca acgagcgctt tacgcccaaa tattgtcagg ggcgcgatga ctgcgagccg
     1381 caggataacg cagcccggtt tgcggcgatc ctcgaactgc tggcggacgt taaagccgta
     1441 ttctgcgtgc gtatcggcca tacgccgtgg caacagctgg aacaggaagg cattgaaccc
     1501 tgcgttgacg gcgcgtggcg gccggtctcc gaagtgctgc ccgcgtggtg gcaacagcgt
     1561 cgggggagct ggcctgccgc gttgccgcat aagggggtcg cctgatgccg ccgctcgact
     1621 ggttgcggcg cttatggctg ctgtaccacg cggggaaagg cagctttccg ctgcgcatgg
     1681 ggcttagccc gcgcgattgg caggcgctgc ggcggcgcct gggcgaggtg gaaacgccgc
     1741 tcgacggcga gacgctcacc cgtcgccgcc tgatggcgga gctcaacgcc acccgcgaag
     1801 aggagcgcca gcagctgggc gcctggctgg cgggctggat gcagcaggat gccgggccga
     1861 tggcgcagat tatcgccgag gtttcgctgg cgtttaacca tctctggcag gaatcttggt
     1921 ctggcatcgc cgccgaattg cgcctgctga tgagcgactg ctttccacag ctggtggtga
     1981 tgaacgaaca caatatgcgc tggaaaaagt tcttttatcg tcagcgctgt ttgctgcaac
     2041 agggggaagt tatctgccgt tcgccaagct gcgacgagtg ctgggaacgc agcgcctgtt
     2101 ttgagtagcc gtttcccgaa gggggcgctg caaacaaaaa agccggaggt ttccctccgg
     2161 cttttcacat catcaaat
//
LOCUS       RHMNIFB      1722 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  R.meliloti nitrogen fixation B (nifB) gene, complete cds.
ACCESSION   M15544
KEYWORDS    nitrogen fixation protein.
SOURCE      R.meliloti DNA.
  ORGANISM  Rhizobium meliloti
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 1722)
  AUTHORS   Buikema,W.J., Klingensmith,J.A., Gibbons,S.L. and Ausubel,F.M.
  TITLE     Conservation of structure and location of Rhizobium meliloti and
            Klebsiella pneumoniae nifB genes
  JOURNAL   J. Bacteriol. 169, 1120-1126 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        199     1671     nitrogen fixation B protein (nifB)
BASE COUNT      422 a    476 c    481 g    343 t
ORIGIN      
        1 gcgtcgcgag cgtcgcctct tttctccgtc ctttcgaaac acgaccggat gcaattcaac
       61 tttgcccttg tcggaagtcc gacaagcctg tgagcaaaga gacaggaccg aatccaaaga
      121 cgccaatcca ttgacgaaaa ggatttggca tagctgttgc tggttgaatt gcaaaatcac
      181 cctccactgg agttgttcat gtccacaccc atgattttgc gtgagagccg gaccagcact
      241 acattctctg accagttgct ggagaacgct aaatcggttg gctgctcacc cccatcgacg
      301 gcgccgggcg acatagatcc tggaacttgg gacaagatta agaatcaccc ctgtttttca
      361 gaggaggcgc atcactattt cgcgcgcatg cacgtggcgg tcgcgcctgc ctgcaacatc
      421 caatgtaact attgcaatcg caaatacgat tgcgccaacg aaagtcggcc cggtgttgcc
      481 tcggaaaagc tcacacctga ccaggcggtg cgaaaggtga ttgccgttgc caacgaagtg
      541 cctcagctgt cagtgcttgg catcgctggg cctggcgatg cctgttacga ctggaagaaa
      601 acaagggcga cgttcgaacg agtggctagg gaaattcccg acataagact ctgcatctcc
      661 acgaacgggc tctcgctgcc ggaccatgtc gatgagcttg ccgaaatgaa cgtcgatcac
      721 gtgacgatca ccatcaacat ggtcgatccg cgtgtcggcg taaagatcta cccctggatt
      781 tactatggtc agcgccgcca cactggtatc gacgctgcga gaatcctgca cgaacggcag
      841 atgttgggcc tggagatgct agccgaacgc ggcatcctca ccaaggtcaa ctcggtaatg
      901 atccccggcg tcaatgatga gcacctgatc gaagtcaaca aagttgtgaa aggaagaggc
      961 gcgttgctgc acaacgtaat gccgctaatt tcaaaccgca tacacgggac ctattacgga
     1021 ctgacagggc agcgcggccc ggaggccttc gaactgcagg cccttcagga ccgtctagaa
     1081 ggaaccaaac tgatgcgtca ttgtcgacat tgccgggccg atgccatagg cttgctcggc
     1141 gatgatcgtg gtcacgagtt cacgctcgct gaaatccccg acgagataac ctacgatgcc
     1201 agcaagcgac aggcctatcg ccagttggtc gcgcgcgaac gcggggacca cctagtggcc
     1261 aagaacgagg cgaacagaac ggtaatgtcg gtggaatatg gcggatcgct tctcattgcc
     1321 gtggcgacca aaggcggggg ccggatcaac gaacattttg gacacgcgaa agaatttcac
     1381 gtttataccg tctcccagag agggatcaag ctggcaggcc gccgcagggt tgagcagtat
     1441 tgcctcggcg gttggggcga ggtcgccacc ctcgatcaca tcgtcgttgc gcttgaagga
     1501 atagacatcc tgctctgcgt caaaatcgga gattacccaa ggaaacagct gacacaggcc
     1561 gggcttcgag cgacggaagc ttatggccat gactacatcg agagtgcgct cgaaagctct
     1621 acgccgccga gtttggctat cgaaccaccg gtaaagacgg cgacacgttg agctgccctc
     1681 cgactgaata aggagtttaa aatggccttc aagattattg ca
//
LOCUS       AKOMTCYTBA    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.boliviensis mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35691
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.boliviensis (MVZ catalog # 171607) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon boliviensis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon boliviensis.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      122 a    111 c     52 g    116 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta ctcaaaattg ttaatcactc attcattgac
       61 ctaccaactc catctaacat ctcatcctga tgaaattttg gatccctact aggcatatgc
      121 ctaataatcc aaattttaac aggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cctcagtagc acatatctgc cgagatgtaa actacggatg attaatccgc
      241 tacttgcacg ctaatggagc ctccatattc tttatctgcc tcttcatcca cgtaggccga
      301 ggtatttact atggttccta taccctctca gaaacctgaa acattggcat catcttattc
      361 ctcacaacta tagcaacagc atttgtagga tatgtactcc c
//
LOCUS       AKOMTCYTBB    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.boliviensis mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35692
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.boliviensis (MVZ catalog # 171608) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon boliviensis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon boliviensis.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      122 a    112 c     52 g    115 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta ctcaaaattg ttaatcactc attcattgac
       61 ctaccaactc catctaacat ctcatcctga tgaaattttg gatccctact aggcatatgc
      121 ctaataatcc aaattttaac aggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cctcagtagc acatatctgc cgagatgtaa actacggatg attaatccgc
      241 tacttgcacg ctaatggagc ctccatattc tttatctgcc tcttcatcca cgtaggccga
      301 ggtatttact atggttccta caccctctca gaaacctgaa acattggcat catcttattc
      361 ctcacaacta tagcaacagc atttgtagga tatgtactcc c
//
LOCUS       AKOMTCYTBC    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.puer mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35693
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.puer (MVZ catalog # 171612) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon puer
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon puer.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      122 a    112 c     52 g    115 t
ORIGIN      
        1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc atttattgac
       61 ctaccaactc catctaacat ctcatcctgg tgaaattttg gatccctact aggcatatgc
      121 ctagtaatcc aaattctaac aggcctattc ctagccatac actatacatc agacacaacc
      181 acagcattct cctcagtagc ccacatctgc cgagatgtaa attacggatg attaatccgc
      241 tatctacacg ccaatggagc ttctatattc tttatctgcc tcttcatcca tgtaggccga
      301 ggtatttact acggctccta tgtcctctca gaaacctgaa acatcggcat catcttattc
      361 ctcacaacta tagcaacagc attcgtagga tatgtactac c
//
LOCUS       AKOMTCYTBD    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.subfuscus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35694
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.subfuscus (MVZ catalog # 172969 and 172970) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon subfuscus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon subfuscus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      120 a    112 c     53 g    116 t
ORIGIN      
        1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
       61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcatatgc
      121 ttagtgatcc aaattctaac aggcctgttc ctagccatac actatacatc cgacacaact
      181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
      241 tacttacacg ctaatggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
      301 ggtatttact acggctccta tgtcctctca gaaacttgaa acatcggcat catcttattc
      361 ctcacaacta tagcaacagc attcgtagga tatgtattac c
//
LOCUS       AKOMTCYTBE    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.subfuscus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35695
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.subfuscus (MVZ catalog # 174109 and 174110) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon subfuscus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon subfuscus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      120 a    113 c     53 g    115 t
ORIGIN      
        1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
       61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcatatgc
      121 ttagtgatcc aaattctaac aggcctgttc ctagccatac actatacatc cgacacaact
      181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
      241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
      301 ggtatttact acggctccta tgtcctctca gaaacttgaa acatcggcat catcttattc
      361 ctcacaacta tagcaacagc attcgtagga tatgtattac c
//
LOCUS       AKOMTCYTBF    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.subfuscus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35696
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.subfuscus (MVZ catalog # 174229 and 174230) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon subfuscus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon subfuscus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      122 a    116 c     51 g    112 t
ORIGIN      
        1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
       61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcacatgc
      121 ttagtaatcc aaattctaac aggcctattc ctagccatac actatacatc cgacacaact
      181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
      241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
      301 ggtatttact acggctccta cgtcctctca gaaacttgaa acatcggcat catcttattc
      361 ctcacaacta tagcaacagc attcgtagga tatgtactac c
//
LOCUS       AKOMTCYTBG    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.kofordi mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35697
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.kofordi (MVZ catalog # 171665 and 171666) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon kofordi
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon kofordi.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      122 a    116 c     51 g    112 t
ORIGIN      
        1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac
       61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcacatgc
      121 ttagtaatcc aaattctaac aggcctattc ctagccatac actatacatc cgacacaact
      181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc
      241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga
      301 ggtatttact acggctccta cgtcctctca gaaacttgaa acatcggcat catcttattc
      361 ctcacaacta tagcaacagc attcgtagga tatgtactac c
//
LOCUS       AKOMTCYTBH    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.juninensis mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35698
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.juninensis (MVZ catalog # 173038 and 173039) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon juninensis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon juninensis.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      117 a    115 c     57 g    112 t
ORIGIN      
        1 atgaaagtct tacgaaaaaa tcacccacta ctcaaaattg ttaatcactc atttatcgat
       61 ctaccaaccc catctaacat ctcatcctga tggaatttcg ggtccctact aggcacatgc
      121 ctagtaatcc aaattctaac aggcctattc ctagccatac actacacgtc agatacaacc
      181 acagcattct cctcagtagc ccacatctgc cgagatgtaa actacggatg attaatccgc
      241 tacttacacg ctaacggagc ctctatattc tttatctgcc tcttcatcca tgtaggccga
      301 gggatttact acggctccta catcctctca gaaacctgaa acattggcat catcttgttt
      361 ctcacaacta tagcaacagc atttgtaggg tatgtacttc c
//
LOCUS       AKOMTCYTBI    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.mollis mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35699
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.mollis (MVZ catalog # 173057 and 173058) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon mollis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon mollis.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      120 a    110 c     54 g    117 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaatta ttaatcactc attcattgac
       61 ctaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactgct cggtgtatgc
      121 ctaataatcc aaatcctaac tggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cctcagtagc ccacatttgt cgagatgtaa actacggatg attaatccgc
      241 tatctacacg ctaacggagc ttccatattc tttatttgcc ttttcatcca tgtaggccga
      301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acatcggtat catcctattc
      361 cttacaacta tagcaacagc attcgtagga tatgtattac c
//
LOCUS       AKOMTCYTBJ    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.torques mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35700
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.torques (MVZ catalog # 171720) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon torques
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon torques.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      117 a    112 c     58 g    114 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc attcattgac
       61 ttaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactact cggtgtatgc
      121 ctaataatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
      241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga
      301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acgtcggtat catcctattc
      361 ctcacaacta tagcaacagc attcgtaggg tacgtgttac c
//
LOCUS       AKOMTCYTBK    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.torques mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35701
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.torques (MVZ catalog # 171721) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon torques
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon torques.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      116 a    112 c     59 g    114 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc attcattgac
       61 ttaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactact cggtgtatgc
      121 ctaataatcc aaattctgac cggcctgttc ctagccatac actacacatc agacacaacc
      181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
      241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga
      301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acgtcggtat catcctattc
      361 ctcacaacta tagcaacagc attcgtaggg tacgtgttac c
//
LOCUS       AKOMTCYTBL    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.torques mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35702
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.torques (MVZ catalog # 174053 and 174054) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon torques
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon torques.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      118 a    112 c     58 g    113 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc atttattgac
       61 ttaccaaccc catccaacat ctcatcttga tggaactttg ggtcactact aggcgtatgc
      121 ctaataatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cttcagtagc ccacatttgc cgagacgtaa actacggatg attaatccgc
      241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga
      301 gggatctatt acggctccta tgccctctca gaaacctgaa acatcggtat catcttattc
      361 ctcacaacca tagcaacagc attcgtaggg tatgtattac c
//
LOCUS       AKOMTCYTBM    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35703
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.aerosus (MVZ catalog # 171679 and 1761680) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon aerosus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon aerosus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      115 a    107 c     59 g    120 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccacta cttaaaattg ttaatcactc atttattgac
       61 ctaccaaccc cacccaacat ttcatcctga tggaattttg ggtcactact tggtgtatgc
      121 ctaataatcc aaattctgac tggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cttcagtagc ccatatttgc cgagatgtaa actacgggtg gctaatccgc
      241 tatctacatg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtgggtcga
      301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat tatcttattc
      361 cttacaacta tagcaacagc attcgtaggg tatgtactac c
//
LOCUS       AKOMTCYTBN    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35704
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.aerosus (MVZ catalog # 172818 and 172819) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon aerosus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon aerosus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      110 a    114 c     60 g    117 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac
       61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc
      121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc
      241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga
      301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctgttc
      361 cttacaacta tagcaacagc attcgtaggg tatgttctac c
//
LOCUS       AKOMTCYTBO    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35705
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.aerosus (MVZ catalog # 172849) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon aerosus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon aerosus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      111 a    114 c     59 g    117 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac
       61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc
      121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc
      241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga
      301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctattc
      361 cttacaacta tagcaacagc attcgtaggg tatgttctac c
//
LOCUS       AKOMTCYTBP    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35706
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.aerosus (MVZ catalog # 172850) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon aerosus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon aerosus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      110 a    114 c     60 g    117 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac
       61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc
      121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc
      181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc
      241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga
      301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctgttc
      361 cttacaacta tagcaacagc attcgtaggg tatgttctac c
//
LOCUS       AKOMTCYTBQ    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35707
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.aerosus (MVZ catalog # 172870) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon aerosus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon aerosus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      115 a    108 c     58 g    120 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcactc atttattgac
       61 ctcccaaccc catccaacat ctcatcttga tggaattttg ggtcactact aggtgtatgc
      121 ctaataatcc aaattctgac tggactattc ctagccatac actacacatc agacacaacc
      181 acagcattct cttcagtcgc ccacatttgc cgagatgtaa attacgggtg attaatccgc
      241 taccttcacg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtaggtcga
      301 ggaatctact acggctccta tgtcctttca gaaacctgaa acatcggtat catcctgttc
      361 cttacaacta tagcaacagc atttgtagga tatgtactac c
//
LOCUS       AKOMTCYTBR    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.aerosus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35708
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.aerosus (MVZ catalog # 172871) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon aerosus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon aerosus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      116 a    108 c     57 g    120 t
ORIGIN      
        1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcactc atttattgac
       61 ctcccaaccc catccaacat ctcatcttga tggaattttg ggtcactact aggtgtatgc
      121 ctaataatcc aaattctgac tggactattc ctagccatac actacacacc agacacaacc
      181 acagcattct cttcagtcgc ccacatttgc cgagatgtaa attacgggtg attaatccgc
      241 tatcttcacg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtaggtcga
      301 ggaatctact acggctccta tgtcctttca gaaacctgaa acatcggtat catcctattc
      361 cttacaacta tagcaacagc atttgtagga tatgtactac c
//
LOCUS       AKOMTCYTBW    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.jelskii mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35716
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.jelskii (MVZ catalog # 173084) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon jelskii
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon jelskii.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      109 a    117 c     55 g    117 t      3 others
ORIGIN      
        1 atggcaatta tacgaaaaac ccacccctta cttaaaatta ttaaccactc attcattgat
       61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc
      121 ctaattattc aaattctaac tggccttttc ctagcaatac actacacatc cgacacagcc
      181 acagcattct cctccgttac ccacatctgc cgagatgtaa actacggctg actcatccgc
      241 taccttcatg ctaacggagc atcyatattt tttatttgyc tattyattca cgtaggacga
      301 ggaatctact atggctccta catgctctca gagacctgaa acatcggaat cattctactc
      361 ctggccacaa tagccacagc attcgtagga tacgttcttc c
//
LOCUS       AKOMTCYTBX    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.jelskii mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35715
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.jelskii (MVZ catalog # 173083) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Akodon jelskii
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon jelskii.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      107 a    118 c     57 g    119 t
ORIGIN      
        1 atggcaatta tacgaaaaac ccacccctta cttaaaatta ttaaccactc attcattgat
       61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc
      121 ctaattattc aaatcctaac tggccttttc ctagcaatac actacacatc cgacacagcc
      181 acagcattct cctccgttac ccacatctgc cgagatgtaa actacggctg actcatccgc
      241 taccttcatg ctaacggggc atctatattt tttatttgtc tatttattca cgtaggacgg
      301 ggaatctact atggctccta catgctctca gagacctgaa acatcggaat cattctactc
      361 ctggccacaa tagccacagc attcgtagga tacgttcttc c
//
LOCUS       AKOMTCYTBY    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.jelskii mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35714
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.jelskii (MVZ catalog # 173073 and 173074) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon jelskii
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon jelskii.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      103 a    121 c     60 g    117 t
ORIGIN      
        1 atggcaatta tacgaaagac ccacccctta cttaaaatca ttaaccactc gttcattgat
       61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc
      121 ctaattattc aaattctaac tggccttttc ctagccatac actacacatc cgacacagcc
      181 acagcattct cctccgtcac ccacatctgc cgagatgtaa attacggctg actcatccgc
      241 tatcttcatg ctaacggagc atccatattt tttatttgcc tattcattca cgtggggcga
      301 gggatctact atggctccta catgctctca gagacctgaa acatcggaat tgttctactc
      361 ctgaccacaa tagccacagc attcgtagga tacgttctcc c
//
LOCUS       AKOMTCYTBZ    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  A.andinus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35713
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      A.andinus (MVZ catalog # 174062 and 174063) liver mitochondrial
            DNA.
  ORGANISM  Mitochondrion Akodon andinus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Akodon andinus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      112 a    115 c     55 g    119 t
ORIGIN      
        1 atgacaatca tacgaaaaac ccacccttta cttaaaatca ttaatcactc atttattgac
       61 ctacccgccc cctccaacat ctcatcctga tggaacttcg gctcactcct tggcatttgc
      121 ttgataattc aaattttaac gggccttttc ctagccatac attatacgtc cgacacagct
      181 acagcattct cctccgtcac ccatatctgc cgagatgtaa attacggatg actaatccgc
      241 tacttacacg ccaatggagc atctatattc tttatctgca tatttattca tgtgggacga
      301 ggaatttact acggctccta catactctca gaaacctgaa acatcggaat tattctactc
      361 ctgaccacaa tagccacagc atttgtaggc tatgtccttc c
//
LOCUS       BOMMTCYTBA    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  B.amoenus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35711
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      B.amoenus (MVZ catalog # 172878) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Bolomys amoenus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Bolomys amoenus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      122 a    107 c     50 g    122 t
ORIGIN      
        1 atgaaaatta ttcgaaaaaa tcacccacta cttaaaatta ttaaccattc attcattgat
       61 cttccaaccc catctaacat ttcatcctga tgaaacttcg gatccttact aggcatctgc
      121 ctaataatcc aaatcctaac aggcctattt ctagcaatac actatacatc agatacaacc
      181 acagcattct cctcagtcgc ccatatctgc cgagacgtga actatggatg attaatccgc
      241 tacctacacg ccaacggagc ctccatattc ttcatctgtc ttttcatcca tgtaggccga
      301 ggaatttact atggctccta tgtcctatca gaaacctgaa atattggtat tatcctattt
      361 tttacaacta tagcaacagc attcgtagga tatgtccttc c
//
LOCUS       BOMMTCYTBB    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  B.amoenus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35712
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      B.amoenus (MVZ catalog # 172879) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Bolomys amoenus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Bolomys amoenus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      122 a    106 c     50 g    123 t
ORIGIN      
        1 atgaaaatta ttcgaaaaaa tcacccacta cttaaaatta ttaaccattc attcattgat
       61 cttccaaccc catctaacat ttcatcctga tgaaacttcg gatccttact aggcatctgc
      121 ctaataatcc aaatcctaac aggcctattt ctagcaatac actatacatc agatacaacc
      181 acagcattct cctcagtcgc ccatatctgc cgagacgtga actatggatg attaatccgc
      241 tacctacacg ccaacggagc ctccatattc ttcatctgtc ttttcattca tgtaggccga
      301 ggaatttact atggctccta tgtcctatca gaaacctgaa atattggtat tatcctattt
      361 tttacaacta tagcaacagc attcgtagga tatgtccttc c
//
LOCUS       MCXMTCYTBA    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  M.mimus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35709
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      M.mimus (MVZ catalog # 171745 and 171746) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Microxus mimus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Microxus mimus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      118 a    116 c     54 g    113 t
ORIGIN      
        1 atgaaaatcc tacgaaaaaa ccacccgcta cttaaaattg ttaaccactc atttattgac
       61 ctaccaaccc catctaacat ctcttcctga tgaaattttg gatccttact aggtgcatgc
      121 ctaataatcc aaattctaac cggcctattc ctagccatac actatacatc agacacaacc
      181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
      241 tatctacacg ctaacggagc ctctatattc tttatctgcc tcttcattca cgtaggccga
      301 ggaatttact acggctccta tgccctttca gagacctgaa acattggcat catcctattc
      361 cttacaacta tagcaacagc attcgtaggg tacgtactcc c
//
LOCUS       MCXMTCYTBB    401 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  M.mimus mitochondrial cytochrome b gene, 5' end.
ACCESSION   M35710
KEYWORDS    cytochrome b; mitochondrial DNA.
SOURCE      M.mimus (MVZ catalog # 171752 and 171753) liver mitochondrial DNA.
  ORGANISM  Mitochondrion Microxus mimus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Hesperomyini; Microxus mimus.
REFERENCE   1  (bases 1 to 401)
  AUTHORS   Smith,M.F. and Patton,J.L.
  TITLE     Variation in mitochondrial cytochrome b sequence in natural
            populations of South American akodontine rodents (Muridae:
            Sigmodontinae)
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Smith, 28-JUN-1990.
FEATURES       from  to/span     description
    pept          1  >   401     cytochrome b
BASE COUNT      116 a    116 c     56 g    113 t
ORIGIN      
        1 atgaaaatcc tacgaaaaaa ccacccgcta cttaaaattg ttaaccactc atttattgac
       61 ctaccaaccc catctaacat ctcttcctga tgaaattttg gatccttact aggtgcatgc
      121 ctaataatcc aaattctaac cggcctattc ctagccatac actatacgtc agacacaacc
      181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc
      241 tatctgcacg ctaacggagc ctctatattc tttatctgcc tcttcattca cgtaggccga
      301 ggaatttact acggctccta tgccctttca gagacctgaa acattggcat catcctattc
      361 cttacaacta tagcaacagc attcgtaggg tacgtactcc c
//
LOCUS       CRCIGHA       482 bp ds-DNA             VRT       02-AUG-1990
DEFINITION  Caiman Ig germline heavy-chain gene V region, exons 1 and 2.
ACCESSION   M34918
KEYWORDS    germline; immunoglobulin heavy-chain; variable region.
SOURCE      Caiman DNA.
  ORGANISM  Caiman crocodylus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Reptilia; 
            Archosauria; Crocodylia; Crocodylidae; Alligatorinae.
REFERENCE   1  (bases 1 to 482)
  AUTHORS   Litman,G.W., Berger,L., Murphy,K., Litman,R., Podlaski,F.,
            Hinds,K., Jahn,C.L., Dingerkus,G. and Erickson,B.W.
  TITLE     Phylogenetic diversification of immunoglobulin V-H genes
  JOURNAL   Dev. Comp. Immunol. 8, 499-514 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1       46     Ig H-chain V region precursor, exon 1
                137  /   441     Ig H-chain V region precursor, exon 2
    sigp          1       46     Ig H-chain V region signal peptide
                137      147     Ig H-chain V region signal peptide
    matp        148  /   441     Ig H-chain V region, exon 2
    IVS          47      136     Ig H-chain V region intron A
    signal      444      450     heptamer recombination signal
    signal      474      482     nonamer recombination signal
BASE COUNT      104 a    139 c    131 g    108 t
ORIGIN      
        1 atgggacttg ggctccacct gctcgtcctg gccgcagctc tgcaaggtat gtcatcttcc
       61 cacaggtcgg ggagggaatc attgcactga tgctgcccta atactagatc ctcttggatc
      121 ctaccttgct tttcaggtgc ttggtcccag gtgcagctgg tggagtccgg aggagatgtg
      181 aggaaacctg gaaactcttt gcgcctctcc tgcaaagcct cggggttcac cttcggtggc
      241 tacggcatgt tctgggtccg ccaggctcct gggaaggggc tggactgggt ggctacaatt
      301 aatactgatg gatccagcca gtggtactcc ccggccgttc aggggaaatt caccatctcc
      361 agaggcaact cccagaacat gctgtacctg cagatgagca gcctcacacc tgaggacaca
      421 gccacgtatt actgcgccag agacacagtg actcaaaccc tattcacggc aatacaaaat
      481 cc
//
LOCUS       MUSIGKACZ     203 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION   M35640
KEYWORDS    immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, clone 1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 203)
  AUTHORS   Nishi,M., Kataoka,T. and Honjo,T.
  TITLE     Preferential rearrangement of the immunoglobulin kappa-chain
            joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     9  >   203     Ig kappa-chain V-J region (AA at 9)
    recomb      181      182     V-region end/J-region start
BASE COUNT       45 a     57 c     50 g     50 t      1 others
ORIGIN      
        1 ggtaccngca gaagtcagat gcctccccca aactatggat ttattacaca tccaacctgg
       61 ctcctggagt cccagctcgc ttcagtggca gtgggtctgg gaactcttat tctctcacaa
      121 tcagcagcat ggagggtgaa gatgctgcca cttattactg ccagcagttt actagttccc
      181 catggacgtt cggtggaggc acc
//
LOCUS       MUSIGKADA     204 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION   M35641
KEYWORDS    immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, clone 2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 204)
  AUTHORS   Nishi,M., Kataoka,T. and Honjo,T.
  TITLE     Preferential rearrangement of the immunoglobulin kappa-chain
            joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     9  >   204     Ig kappa-chain V-J region (AA at 9)
    recomb      181      182     V-region end/J-region start
BASE COUNT       45 a     58 c     50 g     50 t      1 others
ORIGIN      
        1 ggtaccngca gaagtcagat gcctccccca aactatggat ttattacaca tccaacctgg
       61 ctcctggagt cccagctcgc ttcagtggca gtgggtctgg gaactcttat tctctcacaa
      121 tcagcagcat ggagggtgaa gatgctgcca cttattactg ccagcagttt actagttccc
      181 catcggacgt tcggtggagg cacc
//
LOCUS       MUSIGKADB     203 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION   M35642
KEYWORDS    immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, clone 3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 203)
  AUTHORS   Nishi,M., Kataoka,T. and Honjo,T.
  TITLE     Preferential rearrangement of the immunoglobulin kappa-chain
            joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     9  >   203     Ig kappa-chain V-J region (AA at 9)
    recomb      181      182     V-region end/J-region start
BASE COUNT       42 a     59 c     53 g     48 t      1 others
ORIGIN      
        1 ggtaccngca gaagccagga tcctcgccca aaccctggat ttatgacaca tccaacctgg
       61 cttctggatt ccctgctcgc ttcagtggca gtgggtctgg gacctcttgc tctctcataa
      121 tcagcagcat ggaggctgaa gatgctgcca cttattactg ccatcagcgg agtagttacc
      181 catggacgtt cggtggaggc acc
//
LOCUS       MUSIGKADC     203 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION   M35643
KEYWORDS    immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, clone 4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 203)
  AUTHORS   Nishi,M., Kataoka,T. and Honjo,T.
  TITLE     Preferential rearrangement of the immunoglobulin kappa-chain
            joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     9  >   203     Ig kappa-chain V-J region (AA at 9)
    recomb      181      182     V-region end/J-region start
BASE COUNT       47 a     54 c     52 g     50 t
ORIGIN      
        1 ggtaccaaca gaagccaggg cagtctccta aactgctgat atactatgca tccaatcgct
       61 acactggagt ccctgatcgc ttcactggca gtggatatgg gacggatttc actttcacca
      121 tcagcactgt gcaggctgaa gacctggcag tttatttctg ccagcaggat tatagctctc
      181 cgtggacgtt cggtggaggc acc
//
LOCUS       MUSIGKADD     205 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION   M35644
KEYWORDS    immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, clone 5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 205)
  AUTHORS   Nishi,M., Kataoka,T. and Honjo,T.
  TITLE     Preferential rearrangement of the immunoglobulin kappa-chain
            joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     9  >   205     Ig kappa-chain V-J region (AA at 9)
    recomb      181      182     V-region end/J-region start
BASE COUNT       50 a     55 c     52 g     48 t
ORIGIN      
        1 ggtacctgca caagcctggc cagtctccac agctcctcat ctatgggatt tccaacagat
       61 tttctggggt gccagaccgc ttcagtggca gtggttcagg gacagatttc acactcaaga
      121 tcagcacaat aaagcctgag gacttgggaa tgtattactg cttacaaggt acacattcag
      181 cctccggacg ttcggtggag gcacc
//
LOCUS       MUSIGKADE     203 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION   M35645
KEYWORDS    immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, clone 6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 203)
  AUTHORS   Nishi,M., Kataoka,T. and Honjo,T.
  TITLE     Preferential rearrangement of the immunoglobulin kappa-chain
            joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     9  >   203     Ig kappa-chain V-J region (AA at 9)
BASE COUNT       46 a     51 c     53 g     50 t      3 others
ORIGIN      
        1 ggtaccnnnc gaaaccaggg cagtctccta aactgctgat ttactgggca tccactaggg
       61 aatctggggt ccctgatcgc ttcacaggca gtggatctgg gactcatttc actctcacca
      121 tcagcagtgt gaaggctgaa gacctggcag tttattactg tcagcaatat tatagctatc
      181 cgtggacgtt cggtggaggc acc
//
LOCUS       MUSIGKADF     205 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse Ig rearranged kappa-chain gene V-J region, partial cds.
ACCESSION   M35646
KEYWORDS    immunoglobulin light-chain; joining exon; kappa-immunoglobulin;
            processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, clone 1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 205)
  AUTHORS   Nishi,M., Kataoka,T. and Honjo,T.
  TITLE     Preferential rearrangement of the immunoglobulin kappa-chain
            joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     9  >   205     Ig kappa-chain V-J region (AA at 9)
    recomb      181      182     V-region end/J-region start
BASE COUNT       46 a     60 c     51 g     42 t      6 others
ORIGIN      
        1 ggtaccnnnn nnaaacagga cagccaccca aactcctcat ctatcttagc atccaaccta
       61 gaatctgggg tccctgccag cttcagtggc agtgggtctg ggacaggctt caccctcaac
      121 atccatcctg tggaggagtg aggatgctgc aacctattac ttccagcaca gtagggagct
      181 tcctcggacg ttcggtggag gcacc
//
LOCUS       MUSPROTCP     912 bp ss-mRNA            ROD       02-AUG-1990
DEFINITION  Mouse proteoglycan core protein mRNA, complete cds.
ACCESSION   M34603
KEYWORDS    proteoglycan core protein.
SOURCE      Mouse (strain C57B1/6, haplotype H-2b) T-lymphocyte cell line
            EL4.E1, cDNA to mRNA, clone 154.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 912)
  AUTHORS   Elliott,J.F., Pohajdak,B., Talbot,D., Miller,C.L., Helgason,C.D.,
            Bleackley,R.C. and Paetkau,V.H.
  TITLE     Cloning and characterization of an inducible proteoglycan core
            protein mRNA from a mouse T-lymphocyte cell line
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.H.Paetkau, 29-MAY-1990.
FEATURES       from  to/span     description
    pept         43      501     proteoglycan core protein precursor
    sigp         43      117     proteoglycan core protein signal peptide
    matp        265      498     proteoglycan core protein
BASE COUNT      252 a    200 c    194 g    266 t
ORIGIN      
        1 gatttgctaa tccagaggct gagtggagcc gagctggtca ggatgcaggt tcccgtcggc
       61 agcaggcttg tcctggctct cgccttcgtc ctggtttggg gatcttcagt gcaaggttat
      121 cctgctcgga gagccaggta ccagtgggtc cgctgcaaac cgaatggctt ttttgctaac
      181 tgcatcgagg agaagggacc acagtttgac ctaatagatg aatccaataa catcggccct
      241 cccatgaata atcctgtttt gatggaagga ccctcaaaag atttcatctc caattatgat
      301 gactatgggt caggttcggg ctccggctct ggctccggct ctggctcggg ttccggctcc
      361 ggaagtggct tcctaggtga catggaatgg gaataccagc caacagatga aagcaatatt
      421 gtctatttca actataagcc ttttgacagg attctcactg agcaaaacca agaccaacca
      481 gaagacgatt ttattatatg aatgtgacgg tctctgtctc cccacctcca tgtggaacaa
      541 tgtattcagt atacttagtg taccacgttt aaatgaccag tctcaggata aagagtttta
      601 cagaaaattt aaaatgcctg gaaaagactc ttgaatcctg ttaccccttt cctcattaac
      661 tcgtaaggaa ttatgcttta atgctgttac ctatcttgtt gttctggaaa atgcctgcat
      721 ttatgtgtat tgaatcaaca tttaagaaat taacacacac ccccattatt atacaataac
      781 tttcaaagcc atactggttt tgaaaatttt aatttgatag caagttgatg aacatctttc
      841 atacctaaag tgttcaggac ccaactcgca ttgtggatta caatatattc ctttatgtga
      901 taaaagaaat cg
//
LOCUS       PVBENH        316 bp ds-DNA             VRL       02-AUG-1990
DEFINITION  Papovavirus BKV enhancer/regulatory region.
ACCESSION   M35737
KEYWORDS    .
SOURCE      Papovavirus BKV (strain BKV-P2, from prototype Gardner ATCCVR 837)
            DNA, passed in primary human embryonic kidney cells.
  ORGANISM  Papovavirus BKV
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Papillomavirus.
REFERENCE   1  (bases 1 to 316)
  AUTHORS   Berg,D.T., Walls,J.D. and Grinnell,B.W.
  TITLE     A variant enhancer/regulatory region from a cloned human prototype
            BK virus genome
  JOURNAL   Nucleic Acids Res. 16, 9057-9057 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt          92      191     tandem repeat copy A
    rpt         192      290     tandem repeat copy B
BASE COUNT      101 a     79 c     85 g     51 t
ORIGIN      
        1 cctcagaaaa agcctccaca cccttactac ttgagagaaa gggtggaggc agaggcggcc
       61 tcggcctctt atatattata aaaaaaaagg ccacagggag gagctgctta cccatggaat
      121 gcagccaaac catgaccgca ggaaggaaag tgcatgactg ggcagccagc cagtggcagt
      181 taataagcag ccagacagac atttgcttac ccatggaatg cagccaaacc atgacctcag
      241 gaaggaaagt gcatgactgg gcagccagcc agtggcagtt aataagcagc agccagacag
      301 acatgttttg cgagcc
//
LOCUS       HUMCREB      1026 bp ss-mRNA            PRI       02-AUG-1990
DEFINITION  Human active transcription factor CREB mRNA, complete cds.
ACCESSION   M34356
KEYWORDS    active transcription factor CREB; alternative splicing.
SOURCE      Human T-cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1026)
  AUTHORS   Berkowitz,L.A. and Gilman,M.Z.
  TITLE     Two distinct forms of active transcription factor CREB (cAMP
            response element binding protein)
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5258-5262 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Z.Gilman, 16-MAY-1990.
FEATURES       from  to/span     description
    pept          1     1026     active transcription factor CREB-B
    pept          1      261     active transcription factor CREB-A
                304     1026     active transcription factor CREB-A
BASE COUNT      332 a    249 c    233 g    212 t
ORIGIN      
        1 atgaccatgg aatctggagc cgagaaccag cagagtggag atgcagctgt aacagaagct
       61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct
      121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtgca gctgcccaat
      181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct
      241 ccacaagtcc aaacagttca gtcttcctgt aaggacttaa aaagactttt ctccggaaca
      301 cagatttcaa ctattgcaga aagtgaagat tcacaggagt cagtggatag tgtaactgat
      361 tcccaaaagc gaagggaaat tctttcaagg aggccttcct acaggaaaat tttgaatgac
      421 ttatcttctg atgcaccagg agtgccaagg attgaagaag agaagtctga agaggagact
      481 tcagcacctg ccatcaccac tgtaacggtg ccaactccaa tttaccaaac tagcagtgga
      541 cagtatattg ccattaccca gggaggagca atacagctgg ctaacaatgg taccgatggg
      601 gtacagggcc tgcaaacatt aaccatgacc aatgcagcag ccactcagcc gggtactacc
      661 attctacagt atgcacagac cactgatgga cagcagatct tagtgcccag caaccaagtt
      721 gttgttcaag ctgcctctgg agacgtacaa acataccaga ttcgcacagc acccactagc
      781 actattgccc ctggagttgt tatggcatcc tccccagcac ttcctacaca gcctgctgaa
      841 gaagcagcac gaaagagaga ggtccgtcta atgaagaaca gggaagcagc tcgagagtgt
      901 cgtagaaaga agaaagaata tgtgaaatgt ttagaaaaca gagtggcagt gcttgaaaat
      961 caaaacaaga cattgattga ggagctaaaa gcacttaagg acctttactg ccacaaatca
     1021 gattaa
//
LOCUS       ACNRRDA      1486 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  A.lignieresii 16S ribosomal RNA.
ACCESSION   M35017
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      A.lignieresii (strain ATCC 19393) rRNA.
  ORGANISM  Actinobacillus lignieresii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 1486)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1486     16S ribosomal RNA
BASE COUNT      384 a    303 c    448 g    321 t     30 others
ORIGIN      
        1 attnannant ttnatcangg ctcagattna ncgctggcgg caggcttaac acatgcaagt
       61 cgaacggtaa cgggaagggn ncttncnttc tttgccgacg agtggcggac gggtgagtaa
      121 tgcttgggaa tntggcttat ggagggggat aactacggga aactgtagct aataccgcgt
      181 aatatcttag gattaaaggg tgggactttc gggccacctg ccataagatg agcccaagtg
      241 ggattaggta gttggttagg taaaggctga ccaagccgac gatctctagc tngtctgaga
      301 ggatgaccag ccacactnga actgagacac ggtccagact cctacgggag gcagcagtng
      361 ggaatattgc acaatggggg gaaccctnat gcagccatgc cgcgtgaatg aagaaggcct
      421 tcgggttgta aagttctttc ggtagcgagg aaggtatcaa atttnataga tttngtaatt
      481 gacgttaact acagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag
      541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggtt nattaagtga
      601 gatgtgaaag ccccgggctt aacctgggna ttgcatttca tactggtcaa ctagagtact
      661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
      721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcgaaagc gtggggagca
      781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
      841 ntnagtctgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
      901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
      961 atgnaacgcg aagaacctta cctactcttg acatccatgg aatcttgtag agatacgaga
     1021 gtgccttcgg gaaccatgag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
     1081 gttgggttna gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attnggtcgg
     1141 gaactcnaag gagactgccg gtgatnaacc ggaggaaggt ggggatgacg tcaagtcatc
     1201 atggccctta cgagtagggc tacacacgtg ctacaatggc gtatacagag ggaagcaaga
     1261 tggcgacatg gagcaaatct cacaaagtac gtctaagtcc ggattggagt ctgcaactcg
     1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
     1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
     1441 ttaaccgcaa ggggggcgtt taccacggta tgattcatga ctnggg
//
LOCUS       BNORRDA      1532 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  D.nodosus 16S ribosomal RNA.
ACCESSION   M35016
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      D.nodosus (strain 198A; ATCC 27521) rRNA.
  ORGANISM  Bacteroides nodosus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Anaerobic gram-negative straight, curved and helical rods; 
            Bacteroidaceae.
REFERENCE   1  (bases 1 to 1532)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1532     16S ribosomal RNA
BASE COUNT      410 a    330 c    465 g    324 t      3 others
ORIGIN      
        1 aactgaagag tttgattctg gctcagattg aacgctggcg gcatgcttaa cacatgcaag
       61 tcgaacgggg ttatgtagct tgctatgtaa cctagtggcg gacgggtgag taatatatag
      121 gaatctgcct tatggtgggg gacaacgtat ggaaacgtac gctaataccg cataagattg
      181 aagaatgaaa gcgggggctc gaaagacctc gcgccgtaag atgagcctat atcggattag
      241 ctagttggtg gggtaagagc ctaccaaggc gacgatccgt agctggtttg agagaatgat
      301 cagccacatc gggactgaga cacggcccna actcctacgg gaggcagcag tggggaatat
      361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
      421 gtaaagcact tttattagtg aagaacggtg catggttaat acccatgcaa ttgacattag
      481 ctaaggaaaa agcaccggct nactccgtgc cagcagccgc ggtaatacgg agggtgcaag
      541 cgttattcgg aatgactggg cgtaaagcgc acgcaggtgg ttttataagt caggtgtgaa
      601 atccctgggc tcaacctagg aattgcattt gaaaccgtaa gactagagta tgtcagagga
      661 aggcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
      721 aggcagcctt ctggggcaat actgacactc atgtgcgaaa gcgtgggtag caaacaggat
      781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggtgttgggt agaaatactc
      841 ggtaccgaag ctaacgcaat aagttgaccg cctggggagt acggccgcaa ggttgaaact
      901 caaagaaatt gacggggacc cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg
      961 cgaagaacct taccaggctt tgacatccag agaattctac agagatgtgg gagtgccttc
     1021 gggaactctg agacaggtgt tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt
     1081 aagtcccgca acgagcgcaa cccttatcct tacttgccag cacgtaaagg tgggaactat
     1141 aaggagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc atcatggccc
     1201 ttacagcctg ggctacacac gtgctacaat gggcggtaca gacggtagcc aacccgcgag
     1261 ggggagccaa tctgagaaaa ccgttcgtag tccggattgc agtctgcaac tcgactgcat
     1321 gaagtcggaa tcgctagtaa tcgcgaatca gcacgtcgcg gtgaatacgt tcccgggtct
     1381 tgtacacact gcccgtcaca ccatgggagt ttgttgcacc agaagtaggt agcttaaccg
     1441 taaggagggc gcttaccacg gtgtggccga tgactggggt gaagtcgtaa caaggtaacc
     1501 gtaggggaac ctgcggttgg atcacctcct tt
//
LOCUS       CAJSAPA      3974 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  C.fetus surface array protein (sapA) gene, complete cds.
ACCESSION   J05577
KEYWORDS    surface array protein.
SOURCE      C.fetus fetus DNA.
  ORGANISM  Campylobacter fetus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 3974)
  AUTHORS   Blaser,M.J. and Gotschlich,E.C.
  TITLE     Surface array protein of Campylobacter fetus: Cloning and gene
            structure
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.J.Blaser, 01-JUN-1990.
FEATURES       from  to/span     description
    pept         24     2825     surface array protein (sapA)
    binding      12       15     ribosomal binding site
    signal     2846     2870     transcription termination signal
BASE COUNT     1293 a    641 c    785 g   1255 t
ORIGIN      
        1 ctattttatt aaggagtcct taaatgttaa acaaaacaga tgtttcaatg ctttatatca
       61 ctattatggg tatggcaagt gagggtgatg gtaataagta ttggttagat tatgccaata
      121 ataatagttt aggagtttca agtttagcta atattatgct tgatagtcca ggggcggcta
      181 aattctttgg tgattctctt ttagcaggta atgaaaaaga ttttgttact aagatatata
      241 gtatagcttt aggtaatact agtgatgttg atggcattaa ttattggact aaggcaataa
      301 ctggcggtgg agaatttact gatagtaagg gtaatgttat tagtgttgct agtttaagca
      361 agggtgattt aataggtgct atgattaact ctatggttaa tggcggtagt gctgagtcta
      421 aggctatatt tgaggctaag gcagctgcta gtgattactt tgccgatgct actttggtaa
      481 gggatattag tggattagat gagggtacta cttctaagtt aattagcgag attaatagtg
      541 ctagtgatct tgataaggtt aagagtgaga ttgatgcttt gaagagtgag ctacctaatc
      601 cgggtagtac ttatgatctt acagagggta atgataattt aaagggtact gatttagacg
      661 atacttttaa tgggactaca tatgtaggta atggtactaa taagagtact cttagtgcat
      721 ttgataagac tagatggtcg gtgcttggga gagatacgtt gaatgcgata tttactgcaa
      781 taacacgcgc tgcgctacta actgatcaag ctgaattgat aatcactaaa aggcgtacaa
      841 acgtagaaaa tatcaatata atttcagatc tagaaacaag tggcgatttc gttttcaacg
      901 gttatgaaaa agtaggattt aacgtacttg gcgatatagt tagctttgct accgacgcat
      961 ctaaaagcgt aaatgtagaa acaacaggaa cgataactgc tttcaccgca gccggaacag
     1021 gcaaagtcga tgttgtcgcc ggtaaaatct ctgcccttac ggccgattcg cgaacaagcg
     1081 taaatttaac tgctacaaac gacactatca cattaaccag tgcaaacgct gctactagtg
     1141 tgaatttaaa acagcggcag gccaaagacg ctacaataac atccgcaatg cagcaaaaat
     1201 ataacaatag acgcaacagg attgcaacta taacttcagc tacggctgta gagaatttga
     1261 cagttaaaca tgcaactaac gtagcgctaa atggtggcat ggataaactt gcaacagtta
     1321 ctcttgacaa tgctgcttta actgctgcaa tagatataaa atctgcaagc acactaaatt
     1381 taataaattc aagtgttaac ggaccaaaac atctctacag cagcaaaaga cgttactgta
     1441 aatttaagcg agctgctgct aaggttaaat taaacacaac tgctgctacg gatcaaactg
     1501 tgactcttaa agctaatgcc acagacaata gtcttgaatt tgatagcgca actgcaaaaa
     1561 ctacgtcagt cactgctagt ggtagtggaa aaactttggt tatcaaaggt gctgaagtag
     1621 aaactctagt taatatagac actactgcgt tcaatgcgct gcagagtgtt agcttcggca
     1681 aaactggtca aggcggcaaa tttagcgtaa aaacaggtac aggcgatgat aaaatagagt
     1741 ttgtaggcac aactttaact gaaggttcag taatcgatgc gccgggtaat gatactatag
     1801 ctatgaaatc agctgctttg actagtgcaa atttcactat gatcaaaaac atagagaatg
     1861 ttgctattag cgacgcggtg gctacggctg accttagctc ttctgcgttt aaaaactcag
     1921 tcatcataac tactaaagag gctgccgata cgactttaac tataaataaa gaccaagtaa
     1981 ttaattttac tgcggcagat gcaggctcag ttaagcttat tacagtgaaa ctaaatgatg
     2041 taacagcgct aatgattgtt gttaaaatag tattagatgc tgccgctaaa gatactaata
     2101 tagctttggg cacagcagca gccgataaag ctcttgtaat tgatacgggt atagaaactc
     2161 ttaatatcac ttcgttagtt aaagctacaa gccctgaaac tacagcaaat accgtaaatg
     2221 ctaaattgac agatgtaact tctatcatta tagatgggat gcaaattact ttaggtcatg
     2281 ccggcactgc tggaactgac tatagtaaag tttctatgat agatgcaagt gctcttaaag
     2341 ctggtcttac atttgatgca agtgctataa cattaggagc taatgctact ataaaaggtg
     2401 gctctggagc tgatagtata actgttaaag gcggaaatat cgttgtagat ttagttgctg
     2461 gcggagacga tactataact cttaaaaaag gtgctgagaa aacagatatt acaaccgtta
     2521 ataactttaa tgctggcgat aagatagata tagctgatgc caaaaatggt actttcacat
     2581 ttaataaaat tacaatgaat agcgatgcta atttagatga ttatatcact aaagcagtag
     2641 caggagacgg atcaacaaac tctgctgtaa gctactcaca taatggatat acttatgtag
     2701 tagttgatgg aacagctgga gctactttca ctaaagctac tgatacaatt atcaaacttt
     2761 caggtacgct agatcttaaa cttagcggag ataatgttgt agttgatgat ggaagcgtaa
     2821 tttaagcaat agtaattgct tgataaaaaa gccttaattt ttaaggcttt taatttgtct
     2881 aaatttatct tagcttaggc gtatttattt gaaatctcgt cagacggatc gtcttaaaat
     2941 aaggttttat agagacaaag tatattttag accaaacctt aataaatagt agatagtttt
     3001 ttgctaagag gagctaggag aactagctcc ttgaatattt ttaaatttat ttgagttgta
     3061 ttttaaattc aacatatctg ttttttgatg ttttgatcga aatttgtcta ttgtatccat
     3121 cctaaaaatt tcagctcttc aactcttttt gtgtgaaatt gttctaataa ggttgataaa
     3181 ttttcatatt tgtcacttgg aacaagaaaa ctctcgtctc cgactctttt caccaatttt
     3241 tgtttataaa actttaaatt taaaggtttt ataagatcta tatatgaact aaactcatct
     3301 ttatagctat tagctatact aaaagcgatc tgcttgaact ctttgaaatc aacgtttaaa
     3361 attttacaaa gcggcttaaa atcgtttagc ctaatgtttt gacgcttacc gtttataggt
     3421 aaatagctct caaatccttt tgccgtacta taaaatcccg ctgctcatca cgtgcataca
     3481 taggcgcatc agcactatat ctccgcttgt tataagagaa agatttttca gcatgcagat
     3541 cttccgtatt ttatgataac gctataaaca tatatcttca aaagctcctt cttttcagta
     3601 tctgatagta gctctttgct tatgcgctta aaaagctttt cgctggttgt gtcatattta
     3661 tttatactag tagtcctaga tacgaggcaa actcgacgac gttataatga tgattttcta
     3721 gtttatcaaa tcttttaatt acgtagtgat actcgtcgtc atctcctgcc actacagcac
     3781 ttcacggaac tctgattttc aacgtatttt tagcaaaact caaatgaaga tgctcattta
     3841 tcagcgaggt gcggaaaata acttttgaac tggttggatt tgatctgatt ttcgaatatg
     3901 gttttaaaat atagtttgaa gcagcgtctt gatggattat ctttcgttca aaatcgatat
     3961 ttactaattt tttg
//
LOCUS       CDBRRDA      1470 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  C.hominis 16S ribosomal RNA.
ACCESSION   M35014
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      C.hominis (strain ATCC 16826) rRNA.
  ORGANISM  Cardiobacterium hominis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods.
REFERENCE   1  (bases 1 to 1470)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1470     16S ribosomal RNA
BASE COUNT      373 a    331 c    466 g    296 t      4 others
ORIGIN      
        1 actgaagagt ttgattctgg ctcagattga acgctggcgg catgcttaac acatgcaagt
       61 cgaacggaaa cgatggagct tgctccaggc gtcgagtggc gaacgggtga gtaacgcatg
      121 ggaatctgcc ttttgctggg ggataacgta gggaaactta cgctaatacc gcataacacc
      181 taagggtgaa agcgggggac cgaaagcctc gcggcaagag atgagcccat gttggattag
      241 ctagttggtg gggtaaaggc ctaccaaggc gacgatccat agctggtctg agaggatgat
      301 cagccacact gggactgaga cacggcccag actccnacgg gaggcagcag tggggaatat
      361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt
      421 gtaaagcact ttcagtaggg aggaaaggtg cgtagttaat acctgcgcaa ttgacgttac
      481 ctacagaaga agcaccggct aactccgtgc cagcagccgc ggtaatacgg agggtgcgag
      541 cgttattcgg aattactggg cgtaaagcgc acgcaggcgg ttgcccaagt cagatgtgaa
      601 agccccgggc ttaacctggg aactgcattt gaaactgggc gactagagta tgaaagagga
      661 aagcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga
      721 aggcagcttt ctgggtcgat actgacgctc atgtgcgaaa gcgtggggag caaacaggat
      781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggcgtcgggt tgttaaagac
      841 tcggtgcngg agctaacgca ttaagttgac cgcctgggga gtacggccgc aaggttgaaa
      901 ctcaaagaaa ttgacgggga cccgcncaag cggtggagca tgtggtttaa ttcgatgcaa
      961 cgcgaagaac cttaccaggc cttgacatcc taggaacttg gcagagatgc cttggtgcct
     1021 tcgggaacct agagacaggt gttgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg
     1081 ttaagtcccg caacgagcgc aacccttatc cttacttgcc agcactttgg gtgggaactg
     1141 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc
     1201 cttacggcct gggctacaca cgtgctacaa tggtcggtac agacggtagc gaagccgcga
     1261 ggtggagcca atctgagaaa gccgatcgta gtccggattg cagtctgcaa ctcgactgca
     1321 tgaagtcgga atcgctagta atcgcgaatc agcatgtcgc ggtgaatacg ttcccgggtc
     1381 ttgtacacac tgcccgtcac accatgggag tttgttgcac cagaagcagg tagcttaact
     1441 taggagggcg cttgccacgg tgtggccgat
//
LOCUS       GLARGBA       763 bp ds-DNA             INV       02-AUG-1990
DEFINITION  G.lamblia 5.8S rRNA gene.
ACCESSION   M35013
KEYWORDS    5.8S ribosomal RNA; large subunit rRNA; small subunit rRNA.
SOURCE      G.lamblia DNA, clones B4 and C4.
  ORGANISM  Giardia lamblia
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE   1  (bases 1 to 763)
  AUTHORS   Edlind,T.D., Sharetzsky,C. and Cha,M.E.
  TITLE     Ribosomal RNA of the primitive eukaryote Giardia lamblia: Large
            subunit domain I and potential processing signals
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.D.Edlind, 04-JUN-1990.
FEATURES       from  to/span     description
    rRNA        177      321     5.8S ribosomal RNA
    rRNA    <     1      146     small subunit ribosomal RNA
    rRNA        372  >   763     large subunit ribosomal RNA
BASE COUNT      121 a    293 c    293 g     56 t
ORIGIN      
        1 gtacacaccg cccgtcgctc ctaccgactg ggcgcggcgg cgagcgcccc ggacgcgcga
       61 agggccgcga gcccccgcgc ctggaggaag gagaagtcgt aacaaggtat ccgtaggtga
      121 acctgcggat ggatccctcg cgcgccccgc gcgtcgcccc cgcggcccgg tcggcccccg
      181 aacgccccgc cggcggatgc ctcggcccgg gcggcgacga agagcgcggc ggagcgcgag
      241 acgcggtgcg gacccgcccg ccccgagaag caccgaccct cgaacgcagc gcgccccggc
      301 gccgccgcct cggcgcccgc cgcgtgccgc gcgccgcgcc gcagagagcg ccccggggcg
      361 gtcccgccgg gctgcgcggc ccgaggcggc gggggcgacg ggcggaactt aagcatatca
      421 gtacgccccg gaggagaaac caaccgggat tccccgtagc ggcgagcgac gcgggaggag
      481 cccgccccga aggcgcgctg tggggcgcag gcgcaggccc gccgcgaggg ggcccgaggg
      541 ccccgcccga gagggtgcaa gccccgtacg gcggccggcg cctgcgcgcg agtagcgctg
      601 cttgagcgtg cagcgcggaa gggaggcgcg gcccttccaa ggctaaatac gccccgggac
      661 cgatagcgga ccaagtagcg cgagcgaacg gtgaaaagga cgcctgcggc cgctcaaaag
      721 acctgaaccg ccgccgcgcc gccggccccg tctcgaaaca cgg
//
LOCUS       HEIRRDA      1472 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  H.influenzae 16S ribosomal RNA.
ACCESSION   M35019
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      H.influenzae (strain ATCC 33391) rRNA.
  ORGANISM  Haemophilus influenzae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 1472)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1472     16S ribosomal RNA
BASE COUNT      376 a    300 c    459 g    311 t     26 others
ORIGIN      
        1 attgaagagt ttgatcatgg ctcagattga acgctggcgg caggcttaac acatgcaagt
       61 cgaacggtag caggagaaag cttgctttct tgctgacgag tggcggacgg gtgagtaatg
      121 cttgggaatc tggcttatgg agggggataa cgacgggaaa ctgtcgctaa taccgcgtat
      181 tatcggaaga tgaaagtgcg ggactgagag gccgcatgcc ataggangag cccaagtggg
      241 attaggtagt tggtggggta aatgcctacc aagcctncga tctctagctg gtctgagang
      301 atgaccagcc acactggaac tgagacacgg tccagactcc tacgggaggc agcagtgggg
      361 aatattgcgc natgggggga accctgacgc agccatgccg cgtgaatgaa gaaggccttc
      421 gggttgtaaa gttctttcgg tattgaggaa ggttgatgtg ttaatagcac atcaaattga
      481 cgttaaatac agaagaagca ccggctnact ccgtgccagc agccgcggta atacggaggg
      541 tgcgagcgtt aatcggaata actgggcgta aagggcacgc aggcggttat ttaagtgagg
      601 tgtgaaagcc ccgggcttaa cctgggnatt gcatttcaga ctgggtaact agagtacttt
      661 agggaggggt agaattccac gtgtagcggt gaaatgcgta gagatgtgga ggaataccga
      721 aggcgaaggc agccccttgg gaatgtactg acgctcatgt gcgaaagcgt ggggagcaaa
      781 caggattaga taccctggta gtccacgctg taaacgctgt cgatttnggg gttggggttn
      841 aactctggca cccgtagcta acgtgataaa tcgaccgcct ggggagtacg gccgcaaggt
      901 taaaactcaa atgaattgac gggggccngc ncaannggtg gagcatgtgg tttaattcga
      961 tgnaacgcga agaaccttac ctactcttga catcctaaga agagctcaga gatgagcttg
     1021 tgccttcggg aacttagaga caggtgctgc atggctgtcg tcagctcgtg ttgtgaaatg
     1081 ttgggttnag tcccgcaacg agcgcaaccc ttatcctttg ttgccagcga cttggtcggg
     1141 aactcaaagg agactgccag tgataaactg gaggaaggtn gggatgacgt caagtcatca
     1201 tggcccttac gagtagggct acacacgtgc tacaatggcg tatacagagg gaagcgaagc
     1261 tgcgaggtgg agcgaatctc ataaagtacg tctaagtccg gattggagtc tgcaactcga
     1321 ctccatgaag tcggaatcgc tagtaatcgc gaatcagaat gtcgcggtga atacgttccc
     1381 gggccttgta cacaccgccc gtcacaccnt gggagtgggt tgtaccagaa gtagatagct
     1441 naacctnnnn gagggcgtnn ncnacggtan ga
//
LOCUS       HUMIBSUB     3110 bp ss-mRNA            PRI       02-AUG-1990
DEFINITION  Human integrin beta-5 subunit mRNA, complete cds.
ACCESSION   M35011
KEYWORDS    integrin.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3110)
  AUTHORS   Suzuki,S., Huang Z,-S. and Tanihara,H.
  TITLE     Cloning of an integrin beta subunit exhibiting high homology with
            integrin beta-3 subunit
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5354-5358 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Suzuki, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         30     2420     integrin beta-5 subunit
BASE COUNT      726 a    809 c    887 g    688 t
ORIGIN      
        1 cgcgccgccg ctgagggagg cgccccacca tgccgcgggc cccggcgccg ctgtacgcct
       61 gcctcctggg gctctgcgcg ctcctgcccc ggctcgcagg tctcaacata tgcactagtg
      121 gaagtgccac ctcatgtgaa gaatgtctgc taatccaccc aaaatgtgcc tggtgctcca
      181 aagaggactt cggaagccca cggtccatca cctctcggtg tgatctgagg gcaaaccttg
      241 tcaaaaatgg ctgtggaggt gagatagaga gcccagccag cagcttccat gtcctgagga
      301 gcctgcccct cagcagcaag ggttcgggct ctgcaggctg ggacgtcatt cagatgacac
      361 cacaggagat tgccgtgaac ctccggcccg gtgacaagac caccttccag ctacaggttc
      421 gccaggtgga ggactatcct gtggacctgt actacctgat ggacctctcc ctgtccatga
      481 aggatgactt ggacaatatc cggagcctgg gcaccaaact cgcggaggag atgaggaagc
      541 tcaccagcaa cttccggttg ggatttgggt cttttgttga taaggacatc tctcctttct
      601 cctacgcggc accgaggtac cagaccaatc cgtgcattgg ttacaagttg tttccaaatt
      661 gcgtcccctc ctttgggttc cgccatctgc tgcctctcac agacagagtg gacagcttca
      721 atgaggaagt tcggaaacag agggtgtccc ggaaccgaga tgcccctgag gggggctttg
      781 atgcagtact ccaggcagcc gtctgcaagg agaagattgg ctggcgaaag gatgcactgc
      841 atttgctggt gttcacaaca gatgatgtgc cccacatcgc attggatgga aaattgggag
      901 gcctggtgca gccacacgat ggccagtgcc acctgaacga ggccaacgag tacactgcat
      961 ccaaccagat ggactatcca tcccttgcct tgcttggaga gaaattggca gagaacaaca
     1021 tcaacctcat ctttgcagtg acaaaaaacc attatatgct gtacaagaat tttacagccc
     1081 tgatacctgg aacaacggtg gagattttag atggagactc caaaaatatt attcaactga
     1141 ttattaatgc atacaatagt atccggtcta aagtggagtt gtcagtctgg gatcagcctg
     1201 aggatcttaa tctcttcttt actgctacct gccaagatgg ggtatcctat cctggtcaga
     1261 ggaagtgtga gggtctgaag attggggaca cggcatcttt tgaagtatca ttggaggccc
     1321 gaagctgtcc cagcagacac acggagcatg tgtttgccct gcggccggtg ggattccggg
     1381 acagcctgga ggtgggggtc acctacaact gcacgtgcgg ctgcagcgtg gggctggaac
     1441 ccaacagcgc caggtgcaac gggagcggga cctatgtctg cggcctgtgt gagtgcagcc
     1501 ccggctacct gggcaccagg tgcgagtgcc aggatgggga gaaccagagc gtgtaccaga
     1561 acctgtgccg ggaggcagag ggcaagccac tgtgcagcgg gcgtggggac tgcagctgca
     1621 accagtgctc ctgcttcgag agcgagtttg gcaagatcta tgggcctttc tgtgagtgcg
     1681 acaacttctc ctgtgccagg aacaagggag tcctctgctc aggccatggc gagtgtcact
     1741 gcggggaatg caagtgccat gcaggttaca tcggggacaa ctgtaactgc tcgacagaca
     1801 tcagcacatg ccggggcaga gatggccaga tctgcagcga gcgtgggcac tgtctctgtg
     1861 ggcagtgcca atgcacggag ccgggggcct ttggggagat gtgtgagaag tgccccacct
     1921 gcccggatgc atgcagcacc aagagagatt gcgtcgagtg cctgctgctc cactctggga
     1981 aacctgacaa ccagacctgc cacagcctat gcagggatga ggtgatcaca tgggtggaca
     2041 ccatcgtgaa agatgaccag gaggctgtgc tatgtttcta caaaaccgcc aaggactgcg
     2101 tcatgatgtt cacctatgtg gagctcccca gtgggaagtc caacctgacc gtcctcaggg
     2161 agccagagtg tggaaacacc cccaacgcca tgaccatcct cctggctgtg gtcggtagca
     2221 tcctccttgt tgggcttgca ctcctggcta tctggaagct gcttgtcacc atccacgacc
     2281 ggagggagtt tgcaaagttt cagagcgagc gatccagggc ccgctatgaa atggcttcaa
     2341 atccattata cagaaagcct atctccacgc acactgtgga cttcaccttc aacaaatcct
     2401 acaatggcac tgtggactga tgtttccttc tccgaggggc tggagcgggg atctgatgaa
     2461 aaggatcaga ctgaaacgcc ttgcacggct gctcggcttg atcacagctc cctaggtagg
     2521 caccacagag aagaccttct agtgagcctg ggccaggagc ccacagtgcc tgtacaggaa
     2581 ggtgcctggc catgtcacct ggctgctagg ccagagccat gccaggctgc gtccctccga
     2641 gcttgggata aagcaagggg accttggcgc tctcagcttt ccctgccaca tccagcttgt
     2701 tgtcccaatg aaatactgag atgctgggct gtctctccct tccaggaatg ctgggccccc
     2761 agcctggcca gacaagaaga ctgtcaggaa gggtcggagt ctgtaaaacc agcatacagt
     2821 ttggcttttt tcacattgat catttttata tgaaataaaa agatcctgca tttatggtgt
     2881 agttctgagt cctgagactt ttctgcgtga tggctatgcc ttgcacacag gtgttggtga
     2941 tggggctgtt gagatgcctg ttgaaggtac atcgtttgca aatgtgagtt tcctctcctg
     3001 tccgtgtttg tttagtactt ttataatgaa aagaaacaag attgtttggg attggaagta
     3061 aagattaaaa ccaaaagaat ttgtgtttgt ctgataaaaa aaaaaaaaaa
//
LOCUS       KINRRDAA     1474 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  K.indologenes 16S ribosomal RNA.
ACCESSION   M35015
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      K.indologenes (strain ATCC 25869) rRNA.
  ORGANISM  Kingella indologenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1474)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1474     16S ribosomal RNA
BASE COUNT      375 a    322 c    451 g    315 t     11 others
ORIGIN      
        1 ttgaactgaa gagtttgatt ctggctcaga ttgaacgctg gcggcatgct taacacatgc
       61 aagtcgaacg agggaagcag cttgctgcng acctngtggc ggacgggtga gtaacgtata
      121 ggaatctacc ttggggtggg ggataacgta tggaaacgta cgctaatacc acataacact
      181 tacgagtcaa agcgggggat cttcggacct cgcgccctta gatgagccta tattggatta
      241 gctagttggt rgggtaaagg cctaccaagg cgacgatcca tagctggttt gagaggatga
      301 tcagccacat cgggactgag acacggcccn nactcctacg ggaggcagca gtggggaata
      361 ttggncnatg ggggcaaccc tgatccagca atgccgcgtg tgtgaagaag gccttngggt
      421 tgtaaagcac tttcgttagt gaagaaaggt ttatggttaa tacctgtgaa tttgacatta
      481 gctaaagaag aagcaccggc taactccgtg ccagcagccg cggtaatacg gagggtgcna
      541 gcgttattcg gaatgactgg gcgtaaagcg cacgcaggcg gttatttaag tcagatgtga
      601 aagccccggg cttaacctgg gnattgcatt tgaaactggg taactagagt atgtgagagg
      661 aaggcggaat ttccagtgta gcagtgaaat gcgtagagat tggaaggaac accgatggcg
      721 aaggcagcct tctggggcaa tactgacgct catgtgcgaa agcgtgggta gcaaacagga
      781 ttagataccc tggtagtcca cgccctaaac gatgtcaact aggtgttggg cacttaagtg
      841 ctcggtaccg cagctaacgc attaagttga ccgcctgggg agtacggccg caaggttgaa
      901 actcaaagaa attgacgggg acccgcacaa gcggtggagc atgtggttta attcgatgca
      961 acgcgaagaa ccttaccagg ccttgacatc cagagaagtt accagagatg gtttcgtgcc
     1021 ttcgggaact ctgagacagg tgttgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg
     1081 gttaagtccc gcaacgagcg caacccttat ccttacttgc cagcacttcg ggtgggaact
     1141 ataaggagac tgccggtgac aaaccggagg aaggtgggga tgacgtcaag tcatcatggc
     1201 ccttacggcc tgggctacac acgtgctaca atggtcggta cagacggttg ccaacccgcg
     1261 agggggagct aatctgagaa agccgatcgt agtccggatt gcactctgca actcgagtgc
     1321 atgaagtcgg aatcgctagt aatcgcgaat cagaacgtcg cggtgaatac gttcccgggt
     1381 cttgtacaca ctgcccgtca caccatggga gtgggttgca ccagaagtag ctagcttatc
     1441 agggcggtta ccacggtgtg attcangact aggg
//
LOCUS       MUSTNFBG     3219 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse tumor necrosis factor-beta (lymphotoxin) gene.
ACCESSION   Y00137
KEYWORDS    lymphotoxin; signal peptide; tumor necrosis factor.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3219)
  AUTHORS   Gray,P.W., Chen,E., Tang,W.-L. and Ruddle,N.
  TITLE     The murine tumor necrosis factor-beta (lymphotoxin) gene sequence
  JOURNAL   Nucleic Acids Res. 15, 3937-3937 (1987)
  STANDARD  simple automatic
REFERENCE   2  (sites for [1])
  AUTHORS   Fashena,S.J., Tang,W.-L., Sarr,T. and Ruddle,N.H.
  TITLE     The murine lymphotoxin gene promoter: Characterization and negative
            regulation
  JOURNAL   J. Immunol. 145, 177-183 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept       1632     1727     lymphotoxin precursor, exon 2 (first expressed
                                 exon)
               1811     1910     lymphotoxin precursor, exon 3
               2135     2547     lymphotoxin precursor, exon 4
    sigp       1632     1727     lymphotoxin signal peptide
               1811     1813     lymphotoxin signal peptide
    matp       1814     1910     lymphotoxin
               2135     2547     lymphotoxin
    pre-msg     685     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg     857     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg     865     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg    1114     3121     lymphotoxin mRNA and introns (minor alt.)
    pre-msg    1179     3121     lymphotoxin mRNA and introns (major alt.)
    IVS        1286     1622     put. intron I
    IVS        1728     1810     intron II
    IVS        1911     2134     intron III
    signal     1153     1158     put. TATA-box
BASE COUNT      724 a    971 c    744 g    780 t
ORIGIN      
        1 tgaaagctcc ctctgtacag agcattggaa gcctggggtg tacatttggg gttacatgat
       61 cttggggttc taagagaata cccccaaatc atcttccaga cctggaacat tctaggacag
      121 ggttctcaac cttcctaact ccatgaccct ttaatacagt tcctcatgtt gtggtgaccc
      181 caaccataca attattttcg ttgctatttc ataactgtaa tttcgctgct attatgaaca
      241 taatgtaaat atttgtttta aatagaggtt tgccaaagag accttgccac aggttgagac
      301 tgccgctcca gagagtaagg gacacattaa aattgttaca caccagatcc cccaaatttg
      361 gggagagggc actgtaatgg aacttcttga cattaaactg gcagataaac tggcagaaaa
      421 aaaaaaaaaa aagctgggca gtggtggcac acacctttaa tcccagcact tgggaggcag
      481 aggcaggcgg atttctgagt tctaggccag cctggtcgac agagtgagtt tcaggacagc
      541 cagggctaca cagagaaacc ctgtctcgaa aaaagcaaaa aaaaaaaaaa aaaactggca
      601 gatgaccaga aaatacagat atattggaat aactgtgact tgaaccccca aagacaagag
      661 aggaaatagg cctgaagggg cggcaggcat gtcaagcatc cagagccctg ggttcgaacc
      721 tgaaaaaaca aaggtgccgc taaccacatg tggcttcgga gccctccaga catgaccatg
      781 atcgacagag agggaaatgt gcagagaagc ctgtgagcag tcaagggtgc agaagtgata
      841 taaaccatca ctcttcaggg aaccaggctt ccagtcacag cccagctgta ccctctccac
      901 gaattgctcg gccgttcact ggaactcctg ggcctgaccc agctccctgc tagtccctgc
      961 ggcccacagt tccccggacc cgactccctt tcccagaacg cagtagtcta agcccttagc
     1021 ctgcggttct ctcctaggcc ccagcctttc ctgccttcga ctgaaacagc agcatcttct
     1081 aagcctgggg cttccccaag ccccagcccc gacctagaac ccgcccgctg cctgccacac
     1141 tgccgcttcc tctataaagg gacccgacgc cagcgcccag gaccccgcac agcaggtgag
     1201 cctctcctac cctgtctcct tgggcttacc ctggtatcag gcatccctca ggatccccag
     1261 ccttaatggg tctggtcctc ctgtcgtggc tttgattttt ggtctgttcc tgtggcggcc
     1321 ttatcagtct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct
     1381 ctctctctct ctctttctct ctctctgcct ctgttagcca ttgtctgttt ctatggtgga
     1441 gctttcctct tcccctctgt ctctccttat ccctgctcac ttcagggttc ccctgcctgt
     1501 ccccttttct gtctgtcgcc ctgtctctca gggtggctgt ctcagctggg aggtaaggtc
     1561 tgtcttcctc tgtgtgcccc gcctccgcta cacacacaca ctctctctct ctctctcagc
     1621 aggttctcca catgacactg ctcggccgtc tccacctctt gagggtgctt ggcacccctc
     1681 ctgtcttcct cctggggctg ctgctggccc tgcctctagg ggcccaggtg aggcagcaag
     1741 agattggggg tgctggggtg gcctagctaa ctcagagtcc tagagtcctc tccactctct
     1801 tctgtcccag ggactctctg gtgtccgctt ctccgctgcc aggacagccc atccactccc
     1861 tcagaagcac ttgacccatg gcatcctgaa acctgctgct caccttgttg gtaaacttct
     1921 gcctccagag gagaggtcca gtccctgcct tttgtcctac ttgcccaggg gcccaggcga
     1981 tcttcccatc tccccacacc aacttttctt accctaaggg caggcacccc actcccaatc
     2041 tccctaccaa ccatcccact tgtccagtgc ctgctcctca gggatgggga cctctgatct
     2101 tgatagcccc ccaatgtctt gtgcctcttc ccagggtacc ccagcaagca gaactcactg
     2161 ctctggagag caagcacgga tcgtgccttt ctccgacatg gcttctcttt gagcaacaac
     2221 tccctcctga tccccaccag tggcctctac tttgtctact cccaggtggt tttctctgga
     2281 gaaagctgct cccccagggc cattcccact cccatctacc tggcacacga ggtccagctc
     2341 ttttcctccc aatacccctt ccatgtgcct ctcctcagtg cgcagaagtc tgtgtatccg
     2401 ggacttcaag gaccgtgggt gcgctcaatg taccaggggg ctgtgttcct gctcagtaag
     2461 ggagaccagc tgtccaccca caccgacggc atctcccatc tacacttcag ccccagcagt
     2521 gtattctttg gagcctttgc actgtagatt ctaaagaaac ccaagaattg gattccaggc
     2581 ctccatcctg accgttgttt caagggtcac atccccacag tctccagcct tccccactaa
     2641 aataacctgg agctctcacg ggagtctgag acacttcagg ggactacatc ttccccaggg
     2701 ccactccaga tgctcagggg acgactcaag cctacctaga agttcctgca cagagcaggg
     2761 tttttgtggg tctaggtcgg acagagacct ggacatgaag gagggacaga catgggagag
     2821 gtggctggga acaggggaag gttgactatt tatggagaga aaagttaagt tatttattta
     2881 tagagaatag aaagagggga aaaatagaaa gccgtcagat gacaactagg tcccagacac
     2941 aaaggtgtct cacctcagac aggacccatc taagagagag atggcgagag aattagatgt
     3001 gggtgaccaa ggggttctag aagaaagcac gaagctctaa aagccagcca ctgcttggct
     3061 agacatccac agggaccccc tgcaccatct gtgaaaccca ataaacctct tttctctgag
     3121 attctgtctg cttgtgtctg tcttgcgttg ggggagaaac ttcctggtct ctttaaggag
     3181 tggagcaggg gacagaggcc tcagttggcc atgggatcc
//
LOCUS       NGORRDAA     1478 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  N.denitrificans 16S ribosomal RNA.
ACCESSION   M35020
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      N.denitrificans (strain ATCC 14686) rRNA.
  ORGANISM  Neisseria denitrificans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1478)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1478     16S ribosomal RNA
BASE COUNT      380 a    331 c    448 g    308 t     11 others
ORIGIN      
        1 aacataagag tttgatcctg gctcagattg aacgctggcg gcatgcttta cacatgcaag
       61 tcgaacggca gcggagaagt gcttgcactt ctgccggcga gtggcgaacg ggtgagtaac
      121 atattggaac gtaccgagta atgggggata actaatcgaa agattagcta ataccgcata
      181 cgtcctgagg gagaaagcgg gggatcttcg gacctcgcgt tattcgagcg gccaatatct
      241 gattagctag ttggtggggt aaaggcctac caaggcgacg atcagtagcg ggtctgagag
      301 gatgatccgc cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg
      361 gaattttgga cgatgggcgc aagcctgatc cagccatgcc gcgtgtatga agaaggcctt
      421 cgggttgtaa agtacttttg tnngggaaga aaagggattt gctaataccg aatcctcatg
      481 acggtacctn aagaataagc accggctaac tacgtgccag nanccgcggn natacgtagg
      541 gtgcgagcgt taatcggaat tactgggcgt aaagcgagcg cagacggtta cttaagcagg
      601 atgtgaaatc cccgagctca acttgggaac tgcgttctga actgggtagc tagagtatgt
      661 cngagggggg tagaattcca cgtgtagcag tgaaatgcgt agagatgtgg aggaataccg
      721 atggcgaagg cagccccctg ggataatact gacgttcatg ctcgaaagcg tgggtagcaa
      781 acaggattag ataccctggt agtccacgcc ctaaacgatg tcgattagct gttggggtac
      841 ttgatacctt agtagcgtag ctaacgcgtg aaattgaccg cctggggagt acggtcgcaa
      901 gattaaaact caaaggaatt gacggggacc cgcacaagcg gtggatgatg tggattaatt
      961 cgatgcaacg cgaagaacct tacctggtct tgacatgtac ggaatctccc agagacggga
     1021 gngtgccttc gggagccgta acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag
     1081 atgttgggtt aagtcccgca acgagcgcaa cccttgtcat tagttgccat cattcagttg
     1141 ggcactctaa tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct
     1201 catggccctt atgaccaggg cttcacacgt catacaatgg tcggtacaga gggtagccaa
     1261 gccgcgaggt ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc
     1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc
     1381 ccgggtcttg tacacaccgc ccgtcacacc ntgggagtgg gggataccag aagtaggtag
     1441 ggtaaccgca aggagcccnc ttaccacggt atacttca
//
LOCUS       PASRRDA      1481 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  P.multocida 16S ribosomal RNA.
ACCESSION   M35018
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      P.multocida (strain NCTC 10322) rRNA.
  ORGANISM  Pasteurella multocida
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 1481)
  AUTHORS   Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I.
  TITLE     Transfer of Kingella indologenes (Snell and Lapage 1976) to the
            genus Suttonella gen. nov. as Suttonella indologenes comb. nov.;
            transfer of Bacteroides nodosus (Beveridge 1941) to the genus
            Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and
            assignment of the genera Cardiobacterium, Dichelobacter and
            Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of
            Proteobacteria based upon 16S ribosomal ribonucleic acid sequence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.E.Dewhirst, 05-JUN-1990.
            
              Author address: F.E.Dewhirst
              Forsyth Dental Center
              140 Fenway
              Boston, MA 02115
            
FEATURES       from  to/span     description
    rRNA          1     1481     16S ribosomal RNA
BASE COUNT      374 a    305 c    459 g    321 t     22 others
ORIGIN      
        1 attgnagagt ttgancangg ctcagattga ncgctggcgg caggcttaac acatgcaagt
       61 cgaacggtag caggaagaaa gcttnctttc tttgctgacg agtggcggac gggtgagtaa
      121 tgcttgggna tctggcttat ggagggggat aactgtggga aactgcagct aataccgcgt
      181 attctcttag gaggaaaggg tgggaccttn gggccacctg ccataagatg agcccaagtg
      241 ggattaggta gttggtgggg taaaggccta ccaagcctnc gatctctagc tngtctgaga
      301 ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg
      361 ggaatattgc gcaatggggg gaaccctgac gcagccatgc cgcgtgaatg aagaaggcct
      421 tcgggttgta aagttctttc ggtnatgagg aagggatgtt nttaaataga tagcatcatt
      481 gacgttaatt ncagaagaag caccggctna ctccgtgcca gcagccgcgg taatacggag
      541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggac ttttaagtga
      601 gatgtgaaat ccccgagctt aacttgggaa ctgcatttca gactgggagt ctagagtnct
      661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc
      721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcnaaagc gtggggagca
      781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct
      841 ntntgcttgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag
      901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg
      961 atgnaacgcg aagaacctta cctactcttg acatcctaag aagagctcag agatgagctt
     1021 gtgccttcgg gaacttagag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat
     1081 gttgggttaa gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attcggtcgg
     1141 gaactcaaag gagactgcca gtgannaact ggaggaaggt ggggatgacg tcaagtcatc
     1201 atggccctta cgagtagggc tacacacgtg ctacaatggt gcatacagag ggcagcgaga
     1261 gtgcgagctt gagcgaatct cagaaagtgc atctaagtcc ggattggagt ctgcaactcg
     1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc
     1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc
     1441 ttaaccntcg ggggggcgtt taccacggta tgattcatga c
//
LOCUS       RATSUBB      1124 bp ss-mRNA            ROD       02-AUG-1990
DEFINITION  Rat F-0-ATPase subunit b mRNA, complete cds.
ACCESSION   M35052
KEYWORDS    F-0-ATPase subunit b.
SOURCE      Rat (strain Wistar) liver, cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1124)
  AUTHORS   Tsurumi,C., Yoshihara,Y., Osaka,F., Yamada,F., Tani,I., Higuti,T.,
            Shimizu,M., Oeda,K., Ohkawa,H., Toda,H., Kakuno,T., Sakiyama,F.,
            Kumatori,A., Tanaka,K. and Ichihara,A.
  TITLE     cDNA cloning and sequencing for the import precursor of subunit b
            in H+-ATP synthase from rat mitochondria
  JOURNAL   Biochem. Biophys. Res. Commun. 169, 136-142 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Higuti, 04-JUN-1990.
FEATURES       from  to/span     description
    pept         60      830     F-0-ATPase subunit b precursor
    sigp         60      185     F-0-ATPase subunit b signal peptide
    matp        186      827     F-0-ATPase subunit b
    signal     1106     1111     poly-A signal
BASE COUNT      323 a    241 c    266 g    294 t
ORIGIN      
        1 gagtgtctcg ggtcggggtc acagggacat cggtaccgct gtctggatct tcgcagacaa
       61 tgctgtcccg ggtggtactt tctgccgccg ccacagcggc cccgtgtctg aagaacgcgg
      121 ccgtcctggg tccaggggtt ttacaggcaa caagggtctt tcacacagga cagcctcgcc
      181 ttgcccctct accacctctt cctgaatatg gaggaaaagt acgtcttggg ctgattcctg
      241 aggaattttt ccagttcctt taccctaaga ctggtgtaac aggaccctac gtgcttggaa
      301 ctgggcttag cttgtacttt ctatccaaag aaatatatgt gattacccca gagaccttct
      361 ctaccatatc agtagtaggg ttgattgtct atgtgattaa gaaatatggc gcctctattg
      421 gagaatttat tgacaaactt aatgaggaaa aaattgctca actagaagaa ataaagcagt
      481 caagcatgaa gcaaatccag gatgcaatca acagggagaa ggcacagcag gcactggttc
      541 agaagcgcca ctacctcttc gatgttcaga ggaataacat tgccctggcc ttggaggtca
      601 cttaccggga acgactacat aaagcatata aggaggtaaa gaatcgcctg gactaccata
      661 tttctgtaca ggacatgatg cgtcgcaagg agggagagca catgataaac tgggtggaga
      721 agcatgtgat acagagcatt tctgcacagc aggaaaagga gaccattgcc aagtgcatcg
      781 gagatctaaa gatgcttgca aagaaggctc aagctcagcc aattatgtga atgtgtctat
      841 ctcagttgtg atagccagag agggttaact taaatgagaa ctactctact tgaagaactc
      901 tttctgtatt gctgtctaat gaaattgtgg tgtacctttc ctaagaaatg atcaaggtcc
      961 atttagtggc ttaaccctat ttctgatcct tgctctgtat tcggagttgt ctgatcacat
     1021 ttgagtaagc aatttgcagc aactcgctgc ccaacagaaa ttaccaagtt atagtttaaa
     1081 cttgtaatta gttaaaacat cttacaataa aatgtttgaa acag
//
LOCUS       RATHDNFNT    1020 bp ss-mRNA            ROD       02-AUG-1990
DEFINITION  Rat neurotrophin-3 (HDNF/NT-3) mRNA, complete cds.
ACCESSION   M34643
KEYWORDS    neurotrophic factor; neurotrophin-3.
SOURCE      Rat adult (strain Sprague-Dawley) hippocampus, cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1020)
  AUTHORS   Ernfors,P., Ibanez,C.F., Ebendal,T., Olson,L. and Persson,H.
  TITLE     Molecular cloning and neurotrophic activities of a protein with
            structural similarities to nerve growth factor: Developmental and
            topographical expression in the brain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5454-5458 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.P.Ernfors, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         73      849     neurotrophin-3 (HDNF/NT-3) precursor
    sigp         73      120     neurotrophic factor/neurotrophin-3 (HDNF/NT-3)
                                 signal peptide
    matp        490      846     neurotrophic factor/neurotrophin-3
BASE COUNT      320 a    229 c    226 g    245 t
ORIGIN      
        1 gtcgacgtcc ctggaaatag tcatacggat gccatggtta cttctgccac gatcttacag
       61 gtgaacaagg tgatgtccat cttgttttat gtgatatttc ttgcttatct ccgtggcatc
      121 caaggcaaca acatggatca aaggagtttg ccagaagact ctctcaattc cctcattatc
      181 aagttgatcc aggcggatat cttgaaaaac aagctctcca agcagatggt agatgttaag
      241 gaaaattacc agagcaccct gcccaaagca gaggcaccca gagaaccaga gcagggagag
      301 gccaccaggt cagaattcca gccgatgatt gcaacagaca cagaactact acggcaacag
      361 agacgctaca attcaccccg ggtcctgctg agtgacagca cccctttgga gccccctccc
      421 ttatatctaa tggaagatta tgtgggcaac ccggtggtaa ccaatagaac atcaccacgg
      481 aggaaacgct atgcagagca taagagtcac cgaggagagt actcagtgtg tgacagtgag
      541 agcctgtggg tgaccgacaa gtcctcagcc attgacattc ggggacacca ggttacagtg
      601 ttgggagaga tcaaaaccgg caactctcct gtgaaacaat atttttatga aacgaggtgt
      661 aaagaagcca ggccagtcaa aaacggttgc agggggattg atgacaaaca ctggaactct
      721 cagtgcaaaa cgtcgcaaac ctacgtccga gcactgactt cagaaaacaa caaactcgta
      781 ggctggcgct ggatacgaat agacacttcc tgtgtgtgtg ccttgtcaag aaaaatcgga
      841 agaacatgaa ttggcatctg tccccacata taaattatta ctttaaatta tatgatatgc
      901 atgtagcata taaatgttta tattgttttt atatattata agttgacctt tatttattaa
      961 acttcagcaa cccttacagt atataagctt ttttcataat cgggctgctc aaaaaaaaaa
//
LOCUS       NEMG12       3620 bp ds-DNA             INV       02-AUG-1990
DEFINITION  A.lumbricoides trans-spliced mRNA encoded protein gene 12, complete
            cds.
ACCESSION   M33757
KEYWORDS    .
SOURCE      A.lumbricoides suum DNA, clone gene-12.
  ORGANISM  Ascaris lumbricoides
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; 
            Ascaridida; Ascaridina; Ascaridoidea; Ascarididae.
REFERENCE   1  (bases 1 to 3620)
  AUTHORS   Denker,J.A. and Nilsen,T.W.
  TITLE     Trans splicing of nematode pre-mRNA in in vitro
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 886 to 1149)
  AUTHORS   Hannon,G.J., Maroney,P.A., Denker,J.A. and Nilsen,T.W.
  TITLE     Trans splicing of nematode pre-messenger RNA in vitro
  JOURNAL   Cell 61, 1247-1255 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.W.Nilsen, 18-APR-1990.
            
               [1] Author address: T.W.Nilsen,
               Case Western Reserve University,
               Department of Molecular Biology and Microbiology
               School of Medicine, 2119
               Abbington Rd., Cleveland OH, 44106,
               Ph: (216) 368-3913, FAX: 216-368-3055.
            
FEATURES       from  to/span     description
    pept        995     1190     gene-12 encoded protein, exon 1
               1860     1945     gene-12 encoded protein, exon 2
               2280     2432     gene-12 encoded protein, exon 3
               2947     2997     gene-12 encoded protein, exon 4
    pre-msg <     1     3210     gene-12 mRNA (alt.)
    pre-msg <     1     3114     gene-12 mRNA (alt.)
    IVS        1191     1859     gene-12 intron A
    IVS        1946     2279     gene-12 intron B
    IVS        2433     2946     gene-12 intron C
    signal     3097     3102     poly-A signal
BASE COUNT      942 a    722 c    867 g   1080 t      9 others
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgagc acactctggc ctctttcagt tgtttcatgg caaggcgcat acgattttct
       61 aaacgcgctt gagtttttgt atgtgcataa ctgtgccaac actgccgagg atattcgtgc
      121 gctttttgct tgttcttttc cacaaacagc aagtaatgtt gtaaaatacg ccctgtcata
      181 atcattggca tcttcctttg atgtcgacga actggtcacc gatggcttct ttactttgcc
      241 tctgctcatt cttacccaca ttgtccactt acctgttgac cgtgtttcgg caaccagtgc
      301 agtcagatgg tctctatacg cgacgcagca ctcgtcccag tcgtcttcaa gagcaacttt
      361 agcgaagatt ttgaatcccg cgggagagcc gtttcctgtt cgacggtgac agagctattg
      421 tagagggata tcatcgaaat ctaggtcctc ttgagtgcga atcctctgca aacaggtaac
      481 tctagactcc tggcaaaaaa tgagttgatt ctaccaattt tttagttagc agcagttggt
      541 tgggacgagc ttgcatgctt cgttcggttg aggaacttta ctgcagttcc gtcagccatg
      601 ttgcctttcg gattctgttt ttccgacatc atggctacag gtggcactga agacgacgtt
      661 atatttgaca caccttgcac gtcataatgt gtatccgcat tgtgctggtc ggatatcgct
      721 gcagcaatta ttgaggaggt atttggtcat ctgggttcat ttgaaacggc gtgcattatg
      781 ggaacaagtg gtacgcagtc gttcacgcgc tcctctgatt agttacagaa gtggatgcgt
      841 gtccggtctt ttccctcatc ttcttaaccc ttttctttta gaatgtattc aaatgcatgg
      901 ttttgtttgg tggttgtatc gaaaatttac gccatgagac actagtgcat tatttaatag
      961 tagtaaaaga gatgtttttt tagtatattg aaggatgtca tacgacttgg caacggctga
     1021 aagagcagcg tatgctcctt tcttcggcta tatgggagca gcttctgcac agatcttcac
     1081 agtgcttggg gctgcgtacg gcactgcgaa atcggccgtt ggaatctcat caatgggagt
     1141 gatgaggccg gagctgatca tgaaatccgt tattcctgtc atcatggccg gtcagtgttt
     1201 cccgttattc ccactttgat accactgttc agctttacgt ggtgtgatat cggagcatcg
     1261 cccagagcag catattactt tattttttga aagggagcat ggcaagaggt tgacgaaagg
     1321 caaggaatat ttactgcttc taacttactg aattcatcac tgccaattct ttttgtttta
     1381 tatctgactt tcttagagct gaagatgttg tttcaattaa ggtggttgtc gaaatgctgt
     1441 aatcgttacc tgttgctgtc tgaaatttgg cttgtacatc cattattctg agattctttt
     1501 gtataatttt ccactaactg ggatcaaaac ggacttttat aaatatttgt cggtaactaa
     1561 gagtgatttc catttagagt atggaaaaca aagtagggca tctcgcatat ctcatctctt
     1621 gcctctggac tgtcacatat tttctttcat ccccaaagaa aggtccggaa tacttggaga
     1681 tgatcgtcaa tgggtgatta ttgcctcaag gcatatgaga agtgtgcata aatggaattg
     1741 cttaatttgg tgctgtttga tggattctgt ttatgatttg acatattcgg acagatattc
     1801 gggtaacata aaagtatttt tattgacaca agctggtcat gaatttacgt gcgttgcagg
     1861 tattatcggc atttatgggc ttgttgttgc gatggtttta agagggaagg tgacttccgc
     1921 tagcgcagga tacaccctcg acaaagtaag accatttgtt gctctattct cgcaagccca
     1981 ccactgtttt cgctgttctt catctgcacg aattttcttt tggtgcataa tgtcttctgc
     2041 tgcatcatct attgcgactc aaagaaagga cttcagcttt tctaaccttt cctgcgtaaa
     2101 tcatggaaag tcggggggat tgggagggcg ataggtggag ggggagggag cggggaactg
     2161 caaagagaga agtgggacta agtaaacatg atgcggagcg agtcgaccta agattgccta
     2221 aaggtgattt tatattgagt agtgttttta gaggcggtca ttaacgaggt ccgtttcagg
     2281 gatttgccca cctggctgct ggccttacat gtggtctttg cggtctcggt gccggatacg
     2341 caatcggaat cgttggagat gcgggtgtca ggggaacagc acagcaaccg cgcttgttcg
     2401 tcggcatgat cctcattctt atcttctccg aggtactttt ccgctacttg gttaanctgt
     2461 tggaattggt cgatggaatc agcgcgtctt agaatgacta gactgattta aagtggccgg
     2521 atatacctca gtgggtttgg tgctcgtctg gctgagttta gagggaggaa gacttctccg
     2581 ctgttcctgg attcttattt gaacagtcta agagtttatt cccagaactt ccnnnnaaga
     2641 ctaggtagnn aaggggggag agtgggannt aggaaggccg cgaccctaga atgtacaaag
     2701 gaaggtcgat ccctcaatga cgattcgcac ttagaggagt gcacataaga cttacgagaa
     2761 tttacgatgg caccggaaaa aaggtagaag tcataaggtt gttgaccaga tggttactac
     2821 accagcattg tcgcagtcag tcccggaggt gctcatataa agagaaaaca ccagaacaaa
     2881 aataacaaca ttgagatgta gtagtcttca catatttctg tcgctggtga tttgaaaatc
     2941 gttcaggtac tcggtctata cggaatgatt gtcgcgctca ttcttggcac ctcatagacg
     3001 acgggcataa tcggtggctg tacaacaact gtgaaatggc gccgggaggg ttgccaataa
     3061 tgttccacta aatgttatac actactgctg catataaata aaagctgctc cgcgaaaatg
     3121 ctgagtttaa gtggctgcaa cacattttgg tgctgagatg tcgtcttaac tgcaagtttt
     3181 ttgacggaga caatgaagta ttttatgtgc aattattcat gtaacgtcga tgcatttaca
     3241 gtcatatgca aatgatctgc gtgaatgtcg cactgtgaac cacaaccaca gaaaaacgaa
     3301 gtggatctat catgactgca tttacgacta aaacagatcg gtgcacaaag caagtgaaac
     3361 ttagccaagg taaaattaac acagttcttt ctctcaaaca aacatgaagt aaacacatca
     3421 cagaaacatc caaactgcct cgaaagtggt tgtagcgcta taacagcaag agatacgaaa
     3481 agaataaaag aacccgcatt tatacgataa gcgtatttaa atgcgctttc acgcaacgat
     3541 gcaaaaggaa aggctgttgg ctattgtatt gccttcgaga aacaattgac ttagagcaat
     3601 gccatattgg cgtctgaacg
//
LOCUS       HUMPPPB1A    3215 bp ss-mRNA            PRI       02-AUG-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) mRNA, complete
            cds.
ACCESSION   M33689
KEYWORDS    protein phosphotyrosyl phosphatase.
SOURCE      Human placenta, cDNA to mRNA, (library of Clontech), clone
            lambda-16-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3215)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept         73     1380     protein phosphotyrosyl phosphatase 1B (EC
                                 3.1.3.48)
BASE COUNT      818 a    828 c    801 g    768 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg
       61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg
      121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag
      181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg
      241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa
      301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg
      361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa
      421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa
      481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga
      541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat
      601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc
      661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt
      721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac
      781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt
      841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa
      901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac
      961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc
     1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa
     1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc
     1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg
     1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag
     1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct
     1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag
     1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc
     1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc
     1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat
     1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag
     1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc
     1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc
     1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc
     1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat
     1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg
     1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt
     1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata
     2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc
     2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat
     2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac
     2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg
     2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg
     2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc
     2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg
     2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc
     2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca
     2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg
     2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc
     2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg
     2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc
     2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac
     2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg
     2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg
     3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg
     3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag
     3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt
     3181 ttacaatggc catggaataa accattttta caaaa
//
LOCUS       HUMPPPB1A1    276 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x.
ACCESSION   M33688
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     1 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 276)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    /    34  +   243     protein phosphotyrosyl phosphatase 1B, exon x
                                 (EC 3.1.3.48) (AA at 34)
    pre-msg <     1  >   276     PTP1B mRNA and introns
    IVS     <     1       33     PTP1B intron x-1
    IVS         244  >   276     PTP1B intron x
BASE COUNT       56 a     77 c     69 g     74 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 ctttagaatc tactagatga ttttctcttt cagacccaag aaactcgaga gatcttacat
       61 ttccactata ccacatggcc tgactttgga gtccctgaat caccagcctc attcttgaac
      121 tttcttttca aagtccgaga gtcagggtca ctcagcccgg agcacgggcc cgttgtggtg
      181 cactgcagtg caggcatcgg caggtctgga accttctgtc tggctgatac ctgcctcttg
      241 ctggtaagga ggcctcgcgg gtgccctggg gagctc
//
LOCUS       HUMPPPB1A2    453 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+1.
ACCESSION   M33687
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     2 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 453)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   236  +   397     protein phosphotyrosyl phosphatase 1B, exon
                                 x+1 (EC 3.1.3.48)
    pre-msg <     1  >   453     PTP1B mRNA and introns
    IVS     <     1      235     PTP1B intron x
    IVS         398  >   453     PTP1B intron x+1
BASE COUNT      104 a    118 c    111 g    120 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 ggggaggtcc cagactctta accagatctc ttgtgaatgc attgcctcag ggaggcacca
       61 agcctttcat gaggacctgt ccccctgacc cagacacctc ccacccagcc ccacctccaa
      121 cactagggat cacatttcag catgagattg ggaggggaca gacatctaac ggtgttatta
      181 acgttgccct tgagaattgg acctggctga cttatatctc ctctctggct ttcagatgga
      241 caagaggaaa gacccttctt ccgttgatat caagaaagtg ctgttagaaa tgaggaagtt
      301 tcggatgggg ctgatccaga cagccgacca gctgcgcttc tcctacctgg ctgtgatcga
      361 aggtgccaaa ttcatcatgg gggactcttc cgtgcaggtc agcattgcct ttgtttgaat
      421 ccaggtgtga ccattttaac ttttttgtct ttg
//
LOCUS       HUMPPPB1A3    426 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+2.
ACCESSION   M33686
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     3 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 426)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +    46  +   269     protein phosphotyrosyl phosphatase 1B, exon
                                 x+2 (EC 3.1.3.48)
    pre-msg <     1  >   426     PTP1B mRNA and introns
    IVS     <     1       45     PTP1B intron x+1
    IVS         270  >   426     PTP1B intron x+2
BASE COUNT      115 a    122 c    109 g     80 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 gaagtgaaca ctaatagact tccttcctct tgctgctctt tcaaggatca gtggaaggag
       61 ctttcccacg aggacctgga gcccccaccc gagcatatcc ccccacctcc ccggccaccc
      121 aaacgaatcc tggagccaca caatgggaaa tgcagggagt tcttcccaaa tcaccagtgg
      181 gtgaaggaag agacccagga ggataaagac tgccccatca aggaagaaaa aggaagcccc
      241 ttaaatgccg caccctacgg catcgaaagg taatatattg ggtccagctt gttggggtga
      301 ggggaaatga cttctgttct agaaacacac gctggtactg aaaccctgtg atgcagcctc
      361 tgttggcaag cagcgcttcg catccttggg aacagggcgc tggaccaaca cccactccac
      421 tggtgg
//
LOCUS       HUMPPPB1A4    732 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+3.
ACCESSION   M33685
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     4 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 732)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   402  +   597     protein phosphotyrosyl phosphatase 1B, exon
                                 x+3 (EC 3.1.3.48)
    pre-msg <     1  >   732     PTP1B mRNA and introns
    IVS     <     1      401     PTP1B intron x+2
    IVS         598  >   732     PTP1B intron x+3
BASE COUNT      164 a    193 c    205 g    170 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 tctgtagctc taaagaatga gatctggtgt actgatgtgg ccagacattg caattgcagt
       61 acatgagaag gcaaatcata cagtagtgtg tacaccagtg agtcctccag ccagataaat
      121 cctcacagtg accagtcgcc caggcacctt gtgaacccta ccctgggtgt gggtgctatc
      181 tgaagtacct gggggagggg gtgacaagtg gacttcaggc tgatgtggcc ctggcctggc
      241 cctccctcca agcagagggg gctggcacgc tggaaggtta acatcatcca actctgtcta
      301 cacgtggctt gttttttcct agaattcctg ccacaatagc agcatccttg ccattcattt
      361 tctccaaagt gagtacccat ctctgccctc tgattcctca gcatgagtca agacactgaa
      421 gttagaagtc gggtcgtggg gggaagtctt cgaggtgccc aggctgcctc cccagccaaa
      481 ggggagccgt cactgcccga gaaggacgag gaccatgcac tgagttactg gaagcccttc
      541 ctggtcaaca tgtgcgtggc tacggtcctc acggccggcg cttacctctg ctacagggta
      601 tgtttccact gacagacgcg ctgggcagat gctcgtgtgc agagagcact ggccgctagc
      661 ccgatggtag gattcagttc tgtggtgcat ctgagccagt ctcagaagaa acagatcaag
      721 gttttaagtc tg
//
LOCUS       HUMPPPB1A5    365 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+4.
ACCESSION   M33684
KEYWORDS    protein phosphotyrosyl phosphatase.
SEGMENT     5 of 5
SOURCE      Human DNA, (library of Clontech), clone lambda-10-2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 365)
  AUTHORS   Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C.,
            Bruskin,A., Green,N.R. and Hill,D.E.
  TITLE     Molecular cloning and chromosome mapping of the human gene encoding
            protein phosphotyrosyl phosphatase 1B
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Hill, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   266      289     protein phosphotyrosyl phosphatase 1B, exon
                                 x+4 (EC 3.1.3.48)
    pre-msg <     1  >   365     PTP1B mRNA and introns
    IVS     <     1      265     PTP1B intron x+3
BASE COUNT       80 a    101 c     88 g     96 t
ORIGIN      Chromosome 20q13.1-q13.2.
        1 tacctcctaa gacttttacg gttttaaata ttttacctct ttccaggtgg catctgagta
       61 catcagatgg ttttgcaaaa tgcaaacaat tttttccttg gggatgattt ttggggagag
      121 ggggctactg taaaaaataa aaccaaaacc ccctttgctc cctcggaggt tgaagttgcc
      181 ggggggtgtg gccggggtca tgcatgaggc gacagcactg caggtgcggg tctgggctca
      241 tctgaactgt ttggtttcat tccagttcct gttcaacagc aacacatagc ctgaccctcc
      301 tccactccac ctccacccac tgtccgcctc tgcccgcaga gcccacgccc gactagcagg
      361 catgc
//
LOCUS       YSCIRA2A     9859 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  S.cerevisiae IRA2 protein gene, complete cds.
ACCESSION   M33779
KEYWORDS    IRA2 protein.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Unclassified.
REFERENCE   1  (bases 1 to 9859)
  AUTHORS   Tanaka,K., Nakafuku,N.M., Tamanoi,F., Kagiro,Y., Matsumoto,K. and
            Toh-i,A.
  TITLE     IRA2, a second gene of Saccharomyces cerevisiae that encodes a
            protein with a domain homologous to mammalian ras GTPase activating
            protein
  JOURNAL   Mol. Cell. Biol. 10, 4303-4313 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Tanaka, 19-APR-1990, for release after publication.
               University of Chicago
               Dept. of Biochem and Mol Biol
               920 E. 58th Street
               Chicago, IL 60637
FEATURES       from  to/span     description
    pept        231     9470     IRA2 protein (IRA2)
BASE COUNT     3163 a   1908 c   1735 g   3053 t
ORIGIN      
        1 gagctcctgg gaaacaaaag accgaaaaag cgaaaacaaa atcagaacaa ggcttaagta
       61 ctttttcacc aattgtagca aacatttaac cacattttag cacactagca tatagcattg
      121 tcctctgtta ttcgttttgc ttttctcctt tagtgttact tttcccccaa cgttacacca
      181 ttttttgata tcaactaaac tgtatacatt atctttcttc agggagaagc atgtcccagc
      241 ccactaagaa taagaagaaa gaacacggga ccgattccaa gtcatcccgc atgactcgga
      301 cgttggttaa tcatattctt tttgaaagaa ttctcccgat ccttccggtg gagtctaatc
      361 taagtaccta ttcggaagtg gaagagtatt cctcattcat ttcatgcaga tctgtgctca
      421 ttaacgttac cgtttcccga gatgcaaatg ctatggtgga aggcaccttg gagttgatag
      481 aatcgcttct tcaagggcac gaaatcattt cagataaggg tagcagtgac gttattgaat
      541 caatactgat tatactaaga ttgttaagtg atgcgctaga gtataattgg caaaatcaag
      601 aaagccttca ttacaacgac atttcgactc acgtagaaca tgaccaagaa cagaagtaca
      661 gaccaaagct taacagtatt ctgcccgact actcgtcgac tcattccaat ggcaacaaac
      721 actttttcca ccagagcaaa cctcaggcac tgataccgga actggcatcg aaattgcttg
      781 agagttgcgc gaagttgaag ttcaatacaa gaactttgca aattttacaa aatatgatca
      841 gtcatgttca tggaaacatt ctaacgactt tgagttcctc gattcttccc cgccacaaat
      901 cctatctgac aaggcacaac catccttctc attgtaaaat gattgactct actctaggcc
      961 atattctccg atttgtagcg gcttccaatc cgtccgagta ttttgaattt atcagaaaga
     1021 gtgtgcaagt gcccgtaaca cagacacaca cgcattcaca ctcccattca cactctttgc
     1081 catcttccgt ttataacagc atagtgcccc actttgatct tttcagcttc atctatttaa
     1141 gcaagcataa ttttaagaaa tacttggaac tcatcaaaaa cttatcggtg acgttaagga
     1201 aaacgattta tcattgccta cttttgcatt acagcgccaa agcaataatg ttttggataa
     1261 tggctaggcc tgcggaatat tatgaactct tcaacttatt aaaagataat aacaatgaac
     1321 actcgaaatc cttaaacacg ttaaaccata cacttttcga ggagatccat tcgactttta
     1381 atgtgaatag catgataacc accaatcaaa atgctcatca aggctcatct tccccttcgt
     1441 cctcctcgcc atcgtcacca cctagctcat catcatcgga taacaacaat caaaacataa
     1501 tagcaaaatc cttaagtcgt cagctttctc accaccagtc atacattcaa cagcagtctg
     1561 aaagaaaact acattcttca tggactacaa actctcaatc ctctacttca ctgtcatctt
     1621 caacgtctaa ttcaacaaca actgatttct ctactcacac tcaaccagga gaatatgacc
     1681 cttccttacc agatactccc acgatgtcta acatcactat tagtgcatct tcattattat
     1741 ctcaaactcc aactccaaca acacaattgc aacagcggtt gaactcagca gctgcagccg
     1801 ccgccgcagc tgcttcacca tcgaattcca ccccaactgg atacacagca gagcaacaaa
     1861 gtcgcgcttc atacgatgca cacaaaactg gccatactgg taaggattat gacgaacatt
     1921 ttttgtctgt cactcgtttg gataatgttt tggagttata cacgcacttt gatgatactg
     1981 aggtactacc acacacatcc gtactgaagt ttttaactac tttgacaatg ttcgatattg
     2041 acctttttaa tgaattaaac gctacatcat tcaaatatat tcctgattgt actatgcatc
     2101 gtccaaaaga aagaacaagt tctttcaata atactgcaca cgagacaggt tccgaaaaga
     2161 cttcgggtat aaaacatatt acacaaggct taaagaaatt aacttcttta ccttcctcaa
     2221 ccaaaaaaac tgtaaaattt gtgaagatgt tgctaagaaa tttaaatggg aatcaagctg
     2281 tatcagatgt tgccctctta gatacaatga gggccttact atcattcttt acaatgactt
     2341 ctgcggtctt tctcgtggat agaaacttac cctcagtact ttttgccaag agactcatcc
     2401 ccataatggg gacaaattta agcgtcggtc aagactggaa ttcaaaaata aataacagtt
     2461 tgatggtttg tttgaaaaaa aactccacca cgtttgttca attacaatta atattcttct
     2521 cttcagctat tcaattcgat catgaattat tactggcacg tctgagcatc gatacaatgg
     2581 ccaacaattt aaacatgcag aagctatgcc tttatactga aggattcagg atattcttcg
     2641 acataccaag taagaaggaa ttgcggaagg caattgcggt taaaatttct aaatttttca
     2701 aaacattatt ctccattata gcagatattc ttttacaaga atttccgtat tttgatgagc
     2761 aaatcaccga catagttgct tccattcttg acggtacaat tatcaatgag tatggtacga
     2821 agaaacattt caaggggagc tcaccctctt tatgttcgac aacccggtca agatcaggat
     2881 ctacatctca aagttcaatg acaccagttt ctccgctggg actggatact gatatatgtc
     2941 caatgaacac cctgtcttta gttggttcaa gtacttcaag aaattctgac aacgttaatt
     3001 cattaaacag ttcaccaaag aacttgtctt ctgatccata cttgtcacat cttgtggccc
     3061 caagagcgcg tcatgcttta ggtgggccat ctagtattat aaggaataaa ataccgacta
     3121 cattgacttc acctccagga acggaaaaat cttcaccagt acaacgtccg caaacggaaa
     3181 gcatcagtgc cacaccaatg gccataacaa attctactcc attatcgtcg gcagcattcg
     3241 gaattcgatc gcctttgcag aaaataagaa cgaggcgtta ttccgatgaa agtttaggta
     3301 aattcatgaa atcaacaaat aattacattc aagaacattt gataccaaaa gatttgaatg
     3361 aagcaactct tcaagatgct agaagaataa tgattaatat tttcagtatt tttaagagac
     3421 cgaatagtta cttcatcatt cctcacaata taaactcgaa tttacaatgg gtttcgcagg
     3481 attttagaaa tattatgaaa ccgattttcg tcgccatcgt aagtccggat gtagatttac
     3541 agaatactgc tcaatcattc atggatacct tattatcgaa tgttattact tatggtgaat
     3601 cagatgagaa tatcagtatt gaagggtatc atcttctttg cagttacact gtaacattat
     3661 ttgcaatggg ccttttcgat ttgaaaatta ataatgaaaa gcgtcaaatt ctcttggata
     3721 taactgtcaa gtttatgaag gttagatcac atttagcagg gatcgcggag gcctcacacc
     3781 acatggaata cataagtgat tctgaaaaac tcacctttcc gctgattatg gggactgttg
     3841 gtagggccct atttgtttca ttatactcta gtcaacaaaa aattgaaaag actttaaaga
     3901 ttgcttacac agagtatctt tctgcaatca attttcatga gaggaatatt gatgatgctg
     3961 ataaaacttg ggttcataat attgagtttg tagaagcgat gtgtcatgac aactacacaa
     4021 cttctggttc aattgctttc caaaggagga caagaaataa tattttacga tttgctacta
     4081 ttcctaacgc tatcttactt gattctatga ggatgatcta taagaagtgg catacttaca
     4141 cacacagtaa aagtttagaa aaacaagaac ggaacgactt cagaaatttc gcgggtattt
     4201 tagcctcttt gtcgggtatc ctattcatca ataaaaagat attgcaagaa atgtatccat
     4261 acctactcga caccgtttca gaattgaaaa aaaatataga ctcttttatc tcaaaacaat
     4321 gccaatggtt aaactatccg gatttattaa cgagagaaaa ttcaagagat attctaagtg
     4381 tagaactgca tcctttgtct tttaacttac tttttaataa tttgaggctc aagttaaaag
     4441 aacttgcttg ttcagactta tcaataccag aaaatgaaag ttcctatgtt ttattagaac
     4501 aaataatcaa aatgctgcgg acaatcctag gtcgtgatga tgacaattat gtaatgatgc
     4561 ttttttccac agagattgta gatcttattg atttattgac agatgaaata aaaaaaatac
     4621 cagcctattg tccaaaatat ctcaaggcaa ttattcaaat gaccaaaatg ttcagtgcct
     4681 tgcagcactc agaggttaat ttaggtgtca aaaatcattt tcacgttaaa aataaatggt
     4741 tgaggcaaat cactgattgg tttcaagtga gtattgcgag agagtacgat ttcgaaaact
     4801 tgtcaaaacc tctaaaagaa atggatttgg taaaaagaga catggatatt ctatacatag
     4861 atacggcaat cgaagcttca accgctattg cgtacctcac gagacatact ttcttagaga
     4921 ttccacctgc cgcgtcagat cccgaactat ctcgatctag gtctgtgata tttgggtttt
     4981 atttcaacat cttaatgaaa ggccttgaaa aaagtagtga tcgtgacaat tacccagtat
     5041 tcttgaggca caaaatgagt gtcctcaacg acaatgtaat actttcatta acaaatcttt
     5101 caaacaccaa tgttgatgcg agtttgcagt tcaccttacc gatgggctat tccggaaatc
     5161 gaaacattag gaatgcattt ttggaggtct tcattaatat cgttacgaac tatcggacat
     5221 acacggctaa aactgacctt ggaaaattag aggcagcaga caaatttttg cgatatacga
     5281 ttgaacatcc ccagctatcg tcctttggag cagcggtttg tcccgctagc gatattgatg
     5341 cttatgctgc tggcttaata aatgcatttg aaacgaggaa tgccacccac attgtagtgg
     5401 cacagttgat taaaaatgaa attgaaaaat cttccagacc tacggatatc cttagaagaa
     5461 atagctgtgc tacgagatca ttatctatgc tagccaggtc caagggtaac gaatatttga
     5521 ttcgcacttt gcaaccatta ctaaaaaaaa ttatccagaa cagagatttt tttgaaattg
     5581 agaaactaaa accggaagat tcagatgctg aacgtcaaat agagctcttc gttaaataca
     5641 tgaatgaatt attggaatcc atatccaact ccgtatctta ttttccccct cctttatttt
     5701 atatttgcca aaacatttat aaagttgcgt gtgaaaaatt tccggatcac gcaattatcg
     5761 ccgctgggtc tttcgtgttt ttacggtttt tttgtcctgc tttagtcagc cctgattctg
     5821 aaaatatcat agatatttct cacttgagcg aaaagcgtac cttcatcagc ttggctaaag
     5881 ttatccaaaa tattgccaat ggctcagaaa atttctccag atggccagct ttgtgttccc
     5941 aaaaggattt tcttaaggaa tgtagcgata gaattttcag attcctagct gaactttgta
     6001 gaacagatcg cacgatagac atccaagtga gaacagaccc aacgccaatt gcatttgact
     6061 atcaattcct tcattccttt gtttaccttt acggtcttga ggtgagaagg aatgtgctaa
     6121 atgaagcaaa acatgatgat ggtgacattg atggtgacga tttctataag accacatttt
     6181 tacttattga tgatgttctt ggccaattag gccaacctaa aatggaattt tccaatgaaa
     6241 taccaatata cataagagaa catatggacg actatccgga actgtatgag ttcatgaata
     6301 ggcacgcgtt cagaaacatt gagacttcaa cagcgtacag cccaagcgtt cacgagtcca
     6361 cctcaagtga aggcattcca attattacgt taacaatgtc aaatttctca gacagacatg
     6421 tggacattga tacagttgct tacaagttct tgcaaattta tgctcgaatc tggaccacca
     6481 aacactgttt aataatcgac tgtacagaat ttgacgaggg agggcttgat atgaggaaat
     6541 ttatttcttt ggttatggga ctattaccag aagttgcacc caaaaattgt ataggctgtt
     6601 actactttaa cgtaaacgag acatttatgg ataattatgg aaaatgtttg gacaaagaca
     6661 acgtatatgt ttcctcgaaa attcctcatt atttcattaa tagtaactct gatgaaggac
     6721 ttatgaaatc tgtgggtata actggacaag ggttgaaggt tctgcaagat attcgtgtct
     6781 ctctgcatga tatcacgctt tatgacgaaa aaagaaatag atttacgccg gtatcgttga
     6841 aaataggcga tatttacttt caagtcttgc atgaaactcc taggcaatat aaaataaggg
     6901 acatgggtac tttattcgac gtaaaattca atgatgtcta cgaaattagc cgaatatttg
     6961 aagtacatgt ttcgtcaata actggagtgg cagctgaatt tacagtaact tttcaggacg
     7021 agagaaggtt gatttttagt agtccgaaat accttgaaat tgtgaagatg ttctattacg
     7081 cacagatccg gttagaaagt gaatatgaaa tggataataa ttcgagtacc tcctccccaa
     7141 attcaaacaa caaggtcaaa cagcagaaag agagaacaat tctattgtgc cacctactgt
     7201 tagtatctct tattggtctg tttgatgaga gtaaaaaaat gaaaaacagt tcgtataacc
     7261 taatagctgc cactgaggcg tcatttggtt tgaactttgg ctcccatttt catcgctctc
     7321 ccgaggtgta cgtccccgaa gatactacaa catttttagg tgttattgga aagtctcttg
     7381 cagagtctaa tccagaactc acagcctata tgtttatcta tgttttggag gcattgaaga
     7441 acaacgtaat tcctcacgtt tacatccctc ataccatttg cggtttgtct tattggatcc
     7501 ctaatttata ccaacatgtg tatttggctg atgatgaaga aggccccgaa aacatatctc
     7561 acattttccg aattcttatc aggctctctg tgagagagac tgactttaaa gccgtataca
     7621 tgcaatatgt ttggttgcta cttttagatg atggccgctt aactgacatt atcgttgatg
     7681 aagttattaa tcatgcgtta gaaagagact ccgaaaaccg cgattggaag aaaacaatat
     7741 cgttactgac tgtcctaccc actactgagg ttgctaataa tattattcaa aaaatattgg
     7801 caaaaattag atcattttta ccgtcattga agttagaagc tatgacccaa agttggtctg
     7861 aactaacaat attagttaag ataagcatcc acgttttttt tgaaacttct ttgctggtac
     7921 agatgtactt accagagatc ctgtttatcg tatccttatt aattgatgtt ggtccaaggg
     7981 aactcagatc atcactacac cagctattaa tgaatgtatg ccattccttg gctattaact
     8041 cagctttacc acaagatcat agaaataatc tagatgaaat aagtgatata tttgcacatc
     8101 aaaaggtgaa gtttatgttt gggttcagcg aggacaaagg acgaatttta cagattttta
     8161 gcgcttcttc ttttgcaagc aagtttaata ttctggattt cttcatcaat aatatattat
     8221 tgctgatgga atattcttca acgtacgaag caaacgtgtg gaagacaaga tacaagaaat
     8281 atgtcttgga atctgtgttt acaagtaatt cttttctttc ggcacgttca atcatgattg
     8341 ttggtataat gggtaaatct tacataactg aagggttatg caaggctatg ttaattgaaa
     8401 ccatgaaagt tatcgccgaa ccaaagatta ctgacgagca tcttttctta gccatatctc
     8461 atatttttac ttattccaaa attgttgaag gtttggatcc caaccttgac ttaatgaagc
     8521 acttattttg gttttcaaca ctcttccttg aatcacgtca cccgataatt tttgagggtg
     8581 cccttctctt tgtgtcaaac tgtataaggc gcctatacat ggcccagttt gaaaatgaaa
     8641 gcgaaacatc attgataagt actttactta aggggagaaa gtttgctcat acctttttaa
     8701 gcaagataga gaatcttagt ggtattgttt ggaatgaaga taattttaca cacattctga
     8761 ttttcatcat taataaagga ctatccaatc ctttcattaa gagtacggct tttgatttct
     8821 tgaagatgat gtttagaaac tcctactttg agcatcaaat caatcagaaa tctgatcatt
     8881 atttgtgcta tatgttccta ttgtattttg ttttaaactg caatcaattt gaggaacttt
     8941 taggtgacgt tgattttgaa ggagaaatgg ttaacattga aaacaagaac accattccta
     9001 aaattttgtt agagtggttg agttcggata acgaaaatgc aaacattacc ctctatcaag
     9061 gtgcgatact gttcaaatgt tcagttacgg atgaaccaag tagatttagg tttgcgttga
     9121 ttattaggca tctattgaca aagaaaccca tttgtgcatt gcgtttttac agtgttattc
     9181 gtaacgaaat aagaaaaata tcagcatttg agcaaaattc ggattgtgtt ccacttgctt
     9241 tcgatatttt aaacttatta gtgacgcatt cagagtctaa ttcgttagaa aaacttcacg
     9301 aagaatccat tgaacgtcta accaaaagag gtttatcgat tgtgacttct tctggtatat
     9361 ttgcgaagaa ctccgacatg atgatacctt tagatgtaaa acctgaagat atctatgaac
     9421 gtaagagaat aatgacaatg attttatcaa ggatgtcatg ttctgcttag aggtgttaca
     9481 taaactaatg aaagaaatat caatatctat ctgtaagcat gaatgtacat atctcatgtt
     9541 agggttttct tatcgctaat ttttcgcaat ttgttacgtg ggttgctttt atacagctac
     9601 aatttttata tattctatcg tgtaatgaat ggctcagtaa attcaagcgc cacatagact
     9661 aatgtacata ccaatgcatt ttaattgtaa gaataaaagg ggccattcat ctaccgtctt
     9721 agttgaaagt gtttctgtga attttttcaa attccgtttt ttccttttta tataatagca
     9781 tggtggcacg agcatcttcg actgaagaat gctcaccttc ttgaatggaa atttttaaaa
     9841 cctccctggt taatttctt
//
LOCUS       HUMMHDQ3L     967 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment,
            long terminal repeat region.
ACCESSION   M33841
KEYWORDS    major histocompatibility complex.
SOURCE      Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 967)
  AUTHORS   Kambhu,S., Falldorf,P. and Lee,J.S.
  TITLE     Endogenous retroviral long terminal repeats (LTR) within the HLA-DQ
            locus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4927-4931 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.S.Lee, 25-APR-1990.
FEATURES       from  to/span     description
    rpt           1        7     inverted repeat A
    rpt         961      967     inverted repeat B
    rpt         560      810     R region
    signal      792      797     poly-A signal
    site         74       79     1/2 GRE
    site         80       87     enhancer core
    site        530      536     TATA box
BASE COUNT      254 a    235 c    222 g    256 t
ORIGIN      Chromosome 6p21.3.
        1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
       61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct
      121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa
      181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
      241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc
      301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt
      361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa
      421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa
      481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg
      541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg
      601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc
      661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg
      721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc
      781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga
      841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct
      901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac
      961 ccctaca
//
LOCUS       HUMMHDQ5L     960 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment,
            long terminal repeat region.
ACCESSION   M33842
KEYWORDS    major histocompatibility complex.
SOURCE      Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 960)
  AUTHORS   Kambhu,S., Falldorf,P. and Lee,J.S.
  TITLE     Endogenous retroviral long terminal repeats (LTR) within the HLA-DQ
            locus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4927-4931 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.S.Lee, 25-APR-1990.
FEATURES       from  to/span     description
    rpt           1        7     inverted repeat A
    rpt         954      960     inverted repeat B
    rpt         555      803     R region
    signal      785      790     poly-A signal
    site         74       79     1/2 GRE
    site         80       87     enhancer core
    site        525      531     TATA box
BASE COUNT      250 a    232 c    219 g    259 t
ORIGIN      Chromosome 6p21.3.
        1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat
       61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct
      121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa
      181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc
      241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag
      301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa
      361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt
      421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa
      481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt
      541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca
      601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac
      661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct
      721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat
      781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg
      841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa
      901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca
//
LOCUS       NEUAMTR      5928 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  N.crassa mating type protein gene, complete cds.
ACCESSION   M33876
KEYWORDS    mating type protein.
SOURCE      N.crassa (strain 74-ORS-A) DNA.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 5928)
  AUTHORS   Glass,N.L., Grotelueschen,J. and Metzenberg,R.L.
  TITLE     The Neurospora crassa A mating-type region
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4912-4916 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.L.Glass, 26-APR-1990.
FEATURES       from  to/span     description
    pept       4121     4283     mating type protein, exon 1
               4343     5046     mating type protein, exon 2
    IVS        4284     4342     mating type protein intron A
    signal     3858     3906     GC signal
    signal     3945     3949     CAAT box
    signal     5319     5326     poly-A signal
BASE COUNT     1470 a   1459 c   1526 g   1473 t
ORIGIN      
        1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg
       61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga
      121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct
      181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct
      241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag
      301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa
      361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga
      421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa
      481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg
      541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg
      601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg
      661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc
      721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg
      781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc
      841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg
      901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc
      961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat
     1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa
     1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc
     1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg
     1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt
     1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa
     1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc
     1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca
     1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat
     1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac
     1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca
     1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta
     1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga
     1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt
     1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa
     1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac
     1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc
     1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc
     2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg
     2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga
     2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg
     2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca
     2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca
     2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc
     2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca
     2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat
     2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag
     2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc
     2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca
     2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg
     2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag
     2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg
     2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata
     2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg
     3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca
     3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca
     3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt
     3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt
     3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca
     3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag
     3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac
     3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta
     3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca
     3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac
     3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg
     3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca
     3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt
     3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt
     3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc
     3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa
     3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc
     4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt
     4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat
     4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc
     4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt
     4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat
     4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag
     4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat
     4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg
     4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg
     4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg
     4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag
     4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat
     4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt
     4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct
     4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc
     4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat
     4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg
     5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca
     5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc
     5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg
     5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg
     5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg
     5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg
     5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca
     5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa
     5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct
     5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg
     5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag
     5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg
     5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc
     5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc
     5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac
//
LOCUS       PFARGEB      2126 bp ds-DNA             INV       02-AUG-1990
DEFINITION  P.falciparum 18S ribosomal RNA not in asexual parasites.
ACCESSION   M19173
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.falciparum DNA, clone B.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 2126)
  AUTHORS   McCutchan,T.F., de la Cruz,V.F., Lal,A.A., Gunderson,J.H.,
            Elwood,H.J. and Sogin,M.L.
  TITLE     Primary sequences of two small subunit ribosomal RNA genes from
            Plasmodium falciparum
  JOURNAL   Mol. Biochem. Parasitol. 28, 63-68 (1988)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 2126; revises [1])
  AUTHORS   Sogin,M.L.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Computer-readable copy of sequence for [2] kindly submitted by
            M.L.Sogin, 01-AUG-1990.
               SOGIN@EVOL1.MBL.EDU
BASE COUNT      704 a    296 c    412 g    714 t
ORIGIN      
        1 aacctggttg atcttgccag tagtcatatg cttgtctcac agattaagcc atgcaagtga
       61 aagtacatat gtaagtaata tattgaaact gcgaacggct cattaaaaca gttataatct
      121 acttgatgtt tttaatataa ggataactac ggaaaatctg tagctaatac ttgtgaaaat
      181 accttttgat atatacatat gtatgtatca ttaaggtatg tatttgttag atataagaat
      241 aaaataaata taattgaatt ataacaaaga agaaacacat aaaatatgtg tattatcaat
      301 cgagtatctg acctatcagc ttttgatgtt agggtattga cctaacatgg ctatgacggg
      361 taacggggaa ttagagttcg attccggaga gggagcctga gaaatagcta ccacatctaa
      421 ggaaggcagc aggcgcgtaa attacccaat tctaaaaaag agaggtagtg acaagaaata
      481 acaatacaat atcgaaaaat gattttgtaa ttggaatgat aggaatttac aaggttccta
      541 gagaaacaat tggagggcaa gtctggtgcc agcagccgcg gtaattccag ctccaatagc
      601 atatattaaa attgttgcag ttaaaacgtt cgtagttgaa tattaaagaa tccgatgttt
      661 catttaaact ggtttgggaa aaccaaatat attatatatt ttgctttgtt caaaataagg
      721 ttttctaata aattatgttt ttatcagata tgacagaatc ttttttaaaa tctcttcaat
      781 atgcttttat tgcttttgag aggttttgtt actttgagta aaattaagtg ttcataacag
      841 acgggtagtc atgattgagt tcattgtgtt tgaatactac agcatggaat aacaaatatg
      901 aataagctaa ttattttttt tttcattatt ttttttgata ttcttattag cttagttacg
      961 attaatagga gtagctttgg gggcattcgt attcagatgt cagaggtgaa attctaagat
     1021 tttctggaga cggactactg cgaaagcatt tgcctaatct atttccatta atcaagaacg
     1081 aaagttaagg gagtgaagac gatcagatac cgtcgtaatc ttaaccataa actataccga
     1141 ctaggtgttg gatgaatata aaaaatatat aaatatgtag catttcttag ggaatgttga
     1201 ttttatatta gaattgcttc cttcagtacc ttatgagaaa tcaaagtctt tgggttctgg
     1261 ggcgagtatt cgcgcaagcg agaaagttaa aagaattgac ggaagggcac caccaggcgt
     1321 ggagcttgcg gcttaatttg actcaacacg ggaaaactca ctagtttaag acaagagtag
     1381 gattgacaga ttaatagctc tttcttgatt tcttggatgg tgatgcatgg ccgtttttag
     1441 ttcgtgaata tgatttgtct ggttaattcc gataacgaac gagatcttaa cctgctaatt
     1501 agcggtaagt acactatatt tttatttgaa attgaatata ggtaattata catgtttatt
     1561 cagtgttcaa attaggatat tttttttatt aaaatattct tttccctgtt ttactaataa
     1621 tttgtttttt tttactctat ttctctcttc ttttaagaat gtacttgctt gattaaataa
     1681 agcttcttag aggaacagtg tgtatctaac acaaggaagt ttaaggcaac aacaggtctg
     1741 tgatgtcctt agataaacta ggctgcacgc gtgctacaat gatatatata acaagttgtt
     1801 aaaaatgtac ttataaataa gtgtgtacaa tttttcctgt actgaaaagt ataggtaatc
     1861 tttatcagta tatatcgtaa ttgggataga ttattgcaat tattaatctt gaacgaggaa
     1921 gcccgtcgct cctaccgatt gaaagatatg ataaattgtt tggatatgaa ttaaaataat
     1981 agaagtcgta acaaggtttc cgtaggtgaa cctgcggaag gatcattaat atatgaaacg
     2041 ccgagctcga attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc
     2101 acaattccac acaacatacg agccgg
//
LOCUS       ALCRR45S      103 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  A.tuberosum chloroplast 4.5S ribosomal RNA.
ACCESSION   M35406
KEYWORDS    4.5S ribosomal RNA.
SOURCE      A.tuberosum chloroplast young leaf rRNA.
  ORGANISM  Chloroplast Allium tuberosum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Liliidae; Liliales; Liliaceae; Allium tuberosum.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
  TITLE     The nucleotide sequences of chloroplast 4.5 S rRNAs from four
            species of plants, celery (Apium graveoleus), barley (Hordeum
            vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
            communis)
  JOURNAL   FEBS Lett. 200, 193-196 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      103     4.5S ribosomal RNA
BASE COUNT       29 a     18 c     31 g     25 t
ORIGIN      
        1 taaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtctagtgg aagtgcagtg
       61 atgtatgcag ctgaggcatc ctaatagacc gagagatttg aac
//
LOCUS       APIRR45S      103 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  A.graveoleus chloroplast 4.5S ribosomal RNA.
ACCESSION   M35404
KEYWORDS    4.5S ribosomal RNA.
SOURCE      A.graveoleus chloroplast young leaf rRNA.
  ORGANISM  Chloroplast Apium graveoleus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Apiales; Apiaceae; Apium graveoleus.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
  TITLE     The nucleotide sequences of chloroplast 4.5 S rRNAs from four
            species of plants, celery (Apium graveoleus), barley (Hordeum
            vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
            communis)
  JOURNAL   FEBS Lett. 200, 193-196 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      103     4.5S ribosomal RNA
BASE COUNT       28 a     19 c     32 g     24 t
ORIGIN      
        1 gaaggtcacg gtgagacgag ccgtttatca ttacgatagg tgtctagtgg aagtgcagtg
       61 atgtatgcag ctgaggcatc ctaacagacc ggcagatttg aac
//
LOCUS       BLYRR45S       96 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  H.vulgare chloroplast 4.5S ribosomal RNA.
ACCESSION   M35405
KEYWORDS    4.5S ribosomal RNA.
SOURCE      H.vulgare chloroplast young leaf rRNA.
  ORGANISM  Chloroplast Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Hordeum vulgare.
REFERENCE   1  (bases 1 to 96)
  AUTHORS   Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
  TITLE     The nucleotide sequences of chloroplast 4.5 S rRNAs from four
            species of plants, celery (Apium graveoleus), barley (Hordeum
            vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
            communis)
  JOURNAL   FEBS Lett. 200, 193-196 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1       96     4.5S ribosomal RNA
BASE COUNT       29 a     17 c     30 g     20 t
ORIGIN      
        1 taaggtagcg gcgagacgag ccgtttaaat aggtgtcaag tggaagtgca gtgatgtatg
       61 cagctgaggc atcctaacga accgaacgat ttgaac
//
LOCUS       CEVCG         371 bp ds-DNA   circular  VRL       02-AUG-1990
DEFINITION  Citrus exocortis viroid RNA sequence, complete genome.
ACCESSION   M34917
KEYWORDS    complete genome.
SOURCE      Citrus exocortis viroid (strain A) RNA.
  ORGANISM  Citrus exocortis viroid
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 371)
  AUTHORS   Visvader,J.E., Gould,A.R., Bruening,G.E. and Symons,R.H.
  TITLE     Citrus exocortis viroid: Nucleotide sequence and secondary
            structure of an Australian isolate
  JOURNAL   FEBS Lett. 137, 288-292 (1982)
  STANDARD  simple staff_review
BASE COUNT       72 a    112 c    112 g     75 t
ORIGIN      
        1 cgggatcttt cttgaggttc ctgtggtgct cacctgaccc tgcaggcagg aaaagaaaaa
       61 agaggcggcg ggggaagaag tccttcaggg atccccgggg aaacctggag gaagtcgagg
      121 tcggggggga cagctgcttc ggtcgccgcg gatcactggc gtccagcgga gaaacaggag
      181 ctcgtctcct tcctttcgct gctggctcca catccgatcg tcgctgaagc gcctcgcccc
      241 ctcgcccgga gcttctctct ggagactacc cggtggaaac aactgaagct tcaaccccaa
      301 accgcttttc ttgtatcttc actgctctcc gggcgagggt gaaagccctc ggaaccctag
      361 attgggtccc t
//
LOCUS       COMRR45S      103 bp ss-rRNA            RNA       02-AUG-1990
DEFINITION  C.communis chloroplast 4.5S ribosomal RNA.
ACCESSION   M35407
KEYWORDS    4.5S ribosomal RNA.
SOURCE      C.communis chloroplast young leaf rRNA.
  ORGANISM  Chloroplast Commelina communis
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Commelinales; Commelinaceae; Commelina communis.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L.
  TITLE     The nucleotide sequences of chloroplast 4.5 S rRNAs from four
            species of plants, celery (Apium graveoleus), barley (Hordeum
            vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina
            communis)
  JOURNAL   FEBS Lett. 200, 193-196 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      103     4.5S ribosomal RNA
BASE COUNT       31 a     18 c     31 g     23 t      0 others
ORIGIN      
        1 taaggtagcg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagta
       61 atgtatgcag ctgaggcatc ctaacagacc gagagatttg aac
//
LOCUS       HUMBKM        615 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human banded krait minor satellite (Bkm) repeat.
ACCESSION   M35828
KEYWORDS    banded krait minor satellite repeat.
SOURCE      Human DNA, clone lambda-X13.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 615)
  AUTHORS   Erickson,R.P., Ross,C.E., Gorski,J.L., Stalvey,J.R.D. and
            Drumm,M.M.
  TITLE     Bkm sequences from the human X chromosome contain large clusters of
            GATA/GACA repeats
  JOURNAL   Ann. Hum. Genet. 52, 167-176 (1988)
  STANDARD  simple staff_review
BASE COUNT      258 a     90 c     87 g    180 t
ORIGIN      X chromosome.
        1 agctttggtt caatttttat ggtttttatg gaaggtacaa agttactaat gcaccacccc
       61 acagtaagca ccagccccac atggtttcat agagaaatat tagaaaatct tcaaagattg
      121 ggtagttgtg atgctacgaa aacttttcca aaacaaactc ctactgaagt tttatgaaat
      181 actaaaatat tgatacttta acctaataga tagtacaata aaacctacat ataattgcac
      241 ttatggatat agatgtacta attaaaatac tagcaaacat aacccaacac tgcattcaga
      301 tataacatac catgaccaaa agaacttcat tctattaatg taaaaaaaat cacattaata
      361 gaattaaggg gaaaacatat gattatctac agtgatgcta aatgaaacca acaatattta
      421 aaattattct gataagaccc tcaagaactt gaattgatgg agattcaact atatatatat
      481 aaatatatgt ttataaatat agatagacag atagatagat agacagatag atagatagat
      541 agatagatag atagatagat agatagatat agatatagat actttaatcc taaaggcaag
      601 atcttaccta ctgca
//
LOCUS       HUMBKMX       206 bp ds-DNA             PRI       02-AUG-1990
DEFINITION  Human banded krait minor satellite (Bkm) repeat.
ACCESSION   M35829
KEYWORDS    banded krait minor satellite repeat.
SOURCE      Human DNA, clone lambda-X81.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 206)
  AUTHORS   Erickson,R.P., Ross,C.E., Gorski,J.L., Stalvey,J.R.D. and
            Drumm,M.M.
  TITLE     Bkm sequences from the human X chromosome contain large clusters of
            GATA/GACA repeats
  JOURNAL   Ann. Hum. Genet. 52, 167-176 (1988)
  STANDARD  simple staff_review
BASE COUNT       93 a     25 c     39 g     49 t
ORIGIN      X chromosome.
        1 agtctacaga gctgcaagaa caccacatca taaatcttac gatttgaatg acatcgatag
       61 atagatagat agatagatag atagatagat agatagatag atagatagat agatagatag
      121 atatagatag atacatacat acatacatac acacatagaa atagagatat cctgatggaa
      181 agaataaacc atgagtctgg taatag
//
LOCUS       MENRR5S       128 bp ss-rRNA            BCT       02-AUG-1990
DEFINITION  M.tindarius 5S ribosomal RNA.
ACCESSION   M34910
KEYWORDS    5S ribosomal RNA.
SOURCE      M.tindarius (strain DSM 2095) rRNA.
  ORGANISM  Methanolobus tindarius
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanomicrobiales; Methanosarcinaceae.
REFERENCE   1  (bases 1 to 128)
  AUTHORS   Willekens,P., Stetter,K.O., Vandenberghe,A., Huysmans,E. and
            De Wachter,R.
  TITLE     The structure of 5S ribosomal RNA in the methanogenic
            archaebacteria Methanolobus tindarius and Methanococcus
            thermolithotrophicus
  JOURNAL   FEBS Lett. 204, 273-278 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      128     5S ribosomal RNA
BASE COUNT       32 a     33 c     34 g     29 t
ORIGIN      
        1 ggagtttggc ggccatagcg gcggggcaat tcctgtaccc atcccgaaca cagaagataa
       61 gcccgcctgc gttctatact gtactaaagt atgagagtct ttgggaaata tggaacgctg
      121 ccatctcc
//
LOCUS       MUSSPIX       980 bp ds-DNA             ROD       02-AUG-1990
DEFINITION  Mouse protease inhibitor gene, 5' flank.
ACCESSION   M35824
KEYWORDS    protease inhibitor gene.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 980)
  AUTHORS   Needham,M., Mills,J.S. and Parker,M.G.
  TITLE     Organization and upstream DNA sequence of the mouse protease
            inhibitor gene
  JOURNAL   Nucleic Acids Res. 16, 6229-6229 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        964  >   980     protease inhibitor mRNA
BASE COUNT      324 a    200 c    208 g    248 t
ORIGIN      
        1 taaacagaaa ttgtcttgtg ctgttaggaa agaacatcaa actcttttta ttaagtccct
       61 agttgtgtaa gagagcagta gccccatctg agtgttgctg ggaaatatgt gcctttattt
      121 ttctagagag aaaaacaaaa ctaagctaaa ctatacaaaa caaaaacaac ttacactttg
      181 tgataatggg agacattgct tcattggtca atactgttta ttatacaaac accctgggaa
      241 agataatctg tctgaaacaa ggtgacagag ttttttctac tgggcaagac aggcagaaac
      301 atgaaagaaa ctagagaccc agggtttggg tagatactca gtacacattg gttaataaag
      361 aatcaatcct ttggattatt tactaccatt attttaattt tccaatgcat aagttaggca
      421 ataggaaagt gaaccaattt cctagaggca tgcagctaat ctggtcagta cacacagccc
      481 cgggacacag gagacagaac tgaactctcc ttatcttgta ccccaccctg gcaccagcct
      541 atcaggactt aaggaataca cagcagataa agagacaacg gggggggggg gtgaaggacc
      601 aagaattgga gcagaagaat ggaagaaaag gaagaatgga aatcctgttt gagttctgca
      661 tgtctgccca tccattcaga actactgaag agacaaagga gacctttgac ttcccaaaca
      721 gagaagaaaa cctcacccca ggaaggcttt cctaaaagag gatgtcagga aatgcagcta
      781 atttgctttt cctggaagaa tttgggtcca aagttttctg cccttcttcc tacctaggtt
      841 acattttccc cagcctggag gccacacaag tcaatcaata accaaagata cctattatag
      901 ggcacagtgg gtggagcctg tattaccatc tgcctatatg accactcctc agtttctgaa
      961 gagaagcacc ctgtatagtt
//
LOCUS       MVTRR5S       120 bp ss-rRNA            BCT       02-AUG-1990
DEFINITION  M.thermolithotrophicus 5S ribosomal RNA.
ACCESSION   M34911
KEYWORDS    5S ribosomal RNA.
SOURCE      M.thermolithotrophicus (strain DSM 2095) rRNA.
  ORGANISM  Methanococcus thermolithotrophicus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanococcales; Methanococcaceae.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Willekens,P., Stetter,K.O., Vandenberghe,A., Huysmans,E. and
            De Wachter,R.
  TITLE     The structure of 5S ribosomal RNA in the methanogenic
            archaebacteria Methanolobus tindarius and Methanococcus
            thermolithotrophicus
  JOURNAL   FEBS Lett. 204, 273-278 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      120     5S ribosomal RNA
BASE COUNT       30 a     33 c     31 g     26 t
ORIGIN      
        1 tgatataacg gtcatagcgg aggagccaca tccgatcccg tcccgatctc ggaaattaag
       61 ccctccagcg ataccctaag tactgccatt tggtgggaac agggtgacgc tgttgatcac
//
LOCUS       SF6LYZ        996 bp ds-DNA             PHG       02-AUG-1990
DEFINITION  Bacteriophage SF6 fragment D lysozyme gene, complete cds.
ACCESSION   M35825
KEYWORDS    lysozyme.
SOURCE      Bacteriophage SF6 DNA.
  ORGANISM  Bacteriophage SF6
            Viridae; ds-DNA viruses.
REFERENCE   1  (bases 1 to 996)
  AUTHORS   Verma,M.
  TITLE     Molecular cloning and sequencing of lysozyme gene of bacteriophage
            SF6 of Bacillus subtilis
  JOURNAL   Curr. Microbiol. 13, 299-301 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         13      963     lysozyme
BASE COUNT      284 a    223 c    243 g    246 t
ORIGIN      
        1 ctgcagatcg taatggatcc tcgcctacgt gaagaagtag tacggctgat aatcgcatta
       61 acgagtgata atggagcatc actgtcaaaa cggcttcaat caagggtctc ggcgctcgag
      121 aagacgtctc aaatacactc tgatactatc ctccggatca cccagggact cgatgatgca
      181 aacaaacgaa tcatcgctct tgagcaaagt cgggatgact tggttgcatc agtcagtgat
      241 gctcaacttg caatctccag attggaaagc tctatcggag ccctccaaac agttgtcaat
      301 ggacttgatt cgagtgttac ccagttgggt gctcgagtgg gacaacttga gacaggactt
      361 gcagacgtac gcgttgatca cgacaatctc gttgcgagag tggatactgc agaacgtaac
      421 attggatcat tgaccactga gctatcaact ctgacgttac gagtaacatc catacaagcg
      481 gatttcgaat ctaggatatc cacgttagag cgcacggcgg tcactagcgc gggagctccc
      541 ctctcaatcc gtaataaccg tataaccatg ggattaaatg atggactcac gttgtcaggg
      601 aataatctcg ccatccgatt gccaggaaat acgggtctga atattcaaaa tggtggactt
      661 cagtttcgat ttaatactga tcaattccag atagttaata ataacttgac tctcaagacg
      721 actgtgtttg attctatcaa ctcaaggata ggcgcaactg agcaaagtta cgtggcgtcg
      781 gcagtgactc ccttgagatt aaacagtagc acgaaggtgc tggatatgct aatagacatg
      841 tcaacacttg aaattaattc tagtggacag ctaactgtta gatcgacatc cccgaatttg
      901 aggtatccga tagctgatgt tagcggcggt atcggaatga gtccaaatta taggtttagg
      961 tgaggatcag accaccccgc ggcactgggg ctgcag
//
LOCUS       SYESC6       1071 bp ss-RNA             VRL       02-AUG-1990
DEFINITION  Sonchus yellow net virus M1 protein gene, complete cds.
ACCESSION   M35689
KEYWORDS    M1 protein.
SOURCE      Sonchus yellow net virus RNA, clones pGL[9,32].
  ORGANISM  Sonchus yellow net virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Phytorhabdovirus.
REFERENCE   1  (bases 1 to 1071)
  AUTHORS   Hillman,B.I., Heaton,L.A., Hunter,B.G., Modrell,B. and Jackson,A.O.
  TITLE     Structure of the gene encoding the m1 protein of sonchus yellow net
            virus
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.I.Hillman, 28-JUN-1990.
FEATURES       from  to/span     description
    pept         72      932     M1 protein
    mRNA          1     1071     M1 mRNA
BASE COUNT      403 a    197 c    235 g    236 t
ORIGIN      
        1 aacttaaaat aataataata tcaaataatc aagcagacat aaaaacaggg ttatctctat
       61 tgtatttcag aatggcaggt atatacgcag tttcaatcaa gggccatgca tcggcgatat
      121 tcaaccgaca agagaaggaa atcagcacag gacgtgtttg ggaagttatg aagaagataa
      181 tgtccttaaa gcctactagg gtcattatgt catattctct ccttagatca gcgttggata
      241 aaagcaggca acttacccaa gaagaataca atattatgca attgatccta gacggctgtg
      301 tcaaaaccct tgagcctgtt gcagccagcg ggatatgcat agatgtgaac ctgggaaaat
      361 gcaccaagca cacaatccca tttggcatca caaataacga cgtaggtcac gtgtcagtag
      421 tgatgacatt accatttttg gaagaaggat gttataacat aggcgcatgt tttgatggaa
      481 gattgtccaa gtcaaggagc gacgcaagcc attatgctgt tgatgtaagt ctggaaattt
      541 acttaaagtc attatccaga gacgaagctg aagaacaaat atccaaggga acatcagtgt
      601 accctttcaa aattaatcat cctacatact ttgaagatga gactgataca tcagatggag
      661 aaagtcttag tggacgagca agctcagatg atgggccaga agacggaggc cacggacatg
      721 gtgataaaaa caatgagaaa aactctggga aggtggtaag gaaaaggaaa tcaaggaaag
      781 agatcgatgt aggaaggttc aagatggtta aggataatat cataaacacc agatccggac
      841 tgctaaaatc catgagagga acgggacatc ggaaacacag aacgcaggag atcacagaag
      901 gatacaacta cggagacaag gacgcagaat gaaccaaccc accaaaagca gaaaacttgc
      961 tactatataa taatatgaag agcccacaat aagatcacac aatgtcaaac agcacaactc
     1021 caaatccatt aggtaatacc tgtattattc cgtgcactta tataagaaaa a
//
LOCUS       BLYCPPSBEF    681 bp ds-DNA             ORG       02-AUG-1990
DEFINITION  Barley chloroplast cytochrome b-559 apoprotein (psbEF) genes,
            complete cds.
ACCESSION   M35616
KEYWORDS    cytochrome b-559 apoprotein; psbE gene; psbF gene.
SOURCE      Barley chloroplast DNA, clones pDS6 and pDS6-RBSII.
  ORGANISM  Chloroplast Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Hordeum vulgare.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Krupinska,K.
  TITLE     Characterization and in vitro expression of the cytochrome b-559
            genes of barley. II. in vitro transcription and translation
  JOURNAL   Carlsberg Res. Commun. 53, 233-246 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        139      390     cytochrome b-559 9.4 kDa protein (psbE)
    pept        401      520     cytochrome b-559 4.5 kDa protein (psbF)
    binding     129      132     psbE ribosomal binding site (put.)
    binding     390      394     psbF ribosomal binding site (put.)
BASE COUNT      202 a    131 c    123 g    225 t
ORIGIN      
        1 cgactcaaaa tacacctttg gtacaaaatt gacaatctca caaggatgaa atatcagtaa
       61 ttttctattt actggttgat cccatctttt acggaatcaa ttcctttttt gaatgtacaa
      121 aaattttggg agttcagcat gtctggaagc acgggagaac gttcttttgc tgatattatt
      181 accagtattc gatactgggt tattcatagc attactatac cttccctatt cattgcgggt
      241 tggttatttg tcagtacggg tttagcttat gacgtgtttg gaagtcctag gccaaacgag
      301 tatttcacgg aaagccgaca aggaattccg ttaataaccg accgttttga ttctttagaa
      361 caactcgatg aatttagtag atccttttag gaggccctca atgaccatag atcgaaccta
      421 tcctattttt acagtgcgat ggctggctat tcacggacta gctgtaccta ctgttttttt
      481 cttgggatca atatcagcaa tgcagttcat ccaacgataa accaaattcc aactatagaa
      541 ctatgacaca atcaaacccg aatgaacaaa atgttgaatt gaatcgtacc agtctatact
      601 ggggtttatt actcattttt gtacttgctg ttttattttc caattacttc ttcaattgag
      661 agaaagaaag agactaacaa g
//
LOCUS       BOVIL1BA      944 bp ds-DNA             MAM       02-AUG-1990
DEFINITION  Bovine interleukin-1 beta mRNA, complete cds.
ACCESSION   M35589
KEYWORDS    interleukin-1 beta.
SOURCE      Bovine alveolar macrophage, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 944)
  AUTHORS   Leong,S.R., Flaggs,G.M., Lawman,M. and Gray,P.W.
  TITLE     The nucleotide sequence for the cDNA of bovine interleukin-1 beta
  JOURNAL   Nucleic Acids Res. 16, 9054-9054 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         45      845     interleukin-1 beta precursor
    sigp         45      383     interleukin-1 beta signal peptide
    matp        384      842     interleukin-1 beta
BASE COUNT      270 a    234 c    228 g    212 t
ORIGIN      
        1 attctctcca gccaaccttc attgcccagg tttctgaaac agccatggca accgtacctg
       61 aacccatcaa cgaaatgatg gcttactaca gtgacgagaa tgagctgtta tttgaggctg
      121 atgaccctaa acagatgaag agctgcatcc aacacctgga cctcggttcc atgggagatg
      181 gaaacatcca gctgcagatt tctcaccagt tctacaacaa aagcttcagg caggtggtgt
      241 cggtcatcgt ggccatggag aagctgagga acagtgccta cgcacatgtc ttccatgatg
      301 atgacctgag gagcatcctt tcattcatct ttgaagaaga gcctgtcatc ttcgaaacgt
      361 cctccgacga gtttctgtgt gacgcacccg ttcagtcaat aaagtgcaaa ctccaggaca
      421 gagagcaaaa atccctggtg ctggctagcc catgtgtgct gaaggctctc cacctcctct
      481 cacaggaaat gaaccgagaa gtggtgttct gcatgagctt tgtgcaagga gaggaaagag
      541 acaacaagat tcctgtggcc ttgggtatca aggacaagaa tctatacctg tcttgtgtga
      601 aaaaaggtga tacgcccacc ctgcagctgg aggaagtaga ccccaaagtc taccccaaga
      661 ggaatatgga aaagcgcttt gtcttctaca agacagaaat caagaataca gttgaatttg
      721 agtctgtcct gtaccctaac tggtacatca gcacttctca aatcgaagaa aggcccgtct
      781 tcctgggaca ttttcgagct ggccaggata taactgactt cagaatggaa accctctctc
      841 cctaaagaaa gccataccca gggagtccac gtgggctgaa taaccccgag gactggcaga
      901 agggaaggga agaatgtagc tgcagcctga acttcactgt tgtc
//
LOCUS       ECOK88ADA     744 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  E.coli K88ad subunit protein gene, partial cds.
ACCESSION   M35637
KEYWORDS    K88ad subunit protein.
SOURCE      E.coli (K12 strain 5K) DNA, clones pFM205, pBad, pBac.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 744)
  AUTHORS   Gaastra,W., Klemm,P. and de Graaf,F.K.
  TITLE     The nucleotide sequence of the K88ad protein subunit of porcine
            enterotoxigenic Escherichia coli
  JOURNAL   FEMS Microbiol. Lett. 18, 177-183 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   744     K88ad subunit protein (AA at 1)
BASE COUNT      192 a    137 c    205 g    210 t
ORIGIN      
        1 aatggttcgg tcgatatcgg tggtagtatc actgcagatg attatcgtca gaaatgggaa
       61 tggaaagttg gtacaggtct taatggattt ggtagtgtat tgaatgacct gaccaatggt
      121 ggaaccgaac tgaccattac tgttactggt aataagccaa gtttgttagg ccgaaccaga
      181 gaggcatttg ctacgccagt ggttggtggt gtagatggaa ttcctcatat tgcatttact
      241 gactatgaag gagcttctgt agaactcaga aaccctgatg gtgaaattga aaaaggttta
      301 gcatattttg ttctgccgat gaaaaatgca gagggcacta aagttggttc agtgaaagtg
      361 aatgcatctt atgccggtgc gctcgggaga ggtggggtta cttctgcgga cggggagctg
      421 atgtcgcttt ttgccgaggg gtcgcacgct atcttttatg gtggtttgcc gacgaatgtt
      481 cagaattctg cactcccggg tgggagtgcc gcagcggcgc gcacagagtt gtttggaagt
      541 ctatcaaaaa atgatattct cggacagatt caaagagtaa acgcaaatat tacttctctt
      601 gttaacgtcc caggttcttt caatgaaaac atggcgtaca ctgatggatc cgttgtttcc
      661 gttgcctatg cactgggtat tgcaaaccgg cagactattg aggcaacttt taatcaggct
      721 gtaactacca gcactcagtg gagc
//
LOCUS       HUMTNFTRP    1379 bp ss-mRNA            SYN       02-AUG-1990
DEFINITION  Synthetic human tumor necrosis factor mRNA, complete cds.
ACCESSION   M35592
KEYWORDS    tumor necrosis factor.
SOURCE      Altered human leukemic B-cell line Ball-1, cDNA to mRNA, clone
            pM324-346.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 1379)
  AUTHORS   Nobuhara,M., Kanamori,T., Nagase,Y., Nii,A., Morishita,H.,
            Tohyama,J., Andoh,S. and Kurimoto,M.
  TITLE     The expression of human tumor necrosis factor in E.coli
  JOURNAL   Nucleic Acids Symp Ser 17, 131-134 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        178      879     synthetic tumor necrosis factor precursor
    sigp        178      399     synthetic tumor necrosis factor signal peptide
    matp        400      876     synthetic tumor necrosis factor
BASE COUNT      318 a    438 c    336 g    287 t
ORIGIN      
        1 agcagacgct ccctcagcaa ggacagcaga ggaccagcta agagggagag aagcaactac
       61 agaccccccc tgaaaacaaa aatcagacgc cacatcccct gacaagctgc caggcaggtt
      121 ctcttcctct cacatactga cccacggctc caccctctct cccctggaaa ggacaccatg
      181 agcactgaaa gcatgatccg ggacgtggag ctggccgagg aggcgctccc caagaagaca
      241 ggggggcccc agggctccag gcggtgcttg ttcctcagcc tcttctcctt cctgatcgtg
      301 gcaggcgcca ccacgctctt ctgcctgctg cactttggag tgatgggccc ccagagggaa
      361 gagttcccca gggacctctc tctaatcagc cctctggccc aggcagtcag atcatcttct
      421 cgaaccccga gtgacaagcc tgtagcccat gttgtagcaa accctcaagc tgaggggcag
      481 ctccagtggc tgaaccgccg ggccaatgcc ctcctggcca atggcgtgga gctgagagat
      541 aaccagctgg tggtgccatc agagggcctg tacctcatct actcccaggt cctcttcaag
      601 ggccaaggct gcccctccac ccatgtgctc ctcacccaca ccatcagccg catcgccgtc
      661 tcctaccaga ccaaggtcaa cctcctctct gccatcaaga gcccctgcca gagggagacc
      721 ccagaggggg ctgaggccaa gccctggtat gagcccatct atctgggagg ggtcttccag
      781 ctggagaagg gtgaccgact cagcgctgag atcaatcggc ccgactatct cgactttgcc
      841 gagtctgggc aggtctactt tgggatcatt gccctgtgag gaggacgaac atccaacctt
      901 cccaaacgcc tcccctgtcc caatcccttt attaccccct ccttcagaca ccctcaacct
      961 cttctggctc aaaaagagaa ttgggggctt agggtcggaa cccaagctta gaactttaag
     1021 caacaagacc accacttcga aacctgggat tcaggaatgt gtggcctgca cagtgaaggt
     1081 gctggcaacc actaagaatt caaactgggg cctccagaac tcactggggc ctacagcttt
     1141 gatccctgac atctggaatc tggagaccag ggagcctttg gttctgacca gaatgctgca
     1201 ggacttgaga agacctcacc tagaaattga cacaagtgga ccttaggcct tcctctctcc
     1261 agatgtttcc agacttcctt gagacacgga gcccagccct ccccatggag ccagctccct
     1321 ctatttatgt ttgcacttgt gattatttat tatttattta ttatttattt atttacaga
//
LOCUS       SHREF1BA      694 bp ss-mRNA            INV       02-AUG-1990
DEFINITION  A.salina elongation factor 1-beta mRNA, complete cds.
ACCESSION   M35636
KEYWORDS    elongation factor 1-beta.
SOURCE      A.salina cyst, cDNA to mRNA, clones B[4,27].
  ORGANISM  Artemia salina
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Branchiopoda; Sarsostraca; Anostraca; Artimiidae.
REFERENCE   1  (bases 1 to 694)
  AUTHORS   Maessen,G.D.F., Amons,R., Maassen,J.A. and Moeller,W.
  TITLE     Primary structure of elongation factor 1-beta from Artemia
  JOURNAL   FEBS Lett. 208, 77-83 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         26      649     elongation factor 1-beta
    signal      677      682     polyA signal
BASE COUNT      240 a    117 c    150 g    187 t
ORIGIN      
        1 ttttctgctg taaaaaaaga tcaaaatggc caatattgac ctaaaagccg aaaagggaca
       61 ggaacaacta aacgagcttt tggccaacaa gagttacctt caaggatatg aaccatccca
      121 agaggatgtt gctgccttca atcagctaaa taaagccccg tctgacaaat ttccttactt
      181 gcttcgatgg tacaagcaca ttagttcttt ttccgatgca gaaaagaaag gatttcccgg
      241 tattccaact tctgcatcca aagaagaaga tgatgatgtt gatctttttg gatcagatga
      301 ggaagatgaa gaagctgaaa aaatcaaggc agaaaggatg aaggcctata gtgataagaa
      361 gtcaaagaaa cctgcaattg ttgccaaatc ctctgtcatt ttggatataa agccatggga
      421 cgacgaaacg gatatggcag aaatggagaa actagttcga agtgtacaaa tggatggtct
      481 tgtctggggt gcagctaaac ttattcccct tgcatatgga attaagaagc tatcaatcat
      541 gtgtgttgta gaagatgata aagtatccat tgacgaactt caagaaaaaa tttctgaatt
      601 tgaagacttt gtccagtctg tggatattgc cgctttcaac aaagtttaat tcttaattaa
      661 ttacttgtga gaagggaata aatcagggca attg
//
LOCUS       YSCRO18SA     813 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  Yeast (S.cerevisiae) ribosomal operon DNA with 18S rRNA, 5' end.
ACCESSION   M35588
KEYWORDS    18S ribosomal RNA; ribosomal operon.
SOURCE      Yeast (S.cerevisiae) DNA, clone pVIrA-3.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 813)
  AUTHORS   Skryabin,K.G., Zakhar'ev,V.M., Rubtsov,P.M. and Baev,A.A.
  TITLE     Nucleotide sequence of the presumed region of initiation of
            transcription of the yeast ribosomal operon
  JOURNAL   Dokl. Biochem. 247, 288-290 (1979)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA        653  >   813     18S rRNA
BASE COUNT      221 a    140 c    194 g    258 t
ORIGIN      
        1 gaattcgagt aggcttgtcg ttcgttatgt ttttgtaaat ggcctcgtca aacggtggag
       61 agagtcgcta ggtgatcgtc agatctgcct agtctctata cagcgtgttt aattgacatg
      121 ggttgatgcg tattgagaga tacaatttgg gaagaaattc ccagagtgtg tttcttttgc
      181 gtttaacctg aacagtctca tcgtgggcat cttgcgattc cattggtgag cagcgaagga
      241 tttggtggat tactagctaa tagcaatcta tttcaaagaa ttcaaacttg ggggaatgcc
      301 ttgttgaata gccggtcgca agacgtgatt cttgaagtgt aacctcctct caaatcagcg
      361 atatcaaacg taccattccg tgaacaccgg ggtatctgtt tggtggaacc tgattagagg
      421 aaactcaaag agtgctatgg tatggtgacg gagtgcgctg gtcaagagcg taagcttttg
      481 aacagagagc atttccggca gcagagagac ccgaaaaagc aatttttctg gaatttcagc
      541 tgtttccaaa ctcaataagt atcttctagc aagagggaat aggtgggaaa aaaaaaaaga
      601 gatttcggtt tctttctttt ttactgcttg ttgcttcttc ttttaagata gttatctggt
      661 tgatcctgcc agtagtcata tgcttgtctc aaagattaag ccatgcatgt ctaagtataa
      721 gcaatttata cagtgaaact gcgaatggct cattaaatca gttatcgttt atttgatagt
      781 tcctttacta catggtataa ctgtggtaat tct
//
LOCUS       YSJLEU2A      968 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  Y.lipolytica beta-isopropylmalate dehydrogenase (LEU2) gene, 5'
            end.
ACCESSION   M35579
KEYWORDS    beta-isopropylmalate dehydrogenase.
SOURCE      Y.lipolytica (strain 21109-9) DNA, clone pINA92.
  ORGANISM  Yarrowia lipolytica
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 968)
  AUTHORS   Gaillardin,C. and Ribet,A.-M.
  TITLE     LEU2 directed expression of beta-galactosidase activity and
            phleomycin resistance in Yarrowia lipolytica
  JOURNAL   Curr. Genet. 11, 369-375 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        291  >   968     beta-isopropylmalate dehydrogenase (LEU2)
    signal      163      172     CAAT box
    signal      213      223     TATA box
BASE COUNT      215 a    303 c    242 g    208 t
ORIGIN      
        1 gaattccgtc gtcgcctgag tcatcattta tttaccagtt ggccacaaac ccttgacgat
       61 ctcgtatgtc ccctccgaca tactcccggc cggctgggta cgttcgatag cgctatcggc
      121 atcgacaagg tttgggtccc tagccgatac cgcactacct gagtcacaat cttcggaggt
      181 ttagtcttcc acatagcacg ggcaaaagtg cgtatatata caagagcgtt tgccagccac
      241 agattttcac tccacacacc acatcacaca tacaaccaca cacatccaca atggaacccg
      301 aaactaagaa gaccaagact gactccaaga agattgttct tctcggcggc gacttctgtg
      361 gccccgaggt gattgccgag gccgtcaagg tgctcaagtc tgttgctgag gcctccggca
      421 ccgagtttgt gttcgaggac cgactcattg gaggagctgc cattgagaag gagggcgagc
      481 ccatcaccga cgctactctc gacatctgcc gaaaggctga ctctattatg ctcggtgctg
      541 tcggaggcgc tgccaacacc gtatggacca ctcccgacgg acgaaccgac gtgcgacccg
      601 agcagggtct cctcaagctg cgaaaggacc tgaacctgta cgccaacctg cgaccctgcc
      661 agctgctgtc gcccaagctc gccgatctct cccccatccg aaacgttgag ggcaccgact
      721 tcatcattgt ccgagagctc gtcggaggta tctactttgg agagcgaaag gaggatgacg
      781 gatctggcgt cgcttccgac accgagacct actccgttcc tgaggttgag cgaattgccc
      841 gaatggccgc cttcctggcc cttcagcata acccccctct tcccgtgtgg tctcttgaca
      901 aggccaacgt gctggcctcc tctcgacttt ggcgaaagac tgttactcga gtcctcaagg
      961 acgaattc
//
LOCUS       YSTARSA1      131 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  Yeast ARS site linked to HML locus, segment 1.
ACCESSION   M17505
KEYWORDS    autonomous replication; origin of replication.
SEGMENT     1 of 2
SOURCE      Yeast DNA.
  ORGANISM  Yeast sp.
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE   1  (bases 1 to 131)
  AUTHORS   Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
            Nasmyth,K.A. and Hicks,J.B.
  TITLE     Localization and sequence analysis of yeast origins of DNA
            replication
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site         92      101     ARS consensus sequence
BASE COUNT       41 a     14 c     19 g     57 t
ORIGIN      Chromosome III.
        1 atggaatgta atttcttaag tattctatat gtacttaaaa cctattaata tatggatcaa
       61 cacagtatct tatgaatggg tttttgattt ttttatgttt ttttaaaaca ttaaagtttt
      121 cggcacggac t
//
LOCUS       YSTARSA2      146 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  Yeast ARS site linked to HML locus, segment 2.
ACCESSION   M17506
KEYWORDS    autonomous replication; origin of replication.
SEGMENT     2 of 2
SOURCE      Yeast DNA.
  ORGANISM  Yeast sp.
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE   1  (bases 1 to 146)
  AUTHORS   Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
            Nasmyth,K.A. and Hicks,J.B.
  TITLE     Localization and sequence analysis of yeast origins of DNA
            replication
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        136      146     ARS consensus sequence
BASE COUNT       46 a     22 c     21 g     57 t
ORIGIN      Chromosome III; about 2.8 kb downstream of of segment 1.
        1 cttctatgtt acttcttcaa catgaaagcc cgaacgtttg caatgatatc ctctttttca
       61 cccgattatt ttcatttatt aagtaagtgg ggttctgggt gaaaaaatca aaacaaaaat
      121 atgaatttgt aatgattttt atattt
//
LOCUS       YSTARSB1      147 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  Yeast ARS site linked to HMR locus, segment 1.
ACCESSION   M17507
KEYWORDS    autonomous replication; origin of replication.
SEGMENT     1 of 2
SOURCE      Yeast DNA.
  ORGANISM  Yeast sp.
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE   1  (bases 1 to 147)
  AUTHORS   Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
            Nasmyth,K.A. and Hicks,J.B.
  TITLE     Localization and sequence analysis of yeast origins of DNA
            replication
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site        112      122     ARS consensus sequence
BASE COUNT       56 a     14 c      9 g     68 t
ORIGIN      Chromosome III.
        1 aatagtatca atatatatat atatatatat atttatttgt ttactttttc tatcagtgtt
       61 ttcaattttt tattaaacaa tgtttgattt tttaaatcgc aatttaatac ctaaatataa
      121 aaaatgttat tatattgcaa aaaccca
//
LOCUS       YSTARSB2       85 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  Yeast ARS site linked to HMR locus, segment 2.
ACCESSION   M17508
KEYWORDS    autonomous replication; origin of replication.
SEGMENT     2 of 2
SOURCE      Yeast DNA.
  ORGANISM  Yeast sp.
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J.,
            Nasmyth,K.A. and Hicks,J.B.
  TITLE     Localization and sequence analysis of yeast origins of DNA
            replication
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    site         63       73     ARS consensus sequence
BASE COUNT       33 a      7 c      5 g     40 t
ORIGIN      Chromosome III; about 1.9 kb downstream of segment 1.
        1 atttatcatg ttttggtatg ataatttaat ttttaaaaaa acaaatttaa ttgacctcat
       61 taattaatat ttattaatac cttta
//
LOCUS       HUMROS1      7375 bp ss-mRNA            PRI       02-AUG-1990
DEFINITION  Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA,
            complete cds.
ACCESSION   M34353
KEYWORDS    oncogene; transmembrane tyrosine-specific protein kinase.
SOURCE      Human glioblastoma cell line SW-1088, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 7375)
  AUTHORS   Birchmeier,C., O'Neil,K., Riggs,M. and Wigler,M.
  TITLE     Characterization of ROS1 cDNA from a human glioblastoma cell line
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4799-4803 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.H.Wigler, 15-MAY-1990.
FEATURES       from  to/span     description
    pept        207     7250     transmembrane tyrosine-specific protein kinase
                                 precursor /hgml_locus_uid="LN0124W"
                                 /nomgen="ROS1" /map="6q21-q22"
    sigp        207      314     transmembrane tyrosine-specific protein kinase
                                 signal peptide
    matp        208     7247     transmembrane tyrosine-specific protein kinase
BASE COUNT     2203 a   1496 c   1605 g   2071 t
ORIGIN      
        1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa
       61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc
      121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca
      181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg
      241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata
      301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata
      361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact
      421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg
      481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa
      541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc
      601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt
      661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct
      721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg
      781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg
      841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc
      901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact
      961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg
     1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt
     1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg
     1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg
     1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca
     1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta
     1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct
     1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc
     1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt
     1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga
     1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg
     1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac
     1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca
     1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat
     1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct
     1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc
     1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca
     1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg
     2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa
     2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa
     2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag
     2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa
     2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt
     2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga
     2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag
     2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc
     2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca
     2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca
     2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa
     2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca
     2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca
     2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata
     2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga
     2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat
     3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc
     3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc
     3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt
     3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg
     3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc
     3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat
     3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac
     3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa
     3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt
     3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta
     3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc
     3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa
     3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg
     3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc
     3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg
     3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat
     3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga
     4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag
     4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa
     4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga
     4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga
     4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga
     4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg
     4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga
     4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt
     4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg
     4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca
     4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg
     4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg
     4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg
     4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa
     4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca
     4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc
     4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg
     5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg
     5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca
     5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt
     5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac
     5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt
     5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg
     5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag
     5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg
     5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca
     5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca
     5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag
     5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg
     5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc
     5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg
     5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc
     5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag
     6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg
     6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa
     6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga
     6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc
     6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag
     6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca
     6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac
     6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact
     6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa
     6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa
     6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt
     6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa
     6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga
     6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc
     6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg
     6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt
     6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt
     7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc
     7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag
     7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc
     7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt
     7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc
     7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc
//
LOCUS       MBOPIVML      969 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  M.lacunata pilin gene inverting protein (PivML) gene, complete cds.
ACCESSION   M34367
KEYWORDS    pilin gene inverting protein.
SOURCE      M.lacunata (ATCC 17956) DNA, clone pMxL1.
  ORGANISM  Moraxella lacunata
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 969)
  AUTHORS   Marrs,C.F., Rozsa,F.W., Hackel,M., Stevens,S.P. and Glasgow,A.C.
  TITLE     Identification, cloning and sequencing of Piv, a new gene involved
            in inverting the pilin gene of M.lacunata
  JOURNAL   J. Bacteriol. 172, 4370-4377 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.F.Marrs, 17-MAY-1990.
FEATURES       from  to/span     description
    pept          1      969     pilin gene inverting protein (PivML)
BASE COUNT      318 a    236 c    165 g    250 t
ORIGIN      
        1 atgtctaaaa cttacattgg gattgatatc gccaaaaaca catttgatgc ctgctttatt
       61 acccataaca cttggcaaaa ctgtacattc acaaataacc aacagggttt tatagagcta
      121 acattgtgga ttcaagcaca tcattataac acatctacct tacatttgat tattgaggca
      181 acgggggctt attgggaaaa attagctcat tgggcaatct ctcatcatca taaggtaagt
      241 attgtcaacc ccttatatat tcatgcatac gccaaaagcc taggcatacg caccaagaca
      301 gataagcagg atgccattct cttggcacgc tatggtgcaa aagaaaatcc acctttatgg
      361 cagcctaaat cagacaacga aataaaactg actgcccttc taaaacaacg tgaacaccac
      421 aaacggcaac tcatcaaaga acgcactcga caggaagctc tgtctattta tgtgaaatcg
      481 tatactgatg acaacattcg tcattggagt gacagcatca cacagctcga tcatcaaatt
      541 tggcaactca tcaattgcac gcccgagcta aattaccgtg ccagcctact tgccacaata
      601 cctggaattg gcaaaaagac actgccacat ctactggttg tcattggaga tggttcatca
      661 tttcaaagtg ccaaacatct agcaagctat gctgggcttg cccctcgtca tcatcaatca
      721 ggcattagta ttcacaagca gtcttccatt ggatttagtg gacaaaaaga actacgttca
      781 gcccttttta tgcccgctgt catagtcagc tttggtcgat atcccgcttt tcaaaagttc
      841 gtcaaacgca tggaacaaaa aggcaaaacc aaaaaacaaa ttattatcgc catcatgcgc
      901 aaattactca ccatcagcta tgccgttatt cgccaaaatc gcccttttga caaacgcatt
      961 catgaataa
//
LOCUS       STYMKAD       915 bp ds-DNA             BCT       02-AUG-1990
DEFINITION  Plasmid pEX102 mouse killing agent D (mkaD) gene, complete cds.
ACCESSION   M34355
KEYWORDS    mouse killing agent D.
SOURCE      Plasmid pEX102 (from Salmonella typhimurium, strain TML R66) DNA.
  ORGANISM  Plasmid pEX102
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 915)
  AUTHORS   Taira,S. and Rhen,M.
  TITLE     Nucleotide sequence of mkaD, a virulence associated gene of
            Salmonella typhimurium containing variable and constant region
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.P.Taira, 16-MAY-1990.
FEATURES       from  to/span     description
    pept        132      857     mouse killing agent D (mkaD)
BASE COUNT      273 a    181 c    213 g    248 t
ORIGIN      
        1 ggaccgaaat atcatttttt cgcctgataa aaaatgaggt tttctggata actaatcgtt
       61 ttattaaaaa aaaactgaga atttatatct aataatatgg cgatatatcc atatcgcaaa
      121 ggagatttcc catgcccata aataggccta atctaaatct aaacatccct cctttgaata
      181 ttgtagctgc ttatgatggg gcggaaatac catctacaaa taagcacctg aaaaataatt
      241 tcaactcctt gcacaaccaa atgcggaaga tgccggtatc ccactttaaa gaggcgctgg
      301 atgtgcctga ctattcaggg atgcgccaga gtggtttctt tgctatgagc caaggttttc
      361 agctgaataa ccatggttac gatgttttca tccatgctcg tcgagaatca cctcagtctc
      421 agggcaaatt tgccggtgac aagttccaca tcagtgtgct cagggatatg gtgccacaag
      481 catttcaagc gctgtccgga ttgctgtttt cagaggacag tccggtagat aagtggaaag
      541 tgaccgatat ggagaaggtc gttcaacaag cccgtgttag cctgggcgct cagttcacgt
      601 tgtatataaa accagaccag gaaaattcgc agtacagtgc gtcgtttctc cacaagacac
      661 ggcaatttat agagtgtctg gaatccagac tatccgaaaa tggggttatt tcaggacagt
      721 gtcctgagtc agacgttcat cctgaaaatt ggaaatatct cagttatcgt aatgaactac
      781 gaagtgggcg tgatggtggc gaaatgcaga gacaggcttt acgtgaggaa ccgttttatc
      841 gtttgatgac agagtaagta tgggtttggg gagcaacgga acagtaaacg ccgttaaaca
      901 gctattttaa atgct
//
LOCUS       TOBEXT       3994 bp ds-DNA             PLN       02-AUG-1990
DEFINITION  N.plumbaginifolia extensin (ext) gene, complete cds.
ACCESSION   M34371
KEYWORDS    cell wall protein; extensin.
SOURCE      N.plumbaginifolia leaf DNA.
  ORGANISM  Nicotiana plumbaginifolia
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 3994)
  AUTHORS   De Loose,M.R.R., Gheyson,G., Tire,C., Gielen,J., Villaroel,R.,
            Genetello,C., Van Montagu,M., Depicker,A. and Inze,D.
  TITLE     The extensin signal peptide allows secretion of heterologous
            proteins from protoplasts
  JOURNAL   Unpublished (1990)
  STANDARD  unannotated staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.R.De Loose, 18-MAY-1990.
            
               Author address: M.R.R.De Loose
               Laboratorium voor Genetika, Wetenschappen,
               K.L.Ledeganckstraat, Gent,
               Oost-Vlaanderen, BELGIUM
FEATURES       from  to/span     description
    pept       1590     2840     extensin precursor
    sigp       1590     1652     extensin signal peptide (could end at 1667)
    matp       1653     2837     extensin (could start at 1668)
    mRNA       1560     3342     extensin mRNA
    rpt        1740     1824     tandem 85 bp repeats
    rpt        1740     1758     19 bp repeats
    signal     1532     1537     TATA box
    signal     3323     3329     poly-A signal
BASE COUNT     1264 a    992 c    550 g   1188 t
ORIGIN      
        1 cgcccgggga tcctcctgga tgccattttt cctttgcttt tggatacctt ttcgcaactt
       61 tagtccattg cttgaacatg gttttttcta cctcttgtct gttcccatga tgataaacta
      121 tgatagctaa cattttcagt aatattggat tgaattagat atactttgaa ttattgagct
      181 acaaaattcg ttgaatattt tgccctatcc cccacccaaa agaaaataat tcataccttt
      241 atttggtaat aaaagtgggt cacatgcaca gtttttctcg tcttctctag attaaactct
      301 ttggaaatga cctactgaaa atactacaca taaaattccc cactcccctc caagttgggg
      361 gagggtgggg gttggtgggt tttttttccc cttgaatcca ttcagccggc ttatgcctcc
      421 cgaggtggtg ggaggtttaa tttgaaccct taccctatag ttaatatcga aatagataat
      481 acaaaggagg ggaacataaa accaaaacct ccgacaaaac atcaaagtgt aggagttcat
      541 gatatacgct tgattgaaag tgcacaatgg tctttgcata ctatcaaagt atcaattggt
      601 tcttgaatta tatctcttaa tataatgtat tgtgtttaat ttctctacta tcccctattt
      661 tataagacta aaagatcctg acatgcttct tgaacacatg tgaaggttag ttaactggtc
      721 agaagtacac aagaattaat ttgtacacct atgtttctca ccagaatgcc cataaattac
      781 atcaattctt caacatcaag agatcgaaaa acttaacaat tgttctaata gttgaactga
      841 gtcgtccgat ccatatccga ttcttcacta gaagcattaa tcatatagag caatttaact
      901 taatttacta tattggtaat catttacata gttaagttat aaaaccttgg agcgacacaa
      961 ggatttacac taatcatgat tgaatattaa ctttactcaa tttatcaatg tctgatacaa
     1021 gtgaattaat ttctctcttt gtgatttcgg tagaaaatta atgtcaagtt tcaaagtttt
     1081 tctttttgaa attagtcata catgtgaata gaacattaat ttaagttaaa gaatcatata
     1141 aaaaagttag ccgctttgat ttctgtaact gaaatcgtgc agagatgagg catccacact
     1201 tgtttttcaa agcttcagta ctatttaaaa acaaagacag taaaaaggag actttttctt
     1261 tgacattatt gcatcagaaa tagtatagct gccataatag tttattcctt tgcttagctt
     1321 gcagcctcta tcaaacaaaa aaatacacca gctcaagtca atttgagccg acaacatgac
     1381 aaaaccaaat caaatatgca tttttttttt tttttttttt tttttttttt tttttttttt
     1441 tttacacttt ggtaggtgtt aagtaatcta gtgagacttt taccttcatt tatgaaaatc
     1501 ttgaaaaggg taattgtcta attgaaagct atataaaggg gtcgaagtga agcttaagag
     1561 gacaacaact tttctcattt gtttcaaaga tgggaaaaat ggcttctcta tttgccacat
     1621 ttttagtggt tttagtgtca cttagcttag cttctgaaag ctcagcaaat tatcaatact
     1681 catctccacc accacctaag aagccatacc acccttcacc aacaccatat tatcctgcac
     1741 cagtttataa atctccacca ccaccaattc cggtttataa gtcaccacca ccacccaaga
     1801 agccatacta ccctccacac accccagttt acaaatctcc accaccacca actccagttt
     1861 acaagtcgcc accaccaccc aagaagccac actaccctcc acacacaccc gtttacaagt
     1921 ctccaccacc acctactcca gtttacaagt caccaccatc acccaagaag ccacactatc
     1981 ctccacacac tccggtttac aagtcgccac caccaccaac tcccgtttac aagtcgccac
     2041 caccacccaa gaagccacac tatccaccac acaccccggt ttacaagtcg ccaccaccac
     2101 caactcccgt ttacaagtcg ccaccaccac caaagaagcc acactatcct ccacacacac
     2161 cagtttacaa gtcaccacca ccacccaaga agccatacta tccaccacac accccagttt
     2221 acaagtcgcc accaccacca actccagttt acaagtcacc accaccacct aagaagccat
     2281 accacccttc accaacaccg tatcatcctt caccagttta caagtcccct ccaccaccaa
     2341 ctccagttta caagtcacca ccaccaccta agaagccata ccacccttca ccaacaccgt
     2401 atcatccatc accagtttac aagtcccctc caccaccaac tccagtttac aagtcaccac
     2461 caccacctaa gaagccatac cacccttcac caacaccgta tcatccatca ccagtttaca
     2521 agtcccctcc accaccaact ccagtttaca agtcaccacc accacctaag aagccatacc
     2581 acccttcacc aacaccgtat catcctgcac cagtttacaa gtccccacca ccaccaactc
     2641 cagtttacaa gtcaccaccg ccaccagtga agccatacca tccttcacca acaccatacc
     2701 atcctgcacc agtttataag tctccaccac caccaactcc tgtttacaag tcacctccac
     2761 caccaactcc agtctacaag tctccaccac cacaccaccc ctatgtttac gcttctcctc
     2821 ctcctcccta ccattactaa gaagtgacat cacaaagctg aaggtaatta aggcaaatta
     2881 attaaaacat tttcaaaatg atatgtcagg aatgcaaaga aaattgcata actttgttta
     2941 tgcaaaacat taatataata taaagaactg cgctctctgc ttattaacta tttttttagc
     3001 ttcatttatt agttactagc tgttcagaat tttcagtatc ttttgatatt acaaagaacc
     3061 taatcagaca atgtatattc ttatgcagga aaagcagaat gttgagctaa aagaaaggct
     3121 ttttccattt tcgagaggca acgagaagaa aagaagaaga aaaaatagag taaataataa
     3181 gccccacggg aggcgaagtt cttttgtagc ttcatgttgt ctaagctatt gatattgttt
     3241 gtaccctata ttttatttct ctgtctttgt gtatgttttg ttcaatttcg atctccttgc
     3301 aaaatgcaga gattatgagc tgaataaagt aagttaatat tattatacgt gttaattttc
     3361 tactcctctc tctagccagc cttttgtttt ctctttttct tatttcattt tctttatatc
     3421 aaaccatttt aggagagggc caaggagtga tccagtaaaa cacgaagatt agaagaaact
     3481 tcccttcttt ttttttatct gaaaaaaatt taaagtcgag atttatctct ttttgtaatg
     3541 gaatcatttc cacagttatg actattttca aattgatatc gccagaccta acaaattaaa
     3601 ttactccttc cgctcatctg gccgtacact tctgctccat ctggccgtac agttctgctc
     3661 catctggcag tgtacggaga atggataact taatattcag tatgaatttg gacataaaat
     3721 cttctaagat tgctagtgaa ttttgtgagc gttattgtgt taaaacaaaa gacagaaaat
     3781 tattagaaga attaaagaaa tatgaaaata gtatagaatc agaaatattt cgagtccaca
     3841 agttttttgt gtgttcttaa ggaatgtttt aacccctcac acgttgccaa ggtaatggat
     3901 taaatcctcc caagataaaa cggaataaac cttcctgcaa cagtggcaat acaaactgca
     3961 ggtcgactct agggatcccc gggcgagcga attc
//

GenBank-Updates@genbank.bio.net (08/03/90)

LOCUS       CLLRRE       1861 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  C.sapidus 18S rRNA.
ACCESSION   M34360
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      C.sapidus rRNA.
  ORGANISM  Callinectes sapidus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; 
            Brachyrhyncha; Portunoidea; Portunidae.
REFERENCE   1  (bases 1 to 1861)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustaceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1860     18S ribosomal RNA
BASE COUNT      359 a    375 c    421 g    367 t    339 others
ORIGIN      
        1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct
       61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt
      121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta
      181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc
      241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc
      301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc
      361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac
      421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac
      481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat
      541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct
      601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg
      661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc
      721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata
      841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc
      901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc
      961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg
     1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt
     1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt
     1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac
     1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg
     1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn
     1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct
     1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc
     1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga
     1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag
     1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc
     1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc
     1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn
     1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1861 n
//
LOCUS       HUMNCSRC      138 bp ds-DNA             PRI       03-AUG-1990
DEFINITION  Human membrane-associated tyrosine protein kinase (C-SRC) gene,
            exons 3, 4, NI, and NII, partial cds.
ACCESSION   M34469
KEYWORDS    membrane-associated tryosine protein kinase.
SOURCE      Human adult brain DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 138)
  AUTHORS   Pyper,J.M. and Bolen,J.B.
  TITLE     Identification of a novel neuronal C-SRC exon expressed in human
            brain
  JOURNAL   Mol. Cell. Biol. 10, 2035-2040 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       29     membrane-associated tyrosine protein kinase
                                 (C-SRC), exon 3 (AA at 1)
                 30       47     membrane-associated tyrosine protein kinase,
                                 exon NI
                 48       80     membrane-associated tyrosine protein kinase,
                                 exon NII
                 81  >   136     membrane-associated tyrosine protein kinase,
                                 exon 4
    variant     117      117     a or g
    variant     135      135     c or t
BASE COUNT       36 a     40 c     40 g     22 t
ORIGIN      
        1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc
       61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga
      121 cagacaggct acatcccc
//
LOCUS       MHVAPEPA     1000 bp ss-RNA             VRL       03-AUG-1990
DEFINITION  Murine coronavirus peplomer (S) protein gene.
ACCESSION   M34435
KEYWORDS    peplomer protein.
SOURCE      Murine (strain JHM-DL) RNA.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 1000)
  AUTHORS   Banner,L.R., Keck,J.G. and Lai,M.M.C.
  TITLE     A clustering of RNA recombination sites adjacent to a hypervariable
            region of the peplomer gene of Murine coronavirus
  JOURNAL   Virology 175, 548-555 (1990)
  STANDARD  simple staff_review
BASE COUNT      258 a    192 c    227 g    317 t      6 others
ORIGIN      
        1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag
       61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa
      121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct
      181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa
      241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta
      301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt
      361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca
      421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac
      481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg
      541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga
      601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg
      661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt
      721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt
      781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa
      841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa
      901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag
      961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa
//
LOCUS       MHVAPEPB      843 bp ss-RNA             VRL       03-AUG-1990
DEFINITION  Murine coronavirus peplomer (S) protein gene.
ACCESSION   M34436
KEYWORDS    peplomer protein.
SOURCE      Murine (strain A59) RNA.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 843)
  AUTHORS   Banner,L.R., Keck,J.G. and Lai,M.M.C.
  TITLE     A clustering of RNA recombination sites adjacent to a hypervariable
            region of the peplomer gene of Murine coronavirus
  JOURNAL   Virology 175, 548-555 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site        400      401     deletion site
BASE COUNT      227 a    153 c    183 g    280 t
ORIGIN      
        1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag
       61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa
      121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct
      181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa
      241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta
      301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt
      361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt
      421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc
      481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc
      541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat
      601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt
      661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc
      721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact
      781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc
      841 gaa
//
LOCUS       MHVSP        3780 bp ss-mRNA            VRL       03-AUG-1990
DEFINITION  Mouse hepatitis virus surface protein S mRNA, complete cds.
ACCESSION   X04797 M34437
KEYWORDS    glycoprotein; membrane glycoprotein; surface glycoprotein;
            surface projection glycoprotein.
SOURCE      Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones
            pJMS1010, pJS112, and pJS92.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 3780)
  AUTHORS   Schmidt,I., Skinner,M. and Siddell,S.
  TITLE     Nucleotide Sequence of the Gene Encoding the Surface Projection
            Glycoprotein of Coronavirus MHV-JHM
  JOURNAL   J. Gen. Virol. 68, 47-56 (1987)
  STANDARD  simple automatic
COMMENT        EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE         24     30       put. region of homology preceeding
                                            mRNA 5' initiation  codons
            
               SITE        121    129       pot. N-glycosylation site
               SITE        208    216       pot. N-glycosylation site
               SITE        430    438       pot. N-glycosylation site
               SITE        604    612       pot. N-glycosylation site
               SITE       1099   1107       pot. N-glycosylation site
               SITE       1333   1341       pot. N-glycosylation site
               SITE       1636   1644       pot. N-glycosylation site
               SITE       1732   1740       pot. N-glycosylation site
               SITE       1756   1764       pot. N-glycosylation site
               SITE       1825   1833       pot. N-glycosylation site
               SITE       1900   1914       put. proteolytic cleavage site
               SITE       1972   1980       pot. N-glycosylation site
               SITE       2023   2031       pot. N-glycosylation site
               SITE       2293   2301       pot. N-glycosylation site
               SITE       2440   2448       pot. N-glycosylation site
               SITE       3139   3147       pot. N-glycosylation site
               SITE       3301   3309       pot. N-glycosylation site
               SITE       3331   3339       pot. N-glycosylation site
               SITE       3388   3396       pot. N-glycosylation site
               SITE       3436   3444       pot. N-glycosylation site
               SITE       3499   3507       pot. N-glycosylation site
               SITE       3553   3654       pot. transmembrane domain
               SITE       3622   3675       cysteine-rich region
               SITE       3655   3660       charge cluster
               SITE       3715   3723       pot. N-glycosylation site
FEATURES       from  to/span     description
    pept         31     3738     surface protein S precursor
    sigp         31       60     surface protein S signal peptide
    matp         61     3736     surface protein S
BASE COUNT     1030 a    718 c    800 g   1232 t
ORIGIN      
        1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc
       61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat
      121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact
      181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct
      241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg
      301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac
      361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt
      421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct
      481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat
      541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag
      601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt
      661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg
      721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct
      781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt
      841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt
      901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt
      961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa
     1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact
     1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg
     1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt
     1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt
     1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt
     1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat
     1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat
     1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt
     1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct
     1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc
     1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct
     1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt
     1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat
     1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc
     1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt
     1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc
     1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat
     2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt
     2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat
     2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct
     2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct
     2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac
     2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag
     2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa
     2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg
     2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct
     2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta
     2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac
     2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag
     2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg
     2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca
     2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc
     2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc
     3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata
     3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca
     3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg
     3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat
     3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg
     3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt
     3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc
     3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag
     3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa
     3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt
     3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa
     3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt
     3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt
//
LOCUS       MUSIGLAZ      713 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds.
ACCESSION   M34598 M29013 J03562
KEYWORDS    constant region; immunoglobulin; immunoglobulin light chain;
            joining exon; lambda-immunoglobulin; processed gene;
            variable region.
SOURCE      Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone
            Y31.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 713)
  AUTHORS   Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A.
  TITLE     Structure of a third murine immunoglobulin lambda light chain
            variable region that is expressed in laboratory mice
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 2 to 450)
  AUTHORS   Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
  TITLE     Mouse V-lambda-x gene sequence generates no junctonal diversity and
            is conserved in mammalian species
  JOURNAL   J. Immunol. 144, 2816-2820 (1990)
  STANDARD  simple staff_review
REFERENCE   3  (bases 266 to 429)
  AUTHORS   Sanchez,P. and Cazenave,P.-A.
  TITLE     A new variable region in mouse immunoglobulin lambda light chains
  JOURNAL   J. Exp. Med. 166, 265-270 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         12  >   713     Ig lambda chain precursor V-x,J-2,C-2 region
    sigp         12       68     Ig lambda chain signal peptide
    matp         69  >   713     Ig lambda chain
    recomb      379      380     V-region end/J2-region start
    recomb      414      415     J2-region end/C2-region start
BASE COUNT      175 a    190 c    159 g    189 t
ORIGIN      
        1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt
       61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag
      121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc
      181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca
      241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta
      301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa
      361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc
      421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca
      481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga
      541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca
      601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca
      661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct
//
LOCUS       MUSIGLVD      681 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds.
ACCESSION   M34597
KEYWORDS    constant region; germline; immunoglobulin light chain;
            joining exon; lambda-immunoglobulin; variable region.
SOURCE      Mouse (strain Balb/c AnPt) liver DNA, clone 30X2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A.
  TITLE     Mouse V-lambda-x gene sequence generates no junctonal diversity and
            is conserved in mammalian species
  JOURNAL   J. Immunol. 144, 2816-2820 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         59      104     Ig lambda-chain precursor Vx-J2-C2 region,
                                 exon 1
                209      531     Ig lambda-chain precursor Vx-J2-C2 region,
                                 exon 2
    sigp         59      104     Ig lambda-chain signal peptide
                209      219     Ig lambda-chain signal peptide
    matp        220      528     Ig lambda-chain
    IVS         105      208     Ig lambda-chain Vx-J2-C2 region intron A
    recomb      530      531     Vx-region end/J2-region start
    recomb      565      566     J2-region end/C2-region start
BASE COUNT      179 a    156 c    136 g    210 t
ORIGIN      Chromosome 16.
        1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat
       61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt
      121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc
      181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca
      241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag
      301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg
      361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct
      421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa
      481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa
      541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta
      601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa
      661 tcctcttgga cttcgtaggg c
//
LOCUS       PAERRE       1877 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  P.kadiakensis 18S rRNA.
ACCESSION   M34359
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.kadiakensis rRNA.
  ORGANISM  Palaemonetes kadiakensis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea; 
            Palaemonoidea; Palaemonidae.
REFERENCE   1  (bases 1 to 1877)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustaceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1877     18S ribosomal RNA
BASE COUNT      339 a    321 c    399 g    313 t    505 others
ORIGIN      
        1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt
       61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt
      121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac
      181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg
      241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca
      301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct
      361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna
      421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga
      481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac
      541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn
      601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac
      841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna
      901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta
      961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta
     1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa
     1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt
     1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc
     1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt
     1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn
     1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn
     1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg
     1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna
     1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg
     1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca
     1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact
     1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag
     1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn
     1861 nnnnnnnnnn nnnnnnn
//
LOCUS       PBESVBRA      584 bp ds-DNA             INV       03-AUG-1990
DEFINITION  P.berghei telomeric repeat region subfragment alpha DNA.
ACCESSION   M34601
KEYWORDS    .
SOURCE      P.berghei DNA, clone pTel.1.
  ORGANISM  Plasmodium berghei
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 584)
  AUTHORS   Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
  TITLE     Organization of subtelomeric repeats in Plasmodium berghei
  JOURNAL   Mol. Cell. Biol. 10, 2423-2427 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         461      541     27 bp repeats
BASE COUNT      205 a     41 c     85 g    251 t      2 others
ORIGIN      
        1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac
       61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc
      121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata
      181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat
      241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat
      301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa
      361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt
      421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg
      481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg
      541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga
//
LOCUS       PBESVBRB      593 bp ds-DNA             INV       03-AUG-1990
DEFINITION  P.berghei telomeric repeat region subfragment a DNA.
ACCESSION   M34602
KEYWORDS    .
SOURCE      P.berghei DNA, clone pTel.1.
  ORGANISM  Plasmodium berghei
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 593)
  AUTHORS   Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C.
  TITLE     Organization of subtelomeric repeats in Plasmodium berghei
  JOURNAL   Mol. Cell. Biol. 10, 2423-2427 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt         484      510     27 bp repeat motif
BASE COUNT      209 a     40 c     97 g    247 t
ORIGIN      
        1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca
       61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac
      121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa
      181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt
      241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat
      301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat
      361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat
      421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg
      481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt
      541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg
//
LOCUS       PEURRE       1902 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  P.aztecus 18S rRNA.
ACCESSION   M34362
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.aztecus rRNA.
  ORGANISM  Penaeus aztecus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; 
            Penaeidae.
REFERENCE   1  (bases 1 to 1902)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustaceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1902     18S ribosomal RNA
BASE COUNT      340 a    369 c    427 g    332 t    434 others
ORIGIN      
        1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt
       61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt
      121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg
      181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc
      241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg
      301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc
      361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn
      421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag
      481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga
      541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga
      601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg
      661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc
      721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn
      781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn
      841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg
      901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg
      961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna
     1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn
     1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg
     1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa
     1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn
     1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg
     1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac
     1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta
     1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct
     1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg
     1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact
     1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg
     1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt
     1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn
//
LOCUS       POCRRE       1874 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  P.ascensionis 18S rRNA.
ACCESSION   M34358
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.ascensionis rRNA.
  ORGANISM  Procaris ascensionis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea; 
            Procaridoidea; Procarididae.
REFERENCE   1  (bases 1 to 1874)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustaceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1874     18S ribosomal RNA
BASE COUNT      298 a    291 c    331 g    266 t    688 others
ORIGIN      
        1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct
       61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt
      121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac
      181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt
      241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac
      301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn
      361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc
      481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa
      541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn
      601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
      721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt
      781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg
      841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat
      901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat
      961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga
     1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac
     1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt
     1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca
     1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt
     1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta
     1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct
     1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag
     1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca
     1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn
     1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg
     1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg
     1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1861 nnnnnnnnnn nnnn
//
LOCUS       PRARRE       1869 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  P.leonensis 18S rRNA.
ACCESSION   M34363
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      P.leonensis rRNA.
  ORGANISM  Procambarus leonensis
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; 
            Astacoidea; Cambaridae.
REFERENCE   1  (bases 1 to 1869)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustaceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1869     18S ribosomal RNA
BASE COUNT      409 a    412 c    459 g    419 t    170 others
ORIGIN      
        1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt
       61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt
      121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat
      181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt
      241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc
      301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca
      361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg
      421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt
      481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact
      541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt
      601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag
      661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc
      721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn
      781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa
      841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg
      901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac
      961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg
     1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg
     1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg
     1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn
     1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa
     1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta
     1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact
     1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg
     1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg
     1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac
     1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa
     1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn
     1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt
     1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa
     1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn
     1861 nnnnnnnnn
//
LOCUS       PVIC1RPTA     711 bp ds-DNA             INV       03-AUG-1990
DEFINITION  P.vivax circumsporozoite protein gene, partial cds.
ACCESSION   M28745 M25758
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax sporozoite (isolate VK247) sporozoite DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 711)
  AUTHORS   Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
            Waters,A.P. and Prasittisuk,C.
  TITLE     Circumsporozoite protein heterogeneity in the human malaria
            parasite Plasmodium vivax
  JOURNAL   Science 245, 973-976 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Lanar, 25-JUN-1989.
FEATURES       from  to/span     description
    pept    <     1  >   711     circumsporozoite protein (AA at 1)
    site          1       15     region 1
    site        697      711     region 2
    site        502      597     post repeat variable region
    site        598      696     post repeat constant region
BASE COUNT      261 a    151 c    226 g     73 t
ORIGIN      
        1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat
       61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca
      121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca
      181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga
      241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc
      301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa
      361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga
      421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat
      481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca
      541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga
      601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta
      661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c
//
LOCUS       PVIC1RPTB     657 bp ds-DNA             INV       03-AUG-1990
DEFINITION  P.vivax circumsporozoite protein gene, partial cds.
ACCESSION   M28746 M25759
KEYWORDS    circumsporozoite protein.
SOURCE      P.vivax sporozoite (isolate VK210) DNA.
  ORGANISM  Plasmodium vivax
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 657)
  AUTHORS   Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T.,
            Waters,A.P. and Prasittisuk,C.
  TITLE     Circumsporozoite protein heterogeneity in the human malaria
            parasite Plasmodium vivax
  JOURNAL   Science 245, 973-976 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Lanar, 25-JUN-1989.
FEATURES       from  to/span     description
    pept    <     1  >   657     circumsporozoite protein (AA at 1)
    site          1       15     region 1
    site        643      657     region 2
    rpt          16      477     27 bp repeats
    site        478      544     3' post repeat variable region
    site        545      642     3' post repeat constant region
BASE COUNT      229 a    135 c    224 g     69 t
ORIGIN      
        1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat
       61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa
      121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca
      181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat
      241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca
      301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga
      361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca
      421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga
      481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca
      541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa
      601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc
//
LOCUS       SHV2A         554 bp ss-RNA             VRL       03-AUG-1990
DEFINITION  Simian hepatitis A virus segment 2A-encoded protein mRNA,
            partial cds.
ACCESSION   M34085
KEYWORDS    .
SOURCE      Simian hepatitis A virus (strain PA21), cDNA to viral RNA.
  ORGANISM  Simian hepatitis A virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae.
REFERENCE   1  (bases 1 to 554)
  AUTHORS   Brown,E.A., Jansen,R.W. and Lemon,S.M.
  TITLE     Characterization of a Simian hepatitis A virus (HAV): Antigenic and
            genetic comparison with human HAV
  JOURNAL   Unpublished (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.A.Brown, 04-MAY-1990.
            
              Author address: E.A.Brown
              547 Burnett-Womack
              CB# 7030
              Chapel Hill, NC 27599
            
FEATURES       from  to/span     description
    pept    <     1  >   554     segment 2A-encoded protein (AA at 1)
BASE COUNT      184 a     71 c    138 g    161 t
ORIGIN      Segment 2A; map position 3108-3662.
        1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg
       61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag
      121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa
      181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg
      241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg
      301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg
      361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat
      421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt
      481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg
      541 tcaggaatac agga
//
LOCUS       SHVVP1CP     2373 bp ss-RNA             VRL       03-AUG-1990
DEFINITION  Simian hepatitis A virus capsid protein VP1 mRNA, partial cds.
ACCESSION   M34084
KEYWORDS    capsid protein VP1.
SOURCE      Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed
            in cwll line BS-C-1.
  ORGANISM  Simian hepatitis A virus
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Picornaviridae.
REFERENCE   1  (bases 1 to 2373)
  AUTHORS   Brown,E.A., Jansen,R.W. and Lemon,S.M.
  TITLE     Characterization of a Simian hepatitis A virus (HAV): Antigenic and
            genetic comparison with human HAV
  JOURNAL   J. Virol. 63, 4932-4937 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.A.Brown, 04-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1  >  2373     capsid protein VP1 (AA at 1)
BASE COUNT      661 a    447 c    491 g    774 t
ORIGIN      Segment P1; map position 735-3107.
        1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg
       61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact
      121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca
      181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga
      241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa
      301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg
      361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca
      421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct
      481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc
      541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat
      601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct
      661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca
      721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt
      781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat
      841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca
      901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag
      961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa
     1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg
     1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt
     1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc
     1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg
     1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag
     1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat
     1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct
     1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct
     1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag
     1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta
     1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca
     1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg
     1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt
     1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg
     1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg
     1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca
     1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg
     2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat
     2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc
     2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg
     2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg
     2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat
     2341 gatcctcgaa ctgaagagga tcgtaaattt gaa
//
LOCUS       STNRRE       1885 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  S.hispidus 18S rRNA.
ACCESSION   M34361
KEYWORDS    18S ribosomal RNA; ribosomal RNA.
SOURCE      S.hispidus rRNA.
  ORGANISM  Stenopus hispidus
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; 
            Stenopodidae.
REFERENCE   1  (bases 1 to 1885)
  AUTHORS   Kim,W. and Abele,L.G.
  TITLE     Molecular phylogeny of selected decapod crustaceans based on 18S
            rRNA nucleotide sequences
  JOURNAL   J. Crust. Biol. 10, 1-13 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.G.Abele, 19-MAY-1990.
FEATURES       from  to/span     description
    rRNA          1     1885     18S ribosomal RNA
BASE COUNT      327 a    336 c    430 g    368 t    424 others
ORIGIN      
        1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt
       61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact
      121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn
      181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta
      241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag
      301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt
      361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga
      421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn
      481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag
      541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag
      601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn
      661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc
      721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn
      781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg
      841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg
      901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg
      961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt
     1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt
     1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg
     1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat
     1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg
     1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt
     1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga
     1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng
     1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt
     1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc
     1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg
     1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg
     1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn
     1861 nnnnnnnnnn nnnnnnnnnn nnnnn
//
LOCUS       SV4CG        5243 bp ds-DNA   circular  VRL       03-AUG-1990
DEFINITION  Simian Virus 40 complete genome.
ACCESSION   J02400 J02402 J02406 J02407 J02408 J02409 J02410 V01380 J02403
            J04139 M24914
KEYWORDS    T-antigen; agnoprotein; alternate splicing; antigen;
            auxiliary region; circular; complete genome; core region;
            early leader protein; enhancer; large T-antigen;
            major structural protein VP1; minor structural protein VP2;
            minor structural protein VP3; origin of replication; palindrome;
            small T-antigen; tandem repeat; topoisomerase.
SOURCE      Simian Virus 40 (SV40) DNA, cDNA, mRNA and RNA from wild type
            strains 777 and 776. (When the material is not simply DNA, such is
            indicated on the reference line.).
  ORGANISM  Rhesus macaque polyomavirus
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 2463 to 2640; mRNA)
  AUTHORS   Dhar,R., Zain,S., Weissman,S.M., Pan,J. and Subramanian,K.
  TITLE     Nucleotide sequences of RNA transcribed in infected cells and by
            Escherichia coli RNA polymerase from a segment of Simian Virus 40
            DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 71, 371-375 (1974)
  STANDARD  full staff_review
REFERENCE   2  (bases 2623 to 2699; mRNA)
  AUTHORS   Dhar,R., Weissman,S.M., Zain,B.S., Pan,J. and Lewis,A.M.Jr.
  TITLE     The nucleotide sequence preceding an RNA polymerase initiation site
            on SV40 DNA. part 2. the sequence of the early strand transcript
  JOURNAL   Nucleic Acids Res. 1, 595-613 (1974)
  STANDARD  full staff_review
REFERENCE   3  (bases 2623 to 2699; mRNA)
  AUTHORS   Dhar,R., Subramanian,K., Zain,B.S., Pan,J. and Weissman,S.M.
  TITLE     Nucleotide sequence about the 3' terminus of SV40 DNA transcripts
            and the region where DNA synthesis is initiated
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 39, 153-160 (1975)
  STANDARD  full staff_review
REFERENCE   4  (sites; cds start for VP1)
  AUTHORS   van de Voorde,A., Contreras,R., Rogiers,R. and Fiers,W.
  TITLE     The initiation region of the SV40 VP1 gene
  JOURNAL   Cell 9, 117-120 (1976)
  STANDARD  full staff_review
REFERENCE   5  (bases 470 to 524)
  AUTHORS   Ysebaert,M., Thys,F., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the restriction fragments Hind L and Hind M
            of SV40 DNA
  JOURNAL   Nucleic Acids Res. 3, 3409-3421 (1976)
  STANDARD  full staff_review
REFERENCE   6  (bases 32 to 236; direct tandem repeats)
  AUTHORS   Subramanian,K.N., Reddy,V.B. and Weissman,S.M.
  TITLE     Occurrence of reiterated sequences in an untranslated region of
            Simian Virus 40 DNA determined by nucleotide sequence analysis
  JOURNAL   Cell 10, 497-507 (1977)
  STANDARD  full staff_review
REFERENCE   7  (bases 4592 to 4661; mRNA and DNA)
  AUTHORS   Thimmappaya,B. and Weissman,S.M.
  TITLE     The early region of SV40 DNA may have more than one gene
  JOURNAL   Cell 11, 837-843 (1977)
  STANDARD  full staff_review
REFERENCE   8  (bases 1046 to 1620; overlap between VP1, VP2 and VP3 genes)
  AUTHORS   Contreras,R., Rogiers,R., van de Voorde,A. and Fiers,W.
  TITLE     Overlapping of the VP2-VP3 gene and the VP1 gene in the SV40 genome
  JOURNAL   Cell 12, 529-538 (1977)
  STANDARD  full staff_review
REFERENCE   9  (bases 1 to 164; 5092 to 5243; origin of replication)
  AUTHORS   Subramanian,K.N., Dhar,R. and Weissman,S.M.
  TITLE     Nucleotide sequence of a fragment of SV40 DNA that contains the
            origin of DNA replication and specifies the 5' ends of "early" and
            "late" viral RNA: III. construction of the total sequence of
            EcorII-G fragment of SV40 DNA
  JOURNAL   J. Biol. Chem. 252, 355-367 (1977)
  STANDARD  full staff_review
REFERENCE   10 (bases 1 to 585; 4857 to 5243)
  AUTHORS   Dhar,R., Subramanian,K.N., Pan,J. and Weissman,S.M.
  TITLE     Structure of a large segment of the genome of Simian Virus 40 that
            does not encode known proteins
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 74, 827-831 (1977)
  STANDARD  full staff_review
REFERENCE   11 (bases 3733 to 4007)
  AUTHORS   Volckaert,G., Contreras,R., Soeda,E., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of Simian Virus 40 Hind H restriction fragment
  JOURNAL   J. Mol. Biol. 110, 467-510 (1977)
  STANDARD  full staff_review
REFERENCE   12 (bases 1427 to 2593; VP1 gene)
  AUTHORS   Pan,J., Reddy,V.B., Thimmappaya,B. and Weissman,S.M.
  TITLE     Nucleotide sequence of the gene for the major structural protein of
            SV40 virus
  JOURNAL   Nucleic Acids Res. 4, 2539-2548 (1977)
  STANDARD  full staff_review
REFERENCE   13 (bases 323 to 565; mRNA)
  AUTHORS   Celma,M.L., Dhar,R., Pan,J. and Weissman,S.M.
  TITLE     Comparison of the nucleotide sequence of the messenger RNA for the
            major structural protein of SV40 with the DNA sequence encoding the
            amino acids of the protein
  JOURNAL   Nucleic Acids Res. 4, 2549-2559 (1977)
  STANDARD  full staff_review
REFERENCE   14 (bases 347 to 472)
  AUTHORS   van Heuverswyn,H., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the SV40 DNA restriction fragment Hind C-Hap
            2
  JOURNAL   Nucleic Acids Res. 4, 1015-1024 (1977)
  STANDARD  full staff_review
REFERENCE   15 (bases 1709 to 1786; RNA and DNA)
  AUTHORS   Contreras,R., Volckaert,G., Thys,F., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the restriction fragment Hind F-Eco RI2 of
            SV40 DNA
  JOURNAL   Nucleic Acids Res. 4, 1001-1014 (1977)
  STANDARD  full staff_review
REFERENCE   16 (bases 1 to 163; 181 to 5243)
  AUTHORS   Reddy,V.B., Thimmappaya,B., Dhar,R., Subramanian,K.N., Zain,S.,
            Pan,J., Ghosh,P.K., Celma,M.L. and Weissman,S.M.
  TITLE     The genome of Simian Virus 40
  JOURNAL   Science 200, 494-502 (1978)
  STANDARD  full staff_review
REFERENCE   17 (bases 1 to 163; 181 to 5243)
  AUTHORS   Fiers,W., Contreras,R., Haegeman,G., Rogiers,R., van de Voorde,A.,
            van Heuverswyn,H., van Herreweghe,J., Volckaert,G. and Ysebaert,M.
  TITLE     Complete nucleotide sequence of SV40 DNA
  JOURNAL   Nature 273, 113-120 (1978)
  STANDARD  full staff_review
REFERENCE   18 (bases 2633 to 2933; RNA and DNA)
  AUTHORS   Thimmappaya,B., Zain,B.S., Dhar,R. and Weissman,S.M.
  TITLE     Nucleotide sequence of DNA template for the 3' ends of SV40 mRNA:
            II. the sequence of the DNA fragment EcorII-F and a part of
            EcorII-H
  JOURNAL   J. Biol. Chem. 253, 1613-1618 (1978)
  STANDARD  full staff_review
REFERENCE   19 (bases 325 to 526; 1463 to 1540;
            sites-late 16S mRNA and RNA splicing)
  AUTHORS   Ghosh,P.K., Reddy,V.B., Swinscoe,J., Choudary,P.V., Lebowitz,P. and
            Weissman,S.M.
  TITLE     The 5'-terminal leader sequence of late 16 S mRNA from cells
            infected with Simian Virus 40
  JOURNAL   J. Biol. Chem. 253, 3643-3647 (1978)
  STANDARD  full staff_review
REFERENCE   20 (bases 322 to 536; 1450 to 1541;
            sites-late 16S mRNA and RNA splicing)
  AUTHORS   Haegeman,G. and Fiers,W.
  TITLE     Evidence for 'splicing' of SV40 16S mRNA
  JOURNAL   Nature 273, 70-73 (1978)
  STANDARD  full staff_review
REFERENCE   21 (bases 4568 to 5175; small t antigen gene)
  AUTHORS   Volckaert,G., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the Simian Virus 40 small-t gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 2160-2164 (1978)
  STANDARD  full staff_review
REFERENCE   22 (bases 5109 to 5243; 1 to 63; RNA)
  AUTHORS   Woodworth-Gutai,M. and Nathans,D.
  TITLE     Evolutionary variants of Simian Virus 40: nucleotide sequence of a
            conserved SV40 DNA segment containing the origin of viral DNA
            replication as an inverted repetition
  JOURNAL   J. Mol. Biol. 126, 259-274 (1978)
  STANDARD  full staff_review
REFERENCE   23 (bases 1 to 74; 5171 to 5243; T antigen binding sites)
  AUTHORS   Tjian,R.
  TITLE     The binding site on SV40 DNA for a T antigen-related protein
  JOURNAL   Cell 13, 165-179 (1978)
  STANDARD  full staff_review
REFERENCE   24 (sites; 5' and 3' termini of 19s late mRNAs)
  AUTHORS   Ghosh,P.K., Reddy,V.B., Swinscoe,J., Lebowitz,P. and Weissman,S.M.
  TITLE     Heterogeneity and 5'-terminal structures of the late RNAs of Simian
            Virus 40
  JOURNAL   J. Mol. Biol. 126, 813-846 (1978)
  STANDARD  full staff_review
REFERENCE   25 (bases 562 to 1620; VP2 and VP3 genes)
  AUTHORS   Reddy,V.B., Dhar,R. and Weissman,S.M.
  TITLE     Nucleotide sequence of the genes for the Simian Virus 40 proteins
            VP2 and VP3
  JOURNAL   J. Biol. Chem. 253, 621-630 (1978)
  STANDARD  full staff_review
REFERENCE   26 (bases 358 to 914; sites-5' termini of late mRNAs)
  AUTHORS   Dhar,R., Reddy,V.B. and Weissman,S.M.
  TITLE     Nucleotide sequence of the DNA encoding the 5'-terminal sequences
            of Simian Virus 40 late mRNA
  JOURNAL   J. Biol. Chem. 253, 612-620 (1978)
  STANDARD  full staff_review
REFERENCE   27 (bases 2300 to 2668; sites-mRNA 3' termini)
  AUTHORS   Zain,B.S., Thimmappaya,B., Dhar,R. and Weissman,S.M.
  TITLE     Nucleotide sequences of DNA encoding the 3' ends of SV40 mRNA I.
            the sequence of the DNA fragment HindII,III-G
  JOURNAL   J. Biol. Chem. 253, 1606-1612 (1978)
  STANDARD  full staff_review
REFERENCE   28 (bases 2669 to 2901)
  AUTHORS   van Heuverswyn,H., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the Simian-Virus-40 DNA region coding for
            the carboxyl-terminal part of the T antigen
  JOURNAL   Eur. J. Biochem. 86, 335-344 (1978)
  STANDARD  full staff_review
REFERENCE   29 (bases 264 to 526; 1463 to 1516; 2582 to 2602; 2663 to 2676;
            sites-cDNA, 16S leader segments)
  AUTHORS   Reddy,V.B., Ghosh,P.K., Lebowitz,P. and Weissman,S.M.
  TITLE     Gaps and duplicated sequences in the leaders of SV40 16S RNA
  JOURNAL   Nucleic Acids Res. 5, 4195-4213 (1978)
  STANDARD  full staff_review
REFERENCE   30 (sites; cap site for 16s and 19s late mRNAs)
  AUTHORS   Haegeman,G. and Fiers,W.
  TITLE     Localization of the 5' terminus of late SV40 mRNA
  JOURNAL   Nucleic Acids Res. 5, 2359-2371 (1978)
  STANDARD  full staff_review
REFERENCE   31 (sites; mutations in region of origin of replication)
  AUTHORS   Subramanian,K.N. and Shenk,T.
  TITLE     Definition of the boundaries of the origin of DNA replication in
            Simian Virus 40
  JOURNAL   Nucleic Acids Res. 5, 3635-3642 (1978)
  STANDARD  full staff_review
REFERENCE   32 (sites; splice sites in late mRNAs)
  AUTHORS   Lai,C.-J., Dhar,R. and Khoury,G.
  TITLE     Mapping the spliced and unspliced late lytic SV40 RNAs
  JOURNAL   Cell 14, 971-982 (1978)
  STANDARD  full staff_review
REFERENCE   33 (sites; early mRNAs)
  AUTHORS   May,E., Kress,M. and May,P.
  TITLE     Characterization of two SV40 early mRNAs and evidence for a nuclear
            "prespliced" RNA species
  JOURNAL   Nucleic Acids Res. 5, 3083-3099 (1978)
  STANDARD  full staff_review
REFERENCE   34 (sites; early mRNAs and RNA splicing)
  AUTHORS   Berk,A.J. and Sharp,P.A.
  TITLE     Spliced early mRNAs of Simian Virus 40
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75, 1274-1278 (1978)
  STANDARD  full staff_review
REFERENCE   35 (review; bases 1 to 163; 181 to 5243)
  AUTHORS   Lebowitz,P. and Weissman,S.M.
  TITLE     Organization and transcription of the Simian Virus 40 genome
  JOURNAL   Curr. Top. Microbiol. Immunol. 87, 43-172 (1979)
  STANDARD  full staff_review
REFERENCE   36 (bases 1 to 472; 5174 to 5243)
  AUTHORS   van Heuverswyn,H. and Fiers,W.
  TITLE     Nucleotide sequence of the Hind-C fragment of Simian Virus 40 DNA:
            comparison of the 5'-untranslated region of wild-type virus and of
            some deletion mutants
  JOURNAL   Eur. J. Biochem. 100, 51-60 (1979)
  STANDARD  full staff_review
REFERENCE   37 (bases 1276 to 1539; deletion mutant analysis)
  AUTHORS   Contreras,R., Cole,C., Berg,P. and Fiers,W.
  TITLE     Nucleotide sequence analysis of two Simian Virus 40 mutants with
            deletions in the late region of the genome
  JOURNAL   J. Virol. 29, 789-793 (1979)
  STANDARD  full staff_review
REFERENCE   38 (bases 325 to 562; 1403 to 1582; cDNA)
  AUTHORS   Bina-Stein,M., Thoren,M., Salzman,N. and Thompson,J.A.
  TITLE     Rapid sequence determination of late Simian Virus 40 16S mRNA
            leader by using inhibitors of reverse transcriptase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 76, 731-735 (1979)
  STANDARD  full staff_review
REFERENCE   39 (bases 2691 to 4571; 4638 to 5163;
            sites-5' termini of early mRNAs)
  AUTHORS   Reddy,V.B., Ghosh,P.K., Lebowitz,P., Piatak,M. and Weissman,S.M.
  TITLE     Simian Virus 40 early mRNA's: I. genomic localization of 3' and 5'
            termini and two major splices in mRNA from transformed and
            lytically infected cells
  JOURNAL   J. Virol. 30, 279-296 (1979)
  STANDARD  full staff_review
REFERENCE   40 (sites; splice sites for large T antigen mRNA)
  AUTHORS   Mark,D.F. and Berg,P.
  TITLE     A third splice site in SV40 early mRNA
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44, 55-62 (1980)
  STANDARD  full staff_review
REFERENCE   41 (bases 4963 to 5179; deletion mutant analysis)
  AUTHORS   Pipas,J.M., Adler,S.P., Peden,K.W.C. and Nathans,D.
  TITLE     Deletion mutants of SV40 that affect the structure of viral tumor
            antigens
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44, 285-291 (1980)
  STANDARD  full staff_review
REFERENCE   42 (bases 2667 to 2840; deletion mutant analysis)
  AUTHORS   van Heuverswyn,H., Cole,C., Berg,P. and Fiers,W.
  TITLE     Nucleotide sequence analysis of two Simian Virus 40 mutants with
            deletions in the region coding for the carboxyl terminus of the T
            antigen
  JOURNAL   J. Virol. 30, 936-941 (1979)
  STANDARD  full staff_review
REFERENCE   43 (bases 263 to 532; deletion mutant analysis)
  AUTHORS   Subramanian,K.N.
  TITLE     Segments of Simian Virus 40 DNA spanning most of the leader
            sequence of the major late viral messenger RNA are dispensable
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 76, 2556-2560 (1979)
  STANDARD  full staff_review
REFERENCE   44 (bases 4548 to 4923; deletion mutant analysis)
  AUTHORS   Volckaert,G., Feunteun,J., Crawford,L.V., Berg,P. and Fiers,W.
  TITLE     Nucleotide sequence deletions within the coding region for small-t
            antigen of Simian Virus 40
  JOURNAL   J. Virol. 30, 674-682 (1979)
  STANDARD  full staff_review
REFERENCE   45 (bases 250 to 304; 345 to 399; deletion mutant analysis)
  AUTHORS   Haegeman,G., van Heuverswyn,H., Gheysen,D. and Fiers,W.
  TITLE     Heterogeneity of the 5' terminus of late mRNA induced by a viable
            Simian Virus 40 deletion mutant
  JOURNAL   J. Virol. 31, 484-493 (1979)
  STANDARD  full staff_review
REFERENCE   46 (bases 4566 to 5225; cDNA)
  AUTHORS   Thompson,J.A., Radonovich,M.F. and Salzman,N.P.
  TITLE     Characterization of the 5'-terminal structure of Simian Virus 40
            early mRNA's
  JOURNAL   J. Virol. 31, 437-446 (1979)
  STANDARD  full staff_review
REFERENCE   47 (sites; deletion mutant analysis over bases 4553 to 5172)
  AUTHORS   Thimmappaya,B. and Shenk,T.
  TITLE     Nucleotide sequence analysis of viable deletion mutants lacking
            segments of the Simian Virus 40 genome coding for small t antigen
  JOURNAL   J. Virol. 30, 668-673 (1979)
  STANDARD  full staff_review
REFERENCE   48 (sites; deletion mutant analysis of major late cap site)
  AUTHORS   Haegeman,G., Iserentant,D., Gheysen,D. and Fiers,W.
  TITLE     Characterization of the major altered leader sequence of late mRNA
            induced by SV40 deletion mutant dl-1811
  JOURNAL   Nucleic Acids Res. 7, 1799-1814 (1979)
  STANDARD  full staff_review
REFERENCE   49 (sites; splice sites in early mRNAs)
  AUTHORS   Khoury,G., Gruss,P., Dhar,R. and Lai,C.-J.
  TITLE     Processing and expression of early SV40 mRNA: a role for RNA
            conformation in splicing
  JOURNAL   Cell 18, 85-92 (1979)
  STANDARD  full staff_review
REFERENCE   50 (review; bases 1 to 163; 181 to 5243)
  AUTHORS   Buchman,A.R., Burnett,L. and Berg,P.
  TITLE     Appendix A: The SV40 nucleotide sequence
  JOURNAL   (in) Tooze,J. (Eds.);
            DNA Tumor Viruses - Second Edition Revised: 799-841;
            Cold Spring Harbor Laboratory, Cold Spring Harbor (1981).
  STANDARD  full staff_review
REFERENCE   51 (bases 2663 to 2921; deletion mutant analysis)
  AUTHORS   Alwine,J.C. and Khoury,G.
  TITLE     Simian Virus 40-associated small RNA: mapping on the Simian Virus
            40 genome and characterization of its synthesis
  JOURNAL   J. Virol. 36, 701-708 (1980)
  STANDARD  full staff_review
REFERENCE   52 (bases 4007 to 4574)
  AUTHORS   Volckaert,G., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the Simian Virus 40 HindII+III restriction
            fragment A: (second part of the T antigen gene)
  JOURNAL   Eur. J. Biochem. 106, 169-177 (1980)
  STANDARD  full staff_review
REFERENCE   53 (bases 3476 to 3735; RNA and DNA)
  AUTHORS   van Herreweghe,J., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the Simian Virus 40 HindII+III restriction
            fragment I: (fourth part of the T antigen gene)
  JOURNAL   Eur. J. Biochem. 106, 179-192 (1980)
  STANDARD  full staff_review
REFERENCE   54 (bases 3146 to 3481)
  AUTHORS   van Herreweghe,J., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the Hind-I-proximal part of Simian Virus 40
            HindII+III restriction fragment B: (fifth part of the T antigen
            gene
  JOURNAL   Eur. J. Biochem. 106, 193-198 (1980)
  STANDARD  full staff_review
REFERENCE   55 (bases 2669 to 3251)
  AUTHORS   van Heuverswyn,H., van de Voorde,A., van Herreweghe,J.,
            Volckaert,G., de Winne,P. and Fiers,W.
  TITLE     Nucleotide sequence of Simian Virus 40 DNA: structure of the middle
            segment of the HindII+III restriction fragment B: (sixth part of
            the T antigen gene) and codon usage
  JOURNAL   Eur. J. Biochem. 106, 199-209 (1980)
  STANDARD  full staff_review
REFERENCE   56 (bases 4572 to 4917)
  AUTHORS   Seif,I., Khoury,G. and Dhar,R.
  TITLE     A rapid enzymatic DNA sequencing technique: determination of
            sequence alterations in early Simian Virus 40 temperature sensitive
            and deletion mutants
  JOURNAL   Nucleic Acids Res. 8, 2225-2240 (1980)
  STANDARD  full staff_review
REFERENCE   57 (bases 1 to 74; 5161 to 5243;
            deletion and point mutants over the replication origin)
  AUTHORS   DiMaio,D. and Nathans,D.
  TITLE     Cold-sensitive regulatory mutants of Simian Virus 40
  JOURNAL   J. Mol. Biol. 140, 129-142 (1980)
  STANDARD  full staff_review
REFERENCE   58 (bases 13 to 65; 5155 to 5227;
            deletion mutants over the early promoter)
  AUTHORS   Benoist,C. and Chambon,P.
  TITLE     Deletions covering the putative promoter region of early mRNAs of
            Simian Virus 40 do not abolish T-antigen expression
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77, 3865-3869 (1980)
  STANDARD  full staff_review
REFERENCE   59 (sites; T antigen binding sites)
  AUTHORS   Shalloway,D., Kleinberger,T. and Livingston,D.M.
  TITLE     Mapping of SV40 DNA replication origin region binding sites for the
            SV40 T antigen by protection against exonuclease III digestion
  JOURNAL   Cell 20, 411-422 (1980)
  STANDARD  full staff_review
REFERENCE   60 (bases 1 to 33; 212 to 236; 5161 to 5243;
            origin-defective deletion mutants)
  AUTHORS   Gluzman,Y., Sambrook,J.F. and Frisque,R.J.
  TITLE     Expression of early genes of origin-defective mutants of Simian
            Virus 40
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77, 3898-3902 (1980)
  STANDARD  full staff_review
REFERENCE   61 (bases 5213 to 5236; early mRNA initiation sites)
  AUTHORS   Haegeman,G. and Fiers,W.
  TITLE     Characterization of the 5'-terminal cap structures of early Simian
            Virus 40 mRNA
  JOURNAL   J. Virol. 35, 955-961 (1980)
  STANDARD  full staff_review
REFERENCE   62 (sites; early mRNA initiation sites and repression by T antigen)
  AUTHORS   Hansen,U., Tenen,D.G., Livingston,D.M. and Sharp,P.A.
  TITLE     T antigen repression of SV40 early transcription from two promoters
  JOURNAL   Cell 27, 603-612 (1981)
  STANDARD  full staff_review
REFERENCE   63 (sites; poly-A at 2662; deletion mutants over 2533-2775)
  AUTHORS   Fitzgerald,M. and Shenk,T.
  TITLE     The sequence 5'-AAUAAA-3' forms part of the recognition site for
            polyadenylation of late SV40 mRNAs
  JOURNAL   Cell 24, 251-260 (1981)
  STANDARD  full staff_review
REFERENCE   64 (bases 306 to 428; deletion mutant analysis)
  AUTHORS   Woodworth-Gutai,M.
  TITLE     Recombination in SV40-infected cells: viral DNA sequences at sites
            of circularization of transfecting linear DNA
  JOURNAL   Virology 109, 353-365 (1981)
  STANDARD  full staff_review
REFERENCE   65 (bases 101 to 178; 251 to 280; early promoter analysis)
  AUTHORS   Gruss,P., Dhar,R. and Khoury,G.
  TITLE     Simian Virus 40 tandem repeated sequences as an element of the
            early promoter
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 943-947 (1981)
  STANDARD  full staff_review
REFERENCE   66 (bases 110 to 178; 251 to 601; deletion mutant analysis)
  AUTHORS   Barkan,A. and Mertz,J.E.
  TITLE     DNA sequence analysis of Simian Virus 40 mutants with deletions
            mapping in the leader region of the late viral mRNA's: mutants with
            deletions similar in size and position exhibit varied phenotypes
  JOURNAL   J. Virol. 37, 730-737 (1981)
  STANDARD  full staff_review
REFERENCE   67 (bases 1 to 54; 5148 to 5243; cDNAs over this region)
  AUTHORS   Ghosh,P.K., Lebowitz,P., Frisque,R.J. and Gluzman,Y.
  TITLE     Identification of a promoter component involved in positioning the
            5' termini of Simian Virus 40 early mRNAs
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 100-104 (1981)
  STANDARD  full staff_review
REFERENCE   68 (bases 1 to 173; 182 to 280; 5216 to 5243;
            early promoter analysis)
  AUTHORS   Benoist,C. and Chambon,P.
  TITLE     In vivo sequence requirements of the SV40 early promoter region
  JOURNAL   Nature 290, 304-310 (1981)
  STANDARD  full staff_review
REFERENCE   69 (sites; promoter for early mRNAs)
  AUTHORS   Mathis,D.J. and Chambon,P.
  TITLE     The SV40 early region TATA box is required for accurate in vitro
            initiation of transcription
  JOURNAL   Nature 290, 310-315 (1981)
  STANDARD  full staff_review
REFERENCE   70 (bases 1 to 177; 5171 to 5243; autoregulation by T antigen)
  AUTHORS   Myers,R.M., Rio,D.C., Robbins,A.K. and Tjian,R.
  TITLE     SV40 gene expression is modulated by the cooperative binding of T
            antigen to DNA
  JOURNAL   Cell 25, 373-384 (1981)
  STANDARD  full staff_review
REFERENCE   71 (bases 1 to 102; 5148 to 5243;
            upstream shift of early mRNA initiation site)
  AUTHORS   Ghosh,P.K. and Lebowitz,P.
  TITLE     Simian Virus 40 early mRNA's contain multiple 5'termini upstream
            and downstream from a Hogness-Goldberg sequence; a shift in 5'
            termini during the lytic cycle is mediated by large T antigen
  JOURNAL   J. Virol. 40, 224-240 (1981)
  STANDARD  full staff_review
REFERENCE   72 (bases 2644 to 2923; deletion mutant analysis)
  AUTHORS   Feunteun,J., Carmichael,G., Nicolas,J.C. and Kress,M.
  TITLE     Mutant carrying deletions in the two Simian Virus 40 early genes
  JOURNAL   J. Virol. 40, 625-634 (1981)
  STANDARD  full staff_review
REFERENCE   73 (sites; cds for agnoprotein)
  AUTHORS   Jackson,V. and Chalkley,R.
  TITLE     Use of whole-cell fixation to visualize replicating and maturing
            Simian Virus 40: identification of new viral gene product
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 6081-6085 (1981)
  STANDARD  full staff_review
REFERENCE   74 (bases 1 to 32; 212 to 236; 5161 to 5243;
            T antigen binding to a deletion mutant)
  AUTHORS   McKay,R. and DiMaio,D.
  TITLE     Binding of an SV40 T antigen-related protein to the DNA of SV40
            regulatory mutants
  JOURNAL   Nature 289, 810-813 (1981)
  STANDARD  full staff_review
REFERENCE   75 (sites; transcriptional enhancement by the 72 bp repeats)
  AUTHORS   Moreau,V., Hen,R., Wasylyk,B., Everett,R., Gaub,M.P. and Chambon,P.
  TITLE     The SV40 72 base repair repeat has a striking effect on gene
            expression both in SV40 and other chimeric recombinants
  JOURNAL   Nucleic Acids Res. 9, 6047-6068 (1981)
  STANDARD  full staff_review
REFERENCE   76 (bases 325 to 622; 1463 to 1511; cDNAs;
            late mRNA synthesis by deletion mutants)
  AUTHORS   Piatak,M., Subramanian,K.N., Roy,P. and Weissman,S.M.
  TITLE     Late messenger RNA production by viable Simian Virus 40 mutants
            with deletions in the leader region
  JOURNAL   J. Mol. Biol. 153, 589-618 (1981)
  STANDARD  full staff_review
REFERENCE   77 (sites; agnoprotein coding sequence)
  AUTHORS   Jay,G., Nomura,S., Anderson,C.W. and Khoury,G.
  TITLE     Identification of the SV40 agnogene product: a DNA binding protein
  JOURNAL   Nature 291, 346-349 (1981)
  STANDARD  full staff_review
REFERENCE   78 (sites; mapping of the origin of replication)
  AUTHORS   Bergsma,D.J., Olive,D.M., Hartzell,S.W. and Subramanian,K.N.
  TITLE     Territorial limits and functional anatomy of the Simian Virus 40
            replication origin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 79, 381-385 (1982)
  STANDARD  full staff_review
REFERENCE   79 (sites; topoisomerase cleavage sites)
  AUTHORS   Edwards,K.A., Halligan,B.D., Davis,J.L., Nivera,N.L. and Liu,L.F.
  TITLE     Recognition sites of eukaryotic DNA topoisomerase I: DNA nucleotide
            sequencing analysis of topo I cleavage sites on SV40 DNA
  JOURNAL   Nucleic Acids Res. 10, 2565-2576 (1982)
  STANDARD  full staff_review
REFERENCE   80 (bases 2619 to 2920; deletion mutant analysis)
  AUTHORS   Polvino-Bodnar,M. and Cole,C.N.
  TITLE     Construction and characterization of viable deletion mutants of
            Simian Virus 40 lacking sequences near the 3' end of the early
            region
  JOURNAL   J. Virol. 43, 489-502 (1982)
  STANDARD  full staff_review
REFERENCE   81 (bases 255 to 2535; only joints of deletion mutants)
  AUTHORS   Johnson,A.D., Barkan,A. and Mertz,J.E.
  TITLE     Nucleotide sequence analysis of the recombinant joints in 16
            naturally arising deletion mutants of Simian Virus 40
  JOURNAL   Virology 123, 464-469 (1982)
  STANDARD  full staff_review
REFERENCE   82 (bases 260 to 422; RNA)
  AUTHORS   Hay,N., Skolnik-David,H. and Aloni,Y.
  TITLE     Attenuation in the control of SV40 gene expression
  JOURNAL   Cell 29, 183-193 (1982)
  STANDARD  full staff_review
REFERENCE   83 (sites; late mRNA initiation sites)
  AUTHORS   Alwine,J.C.
  TITLE     Evidence for Simian Virus 40 late transcriptional control: mixed
            infections of wild-type Simian Virus 40 and a late leader deletion
            mutant exhibit trans effects on late viral RNA synthesis
  JOURNAL   J. Virol. 42, 798-803 (1982)
  STANDARD  full staff_review
REFERENCE   84 (bases 284 to 326; late promoter analysis)
  AUTHORS   Brady,J., Radonovich,M., Vodkin,M., Natarajan,V., Thoren,M.,
            Das,G., Janik,J. and Salzman,N.P.
  TITLE     Site-specific base substitution and deletion mutations that enhance
            or suppress transcription of the SV40 major late RNA
  JOURNAL   Cell 31, 625-633 (1982)
  STANDARD  full staff_review
REFERENCE   85 (bases 1 to 16; 5227 to 5243;
            mutant analysis over origin of replication)
  AUTHORS   Bergsma,D.J., Olive,D.M., Hartzell,S.W., Byrne,B.J. and
            Subramanian,K.N.
  TITLE     Cyclization of linear chimeric plasmids in vivo by a novel
            end-to-end joining reaction or by intramolecular recombination: one
            of the products contains a 147-bp perfect palindrome stable in
            Escherichia coli
  JOURNAL   Gene 20, 157-167 (1982)
  STANDARD  full staff_review
REFERENCE   86 (bases 5164 to 5243; 1 to 48;
            T antigen binding to a deletion mutant)
  AUTHORS   Tenen,D.G., Haines,L.L. and Livingston,D.M.
  TITLE     Binding of an analog of the Simian Virus 40 T antigen to wild-type
            and mutant viral replication origins
  JOURNAL   J. Mol. Biol. 157, 473-492 (1982)
  STANDARD  full staff_review
REFERENCE   87 (sites; T antigen binding sites)
  AUTHORS   DiMaio,D. and Nathans,D.
  TITLE     Regulatory mutants of Simian Virus 40: effect of mutations at a T
            antigen binding site on DNA replication and expression of viral
            genes
  JOURNAL   J. Mol. Biol. 156, 531-548 (1982)
  STANDARD  full staff_review
REFERENCE   88 (sites; mapping of early region promoter)
  AUTHORS   Fromm,M. and Berg,P.
  TITLE     Deletion mapping of DNA regions required for SV40 early promoter
            function in vivo
  JOURNAL   J. Mol. Appl. Genet. 1, 457-481 (1982)
  STANDARD  full staff_review
REFERENCE   89 (sites; SV40 enhancement of beta-globin gene expression)
  AUTHORS   Banerji,J., Rusconi,S. and Schaffner,W.
  TITLE     Expression of a beta-globin gene is enhanced by remote SV40 DNA
            sequences
  JOURNAL   Cell 27, 299-308 (1981)
  STANDARD  full staff_review
REFERENCE   90 (bases 1 to 129; 5228 to 5243; mapping of the early promoter)
  AUTHORS   Byrne,B.J., Davis,M.S., Yamaguchi,J., Bergsma,D.J. and
            Subramanian,K.N.
  TITLE     Definition of the Simian Virus 40 early promoter region and
            demonstration of a host range bias in the enhancement effect of the
            Simian Virus 40 72-base-pair repeat
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 721-725 (1983)
  STANDARD  full staff_review
REFERENCE   91 (sites; agnogene mutants)
  AUTHORS   Mertz,J.E., Murphy,A. and Barkan,A.
  TITLE     Mutants deleted in the agnogene of Simian Virus 40 define a new
            complementation group
  JOURNAL   J. Virol. 45, 36-46 (1983)
  STANDARD  full staff_review
REFERENCE   92 (bases 335 to 394; 1706 to 1753; mutant analysis)
  AUTHORS   Margolskee,R.F. and Nathans,D.
  TITLE     Suppression of a VP1 mutant of Simian Virus 40 by missense
            mutations in serine codons of the viral agnogene
  JOURNAL   J. Virol. 48, 405-409 (1983)
  STANDARD  full staff_review
REFERENCE   93 (bases 3512 to 3565; 4154 to 4311;
            mutant analysis over these regions)
  AUTHORS   Pipas,J.M., Peden,K.W.C. and Nathans,D.
  TITLE     Mutational analysis of Simian Virus 40 T antigen: isolation and
            characterization of mutants with deletions in the T-antigen gene
  JOURNAL   Mol. Cell. Biol. 3, 203-213 (1983)
  STANDARD  full staff_review
REFERENCE   94 (bases 1709 to 1786; topoisomerase II)
  AUTHORS   Liu,L.F., Rowe,T.C., Yang,L., Tewey,K.M. and Chen,G.L.
  TITLE     Cleavage of DNA by mammalian DNA topoisomerase II
  JOURNAL   J. Biol. Chem. 258, 15365-15370 (1983)
  STANDARD  full staff_review
REFERENCE   95 (bases 2773 to 4406; only joints of deletion mutants)
  AUTHORS   Tornow,J. and Cole,C.N.
  TITLE     Intracistronic complementation in the Simian Virus 40 A gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 6312-6316 (1983)
  STANDARD  full staff_review
REFERENCE   96 (bases 21 to 118;
            transcription factor binding to early promoter)
  AUTHORS   Dynan,W.S. and Tjian,R.
  TITLE     The promoter-specific transcription factor Sp1 binds to upstream
            sequences in the SV40 early promoter
  JOURNAL   Cell 35, 79-87 (1983)
  STANDARD  full staff_review
REFERENCE   97 (sites; 7S-K RNA of transformed cells)
  AUTHORS   Sohn,U., Szyszko,J., Coombs,D. and Krause,M.
  TITLE     7S-K nuclear RNA from Simian Virus 40-transformed cells has
            sequence homology to the viral early promoter
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 7090-7094 (1983)
  STANDARD  full staff_review
REFERENCE   98 (bases 1 to 49; 5181 to 5243; T antigen binding analysis)
  AUTHORS   Tenen,D.G., Livingston,D.M., Wang,S.-S. and Martin,R.G.
  TITLE     Effect of a stem-loop structure within the SV40 replication origin
            upon SV40 T antigen binding to origin region sequences
  JOURNAL   Cell 34, 629-639 (1983)
  STANDARD  full staff_review
REFERENCE   99 (bases 253 to 482; cDNA; late mRNA initiation)
  AUTHORS   Piatak,M., Ghosh,P.K., Norkin,L.C. and Weissman,S.M.
  TITLE     Sequences locating the 5' ends of the major Simian Virus 40 late
            mRNA forms
  JOURNAL   J. Virol. 48, 503-520 (1983)
  STANDARD  full staff_review
REFERENCE   100(bases 1 to 14; 5210 to 5243; autoregulation by T antigen)
  AUTHORS   Rio,D.C. and Tjian,R.
  TITLE     Sv40 T antigen binding site mutations that affect autoregulation
  JOURNAL   Cell 32, 1227-1240 (1983)
  STANDARD  full staff_review
REFERENCE   101(bases 1 to 134; 5216 to 5243; early promoter analysis)
  AUTHORS   Everett,R.D., Baty,D. and Chambon,P.
  TITLE     The repeated GC-rich motifs upstream from the TATA box are
            important elements of the SV40 early promoter
  JOURNAL   Nucleic Acids Res. 11, 2447-2464 (1983)
  STANDARD  full staff_review
REFERENCE   102(bases 5165 to 5243)
  AUTHORS   Tenen,D.G., Taylor,T.S., Haines,L.L., Bradley,M.K., Martin,R.G. and
            Livingston,D.M.
  TITLE     Binding of simian virus 40 large T antigen from virus-infected
            monkey cells to wild-type and mutant viral replication origins
  JOURNAL   J. Mol. Biol. 168, 791-808 (1983)
  STANDARD  full staff_review
REFERENCE   103(bases 1 to 26; 5170 to 5243; T antigen binding sites)
  AUTHORS   Jones,K.A. and Tjian,R.
  TITLE     Essential contact residues within SV40 large T antigen binding
            sites I and II identified by alkylation-interference
  JOURNAL   Cell 36, 155-162 (1984)
  STANDARD  full staff_review
REFERENCE   104(sites; sites-5' termini of late mRNAs)
  AUTHORS   Lycan,D.E. and Danna,K.J.
  TITLE     S1 mapping of purified nascent transcripts of Simian Virus 40
  JOURNAL   Mol. Cell. Biol. 4, 625-633 (1984)
  STANDARD  full staff_review
REFERENCE   105(bases 1 to 33; 5213 to 5243; T antigen binding analysis)
  AUTHORS   Lewton,B.A., Delucia,A.L. and Tegtmeyer,P.
  TITLE     Binding of Simian Virus 40 A protein to DNA with deletions at the
            origin of replication
  JOURNAL   J. Virol. 49, 9-13 (1984)
  STANDARD  full staff_review
REFERENCE   106(sites; T antigen binding sites)
  AUTHORS   Pomerantz,B.J. and Hassell,J.A.
  TITLE     Polyomavirus and Simian Virus 40 large T antigens bind to common
            DNA sequences
  JOURNAL   J. Virol. 49, 925-937 (1984)
  STANDARD  full staff_review
REFERENCE   107(bases 1 to 118; 5186 to 5243; primase initiation sites)
  AUTHORS   Tseng,B.Y. and Ahlem,C.N.
  TITLE     Mouse primase initiation sites in the origin region of Simian Virus
            40
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 2342-2346 (1984)
  STANDARD  full staff_review
REFERENCE   108(bases 1 to 294; 297 to 379; transcriptional enhancer analysis)
  AUTHORS   Weber,F., de Villiers,J. and Schaffner,W.
  TITLE     An SV40 'enhancer trap' incorporates exogenous enhancers or
            generates enhancers from its own sequences
  JOURNAL   Cell 36, 983-992 (1984)
  STANDARD  full staff_review
REFERENCE   109(bases 1 to 33; 5213 to 5243;
            mutant analysis at the replication origin)
  AUTHORS   Cohen,G.L., Wright,P.J., Delucia,A.L., Lewton,B.A.,
            Anderson,M.E. and Tegtmeyer,P.
  TITLE     Critical spatial requirement within the origin of Simian Virus 40
            DNA replication
  JOURNAL   J. Virol. 51, 91-96 (1984)
  STANDARD  full staff_review
REFERENCE   110(sites; mRNA polyadenylation signal sequences)
  AUTHORS   Sadofsky,M. and Alwine,J.C.
  TITLE     Sequences on the 3' side of hexanucleotide AAUAAA affect efficiency
            of cleavage at the polyadenyation site
  JOURNAL   Mol. Cell. Biol. 4, 1460-1468 (1984)
  STANDARD  full staff_review
REFERENCE   111(bases 1582 to 1635; 2524 to 2652;
            chromatin structure at origin of replication; mutational analysis)
  AUTHORS   Innis,J.W. and Scott,W.A.
  TITLE     DNA replication and chromatin structure of Simian Virus 40
            insertion mutants
  JOURNAL   Mol. Cell. Biol. 4, 1499-1507 (1984)
  STANDARD  full staff_review
REFERENCE   112(sites; bidirectional promoter element)
  AUTHORS   Hartzell,S.W., Byrne,B.J. and Subramanian,K.N.
  TITLE     Mapping of the late promoter of Simian virus 40
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 23-27 (1984)
  STANDARD  full staff_review
REFERENCE   113(sites;
            late promoter sequence requirements in the presence of T antigen)
  AUTHORS   Hartzell,S.W., Byrne,B.J. and Subramanian,K.N.
  TITLE     The simian virus 40 minimal origin and the 72-base-pair repeat are
            required simultaneously for efficient induction of late gene
            expression with large tumor antigen
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 6335-6339 (1984)
  STANDARD  full staff_review
REFERENCE   114(bases 304 to 528)
  AUTHORS   Hay,N., Kessler,M. and Aloni,Y.
  TITLE     SV40 deletion mutant (d1861) with agnoprotein shortened by four
            amino acids
  JOURNAL   Virology 137, 160-170 (1984)
  STANDARD  full staff_review
REFERENCE   115(bases 1 to 366; 5230 to 5243)
  AUTHORS   Swimmer,C. and Shenk,T.
  TITLE     A viable simian virus 40 variant that carries a newly generated
            sequence reiteration in place of the normal duplicated enhancer
            element
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 6652-6656 (1984)
  STANDARD  full staff_review
REFERENCE   116(sites; Okazaki fragment sequences)
  AUTHORS   Hay,R.T., Hendrickson,E.A. and DePamphilis,M.L.
  TITLE     Sequence specificity for the initiation of RNA-primed simian virus
            40 DNA synthesis in vivo
  JOURNAL   J. Mol. Biol. 175, 131-157 (1984)
  STANDARD  full staff_review
REFERENCE   117(sites; tsA3900 point mutant sequence)
  AUTHORS   Hutchinson,N.I., Chang,L.-S., Pater,M.M., Bouck,N., Shenk,T.E. and
            Di Mayorca,G.
  TITLE     Characterization of a new simian virus 40 mutant, tsA3900, isolated
            from deletion mutant tsA1499
  JOURNAL   J. Virol. 53, 814-821 (1985)
  STANDARD  full staff_review
REFERENCE   118(sites; large t-antigen binding specificity)
  AUTHORS   Scheller,A. and Prives,C.
  TITLE     Simian virus 40 and polyomavirus large tumor antigens have
            different requirements for high-affinity sequence-specific DNA
            binding
  JOURNAL   J. Virol. 54, 532-545 (1985)
  STANDARD  full staff_review
REFERENCE   119(bases 1499 to 2590; temperature-sensitive mutants)
  AUTHORS   Ng,S.-C., Behm,M. and Bina,M.
  TITLE     DNA sequence alterations responsible for the synthesis of
            thermosensitive VP1 in temperature-sensitive BC mutants of simian
            virus 40
  JOURNAL   J. Virol. 54, 646-649 (1985)
  STANDARD  full staff_review
REFERENCE   120(sites; recombination in poly(GT)-sequences)
  AUTHORS   Stringer,J.R.
  TITLE     Recombination between poly[d(GT)-d(CA)] sequences in simian virus
            40-infected cultured cells
  JOURNAL   Mol. Cell. Biol. 5, 1247-1259 (1985)
  STANDARD  full staff_review
REFERENCE   121(sites; large T-antigen binding sites)
  AUTHORS   Barrera-Saldana,H., Takahashi,K., Vigneron,M., Wildeman,A.,
            Davidson,I. and Chambon,P.
  TITLE     All six GC-motifs of the SV40 early upstream element contribute to
            promoter activity in vivo and in vitro
  JOURNAL   EMBO J. 4, 3839-3849 (1985)
  STANDARD  full staff_review
REFERENCE   122(bases 4556 to 4657; 4907 to 4937)
  AUTHORS   Noble,J.C., Prives,C. and Manley,J.L.
  TITLE     In vitro splicing of simian virus 40 early pre mRNA
  JOURNAL   Nucleic Acids Res. 14, 1219-1235 (1986)
  STANDARD  full staff_review
REFERENCE   123(sites; SV40 early leader protein (SELP) boundaries)
  AUTHORS   Khalili,K., Brady,J. and Khoury,G.
  TITLE     Translational regulation of SV40 early mRNA defines a new viral
            protein
  JOURNAL   Cell 48, 639-645 (1987)
  STANDARD  full staff_review
REFERENCE   124(bases 1494 to 1708)
  AUTHORS   Rogiers,R., van de Voorde,A., Soeda,E. and Fiers,W.
  TITLE     Nucleotide sequence of the Simian virus 40 Hind-K restriction
            fragment
  JOURNAL   Eur. J. Biochem. 85, 205-224 (1978)
  STANDARD  full staff_entry
REFERENCE   125(bases 662 to 1051)
  AUTHORS   Ysebaert,M., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the Simian virus 40 HindII+III restriction
            fragment D and the total amino acid sequence of the late proteins
            VP2 and VP3
  JOURNAL   Eur. J. Biochem. 91, 431-439 (1978)
  STANDARD  full staff_entry
REFERENCE   126(bases 2060 to 2299)
  AUTHORS   van Heuverswyn,H., van de Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the Simian virus 40 HindII+III restriction
            fragment J and the total amino acid sequence of the major
            structural protein VP1
  JOURNAL   Eur. J. Biochem. 91, 415-430 (1978)
  STANDARD  full staff_entry
REFERENCE   127(bases 1 to 107; 5146 to 5243)
  AUTHORS   Hay,R.T. and DePamphilis,M.L.
  TITLE     Initiation of SV40 DNA replication in vivo: Location and structure
            of 5' ends of DNA synthesized in the ori region
  JOURNAL   Cell 28, 767-779 (1982)
  STANDARD  full staff_entry
REFERENCE   128(bases 80 to 350)
  AUTHORS   Contreras,R., Gheysen,D., Knowland,J., van de Voorde,A. and
            Fiers,W.
  TITLE     Evidence for the direct involvement of DNA replication origin in
            synthesis of late SV40 RNA
  JOURNAL   Nature 300, 500-505 (1982)
  STANDARD  full staff_entry
REFERENCE   129(sites; early SV40 transcription)
  AUTHORS   Lebowitz,P. and Ghosh,P.
  TITLE     Initiation and regulation of Simian virus 40 early transcription in
            vitro
  JOURNAL   J. Virol. 41, 449-461 (1982)
  STANDARD  full staff_entry
REFERENCE   130(sites; temperature sensitive mutants in the VP1 gene)
  AUTHORS   Behm,M., Lowman,H., Ng,S.-C. and Bina,M.
  TITLE     Analysis of temperature-sensitive mutations in the simian virus 40
            gene encoding virion protein 1
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 9421-9425 (1988)
  STANDARD  full staff_review
REFERENCE   131(sites; ends of "early" and "late" mRNA)
  AUTHORS   Subramanian,K.N., Ghosh,P.K., Dhar,R., Thimmappaya,B., Zain,S.B.,
            Pan,J. and Weissman,S.M.
  TITLE     The primary structure of regions of SV40 DNA encoding the ends of
            mRNA
  JOURNAL   Prog. Nucleic Acid Res. Mol. Biol. 19, 157-164 (1976)
  STANDARD  simple staff_review
REFERENCE   132(bases 1 to 164; 5092 to 5243)
  AUTHORS   Weissman,S.M., Subramanian,K.N., Dhar,R., Zain,B. and Pan,J.
  TITLE     SV40 DNA sequences as an example of the structure of genes
            functioning in animal cell nuclei
  JOURNAL   Med. Pediatr. Oncol. 2, 259-263 (1976)
  STANDARD  full staff_entry
REFERENCE   133(bases 1787 to 2059)
  AUTHORS   Contreras,R., Van De Voorde,A. and Fiers,W.
  TITLE     Nucleotide sequence of the restriction fragment Hind-F-EcoRI-1 of
            simian-virus-40 DNA (part of the VP1 gene)
  JOURNAL   Eur. J. Biochem. 86, 317-324 (1978)
  STANDARD  simple staff_review
COMMENT     Kindly reviewed by K. N. Subramanian.  Draft entry and computer-
            readable sequence for [130] kindly submitted by M.Bina,
            16-FEB-1989.
            
            The E or minus strand of SV40 strain 776 is shown as it is reported
            by [16] and [17] using the numbering system by [50], with exception
            of of the addition of 17 bases (between nucleotides 164 and 165) to
            their sequence [36]. Differences between the complete sequences as
            published by [16] and [17] and this sequence are limited to the 17
            bp change, the revision at 328 below, and the numbering. All
            references tend to agree upon an independent coordinate system,
            where map units run from 0.00 to 1.00 and the origin is defined by
            the sole EcoRI site, which is found at base 1782 below. In keeping
            with [50], map units for this entry have been calculated by
            
                                      SV + 3461
                            mu  =    -----------
                                        5243
            
            If the result is > 1, subtract 1 for the value. The origin of this
            sequence, then, is at 0.66. Identical units are used for Polyoma
            virus.
            
            DNA replication is bidirectional from an origin which has been
            narrowed by mutational analysis to bases 5193 to 34 [31]. Given
            that early mRNA transcription also arises from this region, the two
            processes will be, to some extent, simultaneously regulated. In
            addition to the 21-bp and 72-bp repeats concerned with early
            transcription (see below), the origin includes a 17-bp A + T-rich
            sequence; further demarcation of a "core" region and an "auxiliary"
            region makes the former comprise a 17-bp palindrome (5193-5209), a
            15-bp palindrome (5213-5227), a 27-bp palindrome (5230-13) and the
            A + T-rich region (15-31), and the latter the three 21-bp repeats
            [85]. Thus the "core" overlaps the T-antigen binding sites I and II
            and the "auxiliary region" (which is said to enhance replication
            efficiency) overlaps the T-antigen binding site III [85].
            
            Transcription of early mRNAs (18s and 19s) is leftward off the E,
            or minus, strand and transcription of late mRNAs (16s and 19s) is
            rightward off the L, or plus, strand. In the former case, the
            annotation will show the symbols "(c)" and "comp strand" for the
            transcripts and products, and for the regulatory elements which
            govern them. In some references, the E strand has been labeled
            "Late" to indicate that it has the polarity of late mRNA, and the L
            strand "Early" to indicate that it has the polarity of early mRNA.
            (Early mRNAs can be produced throughout the lytic cycle.)
            
            The characterization of the mRNAs as 18s and 19s early and 16s and
            19s late is a simplification: heterogeneity with regard to 5' and
            3' termini, splice sites, and abundance as a function of time in
            the lytic cycle and regulatory events, have all been observed [24]
            [35],[67],[71]. For the purposes of this entry, only the simplest
            major mRNA species have been entered in the Sites.
            
            The promoter for the early mRNAs is considered to include at least
            bases 5185 to 250 on the E strand: the T antigen binding sites (for
            which the minimal spans are given [62]), the "TATA" box at position
            21, the 21-bp repeats, and the 72-bp repeats are all known to play
            a role in early transcription. One of the effects of regulation at
            these sites is to shift the cap site to one of at least seven
            positions between bases 95 and 32 [71]. Bases 55 to 97 are also of
            interest because they permit base-pairing homology with the small
            nuclear RNA 7S-K [97] and because they bind the cell factor SP-1
            [96].
            
            For late promoter induction to occur at high efficiency two domains
            are required simultaneously: the minimal replication origin and the
            72 bp repeat [113]. The origin-proximal 22 bp portion of the 72-bp
            repeat is sufficient for induction, while the origin-distal portion
            is dispensable [113]. If T antigen is present, the 21 bp repeat is
            dispensable for induction of late promoter activity in vivo [113].
            The regulatory elements for early transcription have been annotated
            for the "comp strand". The late region induction signals are to be
            found in the same region as the early regulatory elements [113].
            The efficacy of the 72-bp repeat in promoting transcription has
            been demonstrated for both orientations [75].
            
            The SAS-RNA is a small viral associated RNA whose function is
            unknown. It appears to arise through processing from some of the
            longer early transcripts; mutants in the SAS-RNA coding region
            remain viable [51].
            
            Seven proteins -- the small t antigen, the large T antigen, the
            three structural proteins, the early leader protein [123], and the
            agnoprotein (which is thought to be involved with structural
            development [92])-- are known to be translated from the
            multiplicity of mRNAs. The existence of a middle T antigen, known
            to exist in Polyoma, has been suggested [40]. For a summary of
            other possible coding sequences, see [50].
            
            Missing data project [124-129], [131].
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM          21     15 (C)   TATA-box [3]
               MSG         120   2674       major 16S RNA
FEATURES       from  to/span     description
    pept         16        1 (c) early leader protein (SELP; [123])
               5243     5188 (c) early leader protein (SELP; [123])
    pept        335      523     agnoprotein (lp-1, VCP)
    pept        562     1620     VP2 minor structural protein
    pept        916     1620     VP3 minor structural protein
    pept       1499     2593     VP1 major structural protein
    pept       5163     4918 (c) large T antigen exon 1
               4571     2691 (c) large T antigen exon 2
    pept       5163     4639 (c) small t antigen
    RNA        2907     2842 (c) SV40-associated small (SAS) RNA; 0.21 [51]
    pre-msg     264     2676     minor late 19s mRNA [29],[62],[71]
    pre-msg     325     2676     major late 19s mRNA [10],[19],[29],[30],[38]
    pre-msg    2587       31 (c) early mRNA (alt.) [3],[34],[39],[62],[71]
    pre-msg    2599       31 (c) early mRNA (alt.) [28],[62],[71]
    pre-msg    5225     2587 (c) early mRNA (alt.) [3],[34],[39],[46],[61]
    pre-msg    5230     2587 (c) early mRNA (alt.) [3],[34],[39],[68]
    pre-msg    5236     2587 (c) early mRNA (alt.) [3],[6],[34],[39],[61]
    pre-msg    5225     2599 (c) early mRNA (alt.) [28],[46],[61]
    pre-msg    5230     2599 (c) early mRNA (alt.) [28],[68]
    pre-msg    5236     2599 (c) early mRNA (alt.) [6],[28],[39],[61]
    IVS         295      434     late 19s intron [24],[29]
    IVS         527     1462     late 16s intron [19],[29],[32]
    IVS        4636     4572 (c) early 19s mRNA (small t antigen) intron
                                 [24],[29]
    IVS        4917     4572 (c) large T antigen (18s) intron [24],[29]
    rpt          60       40 (c) 21 bp non-tandem repeat I [90],[96],[97]
    rpt          82       62 (c) 21 bp tandem repeat II [90],[96],[97]
    rpt         103       83 (c) 21 bp tandem repeat III [90],[96],[97]
    rpt         178      107 (c) 72 bp tandem repeat enhancer sequence A
                                 [65],[68],[75],[90]
    rpt         250      179 (c) 72 bp tandem repeat enhancer sequence B
                                 [65],[68],[75],[90]
    binding    5231       13 (c) T antigen binding site II [23],[62],[74]
    binding      61       36 (c) T antigen binding site III 3' [23],[62],[74]
    binding    5209     5184 (c) T antigen binding site I [23],[62],[74]
    signal       27       21 (c) early mRNA promoter element; 0.66
                                 [58],[68],[69]
    orgrpl     5191       31     replication origin core region; 0.67 [85]
    orgrpl       32       83     replication origin auxiliary region; 0.67 [85]
    variant     164      182     gggactttccacacctggt in strain 776 [36]; gt in
                                 strain 777; 0.69 [16],[17]
    revision    328      331     gtgc in [36],[50]; gc in [16],[17]
    signal     2613     2608 (c) early mRNA polyadenyation signal on the comp
                                 strand; 0.16 [28],[110]
    signal     2642     2637 (c) early mRNA polyadenyation signal on the comp
                                 strand; 0.16 [28],[110]
    signal     2657     2662     late mRNA polyadenyation signal; 0.17
                                 [28],[63],[110]
    mut        1626     1626     g in wild-type; a in temperature-sensitive
                                 mutant tsC260
    mut        1667     1667     c in wild-type; a in temperature-sensitive
                                 mutant tsB228
    mut        1680     1680     c in wild-type; g in temperature-sensitive
                                 mutant tsB218
    mut        1718     1718     g in wild-type; a in temperature-sensitive
                                 mutant tsB[204,211,265]
    mut        1719     1719     c in wild-type; t in temperature-sensitive
                                 mutant tsB8
    mut        1756     1756     a in wild-type; c in temperature-sensitive
                                 mutant tsB8
    mut        1995     1995     g in wild-type; c in temperature-sensitive
                                 mutant tsB221
    mut        2003     2003     g in wild-type; a in temperature-sensitive
                                 mutant tsB201
    mut        2084     2084     c in wild-type; t in temperature-sensitive
                                 mutant tsBC223
    mut        2091     2091     c in wild-type; c in temperature-sensitive
                                 mutant tsB4
    mut        2141     2141     c in wild-type; t in temperature-sensitive
                                 mutant tsC219
    mut        2237     2237     c in wild-type; a in temperature-sensitive
                                 mutant tsC240
    mut        2262     2262     c in wild-type; t in temperature-sensitive
                                 mutant tsC260
    mut        2354     2354     c in wild-type; t in temperature-sensitive
                                 mutant tsBC[208,214,216,217,248,274]
    mut        2367     2367     a in wild-type; c in temperature-sensitive
                                 mutant tsBC11
BASE COUNT     1518 a   1100 c   1039 g   1586 t
ORIGIN      BglI site on the E (minus) strand of the genome; mu 0.66 [50]
        1 gcctcggcct ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa
       61 ctgggcggag ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa
      121 ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg
      181 gttgctgact aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac
      241 tttccacacc ctaactgaca cacattccac agctggttct ttccgcctca gaaggtacct
      301 aaccaagttc ctctttcaga ggttatttca ggccatggtg ctgcgccggc tgtcacgcca
      361 ggcctccgtt aaggttcgta ggtcatggac tgaaagtaaa aaaacagctc aacgcctttt
      421 tgtgtttgtt ttagagcttt tgctgcaatt ttgtgaaggg gaagatactg ttgacgggaa
      481 acgcaaaaaa ccagaaaggt taactgaaaa accagaaagt taactggtaa gtttagtctt
      541 tttgtctttt atttcaggtc catgggtgct gctttaacac tgttggggga cctaattgct
      601 actgtgtctg aagctgctgc tgctactgga ttttcagtag ctgaaattgc tgctggagag
      661 gccgctgctg caattgaagt gcaacttgca tctgttgcta ctgttgaagg cctaacaacc
      721 tctgaggcaa ttgctgctat aggcctcact ccacaggcct atgctgtgat atctggggct
      781 cctgctgcta tagctggatt tgcagcttta ctgcaaactg tgactggtgt gagcgctgtt
      841 gctcaagtgg ggtatagatt ttttagtgac tgggatcaca aagtttctac tgttggttta
      901 tatcaacaac caggaatggc tgtagatttg tataggccag atgattacta tgatatttta
      961 tttcctggag tacaaacctt tgttcacagt gttcagtatc ttgaccccag acattggggt
     1021 ccaacacttt ttaatgccat ttctcaagct ttttggcgtg taatacaaaa tgacattcct
     1081 aggctcacct cacaggagct tgaaagaaga acccaaagat atttaaggga cagtttggca
     1141 aggtttttag aggaaactac ttggacagta attaatgctc ctgttaattg gtataactct
     1201 ttacaagatt actactctac tttgtctccc attaggccta caatggtgag acaagtagcc
     1261 aacagggaag ggttgcaaat atcatttggg cacacctatg ataatattga tgaagcagac
     1321 agtattcagc aagtaactga gaggtgggaa gctcaaagcc aaagtcctaa tgtgcagtca
     1381 ggtgaattta ttgaaaaatt tgaggctcct ggtggtgcaa atcaaagaac tgctcctcag
     1441 tggatgttgc ctttacttct aggcctgtac ggaagtgtta cttctgctct aaaagcttat
     1501 gaagatggcc ccaacaaaaa gaaaaggaag ttgtccaggg gcagctccca aaaaaccaaa
     1561 ggaaccagtg caagtgccaa agctcgtcat aaaaggagga atagaagttc taggagttaa
     1621 aactggagta gacagcttca ctgaggtgga gtgcttttta aatcctcaaa tgggcaatcc
     1681 tgatgaacat caaaaaggct taagtaaaag cttagcagct gaaaaacagt ttacagatga
     1741 ctctccagac aaagaacaac tgccttgcta cagtgtggct agaattcctt tgcctaattt
     1801 aaatgaggac ttaacctgtg gaaatatttt gatgtgggaa gctgttactg ttaaaactga
     1861 ggttattggg gtaactgcta tgttaaactt gcattcaggg acacaaaaaa ctcatgaaaa
     1921 tggtgctgga aaacccattc aagggtcaaa ttttcatttt tttgctgttg gtggggaacc
     1981 tttggagctg cagggtgtgt tagcaaacta caggaccaaa tatcctgctc aaactgtaac
     2041 cccaaaaaat gctacagttg acagtcagca gatgaacact gaccacaagg ctgttttgga
     2101 taaggataat gcttatccag tggagtgctg ggttcctgat ccaagtaaaa atgaaaacac
     2161 tagatatttt ggaacctaca caggtgggga aaatgtgcct cctgttttgc acattactaa
     2221 cacagcaacc acagtgcttc ttgatgagca gggtgttggg cccttgtgca aagctgacag
     2281 cttgtatgtt tctgctgttg acatttgtgg gctgtttacc aacacttctg gaacacagca
     2341 gtggaaggga cttcccagat attttaaaat tacccttaga aagcggtctg tgaaaaaccc
     2401 ctacccaatt tcctttttgt taagtgacct aattaacagg aggacacaga gggtggatgg
     2461 gcagcctatg attggaatgt cctctcaagt agaggaggtt agggtttatg aggacacaga
     2521 ggagcttcct ggggatccag acatgataag atacattgat gagtttggac aaaccacaac
     2581 tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt gatgctattg ctttatttgt
     2641 aaccattata agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca
     2701 ggttcagggg gaggtgtggg aggtttttta aagcaagtaa aacctctaca aatgtggtat
     2761 ggctgattat gatcatgaac agactgtgag gactgagggg cctgaaatga gccttgggac
     2821 tgtgaatcaa tgcctgtttc atgccctgag tcttccatgt tcttctcccc accatcttca
     2881 tttttatcag cattttcctg gctgtcttca tcatcatcat cactgtttct tagccaatct
     2941 aaaactccaa ttcccatagc cacattaaac ttcatttttt gatacactga caaactaaac
     3001 tctttgtcca atctctcttt ccactccaca attctgctct gaatactttg agcaaactca
     3061 gccacaggtc tgtaccaaat taacataaga agcaaagcaa tgccactttg aattattctc
     3121 ttttctaaca aaaactcact gcgttccagg caatgcttta aataatcttt gggcctaaaa
     3181 tctatttgtt ttacaaatct ggcctgcagt gttttaggca cactgtactc attcatggtg
     3241 actattccag ggggaaatat ttgagttctt ttatttaggt gtttcttttc taagtttacc
     3301 ttaacactgc catccaaata atcccttaaa ttgtccaggt tattaattcc ctgacctgaa
     3361 ggcaaatctc tggactcccc tccagtgccc tttacatcct caaaaactac taaaaactgg
     3421 tcaatagcta ctcctagctc aaagttcagc ctgtccaagg gcaaattaac atttaaagct
     3481 ttccccccac ataattcaag caaagcagct gctaatgtag ttttaccact atcaattggt
     3541 cctttaaaca gccagtatct ttttttagga atgttgtaca ccatgcattt taaaaagtca
     3601 tacaccactg aatccatttt gggcaacaaa cagtgtagcc aagcaactcc agccatccat
     3661 tcttctatgt cagcagagcc tgtagaacca aacattatat ccatcctatc caaaagatca
     3721 ttaaatctgt ttgttaacat ttgttctcta gttaattgta ggctatcaac ccgcttttta
     3781 gctaaaacag tatcaacagc ctgttggcat atggtttttt ggtttttgct gtcagcaaat
     3841 atagcagcat ttgcataatg cttttcatgg tacttatagt ggctgggctg ttctttttta
     3901 atacatttta aacacatttc aaaactgtac tgaaattcca agtacatccc aagcaataac
     3961 aacacatcat cacattttgt ttccattgca tactctgtta caagcttcca ggacacttgt
     4021 ttagtttcct ctgcttcttc tggattaaaa tcatgctcct ttaacccacc tggcaaactt
     4081 tcctcaataa cagaaaatgg atctctagtc aaggcactat acatcaaata ttccttatta
     4141 acccctttac aaattaaaaa gctaaaggta cacaattttt gagcatagtt attaatagca
     4201 gacactctat gcctgtgtgg agtaagaaaa aacagtatgt tatgattata actgttatgc
     4261 ctacttataa aggttacaga atatttttcc ataattttct tgtatagcag tgcagctttt
     4321 tcctttgtgg tgtaaatagc aaagcaagca agagttctat tactaaacac agcatgactc
     4381 aaaaaactta gcaattctga aggaaagtcc ttggggtctt ctacctttct cttctttttt
     4441 ggaggagtag aatgttgaga gtcagcagta gcctcatcat cactagatgg catttcttct
     4501 gagcaaaaca ggttttcctc attaaaggca ttccaccact gctcccattc atcagttcca
     4561 taggttggaa tctaaaatac acaaacaatt agaatcagta gtttaacaca ttatacactt
     4621 aaaaatttta tatttacctt agagctttaa atctctgtag gtagtttgtc caattatgtc
     4681 acaccacaga agtaaggttc cttcacaaag atcaagtcca aaccacattc taaagcaatc
     4741 gaagcagtag caatcaaccc acacaagtgg atctttcctg tataattttc tattttcatg
     4801 cttcatcctc agtaagcaca gcaagcatat gcagttagca gacattttct ttgcacactc
     4861 aggccattgt ttgcagtaca ttgcatcaac accaggattt aaggaagaag caaatacctc
     4921 agttgcatcc cagaagcctc caaagtcagg ttgatgagca tattttactc catcttccat
     4981 tttcttgtac agagtattca ttttcttcat tttttcttca tctcctcctt tatcaggatg
     5041 aaactccttg cattttttta aatatgcctt tctcatcaga ggaatattcc cccaggcact
     5101 cctttcaaga cctagaaggt ccattagctg caaagattcc tctctgttta aaactttatc
     5161 catctttgca aagctttttg caaaagccta ggcctccaaa aaagcctcct cactacttct
     5221 ggaatagctc agaggccgag gcg
//
LOCUS       BOVPROB1      257 bp ss-mRNA            MAM       03-AUG-1990
DEFINITION  Cattle prolactin mRNA, 5' end.
ACCESSION   M36873
KEYWORDS    prolactin.
SEGMENT     1 of 2
SOURCE      Cattle hypophysis, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 257)
  AUTHORS   Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and
            Baev,A.A.
  TITLE     Genetic engineering of peptide hormones: II. Possible polymorphism
            of preprolactin in cattle. Data of molecular cloning
  JOURNAL   Mol. Biol. 22, 117-121 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         51  >   257     prolactin precursor
    sigp         51      143     prolactin signal peptide
    matp        144  >   257     prolactin
BASE COUNT       51 a     76 c     66 g     64 t
ORIGIN      
        1 ataggacgag agcttcctgg tgaagtgtgt ttcttgaaat catcaccacc atggacagca
       61 aaggttcgtc gcagaaagca gggtcccgcc tgctcctgct gctggtggtg tcaaatctac
      121 tcttgtgcca gggtgtggtc tccacccccg tctgtcccaa tgggcctggc aactgccagg
      181 tatcccttcg agacctgttt gaccgggcag tcatggtgtc ccactacatc catgacctct
      241 cctcggaaat gttcaac
//
LOCUS       BOVPROB2      551 bp ss-mRNA            MAM       03-AUG-1990
DEFINITION  Cattle prolactin mRNA, 3' end.
ACCESSION   M36874
KEYWORDS    prolactin.
SEGMENT     2 of 2
SOURCE      Cattle hypophysis, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 551)
  AUTHORS   Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and
            Baev,A.A.
  TITLE     Genetic engineering of peptide hormones: II. Possible polymorphism
            of preprolactin in cattle. Data of molecular cloning
  JOURNAL   Mol. Biol. 22, 117-121 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      405     prolactin
BASE COUNT      153 a    135 c    119 g    144 t
ORIGIN      
        1 ccggaagata aagaacaagc ccaacagacc catcatgaag tccttatgag cttgattctt
       61 gggttgctgc gctcctggaa tgaccctctc tatcacctag tcaccgaggt gcggggtatg
      121 aaaggagccc cagatgctat cctatcgagg gccatagaga ttgaggaaga aaacaaacga
      181 cttctggaag gcatggagat gatatttggc caggttattc ctggagccaa agagactgag
      241 ccctaccctg tgtggtcagg actcccgtcc ctgcaaacta aggatgaaga tgcacgttat
      301 tctgcttttt ataacctgct ccactgcctg cgcagggatt caagcaagat tgacacttac
      361 cttaagctcc tgaattgcag aatcatctac aacaacaact gctaagccca cattccatcc
      421 tatccatttc tgagatggtt cttaatgatc cattccctgg caaacttctc tgagctttat
      481 agctttgtaa tgcatgcttg gctctaatgg gtttcatctt aaataaaaac agactctgta
      541 gcgatgtcaa a
//
LOCUS       CHKBABA1     4051 bp ds-DNA             VRT       03-AUG-1990
DEFINITION  G.gallus B-creatine kinase (B-CK) gene, exons 1-4.
ACCESSION   M33711 J05496
KEYWORDS    B-creatine kinase.
SEGMENT     1 of 4
SOURCE      G.gallus (strain White Leghorn) adult blood cell DNA, clones
            T[56,62].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 4051)
  AUTHORS   Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
  TITLE     A unique chicken B-creatine kinase gene gives rise to two
            B-creatine kinase isoproteins with distinct N-termini by
            alternative splicing
  JOURNAL   J. Biol. Chem. 265, 11656-11666 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Wirz, 13-APR-1990.
FEATURES       from  to/span     description
    pept       1201     1393     B-creatine kinase, exon 2b (EC 2.7.3.2) (first
                                 expressed exon)
               2449     2603     B-creatine kinase, exon 3
               3464  +  3596     B-creatine kinase, exon 4
    pep$       1800     1977     B-creatine kinase, exon 2a
               2449     2603     B-creatine kinase, exon 3
               3464  +  3596     B-creatine kinase, exon 4
    pre-msg     411  >  4051     B-creatine kinase mRNA and introns
    IVS        1394     1789     B-creatine kinase intron A
    IVS        1978     2498     B-creatine kinase intron B (no splice
                                 consensus)
    IVS        2604     3463     B-creatine kinase intron C
    IVS        3597  >  4051     B-creatine kinase intron D
    signal      384      390     TATA box
    signal      251      256     GC box
    signal      310      315     GC box
    signal      328      332     CAT box
    signal      346      351     TATA box
    signal      358      362     GC box
    signal      384      389     TATA box
BASE COUNT      943 a    985 c   1128 g    995 t
ORIGIN      
        1 tttgcacacg accagaagag accttcagcg cgttacacaa taaacctatg ttcaaatctg
       61 aggtaaaagg agccagctag caaggatgga agcaaaaagc cgcttgtgcc cgccttggtg
      121 cctaacggct gccgtattcg ggctgccggt ggccgctggg acggatgtac gtgcacgtct
      181 gcgttcacga ggtcacgacg gctaccgctc caggctcgag gcggacgggc cacgaaccca
      241 aagccaccgg ccgcccaacc gccgcgcaaa atggcggcac gccttcactg gccaggagga
      301 gccgctgctc cgcccctcac ggctcgacca atggaatgaa tgggctataa ataggagcca
      361 atgaggcggc gggtcgcgct ctatataaaa ggggcgcggc ggctgactcg gggctccact
      421 ccggcttggg cagcgggtgg gtgcggagcg gccgctgagg tccttacgtt ggctgtggag
      481 ccctcggtaa gtcccggggc agccgagccg cctgcagctt aaagaaaaga agaaggaaac
      541 gatttaactt ttataagaaa aatgtatttt tgatggtgtg ggttggagcc ggtccggccg
      601 ggaaagcgtc ggggggagcg ggatgcggcg gggcggcgcg gggcaggtgc cgggtgcgta
      661 ccgggtgcgg ccgcagcttg agccggagcc ccgctctgaa agcagctcac cccggtattt
      721 agcagggtag ttacatacgg tcgccccagc cgcgtccccc gagggagccg tccgttgtgt
      781 aacggctcgc tgagcactgc ccactcgccc cggtccccgg ctcgcgctgc cggtaccggc
      841 accggccgcg gcgctcccca acggccgctc cgcggggctc cgccggcggc tgctccgccg
      901 cctccccgcg ccctggccga gccccggagc gtccggaccg ggatgctgaa ggatggggcg
      961 aggtttgcgg agctggagaa agtttccttt tttttttttc tcggtttttt tttctctttt
     1021 tttttttttt tttgggccct ccctccttgc ctggccttga ggaaggtgtt gtggcaacga
     1081 aggtgcggaa ggcacgggtg gaagcgaagc caaggtgacc ttgcttggcg ttggcgcctc
     1141 ccccgggccc ctggtgatag gcttccctct cttccctcag tgactgctgt agggacagcc
     1201 atgcccttct caaacagcca caacctcctg aagatgaagt actccgttga cgatgagtac
     1261 cccgacctga gcgtgcacaa caatcacatg gccaaggtgc tgaccctgga cctgtacaag
     1321 aagttgaggg acaggcagac ttccagtgga tttacgctgg atgatgtcat tcagaccggg
     1381 gttgacaacc caggtaacgg cggcaccgtt aggactgcag ggctctctgc tctccctcct
     1441 tcctggaagt gctccttgac agcaaagcaa accccgcacg ctccgaggga cggctgtgct
     1501 cctccgtgca atctgacaca gcaattccca ggcgctccca ttatgtaact gctctgctaa
     1561 atacttcaaa tcccaaccaa aaaaaaaaaa aaaaagcaaa cgctcaaatg tgttctcatt
     1621 tttatgcaac cggggggaaa aaaattctct gagctgtgtg ggcttacccg gctgttctga
     1681 gctcgtctgt tccaaagtgg ttatgtgatg ttctgtagca cttctatgaa agcaaggggg
     1741 agcggatgcc gtatacgtgt taacattaca gtatttattt ttactctagc aatatcaaaa
     1801 tggcccaact aaataatcag aggctgcctc ccgaggagga gtacccggac ctgagcaccc
     1861 acaacaacca catggccaaa gtgctaaccc tggatttgta caagaaactg agagacagag
     1921 tcacgcccag tggcttcacc ctggatgatg tcattcagac tggggttgat aatcccggta
     1981 aaatgcgctg agatattctg tgggagccag catgaagtac ccggtgcttt tcagtatggt
     2041 gacttaagct aagctcgtgg aggtggtgtt tatgagcaga ctatgaaaat gcggttctgt
     2101 ctgtagcaga gcgaataaag cgatgcatgc ttgctcacag caaagcatgc taaagtgtag
     2161 ttctatctac gggtttgcat tagctagtaa aattggatgt gctgtgcgaa attgccctaa
     2221 accaaagctt gacaagagtg tggcttaacg gcaactgctg tggtttccca tagcactgca
     2281 tttgactaac tggatgaaat gtaaaatgta gggtctccca actgctctta ggccagtgca
     2341 tttctaagtg agtagatctg ctgtgaagaa acttgcctgg ttagtgctgt gctggagtac
     2401 agctgagcat cacccaactt agctcttctg cctttgttac cactccaggc caccccttca
     2461 taatgacagt aggatgcgta gctggcgacg aagaatccta tgaagtgttt aaggaactct
     2521 ttgatccggt tattgaggac aggcacggtg gctacaaacc aactgatgag cacaagaccg
     2581 acctgaatgc tgataacctg caggtatacg tttttgttga gctactgata cccactgatc
     2641 tgaggcctgg ggaaagctgt gagcggctgc ctaaacggat tgtttgggct ctgtcctctc
     2701 tctctcatct ctcgctgccc gctgcagtct tacttaagag cctgcagcag taggttgtca
     2761 gtaactgggt agaaccaggc agctgacaag caggacttgc ttcatgtgct gctgcaatgt
     2821 aagtcatacc aaacacgttg agcacgtatt tctttgggct gtgctagttg tgtgaaacaa
     2881 cccagtccca gggctgttag aaatcttaac tcttcctact gtatgatgtt gtaggctaaa
     2941 aatcagccag tggtctcgtg ttatagaata gccattgtta cggctggaag tagaggtttc
     3001 tgcccatctt acagacttca aatgttattg cagtctctgg gaattccaac tgctatcctg
     3061 gttggcctgc cataggacat ctcaaaaatg agaacgcttt gacaataatg agtaactctc
     3121 atttgcagga gtaaacacct acctaagcaa ggatttgtct cgtgttgata cctctagatt
     3181 gcagagtgaa aggagtaaca gaagcctggg atttgggtct gagatgggtg taaaactgat
     3241 ggtatcccta tcttctagca catagttggt agtcgtagtc ttagcctcaa aacaaggagt
     3301 ttgatactct catacttgtt ggaaagctgg aaaaaatggt cctgaagtac atcgtattac
     3361 aggacaacta gtcagtctca aatatgaatg cacgtgacct gtgtgtattt agggtaagca
     3421 tcttagcaac aaggcagtac tgactaattt tctgatcttg tagggtggtg acgacttgga
     3481 tcctaattac gtgctgagct ctcgtgtgag aactggcagg agcatccgtg gattctgcct
     3541 tccccctcat tgcagcaggg gagagagacg ggctattgaa aagctctctg ttgaaggtaa
     3601 agtgttgaat caggcaatga ggggaaactg ctgacctcgt ctaggcaagg aggtgctctg
     3661 tgcagttgga gaatggaggt ggttggcaga agctgccttc ctcaaggctt aacttgagtg
     3721 tggaaccttc cctgaaaaag gcagcttgga ttatggttgc gtcacgttgc agtcggtact
     3781 cagagcagga acacatgtca gggaagttgg actgtattaa gttgaccttt gctgatcttg
     3841 gaagcttctt tctgtgtgtg gggaaatagg ttggttcatc acggctcaaa cctgagatgt
     3901 aataggtatg gagcaaccct agggcttgag tcatggttgc agtattttgt ggcttctttt
     3961 aattgacttt tattaatgtt cagcttgctt agaggtgtag tccagctctg agagagctct
     4021 agttctcagt ggaactcctc agctgcagtt c
//
LOCUS       CHKBABA2      444 bp ds-DNA             VRT       03-AUG-1990
DEFINITION  G.gallus B-creatine kinase (B-CK) gene, exon 5.
ACCESSION   M33712 J05496
KEYWORDS    B-creatine kinase.
SEGMENT     2 of 4
SOURCE      G.gallus (strain White Leghorn) adult blood cell DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 444)
  AUTHORS   Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
  TITLE     A unique chicken B-creatine kinase gene gives rise to two
            B-creatine kinase isoproteins with distinct N-termini by
            alternative splicing
  JOURNAL   J. Biol. Chem. 265, 11656-11666 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Wirz, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +    45  +   216     B-creatine kinase (B-CK), exon 5
    pep$    +    45  +   216     B-creatine kinase (B-CK), exon 5
    pre-msg       1  >   444     B-creatine kinase mRNA and introns
    IVS     <     1       44     B-creatine kinase intron E
    IVS         217  >   444     B-creatine kinase intron F
BASE COUNT      108 a     95 c    120 g    121 t
ORIGIN      
        1 tgtttatagt ctctgctccc atcctaaaag cacctctctt tcagctctgg gcagtctggg
       61 aggtgatctc aaggggaagt actatgccct aaggaacatg actgatgcag agcagcagca
      121 gctgattgat gatcacttct tgtttgacaa acctgtttct cctcttctgt tggcatctgg
      181 gatggcacga gattggcctg atgccagggg tatctggtga gtatatgccc agcagggtga
      241 gttgtgtgaa atccctgcaa caaaaagcct tgaaattttg gttgtggctg atataattgg
      301 acaaaacagc acgttgtgaa cttacgctgc tgaatctaat gaggaacttc tgcgtggtat
      361 cccctgggaa gcgatatctg aaagcagaac ttgccccaag tgctagtgtg cactgcattg
      421 ggctggtagg aagaagccag cttg
//
LOCUS       CHKBABA3     1586 bp ds-DNA             VRT       03-AUG-1990
DEFINITION  G.gallus B-creatine kinase (B-CK) gene, exons 6 and 7.
ACCESSION   M33713 J05496
KEYWORDS    B-creatine kinase.
SEGMENT     3 of 4
SOURCE      G.gallus (strain White Leghorn) adult blood cell DNA, clone T8.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1586)
  AUTHORS   Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
  TITLE     A unique chicken B-creatine kinase gene gives rise to two
            B-creatine kinase isoproteins with distinct N-termini by
            alternative splicing
  JOURNAL   J. Biol. Chem. 265, 11656-11666 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Wirz, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +   308      431     B-creatine kinase, exon 6
               1170  +  1359     B-creatine kinase, exon 7
    pep$    +   308      431     B-creatine kinase, exon 6
               1170  +  1359     B-creatine kinase, exon 7
    pre-msg       1  >  1586     B-creatine kinase mRNA and introns
    IVS     <     1      307     B-creatine kinase intron G
    IVS         432     1171     B-creatine kinase intron H (no splice
                                 consensus)
    IVS        1360  >  1586     B-creatine kinase intron I
BASE COUNT      435 a    336 c    385 g    430 t
ORIGIN      
        1 tggactgaac ataaatgagg tgagcaatga ctgaactctt aagttttgat gctgttaaca
       61 ctggctaatg tcctgcaatt actttctgac atggatagat acaggccact tccaaatacc
      121 tcttgactag agtagggtaa aagtacaagc ttaattttgt tagtgcttag atggagaaca
      181 gggagataaa tttctttgga gaagcaatta aacgagcctc ttggaagcta cagcagtgag
      241 taacttggtg ctactgaatc tgtcttgaaa acgtcaagtg agacaacagg tctgtcttgt
      301 ctcctaggca caatgacaac aagaccttcc ttgtttggat caatgaggag gatcacctca
      361 gggttatttc catgcagaaa ggtggcaaca tgaaggaagt gtttactcgc ttctgtacag
      421 ggctaacaca ggtaaatcaa atactgcttg agctctgggt ctaaatatta atctggttgg
      481 agctgaccat acaggatctt ccaggctggg agcatgtaca caggaggtga cagccttctc
      541 ctttgttctg cagggtgctt ctttgtagtt tgggtttctg ggaacatcct gatgagatcc
      601 tctgctttga gggctctgaa atgcctgtac tggcaaagct ttatggctgt gaaatctgga
      661 ataaaacctc taatgtgaca cctttaccaa actgtactta aaggatctct gtggttctat
      721 aaaatgtatg tagagaccct gggtaggagg aaaaaattgc ttaaatgtga atgggtgaca
      781 ttggacagaa gtttgtaggc tgcatgcttt tctcagcttc tggtcttgat atcagcttgt
      841 gctttaccag ccccttccct aatcctaaca gatgcaccag tttgacatgt actcaatcaa
      901 gcttcccagt agatagcaac ttcagacttt acagtgtagc ttgaacttag gctgtctcgc
      961 tgctgggagc aagagtaagg ccttcaaaag cccctcattg gtgaccctct tctacaaatg
     1021 gagggggccc tatgtaactg acttaaataa tctgctcaaa ctgttgtggc tgttgctatg
     1081 caactgaagc tgtaaatgag cacttagtgg atgacctaca tcgttcaggc agagaacaat
     1141 ctctaaacat ccattctctc tatttataga tagaaactct cttcaagtcc aaaaactacg
     1201 agttcatgtg gaacccacac ttgggctaca tcctgacctg cccatccaac cttggaacgg
     1261 ggctccgtgc tggtgtgcac atcaagctac caaaccttgg gaagcatgag aagtttggag
     1321 aagtcctcaa gaggcttcgg ctgcagaaac gaggcacagg tgagaggtgg tgatacagtg
     1381 atgccccaag caggctgcct ctgaccagac agcagtgaga gcctggctgc agcgaagcta
     1441 atcttggttc atgctgacag tgcctgacag accatccaga agggaccctt ctgaagcgtg
     1501 gggggggaca accaagcatt aaaagtgtag cttagaagag gtggagatgc tggtactttg
     1561 cttaagcctt attatatcca gcttct
//
LOCUS       CHKBABA4      650 bp ds-DNA             VRT       03-AUG-1990
DEFINITION  G.gallus B-creatine kinase (B-CK) gene, exon 8.
ACCESSION   M33714 J05496
KEYWORDS    B-creatine kinase.
SEGMENT     4 of 4
SOURCE      G.gallus (strain White Leghorn) adult blood cell DNA, clone T8.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 650)
  AUTHORS   Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C.
  TITLE     A unique chicken B-creatine kinase gene gives rise to two
            B-creatine kinase isoproteins with distinct N-termini by
            alternative splicing
  JOURNAL   J. Biol. Chem. 265, 11656-11666 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Wirz, 13-APR-1990.
FEATURES       from  to/span     description
    pept    +    49      227     B-creatine kinase, exon 8
    pep$    +    49      227     B-creatine kinase, exon 8
    pre-msg       1  >   396     B-creatine kinase mRNA and introns
    IVS     <     1       48     B-creatine kinase intron J
    signal      391      396     poly-A signal
BASE COUNT      161 a    129 c    161 g    199 t
ORIGIN      
        1 aactggctca ggcagcggtc agcgtactaa ttaaatctct tcgtccaggt ggtgtggaca
       61 cagctgcagt cggaggggtg tttgatgtat ccaatgctga tcgtcttggc ttctctgagg
      121 tagagctggt gcagatggtg gtagatggtg tgaagctgct cattgaaatg gaaaaacgcc
      181 ttgagaaagg ccagtccatt gatgacctca tgccagctca gaaataaagc actttattct
      241 catgcttcct aacttattgg atgaataata aaatgtcact ccaatttcaa accccttggg
      301 tcagagccca cttagttaca ctgtagagaa gtcttccatc catctgtgtt agagtttatt
      361 ttttgatggc tgaaatgttg ttgaaaatga aataaactgt tgttttggcc tgacctgcct
      421 tagatgtgtt aactgagtgt ctttgaggtc taagtgactt gctgttcctg gtaagctgaa
      481 ctgattggtt ttaggtgcta gcacatcatc ctggctgtgc tgctgtagtc tgcttggtgg
      541 gcttagaaat gtgcttatag cagtgtcaaa agcaagcctc taatgtgatt gcagctgccg
      601 taaacctgag aactggtgct aaacatgagt cactttaaac ctgccctacc
//
LOCUS       HUMSATIII1    132 bp ds-DNA             PRI       03-AUG-1990
DEFINITION  Human satellite III DNA fragment, 5' end.
ACCESSION   M25431
KEYWORDS    satellite repetitive sequence.
SEGMENT     1 of 3
SOURCE      Human DNA, clone lambda-HS3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 132)
  AUTHORS   Cooke,H.J. and Hindley,J.
  TITLE     Cloning of human satellite III DNA: Different components are on
            different chromosomes
  JOURNAL   Nucleic Acids Res. 6, 3177-3197 (1979)
  STANDARD  simple staff_review
BASE COUNT       39 a     27 c     14 g     52 t
ORIGIN      
        1 aattcatttg aagacaattc cattcaatac caattgatga tggttatttt tgattccatt
       61 tgatgatgat tacattccat ttcatcataa ttccattcga ttccactcga gattccattc
      121 gattccattc aa
//
LOCUS       HUMSATIII2     75 bp ds-DNA             PRI       03-AUG-1990
DEFINITION  Human satellite III DNA fragment.
ACCESSION   M25432
KEYWORDS    satellite repetitive sequence.
SEGMENT     2 of 2
SOURCE      Human satellite DNA, clone lambda-HS3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Cooke,H.J. and Hindley,J.
  TITLE     Cloning of human satellite III DNA: Different components are on
            different chromosomes
  JOURNAL   Nucleic Acids Res. 6, 3177-3197 (1979)
  STANDARD  simple staff_review
BASE COUNT       19 a     15 c     11 g     30 t
ORIGIN      
        1 cgaatgaatg agtccatcca tttcaatttc atgataattc cattcgtttc aattcgatgg
       61 tgtttccatt cgatt
//
LOCUS       HUMSATIII3     93 bp ds-DNA             PRI       03-AUG-1990
DEFINITION  Human satellite III DNA fragment, 3' end.
ACCESSION   M25433
KEYWORDS    satellite repetitive sequence.
SEGMENT     3 of 3
SOURCE      Human satellite DNA, clone lambda-HS3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 93)
  AUTHORS   Cooke,H.J. and Hindley,J.
  TITLE     Cloning of human satellite III DNA: Different components are on
            different chromosomes
  JOURNAL   Nucleic Acids Res. 6, 3177-3197 (1979)
  STANDARD  simple staff_review
BASE COUNT       21 a     20 c     16 g     36 t
ORIGIN      
        1 ttcattcgat tcatttgatg atgattcatg cgcgattcat tagatgatga cccctttcat
       61 ttccattcaa tggaggattc cattcggttc cat
//
LOCUS       MS2AAR        218 bp ss-RNA             PHG       03-AUG-1990
DEFINITION  Bacteriophage MS2 A protein mRNA, 3' end, and coat protein mRNA,
            complete cds.
ACCESSION   M25187
KEYWORDS    A protein; coat protein.
SOURCE      Bacteriophage MS2 RNA.
  ORGANISM  Bacteriophage MS2
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Leviviridae.
REFERENCE   1  (bases 1 to 218)
  AUTHORS   Contreras,R., Ysebaert,M., Min Jou,W. and Fiers,W.
  TITLE     Bacteriophage MS2 RNA: Nucleotide sequence of the end of the A
            protein gene and the intercistronic region
  JOURNAL   Nature New Biol. 241, 99-101 (1973)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      137     A protein (AA at 3)
    pept        161  >   218     coat protein
BASE COUNT       54 a     58 c     52 g     54 t
ORIGIN      
        1 ctaaggccca aatctcagcc atgcatcggg gagtacaatc cgtatggcca acaactggcg
       61 cgtacgtaaa gtctcctttc tcgatggtcc ataccttaga tgcgttagca ttaatcaggc
      121 aacggctctc tagatagagc cctcaaccgg agtttgaagc atggcttcta actttactca
      181 gttcgttctc gtcgacaatg gcggaactgg cgacgtga
//
LOCUS       MUSCMDH1      567 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 1.
ACCESSION   M36076
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     1 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 567)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        540  +   542     cytosolic malate dehydrogenase, exon 1
                                 (cMDHase) (EC 1.1.1.37)
    pre-msg     458  >   567     cytosolic malate dehydrogenase mRNA and
                                 introns (alt.)
    pre-msg     459  >   567     cytosolic malate dehydrogenase mRNA and
                                 introns (alt.)
    IVS         543  >   567     cytosolic malate dehydrogenase intron A
BASE COUNT      109 a    156 c    169 g    133 t
ORIGIN      
        1 gttcttccgc aagcgtcaat tcctcccgcc tctgagagag tttttaaaag gtttgtttcc
       61 gggtcgagcg ggctttaagc aacggaaggt ctcttagccg cccttgttta gtcttgggga
      121 ggatagattc tcgtggagcg acgtgtgtgt cgctcagggg tcggtttctc ctccctcgag
      181 ttaacgcctc ctctcctgcc aattgctgag cgccatcagg caggcgcctc actcaaagca
      241 ccaaccctct gctcacagac gcgctccaat caccgaggct cagcccggga ctactttgca
      301 gcgaggcgcg taggaagaag gggtttgggg gaattgtagt ttagcactgt caggaatgca
      361 cgaaggtggg cgccagaggt cgcggaagaa ctacacttcc cagaaagggc cgtgtctcca
      421 gccgaagcct ttgcgggcca gccccggttc tctcccagag tctgttccgc tgtagaggtg
      481 acctgactgc tggagactgc cttttgcagg tgcagagatc ggccttgcag tttgcaataa
      541 tggtgaggtg ggctctggaa ctcacac
//
LOCUS       MUSCMDH2      149 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 2.
ACCESSION   M36077
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     2 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 149)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    27  +   125     cytosolic malate dehydrogenase, exon 2
                                 (cMDHase)
    pre-msg <     1  >   149     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       26     cytosolic malate dehydrogenase intron A
    IVS         126  >   149     cytosolic malate dehydrogenase intron B
BASE COUNT       41 a     25 c     36 g     47 t
ORIGIN      About 4.8 kb after segment 1.
        1 taaactagtg gtctttgtca ttacagtctg aaccaatcag agtccttgtg actggagcag
       61 ctggtcaaat tgcatattca ctgttgtaca gtattggaaa tggatctgtc tttgggaaag
      121 accaggtagg ggcatgttct tataaatac
//
LOCUS       MUSCMDH3      147 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 3.
ACCESSION   M36078
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     3 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 147)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    26  +   122     cytosolic malate dehydrogenase exon 3
                                 (cMDHase)
    pre-msg <     1  >   147     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       25     cytosolic malate dehydrogenase intron B
    IVS         123  >   147     cytosolic malate dehydrogenase intron C
BASE COUNT       27 a     39 c     40 g     41 t
ORIGIN      About 0.8 kb after segment 2.
        1 gcctgctgtc cttgctcttt ggcagcccat cattcttgtg ctgttggaca tcacccccat
       61 gatgggtgtt ctggacggtg tcctgatgga actgcaagac tgtgcccttc cccttctgca
      121 gggtgagttg gaagtcaaag aaaacag
//
LOCUS       MUSCMDH4      226 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 4.
ACCESSION   M36079
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     4 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 226)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    26  +   201     cytosolic malate dehydrogenase, exon 4
    pre-msg <     1  >   226     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       25     cytosolic malate dehydrogenase intron C
    IVS         202  >   226     cytosolic malate dehydrogenase intron D
BASE COUNT       69 a     45 c     62 g     50 t
ORIGIN      About 1.6 kb after segment 3.
        1 tgtgtgttgt ttgccatgtc catagatgtc attgcaacgg acaaagaaga gattgccttc
       61 aaagacctgg atgtggctgt cctagtgggc tccatgccaa gaagggaagg catggagagg
      121 aaggacctac tgaaagccaa tgtgaaaatc ttcaaatccc agggcacagc cttggagaaa
      181 tacgccaaga aatcagttaa ggtgactcac acagatttca tggggt
//
LOCUS       MUSCMDH5      173 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 5.
ACCESSION   M36080
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     5 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 173)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    26  +   148     cytosolic malate dehydrogenase, exon 5
    pre-msg <     1  >   173     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       25     cytosolic malate dehydrogenase intron D
    IVS         149  >   173     cytosolic malate dehydrogenase intron E
BASE COUNT       48 a     49 c     35 g     41 t
ORIGIN      About 0.9 kb after segment 4.
        1 tctgctctgt gcctccacca tctaggtcat tgttgtggga aacccagcca atacgaactg
       61 cctgacagcc tccaagtcag cgccatcgat ccccaaggag aatttcagtt gcctgactcg
      121 cttggaccac aaccgagcaa aatctcaagt aagaaaaatg gcttttagat tta
//
LOCUS       MUSCMDH6      227 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 6.
ACCESSION   M36081
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     6 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 227)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    26  +   202     cytosolic malate dehydrogenase, exon 6
    pre-msg <     1  >   227     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       25     cytosolic malate dehydrogenase intron E
    IVS         203  >   227     cytosolic malate dehydrogenase intron F
BASE COUNT       71 a     40 c     60 g     56 t
ORIGIN      About 3.4 kb after segment 5.
        1 tgatatgatg ttttacatga actagattgc tcttaaactc ggtgtaaccg ctgatgatgt
       61 aaagaatgtc attatctggg gaaatcattc atcgacccag tatccagatg tcaatcatgc
      121 caaggtgaaa ctgcaaggaa aggaagtcgg tgtgtatgaa gccctgaaag acgacagctg
      181 gctgaaggga gagttcatca cggtaagaag gatgtgaacc ctctgag
//
LOCUS       MUSCMDH7      164 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 7.
ACCESSION   M36082
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     7 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 164)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    26  +   139     cytosolic malate dehydrogenase, exon 7
    pre-msg <     1  >   164     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       25     cytosolic malate dehydrogenase intron F
    IVS         140  >   164     cytosolic malate dehydrogenase intron G
BASE COUNT       34 a     46 c     44 g     40 t
ORIGIN      About 0.7 kb after segment 6.
        1 actgtctctc tgttgtccca cccagactgt gcaacagcgt ggtgctgctg tcatcaaggc
       61 tcggaagctg tccagtgcaa tgtctgctgc gaaagccatc gcagaccaca tcagagacat
      121 ctggtttgga accccagagg tgagggttct catttgtact ggcc
//
LOCUS       MUSCMDH8      131 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 8.
ACCESSION   M36083
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     8 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 131)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    26  +   115     cytosolic malate dehydrogenase, exon 8
    pre-msg <     1  >   131     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       25     cytosolic malate dehydrogenase intron G
    IVS         116  >   131     cytosolic malate dehydrogenase intron H
BASE COUNT       27 a     27 c     36 g     41 t
ORIGIN      About 0.3 kb after segment 7.
        1 gctatgataa tgtaaacttt ttcagggaga gttcgtgtcg atgggtgtta tctctgatgg
       61 caactcctat ggtgtccctg atgacctgct ctactcactc cctgtcgtga tcaaggtggg
      121 tacatggaga g
//
LOCUS       MUSCMDH9      528 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 9.
ACCESSION   M36084
KEYWORDS    cytosolic malate dehydrogenase.
SEGMENT     9 of 9
SOURCE      Mouse (strain C3H/He) male liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 528)
  AUTHORS   Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K.
  TITLE     Structural organization of the mouse cytosolic malate dehydrogenase
            gene: Comparison with that of the mouse mitochondrial malate
            dehydrogenase gene
  JOURNAL   J. Mol. Biol. 202, 355-364 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    26      151     cytosolic malate dehydrogenase, exon 9
                                 (cMDHase)
    pre-msg <     1  >   353     cytosolic malate dehydrogenase mRNA and
                                 introns
    IVS     <     1       25     cytosolic malate dehydrogenase intron H
    signal      329      334     poly-A signal
    signal      449      454     poly-A signal
BASE COUNT      175 a     98 c    100 g    155 t
ORIGIN      About 1.4 kb after segment 8.
        1 agctctcgcc cttgtcccct gacagaataa gacctggaag tttgttgaag gcctccccat
       61 taatgacttc tcccgtgaaa agatggacct gacagcaaag gagctgaccg aggaaaagga
      121 gaccgctttt gagtttctct cctctgcgtg actagacact cgttttgaca tcagcagaca
      181 gccgaaggct gaggaatcaa aatgtcgtct ttgagcctag taccaaacag taataatgct
      241 acattcaaat tgtgaacagc aaaatatttt aaatagtgtg tgctttatga tttgtgaaag
      301 tctatcatgt tgttagtgct gcaatctaaa taaaagtata ttcaagtgaa aatctctcag
      361 actctgtttc tactttatat ttagtatctt caggaaaaca agtttggccc aatagattat
      421 aattttactt ttttaattga ctaaaagaaa taaagatgga aaatattatg aagtaaagca
      481 ttagtctcta acataaacaa ggaagcccaa tcaatttcag agggatcc
//
LOCUS       MUSMHHLABA    158 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse MHC class I HLA-B7 gene, 5' flank.
ACCESSION   M35971
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Mouse L cell line DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 158)
  AUTHORS   Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
            Probst,H. and Mueller,C.A.
  TITLE     Differential regulation of HLA class I genes by interferon
  JOURNAL   Immunogenetics 31, 245-252 (1990)
  STANDARD  simple staff_review
BASE COUNT       21 a     53 c     43 g     41 t
ORIGIN      
        1 ggtctcaggg tctcaggctc cgagggccgg gtctgcaatg gggaggcgca cgttggggat
       61 tccccactcc cctgagtttc acttcttctc ccaacttgtg tcgggtcctt cttccaggat
      121 actcgtgacg cgtccccact tcccactccc attgggta
//
LOCUS       MUSMHHLABB    160 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse MHC class I HLA-Bw64 gene, 5' flank.
ACCESSION   M35972
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Mouse L cell line DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 160)
  AUTHORS   Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
            Probst,H. and Mueller,C.A.
  TITLE     Differential regulation of HLA class I genes by interferon
  JOURNAL   Immunogenetics 31, 245-252 (1990)
  STANDARD  simple staff_review
BASE COUNT       21 a     55 c     43 g     41 t
ORIGIN      
        1 ggctctcagg gtctcaggct ccgagggccg cgtctgcaat ggggaggcgc agcgttgggg
       61 attccccact cccctgagtt tcacttcttc tcccaacttg tgtcgggtcc ttcttccagg
      121 atactcgtga cgcatcccca cttcccactc ccattgggtg
//
LOCUS       MUSMHHLABC    159 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse MHC class I HLA-B38 gene, 5' flank.
ACCESSION   M35973
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Mouse L cell line DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 159)
  AUTHORS   Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
            Probst,H. and Mueller,C.A.
  TITLE     Differential regulation of HLA class I genes by interferon
  JOURNAL   Immunogenetics 31, 245-252 (1990)
  STANDARD  simple staff_review
BASE COUNT       21 a     52 c     44 g     42 t
ORIGIN      
        1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga
       61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga
      121 tactcgtgac gcatccccac ttcccactcc cattgggtg
//
LOCUS       MUSMHHLABD    159 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse MHC class I HLA-B39 gene, 5' flank.
ACCESSION   M35974
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Mouse L cell line DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 159)
  AUTHORS   Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I.,
            Probst,H. and Mueller,C.A.
  TITLE     Differential regulation of HLA class I genes by interferon
  JOURNAL   Immunogenetics 31, 245-252 (1990)
  STANDARD  simple staff_review
BASE COUNT       21 a     52 c     44 g     42 t
ORIGIN      
        1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga
       61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga
      121 tactcgtgac gcatccccac ttcccactcc cattgggtg
//
LOCUS       MUSTCEAEA     396 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION   M35486
KEYWORDS    T-cell receptor; constant region; diversity exon; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL23.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 396)
  AUTHORS   Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
            Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
  TITLE     Autoimmune disease and T-cell immunologic recognition
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   396     T-cell receptor alpha-chain precursor (AA at
                                 1)
    sigp    <     1       33     T-cell receptor alpha-chain signal peptide
    matp         34  >   396     T-cell receptor alpha-chain
    recomb      327      328     V-region end/J-region start
    recomb      385      386     J-region end/C-region start
BASE COUNT      113 a    101 c     95 g     87 t
ORIGIN      
        1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg
       61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt
      121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc
      181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc
      241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac
      301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact
      361 gggacaaaac tccaagtcgt tccaaacatc cagaac
//
LOCUS       MUSTCEAEB     384 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION   M35487
KEYWORDS    T-cell receptor; constant region; diversity exon; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL51.1.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 384)
  AUTHORS   Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
            Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
  TITLE     Autoimmune disease and T-cell immunologic recognition
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   384     T-cell receptor alpha-chain precursor (AA at
                                 1)
    sigp    <     1       39     T-cell receptor alpha-chain signal peptide
    matp         40  >   384     T-cell receptor alpha-chain
    recomb      315      316     V-region end/J-region start
    recomb      373      374     J-region end/C-region start
BASE COUNT      111 a     96 c     93 g     84 t
ORIGIN      
        1 ctagctgggg tgagtggcca gcaggagaaa cgtgaccagc agcaggtgag acaaagtccc
       61 caatctctga cagtctggga aggagagacc gcaattctga actgcagtta tgagaacagt
      121 gcttttgact acttcccatg gtaccagcag ttccctgggg aaggtcccgc tctcctgata
      181 tccatacttt cagtgtccga taaaaaggaa gatggacgat tcacaatctt cttcaataaa
      241 agggagaaaa agctctcctt gcacattgca gactctcagc ctggagactc agccacctac
      301 ttctgtgcag caagtgcaaa ttctgggact taccagaggt ttggaactgg gacaaaactc
      361 caagtcgttc caaacatcca gaac
//
LOCUS       MUSTCEAEC     388 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION   M35488
KEYWORDS    T-cell receptor; constant region; diversity exon; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL83.12.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 388)
  AUTHORS   Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
            Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
  TITLE     Autoimmune disease and T-cell immunologic recognition
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   388     T-cell receptor alpha-chain precursor (AA at
                                 2)
    sigp    <     1       43     T-cell receptor alpha-chain signal peptide
    matp         44  >   388     T-cell receptor alpha-chain
    recomb      319      320     V-region end/J-region start
    recomb      377      378     J-region end/C-region start
BASE COUNT      112 a     98 c     93 g     85 t
ORIGIN      
        1 tcacctagct ggggtgagtg gccagcagga gaaacgtgac cagcagcagg tgagacaaag
       61 tccccaatct ctgacagtct gggaaggaga gaccgcaatt ctgaactgca gttatgagaa
      121 cagtgctttt gactacttcc catggtacca gcagttccct ggggaaggtc ccgctctcct
      181 gatatccata ctttcagtgt ccgataaaaa ggaagatgga cgattcacaa tcttcttcaa
      241 taaaagggag aaaaagctct ccttgcacat tgcagactct cagcctggag actcagccac
      301 ctacttctgt gcagcaagtg caaattctgg gacttaccag aggtttggaa ctgggacaaa
      361 actccaagtc gttccaaaca tccagaac
//
LOCUS       MUSTCEAED     359 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION   M35489
KEYWORDS    T-cell receptor; constant region; diversity exon; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL127.6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 359)
  AUTHORS   Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
            Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
  TITLE     Autoimmune disease and T-cell immunologic recognition
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   359     T-cell receptor alpha-chain precursor (AA at
                                 3)
    sigp    <     1       14     T-cell receptor alpha-chain signal peptide
    matp         15  >   359     T-cell receptor alpha-chain
    recomb      290      291     V-region end/J-region start
    recomb      348      349     J-region end/C-region start
BASE COUNT      107 a     91 c     81 g     80 t
ORIGIN      
        1 agaaacgtga ccagcagcag gtgagacaaa gtccccaatc tctgacagtc tgggaaggag
       61 agaccgcaat tctgaactgc agttatgaga acagtgcttt tgactacttc ccatggtacc
      121 agcagttccc tggggaaggt cccgctctcc tgatatccat actttcagtg tccgataaaa
      181 aggaagatgg acgattcaca atcttcttca ataaaaggga gaaaaagctc tccttgcaca
      241 ttgcagactc tcagcctgga gactcagcca cctacttctg tgcagcaagt gcaaattctg
      301 ggacttacca gaggtttgga actgggacaa aactccaagt cgttccaaac atccagaac
//
LOCUS       MUSTCEAEE     396 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION   M35490
KEYWORDS    T-cell receptor; constant region; diversity exon; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL172.10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 396)
  AUTHORS   Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
            Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
  TITLE     Autoimmune disease and T-cell immunologic recognition
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   396     T-cell receptor alpha-chain precursor (AA at
                                 1)
    sigp    <     1       51     T-cell receptor alpha-chain signal peptide
    matp         52  >   396     T-cell receptor alpha-chain
    recomb      327      328     V-region end/J-region start
    recomb      385      386     J-region end/C-region start
BASE COUNT      113 a    101 c     95 g     87 t
ORIGIN      
        1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg
       61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt
      121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc
      181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc
      241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac
      301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact
      361 gggacaaaac tccaagtcgt tccaaacatc cagaac
//
LOCUS       MUSTCEAEF     390 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION   M35491
KEYWORDS    T-cell receptor; constant region; diversity exon; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL183.2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 390)
  AUTHORS   Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
            Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
  TITLE     Autoimmune disease and T-cell immunologic recognition
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   390     T-cell receptor alpha-chain precursor (AA at
                                 1)
    sigp    <     1       45     T-cell receptor alpha-chain signal peptide
    matp         46  >   390     T-cell receptor alpha-chain
    recomb      321      322     V-region end/J-region start
    recomb      379      380     J-region end/C-region start
BASE COUNT      112 a     99 c     93 g     86 t
ORIGIN      
        1 cttcacctag ctggggtgag tggccagcag gagaaacgtg accagcagca ggtgagacaa
       61 agtccccaat ctctgacagt ctgggaagga gagaccgcaa ttctgaactg cagttatgag
      121 aacagtgctt ttgactactt cccatggtac cagcagttcc ctggggaagg tcccgctctc
      181 ctgatatcca tactttcagt gtccgataaa aaggaagatg gacgattcac aatcttcttc
      241 aataaaaggg agaaaaagct ctccttgcac attgcagact ctcagcctgg agactcagcc
      301 acctacttct gtgcagcaag tgcaaattct gggacttacc agaggtttgg aactgggaca
      361 aaactccaag tcgttccaaa catccagaac
//
LOCUS       MUSTCEAEG     395 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region.
ACCESSION   M35492
KEYWORDS    T-cell receptor; constant region; diversity exon; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL214.12.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 395)
  AUTHORS   Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W.,
            Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L.
  TITLE     Autoimmune disease and T-cell immunologic recognition
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   395     T-cell receptor alpha-chain precursor (AA at
                                 3)
    sigp    <     1       50     T-cell receptor alpha-chain signal peptide
    matp         51  >   395     T-cell receptor alpha-chain
    recomb      326      327     V-region end/J-region start
    recomb      384      385     J-region end/C-region start
BASE COUNT      108 a    101 c     98 g     88 t
ORIGIN      
        1 ctaggccttc acatgattgt gatgctcctc atattcacaa gggcccatgg agactcagtg
       61 actcagacgg aaggtcaagt ggccctctca gaagaggact ttcttacgat acactgcaac
      121 tactcagcct agggtaccca gctctgttct ggtacgagca gtatcccggg gaaggtccac
      181 agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggt tttgaagcta
      241 catatgataa agggaccacc tccttccact tgcggaaagc ctcagtgcaa gagtcagact
      301 cggctgtgta ctactgtgct ctgggtccaa attctgggac ttaccagagg tttggaactg
      361 ggacaaaact ccaagtcgtt ccaaacatcc agaac
//
LOCUS       RATIDA        198 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Rat ID-A element in the S-100 beta-subunit gene.
ACCESSION   M35983
KEYWORDS    .
SOURCE      Rat brain DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 198)
  AUTHORS   Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and
            Takahashi,Y.
  TITLE     ID sequences in the genes of three brain-specific proteins
  JOURNAL   Biochem. Int. 15, 809-816 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           5       19     direct repeat
    rpt         184      198     direct repeat
BASE COUNT       94 a     26 c     58 g     20 t
ORIGIN      
        1 agctaaaaac gaactccgtg gggttgggga tttagctcag tggtagagcg cttgcctagc
       61 aagagcaagg ccctgggttc ggtccccagc tccgaaaaaa agaaaaaaag aaaaaagaaa
      121 aagaagaaga agaagaagaa gaagaagaag aagaagaaga agaagaagaa gaagaagaag
      181 aagaaaaaag aaacgcgt
//
LOCUS       RATIDB        141 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Rat ID-B element in the S-100 beta-subunit gene.
ACCESSION   M35984
KEYWORDS    .
SOURCE      Rat brain DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and
            Takahashi,Y.
  TITLE     ID sequences in the genes of three brain-specific proteins
  JOURNAL   Biochem. Int. 15, 809-816 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           5       19     direct repeat
    rpt         121      135     direct repeat
BASE COUNT       52 a     26 c     43 g     20 t
ORIGIN      
        1 gtccaaagaa tggggcagag aggctgggga tttagctcag tggtagagcg cttacctagg
       61 aagcgacagg ccctgggttc ggtccccagc tccgaaagaa agaaccaaaa aaaaaaaaaa
      121 aaagaatggg gcagaataga t
//
LOCUS       RATIDC        141 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Rat ID-C element in the S-100 beta-subunit gene.
ACCESSION   M35985
KEYWORDS    .
SOURCE      Rat brain DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and
            Takahashi,Y.
  TITLE     ID sequences in the genes of three brain-specific proteins
  JOURNAL   Biochem. Int. 15, 809-816 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           7       19     direct repeat
    rpt         113      125     direct repeat
BASE COUNT       39 a     32 c     48 g     22 t
ORIGIN      
        1 cccttagagc ccaagggcga gggttgggga tttagctcag tggtagagcg cttgcctggc
       61 aagtgcaagg ccctgggttc ggtccccagc tccggaaaaa aaaaaaaaaa aggagcccaa
      121 gggtgggtca aaggcagcta g
//
LOCUS       RSFGCA1        71 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  Plasmid RSF1010 transposon Tn951 5' inverted terminal repeat.
ACCESSION   M25019
KEYWORDS    inverted terminal repeat.
SEGMENT     1 of 3
SOURCE      Plasmid RSF1010 DNA.
  ORGANISM  Plasmid RSF1010
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 71)
  AUTHORS   Cornelis,G., Sommer,H. and Saedler,H.
  TITLE     Transposon Tn951 (TnLac) is defective and related to Tn3
  JOURNAL   Mol. Gen. Genet. 184, 241-248 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt          16       56     5' inverted terminal repeat
    rpt          11       15     5' insertion target sequence
BASE COUNT       23 a     13 c     19 g     16 t
ORIGIN      
        1 ttgtcaaact gactaggggt ctgacgctca gtggaacgaa aactcacgtt aagcaattgt
       61 ggctgagaaa a
//
LOCUS       RSFGCA2       109 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  Plasmid RSF1010 transposon Tn951 DNA fragment.
ACCESSION   M25020
KEYWORDS    inverted terminal repeat.
SEGMENT     2 of 3
SOURCE      Plasmid RSF1010 DNA.
  ORGANISM  Plasmid RSF1010
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Cornelis,G., Sommer,H. and Saedler,H.
  TITLE     Transposon Tn951 (TnLac) is defective and related to Tn3
  JOURNAL   Mol. Gen. Genet. 184, 241-248 (1981)
  STANDARD  simple staff_review
BASE COUNT       29 a     27 c     23 g     30 t
ORIGIN      
        1 gaatcggcat gagtgcaacg gcataagcct gcttaaaaaa caggcttaac gtgggatatt
       61 ttccgttttc caagcggacc cctatcaata tgctcggcca ttattcctt
//
LOCUS       RSFGCA3       105 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  Plasmid RSF1010 transposon Tn951 3' inverted terminal repeat.
ACCESSION   M25021
KEYWORDS    inverted terminal repeat.
SEGMENT     3 of 3
SOURCE      Plasmid RSF1010 DNA.
  ORGANISM  Plasmid RSF1010
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 105)
  AUTHORS   Cornelis,G., Sommer,H. and Saedler,H.
  TITLE     Transposon Tn951 (TnLac) is defective and related to Tn3
  JOURNAL   Mol. Gen. Genet. 184, 241-248 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt          50       90     3' inverted terminal repeat
    rpt          91       95     3' insertion target sequence
BASE COUNT       31 a     27 c     26 g     21 t
ORIGIN      
        1 accaaaggac atctgagacc attaaaagag gcgtcagagg tagaaaacgt tgcttaacgt
       61 gagttttcgt tccactgagc gtcagacccc gactacgtcc acgcg
//
LOCUS       SYNPROA       808 bp ss-mRNA            SYN       03-AUG-1990
DEFINITION  Manipulation-derived rearrangement of cattle prolactin mRNA.
ACCESSION   M35587
KEYWORDS    prolactin.
SOURCE      Synthetic, cDNA to mRNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 808)
  AUTHORS   Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and
            Baev,A.A.
  TITLE     Genetic engineering of peptide hormones: II. Possible polymorphism
            of preprolactin in cattle. Data of molecular cloning
  JOURNAL   Mol. Biol. 22, 117-121 (1988)
  STANDARD  simple staff_review
BASE COUNT      217 a    201 c    195 g    195 t
ORIGIN      
        1 ttgaacattt ccgaggagag gtcatggatg tagtgggaca ccatgactgc ccggtcaaac
       61 aggtctcgaa gggatacctg gcagttgcca ggcccattgg gacagacggg ggtggagacc
      121 acaccctggc acaagagtag atttgacacc accagcagca ggagcaggcg ggaccctgct
      181 ttctgcgacg aacctttgct gtccatggtg gtgatgattt caagaaacac acttcaccag
      241 gaagctctcg tcctatcccg gaagataaag aacaagccca acagacccat catgaagtcc
      301 ttatgagctt gattcttggg ttgctgcgct cctggaatga ccctctctat cacctagtca
      361 ccgaggtgcg gggtatgaaa ggagccccag atgctatcct atcgagggcc atagagattg
      421 aggaagaaaa caaacgactt ctggaaggca tggagatgat atttggccag gttattcctg
      481 gagccaaaga gactgagccc taccctgtgt ggtcaggact cccgtccctg caaactaagg
      541 atgaagatgc acgttattct gctttttata acctgctcca ctgcctgcgc agggattcaa
      601 gcaagattga cacttacctt aagctcctga attgcagaat catctacaac aacaactgct
      661 aagcccacat tccatcctat ccatttctga gatggttctt aatgatccat tccctggcaa
      721 acttctctga ggtttatagc tttgtaatgc atgcttggct ctaatgggtt tcatcttaaa
      781 taaaaacaga ctctgtagcg atgtcaaa
//
LOCUS       FLANSHB       856 bp ss-RNA             VRL       03-AUG-1990
DEFINITION  Influenza A/Houston/24269/85 (H3N2) nonstructural protein RNA (seg
            8).
ACCESSION   M17699
KEYWORDS    nonstructural protein.
SOURCE      Influenza A/Houston/24269/85 RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Buonagurio,D.A., Nakada,S., Parvin,J.D., Krystal,M., Palese,P. and
            Fitch,W.M.
  TITLE     Evolution of human influenza A viruses over 50 years: Rapid,
            uniform rate of change in NS gene
  JOURNAL   Science 232, 980-982 (1986)
  STANDARD  full staff_review
COMMENT     Computer-readable sequence for [1] kindly provided by P.Palese,
            12-OCT-1987.
FEATURES       from  to/span     description
BASE COUNT      280 a    173 c    198 g    205 t
ORIGIN      Unreported.
        1 caacactgtg tcaatcttcc aggtagattg ctttctttgg catgtccaca aacaagttgt
       61 agaccaagaa ctgagtgatg ccccattcct tgatcggctt cgccgagatc agaggtccct
      121 aaggggaaga ggcagcaccc tcggtctaga catcaaagca gccacccatg ttggaaagca
      181 gatagtagaa aagattctga aggaagaatc tgatgaggca cttaaaatga ccatggcctc
      241 cacacctgct tcgcgataca taactgacat gactattgag gaattgtcaa gaaactggtt
      301 catgctaatg cccaagcaga aagtggaagg acctctttgc atcagaatgg accaggcaat
      361 catggagaaa aacatcatgt tgaaagcgaa tttcagtgtg atctttgacc gactagagac
      421 cctagtatta ctaagggctt tcaccgaaga gggagcaatt gttggcgaaa tctcaccatt
      481 gccttctttt ccaggacata ctattgagga tgtcaaaaat gcaattgggg tcctcatcgg
      541 aggacttgaa tggaatgata acacagttcg agtctctaaa actctacaga gattcgcttg
      601 gggaagcagt aatgaggatg ggggacctcc acttactcca aaacagaaac ggaaaatggc
      661 gagaacagct aggtcaaaag tttgaagaga taagatggct gattgaagaa gtgagacaca
      721 gactaaaaac aactgagaat agttttgagc aaataacatt catgcaagcc ttacagctgc
      781 tgtttgaagt ggaacaggag ataagaactt tctcatttca gcttatttaa tgataaaaaa
      841 cacccttgtt tctact
//
LOCUS       VACSPI1A     3043 bp ds-DNA             VRL       03-AUG-1990
DEFINITION  Vaccinia virus serine protease inhibitor superfamily gene SPI-1.
ACCESSION   M24217
KEYWORDS    serine protease inhibitor.
SOURCE      Vaccinia virus DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 3043)
  AUTHORS   Kotwal,G.J. and Moss,B.
  TITLE     Vaccinia Virus Encodes Two Proteins That Are Structurally Related
            to Members of the Plasma Serine Protease Inhibitor Superfamily
  JOURNAL   J. Virol. 63, 600-606 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence for [1] kindly
            submitted by B.Moss, 15-FEB-1990.
FEATURES       from  to/span     description
    pept    <     1      243     unspecified ORF
    pept        255      659     16 kDa ORF
    pept        927     1988     SPI-1 (40 kDa ORF)
    pept       2150     2722     21.5 ORF
BASE COUNT     1100 a    459 c    487 g    997 t
ORIGIN      
        1 tcacataatc tatttagaga tcgagtcatg cacgattata taagtaatac atatattgat
       61 cttgagtgtt tagatattat tagatcgttg gatggattcg atatcaatgg ttactttgaa
      121 ggacgtacac cacttcattg cgctatacaa cataacttca ctcagattgc taagtactta
      181 ttagatcgag gagctgatat agtcgtaccc aacacattga ttatacatca gtacatacag
      241 taaatagcat agatatggag gaggatacaa atatttcaaa taaagttata aggtacaaca
      301 ctgtcaataa tatatgggaa acattaccta acttctggac tggaactata aatccaggcg
      361 tggtctcgca taaagatgat atatatgttg tatgcgacat caaagatgaa aaaaatgtta
      421 aaacttgtat atttagatat aacacgaata cgtataacgg atgggaattg gtcacgacga
      481 cagaaagcag attatcagct ctgcatacta ttctttataa caataccata atgatgttac
      541 attgttatga atcgtatatg ttacaagata catttaatgt gtacactcgc gaatggaatc
      601 atatgtgtca tcaacattcg aatagttata tcatgtacaa tatactaccc atctactaaa
      661 tataatagaa taaaataaat gagtatgatc attttagata acgattgatt ttatcattac
      721 cgcttcattc ttatattctt tgcttacgga acctatattt agaaacatct actaacgatt
      781 ttttatgctt gcattattaa tggtatgtaa tatgattgat tgtgtacgca ataccaattt
      841 gttaagtatg aatacggggt acaaacataa actgaagttt aacattattt atttatgata
      901 tatatcgtta ttgtttggtc tataccatgg atatctttaa agaactaatc ttaaaacaca
      961 cggatgaaaa tgttttgatt tctccagttt ctattttatc tactttatct attctaaatc
     1021 atggagcagc tggttctaca gctgaacaac tatcaaaata tatagagaat atgaatgaga
     1081 atacacccga tgacaataat gacatggacg tagatattcc gtattgtgcg acactagcta
     1141 ccgcaaataa aatatacggt agcgatagta tcgagttcca cgcctccttc ctacaaaaaa
     1201 taaaagacga ttttcaaact gtaaacttta ataatgctaa ccaaacaaag gaactaatca
     1261 acgaatgggt taagacaatg acaaatggta aaattaattc cttattgact agtccgctat
     1321 ccattaatac tcgtatgaca gttgttagcg ccgtccattt taaagcaatg tggaaatatc
     1381 cattttctaa acatcttaca tatacagaca agttttatat ttctaagaat atagttacca
     1441 gtgttgatat gatggtgagc actgagaata acttgcaata tgtacatatt aatgaattat
     1501 tcggaggatt ctctattatc gatattccat acgagggaaa ctctagtatg gtaattatac
     1561 taccggacga catagaaggt atatataaca tagaaaaaaa tataacagat gaaaaattta
     1621 aaaaatggtg tggtatgtta tctactaaaa gtatagactt gtatatgcca aagtttaaag
     1681 tggaaatgac agaaccgtat aatctggtac cgattttaga aaatttagga cttactaata
     1741 tattcggata ttatgcagat tttagcaaga tgtgtaatga aactatcact gtagaaaaat
     1801 ttctacatac gacgtttata gatgttaatg aggagtatac agaagcatcg gccgttacag
     1861 gagtatttat gactaacttt tcgatggtat atcgtacgaa ggtctacata aaccatccat
     1921 tcatgtacat gattaaagac aacacaggac gtatactttt tatagggaaa tactgctatc
     1981 cgcaataaat ataaacaaat agacttttat cacgtttatc tatgtctaaa tattacaaat
     2041 agtaatagta taaactaaag ctgataatac ttaaaaaaat aataatatca tttacaatta
     2101 atagtataaa ctaaaaatta aacaaatcgt tattataagt aatatcaaaa tgatgatata
     2161 cggattaata gcgtgtctta tattcgtgac ttcatccatc gctagtccac tttatattcc
     2221 cgttattcca cccatttcgg aagataaatc gttcaatagt gtagaggtat tagtttcctt
     2281 gtttagagat gaccaaaaag actatacggt aacttctcag ttcaataact acactatcga
     2341 taccaaagac tggactatcg gcgtactatc cacacctgat ggtttggata taccattgac
     2401 taatataact tattggtcac ggtttactat aggtcgtgca ttgttcaaat cagagtctga
     2461 ggatattttc caaaagaaaa tgagtattct aggtgtttct atagaatgta agaagtcgtc
     2521 gacattactt acttttttga ccgtgcgtaa aatgactcga gtatttaata aatttccaga
     2581 tatggcttat tatcgaggag actgtttaaa agccgtttat gtaacaatga cttataaaaa
     2641 tactaaaact ggagagactg attacacgta cctctctaat ggggggttgc ctgcatacta
     2701 tcgtaatggg gtcgatggtt gattattgat tagtatattc cttattcttt ttattcacac
     2761 aaaaagaaca tttttataaa catgaaacca ctgtctaaat gtaattatga tcttgattta
     2821 tagatgaaga tcagccttta gaggatttta accagtatgt ttaatatgaa aaaaataaac
     2881 ataacatatt ttgagattaa gcgctattgt gcaagattat attagaatca aattaatctt
     2941 tcatacgaga aaaataacga catacgtcgt caacaaatta aactttttat ttattagtta
     3001 actagcttat agaacttgct cattgttatg tttctaaaac ggg
//
LOCUS       VACSPI2A     1475 bp ds-DNA             VRL       03-AUG-1990
DEFINITION  Vaccinia virus serine protease inhibitor superfamily gene SPI-2.
ACCESSION   M24218
KEYWORDS    serine protease inhibitor.
SOURCE      Vaccinia virus DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 1475)
  AUTHORS   Kotwal,G.J. and Moss,B.
  TITLE     Vaccinia Virus Encodes Two Proteins That Are Structurally Related
            to Members of the Plasma Serine Protease Inhibitor Superfamily
  JOURNAL   J. Virol. 63, 600-606 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1475)
  AUTHORS   Kotwal,G.J. and Moss,B.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence for [2] kindly
            submitted by B.Moss, 15-FEB-1990.
FEATURES       from  to/span     description
    pept        295     1332     SPI-2
BASE COUNT      487 a    245 c    286 g    457 t
ORIGIN      
        1 tccatggaaa aacgaaagta gtataaaagt aataaaacaa aaaaaagaat ataaaaaatt
       61 tatagctact ttctttgagg actgttttcc tgaaggaaat gaacctctgg aattagttag
      121 atatatagaa ttagtataca cgttagatta ttctcaaact cctaattatg acagactacg
      181 taaactgttt atacaagatt gaaattatat tctttttttt atagagtgtg gtagtgttac
      241 ggatatttaa tattagacta tctctatcgc gctacacgac caatatcgat tactatggat
      301 atcttcaggg aaatcgcatc ttctatgaaa ggagagaatg tattcatttc tccagcgtca
      361 atctcgtcag tattgacaat actgtattat ggagctaatg gatccactgc tgaacagcta
      421 tcaaaatatg tagaaaagga ggagaacatg gataaggtta gcgctcaaaa tatctcattc
      481 aaatccataa ataaagtata tgggcgatat tctgccgtgt ttaaagattc ctttttgaga
      541 aaaattggcg ataagtttca aactgttgac ttcactgatt gtcgcactat agatgcaatc
      601 aacaagtgtg tagatatctt tactgagggg aaaatcaatc cactattgga tgaaccattg
      661 tctcctgata cctgtctcct agcaattagt gccgtatact ttaaagcaaa atggttgacg
      721 ccattcgaaa aggaatttac cagtgattat cccttttacg tatctccgac ggaaatggta
      781 gatgtaagta tgatgtctat gtacggcaag gcatttaatc acgcatctgt aaaggaatca
      841 ttcggcaact tttcaatcat agaactgcca tatgttggag atactagtat gatggtcatt
      901 cttccagaca agattgatgg attagaatcc atagaacaaa atctaacaga tacaaatttt
      961 aagaaatggt gtaactctct ggaagctacg tttatcgatg ttcacattcc caagtttaag
     1021 gtaacaggct cgtataatct ggtggatact ctagtaaagt caggactgac agaggtgttc
     1081 ggttcaactg gagattatag caatatgtgt aattcagatg tgagtgtcga cgctatgatc
     1141 cacaaaacgt atatagatgt caatgaagag tatacagaag cagctgcagc aacttgtgca
     1201 ctggtgtcag actgtgcatc aacaattaca aatgagttct gtgtagatca tccgttcatc
     1261 tatgtgatta ggcatgttga tggaaaaatt cttttcgttg gtagatattg ctctccgaca
     1321 actaattgtt aaccattttt tttaaaaaat agaaaaaaca tgtggtatta gtgcaggtcg
     1381 ttattcttcc aattgcaatt ggtaagatga cggccaactt tagtacccac gtcttttcac
     1441 cacagcactg tggatgtgac agactgacca gtatt
//
LOCUS       YSCARSX      1539 bp ds-DNA             PLN       03-AUG-1990
DEFINITION  Yeast (S.cerevisiae) chromosome III L terminal region DNA.
ACCESSION   M12922
KEYWORDS    .
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1539)
  AUTHORS   Button,L.L. and Astell,C.R.
  TITLE     The Saccharomyces cerevisiae chromosome III left telomere has a
            type X, but not a type Y', ARS region
  JOURNAL   Mol. Cell. Biol. 6, 1352-1356 (1986)
  STANDARD  simple staff_review
REFERENCE   2  (sites)
  AUTHORS   Longtine,M.S., Wilson,N.M., Petracek,M.E. and Berman,J.
  TITLE     A yeast telomere binding activity binds to two related telomere
            sequence motifs and is indistinguishable from RAP1
  JOURNAL   Curr. Genet. 16, 225-239 (1989)
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    site        232      236     an unusual yeast telomere repeat motif (CCCCA)
BASE COUNT      514 a    551 c    130 g    344 t
ORIGIN      
        1 cccacacacc acacccacac cacacccaca caccacacac accacaccca cacacccaca
       61 ccacaccaca cccacaccac acccacacac ccacacccac acaccacacc cacacacacc
      121 acacccacac acaccacacc cacacaccac acccacacac acaccacacc cacacacacc
      181 acaccacacc cacaccacac ccacacccac acaccacacc acaccacacc cacaccacac
      241 accacaccca cacacaccac accacacaca ccacaccaca cccacacacc acacccacac
      301 acccacaccc acacacacca cacccacacc acacccacac ccacacaccc acaccctaac
      361 actaccctaa cactacccta ttctaaccct gattttacct gtctcccaac ttactctcca
      421 ttaccctacc tctccactcg ttaccctgtc tgattcaacc gtaccactcc caaccaccat
      481 ccatctctct acttactacc accaacccac cgtccaccat aaccgttacc ctccaactac
      541 ccgtatccaa ctccactacc gcttacccta ccatcgacca tgtcctactc actgtactgt
      601 tgttcaccca ccatattgaa acgtctacaa atgatcgtaa ataatacaca tatacttatc
      661 ctaccactct aatcccacta ccacatgcca tactcacctt cacttgtatt ctgatcggtc
      721 atacgcacac ggatgctaca gtatatacca tctcaaactt acctactttc atattcatca
      781 tcacccatct ctcaccatca gtaccaatgc actcgcatca ttatgcacgg cttgcctcag
      841 cggtctatac cctgtgccat ttacgcataa cgcccatcat tatccacatt ttaatatcta
      901 tatctcattc ggcgacacca aatattgtat aactgccctt aatacatacg ttataccact
      961 tttacaccat atactaacca ctcaatttat acacacttat gtcaatataa ccacaaaatc
     1021 accactaaaa tcacctaaac ataaaaatat tctactcttc aacaataata cataaacaca
     1081 ctcaattgcg tatctatacc accatgacgt cattaacgta aaagttcctt aatattgcaa
     1141 attgcttgaa cggatgccat ttcatgttga atgtggtaac ccaatagcat gatatgagta
     1201 atgctttagt attgtttcag agttgtttca gtaatgtttt agacaaggag aacatatagt
     1261 agcaaacctc taatccggta gtacttaaga aactacagtt tctatgtacg aaagcagtaa
     1321 ctatgtaatt attacattta catgacatat aggaaggtcc aataaactta ctacattatg
     1381 acctataagc tagatcgtaa ttcattacgt caacaggtta tgagccctag agcaatgctt
     1441 ctgagaacat aatcaatcta tctagcccca acaattataa acagtggctg tacggtatcg
     1501 agaccgctgc tgaatatgct aacgaatata tgaacgaat
//
LOCUS       VAC35K        840 bp ds-DNA             VRL       03-AUG-1990
DEFINITION  Vaccinia virus gene for 35K complement control protein.
ACCESSION   X13166
KEYWORDS    complement protein; regulatory protein; secretory protein.
SOURCE      Vaccinia virus DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 840)
  AUTHORS   Kotwal,G.J. and Moss,B.
  TITLE     Vaccinia virus encodes a secretory polypeptide structurally related
            to complement control proteins
  JOURNAL   Nature 335, 176-178 (1988)
  STANDARD  simple automatic
COMMENT     *source: library=HindIII in M13; strain=wildtype.
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE         43     54       late transcription start site
            
               OPR         543    549       early transcription stop signal
FEATURES       from  to/span     description
    pept         52  >   840     35K preprotein
    sigp         52      108     signal peptide (AA -19 to -1)
    matp        109  >   840     mature 35K protein (AA 1-244)
BASE COUNT      257 a    152 c    179 g    252 t
ORIGIN      
        1 tttttattat ttgtacgatg tccaggataa catttttacg gataaataaa tatgaaggtg
       61 gagagcgtga cgttcctgac attgttggga ataggatgcg ttctatcatg ctgtactatt
      121 ccgtcacgac ccattaatat gaaatttaag aatagtgtgg agactgatgc taatgctaat
      181 tacaacatag gagacactat agaatatcta tgtctacctg gatacagaaa gcaaaaaatg
      241 ggacctatat atgctaaatg tacaggtact ggatggacac tctttaatca atgtattaaa
      301 cggagatgcc catcgcctcg agatatcgat aatggccaac ttgatattgg tggagtagac
      361 tttggctcta gtataacgta ctcttgtaat agcggatatc atttgatcgg tgaatctaaa
      421 tcgtattgtg aattaggatc tactggatct atggtatgga atcccgaggc acctatttgt
      481 gaatctgtta aatgccaatc ccctccatct atatccaacg gaagacataa cggatacgag
      541 gatttttata ccgatgggag cgttgtaact tatagttgca atagtggata ttcgttgatt
      601 ggtaactctg gtgtcctgtg ttcaggagga gaatggtccg atccacccac gtgtcagatt
      661 gttaaatgtc cacatcctac aatatcaaac ggatacttgt ctagcgggtt taaaagatca
      721 tactcataca acgacaatgt agactttaag tgcaagtacg gatataaact atctggttcc
      781 tcatcatcta cttgctctcc aggaaataca tggaagccgg aacttccaaa atgtgtacgc
//
LOCUS       BSUCISAB     2246 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  B.subtilis site-specific DNA recombinase (cisA) and RNA polymerase
            sigma-K subunit (cisB; spoIVCB) genes, complete cds.
ACCESSION   M29040
KEYWORDS    DNA recombinase; RNA polymerase sigma-K subunit.
SOURCE      B.subtilis (strain JH642) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2246)
  AUTHORS   Sato,T., Samori,Y. and Kobayashi,Y.
  TITLE     The cisA cistron of Bacillus subtilis sporulation gene spoIVC
            encodes a protein homologous to a site-specific recombinase
  JOURNAL   J. Bacteriol. 172, 1092-1098 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Kobayashi, 14-OCT-1989.
FEATURES       from  to/span     description
    pept        134     1636     DNA recombinase (cisA) (gtg start codon)
    pept       2014     1544 (c) RNA polymerase sigma-K subunit (cisB)
    binding     119      127     ribosome binding site
    signal     1635     1660     pot. transcription termination signal
BASE COUNT      776 a    405 c    475 g    590 t
ORIGIN      3 bp upstream of PvuII site.
        1 cagctgttac atagcattac ccaaggggtg atgcatttta tgaaagtgat aatcatcgag
       61 ggaccgcaag ctgacaaatg cattaacgat tgctatcatt atttaataaa actttatagg
      121 aaggagattc agggtgatag caatatatgt aagggtatcg accgaggaac aagcgatcaa
      181 gggatcgagc atcgacagcc aaatcgaggc ctgtataaag aaagcaggga ctaaagatgt
      241 gctgaagtat gcagatgaag gattttcagg agagctttta gaacgtccgg ctttgaatcg
      301 cttgagggag gatgcaagca agggacttat aagtcaagtc atttgttacg atcctgaccg
      361 tctttctcgg aaattaatga atcagctaat cattgatgac gaattgcgaa agcgaaacat
      421 acctttgatt tttgtaaatg gtgaatacgc caattctcca gaaggtcaat tgtttttcgc
      481 aatgcgcggg gcaatctcag aatttgaaaa agccaaaatc aaagaacgga catcaagcgg
      541 ccgacttcaa aaaatgaaaa aaggcatgat cattaaagat tctaaactat atggctataa
      601 atttgttaaa gagaaaagaa ctcttgagat attagaagag gaagcaaaaa tcattcggat
      661 gatttttaac tatttcaccg atcataaaag cccttttttc ggcagagtaa atggtattgc
      721 tctacattta actcagatgg gggttaaaac aaaaaaaggc gccaaagtat ggcacaggca
      781 ggttgttcgg caaatattaa tgaactcttc ctataagggt gaacatagac agtataaata
      841 tgatacagag ggttcctatg tttcaaagca ggcagggaac aaatctataa ttaaaataag
      901 gcctgaagaa gaacaaatca ctgtgacaat tccagcaatt gttccagctg aacaatggga
      961 ttatgctcaa gaactcttag gtcaaagtaa aagaaaacac ttgagtatca gccctcacaa
     1021 ttacttgtta tcgggtttgg ttagatgcgg aaaatgcgga aataccatga cagggaagaa
     1081 aagaaaatca catggtaaag actactatgt atatacttgc cggaaaaatt attctggcgc
     1141 aaaggaccgc ggctgcggaa aagaaatgtc tgagaataaa ttgaaccggc atgtatgggg
     1201 tgaaattttt aaattcatca caaatcctca aaagtatgtt tcttttaaag aggctgaaca
     1261 atcaaatcac ctgtctgatg aattagaact tattgaaaaa gagatagaga aaacaaaaaa
     1321 aggccgcaag cgtcttttaa cgctaatcag cctaagcgat gacgatgatt tagacataga
     1381 tgaaatcaaa gcacaaatta ttgaactgca aaaaaagcaa aatcagctta ctgaaaagtg
     1441 taacagaatc cagtcaaaaa tgaaagtcct agatgatacg agctcaagtg aaaatgctct
     1501 aaaaagagcc atcgactatt ttcaatcaat cggtgcagat aacttaactc ttgaagataa
     1561 aaaaacaatt gttaacttta tcgtgaaaga agttaccatt gtggattctg acaccatata
     1621 tattgaaacg tattaaagag gggtgtatgc accccccttt tgtaattaca atctcatttt
     1681 caatacacct cgctgcatac gtcgccacct ttgtcccttt tccagcggaa tagctttcaa
     1741 ttcctttaat aagcccgatc gttccgatgg agattaagtc ctctgcatcc tcacctgtat
     1801 tttcgaactt tttcacaata tgggcgacca agcgaagatt atgctcaatc agcatgtttc
     1861 tggcatgttc atccccttta gccatgagct ctaagtattt tttttcttcg ctgcttgaga
     1921 gcggttgtgg aaaggcattg tttttcacgt aagatactaa aaagacaagc tctttaacaa
     1981 caaagccgag cgctgcgaaa acacctgtca ccatcgtcac ctccacaaaa gtatgtaggc
     2041 aaaagcctat atgtaaatgt atgtgaccgg gaggctgtct gtgtctgtac cgggaaattt
     2101 ctcgcggggg agtccggctt tggtaagcag catactaact tcaaatgtta aaaatattct
     2161 tcatcaagcg cccatacatt gaaatgaaca aaggtatggg gggatgggga tgaaaaaatg
     2221 gatggcaggc ctgtttcttg ctgcag
//
LOCUS       BSUSIG43     5127 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  Bacillus subtilis sigma 43 operon with P23-dnaE-rpoD genes
            (dnaE for DNA primase, rpoD for RNA polymerase).
ACCESSION   X03897
KEYWORDS    DNA primase; RNA polymerase; polymerase; primase;
            unidentified reading frame.
SOURCE      Bacillus subtilis.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 5127; enum. 1 to 5127)
  AUTHORS   Wang,L.-F. and Doi,R.H.
  TITLE     Nucleotide sequence and organization of Bacillus subtilis RNA
            polymerase major sigma (sigma 43) operon
  JOURNAL   Nucleic Acids Res. 14, 4293-4307 (1986)
  STANDARD  simple automatic
COMMENT     EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM         728    733       -35 region P1
               PRM         750    755       -10 region P1
               PRM         754    759       -35 region P2
               PRM         778    783       -10 region P2
               RBS         848    851       pot. rRNA binding site
            
               PRM         942    949       -35 region P3
               PRM         963    971       -10 region P3
               RBS        1462   1472       pot. rRNA binding site
            
               RBS        3462   3474       pot. rRNA binding site
            
               OPR        4609   4631       palindrome pot. transcription
                                            terminator
FEATURES       from  to/span     description
    pept        855     1445     P23 unidentified reading frame
    pept       1479     3290     dnaE
    pept       3489     4604     rpoD
BASE COUNT     1677 a    904 c   1177 g   1369 t
ORIGIN      
        1 gaattcttat gttgaagata taggcactat caatgaagtg atttcacttg caaaggcaga
       61 cggcggcatt atctgtttta cactcgtggt gccggaaatc agagaatatt tgatagccga
      121 agcggaaaaa gcaaatgttt tatattatga tattatcggc ccgttgattg ataaaatgga
      181 aacagcctac ggtttaacag cgaaatacga accggggcgg gtgcgccagc ttgatgaaga
      241 ttatttcaaa aaagtggagg ccatcgagtt tgcagttaaa tacgatgatg gacgtgatcc
      301 aagagggatt ttaaaagctg atatcgtttt gatcggcgtg tcaagaacgt ctaaaacacc
      361 gctgtctcaa tatctcgcac acaaacgcct gaaggttgcc aatgttccga ttgtaccgga
      421 ggttgatccg ccggaagaac tctttaacgt tgatccgaaa aaatgcatcg gtttaaagat
      481 tagccctgat aaactgaatc atatcagaaa agaacgttta aaatcactcg ggcttaatga
      541 taaagcgatt tatgcaaata tcaacagaat caaagaggaa ctcgagtatt tcgaaaagat
      601 tgtggatcgg atcggctgcc aggttgttga tgtttcaaat aaagcggttg aggaaacagc
      661 aaatattatc catcatctca aaacaaaaaa catataactc aggacgctct atcctgggtt
      721 tttggctgtg ccaaaaggga ataatgaaaa acaatagcat ctttgtgaag tttgtattat
      781 aataaaaaat tgtgataaaa tgattaattt taggtttaag gatcgtgtga tacgaataaa
      841 ctattatggg taagatgtca agaatttctc ccggaaattt ttcgacaaat tcatatacat
      901 ccacaataat aaaggatgtg cgattttgct tgcttttatg caggagttta atggagggat
      961 ggagaattac tcttcttaat gaacaagaaa agacgatttt tgtcgatgct gatgcttgtc
     1021 cggtaaaaga tgaaatttta caaacagcat ccgagtatga agttcaagtt ctttttgtcg
     1081 cttcatttga acattatcag ctttccagaa gcaatgaaga aaaatggaag tatgttgatc
     1141 ctcataaaga agctgctgat ttatatatcg caaatcacgt gaaaccggga gatattgttg
     1201 tgacgcagga catcggatta gcatctctgc tgttgaacag aaatgtctct gttatgtcgg
     1261 aaagaggtcg tctttacaag gaagacacga ttgattttgc cctagagggc cgtcattttt
     1321 ccggcaaaca aagaagaaaa ggcgtatatg ccaaagggcc taaaaaattg aataaagaag
     1381 atcgagaacg atttattaca ctgctgcaaa aaatcctgtc gaacgatgaa gggattttgc
     1441 actaaagcat cgaataatgt acgacggagt gttataagat gggaaatcgg ataccagatg
     1501 aaattgtgga tcaggtgcaa aagtcggcag atatcgttga agtcataggt gattatgttc
     1561 aattaaagaa gcaaggccga aactactttg gactctgtcc ttttcatgga gaaagcacac
     1621 cttcgttttc cgtatcgccc gacaaacaga tttttcattg ctttggctgc ggagcgggcg
     1681 gcaatgtttt ctctttttta aggcagatgg aaggctattc ttttgccgag tcggtttctc
     1741 accttgctga caaataccaa attgattttc cagatgatat aacagtccat tccggagccc
     1801 ggccagagtc ttctggagaa caaaaaatgg ctgaggcaca tgagctcctg aagaaatttt
     1861 accatcattt gttaataaat acaaaagaag gtcaagaggc actggattat ctgctttcta
     1921 ggggctttac gaaagagctg attaatgaat ttcagattgg ctatgctctt gattcttggg
     1981 actttatcac gaaattcctt gtaaagaggg gatttagtga ggcgcaaatg gaaaaagcgg
     2041 gtctcctgat cagacgcgaa gacggaagcg gatatttcga ccgcttcaga aaccgtgtca
     2101 tgtttccgat ccatgatcat cacggggctg ttgttgcttt ctcaggcagg gctcttggca
     2161 gccagcagcc taagtatatg aacagtcctg aaaccccgct ctttcataaa agcaaactgc
     2221 tttacaattt ttataaggcc cgccttcata tcagaaagca ggaaagagca gtcttatttg
     2281 aagggtttgc tgatgtctat acggccgtaa gctcggatgt aaaggaaagc atagccacga
     2341 tgggaacgtc tcttacagat gatcatgtca agatcctgag aagaaacgtc gaagaaatca
     2401 ttctttgcta tgactctgat aaagccggtt atgaagccac cttaaaagct tcggagcttc
     2461 tgcaaaaaaa aggctgcaaa gtcagagttg caatgattcc tgacggattg gaccctgatg
     2521 attacatcaa aaaattcggc ggggaaaaat ttaaaaacga cattattgac gcaagtgtca
     2581 ccgtaatggc gttcaaaatg caatatttcc gaaaaggaaa gaacctgtcc gatgaaggcg
     2641 accgcctagc ttacattaaa gacgtactga aagaaatcag cacgctttca gggtctctag
     2701 agcaggaagt ctatgtaaag cagcttgctt cagagttttc gctttcacag gagtctttaa
     2761 ctgagcagct gtctgttttc agcaagcaaa acaaacctgc tgacaatagc ggtgaaacta
     2821 aaacgcggcg agcgcatctg acgacaaaag caaggcaaaa acgtttgcgt ccggcgtatg
     2881 aaaatgcaga aaggctgtta ctcgctcaca tgcttcgaga tcggagcgtc atcaaaaaag
     2941 tgattgaccg ggtagggttt caatttaata ttgatgagca ccgggcatta gccgcttatc
     3001 tttatgcttt ttatgaagag ggagccgagc tgacgcctca gcatctgatg gccagggtga
     3061 cggatgatca tataagccag ctcttgtccg atatattaat gcttcaggtt aatcaagagc
     3121 ttagcgaagc cgagttatca gattatgtaa aaaaagtgtt gaatcaaaga aattggtcaa
     3181 tgataaaaga aaaagaggcg gaaagagccg aagcagaaag gcaaaaagat tttttaagag
     3241 ctgcttcttt ggctcaagaa atcgttacat tgaaccgatc tttaaaataa ctggagaact
     3301 gatgaggagc atttattggc aatgattcct tgcggaggag caaatagatc gcttaacctc
     3361 atcatgaatt gtcatttcat tattcgcaca ttgttaaagg cagttcacat agaaaacgcc
     3421 tgaatggacc gaataagaat cataccgctt atagaattcg ttgcaagctt tggaaggagg
     3481 gatccataat ggctgataaa caaacccacg agacagaatt aacattcgac caagtaaaag
     3541 agcaattaac agagtctggt aaaaaacgtg gcgttttgac atatgaagaa attgctgagc
     3601 gtatgtccag ctttgaaatt gaatcagacc aaatggatga gtattatgaa tttttaggtg
     3661 aacaaggtgt tgaattaatt agtgagaatg aagaaacaga agatcctaat attcagcagc
     3721 ttgccaaagc cgaagaagaa tttgacctta atgacctaag tgtaccgcct ggcgttaaaa
     3781 tcaatgaccc agttcgtatg tatttaaagg aaatcggtcg ggttaacctt ctttctgcaa
     3841 aagaagaaat cgcctacgct caaaagattg aagaaggtga cgaagaatct aaacgcagat
     3901 tggctgaagc gaacctgcgg cttgttgtca gtatcgcaaa acggtatgtc ggacgcggta
     3961 tgctgttcct tgatctgatc catgaaggaa acatgggcct gatgaaagcc gttgaaaaat
     4021 ttgattatcg caaaggttat aaattcagta cgtatgctac gtggtggatc agacaggcga
     4081 ttacacgcgc cattgccgat caggcgagaa cgatccggat tcccgttcat atggttgaaa
     4141 ccattaataa attaatccgt gtgcagcgtc aattactgca agacttaggc agagaaccaa
     4201 cacctgaaga aattgcggaa gatatggatt taacgcctga aaaagtacgc gaaatcttaa
     4261 agattgctca agagccggta tctctggaaa caccgatcgg tgaagaggat gactcgcacc
     4321 ttggtgattt cattgaagac caagaagcaa cttcaccttc tgaccacgcc gcatacgagc
     4381 tattgaaaga gcagctggaa gatgtgcttg atacgttaac tgatcgtgaa gaaaatgtat
     4441 tgcgtcttcg attcggtctt gatgacggcc gtacaagaac attagaagag gtcggcaaag
     4501 tatttggagt aacgagagag cgtattcgac aaatcgaagc caaagcgttg cggaaactaa
     4561 gacatcctag cagaagtaaa cgtttgaaag atttccttga ataagatgga acgggtcttg
     4621 aagatccgtt cttctttttt taaaaagata tatggataat atgcctttat tttactgaaa
     4681 aatgatgtca tttgcaaatg aacattgtgg tgaaaaattt caaaatctaa ttccatattt
     4741 tctattgtaa gcgtatacaa tacattatac aatagaataa aaaggatatt agagatttag
     4801 gcatgtttct atttcaattt tgtataaagt gtgaataaaa aacttttgta tagcaatcca
     4861 tttacttttt gtaaaaataa gttagaatta gaagtgttta cataggggga aggatcaaaa
     4921 agggggaagg gaaatgaaat ggaacccgct tattccattt ttgctgatcg ctgttttagg
     4981 aatgggtcta actttctttt tatcagtaaa aggacttgat gactctcggg aggttgcgag
     5041 cggaggagaa agcaaatctg ctgaaaagaa agatgcaaac gcttcaccag aagaaattta
     5101 caaggcaaat cgcatcgcat cgcatgc
//
LOCUS       CHKMHBFVB    4790 bp ds-DNA             VRT       03-AUG-1990
DEFINITION  Chicken MHC class I B-FIV-B12 alpha-chain gene, complete cds.
ACCESSION   M31012
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Chicken (strain CB; haplotype B12), clone cBF23.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 4790)
  AUTHORS   Kroemer,G., Zoorob,R. and Auffray,C.
  TITLE     Structure and expression of a chicken MHC class I gene
  JOURNAL   Immunogenetics 31, 405-409 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.Kroemer, 21-DEC-1990.
FEATURES       from  to/span     description
    pept       1405     1471     MHC B12 alpha-chain, exon 1
               1589     1852     MHC B12 alpha-chain, exon 2
               2081     2353     MHC B12 alpha-chain, exon 3
               2457     2729     MHC B12 alpha-chain, exon 4
               2803     2910     MHC B12 alpha-chain, exon 5
               3021     3053     MHC B12 alpha-chain, exon 6
               3213     3245     MHC B12 alpha-chain, exon 7 (put.)
               3400     3419     MHC B12 alpha-chain, exon 8
    IVS        1472     1588     MHC B12 alpha-chain intron A
    IVS        1853     2080     MHC B12 alpha-chain intron B
    IVS        2354     2456     MHC B12 alpha-chain intron C
    IVS        2730     2802     MHC B12 alpha-chain intron D
    IVS        2911     3020     MHC B12 alpha-chain intron E
    IVS        3054     3212     MHC B12 alpha-chain intron F
    IVS        3246     3399     MHC B12 alpha-chain intron G
    pre-msg    1347     3587     MHC B12 alpha-chain and introns (alt.)
    pre-msg    1379     3587     MHC B12 alpha-chain and introns (alt.)
    binding    1328     1337     Sp1 binding site
    site       1219     1228     interferon responsive element
    site       1263     1270     cAMP responsive element
    site       1238     1244     S box (B-L-beta)
    site       1247     1255     X box (B-L-beta)
    site       1288     1294     Y box (B-L-beta)
    site       1313     1317     CAAT box
BASE COUNT      812 a   1637 c   1460 g    881 t
ORIGIN      Microchromosome 16.
        1 acctataagg gacatgggaa cacagcagga ggtgtttggg ttctggggac atggggggaa
       61 ctctggggaa ggagatgtga ggtgacatgg gtgacatggc aggaggggac ccaagggtgc
      121 acacatccct gtgtgccaca ggggtgtccc caaccccctc accccagagg acatccccca
      181 tgtccccata tcccccggtg tccccatacc tcaatttgta ggagctggtt gaagttgttg
      241 tggggtgtga gggcaaatgg gactgggagg gtgcaggccc agccccacgg tccccctcag
      301 cccatgcagt caccgtcccg gtcaccggcc ccacagctgc cagcaacagc ccatgggggt
      361 ccccaaggcc acaggggcgg ggggccacca ggacccacct gagggaacac tgcttgagcc
      421 accagcccca taactgctcc ataagccctc ccagatccca taagcatcca gacacccctt
      481 aggtccatag gaatcccccc agctctggcc ccataacccc acccaaccac acagaaatct
      541 cccttagtcc ccccagactc catcacaccc caatcctata gcagccccca gccccaccag
      601 tgccccatag tccctcatag gcactgcaga acattcatag tcccatagct accccacaaa
      661 caccccatag ccccatactc agggtcctgg gagctggccc tcattaaggg agtgaggaag
      721 aggaggagga agagctgaag ccagtgaggc ccatggctcc ggcagtgtgg ggcagatggg
      781 agcagtggcg cagggacagc ggggttgggg tcaatcatta acttgggggg tggacattgg
      841 ggtccacctg cagaccctga gggcatcgac ctcagatgga ccacgtggaa tgtggggaag
      901 cggtgtgggg cggcgctgtg gggaaatggc actgatccca aaggaagccc tgggggacac
      961 cactgtcacc accccctccc ccccggacac gagcctctaa ccctatggat gtgacccccc
     1021 ccccaaacag ttccttgact ggaatggaga atgactcctc ctcttctcac ggcccccgtc
     1081 ctcccggcag agaccctcgg ccccctcccc cggggtaagg acgcggcccc gtgcgaacac
     1141 agcccgggtg cccccccgtg ctcgaagggc cgggggttcc cacaccacgc ccatcccctc
     1201 ccccgctccg cctttcgctt tcgcttcaca acctgaggga gcgcattctg cctggcgccc
     1261 gatgacgtca cataaaactc caactaccat tggcggagag gcgacggagg agccaatggg
     1321 ggcgcggggc ggggcggagg agtaggaaaa gctgaaggag ctgcgctggg tgcggcggac
     1381 ttgagagtgc agcggtgcga ggcgatgggg ccgtgcgggg cgctgggcct ggggctgctg
     1441 ctcgccgccg tgtgcggggc ggcggccccc ggtgagtgcg gccggaccgg gacccctccc
     1501 ccgcccgtaa ccccaccccg gggctgtgtc cgtgggatcc tgagacccac acccgcggct
     1561 cacggcccgc tgtgctccgt ccccgcagag ctccataccc tgcggtacat ccaaacggcg
     1621 atgacggatc ccggccccgg gcagccgtgg ttcgtgactg tggggtacgt ggacggggaa
     1681 ctcttcgtgc actacaacag caccgcgcgg aggtacgtgc cccgcaccga gtggatagcg
     1741 gccaaggcgg accagcagta ctgggatgga cagacgcaga tcggacaggg caatgagcag
     1801 attgaccgcg agaacctggg catactgcag cggcgctaca accagaccgg cggtgagcac
     1861 ggccggggcc gcggctccgt gggtgtggga tgggctccat gcgcagtgcc gcccacaccc
     1921 cccaggcctg gccctgcccg gcggcaccgt cccggggctg cccgtcacag ccccaccgcg
     1981 ctcggggtgc cgcgtcccgg ggggacccca acccatcccc gctgcagtgg gagccccgga
     2041 gccggagggg cccctcaccc cctgcccggc tgtgtttcag ggtctcacac ggtgcagtgg
     2101 atgtacggct gtgacatcct cgagggcggc cccatccggg ggtattatca gatggcctac
     2161 gatgggagag acttcactgc cttcgacaaa ggcacgatga cgttcactgc ggcagttcca
     2221 gaggcagttc ccaccaagag gaaatgggag gaagagagtg aacctgagag gtggaagaat
     2281 tacctggagg aaacctgcgt ggagtggctg cggagatacg tggaatacgg gaaggctgag
     2341 ctgggcagga gaggtgagcg gggtgggggg gggagcggct gcagtgtggg gctggacgtg
     2401 gggcgggggc tcagcgtggg gagctcagcc cggccctcac tgccgcccgc ccgcagagcg
     2461 gcccgaggtg cgagtgtggg ggaaggaggc cgacgggatc ctgaccttgt cctgccgcgc
     2521 tcacggcttc tacccgcggc ccatcgttgt cagctggctg aaggacggcg cggtgcgggg
     2581 ccaggacgcc cactcggggg gcatcgtgcc caacggcgac ggcacctacc acacctgggt
     2641 caccatcgat gcgcagccgg gggacgggga caagtaccag tgccgcgtgg agcacgccag
     2701 cctgccccag cccggcctct actcgtgggg tgagtgaggg gatgtggggc tggggggctg
     2761 cgggctgccc cttcccctgc tgatggcccc gctctccccc agagccgcca cagcccaacc
     2821 tggtgcccat cgtggcgggg gtggccgtcg ccattgtggc cattgccatc atggttggtg
     2881 ttggattcat catctacaga cgccatgcag gtaaaagcag aggggtgcag gcgggcagtg
     2941 gggggatctg ggtcccccct tgggagccct catgcctggc tgtgatgtga acctgtgctg
     3001 aagcatctct ctgtctgcag ggaagaaggg gaagggctac aacatcgcgc ccggtgagtg
     3061 atgagggcag cgctgtcccc cacctctgcc cagtgccagg gcggtcctgg ggtctgcact
     3121 ttctcccagg gtacccattc ctggtgcttg gggctgctcc acgccccata gggagcacag
     3181 ggctggatct cacagctgtt cctcccttat agacagggaa ggtggatcca gcagctcgag
     3241 cacaggtgcg tgtggggctg tgggttggga ggggtccgtg tgctctctgt ggtactgccc
     3301 agggctgggc tatgctgggg ctctgcgggg agacccccgg agcagagggt tgggatgtga
     3361 acctggcccc gtgggacatc atcccttctc atcccacagg gagcaacccc gccatctgag
     3421 tgctgtgctt cagcctgcaa ggagccaaca gtccacacca gcatttgggg tcagtgatgg
     3481 gcacagcccc atcctcttga cctctcacat ctcattctgc ttcctatgct gactgttatg
     3541 ctttgcctgc actgcttcct gtgaaataaa atgatgggcc attctgtggc tcagcttgcc
     3601 tgcagtctgc actgtgctgt ggttggggat ggggtggatg aggggaccgt gtcagtttgg
     3661 ctgctcaggg tgcagatgtg gccctgtgct gagtacccac tgccctcccc tcttctatct
     3721 gcctgctgct cactcccccc tgtgtacccc catcccttct cacctctcct ctgtgacccc
     3781 atgctggtgg ttgcttgctc cctgtcctgg cagaactctc attttccaat ggcatccctg
     3841 tttgttggga tgtggtctcc ttggtcctcc ccccagcagt cactgcacat atccacccca
     3901 cttccccccc aggttgctgt cccacagcac tcctatttcc ttctctgccc tccccccccg
     3961 cccatccagc tgcctctgca atcctcaccc acaaccttgc ccactccacc tccctcatcc
     4021 cgcccttccc ccagctgtcc tgtccctgct gggccccctc cccccacatt gtaccctaca
     4081 cccaaataaa tacgtttgtt ctgctgccct ccagccgtct tctggtttat ttcccccccg
     4141 atttgttgtt gtggcgtccg ctcttctcct ggggggaagg ggctctgggg gtccctcatt
     4201 ctccctgctc ttcttacagc accgggactc cccgcgctga gatcccaaca cacccgggta
     4261 caaacatgcg gcttttattc ccagttctgt gtcccacccc cggccctggt ggcactcagt
     4321 ggcactgcag tccatgcagt ggccgttgtg tgtcgtagag cagcggtacc cgagcgcgcc
     4381 ccggctcggc atccatgtgc ccacggcaca gctcttgtgg tcccttgtcg ctgccccggt
     4441 gtccccacct ccaccctcag tgtccccaac gtgcagtgtc cccatcccac atcccaccac
     4501 cccctgccct ccccatccat aactcccgct atccctctcc ctctgtcccc ctcccccggt
     4561 gctccctgtt gtccccagtc ccgcagaagg ctgccggggg cagcacctcg tgggggggtc
     4621 cctgctgcca cacctctccc ccgtccaaca ccactaccca ttgtaccagg gctgcccgcc
     4681 ccgtcaccat cagcactgca catccttacc ctctggctag ctggatccaa cttgtggaca
     4741 ggcttctgat gccctgtgta tgacaaggtt tctgatcctc gcgcctgcag
//
LOCUS       DOGMHDL9A    1499 bp ss-mRNA            MAM       03-AUG-1990
DEFINITION  Dog MHC class I DLA-A9/A9 alpha-chain mRNA, complete cds.
ACCESSION   M32283
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Dog (#C624; haplotype DLA-A9/A9 homozygote) peripheral blood
            leukocyte, cDNA to mRNA, clone pBT-I16.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 1499)
  AUTHORS   Sarmiento,U.M. and Storb,R.
  TITLE     Nucleotide sequence of a dog class I cDNA clone
  JOURNAL   Immunogenetics 31, 400-404 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            U.M. Sarmiento, 22-FEB-1990.
FEATURES       from  to/span     description
    pept          1     1089     MHC DLA-A9/A9 alpha-chain precursor
    sigp          1       72     MHC DLA-A9/A9 alpha-chain signal peptide
    matp         73     1086     MHC DLA-A9/A9 alpha-chain
    mRNA          1     1499     MHC DLA-A9/A9 alpha-chain mRNA
BASE COUNT      296 a    425 c    489 g    289 t
ORIGIN      
        1 atggaggtgg tgatgccgcg agccctcctc gtgctgctgt cggcggccct ggccctgacc
       61 ccgacccggg cgggctccca ctccctgagg tatttctaca cctccgtgtc ccggcccggc
      121 gcgggggacc cccgcttcat cgccgtcggc tacgtggacg acacgcagtt cgtgcggttc
      181 gacagcgacg cggccactgg gaggatggag ccgcgggcgc cgtgggtgga gcaggagggg
      241 ccggagtatt gggaccggca gacgcggacc atcaaggaga ccgcacggac tttccgagtg
      301 gacctggaca ccctgcgcgg ctactacaac cagagcgagg ccgggtctca cacccgccag
      361 accatgtacg gctgtgacct ggggcccgac gggcgccttc tccgcgggta cagtcaggac
      421 gcctacgacg gcgccgatta catcgccctg aacgaggacc tgcgctcctg gaccgcggcg
      481 gacacggcgg cgcagatcac ccagcgcaag tgggaagcgg caggtgtagc agagctacaa
      541 tggaggaact acctggagac gacgtgcgtg gagtggctgc ggaggtacct ggagatgggg
      601 aaggagacgc tgctgcgcgc agacccccca agcacacgtg tgacccacca ccccgtctct
      661 gaccatgagg tcaccctgag gtgctgggcg ctgggcttct accctgcgga gatcaccctg
      721 acctggcagc gggatgggga ggaccagacc caggacacag aggttgtgga cacaaggcct
      781 gcaggagatg ggaccttcca gaagtgggcg gccgtggtgg tgccttctgg acaggagcag
      841 agatacacgt gccacgtcca gcatgagggg ctgccggagc ctatcacgcg gagatgggag
      901 ccttcccctc tgtccaccat tgtcatcgtc agcattgctg ctctggttct cctcgtggtc
      961 gctggggtga ttggagctgt gatctggagg aagcagcgct caggaggaaa aggaccaggc
     1021 tactctcatg ctgcacgtga tgacagtgcc cagggctctg atgtgtctct gacagctcct
     1081 agagtgtgag accagctgcc tgtgggactg acggatgcaa gatgtgttca catctcacgt
     1141 gatgacatca acaaccctgg cttgtctctg caaacagtgt caggatgtgc ctgtgtccct
     1201 aggagcataa tgtgaggagg tggggagatt ggcccaccct gcccaccatg acctgtccct
     1261 aatctgatgt gcgctctcct ctctgatgtg ctttcctgtc caggagaggc agggctggac
     1321 catctccatc cctgtctttg tttcatgttg agtactaatc tcttactatc cgattgaaaa
     1381 taagaatcca gatatgagtt tgtgtttcct gagtcttggg atgtggggct gatgaggtaa
     1441 taaaaggaga tttgtgaagt tgagagagca aataaatgga agccctgaga accttccag
//
LOCUS       DOGMHDRBA    1165 bp ss-mRNA            MAM       03-AUG-1990
DEFINITION  C.familiaris MHC class II DLA-DR-beta-1 gene, complete cds.
ACCESSION   M29611
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      C.familiaris (dog #C624, haplotype DLA-Dw3/Dw3) leukocyte, cDNA to
            mRNA, clone pBT-DR5.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 1165)
  AUTHORS   Sarmiento,U.M. and Storb,R.
  TITLE     Nucleotide sequence of a dog DRB cDNA clone
  JOURNAL   Immunogenetics 31, 396-399 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by U.M.Sarmiento, 31-OCT-1989.
FEATURES       from  to/span     description
    pept         40      840     MHC DLA-DR-beta-1 precursor
    sigp         40      126     MHC DLA-DR-beta-1 signal peptide
    matp        127      837     MHC DLA-DR-beta-1
    mRNA          1     1165     MHC DLAB1 mRNA
BASE COUNT      237 a    313 c    319 g    296 t
ORIGIN      
        1 cgactgctcc cctgcaccct gtcctttctg ttttccagca tggtgtgtct gtgttttctt
       61 ggaggctcct ggatgacagc tctgatgctg atactgatgg tgctgaaccc tcccttcgct
      121 tgggccaggg acaccccacc acatttcttg gaggtggcaa agtccgagtg ctatttcacc
      181 aacgggacgg agcgggtgcg gttcgtggaa agatacatcc ataaccggga ggagttcgtg
      241 cgcttcgaca gcgacgtggg ggagttccgg gcggtcacgg agctcgggcg gcccgtcgct
      301 gagtcctgga acgggcagaa ggagatcttg gagcaggagc gggcaacggt ggacacctac
      361 tgcagacaca actacggggt gattgagagc ttcacggtgc agcggcgagt cgagcctaca
      421 gtgactgtgt atcctacgaa gactcagacc ttgcagcatc acaacctcct ggtctgctct
      481 gtgaatggtt tctatccagg ccacattgaa gtcaggtggc tccggaatgg ccaggaagag
      541 gaagctgggg ttgtgtccac aggcctaatc cgtaatggag actggacctt ccagatcctg
      601 gtgatgctgg agatagttcc tcagagcgga gaggtctaca cctgccaagt ggagcaccca
      661 agtttgacaa gccctgtcac cgtggaatgg agggcacagt ctgattctgc acagagcaag
      721 atgctgagtg gaatcggggg ctttgtcctg ggtctgctct tccttgcagt ggggctgttc
      781 atctacttca gaaatcagaa gggacactct ggacttcagc caacaggact cctgagctga
      841 agtggagatg gtgaccctca agaaaaaacc ttcctttcca gcttcttctc agagtccaaa
      901 ggtttccagc ttggctctta ttcttctaca aatacagcgc ctttctcagg atctggtttg
      961 ccctggcttc agtgaccctg cacagcttgt gttccccaat ggctccctct gccgtgtcct
     1021 tgacctggaa cccccagtgc tgactgcagt accttatctg cattctctcc ttcctcttcc
     1081 tatgcagccc ttatgtttcc cttgcatggg aactcacctt ctgctcatgt ttctttataa
     1141 acttttctca aataaatgtg gattg
//
LOCUS       LACTHYSYN     957 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  L.lactis thymidylate synthase (thyA) gene, complete cds.
ACCESSION   M33770
KEYWORDS    thymidylate synthase.
SOURCE      L.lactis lactis (strain 712) DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 957)
  AUTHORS   Ross,P., O'Gara,F. and Condon,S.
  TITLE     Cloning and characterization of the thymidylate synthase gene from
            Lactococcus lactis subsp. lactis
  JOURNAL   Appl. Environ. Microbiol. 56, 2156-2163 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by P.R.Rose, 04-APR-1990.
FEATURES       from  to/span     description
    pept         87      926     thymidylate synthase (EC 2.1.1.45)
    binding      77       80     ribosome binding site
    site         28       33     -35 region
    site         51       56     TATA box
BASE COUNT      329 a    139 c    192 g    297 t
ORIGIN      
        1 tctgagaggt tattttggga aatactattg aaccatatcg aggtgtgtgg tataatgaag
       61 ggaattaaaa aagataggaa aatttcatga cttacgcaga tcaagttttt aaacaaaata
      121 tccaaaatat cctagataat ggtgtttttt cagaaaatgc aagaccaaag tataaggatg
      181 gtcaaatggc gaatagcaaa tatgtcactg gttcattcgt tacttatgat ttgcaaaagg
      241 gggagtttcc aattaccact ttgcgtccaa ttccaatcaa atctgctatt aaagaattga
      301 tgtggatata ccaagaccaa acaagtgaac tttctgttct cgaagagaag tatggagtca
      361 aatactgggg agaatgggga attggtgatg gtacgattgg gcaacgttat ggtgcaacag
      421 tcaaaaaata taatatcatt ggtaaattat tagaaggctt ggccaaaaat ccatggaatc
      481 gtcgtaatat catcaacctt tggcagtatg aagattttga ggaaacagaa ggtcttttac
      541 catgtgcttt ccaaacgatg tttgatgtcc gtcgagaaaa agatggtcag atttatttgg
      601 atgccacact gattcaacgt tcaaacgata tgcttgtagc ccaccatatc aatgcgatgc
      661 aatatgttgc tttgcaaatg atgattgcaa aacatttttc ttggaaagtt gggaaattct
      721 tttattttgt aaataattta catatttatg ataatcagtt tgagcaggca aatgaattaa
      781 tgaagcgaac agcttctgaa aaagaacctc gtttggtcct taatgttcct gatggtacaa
      841 actttttcga tattaaacct gaagattttg aacttgtgga ctatgagcca gtaaaacctc
      901 aattgaaatt tgatttagca atttaaatta atctataagt tactgacaaa actgtca
//
LOCUS       MUSLY48A      121 bp ss-mRNA            ROD       03-AUG-1990
DEFINITION  Mouse leukosialin (Ly48) mRNA, partial cds.
ACCESSION   M30693
KEYWORDS    leukosialin.
SOURCE      Mouse (strain C57BL/6J) adult thymus EL-4 tumor, cDNA to mRNA,
            clone ML18.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Baecher,C.M., Dorfman,K.S., Mattei,M.G. and Frelinger,J.G.
  TITLE     cDNA cloning and localization of the mouse leukosialin gene (Ly48)
            to chromosome 7
  JOURNAL   Immunogenetics 31, 307-314 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.G.Frelinger, 14-DEC-1989.
FEATURES       from  to/span     description
    pept    <     3  >   119     leukosialin (Ly48; AA at 3)
BASE COUNT       28 a     28 c     43 g     22 t
ORIGIN      Chromosome 7 put. 7F3/7F4 bands.
        1 gccagggctc tttagtacta gaagagctga agcctgggtc gggtcccaac ctgaaggggg
       61 aggaagagcc gcttgtgggc agtgaggatg aagctgtgga aaccccaact tctgacggtc
      121 c
//
LOCUS       MUSUBILP     2912 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse housekeeping DXS253E (P3) and DXS254E (GdX) genes, 3' end,
            and complete cds.
ACCESSION   J04761
KEYWORDS    housekeeping protein.
SOURCE      Mouse (strain Balb/c) liver DNA, clone lambda-M7P.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2912)
  AUTHORS   Filippi,M., Tribioli,C. and Toniolo,D.
  TITLE     Linkage and sequence conservation of the X linked genes DXS253E
            (P3) and DXS254E (GdX) in mouse and man
  JOURNAL   Genomics 7, 453-457 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by D.Toniolo, 10-APR-1990.
FEATURES       from  to/span     description
    pept    <     1      550     housekeeping protein DXS253E (P) (AA at 2)
    pept       1536     1583     housekeeping protein DXS254E (GdX), exon 1
               1740     1845     housekeeping protein DXS254E (GdX), exon 2
               2089     2297     housekeeping protein DXS254E (GdX), exon 3
               2413     2523     housekeeping protein DXS254E (GdX), exon 4
    pre-msg <     1      801     DXS253E mRNA
    pre-msg    1533  >  2912     DXS254E mRNA and introns
    IVS        1584     1739     DXS254E intron A
    IVS        1846     2088     DXS254E intron B
    IVS        2298     2412     DXS254E intron C
BASE COUNT      649 a    822 c    765 g    676 t
ORIGIN      X chromosome.
        1 cactggtttc ctaccactct catcagccat ctacagctac cttctcagca tccatgaaac
       61 actccacgtg cccatctcca agatattggg gaccctgctg tttatcgcca tccccatagc
      121 agcgggtgtg gtgatcaaat ctaagctccc caagttctct gaactactgt tacaggtcat
      181 caagcccttc agctttatac ttctcctggg tggcctgttc ctggcctacc acatgggggt
      241 cttcatccta gtgggagtca ggttacccat tgtactggtg ggtttcacag tgcctcttgt
      301 tggcctcttg gtgggctaca gcctggccat ctgcctgaag ctgccagtgg ctcagcgacg
      361 aacagtcagc attgaggtag gggtgcaaaa cagcctgcta gccttagcca tgctacagct
      421 gtctctgcgc cgccttcaag cagactatgc ctctcaggcc cctttcattg tggcactgag
      481 tggtacctcc gagatgttgg ctttggttat tggccagttc atctatagca tcttgtttcc
      541 tgttccctga gccctctgga tcaagtttta taacttccag cccccatact cctctcttcc
      601 cacagttctt gtgtaccgta ccaaaggcct ttagttctca tgcactatgc actcaagaca
      661 aatccaggct tattttttta ctccttttcg ccccagcttt cagtgccaaa gaggccatgc
      721 tgagttagat agttgggtat tgcccagaaa tatatttcaa taaaaaaaaa ccaaaaaaac
      781 aaaagcaagc tcgggtccct tccagtctgt cccttggagc ctgtggattg gggtaagaag
      841 agtgtgtctg ggcggatatg ggccagaaca gtgaaccagc ttggtgaccc tgcacagaac
      901 ccactttctc ctcctcactc tgttttgcac atggtagaca tgtacctttg agctgtatcc
      961 tcagatctac aaaggctttg ctctaacagc cctacctaac agggcagacc tctgcactat
     1021 aaggagtgaa gagagcacta cttccgagct tcagtcagga tatttttgca gccagactag
     1081 aaaggaagta gaaagccatt aagctatgac aggcaggact ggaccatttc tgtatctgag
     1141 agacaccacc acgtgtttgc attgacttgg tccaccagca gggtgggaat tagcacaggc
     1201 cgggcctgga gtgaggctga cattctggtc cttcctttca agccacagct cccagactaa
     1261 gcaagcctgg ccccacaggt acctcacttg gatcccagtg ggcggggcat ccccacgtcc
     1321 ccgcgccatt aagcccaggt ggtggactct acccaaggca cggccggggc agaagcgaag
     1381 gccgggaggc gggactgaag aatgcgcagg cgggagccac gagcggcgcg cccctttcac
     1441 ggcttccggc ggcggcgggc ggttccgggg cggcggcgcg cggcagggga ccgttggtgt
     1501 ttgcgttggc cgtagtggac tgggccgtgg acaccatgca gctgaccgtg aaggcgctcc
     1561 agggccggga atgtagccta caggtaaagt caccccgcag gctgctgcac tcactcctgc
     1621 cgctggggcc gagcgcgcgt gggggaggcc gggagagctg acctcaggag ggggtgcgag
     1681 ggcttcaccg cccagggaat tcgggctgag cccttgcctc tccacctcgg ccggatcagg
     1741 tggcggagga cgagctagtg tctacactga agcacctggt ctcggataag ctgaatgtcc
     1801 ctgtgcgcca gcaacgtctg ctgttcaagg gcaaggccct agcaggtaat cagggaaagg
     1861 agccatccag ggagctttac agaaagctgg gggacgggat gtacaggcca gtgtagtgga
     1921 tcggggcact ggctgccact ggacctcaga aaagtagctg agccatcacc accccaccac
     1981 cccatagcca gatgctgcag tgcttcactc tggctgtctg tggcaggatt gcttcccagc
     2041 cacctccaca tcagtgagga aggaaccaca ccaatctctc tttggcagat gaaaaacgac
     2101 tgtcagatta caacattggg cccaattcta agctcaacct agttgttaag cctttggaga
     2161 aggtgctact ggaagaaggg tctgcccaca gactggtcga ctccccagcc acccccatct
     2221 ggcagctgat ctccaaagtc ctggcccgtc acttcagtgt agcagatgcc agcagggtcc
     2281 tggaacaact acagagggtg agaagagtaa tgctggccct ccattcttaa tgtttctacc
     2341 cgtcccctac ctaatgcagc atacgccttc cccactggcc tctctgaata ccactcggtc
     2401 ttctctcctc aggattatga caggtccttg agccgcctaa cactggatga catcgaacgt
     2461 ttggccagcc gctttctaca ccctgaagtg actgaggcta tggaaaaagg gttctgcaaa
     2521 tagcattctg ggattgtggg gagaaatccc aggtcaggcc acagctgcat gttgcattaa
     2581 atgtgttctc atgtcgcagt ttggctcata gtgataataa tagctggtat gtatccagct
     2641 cttgctaggt gctaggtact caaagtacat ggcctgagtt ctcttctgcc tcccacagca
     2701 aagttgaggc tgcactgtag agacacctag caatggcttg ggacagacag ggtccaagtc
     2761 tagccagcct atagaagtgc ctctgtagga gaaagagtgt gctgttttca gagatgggtc
     2821 ttagtattcc catttagccc actcagcata caccctcaaa agactccctc aggaaaccac
     2881 tagttcctgt cacctgccct ctggtgtgtc cc
//
LOCUS       YEPLCREA     3316 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  Virulence plasmid pYVO3 (from Y.enterocolitica) LcrE protein mRNA,
            complete cds.
ACCESSION   M32097
KEYWORDS    .
SOURCE      Virulence plasmid pYV03 (from Yersinia enterocolitica, strain
            6471/76, serotype 0:3) DNA, clones pY[MS4146,MS4073,P6].
  ORGANISM  Plasmid pYV03
            Prokaryota; Bacteria.
REFERENCE   1  (bases 2 to 1081)
  AUTHORS   Viitanen,A.-M., Toivanen,P. and Skurnik,M.
  TITLE     The lcrE gene is part of an operon in the lcr region of Yersinia
            enterocolitica O:3
  JOURNAL   J. Bacteriol. 172, 3152-3162 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3316)
  AUTHORS   Viitanen,A.-M., Toivanen,P. and Skurnik,M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by M.Skurnik, 14-FEB-1990.
FEATURES       from  to/span     description
    pept        649     1530     lcrE product
    ORF        1511     1789     ORF1
    ORF        1776     2147     ORF2 (gtg start codon)
    ORF        2509     2853     ORF4
    ORF        2840  >  3315     ORF5
    ORF         451      119 (c) ORF6
    ORF         173  <     1 (c) ORF7
    mRNA       1020  <     1 (c) lcrB locus mRNA
BASE COUNT      892 a    751 c    791 g    882 t
ORIGIN      
        1 aggatcccca aagaaagtgg tgtggtaata agttttcggc tcatcggagc tggggcatcc
       61 tgataaactg ggtaccaagc cgccggttca gggagatgcc ccccatcgaa aggctgccct
      121 aaaccatcca acacctgccc cagcagatgt tcacccaccc caacctgatg cattgtcctg
      181 tcgggctaac ttcagtatta gaagatatcc cgtacatttc accaagtgga ataagtaatg
      241 cttgatgttg ggcaaaacct atgacttcag cctgtaaaga caggctgttg tctgggttac
      301 gtaagtaaca taactcaccg atgcgcacac caggcactac cgcttttaat agcgttcctg
      361 tcacttgagt gacacgtcct ctaatttgga ttaggcggct acctacaatg ccatgacgaa
      421 tatgatgagg tatctgatct agtgagagca taaatccata atggttgaaa tattaaccac
      481 tattttagtg actaaaaacg ctaaaaaatt gtagcgggag ccgcgagttt ttagaaaaat
      541 agccaagcag cactaaaatt tctcggctga ttttggcatc gataagcaag aactattttt
      601 ataatcgcgg taattgcaat tataaaccgt tcatcttagg gagtagttat gacgacgctt
      661 cataacctat cttatggcaa taccccgctg cgtaatgagc atccagagat tgccagtagt
      721 cagatcgtaa atcagactct gggtcaattt cggggagaat ctgtgcagat agtcagcggc
      781 actctgcagt ctatagctga tatggcagaa gaggtaacat ttgtcttctc cgagcgtaag
      841 gagctctccc tcgacaaacg caaattaagt gacagccaag ctcgagttag cgacgttgag
      901 gagcaggtta atcaatacct tagcaaagtt ccagagttga aacaaaaaca gaatgtgagt
      961 gagctgctca gtctgttgag taacagcccc aatataagct tgtcccagtt aaaggcttat
     1021 ctggagggga aatcagaaga accgagtgag caattcaaaa tgctctgcgg cttgcgtgat
     1081 gccctgaaag ggcgccctga attagcacat cttttgcatt tggttgaaca agctctggtc
     1141 agcatggttg aagagcaaga agaagccatt gtattgggtg ccagaataac cccggaagcg
     1201 tacagagaat cccagtcggg tgttaatcca ctgcagccgc tccgtgatac ctaccgcgat
     1261 gcagtgatgg gttatcaagg gattaatgcg atctggagtg atttacaaaa acgttttccc
     1321 aatggggata tagactcggt gatattattc ctgcaaaagg cgcttagtgc agatctacaa
     1381 agtcaacaaa gcgggtctga acgggaaaaa ttagaaatag ttattagtga cttacagaag
     1441 ctaaaggagt ttcgtagcgt gagtgaccag gttaaaggat tttggcaact tttttcagag
     1501 ggtataacta atggcttacg acctttctga gtttatggga gatattgtcg cactggttga
     1561 caagcgctgg gcggggattc atgacattga acatcttgcc aacgcctttt cccttcctac
     1621 gcctgaaatc aaagtgcgtt tctatcaaga tttaaaaaga atgtttcgtc ttttccctct
     1681 gggggtattt agcgatgagg agcaacggca aaatttattg caaatgtgtc aaaatgcgat
     1741 cgatatggct attgagagtg aagaggaaga attgagtgag ttggattgaa cccatcattt
     1801 cccatttctg ccaggatctg ggagtgccaa catctagccc cctttcgcct cttattcaat
     1861 tagagatggc tcaatctggc acgctgcaac tggaacaaca tggtgcgaca ctgacactgt
     1921 ggttagcgcg ttctctcgcc tggcaccagt gcgaagatgc tatggtcaaa gcgctaacgc
     1981 tcacggcggc ccaaaagagt ggcgctttac cgctgcgagc ggggtggtta ggggaaaatc
     2041 aactggtgtt atttgtctcg cttgatgagc gttccttaac cttgcccctt ttacatcaag
     2101 ctttcgaaca gttactgcga ttgcagcaag aggtgcttgc gccgtgagtc gcataataac
     2161 tgccccccat attggcatcg aaaaactgtc ggcgattagc ctggaagagc tatcctgtgg
     2221 cttgcctgat cgttatgcct tgccgcctga tgggcatcca gtcgaaccac atttagagcg
     2281 cctttaccct acagcacaaa gcaagcgtag cctatgggac tttgcttctc ccggctatac
     2341 atttcatgga ttacatcgag ctcaagatta tcggcgcgaa ctggatacct tgcagtcact
     2401 gctaaccacc agtcagtcct cagagctaca agctgccgcg gcgctgctga aatgccaaca
     2461 agatgatgat cggttactgc aaataatcct taacctgttg cacaaagtat gaatattact
     2521 ttaaccaaac gacaacagga gttcttgctg ctcaacggtt ggttacaact acaatgtggc
     2581 catgcagagc gcgcatgtat tctattggac gccttgctga cgttaaatcc tgagcattta
     2641 gccggtcggc gttgccgatt agtcgcgcta cttaataata accagggaga acgtgccgaa
     2701 aaagaagcgc aatggctaat atcacatgac cctttacagg ctggaaattg gctctgcttg
     2761 agccgtgccc aacaactgaa cggcgatctt gataaggctc gccatgctta tcaacattat
     2821 ttggagttga aagatcataa tgaatcccca tgatcttgag tggctaaatc gtattggcga
     2881 gcgtaaagat atcatgctgg cagtgctgct gttagctgtg gtattcatga tggtcttacc
     2941 actccccccc ctcgtgttgg acattctgat tgctgttaac atgaccatct cagtggtgtt
     3001 gttaatgata gcgatctata tcaactctcc tttacaattt tcagctttcc cagcggtgct
     3061 actcgttacc acgttatttc gtctcgcact ttcagttagc accacccgca tgatcctgct
     3121 acaagctgat gcggggcaga ttgtttacac ctttggtaat ttcgtcgttg gcggtaacct
     3181 catcgtcggg attgtcatct tcctgatcat cactattgtg caatttttag tgataacgaa
     3241 aggctcagaa cgtgtagcag aagttagtgc cagattctct cttgatgcga tgccgggtaa
     3301 acagatgagt atcgat
//
LOCUS       FSOFARS1A     170 bp ds-DNA             PLN       03-AUG-1990
DEFINITION  F.oxysporum autonomous replicating sequence (ARS).
ACCESSION   M29451
KEYWORDS    autonomous replicating sequence.
SOURCE      F.oxysporum lycopersici (strain 73) ARS DNA, clone pFOLT4R4.
  ORGANISM  Fusarium oxysporum
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Hypocreales; Hypocreaceae.
REFERENCE   1  (bases 1 to 170)
  AUTHORS   Powell,W.A. and Kistler,H.C.
  TITLE     In vivo rearrangement of foreign DNA by Fusarium oxysporum produces
            linear self-replicating plasmids
  JOURNAL   J. Bacteriol. 172, 3163-3171 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.A.Powell, 27-OCT-1989.
FEATURES       from  to/span     description
    site         20       28     DNA homologous to yeast autonomously
                                 replicating consensus sequence
    site         26       31     telomere repeat unit
    site        130      136     ligation point of telomeres -> ApaI site
BASE COUNT       35 a     18 c     66 g     51 t
ORIGIN      116 bp upstream of ApaI site.
        1 catgccatcc gtaagatgct ttggtttagg gttagggtta gggttagggt tagggttagg
       61 gttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg
      121 gttagggtta gggccctaac cctaaaccaa agcatcttac ggatggcatg
//
LOCUS       ARURRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  A.gigantea ribosomal RNA.
ACCESSION   M37170 M35661
KEYWORDS    ribosomal RNA.
SOURCE      A.gigantea rRNA.
  ORGANISM  Arundinaria gigantea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     18 c     12 g      9 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctacgt ggaggcatcc ctccgcagct
//
LOCUS       ASTRRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  A.sativa ribosomal RNA.
ACCESSION   M37171 M35661
KEYWORDS    ribosomal RNA.
SOURCE      A.sativa rRNA.
  ORGANISM  Avena sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     19 c     11 g      9 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS       BLYRRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  H.vulgare ribosomal RNA.
ACCESSION   M37172 M35661
KEYWORDS    ribosomal RNA.
SOURCE      H.vulgare ribosomal RNA.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     19 c     11 g      9 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctacgt ggagccatcc ctccgcagct
//
LOCUS       MUSELF2PS    2183 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse elongation factor 2 processed pseudogene, complete cds.
ACCESSION   X15335 J04749
KEYWORDS    elongation factor 2; pseudogene.
SOURCE      Mouse cell line Ltk- DNA, clone cMER1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2183)
  AUTHORS   Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and
            Uchida,T.
  TITLE     Amplification of a long sequence that includes a processed
            pseudogene for elongation factor 2 in the mouse
  JOURNAL   Genomics 6, 80-88 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Koide, 31-AUG-1989.
FEATURES       from  to/span     description
    pept.ps     319     2038     pseudo-elongation factor 2
    rpt         319      330     repeat copy A
    rpt        2027     2038     repeat copy B
BASE COUNT      594 a    543 c    585 g    461 t
ORIGIN      
        1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca
       61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct
      121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaaatgta
      181 gtttgaaaaa gggagcttgc ctgcctatca ttttagtgcc tgtgaggtgg agatgtgagg
      241 atcagtaagt agttcaaggt tattcttggc tgtttagtga gtgttggcta tcctaggcta
      301 cataagtccc tgtttcaaga aaacaaacca ggccactgta ggcgcgcttc cctgttcacc
      361 tctgactctg agaatctgtc gccatccacc accatggtga acttcacagt agttcagatc
      421 cgtgccatca cggacaggaa agccaacatt cagaacatgt cagtcatctc catgtggacc
      481 acggcaagtc cacgctgaac gactcccttg tgtgcaaggc tggcatcatt gcctccgccg
      541 gagccgggga gacacgtttc actgacatgc gcaaagatga gcaggagcgc tgcatcacaa
      601 tcaagtccac cgccatctcc ctcttctacg agctctctaa gaacgacctg aaccgcatta
      661 agcagagcaa ggatggctcg ggcttcctca tcaacctcat cgactctccc ggccatgtgg
      721 acttctcttc agaggtgaca gctgccttgc gtgtcaccga tggagctctg gtggtggtgg
      781 actgtgtgtc tggcgtgtgt gtgcagacag aaattgtgct gcgccaggcc atcgctgagc
      841 gcatcaagcc cgtcctgatg atgaacaaga tggaccgggc ccctgctgga gctgcagctg
      901 gaacccgagg agctctacca gaccttccag cgcatcgtgg aaaacgtcaa cgtcatcatc
      961 tctaccgatg gcgagggtga gagtgggccc atgggcaata tcatgattga ccccgtcctg
     1021 ggtgccgtag gctttggttc tggcctgcat ggctgggcct tcaccctgaa gcagtttgcg
     1081 gagatgtacg tggccaagtt tgcagccaag ggcgagggcc agctgggtgc aactgagcgt
     1141 gccaagaaag tagagggcat gatgaagaag ctgtggggag accagtactt tgatctgccc
     1201 tcatggagcc tatctatctg gtggagatcc agtgtcctga gcaagtggtg ggtggcatct
     1261 atggtgtcct gaacaggaag cgtggccata tgtcccaggt ggccggtacc cccatgtttg
     1321 tggtcaaggc atacctgcct gtcaatgaat cctttggctt cactgctgat ctgcgatcca
     1381 acactagcag ccaggccttc ccccagtgcg agtttgacca ctggcagatc ctgcctgggg
     1441 atccttttgt caacagcagc cgccccagcc aagtggtagc tgagacctgc aagcacaagg
     1501 gcctgaaaga gggctggaca acttcctgga caaactgtag gcagcctgat actgccacat
     1561 gttgcacagt gcctgcccat cagaagacac cttgagactg tccccacagt gctcctctgg
     1621 aggctgttgg ggccaccctg acatcactca gcactcactt ggctaccaat tctatttatt
     1681 tcagaattac aaaatagcgg gaatctctct gcaggctgga ctggcaggcc gtggggtagg
     1741 cggacatggc tcttaacatt ttcagaggga aacacgcaga tgtccaaaag tctaaataaa
     1801 tgcattcaga ggtttatgag gtccatggcc aagtggagct cccccagagg gggaggtggg
     1861 gtaagtacct ccaggaaggc aggcagcctg ccttagactt gcaacccggc tgtgcgaatc
     1921 attggagtaa taaactacag tggttgatct gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1981 aaaaaagaaa agaaaagaaa agaaagaaag aaagaaagaa agaaaggaaa acaaaccaaa
     2041 agaaaaagag tttccatgat gtccaggatg atgcaggtgg gaagggctag ctcaggttag
     2101 aagggctagc tcaggtctct cttaccctgc tctaactcag cctgcttctg gaggagagac
     2161 aagtcaaaga gcagcaggag ctt
//
LOCUS       MUSELF2PSA   2199 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse elongation factor 2 processed pseudogene, complete cds.
ACCESSION   X15336 J04749
KEYWORDS    elongation factor 2; pseudogene.
SOURCE      Mouse cell line Ltk- DNA, clone cMER9.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2199)
  AUTHORS   Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and
            Uchida,T.
  TITLE     Amplification of a long sequence that includes a processed
            pseudogene for elongation factor 2 in the mouse
  JOURNAL   Genomics 6, 80-88 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Koide, 31-AUG-1989.
FEATURES       from  to/span     description
    pept.ps     324     2016     pseudo-elongation factor 2
    rpt         324      335     repeat copy A
    rpt        2005     2016     repeat copy B
BASE COUNT      602 a    546 c    590 g    461 t
ORIGIN      
        1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca
       61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct
      121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa
      181 caaaacaaaa atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga
      241 ggtggagatg tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt
      301 ggctatccta ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc
      361 gcttccctgt tcacctctga ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc
      421 acagtagttc agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc
      481 atctccatgt ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca
      541 tcattgcctc cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg
      601 agcgctgcat cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg
      661 acctgaaccg cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact
      721 ctcccggcca tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag
      781 ctctggtggt ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc
      841 aggccatcgc tgagcgcatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg
      901 ctggagctgc agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac
      961 gtcaacgtca tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg
     1021 attgaccccg tcctgggtgc cgtaggcttt ggttctggcc tgcatggctg ggccttcacc
     1081 ctgaagcagt ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg
     1141 ggtgcaactg agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag
     1201 tactttgatc tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag
     1261 tggtgggtgg catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg
     1321 gtacccccat gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg
     1381 ctgatctgcg atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc
     1441 agatcctgcc tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga
     1501 cctgcaagca caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc
     1561 ctgatactgc cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc
     1621 acagtgctcc tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta
     1681 ccaattctat ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc
     1741 aggccgtggg gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc
     1801 aaaagtctaa ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc
     1861 agagggggag gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac
     1921 ccggctgtgc gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa
     1981 gaaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa
     2041 aagaaaacaa accaaaagaa aaagagtttc catgatgtcc aggatgatgc aggtgggaag
     2101 ggctagctca ggttagaagg gctagctcag gtctctctta ccctgctcta actcagcctg
     2161 cttctggagg agagacaagt caaagagcag caggagctt
//
LOCUS       MUSELF2XX    2161 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse elongation factor 2 related sequence.
ACCESSION   X15337 J04749
KEYWORDS    elongation factor 2.
SOURCE      Mouse cell line Ltk- DNA, clone MER7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2161)
  AUTHORS   Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and
            Uchida,T.
  TITLE     Amplification of a long sequence that includes a processed
            pseudogene for elongation factor 2 in the mouse
  JOURNAL   Genomics 6, 80-88 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Koide, 31-AUG-1989.
FEATURES       from  to/span     description
    pept.ps     334     2054     elongation factor 2 related sequence
    rpt         334      345     tandem repeat
    rpt        2043     2054     tandem repeat
BASE COUNT      575 a    544 c    581 g    461 t
ORIGIN      
        1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca
       61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct
      121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa
      181 atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga ggtggagatg
      241 tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt ggctatccta
      301 ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc gcttccctgt
      361 tcacctctgg ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc acagtagttc
      421 agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc atctccatgt
      481 ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca tcattgcctc
      541 cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg agcgctgcat
      601 cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg acctgaaccg
      661 cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact ctcccggcca
      721 tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag ctctggtggt
      781 ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc aggccatcgc
      841 tgagcacatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg ctggagctgc
      901 agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac gtcaacgtca
      961 tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg attgaccccg
     1021 tcctgggtac cgtaggcttt ggttctggcc tgcatggctg ggccttcacc ctgaagcagt
     1081 ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg ggtgcaactg
     1141 agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag tactttgatc
     1201 tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag tggtgggtgg
     1261 catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg gtacccccat
     1321 gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg ctgatctgcg
     1381 atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc agatcctgcc
     1441 tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga cctgcaagca
     1501 caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc ctgatactgc
     1561 cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc acagtgctcc
     1621 tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta ccaattctat
     1681 ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc aggccgtggg
     1741 gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc aaaagtctaa
     1801 ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc agagggggag
     1861 gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac ccggctgtgc
     1921 gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa aaaagaaaag
     1981 aaagaaagaa agaaagaaag aaaagaaaac aaaccaaaag aaaaagagtt tccatgatgt
     2041 ccaggatgat gcaggtggga agggctagct caggttagaa gggctagctc aggtctctct
     2101 taccctgctc taactcagcc tgcttctgga ggagagacaa gtcaaagagc agcaggagct
     2161 t
//
LOCUS       MUSTHBS1     4915 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse thrombospondin (THBS1) gene, exons 1-7.
ACCESSION   J05605
KEYWORDS    thrombospondin.
SEGMENT     1 of 2
SOURCE      Mouse (strain BALB/C) DNA, clone phi-42.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4915)
  AUTHORS   Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P.
  TITLE     Characterization of the mouse thrombospondin gene and evaluation of
            the role of the first intron in human gene expression
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Bornstein, 28-JUN-1990.
FEATURES       from  to/span     description
    pept       1222     1288     thrombospondin, exon 2 (first expressed exon)
               1580     2139     thrombospondin, exon 3
               2947     3022     thrombospondin, exon 4
               3267     3466     thrombospondin, exon 5
               3561     3683     thrombospondin, exon 6
               4439  +  4532     thrombospondin, exon 7
    pre-msg     484  >  4915     thrombospondin mRNA and introns
    IVS         667     1193     thrombospondin intron A
    IVS        1289     1579     thrombospondin intron B
    IVS        2140     2946     thrombospondin intron C
    IVS        3023     3266     thrombospondin intron D
    IVS        3467     3560     thrombospondin intron E
    IVS        3684     4438     thrombospondin intron F
    IVS        4533  >  4915     thrombospondin intron G
BASE COUNT     1188 a   1342 c   1100 g   1282 t      3 others
ORIGIN      Chromosome 2, region F.
        1 gggatccccc acccaccagg attgactcag gaggcagctt cctttaaggg gtccaacctt
       61 ttgccctctt ttatttttaa accagactgt ctcgatcccc aagcgtctct gtcatggaat
      121 tattcaagga gatgtgcttt aatgaaagcc tccctaaagg gtcttaggtg gtccccaaag
      181 aagcatcgcg tctggggact ctcatgcaga acgtctccag ttcacatggc gcaagatcct
      241 aagcgctaaa ggctgagtac gccaaggctg cgtgggcgga gacctatttt ttctgacaag
      301 ttccaggggc tcctgtgcgg gatcggagtc tcccccttca ctttcagccc gagagctgtg
      361 cgccaagcag cagggggcgg agaaatgaac ccgtccgtcc ccgtcccccg cccccgcccc
      421 cagaaccctc ccccctccct gcttcctgcc tctttaaaag gtcctgggct cctcagtcaa
      481 gccagccact gcctggagtc agccagcctc atcggacttc tgcaggcaat cgcgaagctg
      541 ctatccagtt ctgccacggt ctctcccggc gcaccggcag tctcagcgtc ttcaccggac
      601 tcagcgtcct tgtccttcac ttcacctttg ccacctctcc gggttactga gccccggtgc
      661 acacaggtaa acctccgcct ctgcggagca cctggccggc cacctcaaag ccccaggctg
      721 cagctgcaga tgatcagcgc ttctcctcat ctcctactct tccccttctg gttttcctaa
      781 ctcctttatc ctgccccgtc tatcccctgg aaagaacttt ccggagctgg ttctcttcag
      841 ccttcgggtt cacaaactat acctccaaga caccccaaga ttttctcccc agtagtctcc
      901 aggttacccc tgactgtgcc atccaaaata cattctgtaa tctgtctaaa ggggcggggg
      961 ggggagggac ggacccaagt cactttatac cctttatcct ggaactccaa caatccccct
     1021 gcctctggcg ctctttttat actggatacc gtttggaaag taaacccttc tttttcagca
     1081 catatggggc aaaagaagaa acacacacac acacacacac acacacacac acctctgccc
     1141 ttccaccttg cttcctggtc tctgactctg gcctcctgtg cttcttgcta caggctccgt
     1201 gttgggcaca aaggctccac catggagctc ctgcggggac taggtgtcct gttcctgttg
     1261 catatgtgtg gaagcaaccg cattccaggt gagtttgtga tatatcattt tcctccccct
     1321 tctcctctct ctctctctct ctttctctct ctctctctct gctgctgctc tctctctctc
     1381 tctctctctc tctctctctc tctctcgaag ctcctgtgtt tccttggagt ccttatctcc
     1441 tgataaggag gcagcttccc tcgctcagct ttccataagt gccattattg tcacgagttc
     1501 ccaccttggc tcctcacagc cagaacttta cccccttata cctagcctag aaagccgatt
     1561 gtcttctctc tccttacaga gtctggggga gataacggtg tgtttgacat ctttgaactc
     1621 attggaggtg cacgaagggg ccccggtcgc cgactggtga agggccaaga tctatccagc
     1681 cccgccttcc gcattgagaa tgccaacctg atccccgctg tgccggatga caagttccaa
     1741 gacctactgg acgctgtgtg ggccgacaaa ggcttcatct tcttggcttc cttgaggcag
     1801 atgaagaaga cccggggcac actcctggct gtggaacgga aagacaacac tggccagatc
     1861 ttcagtgtgg tctccaacgg caaagctggc accctggacc tgagcctgag cctgccaggg
     1921 aagcaacaag tggtgtcagt ggaggaagct ctcctggcca ctggccagtg gaagagcatc
     1981 acgctgtttg ttcaagagga ccgggctcaa ctctacatag actgtgataa gatggagagc
     2041 gcggagctgg atgtacccat ccagagcatc ttcaccaggg atctggccag cgttgccagg
     2101 ctccgagttg caaagggaga tgtcaatgac aattttcagg taaatcttct tacctgtcac
     2161 aagggctatg tggaattaat atcggtagac taaaaatgat tctcaaatgn taaagaggaa
     2221 cagacacccc cagaaggttg gttaaatcat attacagatg acccaaaact ccagggnttt
     2281 cagatttgat agggacccga gaaattttnt tcctaataag aaataagttg ctaatggtgc
     2341 caggacctgg gcacccatgc tttcaaagct atcaggagag ataagaaatg gttttcctat
     2401 tactacagcc tgtttcagaa tggggtactc cctgtgccct taggaaagta acttgtatca
     2461 aatgtaccta gcagttttaa tagggcatga aaaggatact ttgtcagtat ttgagtaaat
     2521 gggcagggtg ttctgggtgc tttcagtatg gatctaaggc aagacatgaa taacagtcca
     2581 gtcatgtgcc tctgggactc cttacttgcc ttaactttca gatatttttt tttcttacaa
     2641 gcgtggcttt gaacgcccac attaactctt tgtcactgga gagtatcatg tcctccagac
     2701 ttgggatcaa attccatttc ctgtgtaccc caagcagcct gaaaactgga ttctttgtaa
     2761 aatgctccag ttggcagaag acacaattaa gattttgaag ttgttcggca atttatgtct
     2821 ctgtgatagt ttggcaggat gatcttaagt tcttttaaac tttctgagcg ttcaccacaa
     2881 gcaaactcag actggccgcc ccacccccac gcccagcatt aaaccacgtc tgtcctctgc
     2941 ccacaggggg tgctgcagaa tgtgaggttt gtctttggaa ccaccccaga agacattctc
     3001 aggaacaaag gctgctccag ctgtgagtac ccctgtgttt tttaagattt ggagaattct
     3061 ggggaagtcc accgaaagca ccctgattaa tgtaagcagt agggtgcata cacaatatct
     3121 cagggggagc agagttggct tgcacccact ctgtcatgcc aaaggaaatg tcttttataa
     3181 cccagaaccc gaagggtagg ctggtaggat cagctgttga aaccttgaat cagctatcga
     3241 atacgttact tcttatttct tagcagctac caacgtcctt cttacccttg acaacaacgt
     3301 ggtgaacggt tccagccctg ctatccgcac caactacatc ggccacaaaa caaaggacct
     3361 ccaagctatc tgtggcctct cctgtgatga actatccagc atggtcctgg aactgaaggg
     3421 cctgcgcacc atcgtgacca ctctgcagga cagcatccga aaagtggtca gtggcctccc
     3481 ctcccatttg ccagctcttg gaaatcacca caggtggtcc cacgggagcc tggacactca
     3541 tgagccttct cccctttcag acggaagaga acagagagct ggtcagtgag ctgaagcggc
     3601 ctcccctctg ctttcacaat ggagtccagt acaagaacaa cgaggagtgg actgtagaca
     3661 gttgcacaga gtgtcactgc caggtaagag acactcacag actgcaataa gacttggaga
     3721 catatgtctc gggagatacg taaatgacta cacacatttt taaatgtgct ctggcctgct
     3781 accaaagtga ttttttttct ttaatatgta atgacatctg caaattcctc tttaaatatc
     3841 agaaggttta tcttaataat accccaggct gagcagctaa gagaacaaat ctatctaaac
     3901 gttggaatga cttacctgct ttagttcaaa gaccaagtat tgagtgtcag atacataaaa
     3961 cccgcttaaa gcattcggtt ttggccagcc ctaatttgta ttcagctctg gcaggcagat
     4021 tcttctaaca ctgttcggat ctctggctca ccgggaaaag tttaaatgca aatcactggt
     4081 tactgaattt gagacacaac ttcaattaag ttcccgtctc ccttcaatat gtgataaaaa
     4141 cccttatcta cctttgagat gcttactgat acctgcaatg tccgttgctt tcccttatgc
     4201 aaatgtgttg ttctgtaact tccacccctg actgagtagc agttaaaaag aactttccca
     4261 aagaatggag tgagaattag ccctggaact gtatacctct ggggactaca gatccatgag
     4321 acatccctcc tataaaagcc acccctcctg tgtcctggct tataatggca atgtcacaga
     4381 aagctcctct ctgcaatacg cttggtaaca tcgttcctct ccttctcttg ccttgcagaa
     4441 ctcggttacc atctgcaaaa aggtgtcctg tcccatcatg ccctgctcca acgccacagt
     4501 tcctgatggt gaatgctgcc cacggtgctg gcgtaagttc ctaaaatggt gtagccatcc
     4561 ctcagtggct agctcagtct gggggcatcc tgacagctgg gcacaacacc agagcccagc
     4621 ccctcccagc tccatcagac gttccagtca gatctgggtt atggcatagg caatggatga
     4681 tcttagcaga gcttatggat tctggtttta tgctataagg agagagatgg atacagcagt
     4741 agacttaagc tggttgcttt tagcaattga tctttgttga aaaaaatgat ctactggaga
     4801 acggacagaa actgtgtaag cattgggctg aggcttgaaa agccacctgt tttatctgca
     4861 gtgagaggag gcctaaggct tagcatctgt tcattcaggc agtaaagaaa tcagc
//
LOCUS       MUSTHBS12    1145 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse thrombospondin (THBS1) gene, exons 8 and 9.
ACCESSION   J05606
KEYWORDS    thrombospondin.
SEGMENT     2 of 2
SOURCE      Mouse (strain BALB/C) DNA, clone phi-42.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1145)
  AUTHORS   Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P.
  TITLE     Characterization of the mouse thrombospondin gene and evaluation of
            the role of the first intron in human gene expression
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by P.Bornstein, 28-JUN-1990.
FEATURES       from  to/span     description
    pept    +   297      470     thrombospondin, exon 8
                969  >  1145     thrombospondin, exon 9
    pre-msg <     1  >  1145     thrombospondin mRNA and introns
    IVS     <     1      296     thrombospondin intron G
    IVS         471      968     thrombospondin intron H
BASE COUNT      302 a    294 c    260 g    289 t
ORIGIN      820 bp after segment 1.
        1 tgaccgtagt gctagattgc tgtgctgtga gagagatgtg tgatgctcca ttatcattat
       61 cagtcagact cttgatagcc aagcgtctga gagtacataa atgtacacaa tctctagttg
      121 ttaatcatgc ttccattaaa ctaagagtat gaaaaaagaa ggactttttt taaaaaaaga
      181 aacctattgc tttaccacct ctggctataa agggccataa gcatataagt aaataaactg
      241 cctaagaaca gtttatctta gacatgagat gctaacgacc tcatttctcc ccacagccag
      301 cgactctgct gacgatggct ggtctccctg gtctgagtgg acctcctgct ctgccacatg
      361 tggcaatgga attcagcaac gtggtcgttc ctgtgacagc ctcaacaaca gatgcgaggg
      421 ctcttcggta cagacgagga cctgccacat tcaggagtgt gacaaaagat gtaagcatca
      481 tggctgccaa gggtggcaat ctcccttgcc cagtgcttcc ctcacaaaga gcagctccgg
      541 agcggggagc ctgagaatgc aggttctgag cctgcggctc tcagtctcaa tcacacctct
      601 aacctttgca ctcctgggct tgaccttcct tccctgaggc tcctcctcat ctgtaacgag
      661 agaacaataa ccattcctga taaagttact ttagccattt ctctagataa ttcatatgaa
      721 acaatcagaa tggatctggg ctcctgacca cgaacctata aatattaggt tgttgacagc
      781 aactcacaca cacagacaaa taccatcttc cctatagttt ctccactgaa gtgtggcagt
      841 agccagtttc tacaatacac tagttgaagc tgcttttggt gggctgctgg aatgattgcc
      901 tgctgagact tgctgtgtgg ccccctccaa ggcagcaccc attggtgaca atcaactttg
      961 cccttcagtt aaacaggatg gtggctggag tcactggtct ccatggtcgt cctgttctgt
     1021 gacctgtggt gacggtgtga tcacaaggat ccgtctctgc aactccccca gcccccagat
     1081 gaacgggaag ccctgtgaag gtgaagcccg ggagaccaaa gcctgcaaga aagacgcctg
     1141 cccaa
//
LOCUS       MZERRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  Z.mays ribosomal RNA.
ACCESSION   M37178 M35661
KEYWORDS    ribosomal RNA.
SOURCE      Z.mays rRNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       12 a     17 c     10 g     11 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctatac ggagccatcc ctccgtagtt
//
LOCUS       PNURRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  P.nudum ribosomal RNA.
ACCESSION   M37179 M35661
KEYWORDS    ribosomal RNA.
SOURCE      P.nudum rRNA.
  ORGANISM  Psilotum nudum
            Eukaryota; Plantae; Thallobionta; Psilotophyta; Psilotopsida; 
            Psilotales; Psilotaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     16 c     12 g     11 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagttacgc gaaggatcct cttcgtggcc
//
LOCUS       RATRGK5      1883 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Rat voltage-gated potassium channel protein (RGK5) gene,
            complete cds.
ACCESSION   M30312
KEYWORDS    voltage-gated potassium channel protein.
SOURCE      Rat (Sprague Dawley) adult liver DNA, clone lambda RGK5.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1883)
  AUTHORS   Douglass,J., Osborne,P.B., Cai,Y.-C., Wilkinson,M.,
            Christie,M.J. and Adelman,J.P.
  TITLE     Characterization and functional expression of a rat genomic DNA
            clone encoding a lymphocyte potassium channel
  JOURNAL   J. Immunol. 144, 4841-4850 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.O.Douglass 01-DEC-1989.
FEATURES       from  to/span     description
    pept         35     1612     voltage-gated potassium channel protein
BASE COUNT      380 a    567 c    489 g    447 t
ORIGIN      
        1 ctgcagagcc aggctacgcg agctgccgcc agacatgacc gtggtgcccg gggaccacct
       61 gctggagcca gaagcggcgg gaggcggcgg cggggacccg cctcagggag gctgtgtcag
      121 tggcggcggc tgcgaccgct acgaaccgct gccgcccgcg ctgcccgccg cgggcgagca
      181 ggattgctgc ggggagcgcg tggtcatcaa catctccggg ctgcgcttcg agacacagct
      241 caagaccctc tgccagttcc ctgagacgct gctaggcgac cccaagcggc gcatgcgata
      301 cttcgacccg ctccgcaatg agtacttctt cgaccgcaac agacccagct tcgacgccat
      361 cctctactac taccagtccg ggggccgcat ccgccggccg gtcaacgtgc ccatcgacat
      421 cttctccgag gagatccgct tctaccaact gggtgaggag gctatggaga agttccgtga
      481 ggacgagggc ttcctgcggg aggaggagcg acccctgccc cgccgtgact tccagcgcca
      541 ggtgtggctg ctcttcgaat accccgagag ctcgcggccg gcccggggca ttgccatcgt
      601 gtcagtgctg gtcattctca tctccattgt catcttctgc ttggagacac tacccgagtt
      661 tcgcgacgag aaggactatc ccgcctctcc gtcgcaggac gtgtttgagg ctgccaacaa
      721 cagcacgtcg ggggcctcct ctggagcctc cagcttctcg gaccccttct tcgtagtgga
      781 gaccctgtgc atcatctggt tctcctttga gctgctggtg cgattctttg cttgccccag
      841 taaagccacc ttctccagaa atatcatgaa cctgatagac attgtagcca tcatccctta
      901 ttttattact ctgggcactg agctggctga gcgacagggt aatgggcagc aggctatgtc
      961 actggccatc ctgagggtca tccgcctagt aagggtcttc cgcatcttca agctctcccg
     1021 ccattctaag gggctgcaga tcctgggaca gacactgaag gcttccatgc gagagctggg
     1081 gctgctcatt ttcttccttt tcattggggt catccttttc tccagtgcag tctactttgc
     1141 tgaggcagac gacccttctt cgggttttaa cagtatcccg gatgccttct ggtgggccgt
     1201 ggtaaccatg acaactgtcg gttatggtga tatgcaccca gtgaccatag gaggcaagat
     1261 tgtgggctct ctttgtgcca tcgcaggtgt cttgaccatt gcattgccgg ttcctgtgat
     1321 tctttccaac ttcaattact tctaccaccg ggagacagaa ggggaagagc aagcccagta
     1381 catgcacgtg ggaagctgcc agcacctctc ctcttcagca gaggagctcc gaaaagcccg
     1441 gagtaactcc actctgagta agtcggagta tatggtgatc gaagaggggg gtatgaacca
     1501 cagcgccttc ccccagaccc ccttcaaaac gggcaactcc actgccactt gcaccaccaa
     1561 caataatccc aactcctgtg tcaacatcaa aaagatattc actgatgtct aatagatgat
     1621 acgattgcca ttctgtgccc agtattgtgt ggaacatgcc cccttggtct gtgtatgccc
     1681 ttgatttata catttccaga ccattcatca aggaaagtac atgaagaagt ggaaaagcac
     1741 acttcattct ccctctccct attgcttcat actgaaacag gtgcctggtt ttgcaagtgg
     1801 gctcattctc tcagctcttt ttctctctcc ctctctctct ccctgtttct taattttgtg
     1861 aacaacaaac ttacattaag ctt
//
LOCUS       RICRRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  O.sativa ribosomal RNA.
ACCESSION   M37173 M35661
KEYWORDS    ribosomal RNA.
SOURCE      O.sativa rRNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     19 c     11 g      9 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS       SAHRRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  S.officinarum ribosomal RNA.
ACCESSION   M37174 M35661
KEYWORDS    ribosomal RNA.
SOURCE      S.officinarum rRNA.
  ORGANISM  Saccharum officinarum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     19 c     11 g      9 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS       SRGRRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  S.bicolor ribosomal RNA.
ACCESSION   M37175 M35661
KEYWORDS    ribosomal RNA.
SOURCE      S.bicolor rRNA.
  ORGANISM  Sorghum bicolor
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     19 c     11 g      9 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS       TRDRRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  T.dactyloides ribosomal RNA.
ACCESSION   M37176 M35661
KEYWORDS    ribosomal RNA.
SOURCE      T.dactyloides rRNA.
  ORGANISM  Tripsacum dactyloides
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     17 c     11 g     11 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgtagtt
//
LOCUS       WHTRRUN        50 bp ss-rRNA            RNA       03-AUG-1990
DEFINITION  T.aestivum ribosomal RNA.
ACCESSION   M37177 M35661
KEYWORDS    ribosomal RNA.
SOURCE      T.aestivum rRNA.
  ORGANISM  Triticum aestivum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 50)
  AUTHORS   Hamby,R.K. and Zimmer,E.A.
  TITLE     Ribosomal RNA sequences for inferring phylogeny within the grass
            family (Poaceae)
  JOURNAL   Pl. Syst. Evol. 160, 29-37 (1988)
  STANDARD  simple staff_review
COMMENT     The ribosomal RNA sequences given are either 26S or 18S, but it is
            not specifically delineated in the article.
FEATURES       from  to/span     description
    rRNA    <     1  >    50     ribosomal RNA
BASE COUNT       11 a     19 c     11 g      9 t
ORIGIN      
        1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct
//
LOCUS       SUSFCK       3985 bp ss-mRNA            INV       03-AUG-1990
DEFINITION  S.purpuratus flagellar creatine kinase mRNA, complete cds.
ACCESSION   M33763
KEYWORDS    creatine kinase; flagellar creatine kinase.
SOURCE      S.purpuratus sperm, cDNA to mRNA, clone WD2.2.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; 
            Echinoidea; Echinacea; Echinoida; Strongylocentrotidae.
REFERENCE   1  (bases 1 to 3985)
  AUTHORS   Wothe,D.D., Charbonneau,H. and Shapiro,B.M.
  TITLE     The phosphocreatine shuttle of sea urchin sperm: Flagellar creatine
            kinase resulted from a gene triplication
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5203-5207 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.D.Wothe, 18-APR-1990.
FEATURES       from  to/span     description
    pept         95     3619     flagellar creatine kinase (E.C. 2.7.3.2)
    signal     3961     3966     poly-A signal
BASE COUNT     1184 a    923 c    910 g    968 t
ORIGIN      
        1 ccctaaactt caatcacact tctaatcaag aaaggaacaa atttgtatta attatttcat
       61 tttctacata tgcattgtaa tcagtgaccc agcaatggga tgcgcagcat cgtcacaaca
      121 aacgacggcc acaggtggtc aaccagctgc tggtgaaaag gcaaacccgg cccccgctaa
      181 taacaatcca aacgcagcaa acaaagcaga aacgacagga gcagctgaag aactcactaa
      241 agaatctgaa ccatttgtgg aaccggatcc caattaccct gacctaagca agcacaacaa
      301 ttacctggca gaatctttaa cgccttccat ctacaacaaa atctgtaacc ttcgcacact
      361 ttctggttac tctgtggatg gttgcatgca gaccggcgtg gacaatccag gacatccttt
      421 catcaagact gtcggattgg tggctgggga tgaagagtgt tacgatctct ttgctgacct
      481 ctttgatcca actattgaca aaagacacaa cggatatccc cgcaatgcaa agcacacaac
      541 tgatctcaac cctgatcatc tcaagggtgg tgatgacttt gatccaaagt atgtcctttc
      601 atgcagagtt cgtacaggcc gctgcattag agggtacggc ctcccaccac attgcacgag
      661 ggctgaacgt cgagacgttg agaaagtttg caaagacgca ctggcaactc tagatgggcc
      721 actcaaaggc acgtattacc cactaacggg gatgacagag gaaatgcagg acaaactaat
      781 tgcagatcat ttcctattcg ataagcccgt ttctcctttg cttatgtcag ctcgcatggc
      841 ccgtgattgg cctgatggac gtgggatctg gcacaatgcc gataaaaact tcctggtttg
      901 gataaatgag gaagatcaca ctcgggttat ttccatggaa acaagtggca acatgaagaa
      961 tgtctttaaa cgattctgta atggtctcaa caaagtcgag aatgctctca aagcaaaagg
     1021 gtatgaattc tcatggaatg agcatcttgg gtatgtgctc acttgtccat ccaaccttgg
     1081 tacaggtgta cgtgcaggag tccacatcaa gattcctctt ttcagcaagc acgctggatt
     1141 tgaatccatt ctgaagcatt atcgtctcca gaaacgtggc acaggaggcg tggatacggc
     1201 atccactgac ggcacgttcg acatttcaaa cctagatcgt cttggcactt ctgaagtaca
     1261 acaggttcaa tccgttgttg atggtgtcaa gaagctgatt gagttggaaa aagccttaga
     1321 aaagggatca gatatcagcg gccagattcc ccgggaccca gcaatagttc gtgcagaaca
     1381 agtgaaagaa ggatatccag acctatccaa acacaacaat catctggccc actgtctgac
     1441 atacgacatt tggaaaagtc tcaaggacaa aaagacgcca agtggcttta ctctggatgg
     1501 ctgcatccaa acgggagtca tgaaccctgg tcatcctcac atcatgaccg ttggaatggt
     1561 tgcaggagat gaggaatcat atgacgtctt cgctgatatc tttgacccgg tcattgatgc
     1621 acgacacgga ggttatccca aagatgccgt ccatgtcacc aacataaacc atgcagatct
     1681 taaaggagga gataacctgg atcccaaata tgttttgtct tgccgtgttc gcacaggacg
     1741 cagcatcatc gggtattctc tgcccccgca ttgcaccgtg gaagagagag ctgccgtaga
     1801 gacgatcacg atcggagctc ttgataaatt tgatggagac ttgcaaggaa aatactaccc
     1861 actggaagga atgtctgatg aaacacaaac acagctgata gatgaccatt tcctcttcga
     1921 taagcctgtt tctcctcttc ttaccgccgc aagaatgcat cgtgattggc cacagggaag
     1981 aggaatttgg cataacgaaa acaagaactt cttggtgtgg gtcaatgaag aagatcatat
     2041 ccgtgtcatc tctatggaga aggatgggaa tatgcgagcg gtcttcaagc gattctgtga
     2101 aggtttgcag aagttcgagc agatgatcaa aaaggatggc aaggagttca tgtggaacaa
     2161 acatctgggt tacgtactta cttgtccgtc taacctggga accggcctac gcgctggcgt
     2221 acatgtgaaa ctgcctctgc tgagcaagta tccgagattc gaccagatcc taagagccct
     2281 ccgacttcag aagagaggta caggtggagt agatactgcc tcaactgatg gaaccttcga
     2341 catctcgaac ctcgacaggc ttggatcatc agaagtacag caagttcagt ttgttgtcga
     2401 tggcgttgaa cttctcgtcc aaatggagaa gaaacttgaa aagggcgagg acatttttga
     2461 tattctacca cagcaatgcc gtcccaaacc accaataaag cctttcagtt atgactaccc
     2521 cgatttctct ctgcacaaca actggatgtc taaatgtatg acagaagaaa tctacaacaa
     2581 actttgcaac ttgaagacaa aaggtggagt caccctgaat gactgcattc aaactggcat
     2641 tgacaatcct ggacatccat acatcatgac tgttggtttg gtagctggag atgaagagtg
     2701 ctacgaggtt ttcgctcctc tcttcgaccc ggtgataagt gccaggcacg gcggctacgc
     2761 tcttgatgcc aaacacccta ccaacctcaa tgcagcagaa ctcaaaggcg gagatgattt
     2821 ggaccctgag ttcgtcctat cttgtcgtgt acgtactggt cgttgcatcc gcggcttggc
     2881 tctcccgcca tgttgcacca gagcagaacg tgcggaagta gagaagatta caacagaagc
     2941 attatctacc ctaagtggac ctctgaaggg aaagtactac ccactaaccg gtatgacaga
     3001 cgaagagcaa gagaagctca tcgaagacca cttcctcttt gacaagcccg tatccccgct
     3061 gttgctttgt gccaatatgg cacgtgactg gccccagggt cgtggcatat ggcacaatga
     3121 tgagaagaat ttccttgtct gggtcaacga ggaagatcac actcgagtca tctccatgga
     3181 gaagagcggg aacatgaagc gagtgtttga gaggttctgc gacggtctca aaaaggtcga
     3241 agactccatc aagtccaagg gttaccagtt catgtggaat gagcatcttg gttatgtact
     3301 gacatgtcca tccaacctcg ggacgggact gcgtgcaggc gtccatgtca aggttccttt
     3361 gttaagtcaa cagaaaatct ttgactctat ccttgatcac atgcgtctgc aaaagcgagg
     3421 aacaggaggt gtagacacag cctctactga cggcaccttt gatatctcca actcggatcg
     3481 catcggtttc tccgaagtcc atcttgtcca acaactcgta gacggtgtca agctcctcgt
     3541 caacctcgaa aaggctctca tgaagggaga agacatcaac agtctcctgc cagagaagct
     3601 aagggaagat tcctcataaa gccaggatcc acagcttgca gaatagtggc cgttgtcgcg
     3661 caagtgttat aaatgccacc tttagaagat gtagcctaca cattaaaaaa aaatacaatt
     3721 agaattgaaa tctataagga cgatttgaat ataccataaa gatttgccca tgattttgaa
     3781 tcaaaataac ttcatgctga acttgtaaat gaaaaagtaa tgttagaaaa ctctgtaaat
     3841 tgggaagata taaaatttca tggccgtttt tctttctgat gaagtacact aattatatta
     3901 atttctctta ttaatatgta tggaaacttt taattatgta aaaacaagga aacgaactga
     3961 aatacaattc aaactacttt tcatt
//
LOCUS       BSUSACP      3518 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  B.subtilis sacT gene, complete cds, enzyme II (sacP) gene (of the
            sucrose phosphotransferase system), complete cds, and sucrase gene,
            5' end.
ACCESSION   J03006 M33761
KEYWORDS    enzyme II; sucrase; sucrose transferase system.
SOURCE      B.subtilis (strain Marburg 168) DNA, clone pBSG8-10.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1987 to 3518)
  AUTHORS   Fouet,A., Arnaud,M., Klier,A. and Rapoport,G.
  TITLE     Bacillus subtilis sucrose-specific enzyme II of the
            phosphotransferase system: Expression in Escherichia coli and
            homology to enzymes II from enteric bacteria enzymes II from
            enteric bacteria
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 8773-8777 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2070)
  AUTHORS   Debarbouille,M., Arnaud,M., Fouet,A., Klier,A. and Rapoport,G.
  TITLE     The sacT gene regulating the sacPA operon in Bacillus subtilis
            shares strong homology with transcriptional antiterminators
  JOURNAL   J. Bacteriol. 172, 3966-3973 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed copy of sequence for [1] kindly provided by
            A.Fouet, 16-SEP-1987.  Draft entry and computer-readable sequence
            for [2] kindly submitted by M.Debarbouille 18-APR-1990.
FEATURES       from  to/span     description
    pept        136      897     sacT protein
    pept       2008     3390     enzyme II (sacP)
    pept       3387  >  3518     sucrase
BASE COUNT     1013 a    712 c    867 g    926 t
ORIGIN      
        1 ctttacatgg cggtcaaaag gaggtaaaat cccagtcaaa agcaaattgg cgggagagat
       61 aacctcttga aaatctataa agtattaaac aacaatgcgg ctttaataaa agaggatgat
      121 caggaaaaaa tcgtgatggg accgggaatc gcttttcaaa aaaagaaaaa tgatctcatc
      181 cctatgaata aagtggaaaa gatctttgtc gtgcgcgacg agaatgaaaa gttcaaacaa
      241 atcctgcaaa cactgccgga ggagcatata gaaatagctg aggatatcat cagctatgcg
      301 gaaggagagc tcgcggcgcc gctgagcgat cacattcata tcgcgctttc cgaccatttg
      361 tcctttgcga tcgaaaggat tcaaaatggg ctgcttgtgc agaataaatt gctgcatgag
      421 ataaaggcgc tctataaaaa agaatatgag atcggcctgt gggcgatcgg acatgtaaaa
      481 gagacattgg gcgtgtctct gcctgaagat gaagcgggtt atattgccct tcacatccat
      541 acggcgaaga tggatgcgga gagcatgtat tcagcgctga agcatacgac catgatcaaa
      601 gaaatgatag agaaaataaa acaatacttt aatcgaaagg tggatgaaaa cagcatttcc
      661 tatcaacgcc ttgtcacgca tttgcgatac gcggtcagcc ggttggaatc aaatgaagcg
      721 ctccatcgta tggacgagga gatgctttat ttcatccaaa aaaagtattc attcgcctat
      781 caatgtgcgc tggagctggc ggagtttttg aaaaatgaat atcaattaca tttgccggaa
      841 tccgaggccg gctatatcac gctgcatgtc cagcgtcttc aagatctctc ggaataaccg
      901 ctttgacttg cagggagtga tctctggaag tttttttatt gatcaggatt cagtgtgtag
      961 tacatccagc ccatcataac tattcccgcg gtcagattgc cgagcgtaac ggggattaag
     1021 tttctgactg ctcccatcag tgtcactgta tcagggtgct cgatcaaaag cgagatggcg
     1081 aatgtgcaca tattggcaat gctgtgttca aagccggaaa taaagaagca gaaaacgaaa
     1141 agcatcatgg taaaaagctt tgctccttcc cctttgagag acattggaat gaaaaaggcg
     1201 aggcacacaa gccaattgca cagcattcct ctgaaaaaca attcggaagc cggcggctcc
     1261 attttgtgct ctgccaaatg aatcaaaaag gaatgaacag aaggctcctc aaaaagtccc
     1321 gtcgcgctga tcaggatggc aaacagaatg gcgccgatta aattgccggc atagcttgac
     1381 atccacaagt atagggtgtc gcgccagctg atttttttcc ggagcgcggt ataggtgaaa
     1441 taaaaggtgt tgccggtaaa taaatctccg cctccgtaag caatcatcag aatagcggcc
     1501 ccgaaagtga cagcggctgc cggaaaggca aacggagaat cagccataaa gaaatagctg
     1561 cctgttttgc ttgcggctgt aatgccaaaa ccaataaata tactggccaa aatggaccga
     1621 agcacataac ggattttgct tgaagcgaat atgttttgtt ttttcaaagc atactgttca
     1681 accttttgta atgcttgagt ttccattgtc ttttcctgct ttctttcatg attgagcata
     1741 gtatggcacg atattgtgaa gtatttcaca attaattttt acgaattttt gtctacatgt
     1801 tatatcaaac tatttaacta aaaaaatagt tgacgaaaac gctatcatga tttatgatga
     1861 aagcgtattc ttaatctgaa taagcgggat tgtgactggt aaagcaggca agacctaaaa
     1921 tttgcgtaaa tgaaaaagga tcgctgtgtc ctttattcgt tggcgaattt taggtctttt
     1981 ttgtttaaat aaaaggggga gaaatagatg gattacaaag agactgcaaa acgcctcatt
     2041 gagcttctcg gagggaaaga aaatattatc agcgcggctc attgtgcaac aagactgcgt
     2101 ttagtgatga aagatgaatc aaagatagat caagcacaag tagaagagct tgatggcgtt
     2161 aaaggggctt tcagcagctc tggccagtac cagatcattt tcggaacagg ccttgtgaat
     2221 aaagtattcg atgccttttc taaagaagct gatatcgagc gtgaagaaca cgtcaatcat
     2281 caggatgcgg caaaagaaaa gctgaatccc gctgcgagat ttgcgaaaac gctttctaat
     2341 atttttgttc caatcatccc agctattgta gccagcggcc tattaatggg attgctgggt
     2401 atgatcaatg cgtttcattg gatgagcaag gattctgcgc ttttgcagct gcttgatatg
     2461 ttttcaagtg cagcattcat tttcttgccg attctaatcg gggtcagcgc ttcaaaagag
     2521 tttggcagca atccatactt gggagcggtc atcgggggaa tcatgattca tccgaatctt
     2581 ttaaatccat ggggattggc ggaagaacaa ctgattacat gcatcttttc ggatttgata
     2641 tcgctcttct cggctacagg gaactgttat ccctgtcctg cttgcggtgt atgtgatgag
     2701 caaggtagag aaatggacga gaaaagtggt tccacatgcg gtgatttgct tgttacacca
     2761 tttgttactg tgattgtcac cgggtttgtt gcttttatcg cgattggccc tctaggcaga
     2821 gcgcttggct ccggcattac ggttgcttta acttatgtat atgaccatgc cggttttgtc
     2881 gcaggtctga ttttcggggg cacgtattca ctcatcgtgc tgacaggcgt tcatcacagc
     2941 ttccatgcga ttgaggcagg gctaattgca gatataggca aaaactactt gctgccgatc
     3001 tggtcaatgg cgaatgtcgc acagggcggg gcaggtcttg ccgtcttctt tatggcgaaa
     3061 aaggcaaaga caaaagaaat cgcgcttccg gcagcatttt ccgcttttct cggcattact
     3121 gagcccgtca tattcggagt caatctgcgc taccgaaaac cgtttatcgc cgcgatgatc
     3181 ggaggcgcat tgggtggggc atatgtcgtc tttacacatg tagctgcaaa cgcttacggg
     3241 ttaacgggaa ttccgatgat cgccattgcg gctccgtttg gcttcagcaa cttgattcac
     3301 tatctgattg gcatggcgat tgcggcagtg tcagcgttca ttgcagcatt tgtcatgaag
     3361 attaatgagg atgaggagag gaaaaaatga cagcacatga ccaggagctt cgtcgccggg
     3421 cttatgaaga agtggagaaa aaagagccca tcgctaacag cgatccgcac cgccagcatt
     3481 ttcatatcat gccgccggtt gggctgctga atgacccg
//
LOCUS       BOVACTH      1084 bp ss-mRNA            MAM       03-AUG-1990
DEFINITION  Bovine corticotropin-beta-lipotropin mRNA, complete cds.
ACCESSION   M25587
KEYWORDS    adrenocorticotropic hormone; corticotropin-beta-lipotropin;
            lipotropin; melanocortin.
SOURCE      Bovine, cDNA to mRNA, clone pSNAC20.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1084)
  AUTHORS   Cohen,S.N., Chang,A.C.Y., Nakanishi,S., Inoue,A., Kita,T.,
            Nakamura,M. and Numa,S.
  TITLE     Studies of cloned DNA encoding the structure for the bovine
            corticotropin-beta-lipotropin precursor protein
  JOURNAL   Ann. N.Y. Acad. Sci. 343, 415-425 (1980)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        114      911     corticotropin-beta-lipoprotein precursor
    sigp        114      191     corticotropin-beta-lipoprotein signal peptide
    matp        342      377     gamma-melanocortin (gamma-MSH)
    matp        507      548     adrenocorticotropic hormone (ACTH)
    matp        630      908     beta-lipotropin
    matp        756      809     beta-melanocortin (beta-MSH)
    matp        816      908     beta-endorphin
    mRNA    <     1     1084     corticotropin-beta-lipoprotein mRNA
BASE COUNT      220 a    337 c    370 g    157 t
ORIGIN      
        1 aagagaacga agggaagaaa agtgaccgag acgccgctga acatcctcgc cccggcgcag
       61 cgggagccgc ccgagccagc ttccccgtga cagagcctca gcctgcctgg aagatgccga
      121 gactgtgcag cagtcgttcg gccgccctgc tgctggcctt gctgcttcag gcctccatgg
      181 aagtgcgtgg ttggtgcctg gagagcagcc agtgtcagga cctcaccacg gaaagtaacc
      241 tgctggcgtg catccgggcc tgcaagcccg acctctccgc cgagacgccg gtgttccccg
      301 gcaacggcga tgagcagccg ctgactgaga acccccggaa gtacgtcatg ggccatttcc
      361 gctgggaccg cttcggccgt cggaatggta gcagcagcag cggagttggg ggcgcggccc
      421 agaagcgcga ggaggaagtg gcggtgggcg aaggccccgg gccccgcggc gatgacgccg
      481 agacgggtcc gcgcgaggac aagcgttctt actccatgga acacttcccc tggggcaagc
      541 cggtgggcaa gaagcggcgc ccggtgaagg tgtaccccaa cggcgccgag gacgagtcgg
      601 cccaggcctt tcccctcgaa ttcaagaggg agctgaccgg ggagaggctc gagcaggcgc
      661 gcggccccga ggcccaggct gagagtgcgg ccgcccggcc tgagctggag tatggcctgg
      721 tggcggaggc ggaggctgag gcggccgaga agaaggactc ggggccctat aagatggaac
      781 acttccgctg gggcagcccg cccaaggaca agcgctacgg cgggttcatg acctccgaga
      841 agagccaaac gccccttgtc acgctgttca aaaacgccat catcaagaac gcccacaaga
      901 agggccagtg agggcgcagc gggcaggggc ctctctccgc ggaaagttga ccctgaaggc
      961 ctctcttctg ccctcctacc gcctcgcagc ctgggtgagg attcgcccag gcagtgatgg
     1021 cgccaggtat cccgactctt aaagctgtct gtagttaaga aataaaacct ttcaagtttc
     1081 acga
//
LOCUS       ECOK88ACF     861 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  E.coli K88ac fimbriae subunit gene, 5' end.
ACCESSION   M35954
KEYWORDS    K88ac fimbriae subunit; antigen.
SOURCE      E.coli (K12, strain MC1000) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 861)
  AUTHORS   Josephsen,J., Hansen,F., de Graaf,F.K. and Gaastra,W.
  TITLE     The nucleotide sequence of the protein subunit of the K88ac
            fimbriae of porcine enterotoxigenic Escherichia coli
  JOURNAL   FEMS Microbiol. Lett. 25, 301-306 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         13  >   861     K88ac fimbriae subunit precursor
    sigp         13       75     K88ac fimbriae subunit signal peptide
    matp         76  >   861     K88ac fimbriae subunit
BASE COUNT      227 a    154 c    230 g    250 t
ORIGIN      
        1 gggtttaatc ttatgaaaaa gactctgatt gcactggcaa ttgctgcatc tgctgcatct
       61 ggtatggcac atgcctggat gactggtgat ttcaatggtt cggtcgatat cggtggtagt
      121 atcactgcag atgattatcg tcagaaatgg gaatggaaag ttggtacagg tcttaatgga
      181 tttggtaatg tattgaatga cctgaccaat ggtggaacca aactgaccat tactgttact
      241 ggtaataagc caattttgtt aggccgaacc aaagaagcat ttgctacgcc agtaactggt
      301 ggtgtagatg gaattcctca tattgcattt actgactatg aaggagcttc tgtagtactc
      361 agaaaccctg atggtgaaac taataaaaaa ggtttagcat attttgttct gccgatgaaa
      421 aatgcagagg gcactaaagt tggttcagtg aaagtgaatg catcttatgc cggtgtgtta
      481 gggagaggtg gggttacttc tgcggacggg gagctgcttt cgctttttgc cgacgggttg
      541 agcgctatct tttatggtgg tttgccgagg ggttctgaat cgtcggctgg gagtgccgca
      601 gcggcgcgca caaagttgtt tggaagtcta tcaagaaatg atattctcgg acagattcaa
      661 agagtaaacg caaatattac ttctcttgtt gacgtcgcag gttcttacag ggaaaacatg
      721 gagtacactg atggaactgt tgtttctgct gcctatgcac tgggtattgc aaacggtcag
      781 actattgagg caacttttaa tcaggctgta actaccagca ctcagtggag cgctccgctg
      841 aacgtagcaa taacttatta c
//
LOCUS       LBARNII       400 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  L.acidophilus RNA polymerase (RNAI and RNAIIa) genes, 5' promoter
            region.
ACCESSION   M35739
KEYWORDS    RNA polymerase.
SOURCE      L.acidophilus DNA, clone pRNL5.
  ORGANISM  Lactobacillus acidophilus
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 400)
  AUTHORS   Natori,Y., Kano,Y. and Imamoto,F.
  TITLE     Characterization and promoter selectivity of Lactobacillus
            acidophilus RNA polymerase
  JOURNAL   Biochimie 70, 1765-1774 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        143  >   400     RNAI mRNA
    mRNA        209  <     1 (c) RNAIIa mRNA
BASE COUNT      149 a     69 c     67 g    115 t
ORIGIN      
        1 tgcgcggggt tcgagctacg gccaaaagaa ccccgcgttg gaacatctgg cacaaacttt
       61 ttctcatttt taactttttt ttaatttaat ttagacaaaa aaagaagcct taactggctt
      121 cgatctgtgg tataattttt ttaaagtaaa caaaaaaagt gctaactaaa acaacaacaa
      181 cagcactttt tcaatcaact tttcaacttc gaatttatta tagcataaaa ctagtaaaaa
      241 taaatacaaa aaaaaattgc ctgtattatc ttcgaagctg ggaagttgtc ccactatcgg
      301 aggttttttt tatgggaaaa atcgtaactt taccacaagg ataggctagt cagcctaagt
      361 ctaaaaagaa aaaaagcaag aagagttcag gcagtgcaag
//
LOCUS       LEIKPGRA       44 bp ss-RNA             RNA       03-AUG-1990
DEFINITION  L.tarentolae kinetoplast minicircle guide RNA (COIII gRNA-I).
ACCESSION   M33870
KEYWORDS    guide RNA.
SOURCE      L.tarentolae (strain UC) kinetoplast minicircle D-12-1, guide RNA.
  ORGANISM  Kinetoplast Leishmania tarentolae
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Leishmania tarentolae.
REFERENCE   1  (bases 1 to 44)
  AUTHORS   Sturm,N. and Simpson,L.
  TITLE     Kinetoplast DNA minicircles encode guide RNAs for editing of
            cytochrome oxidase subunit III mRNA
  JOURNAL   Cell 61, 879-884 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Simpson 13-APR-1990.
BASE COUNT       17 a      7 c      9 g     11 t
ORIGIN      
        1 ctatctttac gaggtaaaga cagagagatg aaaacactat tcgt
//
LOCUS       LEIKPGRB       46 bp ss-RNA             RNA       03-AUG-1990
DEFINITION  L.tarentolae mitochondrial minicircle guide RNA (4E-2 gRNA).
ACCESSION   M33871
KEYWORDS    guide RNA.
SOURCE      L.tarentolae mitochondrial minicircle Lt154, guide RNA.
  ORGANISM  Mitochondrion Leishmania tarentolae
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Leishmania tarentolae.
REFERENCE   1  (bases 1 to 46)
  AUTHORS   Sturm,N. and Simpson,L.
  TITLE     Kinetoplast DNA minicircles encode guide RNAs for editing of
            cytochrome oxidase subunit III mRNA
  JOURNAL   Cell 61, 879-884 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Simpson 13-APR-1990.
BASE COUNT       27 a      5 c      5 g      9 t
ORIGIN      
        1 ataaaacaca acaaaaaaca tagagaaatc atagagtgtt aaatat
//
LOCUS       MUSIFNRES     508 bp ds-DNA             ROD       03-AUG-1990
DEFINITION  Mouse MHC class II-associated invariant chain gene, exon 1.
ACCESSION   M35872
KEYWORDS    MHC class II-associated invariant chain.
SOURCE      Mouse DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 508)
  AUTHORS   Eades,A.-M., Litfin,M. and Rahmsdorf,H.J.
  TITLE     The IFN-gamma response of the murine invariant chain gene is
            mediated by a complex enhancer that includes several MHC class II
            consensus elements
  JOURNAL   J. Immunol. 144, 4399-4409 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        432  >   508     invariant chain protein, exon 1
    mRNA        348  >   508     invariant chain protein mRNA
    site         87      158     IFN-gamma response element
    signal      267      272     SP1 binding site
    signal      255      258     CAAT box
    signal      315      319     TATA box
BASE COUNT      133 a    131 c    140 g    104 t
ORIGIN      
        1 gggtgggggt gggtagacaa aggcactttc tgcttcaaaa ataattcctt ccctggggaa
       61 agtgcctgct tctagaaccg tccaaagagc ctgatgaatc cagaagtctg cctagaaaca
      121 agtgatgata gccctggcca gccaatggga tcatgcaggc ctttctacct gtttagggaa
      181 ctcccccttc atcctgccca gggaggcagc tttgagtgag tggggaattt ccagatttgt
      241 ggctttcagt tccacatcta ccatgtgggc ggagtgacct gctgtgggcg aatcagattc
      301 cttccagtat cagctttaag aggtgatctt ggggctcaag ggtcccagac acacagcagc
      361 agcagcagca gcagcagcag caacagcagc agcagcagca gcgcctgtgg gaaaaaatag
      421 aggctagagc catggatgac caacgcgacc tcatctctaa ccatgaacag ttgcccatac
      481 tgggcaaccg ccctagagag ccagaaag
//
LOCUS       SOYBCONGL     794 bp ss-mRNA            PLN       03-AUG-1990
DEFINITION  Soybean beta-conglycinin-alpha subunit mRNA, 3' end.
ACCESSION   M26128
KEYWORDS    beta-conglycinin.
SOURCE      Soybean, cDNA to mRNA, clone pB36.
  ORGANISM  Glycine max
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 794)
  AUTHORS   Coates,J.B., Medeiros,J.S., Thanh,V.H. and Nielsen,N.C.
  TITLE     Characterization of the subunits of beta-conglycinin
  JOURNAL   Arch. Biochem. Biophys. 243, 184-194 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      659     beta-conglycinin-alpha subunit (AA at 3)
    mRNA    <     1      794     beta-conglycinin-alpha subunit
BASE COUNT      253 a    153 c    177 g    211 t
ORIGIN      
        1 tgagcaaacg tgccaaatct agttcaagga aaaccatttc ttctgaagat aaacctttta
       61 acttgggaag ccgcgacccc atctactcca agaagcttgg caagttcttt gagatcaccc
      121 cagagaaaaa cccccagctt cgggacttgg atatcttcct cagtattgtg gatatgaacg
      181 agggagctct tcttctacca cacttcaatt caaaggcgat agtgatactg gtaattaatg
      241 aaggagatgc aaacattgaa cttgttggcc taaaagaaca acaacaggag cagcaacagg
      301 aagagcaacc tttggaagtg cggaaatata gagccgaatt gtctgaacaa gatatatttg
      361 taatcccagc aggttatcca gttgtggtca acgctacctc aaatctgaat ttctttgcta
      421 ttggtattaa tgccgagaac aaccagagga acttcctcgc aggttcgcaa gacaatgtga
      481 taagccagat acctagtcaa gtgcaggagc ttgcattccc tgggtctgca caagctgttg
      541 agaagctatt aaagaaccaa agagaatcct actttgtgga tgctcagcct aatgaaaaag
      601 aggagggtaa taagggaaga aagggtcctt tgtcttcaat tttgagggct ttttactgaa
      661 taagtatgta ctaaaatgta tgctgtaata gctcatagtg agcgaggaaa gtatcgggct
      721 atgtaactat gactagagct tcaactatga ataaataaat cgacagcata tgatgctttt
      781 gttttgtgtt cttc
//
LOCUS       SOYGLYB4      911 bp ss-mRNA            PLN       03-AUG-1990
DEFINITION  Soybean glycinin A3-B4 subunit mRNA, 3' end.
ACCESSION   M35671
KEYWORDS    glycinin; glycinin-B4 subunit.
SOURCE      Soybean (var. Early-10) seed, cDNA to mRNA, clones pSPG[204,673].
  ORGANISM  Glycine max
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 911)
  AUTHORS   Epishin,S.M., Kotenko,S.V., Zakharova,E.S., Kiselev,S.L. and
            Vinetskii,Y.P.
  TITLE     Cloning and structural analysis of cDNA encoding a portion of the
            A-3-B-4 glycinin subunit
  JOURNAL   Genetika 22, 810-815 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      723     glycinin precursor
    matp        202      720     glycinin-B4 subunit
    mRNA    <     1      911     glycinin mRNA
BASE COUNT      280 a    220 c    203 g    208 t
ORIGIN      
        1 caagaagacg aagacgaaga tgaagacgaa gaatatgaac aaactccctc ttatcctcca
       61 cgacgaccaa gccatggaaa gcatgaagat gacgaggacg aggacgaaga agaacatcaa
      121 cctcgtcctg atcaccctcc acagcgacca agcaggcccg aacaacaaga accacgtgga
      181 agaggatgtc agactagaaa tggggttgag gaaaatattt gcaccatgaa gcttcacgag
      241 aacattgctc gcccttcacg tgctgacttc tacaacccaa aagctggtcg cattagcacc
      301 ctcaacagtc tcaccctccc agccctccgc caattcggac tcagtgccca atatcttgtc
      361 ctctacagga atggaattta ctctccacat tggaacttga acgcgaacag tgtgatctat
      421 gtgactcgag ggaaaggaag agttagagtg gtgaactgcc aagggaatcc agtgttcgac
      481 ggtgacctaa ccaggggaca attgctactg gtgccgcaga actttgtggt cgctgaccaa
      541 gggggaaaac aaggattgga atacgtagtg ttcaagacac agcacaacgc cgtgagcagc
      601 tacattaagg atctgtttag ggcaatccct tcggaggttc tttccaattc ttacaacctt
      661 ggccagagtc aagtgcgtca gctcaagtat caaggaaact ccggcccttt gctcaaccca
      721 taaataacaa caagcatata tcaaggtgtg gtgaggccat cttatatgaa taatatgaaa
      781 atatattttg tgtaataata aaactatggc ctatgtattt accaccctcc gacccagcct
      841 atgttaatat ctgagtggcg ttgtaccttt gaatcgcctt aataaaatgt cagtcttcaa
      901 gttttgtctt t
//
LOCUS       TLARGN        133 bp ds-DNA             PLN       03-AUG-1990
DEFINITION  T.lanuginosus 26S ribosomal RNA gene, partial.
ACCESSION   M35561
KEYWORDS    26S ribosomal RNA.
SOURCE      T.lanuginosus DNA.
  ORGANISM  Thermomyces lanuginosus
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 133)
  AUTHORS   Nazar,R.N. and Wong,W.M.
  TITLE     Is the 5S RNA a primitive ribosomal RNA sequence?
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >   133     23S rRNA
BASE COUNT       39 a     23 c     39 g     32 t
ORIGIN      
        1 gtagcaaata ttcaaatgag aactttgaag actgaagtgg ggaaaggttc cacgtcaaca
       61 gcagttggac gtgggttagt cgatcctaag agatggggaa gctccgtttc aaaggcctga
      121 ttttatgcag gcc
//
LOCUS       YSCRGNA       252 bp ds-DNA             PLN       03-AUG-1990
DEFINITION  S.cerevisiae 26S ribosomal RNA gene, partial.
ACCESSION   M35562
KEYWORDS    26S ribosomal RNA.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 252)
  AUTHORS   Nazar,R.N. and Wong,W.M.
  TITLE     Is the 5S RNA a primitive ribosomal RNA sequence?
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >   252     23S rRNA
BASE COUNT       62 a     60 c     75 g     55 t
ORIGIN      
        1 gtagcaaata ctcaaatgag aactttgagg actgaagtgg ggaaaggttc catgtgaaca
       61 gcagttggac atgggttagt cgatcctaag acacagggaa gttccgtttg aaaggcgccc
      121 tggtgcgccg ttgacatgca acagctggga ttcgctggtg gtcacccacc caactactaa
      181 ccagccggcg tgtggcttaa gtacggctga gcggacggga agccctgttt tccacaccct
      241 atggtcgcat gt
//
LOCUS       YSCSRP        413 bp ds-DNA             PLN       03-AUG-1990
DEFINITION  S.cerevisiae serine rich protein (srp) gene, 5' end.
ACCESSION   M35738
KEYWORDS    serine rich protein.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 413)
  AUTHORS   Marguet,D. and Lauquin,G.J.-M.
  TITLE     The yeast srp gene: Positive modulation by glucose of its
            transcriptional expression
  JOURNAL   Biochem. Biophys. Res. Commun. 138, 297-303 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        408  >   413     serine rich protein (srp)
    mRNA        347  >   413     SRP mRNA (alt.)
    mRNA        355  >   413     SRP mRNA (alt.)
    mRNA        404  >   413     SRP mRNA (alt.)
BASE COUNT      123 a     83 c     71 g    136 t
ORIGIN      
        1 gatccatctt tcttcgcgcg tttattttca gcaccatgtt ttaggttttt acagcatcgt
       61 ttaaggaacc caacaataca atagcgggaa gaatgcactt tctcgttcca taaagggtct
      121 ctttcaccta tacggttggt acagatttcc agtgaatgcc agtcagccac ggcattacgt
      181 cgtttgcttc tattttcttc gttttgaact gcgtttgtat gcaactgtcc cttgacagag
      241 aaaaaagtga agcaaaatga cagacaaaga aatcttttgt ataaaaggtc ggttgaatct
      301 tgttgttagc ttgaaatcag cttgcttttc tcctctaaat tacataaaaa accaagaaaa
      361 tatcagactt tttcattcgc tttcaacaag tactacaata attaaaaatg gct
//
LOCUS       AVIVNFDG     6557 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  A.vinelandii nitrogenase (vnfHDGK) genes, complete cds.
ACCESSION   M32371
KEYWORDS    nitrogenase 2; vanadium nitrogenase.
SOURCE      A.vinelandii (strain CA) DNA.
  ORGANISM  Azotobacter vinelandii
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Azotobacteraceae.
REFERENCE   1  (bases 1 to 6557)
  AUTHORS   Joerger,R.D., Loveless,T.M., Pau,R.N., Mitchenall,L.A.,
            Simon,B.H. and Bishop,P.E.
  TITLE     Nucleotide sequence and mutational analysis of the structural genes
            for nitrogenase 2 of Azotobacter vinelandii
  JOURNAL   J. Bacteriol. 172, 3400-3408 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.N.Pau, 28-FEB-1990.
FEATURES       from  to/span     description
    pept        238     1110     nitrogenase (vnfH)
    pept       2415     3839     nitrogenase (vnfD)
    pept       3839     4180     nitrogenase (vnfG)
    pept       4237     5664     nitrogenase (vnfK)
    pept       1227     1424     ferredoxin-like ORF
    rpt        1172     1192     inverted repeat
    rpt        4188     4219     inverted repeat
    rpt        5737     5760     inverted repeat
    binding    1214     1218     pot. ribosome binding site
    binding    2400     2406     pot. ribosome binding site
    binding    4222     4228     pot. ribosome binding site
    site       2311     2327     pot. NTRA-dependent promoter
BASE COUNT     1359 a   2270 c   1874 g   1054 t
ORIGIN      
        1 gatcgcttcc cggctgtacc tgcgggtacg tcgacggagc gcacagcatc ctggcctgga
       61 tttatggagt ccaataaacc tgcaaaaatt aaaataattc actaattaaa tatgtttttt
      121 tgattttata ttccaaaaaa taggcaatca tcgacttatc gatccttgtg gcaccgccct
      181 tgctcaactc tcgtcggcac aaatcaaaac gccaacgaat caacggaggt tcctaagatg
      241 gcattgcgtc agtgtgcaat ttacggcaag ggtggcatcg gcaagtccac caccacccag
      301 aacctggtcg ccgccctcgc cgaagccggc aagaaagtga tgatcgtcgg ttgtgacccg
      361 aaagccgact ccacccgcct gatcctgcac tccaaggccc agggcaccgt catggaaatg
      421 gccgcgtccg ccggctcggt cgaagacctg gagctggaag acgtgctgca gatcggcttc
      481 ggcggcgtca agtgcgtcga atccggtggc ccggagccgg gcgtcgcttg cgccggccgt
      541 ggcgtgatca ccgcgatcaa cttcctggaa gaagaaggcg cctacagcga cgacctggac
      601 ttcgtgttct atgacgtgct gggcgacgtg gtatgcggcg gcttcgccat gccgatccgc
      661 gagaacaagg cccaggaaat ctacatcgtc tgctccggcg agatgatggc catgtacgcc
      721 gccaacaaca tcgccaaggg catcgtgaaa tacgcccact ccggcagcgt gcgtctgggc
      781 ggcctgatct gcaacagccg caagaccgac cgcgaagacg agctgatcat ggccctggcc
      841 gcgaagatcg gcacccagat gatccacttc gtgccgcgcg acaacgtcgt gcaacacgcc
      901 gaaatccgcc gcatgaccgt gatcgaatac gatccgaagg ccggacaggc cgacgagtac
      961 cgtgccctgg ctcgcaaaat cgtcgacaac aagctgctgg tcatcccgaa cccggcctcc
     1021 atggaagaac tcgaagagct gctaatggaa ttcggcatca tggaagtcga agacgagtcc
     1081 gtcgtcggca aggccgccgc cgaaggctga ttcacccagc acagcgtttg cggaggagcg
     1141 tgcgccgcgg gctttcggaa tggcttctcg cggccggcgg cgcacgccgc cctcccttcg
     1201 aacaaccgac ctcaggagct gacaccatgg ccatggccat cgacggctac gaatgcactg
     1261 tctgcggcga ctgcgagccg gtctgcccga ccggttcgat cgtcttcagg gacgatcact
     1321 acgcgatcga agccgacagt tgcaacgaat gcaccgacgt gggcgagccg cgctgtctcg
     1381 gcgtctgccc cgtcgacttc tgcatccagc cgctcgatga ctgaacactg aacgactccg
     1441 caccccgttg ccggcggcag gacattccgc gccgtcctgc cgccggaccc agaacggcga
     1501 tcgcttttcc tcaggggcga tcgccgtttt acttttcccc gctccgctag cgcccgcgga
     1561 caacagccgt ccgtatccgc gccgttcgcc gtcctcgacc ggccacgatc gcggcacagc
     1621 tcgccaagct acagcccgct ccacaagctg accatcggca tccagacctt cgccaagatc
     1681 cgcgaagact tcttacgtcg acaagaccgg cttcgcccgg cgaccgatcg aacgggacac
     1741 actacttcct ctcccggcca tgccgcttcg gcaagagccc gttcctcgat accctggccg
     1801 aaccgttcgc cggcgacgaa ccttcgttcc gcaggctgca gatgcatgac cgatgggact
     1861 ggcgccggcg ccatccggca ttgcggatcg gcttcggcgg cgcgccgatc cgggacggcg
     1921 gataagctcg ccgagccgct gctcccaaag ccacgcccac ctccacgtcc cggcgcggaa
     1981 gtggcgatcc ccagccgcac gtccccatgg tttccgggtc tggaaccgtc ccccggactt
     2041 cccgtacgga cgcccggaag ccaccgcccc gacggccacg cacagccacg tacagacccc
     2101 ccgctctccg tacttggccg aacgaaaaac cttttacgta acccgccggc cgtccacccg
     2161 gttgtcccga acaggaaaaa aaagcccgga aaaggcttgc cccggcgctt ttccaaaacc
     2221 tcgaaaatgc gcacaaataa ttgattcgaa aggattaatc tgagacagcg gcgatgccgc
     2281 gaaaaattcc tgcaagcgct gcaaggacat atggcacgca tcctgccctt acctcttcgc
     2341 caaccggttt ttcggtcccg gtcgagtgcc gtcaggggac tcgatccacg catagagccg
     2401 aggagacttc aatcatgcca atggtattgc tggaatgtga caaggacata cccgagcgcc
     2461 agaaacacat ctatctgaag gcgcccaacg aggacacccg cgagttcctg ccgatcgcca
     2521 acgcggcgac catccccggc accctgtccg aacgcggctg cgccttctgc ggcgccaagc
     2581 tggtgatcgg cggtgtgctc aaagacacca tccagatgat tcacggcccg ctcggctgtg
     2641 cctacgacac ctggcacacc aagcgctacc cgaccgacaa cggccacttc aacatgaagt
     2701 acgtctggtc gaccgacatg aaggaaagcc atgtggtctt cggcggcgag aaacgcctcg
     2761 agaagagcat gcacgaagcc ttcgacgaaa tgcccgacat caagcggatg atcgtctaca
     2821 cgacctgccc gaccgcgctg atcggcgacg acatcaaggc cgtggccaag aaggtgatga
     2881 aggaccgtcc ggacgtggac gtcttcaccg tcgaatgccc cggcttctcc ggtgtgtccc
     2941 agtccaaggg ccaccacgtc ctgaacatcg gctggatcaa cgagaaagtc gagacgatgg
     3001 agaaggaaat caccagcgaa tacaccatga acttcatcgg tgacttcaat attcaaggtg
     3061 atacccaact gctgcaaacc tactgggacc gcctgggcat ccaggtcgtc gcccacttca
     3121 ccggcaacgg cacctacgac gacctgcgct gcatgcacca ggcccagctc aacgtggtga
     3181 actgcgcccg ttcctccggc tacatcgcca acgagctgaa gaagcgctac ggcatcccgc
     3241 gtctggacat cgactcctgg ggcttcaact acatggccga gggcatccgc aagatctgcg
     3301 ccttcttcgg catcgaggag aagggcgagg agctgatcgc cgaggaatac gccaagtgga
     3361 agccgaagct cgactggtac aaggagcgtc tgcaaggcaa gaaaatggcg atctggaccg
     3421 gcggcccgcg cctgtggcac tggaccaagt cggtcgagga cgacctgggc gtgcaagtgg
     3481 tggccatgtc ctccaagttc ggccatgagg aagacttcga gaaggtcatc gcccgcggca
     3541 aggaaggcac ctactacatc gacgacggca acgagctgga attcttcgag atcatcgacc
     3601 tggtcaagcc ggacgtgatc ttcaccggcc cgcgcgtcgg cgaactggtc aagaagctgc
     3661 acatccccta cgtcaacggc cacggctacc acaacggccc gtacatgggc ttcgaaggct
     3721 tcgtcaacct ggcccgcgac atgtacaacg ccgtgcacaa cccgctgcgc cacctggccg
     3781 ccgtggatat ccgcgacaag tcgcagacta ccccggtcat cgtgcggggg gccgcctgat
     3841 gagccagtcc catctcgacg atctgttcgc ctatgtcgag gagcgctgcc tgtggcagtt
     3901 cttctcgcgc acctgggacc gcgaggaaaa catcgagggc gtgctcaatc aggtcggccg
     3961 cctgctgacc ggccaggagc cgctgcgcgg caccccgcag gagcgcctgt tctacgccga
     4021 cgccctggcc atggccaacg atgtccgcga gcgtttcccc tgggcttcgc aggtcaacaa
     4081 ggaagagatc gagttcctgc tcgacggtct caagtcccgt ctggtcgacg tgaccatcac
     4141 ccgctcgacc aaccgcgaac tcaaccacca cctctactga gtcatcggcg cggagacggc
     4201 cggaacgccg tctgcgcgcg cgaaggagcc attttcatga gcaattgcga actgaccgtg
     4261 ctgaagccgg cagaagtcaa gctgagcccg cgtgaccgcg agggcatcat caacccgatg
     4321 tacgactgcc agccggccgg cgcccagtac gccggcatcg gcatcaagga ctgcatcccg
     4381 ctggtccacg gcggccaggg ctgcacgatg ttcgtccgcc tgctgttcgc ccagcacttc
     4441 aaggaaaact tcgacgtcgc ctccacctcg ctgcacgagg agtcggcggt gttcggcggc
     4501 gccaagcgcg tcgaggaagg cgtgctggtc ctcgcccgcc gctacccgaa cctgcgcgtc
     4561 atcccgatca tcaccacctg ctccaccgaa gtcatcggcg acgacatcga gggcagcatc
     4621 cgggtctgca accgggcact cgaagccgaa ttcccggatc gcaagatcta cctggcgccg
     4681 gtacacaccc cgagcttcaa gggcagccac gtcaccggct acgccgagtg cgtgaagtcg
     4741 gtgttcaaga ccatcaccga cgcgcacggc aagggccagc cgagcggcaa gctcaacgtg
     4801 ttcccgggct gggtcaaccc cggcgacgtg gtgctgctca agcgctactt caaggaaatg
     4861 gacgtcgaag ccaacatcta catggacacc gaggacttcg actcgccgat gctgccgaac
     4921 aagagcatcg agacccacgg ccgcaccacc gtcgaggaca tcgccgacag cgccaacgcc
     4981 ctggccaccc tgtccctggc ccgctacgag ggcaacacca ccggcgagtt gctgcagaag
     5041 accttcgcgg tgccgaatgc cctggtcaac accccctacg gcatcaagaa caccgacgac
     5101 atgctgcgca agatcgccga agtcaccggc aaggagatcc ccgagtcgct ggtccgcgag
     5161 cgcggcatcg ccctcgacgc cctggccgac ctggcgcaca tgttcttcgc caacaagaaa
     5221 gtggcgatct tcggccaccc ggacctggtg ctcggcctgg cccagttctg catggaagtc
     5281 gagctggaac ccgtactgct gctgatcggc gacgaccagg gcaacaagta caagaaggac
     5341 ccgcgcatcg aggagctgaa gaacaccgcg cacttcgaca tcgagatcgt ccacaacgcc
     5401 gacctctggg aactggagaa gcgcatcaac gccggcctcc agctcgacct gatcatgggt
     5461 cactcgaagg gccgctacgt cgccatcgag gccaacatcc cgatggtccg cgtcggcttc
     5521 ccgaccttcg accgcgccgg tctctaccgc aagccctcga tcggctacca gggcgccatg
     5581 gaactgggcg agatgatcgc caacgccatg ttcgcccaca tggaatacac ccgtaacaag
     5641 gagtggatcc tcaatacgtg gtgagttgag gtgccggagc ggtttccacg cactccggct
     5701 gtcgagccga ccgagatgac ggcatgcacg gatctcccct cgcccatgca tgggcgaggg
     5761 agttgcgagc cgtcacgccc gaggtcgttc cgacggcagg ccgatgcacc ggaaaaccgc
     5821 tcccgccccc ggcggctccc ggctccggcc ggggcctccg ccccgtattc cagatccggc
     5881 cgccatgccg cattcccggg agtgatgcca tgaaacagcg acaggaaatg gtcgcccact
     5941 accgcgcctg cttcggcgag ctgtgcgccc gaccggaaca ccgtcctatc gaaccctaca
     6001 cccgcccccg gcgcctgagc ttcgcggagc cggaaagcgc gaccgcccgt caggtgccgg
     6061 cgcgcctggt actggccctg accagtgcct acgccctgct cgccgactgg caggaatccc
     6121 gcgacccgtc gctggccgac ctgggcagtt ggcaacgcta cctcgccctg ccccgccgca
     6181 ccccggcgga aaagctgatg gccgagatct tccgcatcct gcgcgtgttc cgcgccgcgg
     6241 cgatccagca gaacggcgcc atcgaaatcc gcgacgacgg cctgatccgc gccagttgca
     6301 cctacaaccg ctgcgcgctg agcctgctga tcacccaggc cggcctcgaa ctgctcaccg
     6361 cctgcgtcgc ctgctacctc gaatccttcg accagcccta cagcgatgcc tacgaggaac
     6421 tgctgctcgg ccagtactac gccgacatcg tcgccgagat ccgctccttc gccgacgacg
     6481 accgggtgct gttccagttc cgccagaaag cctggttcaa ccgccatgtc cgcctggact
     6541 gcgacaaccc gcgcctg
//
LOCUS       ECOCYS       5755 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  E.coli thiosulfate binding protein (cysP), sulfate permease (cysT,
            cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes,
            complete cds.
ACCESSION   M32101
KEYWORDS    cysA gene; cysM gene; cysP gene; cysT gene; cysW gene;
            o-acetylserine (thiol)-lyase-B; sulfate permease;
            thiosulfate binding protein.
SOURCE      E.coli K12 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1292 to 5755)
  AUTHORS   Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A.
  TITLE     Sulfate and thiosulfate transport in E.coli K12: Nucleotide
            sequence and expression of the cysTWAM gene cluster
  JOURNAL   J. Bacteriol. 172, 3351-3357 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1291)
  AUTHORS   Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D.
  TITLE     Sulfate and thiosulfate transport in E.coli K12: Identification of
            a gene encoding a novel protein involved in thiosulfate binding
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.Halanicka, 14-FEB-1990.
FEATURES       from  to/span     description
    pept        559     1575     thiosulfate binding protein (cysP)
    pept       1575     2408     sulfate permease (cysT)
    pept       2408     3283     sulfate permease (cysW)
    pept       3273     4370     sulfate permease (cysA)
    pept       4505     5416     o-acetylserine (thiol)-lyase-B (cysM; gtg
                                 start codon; EC 4.2.99.8)
    signal      516      521     -10 region
    site       5737     5755     right end of mu
BASE COUNT     1259 a   1477 c   1682 g   1337 t
ORIGIN      52 min on K12 map.
        1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag
       61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt
      121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc
      181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt
      241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt
      301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc
      361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac
      421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta
      481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta
      541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc
      601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac
      661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat
      721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg
      781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa
      841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat
      901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc
      961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg
     1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt
     1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc
     1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg
     1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa
     1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac
     1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg
     1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac
     1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc
     1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg
     1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa
     1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg
     1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg
     1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg
     1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc
     1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg
     1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag
     1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg
     2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa
     2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg
     2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga
     2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc
     2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat
     2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag
     2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg
     2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat
     2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc
     2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc
     2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg
     2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg
     2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca
     2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg
     2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga
     2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa
     3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt
     3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca
     3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt
     3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa
     3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga
     3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga
     3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg
     3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc
     3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga
     3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg
     3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg
     3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg
     3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc
     3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca
     3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga
     3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga
     3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg
     4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc
     4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt
     4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg
     4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc
     4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg
     4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt
     4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt
     4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga
     4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat
     4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc
     4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa
     4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat
     4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg
     4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat
     4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga
     4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg
     4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat
     5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca
     5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg
     5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa
     5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc
     5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat
     5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca
     5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg
     5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg
     5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt
     5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa
     5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct
     5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag
//
LOCUS       TTHTRPBA     2586 bp ds-DNA             BCT       03-AUG-1990
DEFINITION  T.thermophilus tryptophan synthetase (trpAB) genes, complete cds.
ACCESSION   M32108
KEYWORDS    tryptophan synthetase.
SOURCE      T.thermophilus (strain HB27) DNA.
  ORGANISM  Thermus thermophilus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 2586)
  AUTHORS   Koyama,Y. and Furukawa,K.
  TITLE     Cloning and sequence analysis of tryptophan synthetase genes of an
            extreme thermophile, Thermus thermophilus HB27: Plasmid transfer
            from replica-plated E.coli recombinant colonies to competent
            T.thermophilus cells
  JOURNAL   J. Bacteriol. 172, 3490-3495 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            Y.Koyama, 14-FEB-1990.
FEATURES       from  to/span     description
    pept        353     1567     tryptophan synthetase B (EC 4.2.1.20)
    pept       1564     2379     tryptophan synthetase A (gtg start codon; EC
                                 4.2.1.20)
BASE COUNT      396 a    879 c    905 g    406 t
ORIGIN      1 bp upstream of BglII site.
        1 agatctcgtt gacgctcgcc gtgcggtggc ggaaccactg cctcgccacg aagatggggg
       61 ccttgacgtg aaacttgaag accaccatct caaaggggct cgtgtggcgg tggcgcatga
      121 ggtagtcaat gagggcggcg tcctcccgca ccgtcttcgt cccttccccg taggagaccc
      181 gggccgcctg gacgatggcg cggtcgtccc ccatctggtc caccaggcgg acgaagccct
      241 tgtccaagac cgggatcgtg agaggccctt ccatcccccc gagtttaccg ggaggcccct
      301 ccggggtagg atgggagttg tcttggcgcg aggcgccttt agggagcgaa gcatgctgac
      361 cctacccgac tttcccttgc ccgacgcgag ggggcggttc ggcccctacg gggggcggta
      421 cgtgcccgag accctgatcc ccgccctgga ggagttggag gccgcctacc gggaggccaa
      481 gaaggatccg gccttcctgg aggagctgga ccactacctc aggcagtttg ccggcaggcc
      541 cacgcccctt taccacgcca agcggctttc cgagtactgg ggtggggccc aggtcttcct
      601 caagcgggag gacctcctcc acaccggggc ccacaagatc aacaacaccc tgggccaagc
      661 cctcctggcg aggcgcatgg gcaagaggcg ggtcatcgcc gaaacggggg cggggcagca
      721 cggggtgagc gtggccacgg tggcggccct cttcggcctg gagtgcgtgg tctacatggg
      781 ggaggaggac gtcaggcgcc aggccttaaa cgtcttccgc atgaagctcc tcggggccga
      841 ggtccggccc gtggccgcgg ggagccgcac cctcaaggac gccaccaacg aggccatccg
      901 cgactggatc acgaacgtgc gcaccacctt ctacatcctg ggctccgtgg tgggccccca
      961 cccctacccc atgatggtcc gggacttcca gagcgtcatc ggggaggagg tgaagcggca
     1021 aagcctggag cttttcggcc gcctgcccga cgccctcatc gccgcggtgg ggggcggttc
     1081 caacgccatc ggcctcttcg cccccttcgc ctacctgcct gagggcaggc ccaagctcat
     1141 cggggtggag gccgccgggg aggggctttc cacgggcagg cacgccgcga gcatcggggc
     1201 ggggaagcgg ggggtgctgc acgggagcta catgtacctc ctctacgacc acgatgggca
     1261 gatcaccccg gcccactccg tctccgcagg gctggactac cccggggtgg ggccggagca
     1321 cagctactac gccgacgccg gggtggcgga gtacgccagc gtcacggacg aggaggccct
     1381 cgagggcttc aagctcctgg cgcggcttga gggcatcatc cccgccttgg agtcggccca
     1441 cgccatcgcc tacgccgcca aggtggtgcc ggagatggac aaggaccagg tggtggtcat
     1501 caacctctcc ggccgggggg acaaggacgt gaccgaggtg atgcgccttc tgggagggga
     1561 gctgtgacca ccctcgaggc cttcgccaag gcccggtccg agggccgggc cgccctcatc
     1621 ccttacctca ccgcgggctt ccccagccgg gagggctttt tgcaggcggt ggaggaggtc
     1681 ctcccctatg ccgaccttct ggagatcggc ctcccctact ccgaccccct gggggacggc
     1741 cccgtgatcc agcgggcaag cgaactggcc ctgaggaagg ggatgagcgt ccaaggcgct
     1801 ttggagctcg tccgcgaggt gcgcgccctc acggagaaac ccctcttcct catgacctac
     1861 ctgaaccccg tcctggcctg ggggccggag cgcttcttcg gcctcttcaa gcaggcgggg
     1921 gccacggggg tgatccttcc cgaccttccc cccgacgagg acccaggcct cgtgcgcctc
     1981 gcccaggaga tcggcctgga gacggtcttc cttctcgccc ccacctccac ggacgcccgc
     2041 atcgccaccg tggtccgcca cgccacgggc ttctcgtacg ccgtctccgt caccggggtc
     2101 acggggatgc gggaaaggct tcccgaggag gtcaaggacc tggtgcggcg catcaaggcc
     2161 aggaccgccc tccccgtggc cgtgggcttc ggggtttccg gcaaggccac ggccgcgcag
     2221 gcggcggtgg cggacggggt ggtggtggga agcgccttgg tccgggcctt ggaggagggg
     2281 aggtcccttg cgccccttct ccaggagatc cgccagggcc tccagcgcct cgaggccaac
     2341 cccggcctga aggaaagctc aaagaagccg ttgtcctgag ccttggaggg taaacccatg
     2401 cggcagtggc cttttttagt tctggcaggc ggcctcctcc tcggggcctg tgggctcctt
     2461 ccggggtttg gggggcctaa ggtctccgga gagttccagg gggactggtc cggcgtggcc
     2521 cagggcctgc gcctggccct ggtgggcctc acgacggagg ggcaggtgaa ctacgacaac
     2581 cagctg
//

GenBank-Updates@genbank.bio.net (08/04/90)

LOCUS       M25435        120 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 6. Possible secondary structure model of 5S RNA.
ACCESSION   M25435
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Rabin,D. and Crothers,D.M.
  TITLE     analysis of rna secondary structure by photochemical reversal of
            psoralen crosslinks
  JOURNAL   Nucleic Acids Res. 7, 689-703 (1979)
  STANDARD  unannotated staff_entry
BASE COUNT       23 a     36 c     41 g     20 t
ORIGIN      
        1 tgcctggcgg ccgtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac
       61 gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat
//
LOCUS       M25592         55 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 4. The nucleotide sequence of the fragment is given
            at the top of the figure.
ACCESSION   M25592
REFERENCE   1  (bases 1 to 55)
  AUTHORS   Ogata,R.T. and Gilbert,W.
  TITLE     dna-binding site of lac repressor probed by dimethylsulfate
            methylation of lac operator
  JOURNAL   J. Mol. Biol. 132, 709-728 (1979)
  STANDARD  unannotated staff_entry
BASE COUNT       16 a      9 c     16 g     14 t
ORIGIN      
        1 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacag
//
LOCUS       M25594         75 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 4. (a) the cloverleaf model for the base pairing of
            native tRNA-Trp.
ACCESSION   M25594
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Jones,C.R., Kearns,D.R. and Muench,K.H.
  TITLE     nuclear magnetic resonance of the base-pairing structure of the
            native and denatured conformers of escherichia coli transfer
            rna-trp
  JOURNAL   J. Mol. Biol. 103, 747-764 (1976)
  STANDARD  unannotated staff_entry
BASE COUNT       13 a     21 c     24 g     14 t      3 others
ORIGIN      
        1 aggggcgtag ttcaaddggd agagcaccgg tctccaaacc gggtgttggg agttcgagtc
       61 tctccgcccc tgcca
//
LOCUS       M25597         78 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 1: (A) The cloverleaf structure of E. coli tRNA-Met-f1.
ACCESSION   M25597
REFERENCE   1  (bases 1 to 78)
  AUTHORS   Daniel,W.E.Jr. and Cohn,M.
  TITLE     changes in tertiary structure accompanying a single base change in
            transfer rna. protein magnetic resonance and aminoacylation studies
            of escherichia coli trna-met-f1 and trna-met-f3 and their
            spin-labeled (s4u8) derivatives
  JOURNAL   Biochemistry 15, 3917-3924 (1976)
  STANDARD  unannotated staff_entry
BASE COUNT       15 a     26 c     25 g     12 t
ORIGIN      
        1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggatc gtcggttcaa
       61 atccggcccc cgcaacca
//
LOCUS       M26841        459 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 1. Nucleotide sequence coding for mature Il-1-beta and the
            corresponding amino acid sequence.
ACCESSION   M26841
REFERENCE   1  (bases 1 to 459)
  AUTHORS   Gronenborn,A.M., Clore,G.M., Schmeissner,U. and Wingfield,P.
  TITLE     a h-nmr study of human interleukin-1-beta
  JOURNAL   Eur. J. Biochem. 161, 37-43 (1986)
  STANDARD  unannotated staff_entry
BASE COUNT      143 a    109 c    106 g    101 t
ORIGIN      
        1 gcacctgtac gatcactgaa ctgcacgctc cgggactcgc agcaaaaaag cttggtgatg
       61 tctggtccat atgaactgaa agctctccac ctccagggac aggatatgga gcaacaagtg
      121 gtgttctcca tgtcctttgt acaaggagaa gaaagtaatg acaaaatacc tgtggccttg
      181 ggcctcaagg aaaagaatct gtacctgtcc tgcgtgttga aagatgataa gcccactcta
      241 cagctggaga gtgtagatcc caaaaattac caaaagaaga agatggaaaa gcgatttgtc
      301 ttcaacaaga tagaaatcaa taacaagctg gaatttgagt ctgcccagtt ccccaactgg
      361 tacatcagca cctctcaagc agaaaacatg cccgtcttcc tgggagggac caaaggcggc
      421 caggatataa ctgacttcac catgcaattt gtgtcttcc
//
LOCUS       M28421        142 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 5. The unique Xho I site of phi X174 DNA.
ACCESSION   M28421
REFERENCE   1  (bases 1 to 142)
  AUTHORS   Lam,L.H. and Reynolds,R.J.
  TITLE     dna sequence dependence of closely opposed cyclobuty1 pyrimidine
            dimers induced by uv radiation
  JOURNAL   Mutat. Res. 178, 167-176 (1987)
  STANDARD  unannotated staff_entry
COMMENT     
BASE COUNT       43 a     27 c     28 g     44 t
ORIGIN      
        1 gctttatcaa gataattttt cgactcatca gaaatatccg aaagtgttaa cttctgcgtc
       61 atggaagcga taaaactctg caggttggat acgccaatca tttttatcga agcgcgcata
      121 aatttgagca gatttgtcgt ca
//
LOCUS       M28574        160 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 4. Sequence of tyrT DNA.
ACCESSION   M28574
REFERENCE   1  (bases 1 to 160)
  AUTHORS   Drew,H.R. and Travers,A.A.
  TITLE     dna bending and its relation to nucleosome positioning
  JOURNAL   J. Mol. Biol. 186, 773-790 (1985)
  STANDARD  unannotated staff_entry
COMMENT     
BASE COUNT       40 a     42 c     38 g     40 t
ORIGIN      
        1 ccggttacct ttaatccgtt acggatgaaa attacgcaac cagttcattt ttctcaacgt
       61 aacactttac agcggcgcgt catttgatat gatgcgcccc gcttcccgat aagggagcag
      121 gccagtaaaa agcattaccc cgtggtgggg gttcccgaga
//
LOCUS       M28575        169 bp ds-DNA             BAD       04-AUG-1990
DEFINITION  Figure 4. Sequence of tyrT DNA.
ACCESSION   M28575
REFERENCE   1  (bases 1 to 169)
  AUTHORS   Drew,H.R. and Travers,A.A.
  TITLE     dna bending and its relation to nucleosome positioning
  JOURNAL   J. Mol. Biol. 186, 773-790 (1985)
  STANDARD  unannotated staff_entry
COMMENT     
BASE COUNT       42 a     46 c     40 g     41 t
ORIGIN      
        1 aattccggtt acctttaatc cgttacggat gaaaattacg caaccagttc atttttctca
       61 acgtaacact ttacagcggc gcgtcatttg atatgaagcg ccccgcttcc cgataaggga
      121 gcaggccagt aaaaagcatt accccgtggt gggggttccc gagccccgg
//
LOCUS       RCANIFR4     2583 bp ds-DNA             BCT       04-AUG-1990
DEFINITION  Rhodobacter capsulatus nitrogen fixation (nifR4 and nifA) genes,
            5' end.
ACCESSION   X12358
KEYWORDS    nifA gene; nifR4 gene; nitrogen fixation.
SOURCE      Rhodobacter capsulatus (strain SB1003) DNA, clone pRCN200.
  ORGANISM  Rhodobacter capsulatus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 2583)
  AUTHORS   Jones,R. and Haselkorn,R.
  TITLE     The DNA Sequence of the Rhodobacter capsulata ntrA, ntrB and ntrC
            gene analogues required for nitrogen fixation
  JOURNAL   Mol. Gen. Genet. (1988) In press
  STANDARD  simple automatic
COMMENT     See X07567 for overlapping sequence
            
              [1]  Author address Jones R.,
              University of Southern California,
              Mathematics Department DRB289
              University Park
              Los Angeles, CA 90089-1113 USA.  Submitted (15-JUL-1988) on tape
            to the EMBL data library by:
FEATURES       from  to/span     description
    pept        644     1924     nifR4 protein
    pept       2302  >  2583     nifA protein
BASE COUNT      406 a    906 c    835 g    436 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttgctt gcggggcccg ttacggttgg cacgcttctt gctggatcat ttctgaagcc
       61 tcaaccgatc aaaaaggggg atgcgatgaa ggatctcttc gacgaaagcc tgacgctgga
      121 caccgcgctg ccgcgcccca ggcaccgccc gccgccgcgc cctggcgccg gcaggccgcg
      181 gcgaaggccc ccgacaccgc cgccttcctt gcgaatttcg tcaggatcgg agagatcgcc
      241 gccccgaaac ccccggccgc ggcctgtttc cgcccgagga agaagccgcg gtcgtcgccg
      301 agctgatcgc cgagatgcgc ccgatgttcc agcgcgacgg cggcgacatc gagctgatcg
      361 gcctcaccgg cgccaccgtt caggtgcggc tttcgggcag ctgcgcgggc tgcatgatgt
      421 cggcgcgcac gctttccacg gtgcaacacc agctgatcga aaccctgggc cgccggtccg
      481 ggtcgtcccc gagatccggc actgagatga tcgcccgccg ccccgcccaa aaacacggca
      541 caacccaggg tggcgcggcg cggatttcgg cacctgaccc tttcttcacc tgcggcgccc
      601 tgccgcccca cgcaatgacc cgagtttgag ataggttgtc cccatggaac tcgcgcagac
      661 gctttcccaa agacagacga tgcagatggc ggggcagatg ctccattccc tcgccatcct
      721 tggcatgtcg tcgcaggatt tgtccgagca tctgaccgaa caggccacat cgaacccctt
      781 tctgacctac cgcgccccgc ccgccttcat cgcccgcggc ggcgaggatt tcgacgccgt
      841 gggcgcggtg gccgcgcaca agcccagcct gatggcccat gtggtcgatc agatcgagat
      901 ggccttcacc gagacccccg accggctgct ggcgctgcgc tttgccgagg cgctggagcc
      961 gagcggctgg ctgggccaga gcctcgacag catcgcgctg gcggcgggcg tgtcgctttc
     1021 gcgcgccgaa tcgatgctgg cggtcctgca gggcttcgag cctaccggcc ttttcgcccg
     1081 cgacctgtcg gattgcctga tcctgcaggc gcgcgaggcc gacatcctga cctgggaagt
     1141 cgaaacgctg atccgcaaca ttcggctgat cgcggaaaac cggctgtccg atctggccga
     1201 tctttgcgat tgcgacatcg gcgacatccc cgaaatcatc aagcagatcc ggcacttgaa
     1261 cccgaagccg gggcttgctt tcgaccatca gccgacgccc gttttcccgc ccgatctgat
     1321 cgccgtgcgc ggcgccgagg gctggacggt ggaactcaac cgcgccacct cgcccacgat
     1381 caccgtgcgc gaagatcgtt tcgccgatgg caccgccgat gcgaaggcgc gcgcggaacg
     1441 gcgaaggcgt ggccgaggcc cgggcgctgg cgaggcgctg gagcgacgcc gcgatacgct
     1501 gttgcgcacc gccgccgtgc tggtggcgcg gcaatcggcc ttcctcgaca aggggcccgc
     1561 ccatctggtg ccgctgacgc tggaggatgt ggcctcggaa ctgggcctgc atgcctcgac
     1621 gatttcgcgc gcggtctcgg ggcggatgat ccagacccag acccgggcgc tgccgctgcg
     1681 ggcgttcttt tcccgcgcgg tctcgaccca gggcggtggc gaggcggtgt cgcgcgacag
     1741 ccttgatttc gtgcagcgca cgtgggcggc gaagatccgc caaaacccgc tgtcggatga
     1801 tgcgatcgtg acgctcgccg aacgcgccgg gctgcggatc gcgcggcgga cggtggcgaa
     1861 ataccgctcg acgctgggcc ttgcctcctc ctacgagcgt cgccgcgccg ccgccgcgcg
     1921 ctgagtctgc gaaacgcgac ctctgccaga ccggaagcgc catttttttc ggcgctccgg
     1981 ccattcccgc ctcaaagttg atcttaacga agcctcaatc acccccaagt tgcgacatat
     2041 tggcacctga gactgtttgc gccgtgcatc cgggcgcaag cttgaccaag tctcgacatg
     2101 tccgccacgg cacccaaccg cgatcgaacg atccacgcag cgcagactgc ggcaccccat
     2161 gggagggggt gatcgtcgtt tgatcccccg gcgccgggca agcgagggag aacagtgatg
     2221 atcgacatcc gggaccggct cgtcccccaa ccgcagcaag acatcgctcg gcgcgggcca
     2281 cggccgaccg ccttgccttg gatgcgcttt acgagatcgc gaagaccttt gcggcggccg
     2341 gatccggtgg cgaggtgccg agatcttcaa cgtgctgtcg tctttcctcg atctgcgcca
     2401 tggggtgctg gcgttctggc cgagccgggc gagggcgcgg gggtgaaccc ctatgtcatc
     2461 gccgcgacgg cctttcagcg cagccccgag gcccccgccg cggatgtgct gcccgatgcc
     2521 gtggcgcgga tcgtctttcg ctccggcgtg cctttcgtca gctttgatct ggccgccgaa
     2581 ttc
//
LOCUS       ACTCPA        634 bp ds-DNA             PLN       04-AUG-1990
DEFINITION  A.deliciosa actinidin gene, 5' end.
ACCESSION   M35795
KEYWORDS    actinidin.
SOURCE      A.deliciosa (var. deliciosa, strain cv.Hayward) leaf DNA, clone
            pKIWI 37.
  ORGANISM  Actinidia deliciosa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Theales; Actinidiaceae.
REFERENCE   1  (bases 1 to 634)
  AUTHORS   Keeling,J., Maxwell,P. and Gardner,R.C.
  TITLE     Nucleotide sequence of the promoter region from kiwifruit actinidin
            genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Keeling, 05-JUL-1990.
            
              Author address: J.Keeling
              Centre for Gene Technology
              Dept of Cellular and Molecular Biology
              University of Auckland
              Private Bag
              Auckland
              NEW ZEALAND
              email: SYSTEM%CELVAX.AUKUNI.AC.NZ
            
FEATURES       from  to/span     description
    pept        595  >   634     actinidin
    mRNA        558  >   634     actinidin mRNA
    signal      507      514     TATA box
BASE COUNT      245 a    120 c     71 g    198 t
ORIGIN      
        1 caaaccgtag ttacaaggtg ggcctgaacc agtttgctga cctgactgat gaggagttcc
       61 gataaattag ctttcttttt ttgtcttcac ctattttctt tttcaaattt gatttacatt
      121 tgtacattta aaattatttt cgccaagaaa aaccgaataa tataaaaatt tgatcaaaca
      181 attaaaacaa ttcattaaag gcgaaaaata taaaaaccaa atcaactaaa aaatatcaac
      241 aaaaaaataa ctaatcaaaa ctgggcataa atagaaagga caatttaata attttgtata
      301 ataaaaacaa aaataaagta tttaaagttg tggttggata aggatttaaa gaagaaaaaa
      361 aattaaatct aaatcattga aatttaattt tatatttttt ttctcttttt tctactgaat
      421 ctgcagttcc aacagaacct ttaaaaaaaa ttgtgaaaat cattttttca aatgtcgtaa
      481 gacccccccc accccccacg caccctatat aaaggccact ctctccctcc acattcacac
      541 acctccaatc ccaatctttt tcttctaaaa ttcaaaaaac gagagagaac aaaaatgggt
      601 ttgcccaaat ccttcgtgtc aatgtctctc ctct
//
LOCUS       BOVAPOAI      933 bp ss-mRNA            MAM       04-AUG-1990
DEFINITION  B.taurus apolipoprotein A-I mRNA, complete cds.
ACCESSION   M35870
KEYWORDS    apolipoprotein A-I.
SOURCE      B.taurus liver, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 933)
  AUTHORS   O'hUigin,C., Chan,L. and Li,W.-H.
  TITLE     Cloning and sequencing of bovine apolipoprotein A-I cDNA and
            molecular evolution of apolipoproteins A-I and B-100
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C. O'hUigin, 27-JUN-1990.
FEATURES       from  to/span     description
    pept         73      870     apolipoprotein A-I precursor
    sigp         73      127     apolipoprotein A-I signal peptide
    matp        145      867     apolipoprotein A-I
    signal      914      919     poly-A signal
BASE COUNT      201 a    269 c    313 g    150 t
ORIGIN      
        1 cgccaattac aggcctcgaa agagtcctgt attgttattt ttcgtcacta cctccccggg
       61 tcgggattca agatgaaagc cgtggtgctg accttggctg tgctcttcct gacggggagc
      121 caggctcggc atttctggca gcaagatgac ccccagtcat cctgggatcg ggtgaaggat
      181 tttgccaccg tgtatgtgga agcaatcaag gatagtggca gagactatgt ggcccaattc
      241 gaagcctccg ctttgggaaa acagctcaac ctgaaactcc tggacaactg ggacaccctg
      301 gccagcacgt tgtccaaagt gcgtgaacag ctgggcccag tgacccagga gttctgggac
      361 aacctggaaa aggagaccgc gtcgctgagg caggagatgc acaaggacct ggaggaggtg
      421 aagcagaagg tgcagcccta cctggacgag ttccagaaga agtggcacga ggaggtggag
      481 atctaccgcc agaaggtggc gccgctgggc gaggagtttc gcgagggcgc gcgccagaag
      541 gtgcaggagc tgcaggacaa gctgagccct ctggcccagg agctccgcga ccgcgcgcgc
      601 gcccacgtgg agacgctgcg gcagcacgtg gcgccctaca gcgacgacct gcgccagcgg
      661 ctgaccgcgc gcctggaggc gctcaaggag ggcggcggca gcctggccga gtaccacgcc
      721 aaggccagcg agcagctgaa ggcgctgggc gagaaggcca agccggtgct ggaggacctc
      781 cgccagggcc tgctgcccgt gctggagagc ctcaaggtca gcatcctggc cgccatagac
      841 gaggcctcca agaagctgaa cgcccagtga gatgcctcgg gctgccccgc tggccatcac
      901 ttcggtttct tagaataaac atttccggag tgg
//
LOCUS       DOGINVOLA     864 bp ds-DNA             MAM       04-AUG-1990
DEFINITION  Dog involucrin gene, complete cds.
ACCESSION   M34442
KEYWORDS    involucrin.
SOURCE      Dog adult skin keratinocyte DNA.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 864)
  AUTHORS   Tseng,H. and Green,H.
  TITLE     The involucrin genes of pig and dog: Comparison of their segments
            of repeats with those of prosimians and higher primates
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Tseng and H.Green, 23-MAY-1990.
FEATURES       from  to/span     description
    pept          1      858     involucrin
    rpt         232      513     repeat segment
BASE COUNT      286 a    226 c    249 g    103 t
ORIGIN      
        1 atgtcccagc aacacactct gccagtgacc ctgccccctg ccctcagtca ggggcccctc
       61 aagcctgagt ctcctcccat cgatactcaa caagagcagg tgaaacagcc gacttcacta
      121 cctgtcctgt gccaaaaaat gccctcttct ctcccaggga aggttccctt ggggcatggg
      181 gagaaacata caagtcttgt gaagggggaa cctgagcaac agtgtgagcc acaggagcag
      241 gaacagcagc agaaacaaca agagtcacag gagcagaaac tgcacctgga acagtgtctg
      301 gaacagcatc aggaacaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg
      361 gaacagcagc aggagcaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg
      421 gaacagcagc aggagcaaca agagtcacag gagaaggaac ttcatctgga acaggaacag
      481 cagaaggagg agctacagca gcaggaacag cagcagggaa aggaacagtg tgagaaacat
      541 caggaagcaa aaaatctgga gcagcagcta gagcagatag gagcacaaag gaagcagcag
      601 caaaaggaac agctggaaca ggaaaagaag ctcgtggacc agcacctgga tcaagagcca
      661 gcacagagaa ctgagcaacc ggaaaagaaa gaagagcagg tgctggagca gcaggggcag
      721 caggaggggc agctggagca gcctgccttt gtcccagctc aggtccaggt ccgagaaccc
      781 ctgaagggag aagtcctgcc cctcatagag cagcagcacc agaagcagga ggttcatgac
      841 cctccagaac accagtgagc accc
//
LOCUS       DRODMRA       360 bp ds-DNA             INV       04-AUG-1990
DEFINITION  D.melanogaster dispersed middle repetitive DNA sequence.
ACCESSION   M16263
KEYWORDS    dispersed middle repetitive DNA sequence.
SOURCE      D.melanogaster DNA, clone p112-0.4RA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Waring,G.L. and Pollack,J.C.
  TITLE     Cloning and characterization of a dispersed, multicopy, X
            chromosome sequence in Drosophila melanogaster
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987)
  STANDARD  simple staff_review
BASE COUNT      138 a     56 c     42 g    124 t
ORIGIN      X chromosome.
        1 ccaaatttaa atgcaaatcg atttggattt taataacaaa ctcaacgagg tatgacattc
       61 catatttggt ctattatttt caatgttatg atcaaaatac caaatatttt tttcacaaaa
      121 aaccaggaaa acgaaatttg gcaaaaaatt taatatttaa gtagagtttt ttgggtataa
      181 cttgactaaa aatggtcaca caactaaaat aattaccatt ttgtgctcct aattaccaat
      241 actaatcatc cccatcactt tttggcggac tttgaaaaat taatttttgc ccaaattttc
      301 gcattttttg taaggggtaa ctcatcaaaa tttttaaaaa atgccaaaaa aataaagttc
//
LOCUS       DRODMRB       354 bp ds-DNA             INV       04-AUG-1990
DEFINITION  D.melanogaster dispersed middle repetitive DNA sequence.
ACCESSION   M16264
KEYWORDS    dispersed middle repetitive DNA sequence.
SOURCE      D.melanogaster DNA, clone pX-12R.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Waring,G.L. and Pollack,J.C.
  TITLE     Cloning and characterization of a dispersed, multicopy, X
            chromosome sequence in Drosophila melanogaster
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987)
  STANDARD  simple staff_review
BASE COUNT      130 a     57 c     47 g    120 t
ORIGIN      X chromosome.
        1 ccaaatttga atgcaaatcg atttggattt aaaaaacaaa ctcaacgaag atgacattcc
       61 atatttgggc cattattttc aatgttttga tcaaaatacc gattattttc ttctcaaaaa
      121 attagggaaa cgagttccgg ccaaaaatgc atatttaaga ttggtttttt gggtatactt
      181 ggctaaaatg gtcagacagc aaaaaaatta ccatttattc tcctataaca aatactaacc
      241 acccctttca cttttaaacg gattttgtaa aattaatttt tggccaaatt ttcgcatttt
      301 ttttaagggg taccatcata aaaatttgca aaaaattgaa aaaatctaga attc
//
LOCUS       DRODMRC       352 bp ds-DNA             INV       04-AUG-1990
DEFINITION  D.melanogaster dispersed middle repetitive DNA sequence.
ACCESSION   M16265
KEYWORDS    dispersed middle repetitive DNA sequence.
SOURCE      D.melanogaster DNA, clone pX-30 P-0.4R.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 352)
  AUTHORS   Waring,G.L. and Pollack,J.C.
  TITLE     Cloning and characterization of a dispersed, multicopy, X
            chromosome sequence in Drosophila melanogaster
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987)
  STANDARD  simple staff_review
BASE COUNT      128 a     64 c     42 g    118 t
ORIGIN      X chromosome.
        1 ccaaacttaa atgcaaatcg attgggattc aaaaaacaaa ctcaacgagg tatgacattc
       61 catatttggg ccattatttc catgttttca tcaaaatacc gaatttttat cacaaaaaat
      121 ctgggaaact atttttggca aaaactcaat tttcaagttg gctcttttgg ctataactga
      181 ctaaaaatgg tcacagagca aaaattagta ccattttata ctccttataa ccaatactaa
      241 ccaccccttt cactttttaa ctgatttagt aaaatttaat tttggccaaa ttttcgcatt
      301 ttttgtaagg gtaacatcat aaatttgcaa aaaattgaaa aatcctagat tc
//
LOCUS       DROTMLPA     5546 bp ss-mRNA            INV       04-AUG-1990
DEFINITION  D.melanogaster serrate (Ser) mRNA, complete cds.
ACCESSION   M35759
KEYWORDS    Ser gene; serrate.
SOURCE      D.melanogaster pupa, cDNA to mRNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 5546)
  AUTHORS   Fleming,R.J., Scottgale,T.N., Diederich,R.J. and
            Artavanis-Tsakonas,S.
  TITLE     The gene serrate encodes a putative EGF-like transmembrane protein
            essential for proper ectodermal development in Drosophila
            melanogaster
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.J.Fleming, 28-JUN-1990.
            
              Author address: R.J.Fleming
              Yale University
              Dept of Biology
              KBT 1104
              New Haven, CT 06511
              email: WHAKRIA@YALEVM
            
FEATURES       from  to/span     description
    pept        443     4657     serrate protein precursor
    sigp        443      679     serrate protein signal peptide
    matp        680     4657     serrate protein
    mRNA    <     1     5546
BASE COUNT     1384 a   1446 c   1504 g   1212 t
ORIGIN      Chromosome 3 97F.
        1 ccgagtcgag cgccgtgctt cgagcggtga tgagcccctt ttctgtcaac gctaaagatc
       61 tacaaaacat cagcgcctat caagtggaag tgtcaagtgt gaacaaaaca aaaacgagag
      121 aagcacatac taaggtccat ataaataata aataataatt gtgtgtgata acaacattat
      181 ccaaacaaaa ccaaacaaaa cgaaggcaaa gtggagaaaa tgatacagca tccagagtac
      241 ggccagttat tcagctatcc agagcaagtg tagtgtggca aaatagaaac aaacaaaggc
      301 accaaaatct gcatacatgg gctaattaag gctgcccagc gaatttacat ttgtgtggtg
      361 ccaatccaga gtgaatccga aacaaactcc atctagatcg ccaaccagca tcacgctcgc
      421 aaacgccccc agaatgtaca aaatgtttag gaaacatttt cggcgaaaac cagctacgtc
      481 gtcgtcgttg gagtcaacaa tagaatcagc agacagcctg ggaatgtcca agaagacggc
      541 gacaaaaagg cagcgtccga ggcatcgggt acccaaaatc gcgaccctgc catcgacgat
      601 ccgcgattgt cgatcattaa agtctgcctg caacttaatt gctttaattt taatactgtt
      661 agtccataag atatccgcag ctggtaactt cgagctggaa atattagaaa tctcaaatac
      721 caacagccat ctactcaacg gctattgctg cggcatgcca gcggaactta gggccaccaa
      781 gacgataggc tgctcgccat gcacgacggc attccggctg tgcctgaagg agtaccagac
      841 cacggagcag ggtgccagca tatccacggg ctgttcgttt ggcaacgcca ccaccaagat
      901 actgggtggc tccagctttg tgctcagcga tccgggtgtg ggagccattg tgctgccctt
      961 tacgtttcgt tggacgaagt cgtttacgct gatactgcag gcgttggata tgtacaacac
     1021 atcctatcca gatgcggaga ggttaattga ggaaacatca tactcgggcg tgatactgcc
     1081 gtcgccggag tggaagacgc tggaccacat cgggcggaac gcgcggatca cctaccgtgt
     1141 ccgggtgcaa tgcgccgtta cctactacaa cacgacctgc acgaccttct gccgtccgcg
     1201 ggacgatcag ttcggtcact acgcctgcgg ctccgagggt cagaagctct gcctgaatgg
     1261 ctggcagggc gtcaactgcg aggaggccat atgcaaggcg ggctgcgacc ccgtccacgg
     1321 caagtgcgat cgtccggggg aatgcgaatg cagacccggc tggcgtggtc cattgtgcaa
     1381 cgagtgcatg gtctatcccg gctgcaagca tggttcctgc aacggcagcg cctggaaatg
     1441 cgtgtgcgac accaactggg gtggcatatt gtgcgatcaa gatttaaatt tctgcggcac
     1501 ccatgaaccc tgcaagcacg gcggcacctg cgaaaatacc gctccggaca aatatcggtg
     1561 cacatgcgcc gagggcctct cgggcgagca gtgcgagatc gtggagcacc catgtgccac
     1621 caggccatgc cgcaacggcg gcacatgcac actcaagacg agtaaccgaa ctcaagccca
     1681 agtgtatcgc acatcacatg gcaggagcaa catgggccgg ccggtaagac gcagcagttc
     1741 gatgcgcagc ctggatcacc tgcggccgga ggggcaggcg ctgaatggca gcagctcctc
     1801 gggattggtg tccctaggtt cgctgcagct gcagcagcaa ctggcccccg acttcacttg
     1861 cgactgcgca gccggatgga cgggaccgac atgcgaaata aatatcgacg agtgcgccgg
     1921 gggtccctgc gagcatggtg gcacttgcat cgatctaatc ggtggctttc gatgtgaatg
     1981 tccgccggag tggcatggcg atgtctgtca ggtggatgtg aacgagtgcg aggcgccgca
     2041 ttccgccgga atcgctgcga acgcattgct gaccaccaca gccaccgcga ttattggtag
     2101 taatctgagc agtactgctc ttctggccgc tctgaccagt gcagtggcat ccacatcctt
     2161 ggccatcgga ccctgcatca atgccaagga gtgtcgcaat cagccgggtt cctttgcctg
     2221 catctgcaag gagggctggg gcggagtgac ctgtgccgag aatctagatg actgtgtggg
     2281 tcagtgccgg aatggagcca cctgcattga tctggtcaac gactataggt gcgcctgtgc
     2341 ctctggattc acgggtcgcg attgcgagac ggacatagac gagtgcgcca cttccccgtg
     2401 ccgaaacgga ggcgaatgtg tggacatggt gggcaaattc aattgcattt gcccacttgg
     2461 ctactcgggt tctctgtgcg aggaggccaa ggagaactgc acaccgtcgc catgtttgga
     2521 gggtcactgc ctcaacacgc ccgaaggata ctactgccat tgtccaccgg atcgcgccgg
     2581 aaagcactgc gagcaactgc gtccgctctg ctcccagccg ccctgcaacg agggctgctt
     2641 cgccaatgtc agcctagcga cgtcagcgac aacgacgacg acaaccacca caacggcgac
     2701 aacgacaagg aagatggcca agccaagcgg attgccctgc agcggacacg gcagctgcga
     2761 gatgagcgac gtgggcacct tctgcaaatg ccatgtgggc cacaccggca ccttctgcga
     2821 gcacaatctc aacgaatgct cgccgaatcc ttgtcgaaat gggggaattt gccttgacgg
     2881 cgacggcgat tttacatgcg agtgcatgtc gggctggaca ggtaaacgct gctcggagcg
     2941 cgctacaggt tgttatgccg gtcagtgcca gaatggtggt acctgcatgc ctggagcccc
     3001 ggacaaggct ctgcagccgc attgccgctg tgcgccaggt tggactggtc tgttttgcgc
     3061 cgaggctatt gaccagtgtc gcgggcagcc gtgccacaat ggcggaacgt gcgagtcggg
     3121 agcgggctgg ttccgctgcg tctgcgctca gggattctct ggtccagact gccgcatcaa
     3181 tgtgaacgag tgctcgccac agccttgcca gggcggtgcc acctgcatcg acggaatcgg
     3241 tggatacagc tgcatctgcc caccaggaag gcatggattg cggtgtgaaa ttttgctctc
     3301 cgatcccaag tccgcctgcc agaacgcaag caacactatc tctccgtata cagctctaaa
     3361 ccgaagccaa aactggctgg atattgctct aaccggaaga acagaagacg atgagaactg
     3421 caatgcgtgt gtctgcgaaa acggcacctc tcggtgcacg aatctctggt gtggattgcc
     3481 caattgctat aaggtggatc cgctctccaa gtcctcgaat ctgtccggtg tttgcaaaca
     3541 gcacgaggtg tgcgttccgg cactgagtga gacatgcctg tcatcgcctt gtaatgttcg
     3601 tggagattgc cgggcactgg aaccatcgcg tcgggttgct ccaccccgac tgccagccaa
     3661 atctagctgc tggcccaatc aggccgtggt caacgagaac tgcgcccgac tcaccatcct
     3721 tttggccctg gagcgagtgg gcaagggagc ttcggtggag ggtctctgct ccctggtaag
     3781 ggtgctgctg gctgcccagt tgatcaagaa gccggcgagt acttttggcc aggatccggg
     3841 aatgcttatg gtgctctgcg atctcaaaac gggcaccaat gataccgttg aactaactgt
     3901 gtcgtccagt aaattaaatg atccccagct gccagtggcg gtgggtctgc tgggtgaact
     3961 cctgagctcc aggcagttga atggcatcca gcggcgcaag gaactggagc tgcagcatgc
     4021 aaaattggct gccctcacct ccattgtgga ggtcaagttg gaaacggccc gcgtggccga
     4081 tggatcgggt catagtctgc tgataggagt gctatgcggt gtctttatag tcctggtggg
     4141 attctcggtg ttcatcagtc tttactggaa acagcgtctg gcttatcgca ccagttcggg
     4201 aatgaactta actccctccc tggatgcact gcgtcacgag gaggagaagt cgaataatct
     4261 gcagaacgag gagaatctgc gaaggtatac aaatccgctg aagggcagca ccagttccct
     4321 aagagcggcc accggcatgg aactaagcct caatcccgct ccggaattag ccgcctcggc
     4381 ggcgagtagt tccgccttgc acagatcgca gccactattc ccgccatgcg atttcgagcg
     4441 tgagctggac tccagtacgg gcctgaagca ggcgcacaag cggagctcac agattctgct
     4501 gcacaaaacc caaaactcgg acatgcggaa gaacactgtg ggctcgctgg acagtccgcg
     4561 taaggacttt ggcaagcggt cgatcaactg caagtccatg ccaccctctt cgggcgacga
     4621 gggctccgat gtccttgcca ccactgtgat ggtttagccg tgatctcacc aaccaaccaa
     4681 tcaagaaacc aaccagccgc ccacagccag ctcaaagttc caattgccac agcacgggcg
     4741 ctatttccaa gtgcattagt agcgtaatta aaactaggat attgttaagg ataccaaggt
     4801 aggccacaac ggagtggctc tgttgaaaac gtaaagttct aaaaatccag gtctctcaga
     4861 caaagatgag gtacacaaat aaattggcta gttaatcaag catgttatgg ccacgggatg
     4921 ggcaaattta tttgtatacc tgatcttatc ttaatactaa accagttttc tactattttt
     4981 tttttgtgga tcaagcttaa aagttcagct aggcaggcgt tttccgcagt gccatgtcga
     5041 tgtggaagcc caaaatattt aggttagata gtgtaatttc gaactcttct cttcgctaag
     5101 caacatccta cacagtgtga tatttagtgt aacccaggcg cgcatttaca ttcaattaaa
     5161 gacaatgata tataaatata aacgaaatca actccttggc tagcacaagc tgtatgtata
     5221 tagttctcat ttaggatcgt cgcgctctat attgtgtata agctgtaaat actgtaaatt
     5281 agcagttacc gttattgtat tttgtctata gttagattgg tactattaaa ctaagaacca
     5341 gccgcaacgc gttagacttt aaaagttgtt tgcaattgta cgcaataata tagttttatg
     5401 ctcgtagtta ggtagctgtg taaccgggta agattcaaac gattttgtac tgtattatat
     5461 acctatctgt gtagtaatat ttatttatta tattaaattt gatctagacg caataaagta
     5521 atatcaataa agatagtaaa agacat
//
LOCUS       ECOLRRPA      495 bp ds-DNA             BCT       04-AUG-1990
DEFINITION  E.coli leucine-responsive-regulatory protein (Lrp) gene,
            complete cds.
ACCESSION   M35869
KEYWORDS    Lrp gene; leucine-responsive-regulatory protein.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 495)
  AUTHORS   Willins,D.A., Platko,J.V., Ryan,C. and Calvo,J.M.
  TITLE     Lrp, a small, basic global regulatory protein of E.coli
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.M.Calvo, 27-JUN-1990.
            
              Author address: J.M.Calvo
              Cornell University
              Dept of Biochemistry
              Room 451
              Biotechnology Bldg.
              Ithaca, NY 14853
              email:DEKY@CRNLVAX5
            
FEATURES       from  to/span     description
    pept          1      495     leucine-responsive-regulatory protein (Lrp)
BASE COUNT      127 a    110 c    128 g    130 t
ORIGIN      20 min. on E.coli map.
        1 atggtagata gcaagaagcg ccctggcaaa gatctcgacc gtatcgatcg taacattctt
       61 aatgagttgc aaaaggatgg gcgtatttct aacgtcgagc tttctaaacg tgtgggactt
      121 tccccaacgc cgtgccttga gcgtgtgcgt cggctggaaa gacaagggtt tattcagggc
      181 tatacggcgc tgcttaaccc ccattatctg gatgcatcac ttctggtatt cgttgagatt
      241 actctgaatc gtggcgcacc ggatgtgttt gaacaattca ataccgctgt acaaaaactt
      301 gaagaaattc aggagtgtca tttagtatcc ggtgatttcg actacctgtt gaaaacacgc
      361 gtgccggata tgtcagccta ccgtaagttg ctgggggaaa ccctgctgcg tctgcctggc
      421 gtcaatgaca cacggacata cgtcgttatg gaagaagtca agcagagtaa tcgtctggtt
      481 attaagacgc gctaa
//
LOCUS       HUMA1AT       143 bp ss-mRNA            PRI       04-AUG-1990
DEFINITION  Human alpha-1-antitrypsin (alpha-1-AT) mRNA, 3' end.
ACCESSION   M26123
KEYWORDS    alpha-1-antitrypsin.
SOURCE      Human adult liver, cDNA to mRNA, clone PJN1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 143)
  AUTHORS   Coutelle,C., Speer,A., Rogers,J., Kalsheker,N., Humphries,S. and
            Williamson,R.
  TITLE     Construction and partial characterization of a human liver cDNA
            library
  JOURNAL   Biomed. Biochim. Acta 44, 421-431 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      101     alpha-1-antitrypsin (alpha-1-AT; AA at 3)
                                 /hgml_locus_uid="LX0081X" /nomgen="PI"
                                 /map="14q32.1"
BASE COUNT       37 a     53 c     19 g     34 t
ORIGIN      
        1 ccgaggtcaa gttcaacaaa ccctttgtct tcttaatgat tgaccaaaat accaagtctc
       61 ccctcttcat gggaaaagtg gtgaatccca cccaaaaata actgcctctc gctcctcaac
      121 ccctcccctc catccctggc ccc
//
LOCUS       HUMGAPDHG    5378 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene,
            complete cds.
ACCESSION   J04038
KEYWORDS    glyceraldehyde-3-phosphate dehydrogenase.
SOURCE      Human DNA.
REFERENCE   1  (bases 1 to 5378)
  AUTHORS   Ercolani,L., Florence,B., Denaro,M. and Alexander,M.
  TITLE     Isolation and complete sequence of a functional human
            glyceraldehyde-3-phosphate dehydrogenase gene
  JOURNAL   J. Biol. Chem. 263, 15335-15341 (1988)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 650 to 1157)
  AUTHORS   Nasrin,N., Ercolani,L., Denaro,M., Kong,X.F., Kang,I. and
            Alexander,M.
  TITLE     An insulin response element in the glyceraldehyde-3-phosphate
            dehydrogenase gene binds a nuclear protein induced by insulin in
            cultured cells and by nutritional manipulations in vivo
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5273-5277 (1990)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    pept       1453     1481     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 2 (first expressed exon) (EC 1.2.2.12)
               3116     3215     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 3
               3306     3412     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 4
               3542     3632     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 5
               3723     3838     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 6
               3931     4012     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 7
               4206     4618     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 8
               4723     4792     glyceraldehyde-3-phosphate dehydrogenase, exon
                                 9
    pre-msg    1138     4993     GAPDH mRNA and introns
    IVS        1190     1429     GAPDH intron A
    IVS        1482     3115     GAPDH intron B
    IVS        3216     3305     GAPDH intron C
    IVS        3413     3541     GAPDH intron D
    IVS        3633     3722     GAPDH intron E
    IVS        3839     3930     GAPDH intron F
    IVS        4013     4205     GAPDH intron G
    IVS        4619     4722     GAPDH intron H
    site        650      729     insulin response element A [2]
    site        729      867     insulin response element B [2]
    site       1049     1071     insulin response element A [2]
    signal      884      892     CAAT box
    signal     1108     1113     TATA box
    site       1447     1456     Kozak consensus sequence
    revision    877      879     ccc in [2]; cc in [1]
BASE COUNT      994 a   1642 c   1627 g   1115 t
ORIGIN      
        1 ggatcccctg ctgggagggg gcaggggacc tgttcccacc gtgtgcccaa gacctctttt
       61 cccacttttt ccctcttctt gactcaccct gccctcaata tcccccggcg cagcagtgaa
      121 agggagtccc tggctcctgg ctcgcctgca cgtcccaggg cggggaggga cttccgccct
      181 cacgtcccgc tcttcgcccc aggctggatg gaatgaaagg cacactgtct ctctccctag
      241 gcagcacagc ccacaggttt caggagtgcc tttgtgggag gcctctgggc ccccaccagc
      301 catcctgtcc tccgcctggg gccccagccc ggagagagcc gctggtgcac acagggccgg
      361 gattgtctgc cctaattatc aggtccaggc tacagggctg caggacatcg tgaccttccg
      421 tgcagaaacc tccccctccc cctcaagccg cctcccgagc ctccttcctc tccaggcccc
      481 cagtgcccag tgcccagtgc ccagcccagg cctcggtccc agagatgcca ggagccagga
      541 gatggggagg gggaagtggg ggctgggaag gaaccacggg cccccgcccg agcccatggg
      601 cccctcctag gcctttgcct gagcagaccg gtgtcactac cgcagagcct cgaggagaag
      661 ttccccaact ttcccgcctc tcagcctttg aaagaaagaa aggggagggg gcaggccgcg
      721 tgcagccgcg agcggtgctg ggctccggct ccaattcccc atctcagtcg ttcccaaagt
      781 cctcctgttt catccaagcg tgtaagggtc cccgtccttg actccctagt gtcctgctgc
      841 ccacagtcca gtcctgggaa ccagcaccga tcacctccca tcgggccaat ctcagtccct
      901 tccccctacg tcggggccca cacgctcggt gcgtgcccag ttgaaccagg cggctgcgga
      961 aaaaaaaaag cggggagaaa gtagggcccg gctactagcg gttttacggg cgcacgtagc
     1021 tcaggcctca agaccttggg ctgggactgg ctgagcctgg cgggaggcgg ggtccgagtc
     1081 accgcctgcc gccgcgcccc cggtttctat aaattgagcc cgcagcctcc cgcttcgctc
     1141 tctgctcctc ctgttcgaca gtcagccgca tcttcttttg cgtcgccagg tgaagacggg
     1201 cggagagaaa cccgggaggc tagggacggc ctgaaggcgg caggggcggg cgcaggccgg
     1261 atgtgttcgc gccgctgcgg ggtgggcccg ggcggcctcc gcattgcagg ggcgggcgga
     1321 ggacgtgatg cggcgcgggc tgggcatgga ggcctggtgg gggaggggag gggaggcgtg
     1381 tgtgtcggcc ggggccacta ggcgctcact gttctctccc tccgcgcagc cgagccacat
     1441 cgctcagaca ccatggggaa ggtgaaggtc ggagtcaacg ggtgagttcg cgggtggctg
     1501 gggggccctg ggctgcgacc gcccccgaac cgcgtctacg agccttgcgg gctccgggtc
     1561 tttgcagtcg tatgggggca gggtagctgt tccccgcaag gagagctcaa ggtcagcgct
     1621 cggacctggc ggagccccgc acccaggctg tggcgccctg tgcagctccg cccttgcggc
     1681 gccatctgcc cggagcctcc ttcccctagt ccccagaaac aggaggtccc tactcccgcc
     1741 cgagatcccg acccggaccc ctaggtgggg gacgctttct ttcctttcgc gctctgcggg
     1801 gtcacgtgtc gcagaggagc ccctccccca cggcctccgg caccgcaggc cccgggatgc
     1861 tagtgcgcag cgggtgcatc cctgtccgga tgctgcgcct gcggtagagc ggccgccatg
     1921 ttgcaaccgg gaaggaaatg aatgggcagc cgttaggaaa gcctgccggt gactaaccct
     1981 gcgctcctgc ctcgatgggt ggagtcgcgt gtggcgggga agtcaggtgg agcgaggcta
     2041 gctggcccga tttctcctcc gggtgatgct tttcctagat tattctctgg taaatcaaag
     2101 aagtgggttt atggaggtcc tcttgtgtcc cctccccgca gaggtgtggt ggctgtggca
     2161 tggtgccaag ccgggagaag ctgagtcatg ggtagttgga aaaggacatt tccaccgcaa
     2221 aatggcccct ctggtggtgg ccccttcctg cagcggctca cctcacggcc ccgcccttcc
     2281 cctgccagcc tagcgttgac ccgaccccaa aggccaggct gtaaatgtca ccgggaggat
     2341 tgggtgtctg ggcgcctcgg ggaacctgcc cttctcccca ttccgtcttc cggaaaccag
     2401 atctccaccg caccctggtc tgaggtctga ggttaaatat agctgctgac ctttctgtag
     2461 ctgggggcct gggctggggc tctctcccat cccttctccc cacacacatg cacttacctg
     2521 tgctcccact cctgatttct ggaaaagagc taggaaggac aggcaacttg gcaaatcaaa
     2581 gccctgggac tagggggtta aaatacagct tcccctcttc ccacccgccc cagtctctgt
     2641 cccttttgta ggagggactt agagaagggg tgggcttgcc ctgtccagtt aatttctgac
     2701 ctttactcct gccctttgag tttgatgatg ctgagtgtac aagcgttttc tccctaaagg
     2761 gtgcagctga gctaggcagc agcaagcatt cctggggtgg catagtgggg tggtgaatac
     2821 catgtacaaa gcttgtgccc agactgtggg tggcagtgcc cacatggccg cttctcctgg
     2881 aagggcttcg tatgactggg ggtgttgggc agccctggag ccttcagttg cagccatgcc
     2941 ttaagccagg ccagcctggc agggaagctc aagggagata aaattcaacc tcttgggccc
     3001 tcctgggggt aaggagatgc tgcattcgcc ctcttaatgg ggaggtggcc tagggctgct
     3061 cacatattct ggaggagcct cccctcctca tgccttcttg cctcttgtct cttagatttg
     3121 gtcgtattgg gcgcctggtc accagggctg cttttaactc tggtaaagtg gatattgttg
     3181 ccatcaatga ccccttcatt gacctcaact acatggtgag tgctacatgg tgagccccaa
     3241 agctggtgtg ggaggagcca cctggctgat gggcagcccc ttcataccct cacgtattcc
     3301 cccaggttta catgttccaa tatgattcca cccatggcaa attccatggc accgtcaagg
     3361 ctgagaacgg gaagcttgtc atcaatggaa atcccatcac catcttccag gagtgagtgg
     3421 aagacagaat ggaagaaatg tgctttgggg aggcaactag gatggtgtgg ctcccttggg
     3481 tatatggtaa ccttgtgtcc ctcaatatgg tcctgtcccc atctcccccc caccccggta
     3541 ggcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc gtggagtcca
     3601 ctggcgtctt caccaccatg gagaaggctg gggtgagtgc aggagggccc gcgggagggg
     3661 aagctgactc agccctgcaa aggcaggacc cgggttcata actgtctgct tctctgctgt
     3721 aggctcattt gcagggggga gccaaaaggg tcatcatctc tgccccctct gctgatgccc
     3781 ccatgttcgt catgggtgtg aaccatgaga agtatgacaa cagcctcaag atcatcaggt
     3841 gaggaaggca gggcccgtgg agaagcggcc agcctggcac cctatggaca cgctcccctg
     3901 acttgcgccc cgctccctct ttctttgcag caatgcctcc tgcaccacca actgcttagc
     3961 acccctggcc aaggtcatcc atgacaactt tggtatcgtg gaaggactca tggtatgaga
     4021 gctggggaat gggactgagg ctcccacctt tctcatccaa gactggctcc tccctgctgg
     4081 ggctgcgtgc aaccctgggg ttgggggttc tggggactgg ctttcccata atttcctttc
     4141 aaggtgggga gggaggtaga ggggtgatgt ggggagtacg ctgcagggcc tcactccttt
     4201 tgcagaccac agtccatgcc atcactgcca cccagaagac tgtggatggc ccctccggga
     4261 aactgtggcg tgatggccgc ggggctctcc agaacatcat ccctgcctct actggcgctg
     4321 ccaaggctgt gggcaaggtc atccctgagc tgaacgggaa gctcactggc atggccttcc
     4381 gtgtccccac tgccaacgtg tcagtggtgg acctgacctg ccgtctagaa aaacctgcca
     4441 aatatgatga catcaagaag gtggtgaagc aggcgtcgga gggccccctc aagggcatcc
     4501 tgggctacac tgagcaccag gtggtctcct ctgacttcaa cagcgacacc cactcctcca
     4561 cctttgacgc tggggctggc attgccctca acgaccactt tgtcaagctc atttcctggt
     4621 atgtggctgg ggccagagac tggctcttaa aaagtgcagg gtctggcgcc ctctggtggc
     4681 tggctcagaa aaagggccct gacaactctt ttcatcttct aggtatgaca acgaatttgg
     4741 ctacagcaac agggtggtgg acctcatggc ccacatggcc tccaaggagt aagacccctg
     4801 gaccaccagc cccagcaaga gcacaagagg aagagagaga ccctcactgc tggggagtcc
     4861 ctgccacact cagtccccca ccacactgaa tctcccctcc tcacagttgc catgtagacc
     4921 ccttgaagag gggaggggcc tagggagccg caccttgtca tgtaccatca ataaagtacc
     4981 ctgtgctcaa ccagttactt gtcctgtctt attctagggt ctggggcaga ggggagggaa
     5041 gctgggcttg tgtcaaggtg agacattctt gctggggagg gacctggtat gttctcctca
     5101 gactgagggt agggcctcca aacagccttg cttgcttcga gaaccatttg cttcccgctc
     5161 agacgtcttg agtgctacag gaagctggca ccactacttc agagaacaag gccttttcct
     5221 ctcctcgctc cagtcctagg ctatctgctg ttggccaaac atggaagaag ctattctgtg
     5281 ggcagcccca gggaggctga caggtggagg aagtcagggc tcgcactggg ctctgacgct
     5341 gactggttag tggagctcag cctggagctg agctgcag
//
LOCUS       HUMVARNA      156 bp ss-mRNA            PRI       04-AUG-1990
DEFINITION  Human integrated adenovirus 2 VA RNA.
ACCESSION   M35961
KEYWORDS    .
SOURCE      Human adenovirus 2 infected KB cell line, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   Ohe,K. and Weissman,S.M.
  TITLE     Nucleotide sequence of an RNA from cells infected with adenovirus 2
  JOURNAL   Science 167, 879-881 (1969)
  STANDARD  simple staff_review
BASE COUNT       26 a     46 c     53 g     31 t
ORIGIN      
        1 gggcactctt ccgtggtctg gtggataaat tcgcaagggt atcatggcgt ggacgaccgg
       61 ggttcgaacc ccggatccgt gatccatgcg gttaccgtcc gccgcccgtg cgtcgaaccc
      121 aggtgtgcga cgtcagacaa cgggggagcg ctcctt
//
LOCUS       MUSGFAPA     2511 bp ss-mRNA            ROD       04-AUG-1990
DEFINITION  Mouse glial fibrillary acidic protein mRNA, 3' end.
ACCESSION   M25937
KEYWORDS    glial fibrillary acidic protein.
SOURCE      Mouse brain, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2511)
  AUTHORS   Cowan,N.J., Lewis,S.A., Balcarek,J.M., Krek,V. and Shelanski,M.
  TITLE     Structural implications of a cDNA clone encoding mouse glial
            fibrillary acidic protein
  JOURNAL   Ann. N.Y. Acad. Sci. 455, 575-582 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1     1212     glial fibrillary acidic protein (AA at 1)
BASE COUNT      625 a    690 c    697 g    499 t
ORIGIN      
        1 ctgggtacca tgccacgttt ctccttgtct cgaatgactc ctccactccc tgccagggtg
       61 gacttctccc tggccggggc gctcaatgct ggcttcaagg agacacgggc gagcgagcgt
      121 gcagagatga tggagctcaa tgaccgcttt gctagctaca tcgagaaggt ccgcttcctg
      181 gaacagcaaa acaaggcgct ggcagctgaa ctgaaccagc ttcgagccaa ggagcccacc
      241 aaactggctg atgtctacca ggcggagctt cgggagctgc ggctgcggct ggaccagctt
      301 acggccaaca gtgcccggct ggaggtggag agggacaact ttgcacagga cctcggcacc
      361 ctgaggcaga agctccaaga tgaaaccaac ctgaggctgg aggcagagaa caacctggct
      421 gcgtatagac aggaggcaca tgaagccacc ctggctcgtg tggatttgga gagaaaggtt
      481 gaatcgctgg aggaggagat ccagttctta aggaagatct atgaggagga agttcgagat
      541 ctccgggagc agctggccca acagcaggtc cacgtggaga tggatgtggc caagccagac
      601 ctcacagcgg ccctgagaga gattcgcact caatacgagg cagtggccac cagtaacatg
      661 caagagacag aggagtggta tcggtctaag tttgcagacc tcacagacgc tgcgtcccgc
      721 aacgcagagc tcctccgcca agccaaacac gaagctaacg actatcgccg ccaactgcag
      781 gccttgacct gcgatctgga gtccctccgc ggcacgaacg agtccctaga gcggcaaatg
      841 cgcgaacagg aagagcgcca tgcgcgggag tcggccagtt accaggaggc acttgctcgg
      901 ctggaggagg agggccaaag cctcaaggag gagatggccc gccacctgca ggagtaccag
      961 gatctactca acgttaagct agccctggac atcgagatcg ccacctacag gaaattgctg
     1021 gagggcgaag aaaaccgcat caccattcct gtacagactt tctccaacct ccagatccga
     1081 gaaaccagcc tggacaccaa atccgtgtca gaaggccacc tcaagaggaa catcgtggta
     1141 aagactgtgg agatgcggga tggtgaggtc attaaggact cgaagcagga gcacaaggac
     1201 gtggtgatgt gaggtgtgcc acctggtggc ccttgccatg cagtgtgagg gcccaaagct
     1261 tagcctcaaa taggcctgtt tgccaggctc agttcccacc cacaccagca cttcccttcc
     1321 ttccctggtt ttctgcctgt gtgctgccca aggcgtcaat caggtcataa gcttcataga
     1381 tggcatagat ggcatatacc cttcaccttc aactaacagg atactcaccc caaaggcgca
     1441 gtcaggaggg gagggaaccc cagctggtta gaattggaag ggaagaggaa agatgagcag
     1501 agtagagaga tttaacaaat cacttccttc atccttgttg ttatggaaac cgttgccaga
     1561 gctggaagtt tccacaggct gctggagcta gacaacaatt cagacagaaa gggaaagtcc
     1621 ctgaggcaaa gtctctctag ccagagacct atgcatcccg aatgccacta aggcagtcct
     1681 gaagggccct ccaggtgatg actccagtgt gtcagcccca ctgagcagct atgcaggttg
     1741 actgcccaca ggcatgtgaa acttggttct cagcacttgg cagggatcta tggcataagt
     1801 ggagagggaa ggtgtactgg acggcggaga ggagggctcc ctggccccta agtgtggatg
     1861 cagagaggtg gagcccagga agggtctctg cttaggctgc agggtaccga atggcagagg
     1921 cactggtaga gatcatttgg acactcggag ttgaaagtta caggcaatct gttacacttg
     1981 gctctgaatc ctatcaatca aggaaataac ccgttctctg gaagacactg aaacaggaga
     2041 gcaggacttc cgtccactgg gcagggtaca gatgtgtctc agttgtgaag gtctattcct
     2101 ggctgcacag tccccatccg ctcaggtcat cttacccctg tgactgctct cagccctgga
     2161 agaatccaca accatccttc caaggttgtc catccccaca atgactagct gttgctctcc
     2221 aagctaaggg accattccct gctctatgca tatacgtaat gtcacctatt tagtatcatc
     2281 ctatttgaga gtttgaggaa ctgaaacact gttctgttca agcacctggt gctatgcctt
     2341 catattagag caccttctct gaggctgatt ggtgggcagg tagggaagac attgagcaga
     2401 cagtgtccgc tcagttgtcc ttccctccct tccaaggtcc ctccctcttt ccaggacatc
     2461 gcccccccac cccacccctc ctttccacct ccgctaacct ccagacgcac t
//
LOCUS       MUSHOX61A     318 bp ss-mRNA            ROD       04-AUG-1990
DEFINITION  Mouse homeobox-6.1 protein (Hox-6.1) mRNA, 3' end.
ACCESSION   M35986
KEYWORDS    Hox-6.1 gene; homeobox-6.1 protein.
SOURCE      Mouse male kidney, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Sharpe,P.T., Miller,J.R., Evans,E.P., Burtenshaw,M.D. and
            Gaunt,S.J.
  TITLE     Isolation and expression of a new mouse homeobox gene
  JOURNAL   Development 102, 397-407 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      318     homeobox-6.1 protein (AA at 1)
    site         30      210     homeobox
BASE COUNT       94 a     80 c    100 g     44 t
ORIGIN      Chromosome 14E2.
        1 aatcacagtg gggtcggtta cggagcggac cggaggcgcg gccgccagat ctactctcgg
       61 taccagaccc tggaactgga gaaggaattt cacttcaacc gctacctaac tcggcgccgg
      121 cgcatcgaga tcgccaatgc tctgtgcctg accgagcgac agatcaaaat ctggttccag
      181 aacgcgcgca tgaagtggaa aaaagaatct aatctcacgt ccacactctc agggggtggc
      241 ggaggggcaa ccgccgacag cctgggagga aaggaggaaa agcgagaaga gacagaagaa
      301 gagaagcaga aagagtga
//
LOCUS       PEAVICIL      190 bp ds-DNA             PLN       04-AUG-1990
DEFINITION  P.sativum vicilin mRNA, partial cds.
ACCESSION   M35978
KEYWORDS    vicilin.
SOURCE      P.sativum, cDNA to mRNA, clone pRC 2.2.1.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 190)
  AUTHORS   Gatehouse,J.A., Lycett,G.W., Croy,R.R.D. and Boulter,D.
  TITLE     The post-translational proteolysis of the subunits of vicilin from
            pea (Pisum sativum L.)
  JOURNAL   Biochem. J. 207, 629-632 (1982)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   190     vicilin (AA at 2)
BASE COUNT       74 a     30 c     46 g     40 t
ORIGIN      
        1 agagatagag aagattctct tagaagagca tgagaaagag acacatcaca gaagaagcct
       61 tagggataag agacaacaga gccaagaaaa gaatgtaata gtcaaagtat caaagcaagt
      121 gcaaagttac aaagctaagt tgactccagg tgatgttttt gtaattccgg caggtcatcc
      181 tgttgccgta
//
LOCUS       PIGINVOLA    1204 bp ds-DNA             MAM       04-AUG-1990
DEFINITION  Pig involucrin gene, complete cds.
ACCESSION   M34441
KEYWORDS    involucrin.
SOURCE      Pig (Yorkshire) adult skin keratinocyte DNA.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1204)
  AUTHORS   Tseng,H. and Green,H.
  TITLE     The involucrin genes of pig and dog: Comparison of their segments
            of repeats with those of prosimians and higher primates
  JOURNAL   Mol. Biol. Evol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Tseng and H. Green, 23-MAY-1990.
FEATURES       from  to/span     description
    pept          1     1044     involucrin
    rpt         247      831     repeat segment
BASE COUNT      377 a    311 c    363 g    153 t
ORIGIN      
        1 atgtcccagc aatacactct tccagtgacc ctgccccctg ccctcagtca ggagcccctc
       61 aagcctgttt ctcctcccgc cgatacccag caggagcaag tgaaacaggc aactccgctg
      121 cctgctccat gccagaagat gctctcggag ctcccagtgg aggtcccctt ggaggatgcg
      181 gagaaacaca caactcttgt gaagggagtg cctgagcaag agtgtgagcc gcagccacag
      241 gagccacagc agcaggaact gcatgtggaa cagcagcagc agcagcaaga atcacaggtg
      301 caggaactgc atgtggatca gcagcagcag caacaagagt cacaggagca ggaactgcat
      361 gtggatcagc agcagcagca gcaagagtca caggagcagg aactgcatgt ggatcagcag
      421 cagcaacaag agtcacaggt gcaggaactg catgtgggtc atcatcagca gcagcaagag
      481 tcacaggagc aggaactgca tgtggatcat catcagcagc agcaagagtc acaggagcag
      541 gaactgcatg tggatcagca gcagcagcaa caagagtcac aggagcagga actgcatgtg
      601 gatcagcagc agcagcagca agagtcacag gagcaggaac tgcatgtgga tcatcatcag
      661 cagcagcaag agtcacaggt gcaggaactg catgtggatc atcagcagca gcagcaagag
      721 tcacaggagc aggaactgca tgtggatcag caccagcagc agcaagagtc acaggagcag
      781 gaactgcatg tggatcagca gcagcaggag ctccaggtgc aggaagtgca gcagcagcag
      841 cagcagcagc aagaacagca ggaggaccat cagaaagcag aacacctgga gcaggaggaa
      901 gcacaaaggg agcagcagct aaagggacag ctggaacagg agaagaaggg cgtgtaccag
      961 cacctggatc aagagctaac aaagagagat gaacacctgg aaaagaaagg ggagcactgc
     1021 tggagcagca ggagaagcct ctagagccag cagagcagca ggaggggcag ttgaaacagc
     1081 ctgtgcttat tccagctcct ggccaggtcc aggagaccca ccccagtcca actgctgaag
     1141 ggagaagttt tgccccctga gaagcaggac ataagtaacc acctctggac ataagtaacc
     1201 accc
//
LOCUS       FLANPB       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/Wilson-Smith/33 (H1N1) nucleoprotein (seg 5) mRNA,
            complete cds.
ACCESSION   M30746
KEYWORDS    nucleoprotein.
SOURCE      Influenza A /Wilson-Smith/33 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      507 a    318 c    409 g    331 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcacag agtgacatcg aaatcatggc gaccaaaggc
       61 accaaacgat cttacgaaca gatggagact gatggagaac gccagaatgc cactgaaatc
      121 agagcatctg tcggaaaaat gattggtgga attggacgat tctacatcca aatgtgcacc
      181 gaacttaaac tcagtgatta tgagggacgg ctgattcaga acagcttaac aatagagaga
      241 atggtgctct ctgcttttga cgagaggagg aataaatatc tagaagaaca tcccagtgcg
      301 gggaaagatc ctaagaaaac tggaggacct atatacagga gagtagatgg aaagtggatg
      361 agagaactca tcctttatga caaagaagaa ataagacgaa tctggcgcca agctaataat
      421 ggtgacgatg caacggctgg tctgactcac atgatgatct ggcactccaa tttgaatgat
      481 gcaacttacc agaggacaag agctcttgtt cgcacaggaa tggatcccag gatgtgctca
      541 ctgatgcagg gttcaaccct ccctaggagg tctggggccg caggtgctgc agtcaaagga
      601 gttggaacaa tggtgatgga attgatcaga atgatcaaac gtgggatcaa tgatcggaac
      661 ttctggaggg gtgagaatgg acggagaaca aggattgctt atgaaagaat gtgcaacatt
      721 ctcaaaggga aatttcaaac agctgcacaa agagcaatgg tggatcaagt gagagagagc
      781 cggaatccag gaaatgctga gttcgaagat ctcatctttc tagcacggtc tgcactcata
      841 ttgagagggt cagttgctca caagtcctgc ctgcctgcct gtgtgtatgg acctgccgta
      901 gccagtggat acgactttga aagagaggga tactctctag tcggaataga ccctttcaga
      961 ctgcttcaaa acagccaagt atacagccta atcagaccaa atgagaatcc agcacacaag
     1021 agtcaactgg tgtggatggc atgccattct gctgcatttg aagatctaag agtatcaagc
     1081 ttcatcagag ggacgaaagt ggtcccaaga gggaagcttt ccactagagg agttcaaatt
     1141 gcttccaatg aaaacatgga gactatggaa tcaagtaccc ttgaactgag aagcagatac
     1201 tgggccataa ggaccagaag tggagggaac accaatcaac agagggcttc ctcgggccaa
     1261 atcagcatac aacctacgtt ctcagtacag agaaatctcc cttttgacag accaaccatt
     1321 atggcagcat tcactgggaa tacagagggg agaacatctg acatgagaac cgaaatcata
     1381 aggctgatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcagc gagcccgatc gtgccctcct ttgacatgag taatgaagga
     1501 tcttatttct tcggagacaa tgcagaggag tacgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPC       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/swine/Iowa/15/30 (H1N1) nucleoprotein (seg 5) mRNA,
            complete cds.
ACCESSION   M30747
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/swine/Iowa/15/30 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1565; revises [1])
  AUTHORS   Gorman,O.T.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      509 a    318 c    411 g    327 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcg aaatcatggc gtctcaaggc
       61 accaaacgat catacgaaca aatggaaact ggtggagaac gccagaatgc cacggaaatc
      121 agagcatctg tcggaagaat gattggtgga attggaagat tctacatcca aatgtgcacc
      181 gaactcaaac tcagtgatta tgagggacgg ctgattcaga acagcataac aatagagaga
      241 atggtgctct ctgcttttga cgagaggagg aacaaatatc tggaagaaca tccaagcgct
      301 gggaaagacc ctaagaaaac tggaggaccc atatacagga gaatagacgg aaagtggata
      361 agagaactca tcctttatga caaagaagaa ataaggagaa tctggcgcca agccaacaat
      421 ggtgaggata caacagccgg tctgactcac atgatgatct ggcattccaa tttgaatgat
      481 gccacttatc agaggacaag agctcttgtt cgcaccggaa tggatcccag gatgtgctct
      541 ctgatgcagg gttcaactct ccccaggagg tctggagccg caggtgctgc agtcaaagga
      601 gttgggacag tggtgatgga gttgatcaga atgatcaaac gtggaatcaa tgatcggaac
      661 ttctggaggg gtgaaaatgg acgaaggaca agaattgcct atgaaagaat gtgcaacatt
      721 ctcaaaggga aatttcaaac agctgcacag agagcaatga tggatcaagt gagagagagc
      781 cgaaacccag gaaatgctga aatcgaagat ctcatctttc tggcacgatc cgcactcata
      841 ttacgaggat cagttgcaca caagtcctgt ctgcctgcct gtgtgtatgg acttgctgta
      901 gccagtggac atgactttga aagagagggg tactctctgg tcggaataga ccctttcaga
      961 ctgcttcaaa acagccaagt attcagccta atcagaccaa atgaaaaccc agcgcacaag
     1021 agtcaattag tgtggatggc atgccattct gctgcatttg aagatttaag ggtatcaagc
     1081 ttcataagag ggaaaagagt ggttccacga gggcagcttt ccaccagagg ggttcagatt
     1141 gcttccaatg agaacatgga gactatggac tctagtactc ttgaactgag aagcagatac
     1201 tgggccataa ggaccagaag tggaggaaac actaatcaac agagggcatc tgcgggccaa
     1261 atcagcgtgc aacctacatt ctcggtgcag agaaatctcc cttttgagag agcaaccgtt
     1321 atggcagcat tcactgggaa cacagaggga agaacatcag acatgagaac cgaaatcata
     1381 aggataatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gagcccgatc gtgccttcct ttgacatgag taacgaagga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPD       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/swine/Tennessee/24/77 (H1N1) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30748
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/swine/Tennessee/24/77 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1565; revises [1])
  AUTHORS   Gorman,O.T.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      514 a    307 c    409 g    335 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcaatg agtgacatcg aaatcatggc gtctcaaggc
       61 accaaacgat catatgaaca aatggagact ggtggggaac gccaggatgc cacagaaatc
      121 agagcatctg tcggaagaat gattggtgga atcggaagat tctacatcca aatgtgcact
      181 gaactcaaac tcagtgacta tgagggacga ctgattcaaa atagcataac aatagagaga
      241 atggtgctct ctgcttttga cgagagaagg aataaatacc tagaagagca tcccagtgct
      301 gggaaagatc ctaagaaaac tggaggaccc atatatagaa gagtagacgg aaaatggatg
      361 agggaactca tcctttatga caaagaagaa ataaggagag tttggcgcca agccaacaat
      421 ggtgaagatg caacagccgg ccttacccat attatgattt ggcactccaa tctgaatgat
      481 gccacctatc agagaacaag agctcttgtt cgcactggga tggatcccag aatgtgctcc
      541 ctaatgcaag gttcaacact tcccagaagg tctggagccg caggtgctgc agtaaaagga
      601 gttggaacaa tagcgatgga gttaatcaga atgatcaaac gtgggatcaa tgaccgaaac
      661 ttctggaggg gtgaaaatgg acgaaggaca aggattgcat atgaaagaat gtgcaacatt
      721 ctcaaaggga aatttcagac agctgcccag agggcaatga tggatcaagt gagagaaagt
      781 cggaacccag gaaacgctga aattgaagat ctcattttcc tggcacggtc agcactcatt
      841 ttaagagggt cagttgcaca taagtcttgc ctgcctgctt gtgtgtatgg gcttgcagta
      901 gcgagtgggc atgactttga aagagaagga tattctctgg tcggaataga ccccttcaaa
      961 ctacttcaaa acagtcaagt attcagcctg atcagaccaa atgaaaaccc agctcacaag
     1021 agtcaactgg tgtggatggc atgccactct gccgcatttg aggatttaag agtatcaggc
     1081 ttcataagag ggaagaaagt ggttccaaga ggaaagcttt ccacaagagg ggttcagatt
     1141 gcttcaaatg agaatgtgga agctatggac tctagtaccc tggaactaag aagcaggtac
     1201 tgggccataa ggaccagaag cgggggaaat accaatcaac agaaggcatc cgcaggccag
     1261 atcagtgtgc aacctacatt ctcagtacaa aggaatctcc cttttgagag agcgaccgtt
     1321 atggcagctt tcattgggaa caatgaggga cgaacatcag atatgcgaac tgaaatcata
     1381 aggatgatgg aaagtgcaaa gccagaagat ttgtccttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgaa taatgagggg
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt gaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPE       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/swine/Netherlands/12/85 (H1N1) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30749
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/swine/Netherlands/12/85 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      503 a    307 c    419 g    336 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacattc acatcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaacc ggaggagaac gccagaatgc tactgaaatc
      121 agagcatctg ttgggggaat ggttggtgga attggaagat tctacataca gatgtgcact
      181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga atagcataac aatagagaga
      241 atggttctct ctgcatttga tgagaggagg aacaaatacc tggaagaaca tcccagtgcg
      301 gggaaggacc caaagaaaac tggaggtcca atctacaaaa agagagacgg aaaatggatg
      361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcaaacaat
      421 ggtgaagatg ctactgctgg tctcactcat ctgatgattt ggcattccaa cctgaatgat
      481 gccacatatc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct
      541 ctgatgcaag gatcaactct cccgaggaga tctggagctg caggtgcagc agtaaaggga
      601 gttgggacga tggtaatgga actgattcgg atgataaagc gggggatcaa tgatcggaac
      661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcat atgagagaat gtgcaacatc
      721 ctcaaaggga aatttcagac agcagcgcaa cgagcaatga tggaccaggt gcgagaaagc
      781 agaaatcctg ggaatgctga gattgaagac cttatctttc tggcacgatc tgcactcatt
      841 ctgagaggat cagtggctca taaatcctgc ctgcctgctt gtgtatatgg acttgttgtg
      901 gcaagtggat atgactttga aagagaaggg tactctctag tcggaataga tcctttccgt
      961 ctgctccaga acagccaggt gttcagcctc attagaccaa atgagaatcc agcacataag
     1021 agtcagctgg tatggatggc atgccattct gcagcgtttg aagatctgag agtgtcaagt
     1081 ttcatcagag ggacaaaagt ggtcccaaga ggacaactgt ccaccagagg agttcaaatt
     1141 gcttcaaatg aaaacatgga aacaatggac tccagtactc ttgaactgag aagcaaatac
     1201 tgggctataa gaaccaggag cggaggaaac accaaccaac agagagcatc tgcagggcaa
     1261 atcagtgtac aacctacttt ctcggtacag agaaatcttc ctttcgagaa agcgaccatc
     1321 atggcagctt ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaattata
     1381 agaatgatgg aaagtgccag accagaagat gtgtccttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPF       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/equine/London/1416/73 (H7N7) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30750
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/equine/London/1416/73 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      497 a    322 c    418 g    328 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatca aactcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc
      121 agagcatctg ttggaaggat ggtgggagga atcggccggt tttatgttca aatgtgtact
      181 gagctcaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagagagg
      241 atggtgcttt cggcattcga cgaaagaaga aacaagtacc tggaggagca tcccagtgct
      301 ggaaaagacc ccaagaaaac gggaggcccg atatacagaa ggagagatgg gaaatggatg
      361 agagaactca tcctccatga taaagaagaa atcaggagga tctggcgtca ggccaacaat
      421 ggtgaagatg ctactgctgg tctgactcac atgatgatct ggcactccaa tctcaatgac
      481 accacctacc aaagaacaag ggctcttgtt cggactggga tggatcccag aatgtgctct
      541 ctgatgcaag gatcaactct cccacggaga tctggagctg ccggtgctgc agtgaagggt
      601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaac
      661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaacatc
      721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gagagagggc
      781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcgcgatc agcactcatt
      841 ctgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta
      901 gccagtgggt atgacttcga gaaagaggga tactctctgg ttggaattga tcctttcaaa
      961 ctactccaga acagccaaat tttcagtctg atcagaccga aagaaaatcc agcacacaag
     1021 agccagctgg tgtggatggc atgccattct gcagcatttg aggacctgag agtttcgaat
     1081 ttcattagag gaaccaaagt aatcccaaga ggacagttgg caaccagagg agtgcaaatt
     1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat
     1201 tgggcaataa ggaccaggag tggggggaac accagtcaac agagagcatc tgcaggacag
     1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt
     1321 atggctgcat tcactggaaa cactgagggg aggacttccg acatgagaac ggaaatcata
     1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg
     1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPG       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/equine/Kentucky/2/86 (H3N8) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30751
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/equine/Kentucky/2/86 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      502 a    318 c    411 g    334 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc
      121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact
      181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg
      241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct
      301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg
      361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat
      421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac
      481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct
      541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt
      601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat
      661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc
      721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc
      781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt
      841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta
      901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa
      961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag
     1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat
     1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt
     1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat
     1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag
     1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt
     1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata
     1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg
     1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPH       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/gull/Massachussetts/26/80 (H13N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30752
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/gull/Massachussetts/26/80 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      492 a    317 c    429 g    327 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt
      121 agggcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact
      181 gaactcaagc tcagtgacaa tgagggacga ttgatccaaa acagcatcac catagagaga
      241 atggtcctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
      301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
      361 agagaattag ttctgtatga caaagaagaa ctaagaagaa tctggcggca ggcaaacaat
      421 ggggaagatg cgactgctgg tctcacccac ttgatgatct ggcattccaa tttgaatgac
      481 gccacgtatc agagaactag agcacatgtg cgcacaggaa tggatcccag gatgtgctcc
      541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtgaaggga
      601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
      661 ttctggagag gtgaaaacgg acgaagaaca agaattgcct atgaaagaat gtgcaacatc
      721 ctcaaaggga aatttcaaac agcagcacag cgggctatga tggatcaggt gcgggaaagc
      781 cggaatcctg gaaatgctga aatagaggac cttatatttc tggctcgatc tgcacttatc
      841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggcggtg
      901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
      961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
     1021 agtcaattgg tgtggatggc atgccattct gctgcatttg aggatttgag agtgtcaagt
     1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
     1141 gcatccaatg agaacatgga aacaatgaat tccagcactc ttgaattgag aagcaaatac
     1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa
     1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc
     1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata
     1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPI       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/gull/Astrakhan/227/84 (H13N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30753
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/gull/Astrakhan/227/84 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      489 a    323 c    426 g    327 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggtgaac gccagaatgc cactgagatt
      121 cgggcatctg tcggaaggat ggttggagga atcggaagat tctacataca gatgtgcact
      181 gaactcaaac taagtgacaa cgaggggagg ctgatccaga acagtatcac aatagaaaga
      241 atggtccttt ctgcgtttga tgagagaagg aacaggtact tggaggagca ccccagcgct
      301 gggagggacc ccaagaagac tggaggacca atttacagaa ggagagaggg aaaatgggtg
      361 agagaattgg tcctgtatga caaggaagaa ataagaagaa tctggcgaca agcaaacaat
      421 ggagaagatg caactgccgg ccttacccac ttgatgatct ggcattctaa cttgaatgac
      481 gccacatatc agaggactag agcccttgtg cgcactggga tggaccccag gatgtgctcc
      541 cttatgcaag gatcgacact tccaagaagg tctggagcgg ctggtgcagc tgtaaaggga
      601 gttgggacaa tggtgatgga actcatcaga atgataaaaa gaggggtcaa tgaccgtaac
      661 ttctggagag gtgaaaatgg acgaagaaca agaattgcct atgaaagaat gtgcaacatt
      721 ctcaaaggaa aattccaaac agctgcacaa cgagctatga tggatcaggt cagagaaagc
      781 cgtagtcctg gaaatgctga aatcgaggac ctcatctttc tggccaggtc tgcacttatt
      841 ctgaggggag cagtggctca caaatcatgc ctgcctgcct gtgtgcatgg acttgccgta
      901 gcaagtggat acgactttga gagggaggga tactcccttg ttgggataga ccctttccgt
      961 ctgctccaga acagccaggt attcagtcta atccgaccca atgaaaatcc agcacataag
     1021 agtcagttgg tgtggatggc atgccattcc gctgcatttg aggatttgag agtgtcaagc
     1081 ttcatcaggg gagcaagagt gttaccaaga gggcaactat ccacaagagg tgttcaaatt
     1141 gcatccaatg agaacatgga gactatgagt tccagcactc ttgaattgag aagcaaatac
     1201 tgggcaataa ggactagaag cggaggaaac accaaccaac aaagagcatc tgcaggacaa
     1261 atcagtgtgc aacccacttt ctctgtgcaa agaaacctcc cctttgagag agcgaccatc
     1321 atggctgcat tcacaggaaa tgcagaaggc agaacatctg atatgagaac tgagatcata
     1381 cggatgatgg aaaatgcaag accagaggat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taaagaggga
     1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPK       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/gull/Maryland/1824/78 (H13N9) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30755
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/gull/Maryland/1824/78 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      492 a    319 c    428 g    326 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt
      121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact
      181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga
      241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
      301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
      361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat
      421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac
      481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc
      541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga
      601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
      661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc
      721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaagt gcgggaaagc
      781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc
      841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtgaatgg actggctgtg
      901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
      961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
     1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt
     1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
     1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac
     1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa
     1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc
     1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata
     1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag
     1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPL       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/gull/Maryland/1815/79 (H13N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30756
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/gull/Maryland/1815/79 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      490 a    319 c    429 g    327 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt
      121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact
      181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga
      241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
      301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
      361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat
      421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac
      481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc
      541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga
      601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
      661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc
      721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaggt gcgggaaagc
      781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc
      841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg
      901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
      961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
     1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt
     1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
     1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac
     1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa
     1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc
     1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata
     1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPN       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/turkey/Minnesota/833/80 (H4N2) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30769
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/turkey/Minnesota/833/80 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      494 a    311 c    429 g    331 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc
      121 agggcatctg ttgggagaat ggttggtgga atcgggagat tctacataca gatgtgcaca
      181 gaactcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga
      241 atggttctct cggcatttga tgagaggaga aacaaatatc tggaagagca tcccagtgct
      301 ggaaaagacc ctaagaagac tggaggtcca atatacagga ggagagatgg gaaatgggtg
      361 agagaactga tcctgtatga caaagaggag atcagaagga tttggcgtca agcgaataat
      421 ggagaagacg caaccgctgg tctcacccat ttgatgatct ggcattccaa tctgaatgat
      481 gccacatatc agagaacaag ggcactggtg cgtactggga tggatcccag gatgtgctct
      541 ctgatgcaag gatcaactct tccgaggagg tctggggctg ctggagcagc agtgaaagga
      601 gttggaacaa tggtgatgga attgattcgg atgatcaagc gagggatcaa tgatcggaat
      661 ttctggagag gtgaaaatgg acggagaaca agaattgcat acgaaagaat gtgcaacatt
      721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gagggaaagc
      781 cggaatccag ggaatgctga aattgaagat ctaatctttc ttgcacggtc tgctctcatt
      841 ctgaggggat cagtggctca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg
      901 gccagtggat acgactttga gagggaggga tactccctag tcggaattga tcctttccgt
      961 ctgctccaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa
     1021 agtcagctgg tatggatggc atgtcattct gcagcatttg aagatctgag agtgtcaagc
     1081 ttcatcagag gaacaagagt ggtcccaaga ggacaactgt ccaccagagg agttcagatt
     1141 gcttcaaatg agaacatgga gacaatggac tccagtactc ttgaattgag gagcagatac
     1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcaggacaa
     1261 atcagcgtac agcccacatt ctctgtgcag agaaacctcc cattcgagag agcaaccatt
     1321 atggcggcat ttacgggaaa cactgaaggc agaacttcag acatgagaac tgagatcata
     1381 aggatgatgg aaaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga
     1501 tcttatttct tcggagacaa tgcagaggaa tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPO       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/gull/Minnesota/945/80 (H13N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30757
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/gull/Minnesota/945/80 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (sites)
  AUTHORS   Gorman,O.T.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990.
            
            The isolate mentioned in the published paper [1] is incorrect and
            should be "A/Gull/Minnesota/945/80".
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      490 a    321 c    426 g    328 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc
       61 accaaacgat cttatgagca aatggaaact ggtggcgagc gccagaatgc caatgacatc
      121 agagcatctg tcgggaggat ggttggggga attggaagat tctacataca gatgtgcact
      181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga
      241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc
      301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg
      361 agagaattag ttctgtatga caaagaagag ctaaggagaa tctggcggca ggccaacaat
      421 ggggaagatg cgactgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac
      481 gccacgtatc aaagaactag agcacatgtg cgcacaggga tggatcccag gatgtgctcc
      541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga
      601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat
      661 ttctggagag gtgaagacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc
      721 ctcaaaggga aatttcagac agcagcacaa cgggctatga tggatcaagt gcgggaaagc
      781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc
      841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg
      901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt
      961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag
     1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt
     1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt
     1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattaag aagcaaatac
     1201 tgggcaataa ggactagaag tggaggaaac accaaccagc aaagagcatc tgcgggacaa
     1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgaaag ggcgaccatc
     1321 atggctgcat tcacaggaaa tccggagggc agaacatctg atatgagaac ggaaatcata
     1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag
     1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPP       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/equine/Tennessee/5/86 (H3N8) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30758
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/equine/Tennessee/5/86 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      502 a    318 c    411 g    334 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc
      121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact
      181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg
      241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct
      301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg
      361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat
      421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac
      481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct
      541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt
      601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat
      661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc
      721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc
      781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt
      841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta
      901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa
      961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag
     1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat
     1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt
     1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat
     1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag
     1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt
     1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata
     1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg
     1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPR       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/duck/New Zealand/31/76 (H4N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30760
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/duck/New Zealand/31/76 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      502 a    325 c    412 g    326 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgaaca gatggagact ggtggagaac gccagaatgc tactgagatc
      121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact
      181 gaactcaaac tcagtgacta tgaaggaagg ctaatccaaa acagcataac aatagagaga
      241 atggttcttt ctgcatttga tgaaaggaga aacaagtact tggaggagca ccccagtgct
      301 ggaaaggatc ctaagaaaac tggaggtcca atctatagaa gaagagatgg aaaatgggtg
      361 agagaactga tcctgtacga caaagaggag atcaggagga tttggcgcca agcgaataat
      421 ggagaggacg caactgctgg tcttacccat ttgatgatct ggcattccaa tcttaatgat
      481 gcaacatacc agagaacaag agcccttgtg cgtactggga tggaccccag gatgtgctct
      541 ctgatgcaag gctcaactct cccgaggagg tctggagctg caggtgctgc agtaaaagga
      601 gttggaacaa tggtgatgga gctgattcga atgataaaac gaggaatcaa tgaccgtaac
      661 ttctggagag gcgagaatgg acgaaggaca agaattgctt acgagaggat gtgcaacatt
      721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgggagagc
      781 cgaaatcctg gaaatgctga aattgaagac ctcatcttct tggcacgatc tgcactcatt
      841 ctgaggggat cagtggctca caaatcctgc ctacctgctt gtgtatatgg attagctgta
      901 gccagtggat atgactttga aagagaagga tactccctgg tcggaataga ccccttccgt
      961 ctgctccaga acagccaggt attcagcctc atcagaccca atgaaaatcc agcacacaag
     1021 agtcaactgg tgtggatggc atgccactct gcagcatttg aagacctgag agtttcaagc
     1081 ttcatcaggg gaacaagagt gatcccgaga gggcaactgt ccactagagg ggtccaaatt
     1141 gcttcaaatg aaaacatgga aacaatggac tctagcactc tcgaattgag aagcagatac
     1201 tgggccataa ggaccagaag tggaggaaac accaaccaac agcgggcatc tgcgggacaa
     1261 atcagcgtgc aacccacttt ctctgtgcaa agaaatcttc ctttcgaaag agcaaccatc
     1321 atggcagcat ttacagggaa cactgaaggc agaacatctg acatgagaac tgagatcata
     1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPS       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/grey teal/Australia/2/79 (H4N4) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30761
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/grey teal/Australia/2/79 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      513 a    327 c    401 g    324 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgaaca aatggaaact ggtggagaac gccagaatgc tactgaaatc
      121 agatcttctg ttggaagaat ggttggtgga attggaagat tctacataca gatgtgcact
      181 gaattcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga
      241 atggtccttt ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagtgct
      301 ggaaaagatc ctaagaagac tggaggtcca atctacagaa ggagagatgg aaagtggatt
      361 agagagctga tcctatgcga caaagaggag atcaggagaa tctggcgcca agcgaataat
      421 ggggaggatg caactgctgg ccttacccat ttgatgatat ggcattccaa cctcaatgat
      481 gccacttacc agagaacaag agcccttgtg cgtactggga tggatcctag aatgtgctct
      541 ctgatgcaag gctcaactct cccgagaaga tctggagctg ctggcgctgc agtaaaagga
      601 gttgggacaa tggtgatgga actgattcgg atgataaaac ggggaatcaa tgaccgaaat
      661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcct atgagagaat gtgcaacatt
      721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggaccaagt gcgggaaagc
      781 cgaaatcctg ggaatgctga aattgaagac cttatatttc tggcacgatc tgcactcatc
      841 ctaagaggat cagtagccca caaatcctgc ctacctgctt gtgtgtatgg attggccgta
      901 gccagtggat atgactttga gaaagaagga tactccctag tcggaataga tcctttccgc
      961 atgctccaga acagccaggt attcagtctc atcaggccca atgaaaattc agcacacaag
     1021 agtcaattgg tatggatggc atgccactct gcagcatttg aagacctgag agtatcaagc
     1081 ttcatcagag gaacaaaagt ggtcccaaga gggaaactct ccaccagagg agtccaaatc
     1141 gcttcaaatg agaacatgga aacaatggac tctacaactc tcgaattgag aagcaggtac
     1201 tgggctataa ggaccagaag cggaggaaac acaaatcagc agcgggcgtc agctggacaa
     1261 atcagcgtac aacccacttt ctctgtgcag aggaatctcc cattcgagag agcaaccatt
     1321 atggcagcat ttaccgggaa cactgaaggc agaacatccg acatgagaac tgagatcata
     1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aagggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPT       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/duck/Czechoslovakia/56 (H4N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30762
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/duck/Czechoslovakia/56 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      489 a    315 c    432 g    329 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgaaca gatggaaact ggtggagagc gccagaatgc cactgagatc
      121 agagcatctg ttgggagaat ggtcggtgga attgggaggt tttacataca gatgtgtact
      181 gaactcaagc tcagtgacta tgaagggaga ctgatccaga acagcataac aatagagaga
      241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgct
      301 gggaaggacc ctaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatgggtg
      361 agagaattga tcctgtacga caaagaggag atcaggagga tttggcgcca agcaaacaat
      421 ggagaagacg caactgctgg tctcactcac ttgatgatct ggcattccaa tctgaatgat
      481 gccacatatc agagaacaag agctcttgta cgtactggga tggacccaag aatgtgctct
      541 ctgatgcaag gatcaactct cccgaggagg tctggagctg ctggtgcagc agtgaaggga
      601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat
      661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagaggat gtgcaatatc
      721 ctcaaaggga agttccaaac agcagcacaa agagcaatga tggaccaggt gcgggagagc
      781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacggtc tgcactcatt
      841 ctgagaggat cagtggctca taagtcctgt ctgcctgctt gtgtgtatgg acttgctgtg
      901 gccagtggat acgactttga gagagagggg tactctctgg tcggaataga tcctttccgt
      961 ctgctccaaa acagccaagt attcagcctc attagaccaa atgagaatcc agcacataag
     1021 agccaactgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc
     1081 ttcatcagag ggacaagagt ggtcccaaga gggacaactg tcaccagagg atttcaaatt
     1141 gcttcaaatg agaacatgga aacaatggac tccagcactc ttgaactgag gagcagatat
     1201 tgggctataa ggaccaagag cggaggaaac accaaccaac agagagcatc tgcaggacaa
     1261 atcagtgtac aacccacttt ctcggtacag agaaatctcc ctttcgagag agcgaccatc
     1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata
     1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPU       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/duck/Ukraine/2/60 (H11N8) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30763
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/duck/Ukraine/2/60 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      496 a    325 c    423 g    321 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggagatc gccagaatgc cactgagatc
      121 agagcatctg ttgggaggat ggtcagcgga attggaagat tctacataca gatgtgcact
      181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagaaaga
      241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgca
      301 gggaaggatc ctaagaaaac cggaggtcca atctatagaa ggagagacgg aaaatggatg
      361 agagagctga tcctgtatga caaagaggag atcaggagaa tttggcgcca agcgaacaat
      421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat
      481 gccacatatc aaagaacaag agctctcgtg cgcactggaa tggaccccag aatgtgctcc
      541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggcgcagc agtaaaggga
      601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat
      661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc
      721 cttaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gcgggaaagc
      781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacgatc tgcactcatt
      841 ctgagaggat cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg
      901 gccagtggat acgacttcga gagagaggga tactctctgg tcggaataga tcctttccgt
      961 ctgctccaaa acagccaggt gttcagtctc attagaccaa atgagaatcc tgcacataag
     1021 agccaactgg tgtggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc
     1081 ttcatcagag ggacaagagt ggtcccaaga gggcaactgt ccaccagagg agttcaaatt
     1141 gcttcaaatg aaaacatgga aacaatggac tccagcactc ttgaactgag gagcagatac
     1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc cgcaggacaa
     1261 ataagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgagag agcgaccatc
     1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata
     1381 agaatgatgg agagtgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPV       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/mallard/Astrakhan/244/82 (H?N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30764
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/mallard/Astrakhan/244/82 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      500 a    317 c    421 g    327 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatca acatcatggc gtctcaaggc
       61 accaaacgat cttatgaaca gatggaaact ggtggagaac gccagaatgc tactgagatc
      121 agagcatctg ttggaagaat ggttggtgga attgggaggt tttacataca gatgtgcact
      181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagagaga
      241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagaaca tcccagtgcg
      301 gggaaggacc cgaagaaaac tggaggtcca atctaccgaa ggagagacgg gaaatggatg
      361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcgaacaat
      421 ggagaagacg caactgctgg tctcactcac ctgatgatct ggcattccaa cctaaatgat
      481 gccacatacc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct
      541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggtgcagc agtaaaggga
      601 gtcgggacga tggtgatgga actaattcgg atgataaagc gagggattaa cgatcggaat
      661 ttctggagag gcgaaaacgg acgaagaaca aggattgcat atgagagaat gtgcaacatc
      721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgagaaagc
      781 agaaatcctg ggaatgctga aattgaagat ctcatctttc tggcacggtc tgcactcatc
      841 ctgagaggat cagtggccca taagtcctgc ttgcctgctt gtgtgtacgg acttgctgtg
      901 gccagtggat atgactttga gagagaaggg tactctctgg tcgggataga tcctttccgt
      961 ctgcttcaaa acagccaggt cttcagtctc atcagaccaa atgagaatcc agcacataag
     1021 agtcaactgg tatggatggc atgccattct gcagcatttg aggacctgag agtctcaagt
     1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactat ccaccagagg agttcaaatt
     1141 gcttcaaatg agaacatgga aacaatggac tccagtactc ttgaactgag aagcagatat
     1201 tgggctataa gaaccaggag tggaggaaac accaaccaac agagagcatc tgcaggacaa
     1261 atcagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgaaag agcgaccatc
     1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaatcata
     1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPW       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/budgerigar/Hokkaido/1/77 (H4N6) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30765
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/budgerigar/Hokkaido/1/77 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      500 a    322 c    428 g    315 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcacag agtgacatcc acgacatggc gtctcaaggc
       61 accaaacgat cctatgagca gatggaaact ggtggagaac gccagaatgc tactgagatc
      121 agagcatctg ttgggagaat ggtcggtgga attggaagat tctacataca gatgtgcact
      181 gaaatcaaac tcagtgacta tgaagggagg ctgattcaga acagcataac aatagagaga
      241 atggtcctct ctgcatttga ggaaaggagg aacaaatacc tggaagagca ccccagtgcg
      301 ggaaaggacc cgaagaaaac tggaggtcca atctacagaa ggagagatgg aaaatgggtg
      361 agagaactga tcctgtatga caaagaggag ataaggagga tctggcgcca agcgaacaat
      421 ggagaagacg caactgcagg tctcactcac ctgatgatct ggcattccaa cctgaatgat
      481 gccacatatc agagaacaag agccctcgtg cgtactggga tggaccccag aatgtgctct
      541 ctgatgcaag gatcaactct cccgagaaga tctggggcag ctggtgcagc aataaaggga
      601 gttgggacga tggtgatgga gctgattcgg atgataaaac ggggaatcaa tgatcggaat
      661 ttctggagag gcgaaaatgg gcgaagaaca aggactgcat acgagagaat gtgcaacatc
      721 ctaaaaggaa aattccaaac agcagcacaa cgagcaatga tggatcaggt gcgagaaagc
      781 aggaatcctg gaaatgctga gattgaagac cttatatttc tggcacggtc tgcactcatt
      841 ctgagaggat cagtagctca taagtcctgc ctgcctgctt gtgtatatgg gcttgcagtg
      901 gccagtgggt acgactttga gagagaggga tactctctgg tcggaataga tcctttccgc
      961 ctgcttcaaa acagtcaggt gttcagcctc attagaccaa atgagaatcc agcacataaa
     1021 agccaactag tctggatggc ttgccattct gcagcatttg aagacctgag agtctcaagc
     1081 ttcatcagag ggacaagact agtcccaaga ggacaactgt ccactagagg agttcaaatt
     1141 gcttcaaatg agaacatgga agcaatggac tccaatactc ttgagctgag gagcagatac
     1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa
     1261 atcagcgtgc aacctacttt ctcggtacag aggaatctcc ctttcgagag agctaccatc
     1321 atggcggcat ttacagggaa caatgaagga aggacttctg acatgaggac tgagatcata
     1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaggga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPX       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/ruddy turnstone/New Jersey/47/85 (H4N6) nucleoprotein
            (seg 5) mRNA, complete cds.
ACCESSION   M30766
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/ruddy turnstone/New Jersey/47/85 RNA, and cDNA to viral
            RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      491 a    317 c    423 g    334 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc
      121 agagcatctg ttgggagaat ggttggtgga atcggaagat tctacataca gatgtgcact
      181 gaactcaaac tcagtgacta tgaagggaga ctgatccaaa acagcataac aatagagaga
      241 atggttctct cggcatttga cgagaggaga aacaaatatc tggaagagca ccccagtgct
      301 gggaaagacc ctaaaaagac tggaggtcca atctacagga ggagagatgg gaaatggatg
      361 agagaactca tcctatatga caaagaggag atcagaagga tttggcgtca agcgaataat
      421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat
      481 gccacatacc agagaacaag ggcacttgtg cgtactggga tggatcccag gatgtgctct
      541 ctgatgcaag gctcaactct tccgaggagg tctggggctg ctggagcagc agtgaaaggg
      601 gttggaacaa tggtgttgga attgatccgg atgatcaagc gagggatcaa tgatcggaat
      661 ttctggagag gcgaaaatgg aaggaggaca aggattgctt atgaaagaat gtgcaacatt
      721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggatcaggt gagggaaagc
      781 cggaatcccg ggaatgctga aattgaagat ctcatctttc ttgcacggtc tgctctcatt
      841 ctgagaggat cagtggccca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg
      901 gccagtggat acgactttga aagggaggga tactccctag tcggaattga tcctttccgt
      961 ttgcttcaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa
     1021 agtcagctgg tatggatggc atgccactct gcagcatttg aagatctgag agtgtcaagc
     1081 ttcattagag gaacaagagt agtcccaaga ggacaactgt ccaccagagg agttcagatt
     1141 gcttcaaatg agaacatgga gacaatggac tccagcactc ttgaattgag gagcagatac
     1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcagggcaa
     1261 atcagcgtac agcccacatt ctctgtgcag aggaacctcc cattcgagag agcaaccatt
     1321 atggcggcat ttacaggaaa cactgaaggc agaacttcag acatgagaac tgagatcata
     1381 aggatgatgg aaaatgccag tcctgaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga
     1501 tcttatttct tcggagacag tgcagaggaa tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPY       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/tern/South Africa/61 (H5N3) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30767
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/tern/South Africa/61 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      493 a    322 c    428 g    322 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggagagc gccagaatgc cactgagatc
      121 agagcatctg ttgggagaat ggtcggtgga atcgggagat tctacataca gatgtgcact
      181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga acagcataac aatagagaga
      241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgcg
      301 gggaaggacc cgaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatggatg
      361 agagaactga tcctgtatga caaagaggag atcaggagga tttggcgcca agcaaacaat
      421 ggagaagacg caactgctgg tctcactcat ctgatgatct ggcattccaa cttgaatgat
      481 gccacatatc agagaacaag agctcttgtg cgtactggga tggaccccag aatgtgctct
      541 ctgatgcaag gatcaactct cccgaggaga tctggagccg ctggtgcagc agtgaaggga
      601 gttggaacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat
      661 ttttggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc
      721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gcgagaaagc
      781 cggaatcctg ggaatgctga aattgaagac cttatctttc tggcacggtc tgcactcatt
      841 ctgagagggt cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg
      901 gccagtggat acgactttga gagagaggga tactctctgg tcggaataga ccctttccgt
      961 ctgctccaaa acagccaggt attcagcctc attagaccaa aagagaatcc agcacataag
     1021 agccaattgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc
     1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactgt ccaccagagg atttcaaatt
     1141 gcttcgaatg agaacatgga ggcaatggac tccagtactc ttgaactgag gagcagatac
     1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa
     1261 atcagcgtac aacccacttt ctcagtacag agaaatcttc ctttcgagag agcgaccatc
     1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgagatcata
     1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       FLANPZ       1565 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Influenza A/chicken/Pennsylvania/1/83 (H5N2) nucleoprotein (seg 5)
            mRNA, complete cds.
ACCESSION   M30768
KEYWORDS    nucleoprotein.
SOURCE      Influenza A/chicken/Pennsylvania/1/83 RNA, and cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1565)
  AUTHORS   Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G.
  TITLE     Evolution of the nucleoprotein gene of influenza A virus
  JOURNAL   J. Virol. 64, 1487-1497 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by O.T.Gorman, 14-DEC-1989.
FEATURES       from  to/span     description
    pept         46     1542     nucleoprotein
BASE COUNT      492 a    313 c    427 g    333 t
ORIGIN      
        1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc
       61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgagatc
      121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact
      181 gaactcaaac tcagcgacca tgaagggaga ctgatccaga acagcataac aatagagaga
      241 atggttctct cggcatttga tgagagaaga aacagatatc tggaagagca tcccagtgct
      301 gggaaagatc ctaagaagac tgggggtcca atctacagaa ggagagatgg gaaatgggtg
      361 agagaattga tcctctatga caaagaggag atcagaagaa tttggcgtca agcaaataat
      421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcactccaa cctgaatgat
      481 gccacatatc agagaacaag ggcacttgtg cgtaccggga tggaccccag gatgtgctct
      541 ctgatgcaag gctcaacact tccaaggagg tctggtgctg ctggagcagc cgtgaaaggg
      601 attggaacaa tggtaatgga attgatacgg atgatcaaac gagggatcaa tgataggaat
      661 ttctggagag gcgaaaatgg gcggagaaca agaattgcat acgagagaat gtgtaacatt
      721 ctcaagggga aatttcaaac agctatacaa cgggcaatga tggaccaagt gagggaaagc
      781 cggaatcctg ggaatgctga aattgaagat ctcatctttc ttgcgcggtc tgctctcatt
      841 ctgaggggat cagtggcaca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg
      901 gccagtggat acgattttga gagggaggga tactccctag tcgggattga tcctttccgt
      961 ctgctccaaa acagtcaagt tttcagtctt ataagaccaa acgaaaatcc agcacataaa
     1021 agtcagctgg tatggatggc atgtcactct gcagcatttg aggatctgag agtatcaagc
     1081 ttcattagag gaacaagagt ggtcccaaga gggcaacttt ccaccagagg agttcagatt
     1141 gcttcaaatg agaacatgga tgcaatagac tccagtactc ttgaattgag gagcagatac
     1201 tgggccataa gaaccagaag cggaggaaac acaaatcagc agagagcatc tgcaggacaa
     1261 atcagcgtac agcccacatt ctcagtgcag agaaacctcc cattcgagag agcaaccatc
     1321 atggcggcat tcacaggaaa cacagaaggc agaacttcag acatgagaac tgagattata
     1381 aggatgatgg agaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag
     1441 ctctcggacg aaaaggcgac gaacccgatc gtgccttcct ttgacatgag taacgagggg
     1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt
     1561 ctact
//
LOCUS       HUMCEAB      2690 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human carcinoembryonic antigen (CEA) hsCGM1 gene, exons 1 and 2.
ACCESSION   M22433 D51537
KEYWORDS    carcinoembryonic antigen.
SOURCE      Human fetal liver (lambda-hsCHM1-1 library) DNA, clone hsCHM1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2690)
  AUTHORS   Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
            Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
            Zimmermann,W.
  TITLE     Analysis of the size of the carcinoembryonic antigen (CEA) gene
            family: Isolation and sequencing of N-terminal domain exons
  JOURNAL   Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.A. Thompson 07-FEB-1989.
FEATURES       from  to/span     description
    pept       1250     1313     carcinoembryonic antigen hsCHM1, exon 1
                                 /nomgen="CEA" /map="19q13.1-q13.3"
                                 /hgml_locus_uid="LN0164S"
               2158  /  2517     carcinoembryonic antigen hsCHM1, exon 2
    IVS        1314     2157     carcinoembryonic antigen hsCHM1, intron A
    IVS        2518  >  2690     carcinoembryonic antigen hsCHM1, intron B
BASE COUNT      823 a    637 c    653 g    577 t
ORIGIN      Chromosome 19q31.1-q31.3.
        1 gaattccaca gcaataacca cgatgacaac caccatgtac tcaacacccg cctgggcacg
       61 gggctcccac agcagctcac ttattcccaa caactctgca aggaggattt taccatcctc
      121 cttttacaaa tcagggaatc aaggatcata gaagccacgt gcacttgtcc aagtcaacat
      181 agttaagtga cagaaccatt agctgtcccc aggtacatct ggacataaag ttcatgttta
      241 tgccactgtg tcagcatttc caaaaactga ttttaggcga aacgtaagta agctttttaa
      301 aaactttaat acttatgcgt ttattttaat atacattgag aaaacattta agcacacatc
      361 aaatctgtaa tttcatggac aatattgcat aagacaagga tgttttgtct ccaactcctg
      421 gcctcaagcc atcctcccac catagctctc gagtagctgg gatcacagac tgagccaccg
      481 atccctgcta ggacaggatg ttttgtaaac taaatttatt tagaaaaaag gatgaaatat
      541 ataataataa aggtggtaca agctagagag aaaatcataa agtcagccta gaaatgtctg
      601 gtgtctggat gacataaagc tacagcactg tgaagcctca ttctcagtta ctcccaggaa
      661 attagagtca cataatgctg cagaaagaac agctcagaat cttagatccg gctttagccc
      721 tagatatatc catttgtagg accccagaca tctctgtgac ctccttgctg ggagtaaatc
      781 caaccttccc agacatgtga gaacagtaag aagaccctgc acacacaaag gagtttctcc
      841 gtcacagaga aaataacacc aggttcaggg accccaggga ctctgcatgg tgctgacaga
      901 cccaaggcca aggcagagca gaggtccacg ctggggaggg agggtcatcc tgttatgaaa
      961 cagggatcca agtaagcctt gcttctcaga gcctggtctg ggcaactcaa atgtagacag
     1021 aaggccccaa ggaagaagag aaaatgaggc aaaactgaga ggggagggga cagagaggtg
     1081 acctgggcag agcttcaccc atgaccctgg aaagtgctcc tgccctggga ggaggctcag
     1141 catggaaaga ggaaggacag cagagcctaa gtcacagtag ccctgactac agcattcctg
     1201 gagcccaggc tcttttccac agaggaggaa agagcaggca gcagagacca tggggccccc
     1261 ctcagcctct ccccacagag aatgcatccc ctggcagggg cttctgctca caggtgagtg
     1321 gaggattcct gggagtgggc aagaggaggg atcacagaga atggctgggg tctcctgggg
     1381 aggatggggc tctgataggg gacagaaggc ttctgctgaa gcctcagggg agagaacatc
     1441 agagagggac acgggtcaca acaagacaat cacattgaac tgggattgat aagagggagg
     1501 aaaatccatt gatcatgttt tccaagttaa tcattactgg ccactacaat tagaaaatga
     1561 taagaataag aattacatca gggtgatact ttaaataaaa atataaccag ggcactaaaa
     1621 cctgtctttg ccccaaccac aagttgcaaa ataaccacca ctccttaact catccaccag
     1681 tatttgcaat caaattttag gcactggcgt acaacaaata tcagacaagt ctctgtgttc
     1741 aaagagctta cactcttgca gagatgaaga tagacaccca aagagatcta gaatgtgagt
     1801 tcaggtgttg acaagagccc tggagggaac agagcagaaa aaggtcagaa agggacgccc
     1861 cagggtctct agaggaggtg tcaggggagg gatctcccaa ggatgccctg atgtgagcag
     1921 gatctgaggg cagtggggag ggagccatgc agacccctgg ggaaggggat tccacacagg
     1981 aaaatgccaa ggttagaggt gctgaagaaa gaaaggtcac gttactgacc ttaaccaagt
     2041 gggacacacc tacactctca aggctgaagg gagaagagac tctctcagga cccagggccc
     2101 catctttcca tcccaataca tgggtaccaa tattgactga tgctttctcc ctcctagcct
     2161 cacttctaaa cttctggaac ccgcccacca ctgccaagct cactattgaa tccacgccgt
     2221 tcaatgtcgc agaggggaag gaggtgcttc tacttgtcca caatctgccc caacatcttt
     2281 ttggctacag ctggtacaaa ggggaaagag tggatggcaa cagtctaatt gtaggatatg
     2341 taataggaac tcaacaagct accccagggg ccgcatacag cggtcgagag acaatataca
     2401 ccaatgcatc cctgctgatc cacaatgtca cccagaatga cataggattc tacaccctac
     2461 aagtcataaa gtcagatctt gtgaatgaag aagcaactgg acagttccat gtataccgtg
     2521 agtatttcca catgacctct ggaggttggg ggtcagttct acttcccaca tatgggattg
     2581 tacggcctgg gctgtgcctc tggccctctc tgcattacat tctgtatcag ggtttggaca
     2641 tttagtgcag gacacacacg ggggagacaa acttccacag actagaattc
//
LOCUS       HUMCEAC       781 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human carcinoembryonic antigen (CEA) hsCGM2 gene, exon 2.
ACCESSION   M22434 D51537
KEYWORDS    carcinoembryonic antigen.
SOURCE      Human fetal liver (lambda-hsCGM2-1 library) DNA, clone hsCGM2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 781)
  AUTHORS   Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
            Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
            Zimmermann,W.
  TITLE     Analysis of the size of the carcinoembryonic antigen (CEA) gene
            family: Isolation and sequencing of N-terminal domain exons
  JOURNAL   Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.A. Thompson 07-FEB-1989.
FEATURES       from  to/span     description
    pept    /   221  /   583     carcinoembryonic antigen hsCGM2, exon 2 (AA at
                                 223) /nomgen="CEA" /map="19q13.1-q13.3"
                                 /hgml_locus_uid="LN0164S"
    IVS     <     1      220     carcinoembryonic antigen, intron A
    IVS         584  >   781     carcinoembryonic antigen, intron B
BASE COUNT      220 a    195 c    184 g    182 t
ORIGIN      Chromosome 19q31.1-q31.3.
        1 ctgcagatcc ctggggaaga ggattccgaa cagggaaatg taaggtcaga ggtgctgata
       61 ggggacatgc tgctgtcatt gatccagtag gacacacaca cacacacaca cttacttcaa
      121 gatgggggtg ggtgaagaga cctgctcagg atccagggcc ccatctttcc accccaatac
      181 ataggtccca atattgactg atgttctctc cccctcctag cctcgctttt aaccttctgg
      241 aacctgccaa acagtgccca gaccaatatt gatgtcgtgc cgttcaatgt cgcagaaggg
      301 aaggaggtcc ttctagtagt ccataatgag tcccagaatc tttatggcta caactggtac
      361 aaaggggaaa gggtgcatgc caactatcga attataggat atgtaaaaaa tataagtcaa
      421 gaaaatgccc cagggcccgc acacaacggt cgagagacaa tataccccaa tggaaccctg
      481 ctgatccaga acgtcaccca caatgacgca ggattctata ccctacacgt tataaaagaa
      541 aatcttgtga atgaagaagt aaccagacaa ttctacgtat tctgtgagtg atacctccat
      601 gacttctggg tgctgggggc cagttctact tcatacacac ggggttgtca ggcctgggtt
      661 gtgcctgtgt ccccatctac attttatcca gtgttggagt ttgggcattt agtgaaggac
      721 acacatgggg gagacaaact tctacagacc agaatccctt tcctgcatcc agaccctgca
      781 g
//
LOCUS       HUMPSBGA     1042 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human pregnancy-specific B-1-glycoprotein gene, exon 2.
ACCESSION   M22437 D51537
KEYWORDS    glycoprotein; pregnancy-specific B-1 glycoprotein.
SOURCE      Human fetal liver (lambda-PSBG-1 library) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1042)
  AUTHORS   Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
            Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
            Zimmermann,W.
  TITLE     Analysis of the size of the carcinoembryonic antigen (CEA) gene
            family: Isolation and sequencing of N-terminal domain exons
  JOURNAL   Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.A. Thompson 07-FEB-1989.
FEATURES       from  to/span     description
    pept    /   497  /   862     pregnancy-specific B-1 glycoprotein, exon 2
                                 (AA at 499) /nomgen="B1G1" /map="6"
                                 /hgml_locus_uid="LG0073P"
    IVS     <     1      496     pregnancy-specific B-1 glycoprotein, intron A
    IVS         863  >  1042     pregnancy-specific B-1 glycoprotein, intron B
BASE COUNT      289 a    241 c    274 g    238 t
ORIGIN      Chromosome 6.
        1 gagctcacgc tgtcatgggg gggaagacag acatgcaaag agatctagaa tgtgaggtca
       61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag
      121 ggtctctgaa gcaggcatca ggaaagaagt ctaacgatgc cctgatgtaa gcaggacctg
      181 agggcattgt ggagggggcc atgcggaccc ctggggaaga ggattccaaa cagaaaaatg
      241 ccaacgtcag aagtgttgaa ggaatgggcg tcatgctgct gaccttgacc tagtaggaca
      301 gtaggacaca cacacataca cacacacaat cacacatgcc gcttttgtgt gtgtgtgtgt
      361 gtgtgtgttt gtatgtgtgt gtgtgtctgt gttgtcaagg ctgaggactg aagagacctt
      421 ctcaggaccc agggccccat gttttcacac caatacatag gtctcaatat tgactgatgt
      481 tctctccacc tcctagcatc acttttaaac ttctggaacc tgcccaccac tgcccaagtc
      541 acgattgaag ccgagccaac caaagtttcc gaggggaagg atgttcttct acttgtccac
      601 aatttgcccc agaatcttac cggctacatc tggtacaaag ggcaaatgag ggacctctac
      661 cattacatta catcatatgt agtagacggt gaaataatta tatatgggcc tgcatatagt
      721 ggacgagaaa cagcatattc caatgcatcc ctgctgatcc agaatgtcac ccgggaggac
      781 gcaggatcct acaccttaca catcataaag ggagatgatg ggactagagg agtaactgga
      841 cgtttcacct tcaccttaca ccgtaagtga ttccacatga tccctgggtg ttgggggaca
      901 ggggtcactt ctacttcaca cacacaggat tgtcaggcct ggactctgcc tgtgtcactc
      961 actgcgttat gtcccatgct gggatttggg catttagtgc aggacacaca cagaggagac
     1021 aaatttcaac agatcagaat tc
//
LOCUS       ASNMTATP9     225 bp ds-DNA             ORG       04-AUG-1990
DEFINITION  A.nidulans mitochondrion ATPase subunit 9 gene, complete cds.
ACCESSION   M35967
KEYWORDS    ATPase.
SOURCE      A.nidulans mitochondrion DNA.
  ORGANISM  Mitochondrion Aspergillus nidulans
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Plectomycetes; Eurotiales; Trichocomaceae; Aspergillus nidulans.
REFERENCE   1  (bases 1 to 225)
  AUTHORS   Brown,T.A., Ray,J.A., Waring,R.B., Scazzocchio,C. and Davies,R.W.
  TITLE     A mitochondrial reading frame which may code for a second form of
            ATPase subunit 9 in Aspergillus nidulans
  JOURNAL   Curr. Genet. 8, 489-492 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      225     ATPase subunit 9
BASE COUNT       66 a     29 c     51 g     79 t
ORIGIN      
        1 atgttacaat cagctagaat aataggaaca ggattagcta caacaggttt aatcggagca
       61 ggtgtaggta tcggtgtagt atttggtgcc ttaatcttag gagttgcaag aaaccctgca
      121 ttaagaggtc aactgttttc atatgcaatt ttaggatttg catttgcgga agcaactgga
      181 ttatttgctt taatgatggc ttttttatta ttatatgtag cttaa
//
LOCUS       DOGPHA2       562 bp ss-mRNA            MAM       04-AUG-1990
DEFINITION  Canine phospholipase A-2 mRNA, complete cds.
ACCESSION   M35301
KEYWORDS    phospholipase A-2.
SOURCE      Canine pancreas, cDNA to mRNA.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 562)
  AUTHORS   Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G.
  TITLE     Primary structures of canine pancreatic lipase and phospholipase
            A-2 messenger RNAs
  JOURNAL   Pancreas 1, 430-437 (1986)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.Scheele, 06-MAR-1990.
FEATURES       from  to/span     description
    pept         29      469     phospholipase A-2 precursor
    sigp         29       73     phospholipase A-2 signal peptide
    matp         95      466     phospholipase A-2
    mRNA    <     1      562     phospholipase A-2 mRNA
BASE COUNT      143 a    167 c    128 g    124 t
ORIGIN      
        1 ttctcttgct cagcaacttg accacaaaat gaaattcctc gtactggctg ctctgctcac
       61 agtggccgct gccgaggggg gcatcagccc gcgggcagtt tggcagttcc gcaacatgat
      121 caagtgcacg atccccgaga gtgacccctt gaaggactac aacgactacg gctgctactg
      181 tggcctgggt ggatccggca cccctgtgga tgagttggac aagtgttgcc agacgcatga
      241 ccactgctac tcagaagcca agaaactgga cagctgtaaa ttcctcctgg acaaccccta
      301 caccaaaatc tactcatact catgctccgg ctctgagatc acctgcagca gcaaaaacaa
      361 agactgccag gcctttatct gcaactgtga ccgcagtgct gccatctgct tttcaaaggc
      421 tccgtataac aaggagcaca agaacctgga cactaagaag tactgctaga attgagtctc
      481 ctctctggaa agcatcattc ctacatgcct cgtggccttc accctaccct gtgtcctcca
      541 ataaagcact ttgttgaaag gc
//
LOCUS       DOGPLIP      1493 bp ss-mRNA            MAM       04-AUG-1990
DEFINITION  Canine lipase mRNA, complete cds.
ACCESSION   M35302
KEYWORDS    lipase.
SOURCE      Canine pancreas, cDNA to mRNA.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 1493)
  AUTHORS   Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G.
  TITLE     Primary structures of canine pancreatic lipase and phospholipase
            A-2 messenger RNAs
  JOURNAL   Pancreas 1, 430-437 (1986)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by G.Scheele, 06-MAR-1990.
FEATURES       from  to/span     description
    pept         35     1438     lipase precursor
    sigp         35       85     lipase signal peptide
    matp         86     1435     lipase
    mRNA    <     1     1493     lipase mRNA
BASE COUNT      410 a    377 c    378 g    328 t
ORIGIN      
        1 cgaaagggag ggaaggccca gaacatccca caggatggtg agcatctgga caatcgcgct
       61 gtttctgctg ggagcagcca aagcaaagga agtttgctat gaacaaatcg gatgcttttc
      121 tgacgcggag ccctgggccg ggacagcaat caggcctctg aaagttctcc cctggagccc
      181 cgagagaatc ggcacccgct tcctgctcta caccaacaag aacccaaaca actttcaaac
      241 tctccttccc tctgatccat caacgattga ggcatcaaat tttcaaacag acaagaagac
      301 ccggttcacc atccatggct tcataaacaa gggagaagag aactggctgc tggatatgtg
      361 caagaacatg ttcaaggttg aggaggtgaa ctgcatctgc gtggactgga agaaaggttc
      421 ccaaacctca tatacacagg ctgccaacaa cgtgcgggtg gtgggcgccc aggtggccca
      481 gatgctcagc atgctctcgg caaactacag ctactcacct tcccaagtcc agctcatcgg
      541 ccacagccta ggagcccacg tggctgggga ggcagggagc aggactccag gtctgggcag
      601 gattacaggg ttggatcctg tagaagcaag tttccagggc actcctgaag aggttcgact
      661 tgatcccact gatgctgact ttgttgatgt gattcacaca gatgcagctc ccctgatccc
      721 attcctgggt tttggaacaa gccaacagat gggtcacctt gacttcttcc ccaatggtgg
      781 agaggaaatg ccaggatgca agaagaatgc cctgtcacag atcgtgaacc tagatggcat
      841 ttgggaagga actcgggact tcgtggcttg caatcacctg agaagttaca agtattactc
      901 agagagcatc ctcaaccctg atggatttgc ttcctacccc tgtgcttcct acagggcctt
      961 tgaatctaac aagtgcttcc cctgcccaga tcaagggtgc ccacagatgg gtcactatgc
     1021 tgataaattt gctgtcaaga caagtgatga gacacagaaa tacttcctga acaccggaga
     1081 ttccagcaat tttgctcgct ggagatacgg ggtttctata acattgtctg ggaaaagagc
     1141 cactggtcag gctaaagttg ctttgtttgg aagtaaggga aatactcatc aattcaatat
     1201 cttcaagggg attctcaaac caggctctac tcattccaat gagtttgatg caaagcttga
     1261 tgttggaaca attgagaaag tcaagtttct ttggaataac aacgtggtaa acccaacctt
     1321 tcccaaagtg ggtgcagcca agatcaccgt gcaaaaggga gaggagaaaa cagtgcacag
     1381 cttctgcagc gaaagcaccg tgagggagga tgttctgctc accctcacgc cctgttagcg
     1441 tccagcgcaa cccgggcacc gtgttaacag caataaaaac cactgatgca ttt
//
LOCUS       HUMPGDZ1     1106 bp ss-mRNA            PRI       04-AUG-1990
DEFINITION  Human MHC class II lymphocyte antigen (HLA-DNA) gene, complete cds.
ACCESSION   M31525 M35125
KEYWORDS    cell surface glycoprotein; class II gene; lymphocyte antigen;
            major histocompatibility complex.
SOURCE      Human B-cell cell line PGF, cDNA to mRNA, clone PGDZ1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1106)
  AUTHORS   Young,J.A.T. and Trowsdale,J.
  TITLE     The HLA-DNA (DZA) gene is correctly expressed as a 1.1kb mature
            mRNA transcript
  JOURNAL   Immunogenetics 31, 386-388 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.T.Young, 22-JAN-1990.
FEATURES       from  to/span     description
    pept         77      829     MHC HLA-DNA precursor
                                 /hgml_locus_uid="LV0117X" /nomgen="HLA-DNA"
                                 /map="6p21.3"
    sigp         77      151     MHC HLA-DNA signal peptide
    matp        152      826     MHC HLA-DNA
    mRNA          1     1106     MHC HLA-DNA mRNA
BASE COUNT      204 a    354 c    289 g    259 t
ORIGIN      
        1 cttcttcttt acctccgcct tgttcctgtc ctcaccacac ggactgagac tgatttgatt
       61 aaagcaccag agtgtaatgg ccctcagagc agggctggtc ctggggttcc acaccctgat
      121 gaccctcctg agcccgcagg aggcaggggc caccaaggct gaccacatgg gctcctacgg
      181 acccgccttc taccagtctt acggcgcctc gggccagttc acccatgaat ttgatgagga
      241 acagctgttc tctgtggacc tgaagaaaag cgaggccgtg tggcgtctgc ctgagtttgg
      301 tgactttgcc cgctttgacc cgcagggcgg gctggccggc atcgccgcaa tcaaagccca
      361 tctggacatc ctggtggagc gctccaaccg cagcagagcc atcaacgtgc ctccacgggt
      421 gaccgtgctc cccaagtctc gggtggagct gggccagccc aacatcctca tctgcatcgt
      481 ggacaacatc ttcccccctg tgatcaatat cacctggctg cgcaacggcc aaactgtcac
      541 tgagggagtg gcccagacca gcttctattc ccagcctgac catttgttcc gcaagttcca
      601 ctacctgccc ttcgtgccct cagccgagga cgtctatgac tgccaggtgg agcactgggg
      661 cctggatgcg ccactcctca ggcattggga gctccaggtg cctattccac caccagatgc
      721 catggagacc ctggtctgtg ccctgggcct ggccatcggc ctggtgggct tcctcgtggg
      781 caccgtcctc atcatcatgg gcacatatgt gtccagtgtc cccaggtaat gatccttctg
      841 agagaaatga cttgtgggag acaccctgca gatcctcatg ggtttgtgac agcccctgcg
      901 tgctcagtgc cctttaagtg catcccgctg tgctgacttt gagtgggatc aacatctgtc
      961 ctacgggtcc cctctttttt ggccccagta ttcatggcag ggtttgttgg acacctacta
     1021 gcttcccttc ccattcaaca cacacacaca ttcttgctct acccaaagct ctggctggca
     1081 gcactaaatg ctttggtggt gtttgc
//
LOCUS       LACPEPXP     3963 bp ds-DNA             BCT       04-AUG-1990
DEFINITION  L.lactis X-prolyl dipeptidyl aminopeptidase (pepXP) gene,
            complete cds.
ACCESSION   M35865
KEYWORDS    X-prolyl dipeptidyl aminopeptidase.
SOURCE      L.lactis lactis (strain NCDO 763, ML3) DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 3963)
  AUTHORS   Nardi,M., Chopin,M.-C., Chopin,A., Cals,M.-M. and Gripon,J.-C.
  TITLE     Cloning and DNA sequence analysis of an X-prolyl dipeptidyl
            aminopeptidase determinant from Lactococcus lactis subsp. lactis
            NCDO 763
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Chopin, 02-JUL-1990.
            
                   Author address: A.Chopin
                   Institut National de la Recherche Agronomique
                   Lab de G/032n/032tique Microbplenne
                   Institut de Biotechnologie CRJ
                   78352 Jouy-en-Josas Cedex
                   FRANCE
FEATURES       from  to/span     description
    pept       1105     3396     X-prolyl dipeptidyl aminopeptidase (pepXP) (EC
                                 3.4.14.5)
    pept        933       64 (c) ORF1 (put.)
    pept       3398     3883     ORF3 (put.)
BASE COUNT     1261 a    748 c    733 g   1221 t
ORIGIN      
        1 actagtcatt gcacttatgg tcaaataaaa aaattactga caactctgtc agtaattttt
       61 ttattaaaga taaatcattt taaataatgc aacggctgca agtgatgcaa gaattggtgc
      121 caaaactgga acccatgcat accaccattt agatgaacct ttagcttcac ccaaaactga
      181 ttttggtaag agactgtgaa ccaaacgtgg accaaagtca cgcgccggat tcaaaccagg
      241 acctgtaggt ccaccaagtg caactacgag tcccataaca aggaatccta ggaacaagtg
      301 agcaatcatt ttagaagctg aagctcctga tgcttgaacc caaatttgat tcataacgtc
      361 tgatgaagaa acatctgctc cttgaccttt caagtagttt gtcatccaag taattgattg
      421 actaccaaag aagatatttg tcgcagcaac cgcaccgaag aataatacaa atgaacctaa
      481 gaactcattc aagaaaccat taatggttgc ccccaaacga gttttttcac tgttatcatc
      541 aacgttgtcg atagttgaga aagttccaag aatagcattt ggattttgtg ttttaagata
      601 gtatggacga taaaccataa cgattaataa ttgtccaaac atcgcaccta atacttgcgc
      661 aataatgtat tgagcaacgt gtgcccaagg aaaaagtcct gaggctgcaa gtccaagtgt
      721 gaaagctggg ttgatttgtg aagtaatatt accaaaagca actgctggca acattacgcc
      781 aagtccgtaa ccccaaccaa taatcatcca agattgtgcg tgagcttttg tacctttaag
      841 ttcaacgtta gcaaccgcac cattacccat aataatgagt agggcagttc caacaaattc
      901 agtgatatat ttcactgtcc atgtaacatc cattttttat ggaaccctcc aaaatttgct
      961 tttaatataa taacctcagc tattatacca gaaaagcacg tttgcttcaa atattcattg
     1021 aaaatagcgc gcctttagtc tgattaatca ctgaaaattt gctataatta aaagtattat
     1081 catgtttatt acggaggatt taaaatgcgc tttaaccatt tttcaattgt tgacaaaaat
     1141 tttgatgaac aattagcaga acttgatcag cttggttttc gttggtctgt tttttgggat
     1201 gaaaagaaaa tcttaaaaga ttttctcatt caaagtccta gtgatatgac tgctctacaa
     1261 gcaactgctg aattagatgt cattgaattt ttaaaatcct ctatagaatt agattgggaa
     1321 attttctgga atattgcttt gcaacttctt gattttgtac caaactttga ttttgagatt
     1381 ggcaaagctt ttgagtatgc aaaaaatagc aacctccctc aaatcgaagc agaaatgacc
     1441 acagaaaata tcatttctgc tttctactac ctcctttgta cgcgtcgaaa aactggaatg
     1501 attttagttg aacattgggt atcagaggga cttcttcctt tagataatca ttatcatttt
     1561 ttcaatgaca aatcgttggc aacctttgat tcttcacttc ttgaacgtga agtcctttgg
     1621 gttgaaagtc cagtcgatag tgaacaacgg ggtgaaaatg atttgataaa aattcaaatc
     1681 attcgtccaa aaagtaccga aaaacttcca gtggtaatga cggcgagtcc ttatcattta
     1741 gggattaatg ataaagcaaa tgatttagcc ttacacgata tgaatgtaga acttgaagaa
     1801 aaaaccagtc acgaaattca tgttgaacaa aagctcccgc aaaaattatc agcgaaagca
     1861 aaggaacttc cgattgttga caaagctcct tatcgtttta ctcatggttg gacttattct
     1921 ttaaatgact atttcttgac tagaggattt gcctctatct atgtggccgg tgttggaacc
     1981 cgttcttctg acggctttca aacctcaggc gattaccaac aaatttatag tatgacagct
     2041 gtcattgact ggttgaatgg tcgagctcgt gcttatactt ctcgtaaaaa gacacatgaa
     2101 atcaaagcct cttgggcaaa tggtaaagtg gctatgactg gaaaatctta tctgggaacc
     2161 atggcttatg gagccgctac tactggagtt gaaggtctag agcttatctt agccgaagct
     2221 ggaatttctt cttggtataa ttattaccgt gaaaatggcc ttgtccgctc tcctggcggt
     2281 tttcctggtg aagatctaga tgttcttgct gcgctcactt attcacgtaa tcttgatggt
     2341 gctgatttct taaaaggtaa tgctgaatac gaaaaacgat tggcagaaat gactgctgct
     2401 ttagaccgca aatctggtga ttacaatcaa ttttggcatg accgaaatta tctgataaat
     2461 actgacaaag ttaaggctga tgtcttaatt gttcatggac tacaagattg gaatgtgaca
     2521 ccagagcaag cctataattt ctggaaagct ctgccagaag gtcatgctaa acatgccttt
     2581 ttacaccgtg gggctcatat ttatatgaat tcttggcagt ccattgattt ttctgaaacg
     2641 attaatgctt actttgtcgc aaaattactg gatagagatt taaatttaaa tctcccacct
     2701 gttattttac aggaaaattc taaagaccaa gtatggacga tgatgaatga ttttggtgcc
     2761 aatacccaaa tcaaactccc tcttggtaaa acggccgttt cttttgctca attcgacaat
     2821 aattatgacg atgagacttt caaaaagtat tctaaagatt tcaatgtctt taaaaaagac
     2881 ttatttgaaa acaaagccaa tgaagctgtc attgatttag aacttccgtc aatgctgaca
     2941 attaatggtc cagttgagct tgaactcaga ctaaaattaa atgacacgaa aggcttctta
     3001 tctgctcaaa ttcttgattt tggtcagaaa aaacgtttgg aagataaagt cagagtgaaa
     3061 gattttaaag tgcttgaccg tgggcgaaat ttcatgttag acgacttagt tgaactccca
     3121 cttgttgaga gcccttatca gttagtcact aaaggcttta ctaatctcca aaatcaaagt
     3181 ttactgacag tcagtgattt aaaggctgac gaatggttta caataaaatt tgaactacag
     3241 ccaaccattt atcatttaga aaaagctgac aagcttcggg tcattctcta tagtactgac
     3301 tttgaacata cggttcgtga taatcgtaaa gtgacttacg agattgattt atctcaatct
     3361 aaactcatta ttcctattga aagtgtgaaa aattaatatg gcaattattg gaattcttgg
     3421 aactccttac aacacagttg aacgttctcc tttttggtgg aacaaagtaa gctatacacg
     3481 tcaatctttt attgatgttt tccaggaatt aggtcacact gtgattgttc ttcctgttga
     3541 caaaaccgaa aatataaaaa attatctgcc attggtcgat aaaattgtac tgactggtgg
     3601 cgttgatgtc agtccttatc tctatggcga ggaacctcat gctcaattag gaacgactga
     3661 cccaattcgt gaccgttttg agcttgccgc aattaaagct gctctagaag cgaataaacc
     3721 tattcttggt gtatgtcgtg gcctacaatt gctcaatgtt tattttggag gaactcttta
     3781 tcaagacttg agcctaactt caagtcagat aaagcacctc caaagtccaa ctccccaaga
     3841 agttccgaca catcacatta gcgtcgaaaa agaagttctt tagatttttt acctgaaaat
     3901 tacatggtca attctttcca ccatcaagtc attaaggatt tgggtcaggg cttgcaggcc
     3961 att
//
LOCUS       PIGA1AG       668 bp ss-mRNA            MAM       04-AUG-1990
DEFINITION  Swine alpha-1 acid glycoprotein mRNA, 3' end.
ACCESSION   M35990
KEYWORDS    alpha-1 acid glycoprotein.
SOURCE      Swine, cDNA to mRNA.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 668)
  AUTHORS   Stone,R.T. and Maurer,R.A.
  TITLE     Cloning and developmental regulation of alpha-1 acid glycoprotein
            in swine
  JOURNAL   Dev. Genet. 8, 295-304 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      552     alpha-1 acid glycoprotein (AA at 1)
    mRNA    <     1      668     alpha-1 acid glycoprotein mRNA
BASE COUNT      181 a    186 c    170 g    131 t
ORIGIN      
        1 ccgctgtgcg ccaacttgac ggcagtgccc atcaccaatg ccaccttgga cttgatctcc
       61 ggcaagtggt attatatcgg ctcggccttc cgaaaccccc agtacaatga gtcggccaga
      121 tccatccagg cagccttctt tttctttgat cccaagcccg cggaggacaa gataaacctc
      181 agagagtacc agaccatagg gaaccagtgc atctataacg acagctccct gaaggtccat
      241 cgggagaatg gaagcctatc gaaacacgag atgggcagag aacacgttgc tgacctgctg
      301 ttaaccaagg tccccaagac cttcatgctc atcaattcct tgcacgataa gaacaacgtg
      361 gggctctcct tctacgccga caaggcggag gtgactccgg agcagatgaa agagttccat
      421 gatgctattg agtgcacggg aatccacaag tcagaaatca catacaccga cgagaaaaag
      481 gatctgtgtg ggccgctgga gaagcagcat gaggaggaaa ggaagaagga gaaggagaag
      541 gaggggtcct agcattgggt caggtccttg ggcaccttgg aggcccatcc tcagcacctc
      601 ccacccaccc tttgcacctc aattctctcc ctcagttgca tcaataaaac ttcgcatttg
      661 gtacaggc
//
LOCUS       RATMTNDI      946 bp ds-DNA             ORG       04-AUG-1990
DEFINITION  Rat NADH-dehydrogenase (NDI) gene, complete cds.
ACCESSION   M35826
KEYWORDS    NADH-dehydrogenase.
SOURCE      Rat mitochondrion DNA.
  ORGANISM  Mitochondrion Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; 
            Rattus norvegicus.
REFERENCE   1  (bases 1 to 946)
  AUTHORS   Quagliariello,C. and Spena,A.
  TITLE     Sequenza nucleotidica del gene NDI del DNA mitocondriale di fegato
            di ratto
  JOURNAL   Boll. Soc. Ital. Biol. Sper. 64, 211-218 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      946     NADH-dehydrogenase (NDI) (att start codon)
BASE COUNT      294 a    299 c    103 g    250 t
ORIGIN      
        1 attaatatcc taacactcct aatcccaatc ttaattgcca taggccttct caccctagta
       61 gaacggaaaa tcctaggcta catacaatta cgcaaaggcc ccaacaacga aggcccatat
      121 ggtaaactac aaccatttgc agatgccata aaactattca taaaagaacc catacgccct
      181 ctaaccacct caatatcact atttattatc gccccaaccc tctcccttac actagctcta
      241 agcctatgaa ttcccttacc aataccccac ccccttatca acctcaacct aggcatacca
      301 tttattctag ccacatccag cctttcggtc tactccattc tatgatcggg atgagcctca
      361 aattcaaaat actccctatt cggagcccta cgagccgttg cccaaaccat ctcttacgaa
      421 gtcacaatag ccttatacct cttatccgtc ctcctaataa gcggctcctt ctccctacaa
      481 atacttatca ctacacaaga acatatctga ctattaatcc ccgcctgacc aatagccata
      541 atatgataca tttcaaccct cgcagaaaca aatcgagctc ccttcgactt aacagaagga
      601 gaatcagaat tagtctcagg ctttaacgtc gaatacgccg caggaccatt cgccctattc
      661 ttcatagccg agtacaccaa cattattcta ataaacgccc taacatcaat tgtattccta
      721 ggccccttat atcatatcaa ttaccctgaa ttatactcaa ccagcttcat aacagaaaca
      781 ctacttctat ccacaacttt cctatgaatt cgagcatcct acccccgttt tcgatatgac
      841 caactaatgc acctcctatg aaaaaatttc ctcccactaa cactagcatt ctgcatatga
      901 tacatttccc tgccaatttt cctagcagga attccaccct acacat
//
LOCUS       RICPETB      1682 bp ds-DNA             PLN       04-AUG-1990
DEFINITION  Rice chloroplast apocytochrome b6 (petB) gene, complete cds.
ACCESSION   M35995
KEYWORDS    apocytochrome b6.
SOURCE      Rice (L. cv. Labelle) chloroplast DNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1682)
  AUTHORS   Cote,J.-C., Wu,N.-H. and Wu,R.
  TITLE     Nucleotide sequence of the rice chloroplast apocytochrome b6 gene
            (petB)
  JOURNAL   Plant Mol. Biol. 11, 873-874 (1988)
  STANDARD  simple staff_review
COMMENT     Authors indicate two potential petB products for the given
            sequence.  The first product contains an intron, and has its
            initiation codon at position 67.  The second product has no intron,
            and is read straight through from position 829.
FEATURES       from  to/span     description
    pept         67       72     apocytochrome b6, exon 1 (alt.)
                886     1527     apocytochrome b6, exon 2 (alt.)
    pept        829     1527     apocytochrome b6 (alt.)
    IVS          73      885     apocytochrome b6 intron A (put.) (no splice
                                 consensus)
BASE COUNT      499 a    290 c    318 g    575 t
ORIGIN      
        1 gctctacatt tctagacatt ctggtagttc gaccgtggaa ttttttggtt tcggtatctc
       61 tggaatatga gtgtgtgact tgttagaatt tgctcctatt gataatacat agaaagcacc
      121 tgttatctct atcaagatga ttctaattcg tcggatatta tttattctag tatctggaac
      181 acgaaataga tagagtggat caagaaaaaa aaatgaaact atgattcata ttaactattc
      241 agacctcgca accagactga aaaaaattca agtagttctt aataaaaata aaaaaagaaa
      301 atttcttcct tccaattttg tttgcccaaa aaacaacttt ttttctctcg attttgtcga
      361 gtcattacac cgattcaata aatgatcatc aagcggttct tattcgaaga acccttgcct
      421 tttgtttagc ttgagactca atcatcgtgg ctctagtatg aatctaaggt tttaattgaa
      481 ctgattcata ggatcgcaac aagataattt ctaccagaaa actactccaa tttttgcttt
      541 atttatttat ctagtaaaac aagagtaaat ctgcattacg cacaaaaaaa agaaatccaa
      601 aatagggaag agaaaaatca agaggcctct aatgatcaac atttgggaaa gaaagataga
      661 cgagccaact tgagattttt tggcattatc atcacaaaga agaaattctg gatttttctt
      721 atttcatatc ttcaaggcaa atcgacccaa cccagtggct gatgaagttt tgaacctttt
      781 ttctaatatc cgttgaaaat ttgtgtgttt ctgtttgagc cgtacgagat gaaattctca
      841 tatacggttc tcggaggggg gttcgggtta gttacctatc tcaataaagt atatgattgg
      901 tttgaggaac gtcttgagat tcaggcaatt gcagatgata taactagtaa atatgttcct
      961 cctcatgtca acatatttta ttgtttaggg ggaattacac ttacttgttt tctagtacaa
     1021 gttgctaccg gttttgctat gactttttac tatcgcccaa ccgttacaga ggctttttcc
     1081 tcggttcaat acataatgac cgaggccaac tttggttggt taatccgatc agttcatcga
     1141 tggtcagcaa gtatgatggt tctaatgatg atcctgcacg tatttcgtgt gtatctcaca
     1201 ggtgggttta aaaaaccccg cgaattaact tgggtcactg gtgtggtttt agctgtatta
     1261 actgcatcgt ttggtgtaac tggttattct ttaccttggg atcaaattgg ttattgggca
     1321 gtcaaaattg tgacaggtgt gcctgacgcg attccggtaa taggatcacc tttagtggag
     1381 ttattacgtg gaagtgctag tgtgggtcaa tccactttga ctcgttttta tagtttacat
     1441 acctttgtac ttcctctgct tactgccgta tttatgttaa tgcactttct aatgatacgt
     1501 aagcaaggta tttcgggccc tttataagga aggcatctca tagagagttc taattctcat
     1561 atatcatatc gggtaggttg tggtatttca ttgctacaaa catgggttat tctaaaataa
     1621 gacatgtcat ttagatactt ctcttcaact ccgaactatt gtgatacaaa taatacaaat
     1681 ag
//
LOCUS       VAZTHYK      1524 bp ds-DNA             VRL       04-AUG-1990
DEFINITION  Varicella-zoster virus thymidine kinase gene, complete cds.
ACCESSION   M36160
KEYWORDS    thymidine kinase.
SOURCE      Varicella-zoster virus (strain Oka vaccine) DNA.
  ORGANISM  Varicella-zoster virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Mori,H., Shiraki,K., Kato,T., Hayakawa,Y., Yamanishi,K. and
            Takahashi,M.
  TITLE     Molecular analysis of the thymidine kinase gene of thymidine
            kinase-deficient mutants of Varicella-zoster virus
  JOURNAL   Intervirology 29, 301-310 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        420     1445     thymidine kinase
BASE COUNT      434 a    354 c    327 g    408 t      1 others
ORIGIN      
        1 ctggcgcata ccctcgcaaa actggtgata cttagtaggg gtatgtatat tagcgctaaa
       61 acggcaagat tttaattcca ctataaaaca aacggtcttt ccggcaccac tggattccgt
      121 ttgtataata caaacacaat cggggcgtcg gcgtcccaaa tttacttcaa acgacattga
      181 tatgcgtaca gccctttgaa catccacgtg ggataacggc gacaggagtt ttgccagcct
      241 cgggttgaac gcgtccgcga aacctcgacg tacgttatca atatcctttt tgagtacatc
      301 gtaaaaacga gtgtggcaac gttgtcccaa acgaaaacac ttggcccgaa ttcgactagc
      361 ggacatattt gaagttccgt cccagaagat aacctaagac gcgtttgtct acaataaaca
      421 tgtcaacgga taaaaccgat gtaaaaatgg gcgttttgcg tatttatttg gacggggcgt
      481 atggaattgg aaaaacaacc gccgccgaag aatttttaca ccactttgca ataacaccaa
      541 accggatctt actcattggg gagcccctgt cgtattggcg taaccttgca ggggaggacg
      601 ctatttgcgg aatttacgga acacaaactc gccgtcttaa tggagacgtt tcgcctgaag
      661 acgcacaacg cctcacggct cattttcaga gcctgttctg ttctccgcat gcaattatgc
      721 atgcgaaaat ctcggcattg atggacacaa gtacatcgga tctcgtacaa gtaaataagg
      781 agccgtataa aattatgtta tccgaccgac acccaatcgc ctcaactata tgttttccct
      841 tgtccagata cttagtggga gatatgtccc cagcggcgct tcctgggtta ttgtttacgc
      901 ttcccgctga accccccggg accaacttgg tagtttgtac cgtttcactc cccagtcatt
      961 tatccagagt aagcaaacgg gccagaccgg gagaaacggt taatctgccg tttgttatgg
     1021 ttctgagaaa tgtatatata atgcttatta atacaattat atttcttaaa actaacaact
     1081 ggcacgcggg ctggaacaca ctgtcatttt gtaatgatgt atttaaacag aaattacaaa
     1141 aatccgagtg tataaaacta cgcgaagtac ctgggattga agacacgtta ttcgccgtnc
     1201 ttaaacttcc ggagctttgc ggagagtttg gaaatattct gccgttatgg gcatggggaa
     1261 tggagaccct ttcaaactgc ttacgaagca tgtctccgtt cgtattatcg ttagaacaga
     1321 caccccagca tgcggcacaa gaactaaaaa ctctgctacc ccagatgacc ccggcaaaca
     1381 tgtcctccgg tgcatggaat atattgaaag agcttgttaa tgccgttcag gacaacactt
     1441 cctaaatata cctagtattt acgtatgtac cagtaaaaag atgatacaca ttgtcatact
     1501 cgcgtgtacg tgtttttctt tttt
//
LOCUS       HUMCAMA      2175 bp ss-mRNA            PRI       04-AUG-1990
DEFINITION  Human calmodulin mRNA, complete cds.
ACCESSION   J04046
KEYWORDS    calmodulin.
SOURCE      Human teratoma cell line, cDNA to mRNA, clone lambda-ht6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2175)
  AUTHORS   Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A.,
            Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E.
  TITLE     Multiple divergent mRNAs code for a single human calmodulin
  JOURNAL   J. Biol. Chem. 263, 17055-17062 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2175; revises [1])
  AUTHORS   Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A.,
            Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by R.Fischer, 28-JUL-1988, and [2] 18-JUL-1990.
            
               [2] Author address: M.Koller
               Biochemistry Dept.
               Swiss Federal Inst. of Technology
               Universitaetsstr. 16
               Zurich,  CH
               biochemie@czheth5a
FEATURES       from  to/span     description
    pept        104      553     calmodulin
    mRNA          6     2175     CALM mRNA
BASE COUNT      430 a    626 c    592 g    527 t
ORIGIN      
        1 tgagtgtgga ggcgcggacg cgcggcggag ctggaactgc tgcagctgct gccgccgccg
       61 gaggaacctt gatccccgtg ctccggacac cccgggcctc gccatggctg accagctgac
      121 tgaggagcag attgcagagt tcaaggaggc cttctccctc tttgacaagg atggagatgg
      181 cactatcacc accaaggagt tggggacagt gatgagatcc ctgggacaga accccactga
      241 agcagagctg caggatatga tcaatgaggt ggatgcagat gggaacggga ccattgactt
      301 cccggagttc ctgaccatga tggccagaaa gatgaaggac acagacagtg aggaggagat
      361 ccgagaggcg ttccgtgtct ttgacaagga tgggaatggc tacatcagcg ccgcagagct
      421 gcgtcacgta atgacgaacc tgggggagaa gctgaccgat gaggaggtgg atgagatgat
      481 cagggaggct gacatcgatg gagatggcca ggtcaattat gaagagtttg tacagatgat
      541 gactgcaaag tgaaggcccc ccgggcagct ggcgatgccc gttctcttga tctctctctt
      601 ctcgcgcgcg cactctctct tcaacactcc cctgcgtacc ccggttctag caaacaccaa
      661 ttgattgact gagaatctga taaagcaaca aaagatttgt cccaagctgc atgattgctc
      721 tttctccttc ttccctgagt ctctctccat gcccctcatc tcttcctttt gccctcgcct
      781 cttccatcca cgtcttccaa ggcctgatgc attcataagt tgaagccctc cccagatccc
      841 cttggagcct ctgccctcct ccagcccgga tggctctcct tcattttggt ttgtttcctc
      901 ttgtttgtca tcttattttg ggtgctgggg tggctgccag cctgtcccgg gacctgctgg
      961 gagggacaag aggccctccc caggcagaag agcatgccct ttgccgttgc atgcaaccag
     1021 ccctgtgatt ccacgtgcag atcccagcag cctgttgggg caggggtgcc aagagaggca
     1081 ttccagaagg actgaggggg cgttgaggaa ttgtggcgtt gactggatgt ggcccaggac
     1141 tgggtcgagg gggccaactc acagaagggg actgacagtg ggcaacactc acatcccact
     1201 ggctgctgtt ctgaaaccat ctgattggct ttctgaggtt tggctgggtg gggactgctc
     1261 atttggccac tctgcagatt ggacttgccc gcgttcctga agcgctctcg agctgttctg
     1321 taaatacctg gtgctaacat cccatgccgc tccctcctca cgatgcaccc accgccctga
     1381 gggcccgtcc taggaatgga tgtggggatg gtcgctttgt aatgtgctgg ttctcttttt
     1441 ttttctttcc cctctatggc ccttaagact ttcattttgt tcagaaccat gctgggctag
     1501 ctaaagggtg gggagaggga agatgggccc caccagctct caagagaaac gcacctgcaa
     1561 taaaacagtc ttgtcggcca gctgcccagg gacggcagct acagcagcct ctgcgtcctg
     1621 gtccgccagc acctcccgct tctccgtggt gacttggcgc cgcttcctca catctgtgct
     1681 ccgtgccctc ttccctgcct cttccctcgc ccacctgcct gcccccatac tccccagcgg
     1741 agagcatgat ccgtgccctt gcttctgact ttcgcctctg ggacaagtaa gtcaatgtgg
     1801 gcagttcagt cgtctgggtt ttttcccctt ttctgttcat ttcatctggc tccccccacc
     1861 acctccccac cccacccccc accccctgct tcccctcact gcccaggtcg atcaagtggc
     1921 ttttcctggg acctgcccag ctttgagaat ctcttctcat ccaccctctg gcacccagcc
     1981 tctgagggaa ggagggatgg ggcatagtgg gagacccagc caagagctga gggtaagggc
     2041 aggtaggcgt gaggctgtgg acattttcgg aatgttttgg ttttgttttt tttaaaccgg
     2101 gcaatattgt gttcagttca agctgtgaag aaaaatatat atcaatgttt tccaataaaa
     2161 tacagtgact acctg
//
LOCUS       BMEGDH1      2834 bp ds-DNA             BCT       04-AUG-1990
DEFINITION  B. megaterium glucose dehydrogenase gene and ORFs.
ACCESSION   D90043
KEYWORDS    glucose dehydrogenase.
SOURCE      Bacillus megaterium (strain IAM1030) DNA.
  ORGANISM  Bacillus megaterium
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2834)
  AUTHORS   Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
            Urabe,I. and Okada,H.
  TITLE     Active and silent isozyme genes of glucose dehydrogenase from
            Bacillus megaterium IAM1030
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihide
            Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan.
FEATURES       from  to/span     description
    pept       1964     2749     glucose dehydrogenase (EC 1.1.1.47)
    ORF         181      867     ORF1
    ORF        1086     1946     ORF2
    signal       89       94     put. -35 region for ORF1
    signal      116      121     put. -10 region for ORF1
    signal      167      172     ORF2 ribosome binding site
    signal      893      928     termination signal
    signal      972      977     put. -35 region for ORF2
    signal      997     1003     put. -10 region for ORF2
    signal     1070     1076     ORF2 ribosome binding site
    signal     1949     1954     glucose dehydrogenase ribosome binding site
    signal     2751     2783     termination signal for glucose dehydrogenase
BASE COUNT      896 a    442 c    644 g    852 t
ORIGIN      
        1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga
       61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag
      121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa
      181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg
      241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt
      301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat
      361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca
      421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat
      481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag
      541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta
      601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat
      661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct
      721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt
      781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt
      841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg
      901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt
      961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt
     1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt
     1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt
     1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt
     1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg
     1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta
     1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca
     1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg
     1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag
     1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa
     1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat
     1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca
     1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg
     1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc
     1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga
     1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag
     1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga
     1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg
     2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa
     2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt
     2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt
     2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt
     2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga
     2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt
     2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa
     2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat
     2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg
     2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc
     2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc
     2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca
     2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga
     2821 ctggtgttga attc
//
LOCUS       BMEGDH2      1202 bp ds-DNA             BCT       04-AUG-1990
DEFINITION  B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene.
ACCESSION   D90044
KEYWORDS    glucose dehydrogenase; isozyme.
SOURCE      Bacillus megaterium (strain IAM1030) DNA.
  ORGANISM  Bacillus megaterium
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1202)
  AUTHORS   Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
            Urabe,I. and Okada,H.
  TITLE     Active and silent isozyme genes of glucose dehydrogenase from
            Bacillus megaterium IAM1030
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihide
            Mitamura Department of Fermentation Technology Osaka University 2-1
            Yamada-oka Suita, Osaka 565 Japan Phone:  06-877-5111 x4373 Fax:
            06-876-9036
FEATURES       from  to/span     description
    pept        125      910     glucose dehydrogenase
    signal       27       32     put. -35 region
    signal       49       55     put. -10 region
    signal      111      116     SD sequence
    signal      928      959     termination signal
BASE COUNT      427 a    190 c    268 g    317 t
ORIGIN      
        1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa
       61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt
      121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt
      181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg
      241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc
      301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc
      361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt
      421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg
      481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa
      541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc
      601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa
      661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa
      721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg
      781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt
      841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg
      901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt
      961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg
     1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa
     1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg
     1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg
     1201 ag
//
LOCUS       BMOPTTHP1    1023 bp ss-mRNA            INV       04-AUG-1990
DEFINITION  B.mori PTTH mRNA.
ACCESSION   D90082
KEYWORDS    PTTH; preproPTTH.
SOURCE      B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to
            mRNA, clones P1, P2, C2, C9 and C19.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 1023)
  AUTHORS   Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
            Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
  TITLE     Molecular cloning of the Bombyx mori prothoracicotropic hormone
  JOURNAL   Science 247, 1333-1335 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hironori
            Ishizaki Department of Biology, School of Science, Nagoya
            University Chikusa-ku Nagoya 464-01 Japan Phone:  052-781-5111
            x2472 Fax:    052-783-0719 Telex:  SCUNAG J: 447-7323
FEATURES       from  to/span     description
    pept         34      708     preproPTTH
    matp        379      708     PTTH subunit
    signal      768      773     polyadenylation signal
    signal      826      831     polyadenylation signal
    signal      997     1002     polyadenylation signal
    signal     1004     1009     polyadenylation signal
BASE COUNT      358 a    179 c    186 g    300 t
ORIGIN      
        1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc
       61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg
      121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac
      181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg
      241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta
      301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa
      361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct
      421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc
      481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc
      541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct
      601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg
      661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt
      721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata
      781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc
      841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt
      901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca
      961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt
     1021 aaa
//
LOCUS       BMOPTTHP4     944 bp ss-mRNA            INV       04-AUG-1990
DEFINITION  B.mori preproPTTH mRNA.
ACCESSION   D90083
KEYWORDS    PTTH; preproPTTH.
SOURCE      B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to
            mRNA, clone P4 and C21.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 944)
  AUTHORS   Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
            Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
  TITLE     Molecular cloning of the Bombyx mori prothoracicotropic hormone
  JOURNAL   Science 247, 1333-1335 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hironori
            Ishizaki Department of Biology, School of Science, Nagoya
            University Chikusa-ku Nagoya 464-01 Japan Phone:  052-781-5111
            x2472 Fax:    052-783-0719 Telex:  SCUNAG J: 447-7323
FEATURES       from  to/span     description
    pept    <     1      631     preproPTTH
    matp        302      631     PTTH subunit
    signal      691      696     polyadenylation signal
    signal      749      754     polyadenylation signal
    signal      918      923     polyadenylation signal
    signal      925      930     polyadenylation signal
BASE COUNT      337 a    163 c    177 g    267 t
ORIGIN      
        1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt
       61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag
      121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga
      181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa
      241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag
      301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa
      361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg
      421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag
      481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga
      541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag
      601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt
      661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa
      721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag
      781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt
      841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata
      901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa
//
LOCUS       HUMMTSDHB     958 bp ss-mRNA            ORG       04-AUG-1990
DEFINITION  Human mitochondrial succinate-ubiquinone oxidoreductase
            (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA.
ACCESSION   D90047
KEYWORDS    Ip; complex II; iron sulfur subunit; sdh B;
            succinate-ubiquinone oxidoreductase.
SOURCE      Human liver mitochondrion, cDNA to mRNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 958)
  AUTHORS   Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M.
  TITLE     Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning
            of iron sulfur(Ip) subunit of liver mitochondria
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kiyoshi
            Kita Department of Parasitology Juntendo University 2-1-1,Hongo
            Bunkyo-ku, Tokyo 113 Japan Phone:  03-813-3111 x3542 Fax:
            03-814-9300
FEATURES       from  to/span     description
    pept    <     1      789     succinate-ubiquinone oxidoreductase (sdh B)
                                 (AA at 1)
    site         74       95     iron-sulfur binding site I
    site        167      179     iron-sulfur binding site II
    site        224      236     iron-sulfur binding site III
BASE COUNT      319 a    212 c    204 g    223 t
ORIGIN      
        1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt
       61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg
      121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag
      181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt
      241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac
      301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat
      361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag
      421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag
      481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc
      541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg
      601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc
      661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat
      721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct
      781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt
      841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata
      901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS       HUMNCAW      2287 bp ss-mRNA            PRI       04-AUG-1990
DEFINITION  Human nonspecific cross-reacting antigen (NCA-W272) mRNA.
ACCESSION   D90064
KEYWORDS    CEA; CEA gene family; PI-anchored membrane protein.
SOURCE      Human white blood cells, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2287)
  AUTHORS   Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and
            Matsuoka,Y.
  TITLE     Characterization of a cDNA clone encoding a new species of the
            nonspecific cross-reacting antigen (NCA), a member of the CEA gene
            family
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Fumiko
            Arakawa First Department of Biochemistry School of Medicine Fukuoka
            University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone:
            092-801-1011  x2892 Fax:   092-801-3600
FEATURES       from  to/span     description
    ORF          87     1136     nonspecific cross-reacting antigen ORF
BASE COUNT      618 a    593 c    453 g    623 t
ORIGIN      
        1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag
       61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc
      121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca
      181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc
      241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa
      301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag
      361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg
      421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg
      481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca
      541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc
      601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc
      661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac
      721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga
      781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag
      841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt
      901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa
      961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca
     1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag
     1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc
     1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa
     1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact
     1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc
     1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa
     1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt
     1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc
     1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga
     1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga
     1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc
     1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt
     1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt
     1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa
     1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca
     1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat
     1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga
     2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt
     2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac
     2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc
     2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg
     2281 gctgatc
//
LOCUS       PIGDESTN     1666 bp ss-mRNA            MAM       04-AUG-1990
DEFINITION  Porcine destrin mRNA.
ACCESSION   D90053 J05290
KEYWORDS    actin-binding protein; cofilin; destrin.
SOURCE      Pig adult brain, cDNA to mRNA, clone PD2.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1666)
  AUTHORS   Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S.,
            Iida,K. and Yahara,I.
  TITLE     Destrin, a mammalian actin-depolymerizing protein, is closely
            related to cofilin: Cloning and expression of porcine brain destrin
            cDNA
  JOURNAL   J. Biol. Chem. 265, 5768-5773 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kenji
            Moriyama Department of Biophysics and Biochemistry Faculty of
            Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan
            Phone: 03-821-2111  x4408
FEATURES       from  to/span     description
    pept         54      551     destrin
    mRNA    <     1     1666     destrin mRNA
    signal     1645     1660     polyadenylation signal
BASE COUNT      492 a    322 c    372 g    480 t
ORIGIN      
        1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt
       61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt
      121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag
      181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa
      241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg
      301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc
      361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg
      421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc
      481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat
      541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc
      601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg
      661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct
      721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga
      781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt
      841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt
      901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt
      961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac
     1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca
     1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca
     1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt
     1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta
     1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag
     1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc
     1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg
     1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga
     1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt
     1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta
     1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt
//
LOCUS       RATPMP70X    3324 bp ss-mRNA            ROD       04-AUG-1990
DEFINITION  Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA.
ACCESSION   D90038 J05256
KEYWORDS    PMP70; peroxisomal membrane protein.
SOURCE      Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181,
            189,201].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3324)
  AUTHORS   Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T.
  TITLE     The 70-kDa Peroxisomal Membrane Protein Is a Member of the
            Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Keiju
            Kamijo Department of Biochemistry Shinshu University School of
            Medicine Matsumoto 390 Japan Phone:  263-35-4600 x5182 Fax:
            263-33-6458
FEATURES       from  to/span     description
    pept         36     2015     peroxisomal membrane protein (PMP70)
    signal     3271     3276     polyadenylation signal (put.)
BASE COUNT      921 a    658 c    773 g    972 t
ORIGIN      
        1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga
       61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc
      121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg
      181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac
      241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta
      301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac
      361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact
      421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg
      481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag
      541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc
      601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta
      661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg
      721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa
      781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta
      841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa
      901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct
      961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg
     1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca
     1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc
     1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg
     1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta
     1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg
     1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa
     1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg
     1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat
     1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc
     1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa
     1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg
     1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg
     1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac
     1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca
     1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc
     1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa
     1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt
     2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta
     2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca
     2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat
     2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg
     2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga
     2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct
     2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt
     2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat
     2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt
     2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt
     2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg
     2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac
     2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg
     2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg
     2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca
     2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc
     3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg
     3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa
     3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt
     3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat
     3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag
     3301 attaaaaaaa aaaaaaagga attc
//
LOCUS       VACSANT      1525 bp ds-DNA             VRL       04-AUG-1990
DEFINITION  Vaccinia virus surface (S) antigen gene.
ACCESSION   D90076
KEYWORDS    S gene; surface antigen.
SOURCE      Vaccinia virus DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 1525)
  AUTHORS   Ueda,Y., Morikawa,S. and Matsuura,Y.
  TITLE     Identification and nucleotide sequence of the gene encoding a
            surface antigen induced by vaccinia virus
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Yoshiaki
            Ueda National Institute of Health Gakuen, Musashimurayama Tokyo
            190-12 Japan Phone:  0425-61-0771 Fax:    0425-65-3315
FEATURES       from  to/span     description
    pept        382     1437     surface antigen S
BASE COUNT      568 a    229 c    258 g    470 t
ORIGIN      
        1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata
       61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat
      121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa
      181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat
      241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac
      301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata
      361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca
      421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc
      481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt
      541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt
      601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa
      661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta
      721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca
      781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc
      841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga
      901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat
      961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta
     1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat
     1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa
     1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact
     1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga
     1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc
     1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt
     1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata
     1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt
     1501 gctatagaat gaatgaggat acgcg
//
LOCUS       YSCA1         881 bp ds-DNA             PLN       04-AUG-1990
DEFINITION  S. cerevisiae acidic ribosomal protein A1 (YSCA1).
ACCESSION   D90072 X13682
KEYWORDS    acidic ribosomal protein; ribosomal protein.
SOURCE      S. cerevisiae (strain IFO-40028) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 881)
  AUTHORS   Mitsui,K. and Tsurugi,K.
  TITLE     Identification of A1 protein as the fourth member of 13 kDa-type
            acidic ribosomal protein family in yeast Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 315; 631 to 881)
  AUTHORS   Mitsui,K. and Tsurugi,K.
  TITLE     Identification of A1 protein as the fourth member of 13 kDa-type
            acidic ribosomal protein family in yeast Saccharomyces cerevisiae
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989)
  STANDARD  full staff_entry
REFERENCE   3  (bases 277 to 742)
  AUTHORS   Tsurugi,K. and Mitsui,K.
  TITLE     cDNA and deduced amino acid sequence of acidic ribosomal protein A1
            from Saccharomyces cerevisiae
  JOURNAL   Nucleic Acids Res. 16, 3574-3574 (1988)
  STANDARD  simple automatic
COMMENT     These data kindly submitted in computer readable form by: Kazuhiro
            Mitsui Department of Biochemistry Yamanashi Medical college Tamaho,
            Nakakoma-gun Yamanashi 409-38 Japan Phone:  0552-73-1111 x2257
FEATURES       from  to/span     description
    pept        313      633     acidic ribosomal protein A1
    signal      125      135     UASrpg box1
    signal      182      192     UASrpg box2
    signal      716      721     poly(A) signal
    variant     303      303     a in [1]; g in [3]
    variant     684      685     tt in [1]; t in [3]
    variant     719      719     a in [1]; t in [3]
BASE COUNT      270 a    165 c    157 g    289 t
ORIGIN      
        1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt
       61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca
      121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa
      181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca
      241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa
      301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac
      361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca
      421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac
      481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct
      541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc
      601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg
      661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa
      721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg
      781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc
      841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c
//
LOCUS       YSCCDC23X    3107 bp ds-DNA             PLN       04-AUG-1990
DEFINITION  S. cerevisiae CDC23 protein gene.
ACCESSION   D90081
KEYWORDS    CDC23 protein.
SOURCE      Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line
            D22) DNA, clone YX34.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3107)
  AUTHORS   Doi,A. and Doi,K.
  TITLE     Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces
            cerevisiae
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kenji Doi
            The Institute of Scientific and Industrial Research Osaka
            University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111
            Fax:    06-877-4977
FEATURES       from  to/span     description
    ORF         765     2645     ORF for CDC23
    site        153      161     calcium-binding site
BASE COUNT     1010 a    595 c    629 g    873 t
ORIGIN      
        1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact
       61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt
      121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc
      181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt
      241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg
      301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct
      361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt
      421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga
      481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag
      541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat
      601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact
      661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc
      721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc
      781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac
      841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag
      901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag
      961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt
     1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt
     1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc
     1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta
     1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc
     1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga
     1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag
     1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa
     1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact
     1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat
     1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta
     1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg
     1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc
     1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg
     1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt
     1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg
     1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact
     1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt
     2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca
     2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca
     2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt
     2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg
     2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca
     2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat
     2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg
     2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa
     2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta
     2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata
     2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat
     2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc
     2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact
     2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc
     2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc
     2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga
     3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga
     3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag
//
LOCUS       BIM127       8482 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Bovine immunodeficiency-like virus, complete proviral genome.
ACCESSION   M32690
KEYWORDS    complete genome.
SOURCE      Bovine immunodeficiency-like virus (isolate HXB3), proviral DNA,
            clone 127.
  ORGANISM  Bovine immunodeficiency-like virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 8482)
  AUTHORS   Garvey,K.J., Oberste,M.S., Elser,J.E., Braun,M.J. and Gonda,M.A.
  TITLE     Nucleotide sequence and genome organization of biologically active
            proviruses of the bovine immunodeficiency-like virus
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_review
COMMENT     Kindly submitted in computer-readable form prior to publication by
            Dr. M.A. Gonda, Frederick Cancer Research Facility, Frederick, MD.
            Clones 127 and 106 (see <BIV106>, Accession number M32691) were
            obtained from a single virus isolation; the latter, 106, has 8391
            nucleotides as the result of a sizeable deletion in env and two
            minor deletions in the LTR R regions. The Q and S orfs are named by
            analogy with similar orfs in visna; other short reading frames can
            be identified.
FEATURES       from  to/span     description
    pept        316     1746     gag polyprotein
    pept    <  1572     4742     pol polyprotein
    pept       4601     5197     protein Q (put.)
    pept       5228     5539     protein S (put.)
    pept       5415     8129     env polyprotein
BASE COUNT     2699 a   1798 c   2017 g   1968 t
ORIGIN      
        1 ttcagctcgt gtagctcatt agctccgagc tccccaacct acagcctgag aggcactggc
       61 tcggttgggt agccagcctt tcgggtaata aaggcttgtt ggcattcggc atctacccgt
      121 gcctcctgtc ttgtcttact cgagcgaacc cacaactccg tcctgctgag ctcacagctc
      181 gcggggcggt gaagaacacc caacagttgg cgcccaacgt ggggctcgag taagagagac
      241 tcggctcgag taaaagaaga cccagctcga acgagaagac tccggacagg tgagtagttg
      301 cgtgttttcc ccgggatgaa gagaagggag ttagaaaaga agcttcgtaa ggttagggtg
      361 acaccccaac aggataaata ttatactata gggaatcttc aatgggccat tagaatgata
      421 aatctaatgg ggatcaaatg tgtgtgtgac gaggagtgct cggcagcaga ggtagccctt
      481 atcataaccc aattttcagc tttagactta gaaaattctc ctatcagagg taaggaggag
      541 gtggccataa aaaatactct gaaggttttc tggtccctgc tggcggggta caaaccagag
      601 agtacagaaa cggccctagg atattgggag gcctttacat atagagaaag ggaggccaga
      661 gctgataagg aaggcgaaat taagagtatt tacccttccc taacacagaa cacacagaat
      721 aagaagcaga catcgaatca gacaaacact caatcattac cagctatcac tactcaagat
      781 ggtactccta ggtttgatcc tgacctcatg aagcagctta agatctggtc agacgccact
      841 gaaagaaatg gggttgacct tcatgcagtg aatatattag gggtcattac agcaaaccta
      901 gtacaggaag aaattaaact cctcttgaat agtacaccca agtggagatt agatgtacaa
      961 cttatagaat caaaagtaag agagaaagaa aatgcccaca gaacgtggaa acagcatcat
     1021 ccagaagccc caaaaacaga tgaaatcatc ggtaaggggc ttagttctgc tgaacaagcc
     1081 accctgatct cagtagaatg cagagaaact ttcagacagt gggtgctgca ggcagctatg
     1141 gaggtggcac aggcaaaaca tgctacccca ggtcccatca acattcatca gggacccaag
     1201 gagccgtaca cagactttat aaatagatta gtggcagccc ttgaaggtat ggcggctcca
     1261 gaaaccacaa aagaatactt actccaacat ctatctattg atcatgccaa tgaagactgc
     1321 cagtctattc taagaccttt gggacccaac accccaatgg agaaaaaatt agaagcatgt
     1381 agggtagtgg gatctcagaa atcaaagatg caatttttgg tagcagctat gaaagaaatg
     1441 gggatccaat caccaattcc agcagtcttg cctcacacac cagaagcata tgcctcccaa
     1501 acctcagggc ccgaggatgg taggagatgt tacggatgtg ggaagacagg acatttgaag
     1561 aggaattgta aacagcaaaa atgctaccat tgtggcaaac ctggccacca agcaagaaac
     1621 tgcaggtcaa aaaacgggaa gtgctcctct gccccttatg ggcagaggag ccaaccacag
     1681 aacaattttc accagagcaa catgagttct gtgaccccat ctgcaccccc tcttatatta
     1741 gattagacaa acagcctttt ataaaggtgt tcataggggg aagatgggta aaagggttag
     1801 tagacactgg agcagatgag gtagtgctta agaacataca ttgggatagg ataaaagggt
     1861 atccagggac accaattaaa caaattgggg taaatggagt aaatgtggcc aaaaggaaga
     1921 cccacgtaga gtggagattt aaggataaga ctgggataat tgatgtcttg ttctcagata
     1981 ctcctgtaaa cctttttggg agatctcttc tacgtagcat agtgacttgc ttcaccctac
     2041 ttgttcacac agaaaaaatc gaacccctac ccgtcaaggt aaggggacca gggcctaagg
     2101 taccccagtg gcccttgaca aaagaaaagt atcaggctct taaggaaatt gtgaaagatc
     2161 ttttagcaga aggaaaaatt tccgaagctg cttgggataa cccatataat accccagttt
     2221 ttgttataaa gaaaaaggga acgggaagat ggaggatgct aatggatttt agggaattaa
     2281 ataagataac agttaaagga caagaattct ctacaggctt accttaccct ccaggaatta
     2341 aggaatgtga acacttaact gcaatagata taaaagatgc ctactttact atccctttac
     2401 atgaggactt tagacccttt acagccttct ctgtagtccc tgtaaatcga gaaggaccta
     2461 tagagaggtt ccagtggaat gttctaccac aaggatgggt atgtagccct gccatttatc
     2521 agactaccac ccagaagatt atagaaaaca ttaaaaagag tcacccagat gtcatgttgt
     2581 atcaatatat ggatgatttg ttgattgggt ctaataggga tgatcataag caaatagtgc
     2641 aggaaatcag ggataagtta ggatcatatg gtttcaagac tccagatgaa aaggtccagg
     2701 aagagagagt gaaatggatc ggttttgagc tcacacccaa gaaatggcgt tttcagccca
     2761 ggcaactaaa gataaaaaac ccactcacag taaatgaatt acagcaatta gtaggtaatt
     2821 gtgtttgggt acagccagaa gtaaaaatcc ctctataccc cttaaccgat ctactgaggg
     2881 ataagaccaa tctccaagaa aagatacaac taacaccaga agccatcaag tgtgtagaag
     2941 aattcaatct aaaactaaaa gatccagaat ggaaagatag aataagagaa ggagcagaat
     3001 tagtcataaa aatacagatg gttcctcggg gcatagtatt tgatctgttg caagatggaa
     3061 atcccatatg gggaggagta aaaggactaa attatgatca ttcaaacaaa ataaaaaaga
     3121 tacttagaac tatgaatgag ctgaacagaa cagtggtaat tatgacagga agagaagcta
     3181 gtttcctgct tcctgggtct tctgaagatt gggaagcggc actccagaag gaagaaagtc
     3241 taacacaaat attcccagta aagttttata ggcactcctg cagatggacc tccatatgtg
     3301 ggccagtaag agaaaatcta accacctact atactgacgg agggaagaaa gggaaaacag
     3361 ctgcagcagt atattggtgt gaaggaagga ctaagtcaaa ggtatttcca ggaaccaatc
     3421 aacaggcgga attgaaggcc atatgcatgg ctctcttgga tggaccacca aaaatgaata
     3481 tcataacaga tagtagatac gcctatgagg gaatgagaga agaaccagaa acgtgggcca
     3541 gggaaggaat ctggctggag attgccaaga tattgccctt taagcagtac gtgggggtcg
     3601 ggtgggtgcc tgcacataaa gggataggag gaaatacaga ggcagatgaa ggagttaaga
     3661 aagccttaga acagatggcc ccgtgtagcc ctcctgaggc cattctatta aaaccaggag
     3721 aaaaacaaaa tctggagaca gggatctaca tgcaggggct tagaccacaa agcttcctcc
     3781 caagagcaga cttaccagta gccatcacag gaaccatggt agattcagag ctacagctac
     3841 agctacttaa cataggaact gagcatataa gaatccaaaa agatgaggtc ttcatgacct
     3901 gtttcctaga aaatatcccc tcagccactg aagatcatga gagatggcat acctcaccag
     3961 acattttggt taggcagttc catctcccta agagaatagc taaagagata gtagccagat
     4021 gccaagaatg taaaaggaca accactagcc cagtcagagg aacaaacccc agaggtcgat
     4081 tcttatggca gatggacaat actcactgga ataaaacaat tatttgggta gcagtagaga
     4141 caaattcagg attagtggaa gctcaggtga tccctgaaga aacagcacta caagtagctc
     4201 tctgcatttt acagctaatc cagagatata cagttcttca cttacatagt gacaacgggc
     4261 cgtgctttac tgcacacagg atagaaaatc tatgtaagta tctggggatc acaaaaacta
     4321 cgggaatacc ctacaaccca caatcccagg gagttgtaga aagagcccac agagatctaa
     4381 aagacagatt ggcagcttat cagggagatt gtgaaaccgt agaagcagcc cttagcctcg
     4441 cattagtttc tttaaataaa aaaagagggg gaataggggg ccatacacca tatgaaatat
     4501 acctagaatc agaacatacc aaataccaag accaactaga acaacaattt tcaaaacaaa
     4561 aaattgaaaa gtggtgttac gtaaggaaca gaagaaagga atggaaagga ccctacaaag
     4621 tgttgtggga cggagacggg gcagcagtaa tagaggaaga gggaaaaaca gccttatatc
     4681 cacaccgtca tatgcgcttc atcccccccc cagattcaga tatccaagat gggagttcgt
     4741 gaggcagaca gaatacagca tgaccgcgtg cgtaagaaaa gggaaattag tccttactta
     4801 ccagtacgcg atctggaaaa gagtctggac gatagaaaca ggatttacag atccaagtct
     4861 gtttatgacc ccagctggaa cacacaccac tgaagaaata ggtcacttag atctcttttg
     4921 gcttaggtac tgttcatgtc cgcatgagat gcccccgtgg ctagacttcc ttagaggcac
     4981 cctcaatcta cgcatttcct gtcgacgcgc tcttcaagcg tcagtgttga ctagcacccc
     5041 tagacactcc ctccaacgct tagctgcact tcagctgtgc actaacgcat gtctctgttg
     5101 gtacccgtta ggacgcatca acgacaccac cccgttgtgg ttgaactttt cgtctgggaa
     5161 ggaaccaacg atccaacaac tgagtggcca cccctaactc gtcgtaacat tcatagattg
     5221 tggcaatatg cccggacctt gggtggcgat gataatgttg ccacagccca aagaaagctt
     5281 tggaggaaag ccaattggct ggcttttctg gaacacgtgc aaaggaccta ggcgggactg
     5341 tccacattgt tgttgtccca tatgtagttg gcattgtcag ctttgctttt tgcagaaaaa
     5401 tctaggaatc aactatggat caggacctag acggcgcgga acgcggggaa aggggaggag
     5461 gatccgaaga actgcttcag gaggagatca acgaagggag gctgacagcc agagaagctt
     5521 tacaaacatg gatcaataac ggtgagatcc acccttgggt cctggcagga atgctgtcca
     5581 tgggagtagg aatgctacta ggagtatatt gtcagttacc agacacactg atttggatac
     5641 taatgtttca attatgcctt tattggggtt tgggtgaaac atctagagaa ttagacaagg
     5701 atagttggca gtgggtcaga agtgtattta taatagcaat attgggaact ctcactatgg
     5761 caggaactgc tttggccgac gacgatcaaa gtactttaat ccccaatatc acaaaaattc
     5821 ctacaaagga cacggaaccc ggttgcacct atccgtggat attaatcctc ttgattttgg
     5881 ctttcatact gggaattctg ggtataatac ttgtcttgag acgcagcaac tcggaggata
     5941 tattggcagc cagagatacc atagattggt ggctctcagc taatcaggaa atacctccaa
     6001 agtttgcttt cccaataata ttaatatctt cccctctagc aggcataata ggatattatg
     6061 tcatggaaag gcacttagag atcttcaaaa agggatgtca aatttgtggg agcctgagca
     6121 gcatgtgggg aatgcttttg gaagaaattg gcaggtggct cgcacgtagg gaatggaatg
     6181 ttagtagagt aatggttatc ctcttaatca gcttcagttg gggaatgtat gtcaataggg
     6241 taaatgcctc agggtcacat gtagccatgg tcaccagccc tccagggtac cgcatagtga
     6301 atgataccag ccaggcacct tggtattgct tctcctcggc accaatccca acgtgtagtt
     6361 cctctcagtg gggagacaaa tattttgagg agaaaataaa cgagacactg gtcaaacagg
     6421 tgtatgaaca ggccgcgaaa cattcgagag ccacatggat tgaacctgat ctattggagg
     6481 aagcagtcta tgagctagct ctgttatcag ctaatgacag tcgtcaggtg gtggtagaaa
     6541 atggtacaga cgtatgtagc tcacagaact cgagcacaaa caaaggccac ccaatgacgc
     6601 ttctaaagtt gagagggcag gtgtcagaaa cttggatagg gaattcctcc ctccagtttt
     6661 gtgtccagtg gccatatgtc ttggtaggtc ttaataatag tgatagtaat attagcttca
     6721 attcgggaga ttggatagca accaattgta tgcacccaat tacactaaat aaaagtgcac
     6781 aagatctagg aaaaaatttt ccgagactaa catttcttga cggacaactg tcccagttga
     6841 agaacacact gtgcggacat aacacaaact gtttgaaatt tggaaacaag tccttcagta
     6901 caaattccct aatactatgc caagacaacc ccatcggcaa cgacaccttt tatagcctaa
     6961 gtcattcctt ctcaaaacag gcctctgccc ggtggattct tgtaaaggtc cccagctatg
     7021 ggtttgtggt agtaaatgac acagatacac caccatccct ccgcatccga aagcctcgag
     7081 cagtcggact agcaatattc ctgcttgtgc tggctatcat ggccatcaca tcctccttgg
     7141 tggcagctac aacgctcgtg aaccagcaca cgacggctaa ggttgtggag agggttgtgc
     7201 aaaatgtgtc atatattgct caaacccagg accaattcac ccacctgttc aggaatataa
     7261 acaacagatt aaatgtccta caccatagag tttcatactt ggagtatgta gaggaaatca
     7321 gacaaaaaca agtattcttt ggttgcaaac ctcatggaag gtattgccac tttgactttg
     7381 gaccagagga agttggatgg aacaatagtt ggaatagcaa aacttggaat gatctacaag
     7441 atgagtatga taagatagaa gaaaaaatat taaaaattcg agtggactgg ctcaatagct
     7501 ccctgagtga cacacaggac acctttggcc tggagacctc tatttttgac catttagtgc
     7561 aattgtttga ttggacttct tggaaagact ggataaaaat cattatagta atcattgtac
     7621 tttggcttct gataaagatt ctcctaggta tgttaagaag ctgcgccaag gtcagccaga
     7681 attaccaaca tctcccggcg gaggaggagg acggggacac agagccagaa agctccccgg
     7741 cgagaggaga cccggcttct ggaagtctct acgagaattg gttgaacaaa ataggagaaa
     7801 gcaagaacga cgcctatcgg gtctggacag aagaatacaa cagcttgagg atcttgttcg
     7861 ccacatgtcg ctgggatctc ctgacccctc aactccttca gcttccgttc tttctgttaa
     7921 ccctcctgct caaactcctt tgggacatct tccgccacgc tcctatttta aacttaaaag
     7981 ggtggactgt ggggcagggt gggacctcag gacaacagca gcccccggac ttcccatatg
     8041 tgaattggac tggatccagg gaacaaaata acccagaagg gggattagac tctggggctt
     8101 ggtatgaagg cctgagaggt tctcagtaga ttgtaagtct tcggcgagac tgcatgtctg
     8161 cacgtagaca ggaaatgttt atcttctcag ctgattgtgg ttaggccgat tactggaaac
     8221 tagacaacct gattcattag tggttaagat tatgcataag tgctcgcaat gatgtagctg
     8281 cttacgcttg cttactccgc cctgaaacgc ctaccttaac acgcaacacg cccacctgta
     8341 agaatatata aaccatatct tcactctgta cttcagctcg tgtagctcat tagctccgag
     8401 ctccccaacc tacagcctga gaggcactgg ctcggttggg tagccagcct ttcgggtaat
     8461 aaaggcttgt tggcattcgg ca
//
LOCUS       CEAVCG       9180 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Caprine arthritis encephalitis virus, complete proviral genome.
ACCESSION   M33677
KEYWORDS    complete genome.
SOURCE      Caprine arthritis encephalitis virus (isolate Clements) proviral
            DNA, infectious molecular clone.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9180)
  AUTHORS   Saltarelli,M., Querat,G., Konings,D.A.M., Vigne,R. and
            Clements,J.E.
  TITLE     Nucleotide sequence and transcriptional analysis of an infectious
            clone of CAEV
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Kindly provided in computer-readable form prior to publication by
            Dr. Janice Clements, Johns Hopkins University, Baltimore, MD., 10
            April 1990. The annotation shown below is tentative.
FEATURES       from  to/span     description
    pept        512     1858     gag polyprotein
    pept    <  1717     5046     pol polyprotein (NH2-terminus uncertain)
    pept       5006     5695     protein Q (put.)
    pept       5688     5951     protein S (put.)
    pept       6012     8837     env polyprotein
    ORF        8405     8794     ORF 4
BASE COUNT     3505 a   1466 c   2301 g   1908 t
ORIGIN      
        1 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat
       61 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg
      121 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg
      181 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga
      241 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag
      301 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg
      361 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg
      421 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta
      481 atacaacagc aacaccgtaa gaaaatccgc catggtgagt ctagatagag acatggcgag
      541 gcaagtctcc ggggggaaaa gagattatcc tgagctcgaa aaatgtatca agcatgcatg
      601 caagataaaa gttcgactca gaggggagca cttgacagaa ggaaattgtt tatggtgcct
      661 taaaacatta gattacatgt ttgaggacca taaagaggaa ccttggacaa aagtaaaatt
      721 taggacaata tggcagaagg tgaagaatct aactcctgag gagagtaaca aaaaagactt
      781 tatgtctttg caggccacat tagcgggtct aatgtgttgc caaatgggga tgagacctga
      841 gacattgcaa gatgcaatgg ctacagtaat catgaaagat gggttactgg aacaagagga
      901 aaagaaggaa gacaaaagag aaaaggaaga gagtgtcttc ccaatagtag tgcaagcagc
      961 aggagggaga agctggaaag cagtagattc tgtaatgttc cagcaactgc aaacagtagc
     1021 aatgcagcat ggcctcgtgt ctgaggactt tgaaaggcag ttggcatatt atgctactac
     1081 ctggacaagt aaagacatac tagaagtatt ggccatgatg cctggaaata gagctcaaaa
     1141 ggagttaatt caagggaaat taaatgaaga agcagaaagg tggagaagga ataatccacc
     1201 acctccagca ggaggaggat taacagtgga tcaaattatg ggggtaggac aaacaaatca
     1261 agcagcagca caagctaaca tggatcaggc aaggcaaata tgcctgcaat gggtaataaa
     1321 tgcattaaga gcagtaagac atatggcgca caggccaggg aatccaatgc tagtaaagca
     1381 aaaaacgaat gagccatatg aagattttgc agcaagactg ctagaagcaa tagatgcaga
     1441 gccagttaca cagcctataa aagattatct aaagctaaca ctatcttata caaatgcatc
     1501 agcagattgt cagaagcaaa tggatagaac actaggacaa agagtacaac aagctagtgt
     1561 agaagaaaaa atgcaagcat gtagagatgt gggatcagaa gggttcaaaa tgcaattgtt
     1621 agcacaagca ttaaggccag gaaaaggaaa agggaatgga cagccacaaa ggtgttacaa
     1681 ctgtggaaaa ccgggacatc aagcaaggca atgtagacaa ggaatcatat gtcacaactg
     1741 tggaaagaga ggacatatgc aaaaagaatg cagaggaaag agagacataa ggggaaaaca
     1801 gcagggaaac gggaggaggg ggatacgtgt ggtgccgtcc gctcctccta tggaataact
     1861 tcagcaccac ctatggttca ggtccgcata ggttcccagc agaggaactt gttatttgat
     1921 accggggcgg accgaactat agttagatgg catgagggct cgggaaaccc agccggaagg
     1981 ataaaactgc aaggaatagg aggaatagta gaaggagaaa aatggaataa tgtagaatta
     2041 gaatataaag gagaaacaag aaagggaaca atagtagtgt taccacaaag tccagtagaa
     2101 gtattaggac gagataacat ggcccgattt ggaataaaga taataatggc aaatttagag
     2161 gaaaaaagaa tcccaattac aaaagtaaaa ttgaaagagg gatgtacggg tccacatgtc
     2221 ccacaatggc cattaacaga agagaaatta aaaggtctaa cagaaatcat agataaatta
     2281 gtggaagaag gaaaactagg aaaggcaccc ccacattgga catgtaatac tccaatcttt
     2341 tgcataaaaa agaaatcagg gaagtggaga atgttaatag atttcagaga attgaacaaa
     2401 cagacagaag atttaacaga agcgcagtta ggactcccgc atccgggagg actacaaaag
     2461 aaaaaacatg ttacaatatt ggacatagga gatgcatatt ttactatacc cctatatgaa
     2521 ccatatcgag agtacacatg ttttactcta ttaagtccta ataatctagg accatgtaaa
     2581 agatactatt ggaaagtgct gccacaaggt tggaaattga gtccatctgt atatcaattt
     2641 actatgcagg agatcttaga ggattggata cagcagcatc cagaaattca atttggcata
     2701 tatatggatg atatttacat aggaagtgat ttagaaatta aaaagcatag agaaatagtg
     2761 aaagatttag ccaattatat tgcccaatat ggattcactc tgccagaaga gaagagacaa
     2821 aagggatatc cagcaaaatg gctaggattt gaactacacc cgcagacctg gaaatttcag
     2881 aagcatacat tacctgaatt aacaaaggga acaataacat taaataaatt acagaaatta
     2941 gtaggagaat tagtatggag acaatccata attgggaaaa gcattcctaa cattctgaaa
     3001 ttaatggaag gagatagaga attacaaagt gaaagaaaaa ttgaagaagt acatgtgaaa
     3061 gaatgggaag catgtaggaa aaaattagaa gaaatggaag gaaattatta taataaagac
     3121 aaagatgtct atggacaatt ggcttgggga gacaaagcta tagaatatat agtgtatcag
     3181 gagaaaggga aaccattatg ggtaaatgtg gttcacaata taaagaacct aagcatcccg
     3241 caacaggtta ttaaagcagc gcaaaaatta acccaagaag tcatcattag gacaggaaaa
     3301 ataccatgga tattgttgcc agggaaagaa gaagattgga gactagaatt gcaattaggg
     3361 aacatcacat ggatgccaaa attttggtcc tgttatcgag gacatacaag atggagaaaa
     3421 agaaatataa tagaagaagt agtagaaggg cctacatatt atacagatgg aggaaaaaag
     3481 aataaagtag gaagtctagg gttcatagta tcaacagggg aaaaatttag aaagcatgaa
     3541 gagggcacaa accagcaact agaattaaga gccatagagg aagctctaaa acaagggcct
     3601 caaacaatga atttagtaac agatagtaga tatgcatttg aatttttatt aagaaattgg
     3661 gatgaagaag taataaagaa tccaattcaa gcaagaatta tggaaattgc ccacaagaaa
     3721 gataggatag gagtgcattg ggtgccagga cataaaggga ttccccaaaa tgaagaaata
     3781 gacaaatata tttcggaaat atttcttgca aaagaaggag aaggaattct cccaaaaaga
     3841 gaagaggatg cagggtatga tttaatatgc ccagaagagg ttaccataga gccaggacaa
     3901 gtgaaatgca tccccataga gctaagatta aatttaaaga aatcacaatg ggctatgatt
     3961 gctacaaaaa gcagcatggc tgccaaagga gtgttcacac aaggaggaat catagactca
     4021 ggatatcagg gacaaataca ggtaataatg tataatagca ataaaatagc agtagtcata
     4081 ccccaaggga gaaaatttgc acaattaata ttaatggata aaaagcatgg aaaattggaa
     4141 ccctgggggg aaagcagaaa aacagaaagg ggagaaaaag gatttgggtc tacaggaatg
     4201 tattggatag aaaatattcc tctggcagag gaagaccaca caaaatggca tcaagatgcc
     4261 cgatcattgc atctagaatt tgaaattcca agaacagcag cagaagacat agtaaatcaa
     4321 tgtgaaatat gcaaagaagc gaggacacct gcagtaatta gaggcggaaa caaaaggggg
     4381 gtaaatcatt ggcaagtgga ttatacccat tatgaaaata tcatactatt agtatgggta
     4441 gaaacaaatt caggactaat atatgcagaa aaagtaaaag gagaatcagg gcaagaattc
     4501 agaataaaag tgatgcattg gtatgcatta tttggtccag agtcattgca gtcagacaat
     4561 ggacctgcat ttgcagcaga gcccacacag ctgttaatgc aatacctagg agtaaaacac
     4621 acaacaggca taccttggaa tccacagtct caggctatag tagaaagggc acatcaacta
     4681 ttgaaaagca ctttaaagaa gttccagcca caatttgtcg ctgtagaatc agccatagca
     4741 gcagccctag tcgccataaa tataaaaaga aagggtgggc tggggacaag ccctatggat
     4801 atttttatat ataataaaga acagaaaaga ataaataata aatataataa aaattctcaa
     4861 aaaattcaat tctgttatta cagaataagg aaaagaggac atcaggagag tggaaaggac
     4921 caacccaggt actgtggaaa ggggaaggag ccaattgtgg taaaggatat agaaagtgaa
     4981 aagtatttag taatacctta caaagatgca aaattcatcc cgccaccaac aaaagaaaag
     5041 gaataaaaaa cctggaccag aattaccctt agcactatgg atacatatag cagaaagcat
     5101 taatggggat agctcatggt acataacaat gagactgcaa cagatgatgt ggggaaaaag
     5161 aggaaataag ttacaatata agaatgaaga cagggaatat gaaaattggg aaattacatc
     5221 atggggatgg aaaatgcacc taaggagagt gaaacaatgg atacaagaca acaggagagg
     5281 aagcccatgg cagtacaaag taggaggaac atggaaaagt ataggagtgt ggttcctgca
     5341 agcaggagat tacagaaagg tagacaggca cttctggtgg gcatggagga tactgatatg
     5401 ttcctgcagg aaagaaaagt ttgatataag agaatttatg agaggaagac atagatggga
     5461 tttgtgcaaa tcctgtgctc aaggagaagt agtaaagcat actagaacaa aaagtctgga
     5521 aagactagta ctgctacaga tggtagaaca gcatgtgttt caagtattgc cattgtggag
     5581 agccaggaga agtagtacaa cagatttccc atggtgcagg gacacaacgg gatacacgca
     5641 tgcgtggtct gtccaggagt gctggttgat ggaatatctc ttagaggatg agtgaagaac
     5701 tgcctcaaag aagggagaca catccagaag aacttgtaag gaacgtacgg gaaagagaaa
     5761 gggatacatg gcaatggaca agcatcagag tacctgcgga aatactgcaa agatggcttg
     5821 ctatgcttag gtcaggcaga aatagaaaga aagtgtatag agaaatgcaa aaatggatgt
     5881 ggatacatcc caaggcgcct gtgattaggg cctgtggatg cagactatgt aacccggggt
     5941 ggggaacata atcaagggaa taataaatgc aaataaatgt aactaacaag tagcaaaagt
     6001 gtctgtgtta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg
     6061 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc
     6121 aaggtaagta tcaaccccag gtaagtaagc aaatagggaa cagaaatact aacccatgct
     6181 ttgcctataa agggatattc ctatggagga tatcactaac aatgtggata ttgctaggga
     6241 taaatatgtg tgtcagtgca gaggattaca taacactaat atcagatccc tatgggttct
     6301 cacccataaa aaatgtgtct ggggtaccag tgacttgtgt aacaaaagaa ttcgcaaaat
     6361 ggggatgtca accactagga gcgtaccctg atccagaaat agaatacaga aatgtgagtc
     6421 aggaagtagt gaaagaagta tatcaagaga attggccatg gaatacatat cattggcctc
     6481 tctggcaaat ggagaatgtt aggtactggt taaaagaaaa tatgcaagaa aatcaacaga
     6541 gaaaaaataa tacaaaagag ggtatagagg aattattagc aggaactata aggggaagat
     6601 tctgtgtacc atacccattt gccttgttaa aatgcacaaa gtggtgctgg tatacagcgg
     6661 ccataaacaa cgagtcagga aaagcaggaa aaataaaaat aaattgcaca gaagcaagag
     6721 cagtctcctg tacagaggac atgccattag cctcaataca aagagcatat tgggatgaga
     6781 aagacagaga gagcatggcc tttatgaata tcaaagcatg tgatagcaac ctaaggtgtc
     6841 agaaaagacc tggagggtgt atggaaggat accctatccc agtaggagca gaaataatcc
     6901 ctgaaagtat gaaataccta aggggagcaa agagtcagta tgggggaata aaagataaga
     6961 atggagaatt aaaattacca ttaacattaa gagtgtgggt aaaattagca aatgtgtcag
     7021 aatgggtaaa tgggacaccc ccggattggc aagacagaat taacggatcc aaaggaataa
     7081 atgggacgct ctggggagag cttaacagta tgcatcacct aggatttgcc cttagccaga
     7141 acggcaaatg gtgtaactac accggggaaa taaaattagg gcaagaaaca ttccaatatc
     7201 attacaagcc aaactggaac tgtaccggga attggacgca atatccggtg tggcaagtga
     7261 ttagaaacct ggatatggtg gaacatatga caggagaatg tgtgcagaga ccacaaaggc
     7321 acaatataac agtaggaaat ggaaccataa cagggaattg cagtacaaca aactgggatg
     7381 gatgtaattg ctcacgatca ggaaactacc tatataacag ctctgaggga ggattgttat
     7441 taattctgtg cagacaaaac agcaccctaa caaggatcct gggaacaaat acaaattgga
     7501 caactatgtg gggaatatac aaaaattgtt caggatgcga gaatgcaaca ttagacaaca
     7561 caggagaagg aaccttagga ggtgtagcta ataagaactg tagcttgcct cataaaaatg
     7621 agagcaacaa gtggacttgt gccccaagac aaagagatgg aaaaacagat tcgctataca
     7681 tagcaggagg aaaaaagttt tggacacgaa ttaaggccca attcagctgt gaaagtaaca
     7741 taggacaatt agatggaatg ttgcatcagc aaatactatt gcaaaaatat caagtaatta
     7801 aggtaagagc ttatacatat ggggtgatag aaatgccaga aaactatgca aaaacaagaa
     7861 tcataaacag gaaaaaaaga gaactcagcc acaagaggaa gaagagaggc gttggcttgg
     7921 tcattatgct agttatcatg gcaatagtag ctgccgcagg ggcttctctg ggagtcgcaa
     7981 acgcgattca gcagtcttac actaaggcag ctgtccagac ccttgctaat gcaactgctg
     8041 cacagcagga tgtgttagag gcaacctatg ccatggtaca gcatgtggct aaaggcgtac
     8101 gaatcttgga agctcgagtg gaagctatca cagatagaat aatgctatac caagaattgg
     8161 attgttggca ctatcatcaa tactgtataa cctctacaaa aacagaagta gcaaaatata
     8221 tcaattggac gaggtttaag gataattgca catggcagca gtgggagaga ggattacagg
     8281 ggtatgatac aaacttaaca atactgttaa aggaatcagc agcaatgaca caactagcag
     8341 aagagcaagc aaggaggata ccagaagtat gggaaagttt aaaagacgtc tttgattggt
     8401 caggatggtt ctcatggcta aagtatattc ctattatagt agtaggatta ttaggatgca
     8461 ttctgataag agctgtgata tgtgtatgtc aacctcttgt gcagatatac agaactctaa
     8521 gtaccccgac ataccaacgg gtcacagtca tcatggaaac aagagcagac gtcgcaggag
     8581 aaaatcagga ttttggcgat ggcttagagg aatcagacaa cagcgaaaca agcgaaagag
     8641 tgacagtaca gaaagcttgg agccgtgcct gggagctttg gcagaactca ccctggaagg
     8701 agccatggaa aaggggcctg ctgaggctgc tcgtccttcc gctgacgatg ggaatctgga
     8761 taaatggatg gcttggagaa caccacaaaa ataaaaaaag aaagggtgac tgtgagacat
     8821 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat
     8881 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact
     8941 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg
     9001 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag
     9061 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt
     9121 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga
//
LOCUS       CEAVLTRM      522 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Caprine arthritis encephalitis virus proviral long terminal repeat.
ACCESSION   M14149
KEYWORDS    long terminal repeat.
SOURCE      Caprine arthritis encephalitis virus (CAEV) unintegrated proviral
            DNA.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 522)
  AUTHORS   Hess,J.L., Pyper,J.M. and Clements,J.E.
  TITLE     Nucleotide sequence and transcriptional activity of the caprine
            arthritis-encephalitis virus long terminal repeat
  JOURNAL   J. Virol. 60, 385-393 (1986)
  STANDARD  full staff_review
COMMENT     [1] presents the LTR for CAEV in a fashion that includes sequence
            data downstream of the 5' LTR (bases 499-522) and sequence data
            upstream of the 3' LTR (bases 1-48).  For this reason, two
            "pre-msg" entries are found in the FEATURES table, one for the
            5'end of the unspliced transcript and the other for the 3' end of
            the unspliced transcript.  Similarly, both the TATA box and the
            polyadenylation signal are annotated.  The CAEV LTR and the Visna
            LTR are approximately 50% homologous.  [1] did not find a
            significant level of homology of CAEV LTR with any other retroviral
            LTR.  [1] showed that, unlike Visna and HIV, CAEV LTR is not
            transactivated.  Plasmids constructed with CAEV LTR and CAT showed
            no evidence of transactivated up-regulation in four cell types
            (SCP, GSM, SAM and primary goat macrophages).  This may explain
            CAEV's lack of cytotoxicity.
FEATURES       from  to/span     description
    pre-msg <     1      420     genomic mRNA, 3' end (see COMMENT)
    pre-msg     336  >   522     genomic mRNA, 5' end (see COMMENT)
    site        336      336     mRNA cap site
    LTR          49      498     long terminal repeat
    rpt          87      157     71 bp direct repeat, 5' copy
    rpt         158      228     71 bp direct repeat, 3' copy
    rpt         336      420     R repeat
    binding     499      515     Lys-tRNA primer binding site
    signal      306      310     TATA box
    signal      393      398     poly-A signal
BASE COUNT      163 a    108 c    137 g    114 t
ORIGIN      84 bp upstream of AluI site.
        1 aaatggatgg cttggagaac accacaaaaa taaaaaaaga aagggtggac tgtgagacat
       61 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat
      121 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact
      181 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg
      241 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag
      301 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt
      361 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga
      421 gtggagagtg attactgagc ggccggtgta tcgggagtcg tcccttaatc tgtgcaatac
      481 cagagcggct ctcgcagctg gcgcccaacg tggggcccga gg
//
LOCUS       CEAVPOL       651 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Caprine arthritis encephalitis virus proviral pol gene, partial
            cds.
ACCESSION   K03327
KEYWORDS    .
SOURCE      Caprine arthritis encephalitis virus proviral DNA.
  ORGANISM  Caprine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F.,
            Tronick,S.R. and Aaronson,A.
  TITLE     Nucleotide sequence evidence for relationship of AIDS retrovirus to
            lentiviruses
  JOURNAL   Nature 317, 366-368 (1985)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 651)
  AUTHORS   Chiu,I.-M.
  JOURNAL   Unpublished (1985) Ohio State U. Medical Sch., Columbus, Ohio
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            I.-M.Chiu, 09-OCT-1985.
FEATURES       from  to/span     description
    pept    <     1  >   651     pol protein (reverse transcriptase; AA at 1)
BASE COUNT      269 a    102 c    122 g    158 t
ORIGIN      434 bp upstream of BglII site
        1 gtcccacaat ggccattgac agaagaaaaa ttaaaaggtc tatcagaaat agtagataaa
       61 ttagtaaaaa aaaaaaaact aggaaaggca cccccacatt ggacatgtaa tactccaata
      121 ttttgcataa aaaagaaatc aggaaaatgg aaaatgttaa tagatttcag aaaattaaac
      181 aaacaaacag aggatttaac agaagcgcaa ttaggacttc cgcatccggg gggactacaa
      241 aagaaaaaac atgttacaat attagacata ggagatgcat attttactat acccctatat
      301 aaaccatatc gagagtacac ttgctttact ctattaagtc ctaataatct aggaccatgt
      361 aaaagatatt attggaaggt tctaccacaa ggttggaaac tgagtccatc tgtatatcaa
      421 tttaccatgc aggagatctt aggggaatgg atacaggagc atcctgaaat tcaatttaga
      481 atatatatgg atgatatcta cataagaagt gatttagaaa taaaaaagca tagagagata
      541 gtggaggaat tagccaatta tattgcccaa tatagattca ctctgccaga agagaaaagg
      601 caagagagat atccagcaaa atggttaggg tatgaactac atcctcaggc c
//
LOCUS       EAEVWU5      2580 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Equine arthritis encephalitis virus, envelope gene, complete cds.
ACCESSION   X16988
KEYWORDS    envelope protein.
SOURCE      Equine arthritis encephalitis virus, clone pWU5.
  ORGANISM  Equine arthritis encephalitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 2580)
  AUTHORS   McGuire,T.C., Lacy,P.A. and O'Rourke,K.
  TITLE     cDNA sequence of the env gene of a pathogenic equine infectious
            anemia lentivirus variant
  JOURNAL   Nucleic Acids Res. 18, 196 (1990)
  STANDARD  full staff_review
COMMENT     Kindly submitted in computer-readable form by K. O'Rourke,
            Washington State University, Pullman WA.
FEATURES       from  to/span     description
    pept          1     2580     env polyprotein
BASE COUNT      891 a    428 c    565 g    696 t
ORIGIN      Start of env cds.
        1 atggtcagca tcgcattcta tggggggatc ccagggggaa tctcaacccc tattacccaa
       61 cagtcagaaa aatccaagta tgaggaaaac acaatgtttc aaccttattg ttataataat
      121 gacagtaaga acagcatggc agaatcgaag gaagcaagag accaagaaat gaacctgaaa
      181 gaagaatcta aagaagaaaa aagaagaaat gactggtgga aaataggtat gtttctgtta
      241 tgcttagcag gaactactgg aggaatactt tggtggtatg aaggactccc acagcaacat
      301 tatatagggt tggtagcgat agggggaaga ttaaacggat ctggccaatc aaatgctata
      361 gaatgctggg gttccttccc ggggtgtaga ccatttcaaa attacttcag ttatgagacc
      421 aatagaagca tgcatatgga taataatact gctacattat tagaagctta tcatagagag
      481 ataacattca tttataagtc ttcttgcaca gatagtgatc attgtcaaga gtatcaatgt
      541 aaaaaagtta atcttaattc ctctgactcc tctaactctg tacgtgttga ggatgtaacg
      601 aacacagcgg aatattgggg atttaaatgg ctagaatgta atcaaacaga aaattttaag
      661 actatattag tacctgaaaa tgaaatggta aatatcaatg atactgatac ctggatacct
      721 aaggggtgta atgagacgtg ggcaagagtg aaacgttgtc ctatagatat tttatatggg
      781 atacatccaa tcaggctgtg tgtacagcca ccattttttc tggtacagga gaaagggatt
      841 gctgatactt ctagaattgg caattgtggg ccaacaatat ttcttggggt tttagaagat
      901 aataagggag tagtacgggg ggattataca gcctgcaatg tgagtcgcct aaatataaat
      961 agaaaggatt atacagggat ctatcaagta cctatatttt atacatgtac tttcactaac
     1021 ataacttcct gcaataatga gccaataatc agtgttatca tgtatgaaac aaaccaggta
     1081 caatatttat tgtgtaataa taataatagt aataattata attgtgtagt acaaagtttt
     1141 ggagttatag gacaggcaca cttagaactg cctagaccta acaaaagaat aaggaatcaa
     1201 agctttaacc aatataactg ctctataaat aacaaaacag aattagaaac atggaagtta
     1261 gtaaagactt ctggcataac tcctttacct atttcttctg aagctaacac tggactaatt
     1321 agacataaga gagattttgg tataagtgca atagtggcag ctattgtagc cgctactgct
     1381 attgctgcta gcgctactat gtcttatgtt gctctaactg aggttaacaa aataatggaa
     1441 gtacaaaatc atacttttga ggtagaaaat agtactctaa atggtatgga tttaatagaa
     1501 cgacaaataa agatattata tgctatgatt cttcaaacac atgcagatgt tcaactgtta
     1561 aaggaaagac aacaggtaga ggagacattt aatttaattg gatgtataga aagaacacat
     1621 gtattttgtc atactggtca tccctggaat atgtcatggg gacatttaaa tgagtcaaca
     1681 caatgggatg actgggtaag caaaatggaa gatttaaatc aagagatact aactacactt
     1741 catggagcca ggaacaattt ggcacaatcc atgataacat tcaatacacc agatagtata
     1801 gctcaatttg gaaaagacct ttggagtcat attggaaatt ggattcctgg attgggagct
     1861 tccattataa aatatatagt gatgtttttg cttatttatt tgttactaac ctcttcgcct
     1921 aagatcctca gggccctctg gaaagtgacc agtggtgcag ggtcctccgg cagtcgttac
     1981 ctgaagaaaa aattccatca caaacatgca tcacgagaag acacctggga ccaggcccaa
     2041 cacaacatac acctagcagg cgtgaccggt ggatcagggg acaaatacta caagcagaag
     2101 tactccagga acgactggaa tggagaatca gaggagtaca acaggcggcc aaagagctgg
     2161 gtgaagtcaa tcgaggcatt tggagagagc tatatttccg agaagaccaa aggggagatt
     2221 tctcagcctg gggcggctat caacgagcac aagaacggct ctggggggaa caatcctcac
     2281 caagggtcct tagacctgga gattcgaagc gaaggaggaa acatttatga ctgttgcatt
     2341 aaagcccaag aaggaactct cgctatccct tgctgtggat ttcccttatg gctattttgg
     2401 ggactagtaa ttatagtagg acgcatagca ggctatggat tacgtggact cgctgttata
     2461 ataaggattt gtactagagg cttaaatttg atatttgaaa taatcagaaa aatgcttgat
     2521 tatattggaa gagctttaaa tcctggcaca tctcatgtat caatgcctca gtatgtttag
//
LOCUS       EIAV         8344 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Equine infectious anemia virus (EIAV), complete proviral genome.
ACCESSION   M16575 M11337 K03334 M14855
KEYWORDS    complete genome.
SOURCE      Equine infectious anemia virus proviral DNA, clones 1369 and 409-2
            [1].
  ORGANISM  Equine infectious anemia virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 8229)
  AUTHORS   Kawakami,T., Sherman,L., Dahlberg,J., Gazit,A., Yaniv,A.,
            Tronick,S.R. and Aaronson,S.A.
  TITLE     Nucleotide sequence analysis of equine infectious anemia virus
            proviral DNA
  JOURNAL   Virology 158, 300-312 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 324 to 5122)
  AUTHORS   Stephens,R.M., Casey,J.W. and Rice,N.R.
  TITLE     Equine infectious anemia virus gag and pol genes: Relatedness to
            visna and AIDS virus
  JOURNAL   Science 231, 589-594 (1986)
  STANDARD  full staff_review
REFERENCE   3  (bases 2300 to 2950)
  AUTHORS   Chiu,I.-M.
  JOURNAL   Unpublished (1985) Meloy Labs Inc, Springfield, VA 22151
  STANDARD  full staff_review
REFERENCE   4  (sites; pept)
  AUTHORS   Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F.,
            Tronick,S.R. and Aaronson,A.
  TITLE     Nucleotide sequence evidence for relationship of AIDS retrovirus to
            lentiviruses
  JOURNAL   Nature 317, 366-368 (1985)
  STANDARD  full staff_review
REFERENCE   5  (bases 4769 to 8048; 8069 to 8344)
  AUTHORS   Payne,S.L.
  JOURNAL   Unpublished (1987) Dept Biochem, Louisiana St Univ, Baton Rouge, LA
  STANDARD  full staff_review
REFERENCE   6  (bases 4769 to 8048; 8069 to 8228)
  AUTHORS   Rushlow,K., Olsen,K., Stiegler,G., Payne,S.L., Montelaro,R.C. and
            Issel,C.J.
  TITLE     Lentivirus genomic organization: The complete nucleotide sequence
            of the env gene region of equine infectious anemia virus
  JOURNAL   Virology 155, 309-321 (1986)
  STANDARD  full staff_review
REFERENCE   7  (sites; correction of 5348 [1] personal communication)
  AUTHORS   Tronick,S.R. and Fuller,F.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Correction of [1] by [7] leads to agreement with [6] regarding the
            env cds. The sequence starting with coordinate 8230 is horse
            cellular DNA.
FEATURES       from  to/span     description
    pept        465     1925     gag polyprotein (put.)
    pept    <  1682     5122     pol polyprotein (AA at 1682; NH2-terminus
                                 uncertain)
    pept       5312     7891     envelope polyprotein
    pept    <  7234     7641     3' ORF protein (AA at 7233; NH2-terminus
                                 uncertain)
    pre-msg     209     8193     EIAV genomic RNA
    cell       8230     8344     horse cellular DNA
    binding     324      342     primer (Lys-tRNA) binding site
    variant     342      342     c in [1], t in [2]
    variant    2341     2341     t in [1], c in [2],[3]
    variant    2349     2349     t in [1], c in [2]
    variant    2436     2436     g in [1], a in [3]
    variant    2677     2677     t in [1], a in [2]
    variant    2865     2870     tggaag in [1], agtaaa in [2]
    variant    2870     2870     g in [1], a in [3]
    variant    3777     3777     g in [1], a in [2]
    variant    3858     3858     g in [1], a in [2]
    variant    4529     4530     ac in [1], ca in [2]
    variant    4573     4573     a in [1], g in [2]
    revision   5348     5348     gg in [6],[7]; [g] in [1]
    variant    6348     6348     a in [1], g in [5],[6]
    variant    6350     6350     g in [1], a in [5],[6]
    variant    7060     7060     c in [1], t in [5],[6]
    variant    8048     8069     gagttccgcatttgtgacgcgt in [1], gt in [5],[6]
    variant    8073     8073     g in [1], c in [5],[6]
    variant    8098     8098     a in [1], g in [5],[6]
    variant    8104     8104     g in [1], a in [5],[6]
    variant    8118     8119     ga in [1], gca in [5],[6]
    mRNA        210     8193     EIAV genomic mRNA
BASE COUNT     2984 a   1358 c   1846 g   2156 t
ORIGIN      
        1 tgtggggttt ttatgagggg ttttataaat gattataaga gtaaaaagaa agttgctgat
       61 gctctcataa ccttgtataa cccaaaggac tagctcatgt tgctaggcaa ctaaaccgca
      121 ataaccgcat ttgtgacgcg agttccgcat ttgtgacgcg ttaagttcct gtttttacag
      181 tatataagta cttgtgttct gacaattggg actcagattc tgcggtctga gtcccttctc
      241 tgctgggctg aaaaggcctt tgtaataaat ataattctct actcagtccc tgtctctagt
      301 ttgtctgttc gagatcctac agttggcgcc cgaacaggga cctgagaggg gcgcagaccc
      361 tacctgttga acctggctga tcgtaggatc cccgggacag cagaggagaa cttacagaag
      421 tcttctggag gtgttcctgg ccagaacaca ggaggacagg taagatggga gaccctttga
      481 catggagcaa ggcgctcaag aagttagaga aggtgacggt acaagggtct cagaaattaa
      541 ctactggtaa ctgtaattgg gcgctaagtc tagtagactt atttcatgat accaactttg
      601 taaaagaaaa ggactggcag ctgagggatg tcattccatt gctggaagat gtaactcaga
      661 cgctgtcagg acaagaaaga gaggcctttg aaagaacatg gtgggcaatt tctgctgtaa
      721 agatgggcct ccagattaat aatgtagtag atggaaaggc atcattccag ctcctaagag
      781 cgaaatatga aaagaagact gctaataaaa agcagtctga gccctctgaa gaatatccaa
      841 tcatgataga tggggctgga aacagaaatt ttagacctct aacacctaga ggatatacta
      901 cttgggtgaa taccatacag acaaatggtc tattaaatga agctagtcaa aacttatttg
      961 ggatattatc agtagactgt acttctgaag aaatgaatgc atttttggat gtggtacctg
     1021 gccaggcagg acaaaagcag atattacttg atgcaattga taaaatagca gatgattggg
     1081 ataatagaca tccattaccg aatgctccac tggtggcacc accacaaggg cctattccca
     1141 tgacagcaag gtttattaga ggtttaggag tacctagaga aagacagatg gagcctgctt
     1201 ttgatcagtt taggcagaca tatagacaat ggataataga agccatgtca gaaggcatca
     1261 aagtgatgat tggaaaacct aaagctcaaa atattaggca aggagctaag gaaccttacc
     1321 cagaatttgt agacagacta ttatcccaaa taaaaagtga gggacatcca caagagattt
     1381 caaaattctt gactgataca ctgactattc agaacgcaaa tgaggaatgt agaaatgcta
     1441 tgagacattt aagaccagag gatacattag aagagaaaat gtatgcttgc agagacattg
     1501 gaactacaaa acaaaagatg atgttattgg caaaagcact tcagactggt cttgcgggcc
     1561 catttaaagg tggagccttg aaaggagggc cactaaaggc agcacaaaca tgttataact
     1621 gtgggaagcc aggacattta tctagtcaat gtagagcacc taaagtctgt tttaaatgta
     1681 aacagcctgg acatttctca aagcaatgca gaagtgttcc aaaaaacggg aagcaagggg
     1741 ctcaagggag gccccagaaa caaactttcc cgatacaaca gaagagtcag cacaacaaat
     1801 ctgttgtaca agagactcct cagactcaaa atctgtaccc agatctgagc gaaataaaaa
     1861 aggaatacaa tgtcaaggag aaggatcaag tagaggatct caacctggac agtttgtggg
     1921 agtaacatat aatctagaga aaaggcctac tacaatagta ttaattaatg atactccctt
     1981 aaatgtactg ttagacacag gagcagatac ttcagtgttg actactgcac attataatag
     2041 gttaaaatat agagggagaa aatatcaagg gacgggaata ataggagtgg gaggaaatgt
     2101 ggaaacattt tctacgcctg tgactataaa gaaaaagggt agacacatta agacaagaat
     2161 gctagtggca gatattccag tgactatttt gggacgagat attcttcagg acttaggtgc
     2221 aaaattggtt ttggcacagc tctccaagga aataaaattt agaaaaatag agttaaaaga
     2281 gggcacaatg gggccaaaaa ttcctcaatg gccactcact aaggagaaac tagaaggggc
     2341 taaagagata gtccaaagac tattgtcaga gggaaaaata tcagaagcta gtgacaataa
     2401 tccttataat tcacccatat ttgtaataaa aaagaggtct ggcaaatgga ggttattaca
     2461 agatctgaga gaattaaaca aaacagtaca agtaggaacg gaaatatcca gaggattgcc
     2521 tcacccggga ggattaatta aatgtaaaca catgactgta ttagatattg gagatgcata
     2581 tttcactata cccttagatc cagagtttag accatataca gctttcacta ttccctccat
     2641 taatcatcaa gaaccagata aaagatatgt gtggaattgt ttaccacaag gattcgtgtt
     2701 gagcccatat atatatcaga aaacattaca ggaaatttta caacctttta gggaaagata
     2761 tcctgaagta caattgtatc aatatatgga tgatttgttc gtgggaagta atggttctaa
     2821 aaaacaacac aaagagttaa tcatagaatt aagggcaatc ttactggaag agggttttga
     2881 gacaccagat gataaattac aagaagtgcc accttatagc tggctaggtt atcaactttg
     2941 tcctgaaaat tggaaagtac aaaaaatgca attagacatg gtaaagaatc caacccttaa
     3001 tgatgtgcaa aaattaatgg ggaatataac atggatgagc tcaggggtcc cagggttgac
     3061 agtaaaacac atagcagcta ctactaaggg atgtttagag ttgaatcaaa aagtaatttg
     3121 gacggaagag gcacaaaaag agttagaaga aaataatgag aagattaaaa atgctcaagg
     3181 gttacaatat tataatccag aagaagaaat gttatgtgag gttgaaatta caaaaaatta
     3241 tgaggcaact tatgttataa aacaatcaca aggaatccta tgggcaggta aaaagattat
     3301 gaaggctaat aagggatggt caacagtaaa aaatttaatg ttactgttgc aacatgtggc
     3361 aacagaaagt attactagag taggaaaatg tccaacgttt aaggtaccat ttaccaaaga
     3421 gcaagtaatg tgggaaatgc aaaaaggatg gtattattct tggctcccag aaatagtata
     3481 tacacatcaa gtagttcatg atgattggag aatgaaattg gtagaagaac ctacatcagg
     3541 aataacaata tacactgatg ggggaaaaca aaatggagaa ggaatagcag cttatgtgac
     3601 cagtaatggg agaactaaac agaaaaggtt aggacctgtc actcatcaag ttgctgaaag
     3661 aatggcaata caaatggcat tagaggatac cagagataaa caagtaaata tagtaactga
     3721 tagttattat tgttggaaaa atattacaga aggattaggt ttagaaggac cacaaagtcc
     3781 ttggtggcct ataatacaaa atatacgaga aaaagagata gtttattttg cttgggtacc
     3841 tggtcacaaa gggatatgtg gtaatcaatt ggcagatgaa gccgcaaaaa taaaagaaga
     3901 aatcatgcta gcataccaag gcacacaaat taaagagaaa agagatgaag atgcagggtt
     3961 tgacttatgt gttccttatg acatcatgat acctgtatct gacacaaaaa tcatacccac
     4021 agatgtaaaa attcaagttc ctcctaatag ctttggatgg gtcactggga aatcatcaat
     4081 ggcaaaacag gggttattaa ttaatggagg aataattgat gaaggatata caggagaaat
     4141 acaagtgata tgtactaata ttggaaaaag taatattaaa ttaatagagg gacaaaaatt
     4201 tgcacaatta attatactac agcatcactc aaattccaga cagccttggg atgaaaataa
     4261 aatatctcag agaggggata aaggatttgg aagtacagga gtattctggg tagaaaatat
     4321 tcaggaagca caagatgaac atgagaattg gcatacatca ccaaagatat tggcaagaaa
     4381 ttataagata ccattgactg tagcaaaaca gataactcaa gaatgtcctc attgcactaa
     4441 gcaaggatca ggacctgcag gttgtgtcat gagatctcct aatcattggc aggcagattg
     4501 cacacatttg gacaataaga taatattgac ttttgtagag tcaaattcag gatacataca
     4561 tgctacatta ttatcaaaag aaaatgcatt atgtacttca ttggctattt tagaatgggc
     4621 aagattgttt tcaccaaagt ccttacacac agataacggc actaattttg tggcagaacc
     4681 agttgtaaat ttgttgaagt tcctaaagat agcacatacc acaggaatac catatcatcc
     4741 agaaagtcag ggtattgtag aaagggcaaa taggaccttg aaagagaaga ttcaaagtca
     4801 tagagacaac actcaaacac tggaggcagc tttacaactt gctctcatta cttgtaacaa
     4861 agggagggaa agtatgggag gacagacacc atgggaagta tttatcacta atcaagcaca
     4921 agtaatacat gagaaacttt tactacagca agcacaatcc tccaaaaaat tttgttttta
     4981 caaaatccct ggtgaacatg attggaaggg acctactagg gtgctgtgga agggtgatgg
     5041 tgcagtagta gttaatgatg aaggaaaggg aataattgct gtaccattaa ccaggactaa
     5101 gttactaata aagccaaatt gagtattgtt gcaggaagca agacccaact accattgtca
     5161 gctgtgtttc ctgaggtctc taggaattga ttacctcgat gcttcattaa ggaagaagaa
     5221 taaacaaaga ctgaaggcaa tccaacaagg aagacaacct caatatttgt tataaggttt
     5281 gatatatggg attatttggt aaaggggtaa catggtcagc atcgcattct atggggggat
     5341 cccaggggga atctcaaccc ctattaccca acagtcagaa aaatctaagt gtgaggagaa
     5401 cacaatgttt caaccttatt gttataataa tgacagtaag aacagcatgg cagaatcgaa
     5461 ggaagcaaga gaccaagaaa tgaacctgaa agaagaatct aaagaagaaa aaagaagaaa
     5521 tgactggtgg aaaataggta tgtttctgtt atgcttagca ggaactactg gaggaatact
     5581 ttggtggtat gaaggactcc cacagcaaca ttatataggg ttggtggcga tagggggaag
     5641 attaaacgga tctggccaat caaatgctat agaatgctgg ggttccttcc cggggtgtag
     5701 accatttcaa aattacttca gttatgagac caatagaagc atgcatatgg ataataatac
     5761 tgctacatta ttagaagctt atcatagaga gataacattc atttataagt cttcttgcac
     5821 agatagtgat cattgtcaag agtatcaatg taaaaaagtt aatcttaatt cctctgactc
     5881 ctctaactct gtacgtgttg aggatgtaac gaacacagcg gaatattggg gatttaaatg
     5941 gctagaatgt aatcaaacag aaaattttaa gactatatta gtacctgaaa atgaaatggt
     6001 aaatatcaat gatactgata cctggatacc taaggggtgt aatgagacgt gggcaagagt
     6061 gaaacgttgt cctatagata ttttatatgg gatacatcca atcaggctgt gtgtacagcc
     6121 accatttttt ctggtacagg agaaagggat tgctgatact tctagaattg gcaattgtgg
     6181 gccaacaata tttcttgggg ttttagaaga taataaggga gtagtacggg gggattatac
     6241 agcttgcaat gtgcgtcgcc taaatataaa tagaaaggat tatacaggga tctatcaagt
     6301 acctatattt tatacatgta ctttcactaa cataacttcc tgcaataatg agccaataat
     6361 cagtgttatc atgtatgaaa caaaccaggt acaatattta ttgtgtaata ataataatag
     6421 taataattat aattgtgtag tacaaagttt tggagttata ggacaggcac acttagaact
     6481 gcctagacct aacaaaagaa taaggaacca aagctttaac caatataact gctctataaa
     6541 taacaaaaca gaattagaaa catggaagtt agtaaagact tctggcgtaa ctcctttacc
     6601 tatttcttct gaagctaaca ctggactaat tagacataag agagattttg gtataagtgc
     6661 aatagtggca gctattgtag ccgctactgc tattgctgct agcgctacta tgtcttatgt
     6721 tgctctaact gaggttaaca aaataatgga agtacaaaat catacttttg aggtagaaaa
     6781 tagtactcta aatggtatgg atttaataga acgacaaata aagatattat atgctatgat
     6841 tcttcaaaca catgcagatg ttcaactgtt aaaggaaaga caacaggtag aggagacatt
     6901 taatttaatt ggatgtatag aaagaacaca tgtattttgt catactggtc atccctggaa
     6961 tatgtcatgg ggacatttaa atgagtcaac acaatgggat gactgggtaa gcaaaatgga
     7021 agatttaaat caagagatac taactacact tcatggagcc aggaacaatt tggcacaatc
     7081 catgataaca ttcaatacac cagatagtat agctcaattt ggaaaagacc tttggagtca
     7141 tattggaaat tggattcctg gattgggagc ttccattata aaatatatag tgatgttttt
     7201 gcttatttat ttgttactaa cctcttcgcc taagatcctc agggccctct ggaaagtgac
     7261 cagtggtgca gggtcctccg gcagtcgtta cctgaagaaa aaattccatc acaaacatgc
     7321 atcgcgagaa gacacctggg accaggccca acacaacata cacctagcag gcgtgaccgg
     7381 tggatcaggg gacaaatact acaagcagaa gtactccagg aacgactgga atggagaatc
     7441 agaggagtac aacaggcggc caaagagctg ggtgaagtca atcgaggcat ttggagagag
     7501 ctatatttcc gagaagacca aaggggagat ttctcagcct ggggcggcta tcaacgagca
     7561 caagaacggc tctgggggga acaatcctca ccaagggtcc ttagacctgg agattcgaag
     7621 cgaaggagga aacatttatg actgttgcat taaagcccaa gaaggaactc tcgctatccc
     7681 ttgctgtgga tttcccttat ggctattttg gggactagta attatagtag gacgcatagc
     7741 aggctatgga ttacgtggac tcgctgttat aataaggatt tgtattagag gcttaaattt
     7801 gatatttgaa ataatcagaa aaatgcttga ttatattgga agagctttaa atcctggcac
     7861 atctcatgta tcaatgcctc agtatgttta gaaaaacaag gggggaactg tggggttttt
     7921 atgaggggtt ttataaatga ttataagagt aaaaagaaag ttgctgatgc tctcataacc
     7981 ttgtataacc caaaggacta gctcatgttg ctaggcaact aaaccgcaat aaccgcattt
     8041 gtgacgcgag ttccgcattt gtgacgcgtt aagttcctgt ttttacagta tataagtact
     8101 tgtgttctga caattgggac tcagattctg cggtctgagt cccttctctg ctgggctgaa
     8161 aaggcctttg taataaatat aattctctac tcagtccctg tctctagttt gtctgttcga
     8221 gatcctacac aaaggattag caatcagaat aatattagac tactcaaaag caacacataa
     8281 agccagaaga cagctgagtc atgttttcaa attctgacaa aaaattattt acaacttaga
     8341 attc
//
LOCUS       HIV2BEN     10359 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 2 (HIV-2), complete proviral
            genome.
ACCESSION   M30502
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 2 (isolate BEN) proviral DNA,
            clones MK[2,6].
  ORGANISM  Human immunodeficiency virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C.,
            Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and
            Hunsmann,G.
  TITLE     A novel proviral clone of HIV-2: Biological and phylogenetic
            relationship to other primate immunodeficiency viruses
  JOURNAL   Virology 177, 305-311 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 10359)
  AUTHORS   Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C.,
            Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and
            Hunsmann,G.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Kindly submitted prior to publication by Dr. F. Kirchhoff, DPZ,
            Gottingen, 0551/3851-0. HIV2BEN was isolated from a German AIDS
            patient(with predominantly neurological complications) who was
            probably infected in Mali (Klemm et al., J Neuro 235,304,1988).
            The sequence was determined from two clones: bases 1903 to 5057 and
            7293 to 9131 were from MK2 and bases 1 to 1902, 5058 to 7292 and
            9132 to 10359 were from MK6.
FEATURES       from  to/span     description
    pept       1103     2668     gag polyprotein
    pept    <  2326     5754     pol polyprotein (NH2 terminus uncertain)
    pept       5423     6070     vif protein
    pept       5898     6239     vpx protein
    pept       6239     6502     vpr protein
    pept       6402     6697     tat protein, exon 2 (first expressed exon)
               8861     8957     tat protein, exon 3 (AA at 8862)
    pept       6628     6697     rev protein, exon 2 (first expressed exon)
               8861     9102     rev protein, exon 3
    pept       6704     9286     env polyprotein
    pept       9120     9893     nef protein
    LTR           1      855     5' LTR
    LTR        9505    10359     3' LTR
    binding     859      875     primer (Lys-tRNA) binding site
    signal    10211    10216     poly-A signal
BASE COUNT     3506 a   2132 c   2598 g   2123 t
ORIGIN      Left end of viral genome (5'LTR)
        1 tgcaagggat gttttacagt aggaggagac atagaatcct agacatatac ctagaaaaag
       61 aggaagggat aataccagat tggcagaatt atactcatgg gccaggagta aggtacccaa
      121 tgtacttcgg gtggctgtgg aagctagtat cagtagaact ctcacaagag gcagaggaag
      181 atgaggccaa ctgcttagta cacccagcac aaacaagcag acatgatgat gagcatgggg
      241 agacattagt gtggcagttt gactccatgc tggcctataa ctacaaggcc ttcactctgt
      301 acccagaaga gtttgggcac aagtcaggat tgccagagaa agaatggaag gcaaaactga
      361 aagcaagagg gataccatat agtgaataac aggaacaacc atacttggtc aaggcaggaa
      421 gtagctacta agaaacagct gaggctgcag ggactttcca gaaggggctg taaccaaggg
      481 agggacatgg gaggagctgg tggggaacgc cctcatactt actgtataaa tgtacccgct
      541 tcttgcattg tattcagtcg ctctgcggag aggctggcag atcgagccct gagaggttct
      601 ctccagcact agcaggtaga gcctgggtgt tccctgctgg actctcacca gtacttggcc
      661 ggtactgggc agacggctcc acgcttgctt gcttaaagac ctcttcaata aagctgccag
      721 ttagaagcaa gttaagtgtg tgttcccatc tctcctagtc gccgcctggt cattcggtgt
      781 tcatctgagt aacaagaccc tggtctgtta ggacccttct cgctttggga atccaaggca
      841 ggaaaatccc tagcaggttg gcgcccgaac agggacttga agaggactga gaagccctgg
      901 aactcggctg agtgaaggca gtaagggcgg caggaacaaa ccacgacgga gtgctcctag
      961 aaaggcgcgg gccgaggtac caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg
     1021 gtgaaggtaa gtacctacac caaaaactgt agccagaaaa ggcttgttat cctaccttta
     1081 gacaggtaga agattgtggg agatgggcgc gagaaactcc gtcttgagag ggaaaaaagc
     1141 agacgaatta gaaaaagtta ggttacggcc cggcggaaag aaaaagtaca ggttaaaaca
     1201 tattgtgtgg gcagcgaatg aattggataa attcggattg gcagagagcc tgttggagtc
     1261 aaaagaaggt tgccaaaaga ttctcagagt tttagatcca ttagtaccaa cagggtcaga
     1321 aaatttaaaa agccttttta ataccgtctg cgtcatttgg tgcttgcacg cagaagagaa
     1381 agtgaaagat actgaggaag caaagaaact agcacagaga catctagtgg cagaaactgg
     1441 aactgcagag aaaatgccaa atacaagtag accaacagca ccacctagtg ggaaaagagg
     1501 aaactacccc gtgcaacaag cgggtggcaa ctatgtccat gtgccactga gcccccgaac
     1561 tctaaatgca tgggtaaaat tagtggagga aaagaagttc ggggcagaag tagtgccagg
     1621 atttcaggca ctctcagaag gctgcacgcc ctatgatatt aatcaaatgc ttaattgtgt
     1681 gggcgatcac caagcagcta tgcaaataat cagagagatt attaatgaag aagcagcaga
     1741 ctgggattcg cagcacccaa taccaggccc cttaccagca ggacagctca gagacccaag
     1801 agggtctgac atagcaggaa caacaagcac agtagatgaa cagatccagt ggatgtatag
     1861 gccacaaaat cccgtaccgg tagggaacat ctacagaaga tggatccaaa tagggctgca
     1921 aaagtgtgtc agaaagtaca acccaactaa catcttagac ataaaacagg gaccaaaaga
     1981 accgttccaa agctatgtag acaggttcta caaaagcttg agggcagaac aaacagaccc
     2041 agcagtaaaa aattggatga cccaaacgct gctaatacag aatgccaacc cagactgcaa
     2101 gttagtacta aaaggactgg ggatgaatcc caccctagaa gagatgctaa ccgcctgcca
     2161 gggggtaggc ggaccaggcc agaaagccag gctaatggct gaagccctaa aagaggctat
     2221 gggaccaagc cctatcccat ttgcagcagc ccaacaaaga aaggcaatta ggtattggaa
     2281 ctgtggaaag gagggacact cggcaagaca gtgccgagca cctagaagac agggctgctg
     2341 gaagtgtggc aagccaggac acatcatggc aaactgcccg gaaagacagg caggtttttt
     2401 agggttgggc ccacggggaa agaagcctcg caacttcccc gtgacccaag cccctcaggg
     2461 gctgatacca acagcacctc cggcagatcc agcagcggaa ctgttggaga gatatatgca
     2521 gcaagggaga aagcagaggg agcagaggga gagaccatac aaagaggtga cggaggactt
     2581 gctgcacctc gagcagagag agacacctca cagagaggag acagaggact tgctgcacct
     2641 caattctctc tttggaaaag accagtagtc acagcgtaca tcgaggatca gccggtagaa
     2701 gtcttactag acacaggggc tgatgactca atagtagcag gaatagaatt aggggacaat
     2761 tacactccaa aaatagtagg gggaataggg ggatttataa acaccaaaga atacaaaaat
     2821 gtagaaataa aagtactaaa taaaagagta agagccacca taatgacagg agatacccca
     2881 atcaacatct ttggcagaaa tattctgaca gccttaggca tgtcattaaa tttaccagtt
     2941 gccaagatag agccaataaa agtaacattg aagccaggga aagatggacc aaggctgaaa
     3001 caatggcccc taacaaaaga gaaaatagaa gcactaaaag agatctgtga aaaaatggaa
     3061 aaagagggcc agctagaaga ggcacctcca actaatcctt ataatacccc cacatttgca
     3121 attaagaaaa aggacaagaa caaatggagg atgctgatag attttagaga actaaataag
     3181 gtgactcaag atttcacaga aattcagcta ggaattccac acccggcagg actagccaaa
     3241 aagaaaagga tctctatatt agatgtaggg gatgcctatt tttccatacc actacatgaa
     3301 gattttaggc agtatactgc atttacccta ccagcagtaa acaatatgga accaggaaaa
     3361 agatatatat ataaagtctt gccacaagga tggaagggat caccagcaat ttttcaatac
     3421 acaatgaggc aagtcttaga acctttcaga aaagcaaacc cagatgtcat tctcatccag
     3481 tacatggatg atatcttaat agctagtgac aggacaggtt tagagcatga caaagtggtc
     3541 ctgcagctaa aagaacttct aaatggccta gggttttcta ctccagatga gaagttccaa
     3601 aaagaccctc catttcaatg gatgggctgt gaactatggc caactaaatg gaagctgcag
     3661 aaactacaac tgccccagaa agacatatgg acagtcaatg acatccaaaa gctagtggga
     3721 gtcttaaatt gggcggcaca aatctattca ggaataaaaa ccaaacactt atgtagacta
     3781 attagaggaa aaatgacact cacagaagaa gtgcagtgga cagaactagc agaagcagag
     3841 ctagaagaaa acaaaattat cttgagccag gaacaagaag gatattatta ccaagaagaa
     3901 aaagaattag aggcaacaat ccaaaaaagc caaggacatc aatggacata caaaatacac
     3961 caggaagaga aaatcctaaa agtaggaaag tatgcaaaga taaaaaatac ccataccaat
     4021 ggggtcagat tactagcaca ggtagttcag aaaataggaa aagaggcact agtcatttgg
     4081 ggacggatac caaaatttca cctgccagtg gagagagaga cctgggagca gtggtgggat
     4141 aactactggc aagtgacatg gatcccagag tgggactttg tatctacccc accactggtc
     4201 aggttaacat ttaacctagt aggagatcct ataccaggcg cagagacctt ctacacagat
     4261 ggatcatgca atagacagtc aaaagaggga aaagcaggat atgtaacaga tagaggaaaa
     4321 gacaaagtaa aagtattaga acaaactacc aatcagcagg cagaattaga agtctttcgg
     4381 atggcactgg cagactcagg cccaaaggtt aatatcatag tagattcaca gtatgtaatg
     4441 gggatagtag caggccagcc aacagagtca gaaaatagaa tagtgaacca gatcatagaa
     4501 gaaatgataa agaaggaagc agtctatgtt gcatgggtcc cagcccataa aggcatagga
     4561 ggaaaccagg aagtagacca tttagtaagt caaggcatca gacaagtatt attcctggaa
     4621 aagatagagc ccgctcaaga ggaacatgaa aaatatcata gcattataaa agaactaacc
     4681 cataaatttg gaatacccct tctagtagca agacagatag taaactcatg tgcccaatgc
     4741 caacagaaag gagaagccat acatgggcaa gtaaatgcag aaataggcgt ttggcaaatg
     4801 gactacacac acttagaagg aaaaatcatt atagtagcag tacatgttgc aagtggattc
     4861 atagaagcag aagtcatccc acaggaatca ggaaggcaga cagcactctt cctattaaaa
     4921 ctggccagta ggtggccaat aacgcacttg cacacagaca atggccccaa cttcacttca
     4981 caggaagtga agatggtggc atggtgggta ggtatagaac aatcctttgg agtaccttac
     5041 aacccacaaa gccagggagt agtagaagca atgaatcacc acctaaagaa tcagataagt
     5101 agaattagag aacaggcaaa tacaatagaa acaatagtac tgatggcagt tcattgcatg
     5161 aattttaaaa gaaggggagg aataggggat atgaccccag cagaaagact aatcaacatg
     5221 attaccacag aacaagaaat acaattcctc caaagaaaaa attcaaattt taaaaatttc
     5281 caggtctatt acagagaagg cagagatcag ctgtggaaag gacctggtga actactgtgg
     5341 aagggagaag gagcagtcat agtcaaggta gggacagaca taaaagtagt accaagaagg
     5401 aaggccaaga ttatcaggga ctatggagga agacaggaac tggatagtag tccccacctg
     5461 gagggtgcca gggaggatgg agaaatggca tgcccttgtc aagtacctga aatacagaac
     5521 aaaagaccta gaggaggtgc gctatgttcc ccaccacaag gtgggatggg catggtggac
     5581 ttgcagcagg gtaatattcc cactacaagg aaaaagtcat ctagaaatac aggcatattg
     5641 gaacctaaca ccagaaaaag gatggctctc ctctcatgca gtaagattaa cctggtatac
     5701 agaaaagttc tggacagatg ttaccccaga ctgtgcagac atcctaatac atagcactta
     5761 tttctcttgc tttacggcag gtgaagtaag aagagccatc agaggggaaa agttattgtc
     5821 ctgctgcaac tatccccaag ctcataaagc acaggtacca tcacttcaat acctagccct
     5881 agtagtagta caacaaaatg acagacccca gagaaagggt accgccagga aacagtggag
     5941 aagagaccat tggagaggcc ttcgagtggc tagagaggac catagaagcc ttaaacaggg
     6001 aggcagtgaa ccatctgccc cgagagctca ttttccaggt gtggcaaagg tcctggagat
     6061 attggcatga tgaacaaggg atgtcagcaa gctacacaaa gtatagatat ttgtgcctaa
     6121 tgcaaaaagc tatatttaca catttcaaga gagggtgcac ttgctggggg gaggacatgg
     6181 gccgggaagg attggaagac caaggacctc cccctcctcc ccctccaggt ctagtctaat
     6241 gactgaagca ccaacagagt ttcccccaga agatgggacc ccacggaggg acttagggag
     6301 tgactgggta atagaaactc tgagggaaat aaaggaagaa gccttaagac attttgatcc
     6361 ccgcttgcta attgctcttg gctactatat ccataataga catggagaca cccttgaagg
     6421 cgccagagag ctcattaaaa ccctacaacg agccctcttc gtgcacttca gagcgggatg
     6481 taaccgctca agaattggct aaacaaggag aagaactcct tgcccagctg caccgacccc
     6541 tagagccatg cactaacaaa tgctattgta agcgatgcag tttccattgc cagctgtgtt
     6601 tctcgaaaaa ggggctcgga atatcatatg agcgaaaggg cagacgaaga aggactccaa
     6661 ggaaaactaa gactccttcg ccttctgcac cagacaagtg agtatggagc ctggtaggaa
     6721 tcagctgttt gttgtcattt tactaacaag tgcttgctta gtatattgta gccagtatgt
     6781 gactgttttc tatggcatac ccgcgtggaa aaatgcatct attcccttat tttgtgcaac
     6841 taaaaataga gacacttggg ggaccataca gtgcttgcca gacaatgatg attatcagga
     6901 aataatttta aatgtgacag aggcttttga tgcatggaat aatacagtga cagaacaagc
     6961 agtagaagat gtctggcatc tatttgagac atcaataaaa ccatgtgtca agctaacacc
     7021 tctatgtgtg gcaatgaatt gtagcagggt tcaagggaat accacgaccc cgaatcccag
     7081 gacctcgagt tccacaacct cgagaccacc cacatccgca gcctccataa taaatgaaac
     7141 ttctaactgc atagaaaaca acacatgcgc aggattaggg tatgaggaga tgatgcaatg
     7201 tgagttcaat atgaaggggt tagaacaaga taagaaaagg aggtataagg acacatggta
     7261 tttagaagat gtggtttgtg acaacacaac agctggcaca tgttacatga gacattgcaa
     7321 cacatcaatc atcaaagagt catgtgataa gcactattgg gatgctatga ggtttagata
     7381 ctgtgcacca ccgggctttg ccctattaag atgtaatgat accaactatt caggctttga
     7441 acctaagtgc actaaagtag tagctgcttc atgcacaagg atgatggaaa cgcaaacttc
     7501 tacttggttt ggctttaatg gcactagagc agaaaataga acatatatct attggcatgg
     7561 cagagataat aggactatca ttagcttaaa caagtattat aatctcacaa tgcgttgtaa
     7621 gagaccagga aataagacag ttttaccaat aacacttatg tcaggattag tgtttcactc
     7681 tcagccaatc aacacaaggc ctaggcaggc atggtgccgg tttggaggca gatggaggga
     7741 agccatgcag gaggtgaagc aaacccttgt acaacatccc agatacaaag gaatcaatga
     7801 tacagggaaa attaacttta cgaaaccggg agcaggctca gacccggaag tggcatttat
     7861 gtggactaac tgcagaggag aatttctcta ctgtaacatg acttggttcc tcaattgggt
     7921 agaagacaag aaccaaacac ggcgcaacta ttgccatata aagcagataa ttaatacctg
     7981 gcataaagta gggaaaaatg tatatttgcc tcctagggaa ggggagttgg cctgtgaatc
     8041 aacagtaacc agcataattg ctaacattga catagataaa aatcggactc ataccaacat
     8101 tacctttagt gcagaagtgg cagaactgta ccgattagaa ctgggagact acaaattaat
     8161 agaaataaca ccaattggct tcgcacctac agatcagaga aggtactcct caactccagt
     8221 gaggaacaaa agaggtgtgt tcgtgctagg gttcttgggt tttctcgcga cagcaggttc
     8281 tgcaatgggc gcgcggtccc tgacgctgtc agcccagtcc cggactttac tggccgggat
     8341 agtgcagcaa cagcaacagc tgttggacgt agtcaagaga caacaagaaa tgttgcgact
     8401 gaccgtctgg ggaacgaaaa acctccaggc aagagtcact gctatcgaga agtacctaaa
     8461 gcatcaggca cagctaaatt catggggatg tgcgtttaga caggtctgcc acactactgt
     8521 accgtgggta aatgactctt tatcgcctga ctggaaaaat atgacatggc aggagtggga
     8581 gaaacaagtc cgctacctag aggcaaatat cagtcaaagt ttagaagaag cccaaattca
     8641 acaagaaaag aatatgtatg aattacaaaa attaaatagc tgggatattc ttggcaactg
     8701 gtttgactta acctcctggg tcaagtatat tcaatatgga gtgcatatag tagtgggaat
     8761 aatagcttta agaatagcaa tctatgtagt gcaattgtta agtagattta gaaagggcta
     8821 taggcctgtt ttctcttccc cccccggtta tctccaacag atccatatcc acaaggaccg
     8881 gggacagcca gccaacgaag gaacagaaga agacgtcgga ggcgacagtg gttacgactt
     8941 gtggccttgg ccaataaact atgtgcagtt cctgatccac ctactgactc gcctcttgat
     9001 cgggctatac aacatctgca gagacttact atccaagaac tccccgaccc gccgactgat
     9061 ctcccagagt ctaacagcaa tcagggactg gctgagactt aaggcggccc aactgcaata
     9121 tgggtgcgag tggatccaag aagctttcca agcattcgcg aggactacga gagagactct
     9181 tgcgggcgcg tggggatggt tatgggaagc agcgcgacgc atcgggaggg gaatactcgc
     9241 agttccaaga agaatcaggc agggagcaga actcgccctc ctgtgaggga cagcagtatc
     9301 agcagggaga gtacatgaac agcccatgga gaaacccagc aacagaaaga cagaaagatt
     9361 tgtataggca gcaaaatatg gatgatgtag attctgatga tgatgaccta ataggagttc
     9421 ctgttacacc aagagtacca cggagagaaa tgacctataa attggcaata gatatgtcac
     9481 attttataaa agaaaaaggg ggactgcaag ggatgtttta cagtaggagg agacatagaa
     9541 tcctagacat atacctagaa aaagaggaag ggataatacc agattggcag aattatactc
     9601 atgggccagg agtaaggtac ccaatgtact tcgggtggct gtggaagcta gtatcagtag
     9661 aactctcaca agaggcagag gaagatgagg ccaactgctt agtacaccca gcacaaacaa
     9721 gcagacatga tgatgagcat ggggagacat tagtgtggca gtttgactcc atgctggcct
     9781 ataactacaa ggccttcact ctgtacccag aagagtttgg gcacaagtca ggattgccag
     9841 agaaagaatg gaaggcaaaa ctgaaagcaa gagggatacc atatagtgaa taacaggaac
     9901 aaccatactt ggtcaaggca ggaagtagct actaagaaac agctgaggct gcagggactt
     9961 tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgggga acgccctcat
    10021 acttactgta taaatgtacc cgcttcttgc attgtattca gtcgctctgc ggagaggctg
    10081 gcagatcgag ccctgagagg ttctctccag cactagcagg tagagcctgg gtgttccctg
    10141 ctggactctc accagtactt ggccggtact gggcagacgg ctccacgctt gcttgcttaa
    10201 agacctcttc aataaagctg ccagttagaa gcaagttaag tgtgtgttcc catctctcct
    10261 agtcgccgcc tggtcattcg gtgttcatct gagtaacaag accctggtct gttaggaccc
    10321 ttctcgcttt gggaatccaa ggcaggaaaa tccctagca
//
LOCUS       HIV2D205     7817 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 2 (HIV-2), gag, pol, vif,
            vpX, vpR, partial env.
ACCESSION   X16109
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 2 (isolate D205) viral RNA.
  ORGANISM  Human immunodeficiency virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 7817)
  AUTHORS   Dietrich,U., Adamski,M., Kreutz,R., Seipp,A., Kuhnel,H. and
            Rubsamen-Waigmann,H.
  TITLE     A highly divergent HIV-2-related isolate
  JOURNAL   Nature 342, 948-950 (1990)
  STANDARD  full staff_review
COMMENT     This isolate, also called HIV2-ALT by the authors, was taken from a
            healthy Ghanian woman. Tree analysis reveals that it likely arose
            prior to the branching of HIV2s and SIVs, i.e.  the sooty mangabey
            and macaque isolates. [1] calls attention to an extension of the
            gag-pol overlap region resulting in a dilated (by 20 residues) pol
            orf. The env cds begins at 6671; tat-exon1 at 6375 and rev-exon1 at
            6601, with the possible donor splice site at 6665.
FEATURES       from  to/span     description
    pept       1079     2644     gag polyprotein
    pept    <  2299     5475     pol polyprotein (NH2 terminus uncertain)
    pept       5396     6046     vif protein
    pept       5877     6212     vpX protein
    pept       6212     6517     vpR protein
    pept       6375  /  6664     tat protein, exon 2 (first expressed exon)
    pept       6601  /  6664     rev protein, exon 2 (first expressed exon)
    pept       6744  >  6875     env polyprotein
    site       6876     6878     env polyprotein in-frame stop codon
BASE COUNT     2705 a   1602 c   1930 g   1580 t
ORIGIN      5'-terminus of 5'LTR
        1 tggaagggat gtattatagt gagagaagac acagaatatt agacacatat tttgagaatg
       61 aagaaggcat tgtgtctggc tggcaaaact atactcatgg gccagggata aggcatccca
      121 aatactttgg ttggctgtgg aagctggtac cagtagaggt gccagcagcg acccgagagg
      181 aggaggaaac ccattgccta atgcacccgg cacagatctc ctcatgggat gacatccatg
      241 gggagactct tatctggcag tttgattccc tcctggcata tgattatgtg gctttcaata
      301 ggtttccaga agagttcggg taccagtcag ggctaccaga ggaggagtgg aaggctagac
      361 taaaagcaag agggatacct acagattagg caagagacag cagcataaac aggaactagc
      421 tgacactgca caagaaggaa actagcagac actgcaggga ctttccaaaa gaggctgtaa
      481 catgggagga gcaagggagg ggttggttgg gaacgcccat taaatctctg tataaatgta
      541 cccgcttctc gcattgtatt cagtcgctct gcggagaggc tggcagattg agccctggga
      601 ggttctctcc agcactagca ggtagagcct gggtgttccc tgctagactc tcaccagcac
      661 taggccagtg ttgggcagac ggctccacgc ttgcttgctt aaaaccctct taataaactg
      721 ccaattagaa gcaagtaagt gtgtgttccc atctctccta gtcgcgcctg gtcactcggt
      781 gctccactga tagaccctgg tctgttagga cccttcttgc tttgggaaac cgaagcggga
      841 aaatccctag cagattggcg cccgaacagg gacctgggaa gagtgagagt cttcaagagc
      901 acagctgagt gaaggcagta agggcggcag gaatcaacca cgacggagag ctcctgtaaa
      961 agcgcaggcc ggtaccaggc agcgtgagga gcgggaggag aagaggctcc gggagtaagg
     1021 taagtactcc aaattttgca ttgtctactc taaagaggga gtagggcata agtgggagat
     1081 gggcgcgaga ggctccgtcc tgtcagggaa gaaaacagat gaattagaaa aagttaggtt
     1141 acggcccggc ggaaagaaaa agtatatgct aaaacatgta gtatgggcag tgaacgaatt
     1201 agacagattt ggattagcag agagcctgtt ggaatcaaaa gaaggatgtc aaaaaattct
     1261 taaagtctta gctccattag taccaacagg gtcagaaaat ttaaaaagcc tttttaatat
     1321 cgtctgcgtc attttttgcc tgcacgcaga agagaaagtg aaagatacag aggaagcaaa
     1381 aaagatagca cagagacatc tagcggcgga cacagaaaaa atgccagcta caaataaacc
     1441 aacagcacca cctagcggcg gaaattatcc agtgcagcaa ctggctggca actacgtcca
     1501 cctgccgcta agcccccgaa ccttaaatgc ttgggtaaag ttagtagaag aaaagaagtt
     1561 cggggcagaa gtagtaccag gatttcaggc actatcagaa ggatgcaccc cttatgatat
     1621 aaatcagatg ctaaattgtg taggagaaca tcaggcagcc atgcaaatta ttagagaaat
     1681 aatcaatgag gaagcagcag actgggacca gcaacacccg tcaccaggcc caatgccggc
     1741 aggacaactt agggacccaa gagggtcaga tatagcagga accaccagca cagtagagga
     1801 acagatacag tggatgtaca gggcccaaaa tcctgtccca gtgggaaaca tttatagaag
     1861 atggattcaa ttaggattgc agaaatgtgt ccgaatgtac aatcctacca acatattaga
     1921 cataaagcag ggaccaaagg agcccttcca aagctatgta gatagattct acaaaagctt
     1981 acgggcagaa caaacagacc cagcagtgaa aaattggatg acacaaacac tgctgattca
     2041 gaatgctaac ccagattgca agttagtgct taagggcttg ggaatgaatc ccaccttaga
     2101 ggaaatgcta acggcctgcc aagggatagg aggcccaggg cagaaggcaa ggctaatggc
     2161 cgaagcctta aaagaggccc taacacctgc acccataccg tttgctgccg ttcaacaaaa
     2221 agcagggaag agagggacag tgacatgctg gaactgtggc aaacagggac acacagccag
     2281 gcaatgcagg gcccctagaa gacagggatg ctggaaatgt ggaaaaacag gacacatcat
     2341 gtcaaaatgc ccagaaagac aggcgggttt tttagggtta ggaccctggg gaaagaagcc
     2401 tcgcaacttc cccatgaccc aagtgcctca gggagtgaca ccatctgcac ccccgatgaa
     2461 cccagcagag ggcatgacac ctcggggggc gacaccatct gcgccccctg cagatccagc
     2521 agtggagatg ctgaaaagtt acatgcagat ggggagacaa cagagagaga gccgagagag
     2581 accctacaag gaggtgacag aggatttgct gcacctcaat tctctctttg gagaagacca
     2641 gtagtcaaag catgtatcga gggtcagtca gtagaagtat tactagacac aggagttgac
     2701 gactcaatag tagcagggat agaattaggt agcaattaca ccccaaaaat agtaggaggg
     2761 ataggagggt tcataaatac caaagaatac aaagatgtag aaatagaagt agtgggaaaa
     2821 agagtaaggg caactataat gacaggagat accccaataa acatttttgg cagaaatatt
     2881 ttaaatacct tgggcatgac tttaaatttc ccagtggcaa aggtagaacc agtaaaagtt
     2941 gagttaaaac ctggaaaaga tgggccaaag atcagacaat ggcctctatc cagggaaaag
     3001 atactagccc tcaaagaaat ctgtgaaaaa atggaaaagg agggacaatt agaagaagca
     3061 ccccctacta atccatacaa cacacccacc tttgccataa agaagaaaga taagaacaaa
     3121 tggagaatgc taatagactt tagagagtta aacaaggtaa cccaggactt cacagaagtc
     3181 aactgggtat tccccacccg gcaggtggca gaaaaaagga gaataacagt aatagatgtg
     3241 ggagatgcct acttcagtat cccactagat ccaaacttca gacagtatac agcattcacc
     3301 ttgccatcag taaacaatgc agagccagga aagagataca tttataaagt tctaccacaa
     3361 gggtggaagg ggtcccagtc aatttgtcaa tactccatga gaaaggtatt agatcctttt
     3421 agaaaggcca acagcgatgt cattataatt cagtacatgg atgacatcct tatagcaagt
     3481 gacagaagtg atctggagca cgacagggta gtgtcccaac taaaagagtt attaaatgac
     3541 atgggattct ctaccccaga agaaaagttc caaaaagacc ctccgttcaa atggatgggt
     3601 tatgagctct ggccaaaaaa gtggaaactg caaaaaatac aactgccaga aaaagaagtt
     3661 tggacagtga atgcaattca aaaactggta ggagtattaa actgggcagc tcaactcttt
     3721 cctggaatta agacaaggca catatgcaaa ctaattaggg gaaagatgac cctaacagaa
     3781 gaagtacagt ggacagaact agcagaagca gagctacagg agaataaaat catcttagaa
     3841 caggaacaag aaggatccta ctacaaggaa agggtaccgc tagaagcaac agtacagaaa
     3901 aacctagcaa atcagtggac atacaaaatt catcagggaa ataaagtcct aaaagtagga
     3961 aaatatgcaa aggttaaaaa cacgcacacc aacggggtaa gactactggc acatgtagtt
     4021 cagaaaatag gcaaagaagc cctagtcatc tggggagaga taccagtgtt ccatctgcca
     4081 gtagaaagag agacatggga ccagtggtgg acagattact ggcaagtaac ctggatccca
     4141 gagtgggact ttgtctcgac cccaccatta ataagactag cctacaacct agtcaaagac
     4201 cccctagaag ggagagaaac ctactacaca gatgggtcct gcaatagaac ctcaaaggaa
     4261 ggaaaagcag gatatgtcac tgacagggga aaagataagg ttaaagtgtt agaacagaca
     4321 acaaaccaac aagcagaact tgaagcattt gcattagcat taacagactc agaaccacaa
     4381 gttaacatca tagtagattc acaatatgtc atgggaataa tagctgcaca gccaacagaa
     4441 acagaatcac caatagtagc aaaaataatt gaagaaatga tcaaaaaaga ggcagtatat
     4501 gtaggatggg taccagctca caagggactg ggtggtaatc aggaagtaga ccacctagta
     4561 agtcaaggaa tcagacaggt cttgttccta gaaaaaatag aaccagccca ggaagagcat
     4621 gaaaaatatc atggcaatgt aaaagaactg gtccataaat tcggaattcc acaattagtg
     4681 gcaaaacaga tagtaaattc ctgtgataaa tgccaacaaa aaggggaagc tattcatgga
     4741 caggtaaatg cagacctagg gacatggcag atggactgta cacatttaga aggaaaaatt
     4801 ataatagtgg cagtccatgt agccagtggg tttatagaag cagaggtaat accccaagag
     4861 acaggaagac agacagctct cttcctacta aagttggcca gcagatggcc tatcacacac
     4921 ctacacacag acaacggtgc caacttcacc tcaccaagtg taaagatggt agcctggtgg
     4981 gtaggaatag aacaaacttt tggagtaccc tataacccac aaagtcaagg agtagtggaa
     5041 gcaatgaacc atcacctgaa aaatcaaata gacagactca gagaccaagc agtatcaata
     5101 gagacagttg tactaatggc aactcactgc atgaatttta aaagaagggg aggaataggg
     5161 gatatgaccc ctgcagaaag actagttaac atgataacca cagagcaaga aatacagttc
     5221 ttccaagcaa aaaatttaaa atttcaaaat ttccaggtct attacagaga aggcagagat
     5281 caactctgga agggacctgg tgaactattg tggaaagggg aaggagcagt catcataaag
     5341 gtagggacag aaatcaaagt agtacccagg agaaaagcaa aaattataag gcactatgga
     5401 ggaggaaaag gattggattg tagtgccgac atggaggata ccaggcaggc tagagagatg
     5461 gcacagtctg attaagtatc ttaagtatag aacaggagag ttgcaacagg tctcttatgt
     5521 ccctcaccac aaggtaggat gggcttggtg gacttgcagt agaataatat ttcccctaaa
     5581 caaaggagca tggctagaag tccaaggata ttggaaccta accccagaaa ggggattctt
     5641 gagctcctat gctgtaagac taacatggta tgagaggaac ttttatacag atgtaacacc
     5701 tgatgtggca gaccagctac tgcatgggtc ttatttctct tgcttttcag ccaatgaagt
     5761 aaggagagcc atcaggggag aaaagatatt gtcctactgc aactatccat cagctcacga
     5821 agggcaggta ccaagcttac agtttctagc cctaagggtc gtacaggaag gaaaaaatgg
     5881 atcccaggga gagagtgcca ccaggaaaca gcgacgaaga aacagtagga gaagcattcg
     5941 cttggctaga aagaacaata acagagctca acagggtagc ggtcaaccat ttgccccgag
     6001 aacttatttt ccaggtctgg cagaggtctt gggcatactg gcgtgaggaa cagggcatgt
     6061 caattagcta taccaaatat agatacttgt tgctaatgca gaaagcaatg tttgtgcact
     6121 atacaaaggg ctgtaggtgc ctgcaggagg gccatgggcc agggggatgg agatcaggac
     6181 ctcctcctcc tcctccccca ggcctggcct aatggcagaa gcagccccag agatccctcc
     6241 agagaacgag aacccacaaa gagaaccgtg ggaagagtgg ataggggaga tcctggagga
     6301 aataaagcaa gaagccttaa agcattttga tcctcgcttg ctaactgcgc ttggtaactt
     6361 tatctacagt aggcatggag atacccttgc aggagcagga gagctcatta aaatcctcca
     6421 acgagccctc ttcctccact tcagagccgg ttgtcaacac tcaaggattg gacaatcagg
     6481 gggaggaaat cctctctcaa ctataccgcc cccttaaggc atgcgataat acatgctact
     6541 gtaagaaatg ctgctaccat tgccagcttt gttttcttaa aaagggtctt gggatatgtt
     6601 atgaccgctc gagaaggaga tctgcaaaaa gagctaagac tactgcacct tctgcaccag
     6661 acaagtgagt atggcatatt ttagcagccg cctgcctatt gcgctcctgc ttataggtat
     6721 cagtgggttt gtatgtaaac aatatgttac tgtcttctat ggcatacccg catggaggaa
     6781 cgcaacagtt cccctcattt gtgcaaccac aaacagagac acctggggaa ctgtacagtg
     6841 tctcccagac aatggtgact acactgagat caggctaaac ataacagagg cttttgatgc
     6901 atgggataat acagtgacac aacaggcagt agatgatgtg tggagactct ttgaaacctc
     6961 cataaaacca tgtgtcaaac taaccccact gtgtgtggca atgaactgta gtaaaaccga
     7021 aacaaaccca gggaatgcca gtagtactac caccactaag cctactacca cctctcgtgg
     7081 gctgaaaacg attaacgaaa cagacccatg cataaaaaat gacagctgca caggactagg
     7141 agaagaggaa ataatgcaat gtaattttag tatgacggga ctaagaagag atgagctaaa
     7201 acaatataaa gacacctggt actcagaaga tttagagtgt aataatacca ggaagtatac
     7261 cagcagatgc tatataagaa cctgcaacac aacaattatc caagagtcat gtgacaaaca
     7321 ttattgggac agcttaaggt ttaggtattg tgctcccccg gggttttttc tactaagatg
     7381 taatgatacc aactattcag gcttcatgcc caactgcagt aaggtagtag cgtcctcctg
     7441 cacaagaatg atggaaacac agtcctctac atggtttggc ttcaatggta caagggcaga
     7501 gaacaggaca tatatatatt ggcatgaaaa agacaatagg accatcataa gcttaaatac
     7561 atactataat ttgtcaatac actgtaagag gccaggaaac aagacggttg taccaataag
     7621 aaccgtgtca ggactacttt tccattcaca gcctatcaat aagagaccca gacaagcttg
     7681 gtgctggttt aagggaaact ggacagaagc cataaaggag gtgaaaagga ccatcataaa
     7741 acatcccagg tataaaggag gtgcaaaaaa tatcacaagc gtaaagttag tatcagaaca
     7801 tggaaaaggt tcagatc
//
LOCUS       HIV2GH1      9480 bp ds-DNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 2 proviral DNA, gag, pol, vif,
            vpX, vpR, tat, rev, env and nef genes.
ACCESSION   M30895
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 2 MOLT-4 cell (isolate GH1)
            proviral DNA, clone 8.
  ORGANISM  Human immunodeficiency virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9480)
  AUTHORS   Hasegawa,A., Tsujimoto,H., Maki,N., Ishikawa,K.-I., Miura,T.,
            Fukasawa,M., Miki,K. and Hayami,M.
  TITLE     Genomic divergence of HIV-2 from Ghana
  JOURNAL   AIDS Res. Hum. Retroviruses 5, 593-604 (1989)
  STANDARD  full staff_review
COMMENT     GH1 was isolated from a Ghanian AIDS patient; virus was cloned in
            MOLT-4 clone 8 cells.
FEATURES       from  to/span     description
    pept        544     2112     gag polyprotein
    pept    <  1770     4937     pol polyprotein
    pept       4867     5514     vif protein
    pept       5342     5680     vpX protein
    pept       5680     5997     vpR protein
    pept       5843     6138     tat protein, exon 2 (first expressed exon)
               8281     8377     tat protein, exon 3 (AA at 8282)
    pept       6069     6138     rev protein, exon 2 (first expressed exon)
               8281     8522     rev protein, exon 3 (AA at 8283)
    pept       6148     8364     env polyprotein (see COMMENT)
    pept       8540     9307     nef protein
BASE COUNT     3257 a   1957 c   2349 g   1917 t
ORIGIN      
        1 cagtcgctct ggcagaggct ggcagattga gccctgggag gttctctcca gcactagcag
       61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg
      121 gctccacgct tgcttgctta aagacctctt aataaagctg ccaattagaa gcaagttaag
      181 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gaataacaag
      241 accctggtct gttaggaccc ttccgctttg ggaatccaag gcaggaaaat ccctagcagg
      301 ttggcgccca acagggactt gaaagggact gagaagccct ggaacacggc tagtgaaggt
      361 agtaaggcgg caggaacaaa ccacgacgga gtgctcctag aaaagcgcgg gccgaggtac
      421 caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg gtgaaggtaa gtgcctacac
      481 cgaaaaccgt agccagaaaa ggctagttat cctatctttt agacaggtag aagattgtgg
      541 gagatgggcg cgagaaactc cgtcttgaga gggaaaaaag cagatgaatt agaaaagatt
      601 aggttacggc ccagcggaaa gaaaaagtac aggctaaaac atattgtgtg ggcagcgaac
      661 gaattggaca aattcggatt ggcagagagc ctgttggagt caaaagaagg ttgccaaaag
      721 attctcacag ttttagatcc attagtacca acagggtcag aaaatttaaa aagccttttt
      781 aacacagtct gcgtcatttg gtgcttgcac gcagaagaga aagtgaaaga tactgaagaa
      841 gcaaaaaaac tagtacagag acatctaggg gcagaaacag gaactgcaga aaaaatgcca
      901 agtacaagta gaccaacagc accacctagc gggagaggaa gaaacttccc tgtacaacag
      961 acaggcggtg gcaactatat ccacgtgcca ctgagccccc gaactctaaa tgcttgggta
     1021 aaattagtag aggacaagaa gttcggggca gaagtagtgc caggatttca agcactctca
     1081 gaaggctgca cgccctatga tatcaaccaa atgcttaatt gtgtgggcga tcaccaagca
     1141 gctatgcaaa taatcagaga gattatcaat gacgaagcag cagattggga tgcacagcac
     1201 ccaataccag gccccttacc agcagggcag cttagagacc caagggggtc tgacatagca
     1261 ggaacaacaa gcacagtaga agaacagatc cagtggatgt ataggccaca aaatcccgtg
     1321 ccggtaggga acatctacag aagatggatc cagatagggc tacagaagtg tgtcaggatg
     1381 tacaacccaa ctaacatctt agacgtaaag cagggaccaa aggaaccgtt ccagagctat
     1441 gtggacaggt tctataaaag cttgagggca gaacaaacag atccggcagt aaagaactgg
     1501 atgacccaaa cgctgctaat acagaatgcc aacccagact gcaagttagt actaaaagga
     1561 ctggggatga atcccaccct agaagagatg ctgactgcct gtcagggggt aggtggacca
     1621 ggccaaaaag ccagactaat ggccgaagcc ctgaaggagg ctttgacacc accccctatc
     1681 ccatttgcag cagcccaaca aagaaaggta attaggtgtt ggaactgcgg aaaggaggga
     1741 cactcggcga gacagtgccg agcgcctaga agacagggct gctggaaatg tggcaagaca
     1801 ggacacgtca tggcaaaatg cccagaaaga caggcaggtt ttttagggat gggtccatgg
     1861 ggaaagaagc cccgcaactt ccccgtggcc caagctcctc cggggctgat accaacagca
     1921 cccccagcag atccagcagt ggatctattg gaaagatata tgcagcaggg gagagagcag
     1981 agggagcaga gggagagacc atacaaagag gtgacggaag acttactgca cctcgagcag
     2041 ggaaaagcac ctcacagaga ggcgacagag gacttgctgc acctcaattc tctctttgga
     2101 aaagaccagt agtcacagca tacatcgagg tacagccggt agaagtctta ttagacacag
     2161 gggctgatga ctcaatagtg gcaggaatac agttagggga caattacgtt ccaaaaatag
     2221 tggggggaat agggggattc ataaacacca aagagattaa aaatatagaa ataaaagtac
     2281 taaataaaag agtaagagcc accataatga caggagatac cccaatcaac atctttggca
     2341 gaaatatcct gacagcctta ggcatgtcat taaatttacc aatcgccaag atagaaccaa
     2401 taaaagtaac gttgaagcca gggaaagatg gaccaagact gagacaatgg cccctaacaa
     2461 aagagaaaat agaagcacta agagagatct gcgaaaaaat ggaaaaggag ggccaactag
     2521 aagaggcacc tccaactaat ccttataata cccccacatt tgcaattaag aaaaaggaca
     2581 agaacaaatg gaggatgcta atagacttta gagaactaaa cagggtgact caagatttca
     2641 cagaaattca gctaggaatt ccacacccgg caggattagc caaaaagaaa aggatcactg
     2701 tactagatgt aggggatgcc tacttttcca taccgctaca tgaagatttt aggcagtata
     2761 ctgcatttac cctaccatca gtaaacaatg cagaaccagg aaaaagatat atatataagg
     2821 tcttaccaca aggatggaag ggatcaccag caatttttca acacacaatg aggcaagtct
     2881 tagaaccttt cagaaaagca aacccagatg tcattctcat ccaatacatg gatgatattt
     2941 taatagctag tgacaggaca ggtttagagc atgacaaagt agtcctgcag ctaaaagaac
     3001 ttctgaatgg cctagggttc tctaccccag atgagaagtt ccaaaaagac cctccgcttc
     3061 aatggatggg ctatgaacta tggccaacta aatggaagct gcagaaatta caactgcccc
     3121 agaaagaaat atggacagtc aacgacatcc agaagctggt gggagtctta aattgggcgg
     3181 cacaaatcta tccaggaata aaaaccaaac atttgtgtag gctaatcaaa ggaaaaatga
     3241 cactcacaga agaagtgcag tggacagagt tagcagaagc agagctagaa gaaaacaaaa
     3301 ttatcttaag ccaggaacaa gaaggatact attaccaaga agaaaaagaa ttagaggcaa
     3361 caatccaaaa aaatcaagac aatcagtgga catacaaaat acaccaggaa gagaaaatcc
     3421 taaaagtagg gaaatatgca aagataaaaa atacccatac caatggggtc agattactag
     3481 cacaggtagt tcagaaaata ggaaaagaag cactagtcat ctggggacgg ataccaaaat
     3541 tccacttacc ggtggagaga gagacctggg agcagtggtg ggataactac tggcaagtga
     3601 catggatccc agagtgggac ttcgtatcta ccccaccact agtcaggtta acatttaacc
     3661 tagtaggaga tcctatacca ggcgcagaga ccttctacac agatggatca tgcaatagac
     3721 agtcaaagga aggaaaagca cgatatgtga cagatagagg aagagacaag gtaagagtac
     3781 tagaacgaac taccaatcag caagcagaat tagaagcctt tgcgatgaca ctgacagact
     3841 cgggtccaaa agttaatatc atagtagact cacagtatgt aatgggaata gtagtaggcc
     3901 aaccaacaga gtcagaaagt agaatagtaa accaaatcat tgaggacatg ataaagaaag
     3961 aagcagtcta tgtcgcgtgg gtcccagccc ataaaggcat aggaggaaac caggaagtag
     4021 accatttagt aagtcagggc atcagacaag tattgttcct agaaaggata gagcccgccc
     4081 aagaagaaca tgaaaaatat catagcaata tgaaagaact aacccataaa tttggaatac
     4141 cccaactagt agcaagacag atagtaaaca catgtgccca atgccaacag aaaggagagg
     4201 ccatacatgg gcaggtaaat gcagaaatag gtgtctggca aatggactgc acacacttag
     4261 aaggaaagat cattatagta gcagtacatg ttgcaagtgg attcatagaa gcagaagtta
     4321 tcccacagga atcaggaagg cagacagcgc tcttcctatt aaaactggcc agtaggtggc
     4381 caataacaca cttacacaca gacaatggct ccaacttcac ttcacaggaa gtgaagatgg
     4441 tggcatggtg gataggtata gagcaatcct ttggagtacc ttacaaccca caaagccagg
     4501 gagtagtaga agcaatgaat caccacttaa agaatcagat aagtagaatt agagaacagg
     4561 caaatacaat agaaaccata gtactaatgg cagttcattg catgaatttt aaaagaaggg
     4621 gaggaatagg ggatatgacc ccagcagaaa gactaatcaa catgattacc acagaacaag
     4681 aaatacaatt cctccaaaga aaaaattcaa attttaaaaa cttccaggtc tattacagag
     4741 aaggcagaga tcagctgtgg aaaggacctg gagaactact gtggaaggga gacggagcag
     4801 tcatagtcaa ggtaggggca gacataaaag taataccaag aaggaaggcc aagattatca
     4861 gagactatgg aggaaggcaa gaactggata gttcccacct ggagggtgcc agggaggagg
     4921 atggagaggt ggcatagtct tgtcaagtac ctaaaataca gaacaagaga cctagaggag
     4981 gtgcgctatg ttccccacca caaggtggga tgggcgtggt ggacttgcag cagggtaata
     5041 ttccctctaa aaggagaaag tcatctagag atacaggcat attggaacct aacgccagaa
     5101 aaaggatggc tctcctctca ttcagtgaga ataacctggt atacagaaag gttctggaca
     5161 gatgttaccc cagactatgc agacatccta atacatagca cttatttctc ttgctttacg
     5221 gcaggtgaag taagaagagc catcagaggg gaaaagttat tgtcctgctg caactatccc
     5281 caagctcata aagtacaggt accatcactt caatacctag ccctagtggt agtgcaacaa
     5341 aatgacagac cccagagaaa gggtaccgcc aggaaacagt ggagaagaga ccattggaga
     5401 ggccttcgag tggctagaca ggactataga agccttaaac agagaggcag tgaaccatct
     5461 gccccgagag ctcattttcc aggtgtggca aaggtcctgg agatactggc atgatgacca
     5521 agggatgtca ccaagctaca caaagtatag atatttgtgc ctgatgcaga aagctgtatt
     5581 tatacatttc aagagagggt gcacttgcct ggggggagga catggaccgg gaggatggag
     5641 atcaggacct ccccctcctc cccctccagg tctagtctaa tgactgaagc accaacagag
     5701 tttcccccag aagatgggac cccacggagg gaactagggg gtgactgggt aataagaatt
     5761 ctgggagaaa taaaggaaga agccttaaag cattttgatc cccgcttgct aatcgctctt
     5821 ggcaactata tccatagtag acatggagac acacctgaag gcgccagaga gctcattaga
     5881 atcctacaac gagccctctt cgtgcacctc agagcagggt gtaaccgctc aagaattagc
     5941 caaacaagga gaagaactcc tttcccagct gcaccgaccc ctagaggcat gtactaactc
     6001 atgctattgt aagcagtgca gcttccattg ccagctgtgt ttcttgaaaa aagggctcgg
     6061 gatatggtat gcacgaaaaa gcagacggag aagaactcca agaaagacta agactcattc
     6121 gtcttctgca tcagacaagt gagtaagatg tgtggtaaga gtctactatg tgttgccagc
     6181 ttgctagcta gtgcttactt agtatattgc acccaatatg tgactgtttt ctatggcgtg
     6241 cccgtgtgga gaaatgcatc cattcccctc ttttgtgcaa ctaaaaacag agatacttgg
     6301 ggaaccatac agtgcaagcc agacaatgat gattatcaag aaataacctt gaatgtgaca
     6361 gaggctttcg acgcatggga taatacagta acagaacaag cagtagaaga tgtctggagt
     6421 ctatttgaga catcaataaa accatgcgtc aaactaacac ctctatgtgt agcaatgagt
     6481 tgtaacagca ccactaacaa cacaaccaca acagggagca caacagggat gtcagagata
     6541 aatgaaactt ctccatcata ctcagacaac tgcacaggat tggggaagga agagatagtc
     6601 aattgtcagt tctatatgac aggattagag agagataaga aaaaacagta taatgaaaca
     6661 tggtactcaa aagatgtagt ttgtgaatca aataatacca aagatgggaa aaacagatgt
     6721 tacatgaacc attgcaacac atcagtcatc acagagtcat gtgataagca ctattgggat
     6781 gctataaagt ttagatactg tgcaccaccg ggttatgccc tactaagatg caatgatacc
     6841 aattattcag gcttcgagcc aaagtgttct aaagtagtag cttctacatg cacaaggatg
     6901 atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga gaatagaaca
     6961 tatatctatt ggcatggcag agataatagg actatcatca gcttaaacaa gtattataac
     7021 ctttctatac attgtaagag accaggaaac aagacagttg taccaataac actcatgtca
     7081 gggttagtgt ttcactccca gccaatcaat acaagaccta ggcaagcatg gtgctggttc
     7141 aaaggcaagt ggagggaagc catgcaggag gtaaagcaaa cccttataaa acatcccagg
     7201 tataaaggaa ccaacgacac aaagaatatt aactttacaa aaccaggaag aggctcagac
     7261 ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg caacatgact
     7321 tggttcctca attgggtaga aaacagaccg aatcagacac agcacaatta tgcgccgtgc
     7381 catataaggc aaataattaa tacctggcac aaggtaggga aaaatgtata tttgcctcct
     7441 agagaaggac agttgacctg caactcaaca gtgaccagca taattgctaa cattgacgta
     7501 aatagtaacc agacaaatat tacctttagt gcagaggtgg cagaactata ccgattagaa
     7561 ttgggagatt ataaattaat agaagtaaca ccaattggct tcgcacctac gagagagaaa
     7621 agatattcct cggctccagt gaggaacaaa agaggtgtgt ttgtgctagg gttcttgggt
     7681 tttctcgcga cagcaggttc tgcaatgggc gcggcgtccc tgacgctgtc ggctcagtcc
     7741 cggactttac tggccgggat agtgcagcaa cagcaacagc tgttggacgt agtcaagaga
     7801 caacaagaaa tgttgcgact gaccgtctgg ggaacgaaaa atctccaggc aagagtcact
     7861 gctatcgaga agtacttaaa ggaccaggca cagttaaatt catgggggtg tgcgtttagg
     7921 caggtctgcc acactactgt accatgggta aatgattctt tatcgcctga ctggaacaat
     7981 atgacatggc aggaatggga aaaacaagtc cgctacctag aggcaaatat cagtcaaagt
     8041 ttagaacagg cacaaattca acaagaaaag aatatgtatg aattacaaaa gctaaatagc
     8101 tgggatgtct ttggcaactg gtttgattta acctcctgga tcaagtatat tcaatatgga
     8161 gtttatatag tagtaggagt gatagtttta agaatagcaa tatatatagt gcaattgtta
     8221 agtagactta gaaagggcta taggcctgtt ttctcttccc ctcccggtta tctccaacag
     8281 atccatatcc acacggaccg gggacagcca gccaacgaag gaacagaaga agacgacaga
     8341 gacgacgatg gttacgactt gtagccttgg ccgataaact atatacattt cctgatccac
     8401 ctactgactc gcctcttgac cgggctatac aagatctgca gagacttact atccacgaac
     8461 tccccgaccc accgactgat ctcccagaat ctaacagcaa tcagggactg gctgagactt
     8521 aaagcggcct acctgcaata tgggggcgag tggatccaag aagcattcca agcattcgca
     8581 aagactacga gagagactct tgcgagcgca tggggggggt tatgtgcagc agtgcaacgc
     8641 gtcgggaggg gaatactcgc agtcccaaga aggatcaggc aaggggcaga aatcgccctc
     8701 ctgtgaggga cagcagtatc ggcagggaga tttcatgaac accccatgga gaaccccggc
     8761 aatagaagga cagaaaaagt tatacaagca gcaaaatatg gatgatatag attctagtga
     8821 tgatgaccta gtgggagttc ctgttacacc aagagtacca ctaagagcaa tgacctataa
     8881 attggcagta gatatgtcac attttataaa gaaaagggga ctggatggga tgttttacag
     8941 tagggataga catagaatcc tagacttgta cctagaaaag gaggaaggga taataccaga
     9001 ttggcagaat tatactcatg ggccaggagt aaggtaccca atgtgtttcg ggtggctatg
     9061 gaagctagta ccagtagatg tctcacaaga ggcagaggac gatgagacca actacttaac
     9121 acacccagca caaacaagca gacatgatga cgagcatgga gaaacactac tttggcggtt
     9181 tgaccccacg ctggcctatg attacaaggc tttcattctg catccagaag agtttgggca
     9241 caagtcagga ttgccagaga aagagtggaa ggcaaaactg aaagcaagag ggataccata
     9301 tagttaacaa caggaacagc catacttggt caaggcagga agtagctact gagaacagct
     9361 gaggctgcag ctgcagggac tttccagaag gggctgtaac caagggaggg acatgggagg
     9421 agctggtggg gaacgccctc atacttactg tataaatgta ctcgcttctt acattgtatt
//
LOCUS       HIV2NIHZA    9431 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 2 (HIV-2) proviral gag, pol, vif,
            vpX, vpR, tat, rev, env and nef genes.
ACCESSION   J03654
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 2 (isolate FG) proviral DNA,
            clone NIHZ.
  ORGANISM  Human immunodeficiency virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9431)
  AUTHORS   Zagury,J.F., Franchini,G., Reitz,M., Collalti,E., Starcich,B.,
            Hall,L., Fargnoli,K., Jagodzinski,L., Guo,H.G., Laure,F. and et,al.
  TITLE     Genetic variability between isolates of human immunodeficiency
            virus (HIV) type 2 is comparable to the variability among HIV type
            1
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 5941-5945 (1988)
  STANDARD  full staff_review
COMMENT     Hard copy of sequence kindly provided by G. Franchini (August
            1987).
            
            The NIHZ clone has a large deletion in the nef cds and LTR relative
            to other HIV2 sequences; nevertheless, it is biologically active.
FEATURES       from  to/span     description
    pept        547     2106     gag polyprotein
    pept    <  1824     4931     pol polyprotein (NH2 terminus uncertain; AA at
                                 1824)
    pept       4861     5508     vif protein
    pept       5336     5674     vpX protein
    pept       5674     5991     vpR protein
    pept       5837     6132     tat protein, exon 2 (first expressed exon)
               8272     8368     tat protein, exon 3 (AA at 8273)
    pept       6063     6132     rev protein, exon 2 (first expressed exon)
               8272     8525     rev protein, exon 3 (AA at 8274)
    pept       6139     8709     env polyprotein
    pept       8543     9085     nef protein
    LTR     <     1      301     5' LTR
    LTR        8928  >  9431     3' LTR
    rpt     <     1      177     R repeat 5' copy
    rpt        9254     9431     R repeat 3' copy
    binding     305      321     primer (Lys-tRNA) binding site
    signal     9404     9409     poly-A signal
BASE COUNT     3213 a   1913 c   2348 g   1957 t
ORIGIN      Cap site of genomic RNA.
        1 agtcgctctg gcagaggctg gcagattgag ccctgggagg ttctctccag cactagcagg
       61 tagagcctgg gtgttccctg ctagactctc accagtgctt ggccggcgct gggcagacgg
      121 ctccacgctt gcttgcttaa agacctctca ataaagctgc caattagaag caagttaagt
      181 gtgtgctccc atctctccta gtcgccgcct ggtcatccgg tgttcacctg tataacaaga
      241 ccctggtctg ttaggaccct tcttgctttg ggaaaccgag gcaggcggaa aatccctagc
      301 aggttggcgc ccgaacagga cttgaagaag actgagaagc cttggaacaa gggctgggtg
      361 aaggcagtaa gggcggcagg aacaaaccag cacggagtgc tcctagaaag gcgcgggccg
      421 aggtaccaaa gcggcgtgtg gacgggagtc gagaggcctc cgggtgaagg gtaagtacct
      481 acaccaaaaa ctgtagccaa agagggcttg ttatcctacc tttagacagg tagaagattg
      541 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagatga attagaaaaa
      601 attaggttac ggcccggcgg aaagaaaaag tacaagttaa aacatattgt gtgggcagcg
      661 aatgaattgg acagattcgg attagcagag agcctgttgg agtcaaaaga aggttgccaa
      721 aaaattctta cagttttaga tccattagta ccgacaggtt cagaaaattt aaaaagcctt
      781 tttaatactg tctgcgtcat ctggtgcata cacgcagaag agaaagtgaa agatactgaa
      841 ggagcaaaac aaatagtaca gagacatcta gtggcagaaa caggaactgc agagaaaatg
      901 ccaaatacaa gtagaccaac agcaccacct agcgggaaaa acttccccgt gcagcaagta
      961 gctggcaact atacccatat accactgagt cccggaactc taaatgcttg ggtaaaatta
     1021 gtggaggaaa agaaattcgg ggcagaagta gtgccaggat ttcaggcact ctcagaaggc
     1081 tgcacgccct atgatattaa tcagatgctt aattgtgtgg gcgaccatca agcagctatg
     1141 caaataatca gggaaattat taatgaggaa gcagcagact gggatgttgc acatcccata
     1201 ccaggcccct taccagcggg gcagcttaga gagccaagag ggtctgacat agcagggaca
     1261 acaagcacag tagaagagca gatccagtgg atgtttagac cacaaaatcc tgtaccagta
     1321 ggcaacatct ataggagatg gattcagata ggactgcaaa agtgcgtcag gatgtacaat
     1381 ccaaccaaca tcctagacat aaatcaagga ccaaaggaac cattccaaag ctatgtagat
     1441 aggttctaca aaagcttgag ggcagaacaa acagatccag cagtaaaaaa ttggatgacc
     1501 caaacactgc taatacagaa tgccaaccca gattgtaaat tagtactaaa aggattagga
     1561 atgaatccta ccttagaaga aatgctgacc acctgtcagg gggtaggggg accaggccag
     1621 aaagctagat taatggcaga agccctaaag gaggtcatgg caccagcccc catcccattt
     1681 gcagcagccc agcagagaaa gacattcaag tgctggaatt gtggaaagga aggacactcg
     1741 gcaagacaat ggagcgcgcc tagaagacag ggctgctgga agtgtggtaa gtcaggacat
     1801 gtcatggcaa attgcccaga tagacaggct ggttttttag ggattggcca ctggggaaag
     1861 aaggcccgca acttccccgc ggcccaagtc ccgcaggggc taacaccaac agcaccccca
     1921 ttggatccag cagtggacct actggagaaa tatatgcagc aaggaaaaag gcaaagggag
     1981 cagagagaga gaccgtacaa ggaagtgaca gaggacttac tgcgtttcga gcaggcagag
     2041 acaccatgca gggagacgac agaggacttg ctgcacctca attctctctt tggaaaagac
     2101 cagtagtcac agcacacatt gagggtcagc cagtagaagt tttgttagac acaagggcta
     2161 acgactcaat agtagcagga atagagttag ggagcaatta tagtccaaaa atagtaggag
     2221 gaataggggg attcataaat accaaggaat ataaaaatgt agaaatagaa gtcctaggta
     2281 aaagggtaaa agccaccata atgacaggtg ataccccgat caacattttt ggcagaaatg
     2341 ttctgacagc tttaggcatg tcattaaacc tgccagttgc caagatagaa ccaataaaaa
     2401 taatgctaaa gccagggaaa gatggaccaa gactaaaaca atggccttta acaaaagaaa
     2461 aaatagaagc actaaaagaa atctgtgaaa aaatggaaaa agaaggccag ctagaagaag
     2521 cacctccaac taatccttat aataccccca catttgcaat caagaaaaag gacaaaaaca
     2581 aatggagaat gctaatagac tttagggagc taaacaaggt aactcaggat ttcacagaaa
     2641 ttcagttagg aattccacac ccagcaggat tggccaaaaa gaggagaatt actgtactag
     2701 atgtagggga tgcttacttt tccataccac tacatgagga ctttagacag tatactgcat
     2761 tcactctacc atcagtaaac aatgcagaac caggaaaaag atatatatac aaagtcctac
     2821 cacaggggtg gaaggggtca ccagcaattt ttcaatacac aatgaggcag atcttagaac
     2881 cattcagaaa agcaaacgag gatgtcatta tcattcagta catggatgat atcttaatag
     2941 ccagcgacag gacagactta gaacatgaca aagtggtcct gcagctaaaa gaacttctaa
     3001 atggactagg attttccacc ccagatgaga aattccagaa agaccctcca tatcgctgga
     3061 tgggctatga attatggcca actaaatgga agttgcaaaa aatacagctg ccccaaaaag
     3121 aagtatggac agtcaatgac atccaaaagc tagtgggtgt cctaaattgg gcagcacaaa
     3181 tctacccagg gataaagacc aaacacctat gtaggctaat tagaggaaaa atgacactca
     3241 cagaagaggt acagtggaca gaattagcag aagcagagct agaggaaaac agaattatct
     3301 taagccagaa acaagaagga cactattacc aggaagaaaa aaagttagaa gcaacagtcc
     3361 agaaagatca agacaatcag tggacatata aggtacacca gggggagaaa attctcaaag
     3421 gtgggaaaat atgcaaagat aaaaaatacc cataccaacg ggtcagattg ttagcacagg
     3481 tagttcaaaa aataggaaaa gaagcactag tcatttgggg acggatacca aaatttcacc
     3541 taccagtaga gagagatacc tgggagcagt ggtgggataa ctactggcaa gtaacatgga
     3601 tcccagactg ggactttgta tctaccccac cactggtcag gctagcattt aacctggtag
     3661 gagagcctgt accaggcgca gaaactttct acacagatgg atcctgcaat aggcagtcaa
     3721 aagaagggaa agcaggatat ataacagata gagggagaga cagggtaaaa gtattagagc
     3781 aaactaccaa tcagcaagca gaattagaag cctttgcaat ggcactaaca gactcaggtc
     3841 caaaagctaa tattatagta gactcacaat atgtaatggg gatagtagca ggccaaccaa
     3901 cagagtcaga gaatagaata gtaaatcaga tcatagaaga aatgataaag aaagaagcaa
     3961 tctatgttgc gtgggtccca gcccacaaag gcataggagg aaaccaggaa gtagatcatt
     4021 tagtaagtca gggcattaga caagtattat tcctagagaa aatagaaccc gcgcaggaag
     4081 aacatgaaaa atatcatagc aatataaaag aactgtccca taaatttgga atacccaagc
     4141 tagtggcaag acaaatagta aacacatgtg cccatgtgca acagaaaggg gaggctatac
     4201 atgggcaagt aaatgcagaa ctaggcactt ggcaaatgga ctgcacacat ttagaaggaa
     4261 aagtcattat agtagcagta catgttgcaa gtggatttat agaggcagaa gttatcccac
     4321 aggaatcagg aaggcaaaca gcactgttct tactgaaact ggccagtagg tggccaataa
     4381 cacacctgca cacagataat ggtgccaact tcacttcaca ggaagtgaaa atggtagcat
     4441 ggtgggtagg tatagaacaa acctttggag tgccttacaa tccacaaagc caaggagtag
     4501 tagaagcaat gaatcatcat ctaaaaaatc agatagacag aattagagag caggcaaata
     4561 cagtagaaac aatagtatta atggcagttc attgcatgaa ttttaaaaga aggggaggaa
     4621 taggggatat gaccccagca gaaagaataa tcaatatgat caccacagaa caagaaatac
     4681 aattcctcca agcaaaaaat tcaaaattaa aaaattttcg ggtctatttc agagaaggca
     4741 gagatcagct gtggaaagga cctggggaac tgctgtggaa gggagacgga gcagtcatag
     4801 tcaaggtagg gacagaaata aaagtagtac caagaaggaa agccaagatc attaaagact
     4861 atggaggaag gcaagagatg gatagtggtt cccatttgga gggtgccagg gaggatggag
     4921 agatggcata gcttggtcaa gtacttgaaa tacagaacaa aagacttaga gaaggtatgc
     4981 tatgttcccc atcataaggt ggggtgggca tggtggactt gcagcagggt aatattccca
     5041 ctgaaagaga acagtcatct ggagatacaa gcatattgga acttaacacc agaaaaagga
     5101 tggctctcct ctcactcagt gagaataact tggtatacag aaaagttctg gacagatgtt
     5161 accccagact gtgcagatac cctaatacat agcacttatt tctcttgctt tacggcaggt
     5221 gaagtaagaa gagccatcag aggggaaaaa ttattgtcct gctgcaaata cccccgagcg
     5281 catagatccc aggtaccgtc acttcaattt ctggccttag tggtagtgca acaaaatgac
     5341 agatcccagg gaaacagtgc caccaggaaa cagcggcgag gagactatcg aagaggcctt
     5401 cgcatggcta gacaggacag tagaggctat aaacagagag gcagtgaatc accacccacg
     5461 agagctcatt ttccaggtct ggcagaggtc ctggagatac tggcatgatg aacaagggat
     5521 gtcaacaagc tacacaaaat atagatattt gtgcttaata cagatggcaa tgtacatgca
     5581 tgctaagaga gatggcactt gcctgggggg aggcatgggc cagaagggtg gagatcaggg
     5641 acctccccct cctccacctc caggtttagt ctaatgactg aagcaccaac agaactcccc
     5701 ccggaggata ggaccccacc gagggagcca ggggatgcat gggtaataga aatcctgaga
     5761 gaaatagaag aagaagcttt aaggcatttt gaccctcgct tgctaactgc tcttggcaga
     5821 tatatctata ctaggcatgg agacaccctt gaaggcgcca gagagctcat tagaatcctg
     5881 caacgagccc tcttcgcgca cttcagagca ggatgtggcc actcaagaat tggccagaca
     5941 aggggaggaa atcctctctc agctataccg acccctagag gcatgcacta actcctgcta
     6001 ttgcaagaaa tgctgctacg attgccagct gtgtttttta caaaaggggc tcgggatatg
     6061 gtatgaccga aagggcagac gaagaaggac tccaaagaaa actaaggctc atccgtcttc
     6121 tgcatcagac aagtgagtat gaagggtagt aagaatcaac tgctgattgc tattgtacta
     6181 gctagtgctt acctaataca ttgcaagcaa tttgtgactg ttttctacgg catacccgcg
     6241 tggaggaatg catccattcc cctgttttgt gcaaccaaaa atagagatac ttggggaaca
     6301 atacagtgct taccagacaa tgatgattat caggaaataa ctctaaatgt gacagaggct
     6361 ttcgatgcat ggaataatac agtaacagaa caagcagtag aggatgtctg gaatctattt
     6421 gagacatcaa taaaaccatg tgtcaaatta acacccttat gtgtagcaat gaactgtaca
     6481 aggaacatga ccacatggac agggaggaca gacacccaaa atatcacaat tataaatgac
     6541 acttcccatg cgcgtgcaga caactgcaca ggattaaagg aggaagaaat gatcgactgt
     6601 cagtttagta tgacaggatt agagagagac aagagaaaac agtatactga agcatggtac
     6661 tcaaaagatg tggtttgtga caataacacc tcaagtcaga gcaagtgtta catgaaccat
     6721 tgcaacacat cagtcatcac agagtcatgt gataagcact attgggatgc tatgaggttt
     6781 agatactgtg caccaccggg ttttgcctta ctaagatgca atgataccaa ttattcaggc
     6841 tttgcaccta actgctctaa agtagtagct gctacatgca ccagaatgat ggaaacgcaa
     6901 acttctacat ggtttggctt taatggcact agagcagaaa atagaacata tatctattgg
     6961 catggtaaag ataacagaac tattatcagc ttaaataact tttataatct cactatgcat
     7021 tgtaagaggc cgggaaataa gacagtgtta ccaataacat ttatgtcagg gtttaagttt
     7081 cactcccagc cagtcatcaa taaaaaaccc aggcaagcat ggtgttggtt cgaaggccaa
     7141 tggaaggaag ccatgcagga ggtgaaggag acccttgcga aacatcccag atataaaggg
     7201 aacaggagcc gcacagagaa tattaaattt aaagcaccag ggagaggctc agacccagaa
     7261 gtaacataca tgtggactaa ctgcagagga gaatctctct actgcaacat gacttggttc
     7321 ctcaattggg tagagaacag gacgggtcag aaacagcgca attatgcacc gtgccgtata
     7381 aggcaaataa ttaatacctg gcacagggta gggaaaaatc tatatttgcc tcccagggaa
     7441 ggggagttga cctgcaactc aacggtgacc agcataattg ccaacattga tgcgggagat
     7501 caaacaaata ttacctttag tgcagaggcg gcagaactat accgattgga attgggggat
     7561 tacaaattag tagaaatcac accaattggc ttcgcaccta catcagtaaa gagatactcc
     7621 tctgctcacc agagacatac aagaggtgtg tttgtgctag ggttcctagg ttttctcgca
     7681 acagcaggtt ctgccatggg cgcggcgtcc ttgacgctgt cagctcagtc tcggacttta
     7741 ttggccggga tagtgcagca acagcaacag ctgttggatg tggtcaagag acaacaagaa
     7801 atgttgcgac tgaccgtctg gggaacaaaa aatctccagg caagagtcac tgctatagag
     7861 aagtacctaa aggaccaggc gcagctaaat tcatggggat gtgcgtttag acaagtctgc
     7921 cacacttctg taccatgggt aaatgatacc ttgacacctg attggaacaa tatgacgtgg
     7981 caggaatggg aacaaaaagt ccgctacctg gaggcaaata tcagtcaaag tctagaacaa
     8041 gcacaaattc agcaagaaaa gaatatgtat gagctacaaa aattaaatag ctgggatgtt
     8101 tttaccaatt ggcttgactt cacctcctgg gtcaggtata ttcaatatgg agtttatgta
     8161 gtagtaggaa tagtagcttt aagaatagta atatatatag tacagatgtt aagtagactt
     8221 agaaagggct ataggcctgt tttctcttcc ccccccggtt acatccaaca gatccatatc
     8281 cacaaggacc aggaacagcc agccagagaa gaaacagaag aagacgttgg aagcaacggt
     8341 ggagacagat cttggccttg gccgatagca tatatacatt tcctgatccg cctgctgatt
     8401 cgcctcttga ccgggctata caacatctgc agggacttac tatccaggat ctccccgatc
     8461 ctccaaccaa tcttccagag tctccagaga gcactaacag caatcagaga ctggctgagg
     8521 cttaaagcag cctacctgca gtatgggtgc gagtggatcc aagaagcgtt ccaagccctt
     8581 gcaaggacta caagagagac tcttgcaggc gcggggagag acttgtggag ggcgctgcaa
     8641 cgaatcggga gggggatact tgcagtccca cgaaggatca ggcagggagc agaactcgcc
     8701 ctcctgtgag ggacagcgat atcagcaggg agattttgta aataccccat ggagaacccc
     8761 agcagcagaa agggagaaag aattgtacaa acagcaaaat atggatgatg tagatctaga
     8821 tgatgatgac caagtaggat tccctgtcac accaagagta ccattaagac caatgacatt
     8881 caaattggca gtagatatgt ctcattttat aaaagaaaaa gggggactgg aagggctgtt
     8941 ttatagtcag agaagacata gaatcttaga cttatactta gacaaggctt ttactctgta
     9001 cccagaggaa tttgggcata attcaggact gccagagaaa gagtggaagg cgagactgaa
     9061 agcaagggga ataccattta gttaaaaaca ggaacagcta tacttggtca gggcaggaag
     9121 tagctactga gaacagctga gactgcaggg actttccaga aggggctgta accaggagag
     9181 ggacatggga ggagctggtg gggaacgccc ttcatacttt ctgtataaat gtacccgctg
     9241 cttgcattgt attcagtcgc tctggcagag gctggcagat tgagccctgg gaggttctct
     9301 ccagcactag caggtagagc ctgggtgttc cctgctagac tctcaccagt gcttggccgg
     9361 cgctgggcag acggctccac gcttgcttgc ttaaagacct ctcaataaag ctgccaatta
     9421 gaagcaagtt a
//
LOCUS       HIV2RODZ     9671 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 2, complete proviral genome.
ACCESSION   M15390
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 2 (isolate ROD) proviral DNA,
            clones lambda-ROD[27,35].
  ORGANISM  Human immunodeficiency virus type 2
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 9361 to 9671)
  AUTHORS   Clavel,F., Guyader,M., Guetard,D., Salle,M., Montagnier,L. and
            Alizon,M.
  TITLE     Molecular cloning and polymorphism of the human immunodeficiency
            virus type 2
  JOURNAL   Nature 324, 691-695 (1986)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 9671)
  AUTHORS   Guyader,M., Emerman,M., Sonigo,P., Clavel,F., Montagnier,L. and
            Alizon,M.
  TITLE     Genome organization and transactivation of the human immuno-
            deficiency virus type 2
  JOURNAL   Nature 326, 662-669 (1987)
  STANDARD  full staff_review
COMMENT     The ROD isolate was obtained from offshore Senegal in 1985. Its
            sequence was determined from two clones: bases 1 to 2658 are from
            ROD 27 and bases 2659 to 9671 are from ROD 35.  An infectious clone
            of this virus has been constructed by Keith Peden, Molecular
            Biology and Genetics, Johns Hopkins University School of Medicine,
            Baltimore, MD. 21205 (301)955-3652.
FEATURES       from  to/span     description
    pept        546     2114     gag polyprotein
    pept    <  1829     4939     pol polyprotein (NH2 terminus uncertain; AA at
    pept       4869     5516     vif protein
    pept       5344     5682     vpX protein
    pept       5682     5999     vpR protein
    pept       5845     6140     tat protein, exon 2 (first expressed exon)
               8307     8403     tat protein, exon 3 (AA at 8308)
    pept       6071     6140     rev protein, exon 2 (first expressed exon)
               8307     8539     rev protein, exon 3 (AA at 8309)
    pept       6147     8723     env polyprotein
    pept       8557     9327     nef protein
    LTR     <     1      299     5' LTR
    LTR        8942  >  9671     3' LTR
    rpt     <     1      173     R repeat 5' copy
    rpt        9497     9671     R repeat 3' copy
    binding     303      319     primer (Lys-tRNA) binding site
    signal     9649     9654     poly-A signal
BASE COUNT     3314 a   1972 c   2401 g   1984 t
ORIGIN      Cap site of genomic RNA.
        1 ggtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
       61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagacg
      121 gccccacgct tgcttgctta aaaacctctt aataaagctg ccagttagaa gcaagttaag
      181 tgtgtgctcc catctctcct agtcgccgcc tggtcattcg gtgttcacct gagtaacaag
      241 accctggtct gttaggaccc ttcttgcttt gggaaaccga ggcaggaaaa tccctagcag
      301 gttggcgcct gaacagggac ttgaagaaga ctgagaagtc ttggaacacg gctgagtgaa
      361 ggcagtaagg gcggcaggaa caaaccacga cggagtgctc ctagaaaggc gcgggccgag
      421 gtaccaaagg cagcgtgtgg agcgggagga gaagaggcct ccgggtgaag gtaagtacct
      481 acaccaaaaa ctgtagccga aagggcttgc tatcctacct ttagacaggt agaagattgt
      541 gggagatggg cgcgagaaac tccgtcttga gagggaaaaa agcagatgaa ttagaaagaa
      601 tcaggttacg gcccggcgga aagaaaaagt acaggctaaa acatattgtg tgggcagcga
      661 ataaattgga cagattcgga ttagcagaga gcctgttgga gtcaaaagag ggttgtcaaa
      721 aaattcttac agttttagat ccaatggtac cgacaggttc agaaaattta aaaagtcttt
      781 ttaatactgt ctgcgtcatt tggtgcatac acgcagaaga gaaagtgaaa gatactgaag
      841 gagcaaaaca aatagtgcgg agacatctag tggcagaaac aggaactgca gagaaaatgc
      901 caagcacaag tagaccaaca gcaccatcta gcgagaaggg aggaaattac ccagtgcaac
      961 atgtaggcgg caactacacc catataccgc tgagtccccg aaccctaaat gcctgggtaa
     1021 aattagtaga ggaaaaaaag ttcggggcag aagtagtgcc aggatttcag gcactctcag
     1081 aaggctgcac gccctatgat atcaaccaaa tgcttaattg tgtgggcgac catcaagcag
     1141 ccatgcagat aatcagggag attatcaatg aggaagcagc agaatgggat gtgcaacatc
     1201 caataccagg ccccttacca gcggggcagc ttagagagcc aaggggatct gacatagcag
     1261 ggacaacaag cacagtagaa gaacagatcc agtggatgtt taggccacaa aatcctgtac
     1321 cagtaggaaa catctataga agatggatcc agataggatt gcagaagtgt gtcaggatgt
     1381 acaacccgac caacatccta gacataaaac agggaccaaa ggagccgttc caaagctatg
     1441 tagatagatt ctacaaaagc ttgagggcag aacaaacaga tccagcagtg aagaattgga
     1501 tgacccaaac actgctagta caaaatgcca acccagactg taaattagtg ctaaaaggac
     1561 tagggatgaa ccctacctta gaagagatgc tgaccgcctg tcagggggta ggtgggccag
     1621 gccagaaagc tagattaatg gcagaggccc tgaaagaggt cataggacct gcccctatcc
     1681 cattcgcagc agcccagcag agaaaggcat ttaaatgctg gaactgtgga aaggaagggc
     1741 actcggcaag acaatgccga gcacctagaa ggcagggctg ctggaagtgt ggtaagccag
     1801 gacacatcat gacaaactgc ccagatagac aggcaggttt tttaggactg ggcccttggg
     1861 gaaagaagcc ccgcaacttc cccgtggccc aagttccgca ggggctgaca ccaacagcac
     1921 ccccagtgga tccagcagtg gatctactgg agaaatatat gcagcaaggg aaaagacaga
     1981 gagagcagag agagagacca tacaaggaag tgacagagga cttactgcac ctcgagcagg
     2041 gggagacacc atacagggag ccaccaacag aggacttgct gcacctcaat tctctctttg
     2101 gaaaagacca gtagtcacag catacattga gggtcagcca gtagaagtct tgttagacac
     2161 aggggctgac gactcaatag tagcaggaat agagttaggg aacaattata gcccaaaaat
     2221 agtaggggga atagggggat tcataaatac caaggaatat aaaaatgtag aaatagaagt
     2281 tctaaataaa aaggtacggg ccaccataat gacaggcgac accccaatca acatttttgg
     2341 cagaaatatt ctgacagcct taggcatgtc attaaatcta ccagtcgcca aagtagagcc
     2401 aataaaaata atgctaaagc cagggaaaga tggaccaaaa ctgagacaat ggcccttaac
     2461 aaaagaaaaa atagaagcac taaaagaaat ctgtgaaaaa atggaaaaag aaggccagct
     2521 agaggaagca cctccaacta atccttataa tacccccaca tttgcaatca agaaaaagga
     2581 caaaaacaaa tggaggatgc taatagattt cagagaacta aacaaggtaa ctcaagattt
     2641 cacagaaatt cagttaggaa ttccacaccc agcagggttg gccaagaaga gaagaattac
     2701 tgtactagat gtaggggatg cttacttttc cataccacta catgaggact ttagaccata
     2761 tactgcattt actctaccat cagtgaacaa tgcagaacca ggaaaaagat acatatataa
     2821 agtcttgcca cagggatgga agggatcacc agcaattttt caacacacaa tgagacaggt
     2881 attagaacca ttcagaaaag caaacaagga tgtcattatc attcagtaca tggatgatat
     2941 cttaatagct agtgacagga cagatttaga acatgatagg gtagtcctgc agctcaagga
     3001 acttctaaat ggcctaggat tttctacccc agatgagaag ttccaaaaag accctccata
     3061 ccactggatg ggctatgaac tatggccaac taaatggaag ttgcagaaaa tacagttgcc
     3121 ccaaaaagaa atatggacag tcaatgacat ccagaagcta gtgggtgtcc taaattgggc
     3181 agcacaactc tacccaggga taaagaccaa acacttatgt aggttaatca gaggaaaaat
     3241 gacactcaca gaagaagtac agtggacaga attagcagaa gcagagctag aagaaaacag
     3301 aattatccta agccaggaac aagagggaca ctattaccaa gaagaaaaag agctagaagc
     3361 aacagtccaa aaggatcaag agaatcagtg gacatataaa atacaccagg aagaaaaaat
     3421 tctaaaagta ggaaaatatg caaaggtgaa aaacacccat accaatggaa tcagattgtt
     3481 agcacaggta gttcagaaaa taggaaaaga agcactagtc atttggggac gaataccaaa
     3541 atttcaccta ccagtagaga gagaaatctg ggagcagtgg tgggataact actggcaagt
     3601 gacatggatc ccagactggg acttcgtgtc taccccacca ctggtcaggt tagcgtttaa
     3661 cctggtaggg gatcctatac caggtgcaga gaccttctac acagatggat cctgcaatag
     3721 gcaatcaaaa gaaggaaaag caggatatgt aacagataga gggaaagaca aggtaaagaa
     3781 actagagcaa actaccaatc agcaagcaga actagaagcc tttgcgatgg cactaacaga
     3841 ctcgggtcca aaagttaata ttatagtaga ctcacagtat gtaatgggga tcagtgcaag
     3901 ccaaccaaca gagtcagaaa gtaaaatagt gaaccagatc atagaagaaa tgataaaaaa
     3961 ggaagcaatc tatgttgcat gggtcccagc ccacaaaggc atagggggaa accaggaagt
     4021 agatcattta gtgagtcagg gtatcagaca agtgttgttc ctggaaaaaa tagagcccgc
     4081 tcaggaagaa catgaaaaat atcatagcaa tgtaaaagaa ctgtctcata aatttggaat
     4141 acccaattta gtggcaaggc aaatagtaaa ctcatgtgcc caatgtcaac agaaagggga
     4201 agctatacat gggcaagtaa atgcagaact aggcacttgg caaatggact gcacacattt
     4261 agaaggaaag atcattatag tagcagtaca tgttgcaagt ggatttatag aagcagaagt
     4321 catcccacag gaatcaggaa gacaaacagc actcttccta ttgaaactgg caagtaggtg
     4381 gccaataaca cacttgcata cagataatgg tgccaacttc acttcacagg aggtgaagat
     4441 ggtagcatgg tggataggta tagaacaatc ctttggagta ccttacaatc cacagagcca
     4501 aggagtagta gaagcaatga atcaccatct aaaaaaccaa ataagtagaa tcagagaaca
     4561 ggcaaataca atagaaacaa tagtactaat ggcaattcat tgcatgaatt ttaaaagaag
     4621 ggggggaata ggggatatga ctccatcaga aagattaatc aatatgatca ccacagaaca
     4681 agagatacaa ttcctccaag ccaaaaattc aaaattaaaa gattttcggg tctatttcag
     4741 agaaggcaga gatcagttgt ggaaaggacc tggggaacta ctgtggaaag gagaaggagc
     4801 agtcctagtc aaggtaggaa cagacataaa aataatacca agaaggaaag ccaagatcat
     4861 cagagactat ggaggaagac aagagatgga tagtggttcc cacctggagg gtgccaggga
     4921 ggatggagaa atggcatagc cttgtcaagt atctaaaata caaaacaaag gatctagaaa
     4981 aggtgtgcta tgttccccac cataaggtgg gatgggcatg gtggacttgc agcagggtaa
     5041 tattcccatt aaaaggaaac agtcatctag agatacaggc atattggaac ttaacaccag
     5101 aaaaaggatg gctctcctct tattcagtaa gaataacttg gtacacagaa aagttctgga
     5161 cagatgttac cccagactgt gcagatgtcc taatacatag cacttatttc ccttgcttta
     5221 cagcaggtga agtaagaaga gccatcagag gggaaaagtt attgtcctgc tgcaattatc
     5281 cccgagctca tagagcccag gtaccgtcac ttcaatttct ggccttagtg gtagtgcaac
     5341 aaaatgacag accccagaga gacagtacca ccaggaaaca gcggcgaaga gactatcgga
     5401 gaggccttcg cctggctaaa caggacagta gaagccataa acagagaagc agtgaatcac
     5461 ctaccccgag aacttatttt ccaggtgtgg cagaggtcct ggagatactg gcatgatgaa
     5521 caagggatgt cagaaagtta cacaaagtat agatatttgt gcataataca gaaagcagtg
     5581 tacatgcatg ttaggaaagg gtgtacttgc ctggggaggg gacatgggcc aggagggtgg
     5641 agaccagggc ctcctcctcc tccccctcca ggtctggtct aatggctgaa gcaccaacag
     5701 agctcccccc ggtggatggg accccactga gggagccagg ggatgagtgg ataatagaaa
     5761 tcttgagaga aataaaagaa gaagctttaa agcattttga ccctcgcttg ctaattgctc
     5821 ttggcaaata tatctatact agacatggag acacccttga aggcgccaga gagctcatta
     5881 aagtcctgca acgagccctt ttcacgcact tcagagcagg atgtggccac tcaagaattg
     5941 gccagacaag gggaggaaat cctctctcag ctataccgac ccctagaaac atgcaataac
     6001 tcatgctatt gtaagcgatg ctgctaccat tgtcagatgt gttttctaaa caaggggctc
     6061 gggatatgtt atgaacgaaa gggcagacga agaaggactc caaagaaaac taagactcat
     6121 ccgtctccta caccagacaa gtgagtatga tgaatcagct gcttattgcc attttattag
     6181 ctagtgcttg cttagtatat tgcacccaat atgtaactgt tttctatggc gtacccacgt
     6241 ggaaaaatgc aaccattccc ctcttttgtg caaccagaaa tagggatact tggggaacca
     6301 tacagtgctt gcctgacaat gatgattatc aggaaataac tttgaatgta acagaggctt
     6361 ttgatgcatg gaataataca gtaacagaac aagcaataga agatgtctgg catctattcg
     6421 agacatcaat aaaaccatgt gtcaaactaa cacctttatg tgtagcaatg aaatgcagca
     6481 gcacagagag cagcacaggg aacaacacaa cctcaaagag cacaagcaca accacaacca
     6541 cacccacaga ccaggagcaa gagataagtg aggatactcc atgcgcacgc gcagacaact
     6601 gctcaggatt gggagaggaa gaaacgatca attgccagtt caatatgaca ggattagaaa
     6661 gagataagaa aaaacagtat aatgaaacat ggtactcaaa agatgtggtt tgtgagacaa
     6721 ataatagcac aaatcagacc cagtgttaca tgaaccattg caacacatca gtcatcacag
     6781 aatcatgtga caagcactat tgggatgcta taaggtttag atactgtgca ccaccgggtt
     6841 atgccctatt aagatgtaat gataccaatt attcaggctt tgcacccaac tgttctaaag
     6901 tagtagcttc tacatgcacc aggatgatgg aaacgcaaac ttccacatgg tttggcttta
     6961 atggcactag agcagagaat agaacatata tctattggca tggcagagat aatagaacta
     7021 tcatcagctt aaacaaatat tataatctca gtttgcattg taagaggcca gggaataaga
     7081 tagtgaaaca aataatgctt atgtcaggac atgtgtttca ctcccactac cagccgatca
     7141 ataaaagacc cagacaagca tggtgctggt tcaaaggcaa atggaaagac gccatgcagg
     7201 aggtgaagga aacccttgca aaacatccca ggtatagagg aaccaatgac acaaggaata
     7261 ttagctttgc agcgccagga aaaggctcag acccagaagt agcatacatg tggactaact
     7321 gcagaggaga gtttctctac tgcaacatga cttggttcct caattggata gagaataaga
     7381 cacaccgcaa ttatgcaccg tgccatataa agcaaataat taacacatgg cataaggtag
     7441 ggagaaatgt atatttgcct cccagggaag gggagctgtc ctgcaactca acagtaacca
     7501 gcataattgc taacattgac tggcaaaaca ataatcagac aaacattacc tttagtgcag
     7561 aggtggcaga actatacaga ttggagttgg gagattataa attggtagaa ataacaccaa
     7621 ttggcttcgc acctacaaaa gaaaaaagat actcctctgc tcacgggaga catacaagag
     7681 gtgtgttcgt gctagggttc ttgggttttc tcgcaacagc aggttctgca atgggcgcgg
     7741 cgtccctgac cgtgtcggct cagtcccgga ctttactggc cgggatagtg cagcaacagc
     7801 aacagctgtt ggacgtggtc aagagacaac aagaactgtt gcgactgacc gtctggggaa
     7861 cgaaaaacct ccaggcaaga gtcactgcta tagagaagta cctacaggac caggcgcggc
     7921 taaattcatg gggatgtgcg tttagacaag tctgccacac tactgtacca tgggttaatg
     7981 attccttagc acctgactgg gacaatatga cgtggcagga atgggaaaaa caagtccgct
     8041 acctggaggc aaatatcagt aaaagtttag aacaggcaca aattcagcaa gagaaaaata
     8101 tgtatgaact acaaaaatta aatagctggg atatttttgg caattggttt gacttaacct
     8161 cctgggtcaa gtatattcaa tatggagtgc ttataatagt agcagtaata gctttaagaa
     8221 tagtgatata tgtagtacaa atgttaagta ggcttagaaa gggctatagg cctgttttct
     8281 cttccccccc cggttatatc caacagatcc atatccacaa ggaccgggga cagccagcca
     8341 acgaagaaac agaagaagac ggtggaagca acggtggaga cagatactgg ccctggccga
     8401 tagcatatat acatttcctg atccgccagc tgattcgcct cttgaccaga ctatacagca
     8461 tctgcaggga cttactatcc aggagcttcc tgaccctcca actcatctac cagaatctca
     8521 gagactggct gagacttaga acagccttct tgcaatatgg gtgcgagtgg atccaagaag
     8581 cattccaggc cgccgcgagg gctacaagag agactcttgc gggcgcgtgc aggggcttgt
     8641 ggagggtatt ggaacgaatc gggaggggaa tactcgcggt tccaagaagg atcagacagg
     8701 gagcagaaat cgccctcctg tgagggacgg cagtatcagc agggagactt tatgaatact
     8761 ccatggaagg acccagcagc agaaagggag aaaaatttgt acaggcaaca aaatatggat
     8821 gatgtagatt cagatgatga tgaccaagta agagtttctg tcacaccaaa agtaccacta
     8881 agaccaatga cacatagatt ggcaatagat atgtcacatt taataaaaac aaggggggga
     8941 ctggaaggga tgttttacag tgaaagaaga cataaaatct taaatatata cttagaaaag
     9001 gaagaaggga taattgcaga ttggcagaac tacactcatg ggccaggagt aagataccca
     9061 atgttctttg ggtggctatg gaagctagta ccagtagatg tcccacaaga aggggaggac
     9121 actgagactc actgcttagt acatccagca caaacaagca agtttgatga cccgcatggg
     9181 gagacactag tctgggagtt tgatcccttg ctggcttata gttacgaggc ttttattcgg
     9241 tacccagagg aatttgggca caagtcaggc ctgccagagg aagagtggaa ggcgagactg
     9301 aaagcaagag gaataccatt tagttaaaga caggaacagc tatacttggt cagggcagga
     9361 agtaactaac agaaacagct gagactgcag ggactttcca gaaggggctg taaccaaggg
     9421 agggacatgg gaggagctgg tggggaacgc cctcatattc tctgtataaa tatacccgct
     9481 agcttgcatt gtacttcggt cgctctgcgg agaggctggc agattgagcc ctgggaggtt
     9541 ctctccagca gtagcaggta gagcctgggt gttccctgct agactctcac cagcacttgg
     9601 ccggtgctgg gcagacggcc ccacgcttgc ttgcttaaaa acctccttaa taaagctgcc
     9661 agttagaagc a
//
LOCUS       HIV2ST       9672 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 2 (HIV-2), complete proviral
            genome.
ACCESSION   M31113
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 2 (isolate HIV-2/ST) proviral
            DNA, clone JSP4-27.
REFERENCE   1  (bases 1 to 9672)
  AUTHORS   Kumar,P., Hui,H., Kappes,J.C., Haggarty,B.S., Hoxie,J.A.,
            Arya,S.K., Shaw,G.M. and Hahn,B.
  TITLE     Molecular characterization of an attenuated human immunodeficiency
            virus type 2 isolate
  JOURNAL   J. Virol. 64, 890-901 (1990)
  STANDARD  full staff_review
REFERENCE   2  (sites; corrections bases 6952, 7003, 7926)
  AUTHORS   Hahn,B.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Kindly provided in computer-readable form by B. Hahn, University of
            Alabama, Birmingham. The corrections of nts 6952, 7003 and 7926,
            from 'c' to 't' result in amino acid changes.
            
            This isolate, obtained from an asymptomatic Senegalese woman, has
            low pathogenicity, infecting cells at a relatively slow rate and
            causing little cell death or fusion.
FEATURES       from  to/span     description
    pept        548     2113     gag polyprotein
    pept    <  1717     1767     pol polyprotein (NH2 terminus uncertain; AA at
                                 1717)
               1771     4938     pol polyprotein
    pept       4868     5515     vif protein
    pept       5343     5681     vpX protein
    pept       5681     5776     vpR protein
               5780     5998     vpR protein
    pept       5844     6139     tat protein, exon 2 (first expressed exon)
               8291     8387     tat protein, exon 3 (AA at 8292)
    pept       6070     6139     rev protein, exon 2 (first expressed exon)
               8291     8544     rev protein, exon 3 (AA at 8293)
    pept       6149     8728     env polyprotein
    site       1768     1770     pol in-frame stop codon
    site       5777     5779     vpR in-frame stop codon
    revision   6952        1     t in [2]; c in [1]
    revision   7003        1     t in [2]; c in [1]
    revision   7926        1     t in [2]; c in [1]
    pept       8562     9329     nef protein
    pre-msg       1     9672     genomic mRNA
    pre-msg       1     9672     tat, rev, nef subgenomic mRNA
    rpt        9499     9672     R repeat 3' copy
    binding     305      321     primer (Lys-tRNA) binding site
    signal     9650     9655     poly-A signal
BASE COUNT     3304 a   1991 c   2412 g   1965 t
ORIGIN      
        1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
       61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg
      121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa
      181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcattc ggtgttcatc taaagtaaca
      241 agaccctggt ctgttaggac cctttctgct ttgggaaacc aaggcaggaa aatccctagc
      301 aggttggcgc ccgaacaggg acttgaagaa gactgagaag ccttggaaca cggctgagtg
      361 aaggcagtaa gggcggcagg aacaaaccac gacggagtgc tcctagaaaa gcgcaggccg
      421 aggtaccaag ggcggcgtgt ggagcgggag tgaaagaggc ctccgggtga aggtaagtgc
      481 ctacaccaaa tacagtagcc agaagggctt gttatcctac ctttagacgg gtagaagatt
      541 gtgggagatg ggcgcgagaa actccgtctt gagagggaaa aaagcagacg aattagaaaa
      601 gattaggtta cggcccggcg gaaagaaaaa atataggcta aaacatattg tgtgggcagc
      661 gaatgaattg gacagattcg gattggcaga gagcctgttg gagtcaaaag agggttgcca
      721 aaaaattctt acagttttag atccattagt accgacaggg tcagaaaatt taaaaagcct
      781 ttttaatact gtctgcgtca tttggtgtat acacgcagaa gagaaagcga aagatactga
      841 agaagcaaaa caaaaggtac agagacatct agtggcagaa acaaaaacta cagaaaaaat
      901 gccaagtaca agtagaccaa cagcaccacc tagcgggaac ggaggaaact tccccgtaca
      961 acaagtggcc ggcaactata cccatgtgcc actaagtccc cgaaccctaa atgcttgggt
     1021 aaaactagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc
     1081 agaaggctgc acgccctatg atattaatca aatgcttaat tgtgtgggcg accatcaagc
     1141 agctatgcaa ataatcaggg aaattattaa tgaagaagca gcagattggg acgcacaaca
     1201 cccaatacca ggccccttac cagcggggca gctcagggag ccaaggggat ctgacatagc
     1261 agggacaaca agcacagtag aagagcagat ccagtggatg tttaggccac aaaatcctgt
     1321 accagtagga agcatctata gaagatggat ccagataggg ctacagaagt gcgtcaggat
     1381 gtacaaccca accaacatcc tagacataaa acagggacca aaggagccat tccagagtta
     1441 tgtagataga ttctacaaga gcttgagggc agaacaaaca gatccagcag taaaaaattg
     1501 gatgacccaa acactgctag tgcagaatgc caacccagac tgtaagttag tactaaaagg
     1561 actagggata aatcctacct tagaagaaat gctaaccgcc tgtcaggggg taggtggacc
     1621 aggccagaaa gccagattaa tggcagaagc cttaaaggag gccatggcac cagcccccat
     1681 cccatttgca gcagcccaac agagaaggac aattaagtgc tggaattgcg gaaaggaagg
     1741 gcactcggca agacaatgcc gagcacctag aagacaaggc tgctggaaat gtggcaaggc
     1801 aggacacatc atggcaaaat gcccagaaag acaggcgggt tttttagggt tgggcccatg
     1861 gggaaagaag ccccgcaatt tccctgtggc ccaaatcccg caggggctga caccaacagc
     1921 acccccgata gacccagtag aggacctact agagaagtac atgcagcaag ggaaaaggca
     1981 gagagagcag agagagaggc catacaaaga agtgacagag gacttcctgc agctcgagaa
     2041 acaagagaca ccatgcagag agacgacaga ggacttgctg cacctcaatt ctctctttgg
     2101 aaaagaccag tagtcacagc acatgttgag ggccagccag tagaagtttt gctagacaca
     2161 ggggctgacg actcaatagt agcaggcgta gagttaggga gcaattatag tccaaagata
     2221 gtagggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataagagta
     2281 ttaaataaaa gagtaagagc caccataatg acaggtgata ccccaatcaa catttttggc
     2341 agaaacattc tgacagcctt aggcatgtca ttaaatctac cagtcgccaa gatagaacca
     2401 ataaaaataa tgctgaagcc aggaaaggat ggaccaaaac tgagacaatg gcccttaaca
     2461 aaagaaaaaa tagaggcact aaaagagatc tgtgagaaaa tggaaagaga gggccagcta
     2521 gaggaggcac ctccaactaa tccttataat acccccacat ttgcaatcaa gaaaaaggac
     2581 aaaaacaaat ggagaatgct aatagatttt agagaactaa acaaggtaac tcaagacttc
     2641 acagaaatcc agttaggaat tccacaccca gcaggactag ccaagaagaa acgaattact
     2701 gtcctagatg taggggatgc ttacttttcc ataccactac atgaggattt tagacagtat
     2761 actgcattta ctctaccatc aataaacaat gctgaaccag gaaaaagata catatataaa
     2821 gtctcaccac agggatggaa gggatcacca gcaatttttc agtacacaat gaggcaggtc
     2881 ttagaaccat tcagaaaagc aaacccggat atcattctca ttcagtacat ggatgatatc
     2941 ttgatagcca gcgacaggac agatttagaa catgacagag tggttctgca gctaaaggaa
     3001 cttctaaatg gcctgggatt ttccacccca gatgagaagt tccaaaaaga ccctccatac
     3061 caatggatgg gctatgaact gtggccaact aaatggaagc tgcaaagaat acaattgccc
     3121 caaaaggaag tatggacagt caatgacatc caaaaactgg tgggtgtcct aaattgggca
     3181 gcacaaatct acccagggat aaagaccaga aacttatgta ggttaatcag aggaaaaatg
     3241 acactcacag aagaggtaca gtggacagaa ttagcagaag cggaactaga agaaaacaaa
     3301 atcatcttaa gccaggaaca agaaggatgc tattaccaag aggaaaagga gctagaagca
     3361 acagtccaaa aagatcaaga caatcagtgg acatataaga tacaccaggg aggaaaaatt
     3421 ctaaaagtag gaaaatatgc aaaggtaaaa aatacccaca ccaacggagt cagactccta
     3481 gcacaagtag ttcaaaaaat aggaaaagaa gcactagtca tttggggacg aataccaaaa
     3541 tttcacctac cagtagaaag agatacctgg gaacagtggt gggataacta ctggcaagtg
     3601 acatggatcc cagactggga cttcatatct accccgccac tggtcagatt agtatttaac
     3661 ctggtgaaag atcccatact aggcgcagaa accttctaca cagatggatc ctgcaataag
     3721 caatcaagag aaggaaaagc aggatacata acagatagag gaagagacaa ggtgaggcta
     3781 ttagagcaaa ccaccaatca gcaagcagaa ttagaagcct ttgcgatggc agtaacagac
     3841 tcaggtccaa aggccaacat tatagtagac tcacaatatg taatgggaat agtagcaggc
     3901 caaccaacag agtcagagag taaaatagta aatcaaatca tagaagaaat gataaaaaag
     3961 gaagcaatct atgttgcatg ggtcccagcc cataaaggca taggaggaaa tcaggaggta
     4021 gatcacttag taagtcaggg catcagacaa gtattattcc tagagaaaat agaacccgct
     4081 caggaggaac atgaaaaata tcatagcaat gtaaaagaac tatcccataa atttggactg
     4141 cccaaattag tggcaagaca aatagtaaac acatgcaccc aatgtcagca gaaaggggag
     4201 gctatacatg ggcaagtaaa tgcagaatta ggcacttggc aaatggactg cacacactta
     4261 gaaggaaaaa tcattatagt agcagtacat gttgcaagtg gatttataga agcagaagtc
     4321 atcccacagg aatcaggaag gcaaacggca ctcttcctac taaaactggc cagtaggtgg
     4381 ccaataacac atttgcacac agacaatggt gccaacttca cttcacagga agtaaagatg
     4441 gtggcatggt ggataggtat agaacaatcc ttcggagtac cttacaatcc acaaagccaa
     4501 ggagtagtgg aagcaatgaa tcaccaccta aaaaatcaga taagcagaat tagagagcag
     4561 gcaaacacag tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaaggagg
     4621 ggaggaatag gggatatgac cccagcagaa agactaatca atatggtcac tgcagaacag
     4681 gaaatacaat tcctccaagc aaaaaattca aaattacaaa attttcgggt ctatttcaga
     4741 gaaggcagag atcagctgtg gaaaggacct ggggaactac tgtggaaggg ggacggagca
     4801 gtcatagtca aggtaggggc tgacataaaa ataataccaa gaaggaaagc taagatcatc
     4861 aaagactatg gaggaaggca agagatggat agcggttcca acttggaggg tgccagggag
     4921 gatggagagg tggcatagcc ttatcaagta tctaaaatac agaacaggag atctagagaa
     4981 ggtgtgctat gttccccacc ataaggtggg atgggcgtgg tggacttgca gcagggtaat
     5041 attcccatta aaaggagaaa gtcatctgga gatacaggca tactggaacc taacaccaga
     5101 aaaaggatgg ctctcctcct attcagtaag actaacttgg tatacagaaa aattctggac
     5161 agatgttacc ccagactgtg cggactccct aatacatagc acttatttct cttgctttac
     5221 ggcaggcgaa gtaagaagag ccatcagagg ggaaaagcta ttatcctgct gcaactaccc
     5281 ccaagcccat aagtaccagg taccgtcact ccagtttctg gccttagtgg tagtgcaaca
     5341 aaatggcagg ccccagagag acaataccac caggaaacag tggcgaagaa actatcggag
     5401 aggccttcga gtggctagac aggacggtag aagccataaa cagagaggca gtgaaccacc
     5461 tgccccgaga gcttattttc caggtgtggc aaaggtcctg gagatactgg catgatgaac
     5521 aaggaatgtc aataagttac acaaagtata gatatttgtg cctaatgcag aaagctatgt
     5581 tcatacattc taagagaggg tgcacttgcc tggggggagg acatgggccg ggaggatgga
     5641 gatcaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga
     5701 gtctcccccg gaggatagga ccccaccgag ggagccaggg gatgagtggg taatagaaac
     5761 cctgagagag ataaaataag aagctttaaa gcactttgac cctcgcttgc taattactct
     5821 tggcaactat atctatgcta gacatggaga cacccttgaa ggcgccagag ggctcattag
     5881 gatcctacaa cgagccctcc tcttgcactt cagagcagga tgcggccgct caaggattgg
     5941 tcagcccagg ggacgaaatc ctttatcagc tataccaacc cctagaggca tgcgataaca
     6001 aatgttactg taaaaagtgc tgctaccatt gccagatgtg ttttttaaac aaggggctcg
     6061 ggatatggta tgaacgaaag ggcagaagaa gaagaactcc gaagaaaact aaggctcatt
     6121 cgtcttctgc atcagacaag tgagtaagat gtgtggtagg aatcaactat ttgttgccag
     6181 cttgctagct agtgcttgct taatatattg cgtccaatat gtgactgttt tctatggcgt
     6241 gcccgtgtgg agaaatgcat ccattcccct cttttgtgca actaaaaata gagatacttg
     6301 gggaaccata cagtgcttgc cagacaatga tgactatcag gaaatagctt taaatgtgac
     6361 agaggccttc gacgcatgga ataatacagt aacagaacaa gcagtagaag atgtctggag
     6421 tctatttgag acatcaataa aaccatgcgt caaactaaca cccttatgtg tagcaatgcg
     6481 ttgtaacagc acaactgcaa aaaacacaac ctccacacca acaaccacca caacagcaaa
     6541 cacaacaata ggagagaatt cttcatgcat acgcacagac aactgcacag ggttgggaga
     6601 agaagagatg gtcgactgtc agttcaatat gacaggatta gagagggata agaaaaaact
     6661 atataatgaa acatggtact caaaagatgt agtctgtgaa tcaaatgaca ccaagaaaga
     6721 gaaaacatgt tacatgaacc actgcaacac atcagtcatc acagagtcat gtgacaagca
     6781 ctattgggat actatgaggt ttagatattg tgcaccaccg ggttttgccc tgctaagatg
     6841 caatgatacc aattattcag gctttgagcc caattgttct aaggtagtag ctgctacatg
     6901 tacaaggatg atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga
     6961 aaatagaaca tatatctatt ggcatggtag ggataataga actatcatta gcttaaacaa
     7021 gttttataat ctcaccgtac attgtaagag gccaggaaac aagacagttg taccaataac
     7081 actcatgtca gggttagtgt ttcactccca gccaatcaat agaagaccca ggcaagcatg
     7141 gtgctggttc aaaggcgagt ggaaggaagc catgaaggag gtgaagctaa cccttgcaaa
     7201 acatcccagg tataaaggaa ccaacgacac agaaaaaatt cgttttatag cgccaggaga
     7261 acgctcagac ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg
     7321 caatatgact tggttcctca attgggtaga aaacagaacg aatcagacac agcacaatta
     7381 tgtgccatgc catataaagc aaataattaa tacctggcac aaggtaggga aaaatgtata
     7441 tttgcctcct agggaaggac agttaacctg caactctaca gtgaccagca taattgctaa
     7501 cattgacgga ggagagaacc agacaaatat tacctttagt gcagaggtgg cagaactata
     7561 ccgattagaa ttgggggatt ataaattgat agaagtaaca ccaattggct ttgcacctac
     7621 accagtaaaa agatactcct ctgctccagt gaggaataaa agaggtgtat tcgtgctagg
     7681 gttcttaggt tttctcacga cagcaggagc tgcaatgggc gcggcgtcct tgacgctgtc
     7741 ggctcagtct cggactttat tggccgggat agtgcagcaa cagcaacagc tgttggacgt
     7801 ggtcaagaga caacaagaaa tgttgcgact gaccgtctgg ggaacaaaaa atctccaggc
     7861 aagagtcact gctatcgaga aatacttaaa ggaccaggcg caactaaatt catggggatg
     7921 tgcgtttaga caagtctgcc acactactgt accatgggta aatgacacct taacgcctga
     7981 ttggaacaac atgacatggc aggaatggga gcaacgaatc cgcaacctag aggcaaatat
     8041 cagtgaaagt ttagaacagg cacaaatcca gcaagaaaag aacatgtatg aactacaaaa
     8101 attaaatagc tgggatgttt ttggcaactg gtttgattta acctcctgga tcaaatatat
     8161 tcagtatgga gtttatatag tagtaggaat aatagtttta agaatagtaa tatatgtagt
     8221 acaaatgtta agtagactta gaaagggcta taggcctgtt ttctcttccc cccccgctta
     8281 cttccaacag atccatatcc acaaggaccg ggaacagcca gccagagaag aaacagaaga
     8341 agacgttgga aacagcgttg gagacaattg gtggccctgg ccgataagat atatacattt
     8401 cctgatccgc cagctgattc gcctcttgaa cagactatac aacatctgca gggacttact
     8461 atccaggagc ttccagaccc tccaactaat ctcccagagt cttcggagag cattgacagc
     8521 agtcagagac tggctgagat ttaacacagc ctacctgcaa tatgggggcg agtggatcca
     8581 agaagcgttc cgagccttcg cgagggctac gggagagact cttacaaacg cctggagagg
     8641 cttctggggg acactgggac aaattgggag gggaatactt gcagtcccaa gaaggatcag
     8701 gcagggggca gaaatcgccc tcctgtgagg gacggcggta tcaacaggga gattttatga
     8761 ataccccatg gagagcccca gcagaagggg agaaaggctc gtacaagcaa caaaatatgg
     8821 atgatgtaga ttcagatgat gatgacctag taggggtccc tgtcacacca agagtaccat
     8881 taagagaaat gacatatagg ttggcaagag atatgtcaca tttgataaaa gaaaaggggg
     8941 gactggaagg gctgtattac agtgatagga gacgtagagt cctagacata tacttagaaa
     9001 aggaagaggg aataattgga gactggcaga actatactca tggaccagga gtaaggtatc
     9061 caaagttctt tgggtggtta tggaagctag taccagtaga tgtcccacaa gagggagatg
     9121 acagtgagac tcactgctta gtgcatccag cacaaacaag caggtttgat gacccgcatg
     9181 gagaaacatt agtttggagg tttgacccca cgctagcttt tagctacgag gcctttattc
     9241 gatacccaga ggagtttggg tacaagtcag gcctgccaga ggatgaatgg aaggcaagac
     9301 tgaaagcaag agggataccg tttagctaaa aacaggaaca gctatacttg gtcagggcag
     9361 gaagtaacta acagaaaaca gctgagactg cagggacttt ccagaagggg ctgttaccag
     9421 gggagggaca tgggaggagc cggtggggaa cgccctcata ctttctgtat aaatgtaccc
     9481 gctactcgca ttgtattcag tcgctctgcg gagaggctgg cagattgagc cctgggaggt
     9541 tctctccagc actagcaggt agagcctggg tgttccctgc tagactctca ccagtgcttg
     9601 gccggcactg ggcagacggc tccacgcttg cttgcttaaa agacctctta ataaagctgc
     9661 cagttagaag ca
//
LOCUS       HIVBOS1       253 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 1, envelope region.
ACCESSION   M27910
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 1 (isolate 1153), clone 1153-1.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 253)
  AUTHORS   Ou,C.-Y.et.al.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     [1] kindly submitted in computer-readable form by C.-Y. Ou, CDC,
            Atlanta. The sequence below, designated 1153-1, appears to be a
            major form of 11 sibling sequences taken from a Boston isolate.
BASE COUNT       98 a     46 c     54 g     55 t
ORIGIN      Approximately 1170 nt from env start (BRU coordinates).
        1 ttcaacacaa ctgtttaata gtacttggac ttggaatggt actgaaaggt caaataacac
       61 tgaaggaaat atcacactcc catgcagaat aaaacaaatt ataaacatgt ggcaggaagt
      121 aggaaaagca atgtatgccc ctcccatcag aggacaaatt agatgttcat caaatattac
      181 agggctacta ttaacaagag atggtggtaa tgagaccaac gagaccgaga ccttcagacc
      241 tggaggagga aat
//
LOCUS       HIVJFL       2553 bp ss-RNA             VRL       04-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 proviral env gene, 5' end.
ACCESSION   M31451
KEYWORDS    envelope protein.
SOURCE      Human immunodeficiency virus type 1 (isolate JFL) proviral DNA,
            clone I-BL2.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 2553)
  AUTHORS   McNearney,T., Westervelt,P., Thielan,B., Trowbridge,D.B.,
            Garcia,J., Whittier,R. and Ratner,L.
  TITLE     Limited sequence heterogeneity among biologically distinct human
            immunodeficiency virus type 1 isolates from individuals involved in
            a clustered infectious outbreak
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 1917-21 (1990)
  STANDARD  full staff_review
COMMENT     This sequence was kindly provided prior to publication by Dr.  Lee
            Ratner, Washington University School of Medicine, St. Louis.  Clone
            I-BL2 and twelve other clones sequenced from seven isolates, taken
            from one donor and two TA-AIDS recipients, all have pre-mature stop
            codons in env. The isolate itself is described as monocytropic.
FEATURES       from  to/span     description
    pept          1     1851     envelope polyprotein
               1855     1992     envelope polyprotein
               1996  >  2553     envelope polyprotein
    site       1852     1854     envelope polyprotein in-frame stop codon
    site       1993     1995     envelope polyprotein in-frame stop codon
BASE COUNT      890 a    435 c    603 g    625 t
ORIGIN      Env start codon.
        1 atgagagtga aggagatcag gaagaattgt cagcacttat ggagatgggg caccatgctc
       61 cttggaatgt tgatgatctg tagtgctgca ggaaaattgt gggtcacagt ctattatggg
      121 gtacctgtat ggaaagaagc aaccaccact ctattttgtg catcagacgc taaagcatat
      181 gacacagagg tacataatgt ttgggccaca catgcctgtg tacccacaga ccctaaccca
      241 caagaagtaa aattggaaaa tgtgacagaa aattttaaca tgtggaaaaa tgacatggta
      301 gaacagatgc atgaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa
      361 ttaaccccac tctgtgttac tttaaattgc attaatttga agaatgttac taatatcaat
      421 aatagttggg gaaagatgga ggaaggagaa ataaaaaact gctctttcaa taccaccaca
      481 agcataaaaa ataagatgca gagagaatat gcactttttt ataaacttga tgtagtacca
      541 atagataagg ataatacaag ctatacattg ataaattgta acacctcaac cattacacag
      601 gcctgtccaa aggtatcctt tgaaccaatt ccaatacatt attgtacccc ggctggtttt
      661 gcgattctaa agtgtaatga taagaagttc aatggaaaag gaccatgtaa aaatgtcagc
      721 acagtacaat gtacacatgg aattaggcca gtagtgtcaa ctcacttgct gttaaatggc
      781 agtctagcag aaggagaggt agtaattaga tctgaaaatt tcatggacaa tgctaaaacc
      841 ataatagtac agctgaataa atctatagca attaattgta caagacctaa caacaataca
      901 agaaaaagta tcactctagg accagggagg gcattttata caacaggaga cataatagga
      961 gacataagac aagcacattg taaccttagt agagtacaat ggaataacac tttaaaacag
     1021 atagttataa agttaagaga acagtttggg aataaaacaa tagtctttaa gcaatcctca
     1081 ggaggggacc cagaaattgt aatgcacact tttaattgtg gaggggaatt tttctactgt
     1141 aattcaacac agctgtttaa tagtacttgg ctgtccaatg gtacttggaa tgatactgaa
     1201 gagtcaaaag acactagaga acctatcata ctcccatgca gaataaaaca aattataaac
     1261 aggtggcagg aagtaggaaa agcaatgtat gcccctccca tcaacggaca aattagatgt
     1321 tcatcaaata ttacagggct actattaaca agagatggtg gtaaagagaa tggggccgag
     1381 atcttcagac ctggaggagg aaatatgagg gacaattgga gaagtaaatt atataaatat
     1441 aaagtagtaa aaattgagcc attaggagta gcacccacca aggcaaagag aagagtggtg
     1501 cagagaaaaa aaagagcagt gggaacgata ggagctatgt tccttgggtt cttgggagca
     1561 gcaggaagca ctatgggcgc agcgtcgatg acgctgacgg tacaggccag actattattg
     1621 tctggtatag tgcaacagca aaacaatttg ctgagggcta ttgaggcgca acagcatctg
     1681 ttgcaactca cagtctgggg catcaagcag ctccaggcaa gagtcctggc tgtggaaaga
     1741 tacctacagg atcaacagct cctggggatt tggggttgct ctggaaaact catttgcacc
     1801 actactgtgc cttggaatgc tagttggagt aataaatctc tggatgagat ttaaaataac
     1861 atgacctgga tgcagtggga gagagaaatt agcaattaca caagcctaat atacacctta
     1921 attgaagaat cgcagaacca acaagaaaag aatgaactag aattattgga attggataag
     1981 tgggcaagtt tgtgaaattg gtttgacata acaaactggc tgtggtctat aaaaatattc
     2041 ataatgatag tagcaggttt agtaggttta agaatagttt ttgctgtgct ttctatagtg
     2101 aatagagtta ggcagggata ctcaccattg tcattccaga cccgcctccc aaccccgagg
     2161 ggacccgaca ggcccggagg aatcgaagaa gaaggtggag agagagacag agacagatcc
     2221 ggttacttag tgactggatt cttagcactt atctgggaag acctgtggag cctgtgcctc
     2281 ttcagctacc accgcttgag agacttactc ttgattgtaa cgaggattgt ggaactcctg
     2341 ggacgcaggg ggtgggaagc cctcaaatat tggtggaatc tcctacagta ttggagtcag
     2401 gaactaaaga atagtgctgt tagtttgctt aacgctacag ctatagcagt agctgagggg
     2461 acagatagga ttatagaagt agtacaaaga attggtagag ctatcctcca catacctgta
     2521 agaataagac agggcttaga aagggctttg cta
//
LOCUS       RATPKATA1    4644 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene,
            exons 1-5.
ACCESSION   D90055 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     1 of 4
SOURCE      Rat (Sprague-Dawley) liver DNA, clone PT147.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4644)
  AUTHORS   Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of twO
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1 Chuo-ku, Tokyo 104 Japan Phone:
            (03)542-2511 Fax:   (03)545-3567 The sequence 1487-1594 encodes
            presumptive leader peptide.
FEATURES       from  to/span     description
    pept       1487     1687     3-ketoacyl-CoA thiolase, exon 1
               1827     1920     3-ketoacyl-CoA thiolase, exon 2
               2800     2857     3-ketoacyl-CoA thiolase, exon 3
               4064     4143     3-ketoacyl-CoA thiolase, exon 4
               4381  +  4423     3-ketoacyl-CoA thiolase, exon 5
    pre-msg    1373  >  4644     3-ketoacyl-CoA thiolase mRNA and introns
    IVS        1688     1826     3-ketoacyl-CoA thiolase, intron A
    IVS        1921     2799     3-ketoacyl-CoA thiolase, intron B
    IVS        2858     4063     3-ketoacyl-CoA thiolase, intron C
    IVS        4144     4380     3-ketoacyl-CoA thiolase, intron D
    signal     1223     1228     GC box
    signal     1263     1268     GC box
    signal     1295     1300     GC box
    signal     1346     1351     GC box
    signal     1202     1206     CAAT box
    signal     1217     1221     CAAT box
    signal     1327     1331     CAAT box
    signal     1238     1244     enhancer core
    signal     1464     1470     enhancer core
BASE COUNT     1013 a   1303 c   1238 g   1090 t
ORIGIN      
        1 ggatcccacg cggggcctcc cgcagacatg gcaagcaacc ctgggccccg gttctgtacg
       61 gtgcgctcgc ttgttgagct tcctacttcc cttaaggctc ctgccaaccc ctcctccttt
      121 cgctttccga gaagctccgc cccttgctag aatctggaat ccggacccac aacacgcagc
      181 ctgaaattca acccacttga gttctcacct ccccttgaag gtgtgagagc ttatagaaac
      241 cgcctctgac gtccgccttg gcccaccgac ctgtcctggg cgatctggta gtggtggcta
      301 tgcgtggtga agtgattcag ccttgtttag gacttgacag gagttacagc tcttccgcac
      361 agtaatttag attctggtaa tagggaggga agagacggac tggcttcgta agcagtaacc
      421 tgaaggctct gtctacctta gccatctggg acaagtggtg accgtgcttt ttaaagggaa
      481 agtccacgac cttgcccgag aatagacatt tagggcatcc tgtcctttgt tatcttagac
      541 agacaacact accttccgag ggtctgcaaa aagcacacgg acctatcact ttactcttta
      601 acgttgggct tagaccaagg aagagtgttt cgttatttcg ctcggttaat ttacttaggt
      661 gtcaatcacc agtactagat ctcccccttc tcctcccccg ggcttagtag aattccttgg
      721 ggaaagtggg cattaatggc gttatcccaa acgaacgtag aacaggctta agacatgtgg
      781 ggcagtgcca cagttagaac ctgcctgcca ttcagcacct tcacaccagc cggacaccca
      841 aaattaattt gttttctgag agagcttctg tgacagccca atagggttgg ggttgaacta
      901 agtacctcaa cattatctag gtcctcggtt ctgggttttg agacccgggg ccgagcacag
      961 caaattttga ggagaaacaa gtgtcttccc tggctcttcc agcaaaggga gaagccctga
     1021 gtcgagctcc gccttccacg ccgctgcgcg gtgacctcag aaggagggac tccctcgaga
     1081 ctgagttcct cttaggaaac cagaagtttg gggcgcctcc tttccgtgag tctaccaaag
     1141 gttgtgcaaa cactagtttc cagtctgcct ttctagggga ggattaagag tacgccttat
     1201 gattggcttc cgtaagattg gcgggcgggg ctttgcgtgg aaagagtctt aggacgggtc
     1261 ttgggcgggg ctctggggca aggggtgagt gggtgggcgg ccctcaagca gacacccctt
     1321 aatctaattg gttgtgtcta agagtgggcg ggccctctaa agaccccatc gcaccactct
     1381 cagcattatc ctacccgagc tctgatccac tttcaggcct cgtgaggtag agggctggcc
     1441 tgcgcctgcg cctgccatca ttttggtttg ttaagcaagg cagagcatga gcgagtcggt
     1501 gggacgcacc tccgcgatgc atcggctgca ggtagtgctg ggccacctgg ccggccgacc
     1561 cgagtcgagc tccgcgctgc aagccgcgcc ctgctccgct accttcccgc aggcttcggc
     1621 ctccgacgtg gtggtggtgc acggacggcg cacccccatc ggccgcgccg gccgcggcgg
     1681 cttcaaggtg aggcccctcg ggcgggcgag aggagcagaa ggcggccaga gtgggtggga
     1741 tcctgccctc gggggtccgg tgagcgcaga gacggcgggt ggggatgaag gagttgggct
     1801 caccgctgct gcacctcgcc ctgcaggaca ccacccccga cgagcttctg tcggccgtgt
     1861 tgaccgcggt tctccaggac gtgaagctaa agcctgagtg tttgggagac atctctgtgg
     1921 gtgagctccc ctgagggtcc tgcaccaccc cacccagcct ctctttctta ccagaacctg
     1981 ggtttcactg agcaagacct acggattcca ccgatttcag tgggatggtc aaagggccct
     2041 tttacagggt tcgcctaaga tcattgggaa acacagacat ctacattatg gttcataaca
     2101 gtagcagaat tacagttctg aagtagcaat gaaaaataat tttgtggttg ggggtcacca
     2161 cagcatgagg aactgtatta aaagggtcac agcactaaga tagttgagaa ccattgatct
     2221 aaagcccaga agttcctgga gacttccagg aggtgagaaa acgcagttct ctacctacat
     2281 gaacatcctg acccgggtac gggctgcaga tgggttcacc ccaccctgcg ctatccttga
     2341 gttacaacag agtataggac atggttttgt ccccaggtgc cagcctgctc ctcgtgcctg
     2401 caccatcgca gggtttggag gaggatactt ggggaggtga acagttgcag gatctgacct
     2461 ctttcaccca cagcccgtct tctcacagcg ttcaggctgt gcactgtgtc tcttacccta
     2521 gtgtctctcc tactaccgag caagcacccc actccccacc ccaaccgtga tttagaagag
     2581 cataccttgg ccccttcagc actgggctgg tagatgactg gctgttgggt agcttggctc
     2641 cccattttcc agactccaga cttctatagc ttcacacgag cctgggaaaa tcctaaagtg
     2701 atccgttaga aagatctgga atgtttcctc tctggaactc acagctgaac ctcagtgtgc
     2761 agaacactac ttggtcctga ataaggtttc tatctacagg taacgtactt gagccaggag
     2821 ccggagcagt catggcgcgc attgcccaat ttctgaggta acccttctac ctctggtggg
     2881 ggattctacc accatgaacc agtccgtagg tgacaggtta tcaggttgga cagtagatag
     2941 caagtcagcc agaatccttg gacccaacag tgctcctact gggttaacca acccagaagc
     3001 aggataccca ggctttggga ccagagccaa ggtgaccaga tgcttcccag ggccgaactt
     3061 ttgttaccct tggaagcatg gactctgacc tcatatgtca gataaataat cacgtgacca
     3121 cttgatcagg atctttactt catttgctac ttgagctgcc tgagaaccct gggctcagtc
     3181 agttatcaga agcttgacgg tagtggggtg ggggaagcag ggtctggaca tcttggccct
     3241 tcaggtaggg aagcagtagc tcatttgcag gattaagagc agcagtagcc agagtccaaa
     3301 ccggggaaaa ccttttgtct agagacacgg aaagggtcag acttagcctt cctgtccctc
     3361 ttgtaacttt cctgttcatc tctcacctgc ttcttcattt tgactttccc ttctctgctg
     3421 tactggccac cctctcccta tacactcgct ctgcctattc tgtcccaata gagagctctc
     3481 tccctgctca cctagctctg gccagtccct gcccagacac ctcggaagct ctcaagcatc
     3541 agatgagtga ccttcttgtc caccaagtca ctatgtgcct ccctgtcgct aagacacttg
     3601 caatggaggg tctggaaaca catgcctgct accttgagct tctaagctca ggcaggggga
     3661 gctcgtttgg tccacccaca ggcactcttc ctctccttca gaagttttga caaaggaaag
     3721 gtagacatct ttgcccacaa agagactagc caccagccta agagctaggt ctcatgggca
     3781 aggcttccaa ccatctcctc agttgcaagt ggctgtagcg acagggtttg cctagtagtt
     3841 gacccctctg catctatcta ccctagtggg agcaaggcca gctgttacct cctggtctgt
     3901 acactgctgt gggccataat gggaccctaa agactcgtgg gtcctgcttt ccatgacaca
     3961 gggtcgagtc cggggctgac cacacatcca aggatgacta accaccatct cccccaggga
     4021 tggctgcaca tccaggctga ccgccatctc ccctctgttt cagtggcatc ccagagaccg
     4081 tgcctctgtc agcagtcaac agacagtgtt catcgggact gcaggcagtg gccaacattg
     4141 ctggtaagta gtgattcctg ggtgcccatg agtggaccct gaccttggtc cttcctcagg
     4201 accctaggca ggaagagtgg aggggcaccc attgccacac cccaagcctt tttgggtgct
     4261 cccgggtctt ccatgaggct ccgttgaatg cacccctgag aacagggagg gaacgggctg
     4321 tgctgctccc agggcgggcc cgctccggcc aaggataatt atggagtgct ttcgttgcag
     4381 gtggcatcag aaatgggtct tacgacattg gcatggcctg tgggtaagat ggccttccgc
     4441 cgggccttat taaccaacag caataatcaa actaatttct ctctcagttc ggaggggccg
     4501 tgattccagg cttctccccg cccacgaaga cagctgaata gggtgggctt ccagaatgtg
     4561 tctaggagtg gcccaaccct ggctgctcag gaatgggcca atcttagcac aagcagtaac
     4621 cggaattggg tcctctaagg tacc
//
LOCUS       RATPKATA2    1205 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene,
            exon 6.
ACCESSION   D90056 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     2 of 4
SOURCE      Rat (Sprague-Dawley) liver DNA, clone PT147.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1205)
  AUTHORS   Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
            (03)542-2511 Fax:   (03)545-3567
FEATURES       from  to/span     description
    pept    +   283  +   381     3-ketoacyl-CoA thiolase, exon 6
    pre-msg <     1  >  1205     3-ketoacyl-CoA thiolase mRNA and introns
    IVS     <     1      282     3-ketoacyl-CoA thiolase intron D
    IVS         382  >  1205     3-ketoacyl-CoA thiolase intron E
BASE COUNT      241 a    351 c    302 g    311 t
ORIGIN      
        1 ggtaccagta ggctgtgcta gggactgcct gtttcctgca tctgccctgg gagtgtgtct
       61 ggtagagtgc cgggtcctaa gcttgaggga aagaaggctc ggctaccttg taacctccag
      121 ctgcccttct caggcaggct tcctagagag cttacctcac tccccaaggg ctccctgggc
      181 ctttattccc tagcacttgt cacacagatg gctgtcccta gggcttccta tgaaaactga
      241 gagctcattc cttcttcttc tccccctctt cctcctcctc agggtggagt ccatgtccct
      301 gtctaacaga gggaaccctg ggaatatttc ctcccgcctg ctggagagtg acaaagccag
      361 agactgcctg attcctatgg ggtgagtgtt catgggaagc tgtgtatgta tccactcacc
      421 gttctgcctg tccttcacca gacagcagca tactcttctc tgactggcag tactcagctt
      481 cctcttcttc ctctgactgt ctcatgctcg acgggaactg ggtatgagat caatgctgcc
      541 cttgacctcg ctttcctgag gtccccccta tacaacacac acacacacac acacacacac
      601 acacacacac acactcgttc ctctccaact ctacaaatgt ccaaatgcac ctcactctac
      661 attactgtgc ctgaggtgcc tgtgtcttag ggaaggcagt gggagtcctc aaagccttgg
      721 gaggcctcaa tagttgaaga cacggggagc ttggcttttg ttattgcggg ggagggctcg
      781 ggctctgaga taccttgtaa gaagtctgtt ccatctctcc agtgctttat aattctgcat
      841 ccgtgtctct gagctctcct cagtttgtcc aagggcaagg cattctggga ggctcaccat
      901 aggagaacca taggctggcc tggctttctc cacatcctgt gtgtttccaa gagcagccta
      961 gaagccgcag agaaactggg gccagatagt gtccccagca tcctcctgcc tatgtcttat
     1021 gggcagagac gactcacccg gggggaacag ttccttctgt aggccccacg ttcaggctgc
     1081 tcttcattct ttgaggtgtg agtctttgag catccgtgga catgggcgct cagtatttaa
     1141 gatgagccag gagtccctgg ggcgtggact gaagctctgt ctctgacgtc gacgccagag
     1201 gtcca
//
LOCUS       RATPKATA3     470 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene,
            exon 7.
ACCESSION   D90057 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     3 of 4
SOURCE      Rat (Sprague-Dawley) liver DNA, clone PT147.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 470)
  AUTHORS   Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
            (03)542-2511 Fax:   (03)545-3567
FEATURES       from  to/span     description
    pept    +    80  +   160     3-ketoacyl-CoA thiolase, exon 7
    pre-msg <     1  >   470     3-ketoacyl-CoA thiolase mRNA and introns
    IVS     <     1       79     3-ketoacyl-CoA thiolase intron E
    IVS         161  >   470     3-ketoacyl-CoA thiolase intron F
BASE COUNT      120 a     91 c    154 g    105 t
ORIGIN      
        1 cgcaagccta gtccttcatc atgtccctgg ccctcaggca ggcagaccca cggcctctgc
       61 tttcctgtgt gtgcactagg ataacctcgg agaatgtggc tgagcggttt ggcatctcac
      121 ggcagaagca agatgccttc gcgctggcct ctcagcagaa gtgagtgcag agggatgggc
      181 tgagggtggg gctggagagt ggatcgggta gagttttgcc aagaaacaaa ctgtaggttg
      241 atgtccttat gcaggtgaag gcaggtacag gatagatcga agcctgtcgt tagatgagaa
      301 ggaaggatga gcaggagaaa agttttaggg aggaggagac cggagagagg aaagagcagc
      361 caggagaact tggaggcaga tgttaacatt tctctctgca catttacagg tcgttatgac
      421 tgttcttaag ggatggatac agggctttgt atgtctcggt gaacaataac
//
LOCUS       RATPKATA4    2264 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketocayl-CoA thiolase (EC 2.3.1.16) A gene,
            exons 8-12.
ACCESSION   D90058 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     4 of 4
SOURCE      Rat (Sparague-Dawley) liver DNA, clone PT147.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2264)
  AUTHORS   Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
            (03)542-2511 Fax:    (03)545-3567
FEATURES       from  to/span     description
    pept    +   301      491     3-ketocayl-CoA thiolase, exon 8
                633      812     3-ketocayl-CoA thiolase, exon 9
               1089     1144     3-ketocayl-CoA thiolase, exon 10
               1230     1375     3-ketocayl-CoA thiolase, exon 11
               1696     1771     3-ketocayl-CoA thiolase, exon 12
    pre-msg <     1     2030     3-ketocayl-CoA thiolase mRNA and introns
    IVS     <     1      300     3-ketoacyl-CoA thiolase intron F
    IVS         492      632     3-ketocayl-CoA thiolase intron G
    IVS         813     1088     3-ketocayl-CoA thiolase intron H
    IVS        1145     1229     3-ketocayl-CoA thiolase intron I
    IVS        1376     1695     3-ketocayl-CoA thiolase intron J
    signal     2012     2017     polyadenylation signal
BASE COUNT      446 a    615 c    703 g    500 t
ORIGIN      
        1 gagtctgctc agtgaaaccc aggttctggt aagaaagaga ggctgggtgg ggtggtgcag
       61 gaccctcagg ggagctcacc cacagagatg tctcccaaac actcaggctt tagcttcacg
      121 tcctggagaa ccgcggtcaa cacggccgac agaagctcgt cgggggtggt gtcctgcagg
      181 gcgaggtgca gcagcggtga gcccaactcc ttcatcccca cccgccgtct ctgcgctcac
      241 cggacccccg agggcaggat cccaaacatg cctatgtctt actgcccacg gtgctggcag
      301 ggcagcaagt gcccagagca aaggctgctt ccgtgctgag atcgtacctg tgacaaccac
      361 tgtcctcgat gacaagggtg acaggaaaac catcaccgtg tctcaggatg agggtgtccg
      421 ccccagcacc accatggagg gcctggccaa gctgaagcct gccttcaagg atggaggctc
      481 taccacggct ggtgagcgtg gccaggagta gggccaatct cagtggcttg ttctactctg
      541 agacctggag atgaccaggc cgcctgggaa ggtgatggtg gggggctggc tctcctcagg
      601 cctgctgacg tctcctggtc caccttctcc aggaaactcc agtcaggtga gtgatggagc
      661 agccgccgtc ctgctggccc ggaggtccaa ggctgaagaa ctgggcctcc ccatccttgg
      721 cgtcctgagg tcctatgcag tggtcggggt ccctcctgac atcatgggca tcggacctgc
      781 ctatgccatc cctgcggcct tgcagaaagc aggtgaggtg gctccttctc accctgtgct
      841 tggatccttc attacctgga tctgggactg gggagtgctg gggtttgacc ttctgggtgc
      901 tgtggatgga gcgggctagc accctacacc caagaccttc ctggcactac agctggctgg
      961 gtctcactct cttgtcccta gaagacaaac atcagactcc tgggatctag gcttgggata
     1021 gagtcctggg ggagtccaaa gtcctgccca gtggttgttg tcaattctgg cctatgcctc
     1081 acccacaggg ctgactgtga atgacataga catctttgag atcaatgagg cctttgcaag
     1141 tcaggtgagc ctgggtgttg tggtaggatt gactgagcca gcagttcaga gcagatggag
     1201 actttagctc ctgcgcttgc tccttccagg ccctctactg tgtggagaag ctgggaattc
     1261 ctgcagagaa ggtgaacccc ctggggggtg caatagccct gggccacccc ctgggctgca
     1321 ccggagcaag gcaggtggtc acgctgctca atgagctgaa gcgccgaggc agacggtaag
     1381 gctgctcctc gtggggtgca gtgggggtta ttagagctgg ggtgtctgat agctggggcg
     1441 tggggaggtg agcacacacg caggggttct gaacttggga atggaggggt gagagcctgt
     1501 cttccagaag ccttgcttca aaaacagcat aaccatgcat gccacctggg tgtgtgtctg
     1561 tccggggctc ggttcccaca gatctgctgt gctggcttgg actctgccca tgaacccttg
     1621 tgactgcagc ctgcctgatg ggagcatggg ttggggtttg gctgactcga gccaacactg
     1681 gttctttttc cccagggctt atggcgtggt gtccatgtgc attgggactg ggatgggagc
     1741 cgctgctgtc tttgaatacc ctgggaactg aggccctgac tgcaggcact acccagagag
     1801 tcctatagta gtgtctggag agggatggta caggagccat cttcgtggga cactcagcag
     1861 tggagggatt tgtcacagca cttcaattca gaagatgtag tcgatgttgg aacaggaggt
     1921 ggaactgccc tgtcaagtac cccaagccat gctaaagtga gcatgggaca cccaggttgc
     1981 aaagccatct gtacctctga cggatgggtg tagtaaatgt gtgctgtctc attttggtgg
     2041 ctgtcatatc cttcgcaggc agctttcgcc tcctggattc tgtgtataaa tgattaagaa
     2101 ccagggtgtc tccggtgaga tctagctgta tgcaggctct ggggacctgt catgaaagac
     2161 ctcagctgac accattgccc tagaagaggc tcccccatcc caggagtcat ctagccccta
     2221 gccacccccc cccccgctct ccagctgtgg gctatgcaga attc
//
LOCUS       RATPKATB1    2234 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
            exons 1-3.
ACCESSION   D90059 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     1 of 5
SOURCE      Rat (Sprague-Dawley) liver DNA, clone PTB1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2234)
  AUTHORS   Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
            (03)542-2511 Fax:    (03)545-3567 The sequence, 1251-1328 encodes
            leader peptide.
FEATURES       from  to/span     description
    pept       1251     1421     peroxisomal 3-ketoacyl-CoA thiolase, exon 1
               1562     1655     peroxisomal 3-ketoacyl-CoA thiolase, exon 2
               1985  +  2042     peroxisomal 3-ketoacyl-CoA thiolase, exon 3
    pre-msg    1223  >  2234     peroxisomal 3-ketoacyl-CoA thiolase mRNA and
                                 introns
    IVS        1422     1561     peroxisomal 3-ketoacyl-CoA thiolase intron A
    IVS        1656     1984     peroxisomal 3-ketoacyl-CoA thiolase intron B
    signal      848      854     enhancer core
    signal     1037     1045     enhancer core
    signal     1114     1118     CAAT box
    signal     1163     1167     CAAT box
    signal     1192     1198     TATA box homolog
    site       1099     1108     NFK-B site
BASE COUNT      460 a    614 c    647 g    513 t
ORIGIN      
        1 agtgctctta accactgagc tatctctcca gccctccact ttattttctg aggcggtatc
       61 tctaaacacg gtgatttgac tagggtaggg agccactctg cccctcggat cctctgcctt
      121 ccccacctaa gcactaagga agcctgccta gcatcgatgt gtgcagtggg gatctgatct
      181 ctggtctccc acttatgggg tcgagcactt taccctctga gccctctcgc actcccccct
      241 cctttcgagt tttgggctct cctggggact gagaaaaccc tgggtgattg tcgtggtttg
      301 gggctgacct gtcactgtga ttggggaacc cagccttact ctttaaggtt ggtaaaacat
      361 tctagatcca gggagcccca attcatccac tcagcaggaa accattgacc agctaatgta
      421 tcaggcacac tgcagacacc tctgctcagg ggggaggaga gtgggcgctg caggggtgtc
      481 catagatgag agcatttcaa atgggcatcc agaggtgtca agacccgcag gacactctca
      541 gagacctttg aaccacttca gtgctttaat aaaaccggtt cttccagcct gttagctgcc
      601 aaagattagg cctgagaatc tcaattactc aagtatccag cttggggcca gggacccagg
      661 agagacaaga ggcaaatgga acagaaaaca gagtcgggac ctctatctcc attcatagtg
      721 taaagctgtt accatggctc cagaacctcc tgccttttgt gtcatagtgt gcgatggtca
      781 tatttgaggg agggtatcct ggttcatgtt cagaaaccca gaagacttcg aggtcctacc
      841 tacagtgtgg attgcaaaat gctagaaacg acaggaaagt tccagacact ggaactggga
      901 ggggtttcaa aagtgcgcac aaactttttt ttaaattata tttgaaaaaa aaatcttatt
      961 ttgtagccca gactggcctt gaacttgcgc agaaaccact tgcaccatca caggtgcaga
     1021 cggatgagct taaggagtgg ttttgaagtt tatttgacca gcattgagaa gctgcgaggt
     1081 ggtggtggga tggatggtgg gactttcctt ctgattggct gtagaatggg taggcgggac
     1141 ttatgaaaaa tgcaaggctc ccattggctc tgccctgtgc cagccggtgt cttttattct
     1201 ctggggtcag cacccagagc ccaggctaat cggtgggacg cacctccgcg atgcatcggc
     1261 tgcaggtagt gctgggccac ctggccggcc ggtccgagtc gagctccgcg ctgcaagccg
     1321 cgccctgctc tgcgggcttc ccgcaggcct cggcctccga cgtggtggtg gtgcacggac
     1381 ggcgcacccc catcggccgc gccggccgcg gcggcttcaa ggtgaggccc ctcgggcggg
     1441 cgaggggagc agaaggcggc catagtgggt gggatcctgg ccctcggggg tccggtgacc
     1501 gcagagacgc cgggtgggaa tgaaggagct gggctcaccg ctgttgcgcc tcgccctgca
     1561 ggacaccacc cccgacgagc ttctgtcggc cgtgttgacc gcggttctcc aggatgtgaa
     1621 gctaaagcct gagtgtttgg gagacatctc tgtgggtgag ctcccctgag ggtcctgcac
     1681 cggctcttcc ccgccctcct ccttcccggg atccgtctct cacctcacag agcagagtta
     1741 ctgaagggca ggagtctgga cctctgaggt ctgaaccagg ctgcccccga tggccagtct
     1801 caagcctaga accttgggcg ccccctcgtg gcacatctgc caactgagag gcgctggaaa
     1861 gcgggaacct ctctcgaaag gctgtgattc ttggagagat ctggactgtc tcctgtctgg
     1921 ggcatgtttt acttcaggcc tcgggacacc acctggcact gagtcaggac tcttctgtcc
     1981 acaggcaatg tgcttcagcc cggggccgga gccgcaatgg cacgcatcgc ccagttcctg
     2041 aggtaatctt tctggatccg tcttagttgt ccaccatgac tgccgaaggc ccttggcact
     2101 gtgtgtgtca cggtggaaga tcaggttgaa tgtagagact gaggcaactg gagtccactg
     2161 acctgacctc tgctcgataa ccttttccct acttagaggg agtgtgttgc tggcccatgg
     2221 ccaaggcgcc cggg
//
LOCUS       RATPKATB2     913 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
            exon 4 and 5.
ACCESSION   D90060 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     2 of 5
SOURCE      Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 913)
  AUTHORS   Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
            (03)542-2511 Fax:    (03)545-3567
FEATURES       from  to/span     description
    pept    +   353      432     peroxisomal 3-ketoacyl-CoA thiolase, exon 4
                673  +   715     peroxisomal 3-ketoacyl-CoA thiolase, exon 5
    pre-msg <     1  >   913     peroxisomal 3-ketoacyl-CoA thiolase mRNA and
                                 introns
    IVS         433      672     peroxisomal 3-ketoacyl-CoA thiolase intron D
BASE COUNT      173 a    234 c    275 g    231 t
ORIGIN      
        1 tgtctcagat gagcacaaga aatgctgttc catctcagag tcttcaagat agaggaaaga
       61 ggttgtctgg gcccacggca ggcctccctg gtccctcttg agaactatgg gacagatgag
      121 aatgtgcttg ggggctcttg tgtgtagcta cggagacaga tctgcacagt agctgcttct
      181 gctcttgtgg atctgctcca gtgggggttg cctgttgtcc tgagctatat gcacactgct
      241 ggggctcttg tgggggccca gaagtctggt ggggcctgct ttccatgaga gagggtcatg
      301 tccagggatg gctgcacgtc cagggctgac cgccatctcc cctctgtttc agtggcatcc
      361 cagagaccgt gcctctgtca gcagtcaaca gacagtgttc atcgggactg caggcagtgg
      421 ccaacattgc tggtaagtga tgattcctgg gtgcccatga gtggaccctg accttggtcc
      481 ttgaccctca ggaccctagg caggaagagt ggagggacac ccattgccat accccaagcc
      541 tttttgggtg ctcccggggt cttccgtgag gctccgttga atgcactccc gagaacaggg
      601 agggaacggg ctgtgctgct ccagggcggg cccgttccgg ccaaggataa ttatggagtg
      661 ctcttggtgc aggtggcatc agaaatgggt cttacgacat tggcatggcc tgtgggtaag
      721 atggccttcc cgccgggctt attaaccaat gcaataatca aactaaaatt tttctctcct
      781 tgactgtgat tctccttgcc cactgtgaca gctggctggg ctgggctccc agggatgcag
      841 cccaggttgt ggctctgagt atcgtgacct cttttgttct cacaactttg cttgtcacat
      901 tggggatctg cag
//
LOCUS       RATPKATB3     145 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
            exon 6.
ACCESSION   D90061 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     3 of 5
SOURCE      Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 145)
  AUTHORS   Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
            (03)542-2511 Fax:    (03)545-3567
FEATURES       from  to/span     description
    pept    +    22  +   120     peroxisomal 3-ketoacyl-CoA thiolase, exon 6
    pre-msg <     1  >   145     peroxisomal 3-ketoacyl-CoA thiolase mRNA and
                                 introns
BASE COUNT       32 a     38 c     41 g     34 t
ORIGIN      
        1 gagctcactc tttctcctca gggtggagtc catgaccctg tctgagagag gcaaccctgg
       61 gaatatttcc tcccgtcttc tggagaacga gaaggccaga gactgcctga ttcctatggg
      121 gtgagtggcg acaggaatct ctcaa
//
LOCUS       RATPKATB4     369 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
            exon 7.
ACCESSION   D90062 J05269
KEYWORDS    3-ketoacyl-CoA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     4 of 5
SOURCE      Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 369)
  AUTHORS   Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone:
            (03)542-2511 Fax:    (03)545-3567
FEATURES       from  to/span     description
    pept    +   242  +   322     peroxisomal 3-ketoacyl-CoA thiolase, exon 7
    pre-msg <     1  >   369     peroxisomal 3-ketoacyl-CoA thiolase mRNA and
                                 introns
BASE COUNT       66 a     94 c    123 g     86 t
ORIGIN      
        1 ggggggaaca gttccttctg taggccccac gttcaggctg ctcttcactc tttgaggtgt
       61 gagtctttga gcatccgtgg acatgggcgc tcagtattta agatgagcca ggagtccctg
      121 gggcgtggac tgaagctctg tctctgacgt ggacgcccag aggtgcaagt ctagtccttc
      181 atcgtgtccc tggccctcag gccagcagac ccatggcctc tgctttcctg tgtcgcacta
      241 ggataacctc ggagaatgtg gctgagcggt ttggcatctc acggcagaag caggatgcct
      301 tcgcgctggc ctctcagcag aagtgagtgc agagggatgg gctgaggtgg gcagagatgc
      361 tgagctggg
//
LOCUS       RATPKATB5    2787 bp ds-DNA             ROD       04-AUG-1990
DEFINITION  Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene,
            exons 8-12.
ACCESSION   D90063 J05269
KEYWORDS    3-ketoacyl-coA thiolase; fatty acid beta-oxidation;
            peroxisomal protein.
SEGMENT     5 of 5
SOURCE      Rat (Sprague-Dawley) liver DNA, clone PTB2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2787)
  AUTHORS   Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T.
  TITLE     Rat Peroxisomal 3-Ketoacyl-CoA Thiolase Gene: Occurrence of two
            closely related but differentially regulated genes
  JOURNAL   J. Biol. Chem. 265, 4600-4606 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Makoto
            Hijikata Virology Division, National Cancer Center Research
            Institute 5-1-1 Tsukiji, Chuo-ku Tokyo 104 Japan Phone: 03-542-2511
            Fax:    03-545-3567
FEATURES       from  to/span     description
    pept    +   987     1177     peroxisomal 3-ketoacyl-CoA thiolase, exon 8
               1326     1505     peroxisomal 3-ketoacyl-CoA thiolase, exon 9
               1782     1837     peroxisomal 3-ketoacyl-CoA thiolase, exon 10
               1923     2068     peroxisomal 3-ketoacyl-CoA thiolase, exon 11
               2387     2462     peroxisomal 3-ketoacyl-CoA thiolase, exon 12
    pre-msg <     1     2742     peroxisomal 3-ketoacyl-CoA thiolase mRNA and
                                 introns
    IVS        1178     1325     peroxisomal 3-ketoacyl-CoA thiolase intron H
    IVS        1506     1781     peroxisomal 3-ketoacyl-CoA thiolase intron I
    IVS        1838     1922     peroxisomal 3-ketoacyl-CoA thiolase intron J
    IVS        2069     2386     peroxisomal 3-ketoacyl-CoA thiolase intron K
    signal     2724     2729     polyadenylation signal
BASE COUNT      572 a    722 c    864 g    629 t
ORIGIN      
        1 ctcacggtgg ggcagagatg gcctgagctg gggctctcac tgtggtacta aggtgttggt
       61 tagccccatg actttagctg ggaggtggaa tggttcagcc cggagtgtgg ggtgggtctt
      121 cccagttctt aactattcac tggggtcggt ctctctagag acctcctcct cctctaccct
      181 gcttcttagg acagaaggac ccagttctga acagacatgg tggtcactgg tgatccttag
      241 gcctggagtg tggggtcagt ggtgctaaga ccggatcccc ctcctaccac aagatgaaga
      301 cctgggtttg cagcggtggc ctgaggccat cattccagaa aaccagagcc ttaggcccgt
      361 gggaaacaga agggcaggtt ggccactcaa atcatggatt gagaacagag ggaggatgga
      421 gtgtaagcca ggtggcctca ccttttagag ttggccagca gtgtgcgttc cctggaagcc
      481 gtgcgttcct gcccacaggc atgactctac ccacttgttc tgttcaccct gacaacagcg
      541 tgggaatcag gaaataaatc caggaggtgc acctcgcttt taaatttaag aggggaagca
      601 ggtgaagttt cgcttttagc agtcaaacat gggccttgaa gaatccggca cagacattcc
      661 caggggtgca gcctcgttgt ggaagagacc tctgagcagg ctaagcagct agcggattaa
      721 gaatcaggag acagaatagc agctctgaac tccctgctgt tggctgggga tgagtactgc
      781 caaaccggga aaatcaggag ctgctgctga gtctgggaga gcttcgggtg cgcctttccc
      841 tccctgggga atcttagctg tcactcagag ctccatcagg gccacaaagg gctgtggttg
      901 cattgccttt ctgagttagc caagcccagg gactgggagg aaagactaca aaggggcccg
      961 actctcgctg tccatggtgc tggcagggca gcaagcgccc agagcaaagg ctgcttccgt
     1021 gctgagatcg tgcctgtgac aaccactgtc ctcgatgaca agggtgacag gaaaaccatc
     1081 accgtgtctc aggatgaggg tgtccgcccc agcaccacca tggagggcct ggccaagctg
     1141 aagcctgcct tcaaggatgg aggctctacc acggctggtg agcgtgggcc aggattaggg
     1201 ccgatctcgg tggcttgttc tactctgaga cctggagctg accaggccgc ctgggaaggt
     1261 gatagtcagg ggctggctct ctcaggccct gcctgacgtc tcctcgtccc acccttcctc
     1321 cccaggaaac tccagtcagg tgagtgatgg agcagccgcc gtcctgctgg cccggaggtc
     1381 caaggctgaa gaactgggcc tccccatcct tggcgtcctg aggtcctatg cagtggtcgg
     1441 ggtccctcct gacatcatgg gcatcggacc tgcctatgcc atccctgcgg ccttgcagaa
     1501 agcaggtgag gtggctcctt ctcaccctgt gcttggatcc ttcattacct ggatctggga
     1561 ctggggagtg ctggggtttg accttctggg tgctgtggat ggagcgggct agcaccctac
     1621 acccaagacc ttcctggcac tacagctggc tgggtctcac tctcttgtcc ctagaagaca
     1681 aacatcagac tcctgggatc taggcttggg atagagtcct gggggagtcc aaagtcctgc
     1741 ccagtgattg ttgtcaattc tggcctatgc ctcacccaca gggctgactg tgaatgacat
     1801 agacatcttt gagatcaatg aggcctttgc aagtcaggtg agcctgggtg ttgtggtagg
     1861 attgactggg ccagcagttc agagcagatg gagactttag ctcctgcgct tgctccttcc
     1921 aggccctcta ctgtgtggag aaggtgggaa ttcctgcaga aaaggtgaac cccctggggg
     1981 gtgcaatagc cctgggccac cccctgggct gcaccggagc aaggcaggtg gtcacgctgc
     2041 tcaatgagct gaagcgccga ggcagacggt aaggctgctc ctcgtggggt gctgtggggg
     2101 ttattagagc tggggtgtcc aatagctggg ggcgtgggga ggtgagcaca cacacagggg
     2161 ttctgaactg ggaatggagg ggcgagacct gtcttccaga agccttgctt caaaaacagc
     2221 atgactgtgc ctgccactgg gtgtgtgtct gtccggggct cggttcccac agatctgctg
     2281 tgctggcttg gactctgccc atgaaccctt gtggctgcag ccctggctgg tgggagcatg
     2341 ggttggggtt tggctactcg agccaacact ggttcttttt ccccagggct tatggcgtgg
     2401 tgtccatgtg catcgggact gggatgggag ccgctgctgt ctttgaatac cctgggaact
     2461 gaggccctga ctgcaggcac tacccagaga gtcctatagt agtgtctgga gagggatggt
     2521 acagaagcca tcttcgtggg acactcagca gtggagggat ttgtcacagc actttaattt
     2581 agaaaatgta atcgcaatac gggcacaagg gcatccaatc cactcctctc agttgaacca
     2641 aagacagcgc acgtcacacg tgaccacagg acaacatgtt tctgagggac agtggcctca
     2701 tgaatagtgg gcatgagggg catagtaaat gtatgttatc tcatcttgct tgcagtcatt
     2761 tcccccaggc cggctcttgc tttgttg
//
LOCUS       NPOCAP       2102 bp ds-DNA             VRL       04-AUG-1990
DEFINITION  Orgyia pseudotsugata polyhedrosis virus capsid protein (P39)
            gene, complete cds.
ACCESSION   M21042 M23429
KEYWORDS    capsid protein; structural protein.
SOURCE      Orgyia pseudotsugata polyhedrosis virus DNA, clone cosmid #58.
  ORGANISM  Orgyia pseudotsugata polyhedrosis virus
            Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; 
            Nuclear polyhedrosis viruses.
REFERENCE   1  (bases 1 to 2102)
  AUTHORS   Blissard,G.W., Quant-Russell,R.L., Rohrmann,G.F. and Beaudreau,G.S.
  TITLE     Nucleotide sequence, transcriptional mapping, and temporal
            expression of the gene encoding p39, a major structural protein of
            the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata
  JOURNAL   Virology 168, 354-362 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by G.W.Blissard, 03-NOV-1988.
            
            The p39 protein is found in both the PDV and BV virion phenotypes;
            P39 mRNAs are approximately 2600 nt in length, are expressed late
            (24-48 hpi), and have two major transcription initiation sites.  At
            each major transcription start site, two transcripts are initiated
            within the ATAAG consensus.  Transcripts initiate at positions 2
            and 3 of the ATAAG consensus.
FEATURES       from  to/span     description
    pept        124     1179     capsid protein
    mRNA         27  >  1179     capsid protein mRNA (alt.)
    mRNA         28  >  1179     capsid protein mRNA (alt.)
    mRNA         75  >  1179     capsid protein mRNA (alt.)
    mRNA         76  >  1176     capsid protein mRNA (alt.)
BASE COUNT      493 a    607 c    585 g    417 t
ORIGIN      Map position 57-58.
        1 ccggcgcagc gcaccgtttc tttatataag aatctaaaat tagatacaag aggtcttggc
       61 taaaattgat tgtataagag atttcttgct ctatcaaaac gtcggccccc atcgccgcag
      121 aaaatggcgc tcgtgtcacc cggggtgtcg tcgcgacggt cgaccaacca ctgcatcttt
      181 ggcgcgattg aaccgttcga ctcgtgcgtc acgtacaggt cgccatgctc gtcggacgcg
      241 tcggtggacg acggctggtt catctgcgac taccacctga agttgcgctt caaaatgtcc
      301 aaaatggtgc tgcccattta cgacgaagac gacaaccagt acaagcgcac catagcgcgg
      361 catttggtgg ggcacaaaga aaggggcgta aagcgcattc tggtgcccac cagagccaac
      421 tacatgaccg tgttcaattt gccgggcatg atgctggccg agcagctgat ttttcatttg
      481 atctacgaca accgtttgga agtcaatcgc atttgcgcct cgttgaaaaa caacgaaaac
      541 ttcattgaca acacgtacag cgttgtggag agcgtgtaca gcgcgacgcg caacattctg
      601 tcgctcacag acccgcaagc gtactgctcg cgcgtggcca acgacgacgt gcgctttttc
      661 gacgcgaacg tagtcgacaa caactaccag gcgggcaacg gcgacaccgt gtttaacaac
      721 atgcccgggt ttttgcgcaa cctaatcagg cgcgccgtgg cgcccgaaac gctgcagatc
      781 gacagcgaag acctgcgcct gcgaaactgc aacacctgtg tgattaacaa caccggcttg
      841 gtagcaaccg tgacaaacac cgagctgtac aacccggtgc gcagctcgga catcattaaa
      901 acgcggccca atcgcttgca gattagaaac gttttgaagt ttgagggcga cacacgcgcg
      961 ttggagcgca ccttgggccg ctacgaggaa taccccatgt acgtgccctt gtttttgggc
     1021 taccagctgg tcaacctgca aaacgacatt ttgcgcgcca acaacttttt acccgcgccg
     1081 tttggcgtgc cgcaggccgt aaacaacctg gaggcccaag cgcccgcagc gcccgcaccc
     1141 gcagcgcccg cgcccgcgcc cgcagcgccc gttgtttaat tccgttgcca tggacacggt
     1201 gaggctgcag tgccacattt gctgctcggt gggcgaaatc aaaaattatt ttttgcagcc
     1261 cgtcgacgca atcaccattt tgcccattgt ggagctgcac acgtgccggc accaattgtg
     1321 cgtcatgtgc gtgcgcaaaa ttgcgcagcg cggccgagac aagcgcgtcg agtgccccat
     1381 gtgccgccgc aaaaacgcgc actttaacgt gtacagcgtg aaccgtaatt cggtggatgt
     1441 gttgcgctgc agcgtggccg acgtgcgcga gcacgggcgc tttggcggat tggcggacgc
     1501 cgcgtcgctg gctcgtgggc tgtttgagcc cagcttgctt gaagccgagc ccgcgccgga
     1561 caactcgttt ggcccaaacg agttacaact tgtgctaaaa cgtctaaagg cgcaaattga
     1621 ggcgcaaaca agaactaatt atgatttgca actgcaagca actgctttgg agcgaacaat
     1681 cgaagaagca aacgaccgct tgggtaaaag ccgcggcgac tacagcgacg cgtgcaagct
     1741 gatggacgag cttcgcggcg acaggctgcg cgcggagcgc gcggtcaagg cgctcgccga
     1801 cgcgcacgcc cagtgggcgg acaagaacgc caaaatgcgg cgcgagaacg acaggttgac
     1861 aaatgaaaac attggtttga ttcgggacaa caatttattt aaacaaaaca ctgcacgcaa
     1921 acgtaaaatc gctccgtaac cgcctttgtt tgcgcgtcaa agtccacctc gtagacgtgt
     1981 cgctcgttta cgctgccgca gcggtcccag ttgtccgtcg tgcaccacgc cgcattgtgt
     2041 cggcggcaca aatacagcgc ttcgttgaat tgcgccggcg gcagattgaa gcggccgagc
     2101 tc
//
LOCUS       PT2REGA       462 bp ds-DNA             PHG       04-AUG-1990
DEFINITION  Bacteriophage T2 translational repressor (regA) gene, complete cds.
ACCESSION   M27737
KEYWORDS    RNA binding protein; translational repressor.
SOURCE      Bacteriophage T2, cDNA to viral mRNA.
  ORGANISM  Bacteriophage T2
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 462)
  AUTHORS   Miller,E.S. and Jozwik,C.E.
  TITLE     Sequence analysis of conserved regA and variable orf43.1 genes in
            T4-like bacteriophages
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.S.Miller, 08-SEP-1989.
            
            The T2 RegA protein coding sequence corresponds to positions
            9839-10207 of the T4 regA sequence contained in the entry with
            accession number M10160.  The complete sequence reported herein
            would correspond to positions 9804-10266.
            
            An RNA secondary structure is detected by primer extension
            reactions on mRNA templates at positions 407-424.  An RNA nuclease
            sensitive site is noted at position 431.
FEATURES       from  to/span     description
    pept         36      404     translational repressor protein (RegA)
BASE COUNT      169 a     66 c     88 g    139 t
ORIGIN      
        1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac
       61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag
      121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggactatac tatatcgttc
      181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag
      241 ttcgtcgtga ttcgattgca tggctattag aagattgggg actaattgaa atcgttcctg
      301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa
      361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc
      421 ccttatttgg agtataatat atcaagagcc taataactcg gg
//
LOCUS       PT4G55G43   13062 bp ds-DNA             PHG       04-AUG-1990
DEFINITION  Bacteriophage T4 genes 55, alpha-gt, 47, 46, 45, 44, 62, regA,
            and 43.
ACCESSION   M10160 J02510 X00769
KEYWORDS    DNA polymerase; DNA polymerase accessory protein;
            alpha-glucosyltransferase; exonuclease; polymerase;
            translational repressor.
SOURCE      Bacteriophage T4 DNA.
  ORGANISM  Bacteriophage T4
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 7513 to 8269)
  AUTHORS   Spicer,E.K., Noble,J.A., Nossal,N.G., Konigsberg,W.H. and
            Williams,K.R.
  TITLE     Bacteriophage T4 gene 45: Sequences of the structural gene and its
            protein product
  JOURNAL   J. Biol. Chem. 257, 8972-8979 (1982)
  STANDARD  full staff_review
REFERENCE   2  (bases 7513 to 10735)
  AUTHORS   Spicer,E.K. and Konigsberg,W.H.
  TITLE     ;
  JOURNAL   (in) Mathews,C.K., Kutter,E.M., Mosig,G. and Berget,P.B. (Eds);
            Bacteriophage T4: 299-301;
            American Society for Microbiology, Washington, DC (1983)
  STANDARD  full staff_review
REFERENCE   3  (bases 9747 to 10265)
  AUTHORS   Trojanowska,M., Miller,E.S., Karam,J., Stormo,G. and Gold,L.
  TITLE     The bacteriophage T4 regA gene: Primary sequence of a translational
            repressor
  JOURNAL   Nucleic Acids Res. 12, 5979-5993 (1984)
  STANDARD  full staff_review
REFERENCE   4  (bases 8257 to 9272)
  AUTHORS   Spicer,E.K., Nossal,N.G. and Williams,K.R.
  TITLE     Bacteriophage T4 gene 44 DNA polymerase accessory protein:
            Sequences of gene 44 and its protein product
  JOURNAL   J. Biol. Chem. 259, 15425-15432 (1984)
  STANDARD  full staff_review
REFERENCE   5  (bases 1 to 7084)
  AUTHORS   Gram,H. and Rueger,W.
  TITLE     Genes 55, alpha-gt, 47, and 46 of bacteriophage T4: The genomic
            organization as deduced by sequence analysis
  JOURNAL   EMBO J. 4, 257-264 (1985)
  STANDARD  full staff_review
REFERENCE   6  (bases 6985 to 7512)
  AUTHORS   Hsu,T. and Karam,J.
  JOURNAL   Unpublished (1985) Med U of South Carolina, Charleston, SC.
  STANDARD  full staff_review
REFERENCE   7  (bases 9781 to 10291)
  AUTHORS   Adari,H.Y., Rose,K., Williams,K.R., Konigsberg,W.H., Lin,T.-C. and
            Spicer,E.K.
  TITLE     Cloning, nucleotide sequence, and overexpression of the
            bacteriophage T4 regA gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 1901-1905 (1985)
  STANDARD  full staff_review
REFERENCE   8  (bases 7513 to 12985; revises [1],[2],[4],[7])
  AUTHORS   Spicer,E.K.
  JOURNAL   Unpublished (1986) Yale U School of Med, New Haven, CT.
  STANDARD  full staff_review
REFERENCE   9  (bases 7513 to 12985; revises [1],[2],[4],[7],[8])
  AUTHORS   Spicer,E.K.
  JOURNAL   Unpublished (1987) Yale U School of Med, New Haven, CT.
  STANDARD  full staff_review
REFERENCE   10 (bases 10205 to 13062)
  AUTHORS   Spicer,E.K., Rush,J., Fung,C., Reha-Krantz,L.J., Karam,J.D. and
            Konigsberg,W.H.
  TITLE     Primary Structure of T4 DNA Polymerase: Evolutionary relatedness to
            eucaryotic and other procaryotic DNA polymerases
  JOURNAL   J. Biol. Chem. 263, 7478-7486 (1988)
  STANDARD  full staff_entry
REFERENCE   11 (bases 9184 to 9903)
  AUTHORS   Rush,J., Lin,T.-C., Quinones,M., Spicer,E.K., Douglas,I.,
            Williams,K.R. and Konigsberg,W.H.
  TITLE     The 44P subunit of the T4 DNA polymerase accessory protein complex
            catalyzes ATP hydrolase
  JOURNAL   J. Biol. Chem. 264, 10943-10953 (1989)
  STANDARD  full staff_entry
REFERENCE   12 (bases 10212 to 10215)
  AUTHORS   Andrake,M., Guild,N., Hsu,T., Gold,L., Tuerk,C. and Karam,J.
  TITLE     DNA polymerase of bacteriophage T4 is an autogenous translational
            repressor
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 7942-7946 (1988)
  STANDARD  full staff_entry
COMMENT     Gene 45 protein is required for both T4 DNA replication and T4 late
            gene transcription [1].  Gene 44 protein is a DNA polymerase
            accessory protein required for T4 DNA replication [4].  Gene 55
            protein associates with the modified host RNA polymerase and is
            necessary for late RNA transcription [5].  Gene 43 encodes a T4 DNA
            polymerase [8].  [5] notes several open reading frames not included
            in the FEATURES table.
            
            A draft entry and computer-readable copy of the sequence in [8],[9]
            were kindly provided by E.K.Spicer 13-MAR-1986 and 27-JUL-1987.
FEATURES       from  to/span     description
    pept         25      582     g55 protein
    pept       2356     3558     alpha-glucosyltransferase (alpha-gt)
    pept       3735     4754     exonuclease g47
    pept       5198     6880     exonuclease g46 (gtg start codon)
    pept       7579     8262     DNA polymerase accessory protein g45
    pept       8313     9272     DNA polymerase accessory protein g44
    pept       9274     9837     DNA polymerase accessory protein g62
    pept       9839    10207     translation repressor protein (regA)
    pept      10286    12982     DNA polymerase (g43)
    revision   8707     8707     g in [8]; c in [2],[4]
    revision   8835     8835     c in [8]; g in [2],[4]
    revision   8950     8950     g in [8]; c in [2],[4]
    revision   9135     9136     cg in [8]; gc in [2],[4]
    revision   9484     9484     c in [8]; g in [2]
    revision   9929     9929     g in [2],[3],[8]; aaag in [7]
    revision  10212    10215     gggg in [12]; gg in
    revision  10551    10552     ct in [8]; tc in [2]
    revision  10807    10807     a in [9]; c in [8]
    revision  12597    12599     ttg in [9]; tg in [8]
    revision  12605    12607     aaa in [9]; aa in [8]
    revision  12609    12610     tt in [9]; t in [8]
    revision  12612    12613     tt in [9]; t in [8]
    revision  12618    12619     aa in [9]; a in [8]
    revision  12644    12646     aaa in [9]; aa in [8]
BASE COUNT     4471 a   1954 c   2512 g   4125 t
ORIGIN      About 45.5 kb on genomic map.
        1 tatgaattga gctaagaggt gaaaatgtca gaaactaagc ctaaatataa ttacgtaaac
       61 aataaagagc ttttacaagc tattattgat tggaaaacag aattagcaaa taataaagac
      121 ccaaataaag tagttcgtca gaatgatact atcggattag ccattatgct tattgcagaa
      181 ggcttatcta aacgtttcaa cttttcagga tacacccagt cttggaaaca agaaatgatt
      241 gcagatggta tagaagcttc tattaagggg cttcacaatt ttgatgaaac gaaatataaa
      301 aacccacatg cgtatataac tcaagcttgt tttaatgcat tcgtccaacg tattaaaaaa
      361 gaacgtaagg aagttgcaaa gaaatatagt tacttcgttc acaatgtcta tgacagccgt
      421 gacgacgata tggttgcgtt agtagatgaa acttttattc aagacatcta tgataaaatg
      481 acgcattacg aagaatcaac ctatagaaca ccgggggctg aaaagaaaag tgttgtagat
      541 gattctccta gtttggattt tttatatgag gctaacgatt aacctctccg gattcttgga
      601 agaaatacct gaagttgaag ctattcccta tttacttaaa atgtatctca gggaagtttt
      661 agctcttgac attgatattg atccagaaaa tccgtatgat accgctttta aatctaatgg
      721 tgtagaatta aactatcggt atcatttaac agatgatgat ttttatttta tattagagaa
      781 ataatatgac tgataaaccc gaaattaatg atgaagtgga aaagcttatt tcttctattg
      841 aagaaaagaa ccgtcttgaa gcagaaagaa aagcaaataa gttattgtct aaaaacaaac
      901 gcgaactgaa tcgtctttat aagcacgctc agatcgcagc tgaaaataat aattttgctc
      961 aatacgaata tgctatcaag aaaagtcggg atattctaaa acagccatat aacgatgaac
     1021 tcatcagtat tctttggaag actactagat cgcagattga ggatatgatt gatgcttaca
     1081 cacgtaaaat tcaagcgtct taaaattaat gcaggattta ctgaatcttt gaatggtcat
     1141 ctttgcgtga aaatttctga aaaagaatac cgtgatagtt caatcaaaga agttaatcct
     1201 cctattgtaa gagcagaccc taatatgaaa gtgtgggttg attcttatca agtcaaaaaa
     1261 tggtggcaac tgtgaaagat gaacatccga cttctgaaat agattataat aaaattcgtt
     1321 cctctagaga ggaaatgatg agacgcttta aagaggctca tgataaagct aaagcagaag
     1381 gaactataac atataagcgc ataaaattta aaagttctaa cgagcctctg tatggcgtat
     1441 tatgtggata ggagcttcgg ctcctatatt gctttataaa ttttttggta aaataaacta
     1501 aaatgaatta tttaaacata tataatttac tgatcgaaaa aggactaaac agaggcaatg
     1561 ataaaagtct tttgacatac tatactgaaa cacatcatat aattcctaga tgtatgggtg
     1621 gtacagatga taaaactaat ttagttcttt tgacaccaga agaacatttt accgcgcatt
     1681 tattattatt taaaatttat agacttccta aattggcttt agctattcgt atgatgtgtt
     1741 atagttctga cggaacaaga ctaaataata aaatgtatgg atggattaaa acagctgtat
     1801 cttcttcaat atcggaaagc atgaaagagt tctggaaaga tgatgataat aaaaaataca
     1861 tgtctaacgc tagacgaaat gctggaaaac caatttatca atatgattta aatggaaatt
     1921 ttattcgcaa atatagatgt attacagatg ctgcagaaga tatgtcatat agctgttcaa
     1981 cttcaattaa acaatgtgta gatggtaaaa ggaaaaccgc tggaggattt caatggaaat
     2041 attattattc tgataacata ggaaagccct ctaggatgtc taatgcaact aaacaaaaaa
     2101 tgtctaaatc taaacgtggg ataactcaga aaagaaatgt gccagtattt caatatgata
     2161 ctactggaaa attattacga gtatttccta gaattaaaga cgcagctgta tccgttaaag
     2221 gatgcatgtc taatattaaa aagtgcatat ctggaaaaag caaaattgct tatggatatg
     2281 tttgggccta ttcgtagcgt tttatagaaa ataaaatatt atttacatga tttattaaat
     2341 gaaaagagga aaactatgcg tatttgcatt tttatggctc gaggtcttga aggttgtggt
     2401 gtaacaaaat tctcactcga gcaacgtgat tggtttatta aaaatggtca tgaagtaact
     2461 ttggtttatg ctaaagataa atcatttact cgtacaagtt ctcatgacca caaatcattt
     2521 tcaattccag ttattttagc taaagaatac gataaagcac ttaagctagt aaatgattgt
     2581 gatattctaa ttattaattc tgttcctgct acttccgttc aagaagctac gattaataac
     2641 tataaaaaac ttttagataa tattaaacct tctattcgtg ttgtagttta tcagcatgat
     2701 cattctgttc tttctttgcg tcgaaatttg ggattagaag aaactgttcg tcgagctgat
     2761 gttattttta gccattctga taatggtgat tttaataaag ttctgatgaa agaatggtat
     2821 ccagaaactg tttctctgtt tgatgatatt gaagaagcac cgacagtata taattttcag
     2881 cctcctatgg atattgtgaa ggttcggtca acttattgga aagatgtttc tgaaattaac
     2941 atgaatatca accgttggat tggtcgtacg actacatgga aaggttttta ccagatgttt
     3001 gattttcatg aaaaattctt aaaacctgct ggtaaatcca ctgtaatgga aggtctggaa
     3061 cgttcccctg cttttattgc aattaaggaa aaaggtattc cgtatgaata ttacggtaat
     3121 cgtgagattg ataaaatgaa tctcgcgccg aatcaaccgg cacaaatcct agattgttat
     3181 attaatagtg aaatgcttga acgaatgagt aaatctggct ttggatatca gttgagtaaa
     3241 cttaaccaga aatacttaca acgctcactc gaatatactc atctcgagct tggtgcatgt
     3301 ggaacaattc cggtattttg gaaatctact ggcgaaaatt taaaattccg tgttgataat
     3361 actcctttga cctcgcatga tagcggtatc atttggtttg atgaaaatga tatggaatca
     3421 acatttgaac gtattaaaga actgtcatct gaccgagctc tttatgaccg tgagcgagaa
     3481 aaagcatatg aatttttgta tcagcatcaa gattcaagct tctgctttaa agaacagttt
     3541 gacattatta caaaataaag ggcttcggcc ctttagcttt atacggagtt tgatataatg
     3601 atatttcttg gatatgtgat actttttctt gcattttatc tattcactag agcatgttgg
     3661 attgggttct ttagcactcc agatggattt atttcaataa ttttattttg catttcaatg
     3721 acggttcttg atatatgaaa attttaaatt taggtgattg gcatttaggc gttaaagctg
     3781 atgatgagtg gattcgcggt attcaaattg atggaattaa gcaagcgata gaatattcta
     3841 agaaaaatgg aattactacc tggattcaat acggtgatat ttttgatgtg cgaaaagcga
     3901 tcacacataa aactatggag tttgcccgtg aaatagttca aacgcttgat gatgctggaa
     3961 ttacattaca cactattgta ggaaatcacg atctccacta taaaaatgta atgcatccaa
     4021 atgcttctac tgagcttttg gctaaatatc ctaatgttaa agtgtatgat aagcctacta
     4081 cagtagattt tgacgggtgt ttgattgatt tgattccttg gatgtgcgaa gaaaatactg
     4141 gtgaaattct tgagcatatc aaaacttcat ctgcttcttt ttgtgttggt cactgggaac
     4201 tgaatggatt ttatttttat aaaggaatga aatctcatgg tcttgaaccc gatttcctta
     4261 agacttataa agaagtgtgg tctggtcact tccatactat ttctgaggct gcaaacgtca
     4321 gatatattgg gacaccatgg acactaaccg caggtgacga gaatgaccct cgtgggttct
     4381 ggatgtttga tacagaaaca gaacgaacgg aatttattcc aaacaatact acctggcatc
     4441 gtagaattca ttatccattt aaaggaaaaa ttgactataa agattttaca aatctatcag
     4501 tacgtgttat agtaactgaa gtagacaaaa atctgacgaa gttcgaatct gaactagaaa
     4561 aagttgtgca ttcattacga gttgtgtcaa agattgataa ctctgtcgag tcagatgaca
     4621 gtgaagaagt tgaagttcaa tcgcttcaga cattaatgga agagtatatt aatgcaattc
     4681 cagacatcac tgattctgac cgtgaagcac ttattcaata tgcaaatcag ctatatgtag
     4741 aggcaacaca atgacttttg atgaatttaa aaatgttatg atgagtcagc attttaaatg
     4801 tgaagtaaaa gatgatattg gtcataaaga aattattgaa tactggtttg aaccgttaga
     4861 ggttgaagat aattgtatta aaaaggttac tgtctgcact gactgggctg tatcttttaa
     4921 cttcaacatt ttagataatg acacacctaa atcattacga gatatggctg tatcttgtat
     4981 taaggatgca tactgtgaag ttttcgacat ttgacattaa tgatgaattc atagcaaata
     5041 ttgattatac cgaagaagat tctagatatg ttggaataat ttatatcaca tcaaaaacag
     5101 cacaaggcgt tgtttgcatg gctgaatttg atgaatactt tttagattat gatgatatga
     5161 tagaatggtc taaaagatac attaaaagga atcttttgtg aagaatttta aacttaatag
     5221 agtaaaatat aaaaatataa tgtcagttgg tcaaaatggg attgacattc aattagataa
     5281 ggttcaaaaa actcttatta ctggacgaaa tggcggtggt aagtctacta tgctagaagc
     5341 catcacattt gggctttttg gtaagccatt tcgcgatgta aagaaaggtc aattaataaa
     5401 cagcacaaat aagaaagaac ttttagttga actgtggatg gaatatgatg agaaaaagta
     5461 ctatatcaaa agaggacaaa aaccaaacgt tttcgaaatc accgttaacg gtacacgtct
     5521 taatgaatct gccagcagta aagatttcca agcagaattt gaacagctta tcggaatgtc
     5581 atatgccagt ttcaagcaga ttgttgtcct tggtacagca gggtataccc ctttcatggg
     5641 tttgtcgacc cctgcgcgaa gaaagcttgt ggaagacctg cttgaggtag gaacattagc
     5701 tgaaatggat aagcttaata aagcactaat acgtgaatta aattcgcaga accaagtgct
     5761 tgatgttaaa aaagatagta ttatccaaca aattaaaata tataatgata atgttgaacg
     5821 ccagaaaaaa ttaacgggtg acaaccttac tcgtctacaa aatatgtatg atgatttggc
     5881 aaaagaagct agaacgctaa aatcggaaat agaagaagct aacgaaagat tagttaatat
     5941 tgttttggac gaagacccga ctgatgcatt taataaaatc ggtcaagaag catttttaat
     6001 taaatcaaaa attgactcgt ataataaagt cattaatatg tatcacgaag gtggattatg
     6061 tccaacctgt ttgtcacaat tgagttccgg tgataaagtt gtttctaaaa ttaaagataa
     6121 agtttctgaa tgcacacatt cgtttgaaca gctttcaaca catcgtgata atttaaaagt
     6181 tcttgttgat gaataccgag ataatattaa aacccagcag tcgttggcaa atgatattcg
     6241 caataaaaag caatctctga tcgcagcagt agataaagct aaaaaagtta aagcggctat
     6301 agaaaaagca tcttctgagt ttattgacca tgctgatgaa atagcactgc ttcaagaaga
     6361 acttgataaa attgttaaga caaaaactaa tttagtaatg gaaaaatacc accgaggaat
     6421 tttgactgat atgctcaaag attctggtat taaaggtgct attattaaaa agtacattcc
     6481 attatttaat aagcagatta accattatct taaaataatg gaagcggatt atgtgtttac
     6541 attagatgaa gaatttaatg agacaattaa atcccgtggt cgtgaagatt ttagttatgc
     6601 ttcatttagt gaaggtgaaa aggcacgaat cgatattgct cttttattta cttggcgtga
     6661 tattgcttct atcgtatctg gtgttagtat tagtacatta attcttgatg aagtgtttga
     6721 tgggtcattt gatgccgaag gtattaaagg tgtagctaat attataaatt caatgaaaaa
     6781 cactaacgtt tttataattt cgcataaaga ccatgacccg caagaatatg gtcagcatct
     6841 tcaaatgaag aaagttggtc gatttactgt aatggtttaa tttataagag attatgcttt
     6901 aatttattag agtataatct ctatggagga aaaacatgga atattcaact ggacagcatc
     6961 tattaactat tcctgaaata aaacgatata ttctgagaaa taatttttct aatgaagagc
     7021 atatagttac tgaatctatg cttaggaatg catttaaagc agaatataca aaaataatgt
     7081 ccaatagaaa tgaagcttgg actgttactg attattatga ctaaaggtgt attatgacta
     7141 aaattactgt gaattatact gttgatgtaa aagatattca gccaaaacac gtgcgttctg
     7201 aatcaaatcc acaaaaccaa aataaaattc gtcgagcatg tgttttgtct ctttctgata
     7261 acgcaatgga agttattcag aacaaaatta aatctgcacc tgctcgtcat gcgtattatg
     7321 aagctatcga tcgtgaagta agtaataaat ggattgaact aatgcgcaaa catactacag
     7381 aatccctaaa cgccggtgct aaatttatta tgacttcatg tggtgaaacg ccttgaagat
     7441 gattattgcg gtaatgcaga tgaacgtcta attgttgctg ctcaaattgt tgcggaaaca
     7501 attgcggctg attttaacgt taattgcttt attaaattag ttataaaatt aaatctcatt
     7561 tgaattgaag gaaattacat gaaactgtct aaagatacta ctgctctgct taaaaatttc
     7621 gctactatta actctggtat tatgcttaaa tccggtcaat ttattatgac tcgcgcagtt
     7681 aatggtacaa cttatgcgga agcaaatatt tctgacgtta ttgattttga tgtagcaatt
     7741 tacgatttga acggttttct cggtattctg tctttagtta atgatgcaga aatttcccag
     7801 tcagaagatg gaaatattaa aattgctgat gcccgctcaa caattttttg gcgagcagcc
     7861 gatccgagta cagatgttgc tcctaataaa ccaattccat tcccggtagc atctgctgtt
     7921 actgaaatta aagctgaaga ccttcaacag ctgttgcgtc tatctcgtgg tctgcaaatt
     7981 gatacaattg ctatcacggt aaaagaaggt aaaatcgtaa ttaacggttt taataaagta
     8041 gaagattctg ctctgacccg tgttaaatat tctttgactc ttggtgatta tgatggtgaa
     8101 aatacattta atttcattat caatatggca aatatgaaaa tgcaaccagg aaattataaa
     8161 cttctgcttt gggcaaaagg taaacaaggt gctgctaaat ttgaaggtga acacgcgaat
     8221 tatgtggtag ctcttgaagc tgattctacc cacgattttt aatagagggc ttcggccttt
     8281 ataatttaca ctaaaacttg aatgaggaaa ttatgattac tgtaaatgaa aaagaacaca
     8341 ttcttgaaca gaaatatcgt ccatctacta tcgatgaatg tattcttccc gcttttgata
     8401 aagaaacctt taaatctatt acaagtaaag gtaagattcc acatattatt cttcattctc
     8461 cttctccagg aacaggtaaa acaactgtag caaaagcatt atgtcatgat gtaaatgctg
     8521 atatgatgtt tgtgaatggg tcagattgta aaattgattt cgttcgtggt cctttgacta
     8581 attttgccag cgccgcttca tttgatggtc gtcaaaaagt aatcgttatt gatgaatttg
     8641 accgttcagg gttagcagag tctcagcgac atcttcgttc ctttatggaa gcttatagtt
     8701 caaactgtag tattattatt actgctaata atattgatgg tattattaaa ccgcttcagt
     8761 cacgctgccg agttattaca ttcggtcaac caactgatga agataaaatt gaaatgatga
     8821 agcagatgat tcgtcgattg actgaaatct gcaagcatga aggaattgct atagctgata
     8881 tgaaagttgt agcagctttg gttaaaaaga attttcctga ttttcgtaaa actattggcg
     8941 agctcgatag ttattcgtct aaaggtgttt tggatgctgg tattttatca ctggttacta
     9001 acgatcgtgg tgctattgat gatgttcttg agtctctcaa aaataaagat gttaaacaac
     9061 tcagagcttt agcaccaaaa tatgcggctg attattcgtg gttcgtgggt aaacttgccg
     9121 aagaaatcta ttcacgtgta actccacaaa gtattattcg tatgtacgaa attgtcggcg
     9181 aaaataatca gtatcatggt attgcagcta atactgaatt gcatttagct tatcttttca
     9241 ttcaattagc atgcgaaatg cagtggaagt gatatgagct tatttaaaga tgatattcaa
     9301 ttaaacgagc atcaagttgc ttggtattca aaagattgga cagctgtcca atccgctgct
     9361 gattctttta aagaaaaagc agaaaatgaa ttttttgaaa taattggagc tattaataat
     9421 aaaactaaat gctctattgc tcaaaaagat tattcaaaat tcatggttga aaatgcatta
     9481 tcacaatttc cagagtgtat gccagctgta tatgctatga atttaattgg atcaggctta
     9541 agtgatgaag ctcattttaa ttatctaatg gctgcagttc ctcgtggtaa aagatatggt
     9601 aaatgggcaa aactggttga agattccacc gaagtattga ttattaagtt acttgctaag
     9661 cggtatcaag ttaatacaaa tgatgcaatt aactataaat caattcttac taaaaatgga
     9721 aaacttcctt tagtattaaa agaactaaaa ggtttagtca cggatgattt tttgaaagaa
     9781 gtgactaaga acgtaaaaga acagaaacaa ctcaaaaaac tagcattgga atggtaaaat
     9841 gattgaaatt actcttaaaa aacctgaaga ttttctgaaa gtaaaagaaa ctttgactcg
     9901 tatgggaatt gctaataata aagataaagt tctatatcag tcctgtcata ttcttcagaa
     9961 aaaaggacta tactatatcg ttcattttaa agaaatgctt cgtatggatg gtcgtcaagt
    10021 tgaaatgaca gaagaagatg aagttcgtcg tgattcgatt gcatggctat tagaagattg
    10081 gggactaatt gaaatcgttc ctggtcaaag aacttttatg aaagatttaa ctaataactt
    10141 ccgagttatt tcttttaaac aaaaacatga atggaagctc gttcctaaat atacgattgg
    10201 taattaagca aggggcttcg gccccttatt tggagtataa tatatcaaga gcctaataac
    10261 tcgggctata aactaaggaa tatctatgaa agaattttat atctctattg aaacagtcgg
    10321 aaataacatt gttgaacgtt atattgatga aaatggaaag gaacgtaccc gtgaagtaga
    10381 atatcttcca actatgttta ggcattgtaa ggaagagtca aaatacaaag acatctatgg
    10441 taaaaactgc gctcctcaaa aatttccatc aatgaaagat gctcgagatt ggatgaagcg
    10501 aatggaagac atcggtctcg aagctctcgg tatgaacgat tttaaactcg cttatataag
    10561 tgatacatat ggttcagaaa ttgtttatga ccgaaaattt gttcgtgtag ctaactgtga
    10621 cattgaggtt actggtgata aatttcctga cccaatgaaa gcagaatatg aaattgatgc
    10681 tatcactcat tacgattcaa ttgacgatcg tttttatgtt ttcgaccttt tgaattcaat
    10741 gtacggttca gtatcaaaat gggatgcaaa gttagctgct aagcttgact gtgaaggtgg
    10801 tgatgaagtt cctcaagaaa ttcttgaccg agtaatttat atgccattcg ataatgagcg
    10861 tgatatgctc atggaatata tcaatctttg ggaacagaaa cgacctgcta tttttactgg
    10921 ttggaatatt gaggggtttg acgttccgta tatcatgaat cgtgttaaaa tgattctggg
    10981 tgaacgtagt atgaaacgtt tctctccaat cggtcgggta aaatctaaac taattcaaaa
    11041 tatgtacggt agcaaagaaa tttattctat tgatggcgta tctattcttg attatttaga
    11101 tttgtacaag aaattcgctt ttactaattt gccgtcattc tctttggaat cagttgctca
    11161 acatgaaacc aaaaaaggta aattaccata cgacggtcct attaataaac ttcgtgagac
    11221 taatcatcaa cgatacatta gttataacat cattgacgta gaatcagttc aagcaatcga
    11281 taaaattcgt gggtttatcg atctagtttt aagtatgtct tattacgcta aaatgccttt
    11341 ttctggtgta atgagtccta ttaaaacttg ggatgctatt atttttaact cattgaaagg
    11401 tgaacataag gttattcctc aacaaggttc gcacgttaaa cagagttttc cgggtgcatt
    11461 tgtgtttgaa cctaaaccaa ttgcacgtcg atacattatg agttttgact tgacgtctct
    11521 gtatccgagc attattcgcc aggttaacat tagtcctgaa actattcgtg gtcagtttaa
    11581 agttcatcca attcatgaat atatcgcagg aacagctcct aaaccgagtg atgaatattc
    11641 ttgttctccg aatggatgga tgtatgataa acatcaagaa ggtatcattc caaaggaaat
    11701 cgctaaagta tttttccagc gtaaagactg gaaaaagaaa atgttcgctg aagaaatgaa
    11761 tgccgaagct attaaaaaga ttattatgaa aggcgcaggg tcttgttcaa ctaaaccaga
    11821 agttgaacga tatgttaagt tcagtgatga tttcttaaat gaactatcga attacaccga
    11881 atctgttctc aatagtctga ttgaagaatg tgaaaaagca gctacacttg ctaatacaaa
    11941 tcagctgaac cgtaaaattc tcattaacag tctttatggt gctcttggta atattcattt
    12001 ccgttactat gatttgcgaa atgctactgc tatcacaatt ttcggccaag tcggtattca
    12061 gtggattgct cgtaaaatta atgaatatct gaataaagta tgcggaacta atgatgaaga
    12121 tttcattgca gcaggtgata ctgattcggt atatgtttgc gtagataaag ttattgaaaa
    12181 agttggtctt gaccgattca aagagcagaa cgatttggtt gaattcatga atcagttcgg
    12241 taagaaaaag atggaaccta tgattgatgt tgcatatcgt gagttatgtg attatatgaa
    12301 taaccgcgag catctgatgc atatggaccg tgaagctatt tcttgccctc cgcttggttc
    12361 aaagggcgtt ggtggatttt ggaaagcgaa aaagcgttat gctctgaacg tttatgatat
    12421 ggaagataag cgatttgctg aaccgcatct aaaaatcatg ggtatggaaa ctcagcagag
    12481 ttcaacacca aaagcagtgc aagaagctct cgaagaaagt attcgtcgta ttcttcagga
    12541 aggtgaagag tctgtccaag aatactacaa gaacttcgag aaagaatatc gtcaacttga
    12601 ctataaagtt attgctgaag taaaaactgc gaacgatata gcgaaatatg atgataaagg
    12661 ttggccagga tttaaatgcc cgttccatat tcgtggtgtg ctaacttatc gtcgagctgt
    12721 tagcggttta ggtgtagctc caattttgga tggaaataaa gtaatggttc ttccattacg
    12781 tgaaggaaat ccatttggtg acaagtgcat tgcttggcca tcgggtacag aacttccaaa
    12841 agaaattcgt tctgatgtgc tatcttggat tgaccactca actttgttcc aaaaatcgtt
    12901 tgttaaaccg cttgcgggta tgtgtgaatc ggctggcatg gactatgaag aaaaagcttc
    12961 gttagacttc ctgtttggct gatagaataa atctaggacc tccaggtccc tttttcatac
    13021 aagtaatata aatctatact tatgaaaaag agatgattct gg
//
LOCUS       PT6REGA       462 bp ss-mRNA            PHG       04-AUG-1990
DEFINITION  Bacteriophage T6 translational repressor protein (regA), complete
            cds.
ACCESSION   M27738
KEYWORDS    RNA binding protein; translational repressor.
SOURCE      Bacteriophage T6, cDNA to viral RNA.
  ORGANISM  Bacteriophage T6
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 462)
  AUTHORS   Miller,E.S. and Jozwik,C.E.
  TITLE     Sequence analysis of conserved regA and variable orf43.1 genes in
            T4-like bacteriophages
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by E.S. Miller, 08-SEP-1989.
            
            The T2 RegA protein coding sequence corresponds to positions
            9839-10207 of the T4 regA sequence contained in the entry with
            accession number M10160.  The complete sequence reported herein
            would correspond to positions 9804-10266.
            
            An RNA secondary structure is detected by primer extension
            reactions on mRNA templates at positions 407-424.  An RNA nuclease
            sensitive site is noted at position 431.
            
               Author address: E.S.Miller
               Dept. of Microbiology
               Box 7615
               North Carolina State University
               Raleigh, NC 27695
               E-mail: esm@cgc.ncsu.edu
FEATURES       from  to/span     description
    pept         36      404     translational repressor protein (regA)
    site        431      431     RNA endonuclease site
BASE COUNT      170 a     64 c     89 g    139 t
ORIGIN      
        1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac
       61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag
      121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggattatac tatatcgttc
      181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag
      241 ttcgtcgtga ttcaatagca tggttgttag aagattgggg actgattgaa atcgtacctg
      301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa
      361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc
      421 ccttatttgg agtataatat atcaagagcc taataactcg gg
//
LOCUS       RB18REGA      674 bp ds-DNA             PHG       04-AUG-1990
DEFINITION  Bacteriophage RB18 translational repressor protein (regA) and
            Orf43.1, complete cds.
ACCESSION   M33533 M27739
KEYWORDS    RNA binding protein; translational repressor.
SOURCE      Bacteriophage RB18, cDNA to viral mRNA.
  ORGANISM  Bacteriophage RB18
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 674)
  AUTHORS   Miller,E.S. and Jozwik,C.E.
  TITLE     Sequence analysis of conserved regA and variable orf43.1 genes in
            T4-like bacteriophages
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by E.S.Miller, 03-APR-1990.
            
            The T2 RegA protein coding sequence corresponds to positions
            9839-10207 of the T4 regA sequence contained in the entry with
            accession number M10160.  The complete sequence reported herein
            would correspond to positions 9815-10255.
            
               Author address: E.S.Miller
               Dept. of Microbiology
               Box 7615
               North Carolina State University
               Raleigh, NC 27695
               E-mail: esm@cgc.ncsu.edu
FEATURES       from  to/span     description
    pept         25      393     translational repressor protein (regA)
    pept        395      616     Orf43.1 peptide
    site        643      643     RegB endonuclease site
BASE COUNT      251 a    106 c    115 g    202 t
ORIGIN      
        1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt
       61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg
      121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa
      181 atgcttcgta tggatggtcg ccaggttgaa atgacagaag aagatgaagt tcgtcgtgat
      241 tcgattgcat ggctattaga agattgggga ctgattgaaa tcgtccctgg tcaaagaact
      301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg
      361 aaactcgttc ctaaatatac gattggtaat taatatgact gctataactc cacaagaata
      421 catggcgtct cttaaagaaa aatataatct ttctgcaaca gaaacacttt ttgatttacc
      481 agaaaacctc caactaaaat ttcaggtaga atttcaaaaa ttaattcatc cagaacaaaa
      541 acactttact gcagtcgtta agtcaattaa tgcagatgga atgacaattt ttcaccgaca
      601 aatagtacta atttaagcaa ggggcttcgg ccccttattt ggagtataat atatcaagag
      661 cctaataact cggg
//
LOCUS       RB51REGA      700 bp ss-mRNA            PHG       04-AUG-1990
DEFINITION  Bacteriophage RB51 translational repressor protein (regA) and
            Orf43.1, complete cds.
ACCESSION   M33532
KEYWORDS    RNA binding protein; translational repressor.
SOURCE      Bacteriophage RB51, cDNA to viral mRNA.
  ORGANISM  Bacteriophage RB51
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 700)
  AUTHORS   Miller,E.S. and Jozwik,C.E.
  TITLE     Sequence analysis of conserved regA and variable orf43.1 genes in
            T4-like bacteriophages
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by E.S.Miller, 03-APR-1990.
            
            The T2 RegA protein coding sequence corresponds to positions
            9839-10207 of the T4 regA sequence contained in the entry with
            accession number M10160.  The complete sequence reported herein
            would correspond to positions 9815-10255.
            
               Author address: E.S.Miller
               Dept. of Microbiology
               Box 7615
               North Carolina State University
               Raleigh, NC 27695
               E-mail: esm@cgc.ncsu.edu
FEATURES       from  to/span     description
    pept         25      393     translational repressor protein (regA)
    pept        404      640     Orf43.1 peptide
    site        671      671     RegB endonuclease site
BASE COUNT      255 a    114 c    125 g    206 t
ORIGIN      
        1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt
       61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg
      121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa
      181 atgcttcgta tggatggccg tcaagttgaa atgacagaag aagatgaagt tcgtcgtgat
      241 tcgattgcat ggctgttaga agattgggga ctgattgaaa tcgttcctgg tcaaagaact
      301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg
      361 aaactcgttc ctaaatatac gattggtaat taaggtaaat tatatgacac aatcagagat
      421 tcagtttacc gaagcttttc taatattcga aaatctaaaa caaacatatg gtctgtcttc
      481 tacccaactg gtttttgacc tcccggaaga taaacaaata gaattccaaa aagaattcca
      541 acgattagtt tacccacgcc aacgcgcatt tcatgcgatt atcaaaacca caaataaaga
      601 tggcaagtcc gttatatcgc gttgtacaat agaaatttaa agcaaagggg cttcggcccc
      661 ttatttggag tataatatat caagagccta ataactcggg
//
LOCUS       HUMQBPC1      235 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein gene, 5' flank
            with an LTR-like sequence.
ACCESSION   M35760 J05438
KEYWORDS    ubiquinone-binding protein.
SEGMENT     1 of 6
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 235)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Common protein binding sites in the 5'-flanking regions of human
            genes for cytochrome c1 and for ubiquinone-binding protein
  JOURNAL   J. Biol. Chem. 265, 8159-8163 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by H.Suzuki, 08-AUG-1989.
FEATURES       from  to/span     description
    site         26       33     Mt3 element
    site        103      110     Mt1 element
    site        114      122     Mt4 element
BASE COUNT       52 a     59 c     56 g     68 t
ORIGIN      
        1 tagccaaccg ggttcggatt gtgcagtcca actccagcca atggagtcag gacacagtag
       61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgtttagg gtgctctcgt
      121 ggcaaccaga cttaccagga gctctattct gcaaaagtaa atttgccttg ctgagagacc
      181 ctttgtcctt tggctcagtg ttggttcttc tttgcagcac cgagcatttg tttcc
//
LOCUS       HUMQBPC2     1567 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human ubiquinone-binding protein (QP) gene, exon 1.
ACCESSION   M35761 M26705 J05438
KEYWORDS    ubiquinone-binding protein.
SEGMENT     2 of 6
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1567)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Common protein-binding sites in the 5'-flanking regions of human
            genes for cytochrome c-1 and ubiquinone-binding protein
  JOURNAL   J. Biol. Chem. 265, 8159-8163 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 888 to 1232)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by H.Suzuki, 08-AUG-1989.
FEATURES       from  to/span     description
    pept       1151  +  1169     ubiquinone-binding protein (QP), exon 1
    IVS        1170  >  1567     ubiquinone-binding protein intron A
    site        425      432     Mt3 element
    site        525      533     Mt1 element
    site        559      567     Mt4 element
BASE COUNT      442 a    339 c    339 g    447 t
ORIGIN      
        1 cttcctgatt agtagtgtag cagtatagca tatgacccta gaattaagag ccccaaattc
       61 gatctggatt ttcagattct ggaagaagaa cctggactct tctagtcatg tgctcttgga
      121 caagttattc aacctttctg agcctcagct ttctcctctc tacaatgaga taattacagt
      181 acttttagaa ttaaaagata gtttagggga gtgagagatc acctccaacc ctccctcccc
      241 tttgctctta tgtgggctct ttggtgaatc taggaaccaa attgacacat ggcagattaa
      301 caagagaaaa gcatactaat tttactagtt ttacatgtac atagggatcc gcacaaaaga
      361 gtgaagtggc caatgtaaga tgcctttata ctttttacac gaactacaaa tcagaagaaa
      421 ggggatctgg ctaggggcag taaatttctc gcggagtcat taggagaaaa gtgtgtgtgt
      481 tggggtgtaa acttagtgga agataacggt tacttcggaa agtatattca ggtccaaagc
      541 agcccccagt ttcgtctctg gtgataggtg ctattttctc cctctggtaa gggacgggta
      601 ctccagagga atcctggagg aagctttatg cgcttgctgc atgcaggaag agataggtca
      661 tcaaaccttt tctgaaacta caatttctcc aatattttca actcgaaata atcaatgtac
      721 caattccata tatttttggg atggcacatc catcagtcct tcaatagaaa gtaagacgat
      781 tgcccttaaa acatttagct tcctgcctgg cacaaagagc tgaattaata ctgtccacgc
      841 tcccatagct attaacagca aataacggga acccagcttt aagccttggt ctcctaattc
      901 ctgacctgtg ttcttcccat cacatcacaa ccaatacttc ttagaatgta gcacgcaaaa
      961 tgccccgcct actccattgg cctcaccctg gccgagaggt gcttcagcag gtgctgaacg
     1021 gcagtcgtca gaactgcgcc tgcgcaagcg gcctttctct gttcgcgatg tgacgtaacg
     1081 cgcctgcgga ctgggcccag cttgtcctct atgacttacc cagaaggcaa cgcttctctt
     1141 tctggtcaaa atggctggta agcaggccgg taagtaactg ggggtcttct tggccggcat
     1201 cgtcgcgctt caccgtttat ttttgcctgc aggtcaacgg atcacaggaa ataattgtaa
     1261 acttttcatt tcaagagagc tggagttgta gggaaaagac agtgagattt cagcatagac
     1321 tggcaggtct aaatgatcct catcctctaa aatttttact cgatattaca gttatttcat
     1381 ttggccacat tctttaaaaa tgtatgtatt ttggggagga ttttgatacc attagtcatt
     1441 taaacattaa tatggattcg taaaagaagt ccatttacct cctcctaagc aactaagttc
     1501 ccccaaggga atggatattg aaaaaggatg agaaataggt aattttaaaa ctgtttgggg
     1561 aacaggg
//
LOCUS       HUMQBPC3      236 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein gene, intron A
            with an LTR-like sequence.
ACCESSION   M33945 J05438
KEYWORDS    ubiquinone-binding protein.
SEGMENT     3 of 6
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 236)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Common protein binding sites in the 5'-flanking regions of human
            genes for cytochrome c1 and for ubiquinone-binding protein
  JOURNAL   J. Biol. Chem. 265, 8159-8163 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by H.Suzuki, 08-AUG-1989.
FEATURES       from  to/span     description
    site         26       33     Mt3 element
    site        103      110     Mt1 element
    site        114      122     Mt4 element
BASE COUNT       57 a     55 c     54 g     70 t
ORIGIN      
        1 aaccgggttc agcttagatt gtacggtcca actctagtca atggagtcag gatacagtag
       61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgttcggg gtgctctcgt
      121 ggcaaccgaa cctatgagga gcaaccttat gcaaaagtaa atttgccttg ctgagagacc
      181 ctttgtcctt tgtctcagtg ctagttcttc tttgcagcac caagcatttg ttttca
//
LOCUS       HUMQBPC4      114 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (QP) gene, exon 2.
ACCESSION   M26706
KEYWORDS    ubiquinone-binding protein cytochrome bc1 complex subunit QP.
SEGMENT     4 of 6
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 114)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    23  +    94     ubiquinone-binding protein (QP), exon 2
    IVS     <     1       22     ubiquinone-binding protein intron A
    IVS          95  >   114     ubiquinone-binding protein intron B
BASE COUNT       34 a     17 c     29 g     34 t
ORIGIN      
        1 ttaacagtgt gacaattatc agtttcagca tcaggcaagt ggctggatgg tattcgaaaa
       61 tggtattaca atgctgcagg attcaataaa ctgggtaagc agctgtgctg ttta
//
LOCUS       HUMQBPC5      208 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (QP) gene, exon 3.
ACCESSION   M26707
KEYWORDS    QP gene; ubiquinone-binding protein.
SEGMENT     5 of 6
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 208)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    21  +   188     ubiquinone-binding protein (QP), exon 3
    IVS     <     1       20     ubiquinone-binding protein intron B
    IVS         189  >   208     ubiquinone-binding protein intron C
BASE COUNT       73 a     36 c     48 g     51 t
ORIGIN      
        1 aactttgtca tctatcatag ggttaatgcg agatgataca atatacgagg atgaagatgt
       61 aaaagaagcc ataagaagac ttcctgagaa cctttataat gacaggatgt ttcgcattaa
      121 gagggcactg gacctgaact tgaagcatca gatcttgcct aaagagcagt ggaccaaata
      181 tgaagagggt agcacagctt attactcc
//
LOCUS       HUMQBPC6      233 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (QP) gene, exon 4.
ACCESSION   M26730
KEYWORDS    QP gene; ubiquinone-binding protein.
SEGMENT     6 of 6
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 233)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    21       97     ubiquinone-binding protein (QP), exon 4
    IVS     <     1       20     ubiquinone-binding protein intron C
BASE COUNT       82 a     23 c     49 g     79 t
ORIGIN      
        1 atatttcatt cattctttag aaaatttcta ccttgaaccg tatctgaaag aggttattcg
       61 ggaaagaaaa gaaagagaag aatgggcaaa gaagtaatca tgtagttgaa gtctgtggat
      121 gcagctgtta tgaagatggt taaacttgaa acaaacaatt ttaagaatta tttggtctga
      181 agatgtttta ctttaaataa atgtctattg taatggctgg agtttttgaa ttc
//
LOCUS       HUMQBPCA      498 bp ss-mRNA            PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein mRNA, complete cds.
ACCESSION   M26700
KEYWORDS    ubiquinone-binding protein.
SOURCE      Human fibroblast SV40 transformed cell line GM637, cDNA to mRNA,
            clones 2 and 6.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 498)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         53      388     ubiquinone-binding protein
BASE COUNT      172 a     72 c    122 g    132 t
ORIGIN      
        1 agcttgtcct ctatgactta cccagaaggc aacgcttctc tttctggtca aaatggctgg
       61 taagcagggc gtttcagcat caggcaagtg gctggatggt attcgaaaat ggtattacaa
      121 tgctgcagga ttcaataaac tggggttaat gcgagatgat acaatatacg aggatgaaga
      181 tgtaaaagaa gccataagaa gacttcctga gaacctttat aatgacagga tgtttcgcat
      241 taagagggca ctggacctga acttgaagca tcagatcttg cctaaagagc agtggaccaa
      301 atatgaagag gaaaatttct accttgaacc gtatctgaaa gaggttattc gggaaagaaa
      361 agaaagagaa gaatgggcaa agaagtaatc atgtagttga agtctgtgga tgcaggtgtt
      421 atgaagatgg ttaaacttga aacaaacaat tttaagaatt atttggtctg aagatgtttt
      481 actttaaata aatgtcta
//
LOCUS       HUMQBPIC       75 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (HQPI) gene,
            exon 2.
ACCESSION   M37387
KEYWORDS    ubiquinone-binding protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     3  /    75     ubiquinone-binding protein (HQP1), exon 2 (AA
                                 at 5)
BASE COUNT       22 a     11 c     21 g     21 t
ORIGIN      
        1 agtttcagca tcggcaagtg gctggatggt attcgaaaat ggtattacaa tgctgcagga
       61 ttcaataaac tgggt
//
LOCUS       HUMQBPIIC     172 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (HQPII) gene,
            complete cds.
ACCESSION   M26701
KEYWORDS    ubiquinone-binding protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 172)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
COMMENT     Intron A starts after nucleotide 98.
FEATURES       from  to/span     description
    pept         80  /   170     ubiquinone-binding protein (HQPII), exon 1
    IVS         171  >   172     HQPII intron A
    site         98       99     site of 2.3 kb intron
BASE COUNT       42 a     40 c     47 g     43 t
ORIGIN      
        1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc agaaggcaac
       61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccgtt tcagcatcag gcaagtggct
      121 ggatggtatt cgaaaatggt attacaatgc tgcaggattc aataaactgg gt
//
LOCUS       HUMQBPIIIC    100 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (HQPIII) gene,
            complete cds.
ACCESSION   M26702
KEYWORDS    ubiquinone-binding protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         80  /    98     ubiquinone-binding protein (HQPIII), exon 1
    IVS          99  >    99     HQPIII intron
BASE COUNT       20 a     30 c     27 g     23 t
ORIGIN      
        1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc acaaggcaac
       61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccggt
//
LOCUS       HUMQBPIVC     465 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (HQPIV) pseudogene,
            complete cds.
ACCESSION   M26703
KEYWORDS    pseudogene; ubiquinone-binding protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 465)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps      80      392     pseudo-ubiquinone-binding protein (HQPIV),
                                 exon 1
BASE COUNT      180 a     66 c    110 g    109 t
ORIGIN      
        1 aaaataaaat aaaataaatt agataaataa ataaataaaa agagagaggg agaaggcaac
       61 atttcacgtt tgggtcaaga tggctggcag gctggctacc gcagcatcag acaggtagct
      121 ggatggtatt tgaaaatggt atgacaatgc tgcaggattc gagaaactaa agtaaatgtg
      181 agatgagaca atatatgaag atgaagatgt aaaggaagcc ataagacggc tttctaacgg
      241 caggatgttt cgcattagaa cactggacct atgagacagc agatcttgcc ttaatagcac
      301 ggaccaaaca tgaagaggat aaattctacc atgagccata tctgaaaccg cttatttggg
      361 aaagagaaga atgagaatgg tcaaagaagt catcacatat ttgaagttcg tggatgtagc
      421 tgcctcaagt atttttatga aaatggttaa acctgaaaca tataa
//
LOCUS       HUMQBPVC      357 bp ds-DNA             PRI       04-AUG-1990
DEFINITION  Human mitochondrial ubiquinone-binding protein (HQPV) pseudogene,
            complete cds.
ACCESSION   M26704
KEYWORDS    pseudogene; ubiquinone-binding protein.
SOURCE      Human leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 357)
  AUTHORS   Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T.
  TITLE     Isolation of a single nuclear gene encoding human
            ubiquinone-binding protein in complex III of mitochondrial
            respiratory chain
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 371-378 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps      71      169     pseudo-ubiquinone-binding protein (HQPV)
BASE COUNT      159 a     51 c     64 g     83 t
ORIGIN      
        1 catttttagt tcccaagagc cgggcataaa caagagacct gagggaatcc tcctctctta
       61 aaaaacaaag cagtggacca actatgaaga ggaaaatttc tgccttgaac tgtatctgaa
      121 agatgttatt tgggaaagaa aagaaagaga agaatgggca aagaagtaat cacatggttg
      181 aagtctgtgg atgcaactgt tatgaagatg gttaaacttg aaataaacaa ttttaagaat
      241 tatttggtct gcagatattt tactttaaat aaatgtctat tgtaatggta aaaaaaacaa
      301 aaacaaaaac aaaacaaaac aaaaacaaaa acaaaacaaa accaaaaaaa cctcagg
//

GenBank-Updates@genbank.bio.net (08/07/90)

LOCUS       RATKV1AA     3291 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Rat potassium channel-Kv1 gene, complete cds.
ACCESSION   M27158 M37145
KEYWORDS    potassium channel-Kv1; transmembrane protein.
SOURCE      Rat brain DNA and cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 477 to 3291)
  AUTHORS   Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B.,
            Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A.,
            Bennett,C., Stein,R.B. and Kaczmarek,L.M.
  TITLE     Cloning and expression of cDNA and genomic clones encoding three
            delayed rectifier potassium channels in rat brain
  JOURNAL   Neuron 4, 929-939 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3291)
  AUTHORS   Luneau,C.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Computer readable copy of sequence [1],[2] kindly submitted by
            R.J.Swanson, 16-AUG-1989, for release after publication.
FEATURES       from  to/span     description
    pept        761     2569     potassium channel-Kv1
BASE COUNT      741 a    905 c    897 g    748 t
ORIGIN      
        1 acaatacgat agcaacatga cagttatgaa gtagcaacaa aaaataaatt tagggtgtgt
       61 gtgtgggggg atggtcaaca caacatgagg aactgtatta aaggtcgaga agcactgctc
      121 taaaagaaaa acataggaag gactctcttc ttcaatctct tacactggaa ctcagctaaa
      181 agcctttccc ttccatctaa agcagttccc accctctggc gtcccctggc tttcagtgca
      241 ttcccaggtg atctgagatc ctgaagctag ctattctcat cctgttgccc tgggagtttc
      301 agcactcctg gtacaacctg ttcctcgctc ctcctgctct tttcctcctc cccttcctgt
      361 cttgcggcag gggccccagc tgaaccctcc agagagacac tctccactcc gtccccaaag
      421 gagaaacgcc cgggggagga gcagaggagg gagccacaga gggcggaggc cggggagctc
      481 agccagagag gggctgctgg aggttgcatc tgctggaagg aggtttttcc cgctgcctgg
      541 taaccagctg ccagaacact gagagggaga gagagaggca gggagcaggg gcagcagctt
      601 cttgacgtca ggaccagcca aggatcgggc cagctacccc ggccagccta gagtcagcgg
      661 ggccctcggc tggagaggag acctacgcga ggcttggagc gtgagttggg ggtgtgggaa
      721 ccggttgtct ggggcgcagc atgccccctg cccccggacc atggagatct ccctggtgcc
      781 cctggagaat ggcagtgcca tgaccctcag aggaggaggg gaggcagggg caagctgtgt
      841 gcagaccccc aggggagagt gtgggtgccc tccgacgtct ggactcaata atcagtccaa
      901 agaaacactg ttgagggggc gcacaacgct cgaggatgcg aaccagggtg gacggccttt
      961 gccccctatg gctcaggagc tgccacaacc tagaaggcta tctgctgagg atgaggaggg
     1021 agaaggcgac cctggcctgg gcacagtgga ggaggaccag gctcctcagg atgcagggtc
     1081 actccatcac cagcgagtcc tcataaacat ctccgggttg cgtttcgaga cgcagctggg
     1141 caccctggca cagtttccca acaccctcct gggggaccca gccaagcgcc tgcactactt
     1201 cgaccccttg agaaatgaat acttcttcga ccgcaaccgg cccagcttcg atggcatttt
     1261 gtactactac cagtctgggg gccgcctgcg caggcccgtc aatgtctccc tggatgtgtt
     1321 tgcagatgag atccgctttt accagctggg ggacgaggcc atggagcgct tccgggagga
     1381 tgagggcttc atcaaggaag aggagaagcc cctgccccgc aacgagttcc agcgccaggt
     1441 atggcttatc ttcgaatacc cagaaagctc tgggtccgca agagccatcg ccatagtgtc
     1501 ggtcctggtc attctcatct ctatcatcac cttctgcctg gagactctgc ctgagttcag
     1561 ggatgagcgg gagctgctac gccatccccc agtgccgccc cagcccccag cccctgcccc
     1621 tgggatcaat ggcagcgtct ctggagcact ttcctctggc cctacggtgg ctccactctt
     1681 gcctaggaca ctggccgatc cattcttcat cgtggagacc acatgtgtga tctggttcac
     1741 ttttgagttg ctcgtgcgct tctttgcctg ccccagcaag gcagaattct ctcggaatat
     1801 catgaacatc atcgatgtcg tggccatctt cccctacttt atcaccctgg gcaccgagct
     1861 ggcagagcaa caaccagggg gagggggtca gaatgggcag caggccatgt ccctagccat
     1921 cctcagggtg atccgcctgg tgcgggtgtt tcggatcttc aagctctccc gccactccaa
     1981 gggactgcag atcctgggta agaccttgca agcatccatg cgggaactcg ggctactcat
     2041 cttcttcctc ttcattggag tcatcctctt ctccagcgct gtctacttcg cagaggcaga
     2101 caatcacggg tcccatttct ctagtatccc agatgccttc tggtgggcag tagtcactat
     2161 gaccactgta ggctatggag acatgagacc catcactgta gggggcaaga tcgtgggttc
     2221 actgtgcgcc atagctgggg tcctcaccat tgccctgcct gtccccgtca tcgtctccaa
     2281 ctttaattac ttctatcatc gggagacaga ccacgaggag caggctgccc tgaaggaaga
     2341 acaaggcaac cagaggcggg agtctgggct ggacacaggg ggtcaacgga aggtcagctg
     2401 cagcaaggcc tccttttgca agactggggg ttccctggag agttctgaca gtatcagaag
     2461 gggtagctgt cctctagaaa agtgtcacct caaggccaag agcaacgtgg acttgcggag
     2521 gtccctgtat gccctctgtc tggacactag ccgtgaaaca gatttgtaaa gagagatcca
     2581 ggcagactgg caccagtgaa gctggccaca ggggtgcccc ttgagcctgg gcatctgctt
     2641 tacaccaccg agtatttaag cccacctggt cacccaccga tgtctgcctt tctttcaccc
     2701 ttctcacctt cctgccctct aacttcccca ttttaactcc tctttccata acccccaggg
     2761 tcgcctattt ttaacaagta tgacattcca tgacgcagga gccgttgaag tgctgagccc
     2821 tcactggaag atggatgcat tcatagccag ttttctacac ccagcagagg gataatcaaa
     2881 caaatttgac tttaaaaagc ttagatccca agagaatcca tagcattccc tacccgtgtt
     2941 ccaaatttac tttacatggg attacatttg tgtatgggca aatattattt ttatggccga
     3001 tgactgcatt ttgtacagca gtgcacatga agaatatttg ggtgtgctct taagatatac
     3061 tttgaattat gaaagtgttg ggggggtcac catttcctgg gacactcagg gacaaagacc
     3121 ctaaaagagt tgttcaggtt tctgtgttct caagcttctt attattctgg gatgtactat
     3181 tggtgcttct ggtctaggag agagaatgtt ctagaagacc agcaaatcta atcttgtgac
     3241 tgtcttcaca gaagcatttt aaggatgttg gaagaaggat ttggagaatt c
//
LOCUS       RATKV2AA     2098 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Rat potassium channel-Kv2 gene, partial cds.
ACCESSION   M27159 M37146
KEYWORDS    potassium channel-Kv1; transmembrane protein.
SOURCE      Rat brain DNA and cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 622 to 2098)
  AUTHORS   Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B.,
            Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A.,
            Bennett,C., Stein,R.B. and Kaczmarek,L.M.
  TITLE     Cloning and expression of cDNA and genomic clones encoding three
            delayed rectifier potassium channels in rat brain
  JOURNAL   Neuron 4, 929-939 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2098)
  AUTHORS   Luneau,C.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Computer readable copy of sequence [1],[2] kindly submitted by
            R.J.Swanson, 16-AUG-1989, for release after publication.
FEATURES       from  to/span     description
    pept        865  >  2098     potassium channel-Kv2
BASE COUNT      358 a    588 c    698 g    453 t      1 others
ORIGIN      
        1 agtgtgcatc tctctctctc tctctccagc atccaggctt gctgcagccc ctagctgggt
       61 aaggggtgtg atgtgagagt ggggtgggag ggggcacagg gcggggcctg ccacgtcact
      121 tggagagtgt gttaaggagg aagggcagag ctgagagctg agcctgctgc tgctgttgct
      181 gctgctgctg ttgctgctgc ttgctgttgc tgctgctgct ggggctgctg cgggtgcggg
      241 caaggattga tgcggtgggg aggtgggtct ctcgctccgg ncccgggcag gcccagggcg
      301 gttccgaggc ctgtagaaca gctcctggga agaaggtggc ggctgcaatc gcgaccttgg
      361 ccagacctag ttcgtggtgg acgtagggcg gaggcggagg ccgagcccgg gcaggagtct
      421 ttggcgagcc agagggaggc gcatctggcg cttcggtacc agcggcagcc ttgtgagtct
      481 tgtgaggctg gagaagtgca gcaggacctg gtgagagcta gcaagtctgg agggcgcctg
      541 tgggtcccga gcccagtttt aggcaccacc gagagctaag ccaagcgtct tttcgggcag
      601 ccagtttcac cgcgcgacag tttgcgggtt ccaggcatct cagaaatctt gagcacggag
      661 gcgcggctac tgagagccag agccacatcc cagacctagc ctggcagaga gaccagctgc
      721 agggttcacc gacctaaccg ccaggtcaga gcacgggccc caccctaaag gagggcgcag
      781 ccggagctgg gaagccggtg ccgcgctccg gagctcgtgt cgtgggcgcc gtcctagtgg
      841 cggggagcgc accgccgagg tgacatgaga tcggagaaat ccctgacgct ggcggcgccg
      901 ggggaggtcc gtgggccgga gggggagcaa caggatgcgg gtgagttcca ggaggccgag
      961 ggcggcggcg gctgctgtag tagtgagagg ctggtgatca acatctctgg gctgcgctac
     1021 gagacgcagc tgcgcacctt gtcgctgttc cctgacacgc tgctaggaga ccctggccgc
     1081 agagtccgct tctttgaccc cttgaggaat gagtacttct ttgaccgcaa ccgacccagc
     1141 ttcgacgcta tcctttatta ctaccagtcg gggggtcgcc tgcgcaggcc ggttaacgtg
     1201 ccccttgaca tctttatgga agagattcgc ttctatcagt tgggagatga agccctggcg
     1261 gccttccggg aggatgaggg ttgcctgccc gaaggtggtg aggatgagaa gccactcccc
     1321 tcccagcctt tccagcgaca ggtctggctc ctctttgagt atccggagag ttctgggccc
     1381 gcccgaggca ttgccatcgt ctcagtgttg gtcatcctca tctccattgt catcttttgc
     1441 ctggagacct tgcctcagtt ccgtgcagat gggcgcggtg gaagcaacga ggggagtggg
     1501 acccgcatgt ccccggcctc cagggggagc cacgaggagg aagatgaaga cgaggattcc
     1561 tatgcatttc ctggtagcat tccctctggg gggttgggga ccggaggaac ttcttcattt
     1621 agtactctcg ggggttcctt cttcacagac cccttcttcc tggtggaaac tctgtgtatc
     1681 gtctggttca cctttgagct cctggtgcgc ttctctgcct gtcccagcaa ggcggccttc
     1741 tttcgcaata tcatgaacat cattgacttg gtggccatct tcccctactt tatcaccctg
     1801 ggcaccgagc tagtgcaacg tcacgagcag cagcctgtga gtggtggcag tggtcagaat
     1861 gggcagcagg ccatgtccct agccatcctc agggtgatcc gcctggtccg ggtgtttcgg
     1921 atcttcaagc tctcccgcca ctccaagggg ttgcagatcc tgggtaagac cttgcaagca
     1981 tccatgcggg agctcgggct actcatcttc ttcctcttca ttggagtcat cctcttctcc
     2041 agcgctgtct acttcgcaga ggcagatgac gttgactcgc tcttccctag catcccag
//
LOCUS       RATKV3AA     2264 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Rat potassium channel-Kv3 gene, complete cds.
ACCESSION   M31744
KEYWORDS    potassium channel-Kv3; transmembrane protein.
SOURCE      Rat brain DNA and cDNA to mRNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (sites)
  AUTHORS   Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B.,
            Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A.,
            Bennett,C., Stein,R.B. and Kaczmarek,L.M.
  TITLE     Cloning and expression of cDNA and genomic clones encoding three
            delayed rectifier potassium channels in rat brain
  JOURNAL   Neuron 4, 929-939 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2264; for [1])
  AUTHORS   Luneau,C.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Computer readable copy of sequence [1],[2] kindly submitted by
            R.J.Swanson, 16-AUG-1989, for release after publication.
            
               [2] Author Address
               Merch, Sharp, and Dohme Research Labs
               W42-300
               West Point, PA  19486
FEATURES       from  to/span     description
    pept        508     2085     potassium channel-Kv3
BASE COUNT      436 a    684 c    695 g    449 t
ORIGIN      
        1 gggttcctcc aaggccagtc tggcagggta ggggttccgc ggggtctggg tagtggggtg
       61 caagtcgaag gcgacccagt gacaaaagaa gggcgaggaa caggggtggc aagcggaaag
      121 cggagccgcg cgcgcagtgg gcgggaaacc cgggggcggc ggcagagagg cgggcgcgcg
      181 cctcgcgggg aggggtgtgg aggagggagg aggaggcgtc tggacggtgc agccgggcgg
      241 gccgcgccag ccctgacaga gcctcgcggc agccgcagca acaggcggcg gggccgggct
      301 cgtggggcag aggcggtggt ggggcgggga gctccaggag gagcgactcg gagctggcca
      361 ccgccagcag ggaaggagag catcgccgcc tcccgcgctc ggccgccgca gccctccgcc
      421 cgccaccgcg cccaccctgc gctggacccc gcgggcggcg gcgcgcgcat cctgcagagc
      481 ccaggctacg ccgagctgcc gccagacatg accgtggtgc ccggggacca cctgctggag
      541 ccagaagcgg cgggaggcgg cggcggggac ccgcctcagg gaggctgtgt cagtggcggc
      601 ggctgcgacc gctacgaacc gctgccgccc gcgctgcccg ccgcgggcga gcaggattgc
      661 tgcggggagc gcgtggtcat caacatctcc gggctgcgct tcgagacaca gctcaagacc
      721 ctctgccagt tccctgagac gctgctaggc gaccccaagc ggcgcatgcg gtacttcgac
      781 ccgctccgca atgagtactt ctttgaccgc aacagaccca gcttcgacgc catcctctac
      841 tactaccagt ccgggggccg catccgccgg ccggtcaacg tgcccatcga catcttctcc
      901 gaggagatcc gcttctacca actgggtgag gaggctatgg agaagttccg tgaggacgag
      961 ggcttcctgc gggaggagga gcgacccctg ccccgccgtg acttccagcg ccaggtgtgg
     1021 ctgctcttcg aataccccga gagctccggg ccggcccggg gcattgccat cgtgtcagtg
     1081 ctggtcattc tcatctccat tgtcatcttc tgcttggaga cactacccga gtttcgcgac
     1141 gagaaggact atcccgcctc tccgtcgcag gacgtgtttg aggctgccaa caacagcacg
     1201 tcgggggcct cctctggagc ctccagcttc tcggacccct tcttcgtagt ggagaccctg
     1261 tgcatcatct ggttctcctt tgagctgctg gtgcgattct ttgcttgccc cagtaaagcc
     1321 accttctcca gaaatatcat gaacctgata gacattgtag ccatcatccc ttattttatt
     1381 actctgggca ctgagctggc tgagcgacag ggtaatgggc agcaggctat gtcactggcc
     1441 atcctgaggg tcatccgcct agtaagggtc ttccgcatct tcaagctctc ccgccattct
     1501 aaggggctgc agatcctggg acagacactg aaggcttcca tgcgagagct ggggctgctc
     1561 attttcttcc ttttcattgg ggtcatcctt ttctccagtg cagtctactt tgctgaggca
     1621 gacgaccctt cttcgggttt taacagtatc ccggatgcct tctggtgggc cgtggtaacc
     1681 atgacaactg tcggttatgg tgatatgcac ccagtgacca taggaggcaa gattgtgggc
     1741 tctctttgtg ccatcgcagg tgtcttgacc attgcattgc cggttcctgt gattgtttcc
     1801 aacttcaatt acttctacca ccgggagaca gaaggggaag agcaagccca gtacatgcac
     1861 gtgggaagct gccagcacct ctcctcttca gcagaggagc tccgaaaagc ccggagtaac
     1921 tccactctga gtaagtcgga gtatatggtg atcgaagagg ggggtatgaa ccacagcgcc
     1981 ttcccccaga cccccttcaa aacgggcaac tccactgcca cttgcaccac caacaataat
     2041 cccaactcct gtgtcaacat caaaaagata ttcactgatg tctaatagat gatacgattg
     2101 ccattctgtg cccagtattg tgtggaacat gcccccttgg tctgtgtatg cccttgattt
     2161 atacatttcc agaccattca tcaaggaaag tacatgaaga agtggaaaag cacacttcat
     2221 tctccctctc cctattgctt catactgaaa caggtgcctg gttt
//
LOCUS       RATKV3AB     2956 bp ss-mRNA            ROD       07-AUG-1990
DEFINITION  Rat mRNA for putative potassium channel subunit protein (RCK3).
ACCESSION   X16001
KEYWORDS    membrane protein; potassium channel protein.
SOURCE      Rattus rattus.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2956)
  AUTHORS   Pongs,O.
  TITLE     ;
  JOURNAL   Unpublished (1989) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 2956)
  AUTHORS   Stuehmer,W., Ruppersberg,J.P., Schoerter,K.H., Sakmann,B.,
            Stocker,M., Giese,K.P., Perschke,A., Baumann,A. and Pongs,O.
  TITLE     Molecular Basis of Functional Diversity of Voltage Gated Potassium
            Channels in Mammalian Brain
  JOURNAL   EMBO J. 8, 3235-3244 (1989)
  STANDARD  simple automatic
COMMENT     *source: developmental stage=adult;tissue=brain-cortex; *source:
            clone=RCK3;
            
            [1]  Author address Pongs O., Ruhr Universitaet Bochum, Fakultaet f
            Chemie , Lehrstuhl f Biochemie, Universitsstr 150, D-4630 Bochum, F
            R G.
            
            Submitted (31-JUL-1989) on tape to the EMBL Data Library
FEATURES       from  to/span     description
    pept          8     1585     put. RCK3 protein (AA 1-525)
BASE COUNT      733 a    774 c    690 g    759 t
ORIGIN      
        1 gccagacatg accgtggtgc ccggggacca cctgctggag ccagaagcgg cgggaggcgg
       61 cggcggggac ccgcctcagg gaggctgtgt cagtggcggc ggctgcgacc gctacgaacc
      121 gctgccgccc gcgctgcccg cggcgggcga gcaggattgc tgcggggagc gcgtggtcat
      181 caacatctcc gggctgcgct tcgagacaca gctcaagacc ctctgccagt tccctgagac
      241 gctgctaggc gaccccaagc ggcgcatgcg gtacttcgac ccgctccgca atgagtactt
      301 cttcgaccgc aacagaccca gcctcgacgc catcctctac tactaccagt ccgggggccg
      361 catccgccgg ccggtcaacg tgcccatcga catcttctcc gaggagatcc gcttctacca
      421 actgggtgag gaggctatgg agaagttccg tgaggacgag ggcttcctgc gggaggagga
      481 gcgacccctg ccccgccgtg acttccagcg ccaggtgtgg ctgctcttcg aataccccga
      541 gagctccggg ccggcccggg gcattgccat cgtgtcagtg ctggtcattc tcatctccat
      601 tgtcatcttc tgcttggaga cactacccga gtttcgcgac gagaaggact atcccgcctc
      661 tccgtcgcag gacgtgtttg aggctgccaa caacagcacg tcgggggcct cctctggagc
      721 ctccagcttc tcggacccct tcttcgtagt ggagaccctg tgcatcatct ggttctcctt
      781 tgagctgctg gtgcgattct ttgcttgccc cagtaaagcc accttctcca gaaatatcat
      841 gaacctgata gacattgtag ccatcatccc ttattttatt actctgggca ctgagctggc
      901 tgagcgacag ggtaatgggc agcaggctat gtcactggcc atcctgaggg tcatccgcct
      961 agtaagggtc ttccgcatct tcaagctctc ccgccattct aaggggctgc agatcctggg
     1021 acagacactg aaggcttcca tgcgagagct ggggctgctc attttcttcc ttttcattgg
     1081 ggtcatcctt ttctccagtg cagtctactt tgctgaggca gacgaccctt cttcgggttt
     1141 taacagtatc ccggatgcct tctggtgggc cgtggtaacc atgacaactg tcggttatgg
     1201 tgatatgcac ccagtgacca taggaggcaa gattgtgggc tctctttgtg ccatcgcagg
     1261 tgtcttgacc attgcattgc cggttcctgt gattgtttcc aacttcaatt acttctacca
     1321 ccgggagaca gaaggggaag agcaagccca gtacatgcac gtgggaagct gccagcacct
     1381 ctcctcttca gcagaggagc tccgaaaagc ccggagtaac tccactctga gtaagtcgga
     1441 gtatatggtg atcgaagagg ggggtatgaa ccacagcgcc ttcccccaga cccccttcaa
     1501 aacgggcaac tccactgcca cttgcaccac caacaataat cccaactcct gtgtcaacat
     1561 caaaaagata ttcactgatg tctaatagat gatacgattg ccattctgtg cccagtattg
     1621 tgtggaacat gcccccttgg tctgtgtatg cccttgattt atacatttcc agaccattca
     1681 tcaaggaaag tacatgaaga agtggaaaag cacacttcat tctccctctc cctattgctt
     1741 catactgaaa caggtgcctg gttttgcaag tgggctgcat tctctcagct ctttttctct
     1801 ctccctctct ctctccctgt ttcttaattt tgtgaacaac aaacttacat taagcttggt
     1861 ttcttgtgca agccctaaag ctacatcctg agaggaaatg aaactataga gtcagagtaa
     1921 ctgtgtgacc tcaaaaatca aaagatagtt gtttctttac taagtaaagg aggcagaact
     1981 taaatgatgc tcctgttcgg tggacttttc agcattattt cctgactaag tgtaactact
     2041 taacctctgg atgtgtgggt ggaaagtcta actagagtgt tgacgtgtaa accctctata
     2101 attttttttt ttttggactg gcctttcagt ttgatctccc ctctaagaat ttcaggttca
     2161 cataactttg aaccaaaagg gaaatgcccc aaatgtcctg atctgactat taactcttcg
     2221 gagtttgcaa agcattttga aaacattaga cacagattcc aatgaagttg tgagttcatg
     2281 cccaagtcaa tatctacgaa agtctagaaa ctcattttca gtgctgccag gagaaacaac
     2341 agaaattcct gatgcatctg agagataagc ttctgcagta tcacaggaag attaaagtgg
     2401 cagacacccc ttccagcgga agttactaat tcggacctga ctgatgcagt tcccatagca
     2461 acccgtgttt cctgggaaac ccgaaaaagg ttgtcatggc atcttttgct ctctagcccc
     2521 acccccaccc caggccgtgc gtttctacag taacctttcc agatggttcc tacttacacg
     2581 acttcataag aaaaaacact atttgaataa agcgcacaaa ttaagtgaaa gactatatag
     2641 aggtgaaatg aatcacaaaa tgctttttta cattttatat aaatcattga tgttatccca
     2701 taacgactgg atcagaaagg aaactgtcgt ctttgaaatg ttacatgtaa ggcatgatct
     2761 gaattaaatt atgtcagctg tttggtaaca atttttaaag atgcttctgt acagttcttt
     2821 cactaagaaa tgacaagtca acagatgaaa cgaaaacagc aatataaaca tggtctgtac
     2881 ataagcaagt gagagtcagt gtacccgaag tcttacacga taccttcaga tttcctcaat
     2941 taaaaaaaaa aaaaag
//
LOCUS       CEUMTFVLA    2682 bp ds-DNA             ORG       07-AUG-1990
DEFINITION  C.unicolor mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes,
            complete, and Leu-tRNA gene, 5' end.
ACCESSION   M35875
KEYWORDS    12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu;
            transfer RNA-Phe; transfer RNA-Val.
SOURCE      C.unicolor liver mitochondrion cell line DH-alpha-5 DNA, clone
            PCUNMR 9.0-18.
  ORGANISM  Mitochondrion Cervus unicolor
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; 
            Cervinae; Cervus unicolor.
REFERENCE   1  (bases 1 to 2682)
  AUTHORS   Miyamoto,M.M., Kraus,F. and Ryder,O.A.
  TITLE     Phylogeny and evolution of antlered deer determined from
            mitochondrial DNA sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.M.Miyamoto, 03-JUL-1990.
FEATURES       from  to/span     description
    tRNA          1       69     Phe-tRNA
    rRNA         70     1024     12S ribosomal RNA
    tRNA       1025     1091     Val-tRNA
    rRNA       1092     2663     16S ribosomal RNA
    tRNA       2664  >  2682     Leu-tRNA
BASE COUNT     1010 a    555 c    467 g    650 t
ORIGIN      
        1 gttgatgtag cttaaacaac aaagcaaggc actgaaaatg cctagatgag tatattaact
       61 ccataaacat acaggtttgg tcccagcctt cctattaacc cttaatagac ttacacatgc
      121 aagcatccgc accccagtga aaatgccccc taagttaata agactaagag gagctggtat
      181 caagcacaca tccgtagctc acgacacctt gcacagccac acccccacgg gagacagcag
      241 tgataaaaat taagccataa acgaaagttt gactaagcca tattaatcag ggttggtaaa
      301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggcaca cggcgtaaag
      361 cgtgttaaag cactatttca aataaagtta aactccaatt aagctgtaaa aagccataat
      421 tgcaacaaaa ataaacaacg aaagtaactt tacaaccgct gaaacacgat agctaggacc
      481 caaactggga ttagataccc cactatgcct agccttaaac acaaatagtt gtgtaaacaa
      541 aactattcgc cagagtacta ccggcaatag cttaaaactc aaaggacttg gcggtgcttt
      601 acacccttct agaggagcct gttctataat cgataaaccc cgataaacct caccattcct
      661 tgctaataca gtctatatac cgccatcttc agcaaaccct aaaaaggtac aaaagtaagc
      721 acaatcataa tacataaaaa cgttaggtca aggtgtaacc tatggaacgg aaagaaatgg
      781 gctacatttt ctaatctaag aaaatccaac acgaaagtta ttatgaaatt aataaccaaa
      841 ggaggattta gcagtaaact aagaatagag tgcttagttg aattaggcca tgaagcacgc
      901 acacaccgcc cgtcaccctc ctcaagtagg cacaatacac tcaaatttat ttacacgtat
      961 taatcatatg agaggagaca agtcgtaaca aggtaagcat actggaaagt gtgcttggat
     1021 aaatcaagat atagcttaaa taaagcatct agtttacacc tagaagattt cacatatcac
     1081 gaatatcttg aaccaattct agcccgcaag cccattcaca ctaaattatc aatgtaacat
     1141 aaaataaaac atttattcaa caacaaaaag tataggagat agaaatttta atacggcgct
     1201 atagagaaag taccgtaagg gaacgatgaa agaaaaaaat caaagtacaa aaaagcaaag
     1261 attacccctt gtaccttttg cataatgagt taactagtaa aaacttaaca aaatgaattt
     1321 cagctaagta ccccgaaacc agacgagcta cttatgaaca atttatcgag aaccaactca
     1381 tctatgtagc aaaatagtga gaagatttgt aagtagaggt gaaacgccca acgagcctgg
     1441 tgatagctgg ttgtccagaa aatgaatatt agttcagctt taaaaatacc aaaaatatga
     1501 acaaattata atgtattttt aaaagttagt ctaaaaaggt acagcctttt agaaatggat
     1561 acaaccttaa ctagagagta agttttaaca ttaaaccata gtaggcctaa aagcagccac
     1621 caattaagaa agcgttaaag ctcaacaata aaatagtatt aatttcaata ataaatagtc
     1681 aactcctaat ctaatactgg actaatctat taagaataga agcaataatg ttaatatgag
     1741 taacaagaag taacttctcc ccgcataagt ttaagtcagt atctgataat actctgacta
     1801 ttaacagcaa aataagaata acctaactat aaataactta ttaactatac tgttaatccg
     1861 acacaggaat gcacttaagg aaagattaaa agaagtaaaa ggaactcggc aaacactaaa
     1921 ccccgcctgt ttaccaaaaa catcacctcc agcataacta gtattggagg cactgcctgc
     1981 ccagtgacaa ccgttaaacg gccgcggtat cctgaccgtg caaaggtagc ataatcactt
     2041 gttctctaaa tagggacttg tatgaatggc cacacgaggg ttttactgtc tcttacttcc
     2101 aatcagtgaa attgaccttc ccgtgaagag gcgggaatat actaataaga cgagaagacc
     2161 ctatggagct ttaactactt agcccaaaga aacaaatttc attaccaagg aaacaacaac
     2221 actctttatg ggctaacagc tttggttggg gtgacctcgg agaacaagaa aacctccgag
     2281 cgattttaaa gactagacct acaagtcgaa tcacacaatc gtttattgat ccaaaaaatt
     2341 gatcaacgga acaagttacc ctagggataa cagcgcaatc ctattcaaga gtccatatcg
     2401 acaatagggt ttacgacctc gatgttggat caggacatcc cgatggtgca accgctatca
     2461 aaggttcgtt tgttcaacga ttaaagtcct acgtgatctg agttcagacc ggagtaatcc
     2521 aggtcggttt ctatctatta tgtatttctc ccagtacgaa aggaccagag aaataaggcc
     2581 aacttcaaat aagcgcctta aattagttaa tgatatcatc ttaattaacc tcacaaacaa
     2641 actttgccct agagaagggc tttgttaagg tggcagagcc gg
//
LOCUS       CPDBA          76 bp ss-RNA             INV       07-AUG-1990
DEFINITION  C.colpoda 5.8S ribosomal RNA.
ACCESSION   M35558
KEYWORDS    5.8S ribosomal RNA.
SOURCE      C.colpoda ribosomal RNA.
  ORGANISM  Colpidium colpoda
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Van Bell,C.T.
  TITLE     5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
            (Ciliophora: Hymenostomatida)
  JOURNAL   J. Mol. Evol. 22, 231-236 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    76     5.8S ribosomal RNA
BASE COUNT       21 a     14 c     20 g     21 t
ORIGIN      
        1 atgcgaattg cagaacacgc gagtcatcag atctttgaac gcaattggtg ggggtgtaaa
       61 aaccttcatg ttcgtt
//
LOCUS       ECOLIVRA      729 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli high-affinity branched-chain amino acid transport repressor
            (livR) gene, complete cds.
ACCESSION   M36020
KEYWORDS    high-affinity branched-chain amino acid transport repressor.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 729)
  AUTHORS   Antonucci,T.K., Wagner,L.M. and Oxender,D.L.
  TITLE     Cloning, expression, and nucleotide sequence of livR, the repressor
            for high-affinity branched-chain amino acid transport in
            Escherichia coli
  JOURNAL   Proteins 1, 125-133 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        229      729     high-affinity branched-chain amino acid
                                 transport repressor (livR)
BASE COUNT      181 a    192 c    170 g    186 t
ORIGIN      
        1 cggcatgtca tttcctctct ctgcaattgt gtacaaatgt acacaacctt gctaatgttg
       61 tgttccctct cttatttgac tgatctatga ccgtaaattc ttcacgtaat gcattgaaac
      121 gccgaacctg gcattgttta tgttcttctt tttgccaggc ctgttaatgg cgtcctgggc
      181 aacccgtacg cctctatccg cgatattctc tctgtctcga tcgctgaaat gggcgtgttc
      241 tctttggtct ggatctccat tgccggcgcg caggtacgtc cgcgatcatt gcgccgaaca
      301 acagcgcgcg gttctttcgc aatacataag cacgaacaag aaacagataa ccacgcctgc
      361 cataacacca tcaagccgaa ctcttcacat gcttggtgca cctgcgaatg taaccgaagg
      421 tcggtggtta gcacgcataa agccagcata attccggcga ataacaaccc ggcagcgttc
      481 cccatgccga aactgaatgt gctgaactgg aaggtgatca tcccgatcat cagcccaata
      541 acaaagaagg cgcagaatgc cagcaggtca gtgacctgga ctgtgaatcg agataaagcc
      601 gatgcgatcc agtgttttta cgcggcgggc atcgtggatc ttgtaaaagg tcactttgtt
      661 aagcacgacg ttgtcatcta tcggcatctc aatctggcta gaatgacgcg gttaaggaag
      721 caaccgtga
//
LOCUS       GCHBA          76 bp ss-RNA             INV       07-AUG-1990
DEFINITION  G.chattoni 5.8S ribosomal RNA.
ACCESSION   M35559
KEYWORDS    5.8S ribosomal RNA.
SOURCE      G.chattoni ribosomal RNA.
  ORGANISM  Glaucoma chattoni
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Van Bell,C.T.
  TITLE     5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
            (Ciliophora: Hymenostomatida)
  JOURNAL   J. Mol. Evol. 22, 231-236 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    76     5.8S ribosomal RNA
BASE COUNT       21 a     13 c     20 g     22 t
ORIGIN      
        1 atgcgaattg cagaatccgc gagtcatcag atctttgaac gcaagtggtg gaggtgtaaa
       61 aaccttcatg tttgtt
//
LOCUS       HYRMTFVLA    2680 bp ds-DNA             ORG       07-AUG-1990
DEFINITION  H.inermis mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes,
            complete, and Leu-tRNA gene, 5' end.
ACCESSION   M35876
KEYWORDS    12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu;
            transfer RNA-Phe; transfer RNA-Val.
SOURCE      H.inermis liver and spleen mitochondrion cell line DH-alpha-5 DNA,
            clone PHINMR 4.0-6.
  ORGANISM  Mitochondrion Hydropotes inermis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; 
            Odocoileinae; Hydropotes inermis.
REFERENCE   1  (bases 1 to 2680)
  AUTHORS   Miyamoto,M.M., Kraus,F. and Ryder,O.A.
  TITLE     Phylogeny and evolution of antlered deer determined from
            mitochondrial DNA sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.M.Miyamoto, 03-JUL-1990.
FEATURES       from  to/span     description
    tRNA          1       70     Phe-tRNA
    rRNA         71     1027     12S ribosomal RNA
    tRNA       1028     1094     Val-tRNA
    rRNA       1095     2661     16S ribosomal RNA
    tRNA       2662  >  2680     Leu-tRNA
BASE COUNT     1010 a    526 c    468 g    676 t
ORIGIN      
        1 gttaatgtag cttaaaaaat aaagcaaggc actgaaaatg cctagatgag tatatttaac
       61 tccataaaca caaaggtttg gtcccagcct tcctattgac ctttaataga cttacacatg
      121 caagcatccg catcccagtg aaaatgccct ccaagttaat aagactaaga ggagctggta
      181 tcaagcacac acccgtagct catgacacct tgcttagcca cacccccacg ggagacagca
      241 gtgataaaaa ttaagccatg aacgaaagtt tgactaagcc atattaatta gggttggtaa
      301 atttcgtgcc agccaccgcg gtcatacgat taacccaagt taataggcat acggcgtaaa
      361 gcgtgttaaa gcaacatata aaaatagagt taaactttaa ttaagctgta aaaagccata
      421 attattaacg aaaataaata acgaaagtaa ctttacaatc gctgaaacat gatagctaag
      481 acccaaactg ggattagata ccccactatg cttagcccta aacacaaata gttataataa
      541 caaaactgtt cgccagagta ctaccggcaa tagcttaaaa ctcaaaggac ttggcggtgc
      601 tttatatcct tctagaggag cctgttctat aatcgataaa ccccgatata cctcaccacc
      661 ccttgctaat gcagtctata taccgccatc ttcagcaaac cctaaaaagg agcaaaagta
      721 agcataatca taatacataa aaacgttagg tcaaggtgta acctatggag tgggaagaaa
      781 tgggctacat tttctaattt aagaaaactt aatacgaaag ttattatgaa attaataacc
      841 aaaggaggat ttagcagtaa actaagaata gagtgcttag ttgaactagg caatgaagca
      901 cgcacacacc gcccgtcacc ctcctcaaat aagcacaata cgcttaaatt tattagtatg
      961 tataaactgt atgagaggag acaagtcgta acaaggtaag catactggaa agtgtgcttg
     1021 gataaatcaa gatatagctt aaataaagca cctagtttac acctagaaga tttcacacac
     1081 cacgaatatc ttgaactaat tctagcccat aaatttattt acactaaact attattaata
     1141 ttataaacaa agcatttatc tataattaaa gtataggaga tagaaatttt taatatgacg
     1201 ctatagagaa agtaccgtaa gggaatgatg aaagaagaaa attaaagtac aaaaaagcag
     1261 agattacccc tcgtaccttt tgcataatga gttaactagt aaaaacttaa caaaacgaat
     1321 tttagctaag taccccgaaa ccagacgagc tacttataaa cagtttatca agaaccaact
     1381 catctatgtg gcaaaatagt gagaagattt ataagtagag gtgaaacgcc taacgagcct
     1441 ggtgatagct ggttgtccag aaaatgaata ttagttcagc tttaaaaata ccaaaaatat
     1501 aaacaaatca cactgtattt ttaaaagtta gtctaaaaag gtacagcctt ttagaaatgg
     1561 atacaacctt gactagagag taaaatttaa taacaccata gtaggcctaa aagcagccat
     1621 caattaagaa agcgttaaag ctcaacaata agtttacatt aatttcaata attataaatc
     1681 aactcctagc ttaatactgg actaatctat aaaaatagaa gcaataatgt taatatgagt
     1741 aacaagaagt atattctccc tgcataagtt taagtcagta tctgataata tcctgactat
     1801 taacagcaaa ataagaataa ctcaactata aatgacttat taattctact gttaatccaa
     1861 cacaggaatg cacttaagga aagattaaaa gaagtaaaag gaactcggca aacactaaac
     1921 cccgcctgtt taccaaaaac atcacctcca gcataactag tattggaggc actgcctgcc
     1981 cagtgacaac cgttaaacgg ccgcggtatc ctgaccgtgc aaaggtagca taatcacttg
     2041 ttctctaaat aaggactagt atgaatggcc aaacgagggt tttactgtct cttacttcca
     2101 atcagtgaaa ttgaccttcc cgtgaagagg cgggaatata ataataagac gagaagaccc
     2161 tatggagctt taactactta acccaaagaa acaaacttaa ccactaaggt gacaacaata
     2221 ttctctatgg gttaacagct ttggttgggg tgacctcgga gaacaaaaaa tcctccgagc
     2281 gattttaaag actagaccta caagtcgcat cgcacaatcg tttattgatc caaaaattga
     2341 tcaacggaac aagttaccct agggataaca gcgcaatcct atttaagagt ccatatcgac
     2401 aatagggttt acgacctcga tgttggatca ggacatcctg gtggtgcagc cgctatcaaa
     2461 ggttcgtttg ttcaacgatt aaagtcctac gtgatctgag ttcagaccgg agtaatccag
     2521 gtcggtttct atctattgtg tatttctccc agtacgaaag gaccagagaa ataaggccaa
     2581 ctaaaaaaaa gcgccttaac ccaattaatg atctcatctc aattaaaatc acaaacaaac
     2641 tccgccctag aaaagggctt tgttaaggtg gcagagccgg
//
LOCUS       M36004         59 bp ds-DNA             BAD       07-AUG-1990
DEFINITION  Figure 1. Insert of plasmiod p WL144 and nucleic acid sequence of
            selA-selB overlap.
ACCESSION   M36004
REFERENCE   1  (bases 1 to 59)
  AUTHORS   Forchhammer,K., Ruecknagel,K.-P. and Boeck,A.
  TITLE     Purification and biochemical characterization of SELB, a
            translation factor involved in selenoprotein synthesis
  JOURNAL   J. Biol. Chem. 365, 9346-9350 (1990)
  STANDARD  simple staff_review
COMMENT     Secondary reference to EMBL entry X16644 for Nature 342, 453-456
            (1989)
FEATURES       from  to/span     description
BASE COUNT       14 a      9 c     18 g     18 t
ORIGIN      
        1 gagcaacggt ttttggagat gttgttgaaa tgattattgc gactgccgga cacgttgac
//
LOCUS       M36102        611 bp ds-DNA             BAD       07-AUG-1990
DEFINITION  Figure 2. Nucleotide sequence of AVP-NpII precursor mRNA from
            bovine hypothalamus.
ACCESSION   M36102
REFERENCE   1  (bases 1 to 611)
  AUTHORS   Brownstein,M.J.
  TITLE     biosynthesis of vasopressin and oxytocin
  JOURNAL   Ann. Rev. Physiol. 45, 129-135 (1983)
  STANDARD  unannotated staff_entry
COMMENT     Secondary reference to Nature 295, 299-303, accession number
            J00001.
FEATURES       from  to/span     description
BASE COUNT       92 a    227 c    205 g     86 t      1 others
ORIGIN      
        1 ngcacagtct acagagcagc actgcgcacg tgtgcccacg cgtgccagga tgcccgacgc
       61 cacactgccc gcctgcttcc tcagcctgct ggccttcacc tctgcttgct acttccagaa
      121 ctgcccaagg ggcggcaaga gggccatgtc cgacctggag ctgagacagt gtctcccctg
      181 cggccccggg ggcaaaggcc gctgcttcgg gcccagcatc tgctgcgggg acgagctggg
      241 ctgcttcgtg ggcacggccg aggcgctgcg ctgccaagag gagaactacc tgccgtcgcc
      301 ctgccagtcc ggccagaagc cctgcgggag cgggggccgc tgcgccgccg ccggcatctg
      361 ctgcaacgat gagagctgcg tgaccgagcc cgagtgccgg gaaggtgtcg gcttcccccg
      421 ccgcgttcgc gccaacgacc ggagcaacgc gaccctgctg gacgggccga gcggggcctt
      481 gttgctgcgg ctggtgcagc tggcgggggc gccggagccc gcggagcccg cccagcccgg
      541 cgtctactga ggcgcgcccc ccccctcccc acccctgccc tcgcagcacg aaaaataaac
      601 gttttaaagg c
//
LOCUS       MMUMTFVLA    2682 bp ds-DNA             ORG       07-AUG-1990
DEFINITION  M.reevesi mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes,
            complete, and Leu-tRNA gene, 5' end.
ACCESSION   M35877
KEYWORDS    12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu-tRNA;
            transfer RNA-Phe; transfer RNA-Val.
SOURCE      M.reevesi liver and brain mitochondrion cell line JM109 DNA, clone
            PMREMR 4.8-16.
  ORGANISM  Mitochondrion Muntiacus reevesi
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; 
            Cervulinae; Muntiacus reevesi.
REFERENCE   1  (bases 1 to 2682)
  AUTHORS   Miyamoto,M.M., Kraus,F. and Ryder,O.A.
  TITLE     Phylogeny and evolution of antlered deer determined from
            mitochondrial DNA sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.M.Miyamoto, 03-JUL-1990.
FEATURES       from  to/span     description
    tRNA          1       69     Phe-tRNA
    rRNA         70     1028     12S ribosomal RNA
    tRNA       1029     1196     Val-tRNA
    rRNA       1197     2664     16S ribosomal RNA
    tRNA       2659  >  2682     Leu-tRNA
BASE COUNT     1001 a    543 c    463 g    675 t
ORIGIN      
        1 gttaatgtag cttaagcaat aaagcaaggc actgaaaatg cctagatgag tgtactagct
       61 ccataaacat ataggtttgg tcccagcctt cctattaacc cttaatagac ttacacatgc
      121 aagcatccac atcccagtga aaatgccctc taagttaata agactaagag gagctggtat
      181 caagcacaca tccgtagctc acgacgcctt gctcagccac acccccacgg gagacagcag
      241 tgacaaaaat taagccataa acgaaagttt gactaagcca tattaattag ggttggtaaa
      301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggtata cggcgtaaag
      361 cgtgttaaag cactatttcc aaataaagct aaatttcaat taagctgtaa aaagccataa
      421 ttgcaacaaa aataaataac gaaagtaact ttacagttgc tgaaacacga tagctaggac
      481 ccaaactggg attagatacc ccactatgcc tagccctaaa cacaaatagt ttccacaaac
      541 aaaactattc gccagagtac taccggcaat agcttaaaac tcaaaggact tggcggtgct
      601 ttataccctt ctagaggagc ctgttctata atcgataaac cccgatagac ctcaccattc
      661 ctcgctaata cagtctatat accgccatct tcagcaaacc ctaaaaagga ataaaagtaa
      721 gcgcaatcat aatacgtaaa aacgttaggt caaggtgtaa cctatgggat gggaagaaat
      781 gggctacatt ttctaactta agaataattc atatacgaaa gttattatga aattaataac
      841 caaaggagga tttagcagta aactaagaat agagtgctta gttgaattag gccatgaagc
      901 acgcacacac cgcccgtcac cctcctcaag taagtacgat atgctcaaac ttatttacat
      961 atattaacca tatgagagga gataagtcgt aacaaggtaa gcatactgga aagtgtgctt
     1021 ggataaatca agatatagct taaataaagt atctagttta cacctagaag atttcactat
     1081 accacgaata tcttgaacca attctagccc ataagccgat tcacactaaa ttaccaattt
     1141 attataaata aaacatttac ctaccattaa aagtatagga gatagaaatt ttaatatggc
     1201 gctatagaga tagtaccgta agggaacgat gaaagaaaaa aatcaaagta caaagaagca
     1261 aagattaccc cttgtacctt ttgcataatg agttaactag caaaaactta acaaaatgaa
     1321 ttttagctaa gtaccccgaa accagacgag ctacttatga acaatttatc gagaaccaac
     1381 tcatctatgt agcaaaatag tgagaagatt tgtaagtaga ggtgaaacgc ctaacgagcc
     1441 tggtgatagc tggttgtcca gaaaatgaat attagttcag ctttaaaaat accaaaaata
     1501 ttaacaaatt ataatgtatt tttaaaagtt agtctaaaaa ggtacagcct tttagaaacg
     1561 gatacaacct taattagaga gtaaaactta atacaaatca tagtaggctt aaaagcagcc
     1621 accaattaag aaagcgttaa agctcaacaa taaaacaacc ttaattccaa caacaagtaa
     1681 tcaactccta acccaatact ggactaatct attaaaaata gaagcaataa tgttaatatg
     1741 agtaacaaga agtatttctc cctgcatgag tttaagtcag tatctgataa tattctgact
     1801 attaacagca agataagaat aacctaccca taaataattt attaattata ctgttaatcc
     1861 aacacaggaa tgcacttaag gaaagattaa aagaagtaaa aggaactcgg caaacactaa
     1921 accccgcctg tttaccaaaa acatcacctc tagcattact agtattagag gcactgcctg
     1981 cccagtgaca accgttaaac ggccgcggta tcctgaccgt gcaaaggtag cataatcact
     2041 tgttctctaa ataaggactt gtatgaatgg ccacacgagg gttttactgt ctcttacttc
     2101 caatcagtga aattgacctt cccgtgaaga ggcgggaata tattaataag acgagaagac
     2161 cctatggagc tttaactact tagcccaaag aaacaaaatt tatttctaag gaaacaacaa
     2221 cattctctat gggctaacag ctttggttgg ggtgacctcg gagaacaaaa aatcctccga
     2281 gcgattttaa agactagacc cacaagtcaa atcacataat cgctcattga tccaaataat
     2341 tgatcaacgg aacaagttac cctagggata acagcgcaat cctattcaag agtccatatc
     2401 gacaataggg tttacgacct cgatgttgga tcaggacatc ccgatggtgc aaccgctatc
     2461 aaaggttcgt ttgttcaacg attaaagtcc tacgtgatct gagttcagac cggagtaatc
     2521 caggtcggtt tctatctatt atgtatttct cccagtacga aaggaccaga gaaataaggc
     2581 caacttcaaa caagcgcctt aaattaatta atgatattat cttaattaac tatacaaaca
     2641 gaccctaccc tagaaaaggg ttttgttaag gtggcagagc cg
//
LOCUS       ODOMTFVLA    2677 bp ds-DNA             ORG       07-AUG-1990
DEFINITION  O.virginianus mitochondrial Phe-, Val-tRNA, 12S and 16S rRNA,
            genes, complete, and Leu-tRNA gene, 5' end.
ACCESSION   M35874
KEYWORDS    12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu;
            transfer RNA-Phe; transfer RNA-Val.
SOURCE      O.virginianus brain mitochondrial cell line JM109 DNA, clone POVIMR
            4.1-19.
  ORGANISM  Mitochondrion Odocoileus virginianus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; 
            Odocoileinae; Odocoileus virginianus.
REFERENCE   1  (bases 1 to 2677)
  AUTHORS   Miyamoto,M.M., Kraus,F. and Ryder,O.A.
  TITLE     Phylogeny and evolution of antlered deer determined from
            mitochondrial DNA sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.M.Miyamoto, 03-JUL-1990.
FEATURES       from  to/span     description
    tRNA          1       69     Phe-tRNA
    rRNA         70     1022     12S ribosomal RNA
    tRNA       1023     1091     Val-tRNA
    rRNA       1092     2658     16S ribosomal RNA
    tRNA       2659  >  2677     Leu-tRNA
BASE COUNT     1000 a    550 c    467 g    660 t
ORIGIN      
        1 gttaatgtag cttaaacaat aaagcaaggc actgaaaatg cctagatgag tataattact
       61 ccataaacac agaggtttgg tcccagcctt cctattaatc cctagcagac ttacacatgc
      121 aagtatcctc atcccagtga aaatgccctc caagtcaata aaactaagag gagctggtat
      181 caagctcaca cccgtagctc atgacgcctt gctcagccac acccccacgg gagacagcag
      241 tgataaaaat taagctataa acgaaagttt gactaagtca tgctaattag ggttggtaaa
      301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggcaca cggcgtaaag
      361 cgtgtttaag cactatgcca aatagagtta aactccaatt aagctgtaaa aagccataat
      421 caagacgaaa ataaataacg aaagtaactt tacaaccgct gaaacacgat agctaagatc
      481 caaactggga ttagataccc cactatgctt agccctaaac ataaatagtt atataaacaa
      541 aactattcgc cagagtacta ccggcaatag cttaaaactc aaaggacttg gcggtgcttt
      601 atacccttct agaggagcct gttctataat cgataaaccc cgatagacct taccacccct
      661 tgctaataca gtctatatac cgccatcttc agcaaaccct aaaaaggaac aaaagtaagc
      721 acaatcatta tacataaaaa cgttaggtca aggtgtaacc tatggagtgg aaagaaatgg
      781 gctacatttt ctaatctaag aaaactcttt acgaaagtta ctatgaaatt agtaaccaaa
      841 ggaggattta gcagtaaact aagaatagag tgcttagttg aattaggcca tgaagcacgc
      901 acacaccgcc cgtcaccctc ttcaaatagg cacaatacac ttaaatttaa ttatacgtat
      961 taatcatatg agaagagata agtcgtaaca aggtaagcat actggaaagt gtgcttggat
     1021 aatcaagata tagcttaaac taaagcatct agtttacacc tagaagattt cacatattat
     1081 gaatatcttg aactaaccct agcccacaaa tccatttata ctaaattatc aagatactac
     1141 aaataaaaca tttatccact gttaaaagta taggagatag aaattttaga tatggcgcta
     1201 tagagaaagt accgtaaggg aacgatgaaa gaaaaaatta aagtacaaaa aagcaaagat
     1261 taccccttgt accttttgca taatgagtta actagtaaaa acttaacaaa atgaatttca
     1321 gctaagtacc ccgaaaccag acgagctact tatgaacaat ttatcgagaa tcaactcatc
     1381 tatgtggcaa aatagtgaga agattcataa gtagaggtga aacgcctaac gagcctggtg
     1441 atagctggtt gtccaggaaa tgaatattag ttcagcttta aaaaatacca ataatataaa
     1501 caaatatact gtatttttaa aagttagtct aaaagggaac agccttttag aaacggatac
     1561 aaccttaact agagagtaaa acctgataat accatagtag gcccaaaagc agccaccaat
     1621 taagaaagcg ttaaagctca acaataaaac tatcttaatc ccaataacaa ataatcaact
     1681 cctagcctta atactggact aatctataaa aatagaagca ataatgttaa tatgagtaac
     1741 aagaagtatc ttctccccgc acgagtttaa gtcagtacct gataatattc tgaccattaa
     1801 cagcaaaata agaataacct aattataaat aacttactta ttatactgtt gatccaacac
     1861 aggagtgcac ttaaggaaag attaaaagaa gtaaaaggaa ctcggcaaac accaaacccc
     1921 gcctgtttac caaaaacatc acctccagca taactagtat tgggggcatt gcctgcccag
     1981 tgacaaccgt taaacgccgc ggtatcctga ccgtgcaaag gtagcataat catttgttct
     2041 ctaaataagg acttgtatga atggccaaac gagggtttta ctgtctctta cttccaatca
     2101 gtgaaattga ccttcccgtg aagaggcggg aatactataa taagacgaga agaccctatg
     2161 gagctttaac tacttagtcc aaagaaataa attcactaac caaggaaaca acaatattct
     2221 tcatggacta acagctttgg ttggggtgac ctcggagaac agaaaatcct ccgagcgatt
     2281 ttaaagacta gactcacaag tcaaatcaca caatcgctta ttgatccaaa aaattgatca
     2341 acggaacaag ttaccctagg gataacagcg caatcctatt caagagtcca tatcgacaat
     2401 agggtttacg acctcgatgt tggatcagga caccccgatg gtgcaaccgc tatcaaaggt
     2461 tcgtttgttc aacgattaaa gtcctacgtg atctgagttc agaccggagt aatccaggtc
     2521 ggtttctatc tattacgtat ttctctcagt acgaaaggac cagagaaata gggccaactt
     2581 agaacaagcg ccttaaatta actaatgatt tactcttaat tagatacaca aacaaatctc
     2641 gccctagaaa agggctttgt taaggtggca gagcccg
//
LOCUS       SHPTGA        470 bp ss-mRNA            MAM       07-AUG-1990
DEFINITION  Ovine thyroglobulin mRNA, 3' end.
ACCESSION   M36148
KEYWORDS    thyroglobulin.
SOURCE      Ovine, cDNA to mRNA.
  ORGANISM  Ovis sp.
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 470)
  AUTHORS   Durand,J., Malthiery,Y., Chabaud,O. and Lissitzky,S.
  TITLE     L'extremite C-terminale de la thyroglobuline ovine presente de
            fortes homologies interspecifiques
  JOURNAL   C R Soc Biol (Paris) 181, 258-266 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      385     thyroglobulin (AA at 2)
    mRNA    <     1      470     thyroglobulin mRNA
BASE COUNT      122 a    140 c    112 g     96 t
ORIGIN      
        1 ggggcagttt actctggagg agaaaagcct ttccctgaaa atcatgcagt atttttccaa
       61 cttcatccga tccggaaatc ccaactaccc tcatgagttc tccaggagag cgcctgaatt
      121 cgcagcccct tggcctgact tcgtcccccg tgatggcgca gagagctaca aggagctcag
      181 tgtcctgctc cccaaccgac agggcctgaa aaaagccgac tgctccttct ggtccaagta
      241 catccagtct ctgaaggcct cagcagatga aactaaggac gggctgttag cagagagcga
      301 agaggaggac cagctggctg gctctgggct gacagaggac ctcctgggtc tcccagagtt
      361 agcctccaag acctacagca agtgaccagc cccatcccaa atgccacctt ggacacctta
      421 ttctccaaca tagccgctat caataaagtg tctacatgtc agaaaaagtc
//
LOCUS       TELBA          75 bp ss-RNA             INV       07-AUG-1990
DEFINITION  T.leucophrys 5.8S ribosomal RNA.
ACCESSION   M35555
KEYWORDS    5.8S ribosomal RNA.
SOURCE      T.leucophrys ribosomal RNA.
  ORGANISM  Tetrahymena leucophrys
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Van Bell,C.T.
  TITLE     5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
            (Ciliophora: Hymenostomatida)
  JOURNAL   J. Mol. Evol. 22, 231-236 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    75     5.8S ribosomal RNA
BASE COUNT       22 a     13 c     20 g     20 t
ORIGIN      
        1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
       61 accttcatgt ttgta
//
LOCUS       TEPBA          74 bp ss-RNA             INV       07-AUG-1990
DEFINITION  T.paravorax 5.8S ribosomal RNA.
ACCESSION   M35556
KEYWORDS    5.8S ribosomal RNA.
SOURCE      T.paravorax ribosomal RNA.
  ORGANISM  Tetrahymena paravorax
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 74)
  AUTHORS   Van Bell,C.T.
  TITLE     5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
            (Ciliophora: Hymenostomatida)
  JOURNAL   J. Mol. Evol. 22, 231-236 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    74     5.8S ribosomal RNA
BASE COUNT       21 a     14 c     18 g     21 t
ORIGIN      
        1 atgcgaattg cagaatccgc gagtcatcag atctttgaac gcaagtggtg aaggctaaaa
       61 ccttcatgtt tgtt
//
LOCUS       TEUBA          75 bp ss-RNA             INV       07-AUG-1990
DEFINITION  T.patula 5.8S ribosomal RNA.
ACCESSION   M35553
KEYWORDS    5.8S ribosomal RNA.
SOURCE      T.patula ribosomal RNA.
  ORGANISM  Tetrahymena patula
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Van Bell,C.T.
  TITLE     5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
            (Ciliophora: Hymenostomatida)
  JOURNAL   J. Mol. Evol. 22, 231-236 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    75     5.8S ribosomal RNA
BASE COUNT       22 a     13 c     19 g     21 t
ORIGIN      
        1 atgcgaattg caaaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
       61 accttcatgt ttgtt
//
LOCUS       TEXBA          75 bp ss-RNA             INV       07-AUG-1990
DEFINITION  T.vorax 5.8S ribosomal RNA.
ACCESSION   M35554
KEYWORDS    5.8S ribosomal RNA.
SOURCE      T.vorax ribosomal RNA.
  ORGANISM  Tetrahymena vorax
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Van Bell,C.T.
  TITLE     5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
            (Ciliophora: Hymenostomatida)
  JOURNAL   J. Mol. Evol. 22, 231-236 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    75     5.8S ribosomal RNA
BASE COUNT       22 a     13 c     20 g     20 t
ORIGIN      
        1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
       61 accttcatgt ttgta
//
LOCUS       TEYBA          75 bp ss-RNA             INV       07-AUG-1990
DEFINITION  T.pyriformis 5.8S ribosomal RNA.
ACCESSION   M35552
KEYWORDS    5.8S ribosomal RNA.
SOURCE      T.pyriformis ribosomal RNA.
  ORGANISM  Tetrahymena pyriformis
            Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; 
            Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Van Bell,C.T.
  TITLE     5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina
            (Ciliophora: Hymenostomatida)
  JOURNAL   J. Mol. Evol. 22, 231-236 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >    75     5.8S ribosomal RNA
BASE COUNT       22 a     13 c     20 g     20 t
ORIGIN      
        1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa
       61 accttcatgt ttgta
//
LOCUS       WHTRDNA       140 bp ds-DNA             PLN       07-AUG-1990
DEFINITION  Wheat ribosomal RNA spacer DNA.
ACCESSION   M36062
KEYWORDS    .
SOURCE      Wheat seedling DNA, clone pTA250.
  ORGANISM  Triticum sp.
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 140)
  AUTHORS   Appels,R. and Dvorak,J.
  TITLE     The wheat ribosomal DNA spacer region: Its structure and variation
            in populations and among species
  JOURNAL   Theor. Appl. Genet. 63, 337-348 (1982)
  STANDARD  simple staff_review
BASE COUNT       15 a     49 c     42 g     34 t
ORIGIN      
        1 ggatccgcgt caacttccgt ccacgacggc cgccgggcct tttttcccgt gcccacgtac
       61 agcccgttca cgggtccgtg taagggtccg tgtacgcggc gtctgccacg tacgtggttt
      121 tgcccggttt tccatggcgc
//
LOCUS       HUMINT01       42 bp ss-mRNA            PRI       07-AUG-1990
DEFINITION  Human leukocyte adhesion glycoprotein p150,95 mRNA, exon 1.
ACCESSION   M29165 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     1 of 7
SOURCE      Human cell line HL-60, cDNA to mRNA, clone lambda-X47.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 42)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     cDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  simple automatic
FEATURES       from  to/span     description
    mRNA    <     1  >    42     P150,95 mRNA, exon 1
BASE COUNT        9 a     15 c     10 g      8 t
ORIGIN      1 bp upstream of EcoRI site; chromosome 16p11-13.1.
        1 gaattcctgc cactcttcct gcaacggccc aggagctcag ag
//
LOCUS       HUMINT02     3690 bp ds-DNA             PRI       07-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            2 - 6.
ACCESSION   M29482 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     2 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3690, exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3690)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   Unpublished (1989) Center for Blood Res., Boston MA 02115
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
            
FEATURES       from  to/span     description
    pept       1028     1064     integrin alpha subunit precursor, exon 2
                                 (first expressed exon)
               1666     1771     integrin alpha subunit precursor, exon 3
               2391     2494     integrin alpha subunit precursor, exon 4
               2795     2865     integrin alpha subunit precursor, exon 5
               3020  +  3131     integrin alpha subunit precursor, exon 6
    sigp       1028     1064     integrin alpha subunit signal peptide
               1666     1685     integrin alpha subunit signal peptide
    matp       1686     1771     integrin alpha subunit
               2391     2494     integrin alpha subunit
               2795     2865     integrin alpha subunit
               3020  +  3131     integrin alpha subunit
    pre-msg <     1  >  3690     P150,95 mRNA and introns
    IVS     <     1      975     P150,95 intron A
    IVS        1065     1665     P150,95 intron B
    IVS        1772     2390     P150,95 intron C
    IVS        2495     2794     P150,95 intron D
    IVS        2866     3019     P150,95 intron E
    IVS        3132  >  3690     P150,95 intron F
BASE COUNT      764 a   1050 c   1020 g    856 t
ORIGIN      1 bp upstream of BamHI site; chromosome 16p11-13.1.
        1 ggatcccttg ggcccaggag ttcgaagcag cagtgaacta tgcacccact gcactccagc
       61 ctgggtggca gagcaagacc ctgtttctga aattaaaaaa aaaaattgat gtacattagg
      121 gggcttccac ggcctgagct gcttcccctt gctttcctcc cagtggccct gaccttgtct
      181 cttacaactt cccaccctga ctgtctggtt acccattgct gatttcacac acagaccctc
      241 ctgtaccctg cctcatccat gtctggctgc tctgtcatct cccaactttg gttgctttca
      301 atgctcagct caagcaccac ctctttcagg aagccttctc agaaagccac accttcacaa
      361 cccgggtgag gcaccctgtg gtctctgtgc ttccccctca cagcaatgaa cttgctgttt
      421 atacatctgc ctctccactg accccagggc tggtgctttg tggtttatat tttcttcccc
      481 acctagcaga gggcttgcat ctccaggctc aaattaggct tcttgaataa atgatgaata
      541 aatgagtgaa tgaatgaatg aacaaatact cgctctgtgc tcctcctagg gacccggatc
      601 ccccactcct tggcccagac tttccaggtc agagtggagg cctcccacca gggtttcctt
      661 taggggtcct gaggggtggg catctgccca aaccccctcc agtctggctg aaatttcaag
      721 gtcaaggggt ccttctggca gtcaagggtg agcctgggag gggcagggca gggatttgca
      781 tccatctaag caaagggcat caagccaagt catctgatga gagtgactcc ggttgggggg
      841 tgggggcgtg tgggagccga gcctgtcctc ggatcagttg cgtactctgc ccgccccctc
      901 tgactcatgc tgacaatctt cttccttccc ctggccacct ctctgcccac ttgcttcctc
      961 agtaccttgg tccagctctt cctgcaacgg cccaggagct cagagctcca catctgacct
     1021 tctagtcatg accaggacca gggcagcact cctcctgttc acaggtgagc ctggacccca
     1081 atgaagtagg gctggggacc caggcccaag ggagccaggg ccctgaactg ggggctcagg
     1141 ctggggggtt aggatctggg taggaagaga gactcagtca agcctgaggg ggaggcaggc
     1201 acatagggtt tgagatttgg agtttgtgga gggagaggat attgatgaac caattttggg
     1261 agagttccag agatgctgga agagaggcca gttgtctctg tactgcagag atttttaaaa
     1321 taggcagaat gcgccaactt gtgctctgtg gacaggatgc tttggtccgc aagttttcct
     1381 ggacgcactc tcatagcgcc cgaggtgcac gttggggaaa gatccttttt agagcctggg
     1441 tactgctctg cagaaatgga gaactgcaac tcgatagtgg atggtgggca aggggcatcc
     1501 ctggaccctg ggaaggagag aaggggatga gttgggtgtc cagaagaccc aggcaccccg
     1561 ggcatcaggc tcggagggga gattgggacg ctggggccgg gggtggaggg cagccaggca
     1621 gaaggaagac ccttctccaa agctctcttc ccacctcttt cccagcctta gcaacttctc
     1681 taggtttcaa cttggacaca gaggagctga cagccttccg tgtggacagc gctgggtttg
     1741 gagacagcgt ggtccagtat gccaactcct ggtgaggccc aggtggtgct cctttggctc
     1801 catccatcct ctccctgctc aggccccatc cccccggccc tgccctgtta tttgcaaact
     1861 ctcctctctg tctggtgtag cgactgccct ggctaatgaa gatttgcctt gaaggcaggc
     1921 acggtctcac agctaacatt tacagagcag taagtgcagt gccaggctca tcacaggtgg
     1981 atgctgattt agtccacacg acagcctgtg agtaggaatc agtcgtgcaa caaacactta
     2041 tttgtttttt ctttcttttt ttctatacat ttaaaaatat atagagacag ggtctcacta
     2101 tgttgcctgg gttggtctca aactcctggg ctcaagcaat cctcccgcct cagcctccca
     2161 aagtgctggg attccaggtg tgagccacca cacccagact caacaaatat ttcttgtctc
     2221 catacgccag agaatccaac agacagaaat cccttccaca tggactttaa attattaaaa
     2281 tccatcttgc agatgaggaa gctgaggctc agggagggaa cgcaaacttg ccggagtggc
     2341 agctgtcggc gtccacactc ttacctaaag tgttctttgt ctcctcgcag ggtggtggtt
     2401 ggagcccccc aaaagataac agctgccaac caaacgggtg gcctctacca gtgtggctac
     2461 agcactggtg cctgtgagcc catcggcctg cagggtgagt caccgcccct cccgggaccc
     2521 agggccgggc tcccaggctt ccctgctcca ggggcccgtg gactcccgga gtgtcacttt
     2581 cagcttccct gtgtctgaga ccctcaccct cagatatgct tcctggcccc ttaaggcctc
     2641 cccgcccatc gcactcccgc agctctgtca agacccgaca gcttccttca ccgtcagacc
     2701 tccttgtctc ccaggtggag gtgacccctg cccagctctt ccacagcctt ctctgtaggg
     2761 cccgagagtg accatgcaca tatctgtccc acagtgcccc cggaggccgt gaacatgtcc
     2821 ctgggcctgt ccctggcgtc taccaccagc ccttcccagc tgctggtgag tggccctggg
     2881 tcacaggagg cttctgaggg agggagggag gagccggggc cgccgggggc tgggactctc
     2941 ctgtagggtg gaggttccgg catctgaggg tgggaggtac atgccaggga gtgcccccag
     3001 cagcccgctg tgtccccagg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat
     3061 gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt
     3121 gtccaggcag ggtgagtgtc gggaccacca aggctttgag gagctcacgc acatccaatt
     3181 gggggtgcgg tgggctagag acagtcttgc cagagtggat cagaaagaag ggatctggaa
     3241 aaagagttac ctcgtgttgc agtggttcct gacgctgctg cccgcacatc ctgccgatcg
     3301 ccgcacgctg ccggaccttt cctgtgacct taacctctcc aagcctcagt ttcttcatct
     3361 gttggatggg gataataaca cacccagcac tgaaagcaac acaggatgat tcatggccag
     3421 gggttagcac agcagctagc accaggcgac acccatgccg gccagctgtt gttattttta
     3481 gaggagagga ctattttcat ccaatgggtc ctgggatatg accaattggt ttgtgccgta
     3541 gtttaggaaa ggtcagtgaa agtgcagtgt gagcaacgtg tgtgtgtaca tgtgtgtata
     3601 tgtatgcatg tgtatacatg tgcacatgca catgtacatg catgtgtgtg catgtatgtg
     3661 tgtgtgtgca tgtgcatgca ggttgagacg
//
LOCUS       HUMINT03     4863 bp ds-DNA             PRI       07-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            7 - 15.
ACCESSION   M29483 Y00093
KEYWORDS    integrin; protein p150,95.
SEGMENT     3 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 294 to 3967; exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 4863)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   Unpublished (1989) Center for Blood Res., Boston MA 02115
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +   294      424     integrin alpha subunit precursor, exon 7
                511      656     integrin alpha subunit precursor, exon 8
                901     1054     integrin alpha subunit precursor, exon 9
               1659     1809     integrin alpha subunit precursor, exon 10
               2428     2501     integrin alpha subunit precursor, exon 11
               2665     2794     integrin alpha subunit precursor, exon 12
               3207     3349     integrin alpha subunit precursor, exon 13
               3531     3671     integrin alpha subunit precursor, exon 14
               3760  +  3969     integrin alpha subunit precursor, exon 15
    matp    +   294      424     integrin alpha subunit
                511      656     integrin alpha subunit
                901     1054     integrin alpha subunit
               1659     1809     integrin alpha subunit
               2428     2501     integrin alpha subunit
               2665     2794     integrin alpha subunit
               3207     3349     integrin alpha subunit
               3531     3671     integrin alpha subunit
               3760  +  3969     integrin alpha subunit
    pre-msg <     1  >  4861     P150,95 mRNA and introns
    IVS     <     1      293     P150,95 intron F
    IVS         425      510     P150,95 intron G
    IVS         657      900     P150,95 intron H
    IVS        1055     1658     P150,95 intron I
    IVS        1810     2427     P150,95 intron J
    IVS        2502     2664     P150,95 intron K
    IVS        2795     3206     P150,95 intron L
    IVS        3350     3530     P150,95 intron M
    IVS        3672     3759     P150,95 intron N
    IVS        3970  >  4863     P150,95 intron O
BASE COUNT      947 a   1358 c   1460 g   1097 t      1 others
ORIGIN      Chromosome 16p11-13.1.
        1 acctgtgatc gccccctcgc ctcccaaagt actgggatta cacggtgagc caccacgcct
       61 ggctcaatca cagcctcttt aggcaacttt aagagaatga agggccttgt tccaggcaag
      121 gggttaggga acgtctgccc ctgatgagga gaggacccag ggtgtggagc ctgactccca
      181 tcgccagact aggggcttag ggaggaaggg ttttggagag tgagctcttg caggagccac
      241 ggtcctggac tccaggagtg tcacttggag gacggtgcca cctccttccc cagagtgccc
      301 aagacaggag caggacattg tgttcctgat cgatggctca ggcagcatct cctcccgcaa
      361 ctttgccacg atgatgaact tcgtgagagc tgtgataagc cagttccaga gacccagcac
      421 ccaggtgtgc tttgggggag ggaggctgct gggggtgggt gcttggatcc tggtgatagg
      481 cctcagccca gccctgtgtg cttctcccag ttttccctga tgcagttctc caacaaattc
      541 caaacacact tgactttcga ggaattcagg cgcacgtcaa accccctcag cctgttggct
      601 tctgttcacc agctgcaagg gtttacatac acggccaccg ccatccaaaa tgtcgtgtga
      661 gtcctgattt cttccaggca cagtcccaaa gcacccaggt cttcccttgg cctcatctga
      721 tctccacgag aaggggacag gcagggacca aaatccagcc cgtgataccc ttgccaagct
      781 ggggcctctg ggtgggactg gggcctccca aaggaaaagg catcttctaa ttttcacaag
      841 ggcaccaggg gctagtgtgg tttggttcac aggcctctaa gacctctcct ttcctgatag
      901 gcaccgattg ttccatgcct catatggggc ccgtagggat gccaccaaaa ttctcattgt
      961 catcactgat gggaagaaag aaggcgacac gctggattat aaggatgtca tccccatggc
     1021 tgatgcagca ggcatcatcc gctatgcaat tggggtaggc ctgggatggc ttcccacttc
     1081 tcccacggct tcctctcagg gcaactcccc tttctgtgta tgatgttctt ttctctttga
     1141 gacagggtct tgctctatca cccaggaagt ggtgcaatcc tagctcactg cagccttgaa
     1201 ctcctgggct ccagtgatcc tcccaccccg cctcccagta gtcgggacca caggtgtgtg
     1261 ccatcaagcc tggctatttt ctttttggtt gagatggggt cttgctatgt tgcccaggct
     1321 ggtctcaaat tcctggcctt aagcaattct gccaccttgg tctcccaaag gcacagggga
     1381 ttacaggcgt gaaccaccgc caacaacatc cctttcaagg atagaaacac cagctctctc
     1441 ggctcttact gccttaagga tgaaaactct gccccagact ggagaccatg atgatccttt
     1501 ctcctaaact ccctgatgct gtccgggctt cgtgtttctc ctgtgtccac cgggtgtgat
     1561 catgttgatc ttgtggggtt attggaagat gttgcaccca gtgcacacag gcacatttga
     1621 tttattattt ttactgagtt gatcttttct ggggacaggt tggattagct tttcaaaaca
     1681 gaaattcttg gaaagaatta aatgacattg catcgaagcc ctcccaggaa cacatattta
     1741 aagtggagga ctttgatgct ctgaaagata ttcaaaccca actgagggag aagatctttc
     1801 ccattgaggg tgagtctgaa gggagctctt cgcttgggga atcctcagcc gttaacacct
     1861 ttccacttag aacccgaggc tccgtgaaac aggtagacag cgtctcggtt ctcctgcttt
     1921 cccgggaccc cgatagccat gtctgtcagc ttgtccccac tgacgtcccc cagcactgtc
     1981 agagctgccc caaagtggcc ccagggatgg ccctgctccc cacagagagt gatctcacac
     2041 caccaccggc tccactgcag aacaaaagca gtccaggccc aacccaggag acccttccac
     2101 ccacaccggg ccctacccag cccacatccc accagccact cactcccctg ggcaaggggc
     2161 acacggacac ctggccccct cggtctgctt gtagacctgt ggggggccct gatgaggacc
     2221 agatcggtgc tgccatcgct gtccacatcc atggagcaga ggggggcccc gaagtcggag
     2281 ctgatctgga ggcagagcct ggtccctgtc acaggcacca gctctccctg tagcctccag
     2341 tcttagcttc tcctaaagct gaagtgttct tggacctggc aaagcccgtc tccctccctg
     2401 gcactcaagc gtcatgcctt accccaggta cggagaccac aagcagtagc tccttcgaat
     2461 tggagatggc acaggagggc ttcagcgctg tgttcacacc tgtgcgtggg gccccttagg
     2521 ccgatgatgt gccgtgaggg gagggggggc agggaaggcc agggtgggtg tcaggtgggt
     2581 aagaggcgca aggcggaagg catatctctg gtcatgctgt cttcctgctc tcggctctgc
     2641 tcagccctgg aatcctttct ccaggatggc cccgttctgg gggctgtggg gagcttcacc
     2701 tggtctggag gtgccttcct gtacccccca aatatgagcc ctaccttcat caacatgtct
     2761 caggagaatg tggacatgag ggactcttac ctgggtgaga aacagccagg ggttggggac
     2821 aggtgggaga tgcactgccc agggtggggt ccagggttct ggggaagggg taggggnatg
     2881 ggggctgtgc tgcccagtgt ggggcccagc ttctggggag ggaggatggg cactgtgctg
     2941 cccggggtgg gttccagggt tctggggagg gggaatgggg gctgtgctgc ctggggtggg
     3001 aatccagggt tctggggaga ggggatgggc gctgtgctgc ctggggtggg ttccagggtt
     3061 ctggggagag aggatggggg ctgcattgcc cagggtgggg tccagggttc tggggagggg
     3121 agatggtgct gtgctgcccg gggtgggaat ccagggttct ggggaggggg aatgggggcc
     3181 tttgtgctga ggcctgggcc cctcaggtta ctccaccgag ctggccctct ggaaaggggt
     3241 gcagagcctg gtcctggggg ccccccgcta ccagcacacc gggaaggctg tcatcttcac
     3301 ccaggtgtcc aggcaatgga ggatgaaggc cgaagtcacg gggactcagg ttgggcgtga
     3361 caggagccac aggccgggaa ttcagggtag gggaggtggc tgggcagaga agaggatgga
     3421 ggggctttga gggccttggg ggaggtcctg gtacctgggg agaggtggga cctggcccac
     3481 agggctgcct ctggcaggga caggcagcat gacccagctc tgcccttcag atcggctcct
     3541 acttcgggcc ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc
     3601 tcatcgggcc cccccattac tacgagcaga cccgaggggc ccaggtgtct gtgtgtccct
     3661 tgcccagggg ggtgagtggc tgatgggcct ggtgtgtgtg gggtctggtg tgggtgaggg
     3721 gttgcccggg ttgggcctgg cactgttttt tttctgcagt ggagaaggtg gtggtgtgat
     3781 gctgttctct acggggagca gggccacccc tggggtcgct ttggggcggc tctgacagtg
     3841 ctgggggatg tgaatgggga caagctgaca gacgtggtca tcggggcccc aggagaggag
     3901 gagaaccggg gtgctgtcta cctgtttcac ggagtcttgg gacccagcat cagcccctcc
     3961 cacagccagg tgaggccgtg tcccatttct gtcactagag cagcctgctt cttgcctctc
     4021 ccactctgtc atactggaaa actgtccctt tttacctttt cctacctccc ttgcccagct
     4081 ctgagcacct tgtagcagtg gcgtggtctc agctcactgc aacctccgcc tcccaggttc
     4141 aagcgattct ctctgcctca gcctccagag tagctgggat tacaggcatg caccaccatg
     4201 tccggttatt ttttgtattt tagtagagac acgtttcgcc atgttggcta ggctggtctt
     4261 gaactcctga cctcaggtga tctgcctgtc tcggcctccc aaagtgctgg gattataggc
     4321 gtgagccgcc atgcccaggc ccctgccagt tttacaaggt acacaggtca ggcacagaaa
     4381 acccatttta cagatggaat ctgggacact aggaagacaa gggccttggt ttgttggagg
     4441 ttcagagtgg gtccgagatg gtgaaggaac tccggcctcc tgacctctaa cccggtgtgc
     4501 agtctcccgg ctccctgctg ctcaccactt aggtccagtc atttcaacct ccctccacct
     4561 gcccctctcc tccctggatg ctacatgatt ttattccctt cctgccatca aggtcccacc
     4621 aaatgcccat ccctgcagcc tccctccacc ccaagggtag cagggttccc tgagaacgaa
     4681 gggctgcctt tcttggcaaa agtcaagaaa gctctgttaa aaaataggca aagggcctgc
     4741 tccctggtgg ctcacatctg taattccgac actttgggag gctgaggcag gaggatcact
     4801 tgaggccagg agtttcaggc cagctgggca acataggggg accccatctc tagaaaaaat
     4861 ttt
//
LOCUS       HUMINT04     2746 bp ds-DNA             PRI       07-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            16 - 21.
ACCESSION   M29484 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     4 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 382 to 2672; exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2746)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   Unpublished (1989) Center for Blood Res., Boston MA 02115
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +   382      512     integrin alpha subunit precursor, exon 16
                628      791     integrin alpha subunit precursor, exon 17
                924     1078     integrin alpha subunit precursor, exon 18
               1656     1787     integrin alpha subunit precursor, exon 19
               1875     1948     integrin alpha subunit precursor, exon 20
               2531  +  2672     integrin alpha subunit
    matp    +   382      512     integrin alpha subunit
                628      791     integrin alpha subunit
                924     1078     integrin alpha subunit
               1656     1787     integrin alpha subunit
               1875     1948     integrin alpha subunit
               2531  +  2672     integrin alpha subunit
    pre-msg <     1  >  2746     P150,95 mRNA and introns
    IVS     <     1      381     P150,95 intron O
    IVS         513      627     P150,95 intron P (no splice consensus)
    IVS         792      923     P150,95 intron Q
    IVS        1079     1655     P150,95 intron R
    IVS        1788     1874     P150,95 intron S
    IVS        1949     2530     P150,95 intron T
    IVS        2673  >  2746     P150,95 intron U
BASE COUNT      602 a    821 c    746 g    577 t
ORIGIN      Chromosome 16p11-13.1.
        1 gaattcctat cctgagcatg gctaaactct gagctaatag tatcattata gaaagatgag
       61 gaaacggagg cacagacaga ttgagtcctt gcccacggcc tcgtggctca tacgtggagg
      121 agtcagaatt ggaactagag actgatcgaa tgaatgacac tcgggtcacc aggacacctt
      181 cctatctcca ctcttacatc tgtttcttag caatcatctc ccaactccta cctcctcttt
      241 tcaggttctt cttggtgaca tctgttacaa ctcacccctt ctctcccttt ccgatggtcc
      301 tacctccata ttccccttgt tacttatttc caacttcttc cctagtttcc atcttgattc
      361 acccttctct cctctggcca gcggatcgcg ggctcccagc tctcctccag gctgcagtat
      421 tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt ggacctggct
      481 gtgggggccc ggggccaggt gctcctgctc aggtgagagc agactttctc agaggctccc
      541 catgtggtcc taggttcaga tgggggtgcc cacccacgtg gtgctcccac cagcgacggc
      601 tgtcctcagc tcggtgctct gcccgcagac cagacctgtg ctctgggtgg gggtgagcat
      661 gcagttcata cctgccgaga tccccaggtc tgcgtttgag tgtcgggagc aggtggtctc
      721 tgagcagacc ctggtacagt ccaacatctg cctttacatt gacaaacgtt ctaagaacct
      781 gcttgggagc cgtgagtccc ctcccctcca acccaggaca ccctgacctc tggagtcccc
      841 catcccaggc ccctgtctcc caccctgctc attgtccacc caaggagttc ctgtctcaac
      901 gccgtccctg cgaccgccta caggtgacct ccaaagctct gtgaccttgg acctggccct
      961 cgaccctggc cgcctgagtc cccgtgccac cttccaggaa acaaagaacc ggagtctgag
     1021 ccgagtccga gtcctcgggc tgaaggcaca ctgtgaaaac ttcaacctgc tgctcccggt
     1081 gcgtctgggc atgaacgtgg gtggcggccg cgctggggct ggcagaaggc agggcaggga
     1141 gagaacaggc tgtgttccgg cctccctgtg gctcagccca gcacaggacc agccatgcag
     1201 gacgtgctta ctgcacgtta gccagtgagt gagtgagcga gcaaacaagt gatgagatcg
     1261 tctgcaattt ccagggccac acgattggat ttcaggaaag agaattgggc aacctgagag
     1321 agctctgggc ttaccttctg gcttttcagg cattcactga cagggttatc gagctgctcc
     1381 tggagacagc cttgcctggg ccatgggcat aggtggccaa aacagtcatt gctgatcggg
     1441 aggtctgggg gggggaggaa aaaaacaaag acaaacaagg ggagaggaca gagagggtgt
     1501 cagggaggca tcctgaaggc ggtgacgctg agcaggctct ggaggaagtg aagcagagcg
     1561 ggagctgggc agaggcagga taagaactgc ggatgaggcc gagcgcagct cttaccctcc
     1621 ccttaccctc gctccccgcg acgcccgtcc cccagagctg cgtggaggac tctgtgaccc
     1681 ccattacctt gcgtctgaac ttcacgctgg tgggcaagcc cctccttgcc ttcagaaacc
     1741 tgcggcctat gctggccgcc gatgctcaga gatacttcac ggcctccgtg agtcctggca
     1801 ctgggtctcc cagagagggt gcacagcgtg gggcctgggt ctcggagaaa accccccgtt
     1861 gccttcccac gcagctaccc tttgagaaga actgtggagc cgaccatatc tgccaggaca
     1921 atctcggcat ctccttcagc ttcccagggt gagcgcccca ccttagatgc cctactgccc
     1981 cagcctcctt cctggaatct gggactcctg cctctgctct ccctaacatt gtctcatcct
     2041 atagtcaaaa cccaggtgtc ttggctgggc acagtggctc actcctgtaa tccagcactt
     2101 tgggaggccg aggtgggagg acttttgagg ccaggagtta gggttacgac ctgggcaaca
     2161 gagcgacacc catttccaca aaaacaaaac aacaacaaca acaacaacaa caacaacaac
     2221 aacaacatca cttgagtgtg gtagagcatg cctatagtcc cagctacttg ggaggctgaa
     2281 gcttaaggct tgcttgagct ctggagttgg aggtctgcag tgagccataa tcacaccact
     2341 gcactccagc ctgggtgaaa gagcaggact ctgtctctta aaaaaaaaga agaagaagaa
     2401 gaagaagaag aagaacccag gggtccgtcc cctgtctatc tcccaaatcc ccacccaccc
     2461 cattttatcc cagaccattt ctagcctcag tcacagaatc atcttatcct ttccttcacc
     2521 tgatacccag cttgaagtcc ctgctggtgg ggagtaacct ggagctgaac gcagaagtga
     2581 tggtgtggaa tgacggggaa gactcctacg gaaccaccat caccttctcc caccccgcag
     2641 gactgtccta ccgctacgtg gcagagggcc aggtgcacct ctggggaagg aggaggaggc
     2701 agggctgggc gttagcgtag attcccgtgc gggttcagaa cccggg
//
LOCUS       HUMINT05     1006 bp ds-DNA             PRI       07-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            22 - 24.
ACCESSION   M29485 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     5 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 202 to 665)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1006)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   Unpublished (1989) Center for Blood Res., Boston MA 02115
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +   203      319     integrin alpha subunit precursor, exon 22
                404      483     integrin alpha subunit precursor, exon 23
                582  +   665     integrin alpha subunit precursor, exon 24
    matp    +   203      319     integrin alpha subunit
                404      483     integrin alpha subunit
                582  +   665     integrin alpha subunit
    pre-msg <     1  >  1006     P150,95 mRNA and introns
    IVS     <     1      202     P150,95 intron U
    IVS         320      403     P150,95 intron V
    IVS         484      581     P150,95 intron W
    IVS         666  >  1006     P150,95 intron X
BASE COUNT      228 a    286 c    229 g    263 t
ORIGIN      Chromosome 16p11-13.1.
        1 ttctatcctg gtgacagagt gagacctggt ctcaaaacaa acaaacaaac aaaatataag
       61 cttaaggtgg gctccaggaa gctttatcac tacttcgtgg cgtgtctttg gaatgctgtt
      121 atattaggtt ggtgcaaaag taattgggtt tttgccattg ctttcaattt caactaatac
      181 tcctctactt tctcatgcct agaaacaagg gcagctgcgt tccctgcacc tgacatgtga
      241 cagcgcccca gttgggagcc agggcacctg gagcaccagc tgcagaatca accacctcat
      301 cttccgtggc ggcgcccagg tcagcctggc ttctgtcccc tcactgctcc cctgccccac
      361 cctgtcttta ctgctctgtg acctctcagt tccttttcct cagatcacct tcttggctac
      421 ctttgacgtc tcccccaagg ctgtcctggg agaccggctg cttctgacag ccaatgtgag
      481 caggtgagcc gggccatggc caggggcagt gcctcatctc cagcctcaca ccccattctc
      541 ctctggggcc tctggcaact gagtctctcc tctttctcca gtgagaacaa cactcccagg
      601 accagcaaga ccaccttcca gctggagctc ccggtgaagt atgctgtcta cactgtggtt
      661 agcaggtcac aggtacccac tgcaggaaaa agggttcttc tctctgaccc tcaaaaagaa
      721 aaaaaaaaaa aaggccttga aacgctgcca cagagggtga gataaggtgt ttgaaagtaa
      781 aaggtcaggt gtttcagaag acaccttcct tcagccaatg ccttcctcga atttgctgtg
      841 tgccaggcag ggtgctgtgg ttattttcca tacattcatt tgacattcat tgaagattta
      901 ctgagccccc attatgtgtg atcaaaccag acatgaaccc tcgccttgtg ggtgtgcctt
      961 gctggatgtc tcctgtgttc cactctcact gcactgcatg ctgagt
//
LOCUS       HUMINT06     1904 bp ds-DNA             PRI       07-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            25 - 30.
ACCESSION   M29486 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     6 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 86 to 1528; exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1904)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   Unpublished (1989) Center for Blood Res., Boston MA 02115
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +    86      164     integrin alpha subunit precursor, exon 25
                280      387     integrin alpha subunit precursor, exon 26
                505      588     integrin alpha subunit precursor, exon 27
                788      901     integrin alpha subunit precursor, exon 28
               1044     1145     integrin alpha subunit precursor, exon 29
               1417  +  1527     integrin alpha subunit precursor, exon 30
    matp    +    86      164     integrin alpha subunit
                280      387     integrin alpha subunit
                505      588     integrin alpha subunit
                788      901     integrin alpha subunit
               1044     1145     integrin alpha subunit
               1417  +  1527     integrin alpha subunit
    pre-msg <     1  >  1904     P150,95 mRNA and introns
    IVS     <     1       85     P150,95 intron X
    IVS         165      279     P150,95 intron Y
    IVS         388      504     P150,95 intron Z
    IVS         589      787     P150,95 intron AA
    IVS         902     1043     P150,95 intron AB
    IVS        1146     1416     P150,95 intron AC
    IVS        1528  >  1903     P150,95 intron AD
BASE COUNT      381 a    545 c    525 g    453 t
ORIGIN      Chromosome 16p11-13.1.
        1 accacctgtc ctctcatgct ctagccaatg ccttctgcag atgcccatgg tagttcacat
       61 ccacttatgc gtcttctctc tccagccacg aacaattcac caaatacctc aacttctcag
      121 agtctgagga gaaggaaagc catgtggcca tgcacagata ccaggtcagg tggtggtgta
      181 cgcaggaaga ccttgggcat ggggtgggag gctgggtagc cggagactgg ggagggattt
      241 ggctttggcg tggctctgcc ctcagtgccc tctgtgcagg tcaataacct gggacagagg
      301 gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga ggctgtgtgg
      361 atggatgtgg aggtctccct cccccaggta cccaaggact gcatgtggct cctccacgaa
      421 tgccctttct acctggattc cttgtgcccc atgtgggtcc ctgatgtccc agctgagaca
      481 cttgttctct gcattttccc ccagaaccca tcccttcggt gctcctcaga gaaaatcgcg
      541 ggcccagcat ctgacttcct ggcgcacatt cagaagaatc ccgtgctggt gaggagggct
      601 ctgggtctgg ccctcactgt aggcccacat cagaggaatt taacccagga gttcatgttc
      661 catatccatc ctgctgaagt accctcttgc attcggatat ggccgctgcc ctcaagtcac
      721 acgcataatg ctgcctccca ccttcacact catctttctc agccccatgc tatttatctg
      781 cccccaggac tgctccattg ctggctgcct gcggttccgc tgtgacgtcc cctccttcag
      841 cgtccaggag gagctggatt tcaccctgaa gggcaacctc agctttggct gggtccgcca
      901 ggtgtgtggg tgcaacgaca gagcccctgc cccagactca ggcgggacct ggcatgtctg
      961 tgcccatctg caagccaggg cacccccaga gctctgagcc tcccccagag ccagttcaac
     1021 aggtttcccc cacccctttg cagatattgc agaagaaggt gtcggtcgtg agtgtggctg
     1081 aaattacgtt cgacacatcc gtgtactccc agcttccagg acaggaggca tttatgagag
     1141 ctcaggtaga gaccatgtgg agggcagcga ccaggcagga aagagggtcc caagggctac
     1201 atctgtggtg ctgggtgggg ggtttgcaag ccttggggga ggagggtgaa ggcctctggg
     1261 caggatagct gtccctaagg gcacgggtgc tgctgtgtct cacctcttgg agcagggcct
     1321 ggggaaggag gggagggagt taaaggttgg ggagcctggg aggagtctgg gatagtagga
     1381 ggatgggagt ctctgacagg gtcacttcca cttcagacga caacggtgct ggagaagtac
     1441 aaggtccaca accccacccc cctgatcgta ggcagctcca ttgggggtct gttgctgctg
     1501 gcactcatca cagcggtact gtacaaagtg agtgttttat gccacccttg acaccaccag
     1561 catctggtcc cgctcttttt gcagagtgag aaggagctca ctttgaaggc agaggcacat
     1621 tcttactggg tcacttcata tgagaaactg cttcccacct gcaatgtcac cgtgccccag
     1681 tggccccctg ctttgtgatt cccaggcttc ctctaatatt tctccctttc tttcctgctc
     1741 ttctccatca ttctacgtgt tcctgacagc agattatcat ataaaagcac agacctgggt
     1801 tgaatgcgac atcaccacgg gttcttttgt cttgaccata ggccagtgtc tgctccactc
     1861 tgggccttga tttccatgtg aggtgatatc acccagctca taga
//
LOCUS       HUMINT07      149 bp ds-DNA             PRI       07-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exon
            31.
ACCESSION   M29487 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     7 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1, and cell line HL-60, cDNA
            to mRNA, clone lambda-X47.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 44 to 149)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 149; revises [1])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   Unpublished (1989) Center for Blood Res., Boston MA 02115
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +    44      148     integrin alpha subunit precursor, exon 31
    matp    +    44      145     integrin alpha subunit
    pre-msg <     1  >   149     P150,95 mRNA and introns
    IVS     <     1       85     P150,95 intron AC
BASE COUNT       40 a     41 c     37 g     31 t
ORIGIN      Chromosome 16p11-13.1.
        1 actgaatggg cttcctgagt ttcttcttcg tcctcccccc taggttggct tcttcaagcg
       61 tcagtacaag gaaatgatgg aggaggcaaa tggacaaatt gccccagaaa acgggacaca
      121 gacccccagc ccgcccagtg agaaatgat
//
LOCUS       ECOKDPDE     3600 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli transmembrane protein (kdpD) and cytoplasmic protein (kdpE)
            genes, complete cds.
ACCESSION   M36066
KEYWORDS    cytoplasmic protein; kdpD gene; kdpE gene; transmembrane protein.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3600)
  AUTHORS   Walderhaug,M.O., Daniel,J.M., Hesse,J. and Epstein,W.
  TITLE     KdpD and KdpE, regulatory proteins for the kdp operon of
            Escherichia coli, are members of the two component sensor-effector
            class
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.O.Walderhaug, 03-JUL-1990.
            
               Author address: M.O.Walderhaug
               University of Illinois at Chicago
               Department of Microbiology and Immunology
               Box 6998, Chicago, IL 60680
FEATURES       from  to/span     description
    pept        141     2825     transmembrane protein (kdpD)
    pept       2822     3499     cytoplasmic protein (kdpE)
    mRNA         68  >  3499     kpd operon
    signal       30       36     -35 region
    signal       55       60     -35 signal
    binding     130      133     ribosome binding site
BASE COUNT      789 a    968 c   1051 g    792 t
ORIGIN      16 min on K12 map.
        1 acgcgtggcg aaagcgcgta atctcagcgt tgaacagctc acgcaactga tcgcaaaata
       61 cagccaacaa ccgctggtga aatatatcgg ccagccggtt gtcaacattg ttgaactcaa
      121 tctggcgctg gataaacttg atgaataacg aacccttacg tcccgacccc gatcgtctgc
      181 tggaacaaac tgccgcgccg catcggggga agctgaaagt tttcttcggt gcctgtgcag
      241 gcgtcgggaa gacctgggcg atgctggcag aagcccagcg actgcgggcg caagggctgg
      301 atattgtggt tggcgtggta gaaacccacg ggcgaaaaga taccgccgcc atgctggaag
      361 ggctggctgt tctgccgtta aaacgccagg cgtaccgtgg gcggcatatc agcgagtttg
      421 atctcgatgc cgccctcgcc cgccgcccgg cgctgatctt aatggacgaa ctggcgcaca
      481 gtaatgcgcc aggttcccgt catcccaaac gctggcagga tatcgaagaa ctgctggaag
      541 ctggcattga tgttttcact accgtcaacg ttcagcatct ggaaagtctg aatgatgtgg
      601 tcagcggcgt caccggaatt caggtacggg aaaccgtgcc cgatcctttt ttcgatgccg
      661 ccgacgacgt ggtgctggtg gacttgcccc cggacgatct gcgccagcgg ctgaaagaag
      721 gcaaagtcta tattgccggg caggcggagc gcgccattga acattttttc cgcaaaggta
      781 atctgatcgc cctgcgcgaa ctggcactgc gccgtactgc cgatcgcgtt gatgagcaaa
      841 tgcgcgcctg gcgggggcat cctggcgaag agaaagtgtg gcacacgcgc gacgcgatcc
      901 ttttatgcat cggccataac accggcagcg aaaaactggt ccgcgcagcg gcgcggctgg
      961 cgtcacggct gggtagcgtc tggcacgcgg tgtatgttga aacccctgcc ctgcaccgct
     1021 taccggaaaa aaaacgtcgg gcaattctca gcgccttacg tctggccgag gaactgggcg
     1081 cggagacggc aacactttct gatccagcgg aagagaaagc ggtagtgcgt tatgcccgtg
     1141 aacataatct cggcaagatt attctcggtc gcccggcctc gcgccgctgg tggcgtcggg
     1201 aaacgtttgc tgaccgactg gcgcgcatcg cccccgatct cgatcaggtg ctggtcgcgc
     1261 ttgatgaacc acccgcccgc acgattaaca acgcgccgga taaccgctct tttaaagaca
     1321 agtggcgtgt acaaattcag ggatgcgtgg ttgccgccgc gttatgcgcc gttatcacct
     1381 taattgccat gcagtggctg atggcgtttg atgccgccaa cctggtgatg ctgtatctgc
     1441 ttggcgtggt ggtggtggcg ctattttatg gacgctggcc ttcagtggtt gccaccgtca
     1501 ttaatgtagt gagtttcgat ctctttttta tcgccccacg cggcacgctc gccgtctctg
     1561 atgtgcaata tctgctgacc ttcgcggtga tgttaaccgt cgggctggtg atcgggaacc
     1621 ttactgctgg cgtgcgttat caggcgcggg tagcccgtta ccgcgagcaa cgcacacggc
     1681 acttatatga aatgtcgaaa gctctggcgg tgggccgcag tccgcaggat atcgctgcca
     1741 ccagcgaaca atttattgcc tccacgtttc atgcccgcag tcaggtgttg ttgcccgatg
     1801 acaacggtaa attgcagccg ttaacacatc cgcaaggaat gacgccgtgg gacgatgcca
     1861 tcgcgcagtg gagttttgat aaaggcctgc ctgcgggcgc gggcaccgac acgttacccg
     1921 gtgtaccgta ccagattttg ccgctaaaaa gcggcgagaa aacctacggg ctggtggtgg
     1981 tggagccggg gaatctgcgc cagttgatga tcccggaaca gcagcgcctg ctggagacgt
     2041 ttacgctgtt agtcgccaat gcccttgagc ggctgacgct aaccgccagc gaagaacagg
     2101 cgcggatggc aagcgaacgt gaacagatcc gcaacgccct gctggcggcg ctttcgcatg
     2161 atttacgcac gccgcttacg gtgctgtttg gtcaggcaga aatcttaacg ctcgatctgg
     2221 caagcgaagg atcaccccac gcccgccagg ccagcgagat ccgtcagcat gtgctgaaca
     2281 ctacccgact ggtgaataat ctactggata tggcgcgaat tcagtccggc ggctttaatt
     2341 tgaagaaaga gtggttaacg ctggaagaag tagtcggcag cgcgctgcaa atgctggaac
     2401 cgggtttatc gtcgcccatc aatctttctc tgccagaacc gctgacctta atccacgttg
     2461 acgggccact ctttgaacgg gtgctgatta atctgctgga gaacgcggtg aaatatgcgg
     2521 gtgcgcaggc cgaaattggt atcgatgccc acgttgaggg cgaaaatcta caactggatg
     2581 tctgggataa cggccccggt cttccgccag gccaggagca gacgatattt gataagtttg
     2641 ctcgcgggaa taaagagtcg gcagtaccgg gggtagggct tggactggca atttgtcggg
     2701 cgatagtgga tgtacacggg ggcactatta ccgcgttcaa ccgaccggaa ggtggtgcct
     2761 gttttcgtgt tacacttccc cagcaaactg cccctgaact tgaagaattt catgaggata
     2821 tgtgacaaac gttctgattg ttgaagatga acaggctatt cgtcgctttc tgcgcacggc
     2881 gctggagggc gacgggatgc gcgtctttga ggccgaaacg ctgcaacgcg gcttgctgga
     2941 agcggcaacc cgtaagccag atttgattat tctcgatctc ggcctgcccg atggtgatgg
     3001 gattgagttt atccgcgacc tgcgccagtg gagccgggtg ccggtgattg tgctttccgc
     3061 acgcagcgaa gagagcgaca aaatcgccgc gctggatgcc ggagcggatg attatctgag
     3121 taagccgttt ggcattggcg aattgcaggc ccgtctgcgc gtcgcattac gccgccactc
     3181 tgccaccacc gcgcccgatc cgctggtaaa attttccgat gttaccgtcg atttagccgc
     3241 ccgcgtgatt caccggggtg aggaagaggt gcatctcaca ccaattgagt tccgccttgc
     3301 tggtcgctgc tcaacaatgc cggaaaagta ctcacccagc ggcccagtcc ttaaccaggt
     3361 gtgggggcca aacgcggtcg aacacagtca ctatttgcgt atttatatgg gacatctgcg
     3421 acaaaaactg gaacaggatc ccgcccgccc acgccatttc attactgaaa ccggtattgg
     3481 ctatcggttt atgctttgaa tattaatttt aatacagcct gccttttatt aattaaagcc
     3541 gtaataataa atacggcttt ttatcttaaa caacacacaa aaataacaat tcaatatttt
//
LOCUS       KAEUREX      4761 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  K.aerogenes urease subunits A (ureA), B (ureB), C (ureC), and
            urease accessory protein (ureE, ureF and ureG) genes, complete cds.
ACCESSION   M36068
KEYWORDS    urease; urease accessory protein.
SOURCE      K.aerogenes (strain CG253) DNA.
  ORGANISM  Klebsiella aerogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 4761)
  AUTHORS   Mulrooney,S.B. and Hausinger,R.P.
  TITLE     Sequence of the Klebsiella aerogenes urease genes and evidence for
            accessory proteins facilitating nickel incorporation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.B.Mulrooney, 03-JUL-1990.
            
                 Author address: S.B.Mulrooney
                 Michigan State University
                 Dept of Biochemistry
                 East Lansing, MI 48824
FEATURES       from  to/span     description
    pept        264      566     urease subunit A (ureA)
    pept        576      896     urease subunit B (ureB)
    pept        889     2592     urease subunit C (ureC)
    pept       2602     3078     urease accessory protein (ureE)
    pept       3080     3754     urease accessory protein (ureF)
    pept       3763     4380     urease accessory protein (ureG)
    site       4385     4424     transcription termination signal
BASE COUNT      898 a   1542 c   1452 g    869 t
ORIGIN      
        1 ctctcgccga acgtccctgg gtcggcactt tgctgtgcta tccggctacc gatgccctgc
       61 tcgacggggt gcgcgacgcg ctggcgccgc tcggtctcta cgccggcgcc agcctgaccg
      121 accgcctgct gacggtgcgt ttcctcagtg acgataatct gatttgccag cgggtgatgc
      181 gcgacgtatg gcagtttctg cgccctcatc tcaccggtaa atctcccgta cttccccgaa
      241 tctggctgac ttaagagaac gttatggaac tgaccccccg agaaaaagac aagctgttgc
      301 tgtttaccgc cgcgctggtg gcggagcgtc gcctggcccg cggcctgaag ctcaactatc
      361 cggagtccgt ggccctgatc agcgccttta ttatggaagg cgctcgggac ggcaaaagcg
      421 tggcctcgct gatggaggaa ggccgtcacg tcctgacccg cgagcaggtg atggagggcg
      481 tcccggaaat gatcccggat atccaggtcg aagccacctt cccggacggc tcgaagctgg
      541 tcaccgttca caacccgatt atctgaggta gcgccatgat ccccggtgaa tatcacgtta
      601 agcccggtca gatagccctg aataccggcc gggcaacctg tcgcgtggtc gttgagaacc
      661 acggcgatcg gccgattcag gtcggttcgc actaccattt cgccgaggtt aacccggcgc
      721 tgaagttcga ccgtcagcag gccgccggct atcgcctgaa tatcccggcg ggcacggcgg
      781 tacgctttga acccggccag aaacgcgagg tcgagctggt ggccttcgcc ggtcaccgcg
      841 ccgtcttcgg cttccgcggc gaggtcatgg gccctctgga ggtaaacgat gagtaatatt
      901 tcacgccagg cctatgccga tatgttcggc cccaccgtcg gcgacaaggt gcgcctggca
      961 gataccgagc tgtggatcga ggtggaggac gatttgacca cctacgggga agaggtcaaa
     1021 ttcggcggcg gcaaagtgat ccgcgacggc atgggccagg gacagatgct ggccgccgac
     1081 tgtgtcgacc tggtgctcac caacgcgttg atcgtcgatc actgggggat cgttaaggcc
     1141 gatatcggcg tgaaggacgg ccggatcttc gccatcggca aagccggcaa ccccgacatc
     1201 cagcccaacg tcaccatccc catcggcgct gcgacggaag tgatcgccgc cgaaggaaaa
     1261 attgtcaccg ccggcgggat cgatacccat attcactgga tctgtccgca gcaggcggaa
     1321 gaggcgctgg tctctggcgt gaccaccatg gtcggcggcg gcaccggccc ggccgcgggc
     1381 acccatgcca ccacctgcac cccgggcccg tggtatatct cacgcatgct gcaggcggcc
     1441 gacagcctgc cggtcaatat cggcctgctg ggcaagggaa acgtttctca gccggatgcc
     1501 ctgcgcgagc aggtggcggc aggcgttatt ggcctgaaga tccatgagga ctggggcgcc
     1561 accccggcgg cgatcgactg tgcgttaacc gtcgccgatg aaatggacat ccaggtcgcc
     1621 ctgcacagcg acaccctgaa tgaatccggt tttgtggaag acaccctcgc cgccatcggc
     1681 gggcgcacca tccacacctt ccataccgaa ggggccggcg gcggccatgc gccggacatc
     1741 atcaccgcct gcgcccaccc gaacattttg ccgtcgtcca ccaacccaac gctgccctac
     1801 accctcaaca ccatcgatga acatctcgat atgctgatgg tctgccacca tctggacccg
     1861 gacatcgccg aggacgtggc ctttgccgag tcgcgcattc gccgggaaac catcgctgcg
     1921 gaagacgtgc tgcacgatct cggcgccttc tcgctcacct cctccgattc gcaggccatg
     1981 ggccgcgtcg gggaagtgat tctccgcacc tggcaggtgg cgcatcgcat gaaggtgcag
     2041 cgcggagcgc tggcggagga gaccggggat aacgacaact tccgcgtgaa gcgctacatc
     2101 gccaaataca ccatcaaccc ggcgctgacc cacggcatcg cacacgaagt cggatccatt
     2161 gaggtgggta agctggctga cctcgtggtc tggtcaccag ccttcttcgg cgtgaaaccg
     2221 gccaccgtga tcaaaggcgg catgatcgcc atcgcgccga tgggcgatat caatgcctct
     2281 attccgaccc cgcagccggt gcactaccgc ccgatgtttg gcgcgctggg cagcgcccgc
     2341 catcactgcc gcctcacctt cctgtcgcag gcggcggcag ccaatggcgt tgccgagcgg
     2401 ctgaacctgc gcagcgcgat cgccgtggtg aaaggctgcc gtacggtgca gaaagccgac
     2461 atggtgcaca acagtctgca gcctaacatc accgtcgacg cccagaccta tgaggtgcgg
     2521 gtggatggcg aacttatcac cagcgagccg gcagacgttc tgccgatggc gcaacgatat
     2581 tttctgtttt aaggagagcg gatgctttat ttaactcaac gtctggagat ccccgccgcc
     2641 gcgaccgcca gcgttacgct gccgattgat gttcgcgtca aaagccgggt taaggtcacc
     2701 ctcaacgatg gccgggatgc cggcctgctg ctgccccgcg gcctgctact acgcggcggc
     2761 gatgtgctca gcaacgaaga aggcaccgag tttgtgcagg tgattgccgc tgatgaagag
     2821 gtgtcggtag tgcgctgcga cgatccgttt atgctggcga aggcctgcta ccacctcggc
     2881 aaccgtcacg tgccgctgca gatcatgccg ggcgagctgc gctaccatca cgatcacgtg
     2941 ctggacgata tgctgcgcca gttcggcctg acggtgacct ttggccagct gccgttcgag
     3001 ccggaagccg gcgcttacgc cagcgagagc cacggtcatc atcatgctca tcatgaccac
     3061 cacgctcaca gccactagca tgtcgacagc ggaacaacgc ctgcggctga tgcagctggc
     3121 cagcagcaac ctgccggtag ggggttacag ctggtcccag gggctggagt gggctgtgga
     3181 agccggctgg gtgctggacg tcgcggcctt cgagcgctgg cagcgacgcc agatgacgga
     3241 aggctttttt accgttgacc tgccgctgtt cgcccgcctg taccgcgcct gcgaacaagg
     3301 cgatatcgct gcggcccagc gctggaccgc ctatctgctg gcctgccggg aaactcgtga
     3361 actgcgggag gaagagcgca accgcggcgc ggcgtttgcc cgtctgctga gcgactggca
     3421 gccggactgt ccgccgccgt ggcgctccct gtgccagcaa agccagctcg ccgggatggc
     3481 ctggctcggc gtgcgctggc gtatcgccct gcccgagatg gccctcagcc tgggctatag
     3541 ctggattgag agcgccgtga tggccggcgt caagctggtc cccttcggcc agcaggccgc
     3601 ccagcagctg attttacgtc tttgtgacca ctacgcggcc gagatgcccc gcgcgctggc
     3661 cgcgccggac ggcgatatcg gatcggccac cccgctcgcc gccatcgcct ctgcccggca
     3721 tgaaacccaa tactctcgat tattccgttc ctaggagaag ccatgaactc ttataaacac
     3781 ccgctgcgcg tcggcgtcgg cggcccggtc ggctccggta aaaccgctct gctggaagcg
     3841 ctgtgtaaag cgatgcgcga tacctggcag ctggcggtgg tcactaacga catctatacc
     3901 aaagaagatc agcgcatcct caccgaagcg ggcgcgctgg cgcctgaacg catcgtcggt
     3961 gtggaaaccg gcggctgccc gcatacggcg atccgcgaag atgcctcaat gaacctcgcc
     4021 gccgtggaag cgctgagtga aaagttcggt aacctcgacc ttatcttcgt ggaaagcggc
     4081 ggcgataacc tgagcgccac cttcagcccg gagctggcgg atctgaccat ctacgtcatc
     4141 gatgtggccg aaggggagaa gatcccgcgc aaaggcggac cggggatcac caaatccgat
     4201 ttcctggtga tcaataaaac cgaccttgcc ccctatgtgg gcgcgtcgct ggaggtgatg
     4261 gcgagcgata cccagcgtat gcgcggcgat cgcccatgga ccttcaccaa tctgaagcag
     4321 ggcgacggcc tgagcaccat tatcgccttc ctcgaagaca aaggcatgct tggcaaatag
     4381 gcctgttgca ccagccgggc gcaagcgcgt ccggatggtg cagcctctgc tctctcttta
     4441 tatcatcctg cctccacctc cgcgccacgc ctgccctgca atatggcata aggtttgcta
     4501 attcaagtca tgcctaacca ttaaggaatg actatgtcat cactggatct taaccctgaa
     4561 ttacccgcga caacgcggac ttccggtacc cgggaaacct tagaagatta caccttacgt
     4621 tacgccccgc tgagcttccg ccgctggggt ccgggcgtcg tcgcggtcac cgcgctgggc
     4681 ggcatcgcct atctggccga cttttccatc ggcgccagca tcggtatggc ctggggcacc
     4741 agcaacgcca tctattcgat c
//
LOCUS       MUSTCP1X     1714 bp ss-mRNA            ROD       07-AUG-1990
DEFINITION  Mouse t-complex protein (Tcp-1x) mRNA, 3' end.
ACCESSION   M35797
KEYWORDS    t-complex protein.
SOURCE      Mouse (strain C57BL/6J) 8.5 day-old embryo, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1714)
  AUTHORS   Dudley,K., Shanahan,F., Burtenshaw,M., Evans,E.P., Ruddy,S. and
            Lyon,M.F.
  TITLE     Isolation and characterisation of a cDNA clone corresponding to the
            mouse t-complex gene Tcp-1x
  JOURNAL   Genet. Res. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Dudley, 29-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1      875     t-complex protein (Tcp-1x)
BASE COUNT      454 a    387 c    445 g    428 t
ORIGIN      Chromosome 17, A2/A3 junction.
        1 taaaacgacg gccagtgaat tcatgtctta cgggcaagga ggatatgagc aagcctcctc
       61 acttgactca cctgagaaca ggagtcagga tgggtgaggt gccactggct gacagcatac
      121 tctgtgatgg tctgacagat gcctttcaca actaccacat gggcatcaca gctgaaaacg
      181 tagccaaaaa atggcaagtg agcagagagg cccaggacaa ggttgcagtt ctgtcacaga
      241 acagggcaga gactgctcag aaagctggcc actttgacaa ggagattgtg ccagtgctgg
      301 tgtcttccag aaaaggtctt actgaagtga aaatcgatga atttccccgt catgggagta
      361 accttgaagc catgggaacg ctgaaacctt actttcttac tgatgggaca ggaactgtca
      421 ccccagcgaa cgcatcagga atgaacgatg gcgctgctgc tgtggtcctt atgaagaaga
      481 cagaagctga gaggcggatg ctgaaacctt tagcaagaat agtctcctgg tcccaagccg
      541 gtgtggagcc ttctgttatg ggagtaggac caattccagc cataaagcaa gctgttgcaa
      601 aggcaggctg gtccctggag gatgttgacc tgtttgaaat caatgaagcc tttgcagcgg
      661 tgtctgcggc aatagctaaa gaacttggat taaaccccgg gaaggtcaac atcgatggag
      721 gagccattgc cttgggccat cctctgggag catctggctg caggattcta gtgaccttgc
      781 tgcacaccct ggagagagtg ggcgggaccc gtggtgttgc agccctgtgc attgggggtg
      841 ggatgggagt cgcaatgtgt gttcagagag ggtgaactgc atgacagcca ccaccttgga
      901 cagttcttgt taaatcagtg caacactaag cagaaaggga aaccagagga ccaaactgag
      961 gacaggaacc caggtggaca gcttgctgta ctttaatgtg agacacccaa ggctacggca
     1021 ttcgacctga cactgttgta aataagaggg aaatccaatc agtcatcaag ggctccagag
     1081 tgaacagcat tttcataact tccatgttta tcgtctttca tttctggtga tttctctgat
     1141 cattgttttg tttttgtttc ttgtttgagt taggctacat atatgcctca aaatcgtttt
     1201 gtagatcagg ttgactttga actcccggag atccaccttt ctgtgcctcc tgaatacttg
     1261 aattaaaggc atgcaccatt atggcagggc ccccaaaatg atccattcaa aacatgggga
     1321 ttcggcttct gttacgaact tcaggctctt cactttaact atggttggtt caaacgcccc
     1381 tgcgtgcctg tctctcggtt tcccattgat caaatcaaga ccaatcctgt agtgaaagtc
     1441 ggattcagtg agcctcttcc aggctgaggt ggggtaggta ttccagctgt tctagtcaga
     1501 agactggcag tgccaacaag tgtgcctctg gattgttgaa aggtgggcag caagcagcat
     1561 ccacgtactt atttctctgc ttgacgggat gggatgttac ctgcttcaga tatctacctt
     1621 gacttcccca aaatgatgac tggaactgga actgacagtc aaataaactc tttcctccct
     1681 taaaaaaaaa aaccttgacc gtaagacatg aatt
//
LOCUS       YSCAER2      2551 bp ds-DNA             PLN       07-AUG-1990
DEFINITION  S.cerevisiae heme-regulated and catabolite repressed genes
            repressor (AER2) gene, complete cds.
ACCESSION   M35861
KEYWORDS    catabolite repressed gene repressor; heme-regulated gene repressor.
SOURCE      S.cerevisiae.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2551)
  AUTHORS   Zhang,M., Rosenblum-Vos,L.S., Lowry,C.V., Boakye,K. and
            Zitomer,R.S.
  TITLE     A yeast protein with homology to the B-subunit of G-proteins is
            involved in repression of heme-regulated and catabolite repressed
            genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Zitomer, 02-JUL-1990.
            
              Author address: R.S.Zitomer
              Department of Biological Sciences
              State University of New York
              Albany, NY 12222
FEATURES       from  to/span     description
    pept        247     2388     repressor AER2
BASE COUNT      783 a    628 c    488 g    652 t
ORIGIN      
        1 aaaacaaaat aacaagtaca taagtaaaca aatagagaga aattccccca tctaccccca
       61 attgataaac atcagcgaag caagaacaac tggctgaaca cgtgcccctc tatctgtcct
      121 tctgcaattg tttaattttc tttcacgcca agttaccttt cgcacaccac tactcttttt
      181 ctattgtttt tttgtctttt ttgataagca ggggaagaaa gaaatcagct ttccatccaa
      241 accaatatga ctgccagcgt ttcgaatacg cagaataagc tgaatgagct tctcgatgcc
      301 atcagacagg agtttctcca agtctcacaa gaggcaaata cctaccgtct tcaaaaccaa
      361 aaggattacg atttcaaaat gaaccagcag ctggctgaga tgcagcagat aagaaacacc
      421 gtctacgaac tggaactaac tcacaggaaa atgaaggacg cgtacgaaga agagatcaag
      481 cacttgaaac tagggctgga gcaaagagac catcaaattg catctttgac cgtccagcaa
      541 cagcggcaac agcaacagca gcaacaggtc cagcagcatt tacaacagca acagcagcag
      601 ctagccgctg catctgcatc tgttccagtt gcgcaacaac caccggctac tacttcggcc
      661 accgccactc cagcagcaaa cacaactact ggttcgccat cggccttccc agtacaagct
      721 agccgtccta atctggttgg ctcacagttg cctaccacca ctttgcctgt ggtgtcctca
      781 aacgcccaac aacaactacc acaacagcaa ctgcaacagc agcaacttca acaacagcaa
      841 ccacctcccc aggtttccgt ggcaccattg agtaacacag ccatcaacgg atctcctact
      901 tctaaagaga ccactacttt accctctgtc aaggcacctg aatctacgtt gaaagaaact
      961 gaaccggaaa ataataatac ctcgaagata aatgacaccg gatccgccac cacggccacc
     1021 actaccaccg caactgaaac tgaaatcaaa cctaaggagg aagacgccac cccggctagt
     1081 ttgcaccagg atcactactt agtcccttat aatcaaagag caaaccactc taaacctatc
     1141 ccacctttcc ttttggatct agattcccag tctgttcccg atgctctgaa gaagcaaaca
     1201 aatgattatt atattttata caacccggca ctaccaagag aaattgacgt tgagttacac
     1261 aaatctttgg atcatacttc agttgtttgt tgcgtgaagt tcagtaacga tggtgaatac
     1321 ttagccacag gctgcaacaa aactactcaa gtgtatcgcg tttcagatgg ttctctggtg
     1381 gcccgtctat ctgacgattc tgctgccaat aaccatcgaa attcgatcac tgaaaataac
     1441 accaccacgt ccacggataa caatacaatg acaaccacta ctaccaccac aattactacc
     1501 acagcgatga cttcggcagc agaattggca aaagatgtgg aaaacctgaa cacttcgtct
     1561 tccccatcat ccgacttgta tatccgttca gtgtgttttt ctccagatgg gaaatttttg
     1621 gcaacaggtg ctgaagacag actgattaga atttgggata ttgaaaatag aaagattgtt
     1681 atgattcttc aaggccacga acaagatatt tattcattgg actactttcc ctcaggtgac
     1741 aaattagtct ccggttctgg tgaccgtacc gttcgtattt gggacttacg tacaggccag
     1801 tgttcattga ctttatccat tgaagatggt gttaccaccg tcgctgtatc accaggtgat
     1861 ggtaaataca tcgctgctgg ttctctagat cgtgctgtga gagtttggga ttccgagacc
     1921 ggattcttgg tggaaagact agattcggaa aacgaatccg gtacaggcca caaggactct
     1981 gtttatagcg ttgtcttcac tagagatgga caaagcgttg tatccggctc attagataga
     2041 tctgttaagc tctggaattt gcagaatgca aacaacaaga gcgattcgaa aactccaaat
     2101 tccggcactt gtgaagttac gtatatcggg cataaagact ttgtattgtc cgtggccacc
     2161 acacaaaatg atgagtacat cttgtccggt tccaaagatc gtggtgtcct gttttgggat
     2221 aagaaatccg gcaatccgtt attgatgttg caaggtcata ggaattcagt tatatctgtg
     2281 gctgtggcaa acgggtctcc gctgggtcca gaatataacg tttttgctac tggtagcggt
     2341 gattgtaaag caaggatttg gaagtataaa aaaatagcgc caaattaaga acagaacaca
     2401 aaaggaacac tttacaaatg taactaacta aactattctt caattcattg tcattatatc
     2461 tttctctcta ttctttttta tctacttatt tgttatgtta tgatccaggt acgctttaaa
     2521 acaaacagtt actttcctca ccattcatcc c
//
LOCUS       ALFHISH3A    1089 bp ds-DNA             PLN       07-AUG-1990
DEFINITION  Alfalfa histone H3 (H3-1.1) gene, complete cds.
ACCESSION   M35867
KEYWORDS    histone H3.
SOURCE      Alfalfa (L. cv. Nagyszenasi) DNA, clone lambda-ALH3-1.
  ORGANISM  Medicago sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 1089)
  AUTHORS   Wu,S.C., Boegre,L., Vincze,E., Kiss,G.B. and Dudits,D.
  TITLE     Isolation of an alfalfa histone H3 gene: Structure and expression
  JOURNAL   Plant Mol. Biol. 11, 641-649 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        285      695     histone H3 (H3-1.1)
    signal       93       97     CAAT box
    signal      168      174     TATA box
    signal      817      826     polyA signal
    signal      963      968     polyA signal
    signal     1008     1013     polyA signal
    site          2        6     GATCC motif
    site         38       43     ACGTCA consensus
    site         46       50     GATCC motif
    site         53       46 (c) CGCGGATC octamer
BASE COUNT      305 a    271 c    178 g    335 t
ORIGIN      
        1 ggatcctcat cacacaaaca aaacacatcc acacgccacg tcatcgatcc gcgtgtcgca
       61 aatactccaa ataaacgaca cccgtcgatt aaccaataat caacggccac aattacaccc
      121 cattcaccca cttctcaaat ttcaaaaccc gcacaaaatc atatcactat ataactcacc
      181 cctttcatct cttcttcctc atcactctca attcttcaaa gcacaaaaac aaccttcaag
      241 tttctctgtt tgatactgtt cttttcagtt atttttcata atcaatggca cgtaccaagc
      301 aaaccgctcg caaatccacc ggtggcaaag ctccaaggaa gcaactcgcc acaaaagccg
      361 ctcgcaaatc tgctccggcc accggcggag tgaagaaacc tcaccgtttc aggccaggaa
      421 ccgtcgctct ccgtgagatc cgcaagtacc agaagagcac tgagctcctc atccgcaaac
      481 tccccttcca gcgtcttgtc cgtgagatcg ctcaggattt caagactgat ctccgtttcc
      541 agagctccgt cgtgtcggct ttgcaagaag cggccgaggc ttatctcgtc ggtctctttg
      601 aggatactaa cctctgcgcc attcatgcta agcgtgtcac tatcatgcct aaggatatcc
      661 agctcgctag gcgtatccgt ggcgagcgtg cttgatcttg ttgattcgct ttgttagggt
      721 ttgtgtagat aggttcatga tgtagttaaa tcacaaaccg ttgctataag tttctctatg
      781 gattttgtta tattgtaatg tgcttaacgc ttaatcaatg aaatcgatca tcttttgtta
      841 aactctttgt tcaattactt atgctttttt tttatctttt cttaacccta attttctgtc
      901 attttattac actttccgaa cttttgttat ccctaattgg atttgaaatc aaaattaggg
      961 ttgataaagg catattgtat aatgttgaaa ttcttgttaa atatattgat aaatacaatt
     1021 atttcaaagt aaaaattaaa tgctgttgat tacatcttga aatgtaaaaa taattgcatg
     1081 cataagctt
//
LOCUS       APHYEPPC      400 bp ds-DNA             SYN       07-AUG-1990
DEFINITION  Foot-and-mouth disease virus/SP6 promoter recombinanant DNA.
ACCESSION   M35873
KEYWORDS    .
SOURCE      Foot-and-mouth disease virus (strain O1K), cDNA to viral RNA, clone
            pFMDV-YEP-polyC.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 400)
  AUTHORS   Zibert,A., Maass,G., Strebel,K., Falk,M.M. and Beck,E.
  TITLE     Infectious foot-and-mouth disease virus derived from a cloned full-
            length cDNA
  JOURNAL   J. Virol. 64, 2467-2473 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    RNA          18      396     RNA
    signal        1       17     SP6 promoter
    recomb       29       30     plasmid DNA end/foot-and-mouth disease DNA
                                 start
BASE COUNT       68 a    122 c    112 g     98 t
ORIGIN      
        1 atttaggtga cactatagaa tacaagctct tgaaaggggg cattagggtc tcacccctag
       61 taagccaacg acagtccctg cgttgcactc cacacttacg ttgtacacac gcgggacccg
      121 atgggctatc gttcacccac ctacagctgg actcacggcg ccgcgtggcc atttagctgg
      181 attgtgcgga cgaacacgct tgcgcacctc gcgtgaccgg ttagtactct taccactctc
      241 cgcctacttg gtcgttagcg ctgtcttggg cattcctgtg ggggccgttc gacgctccac
      301 gggaactctc ctgtgtgaca tctacggtga tggggccgtt tcgcgtgggc tggtcgtttg
      361 gactgcttcg gctgtcaccc ggcgcccgcc tttcagctag
//
LOCUS       CYNAAAGFP     282 bp ss-mRNA            SYN       07-AUG-1990
DEFINITION  C.tetragonoloba alpha-amylase/alpha-galactosidase fusion protein
            gene, 5' end.
ACCESSION   M35837
KEYWORDS    alpha-amylase; alpha-galactosidase; fusion protein.
SOURCE      C.tetragonoloba and plasmid DNA, clone pUR2601.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 282)
  AUTHORS   Overbeeke,N., Termorshuizen,G.H.M., Giuseppin,M.L.F.,
            Underwood,D.R. and Verrips,C.T.
  TITLE     Secretion of the alpha-galactosidase from Cyamopsis tetragonoloba
            (Guar) by Bacillus subtilis
  JOURNAL   Appl. Environ. Microbiol. 56, 1429-1434 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         88  >   282     alpha-amylase/alpha-galactosidase fusion
                                 protein
    binding      77       80     ribosomal binding site (put.)
    recomb      180      181     alpha-amylase end/alpha-galactosidase start
BASE COUNT       88 a     56 c     70 g     68 t
ORIGIN      
        1 gctgcaggta accggatccg aattcccggg gatccgtcct gatatgtaaa atataatttg
       61 tataagaaaa tgagagggag aggaaacatg attcaaaaac gaaagcggac agtttcgttc
      121 agacttgtgc ttatgtgcac gctgttattt gtcagtttgc cgattacaaa aacatcagcc
      181 gcggaaaacg gtttgggtca aaccccacca atgggttgga actcctggaa ccacttcggt
      241 tgtgacatta acgaaaacgt tgtcagagaa acagctgatg ca
//
LOCUS       HSODR3E       111 bp ds-DNA             VRL       07-AUG-1990
DEFINITION  Herpesvirus papio 3' dyad region DNA.
ACCESSION   M35835
KEYWORDS    dyad region.
SOURCE      Herpesvirus papio DNA, from baboon B-cell.
  ORGANISM  Herpesvirus papio
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Loeb,D.D., Sung,N.S., Pesano,R.L., Sexton,C.J.,
            Hutchison,C.H.III. and Pagano,J.S.
  TITLE     Plasmid origin of replication of herpesvirus papio: DNA sequence
            and enhancer function
  JOURNAL   J. Virol. 64, 2876-2883 (1990)
  STANDARD  simple staff_entry
BASE COUNT       29 a     30 c     28 g     24 t
ORIGIN      
        1 ggacagcaca tactgcctcg cagatagcat ataccgccgg catgatagca tatgctaccc
       61 agatggatag catacgctac ctcccgggta gtatttgcta tccgggtaag g
//
LOCUS       HUMMHDQB3A    552 bp ss-mRNA            PRI       07-AUG-1990
DEFINITION  Human MHC class II DQ3.1ER (DR4) mRNA, partial cds.
ACCESSION   M35730
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human homozygous cell line ER, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 552)
  AUTHORS   Holbeck,S.L. and Nepom,G.T.
  TITLE     Molecular analysis of DQ-beta-3.1 genes
  JOURNAL   Hum. Immunol. 21, 183-192 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   552     MHC class II DQ3.1ER protein (AA at 3)
                                 /hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1"
                                 /map="6p21.3"
    site        269      270     MHC class II DQ3-beta-1 exon end/MHC class II
                                 DQ3-beta-2 exon start
BASE COUNT      128 a    165 c    162 g     97 t
ORIGIN      Chromosome 6p21.3.
        1 aggatttcgt gtaccagttt aaggccatgt gctacttcac caacgggacg gagcgcgtgc
       61 gttatgtgac cagatacatc tataaccgag aggagtacgc acgcttcgac agcgacgtgg
      121 aggtgtaccg ggcggtgacg ccgctggggc cgcctgacgc cgagtactgg aacagccaga
      181 aggaagtcct ggagaggacc cgggcggagt tggacacggt gtgcagacac aactaccagt
      241 tggagctccg cacgaccttg cagcggcgag tggagcccac agtgaccatc tccccatcca
      301 ggacagaggc cctcaaccac cacaacctgc tggtctgctc agtgacagat ttctatccag
      361 cccagatcaa agtccggtgg tttcggaatg accaggagga gacaaccggc gttgtgtcca
      421 ccccccttat taggaacggt gactggacct tccagatcct ggtgatgctg gaaatgactc
      481 cccagcatgg agacgtctac acctgccacg tggagcaccc cagcctccag aaccccatca
      541 ccgtggagtg gc
//
LOCUS       HUMMHDQB3B    530 bp ss-mRNA            PRI       07-AUG-1990
DEFINITION  Human MHC class II DQ3.1LUY (DR8) mRNA, partial cds.
ACCESSION   M35731
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human homozygous cell line LUY, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 530)
  AUTHORS   Holbeck,S.L. and Nepom,G.T.
  TITLE     Molecular analysis of DQ-beta-3.1 genes
  JOURNAL   Hum. Immunol. 21, 183-192 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept.ps <     1  >   530     MHC class II DQ3.1LUY protein (AA at 3)
                                 /hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1"
                                 /map="6p21.3"
    site        248      249     MHC class II DQ3-beta-1 exon end/MHC class II
                                 DQ3-beta-2 exon start
BASE COUNT      124 a    157 c    154 g     95 t
ORIGIN      Chromosome 6p21.3.
        1 aggatttcgt gtaccagttt aaggccatgt gctacttcac caacgggacg gagcgcgtgc
       61 gttatgtgac cagatacatc tataaccgag aggagtacgc acgcttcgac agcgacgtgg
      121 aggtgtaccg ggcggtgacg ccgctggggc cgcctgacgc cgagtactgg aacagccaga
      181 aggaagtcct ggagaggacc cgggcggagt tggacacggt gtgcagacac aactaccagt
      241 tggagctctg gagcccacag tgaccatctc cccatccagg acagaggccc tcaaccacca
      301 caacctgctg gtctgctcag tgacagattt ctatccagcc cagatcaaag tccggtggtt
      361 tcggaatgac caggaggaga caaccggcgt tgtgtccacc ccccttatta ggaacggtga
      421 ctggaccttc cagatcctgg tgatgctgga aatgactccc cagcatggag acgtctacac
      481 ctgccacgtg gagcacccca gcctccagaa ccccatcacc gtggagtggc
//
LOCUS       LGILG106NL    378 bp ss-mRNA            PLN       07-AUG-1990
DEFINITION  L.gibba negatively light-regulated protein (Lg106) mRNA, 3' end.
ACCESSION   M35866
KEYWORDS    negatively light-regulated protein.
SOURCE      L.gibba (L. G-3), cDNA to mRNA, clone pLg106.
  ORGANISM  Lemna gibba
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Arecidae; Arales; Lemnaceae.
REFERENCE   1  (bases 1 to 378)
  AUTHORS   Okubara,P.A., Flores,S. and Tobin,E.M.
  TITLE     Characterization of a negatively light-regulated mRNA from Lemna
            gibba
  JOURNAL   Plant Mol. Biol. 11, 673-681 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      261     ORF1 (AA at 1)
    pept    <     1       82     ORF2 (AA at 2)
    pept    <     1       62     ORF3 (AA at 3)
BASE COUNT       96 a     96 c    102 g     84 t
ORIGIN      
        1 gacggcggat tattacccaa aaagcctcct ctcatttcca aggaccacga gcgagccttt
       61 gactccgctg actgggcatt aggaaagcaa ggaggccccg tggagaagcc gaagcctctg
      121 gaggcgctac ggccgaaact ccagccaact cagcagcaag ttcgcttgag atcggtttat
      181 gctcctgccg agggcgaaga ttggatccag cggcgcgaca gaagagatgg gcaacgagtg
      241 agtcgcctcg gaggaatgta gtatccatcc tatatctctc tctccgattc ctgaataaaa
      301 cgatctattt tgtggcatct cctgaagaac cctaattgat tagtagttga ggtttgacct
      361 tgggtcaacg gatcagat
//
LOCUS       MUSIGHADM     360 bp ss-mRNA            ROD       07-AUG-1990
DEFINITION  Mouse Ig heavy chain mRNA V-region, partial cds.
ACCESSION   M26808
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain C57BL/6) hybridoma line B1-8.V4, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Brueggemann,M., Mueller,H.-J., Burger,C. and Rajewsky,K.
  TITLE     Idiotypic selection of an antibody mutant with changed hapten
            binding specificity, resulting from a point mutation in position 50
            of the heavy chain
  JOURNAL   EMBO J. 5, 1561-1566 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   360     Ig heavy chain V-region (AA at 1)
BASE COUNT       89 a     92 c     99 g     80 t
ORIGIN      Chromosome 12.
        1 caggtccaac tgcagcagcc tggggctgag cttgtgaagc ctggggcttc agtgaagctg
       61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg
      121 cctggacgag gccttgagtg gattggaggg attgatccta atagtggtgg tactaagtac
      181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aaccctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagatacgat
      301 tactacggta gtagctactt tgactactgg ggccaaggca ccactctcac agtctcctca
//
LOCUS       PLYJCRR       270 bp ds-DNA             VRL       07-AUG-1990
DEFINITION  Polyomavirus JC regulatory region DNA.
ACCESSION   M35834
KEYWORDS    regulatory region.
SOURCE      Polyomavirus JC DNA, from human progressive multifocal
            leukoencephalopathy patient CY urine.
  ORGANISM  Polyomavirus JC
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 270)
  AUTHORS   Yogo,Y., Kitamura,T., Sugimoto,C., Ueki,T., Aso,Y., Hara,K. and
            Taguchi,F.
  TITLE     Isolation of a possible archetypal JC virus DNA sequence from
            nonimmunocompromised individuals
  JOURNAL   J. Virol. 64, 3139-3143 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        268  >   270     late leader protein
    orgrpl       12  <     1 (c) origin of DNA replication
    signal       15       29     TATA box
BASE COUNT       78 a     66 c     77 g     49 t
ORIGIN      
        1 gcctcggcct cctgtatata taaaaaaaag ggaaggtagg gaggagctgg ctaaaactgg
       61 atggctgcca gccaagcatg agctcatacc tagggagcca accagctgac agccagaggg
      121 agccctggct gcatgccact ggcagttata gtgaaacccc tcccatagtc cttaatcaca
      181 agtaaacaaa gcacaagggg aagtggaaag cagccagggg aacatgtttt gcgagccaga
      241 gctgttttgg cttgtcacca gctggccatg
//
LOCUS       PSEIAAMH     3677 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  P.syringae tryptophan 2-monooxygenase (iaaM) and indoleacetamide
            hydrolase (iaaH) genes, complete cds.
ACCESSION   M11035 M35690
KEYWORDS    iaaH gene; iaaM gene; indoleacetamide hydrolase; indoleacetic acid;
            indoleacetic acid operon promoter region;
            tryptophan 2-monooxygenase; tryptophan-2-oxyreductase.
SOURCE      P.syringae savastanoi (strain EW2009) DNA, clones pLUC2, pCP3, and
            pCJP12.
  ORGANISM  Pseudomonas syringae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 202 to 3677)
  AUTHORS   Yamada,T., Palm,C.J., Brooks,B. and Kosuge,T.
  TITLE     Nucleotide sequences of the Pseudomonas savastanoi indoleacetic
            acid genes show homology with agrobacterium tumefaciens T-DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6522-6526 (1985)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 312)
  AUTHORS   Gaffney,T.D., da Costa e Silva,O., Yamada,T. and Kosuge,T.
  TITLE     The indoleacetic acid operon of Pseudomonas syringae subsp.
            savastanoi: Transcription analysis and promoter identification
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  simple staff_entry
COMMENT     The iaaH and iaaM genes are part of an operon that is borne on a
            plasmid, pIAA, in oleander strains of the pathogen.  There appears
            to be no genetic transformation of host tissue by P.savastanoi.  No
            Pribnow boxes were found, though potential ribosome binding sites
            are located at positions 435-439 and 2164-2168.  Expression of iaaH
            depends upon the promoter for iaaM.
            
            Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Yamada, 14-JAN-1986.
            
            Draft entry and computer-readable sequence for [2] kindly submitted
            by T.D.Gaffney, 28-JUN-1990.
FEATURES       from  to/span     description
    pept        454     2127     tryptophan 2-monooxygenase (EC 1.13.12.3)
    pept       2170     3537     indoleacetamide hydrolase
    pept         82      312     promoter proximal ORF
    mRNA         54  >  3537     indoleacetic acid mRNA
    site         19       47     indoleacetic acid operon promoter
    binding     577      651     FAD binding site
BASE COUNT      813 a    917 c   1046 g    901 t
ORIGIN      1 bp upstream of EcoRI site.
        1 ccacaaaccg gcaagacgtt gaaaatcaat ttcaagcatt gtatgttttt ggtgggacgc
       61 cacttggaaa atctggtgct gatgcccgga cggactcctg cgaactggct ggacaaggcg
      121 atggtggcag tggcccgaat cagacagcgc aagccccagg cggcagtacc cgcggcgctc
      181 ctccaagccc cgcacgctat ggaattcgtt tggagcacca ccaaaacctt taccgaatgc
      241 cattgtccgg cttacacccc ctatacgcaa gccagtgcgt ggtcggccag cgcggtcgtt
      301 agtgcttatt aattttagag gactagcatg gtttgaagga atggttcagg tgttttggat
      361 tttggcggct gccgttaaaa ggtgctgttt cagtttcatt taaagaatcg taatccgggt
      421 agcacgtaag gtcgaagagg gtagagtcgc gttatgtatg accattttaa ttcacccagt
      481 attgatattt tgtacgacta cggtcccttt ctgaaaaaat gtgaaatgac gggaggcata
      541 ggcagctatt cagccggaac gcccacccct cgggtagcga tagtcggtgc cggcatcagt
      601 gggctggtcg ctgcaactga actattacgt gcgggagtca aggacgttgt cttatatgaa
      661 tcgcgtgatc gaatcggggg acgggtatgg tctcaagttt tcgatcagac tcgtccacgt
      721 tacattgcag aaatgggtgc gatgcgcttt cctcccagtg caactggcct tttccactac
      781 ctgaaaaagt ttggtatttc gacgtcgacc acctttccgg atcctggtgt ggtggacacg
      841 gagctgcatt accgtggcaa gcgctatcac tggccagcgg gcaaaaagcc gcccgaatta
      901 ttcaggcgag tctatgaggg gtggcagtct ctattgtccg aaggttacct ccttgaaggc
      961 ggttctttag ttgccccgct ggacattacc gcaatgctga agtcgggtcg tctggaagag
     1021 gcagcgatcg catggcaggg atggctcaat gtattccggg attgttcatt ctataacgcg
     1081 attgtctgta tttttactgg ccgccatccg ccaggcggcg acagatgggc tcgtcctgaa
     1141 gactttgagc tgtttggctc gcttggcata ggctcgggcg ggtttttgcc agtctttcag
     1201 gctggcttta cggaaatact gcggatggtt atcaacggat accaaagtga ccagcgactg
     1261 attccggacg ggatatccag tctggccgcg agactcgctg atcagtcgtt tgacggcaaa
     1321 gcgttaaggg accgcgtttg ttttagccgg gtaggtcgca tttccagaga agctgaaaaa
     1381 atcatcatcc agacggaagc aggagaacag cgtgtatttg atcgagtaat tgtcactagc
     1441 agtaatcggg ccatgcaaat gattcactgc ctcacggata gcgagagctt tctgagtcgt
     1501 gatgtcgctc gtgctgtccg cgaaacccat ctgacaggtt catcgaagct tttcattctc
     1561 acccgaacca aattctggat aaaaaacaag cttcccacca ccatccagtc ggacggtctg
     1621 gtgcgcggcg tctattgtct ggattatcag cccgatgaac ctgaggggca tggcgttgtt
     1681 ctgctcagtt acacgtggga agacgacgct caaaaaatgc tggcgatgcc tgacaagaaa
     1741 acgcgttgcc aggtactggt tgatgacctt gctgcgatac acccgacgtt cgccagttat
     1801 ctcctgcccg ttgatgggga ttatgagcgg tatgtattgc accatgactg gctcaccgat
     1861 ccccattctg cgggcgcttt caaactcaat tatcccggcg aggacgttta ctcgcagcga
     1921 ttgttttttc aaccaatgac agcgaacagt cccaataaag acacggggct ctatctggct
     1981 ggctgcagtt gctcttttgc cggagggtgg atcgaaggtg ctgtccagac agcattgaac
     2041 agtgcttgcg cggtgctgcg cagcaccgga gggcaactgt caaaaggcaa cccgctggac
     2101 tgtatcaacg cctcctatcg ctattaacag tcctgagagc agcgctaagc taatacgggt
     2161 gaaaagagca tgcatgaaat aatcacactt gaatcgctgt gccaggcatt ggctgatggc
     2221 gagattgccg ctgcggagct gcgcgagcgt gcgctcgata ctgaggcccg cttggcccgt
     2281 ttgaactgtt tcatacgtga gggtgatgcg gtcagccagt ttggcgaggc tgatcatgca
     2341 atgaagggta ctccactctg gggaatgccg gtttccttca aggataatat ctgtgttcgc
     2401 ggtttgccgt tgacagccgg gacgcgagga atgtccggtt tcgtatctga tcaggatgct
     2461 gcgattgtca gtcaactcag agccctcggg gctgtggtcg ctggcaagaa caacatgcac
     2521 gaactcagtt tcggagtgac ctccatcaat cctcattggg gaaccgtggg aaaccccgtg
     2581 gcccccggtt attgtgccgg aggtagtagt ggtggaagtg ccgccgcagt ggcaagtgga
     2641 attgttccgc tgtcggtggg gaccgacacg gggggctcga taagaatacc ggcggccttc
     2701 tgcggcatta cgggcttcag acccactact ggacgctggt caacggcagg cattatccct
     2761 gtttctcata caaaggattg cgttgggttg ctgacacgaa cggcaggcga cgccggattt
     2821 ttgtacggac tgttgtcagg caagcagcag tcttttcccc tgagcaggac ggccccctgt
     2881 cgaatcggcc taccggtctc catgtggtcc gatctggatg gcgaggtgga gagggcatgc
     2941 gtgaatgcac tcagcctgct gcgcaagaca gggttcgagt ttattgaaat tgatgatgcc
     3001 gatattgtcg aactgaacca gacactcacg ttcaccgttc cgctttacga attctttgcc
     3061 gaccttgctc agtcattgct ctccttgggc tggaagcacg gtatccatca tatttttgca
     3121 caggttgatg atgccaacgt gaaaggcatc atcaatcacc accttgggga gggcgctata
     3181 aaaccggctc actatttgag ttcactgcaa aatggcgagc tgttgaaaag aaaaatggat
     3241 gagctatttg ctcgtcataa tatcgagctc ctggggtacc ccacggtacc ttgccgggta
     3301 cctcatctgg atcatgctga ccgaccagag tttttcagtc aggcaattcg caataccgac
     3361 ctggccagca atgcgatgct cccctccatt actattcctg tcgggccaga agggcgattg
     3421 ccggtcggct tgagctttga tgctctaaga gggcgagacg cacttttgct gagcagagtc
     3481 agtgccatag agcaggtatt aggttttgta cgaaaagtat tgccgcatac aacgtaggct
     3541 acaggccagc gtgaccatgg ctgcgtagct cttggccagc ttgtcgtagc gggtgccgat
     3601 tcggcggttc tcttttagcc agccaaacat ccgctcaatg atgttgcgct gccgatactt
     3661 tggacgatcg aagagtc
//
LOCUS       SVCG3E         68 bp ss-RNA             VRL       07-AUG-1990
DEFINITION  Spring viremia of carp virus genome, 3' end.
ACCESSION   M35836
KEYWORDS    .
SOURCE      Spring viremia of carp virus RNA.
  ORGANISM  Spring viremia of carp virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae.
REFERENCE   1  (bases 1 to 68)
  AUTHORS   Roy,P., Gupta,K.C. and Kiuchi,A.
  TITLE     Characterization of Spring viremia of carp virus mRNA species and
            the 3' sequence of the viral RNA
  JOURNAL   Virus Res. 1, 189-202 (1984)
  STANDARD  simple staff_entry
BASE COUNT       20 a     13 c     10 g     25 t
ORIGIN      
        1 tgcttctgtt tatttggtaa ctattgtcaa tgtaccaaat aaacagacca ttgtctgtag
       61 tactcaca
//
LOCUS       TTHSUCDA      880 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  T.aquaticus B succinyl-CoA synthetase alpha-subunit (sucD) gene,
            complete cds.
ACCESSION   M35832 X12815
KEYWORDS    sucD gene; succinyl-CoA synthetase alpha-subunit.
SOURCE      T.aquaticus (strain B (NCIB 11247)) DNA.
  ORGANISM  Thermus aquaticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 880)
  AUTHORS   Nicholls,D.J., Sundaram,T.K., Atkinson,T. and Minton,N.P.
  TITLE     Nucleotide sequence of the succinyl-CoA synthetase alpha-subunit
            from Thermus aquaticus B
  JOURNAL   Nucleic Acids Res. 16, 9858-9858 (1988)
  STANDARD  full staff_entry
COMMENT     Gene product mature structure is (alpha beta)2.
            
            Submitted (09-SEP-1988) on tape to the EMBL data library.
FEATURES       from  to/span     description
    pept         14      880     succinyl-CoA synthetase alpha-subunit (sucD;
                                 EC 6.2.1.5)
    binding       2        7     sucD ribosomal binding site (put.)
BASE COUNT      153 a    304 c    310 g    113 t
ORIGIN      
        1 tgggaggtgc ggcgtgatcc tggtgaaccg cgagacccgc gtcctggtcc agggcatcac
       61 cggccgggag gggcagttcc acaccaagca gatgctggac tacggcacca agatcgtcgc
      121 cggggtcacc ccgggcaaag ggggaacgga ggtcctaggg gtccccgtct acgacacggt
      181 gaaggaggcg gtggcccacc acgaggtgga cgcctccatc atcttcgtgc ccgccccggc
      241 cgcggcggac gccgccctgg aagcggccca cgccgggatc cccctcatcg tcctcatcac
      301 cgagggcatc cccaccctgg acatggtgcg ggcggtggag gagatcaagg ccctgggaag
      361 ccgcctcatc ggggggaact gccccgggat catcagcgcc cacctaccca agatcgggat
      421 catgcccggc cacgtcttca agcggggccg ggtggggatc atcagccgct ccggcaccct
      481 tcggcacgag gccgcagccg ccctttccca ggcggggctc ggcaccacca ccacggtggg
      541 gatcgggggc gaccccgtca ggcggctcac cttcaaggac ctcctccccc tcttcaacga
      601 ggacccggag acggaggccg tggtcctcat cggggagatc ggcggctccg acgaggagga
      661 ggcggcggct tgggtgaagg accacatgaa gaagccggtg gtgggcttca tcggaggccg
      721 ctccgccccc aagggcaagc gcatgggcca cgccggggcc atcatcatgg gcaacgtggg
      781 caccccggag tccaagctcc gggccttcgc cgaggcgggc atccccgtgg ccgacaccat
      841 cgacgagatc gtggagctgg tcaagaaggc cctgggctaa
//
LOCUS       MUSMHPCRE     256 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Mouse MHC class I Q even processed pseudogene, exons 7 and 8.
ACCESSION   M30902
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex;
            pseudogene.
SOURCE      Mouse adult (strain SK/CamE1) liver.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 256)
  AUTHORS   Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
            Dilwith,R. and Flaherty,L.
  TITLE     Genetic polymorphisms of Q region genes from wild derived mice:
            Implications for Q region evolution
  JOURNAL   Immunogenetics 31, 315-325 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Flaherty 04-JAN-1990.
FEATURES       from  to/span     description
    pept.ps /    45       82     pseudo-MHC Q even, exon 7 (AA at 46)
                191      219     pseudo-MHC Q even, exon 8
    IVS     <     1       44     pseudo-MHC Q even intron A (no splice
                                 consensus)
    IVS          83      190     pseudo-MHC Q even intron B (no splice
                                 consensus)
BASE COUNT       52 a     52 c     70 g     82 t
ORIGIN      Chromosome 17.
        1 agatatatct ttgtatatat ttttccctag gcagggacag ctcccagagc tctgatatgt
       61 ttctctcaag attgtaaagg tgacattcta tggcctgatt gcagaggggc actgtggaca
      121 tggttgtgtt tcagggactc ccacaatccc ctgtgagtgg tgggttgttg ggatattgtc
      181 ttcattgtgg tggttcctga ccctcgttct ctattatgaa gacagctgcc tggagtggac
      241 tcagtgacag ccagtg
//
LOCUS       MUSMHPCRM     264 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Mouse MHC class I Q odd processed pseudogene, exons 7 and 8.
ACCESSION   M30901
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex;
            pseudogene.
SOURCE      Mouse adult (strain Metkovic 2) liver.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
            Dilwith,R. and Flaherty,L.
  TITLE     Genetic polymorphisms of Q region genes from wild derived mice:
            Implications for Q region evolution
  JOURNAL   Immunogenetics 31, 315-325 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Flaherty 04-JAN-1990.
FEATURES       from  to/span     description
    pept.ps /    45       81     pseudo-MHC Q odd, exon 7 (AA at 45)
                194      225     pseudo-MHC Q odd, exon 8
    IVS     <     1       44     pseudo-MHC Q odd intron A (no splice
                                 consensus)
    IVS          82      193     pseudo-MHC Q odd intron B
BASE COUNT       59 a     53 c     76 g     76 t
ORIGIN      Chromosome 17.
        1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
       61 gatctctccc agatggtaaa ggtgacactc tggggcctga ttggagaggg gcaatgtgga
      121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
      181 gttgtcttca cagtgatggg tcatgtctct cattctctag catgaagaca gctgcctgga
      241 ctgcactgag tgacagacga tgtg
//
LOCUS       MUSMHQ4D      261 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Mouse MHC class I Q4d Qb-1 processed pseudogene (d haplotype),
            exons 7 and 8.
ACCESSION   M30896
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex;
            pseudogene.
SOURCE      Mouse adult (strain Balb/c, haplotype d) sperm, clone 65.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 261)
  AUTHORS   Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
            Dilwith,R. and Flaherty,L.
  TITLE     Genetic polymorphisms of Q region genes from wild derived mice:
            Implications for Q region evolution
  JOURNAL   Immunogenetics 31, 315-325 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Flaherty 04-JAN-1990.
FEATURES       from  to/span     description
    pept.ps /    45       82     peudo-MHC Q4d Qb-1, exon 7 (AA at 46)
                191      222     peudo-MHC Q4d Qb-1, exon 8
    IVS     <     1       44     peudo-MHC Q4d Qb-1 intron A (no splice
                                 consensus)
    IVS          83      190     peudo-MHC Q4d Qb-1 intron B (no splice
                                 consensus)
BASE COUNT       54 a     52 c     70 g     85 t
ORIGIN      Chromosome 17.
        1 tatagatata tctttgtata tatttttccc taggcaggga cagctcccag agctctgata
       61 tgtttctctc aagattgtaa aggtgacatt ctatggcctg attgcagagg ggcactgtgg
      121 acatggttgt gtttcaggga ctcccacaat cccctgtgag tggtgggttg ttgggatatt
      181 gtcttcattg tggtggttcc tgaccctcat tctctatcat gaagacagct gcctggagtg
      241 gacttagtga cagccagtgt g
//
LOCUS       MUSMHQ5B      264 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Mouse MHC class I Q5B processed pseudogene (b haplotype),
            exons 7 and 8.
ACCESSION   M30897
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex;
            pseudogene.
SOURCE      Mouse adult (strain B10, haplotype b) liver, clone B2.5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
            Dilwith,R. and Flaherty,L.
  TITLE     Genetic polymorphisms of Q region genes from wild derived mice:
            Implications for Q region evolution
  JOURNAL   Immunogenetics 31, 315-325 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Flaherty 04-JAN-1990.
FEATURES       from  to/span     description
    pept.ps /    45       81     pseudo-MHC Q5b, exon 7 (AA at 45)
                194      225     pseudo-MHC Q5b, exon 8
    IVS     <     1       44     pseudo-MHC Q5b intron A (no splice consensus)
    IVS          82      193     pseudo-MHC Q5b intron B
BASE COUNT       58 a     55 c     77 g     74 t
ORIGIN      Chromosome 17.
        1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
       61 gatctctccc agatggtaaa ggtgacactc tgggacctga ttggggaggg gcaatgcgga
      121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
      181 gttgtcttca cagtgatggg tcgtgtccct cattctctag catgaagaca gctgcctgga
      241 ctgcactgag tgacagacga tgtg
//
LOCUS       MUSMHQ5D      264 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Mouse MHC class I Q5d processed pseudogene (d haplotype),
            exons 7 and 8.
ACCESSION   M30898
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex;
            pseudogene.
SOURCE      Mouse adult (strain Balb/c, haplotype d) sperm, clone 65.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
            Dilwith,R. and Flaherty,L.
  TITLE     Genetic polymorphisms of Q region genes from wild derived mice:
            Implications for Q region evolution
  JOURNAL   Immunogenetics 31, 315-325 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Flaherty 04-JAN-1990.
FEATURES       from  to/span     description
    pept.ps /    45       81     pseudo-MHC Q5d, exon 7 (AA at 45)
                194      225     pseudo-MHC Q5d, exon 8
    IVS     <     1       44     pseudo-MHC Q5d intron A (no splice consensus)
    IVS          82      193     pseudo-MHC Q5d intron B
BASE COUNT       58 a     53 c     77 g     76 t
ORIGIN      Chromosome 17.
        1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
       61 gatctctccc agatggtaaa ggtgacactc tggggcctga ttggggaggg gcaatgtgga
      121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
      181 gttgtcttca cagtgatggg tcatgtctct cattctctag catgaagaca gctgcctgga
      241 ctgcactgag tgacagacga tgtg
//
LOCUS       MUSMHQ6B      261 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Mouse MHC class I Q6b Qa-2 processed pseudogene (b haplotype),
            exons 7 and 8.
ACCESSION   M30899
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex;
            pseudogene.
SOURCE      Mouse adult (strain B10, haplotype b) liver, clone B2.17.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 261)
  AUTHORS   Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
            Dilwith,R. and Flaherty,L.
  TITLE     Genetic polymorphisms of Q region genes from wild derived mice:
            Implications for Q region evolution
  JOURNAL   Immunogenetics 31, 315-325 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Flaherty 04-JAN-1990.
FEATURES       from  to/span     description
    pept.ps /    45       82     pseud-MHC Q6b Qa-2, exon 7 (AA at 46)
                191      222     pseud-MHC Q6b Qa-2, exon 8
    IVS     <     1       44     pseud-MHC Q6b Qa-2 intron A (no splice
                                 consensus)
    IVS          83      190     pseud-MHC Q6b Qa-2 intron B (no splice
                                 consensus)
BASE COUNT       53 a     52 c     71 g     85 t
ORIGIN      Chromosome 17.
        1 tatagatata tctttgtata tatttttccc taggcaggga cagctcccag agctctgata
       61 tgtttctctc aagattgtaa aggtgacatt ctatggcctg attgcagagg ggcactgtgg
      121 acatggttgt gtttcaggga ctcccacaat cccctgtgag tggtgggttg ttgggatatt
      181 gtcttcattg tggtggttcc tgaccctcgt tctctatcat gaagacagct gcctggagtg
      241 gacttagtga cagccagtgt g
//
LOCUS       MUSMHQ89DA    264 bp ds-DNA             ROD       07-AUG-1990
DEFINITION  Mouse MHC class I Q8/Q9d Qa-2 processed pseudogene (d haplotype),
            region surrounding exons 7 and 8.
ACCESSION   M30900
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex;
            pseudogene.
SOURCE      Mouse adult (strain Balb/c, haplotype d) sperm, clone 46.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 264)
  AUTHORS   Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K.,
            Dilwith,R. and Flaherty,L.
  TITLE     Genetic polymorphisms of Q region genes from wild derived mice:
            Implications for Q region evolution
  JOURNAL   Immunogenetics 31, 315-325 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Flaherty 04-JAN-1990.
FEATURES       from  to/span     description
    pept.ps /    45       81     peudo-MHC Q8/Q9d Qa-2, exon 7 (AA at 45)
                194      225     peudo-MHC Q8/Q9d Qa-2, exon 8
    IVS     <     1       44     peudo-MHC Q8/Q9d Qa-2 intron A (no splice
                                 consensus)
    IVS          82      193     pseudo-MHC Q8/Q9d Qa-2 intron B (no splice
                                 consensus)
BASE COUNT       57 a     54 c     78 g     75 t
ORIGIN      Chromosome 17.
        1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata
       61 gatctctccc aggtggtaaa ggtgacactc tgggacctga ttggggaggg gcaatgtgga
      121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat
      181 gttgtcttca cagtgatggg tcgtgtccct cattctctag catgaagaca gctgcctgga
      241 ctgcactgag tgacagacga tgtg
//
LOCUS       PMUNE1        220 bp ds-DNA             PHG       07-AUG-1990
DEFINITION  Bacteriophage mu genomic left end.
ACCESSION   M34919
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Bacteriophage mu DNA.
  ORGANISM  Bacteriophage mu
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 220)
  AUTHORS   Groenen,M.A.M., Timmers,E. and Van De Putte,P.
  TITLE     DNA sequences at the ends of the genome of Bacteriophage mu
            essential for transposition
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 2087-2091 (1985)
  STANDARD  simple staff_review
BASE COUNT       82 a     32 c     44 g     62 t
ORIGIN      
        1 tgtattgatt cacttgaagt acgaaaaaaa ccgggaggac attggattat tcgggatctg
       61 atgggattag atttggtggg gcttgcaagc ctgtagtgca aattttagtc cttaatcaat
      121 gaaacgcgaa agatagtaaa aaattgcttt tgtttcattg aaaatacgaa aaacaaaaac
      181 actgcaaatc atttcaataa cagcttcaaa aaacgttcaa
//
LOCUS       PMUNE2        220 bp ds-DNA             PHG       07-AUG-1990
DEFINITION  Bacteriophage mu genomic right end.
ACCESSION   M34920
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Bacteriophage mu DNA.
  ORGANISM  Bacteriophage mu
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 220)
  AUTHORS   Groenen,M.A.M., Timmers,E. and Van De Putte,P.
  TITLE     DNA sequences at the ends of the genome of Bacteriophage mu
            essential for transposition
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 2087-2091 (1985)
  STANDARD  simple staff_review
BASE COUNT       71 a     47 c     41 g     61 t
ORIGIN      
        1 gtggtacaca aatttaatca gtatcgctac atcagattcc tgaacaaacg agcaaggaag
       61 cggctaaata ccaaactatt caaggttcag gcatacccta agtgatcccc atgtaatgaa
      121 taaaaagcag taattaatac atctgtttca tttgaagcgc gaaagctaaa gttttcgcat
      181 ttatcgtgaa acgctttcgc gtttttcgtg cgccgcttca
//
LOCUS       STYRGGA       103 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  S.typhimurium 23S rRNA gene.
ACCESSION   M35803
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      S.typhimurium (strain ATCC 13311) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1  >   103     23S ribosomal RNA
BASE COUNT       23 a     22 c     38 g     20 t
ORIGIN      
        1 ggaagatgta acggggctaa accatgcacc gaagctgcgg cagcgacgct gatgcgttgt
       61 tgggtagggg agcgttctgt aagcctgcga aggtcacctg tga
//
LOCUS       STYRGGB       104 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  S.typhimurium 23S rRNA gene.
ACCESSION   M35804
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      S.typhimurium (strain LB5010) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 104)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1  >   104     23S ribosomal RNA
BASE COUNT       25 a     22 c     34 g     23 t
ORIGIN      
        1 cctgcgcgga agatgtaacg gggctcaaac catacaccga agctacgggt tcatcctttg
       61 gatgagcggt agaggagcgt tctgtaagcc tgtgaaggtg agtt
//
LOCUS       STYRGGC       185 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  S.typhimurium 23S rRNA gene.
ACCESSION   M35819
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      S.typhimurium (strain ATCC 13311) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 185)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       38     23S ribosomal RNA
                145  >   185     23S ribosomal RNA
    site         39      144     23S rRNA intervening sequence
BASE COUNT       49 a     36 c     60 g     40 t
ORIGIN      
        1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
       61 aactttacgg agttgacgat tgacggagcg aagcgacgtc aaagcgttca ttaaagtcga
      121 gttggcttag ggatacgttc gttgggtagg ggagcgttct gtaagcctgt gaaggtgtgt
      181 cgtga
//
LOCUS       YEPRGGA        85 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.pestis 23S rRNA gene.
ACCESSION   M35798
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.pestis (strain EV76) DNA.
  ORGANISM  Yersinia pestis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1  >    85     23S ribosomal RNA
BASE COUNT       15 a     16 c     34 g     20 t
ORIGIN      
        1 atgcaccgaa gctgcggcag cgacgcttag gcgttgttgg gtaggggagc gttctgtaag
       61 ccgttgaagg tgacctgtga gggtt
//
LOCUS       YEPRGGB       120 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.pseudotuberculosis 23S rRNA gene.
ACCESSION   M35799
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.pseudotuberculosis (strain YPIII/pIB1; serotype III) DNA.
  ORGANISM  Yersinia pseudotuberculosis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1  >   120     23S ribosomal RNA
BASE COUNT       23 a     24 c     47 g     26 t
ORIGIN      
        1 gtcggcctgc gcggaagatg taacggggct aaaccatgca ccgaagctgc ggcagcgacg
       61 cttaggtgtt gttgggtagg ggagcgttct gtaagccgtt gaaggtgacc tgtgagggtt
//
LOCUS       YEPRGGC        83 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35800
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 761/80; serotype 0:6,310) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 83)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1  >    83     23S ribosomal RNA
BASE COUNT       14 a     16 c     34 g     19 t
ORIGIN      
        1 gcaccgaagc tgcggcagcg acgcttaggc gttgttgggt aggggagcgt tctgtaagcc
       61 gttgaaggtg acctgtgagg gtt
//
LOCUS       YEPRGGD       114 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35801
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 8081; serotype 0:8) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 114)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1  >   114     23S ribosomal RNA
BASE COUNT       23 a     24 c     43 g     24 t
ORIGIN      
        1 gtcggcctgc gcggaagatg taacggggct aaaccatgca ccgaagctgc ggcagcgaca
       61 cttaggtgtt gttgggtagg ggagcgttct gtaagccgtt gaaggtgacc tgtg
//
LOCUS       YEPRGGE       104 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.bercovieri 23S rRNA gene.
ACCESSION   M35802
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.bercovieri (strain 3016/84; serotype 0:58,16) DNA.
  ORGANISM  Yersinia bercovieri
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 104)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1  >   104     23S ribosomal RNA
BASE COUNT       21 a     21 c     38 g     24 t
ORIGIN      
        1 cgcggaagat gtaacggggc taaaccatgc accgaagctg cggcagcttc gcttaggtgt
       61 tgttgggtag gggagcgttc tgtaagccgt tgaaggtgac ctgt
//
LOCUS       YEPRGGF       197 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35805
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 132; serotype 0:1) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 197)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       37     23S ribosomal RNA
                159  >   197     23S ribosomal RNA
    site         38      158     23S rRNA intervening sequence
BASE COUNT       52 a     43 c     60 g     42 t
ORIGIN      
        1 agatgtaacg gggctaaacc atgcaccgaa gctgcggcag cgaacgtatc acttaaaacg
       61 cttaacgtga cgcaaagccg gcatgttcaa gccacactaa acgttgagtt ggccggtgtg
      121 ctgacaaacg aacaggcgtt ttaaggatac gttcgttggg taggggagcg ttctgtaagc
      181 cgttgaaggt ggcctgt
//
LOCUS       YEPRGGG       179 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35806
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 1142; serotype 0:2) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 179)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       19     23S ribosomal RNA
                141  >   179     23S ribosomal RNA
    site         20      140     23S rRNA intervening sequence
BASE COUNT       47 a     40 c     52 g     40 t
ORIGIN      
        1 ccatgcaccg aagctgcggc agcgaacgta tcacttaaaa cgcttaacgt gacgcaaagc
       61 cggcatgttc aagccacact aaacgttgag ttggccgatg tgctgacaaa cgaacaggcg
      121 ttttaaggat acgttcgttg ggtaggggag cgttctgtaa gccgttgaag gtagtctgt
//
LOCUS       YEPRGGH       206 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35807
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 6471/76; serotype 0:3) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 206)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       38     23S ribosomal RNA
                160  >   206     23S ribosomal RNA
    site         39      159     23S rRNA intervening sequence
BASE COUNT       54 a     43 c     65 g     44 t
ORIGIN      
        1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacttaaaac
       61 gcttaacgtg acgcaaagcc ggcatgttca agccacacta aacgttgagt tggccggtgt
      121 gctgacaaac gaacaggcgt tttaaggata cgttcgttgg gtaggggagc gttctgtaag
      181 ccgttgaagg tggtctgtga gggctg
//
LOCUS       YEPRGGI       175 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35808
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 7500; serotype 0:5,27) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 175)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       10     23S ribosomal RNA
                132  >   175     23S ribosomal RNA
    site         11      131     23S rRNA intervening sequence
BASE COUNT       46 a     36 c     55 g     38 t
ORIGIN      
        1 gaagctgcgg cagcgaacgt atcacttaaa acgcttaacg tgacgcaaag ccggcatgtt
       61 caagccacac taaacgttga gttggccgat gtgctgacaa acgaacaggc gttttaagga
      121 tacgttcgtt gggtagggga gcgttctgta agccgttgaa ggtgacctgt gaggg
//
LOCUS       YEPRGGJ       190 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35809
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain Ruokola/71; serotype 0:9) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 190)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       30     23S ribosomal RNA
                152  >   190     23S ribosomal RNA
    site         31      151     23S rRNA intervening sequence
BASE COUNT       49 a     43 c     58 g     40 t
ORIGIN      
        1 acggggctaa accatgcacc gaagctgcgg cagcgaacgt atcacttaaa acgcttaacg
       61 tgacgcaaag ccggcatgtt caagccacac taaacgttga gttggccggt gtgctgacaa
      121 acgaacaggc gttttaagga tacgttcgtt gggtagggga gcgttctgta agccgttgaa
      181 ggtggcctgt
//
LOCUS       YEPRGGK       192 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35810
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain C36; serotype 0:15) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 192)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       31     23S ribosomal RNA
                153  >   192     23S ribosomal RNA
    site         32      152     23S rRNA intervening sequence
BASE COUNT       50 a     42 c     59 g     40 t      1 others
ORIGIN      
        1 aacggggcta aaccatgcac cgaagctgcg gcagcgaacg tatcacttaa aacgcttaac
       61 gtgacgcaaa gccggcangt tcaagccaca ctaaacgttg agttggccgg tgtgctgaca
      121 aacgaacagg cgttttaagg atacgttcgt tgggtagggg agcgttctgt aagccgttga
      181 aggtggtctg tg
//
LOCUS       YEPRGGL       177 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35811
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain E701; serotype 0:4,32) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 177)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       29     23S ribosomal RNA
                139  >   177     23S ribosomal RNA
    site         30      138     23S rRNA intervening sequence
BASE COUNT       43 a     40 c     57 g     37 t
ORIGIN      
        1 cggggctaaa ccatgcaccg aagctgcggc agcgaacgta tcacccaagc caattcacgg
       61 cagtggataa tgattgacgg agcgcagcga cgtcaatgcg tccaataaag tctagttggc
      121 ttagggatac gttcgttggg taggggagcg ttctgtaagc cgttgaaggt ggcctgt
//
LOCUS       YEPRGGM       191 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35812
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 8081; serotype 0:8) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 191)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       38     23S ribosomal RNA
                148  >   191     23S ribosomal RNA
    site         39      147     23S rRNA intervening sequence
BASE COUNT       51 a     39 c     63 g     38 t
ORIGIN      
        1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
       61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
      121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
      181 acctgtgagg g
//
LOCUS       YEPRGGN       158 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35813
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 5081; serotype 0:13a,13b) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 158)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       17     23S ribosomal RNA
                127  >   158     23S ribosomal RNA
    site         18      126     23S rRNA intervening sequence
BASE COUNT       41 a     33 c     51 g     33 t
ORIGIN      
        1 atgcaccgaa cgtgcggcag cgaacgtatc acccaagaca attcacggca gtggataatg
       61 attgacggag cgcagcgacg tcaatgcgtc caataaagtc gagttggctt agggatacgt
      121 tcgttgggta ggggagcgtt ctgtaagccg ttgaaggt
//
LOCUS       YEPRGGO       180 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35814
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 9312/78; serotype 0:13,18) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       38     23S ribosomal RNA
                148  >   180     23S ribosomal RNA
    site         39      147     23S rRNA intervening sequence
BASE COUNT       50 a     36 c     58 g     36 t
ORIGIN      
        1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
       61 aattcacgga agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
      121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS       YEPRGGP       180 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35815
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 874/77; serotype 0:20) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       38     23S ribosomal RNA
                148  >   180     23S ribosomal RNA
    site         39      147     23S rRNA intervening sequence
BASE COUNT       49 a     37 c     58 g     36 t
ORIGIN      
        1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
       61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
      121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS       YEPRGGQ       180 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35816
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain E736; serotype 0:21) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       38     23S ribosomal RNA
                148  >   180     23S ribosomal RNA
    site         39      147     23S rRNA intervening sequence
BASE COUNT       48 a     38 c     58 g     36 t
ORIGIN      
        1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
       61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccactaaagt
      121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS       YEPRGGR       180 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.enterocolitica 23S rRNA gene.
ACCESSION   M35817
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.enterocolitica (strain 2139/72; serotype 0:34) DNA.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA    <     1       38     23S ribosomal RNA
                148  >   180     23S ribosomal RNA
    site         39      147     23S rRNA intervening sequence
BASE COUNT       49 a     37 c     58 g     36 t
ORIGIN      
        1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac
       61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt
      121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg
//
LOCUS       YEPRGGS       153 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  Y.bercovieri 23S rRNA gene.
ACCESSION   M35818
KEYWORDS    23S ribosomal RNA; ribosomal RNA.
SOURCE      Y.bercovieri (strain 3016/84; serotype 0:58,16) DNA.
  ORGANISM  Yersinia bercovieri
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 153)
  AUTHORS   Skurnik,M. and Toivanen,P.
  TITLE     Intervening sequences (IVS) in the 23S ribosomal RNA genes of
            pathogenic Yersinia enterocolitica strains. The IVSs in
            Y.enterocolitica and Salmonella typhimurium have common origin
  JOURNAL   Unpublished (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Skurnik, 29-JUN-1990.
            
              Author address: M.Skurnik
              Turku University
              Dept of Medical Microbiology
              Kiinanmyllynkatu 13
              SF-20520 Turku
              FINLAND
              email:mskurnik@finabo
            
FEATURES       from  to/span     description
    rRNA        110  >   153     23S ribosomal RNA
    site    <     1      109     23S rRNA intervening sequence
BASE COUNT       39 a     30 c     51 g     33 t
ORIGIN      
        1 cagcgaacgt atcacccaag acaattcacg gcagtggata atgattgacg gagcgcagcg
       61 acgtcaatgc gtccaataaa gtcgagttgg cttagggata cgttcgttgg gtaggggagc
      121 gttctgtaag ccgttgaagg tgacctgtga ggg
//
LOCUS       ECOINSW        49 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli insertion site for transposon Tn1545.
ACCESSION   M31752
KEYWORDS    .
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 49)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb       26       27     transposon Tn1545 insertion site
BASE COUNT       12 a     10 c      0 g     27 t
ORIGIN      
        1 cttttcccaa catttccttt tttattatta aaaatcattt ttttcttca
//
LOCUS       ECOINSX        49 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli insertion site for transposon Tn1545.
ACCESSION   M31754
KEYWORDS    .
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 49)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb       26       27     transposon Tn1545 insertion site
BASE COUNT       21 a      4 c      6 g     18 t
ORIGIN      
        1 atttgaaaat gctgattctt ttatagataa aaatagcaat gcttaaata
//
LOCUS       ECOINSY        49 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli insertion site for transposon Tn1545.
ACCESSION   M34227
KEYWORDS    .
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 49)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb       26       27     transposon Tn1545 insertion site
BASE COUNT       16 a      8 c      5 g     20 t
ORIGIN      
        1 tgttggattt catttcactt tctccataaa aaaaaacggt aatcatttt
//
LOCUS       ECOINSZ        49 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli insertion site for transposon Tn1545.
ACCESSION   M35413
KEYWORDS    .
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 49)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb       26       27     transposon Tn1545 insertion site
BASE COUNT       18 a      6 c     11 g     14 t
ORIGIN      
        1 cacctcatac atttgatgtt tatcagtaaa aaaatgggag gattaaggg
//
LOCUS       ECOT1545A1    136 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 5' junction.
ACCESSION   M35311
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 136)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb       26       27     E.coli end/ transposon Tn1545 start
BASE COUNT       49 a     28 c     10 g     49 t
ORIGIN      
        1 cttttcccaa catttccttt tttattttaa aaaatagcat aaaaatctag ttatccgcat
       61 aaaaactgga cttatcacac tttatcaagg tcaaaaccac tcaatttact actaatttac
      121 tacttatgaa tgagct
//
LOCUS       ECOT1545A2    212 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 3' junction.
ACCESSION   M35315
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      189      190     transposon Tn1545 end/ E.coli start
BASE COUNT       84 a     29 c     28 g     71 t
ORIGIN      
        1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
       61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
      121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
      181 ttgtttatta ttaaaaatca tttttttctt ca
//
LOCUS       ECOT1545B1    135 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 5' junction.
ACCESSION   M35312
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review staff_entry
FEATURES       from  to/span     description
    recomb       26       27     E.coli end/ transposon Tn1545 start
BASE COUNT       53 a     23 c     14 g     45 t
ORIGIN      
        1 atttgaaaat gctgattctt ttatagataa aaatagcata aaaatctagt tatccgcata
       61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact
      121 acttatgaat gagct
//
LOCUS       ECOT1545B2    212 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 3' junction.
ACCESSION   M35316
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      189      190     transposon Tn1545 end/E.coli start
BASE COUNT       88 a     29 c     30 g     65 t
ORIGIN      
        1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
       61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
      121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
      181 ttgtttctta taaaaatagc aatgcttaaa ta
//
LOCUS       ECOT1545C1    135 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 5' junction.
ACCESSION   M35313
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb       26       27     E.coli end/ transposon Tn1545 start
BASE COUNT       48 a     27 c     13 g     47 t
ORIGIN      
        1 tgttggattt catttcactt tctccattaa aaatagcata aaaatctagt tatccgcata
       61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact
      121 acttatgaat gagct
//
LOCUS       ECOT1545C2    212 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 3' junction.
ACCESSION   M35317
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      189      190     transposon Tn1545 end/ E.coli start
BASE COUNT       88 a     28 c     31 g     65 t
ORIGIN      
        1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
       61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
      121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
      181 ttgttttgat aaaaaaaaac ggtaatcatt tt
//
LOCUS       ECOT1545D1    135 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 5' junction.
ACCESSION   M35314
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb       26       27     E.coli end/ transposon Tn1545 start
BASE COUNT       51 a     28 c     13 g     43 t
ORIGIN      
        1 cacctcatac atttgatgtt tatcagctaa aaatagcata aaaatctagt tatccgcata
       61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact
      121 acttatgaat gagct
//
LOCUS       ECOT1545D2    212 bp ds-DNA             BCT       07-AUG-1990
DEFINITION  E.coli/transposon Tn1545 3' junction.
ACCESSION   M35318
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Caillaud,F. and Courvalin,P.
  TITLE     Nucleotide sequence of the ends of the conjugative shuttle
            transposon Tn1545
  JOURNAL   Mol. Gen. Genet. 209, 110-115 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    recomb      189      190     transposon Tn1545 end/ E.coli start
BASE COUNT       87 a     26 c     37 g     62 t
ORIGIN      
        1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac
       61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac
      121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact
      181 ttgttttgat aaaaaaatgg gaggattaag gg
//

GenBank-Updates@genbank.bio.net (08/08/90)

LOCUS       MUSMHC4B1     278 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse MHC class II H2-C4-w7 gene (C3H, w7), 5' flank, segment 1.
ACCESSION   M15601
KEYWORDS    class III gene; complement protein; complement system protein;
            major histocompatibility complex; protease.
SEGMENT     1 of 2
SOURCE      Mouse (C3H.W7) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 278)
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  TITLE     Recombination of two homologous MHC class III genes of the mouse
            (C4 and Slp) that accounts for the loss of testosterone dependence
            of sex-limited protein expression
  JOURNAL   J. Immunol. 138, 620-627 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 278; revises [1])
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    revision     81       82     ca in [2]; ac in [1]
BASE COUNT       67 a     94 c     62 g     55 t
ORIGIN      Chromosome 17.
        1 gtcaacagtt attgcacatg cttaatgtaa acctgtagcc tctacatgct tgtgtacctg
       61 cccagacagt atatgacacc cacacccaca cccacaccca cacccacacc caccccaccc
      121 cacttcacac ccaggcagca gtagtggacc caacagggta agtgggtccc tgcttgggtt
      181 gcctgtgttt ctggaagctc aaggtccatg catcagcagt aggggaaggg agggttctca
      241 gcccaggctg catcatcatg gccgcctacc acctgcag
//
LOCUS       MUSMHSLQ2     314 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse MHC class III H2-Slp-w7d gene (C3H,w7), 5' flank, segment 2.
ACCESSION   M15609
KEYWORDS    class III gene; complement protein; complement system protein;
            major histocompatibility complex.
SEGMENT     2 of 2
SOURCE      Mouse (C3H.W7) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  TITLE     Recombination of two homologous MHC class III genes of the mouse
            (C4 and Slp) that accounts for the loss of testosterone dependence
            of sex-limited protein expression
  JOURNAL   J. Immunol. 138, 620-627 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 314; revises [1])
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    revision    119      121     cac in [1]; c in [1]
BASE COUNT       81 a     88 c     72 g     73 t
ORIGIN      About 1064 bp downstream from segment 1; chromosome 17.
        1 tcgactgggg aacccattct agttttgtta ggacctttac ctgtcttatc tatgtgtcat
       61 gttggagctg aatacacgca cacatgggca agcacatgta ttcatacaca cacacacaca
      121 cccacacgca catgcagagg gacacgcaca cgcacacaaa cacacacaca aggggaagtc
      181 cctggtactg ggaataaaca agcaccagat caattttggg gccaagattt ctacagagac
      241 cctggtgttt gggttgcctc cccagctctg gcctagggcc agttctgttt cttggccatc
      301 acgtggtttc ccgg
//
LOCUS       MUSMHSLR2     320 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse MHC class III H2-Slp-w7c gene (C3H,w7), 5' flank, segment 2.
ACCESSION   M15610
KEYWORDS    class III gene; complement protein; complement system protein;
            major histocompatibility complex.
SEGMENT     2 of 2
SOURCE      Mouse (C3H.W7) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 320)
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  TITLE     Recombination of two homologous MHC class III genes of the mouse
            (C4 and Slp) that accounts for the loss of testosterone dependence
            of sex-limited protein expression
  JOURNAL   J. Immunol. 138, 620-627 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 320; revises [1])
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    revision    124      126     aca in [2]; a in [1]
BASE COUNT       83 a     92 c     71 g     74 t
ORIGIN      About 1064 bp downstream from segment 1; chromosome 17.
        1 tcgactgggg aacccattct agttttttta ggacctttac ctgtcttatc tatgtgtcat
       61 gttggagctg aatacacgca cacatgggca agcacatgta ttcgtacaca cacacacaca
      121 cacacaccca cacgcacatg cacagggaca cgcacacgca cacaaacaca cacacaaggg
      181 gaagtccctg gtactgggaa taaacaagca ccagatcaat tttggggcca agatttctac
      241 agagaccctg gtgtttgggt tgcctcccca gctctggcct agggccagtt ctgtttcttg
      301 gccatcacgt ggtttcccgg
//
LOCUS       MUSMHSLS2     294 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse MHC class III H2-Slp-w7b gene (C3H,w7), 5' flank, segment 2.
ACCESSION   M15611
KEYWORDS    class III gene; complement protein; complement system protein;
            major histocompatibility complex.
SEGMENT     2 of 2
SOURCE      Mouse (C3H.W7) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 294)
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  TITLE     Recombination of two homologous MHC class III genes of the mouse
            (C4 and Slp) that accounts for the loss of testosterone dependence
            of sex-limited protein expression
  JOURNAL   J. Immunol. 138, 620-627 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 294; revises [1])
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    revision    232      232     a in [2]; g in [1]
BASE COUNT       76 a     80 c     66 g     72 t
ORIGIN      About 1064 bp downstream from segment 1; chromosome 17.
        1 tcgactgggg aacccattct agttttgtta ggacctttac ctgtcttatc tatgtgtcat
       61 gttggagctg aatacacgca cacatggaca agcacatgta ttcgtacaca cacacacaca
      121 cacacgcaca cgcacacaaa cacacacaca aggggaagtc cctggtactg ggaataaaca
      181 aggaccagat caattttggg gccaagattt ctacagagac cctggtgttt gagttgcctc
      241 cccagctctg gcctagggcc agttctgttt cttggccatc acgtggtttc ccgg
//
LOCUS       MUSMHSLT1     253 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse MHC class III H2-Slp-w7a gene (C3H,w7), 5' flank, segment 1.
ACCESSION   M15605
KEYWORDS    class III gene; complement protein; complement system protein;
            major histocompatibility complex.
SEGMENT     1 of 2
SOURCE      Mouse (C3H.W7) liver DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 253)
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  TITLE     Recombination of two homologous MHC class III genes of the mouse
            (C4 and Slp) that accounts for the loss of testosterone dependence
            of sex-limited protein expression
  JOURNAL   J. Immunol. 138, 620-627 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 253; revises [1])
  AUTHORS   Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D.,
            Pattanakitsakul,S.-N. and Takahashi,M.
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    revision     95       96     ct in [2]; ta in [1]
    revision    138      144     tgggtc in [2]; gg in [1]
    revision    148      150     ctt in [2]; c in [1]
    revision    174      176     tca in [2]; t in [1]
    revision    187      189     atc in [2]; a in [1]
BASE COUNT       58 a     76 c     62 g     57 t
ORIGIN      Chromosome 17.
        1 gtcaacagtt attgcacata cttaatgtaa acctgtagcc tctacatgct tgtgtacctg
       61 cccagacagt atatgacacg cacacaccca cacactcccc cacacccagg cagctgtggt
      121 ggacccaaca gggtaagtgg gtccctgctt gggttgcctg tgtttctgga agctcaatgt
      181 ccacgcatca gcagttgggg aagggagggt tctcagccca ggctgcatca tcatggctgc
      241 ctaccacctg cag
//
LOCUS       BOVCMPR      7913 bp ss-mRNA            MAM       08-AUG-1990
DEFINITION  Bovine cation-independent mannose 6-phosphate receptor protein
            mRNA, complete cds.
ACCESSION   J03527 M15869
KEYWORDS    cation-independent mannose 6-phosphate receptor protein;
            insulin-like growth factor II receptor protein.
SOURCE      Bovine (calf), cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 3267 to 7913)
  AUTHORS   Lobel,P., Dahms,N.M., Breitmeyer,J., Chirgwin,J.M. and Kornfeld,S.
  TITLE     Cloning of the bovine 215-kDa cation-independent mannose
            6-phosphate receptor
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 2233-2237 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 7913; revises [1])
  AUTHORS   Lobel,P., Dahms,N.M. and Kornfeld,S.
  TITLE     Cloning and sequence analysis of the cation-independent mannose
            6-phosphate receptor
  JOURNAL   J. Biol. Chem. 263, 2563-2570 (1988)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.Lobel, 22-JUL-1987 and for [2] 23-NOV-1988.
FEATURES       from  to/span     description
    pept        153     7652     cation-independent mannose 6-phosphate
                                 receptor protein
BASE COUNT     1665 a   2401 c   2413 g   1434 t
ORIGIN      
        1 cgcccctcga gctcccccgg ctcgtccgtg gctcctcgcg ctcctgtccg tctccgcctg
       61 ccgccctggc tgcacacccg tgtcccggac gccgcccccg gccgccgcgc gccgctcgcc
      121 cggctccgga cgcgcagccc gggcccggcg cgatggaggc ggccgccggc cggagctcgc
      181 acctggggcc cgcgcccgcc gggcgcccgc cgcggtgccc gctcctgctg cagctgcagc
      241 tgctgctgct gctgctgctg ctgccgccgg gctgggttcc cggggccgcg ggcacccagg
      301 gcgccgagtt cccagagctg tgcagttata catgggaagc agtggatacc aaaaataaca
      361 tgctttataa aatcaacatc tgtggaaata tgggtgttgc ccagtgtgga ccatcaagtg
      421 ctgtctgtat gcatgacttg aagacagaca gctttcattc tgtgggtgac tctcttttga
      481 aaacagcaag cagatctctt ctggaattta acacaacagt gaactgtaag cagcagaatc
      541 acaaaattca gagtagcatc accttcttat gtgggaaaac cttgggaact cccgagtttg
      601 taactgcaac agattgtgtg cattacttcg agtggaggac tactgcagcc tgcaaaaaga
      661 atatatttaa agcgaataaa gaggtgccct gttacgcttt cgacagagag ctcaagaagc
      721 acgatttaaa cccactgatc aagaccagcg gtgcttactt ggtggacgac tctgacccgg
      781 acacatctct gttcatcaat gtctgcaggg acatagaggt gctccgggcc tcgagtccac
      841 aagtgcgcgt gtgtcccacc ggcgcggccg cctgcctggt gcgaggggac cgcgcgttcg
      901 acgtgggccg gccccaggag gggctgaagc tcgtgagcaa tgacaggctc gtcctgagtt
      961 acgtgaagga aggggccggc cagcccgact tctgtgacgg ccacagcccg gcggtgacca
     1021 tcacgttcgt gtgcccgtcg gagcgcagag agggcaccat tcccaagctc acagcgaaat
     1081 ccaactgccg ctttgagatc gagtgggtca ccgagtacgc ctgccacagg gattacctgg
     1141 aaagccggag ctgctccctg agcagcgcgc agcatgacgt ggccgtcgac ctccagccgt
     1201 tgagccgggt ggaagcctca gactccttgt tctacacctc ggaggcggac gagtatacat
     1261 attatttgag catctgcgga ggaagccaag cgcccatctg taataagaaa gatgctgcag
     1321 tgtgccaagt gaaaaaggca gattccactc aagtcaaagt ggccgggaga ccccagaacc
     1381 tgaccctccg gtactcggat ggagacctca ccttgatcta tttcgggggt gaagagtgca
     1441 gctccggctt ccagcggatg agtgtcatca acttcgagtg caatcagaca gcaggtaaca
     1501 atggcagagg ggctcctgtg ttcaccgggg aggtggactg cacctacttc ttcacgtggg
     1561 atacgaagta cgcctgtgtc cacgagaagg aggccctgct gtgcggcgtc tccgacggga
     1621 aacagcgctt cgacctgtcg gcgctggccc ggcactcaga actggaacaa aattgggaag
     1681 ctgtggatgg cagtcagagg gaagcagaaa agaagcattt cttcattaac atctgccaca
     1741 gggtcctgca gacgggccag gcacggggct gccccgaaga cgcggccgtg tgtgccgtgg
     1801 ataagaatgg aagtaaaaat ctgggcagat ttatttcttc tcccaccaga gagaaaggaa
     1861 atattcagct ctcttactca gatggtgatg agtgcggtgg tggccagaag ataataacaa
     1921 atataacact catgtgcaaa ccaggtgatt tagaaagtgc cccggtgctg acaacctcca
     1981 gggctgacgg ctgcttctac gagtttgagt ggcgcacggc tgcagcctgc gtgctctcca
     2041 ggaccgaggg ggacaactgc actgtctttg actcccaggc agggttttct ttcgacttga
     2101 cgcctctcac gaagaaggac gcctacaagg tcgagacgga caagtacgag ttccacatca
     2161 acgtgtgcgg cccggtgtcc gtgggcgcct gcccgccgga ctcgggggcc tgtcaggtgt
     2221 ccaggagtga taggaagtct tggaacttgg gacgcagcaa tgctaagctt tcgtattacg
     2281 acgggatgat ccagctgacc tacagggacg gcacacccta caataacgag aagcgcacgc
     2341 cgagagccac gctcatcacc ttcctctgtg accgagacgc cggagtgggt ttccccgaat
     2401 atcaggagga agataactct acatacaact tccggtggta caccagttac gcctgcccgg
     2461 aggagccgct ggagtgcatc gtgactgacc ccgtcacact ggaccagtac gacctctcca
     2521 ggctagcgaa atccgagggc ggtcctgggg gaaactggta ctctctggac aacggcgggg
     2581 cacgcagcac gtggcggaag tactacatca acgtgtgtcg tcccctgaac ccggtgccgg
     2641 gctgtgaccg ctacgcgtcc gcctgtcaga tgaagtacca gggcgagcag ggctcgtact
     2701 ctgagaccgt ctccatcagc aacctggggg tggcgaagac gggccccatg gtggaggaca
     2761 gcggcagcct gctcctggag tacgtcaacg gctccgcctg caccaccagc gaccagaggc
     2821 gcaccaccta caccaccagg atccaccttg tctgctctac cggcagcctg tatacccatc
     2881 ccatattttc tctcaactgg gagtgtgtgg tcagcttcct gtggaacacg gcggcagcct
     2941 gtcctatccg aatcaccacg gacatagacc aggtctgctc catcaaggac cccaacagcg
     3001 ggtacgtgtt tgatctgaac ccactgaaca attcccgagg atacgtggtt ttgggcatcg
     3061 ggaagacgtt tctgttcaac gtgtgcggtg acatgcccgc ctgtggcacc ctggatggga
     3121 agccagcttc cggctgcgag gcagaagtcc agatggacga catgaagacc ctgaagccgg
     3181 gcaggctggt gggcctggag aagagcctgc agctgtccac cgagggcttt ataaccctga
     3241 actacacggg gcttccttcc caccccaacg ggagggctga tgccttcatc atccgcttcg
     3301 tctgcaatga tgacgtttac ccagggacac ccaagttcct gcaccaggac atcgactcta
     3361 gcctggggat ccgggacact ttcttcgagt ttgaaaccgc gctggcctgt gtaccttctc
     3421 cggtagattg ccaagtcaca gaccccgccg ggaacgagta tgatctgagt ggcctgagca
     3481 aggccaggaa gccgtggact gcggttgaca cgttcgatga ggggaagaag aggaccttct
     3541 acctgagcgt gtgcacgcct ctcccgtaca ttcccggctg ccacggcacc gctgtggggt
     3601 gctgcctggt gacggaagac agcaagttga acctaggcgt cgtgcagatc agtcctcagg
     3661 tgggcgccaa cgggtccctg agcctcgtct acgtcaacgg ggacaagtgc aagaaccagc
     3721 gtttctccac caggataaac ctcgagtgtg cccacacaac gggctccccg acctttcagc
     3781 tccagaacga ctgtgagtat gtgtttctct ggagaaccgt ggaagcctgt cccgtcgtgc
     3841 gtgcggaagg agactactgc gaggtgagag acccaaggca cggcaacctg tataacctga
     3901 tacctcttgg tctgaacgac actgtcgtgc gggccggcga atacacctat tacttccgcg
     3961 tctgcggaga gctgacatcc ggcgtctgcc caaccagtga caagtccaag gtcatctcat
     4021 catgccagga aaagcgggga ccccagggat ttcaaaaagt ggcaggtctg tttaatcaga
     4081 agctgaccta cgagaatggg gtgctgaaga tgaactacac cgggggcgac acctgccaca
     4141 aggtgtacca gcgttccacc accatctttt tctactgcga ccgcagcacg caggcgcccg
     4201 tgtttctcca ggagacgtcc gattgctcct acctgtttga gtggcgcacg cagtacgcct
     4261 gcccgcccta cgacctgacc gagtgttcgt tcaaaaacga ggctggggaa acctacgacc
     4321 tctcgtctct gtcgaggtac agcgacaact gggaggctgt cacgggcaca gggtccaccg
     4381 agcactacct catcaacgtg tgcaagtccc tgtccccgca ggctggctca gatccgtgcc
     4441 ctccggaggc ggccgtgtgt ctgctgggcg gccccaagcc cgtgaacctg ggcagggtgc
     4501 gggacagtcc tcagtggagc cagggcttga ccctcctgaa gtacgttgac ggtgacttgt
     4561 gtccagacca gattcggaag aagtcaacca ccatccgctt cacgtgcagc gagagccacg
     4621 tgaactccag gcccatgttc atcagcgctg tggaggactg cgaatacacc ttctcctggc
     4681 ccacggccgc cgcctgcgcg gtcaagagca acgtgcacga cgactgccag gtcaccaacc
     4741 ccgccacggg acacctgttt gacctgagct ctctgagcgg ccgggccggc ttcaccgccg
     4801 cctacagcga gaaggggtta gtctacctca gcgtgtgcgg ggacaacgag aactgcgcca
     4861 acggcgtggg ggcctgcttt gggcagacca ggatcagcgt gggcaaggcg agcaagaggc
     4921 tgacctacgt ggaccaggtc ttgcaactgg tgtatgaggg cggctccccc tgcccctcca
     4981 agaccggcct gagctacaag agcgtcatca gcttcgtgtg caggcctgag gtcgggccca
     5041 ccaacaggcc catgctgatc tccctggaca agcgcacgtg cacgcttttc ttctcctggc
     5101 acacacccct ggcctgcgag cagacgaccg agtgctccgt gaggaacggc agctccctca
     5161 tcgacctgtc cccactcatc caccgcaccg ggggttacga agcatacgat gagagtgagg
     5221 acgacggctc cgacaccagc cctgacttct acatcaacat ctgccagccg ctcaacccca
     5281 tgcacgggtt ggcctgcccc gccggcacgg ccgtgtgcaa ggttcccgtg gacggccccc
     5341 cgatagatat tggccgagtg gcaggacctc cgatcctcaa tcccatagcc aacgaagttt
     5401 acttgaactt tgaaagcagc actccttgct tggcggaccg gcacttcaac tacacctcac
     5461 tgatcacgtt ccactgtaag cggggcgtga gcatgggaac gccaaaactg ctgaggacca
     5521 gtgtgtgtga cttcgtgttt gagtgggaga ctcctctggt ctgtcccgac gaagtgaaga
     5581 cggacggctg ctccctcacg gacgaacagc tgtactacag cttcaacctg tccagcctct
     5641 ccaagagcac cttcaaggtg acccgaggcc cgcacaccta cagtgtgggg gtgtgcaccg
     5701 cagccgcagg cctggacgaa ggaggctgca aggacggtgc tgtctgcctg ctgtccggga
     5761 gcaagggggc gtctttcggg cggctggcgt ccatgaagct ggactacagg catcaggacg
     5821 aagctgtcat cctgagttac gccaacggag acacttgccc tccggaaact gaggacggcg
     5881 agccgtgtgt gttccccttc gtgttcaacg ggaagagcta cgaggagtgt gttgtggaga
     5941 gcagggccag gctctggtgc gcgaccaccg ccaactacga cagagaccac gagtggggct
     6001 tctgcaagca ctccaccagc caccggacgt ccgtcatcat cttcaagtgt gacgaggacg
     6061 ccgacgtggg gcggccccag gtcttcagcg aggtgcgcgg ctgtgaggtg accttcgagt
     6121 ggaagacgaa ggtggtctgc cccccgaaga agatggagtg caagttcgtc cagaagcacc
     6181 ggacctacga cctgcggctg ctctcgtccc tcaccggctc ctggtccttc gtccacaacg
     6241 gagcctcgta ctacatcaac ctgtgtcaga aaatatacaa gggaccccag gactgctcgg
     6301 agagagccag cgtgtgcaaa aagagcacct ctggcgaggt gcaggtcctc gggctcgttc
     6361 acacacagaa gctggatgtt gtagatgaca gagtcatcgt aacttactct aaaggccact
     6421 actgtgggga caataagaca gcgtctgctg tcatcgagct gacctgtgcc aagacagtgg
     6481 ggcggccttc gttcacgagg ttcgacgtcg acagctgcac ctaccacttc agctgggact
     6541 cacgagcggc ctgcgccgtg aagcctcagg aggtgcagat ggtgaatggg accatcacca
     6601 acccggccaa cggccggagc ttcagcctcg gggatattta cttcaaacga ttcagcgcct
     6661 ctggggacgt gagaaccaac ggggacaggt acatctacga gatccagctg tcgtccatca
     6721 cgggctccag cagccccgcc tgctctgggg ccagcatctg ccagaggaag gccaacgacc
     6781 agcacttcag tcgcaaagtc ggaacctcca accaaaccag atactacgtt caagatggcg
     6841 acctggatgt ggtgttcacc tcgtcctcca agtgtggaaa agacaagaca aagtctgtgt
     6901 cctccaccat cttcttccac tgtgaccccc tggtgaagga cgggatcccc gagttcagcc
     6961 acgagactgc cgactgccag tacctcttct cctggcacac ctctgccgtg tgcccgctgg
     7021 gggcgggctt cgacgaggag atcgcagggg atgacgccca ggagcacaaa gggctctcag
     7081 agcgcagcca ggcggtcggg gcagtgctca gcctgctgct ggtggcactc accgcctgcc
     7141 tgctcaccct gctgctgtac aagaaggagc gccgggagat ggtaatgagc aggcttacca
     7201 actgctgccg ccggagcgcg aacgtgtcct acaagtactc gaaggtgaac aaggaggagg
     7261 aggccgacga gaacgagacc gagtggctga tggaggagat ccagccgccg gcgccgcggc
     7321 ccgggaagga gggccaggag aacgggcacg tggccgccaa gtcggtgaga gctgccgaca
     7381 cgctgagcgc cctgcacggc gacgagcagg atagcgagga cgaggtcctg acgctgcccg
     7441 aggtgaaggt gcgcccgcca ggccgggctc ctggtgccga aggtggcccc ccgctgcggc
     7501 cactgcctcg gaaggcgcca ccgccgctgc gggcggacga ccgggtgggg ctggtgcgtg
     7561 gggagccggc gcgccggggg cggccccggg cggcggccac gcccatcagc accttccacg
     7621 acgacagcga cgaggacctc ctgcacgtct aggctcgccc gcgccggctc cgaccaaatc
     7681 cgatgggact ccgtgatgct tctgtccttg gcctttaacg aaaactgtcc aaaaaaggga
     7741 agagtgttgg tggtggggga ggaggggggc gcccctctcc gtgggcacgg ggaggaggcg
     7801 cggccgctga tggccgcgcc agccccgggt ctgtccccag ccctcacctt gagcgtggcc
     7861 gccgtgcgct tcctaaaggc gccggggcca gacgcgtctt gggactgagg gct
//
LOCUS       HUMHBBC      1552 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human hemoglobin-related sequence across the breakpoint for Indian
            delta-beta thalassemia.
ACCESSION   M37467
KEYWORDS    hemoglobin.
SOURCE      Human (case V.1) DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1552)
  AUTHORS   Mishima,N. and Gilman,J.G.
  TITLE     The 3' breakpoint of the Indian delta-beta thalassemia deletion
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.G.Gilman, 01-AUG-1990.
                Author address: J.G.Gilman,
                Montefiore Medical Sickle Cell Center,
                111 East 210th Street,
                The Bronx, NY 10467
            
            This sequence of 1552 bases connects the sequence of <HUMHBB> at
            position 73309, with the sequence beginning at position 15 under
            accession number M33706 (position 1464 of this sequence).  In
            <HUMHBB> coordinates, this sequence begins at 73309 and ends at
            74860.
FEATURES       from  to/span     description
BASE COUNT      447 a    271 c    299 g    535 t
ORIGIN      
        1 ttttctcagt cagttaacat tccttcaact agattagttg tgacaaaatt tcaggccttg
       61 ctcaactcta acctcagtct ctccatatat ttctgagagt cttccttctt atttgtttat
      121 aagaatttag taaatggcaa tattgagaag tagaaaattt tggagacttg tttttcatga
      181 accagaacaa tctcaaagca gcaatactaa acaggagcca agcctagaat aaagttgaat
      241 ttctggaatc aacaagctag gggaaagtct cccattaagt caaatatttg acaactggga
      301 ggtcaagtca gaggtgaaaa atgattaaag ataacaagag gtctgaaaac aaagtgacca
      361 attaaaataa aaagaaagag ccaagctggg cacagtggct tgcatctgta gtcccagtaa
      421 cccaggaggc tgaagtgata ggatcacttg agcccaggat ttcagggttg cagtgagcaa
      481 taatgacacc actgctctct agccttggtg actgagatcc tgtctctgag gaaaaacaaa
      541 caaacaagca aaaaacaaac aaacaaacac acagatatta agaacataga agttttgtta
      601 cggtgatgat attctccaac acaggtctga accccttgca gtgcctcggg gctgtttctc
      661 cagtggattc ttgggttctt cacatcttct caactttgtg aaaagatatt catccaaagg
      721 tctcacctgt cttattcaga gcttggattt tggactagct ttttctattt cttgggattc
      781 tctagagcca tatattcaga tataacacag atataaccaa aagaagcaaa ggaggcatca
      841 cagaaggggc tttgtattct ggggaggggg acatagagtt gagagaaata caggaattta
      901 taggactcta ctcaaagagc cctcaaataa agtacggcta gtcttctttg cctaagacaa
      961 tgagcagtcc aggaagttat ccatttcttt tagattttct agtttatgta catagagatg
     1021 ttcatagtag tttttgatga ttatttgtat ttctgtgggg tgagtgataa tattcccttt
     1081 tcatttctaa ctgtgcttat ttggatcttc tctcttgtct tctttattag tctagctagt
     1141 gatctacctt attaattttt tcacaaaacc agctcttgga ctcattaatc ttttgaatgg
     1201 tattttgtgt tttaatatcc ttcagttagc tttgatcttg tttatttctt gtcttctgct
     1261 agatttaggg ttggtttgct cttggttctc tggttctttt agttgtgaca ttaggttgtt
     1321 aatttgaggg ctttaagact ttttgatgtg ggcatttagt gtataaattt ctctcttaac
     1381 actgtctaag ctgtgtccca gagattccgg tatattgtat ctttgttctc attagtttca
     1441 aagaactatt ttatttttgc cttagttttc attatttacc caaaattcat tcaggagcag
     1501 gttgtttaat ttccatgtaa ttgcatagtt ttcagctatt cattagtctt ga
//
LOCUS       HUMHBBTHMU   2337 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human hemoglobin DNA with a deletion causing Indian delta-beta
            thalassemia.
ACCESSION   M33706
KEYWORDS    hemoglobin.
SOURCE      Human (patient B.T.) leukocyte DNA.
REFERENCE   1  (bases 1 to 2337)
  AUTHORS   Mishima,N. and Gilman,J.G.
  TITLE     The 3' breakpoint of the Indian delta-beta thalassemia deletion
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.G.Gilman, 01-AUG-1990.
                Author address: J.G.Gilman,
                Montefiore Medical Sickle Cell Center,
                111 East 210th Street,
                The Bronx, NY 10467
FEATURES       from  to/span     description
    recomb       15       17     overlap of 5' end 3' breakpoint DNA
BASE COUNT      942 a    462 c    401 g    529 t      3 others
ORIGIN      
        1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa
       61 tttccatgta attgcatagt tttcagctat tcattagtct tgaaatctat ttttattcct
      121 ctgtggtctg agagtgattg atatgatttt tttgcatttg ctgaggattg ttttaggtcc
      181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa
      241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac
      301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa
      361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc
      421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat
      481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa
      541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt
      601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa
      661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat
      721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat
      781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc
      841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga
      901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt
      961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt
     1021 ttcaggatat aaaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca
     1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata
     1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact
     1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa
     1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat
     1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat
     1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga
     1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt
     1501 ggggtggtag gagagnggag ggataacatt aggagatata tctaatgtaa atgacgagtt
     1561 aacgggtgca gcacaccaat atggcacacg tatacatatg taacaaactg cacaggtacc
     1621 gtagaactta gagtacaaaa aaaaaaanga gaaaaagaaa aaagctggag ataacacatt
     1681 gcctaacttc aaaccatacc acagggctac agtaaccaaa acagtntggt actgttacag
     1741 aaaagacaca cagacaaatg gaacagaata gagagcccag aaataagacc acatgcctac
     1801 aaccatctga tctttgacaa agctgataaa aacaagcaat gaggaaaaga caccctattc
     1861 aacaagtcgt gttgggataa ctgactaacc gtatgtagaa gattgaaacc agaccccgtt
     1921 ctatatcata tacacaaata aactcaagat ggattagaga cttaaatgca aagcccaaac
     1981 tataaaaatg cttgaaagca acctagacaa taccattccg gacataagca ggggcaaaga
     2041 tttcatgaca aagatgccaa gagtcattgc agcaaaagca aaaattgaca aataggatct
     2101 aattaaaata aagtttctgc acagcaaaag aaactatcaa ctgagtaaac aacctataga
     2161 atgggagaaa atttttgcaa actatgcatc caacaaaggc ctaatattca gaatctataa
     2221 ggaacttaaa tgaatctaca agaacaaaac aaacaatgcc acaaaaaatg ggtaaaggac
     2281 atgaacagac agtattcaaa agaagacata catgtggcca ccaagcatat gaaaaaa
//
LOCUS       BSUGLPKD     4259 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  B.subtilis glycerol kinase (glpK) and glycerol-3-phosphate
            dehydrogenase (glpD) genes, complete cds.
ACCESSION   M34393
KEYWORDS    glycerol kinase; glycerol-3- phosphate dehydrogenase.
SOURCE      B.subtilis (starin W168) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 4259)
  AUTHORS   Holmberg,C., Beijer,L., Rutberg,B. and Rutberg,L.
  TITLE     Glycerol catabolism in Bacillus subtilis: Nucleotide sequence of
            the genes encoding glycerol kinase (glpK) and glycerol-3- phosphate
            dehydrogenase (glpD)
  JOURNAL   Unpublished (1990) University of Lund, Dept. of Microbiology,
            Solvegatan 21, S-223 62 Lund , SWEDEN
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.M.E.Holmberg, 22-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      679     ORF 1 (AA at 2)
    pept        698     2188     glycerol kinase (glpK) (EC 2.7.1.30)
    pept       2329     3996     glycerol-3-phosphate dehydrogenase (glpD) (EC
                                 1.1.99.5)
    pept       4128  >  4259     ORF 4
    binding     682      690     ribosome binding site
    binding    2315     2321     ribosome binding site
    binding    4116     4121     ribosome binding site
    signal     2243     2303     transcription termination signal
    signal     4000     4033     transcription termination signal
    signal     4067     4072     -35 region
    signal     4090     4095     -10 region
BASE COUNT     1269 a    853 c   1092 g   1045 t
ORIGIN      
        1 ccatggcgga tacgcggttg gcggcatcag cggtgcccat ttgaatccgg cgctaacgat
       61 agcgcttgca tttgtaggag attttccttg gaaagaggtt ccggtttata ttgcggcgca
      121 aatgatcgga gcgatcatcg gtgcggtgat tatttatctg cattacctcc cgcactggaa
      181 gtcaacggat gatcccgctg ccaagctggg tgttttctca acagggccca gcattccgca
      241 tacatttgca aacgttttaa gcgaagtgat tgggacattt gtccttgtac ttggaatctt
      301 ggccataggt gcaaatcaat ttacagaagg acttaatcct ttaatcgtcg gtttcctcat
      361 tgtagcaatc ggtatttctc tgggaggcac caccggctat gctatcaatc ctgcacgtga
      421 cttaggtccg cggatcgccc acgcttttct tccgattccg gggaagggct catcaaactg
      481 gaaatacgca tgggttccgg tagtcggccc gattttaggc ggatcattcg gcggtgtatt
      541 ttacaacgct gcatttaaag ggcacatcac aagcagcttc tggattgtaa gcgttatatt
      601 ggttgtggta ttgttaggac tctatgttta tacgaaatca cattctgcta aaacattatc
      661 aaattctaaa tatatttaat caaaggggag acatcttatg gaaacgtaca ttttatcctt
      721 agatcagggg acgacaagtt caagagcgat tctgtttaat aaagaaggca aaattgtcca
      781 ctctgctcaa aaggaattta cacaatactt cccgcatcct ggctgggttg agcataatgc
      841 caatgaaatt tggggctctg tcctcgcggt tatcgcctca gtcatctctg aatcaggaat
      901 cagcgcttct caaattgccg gcatcggcat cacgaaccag cgcgagacga cggttgtgtg
      961 ggataaagat acaggaagtc ctgtctataa tgcaatcgtt tggcagtcca gacagacgtc
     1021 cggcatttgt gaggaacttc gtgaaaaagg atataatgat aaattcagag aaaaaacagg
     1081 gcttttaatc gatccttact tctccggcac gaaggtgaag tggattttag acaatgtgga
     1141 aggcgcaaga gaaaaagcgg aaaaaggcga gctgctgttt ggaacgattg atacgtggct
     1201 catttggaaa atgtcaggcg gaaaagcgca tgtgaccgat tactccaatg cctcaagaac
     1261 actgatgttt aatatttacg atttaaaatg ggacgatcaa ctgctcgaca ttctaggcgt
     1321 accgaaatcc atgctccctg aagtgaagcc gtcctctcat gtgtatgcgg agactgttga
     1381 ttatcacttc ttcggaaaaa atatcccgat tgctggagcg gcaggcgacc agcagtccgc
     1441 attgttcggc caggcatgct ttgaagaagg catggggaaa aacacttacg gcacaggatg
     1501 tttcatgctg atgaataccg gggaaaaagc aattaagtcc gaacatgggc ttttgacaac
     1561 aatcgcttgg ggcattgacg gaaaagtgaa ctatgcgtta gaagggagca tttttgtcgc
     1621 aggctctgcc atccagtggc ttagagacgg tttgagaatg ttccaggatt catcgctaag
     1681 cgaatcttat gcagaaaaag tggattcaac tgacggcgtg tatgttgttc cagcatttgt
     1741 cggactggga acgccttact gggacagcga tgtgcgcggt tcggttttcg gcctgacaag
     1801 agggacaaca aaagagcact ttatccgtgc gacactggag tcattggctt atcagaccaa
     1861 agatgtgctt gacgcaatgg aagcagattc aaacatttca ttaaagacgc tccgtgtaga
     1921 cggaggagct gtaaaaaaca atttcctaat gcagttccaa ggagacctgt tgaatgttcc
     1981 tgtggagcgc ccggaaatta atgaaacgac tgcacttggc gcggcttatt tggcgggtat
     2041 cgctgtggga ttctggaagg accgttctga aatcgcgaac cagtggaatc tggataaacg
     2101 gtttgagcct gaattggaag aagaaaaacg aaatgagctg tataaaggct ggcaaaaagc
     2161 cgtgaaagca gctatggctt ttaaataaag taatactatg gtataatggt tacaagttaa
     2221 taagaacggt cctgagatga ggagagacca cagcaccaaa gtgtaagcat gcactttggc
     2281 tgttgtggtc tctttttcta tttaccgtga caacaaggag gaaacgtaat gatgaatcat
     2341 caattttcaa gtcttgaaag agatcgcatg ctgacagaca tgacgaaaaa aacatatgac
     2401 ctatttatta tcggaggagg aattacagga gccggaacag ctcttgacgc ggcatcaagg
     2461 ggaatgaagg tcgcactaag cgaaatgcag gactttgcgg cgggaacatc aagccgatca
     2521 acgaagctgg tacatggcgg cttgcgctat ttaaaacaat ttgaagtgaa aatggtcgct
     2581 gaggtaggaa aagagcgggc gattgtgtat gaaaacggcc cgcatgtcac tacaccggaa
     2641 tggatgctgc ttccgtttca taaaggcggc acattcggtt catttacaac atctattggg
     2701 ttaagggttt atgacttcct tgcaggtgtg aaaaagtcag aacgaagaag catgctttca
     2761 gcaaaagaaa cgctgcaaaa agagcctttg gtgaaaaaag acggcttaaa aggcggcggg
     2821 tactatgtgg aataccgcac tgacgatgcg agactgacca tcgaagtcat gaaggaagcg
     2881 gttaaattcg gggcagagcc tgtgaattac tccaaagtga aggagcttct ttacgaaaaa
     2941 ggcaaagccg tcggcgtatt aattgaagat gtgctgacaa agaaagaata taaagtgtat
     3001 gcgaaaaaaa ttgtcaatgc tacaggccct tgggtcgatc agctcagaga aaaagaccat
     3061 tcgaaaaacg gaaagcattt gcagcataca aaaggcattc accttgtatt tgaccagtct
     3121 gtctttccgc tgaaacaggc tgtatatttc gatacacctg atggccggat ggtatttgcg
     3181 attcctcgtg aaggcaaaac atacgtggga acaacagaca ctgtttacaa agaggcgctg
     3241 gagcatccgc ggatgacaac ggaggatcgt gattatgtca tcaaatcaat caattacatg
     3301 ttcccggaac tgaatatcac tgcgaatgac atcgaatcca gctgggcggg gctgcgtccg
     3361 ctgattcatg aagaaggcaa agatccttct gaaatttcac ggaaagacga gatttggaca
     3421 tctgactcag gcctgatcac cattgccggc ggaaaactga ctggatacag aaaaatggcg
     3481 gacgacatcg ttgatcttgt ccgtgatcgc ttaaaagaag agggcgaaaa ggattttgga
     3541 ccatgtaaaa cgaaaaacat gccaatctct ggcgggcacg tcggcggttc gaaaaatctt
     3601 atgtccttcg ttaccgcgaa aacaaaagaa ggaattgcag ccggtttatc agaaaaagac
     3661 gcaaaacagc ttgcgatcag atacggctct aacgtagatc gcgtctttga ccgggtagaa
     3721 gcgctgaaag atgaggccgc gaaacgcaac attccggttc atattcttgc tgaggcagaa
     3781 tacagtatag aagaagagat gactgcaacc cctgctgact tctttgtccg cagaacggga
     3841 cgtttatttt ttgatatcaa ttgggtaaga acatataaag atgccgttat tgattttatg
     3901 agcgagcgat tccaatggga tgagcaggcg aaaaacaaac atacagaaaa cctcaacaag
     3961 cttttacacg atgcggtcgt accgcttgag caataaatca taacgggctg tctgcagccc
     4021 gttatttctt tttacgtgcc gaaaggggga gatctcaggt tatcaagtga cagaggcacc
     4081 cgcttcatat aaaataatgg cataagctga tacataggag gacgaatatg acttggagaa
     4141 agagctatga acgctggaaa cagacagaac atttagatct ggaattaaaa gagcgcctta
     4201 ttgaattaga gggagatgaa caggcccttg aggactgttt ctataaagac cttgaattc
//
LOCUS       CLCMTPLAS     379 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  C.purpurea mitochondrial plasmid terminal 5' inverted repeat (TIR).
ACCESSION   M35177
KEYWORDS    .
SOURCE      C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
  ORGANISM  Claviceps purpurea
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE   1  (bases 1 to 379)
  AUTHORS   Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
            Rogmann-Backwinkel,P. and Tudzynski,P.
  TITLE     Structural and functional analysis of mitochondrial plasmids in
            Claviceps purpurea
  JOURNAL   Mol. Gen. Genet. 214, 128-134 (1988)
  STANDARD  simple staff_review
BASE COUNT       97 a     76 c     64 g    142 t
ORIGIN      
        1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
       61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
      121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
      181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
      241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
      301 aactttccgc taaaacccgc acgtcttcgt ctgcatggta tgggcgttag attctattct
      361 acaatgtctt ccgccaaaa
//
LOCUS       CLCMTPLASX    379 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  C.purpurea mitochondrial plasmid 3' terminal inverted repeat (TIR).
ACCESSION   M35178
KEYWORDS    .
SOURCE      C.purpurea (strain K) mitochondrial plasmid pClK1 DNA.
  ORGANISM  Claviceps purpurea
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Clavicipitales; Clavicipitaceae.
REFERENCE   1  (bases 1 to 379)
  AUTHORS   Duevell,A., Hessberg-Stutzke,H., Oeser,B.,
            Rogmann-Backwinkel,P. and Tudzynski,P.
  TITLE     Structural and functional analysis of mitochondrial plasmids in
            Claviceps purpurea
  JOURNAL   Mol. Gen. Genet. 214, 128-134 (1988)
  STANDARD  simple staff_review
BASE COUNT      105 a     75 c     62 g    137 t
ORIGIN      
        1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt
       61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact
      121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa
      181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac
      241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta
      301 aactttccgc taaaacccgc acgtcttact ttagctcaac aaagaaagcg tagtgaactt
      361 aaaaaagcac gtcttgctc
//
LOCUS       DDISP60       328 bp ds-DNA             INV       08-AUG-1990
DEFINITION  D.discoideum spore coat 60 (sp60) gene, 5' flank.
ACCESSION   M34546
KEYWORDS    spore coat 60.
SOURCE      D.discoideum (strain Ax4) DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 328)
  AUTHORS   Fosnaugh,K., Loomis,K. and Loomis,W.F.
  TITLE     Coordinate regulation of the spore coat genes in Dictyosteiium
            discoideum
  JOURNAL   Dev. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable  sequence for [1] kindly
            submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT      168 a     15 c     11 g    134 t
ORIGIN      
        1 aaaacataga acaaatttaa ataattattt tttattaaat tgtattattt ttttttatta
       61 tttatttatt ttttattttc aaaaaaaaaa aaaaaaagtt aacagacaaa aagtataatc
      121 tatttaatta ttcaaaaaaa aaaatattaa atcattgtag tattttgttc atattcaaat
      181 taaatatcaa tacaattaaa aaaagttaaa atgtaattat taatatatat atatatttat
      241 caatatataa tttaatataa aaaaaaaaaa gaaatttaat aaacataaat atttataaaa
      301 tcataaattt aaaataaata attttttt
//
LOCUS       DDISP70       563 bp ds-DNA             INV       08-AUG-1990
DEFINITION  D.discoideum spore coat 70 (sp70) gene, 5' flank.
ACCESSION   M34547
KEYWORDS    spore coat 70.
SOURCE      D.discoideum (strain Ax4) DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 563)
  AUTHORS   Fosnaugh,K., Loomis,K. and Loomis,W.F.
  TITLE     Coordinate regulation of the spore coat genes in Dictyosteiium
            discoideum
  JOURNAL   Dev. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT      207 a     73 c     47 g    236 t
ORIGIN      
        1 gaagtgtgag ttgcgcagtg tgttagtgtg ggtttaatga gagtgttttt tttttttttt
       61 tttttttttt tgaacgtaca tatttatttg tttttattaa tatttcataa atgaatttgt
      121 aaatataaaa ataaattgtt tttttttatt ttattttttt gattatgaca ataatttcaa
      181 taaaatgtaa aatatgataa aaataatttt aattattgaa acacacacac tcactcactt
      241 tttcccacta gtattacact ttccaatgaa aaataaatta atattgaaaa agaaggaaat
      301 ggaaaaaaat gaaaaataaa aataattaaa taaacaataa tattacaaac acactatttc
      361 tttctcaccc acacaaccaa tatactaaca cttctcatac acccaccaac acacctacac
      421 ccagttcttc acattcatac acaaagttta atattacact ttaaaaaaca ttcatattaa
      481 tgatttttat gatttattta tttattaatt tatttaattg tgtattttga ttattgctct
      541 ctacttttta tatatttttt ttc
//
LOCUS       DDISP96       682 bp ds-DNA             INV       08-AUG-1990
DEFINITION  D.discoideum spore coat 96 (sp96) gene, 5' flank.
ACCESSION   M34548
KEYWORDS    spore coat 96.
SOURCE      D.discoideum (strain Ax4) DNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 682)
  AUTHORS   Fosnaugh,K., Loomis,K. and Loomis,W.F.
  TITLE     Coordinate regulation of the spore coat genes in Dictyosteiium
            discoideum
  JOURNAL   Dev. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by K.Fosnaugh, 24-MAY-1990.
BASE COUNT      321 a     37 c     63 g    261 t
ORIGIN      
        1 tgaaaaaaaa aaaattttgt gtgtgatagt ttgtcaaaaa tcatttgaaa ttaaaatcat
       61 attataatat atattaaaaa aatgttgtat ctttaatttt ttttcatact agtgtgttaa
      121 atgaattaaa gtgtggtaat gaataaatga aaaaaaaaaa aaaattaaag tgagagagtg
      181 tgtaaaagtg gtaatgaatg tgtgtttgag tgtgttaaaa atgtgtttac aaaatataat
      241 aatttaaatt cattcatatt caactatgga gaaagcgtgt tttcaaacac aataaatgtg
      301 taatgaaaaa tacaattaaa taattattta attttttgta tgattaaaaa aaaaagagta
      361 aattagtagt gccaccctaa aacaataata ataataatta aataatatta atatttattt
      421 aattttttta tttaattttt tttcaaaaca attcataata taaatttatt cttttttttt
      481 tttttttttt tttttaaaaa tattcaactc agtaataata ataataaaaa taataacaat
      541 aataaggaat tataatcaaa taataatata gctataatct ataattacta tagtaaaatt
      601 acgtatacat atataaataa ttattattat cattgttttt aaaaataaat aaataaataa
      661 taaaaaaaaa aaaaaaaaaa aa
//
LOCUS       GLAA2GIA     1870 bp ds-DNA             INV       08-AUG-1990
DEFINITION  Giardia lamblia alpha-2 giardin gene, complete cds.
ACCESSION   M34550
KEYWORDS    alpha-2 giardin.
SOURCE      G.lamblia (strain Portland-1) trophozoite DNA, clone pG2.
  ORGANISM  Giardia lamblia
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae.
REFERENCE   1  (bases 1 to 1213)
  AUTHORS   Alonso,R.A. and Peattie,D.A.
  TITLE     Genomic organization of alpha giardin genes in Giardia lamblia
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1214 to 1870)
  AUTHORS   Alonso,R.A. and Peattie,D.A.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.A.Alonso, 25-MAY-1990.
            
                            Author address: R.A.Alonso
                            Dept of Tropical Public Health
                            Harvard School of Public Health
                            665 Huntington Avenue, Boston, MA 02115
FEATURES       from  to/span     description
    pept        215     1105     alpha-2 giardin
BASE COUNT      466 a    468 c    507 g    429 t
ORIGIN      
        1 atactgcaga tacacctcat ggtgtacatg ctggcggcaa gtatcgtata aactctatgg
       61 agacatgggg cagaccttgt ctcgctgctt tgacgagcga atggccccta ttttgcgcac
      121 gctatggcgc gtcacagatg cataggataa gcgaactcat gatggaaatt caaattacct
      181 taaaatatat tttctgagcg cttacattta gaaaatgccg aagctatccc agatcgtcgc
      241 tgacatgaag caggccatcg atgccaagga cgaggcccag atcgcgttca tagcgtcgga
      301 atactctgca gatgcgcgtc agaggattgc gcaggggtac agggaccaat acggcaagga
      361 gctccccgac gacatcaaga aggcgctcaa gggcggctcc gaggagagcc tcctgatgga
      421 tctcttcagc gatcgtcacg aggtccgcgc ccagcacatc cgcgacgcgc tttccggcaa
      481 gaacgaccac atggccttct tcgacacagt cattctgtgc acccccgaag actggcacga
      541 gaccgtcgcc gcctacacga ggatgttcaa gaagcccctc gtcgaggact tcatgaagga
      601 tgtcggccgc aaggagaatt ggtgtctttt catggagaag tggatggctc acgagcgcac
      661 tagccgtgaa ggcagccctg acgaggaagc agagaaactg aacaaggcgt tcagtgagag
      721 cgaccacgac tacatttcca gcttcatggc tggggtgcct cccgaagaat acaagtccat
      781 caacaccagc ttcaagagcc ttacaggaaa gggcatcgac caggccttcg ccaccatcta
      841 tacggggact gactactact ccctgtactg cgcccacttt gctcttcttg gcatgcacaa
      901 gcttgccgcc tacctggtca actgcgcctg caacgataag ggcgacgaga agcgcatgag
      961 gaggatcacg ggcatgatgg tcgacaagtg cttggctgca aagtatgcct acaagaccta
     1021 cgggagtatg aaagccgacg tcgaaaggtg ctttgataag cgcatggctc cgatcctctg
     1081 taccctctgg cgcctaaggg agtgagcgcc tttactgcgg ggtttccttt cgctagtgaa
     1141 ttgcctagcg ggtgtttact gccttctgtt tctttttgcc ctgtcctggc ggtccaagat
     1201 gctatcctgc gatgagaggt ctttcgagaa cggatgcgag aagtctggga ttgctacatg
     1261 ccgtccacct gactgtgagg gtgagtagga ctggagtagc ggcaaggagt gggagctcat
     1321 gttgtccata gagctgtatg cctgatcaat ctgcccattt cctgcttgtc ctctgcgggt
     1381 tttttgtttc agatctgagc tgcctaaccg agaggggccg tactctatgc ttagttctct
     1441 attaaccaac tgaacagcct ttgctcgctt gcttgccagt gtatatttat atctatattt
     1501 caagaacaga gagaggaccg tgatgaagag gagcaggatg agtgtcgcaa cggaccagaa
     1561 cacgacaata agggctatag acatcttaga atgataacca acaaagctta tgaactcctc
     1621 tccactgaaa gaagaggaaa gataccgagt ctcgtaaaac tgaatggtag tcctatcgat
     1681 ggtgtgcccc cagatagggt ctactatcgg ctggttggta agattgcttg cagcggtatc
     1741 gacagctgcc acaaccacta agtagtcaag gctcgtgctt gtacgaagga aggtcaacgg
     1801 gaatgcgaag catggaccag gtgttatttg cttgctcgat aaactcagga gtatatattt
     1861 cagtattgtc
//
LOCUS       KAEHUTC      1625 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  K.aerogenes histidine utilization repressor C (hutC) gene,
            complete cds.
ACCESSION   M34604
KEYWORDS    histidine utilization repressor C.
SOURCE      K.aerogenes (strain W-70) DNA.
  ORGANISM  Klebsiella aerogenes
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1625)
  AUTHORS   Schwacha,A. and Bender,R.A.
  TITLE     Nucleotide sequence of the gene encoding the repressor for
            histidine utilization genes of Klebsiella aerogenes
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.A.Bender, 29-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      669     histidine utilization repressor G (AA at 1)
    pept        731     1456     histidine utilization repressor C (hutC)
    pept       1599  >  1625     histidine utilization repressor U
    signal      667      686     hutC promoter
    signal     1522     1493     Pc promoter
    signal     1531     1557     Puh promoter
    binding    1475     1495     CRP binding site
    site         34       61     helix-turn-helix
BASE COUNT      311 a    476 c    508 g    330 t
ORIGIN      
        1 gtcgacggcg agcagttaga agccgcgcat caggcgctgc gcgaggcggt ggccgactgt
       61 cagcgggcgg gtaaacgcac gctggtcctg ggcggcggcc acgagacggc cttcggccac
      121 ggcgctgggg tactggatgc cttcccaggc gaaaaagtgg gcattattaa tctcgatgcg
      181 catctggatc tgcgctttgc cgactgcgcc agctccggga cgccgttccg ccagttggcg
      241 ctggagtgtg acgcgcagca gcgcggtttt cactatacct gcatcggggt tagccgggcg
      301 gcgaacaccc aggcgctatg ggatgaagcg gcgcgccgcc aggtggctat cgttgaagat
      361 ctggaggtgc tgacagcctt cgaaactcgc gtgttgccgg agcttgagcg caatatcgcg
      421 caattcgacc gcctgtatct gactatcgat ctcgacgtgc tgccggcgcg agaaatgccg
      481 gcggtgtcgg ccccggcggc gctgggcgtg ccgctgggca cgctgctgcg catcgttgag
      541 ccgctgtgcc gcagcggtaa gctgcaggcg gtggatctgg tggagtttaa cccgctgttt
      601 gacattgacg gtcagggcgc tcgcgcggcg gcccgtgtgg catggcaaat cgcccattgg
      661 tggcgctagc gatccgcgct attattttcg tttcgccgct ccgcttatta cgcataagga
      721 aagccaggcc atgtttgcac aacaacctcg ttccgcgccc gcgcctttct atgaaaaggt
      781 gaagcaggcg atcagcgaaa agatccacag cggcgtctgg cgaccgcatg accgtatccc
      841 gtcggaagct gagctggtgg cccagttcgg cttcagccgg atgaccatca atcgcgcgct
      901 gcgcgagctg acggacgaag gcctgctggt caggttgcaa ggggtgggaa cctttgtcgc
      961 cgagccgaaa gggcagtcgg cgctgttcga ggtgcgcagt attgccgcag aaattgtcgc
     1021 ccgtcaccat cagcaccgct gtgaggtact cctgcttgag gagacccggg cggatcatat
     1081 tcaggcgacg gcgctcagcg tcccggaagg cacccgcatc tttcactcgc tgatggtgca
     1141 ttacgaaaac gaggtgccgg tgcaaatcga agatcgctgc gtcaatgccg cagtggtacc
     1201 ggactatctg catcaggact acaccgccac cacgccccat gattacctgt cgctgatcgc
     1261 gccgttaacc gaaggtgaac atattgttga agcggtgcag gccacggcgg aagagtgcgc
     1321 gctgctgcat attcacgccc acgatccgtg cctgctgatc cgtcgtcgca cctggtcgac
     1381 aacccacatc gtctcccacg cgcgcctgct gttccccggt agccgatatc gtctgcaggg
     1441 gcgctttggc tcctgatcca cgctggcgtc agcaaaacgt gattgctgac gcaatataac
     1501 aaaattgtat catttctgtt aaatcctggc ttgcgcatgc ttgtatagac aagtatatgt
     1561 atctacgtaa acaacgttta ttgtcaggag aaaccccgat gtcgcaaagc aaatatcgcc
     1621 agctg
//
LOCUS       MUSMHH2KBM     66 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Mouse major histocompatibility complex antigen H-2Kbm29 mRNA.
ACCESSION   M34542
KEYWORDS    major histocompatibility complex antigen.
SOURCE      Mouse ((C3H.CAS3(R4)XB6)F2, mutant #6414(bm29)), liver, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 66)
  AUTHORS   Horton,R.M., Loveland,B.E., Parwani,A., Pease,L.R. and
            Fischer Lindahl,K.
  TITLE     Immunological characterization and molecular analysis of the
            H-2Kbm29 mutant
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Weiss,E., Golden,L., Zakut,R., Mellor,A., Fahrner,K., Kvist,R. and
            Flavell,R.A.
  TITLE     The DNA sequence of the H-2Kb gene: Evidence for gene conversion as
            a mechanism for the generation of polymorphism in
            histocompatibility antigens
  JOURNAL   EMBO J. 2, 453-462 (1983)
  STANDARD  full automatic
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.M.Horton, 24-MAY-1990.
            
            Mutation in codon 89 changes bp 43 and 44 to "gc", otherwise this
            sequence is identical to reference 2.
FEATURES       from  to/span     description
    pept    <     1  >    66     MHC H-2Kbm29 (AA at 3)
BASE COUNT       14 a     22 c     19 g     11 t
ORIGIN      
        1 cgagtggacc tgaggaccct gctcggctac tacaaccaga gcgcgggcgg ctctcacact
       61 attcag
//
LOCUS       YSCKGD2      2112 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  S.cerevisiae dihydrolipoyl transsuccinylase (KGD2) gene,
            complete cds.
ACCESSION   M34531
KEYWORDS    alphaketogluterate dehydrogenase complex;
            dihydrolipoyl transsuccinylase.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2112)
  AUTHORS   Repetto,B. and Tzagoloff,A.
  TITLE     Structure and regulation of KGD2, the structural gene for yeast
            dihydrolipoyl transsuccinylase
  JOURNAL   Mol. Cell. Biol. 10, 4221-4232 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Tzagoloff, 29-MAY-1990.
FEATURES       from  to/span     description
    pept        560     1987     dihydrolipoyl transsuccinylase
    mRNA        306  >  1987     dihydrolipoyl transsuccinylase mRNA
    mRNA        451  >  1987     dihydrolipoyl transsuccinylase mRNA (alt.)
    signal      196      200     CAAT box
    signal      228      232     CAAT box
BASE COUNT      649 a    435 c    425 g    603 t
ORIGIN      
        1 tcgaacgtct cgtggaaagc cttgaccgtt gcgtctttac tcgtctctat tcaaccctgt
       61 ggtcctatta gaaccgatct ttgttccatc gtcattacta cttcttgtct cttctctctt
      121 ggggaataat aaacaaaaca attccagcca cccgcatctc cgtcaaagac gatattaaca
      181 accctttttt tgcagccaat caaaactaag gtcctgcccg gagtgagcca ataaagcggc
      241 cacaaattac gcaggaatct atggttccta aagcttattc ttattctaca ttgacaccat
      301 tctactacag ggctcggatc cacacttggt ctatcagcaa atgcaaatcc tttttccctc
      361 aactaggcct acttttcata tataatctct ttttgagctt caggggagga aatatctaac
      421 gtaaaaaaaa gaaaaatccg aaataaaaaa ttaggcagct atctttaacg gtatatatca
      481 gtgaatagat atattgttat aaacttcact accacatttg ttacaaccaa agacacaact
      541 tcagataatt atttaaacaa tgctttccag agcgacgcgt actgcagctg ccaaatcctt
      601 agtaaaatct aaagtggcta gaaatgttat ggctgcttct ttcgtcaaga gacatgcttc
      661 tacaagtttg ttcaaacaag ctaacaaggt cgaatcctta ggttcaatat atttatccgg
      721 caagaaaatt tcagttgcgg cgaatccgtt ctccataact agcaatcgtt ttaaatctac
      781 ctctattgaa gttcctccga tggcagagtc cctgactgaa ggctctttaa aggaatatac
      841 taaaaacgtt ggtgatttta ttaaggagga cgagctgttg gccactattg agaccgataa
      901 aattgatatt gaggtcaatt cgccagtatc aggtactgtt acgaagctaa atttcaaacc
      961 agaggacact gtcactgttg gtgaggagtt agctcaggtc gagcctggtg aagcacctgc
     1021 tgagggttct ggagaatcta agccagagcc taccgaacaa gcggagcatc gcaaggtgtc
     1081 gccgcaaggg aaaactcaag tgaggaaacg gcttcaaaga aagaagctgc tccaaagaaa
     1141 gaagccgctc caaagaaaga agttacagaa ccaaaaaagg actgatcaac caaagaagac
     1201 cgtctctaag gcgcaggaac ccccagtagc ctctaactct ttcacaccat ttccacgtac
     1261 agaaaccagg gtcaaaatga accgtatgag attgaggatt gccgaaagat taaaagagtc
     1321 tcaaaacact gctgcttcct taaccacatt caacgaagtt gacatgtcag ctttgatgga
     1381 aatgaggaaa ctgtataaag atgagattat taagaagacc ggtactaaat tcggattcat
     1441 gggtcttttc tccaaagcat gtaccttggc cgccaaggat attccagccg tcaatggtgc
     1501 cattgaaggt gaccagattg tttatcgtga ttacacagat atttctgttg ctgtggccac
     1561 tccaaagggt ttggttaccc ccgtcgttcg taatgcagag tcattgagtg ttttagatat
     1621 tgagaacgaa attgttcgct tgagtcataa agcgcgtgat ggcaaattaa ccctagaaga
     1681 tatgacgggt ggtactttca ccatatctaa tggtggtgtt tttggttcat tatacggtac
     1741 tcctatcatc aattcaccac aaacagccgt cctaggcttg catggtgtca aagagagacc
     1801 tgtcactgtt aatggacaaa ttgtctcaag accaatgatg tacttggctt tgacttatga
     1861 tcatagattg ctagatggtg agaagctgtt atccttcttg aagactgtta aagagttgat
     1921 tgaagaccct agaaaatgtt gttatggtga tttgaaattt gcagcccata ccaatctgat
     1981 ttcatgatta ttcttgtccg ctattactgt gtgaatctct tctatttaag tatgatattt
     2041 aaagatatgc tatgatgata aatggaacta tgaattaatg ttttcttttt ataatgagat
     2101 atggaaaagc gc
//
LOCUS       RATALADPS    1612 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat delta-aminolevulinate dehydratase processed pseudogene,
            complete cds.
ACCESSION   J04764
KEYWORDS    delta-aminolevulinate dehydratase; pseudogene.
SOURCE      Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1612)
  AUTHORS   Bishop,T.R., Frelin,L.P. and Boyer,S.H.
  TITLE     Isolation and characterization of a rat delta-aminolevulinate
            dehydratase processed pseudogene
  JOURNAL   Genomics 7, 629-632 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.R.Bishop, 20-APR-1990.
FEATURES       from  to/span     description
    pept.ps     271     1264     pseudo-delta-aminolevulinate dehydratase
    rpt         101      108     8 bp flanking direct repeat
    rpt        1385     1392     8 bp flanking direct repeat
BASE COUNT      404 a    411 c    418 g    379 t
ORIGIN      
        1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt
       61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt
      121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca
      181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta
      241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc
      301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa
      361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct
      421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga
      481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca
      541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg
      601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat
      661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg
      721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat
      781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac
      841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat
      901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga
      961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca
     1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct
     1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa
     1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc
     1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga
     1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa
     1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga
     1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta
     1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt
     1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg
     1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at
//
LOCUS       MCAAAF1       229 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus aphid acquisition factor 5' end.
ACCESSION   M32809
KEYWORDS    aphid acquisition factor.
SOURCE      Cauliflower mosaic virus viral DNA, clone pCMS31.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 229)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept    /    36  >   229     aphid acquisition factor (AA at 38)
    IVS     <     1       35     intron X
BASE COUNT       88 a     43 c     40 g     58 t
ORIGIN      
        1 aattaatcat cttaacaacc tcaatgagat tgtaggaaga agcttactcg gaatatggaa
       61 gatcaactca tacttcggac taagcaaaga cccttcggag tccaaatcaa aaaacccgtc
      121 agtttttaat actgcaaaaa ccatttttaa gagtgggggg gttgattact cgagcctaaa
      181 ggaaataaaa tcccttttag aagctcaaaa cactagaatt aaaagtcta
//
LOCUS       MCAAAFCP     1125 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus aphid acquisition factor (3' end), and
            coat protein (5' end) genes.
ACCESSION   M32813
KEYWORDS    aphid acquisition factor; coat protein.
SOURCE      Cauliflower mosaic virus viral DNA, clone pLW303S.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 1125)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1      246     aphid acquisition factor (AA at 1)
    pept        248      637     ORF
    pept        619  >  1125     coat protein
BASE COUNT      435 a    218 c    223 g    249 t
ORIGIN      
        1 aaaaacccgt cagtttttaa tactgcaaaa accattttta agagtggggg ggttgattac
       61 tcgagccaac taaaagaaat aaaatctctt ttagaatctc aaaatactag aattaaaagt
      121 ctagaaaaag caattcaatc cttagagcgt aagattgaac cagagccctt aactaaagaa
      181 gaagttaaag agcttaaaga atcgattaac tcgatcaaag aaggattaaa gaatattatt
      241 ggctaaaatg gctaatctta atcaaatcca aaaagaagtc tctgaaatcc tcagtgacca
      301 aaaatccatg aaagcggata taaaagctat cttagaacta ttaggatccc aaaatcctat
      361 taaagaaagc ttagaagctg ttgcagcgaa aatcgttaat gacttaagca agctcatcaa
      421 tgattgcccc tgtaacaaag agatattaga agccttaggc aatcagccta aagagcaact
      481 aatagaacaa cctaaagaga aaggcaaagg ccttaatcta ggaaaatatt cttaccccaa
      541 ttacggagta ggaaatgaag aattaggatc ctctggaaac cctaaagctt taacctggcc
      601 cttcaaagct ccagcaggat ggccgaatca attttagaca ggaccattaa ccggttctgg
      661 tataatctgg gagaagattg tctctcagaa agtcaatttg accttatgat aaggttaatg
      721 gaagagtccc ttgacgggga ccaaattatt gatctaacct ctctacctag tgataatttg
      781 caggtcgaac aggttatgac aactaccgaa gactcgatct cggaagaatc agaattcctt
      841 ctagcaatag gagaaacatc tgaagacgaa agcgattcag gagaagaacc tgaattcgaa
      901 caagttcgaa tggatcgaac aggaggaacg gagattccca aagaagaaga tggtgaacca
      961 tctagacaca atgagagaaa gagaaagacc ccggaagacc ggtactttcc aactcaacca
     1021 aagaccattc caggacaaaa acaaacgtct atgggaatgc tcaacatcga ctgccaaacc
     1081 aatcgaagaa ccttaatcga cgactgggca gcagaaatcg gactg
//
LOCUS       MCACPX        106 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus coat protein, partial cds.
ACCESSION   M32814
KEYWORDS    coat protein.
SOURCE      Cauliflower mosaic virus viral DNA, clone pLW303S.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 106)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   106     coat protein precursor (AA at 2)
BASE COUNT       52 a     16 c     18 g     20 t
ORIGIN      
        1 atgcaagaaa acatccaaga agaagtatca taagcgatac aagaagaaat ataaggtcta
       61 taaaccttat aagaagaaga agaaattccg atccggaaaa tacttc
//
LOCUS       MCACTCTP      221 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus cell-to-cell transport protein, 5' end.
ACCESSION   M32812
KEYWORDS    cell-to-cell transport protein.
SOURCE      Cauliflower mosaic virus viral DNA, clone pLW303S.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 221)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept         43  >   221     cell-to-cell transport protein
BASE COUNT       93 a     40 c     32 g     56 t
ORIGIN      
        1 ctctaacgag tatccacaga aaaaataacc ttctgtgttg agatggattt gtatccagaa
       61 gaaaataccc aaagcgagca atcgcagaat tctgaaaata atatgcaaat atttaaatca
      121 gaaaattcgg atggattctc ctccgatcta atgatctcaa acgatcaatt aaagaatatc
      181 tctaaaaccc aattaacctt ggaaaaagaa aagatattca a
//
LOCUS       MCAMPX        972 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus matrix protein (put.), partial cds.
ACCESSION   M32811
KEYWORDS    matrix protein.
SOURCE      Cauliflower mosaic virus viral DNA, clone pLW303S.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 972)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1      187     matrix protein (put.) (AA at 2)
    ORF         895  >   972     ORF 7
    mRNA    <     1      283     35S RNA
    signal      253      257     35S TATA box
    binding     882      895     primer binding site
BASE COUNT      321 a    209 c    200 g    242 t
ORIGIN      
        1 ctgtcacttc atcgaaagga cagtagaaaa ggaaggtggc tcctacaaat gccatcattg
       61 cgataaagga aaggctatca ttcaagatgc ctctgccgac agtggtccca aagatggacc
      121 cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt caaagcaagt
      181 ggattgatgt gacatctcca ctgacgtaag ggatgacgca caatcccact atccttcgca
      241 agacccttcc tctatataag caagctcatt tcatttggag aggacacgct gaaatcacca
      301 gtctctctct acaaatctat ctctctctct attctccata ataatgtgtg agtagttccc
      361 agataaggga attagggttc ttatagggtt tcgctcatgt gttgagcata taagaaaccc
      421 ttagtatgta tttgtatttg taaaatactt ctatcaataa aatttctaat tcctaaaacc
      481 aaaatccagt actaaaatcc agatctccta aagtccctat agatctttgt cgtgaatata
      541 aaccagacac gagacgacta aacctggagc ccagacgccg tttgaagcta gaagtaccgc
      601 ttaggcagga ggccgttagg gaaaagatgc taaggcaggg ttggttacgt tgactccccc
      661 gtaggtttgg tttaaatatg atgaagtgga cggaaggaag gaggaagaca aggaaggata
      721 aggttgcagg ccctgtgcaa ggtaagaaga tggaaatttg atagaggtac gttactatac
      781 ttatactata cactaaggga tgcttgtatt ttaccctata ccccctaata accccttatc
      841 gatctaaaga aataatccgc ataagccccc gcttaaaaaa ttggtatcag agccatgaat
      901 aggtctatga ccaaaactca agaggataaa acctcaccaa aatacgaaag agttcttaac
      961 tctaaagata aa
//
LOCUS       MCAORFIX1      74 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus unidentified protein gene, partial cds.
ACCESSION   M32808
KEYWORDS    .
SOURCE      Cauliflower mosaic virus viral DNA, clone CMS31.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 74)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1  /    34     unidentified protein (AA at 2)
    IVS          35  >    74     intron X
BASE COUNT       29 a     12 c     13 g     20 t
ORIGIN      
        1 attatcagtg gacattcacg atgccacagg taaggtatat cttcctttaa tcactaaaga
       61 ggagataaat aaaa
//
LOCUS       MCAORFX2      178 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus unidentified protein gene, 5' end.
ACCESSION   M32810
KEYWORDS    .
SOURCE      Cauliflower mosaic virus viral DNA, clone pCMS31.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 178)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept         94  >   178     Unidentified protein
    binding      82       94     primer binding site
BASE COUNT       68 a     38 c     24 g     48 t
ORIGIN      
        1 tgcttgtatt ttaccctata taccctaata accccttatc gatttaaaga aataatccgc
       61 ataagccccc gcttaaaaat tggtatcaga gccatgaatc ggtttaaaga ccaaactcaa
      121 gagggtaaaa cctcatcaaa atacgaaaga gttcttaact ctaaagataa aagatctt
//
LOCUS       MCARTRA       467 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Cauliflower mosaic virus reverse transcriptase (V) gene, partial
            cds.
ACCESSION   M32815
KEYWORDS    reverse transcriptase.
SOURCE      Cauliflower mosaic virus viral DNA, clone pLW303S.
  ORGANISM  Cauliflower mosaic virus
            Viridae; ds-DNA nonenveloped viruses; Caulimovirus.
REFERENCE   1  (bases 1 to 467)
  AUTHORS   Vaden,V.R. and Melcher,U.
  TITLE     Recombination sites in Cauliflower mosaic virus DNAs: Implications
            for mechanisms of recombination
  JOURNAL   Virology 177, 717-726 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by U.K.Melcher, 13-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1  >   467     reverse transcriptase (AA at 3)
BASE COUNT      157 a    110 c     86 g    114 t
ORIGIN      
        1 agaagatctt ctcttccttc gactgcaagt caggattctg gcaagttctg ctagatcaag
       61 aatcaagacc tctaacggca ttcacatgtc cccaaggtca ctacgaatgg aatgtggtcc
      121 ctttcggcct aaagcaggct ccatccatat tccaaagaca catggacgaa gcatttcgtg
      181 tgttcagaaa gttctgttgc gtttatgtcg acgacattct cgtattcagt aacaacgaag
      241 aagatcacct acttcatgta gcgatgatct tacaaaagtg caatcaacat ggaatcatcc
      301 tttccaaaaa gaaagcacaa ctcttcaaga agaagataaa cttccttggt ctagaaatag
      361 atgaaggaac acataagcct caaggacaca tcttggaaca catcaacaag ttcccagata
      421 cccttgaaga caagaagcaa cttcagagat tcttaggcat actaaca
//
LOCUS       HUMOSTRO     1425 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human osteopontin mRNA, complete cds.
ACCESSION   J04765
KEYWORDS    osteopontin; phosphoprotein.
SOURCE      Human adult osteoblast, cDNA to mRNA, clone HOP-10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1425)
  AUTHORS   Young,M.F., Kerr,J.M., Termine,J.D., Wewer,U.M., Wang,M.G.,
            McBride,O.W. and Fisher,L.W.
  TITLE     cDNA cloning, mRNA distribution and heterogeneity, chromosomal
            location, and RFLP analysis of human osteopontin (OPN)
  JOURNAL   Genomics 7, 491-502 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.F.Young, 04-MAY-1990.
FEATURES       from  to/span     description
    pept         68     1012     osteopontin precursor
                                 /hgml_locus_uid="LK0223T" /nomgen="OPN"
                                 /map="unassigned"
    sigp         68      115     osteopontin signal peptide
    matp        176     1012     osteopontin
    mRNA    <     1     1425     osteopontin mRNA
BASE COUNT      449 a    293 c    296 g    387 t
ORIGIN      Chromosome 4q13.
        1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact
       61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc
      121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga
      181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca
      241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc
      301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga
      361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca
      421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac
      481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg
      541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga
      601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt
      661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga
      721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca
      781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc
      841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt
      901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct
      961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa
     1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa
     1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg
     1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa
     1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg
     1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact
     1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag
     1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag
//
LOCUS       VSVGLYPA     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 84-PN-H) glycoprotein gene,
            complete cds.
ACCESSION   M35207
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-PN-H, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      482 a    364 c    364 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc ctcgacacca tgaagtgcct tttgtactta gctctgttat
       61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggtaattgga
      121 aaaatgttcc ttccagttac cattactgcc catcaagctc agatttgaat tggcataatg
      181 gcctaatagg cacagcctta caggtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
      301 cgaaatatat aacacattcc atccggtcct tcacaccatc cgtggagcaa tgcaaggaaa
      361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactccc catcatgtac
      481 ttgtcgatga gtacacggga gaatgggttg attcccaatt catcaacggg aaatgcagca
      541 atgacatctg ccctactgtg cataactcca caacctggca ttccgattac aaagttaaag
      601 ggctgtgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggggagc
      661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgct tacgaaactg
      721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
      781 tttggttcga aatggcagac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
      841 aagggtcaag catatctgct ccatctcaga catcagtgga cgtgagtctc atccaggatg
      901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
      961 ttcccatctc tccggtagac ctaagctatc ttgctcctaa aaatccagga accggcccag
     1021 ctttcacaat catcaatggt actctgaaat attttgagac caggtacatc agagtcgata
     1081 ttgctgctcc aattctccca agaatggttg gaacaatcag tggaactacc acagaaagag
     1141 aactgtggga agactgggca ccatatgaag atgtggaaat cgggcctaat ggagttctga
     1201 aaaccagttc aggatataaa tttcctttat acatgattgg acatggcatg ctggactctg
     1261 atctccatct cagctcgaag gctcaagtgt ttgaacaccc tcacattcag gatgctgcat
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttgtcc aaaaatccaa
     1381 tcgagctcgt agaagggtgg ttcagcggtt ggaagagctc cattgcttct tttttcttta
     1441 tcataggttt gatcattgga ttattcttgg tcctccgagt tggtatctat ctttgcatca
     1501 agttaaaaca gaccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaaccaga atcggacacg acagagtaat catatttaaa ccaaatcgac ttgtgatatc
     1621 atgctccaag aggctttaat tatgtttcaa ttttatg
//
LOCUS       VSVGLYPB     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 85-CR-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35208
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-CR-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      487 a    358 c    366 g    446 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagtcttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPC     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 87-CR-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35209
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-CR-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      489 a    358 c    364 g    446 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPD     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 84-CR-B) glycoprotein gene,
            complete cds.
ACCESSION   M35210
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-CR-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      488 a    357 c    365 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 gattgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPE     1665 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 87-OA-B) glycoprotein gene,
            complete cds.
ACCESSION   M35211
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-OA-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      476 a    364 c    358 g    467 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
     1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcgtct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
     1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
     1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS       VSVGLYPF     1665 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 87-VC-B) glycoprotein gene,
            complete cds.
ACCESSION   M35212
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-VC-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      476 a    362 c    358 g    469 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccttg caagttaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
     1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcgtct tagctcaaag gctcaggtat ttgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta
     1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
     1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS       VSVGLYPG     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 84-PN-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35213
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-PN-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      488 a    356 c    366 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttctggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcggg tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccga
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctatgcatca
     1501 acttacggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgtttgaa ttgtatg
//
LOCUS       VSVGLYPH     1665 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain Mudd-Summers) glycoprotein gene,
            complete cds.
ACCESSION   M35214
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain Mudd-Summers, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      477 a    366 c    353 g    469 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacca tgaagtgcct tttgtactta gcttttttat
       61 tcatcggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttagtagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtacggac
      301 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag cgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttccgactat aaggtcaaag
      601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaagaag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
     1021 tctttaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
     1141 tactgtggga tgactgggct ccatatgaag acgtgggaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt ttgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatggtgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagtttgt agaaggttgg ttcagtagtt ggaagagctc tattgcctct tttttcttta
     1441 ccatagggtt aatcattgga ctattcttgg ttctccgagt tggtatttat ctttgcatta
     1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 cgtaactcaa atcctgcaca acagattctt catgtttgaa ccaaatcaac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttaattt ttatg
//
LOCUS       VSVGLYPI     1665 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 86-DF-P) glycoprotein gene,
            complete cds.
ACCESSION   M35215
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-DF-P, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      477 a    363 c    357 g    468 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca
      541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga
     1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctct tttttcttta
     1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta
     1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc
     1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg
//
LOCUS       VSVGLYPJ     1788 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 82-HD-B) glycoprotein gene,
            complete cds.
ACCESSION   M35216
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 82-HD-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1788)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      508 a    362 c    344 g    574 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
     1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS       VSVGLYPK     1788 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 83-HD-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35217
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 83-HD-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1788)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      508 a    362 c    344 g    574 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat
     1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg
//
LOCUS       VSVGLYPL     1815 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 83-HD-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35218
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 83-HD-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1815)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      512 a    358 c    346 g    599 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gattgctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatt agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcactat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggcctaatt catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taattttatt tttttatttt tattttttat tttttatttt ttatttttta
     1741 ttttttattt tttattttta ttttttatct ttttattttt tattttttat ttttattttt
     1801 atttttattt ttatg
//
LOCUS       VSVGLYPM     1665 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain San Juan 56-NM-B) glycoprotein
            gene, complete cds.
ACCESSION   M35219
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain San Juan 56-NM-B, Indiana
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1665)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      484 a    364 c    355 g    462 t
ORIGIN      
        1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat
       61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga
      121 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagccata caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac
      301 cgaagtatat aacacagtcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg
      421 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca
      541 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag
      601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc
      661 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag
      841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg
     1021 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga
     1201 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt
     1321 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta
     1441 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta
     1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc
     1621 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatg
//
LOCUS       VSVGLYPN     1812 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 85-ES-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35220
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-ES-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1812)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      512 a    362 c    347 g    591 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
     1741 tttatttttt attttttatt tttatttttt atctttttat ttcttatttt tatttttatt
     1801 tttattttta tg
//
LOCUS       VSVGLYPO     1861 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 86-ES-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35221
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-ES-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1861)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      520 a    363 c    347 g    631 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
     1741 ttttattttt tattttttta ttttttattt tttatttttt atttttattt tttatttttt
     1801 tattttttta ttttttattt ttatttttat ttttattttt atttttattt ttatttttat
     1861 g
//
LOCUS       VSVGLYPP     1802 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 84-GM-B) glycoprotein gene,
            complete cds.
ACCESSION   M35222
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-GM-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1802)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      506 a    362 c    348 g    586 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
       61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
     1741 ttttttattt ttatttttta tctctttatt ttttattttt atttttattt ttatttttta
     1801 tg
//
LOCUS       VSVGLYPQ     1868 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 87-ES-B) glycoprotein gene,
            complete cds.
ACCESSION   M35223
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-ES-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1868)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      520 a    362 c    347 g    639 t
ORIGIN      
        1 aacagagatc aatctgtttt cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat tacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct catcatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catatttcaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
     1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
     1801 atttttttat ttttttattt tttattttta tttttatttt tatttttatt tttattttta
     1861 tttttatg
//
LOCUS       VSVGLYPR     1860 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 86-ES-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35224
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-ES-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1860)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      520 a    362 c    347 g    631 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcta caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt
     1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt
     1801 atttttttat tttttatttt tatttttatt tttattttta tttttatttt tatttttatg
//
LOCUS       VSVGLYPS     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 69-PN-L) glycoprotein gene,
            complete cds.
ACCESSION   M35225
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 69-PN-L, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      487 a    358 c    364 g    448 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcattggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattattgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgccacgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 ctaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca agtgactcct catcatgtat
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggagaac
      661 tctcatccct agggaaggaa ggcacgggat tcagaagcaa ttactttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgac aaagatctat ttgctgcagc aaaattcccg gagtgtcccg
      841 aaggatcgag tatctccgct ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt ggagcaggtc
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctaaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccacct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg ttcagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttgttcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca ataccacaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPT     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 87-CR-B1) glycoprotein gene,
            complete cds.
ACCESSION   M35226
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 87-CR-B1, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      490 a    358 c    363 g    446 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg aatcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPU     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 85-CR-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35227
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-CR-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      486 a    357 c    367 g    447 t
ORIGIN      
        1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga
      121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg
      181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag
      241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg
      421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac
      481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca
      541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag
      601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac
      661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg
      721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg
      781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg
      841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg
      901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta
      961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag
     1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg
     1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga
     1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat
     1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa
     1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta
     1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca
     1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa
     1561 agtaatcaca atactgcaca acagaataat catatttgga ccaaatcgac ttgtgatatc
     1621 atgctttaag aggctttaac gatgttttga ttgtatg
//
LOCUS       VSVGLYPV     1796 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene,
            complete cds.
ACCESSION   M35228
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 86-GM-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1796)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      503 a    362 c    349 g    582 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
       61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttcccattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat agtcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta
     1741 ttttttattt tttattttta ttttttatct ttttattttt atttttattt tttatg
//
LOCUS       VSVGLYPW     1800 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 85-GM-B) glycoprotein gene,
            complete cds.
ACCESSION   M35229
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-GM-B, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1800)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      506 a    362 c    348 g    584 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat
       61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taatttttat tttttatttt tattttttta ttatttttta ttttttattt
     1741 ttatttttta tttttatttt ttcatctttt tattttttat ttttattttt attttttatg
//
LOCUS       VSVGLYPX     1652 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 59-PN-L) glycoprotein gene,
            complete cds.
ACCESSION   M35230
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 59-PN-L, Indiana serotype), cDNA
            to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1652)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      487 a    344 c    367 g    453 t      1 others
ORIGIN      
        1 aacagagatc aatctgtctt cctagcatca tgaaatgnat tttgtacttc gcttttctcc
       61 tggtaggagt gaactgtaaa tttacaatag ttttcccaca taatcaaaaa gggaactgga
      121 aaaatgtgcc atccagttac cattattgtc catctagttc cgatcttaat tggcacaatg
      181 gcctgattgg tacagcatta caagtcaaga tgcccaaaag tcacaaagca attcaggcag
      241 acggctggat gtgtcatgca tccaaatggg tcaccacatg tgacttcagg tggtacggac
      301 ccaaatatat tacgcattct attcggtcat tcacaccctc tgtggaacag tgcagagaaa
      361 gcattgaaca gaccaaacaa ggaatctgga taaatcctgg ctttcctcct cagagttgtg
      421 gatatgcaac tgtgacggac gcggaagcag taattgtcca ggtcacccca catcatgtgc
      481 tcgtagatga gtacacaggc gaatgggtag attcccaatt tttgaacgga aagggcaaag
      541 acgacatatg ccccacagtc cataattcga ccacatggca ttcagactac aaagtcaaag
      601 ggctttgtga tgctaaccta atctcaatgg acatcacttt cttctcagaa gatggtgagc
      661 tgtcctcctt gggcaaagat gggactggat tcagaagcaa ttactttgct tatgaaaatg
      721 gagataaggc ttgccgcatg cagtattgca aacattgggg agtcagactc ccttctggag
      781 tctggttcga gatggcagac aaagatttgt ttgcaggagc caagttcccc gagtgtccag
      841 acggaaccag tatctctgct ccttctcaga catccgtaga tgtaagctta attcaggacg
      901 tcgaaagaat cttggactat tccttgtgtc aagagacttg gagtaaaatc aaggcaggcc
      961 ttccgatttc tccagttgat ctaagctatc tcgctccaaa aaatccaggg acaggtcccg
     1021 ctttcaccat catcaatggg accttgaagt actttgagac cagatacatc agagtggaca
     1081 ttgctgctcc aatcataccc agaatggtgg gaactatcag tggaacatct acagaaagag
     1141 aattgtggga tgactggtac tcttatgaag atgtggagat tgggcccaat ggagtattga
     1201 aaacaacctc aggatataaa tttcctcttt atatgattgg acacggaatg ctggactctg
     1261 atcttcacct gacatcgaag gcccaggttt ttgaacatcc ccacattcag gacgctgcat
     1321 cacagcttcc tgacggcgag actttatttt ttggtgatac cgggctctca aaaaacccca
     1381 ttgaattagt agagggatgg ttcagtgggt ggaaaagctc tattgcttcg ttcttctttg
     1441 tcattggatt gatcattgga ttatatttgg ttttaagggt gggaatatat ctttgtctta
     1501 aactaaagca tacaaaaaag agaaaaattt acacagacat tgagatgaac agacttggaa
     1561 aataaccaaa attgtgtgat ggatttgctt ttttgccaaa gaagcctgtg atatccttct
     1621 cagagaagct ttaatctttt attgacaata tg
//
LOCUS       VSVGLYPY     1657 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 84-PN-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35231
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 84-PN-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1657)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      489 a    367 c    357 g    444 t
ORIGIN      
        1 aacagagatc catctgtttc ctcgacacca tgaagtgcct tttgtactta gctcttttat
       61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggaaattgga
      121 aaaatgttcc ttccagttac cattactgcc cgtcaagttc agatttgaac tggcataacg
      181 gcctaatagg cacagcccta caggtcaaga tgcccaagag tcacaaggct attcaagcag
      241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac
      301 cgaaatatat aacacattcc atccggtcct tcacaccatc tgtggagcaa tgcaaggaaa
      361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg
      421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtaactccc catcatgtac
      481 ttgtcgatga gtacacggga gagtgggtag attcccaatt catcaacggg aaatgcagca
      541 atgacatatg ccctactgtc cataactcca caacctggca ttccgactac aaagtcaaag
      601 ggctgtgtga ttctaacctc atctccatgg acatcacctt cttctcagaa gatggggagc
      661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgca tacgaaacag
      721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg
      781 tctggtttga aatggccgac aaagatctgt ttgctgcagc aaaattccca gaatgtccag
      841 aagggtcaag catctctgct ccatctcaga catcagtgga cgtgagtctt attcaggacg
      901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc
      961 ttcccatctc tccggtagat ctaagctatc ttgctccaaa aaacccagga accgggccag
     1021 cctttacaat aatcaatggt actctgaaat actttgagac cagatacatc agagtcgata
     1081 ttgctgctcc aattctccca agaatggtcg gaacaatcag tggaactacc accgagagag
     1141 aattgtggga cgactgggca ccatatgaag atgttgaaat cggaccaaat ggagttctga
     1201 aaaccagttc aggatataaa ttccctttat atatgattgg acatggcatg ctggactctg
     1261 atctccatct cagctcaaag gctcaggtat ttgaacaccc tcacattcag gatgctgcat
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaatccaa
     1381 tcgagctcgt agaagggtgg ttcagtggtt ggaaaagctc cattgcttct tttttcttta
     1441 tcataggttt gatcattgga ttattcttgg ttctccgagt tggtatctat ctctgcatca
     1501 agttaaaaca taccaaaaaa agaaaaatct atacagacat tgagatgaat cgactcggaa
     1561 agtaactgga atcagacacg acagaataat catatttaaa ccaaatcgac ttgtgatatc
     1621 atgctctaag aggctttaat tatgttttaa ttttatg
//
LOCUS       VSVGLYPZ     1812 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 85-ES-B2) glycoprotein gene,
            complete cds.
ACCESSION   M35232
KEYWORDS    glycoprotein; surface antigen.
SOURCE      Vesicular stomatitis virus (strain 85-ES-B2, Indiana serotype),
            cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1812)
  AUTHORS   Bilsel,P.A. and Nichol,S.T.
  TITLE     Polymerase errors accumulating during natural evolution of the
            glycoprotein gene of vesicular stomatitis Indiana serotype viruses
  JOURNAL   J. Virol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.T.Nichol, 14-JUN-1990.
FEATURES       from  to/span     description
    pept         30     1565     glycoprotein
BASE COUNT      513 a    360 c    346 g    593 t
ORIGIN      
        1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat
       61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga
      121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg
      181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag
      241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac
      301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa
      361 gcattgaaca gacaaaacaa ggaacttggc tgaatccagg cttccctccc caaagttgcg
      421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc
      481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg
      541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag
      601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc
      661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg
      721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg
      781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag
      841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg
      901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc
      961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg
     1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata
     1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg
     1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga
     1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg
     1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt
     1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa
     1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta
     1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta
     1501 aattaaagca caccaataaa agacagattt atacagacat aaagatgaac cgacttggaa
     1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca
     1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat
     1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt
     1741 tttatttttt attttttatt tttatttttt atctttttat tttttatttt tatttttatt
     1801 tttattttta tg
//
LOCUS       ECODKSA      1273 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  E.coli dnaK suppressor (dksA) gene, complete cds.
ACCESSION   M34945
KEYWORDS    dnaK suppressor.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1273)
  AUTHORS   Kang,P.J. and Craig,E.A.
  TITLE     Identification and characterization of a new Escherichia coli gene
            that is a dosage-dependent suppressor of a dnaK deletion mutation
  JOURNAL   J. Bacteriol. 172, 2055-2064 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        229      441     ORF 1
    pept        619     1074     dnaK suppressor (dksA)
BASE COUNT      343 a    301 c    333 g    296 t
ORIGIN      
        1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa
       61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc
      121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata
      181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg
      241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt
      301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg
      361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa
      421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg
      481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg
      541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc
      601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt
      661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa
      721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa
      781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac
      841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac
      901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc
      961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg
     1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag
     1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga
     1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct
     1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt
     1261 acgcatagaa gat
//
LOCUS       STFCYCLI     2180 bp ss-mRNA            INV       08-AUG-1990
DEFINITION  Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds.
ACCESSION   M33880
KEYWORDS    cyclin B.
SOURCE      Starfish (A.pectinifera) egg, cDNA to mRNA, clone
            lambda-gt10-cyc10.
  ORGANISM  Asterina pectinifera
            Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; 
            Stelleroidea; Asteroidea; Spinulosida; Asterinidae.
REFERENCE   1  (bases 1 to 2180)
  AUTHORS   Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T.
  TITLE     The starfish egg mRNA responsible for meiosis reinitiation encodes
            cyclin
  JOURNAL   Dev. Biol. 140, 241-252 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Kishimoto, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        126     1313     cyclin B (CYC)
    mRNA    <   126     2180     cyclin B mRNA
BASE COUNT      650 a    440 c    482 g    608 t
ORIGIN      
        1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg
       61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga
      121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt
      181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc
      241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag
      301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca
      361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa
      421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct
      481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac
      541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt
      601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc
      661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc
      721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc
      781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg
      841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt
      901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca
      961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca
     1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag
     1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc
     1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat
     1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca
     1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg
     1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt
     1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt
     1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg
     1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa
     1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt
     1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga
     1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc
     1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca
     1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac
     1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat
     1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag
     1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat
     2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg
     2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt
     2161 gtaataaagt taatggaaat
//
LOCUS       ECOCLPPA     1236 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  Escherichia coli ATP-dependent clp protease proteolytic component
            (clpP) gene, complete cds.
ACCESSION   J05534
KEYWORDS    ATP-dependent protease.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1236)
  AUTHORS   Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J.,
            Bowers,B. and Gottesman,S.
  TITLE     Sequence and structure of ClpP, the proteolytic component of the
            ATP-dependent Clp protease of Escherichia coli
  JOURNAL   J. Biol. Chem. 265, 12536-12545 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Gottesman, 23-APR-1990.
FEATURES       from  to/span     description
    pept        378     1001     ATP-dependent protease (clpP)
    mRNA        279  >  1236     clpP mRNA
    pept    <     1      133     ORF
    signal      244      249     -35 region
    signal      267      272     -10 region
    signal      165      193     transcription termination signal
    binding     367      372     ribosome binding site (put.)
BASE COUNT      329 a    291 c    318 g    298 t
ORIGIN      
        1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga
       61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa
      121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc
      181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg
      241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac
      301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt
      361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg
      421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt
      481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta
      541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc
      601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc
      661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt
      721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga
      781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg
      841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat
      901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg
      961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg
     1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt
     1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc
     1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg
     1201 cgcaagctga ttgccggtcc atccgtgtat atctgc
//
LOCUS       HAMACAPT     1837 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA,
            partial cds.
ACCESSION   J05590 M22755
KEYWORDS    N-acetylglucosamine-1-phosphate transferase; glycosyl transferase;
            tunicamycin-resistance.
SOURCE      Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone
            TRG-[4,22].
  ORGANISM  Cricetulus longicaudatus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 889 to 960)
  AUTHORS   Lehrman,M.A., Zhu,X. and Khounlo,S.
  TITLE     Amplification and molecular cloning of the hamster tunicamycin-
            sensitive N-acetylglucosamine-1-phosphate transferase gene: The
            hamster and yeast enzymes share a common peptide sequence
  JOURNAL   J. Biol. Chem. 263, 19796-19803 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1837)
  AUTHORS   Zhu,X. and Lehrman,M.A.
  TITLE     Cloning, sequence and expression of a cDNA encoding hamster
            UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate
            transferase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and sequence for [1] kindly submitted by M. Lehrman,
            04-APR-1989.  Draft entry and computer-readable sequence for [1]
            kindly submitted by M.Lehrman, 12-JUN-1990.
FEATURES       from  to/span     description
    pept        148     1374     N-acetylglucosamine-1-phosphate transferase
                                 (GPT)
    binding      67       79     dolichol binding site
    binding     222      234     dolichol binding site
BASE COUNT      363 a    515 c    418 g    541 t
ORIGIN      
        1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg
       61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg
      121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg
      181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc
      241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg
      301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc
      361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc
      421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg
      481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct
      541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta
      601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc
      661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat
      721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta
      781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt
      841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga
      901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc
      961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct
     1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg
     1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc
     1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag
     1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt
     1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc
     1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc
     1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca
     1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt
     1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt
     1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc
     1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat
     1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag
     1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc
     1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt
//
LOCUS       HS4NA3ABC   10089 bp ds-DNA             VRL       08-AUG-1990
DEFINITION  Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C
            genes, complete cds.
ACCESSION   M34440
KEYWORDS    nuclear antigen.
SOURCE      Epstein-Barr virus type 2 (isolate AG876) DNA.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 10089)
  AUTHORS   Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E.,
            Rickinson,A. and Kieff,E.
  TITLE     Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their
            EBNA-3A, EBNA-3B and EBNA-3C genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.T.Sample, 23-MAY-1990.
               Kieff Laboratory
               Harvard Medical School
               75 Francis Street
               Boston, MA 02115
FEATURES       from  to/span     description
    pept        189      524     nuclear antigen EBNA-3A, exon 1
                610     3051     nuclear antigen EBNA-3A, exon 2
    pept       3237     3593     nuclear antigen EBNA-3B, exon 1
               3672     6155     nuclear antigen EBNA-3B, exon 2
    pept       6277     6636     nuclear antigen EBNA-3C, exon 1
               6711     9560     nuclear antigen EBNA-3C, exon 2
    IVS         525      609     EBNA-3A intron
    IVS        3594     3671     EBNA-3B intron
    IVS        6637     6710     EBNA-3C intron
BASE COUNT     2495 a   2837 c   2619 g   2138 t
ORIGIN      
        1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct
       61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag
      121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg
      181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag
      241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca
      301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc
      361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac
      421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc
      481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg
      541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt
      601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat
      661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg
      721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg
      781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact
      841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg
      901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg
      961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc
     1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct
     1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac
     1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat
     1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg
     1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca
     1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg
     1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg
     1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct
     1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa
     1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac
     1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc
     1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc
     1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg
     1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg
     1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa
     1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca
     1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca
     2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag
     2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc
     2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac
     2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg
     2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa
     2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg
     2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg
     2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc
     2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg
     2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg
     2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa
     2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg
     2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct
     2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat
     2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt
     2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg
     3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg
     3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat
     3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc
     3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga
     3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg
     3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac
     3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag
     3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca
     3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc
     3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag
     3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc
     3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga
     3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac
     3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct
     3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata
     3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa
     3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag
     4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt
     4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga
     4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata
     4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag
     4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct
     4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg
     4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa
     4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg
     4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca
     4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag
     4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc
     4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg
     4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca
     4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca
     4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga
     4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt
     4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac
     5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc
     5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac
     5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct
     5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc
     5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc
     5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc
     5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc
     5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc
     5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc
     5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag
     5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta
     5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg
     5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac
     5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac
     5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc
     5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct
     6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt
     6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt
     6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg
     6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta
     6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg
     6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat
     6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac
     6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc
     6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta
     6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca
     6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc
     6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc
     6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct
     6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg
     6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat
     6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca
     6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg
     7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag
     7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga
     7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc
     7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt
     7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta
     7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc
     7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg
     7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag
     7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc
     7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac
     7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag
     7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg
     7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag
     7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt
     7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga
     7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta
     7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac
     8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc
     8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc
     8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc
     8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc
     8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc
     8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg
     8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac
     8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg
     8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg
     8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata
     8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc
     8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact
     8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc
     8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt
     8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag
     8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc
     9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac
     9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc
     9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc
     9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac
     9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg
     9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag
     9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa
     9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg
     9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca
     9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga
     9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta
     9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca
     9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg
     9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg
     9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg
     9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag
     9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc
    10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt
    10081 ggaggatcc
//
LOCUS       HUMPAPB      3061 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human prostatic acid phosphatase mRNA, complete cds.
ACCESSION   M34840
KEYWORDS    glycoprotein; prostatic acid phosphatase; secretory protein.
SOURCE      Human prostate, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1199)
  AUTHORS   Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
            Moore,D.L.
  TITLE     Covalent structure, disulfide bonding and identification of
            reactive surface and active site residues of human prostatic acid
            phosphatase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3061)
  AUTHORS   Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
            Moore,D.L.
  TITLE     Covalent structure, disulfide bonding and identification of
            reactive surface and active site residues of human prostatic acid
            phosphatase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.L.Van Etten, 08-JUN-1990.
               Chemistry Department
               Purdue University
               West Lafayette, IN 47907
FEATURES       from  to/span     description
    pept         15     1175     prostatic acid phosphatase precursor (EC
                                 3.1.3.2) /hgml_locus_uid="LA0168F"
                                 /nomgen="ACPP" /map="3q21-qter"
    sigp         15      110     prostatic acid phosphatase signal peptide
    matp        111     1172     prostatic acid phosphatase
BASE COUNT      917 a    642 c    681 g    821 t
ORIGIN      Chromosome 3q21-qter.
        1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc
       61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga
      121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg
      181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg
      241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt
      301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg
      361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc
      421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc
      481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg
      541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag
      601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg
      661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga
      721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat
      781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc
      841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc
      901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga
      961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc
     1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt
     1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa
     1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag
     1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc
     1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca
     1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt
     1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa
     1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt
     1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt
     1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga
     1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc
     1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa
     1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt
     1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat
     1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata
     1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga
     1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag
     2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag
     2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt
     2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta
     2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca
     2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa
     2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc
     2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac
     2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa
     2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat
     2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac
     2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc
     2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc
     2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat
     2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc
     2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat
     2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta
     3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa
     3061 a
//
LOCUS       LDV3ECPA     1064 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Lactate dehydrogenase-elevating virus capsid protein gene, complete
            cds.
ACCESSION   M34856
KEYWORDS    viral capsid protein.
SOURCE      Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones
            A16, B24, B63, B90, B103, C44, dt4.
  ORGANISM  Lactate dehydrogenase-elevating virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Togaviridae.
REFERENCE   1  (bases 1 to 1064)
  AUTHORS   Godeny,E.K., Speicher,D.W. and Brinton,M.A.
  TITLE     Map location of lactate dehydrogenase-elevating virus (LDV) capsid
            protein (Vp1) gene
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.K.Godeny, 06-JUN-1990.
FEATURES       from  to/span     description
    pept        637      984     capsid protein
    pept        135      650     unidentified ORF
BASE COUNT      267 a    223 c    265 g    309 t
ORIGIN      3' terminus region.
        1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc
       61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc
      121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt
      181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg
      241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt
      301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg
      361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc
      421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg
      481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta
      541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga
      601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc
      661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat
      721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc
      781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga
      841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt
      901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat
      961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg
     1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt
//
LOCUS       MBIFRHADGB   3903 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase
            (frhADGB) gene, complete cds.
ACCESSION   J02914
KEYWORDS    8-hydroxy-5-deazaflavin-reducing hydrogenase.
SOURCE      M.thermoautotrophicum (strain delta-H) DNA.
  ORGANISM  Methanobacterium thermoautotrophicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanobacteriaceae.
REFERENCE   1  (bases 1 to 3903)
  AUTHORS   Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T.
  TITLE     Cloning, sequence determination, and expression of the genes
            encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin
            reducing hydrogenase from Methanobacterium thermoautotrophicum
            delta-H
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.Alex, 25-MAY-1990.
FEATURES       from  to/span     description
    pept        298     1515     F420 hydrogenase alpha-subunit (frhA)
    pept       1524     2000     F420 hydrogenase delta-subunit (frhD)
    pept       2114     2824     F420 hydrogenase gamma-subuint (frhG)
    pept       2837     3682     F420 hydrogenase beta-subunit (frhB)
    binding     284      293     ribosomal binding site (put.)
    binding    1514     1520     ribosomal binding site (put.)
    binding    2100     2106     ribosomal binding site (put.)
    binding    2823     2832     ribosomal binding site (put.)
BASE COUNT     1201 a    878 c    965 g    859 t
ORIGIN      
        1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta
       61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac
      121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt
      181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt
      241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg
      301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg
      361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg
      421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg
      481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc
      541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac
      601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac
      661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt
      721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac
      781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa
      841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc
      901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa
      961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata
     1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca
     1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt
     1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg
     1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta
     1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag
     1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca
     1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc
     1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa
     1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg
     1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt
     1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc
     1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg
     1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg
     1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa
     1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa
     1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg
     1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc
     2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga
     2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag
     2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact
     2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga
     2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga
     2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac
     2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt
     2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc
     2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc
     2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt
     2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct
     2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg
     2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct
     2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac
     2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata
     2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg
     3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa
     3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga
     3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta
     3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat
     3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat
     3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga
     3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg
     3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg
     3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga
     3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga
     3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat
     3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt
     3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa
     3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt
     3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat
     3901 cga
//
LOCUS       MUSLRCNA     1751 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse loricrin gene, complete cds.
ACCESSION   M34398
KEYWORDS    cell envelope protein; loricrin.
SOURCE      Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1751)
  AUTHORS   Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D.,
            Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M.,
            Yuspa,S.H. and Roop,D.R.
  TITLE     Identification of a major keratinocyte cell envelope protein,
            loricrin
  JOURNAL   Cell 61, 1103-1112 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Rothnagel, 22-MAY-1990.
FEATURES       from  to/span     description
    pept         71     1516     loricrin
    signal     1740     1745     polyA signal
BASE COUNT      223 a    493 c    706 g    329 t
ORIGIN      
        1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc
       61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg
      121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg
      181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg
      241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg
      301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg
      361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg
      421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg
      481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg
      541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg
      601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg
      661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc
      721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg
      781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg
      841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg
      901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg
      961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg
     1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg
     1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc
     1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg
     1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag
     1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg
     1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg
     1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg
     1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac
     1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc
     1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac
     1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct
     1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca
     1741 ataaatttgc t
//
LOCUS       PFAGIPA      1906 bp ds-DNA             INV       08-AUG-1990
DEFINITION  P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete
            cds.
ACCESSION   J05544
KEYWORDS    glucosephosphate isomerase.
SOURCE      P.falciparum (strain 7G8) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 1906)
  AUTHORS   Kaslow,D.C. and Hill,S.
  TITLE     Cloning metabolic pathway genes by complementation in Escherichia
            coli: Isolation and expression of Plasmodium falciparum
            glucosephosphate isomerase
  JOURNAL   J. Biol. Chem. 265, 12337-12341 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.C.Kaslow, 04-JUN-1990.
FEATURES       from  to/span     description
    pept         28     1803     glucosephosphate isomerase (EC 5.3.1.9)
BASE COUNT      782 a    174 c    289 g    661 t
ORIGIN      Chromosome 14.
        1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat
       61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat
      121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct
      181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa
      241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc
      301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata
      361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat
      421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg
      481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa
      541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat
      601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat
      661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct
      721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt
      781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg
      841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta
      901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga
      961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat
     1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat
     1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca
     1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa
     1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat
     1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa
     1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg
     1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa
     1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt
     1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta
     1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt
     1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat
     1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat
     1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag
     1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt
     1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt
//
LOCUS       RATGADA      3198 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Rat glutamic acid decarboxylase (GAD) mRNA, complete cds.
ACCESSION   M34445
KEYWORDS    glutamic acid decarboxylase.
SOURCE      Rat brain and hypothalamus, cDNA to mRNA, (libraries of
            D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and
            pCR22HP.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3198)
  AUTHORS   Wyborski,R.J., Bond,R.W. and Gottlieb,D.I.
  TITLE     Characterization of a cDNA coding for rat glutamic acid
            decarboxylase
  JOURNAL   Mol. Brain Res. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.I.Gottlieb, 23-MAY-1990.
FEATURES       from  to/span     description
    pept        184     1965     glutamic acid decarboxylase
    mRNA    <     1     3198     glutamic acid decarboxylase mRNA
    signal     3174     3179     polyA signal
BASE COUNT      869 a    742 c    767 g    820 t
ORIGIN      
        1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca
       61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc
      121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag
      181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat
      241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc
      301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag
      361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa
      421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct
      481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg
      541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac
      601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac
      661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt
      721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta
      781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc
      841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca
      901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc
      961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc
     1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg
     1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt
     1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat
     1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca
     1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg
     1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc
     1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa
     1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag
     1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac
     1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc
     1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt
     1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa
     1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc
     1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg
     1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc
     1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc
     1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca
     2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg
     2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa
     2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat
     2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc
     2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca
     2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag
     2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta
     2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca
     2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac
     2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag
     2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt
     2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta
     2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt
     2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt
     2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc
     2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa
     3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta
     3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt
     3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa
     3181 taaaaaatat tctccatg
//
LOCUS       RATTSHRA     5270 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Rat thyroid stimulating hormone receptor mRNA, complete cds.
ACCESSION   M34842
KEYWORDS    TSH receptor; thyroid stimulating hormone receptor;
            transmembrane protein.
SOURCE      Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to
            mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5270)
  AUTHORS   Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C.,
            McBride,O.W. and Kohn,L.D.
  TITLE     Cloning, chromosomal assignment and regulation of the rat
            thyrotropin receptor by thyrotropin, agents which increase cAMP
            levels and thyroid autoantibodies
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.D.Kohn, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         55     2349     thyroid stimulating hormone receptor precursor
    sigp         55      117     thyroid stimulating hormone receptor signal
                                 peptide
    matp        118     2346     thyroid stimulating hormone receptor
    signal     5256     5261     poly-A signal
BASE COUNT     1415 a   1343 c   1176 g   1336 t
ORIGIN      
        1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg
       61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga
      121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag
      181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact
      241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat
      301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa
      361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta
      421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct
      481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct
      541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg
      601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg
      661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga
      721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct
      781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag
      841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac
      901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat
      961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc
     1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag
     1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag
     1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac
     1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca
     1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg
     1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc
     1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc
     1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac
     1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt
     1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc
     1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg
     1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc
     1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac
     1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg
     1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt
     1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc
     1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg
     2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc
     2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag
     2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa
     2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt
     2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca
     2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa
     2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct
     2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc
     2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac
     2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg
     2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag
     2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga
     2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat
     2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc
     2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat
     2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt
     3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta
     3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat
     3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac
     3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga
     3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt
     3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa
     3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct
     3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt
     3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt
     3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata
     3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt
     3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac
     3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat
     3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga
     3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat
     3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag
     3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc
     4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc
     4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg
     4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa
     4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga
     4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag
     4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc
     4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg
     4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc
     4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct
     4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg
     4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca
     4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa
     4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta
     4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga
     4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg
     4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc
     4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg
     5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca
     5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt
     5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg
     5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg
//
LOCUS       STMBLIPA      840 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  S.clavuligerus beta-lactamase inhibitory protein gene, complete
            cds.
ACCESSION   M34538
KEYWORDS    beta-lactamase inhibitory protein.
SOURCE      S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP.
  ORGANISM  Streptomyces clavuligerus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 840)
  AUTHORS   Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E.
  TITLE     Isolation and characterization of a beta-lactamase inhibitory
            protein from Streptomyces clavuligerus and cloning and analysis of
            corresponding gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.E.Jensen, 25-MAY-1990.
               Dept. of Microbiology
               University of Alberta
               Edmonton, Alberta Canada T6G 2E9
FEATURES       from  to/span     description
    pept         67      672     beta-lactamase inhibitory protein precursor
                                 (bliP)
    sigp         67      174     beta-lactamase inhibitory protein signal
                                 peptide
    matp        175      669     beta-lactamase inhibitory protein
    binding      52       63     ribosomal binding site
    signal      686      724     transcription termination signal
BASE COUNT      140 a    248 c    320 g    132 t
ORIGIN      
        1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga
       61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc
      121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg
      181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc
      241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc
      301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc
      361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg
      421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg
      481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg
      541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc
      601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg
      661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc
      721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg
      781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc
//
LOCUS       TRBPARPA      763 bp ss-mRNA            INV       08-AUG-1990
DEFINITION  T.brucei procyclin (pPS-1) mRNA, complete cds.
ACCESSION   M34790
KEYWORDS    procyclic acidic repetitive protein; procyclin.
SOURCE      T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone
            pPS-1.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 763)
  AUTHORS   Dorn,P.L., Aman,R.A. and Boothroyd,J.C.
  TITLE     Inhibition of protein synthesis results in super-induction of
            procyclin RNA
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Boothroyd, 04-JUN-1990.
               Stanford University School of Medicine
               Dept of Microbiology and Immunology
               Sherman Fairchild Science Building
               Stanford, CA 94305-5402 USA
FEATURES       from  to/span     description
    pept         29      466     procyclin
    mRNA    <     1      763     procyclin mRNA
    rpt         215      393     12 bp degenerate repeats
BASE COUNT      192 a    215 c    156 g    200 t
ORIGIN      
        1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct
       61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc
      121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg
      181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga
      241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga
      301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga
      361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc
      421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt
      481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc
      541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt
      601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc
      661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag
      721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt
//
LOCUS       W10PPIBCA    4761 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes,
            complete cds, and ippD gene, 3' end.
ACCESSION   M34849
KEYWORDS    antigen.
SOURCE      Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA,
            clones HB101/pHS[4108,5357].
  ORGANISM  Plasmid pWR100
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 4761)
  AUTHORS   Baudry,B., Kaczorek,M. and Sansonetti,P.J.
  TITLE     Nucleotide sequence of the invasion plasmid antigen B and C genes
            (ipaB and ipaC) of Shigella flexneri
  JOURNAL   Microb. Pathog. 4, 345-357 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 538 and 3987 to 4761)
  AUTHORS   Baudry,B., Kaczorek,M. and Sansonetti,P.J.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Baudry, 05-JUN-1990.
               Center for Vaccine Development
               University of Maryland at Baltimore
               10 South Pine Street
               Baltimore, MD 21201
FEATURES       from  to/span     description
    pept        606     1073     polypeptide I (ippI)
    pept       1079     2821     polypeptide B (ipaB)
    pept       2841     3932     polypeptide C (ipaC)
    pept       3983  >  4761     ipaD protein
    pept    <     1      549     unidentified ORF
    binding     595      600     ippI ribosomal binding site
    binding    1065     1070     ipaB ribosomal binding site
    binding    2829     2834     ipaC ribosomal binding site
    binding    3971     3977     ipaD ribosomal binding site
    signal      556      561     ippI -10 region
    signal     2723     2728     ipaC -35 region
    signal     2745     2750     ipaC -10 region
BASE COUNT     1769 a    952 c    805 g   1235 t
ORIGIN      
        1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat
       61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat
      121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct
      181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca
      241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat
      301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc
      361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg
      421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca
      481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat
      541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag
      601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa
      661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca
      721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct
      781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta
      841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat
      901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag
      961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat
     1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat
     1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga
     1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac
     1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa
     1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct
     1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa
     1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga
     1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat
     1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat
     1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga
     1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca
     1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat
     1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa
     1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg
     1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc
     1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga
     1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt
     2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt
     2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca
     2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc
     2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc
     2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac
     2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac
     2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat
     2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc
     2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc
     2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa
     2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga
     2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc
     2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg
     2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata
     2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc
     2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat
     3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg
     3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag
     3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag
     3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga
     3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc
     3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa
     3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc
     3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa
     3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc
     3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt
     3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc
     3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca
     3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca
     3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc
     3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag
     3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg
     3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc
     4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa
     4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa
     4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac
     4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc
     4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt
     4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga
     4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata
     4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag
     4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa
     4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc
     4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac
     4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat
     4741 agacaatatg ttaaaaagct t
//
LOCUS       YEPOMPHA      824 bp ss-mRNA            BCT       08-AUG-1990
DEFINITION  Y.enterocolitica cationic outer membrane protein (ompH) gene,
            complete cds.
ACCESSION   M34854
KEYWORDS    cationic outer membrane protein;
            enterobacterial outer membrane protein;
            lipopolysaccharide binding protein; secretory protein;
            structural membrane protein.
SOURCE      Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 824)
  AUTHORS   Hirvas,L., Koski,P. and Vaara,M.
  TITLE     The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and
            comparison to known ompH sequences
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.H.Hirvas, 06-JUN-1990.
               University of Helsinki
FEATURES       from  to/span     description
    pept        324      818     ompH protein precursor (gtg start codon)
    sigp        324      392     ompH protein signal peptide
    matp        393      815     ompH protein
    binding     308      317     ompH ribosomal binding site
    signal      120      125     ompH -35 region
    signal      143      148     ompH -10 region
BASE COUNT      257 a    147 c    198 g    221 t      1 others
ORIGIN      
        1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg
       61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg
      121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt
      181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt
      241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa
      301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag
      361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa
      421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag
      481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc
      541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga
      601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc
      661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta
      721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta
      781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac
//
LOCUS       YSCEXG1A     2056 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase
            (EXG1) gene, complete cds.
ACCESSION   M34341
KEYWORDS    1,3-glucan glucanohydrolase; exo-1,3-B-glucanase.
SOURCE      S.cerevisiae (strain AB320) DNA, clone YEp13.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2056)
  AUTHORS   Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A.,
            Nebreda,A.R., Mendez,E. and del Rey,F.
  TITLE     Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1,
            in the yeast Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.del Rey, 17-MAY-1990.
               Instituto de Microbiologia bioquimica
               Facultad de Biologia
               CSIC/Universidad de Salamanca
               37008 Salamanca, SPAIN
FEATURES       from  to/span     description
    pept        363     1709     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase precursor (EXG1, EC 3.2.1.58)
    sigp        363      482     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase signal peptide
    matp        483     1706     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase
    mRNA        265  >  1709     EXG1 mRNA
    signal      153      159     TATA box
BASE COUNT      610 a    395 c    396 g    655 t
ORIGIN      Chromosome XII-right arm.
        1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg
       61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac
      121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa
      181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat
      241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt
      301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta
      361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta
      421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa
      481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg
      541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg
      601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg
      661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg
      721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc
      781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca
      841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt
      901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca
      961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat
     1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca
     1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca
     1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact
     1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct
     1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg
     1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga
     1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg
     1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt
     1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct
     1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg
     1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc
     1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat
     1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt
     1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa
     1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta
     1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat
     1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt
     2041 gagcttttcc gccggc
//
LOCUS       ACCPCACGH    2391 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  A.calcoaceticus gamma-carboxymuconolactone decarboxylase (pcaC) and
            protocatechuate 3,4-dioxygenase alpha and beta subunit (pcaGH)
            genes, complete cds.
ACCESSION   M33798
KEYWORDS    gamma-carboxymuconolactone decarboxylase;
            protocatechuate 3,4-dioxygenase.
SOURCE      A.calcoaceticus DNA, clone pZR2.
  ORGANISM  Acinetobacter calcoaceticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 2391)
  AUTHORS   Hartnett,C., Neidle,E.L., Ngai,K.-L. and Ornston,N.
  TITLE     DNA sequences of genes encoding Acinetobacter calcoaceticus
            protocatechuate 3,4-dioxygenase: Evidence indicating shuffling of
            genes and of DNA sequences within genes during their evolutionary
            divergence
  JOURNAL   J. Bacteriol. 172, 956-966 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        324      728     gamma-carboxymuconolactone decarboxylase
                                 (pcaC) (EC 4.1.1.44)
    pept        750     1463     protocatechuate 3,4-dioxygenase, alpha subunit
                                 (pcaH) (EC 1.13.11.3)
    pept       1491     2120     protocatechuate 3,4-dioxygenase, beta subunit
                                 (pcaG) (EC 1.13.11.3)
    ORF     <     1      203     ORF (AA at 3)
    binding     312      315     ribosomal binding site (put.)
    binding     737      740     ribosomal binding site (put.)
    binding    1478     1482     ribosomal binding site (put.)
BASE COUNT      670 a    463 c    533 g    725 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttatca aacccaacgt tacttgctct gtttcatttt atgtgcaggt attgccgtca
       61 atggtgctca atcttcaatg ccagtattga gtgcgcgttt ctatccaacg caatgtcgtg
      121 caacaggtgt ggcatggatg tctggaattg gccgttttgg tcgccgtatt tggtgcatgg
      181 attggtgcag ttttattagg taataactgg tcatttacca tgatcctgag catgttgata
      241 tcctgctcgt gccgctgcaa ttgcaatttt tgttaaatcc cttgttggca catacagatg
      301 caacttgatg tgaggtttct cttatgaatg atgaacaacg ttataaacaa gggctggaag
      361 tccgtacaga agttctgggt gaaaagcatg ttaaccgttc acttgagaac ctaaacgact
      421 tcaatcagga ttttcaaaac tttattagcc gttttgcatg gggagaggtt tggtctcgtc
      481 ctggacttcc tcgtcatacg cgtagtttgg tcacaattgc agtgttgttg gcgcttgggc
      541 gtgaagatga gctgcgtatc gatttacgtg cctgtttcaa caatggtgtg accaaagatg
      601 aattaaaaga gttgattcta cattgttcac tttatgcagg cttacccgct tcaaatgctg
      661 cgatgcatat ggcagaagag gtctttaagg acttgggaat tgcacctgaa aaagtaaaca
      721 aggattaatg gatttagagg gagtagaaaa tgtctcaaat tatttgggga gcttacgctc
      781 agtcgaatac cgaagatcat ccgccagctt atcgaccagg ttataaaacc agtgtgttac
      841 gctcgcctaa gaatcgactc atttctattg cagaaacttt aagtgaagtc actgcaccac
      901 attttagtgc tgataaattt ggtccaaaag acaatgactt aatcttgaac tatgccaaag
      961 atggtttgcc gattggtgaa cgtgtcattg tgcatggtta tgtacgtgac cagtttggtc
     1021 gacctgtaaa aaatgcactt gttgaagtat ggcaagccaa tgcatctggt cgttatcgtc
     1081 atccaaatga tcaatatatc ggtgccatgg atcctaattt cggtggttgt ggccgtatgt
     1141 tgactgatga taatggttat tatgtattcc gtacgattaa accaggtcca tacccatggc
     1201 gtaaccgtat caatgaatgg cgtcctgctc acattcactt ctctttaatt gcagatggtt
     1261 gggcacaacg tttaatttcg cagttctatt ttgaaggcga tacattgatt gattcttgcc
     1321 caatcctgaa aaccattcct tctgagcaac aacgtcgtgc gctaattgca ttagaagaca
     1381 agagcaattt cattgaagct gacagccgtt gttatcgctt tgacattacg cttcgtggtc
     1441 gcgcggactt acttcgaaaa tgacttaact taatggatgg agtaaaaaac atgaatggtt
     1501 ggaattttca ggaactcaaa gaaaccccat ctcaaacagg tggtccatat gtccatattg
     1561 gtcttttgcc aaaacaagcc aatatcgaag tatttgaaca caatttaggt aacaaccttg
     1621 tacaggacaa tacgcaaggt caacgcatcc gattagaagg tcaagtgttt gatgggctta
     1681 gtttacctct acgtgacgta ctgattgaaa tctggcaagc agataccaat ggcgtttacc
     1741 caagtcaagc tgatactcaa ggtaaacaag tcgatccgaa ctttttgggt tggggccgta
     1801 caggtgcaga ttttggtaca ggcttctgga gttttaatac cattaagcca ggtgctgttc
     1861 caggccgtaa aggttcaact caagcaccac atatttcact gattattttt gcgcgtggta
     1921 tcaacattgg tcttcacact cgtgtctatt ttgatgacga agcagaagca aatgcaaaag
     1981 atcctgtttt aaacagtatc gagtgggcga cacgtcgtca aacactggtt gccaaacgtg
     2041 aagagcgtga tggtgaagtg gtttatcgtt ttgatattcg tattcaaggt gaaaacgaaa
     2101 cggtattttt cgatatctaa atttttgact atggaacggg ccgtatatct ttcggtatac
     2161 agcctccata attttacaag gaaacgtaaa tgttcaatac aaacaaactg gttaaacaac
     2221 tttgtctaag tggtgcgttg ttgatgtcat tgcctatatt gtcaacgacc tatgcggctg
     2281 aaaatactgt accagcatct aaatcaactt atgttgtaaa aaatttaaac attggtgatt
     2341 tacctgttaa aaccctcgtg ccaatcactg ctaaaacaag agagcaagct t
//
LOCUS       HAMSAA3A      530 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  M.auratus serum amyloid A (SAA3) mRNA, complete cds.
ACCESSION   M33431
KEYWORDS    serum amyloid A protein.
SOURCE      M.auratus (strain LVG) embryo fibroblast, cDNA to mRNA, clone
            pAA825.
  ORGANISM  Mesocricetus auratus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 530)
  AUTHORS   Gervais,C. and Suh,M.
  TITLE     Serum amyloid A protein-related mRNA expression in herpes simplex
            virus type 2-transformed hamster cells
  JOURNAL   Mol. Cell. Biol. 10, 4412-4414 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Suh, 30-MAR-1990.
FEATURES       from  to/span     description
    pept         42      410     serum amyloid A
    signal      510      515     polyA signal
BASE COUNT      147 a    122 c    140 g    121 t
ORIGIN      
        1 ggggacctct agagtcactc accaggcatc tcgaaagcag gatgaagcct ttccttgcca
       61 tcattttttg cttcttgatc ctgggagttg acagccaaag gtggttccag ttcatgaaag
      121 aagctggcca agggagtaca gacatgtgga gagcttactc tgacatgaga gaggccaact
      181 ggaaaaactc agataaatac ttccatgccc gggggaacta tgatgctgcc aaaaggggac
      241 caggaggagc ctgggctgct aaagtgatca gtgatgccag agagggtatt cagaggttca
      301 caggacgtgg agcagcggac tcaagagctg accagtttgc caacaaatgg ggccggagtg
      361 gtaaagaccc caaccacttt cgacctgctg gcctgcctag taaatactga gtttcatctt
      421 catgctattc ccagccatgc accccaaggt tgcatttact agattaagtt agttcctgag
      481 ctctggacgt gtaagcaccc gaaatttaga ataaatacct gtgaaatgct
//
LOCUS       HEIBEXBA      795 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  H.influenza (strain RM153) capsulation protein (bexB) gene, 5' end.
ACCESSION   M33787
KEYWORDS    capsulation protein.
SOURCE      H.influenza (type b, strain RM153) DNA.
  ORGANISM  Haemophilus influenzae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 795)
  AUTHORS   Kroll,J.S. and Moxon,E.R.
  TITLE     Capsulation in distantly related strains of Haemophilus influenzae
            type b: genetic drift and gene transfer at the capsulation locus
  JOURNAL   J. Bacteriol. 172, 1374-1379 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >   795     capsulation protein (bexB)
BASE COUNT      188 a    133 c    177 g    297 t
ORIGIN      
        1 atgcaatatg gtgataaaac aacatttaaa cagtcattag ctattcaagg acgggtgatt
       61 aacgcgttgc ttatgcgtga aataatcact cgttatgggc gtcaaaatat tggctttttt
      121 tggctatttg ttgagccttt gctcatgact ttttttattg taatgatgtg gaaatttatc
      181 cgtgctgata aattttctac tttaaatatg attgcctttg tgatgacggg gtatccaatg
      241 gcaatgatgt ggcgaaatgc ttcaaaccgt gcgattggtt ccatttctgc aaacttgagt
      301 ttactttatc accgaaatgt tcgagtgcta gataccattt ttactcgcgt gttgcttgag
      361 gttgctggtg cttctattgc tcaaattctt tttatggcaa tcttagtcat gattgattgg
      421 attgatgcac cgcatgatgt gttttatatg ctcattgcat ggttcctcat ggcaatgttt
      481 gcttttggct tggggttaat catttgtgcg attgctcagc aatttgatgt gtttggtaaa
      541 atttggggaa cactcagttt cgttttatta ccaatttctg gcgcattctt cttcgtccat
      601 aaccttccag cacaggcaca atctattgca ctttggtttc cgatgattca tggtacagaa
      661 atgttccgcc atggctattt tggcgataca gtggttactt acgaaagtat aggctttttg
      721 gttgtgagtg atctagcctt gttgttactg ggcttagtga tggttaaaaa ctttagtaaa
      781 ggagttgagc cacaa
//
LOCUS       HEIBEXBB      795 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  H.influenza (strain RM926) capsulation protein (bexB) gene, 5' end.
ACCESSION   M33788
KEYWORDS    capsulation protein.
SOURCE      H.influenza (type b, strain RM926 ATCC 8468) DNA, clones
            lambda-926.[8,11,14].
  ORGANISM  Haemophilus influenzae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Pasteurellaceae.
REFERENCE   1  (bases 1 to 795)
  AUTHORS   Kroll,J.S. and Moxon,E.R.
  TITLE     Capsulation in distantly related strains of Haemophilus influenzae
            type b: genetic drift and gene transfer at the capsulation locus
  JOURNAL   J. Bacteriol. 172, 1374-1379 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1  >   795     bexB protein
BASE COUNT      188 a    128 c    178 g    301 t
ORIGIN      
        1 atgcaatatg gtgatcaaac aacgtttaaa cagtcattag cgattcaagg aagagtgatt
       61 aatgcattac ttatgcgtga aataatcact cgttatgggc gcaaaaatat tggctttcta
      121 tggctatttg ttgagccttt actcatgact ttttttattg tgatgatgtg gaaatttatt
      181 cgtgctgata aattttcaac tttaaatatg attgcttttg tcatgaccgg gtatcctatg
      241 gcgatgatgt ggcgaaatgc ttctaatcga gctattggtt cgatttctgc aaacttgagt
      301 ttactttatc accgaaatgt tcgagtgcta gatactatct ttactcgagt attgcttgag
      361 gttgcaggtg cttcgattgc tcaaatcctt tttatggcag ttttagtgct tattggttgg
      421 attgatgcac cacgtgatgt gttttatatg cttatggcat ggtttctcat ggctatgttt
      481 gcttttgcct taggccttat tatttgtgcg gtagctcaac aatttgatgt gttcggtaaa
      541 atttggggaa ccttaagttt cgtattgtta ccaatttctg gtgcattttt ctttgttcac
      601 aaccttccat cacaggctca atctatcgcg ctttggctcc caatgattca tggtacggaa
      661 atgtttcgtc atggctactt tggggatacg gtcgttactt atgagagtat cggattttta
      721 gtcgtaagtg atttggcctt attgctaatg ggcttggtga tggttaaaaa ctttagtaag
      781 gggattgagc cacaa
//
LOCUS       PSEARCDA     1502 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  P.aeruginosa membrane protein arcD (complete cds) and arcA (5' end)
            genes.
ACCESSION   M33223
KEYWORDS    membrane protein.
SOURCE      P.aeruginosa (strain PAO1) DNA, clone pME183.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1502)
  AUTHORS   Luethi,E., Baur,H., Gamper,M., Brunner,F., Villeval,D.,
            Mercenier,A. and Haas,D.
  TITLE     The arc operon for anaerobic arginine catabolism in Pseudomonas
            aeruginosa contains an additional gene, arcD, encoding a membrane
            protein
  JOURNAL   Gene 87, 37-43 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Haas, 26-MAR-1990.
FEATURES       from  to/span     description
    pept         24     1472     membrane protein (arcD)
    pept       1494  >  1502     membrane protein (arcA)
    binding       8       15     ribosome binding site (put.)
BASE COUNT      247 a    507 c    443 g    305 t
ORIGIN      
        1 attcctatag ggagattccg aaaatgtccc aagaatccag ccaaaaactc cgactaggag
       61 cgctaacggc tctggttgtc ggctcgatga tcggaggggg catcttctcc ctcccgcaaa
      121 acatggcagc cagcgccgat gtcggcgccg tcctgatcgg atgggcgatc accgccgtcg
      181 gcatgctcac cctcgctttc gtcttccaga ccctcgccaa ccgcaagccc gaactcgacg
      241 gcggcgtgta cgcctacgcc aaggccggct tcggcgacta catgggtttc tcctcggcct
      301 ggggctactg gatcagcgcc tggctgggca acgtcggcta cttcgtcctg ttgttcagca
      361 ccctcggcta cttcttcccg atcttcggca agggcgacac cgtcgcggcg atcgtctgcg
      421 cctcggtcct gctctgggcc ctgcacttcc tggtgctgcg cgggatcaag gaggcggcgt
      481 tcatcaacac cgtcaccacc gtggccaagg tcgtgccgct gttcctgttc atcctgatct
      541 gcctgttcgc cttcaagctg gacatcttca ccgccgacat ctggggcaag agcaacccgg
      601 acctgggcag cgtgatgaac caggtgcgca acatgatgct ggtcaccgtc tgggtgttca
      661 tcggcatcga gggcgcgagc atcttctcct cccgcgcgga aaaacgttcc gacgtcggca
      721 aggccaccgt gatcggcttc atcaccgtcc tgctcctgct ggtgctggtc aacgtgctgt
      781 ccatgggcgt gatgacccag ccggaactgg ccaagctgca gaacccgtcg atggcgctgg
      841 tactcgagca tgtggtcggc cactggggcg ccgtgctgat cagcgtcggc ctgctgatct
      901 cgctgctggg cgcgctgctc tcctgggtgc tgctgtgcgc cgagatcatg ttcgccgccg
      961 ccaaggacca caccatgccg gagttcctgc gccgcgagaa cgccaaccag gtgccggcca
     1021 acgccctgtg gctgaccaac atctgcgtac aggtgttcct ggtggtggtg ttcttcacct
     1081 cgggcgaccc ggacggcatg gacccgtaca ccaagatgct gctcctggcc acctcgatga
     1141 tcctgatccc gtacttctgg tccgccgcct atggcctgct gctgaccctg aagggcgaga
     1201 cctacgagaa cgatgcccgc gaacgcagca aggacctggt catcgccggc atcgccgtgg
     1261 cctacgcggt ctggctgctc tacgcggggg gcctgaagta cctgctgctg tccgccctgc
     1321 tgtatgcgcc cggcgcgatc cttttcgcca aggccaagca cgaggtcggc cagccgatct
     1381 tcaccggaat cgaaaaactg atcttcgccg cagtcgtcat tggcgccctg gtggctgcct
     1441 acggcctcta cgacggtttc ctcaccctct gattactcca ctggagacgt gaaatgagca
     1501 cg
//
LOCUS       HUMSPTB      6765 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human beta-spectrin (SPTB) mRNA, complete cds.
ACCESSION   J05500
KEYWORDS    beta-spectrin; spectrin.
SOURCE      Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and
            V252.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 6765)
  AUTHORS   Winkelmann,J.C., Chang,J.-G., Tse,W.T., Scarpa,A.L.,
            Marchesi,V.T. and Forget,B.G.
  TITLE     Full length sequence of the cDNA for human erythroid beta-spectrin
  JOURNAL   J. Biol. Chem. 265, 11827-11832 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Winkelmann, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         96     6509     beta-spectrin /nomgen="SPTB" /map="14"
                                 /hgml_locus_uid="LS0033T"
    mRNA    <     1     6765     SPTB mRNA
    signal     6716     6722     poly-A signal
BASE COUNT     1626 a   1822 c   2146 g   1171 t
ORIGIN      Chromosome 14q23-q24.
        1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag
       61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa
      121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg
      181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg
      241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg
      301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca
      361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc
      421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac
      481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc
      541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag
      601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag
      661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta
      721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca
      781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc
      841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct
      901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc
      961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg
     1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca
     1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca
     1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca
     1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag
     1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg
     1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc
     1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata
     1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga
     1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg
     1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc
     1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg
     1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg
     1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga
     1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg
     1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc
     1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg
     1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg
     2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc
     2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc
     2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt
     2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga
     2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc
     2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg
     2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc
     2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat
     2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgcaggcc ctgcgggagc
     2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg
     2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga
     2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca
     2761 ggttcgacat cctggaccag gagatgaaga ccttgatgac tcagattgat ggtgtgaacc
     2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc
     2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg
     2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt
     3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag
     3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg
     3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga
     3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat
     3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg
     3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca
     3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg
     3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc
     3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg
     3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg
     3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat
     3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga
     3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg
     3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg
     3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc
     3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct
     3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc
     4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc
     4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc
     4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa
     4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg
     4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcggtcagac gacccgggca
     4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga
     4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg
     4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa
     4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg
     4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta
     4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg
     4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg
     4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc
     4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt
     4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct
     4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc
     4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg
     5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg
     5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca
     5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa
     5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc
     5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca
     5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg
     5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc
     5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga
     5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca
     5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg
     5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg
     5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc
     5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct
     5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc
     5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca
     5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc
     6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt
     6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc
     6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg
     6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca
     6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg
     6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga
     6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg
     6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac
     6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg
     6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg
     6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag
     6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa
     6721 attttattgt ataaatatat caccatttac atgaggggaa acact
//
LOCUS       STYEUTBC     2526 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes,
            complete cds.
ACCESSION   J05518
KEYWORDS    ethanolamine ammonia-lyase.
SOURCE      S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2526)
  AUTHORS   Faust,L.R.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M.
  TITLE     Cloning, sequencing, and expression of the genes encoding the
            adenosylcobalamin-dependent ethanolamine ammonia-lyase of
            Salmonella typhimurium
  JOURNAL   J. Biol. Chem. 265, 12462-12466 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.M.Babior, 08-MAY-1990.
FEATURES       from  to/span     description
    pept        141     1499     ethanolamine ammonia-lyase (eutB)
    pept       1518     2378     ethanolamine ammonia-lyase (eutC)
    binding     130      133     ribosome binding site
    binding    1507     1510     ribosome binding site
BASE COUNT      563 a    687 c    779 g    497 t
ORIGIN      
        1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga
       61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt
      121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt
      181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg
      241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa
      301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc
      361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca
      421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc
      481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc
      541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc
      601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg
      661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc
      721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga
      781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc
      841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct
      901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt
      961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct
     1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg
     1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc
     1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga
     1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc
     1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc
     1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg
     1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg
     1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat
     1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg
     1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc
     1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg
     1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc
     1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg
     1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg
     1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc
     1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg
     1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga
     2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact
     2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca
     2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg
     2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc
     2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg
     2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga
     2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg
     2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt
     2521 gcctcc
//
LOCUS       XELPCNA      1018 bp ss-mRNA            VRT       08-AUG-1990
DEFINITION  X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete
            cds.
ACCESSION   M34080
KEYWORDS    nuclear protein; proliferating cell nuclear antigen.
SOURCE      X.laevis oocyte, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1018)
  AUTHORS   Leibovici,M., Gusse,M., Bravo,R. and Mechali,M.
  TITLE     Characterization and developmental expression of Xenopus
            proliferating cell nuclear antigen (PCNA)
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Leibovici, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         28      813     proliferating cell nuclear antigen (PCNA)
    mRNA    <     1     1018     PCNA mRNA
BASE COUNT      284 a    223 c    237 g    274 t
ORIGIN      
        1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc
       61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca
      121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact
      181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag
      241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg
      301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt
      361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa
      421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc
      481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca
      541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa
      601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat
      661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca
      721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg
      781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc
      841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa
      901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa
      961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac
//
LOCUS       YSCAAC2A     1333 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete
            cds.
ACCESSION   M34076 J05542
KEYWORDS    ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE      S.cerevisiae (strain W303-1B) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1333)
  AUTHORS   Kolarov,J., Kolarova,N. and Nelson,N.
  TITLE     A third ADP/ATP-translocator gene in yeast
  JOURNAL   J. Biol. Chem. 265, 12711-12716 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Nelson, 08-MAY-1990.
FEATURES       from  to/span     description
    pept        235     1158     ADP/ATP-translocator protein (AAC2)
BASE COUNT      388 a    209 c    301 g    435 t
ORIGIN      
        1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc
       61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg
      121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa
      181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt
      241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt
      301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat
      361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt
      421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat
      481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg
      541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct
      601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc
      661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg
      721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc
      781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct
      841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg
      901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga
      961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa
     1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg
     1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc
     1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc
     1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag
     1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc
     1321 agccactgtt gtt
//
LOCUS       YSCAAC3      1308 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete
            cds.
ACCESSION   M34075 J05542
KEYWORDS    ADP/ATP translocase; ADP/ATP-translocator protein.
SOURCE      S.cerevisiae (strain W303-1B) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1308)
  AUTHORS   Kolarov,J., Kolarova,N. and Nelson,N.
  TITLE     A third ADP/ATP-translocator gene in yeast
  JOURNAL   J. Biol. Chem. 265, 12711-12716 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by N.Nelson, 08-MAY-1990.
FEATURES       from  to/span     description
    pept         78     1034     ADP/ATP-translocator protein (AAC3)
BASE COUNT      353 a    228 c    263 g    464 t
ORIGIN      
        1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata
       61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc
      121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg
      181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag
      241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca
      301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta
      361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt
      421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg
      481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat
      541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg
      601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac
      661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga
      721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt
      781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa
      841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga
      901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag
      961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta
     1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc
     1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata
     1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa
     1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc
     1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt
//
LOCUS       VSVPNA       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 06/85-NM-E) nucleoprotein gene,
            complete cds.
ACCESSION   M31845
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 06/85-NM-E, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      438 a    267 c    293 g    331 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
       61 aaaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac
      121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catatgtata
      181 ccagggtatc aagtccggta atccatccat cctccacata aatgcttatc tctacgctgc
      241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
      301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gacgataaat ggcttcccat
      421 ctacatccta ggtctctaca gagtgggcag atctaaagtt acggattaca gaaagaaact
      481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
      541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
      601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
      661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
      721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
      781 cgagctctgc cagatgatgt atccgggaca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatt gactttgggt tatctcagaa atccccctat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc
      961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
     1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaagtacac
     1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
     1201 cgcaagaaga gcagtcaact ctctgcaatc acttcgtgag aagacaattg gcaaatatgc
     1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNB       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 11/83-CA-B) nucleoprotein,
            complete cds.
ACCESSION   M31846
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 11/83-CA-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      437 a    265 c    296 g    331 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
       61 gaaattaccg gccaacgagg atccggtcga atacccggct gattacttca aaaataatac
      121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
      181 ccaggggatc aagtccggta atccatccat cctccacata aatgcttatc tctatgctgc
      241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
      301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
      421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact
      481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctcgtcga
      541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
      601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
      661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
      721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
      781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
      961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
     1021 tgctgttggg tcctcagcag acattgaaca gcagttctat attggagaag acaaatacac
     1081 aacagaaaaa gatgatggtc tgaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
     1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgaa aagacaattg gcaaatatgc
     1261 caaggcagag tttgacaagt gacaaaattt aagataccac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNC       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 01/84-SN-P1) nucleoprotein
            gene, complete cds.
ACCESSION   M31847
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 01/84-SN-P1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      438 a    268 c    292 g    331 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
       61 aaaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac
      121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
      181 ccagggtatc aagtccggta atccatccat cctccacata aatgcttatc tctacgctgc
      241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
      301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gacgataaat ggcttcccat
      421 ctacatccta ggtctctaca gagtgggcag atctaaagtt acggattaca gaaagaaact
      481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
      541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
      601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
      661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
      721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
      781 cgagctctgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatt gactttgggt tatctcagaa atccccctat tcatcagtaa agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc
      961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
     1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaagtacac
     1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
     1201 cgcaagaaga gcagtcaact ctctgcaatc acttcgtgag aagacaattg gcaaatatgc
     1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPND       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 07/83-GA-P) Phosphoprotein and
            nucleocapsid genes.
ACCESSION   M31848
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 07/83-GA-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      435 a    272 c    297 g    325 t
ORIGIN      
        1 aacagatatc aaaatggccc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
       61 caaattaccg gccaacgaag atccggttga atacccggcc gattatttca aaaataatac
      121 caatatagtg ttatatgtga gcaccaaagt agcactaaat gatttgaggg catacgtata
      181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
      241 attgaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
      301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gtcgaagaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
      421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaatt
      481 actggacggg cttgaaaatc aatgcaaagt ggcgtcgacc agatttgaga gtctagtcga
      541 agatggtctc gacttctttg acatatggga gaacgatcca aatttcacca agatagtggc
      601 tgcagtggat atgttcttcc acatgtttaa aaagcatgag cgtgctccaa tcagatacgg
      661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa
      721 agtcagtggg ctctcaattg aagaactcac aacatgggtc ctgaataggg aggttgcaga
      781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatc gactttggat tatcccagaa atccccatat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc
      961 tagacagcct gacgacatcg aatacacctc actaacttgt gcaagtttgc tgctgtcatt
     1021 cgctgttggg tcatcagcag acattgaaca gcaattctat attggagagg ataaatacac
     1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
     1141 agactggctt ggctggtatg atgacaatgg ggggaaaccc acaccagata tgctcaactt
     1201 cgcgagaaga gcagttaact ccctgcagtc actccgtgag aagacaattg gcaaatatgc
     1261 caaggcagag tttgacaagt gacaaaactt aatataccac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNE       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 11/82-VC-B2) nucleoprotein
            gene, complete cds.
ACCESSION   M31849
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 11/82-VC-B2, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
BASE COUNT      435 a    263 c    296 g    335 t
ORIGIN      
        1 aacagacatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
       61 taaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac
      121 caatatagtc ttatatgtga gcaccaaagt ggcgctaaat gatttgagag catacgtata
      181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
      241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
      301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaagaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
      421 ctacatccta ggtctttaca gagtgggcag atccaaagtt acggattaca gaaagaaact
      481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
      541 ggatggtctt gacttttttg acatatggga gaatgatccg aatttcacca agatagttgc
      601 tgcagtggat atgttcttcc acatgtttaa aaagcatgaa cgtgctccaa tcagatacgg
      661 aaccatagtc tcaagattca aggactgtgc agcacttgcg acatttggac atctcagcaa
      721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
      781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 atatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaagaatgc
      961 tagacagcct gacgacattg aatacacttc actaacttgt gcaagtttac tgctgtcgtt
     1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaaatacac
     1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
     1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgag aagacaatcg gcaaatatgc
     1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNF       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 07/84-OA-B) nucleoprotein gene,
            complete cds.
ACCESSION   M31850
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 07/84-OA-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      435 a    260 c    298 g    336 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcca ttattcagcc
       61 taagttaccg gccaacgagg atccgattga atacccggct gattacttca aaaataatac
      121 caatatagta ttatatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
      181 tcaggggatc aagtccggta atccatccat actccacata aatgcttatc tttacgctgc
      241 attaaaggga gtggaaggaa ctttggacag agactgggtt agctttggaa gaacaattgg
      301 aaagagagag gagaatgtaa aaatcttcga tctagtgaaa gttgaagaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
      421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact
      481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
      541 ggatggtcta gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
      601 tgcagtggat atgttcttcc acatgtttaa aaagcatgaa cgtgctccaa tcagatacgg
      661 aaccatagtc tcaagattca aggactgtgc agcacttgcg acattcggac atctcagcaa
      721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
      781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgcggcact cttgctaaga tcgactcgag caaagaatgc
      961 tagacagcct gatgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
     1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac
     1081 aacagaaaaa gatgatggtc ttaagaaatc ggatgtccca ccaaaaggaa ggaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
     1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc
     1261 caaggcagag tttgacaagt gacaaaattt aagataccac tgtcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNG       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../52-GA-P) nucleoprotein gene,
            complete cds.
ACCESSION   M31851
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain ../52-GA-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      428 a    260 c    302 g    339 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
       61 taaattaccg gccaacgagg atccggttga atacccggcc gattacttca aaaataatac
      121 caatatagtg ttgtatgtga gcacaaaagt agcactaaat gatttgaggg catacgtata
      181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
      241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
      301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaggaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
      421 ctacatccta ggtctttaca gggtgggcag atctaaagtt acggattaca gaaagaaatt
      481 attggacggg cttgagattc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
      541 agatggtctc gacttctttg atgtatggga gaatgatcca aatttcacca agatagtggc
      601 tgcagtggat atgttcttcc acatgttcaa aaagcatgag cgtgctccaa tcagatacgg
      661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa
      721 agtcagtggt ctttcaattg aagaactcac aacatgggtc ctaaataggg aagttgcaga
      781 cgagctatgt cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatc gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
      961 tagacagcct gacgacatcg agtacacttc actaacttgt gcaagtttgc tgctgtcatt
     1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac
     1081 aacagaaaaa gatgatggtc ttaggaaatc agatgttcca ccaaaaggaa gaaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctccaatt
     1201 cgcaaggaga gcagttaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc
     1261 caaggccgag tttgacaagt gacaaaactt aagataccac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNH       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 10/84-GM-P) nucleoprotein gene,
            complete cds.
ACCESSION   M31852
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 10/84-GM-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      441 a    274 c    280 g    334 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcaa taattcagcc
       61 caaactaccg gccaatgagg atccggtaga atatccagct gattacttca aaaataatac
      121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttaaggg cctatgtcta
      181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc
      241 actaaaggga gtagaaggga cattagacag agactgggtc agctttggaa gaacaatcgg
      301 aaagagggag gagaatgtca aaatcttcga tctagttaaa gttgaggaac tgaagacagc
      361 tcttcctgac gggaaatcgg atccagatcg atcagctgaa gatgataaat ggcttcctat
      421 ttacatccta ggcctctacc gagtaggtag atccaaagtt acagattaca gaaagaaact
      481 cctggacggt ctagaaaatc agtgtaaagt ggcatcaacc agatttgaaa gccttgttga
      541 agatggtctt gacttcttcg acatctggga aaatgatccc aatttcacta aaatagttgc
      601 tgcagtggat atgttcttcc acatgttcaa gaagcatgaa cgtgctccga tcagatacgg
      661 aaccatagtc tcacgattca aagactgtgc agcacttgct acatttggac atctcagcaa
      721 ggtcagcgga ctttcaattg aggacctaac aacatgggtt ctgaatcgag aagtcgcaga
      781 cgaactatgc caaatgatgt acccaggtca agaaattgat aaggcggatt catacatgcc
      841 gtatatgatt gattttgggt tatcccaaaa atctccttat tcatcagtta agaatccagc
      901 ctttcacttc tggggtcaac tggctgcact cttgctaaga tcaactcgag caaaaaatgc
      961 tcggcaaccc gatgacattg agtacacttc actgacttgt gcaagtttac tactgtcgtt
     1021 tgcagttgga tcttcagcag acattgaaca acaattctac atcggagagg acaaatacac
     1081 aacagaaaaa gatgacggtc tcaagaaatc tgatgtccca ccgaaaggaa gaaatgttgt
     1141 cgactggctt ggttggtatg atgacaatgg aggaaaaccc acaccagata tgctcaactt
     1201 cgcacgaaga gcagtcagtt cgttgcagtc acttcgcgag aaaacaattg gcaagtacgc
     1261 caaggcagaa tttgacaaat gaccaagttt gacatatcaa tatcaatatt atattctgtg
     1321 ctatatatg
//
LOCUS       VSVPNI       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 11/84-HD-B1) nucleoprotein
            gene, complete cds.
ACCESSION   M31853
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 11/84-HD-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      442 a    278 c    283 g    326 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa taattcagcc
       61 taaactaccg gccaatgagg atccggtaga atatccagct gattacttca aaaataatac
      121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttgaggg cctatgtcta
      181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc
      241 actaaaggga gtagagggaa cattagacag agactgggtt agctttggaa gaacaatcgg
      301 aaaaagggag gagaatgtca aaatcttcga cctggtcaaa gttgaagaac tgaagacagc
      361 tcttcctgac ggaaaatcag atccagaccg atcagccgaa gatgataaat ggcttcctat
      421 ttacatccta ggcctctatc gagtaggtag atccaaagtg acagattaca gaaagaaact
      481 cctggatggt ctagaaaatc agtgcaaagt ggcatcaacc agatttgaaa gcctggttga
      541 agatggtctt gacttcttcg acatctggga aaatgatccc aatttcacta aaatagttgc
      601 tgcagtggac atgttcttcc acatgttcaa gaagcatgaa cgtgctccga tcagatacgg
      661 aaccatagtc tcacgattca aagactgtgc ggcacttgct acatttggac atctcagcaa
      721 agtcagcgga ctttcaattg aggacctaac aacatgggtt ctgaatagag aagtcgcaga
      781 tgaactatgc caaatgatgt acccaggcca agaaattgat aaggcggatt catacatgcc
      841 atatatgatt gactttgggt tatcccaaaa atctccgtat tcatcagtga agaatccagc
      901 ctttcacttc tggggtcaac tggctgcact cttgctaaga tcgactcggg caaaaaatgc
      961 taggcaacct gatgacattg agtacacttc actgacctgt gcaagtttac tcctgtcatt
     1021 tgcagttgga tcttctgcag acattgagca acaattctac atcggggaag acaaatacac
     1081 aacagagaag gatgacggtc tcaagaaatc cgaagtccca ccgaaaggaa gaaatgttgt
     1141 agactggctt ggttggtacg atgacaacgg aggaaaaccc acacctgata tgctcaactt
     1201 cgcacgaaga gcagtcagct ctttgcagtc acttcgtgag aaaacaatcg gcaaatacgc
     1261 taaagcagaa tttgataaat gaccaggttt gacatatcaa tatcaatatt atattctgtg
     1321 ctatatatg
//
LOCUS       VSVPNJ       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 09/82-HD-B) nucleoprotein gene,
            complete cds.
ACCESSION   M31854
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 09/82-HD-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      437 a    265 c    296 g    331 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc
       61 gaaattaccg gccaacgagg atccggtcga atacccggct gattacttca aaaataatac
      121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata
      181 ccaggggatc aagtccggta atccatccat cctccacata aatgcttatc tctatgctgc
      241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg
      301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
      421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact
      481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctcgtcga
      541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc
      601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg
      661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa
      721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga
      781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
      961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt
     1021 tgctgttggg tcctcagcag acattgaaca gcagttctat attggagaag acaaatacac
     1081 aacagaaaaa gatgatggtc tgaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt
     1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgaa aagacaattg gcaaatatgc
     1261 caaggcagag tttgacaagt gacaaaattt aagataccac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNK       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../49-UT-B1) nucleoprotein
            gene, complete cds.
ACCESSION   M31855
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain ../49-UT-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      428 a    261 c    302 g    338 t
ORIGIN      
        1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcca taattcagcc
       61 taaactaccg gccaacgagg atccggttga atacccggcc gattacttca aaaataatac
      121 caatatagtg ttgtatgtga gcacaaaagt agcactaaat gatttgaggg catacgtata
      181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc
      241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg
      301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaggaac tgaagacagc
      361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat
      421 ctacatccta ggtctttaca gggtgggcag atctaaagtt acggattaca gaaagaaatt
      481 attggacggg cttgagattc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga
      541 agatggtctc gacttctttg atgtatggga gaatgatcca aatttcacca agatagtggc
      601 tgcagtggat atgttcttcc acatgttcaa aaagcatgag cgtgctccaa tcagatacgg
      661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa
      721 agtcagtggt ctttcaattg aagaactcac aacatgggtc ctaaataggg aagttgcaga
      781 cgagctatgt cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc
      841 gtatatgatc gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc
      901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc
      961 tagacagcct gacgacatcg agtacacttc actaacttgt gcaagtttgc tgctgtcatt
     1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac
     1081 aacagaaaaa gatgatggtc ttaggaaatc agatgttcca ccaaaaggaa gaaatgtcgt
     1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctccaatt
     1201 cgcaaggaga gcagttaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc
     1261 caaggccgag tttgacaagt gacaaaactt aagataccac tatcactatt atattctatg
     1321 ttatatatg
//
LOCUS       VSVPNL       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 10/82-CR-B) nucleoprotein gene,
            complete cds.
ACCESSION   M31856
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 10/82-CR-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      431 a    268 c    298 g    332 t
ORIGIN      
        1 aacagacatc aaaatggctc ctacagtcaa gagaattgtc aatgactcta ttattcatcc
       61 taaattgcca gcccatgaag atccggtgga atacccggct gattacttca aaaacaatac
      121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttatgtcta
      181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttatgcagc
      241 attgaagggg gtagaaggga ccttagacag agactgggtc agcttcggac gcacaattgg
      301 gaaaagagaa gagagtgtga aaatctttga ccttgttaaa gttgaggaat taaagacagc
      361 ccttcctgat ggaaaatcag atcccgaccg ttcagctgag gatgacaaat ggcttcccat
      421 ctatatcctg ggtctttaca gagtaggcag atcaaaagtt actgattaca gaaagaaatt
      481 gctggatggt ttagagaatc agtgcaaagt agcatcgacc agattcgaaa gcctcgttga
      541 agatggattg gatttctttg acatatggga gaatgatccg aacttcacca agatagttgc
      601 agcggtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatacgg
      661 aaccatagtc tcacggttca aagactgtgc agcattagcg acatttgggc atctaagcaa
      721 ggtcagtgga ctctctattg aagatctcac aacatgggtc cttaatagag aagttgcaga
      781 cgagctatgc cagatgatgt atccggggca agaaattgat aaggcggatt cttacatgcc
      841 atatatgatt gactttggat tgtcgcagaa atcaccatac tcatcagtta aaaacccagc
      901 tttccacttc tggggtcaac tagcagcact cctattgaga tctactcgcg caaaaaatgc
      961 ccgacagcca gatgacattg aatatacatc attaacttgt gcaagtctac tgctgtcatt
     1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag ataaatacac
     1081 aacagacagg gatgatggtc tgaagaaatc agatgtacca cccaaaggga ggaatgtcgt
     1141 agactggctc ggctggtatg acgataatgg aggaaaacct acaacggaca tgctgaactt
     1201 cgcaagaaga gcagtcagct ccctacaagc acttcgcgag aaaacaatcg gcaaatatgc
     1261 caaggcagag tttgataaat gaccaaactc tgcataccac tatcaatatt atattctatg
     1321 ttacatatg
//
LOCUS       VSVPNM       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 12/82-HD-B) nucleoprotein gene,
            complete cds.
ACCESSION   M31857
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 12/82-HD-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      441 a    267 c    291 g    330 t
ORIGIN      
        1 aacagacatc aaaatggctc ctacagtcaa gagaattgtc aatgactcta ttattcatcc
       61 taaactgcca gcccatgaag atccggtgga atacccggct gattacttca aaaacaatac
      121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttatgtcta
      181 ccaagggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
      241 attgaaggga gtagaaggga ccttggacag agactgggtc agcttcggac gcacaattgg
      301 gaaaagagaa gagagtgtga aaatctttga ccttgttaaa gtggaagaat taaagacagc
      361 acttcctgat ggaaaatcag atccagaccg ttcagctgag gatgataaat ggcttccaat
      421 ttatatcctg ggtctctaca gagtaggtag atcaaaagtt actgattacc gaaagaaatt
      481 gctggacggt ttagagaatc agtgcaaagt agcatcgacc agattcgaaa gcctagttga
      541 agacggattg gatttctttg acatatggga gaatgatcca aacttcacca agatagttgc
      601 agcagtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatacgg
      661 aaccatagtc tcacggttca aagactgtgc agcattggcg acatttgggc atctcagcaa
      721 ggtcagtgga ctctctattg aagatctcac aacatgggtt ctaaatagag aagttgcaga
      781 cgagctatgc cagatgatgt atccagggca agaaattgat aaggcggatt cttacatgcc
      841 atatatgatt gactttggat tgtcgcagaa atcaccatac tcatcagtta aaaacccagc
      901 tttccacttc tggggtcaac tcgcagcgct cctattaaga tctactcgcg caaaaaatgc
      961 cagacaacca gatgacattg aatatacatc attaacttgt gcaagtctac tgctgtcatt
     1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag acaaatacac
     1081 aacagataag gatgatggtc taaagaaatc agatgtacca cccaaaggga ggaatgtcgt
     1141 agactggctc ggctggtatg acgataatgg gggaaaacct acaccagaca tgctgaactt
     1201 cgcaagaaga gcagtcagct ccctacaagc acttcgtgag aaaacaatcg gcaaatatgc
     1261 caaggcagag tttgataaat gaccaaactc tacataccac tatcaatatt atattctatg
     1321 ttacatatg
//
LOCUS       VSVPNN       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 10/85-HD-B1) nucleoprotein
            gene, complete cds.
ACCESSION   M31858
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 10/85-HD-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      437 a    264 c    294 g    334 t
ORIGIN      
        1 aacagacatc aaaatggctc ctacagttaa gagaatcgtc aatgactcta ttattcatcc
       61 taaactgcca gcccatgaag atccggtgga atacccagct gattacttca aaaacaatac
      121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttacgtcta
      181 ccaagggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
      241 attgaaggga gtagaaggga ctttggacag agactgggtc agctttggac gcacgattgg
      301 gaaaagagaa gagagtgtga aaatatttga ccttgttaaa gtggaagaat taaagacagc
      361 acttcctgat ggaaaatcag atccagaccg ttcagctgag gatgataaat ggcttcccat
      421 ttatatcctg ggtctctaca gagtaggtag atcaaaagta actgattacc gaaagaaatt
      481 gctggatggt ttagagaatc aatgcaaagt agcatcgacc agattcgaaa gcctagttga
      541 agatggattg gatttctttg acatatggga gaatgatccg aacttcacca agatagttgc
      601 agcggtagat atgttctttc atatgttcaa aaaacatgag cgggcaccaa tcagatacgg
      661 aaccatagtc tcacggttca aagactgtgc cgcattggcg acatttgggc atctcagcaa
      721 ggtcagtgga ctctctattg aagatctcac aacatgggtc cttaataggg aagttgcaga
      781 cgagctatgc cagatgatgt atccagggca agaaattgat aaggcggatt cttacatgcc
      841 atatatgatt gactttggat tatcgcagaa atcaccatac tcatcagtta aaaacccagc
      901 tttccacttc tggggtcaac ttgcagcgct cctattaaga tctactcgcg caaaaaatgc
      961 ccgacagcca gatgacattg aatatacatc attgacttgt gcaagtctac tgctgtcatt
     1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag ataaatacac
     1081 aacagataag gatgatggtc taaagaaatc agatgtacca cccaaaggga ggaatgtcgt
     1141 agactggctc ggctggtatg acgataatgg gggaaaacct acaccagaca tgctgaactt
     1201 cgcaagaaga gcagtcagct ccctacaagc acttcgtgag aaaacaattg gtaaatatgc
     1261 caaggcagag tttgataaat gaccaaactc tacataccac tatcaatatt atattctatg
     1321 ctacatatg
//
LOCUS       VSVPNO       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 07/83-NC-P) nucleoprotein gene,
            complete cds.
ACCESSION   M31859
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 07/83-NC-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      430 a    270 c    293 g    336 t
ORIGIN      
        1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtt aacgactcta ttattcatcc
       61 caagctgcca gcccatgaag atccggtaga atacccggct gattacttca aaaacaatac
      121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gacttgaggg cttatgtcta
      181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
      241 attgaaagga gtagaaggga ccttggatcg agactgggtc agcttcggac gcaccattgg
      301 aaaaagagaa gagagtgtga aaatctttga tcttgtcaaa gtggaagagt taaaaacagc
      361 acttcctgac ggaaaatcag accctgatcg ttcagctgag gatgacaaat ggcttcccat
      421 ttatatcctg ggtctctaca gagttggtag atcaaaagtt actgattacc ggaagaaatt
      481 gctagatggt ctagagaatc agtgcaaggt agcatcaacc agattcgaaa gtttagtcga
      541 agatggattg gatttctttg acatatggga gaacgatccg aacttcacca agatagttgc
      601 agcagtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatatgg
      661 aaccatagta tcacggttca aagactgtgc cgcattagca acatttgggc atctcagtaa
      721 agtcagtgga ctttctattg aggatctcac aacatgggtc ctaaataggg aagttgcaga
      781 cgaactatgc cagatgatgt acccagggca agaaattgat aaagcggatt cctacatgcc
      841 atatatgatt gactttggat tgtcccagaa atcaccatat tcatcagtga aaaacccagc
      901 cttccacttc tggggtcaac ttgcagcact cctattaaga tctactcgcg caaagaatgc
      961 tcgacagccg gatgacattg aatatacatc attgacttgt gcaagtctat tactgtcatt
     1021 tgcagttggg tcctcagcag acattgaaca gcagttctat attggagaag ataagtacac
     1081 aacagataag gatgatggtc tgaagaaatc agatgtacca cccaaaggga ggaatgttgt
     1141 agactggctc ggctggtatg atgataatgg gggaaaaccc acaccggaca tgctaaattt
     1201 cgcaagaaga gcagtcaact ccctacaagc actccgtgag aaaacgattg gcaagtatgc
     1261 caaggccgag ttcgataaat gaccagattc tacctactac tatcaatatt atattctctg
     1321 ttatatatg
//
LOCUS       VSVPNP       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 01/85-PN-B1) nucleoprotein
            gene, complete cds. genes.
ACCESSION   M31860
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain 01/85-PN-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      437 a    265 c    293 g    334 t
ORIGIN      
        1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtc aacgactcta ttatccatcc
       61 taagctgcca gcccatgaag atccggtaga atacccggct gattacttca agaacaatac
      121 gaatatcgtg ttgtatgtaa gcactaaagc agcattaaat gatttaaggg catatgtcta
      181 ccaggggatt aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
      241 attaaaggga gtagaaggga ccttggacag agattgggtc agtttcggac gcaccattgg
      301 gaaaagagaa gagaatgtga aaatatttga tcttgttaaa gtggaagaat taaaaacagc
      361 acttcctgat ggaaaatcag accctgatcg ttcggcagag gatgataaat ggctccccat
      421 ttatatccta ggtctctaca gagtgggtag atcgaaagtt actgattacc ggaagaaatt
      481 gctagatggt ctagagaacc agtgcaaagt agcatcaacc agatttgaaa gtctagtcga
      541 agatggattg gatttttttg acatatggga aaatgatccg aatttcacca agatagttgc
      601 ggcagtagac atgttctttc atatgttcaa gaaacatgag cgtgcaccaa taagatatgg
      661 aaccatagta tcacggttca aggactgtgc agcgttagca acatttggac atctcagtaa
      721 agtcagcgga ctttctattg aagatctcac aacatgggtc ctcaataggg aagttgcaga
      781 cgaactctgc cagatgatgt acccaggtca agaaattgat aaatcggatt cttacatgcc
      841 atatatgatt gactttggat tgtcccagaa atcgccgtac tcatcagtaa aaaacccagc
      901 tttccacttc tggggtcaac ttgcagcact cctgttaaga tctactcgtg caaaaaatgc
      961 ccgacagcca gatgacattg aatatacatc actgacttgt gcaagcctac tactgtcatt
     1021 tgcagtcggg tcttcagcag acatcgaaca gcagttttat attggagaag acaagtacac
     1081 aacagaaaag gatgatggtc taaggaactc agatgtacca cccaaaggga ggaatgttgt
     1141 agactggctc agctggtatg acgataatgg aggaaaacca acaccggaca tgctaaattt
     1201 tgcaaaaaga gcggtcaact ccttgcaggc actccgtgag aaaacgattg gcaagtatgc
     1261 caaggcagag ttcgataaat gaccagatta tacataccgc tatcaatatt atattctctg
     1321 ttatatatg
//
LOCUS       VSVPNQ       1329 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../60-PN-B) nucleoprotein gene,
            complete cds.
ACCESSION   M31861
KEYWORDS    nucleoprotein.
SOURCE      Vesicular stomatitis virus (strain ../60-PN-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         14     1282     nucleoprotein
BASE COUNT      434 a    273 c    290 g    332 t
ORIGIN      
        1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtt aacgactctg ttattcatcc
       61 taagctacca gcccatgagg atcccgtaga atatccagct gattacttca aaaacaatac
      121 aaatatcata ttatatgtta gcactaaagt ggcactgaat gatttaaggg cttatgtcta
      181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc
      241 attgaaggga gtagaaggga ctttagacag agactgggtc agcttcggac gaaccattgg
      301 gaaaagagaa gagagtgtga aaatctttga tcttgtcaaa gtggaagagt taaaaacagc
      361 acttcctgat ggaaaatcag accctgatcg ttcggccgag gatgacaaat ggcttcccat
      421 ttatatctta ggtctctaca gagtgggtag atcaaaggtt actgattacc ggaagaaatt
      481 gctagacggt ctagagaacc agtgcaaagt agcatcaacc agattcgaaa gtctagtcga
      541 agatggattg gatttttttg acatatggga aaatgatccg aacttcacca agatagttgc
      601 agcagtagac atgttctttc acatgttcaa gaaacatgag cgagcaccaa tcagatatgg
      661 aaccatagta tcacggttca aggactgtgc agcattagca acatttgggc atctcagtaa
      721 agtcagtgga ctttctattg aggatcttac aacatgggtc ctcaataggg aagttgcaga
      781 tgaactatgc cagatgatgt atccggggca agaaattgat aaagctgatt cttacatgcc
      841 atatatgatt gactttggat tgtcccagaa atcaccatat tcatcagtta aaaacccagc
      901 cttccacttc tggggtcaac ttgcagcact cctgttaaga tctactcgtg caaaaaatgc
      961 ccgacagcca gatgacattg aatatacgtc attgacctgt gcaagtctac tactgtcctt
     1021 tgcagttgga tcctcagcag acattgaaca gcagttctat attggagaag ataagtacac
     1081 gacagacaag gatgatggtc taaagaaatc agatgcacca cccaaaggga ggaatgttgt
     1141 agactggctc ggttggtatg acgataatgg gggaaaaccc acaccggaca tgctaaattt
     1201 cgcaagaaga gcggtcaact ccctacaagc actccgtgag aaaacgattg gcaaatatgc
     1261 caaggcagag ttccataaat gaccagatta tacccaccgc taccaatatt atatcctatg
     1321 ttacatatg
//
LOCUS       VSVPPAA       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 11/84-HD-B1) phosphoprotein
            gene, complete cds.
ACCESSION   M31871
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 11/84-HD-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      290 a    163 c    182 g    221 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt
       61 ggattctgcc ttgcaggatg ccaatgaatc tgaagaaaga cgagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccggatga
      241 aagaatagac aagtgggaag aggacttgcc tgaattagaa aatattgatg atgatatacc
      301 ggtgaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt
      361 gtctctgttc cctccagttg gactaacaaa gattcaaaca gagcaatgga gaaaaaccat
      421 tgaggcagtt tgtgagagct caaaatattg gaatttatca gaatgtcaaa ttctgaactt
      481 ggaagactgc ctcaccctca aaggccgagt gatgactcct gattgtagtt cctcagtaaa
      541 atctcaaaat tctattcaga gttcggaatc tctctactcc tctcattccc caggtcccac
      601 actcaaggga tcagaatcca tcaatctatg ggatttgaag tccacagaag tacaattgat
      661 ctctaagaga gctggagtta aggacatgac agtcaaattg acagacttct tcgaaagtga
      721 ggaagagtat tactcagtat gcccagaagg ggcaccagat ttaataggag ctatcatcat
      781 gggattgaaa tataagaaac tattcaatca agcaagaatg aagtatcgtc tctaagtctt
      841 tctcaggatc agtatg
//
LOCUS       VSVPPAB       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 09/82-HD-B) phosphoprotein
            gene, complete cds.
ACCESSION   M31872
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain , New Jersey serotype), cDNA to
            viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      283 a    168 c    185 g    220 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt
       61 ggattctgcc ttgcaggatg ccaatgaatc tgaggaaagg cgagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccagatga
      241 aagaatagac aagtgggaag aagatttgcc tgaattagaa aagattgatg atgatatacc
      301 ggtgaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt
      361 gtctctgttc cctccagttg gattaacaaa aattcaaaca gaacaatgga gaaaaaccat
      421 tgaggcggtt tgtgagagtt caaaatattg gaatctatca gaatgccaaa ttctgaactt
      481 agaagactgc ctaaccctca aaggccgagt gatgactcct gactgtagtt cttcagtcaa
      541 atctcaaaat tctgtccaga gttcggaatc tctctactcc tctcattccc caggccccac
      601 actcaaggtg tcagagtcca tcaatctatg ggatttaaag tccacagaag tacaactgat
      661 ctctaagaga gccggagtta aggacatgac agtcaaattg acagacttct tcggaagtga
      721 ggaagagtat tattcagtat gcccagaagg ggcaccagac ttgatgggag ctatcatcat
      781 gggattgaaa tataagaaac tcttcaatca ggcgcgaatg aagtatcgtc tttaattcct
      841 tctcatgatc agtatg
//
LOCUS       VSVPPAC       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../49-UT-B1) phosphoprotein
            gene, complete cds.
ACCESSION   M31873
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain ../49-UT-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      284 a    161 c    185 g    226 t
ORIGIN      
        1 aacagatatc atggacagtg ttgataggct caagacttac ttagccactt atgataattt
       61 ggattctgcc ttgcaggatg ccaatgaatc tgaggaaaga cgagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg attataatgc tgaacatctt acgatgctgt caccggatga
      241 aagaatagac aagtgggaag aagatttgcc tgaattagaa aagattgatg atgatatacc
      301 ggtgaccttt tctgattgga cacagcctgt aatgaaggaa aatgggggag agaaatcatt
      361 gtctctgttc cctccagtcg ggttaacaaa gattcaaaca gaacaatgga aaaaaaccat
      421 tgaggcggtt tgtgagagtt caaaatattg gaatttatca gaatgccaaa ttcttaactt
      481 ggaagacagc ctcactatca aaggccgatt gatgactcct gattgtagtt cttcagtaaa
      541 atctcaaaat tctgtccgga ggtcagaacc tctctactcc tctcattctc caggtccccc
      601 actcaaggta tcagagtcca tcaatttatg ggatttaaag tccactgaag tacaattgat
      661 ctccaagaga gccggagtta aggacatgac agtcaaattg acagacttct ttggaagtga
      721 ggaagagtat tattcagtat gcccagaagg ggcgccagac ttgatgggag ctatcatcat
      781 gggactgaag tacaagaaac tcttcaatca ggcaagaatg aaatatcgtc tttaattcct
      841 tttcatgatc aatatg
//
LOCUS       VSVPPAD       855 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 10/82-CR-B) phosphoprotein
            gene, complete cds.
ACCESSION   M31874
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 10/82-CR-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 855)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      275 a    153 c    199 g    228 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
      241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgatatacc
      301 ggtcaccttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt
      361 atctttgttt cctcccgttg gattgaccaa ggttcagacc gaacaatgga ggaagacgat
      421 tgaggcagtc tgtgagagct caaaatattg gaatttatca gagtgccgga ttgtaacttc
      481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa
      541 atctcagaac tccgtacgga gttcagaatc tctttactcc tcccattcac caggacctgc
      601 atctagggca tcagagtcaa tgaatttatg ggacttgaag tcaacagaag tccaattgac
      661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga
      721 ggaagagtat tattcagtat gcccggaagg agcatcagat ttgatgggag caattattat
      781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaactcct
      841 tactacagtc aatat
//
LOCUS       VSVPPAE       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 12/82-HD-B) phosphoprotein
            gene, complete cds.
ACCESSION   M31875
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 12/82-HD-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      277 a    153 c    197 g    229 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
      241 gagagtagac aagtgggagg aagacttgcc tgaattagag aagattgatg atgatatacc
      301 ggtcactttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt
      361 atctttgttt cctcccgttg gattgaccaa gattcagacc gaacaatgga gaaagacgat
      421 tgaggcagtc tgtgagagct caaaatactg gaatttatca gagtgccgga ttgtaacttc
      481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa
      541 atctcagaat tctgtacaga gttcagaatc tctctactcc tcccattcac caggacctgt
      601 tcctagggca tcagagtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgat
      661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga
      721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat
      781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaataccgac tttaactcct
      841 tactacagtc aatatg
//
LOCUS       VSVPPAF       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 10/85-HD-B1) phosphoprotein
            gene, complete cds.
ACCESSION   M31876
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 10/85-HD-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      279 a    160 c    195 g    222 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccagacga
      241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgatatacc
      301 ggtcatcttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt
      361 atctttgttt cctcccattg gattgaccaa ggttcagacc gaacaatgga gaaagacgat
      421 tgaggcagtc tgtgagagct caaaatactg gaatttatca gagtgccgga ttgtaacttc
      481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa
      541 atctcagaat tctgtacaaa gttcagaatc tctctactcc tccccttcac caggacctgc
      601 acctagggca tcagagtcaa tgaatttatg ggacctgaag tcaacagaag tccaactgac
      661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga
      721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caatcattat
      781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaataccgac tttaactcct
      841 tactacagtc aatatg
//
LOCUS       VSVPPAG       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 07/83-NC-P) phosphoprotein
            gene, complete cds. genes.
ACCESSION   M31877
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 07/83-NC-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      279 a    148 c    199 g    230 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaggagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga
      181 atcgtcagat tcagatactg actataatgc tgagcatctt acgatgttgt cgccagatga
      241 aagaatagac aagtgggaag aggacttgcc tgaattagag aagattgatg atgatatacc
      301 ggtcaccttc cctaattgga ctcaacctgt aatgaaggag aatggtgggg agaagtcatt
      361 atctttgttt cctccagttg gattgactaa ggttcaaaca gaacaatgga ggaagactat
      421 tgaggcagtc tgtgagagct caaaatattg gaatttatca gagtgccgga ttgtaacttc
      481 agatgattgt ctcattctca gaggacgagt tatgactcct gattgcagct catcagttaa
      541 atctcagaat tctgtacaga gttcagaatc tctctactcc tcccattcac caggacctgc
      601 acctagagca tcagagtcaa cgaatctatg ggatttgagg tctacagagg tgcaattgac
      661 ttcaaagaga gcaggagtta aggatatgac agtaaagtta acagatttct ttgaaagcga
      721 agaagagtat tattcagtat gcccagaggg agcatcagac ttaatgggag caattattat
      781 gggactgaag cataagaaac tctttaatca agcaagaatg aaatatcgac tttaacttct
      841 tactgcagac aatatg
//
LOCUS       VSVPPAH       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 01/85-PN-B1) phosphoprotein
            gene, complete cds.
ACCESSION   M31878
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 01/85-PN-B1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      276 a    156 c    198 g    226 t
ORIGIN      
        1 aacagatatc atggacagtg tcgatcggct caagacgtat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaggaaga
      181 atcgtcagat tcagatactg actataatgc tgagcatctt acgatgttgt cgcccgagga
      241 aagaatagac aagtgggaag aagacttgcc tgaattggag aagatcgatg acgatatact
      301 ggtcacattc cctagttgga ctcaacctgt aataaaagag aatggtgggg agagatcatt
      361 atctttattt cctccagtcg gattgactca ggttcagaca gaacagtgga agaagaccat
      421 tgagacagtc tgtgagagct caaaatattg gaatttgtca gagtgccaga ttgtaacttc
      481 aggaaattgt cttattttga ggggacaagt tatgacctct gattgcagcc catcagctaa
      541 atctaagaat tctgtacaga gttccgagtc tccttcccct tccaattcac ccgaacatgc
      601 atctagagca tcagcttcac cgaatctatg ggactttaag tttacagagg tgcaattgat
      661 ctcaaaaaga gcaggagtta aggacatgac agtaaaatta acagacttct ttgaaagcga
      721 agaagagtat tattcagtat gcccggacgg agcatctgat ttgatggggg caattgttat
      781 gggcctgaag cataagaaac tctttaatca ggcaaggatg aaatatcgac tttaacttct
      841 tactgcagac aatatg
//
LOCUS       VSVPPAI       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../60-PN-B) phosphoprotein
            gene, complete cds.
ACCESSION   M31879
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain ../60-PN-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      271 a    163 c    206 g    216 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc cgaagagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa accaactccg tcgtattatc aggaggaaga
      181 atcgtcagat tcagatactg actacaatgc tgagcatctt acgatgttgt cgccagatga
      241 aagaatagac aagtgggaag aggacttgcc agaattagag aagatcgatg atgatatacc
      301 ggtcaccttc cctaattgga ctcaacctgt aatgaaggag aatggtgggg agaagtcatt
      361 atcattgttt cctccagttg gattgactaa ggttcagaca gaacagtgga agaagaccat
      421 cgaggcagtc tgtgagagct caaaatattg gaacttatca gagtgccgga ttgtgacttc
      481 agatgattgt ctcattctga ggggacgagt tatgactcct gattgcagct catcagttaa
      541 atctcagaat tctgtacaga gttccgagtc tctctgctcc tcccattcac caggacctgc
      601 ccctagagca tcagattcaa cgaatctatg ggacttgaag tccacagagg tgcaactagc
      661 ctctaagaga gcaggagtta aggacatgac agtaaagtta acagacttct ttgaaagcga
      721 agaagagtat tattcagtat gcccagaagg agcatccgac ttgatggggg caattattat
      781 gggactgaag cataagaaac tctttaacca ggcaagaatg aaatatcgga tttaacttct
      841 tactgcggac aatatg
//
LOCUS       VSVPPAJ       856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../76-EC-M) phosphoprotein
            gene, complete cds.
ACCESSION   M31880
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain ../76-EC-M, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      278 a    156 c    197 g    225 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg ccaatgaatc tgaagagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
      241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgacatacc
      301 ggttaccttt cctaattgga ctcaacctgt catgaaagag aatggtgggg agaaatcatt
      361 atctctgttt cctcccgttg gattgaccaa gtttcagacc gaacaatgga ggaagacgat
      421 tgaagcggtc tgtgaaagct caaaatattg gaatttatca gaatgccgga tggtaacttc
      481 agatgattgt ctgattctca gaggacgagt tataactcct gattgcagtt catcagttaa
      541 atctcagaat tctgtacgga gttcagaatc tctctattca tcccactcac caggacctgc
      601 acctggggca tcagggtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgac
      661 ctcaaagaga gcaggagtca aggacatgac agtcaagtta acagacttct ttggaagtga
      721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat
      781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaacccct
      841 tactacagtc aatatg
//
LOCUS       VSVPPR        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../76-EC-M) phosphoprotein
            gene, complete cds.
ACCESSION   M31862
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain ../76-EC-M, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      278 a    156 c    197 g    225 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg ccaatgaatc tgaagagaga agagaggata aatatctcca
      121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga
      241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgacatacc
      301 ggttaccttt cctaattgga ctcaacctgt catgaaagag aatggtgggg agaaatcatt
      361 atctctgttt cctcccgttg gattgaccaa gtttcagacc gaacaatgga ggaagacgat
      421 tgaagcggtc tgtgaaagct caaaatattg gaatttatca gaatgccgga tggtaacttc
      481 agatgattgt ctgattctca gaggacgagt tataactcct gattgcagtt catcagttaa
      541 atctcagaat tctgtacgga gttcagaatc tctctattca tcccactcac caggacctgc
      601 acctggggca tcagggtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgac
      661 ctcaaagaga gcaggagtca aggacatgac agtcaagtta acagacttct ttggaagtga
      721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat
      781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaacccct
      841 tactacagtc aatatg
//
LOCUS       VSVPPS        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 06/85-NM-E) phosphoprotein gene,
            complete cds.
ACCESSION   M31863
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 06/85-NM-E, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and itch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      288 a    161 c    190 g    217 t
ORIGIN      
        1 aacagagatc atggacaata ttgatcggct caaaacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
      121 agacctcttc atcgaagatc agggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataattc tgaacatctt acgatgttgt cgccggatga
      241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagatcgatg atgatatacc
      301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
      361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
      421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
      481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa
      541 atctcagaac tccacacagg gttctgaatc tctatcctcc tcacattcac ccggtccagc
      601 acccaaggtc aaggagtcaa ccagtctatg ggacttaaga tcaacggaag tacaattgac
      661 ttcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttgggagtga
      721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
      781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
      841 tccgatgatc gatatg
//
LOCUS       VSVPPT        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 11/83-CA-B) phosphoprotein
            gene, complete cds.
ACCESSION   M31864
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 11/83-CA-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      292 a    158 c    186 g    220 t
ORIGIN      
        1 aacagagatc atggacagta ttgatcggct caaaacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
      121 agacctcttc atcgaaaatc agggagataa aacgactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataattc tgaacatctt acgatgttgt cgccggatga
      241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
      301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
      361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
      421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
      481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa
      541 atctcagaac tccatacaga gttccgaatc tctctcctct tcacattcac ccggtccagc
      601 acccaaagtc aaggagtcaa ccagtctatg ggacttaaga tcaacagaag tacaattgat
      661 ctcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
      721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
      781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
      841 ttcgatgatc aatatg
//
LOCUS       VSVPPU        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 01/84-SN-P1) phosphoprotein
            gene, complete cds.
ACCESSION   M31865
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 01/84-SN-P1, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      289 a    160 c    190 g    217 t
ORIGIN      
        1 aacagagatc atggacaata ttgatcggct caaaacttat ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
      121 agacctcttc atcgaagatc agggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataagtc tgaacatctt acgatgttgt cgccggatga
      241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagatcgatg atgatatacc
      301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
      361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
      421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
      481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa
      541 atctcagaac tccacacagg gttctgaatc tctatcctcc tcacattcac ccggtccagc
      601 acccaaggtc aaggagtcaa ctagtctatg ggacttaaga tcaacggaag tacaattgac
      661 ttcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
      721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
      781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
      841 tccgatgatc gatatg
//
LOCUS       VSVPPV        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 07/83-GA-P) phosphoprotein
            gene, complete cds.
ACCESSION   M31866
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 07/83-GA-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      296 a    161 c    188 g    211 t
ORIGIN      
        1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt
       61 ggattctgcc ctgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataatgt tgaacatctt acgatgttat cgccggatga
      241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
      301 agtcactttt tctgactgga cacaacctgt aatgaaggaa aacggagggg agaaatcact
      361 atctctgttt cctccggttg gattaacaaa agttcagaag gaccaatgga ggaagacaat
      421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
      481 agaagattgt cttatcctca aaggccgaat tatgactccc gactgcagct catcaatcaa
      541 atctcagaac tccatacaga gttccgaatc gctctcctcc tcgcattcac ccggtccagc
      601 acccaaagtc acggagtcaa ctagtctatg ggatttaaaa tcgacagaag tacaactgat
      661 atcgaagagg gccggagtaa aagacatgat ggtgaagttg acagactttt ttggaagtga
      721 agaggaatat tattcagtat gcccaaaagg ggcaccagac ttgatgggag caatcatcat
      781 gggactaaaa cataaaaagc tttttaatca agcaagaatg aagtatcgtc tctaattaat
      841 tccgatgaac gatatg
//
LOCUS       VSVPPW        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 11/82-VC-B2) phosphoprotein
            gene, complete cds.
ACCESSION   M31867
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 11/82-VC-B2, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      291 a    164 c    187 g    214 t
ORIGIN      
        1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga
      241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
      301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact
      361 atctctgttc cctccggttg gattaacaaa agttcagacg gaacaatgga ggaagacaat
      421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc
      481 agatgattgt cttatcctca aaggccgagt gatgactcct gactgcagtt catcaatcaa
      541 atctcagaat tccatacaga gttctgaatc tctctcctcc tcgcattcac ccggtccagc
      601 acccaaagtc acggagtcaa ctagtctatg ggacttaaaa tcaacagaag tacaactgat
      661 ctcgaagagg gccggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
      721 agatgaatat cattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
      781 gggactaaaa cataaaaaac tctttaatca agcaagaatg aagtatcgtc tctaattcat
      841 tccgatgatc gatatg
//
LOCUS       VSVPPX        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 07/84-OA-B) phosphoprotein
            gene, complete cds.
ACCESSION   M31868
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 07/84-OA-B, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      294 a    158 c    186 g    218 t
ORIGIN      
        1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga
      241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
      301 agtcactttt tctgattgga cacaacctgt aatgaaggaa aacggagggg agaaatcact
      361 atctctgttt cctccggttg gattaacaaa aattcagacg gaccaatgga ggaagacaat
      421 tgaggcagtc tgtgagagct caaaatattg gaatttgtca gaatgccaaa ttatgaactc
      481 agataattgt cttatcctca aagtccgagt tatgactcct gactgcagtt catcaatcaa
      541 atctcagaac tccatacaga gttccgaatc tttctcctcc tcgcattcac cgggtccagc
      601 actaaaagtc acggagtcaa ctagtctatg ggacttaaga tcaacagaag tacaactgat
      661 ctcgaagagg gccggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga
      721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat
      781 gggactaaaa cataaaaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat
      841 tccgatgatc gatatg
//
LOCUS       VSVPPY        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain ../52-GA-P) phosphoprotein
            gene, complete cds.
ACCESSION   M31869
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain ../52-GA-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      292 a    164 c    189 g    211 t
ORIGIN      
        1 aacagagatc atggatagtg ttgatcggct caaaaattac ttggctactt atgataattt
       61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa atcaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga
      241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc
      301 agtcactttt tctgactgga cacaacctgt aatgaaggaa aacggagggg agaaatcact
      361 ctatctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat
      421 tgatgcggtc tgtgagagct ccaaatattg gaatttgtca gaatgccaga ttatgaactc
      481 agaagattgt cttatcctca aaggccgaat tatgactcct gactgcagct catcaatcaa
      541 atctcagaac tccatacaga gttccgaatc gccctcctcc tcgcattcac ccggtccagc
      601 acccaaagtc acggaatcaa ctagtctatg ggacttaaaa tcaacagacg tacaactgat
      661 ctcgaagagg gccggagtga aagacatgac ggtgaagttg acagactttt ttggaagtga
      721 agaggaatat tattcagtat gcccaaaagg ggcaccagac ttgatgggag caatcatcat
      781 gggactaaaa cataaaaagc tttttaatca agcaagaatg aagtatcgtc tctaattaat
      841 tccgatgaac gatatg
//
LOCUS       VSVPPZ        856 bp ss-RNA             VRL       08-AUG-1990
DEFINITION  Vesicular stomatitis virus (strain 10/84-GM-P) phosphoprotein
            gene, complete cds.
ACCESSION   M31870
KEYWORDS    phosphoprotein.
SOURCE      Vesicular stomatitis virus (strain 10/84-GM-P, New Jersey
            serotype), cDNA to viral RNA.
  ORGANISM  Vesicular stomatitis virus
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Rhabdoviridae; Vesiculovirus.
REFERENCE   1  (bases 1 to 856)
  AUTHORS   Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T.
  TITLE     Phosphoprotein and nucleocapsid protein evolution of vesicular
            stomatitis virus, New Jersey
  JOURNAL   J. Virol. 64, 2498-2504 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            S.T. Nichol, 05-FEB-1990.
FEATURES       from  to/span     description
    pept         11      835     phosphoprotein
BASE COUNT      283 a    166 c    183 g    224 t
ORIGIN      
        1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt
       61 ggattctgcc ctgcaggatg ccaatgaatc tgaggaaaga cgagaggata aatatctcca
      121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga
      181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccggatga
      241 aagaatagac aagtgggaag aagacttgcc tgaattagaa aagattgatg atgatatacc
      301 ggtaaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt
      361 gtccctgttc cctccagttg ggttaacaaa gattcaaaca gaacaatgga gaaaaaccat
      421 tgaggcggtt tgtgagagtt caaaatattg gaatttatca gaatgccaaa ttcttaactt
      481 agaagactgc ctcactctca aaggccgagt tatgactcct gattgtagtt cttcagtcaa
      541 atctcaaaat tctgtccgga gttcagaatc tctctactct tctcattccc caggtcccac
      601 actcaaggta tcagagtcca tcaatctatg ggatttaaag tccacagaag tacaattgat
      661 ctctaagaga gctggagtta aggacatgac agtcaaattg acagacttct tcggaagtga
      721 ggaagagtat tattcagtat gcccagaagg ggcaccagac ttgatgggag ccatcatcat
      781 gggattgaag tacaagaaac tcttcaatca ggcaagaatg aagtatcgcc tttaattcct
      841 tcttatgatc agtatg
//
LOCUS       HUMCYPIIF    1825 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human cytochrome P450IIF1 protein (CYP2F) mRNA, complete cds.
ACCESSION   J02906
KEYWORDS    cytochrome P450.
SOURCE      Human lung, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1825)
  AUTHORS   Nhamburo,P.T., Kimura,S., McBride,O.W., Kozak,C.A.,
            Gelboin,H.V. and Gonzalez,F.J.
  TITLE     The human CYP2F gene subfamily: Identification of a cDNA encoding a
            new cytochrome P450, cDNA-directed expression and chromosome
            mapping
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.T.Nhamburo, 18-MAY-1990.
FEATURES       from  to/span     description
    pept         56     1531     cytochrome P450IIF1 /nomgen="CYP2F"
                                 /map="19q13.1-q13.2" /hgml_locus_uid="LV0170W"
    mRNA    <     1     1813     CYP2F mRNA
BASE COUNT      356 a    596 c    470 g    403 t
ORIGIN      
        1 gcaggctcag cgcatcccag ccagtgtctc ctgcagctca gcagctgcct tcaccatgga
       61 cagcataagc acagccatct tactcctgct cctggctctc gtctgtctgc tcctgaccct
      121 aagctcaaga gataagggaa agctgcctcc gggacccaga cccctctcaa tcctgggaaa
      181 cctgctgctg ctttgctccc aagacatgct gacttctctc actaagctga gcaaggagta
      241 tggctccatg tacacagtgc acctgggacc caggcgggtg gtggtcctca gcgggtacca
      301 agctgtgaag gaggccctgg tggaccaggg agaggagttt agtggccgcg gtgactaccc
      361 tgcctttttc aactttacca agggcaatgg catcgccttc tccagtgggg atcgatggaa
      421 ggtcctgaga cagttctcta tccagattct acggaatttc gggatgggga agagaagcat
      481 tgaggagcga atcctagagg agggcagctt cctgctggcg gacgtgcgga aaactgaagg
      541 cgagcccttt gaccccacgt ttgtgctgag tcgctcagtg tccaacatta tctgttccgt
      601 gctcttcggc agccgcttcg actatgatga tgagcgtctg ctcaccatta tccgccttat
      661 caatgacaac ttccaaatca tgagcagccc ctggggcgag ttgtacgaca tcctagaccc
      721 cagattcccg agcctcctgg actgggtgcc tgggccgcac caacgcatct tccagaactt
      781 caagtgcctg agagacctca tcgcccacag cgtccacgac caccaggcct cgtctccccg
      841 ggacttcatc cagtgcttcc tcaccaagat ggcagaggag aaggaggacc cactgagcca
      901 cttccacatg gataccctgc tgatgaccac acataacctg ctctttggcg gcaccaagac
      961 ggtgagcacc acgctgcacc acgccttcct ggcactcatg aagtacccaa aagttcaagc
     1021 ccgcgtgcag gaggagatcg acctcgtggt gggacgcgcg cggctgccgg cgctgaagga
     1081 ccgcgcggcc atgccttaca cagacgcggt gatccacgag gtgcagcgct ttgcagacat
     1141 catccccatg aacttgccgc accgcgtcac tagggacacg gcctttcgcg gcttcctgat
     1201 acccaagggc accgatgtca tcaccctcct taacaccgtc cactacgacc ccagccagtt
     1261 cctgacgccc caggagttca accccgagca ttttttggat gccaatcagt ccttcaagaa
     1321 gagtccagcc ttcatgccct tctcagctgg gcgccgtctg tgcctgggag agctgctggc
     1381 gcgcatggag ctctttctgt acctcaccgc catcctgcag agcttttcgc tgcagccgct
     1441 gggtgcgccc gaggacatcg acctgacccc actcagctca ggtcttggca atttgccgcg
     1501 gcctttccag ctgtgcctgc gcccgcgcta acgccccggc ccttccagat tcgcctgtga
     1561 gcgatgaggc ccacccatgt gggttgctac gtccccttct tggtccacag tctgccctca
     1621 tccctctggc agtcacgctg tcttccctgc atgctgtgcc tgccgcgtgc ccttccccca
     1681 tccctccaat ctgtgccccg tctgcagggc agaggcagat gtggcatgtc tttttgtacc
     1741 cacagagctt gttctatggc acgccctttt ctaggctttt tgtatcattt cttagtacat
     1801 tgtaatagat tcaaaccagt cttgg
//
LOCUS       MUSADA3A      891 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse adenosine deaminase (ada) gene, 3' end.
ACCESSION   J05523
KEYWORDS    adenosine deaminase.
SOURCE      Mouse fibroblast cell line B-1/200 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 891)
  AUTHORS   Maa,M.-C., Chinsky,J.M., Ramamurthy,V., Martin,B.D. and
            Kellems,R.E.
  TITLE     Identification of transcription stop sites at the 5' and 3' ends of
            the murine adenosine deaminase gene
  JOURNAL   J. Biol. Chem. 265, 12513-12519 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.Ramamurthy, 18-MAY-1990.
BASE COUNT      241 a    207 c    262 g    181 t
ORIGIN      
        1 atagctggcc accttgcctt gattctatca aaggcattcc agtcccagca tcaagcacat
       61 tctgtcctta tgcctccctt tcagacatca gtcatggtgg ctagcaagca tcatgtactg
      121 ggaggaccca aatacacaaa gactctcagt aaggcaaacg gtgttgtaaa ggcctacatg
      181 aggggactag tgatggctaa ggatgaaggc aggcagcaaa caaagctgat gagagctgag
      241 cagagcacag gagccttgaa gagccccttg gcactgtgcc cggtgtgcca gctgggtcca
      301 ctgctcattc ggttggacag tgagcagccg ttgagtaggt acccgtcacg tgccaaacac
      361 tgttagtcac agagggcatg agacagcctg tgctggaaca gctcagttca tagggccatg
      421 gagatgggga gaaaggggcg cctctgtcag agacaagctg tggtctggga aggccttagc
      481 actaaaagca ccacaatgag aagcaaccgc cagaagcagg gcccgcaggc ctttgttcca
      541 gctgcaaaga gaaaggaaaa agtggggaat aagagttggg gctgcggagg gggtggggag
      601 cattgtgcag gttccgtact tgaacagaaa gcagggacca acacaaggaa ggccaaaggg
      661 ctcagcctgg gaaactttgt gaggctatgg aggtaagggg cagggtattc tcccgaggca
      721 tagagactgt cataccaatg atggtcattg gctgggcgag cctaagaaga ctccagtttc
      781 ccttgacctc agttaaatct attacatgag aaactgggtc cctgtcccca acattgtccc
      841 tgttccgtgt tcctaagtca tttctgggca ataaggactg agaactctag a
//
LOCUS       YSCG6PD      3782 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  S.cerevisiae glucose-6-phosphate dehydrogenase (ZWF1) gene and ORF,
            complete cds.
ACCESSION   M34709
KEYWORDS    glucose-6-phosphate dehydrogenase.
SOURCE      S.cerevisiae (strain S288C) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3782)
  AUTHORS   Nogae,I. and Johnston,M.
  TITLE     Isolation and characterization of the ZWF1 gene of Saccharomyces
            cerevisiae, encoding glucose-6-phosphate dehydrogenase
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Johnston, 18-MAY-1990.
FEATURES       from  to/span     description
    pept       1585     3102     glucose-6-phosphate dehydrogenase (ZWF1) (EC
                                 1.1.1.49)
    pept        246     1037     pot. ORF
    mRNA       1472  >  3102     ZWF1 mRNA
BASE COUNT     1066 a    899 c    887 g    930 t
ORIGIN      
        1 atcgattagc aaaaccgtag agaagatcat tgcgcataag aaacaaaaag aaaataccgg
       61 tgcggaccgc aagcctctat tatctgctgt gtgtccggga tttcttatat acacggaaaa
      121 gaccaaaccg cagttggtgc ctatgctgct caatgtcaag tcgcctcagc aaattacggg
      181 ctcgttgatc agagctacgt tcgagagtct ctatcgcgcg tgagtccttt taccatctgt
      241 cgctgatgcc atgttttgac aagaaactag aagcgtcaag gccagaatcg cttgacgatg
      301 ggattgactg tgtcattacg ccacgtgaaa tagtgacaat gttacaagag ttgaatttag
      361 acttcaagtc gtttcttact gaggacacaa gtttatatgg cgctatcacc gccgggtggg
      421 acccgcgagt ccactgggcg tcgaatctgg gaggcacttg tggtggctat gcctaccagt
      481 acgtaacggc tgtccaacga ctacacccag gaagtcagat gatagttctg gagggtagaa
      541 acagcgacat tgtcgagtac cgattgctgc acgatgatcg cataatagcc gccgccagcg
      601 agctttccgg gtttagaaac atccaaaatc tggtgcgtaa actgacctcg ggctccgggt
      661 cagagcgcaa gaggaacatc agctctgcgg aagagagacg aacaggtcca aaggcgaact
      721 cacgcgagat ggcgccagcc gccacggccg atccctacca ctcggactac atcgaggtaa
      781 acgcgtgtcc gggtgcatgc atgaacggtg gtggcctact caatggcgag caaaacagtc
      841 tcaagcggaa gcagttggtt cagacactga ataagcgtca cggagaggag ctagcaatgg
      901 tagatcctct aaccctggga cccaagctag aagaggcggc accgcccgct ttcgctagag
      961 tacgtcttcg cgcccgtcaa gcaggccgtc gaaaaggatc tcgtctctgt tgggagcacc
     1021 tggtaagtaa ggtgtagttt tgcacccgtg tacataagcg tgaaatcacc acaaactgtg
     1081 tgtatcaagt acatagtgac atttaaataa tagcaagaac aacaataata gtagcgctac
     1141 tggaagcacc acgtaatagt ggaaaagaac tggaaaaacc gctataagat gcatactccg
     1201 gcggtcttac gcggagatac aagcttccaa cggtgctaaa agcccggttt cggctcggcc
     1261 ggaggaggaa gagagacgaa aaaaaaaaaa atgactaaaa aaaaaatgga atattattaa
     1321 tgtgggattt ttggctcaag gtgtggtggc cccttttcta agggtggcga attcttcaat
     1381 gtacggaaaa ctcgccaagg ctatcccata tataagcaaa ctgtgggttc atctatatac
     1441 cgacacataa cacctaaagt ggcttcctcc tgcccctctc tcccttttct ccactcaccc
     1501 ctccttctcc cccttccccc tctccaattg gctgtataga cagaaagagt aaatccaata
     1561 gaatagaaaa ccacataagg caagatgagt gaaggccccg tcaaattcga aaaaaatacc
     1621 gtcatatctg tctttggtgc gtcaggtgat ctggcaaaga agaagacttt tcccgcctta
     1681 tttgggcttt tcagagaagg ttaccttgat ccatctacca agatcttcgg ttatgcccgg
     1741 tccaaattgt ccatggagga ggacctgaag tcccgtgtcc taccccactt gaaaaaacct
     1801 cacggtgaag ccgatgactc taaggtcgaa cagttcttca agatggtcag ctacatttcg
     1861 ggaaattacg acacagatga aggcttcgac gaattaagaa cgcagatcga gaaattcgag
     1921 aaaagtgcca acgtcgatgt cccacaccgt ctcttctatc tggccttgcc gccaagcgtt
     1981 tttttgacgg tggccaagca gatcaagagt cgtgtgtacg cagagaatgg catcacccgt
     2041 gtaatcgtag agaaaccttt cggccacgac ctggcctctg ccagggagct gcaaaaaaac
     2101 ctgggggccc tctttaaaga agaagagttg tacagaattg accattactt gggtaaagag
     2161 ttggtcaaga atcttttagt cttgaggttc ggtaaccagt ttttgaatgc ctcgtggaat
     2221 agagacaaca ttcaaagcgt tcagatttcg tttaaagaga ggttcggcac cgaaggccgt
     2281 ggcggctatt tcgactctat aggcataatc agagacgtga tgcagaacca tctgttacaa
     2341 atcatgactc tcttgactat ggaaagaccg gtgtcttttg acccggaatc tattcgtgac
     2401 gaaaaggtta aggttctaaa ggccgtggcc cccatcgaca cggacgacgt cctcttgggc
     2461 cagtacggta aatctgagga cgggtctaag cccgcctacg tggatgatga cactgtagac
     2521 aaggactcta aatgtgtcac ttttgcagca atgactttca acatcgaaaa cgagcgttgg
     2581 gagggcgtcc ccatcatgat gcgtgccggt aaggctttga atgagtccaa ggtggagatc
     2641 agactgcagt acaaagcggt cgcatcgggt gtcttcaaag acattccaaa taacgaactg
     2701 gtcatcagag tgcagcccga tgccgctgtg tacctaaagt ttaatgctaa gacccctggt
     2761 ctgtcaaatg ctacccaagt cacagatctg aatctaactt acgcaagcag gtaccaagac
     2821 ttttggattc cagaggctta cgaggtgttg ataagagacg ccctactggg tgaccattcc
     2881 aactttgtca gagatgacga attggatatc agttggggca tattcacccc attactcaag
     2941 cacatagagc gtccggacgg tccaacaccg gaaatttacc cctacggatc aagaggtcca
     3001 aagggattga aggaatatat gcaaaaacac aagtatgtta tgcccgaaaa gcacccttac
     3061 gcttggcccg tgactaagcc agaagatacg aaggataatt agaaaaatgc aagcacattc
     3121 atttatcggc taagtcactg aaattttttt tttcgagtga ttttatcttg caagctcact
     3181 ctctttgttc ttttctcctc tataatggca tctccccccc accaactcct ctctacttgc
     3241 gtgtgtattt gtttgtatac atgtgtaaat atatatacat aaaagaatgt cgtctcatgt
     3301 attttttaac ttttagccgc ggtcagtgac attttggctt tcccaccatt ccacgtctga
     3361 aaaaaaaaaa aaaaaaagag ataaaatatg aattgaatat atatcaaaaa tgtctgcaaa
     3421 aatttttacg aatcagtccg attggacttg tacttatcat tgatttgctc gttatgcgtc
     3481 ttatccactt gattagctat tccctgtaaa gtcttagaca ctgcctcagt ggccccaatt
     3541 aacggccgaa taattgccac aggtgcagct ctagcaacat aaactgctgc ggcgctcggc
     3601 cctcccggtt tatcatcctt catctgtccc tttgccctcc ataccgcatc gtacgcaata
     3661 tgcatatgct tttctaatga actatacgcc tccttcaaac ccgtgggcag gtcgaggggc
     3721 tgatctgcgt acagactcac tattgtgggt tcgttgtgat ctcccggtga agacatatcg
     3781 at
//
LOCUS       BOVLMCB       326 bp ds-DNA             SYN       08-AUG-1990
DEFINITION  Synthetic bovine liver microsomal lipase-solubilized cytochrome b-5
            gene, complete cds.
ACCESSION   J02905
KEYWORDS    cytochrome b-5.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 326)
  AUTHORS   Funk,W.D., Lo,T.P., Mauk,M.R., Brayer,G.D., MacGillivray,R.T.A. and
            Mauk,A.G.
  TITLE     Mutagenic, electrochemical, and crystallographic investigation of
            the cytochrome b-5 oxidation-reduction equilibrium: Involvement of
            asparagine-57, serine-64, and heme propionate-7
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            G.Mauk, 16-MAY-1990.
FEATURES       from  to/span     description
    pept         27      311     microsomal cytochrome b-5
    binding      16       19     ribosome binding site
BASE COUNT      106 a     82 c     77 g     61 t
ORIGIN      
        1 gtacggtacc gtaaaggaga acaacaatga gcaaagctgt caaatactac actctggaac
       61 aaatccaaaa acacaacaac tcgaaatcga cgtggctgat cctcgactac aaagtatacg
      121 acctgactaa attcctggaa gagcacccgg gaggcgaaga agtcctgcgc gaacaggccg
      181 gcggtgacgc gactgaaaac tttgaggatg ttggccatag taccgacgct cgtgaactgt
      241 cgaaaacgtt catcatcggt gagctgcacc cggacgatcg ttctaaaatc accaaaccgt
      301 ccgaatctta atgagaattc gtctga
//
LOCUS       DROEST6P     5424 bp ds-DNA             INV       08-AUG-1990
DEFINITION  D.melanogaster carboxylesterase 6 and P (Est-6 and Est-P) genes,
            complete cds.
ACCESSION   M33780 M33781
KEYWORDS    carboxylesterase.
SOURCE      D.melanogaster (strain Dm145) adult DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 929 to 4027)
  AUTHORS   Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
            Richmond,R.C.
  TITLE     Molecular analysis of duplicated esterase genes in Drosophila
            melanogaster
  JOURNAL   Mol. Biol. Evol. 7, 9-28 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5424)
  AUTHORS   Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and
            Richmond,R.C.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.C.Richmond, 19-APR-1990.
FEATURES       from  to/span     description
    pept       1173     2559     carboxylesterase 6 (Est-6) precursor (EC
                                 3.1.1.1), exon 1
               2611     2858     carboxylesterase 6 precursor, exon 2
    sigp       1173     1235     carboxylesterase 6 signal peptide
    matp       1236     2559     carboxylesterase 6
               2611     2855     carboxylesterase 6
    pept       3052     4438     carboxylesterase P (Est-P) precursor (EC
                                 3.1.1.1), exon 1
               4495     4742     carboxylesterase P precursor, exon 2
    sigp       3052     3111     carboxylesterase P signal peptide
    matp       3112     4438     carboxylesterase P
               4495     4739     carboxylesterase P
    pre-msg    1132     2867     Est-6 mRNA and intron (alt.)
    pre-msg    1132     2998     Est-6 mRNA and intron (alt.)
    IVS        2560     2610     Est-6 intron A
    IVS        4439     4494     Est-P intron A
    signal     1104     1110     TATA box
BASE COUNT     1536 a   1117 c   1235 g   1536 t
ORIGIN      Chromosome 3L 69A1-5.
        1 aagcttgcta tatatctatc tgtatatacc tatttgctta gaccgaactt aatgattttc
       61 cctttttcca aaacttacac aaatatatac ctaatacctt acactgaaaa aactgttcaa
      121 ttcactatgt acttattagt agttcttata aaccttgtta ttactaatta cgtacgaaaa
      181 cagaacggga aataaatatc taaaaattta aatagtaaac tgagctgatg tttctacaca
      241 aggatgaaca caagtatcaa agtgctcata ggtacagcat ttctagaaca caattacacc
      301 agtcttttgc tagcaataaa ttttcactca tctcccactt ttctctagaa ctatcctgaa
      361 tttatttatt tgattctatt ttgtcgcaag cacactcaat tcttgatctt tttaataata
      421 atcgtagcgg aaagctgtat ctgtacaggc aagaccaccg cccctcaaaa tgccaacgaa
      481 ctacagtcgg attcaaaagc gtgcacctgc aagtgcaagt tgtatgatat cttgcaccat
      541 tttatttcga tacacgtttt ggtacttcga tttaatatgg ctaattgtta ttcagaaaat
      601 aaacacaaaa acttaggaca catgttcaaa aatagaaata tgtacatatt tgttgcaaac
      661 cttaaatctt attaattatc tttatctata atcaaaaagc ctactcattt gttaaatggt
      721 ttcgttttaa agtgcattat caaattcatc tggataaaca ttcaaacaaa aaattgcatg
      781 catgtaaagt aattaatcaa gttgactcga agcgattgtg caatataagt ctcacctgaa
      841 gtgggattgg caactgcttg ggatgcatgt cgaaaatcta tattaagccc agtcaaatat
      901 tttaagcgta aagtaacaaa cctaatttgt atacggctat cgttttaatt cgcacacgcc
      961 tatcaactgg atgatgttca cactagagtt actcccattt gaaagccggg cattggaaaa
     1021 ctaatctcat gcgtgcccag atctcaattg agactggttg actggatgtt caggtggccg
     1081 ggtggcgata agccgatcga tggaataaaa ggggccgcaa ttgccgcatc tcaagatagt
     1141 tgcggtctga attcgccgga gtgaggagca acatgaacta cgtgggactg ggacttatca
     1201 ttgtgctgag ctgcctttgg ctcggttcga acgcgagtga tacagatgac cctctgttgg
     1261 tgcagctgcc ccagggcaag ctacgtggtc gcgataatgg aagctactac agctacgaat
     1321 cgattcccta cgccgaaccg cccactggcg atctacgatt cgaggctcca gagccgtaca
     1381 aacaaaagtg gtcggatata ttcgatgcca ccaaaacccc ggtggcgtgc ctgcagtggg
     1441 atcagttcac gcctggggcc aacaaattgg taggagagga ggattgccta accgtcagcg
     1501 tctacaagcc gaagaatagc aagaggaata gctttccggt ggtggcccac attcacggag
     1561 gtgcctttat gttcggtgca gcatggcaaa atggacacga gaacgtgatg cgtgagggca
     1621 aattcattct ggtgaagata agctatcgcc tggggccatt gggtttcgtg agcaccggcg
     1681 atagggatct tcccggaaac tatggactga aagatcaacg gctggctctc aaatggatta
     1741 agcagaatat agccagtttt ggtggagaac cgcagaacgt actgttggtt ggtcactccg
     1801 ctggaggagc ttcggtccat ctgcagatgc ttcgtgaaga tttcggccag ctggccaggg
     1861 cggcattctc gtttagtgga aatgctctag atccatgggt tatacagaag ggagcaagag
     1921 gacgagcctt tgaactggga cgcaacgtgg gatgtgaatc ggctgaagac tcgaccagcc
     1981 tgaagaaatg cctaaagtca aagccagcca gtgaattagt caccgccgtc cgtaaattcc
     2041 ttatattttc ctatgtgccc tttgctccat ttagtcctgt attggagcca tcggatgctc
     2101 cagacgccat tatcacccag gatcccaggg atgtcattaa gagcggaaag ttcggacagg
     2161 ttccgtgggc tgtttcctat gtcacagagg atggtggcta caatgccgcc ttgcttttga
     2221 aggaacggaa atctggaata gttatcgatg atctaaacga gcgttggctt gagttggcac
     2281 catatttact attctaccgg gacacgaaga ccaaaaagga tatggacgac tactcgcgga
     2341 aaattaagca ggagtatata ggcaatcaga gatttgacat cgaaagctat tcagaattgc
     2401 agcggctatt cacggatatt ctcttcaaga atagcacgca ggagtcattg gatcttcatc
     2461 gcaaatatgg aaagagtcct gcctacgctt atgtctatga caatccagcc gaaaaaggaa
     2521 tcgcacaggt cctggccaat cgaaccgatt atgattttgg taaggaaatc gtacttttaa
     2581 atggacttag ttaaatcatt acttttatag gaactgtaca cggtgacgac tactttttga
     2641 tattcgaaaa tttcgtacga gatgtggaaa tgcgtccgga tgagcagata atttcgagaa
     2701 attttatcaa tatgctggca gattttgctt cgagtgataa tggctctcta aaatatggtg
     2761 aatgcgattt caaagataat gtaggtagtg agaaattcca attattagct atttatattg
     2821 atggctgcca gaataggcag catgtggaat ttccgtaagt tacatgaata aaatcaaaaa
     2881 tttttcgttc tgtgtaattt ttaattattt aattattctc aactggcttt aaatatcatt
     2941 tgtacaaaac gtgtttgtcg ctttatattt tggtttttgt tgtttcttta taaagaataa
     3001 taaacgtttg tttacccgca gtttgattag tactttgtgg agttccagat catgagtata
     3061 ttcaaacggc tgttgtgcct gactttgctg tggatagcag ctttagaatc tgaagctgat
     3121 cccttgattg ttgagataac aaatggaaaa atccgtggca aagataatgg gttgtactac
     3181 agctacgaat cgattcccta tgccgagcat ccaactggtg ccctccgttt tgaagcacct
     3241 cagccgtata gtcatcattg gactgatgtt ttcaatgcca cgcagtctcc agttgagtgc
     3301 atgcagtgga atcagtttat aaacgaaaac aataagctga tgggtgatga ggattgctta
     3361 acggtaagca tctataagcc aaagaaaccc aatcggagca gctttcctgt cgtagtactc
     3421 ctgcatggag gtgctttcat gttcggtagt ggatccatat atggacacga ctccattatg
     3481 cgtgagggaa ctttgcttgt ggtaaaaata agctatcgtc ttggaccatt gggttttgca
     3541 agtaccggcg atagacactt gccgggaaac tatggtctaa aggatcaacg tctggcccta
     3601 caatggatca agaagaacat tgctcacttt ggtggaatgc cagataatat tgtgctcatt
     3661 ggtcactctg caggcggtgc ttcggctcat ttgcagctgt tgcacgagga tttcaaacat
     3721 ttggccaaag gagcgatttc ggtgagcggc aatgcattgg atccttgggt catacagcag
     3781 ggtggacgac gacgtgcatt tgaactgggt cgtattgtcg gttgtggaca cacaaatgtc
     3841 tccgcagaac tcaaggactg cttgaagtct aagccggcta gcgatatagt ctctgctgtc
     3901 cgaagcttcc ttgtgttttc ctatgtaccc ttcagtgctt ttggacctgt tgtggagccg
     3961 tcagatgcac cagacgcctt tctaaccgag gacccaagag cagtgattaa gagcgggaag
     4021 tttgcccaag tcccttgggc tgtgacgtac accactgagg acgggggata caacgctgct
     4081 cagctgttgg aaagaaacaa attaactggc gagagttgga ttgacctact caatgatcga
     4141 tggtttgatt gggcaccata cttgctcttc tatcgggacg ccaagaaaac catcaaagat
     4201 atggatgatc tttcatttga tctcaggcag cagtatctag cagatcggcg attcagtgtg
     4261 gaaagttatt ggaacgtgca gcgaatgttt actgatgttc ttttcaagaa tagcgtgcca
     4321 agtgcaatag atcttcaccg aaagtatggc aaaagtccgg tttattcttt tgtctacgat
     4381 aatcctaccg attccggagt gggtcaattg ctttccaatc gaacagatgt acattttggt
     4441 gggtacatac ttgcgtttca aatacttaaa gctctagctt tcttattatt aaaggtactg
     4501 tccacggaga tgactttttc ttgattttca atacagctgc ataccgtatc ggcattcgtc
     4561 cggatgaaga agttatttca aaaaagttta taggtatgct ggaggatttc gcactcaacg
     4621 ataagggaac attaacattt ggagaatgta atttccaaaa taatgtgaac agcaaggaat
     4681 atcaagtgct gcgtatttca cgaaacgctt gtaaaaacga ggaatatgct cggtttccct
     4741 aaggaataaa tgcatttaaa aaatgtaaac agtataataa tgtaattaaa gaacaaattt
     4801 attacacctt tatgccgcct caactgtcag aggtttttca ataagttcgt cttcttcttc
     4861 tgggtcgttt tcctcgccct ccacattccc atacatctca atcttgctgt cgaggaagcg
     4921 aaaatccttg aggatatgca acttgttgtg aaaccagcaa tggaaatgaa tttaatggcg
     4981 atgtttgcca aacagcagag tagcaggcag tgtgctccac gggaatattg gcttgcagta
     5041 ctccatttag agtgtaggat tgcacatgat tgtgctcacc accgattaag acacaatcgt
     5101 ccacaaagtc acatagatgc acgcgtccgg ggaagctaaa gacacatgtg ctttccatgg
     5161 aacgcaggtg gaaaattgaa gctttgggtc caccaccgca gagcagccaa tcctcgttga
     5221 cagccactgc tcctatccat ttgccccaat ccggacgtaa cagatttggg ttcttgtaag
     5281 gctccagcat tgaagtgtgc tgctgctgct tggtgctcca cactcgcaca gtgccatcct
     5341 cggcgccaga gaagatctgt ccgttggcat tgcccaccac actgtgcacg tagtctgtgt
     5401 gtccgcggta ctcgcgctga attc
//
LOCUS       ECOLASNII    1530 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION   M34277
KEYWORDS    L-asparaginase II.
SOURCE      E.coli (strain K-12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1530)
  AUTHORS   Jennings,M.P. and Beacham,I.R.
  TITLE     Analysis of the Escherichia coli gene encoding L-asparaginase II,
            ansB, and its regulation by cyclic AMP receptor and FNR proteins
  JOURNAL   J. Bacteriol. 172, 1491-1498 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        385     1431     L-asparaginase II (ansB) precursor
    sigp        385      450     L-asparaginase II signal peptide
    matp        451     1428     L-asparaginase II
    mRNA        323  >  1431     ansB mRNA
BASE COUNT      409 a    381 c    382 g    358 t
ORIGIN      
        1 aaatgggcgc gaaagcggtg ctgaaaagcg gcggtaaccc attacagaat gtgctgggaa
       61 gcctgggaag cctggggggg ctgcaatcct caatccaaac cgagtggaaa aagcaggaaa
      121 aagatttcca gcagtttggc aaagatgttt gtagccgcgt tgtgactctg gaagatagcc
      181 gcaaagccct ggtcgggaat ttaaaataat cctctatttt aagacggcat aatacttttt
      241 tatgccgttt aattcttcgt tttgttacct gcctctaact ttgtagatct ccaaaatata
      301 ttcacgttgt aaattgttta acgtcaaatt tcccatacag agctaaggga taatgcgtag
      361 cgttcacgta actggaggaa tgaaatggag tttttcaaaa agacggcact tgccgcactg
      421 gttatgggtt ttagtggtgc agcattggca ttacccaata tcaccatttt agcaaccggc
      481 gggaccattg ccggtggtgg tgactccgca accaaatcta actacacagt gggtaaagtt
      541 ggcgtagaaa atctggttaa tgcggtgccg caactaaaag acattgcgaa cgttaaaggc
      601 gagcaggtag tgaatatcgg ctcccaggac atgaacgata atgtctggct gacactggcg
      661 aaaaaaatta acaccgactg cgataagacc gacggcttcg tcattaccca cggtaccgac
      721 acgatggaag aaactgctta cttcctcgac ctgacggtga aatgcgacaa accggtggtg
      781 atggtcggcg caatgcgtcc gtccacgtct atgagcgcag acggtccatt caacctgtat
      841 aacgcggtag tgaccgcagc tgataaagcc tccgccaacc gtggcgtgct ggtagtgatg
      901 aatgacaccg tgcttgatgg ccgtgacgtc accaaaacca acaccaccga cgtagcgacc
      961 ttcaagtctg ttaactacgg tcctctgggt tacattcaca acggtaagat tgactaccag
     1021 cgtaccccgg cacgtaagca taccagcgac acgccattcg atgtctctaa gctgaatgaa
     1081 ctgccgaaag tcggcattgt ttataactac gctaacgcat ccgatcttcc ggctaaagca
     1141 ctggtagatg cgggctatga tggcatcgtt agcgctggtg tgggtaacgg caacctgtat
     1201 aaatctgtgt tcgacacgct ggcgaccgcc gcgaaaaccg gtactgcagt cgtgcgttct
     1261 tcccgcgtac cgacgggcgc taccactcag gatgccgaag tggatgatgc gaaatacggc
     1321 ttcgtcgcct ctggcacgct gaacccgcaa aaagcgcgcg ttctgctgca actggctctg
     1381 acgcaaacca aagatccgca gcagatccag cagatcttca atcagtacta atcgcctcgc
     1441 cccggtatcg tgccggggct ttttcacttc agactcacgt ccattgccaa ttttaattac
     1501 cctaatgata atcaccggaa taaattattt
//
LOCUS       HUMPADPRP    2085 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human poly(ADP-ribose) polymerase (PADPRP) gene, exon 1.
ACCESSION   M34337
KEYWORDS    poly(ADP-ribose) polymerase.
SOURCE      Human peripheral blood leukocyte DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2085)
  AUTHORS   Ogura,T., Nyunoya,H., Takahashi-Masutani,M., Miwa,M.,
            Sugimura,T. and Esumi,H.
  TITLE     Characterization of a putative promoter region of the human poly
            (ADP-ribose) polymerase gene: Structural similarity to that of the
            DNA polymerase beta gene
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 701-710 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1943  /  2062     poly(ADP-ribose) polymerase
                                 /hgml_locus_uid="LZ0154E" /nomgen="PPOL"
                                 /map="1q41-q42"
    pre-msg    1777  >  2085     PPOL mRNA and introns
    IVS        2063  >  2085     PPOL intron A
BASE COUNT      495 a    535 c    550 g    505 t
ORIGIN      
        1 tttagggatg atatagttgt caacccagag atggcatgat catgcctttt gacttggtca
       61 ttctctaagt aaaactttta tttgttccat catattttcc acttattctg tttaccttca
      121 aaatatcttt tttttttttt tttgagacag ggtcacactg tcacccaggc tagagtccag
      181 tggcactatc atggctcacc acagcctcaa ccttcagggc tcaggtgatc ctcccacttc
      241 agcctcccga gtagatggga ctacaggcac ctgccaccac ccccagctaa tttttgtaga
      301 gacaaggttt tgccatgttg tccaggctgg tcttgaactc ctgggctcaa gggatccggc
      361 cacctcagcc tcccaaagtg ctaggattat aggcatgagc cactgtgccc agcctacctt
      421 caacgtatct aactggttac taacttttag gattcggcct atgtctcaca accttcttgc
      481 ttactcaaca tccttgtctc ttaagccact agcttcttct ctatggttaa cactttttat
      541 gagttttatt catctgctta tttttcttat cctctatacc agaattgaat attttcaaat
      601 aaagcacact catgttacaa tctttgaaat ggaaaaaaaa aatgcatagg attagaaaag
      661 aaaccaattt taataaacta tattttgaag tatagttcta tattaaacaa caagatctag
      721 gccaggtgca gtggctcatg cctgtaatcc cagcaatttg ggaagtcgag gtgggaggat
      781 tgcttgaggc caggggttca agaccagcct gggcaacatg gagagattcc ccatctcttt
      841 ctttacacac acacacacac acacacacaa aatatctgat agcaacaggt gcagtcatta
      901 ccacaatttc gagtagtgat gagcttaata atatttcgag ttatcaccaa caactgtaaa
      961 ctaacatgaa aacgtctgtg atgactattg cccacaaagt cacaggtact gctaatactc
     1021 ctggtatttg tagtaaattc ataataaagg aaatgctagg tttcagttgg tattttgtcc
     1081 cgacggtctg tggacggcag gttagaacgc ccgtccaagc caggagggtg gacctagcac
     1141 tgcagggtcc acctcgggcc aatcaactat attcccgagg cgggggcctg cgcttcccgg
     1201 acccagctgc cctcagggga gagaggacac acttaagagt ttggggccgg cgtggtagct
     1261 catgcccctg atcccagcac ttcgggaggc tgaggcgtga agatcacttg tagcaggagt
     1321 ttgagaccag tctagccaac ttggcgagac cctgtcccta aaaaaaattt ttttttaatt
     1381 agccagttgt ggtgagcgcc tgtagtccca gctactcggg aggctgaggt gggaggatcg
     1441 ctgggctcag gagttccaga ctgcagtgag ccatgatggc ggcactgcac tccagcgcgg
     1501 tgagactcag tctcaaaaat aaaaggggga ggggttgggg gtaaaattag ttgtgaaatc
     1561 aagtaagact tcctgggaca gaacaatcaa aggggtggcg ccgggtcctc caaagagcta
     1621 ctagctcagc ccaagccccg cctcggcccc cagggcagcg gccgcagagc tccacccggc
     1681 aggcgcccgg gaaactccgc cccccggccg gcagggcgcg cgcccgccgg ccccgccccg
     1741 tggacgcggg ttccgtggcg ttcccgcggc caggcatcag caatctatca gggaacggcg
     1801 gtggccggtg cggcgtgttc ggtgcgctct ggccgctcag ccgtggcggc tgggtgagcg
     1861 cacgcgaggc ggcgaggcgg caagcgtgtg tttctaggtc gtggcgtcgg gcttccggag
     1921 ctttggcggc agctagggga ggatggcgga gtcttcggat aagctctatc gagtcgagta
     1981 cgccaagagc gggcgcgcct cttgcaagaa atgcagcgag agcatcccca aggactcgct
     2041 ccggatggcc atcatggtgc aggtgcgggc cgctgtgcgg cgggg
//
LOCUS       HUMREVERB1    295 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human Rev-ErbA-alpha mRNA, 5' end.
ACCESSION   M34339
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Human fetal skeletal muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 295)
  AUTHORS   Lazar,M.A., Jones,K.E. and Chin,W.W.
  TITLE     Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
            from the noncoding DNA strand of a thyroid hormone receptor gene
            results in a related protein that does not bind thyroid hormone
  JOURNAL   DNA 9, 77-83 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        204  >   295     Rev-ErbA-alpha protein (put.)
BASE COUNT       58 a    107 c     74 g     56 t
ORIGIN      
        1 ccgttgcctc aacgtccaac ccttcctgca gggctgcagt ccggccaccc caagcccttg
       61 ctgcagggtg cttcggatcc tgatcgtgag tcgcggggtc cactccccgc ccttagccag
      121 tgcccagggg gcaacagcgg cgatcgcaac ctctagtttg agtcaaggtc cagtttgaat
      181 gaccgctctc agctggtgaa gacatgacca ccctggactc caacaacaac acaggtggcg
      241 tcatcaccta cattggctcc agtggctcct ccccaagccg caccagccct gaatc
//
LOCUS       HUMREVERB2    285 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human Rev-ErbA-alpha mRNA, partial cds.
ACCESSION   M34340
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Human fetal skeletal muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 285)
  AUTHORS   Lazar,M.A., Jones,K.E. and Chin,W.W.
  TITLE     Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription
            from the noncoding DNA strand of a thyroid hormone receptor gene
            results in a related protein that does not bind thyroid hormone
  JOURNAL   DNA 9, 77-83 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   285     Rev-ErbA-alpha protein (put.) (AA at 1)
BASE COUNT       62 a     79 c     99 g     45 t
ORIGIN      About 1.7 kb after segment 1.
        1 ctcgtaaagg agagagaagt gcagagttcg attctgtaca agggggcagc ggcagaaggc
       61 cggccgggcg ggtcactggg cgtccacccg gaaggacagc agcttctcgg aatgcatgtt
      121 gttcagggtc cgcaggtccg gcagcttgag cagcagcttg gtgaagcggg aagtctccaa
      181 gggccggttc ttcagcacca gagcccgaag agcccgcagc agcgtctcct ggagctgctc
      241 caccgaagcg gaattctcca tgcccgagcg gtctgcagag acaag
//
LOCUS       HUMTFEB      1526 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human TFEB protein mRNA, partial cds.
ACCESSION   M33782
KEYWORDS    .
SOURCE      Human BJAB B cell line, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1526)
  AUTHORS   Carr,C.S. and Sharp,P.A.
  TITLE     A helix-loop-helix protein related to the immunoglobulin E
            box-binding proteins
  JOURNAL   Mol. Cell. Biol. 10, 4384-4388 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.S.Carr, 19-APR-1990.
FEATURES       from  to/span     description
    pept    <     1  >  1526     TFEB protein (AA at 1)
BASE COUNT      324 a    495 c    493 g    214 t
ORIGIN      
        1 ggtggagtga cagccggagc ccgggcgccg gcgcggggct cggtgacagc ggaggcggcg
       61 gcccgggcgg gacgagggag cggccgcggc gccgacggcg ggcgggaagg gcggacgggg
      121 cgggcctccc tggtggcacg gggccggagc aggccgggag caggtggcgg gcacagcttg
      181 cgggccaggc acccgaactt gcgacaagtt gccggagccg gggcgcgggc ggcggacaga
      241 ttgaccttca gagcgaggga gccagcgccg gcagccacca tggcgtcacg catagggttg
      301 cgcatgcagc tcatgcggga gcaggcgcag caggaggagc agcgggagcg catgcagcaa
      361 caggctgtca tgcattacat gcagcagcag cagcagcagc aacagcagca gctcggaggg
      421 ccgcccaccc cggccatcaa tacccccgtc cacttccagt cgccaccacc tgtgcctggg
      481 gaggtgttga aggtgcagtc ctacctggag aatcccacat cctaccatct gcagcagtcg
      541 cagcatcaga aggtgcggga gtacctgtcc gagacctatg ggaacaagtt tgctgctgcc
      601 cacatcagcc cagcccaggc tctccgaaac ccccaccagc cgcctcccca ggggtgcgag
      661 ctggacacgt gctgtcctcc tccgctggca acagtgctcc caataagccc catggccatg
      721 ctgcacattg gctccaaccc tgagagggag ttggatgatg tcattgacaa cattatgcgt
      781 ctgacgatgt ctcttggcta catcaatcct gaaatgcaga tgcccaacac gctacccctg
      841 tccagcagcc acctgaatgt gtacagcagc gacccccagg tcacagcctc cctggtgggc
      901 gtcaccagca gctcctgccc tgcggacctg acccagaagc gagagctcac agatgctgag
      961 agcagggccc tggccaagga gcggcagaag aaagacaatc acaacttaat tgaaaggaga
     1021 cgaaggttca acatcaatga ccgcatcaag gagttgggaa tgctgatccc caaggccaat
     1081 gacctggacg tgcgctggaa caagggcacc atcctcaagg cctctgtgga ttacatccgg
     1141 aggatgcaga aggacctgca aaagtccagg gagctggaga accactctcg ccgcctggag
     1201 atgaccaaca agcagctctg gctccgtatc caggagctgg agatgcaggc tcgagtgcac
     1261 ggcctcccta ccacctcccc gtccggcatg aacatggctg agctggccca gcaggtggtg
     1321 aagcaggagc tgcctagcga agagggccca ggggaggccc tgatgctggg ggctgaggtc
     1381 cctgaccctg agccactgcc agctctgccc ccgcaagccc cgctgcccct gcccacccag
     1441 ccaccgtccc cattccatca cctggacttc agccacagcc tgagctttgg gggcagggag
     1501 gacgagggtc ccccgggcta ccccga
//
LOCUS       MUSFCERII    2237 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Mouse low affinity IgE receptor (Fc-epsilon-RII) mRNA, complete
            cds.
ACCESSION   M34163
KEYWORDS    IgE receptor.
SOURCE      Mouse (strain Balb/c) splenic B cell, cDNA to mRNA, clone p23.18.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2237)
  AUTHORS   Gollnick,S.O., Trounstine,M.L., Yamashita,L.C., Kehry,M.R. and
            Moore,K.W.
  TITLE     Isolation, characterization, and expression of cDNA clones encoding
            the mouse Fc receptor for IgE (Fc-epsilon-RII)
  JOURNAL   J. Immunol. 144, 1974-1982 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        175     1170     low affinity IgE receptor (Fc-epsilon-RII)
    mRNA    <     1     2237     Fc-epsilon-RII mRNA
BASE COUNT      592 a    622 c    533 g    490 t
ORIGIN      
        1 ggagaagact actgtcttca acacactagc ctgagctacc ttatccaagt gctccacata
       61 ttccagaagg agaaggacag acttcaagtt caaatcactt ccagagctgt gagtgacaag
      121 tgccttggca ggtagtgcac gcctcatcac tgaaaggatc caaacaagac tgccatggaa
      181 gaaaatgaat actcaggata ctgggaacct cctagaaagc gttgctgctg tgcaagacgt
      241 gggacacagc tcatgttggt ggggctgctg agcacagcaa tgtgggctgg cctgctggcc
      301 ctgcttcttc tgtggcactg ggaaacggag aagaatctaa aacagctggg agacactgca
      361 attcagaatg tctctcatgt taccaaggac ttacaaaaat tccagagtaa tcaattggcc
      421 cagaagtccc aggttgttca gatgtcacaa aacttgcaag aactccaagc tgaacagaag
      481 caaatgaaag ctcaggactc tcggctctcc cagaacctga ccggactcca ggaggatcta
      541 aggaacgccc aatcccagaa ctcaaaactc tcccagaacc tgaacagact ccaagacgat
      601 ctagtcaaca tcaaatccct gggcttgaat gagaagcgca cagcctccga ttctctagag
      661 aaactccagg aagaggtggc aaagctgtgg atagagatac tgatttcaaa gggaactgca
      721 tgcaacatat gtcccaagaa ctggctccat ttccaacaga agtgctacta ttttggcaag
      781 ggctccaagc agtggatcca ggccaggttc gcctgcagtg acctgcaagg gcgactagtc
      841 agcatccaca gccaaaagga acaggacttc ctgatgcaac acatcaacaa gaaggattcc
      901 tggattggcc tccaggatct caatatggag ggagagtttg tatggtcgga cgggagccct
      961 gtgggttata gcaactggaa tccaggggag cccaataacg ggggccaggg tgaggactgt
     1021 gtgatgatgc ggggatccgg ccagtggaac gacgccttct gccgcagcta cttggatgca
     1081 tgggtgtgtg agcagctggc aacatgtgag atatctgccc ccttagcctc tgtgactcca
     1141 acaaggccca ccccaaaaag tgaaccctga caaacttctg ctcacactct tctggatttc
     1201 tcctctacct ttatcgtgga aacagctggg ccctgaggat acccctatca gggcccaggg
     1261 ctctctgtga ccgaaggctt tgattatgtt cccacccata ctgaagcagc tggtggatgc
     1321 cagctcctgc cagctaccca gaaaccctct ccagctctcc agctaagctg gccatcccat
     1381 tccatctgcc ttcctcaaac ctgggcccca gccttgctag ctccctgact acgggcatgc
     1441 atgtgggcag ctgagccaac cagggagctg ctgagaacaa agatttcgaa ggcttctttt
     1501 gcagtcccca cctcctatca agttccccac tttctccccc tcggcatcag agaacagggg
     1561 ttccctttcc ccaggatctg ggatgagtcc tcccatcaag tttgcatcag tggtcccagg
     1621 actccgaccc tccttggagg ctaccaggtg tgctcctggt gcgggaggta ttgaaggaac
     1681 tctaaacagc tccagcaagg cgagcctggc tctgtctggt aggcctggcc cttctctccc
     1741 attccttcta ccttactaaa agctgttaga gaacagtcct aaagctagcc cccaaggtct
     1801 attcccttat ttggccactt cctcctcctg aggctgacta caaggtccag ctatccaagt
     1861 actgaagtct aacatcaaaa gccccctttg tctcacctaa gtagcaatgc ccaatcaaaa
     1921 tacaccatca catcatagcc cagtctaaca gaccgccctt tttctcttca taaaattaca
     1981 cctgcaacca ggcgtagtgg tgcaggcctt tagtcccagc acttgggagg cagagacaag
     2041 cgaatttctg agttcgaggc cagcctggtc tacaaagtga gttccaggac agccagggct
     2101 acacagagaa accctgtctc gaagaaagaa aaaaaaaaaa aattacacct gcgaggtcac
     2161 ttgggctgct gtttttctgc ctgagtcaga gggcagccac ttaacttttc ttccctgctt
     2221 aataaaggat ctctgtg
//
LOCUS       RATATM1      1585 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat alpha-tropomyosin (alpha-TM) gene, exon 1b.
ACCESSION   M34137
KEYWORDS    alpha-tropomyosin.
SEGMENT     1 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1585)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
COMMENT     [1] reports isolation of exon 1b which is the first exon coding for
            the alternatively expressed low-M-r alpha-tropomyosins.
FEATURES       from  to/span     description
    pept       1275  /  1406     low-M-r alpha-tropomyosin
BASE COUNT      360 a    408 c    449 g    368 t
ORIGIN      
        1 agaggtgttc cagctttgga agataagtga ggcaggctgt tttcttttct acctttgttt
       61 ccattctcca cctcccacat ttaagcaagt tcaggactga ctctttgaca gaggtgtagg
      121 attaaaaagc tcggagactt gacaggcttt aaaaaaaaca tagcggaaga aggaatgaca
      181 gaggcgggtt cttacagact gtgtccatgt ggctttccag ctggtaacca gaggttctgt
      241 cacaggtgtt tgtggtcttg cagacccggg ttcctcagat tccagatatc aggctgaatt
      301 gcctgaagtt gcacagttcg ttatggggta cagacaaaat cttatggtag cgtttaatta
      361 catagtttga gttcttaaat aagtgaggca tatgtaaatg ttttatatca tctgataact
      421 tcaacagcct tgtgaagtgt cctgtactat tatactcgtt tttaagagag gaaatccctg
      481 agaagtcatt tgtccaaagg cacacagcca gcagagactt aagagccaga ttgaaacctt
      541 aaaaagatgt atttagaaca ggagatttaa ccactacgct ttgccagcta gtagtttaca
      601 cttggtgaga tagtttgggt gttgggattc ttgtagtgag gatcttgtgt tctgtgttct
      661 tcacaaggat ctttaggata ttcctatttc aggctccgag ttccttgttc agactctggg
      721 aaatcgggag ttgccaatgc ctttagtatt cggaatgatg gcgttcttta caactcaaag
      781 agtttaggga acgttcaagg tctaggttcc tacccttact cagcttcagg ggaatctcga
      841 tttcagagcg ctccaaatgg ccagcaggag tcgccatagg ctttagcatc cagttcctct
      901 acttggagtc acccaggctc cgcggaatgg atgcaaacgc cctcctgcgg tagcctcgct
      961 ctaggcaaag cagcccagag ccagccattc ccgcgcgcac gcgcagtggt tccaagggta
     1021 gccagttcct cgggtccacc ggccccgccc gcctgcccgc cgcgcccatc ccctgcgttc
     1081 gcgcacccgc ccgcggctct ctagtgcggc cctggggccg tgacttccgg gctgcttctt
     1141 agccgcaggg ggcgccggcg ttgcaaagcg ggcctgggcg ggacgaaccg gacctgggct
     1201 ccgtaactgc aggagcccag cagtagcagc tgcagagcgc ggaggcagag cacagcctca
     1261 gccgccgcct cgccatggcg ggtagcagct cgctggaggc ggtgcggaga aagatccgga
     1321 gcctgcagga acaggcagat gccgcggagg agcgtgcggg cagcctgcag cgagagctgg
     1381 accaggagcg gaagctgcgg gaaaccgtaa gggatccacc catcacccga ggcccccacg
     1441 ccgcacctgg cgcacctagg ccaactggca cccggctact cagcctgccg gtctatctcc
     1501 tcggtctaaa gtaaaacctc caagggggag caagagtgct tgctgttgag aaggttctgg
     1561 gggcgagcat gaaatgggtc ccggg
//
LOCUS       RATATM2       462 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat alpha-tropomyosin (alpha-TM) gene, exon 9c.
ACCESSION   M34138
KEYWORDS    alpha-tropomyosin.
SEGMENT     2 of 2
SOURCE      Rat DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 462)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
COMMENT     [1] reports isolation of exon 9c which is the final exon coding for
            the alternatively expressed brain alpha-tropomyosins TMBr-1 and
            TMBr-3.
FEATURES       from  to/span     description
    pept    /   375      448     alpha-tropomyosin
BASE COUNT       97 a    141 c    100 g    124 t
ORIGIN      Unknown number of bp after segment 1.
        1 ctcgcctaga atctctgctg agctcccaca aagcatgagt gtctggggct gccctccagc
       61 ctggccttag tccccaaatc tgttgtccta aactacacat aagtgagtca ctgggtggct
      121 ttaatgaggt gcagaaggag cattggttct gcacctcttc gcctctgctc tacaccaagg
      181 aaagattctg gcccctcctt tattctgcca gacctccatg cgcctcaagg gcacagggca
      241 gagggcctgg ggtcagggta ccttgccgct gtcctttgta ttgtgttctc agtctgttaa
      301 ccgtttctca ctgactcttg ctaacctctt ccttctgcct cttcttttct gctaaccctt
      361 gctgacctgc ccagatcaac tctaccatca actcgagcaa aaccgccgtc taactaatga
      421 actaaagctg gccctgaatg aggattaaaa ccctgggcca ag
//
LOCUS       RATPLPA1     1658 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, exons 1 and 2.
ACCESSION   M34108
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     1 of 5
SOURCE      Rat liver DNA, clone lambda-rPLPg10-2.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1658)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       1477  +  1577     parathyroid hormone-like peptide (PLP)
                                 precursor, exon 2 (first expressed exon)
    sigp       1477  +  1577     parathyroid hormone-like signal peptide
    pre-msg     951  >  1658     PLP mRNA and introns
    IVS        1259     1452     PLP intron A
    IVS        1578  >  1658     PLP intron B
BASE COUNT      325 a    507 c    466 g    360 t
ORIGIN      
        1 gggaaccagt tacacacaca cacacacaca cacacacaca cacacacaca cacacacact
       61 ggtaagaacc agtagcccac aacattctcc ccctctcccc acccacccac ccagcagcgc
      121 tgagaagctg aaacgactgg cataaggtta ctcagtgaat ctatagcagg acatagagta
      181 ggttatcaga ggaagagtgg ctaagttgga ttctgtgtgc tccaggttct ctgagatcac
      241 aaagtgggtc ttcgatcctt gttcacacac cctttctagg atctgctcca gatgtgatgt
      301 ccccagctgt tctcaaacag ttaagaccaa aaatcgcctt tgctctcctg ggtgctaccc
      361 ttagtcccct ccccagggga aggccatgcc aggcaggaag gctggcttgc aacagtgcga
      421 gcttgcaaac gagcttggga ctcttgggat tcgcaggcca cccggagttc ctccttgccg
      481 ccggtgggga ccagggctcc tggacctggg ctcctggact caggcggcag caggctggtt
      541 caggaggagg agagcgtcag atcggcccac gcggcgactg gggtggggct ccgtggctcc
      601 cagctcccct gtgctccccg cccccgccgg cccgctcacc ccagtctgct ccgccggcgc
      661 gcgttcctcg ggcgccacct ctttgcgact cgctcacttc tcagcaggtt ggccacagag
      721 cgtgtgaacc ttccaaggct gggctcacag ctcctctcca acctgcgccg cccagctggg
      781 cgtccgcccc gccgccggaa ccgcctcgcg ccacctggcc ctcgcatcca cgacacgcgc
      841 gcctgcaact tgttcaaggg cgttgtggaa tcaactttcc ggaagcaacc agcccaccgg
      901 aggaggtaga ccgacagcta tgtatatata cgctctctgc cgcaagtggc tttgcaccca
      961 gaggcctggt gcgcaggaca gctgactcct gcgggcacac tcgcgtttga agaggggttc
     1021 gccctgcccc gcgacccaga gcgctgccgc caagactaat tagacattgc tatgggagcc
     1081 acagcaacgc gtcacgcatc cccgacgcct atgtaaaacg gttcggtctt cgctctttca
     1141 gaggaaactc tctgattgct ttttcccttc tcgggtccct tttttttttt tttttttttt
     1201 ttttttttgc ctgtgcggtt tgagagaggc acagttggag gcgctgattc ctacaaaagt
     1261 aagtgctcag gggctcggaa acactttttt gctttctttt tcttgttttc ttttttcctt
     1321 ataacttgtg agatgccctc gacattggag tggcttgttg ggaagccacg gggttccaaa
     1381 agagggagag gggcagtgag aggaagagat agatggagag ccaggactga ccctcgggtt
     1441 ccactcttgc aggtcccaga gccagcgagc ggcacgatgc tgcggaggct ggttcagcag
     1501 tggagcgtcc tggtgttcct gctcagctac tccgtgccct cccgcgggcg ctcggtggag
     1561 gggcttggtc gcaggctgta agtaccggat cttcctcccg ggtgagggat ggaggagacc
     1621 agcaccagag gtgccgaact gaggcgggca atgcggaa
//
LOCUS       RATPLPA2      144 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, intron B
            (partial).
ACCESSION   M34109
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     2 of 5
SOURCE      Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 144)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   144     parathyroid hormone-like peptide intron B
BASE COUNT       42 a     26 c     21 g     55 t
ORIGIN      3 kb after segment 1.
        1 gcagaacact atgttttgac tgaagtccaa gttgcctctg ataaactttt gcccaatttt
       61 tcttttaatt tttttacagt aaccaaagac aaggataaag tataaatcca cttatttttc
      121 catgctggca ctttggtttg taat
//
LOCUS       RATPLPA3      145 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, intron B
            (partial).
ACCESSION   M34110
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     3 of 5
SOURCE      Rat liver DNA, clones lambda-rPLPg[10-2,9-0].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 145)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   145     parathyroid hormone-like peptide intron B
BASE COUNT       22 a     34 c     34 g     55 t
ORIGIN      640 bp after segment 2.
        1 agtctccata gggtcaatcc actgtcatgt gcttcttatg ctgtgctgtt tcttgtcccc
       61 ttgctgtgac actcacttct gcctttcttg gacttgagaa cacgttgggg acttttgaga
      121 atcggagtcc tgttttgtgg acttt
//
LOCUS       RATPLPA4      920 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, exon 3.
ACCESSION   M34111
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     4 of 5
SOURCE      Rat liver DNA, clone lambda-rPLPg9-0.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 920)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    94  +   516     parathyroid hormone-like peptide (PLP)
                                 precursor, exon 3
    sigp    +    94      100     parathyroid hormone-like signal peptide
    matp        101  +   516     parathyroid hormone-like peptide
    IVS     <     1       93     PLP intron B
    IVS         517  >   920     PLP intron C
BASE COUNT      198 a    353 c    166 g    203 t
ORIGIN      1.3 kb after segment 3.
        1 tccctactct agaggtcaac tgttcaccca tctcattcct tcaagtccaa cacgggcagg
       61 tcctcagctg acaactgtct tctcccctga cagaaaacgc gcggtgtctg agcaccagct
      121 actgcatgac aagggcaagt ccatccaaga cttgcgccgc cgtttcttcc tccaccatct
      181 gattgcggag atccacacag ctgaaatcag agctacctcg gaggtgtccc ctaactccaa
      241 acctgctccc aacaccaaaa accaccctgt gcggtttggg tcagacgacg agggcagata
      301 cctaactcag gaaaccaaca aggtggagac gtacaaagag cagccactca agacgcccgg
      361 gaagaagaag aaaggcaagc cggggaaacg cagagaacag gagaaaaaaa agcgaaggac
      421 tcggtctgcc tggccaggca caactgggag tggcctgctt gaggaccccc agccccacac
      481 ctccccgacc tcgacctccc tggagcccag ctcaaggtat tgtctgtccg ctgtgctcag
      541 tagggctccc ctccctccac cggcctgggt cccagagatg cagaatcctc tgtatcttga
      601 tctgagacta catcagttag gccacaatct cagtccattc ctggttagcc tgctagacaa
      661 tttaatttcc ctacttccct cccaccagcc caatcaacca tccttcctcc ctccctccct
      721 ccttccctcc ctcactccct ccctccctcc atccctccct ccctaccatc ctccctccct
      781 tccttctccc tttcctttcc atcccctccc ctcttctccc tttcctttcc atcccctccc
      841 ctcttctcct cccctctctc tgattcctgt cttcctcact cctccccaca catagggtct
      901 tcctcaagcc aaggctgact
//
LOCUS       RATPLPA5      855 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Rat parathyroid hormone-like peptide (PLP) gene, exon 4.
ACCESSION   M34112
KEYWORDS    parathyroid hormone-like peptide.
SEGMENT     5 of 5
SOURCE      Rat liver DNA, clone lambda-rPLPg9-0.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 855)
  AUTHORS   Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D.
  TITLE     Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence
            of the rat gene and comparison with the human homologue
  JOURNAL   Mol. Endocrinol. 4, 441-446 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    21       30     parathyroid hormone-like peptide (PLP)
                                 precursor, exon 4
    matp    +    21       27     parathyroid hormone-like peptide
    pre-msg <     1      449     PLP mRNA and introns
    IVS     <     1       20     PLP intron C
BASE COUNT      280 a    139 c    157 g    279 t
ORIGIN      3.6 kb after segment 4.
        1 ccccttcctt gtcttcacag gacgcattga aactttcacc gaagatcttc caagacacgt
       61 tacaggattc tgtaatagta aacatatgga aagtattaga catatttatt gcctgtacat
      121 actgtaaatg cattgggatc aaactgtctc cccaggaaac tgcacatggg tcatgttacc
      181 ataatttatt ttgtcaactg atgtatttat ttgtaaatgt atcttggtgc tgctgactct
      241 gtttttttgt aacataatgc actttaggta tacatatcaa gtatgtcgat gaatttaaca
      301 cgaaaaagtg tctctatttt gtggttcatt ttgatgagtt ctgaaatata attatctaga
      361 ctgatctccc tctgcgcatg taaaaatggc actattttaa atttgtaaag aatgtctaat
      421 aaaatataat ctaattacac cgtgacttac aatgttggtt ttaattcttt aaggtttcta
      481 atcagagaaa cacagtacaa ttttttttta aattctactt gaataaaatc ctaagcttga
      541 tgtgtttgtg tggggatcta ctccaatgtg tccgcattag cttttagtta aaaaccactt
      601 aaaatgtatt ttgccaatat ttcagctcaa gtcaatagaa tggctgaaca gcctgaactt
      661 caagttgaaa agaaagacaa gaaagggcaa aaaataagca gagaagtcag gagaaagggt
      721 caaagagggt tgtccagagt taaaaccaga gaagtgggtc cagacttgaa gactctgaat
      781 aaattttgcc aatgtaaagt ttagtttgca agggcttgct ctcgattcat aacaataata
      841 aaagatctgc tcggg
//
LOCUS       RATTMBR1A    1050 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Rat brain alpha-tropomyosin (TMBr-1) mRNA, complete cds.
ACCESSION   M34135
KEYWORDS    alpha-tropomyosin.
SOURCE      Rat brain, cDNA to mRNA, clone pOk4.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1050)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         41      886     brain alpha-tropomyosin (TMBr-1)
    mRNA    <     1     1050     TMBr-1
BASE COUNT      336 a    220 c    306 g    188 t
ORIGIN      
        1 ctctggctcc tctgcctaag ggcccctcgc caccgccacc atggacgcca tcaagaagaa
       61 gatgcagatg ctgaagctcg acaaagagaa cgccttggat cgagcagagc aggcggaggc
      121 tgacaagaag gctgcggaag accggagcaa gcagctggaa gatgagctgg tgtcactgca
      181 aaagaaactc aaggccactg aagatgaact ggacaaatac tccgaggctc tcaaagatgc
      241 ccaggagaaa ctggagctgg cggagaaaaa ggccacagat gctgaagctg acgtagcatc
      301 tctgaacaga cgcatccagc tggttgagga ggagttggat cgcgctcagg agcgtctggc
      361 cacagctcta cagaagctgg aggaggctga gaaggctgca gatgagagtg agagaggcat
      421 gaaagtcatt gaaagccgag cccaaaaaga tgaagaaaag atggagattc aggagatcca
      481 gctgaaagag gccaagcaca ttgctgaaga tgctgaccga aagtatgaag aggtggcccg
      541 taagctggtc atcatcgaga gcgatctgga gcgtgcggag gagagggctg agctctcgga
      601 aggcaaatgt gccgagcttg aagaagagtt gaaaacggtg acgaacaact tgaagtcact
      661 ggaggctcag gctgagaagt actctcagaa agaagacaag tatgaagagg agatcaaggt
      721 tctctctgac aagctgaagg aggctgagac ccgggctgag tttgcagaga gatcagtaac
      781 caaattggag aaaagcattg atgacttaga agatcaactc taccatcaac tcgagcaaaa
      841 ccgccgtcta actaatgaac taaagctggc cctgaatgag gattaaaacc ctgggccaag
      901 ttctaggaac ggagtccatg ttcagaaaat ctatgactct cggggttaaa aaatgataac
      961 ttctagagaa atgggaagga aatgcattgg ctggaaggcc agccttgcct atgttttcct
     1021 ctgtatatct taattaaatc ttcaataacc
//
LOCUS       RATTMBR2A    1004 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Rat brain alpha-tropomyosin (TMBr-2) mRNA, complete cds.
ACCESSION   M34134
KEYWORDS    alpha-tropomyosin.
SOURCE      Rat brain, cDNA to mRNA, clone pOk15.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1004)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        137      892     brain alpha-tropomyosin (TMBr-2)
    mRNA    <     1     1004     TMBr-2
BASE COUNT      265 a    242 c    321 g    176 t
ORIGIN      
        1 cttagccgca gggggcgccg gcgttgcaaa gcgggcctgg gcgggacgaa ccggacctgg
       61 ctccgtaact gcaggagccc agcagtagca gctgcagagc gcggaggcag agcacagcct
      121 cagccgccgc ctcgccatgg cgggtagcag ctcgctggag gcggtgcgga gaaagatccg
      181 gagcctgcag gaacaggcag atgccgcgga ggagcgtgcg ggcagcctgc agcgagagct
      241 ggaccaggag cggaagctgc gggaaaccgc tgaagctgac gtagcatctc tgaacagacg
      301 catccagctg gttgaggagg agttggatcg cgctcaggag cgtctggcca cagctctaca
      361 gaagctggag gaggctgaga aggctgcaga tgagagtgag agaggcatga aagtcattga
      421 aagccgagcc caaaaagatg aagaaaagat ggagattcag gagatccagc tgaaagaggc
      481 caagcacatt gctgaagatg ctgaccgaaa gtatgaagag gtggcccgta agctggtcat
      541 catcgagagc gatctggagc gtgcggagga gagggctgag ctctcggaag gcaaatgtgc
      601 cgagcttgaa gaagagttga aaacggtgac gaacaacttg aagtcactgg aggctcaggc
      661 tgagaagtac tctcagaaag aagacaagta tgaagaggag atcaaggttc tctctgacaa
      721 gctgaaggag gctgagaccc gggctgagtt tgcagagaga tcagtaacca aattggagaa
      781 aagcattgat gacttagaag ataagtttct ttgcttctct cctcccaaga ctccttcatc
      841 aagccggatg tcccacctct ctgagctctg catctgtctg ctctccagct gacccaggtt
      901 tctttctagt gcccacccac cctagggcca ggcacagacc gtgctttcta ttgtacagag
      961 gtgatcctcc cagtgtaaaa taaacactgt acgctagtcc tgtt
//
LOCUS       RATTMBR3A    2488 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Rat brain alpha-tropomyosin (TMBr-3) mRNA, 3' end.
ACCESSION   M34136
KEYWORDS    alpha-tropomyosin.
SOURCE      Rat brain, cDNA to mRNA, clone pOk10.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2488)
  AUTHORS   Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M.
  TITLE     Three novel brain tropomyosin isoforms are expressed from the rat
            alpha-tropomyosin gene through the use of alternative promoters and
            alternative RNA processing
  JOURNAL   Mol. Cell. Biol. 10, 1729-1742 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      604     brain alpha-tropomyosin (TMBr-3)
    mRNA    <     1     2488     TMBr-3
BASE COUNT      713 a    531 c    579 g    665 t
ORIGIN      
        1 tgaagctgac gtagcatctc tgaacagacg catccagctg gttgaggagg agttggatcg
       61 cgctcaggag cgtctggcca cagctctaca gaagctggag gaggctgaga aggctgcaga
      121 tgagagtgag agaggcatga aagtcattga aagccgagcc caaaaagatg aagaaaagat
      181 ggagattcag gagatccagc tgaaagaggc caagcacatt gctgaagatg ctgaccgaaa
      241 gtatgaagag gtggcccgta agctggtcat catcgagagc gatctggagc gtgcggagga
      301 gagggctgag ctctcggaag gcaaatgtgc cgagcttgaa gaagagttga aaacggtgac
      361 gaacaacttg aagtcactgg aggctcaggc tgagaagtac tctcagaaag aagacaagta
      421 tgaagaggag atcaaggttc tctctgacaa gctgaaggag gctgagaccc gggctgagtt
      481 tgcagagaga tcagtaacca aattggagaa aagcattgat gacttagaag atcaactcta
      541 ccatcaactc gagcaaaacc gccgtctaac taatgaacta aagctggccc tgaatgagga
      601 ttaaaaccct gggccaagtt ctaggaacgg agtccatgtt cagaaaatct atgactctcg
      661 gggttaaaaa atgataactt ctagagaaat gggaaggaaa tgcattggct ggaaggccag
      721 ccttgcctat gttttcctct gtatatctta attaaatctt caataaccaa gaagttttta
      781 tgatacatta tgtatatggg aaaatattta taccaatcaa ttcagctctt cctgcatcag
      841 caggccctct ttgaagtgat gataatgagg actcccccac caccaccatc cccccaaaaa
      901 aaggaaaaaa ttttctattt cctctaaaca gctgtgagtc ctgctttaaa tatctcttct
      961 acatttgtat tcctaacatc ctcctaacat gtatagtgtt tgaaatgtga tccaacgtgt
     1021 acctatgctt gggcaaaata gcttttgaaa acaggagctc gtaccacaag tccctggttg
     1081 aaaggtatgc tttgttaagt ttaacggtgt tggaatctga ttatcgtgtc atgctgttaa
     1141 gcgggatact ataaaaagac taagagaatg tccctaatga agtgtgcatg agatgtgttc
     1201 atataaccat ttgcgagccg tagaaataaa tctatttaaa aagtggtcag aaccttcctt
     1261 acgtcatctt cgttttctgt gaggtcgtgt tctgacccac ctacccgatc tttaactgcc
     1321 tgtaactcag aatcctgtgc tggtattaac gtcttcttcg gtctgtgtaa caatgctgta
     1381 cctgtatttg gtttttgacc ctcccttgtt ccccgtagca cgcagtgggt gagccttcac
     1441 caaacacacc ctgcacgtta cgtcagggtg ggttgcaggt gtaggtgcac ccggggcctt
     1501 gtgtagatgt gagtgacttc caaagcatgt gtagtgccac tcatctgtgt gtgactggtg
     1561 tttgcttact aaatgtacaa accactaccg agtgtgaagc tcagccattc atgactcaat
     1621 tttctgatcc catcacagag aaagtggcac atgccaaaga agaaaacctt agtatgcacc
     1681 agatgctgga ccagacttta ctggagctaa acaacatgtg aaaccctcct tagctgcggc
     1741 cacattctct cgtttccccc tgttgtttta ttttgtcttt aaacacatgc ttaccatgaa
     1801 accccttcaa tgcttttttt ttttatatat atatatacct ttaccactgt cactgcaaca
     1861 tctgcagaga gccagctagg gcagggggtg gggaaagaca cacagaaagc aggtgtactt
     1921 gttatttaaa tgtgccattt cccaggttga cattgccaca cttcacagag tttaggcatg
     1981 cagtttgctt agccagtgta ggaagcctca cgaaaacaaa aggattaaca gcctaagtgc
     2041 caatggtggg gtcatcagga agactgatgt tggagacaat caggtgcaga ttggtgctaa
     2101 tttaaacaaa agaacccctg gtgtcttttg ttcagtctaa tacaatttag acttctgtcc
     2161 aacactaatt tatttttgtc ttgagtagca ctgcgagagg atatgggttc catgtgcccg
     2221 tgttcctcac agccaagggt ttgaaagcca cgctggtctt gtaagattcc cattccttcc
     2281 cagctgacat gcttgtggtc ccttccaact cagtggggag agggctcagt gctgttcgcc
     2341 cccccccccc ccaattctcc atgtcagggt aagagatcta ggcactacat atgtaattgg
     2401 taaagaaaag gcattttctc gagagttata actatgtaaa cattgtataa tgatatgaaa
     2461 taaaatgcac attgtaggac attttcta
//
LOCUS       RSPNIFHD     1375 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  R.rubrum dinitrogenase reductase (nifH) gene, complete cds; and
            dinitrogenase alpha subunit (nifD') gene, 5' end.
ACCESSION   M33774
KEYWORDS    dinitrogenase alpha subunit; dinitrogenase reductase.
SOURCE      R.rubrum DNA.
  ORGANISM  Rhodospirillum rubrum
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 1375)
  AUTHORS   Lehman,L.J., Fitzmaurice,W.P. and Roberts,G.P.
  TITLE     The cloning and functional characterization of the nifH gene of
            Rhodospirillum rubrum
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.J.Lehman, 18-APR-1990.
FEATURES       from  to/span     description
    pept        367     1254     dinitrogenase reductase (nifH)
    pept       1370  >  1375     dinitrogenase alpha subunit (nifD)
    binding     355      358     ribosome binding site
    binding    1357     1361     ribosome binding site
    signal      145      160     upstream activating sequence
    signal      175      190     upstream activating sequence
    signal      280      296     ntr promoter
BASE COUNT      287 a    404 c    431 g    253 t
ORIGIN      
        1 ggggatccca aggagggacc ctccgtcctg accaagaact gtgcgttcgt gttttcgccc
       61 attcctttat caccgatcgg gcggcgacgg gaagggcgtt ctgggcagta gcgcaccatc
      121 gcgcatagcg tcgcgccgcc ccgctgttgg gtccgcgaca gggttttggg gcgttgtcgg
      181 gtttgcgaca cggttgaaaa taaccctaca ttccttgtta gatagattta ttcctttgat
      241 taaaaacgat aattttcaaa aaaatcgggc aaatcaggtt tggcacgctg ggtgcattaa
      301 cggggtgagc gtggcggtgg agcttccccg ccttaggacc ggatcacgaa gaaaaggatt
      361 gattccatga gcgcactgcg tcagatcgcg ttctacggca agggcggtat cggcaagtcg
      421 acgacgtccc agaacacgct ggccgctctg gtcgaaatgg gccagcggat cctgattgtc
      481 ggctgcgatc ccaaggccga ttccacccgc ctgatcctga acaccaagct gcaggacacc
      541 gttcttcatc tggccgccga ggccggctcg gtcgaagacc tcgacgtcgc cgatgtggtg
      601 aagatcggct acaagggcat caagtgcacc gaatccggtg gtccggagcc cggcgtcggc
      661 tgtgccggcc gtggcgtgat caccgccatc aacttccttg aggaaaacgg cgcctacgac
      721 gacctcgatt acgtctctta cgacgtgctg ggcgacgtgg tttgcggtgg cttcgccatg
      781 ccgatccgcg agaacaaggc tcaggaaatc tacatcgtca tgtcgggcga gatgatggcg
      841 ctgtatgccg ccaacaacat cgccaagggc attttgaaat acgcccacac cggtggggtt
      901 cgtctgggcg gtctgatctg caacgagcgt cagactgata aggaagtcga actggccgag
      961 gccctggccg gccgtttggg ctgccgcctc atccacttcg tgccgcgcga caacggcgtt
     1021 cagcacgccg agctgcgccg ccaaacggtg atccaatacg cccccgacag caagcaggcc
     1081 ggtgagtatc gcacgctggc gaccaagatc cacaacaact ccggccaggg cgtggtgccg
     1141 accccgatca ccatggaaga cctggaagag atgctgatgg aattcggcat catgaagtcc
     1201 gacgaagagg cgctggccga gctcgaagcc aaggaatccg ccgccgccaa ttaaggccgt
     1261 cgccgtgcga cgttcccccc ggccggccgg tcaggcgcag gggcgacgga agcgggcgac
     1321 ctccaagaag gcctcttcgc gaggcgatag tcgatcagag gagagggata tgagc
//
LOCUS       YSCMTFDEL      70 bp ds-DNA             ORG       08-AUG-1990
DEFINITION  S.cerevisiae mitochondrial novel junction sequence.
ACCESSION   M34150
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site         33       44     novel junction site
BASE COUNT       11 a     25 c     19 g     15 t
ORIGIN      
        1 cttcggagtt cggccccccc ataagggggg gacctcactc cttccccact gcactggatg
       61 cgggacttat
//
LOCUS       YSCMTGAP12    457 bp ds-DNA             ORG       08-AUG-1990
DEFINITION  S.cerevisiae mitochondrial gap 12 sequence.
ACCESSION   M34149 X15185
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 457)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
COMMENT     The gap 12 sequence is a segment of yeast mitochondrial DNA that
            had not been determined previously.
BASE COUNT      161 a     57 c     57 g    182 t
ORIGIN      
        1 tacaatttat aatttaataa agaaggaaat aaataataat aactcctttt ggggttccgg
       61 tggggttcac acctttataa ataataaata aagatgttta ctcctcttcg gggttcggtc
      121 ccctttttgg gttccggaac taattaatat tttatataat aataataata tattaatata
      181 atttcattat taataaatat ctcctgcggg gttcggtccc ccccgtaagg ggggggtcct
      241 cactccttcg gagcgtacta ttattataaa taattatata ttataatata attaaaaagt
      301 attataattg aaacgaaaat tgtaatttta aatggaataa taattattat atatttaata
      361 tatttaataa agttataata tctctttcta ccggactatt ttattttatt ttattttatt
      421 tttataaaga aaaatagtat aatattatct tctcctc
//
LOCUS       YSCMTORI1A     70 bp ds-DNA             ORG       08-AUG-1990
DEFINITION  S.cerevisiae mitochondrial ori1 gene, 5' flank.
ACCESSION   M34153
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
BASE COUNT       19 a     11 c     24 g     16 t
ORIGIN      
        1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
       61 ggggtgggtg
//
LOCUS       YSCMTORI2A     70 bp ds-DNA             ORG       08-AUG-1990
DEFINITION  S.cerevisiae mitochondrial ori2 gene, 5' flank.
ACCESSION   M34152
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA         32  >    70     ori2 mRNA (+/- 4 bp)
BASE COUNT       23 a      0 c     18 g     29 t
ORIGIN      
        1 tttttttata agataatttt tgtaaatata taagtaataa attaagtttt atagggggag
       61 ggggtgggtg
//
LOCUS       YSCMTORI7A     70 bp ds-DNA             ORG       08-AUG-1990
DEFINITION  S.cerevisiae mitochondrial ori7 gene, 5' flank.
ACCESSION   M34154
KEYWORDS    .
SOURCE      S.cerevisiae mitochondrial DNA.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 70)
  AUTHORS   Skelly,P.J. and Clark-Walker,G.D.
  TITLE     Conversion at large intergenic regions of mitochondrial DNA in
            Saccharomyces cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 1530-1537 (1990)
  STANDARD  simple staff_review
BASE COUNT       19 a     11 c     24 g     16 t
ORIGIN      
        1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag
       61 ggggtgggtg
//
LOCUS       MXAFRZE      2493 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  M.xanthus gliding motility regulatory protein (frzE) gene, complete
            cds.
ACCESSION   M35192
KEYWORDS    gliding motility regulatory protein.
SOURCE      M.xanthus (strain DZF1) DNA.
  ORGANISM  Myxococcus xanthus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; 
            Myxococcaceae.
REFERENCE   1  (bases 1 to 2493)
  AUTHORS   McCleary,W.R. and Zusman,D.R.
  TITLE     FrzE of Myxococcus xanthus is homologous to both CheA and CheY of
            Salmonella typhimurium
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.R.McCleary, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         64     2397     gliding motility regulatory protein (frzE)
BASE COUNT      375 a    848 c    926 g    344 t
ORIGIN      
        1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg
       61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc
      121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag
      181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt
      241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag
      301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc
      361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc
      421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg
      481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc
      541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc
      601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc
      661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc
      721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc
      781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg
      841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg
      901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc
      961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc
     1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag
     1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg
     1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg
     1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc
     1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc
     1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg
     1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc
     1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag
     1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc
     1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg
     1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg
     1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg
     1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc
     1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac
     1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc
     1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc
     1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc
     2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag
     2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag
     2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg
     2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg
     2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc
     2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga
     2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg
     2461 caaggggctg cgtgcgctcg cggcccgggg cct
//
LOCUS       RATPCBB       463 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Rat PCB-binding protein mRNA, complete cds..
ACCESSION   J05536
KEYWORDS    PCB binding protein; binding protein.
SOURCE      Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone
            pPCB6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 463)
  AUTHORS   Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J.,
            Gillner,M., Gustafsson,J.-A. and Lund,J.
  TITLE     Cloning, structure, and expression of a rat binding protein for
            polychlorinated biphenyls: Homology to the hormonally regulated
            progesterone-binding protein uteroglobin
  JOURNAL   J. Biol. Chem. 265, 12690-12693 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.Lund, 15-JUN-1990.
FEATURES       from  to/span     description
    pept         58      348     PCB binding protein precursor
    sigp         58      114     PCB binding protein signal peptide
    matp        115      345     PCB binding protein
BASE COUNT      117 a    156 c     84 g    106 t
ORIGIN      
        1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg
       61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg
      121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat
      181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag
      241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg
      301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat
      361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta
      421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc
//
LOCUS       YSBCPR       2990 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds.
ACCESSION   M35199
KEYWORDS    NADPH-cytochrome P450 reductase.
SOURCE      C.tropicalis (ATCC 750) DNA, clone pTS1.
  ORGANISM  Candida tropicalis
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 2990)
  AUTHORS   Sutter,T.R., Sanglard,D. and Loper,J.C.
  TITLE     Isolation and characterization of the alkane-inducible NADPH-
            cytochrome P450 oxidoreductase gene from Candida tropicalis:
            Identification of invariant residues within similar amino acid
            sequences of divergent flavoproteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            T.R.Sutter, 15-JUN-1990.
            
            Author name and address
               T.R.Sutter
               CIIT, P.O. Box 12137
               Research Triangle Park, NC 27709
FEATURES       from  to/span     description
    pept        554     2596     NADPH-cytochrome P450 reductase
    signal      198      204     TATA box
BASE COUNT      988 a    442 c    558 g   1002 t
ORIGIN      
        1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa
       61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg
      121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag
      181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag
      241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag
      301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa
      361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata
      421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta
      481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc
      541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt
      601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt
      661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa
      721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa
      781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc
      841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt
      901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac
      961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac
     1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg
     1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt
     1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa
     1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga
     1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt
     1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga
     1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa
     1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat
     1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa
     1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat
     1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg
     1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga
     1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc
     1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt
     1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt
     1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac
     1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca
     2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag
     2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg
     2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg
     2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt
     2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa
     2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa
     2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc
     2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga
     2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca
     2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta
     2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga
     2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat
     2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc
     2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta
     2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt
     2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta
//
LOCUS       CPCURA5A     2029 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  C.neoformans orotidine monophosphate pyrophosphorylase (URA5) gene,
            complete cds.
ACCESSION   M34606
KEYWORDS    orotidine monophosphate pyrophosphorylase.
SOURCE      C.neoformans (strain B-3501) asexual phase cell line B-3501 DNA,
            clone pURA5g2.
  ORGANISM  Cryptococcus neoformans
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina; Filobasidiales.
REFERENCE   1  (bases 1 to 2029)
  AUTHORS   Edman,J.C. and Kwon-Chung,K.J.
  TITLE     Isolation of the URA5 gene from Cryptococcus neoformans var
            neoformans and its use as a selective marker for transformation
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Edman, 29-MAY-1990.
            
              Author address: J.C.Edman
              University of California San Francisco
              Hormone Research Institute
              San Francisco, CA 94143-0534
              email: edman@socrates.ucsf.edu
            
FEATURES       from  to/span     description
    pept        714      814     orotidine monophosphate pyrophosphorylase
                                 (URA5), exon 1 (E.C. 2.4.2.10)
                867     1095     orotidine monophosphate pyrophosphorylase,
                                 exon 2
               1145     1492     orotidine monophosphate pyrophosphorylase,
                                 exon 3
    pre-msg <   714     1527     URA5 mRNA and introns
    IVS         815      866     URA5 intron A
    IVS        1096     1144     URA5 intron B
BASE COUNT      512 a    515 c    479 g    523 t
ORIGIN      
        1 gatcttgggg atggtattga agacgatcca gaaaaagtct ctccatccct cccacgttcg
       61 ctgttcggtt tctttgcaaa gagcgaagtt gctcgacctt tgacagaccc taaaatctgg
      121 tcgtattttg cgttcaaaga tcgcttactg cgaccgcgat tttttagagg tggatttttt
      181 gttgcgagac tagcgcatcg gataggcatg ggtgatatac gtttgcgttc cttatgaggg
      241 gtatcatcag tggcaaatta ccgactagat gaagaagacg acctacaccc cgctaaccgg
      301 tgtccgatcg acatgatcca cggtgtccaa agcgaagaag gacatgatga ttggagttcg
      361 cttacctttg gtataaacca gtgttgtttc tgaagaggtg aggaagaaaa tgatctaggg
      421 caataacaga aatcgtttga agaagaagta aaagaaaaga cacacgtcag acaagcatca
      481 tcatcattca ttttgaatac tcatcacggc accttcttcc ttgcatgcgc attattcatg
      541 atgagatgct cgcagccgcc ggatttcggc tgcaatcaag aacgatatcg cgtacggcga
      601 tgtccgcctc cactccatct tccaacaacc accacagtgc ccgataacac tacgacgaaa
      661 attgttcttt tttggatctt gtcgacaacg gcgaccaaaa caagaacttg atcatgtcct
      721 cccaagccct cgactccgcc aaagttgcct tcatcgaggc tgccatcgaa catggcgtgc
      781 ttcttttcgg caactttacc ttgaagtccg gccggtgagc catattgcag cgcttcacag
      841 tccaatcgaa tctgacatgt gttcagccaa tccccttact tcttcaatgc cggtctcctt
      901 tactcttcat cgcttctctc aactaccgct caggcttacg ccaaggtact ttcctcttct
      961 aggattcctg actttgacgt cctcttcggc ccagcttaca agggtatctc cttggctgct
     1021 gtctccgctg taagccttta tcagcaaacc ggcaaagata tcggctactg ctacaacagg
     1081 aaggagaaga aggacgtgag tctgtcctaa ccagtgcgac agcgatgagc tcataagcca
     1141 gtagcacggt gagggcggta ctatggtcgg tgcgcctctc aagggacgaa tcgtcatcat
     1201 cgacgatgtt ctcacctctg gcaaggccat ccgtgaagct attgacattc tcaaggcctc
     1261 ccctgaagcg aagcttgtcg gaattgtcca gcttgtcgac agacaagaga aaggccagag
     1321 cggtagcggc aagagtaccg tacaggaggt tgaggaagag ttcggtgtgc ctgtcgagcc
     1381 tattattggt ttggacgaca ttgtgaagta cttagaaagc tccggcaagt gggaaaagga
     1441 gctgcaagag gtcaggaagt acagggcgga gtacggtgtt cagaggtctt aagggttttc
     1501 ttcttaaatg cacgggttta ggtctagcta atcaagttcc gacatattac aagtttgtaa
     1561 gcttgtatca aaggaactta agtacaggca ggcgtcctga ggcgacaaag gaagctgtaa
     1621 tatgattgtt ggctgtcaat cttcatcgta tctactttgt caatactgac ttcaatgacc
     1681 caataataca attttattag tgttgaccca gaatggttag caggaaactc cccttctctt
     1741 cctctcaatc ccaatcatac ttcatatctc ctgctccccc catttccgtc ttcctcgatg
     1801 actccctggt cccatccctc ccacctcctg gaggcaagct ggagcacctg gacctaatgg
     1861 gtcgtcgccc caaattgcct ccacctctta agacaatcat cgtccaatca aactctgaac
     1921 tatcttccaa gccaatggcc ggatctggac acaagatgat gtcgaagccg cttgatgtgc
     1981 cagggttggt ccgtggcctg gagacgtaag agcgggtagt actgggatc
//
LOCUS       ECOCHEA      2190 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  E.coli cheA protein (complete cds), motB protein (3' end),
            and cheW protein (5' end) genes.
ACCESSION   M34669
KEYWORDS    cheA gene; cheW gene; motB gene.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2190)
  AUTHORS   Kofoid,E.C. and Parkinson,J.S.
  TITLE     Tandem translational starts in the cheA locus of E.coli
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.C.Kofoid, 01-JUN-1990.
            
              Author address: E.C.Kofoid
              University of Utah
              Dept of Biology
              Salt Lake City, UT 84112
              email: kofoid@bioscience.utah.edu
FEATURES       from  to/span     description
    pept    <     1      104     MotB protein
    pept        109     2073     CheA protein (alt.)
    pept        400     2073     CheA protein (alt.)
    pept       2094  >  2190     CheW protein
BASE COUNT      575 a    546 c    609 g    460 t
ORIGIN      42 min on K12 map.
        1 atgaaaacgc cgaaagccag aatgagccag taagcgccct ggaaaaacct gaggttgcac
       61 cacaggtcag tgttcccaca atgccatcag ccgaaccgag gtgacagcgt gagcatggat
      121 ataagcgatt tttatcagac attttttgat gaagcggacg aactgttggc tgacatggag
      181 cagcatttgc tggttttgca gccggaagcg ccagatgccg aacaattgaa tgccatcttt
      241 cgggctgccc actcgatcaa aggaggggca ggaacttttg gcttcagcgt tttgcaggaa
      301 accacgcatc tgatggaaaa cctgctcgat gaagccagac gaggtgagat gcaactcaac
      361 accgacatta tcaatctgtt tttggaaacg aaggacatca tgcaagaaca gctcgacgct
      421 tataaacagt cgcaagagcc ggatgccgcc agcttcgatt atatctgcca ggccttgcgt
      481 caactggcat tagaagcgaa aggcgaaacg ccatccgcag tgacccgatt aagtgtggtt
      541 gccaaaagtg aaccgcaaga tgagcagagt cgcagtcagt cgccgcgacg aattatcctt
      601 tcgccgctga aggccgggga agtcgacctg ctggaagaag aactgggaca tctgacaacg
      661 ttaactgacg tggtgaaagg ggcggattcg ctctcggcaa tattaccggg cgacatcgcc
      721 gaagatgaca tcacagcggt actctgtttt gtgattgaag ccgatcagat tacctttgaa
      781 acagtagaag tctcgccaaa aatatccacc ccaccagtgc ttaaactggc agccgaacaa
      841 gcgccaaccg gccgcgtgga gcgggaaaaa acgacgcgca gcaatgaatc caccagcatc
      901 cgtgtagcgg tagaaaaggt tgatcaatta attaacctcg tcggcgagct ggttatcacc
      961 cagtccatgc ttgcccagcg ttccagcgaa ctggacccgg ttaatcatgg tgatttgata
     1021 accagcatgg ggcagttaca acgtaacgcc cgtgatttgc aggaatcagt gatgtcgatt
     1081 cgcatgatgc cgatggaata tgtttttagt cgctatcccc ggctggtgcg tgatctggcg
     1141 ggaaaactcg gcaagcaggt agaactgacg ctggtgggca gttctactga actcgacaaa
     1201 agcctgatag aacgcattat cgacccgctg acccacctgg tacgcaatag cctcgatcac
     1261 ggtattgaac tgccagaaaa acggctcgcc gcaggtaaaa acagcgtcgg aaatttaatt
     1321 ctgtctgccg aacatcaggg cggcaacatt tgcattgaag tgaccgacga tggggcgggg
     1381 ctaaaccgtg agcgaattct ggcaaaagcg gcctcgcaag gtttgactgt cagcgaaaac
     1441 atgagcgacg acgaagtcgc gatgctgata tttgcacctg gcttctccac ggcagagcag
     1501 gtcaccgacg tctccgggcg cggcgtcggc atggacgtcg ttaaacgtaa tatccagaag
     1561 atgggcggtc atgtcgaaat ccagtcgaag cagggtactg gcactacgat ccgcatttta
     1621 ctgccgctga cgctggccat cctcgacggc atgtccgtac gcgttgcgga tgaagttttc
     1681 attctgccgc tgaatgctgt tatggaatca ctgcaacccc gtgaagccga tctccatcca
     1741 ctggccggcg gcgagcgggt gctggaagtg cggggtgaat atctgcccat cgtcgaactg
     1801 tggaaagtgt tcaacgtcgc gggcgcgaaa accgaagcca cccagggaat tgtggtgatc
     1861 ttacaaagtg gcggtcgccg ctacgccttg ctggtggatc aattaattgg tcaacaccag
     1921 gttgtggtta aaaaccttga aagtaactat cgcaaagtcc ccggcatttc tgctgcgacc
     1981 attcttggcg acggcagcgt ggcactgatt gttgatgtct ccgccttgca ggcgataaac
     2041 cgcgaacaac gtatggcgaa caccgccgcc tgaatgagta aaaaggtaac aatatgaccg
     2101 gtatgacgaa tgtaacaaag ctggccagcg agccgtcacg gcaggaattt ctggtattta
     2161 cccttggtga tgaagagtac ggtattgatg
//
LOCUS       HAMHSP60A    2726 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  C.griseus chaperonin (HSP60) non-functional pseudogene.
ACCESSION   M34562
KEYWORDS    chaperonin; pseudogene.
SOURCE      C.griseus cell line CHO DNA, clone GC-1.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2726)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    IVS        1738     1824     HSP60 intron A
BASE COUNT      915 a    513 c    579 g    719 t
ORIGIN      
        1 catggtgatt tttctttaga tattctgggc ctcctgaaac tgacaagatt ctgtaaagca
       61 aaagacataa tcaacaagat aagacagcag tgcaaacaat gggaaaagtt cttcaccaac
      121 cccatatcta gccaagggca aatttctaaa acatacaaag aactgatgaa agcagtcaac
      181 aaaacaccaa ataatcgccc tcactctccg ccaccagcct gctttgccag atgagaccag
      241 tgtcccgggc gctggctcct catgtcactc gggcctatgc caatgatgta aaatgtgggg
      301 caaatccttg atccttaaag cttcaagttg tagacctttt agccgatgct gtagctgtta
      361 caatggggcc aaagggaaga acagtgatta ttgaacagag ttgaggaagt cccaaagtaa
      421 caaaagatgg ggtcattttt acaaaggcaa ttgatctaaa ggatacatac aaataatatc
      481 agagctaaac ttgttcaagt tgttgccaat aactcaaatg aagaggctgg gatggcacta
      541 ccactgctac tgttctggca agttctattg ccaacgaagg ctttgagaag atcagcaaag
      601 ggtctaatcc agtagaaatc tggagaagtg tgatgttggc tgttgatgct gtaattgctg
      661 aactaacaaa acaatctaaa catgtgacaa cccatgaaga aatttctcag gttgctacaa
      721 tttctgcaaa tggacacata gacattggaa acatcatatc tgatgcaatg aaacatgttg
      781 gagggaaaga cgtcatcaca ctgaaggata gaaataccct gaatgatgaa ttagaaatta
      841 ttggaggcat gaagtttgaa agaagatgta tttctcagac atcaataagt caaaaatgtg
      901 aattccaaga tgcctatgtt ctattgagtg aaaagaaaat ttctattgtc gagcccatta
      961 ctgctcatag gaaacaaatg gtgatatttg ctgaagatgt tgatggagaa gctctaagca
     1021 cactggtttt gaacaggcta aaagctggtc ttcagatcgt agcagttaaa gctccaggtt
     1081 tttggggaca ataggaagaa taagcttaaa gatacggcta ttgctactga tggtgagcaa
     1141 tttggagaac aggggctaaa actaaatatt gatgatggtc aagttcatga tttaggaaaa
     1201 tttggagagg tcattgtccc caaagatgat gccatgcttt tgaaaggaac tgatgactaa
     1261 actcaaattg agaaaccaat tcaagaaatc actaagcagc tagacatcac aactagtgaa
     1321 tatgaaaagg aaaagctaaa cgaaacactt gctaaacttt cagatggtgc agctgtgttt
     1381 aaggttggag gaacaaggga tgttgaagtg aaggagaaga aagacagagt tacaaatgct
     1441 ctcaatgcta caagagcagc tgttgaaaag ggcattgttt taggacttag ctgtgctctg
     1501 cctctgtgca tcccagcctt ggattcctta aagccttcaa atgaagatca gaaaataggt
     1561 atagaaatta ttaaaagagc actcaaaatt cctgcagtga caattgctga gaatgcaggt
     1621 gttgaataat ctttgagagt tgagaaaatt ctgcagggtt cctagggagt tggttattag
     1681 gctatgcccg caaattttgt gaacatggta gaaaaaggaa tcattaatcc aacaaaggta
     1741 aatgagtcag caattcttaa acattggatg gcttgttcta tggtatgtgt tttcattaat
     1801 cattttataa acattttttt gtaggttgta agaactgctt tagtagatgc ctccttgcta
     1861 acttcagcag aagctgtagt gacagaaatt ccaaaagaag agaaggacct tggaatgcgc
     1921 acacatgcct ttaatcctag cacttgcgag gcagaggcag gcagatatct gtgagttcga
     1981 ggccagcctg gactccggag tgagtgccag gatagtctcc aaagctatcc aaagagaaac
     2041 cctgtcttga aaaacaaaca aacaaaaaag acatttgtac attcctgaaa aaaacccaaa
     2101 taatccaatt aaactagggg tcagcctaat aaagataccc gaataggttt ttacatctta
     2161 agagaaaaat ctactaataa atgacttgga gcaggtcctt cacagacaat atttagtgag
     2221 agtaggtagc ctcatgtggg ttagtagttc cagttgcttc aaaatgccct caccttttgt
     2281 cataggctca tgtggaggat ctactccatt ggatggtgat attaaaagta gcacatgggt
     2341 acatgcctta agatattaag atccccaagt aaaaactgca tctcagatgc tctctgtttc
     2401 ttgggccatg atatgggcag tgagatttct gtcacacttg ccaggaaaca cttatgatct
     2461 gtctgtcaca atctcttata ttgtcaagta cttaaccaaa cacaatcacc catgacctct
     2521 catatcctcc tggacttaga caggaggcac aagtctactc aggtataaat atttttactt
     2581 ctgatgcccc acagtaacct acaatggctt cagtccagta cacaggtaag taaacccacc
     2641 aatataccat tccagctgat tgataacctc ttagatacat cccatatgct gcagttccca
     2701 aatggagaca acccaaagtt tcttga
//
LOCUS       HAMHSP70A    2243 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Chinese hamster heat shock protein (cognate form of hsp70) mRNA,
            complete cds.
ACCESSION   M34561
KEYWORDS    heat shock protein; stress protein.
SOURCE      Chinese hamster cell line CHO, cDNA to mRNA, clone RS 23.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 2243)
  AUTHORS   Ahmad,S., Ahuja,R., Venner,T.J. and Gupta,R.S.
  TITLE     Identification of a protein altered in mutants resistant to
            microtubule inhibitors with the constitutive form of the major heat
            shock protein (hsp70)
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN% GUPTAR@,SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    pept         50     1990     heat shock protein (hsp70)
    mRNA    <     1     2243     hsp70 mRNA
    signal     2220     2225     poly-A signal
BASE COUNT      654 a    486 c    551 g    552 t
ORIGIN      
        1 gttttgtgtg gtctcgtcgc cagcacagcc gggcctacag caagcaacca tgtctaaggg
       61 acctgcagtt ggtattgatc ttggcaccac ctactcctgt gtgggtgtct tccagcatgg
      121 aaaggtggag ataattgcca atgaccaggg taaccgaacc acgccaagct atgttgcttt
      181 tactgacaca gaacgattaa ttggggatgc cgcgaagaat caggttgcaa tgaaccccac
      241 caacacagtt tttgatgcca aacgtctgat cggacgtagg tttgatgatg ctgttgttca
      301 gtctgatatg aagcactggc ccttcatggt ggtgaacgat gcaggcaggc ccaaggtcca
      361 agtagaatac aaaggagagg caaaaagttt ctacccagag gaggtgtcct ctatggttct
      421 gaccaagatg aaggaaattg cagaagccta tctcgggaag actgttacca atgctgtggt
      481 cacagtgcca gcctacttca atgattctca gcgacaggct acaaaagatg ctggaactat
      541 tgctggcctg aatgtcctgc gaattatcaa tgaaccaact gctgctgcta ttgcttatgg
      601 gctagataag aaggttggag ctgaaaggaa tgtgctcatt tttgacttgg gaggtggtac
      661 ttttgatgtg tctatcctca ctattgagga tggaattttt gaagtcaaat caacagccgg
      721 agacacccac ttgggtggag aagactttga taacagaatg gtcaaccatt tcattgctga
      781 gtttaagcga aacgacaaga aggacatcag cgagaacaag agagctgtca ggcgtctgcg
      841 tactgcctgt gaacgggcaa agcgcaccct ctcctccagc acccaggcca gtattgagat
      901 tgattctctc tatgagggaa ttgatttcta tacctccatt acccgtgctc gattcgaaga
      961 attgaatgct gacctgttcc gaggcactct ggaccctgta gagaaggccc ttcgagatgc
     1021 taaactagac aagtcacaga tccatgacat tgtcttggtg ggtggttcca ccagaatccc
     1081 caagattcag aagcttctgc aagacttctt caatggaaaa gaactgaata agagcattaa
     1141 ccccgatgaa gctgttgctt atggtgcagc tgtccaggcg gccattctat ctggagacaa
     1201 gtctgagaat gttcaggatt tgttgctctt ggatgtcact cctctttccc ttggcattga
     1261 aactgctggt ggagtcatga ctgtcctcat caagcgcaat accaccatcc ctaccaagca
     1321 gacgcagact ttcactacct attctgacaa tcagcctggt gtgctcattc aggtctatga
     1381 aggcgaaagg gccatgacca aggataacaa cctgcttgga aagtttgaac ttacaggcat
     1441 ccctccagca ccccgtggtg ttcctcagat tgaagttact tttgacattg atgccaatgg
     1501 catcctcaat gtctctgctg tagataagag cacaggcaag gagaacaaga tcactatcac
     1561 caacgacaag ggccgcttga gcaaggagga tattgaacgc atggtccagg aagctgagaa
     1621 gtacaaagct gaagatgaga agcagagaga taaggtttcc tccaagaatt cactggagtc
     1681 ctatgctttc aacatgaaag caacagttga agacgagaaa cttcaaggca agatcaatga
     1741 tgaggacaaa cagaagattc tagacaagtg taatgaaatc atcagttggc tggataagaa
     1801 tcagactgca gagaaggaag aatttgaaca ccagcagaaa gaactggaga aagtctgcaa
     1861 ccccatcatt accaagctgt accagagtgc tggtggcatg cctggaggaa tgcccggggg
     1921 cttccctggt ggtggagctc ctccatctgg tggtgcttct tcagggccta ccattgaaga
     1981 ggtggattaa gtcagtccaa gtagaggtgt agcattgttc cacagggaaa catttgaagg
     2041 acccaaattt gcaaagtttc atggcagttt ttaaatttaa actgttaaca ataaattact
     2101 gggcattctc aatacttgaa catggaacat gtacacagga aataaaacat tgctctttat
     2161 aagcactgta tcctaagtgg aaaactcaat gtctaatatg gtggaaaatg gaatgtctaa
     2221 ataaaactat ttaaattggc acc
//
LOCUS       HAMTCP1A     1845 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Chinese hamster T-complex protein 1 (TCP-1) mRNA, complete cds.
ACCESSION   M34665
KEYWORDS    T-complex protein 1.
SOURCE      Chinese hamster cell line CHO, cDNA to mRNA, clone pRG 501.
  ORGANISM  Cricetulus griseus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 1845)
  AUTHORS   Ahmad,S. and Gupta,R.S.
  TITLE     Cloning of a Chinese hamster ovary cell protein homologous to the
            mouse T-complex protein TCP-1: Structural similarity to the
            ubiquitous "Chaperonin" family of heat shock proteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 01-JUN-1990.
            
              Author address: R.S.Gupta
              McMaster University
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR@,SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    pept         89     1759     T-complex protein 1
    signal     1818     1823     poly-A signal
BASE COUNT      530 a    368 c    472 g    475 t
ORIGIN      
        1 gttccccgct gtggtggccg cagtccggcc tctgagcgga gatcccggcc gtccgcggtg
       61 ttagcgaggc cagccaggtt ccctaacgat ggagggccct ttgtccgtgt tcggggaccg
      121 cagcactgga gaggcgatcc gctcccagaa tgttatggct gcagcttcca ttgccaacat
      181 tgtgaaaagt tctctggggc cagttggctt ggataaaatg ttggtggatg acattggtga
      241 tgtaaccatt accaacgatg gggccaccat cctgaagtta ctggaggtag aacaccctgc
      301 agctaaagtt ctttgtgagc tggctgactt gcaagacaaa gaagtgggag atggaactac
      361 ctctgtggta attattgcag cggagcttct gaaaaatgca gatgaactag tcaaacagaa
      421 aattcatcca acatcagtta ttagtggcta tcggcttgcc tgcaaggaag cagtgcgcta
      481 tatcagtgag aacctaatta ttaacacaga cgaacttgga agagactgtc tgattaatgc
      541 cgctaagaca tccatgtctt ccaaaattat tggaataaat ggtgatttct ttgctaatat
      601 ggtggtagat gctgtacttg ctgtcaaata cacagatctg agaggccagc ctcgatatcc
      661 agtcaattct gttaatattc tgaaagccca tggaagaagt caggtagaaa gcatgctgat
      721 caatggctat gcactgaact gtgtggtggg atctcagggc atgcccaaga ggatagttaa
      781 tgcaaaaatt gcatgtcttg acttcagcct gcagaaaaca aaaatgaagc ttggtgtgca
      841 ggtggttatt acagatcctg agaaattgga ccaaattaga cagagagaat cagacatcac
      901 caaggagaga attgaaaaga ttctggcgac tggggccaat gttattctga ccactggtgg
      961 gatagatgat atgtgtctga agtactttgt ggaggctggt gccatggctg ttagacgagt
     1021 tctaaaaagg gatctcaagc gtatcgctaa agcttctgga gcaagtatcc tgtccacact
     1081 ggccaatttg gaaggcgaag aaacttttga agctaccatg ctggggcaag cagaagaagt
     1141 ggtccaggag agaatttgtg atgatgagct gatcttaatc aaaaatacca aagctcgtac
     1201 atctgcatca atcatcttac gaggagcaaa tgatttcatg tgtgatgaaa tggagcgatc
     1261 tttacatgat gctctttgtg tggtgaagag agttttggag tcaaaatctg tagtcccagg
     1321 aggaggtgct gtagaagccg ccctgtccat ataccttgaa aactatgcaa ccagtatggg
     1381 gtctcgggaa cagcttgcta ttgcagagtt tgcaaggtct cttcttgtta ttcctaatac
     1441 actggcagtg aatgctgccc aagactccac tgacctggtt gccaagttac gagcttttca
     1501 caatgaggct caagttaacc cagaacgtaa aaacctgaag tggattggtc ttgatttgat
     1561 caatggaaaa ccacgagaca acaagcaagc aggggtgttt gaaccaacca tagttaaagt
     1621 gaagagcctg aagtttgcaa cagaagctgc aatcactatt cttcgaatcg atgatctgat
     1681 aaaattacac ccggaaacta aagacgataa acatggaagt tatgaaaatg ctgttcactc
     1741 tggagccctt gatgactgat tggatttccc ttttatttat agcagtgtca ggtgcaatgt
     1801 agccttgggt atctcacatt aaagtacagt gagctgtcca cctga
//
LOCUS       HUMHP601A    1941 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 1.
ACCESSION   M34660
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS2-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 111 to 1941)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1941)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      624 a    329 c    446 g    541 t      1 others
ORIGIN      
        1 atatgaaata aagcctttat gtgctctacc taaattaata aatttattta ttttttttng
       61 tatatttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
      121 gcctcgccat gcgacatgtc ctgccagcac cccacagaaa tgcttccatt ccattaccca
      181 cagtctttca ccagatgaga ccagtgtcca ggctactggc tcctcacctc acttgaagtg
      241 atggtaaaga tgtaaaattt ggtgctgatg tccgagcctt aatgcttaaa agtgtagacc
      301 tttttgtcga tgtagccatt acaatggggc caaagggaag aacagtaatt attgaacata
      361 gctgggcaag tcccaaagta acaaaagatg gtgtgactga tgcaaagtca attgacttaa
      421 aagataaata taaaagcatt ggagctaaac ttgtccaaga tgttgccaat aacacagatg
      481 aagagactgg ggatggcact atcacttgct gttgtactgg tatgctctat ttccaaataa
      541 ggtttccaga aggttagcaa aggtgctaat ccagtggaaa tcaggagagg tgtatgttag
      601 ctgatgctgt aattgctgaa cttaaaaagc agtctaaacc tgtgaccaaa cctgaagaaa
      661 ttgcacaggt tgctacaatt tctgcaaatt gagacaaaga aattggcaac atcatctctg
      721 atgcaatgaa aaaagttgga agaaagggca tcatcacagt aaaggatgga aaaacactga
      781 ctgtagaatt agaaattatt gaaggcatga aatttgatcg aggatatatt tctccatact
      841 ttattaatac atcaaaaggt gagaaatgtg aattccagga tgcctatgtt ctgttgcatg
      901 aaaagcaaaa tttctacgtg ctccacgtcc attgtaactg ctcttgcaaa ttgccaatgc
      961 ttactgtaag cctttggtca taattgctgg agatattgat ggagaagctc taactacact
     1021 catcctgaat aggctaaagg ttggtcttca ggttgtggca gtcaaggctc cagggtttgg
     1081 tgacaataga aagaaccagc ttaaagatac ggttattgct actggtggta cagtgtttgg
     1141 agaagagggc tcacactaaa tcttgaagac gttcagcctc gtgacgtagg agagttggag
     1201 aggtactgtg atcaagatat gccatgctct taaaagcaaa aggtaacaag tctcaaatta
     1261 aaaatgtgtt caagaaatca ttgaccagtc agatgtcaca actatgtaat tgaaaaggaa
     1321 aaagtgagtg gagaaacttt cagatggagt agctgtgctg aaggttggtg gcaagtgatg
     1381 ttgaagttaa tgaagagaaa gacagagtta taggtgcact taatgctaca agagctgctg
     1441 ttgaagaagg cattgtttag ggagggggtt gtgccctgct tcgatgcatt ccagccttgg
     1501 actcattcac tccagctaat gaagataaaa taattggtat agaaattatt aaaagaacac
     1561 tcaaaattcc agcaatgaca attcgtaaga atgcaggtgt tgatgcattt ttgatagttg
     1621 agaaaattat gtaaagttcc tcagaagttg gttatgatac tatgttagga gatgtcgtga
     1681 atatggtgga aaaagacatt attgacccaa caaaggttgt gagaactgct tcattggatg
     1741 ctgctggcat gccctctata ttaactacag cagctgttgt agtcacagaa attcctaaag
     1801 aagggaacag ccctggaatg ggtgcatgtg gaatgggagg tggcctgttc taattcctag
     1861 aatagtgctt tacctttatt aatgaattgt gataggaacc caaggcagtg ttcctcacca
     1921 ataacttcag agaagtcagt t
//
LOCUS       HUMHP602A    1920 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 2.
ACCESSION   M34662
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS2-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 117 to 1920)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1920)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      619 a    324 c    431 g    544 t      2 others
ORIGIN      
        1 tggattatgn aaatnaaagc ctttatgtgc tctgcctaaa ttaatatttt attttttttt
       61 gatacttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct
      121 gcctcacctc accatgcaca tgtcctgcca ccaccccaca gaaatgcttc acttacccac
      181 agtctttcgc cagatgagac cagtgtccag ggtactggct cctcaccctc acttgaagga
      241 tggtaaagat gtaaaatttg gtgctgatgt ctgcgcctta atgcttcaaa gtgtagactt
      301 ttgtcagtaa ttattgaaca tagctgggaa gtcccaaagt aacaaaagat ggtctgactg
      361 atgcaaagtc aattgactta aaggataaat ataaaagcat tggagctaaa cttctccaag
      421 atgttgccta aaacacaaca aatggagaga ctggggatgg cactatcact gctgctgtac
      481 tggtatgctc tatttccaaa taagccttcc agaaggttag caaaggtgct aatccagtgg
      541 aaatcatcag agagtgtgat gttagctgtt gatgctgtaa ttgctgaact taaaaagcag
      601 tctaaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
      661 cacaaagaaa attggtaaca tcatctctga tgcaatgaaa aagtttggaa gaaagggcat
      721 catcacagta aaggatggaa aaacactgac tgatgaatta gaaattattg aaggcatgaa
      781 aaatttgatc gaggatatat ttctccatac tttcttaata catcaaaagg tgagaaatgt
      841 gaattccagg atgcctatgt tctactgagt gaaaagaaca tttctggtgt ccagtccatt
      901 gtacctgctc ttgaaattgc cagtgcttac cttaagcctt tggtcacaat tgctgaagat
      961 attgatgaag aaacgttaag tacgctcatc ttgaataggc taaaagttgg tcttcaggtt
     1021 gtagcagcca aagtttcagg ttttggtgac aacggagcca gcttaaagat atggctattg
     1081 ctactggtag tgcagtgttt ggagaagagg ggttaaccct aatcttgaag atgctcagcc
     1141 tcatgactca ggaaaatttg gagaggtcat tgtgccaaag tagtgctatg tatttaaaag
     1201 gaaaagatga caaagcttaa attgaaaaat gtattcaaga aatcattaag tagttagata
     1261 tcataactag tggatatgaa aagaaaaaac tgaatgaata tatggcaaaa ctttcagatg
     1321 gagtagccgt gttgatggtt ggttggtgag acaagtgatg ttcaggtgaa agacaagaaa
     1381 gacagattta ccgatgtcct taatgctaca agagctgcta ttgaagaagg cattgttttg
     1441 ggagggggtt gtgccctgct tcgagcattc cactcattca ctccagctaa tgaagataaa
     1501 aataatggca tagaaattat taaaagaaca ctcaaaattc cagcaatgac tattgctaag
     1561 aatgcaggtg ttgatttttg atagttgaga aaattatgca aagttcctca gaagttggtt
     1621 atgatactat gttaggagat gtcgtgaata tggtggaaaa agacattatt gacccaacaa
     1681 aggttgtgag aactgcttta ttggatgctg cttgcatggc tctctattaa ctacagcagc
     1741 tgttgtagtc acaaaaattc ctaaagaagg gaacagccct ggaatgggtg caatgtgtgg
     1801 aatgggaggt ggcctgttct aattcctaga atagtgcttt acctttatta atgaattgtg
     1861 atagggagcc caaggcgtgt tcctcaccaa taacttcaga gaagtcagtt ggagaaaatg
//
LOCUS       HUMHP603A    1912 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 3.
ACCESSION   M34661
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS5-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 99 to 1912)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1912)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      614 a    319 c    449 g    527 t      3 others
ORIGIN      
        1 ctttatgtgc tcttanctaa attaataatt tatttanntt tttttgtata tttgctatgt
       61 cacctttaga aagtaagagg cagctctcac ttgctgctga tgacctgcct cgccatgcac
      121 atgtctgcca gcaccccaca gaaatgcttc cattacccac agtctttcac cagatgagac
      181 cagtgtccag ggtactggct cctcacctca cttgaagtga tggtaaagat gtaaaatttc
      241 ctgctgatgt ccgagcctta atgcttcaaa gtgtagactt ttttgtcgat gctgtagcca
      301 ttacaatggg gccaaaggga agaacagtaa ttattgaaca tagctgggga agtcccaaag
      361 taacaaaaga tggtgtaact gatgcaaagt caattgactt aaaggataaa tataaaagca
      421 ttggagctaa acttgtccaa gatgttgcca ataacacaga tgaagagact gggggatggc
      481 actatcactg ctgctgtact ggtatgctct atattcaaat aaggcttcca gaaggttagc
      541 aaaggtgcta atccagtgga aatcaagaga ggtgtgatgt tagctgttga tgctgtaatt
      601 gctgaactta aaaagcagtg taaacctgtg accaaacctg aaattgcaca ggttgctaca
      661 atttctgcaa atggagacaa agaaattggt aacatctctg atgcaatgaa aaagtttgga
      721 agaaagggca tcacagtaaa ggatggaaaa acactgactg atgaattaga aattattgaa
      781 ggcatgaaaa tttgatcgag gatatatttc tccatacttt attaatacat caaaaggtga
      841 gaaatgtgaa ttccaggatg cctatgttct gttgcatgaa aagaaaattt ctagtgtcca
      901 gtccattgta actgctcttg aaattgccaa tgcttactgt aagcctttgg tcataattgc
      961 tggagacatt gatggagaag ctctaactac actcatcctg aataggctaa aggttggtct
     1021 tcaggttgtg gcagtcaaag ctccagggtt tgggtacaat agaaagaacc agcttaaaga
     1081 tacggtcatt gctactggtg gtacagtgtt tggagaagag ggctgacact aaatcttgaa
     1141 gacgttcagc ctcgtgacgt aggagaagtt ggagaggtca ctgtgaacaa agattatgcc
     1201 atgctcttaa aaggaaaagg taacaagtct caaattgaaa aatgtgttca agaaatcatt
     1261 gaccagtcag atgtcacaac tagtgaatac gaaaaggaaa aagtgagtgg agaaactttc
     1321 agatggagta gctgtgctga aggttgttgg tgggacaagt gatgttgaag tgaatgaaga
     1381 gaaagacaga gttacaggtg cacttaatgc tacaagagct gctgttgaag aaggcattgt
     1441 ttagggaggg ggttgtgccc tgcttcgatg cattccagcc ttggactcat tcactccagc
     1501 taatgaagat aaataattgg tatagaaatt attaaaagaa cactcaaaat tccagcaatg
     1561 acaattgcta agaatgcagc tgttgatggc attttgatag ttgagaaaat tatgtaaagt
     1621 gcctcagaag ttggttatga tactatgtta ggagatgtcg tgaatatggt gaaaaagaca
     1681 ttattgaccc aacaaaggtt gtgagaactg cttcattgga tgctgctggc atggcctctc
     1741 tattaactac agcagctgtt gtagtcacag aaattcctaa agaagagaac agccctggaa
     1801 taggtgcaat gtgtggaatg ggaggtggcc tgttcaaatt cctagaatag tgctttacct
     1861 ttattaatga attgtgatag gaacccaagg cagtgttcct caccaataac tt
//
LOCUS       HUMHP604A    2019 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human chaperonin (HSP60) non-functional pseudogene 4.
ACCESSION   M34663
KEYWORDS    chaperonin; pseudogene.
SOURCE      Human fetal liver DNA, clone PS5-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 123 to 2019)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 2019)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
BASE COUNT      657 a    327 c    479 g    556 t
ORIGIN      
        1 agagaaatgt aatatgaaat aaagctttat gtgctctacc taaattaata atttatttat
       61 ttttttttgt atatttgcta tgtcaccttt agaaagtaag aggcagctct cacttgctga
      121 ttgatgacct gatgatgacc tgcctcgcca tgcacatgtc ctgccagcac cccacagaaa
      181 tgcttccatt acccacagtc tttcaccaga tgagaccagt gtccaggcta ctggctactg
      241 gctcctcacc tcacttgaag tgatggtaaa gatgtaaaat ttggtgctga tgtccgagcc
      301 ttaatgcttc aaagtgtata gcttttgtcg agctgttagc cattacaaat gggccaaagg
      361 gaagaacagt agattattga acatagctgg ggaagtccca aagtaacaaa agagtggtgt
      421 gactgatgca aaggcaattg acttaaagga taaatataaa agcattggag ttaaacttgt
      481 ccaagatgtt gccaataaca cagatgaaga agactgggga tggcactatc actgctgctg
      541 tactggtatg ctctatttcc aaataaggct tccagaaggt tagcaaaggt gctaatccgt
      601 ggaaatcagg agaggtgtga tgttagctgt tgatgctgta attgctgaac ttaaaaagca
      661 gtctaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga
      721 gacaaagaaa ttggtaacat catctctgat gcaatgaaaa actttggaag aaagggcatc
      781 atcacagtaa aggatggaaa aacactgact gatgaattag aaattattga aggcatgaaa
      841 atttgatcga ggatatattt ctccatactt tattaataca tcaaaaggtg agaaatgtga
      901 attccaggat gcctatgttc tgttgcatga aaagaaaatt tctagtgtcc agtccattgt
      961 aactgctctt gaaattgcca atgcttagtg taagcctttg gtcatatatt gctggagaca
     1021 ttgatggaga agctctaact acactcatcc tgaataagct aaaggttggt cttcaggttg
     1081 tggcagtcaa agctaaagct ccagggtttg gtgacaatag aaagaaccag cttaaagata
     1141 cggttattgc tactggtggt acagtgtttg gagaagaggg ctgacactaa atcttgaaga
     1201 cgttcagcct cgtgacgtag gagaagttgg agaggtcact gtgatcaaga ttatgccatg
     1261 ctcttaaaag gaaaaggtaa caagtctcaa attgaaaaat gtgtgcaaga aatcgttgac
     1321 cagtcagatg tcacaactat gtaatacgaa aaggaaaaag tgagtgaaga aactttcaga
     1381 tggagtagct gtgctgaagg ttggtgggac aagtgatgtt gaagtgaatg aagagaaaga
     1441 cagagttata ggtgcactta atgctacaag agctgctgtt gaagaaggca ttgtttaggg
     1501 agggggttgt gccctgcttc gatgcattcc agccttggac tcattcactc cagctaatga
     1561 agataaaata attggtatag aaattattaa aagaacactc aaaattccag caatgacaat
     1621 tgctaagaat gcaggtgttg atgcattttt gatagttgag aaaattatgt aaagttcctc
     1681 agaagttggt tatgatacta tgttaggaga tgtcgtgaat atggtggaaa aagacattat
     1741 tgacccaaca aaggttgtga gaactgcttc attggatgct gctgcatggc ctctctatta
     1801 ttaactacag cagctgttgt agtcacagaa attcctaaag aagggaacag ccctggaatg
     1861 ggtggaatgg gtgcaatgat atgtggaatg ggaggtggcc tgttctaatt cctagaatag
     1921 tgctttacct ttattaatga attgtgatag gaaacccaag gcagtgttcc tcaccaataa
     1981 cttcagagaa gtcagttgga gaaaatgaag aaaaaggct
//
LOCUS       HUMHSP60A    2202 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human chaperonin (HSP60) mRNA, complete cds.
ACCESSION   M34664
KEYWORDS    chaperonin.
SOURCE      Human placenta cDNA to mRNA, and DNA, clone PGEM-10.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2202)
  AUTHORS   Venner,T.J., Singh,B. and Gupta,R.S.
  TITLE     Nucleotide sequence and novel structural features of human and
            Chinese hamster HSP60 (chaperonin) gene families
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.S.Gupta, 29-MAY-1990.
            
              Author address: R.S.Gupta
              McMaster University
              Dept of Biochemistry
              1200 Main Street West
              Hamilton
              Ontario, CANADA L8N 3Z5
              email: IN%GUPTAR.@SSCVAX.McMASTER.CA
            
FEATURES       from  to/span     description
    pept         25     1746     chaperonin (HSP60)
BASE COUNT      699 a    371 c    538 g    594 t
ORIGIN      
        1 cacgcttgcc gccgccccgc agaaatgctt cggttaccca cagtctttcg ccagatgaga
       61 ccggtgtcca gggtactggc tcctcatctc actcgggctt atgccaaaga tgtaaaattt
      121 ggtgcagatg cccgagcctt aatgcttcaa ggtgtagacc ttttagccga tgctgtggcc
      181 gttacaatgg ggccaaaggg aagaacagtg attattgagc agggttgggg aagtcccaaa
      241 gtaacaaaag atggtgtgac tgttgcaaag tcaattgact taaaagataa atacaagaac
      301 attggagcta aacttgttca agatgttgcc aataacacaa atgaagaagc tggggatggc
      361 actaccactg ctactgtact ggcacgctct atagccaagg aaggcttcga gaagattagc
      421 aaaggtgcta atccagtgga aatcaggaga ggtgtgatgt tagctgttga tgctgtaatt
      481 gctgaactta aaaagcagtc taaacctgtg accacccctg aagaaattgc acaggttgct
      541 acgatttctg caaacggaga caaagaaatt ggcaatatca tctctgatgc aatgaaaaaa
      601 gttggaagaa agggtgtcat cacagtaaag gatggaaaaa cactgaatga tgaattagaa
      661 attattgaag gcatgaagtt tgatcgaggc tatatttctc catactttat taatacatca
      721 aaaggtcaga aatgtgaatt ccaggatgcc tatgttctgt tgagtgaaaa gaaaatttct
      781 agtatccagt ccattgtacc tgctcttgaa attgccaatg ctcaccgtaa gcctttggtc
      841 ataatcgctg aagatgttga tggagaagct ctaagtacac tcgtcttgaa taggctaaag
      901 gttggtcttc aggttgtggc agtcaaggct ccagggtttg gtgacaatag aaagaaccag
      961 cttaaagata tggctattgc tactggtggt gcagtgtttg gagaagaggg attgaccctg
     1021 aatcttgaag acgttcagcc tcatgactta ggaaaagttg gagaggtcat tgtgaccaaa
     1081 gacgatgcca tgctcttaaa aggaaaaggt gacaaggctc aaattgaaaa acgtattcaa
     1141 gaaatcattg agcagttaga tgtcacaact agtgaatatg aaaaggaaaa actgaatgaa
     1201 cggcttgcaa aactttcaga tggagtggct gtgctgaagg ttggtgggac aagtgatgtt
     1261 gaagtgaatg aaaagaaaga cagagttaca gatgccctta atgctacaag agctgctgtt
     1321 gaagaaggca ttgttttggg agggggttgt gccctccttc gatgcattcc agccttggac
     1381 tcattgactc cagctaatga agatcaaaaa attggtatag aaattattaa aagaacactc
     1441 aaaattccag caatgaccat tgctaagaat gcaggtgttg aaggatcttt gatagttgag
     1501 aaaattatgc aaagttcctc agaagttggt tatgatgcta tggctggaga ttttgtgaat
     1561 atggtggaaa aaggaatcat tgacccaaca aaggttgtga gaactgcttt attggatgct
     1621 gctggtgtgg cctctctgtt aactacagca gaagttgtag tcacagaaat tcctaaagaa
     1681 gagaaggacc ctggaatggg tgcaatgggt ggaatgggag gtggtatggg aggtggcatg
     1741 ttctaactcc tagactagtg ctttaccttt attaatgaac tgtgacagga agcccaaggc
     1801 agtgttcctc accaataact tcagagaagt cagttggaga aaatgaagaa aaaggctggc
     1861 tgaaaatcac tataaccatc agttactggt ttcagttgac aaaatatata atggtttact
     1921 gctgtcattg tccatgccta cagataattt attttgtatt tttgaataaa aaacatttgt
     1981 acattcctga tactgggtac aagagccatg taccagtgta ctgctttcaa cttaaatcac
     2041 tgaggcattt ttactactat tctgttaaaa tcaggatttt agtgcttgcc accaccagat
     2101 gagaagttaa gcagcctttc tgtggagagt gagaataatt gtgtacaaag tagagaagta
     2161 tccaattatg tgacaacctt tgtgtaataa aaatttgttt aa
//
LOCUS       MUSGCANF     4049 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Mouse guanylate cyclase/atrial natriuretic factor receptor mRNA,
            complete cds.
ACCESSION   J05504
KEYWORDS    atrial natriuretic factor receptor; guanylate cyclase.
SOURCE      Mouse (strain C57B1/6J) Leydig cell line MA-10, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 4049)
  AUTHORS   Pandey,K.N. and Singh,S.
  TITLE     Molecular cloning and expression of murine guanylate cyclase/atrial
            natriuretic factor receptor cDNA
  JOURNAL   J. Biol. Chem. 265, 12342-12348 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.N.Pandey, 31-MAY-1990.
FEATURES       from  to/span     description
    pept        348     3521     guanylate cyclase/atrial natriuretic factor
                                 receptor
    signal     4029     4034     poly-A signal
BASE COUNT      879 a   1164 c   1140 g    866 t
ORIGIN      
        1 cagaaaccct cccaaactcc tatagccaca cacacctttc ccggccaaga tccaaacaaa
       61 cctctacttt cctcttccct aggagccaga ctcccttcgg gtgctgcgct cgctctcacc
      121 tgctctaaag cacctccgct ctcggacgct cccaattccg ccctcctgct cgacggcggg
      181 acagtcgcag cctcggcagg cagcttgctc tcgccgctgc ggcttcaacc cagccccctc
      241 cctcgctacg gctgggcgct cttgactccc gaccctcgcc tctgagcccg aggacggcga
      301 tcagaccatg gtgacagcgc tgctccgtcg ctgcgctcgc tgaggccatg ccgcgttccc
      361 gacgcgtccg tccgcgccta agggcgctgc tgctgctacc gccgctgctg ctgctccgaa
      421 gcggccacgc gagcgacctg accgtggctg tggtgctgcc cgtgaccaac acctcgtacc
      481 cgtggtcctg ggcgcgtgta gggccggcgg tggaactggc tctcgggagg gtgaaggctc
      541 ggccggactt gctgccgggt tggacggtcc gtatggtgct gggcagcagc gagaacgcgg
      601 cgggcgtctg ctccgacacc gctgcaccgc tggccgcggt ggatctcaag tgggagcaca
      661 gccccgccgt gttcctgggc cccggctgcg tatactctgc tgccccggtg gaccgcttca
      721 ccgcgcactg gcggttgccg ctgctgacgg ctggcgcccc ggctctgggc atcggggtga
      781 aggatgagta cgcgttaacc acccgcacag gacccagcca tgtcaagctg ggcgacttcg
      841 tgacggcgct gcatcgacgg ctgggctggg agcaccaggc gcttgtgctc tatgcagatc
      901 ggctgggcga cgaccggccg tgcttcttca tagtggaggg gctgtacatg cgggtgcgtg
      961 agcgactcaa catcacagta aatcaccagg agttcgtcga gggcgacccg gaccactaca
     1021 ccaagctact gcggaccgtg cagcgcaagg gcagagttat ctacatctgc agttctccgg
     1081 atgccttcag gaatctgatg cttttggccc tggatgctgg cctgactggg gaggactatg
     1141 ttttcttcca cctggatgtg tttgggcaaa gccttcaggg tgctcagggc cctgttccag
     1201 agaagccctg ggaaagagac gatgggcagg ataggagagc ccgccagcgc tttcaggctg
     1261 caaaaattat tacttacaaa gaacccgata atcctgagta cttggaattc ctgaagcagc
     1321 taaaactctt ggctgacaag aaattcaact tcaccatgga ggatggcctg aaaaatatca
     1381 tcccagcatc cttccatgac gggctcctgc tctatgtcca ggcagtgaca gagactctgg
     1441 cacagggggg cactgtcact gatggagaga acatcactca gcggatgtgg aaccgaagct
     1501 tccaaggtgt gacaggatac ctgaaaattg atagaaatgg agatcgggac actgattctc
     1561 ctctctggga tatggacccc gagacaggtg ccttcagggt tgtcctgaac tttaatggta
     1621 cttcccagga gctgatggct gtgtcagaac acagattata ctggcctctg ggatacccac
     1681 ctcctgacat ccctaaatgt ggctttgaca atgaggaccc agcctgcaac caagaccact
     1741 tttccacact ggaggttctg gctttggtgg gcagcctctc tctggttagc tttctgatcg
     1801 tgtctttctt catatacagg aagatgcagc tggaaaagga gctggtctca gagttgtggc
     1861 gggtgcgctg ggaggacttg cagcccagca gcctggagag gcaccttcgg agcgctggca
     1921 gtcggctgac cctgagtggg cgaggctcca attatggctc cctgctaacc acggagggcc
     1981 agttccaagt ctttgccaag acagcatact ataagggcaa cctcgtggct gtgaaacgtg
     2041 tgaaccggaa acgcattgag ttgacacgaa aagtcctgtt tgaacttaaa catatgcggg
     2101 atgtgcagaa tgagcaattg accagatttg tgggagcttg taccgaccct cccaacatct
     2161 gtatcctcac agagtactgt ccccgtggaa gcctacagga cattctagag aatgagagta
     2221 ttaccctgga ctggatgttt cggtactcac tcaccaatga cattgtcaag ggaatgctct
     2281 ttctacacaa cggggccatt tgttcccatg ggaacctcaa gtcatccaac tgcgtggtag
     2341 atggacgttt tgtgttaaag atcacagact atgggctcga gagcttcaga gacccggagc
     2401 cagagcaagg acacaccctc tttgccaaaa aactgtggac tgcacctgag ctcctgcgaa
     2461 tggcttcccc acctgcccgt ggctcccaag ctggggatgt ctacagtttt ggtatcatcc
     2521 ttcaggaaat tgccctaaga agtggggtct tctatgtgga aggtttggac ctcagcccaa
     2581 aagagatcat tgagcgtgtg actcggggtg agcagccccc attccgacct tccatggatc
     2641 tgcagagcca cctggaggaa ctggggcagc tgatgcagag gtgctgggca gaggatcctc
     2701 aggagcggcc accctttcaa cagatccgcc tggcgctgcg caagttcaac aaggagaaca
     2761 gcagcaacat cctggacaac ctgctgtcac gcatggaaca gtacgccaac aacctggagg
     2821 aactggtaga ggagagaaca cagccttatc tggaggagaa gcgcaaagct gaggccctgc
     2881 tttaccagat tctgcctcac tctgtggctg agcagctgaa gagaggcgag acagtccagg
     2941 ctgaggcatt tgatagtgtt actatctatt tcagtgatat cgtgggcttt acagctcttt
     3001 cagcagagag cacacccatg caggtggtca ccctgctcaa tgatctgtac acctgttttg
     3061 atgctgtcat agacaacttt gatgtgtaca aggtagagac cattggtgat gcttacatgg
     3121 tggtatcagg gctcccagtg aggaatggac agctccatgc ccgagaggta gcccgaatgg
     3181 cacttgcact gctcgatgct gtacgctcct tccgcatcgg ccataggccc caggaacagc
     3241 tgcgcttgcg cattggaatt cacacaggtc ctgtgtgtgc tggtgtggta gggctaaaga
     3301 tgccccgata ctgcctcttt ggagacacag tcaacacagc ttcaagaatg gagtctaatg
     3361 gggaagccct caggatccac ttgtcttcgg agaccaaggc tgtgctggaa gagttcgatg
     3421 gtttcgagct ggagctccga ggggatgtgg aaatgaaggg caaaggcaag gttcgttcct
     3481 attggctcct cggggaccgg ggatgcagct ctcgagcctg acctactgcc ctgctattcc
     3541 ttgtcacctc ccctccctat cccagcaatg acacgggtct ccaacttccc cctctcccac
     3601 agcagctcag ccactgtgga aagattaggg acctaaccag cgcagtcatc agatgtgacc
     3661 tctgagagag gatggagatg gtggggactg gagggggact cctaagttta tagggctgac
     3721 tgaaataccc agtcactccc gtagcacatg ccccgccccc cccccgcccc cccactcagc
     3781 tgcctagcag acagtgattc cttctgccgc cctcaactta gctccactgt gagttagagg
     3841 gagggaaatt gccacctgaa ggaaagagaa aagagattct cggggtttgc aggaggcagg
     3901 cagtcctgtg tcacaaatac tcccctcact cccagtccac cacctgcccc accgacttcc
     3961 cttcccacac agtgcactga ggagaagaga ggcatggggt tgccttgctt ctcctatgag
     4021 caaaacccat taaagtcttt attcctgtg
//
LOCUS       MUSMHH2DR    1415 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  Mouse MHC class I H-2Dr protein mRNA, complete cds.
ACCESSION   M34961
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Mouse (strain B10.RIII) spleen, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1415)
  AUTHORS   Zeling,C. and Pease,L.R.
  TITLE     Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Zeling, 29-MAY-1990.
            
              Author address: C.Zeling
              Mayo Clinic
              323 Guggenheim
              200 1st St. SW
              Rochester, MN 55905
              email: Pease@mayo.edu
            
FEATURES       from  to/span     description
    pept          1     1089     MHC H-2Dr protein precursor
    sigp          1       72     MHC H-2Dr protein signal peptide
    matp         73     1086     MHC H-2Dr protein
BASE COUNT      298 a    398 c    442 g    277 t
ORIGIN      
        1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
       61 cagacccgcg cgggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
      121 ctcggggagc cccggtacat agaagtcggc tacgtggacg acacggagtt cgtgcgcttc
      181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
      241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagat tttccgagtg
      301 aacctgagga ccctgctcag ctactacaac cagagcaagg gcggctctca cacgatccag
      361 tggatgttcg gctgtgacgt ggagtcggac gggcgcctcc tccgcgggta ccagcagtac
      421 gcatacgacg gctgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
      481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg ctggtgctgc agagagaaag
      541 agggcctacc tagagggcac gtgcgtggag tggctcctca gacacctgaa gaacgggaac
      601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaaa
      661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
      721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
      781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctctcgggaa ggagcagaat
      841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
      901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
      961 gccattattg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
     1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
     1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
     1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
     1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
     1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
     1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
     1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS       MUSMHH2DS    1415 bp ss-mRNA            ROD       08-AUG-1990
DEFINITION  M.musculus MHC class I H-2Ds protein mRNA, complete cds.
ACCESSION   M34962
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      M.musculus (strain B10.RIII) spleen, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1415)
  AUTHORS   Zeling,C. and Pease,L.R.
  TITLE     Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Zeling, 29-MAY-1990.
            
              Author address: C.Zeling
              Mayo Clinic
              323 Guggenheim
              200 1st St. SW
              Rochester, MN 55905
              email: Pease@mayo.edu
            
FEATURES       from  to/span     description
    pept          1     1089     MHC H-2Dr protein precursor
    sigp          1       72     MHC H-2Dr protein signal peptide
    matp         73     1086     MHC H-2Dr protein
BASE COUNT      292 a    401 c    444 g    278 t
ORIGIN      
        1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact
       61 cagacccgcg ctggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc
      121 ctcggggagc cccggtacat agaagtcggc tacgtggaca acacggagtt cgtgcgcttc
      181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag
      241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagag tttccgagtg
      301 aacctgagga ccgcgctccg ctactacaac cagagcaagg gcggctctca cacggtccag
      361 gtgatgtctg gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta ccagcagcac
      421 gcatacgacg gccgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg
      481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg gtggtgatgc agagcattac
      541 aaggcctacc tagagggcac gtgcgtggag tggctcctca gatacctgga gctcgggaac
      601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaga
      661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc
      721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca
      781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctcttgggaa ggagcagaat
      841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct
      901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg
      961 gccattactg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa
     1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt
     1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc
     1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc
     1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc
     1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga
     1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag
     1381 aataagaatt tgaatgtggg tggctggaga gatgg
//
LOCUS       CREAPCYN      577 bp ss-mRNA            PLN       08-AUG-1990
DEFINITION  C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds.
ACCESSION   J05524
KEYWORDS    apoplastocyanin.
SOURCE      C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone
            PC6-2.
  ORGANISM  Chlamydomonas reinhardtii
            Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; 
            Volvocales; Chlamydomonadaceae.
REFERENCE   1  (bases 1 to 577)
  AUTHORS   Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and
            Bogorad,L.
  TITLE     Isolation and characterization of a complementary DNA clone for an
            algal pre-apoplastocyanin
  JOURNAL   J. Biol. Chem. 265, 12372-12379 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Hill, 11-MAY-1990.
FEATURES       from  to/span     description
    pept         22      459     apoplastocyanin (PC6-2) precursor
    sigp         22      162     apoplastocyanin signal peptide
    matp        163      456     apoplastocyanin
BASE COUNT       91 a    189 c    171 g    126 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc
       61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt
      121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg
      181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag
      241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc
      301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc
      361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac
      421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat
      481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac
      541 gcagagagcg tgtagctctg taactttttc ggaattc
//
LOCUS       HUMTNC1      1085 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human fast skeletal muscle troponin C gene, exon 1.
ACCESSION   M33771 J05502
KEYWORDS    troponin C.
SEGMENT     1 of 2
SOURCE      Human HeLa cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1085)
  AUTHORS   Gahlmann,R. and Kedes,L.
  TITLE     Cloning, structural analysis, and expression of the human
            fast-twitch skeletal muscle troponin C gene
  JOURNAL   J. Biol. Chem. 265, 12520-12528 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Gahlmann, 20-APR-1990.
FEATURES       from  to/span     description
    pept       1033  +  1035     troponin C, exon 1
    pre-msg     968  >  1085     troponin C mRNA and introns (alt.)
    pre-msg     969  >  1085     troponin C mRNA and introns (alt.)
    IVS        1036  >  1085     troponin C intron A
BASE COUNT      266 a    276 c    306 g    237 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct
       61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc
      121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa
      181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa
      241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc
      301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac
      361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc
      421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca
      481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga
      541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt
      601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc
      661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga
      721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc
      781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt
      841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt
      901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag
      961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg
     1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg
     1081 tggac
//
LOCUS       HUMTNC2      1773 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human fast skeletal muscle troponin C gene, exons 2-6.
ACCESSION   M33772 J05502
KEYWORDS    troponin C.
SEGMENT     2 of 2
SOURCE      Human HeLa cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1773)
  AUTHORS   Gahlmann,R. and Kedes,L.
  TITLE     Cloning, structural analysis, and expression of the human
            fast-twitch skeletal muscle troponin C gene
  JOURNAL   J. Biol. Chem. 265, 12520-12528 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Gahlmann, 20-APR-1990.
FEATURES       from  to/span     description
    pept    +    51      102     troponin C, exon 2
                247      390     troponin C, exon 3
                479      593     troponin C, exon 4
                758      894     troponin C, exon 5
               1506     1537     troponin C, exon 6
    pre-msg <     1     1668     troponin C mRNA and introns
    IVS     <     1       50     troponin C intron A
    IVS         103      246     troponin C intron B
    IVS         391      478     troponin C intron C
    IVS         594      757     troponin C intron D
    IVS         895     1505     troponin C intron E
BASE COUNT      358 a    455 c    628 g    331 t      1 others
ORIGIN      About 2.5 kb after segment 1.
        1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc
       61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag
      121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg
      181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc
      241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg
      301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg
      361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat
      421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc
      481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac
      541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct
      601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag
      661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc
      721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc
      781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg
      841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg
      901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct
      961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc
     1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta
     1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat
     1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac
     1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc
     1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa
     1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa
     1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg
     1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc
     1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca
     1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg
     1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc
     1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc
     1741 ggctggcacc acccggagca cccttgggga tcc
//
LOCUS       ENTLACBOHI   1164 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  Plasmid PDS075 (from E.cloacae) beta-lactamase (Ohio-1) gene,
            complete cds.
ACCESSION   M33655
KEYWORDS    beta-lactamase.
SOURCE      Plasmid PDS075 (from Enterobacter cloacae) DNA.
  ORGANISM  Plasmid PDS075
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1164)
  AUTHORS   Shlaes,D.M., Currie-McCumber,C., Hull,A., Behlau,I. and Kron,M.
  TITLE     The Ohio-1 beta-lactamase is part of the SHV-1 family
  JOURNAL   Antimicrob. Agents Chemother. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.M.Shlaes, 10-APR-1990.
FEATURES       from  to/span     description
    pept        143     1003     Ohio-1 beta-lactamase
    signal       56       62     -35 region
    signal       80       85     -10 region
    binding     126      131     ribosome binding site
BASE COUNT      233 a    345 c    357 g    229 t
ORIGIN      
        1 cgagtaatac tcacatggca ctgacgcaaa atcatcgatt tattcaacaa agccattgtg
       61 aatcagcaaa acgccgggtt attcttattt gtcgcttctt tactcgcctt tatcggccct
      121 cactcaagga tctaccgtgg ttatgcgtta ttttcgcctg tgtattatct ccctgttagc
      181 caccctgccg ctgcgggtac acgccggacc gcagccgctt gagcaaatta aactaagcga
      241 aagccagctg tcgggcagcg taggcatgat agaaatggat ctggccaggc ccggcacgct
      301 gaccgcctgg cgcgccgatg aacgctttcc catgatgagc acctttaaag tagtgctctg
      361 cggcgcaggt ctggcgcggg tggatgccgg tgacgaacag ctggagcgaa agatccacta
      421 tcgccgacag gatctggtgg actactcgcc ggtcagcgaa aaacaccttg ccgacggcat
      481 gacggtcggc gaactctgtg ccgccgccat taccatgagc gataacagcg ccgccaatct
      541 gctgctgcca gccgtcggcg gccccgcagg attgactgcc tttttgcgcc agatcggcga
      601 caacgtcacc cgccttgacc gctgggaaac ggaactgaat gaggcgcttc ccggcgacgc
      661 ccgcgacacc actaccgccc gcagcatggc cgcgaccctg cgcaagctgc tgaccagcca
      721 gcgtctgagc gcccgttcgc aacggcagct gctgcagtgg atggtggacg atcgggtcgc
      781 cggacgtttg atccgctccg tgctgccggc gggctggttt atcgccgata agaccggagc
      841 tggcgaacgg ggtgcgcgcg ggattgtcgc cctgcttggc ccgaataaca aagcagagcg
      901 gattgtggtg atttatctgc gggatacgcc ggcgagcatg gccgagcgaa atcagcaaat
      961 cgccgggatc gccggggcgc tgatcgagca ctggcaacgc taacccggcg gtgggcgcgc
     1021 gcgttatccg gctcgtagca cctcgcaggc gtgccggcga tatgactggc ggcggcatcg
     1081 gagagatggc ttctggctaa tgatggtcgg tgcaacctgg gtgaaaggta acgccataaa
     1141 cggtggccac ctgattgtgt ttct
//
LOCUS       HUMTRPY1B    2609 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human tryptase-I gene, complete cds.
ACCESSION   M33494
KEYWORDS    serine protease; tryptase-I.
SOURCE      Human adult skin DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2609)
  AUTHORS   Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M.,
            Craik,C.S. and Caughey,G.H.
  TITLE     Human mast cell tryptase: Multiple cDNAs and genes reveal a
            multigene serine protease family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Vanderslice 02-APR-1990.
FEATURES       from  to/span     description
    pept        458      518     tryptase-I, exon 1
                673      844     tryptase-I, exon 2
                954     1219     tryptase-I, exon 3
               1346     1509     tryptase-I, exon 4
               1595     1759     tryptase-I, exon 5
    pre-msg     225     2072     tryptase-I mRNA
    IVS         248      457     tryptase-I, intron A
    IVS         519      672     tryptase-I, intron B
    IVS         845      953     tryptase-I, intron C
    IVS        1220     1345     tryptase-I, intron D
    IVS        1510     1594     tryptase-I, intron E
    signal      131      135     pot. CAAT box
    signal      197      201     pot.TATA box
BASE COUNT      422 a    941 c    776 g    470 t
ORIGIN      
        1 accagctgac aggtggagct gccagtctcc agtgctcagc cctcagcggg gcctgcctgg
       61 cagccccaca cacagagggc atcggggtgg cgggggcacg tgttacacgg gggccctggg
      121 tctgagtcat ccacttcctc cgagtctgga tgggaggacc cagcgcccct cctccgcccc
      181 ctcctgatct ggaaggataa atggggaggg gagagccact gggtagaagg aacagggagt
      241 ggccagggta agtccccact ctcagagacc ctgacatcag cgtcacctgg agcagagtgg
      301 cccagcctca gactcagagc accaagaccc aggcccgcag gcctggaccc accccggtcc
      361 ccccgtccca gctccattct tcaccccaca atctgtagcc cccagccctg ccctgtgagg
      421 cccggccagg cccacgatgc tcctccttgc tccccagatg ctgaatctgc tgctgctggc
      481 gctgcccgtc ctggcgagcc gcgcctacgc ggcccctggt gagtcccagc cggggtccac
      541 cctgcccctc accacattcc acaggtcagg gcctgggtgg gttctgggga ggtcgggctg
      601 gccccccaca cagggaaggg ctgggcccag gcctggggct gcttcctggt cctgacctgg
      661 cacctgcccc agccccaggc caggccctgc agcgagtggg catcgtcggg ggtcaggagg
      721 cccccaggag caagtggccc tggcaggtga gcctgagagt ccacggccca tactggatgc
      781 acttctgcgg gggctccctc atccaccccc agtgggtgct gaccgcagcg cactgcgtgg
      841 gaccgtgagt ctcccggggc ctggaggggt ggggaagggc tggatgtgag ccctggctcc
      901 cgggtgctcc tgggggctgc ccagggccct gagtgggatc ctccgctgcc cagggacgtc
      961 aaggatctgg ccgccctcag ggtgcaactg cgggagcagc acctctacta ccaggaccag
     1021 ctgctgccgg tcagcaggat catcgtgcac ccacagttct acaccgccca gatcggagcg
     1081 gacatcgccc tgctggagct ggaggagccg gtgaacgtct ccagccacgt ccacacggtc
     1141 accctgcccc ctgcctcaga gaccttcccc ccggggatgc cgtgctgggt cactggctgg
     1201 ggcgatgtgg acaatgatgg tgggtctggg gacagtggag gtggggccag ggtcttagcc
     1261 acagcccagc ccctgggtcc ctctgggctc caggtggggg ttgcccggcc ccctcctgag
     1321 gctgcaccct cttccccacc tgcagagcgc ctcccaccgc catttcctct gaagcaggtg
     1381 aaggtcccca taatggaaaa ccacatttgt gacgcaaaat accaccttgg cgcctacacg
     1441 ggagacgacg tccgcatcgt ccgtgacgac atgctgtgtg ccgggaacac ccggagggac
     1501 tcatgccagg tgggccccgc ctgtcccccg ccccccgccc cccaaccccc actcccaggc
     1561 ctgttcggcg agcgctgacc tctgaccttc ccagggcgac tccggagggc ccctggtgtg
     1621 caaggtgaat ggcacctggc tgcaggcggg cgtggtcagc tggggcgagg gctgtgccca
     1681 gcccaaccgg cctggcatct acacccgtgt cacctactac ttggactgga tccaccacta
     1741 tgtccccaaa aagccgtgag tcaggcctgg gttggccacc tgggtcactg gaggaccaac
     1801 ccctgctgtc caaaacacca ctgcttccta cccaggtggc gactgccccc cacaccttcc
     1861 ctgccccgtc ctgagtgccc cttcctgtcc taagccccct gctctcttct gagccccttc
     1921 ccctgtcctg aggacccttc cctatcctga gcccccttcc ctgtcctaag cctgacgcct
     1981 gcaccgggcc ctccagccct cccctgccca gatagctggt ggtgggcgct aatcctcctg
     2041 agtgctggac ctcattaaag tgcatggaaa tcactggtgt gcatcgctgt gtttctggtt
     2101 gtggatgtca ctgggagaga aggggtccag gtgtgctgag gacacctgcc acagtgtgag
     2161 gtcctagccc tcaaggcaca gccagtcacc gtgggaccac tggaggacca acccctgctg
     2221 tccaaaacac cactgcttcc tacccaggtg gcgactgccc cccacacctt ccctgccccg
     2281 tcctgagtgc cccttcctgt cctaagcccc ctgctctctt ctgagcccct tcccctgtcc
     2341 tgaggaccct tccctatcct gagccccctt ccctgtccta agcctgacgc ctgcaccggg
     2401 ccctccagcc ctcccctgcc cagatagctg gtggtgggcg ctaatcctcc tgagtgctgg
     2461 acctcattaa agtgcatgga aatcactggt gtgcatcgct gtgtttctgg ttgtggatgt
     2521 cactgggaga gaaggggtcc aggtgtgctg aggacacctg ccacagtgtg aggtcctagc
     2581 cctcaaggca cagccagtca ccgtgggac
//
LOCUS       HUMTRY1A     1137 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human tryptase-I mRNA, 3' end.
ACCESSION   M33491
KEYWORDS    serine protease; tryptase-I.
SOURCE      Human adult skin, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1137)
  AUTHORS   Vanderslice,P., Ballinger,S.M., Tam,E.K., Goldstein,S.M.,
            Craik,C.S. and Caughey,G.H.
  TITLE     Human mast cell tryptase: Multiple cDNAs and genes reveal a
            multigene serine protease family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3811-3815 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Vanderslice, 02-APR-1990.
FEATURES       from  to/span     description
    pept    <     1      824     tryptase-I (AA at 3)
    signal     1119     1124     poly-A signal
BASE COUNT      196 a    410 c    328 g    203 t
ORIGIN      
        1 tgaatctgct gctgctggcg ctgcccgtcc tggcgagccg cgcctacgcg gcccctgccc
       61 caggccaggc cctgcagcga gtgggcatcg tcgggggtca ggaggccccc aggagcaagt
      121 ggccctggca ggtgagcctg agagtccacg gcccatactg gatgcacttc tgcgggggct
      181 ccctcatcca cccccagtgg gtgctgaccg cagcgcactg cgtgggaccg gacgtcaagg
      241 atctggccgc cctcagggtg caactgcggg agcagcacct ctactaccag gaccagctgc
      301 tgccggtcag caggatcatc gtgcacccac agttctacac cgcccagatc ggagcggaca
      361 tcgccctgct ggagctggag gagccggtga acgtctccag ccacgtccac acggtcaccc
      421 tgccccctgc ctcagagacc ttccccccgg ggatgccgtg ctgggtcact ggctggggcg
      481 atgtggacaa tgatgagcgc ctcccaccgc catttcctct gaagcaggtg aaggtcccca
      541 taatggaaaa ccacatttgt gacgcaaaat accaccttgg cgcctacacg ggagacgacg
      601 tccgcatcgt ccgtgacgac atgctgtgtg ccgggaacac ccggagggac tcatgccagg
      661 gcgactccgg agggcccctg gtgtgcaagg tgaatggcac ctggctgcag gcgggcgtgg
      721 tcagctgggg cgagggctgt gcccagccca accggcctgg catctacacc cgtgtcacct
      781 actacttgga ctggatccac cactatgtcc ccaaaaagcc gtgagtcagg cctgggttgg
      841 ccacctgggt cactggagga ccaacccctg ctgtccaaaa caccactgct tcctacccag
      901 gtggcgactg ccccccacac cttccctgcc ccgtcctgag tgccccttcc tgtcctaagc
      961 cccctgctct cttctgagcc ccttcccctg tcctgaggac ccttccctat cctgagcccc
     1021 cttccctgtc ctaagcctga cgcctgcacc gggccctcca gccctcccct gcccagatag
     1081 ctggtggtgg gcgctaatcc tcctgagtgc tggacctcat taaagtgcat ggaaatc
//
LOCUS       HUMTRY2A     1128 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human tryptase-II mRNA, 3' end.
ACCESSION   M33492
KEYWORDS    serine protease; tryptase-II.
SOURCE      Human adult skin, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1128)
  AUTHORS   Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M.,
            Craik,C.S. and Caughey,G.H.
  TITLE     Human mast cell tryptase: Multiple cDNAs and genes reveal a
            multigene serine protease family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Vanderslice, 02-APR-1990.
FEATURES       from  to/span     description
    pept    <     1      826     tryptase-II (AA at 2)
    signal     1121     1126     poly-A signal
BASE COUNT      190 a    409 c    329 g    200 t
ORIGIN      
        1 gctgaatctg ctgctgctgg cgctgcccgt cctggcgagc cgcgcctacg cggcccctgc
       61 cccaggccag gccctgcagc gagtgggcat cgttgggggt caggaggccc ccaggagcaa
      121 gtggccctgg caggtgagcc tgagagtcca cggcccatac tggatgcact tctgcggggg
      181 ctccctcatc cacccccagt gggtgctgac cgcagcgcac tgcgtgggac cggacgtcaa
      241 ggatctggcc gccctcaggg tgcaactgcg ggagcagcac ctctactacc aggaccagct
      301 gctgccggtc agcaggatca tcgtgcaccc acagttctac accgcccaga tcggagcgga
      361 catcgccctg ctggagctgg aggagccggt gaaggtctcc agccacgtcc acacggtcac
      421 cctgccccct gcctcagaga ccttcccccc ggggatgccg tgctgggtca ctggctgggg
      481 cgatgtggac aatgatgagc gcctcccacc gccatttcct ctgaagcagg tgaaggtccc
      541 cataatggaa aaccacattt gtgacgcaaa ataccacctt ggcgcctaca cgggagacga
      601 cgtccgcatc gtccgtgacg acatgctgtg tgccgggaac acccggaggg actcatgcca
      661 gggcgactcc ggagggcccc tggtgtgcaa ggtgaatggc acctggctgc aggcgggcgt
      721 ggtcagctgg ggcgagggct gtgcccagcc caaccggcct ggcatctaca cccgtgtcac
      781 ctactacttg gactggatcc accactatgt ccccaaaaag ccgtgagtca ggcctgggtt
      841 ggccacctgg gtcactggag gaccaacccc tgctgtccaa aacaccactg cttcctaccc
      901 aggtggcgac tgccccccac accttccctg ccccgtcctg agtgcccctt cctgtcctaa
      961 gccccctgct ctcttctgag ccccttcccc tgtcctgagg acccttcccc atcctgagcc
     1021 cccttccctg tcctaagcct gacgcctgca ccgggccctc cggccctccc ctgcccaggc
     1081 agctggtggt gggcgctaat cctcctgagt gctggacctc attaaagt
//
LOCUS       HUMTRY3A     1081 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human tryptase-III mRNA, 3' end.
ACCESSION   M33493
KEYWORDS    serine protease; tryptase-III.
SOURCE      Human adult skin, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1081)
  AUTHORS   Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M.,
            Craik,C.S. and Caughey,G.H.
  TITLE     Human mast cell tryptase: Multiple cDNAs and genes reveal a
            multigene serine protease family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Vanderslice, 02-APR-1990.
FEATURES       from  to/span     description
    pept    <     1      805     tryptase-III (AA at 2)
BASE COUNT      181 a    398 c    317 g    185 t
ORIGIN      
        1 gctgcccgtc ctggcgagcc gcgcctacgc ggcccctgcc ccaggccagg ccctgcagcg
       61 agtgggcatc gttgggggtc aggaggcccc caggagcaag tggccctggc aggtgagcct
      121 gagagtccgc gaccgatact ggatgcactt ctgcgggggc tccctcatcc acccccagtg
      181 ggtgctgacc gcagcgcact gcgtgggacc ggacgtcaag gatctggccg ccctcagggt
      241 gcaactgcgg gagcagcacc tctactacca ggaccagctg ctgccggtca gcaggatcat
      301 cgtgcaccca cagttctaca ccgcccagat cggagcggac atcgccctgc tggagctgga
      361 ggagccggtg aaggtctcca gccacgtcca cacggtcacc ctgccccctg cctcagagac
      421 cttccccccg gggatgccgt gctgggtcac tggctggggc gatgtggaca atgatgagcg
      481 cctcccaccg ccatttcctc tgaagcaggt gaaggtcccc ataatggaaa accacatttg
      541 tgacgcaaaa taccaccttg gcgcctacac gggagacgac gtccgcatcg tccgtgacga
      601 catgctgtgt gccgggaaca cccggaggga ctcatgccag ggcgactccg gagggcccct
      661 ggtgtgcaag gtgaatggca cctggctgca ggcgggcgtg gtcagctggg gcgagggctg
      721 tgcccagccc aaccggcctg gcatctacac ccgtgtcacc tactacttgg actggatcca
      781 ccactatgtc cccaaaaagc cgtgagtcag gcctggggtg tccacctggg tcactggagg
      841 accagcccct cctgtccaaa acaccactgc ttcctaccca ggcggcgact gccccccaca
      901 ccttccctgc cccgtcctga gtgccccttc ctgtcctaag ccccctgctc tcttctgagc
      961 cccttcccct gtcctgagga cccttcccca tcctgagccc ccttccctgt cctaagcctg
     1021 acgcctgcac cgggccctcc ggccctcccc tgcccaggca gctggtggtg ggcgctaatc
     1081 c
//
LOCUS       YSCAPN1      1815 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  S.cerevisiae apurinic endonuclease (APN1) gene, complete cds.
ACCESSION   M33667
KEYWORDS    3'-repair diesterase; apurinic endonuclease.
SOURCE      S.cerevisiae (strain EJ2169) DNA, clone pSCP8.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1815)
  AUTHORS   Popoff,S.C., Spira,A.I., Johnson,A.W. and Demple,B.
  TITLE     Yeast structural gene (APN1) for the major apurinic endonuclease:
            Homology to Escherichia coli endonuclease 4
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 4193, 4197 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.C.Popoff, 10-APR-1990.
FEATURES       from  to/span     description
    pept        340     1443     apurinic endonuclease (APN1)
    signal      265      269     TATA box
BASE COUNT      607 a    287 c    372 g    549 t
ORIGIN      Chromosome 11.
        1 cccgagcaca agaaaataaa aaattgaaca aaaataagaa taaagtcaca aagggaagaa
       61 gatgaggggg ttctttttcc agactttctt cttttggtcc ttcaccttgg atatatgaac
      121 atagtagtat tattgaacac gaactttttt catctaatac ttctcttttt ttgccagttt
      181 ttctgggaac ttggaaactg gaatttttca ttcttacctg gtttcttttt cttcatttat
      241 ttacatgttg gaaatcattg taaatataag atgacaaact ccgaataaga aacacaaaac
      301 gcaacattaa taagcttttg gcatatcgga accatcgtaa tgccttcgac acctagcttt
      361 gttagatctg ctgtctcgaa atacaaattt ggtgcgcaca tgtcaggtgc cggtggaatt
      421 tctaatagtg taactaatgc atttaacact ggctgtaatt cgtttgccat gtttttaaaa
      481 tctccaagaa agtgggtttc tccgcagtat acacaggagg aaatagataa atttaagaaa
      541 aactgtgcaa cttacaatta taacccattg acggacgtct taccgcatgg ccaatacttt
      601 attaatttgg ctaatccgga tagggaaaag gcagaaaaga gttatgagtc attcatggac
      661 gatctgaaca ggtgtgagca attagggata ggcctgtata acttacatcc aggttctact
      721 ttgaaaggag accatcagtt gcagttaaaa caactagcct catatttgaa caaagcaatt
      781 aaggaaacga aatttgttaa aattgtatta gaaaatatgg ctggtactgg aaatttagta
      841 ggaagttctc tggtagattt gaaggaggtt attggaatga ttgaagataa atcaagaatc
      901 ggcgtttgca tagatacatg ccatacattt gcagcaggct acgatatcag caccactgaa
      961 acgtttaaca atttttggaa agagttcaat gatgtaatag ggttcaaata tctaagtgct
     1021 gttcacttaa atgattccaa ggctccttta ggaagcaata gagatttgca tgaacgcttg
     1081 ggtcaaggtt atttgggtat agatgtgttt agaatgatcg cgcactctga atacctgcag
     1141 ggtatcccta ttgtcttgga aactccatac gaaaatgatg aaggctatgg taatgaaatc
     1201 aaacttatgg agtggttgga atcaaagagt gagagcgaat tgttagaaga caaggagtat
     1261 aaagaaaaaa atgatacgct gcagaaatta ggtgctaaat cgcgtaagga acagttagac
     1321 aagtttgagg ttaaacaaaa gaagcgagct gggggcacca agaggaagaa agccactgca
     1381 gagccaagcg ataatgatat cttgtcacaa atgacaaaga agaggaagac taagaaagaa
     1441 taatgagaag cgagaagaat tttaaatacg taatcaattt ttgtagatta tctcaacgta
     1501 cgtaactaaa tgctaccctt cgtcatccaa gatgctgttg cccatttcat agaaactact
     1561 tacttggttt atagatgttt gcgttgcgtt acagaataag gcagcctttt ttttttttag
     1621 attttcattt ttcacatact gatggattag cagattataa aagggtaaag ttaaaatcat
     1681 tccaaacttg aactaaggaa ggcgctagac catcctctta tggaacgtta ttcctaagca
     1741 tagaaagttt tttacttgta atcttgaacg cggggtgtgt tctttatctt ttttttcttt
     1801 tttacacaat tagac
//
LOCUS       YSCCKA2      2393 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  S.cerevisiae casein kinase-II alpha' subunit gene, complete cds.
ACCESSION   M33759
KEYWORDS    casein kinase-II.
SOURCE      S.cerevisiae (strain S288C) DNA, (library of M.Snyder), clone
            ScA2-1.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2393)
  AUTHORS   Padmanabha,R., Chen-Wu,J.L.-P., Hanna,D.E. and Glover,C.V.C.
  TITLE     Isolation, sequencing, and disruption of the yeast CKA2 gene:
            Casein kinase II is essential for viability in Saccharomyces
            cerevisiae
  JOURNAL   Mol. Cell. Biol. 10, 4089-4099 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.V.C.Glover, 19-APR-1990.
FEATURES       from  to/span     description
    pept        520     1539     casein kinase-2
    signal      337      342     TATA box
BASE COUNT      771 a    470 c    455 g    697 t
ORIGIN      
        1 agatctacat actgaatgaa ttgtcccctc aactctgaag ttgatttact tgctgtaggt
       61 cgattgctcc aaagttggat atccctaatg accaaacttc cctgctttcc gctaagagta
      121 aaatttagtg tgcataattt ccgtgacatc aaaagtcagg cgaattggtg aatgctgtcc
      181 cctttcaagg cagtctatca tttcttagtt acctgccaac aggcctttat caggcctcaa
      241 agaaaaaaat ttccttgcgt agcccgcttt tcgcccgaac ctgacaaagc aaacagacga
      301 tcgaaaaatt tccaacgatc ttatctaggt ggaagttata aaaagaagaa taaagcatcg
      361 tcaacctttt taacctgaat tgggaaacaa cgggcacaca catttgttgg agtttgtatg
      421 attgctaaaa aataaaaagt aattaaaaaa acatcataga gttgtaaggt ctcagggaaa
      481 tagaaggaac aataaaccta aaagaataga agaaacagaa tgccattacc tccgtcaaca
      541 ttgaaccaga aatctaatag agtctactct gtagctaggg tgtacaagaa tgcctgcgag
      601 gagagaccac aagaatactg ggactacgaa caaggggtga ccatcgattg gggaaagatt
      661 tccaattacg aaattatcaa caaaattgga agagggaaat attccgaagt gttcagcggt
      721 agatgtattg taaacaacca gaagtgtgtt attaaagttt taaaaccagt taaaatgaaa
      781 aaaatttata gagagttgaa aattctgacc aatctaacag gcggccccaa tgttgttggc
      841 ctttatgata tagtacaaga cgctgactcc aaaatacctg ctttgatctt tgaggaaatc
      901 aaaaatgttg atttcagaac tttatatcct acattcaaac ttcctgacat ccagtattat
      961 ttcacgcaat tattgattgc gttagactac tgtcactcca tgggcataat gcacagagac
     1021 gtaaagcctc agaatgtcat gattgatcct acggaacgta aactaaggct gatcgattgg
     1081 ggcctggcgg agttctacca tccaggtgta gattacaacg ttcgtgtcgc ttcgcgttac
     1141 cacaagggac cagaactttt agtaaacttg aaccaatatg actactccct agacttatgg
     1201 tcagtaggat gcatgctagc agctattgtc ttcaaaaaag aacctttttt caaagggtcg
     1261 tctaatccag atcaactggt aaagattgcc acagtactag gaaccaagga actgttaggc
     1321 tatttgggta agtacgggtt gcacttacca tctgaatacg acaacattat gagagacttt
     1381 acaaaaaaat cgtggacaca ctttataacc tccgagacca aattagctgt tcctgaagtg
     1441 gttgatttaa tcgacaattt attaaggtat gaccatcaag aaagattaac agcaaaggag
     1501 gctatggatc ataagttttt caaaacgaag tttgaataat tggacaacaa tactcttagc
     1561 aaggcaattc tttttccacc actgatttcc atttataaaa agcaggggtt ttcctttttt
     1621 gtagcttgat attagagaat atccttaatg tgaatattaa attgcaaact ctatttaaca
     1681 acgaatgtag attattatat gaccagaaaa tcactttata tagatataag agccttccgt
     1741 aatgctatgc aagttttgat gaattattat attgtcgtga ggcgtgttcc gaggagtttc
     1801 gtcgccattg agatagagat gctcgtaatc gtacgtgtct ggaagtgcct gccgacggta
     1861 gcgatgtcta ctctcgtccc agacaatgga atagttttga aattgttcta catcatttat
     1921 tagaacgcac gattcacgaa atctcccacg tttgtcaata tctcgcctac gaacagtgtc
     1981 atcgaatgat attgaacgct gagagagcca tttacctttt gagtaaaata cgctgccttt
     2041 tttcagcaat gaagaaccac ttccctcctt tcttacatcc aataattgct tcccttgttc
     2101 ctttatattc cccataggaa agtccgttgg agtaggttgc tctctttcat gtattccaac
     2161 cagatcaggt ccgtaaagaa cagtaacgtc tatttcttta ttccagttta tgcttagtgg
     2221 atcaattttg tattcatcta aattgaagtg ctttatggac cacctcctcc atatacaatg
     2281 atacaaccta gccatatata tgctatagtc aaactcattt tcagaagagc aactgggtaa
     2341 cctcaataat ttatgcttgc gcattttcag caacaagcgc caggtagtgg aca
//
LOCUS       BSURPOF      3078 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
            complete cds.
ACCESSION   M34995
KEYWORDS    RNA polymerase; minor sigma-37 factor.
SOURCE      B.subtilis (strains 168 and Marburg), clones
            lambda-[21,gt11,1,11,21,31].
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 368 to 2297)
  AUTHORS   Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
  TITLE     Gene encoding the 37,000-dalton minor sigma factor of Bacillus
            subtilis RNA polymerase: Isolation, nucleotide sequence,
            chromosomal locus, and cryptic function
  JOURNAL   J. Bacteriol. 169, 771-778 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3078)
  AUTHORS   Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
  TITLE     Similar organization of the sigB and spoIIA operons encoding
            alternate sigma factors of Bacillus subtilis RNA polymerase
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.W.Price, 18-FEB-1987.  Draft entry and computer-readable
            sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES       from  to/span     description
    pept        200      529     ORF V
    pept        526     1008     ORF W
    pept        968     1762     37 kd minor sigma factor (rpoF, sigB; ttg
                                 start codon)
    pept       1762     2361     ORF X
    mRNA        131     2368     sigB operon mRNA
    signal      131      168     sigB operon promoter
    signal     2368     2401     sigB transcription termination signal
    binding     188      194     ORF V ribosomal binding site
    binding     511      519     ORF W ribosomal binding site
    binding     954      961     sigB ribosomal binding site
    binding    1750     1753     ORF X ribosomal binding site
BASE COUNT      999 a    575 c    755 g    749 t
ORIGIN      40 degrees on B.subtilis map.
        1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
       61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
      121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
      181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
      241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
      301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
      361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
      421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
      481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
      541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
      601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
      661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
      721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
      781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
      841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
      901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
      961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
     1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
     1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
     1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
     1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
     1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
     1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
     1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
     1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
     1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
     1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
     1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
     1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
     1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
     1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
     1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
     1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
     1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
     2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
     2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
     2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
     2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
     2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
     2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
     2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
     2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
     2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
     2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
     2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
     2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
     2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
     2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
     2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
     2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
     3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
     3061 caaactgctg cgaagctt
//
LOCUS       HUMERPR      1624 bp ss-mRNA            PRI       08-AUG-1990
DEFINITION  Human erythropoietin receptor mRNA, complete cds.
ACCESSION   M34986
KEYWORDS    erythropoietin receptor.
SOURCE      Human cell line OCI-M1, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1624)
  AUTHORS   Ehrenman,K. and St John,T.
  TITLE     Sequence of the human erythropoietin gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.St.John, 07-JUN-1990.
            
              Author address: T.St.John
              Hutchinson Cancer Res Center
              1124 Columbia St.
              Seattle, WA
              Email: TSTJOHN@FHCRCVAX
            
FEATURES       from  to/span     description
    pept          1     1527     erythropoietin receptor precursor
    sigp          1       72     erythropoietin receptor signal peptide
    matp         73     1524     erythropoietin receptor
BASE COUNT      276 a    537 c    498 g    313 t
ORIGIN      
        1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
       61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
      121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
      181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
      241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
      301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
      361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
      421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
      481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
      541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
      601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
      661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
      721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
      781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
      841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
      901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
      961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
     1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
     1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
     1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
     1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
     1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
     1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
     1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
     1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
     1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
     1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
     1621 agga
//
LOCUS       HUMGFP40H    4379 bp ds-DNA             PRI       08-AUG-1990
DEFINITION  Human P40 T-cell and mast cell growth factor (hP40) gene, complete
            cds.
ACCESSION   M30135
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor;
            megakeryoblastic leukemia cell growth factor.
SOURCE      Human (cell line CESS) DNA, clone lambda-H40.3a1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4379)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 12-DEC-1989.
FEATURES       from  to/span     description
    pept        796      909     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
               1022     1057     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
               1140     1172     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2461     2592     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               4096     4215     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        796      849     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        850      909     P40 T-cell and mast cell growth factor
               1022     1057     P40 T-cell and mast cell growth factor
               1140     1172     P40 T-cell and mast cell growth factor
               2461     2592     P40 T-cell and mast cell growth factor
               4096     4212     P40 T-cell and mast cell growth factor
    IVS         910     1021     hP40 intron A
    IVS        1058     1139     hP40 intron B
    IVS        1173     2460     hP40 intron C
    IVS        2593     4095     hP40 intron D
    signal      742      747     TATA box
    signal     4354     4359     polyA signal
    rpt        2965     3265     Alu repeat
    binding     370      376     AP-2 binding site
    binding     624      629     IRF-1 binding site
    binding     627      633     AP-1 binding site
BASE COUNT     1208 a    921 c    904 g   1346 t
ORIGIN      
        1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
       61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
      121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
      181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
      241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
      301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
      361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
      421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
      481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
      541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
      601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
      661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
      721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
      781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
      841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
      901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
      961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
     1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
     1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
     1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
     1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
     1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
     1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
     1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
     1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
     1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
     1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
     1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
     1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
     1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
     1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
     1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
     1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
     1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
     2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
     2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
     2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
     2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
     2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
     2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
     2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
     2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
     2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
     2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
     2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
     2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
     2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
     2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
     2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
     2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
     3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
     3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
     3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
     3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
     3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
     3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
     3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
     3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
     3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
     3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
     3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
     3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
     3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
     3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
     3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
     3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
     3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
     4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
     4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
     4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
     4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
     4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
     4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS       MEFDNABP     1710 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION   M34778
KEYWORDS    DNA binding protein HMfB.
SOURCE      M.fervidus DNA.
  ORGANISM  Methanothermus fervidus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanothermaceae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  TITLE     DNA binding protein HMf, isolated from the hypothermophilic
            archaeal Methanothermus fervidus is most closely related to
            histones
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1710)
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Sandman, 01-JUN-1990.
            
               Author address: K.Sandman
               Department of Microbiology
               Ohio State University
               484 W. 12th Ave.
               Columbus, Ohio 43210
               E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES       from  to/span     description
    pept        578      787     DNA binding protein HMfB
    pept       1662      805 (c) ORF285
    signal      450      457     promoter
    signal      523      530     promoter
    signal      790      814     transcription termination signal
BASE COUNT      619 a    263 c    265 g    561 t      2 others
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
       61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
      121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
      181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
      241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
      301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
      361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
      421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
      481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
      541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
      601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
      661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
      721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
      781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
      841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
      901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
      961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
     1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
     1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
     1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
     1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
     1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
     1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
     1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
     1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
     1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
     1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
     1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
     1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS       MUSINT1A     5607 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION   K02593 M34750
KEYWORDS    Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE      Mouse (BALB/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1098 to 5606)
  AUTHORS   Van Ooyen,A. and Nusse,R.
  TITLE     Structure and nucleotide sequence of the putative mammary oncogene
            int-1; Proviral insertions leave the protein-encoding domain intact
  JOURNAL   Cell 39, 233-240 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5607)
  AUTHORS   Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
            Otte,A., Schuuring,E. and van Ooyen,A.
  TITLE     The Wnt-1 (int-1) oncogene promoter and its mechanism of activation
            by insertion of proviral DNA of the mouse mammary tumor virus
  JOURNAL   Mol. Cell. Biol. 10, 4170-4179 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [2] kindly submitted
            by R.Nusse, 31-MAY-1990.
FEATURES       from  to/span     description
    pept       1779     1882     Wnt-1 protein (exon 1)
               2452     2705     Wnt-1 protein (exon 2)
               3279     3544     Wnt-1 protein (exon 3)
               4002     4490     Wnt-1 protein (exon 4)
    IVS        1883     2451     Wnt-1 cds intron A
    IVS        2706     3278     Wnt-1 cds intron B
    IVS        3545     4001     Wnt-1 cds intron C
    site       1133     1134     MMTV insertion site in tumor 17 [1]
    site       1394     1395     MMTV insertion site in tumor 35 [1]
    site       1631     1632     MMTV insertion site in tumor 102 [1]
    site       4495     4496     MMTV insertion site in tumor 53 [1]
    mRNA       1428     5398     Wnt-1 mRNA (alt.)
    mRNA       1595     5398     Wnt-1 mRNA (alt.)
    signal     1562     1566     TATA box
    signal     1573     1577     TATA box
BASE COUNT     1188 a   1681 c   1519 g   1219 t
ORIGIN      Chromosome 15.
        1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
       61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
      121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
      181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
      241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
      301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
      361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
      421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
      481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
      541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
      601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
      661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
      721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
      781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
      841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
      901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
      961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
     1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
     1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
     1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
     1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
     1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
     1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
     1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
     1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
     1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
     1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
     1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
     1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
     1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
     1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
     1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
     1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
     1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
     2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
     2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
     2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
     2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
     2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
     2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
     2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
     2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
     2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
     2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
     2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
     2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
     2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
     2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
     2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
     2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
     3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
     3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
     3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
     3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
     3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
     3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
     3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
     3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
     3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
     3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
     3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
     3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
     3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
     3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
     3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
     3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
     3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
     4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
     4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
     4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
     4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
     4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
     4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
     4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
     4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
     4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
     4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
     4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
     4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
     4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
     4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
     4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
     4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
     4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
     5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
     5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
     5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
     5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
     5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
     5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
     5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
     5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
     5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
     5581 gaccctaaat gagaccaact agggatc
//
LOCUS       MUSP40M      3808 bp ds-DNA             ROD       08-AUG-1990
DEFINITION  Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
            cds.
ACCESSION   M30136
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor.
SOURCE      Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
            DNA, clone lambda-S40.1a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3808)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 21-NOV-1989.
FEATURES       from  to/span     description
    pept        644      757     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
                846      881     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
                965      997     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2184     2315     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               3393     3512     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        644      697     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        698      757     P40 T-cell and mast cell growth factor
                846      881     P40 T-cell and mast cell growth factor
                965      997     P40 T-cell and mast cell growth factor
               2184     2315     P40 T-cell and mast cell growth factor
               3393     3509     P40 T-cell and mast cell growth factor
    IVS         758      845     mP40 intron A
    IVS         882      964     mP40 intron B
    IVS         998     2183     mP40 intron C
    IVS        2316     3392     mP40 intron D
    signal      589      594     TATA box
    binding     227      233     AP-2 binding site
    binding     475      480     IRF-1 binding site
    binding     478      484     AP-1 binding site
BASE COUNT     1012 a    903 c    869 g   1024 t
ORIGIN      Chromosome 13.
        1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
       61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
      121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
      181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
      241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
      301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
      361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
      421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
      481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
      541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
      601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
      661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
      721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
      781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
      841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
      901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
      961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
     1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
     1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
     1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
     1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
     1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
     1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
     1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
     1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
     1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
     1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
     1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
     1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
     1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
     1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
     1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
     1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
     1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
     2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
     2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
     2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
     2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
     2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
     2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
     2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
     2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
     2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
     2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
     2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
     2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
     2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
     2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
     2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
     2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
     3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
     3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
     3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
     3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
     3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
     3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
     3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
     3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
     3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
     3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
     3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
     3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
     3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
     3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS       PIGMHDQAA    1105 bp ss-mRNA            MAM       08-AUG-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
            cds.
ACCESSION   M29938
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
            ZQ30-1.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1105)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         13      777     MHC SLA-DQ alpha chain precursor
    sigp         13       81     MHC SLA-DQ alpha chain signal peptide
    matp         82      774     MHC SLA-DQ alpha chain
    mRNA    <     1     1105     MHC SLA-DQ mRNA
BASE COUNT      254 a    290 c    255 g    306 t
ORIGIN      
        1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
       61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
      121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
      181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
      241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
      301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
      361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
      421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
      481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
      541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
      601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
      661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
      721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
      781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
      841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
      901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
      961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
     1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1081 aaacaataaa tcacttcatg aggtt
//
LOCUS       PIGMHDQAB    1174 bp ss-mRNA            MAM       08-AUG-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
            cds.
ACCESSION   M29939
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1174)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         69      836     MHC SLA-DQ alpha chain precursor
    sigp         69      137     MHC SLA-DQ alpha chain signal peptide
    matp        138      833     MHC SLA-DQ alpha chain
BASE COUNT      273 a    313 c    266 g    322 t
ORIGIN      
        1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
       61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
      121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
      181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
      241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
      301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
      361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
      421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
      481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
      541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
      601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
      661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
      721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
      781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
      841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
      901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
      961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
     1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
     1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS       PIGMHDQBA     212 bp ds-DNA             MAM       08-AUG-1990
DEFINITION  Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION   M29944
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Pig (haplotype c/c) adult spleen DNA, clone D26.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
            Gustafsson,K. and LeGuern,C.
  TITLE     Class II genes of miniature swine: II. Molecular identification and
            characterization of B (beta) genes from SLA-c haplotype
  JOURNAL   Immunogenetics 31, 1-6 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept    <     1       12     MHC SLA-DO beta chain
BASE COUNT       38 a     58 c     52 g     64 t
ORIGIN      
        1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
       61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
      121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
      181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS       PP2LEND       266 bp ds-DNA             PHG       08-AUG-1990
DEFINITION  Bacteriophage P2 left end DNA.
ACCESSION   M34756
KEYWORDS    .
SOURCE      Bacteriophage P2 DNA.
  ORGANISM  Bacteriophage P2
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 266)
  AUTHORS   Ziermann,R. and Calendar,R.
  TITLE     Characterization of the cos sites of Bacteriophages P2 and P4
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Ziermann, 05-JUN-1990.
            
              Author address: R.Ziermann
              University of California
              Dept. Mol. Cell Biology
              Berkeley, CA 94720
              Email: calendar@garnet.berkeley.edu
            
FEATURES       from  to/span     description
    site          1       19     cohesive end of phage DNA
    site         20      266     left end of phage DNA
BASE COUNT       41 a     58 c     99 g     68 t
ORIGIN      Map position 0.0-0.8%.
        1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
       61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
      121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
      181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
      241 gagccagccg tttatctcgc ggatcc
//
LOCUS       RCABCHH       314 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
            F108) genes, 3' end, and 5' end respectively.
ACCESSION   M34843
KEYWORDS    .
SOURCE      R.capsulata DNA.
  ORGANISM  Rhodobacter capsulatus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Yang,Z. and Bauer,C.E.
  TITLE     Rhodobacter capsulatus genes involved in early steps of the
            bacteriochlorophyll biosynthetic pathway
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.E.Bauer, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1       79     bacteriochlorophyll biosynthesis protein
                                 (bchH) (AA at 2)
    pept        108  >   314     bacteriochlorophyll biosynthesis protein
                                 (F108)
BASE COUNT       59 a     87 c    119 g     49 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
       61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
      121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
      181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
      241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
      301 agcgcgtgct gcag
//
LOCUS       TREANTAT      999 bp ds-DNA             ORG       08-AUG-1990
DEFINITION  T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION   M34848
KEYWORDS    .
SOURCE      T.evansi AnTat 3/3 kinetoplast minicircle DNA.
  ORGANISM  Kinetoplast Trypanosoma evansi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Trypanosoma evansi.
REFERENCE   1  (bases 1 to 999)
  AUTHORS   Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
            Steinert,M. and Hamers,R.
  TITLE     Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Bajyana Songa, 05-JUN-1990.
FEATURES       from  to/span     description
    site        367      516     kinetoplast minicircle common region (ORI)
BASE COUNT      391 a     78 c    203 g    327 t
ORIGIN      
        1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
       61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
      121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
      181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
      241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
      301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
      361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
      421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
      481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
      541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
      601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
      661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
      721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
      781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
      841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
      901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
      961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS       VIBSCRRE     1524 bp ds-DNA             BCT       08-AUG-1990
DEFINITION  V.alginolyticus Scr repressor protein gene (scrR), complete cds,
            and sucrose transport (scrA) protein, 5' end.
ACCESSION   M35009
KEYWORDS    Scr repressor protein; sucrose transport protein.
SOURCE      V.alginolyticus DNA.
  ORGANISM  Vibrio alginolyticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Blatch,G.L. and Woods,D.R.
  TITLE     Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
            repressor-encoding gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Blatch, 10-JUN-1990.
            
              Author address: G.L.Blatch
              University of Cape Town
              Dept. Microbiology
              Private Bag
              Rondebosch
              7700 SOUTH AFRICA
              Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
            
FEATURES       from  to/span     description
    pept         36  <     1 (c) sucrose transport protein (scrA)
    pept        271      555     Scr repressor protein (scrR)
    pept        791     1090     ORF5
    binding     264      267     scrR ribosomal binding site
    binding     780      785     ORF5 ribosomal binding site
    signal      178      183     scrA -10 region
    signal      203      208     scrA -35 region
    signal      190      195     scrR -10 region
    signal      162      167     scrR -10 region
    signal      762      767     ORF5 -35 region
    signal      742      747     ORF5 -35 region
    site         78       91     repressor operator site (put.)
    site        229      242     repressor operator site (put.)
    site         12       71     ScrR helix-turn-helix motif
BASE COUNT      466 a    344 c    321 g    393 t
ORIGIN      
        1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
       61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
      121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
      181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
      241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
      301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
      361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
      421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
      481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
      541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
      601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
      661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
      721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
      781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
      841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
      901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
      961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
     1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
     1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
     1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
     1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
     1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
     1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
     1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
     1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
     1501 gccaattgtg atattgaaga agct
//
LOCUS       XELNRP1A     2431 bp ss-mRNA            VRT       08-AUG-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
            complete cds.
ACCESSION   M34894
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA, clones 24-39.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2431)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         74     1117     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2431     nrp-1B mRNA
    signal     2407     2418     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
    site        921      941     alternative exon (put.)
BASE COUNT      663 a    522 c    543 g    703 t
ORIGIN      
        1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
       61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
      121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
      181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
      241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
      301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
      361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
      421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
      481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
      541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
      601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
      661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
      721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
      781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
      841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
      901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
      961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
     1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
     1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
     1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
     1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
     1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
     1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
     1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
     1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
     1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
     1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
     1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
     1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
     1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
     1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
     1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
     1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
     1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
     2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
     2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
     2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
     2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
     2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
     2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
     2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS       XELNRP1B     2324 bp ss-mRNA            VRT       08-AUG-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
            complete cds.
ACCESSION   M34895
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2324)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         29     1051     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2324     nrp-1B mRNA
    signal     2306     2311     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
BASE COUNT      647 a    467 c    520 g    690 t
ORIGIN      
        1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
       61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
      121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
      181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
      241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
      301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
      361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
      421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
      481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
      541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
      601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
      661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
      721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
      781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
      841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
      901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
      961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
     1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
     1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
     1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
     1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
     1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
     1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
     1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
     1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
     1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
     1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
     1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
     1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
     1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
     1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
     1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
     1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
     1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
     2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
     2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
     2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
     2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
     2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS       YSJSACDHY    2569 bp ds-DNA             PLN       08-AUG-1990
DEFINITION  Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION   M34929
KEYWORDS    saccharopine dehydrogenase.
SOURCE      Y.lipolytica (strain W29) DNA.
  ORGANISM  Yarrowia lipolytica
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2569)
  AUTHORS   Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
            Gaillardin,C.
  TITLE     Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
            lipolytica
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.M.Fournier, 06-JUN-1990.
            
            ORF1 is probably not translated, since a lacZ fusion/expression
            experiment did not produce results.
FEATURES       from  to/span     description
    pept        649     1929     ORF1
    pept       1764      655 (c) saccharopine dehydrogenase
    mRNA        582     2060     ORF1 mRNA (alt.)
    mRNA        598     2060     ORF1 mRNA (alt.)
    mRNA        599     2060     ORF1 mRNA (alt.)
    mRNA        602     2060     ORF1 mRNA (alt.)
    mRNA        606     2060     ORF1 mRNA (alt.)
    mRNA        609     2060     ORF1 mRNA (alt.)
    mRNA       1807      639 (c) ORF2 mRNA (alt.)
    mRNA       1809      639 (c) ORF2 mRNA (alt.)
    mRNA       1811      639 (c) ORF2 mRNA (alt.)
    mRNA       1812      639 (c) ORF2 mRNA (alt.)
    mRNA       1814      639 (c) ORF2 mRNA (alt.)
    signal      521      528     TATA box
    signal      565      575     TATA box
BASE COUNT      668 a    573 c    666 g    662 t
ORIGIN      
        1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
       61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
      121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
      181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
      241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
      301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
      361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
      421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
      481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
      541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
      601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
      661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
      721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
      781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
      841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
      901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
      961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
     1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
     1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
     1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
     1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
     1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
     1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
     1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
     1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
     1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
     1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
     1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
     1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
     1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
     1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
     1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
     1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
     1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
     2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
     2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
     2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
     2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
     2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
     2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
     2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
     2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
     2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//

GenBank-Updates@genbank.bio.net (08/09/90)

LOCUS       INS43AAA      130 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  Insertion sequence IS2-43.
ACCESSION   M25093
KEYWORDS    RNA polymerase binding site; insertion sequence;
            insertion sequence IS2.
SOURCE      Insertion sequence IS2 DNA.
  ORGANISM  Insertion sequence IS2
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 130)
  AUTHORS   Sommer,H., Cullum,J. and Saedler,H.
  TITLE     IS2-43 and IS2-44: New alleles of the insertion sequence IS2 which
            have promoter activity
  JOURNAL   Mol. Gen. Genet. 175, 53-56 (1979)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
BASE COUNT       41 a     23 c     22 g     44 t
ORIGIN      
        1 cctaagacat caatcatctg ttctccaatg actagtctaa aaactagtat taagactatc
       61 acttatttaa gtgatatact tatttaagtg atattggttg tctggagatt cagggggcca
      121 gtctaatacc
//
LOCUS       PSCIS1IN      146 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  Plasmid pDG128 insertion element IS1 target region sequence.
ACCESSION   M25018
KEYWORDS    insertion element; insertion element IS1.
SOURCE      Plasmid pDG128, a derivative of Plasmid pSC101, DNA, clone
            128/10R7.
  ORGANISM  Plasmid pSC101
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 146)
  AUTHORS   Sommer,H., Schumacher,B. and Saedler,H.
  TITLE     A new type of IS1-mediated deletion
  JOURNAL   Mol. Gen. Genet. 184, 300-307 (1981)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
BASE COUNT       38 a     33 c     34 g     41 t
ORIGIN      
        1 gctgcgaaaa tgccttatct ggcctacaga ttcgatgcga ttcgtaggtc ggataagatg
       61 cgcaagcatc gcatccgaca ataagtgccg aatgcgacct acattcacat ggcgcttttt
      121 acatctgacg gtttttattg aagtta
//
LOCUS       BRVRNASA      197 bp ss-mRNA            VRL       09-AUG-1990
DEFINITION  Berne virus ORF5 mRNA, 5'end.
ACCESSION   M33503 M33501
KEYWORDS    core protein.
SOURCE      Berne virus (strain P138/72) viral RNA.
  ORGANISM  Berne virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Toroviridae.
REFERENCE   1  (bases 1 to 197)
  AUTHORS   Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
  TITLE     A 3'-coterminal nested set of independently transcribed mRNAs is
            generated during Berne virus replication
  JOURNAL   J. Virol. 64, 331-338 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        137  >   197     ORF5
    mRNA        113  >   197     RNA5
BASE COUNT       50 a     27 c     38 g     82 t
ORIGIN      
        1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag
       61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt
      121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc
      181 acctcaggtt gttgcat
//
LOCUS       BRVRNASB      179 bp ss-RNA             VRL       09-AUG-1990
DEFINITION  Berne virus ORF3 mRNA, 5' end.
ACCESSION   M33502
KEYWORDS    core protein.
SOURCE      Berne virus (strain P138/72) viral RNA.
  ORGANISM  Berne virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Toroviridae.
REFERENCE   1  (bases 1 to 179)
  AUTHORS   Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M.
  TITLE     A 3'-coterminal nested set of independently transcribed mRNAs is
            generated during Berne virus replication
  JOURNAL   J. Virol. 64, 331-338 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        153  >   179     ORF3
BASE COUNT       52 a     17 c     34 g     76 t
ORIGIN      
        1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat
       61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag
      121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt
//
LOCUS       CHKGLOBA     1204 bp ds-DNA             VRT       09-AUG-1990
DEFINITION  Chicken pie-alpha-globin gene, fragment H3/H4.
ACCESSION   M30485
KEYWORDS    pie-alpha-globin.
SOURCE      Chicken AEV transformed erythroblast DNA, fragment H3/H4.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1204)
  AUTHORS   Broders,F., Zahraoui,A. and Scherrer,K.
  TITLE     The chicken alpha-globin gene domain is transcribed into a
            17-kilobase polycistronic RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA         91  >  1204     pie-alpha-globin mRNA fragment H3/H4 (put.)
BASE COUNT      282 a    252 c    263 g    407 t
ORIGIN      
        1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg
       61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat
      121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg
      181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg
      241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt
      301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac
      361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga
      421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg
      481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag
      541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc
      601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga
      661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag
      721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg
      781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac
      841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt
      901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca
      961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc
     1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt
     1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg
     1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact
     1201 ggtt
//
LOCUS       CHKGLOBB      582 bp ds-DNA             VRT       09-AUG-1990
DEFINITION  Chicken pie-alpha-globin gene, fragment H10.
ACCESSION   M30486
KEYWORDS    pie-alpha-globin.
SOURCE      Chicken AEV transformed erythroblast DNA, fragment H10.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 582)
  AUTHORS   Broders,F., Zahraoui,A. and Scherrer,K.
  TITLE     The chicken alpha-globin gene domain is transcribed into a
            17-kilobase polycistronic RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        241  >   582     pie-alpha-globin fragment mRNA H10
BASE COUNT      171 a    128 c    108 g    175 t
ORIGIN      
        1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata
       61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct
      121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg
      181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata
      241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac
      301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg
      361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat
      421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg
      481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca
      541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc
//
LOCUS       ECOPHOAA      600 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  E.coli alkaline phosphatase (phoA) gene, 5' end.
ACCESSION   M33536
KEYWORDS    alkaline phosphatase.
SOURCE      E.coli (strain K-12) cell line BW7710 DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 600)
  AUTHORS   Agrawal,D.K. and Wanner,B.L.
  TITLE     A phoA structural gene mutation that conditionally affects
            formation of the enzyme bacterial alkaline phosphatase
  JOURNAL   J. Bacteriol. 172, 3180-3190 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.K.Agrawal, 03-APR-1990.
            
            The phoA503 mutation does not interfere with export of active
            enzyme but does interfere with assembly.
FEATURES       from  to/span     description
    pept        283  >   600     alkaline phosphatase precursor (phoA) (EC
                                 3.1.3.1)
    sigp        283      345     alkaline phosphatase signal peptide
    matp        346  >   600     alkaline phosphatase
    variant     413      413     c in wild type; t in phoA503 mutation
BASE COUNT      159 a    130 c    151 g    160 t
ORIGIN      Map position 8.7 minutes; 1 bp upstream of HindIII site.
        1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc
       61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg
      121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt
      181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct
      241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt
      301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg
      361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt
      421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat
      481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat
      541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac
//
LOCUS       GCOEARA      1771 bp ds-DNA             PLN       09-AUG-1990
DEFINITION  G.tikvahiae McLachlan 18S ribosomal RNA gene.
ACCESSION   M33640
KEYWORDS    18S ribosomal RNA.
SOURCE      G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA.
  ORGANISM  Gracilaria tikvahiae McLachlan
            Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; 
            Florideophycideae; Gigartinales; Gracilariaceae.
REFERENCE   1  (bases 1 to 1771)
  AUTHORS   Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A.
  TITLE     Nucleotide sequence of the 18S ribosomal RNA gene from the red alga
            Gracilaria tikvahiae mclachlan
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Ragan 08-APR-1990.  Atlantic Research Lab, National Research
            Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA
            B3H 3Z1
FEATURES       from  to/span     description
    rRNA          1     1771     18S ribosomal RNA
BASE COUNT      445 a    371 c    501 g    454 t
ORIGIN      
        1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc
       61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct
      121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata
      181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt
      241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat
      301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg
      361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc
      421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta
      481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc
      541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc
      601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg
      661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta
      721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata
      781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg
      841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg
      901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga
      961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga
     1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg
     1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg
     1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg
     1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt
     1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag
     1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc
     1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc
     1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat
     1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa
     1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct
     1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat
     1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt
     1741 ttccgtaggt gaacctgcag aaggatcaag c
//
LOCUS       HS6MCP       4440 bp ds-DNA             VRL       09-AUG-1990
DEFINITION  Human herpesvirus type 6 major capsid protein (MCP) gene, complete
            cds.
ACCESSION   M33515
KEYWORDS    major capsid protein.
SOURCE      Human herpesvirus type 6 DNA.
  ORGANISM  Human herpesvirus type 6
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 4440)
  AUTHORS   Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R.
  TITLE     Identification, cloning, and expression of the major capsid protein
            gene of human herpesvirus 6
  JOURNAL   J. Virol. 64, 714-722 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        235     4272     major capsid protein (MCP)
BASE COUNT     1422 a   1169 c    785 g   1064 t
ORIGIN      
        1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct
       61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag
      121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac
      181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa
      241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac
      301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat
      361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa
      421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat
      481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca
      541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa
      601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca
      661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga
      721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc
      781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat
      841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga
      901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt
      961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact
     1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa
     1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat
     1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg
     1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa
     1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg
     1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac
     1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg
     1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac
     1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt
     1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga
     1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa
     1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga
     1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac
     1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc
     1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct
     1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca
     1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt
     2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct
     2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat
     2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac
     2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta
     2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg
     2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc
     2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta
     2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc
     2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac
     2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag
     2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag
     2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc
     2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt
     2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc
     2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca
     2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac
     3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac
     3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat
     3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt
     3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa
     3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag
     3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc
     3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat
     3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat
     3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg
     3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc
     3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg
     3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt
     3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac
     3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc
     3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac
     3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt
     3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac
     4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag
     4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac
     4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc
     4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa
     4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga
     4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct
     4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca
//
LOCUS       HUMSEXREPB    916 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human sex chromosome repeat, clone pDP330.
ACCESSION   M33524
KEYWORDS    sex chromosome repeat.
SOURCE      Human cell line OXENII DNA, clone pDP320.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 916)
  AUTHORS   Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
            Page,D.C.
  TITLE     Human sex-chromosome-specific repeats within a region of
            pseudoautosomal/Yq homology
  JOURNAL   Genomics 7, 625-628 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.M.C.Fisher, 03-APR-1990.
FEATURES       from  to/span     description
    site          1      388     low copy flanking sequence
    rpt         389      916     sex chromosome repeat
BASE COUNT      206 a    228 c    229 g    253 t
ORIGIN      Chromosome Yp.
        1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta
       61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc
      121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt
      181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca
      241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg
      301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
      361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc
      421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga
      481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct
      541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct
      601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca
      661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct
      721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg
      781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct
      841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc
      901 ctggaggctg gaattc
//
LOCUS       HUMSEXRPA     918 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human sex chromosome repeat, clone pDP316.
ACCESSION   M33523
KEYWORDS    sex chromosome repeat.
SOURCE      Human cell line OXENII DNA, clone pDP316.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 918)
  AUTHORS   Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and
            Page,D.C.
  TITLE     Human sex-chromosome-specific repeats within a region of
            pseudoautosomal/Yq homology
  JOURNAL   Genomics 7, 625-628 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.M.C.Fisher, 03-APR-1990.
FEATURES       from  to/span     description
    site          1      388     low copy flanking sequence
    rpt         389      918     sex chromosome repeat
BASE COUNT      207 a    242 c    226 g    243 t
ORIGIN      Chromosome Yp.
        1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta
       61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc
      121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt
      181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca
      241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg
      301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct
      361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc
      421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg
      481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc
      541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc
      601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac
      661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc
      721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta
      781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg
      841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag
      901 tcctggaggc tggaattc
//
LOCUS       IRICAP       2461 bp ds-DNA             VRL       09-AUG-1990
DEFINITION  Iridescent virus type 1 capsid protein gene, complete cds.
ACCESSION   M33542
KEYWORDS    capsid protein.
SOURCE      Iridescent virus type 1 DNA.
  ORGANISM  Iridescent virus type 1
            Viridae; ds-DNA nonenveloped viruses; Iridoviridae.
REFERENCE   1  (bases 1 to 2461)
  AUTHORS   Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L.
  TITLE     Molecular cloning, characterization, and expression of the Tipula
            iridescent virus capsid gene
  JOURNAL   J. Virol. 64, 125-136 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        601     1995     capsid protein
    mRNA        587  >  2461     capsid protein mRNA ( 5' end +/- 5 bp)
BASE COUNT      717 a    462 c    443 g    839 t
ORIGIN      
        1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat
       61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa
      121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta
      181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt
      241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat
      301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt
      361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt
      421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt
      481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt
      541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac
      601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa
      661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga
      721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct
      781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg
      841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta
      901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat
      961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct
     1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca
     1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca
     1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag
     1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct
     1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct
     1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt
     1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt
     1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca
     1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat
     1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt
     1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga
     1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt
     1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc
     1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg
     1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc
     1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac
     1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga
     2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt
     2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa
     2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa
     2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt
     2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc
     2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc
     2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct
     2461 t
//
LOCUS       MUSIGHAAR     363 bp ss-mRNA            ROD       09-AUG-1990
DEFINITION  Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds.
ACCESSION   M33679
KEYWORDS    diversity exon; immunoglobulin heavy chain; joining exon;
            processed gene; variable region.
SOURCE      Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
  TITLE     Clustered heavy chain somatic mutations shared by anti p
            azophenylarsonate antibodies confer enhanced affinity and ablate
            the cross-reactive idiotype
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by B.Parhami-Seren, 11-APR-1990.  Massachusetts General Hospital,
            Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig heavy chain V-D-J region (AA at 1)
BASE COUNT       98 a     83 c     89 g     93 t
ORIGIN      
        1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg
       61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg
      121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac
      181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac
      241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt
      301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc
      361 tca
//
LOCUS       MUSIGKABF     324 bp ss-mRNA            ROD       09-AUG-1990
DEFINITION  Mouse Ig rearranged L-chain mRNA V-J region, partial cds.
ACCESSION   M33678
KEYWORDS    immunoglobulin light chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J.
  TITLE     Clustered heavy chain somatic mutations shared by anti p
            azophenylarsonate antibodies confer enhanced affinity and ablate
            the cross-reactive idiotype
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by B.Parhami-Seren, 11-APR-1990.   Massachusetts General Hospital,
            Jackson 1402, Blossom Street Receiving, Boston, MA 02114
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig light-chain V-J region (AA at 1)
BASE COUNT       96 a     77 c     73 g     77 t      1 others
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca
      181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat
      241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga
      301 ggcaccaagt tggaaatcaa acgg
//
LOCUS       MUSTCVYAN    2567 bp ds-DNA             ROD       09-AUG-1990
DEFINITION  Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5'
            end.
ACCESSION   M33500
KEYWORDS    T cell receptor; beta-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c) DNA, hybridoma B.1.1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2567)
  AUTHORS   Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and
            Fotedar,A.
  TITLE     Transcription of the T cell receptor beta-chain gene is controlled
            by multiple regulatory elements
  JOURNAL   J. Immunol. 144, 1111-1119 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       2544  >  2567     T cell receptor beta-chain V-2 region
                                 precursor
    sigp       2544  >  2567     T cell receptor beta-chain signal peptide
    mRNA       2478  >  2567     T cell receptor beta-chain mRNA
BASE COUNT      708 a    560 c    583 g    716 t
ORIGIN      
        1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag
       61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc
      121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga
      181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag
      241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga
      301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc
      361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc
      421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc
      481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga
      541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt
      601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt
      661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga
      721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg
      781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga
      841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa
      901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata
      961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac
     1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag
     1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt
     1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta
     1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta
     1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat
     1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa
     1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc
     1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct
     1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa
     1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc
     1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc
     1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag
     1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag
     1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa
     1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc
     1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct
     1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat
     2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg
     2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc
     2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta
     2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact
     2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca
     2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt
     2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt
     2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat
     2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc
//
LOCUS       R751TRA       578 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  Plasmid R751 traJ and traK genes, 5'end.
ACCESSION   M25422
KEYWORDS    inverted repeat; transfer origin region.
SOURCE      Plasmid R751 (strain HB101, Inc P-beta) DNA.
  ORGANISM  Plasmid R751
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 578)
  AUTHORS   Lanka,E. and Euerste,J.P.
  TITLE     Conjugative transfer of promiscuous IncP plasmids: Interaction of
            plasmid-encoded products with the transfer origin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            E.Lanka, 17-JUL-1989.
FEATURES       from  to/span     description
    pept         48  <     1 (c) traJ protein
    pept        403  >   578     traK protein
    signal      243      211     promoter PL
    signal      266      294     promoter PR
    rpt          49       64     inverted repeat
    rpt         118      157     inverted repeat
    rpt         296      331     inverted repeat
BASE COUNT      141 a    168 c    163 g    106 t
ORIGIN      
        1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc
       61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg
      121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt
      181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat
      241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct
      301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag
      361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc
      421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac
      481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg
      541 atgaaaacga tctgggagca catgaaggaa accggccg
//
LOCUS       RP4TRAB       571 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  Plasmid RP4 traJ and traK genes, 5' end.
ACCESSION   M25423
KEYWORDS    inerted repeat; transfer origin region.
SOURCE      Plasmid RP4 (strain HB101, IncP-alpha) DNA.
  ORGANISM  Plasmid RP4
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 571)
  AUTHORS   Lanka,E. and Euerste,J.P.
  TITLE     Conjugative transfer of promiscuous IncP plasmids: Interaction of
            plasmid-encoded products with the transfer origin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            E.Lanka, 17-JUL-1989.
FEATURES       from  to/span     description
    pept         26  <     1 (c) traJ protein
    pept        394  >   571     traK protein
    rpt          48       63     inverted repeat
    rpt         118      157     inverted repeat
    signal      219      192     promoter PL
    rpt         281      318     inverted repeat
    signal      246      272     inverted repeat
BASE COUNT      135 a    148 c    181 g    107 t
ORIGIN      
        1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc
       61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg
      121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt
      181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa
      241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa
      301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg
      361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg
      421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg
      481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc
      541 atttgggagc acatgcggga aacggggaag g
//
LOCUS       STAREPEBR    2389 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  S.aureus ethidium resistance (ebr) and replication protein (repA)
            genes, complete cds.
ACCESSION   M33479
KEYWORDS    ethidium resistance protein; replication protein.
SOURCE      S.aureus plasmid DNA.
  ORGANISM  Staphylococcus aureus
            Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; 
            Micrococcaceae.
REFERENCE   1  (bases 1 to 2389)
  AUTHORS   Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J.
  TITLE     Genetic mapping and nucleotide sequence of pWBG32, an ethidium
            bromide resistance plasmid naturally occurring in Staphylococcus
            aureus
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First
            Avenue, RM 1166, New York, NY 10016
FEATURES       from  to/span     description
    pept       1153     1476     ethidium resistance protein (ebr)
BASE COUNT      796 a    403 c    290 g    900 t
ORIGIN      
        1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa
       61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca
      121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt
      181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg
      241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa
      301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg
      361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc
      421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat
      481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa
      541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata
      601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt
      661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc
      721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt
      781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt
      841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt
      901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg
      961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac
     1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca
     1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa
     1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga
     1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata
     1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat
     1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt
     1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc
     1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt
     1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc
     1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa
     1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt
     1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata
     1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta
     1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg
     1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg
     1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg
     1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca
     2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg
     2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct
     2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa
     2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc
     2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag
     2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg
//
LOCUS       XANAVR       2100 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  X.campestris avirulence protein (avrBs1) gene, complete cds.
ACCESSION   M32142 J03672
KEYWORDS    avirulence protein.
SOURCE      X.campestris (strain E3, race 2, pv. vesicatoria) DNA.
  ORGANISM  Xanthomonas campestris
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 2100)
  AUTHORS   Ronald,P.C. and Staskawicz,B.J.
  TITLE     The avirulence gene avrBs-1 from Xanthomonas campestris pv.
            vesicatoria encodes a 50-kD protein
  JOURNAL   Mol. Plant Microb. Interact. 1, 191-198 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            P.Ronald, 15-FEB-1990.
FEATURES       from  to/span     description
    pept        308      622     ORF1
    pept        713     2050     ORF2
BASE COUNT      656 a    423 c    505 g    516 t
ORIGIN      
        1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca
       61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag
      121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc
      181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa
      241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt
      301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg
      361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt
      421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa
      481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct
      541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga
      601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc
      661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga
      721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc
      781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc
      841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc
      901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag
      961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt
     1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc
     1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc
     1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca
     1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc
     1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg
     1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt
     1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct
     1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga
     1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat
     1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct
     1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac
     1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca
     1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac
     1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga
     1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga
     1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga
     1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg
     2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc
//
LOCUS       YSCSCD25     5055 bp ds-DNA             PLN       09-AUG-1990
DEFINITION  S.cerevisiae SCD25 gene, complete cds.
ACCESSION   M26647 M31771
KEYWORDS    Ras protein; SCD25 gene; cell division cycle.
SOURCE      S.cerevisiae (strain OL136) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 2129 to 5055)
  AUTHORS   Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M.
  TITLE     The C-terminal part of a gene partially homologous to CDC25 gene
            suppresses the CDC25-5 mutation in Saccharomyces cerevisiae
  JOURNAL   Gene 77, 21-30 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3880)
  AUTHORS   Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and
            Jacquet,M.
  TITLE     SCD25, a CDC25 like gene, which contains a RAS activating domain is
            a dispensable gene of Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990) See COMMENT for author address
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak,
            01-FEB-1990.  Laboratoire IGD, Groupe des laboratoires de biologie
            cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex
FEATURES       from  to/span     description
    pept        128     3880     SCD25 protein
    pept       4319  >  5055     ORF X
BASE COUNT     1638 a    973 c    900 g   1544 t
ORIGIN      
        1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc
       61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc
      121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga
      181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt
      241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa
      301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa
      361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa
      421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa
      481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg
      541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag
      601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt
      661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat
      721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa
      781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt
      841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt
      901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag
      961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa
     1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc
     1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct
     1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac
     1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac
     1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg
     1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt
     1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg
     1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag
     1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca
     1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc
     1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac
     1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga
     1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg
     1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga
     1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga
     1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa
     1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga
     2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa
     2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt
     2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca
     2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat
     2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga
     2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg
     2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg
     2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga
     2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt
     2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga
     2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag
     2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc
     2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac
     2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga
     2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac
     2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt
     3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat
     3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag
     3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag
     3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa
     3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt
     3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa
     3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag
     3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg
     3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata
     3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa
     3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt
     3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc
     3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga
     3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt
     3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt
     3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca
     3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta
     4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc
     4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc
     4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca
     4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg
     4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat
     4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg
     4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt
     4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag
     4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac
     4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag
     4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt
     4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa
     4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta
     4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt
     4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga
     4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt
     4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa
     5041 gaaaatcaaa gatct
//
LOCUS       ADAMLPA1      630 bp ds-DNA             VRL       09-AUG-1990
DEFINITION  Simian adenovirus 30 major late promoter region DNA.
ACCESSION   M31631
KEYWORDS    promoter.
SEGMENT     1 of 3
SOURCE      Mastadenovirus s30 viral DNA.
  ORGANISM  Mastadenovirus s30
            Unclassified.
REFERENCE   1  (bases 1 to 630)
  AUTHORS   Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T.,
            Lee,S.G. and Hung,P.P.
  TITLE     Conservation of essential sequences in the major late promoter and
            tripartite leader of the simian adenovirus type 30
  JOURNAL   Gene 89, 275-277 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            C.L.Hsiao, 22-JAN-1990.
FEATURES       from  to/span     description
    pre-msg     126  >   630     leader sequence 1 mRNA and intron
    IVS         167  >   630     leader sequence 1 intron A
    signal       95      100     TATA box
    signal      273      292     downstream promoter element
    site         64       73     upstream regulatory sequence
    site        210      220     downstream regulatory sequence
BASE COUNT      128 a    150 c    214 g    138 t
ORIGIN      Map position 16.0-16.6.
        1 acggtgtgca ggcagaggtc cccgtcctcc gcatccaaaa aggtgattgg cttgtaggtg
       61 taagtcacgt gaccttcctt tgggggcggg gggcgataaa agggggcggc gccgtcgtcg
      121 ccgtcactgt cctctgcgtc gctgtggacg atcgccagct gctcgggtga gtagaggcgc
      181 tcgaaggcgg gcatgacgtc ggcgctgagg gtgtcagttt ctacaaacga ggaggatttg
      241 atgttaacct gcccggagcg atgcctttga gaagggcggg gtcgagctgg tcggcaaaaa
      301 caattttttt attgtccagc ttagtggcaa aggacccgta gagggcgtag gtcgtaagaa
      361 gcttcttgct ttttttccca cagctcgcga ttcaagaggt actcttggcg gttctgccag
      421 tactcgggaa gcggaaaccc ctgcgcgtcg gctcggtaag cgcccagcat gtaaaattcg
      481 ttaggcgctg acgatgcatt tgattaactg ctgcgtaggc acttgacgcc aggacctgaa
      541 ggcggagaaa tccaccggat cggagaactt gtcgaggaag gcgtgtagcc agtcgcagtc
      601 gcaaggtaag ctgaggacgg tttccggggg
//
LOCUS       ADAMLPA2      135 bp ds-DNA             VRL       09-AUG-1990
DEFINITION  Simian adenovirus 30 leader region 2 DNA.
ACCESSION   M34220
KEYWORDS    promoter.
SEGMENT     2 of 3
SOURCE      Mastadenovirus s30 viral DNA.
  ORGANISM  Mastadenovirus s30
            Unclassified.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T.,
            Lee,S.G. and Hung,P.P.
  TITLE     Conservation of essential sequences in the major late promoter and
            tripartite leader of the simian adenovirus type 30
  JOURNAL   Gene 89, 275-277 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            C.L.Hsiao, 22-JAN-1990.
FEATURES       from  to/span     description
    IVS     <     1       35     leader sequence 2 intron N-1
    IVS         108  >   135     leader sequence 2 intron N
    site         36      107     leader sequence 2
BASE COUNT       28 a     36 c     36 g     35 t
ORIGIN      About 0.8 kb after segment 1; map postion 26.1-26.6.
        1 aggtcgtaag aagcttcttg ctttttttcc cacagctcgc gattcaagag gtactcttgg
       61 cggttctgcc agtactcggg aagcggaaac ccctgcgcgt cggctcggta agcgcccagc
      121 atgtaaaatt cgtta
//
LOCUS       ADAMLPA3      147 bp ds-DNA             VRL       09-AUG-1990
DEFINITION  Simian adenovirus 30 leader sequence 3 DNA.
ACCESSION   M34221
KEYWORDS    promoter.
SEGMENT     3 of 3
SOURCE      Mastadenovirus s30 viral DNA.
  ORGANISM  Mastadenovirus s30
            Unclassified.
REFERENCE   1  (bases 1 to 147)
  AUTHORS   Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T.,
            Lee,S.G. and Hung,P.P.
  TITLE     Conservation of essential sequences in the major late promoter and
            tripartite leader of the simian adenovirus type 30
  JOURNAL   Gene 89, 275-277 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            C.L.Hsiao, 22-JAN-1990.
FEATURES       from  to/span     description
    IVS     <     1       35     leader sequence 3 intron N-1
    IVS         123  >   147     leader sequence 3 intron N
    site         36      122     leader sequence 3
BASE COUNT       34 a     33 c     53 g     27 t
ORIGIN      About 2.5 kb after segment 2; map position 19.3-19.9.
        1 ggcgctgacg atgcatttga ttaactgctg cgtaggcact tgacgccagg acctgaaggc
       61 ggagaaatcc accggatcgg agaacttgtc gaggaaggcg tgtagccagt cgcagtcgca
      121 aggtaagctg aggacggttt ccggggg
//
LOCUS       TFEMERA      1730 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  T.ferrooxidans mercuric reductase (merA) gene, complete cds.
ACCESSION   M32353
KEYWORDS    mercuric reductase.
SOURCE      T.ferrooxidans (strain E-15) DNA, clones pTM31[4,5].
  ORGANISM  Thiobacillus ferrooxidans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Colorless sulfur bacteria.
REFERENCE   1  (bases 1 to 1730)
  AUTHORS   Inoue,C., Sugawara,K., Shiratori,T., Kusano,T. and Kitagawa,Y.
  TITLE     Nucleotide sequence of the Thiobacillus ferrooxidans chromosomal
            gene encoding mercuric reductase
  JOURNAL   Gene 84, 47-54 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         65     1702     mercuric reductase
BASE COUNT      323 a    542 c    574 g    291 t
ORIGIN      
        1 gcgaccgacg gctgcgaaac gcccgccccg cgtagctgag cacatagaca ctttggagga
       61 tattatgacc gagaacgcgc ccaccgaact cgctatcact ggcatgacct gcgacggttg
      121 cgccgcgcat gtgcgcaaag cactcgaagg cgtgcccggc gtacgcgagg cgcaggtgtc
      181 ctacccggat gccacggccc gggtcgtgct ggagggcgag gtgccgatgc agcggctaat
      241 caaggcggtg gttgcaagtg gctatggtgt gcatccacgg agcgacggtg cctcctccac
      301 aaacgatgga caggagctac acatcgctgt gatcggcacc ggcggagcgg cgatggcgtg
      361 cgcattgaag gctgtcgagc ggggcgcgcg cgtgacgctg atcgaacgca gcaccatcgg
      421 cggcacctgc gtgaacatcg gttgcgtgcc gtccaagatc atgatccgcg ccgcccatat
      481 cgcccacctc cgccgggaaa gcccattcga tggcggcatc caggcggtcg cgccgaccat
      541 ccagcgcaca gcgctgctgg tccaacagca ggcccgtgtc gatgaactgc gtcacgccaa
      601 gtacgaaggc atcctggacg gcaacccggc catcaccgtt ctgcgcggtg aagcgcgttt
      661 caaggacagc cggagtgttg tcgtccattt gaacgatggt ggcgagcgcg tcgtaatgtt
      721 cgaccgctgc ctggttgcca cgggcgccag tccggccgtg ccgccgattc ccggcttgaa
      781 agacactcct tattggacct ccaccgaagg gctggtcagc gaatcgatcc ccgagcgtct
      841 ggccgtgatc ggctcgtcgg tggtggcgct ggaactggcg caagccttcg cccggctcgg
      901 cagccatgtg acgatcctgg cgcgcggcac cttgttcctc cgggaagacc cggccatcgg
      961 tgaggccatc acggcggcgt ttcgcgccga aggcatcgag gtgctggagc acacccaggc
     1021 cagccaggtc gcttatgcgg atggcgaatt tgtgctagcc accgggcacg gcgaactgcg
     1081 cgccgataag ctgctggtcg ccactggtcg cgcaccgaac acacgccgcc tgaatctgga
     1141 agcggcgggc gtggccatca atgcgcaagg ggccatcgtc atcgaccagg gtatgcgcac
     1201 gaacagcccg aacatttacg ccgctggcga ctgcaccgac cagccgcaat tcgtctacgt
     1261 ggcggcagcg gccggcaccc gtgcggccat caacatgatg ggcggtagtg cagccctgga
     1321 cttgacggcg atgccagccg tggtgttcac cgatccgcaa gtggcgactg tgggttacag
     1381 cgcggaagcg catcgcgacg gcatcgaaac cgacagccgc atgacgctcg acaacgtgcc
     1441 gcgggcgctc gccaatttca atacacgcgg cttcatcaag ctggtagccg aagtgggcag
     1501 tggctcgcta atcggcgtgc aggtggtcgc cccggaagcg ggcgagctga tccagactgc
     1561 cgcgctggcg attcgtaacc ggatgacggt acaggaactg gctgaccagt tgtttcccta
     1621 cctgacgatg gtcgaagggc tgaagcttgc tgcccagacc ttcaccaggg atgtgaagca
     1681 gttgtcctgc tgtgcgggtt gagacggatt gataaaggag tccctgttgc
//
LOCUS       MMTELPMA      830 bp ss-RNA             VRL       09-AUG-1990
DEFINITION  Mouse mammary tumor virus (MMTV) phorbol myristate acetate induced
            mRNA, clone 14.
ACCESSION   M37198 M19737 M19738 M22729
KEYWORDS    .
SOURCE      Mouse mammary tumor virus, cDNA to viral RNA, clone 14, passed in
            EL4.E1 cells.
  ORGANISM  Mouse mammary tumor virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses.
REFERENCE   1  (bases 1 to 202; 692 to 720)
  AUTHORS   Elliott,J.F., Pohajdak,B., Talbot,D.J., Shaw,J. and Paetkau,V.
  TITLE     Phorbol diester-inducible, cyclosporine-suppressible transcription
            from a novel promoter within the mouse mammary tumor virus env gene
  JOURNAL   J. Virol. 62, 1373-1380 (1988)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 830)
  AUTHORS   Paetkau,V.H.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept        168      575     PMA-induced transcript
    mRNA          1  >   830     PMA-induced transcript mRNA
    revision      1        1     c in [2]; g in [1]
    revision     39       39     g in [2]; a in [1]
    site         94       95     intron site
    revision    119      119     a in [2]; g in [1]
    site        706      707     site of 491 bp deletion relative to MMTV virus
BASE COUNT      233 a    183 c    184 g    230 t
ORIGIN      
        1 cactgccaga tcgcctttaa gaaggacgcc ttctgggagg gagacgagtc tgctcctcca
       61 cggtggttgc cttgcgcctt ccctgaccaa ggggtgcctt gcgaagagcc ttgaccaaat
      121 gcagtcagat cttaacgtgc ttcttttaaa aaagaaaaaa gggggaaatg ccgcgcctgc
      181 agcagaaatg gttgaactcc cgagagtgtc ctacacctag gggagaagca gccaaggggt
      241 tgtttcccac caaggacgac ccgtctgcgc acaaacgggt gagcccatca gacaaagaca
      301 tattcattct ctgctgcaaa cttggcatag ctctgctttg cctggggcta ttgggggaag
      361 ttgcggttcg tgctcgcagg gctctcaccc ttgactcttt taatagctct tctgtgcaag
      421 attacaatct aaacaattcg gagaactcga ccttcctcct gaggcaagga ccacagccaa
      481 cttcctctta caagccgcat cgattttgtc cttcagaaat agaaataaga atgcttgcta
      541 aaaattatat tttaccaata agaccaatcc aataggtaga ttattagtta ctatgttaag
      601 aaatgaatca ttatctttta gtactatttt tactcaaatt ctgttgttag aaatgggaat
      661 agaaaataga aagagacgct caacctcaat tgaagaacag gtgcaaggat gtgagacaag
      721 tagtttcctg acttggtttg gtatcaaatg ttttgatcta agctctgaat gttctattct
      781 cctatgttct tttgcaactt atccaaggtc ttatgtaaat ggcttagtaa
//
LOCUS       MUSPBGD1     2663 bp ds-DNA             ROD       09-AUG-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 1.
ACCESSION   M28663 M29949 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     1 of 4
SOURCE      Mouse (strain C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2663)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene: Structural organization,
            sequence, and transcriptional analysis
  JOURNAL   J. Biol. Chem. 264, 14829-14834 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept        505      537     porphobilinogen deaminase (housekeeping) exon
                                 1 (EC 4.3.1.8)
               2191     2307     porphobilinogen deaminase (housekeeping) exon
                                 2
               2457  +  2510     porphobilinogen deaminase (housekeeping) exon
                                 3
    pep$       2475  +  2510     porphobilinogen deaminase (erythroid sp.) exon
                                 1
    pre-msg     341  >  2663     PGB deaminase (hk) mRNA and introns
    IVS         538     2190     PGB deaminase (hk) intron A (no splice
                                 consensus)
    IVS        2308     2456     PGB deaminase (hk) intron B
    IVS        2511  >  2663     PGB deaminase (hk) intron C
    pre-msg    2192  >  2663     PGB deaminase (ery.sp.) mRNA and introns
    IVS        2511  >  2663     PGB deaminase (ery.sp.) intron A
    binding    1126     1133     NPE binding site
    binding     312      317     Sp1 binding site
    binding     329      334     Sp1 binding site
    site       2101     2107     CACCC box
    site       2118     2124     CACCC box
BASE COUNT      622 a    661 c    707 g    673 t
ORIGIN      Chromosome 9
        1 ccacccccac cccacacaca cacacaaagt aaatagggct ggagagctta gtggttaaga
       61 gcactgactg ctctttcaga ggtcctgagt tcaattccca gaaaccacat ggtgctcaca
      121 accatctgca atagggtctg atgccctttt ctggtgtgtc taaagaagag agcaatggtg
      181 tactcatata cataaaataa ttttttttaa aaagtaaaag ataataaaaa ttgaaaagga
      241 aaaaaatctt tttgagttgt tctgtgcagt ggacttgagc gaaaaggctg gctatgtcgc
      301 aatcctaatt cccgcccaga ggaaggcacc gccccgttga gggagggcag cggacgtgac
      361 gcagagctca gcaggtcctg cagccggagt gaagtgcggg ctcgggcccc atgtgccttc
      421 agtcccggcc ggcccaggtc gtcggcttct gcagacacca ggggaccgca gcggcactgc
      481 cgcgcctgcg ccctgggcgg agtcatgtcc ggtaacggcg gcgcggccac aaccgcggtg
      541 agttctgagc cggtgaccga tgacccgcac ttctcggggc tttctgggtg caacgattgg
      601 ccccgggttg ccatgttctc gtcgtctatt ggtcggaata gttagctgtc atttttcccc
      661 ccccacacct caaggttttt tttaaagggc cagtaactag gttgccctaa ggcagggaag
      721 gagtgatctc gagcagtggg ggcggggttg tgagtggaaa ggtggtccgc cctgggattc
      781 catccctgta ggctctggct ggatctctgt tgttcccgac cagtaaagga ttatgcacag
      841 acaagatcct tttcacgaag aaggggctga ggcaaatcca gctatctcgg aatacgatcc
      901 acttcattca ggggagagca caccccactt cttaaaactg tatacaaaca tcttggaggt
      961 tacacgcctt ctcccgttct ccgttatgaa gtcacccagc cttagccacc cacaaaagtc
     1021 ctagtagaga cacacctgaa ttgctattgt gagcggggga acccacccct gggccttgtc
     1081 atttctggcc tgcctggaaa gttctgaact tgtgggcagg ctgcctgaga taaggctgag
     1141 ctgggaagct tgcttatctc ctgcccaggc agtaagcagt agtcttggct atgaaaacat
     1201 ttttagagca ctgggttagg gtaggaaggc ctggatttca gcacccactt tctgtctgtt
     1261 catagctgtg agatgtttag acagtaattt gaccactctg catctttgct tctgtgacac
     1321 gggtggaagt acctaccctg tctaacctag tagggttgtt gcaaggacaa tatgcagaca
     1381 ctgctcaaat gctgttctgg gtcaatcaat taaaaaacaa attgtttgaa cttagcaatt
     1441 cctttctatg ggctccctgt tgtccgaaat ttctgtgtta tttcaagccc agctaaattg
     1501 caaaggctat ctcagagtcg tttgttggag gaatcttcgc agtggagtag actggagtcc
     1561 aagagcaagt tttcaccttc agtgaccaag aacttgagtg tctggttata gaagaacctg
     1621 tgagatgagg aacctggtgc agggaagggg gacaatctgt acagtgactc ctgtcccctt
     1681 tgtatcagac tgcagaaccc agttctacct gcttggccct agacaccttt atccaaggcg
     1741 ccttaacaaa agaaagaggt gtgtcctttt gagctcttgg ctctggctta agacaccaga
     1801 ggaaacccgt aggcaatgac tgttaggcag tttattcttg tagtcttctg ggacttcttg
     1861 aggcatgagg tggcctttaa tttaacaagc ccttgatggg atgatgttcc caaagtcacc
     1921 caccaagggc atgaaagggc tgtacattag cttggttgat ttcagtcctt gttaggagta
     1981 catcctggtg tctcacccag ggcttagtga ggccttctca agtgcctgag ttgttgtgga
     2041 cagtgagctt gttctctagc aatgggaggc ttcagctgtc ctgccccagc ttctgtaggc
     2101 cccaccctcc agcagggccc accctcactg tgccgaggct gatgggcctt atcattttgc
     2161 ccacctggct gtgtgcagcc ctcccactca gaacctcctt ggccaggctg ggctttgggg
     2221 ctcagtgtcc tgttgctgct gccacaacag atcctattac agcttttctt ctggtcttgc
     2281 ttctctggat cccgtagagg gcagaaggta ccaaggaaga ttcaaggacc agtcctggga
     2341 gtctctcctt cctagcagcc tcacctgcct aggacccggg agtcctctct cctaagcctg
     2401 tgatcctagt tctttgaatg aggaaaagat cgtaacctag ggactttctt ctgcaggaag
     2461 aaaacggctc aaagatgagg gtgattcgag tgggcacccg taagagccag gtgagtacag
     2521 acatagcgcg ttgcctcaag aattgtaatg ctcacgggtc actagtggga accaaaggct
     2581 agcatcgagc aaataagagt gtgtgagagt cgatttcatg ggggatggca gctcacttcc
     2641 tctgaaaaga gagtctctgg agc
//
LOCUS       MUSPBGD2     2763 bp ds-DNA             ROD       09-AUG-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 2.
ACCESSION   M28664 M29950 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     2 of 4
SOURCE      Mouse (strain C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2763)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene: Structural organization,
            sequence, and transcriptional analysis
  JOURNAL   J. Biol. Chem. 264, 14829-14834 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept    +    52      124     porphobilinogen deaminase (housekeeping), exon
                                 4 (EC 4.3.1.8)
                487      536     porphobilinogen deaminase (housekeeping), exon
                                 5
                619      674     porphobilinogen deaminase (housekeeping), exon
                                 6
                983     1060     porphobilinogen deaminase (housekeeping), exon
                                 7
               1971     2048     porphobilinogen deaminase (housekeeping), exon
                                 8
               2143  +  2218     porphobilinogen deaminase (housekeeping), exon
                                 9
    pep$    +    52      124     porphobilinogen deaminase (erythroid sp.) exon
                                 2
                487      536     porphobilinogen deaminase (erythroid sp.) exon
                                 3
                619      674     porphobilinogen deaminase (erythroid sp.) exon
                                 4
                983     1060     porphobilinogen deaminase (erythroid sp.) exon
                                 5
               1971     2048     porphobilinogen deaminase (erythroid sp.) exon
                                 6
               2143  +  2218     porphobilinogen deaminase (erythroid sp.) exon
                                 7
    pre-msg <     1  >  2763     PGB deaminase (hk and ery.sp.) mRNA and
                                 introns
    IVS     <     1       51     PGB deaminase (hk) intron C; ery.sp. intron A
    IVS         125      486     PGB deaminase (hk) intron D; ery.sp. intron B
    IVS         537      618     PGB deaminase (hk) intron E; ery.sp. intron C
    IVS         675      982     PGB deaminase (hk) intron F; ery.sp. intron D
    IVS        1061     1970     PGB deaminase (hk) intron G; ery.sp. intron E
    IVS        2049     2142     PGB deaminase (hk) intron H; ery.sp. intron F
    IVS        2219  >  2763     PGB deaminase (hk) intron I; ery.sp. intron G
BASE COUNT      704 a    605 c    726 g    728 t
ORIGIN      Chromosome 9; 150 bp upstream of segment 1.
        1 gaaaggcagt ggccagggga ggtgagaaac catctgactc tctttcccca gctggctcgc
       61 atacagaccg agactgtggt ggcgatgctg aaagccttgt accctggcat acagtttgaa
      121 atcagtaagt tttcttgaga ggagtgattg gtagtgaacg ggaagccagt gaaccggagg
      181 acagggcatc tctcgtttgc ctgtggtcaa agcctgcctt gtaagactat tctggctgct
      241 tgtgaaggga aagaaagatt gtctcctgtg cacatctcct ccagctgccc gggctagcct
      301 gacatttcca tactttctgc tttgggttct tttatgagta tgtctgcttt ttctgtcggt
      361 gtgtgtatct gagagagtta ggggctgggt cttctatgcc tcagactcca ctgtgaatcc
      421 agtcaaggcc tgaacgaggg gtgactcagt aggtgttaat gggtatctga ttgactctct
      481 cctcagttgc tatgtccacc acgggagaca agattgttga tactgcactc tctaaggtaa
      541 cgccagtcct tgtcccattc ttcttgtccc tctcccacgt gtaaggggtt cactctgagg
      601 ctctctcttg cctggcagat tggagagaag agcctgttta ccaaggagct agaaaacgcc
      661 ctggaaaaaa acgagtgagt gaggatggag gaatgtggta ccccgagcct agaaccccaa
      721 agtggctctc caatattggc aggattgtcg ggttagactg tggagctcac aggctttcac
      781 agagaagaga gccttgcctt ggagtagcct aactacctgg ggaatcagac tgccggggga
      841 aaggggtaga gtagttgaga agagaccagg tcttagatct taagatgcta tcttcctgaa
      901 cggtcaagga tgctggggtg ggtggtggag ataaggtcac ctactcaaag cctctctctg
      961 tgcctccccc tgccgtctcc agagtggacc tggttgttca ctccctgaag gatgtgccta
     1021 ccatactacc tcctggcttt actattggag ccatctgcaa gtaagcgggg aggacatgca
     1081 tgggacggag ggccctgggc aggattaatc ctactgtggg aatctttgag tttttttttt
     1141 ttttttttcc atttggaact taaccgctta gccgtctgtt ttgaaggttc tcagacatag
     1201 tgtggcagga aagccaattg gttgacttgg ttgactattt agagtttgtg gagttgggct
     1261 cagtggcacg gacctgaaat cccagctact gggaggctaa gacaggatca gagattctgg
     1321 gccagcctgg gctacagagg gatttgaacc agcctgagga acttagattg tgccttaggg
     1381 gcacagaagg ctggcttaca gtggcttagg tggtaaaggc attttttgct gtcaagccaa
     1441 tgacctgagt tcagtccgtg gggtgcactt ggtgaaagaa gagggttgaa tcccacaagt
     1501 tgtcatctga ctcatgcata catgctgtag aatgtttatg ctcctcatcc ctcaatgaaa
     1561 atggaaacaa tcaaggaaat gaaatataaa acctgctggg tggtggtgcg cacgcctata
     1621 atcccagcac ttgggaggca gaggcaggtg aattcaacct ggtctacaaa gtgagttcca
     1681 ggactataca gagaaaccca gtcttaaaaa caaaacaaaa ctaaacaaca acaacaacaa
     1741 caacaaaaaa gaaaaaacaa agaaagaaat ataaaacctt tccaaagaaa ataaaatgaa
     1801 tttggcctgg tggctcatgc tataatctca gcattcagag agctgaggca ggagggttat
     1861 tgtgagttaa aggctagctg gggtacagag aaaattttag gtcacctggg ctagagttaa
     1921 ccctatctcc aaatgctaat acctttattt catcatcatt tgctttgcag acggcaaaac
     1981 ccttgtgatg ctgttgtctt tcacccaaag tttattggaa agaccctgga aaccttgcca
     2041 gagaaaaggt gagtgggcct agtgtgcggg ggagagaggc ctggacagtg gagaacagtt
     2101 ggcagcctgg gttaagttta attctaaact ctctctgagc agtgccgtgg gaaccagctc
     2161 tctgaggaga gtggctcagc tacagagaaa gttccccaac ctggaattca agagtattgt
     2221 atcctttcag aagaaggagg ggaaaaagag ggaaagaagg accttccgaa gcaagtggtc
     2281 catgcggtca gggggtcgtc tttccatctg tccgtccacc cacccaccca cccatccatc
     2341 catccatcca cacatccaca gtcctttaat gttttgcttt tttttttttt tcctgagaca
     2401 gggtttctct gtgtagcctg gctgtcctgg aactcacttt gtagaccaag ctggcctgca
     2461 aagtgagaaa tccgcctgcc tctgcctcct gagtgctggg attaaaggtg cgccaccact
     2521 gcctggcacc ctaatgtttt ttaaactcag gcctggcaat gaggacaatt tgcaaaacaa
     2581 acatggttcc ttgttctata cagctgacat gttagacaga caggcaggca ctgcagatac
     2641 tgaccggtga ccactcctgg tgcagggaca gaggcgcttc tgcttttact ttctgtgctg
     2701 ctaagtggtt ttggttttta cagtgaatat gtgatatgtt tcataaaagt aatttttttt
     2761 tct
//
LOCUS       MUSPBGD3      800 bp ds-DNA             ROD       09-AUG-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 5.
ACCESSION   M28665 M29951 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     3 of 4
SOURCE      Mouse (strain C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 800)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene: Structural organization,
            sequence, and transcriptional analysis
  JOURNAL   J. Biol. Chem. 264, 14829-14834 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept    +   402      515     porphobilinogen deaminase (housekeeping), exon
                                 10 (EC 4.3.1.8)
                739  +   777     porphobilinogen deaminase (housekeeping), exon
                                 11
    pep$    +   402      515     porphobilinogen deaminase (erythroid sp.) exon
                                 8
                739  +   777     porphobilinogen deaminase (erythroid sp.) exon
                                 9
    pre-msg <     1  >   790     PGB deaminase (hk and ery.sp.) mRNA and
                                 introns
    IVS     <     1      401     PGB deaminase (hk) intron I; ery.sp. intron G
    IVS         516      738     PGB deaminase (hk) intron J; ery.sp. intron H
    IVS         778  >   790     PGB deaminase (hk) intron K; ery.sp. intron I
BASE COUNT      181 a    201 c    182 g    236 t
ORIGIN      Chromosome 9; 500 bp upstream of segment 2.
        1 ctgtacccca gctagccttt aactcacaat aaccctcctg cctcagctct ctgaatgctg
       61 agattatagc catgagccac caggccaaat tcattttata tttctttctt tcttttttct
      121 tttttgttgt tgttgttgtt gttgtttagt tttgttttgt ttttaagact gggaaactct
      181 gtatagtcct ggaactcact ttgtagacca gatttagcct tgaattcatg gagatctgta
      241 tctgcctcca gtgctgggat ttaaaggtgt atacaccacc actcaacaaa aacacaacaa
      301 aaacaaaagt tttttaaaag ttagctagag gggggaaaag agactgtggg gcagagggtg
      361 cactgggtag gtcttgactt ctccttagca acgctccaca gcggggaaac ctcaacaccc
      421 gccttcggaa gctggatgag ctgcaggaat tcagtgccat tgtcctggct gtggctggcc
      481 tacagcgcat gggctggcag aaccgggtgg gccaggtagg agctgccctg ttctgcttcc
      541 cattgaatct gcctctctcc tgccttgatt tcttggtgac cattctgcca acaacactac
      601 aaccagaagc ccaggctagg gatattggga ctcattgctg gatttcctac ctgtgccttc
      661 cccaggcttc ctagattgca aaccctagct cactgccttt gaacatcccc tatcccacca
      721 tcttgtctct ctccacagat tttgcaccca gaggaatgca tgtatgctgt gggtcaggta
      781 ggtaggtttg cctggagaga
//
LOCUS       MUSPBGD4     1386 bp ds-DNA             ROD       09-AUG-1990
DEFINITION  Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 4.
ACCESSION   M28666 M29952 J04981
KEYWORDS    hydroxymethylbilanesynthase; porphobilinogen deaminase.
SEGMENT     4 of 4
SOURCE      Mouse (strain C3H) DNA, clone PBGD.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1386)
  AUTHORS   Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B.
  TITLE     The mouse porphobilinogen deaminase gene: Structural organization,
            sequence, and transcriptional analysis
  JOURNAL   J. Biol. Chem. 264, 14829-14834 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Grandchamp, 06-OCT-1989.
FEATURES       from  to/span     description
    pept    +   154      273     porphobilinogen deaminase (housekeeping), exon
                                 12 (EC 4.3.1.8)
                459      512     porphobilinogen deaminase (housekeeping), exon
                                 13
                618      704     porphobilinogen deaminase (housekeeping), exon
                                 14
                783      956     porphobilinogen deaminase (housekeeping), exon
                                 15
    pep$    +   154      273     porphobilinogen deaminase (erythroid sp.) exon
                                 10
                459      512     porphobilinogen deaminase (erythroid sp.) exon
                                 11
                618      704     porphobilinogen deaminase (erythroid sp.) exon
                                 12
                783      956     porphobilinogen deaminase (erythroid sp.) exon
                                 13
    pre-msg <     1     1279     PGB deaminase (hk and ery.sp.) mRNA and
                                 introns
    IVS     <     1      153     PGB deaminase (hk) intron K; ery.sp. intron I
    IVS         274      458     PGB deaminase (hk) intron L; ery.sp. intron J
    IVS         513      617     PGB deaminase (hk) intron M; ery.sp. intron K
    IVS         705      782     PGB deaminase (hk) intron N; ery.sp. intron L
BASE COUNT      372 a    305 c    343 g    366 t
ORIGIN      Chromosome 9; 80 bp upstream of segment 3.
        1 atcagtagtt cctgaaacct gttcatacct tgcacctcta tccatcaata atgttaaaga
       61 caggtttgtt gttatgcata acccaggaag cagtagaggt gtgtttctca tcttagctct
      121 attactagag aagaacagcc tgttgttctt tagggggccc tagccgtgga agtccgagcc
      181 aaggaccagg atatcttgga cctagtgagt gtgttgcacg atcctgaaac tctgcttcgc
      241 tgcattgctg aaagggcttt tctgaggcac ctggtaagat gggctcctcc catggtgttg
      301 tggggaaacc aggaagggca gtagggaggg agatttgtca agtactcagt atgtaatgtt
      361 ttgtatgtat ggagaggacc ttgatctggc ctcttgaggt ctgtggtcaa aagtggtgtt
      421 aaaggccctt agagctcaaa ggaacaatat cattgcagga aggaggctgc agcgtgcccg
      481 tagcagtgca tacagtgata aaggatgggc aagtaagcca gggaaatgga tgaggggagg
      541 gactgtcatt tccatgtgca cccaaacatc taagtaactt tctttaaaca tcctggtaca
      601 aacattttat ttcctagctg tacctgactg gtggagtatg gagtctagat ggctcagata
      661 gcatgcaaga gactatgcag gccaccatcc aggtccctgt tcaggtattg actgggagat
      721 gaggaggaat aaatagaact cttgtaatct tcctcttacc aaaattgtaa cctgtcatcc
      781 agcaagaaga tggtccagaa gatgacccac aactggttgg aatcactgcc cggaacattc
      841 caagaggagc ccagctagct gctgagaacc tgggcatcag cctggccagc ttgctgctca
      901 acaaaggagc caagaacatc ctggatgttg cacggcagct taatgatgtg cgctaactgg
      961 tctgtagggc acaggaaccc tggctgccac tccagtgcct acttctggct tccaagtgcc
     1021 ctgtgctcca tccctagggg tgtgattatc ccaggaaatt gaaccacagg gttgttgaga
     1081 cttccacttt ggaagatatg cctcaccttg gggcctccat atctgccttt ccctcagtag
     1141 ttgggggctt catctcttta gagaaagtcc atgccaatct ttgaatgtaa ccaataccac
     1201 taataaacca gtttagaatg tggttcttct gatagagttg gggaagatat gaataaaccc
     1261 aaagcccttt taaacttgaa tgagtctgag acctttctgt tgtaaaacac gctgtgattt
     1321 gcctcatgtt ctcaaaaaaa aaaaaaaaaa tcagccttta attcctacag cctgtcttca
     1381 gtcgac
//
LOCUS       HUMIBP3     10884 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human insulin-like growth factor-binding protein-3 gene, complete
            cds.
ACCESSION   M35878 M35879 M35880 M35881 M35882 M35883 M35884 M35885 M35886
            M36121 M36122 J05537 J05538
KEYWORDS    insulin-like growth factor-binding protein-3.
SOURCE      Human leukocyte DNA and, cDNA to mRNA, clone #HL1006d.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 10884)
  AUTHORS   Cubbage,M.L., Suwanichkul,A. and Powell,D.R.
  TITLE     Insulin-like growth factor binding protein-3: Organization of the
            human chromosomal gene and demonstration of promoter activity
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.R.Powell, 03-JUL-1990.
            
            The sequence presented here appears in Figures 2 and 3 of ref. [1].
FEATURES       from  to/span     description
    pept       2038     2440     insulin-like growth factor-binding protein-3
                                 precursor (IGFBP-3), exon 1
               5726     5952     insulin-like growth factor-binding protein-3
                                 precursor, exon 2
               6497     6616     insulin-like growth factor-binding protein-3
                                 precursor, exon 3
               8212     8337     insulin-like growth factor-binding protein-3
                                 precursor, exon 4
    sigp       2038     2118     insulin-like growth factor-binding protein-3
                                 signal peptide, exon 1
    matp       2119     2440     insulin-like growth factor-binding protein-3,
                                 exon 1
               5726     5952     insulin-like growth factor-binding protein-3,
                                 exon 2
               6497     6616     insulin-like growth factor-binding protein-3,
                                 exon 3
               8212     8334     insulin-like growth factor-binding protein-3,
                                 exon 4
    pre-msg    1906    10775     IGFBP-3 mRNA and introns
    IVS        2441     5725     IGFBP-3 intron A
    IVS        5953     6496     IGFBP-3 intron B
    IVS        6617     8211     IGFBP-3 intron C
    signal      102     1905     bp 3 promoter
    binding    1808     1821     Sp1 and AP-2 binding domains
    signal     1876     1881     TATA box
    site       5728     5736     potential ASN-linked glycosylation site
    site       5917     5926     potential ASN-linked glycosylation site
    site       7087     7255     region homologous to IGFBP-3 genomic sequence
    signal    10751    10756     Poly-A signal
BASE COUNT     2796 a   2578 c   2737 g   2773 t
ORIGIN      
        1 ctgcagacct gggacctcaa gaattgcatt tgatgccgaa cccagctcta atttcagagt
       61 caaggtctct gcgagtattt aaggaacgga tgtaaacctg ggggattcgt tttgtttcct
      121 tcaattttcc aatgaaatca gagatcctgt tcttgggtgt caacgcagat actagaagga
      181 ggtgatacaa gagaaaggaa acagcaagcg acgattatgg cacggtttcc tgtaaacaag
      241 gttgagtgta gccacagcct gagcactgtg ggagaagagc tcataagaaa atgacggtgc
      301 tgggccttcg tcaccccggg gccctccatt gttcttgtct ttggtctctt tttatttgta
      361 gaggtccaat tatttattta tttagtacaa gagggaacga aattgatctt tccattctaa
      421 aaggagagta tatatgtata aaaggaagct gtatagatat gggggaagag gtggacaggg
      481 ggaaaagggg agaggacgag agagagaaag ggagggagag ggacaaggag agacactggg
      541 cgagagatcg attaggagag acagaaatga tgaatgaaga ttaacttcac ccaaggcttc
      601 gtcgctggag gggaatggag gagctcctga tttgctatta ctactccaaa ctgcaaaggg
      661 ctccttcaag tcacctatcc acctcctaag gcaagcgtcc aatttcaaca gcgttcagga
      721 aagtctcctc ccgcggaggt ctcaccgctt cccactccac ccccacaaac tctttggaaa
      781 agtgccttga aaaatttaat cctcaatcca atcctggacc accagcgtcc tctgttggtc
      841 accgaaggag ggggtgcgca gacaaaactg aagaaactcg agtgccagag aaggccgaca
      901 ggagttacag cgacctcagc gcgcaattgc gccccgaact ttactgaaaa gtgtttagat
      961 tgcagagata agctagaatc ccaacgcatc gagaatacag taatacgaag tcgccttcaa
     1021 aaaatgacaa tgaaaattgc ctattaaagg actatttggt taattacgtt tcagcagtgc
     1081 ccagtttatt gtctttatta ttcttttgtc gtgggtgtaa actccatttg aaaacataat
     1141 cagggagaat acccaagaca agaagaacag ttgtcattta aaatatttga aaagccctgc
     1201 cttaaggagc attcgcttgc cggtccactc ttaattgggg acttgcggtg tagcaacacg
     1261 tgagagtctt cttgcgttga gaagtaagcc tggaaaggcg aaggccccgg ggcatcttca
     1321 gatgcgtatt tgtgggcccc tggggatata aacagcccag cgggtgtaaa ttaaaccccg
     1381 cagtgccttg gctccctgag acccaaatgt aagtcagaaa tgtcccaaga cttcgcctgc
     1441 caacggaatt aaattttaga aagctccacg aggtacacac gaatgcggag cgctgtatgc
     1501 cagtttcccc gacaccggct cgccgcaggg agacctcacc ccgagagcgg aaggggtaag
     1561 ggcggcgggg tcaaggagat cgggggtgct gagttggcca ggagtgactg gggtgaccgg
     1621 gggtgctgag gtggcctgga gtgccggggt ggccgggcac accttggttc ttgtagacga
     1681 caaggtgacg ggctccgggc gtgagcacga ggagcaggtg cccgggcgag tctcgagctg
     1741 cacgcccccg agctcggccc cggctgctca gggcgaagca cgggccccgc agccgtgcct
     1801 gcgccgaccc gcccccctcc caacccccac tcctgggcgc gcgttccggg gcgtgtcctg
     1861 ggccaccccg gcttctatat acgggccggc gcgcccgggc cgcccagatg cgagcactgc
     1921 ggctgggcgc tgaggatcag ccgcttcctg cctggattcc acagcttcgc gccgtgtact
     1981 gtcgccccat ccctgcgcgc ccagcctgcc aagcagcgtg ccccggttgc aggcgtcatg
     2041 cagcgggcgc gacccacgct ctgggccgct gcgctgactc tgctggtgct gctccgcggg
     2101 ccgccggtgg cgcgggctgg cgcgagctcg gggggcttgg gtcccgtggt gcgctgcgag
     2161 ccgtgcgacg cgcgtgcact ggcccagtgc gcgcctccgc ccgccgtgtg cgcggagctg
     2221 gtgcgcgagc cgggctgcgg ctgctgcctg acgtgcgcac tgagcgaggg ccagccgtgc
     2281 ggcatctaca ccgagcgctg tggctccggc cttcgctgcc agccgtcgcc cgacgaggcg
     2341 cgaccgctgc aggcgctgct ggacggccgc gggctctgcg tcaacgctag tgccgtcagc
     2401 cgcctgcgcg cctacctgct gccagcgccg ccagctccag gtgagccgcc cgccaggtgc
     2461 gctgcgtgca gcaccgccac tggcgccgaa gggcctgggg gttgctgggt gccgctgcgg
     2521 gagactccgc ttttcttctc actggagata atatgtgggg aaactgaagg cgctccggga
     2581 aaggtgaagg cggtcgccga gggaccctcc ccagccggcc ctctacttgc tcgattctct
     2641 aagtgcagag tacttgtaaa ttgcaaagcg ctttcagtga aaatgggtaa aggtttccgg
     2701 agctgagggg agcggtaccg atgtttagct gttggaaaga tcctggacac aggagattct
     2761 cctcgccccg cacgggtgca cacggactgc aatcccaggg atgcttgggg atggggggat
     2821 ataggcggat ttggaccaag gaaggtgggt aggcacgttg taggaaatag tacctctctt
     2881 ttaaaatact gactttgcac agccttttgg tttgcaaagc aatgtctagt cccggtatgt
     2941 ccaaaaacaa gtaaagtgga ttcgggtttt gatatcttct gcggttggaa aacctgaagc
     3001 tgaaaaagaa gtaacttctt aaggttaccc agcggccaca acagagtgta ggtttgaact
     3061 ccgcgtgcca ctttcagtac cataccattc ttacaactcg ggccacccct gcacctgcgc
     3121 cgacctcaaa caaacttcca ggtgcgtggt gggtgcgggc aatgtggact aagtcaattt
     3181 caatgacacg gcaagggaat tggaatcagt cctaggctgt ctcccttctt aatctgaaat
     3241 gggggggggg aatgagatgt tgttaagggg agccccagaa gaggaaaaat gcaaacattt
     3301 ggcagagtta ccctcttgct tagccactat cagtatcagg cagacagcga ctctggtaag
     3361 ggcatcacat tgttccctta aaaaaaggag cgggggttgt ttaaatggat ttggcagctg
     3421 ttctttcaag cattcttagc cagcctcacc tagttatatg agaaataaag ttcctgcctt
     3481 gcacagctga aggctgggag aattctcccc atcctaattc ccccaactcc ccaacgatca
     3541 cgttggacag atgtcactgg gcaggccccc atctagggct agcaggatga acagtccctt
     3601 tataatttat gtagctgtag agttccacgc ccgggtgaag ttattttctg gctcggcaag
     3661 gctggctctg ttcacccctg agaaatgctg gattcatgga aaggcaagat gcctgaaaca
     3721 tacactggct ctggtcagct gttaaagctg ctggaggcat ttgtctctcg gggcaaagtt
     3781 atgtcatttg ccaagtgtcg tacattattg tgcattttgg ggtattcaaa aagtgatctt
     3841 agaaatactg atacacatcg tcattcttgg gctttagcaa tcatcatgat taccacctta
     3901 gtagcactgt agtataggtt gatgtgagtt ataagattat aaaaagatct aagtgacttc
     3961 tagaatctat ttgacaaaaa aaggtaaatt ttcgacagtc aaaagtcaca attatctgtt
     4021 gcttaaatag aactgttttg tcttcatgcc ctagtctgca gcccaggcat taagaagaaa
     4081 ccaaggaaat ttaagaaatt actcaaggtt cttagaaaag aagtataaat acgtttattt
     4141 acatgttctt agagtattta cattcttagt atctctttta tctcagtatt tccttgaaaa
     4201 agaaagcaag ctaagattaa aagaaattga aaccaaatcc tcgcaggtag ggacctcctc
     4261 tgtgaggctc tgtgctggac cctgggaatg tgtgcttccc aaggtatgaa accccttggg
     4321 gaactttaca gcaggacctc agtgagctgt ttggcaggtg aggaaactaa gacccagaga
     4381 ggagagggac tttcctaagg ccctggtgag tgacctgcca gtagccactt ccaggggaga
     4441 gcagagcatc tgcagccaaa tcattgcagc cccaggtagc tttctagata gactgtggac
     4501 cagatgggcc acctgagctc cctgctaggg ttacacatta tagccctgtt tgtgtagtag
     4561 agaaatttca tgactctcaa ttgtggactt aagccgatgc ctccagacct tggcatggtc
     4621 cacaggccct gggagcatgg gctctgaatg tagcctttga tccccatagc ggtcttacag
     4681 cccctccaag ttcattctga agaaggaatg gagtgagaat cctggctgca gatccagtct
     4741 tgaatttagt catatactta aaattccaat tcaactgtta acattccagc atccatttta
     4801 agcatcagac tttcttcatt tagcactttt tattataaaa gggagatctg ctggaggggg
     4861 atttctccta ccccaccccc acccagggaa ggaaaagctc tttggcactt agaagtctga
     4921 gccgtgagtg ggactttggc attgtctgca tccatgtgct gctgtgttca cccggggtga
     4981 aaaggactca cttaggcagg caccagcaag atgcacaggg tctgtgtaga ccttgagttt
     5041 tagagatgta acggggacct agaaaacaag ccaccaacat gcttgcatga ttctgagccc
     5101 ctgaggcaaa acgctttgca ggtaataatt cagttttccc atctgagctg gacaccaagc
     5161 tcttataagc gtgtttacct ggtagcattg aggacggtac tggtcaacct tggaattccc
     5221 ataagggctt gttacaactc agactcgtgc cgccactcca gcgtttccgg agtggagaat
     5281 gtgcatttct tccaagtccc cgggctgccg ctgctcccgc gggtgggagg accacacttg
     5341 gagttgactg caaaatttct gagccggcgc tgcagcagcc tcccgtggct caggtctgcc
     5401 ccctgccggt ggaagatgaa gcatactgcc ttcacctact gaggggcact gaagcgtttg
     5461 tctgccttct ttagttgcag ctacttagga agagcacctg tcagattgac tttcaaacag
     5521 ataacttctt gaggtagagc aaccaccatg tagtgagtag tatgatggaa taatacttca
     5581 tcgaggtatt taaaaaaaaa acctcacttg gattgccaac taatattgtc atttacatgt
     5641 gacctggttg caacgttaag atttttacaa gactgtgata gatattgatg actctcatgt
     5701 gtttgtctct cttgggcgtt ttaaggaaat gctagtgagt cggaggaaga ccgcagcgcc
     5761 ggcagtgtgg agagcccgtc cgtctccagc acgcaccggg tgtctgatcc caagttccac
     5821 cccctccatt caaagataat catcatcaag aaagggcatg ctaaagacag ccagcgctac
     5881 aaagttgact acgagtctca gagcacagat acccagaact tctcctccga gtccaagcgg
     5941 gagacagaat atgtgagagc ttttcctctt gttaaaggag gagggcaaga cctgccaagc
     6001 ctgggtactc agagcctctt gagggcaatt cttactcaac aaaccccagc gcctggctga
     6061 tgggtgggca acccctagcc cctctgtgcc ctacctctct cctctcctta cataaagaat
     6121 attgaccctt ttggagaatc ttatgaggat caagctgaaa taacactctt aaaagcatat
     6181 gggatgtcat aaagacctct gcagataatg aaaatattct cataaagata gttttattta
     6241 cttcatcctc tatgcttgtt gacctgctat tggttccatg ccagcttctg tgccttactc
     6301 tgggaagagc aaaaaggaga cagggagtga tggttagctt attcggggga ctttcgtgct
     6361 acatcagaca taaggtatct gaggagcaaa ttacaggtcc cacttttggt agttgtgcag
     6421 catcgtaaga tttttaaagc acacattcta gagtaaaaac tgtgactctg ttgctctggt
     6481 ccttcctgat ccccagggtc cctgccgtag agaaatggaa gacacactga atcacctgaa
     6541 gttcctcaat gtgctgagtc ccaggggtgt acacattccc aactgtgaca agaagggatt
     6601 ttataagaaa aagcaggtga gtgaggtcct cagtgtgttt tcttcctctt ctgttgacac
     6661 agaggagaaa cccatgtcac cagcgcccag gctcttgtgg ccatagctct aactctgagc
     6721 ctgtgcagca ccagtgccca ggacttggtg ccagtctcag gaggtcagac caagggctgc
     6781 tttgacttgt tgctctgagt gctgctatat tggccataat cctcaaccct agtgcctttc
     6841 caccacccgc ttcccactcc tgtcctttca atggttcacc cacaggcgga caagatgctg
     6901 cccagtggca ccctttataa actgcaagtg gacatgttaa cacatttgtt aatgctgcgt
     6961 cagggagtga catttcaaac aactattata gtcagtttcc aagaagtgtg acatgaggtc
     7021 ataccacaaa aaagcttacc ctgaaatccc acaatcgtcc cctttcctac tgatgccttc
     7081 ccgatagtga gcaggttgca atattaagat tttgaaaagg ctgttgctag atgttggtga
     7141 ctcgtgtgtc tctgtctccc ttgggctttt caaggaaatg ctagtgagtg gggggatgac
     7201 tgcagcatgg ccagcttgga gagcccagcc atccccagca cataccaggt gtctgtcttg
     7261 gcgtggaggg gatggaactt gaaatcagac actcggtcca tgctggggat ggccagtctc
     7321 tccaaactgg catgtggtct tcctccgagt cactggcatt tccctagaaa gtccaagtga
     7381 gaagaaggca tgagagtcat caacatcaaa caacagtctt ttcaaaatct ttatattgca
     7441 acatagtccc attcctggaa aaggaatgga gtgagaatcc tggctacaca tcagccccaa
     7501 atgtagtcat tgcctaaaat cccaattaac ctgaaaatga tcaaacaaat ttaagatata
     7561 gtaatattaa gctgtaataa atatgcttct ataggctttg tgttatgtga tggcactatt
     7621 tcaattggct ttctaattgg acaattgata ctatgctatc tacagaattg gcctttggag
     7681 acctaagtga gccacagtgg cctcagggtg accatatact aggattcata gcagtggcca
     7741 cagtcagaag cctaagcttt cctccattgc cattgctcgt ttataccacg tttctgtcaa
     7801 agtcatattc attcaacaaa gtcatactga gaaggtgtca tgtgaggctg gatgtgggct
     7861 ccaaagtcat agctgtgaca ttcgcaggca gcgggatgtt ctcagttcca catttggcag
     7921 agaagtcagt caagaggttc tacaagggct ggtgtccacc ttatactcct agaaacacaa
     7981 aactgccccc acccccgctt tcttggagca ggaagttaca cccacacgca tgcacaggcg
     8041 cacactcagc gggcctaggc agcgtggctc ttgtgttgcc ttagctgaaa tttctgttgt
     8101 gctttctcag catagcagag tcacgctggc aaaccatcat gcgccctggc caccgacctg
     8161 acaccagacc caggagcatt cacttctctg tcttctgttt ctctcccaca gtgtcgccct
     8221 tccaaaggca ggaagcgggg cttctgctgg tgtgtggata agtatgggca gcctctccca
     8281 ggctacacca ccaaggggaa ggaggacgtg cactgctaca gcatgcagag caagtagacg
     8341 cctgccgcaa gggtgagtac tcaggagggg cagcctgggc tccagggcct cactgtcctt
     8401 ggaccagcct caggggctgg gcgtggccac tggccttccc caggcttaca gacccaggag
     8461 ctgcagctca gggccagaaa gagcaaagca aataggacag agccctcaga agggtgcagg
     8521 gagagggaga ccccatcaac ccaaccaaac aagtgtgggg aaggaggccg gccagtgcac
     8581 ctcagggaca ctctgcttta tctcagatac ctcacagcac ctaagctatc attcatccac
     8641 acacaaagtg aagattttca aagttaggct ttacccgtga gtctggaggt catttatctt
     8701 cacagagaac gtttatcgca gactgctaag atacatgttc taattaagat gtgatgtgag
     8761 aacgctgaat gctcgttgga gactcagttg aagtgcagct ttttttctgt caaatatata
     8821 atgaatattc tgttagtctg tggctaatat aattttaata aagttaattt aaatctgata
     8881 gaaaaatgaa attttaaacg ataattttag agaatgctat tatatccagt cttctttttt
     8941 cttttaataa atgagggaac tattggggga aaggaataaa tacattttct ttcattttat
     9001 taagacaaat ttagtaagca gaagaaattt gcatgtttag ttataagggt ttcttttttc
     9061 cttacaagtt ggaaaaaata attctaattt aagggtaact ctttgacaat gaacactgtg
     9121 agcagcatct ggtactcgtt gctttgtttg aaaacatgag ttgagacccc agccgcactt
     9181 gcagcctagt gccattagcc tgcaggctgt gctggatatc tcagggcaag agtcgagccc
     9241 ttttgatttt ggggggatta tttcaatata tttgcttttt ctttttgttt tagttaatgt
     9301 ggagctcaaa tatgccttat tttgcacaaa agactgccaa ggacatgacc agcagctggc
     9361 tacagcctcg atttatattt ctgtttgtgg tgaactgatt ttttttaaac caaagtttag
     9421 aaagaggttt ttgaaatgcc tatggtttct ttgaatggta aacttgagca tcttttcact
     9481 ttccagtagt cagcaaagag cagtttgaat tttcttgtcg cttcctatca aaatattcag
     9541 agactcgagc acagcaccca gacttcatgc gcccgtggaa tgctcaccac atgttggtcg
     9601 aagcggccga ccactgactt tgtgacttag gcggctgtgt tgcctatgta gagaacacgc
     9661 ttcaccccca ctccccgtac agtgcgcaca ggctttatcg agaataggaa aacctttaaa
     9721 ccccggtcat ccggacatcc caacgcatgc tcctggagct cacagccttc tgtggtgtca
     9781 tttctgaaac aagggcgtgg atccctcaac caagaagaat gtttatgtct tcaagtgacc
     9841 tgtactgctt ggggactatt ggagaaaata aggtggagtc ctacttgttt aaaaaatatg
     9901 tatctaagaa tgttctaggg cactctggga acctataaag gcaggtattt cgggccctcc
     9961 tcttcaggaa tcttcctgaa gacatggccc agtcgaaggc ccaggatggc ttttgctgcg
    10021 gccccgtggg gtaggaggga cagagagacg ggagagtcag cctccacatt cagaggcatc
    10081 acaagtaatg gcacaattct tcggatgact gcagaaaata gtgttttgta gttcaacaac
    10141 tcaagacgaa gcttatttct gaggataagc tctttaaagg caaagcttta ttttcatctc
    10201 tcatcttttg tcctccttag cacaatgtaa aaaagaatag taatatcaga acaggaagga
    10261 ggaatggctt gctggggagc ccatccagga cactgggagc acatagagat tcacccatgt
    10321 ttgttgaact tagagtcatt ctcatgcttt tctttataat tcacacatat atgcagagaa
    10381 gatatgttct tgttaacatt gtatacaaca tagccccaaa tatagtaaga tctatactag
    10441 ataatcctag atgaaatgtt agagatgcta tatgatacaa ctgtggccat gactgaggaa
    10501 aggagctcac gcccagagac tgggctgctc tcccggaggc caaacccaag aaggtctggc
    10561 aaagtcaggc tcagggagac tctgccctgc tgcagacctc ggtgtggaca cacgctgcat
    10621 agagctctcc ttgaaaacag aggggtctca agacattctg cctacctatt agcttttctt
    10681 tattttttta actttttggg gggaaaagta tttttgagaa gtttgtcttg caatgtattt
    10741 ataaatagta aataaagttt ttaccattaa aaaaatatct ttccctttgt tattgaccat
    10801 ctctgggctt tgtatcacta attattttat tttattatat aataattatt ttattaaaat
    10861 gttccctgct ttccctttta gcaa
//
LOCUS       PINCABII2     583 bp ss-mRNA            PLN       09-AUG-1990
DEFINITION  Pinus sylvestris cab II/2 mRNA for chlorophyll a/b-binding protein.
ACCESSION   M37489 X14507
KEYWORDS    Cab gene; chlorophyll a/b-binding protein; thylakoid protein.
SOURCE      P.sylvestris cotyledones cDNA to mRNA, clone pINE ab 11.
  ORGANISM  Pinus sylvestris
            Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae; 
            Pinaceae.
REFERENCE   1  (bases 1 to 583)
  AUTHORS   Jansson,S.
  TITLE     ;
  JOURNAL   Unpublished (1989) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 583)
  AUTHORS   Jansson,S. and Gustafsson,P.
  TITLE     Type I and type II genes for the chlorophyll a/b-binding protein in
            the gymnosperm Pinus sylvestris (Scots pine): cDNA cloning and
            sequence analysis
  JOURNAL   Plant Mol. Biol. 14, 287-296 (1990)
  STANDARD  simple automatic
COMMENT        [1]  Author address Jansson,S.
               Plant Physiology
               Umea University
               S-901 87 Umea
               Sweden
            
FEATURES       from  to/span     description
    pept    <     1      455     chlorophyll a/b-binding protein
BASE COUNT      135 a    133 c    171 g    144 t
ORIGIN      
        1 cggagctgtt ggttaaaaac ggggtgaaat ttggggaagc tgtgtggttc aaggccgggg
       61 cgcagatatt ctcagaggga ggccttgact acctggggaa ccccaacctg atccacgcgc
      121 agagcattct agccatctgg gcctgccagg ttgttctcat gggattgatt gaaggataca
      181 gagtgggagg aggacccctt ggagaagggt tggaccctct gtacccaggg gatgccttcg
      241 acccactggg gctggccgac gaccccgagg ccaaggcgga gctgaaggtg aaggagatta
      301 agaacggtcg gctggccatg ttctccatgt tcggtttctt cgttcaggca atcgtgaccg
      361 ggaagggccc cattgaaaat ctctacgacc acttggcgga ccccgttgcc aacaatgcct
      421 gggcctacgc caccaatttc gttcctggca agtgaaggtg acggaaaata aaagaggcct
      481 gtgatctgtg catcaatcat ttgacagcct tagtgttaat aaaatatgtt ctttcagctg
      541 tatgtatttg ttggtgatct tcgttaataa aatattttct ttc
//
LOCUS       RATMHCIAB    1563 bp ss-mRNA            ROD       09-AUG-1990
DEFINITION  Rat MHC class I cell surface antigen mRNA.
ACCESSION   M25319
KEYWORDS    antigen; cell surface antigen; class I gene; glycoprotein;
            histocompatibility antigen; major histocompatibility complex.
SOURCE      Rat cDNA to mRNA, clone pARI.5.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1563)
  AUTHORS   Radojcic,A., Stranick,K.S., Locker,J., Kunz,H.W. and Gill,T.J.III.
  TITLE     Nucleotide sequence of a rat class I cDNA clone
  JOURNAL   Immunogenetics 29, 134-137 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and sequence in computer readable form for [1] kindly
            provided by J.J.Rushton 24-JAN-1990.
FEATURES       from  to/span     description
    pept    <     1     1134     MHC class I antigen (AA at 1)
    sigp    <     1       60     MHC class I antigen signal peptide
    site         61      330     alpha-1 domain (exon 2)
    site        381      606     alpha-2 domain (exon 3)
    site        607      882     alpha-3 domain (exon 4)
    site        883     1131     transmembrane and cytoplasmic domains (exons
                                 5, 6, 7, and 8)
    signal     1535     1541     poly-A signal
BASE COUNT      324 a    412 c    471 g    356 t
ORIGIN      
        1 gcaccgcgca cgctgctcct gctgttggcg gccgccctgg ccccgaccca gattcacgcg
       61 ggctcacact cgctgcggta tttcgacatc accgtgtccc ggcccggcct cggggagccc
      121 cggttcatct ctgtcggcta cgtggacgac acggagttcg tgcgctacga cagcgacgca
      181 gagaatccga gattcaagcc gcgggtccgg tggatggagc gggaggggcc ggagtattgg
      241 gagcggatca cacggatcgc caaggaaagc gagcagattt accgagtggg cctgaggacc
      301 ctgcgcggtt actacaacca gagcgagggc ggctctcaca ccatccagag attgtctggc
      361 tgtgaggtgg ggtcggacgg gatcctcctc cgcgggtatg agcagttcgc ctacgacggc
      421 cgcgattaca tcgccctgaa cgaagacctg aaaacgtggg cggcggcgga ctttgcagca
      481 gggatcaccc ggaacaagtt ggagcgggat ggtgaggcag agagactcag ggcctacctt
      541 gaaggcggga gcgtggagtg gctccgcaga tacttggagc tcaggaagga gacgctgctg
      601 cgctcagaac ccccaaaggc acatgtgacc cttcactcca gacctgaagg tgatgtgacc
      661 ctgaggtgct gggccttggg cttctaccct gctgacatat tcctgacctg gctgttgaat
      721 ggggaggacc tgacccagga catggaactt gtggagacca ggcctgcagg ggatggaacc
      781 ttccagaagt gggcatctgt ggtggtgcct cttgggaagg agcagaatta cacatgccat
      841 gtggagcatg aggggctgcc tgagccgctc accctgagat gggagggtcc tccctccgcc
      901 aactccaaca cgggaatgtc tgttattctt ggaactgtgg ccatcattgc agttatggcc
      961 atcattgcag ctgtggcctt cattggacct gttgtgagga agaggtggat aaaaacagct
     1021 tttcttctca caagtggaaa aggaggagac tacacccctg ctccaggcag ggacagctcc
     1081 cagagctctg atgtgtctct cccagattgt aaagccatga agacagctgc ttgaggtgaa
     1141 ctggatgccg gccgatgtgt tcaggtctct cttgtgacat ccggagccct cggttctctt
     1201 tggacaccga tgcctgggat tccctatgat cctatgactt cggtataggg gactatggga
     1261 cccggcccaa ccctacacac cgggacccta tccctgcact gtttgtgttt cctttcacag
     1321 ccaaccttgc tggttcagcc tgggttgggg cctggacatc tgcatcctat cactcagtgg
     1381 tgctttgaac tgcaactcct cacttctaca ctgagaataa gaatctgagt gtgaacttga
     1441 ctgttcacat ccttgacaca gtgttgactg ctttttaaat tactggattg agaatactta
     1501 gaggttgttt tttgtttttg ttttgttttg ttttaaataa atggcaggtg gagaagcttc
     1561 cag
//
LOCUS       HUMINT01       42 bp ss-mRNA            PRI       09-AUG-1990
DEFINITION  Human leukocyte adhesion glycoprotein p150,95 mRNA, exon 1.
ACCESSION   M29165 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     1 of 7
SOURCE      Human cell line HL-60, cDNA to mRNA, clone lambda-X47.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 42)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     cDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  simple automatic
FEATURES       from  to/span     description
    mRNA    <     1  >    42     P150,95 mRNA, exon 1
BASE COUNT        9 a     15 c     10 g      8 t
ORIGIN      1 bp upstream of EcoRI site; chromosome 16p11-13.1.
        1 gaattcctgc cactcttcct gcaacggccc aggagctcag ag
//
LOCUS       HUMINT02     3690 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            2 - 6.
ACCESSION   M29482 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     2 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3690, exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3690; exons and intron/exon boundaries only)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 2782-2788 (1990)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 3690; exons and intron/exon boundaries)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 12750-12752 (1990)
  STANDARD  full staff_entry
REFERENCE   4  (bases 1 to 3690; exons and intron/exon boundaries; revises [3])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
            
FEATURES       from  to/span     description
    pept       1028     1064     integrin alpha subunit precursor, exon 2
                                 (first expressed exon)
               1666     1771     integrin alpha subunit precursor, exon 3
               2391     2494     integrin alpha subunit precursor, exon 4
               2795     2865     integrin alpha subunit precursor, exon 5
               3020  +  3131     integrin alpha subunit precursor, exon 6
    sigp       1028     1064     integrin alpha subunit signal peptide
               1666     1685     integrin alpha subunit signal peptide
    matp       1686     1771     integrin alpha subunit
               2391     2494     integrin alpha subunit
               2795     2865     integrin alpha subunit
               3020  +  3131     integrin alpha subunit
    pre-msg <     1  >  3690     P150,95 mRNA and introns
    IVS     <     1      975     P150,95 intron A
    IVS        1065     1665     P150,95 intron B
    IVS        1772     2390     P150,95 intron C
    IVS        2495     2794     P150,95 intron D
    IVS        2866     3019     P150,95 intron E
    IVS        3132  >  3690     P150,95 intron F
BASE COUNT      764 a   1050 c   1020 g    856 t
ORIGIN      1 bp upstream of BamHI site; chromosome 16p11-13.1.
        1 ggatcccttg ggcccaggag ttcgaagcag cagtgaacta tgcacccact gcactccagc
       61 ctgggtggca gagcaagacc ctgtttctga aattaaaaaa aaaaattgat gtacattagg
      121 gggcttccac ggcctgagct gcttcccctt gctttcctcc cagtggccct gaccttgtct
      181 cttacaactt cccaccctga ctgtctggtt acccattgct gatttcacac acagaccctc
      241 ctgtaccctg cctcatccat gtctggctgc tctgtcatct cccaactttg gttgctttca
      301 atgctcagct caagcaccac ctctttcagg aagccttctc agaaagccac accttcacaa
      361 cccgggtgag gcaccctgtg gtctctgtgc ttccccctca cagcaatgaa cttgctgttt
      421 atacatctgc ctctccactg accccagggc tggtgctttg tggtttatat tttcttcccc
      481 acctagcaga gggcttgcat ctccaggctc aaattaggct tcttgaataa atgatgaata
      541 aatgagtgaa tgaatgaatg aacaaatact cgctctgtgc tcctcctagg gacccggatc
      601 ccccactcct tggcccagac tttccaggtc agagtggagg cctcccacca gggtttcctt
      661 taggggtcct gaggggtggg catctgccca aaccccctcc agtctggctg aaatttcaag
      721 gtcaaggggt ccttctggca gtcaagggtg agcctgggag gggcagggca gggatttgca
      781 tccatctaag caaagggcat caagccaagt catctgatga gagtgactcc ggttgggggg
      841 tgggggcgtg tgggagccga gcctgtcctc ggatcagttg cgtactctgc ccgccccctc
      901 tgactcatgc tgacaatctt cttccttccc ctggccacct ctctgcccac ttgcttcctc
      961 agtaccttgg tccagctctt cctgcaacgg cccaggagct cagagctcca catctgacct
     1021 tctagtcatg accaggacca gggcagcact cctcctgttc acaggtgagc ctggacccca
     1081 atgaagtagg gctggggacc caggcccaag ggagccaggg ccctgaactg ggggctcagg
     1141 ctggggggtt aggatctggg taggaagaga gactcagtca agcctgaggg ggaggcaggc
     1201 acatagggtt tgagatttgg agtttgtgga gggagaggat attgatgaac caattttggg
     1261 agagttccag agatgctgga agagaggcca gttgtctctg tactgcagag atttttaaaa
     1321 taggcagaat gcgccaactt gtgctctgtg gacaggatgc tttggtccgc aagttttcct
     1381 ggacgcactc tcatagcgcc cgaggtgcac gttggggaaa gatccttttt agagcctggg
     1441 tactgctctg cagaaatgga gaactgcaac tcgatagtgg atggtgggca aggggcatcc
     1501 ctggaccctg ggaaggagag aaggggatga gttgggtgtc cagaagaccc aggcaccccg
     1561 ggcatcaggc tcggagggga gattgggacg ctggggccgg gggtggaggg cagccaggca
     1621 gaaggaagac ccttctccaa agctctcttc ccacctcttt cccagcctta gcaacttctc
     1681 taggtttcaa cttggacaca gaggagctga cagccttccg tgtggacagc gctgggtttg
     1741 gagacagcgt ggtccagtat gccaactcct ggtgaggccc aggtggtgct cctttggctc
     1801 catccatcct ctccctgctc aggccccatc cccccggccc tgccctgtta tttgcaaact
     1861 ctcctctctg tctggtgtag cgactgccct ggctaatgaa gatttgcctt gaaggcaggc
     1921 acggtctcac agctaacatt tacagagcag taagtgcagt gccaggctca tcacaggtgg
     1981 atgctgattt agtccacacg acagcctgtg agtaggaatc agtcgtgcaa caaacactta
     2041 tttgtttttt ctttcttttt ttctatacat ttaaaaatat atagagacag ggtctcacta
     2101 tgttgcctgg gttggtctca aactcctggg ctcaagcaat cctcccgcct cagcctccca
     2161 aagtgctggg attccaggtg tgagccacca cacccagact caacaaatat ttcttgtctc
     2221 catacgccag agaatccaac agacagaaat cccttccaca tggactttaa attattaaaa
     2281 tccatcttgc agatgaggaa gctgaggctc agggagggaa cgcaaacttg ccggagtggc
     2341 agctgtcggc gtccacactc ttacctaaag tgttctttgt ctcctcgcag ggtggtggtt
     2401 ggagcccccc aaaagataac agctgccaac caaacgggtg gcctctacca gtgtggctac
     2461 agcactggtg cctgtgagcc catcggcctg cagggtgagt caccgcccct cccgggaccc
     2521 agggccgggc tcccaggctt ccctgctcca ggggcccgtg gactcccgga gtgtcacttt
     2581 cagcttccct gtgtctgaga ccctcaccct cagatatgct tcctggcccc ttaaggcctc
     2641 cccgcccatc gcactcccgc agctctgtca agacccgaca gcttccttca ccgtcagacc
     2701 tccttgtctc ccaggtggag gtgacccctg cccagctctt ccacagcctt ctctgtaggg
     2761 cccgagagtg accatgcaca tatctgtccc acagtgcccc cggaggccgt gaacatgtcc
     2821 ctgggcctgt ccctggcgtc taccaccagc ccttcccagc tgctggtgag tggccctggg
     2881 tcacaggagg cttctgaggg agggagggag gagccggggc cgccgggggc tgggactctc
     2941 ctgtagggtg gaggttccgg catctgaggg tgggaggtac atgccaggga gtgcccccag
     3001 cagcccgctg tgtccccagg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat
     3061 gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt
     3121 gtccaggcag ggtgagtgtc gggaccacca aggctttgag gagctcacgc acatccaatt
     3181 gggggtgcgg tgggctagag acagtcttgc cagagtggat cagaaagaag ggatctggaa
     3241 aaagagttac ctcgtgttgc agtggttcct gacgctgctg cccgcacatc ctgccgatcg
     3301 ccgcacgctg ccggaccttt cctgtgacct taacctctcc aagcctcagt ttcttcatct
     3361 gttggatggg gataataaca cacccagcac tgaaagcaac acaggatgat tcatggccag
     3421 gggttagcac agcagctagc accaggcgac acccatgccg gccagctgtt gttattttta
     3481 gaggagagga ctattttcat ccaatgggtc ctgggatatg accaattggt ttgtgccgta
     3541 gtttaggaaa ggtcagtgaa agtgcagtgt gagcaacgtg tgtgtgtaca tgtgtgtata
     3601 tgtatgcatg tgtatacatg tgcacatgca catgtacatg catgtgtgtg catgtatgtg
     3661 tgtgtgtgca tgtgcatgca ggttgagacg
//
LOCUS       HUMINT03     4863 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            7 - 15.
ACCESSION   M29483 Y00093
KEYWORDS    integrin; protein p150,95.
SEGMENT     3 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 294 to 3967; exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 4863)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 2782-2788 (1990)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 4863; exons and intron/exon boundaries)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 12750-12751 (1990)
  STANDARD  full staff_entry
REFERENCE   4  (bases 1 to 4863; exons and intron/exon boundaries; revises [3])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +   294      424     integrin alpha subunit precursor, exon 7
                511      656     integrin alpha subunit precursor, exon 8
                901     1054     integrin alpha subunit precursor, exon 9
               1659     1809     integrin alpha subunit precursor, exon 10
               2428     2501     integrin alpha subunit precursor, exon 11
               2665     2794     integrin alpha subunit precursor, exon 12
               3207     3349     integrin alpha subunit precursor, exon 13
               3531     3671     integrin alpha subunit precursor, exon 14
               3760  +  3969     integrin alpha subunit precursor, exon 15
    matp    +   294      424     integrin alpha subunit
                511      656     integrin alpha subunit
                901     1054     integrin alpha subunit
               1659     1809     integrin alpha subunit
               2428     2501     integrin alpha subunit
               2665     2794     integrin alpha subunit
               3207     3349     integrin alpha subunit
               3531     3671     integrin alpha subunit
               3760  +  3969     integrin alpha subunit
    pre-msg <     1  >  4861     P150,95 mRNA and introns
    IVS     <     1      293     P150,95 intron F
    IVS         425      510     P150,95 intron G
    IVS         657      900     P150,95 intron H
    IVS        1055     1658     P150,95 intron I
    IVS        1810     2427     P150,95 intron J
    IVS        2502     2664     P150,95 intron K
    IVS        2795     3206     P150,95 intron L
    IVS        3350     3530     P150,95 intron M
    IVS        3672     3759     P150,95 intron N
    IVS        3970  >  4863     P150,95 intron O
BASE COUNT      947 a   1358 c   1460 g   1097 t      1 others
ORIGIN      Chromosome 16p11-13.1.
        1 acctgtgatc gccccctcgc ctcccaaagt actgggatta cacggtgagc caccacgcct
       61 ggctcaatca cagcctcttt aggcaacttt aagagaatga agggccttgt tccaggcaag
      121 gggttaggga acgtctgccc ctgatgagga gaggacccag ggtgtggagc ctgactccca
      181 tcgccagact aggggcttag ggaggaaggg ttttggagag tgagctcttg caggagccac
      241 ggtcctggac tccaggagtg tcacttggag gacggtgcca cctccttccc cagagtgccc
      301 aagacaggag caggacattg tgttcctgat cgatggctca ggcagcatct cctcccgcaa
      361 ctttgccacg atgatgaact tcgtgagagc tgtgataagc cagttccaga gacccagcac
      421 ccaggtgtgc tttgggggag ggaggctgct gggggtgggt gcttggatcc tggtgatagg
      481 cctcagccca gccctgtgtg cttctcccag ttttccctga tgcagttctc caacaaattc
      541 caaacacact tgactttcga ggaattcagg cgcacgtcaa accccctcag cctgttggct
      601 tctgttcacc agctgcaagg gtttacatac acggccaccg ccatccaaaa tgtcgtgtga
      661 gtcctgattt cttccaggca cagtcccaaa gcacccaggt cttcccttgg cctcatctga
      721 tctccacgag aaggggacag gcagggacca aaatccagcc cgtgataccc ttgccaagct
      781 ggggcctctg ggtgggactg gggcctccca aaggaaaagg catcttctaa ttttcacaag
      841 ggcaccaggg gctagtgtgg tttggttcac aggcctctaa gacctctcct ttcctgatag
      901 gcaccgattg ttccatgcct catatggggc ccgtagggat gccaccaaaa ttctcattgt
      961 catcactgat gggaagaaag aaggcgacac gctggattat aaggatgtca tccccatggc
     1021 tgatgcagca ggcatcatcc gctatgcaat tggggtaggc ctgggatggc ttcccacttc
     1081 tcccacggct tcctctcagg gcaactcccc tttctgtgta tgatgttctt ttctctttga
     1141 gacagggtct tgctctatca cccaggaagt ggtgcaatcc tagctcactg cagccttgaa
     1201 ctcctgggct ccagtgatcc tcccaccccg cctcccagta gtcgggacca caggtgtgtg
     1261 ccatcaagcc tggctatttt ctttttggtt gagatggggt cttgctatgt tgcccaggct
     1321 ggtctcaaat tcctggcctt aagcaattct gccaccttgg tctcccaaag gcacagggga
     1381 ttacaggcgt gaaccaccgc caacaacatc cctttcaagg atagaaacac cagctctctc
     1441 ggctcttact gccttaagga tgaaaactct gccccagact ggagaccatg atgatccttt
     1501 ctcctaaact ccctgatgct gtccgggctt cgtgtttctc ctgtgtccac cgggtgtgat
     1561 catgttgatc ttgtggggtt attggaagat gttgcaccca gtgcacacag gcacatttga
     1621 tttattattt ttactgagtt gatcttttct ggggacaggt tggattagct tttcaaaaca
     1681 gaaattcttg gaaagaatta aatgacattg catcgaagcc ctcccaggaa cacatattta
     1741 aagtggagga ctttgatgct ctgaaagata ttcaaaccca actgagggag aagatctttc
     1801 ccattgaggg tgagtctgaa gggagctctt cgcttgggga atcctcagcc gttaacacct
     1861 ttccacttag aacccgaggc tccgtgaaac aggtagacag cgtctcggtt ctcctgcttt
     1921 cccgggaccc cgatagccat gtctgtcagc ttgtccccac tgacgtcccc cagcactgtc
     1981 agagctgccc caaagtggcc ccagggatgg ccctgctccc cacagagagt gatctcacac
     2041 caccaccggc tccactgcag aacaaaagca gtccaggccc aacccaggag acccttccac
     2101 ccacaccggg ccctacccag cccacatccc accagccact cactcccctg ggcaaggggc
     2161 acacggacac ctggccccct cggtctgctt gtagacctgt ggggggccct gatgaggacc
     2221 agatcggtgc tgccatcgct gtccacatcc atggagcaga ggggggcccc gaagtcggag
     2281 ctgatctgga ggcagagcct ggtccctgtc acaggcacca gctctccctg tagcctccag
     2341 tcttagcttc tcctaaagct gaagtgttct tggacctggc aaagcccgtc tccctccctg
     2401 gcactcaagc gtcatgcctt accccaggta cggagaccac aagcagtagc tccttcgaat
     2461 tggagatggc acaggagggc ttcagcgctg tgttcacacc tgtgcgtggg gccccttagg
     2521 ccgatgatgt gccgtgaggg gagggggggc agggaaggcc agggtgggtg tcaggtgggt
     2581 aagaggcgca aggcggaagg catatctctg gtcatgctgt cttcctgctc tcggctctgc
     2641 tcagccctgg aatcctttct ccaggatggc cccgttctgg gggctgtggg gagcttcacc
     2701 tggtctggag gtgccttcct gtacccccca aatatgagcc ctaccttcat caacatgtct
     2761 caggagaatg tggacatgag ggactcttac ctgggtgaga aacagccagg ggttggggac
     2821 aggtgggaga tgcactgccc agggtggggt ccagggttct ggggaagggg taggggnatg
     2881 ggggctgtgc tgcccagtgt ggggcccagc ttctggggag ggaggatggg cactgtgctg
     2941 cccggggtgg gttccagggt tctggggagg gggaatgggg gctgtgctgc ctggggtggg
     3001 aatccagggt tctggggaga ggggatgggc gctgtgctgc ctggggtggg ttccagggtt
     3061 ctggggagag aggatggggg ctgcattgcc cagggtgggg tccagggttc tggggagggg
     3121 agatggtgct gtgctgcccg gggtgggaat ccagggttct ggggaggggg aatgggggcc
     3181 tttgtgctga ggcctgggcc cctcaggtta ctccaccgag ctggccctct ggaaaggggt
     3241 gcagagcctg gtcctggggg ccccccgcta ccagcacacc gggaaggctg tcatcttcac
     3301 ccaggtgtcc aggcaatgga ggatgaaggc cgaagtcacg gggactcagg ttgggcgtga
     3361 caggagccac aggccgggaa ttcagggtag gggaggtggc tgggcagaga agaggatgga
     3421 ggggctttga gggccttggg ggaggtcctg gtacctgggg agaggtggga cctggcccac
     3481 agggctgcct ctggcaggga caggcagcat gacccagctc tgcccttcag atcggctcct
     3541 acttcgggcc ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc
     3601 tcatcgggcc cccccattac tacgagcaga cccgaggggc ccaggtgtct gtgtgtccct
     3661 tgcccagggg ggtgagtggc tgatgggcct ggtgtgtgtg gggtctggtg tgggtgaggg
     3721 gttgcccggg ttgggcctgg cactgttttt tttctgcagt ggagaaggtg gtggtgtgat
     3781 gctgttctct acggggagca gggccacccc tggggtcgct ttggggcggc tctgacagtg
     3841 ctgggggatg tgaatgggga caagctgaca gacgtggtca tcggggcccc aggagaggag
     3901 gagaaccggg gtgctgtcta cctgtttcac ggagtcttgg gacccagcat cagcccctcc
     3961 cacagccagg tgaggccgtg tcccatttct gtcactagag cagcctgctt cttgcctctc
     4021 ccactctgtc atactggaaa actgtccctt tttacctttt cctacctccc ttgcccagct
     4081 ctgagcacct tgtagcagtg gcgtggtctc agctcactgc aacctccgcc tcccaggttc
     4141 aagcgattct ctctgcctca gcctccagag tagctgggat tacaggcatg caccaccatg
     4201 tccggttatt ttttgtattt tagtagagac acgtttcgcc atgttggcta ggctggtctt
     4261 gaactcctga cctcaggtga tctgcctgtc tcggcctccc aaagtgctgg gattataggc
     4321 gtgagccgcc atgcccaggc ccctgccagt tttacaaggt acacaggtca ggcacagaaa
     4381 acccatttta cagatggaat ctgggacact aggaagacaa gggccttggt ttgttggagg
     4441 ttcagagtgg gtccgagatg gtgaaggaac tccggcctcc tgacctctaa cccggtgtgc
     4501 agtctcccgg ctccctgctg ctcaccactt aggtccagtc atttcaacct ccctccacct
     4561 gcccctctcc tccctggatg ctacatgatt ttattccctt cctgccatca aggtcccacc
     4621 aaatgcccat ccctgcagcc tccctccacc ccaagggtag cagggttccc tgagaacgaa
     4681 gggctgcctt tcttggcaaa agtcaagaaa gctctgttaa aaaataggca aagggcctgc
     4741 tccctggtgg ctcacatctg taattccgac actttgggag gctgaggcag gaggatcact
     4801 tgaggccagg agtttcaggc cagctgggca acataggggg accccatctc tagaaaaaat
     4861 ttt
//
LOCUS       HUMINT04     2746 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            16 - 21.
ACCESSION   M29484 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     4 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 382 to 2672; exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2746)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 2782-2788 (1990)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 2746; exons and intron/exon boundaries)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 12750-12751 (1990)
  STANDARD  full staff_entry
REFERENCE   4  (bases 1 to 2746; exons and intron/exon boundaries; revises [3])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +   382      512     integrin alpha subunit precursor, exon 16
                628      791     integrin alpha subunit precursor, exon 17
                924     1078     integrin alpha subunit precursor, exon 18
               1656     1787     integrin alpha subunit precursor, exon 19
               1875     1948     integrin alpha subunit precursor, exon 20
               2531  +  2672     integrin alpha subunit
    matp    +   382      512     integrin alpha subunit
                628      791     integrin alpha subunit
                924     1078     integrin alpha subunit
               1656     1787     integrin alpha subunit
               1875     1948     integrin alpha subunit
               2531  +  2672     integrin alpha subunit
    pre-msg <     1  >  2746     P150,95 mRNA and introns
    IVS     <     1      381     P150,95 intron O
    IVS         513      627     P150,95 intron P (no splice consensus)
    IVS         792      923     P150,95 intron Q
    IVS        1079     1655     P150,95 intron R
    IVS        1788     1874     P150,95 intron S
    IVS        1949     2530     P150,95 intron T
    IVS        2673  >  2746     P150,95 intron U
BASE COUNT      602 a    821 c    746 g    577 t
ORIGIN      Chromosome 16p11-13.1.
        1 gaattcctat cctgagcatg gctaaactct gagctaatag tatcattata gaaagatgag
       61 gaaacggagg cacagacaga ttgagtcctt gcccacggcc tcgtggctca tacgtggagg
      121 agtcagaatt ggaactagag actgatcgaa tgaatgacac tcgggtcacc aggacacctt
      181 cctatctcca ctcttacatc tgtttcttag caatcatctc ccaactccta cctcctcttt
      241 tcaggttctt cttggtgaca tctgttacaa ctcacccctt ctctcccttt ccgatggtcc
      301 tacctccata ttccccttgt tacttatttc caacttcttc cctagtttcc atcttgattc
      361 acccttctct cctctggcca gcggatcgcg ggctcccagc tctcctccag gctgcagtat
      421 tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt ggacctggct
      481 gtgggggccc ggggccaggt gctcctgctc aggtgagagc agactttctc agaggctccc
      541 catgtggtcc taggttcaga tgggggtgcc cacccacgtg gtgctcccac cagcgacggc
      601 tgtcctcagc tcggtgctct gcccgcagac cagacctgtg ctctgggtgg gggtgagcat
      661 gcagttcata cctgccgaga tccccaggtc tgcgtttgag tgtcgggagc aggtggtctc
      721 tgagcagacc ctggtacagt ccaacatctg cctttacatt gacaaacgtt ctaagaacct
      781 gcttgggagc cgtgagtccc ctcccctcca acccaggaca ccctgacctc tggagtcccc
      841 catcccaggc ccctgtctcc caccctgctc attgtccacc caaggagttc ctgtctcaac
      901 gccgtccctg cgaccgccta caggtgacct ccaaagctct gtgaccttgg acctggccct
      961 cgaccctggc cgcctgagtc cccgtgccac cttccaggaa acaaagaacc ggagtctgag
     1021 ccgagtccga gtcctcgggc tgaaggcaca ctgtgaaaac ttcaacctgc tgctcccggt
     1081 gcgtctgggc atgaacgtgg gtggcggccg cgctggggct ggcagaaggc agggcaggga
     1141 gagaacaggc tgtgttccgg cctccctgtg gctcagccca gcacaggacc agccatgcag
     1201 gacgtgctta ctgcacgtta gccagtgagt gagtgagcga gcaaacaagt gatgagatcg
     1261 tctgcaattt ccagggccac acgattggat ttcaggaaag agaattgggc aacctgagag
     1321 agctctgggc ttaccttctg gcttttcagg cattcactga cagggttatc gagctgctcc
     1381 tggagacagc cttgcctggg ccatgggcat aggtggccaa aacagtcatt gctgatcggg
     1441 aggtctgggg gggggaggaa aaaaacaaag acaaacaagg ggagaggaca gagagggtgt
     1501 cagggaggca tcctgaaggc ggtgacgctg agcaggctct ggaggaagtg aagcagagcg
     1561 ggagctgggc agaggcagga taagaactgc ggatgaggcc gagcgcagct cttaccctcc
     1621 ccttaccctc gctccccgcg acgcccgtcc cccagagctg cgtggaggac tctgtgaccc
     1681 ccattacctt gcgtctgaac ttcacgctgg tgggcaagcc cctccttgcc ttcagaaacc
     1741 tgcggcctat gctggccgcc gatgctcaga gatacttcac ggcctccgtg agtcctggca
     1801 ctgggtctcc cagagagggt gcacagcgtg gggcctgggt ctcggagaaa accccccgtt
     1861 gccttcccac gcagctaccc tttgagaaga actgtggagc cgaccatatc tgccaggaca
     1921 atctcggcat ctccttcagc ttcccagggt gagcgcccca ccttagatgc cctactgccc
     1981 cagcctcctt cctggaatct gggactcctg cctctgctct ccctaacatt gtctcatcct
     2041 atagtcaaaa cccaggtgtc ttggctgggc acagtggctc actcctgtaa tccagcactt
     2101 tgggaggccg aggtgggagg acttttgagg ccaggagtta gggttacgac ctgggcaaca
     2161 gagcgacacc catttccaca aaaacaaaac aacaacaaca acaacaacaa caacaacaac
     2221 aacaacatca cttgagtgtg gtagagcatg cctatagtcc cagctacttg ggaggctgaa
     2281 gcttaaggct tgcttgagct ctggagttgg aggtctgcag tgagccataa tcacaccact
     2341 gcactccagc ctgggtgaaa gagcaggact ctgtctctta aaaaaaaaga agaagaagaa
     2401 gaagaagaag aagaacccag gggtccgtcc cctgtctatc tcccaaatcc ccacccaccc
     2461 cattttatcc cagaccattt ctagcctcag tcacagaatc atcttatcct ttccttcacc
     2521 tgatacccag cttgaagtcc ctgctggtgg ggagtaacct ggagctgaac gcagaagtga
     2581 tggtgtggaa tgacggggaa gactcctacg gaaccaccat caccttctcc caccccgcag
     2641 gactgtccta ccgctacgtg gcagagggcc aggtgcacct ctggggaagg aggaggaggc
     2701 agggctgggc gttagcgtag attcccgtgc gggttcagaa cccggg
//
LOCUS       HUMINT05     1006 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            22 - 24.
ACCESSION   M29485 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     5 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 202 to 665)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1006)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 2782-2788 (1990)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 1006; exons and intron/exon boundaries)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 12750-12751 (1990)
  STANDARD  full staff_entry
REFERENCE   4  (bases 1 to 1006; exons and intron/exon boundaries; revises [3])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +   203      319     integrin alpha subunit precursor, exon 22
                404      483     integrin alpha subunit precursor, exon 23
                582  +   665     integrin alpha subunit precursor, exon 24
    matp    +   203      319     integrin alpha subunit
                404      483     integrin alpha subunit
                582  +   665     integrin alpha subunit
    pre-msg <     1  >  1006     P150,95 mRNA and introns
    IVS     <     1      202     P150,95 intron U
    IVS         320      403     P150,95 intron V
    IVS         484      581     P150,95 intron W
    IVS         666  >  1006     P150,95 intron X
BASE COUNT      228 a    286 c    229 g    263 t
ORIGIN      Chromosome 16p11-13.1.
        1 ttctatcctg gtgacagagt gagacctggt ctcaaaacaa acaaacaaac aaaatataag
       61 cttaaggtgg gctccaggaa gctttatcac tacttcgtgg cgtgtctttg gaatgctgtt
      121 atattaggtt ggtgcaaaag taattgggtt tttgccattg ctttcaattt caactaatac
      181 tcctctactt tctcatgcct agaaacaagg gcagctgcgt tccctgcacc tgacatgtga
      241 cagcgcccca gttgggagcc agggcacctg gagcaccagc tgcagaatca accacctcat
      301 cttccgtggc ggcgcccagg tcagcctggc ttctgtcccc tcactgctcc cctgccccac
      361 cctgtcttta ctgctctgtg acctctcagt tccttttcct cagatcacct tcttggctac
      421 ctttgacgtc tcccccaagg ctgtcctggg agaccggctg cttctgacag ccaatgtgag
      481 caggtgagcc gggccatggc caggggcagt gcctcatctc cagcctcaca ccccattctc
      541 ctctggggcc tctggcaact gagtctctcc tctttctcca gtgagaacaa cactcccagg
      601 accagcaaga ccaccttcca gctggagctc ccggtgaagt atgctgtcta cactgtggtt
      661 agcaggtcac aggtacccac tgcaggaaaa agggttcttc tctctgaccc tcaaaaagaa
      721 aaaaaaaaaa aaggccttga aacgctgcca cagagggtga gataaggtgt ttgaaagtaa
      781 aaggtcaggt gtttcagaag acaccttcct tcagccaatg ccttcctcga atttgctgtg
      841 tgccaggcag ggtgctgtgg ttattttcca tacattcatt tgacattcat tgaagattta
      901 ctgagccccc attatgtgtg atcaaaccag acatgaaccc tcgccttgtg ggtgtgcctt
      961 gctggatgtc tcctgtgttc cactctcact gcactgcatg ctgagt
//
LOCUS       HUMINT06     1904 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exons
            25 - 30.
ACCESSION   M29486 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     6 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 86 to 1528; exons only)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1904)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 2782-2788 (1990)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 1904; exons and intron/exon boundaries)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 12750-12751 (1990)
  STANDARD  full staff_entry
REFERENCE   4  (bases 1 to 1904; exons and intron/exon boundaries; revises [3])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +    86      164     integrin alpha subunit precursor, exon 25
                280      387     integrin alpha subunit precursor, exon 26
                505      588     integrin alpha subunit precursor, exon 27
                788      901     integrin alpha subunit precursor, exon 28
               1044     1145     integrin alpha subunit precursor, exon 29
               1417  +  1527     integrin alpha subunit precursor, exon 30
    matp    +    86      164     integrin alpha subunit
                280      387     integrin alpha subunit
                505      588     integrin alpha subunit
                788      901     integrin alpha subunit
               1044     1145     integrin alpha subunit
               1417  +  1527     integrin alpha subunit
    pre-msg <     1  >  1904     P150,95 mRNA and introns
    IVS     <     1       85     P150,95 intron X
    IVS         165      279     P150,95 intron Y
    IVS         388      504     P150,95 intron Z
    IVS         589      787     P150,95 intron AA
    IVS         902     1043     P150,95 intron AB
    IVS        1146     1416     P150,95 intron AC
    IVS        1528  >  1903     P150,95 intron AD
BASE COUNT      381 a    545 c    525 g    453 t
ORIGIN      Chromosome 16p11-13.1.
        1 accacctgtc ctctcatgct ctagccaatg ccttctgcag atgcccatgg tagttcacat
       61 ccacttatgc gtcttctctc tccagccacg aacaattcac caaatacctc aacttctcag
      121 agtctgagga gaaggaaagc catgtggcca tgcacagata ccaggtcagg tggtggtgta
      181 cgcaggaaga ccttgggcat ggggtgggag gctgggtagc cggagactgg ggagggattt
      241 ggctttggcg tggctctgcc ctcagtgccc tctgtgcagg tcaataacct gggacagagg
      301 gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga ggctgtgtgg
      361 atggatgtgg aggtctccct cccccaggta cccaaggact gcatgtggct cctccacgaa
      421 tgccctttct acctggattc cttgtgcccc atgtgggtcc ctgatgtccc agctgagaca
      481 cttgttctct gcattttccc ccagaaccca tcccttcggt gctcctcaga gaaaatcgcg
      541 ggcccagcat ctgacttcct ggcgcacatt cagaagaatc ccgtgctggt gaggagggct
      601 ctgggtctgg ccctcactgt aggcccacat cagaggaatt taacccagga gttcatgttc
      661 catatccatc ctgctgaagt accctcttgc attcggatat ggccgctgcc ctcaagtcac
      721 acgcataatg ctgcctccca ccttcacact catctttctc agccccatgc tatttatctg
      781 cccccaggac tgctccattg ctggctgcct gcggttccgc tgtgacgtcc cctccttcag
      841 cgtccaggag gagctggatt tcaccctgaa gggcaacctc agctttggct gggtccgcca
      901 ggtgtgtggg tgcaacgaca gagcccctgc cccagactca ggcgggacct ggcatgtctg
      961 tgcccatctg caagccaggg cacccccaga gctctgagcc tcccccagag ccagttcaac
     1021 aggtttcccc cacccctttg cagatattgc agaagaaggt gtcggtcgtg agtgtggctg
     1081 aaattacgtt cgacacatcc gtgtactccc agcttccagg acaggaggca tttatgagag
     1141 ctcaggtaga gaccatgtgg agggcagcga ccaggcagga aagagggtcc caagggctac
     1201 atctgtggtg ctgggtgggg ggtttgcaag ccttggggga ggagggtgaa ggcctctggg
     1261 caggatagct gtccctaagg gcacgggtgc tgctgtgtct cacctcttgg agcagggcct
     1321 ggggaaggag gggagggagt taaaggttgg ggagcctggg aggagtctgg gatagtagga
     1381 ggatgggagt ctctgacagg gtcacttcca cttcagacga caacggtgct ggagaagtac
     1441 aaggtccaca accccacccc cctgatcgta ggcagctcca ttgggggtct gttgctgctg
     1501 gcactcatca cagcggtact gtacaaagtg agtgttttat gccacccttg acaccaccag
     1561 catctggtcc cgctcttttt gcagagtgag aaggagctca ctttgaaggc agaggcacat
     1621 tcttactggg tcacttcata tgagaaactg cttcccacct gcaatgtcac cgtgccccag
     1681 tggccccctg ctttgtgatt cccaggcttc ctctaatatt tctccctttc tttcctgctc
     1741 ttctccatca ttctacgtgt tcctgacagc agattatcat ataaaagcac agacctgggt
     1801 tgaatgcgac atcaccacgg gttcttttgt cttgaccata ggccagtgtc tgctccactc
     1861 tgggccttga tttccatgtg aggtgatatc acccagctca taga
//
LOCUS       HUMINT07      653 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human leukocyte adhesion protein p150,95 alpha subunit gene, exon
            31.
ACCESSION   M29487 Y00093
KEYWORDS    integrin; leukocyte adhesion glycoprotein; protein p150,95.
SEGMENT     7 of 7
SOURCE      Human DNA, (library pWE15), clone 30.1, and cell line HL-60, cDNA
            to mRNA, clone lambda-X47.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 44 to 149)
  AUTHORS   Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A.
  TITLE     CDNA cloning and complete primary structure of the alpha subunit of
            a leukocyte adhesion glycoprotein
  JOURNAL   EMBO J. 6, 4023-4028 (1987)
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 653; revises [1])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 2782-2788 (1990)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 653; exons and intron/exon boundaries)
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  TITLE     Genomic structure of an integrin alpha subunit, the leukocyte
            p150,95 molecule
  JOURNAL   J. Biol. Chem. 265, 12750-12751 (1990)
  STANDARD  full staff_entry
REFERENCE   4  (bases 1 to 653; exons and intron/exon boundaries; revises [3])
  AUTHORS   Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by J.Garcia-Aguilar, 19-OCT-1989.
FEATURES       from  to/span     description
    pept    +    44      148     integrin alpha subunit precursor, exon 31
    matp    +    44      145     integrin alpha subunit
    pre-msg <     1  >   149     P150,95 mRNA and introns
    IVS     <     1       85     P150,95 intron AC
BASE COUNT      126 a    195 c    150 g    182 t
ORIGIN      Chromosome 16p11-13.1.
        1 actgaatggg cttcctgagt ttcttcttcg tcctcccccc taggttggct tcttcaagcg
       61 tcagtacaag gaaatgatgg aggaggcaaa tggacaaatt gccccagaaa acgggacaca
      121 gacccccagc ccgcccagtg agaaatgatc cctctttgcc ttggacttct tctcccgcga
      181 ttttccccac ttacttaccc tcacctgtca ggctgacggg gaggaaccac tgcaccaccg
      241 agagaggctg ggatgggcct gcttcctgtc tttgggagaa aacgtcttgc ttgggaaggg
      301 gcctttgtct tgtcaaggtt ccaactggaa acccttagga cagggtccct gctgtgttcc
      361 ccaaaaggac ttgacttgca atttctacct agaaatacat ggacaatacc cccaggcctc
      421 agtctccctt ctcccatgag gcacgaatga tctttctttc ctttcctttt tttttttttt
      481 cttttctttt tttttttttt tgagacggag tctcgctctg tcacccaggc tggagtgcaa
      541 tggcgtgatc tcggctcgct gcaacctccg cctcccgggt tcaagtaatt ctgctgtctc
      601 agcctcctgc gtagctggga ctacaggcac acgccacctc gcccggcccg atc
//
LOCUS       PEAHSP177A    772 bp ss-mRNA            PLN       09-AUG-1990
DEFINITION  Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA,
            complete cds.
ACCESSION   M33901
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 772)
  AUTHORS   Lauzon,L.M., Helm,K. and Vierling,E.
  TITLE     A cDNA clone from Pisum sativum encoding a low molecular weight
            heat shock protein
  JOURNAL   Nucleic Acids Res. 18, 4274-4274 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept         65      523     17.7 kDa heat shock protein (hsp17.7)
BASE COUNT      254 a    127 c    170 g    221 t
ORIGIN      
        1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag
       61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga
      121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc
      181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga
      241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt
      301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga
      361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc
      421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc
      481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat
      541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg
      601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata
      661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc
      721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac
//
LOCUS       PEAHSP179A    700 bp ss-mRNA            PLN       09-AUG-1990
DEFINITION  Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA,
            complete cds.
ACCESSION   M33900
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 700)
  AUTHORS   Lauzon,L.M., Helm,K. and Vierling,E.
  TITLE     A cDNA clone from Pisum sativum encoding a low molecular weight
            heat shock protein
  JOURNAL   Nucleic Acids Res. 18, 4274-4274 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept    <     1      469     17.9 kDa heat shock protein (hsp17.9)
BASE COUNT      209 a    123 c    170 g    198 t
ORIGIN      
        1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt
       61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa
      121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt
      181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga
      241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg
      301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc
      361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa
      421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct
      481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg
      541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa
      601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt
      661 gagtatgtga aagtttcttt tgattaagct gtatttatcc
//
LOCUS       PEAHSP181A    862 bp ss-mRNA            PLN       09-AUG-1990
DEFINITION  Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA,
            complete cds.
ACCESSION   M33899
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 862)
  AUTHORS   Lauzon,L.M., Helm,K. and Vierling,E.
  TITLE     A cDNA clone from Pisum sativum encoding a low molecular weight
            heat shock protein
  JOURNAL   Nucleic Acids Res. 18, 4274-4274 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept         12      488     18.1 kDa heat shock protein (hsp18.1)
BASE COUNT      278 a    128 c    207 g    249 t
ORIGIN      
        1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg
       61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac
      121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg
      181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga
      241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag
      301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt
      361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc
      421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt
      481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc
      541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat
      601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt
      661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa
      721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac
      781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga
      841 gtctctccaa gatggagaat tc
//
LOCUS       PEAHSP227A    795 bp ss-mRNA            PLN       09-AUG-1990
DEFINITION  Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA,
            complete cds.
ACCESSION   M33898
KEYWORDS    heat shock protein.
SOURCE      P.sativum (cv Little Marvel) leaf, cDNA to mRNA.
  ORGANISM  Pisum sativum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 795)
  AUTHORS   Lauzon,L.M., Helm,K. and Vierling,E.
  TITLE     A cDNA clone from Pisum sativum encoding a low molecular weight
            heat shock protein
  JOURNAL   Nucleic Acids Res. 18, 4274-4274 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Vierling, 01-MAY-1990.
               University of Arizona
               Department of Biochemistry Biological Sciences West Building
               Tucson, AZ 85721
FEATURES       from  to/span     description
    pept         44      637     22.7 kDa heat shock protein (hsp22.7)
BASE COUNT      261 a    122 c    170 g    242 t
ORIGIN      
        1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct
       61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc
      121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc
      181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca
      241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca
      301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga
      361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca
      421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa
      481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca
      541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga
      601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa
      661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg
      721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa
      781 cttgttggct tttgt
//
LOCUS       ECOHGRF       140 bp ds-DNA             SYN       09-AUG-1990
DEFINITION  Synthetic human growth hormone releasing factor (hGRF) gene,
            complete cds.
ACCESSION   M26106
KEYWORDS    growth hormone releasing factor; somatocrinin.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 140)
  AUTHORS   Cravador,A., Jacobs,P., Van Elsen,A., Lacroix,C., Colau,B.,
            Van Alphen,P., Herzog,A. and Bollen,A.
  TITLE     Total DNA synthesis and cloning in Escherichia coli of a gene
            coding for the human growth hormone releasing factor
  JOURNAL   Biochimie 67, 829-834 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          2      139     synthetic human growth hormone releasing
                                 factor (hGRF)
BASE COUNT       30 a     35 c     36 g     39 t
ORIGIN      
        1 catgtacgct gacgctatct tcactaactc ttaccgtaaa gttctgggtc agctgtctgc
       61 tcgtaaactg ctgcaggaca tcatgtctcg tgagcagggt gaatctaacc aggaacgtgg
      121 tgctcgtgct cgtctgtaag
//
LOCUS       HUMACALX      724 bp ss-mRNA            PRI       09-AUG-1990
DEFINITION  Human calcitonin mRNA, complete cds.
ACCESSION   M26095
KEYWORDS    calcitonin.
SOURCE      Human cell-line BEN, cDNA to mRNA, clone hBEN-JR2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 724)
  AUTHORS   Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M.,
            Al-Kazwini,S.J., Holman,J.J. and Marshall,I.
  TITLE     Expression and function of the human calcitonin/alpha-CGRP gene in
            health and disease
  JOURNAL   Biochem. Soc. Symp. 52, 91-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         35      460     calcitonin precursor
    sigp         35      109     calcitonin signal peptide
    matp        287      382     calcitonin
    matp        383      457     flanking peptide
BASE COUNT      163 a    195 c    200 g    166 t
ORIGIN      
        1 ggtgagcccc gagattctgg ctcagagagg tgtcatgggc ttccaaaagt tctccccctt
       61 cctggctctc agcatcttgg tcctgttgca ggcaggcagc ctccatgcag caccattcag
      121 gtctgccctg gagagcagcc cagcagaccc ggccacgctc agtgaggacg aagcgcgcct
      181 cctgctggct gcactggtgc aggactatgt gcagatgaag gccagtgagc tggagcagga
      241 gcaagagaga gagggctcca gcctggacag ccccagatct aagcggtgcg gtaatctgag
      301 tacttgcatg ctgggcacat acacgcagga cttcaacaag tttcacacgt tcccccaaac
      361 tgcaattggg gttggagcac ctggaaagaa aagggatatg tccagcgact tggagagaga
      421 ccatcgccct catgttagca tgccccagaa tgccaactaa actcctccct ttccttccta
      481 atttcccttc ttgcatcctt cctataactt gatgcatgtg gtttggttcc tctctggtgg
      541 ctctttgggc tggtattggt ggctttcctt gtggcagagg atgtctcaaa cttcagatgg
      601 gaggaaagag agcaggactc acaggttgga agagaatcac ctgggaaaat accagaaaat
      661 gagggccgct ttgagtcccc cagagatgtc atcagagctc ctctgtcctg ctttctgaat
      721 gtgc
//
LOCUS       HUMCALARP     234 bp ds-DNA             PRI       09-AUG-1990
DEFINITION  Human calcitonin gene, exon 5.
ACCESSION   M26094
KEYWORDS    calcitonin.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 234)
  AUTHORS   Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M.,
            Al-Kazwini,S.J., Holman,J.J. and Marshall,I.
  TITLE     Expression and function of the human calcitonin/alpha-CGRP gene in
            health and disease
  JOURNAL   Biochem. Soc. Symp. 52, 91-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    36      195     calcitonin precursor, exon 5 (AA at 37)
    matp         54      165     alpha-calcitonin related peptide
    matp        166      192     carboxyl-terminal-flanking peptide (PDN-21)
    IVS     <     1       35     alpha-calcitonin related peptide intron C
BASE COUNT       59 a     63 c     61 g     51 t
ORIGIN      
        1 cagatcttct cttctttctc catcctgcaa atcagaatca ttgcccagaa gagagcctgt
       61 gacactgcca cctgtgtgac tcatcggctg gcaggcttgc tgagcagatc agggggtgtg
      121 gtgaagaaca actttgtgcc caccaatgtg ggttccaaag cctttggcag gcgccgcagg
      181 gaccttcaag cctgagcagc tgaacgactc aagaaggtca caataaagct gaac
//
LOCUS       PIPVGB       1883 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  Plasmid pIP630 (from S. aureus) virginiamycin B hydrolase (vgb)
            gene, complete cds.
ACCESSION   M36022
KEYWORDS    virginiamycin B hydrolase; virginiamycin-resistance.
SOURCE      Plasmid pIP630 (from Staphylococcus aureus) DNA.
  ORGANISM  Plasmid pIP630
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1883)
  AUTHORS   Allignet,J., Loncle,V., Mazodier,P. and El Solh,N.
  TITLE     Nucleotide sequence of a Staphylococcal plasmid gene, vgb, encoding
            a hydrolase inactivating the B components of virginiamycin-like
            antibiotics
  JOURNAL   Plasmid 20, 271-275 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        641     1540     virginiamycin B hydrolase
BASE COUNT      641 a    284 c    375 g    583 t
ORIGIN      
        1 agatctacgg attttcgcca tgccacgaaa ttagcatcat gctagcaagt taaacgaaca
       61 ctgacatgat atattagtgg ttagctatat ttttttactt tgcaacagaa ccattattat
      121 ggtttcttaa aaaaatacaa tgctttttcg ttccttttta ttcatcttcc aattctttgg
      181 catgactgtg tgcattttaa atttgttcag caaatgtgcc gtgtaatgga atacttttta
      241 aatactgtgt aatgataatg caaggcacat actaaaagga atcttcgatt ttgttggctt
      301 attatttgac ttttcataac aattatctta aggttaaaca aatcaataat cgaaagggtg
      361 aaaaaaagca catgatcata taatcctaat tttaaaagaa atcgatattt tggccttggg
      421 ttcaatttca aagtggtttt ggaatgaact ctatttgtta tcggcttttt tctgagatag
      481 gattaatgta atgtgctttt ttggctttaa aaagaccttt gttatccaaa aagtcttttt
      541 aagtgtcctt atccgtgcca cattgcctcc tatctcgaaa aaagagatgg aggctatttt
      601 tgttttggaa atttaattta aataaaacgg aggggataga atggaattta aattacaaga
      661 attaaatctt actaaccaag atacaggacc atatggtata accgtttcag ataaggggaa
      721 agtttggatt acacaacata aagcaaatat gataagttgc atcaatttag atggaaaaat
      781 tacagagtac ccactaccga caccagatgc aaaagtcatg tgtttaacta tatcctcaga
      841 tggggaagtt tggtttactg agaatgcagc aaacaaaata gggaggatta caaaaaaagg
      901 gattattaag gaatatacat tgcctaaccc agattcagca ccctacggta ttacagaagg
      961 accaaatgga gatatatggt ttacagaaat gaatggcaac cgtattggac gtattacgga
     1021 cgacggtaaa attcgtgaat acgagctgcc taataaagga tcttaccctt cttttatcac
     1081 tttgggttct gataatgccc tgtggttcac agaaaatcaa aataatgcta ttggtagaat
     1141 tacagaaagt ggggatatta cagagtttaa aattcctaca cctgcatcag gaccagttgg
     1201 tattacaaag gggaacgacg atgctttatg gtttgtggaa attatcggta ataagatagg
     1261 gcgaataact cctctggggg aaattaccga attcaaaatt ccaacgccaa acgctcgacc
     1321 tcatgcaatt actgctggag caggaattga tttatggttt actgaatggg gggctaataa
     1381 aataggaagg ctgacaagca ataatataat tgaggaatac ccaattcaaa tcaaaagtgg
     1441 tgaaccacat ggcatttgtt tcgatggtga aacaatttgg tttgcaatgg agtgtgacaa
     1501 gataggcaaa ttaactctca ttaaggataa tatggagtga gtcttttgaa tttaaacaat
     1561 gaccatggac ctgatcccga aaatatttta ccgataaaag ggaatcggaa tcttcaattt
     1621 ataaaaccta ctataacgaa cgaaaacatt ttggtggggg aatattctta ttatgatagt
     1681 aagcgaggag aatcctttga agatcaagtc ttatatcatt atgaagtgat tggagataag
     1741 ttgattatag gaagattttg ttcaattggt cccggaacaa catttattat gaatggtgca
     1801 aaccatcgga tggatggatc aacatatcct tttcatctat tcaggatggg ttgggagaag
     1861 tatatgcctt ccttaaaaga tct
//
LOCUS       ECOLIVHMGF   8703 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  E.coli leucine-specific transport (LS-BP; LIV-BP) system (livHMGF)
            genes, complete cds.
ACCESSION   J05516 M13166 M10426 M10427 K02178
KEYWORDS    heat shock protein;
            high affinity branched-chain amino acid transport system;
            htpR gene; isoleucine binding protein; leucine binding protein;
            leucine binding protein; livJ gene; livK gene;
            valine binding protein.
SOURCE      E.coli (K12 strain AE404) isolate W3110 DNA, clone pOX[1,15].
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 90 to 1312)
  AUTHORS   Landick,R., Vaughn,V., Lau,E.T., VanBogelen,R.A., Erickson,J.W. and
            Neidhardt,F.C.
  TITLE     Nucleotide sequence of the heat shock regulatory gene of E. coli
            suggests its protein product may be a transcription factor
  JOURNAL   Cell 38, 175-182 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1312; revises [1])
  AUTHORS   Vaughn,V.
  JOURNAL   Unpublished (1985) Univ Michigan Med School, Ann Arbor MI 48109
  STANDARD  full staff_review
REFERENCE   3  (bases 1407 to 2507 and 3503 to 4609)
  AUTHORS   Landick,R. and Oxender,D.L.
  TITLE     The complete nucleotide sequences of the Escherichia coli LIV-BP
            and LS-BP genes: Implications for the mechanism of high-affinity
            branched-chain amino acid transport
  JOURNAL   J. Biol. Chem. 260, 8257-8261 (1985)
  STANDARD  full staff_review
REFERENCE   4  (bases 4610 to 5696)
  AUTHORS   Nazos,P.M., Antonucci,T.K., Landick,R. and Oxender,D.L.
  TITLE     Cloning and characterization of livH, the structural gene encoding
            a component of the leucine transport system in Escherichia coli
  JOURNAL   J. Bacteriol. 166, 565-573 (1986)
  STANDARD  simple staff_review
REFERENCE   5  (bases 1136 to 8703)
  AUTHORS   Adams,M.D., Wagner,L.M., Graddis,T.J., Landick,R., Antonucci,T.K.,
            Gibson,A.L. and Oxender,D.L.
  TITLE     Nucleotide sequence and genetic characterization reveal six
            essential genes for the LIV-I and LS transport systems of
            Escherichia coli
  JOURNAL   J. Biol. Chem. 265, 11436-11443 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and sequence in computer readable form for [2] kindly
            provided by V.Vaughn, 15-NOV-1985.
            
            Draft entry and computer-readable sequence for [1] kindly submitted
            by M.D.Adams 19-APR-1990, for release after publication.
            
            The htpR (also known as "hin") gene product appears homologous to
            the sigma factor of RNA polymerase, and the two proteins are
            predicted to have similar secondary structures.  In addition, two
            regions of the predicted htpR product resemble protein-DNA contact
            points conserved in known DNA-binding proteins.
            
            The htpR gene encodes a protein, which appears to be identical in
            size (33 kd by migration on two dimensional polyacrylamide gel) and
            isoelectric point with the protein, F33.4, normally present in
            E.coli but deficient in an htpR mutant.
            
            A region homologous to the rpoD gene is located at positions
            508-549 [2].
            
            The E.coli LIV-I and LS AA transport systems are high-affinity,
            periplasmic, binding protein-dependent systems that utilize the
            leucine-, isoleucine-, valine-binding protein (LIV-BP) and
            leucine-specific binding protein (LS-BP), respectively.  These two
            binding proteins interact with a common set of membrane proteins to
            transport branched-chain AAs into the cytoplasm.  The two BP genes
            are encoded in a regulon that also contains the genes for the
            common membrane protein components.
FEATURES       from  to/span     description
    pept       1407     2510     LIV-BP precursor (livJ)
    sigp       1407     1475     LIV-BP signal peptide
    matp       1476     2507     LIV-BP mature protein
    pept       3503     4612     LS-BP precursor (livK)
    sigp       3503     3571     LS-BP signal peptide
    matp       3572     4609     LS-BP mature peptide
    pept       4660     5586     leucine-specific binding protein (livH)
    pept       5583     6857     livM product
    pept       6854     7621     livG product
    pept       7623     8336     livF product
    mRNA       1303  >  4612     livJ mRNA [2] /nomgen="livJ"
    ORF        3078     2566 (c) ORF19
    revision    101      102     ct in [2]; cgt in [1]
    revision    270      270     a in [2]; g in [1]
    revision    280      280     a in [2]; g in [1]
    revision    284      284     a in [2]; g in [1]
    revision    304      304     a in [2]; g in [1]
    revision    310      310     a in [2]; g in [1]
    revision    322      324     caa in [2]; agg in [1]
    revision    328      328     g in [2]; a in [1]
    revision   1453     1453     c in [5]; g in [3]
    revision   3832     3832     t in [5]; c in [3]
BASE COUNT     2039 a   2176 c   2425 g   2062 t      1 others
ORIGIN      76 min on the K12 map.
        1 ctgcacggat caacattacg ccacttacgc ctgaataata aaagcgtgta tactctttcc
       61 tgcaatgggt tccgtagcag ggaaagagac cccgttgtct cttcccggta tttcatctct
      121 atgtcacatt ttgtgcgtaa tttattcaca agcttgcatt gaacttgtgg ataaaatcac
      181 ggtctgataa aacagtgaat gataacctcg ttgctcttaa gctctggcac agttgttgct
      241 accactgaag cgccagaaga tatcgattga gaggatttga atgactgaca aaatgcaaag
      301 tttagcttta gccccagttg gcaacctgga ttcctacatc cgggcagcta acgcgtggcc
      361 gatgttgtcg gctgacgagg agcgggcgct ggctgaaaag ctgcattacc atggcgatct
      421 ggaagcagct aaaacgctga tcctgtctca cctgcggttt gttgttcata ttgctcgtaa
      481 ttatgcgggc tatggcctgc cacaggcgga tttgattcag gaaggtaaca tcggcctgat
      541 gaaagcagtg cgccgtttca acccggaagt gggtgtgcgc ctggtctcct tcgccgttca
      601 ctggatcaaa gcagagatcc acgaatacgt tctgcgtaac tggcgtatcg tcaaagttgc
      661 gaccaccaaa gcgcagcgca aactgttctt caacctgcgt aaaaccaagc agcgtctggg
      721 ctggtttaac caggatgaag tcgaaatggt ggcccgtgaa ctgggcgtaa ccagcaaaga
      781 cgtacgtgag atggaatcac gtatggcggc acaggacatg acctttgacc tgtcttccga
      841 cgacgattcc gacagccagc cgatggctcc ggtgctctat ctgcaggata aatcatctaa
      901 ctttgccgac ggcattgaag atgataactg ggaagagcag gcggcaaacc gtctgaccga
      961 cgcgatgcag ggtctggacg aacgcagcca ggacatcatc cgtgcgcgct ggctggacga
     1021 agacaacaag tccacgttgc aggaactggc tgaccgttac ggcgtttccg ctgagcgtgt
     1081 acgccagctg gaaaagaacg cgatgaaaaa attgcgtgct gccattgaag cgtaatttcc
     1141 gctattaagc agagaaccct agatgagagt ccggggtttt tgttttttgg gcctctgtaa
     1201 taatcaattt cccctccggc aaaacgccaa tccccacgca gattgttaat aaactgtcaa
     1261 aatagctatt ccaatatcat aaaaatcggg atatgtttta gcagagtatg ctgctaaagc
     1321 acgggtagtc atgcataaaa cgaaataaag tgctgaaaaa caacatcaca acacacgtaa
     1381 taaccagaag aatggggatt ctcaggatga acacaaaggg caaagcgtta ctggcaggat
     1441 tgatcgcgct ggcattcagc aatatggctc tggcagaaga tattaaagtc gcggtcgtgg
     1501 gcgcaatgtc cggtccggtt gcgcagtacg gtgaccagga gtttaccggc gcagagcagg
     1561 cggttgcgga tatcaacgct aaaggcggca ttaaaggcaa caaactgcaa atcgcaaaat
     1621 atgacgatgc ctgtgatccg aaacaggcgg ttgcggtggc gaacaaagtc gttaacgacg
     1681 gcattaaata tgtgattggt cacctctgtt cctcatcaac gcagcctgcg tcggatatct
     1741 acgaagacga aggcattttg atgatcaccc cagcggcaac cgcgccggag ctgaccgccc
     1801 gtggctatca gctgatcctg cgaaccaccg gcctggattc cgaccaaggg ccgacggctg
     1861 ccaaatatat tcttgagaaa gtgaaaccgc agcgtattgc tatcgttcac gacaaacagc
     1921 aatacggcga aggtctggcg cgagcggtgc aggacggcct gaagaaaggc aatgcaaacg
     1981 tggtgttctt tgatggcatc accgccgggg aaaaagattt ctcaacgctg gtggcgcgtc
     2041 tgaaaaaaga gaatatcgac ttcgtttact acggcggtta tcacccggaa atggggcaaa
     2101 tcctgcgtca ggcacgcgcg gcagggctga aaactcagtt tatggggccg gaaggtgtgg
     2161 ctaacgtttc gctgtctaac attgcgggcg aatcagcgga agggctactg gtgaccaaac
     2221 cgaagaacta cgatcaggtt ccggcgaaca aacccattgt tgacgcgatc aaagcgaaaa
     2281 aacaggaccc aagtggcgca ttcgtttgga ccacctacgc cgcgctgcaa tctttgcagg
     2341 cgggcctcaa tcagtctgac gatccggctg aaatcgccaa atacctgaaa gcgaactccg
     2401 tggataccgt aatgggcccg ctgacctggg atgagaaagg cgatctgaaa ggctttgagt
     2461 tcggcgtatt tgactggcac gccaacggca cggccaccga tgcgaagtaa tcattaatcg
     2521 gcaactttgg gttgccgcca aattgctaat atcgagtacg ttgcttcatg ccggatgcgg
     2581 cgtaaacgcc ttatccggcc tacaagatcc aaagaaatca gtaaattgca acacacattg
     2641 taggcctgat aagcgtagcg catcaggcaa tacacttttg aaatcggact tgacgattaa
     2701 cacttctccc agccgccctg ttgtgccgta aaccccagcg cctgcataaa cgccgtcatc
     2761 acaccgcgat cttccacgcc gcagccgcca tccaccagca tgaaacgcca agattgttac
     2821 gcaaaacctc ttccagcaga tattgcccca ccgcgacggc gggtgacttc ccgcacgcgc
     2881 agggaatcca gtgctccctc ggtgccgctt aaggttgccc gcgcggcgcg agcaggcgct
     2941 cgttaaacgc gcggcgtaga tacggtggtt atcgtcaacc tgtaacgagg aaggggaata
     3001 ctcncggcca agatcttttg cgaggtcaat ccggtcttgg tcgctaaatt tttctaatcg
     3061 aatgatggtc agcttcatgg gtaacccgtg taaatcacaa aagtgtaacc agtgtagcga
     3121 aataatttaa tcggaggctt tctctttttt atttcttttg gcaggtgatt aattttttaa
     3181 cagcaataat tacaaaatta aaacattaga gaatgaaaaa tgtccagcat aatcccctga
     3241 atgatagtga attattccgc ccctttgtgc cgttatttta tgctgacaaa ggcacttttt
     3301 tctgtttgtc tatcaataaa ttcggaatat tatctgttct taatcgactg aaaaatgggg
     3361 attttaatcg ctattatcac aaaatactgc gctaacccct taatcagaca ggcaaaaaca
     3421 gtgcagtata aaaaaagaac agtctgattt gttaacacat aaaaacaaag caacacaaca
     3481 tcacgaatgg ggatttttga ctatgaaacg gaatgcgaaa actatcatcg cagggatgat
     3541 tgcactggca atttcacaca ccgctatggc tgacgatatt aaagtcgccg ttgtcggcgc
     3601 gatgtccggc ccgattgccc agtggggcat aatggaattt aacggcgcgg agcaggcgat
     3661 taaagacatt aatgccaaag ggggaattaa gggcgataaa ctggttggcg tggaatatga
     3721 cgacgcatgc gacccgaaac aagccgttgc ggtcgccaac aaaatcgtta atgacggcat
     3781 taaatacgtt attggtcatc tgtgttcttc ttctacccag cctgcgtcag atatctatga
     3841 agacgaaggt attctaatga tctcgccggg agcgaccgcg ccggaactaa cccaacgcgg
     3901 ttatcaacac attatgcgta ctgccgggct ggactcttcc caggggccaa cggcggcaaa
     3961 atacattctt gagacggtga agccccagcg catcgccatc atccacgaca aacaacagta
     4021 tggcgaaggg ctggcgcgtt cggtgcagga cgggctgaaa gcggctaacg ccaacgtcgt
     4081 cttcttcgat ggtattaccg ccggggagaa agatttctcc gcgctgatcg cccgcctgaa
     4141 aaaagaaaac atcgacttcg tttactacgg cggttactac ccggaaatgg ggcagatgct
     4201 gcgccaggcc cgttccgttg gcctgaaaac ccagtttatg gggccggaag gtgtgggtaa
     4261 tgcgtcgttg tcgaacattg ccggtgatgc cgccgaaggc atgttggtca ctatgccaaa
     4321 acgctatgac caggatccgg caaaccaggg catcgttgat gcgctgaaag cagacaagaa
     4381 agatccgtcc gggccttatg tctggatcac ctacgcggcg gtgcaatctc tggcgactgc
     4441 ccttgagcgt accggcagcg atgagccgct ggcgctggtg aaagatttaa aagctaacgg
     4501 tgcaaacacc gtgattgggc cgctgaactg ggatgaaaaa ggcgatctta agggatttga
     4561 ttttggtgtg ttccagtggc acgccgacgg ttcatccacg gcagccaagt gatcatccca
     4621 ccgcccgtaa aatgcgggcg ggtttagaaa ggttacctta tgtctgagca gtttttgtat
     4681 ttcttgcagc agatgtttaa cggcgtcacg ctgggcagta cctacgcgct gatagccatc
     4741 ggctacacca tggtttacgg cattatcggc atgatcaact tcgcccacgg cgaggtttat
     4801 atgattggca gctacgtctc atttatgatc atcgccgcgc tgatgatgat gggcattgat
     4861 accggctggc tgctggtagc cgcgggattc gtcggcgcaa tcgtcattgc cagcgcctac
     4921 ggctggagta tcgaacgggt ggcttaccgc ccggtgcgta actctaagcg cctgattgca
     4981 ctcatctctg caatcggtat gtccatcttc ctgcaaaact acgtcagcct gaccgaaggt
     5041 tcgcgcgacg tggcgctgcc gagcctgttt aacggtcagt gggtggtggg gcatagcgaa
     5101 aacttctctg cctctattac caccatgcag gcggtgatct ggattgttac cttcctcgcc
     5161 atgctggcgc tgacgatttt cattcgctat tcccgcatgg gtcgcgcgtg tcgtgcctgc
     5221 gcggaagatc tgaaaatggc gagtctgctt ggcattaaca ccgaccgggt gattgcgctg
     5281 acctttgtga ttggcgcggc gatggcggcg gtggcgggtg tgctgctcgg tcagttctac
     5341 ggcgtcatta acccctacat cggctttatg gccgggatga aagcctttac cgcggcggtg
     5401 ctcggtggga ttggcggcat tccgggggcg atgattggcg gcctgattct ggggattgcg
     5461 gaggcgctct cttctgccta tctgagtacg gaatataaag atgtggtctc attcgccctg
     5521 ccgattctgg tgctgctggt gatgccgacc ggtattctgg gtcgcccgga ggtagagaaa
     5581 gtatgaaacc gatgcatatt gcaatggcgc tgctctctgc cgcgatgttc tttgtgctgg
     5641 cgggcgtctt tatgggcgtg caactggagc tggatggcac caaactggtg gtcgacacgg
     5701 cttcggatgt ccgttggcag tgggtgttta tcggcacggc ggtggtcttt ttcttccagc
     5761 ttttgcgacc ggctttccag aaagggttga aaagcgtttc cggaccgaag tttattctgc
     5821 ccgccattga tggctccacg gtgaagcaga aactgttcct cgtggcgctg ttggtgcttg
     5881 cggtggcgtg gccgtttatg gtttcacgcg ggacggtgga tattgccacc ctgaccatga
     5941 tctacattat cctcggtctc gggctgaacg tggttgttgg tctttctggt ctgctggtgc
     6001 tggggtacgg cggtttttac gccatcggct tacacttttg cgctgctcaa tcactattac
     6061 ggcttgggct tctggacctg cctgccgatt gctggattaa tggcagcggc ggcggcttcc
     6121 tgctcggttt tccggtgctg cgtttgcgcg gtgactatct ggcgatcgtt accctcggtt
     6181 tcggcgaaat tgtgcgcata ttgctgctca ataacaccga aattaccggc ggcccgaacg
     6241 gaatcagtca gatcccgaaa ccgacactct tcggactcga gttcagccgt accgctcgtg
     6301 aaggcggctg ggacacgttc agtaatttct ttggcctgaa atacgatccc tccgatcgtg
     6361 tcatcttcct ctacctggtg gcgttgctgc tggtggtgct aagcctgttt gtcattaacc
     6421 gcctgctgcg gatgccgctg gggcgtgcgt gggaagcgtt gggtgaagat gaaatcgcct
     6481 gccgttcgct gggcttaagc ccgcgtcgta tcaagctgac tgcctttacc ataagtgccg
     6541 cgtttgccgg ttttgccgga acgctgtttg cggcgcgtca gggctttgtc agcccggaat
     6601 ccttcacctt tgccgaatcg gcgtttgtgc tggcgatagt ggtgctcggc ggtatgggct
     6661 cgcaatttgc ggtgattctg gcggcaattt tgctggtggt gtcgcgcgag ttgatgcgtg
     6721 atttcaacga atacagcatg ttaatgctcg gtggtttgat ggtgctgatg atgatctggc
     6781 gtccgcaggg cttgctgccc atgacgcgcc ggcaactgaa gctgaaaaac ggcgcagcga
     6841 aaggagagca ggcatgagtc agccattatt atctgttaac ggcctgatga tgcgcttcgg
     6901 cggcctgctg gcggtgaaca acgtcaatct tgaactgtac ccgcaggaga tcgtctcgtt
     6961 aatcggccct aacggtgccg gaaaaaccac ggtttttaac tgtctgaccg gattctacaa
     7021 acccaccggc ggcaccattt tactgcgcga tcagcacctg gaaggtttac cggggcagca
     7081 aattgcccgc atgggcgtgg tgcgcacctt ccagcatgtg cgtctgttcc gtgaaatgac
     7141 ggtaattgaa aacctgctgg tggcgcagca tcagcaactg aaaaccgggc tgttctctgg
     7201 cctgttgaaa acgccatcct tccgtcgcgc ccagagcgaa cggctcgacc gcgccgcgac
     7261 ctggcttgag cgcattggtt tgctggaaca cgccaaccgt caggcgagta acctggccta
     7321 tggtgaccag cgccgtcttg agattgcccg ctgcatggtg acgcagccgg agattttaat
     7381 gctcgacgaa cctgcggcag gtcttaaccc gaaagagacg aaagagctgg atgagctgat
     7441 tgccgaactg cgtaatcatc acaacaccac tatcttgttg attgaacacg atatgaagct
     7501 ggtgatggga atttcggacc gaatttacgt ggtcaatcag gggacgccgc tggcaaacgg
     7561 tagcccggag cagatccgta ataacccgga cgtgatccgt gcctatttag gtgaggcata
     7621 agatggaaaa agtcatgttg tcctttgaca aagtcagcgc ccactacggc aaaatccagg
     7681 cgctgcatga ggtgagcctg catatcaatc agggcgagat tgtcacgctg attggcgcga
     7741 acggggcggg gaaaaccacc ttgctcggca cgttatgcgg cgatcccggt gccaccagcg
     7801 ggcgaattgt gtttgatgat aaagacatta ccgactggca gacagcgaaa atcatgcgcg
     7861 aagcggtggc gattgtcccg gaagggcgtc gcgtcttctc gcggatgacg gtggaagaga
     7921 acctggcgat gggcggtttt tttgctgaac gcgaccagtt ccaggagcgc ataaagtggg
     7981 cgtatgagct gtttccacgt ctgcatgagc gccgtattca gcgggcgggc accatgtccg
     8041 gcggtgaaca gcagatgctg gcgattggtc gtgcgctgat gagcaacccg cgtttgctac
     8101 tgcttgatga gccatcgctc ggtcttgcgc cgattatcat ccagcaaatt ttcgacacca
     8161 tcgagcagct gcgcgagcag gggatgacta tctttctcgt cgagcagaac gccaaccagg
     8221 ggctaaagct ggcggatcgc ggctacgtgc tggaaaacgg ccatgtagtg ctttccgata
     8281 ctggtgatgc gctgctggcg aatgaagcgg tgagaagtgc gtatttaggc gggtaataac
     8341 acgttgattg atagggagtc aaaagactcc tttgagacag gtgacaaatg taaaattgcc
     8401 tgatgcgctg cgcttatcag gcctactggg tgagtggcaa tatgttgaat ttgcacgatc
     8461 ttgtaggcct gataagcgtt taccgcgcat ccggcatgaa acgatgagca atctgtagag
     8521 tttgattcag accttctata ttttcccgct tatccgtgcc ccatctccca ttttccctca
     8581 cccacgccgt caccgccttg tcatctttct gacaccttac tatcttacaa atgtaacaaa
     8641 aaagttattt ttctgtaatt cgagcatgtc atgttacccc gcgagcataa aacgcgtgaa
     8701 ttc
//
LOCUS       BOVGOA        472 bp ss-mRNA            MAM       09-AUG-1990
DEFINITION  B.taurus go-alpha mRNA, 3' end.
ACCESSION   J02900
KEYWORDS    go-alpha.
SOURCE      B.taurus retina, cDNA to mRNA, clone GO3.1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (sites)
  AUTHORS   Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
            Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
  TITLE     Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated
            regions
  JOURNAL   Biochemistry 29, 5069-5076 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 472)
  AUTHORS   Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M.,
            Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by S.R.Price, 12-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1        3     go-alpha (AA at 1)
BASE COUNT      130 a    133 c     88 g    121 t
ORIGIN      
        1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct
       61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat
      121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc
      181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg
      241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa
      301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc
      361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct
      421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt
//
LOCUS       CAJCAT       1334 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene,
            complete cds.
ACCESSION   M35190
KEYWORDS    chloramphenicol acetyltransferase.
SOURCE      C.coli plasmid C-589 DNA.
  ORGANISM  Campylobacter coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 1334)
  AUTHORS   Wang,Y. and Taylor,D.E.
  TITLE     Chloramphenicol resistance in Campylobacter coli, nucleotide
            sequence, expression and cloning vector construction
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.E.Taylor, 15-JUN-1990.
FEATURES       from  to/span     description
    pept        309      932     chloramphenicol acetyltransferase (cat)
    mRNA        277  >   932     chloramphenicol acetyltransferase mRNA
    signal      242      271     promoter
    binding     297      301     ribosome binding site
    signal      960     1006     transcriptional termination signal
BASE COUNT      433 a    232 c    282 g    387 t
ORIGIN      
        1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag
       61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc
      121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat
      181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt
      241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag
      301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg
      361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta
      421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca
      481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt
      541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt
      601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg
      661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg
      721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg
      781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata
      841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt
      901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac
      961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga
     1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac
     1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat
     1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac
     1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag
     1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga
     1321 aataataaat ggct
//
LOCUS       CLOCBA       5120 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  C.acetobutylicum beta-D-galactosidase (cbgA) and
            beta-D-galactosidase regulatory protein (cbgR) genes, complete cds.
ACCESSION   M35107
KEYWORDS    beta-D-galactosidase; beta-D-galactosidase regulatory protein.
SOURCE      C.acetobutylicum (strain NCIB2951) DNA.
  ORGANISM  Clostridium acetobutylicum
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1260 to 5120)
  AUTHORS   Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B.
  TITLE     Clostridium acetobutylicum beta-galactosidase gene, cbgA, is
            positively regulated in Escherichia coli by a novel regulatory
            gene, cbgR
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5120)
  AUTHORS   Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by D.B.Scott, 11-JUN-1990.
            
               Author address:D.B.Scott:
               Molecular Genetics Unit
               Department of Microbiology and Genetics
               Massey University
               Palmerston North, New Zealand
               E-mail:D.B.Scott@massey.ac.nz
FEATURES       from  to/span     description
    pept       1560     4253     beta-D-galactosidase (cbgA)
    pept       4500     4805     beta-D-galactosidase regulatory protein (cbgR)
BASE COUNT     1921 a    683 c    876 g   1640 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac
       61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa
      121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat
      181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat
      241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt
      301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt
      361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga
      421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa
      481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga
      541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt
      601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat
      661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg
      721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat
      781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct
      841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt
      901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt
      961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc
     1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt
     1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa
     1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata
     1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt
     1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa
     1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta
     1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc
     1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg
     1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta
     1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata
     1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag
     1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt
     1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata
     1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta
     1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc
     1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt
     1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag
     2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg
     2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc
     2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa
     2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt
     2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag
     2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt
     2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac
     2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag
     2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct
     2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg
     2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac
     2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag
     2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat
     2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa
     2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct
     2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt
     3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta
     3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt
     3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca
     3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc
     3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta
     3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta
     3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg
     3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat
     3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag
     3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa
     3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg
     3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag
     3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca
     3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt
     3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa
     3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg
     3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa
     4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag
     4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt
     4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact
     4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct
     4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt
     4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag
     4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc
     4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa
     4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata
     4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa
     4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga
     4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc
     4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa
     4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa
     4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag
     4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa
     4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca
     5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg
     5101 ttaatctaat atacgaattc
//
LOCUS       ECOPNCB      1490 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  E.coli nicotinic acid phosphoribosyl transferase (pncB) gene,
            complete cds.
ACCESSION   J05568
KEYWORDS    nicotinic acid phosphoribosyl transferase.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1490)
  AUTHORS   Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J.,
            Meesters,H.A.R. and Witholt,B.
  TITLE     Variation of cofactor levels in Escherichia coli: Sequence analysis
            and expression of the pncB gene encoding nicotinic acid
            phosphoribosyl transferase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by P.Terpstra, 31-MAY-1990.
            
            Nicotinic acid phosphoribosyl transferase is the first enzyme of
            the three enzyme Preiss-Handler pathway leading to the synthesis of
            NAD.  The protein sequence shows similarity to orotate
            phosphoribosyl transferase (pyr5) from Dictyostelium discoideum
            (acc P09556, SWISS-PROT)
FEATURES       from  to/span     description
    pept        216     1418     nicotinic acid phosphoribosyl transferase
                                 (pncB) (EC 2.4.2.11)
    mRNA        158     1450     nicotinic acid phosphoribosyl transferase mRNA
                                 (3' end put.)
    signal      124      129     -35 region
    signal      146      151     -10 region
    rpt         170      185     inverted repeat
    binding     197      202     ribosome binding site
    signal     1426     1450     rho-independent transcription termination
                                 signal
BASE COUNT      348 a    374 c    364 g    404 t
ORIGIN      
        1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac
       61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg
      121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac
      181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc
      241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc
      301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta
      361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg
      421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac
      481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata
      541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta
      601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca
      661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct
      721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg
      781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc
      841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa
      901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc
      961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt
     1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg
     1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa
     1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct
     1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg
     1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac
     1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca
     1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc
     1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc
//
LOCUS       NEUCCON13    2728 bp ds-DNA             PLN       09-AUG-1990
DEFINITION  N.crassa conidiation-specific protein (con-13) gene, complete cds.
ACCESSION   M35120
KEYWORDS    conidiation-specific protein.
SOURCE      N.crassa (strain 74-OR23-1A) DNA, clone pCon10a.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 2728)
  AUTHORS   Hager,K.M. and Yanofsky,C.
  TITLE     Genes expressed during conidiation in Neurospora crassa: Molecular
            characterization of con-13
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.M.Hager, 12-JUN-1990.
            
               Author address:K.M.Hager:
               Dept. of Physiology
               UCLA Medical School
               10833 Le Conte Avenue
               Los Angeles, CA 90024-1751
               E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU
FEATURES       from  to/span     description
    pept       1009     1275     conidiation-specific protein (con-13), exon 1
               1333     1847     conidiation-specific protein, exon 2
               1910     2150     conidiation-specific protein, exon 3
    pre-msg     922     2367     con-13 mRNA and introns (alt.)
    pre-msg     927     2367     con-13 mRNA and introns (alt.)
    pre-msg     936     2367     con-13 mRNA and intron (alt.)
    pre-msg     946     2367     con-13 mRNA and intron (alt.)
    IVS        1276     1332     con-13 intron A
    IVS        1848     1909     con-13 intron B
    site       2364     2367     polyadenylation site
BASE COUNT      653 a    695 c    720 g    660 t
ORIGIN      Linkage group IV.
        1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc
       61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt
      121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc
      181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt
      241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa
      301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc
      361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca
      421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta
      481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca
      541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta
      601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt
      661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg
      721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc
      781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt
      841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg
      901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca
      961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct
     1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg
     1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa
     1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg
     1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat
     1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta
     1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc
     1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc
     1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac
     1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt
     1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag
     1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag
     1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc
     1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc
     1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc
     1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt
     1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg
     1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct
     2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg
     2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact
     2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg
     2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt
     2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga
     2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc
     2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa
     2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct
     2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag
     2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga
     2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat
     2701 gatggaaacg agatcgaaac ccattgag
//
LOCUS       VVUVVHAB     2237 bp ds-DNA             BCT       09-AUG-1990
DEFINITION  V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds.
ACCESSION   M34670
KEYWORDS    cytolysin; cytotoxin; hemolysin; toxin.
SOURCE      V.vulnificus (strain EDL174) DNA, clone pCVD702.
  ORGANISM  Vibrio vulnificus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 2237)
  AUTHORS   Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G.
  TITLE     The cytolysin gene of Vibrio vulnificus: Sequence and relationship
            to Vibrio cholerae El Tor hemolysin
  JOURNAL   Infect. Immun. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.B.Kaper, 29-MAY-1990.
FEATURES       from  to/span     description
    pept        745     2160     cytolysin (vvhA) precursor
    sigp        745      804     cytolysin signal peptide
    matp        805     2157     cytolysin
    pept        237      743     pot. cytolysin (vvhB)
    signal       55       60     -35 region
    signal       80       85     -10 region
    signal       87       92     -35 region
    signal      110      115     -10 region
    signal      184      189     -35 region
    signal      206      211     -10 region
    signal     2185     2219     transcription termination signal
    binding      54       69     CRP binding site
    binding      59       74     Fur binding site
    binding     185      199     Fur binding site
    binding     226      231     vvhB ribosome binding site
    binding     730      735     vvhA ribosome binding site
BASE COUNT      639 a    498 c    509 g    591 t
ORIGIN      
        1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata
       61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat
      121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa
      181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc
      241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga
      301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg
      361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt
      421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct
      481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt
      541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc
      601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa
      661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc
      721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta
      781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac
      841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga
      901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg
      961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc
     1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc
     1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac
     1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat
     1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa
     1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt
     1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt
     1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg
     1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt
     1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg
     1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac
     1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg
     1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca
     1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg
     1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc
     1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga
     1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt
     1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt
     2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc
     2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag
     2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta
     2221 ttcgccatct attcgtc
//
LOCUS       TOBCPCG    155844 bp ds-DNA   circular  ORG       09-AUG-1990
DEFINITION  N.tabacum (var. Bright Yellow 4) chloroplast, complete genome.
ACCESSION   Z00044
KEYWORDS    16S ribosomal RNA; 23S ribosomal RNA; 4.5S ribosomal RNA;
            5S ribosomal RNA; ATP synthetase; ATPase; NADH dehydrogenase;
            RNA polymerase; autonomous replication; carboxylase; chloroplast;
            complete genome; cytochrome; cytochrome b559; cytochrome b6;
            cytochrome f; initiation factor; phosphoprotein; ribosomal protein;
            ribosomal protein L14; ribosomal protein L16; ribosomal protein L2;
            ribosomal protein L20; ribosomal protein L22;
            ribosomal protein L23; ribosomal protein L33;
            ribosomal protein S11; ribosomal protein S12;
            ribosomal protein S15; ribosomal protein S18;
            ribosomal protein S19; ribosomal protein S2; ribosomal protein S3;
            ribosomal protein S7; ribosomal protein S8;
            ribulose bisphosphate carboxylase; transfer RNA-Ala;
            transfer RNA-Arg; transfer RNA-Asn; transfer RNA-Asp;
            transfer RNA-Cys; transfer RNA-Glu; transfer RNA-Gly;
            transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu;
            transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe;
            transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr;
            transfer RNA-Trp; transfer RNA-Tyr; transfer RNA-Val.
SOURCE      Nicotiana tabacum (var. Bright Yellow 4) chloroplast DNA, clone
            pHC79 (IR-A and IR-B).
  ORGANISM  Chloroplast Nicotiana tabacum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae; Nicotiana tabacum.
REFERENCE   1  (sites)
  AUTHORS   Shinozaki,K., Ohme,M., Tanaka,M., Wakasugi,T., Hayashida,N.,
            Matsubayashi,T., Zaita,N., Chunwongse,J., Obokata,J.,
            Yamaguchi-Shinozaki,K., Ohto,C., Torazawa,K., Meng,B.Y., Sugita,M.,
            Deno,H., Kamogashira,T., Yamada,K., Kusuda,J., Takaiwa,F., Kato,A.,
            Tohdoh,N., Shimada,H. and Sugiura,M.
  TITLE     The complete nucleotide sequence of tobacco chloroplast genome: Its
            gene organization and expression
  JOURNAL   EMBO J. 5, 2043-2049 (1986)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 155844)
  AUTHORS   Sugiura,M.
  JOURNAL   Unpublished (1986) Biology Dept, Nagoya Univ., Nagoya 464, Japan
  STANDARD  full staff_review
COMMENT     The circular tobacco chloroplast DNA sequence is presented in a
            linearized form by cutting at the junction between IRA and LSC.
            The DNA strand which codes for the large subunit of
            ribulose-1,5-bisphosphate carboxylase is designated as A strand and
            the complementary strand as B strand.  The nucleotide sequence of
            the B strand is presented.
            
               Large single copy region (LSC):      1-86684    (86684 bp)
               Inverted repeat B       (IR-B):  86685-112023   (25339 bp)
               Small single copy region (SSC): 112024-130505   (18482 bp)
               Inverted repeat A       (IR-A): 130506-155844   (25339 bp)
            
            Rps12 consists of three exons.  There are two sets of exons 2 and
            3.  One set is located on the same strand in IR-B 28 kb upstream of
            exon 1.  The oteher set is located on the opposite strand in IR-A
            69 kb downstream of exon 1.  The tobacco rps12 gene probably
            consists of three transcription units and requires trans-splicing.
            
            The chloroplast DNA segments capable of replication in yeast, ars1
            and ars2, are located at positions 112768-113117 and 14570-15088
            respectively.
            
            Seven open reading frames, (RF236, RF548, RF862, stop codon to stop
            codon) and (ORF151, ORF90, ORF80, ORF134, start codon to stop
            codon) are present near the rpoB gene.  Four or these ORFs show
            some homology to portions of the beta'-subunit sequence of E.coli
            RNA polymerase [1].
            
            Most open reading frames indicated in FEATURES are from start codon
            to stop codon.  The intron boundaries for the ndhA and ndhB are not
            known and thus the largest possible intron is indicated (from stop
            codon to stop codon).
FEATURES       from  to/span     description
    tRNA         80        6 (c) His-tRNA (GUG)
    tRNA       4407     4371 (c) Lys-tRNA (UUU), exon 1
               1844     1810 (c) Lys-tRNA (UUU), exon 2
    tRNA       7487     7416 (c) Gln-tRNA (UUG)
    tRNA       8719     8632 (c) Ser-tRNA (GCU)
    tRNA       9499     9521     Gly-tRNA (UCC), exon 1
              10213    10260     Gly-tRNA (UCC), exon 2
    tRNA      10430    10501     Arg-tRNA (UCU)
    tRNA      28783    28854     Cys-tRNA (GCA)
    tRNA      31999    31926 (c) Asp-tRNA (GUC)
    tRNA      32191    32108 (c) Tyr-tRNA (GUA)
    tRNA      32323    32251 (c) Glu-tRNA (UUC)
    tRNA      33172    33243     Thr-tRNA (GGU)
    tRNA      37223    37132 (c) Ser-tRNA (UGA)
    tRNA      38050    38120     Gly-tRNA (GCC)
    tRNA      38421    38348 (c) fMet-tRNA (CAU)
    tRNA      47111    47197     Ser-tRNA (GGA)
    tRNA      48577    48505 (c) Thr-tRNA (UGU)
    tRNA      49288    49322     Leu-tRNA (UAA), exon 1
              49826    49875     Leu-tRNA (UAA), exon 2
    tRNA      50232    50304     Phe-tRNA (GAA)
    tRNA      54390    54353 (c) Val-tRNA (UAC), exon 1
              53781    53747 (c) Val-tRNA (UAC), exon 2
    tRNA      54581    54653     Met-tRNA (CAU)
    tRNA      68880    68807 (c) Trp-tRNA (CCA)
    tRNA      69118    69045 (c) Pro-tRNA (UGG)
    tRNA      88770    88697 (c) Ile-tRNA (CAU)
    tRNA      96507    96427 (c) Leu-tRNA (CAA)
    tRNA     102459   102530     Val-tRNA (GAC)
    tRNA     104547   104583     Ile-tRNA (GAU), exon 1
             105291   105325     Ile-tRNA (GAU), exon 2
    tRNA     105390   105427     Ala-tRNA (UGC), exon 1
             106137   106171     Ala-tRNA (UGC), exon 2
    tRNA     109973   110046     Arg-tRNA (ACG)
    tRNA     110699   110628 (c) Asn-tRNA (GUU)
    tRNA     116067   116146     Leu-tRNA (UAG)
    tRNA     131830   131901     Asn-tRNA (GUU)
    tRNA     132556   132483 (c) Arg-tRNA (ACG)
    tRNA     137139   137102 (c) Ala-tRNA (UGC), exon 1
             136392   136358 (c) Ala-tRNA (UGC), exon 2
    tRNA     137982   137946 (c) Ile-tRNA (GAU), exon 1
             137238   137204 (c) Ile-tRNA (GAU), exon 2
    tRNA     140070   139999 (c) Val-tRNA (GAC)
    tRNA     146022   146102     Leu-tRNA (CAA)
    tRNA     153759   153832     Ile-tRNA (CAU)
    rRNA     102758   104246     16S rRNA
    rRNA     106325   109134     23S rRNA
    rRNA     109236   109338     4.5S rRNA
    rRNA     109595   109715     5S rRNA
    rRNA     132934   132814 (c) 5S rRNA
    rRNA     133293   133191 (c) 4.5S rRNA
    rRNA     136204   133395 (c) 23S rRNA
    rRNA     139771   138283 (c) 16S rRNA
    RNA       32347    31836 (c) Asp-tRNA, Tyr-tRNA, Glu-tRNA RNA precursor
    pept       1595      534 (c) PSII 32kd protein (psbA)
    pept       6211     6172 (c) ribosomal protein S16 (rps16), exon 1
               5311     5094 (c) ribosomal protein S16 (rps16), exon 2
    pept      12148    10625 (c) ATPase alpha subunit (atpA)
    pept      13452    13308 (c) ATPase I subunit (atpF), exon 1
              12612    12203 (c) ATPase I subunit (atpF), exon 2
    pept      14099    13854 (c) ATPase III subunit (atpH)
    pept      16001    15258 (c) ATPase a subunit (atpI)
    pept      16938    16228 (c) ribosomal protein S2 (rps2)
    pept      27501    24289 (c) RNA polymerase beta subunit (rpoB)
    pept      34462    35523     PSII D2 protein (psbD)
    pept      35471    36892     PSII 44kd protein (psbC)
    pept      38873    38571 (c) ribosomal protein S14 (rps14)
    pept      41200    38996 (c) PSI P700 apoprotein A2 (psaB)
    pept      43478    41226 (c) PSI P700 apoprotein A1 (psaA)
    pept      48133    47528 (c) ribosomal protein S4 (rps4)
    pept      55276    54875 (c) ATPase epsilon subunit (atpE)
    pept      56769    55273 (c) ATPase beta subunit (atpB)
    pept      57587    59020     RuBisCO large subunit (rbcL)
    pept      64327    65289     cytochrome f (petA)
    pept      66860    66741 (c) PSII component (psbF)
    pept      67121    66870 (c) PSII cytochrome b559 (psbE)
    pept      70123    70323     ribosomal protein L33 (rpl33)
    pept      70510    70815     ribosomal protein S18 (rps18)
    pept      71401    71015 (c) ribosomal protein L20 (rpl20)
    pept      72326    72213 (c) ribosomal protein S12 A (rps12A), exon 1
             100851   100620 (c) ribosomal protein S12 A (rps12A), exon 2
             100083   100058 (c) ribosomal protein S12 A (rps12A), exon 3
    pept      72326    72213 (c) ribosomal protein S12 B (rps12B), exon 1
             141678   141909     ribosomal protein S12 B (rps12B), exon 2
             142446   142471     ribosomal protein S12 B (rps12B), exon 3
    pept      74950    76476     PSII P680 apoprotein (psbB)
    pept      77098    77319     PSII 10kd phosphoprotein (psbF)
    pept      77449    77454     cytochrome b6 (petB), exon 1
              78208    78849     cytochrome b6 (petB), exon 2
    pept      79845    80264     cytochrome b/f complex subunit 4 (petD)
    pept      81465    80452 (c) RNA polymerase alpha subunit (rpoA)
    pept      81947    81531 (c) ribosomal protein S11 (rps11)
    pept      82465    82175 (c) RF96
    pept      83004    82600 (c) ribosomal protein S8 (rps8)
    pept      83544    83173 (c) ribosomal protein L14 (rpl14)
    pept      85093    85085 (c) ribosomal protein L16 (rpl16), exon 1
              84064    83669 (c) ribosomal protein L16 (rpl16), exon 2
    pept      85896    85240 (c) ribosomal protein S3 (rps3)
    pept      86348    85881 (c) ribosomal protein L22 (rpl22)
    pept      86680    86402 (c) ribosomal protein S19 (rps19)
    pept      88231    87841 (c) ribosomal protein L2 (rpl2), exon 1
              87174    86741 (c) ribosomal protein L2 (rpl2), exon 2
    pept      88531    88250 (c) ribosomal protein L23 (rpl23)
    pept     100004    99537 (c) ribosomal protein S7 (rps7)
    pept     125398   125135 (c) ribosomal protein S15 (rps15)
    pept     142525   142992     ribosomal protein S7 (rps7)
    pept     153998   154279     ribosomal protein L23 (rpl23)
    pept     154298   154688     ribosomal protein L2 (rpl2), exon 1
             155355   155788     ribosomal protein L2 (rpl2), exon 2
    mRNA       1680      441 (c) psbA mRNA (441 +/- 2 bp)
    mRNA      57025    54637 (c) atpB, atpE mRNA (alt.; 54637 +/- 1 bp)
    mRNA      57025    54676 (c) atpB, atpE mRNA (alt.; 54676 +/- 2 bp)
    mRNA      57405    59161     rbcL mRNA
    IVS        4370     1845 (c) Lys-tRNA intron
    IVS        6171     5312 (c) rps16 intron
    IVS        9522    10212     Gly-tRNA intron (no splice consensus)
    IVS       13307    12613 (c) atpF intron
    IVS       49323    49825     Leu-tRNA intron (no splice consensus)
    IVS       54352    53782 (c) Val-tRNA intron
    IVS       77455    78207     petB intron (no splice consensus)
    IVS       85084    84065 (c) rpl16 intron
    IVS       87840    87175 (c) rpl2 intron
    IVS       98349    97838 (c) ndhB intron
    IVS       72326    72213 (c) rps12A intron A
    IVS      141678   141909     rps12A intron B (no splice consensus)
    IVS      142446   142471     rps12A intron C (no splice consensus)
    IVS      104584   105290     Ile-tRNA intron (no splice consensus)
    IVS      105428   106136     Ala-tRNA intron (no splice consensus)
    IVS      123219   122140 (c) ndhA intron
    IVS      137101   136393 (c) Ala-tRNA intron
    IVS      137945   137239 (c) Ile-tRNA intron
    IVS      144180   144690     ndhB intron (no splice consensus)
    IVS      154689   155354     rpl2 intron (no splice consensus)
    rpt       86685   112023     inverted repeat B (IR-B)
    rpt      130506   155844     inverted repeat A (IR-A)
    site          1    86684     large single copy region (LSC)
    site     112024   130505     small single copy region (SSC)
    ORF        3658     2129 (c) ORF-509A cds
    ORF        7724     8020     ORF98 cds
    ORF       19753    17165 (c) RF862 cds
    ORF       20277    19873 (c) ORF134 cds
    ORF       20423    20181 (c) ORF80 cds
    ORF       20646    20374 (c) ORF90 cds
    ORF       21475    20765 (c) RF236 cds
    ORF       23127    21481 (c) RF548 cds
    ORF       24283    23828 (c) ORF151 cds
    ORF       37558    37241 (c) ORF105 cds
    ORF       37586    37774     ORF62 cds
    ORF       44264    44497     ORF77 cds
    ORF       45394    45146 (c) ORF82 cds
    ORF       46464    46240 (c) ORF74A cds
    ORF       48933    49145     ORF70A cds
    ORF       51457    50981 (c) ORF158 cds
    ORF       52417    51563 (c) bhpB cds
    ORF       52659    52297 (c) bhpA cds
    ORF       59785    61323     ORF512 cds
    ORF       62630  > 62630     ORF184
    ORF       63407    64096     ORF229
    ORF       66168    66467     ORF99A
    ORF       67580    67269 (c) ORF103
    ORF       72686    72465 (c) ORF73
    ORF       73547    73323 (c) ORF74B
    ORF       82162    82049 (c) ORF37
    ORF       88883    90628     ORF581
    ORF       90598    95724     ORF1708
    ORF       95815    96078     ORF87
    ORF      146472    96057 (c) ORF115
    ORF       96116    96394     ORF92
    ORF       96553    96792     ORF79
    ORF       98889    98350 (c) ndhB, exon 1
              97837    97047 (c) ndhB, exon 2
    ORF      140581   101948 (c) ORF131
    ORF      102099   102311     ORF70B
    ORF      110820   110593 (c) ORF75
    ORF      111025   112077     ORF350
    ORF      114198   112066 (c) ndhF
    ORF      116250   117191     ORF313
    ORF      118958   117429 (c) ndhD
    ORF      119860   119555 (c) ndhE
    ORF      120383   120084 (c) ORF99B
    ORF      120612   120196 (c) ORF138
    ORF      121512   121009 (c) ORF167
    ORF      123840   123217 (c) ndhA, exon 1
             122109   121597 (c) ndhA, exon 2
    ORF      125023   123842 (c) ORF393
    ORF      126482   125796 (c) ORF228
    ORF      127561   126740 (c) ORF273
    ORF      131501   127767 (c) ORF1244
    ORF      131709   131936     ORF75
    ORF      140186   140581     ORF131
    ORF      140430   140218 (c) ORF70B
    ORF      145976   145737 (c) ORF79
    ORF      146125   146472     ORF115
    ORF      146413   146135 (c) ORF92
    ORF      146714   146451 (c) ORF87
    ORF      151931   146805 (c) ORF1708
    ORF      153646   151901 (c) ORF581
    ORF       96404    96057 (c) ORF 115
    anticdn      45       43 (c) His-tRNA anticodon gtg
    anticdn    4376     4374 (c) Lys-tRNA anticodon ttt
    anticdn    7455     7453 (c) Gln-tRNA anticodon ttg
    anticdn    8685     8683 (c) Ser-tRNA anticodon gct
    anticdn   10222    10224     Gly-tRNA anticodon tcc
    anticdn   10463    10465     Arg-tRNA anticodon tct
    anticdn   28815    28817     Cys-tRNA anticodon gca
    anticdn   31965    31963 (c) Asp-tRNA anticodon gtc
    anticdn   32157    32155 (c) Tyr-tRNA anticodon gta
    anticdn   32289    32287 (c) Glu-tRNA anticodon ttc
    anticdn   33204    33206     Thr-tRNA anticodon ggt
    anticdn   37189    37187 (c) Ser-tRNA anticodon tga
    anticdn   38082    38084     Gly-tRNA anticodon gcc
    anticdn   38387    38385 (c) fMet-tRNA anticodon cat
    anticdn   47145    47147     Ser-tRNA anticodon gga
    anticdn   48544    48542 (c) Thr-tRNA anticodon tgt
    anticdn   49841    49843     Leu-tRNA anticodon taa
    anticdn   50265    50267     Phe-tRNA anticodon gaa
    anticdn   54357    54355 (c) Val-tRNA anticodon tac
    anticdn   54614    54616     Met-tRNA anticodon cat
    anticdn   68846    68844 (c) Trp-tRNA anticodon cca
    anticdn   69084    69082 (c) Pro-tRNA anticodon tgg
    anticdn   88737    88735 (c) Ile-tRNA anticodon cat
    anticdn   96474    96472 (c) Leu-tRNA anticodon caa
    anticdn  102491   102493     Val-tRNA anticodon gac
    anticdn  104579   104581     Ile-tRNA anticodon gat
    anticdn  105423   105425     Ala-tRNA anticodon tgc
    anticdn  110007   110009     Arg-tRNA anticodon acg
    anticdn  110667   110665 (c) Asn-tRNA anticodon gtt
    anticdn  116101   116103     Leu-tRNA anticodon tag
    anticdn  131862   131864     Asn-tRNA anticodon gtt
    anticdn  132522   132520 (c) Arg-tRNA anticodon acg
    anticdn  137106   137104 (c) Ala-tRNA anticodon tgc
    anticdn  137950   137948 (c) Ile-tRNA anticodon gat
    anticdn  140038   140036 (c) Val-tRNA anticodon gac
    anticdn  146055   146057     Leu-tRNA anticodon caa
    anticdn  153792   153794     Ile-tRNA anticodon cat
BASE COUNT    47824 a  29991 c  28992 g  49037 t
ORIGIN      2692 bp upstream of BamHI site.
        1 ttatgggcga acgacgggaa ttgaacccgc gcatggtgga ttcacaatcc actgccttga
       61 tccacttggc tacatccgcc ccctcgccta cttacattcc gtttttacat tatttaaatt
      121 agaaaacaaa agattcaagt tcgaatatag ctcttctttc ttatttcaat gatattatta
      181 tttcaaagat aagagatatt caaagataag agataagaag aagtcaaaat ttgatttttt
      241 ttttggaaaa aaaaaatcaa aaagatatag taacattagc aagaagagaa acaagttcta
      301 tttcacaatt taaacaaata caaaatcaaa atagaatact caatcatgaa taaatgcaag
      361 aaaataacct ctccttcttt ttctataatg taaacaaaaa agtctatgta agtaaaatac
      421 tagtaaataa ataaaaagaa aaaaagaaag gagcaatagc accctcttga tagaacaaga
      481 aaatgattat tgctcctttc ttttcaaaac ctcctataga ctaggccagg atcttatcca
      541 tttgtagatg gagcttcgat agcagctagg tctagaggga agttgtgagc attacgttca
      601 tgcataactt ccataccaag gttagcacgg ttaatgatat cagcccaagt attaattaca
      661 cggccttgac tgtcaactac agattggttg aaattgaaac catttaggtt gaaagccata
      721 gtgctgatac ctaaagcggt aaaccagata cctactacag gccaagcagc taggaagaag
      781 tgtaacgaac gagagttgtt gaaactagca tattggaaga tcaatcggcc aaaataacca
      841 tgagcggcta cgatgttata agtttcttcc tcttgaccga atctgtaacc ttcattagca
      901 gattcatttt ctgtggtttc cctgatcaaa ctagaagtta ccaaggaacc atgcatagca
      961 ctgaataggg agccgccgaa tacaccagct acgcctaaca tgtgaaatgg gtgcataagg
     1021 atgttgtgct cagcctggaa tacaatcatg aaattgaaag taccagagat tcctagaggc
     1081 ataccatcag aaaaacttcc ttgaccaatt gggtagatca agaaaactgc ggtagcagct
     1141 gcaacaggag ctgaatatgc aacagcaatc caaggtcgca tacccagacg gaaactaagc
     1201 tcccactcac gacccatgta acaagctacg ccaagtaaga agtgtagaac aattagttca
     1261 taaggaccac cgttgtataa ccattcatca acggatgccg cttcccagat tgggtaaaaa
     1321 tgtaaaccta tagctgcaga agtaggaata atggcaccgg aaataatatt gtttccgtaa
     1381 agtagagacc ctgaaacagg ttcacgaata ccatcaatgt ctactggagg agcagcaatg
     1441 aaggcaataa taaatacaga agttgccgtc aataaggtag ggatcatcaa aacaccaaac
     1501 catccaatgt aaagacggtt ttcagtgcta gttatccagt tacagaagcg accccatagg
     1561 ctttcgcttt cgcgtctctc taaaattgca gtcatggtaa aatcttggtt tatttaatca
     1621 tcagggactc ccaagcacac tagttttcta caaatcaaaa tagaaaatgg aaggcttttt
     1681 attcaacagt ataacatgac ttatatactc gtgtcaacca aggtgtatgt agatctattc
     1741 aaatttttaa tgaagttgat tggaaaaata cggacttctc tacagaaaat tagaatttcg
     1801 atatgctagt gggttgcccg ggattcgaac ccggaactag tcggatggag tagataagtt
     1861 ccttgttaaa taaaataaat gttaatctta aattaaataa acaagtaaag acccctcccc
     1921 aagccgtgct tgcatttttc attgcacacg gctttcccta tgtatacatc agttcctttc
     1981 ttatagaaat tagaaagact ttaaaaagtt gaatactcag ttgatttacc ccttaattac
     2041 tattacaatc aacatttcag aatagtgaaa tttttttatc tcttcatcat ttagaaacaa
     2101 atttccattt agaaaatcta agaatgaatc attgataatt cgccagatca ttgatacaaa
     2161 aaatatccaa ataccaaatc cgacttctat atactcccca caaactagaa gaagctcgtg
     2221 ggaaggtcaa agaaagaact tgttcttccg acgttaagaa ttcttccaat aattccgagc
     2281 ccgatctttt caaaaaagtg cgtacagtac ttttgtgttt ccgagctaaa gttctagcac
     2341 aagaaagtcg aagtatatac tttattcgat ataaagtctt ttttttggaa gatccgctat
     2401 aataatgaaa aagatttctg catatacgcc caaatcggtc aataatatca gaatctgata
     2461 aatcggacca aaccggttta ctaatgggat gccctaatac ggtacaaaag tttgctttag
     2521 ctaatgatcc aatcaaagga ataattggaa caagggtatc gaacttctta attgcattat
     2581 tgattagaaa tgaattttct aacatttgac tacgtaccat tgaaggattt agtcgcacac
     2641 ttgaaagata gcccataaag tcacgggaat gattggataa ttggtttata tggatccttc
     2701 ctgtgtgaaa gcacagagaa caatgacatt gccaaaaatt gacaaggtaa aatttccatt
     2761 tattcatcaa aagaaacgtc ccttttgaag ccagaatgga ttttccttga tacctaacat
     2821 aatgcatgaa aggatccttg aataaccata gggtaacctg aaaatcctta gcaaagactt
     2881 ctacaagacg ttctattttt ccatagaaat atattcgttc aagaagggct ccaaaagatg
     2941 ttgatcgtaa atgagaagat tggttccgta gaaagacgaa agtggattcg cattcatata
     3001 cataagaatt atataagaag aagaagaatc tttgattttt ttttgaaaag gagtaaccgg
     3061 gcttctttga agtaataaga ctattcaaat tccaaaattc atggagaaag aatcgtaata
     3121 aatgtaaaga agaggcatct tttacccaat agcgaagagt ttgaaccaag atttccagat
     3181 gaacagggta gggtattagt atatctaata cataatttag atgtgaaaaa ttgtcctcta
     3241 aaaaaggaaa tgttgaatga attgatcgta aattataaga tttaaaaatc tttttgcctt
     3301 cgaaagaaga taaagaagat attaatcgta gagaaaacgg aatttccaca ataaacgcaa
     3361 atccctctga tatcatttga gaatacaaac tcttgttgca ccccaaaaat gaatttttgt
     3421 tagaatcatt agtagaaata agaaaatgat tctgttgata cattcgagta attaaacgtt
     3481 tcacaattag gaaacttaat ttattgttat aacctggatt ttccaacaaa atcgacctat
     3541 ttctatttaa accatgatca tgagcaagtg cataaatata ctcctgaaag ataagtggat
     3601 ataggaagtt gtgttgttgc gatctatctg gctgtaaata tctttggatt tcttccattt
     3661 gaaattcgat ttgaaccaaa gacggaagat tttgagggtt atcaaatgat acatagtgcg
     3721 atacagttaa aacaaagtat tttagtaaga atagatacct tggatacagg taaacttctc
     3781 aacggattct ctatcatctc tttttttttt cgtttcgttt aattggtcta tgttatagtg
     3841 ttataggata ataagatggt tagaaatcct ttattttttc aacctaatcg ctcttttgac
     3901 ttcggaaaaa actttcttta tcaatatact gtttcttcta cacacacatc tccgtaatag
     3961 aaaatggtaa tagttaggat tcattaaaaa aatggagaat ccactcatgg gacaagaaac
     4021 ccttcccgca tcaggcacta ataaattttt aacgtctaat tagatcggga atcattcaaa
     4081 ttaagaacaa aagctcgttg ctttttcttt ctttccctat aatttaattg aagccgcagc
     4141 cctatccatt tattcattcg acccaacttt attttgttcc gttccaagaa ttctaacacg
     4201 gttttatacc catctaggaa caatgaaata ttctcagaac tttccgttga tacgacatgc
     4261 tatttttacc attcattccc tttcaggatc agtcgtggtc ttccaaactt taccgagagt
     4321 atggacgaat ccctcacttc atccatatgt gtaaaagaga ctagccgcac ttaaaagccg
     4381 agtactctac cgttgagtta gcaacccgaa gaaaatatcg aagaaaaata aataaagaga
     4441 ttagacaaga caaccaaaaa ccattgaagg aataaatcta aaaaaataca ttcacatttt
     4501 cgaattaatt taaaacataa aataaaacta aatagatcca cttcatttat cacaatgaat
     4561 tatatttgtt cgatacactc tgttgtcata aatattgaat agtgaaaaaa aaaaaagaaa
     4621 tttcaattga caacaataaa aaatattaaa aaaaaggact tgtgttagat tggcactaca
     4681 aatctaatcc aaataaaata gatacaaaaa agtatagatg agagaataaa ttaagtggaa
     4741 aacaaaacta caatttattt agatttattt aatccataat ggattcaatc aagttaagtg
     4801 agataagcaa acttgatttc ttttttagtt ttagtagagt tccaatgaaa aacggaaaac
     4861 cacccaattg aaggaaatgc ccgaattttc tatttcgagg atcaataaaa taagaggttt
     4921 tgtcgttata gaacacggaa ttcaatggaa gcaatgataa aaaaatacaa atagaaaagg
     4981 aaagggagga aatacaaaaa aatagaagag aaaagtcata caaagttata tacaaatgac
     5041 tacccccctt tttgtatttc cttaatttat ttccttaatt gaatttcggt tgattaggat
     5101 tgattaggac gaagttcctt aaaaacctcc gccttcttta aaatatcctg aacagttcct
     5161 gtaggttgag cccctttttc aaggaaatat aaaatagcag gaacatttaa ataagtttga
     5221 ttctttatcg gatcataaaa acccactttc cgaagatctt ttccttctct tcgggatcga
     5281 acatcaattg caacgattcg atagacggct cattgggatt gatgtagatg aacaacaccc
     5341 cccctagaaa cgtataggaa gctttctcct cgtacggctc gagaaaaatg attgattcga
     5401 ggttttatct ctgtatggaa ttctatctaa gaaatgacaa ctgggtccat aaaatgatca
     5461 aatcaattaa agatgtaagt cttttttttt cttctttctt cctgaaaatg aaaaagaaac
     5521 cattcgtact ctcataactc aagttggata actttcaaac agttcaaagg aaaatctttc
     5581 ggcaatttca tttattgagc ggtctttcct ccttttatgt ttgtctcgtt taaaatggat
     5641 ttggattctt cagttcgatc cagttattaa gacaataaaa aaggtgtttc cttgttctgg
     5701 gatcctttat ctttgtttta ttttaaatca ttgggtttag acattacttc ggtgcttttt
     5761 aatcctttca aaatggcagc aacatacccc ttttgcgatt tctatgaaag aatcctacag
     5821 acgatggatt cccgcgtgaa acactttgga tcgaaaagtt tgaatcaatt ccaaggaatt
     5881 tttgaattgg aaacttgctc gaattggatt ctttcgattt ccataccgaa aatatattta
     5941 cgaagttgtt ccaatttttt tattgattgg cattaaccct agactcttgc cccgagaaat
     6001 aaattaatac tttctactcg agctccatca tggactattt acattccaag acaacaaaaa
     6061 agaggggttc taatgaaaca gaaccaatga tgtcgagcca agagcacctt cattcctaca
     6121 taaaatggtg gatgtacaaa tccacaacgg atcctgtcct tcaagtcgca cgttgctttc
     6181 taccacatcg tttcaaacga agttttacca taacattcct ctaagaaccg gtctggaatt
     6241 gattcaatta tggaatcatg aatagtcatt ggttgggctg atgtataaac accataatct
     6301 atactttgtt ctatatctat atactataga gataggtgga taaatatttt tctttagtaa
     6361 gaccccatcg ctaatattaa tttatctaac atattaatta atatttaata tataaatata
     6421 tatagaaata ataataaata agaataataa taaataagac gaataaatga gttctttttg
     6481 attctgcatc ttcacgtgac tcaataggag agattgacct atttcagact tcttcaaata
     6541 gcaaagattc cgcttataag gaatgattaa aactatttat atttctaaat ttagaaagtt
     6601 cccttttcga catcattatt tgaagaaaat ttgatagtta aagatcactt ttgatcatct
     6661 taggaaagaa aaaagataag tctttctttt ttaattgaat catcaacgat ttcaatgatc
     6721 taaaatagat aaatacacca aacaacaaat ccaatttttt tttatgagat ggataaaaaa
     6781 agattaatat aaggtaagat tttcattctt attctttttt ttttttttca tctgattgat
     6841 aaaatccaaa gaatggggag ggtttcgtat ctatcaattc gatcaaatag actgagcaat
     6901 tgtcaccgtt tatagatatt gaaatgaatg ccttcccatt actgattaac tcctatctac
     6961 cccattctat gggcctgatg cagcataaat caaaagaaaa gaggggggtg tcctagtctt
     7021 tttgattttt acgaaatgcg agctgtctag gcacaaagcc aaacaagtcc agattaagtc
     7081 aagtttttgc tcctattttt tgatatttta gcctaactca ttgattaaga attaagagac
     7141 ttagtgaatt taattagtac caaaaatccc ctcttggcga aaagtcaaga aatccacaaa
     7201 aaagaaaatg gaatctaatt aggctaattt aggggataga gaatacgaga tagggaatat
     7261 agattctttc gcatctcgat tccgtttttg aaaaaaaaaa atgattcatc gaagaaaaaa
     7321 atcagaaaca acaatcacat tccagctaac atttcgattt taaacagaac attgttaaaa
     7381 aagcaatcta tattctcata gaatatatat atgttctggg acggaaggat tcgaacctcc
     7441 gaatagcggg accaaaaccc gttgccttac cacttggcca cgccccattt agatttctat
     7501 tcgatactaa gaaagtatat tgcttgtttt gtttgtttgt caactctagt ccaaatatct
     7561 atagaataga ttagattggt actaggattt tgcgatgttt ttggtatgtg tagatataga
     7621 attcaactta atttattgat cattacatat aattcaatta agatattgta tgaaaatatg
     7681 attttttcga ttctcctttg agaaaaggag gatttttgat tgggtgggtt caaagaaaaa
     7741 gaagtatttt ttgtttacct tacttacttt ccctttcctt atatcaataa cgcaatcaaa
     7801 atgcaattat ctctccaaga acaaaaagtc tgttatgctt aataccttta gtttgatcgg
     7861 tatctgtctt aattcgaccc ttttttcgag tagttttttc ttcggcaaat tgcccgaggc
     7921 ctatgctttt ttgaatccaa tcgtagatat tatgccagtc atacctctgt ttttttttct
     7981 cttagccttt gtttggcaag ctgctgtaag ttttcgatga gatccttaat aatatcctag
     8041 aaaattcatg atttattcga gaaaaattct aaaataaata aaatcagata agctttaccg
     8101 tttgaaacct cgattcaaac attgaaattc ttggatagtc acgagaaatc cggcttaact
     8161 tatttcctta ttttttgacg ctttcccttc cagtgaaaga ccttattagg ctcctcacaa
     8221 tacctaattg tgtatataaa aaaattttgg ttaatgacaa actcttagta gaaaagaatt
     8281 tatgaaaatt cttttagaga aagagcttca ttgcttggtg tcaaactagg atatgcggta
     8341 gaaaaatgga tgatctattc tctttttttc aaaaaaaatc atcttggaga ttgtgtaatg
     8401 cttactctca aactcttcgt ttacacagta gtgatatttt ttgtttctct cttcatcttt
     8461 ggattcctat ctaatgatcc cggacgtaat cctggacgtg aagaataaaa taaaaaaggt
     8521 ttttccttgc ttgattttcc aattttctta tgatttggtc tattccacac atttaactaa
     8581 gaataagaac aaaggatttc gaaatttgaa aaaaaaaaat caagtcatca acggaaagag
     8641 agggattcga accctcggta cgattaactc gtacaacgga ttagcaatcc gccgctttag
     8701 tccactcagc catctctccc aattgaaaaa gataattact acatgagata gcacataaga
     8761 taaaggaaag aatctttctt tctctctttt cttctttcta tattatatag atatgtacaa
     8821 cttttatcat caatttcctt tatttcttta tctaaagtaa aggaagggct cagaagagcc
     8881 aagaatatca agaaaaataa agaagacctc ttttctttgt cttgattttg ttcgaaagga
     8941 ccctcttatt ctcatggcct ggtctggtca gtacccagcc gggcctcttt tgttccaacg
     9001 aatttgaatt tgaaaactaa aaagcctgtt atagttgtaa tatttcattt taattgaata
     9061 gttaatattc aagcaacaag aaaaaattcc cattttttgt aaaagtaaaa taaaatatat
     9121 aaaatagaaa attcgatcaa aataaaagtc tcatttctct ttctgctttt ttattttatg
     9181 tttaccacct tactggacta aaaaaaagaa gctttcgagt attccacaat gcatttttat
     9241 gttatgattt tagtggtttt gacgagccgt atctctatca aaactcctcc agcaaaagaa
     9301 aagataaaac taaattctgt aatttagtta tttaaatgaa ccctcgtttc caaatctcat
     9361 caaattggaa tccccccagg aaaaaagatc aacactctaa tttggatgat tctgtgacga
     9421 ccctatctta tcctatcttg attaccacaa ttcccctgtt cgacaaaagt tgcatttgta
     9481 tacaataatc ggattgtagc gggtatagtt tagtggtaaa agtgtgattc gttctattat
     9541 cccttaaata gttaaagggt ccttcggttt gattcgtatt ccgatcaaaa acttgatttc
     9601 taaaaaggat ttaatccttt tcctctcaat gacagattcg agaacaaata cacattctcg
     9661 tgatttgtat ccaagggtca cttagacatt gaaaaattgg attatgaaat tgcgaaacat
     9721 aattttggaa ttggatcaat acttccaatt gaataagtat gaataaagga tccatggatg
     9781 aagatagaaa gttgatttct aatcgtaact aaatcttcaa tttcttattt gtaaagaaga
     9841 aattgaagca aaatagctat taaacgatga ctttggttta ctagagacat caacatattg
     9901 ttttagctcg gtggaaacaa aacccttttc ctcaggatcc tattaaatag aaatagagaa
     9961 cgaaataact agaaaggttg ttagaatccc ctcttctaga aggatcatct acaaagctat
    10021 tcgttttatc tgtattcaga ccaaaagctg acatagatgt tatgggtaga attctttttt
    10081 tttttcgaat tttgttcaca tcttagatct ataaattgac tcatctccat aaaggagccg
    10141 aatgaaacca aagtttcatg ttcggttttg aattagagac gttaaaaata atgaatcgtc
    10201 gtcgactata acccctagcc ttccaagcta acgatgcggg ttcgattccc gctacccgct
    10261 ctatatctat ttattctaaa tattttaatg tattcattaa atcaaattta gtttattagt
    10321 attagtacat cattgaatat acaattccaa aaattctttc acatccgatt ctttctgttt
    10381 tttttttcaa acaaaaagtt aaaatacgaa aaaaaaatca gaatgaaaag cgtccattgt
    10441 ctaatggata ggacagaggt cttctaaacc tttggtatag gttcaaatcc tattggacgc
    10501 aatttatttc catatatatt tttttttaga tttcgatagc aagaaagact gtttgaatat
    10561 ttgaatccaa gacgcttgat tccttttttt tattaagatt aagacaaaag tgatcaatat
    10621 ttctttatgc ttgttcctga agtataaaac ggtccatttg ttcctgaata gcttctttca
    10681 aaagggcttc tgcttcctcg gtaaatgtct tggtagaaga tatgatttct tggaactgag
    10741 gtttattagt ttttaagtaa gtacgtagct caacaagaaa tttccttacc tgtccaactt
    10801 ctaatgaatc aagatagccg tttgttccgg tataaatagt cattatctgc tcttctaccg
    10861 tgagaggagc tgattgggat tgtttaagca attcacgtaa tcgttgacct cttgccaatt
    10921 gattctgagt agctttatcg agatcagaag caaattgtgc aaaggcttct aattctgcga
    10981 attgtgctag ttctaatttt aatttaccag ctacttgttt catggctttt atttgagctg
    11041 cggaccccac tctggaaacg gagataccca cattaatagc aggtctgatt ccagaattga
    11101 ataggtcggc ggataagaag atttgtccat cagtaatgga aattacatta gtaggaatat
    11161 aagccgaaac atctcccgat tgggtttcaa ctattggtaa ggcggtcata cttccttcac
    11221 ctaaactaga acttaattta gcggctcttt ccaaaaggcg tgaatgcaaa taaaaaacat
    11281 ctcctagata agcttcacga ccgggcggtc ttcgtaatag aagagacatt tggcgataag
    11341 cttgcgcttg tttggaggga tcatcataaa tgattaaagt gtgtcgttca cgatacataa
    11401 aatattcagc cagagctgct cctgtataag gagcaaggta ttgtaatgta gcaggggaat
    11461 ctgccgtttc ggctaccaca atagtgtatt ccatcgctcc cctttcctgt aaagtagtta
    11521 cgacctgggc cacagaagat gctttttgcc caatagctac ataaacacat attacatttt
    11581 gaccttgttg attgaggatc gtatctgtgg ctactgctgt tttaccggtc tgtctgtccc
    11641 caataattaa ttctcgctga ccacgtccta tagggatcat cgaatcaata gcaataagcc
    11701 cggtttgaag aggctcatat acggaacggc gcgaaataat acccggggcg gcagattcga
    11761 ttaatcgaaa ttcagaagct gaaatttcac ctctaccatc aataggttta gccagggcat
    11821 ttataacacg acccaaataa gcctcactca cgggtatctg agcaattctt cccgttgctt
    11881 ttacagaact tccttcttgt atcaataaac catcgcccat taatacaaca ccaacattat
    11941 ttgattccaa attcagagca atgcctattg taccctcttc aaattcgact aattcacccg
    12001 ccattacttc atcaagaccg tgaatacgag caatgccgtc gcctacttga agtacggtac
    12061 cggtatttac aatctttact tctctattat attgttcaat acgttcacgg ataatattac
    12121 taatttcgtc agctcgaatg gttaccatga ttctttcttt attatttttt gaaagaaaaa
    12181 aataatacct acagtagaaa gactaatcag ttatttcttt cattgttccc aacatgccaa
    12241 tattggacct aatggtacgt aaatgtaact cgttgttcaa acaactattc agagttccta
    12301 gagctcctcg taaggcttgt tggaaaaccc gttgtcggac ttgattaatc gccctttgct
    12361 gttcaaactg aatcgtttcg tttttgtaat tttctaattg ttccaaagtc ttataagttg
    12421 aattaatcaa attcaatttt tctcgttcta tttcagagta tccattcact cgaaactgct
    12481 cggcttcgct ttcgactttc cgtaagcgag aacgagcttt ttcgagttgt tcaatagccc
    12541 ctccacgcag ttcttctgaa tttcgaatag tattcaagat cctctgtttt cgattatcta
    12601 ataaatcact taatgaaagt agattatttt tccattcctt tccaaaattc cataatccct
    12661 tcccgaacca aacatgaatc tttcgattca tttggctctc acgctcaatt acttaaggta
    12721 aattctcata tcttttttta tgaatgtaat gagcctatct tctcttcttt gttcatattc
    12781 caaaaagata tcgaaactaa tgtaatacca aaatattcgg aggactcttc tgacaaaata
    12841 aaaaatatgt aattgtcagc aaagttgttt cttttttttt ttttcaaatc caaaaagctc
    12901 ttcttactta gaataggtcg tcgattcagc attagataaa gggggtaaaa tccccgtttt
    12961 tacaatttac aataagcggt tcaaatcatt ttatcaatat gagtatccta tatcgataaa
    13021 atatttattt tgaaaccacc tctatattaa catagtggta gaaagagtac catgctgcgt
    13081 ctagacttca aacagtttgt tttaaccatg ttaatagttc cacattattg gttaatagag
    13141 aatcaaaatt gatttaccaa tgaatcgcga aatgctatgg ttcttacata taatttctga
    13201 atttattcag aagtaattcg cgagatcatg cacctctctt tcctagttat aacggaaaag
    13261 ggtacagctg ggtggtccag cctattcttg aaataaacaa ctcgcacaca ctccctttcc
    13321 aaaaaaaatc aatacaccaa gcactacact tagatttatt ggatttgttg ctaaaatatc
    13381 ggtattaaac ccgaaactcc cggcagatgg ccagtggccc aaagaaacga aagaatcggt
    13441 tacgtttttc atatgatctc ctcttataga tagactaaaa aatcgaacag agttcttttt
    13501 gtagcacttc gcccctcttt ttatttattc ttttattttt tctgaaattg agtcaaaaaa
    13561 taaaaaatat tcgagttagt tataaattat gaactaacga actagccctt ttattggtta
    13621 ttggaacact aacacttact aaaaagagtt tcccttggtc tatgaacggg aaggatgaaa
    13681 gcgagtcagt atgctaattc ctcatccgca aatcagccct tcccgtaggt tcttttctca
    13741 aagaataaag aattggagga gggaaatctt gatagaattt gaaaaagcaa acgacaagtc
    13801 gaaggcaata aaatatgaaa aatgtattta tttttcatat ttctaagcta agattaaaca
    13861 aaaggattcg caaataaaag tgctaatgct acaaccagtc cataaattgt taaagcttcc
    13921 ataaaagcta gactaagcaa tagcgtacct cgtatttttc cctctgcctc aggctgtctc
    13981 gcgataccct ctacagcttg acccgcagca gtcccttgac caactccggg tccaatagaa
    14041 gcaagcccta cggccaatcc agccgcaata acggaagcgg cagaaatcag tggattcatg
    14101 ataagttcct cgtaccaaaa aaaagaaatg gttaacgata caatcaacca atgagttatg
    14161 acttaattat tccctcgcta ggaatcatcc agtcgaagta actaagaact tcggattgaa
    14221 gtaataagat tattgaatca tcagaactac ttcgatatat cttttttact ttttagccac
    14281 agagtctttg tgaacccata cgactttcgt tcttccattt cttggttcga actgttagtt
    14341 gaattatttc ttgatttcat ccgtttattc attcaattca cagtcacaag gggccggaag
    14401 gacttctagt ctattagaat cccctagagt agtaaaatta tatctttagt tcatttcata
    14461 tataactagc actagtcaat atctaatatc acatatacat gtctttcttc cataacgtaa
    14521 accaagcatt catcttagat tcaatcctat tcgagaatca agcgtcgaaa catctagaag
    14581 ggttggctta tagttattca attacagata cctccctctc ctaaccgacc ctttctaaaa
    14641 tactcaaaaa aatccctttt ttgtaaattc ttttgaacct taccttttct tattattcca
    14701 cctagataaa tctaaatgga caaattgatt aggccgaata attccatatg tatagaaata
    14761 tcattatttg attgatctaa gttcatgcaa tttattaata aaaatgaata atttatttat
    14821 taattattaa tattttggtc aatcgttgaa taaaatcaac tgaaagggaa atcgtttcgc
    14881 cctttttaat ttaatttaat tacacgtcgt aaacctatac aacaagaatt ataattattg
    14941 acaaaaattc ttatattcaa attgttttaa caatgaatta ataatgagat ggactaagca
    15001 atctaaagtg aatattcatt gagacgaagt atgatattaa gtgaaggaaa ggggaatttt
    15061 aggaaaaaga tctttttttt ttagatcttt ttccccttac tctttaatat catcgtaatt
    15121 tttttgctat cactctagat cgtatataaa atagttgtat atttagattc ccctattcta
    15181 ttccctaagt taagtaattc tcttgagcca cccaccatat ttatacattg ctttgggcta
    15241 agctaaataa gactatttca atgatggccc tccatggatt cacctatata agccgcggct
    15301 aaagttgcaa aaataagagc ttgaatacca cttgtaaata atccaaggag catgacaggt
    15361 ataggaacta ctaaaggtac taaagaaaca agaacaacaa ctactaattc atcagctaag
    15421 atattcccga aaagtcgaaa actaagtgat aaaggttttg tgaaatcttc taagatgtta
    15481 atgggtaaaa ggattggggt tggttgaata tattttccga aataacctaa tccttttttt
    15541 gtaagacccg catagaaata tgccactgat gtgagtaaag ccaaagcaac agtagtattt
    15601 atatcattcg tgggtgcggc taactcccca tgaggtaatt gtatgatttt ccaaggtaaa
    15661 agagctcctg accaattaga aacaaaaata aataaaaaca tagtgccaat aaaaggaacc
    15721 cagggcccat attcttcgcc aatttgagtt ttactcacat ctcgaataaa ttcaagaaca
    15781 tattcgaaga aattctgacc cccggtcgga atggtttgtg ggttccgaac agctatagtg
    15841 gctgaaccta ataagatagc aattacaacc caagaagtaa taagtacttg gccatgtact
    15901 tggaaacccc ctatttgcca atagaaatgt tggcctactt ccacaccgga tatatcgtat
    15961 aaccccttta gagtattgat ggaacatgat agaacattca tattgccttg ccctctgaaa
    16021 aaattgaact ttaaacaaaa ttttttgatt caaccatctc tttgtctact tgaatcggat
    16081 attttgaata ccaactaaga tttagaatac taataaatca cataatatcc ccagctattt
    16141 ttatctcttt tttgaaattc agaaatagta agcgattcca taagggattt ctgaagtaag
    16201 ttatttatct tattatgtta ttattaatca aggatttctt atatagctag aacgaccctc
    16261 acaaattgcg aatactaatt tgttaagaat taatcggatt gaggatatgg cgtcatcatt
    16321 cgctggaatt gaaatatctg cgagatcggg gtcacaattt gtatcggtta aacaaattgt
    16381 tggaattcct aaagtaatac actctcgcag ggccgtatat tcttcgtgct gatcaacgat
    16441 gattacaata tcgggtaccc ctgtcatata tttaatcccg cccagatatg tttgcaagcg
    16501 agataattgt cttttcaaca tagcagcatc tcttttcggg agacggttga gtctccctgt
    16561 tttttgttcc attctcaagt ccctgaactt atgaagtctt gtttcggtag tggaccaatt
    16621 cgttaacata ccgccaagcc attttttatt aacataatga caccgggccc ttattgcagc
    16681 ccactctact gaatcagctg ctttattttt ggtaccaaca attaagaatt gttttcccct
    16741 acttgctgcg tcaaaaacta aatcacaagc ttctgataaa aaacgagcag ttctagtaag
    16801 atttgtaata tgaataccct tacgctttgc agaaatataa ggcgccattt taggattcca
    16861 tttcctagta ccatgaccaa aatgaactcc tgcctccatc atctcttcca aatttatgtt
    16921 ccaatatctt cttgtcattt ctctccacac cccccctttt ttttttattc tttttcaaaa
    16981 aaaaaaaaag agacgaggaa ccctgaactg aaataaataa ttgttccgat ggaaccttct
    17041 cttctaccgt agattggacg tagatacacg acccaaacca ttattctttt ctattcatta
    17101 ttctttttat taccaaagca aataaccata ccaaatgcag atagcgaaag agatgaatcc
    17161 gttgttagga atcattaaat cctataaacg attgttcggg tatatcgtgg aaattttttg
    17221 aaagacaaga atcaaataat tttttgtggt ggaacaaaat atctctcatc tccccctcga
    17281 atagattctt tttttttgtt tccaaaggaa tgttgttatg ttgttttgaa gggtgcacta
    17341 atcccttgaa tccggtacca acgggtatca ccccccccaa aacaacgttc tctttcaggc
    17401 ctttcaacca atcgatacga ccccggagag ccgcttttgc taaaactcga gcagtttctt
    17461 gaaaactcgc ttcagatatg aaactttgag tattgagaga tgctcttgtt attcccaata
    17521 agacgactcg gtaacagatc gcttcttcca aagcgcgccc cattcgttct gctcgcaaca
    17581 atccaataag ttctccgggt gaaaaaacat tagacattcc atcttctgaa accaacactt
    17641 ttgatgttat ttgacgtaca ataatttcta gatgcctatt atgaatctgc accccctggg
    17701 agcgataaac ctgttggatt ttattaacca aagagattcg gctttgcgct atagttagct
    17761 cagcaccaat caagaatccc caaggaattc caagaattct tgttatacat ttgttccaac
    17821 cctcaatcct cttttctaga ttcatggata ttgaatcaac cgaacgcact tctaacacct
    17881 gttctacttt tggaagacct tgtgttatat caccagatct cgatttttca tatataaatg
    17941 taactaatgt atctccttcg taaagggttt ccccataatg gccatgaaca gttgctccgg
    18001 gggtggccaa ataaggctta gctgatcgta tcactatcga atcaacttga acaagtataa
    18061 cttgacccga tttgaggggc ggtccatttt tggctataca tacattttca caaataaact
    18121 gtccaagact aattatttta gatgtctctt cacaataatt gtgatggaga aaataccaat
    18181 tcaaattgaa tggatttaaa ataatgttac gacacggatc gggattaaaa atttttccat
    18241 tttcatccat taaataatat ttaaatttaa tcacttgaaa agtctgtttt aaattgtcaa
    18301 gttgcaaata gttagttact aagatctgat tatgagttat taaatggtaa gatgaataaa
    18361 aattctcaat tggaagggat gttcctaaag ggcccaatga attcctaatt ggaattaggg
    18421 gatctttttt aattgatttt tttatcacac tgtgatattt tacatccttg aatggcccca
    18481 ttcgagaaca attggctgct gacaaaatta tcaacgactg acattcctta tttctattta
    18541 acaacgtatg aatagttcct tgaggttggt taatagattg ttgaattttt gccttggaat
    18601 aggaataaat ggaagaaaag gggttgatat tggtacaatc tgatccatta tcagagagca
    18661 atcctgaccc cgacggatca ttcctttttc cgatatacga aataggggat ttcactaagt
    18721 tgattcttag gaaatgtcga atcaaaccat ttgtccttat ttcaacaaaa gaagcacggg
    18781 cttcttcgca agaagaactt tttttgtctt ggttccaatt taatactaaa caagtccgaa
    18841 ctaattgaat acttgtgtca gaaattcctc gaatcggttt gccatttcca taaaggatat
    18901 aattgacaat tcgaagttgc acattatccc tttcctgcaa tggatccggt ggaaaaaggg
    18961 ttgctaaatt tataccgtcc gttatttcat atgtgacgac aggtcgaact aaaacaaaaa
    19021 actttttctt gctaggtgta atccgttgga catagatcca atttttcact tttttggatt
    19081 ccttggaatt tctttttcct gttcctggtg gtatcaaaac gccggtatgt cgggatatct
    19141 tatctgtctc tccaggaaaa tggatatctc cagaaaagat tttaagttca attcgttttt
    19201 tttttctctc cacccggacc aacccaccga ctcggcttct tagatttaag gtgatttgtg
    19261 tatctacccc aacgatacta ttgttccgta ccattatgga agaagatccg ggcaagatat
    19321 gcacctcttc aggaatgaaa aaaaatcgat ctactttcat ttggtatttt ggcctaaatt
    19381 ccttgactcc tcgatactca agcaaatcct cttttttgat gactgaatgc gtttctacag
    19441 tcccatattt aataatgccc gaactctttc ttctgtatcg aggatcatcg aaataagcaa
    19501 gaatactatt tcgacggaaa ataccattta cggggatttc aatcgagata cctgaacagg
    19561 gcattagttc attctcgagt tcttgaatcg agtgtagtgg aatgatgaat ttatttcttc
    19621 gcctttttga caataaatca gaattctcgt ggagaatagg cgaatatacg agattatact
    19681 gaccagcaca tataattcga ttaaggtctg aataatcagg aatcctatct tcttttttac
    19741 cagaaaaatc gaactaaata atttctgcct cgcctgatcg ttggttactc gagaggttag
    19801 aagtatatct tcgcttgcca gaaagaaaat gcgcattcat ttgatcctga tccttgtgga
    19861 tcgaaaggta gactagactg gacctcgagg ccttcctaat aatatccata aatggcttgt
    19921 ttttggtaat agatgaacat taccgtatgt aaattcgggt gcatgataga catcggtact
    19981 ccagtgcatt tctccgtctg aatcagaata aatatgtttt cgaaccttct ctttaaaatt
    20041 caaagtggat attcctgcgc gaatctcagc aattacttgt tctgattcta catattgatc
    20101 gttttgaact aaaagcaaac ttttgggtgg aatattcaca ttatgtagaa tatcttcact
    20161 ctcaatagtt acatacaagt ctatagaaca tagaaaggcg ggatgcccat gacgtgtacg
    20221 tgtcggatga accaaatcct cattgaattt gatttttcca ttagatgggg ctcgcacatg
    20281 ttctgcagta ccccccgtga atatctccgg tatgaaaagt tcttaatgtt aattgagtac
    20341 ccggttctcc aatcgattga cctgcaataa tacctacagc ttccccaatt caaccaggtc
    20401 gccatgagta ggactccggc cataacataa tcgacaaatc caagatgtac tcctacaagt
    20461 aaagggagtt cgaatagaga ttggttgtgc ccgaaaggtt atgaatcgat ttacaagtcc
    20521 aatgccaatg tcttgatttc tagtggcaat acatcgcgga cccatgtata tatcatctgc
    20581 taatacacga ccaattaatg tttggataaa aatcctttcc ggcatcatcc cattccgagg
    20641 actcacagaa ataccccggg cggtgcacaa tccgttcgac gtacaacaat gtgttgaact
    20701 acttcaacaa gtctgcgagt gagatatcca gcgtctgatg ttcgtacagc agtatccaca
    20761 atctttaggg ctcgtagcaa gaaatgatgt attctgttaa agagagtcct tcgcgtaaat
    20821 tgctttgaat aggtaaatca atcatttgtc cttgaggatc tgacattaat cctctcatac
    20881 ctactaattg atgtacctga gatgcatttc ctctagctcc cgagaaagac attatatgaa
    20941 ctggattaaa agggtcagtc atcctaaaat taggattcat ttcttgtcgc aaatattcac
    21001 ttgtagcata ccatatttca atggattgac gtaatttttc taccgcgtgt acattcccat
    21061 aatgatggtg tttttccaaa atcaaacttt gttgttcagc atcttgaact agccatccct
    21121 tagaaggtat tgttaaaaga tcatcaattc ctaatgaaat ggatgtagca gtagcttgtt
    21181 ggaaccccag agtttttact tgatccagga tatgtgatgt atatgccatt ccgaagtgat
    21241 ctattaatct actaataagt cgtttcatgg cagttccgtt tatcgcttta ttgtgaaaga
    21301 ccagattggc ccgttctgcc ataagtacct ccatattccg ctgagtagaa ttcgacaatg
    21361 ggtttgagtc ggtgattgta aaacttcctt ttatcgatct tgattcgcgt ataaattccg
    21421 gaactatgga cctagctgaa ccggagagcc ccgaagtccc acgggtatca tagaattacg
    21481 ttaggtacca gatgaatagg cccgagaaaa cccctgtata gcttcttcga tttctcgata
    21541 aagagcaata tgaccaacag tggttcgaat gtatataaaa aggatttgtt tttttagact
    21601 tcttactatt agatagtgtc cataaatctc ataaaaagta cctaaagatt catagtgaac
    21661 ttcgatggga gtttctcttg aagcaataac gcgttgatct agtcgccacc ggagccacaa
    21721 aggactatct aaattgattc gtttctgccg ataagcccca attgcatcat aggaattaga
    21781 aaaaaacggt tctttcgtat acttatagtg actattgtca cttctttttt gattttgata
    21841 gtttctgcga ttacatggat tatatctatt tacacaaata cctcgatgat ttccgctcgt
    21901 taatacatag agtccaataa gcatatcttg cgttggtacg gaaatgggat ccccaatagc
    21961 cggagacaaa agattcatat gagaaaacat aagtaaacgg gcctctactt gagcctccaa
    22021 ggataaaggt acatgaacag ccatttgatc tccatcaaaa tcagcattga atcccttgca
    22081 aactaatgga tgtaaacaaa tagcgcgccc ctccactaaa acgggctgga atgcctgtat
    22141 gcctaatcta tgcaaagtgg gtgctctatt cagcaataca ggatgtccct gcataacttc
    22201 ttgaaggatt tcccaaacaa tcggctcttt ttctcgaatt ttactcttag caactcctat
    22261 gttcgaagca agatgttgtc taattagacc acgaattaca aatgtctgga aaagttctat
    22321 tgctatttca cgaggcaatc cacatcgatg taatgaaagt gaaggaccca cgacaatgac
    22381 agaacgtcct gaataatcga ctcgtttgcc aagcagagtc tcacgaaatc ttccctcttt
    22441 gccttcaatt acatcagaaa atgacttgta aactttatta tgaccgtccc tcattggttg
    22501 tccccggatt ccattatcaa gaagtgtatc cacggcttct tgtactaatt tctcctgaca
    22561 cattactaat tctcctggcg tagatctact tgttgttaat agatcggtaa gggtattgtt
    22621 ccgatagata actcttctat agagttcatt aatatctgag ctcattagtt tacccccatc
    22681 tatctgaatg atcggtctca actcaggagg aagaactggt aatagacaca aaaccatcca
    22741 ttctggctct atatttgttc gaataaaatg cttagccaat tccacgcgtc taaccaaaaa
    22801 gtcctttctt cttccaactt ttcgatcttc ccattcattc cctgtgtgcc cttcttcccc
    22861 caattcttcc cattctacca acgaattctc tataataatt cgtaaatcta gatcggctaa
    22921 ttgttctcgg atagcacccg cgccagtaga gatttctcga ttgcgaaatg tatcgaaacc
    22981 ctgggtagta aaaaaaagtg ggatgctgta tttccaagat tggatttcat attcgaataa
    23041 acctcgtaat cgtaagaaag tgggcttttt agttatgggc ctagcaaaag aaaaattggg
    23101 ataggattct ataggatctc ccccccttca aaatcggacg tgaaagtttc ctttcatccg
    23161 gctcaagtag gtacaccaaa taaggaaagg agttctcgtt ttcaaactct agaaaatccc
    23221 aaaataaaaa ggtctactcc ttactcaagt tcccagtgaa gacgaaacaa gatttcagtg
    23281 attccgtctt ctattaattc tttattcaaa ttcaattcca acaaataaaa tagaaaattc
    23341 ttgagtagtc tacttccctt tgaatgataa atcccttaac tcttaataat taaaggaata
    23401 ccttggaacc cataagggat ttacttgtct atatattgtt ccattcgatc ttttaggtcc
    23461 cgacttcacc tcgatggtta ggccaccacg cccttaaagt ctatacgcga tagatagact
    23521 cctagaacca tgacatattt gcttacttga acataatttc tttccacgaa aagaaaggaa
    23581 atgtttcatt ctacaaaata aaaagctttt tttacgatgg tacaaataga aattcctctt
    23641 tatttgattt gttacgaaat cgaccataga tcaattccct ttttatttgg gagtattgac
    23701 tacaccccaa ttctgagctt catgttactc tttccaagtg cacatgtcag gtccagggca
    23761 tcccaattgg attgactggg atgacagttt ctccttccga gtctgtaaaa tcagaatttc
    23821 gatcaaatca cacatcgcag tatactaggc cttctaattc tttaagaggt ttatctaaaa
    23881 gattcgcaat ataactagga agacgtttta aataccacac atgggttact gggcatgcga
    23941 gtttgatata gcccatttga taccttcgta tccgagaatc aacaaattcg accccgcatt
    24001 gttcacaaaa tttcgggtct tctttttcat ctccgattac tcgataattt ccacaagcac
    24061 aaattccgct ttttatagga ccaaaattct tcacaaaata atccatcttt tccggtttgt
    24121 tagttttgta atgaaaagta tagggttttg ttacctctcc aactatctct ccattaggca
    24181 ggattttagt ggcccaagca cttatttgtt gaggagaaac tgatccaatt cggagctgtt
    24241 gatgtttata tcgatcgatc atagaagaaa aattattatt cattccgatt aagcttcctt
    24301 cctattaatc tggaagttct tctcagatac aaggaaatga ttcagttcca gagctaaaga
    24361 tcgtagttct cgaacgagca atcgaaaaga ttctggagca tcttcgggat taggtattgt
    24421 tcccccaatg atcgtagtac caagtacttc ctggcgagct ctaatatgat ccgatttata
    24481 agtaagcatc tcttgtaaaa tatgagcaac cccaaaccct tctagagccc aaacctccat
    24541 ttctcctacc cgctgtcccc cctgtttggc tcttcctcta aggggttgtt gtgtaacaag
    24601 cgcataatgt ccactggagc gcccatggat tttatcatca acttgatgaa ttaatttcaa
    24661 gatataaggc tttcctatta taacgggttg ttcaaaagga ttccccgtcc ttccatcaaa
    24721 tattctgctt tttcctggat attcgggttc aaatacccat ggattcgctg tttgcttact
    24781 ggcttcatat aattcagaaa acacaagttt tctcgaagct tcttgttcat atctctcatc
    24841 aaaaggtgct attcgataat gtctgtctag cagactccct gctaacccta gtgaacattc
    24901 aaatatctgt cctacattca ttcgtgaagg tactcctaat gggttaaaga ccatatcaac
    24961 ggatcttcca tcttgtaaat aaggcatatc ttgtctaggc aaaattttgg aaatgatacc
    25021 tttatttccg tgtcttccag ctactttatc gcctactttg atttcacgtt tctgtaaaat
    25081 atatacacga atcgtttcgg gattataact agaaccaccc ctcttctgga tccacctcac
    25141 atcaataacc cgacccctgc cacctatagg tagttttaga caagtttctt ttgaagtaga
    25201 tacctgaata ccaagtatag ctcgtaacaa tctatcttcc ggggcatacg acgattcttt
    25261 cacgacctgg ggtgttaatt tacctactaa aatatcacct gtctctaccc aagatcccag
    25321 catcacaatt ccatttttat ctaaattgcg gagtaaatgg gcttctaaat gcggtatttc
    25381 attagttact ttttcagggc cttggcttgt cacatgagtc tgaatttcat atttccgtat
    25441 gtgaaaagaa gtataaatat cttcatatac caaacgctcg ctaataagta ctgcatcttc
    25501 agaattgtaa ccctcccacg gcatataagc tactaatacg tttttcccca aagcaagttc
    25561 gccaccaacc gtagcagcac catccgctaa aatttgtccc tttttaatgc atttaccccg
    25621 aggaacctgg agtttttgat gcatacaagt atttttattg gaacgttgat atataactaa
    25681 tggaatgctt agaatatctc cattacctgc taaaagaatc ttgtcagtat tggtataaac
    25741 gacccttccc tcgcgttcgg ctatagcaag agcccccgaa tctagagctg cttgtcgttc
    25801 caacccagtt ccaacaatgc atttctcgga gcgagaaaga ggaactgctt gacgttgcat
    25861 attagaactc attaaagctc gattcgcatc attatgttcg ataaaaggaa tgagggaagc
    25921 tccaatagaa aaatattgaa aaggaaaaat acttcgaaga tgaacctgtt cccatgcaat
    25981 agtcaagaat tcttgacggt atcgagctgg aacaacctgt tcttcctgaa tatcctgatt
    26041 taaggctaaa gaatttcctg ccgctaccat atagtattca tctctacctg gtgataaata
    26101 aagcatccgt accccggttg acctctcaga aatttcataa aaagggcttt ctagagatcc
    26161 ccaatgacca atcctcgcat gaattgctaa ggatccaata agtccaacat tgattccttc
    26221 agatgtgtca attgggcaaa tacgtccata gtgactagga tggatatctc gtatccgaaa
    26281 actagcagtg cgccctgtca gtcctccagg gcccaaataa cttaattttc tcccatgaac
    26341 tatttgtgtc aatggattag ttcgatccaa aacttgagat aatgggtgta aaccgaaaaa
    26401 ggattcataa gtagttgtta atggagttga ggttaccaaa ttctgaggtg tcggtatcaa
    26461 tttatgccga attgctccac atatagtccc ccgaaccaca ttttctaaac gaaccagagc
    26521 caatccgaat tgatcttgta aaagatctgc tacagaacga atacgtttat ttttcaaatg
    26581 attcatatcg tcaagtgcac ccattccaaa tttcagccca atcaaatgat cggcggctgc
    26641 caatatatct cgtggtaaca aaaatgtatt gttctggggt atatcaaggt tcagtcttcg
    26701 gttcatattt cgtcgaccaa tccttcctaa ttcacatctt tgttgaaaga atttcttttg
    26761 taattcctta cataaggatt cagaaaatac cggatcgcca cctacacaag caaattgttg
    26821 ataaaactcc aaaatggcat tttcttttga cccaattttt tttctctcct tatcactcag
    26881 aaaagacaaa aaaatttcag gatagcaaac attctctaga atttctctta gattcaaacc
    26941 catagctgat gatagaacta gaatagatat tttttgtttc ctacttacac gagcccatat
    27001 ccttgctttt ctatcaattt ctaattctga tcttcctccc caatctgata ttatggtgcc
    27061 ggtatagacc gaaattccgt tatggtccaa ttctgatcgg taataaatac cgggactttg
    27121 caatatttga ttgatcacaa ttctatatat tccattgact atagaagttc ccagggaatt
    27181 cattagagga atgtttccga taaaaattgt ttgttcttgc atatccctac tgtttttcca
    27241 aattaatccc gcggatacat ataattcaga agaatatgtg agtgattcat acacagcatc
    27301 tctttccttt atcaagggtt cgaccaattg atatgtttcc acaaataatt gaaattcaat
    27361 ttcttgatct gtatcttcaa tttttggaaa cttataaagt tcttccgtca aaccttgatc
    27421 aatgaaccta caaaatcctt caaattgtat ctgattaaat ccaggtattg tagatattcc
    27481 ctcatttcca tccccgagca tttttaattt cccatttatc aaaaaatacc actattggtt
    27541 cattcttcat ctaattagat agattagata aatgatctag caatgatggc atttctattt
    27601 tgtttaccga atcacatgaa attttaccca actccatatc tggaatgtat gaaatacgta
    27661 tgaacggagg aagaaagaga attttctact taaattgaat tggaatttat tggaattttc
    27721 aacagataca aatggaaaga aattgataaa acatccctag aaacagactt ctgctactta
    27781 gacttattaa ttaagttata gaattttgta tagaatatca aaacaaaaat gattccattt
    27841 ctaccattat tatgataata cacattccaa cctgcttgaa taccagaaaa ataaatggat
    27901 tcgacatttg atcttttcgc tgagataaag gcataaaaat aagaaagaat atatagaatt
    27961 agaatcggtt ttttagcatt taaccccctt ttctgttatg gatttcgttg ctaaaaaaat
    28021 gatttgtaga gaagagagag attttgttta cggatttttg aatagaatac gattgtgaag
    28081 tgtataagaa aagaaggttt gtatggctta accacgtgtg gagatatcta taatatccgt
    28141 ctttcttctc ttttattgtt ttattgtcgt tctctgttct attcggggca acccgggttg
    28201 tgctctatga aaacagaatt tcaattttct attcaattca aaattcaaat tgaagtatga
    28261 tacttttctg atatctgata attctctatc ggaacatata taaataatat ataccgtcta
    28321 acaatttctc ttgggggttt acatatactc ataattgttg ttataattaa aattgagaag
    28381 gattttttga ttgaaaaaat ccatactgat tagttatata tcaagttgta ttttcttatg
    28441 tcattaggaa aacaaaattt ggagattcaa atccaagaat cattcatgca ttctaagtca
    28501 atagttaatg gttccgattt tcagaaattt gaattttgga ttttgcgact gaaaatccac
    28561 atttgatttt tcaatagaaa ggtaagagaa agctttgaac attatgaatt tggagatcga
    28621 aattgaaagg atgaatcaaa cccaatcaaa agggaagaag gattaggatt tctttgactt
    28681 ttaggaaaaa ttaaggaaaa cagaactcaa ggtgcaagta caataaaaaa gcagttcagt
    28741 aatcctggaa agttttcatc tattttgtat ttgtagcatt ttggcgacat ggccgagtgg
    28801 taaggcagag gactgcaaat ccttttttcc ccagttcaaa tccgggtgtc gcctgatcaa
    28861 caaaaaactc gaaatctctt cttttcttct gttctgttga tataacccgc cgaatgattc
    28921 cccagcagaa gcagagaaag cagactgttg atacttgttt gattctaaac atctggtctg
    28981 ggggtttttc taaaaaattg taaatatctt tgcattgcat atttaggctt caaggaaata
    29041 ttcgaatgct agaggggcta tcaagacttc gcaattacct tctactacaa atcaaaattt
    29101 tctattatta atgcattgta taatgactgg accttgaatt agattggaga gcccgatagg
    29161 aaatctaaat agttgtggaa gggggcggaa gatactttat tatatacgag gaactcacga
    29221 aaatctctga gtgctcaagc atccaatcaa ttgaaatgag ggtcaacaaa aaaagaatag
    29281 gacctattat tcctacatgt tccattagta acattccctt gagatgttac tgcagatttt
    29341 gcttgtgttt aatctttccc gattagaaat cctataggaa tttcttataa aatgagcgaa
    29401 tttattggat tggtttatta atagtcttcg ttctttttga ctctgcgcca ttgattccac
    29461 tattattagt gaggaataac ggaacaattc ctttatattt atagagatag gggacataat
    29521 tcatatggat atagtaagtc ttgcttgggc tgctttaatg gtagtcttta ctttttccct
    29581 ttcactcgta gtgtggggaa gaagtggact ctaggggtcc tactaattga gttaaggaag
    29641 caaactgtat caatatcaat tgctttcgag atcgttctgc aacacgtttt gaacaaaatc
    29701 aaaatatctt cattttgaaa ttccattgga ctcgactgga gtaatgtatt ataggaatca
    29761 tcctctttca atcaaagagc tatttcaacg attcccatgt ttgtagttcg aaaggaagag
    29821 gatcccagga aatttattcg aacctaattc ttccgaaatt ttctattcca atcaacggcc
    29881 tcttacaggt gatactgagg agggccggac ccttttttta tttctttctc tctttactgt
    29941 tcaaagaaga ggtagttttg ttaagtgtat acgcactttg tatgagaaag aaaggatata
    30001 aacatagtgg ttgtctaacg agatactatg cagaataaga tcttcagatg agtcacatat
    30061 tgcgcattta ccgctttcga atttttgaaa ttggatttat gctttatcga cttatttcat
    30121 atcatggttc aggcgttaaa aatcggtgag gtttactctt ccttttcgat gcccgtggaa
    30181 ctactgtcaa tggtttactc aattacttct tgggaatgtt aaaaaaaaga ttactacgtg
    30241 attttttgaa tctgcctata tctatcgctt ttccttcatt gatttgattc tttcaataga
    30301 taccgagatt cagattggaa atcaaaaatc tagtaattca aactataaga cataagagta
    30361 atttagattg atcagaacaa atagatatag caaataaatg gaattggatg ctatgtcaat
    30421 cccatatatg gaattgatat tcacatatat caagataata ttgtagattg atctatagat
    30481 ccatatcaaa agatccatat caaatgcagc ctctatcttt attttattcc agggggcagc
    30541 tttataacta caatctaact aataaatagt atggtagaaa gaaatagatg aatctttctt
    30601 tctaccatac tatctatcta ttagaatact gccgattcta gtccatacat tttcatttaa
    30661 gacatgaaat tagaatcttt ttcattttat ttcgtcaatt ttggctaaga actcagaagt
    30721 caagtttcat tcaaattagt taataattaa tcgttttgac tgactgtttt tacgtaaatg
    30781 ataagtagaa aagcggtagg aactagaata aatagtgcag tagcaataaa tgcaagaata
    30841 tttacttcca taatctcatc ggttttttac ttcgcaataa ctcgggattt aatcccatag
    30901 agatgataaa tctttggcct gtaaattcaa tgaatgaata ttacctctcg atgatcttga
    30961 atcggatcaa tatcatgaat aacaatatct gaactatcaa atcaattcgt cgtcgagaat
    31021 tgaatagtat aacataggaa gttcttttat ccataccgcc ccaaacttgg attcctgacc
    31081 caatccaaaa ttcctttatt tatttatcat tatcattttt tctcatctgt tctttttttc
    31141 tctctaatct atctagttcc ttcttgtaca atcatctgat gaagtctcat caaatagctc
    31201 ttccacttcc agtggtcaca catagttaca aacccaaaca aacaataaaa gctaaatgga
    31261 aaaagaaagg agtttagaac taaactattt ttgacttgga agacaaagaa gtgtgataaa
    31321 gatgagaccg tataaaatga atattcatca aattgactat tttccgattt gttctttcgt
    31381 cgatgggggc cttaaaacaa aatgaaaaat cggaaaaatg attcattccc ctttctaaga
    31441 ggagtaggat ctttcctttc ccctcctttc ttcgtagatt attagccccg ggacacctat
    31501 accaaaagct cagtgtgcaa tttgcatgaa atctattttt caacttcaaa ctagtaagtg
    31561 aggttccata aatccgtagc cagaaaaata aattgttttt ttttttgttt tttctgggaa
    31621 agtattttct tatattaaat tttgtattgg acaagaaagg aattcccctt gtgtatgcgc
    31681 gcctcaaaaa ggtatagtac tcgattccat tacatgcatc gggggcaatc gaaaaagcca
    31741 gcatttcttg gaatactgac tataatgcta ccaataatcg tactaatcca accgcatatg
    31801 tctttctcct accaaaagga aagaaaaaag aaataaggat ttcccctttg ctttgacaat
    31861 gaaattctgc ccccggtccc cttcataaaa agggagagat ttattgatat atttattgga
    31921 tccatcggga ctgacggggc tcgaacccgc agcttccgcc ttgacagggc ggtgctctga
    31981 ccaattgaac tacaatccca gggaaatacg ggatctagca gaaaatttga ttctttttta
    32041 tctccggatc gggtatttct gaagtacgaa gggggttata tcatctcatg gcggattggc
    32101 gaatttttgg gccgagctgg atttgaacca gcgtagacat attgccaacg aatttacagt
    32161 ccgtccccat taaccgctcg ggcatcgacc caagaagaat caattttaga cttattggta
    32221 atccatgatc aacttccttt cgtagtaccc tacccccagg ggaattcgaa tccccgctgc
    32281 ctccttgaaa gagagatgtc ctaaaccact agacgatggg ggcctgcttg accaaccgcc
    32341 atcatactat gatcatagta tgatcagttt tttgaaattg tcaatataat cgaatgattc
    32401 tatccgaggg atctttcccc ctttcagaat tgcatagaat ttttttattc gtcattgatg
    32461 aattattcat tagaatcgcc attagaaatc tagtagtagt attttttttt ttttggaatt
    32521 atttcaattg aatttctttc gattatttta gtttagatta tttagtattt agaattttct
    32581 ttttttatta taaataaaaa aaaaattaat aaatacaaaa aatagaaata ataaggaaga
    32641 gtaggatttt tgcagggaat gattggtccg tcagaaaagg aaaaaggtgt gaaattctat
    32701 ttctttcact ttcatttgat tcattgttaa gacgagatat ccttatctcc ctcccaccaa
    32761 gacaggaaat taacaaacga gaaatctagt aagcgggatc aagaagaaaa ttcttttttc
    32821 tccaagaatt tagttcagga gacaagtaga atctcttcat tccatgattc gatgaaatat
    32881 cttgaatttt atgttgaatt gctaggtgta tgtacatgta tcaatcaagt gaattttgtt
    32941 ctggtgggat caattcaata aaagaaaaaa agcaattcga gtcggtcttg aaacaattca
    33001 ttgcattttc tcctagactt cctaggtaaa tccattttat tattcaacaa tgagccacta
    33061 gacactatgt atctactgca tgtacttatg catatatact tatgtttata atatatgtac
    33121 ctatagatat tttatccaca tagtgaataa ttccggaatt aaatcaaaaa ggccctttta
    33181 actcagtggt agagtaacgc catggtaagg cgtaagtcat cggttcaaat ccgataaggg
    33241 gctttgtaaa actccaatct agtattcata tttgagggga gaattgtatt tttatttgta
    33301 ataaaaaaag taactaactg gataatacat tatcattata cttaattatt atacttagtt
    33361 ataaagttga acatttgttt agtcaatttt cattattatg aatttctgaa taatgaaaag
    33421 tcacttcttg aactcaccga atattcctat tttccattat accaaccaaa tccattcgaa
    33481 aggttagaaa tcaacaaaag aaaaagtaag tggacctgac ctattgaatc atgactatat
    33541 ccgctattct gatattaaaa ttcgatagag atgaaattgg agcagttgat ttttttttaa
    33601 tttcattttt ttgttttgga ttccacaaga atttgtcgat atttccgatt aaatcttctt
    33661 gttactagat tttctatagg aaaaattata ggaataaatt gttattcctt tcctctacag
    33721 agaaaccttt cttccaagtc acaccataag agccatttat tatctttctt tgattccaga
    33781 tcaaagatta atttcatcat taatttctat ctagattata tatctatatt atattaagta
    33841 gattgtagat ttcgatgtat atctatcaga tcgtggcttc atgtaccaaa tatttcaata
    33901 tcgttgcatc cggtattttt gttttgttcc aacagtgtga tgaagaatag atccgagaaa
    33961 gagactttca ttttcagtct cttatttatt ttatttttat tgaattttcg attttctaaa
    34021 aggaaaatct aaaaggaaaa atagtagatt atctcttttt ctaacagata aaagaatcta
    34081 aaaataaata ttcgatcgaa ctgtcttttt tccttcgatc cgtggaaaga tatactctgg
    34141 ggttttagat ttatttatat gaagtatgaa ggaaagggat cgcttggtcc ttgaagagtt
    34201 ctttcaaaac aaaggattga ttgaattgtc ttattaggac aattaatggt tcatatgctt
    34261 agtcagaagg aataatccaa tggagttcat ggatttacct aggtcagttt atgggctaat
    34321 caataaagca tttttatctt cgaaacccat tggaaagggc agtgcaagag aaatcataca
    34381 aaaatgatcg aatcttcgga cgccccgaaa aagatatgag gtgctcggaa atggtcgaag
    34441 tagttgaata ggaggatcac tatgactata gcccttggta agtttaccaa agacgaaaat
    34501 gatttatttg atattatgga tgactggtta cggagggacc gtttcgtttt tgtaggctgg
    34561 tccggtctat tgctctttcc ttgtgcctat ttcgctgtag ggggttggtt cacaggtaca
    34621 acctttgtaa cttcatggta tacccatgga ttggccagtt cttatttgga aggctgcaat
    34681 ttcttaactg ccgcggtttc tactcctgct aatagtttag cacattcgtt gttgttacta
    34741 tggggtcctg aagcacaagg agattttact cgttggtgtc aattgggggg tctgtggact
    34801 tttgttgctc tccatggagc ttttggccta ataggtttca tgttacgtca attcgagctt
    34861 gctcgatctg ttcaattgag accttataat gcaatcgcat tctctggtcc aattgctgtt
    34921 tttgtttctg tatttctgat ttatccactg ggtcagtctg gttggttctt tgcacctagt
    34981 tttggtgtag cagctatatt tcgattcatc ctcttttttc aagggtttca taattggacg
    35041 ttgaacccat ttcatatgat gggagttgcc ggtgtattgg gcgctgcttt gctatgcgcc
    35101 attcatggtg ctaccgtaga aaatacttta tttgaagacg gtgatggtgc aaatacattc
    35161 cgtgctttta acccaactca agccgaagaa acttattcaa tggtcaccgc taaccgcttt
    35221 tggtcccaaa tctttggggt tgctttttcc aataaacgtt ggttacattt ctttatgtta
    35281 tttgtaccag taaccggttt atggatgagt gctcttggag tagtcggtct agccctgaac
    35341 ctacgtgcct atgacttcgt ttctcaggaa attcgcgcag cggaagatcc tgaatttgag
    35401 actttctaca ccaaaaatat tctcttaaac gaaggtattc gcgcttggat ggcggctcaa
    35461 gatcagcctc atgaaaacct tatattccct gaggaggttc taccacgtgg aaacgctctt
    35521 taatggaact ttagccttag ctggtcgtga ccaagaaacc actggtttcg cttggtgggc
    35581 cgggaatgcc cgacttatca atttatccgg taaactacta ggggctcatg tagcccatgc
    35641 tggattaatc gtattctggg ccggagcaat gaacctattt gaagtggccc atttcgtacc
    35701 agagaagcct atgtatgaac aaggattaat tttacttccc cacctagcta ctctaggttg
    35761 gggggtaggc cctgggggag aagttataga cacctttcca tactttgtat ctggagtact
    35821 tcatttaatt tcttctgcag tattgggctt tggcggcatt tatcatgcac ttctgggacc
    35881 tgagacactt gaagaatctt ttcccttctt tggttatgtc tggaaagatc gaaataaaat
    35941 gaccacaatt ttaggtattc acttaatctt gttaggtcta ggtgcttttc ttctagtatt
    36001 caaggctctt tattttgggg gcgtatatga tacctgggct ccgggagggg gagatgtaag
    36061 aaaaattacc aacttgaccc ttagcccgag tatcatattt ggttatttac taaaatcccc
    36121 ttttggaggg gaaggatgga ttgttagtgt ggacgattta gaagatataa tcggaggaca
    36181 tgtatggtta ggttccattt gtatacttgg tggaatctgg catatcttaa ccaaaccctt
    36241 cgcatgggct cgacgcgcac ttgtatggtc tggagaggct tacttatctt atagtttagg
    36301 ggctttatcc gtctttggtt tcattgcttg ttgttttgtc tggttcaata ataccgctta
    36361 tcctagtgaa ttttacggac ctactggacc agaagcttct caagctcaag catttacttt
    36421 tctagttaga gaccaacgtc ttggggctaa cgtgggatcc gctcaaggac ctactggttt
    36481 aggtaaatat ctaatgcgtt ccccgactgg agaagtcatt tttggaggag aaactatgcg
    36541 tttttgggat ctgcgtgctc catggttaga gcctctaagg ggtccaaatg ggttagactt
    36601 gagtaggttg aaaaaagaca tacaaccttg gcaggaacgg cgttccgcag aatatatgac
    36661 tcatgctcct ttaggttctt taaattccgt gggtggtgta gctaccgaga tcaatgcagt
    36721 caattatgtc tctcctagaa gttggttagc tacctctcat tttgttctag gattcttctt
    36781 cttcgtaggt catttgtggc acgcgggaag ggctcgtgca gctgcagcag gatttgaaaa
    36841 aggaattgat cgtgactttg aacctgttct ttccatgacc cctcttaatt gagatgagac
    36901 aggagatcca atgcttgaat gaagtaaaaa tcactttgat tcaatcatac atcttggaat
    36961 cagcctaagt attccttttt tgtattcctt ttttcttttt ttttttcaat tcattttatc
    37021 taatttattt ttctggcttg gctaggtggg atagccgagc cattcccttt tctttcggat
    37081 agcaggttgg gcaaaaccac taaagaaaaa aatctattca attagcaaaa aaggagagag
    37141 agggattcga accctcgata gttctttgtt aaaactatac cggttttcaa gaccggggct
    37201 atcaaccgct cagccatctc tccgaaagac tatttttatt ttattcctcc gaatagaaca
    37261 tggccatagg ggtggatacc cccactatct gtactatctg taaaaagatc tcaggtgcga
    37321 atccaccggt cgatctatct atccgtatat agatatatga tctagcatgc ccatttgtga
    37381 aataaaaaat aaaattccat ttccccccac tccatgtacg aataaagtgc gaaaggggga
    37441 gtagtaataa gtcatataga atcaatggat tcatgataaa gtaaaatccc tcgatgacat
    37501 attttatcac aattaatatt ttttggctga tagagggatc aaatggtata tagttcattt
    37561 gttggtagct tggaggatta aaagcatgac tcttgctttc caattggctg tttttgcatt
    37621 aattgctact tcattaatct tattgattag cgtacccgtt gtatttgctt ctcctgatgg
    37681 ctggtcaagt aacaaaaatg ttgtattttc tggtacatcc ttatggattg gattagtctt
    37741 tctggtgggt atccttaatt ctctcatctc ttgaacctat tcgtcgcaga cccaaaacca
    37801 aaatgacccc cctaattttt ctcggttgtg agacacatta aattggaatc taagtcccca
    37861 aagaaaacgc aaatcaaata aagaaaacaa aaaaattaga ggggggtcaa acttcttgaa
    37921 taaaaagaat acaattaaaa aaataattgg aatcgttccg aagagaatat gtgtcccggc
    37981 actgcacaaa aaagatccgg ttatatatca tatatgtggg tacatattgt gtatcaagaa
    38041 caaaaaaatg cggatatggt cgaatggtaa aatttctctt tgccaaggag aagatgcggg
    38101 ttcgattccc gctatccgcc caagatccaa gataaagtaa ttttattact atttatttat
    38161 tatttaattt cataaatagc attaaatata tccttaaatt aaggatttgg tatagttggc
    38221 cgtgatagtg tagtgattct atccctcccc tacgttttct ttttccttcc acccccaaaa
    38281 agcgaaaggc gggaattaat tactagttaa cagagtcaac cctaaaatag tttggcaaaa
    38341 caagatgttg cggagacagg atttgaaccc gtgacctcaa ggttatgagc cttgcgagct
    38401 accaaactgc tctaccccgc gccgaagata agaactgaaa actaatagat aaacaaggat
    38461 taaatgcgcc cctccaccct atctgtacaa atagaatagc ccatttatac agaatggtaa
    38521 aggggcttct atgatcatcg accatagaaa tagaaatgaa gcgttaatcc ttaccaactt
    38581 gatcttgttg ctcctggcaa caaacatgca tgaaccattt cacgaagtat gtgtccggat
    38641 agtccaaagt ctcgatagtt agctctcggc cttccggtca aaaaacaacg tcgatgaagg
    38701 cgtgtaggtg cactattccg tggtggggat tgtaactttc cataaatttc ccatttgtca
    38761 ctcaacgacg gaaccttgct tatttctttc tttgaggatc gacgaatcga atgatatttc
    38821 tgttccaatt tttgcctctt cttctccctc tgaatcaaac ttttccttgc cataatggtt
    38881 gaattcctat tagtatccat gatacaagtc gaatcctaga tgtagaaata gaagaaggtg
    38941 gaccccctct ccgtcgaaag aaatgagatt atcgcagata cacacattaa aaatattaac
    39001 caaatttgcc cgacgtagag gcaatcaaga aagccgcata agtgaatata taacctacag
    39061 aaaagtgagc taatccaacc aatcttgctt gtacaatgga aagggccact ggtttatctc
    39121 tccagcgaat caaattggcc aaaggtgtgc gttcatgagc ccatgctaaa gtttcaatca
    39181 attcctgcca atatccacgc caagaaatta agaacataaa tccagtagcc caaacaagat
    39241 gtccaaataa gaacatccat gcccaaaccg ataaactatt cataccaaaa ggattatatc
    39301 cgttgataag ttgtgaagag tttaaccata aataatccct taaccagccc atcaaataag
    39361 tggaagattc attaaactgt gaaacgttac cctgccataa tgtgatgtgc ttccaatgcc
    39421 aataaaaagt aacccatcca atagtattta acatccaaaa aactgccaaa taaaacgcgt
    39481 cccatgccga aatatcacaa gtaccgcctc gtcctgggcc atcgcacgga aaactataac
    39541 cgaaatcctt tttatctggc attaacttgg aaccacgtgc atctaaagca ccttttacta
    39601 agatcaatgt agttgtatgt aaaccaagag caatagcatg atgaaccaaa aagtctccag
    39661 gacctattgt taaaaataat gaattactat tttcattaac agcatttaac caacccggca
    39721 accagatgct tcgacccgca ttgaatgctg gaccactcgt tgaagataaa agtacatcga
    39781 acccatatga agttttacca tgagcggatt gtatccattg agcaaatata ggttcaatca
    39841 agatttgctt ctccggagtg ccaaaggcaa gcatgacatc attatgaaca taaagtccca
    39901 gggtatggaa tcccagaaag aggctggccc aacttaaatg agatatgata gcttctttat
    39961 gctctaacat tcttgccaat acattatctt cattttgctc cggattgtaa tctctaatga
    40021 aaaatatagc tccatgagca aaagctcctg tcatgatgaa tcctgcgata tattggtggt
    40081 gggtatataa tgcagcttga gtagtaaagt cttgtgctat gaatgcataa gcaggtaaag
    40141 agtacatgtg ttgagctacc aaagaagtaa taacccctaa agaagctaga gcaaggccta
    40201 attgaaaatg aagcgaatta ttgattgtgt cataaagacc cttatgtcca cgccccaatc
    40261 gtcccccggg gggaatatgt gcatctaaaa ggtctttcat actgtgccca atcccgaaat
    40321 tggttctata catatgacca gcaacgagaa aaataaatgc aatagctaaa tggtgatggg
    40381 caatatcagt cagccataaa ctttgcgttt gtggatggaa tcccccgaga agagttagaa
    40441 tggcagttcc cgccccttgg gcggtaccaa ataaatgact acttgaatcg gggttttgag
    40501 cataaagatt ccattgacct gtaaaaagtg ggcctaaccc ttggggatgc ggtaatacat
    40561 ctaagaaatt attccaccga acgtactccc ctctggatgc aggaatagca acatgaacta
    40621 aatgccctgt ccaagccaag gaacttacgc caaagagtcc tgacaaatga tgattcagac
    40681 gagattcggc atttttgaac caggaaacgc tcggtttcca tttcggttgt aggtgtaacc
    40741 aacctgctat taaggatatg gcagaaagaa ataatagaaa aagagcacca gtataaagat
    40801 cttcattagt gcgtaaaccg attgtatacc accactgata aacaccagaa taagcgatat
    40861 tcactgggcc aagagcaccc cctcgagtaa aagcttccac ggccggttga ccaaaatgag
    40921 gatcccaaat tgcatgagca ataggtctta catgtaaagg gtcctgtacc cacgactcaa
    40981 aatttccttg ccaagctaca tgaaacagat ttccggaagt ccacagaaaa attattgcta
    41041 attgaccaaa gtgagaagca aaaatattct gataaagacg ttcctcagta atatcatcat
    41101 gactctcgaa gtcatgtgcg gtagcaatac caaaccaaat acgacgagta gtggggtcct
    41161 gagctaagcc ttggctaaac cttggaaatc gtaatgccat aatgcttttc aaatcctcct
    41221 agccattatc ctactgcaat aattcttgct aagaagaatg cccatgttgt ggcaattcca
    41281 cccagaaggt aatgggttac tcctacagca cgtccttgta taatgctcaa ggctctcggc
    41341 tgagtagcag gagcaacttt taatttatta tgagcccaaa cgatggattc aataagttct
    41401 tgccaataac cacgtccact gaatagaaac attaaactaa aagcccagac aaaatgagca
    41461 cctaggaaaa aaaggccata tgcagataat gaagaaccat aagactgaat tacctgggat
    41521 gcctgtgccc ataagaaatc gcggagccac ccattaatag taatagaact ttgcgcaaag
    41581 tttcctcccg tgatatgagt tactacccct tgatcactta cactgcccca aacatctgac
    41641 tgcattttcc aactgaaatg gaatattact accgaaattg cattgtacat ccagaatagt
    41701 cctaagaaga catgatccca ggccgatact tgacatgtac cccctcttcc aggtccatca
    41761 caaggaaaac gaaaaccaag gtttgcttta tccggtgtca aacgggaact gcgagcaaat
    41821 agaacacctt tcaagagtat cagtgccgtc acatgaatcg taaatgcatg aatgtgatgt
    41881 accaagaaat ccgcggttcc taatggaata ggcaacaaag ccaccttgcc acccactgcc
    41941 actaaatcac caccccccca agttaaactg gtacttgctg ttgcaccagg agccgttgca
    42001 ccaggtgcta aagcatgggt gttttgtatc cattgagcaa aaacgggttg taattgtata
    42061 gcggtatctg aaaacatatc ttgaggacgc cctaaagcgc tcatggtatc attatgaata
    42121 tacaaaccaa aactgtgaaa gcctagaaat atacatgccc agttgagatg ggatatgatt
    42181 gcatcacgat gtctaaggac acgatctaat agatcgttgt accgagtagt tggatcataa
    42241 tctcttacca taaaaatggc tgcatgcgcg gcagcaccaa ctatgagaaa tccaccaatc
    42301 cacatgtgat gtgtgaacaa tgacagttgt gtaccatagt cagtagctag atacggataa
    42361 gggggcatgg aatacatatg gtgagctaca acaatggtta aagagcctaa catagctaag
    42421 ttaagagata attgagcatg ccatgacgtt gttaggatct catataggcc tttatggccc
    42481 tgacctgtaa atggaccttt atgagcttct aaaatatctt ttagtccatg accaataccc
    42541 cagttggtcc tatacatgtg acccgctatc aggaaaagaa ttgcaatagc taaatggtga
    42601 tgggcaatat cagtcagcca cagaccccca gttactggat ctaatcctcc acgaaaagta
    42661 agaaagtccg catattttga ccaattcaag gtgaaaaatg gggttgctcc ctcggcaaaa
    42721 ctgggataaa gttgagccaa aagatctcga ttcaagataa attcatgagg aagtggtatc
    42781 tctttaggat ctactccagc gtttagaaat tggttaatcg gtaaagatac atgtacttga
    42841 tgccccgccc aagagagaga cccaagtcct agtagccctg ccaaatggtg attcagcata
    42901 gattctacat cttgaaacca agccaatttt ggcgccgctt tatgataatg aaaccaacca
    42961 gcaaaaagca ttaacgctgc aaagaccaat gccccaattg ctgtacaata gagttgtaat
    43021 tcactagtta ttccagatgc tcgccaaatc tgaaaaaaac cagaggttat ttgtattcct
    43081 cggaaacccc cgcctacgtc accatttaat atttcttggc ccactattgg ccaaaccacc
    43141 tgggcactag gcccaatgtg agttggatca cttagccacg cttcataatt agaaaaacga
    43201 gcaccgtgga aatacatgcc gctcagccaa agaaagatga tggagagttg accgaaatgt
    43261 gcactaaata cttttcgaga gatctcctcc aaatcactgg tatggctatc gaaatcgtga
    43321 gcatcagcat gtaggttcca gatccaagtg gtagtatcag gccctttagc tattgttctt
    43381 gagaaatgac ccggtctggc ccattcctcg aacgaagttt ttacgggatc cctatctacc
    43441 aaaattttaa cttctggttc cggcgaacga ataatcattg agtcctcctc tttccggaca
    43501 acacatacaa agagacccgc caacagtcaa ataattagtg aaccttagag atagagagat
    43561 atttctataa ttagttcgtt tctcttctat ttttctatct cccatctatc tattttcttt
    43621 agttatttac tagagcaatt atgatctgga agtcgatccg gggcaagtgt tcggatctat
    43681 tatgacatag ccttgaggcg ctcaacggac cttttaacct tctaaaaacc tttttgggct
    43741 ttggattgat ccaaaaacga cttttttgtg caacctagtg tatattcata gaagttatta
    43801 gatggagctc tttaattttt tacctagaag attttaatta ctctattcca aatcacgcga
    43861 gtagccatta gacattacta agagacatcc ccgctatata tatttagtga ttcgagggtt
    43921 tattttatta gttttaataa taagaatttt gtttaattta atataataaa caaagtctat
    43981 tttgtactct atctgtgtat ccttttttat tcctaaaaaa tagcagatga aatagaaggc
    44041 ttagaaggga gataatgaaa ttatgtgatt gggtcttcca aaagcaaagg aataatccgt
    44101 tttttagtta actgatctga tgggtccaac aaacaataaa ttataacaaa tatctaaatt
    44161 ctaaataaaa aaatcaaaaa taatagacta agattctaaa taaaggataa taaataaacg
    44221 ggatcttctt ttattcgaaa cgtctcgtga tcttcaacca attatgcgct tcaatataat
    44281 taccgggagt aagcgctata gcctgtttcc aatactcagc ggcttgatcg aaccaagcct
    44341 ctgcaatttc agaatctccc tgttgaatgg cctgttctcc ccggccggaa taggtagttc
    44401 aattccttcc cttagaaccg tacttgagaa tttcttacct catacggctc agcagtcaat
    44461 tcttttggtg tcccattttg atctatacca tatctaataa aatctaatga gatttctcat
    44521 ggatctatcc cagttttagg gttaaccaaa agccaaatag gttaattaca tgagtttcaa
    44581 actgaaattt ggatgaataa tccgtttatt tagttttatc ttttttccca ccttcagaag
    44641 aataaagcat aggcatttct actagtgtta gaattttatg aaaggtaact atctcggttt
    44701 catagataaa tttatataga atctttgaaa aagactttct ttcataagaa agaaaatact
    44761 tactatcttt gggatctgat cctacaccgc tgctcaagac tttagtggat cgactctatt
    44821 acataagtta attcctaatt tttatttcac atcatgagat aagtatttct tccatcatga
    44881 cataagtacg cagttattat tgtatcggcc caaaacctcg ctaattgatc tttacggtgc
    44941 ttcctctatc tctatcaatt aaagccttat atccatagaa aaaagttgct aggcattttt
    45001 attttttcct attttgactt ctatgaagtt tctttctttg ctacagctga taaaaatcgt
    45061 tgttttagac gatgcatatg tagaaagcct atttggttct actagttact ttactagatt
    45121 tttctttttt tttttttttt tctttctata gtggagatag tcgcacgtaa tgacagatca
    45181 cggccatatt attaaaagct tgtggtaaga atgggtttcg ttctagtgct cgaaaataat
    45241 attccaaagc tttcgtatgt tctccattac ttgtgtggat aagccctata ttatagagta
    45301 tataacttcg atcataggga tcaatttcta gtcgcatagc ttcataataa ttctgcaaag
    45361 cttccgcgta atttccttcg gattgagccg acatccgtta cggtcgtcat tcaattgaaa
    45421 gaatctccgt tccagaaccg tacgtgagat tttcacctca tacggctcct cccttatgtg
    45481 cataatgaga ataatacata gaatcaaaaa agattcaacg atgaaaatat tctcattatg
    45541 aactcagcag ggctagtgtt tttacaagaa atctctagcc aaccttcctg caagagattc
    45601 tttcttaaca tcaagcctat tgggactaga tagaaatgat aagataactc caacaatttc
    45661 tttgttttta acgcctccta atttccagga attagtcact tcaatagcct tcgatggtta
    45721 tacgggtatc caaaggacga acgagatgga tgtttgttgt cccaaccatt cttttagtcc
    45781 caagcccgct aaggaaaggg ctgacttaga acaaagtttt cgtgttgttg attcctaggt
    45841 gtagtgcttc ttcccctctg ctgcctatta gcgctagtag agtaggattg acccgtaata
    45901 cagaacctct aggcgtaacc tttcgcttaa tactagaatc gagaatcgaa acatagcatc
    45961 tgaggttgca ttaatcgagg atacacgaca gaaggaattg ttctatttcc aaacttcacc
    46021 ttcaaaaagc gtagattttt tcaaaaattt tctcgaatca cgtgtttttc tcctcgtaag
    46081 actgagagaa atgactaaat atgaaataaa aaaaaaaaaa gaatcaaatc gcaccatctc
    46141 tgtaataggt aaatgcctct ttttctcctg aagttgtcgg aattactcgt aataagatat
    46201 tggctacaat tgaaaaggtc ttatcaataa aatttccatt tatccgtgat ctaggcatag
    46261 gtagcaatcc attctagaat tcttctcatt acctctcatg ggaaaaagat cccacaaaga
    46321 aaagaattgt atagtacgaa ataacataaa aacttctttt ttttttaaga aaaaaacaaa
    46381 agatatgaat cctctattcc aattgttcct ttttgacagg aatcgataag aaataagaaa
    46441 tatttcaagg cgattcgatt tcatactaat gtagtagtat aggaactatt ccgatttcgg
    46501 tgaagttaca aattcgaaga actcgagaaa ttttgattga atcatgatac aaattacaaa
    46561 gaagaaaaaa gaccgaataa tcattctatg atgaaaatag aataactgcc aattttgtgt
    46621 acataacggg tatacactat acaatcaaat ctaaattttt tttatgaatt tctattctaa
    46681 tagaggggta ggtgtttgtt gttgagaact ccaaaaccga aaagtaattt gaaaattttt
    46741 ctggtatgga atcatagtct atataattag aattatgatt taagagtatc cattaactat
    46801 agtctaaaag atatagacca tcaatcagtt gattcgttct aattcattga attaatccgt
    46861 tataaaatat cagaaaaaga aaaagaaggg aacgttgttt tgcaaacatg aatcgaattt
    46921 tttttttcac aatttttacg caaaattgta tctttatccc ggagcctcga aggaaagaaa
    46981 aatcgttctt tgctttgact ttgatgaaaa attttcagtt aaaatggatt gatcatacct
    47041 atccaataat ggaatatgga ttatgactga ctcgctattc actcggtttt tgggtcataa
    47101 tcgttatgta ggagagatgg ccgagtggtt gaaggcgtag cattggaact gctatgtagg
    47161 cttttgttta ccgagggttc gaatccctct ctttccgtac cttcgcttaa ttcaccaatt
    47221 ttactaacaa caagggctca aatagcaatg gataccatta ttccaacagc tagacccttc
    47281 tttgatctaa agatatagat tctcaattcc taattgctgt gacgcgtaaa atagaatact
    47341 aaaaaataat aataatcaaa atactggaaa gaaaagagta gacaaggaat gaaaatagat
    47401 ccttggtcta tgatacaaaa atgggggaaa tccagatcaa actcggattt atcttactta
    47461 accttaggtt aatttacttc gcctaaaggg aagaaaattt tccgaaccct cggtttcagt
    47521 ctgaggttta agtctgacga gaataatatt ctacgactag caattcattt attttcaaac
    47581 cgacccattt actatctatt atttgattga ctaatccttt atattggaat gggtgaaggg
    47641 tcaaatggtt tggtaattcc tcatgagggg atgaatcgag agaaatttga atcagagctc
    47701 tggatttttg ttcatccttt gccgtaataa tatctcgggg tttgcagcga taactcggta
    47761 tatctactat acgaccatta actaaaatat gtcgatggtt aactaattga cgggctgcgg
    47821 gaatagttga agccataccc aatcgaaaaa ggatgttatc caaacgcatt tcaagtaatt
    47881 gtagtaaaac ttgacctgtt gaccccttgg cttttctggc gatacgaacg tatttaagta
    47941 attgtcgttc tgtaagacca taatgaaaac gcaatttttg tttttcttct agacgaatac
    48001 gatattgaga ttttttcccg gaacgcgatt ggtttctaag atcacttccg ttcctaggct
    48061 ttttattagt tagtcctggt aaagccccca ggcggcgtat ttttttgaaa cgaggtcctc
    48121 ggtaacgcga cataaagact ccttattctt atttcttatt tagtatttcg aattaattct
    48181 tatttctatt tattttattt tttattgaat tttattttac agaataaacc taaactaaaa
    48241 ctaaactgaa tctaaatgaa gcgaagttta ctgaaatagt gtacttgtac tattactata
    48301 aagaaaagaa gaatgggatg aattggataa atatacagac ccccttctat tatatatata
    48361 atcctttccc gacataattg gaagttccta taataaattg atagcttttg gaaaaggaag
    48421 aaggcgctat ttcaatattc tttgatttca aaggaacatt atcaatcatc taaaaaatgg
    48481 aataaaaaaa aaagaatagg gaaaagccgg ctatcggaat cgaaccgatg accatcgcat
    48541 tacaaatgcg atgctctaac ctctgagcta agcgggccca cataacagaa atcttatatg
    48601 catagtaatt gactaaacta ttggaattgg aatcttagtt attaactatt caatattata
    48661 ttgaatattc tagaacataa ggattaatat agcgatatag aatttcgatt tatcacaatt
    48721 ctaataacaa ttctaatact aatattatta aatagtgatt gtaaatattg ttaatattct
    48781 ttttttttca ttttccattt gaatggtaaa tgttcttttt catttctttt tttgtcattt
    48841 gaaatccttt tgatttttta ttacagttct atattttatt ctatatcata tatatctctc
    48901 attctatatt tatttcaaat tctaattgtt taatggaatg gttagttata actaatgaga
    48961 cattcctccg ctttcaggcg aaagtgaaga taaaaaaaaa gaatcgaccg ttcaagtatt
    49021 ccaaattgaa tggcaaaatg gcaggaagag agacatatag atggggtata tatccatcta
    49081 tattgaattg cggattccga aatgataaaa tcatttttga ttggacaaaa aaaggtctcc
    49141 tatagaagat agttaagaaa atcaaagagg agaaaacacg ttttcgagat aggaatcggt
    49201 atctaatgaa ttcaatggtt ccagtataaa tgaaagaaaa agaaaaagga atgacatcac
    49261 aacgagatcc taatctcaaa aagaaagggg gatatggcga aatcggtaga cgctacggac
    49321 ttaattggat tgagccttgg tatggaaact tactaagtga tcactttcaa attcagagaa
    49381 accctggaat taacaaaaat gggcaatcct gagccaaatc ctgttttccg aaaacaaaca
    49441 aaggttcaga aaaaaaggat aggtgcagag actcaatgga agctattcta acaaatggag
    49501 ttaaatgcgt tggtagagga atctttacat cgaaacttca gaaagaaaaa gaatgaagtg
    49561 aaggataaac gtatatacat acgtattgaa tactatatca aaatcaaatg attaatgatg
    49621 acccgaatct gtattttttc tataaaaaat agaagaattg gtgtgaatcg attctacatt
    49681 gaagaaagaa tcgaatattc attgatcaaa ccattcactc catagtctga tagatctttt
    49741 gaagaactga ttaatcggac gagaataaag atagagtccc gttctacatg tcaataccgg
    49801 caacaatgaa atttatcgta agaggaaaat ccgtcgactt taaaaatcgt gagggttcaa
    49861 gtccctctat ccccaaaaag actatttcac tccccaacta tttatccgac cccctttcct
    49921 tagcggttcc aaattcctta tctttctcat tcactctatt cttttagaaa tggatttgag
    49981 cgtaaatggc tttctcttat cacaagtctt gtgatatata tgatacacat agaaatgaac
    50041 gtctttgagc aaggaatccc tagttgaatg attccctatc aatatcatta ctcatactga
    50101 aacttacaaa gtcatctttt tgaagatcga agaaattccc cggctttgag aaaattttta
    50161 atctactttt gtccttgtaa ttgacataga ccccagttct ctaataaaat gaggatacta
    50221 cattgggaat agccgggata gctcagttgg tagagcagag gactgaaaat cctcgtgtca
    50281 ccagttcaaa tctggttcct ggcacatgat taatttgtat gggtctctct tccctcgaat
    50341 taatttctaa ttaattgata tgaatcaaca tacatattct tttagagtct agattagaat
    50401 aatagcttta tccagtttgg cgagatatac cccatctatg ttctagatgg gtagagtttc
    50461 ttagataaag tatctaaaag aattggattc tatctcctct tttttttctc ctctcgttca
    50521 accgaatttg aatacgtaat acatattcga aaggttcaat tggttaattg ttgaaaggct
    50581 caaaagtcga atccgaatct aggggggttg aaatagacaa gattcagctc agatccaaag
    50641 aaatagaatc cgatattctc tcatttcttt gtcttttctt tcatattcga tttcttcatt
    50701 ccggatttct ccattccttc ctatatgcct ttctagaacc catctaagta atgtgcgcag
    50761 tacaaagttc atgatgcaga actcatttgg ttcatcctat tggtgtgacc catccgaaat
    50821 aagtatcttc caaataaatg tgagaattcc aatgaatccc taattgtctt tttttgttag
    50881 cctatcgata attccctaaa ttagacctgc ttaatctaga acagaacgtg caatccttga
    50941 atatctgaaa ttgtctaagt ggaaatagct ttcttatcat tcaatgagca tcttgtattt
    51001 cataaaaatt gggggcaata taatccttac gtaagggcca tcctatccaa ctttcaggca
    51061 ttaagatacg tttcaagcgt ggatgattat cataagagat tcccaacata tcatatgatt
    51121 ctcgttcttg aaaatccaca cttttccaaa cccagaaaac agacggaatt ctaggattcc
    51181 tcctggaggc aaatactttt atgcatacct cctctggttg atccacacca tcctctattc
    51241 tcgtaagatg atacacacta gctaacagcc cgccaggcgc tacatcatag gcacattgag
    51301 agcggagata gttgtaccca tatacataaa aaatgacagc aatggaatgc caatcctcgg
    51361 gctttatttg taaagtctct attccttggt aatcaaagcc caaagatcta tgaattagcc
    51421 catgcttgac tagccaagca gacaaacgac cctgcatctt ttttatctct cccgcatttt
    51481 tatttatata agtatttcac atttacgatg aaatttctga aaattgaccc accacttttt
    51541 attctggaca aaggaatcct gtctaattca ctaattcggg ggaagatact gaatttttgt
    51601 atttgaaaaa gatttccgta gggatctctg aagtagatgg gggttgataa agaactcttt
    51661 gatcataatt tcccgtatga atactgtgtt gaacatgaaa cttgtgattg gtagtaaaac
    51721 accgattcgc tcgttgagac ctaattcgat cttcatagag ttctcgagat attttcttac
    51781 gaagttttgt tatagcatct ataaccgctt ccggtttagg tgggcaacct ggcaaatata
    51841 catctacagg aattagctta tcgactcccc gaacagtact ataagaatcg gtactgaaca
    51901 tcccgcctgt aattgtacag gctcccatag caataacata ttttggttca ggcatttgct
    51961 catataatct cactaaagag ggggccattt tcattgttac tgttccggct gttaaaatta
    52021 gatccgcttg tctaggactc gatcttggta ctagtccata acgatcaaag tcgaagcgtg
    52081 agcctattag tgaagcaaat tcaatgaagc aacaactggt accatagaga agcggccata
    52141 aactagagag tcttgaccaa tttgaaagat catttaatgt agttgaaata actgaatttt
    52201 gggttgttcg atcaagtaaa ggaaactgaa tggaattcat aactgtctca atcttatttt
    52261 ttccgttttt ctttttattg tctgaatatt caggagctaa gaccattcca atgccccctt
    52321 tcgccatgca taaactaaac caataattaa gataagcacg aaaatgaaag cttctataaa
    52381 tacagataca cccaatacgt cgaaactcat tgcccatgga taaagaaaaa ccgtttcaac
    52441 atcaaaaaca acaaaaacta gagcaaacat ataataacgg attcgaaatt gtaaccaagc
    52501 atcgcccatt ggttctatac ccgactcata agtagaaagt ttctccggcc ctttgctaat
    52561 cggggctaac actccggaaa ttaaaaatgc caaaatagga acaaggatag atattattag
    52621 aaatgcccaa aaaaaatcat attcgtaaag cagaaacata aacgcactcc tatgaacgtg
    52681 gaaaatatac cggattcgat tggtcgattc gaattggaat tgtcaagtca tccataacta
    52741 tttagtcaaa acaagaattc attttgatcg aaccgtctag tttgctttgt ttattggttt
    52801 attgtagggc atatctcatt gcaagattca tcgactggaa tccgatttta tttccattat
    52861 acttatttcc attttattta gttagtagaa ccttctaact atatattact cttatacaaa
    52921 ttctcttgtt tctcttgttt tcatccagga ttttctctaa agacggggaa ttctaaatta
    52981 attacttatc ttatttcttc tttaattaga aattctttaa agatttctat ttttttctat
    53041 aaatagaatc aggaggtctt ttttcttatt ttttcttagt gatttagaat agaacaagta
    53101 atcaaataga agagaatgta taggaatttc catctcaaga tttagaagat cttgtgttgg
    53161 tatattcctt attattatta tttaataata gtattagggt tcgaatccag gtgacggggt
    53221 ttttcttggt tgaatacaga aaaagaggac tggccttttt cgtgttgtgc ttcgctaggt
    53281 cgaggtaagt aaggtatacg aaggaaaagc ctatttgaca atgaaagtga ccaaaggtat
    53341 tcgtttttca aaaaacttta gcttgtacac aaatacagca ggcccttcct aaatccatgt
    53401 gaattcctct tcgtagtttt tcatttcacc aggcccgtga aatgatttga cttccacaac
    53461 tcaataagat tggggatatc aaaagaaagg gagtctcact aattctttta ttgtggatat
    53521 gaatatgtaa ttcgcctccg aagattaatg acgaaaggtt ggtttcttta tccgcaattg
    53581 aaaaaatcaa tatcgattgg atccgttgat atgcattttt tctttcatct gcttaaacga
    53641 ttgccgtgag taaacttata ggaataattg gatttcactt agttacaagc aagaaataat
    53701 aatgaagaaa tgaaaattat agaatttttt ggattttgca tttttatagg gctatacgga
    53761 ctcgaaccgt agaccttctc ggtaaaacag gtcaaactta ttattattaa aatgatctga
    53821 actgtttcaa agacccaaca tgcatttttt ttgcattggg ctctttcatt aactgatata
    53881 aatatcagtt agtctgccat tttttttctt gacagaaaaa aagataagga aatggctcca
    53941 tgtgctctga ttcattattt gggagcatta ccaaagtgtt tcaaaggtgg gattatcttg
    54001 acgtaggtct gtctctggcc tagatcaacc taagttaaat gaagtctcta tcgttctgct
    54061 gaaaaaatca aatatgaaac ttcatacacc ttaaagttca tatgacgaaa agagattttt
    54121 ttgaggtcct tatactcatt atgcctagca ttgaatagac tgggtattca ccttatcaag
    54181 atctcaaatc aatgatgggg tctgtttggc acctcctaaa tgggcgtcca aattggaccg
    54241 aactctttgt caggctatgg ttccctcaaa gttatggagt aagacatcga tttctcaaca
    54301 agatcaattt ttctgattgt atgatgaact cccttgaaaa acattggcgc gcgtgtaaac
    54361 gagttgctct accaactgag ctatagccct tagtgcttgt gatacatatt ttatcatgta
    54421 gataaattct tgtcaagata aatattccat gatccaacat caacaatctt tgatctcttt
    54481 gagcggtatt ccttagatta gtattgctta ttaagtaata tgatatttat aatccatcga
    54541 caggatgggt ttcatttggt tctctttggg atgataaatg acctacttaa ctcagtggtt
    54601 agagtactgc tttcatacgg cgggagtcat tggttcaaat ccaatagtag gtaaaactta
    54661 ttagatacca gagtcaatgg tatctaataa ggtttacgac ccacccttag tgatattgat
    54721 tttttgattt tgtatctttt ctatttcatt tttgaatttg aatttttgca tcagaattgg
    54781 attctgtttg attgtatttg attgtattca cccgacagaa tctaaatagg attagaaaga
    54841 gaacttcttt ttattattcg aacgtaccaa ctagttatga aatcggattg atagcctcca
    54901 cccgtgttct agctcgtcgg agagctagat ttgcctcaat tttttgtctc cttccttcag
    54961 cctttttcac attagcttcc gctagttcaa gagtttgctg agcttcttgt ggatcaatgt
    55021 cactaccctt ctccgcatca tttactaaaa cagtgatctc attattgcct attctagcaa
    55081 aaccacccat cagagccatc gttaaccatt ggtcgttaag acgtattctc aaaatcccta
    55141 tatctacagc tgtggcaata ggggcgtgat ttggtaatat gccaatttga ccgctattag
    55201 tagataaaac aatttcttcc acttctgaat cccaaacaat tcgattaggg gtcagtacac
    55261 taagatttaa ggtcatttct tcaaattgct ctccatttct aagttcatag ccttcgcggt
    55321 agcttcatcg atattaccta ccaaataaaa ggcctgttca ggaagaccat ctaattctcc
    55381 ggaaaggatc aattgaaatc ctcgaattgt ttctgctaga ccaacatatt tacctggaga
    55441 accggtaaat acttctgcta cgaaaaaggg ttgtgataag aaacgctcaa tttttcgcgc
    55501 tcttgctacg agtaaacgat cctcttcgga taattcgtcc aatccaagga tagctataat
    55561 gtcctgaagt tctttgtaac gttgtaaagt ttgcttaact ctttgggcgg tttcgtaatg
    55621 ttcctcacca acgatccgag gttgaagcat ggttgacgtt gaatctaaag gatctactgc
    55681 tggataaata cctttggcag ccaatcctct tgatagtacg gtagtagcat ctaaatgtgc
    55741 aaatgtcgta gcaggagcag ggtcggtcaa atcgtctgcg ggtacataaa ctgcttgaat
    55801 agaggttatg gacccttctt tggtagaagt aattctttct tgtaaagaac ccatttcggt
    55861 actcagggtg ggttgataac ccacagcgga aggcattcta cccaataagg ccgatacttc
    55921 ggatcctgct tggacgaaac ggaagatatt gtcaataaaa agaagtacgt cttgctcatt
    55981 aacatctcgg aaatattccg ccatagttag ggcagtcaaa ccaactctca tacgagctcc
    56041 cggcggttca ttcatctgac cataaactag ggctactttt gattctgcaa tattttcttc
    56101 attaattact ccagattctt tcatttccat gtaaagatca tttccttccc gagtacgttc
    56161 acccactccg ccaaatacgg atacgccccc gtgagcttta gcaatattgt taatcaattc
    56221 cataataagt actgttttac ccactccagc tcccccgaat agtccgattt ttcctccacg
    56281 gcgataaggg gctaaaagat ctactacttc aattcctgtt tcaaaaatag ataattttgt
    56341 atccaactgt ataaaggcgg gcgcagatct atgaatagga gacgttgtac tagtatctac
    56401 aggccctaaa ttatcaacag gttctccgag cacgttaaaa attcgtccca gagtcgctcc
    56461 cccgaccgga acacttatag gagctcctgt gtcaatcact tccattcctc tcgttagacc
    56521 ctctgtagca ctcatagcta tagccctaac tcgattattt cctaataatt gctgtacctc
    56581 acaagccaca ttaattggtt gaccaacact atctcgacct tgaactacca gagcgttata
    56641 aatattcggc atcttgcccg ggggaaaggc tacatctagt accggaccga tgatttggac
    56701 gacacgcccc gggttttttt tttcaagcgt ggaaacccca gaaccagaag tagtaggatt
    56761 gattctcata ataataaaat aaataaatat gtcgaaatgt ttttgcaaaa attatcgaat
    56821 tcaaaataaa tgtccgctag cacgtcgatc ggttaattca ataaaatggg aattagcact
    56881 cgatttcgtt ggcaccatgc aattgaaccg attcaattgt ttacttattc actgagactg
    56941 agtgaatttg caagcccacc caacctattt taattttaaa atctcaagtg gatgaatcag
    57001 aatcttgaga aagtctttca tttgtctatc attatagaca atcccatcca tattatctat
    57061 tctatggaat tcgaacctga actttatttt ctatttctat tacgattcat tatttgtatc
    57121 taattggctc ctcttcttat ttatttttga tttcaatttc agcatatcga tttatgccta
    57181 gcctattctt ttctttgtgt ttttctttct tttttatacc tttcatagat tcatagagga
    57241 attccgtata ttttcacatc taggatttac atatacaaca tataccactg tcaaggggga
    57301 agttcttatt atttaggtta gtcaggtatt tccatttcaa aaaaaaaaaa agtaaaaaag
    57361 aaaaattggg ttgcgctata tatatgaaag agtatacaat aatgatgtat ttggcaaatc
    57421 aaataccatg gtctaataat caaacattct gattagttga taatattagt attagttgga
    57481 aattttgtga aagattccta tgaaaagttt cattaacacg gaattcgtgt cgagtagacc
    57541 ttgttgttgt gagaattctt aattcatgag ttgtagggag ggatttatgt caccacaaac
    57601 agagactaaa gcaagtgttg gattcaaagc tggtgttaaa gagtacaaat tgacttatta
    57661 tactcctgag taccaaacca aggatactga tatattggca gcattccgag taactcctca
    57721 acctggagtt ccacctgaag aagcaggggc cgcggtagct gccgaatctt ctactggtac
    57781 atggacaact gtatggaccg atggacttac cagccttgat cgttacaaag ggcgatgcta
    57841 ccgcatcgag cgtgttgttg gagaaaaaga tcaatatatt gcttatgtag cttacccttt
    57901 agaccttttt gaagaaggtt ctgttaccaa catgtttact tccattgtag gtaacgtatt
    57961 tgggttcaaa gccctgcgcg ctctacgtct ggaagatctg cgaatccctc ctgcttatgt
    58021 taaaactttc caaggtccgc ctcatgggat ccaagttgaa agagataaat tgaacaagta
    58081 tggtcgtccc ctgttgggat gtactattaa acctaaattg gggttatctg ctaaaaacta
    58141 cggtagagcc gtttatgaat gtcttcgcgg tggacttgat tttactaaag atgatgagaa
    58201 cgtgaactca caaccattta tgcgttggag agatcgtttc ttattttgtg ccgaagcact
    58261 ttataaagca caggctgaaa caggtgaaat caaagggcat tacttgaatg ctactgcagg
    58321 tacatgcgaa gaaatgatca aaagagctgt atttgctaga gaattgggcg ttccgatcgt
    58381 aatgcatgac tacttaacgg ggggattcac cgcaaatact agcttggctc attattgccg
    58441 agataatggt ctacttcttc acatccaccg tgcaatgcat gcggttattg atagacagaa
    58501 gaatcatggt atccacttcc gggtattagc aaaagcgtta cgtatgtctg gtggagatca
    58561 tattcactct ggtaccgtag taggtaaact tgaaggtgaa agagacataa ctttgggctt
    58621 tgttgattta ctgcgtgatg attttgttga acaagatcga agtcgcggta tttatttcac
    58681 tcaagattgg gtctctttac caggtgttct acccgtggct tcaggaggta ttcacgtttg
    58741 gcatatgcct gctctgaccg agatctttgg ggatgattcc gtactacagt tcggtggagg
    58801 aactttagga catccttggg gtaatgcgcc aggtgccgta gctaatcgag tagctctaga
    58861 agcatgtgta aaagctcgta atgaaggacg tgatcttgct caggaaggta atgaaattat
    58921 tcgcgaggct tgcaaatgga gcccggaact agctgctgct tgtgaagtat ggaaagagat
    58981 cgtatttaat tttgcagcag tggacgtttt ggataagtaa aaacagtaga cattagcaga
    59041 taaattagca ggaaataaag aaggataagg agaaagaact caagtaatta tccttcgttc
    59101 tcttaattga attgcaatta aactcggccc aatcttttac taaaaggatt gagccgaata
    59161 caacaaagat tctattgcat atattttgac taagtatata cttacctaga tatacaagat
    59221 ttgaaataca aaatctagaa aactaaatca aaatctaaga ctcaaatctt tctattgttg
    59281 tcttggatcc acaattaatc ctacggatcc ttaggattgg tatattcttt tctatcctgt
    59341 agtttgtagt ttccctgaat caagccaagt atcacacctc tttctaccca tcctgtatat
    59401 tgtccccttt gttccgtgtt gaaatagaac cttaatttat tacttatttt tttattaaat
    59461 tttagatttg ttagtgatta gatattagta ttagacgaga ttttacgaaa caattatttt
    59521 tttatttctt tataggagag gacaaatctc ttttttcgat gcgaatttga cacgacatag
    59581 gagaagccgc cctttattaa aaattatatt attttaaata atataaaggg ggttccaaca
    59641 tattaatata tagtgaagtg ttcccccaga ttcagaactt tttttcaata ctcacaatcc
    59701 ttattagtta ataatcctag tgattggatt tctatgctta gtctgatagg aaataagata
    59761 ttcaaataaa taattttata gcgaatgact attcatctat tgtattttca tgcaaatagg
    59821 gggcaagaaa actctatgga aagatggtgg tttaattcga tgttgtttaa gaaggagttc
    59881 gaacgcaggt gtgggctaaa taaatcaatg ggcagtcttg gtcctattga aaataccaat
    59941 gaagatccaa atcgaaaagt gaaaaacatt catagttgga ggaatcgtga caattctagt
    60001 tgcagtaatg ttgattattt attcggcgtt aaagacattc ggaatttcat ctctgatgac
    60061 acttttttag ttagtgatag gaatggagac agttattcca tctattttga tattgaaaat
    60121 catatttttg agattgacaa cgatcattct tttctgagtg aactagaaag ttctttttat
    60181 agttatcgaa actcgaatta tcggaataat ggatttaggg gcgaagatcc ctactataat
    60241 tcttacatgt atgatactca atatagttgg aataatcaca ttaatagttg cattgatagt
    60301 tatcttcagt ctcaaatctg tatagatact tccattataa gtggtagtga gaattacggt
    60361 gacagttaca tttatagggc cgtttgtggt ggtgaaagtc gaaatagtag tgaaaacgag
    60421 ggttccagta gacgaactcg cacgaagggc agtgatttaa ctataagaga aagttctaat
    60481 gatctcgagg taactcaaaa atacaggcat ttgtgggttc aatgcgaaaa ttgttatgga
    60541 ttaaattata agaaattttt gaaatcaaaa atgaatattt gtgaacaatg tggatatcat
    60601 ttgaaaatga gtagttcaga tagaattgaa cttttgatcg atccgggtac ttgggatcct
    60661 atggatgaag acatggtctc tctagatccc attgaatttc attcggagga ggagccttat
    60721 aaagatcgta ttgattctta tcaaagaaag acaggattaa ccgaggctgt tcaaacaggc
    60781 ataggccaac taaacggcat tcccgtagca attggggtta tggattttca gtttatgggg
    60841 ggtagtatgg gatccgtagt cggagagaaa atcacccgtt tgattgaata cgctgccaat
    60901 caaattttac cccttattat agtgtgtgct tctggggggg cgcgcatgca ggaaggaagt
    60961 ttgagcttga tgcaaatggc taaaatatcg tctgctttat atgattatca attaaataaa
    61021 aagttatttt atgtatcaat ccttacatct ccgacaactg gtggagtgac agctagtttt
    61081 ggtatgttgg gggatatcat tattgccgaa cccaacgcct acattgcatt tgcaggtaaa
    61141 agagtaattg aacaaacatt gaataaaaca gtacccgaag gttcacaagc agctgaatac
    61201 ttattccaga agggtttatt cgacctaatt gtaccacgta atcttttaaa aagcgttctg
    61261 agtgagttat ttaagctcca cgcctttttt cctttgaatc aaaagtcaag caaaatcaag
    61321 tagagcacta agttcaatta ttttatttgt gtttgtagca aaaaagtagt tagtttgtcg
    61381 gaatcaaagt aaataagata ataatggcgc tttctttggt gatagaagat ctaattgtag
    61441 aaagaatcaa aactaaagtt gaggataact ctttttttga cctatattcc tgattacgaa
    61501 tcaagaagcc tttatcaaca agagtgagtt cttcctttcg tgaaattagg aaaataaaac
    61561 gaatttcttc ttcttgtctt aggtatataa tttgaaattc aaatatagat aatagagttt
    61621 tgtatctttc tctatctccc gaaaaaccat tttagctaaa aattcatgtt gggtcggatt
    61681 cgaacgaatc tttcgataat ctgtaagaaa ctctttatct atttttagaa aattagaaga
    61741 caagaacaaa agacaaagaa atgaagaaaa ataataaagt ttattatgat acatatcttt
    61801 ctcatgtagg ggatgaataa gtccatttat ttagttctac agttctacat tctttgcact
    61861 tattatacct actcagttag atttagatat atagatactt agatctatac taagaatttc
    61921 aaattcttca aattctatta ataataaata ttatctaatt tctaattagt aattagaatt
    61981 caaattctta atttaattat aattattaca agatatcttt atttatataa taacataata
    62041 acagatacaa atagtaaatc gaggtacccc ttctatgaca aatttgaacc ttccatctat
    62101 ttttgtgccg ttagtaggcc tagtctttcc ggcaattgca atggcttctt tatttcttca
    62161 tgttcaaaaa aataagattg tttagatccg ctgggaccca atctcatcca tttttttttt
    62221 gaaaacgtgg acttgtatca taacacagat atctatttat tggaatatag tataacatgt
    62281 gatttccacc gaacataaag gaaaaaactc ttatgcccgc agaaatatga tatatggata
    62341 tatcaattct aacaattttc aaatagatca ggatcgctgg atggctgaaa tgtagtcggt
    62401 gaatctctat gtatatcgat atgtatagtg ggatcgtatt aaataaagag tatgttatta
    62461 ttttagattt aaccaatttg atgaattact cctaaaggtt gacatcaaac tagtgctagt
    62521 tcacctcaaa ctagtgctag ttgatgagag ttacttcgga aacaaaaaag taaagtcaaa
    62581 tttctctggg gtattatctc aattccaata aaatgcaatc gggtaaagta tgacttggcg
    62641 atcagaacat atatggatag aacttataac ggggtctcga aaaataagta atttctgctg
    62701 ggcctttatc ctttttttag gttcattagg cttcttatta gttggaactt ccagttatct
    62761 tggtagaaat ttgatatctt tttttccgcc tcagcaaatc attttttttc cacaaggact
    62821 cgtgatgtct ttctacggaa ttgcgggtct ctttattagc tcttatttgt ggtgcacaat
    62881 ttcctggaat gtaggtagtg gttatgatcg attcgataga aaggaaggaa tagtctgtat
    62941 ttttcgttgg ggatttccgg gaaaaaatcg tcgcatattc ctccgattcc ttataaaaga
    63001 tattcagtcc gttagaatag aagttaaaga gggtatttct gctcgtcgtg ttctttatat
    63061 ggacatccga ggccaggggt ccattccctt gactcgtact gatgagaatt tgactccacg
    63121 agaaattgaa caaaaggctg ctgaattagc ctatttcttg cgtgtaccaa ttgaagtatt
    63181 ttgagaaatt gagatatcag tatcaggaaa caatattctg aatttcttca ttcgaagtga
    63241 attcttagct tttttctgga ttctttctag attcaaagac taaccacaaa atcacaaaga
    63301 aaatagattc attagtccga taccttgtat aaaactcatg tgtgtaagaa atattcgatc
    63361 gcatagagtg tacgaatggg ttgattaaca attcacagat gaaaaaatgg caaaaaagaa
    63421 agcattcact cctcttttct atcttgcatc tatagtattt ttgccctggt ggatttcttt
    63481 ctcagttaat aaatgtctgg aatcttgggt taccaattgg tggaatactg ggcaatccga
    63541 aatttttttg aataatattc aagaaaagag tcttctagaa aaattcatag aattagagga
    63601 actcctcttc ttggacgaaa tgatcaagga atactcggaa acacatctcg aagagtttgg
    63661 gataggaatc cataaagaaa cgatccaatt aatcaagata caaaatgaga atcgtatcca
    63721 tacgattttg cacttctcga caaatatcat ctgttttatt attctaagcg ggtattcaat
    63781 tttgggtaat gaaaaacttg ttattcttaa ctcttgggct caggaattcc tatataactt
    63841 aagtgacaca gtaaaagctt tttctattct tttattaact gatttatgta tcggattcca
    63901 ttcaccccac ggttgggaat taatgattgg ctctatctat aaagattttg gatttgttca
    63961 taatgatcaa atcatatctg gtcttgtttc cacctttcca gtcattctcg atacaatttt
    64021 taaatattgg attttccgtt atttaaatcg tctgtctccg tcacttgtag ttatttatca
    64081 ttcaatgaat gactgataaa ggatccattg atattaatct aatccaatta gaatgcttgg
    64141 tactttgtag ttgtacataa gcaaagtatt gaaaatcata tttactcttt ctatttctaa
    64201 ccatcgggga gattcatcct atattattcc tagattattc cagcaaatag cagaatcgtg
    64261 gctagggaac tatactagcg acctacccaa tttattgtag aaattttcgc gatcaatgat
    64321 tggaccatgc aaactagaaa tgctttttct tggctaaaga aacagattac tcgatctatt
    64381 tccgtatcgc tcatgatata tatcttaact cggacatcca tttcaagtgc atatcccatt
    64441 tttgcacagc agggttatga aaatccacga gaagcgactg ggcgtattgt atgtgccaat
    64501 tgccatttag ctaataagcc cgtggagatt gaggttccac aagcggtact tcctgatact
    64561 gtatttgaag cagttgttcg aattccttat gatatgcaac tgaaacaggt tcttgctaat
    64621 ggtaaaaggg gggggttgaa cgtgggggct gttcttattt taccggaggg gtttgaatta
    64681 gctcctcccg atcgtatttc tcccgagatg aaagaaaaga ttggcaattt gtcttttcag
    64741 agctatcgcc ccaataaaaa aaatattctt gtgataggcc ctgtccctgg tcaaaaatat
    64801 agtgaaataa ccttccctat tctttccccg gaccctgcta ctaagaagga tgttcacttc
    64861 ttaaaatatc ctatatacgt aggcgggaac aggggaaggg gtcagattta tcccgacggc
    64921 agcaagagta acaatactgt ttataatgct acagcagcag gtatagtaag caaaatcata
    64981 cgaaaagaaa agggtgggta tgagataacc ataacggatg cgtcggatgg acgtcaagtg
    65041 gttgatatta tccctcccgg accagaactt cttgtttccg agggcgaatc tatcaaattt
    65101 gatcaaccat taacgagtaa tcctaatgta ggcggatttg gtcagggaga tgcagaaata
    65161 gtacttcaag atccattacg tgtccaagga cttttgttct tcttggcatc tgttattttg
    65221 gcacaaatct ttttggttct taaaaagaaa cagttcgaga aggttcaatt ggccgaaatg
    65281 aatttctaga ttcgcagatt tgtcgacatc aagttcgtaa aaagaaccaa attcttgttg
    65341 gcgattattt atgatcaaaa aaatgaaatt ctgaaaactc ctttgtctta tttatactct
    65401 tcttcaaaat ctacatacta tgtggtacaa gggattccca gcatctcgta gaaaaagagt
    65461 atgtaatgta gaatttgaag aagagtattt gactttcatt atttttattt cgttttttaa
    65521 aattggagta gtgtgactat gttactattg acagatttca atgccataag acgtatcaat
    65581 agttttctat tctaaataga aagaaagtca aatttgtcta aatactagac ataaggaagc
    65641 aggggataaa tgcggggaac aaaaaattct aggagggatt atttgtcttc ctagtcttcg
    65701 acacaagaaa ggggtgtaga aaaatccttt tttcttgtgt cgaaacgaaa gagtaatgat
    65761 tcttgatcct gtttgttaaa aattcctagt cttggtttcg atttttccag atgtatcaga
    65821 aaccctttac cttaccccca ccccctttac gtataatata ctaagtggtg gacaaacaaa
    65881 acaaaaaaag agaggaaatt ttattaatta aataaaactt cttcaatcaa cttatcttat
    65941 acaaaatttg atgatgaaat atgaaaacaa taaaaaataa atagagtaat gtaatagaga
    66001 gagtaaggtt ctacattaga ttagtataga aaggatttgc acgatatcta atatattata
    66061 gcagccaaga aattgagtga ttccttcttt cttccaactt tgaaagtacc gatagatact
    66121 atcatagaaa aagaagaggt ggtccgaata gtgaattttt caaaaacatg atcagaaaaa
    66181 tgagaaaaat ggagtttttg aaaagaaaaa gaaatccatt ttatcattta gacgaaaaaa
    66241 atattatgat tcttaagaac tcaacgggcc cttccccttc gaatcaaaca aacaaagaag
    66301 ggaattccgt tgagttctta cgctttcatg ttgacgactc aattcattcg attactagag
    66361 ggatgaaccc aatccggaat atgaaccata aaagaaaata cctattaaac cgattacaag
    66421 aataccagct acagtaccta ttatccaaag aggaatcctt ccagtagtat cggccattta
    66481 ccccacttcc ctccagattt catcaagtgg tcatgctaga gacataaaca gtcatggata
    66541 attaaattat gagatccttc cgaatgagct aagagaatct tattgattct ctttcgtttt
    66601 cttaattgaa gaaataattg gaaaataaaa cagcaagtac aaaaatgagt aataaccccc
    66661 agtagagact ggtacgattc aattcaacat tttgttcgtt cgggtttgat tgtgtcgtag
    66721 ctctataatt cggattaagt ttatcgttgg atgaactgca ttgctgatat tgatcccaaa
    66781 aaaaagacgg taggtacagc taggccgtga acagccaacc atcgtactgt aaaaattgga
    66841 taggttcgat ctatagtcat tagggcctcc taaaacgatc tactaaattc atcgagttgt
    66901 tccaaaggat caaaacggcc agttattaat ggaattcctt gtcggctctc tgtaaaatac
    66961 tcgtttggcc gagggcttcc aaacacatcg taagctaaac cggtgctgac aaataaccaa
    67021 cccgcaatga atagggaagg tatagtaatg ctatgaatga cccagtatcg aatactggta
    67081 ataatatcag caaacgaacg ttctcctgtg cttccagaca tgctgagctc cacatattct
    67141 tgtacagtca aagaagatcg attccgtaaa agatgagatc agtaaatgac aattcactga
    67201 aatttcatct ttgtgagatc gtcaatattg taccgaaggc gtctttagag tataccgaat
    67261 cagtatagct atccttcttc tgacacagca acgcaatttg aaatagtatc aaaagtaagt
    67321 actaaataat ttcttttttc ctttacttgt tgatgtaaaa tcatcttcca ttcaatagaa
    67381 aattctttca attcaacgaa agagattctc atattcacac aatttaagta gatgcgagat
    67441 atagaaattt gcttttcgta gttgtggaag cagttttgtt gttggaatcc tttttttaaa
    67501 gaagaagtta atggtcgagt aagaaataag agtagtagat catattcgag gaaagaaaaa
    67561 atcgaataat tggaatccat agttgtgatg cattgttgtg gatctcgatc caaaggttct
    67621 ttcttgatct agctacaagg atggggcagt agggaaagat aaaatgtgga acctaataga
    67681 aattactagt tttagaatct agttggacaa aaaaaagatt ttttcaagcg attgtgtgat
    67741 aactttttct tcttctccat cattcaagat attatgtgaa ttaatatatt actaaatcta
    67801 atgagttaaa cttaaatgaa agtaaaaaga aaaagtttta taaggtaact gttcgcttta
    67861 aaatcgaaaa tggagtcgat acaattcaac agaatctaag aaatgatcaa attcgaaaat
    67921 catttctatt tttattctat aaaaattcaa gtttcatttt tgaatgcagt tagacgatac
    67981 agctcttatt agtttaatag tttactcaag agttactcaa tgaatcggtt gattggaatt
    68041 gcgggatgga tagatgttac agatgatgaa tcaatttctt ttatatgtct gtcactttat
    68101 ctttgttagt gctgtctgcc tataatgata gataaatcaa aaacttttca ttcaacttat
    68161 tctttcaatt gaaattgaga tttttgccta tcctcctatt ttattttgaa aaatttgaaa
    68221 cttaggtaag tgctttttaa acatatgtat aaaaagaaca tatttcattt aatttagccc
    68281 cttcatgctt actataacta gttatttcgg ttttctatta gcggctttaa ctataacctc
    68341 agctctattt attggtctga gcaagatacg acttatttaa actgaatatt taaaatgaac
    68401 aattcataaa aagaaatcct tctgtgggat tacgcgtatt ctatatttac ttacgttacc
    68461 aattgtcaat tcttgttcat tgtcattgag attcatgtca attcggatta atatttaggt
    68521 atcgatatta cctctttttt tctcctttca aacaaataaa aatgattgaa gtttttctat
    68581 ttggaatcgt gttaggtcta attcctatta ctttggctgg attattcgta actgcatatt
    68641 tacaatatag gcgtggtgat cagttggacc tttgattaat taacatctct ttttgattga
    68701 cctcctcctt tctttaattc acaggcacag gaggtcaaat tccgattgtt gtgaaagtta
    68761 ctgaatgaat ctattttatt ctaattcgat ctaagaagaa aaaaatcacg ctctgtagga
    68821 tttgaaccta cgacatcggg ttttggagac ccacgttcta ccgaactgaa ctaagagcgc
    68881 tttcttatca gaatagataa gactgtaaac aaaaggattc ttttcataac cccaatacat
    68941 tttgtatgca tatactagaa tagcatgata aaaatcaaag attatgtcca atttgaggcg
    69001 atctcaattg atccctcgtt actgctcctt tgagcagtaa taggtaggga tgacaggatt
    69061 tgaacctgtg acattttgta cccaaaacaa acgcgctacc aagctgcgcc acatcccttc
    69121 aattgttcca cagtgtaatt gtagagaatt cctgtcttgt tttccacatg gttatttcct
    69181 ccattgatat atacaaattt tctgctcatt tcgtcttttt ggtctcattt aacatataat
    69241 agtaaaataa aaggaaaaga cttctcttat agattatata gaaaatactt atatacaatt
    69301 atatacaaaa tatataaata cagaacccgt cgtaaaaatc aattagtatt tttcggaaat
    69361 tctcggtaag aaagaagggg atgtattttt tttttctgtt ttaagaaaag gaaaatctta
    69421 tttcccgaat cattgtacat tgcaatttga attaggaatt ctgtgtccaa ctctaagcag
    69481 cccttaacta catatgcatc tgattatata tgtattatct attccaacaa ataatacaaa
    69541 agaaggaggt ttttcaatgc gagatctaaa aacatatctc tctgtggcac cagtactaag
    69601 tacgctatgg ttcggggctt tagcaggtct attgatagag attaatcgtt ttttcccgga
    69661 tgcgttgaca ttcccctttt tttcattcta gttattgtca tgggaaggaa tgaagaagat
    69721 tagagatcca atcaaatatt ggtgatgaat ccctctcccc ctcttttctc ttttttccct
    69781 ttttagaata agggaggaaa gagaaagaat aaaaaaagtg gattcaacat tcgggctcaa
    69841 gttcgaatta actgaatatt aataatagag gaatgggggt agaatagaag atctagggca
    69901 agagtattat acaagatact taaatgatta cttcaatttg aaatatactt tagaaaaatc
    69961 gttgtatttt actatgactt tgctttacta ttactttatt ttcttgattt taatctttta
    70021 cttttagaat tggatttcaa gttagtaact tctattttat cctttcttcg ttttgaatcg
    70081 aaaatagaag agttgagtaa atcaaaaatc caaaggaggt tcatggccaa ggggaaagat
    70141 gtccgagtaa cggtgatttt ggaatgtact agttgtgtcc gaaacagtgt tgataaggta
    70201 tcaagaggta tttccagata tattactcaa aagaaccggc acaatacgcc taatcgatta
    70261 gaattgaaaa aattctgtcc ctattgttac aaacatacga ttcatgggga gataaagaaa
    70321 tagagcgaac caagtacctg tgtcttaccc tttcaaggaa ggggaaaaaa tgacattata
    70381 tatataacat atttaaatag aaaataaaca aatcttattt tttaaaaatc ctattttggg
    70441 tggatttaaa ctgaattaga attaagaaat aggattttag ggataaggaa taaattaaac
    70501 aaacaaacca tggataaatc caagcgacct tttcttaaat tcaagcgatc ttttcgtagg
    70561 cgtttgcccc cgattcaatc gggggatcga attgattata gaaacatgag tttaattagt
    70621 cgatttatta gtgaacaagg aaaaatatta tcaagacgag tgaatagatt gaccttgaaa
    70681 caacaacgat taattactct tgctataaaa caagctcgta ttttatcttt gttacccttt
    70741 ctcaataatg agaaacaatt tgaaagaacc gagtcgaccg ctagaactac tggttttaaa
    70801 gcccgaaata aataggctta ctttttcttc acttgaatca taattacaag aatctagatt
    70861 tgagtatcgt gtcgtaagaa aaaaaatgaa tcggaaaaaa agatttcttt ttttattgaa
    70921 ttgaacgtgt tcattcattt tgactacttt agcatatttt ctcatagaaa tttctactct
    70981 accttcccgg agttcattct ccggggaact ccatttaaat tattctggtg gattctttcc
    71041 aatctacttc ctttatgatt tcgttcgaaa tcatataaag acaattccta tttgatatag
    71101 ctatttgtgc aagtatttta cggttaagaa gcaactgtct cttgtacaga tcgtgtatta
    71161 atctactata actataggat actccccttt cgcgaattac tgcgtttatc cgagtgatcc
    71221 acaaacgacg aaaatctctc tttttcctat ccctatcccg atgagccgaa actaaagctc
    71281 ttattttctg ttgagtaata gttcgagtaa gccttgaatg agccccccga aagcttgatg
    71341 caaataaacg aatttttgtt ctacgtctcc gagctatata tccccgttta attctggtca
    71401 ttgaataaat gaaactttga cgaataacta atcgattgcc tttctttcag ttattctttt
    71461 cccccttcct agtctattaa taacaaaacg gatttttcca atgtataaaa taaaaattcc
    71521 aatggctttg gctactctaa ccttcccgac cacgattttt tctttttttt ttttttaggt
    71581 atttcactgc gaaataagaa agaaataaaa aattgtattt tcctaggtat caaaaatcta
    71641 gtaaataaaa gaaatcaaaa aataaagtag tgggttcctt cgtttctatg gttacttctt
    71701 aaacggtgag gtcttctcta tacaccggag cctttacttt atactttaat ttaatattta
    71761 atcaactaat tgatgttatt gggaacttgt atagttcaca ctctttggct ctacccatga
    71821 attatccagt aataggtctt tcacaatcag atctacctat acagtaagcg gtatttaatt
    71881 atgaaagttt gctgggtagc tgaccctctt agtccgttct tgccagagtg ggagcctgcc
    71941 taatctttat gttttatgct ttttaaataa gatttcctcc gcttaatgga taaccatttg
    72001 ttaccaatgg agaatttctt atcatctgtg attggattta caccaacgga aaccataaac
    72061 ttcatacaca atagagggat atgagagagt tttttttaaa taatgaatgg agttccttct
    72121 tccatcctat cccattcacc ggtactgatc attgatactg taaaagtcgt tttcttgctt
    72181 ttgtgccagc tcatgatcta aacgagtcgc acatacaccc tagtacatgt tcctcgacgc
    72241 tgaggacagc cccgaagagc gggggatttc gtgacatttc tgattggctg tcttgtattt
    72301 ctaataagtt gtttaatagt tggcatgttg aatcgtatac ataatatgat gggttggttt
    72361 agattgatcc taaccgaatg atgatgaatt acttctattt aatagaatat tcaattcgaa
    72421 gataaaatct caaatcacag atttgcgcga aatccatgtt attttcattc aaccgctaca
    72481 agatcaacaa ttccataagc ttgggcttct gttgctgaca taaaaacatc tctttccata
    72541 tcttcggata caacccataa gggtttcccc gttctttgta cataaaccct tgtgagggtt
    72601 tcacgcagtt tcagcagttc ttccgcttcc aggacaaatt cgcctgtttg tgcctcataa
    72661 aaagaactag caggttgatg gatcattacc ctgatgatat aacaaaataa aagcttcccc
    72721 tatctcgcat gataaagcaa agagaaaaga aagataaaga atagaaaaaa gatagaattg
    72781 aaccaaccgt acaggccatc ttttgtgcat acggcctcta caagaaaatt gacctcccct
    72841 cctttctatt gaagaaagag aaaaaataga atctatcaga ctcagatggg taaatgatca
    72901 aattccgatc cttcctttcg gaggagttaa aaaatactat gatggctccg ttgctttata
    72961 tgtttatttt ttcttttttt ttttttgtct gtgattcacg aatcccaaag tttcttttta
    73021 atccgatcaa ataaggaaaa aagtcttttt tttttttttt cgtactcttt cataacataa
    73081 atattgttaa gaactctccg gcatgaaaac aaaaaagttt gtgacgctga actgaactcc
    73141 cgatagataa gagaaaatcg gaaatacccc ttatctcata ctactctctc gatacagaat
    73201 ctaatgtttt gaaaaaaaaa caatacaaaa atttctcata tcgaattcga agtgccatgc
    73261 tattattact tagtattcat atggcgaagg catagtcttc ttttttctct caaataaaaa
    73321 cctcattggc gccaagcgtg agggaatgct agacgtttgg taatttctcc tccgaccagg
    73381 ataaaagatc ccattgaagc ggctaatccc atgcatattg tatggacatc tggtcgcaca
    73441 aattgcatag tatcataaat agccacccca ggtattaccc agcccccagg agagtttata
    73501 aacaaataca gatctttggt ctcatcctcg atactgagat ataccataag accaataagt
    73561 tgattcgaaa tctcgctatc aacctcttgg cctaaaaaaa gtaatctttc tcgataaagt
    73621 cggttgatta gggtaaaatt gtatccctta ggaaccgtac atgcgccttt tgatgcatac
    73681 ggttcaaaaa aaaaatggtg aatcaatgta tagattccag tcctctttct ttttttctag
    73741 aaaggttctt tcttacttct aacgaaaggg cttttcttcg attttttaat aaagacgagt
    73801 tttgactcct tttttatatt ttcgattttc cattataaaa tttgaagtta taagaaaggg
    73861 tcattaaact tatcgaatta acttctcatt gatgtattct ttcatcgaga tttaatccaa
    73921 accgcgatgg tattttcttg ttcctgaatg ggtctgtttc atctttttag gtttatgctc
    73981 tactccgggt aaagatccgc ccgatttgga tttgtacata taggacaaat gctcccatta
    74041 ccatttcttt ttgtatttct tttttttttt caattcattt tatacaagta tttcttagag
    74101 ttgagataac tttgcttgac aattaggatc tctttacaaa gaaaaaatat gaatagcaat
    74161 catagatatc ttaccaatcc aattgggttt tttctaaacg gagcctggat acttcatttt
    74221 tttagtccaa ccaagccaac cataaattat tctaattgaa tttttctaat tgataatagt
    74281 aatatgaatc ccctcaaaaa tggatctaat tgcacttcac gctccaaatt tttgatgatt
    74341 aaatttatct ttcttgggtg aaacggggga tatctcgatc gggggagaga acggggaaat
    74401 accatatgac ccaatatatc tgacaagtcg cactatacgt caacccaaga tgcatcttcc
    74461 tctccaggac ttcggaaagg gacttttgga acaccaatag gcattaaatg aaagaaagaa
    74521 ctaaatacta tatttcactt tgaggtggaa acgtaacaat tttttttatt gtctttataa
    74581 tattcatatt ggtttttatc gtatttattt tatccataga ttataaaaat tcataaagaa
    74641 agacagaatg aataaactca aattattacg aataggtctt tctaatgata aataagtatg
    74701 gactcattcg ctcatagaaa atgggatcaa ctcccccatt gcgtattggt acttatcgag
    74761 tatagaataa atctgcttct ctttgttcct acgaacagaa ttgttccatt attaccaaca
    74821 gaatagaaca cccttgttcg gaaataatcg actgaacaag agtggtccat aggatagtca
    74881 tattatagtc ttttccaatg caataaagtt acgtagtgtc tatttatctt tgatataagg
    74941 ggtatttcca tgggtttgcc ttggtatcgt gttcataccg ttgtattgaa tgatcccggt
    75001 cggttgcttt ctgttcatat aatgcataca gctctggttg ctggttgggc cggttcgatg
    75061 gctctgtatg aattagcggt ttttgatcct tctgatcctg ttcttgatcc aatgtggaga
    75121 cagggtatgt tcgttatacc cttcatgact cgtttaggaa taaccaattc atggggcggt
    75181 tggagtatca caggggggac tgtaacgaat ccgggtattt ggagttacga aggtgtagct
    75241 ggagcacata ttgtgttttc tggcttatgc tttttggcag ctatctggca ttgggtgtat
    75301 tgggatctag aaatattttg tgatgaacgt acaggaaaac cttctttgga tttgccaaag
    75361 atctttggaa ttcatttatt tctctcaggg gtggcttgct ttggttttgg tgcatttcat
    75421 gtaacaggct tgtatggtcc cggaatatgg gtgtccgacc cttatggact aacgggaaaa
    75481 gtacaacctg taaatccagc gtggggcgtg gaaggttttg atccttttgt tccaggagga
    75541 atagcctctc atcatattgc agcaggaaca ttgggcatat tagcgggcct attccatctt
    75601 agcgtccgtc cgccacaacg tctatacaaa ggattgcgta tgggaaatat tgaaaccgtc
    75661 ctttccagta gtatcgctgc tgtctttttt gcagcttttg ttgttgccgg aactatgtgg
    75721 tatggttcgg caacaacccc gattgaatta tttgggccca ctcgttacca atgggatcag
    75781 gggtacttcc agcaagaaat atatcgaaga gttagtgctg ggctagcaga aaatcaaagt
    75841 ttatcagaag cctggtctaa aattcctgaa aaattagctt tttatgatta catcggcaat
    75901 aatccggcaa aagggggatt attcagagcg ggctcaatgg ataacgggga tggaatagcg
    75961 gttggatggt taggacaccc tatctttaga gataaagaag gccgtgaact ttttgtacgt
    76021 cgtatgccta ctttttttga aacatttccg gtcgttttgg tagatggcga tggaattgtt
    76081 agagccgatg ttccttttag aagggcagaa tcgaagtata gtgttgaaca agtaggtgta
    76141 actgttgagt tctacggcgg tgaactcaac ggcgtcagtt atagtgatcc tgctactgtg
    76201 aaaaaatatg ctagacgtgc tcaattgggt gaaatttttg aattagatcg tgctactttg
    76261 aaatccgatg gtgtttttcg tagcagtcca aggggttggt ttacttttgg gcatgcttcg
    76321 tttgctttgc tcttcttctt cggacacatt tggcatggtg ctagaacctt gttcagagat
    76381 gtttttgctg gtattgaccc agatttagat gctcaagtcg aatttggagc attccaaaaa
    76441 cttggagatc caactacaaa aagacaggca gcctgataca acattacttt ggtatctttc
    76501 tttcgccctt attttctttc ttttactttt attgacatag ggtaccagag aaatctttat
    76561 ttgaatcaac ttcgttttta ctcttgttcg ttctttatcc ggaagatgac aaaaaaaaga
    76621 aaataaaaag aaacaaacag gtatgaaagc tataattgta aaccacgatc gaatctatgg
    76681 aagcattggt ttatacattc ctcttagtct cgactctagg gataattttt ttcgctatct
    76741 tttttcgaga accgcctaaa gttccaacta aaaagaacta aaaaggtgaa ataattcttc
    76801 attatctcag ttgaagtact gagcctcccg ataccgggag gctcagtact tcaactagtc
    76861 tccatgttcc tcgaatggat ctcttagttg ttgagaaggt tgcccaaaag cggtatataa
    76921 ggcgtaccca gtaaaactta caagtaaacc agatataaag atggcgacta gggttgctgt
    76981 ttccattctt atcatattta taaaatttca agaccccaat ggatctatga taggatcgtt
    77041 tatttacaac ggaatggtat acaaagtcaa cagatctcaa tgaatacaat aggatttatg
    77101 gctacacaaa ctgttgaaaa cagttctaga tctggtccaa gacgaactgc ggtaggagat
    77161 ttattaaaac cattgaattc ggaatatggt aaagtagctc ctgggtgggg aactactcct
    77221 ttgatgggtg tcgcaatggc cttatttgcg gtatttctat ctattatttt ggagatttat
    77281 aattcttccg ttttattgga tggaatttca atgaattaga tctataagaa ccgcaaagtt
    77341 cttgcttttg agtccaaaat gaatcattta gagctccgat ttctagtcca ttctattttc
    77401 ttttggtagt tcgatcgtgg aatttctttg tttctgtatt tccggagtat gagtgtgtga
    77461 cttgttataa ttgatcctat tgatagtaca gagaatgggt ctgtcatctt gatagagatg
    77521 gttctacttc gtcagatatt tattctaata tttggaacac gaaatagatt aagaaatatt
    77581 tgaactatga ttcatactta atattcagac ctcgtgtccg ggctccaaaa aattttcaaa
    77641 caaagaattc taatttctaa atcgaaagat tcttttcttt caacccctat ttatattttg
    77701 accaaaagca aaacctttct ttgaattttt agtcattcta tttattcagg gaataagtga
    77761 tgatccgagg attcttactc agggaatcct tgatttgatt taggttaggt ttttttattg
    77821 aatcatcgtg gttctagtat gaatctgagg ttttaatcga ttcatagggt cttaacaaga
    77881 gaattcctat caataataaa gaaaacaaat aataaaagcc atattccaca aaaacaaatt
    77941 ctagaaagaa atagggaaaa agagaattca agaggcccat aagtatcaaa ataaagataa
    78001 agacgactgc gccaacttga tattttggta ttatcgccac aaagaagagc tttcggattt
    78061 tccagagaag atgggatcag aacttaataa atttaaaact ttctattcca tatccgttgc
    78121 aactagtatt tgggtgtttt tgcttgagct gtacgagatg aaagtctcat atacggttct
    78181 cagaggggga gttccgccta tctcaataaa gtatatgatt ggttcgaaga acgtctcgag
    78241 attcaagcaa ttgcggatga tataactagt aaatacgttc ctccccacgt caatatattt
    78301 tattgtttag ggggaattac gcttacttgt tttttagtac aagtagctac tgggtttgct
    78361 atgacttttt actatcgtcc gaccgttact gaggcttttg cttctgttca atacataatg
    78421 actgaagcca actttggttg gttaatccga tcagttcatc gatggtcggc aagtatgatg
    78481 gtcctaatga tgatcctgca tgtatttcgt gtgtatctca ccggcggatt taaaaaacct
    78541 cgcgaattga cttgggttac aggtgtggtt ctggctgtat taaccgcatc ttttggcgta
    78601 actggttatt ccttaccttg ggaccaagtc ggttattggg cagtgaaaat agtaacaggt
    78661 gtccctgacg ctattcctgt aataggatca cccttggtcg aattattgcg cggaagcgct
    78721 agtgtgggac aatctacttt gacccgtttt tatagtttac acacttttgt attgccgctt
    78781 cttactgccg tatttatgtt aatgcacttt ccaatgatac gtaaacaagg tatttctggg
    78841 cctttataga gaaaagaaaa atagatccta aatatttgta atcaatcatt tatcacttgg
    78901 tggaggaata tatagtattt cattgctaca agtatggatt attgaaaata ataagacatg
    78961 gatttggata tttcccttta actattcatg tcaactaaac ggggggattg aagggaattt
    79021 tgtgaagaga aaatggatta tgggagtgtg tgacttgaac tattgattgg tctgtgtaga
    79081 tatatgcctg ccacatggga attcacaacc aaatgtgtct ttgttccaat cgccgtgtaa
    79141 gccctataca gaggataggc tggttcgctt aaagagaatc ttttctatga tcaggtccga
    79201 atcatgttgt acatgagcag gctccgtaag atccagtata agtgaactag ataaaacgga
    79261 atcaagattc cgttttatct agttcactta taagattaaa tagtatgtaa atgtattcat
    79321 ttcctctgca gtgacacgat caatactact atcggagtga aacaagggat ctaaagaaga
    79381 agagaggcta gactatatta gtaacaagca aaccttgtat gtgtatctcc aaatattttg
    79441 gagataaata ccaattagaa ggtctgagac gacccagaaa gcacttgatc atatcatgat
    79501 ctgatttgta agcctacttg ggtcttgagt atttacttgt aagaacggaa ttctttgttt
    79561 tgtaatggat agttgcaact ccgtaaaaaa gaattcagtc aaatttttct tacattgaac
    79621 cattcctata tcatatatgt gtatgtgtaa atacaggtac catatatata ttttatatgg
    79681 atatatggag tcatttggtt ctttttattc ttgctcgagc tggatgatta aaaattatca
    79741 tgtccagttc cctcggggga tggatctata agaattcacc tatcccaata acaaaaaaac
    79801 ctgacttgaa tgatcctgta ttaagagcta aattggctaa aggtatgggt cataattatt
    79861 atggagagcc cgcatggccc aatgatcttt tatatatttt tccagtagta attctaggta
    79921 ctattgcatg taatgtaggc ttagccgttt tagaaccatc aatgattggt gaaccggcag
    79981 atccatttgc aacccctttg gaaatattac ctgaatggta tttctttcct gtatttcaaa
    80041 tacttcgtac agtgcccaat aaattattgg gggttctttt aatggtttca gtacctgcgg
    80101 gattattaac agtacctttt ttagagaatg ttaataaatt ccaaaatcca tttcgccgtc
    80161 cagtagcgac gactgtcttt ttgattggta ccgcagtcgc cctttggttg ggcattggtg
    80221 caacattacc tattgataaa tccctaactt taggtctttt ttaaattttt aaattgattc
    80281 aattgtgaaa taacacgaca tgtgtatcta gggaatagtt tcttcaaagc gaattctccc
    80341 tagatacatc tattcaattt aattctgaat ttattttgaa tatatgatat attaatatat
    80401 taattgtgct aaagagtttc aatctatttt cactaagtaa gtccaataga tttaaaactt
    80461 attttttgct aaatcaatta cgaaatattt ttctaaaatg cccaatatcc gttttacatc
    80521 ttcgctacga aaatgttcaa ttttcataag atcttcttgg ctgttattca aaaggtccaa
    80581 caatgtatat atattggaca ttttgaggca attatagatc ctggaaggca attctgattg
    80641 gtcaataaaa atcgatttca atgctatttt ttttttgttt tttatgagtt tagccaattt
    80701 atcatgaaag gtaaaagggg ataaaggaac cgtgtgttga ttgtcctgta aatataagtt
    80761 gtcttcctcc atatgtaaaa agggaataaa taaatcaatt aaatttcggg atgcttcatg
    80821 aagtgcttct ttcggagtta aacttccgtt tgtccatatt tcgagaaaaa gtatctcttg
    80881 tttttcattc ccattcccat aagaatgaat actatgattc gcgtttcgaa caggcatgaa
    80941 tacagcatct ataggataac ttccatcttg aaagttatgt ggcgttttta taagatatcc
    81001 acgatttctc tctatttgta atccaataca aaaatcaatt ggttccgtta aactggctat
    81061 atgttgtgta ttatcaacga tttctacata aggcggcaag atgatatctt gggcagttac
    81121 agatccagga cccttgacac aaatagatgc gtcagaagtt ccatatagat tacttcttaa
    81181 tataatttct ttcaaattca ttaaaatttc atgtaccgat tcttgaatgc ccgttatggt
    81241 agaatattca tgtgggactt tctcagattt tacacgtgtg atacatgttc cttctatttc
    81301 tccaagtaaa gctcttcgca tcgcaatgcc tattgtgtcg gcttggcctt tcataagtgg
    81361 agacagaata aagcgtccat aataaaggcg tttactgtct gttcttgatt caacacactt
    81421 ccactgtagt gtccgagtag atactgttac tttctctcga accatagtac tattatttga
    81481 ttagatcatc gaatctttta tttctcttga gatttcttca atgttcagtt ctacacacgt
    81541 ctttttttcg gaggtctaca gccattatgt ggcataggag ttacatcccg tacgaaagtt
    81601 aatagtatac cacttcgacg aatagctcgt aatgctgcat ctcttccgag accgggacct
    81661 tttatcatga cttctgctcg ttgcatacct tgatccacta ctgtacggat agcgtttgct
    81721 gctgcggttt gagcagcaaa cggtgttcct cttctcgtac ctttgaatcc agaagtaccg
    81781 gcggaggacc aagaaactac tcgaccccgt acatctgtaa cagtgacaat ggtattattg
    81841 aaacttgctt gaacatgaat aactcccttt ggtattctac gtgcaccctt acgtgaacca
    81901 atacgtccat tcctacgcga actaattttc ggtatagctt ttgccatatt ttatcatctc
    81961 gtaaatatga gtcagagata tatggatata tccatttcat gtcaaaacag attctttatt
    82021 tgtacatcgg ctcttctggc aagtctgatt atccctgtct ttgtttatgt ctcgggttgg
    82081 aacaaattac tataattcgt ccccgcctac ggattagtcg acatttttca caaattttac
    82141 gaacggaagc tcttattttc atatttctca ttccttacct taattctgaa tctatttctt
    82201 ggaagaaaat aagtttcttg aaatttttca tctcgaattg tattcccacg aaaggaatgg
    82261 tgaagttgaa aaacgaatcc ttcaaatctt tgttgtggag tcgataaatt atacgccctt
    82321 tggttgaatc ataaggactt acttcaattt tgactctatc tcctggcagt atccgtataa
    82381 aactatgccg gatctttcct gaaacataat ttataatcag atctaaacaa acccggaaca
    82441 gaccgttggg aaggcgattc agtaattaaa gcttcatgac tcctttttgg ttcttaaagt
    82501 ccctttgagg tatcaactaa taagaaagat attagacaac cccccttttt tctttttcac
    82561 aaataggaag tttcgaatcc aatttggata ttaaaaggat taccagatat aacacaaaat
    82621 ctctccacct attccttcta gtcgagcctc tcggtctgtc attatacctc gagaagtaga
    82681 aagaattaca atccccattc cacctaaaat tcgcggaatt cgttgataat tagaatagat
    82741 tcgtagacca ggtcgactga ttcgttttaa atttaaaata tttctatagg gtcttttcct
    82801 attccttcta tgtcgcaggg ttaaaaccaa aaaatatttg tttttttctc gatgttttct
    82861 cacgttttcg ataaaacctt ctcgtaaaag tatttgaaca atattttcgg taatattagt
    82921 agatgctatt cgaaccaccc tttttcgatc catatcagca tttcgtatag aagttattat
    82981 ctcagcaata gtgtccctac ccatgatgaa ctaaaattat tggggcctcc aaatttgata
    83041 taatcaacgt gttttttact tatttttttt ttgaatatga tatgaattat taaagatata
    83101 tgcgtgagac acaatctact aattaatcta tttctttcaa ataccccact agaaacagat
    83161 cacaatttca ttttataata cctcgggagc taatgaaact attttagtaa aatttaattc
    83221 tctcaattcc cgggcgattg caccaaaaat tcgagttcct tttgatttcc ttccttcttg
    83281 atcaataaca actgcagcat tgtcatcata tcgtattatc atcccgttgt cacgtttgag
    83341 ttctttacag gtccgcacaa ttacagctct gactacttct gatctttcta ggggcatatt
    83401 tggtacggct tctttgatca cagcaacaat aacgtcacca atatgagcat atcgacgatt
    83461 gctagctcct atgattcgaa tacacatcaa ttctcgagcc ccgctgttat ccgctacatt
    83521 taaatgggtc tgaggttgaa tcattttttt aatccgttct ttgaatgcaa agggcgaaga
    83581 aaaaaaagaa atatttttgt ccaaaaaaaa agaaacatgc ggtttcgttt catatctaag
    83641 agccctttcc gcattttttt ctattacatt acgaaataat gaattgagtt cgtataggca
    83701 ttttagatgc tgctagtgaa atagcccttc tggctatatt ttctgttact ccacccattt
    83761 cataaagtat tcgacccggt ttaacaacag ctacccaata ttcaggggat ccttttcctg
    83821 aacccatacg tgtttctgcg ggtcttagtg taactggttt gtctggaaat atacgtaccc
    83881 atatttttcc accacgacgt gcatttcgtg tcattgctcg tcggcctgct tctatttgtc
    83941 tagatgtaat ccaagcaggt tcaagtgcct gaagagcata tttaccgaaa gaaatatgat
    84001 tacctcgatg agatattccc ttcattcttc ctctatgttg tttacggaat ctggttcttt
    84061 tggggttata gttgatggtt gtttctgaat tccatctcta ctacagaacc ggacgtgaga
    84121 gtttcttctc atccagctcc tcgcgaataa aaggattcaa aaaatttaat tagaattaag
    84181 ctagaatagt caatcttaag ttaagatata tatgtattta ctgagtaata ccttgaacgt
    84241 gggattcttt gagatttcat tcaatctatt agtaatttgt atatcttgtt tgaatagata
    84301 actaaacttt tgagttttat aaatagaaat ctaaaaaaaa attgtattat tataccaaat
    84361 ccttattttg tcctttattg tattgtccta aattttgcaa taaaaaaagt tttcgcgggc
    84421 gaatattgac tctttcaatc cctatttcat ttgtagggtt aactcgtgac ttctcagatc
    84481 tccgaataca tgaattaatc tctggttcgt tccgccatcc cgaccagtga atcattaaga
    84541 ttcctttttc aatagaatct tttgcattca caagttccgt cgttcccatc acttcttact
    84601 taatggttag gtccgaattc tacaatggag ctcagaatga aattggttct tgagtcaatc
    84661 ttctcagtct ttattggctc gaagctcttg attttttgtt ctatttctat aagaagattc
    84721 attttattat ggtatgaatg cgtattgatg ctttattaca ctgcctttta tgagattact
    84781 catagacctt acatattgga attttatatc attggtattc tttttctctc tttctctcat
    84841 ccttccattt atccacatct tttttgtcta ttttgcttta caacttagaa tcagatttcc
    84901 ttttttgttt atgcaaaaga tttcagttgc tacaaagata tgacctatat atcatatctt
    84961 gactggttct ttagatccag ataatgcgaa gtgatgggtt ggttattagt tctatagttt
    85021 ttagttcata ctatgtgggc tggtcttttt taatcctaac cctaaaaaac caacggagtc
    85081 acacactaag catagcaatt atatcaaatg gtcaatcgaa tttttattca accttataga
    85141 attaagaatt agaaatgttt cccttgattg attagaaaaa gaatgaattt gtcttttttt
    85201 gttcaatcat tggatagaag ggaaagacaa gtagtaaaat tattcctcgt ctagaaatat
    85261 ccaaattttg atgcccaata ctccatagat agttcgaact gtataagagc aataatcaat
    85321 tttcgctcga atcgtttgta ggggaaccct accttctctg atccattcga cacgtgcaat
    85381 ttcttttccg tcgatacgcc ccgcaatttg tatttgaatt ccttttgtat ctgcttgttc
    85441 tgttaattca atagcctttt tcattgcttt tcgaaaggaa actctattct ttaattgtcc
    85501 agctataaat tctgcaagaa tattagggtt tccataaggt tttgcaattc ttgtgacagc
    85561 aatgttcagt tttcggttta cacaatgaaa ttctttttgt aaggtcgttt gtaattcttc
    85621 gattccgcgc ggtcgacttt ctattaataa ttttgggaat cccataaaga ttatgacctg
    85681 gatcagatcg attctttttt gaatctctat acgtgcaatt ccctcgacgc cagaggacgt
    85741 tctcatattc ttttgtacat aattcttgat acaatctctt attttttgat cttcttgtaa
    85801 accttcagaa taattttttg gttgtgaaaa ccaaagggaa tgatgacctt gggttgtacc
    85861 cagtctgaaa ccaagtggat ttattttttg tcccataatc ccccactatt atacatatca
    85921 cgatacggca tagctgtaga tttttttttc catctcgttt tttttaacga atacatctct
    85981 acatattcat catctaaaga tatatctttc attacaatag ttatatgaca ggtcgatctt
    86041 tttattggaa aactacgtcc tcgagctcga ggtttcaatt tcttcacagt agtacctcca
    86101 ttgacttcgg ctttactaat gactaaattg gcttcgctgg aacccatatt gtaactagca
    86161 tttgctgctg cagaataaat caatttcaaa atgggataac atgctcgata gggcatgagt
    86221 tctagtatca taagcgtttc ctcataggaa cggccgcgaa tttgattaat tactcttcgt
    86281 gctttgtcag cagacataga tatatgttca cctaaagcat atacttctgt ttttttcttc
    86341 tttagcataa ggtttgcctc ctactactga atcataagca tctagatttt ttttattaat
    86401 attaacgacg agatctatta tcgctttttg catgtcctct aaaatttaat gtaggtgcaa
    86461 attctcccaa tttgtggcct accatactat ccgttatata aataggcaaa tgctcttttc
    86521 cattatggat agcaatcgta tgaccgatca ttgtgggtat aatggtagat gcccgggacc
    86581 aagttactat tatttctttt tctgcttttg tgttaagctt atcaattttt tttaataaat
    86641 gattggctac aaagggattt ttttttagtg aacgtgtcac aagcttactc ctattttttt
    86701 tttttttgta aaaacgaaga atttaattcg attttctctc ctatttacta cggcgacgaa
    86761 gaatcaaatt atcactatat ttattccttt ttctacttct tcttccaagt gcaggataac
    86821 cccaaggggt tgtgggtttt tttctaccaa ttggggctct cccttcacca cccccatggg
    86881 gatggtctac agggttcata actactcctc ttactacagg acgcttacct agccaacgct
    86941 tagatccggc tctacccaaa cttttctggt tcaccccaac attccccact tgtccgactg
    87001 ttgctgagca gtttttggat atcaaacgga cctccccaga aggtaatttt aatgtggccg
    87061 atttcccctc ttttgcaatc agtttcgcta cagcacccgc tgctctagct aattgtccac
    87121 cctttccaag tgtgatttct atgttatgta tggccgtgcc taagggcata tcggttgaag
    87181 tagattcttc ttttgatcaa tcaaaacccc ttcccaaact gtacaagctt cttccaaagc
    87241 atacttcttt ctggatgtag atgatgatat ctatacagat ggatcttata tatatcgtag
    87301 aatgaagtac cacatgggtg gatatatata tgaatccaaa tctgccgaat cactcatgtt
    87361 atgatcttct acatcctggg tcttcccgtt ccgtcatctg gcttatgttc ttcatgtagc
    87421 attcagaccg aatgactcta tgaaattacg tcgatacttc cacatattat gggtaacgta
    87481 ggagacatct ctatttttcc cccggggaat ctttagaatt cccactgctt aactttcaat
    87541 tcgcctctga ccatcaaatg aaatgtgaat aacccgtcct cctctctttg aaagaagggg
    87601 cgcttccggt tctgtcggtg cttgaaacaa ttttgtcttc tccatattac tatatctcta
    87661 gagtcaataa ttttatatga ggaactactg aactcaatca cttgctgccg ttactcttca
    87721 gttttctgtt gaggtctatc ctgcagaggt actcaaattg gatcagtgat cgatttctag
    87781 gtttcgtcgt aaacctaatt ggttatttcc aattacgtaa atcaatagtt caaaccgcac
    87841 tcaaaggtag ggcatttccc atttttatag gaacttctgt accagaaaca atggtatctc
    87901 caattatagc ccctctggga tgtaaaatat atctcttctc accatcccca tagtgtatga
    87961 gacaaatgta tgcatttcga ttagggtcgt attctatggt tacgattcta ccatatatgt
    88021 ctttttcatt ccgtcgaaaa tcgattttac ggtatagacg cttatgacct ccccctctat
    88081 gccttgcggt aatgattcct ctggcattac gacctttacc acaatgatgc tgtccataga
    88141 tcaaattatt tcgtggattg gatttcactt gactgtctac ggttccattg cgtgtgctcg
    88201 gggtagaagt tttgtataaa tgtatcgcca tgctattaag tatttttttt taagttcttt
    88261 tctttctaag aggtggaata gaataacccg gttgaagcgt aatgatcata cgtctgtaat
    88321 gcattgtatg tcccataata ggtcccattc ttctactctt tcccggaagt cgatgactat
    88381 tcatagctat taccttgaca ccaaagaaga gttcgaccca atgctttatt tctgtcctag
    88441 ttgatcctga ttcgacatta gaagtatatt gatttttccc caataaccga atacttttgt
    88501 ctgtaaatac tgcatatttg attccatcca taaatcgatt ttcttcccta tgagttatag
    88561 tctcaataag aatgctagtt cttactgttc atatattatg atatgaatat accacaccaa
    88621 ttcgttatgt atggatgatg agattccatt gatacagagc caattccaat agacttattg
    88681 gagggtccca ttggcgtgca tccagtagga attgaaccta cgaattcgcc aattatgagt
    88741 tgggcgcttt aaccattcag ccatggatgc ttagcgggga tcctcgtaca tggtgaataa
    88801 ccaaattcca attgaaatga aatctttagg ataaatcaat gcaatttagt taggataaat
    88861 caatgcaatt taggaggaat caatgagagg acatcaattc aaatcctgga ttttcgaatt
    88921 gagagagata ttgagagaga tcaagaattc tcaccatttc ttagattcat ggacccaatt
    88981 caattcagcg ggatccttca ttcacatttt tttccaccaa gaacgttttc taaaactctt
    89041 tgacccccga atttggagta tcctactttc acgcaattca cagggttcaa caagcaatcg
    89101 atatttcacg atcaagggtg taatactctt tgtagtagcg gtccttatat atcgtattaa
    89161 caatcgaaat atggtcgaaa gaaaaaatct ctatttgata gggcttcttc ctatacctat
    89221 gaattccatt ggacccagaa atgatacatt ggaagaatcc gttgggtctt ccaatatcaa
    89281 taggttgatt gtttcgctcc tgtatcttcc caaaggaaaa aagatctctg agagttgttt
    89341 cctgaatccg aaagagagta cttgggttct cccaataact aaaaagtgta gcatgcctga
    89401 atctaactgg ggttcgcgtt ggtggaggaa ctggatcgga aaaaagaggg attctagttg
    89461 taagatatct aatgaaaccg tcgctggaat tgagatctta ttcaaagaga aagatctcaa
    89521 atatctggag tttctttttg tatattatat ggatgatccg atccgcaagg accatgattg
    89581 ggaattgttt gatcgtcttt ctctgaggaa gagtcgaaat agaatcaact tgaattcggg
    89641 accgctattc gaaatcttag tgaaacactg gatttcttat ctcatgtctg cttttcgtga
    89701 aaaaatacca attgaagtgg agggtttctt caaacaacaa ggggctgggt caactattca
    89761 atcaaatgat attgagcatg tttcccatct cttctcgaga aacaagtggg ctatttcttt
    89821 gcaaaactgt gctcaatttc atatgtggca attccgccaa gatctcttcg ttagttgggg
    89881 gaagaatccg cccgaatcgg attttttgag gaacgtatcg agagagaatt ggatttggtt
    89941 agacaatgtg tggttggtaa acaaggatcg gttttttagc aaggtacaga atgtatcgtc
    90001 aaatattcaa tatgattcca caagatctag tttcgttcaa gtaacggatt ctagccaact
    90061 gaaaggatct tctgatcaat ccagagatca tttggattcc attagtaatg aggattcgga
    90121 atatcacaca ttgattaatc aaagagagat tcaacaacga aaagaaagat cgattctttg
    90181 ggatccttcc tttcttcaaa cggaacgaaa agagatagaa tcaggccgat tcccgaaatg
    90241 cctttctgga tattcctcaa tgtcccggct attcacggaa cgtgagaagc agatgattaa
    90301 tcatctgttt ccggaagaaa tcgaagaatt tcttgggaat cctacaagat ccgttcgttc
    90361 ttttttctct gatagatggt cagaacttca tctgggttcg aatcctactg agaggtccac
    90421 tagggatcag aaattgttga agaaacaaca agatctttct tttgtccctt ccaagcgatc
    90481 ggaaaataaa gaaatggtta atatattcaa gataattacg tatttacaaa ataccgtctc
    90541 aattcatcct atttcatcag atccgggatg tgatatggtt ccgaagatga accggatatg
    90601 gacagttcca ataagatttc attcttgaac aaaaatccat tttttgattt atttcatcta
    90661 ttccatgacc ggaacagggg aggatacacg ttacactacg attttgaatc agaagagaga
    90721 tttcaagaaa tggcagatct attcactcta tcaataaccg agccggatct ggtgtatcat
    90781 aagggatttg ccttttctat tgattcctgc ggattggatc aaaaacaatt cttgaatgag
    90841 gccagggatg aatcgaaaaa gaaatcttta ttggttctac ctcctatttt ttatgaagag
    90901 aatgaatctt tttctcgaag gatcagaaaa aaatgggtcc ggatctcctg cgggaatgat
    90961 ttggaagatc caaaaccaaa aatagtggta tttgctagca acaacataat ggaggcagtc
    91021 actcaatata gattgatccg aaatctgatt caaatccaat atagtaccta tgggtacata
    91081 agaaatgtat tgaatcgatt ctttttaatg aatagatccg atcgcaactt cgaatatgga
    91141 attcaaaggg atcaaatagg aaaggatact ctgaatcata gaactataat gaaatatacg
    91201 atcaaccaat atttatcgaa tttgaaaaag agtcagaaga aatggttcga gcctcttatt
    91261 ttgatttctc gaaccgagag atccatgaat cgggatcctg atgcatatag atacaaatgg
    91321 tccaatggga gcaagaattt ccaggaacat ttggaacagt ccgtttcgga gcagaagagc
    91381 cgttttcaag tagtgttcga tcgattacgt attaatcaat attcgattga ttggtctgag
    91441 gttatcgaca aaaaagattt gtctaagcca cttcgtttct ttttgtccaa gtcacttctt
    91501 tttttgtcca agttgctttt ctttttgtct aactcacttc cttttttctg tgtgagtttc
    91561 ggaaatatcc ccattcatag gtccgagatc tacatctatg aattgaaagg tccgaatgat
    91621 caactctgca atcagttgtt agaatcaata ggtcttcaaa ttgttcattt gaaaaaatgg
    91681 aaacccttct tattggacga tcatgatact tcccaaaaat cgaaattctt gatcaatgga
    91741 ggaacaatat caccattttt gttcaataag ataccaaagt ggatgattga ctcattccat
    91801 actagaaata atcgcaggaa atcctttgat aacccggatt cctatttctc aatgatattc
    91861 cacgatcaag acaattggct gaatcccgtg aaaccatttc atagaagttc attgatatct
    91921 tctttttata aagcaaatcg acttcgattc ttgaataatc cacatcactt ctgcttctat
    91981 tggaacacaa gattcccctt ttctgtggaa aaggcccgta tcaataattc tgattttacg
    92041 tatggacaat tcctcaatat cttgttcatt cgcaacaaaa tattttcttt gtgcgtcggt
    92101 aaaaaaaaac atgctttttg ggggagagat actatttcac caatcgagtc acaggtatct
    92161 aacatattca tacctaacga ttttccacaa agtggtgacg aaacgtataa cttgtacaaa
    92221 tctttccatt ttccaagtcg atccgatcca ttcgttcgta gagctattta ctcgatcgca
    92281 gacatttctg gaacacctct aacagagggg caaatagtca attttgaaag aacttattgt
    92341 caacctcttt cagatatgaa tctatctgat tcagaaggga agaacttgca tcagtatctc
    92401 aatttcaatt caaacatggg tttgattcac actccatgtt ctgagaaaga tttatcatcc
    92461 gaaaagagga aaaaacggag tctttgtcta aagaaatgcg ttgagaaagg gcagatgtat
    92521 agaacctttc aacgagatag tgctttttca actctctcaa aatggaatct attccaaaca
    92581 tatatgccat ggttccttac ttcgacaggg tacaaatatc taaatttgat atttttagat
    92641 actttttcag acctattgcc aatactaagt agcagtcaaa aatttgtacc catttttcat
    92701 gatattatgc atggatcagg tatatcatgg cgaattcttc agaaaaaatt gtgtcttcca
    92761 caatggaatc tgataagtga gatctcgagt aagtgtttac ataatcttct tctgtccgaa
    92821 gaaatgattc atcgaaataa tgagtcacca ttgatatcga cacatctgag atcgccaaat
    92881 gctcgggagt tcctctattc aatccttttc cttcttcttg ttgctggata tctcgttcgt
    92941 acacatcttc tctttgtttc ccgggcctct agtgagttac agacagagtt cgaaaaggtc
    93001 aaatctttga tgattccatc atctatgatt gagttgcgaa aacttctgga taggtatcct
    93061 acatctgaac cgaattcttt ctggttaaag aatctctttc tagttgctct ggaacaatta
    93121 ggagattctc tagaagaaat acggggttct gcttctggcg gcaacatgct tggtcccgct
    93181 tatggggtca aatcaatacg ttctaagaag aaagattgga atatcaatct catcgagatc
    93241 atcgatctca taccaaatcc catcaatcga atcacttttt cgagaaatac gagacatcta
    93301 agtcatacaa gtaaagagat ctattcattg ataagaaaaa gaaaaaacgt gaacggggat
    93361 tggattgatg ataaaataga atcctgggtc gcgaacagtg attcgattga tgatgaagaa
    93421 agagaattct tggttcagtt ctccacctta acgacagaaa ataggattga tcaaattcta
    93481 ttgagtctga ctcatagtga tcgtttatca aagaatgact ctggttatca aatgattgaa
    93541 caaccgggag caatttactt acgatactta gttgacattc ataaaaagca tctaatgaat
    93601 tatgagttca atccatcctg tttagcagaa agacggatat tccttgctca ttatcagaca
    93661 atcacttatt cacaaacttc gtgtggggaa aatagttttc atttcccatc tcatggaaaa
    93721 cccttttcgc tccgcttagc cttatccccc tctaggggta ttttagtgat aggttctata
    93781 ggaactggac gatcctattt ggtcaaatac ctagcgacaa actcctatgt tcctttcatt
    93841 acggtatttc tgaacaagtt cctggataac aagcctaaag gttttcttct tgatgagatc
    93901 gatattgatg atagtgacga tattgatgat agtgacaatc ttgatgctag tgacgatatc
    93961 gatcgtgacc ttgatacgga gctgaaactg ctaactagga tgaatgggct aactatggat
    94021 atgatgccgg aaatagaccg attttatatc acccttcaat tcgaattagc aaaagcaatg
    94081 tctccttgca taatatggat tccaaacatt catgatctgg atgtgaatga gtcgaatgac
    94141 ttagccctcg gtctattagt gaaccatctc tccagggatt gtgaaagatg ttctactaga
    94201 aatattcttg ttattgcttc gactcatatt ccccaaaaag tggatcccgc tctaatagct
    94261 ccgaataaat taaatacgtg cattaagata cgaaggcttc ttcttccaca acaacgaaag
    94321 cactttttca ctctttcata tactagggga tttcacttgg aaaagaaaat gttccatact
    94381 aacggattcg ggtccataac catgggttcc aatgcacgag atcttgtagc acttaccaat
    94441 gaggtcctat cgattagtat tacacagaag aaatcaatta tagacactaa tacaattaga
    94501 tccgctcttc atagacaaac ttgggatttg cgatcccagg taagatcggt tcaggatcat
    94561 gggatccttt tctatcagat aggaagggct gtagcacaaa atgtacttct aagtaattgc
    94621 cccatagatc ctatatctat ctatatgaag aagaaatcat gtaacgaagg ggattcttat
    94681 ttgtacaaat ggtacttcga gcttggaacg agcatgaaga gattaacgat acttctttat
    94741 cttttgagtt gttctgccgg atcggtcgct caagatcttt ggtctttatc cggacccgat
    94801 gaaaaaaatg ggatcacttc ttatggactc gttgagaatg attctgatct agttcatggc
    94861 ctattagaag tagaaggcgc tctggtggga tcttcacgga cagaaaaaga ttgcagtcag
    94921 tttgataatg atcgagtgac attgcttctt cggcccgaac cgaggaatcc cttagatatg
    94981 atgcaaaacg gctcttgttc tatccttgat cagagatttc tctatgaaaa atatgaatcg
    95041 gagtttgaag aaggggaggg agaaggagcc cttgacccgc aggaggattt attcaatcac
    95101 atagtttggg ctcctagaat atggcgccct tggggctttc tatttgattg tatcgaaagg
    95161 cccaatgaat tgggatttcc ctattggtcc aggtcatttc ggggcaagcg gatcatttat
    95221 gatgaagagg atgagcttca agagaatgat tcggagttct tgcagagtgg aaccatgcag
    95281 taccagacac gagatagatc ttccaaagaa caaggccttt ttcgaataag ccaattcatt
    95341 tgggaccctg cagatccact ctttttccta ttcaaagatc agccccctgg ctctgtgttt
    95401 tcacatcgag aattatttgc agatgaagag atgtcaaagg ggcttcttac ttcccaaaca
    95461 gaccctccta catctatata taaacgctgg tttatcaaga atacgcaaga aaagcacttc
    95521 gaattgttga ttaatcgtca gagatggctt agaaccaaca gttcattatc taatggatct
    95581 ttccgttcta atactctatc cgagagttat cagtatttat caaatctgtt cctatctaac
    95641 ggaacgctat tggatcaaat gacaaagaca ttgttgagaa aaagatggct tttcccggat
    95701 gaaatgaaaa ttggattcat gtaacaggag aaagatttcc cattccttag ccggaaagat
    95761 atgtggccat gaaagaggga ttaagtggaa cagaattgac tgggtggtag agtcgtggaa
    95821 acgcttgttt cttccatatt ttggacctta gctccatgga agaatatgtt actgctgaaa
    95881 cacggaagaa ttgaaatctt agatcaaaac actatgtatg gatggtatga actgcctaaa
    95941 caagaattct tgaacagcaa acaaccagtt cagatattca cgaccaagaa gtactggatt
    96001 ctctttcgga taggccctga aaggagaagg aaggctggaa tgccaacagg cgtctattat
    96061 attgaattta cccgatagtc cccattttgg gaacgtccag tgccaaagtc actgaatggg
    96121 taagtcgcca atccctggac tatgtaatgt actttatctg ctgggttacg ggcgggcatt
    96181 ttaccagagg tttctaatct acccttgtgt gattcctgtt gaagcatata ctcggggggt
    96241 gggtgcaggg cggacgattt taaagcggac tccccattca ttagatagag aagatcacca
    96301 agatttcgcg atccgctgcc gaatttattc caattccaag agctcggatc gaatcggtat
    96361 atcaataccg attcgatccg agctctctta ttgagaatgc tcattcaatg agcattctca
    96421 atattatgcc ttgaagagga ctcgaacctc cacgctattt agcacgagat tttgagtctc
    96481 gcgtgtctac catttcacca ccaaggcatc ttgaaagtga atcgtattcc atgaatatga
    96541 tatctatcta gtgtgatgta tggaatatat gacaaaggtg gatctattga tcggtcatgt
    96601 catataggcc cgagttggac atccaattgc ttcgatttga attatccgga gaatgcaatg
    96661 cctgatatat atcaaaaaga tggacaatca aacctatttc tcgattcact caaagaggtg
    96721 aatagggtcc caatagagat atgtaaaaag caggtccgat tacgcgtatt cctaatccta
    96781 aatggaatgt aatgatgtag gaatccatat gtaaacatag tatctattta gataggcccg
    96841 aatgacccct tctcataatg agaatgtata taaccctatt ccggcctggt ccggtatgga
    96901 atgaacttat aatcatggaa tcgactcgat catcagatta taagttcata accctagccc
    96961 attcccattt tgggcggaac agatctacta attctttgat tccagttagt aagagggatc
    97021 ttgaactaag aaatagaccc tagaagctaa aaaaggctat cctgagcaat tgcaataatt
    97081 gggttcattg atattcctgg tatagtagat gctatcacac atacaatcat actcaattcg
    97141 atggaattgt ttgatcttaa aggggatctt ctataatttc gcacgtgagg ggttatttct
    97201 tggtttcgtc cagtcattaa taactttatt atttttagat aatagtagat agaaacaacg
    97261 cttgtaagga gtcctattaa aaccaagaaa tataggcctg cctgccatcc acaccagaat
    97321 aaatagagtt ttccgaaaaa acctgctagt ggaggaagac ctcctaggga taagagacat
    97381 agggctaaag agagagccaa aaaaggatct tttgtgtata atcctgcata atctcgaatg
    97441 ttatcagttc cggtacgtag accaaataat acaatgcaag caaaagttcc tagattcatg
    97501 gagatataga acagcatata agttatcatg cttgcatatc catcatttga gtctccaaca
    97561 attattccaa taattacata tccgatttgg cctatggacg aatatgcaag catacgtttc
    97621 atgcttgttt gagtaatagc aatgagattt cccaatatca tgctaagaat agctaggatt
    97681 tccagaagaa gatgccattc gtttgatgag aaataaaaag gaatatcgaa aattcgagtg
    97741 gctgaagctg aagcagctac tttcgaagta acagaaagaa aagcaacgac tggagtggga
    97801 gagtcagagt cgaaaagagg attcctcact tctttctctc attcaaaacc gtgcatgaga
    97861 ctttcatctc acacggctcc taagtgataa aagaaagaag aacccatttt ctttcttttt
    97921 tgattacctt cctcgcgtat gtataagacc gaatccattc gatttctaaa aaggattact
    97981 aatccttaac ttttcgagga atccttcatc agtggttgtg aatgactgat tttttcaatc
    98041 ttttcgacct tggtttcgta ggagcaagtc agaaagattg agaaatagaa ccatctgatt
    98101 taattcgttc tcaatagcca cgagatgatc atcttagggt gatccttttg tcgacggatg
    98161 ctcttattac actcgtagtc tctgaaggat gagaaccaac tatgtagcat ctacatcgag
    98221 aattcaagta ttgtatacgt cattagtccg atcctttgta ggaactaccc gtaataacga
    98281 acttgcaaaa tggatctgtt tatcataaag agattcgtcg ttcctgaccc tgcttcacct
    98341 taattgttat ttgaacaagt aaaagttctg tcttggtccg agtggggata gcatttctct
    98401 tctgcatgtc catggagttt tgaaaaatcc aaacatctca gagatagata gagaggtagg
    98461 aatttctcga acgaaccgca ctccttcgta tacgtcagga gtccattgat gagaaggggc
    98521 tggggaaagc ttgaacccaa ttcctacggt aatgaatatg agcgcaattg aaattcctgg
    98581 ggagttatac atttgtgtat tgataagacc gtttactatt tcttgaagct caatctctcc
    98641 cccggatgaa ccatatagcc aagagaaacc atgaaccaga atagaagagc ttgccccacc
    98701 catgagtaaa tatttcatag tagcctcatt agaccgtaca tctttcttgg tatatccaga
    98761 taataggtag gagcataaac tgaaacattc tggggctaca aagatagtta ttaaatcgtt
    98821 agcaccgcat aaaaacattc cccctagagt agctgttaat acgaataaga gaaactctgt
    98881 tatagccatt tctgtacatt caatgtactc tacggataga ggaatacata gagttgaaca
    98941 tagtaaaata agaaattgaa agatttcgtt gaaattgttc gtttggaaat ttcccgaaaa
    99001 gctaatcata ggttcttctc tccatcggaa caatagggcc gttatgctca ttactaaact
    99061 tgttgaagag atgaaatata accaaggtat atctttttga tcagaggttg aatcgatcat
    99121 cagaagaaga attaggccaa aaattaggat acattctggg aaaatcaaac ttccatcgaa
    99181 gagaagcaaa tgaaaggctt tcataaaaat tctcgtagaa tcgagaatga agttttcatt
    99241 ctgtacatgc cagatcatga attagtaact gcttccaatt tccaaaaaaa atcccaattg
    99301 tgtcgaactt tccatttttg gaatagttac ggaatctcca tgaataggat caaaccttat
    99361 tccatggtat ttacatgagg ttcctcttta agaaagtccc cgagaggctt agttgatcca
    99421 tgatttatgt ttcatctttc cttttcgttt gtttcgagaa atctatcgat caattccgat
    99481 tctttctttt tctcttgatt cttttccgat cgagatgtat agatcctgtt catggattaa
    99541 cgaaaatgtg caaaagctct atttgcctct gccattctat gagtctcttc ctttttgcgt
    99601 atggcatcgc cactcccttt ggcagcatcc actaattcgg aacttaattt gaaagccata
    99661 tttcgacccg gacgttttcg ggatgccgct aataaccaac gaatggcaag tgcttttcct
    99721 tgtgtggatc ctatttcaat gggaacttga tgagtcgatc cacctacacg tcttgctttt
    99781 actgttatat cgggagttac tccacgtatt gcttgacgta aaacggatag tggatttgtt
    99841 tctgtctttt gttgaatctt tttcacggct cgatagataa tttgataagc caatgatttt
    99901 tttccgtgtt tcagaatacg gttaaccaac atgttaacta atcgattacg ataaattgga
    99961 tcggattttg ctgttttttt ttctgcagta cctcgacgtg acatgagcgt gaaaggggtt
   100021 caagaatcag ttttcttttt ataagggcta aaatcactta ttttggcttt tttaccccat
   100081 attgtagggt ggatctcgaa agatatgaaa gatctccctc caagccgtac atacgacttt
   100141 catcgaatac ggctttccgc agaattctat atgtatctat gagatcgagt atggaattct
   100201 gtttactcac tttaaattga gtatccgttt ccctcccttt cctgctagga ttggaaatcc
   100261 tgtattttac atatccatac gattgagtcc ttgggtttcc gaaatagtgt aaaaagaagt
   100321 gcttcgaatc attgctattt gactcggacc tgttctaaaa aagtcgaggt atttcgaatt
   100381 gtttgttgac acggacaaag tcagggaaaa cctctgaaat tatttcaata ttgaaccttg
   100441 gacatataag agttccgaat cgaatctctt tagaaagaag atcttttgtc tcatggtagc
   100501 ctgctccagt ccccttacga aactttcgtt attgggttag ccatacactt cacatgtttc
   100561 tagcgattca catggcatca tcaaatgata caagtcttgg ataagaatct acaacgcact
   100621 agaacgccct tgttgacgat cctttactcc gacagcatct agggttcctc gaacaatgtg
   100681 atatctcaca ccgggtaaat ccttaaccct tccccctctt actaagacta cagaatgttc
   100741 ttgtaaatta tggccaatac cgggtatata agcagtgatt tcaaatccag aggttaatcg
   100801 tactctggca actttacgta aggcagagtt tggttttttt ggggtgatag tggaaaagtt
   100861 gacagataag tcacccttac tgccactcta cagaaccgta catgagattt tcacctcata
   100921 cggctcctcg ttcaattctt tcgaattcat tggatccttt ccgcgttcga gaatcccccc
   100981 cttcttccac tccgccccga agagtaacta ggaccaattt agtcacgttt tcatgttcca
   101041 attgaacact gtccattttt gattattctc aaaggataag attattctct ttaccaaaca
   101101 tatgcggatc caatcacgat cttatatata agaagaacaa aagatctttc ttgatcaatc
   101161 cctttgcccc tcattcttca agaataagga agatcctttt caagtttgaa tttgttcatt
   101221 tggaatctgg gttcttctac ttcatattta tttaatatga atattttccc tctctttttt
   101281 ttatatcatt ccttaagtcc cataggtttg atcctgtaga atttgaccca ttttctcatt
   101341 gaacgaaagg tacgaaataa atcagattga taaaagtacc atgtgaaatc ttcggttttt
   101401 ccccttcctc gatccctatc ccataggtta ggtacagtgt ttgaatcaat agagaacctt
   101461 ttcttctgta tgaatcgata ttattccatt ccaaatcctt cccgatacct cccaaggaaa
   101521 atctcgaatt tggatcccaa attgacgggt tagtgtgagc ttatccatgc ggttatgcac
   101581 tctttgaata ggaatccgtt ttctgaaaga tcctggcttt cgtactttgg tgggtctccg
   101641 agatcctttc gatgacctat gttgaaggga tatctatcta atccgatcga ttgcgtaaag
   101701 cccgcggtag caacggaacc ggggaaagta tacagaaaag acagttcttt tctattatat
   101761 tagtattttc tattatatta gatatattag actattatat tagattagta ttagttagtg
   101821 atcccgactt agtgagtctg atgaattgtt ggcaccagtc ctacattttg tctctgtgga
   101881 ccgaggagaa aaggggctcg gcgggaagag gagtgtacca tgagagaagc aaggaggtca
   101941 acctctttca aatatacaac atggattctg gcaatgtagt tggactctca tgtcgatccg
   102001 aatgaatcat cctttccacg gaggtaaatc tttgcctgct aggcaagagg atagcaagtt
   102061 ccaaattctg tctcggtagg acatgtattt ctattactat gaaattcata aatgaagtag
   102121 ttaatggtag ggttaccatt atcctttttg tagtgacgaa tcttgtatgt gttcctaaga
   102181 aaaggaattt gtccattttt cggggtctca aaggggcgtg gaaacgcata agaactcttg
   102241 aatggaaaag agatgtaact ccagttcctt cggaatcggt agtcaatcct atttccgata
   102301 ggggcagttg acaattgaat ccgattttga ccattatttt catatccgta atagtgcgaa
   102361 aagaaggccc ggctccaagt tgttcaagaa tagtggcgtt gagtttctcg accctttgac
   102421 ttaggattag tcagttctat ttctcgatgg ggcggggaag ggatataact cagcggtaga
   102481 gtgtcacctt gacgtggtgg aagtcatcag ttcgagcctg attatcccta agcccaatgt
   102541 gagtttttct agttggattt gctcccccgc cgtcgttcaa tgagaatgga taagaggctc
   102601 gtgggattga cgtgaggggg cagggatggc tatatttctg ggagcgaact ccgggcgaat
   102661 atgaagcgca tggatacaag ttatgccttg gaatgaaaga caattccgaa tccgctttgt
   102721 ctacgaacaa ggaagctata agtaatgcaa ctatgaatct catggagagt tcgatcctgg
   102781 ctcaggatga acgctggcgg catgcttaac acatgcaagt cggacgggaa gtggtgtttc
   102841 cagtggcgga cgggtgagta acgcgtaaga acctgccctt gggaggggaa caacagctgg
   102901 aaacggctgc taataccccg taggctgagg agcaaaagga ggaatccgcc cgaggagggg
   102961 ctcgcgtctg attagctagt tggtgaggca atagcttacc aaggcgatga tcagtagctg
   103021 gtccgagagg atgatcagcc acactgggac tgagacacgg cccagactcc tacgggaggc
   103081 agcagtgggg aattttccgc aatgggcgaa agctgacgga gcaatgccgc gtggaggtag
   103141 aaggcccacg ggtcgtgaac ttcttttccc ggagaagaag caatgacggt atctggggaa
   103201 taagcatcgg ctaactctgt gccagcagcc gcggtaatac agaggatgca agcgttatcc
   103261 ggaatgattg ggcgtaaagc gtctgtaggt ggctttttaa gtccgccgtc aaatcccagg
   103321 gctcaaccct ggacaggcgg tggaaactac caagctggag tacggtaggg gcagagggaa
   103381 tttccggtgg agcggtgaaa tgcgtagaga tcggaaagaa caccaacggc gaaagcactc
   103441 tgctgggccg acactgacac tgagagacga aagctagggg agcgaatggg attagatacc
   103501 ccagtagtcc tagccgtaaa cgatggatac taggcgctgt gcgtatcgac ccgtgcagtg
   103561 ctgtagctaa cgcgttaagt atcccgcctg gggagtacgt tcgcaagaat gaaactcaaa
   103621 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaaagcgaa
   103681 gaaccttacc agggcttgac atgccgcgaa tcctcttgaa agagaggggt gccttcggga
   103741 acgcggacac aggtggtgca tggctgtcgt cagctcgtgc cgtaaggtgt tgggttaagt
   103801 cccgcaacga gcgcaaccct cgtgtttagt tgccatcgtt gagtttggaa ccctgaacag
   103861 actgccggtg ataagccgga ggaaggtgag gatgacgtca agtcatcatg ccccttatgc
   103921 cctgggcgac acacgtgcta caatggccgg gacaaagggt cgcgatcccg cgaggtgagc
   103981 taaccccaaa aacccgtcct cagttcggat tgcaggctgc aactcgcctg catgaagccg
   104041 gaatcgctag taatcgccgg tcagccatac ggcggtgaat tcgttcccgg gccttgtaca
   104101 caccgcccgt cacactatgg gagctggcca tgcccgaagt cgttacctta accgcaagga
   104161 gggggatgcc gaaggcaggg ctagtgactg gagtgaagtc gtaacaaggt agccgtactg
   104221 gaaggtgcgg ctggatcacc tccttttcag ggagagctaa tgcttgttgg gtattttggt
   104281 ttgacactgc ttcacacccc caaaaaaaag aagggagcta cgtctgagtt aaacttggag
   104341 atggaagtct tctttccttt ctcgacggtg aagtaagacc aagctcatga gcttattatc
   104401 ctaggtcgga acaagttgat aggaccccct tttttacgtc cccatgttcc ccccgtgtgg
   104461 cgacatgggg gcgaaaaaag gaaagagagg gatggggttt ctctcgcttt tggcatagcg
   104521 ggcccccagt gggaggctcg cacgacgggc tattagctca gtggtagagc gcgcccctga
   104581 taattgcgtc gttgtgcctg ggctgtgagg gctctcagcc acatggatag ttcaatgtgc
   104641 tcatcggcgc ctgaccctga gatgtggatc atccaaggca cattagcatg gcgtactcct
   104701 cctgttcgaa ccggggtttg aaaccaaact cctcctcagg aggatagatg gggcgattcg
   104761 ggtgagatcc aatgtagatc caactttcga ttcactcgtg ggatccgggc ggtccggggg
   104821 ggaccaccac ggctcctctc ttctcgagaa tccatacatc ccttatcagt gtatggacag
   104881 ctatctctcg agcacaggtt tagcaatggg aaaataaaat ggagcaccta acaacgcatc
   104941 ttcacagacc aagaactacg agatcgcccc tttcattctg gggtgacgga gggatcgtac
   105001 cattcgagcc gtttttttct tgactcgaaa tgggagcagg tttgaaaaag gatcttagag
   105061 tgtctagggt tgggccagga gggtctctta acgccttctt ttttcttctc atcggagtta
   105121 tttcacaaag acttgccagg gtaaggaaga aggggggaac aagcacactt ggagagcgca
   105181 gtacaacgga gagttgtatg ctgcgttcgg gaaggatgaa tcgctcccga aaaggaatct
   105241 attgattctc tcccaattgg ttggaccgta ggtgcgatga tttacttcac gggcgaggtc
   105301 tctggttcaa gtccaggatg gcccagctgc gccagggaaa agaatagaag aagcatctga
   105361 ctacttcatg catgctccac ttggctcggg gggatatagc tcagttggta gagctccgct
   105421 cttgcaattg ggtcgttgcg attacgggtt ggatgtctaa ttgtccaggc ggtaatgata
   105481 gtatcttgta cctgaaccgg tggctcactt tttctaagta atggggaaga ggaccgaaac
   105541 gtgccactga aagactctac tgagacaaag atgggctgtc aagaacgtag aggaggtagg
   105601 atgggcagtt ggtcagatct agtatggatc gtacatggac ggtagttgga gtcggcggct
   105661 ctcccagggt tccctcatct gagatctctg gggaagagga tcaagttggc ccttgcgaac
   105721 agcttgatgc actatctccc ttcaaccctt tgagcgaaat gcggcaaaag aaaaggaagg
   105781 aaaatccatg gaccgacccc atcatctcca ccccgtagga actacgagat caccccaagg
   105841 acgccttcgg catccagggg tcacggaccg accatagaac cctgttcaat aagtggaacg
   105901 cattagctgt ccgctctcag gttgggcagt cagggtcgga gaagggcaat gactcattct
   105961 tagttagaat gggattccaa ctcagcacct tttgagtgag attttgagaa gagttgctct
   106021 ttggagagca cagtacgatg aaagttgtaa gctgtgttcg ggggggagtt attgtctatc
   106081 gttggcctct atggtagaat cagtcggggg acctgagagg cggtggttta ccctgcggcg
   106141 gatgtcagcg gttcgagtcc gcttatctcc aactcgtgaa cttagccgat acaaagcttt
   106201 atgatagcac ccaatttttc cgattcggcg gttcgatcta tgatttatca ttcatggacg
   106261 ttgataagat ccatccattt agcagcacct taggatggca tagccttaaa agtgaagggc
   106321 gaggttcaaa cgaggaaagg cttacggtgg atacctaggc acccagagac gaggaagggc
   106381 gtagtaatcg acgaaatgct tcggggagtt gaaaataagc atagatccgg agattcccga
   106441 atagggcaac ctttcgaact gctgctgaat ccatgggcag gcaagagaca acctggcgaa
   106501 ctgaaacatc ttagtagcca gaggaaaaga aagcaaaagc gattcccgta gtagcggcga
   106561 gcgaaatggg agcagcctaa accgtgaaaa cggggttgtg ggagagcaat acaagcgtcg
   106621 tgctgctagg cgaagcagcc cgaatgctgc accctagatg gcgaaagtcc agtagccgaa
   106681 agcatcacta gcttatgctc tgacccgagt agcatggggc acgtggaatc ccgtgtgaat
   106741 cagcaaggac caccttgcaa ggctaaatac tcctgggtga ccgatagcga agtagtaccg
   106801 tgagggaagg gtgaaaagaa cccccatcgg ggagtgaaat agaacatgaa accgtaagct
   106861 cccaagcagt gggaggagcc agggctctga ccgcgtgcct gttgaagaat gagccggcga
   106921 ctcataggca gtggcttggt taagggaacc caccggagcc gtagcgaaag cgagtcttca
   106981 tagggcaatt gtcactgctt atggacccga acctgggtga tctatccatg accaggatga
   107041 agcttgggtg aaactaagtg gaggtccgaa ccgactgatg ttgaagaatc agcggatgag
   107101 ttgtggttag gggtgaaatg ccactcgaac ccagagctag ctggttctcc ccgaaatgcg
   107161 ttgaggcgca gcagttgact ggacatctag gggtaaagca ctgtttcggt gcgggccgcg
   107221 agagcggtac caaatcgagg caaactctga atactagata tgacctcaaa ataacagggg
   107281 tcaaggtcgg ctagtgagac gatgggggat aagcttcatc gtcgagaggg aaacagcccg
   107341 gatcaccagc taaggcccct aaatgatcgc tcagtgataa aggaggtagg ggtgcagaga
   107401 cagccaggag gtttgcctag aagcagccac ccttgaaaga gtgcgtaata gctcactgat
   107461 cgagcgctct tgcgccgaag atgaacgggg ctaagcgatc tgccgaagct gtgggatgta
   107521 aaaatacatc ggtaggggag cgttccgcct tagagagaag cctccgcgcg agcggtggtg
   107581 gacgaagcgg aagcgagaat gtcggcttga gtaacgcaaa cattggtgag aatccaatgc
   107641 cccgaaaacc taagggttcc tccgcaaggt tcgtccacgg agggtgagtc agggcctaag
   107701 atcaggccga aaggcgtagt cgatggacaa caggtgaata ttcctgtact gccccttgtt
   107761 ggtcccgagg gacggaggag gctaggttag ccgaaagatg gttatcggtt caagaacgta
   107821 aggtgtccct gctttgtcag ggtaagaagg ggtagagaaa atgcctcgag ccaatgttcg
   107881 aataccaggc gctacggcgc tgaagtaacc catgccatac tcccaggaaa agctcgaacg
   107941 actttgagca agagggtacc tgtacccgaa accgacacag gtgggtaggt agagaatacc
   108001 taggggcgcg agacaactct ctctaaggaa ctcggcaaaa tagccccgta acttcgggag
   108061 aaggggtgcc tcctcacaaa gggggtcgca gtgaccaggc ccgggcgact gtttaccaaa
   108121 aacacaggtc tccgcaaagt cgtaagacca tgtatggggg ctgacgcctg cccagtgccg
   108181 gaaggtcaag gaagttggtg acctgatgac aggggagccg gcgaccgaag ccccggtgaa
   108241 cggcggccgt aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac
   108301 ccgcacgaaa ggcgtaacga tctgggcact gtctcggaga gaggctcggt gaaatagaca
   108361 tgtctgtgaa gatgcggact acctgcacct ggacagaaag accctatgaa gcttcactgt
   108421 tccctgggat tggctttggg cctttcctgc gcagcttagg tggaaggcga agaaggcctc
   108481 cttccggggg ggcccgagcc atcagtgaga taccactctg gaagggctag aattctaacc
   108541 ttgtgtcagg acctacgggc caagggacag tctcaggtag acagtttcta tggggcgtag
   108601 gcctcccaaa aggtaacgga ggcgtgcaaa ggtttcctcg ggccggacgg agattggccc
   108661 tcgagtgcaa aggcagaagg gagcttgact gcaagaccca cccgtcgagc agggacgaaa
   108721 gtcggcctta gtgatccgac ggtgccgagt ggaagggccg tcgctcaacg gataaaagtt
   108781 actctaggga taacaggctg atcttcccca agagctcaca tcgacgggaa ggtttggcac
   108841 ctcgatgtcg gctcttcgcc acctggggct gtagtatgtt ccaagggttg ggctgttcgc
   108901 ccattaaagc ggtacgtgag ctgggttcag aacgtcgtga gacagttcgg tccatatccg
   108961 gtgtgggcgt tagagcattg agaggacctt tccctagtac gagaggaccg ggaaggacgc
   109021 acctctggtg taccagttat cgtgcccacg gtaaacgctg ggtagccaag tgcggagcgg
   109081 ataactgctg aaagcatcta agtagtaagc ccaccccaag atgagtgctc tcctattccg
   109141 acttccccag agcctccggt agcacagccg agacagcgac gggttctctg cccctgcggg
   109201 gatggagcga cagaagtttt tttgagaatt caagagaagg tcacggcgag acgagccgtt
   109261 tatcattacg ataggtgtca agtggaagtg cagtgatgta tgcagctgag gcatcctaac
   109321 agaccggtag acttgaacct tgttcctaca tgacctgatc aattcgatca ggcactcgcc
   109381 atctattttc attgttcaaa tctttgacaa cacgaaaaaa ccattgttca actctttgac
   109441 aacatgaaaa aaccaaaagc tctgccctcc ctctctatct atccaaggga tggaagggca
   109501 gaggcctttg gtgtcccctc cagtcaagaa ttggggcctc acaatcacta gccaatatgc
   109561 ttttctctca tgcctttctt cgttcatggt tcgatattct ggtgtcctag gcgtagagga
   109621 accacaccaa tccatcccga acttggtggt taaactctac tgcggtgacg atactgtagg
   109681 ggaggtcctg cggaaaaata gctcgacgcc aggatgataa aaagcttaac acctctcatt
   109741 cttattactt tttcaatatg aaaacgaaaa aaaaaaaaat gaaaaatcaa aaggtcgttt
   109801 tattcaaaac cccaattgtg acatcccttc tctcccactt cacacctcgg aacgcaccct
   109861 tcttatagag ataaacgcgc cttcacatct tcttaacccg aaatggctgg ggagaggaaa
   109921 ggttcctttt tttgagggta ctcccgggaa cagatccagt ggagacgggg tggggcctgt
   109981 agctcagagg attagagcac gtggctacga accacggtgt cgggggttcg aatccctcct
   110041 cgcccacaac cggcccaaaa gggaagtacc tttccctctg ggggtaggaa aatcatgatc
   110101 gggatagcga accaaaagct atggaacttg ggtgtgggtc ttttgtcgaa atggaatggc
   110161 ttttcttttt ctctttttat ttatcgtgaa tgggggaatc attacacata gtatgcccgg
   110221 tcagcatatt tttttgtttt acgccccgta actcttcctc agccaggctt gggcagaata
   110281 gcagagcaag tattagtagc ataacaaaaa agccttcctc gtcattaata tctttgctcg
   110341 cggcaattgt gacctctcgg gagaatcgat gactgcatct ttgatgcagt gctagtatat
   110401 ctgagacttc ttaattggct agttgtaaat agccccaggg ctatggaaca aaggattatc
   110461 tcggacctag accgaggtat tgatggtgat tttctaatct cgcagaacag aatgtgatac
   110521 gatgagatag aatgcaatag aaacaaagac agggaacggg ttacctactc ttaacgggca
   110581 aagcgagccc ctttattctg aattctttaa ttcagaatca atcaaatctc cccaagtagg
   110641 attcgaacct acgaccaatc ggttaacagc cgaccgctct accactgagc tactgaggaa
   110701 caacaggaga ttcgatctca tagagttcaa ttcccgttcc caacccatga ccaatatgag
   110761 ctcgaagctt ccttcgtaac tcccggaact tcttcgtagt ggctccctta catgcctcat
   110821 ttcagaggga acctcaaagt ggctctattt cattatattc catccatatc ccaattccat
   110881 tcatttaata tccctttggt gtcattgaca taacagatgt cgtttctagt ctatctcttt
   110941 ctatttcttt tctatatatg gaaagttcaa aaatcatcat ataataatcc agaaattgca
   111001 atagaaaaga aataagggag gtttgtgatg atttttcaat cttttctact aggtaatcta
   111061 gtatccttat gcatgaagat aatcaattcg gtcgttgtgg tcggactcta ttatggattt
   111121 ctgaccacat tctccatagg gccctcttat ctcttccttc tccgagctct ggttatggaa
   111181 gaaggaaccg agaagaaggt atcagcaaca actggtttta ttacggggca gctcatgatg
   111241 ttcatatcga tctattatgc gcctctgcat ctagcattgg gtagacctca tacaataact
   111301 gtcctagctc taccatatct tttgtttcat ttcttctgga acaatcacaa acactttttt
   111361 gattatggat ctactaccag aaattcaatg cgtaatctca gcattcaatg tgtattcctg
   111421 aataatctca tttttcaatt attcaaccat ttcattttac caagttcaat gttagccaga
   111481 ttagtcaaca tttatctctt tcgatgcaac agcaagatct tatttgtaac aagtggtttt
   111541 gttggttggt taattggtca cattttattc atgaaatggc ttggattggt attagtctgg
   111601 atacggcaaa atcattctat tagatcgaat aagtacattc gatctaataa gtaccttgtg
   111661 ttagaattga gaaattctat ggctcggatc tttagtattc tcttatttat tacctgtgtc
   111721 tactatttag gcagaatacc ctcacccatt cttactaaga aactaaaaga agcctcaaaa
   111781 acagaagaaa gggtggaaag tgaggaagaa agagatgtag aaatagaaac agcttccgaa
   111841 atgaagggga ctaaacagga acaagaggga tccactgaag aagatcctta tccttctcct
   111901 tccctttttt cggaagaaag gtgggatccg gacaaaatcg atgaaacgga agaaatccga
   111961 gtgaatggaa aggacaaaat aaaggataaa ttccactctc accttacaga gacaggctat
   112021 aattgtaatt gtgaattaaa aaaaacagaa aataaggaat ttgattcaca aagttgaaaa
   112081 gagtaagtaa taaactaata aaaagattga aacataagct aaatacaaga aaagataaga
   112141 agagatgcgt ccgcccccta tatatttgat accttctcct acaatgaaac taataacccc
   112201 aaccccgtta tcagtcccat caattactcg tcgatcaaaa aaatgagtaa attcagctaa
   112261 tcctcttatc ccaccaacta agaatcttgt ataaaaagca tctatgtaag cacgattata
   112321 tgaccaatca tatatgccat ttataatttt gtcccacaga attctcttag gacccttttt
   112381 aacaaaagaa ttaattaact caaaattttt taaagaagaa taaatgggtt tatataaaaa
   112441 ggatgctata aatattccga aataagctaa ccatttcgat aatatatcca aattccctcc
   112501 ctcttggttg aaaggaattc ctatagatcc aacaaacaaa gtaaagagtc ctaatacaaa
   112561 tattgggaat agcatagtat tgtccgattc ataaggatag gaataaaccg ctttatgctc
   112621 aaaatgagca atagtcataa aaggtcgtgt catctttctt ccatttttat caattggata
   112681 tttagttttt gcaaaaaaat aagtactttc attattattc atagttaata aacaagagtt
   112741 tttcttaact ccgtttttac cccatagaga tattgaatag aagggggttt tttgtttccc
   112801 accataattt ggaaaatgag cgtttaaatg cccttcaaaa gtaagtaaat agatccgaaa
   112861 catataaaat gcggttaatc ccgccgtggc ccaagctatt attgcgaaaa ttggcgaata
   112921 caaccaacta tcattaagaa tttcatcttt ggaccaaaaa caagcaagag gtggaatacc
   112981 acaaagagaa agtgtaccta ataaaaatgt gattttgcta attggtacat gttttcttaa
   113041 acctcccata agacccatat tctgactttt agctggagaa tatccaacaa tagtttccat
   113101 tgaatgaata atggatccgg atcctaaaaa taataatgct ttggaataag catgagtaat
   113161 caaatgaaat aaagcgcttc gataagaccc cataccaaga gctaacatca tataacccaa
   113221 ttgagacatt gtggaatagg ctaaacctct cttaatgtct ttttgagcaa gagctaaagt
   113281 agctcctaat aatactgtta ttattcctat aaccgagatc aaatacatta tgtaaggtat
   113341 aactctgaaa agaggaagaa gccgagctac aagaaaaatt cccgccgcta ccatagtagc
   113401 agcatgtata agagccgaaa tgggagtagg cccctccatg gcatcaggta accatacatg
   113461 aagggggaat tgggcggatt tagcaactgc accggcaaat aagagaacag cacataaagt
   113521 aacaaataaa aaatcgactt cattattata aatcaagtta ttgaatattt cgaataaatc
   113581 cctaaattcg aaactccctg ttatccaata aaaacctaaa attcctaata ataaaccaaa
   113641 atcccctaca cgattagtta caaacgcttt ttgacaagca tttgccgcaa caggtcgtgt
   113701 aaaccaaaat cctattaata gataggaaca cagcccaacc aattcccaaa aaatataaat
   113761 ttgtatcaaa ttcgaactag taactaatcc caacatggaa gtactgaaaa aactcatata
   113821 agcaaaaaat ctcaaatagc cttgatcatg agccatataa ttatcactat aaataagaac
   113881 cataattcca accgtagtga ttaatattga cataatagaa gtaagtgggt cgatcaagta
   113941 tccgaagtct aaagaaaaat cattattgat gatccaagac catacatatt gataaaaaga
   114001 actgctattt atttgctgaa tagacaggta gattgaaaaa accatgacta tgcttaacaa
   114061 taaaacactc tgaaaagccc acatacggcg aaaacttttt gttgccgttg gaaaaagaaa
   114121 aagtcccgct cctattaaca tagggactgg aagtggaatg aaaggtatga tccacgcata
   114181 ttcatatgtc tgttccataa aaaagttttg aattcttaat taattgtttc cgattcaccg
   114241 gatcttacct cttttgaaag gagtcaataa aaagtcaaaa tatggactaa ctgaaactaa
   114301 tttaaaactt aaatcgaatt ttctattctt acttattctg agtctttgct aaatacttca
   114361 actattgaaa tcaagaagtt acaattggtc aaatgatatg aaagggatta attactagtc
   114421 tcttttgaaa taggcctatt tttctccaag tttgaccagt gaatcgaacg gggattcaag
   114481 tttttcattt catgaagtaa aaatgcggtt cttatcttta aacctttcga ggtattttat
   114541 tgcatgtaaa tgaaatgtgg aaccataaat agaaatcgag tattttttgg attctttatt
   114601 ttattttttc tttttattaa gttcaactaa tttcctttct acagaacagc cgattagcaa
   114661 attctatagg tatagatttt atgaatcaaa aataatgtga aataaagata ccagtcaata
   114721 gagaaccttt tttttacaat tatgaatgtt ttatggaata gaaaaacttg aaaaaaacac
   114781 atattgacct tcttttttta tttccagtat tatgcaattt tcacacatct tttgcctatc
   114841 tcgataatgt tttattttag gacgacacta ttagctcgaa aataaatagt agtaaaaaga
   114901 attcgttttg aacaatagat gtctttcaca tccagctata acaatgagta attttttaat
   114961 ttctaaatgg cagttccaaa aaaacgcact tcgacatcaa aaaagcgtat tcgtaaaaat
   115021 atttggaaaa ggaagggata ttcgatcgca ttaaaggctt tttcattagc gaaatctctt
   115081 tctaccggga attcaaaaag tttttttgta cgccaaacaa aaataaataa gtaataaaac
   115141 gttcgaataa tttgaatcaa cttgaaaaaa gaattcaatt attcttaaat tattcaatta
   115201 gataataatt gaataattta acgatttccc tttcatattt gatattgatt agctcaccaa
   115261 tcaatacgta atggaactcg cttcgctttt ctgattgata gataaaataa tagaattagg
   115321 aaatcctcta tttactgaat aataactttt ttgttgacaa aagagtaaac atcatttcta
   115381 ttccaaggtg gggagtttca ttttccccat cgacctattt gcagaattcc attaaaaaaa
   115441 aattctatat ttccattcta tttccatatc tatagaagaa cgtatataaa aatctttagt
   115501 gaaattagtg aaagttaaga actcattgaa actaattgat tctattttga aacctttttg
   115561 ttttgtctaa ctttctaact ctttattttc tctgaattat tatatagata cccatgtata
   115621 tcttgccctt aacccaatag agaaaattgc ttaatgaaat tctgtatgac tggttgtcaa
   115681 ttttgagcga tgcaaaatag gttcttttct ttctattttg tcttcaaaat ccattttttg
   115741 ttttagattt ctgaaataaa ataaatagga aatagctgat taaacaatga aaacaaaaaa
   115801 tttgggaact ctattcctta attgagtata gaacggttta gttacaagag ttcaattcga
   115861 ggaaagcata aaatatggga aagtcccagg ttaaataaaa aaaactaaga ctctaaactc
   115921 aaatctaaaa taatgaacct tcaacttcaa attcctattt gaacaacttt ttattgttat
   115981 tgatccattt gaatcattac taaactaaaa tagcttcctc aatctcgacg attgcttatt
   116041 cataggctat tatgagttca agacaggccg ctatggtgaa attggtagac acgctgctct
   116101 taggaagcag tgctaatgca tctcggttcg agtccgagtg gcggcatacc gtcttctaaa
   116161 aaggataaat agatcttata atgaattcaa ttcccgattt cctttttaga attatgtaat
   116221 taagggactc ttctttttta agatttttta tgatattttc aaccttagag catatattaa
   116281 ctcacatttc cttttcgatc gtttcaattg taattacaat tcatttgata acctttttag
   116341 tcgatgaaat cgtaaaacta tacgattcat cagaaaaggg cataatagtt acttttttct
   116401 gtataacagg attattagtt actcgttgga tttcttctgg acatttccca ctaagcgatt
   116461 tatatgaatc attaattttc ctttcatgga gtttctccct tattcatata attccgtatt
   116521 tcaaaaaaaa tgttttaatt ttaagtaaaa taactggccc tagtgctatt tttacccaag
   116581 gctttgctac gtcaggtatt ttaactgaaa tacaccaatc tgtaatatta gtacctgctc
   116641 ttcaatccga gtggttaata atgcacgtaa gtatgatgat attgggctat gcagctcttt
   116701 tatgtggatc attattatca gtagcacttc tagtgattac atttcgaaaa aacagacagc
   116761 ttttttataa gagcaatggt tttttaaacg agtcattttt cttgggtgaa aatgttttac
   116821 aaaatacttc ttttttttct gctaaaaatt attacaggtc ccaattgatt caacaattgg
   116881 attattggag ttatcgggtt attagtttag gatttacttt tttaaccata ggaatccttt
   116941 cgggagcggt atgggctaat gaagcgtggg ggtcgtattg gaattgggac ccaaaagaaa
   117001 cttgggcatt tattacttgg atcgtatttg caatttattt acatactcga acaaatagaa
   117061 atttgcgggg tgcaaattct gcaattgtag cgtctatagg ctttcttata atttggatat
   117121 gctattttgg ggtcaatctt ttaggaatag ggttacatag ttatggttct tttccatcaa
   117181 catttaattg aattcaagac aagttattac aaatacaaga gcgggcggcg cattgtatga
   117241 accagcgtgc ggaccgtgtg aatcatcaat acaatatttg attcacacgg ttttctacca
   117301 tatgtagttc aatttcattg tttttactta acttaagagt taagagaaga aaaaaagtct
   117361 tctttttttc attgtccaag aatgtttttc aaaacaaaca taggtttttt ttatttcagt
   117421 catccaaatt atctataaaa aaaattagat agaataactt cgaccttgtc aactgctaat
   117481 gaaagaacga aatccgggta tataccaata cctattacgg gtaaaaagat ggagatcgaa
   117541 agaaataact ctcgcggtcc agaatcaaaa aaagaatcct tcggggcatt aaatagcttg
   117601 tatccataga acatctggcg tgacatagat aatgaataaa taggagttaa tatcattcca
   117661 attgccatta caaaagtaat tagtattttt ggaattaaaa gatatttttg gccggtaatt
   117721 attccaaaaa atactatcaa ttcggcaaca aaaccactca tacctggtaa tgcaagggaa
   117781 gccatcgaaa agctactgaa catcgtgaac atttttggca ttggaatagc tattccgccc
   117841 atttcgtcaa gataaacaag gcggattcta tcataagtcg ttcccgccaa gaaaaaaagt
   117901 gcagcaccaa taaatccatg agatattatt tgtaaaaggg ctccattaag tcccgtgtcg
   117961 gttagagaac taattcctat aattatgaaa cccatatgag agacagagga ataggctatt
   118021 ctttttttta aattccgttg gccaagagat gttaaagctg catagattat ttgtattgta
   118081 cctattatca tcaaccaagg agaaaatata gaatgggcat gaggtaataa ttccatattg
   118141 attcgaatta atccatacgc tcccattttt aataaaattc cggctagaag catacaagta
   118201 ctgtaatgtg cttctccatg ggtatctggt aaccatgtgt gtagggggat aatgggcgat
   118261 ttgacagcaa aagcaataaa aaatccaata tagaatatta tttctaaaac cacaggatat
   118321 gactgattaa ctgatgtttc aaaatttaat gttggttcat tagaaccata taaagcaaga
   118381 cccaaaactc ccattaagag aaaaacagaa ccccccgccg tgtacaaaat aaattttgta
   118441 gctgagtaca gacgtttctt tcctccccac atgcatagaa gtagataaac aggaattaat
   118501 tctaactccc acatgatgaa aaaaagtaaa aggtcccgag acgaaaatga tccaatttga
   118561 ccactgtaca ttgctaacat gagaaaatgg aataatcgag aatctcgagt aactggccaa
   118621 gccgctaaag tagctaaagt agtgataaat cctgttaata aaatgggtcc tatagaaagt
   118681 ccatctattc ctaatctcca atggaaatca aaaaaattga tccatttata atcctccact
   118741 agttggatta atggatcatc cgattggaaa tgataacaaa atgcataagt cgttagaagg
   118801 agttctaaaa tacatataca tatcgtatac cacctaatta ccctatttcc tttatgggga
   118861 agaaagaaaa ttaaggaacc cgcaaatatt ggaaaaacta caattattgt taaccaagga
   118921 aaataattcg tagtaaagac aagatacact tggaccataa aaacccgtgc tcaaaatatt
   118981 gtgattttcg agcacaggtt tgtcggtaaa aaaaattaaa tggattcaag tagagttttc
   119041 tcgaacgtat caataagcta gacccatact gcgagttgtt tcatgccata aataaactcg
   119101 gacactcaag aaatctgttg gacaggcgga ttcacatctc ttacaaccaa cacagtcctc
   119161 tgttcgtgga gcagaagcaa tttgtttagc cttacaaccg tcccaaggta tcatttctaa
   119221 tacatcggtg ggcaggctcg gacacattga gtacatccta tacacgtatc ataaatcttt
   119281 actgaatgtg acattgggtc tatacgtttt tgaatgttag aaattttcga tctagtaaac
   119341 ttagaaacga atcatataat catatattta tataccagat gaatcaatga gttatcataa
   119401 ttttctaatc aacccccttc tggattggtt tatgagatat gagagagggc caaaatactt
   119461 tgatttctta tgttttgcaa acaagatcac accttacgta gcaaacatgc taattaaaat
   119521 cgatttatca atattagaat ctagatgatt aatactaatt attcaacaaa tttgattggt
   119581 tgatacgagt tgattttctg ttacggtaaa ttgatgaaac aatagccagt ccaatggctg
   119641 cttcagcggc tgcaatagct ataacaaaaa ttgagaaaat gtctcctttt aattgacgat
   119701 tatcaaaaaa atcagaaaat gttacaaaat ttatattaac cgcattcaat ataagttcaa
   119761 gacacataag ggctctaacc atatttcgac ttgtgatcaa tccatagatc ccgatagaaa
   119821 ataaataggc actcaaaaca agtacatgtt cgagaatcat taaacaactc cttatcaatc
   119881 tcgactcctt tcaatatgaa caacaattca accgatttaa ttgactagta tataacaagt
   119941 atggaacaaa gaaatatatt ggtactagat tgacctaaag tctttctatt tatacaacag
   120001 gaattcaaat agaattgaag gaaaatgaat gtgataagac agaacaaaat tttatttgaa
   120061 ttccaagttt taatagaaat tttttattga cgagctacag caattgcacc tattaaagca
   120121 actaaaagga ttattgaaat cagttcaaat ggaagaaaaa aatctgttga taaatgaatt
   120181 ccaatttgtt gactattact tataaaatct tgctctataa tctggtttga tcttgtagtc
   120241 caaataatcc cgtaccatga cgtatctgaa atagtagtaa ttagtgaaat aaaaagactt
   120301 atacaaacca tcgaagtaat tccatctcct acggtccaaa gatgaaaatc tttgtaatat
   120361 tctgaaccat tcatgaacat cacagcaaaa atgattaaaa catttatagc tcctacgtaa
   120421 ataagtactc gcagcagcta caaaatagga gttagataga atatagaata acgatgtaca
   120481 aacaagaacc aatcccaagg aaaaggcaga ataaattgga ttgggaagta ataccactcc
   120541 tagaccccct aatataagac ccgaccctag aaagactaaa agaaaatcat gtattggttc
   120601 agataaatcc attttttatc aaaaatcaaa aacgaagaat ttcatgactt tattgacctg
   120661 accaggaaaa aagaagtttt tcaatttttt atgatacttc ttaattgtta attgaatgaa
   120721 attgtaatgg gtatgaattg acgtagatgc ttttatttta ttggaccact atcaattctt
   120781 tattcgtcga acgagtagtt taaacctatc gattttggat atcatttatc tactttgaaa
   120841 ccattactat tattataact ataatataga aatccgtttt gttttcaatc taaattaagc
   120901 taggagtctc attaaccaac cactagtttg aattgaacaa gcaaaaatat cattctttta
   120961 gatccgaact aagccttcgt aattcggaat ttttttcgaa tttagggttt attcattttt
   121021 tatttgaggt aaattcgaaa ttgttcgaat tgtgtaatca tcaattactg acattggtaa
   121081 gcgacccaaa gcgatttgat tataattcaa ttcgtgacga tcataagtag aaagttcata
   121141 ttcttcggtc attgataaac aatttgttgg acaatactca acgcaattac cacaaaatat
   121201 acagattcca aaatcaatac tgtaattaag caatcgtttc tttcgaatat cagtttccaa
   121261 cttccaatca acaacgggta aatctatagg acatacacgc acacatactt cacaagcaat
   121321 gcatttatca aattcaaagt ggattcggcc tcggaaacgt tccgatgtga tcaatttttc
   121381 gtaggggtat tgaatagtta caggtaaacg atttgcgtgg gacagggtaa tcatgaaacc
   121441 ttggccgatg tatctggcgg ctcgtattgt ttgttgacca taatttatga attcagttat
   121501 catagggagc atatttagaa tatctataaa aaagatttta tgcttgtttc tttctcttgt
   121561 ttgagacaag tcgtgaatct agaatattgt agtcttttac agtgaaagaa gttgggacga
   121621 ggttgtcaat aatagattac ctagagaaat aggtaaaaga aatttccacc caagatttaa
   121681 tagttggtcc attctcagcc tcggtaaagt ccatcttgtt gcaataggaa tgaacaaaaa
   121741 caaataagtt ttggctaatg tgataaagat accaattagt gttccaaaga ctttacccct
   121801 tttatttatg ccaaatagct caggaacaaa tatgtacgga atagaaagat tccaacctcc
   121861 caaataaaga actgttacaa ataatgaaga aactagtaga ttcagatatg aagcaactgt
   121921 aaaatcaaac caaatttgat acctgaatat tcggtttgat accctgctac taattcttct
   121981 tctgcttctt ggtaaatcaa aaggtaatct ttcacactcg gctagagaag aaattagaaa
   122041 aacgataaac ccgatgggtt gacgccacaa attccacccc caaaagccat attttgactg
   122101 cgcttccact atatcaactg tacttaaact gttagataat catagtcgat gataacatca
   122161 ctgtgcccat cgctattaca gaaccgtacg tgagattttc atctcatacg gctcctcaga
   122221 ggtcacaaat aaatctaagg accctttcct attctttatc ttgatatgtt tgtcagatag
   122281 agtaaaaatc tatcctaagg tcccaaatta gaccaatgga attctgtctg ctatatttaa
   122341 aactaataaa tacgggcttc tgaattgatc tcatctttta agaattttca tttttctttg
   122401 ttgattaata accttatcat taaataaaat gcgctttata gcaatatcac atatacattt
   122461 caacctcgaa ttctcaatta cgaaaaaaat tagagagtcc attagttcat gaatcatgac
   122521 aaaaaatttc tctctcgaac tagaaatcaa aatggaatta taggaaagaa agaataaaaa
   122581 caaaaaaaga aaaaagtaag aaaaaaaaag acatcccccc tttttgcttt tgcaattaga
   122641 ttcttttctt tctatttcta ttttatttca ttcctattct cctttctcag aaaaagggcc
   122701 tttaaccaaa gtaaaagatt acttcgttct tgatagttat ttacttactc agtggatagg
   122761 aacatactct ggatcagaat catggggagt acttcttgat catttctacg aacgtaaagc
   122821 cccaattcga attcctttta tgtacagaaa tatcctcttg gataacttac ataatctcaa
   122881 ttactaatcc tttgtgtatc ttggtcttcc taaccatcca ctcatttttg ctttcaacct
   122941 cccgttgtgg aaatccatct atggtaatag acagtaaaaa ctccatacag ttgatctttt
   123001 gaacccgctt caagctatca tgacaattca ccaatcttgg ggtaaacaat ctctattgct
   123061 tatgtttact tttttcacca tttgattctt gtacatagga aatgagactc aaccttttta
   123121 ctgcaaattt agaagccgtt ttctttcact catataacta tctggtttag ttcatcaacc
   123181 caaatgctga ataaaaatga aaatatatat attcaatcaa atctttttac ctttgtttct
   123241 agaaagaaaa gaatttggag aaattttagg tctcaccgaa tcacacgtag agatattgat
   123301 aacacacata gagctaatgg tattttcata actaattgat tgagcagctg cccgtagacc
   123361 acctaaaaaa gaatatttat tatttgatcc atatcccgac ataagaagtc caacgggagc
   123421 aatacttgaa atggcaatcc agaaaaaaac accaatacta agatcggcta gaacaaggtg
   123481 atcaccaaaa ggaattactg aataacttag aaagatggat attactgcta tggatggtcc
   123541 gatactgaat aaacgagtat ctcctgtaga tggaataagg ttctctttca aaagtagttt
   123601 tgtcccatct gctagagctt gaagaattcc taaagggcca gcatattcag gtccgatacg
   123661 ttgttgtatt cctgcagata tttctctttc taaccaaaca attactagta cacctattgt
   123721 gattcctaat acaagagtca aaatagggaa aagcatccat atgatcccat agacttcttt
   123781 taaggattcc aatttggaaa aagaattgat agtttctatt tctgttgtat caattatcat
   123841 ttcaacgatc aacttctccc ataatgatat ctatgctacc tagtattgtc ataatatcag
   123901 ccaatttcat tcttttaact aactgaggaa gaatttgcaa attgataaaa cctggtgggc
   123961 gaattttcca tctccaagga aaaacgctct gatctcctat gagaaaaatt cccaattctc
   124021 cttttggggc ttcaactctc acataaagtt cttgtttcga caattcaaaa gttggagaag
   124081 gttttttact aataaaccga tattcaaaat cattccattc aggatctttt aatctgtcaa
   124141 aacgtcggat ttctaaattt tcgtaaggcc ctcctggaat tccttccaga gcctgttgaa
   124201 taatctttat ggattctgtc atttcaccga ttcgtactaa ataacgagct aatgaatccc
   124261 cttctcgttg ccattgaacc tgccaatcaa attcgtcgta agactcataa tgatcaactt
   124321 tacgaagatc ccattctatt ccggaagctc gtagcattgg tcccgataac ccccaattta
   124381 atgcttcgtc tcccccaata atgcctacgc cttcaactcg ttctaaaaaa ataggattcc
   124441 gggtaataag tttttgatac tcagcaaccc ctgttaaaaa ataatcgcaa aaatccaaac
   124501 atttatctat ccagccatag ggtagatcgg cagccactcc cccgatacga aaataattat
   124561 gcatcattcg cataccggtg gcagcttcga agaggtcata tatcaattct ctttctcgaa
   124621 aaatatagaa gaaaggggtc tgcgcaccaa tatccgccat aaaagggcct agccataaca
   124681 aatgagaagc tatccgactc aactccaaca taatgactct gatatagcta gcccttttag
   124741 gtacttgaat attgcctaat tgttcgggtc catttatggt tattgcttct gtgaacatag
   124801 tagctaaata atcccaacgt gttacataag gcaaatattg tataattgtt cggttttccg
   124861 caattttctc catccctcta tgtaaataac ccaatattgg ttcgcagtcg acaacatctt
   124921 caccatctag agtaacgatg agtcgaagaa caccgtgcat tgatgggtgc tgaggcccca
   124981 tattgactat catgaggtct tttcttgtag ttggtgcagt cataagtttt ttaccgattc
   125041 attcttccat gaattgctga aagtgaaaag aagttcatca aaatttaatc gaaacatata
   125101 agtgaaaatg aaatgactct tcaaataaat caaattaacg agtttttgtc tctcgaatgt
   125161 ccaactgatt aattaattct ttataacgta ctctattttt ttttgacaaa taagctagga
   125221 gtcgttgacg ttttcccaaa attttcttca aacctctctg agataaatag tcttttttgt
   125281 gcaattctaa atgtgaagta agtctccgta tcttattggt gaaattgaat acttgaaatt
   125341 caacagatcc tctcttttct tcttgagaaa taactgaaat gacagaattt tttaccataa
   125401 aagaatttcc cctttcttta ttttacagat atggatttta tcgaatttta tcgatcagta
   125461 ataataatgc cagtaatttg aacgtggtat atagacttaa tttctttatg aactcctaat
   125521 tttatcaatt ccaataaatt aatcaaattc aaaatttgat tcagatagga atccaaaaag
   125581 atggtaggta cttttttttt cattcacaaa agcgactaat ttaaacctaa aatcctaaaa
   125641 tgaagaagat tttgttgatt cctttctaga tctaatcgat actttattga tttagtatcg
   125701 tctactcgaa ttagattcga atgagatgta agaaaaagca tgtgtacatt tgtttacttt
   125761 cagatactct atacgaaaca ggatatatag tactatcaat ttattttcaa ttgtggatac
   125821 atatgtatcc ttaagatact gaaacgacta ccattattgg tatcaaacca ataacgattc
   125881 atacaagcta aatcttctaa tcgataatta ggccaaagaa agaacttcaa tttaattaat
   125941 tcatttttct ctttataaag aggtttcctt tcatccaaaa attgactcca gttttttaca
   126001 ttgttttcgt tgcaaaatac tgaatttcta tcgatgccat tccaattcaa agaattaaac
   126061 aaacttcgaa ttctcaattc tctacgacgt ctagaccata aaatattttc aggaacaagc
   126121 aaatcaaaat gatttttgtc tgtatttatt ctttgagttt gaggttgcag aatgaattca
   126181 tcaaaattct ttttatcaac atatctttgt tcggggtatc tttgattagt ttggtgttta
   126241 cttttatgaa ccaatgaaat acctatggtt tgatacataa taaattgtcc attatttttt
   126301 acagacaacc gaataggttc gataattaat atccccttct tcatcaattc tgtaagagtt
   126361 aaattcttct gaatcagcat tatatccaaa ctcatttctc tcctttgaat tgacgatata
   126421 gcaattttgc ttggatttat cagtcgaagc aggagacaat ataccttgat attctcgatc
   126481 attctttgat tcaaagcatc gttccatctc aattgaaaaa gcaaataacg tttcaagaac
   126541 aaatctagtt ctgcttccgt gttgcttttg tattgttttt tctttttacc cttctttgtg
   126601 tctgattccg cgtaatcttt tttaagagcg ttttgatgtt ttgagagaac agggcccaga
   126661 tttcctttgt tttctatatc tgatccacgc tctttttctc cttgacttgc gggttctttt
   126721 gcttcttgaa ttcgattctt tattttttta tttgatcgta gaaaaaagtt ttgtttttgg
   126781 tttttattga tgtttttatt tgactaacat tttcatttgt attcaaattt aaaagaagta
   126841 atttgcttgg tataatccac ggttttattt tatatacatt ataaagtggt acaaattctg
   126901 ggaagaacca aaattccaga ttcaatatgg gacgatttaa tattttttca ttcattccca
   126961 tccaatcaaa aaaggctttt ttcgaatttt tttgattgtt ttctggattt tgatgaatcg
   127021 taagataaaa aaagcctttt ttatcaattt tatcaattat ttgataatta ttaataccaa
   127081 ttttagtatt tggattactg ttggtatcga tcttaaccca ggcctcaata tcttcttttt
   127141 gtctaagaga aaaatggata attttccaat caaaatattt tctatcgaga tttctttcta
   127201 tatatagaat attgcctttt cttagataat tattgatatg aagattgccg agcatatcaa
   127261 aaaggttgtg tttggacgtg ttggaattag aagaaatttc gaggttctta tttacttgaa
   127321 agggtaatct agaaataaaa gagtcatttt ttttttcata attaatcgat ttatatgcta
   127381 aaagatcata tctataacat ttttgaaaat tatctttttg gtttgctaat gaatagagct
   127441 cagaatcatt ttcttttttg taatgaatta attggtcttt ttcatatgaa ttccatttgt
   127501 ttaaatttcg attttgagcc atacaacctt gattaaccct atttcgccat ttttgtggca
   127561 ttaatctaga ccatctaatc tgagataaat cgtattgata atgccgtctt aaccagtttt
   127621 tccattgatt gattctataa ctctgaagtt tcttatgttt taattcagaa tgaaatattc
   127681 ctagtgttcg aaaatagtcc tttattttag tcttaaggaa aaaagacgtt ctgttatatt
   127741 gaagaacaga tcttaattta gacaaattaa taacttgggg ttgtgataat ttgtaaaata
   127801 cgatatgctt gtgataagta ggataaatca aaaaaaatat gtgaattttt cttactaata
   127861 ttataaagtg acttttttat agtcgaaata aagtgaattt ttttttgatt attaattttt
   127921 tcttgattta tttcattatt ggaaatgtat ttatcaatca atttgtttgt tgattcaaga
   127981 aagagttgtg tattaattct gggaatatta atgatagata aaaatagatc gatgtataat
   128041 ctttgaatga ataattttag aaaataatgg aatttccata ttaatcgagt atttcttctt
   128101 tttaatattt ggaaaatctt ttttggcgat tcgaattttt taatattatt tgttttatta
   128161 ggactaatgt ctatttctgg agttactttc tttttctctt ttgtaattct ttctatttga
   128221 tttttgattg tacttgttct atcagtcaaa tccttcattt tgctttctat cagtgaagaa
   128281 tttggccaat ttccagattc aatttgacta aatgattcgt taattatctg attactcatt
   128341 agagaatctt tttctttttt cgtttcattc gattcatcta tttctttgag tctaaataat
   128401 acaattggat ttacttttga aagttctttt ttcatttttt ttataaatag actacttttg
   128461 ataagccatt ttttggtttc ttttgaaatt cttcgaaata attttatttt tcctttgaaa
   128521 acttttagag ttataaaata tttctttttg aattttccaa tttttttttc gagttcctta
   128581 aaaatgggct caaaaaaaga agggcgtttt cggggagaac caaagggaag ttcagcttcc
   128641 attccccaaa ctgttaaaaa acaaaaatca tctttttgtt ttttcttttt cattagctct
   128701 ccacgggagg agtacagttt agatatatgc caaggtttca gacaaaaagg aaataatatt
   128761 ttgatctgaa tgccatcttt caaccaattt tttggaaatt ctgtttctga taattgaaca
   128821 ccattataag tacatttaat atgcatttct ctattccatt cctgcaaatc ttcagaccat
   128881 tcaggaagtt gcaagactaa catacgcccg agatttttgg ctattatcaa tgaaggtaat
   128941 acaatatatt ttcgaagaat tgattgagtt attaacatgt aacctcttat tatttgcgca
   129001 aaaggaatgg tatcccaggc ttctgctatc tctatccgtg ctttttcctt tcttttgttc
   129061 tccccttttt tgtccttttc ctttttctct tctctttttg tttgttcttc tctagactct
   129121 agaatcttga attctccttc tttacctgac caatttcgaa aaattggttt aatcagtcca
   129181 gagatatcaa aagaaaaaag aaaggggggg gttattctgt caagaaaaag gggggaatgc
   129241 acatttgctt gaaagagttt ccaaataact gttttgcgcc tttgagcccg catagagcct
   129301 ttgattatac ctcgccgaaa atctggttgt tgcgaatagc gtattaaagc cacttccttt
   129361 gtttgatctt gatctgcggt atcagtatct ttggtatcag gatcgttatt ctggttgttg
   129421 gcagtaaaaa tcactacacg tttggctttt cttgaacgaa tttgatgatc cagtggtacg
   129481 ccctcttgat agtcacccga ttgttgttcc aattcggtga ttaatttatg tgaccagcga
   129541 ggtatttttt tactgatttc ttttattcca atcgattttt tttcagatgt tgtcccatta
   129601 ggagcaattg cattgaatac aaattttaca aatttagttc ttttttctga attcactctt
   129661 ccctgttctt ggtctgaaaa taaagaaagg tctttcaaat ttaaactcga ttttggttcg
   129721 ttaccaaatt cattgattaa agttaagaac tcgtcaattt ctgttgataa tggtttttta
   129781 gcaaccgtat ccactttttg ttccaattct tggtaatcag tattcggaag aaagatagta
   129841 tgaatcctat ttattctaac cctctctttc aaattttcta gcgaagtatt gtttatgatt
   129901 gaaggtgaaa actttttttt gattgttcct cgatatggtc catttaacaa aggatcatac
   129961 attttaggca cgtattcttt tttagtatca tcattacaca atctagtcct tgtttcaagt
   130021 atatcgagag aaaaagattc cttgtctaga acttcaagtc gatttaaaaa ttccttattc
   130081 agattattac ttttttcttt gttggtagaa atccactgat tgtccagttc attagggagt
   130141 gttttttgga gtgacaatag gggtatcctt ctttttatca ttttccaaaa agttgataaa
   130201 cttggcgggt atgtaaaaga tattctttgt tttccatcac ttttacatgt gttaaaaaaa
   130261 tattgtgaca tttccgttct tatggcctgt tcaaatcgat tattctttat gtagcgaaat
   130321 ggtcgattcc atcgattata atcgaaaaga agactcacaa gaggctgttg aaaccagaag
   130381 aggtctttat tttcattttt tttatcaagc agttgcaatt taaaaatttc tgtattcccc
   130441 gtgttattat tattcagata agaatcctca taatcataaa ttggactatt actagtatta
   130501 atattattat agcctgtctc tgtaaggtga gagtggaatt tatcctttat tttgtccttt
   130561 ccattcactc ggatttcttc cgtttcatcg attttgtccg gatcccacct ttcttccgaa
   130621 aaaagggaag gagaaggata aggatcttct tcagtggatc cctcttgttc ctgtttagtc
   130681 cccttcattt cggaagctgt ttctatttct acatctcttt cttcctcact ttccaccctt
   130741 tcttctgttt ttgaggcttc ttttagtttc ttagtaagaa tgggtgaggg tattctgcct
   130801 aaatagtaga cacaggtaat aaataagaga atactaaaga tccgagccat agaatttctc
   130861 aattctaaca caaggtactt attagatcga atgtacttat tcgatctaat agaatgattt
   130921 tgccgtatcc agactaatac caatccaagc catttcatga ataaaatgtg accaattaac
   130981 caaccaacaa aaccacttgt tacaaataag atcttgctgt tgcatcgaaa gagataaatg
   131041 ttgactaatc tggctaacat tgaacttggt aaaatgaaat ggttgaataa ttgaaaaatg
   131101 agattattca ggaatacaca ttgaatgctg agattacgca ttgaatttct ggtagtagat
   131161 ccataatcaa aaaagtgttt gtgattgttc cagaagaaat gaaacaaaag atatggtaga
   131221 gctaggacag ttattgtatg aggtctaccc aatgctagat gcagaggcgc ataatagatc
   131281 gatatgaaca tcatgagctg ccccgtaata aaaccagttg ttgctgatac cttcttctcg
   131341 gttccttctt ccataaccag agctcggaga aggaagagat aagagggccc tatggagaat
   131401 gtggtcagaa atccataata gagtccgacc acaacgaccg aattgattat cttcatgcat
   131461 aaggatacta gattacctag tagaaaagat tgaaaaatca tcacaaacct cccttatttc
   131521 ttttctattg caatttctgg attattatat gatgattttt gaactttcca tatatagaaa
   131581 agaaatagaa agagatagac tagaaacgac atctgttatg tcaatgacac caaagggata
   131641 ttaaatgaat ggaattggga tatggatgga atataatgaa atagagccac tttgaggttc
   131701 cctctgaaat gaggcatgta agggagccac tacgaagaag ttccgggagt tacgaaggaa
   131761 gcttcgagct catattggtc atgggttggg aacgggaatt gaactctatg agatcgaatc
   131821 tcctgttgtt cctcagtagc tcagtggtag agcggtcggc tgttaaccga ttggtcgtag
   131881 gttcgaatcc tacttgggga gatttgattg attctgaatt aaagaattca gaataaaggg
   131941 gctcgctttg cccgttaaga gtaggtaacc cgttccctgt ctttgtttct attgcattct
   132001 atctcatcgt atcacattct gttctgcgag attagaaaat caccatcaat acctcggtct
   132061 aggtccgaga taatcctttg ttccatagcc ctggggctat ttacaactag ccaattaaga
   132121 agtctcagat atactagcac tgcatcaaag atgcagtcat cgattctccc gagaggtcac
   132181 aattgccgcg agcaaagata ttaatgacga ggaaggcttt tttgttatgc tactaatact
   132241 tgctctgcta ttctgcccaa gcctggctga ggaagagtta cggggcgtaa aacaaaaaaa
   132301 tatgctgacc gggcatacta tgtgtaatga ttcccccatt cacgataaat aaaaagagaa
   132361 aaagaaaagc cattccattt cgacaaaaga cccacaccca agttccatag cttttggttc
   132421 gctatcccga tcatgatttt cctaccccca gagggaaagg tacttccctt ttgggccggt
   132481 tgtgggcgag gagggattcg aacccccgac accgtggttc gtagccacgt gctctaatcc
   132541 tctgagctac aggccccacc ccgtctccac tggatctgtt cccgggagta ccctcaaaaa
   132601 aaggaacctt tcctctcccc agccatttcg ggttaagaag atgtgaaggc gcgtttatct
   132661 ctataagaag ggtgcgttcc gaggtgtgaa gtgggagaga agggatgtca caattggggt
   132721 tttgaataaa acgacctttt gatttttcat tttttttttt ttcgttttca tattgaaaaa
   132781 gtaataagaa tgagaggtgt taagcttttt atcatcctgg cgtcgagcta tttttccgca
   132841 ggacctcccc tacagtatcg tcaccgcagt agagtttaac caccaagttc gggatggatt
   132901 ggtgtggttc ctctacgcct aggacaccag aatatcgaac catgaacgaa gaaaggcatg
   132961 agagaaaagc atattggcta gtgattgtga ggccccaatt cttgactgga ggggacacca
   133021 aaggcctctg cccttccatc ccttggatag atagagaggg agggcagagc ttttggtttt
   133081 ttcatgttgt caaagagttg aacaatggtt ttttcgtgtt gtcaaagatt tgaacaatga
   133141 aaatagatgg cgagtgcctg atcgaattga tcaggtcatg taggaacaag gttcaagtct
   133201 accggtctgt taggatgcct cagctgcata catcactgca cttccacttg acacctatcg
   133261 taatgataaa cggctcgtct cgccgtgacc ttctcttgaa ttctcaaaaa aacttctgtc
   133321 gctccatccc cgcaggggca gagaacccgt cgctgtctcg gctgtgctac cggaggctct
   133381 ggggaagtcg gaataggaga gcactcatct tggggtgggc ttactactta gatgctttca
   133441 gcagttatcc gctccgcact tggctaccca gcgtttaccg tgggcacgat aactggtaca
   133501 ccagaggtgc gtccttcccg gtcctctcgt actagggaaa ggtcctctca atgctctaac
   133561 gcccacaccg gatatggacc gaactgtctc acgacgttct gaacccagct cacgtaccgc
   133621 tttaatgggc gaacagccca acccttggaa catactacag ccccaggtgg cgaagagccg
   133681 acatcgaggt gccaaacctt cccgtcgatg tgagctcttg gggaagatca gcctgttatc
   133741 cctagagtaa cttttatccg ttgagcgacg gcccttccac tcggcaccgt cggatcacta
   133801 aggccgactt tcgtccctgc tcgacgggtg ggtcttgcag tcaagctccc ttctgccttt
   133861 gcactcgagg gccaatctcc gtccggcccg aggaaacctt tgcacgcctc cgttaccttt
   133921 tgggaggcct acgccccata gaaactgtct acctgagact gtcccttggc ccgtaggtcc
   133981 tgacacaagg ttagaattct agcccttcca gagtggtatc tcactgatgg ctcgggcccc
   134041 cccggaagga ggccttcttc gccttccacc taagctgcgc aggaaaggcc caaagccaat
   134101 cccagggaac agtgaagctt catagggtct ttctgtccag gtgcaggtag tccgcatctt
   134161 cacagacatg tctatttcac cgagcctctc tccgagacag tgcccagatc gttacgcctt
   134221 tcgtgcgggt cggaacttac ccgacaagga atttcgctac cttaggaccg ttatagttac
   134281 ggccgccgtt caccggggct tcggtcgccg gctcccctgt catcaggtca ccaacttcct
   134341 tgaccttccg gcactgggca ggcgtcagcc cccatacatg gtcttacgac tttgcggaga
   134401 cctgtgtttt tggtaaacag tcgcccgggc ctggtcactg cgaccccctt tgtgaggagg
   134461 caccccttct cccgaagtta cggggctatt ttgccgagtt ccttagagag agttgtctcg
   134521 cgcccctagg tattctctac ctacccacct gtgtcggttt cgggtacagg taccctcttg
   134581 ctcaaagtcg ttcgagcttt tcctgggagt atggcatggg ttacttcagc gccgtagcgc
   134641 ctggtattcg aacattggct cgaggcattt tctctacccc ttcttaccct gacaaagcag
   134701 ggacacctta cgttcttgaa ccgataacca tctttcggct aacctagcct cctccgtccc
   134761 tcgggaccaa caaggggcag tacaggaata ttcacctgtt gtccatcgac tacgcctttc
   134821 ggcctgatct taggccctga ctcaccctcc gtggacgaac cttgcggagg aacccttagg
   134881 ttttcggggc attggattct caccaatgtt tgcgttactc aagccgacat tctcgcttcc
   134941 gcttcgtcca ccaccgctcg cgcggaggct tctctctaag gcggaacgct cccctaccga
   135001 tgtattttta catcccacag cttcggcaga tcgcttagcc ccgttcatct tcggcgcaag
   135061 agcgctcgat cagtgagcta ttacgcactc tttcaagggt ggctgcttct aggcaaacct
   135121 cctggctgtc tctgcacccc tacctccttt atcactgagc gatcatttag gggccttagc
   135181 tggtgatccg ggctgtttcc ctctcgacga tgaagcttat cccccatcgt ctcactagcc
   135241 gaccttgacc cctgttattt tgaggtcata tctagtattc agagtttgcc tcgatttggt
   135301 accgctctcg cggcccgcac cgaaacagtg ctttacccct agatgtccag tcaactgctg
   135361 cgcctcaacg catttcgggg agaaccagct agctctgggt tcgagtggca tttcacccct
   135421 aaccacaact catccgctga ttcttcaaca tcagtcggtt cggacctcca cttagtttca
   135481 cccaagcttc atcctggtca tggatagatc acccaggttc gggtccataa gcagtgacaa
   135541 ttgccctatg aagactcgct ttcgctacgg ctccggtggg ttcccttaac caagccactg
   135601 cctatgagtc gccggctcat tcttcaacag gcacgcggtc agagccctgg ctcctcccac
   135661 tgcttgggag cttacggttt catgttctat ttcactcccc gatgggggtt cttttcaccc
   135721 ttccctcacg gtactacttc gctatcggtc acccaggagt atttagcctt gcaaggtggt
   135781 ccttgctgat tcacacggga ttccacgtgc cccatgctac tcgggtcaga gcataagcta
   135841 gtgatgcttt cggctactgg actttcgcca tctagggtgc agcattcggg ctgcttcgcc
   135901 tagcagcacg acgcttgtat tgctctccca caaccccgtt ttcacggttt aggctgctcc
   135961 catttcgctc gccgctacta cgggaatcgc ttttgctttc ttttcctctg gctactaaga
   136021 tgtttcagtt cgccaggttg tctcttgcct gcccatggat tcagcagcag ttcgaaaggt
   136081 tgccctattc gggaatctcc ggatctatgc ttattttcaa ctccccgaag catttcgtcg
   136141 attactacgc ccttcctcgt ctctgggtgc ctaggtatcc accgtaagcc tttcctcgtt
   136201 tgaacctcgc ccttcacttt taaggctatg ccatcctaag gtgctgctaa atggatggat
   136261 cttatcaacg tccatgaatg ataaatcata gatcgaaccg ccgaatcgga aaaattgggt
   136321 gctatcataa agctttgtat cggctaagtt cacgagttgg agataagcgg actcgaaccg
   136381 ctgacatccg ccgcagggta aaccaccgcc tctcaggtcc cccgactgat tctaccatag
   136441 aggccaacga tagacaataa ctcccccccg aacacagctt acaactttca tcgtactgtg
   136501 ctctccaaag agcaactctt ctcaaaatct cactcaaaag gtgctgagtt ggaatcccat
   136561 tctaactaag aatgagtcat tgcccttctc cgaccctgac tgcccaacct gagagcggac
   136621 agctaatgcg ttccacttat tgaacagggt tctatggtcg gtccgtgacc cctggatgcc
   136681 gaaggcgtcc ttggggtgat ctcgtagttc ctacggggtg gagatgatgg ggtcggtcca
   136741 tggattttcc ttccttttct tttgccgcat ttcgctcaaa gggttgaagg gagatagtgc
   136801 atcaagctgt tcgcaagggc caacttgatc ctcttcccca gagatctcag atgagggaac
   136861 cctgggagag ccgccgactc caactaccgt ccatgtacga tccatactag atctgaccaa
   136921 ctgcccatcc tacctcctct acgttcttga cagcccatct ttgtctcagt agagtctttc
   136981 agtggcacgt ttcggtcctc ttccccatta cttagaaaaa gtgagccacc ggttcaggta
   137041 caagatacta tcattaccgc ctggacaatt agacatccaa cccgtaatcg caacgaccca
   137101 attgcaagag cggagctcta ccaactgagc tatatccccc cgagccaagt ggagcatgca
   137161 tgaagtagtc agatgcttct tctattcttt tccctggcgc agctgggcca tcctggactt
   137221 gaaccagaga cctcgcccgt gaagtaaatc atcgcaccta cggtccaacc aattgggaga
   137281 gaatcaatag attccttttc gggagcgatt catccttccc gaacgcagca tacaactctc
   137341 cgttgtactg cgctctccaa gtgtgcttgt tccccccttc ttccttaccc tggcaagtct
   137401 ttgtgaaata actccgatga gaagaaaaaa gaaggcgtta agagaccctc ctggcccaac
   137461 cctagacact ctaagatcct ttttcaaacc tgctcccatt tcgagtcaag aaaaaaacgg
   137521 ctcgaatggt acgatccctc cgtcacccca gaatgaaagg ggcgatctcg tagttcttgg
   137581 tctgtgaaga tgcgttgtta ggtgctccat tttattttcc cattgctaaa cctgtgctcg
   137641 agagatagct gtccatacac tgataaggga tgtatggatt ctcgagaaga gaggagccgt
   137701 ggtggtcccc cccggaccgc ccggatccca cgagtgaatc gaaagttgga tctacattgg
   137761 atctcacccg aatcgcccca tctatcctcc tgaggaggag tttggtttca aaccccggtt
   137821 cgaacaggag gagtacgcca tgctaatgtg ccttggatga tccacatctc agggtcaggc
   137881 gccgatgagc acattgaact atccatgtgg ctgagagccc tcacagccca ggcacaacga
   137941 cgcaattatc aggggcgcgc tctaccactg agctaatagc ccgtcgtgcg agcctcccac
   138001 tgggggcccg ctatgccaaa agcgagagaa accccatccc tctctttcct tttttcgccc
   138061 ccatgtcgcc acacgggggg aacatgggga cgtaaaaaag ggggtcctat caacttgttc
   138121 cgacctagga taataagctc atgagcttgg tcttacttca ccgtcgagaa aggaaagaag
   138181 acttccatct ccaagtttaa ctcagacgta gctcccttct tttttttggg ggtgtgaagc
   138241 agtgtcaaac caaaataccc aacaagcatt agctctccct gaaaaggagg tgatccagcc
   138301 gcaccttcca gtacggctac cttgttacga cttcactcca gtcactagcc ctgccttcgg
   138361 catccccctc cttgcggtta aggtaacgac ttcgggcatg gccagctccc atagtgtgac
   138421 gggcggtgtg tacaaggccc gggaacgaat tcaccgccgt atggctgacc ggcgattact
   138481 agcgattccg gcttcatgca ggcgagttgc agcctgcaat ccgaactgag gacgggtttt
   138541 tggggttagc tcacctcgcg ggatcgcgac cctttgtccc ggccattgta gcacgtgtgt
   138601 cgcccagggc ataaggggca tgatgacttg acgtcatcct caccttcctc cggcttatca
   138661 ccggcagtct gttcagggtt ccaaactcaa cgatggcaac taaacacgag ggttgcgctc
   138721 gttgcgggac ttaacccaac accttacggc acgagctgac gacagccatg caccacctgt
   138781 gtccgcgttc ccgaaggcac ccctctcttt caagaggatt cgcggcatgt caagccctgg
   138841 taaggttctt cgctttgcat cgaattaaac cacatgctcc accgcttgtg cgggcccccg
   138901 tcaattcctt tgagtttcat tcttgcgaac gtactcccca ggcgggatac ttaacgcgtt
   138961 agctacagca ctgcacgggt cgatacgcac agcgcctagt atccatcgtt tacggctagg
   139021 actactgggg tatctaatcc cattcgctcc cctagctttc gtctctcagt gtcagtgtcg
   139081 gcccagcaga gtgctttcgc cgttggtgtt ctttccgatc tctacgcatt tcaccgctcc
   139141 accggaaatt ccctctgccc ctaccgtact ccagcttggt agtttccacc gcctgtccag
   139201 ggttgagccc tgggatttga cggcggactt aaaaagccac ctacagacgc tttacgccca
   139261 atcattccgg ataacgcttg catcctctgt attaccgcgg ctgctggcac agagttagcc
   139321 gatgcttatt ccccagatac cgtcattgct tcttctccgg gaaaagaagt tcacgacccg
   139381 tgggccttct acctccacgc ggcattgctc cgtcagcttt cgcccattgc ggaaaattcc
   139441 ccactgctgc ctcccgtagg agtctgggcc gtgtctcagt cccagtgtgg ctgatcatcc
   139501 tctcggacca gctactgatc atcgccttgg taagctattg cctcaccaac tagctaatca
   139561 gacgcgagcc cctcctcggg cggattcctc cttttgctcc tcagcctacg gggtattagc
   139621 agccgtttcc agctgttgtt cccctcccaa gggcaggttc ttacgcgtta ctcacccgtc
   139681 cgccactgga aacaccactt cccgtccgac ttgcatgtgt taagcatgcc gccagcgttc
   139741 atcctgagcc aggatcgaac tctccatgag attcatagtt gcattactta tagcttcctt
   139801 gttcgtagac aaagcggatt cggaattgtc tttcattcca aggcataact tgtatccatg
   139861 cgcttcatat tcgcccggag ttcgctccca gaaatatagc catccctgcc ccctcacgtc
   139921 aatcccacga gcctcttatc cattctcatt gaacgacggc gggggagcaa atccaactag
   139981 aaaaactcac attgggctta gggataatca ggctcgaact gatgacttcc accacgtcaa
   140041 ggtgacactc taccgctgag ttatatccct tccccgcccc atcgagaaat agaactgact
   140101 aatcctaagt caaagggtcg agaaactcaa cgccactatt cttgaacaac ttggagccgg
   140161 gccttctttt cgcactatta cggatatgaa aataatggtc aaaatcggat tcaattgtca
   140221 actgccccta tcggaaatag gattgactac cgattccgaa ggaactggag ttacatctct
   140281 tttccattca agagttctta tgcgtttcca cgcccctttg agaccccgaa aaatggacaa
   140341 attccttttc ttaggaacac atacaagatt cgtcactaca aaaaggataa tggtaaccct
   140401 accattaact acttcattta tgaatttcat agtaatagaa atacatgtcc taccgagaca
   140461 gaatttggaa cttgctatcc tcttgcctag caggcaaaga tttacctccg tggaaaggat
   140521 gattcattcg gatcgacatg agagtccaac tacattgcca gaatccatgt tgtatatttg
   140581 aaagaggttg acctccttgc ttctctcatg gtacactcct cttcccgccg agcccctttt
   140641 ctcctcggtc cacagagaca aaatgtagga ctggtgccaa caattcatca gactcactaa
   140701 gtcgggatca ctaactaata ctaatctaat ataatagtct aatatatcta atataataga
   140761 aaatactaat ataatagaaa agaactgtct tttctgtata ctttccccgg ttccgttgct
   140821 accgcgggct ttacgcaatc gatcggatta gatagatatc ccttcaacat aggtcatcga
   140881 aaggatctcg gagacccacc aaagtacgaa agccaggatc tttcagaaaa cggattccta
   140941 ttcaaagagt gcataaccgc atggataagc tcacactaac ccgtcaattt gggatccaaa
   141001 ttcgagattt tccttgggag gtatcgggaa ggatttggaa tggaataata tcgattcata
   141061 cagaagaaaa ggttctctat tgattcaaac actgtaccta acctatggga tagggatcga
   141121 ggaaggggaa aaaccgaaga tttcacatgg tacttttatc aatctgattt atttcgtacc
   141181 tttcgttcaa tgagaaaatg ggtcaaattc tacaggatca aacctatggg acttaaggaa
   141241 tgatataaaa aaaagagagg gaaaatattc atattaaata aatatgaagt agaagaaccc
   141301 agattccaaa tgaacaaatt caaacttgaa aaggatcttc cttattcttg aagaatgagg
   141361 ggcaaaggga ttgatcaaga aagatctttt gttcttctta tatataagat cgtgattgga
   141421 tccgcatatg tttggtaaag agaataatct tatcctttga gaataatcaa aaatggacag
   141481 tgttcaattg gaacatgaaa acgtgactaa attggtccta gttactcttc ggggcggagt
   141541 ggaagaaggg ggggattctc gaacgcggaa aggatccaat gaattcgaaa gaattgaacg
   141601 aggagccgta tgaggtgaaa atctcatgta cggttctgta gagtggcagt aagggtgact
   141661 tatctgtcaa cttttccact atcaccccaa aaaaaccaaa ctctgcctta cgtaaagttg
   141721 ccagagtacg attaacctct ggatttgaaa tcactgctta tatacccggt attggccata
   141781 atttacaaga acattctgta gtcttagtaa gagggggaag ggttaaggat ttacccggtg
   141841 tgagatatca cattgttcga ggaaccctag atgctgtcgg agtaaaggat cgtcaacaag
   141901 ggcgttctag tgcgttgtag attcttatcc aagacttgta tcatttgatg atgccatgtg
   141961 aatcgctaga aacatgtgaa gtgtatggct aacccaataa cgaaagtttc gtaaggggac
   142021 tggagcaggc taccatgaga caaaagatct tctttctaaa gagattcgat tcggaactct
   142081 tatatgtcca aggttcaata ttgaaataat ttcagaggtt ttccctgact ttgtccgtgt
   142141 caacaaacaa ttcgaaatac ctcgactttt ttagaacagg tccgagtcaa atagcaatga
   142201 ttcgaagcac ttctttttac actatttcgg aaacccaagg actcaatcgt atggatatgt
   142261 aaaatacagg atttccaatc ctagcaggaa agggagggaa acggatactc aatttaaagt
   142321 gagtaaacag aattccatac tcgatctcat agatacatat agaattctgc ggaaagccgt
   142381 attcgatgaa agtcgtatgt acggcttgga gggagatctt tcatatcttt cgagatccac
   142441 cctacaatat ggggtaaaaa agccaaaata agtgatttta gcccttataa aaagaaaact
   142501 gattcttgaa cccctttcac gctcatgtca cgtcgaggta ctgcagaaaa aaaaacagca
   142561 aaatccgatc caatttatcg taatcgatta gttaacatgt tggttaaccg tattctgaaa
   142621 cacggaaaaa aatcattggc ttatcaaatt atctatcgag ccgtgaaaaa gattcaacaa
   142681 aagacagaaa caaatccact atccgtttta cgtcaagcaa tacgtggagt aactcccgat
   142741 ataacagtaa aagcaagacg tgtaggtgga tcgactcatc aagttcccat tgaaatagga
   142801 tccacacaag gaaaagcact tgccattcgt tggttattag cggcatcccg aaaacgtccg
   142861 ggtcgaaata tggctttcaa attaagttcc gaattagtgg atgctgccaa agggagtggc
   142921 gatgccatac gcaaaaagga agagactcat agaatggcag aggcaaatag agcttttgca
   142981 cattttcgtt aatccatgaa caggatctat acatctcgat cggaaaagaa tcaagagaaa
   143041 aagaaagaat cggaattgat cgatagattt ctcgaaacaa acgaaaagga aagatgaaac
   143101 ataaatcatg gatcaactaa gcctctcggg gactttctta aagaggaacc tcatgtaaat
   143161 accatggaat aaggtttgat cctattcatg gagattccgt aactattcca aaaatggaaa
   143221 gttcgacaca attgggattt tttttggaaa ttggaagcag ttactaattc atgatctggc
   143281 atgtacagaa tgaaaacttc attctcgatt ctacgagaat ttttatgaaa gcctttcatt
   143341 tgcttctctt cgatggaagt ttgattttcc cagaatgtat cctaattttt ggcctaattc
   143401 ttcttctgat gatcgattca acctctgatc aaaaagatat accttggtta tatttcatct
   143461 cttcaacaag tttagtaatg agcataacgg ccctattgtt ccgatggaga gaagaaccta
   143521 tgattagctt ttcgggaaat ttccaaacga acaatttcaa cgaaatcttt caatttctta
   143581 ttttactatg ttcaactcta tgtattcctc tatccgtaga gtacattgaa tgtacagaaa
   143641 tggctataac agagtttctc ttattcgtat taacagctac tctaggggga atgtttttat
   143701 gcggtgctaa cgatttaata actatctttg tagccccaga atgtttcagt ttatgctcct
   143761 acctattatc tggatatacc aagaaagatg tacggtctaa tgaggctact atgaaatatt
   143821 tactcatggg tggggcaagc tcttctattc tggttcatgg tttctcttgg ctatatggtt
   143881 catccggggg agagattgag cttcaagaaa tagtaaacgg tcttatcaat acacaaatgt
   143941 ataactcccc aggaatttca attgcgctca tattcattac cgtaggaatt gggttcaagc
   144001 tttccccagc cccttctcat caatggactc ctgacgtata cgaaggagtg cggttcgttc
   144061 gagaaattcc tacctctcta tctatctctg agatgtttgg atttttcaaa actccatgga
   144121 catgcagaag agaaatgcta tccccactcg gaccaagaca gaacttttac ttgttcaaat
   144181 aacaattaag gtgaagcagg gtcaggaacg acgaatctct ttatgataaa cagatccatt
   144241 ttgcaagttc gttattacgg gtagttccta caaaggatcg gactaatgac gtatacaata
   144301 cttgaattct cgatgtagat gctacatagt tggttctcat ccttcagaga ctacgagtgt
   144361 aataagagca tccgtcgaca aaaggatcac cctaagatga tcatctcgtg gctattgaga
   144421 acgaattaaa tcagatggtt ctatttctca atctttctga cttgctccta cgaaaccaag
   144481 gtcgaaaaga ttgaaaaaat cagtcattca caaccactga tgaaggattc ctcgaaaagt
   144541 taaggattag taatcctttt tagaaatcga atggattcgg tcttatacat acgcgaggaa
   144601 ggtaatcaaa aaagaaagaa aatgggttct tctttctttt atcacttagg agccgtgtga
   144661 gatgaaagtc tcatgcacgg ttttgaatga gagaaagaag tgaggaatcc tcttttcgac
   144721 tctgactctc ccactccagt cgttgctttt ctttctgtta cttcgaaagt agctgcttca
   144781 gcttcagcca ctcgaatttt cgatattcct ttttatttct catcaaacga atggcatctt
   144841 cttctggaaa tcctagctat tcttagcatg atattgggaa atctcattgc tattactcaa
   144901 acaagcatga aacgtatgct tgcatattcg tccataggcc aaatcggata tgtaattatt
   144961 ggaataattg ttggagactc aaatgatgga tatgcaagca tgataactta tatgctgttc
   145021 tatatctcca tgaatctagg aacttttgct tgcattgtat tatttggtct acgtaccgga
   145081 actgataaca ttcgagatta tgcaggatta tacacaaaag atcctttttt ggctctctct
   145141 ttagccctat gtctcttatc cctaggaggt cttcctccac tagcaggttt tttcggaaaa
   145201 ctctatttat tctggtgtgg atggcaggca ggcctatatt tcttggtttt aataggactc
   145261 cttacaagcg ttgtttctat ctactattat ctaaaaataa taaagttatt aatgactgga
   145321 cgaaaccaag aaataacccc tcacgtgcga aattatagaa gatccccttt aagatcaaac
   145381 aattccatcg aattgagtat gattgtatgt gtgatagcat ctactatacc aggaatatca
   145441 atgaacccaa ttattgcaat tgctcaggat agcctttttt agcttctagg gtctatttct
   145501 tagttcaaga tccctcttac taactggaat caaagaatta gtagatctgt tccgcccaaa
   145561 atgggaatgg gctagggtta tgaacttata atctgatgat cgagtcgatt ccatgattat
   145621 aagttcattc cataccggac caggccggaa tagggttata tacattctca ttatgagaag
   145681 gggtcattcg ggcctatcta aatagatact atgtttacat atggattcct acatcattac
   145741 attccattta ggattaggaa tacgcgtaat cggacctgct ttttacatat ctctattggg
   145801 accctattca cctctttgag tgaatcgaga aataggtttg attgtccatc tttttgatat
   145861 atatcaggca ttgcattctc cggataattc aaatcgaagc aattggatgt ccaactcggg
   145921 cctatatgac atgaccgatc aatagatcca cctttgtcat atattccata catcacacta
   145981 gatagatatc atattcatgg aatacgattc actttcaaga tgccttggtg gtgaaatggt
   146041 agacacgcga gactcaaaat ctcgtgctaa atagcgtgga ggttcgagtc ctcttcaagg
   146101 cataatattg agaatgctca ttgaatgagc attctcaata agagagctcg gatcgaatcg
   146161 gtattgatat accgattcga tccgagctct tggaattgga ataaattcgg cagcggatcg
   146221 cgaaatcttg gtgatcttct ctatctaatg aatggggagt ccgctttaaa atcgtccgcc
   146281 ctgcacccac cccccgagta tatgcttcaa caggaatcac acaagggtag attagaaacc
   146341 tctggtaaaa tgcccgcccg taacccagca gataaagtac attacatagt ccagggattg
   146401 gcgacttacc cattcagtga ctttggcact ggacgttccc aaaatgggga ctatcgggta
   146461 aattcaatat aatagacgcc tgttggcatt ccagccttcc ttctcctttc agggcctatc
   146521 cgaaagagaa tccagtactt cttggtcgtg aatatctgaa ctggttgttt gctgttcaag
   146581 aattcttgtt taggcagttc ataccatcca tacatagtgt tttgatctaa gatttcaatt
   146641 cttccgtgtt tcagcagtaa catattcttc catggagcta aggtccaaaa tatggaagaa
   146701 acaagcgttt ccacgactct accacccagt caattctgtt ccacttaatc cctctttcat
   146761 ggccacatat ctttccggct aaggaatggg aaatctttct cctgttacat gaatccaatt
   146821 ttcatttcat ccgggaaaag ccatcttttt ctcaacaatg tctttgtcat ttgatccaat
   146881 agcgttccgt tagataggaa cagatttgat aaatactgat aactctcgga tagagtatta
   146941 gaacggaaag atccattaga taatgaactg ttggttctaa gccatctctg acgattaatc
   147001 aacaattcga agtgcttttc ttgcgtattc ttgataaacc agcgtttata tatagatgta
   147061 ggagggtctg tttgggaagt aagaagcccc tttgacatct cttcatctgc aaataattct
   147121 cgatgtgaaa acacagagcc agggggctga tctttgaata ggaaaaagag tggatctgca
   147181 gggtcccaaa tgaattggct tattcgaaaa aggccttgtt ctttggaaga tctatctcgt
   147241 gtctggtact gcatggttcc actctgcaag aactccgaat cattctcttg aagctcatcc
   147301 tcttcatcat aaatgatccg cttgccccga aatgacctgg accaataggg aaatcccaat
   147361 tcattgggcc tttcgataca atcaaataga aagccccaag ggcgccatat tctaggagcc
   147421 caaactatgt gattgaataa atcctcctgc gggtcaaggg ctccttctcc ctccccttct
   147481 tcaaactccg attcatattt ttcatagaga aatctctgat caaggataga acaagagccg
   147541 ttttgcatca tatctaaggg attcctcggt tcgggccgaa gaagcaatgt cactcgatca
   147601 ttatcaaact gactgcaatc tttttctgtc cgtgaagatc ccaccagagc gccttctact
   147661 tctaataggc catgaactag atcagaatca ttctcaacga gtccataaga agtgatccca
   147721 tttttttcat cgggtccgga taaagaccaa agatcttgag cgaccgatcc ggcagaacaa
   147781 ctcaaaagat aaagaagtat cgttaatctc ttcatgctcg ttccaagctc gaagtaccat
   147841 ttgtacaaat aagaatcccc ttcgttacat gatttcttct tcatatagat agatatagga
   147901 tctatggggc aattacttag aagtacattt tgtgctacag cccttcctat ctgatagaaa
   147961 aggatcccat gatcctgaac cgatcttacc tgggatcgca aatcccaagt ttgtctatga
   148021 agagcggatc taattgtatt agtgtctata attgatttct tctgtgtaat actaatcgat
   148081 aggacctcat tggtaagtgc tacaagatct cgtgcattgg aacccatggt tatggacccg
   148141 aatccgttag tatggaacat tttcttttcc aagtgaaatc ccctagtata tgaaagagtg
   148201 aaaaagtgct ttcgttgttg tggaagaaga agccttcgta tcttaatgca cgtatttaat
   148261 ttattcggag ctattagagc gggatccact ttttggggaa tatgagtcga agcaataaca
   148321 agaatatttc tagtagaaca tctttcacaa tccctggaga gatggttcac taatagaccg
   148381 agggctaagt cattcgactc attcacatcc agatcatgaa tgtttggaat ccatattatg
   148441 caaggagaca ttgcttttgc taattcgaat tgaagggtga tataaaatcg gtctatttcc
   148501 ggcatcatat ccatagttag cccattcatc ctagttagca gtttcagctc cgtatcaagg
   148561 tcacgatcga tatcgtcact agcatcaaga ttgtcactat catcaatatc gtcactatca
   148621 tcaatatcga tctcatcaag aagaaaacct ttaggcttgt tatccaggaa cttgttcaga
   148681 aataccgtaa tgaaaggaac ataggagttt gtcgctaggt atttgaccaa ataggatcgt
   148741 ccagttccta tagaacctat cactaaaata cccctagagg gggataaggc taagcggagc
   148801 gaaaagggtt ttccatgaga tgggaaatga aaactatttt ccccacacga agtttgtgaa
   148861 taagtgattg tctgataatg agcaaggaat atccgtcttt ctgctaaaca ggatggattg
   148921 aactcataat tcattagatg ctttttatga atgtcaacta agtatcgtaa gtaaattgct
   148981 cccggttgtt caatcatttg ataaccagag tcattctttg ataaacgatc actatgagtc
   149041 agactcaata gaatttgatc aatcctattt tctgtcgtta aggtggagaa ctgaaccaag
   149101 aattctcttt cttcatcatc aatcgaatca ctgttcgcga cccaggattc tattttatca
   149161 tcaatccaat ccccgttcac gttttttctt tttcttatca atgaatagat ctctttactt
   149221 gtatgactta gatgtctcgt atttctcgaa aaagtgattc gattgatggg atttggtatg
   149281 agatcgatga tctcgatgag attgatattc caatctttct tcttagaacg tattgatttg
   149341 accccataag cgggaccaag catgttgccg ccagaagcag aaccccgtat ttcttctaga
   149401 gaatctccta attgttccag agcaactaga aagagattct ttaaccagaa agaattcggt
   149461 tcagatgtag gatacctatc cagaagtttt cgcaactcaa tcatagatga tggaatcatc
   149521 aaagatttga ccttttcgaa ctctgtctgt aactcactag aggcccggga aacaaagaga
   149581 agatgtgtac gaacgagata tccagcaaca agaagaagga aaaggattga atagaggaac
   149641 tcccgagcat ttggcgatct cagatgtgtc gatatcaatg gtgactcatt atttcgatga
   149701 atcatttctt cggacagaag aagattatgt aaacacttac tcgagatctc acttatcaga
   149761 ttccattgtg gaagacacaa ttttttctga agaattcgcc atgatatacc tgatccatgc
   149821 ataatatcat gaaaaatggg tacaaatttt tgactgctac ttagtattgg caataggtct
   149881 gaaaaagtat ctaaaaatat caaatttaga tatttgtacc ctgtcgaagt aaggaaccat
   149941 ggcatatatg tttggaatag attccatttt gagagagttg aaaaagcact atctcgttga
   150001 aaggttctat acatctgccc tttctcaacg catttcttta gacaaagact ccgttttttc
   150061 ctcttttcgg atgataaatc tttctcagaa catggagtgt gaatcaaacc catgtttgaa
   150121 ttgaaattga gatactgatg caagttcttc ccttctgaat cagatagatt catatctgaa
   150181 agaggttgac aataagttct ttcaaaattg actatttgcc cctctgttag aggtgttcca
   150241 gaaatgtctg cgatcgagta aatagctcta cgaacgaatg gatcggatcg acttggaaaa
   150301 tggaaagatt tgtacaagtt atacgtttcg tcaccacttt gtggaaaatc gttaggtatg
   150361 aatatgttag atacctgtga ctcgattggt gaaatagtat ctctccccca aaaagcatgt
   150421 ttttttttac cgacgcacaa agaaaatatt ttgttgcgaa tgaacaagat attgaggaat
   150481 tgtccatacg taaaatcaga attattgata cgggcctttt ccacagaaaa ggggaatctt
   150541 gtgttccaat agaagcagaa gtgatgtgga ttattcaaga atcgaagtcg atttgcttta
   150601 taaaaagaag atatcaatga acttctatga aatggtttca cgggattcag ccaattgtct
   150661 tgatcgtgga atatcattga gaaataggaa tccgggttat caaaggattt cctgcgatta
   150721 tttctagtat ggaatgagtc aatcatccac tttggtatct tattgaacaa aaatggtgat
   150781 attgttcctc cattgatcaa gaatttcgat ttttgggaag tatcatgatc gtccaataag
   150841 aagggtttcc attttttcaa atgaacaatt tgaagaccta ttgattctaa caactgattg
   150901 cagagttgat cattcggacc tttcaattca tagatgtaga tctcggacct atgaatgggg
   150961 atatttccga aactcacaca gaaaaaagga agtgagttag acaaaaagaa aagcaacttg
   151021 gacaaaaaaa gaagtgactt ggacaaaaag aaacgaagtg gcttagacaa atcttttttg
   151081 tcgataacct cagaccaatc aatcgaatat tgattaatac gtaatcgatc gaacactact
   151141 tgaaaacggc tcttctgctc cgaaacggac tgttccaaat gttcctggaa attcttgctc
   151201 ccattggacc atttgtatct atatgcatca ggatcccgat tcatggatct ctcggttcga
   151261 gaaatcaaaa taagaggctc gaaccatttc ttctgactct ttttcaaatt cgataaatat
   151321 tggttgatcg tatatttcat tatagttcta tgattcagag tatcctttcc tatttgatcc
   151381 ctttgaattc catattcgaa gttgcgatcg gatctattca ttaaaaagaa tcgattcaat
   151441 acatttctta tgtacccata ggtactatat tggatttgaa tcagatttcg gatcaatcta
   151501 tattgagtga ctgcctccat tatgttgttg ctagcaaata ccactatttt tggttttgga
   151561 tcttccaaat cattcccgca ggagatccgg acccattttt ttctgatcct tcgagaaaaa
   151621 gattcattct cttcataaaa aataggaggt agaaccaata aagatttctt tttcgattca
   151681 tccctggcct cattcaagaa ttgtttttga tccaatccgc aggaatcaat agaaaaggca
   151741 aatcccttat gatacaccag atccggctcg gttattgata gagtgaatag atctgccatt
   151801 tcttgaaatc tctcttctga ttcaaaatcg tagtgtaacg tgtatcctcc cctgttccgg
   151861 tcatggaata gatgaaataa atcaaaaaat ggatttttgt tcaagaatga aatcttattg
   151921 gaactgtcca tatccggttc atcttcggaa ccatatcaca tcccggatct gatgaaatag
   151981 gatgaattga gacggtattt tgtaaatacg taattatctt gaatatatta accatttctt
   152041 tattttccga tcgcttggaa gggacaaaag aaagatcttg ttgtttcttc aacaatttct
   152101 gatccctagt ggacctctca gtaggattcg aacccagatg aagttctgac catctatcag
   152161 agaaaaaaga acgaacggat cttgtaggat tcccaagaaa ttcttcgatt tcttccggaa
   152221 acagatgatt aatcatctgc ttctcacgtt ccgtgaatag ccgggacatt gaggaatatc
   152281 cagaaaggca tttcgggaat cggcctgatt ctatctcttt tcgttccgtt tgaagaaagg
   152341 aaggatccca aagaatcgat ctttcttttc gttgttgaat ctctctttga ttaatcaatg
   152401 tgtgatattc cgaatcctca ttactaatgg aatccaaatg atctctggat tgatcagaag
   152461 atcctttcag ttggctagaa tccgttactt gaacgaaact agatcttgtg gaatcatatt
   152521 gaatatttga cgatacattc tgtaccttgc taaaaaaccg atccttgttt accaaccaca
   152581 cattgtctaa ccaaatccaa ttctctctcg atacgttcct caaaaaatcc gattcgggcg
   152641 gattcttccc ccaactaacg aagagatctt ggcggaattg ccacatatga aattgagcac
   152701 agttttgcaa agaaatagcc cacttgtttc tcgagaagag atgggaaaca tgctcaatat
   152761 catttgattg aatagttgac ccagcccctt gttgtttgaa gaaaccctcc acttcaattg
   152821 gtattttttc acgaaaagca gacatgagat aagaaatcca gtgtttcact aagatttcga
   152881 atagcggtcc cgaattcaag ttgattctat ttcgactctt cctcagagaa agacgatcaa
   152941 acaattccca atcatggtcc ttgcggatcg gatcatccat ataatataca aaaagaaact
   153001 ccagatattt gagatctttc tctttgaata agatctcaat tccagcgacg gtttcattag
   153061 atatcttaca actagaatcc ctcttttttc cgatccagtt cctccaccaa cgcgaacccc
   153121 agttagattc aggcatgcta cactttttag ttattgggag aacccaagta ctctctttcg
   153181 gattcaggaa acaactctca gagatctttt ttcctttggg aagatacagg agcgaaacaa
   153241 tcaacctatt gatattggaa gacccaacgg attcttccaa tgtatcattt ctgggtccaa
   153301 tggaattcat aggtatagga agaagcccta tcaaatagag attttttctt tcgaccatat
   153361 ttcgattgtt aatacgatat ataaggaccg ctactacaaa gagtattaca cccttgatcg
   153421 tgaaatatcg attgcttgtt gaaccctgtg aattgcgtga aagtaggata ctccaaattc
   153481 gggggtcaaa gagttttaga aaacgttctt ggtggaaaaa aatgtgaatg aaggatcccg
   153541 ctgaattgaa ttgggtccat gaatctaaga aatggtgaga attcttgatc tctctcaata
   153601 tctctctcaa ttcgaaaatc caggatttga attgatgtcc tctcattgat tcctcctaaa
   153661 ttgcattgat ttatcctaac taaattgcat tgatttatcc taaagatttc atttcaattg
   153721 gaatttggtt attcaccatg tacgaggatc cccgctaagc atccatggct gaatggttaa
   153781 agcgcccaac tcataattgg cgaattcgta ggttcaattc ctactggatg cacgccaatg
   153841 ggaccctcca ataagtctat tggaattggc tctgtatcaa tggaatctca tcatccatac
   153901 ataacgaatt ggtgtggtat attcatatca taatatatga acagtaagaa ctagcattct
   153961 tattgagact ataactcata gggaagaaaa tcgatttatg gatggaatca aatatgcagt
   154021 atttacagac aaaagtattc ggttattggg gaaaaatcaa tatacttcta atgtcgaatc
   154081 aggatcaact aggacagaaa taaagcattg ggtcgaactc ttctttggtg tcaaggtaat
   154141 agctatgaat agtcatcgac ttccgggaaa gagtagaaga atgggaccta ttatgggaca
   154201 tacaatgcat tacagacgta tgatcattac gcttcaaccg ggttattcta ttccacctct
   154261 tagaaagaaa agaacttaaa aaaaaatact taatagcatg gcgatacatt tatacaaaac
   154321 ttctaccccg agcacacgca atggaaccgt agacagtcaa gtgaaatcca atccacgaaa
   154381 taatttgatc tatggacagc atcattgtgg taaaggtcgt aatgccagag gaatcattac
   154441 cgcaaggcat agagggggag gtcataagcg tctataccgt aaaatcgatt ttcgacggaa
   154501 tgaaaaagac atatatggta gaatcgtaac catagaatac gaccctaatc gaaatgcata
   154561 catttgtctc atacactatg gggatggtga gaagagatat attttacatc ccagaggggc
   154621 tataattgga gataccattg tttctggtac agaagttcct ataaaaatgg gaaatgccct
   154681 acctttgagt gcggtttgaa ctattgattt acgtaattgg aaataaccaa ttaggtttac
   154741 gacgaaacct agaaatcgat cactgatcca atttgagtac ctctgcagga tagacctcaa
   154801 cagaaaactg aagagtaacg gcagcaagtg attgagttca gtagttcctc atataaaatt
   154861 attgactcta gagatatagt aatatggaga agacaaaatt gtttcaagca ccgacagaac
   154921 cggaagcgcc ccttctttca aagagaggag gacgggttat tcacatttca tttgatggtc
   154981 agaggcgaat tgaaagttaa gcagtgggaa ttctaaagat tccccggggg aaaaatagag
   155041 atgtctccta cgttacccat aatatgtgga agtatcgacg taatttcata gagtcattcg
   155101 gtctgaatgc tacatgaaga acataagcca gatgacggaa cgggaagacc caggatgtag
   155161 aagatcataa catgagtgat tcggcagatt tggattcata tatatatcca cccatgtggt
   155221 acttcattct acgatatata taagatccat ctgtatagat atcatcatct acatccagaa
   155281 agaagtatgc tttggaagaa gcttgtacag tttgggaagg ggttttgatt gatcaaaaga
   155341 agaatctact tcaaccgata tgcccttagg cacggccata cataacatag aaatcacact
   155401 tggaaagggt ggacaattag ctagagcagc gggtgctgta gcgaaactga ttgcaaaaga
   155461 ggggaaatcg gccacattaa aattaccttc tggggaggtc cgtttgatat ccaaaaactg
   155521 ctcagcaaca gtcggacaag tggggaatgt tggggtgaac cagaaaagtt tgggtagagc
   155581 cggatctaag cgttggctag gtaagcgtcc tgtagtaaga ggagtagtta tgaaccctgt
   155641 agaccatccc catgggggtg gtgaagggag agccccaatt ggtagaaaaa aacccacaac
   155701 cccttggggt tatcctgcac ttggaagaag aagtagaaaa aggaataaat atagtgataa
   155761 tttgattctt cgtcgccgta gtaaatagga gagaaaatcg aattaaattc ttcgttttta
   155821 caaaaaaaaa aaaaatagga gtaa
//
LOCUS       TIPNPSS      6425 bp ds-DNA             SYN       09-AUG-1990
DEFINITION  A.tumefaciens T-DNA vector containing octopine T-DNA borders and
            markers: neomycin-phosphotransferase - octopine synthase (3' end)
            and Sp/Sm adenyltransferase. complete cds.
ACCESSION   M35007
KEYWORDS    neomycin phosphotransferase;
            streptomycin/spectinomycin adenyltransferase.
SOURCE      N.tabacum T-DNA inserts in A.tumefaciens DNA.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 6425)
  AUTHORS   Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
            Depicker,A.
  TITLE     Cloning and sequence analysis of truncated T-DNA inserts from
            Nicotiana tabacum
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.R.Gheysen, 01-JUN-1990.
FEATURES       from  to/span     description
    pept       2782     1985 (c) neomycin phosphotransferase (gtg start codon)
    pept.ps    5009     4042 (c) streptomycin/spectinomycin adenyltransferase
                                 (gtg start codon)
    mRNA    /  1811     1105 (c) octopine synthase (3' end)
    recomb       24       25     T-DNA end/plant DNA start
    recomb       40       41     plant DNA end/T-DNA start
    recomb     1094     1095     plant DNA end/T-DNA start
    recomb     2786     2787     T-DNA end/plant DNA start
    recomb     3044     3045     T-DNA end/plant DNA start
    recomb     3354     3355     plant DNA end/T-DNA start
    recomb     5585     5586     T-DNA end/plant DNA start
    recomb     6389     6390     T-DNA end/plant DNA start
    signal     1650     1645 (c) poly-A signal
    signal     1686     1681 (c) poly-A signal
    site       1634     1633 (c) major poly-A site
    site          1       24     left T-DNA border
    site        372      395     24 bp border-like sequence
    site       1569     1592     24 bp border-like sequence
    site       1669     1692     24 bp border-like sequence
    site       1779     1756 (c) 24 bp border-like sequence
    site       2128     2105 (c) 24 bp border-like sequence
    site       2449     2472     24 bp border-like sequence
    site       2485     2462 (c) 24 bp border-like sequence
    site       3660     3683     24 bp border-like sequence
    site       3875     3898     24 bp border-like sequence
    site       4359     4336 (c) 24 bp border-like sequence
    site       5868     5891     24 bp border-like sequence
BASE COUNT     1509 a   1754 c   1710 g   1452 t
ORIGIN      
        1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa
       61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca
      121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa
      181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct
      241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc
      301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat
      361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc
      421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt
      481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt
      541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa
      601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg
      661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa
      721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc
      781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac
      841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc
      901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc
      961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc
     1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa
     1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc
     1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc
     1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc
     1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa
     1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt
     1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat
     1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga
     1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata
     1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta
     1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata
     1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt
     1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct
     1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc
     1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa
     1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc
     1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg
     2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat
     2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga
     2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg
     2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg
     2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc
     2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat
     2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa
     2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg
     2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata
     2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa
     2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct
     2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca
     2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag
     2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg
     2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc
     2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg
     3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg
     3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg
     3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc
     3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg
     3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat
     3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc
     3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac
     3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag
     3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg
     3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat
     3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat
     3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa
     3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg
     3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt
     3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg
     3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg
     3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg
     4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta
     4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag
     4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc
     4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg
     4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag
     4321 ocgttaaggt ttcatttagc gcctcaaata gatcctgttc aggaaccgga tcaaagagtt
     4381 cctccgccgc tggacctacc aaggcaacgc tatgttctct tgcttttgtc agcaagatag
     4441 ccagatcaat gtcgatcgtg gctggctcga agatacctgc aagaatgtca ttgcgctgcc
     4501 attctccaaa ttgcagttcg cgcttagctg gataacgcca cggaatgatg tcgtcgtgca
     4561 caacaatggt gacttctaca gcgcggagaa tctcgctctc tccaggggaa gccgaagttt
     4621 ccaaaaggtc gttgatcaaa gctcgccgcg ttgtttcatc aagccttacg gtcaccgtaa
     4681 ccagcaaatc aatatcactg tgtggcttca ggccgccatc cactgcggag ccgtacaaat
     4741 gtacggccag caacgtcggt tcgagatggc gctcgatgac gccaactacc tctgatagtt
     4801 gagtcgatac ttcggcgatc accgcttccc tcatgatgtt taactttgtt ttagggcgac
     4861 tgccctgctg cgtaacatcg ttgctgctcc ataacatcaa acatcgaccc acggcgtaac
     4921 gcgcttgctg cttggatgcc cgaggcatag actgtacccc aaaaaaacag tcataacaag
     4981 ccatgaaaac cgccactgcg ccgttaccac cgctgcgttc ggtcaaggtt ctggaccagt
     5041 tgcgtgaggc catacgctac ttgcattaca gcttacgaac cgaacaggct tatgtccact
     5101 gggttcgtgc cttcatccgt ttccacggtg tgcgtcaccc ggcaaccttg ggcagcagcg
     5161 aagtcgaggc atttctgtcc tggctggcga acgagcgcaa ggtttcggtc tccacgcatc
     5221 gtcaggcatt ggcggccttg ctgttcttct acggcaagtg ctgtgcacgg atctgccctg
     5281 gcttcaggag atcggaagac ctcggccgtc cgggcgcttg ccggtggtgc tgaccccgga
     5341 tgaagtggtt cgcatcctcg gttttctgga aggcgagcat cgtttgttcg cccagcttct
     5401 gtatggaacg ggcatgcgga tcagtgaggg tttgcaactg cgggtcaagg actggatttc
     5461 gatcacggca cgatcatcgt gcgggagggc aagggctcca aggatcgggc cttgatgtta
     5521 cccgagagct tggcacccag cctgcgcgag cagctgtctc gtgcacgggc atggtggctg
     5581 aaggactagg ccgagggccg cagcggcgtt gcgcttcccg acgcccttga gcggaagtat
     5641 ccgcgcgccg ggcattcctg gccgtggttc tgggtttttg cgcagcacac gcattcgacc
     5701 gatccacgga gcggtgtcgt gcgtcgccat cacatgtatg accagacctt tcagcgcgcc
     5761 ttcaaacgtg ccgtagaaca agcaggcatc acgaagcccg ccacaccgca caccctccgc
     5821 cactcgttcg cgacggcctt gctccgcagc ggttacgaca ttcgaaccgt gcaggatctg
     5881 ctcggccatt ccgacgtctc tacgacgatg atttacacgc atgtgctgaa agttggcggt
     5941 gccggagtgc gctcaccgct tgatgcctgc cgcccctcac tgtgagaggt agggcagcgc
     6001 aagtcaatcc tagcggattc actacccctg cgcgaaggcc atcggtgccg catcgaacgg
     6061 ccggttgcgg aaagtcctcc ctgcgtccgc tgatggccgg cagcagcccg tcgttgaagg
     6121 atccctgaaa gcgacgttgg atgttaacat ctacaaattg ccttttctta cgaccatgta
     6181 cgtaagcgct tacgtttttg gtggaccctt gaggaaactg gtagctgttg tgggcctgtg
     6241 gtctcaagat ggatcattaa tttccacctt cacctacgat ggggggcatc gcaccggtga
     6301 gtaatattgt acggctaaga gcgaatttgg cctgtagacc tcaattgcga gctttctaat
     6361 ttcaaactat tcgggcctaa cttttggtgt gatgatgctg actggcagga tatataccgt
     6421 tgtaat
//
LOCUS       TOBPRMMG      200 bp ds-DNA             PLN       09-AUG-1990
DEFINITION  N.tabacum promoter activating a promoterless nptII marker gene.
ACCESSION   M34757
KEYWORDS    .
SOURCE      N.tabacum (strain SR1) DNA.
  ORGANISM  Nicotiana tabacum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and
            Depicker,A.
  TITLE     Cloning and sequence analysis of truncated T-DNA inserts from
            Nicotiana tabacum
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.R.Gheysen, 01-JUN-1990.
FEATURES       from  to/span     description
    pept        198  >   200     ORF
    mRNA        130  >   200     mRNA (5' end +/- 2 bp)
    recomb      193      194     T-DNA end/plant DNA start
    signal       25       30     CAAT box
    signal       47       55     CAAT box
    signal      101      107     TATA box
BASE COUNT       64 a     53 c     32 g     51 t
ORIGIN      
        1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg
       61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta
      121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg
      181 atacaccaaa tcggatcatg
//

GenBank-Updates@genbank.bio.net (08/10/90)

LOCUS       HUMMPRCI     9081 bp ss-mRNA            PRI       10-AUG-1990
DEFINITION  Human cation-independent mannose 6-phosphate receptor mRNA,
            complete cds.
ACCESSION   J03528
KEYWORDS    mannose 6-phosphate receptor.
SOURCE      Human placenta, cDNA to mRNA (library of J.E.Sadler), clones
            PP[4,112,188), RB3 and RC5.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 9081)
  AUTHORS   Oshima,A., Nolan,C.M., Kyle,J.W., Grubb,J.H. and Sly,W.S.
  TITLE     The human cation-independent mannose 6-phophate receptor: Cloning
            and sequence of the full-length cDNA and expression of functional
            receptor in COS cells
  JOURNAL   J. Biol. Chem. 263, 2553-2562 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by R.Miller, 04-JAN-1988.
FEATURES       from  to/span     description
    pept        148     7623     mannose 6-phosphate receptor precursor
                                 /hgml_locus_uid="LY0166H" /nomgen="M6PR"
                                 /map="12"
    sigp        148      267     mannose 6-phosphate receptor signal peptide
    matp        268     7620     mannose 6-phosphate receptor
    mRNA    <     1     9081     MPR-CI mRNA
BASE COUNT     2243 a   2233 c   2476 g   2129 t
ORIGIN      Chromosome 12; 475 bp upstream of EcoRI site.
        1 gagcccagtc gagccgcgct cacctcgggc tcccgctccg tctccacctc cgcctttgcc
       61 ctggcggcgc gaccccgtcc cgggcgcggc cccagcagtc gcgcgccgtt agcctcgcgc
      121 ccgccgcgca gctccgggcc cggcgcgatg ggggccgccg ccggccggag cccccacctg
      181 gggcccgcgc ccgcccgccg cccgcagcgc tctctgctcc tgctgcagct gctgctgctc
      241 gtcgctgccc cggggtccac gcaggcccag gccgccccgt tccccgagct gtgcagttat
      301 acatgggaag ctgttgatac caaaaataat gtactttata aaatcaacat ctgtggaagt
      361 gtggatattg tccagtgcgg gccatcaagt gctgtttgta tgcacgactt gaagacacgc
      421 acttatcatt cagtgggtga ctctgttttg agaagtgcaa ccagatctct cctggaattc
      481 aacacaacag tgagctgtga ccagcaaggc acaaatcaca gagtccagag cagcattgcc
      541 ttcctgtgtg ggaaaaccct gggaactcct gaatttgtaa ctgcaacaga atgtgtgcac
      601 tactttgagt ggaggaccac tgcagcctgc aagaaagaca tatttaaagc aaataaggag
      661 gtgccatgct atgtgtttga tgaagagttg aggaagcatg atctcaatcc tctgatcaag
      721 cttagtggtg cctacttggt ggatgactcc gatccggaca cttctctatt catcaatgtt
      781 tgtagagaca tagacacact acgagaccca ggttcacagc tgcgggcctg tccccccggc
      841 actgccgcct gcctggtaag aggacaccag gcgtttgatg ttggccagcc ccgggacgga
      901 ctgaaggtgg tgcgcaagga caggcttgtc ctgagttacg tgagggaaga ggcaggaaag
      961 ctagactttt gtgatggtca cagccctgcg gtgactatta catttgtttg cccgtcggag
     1021 cggagagagg gcaccattcc caaactcaca gctaaatcca actgccgcta tgaaattgag
     1081 tggattactg agtatgcctg ccacagagat tacctggaaa gtaaaacttg ttctctgagc
     1141 ggcgagcagc aggatgtctc catagacctc acaccacttg cccagagcgg aggttcatcc
     1201 tatatttcag atggaaaaga atatttgttt tatttgaatg tctgtggaga aactgaaata
     1261 cagttctgta ataaaaaaca agctgcagtt tgccaagtga aaaagagcga tacctctcaa
     1321 gtcaaagcag caggaagata ccacaatcag accctccgat attcggatgg agacctcacc
     1381 ttgatatatt ttggaggtga tgaatgcagc tcagggtttc agcggatgag cgtcataaac
     1441 tttgagtgca ataaaaccgc aggtaacgat gggaaaggaa ctcctgtatt cacaggggag
     1501 gttgactgca cctacttctt cacatgggac acggaatacg cctgtgttaa ggagaaggaa
     1561 gacctcctct gcggtgccac cgacgggaag aagcgctatg acctgtccgc gctggtccgc
     1621 catgcagaac cagagcagaa ttgggaagct gtggatggca gtcagacgga aacaaagaag
     1681 aagcattttt tcattaatat ttgtcacaga gtgctgcagg aaggcaaggc acgagggtgt
     1741 cccgaggacg cggcagtgtg tgcagtggat aaaaatggaa gtaaaaatct gggaaaattt
     1801 atttcctctc ccatgaaaga gaaaggaaac attcaactct cttattcaga tggtgatgat
     1861 tgtggtcatg gcaagaaaat taaaactaat atcacacttg tatgcaagcc aggtgatctg
     1921 gaaagtgcac cagtgttgag aacttctggg gaaggcggtt gcttttatga gtttgagtgg
     1981 cacacagctg cggcctgtgt gctgtctaag acagaagggg agaactgcac ggtctttgac
     2041 tcccaggcag ggttttcttt tgacttatca cctctcacaa agaaaaatgg tgcctataaa
     2101 gttgagacaa agaagtatga cttttatata aatgtgtgtg gcccggtgtc tgtgagcccc
     2161 tgtcagccag actcaggagc ctgccaggtg gcaaaaagtg atgagaagac ttggaacttg
     2221 ggtctgagta atgcgaagct ttcatattat gatgggatga tccaactgaa ctacagaggc
     2281 ggcacgccct ataacaatga aagacacaca ccgagagcta cgctcatcac ctttctctgt
     2341 gatcgagacg cgggagtggg cttccctgaa tatcaggaag aggataactc cacctacaac
     2401 ttccggtggt acaccagcta tgcctgcccg gaggagcccc tggaatgcgt agtgaccgac
     2461 ccctccacgc tggagcagta cgacctctcc agtctggcaa aatctgaagg tggccttgga
     2521 ggaaactggt atgccatgga caactcaggg gaacatgtca cgtggaggaa atactacatt
     2581 aacgtgtgtc ggcctctgaa tccagtgccg ggctgcaacc gatatgcatc ggcttgccag
     2641 atgaagtatg aaaaagatca gggctccttc actgaagtgg cttccatcag taacttggga
     2701 atggcaaaga ccggcccggt ggttgaggac agcggcagcc tccttctgga atacgtgaat
     2761 gggtcggcct gcaccaccag cgatggcaga cagaccacat ataccacgag gatccatctc
     2821 gtctgctcca ggggcaggct gaacagccac cccatctttt ctctcaactg ggagtgtgtg
     2881 gtcagtttcc tgtggaacac agaggctgcc tgtcccattc agacaacgac ggatacagac
     2941 caggcttgct ctataaggga tcccaacagt ggatttgtgt ttaatcttaa tccgctaaac
     3001 agttcgcaag gatataacgt ctctggcatt ggtaagattt ttatgtttaa tgtctgcggc
     3061 acaatgcctg tctgtgggac catcctggga aaacctgctt ctggctgtga ggcagaaacc
     3121 caaactgaag agctcaagaa ttggaagcca gcaaggccag tcggaattga gaaaagcctc
     3181 cagctgtcca cagagggctt catcactctg acctacaaag ggcctctctc tgccaaaggt
     3241 accgctgatg cttttatcgt ccgctttgtt tgcaatgatg atgtttactc agggcccctc
     3301 aaattcctgc atcaagatat cgactctggg caagggatcc gaaacactta ctttgagttt
     3361 gaaaccgcgt tggcctgtgt tccttctcca gtggactgcc aagtcaccga cctggctgga
     3421 aatgagtacg acctgactgg cctaagcaca gtcaggaaac cttggacggc tgttgacacc
     3481 tctgtcgatg ggagaaagag gactttctat ttgagcgttt gcaatcctct cccttacatt
     3541 cctggatgcc agggcagcgc agtggggtct tgcttagtgt cagaaggcaa tagctggaat
     3601 ctgggtgtgg tgcagatgag tccccaagcc gcggcgaatg gatctttgag catcatgtat
     3661 gtcaacggtg acaagtgtgg gaaccagcgc ttctccacca ggatcacgtt tgagtgtgct
     3721 cagatatcgg gctcaccagc atttcagctt caggatggtt gtgagtacgt gtttatctgg
     3781 agaactgtgg aagcctgtcc cgttgtcaga gtggaagggg acaactgtga ggtgaaagac
     3841 ccaaggcatg gcaacttgta tgacctgaag cccctgggcc tcaacgacac catcgtgagc
     3901 gctggcgaat acacttatta cttccgggtc tgtgggaagc tttcctcaga cgtctgcccc
     3961 acaagtgaca agtccaaggt ggtctcctca tgtcaggaaa agcgggaacc gcagggattt
     4021 cacaaagtgg caggtctcct gactcagaag ctaacttatg aaaatggctt gttaaaaatg
     4081 aacttcacgg ggggggacac ttgccataag gtttatcagc gctccacagc catcttcttc
     4141 tactgtgacc gcggcaccca gcggccagta tttctaaagg agacttcaga ttgttcctac
     4201 ttgtttgagt ggcgaacgca gtatgcctgc ccacctttcg atctgactga atgttcattc
     4261 aaagatgggg ctggcaactc cttcgacctc tcgtccctgt caaggtacag tgacaactgg
     4321 gaagccatca ctgggacggg ggacccggag cactacctca tcaatgtctg caagtctctg
     4381 gccccgcagg ctggcactga gccgtgccct ccagaagcag ccgcgtgtct gctgggtggc
     4441 tccaagcccg tgaacctcgg cagggtaagg gacggacctc agtggagaga tggcataatt
     4501 gtcctgaaat acgttgatgg cgacttatgt ccagatggga ttcggaaaaa gtcaaccacc
     4561 atccgattca cctgcagcga gagccaagtg aactccaggc ccatgttcat caacgccgtg
     4621 gaggactgtg agtacacctt tgcctggccc acagccacag cctgtcccat gaagagcaac
     4681 gagcatgatg actgccaggt caccaaccca agcacaggac acctgtttga tctgagctcc
     4741 ttaagtggca gggcgggatt cacagctgct tacagcgaga aggggttggt ttacatgagc
     4801 atctgtgggg agaatgaaaa ctgccctcct ggcgtggggg cctgctttgg acagaccagg
     4861 attagcgtgg gcaaggccaa caagaggctg agatacgtgg accaggtcct gcagctggtg
     4921 tacaaggatg ggtccccttg tccctccaaa tccggcctga gctataagag tgtgatcagt
     4981 ttcgtgtgca ggcctgaggc cgggccaacc aataggccca tgctcatctc cctggacaag
     5041 cagacatgca ctctcttctt ctcctggcac acgccgctgg cctgcgagca agcgaccgaa
     5101 tgttccgtga ggaatggaag ctctattgtt gacttgtctc cccttattca tcgcactggt
     5161 ggttatgagg cttatgatga gagtgaggat gatgcctccg ataccaaccc tgatttctac
     5221 atcaatattt gtcagccact aaatcccatg cacggagtgc cctgtcctgc cggagccgct
     5281 gtgtgcaaag ttcctattga tggtcccccc atagatatcg gccgggtagc aggaccacca
     5341 atactcaatc caatagcaaa tgagatttac ttgaattttg aaagcagtac tccttgctta
     5401 gcggacaagc atttcaacta cacctcgctc atcgcgtttc actgtaagag aggtgtgagc
     5461 atgggaacgc ctaagctgtt aaggaccagc gagtgcgact ttgtgttcga atgggagact
     5521 cctgtcgtct gtcctgatga agtgaggatg gatggctgta ccctgacaga tgagcagctc
     5581 ctctacagct tcaacttgtc cagcctttcc acgagcacct ttaaggtgac tcgcgactcg
     5641 cgcacctaca gcgttggggt gtgcaccttt gcagtcgggc cagaacaagg aggctgtaag
     5701 gacggaggag tctgtctgct ctcaggcacc aagggggcat cctttggacg gctgcaatca
     5761 atgaaactgg attacaggca ccaggatgaa gcggtcgttt taagttacgt gaatggtgat
     5821 cgttgccctc cagaaaccga tgacggcgtc ccctgtgtct tccccttcat attcaatggg
     5881 aagagctacg aggagtgcat catagagagc agggcgaagc tgtggtgtag cacaactgcg
     5941 gactacgaca gagaccacga gtggggcttc tgcagacact caaacagcta ccggacatcc
     6001 agcatcatat ttaagtgtga tgaagatgag gacattggga ggccacaagt cttcagtgaa
     6061 gtgcgtgggt gtgatgtgac atttgagtgg aaaacaaaag ttgtctgccc tccaaagaag
     6121 ttggagtgca aattcgtcca gaaacacaaa acctacgacc tgcggctgct ctcctctctc
     6181 accgggtcct ggtccctggt ccacaacgga gtctcgtact atatgaatct gtgccagaaa
     6241 atatataaag ggcccctggg ctgctctgaa agggccagca tttgcagaag gaccacaact
     6301 ggagacgtcc aggtcctggg actcgttcac acgcagaagc tgggtgtcat aggtgacaaa
     6361 gttgttgtca tgtactccaa aggttatccg tgtggtggaa ataagaccgc atcctccgtg
     6421 atagaattga cctgtacaaa gacggtgggc agacctgcat tcaagaggtt tgatatcgac
     6481 agctgcactt actacttcag ctgggactcc cgggctgcct gcgccgtgaa gcctcaggag
     6541 gtgcagatgg tgaatgggac catcaccaac cctataaatg gcaagagctt cagcctcgga
     6601 gatatttatt ttaacctgtt cagagaatct ggggacatga ggaccaatgg ggacaactac
     6661 ctgtatgaga tcctactttc ctccatcaca agctccagaa acccggcgtg ctctggagcc
     6721 aacatatgcc aggtgaagcc caacgatcag cacttcagtc ggaaagttgg aacctctgac
     6781 aagaccaagt actaccttca agacggcgat ctcgatgtcg tgtttgcctc ttcctctaag
     6841 tgcggaaagg ataagaccaa gtctgtttct tccaccatct tcttccactg tgaccctctg
     6901 gtggaggacg ggatccccga gttcagtcac gagactgccg actgccagta cctcttctct
     6961 tggtacacct cagccgtgtg tcctctgggg gtgggctttg acagcgagaa tcccggggac
     7021 gacgggcaga tgcacaaggg gctgtcagaa cggagccagg cagtcggcgc ggtgctcagc
     7081 ctgctgctgg tggcgctcac ctgctgcctg ctggccctgt tgctctacaa gaagaagagg
     7141 agggaaacaa tgataagtaa gctgacctct tgctgtagga gaagttccaa cgtgtcctac
     7201 aaatactcaa aggtgaataa ggaagaagag acagatgaga atgaaacaga gtggctgatg
     7261 gaagagatcc agctgcctcc tccacggcag ggaaaggaag ggcaggagaa cggccatatt
     7321 accaccaagt cagtgaaagc cctcagctcc ctgcatgggg atgaccagga cagtacggat
     7381 gaggttctga ccatcccaga ggtgaaagtt cactcgggca ggggagctgg ggcagagagc
     7441 tcccacccag tgagaaacgc acagagcaat gcccttcagg agcgtgagga cgatagggtg
     7501 gggctggtca ggggtgagaa ggcgaggaaa gggaagtcca gctctgcaca gcagaagaca
     7561 gtgagctcca ccaagctggt gtccttccat gacgacagcg acgaggacct cttacacatc
     7621 tgactccgca gtgcctgcag gggagcacgg agccgcggga cagccaagca cctccaacca
     7681 aataagactt ccactcgatg atgcttctat aattttgcct ttaacagaaa ctttcaaaag
     7741 ggaagagttt ttgtgatggg ggagagggtg aaggaggtca ggccccactc cttcctgatt
     7801 gtttacagtc attggaataa ggcatggctc agatcggcca cagggcggta ccttgtgccc
     7861 agggttttgc cccaagtcct catttaaaag cataaggccg gacgcatctc aaaacagagg
     7921 gctgcattcg aagaaaccct tgctgcttta gtcccgatag gtatttgacc ccgatatatt
     7981 ttagcatttt aattctctcc ccctatttat tgactttgac aattactcag gtttgagaaa
     8041 aaggaaaaaa aaacagccac cgtttcttcc tgccagcagg ggtgtgatgt accagtttgt
     8101 ccatcttgag atggtgaggc tgtcagtgta tggggcagct tccggcggga tgttgaactg
     8161 gtcattaatg tgtcccctga gttggagctc attctgtctc ttttctcttt tgctttctgt
     8221 ttcttaaggg cacacacacg tgcgtgcgag cacacacaca tacgtgcaca gggtccccga
     8281 gtgcctaggt tttggagagt ttgcctgttc tatgccttta gtcaggaatg gctgcacctt
     8341 tttgcatgat atcttcaagc ctgggcgtac agagcacatt tgtcagtatt tttgccggct
     8401 ggtgaattca acctgcccaa agattgattt gtgtgtttgt gtgtgtgtgt gtgtgtgtgt
     8461 gtgtgtgtgt gagtggagtt gaggtgtcag agaaaatgaa ttttttccag atttggggta
     8521 taggtctcat ctcttcaggt tctcatgata ccacctttac tgtgcttatt tttttaagaa
     8581 aaaagtgttg atcaaccatt cgacctataa gaagccttaa tttgcacagt gtgtgactta
     8641 cagaaactgc atgaaaaatc atgggccaga gcctcggccc tagcattgca cttggcctca
     8701 tgctggaggg aggctgggcg ggtacagcgc ggaggaggag ggaggccagg cgggcatggc
     8761 gtggaggagg tgggaggccg ggcggtcaca gcatggagga ggagggaggc gctgctggtg
     8821 ttcttattct ggcggcagcg cctttcctgc catgtttagt gaatgacttt tctcgcattg
     8881 tagaattgta tatagactct ggtgttctat tgctgagaag caaaccgccc tgcagcatcc
     8941 ctcagcctgt accggtttgg ctggcttgtt tgatttcaac atgagtgtat tttttaaaat
     9001 tgatttttct cttcattttt ttttcaatca actttactgt aatataaagt attcaacaat
     9061 ttcaataaaa gataaattat t
//
LOCUS       HUMMRA       5185 bp ss-mRNA            PRI       10-AUG-1990
DEFINITION  Human mannose receptor mRNA, complete cds.
ACCESSION   J05550
KEYWORDS    mannose receptor.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 5185)
  AUTHORS   Taylor,M.E., Conary,J.T., Lennartz,M.R., Stahl,P.D. and
            Drickamer,K.
  TITLE     Primary structure of the mannose receptor contains multiple motifs
            resembling carbohydrate-recognition domains
  JOURNAL   J. Biol. Chem. 265, 12156-12162 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Drickamer, 18-MAY-1990.
FEATURES       from  to/span     description
    pept        104     4474     mannose receptor precursor
    sigp        104      157     mannose receptor signal peptide
    matp        158     4471     mannose receptor
BASE COUNT     1602 a    999 c   1198 g   1386 t
ORIGIN      
        1 gggaacttgg attaggtgga gaggcagttg gggggcctcg ttgttttgcg tcttagttcc
       61 gccctcctgt ccatcaggag aaggaaagga taaaccctgg gccatgaggc tacccctgct
      121 cctggttttt gcctctgtca ttccgggtgc tgttctccta ctggacacca ggcaattttt
      181 aatctataat gaagatcaca agcgctgcgt ggatgcagtg agtcccagtg ccgtccaaac
      241 cgcagcttgc aaccaggatg ccgaatcaca gaaattccga tgggtgtccg aatctcagat
      301 tatgagtgtt gcatttaaat tatgcctggg agtgccatca aaaacagact gggttgctat
      361 cactctctat gcctgtgact caaaaagtga atttcagaaa tgggagtgca aaaatgacac
      421 acttttgggg atcaaaggag aagatttatt ttttaactac ggcaacagac aagaaaagaa
      481 tattatgctc tacaagggat cgggtttatg gagcaggtgg aagatctatg gaaccacaga
      541 caatctgtgc tccagaggtt atgaagccat gtatacgcta ctaggcaatg ccaatggagc
      601 aacctgtgca ttcccgttca agtttgaaaa caagtggtac gcagattgca cgagtgctgg
      661 gcggtcggat ggatggctct ggtgcggaac cactactgac tatgacacag acaagctatt
      721 tggatattgt ccattgaaat ttgagggcag tgaaagctta tggaataaag acccgctgac
      781 cagcgtttcc taccagataa actccaaatc cgctttaacg tggcaccaag cgaggaaaag
      841 ctgccaacaa cagaacgctg agctcctgag catcacagag atacatgagc aaacatacct
      901 gacaggatta accagttcct tgacctcagg actctggatt ggacttaaca gtctgagctt
      961 caacagcggt tggcagtgga gtgaccgcag tcctttccga tatttgaact ggttaccagg
     1021 aagtccatca gctgaacctg gaaaaagctg tgtgtcacta aatcctggaa aaaatgctaa
     1081 atgggaaaat ctggaatgtg ttcagaaact gggctatatt tgcaaaaagg gcaacaccac
     1141 tttaaattct tttgttattc cctcagaaag tgatgtgcct actcactgtc ctagtcagtg
     1201 gtggccgtat gccggtcact gttacaagat tcacagagat gagaaaaaaa tccagaggga
     1261 tgctctgacc acctgcagga aggaaggcgg tgacctcaca agtatccaca ccatcgagga
     1321 attggacttt attatctccc agctaggata tgagccaaat gacgaattgt ggatcggctt
     1381 aaatgacatt aagattcaaa tgtactttga gtggagtgat gggacccctg taacgtttac
     1441 caaatggctt cgtggagaac caagccatga aaacaacaga caggaggatt gtgtggtgat
     1501 gaaaggcaag gatgggtact gggcagatcg gggctgtgag tggcctcttg gctacatctg
     1561 caagatgaaa tcacgaagcc aaggtccaga aatagtggaa gtcgaaaaag gctgcaggaa
     1621 aggctggaaa aaacatcact tttactgcta tatgattgga catacgcttt caacatttgc
     1681 agaagcaaac caaacctgta ataatgagaa tgcttattta acaactattg aagacagata
     1741 tgaacaagcc ttcctgacta gtttcgttgg cttaaggcct gaaaaatatt tctggacagg
     1801 actttcagat atacaaacca aagggacttt tcagtggacc atcgaggaag aggttcggtt
     1861 cacccactgg aattcagata tgccagggcg aaagccaggg tgtgttgcca tgagaaccgg
     1921 gattgcaggg ggcttatggg atgttttgaa atgtgatgaa aaggcaaaat ttgtgtgcaa
     1981 gcactgggca gaaggagtaa cccacccacc gaagcccacg acgactcccg aacccaaatg
     2041 tccggaggat tggggcgcca gcagtagaac aagcttgtgt ttcaagctgt atgcaaaagg
     2101 aaaacatgag aagaaaacgt ggtttgaatc tcgagatttt tgtcgagctc tgggtggaga
     2161 cttagctagc atcaataaca aagaggaaca gcaaacaata tggcgattaa taacagctag
     2221 tggaagctac cacaaactgt tttggttggg attgacatat ggaagccctt cagaaggttt
     2281 tacttggagt gatggttctc ctgtttcata tgaaaactgg gcttatggag aacctaataa
     2341 ttatcaaaat gttgaatact gtggtgagct gaaaggtgac cctactatgt cttggaatga
     2401 tattaattgt gaacacctta acaactggat ttgccagata caaaaaggac aaacaccaaa
     2461 acctgagcca acaccagctc ctcaagacaa tccaccagtt actgaagatg ggtgggttat
     2521 ttacaaagac taccagtatt atttcagcaa agagaaggaa accatggaca atgcgcgagc
     2581 gttttgcaag aggaattttg gtgatcttgt ttctattcaa agtgaaagtg aaaagaagtt
     2641 tctatggaaa tatgtaaaca gaaatgatgc acagtctgca tattttattg gtttattgat
     2701 cagcttggat aaaaagtttg cttggatgga tggaagcaaa gtggattacg tgtcttgggc
     2761 cacaggtgaa cccaattttg caaatgaaga tgaaaactgt gtgaccatgt attcaaattc
     2821 agggttttgg aatgacatta actgtggcta tccaaacgcc ttcatttgcc agcgacataa
     2881 cagtagtatc aatgctacca cagttatgcc taccatgccc tcggtcccat cagggtgcaa
     2941 ggaaggttgg aatttctaca gcaacaagtg tttcaaaatc tttggattta tggaagaaga
     3001 aagaaaaaat tggcaagagg cacgaaaagc ttgtataggc tttggaggga atctggtctc
     3061 catacaaaat gaaaaagagc aagcatttct tacctatcac atgaaggact ccactttcag
     3121 tgcctggact gggctgaatg atgtcaattc agaacacacg ttcctttgga cggatggacg
     3181 aggagtccat tacacaaact gggggaaagg ttaccctggt ggaagaagaa gcagtctttc
     3241 ttatgaagat gctgactgtg ttgttattat tggaggtgca tcaaatgaag caggaaaatg
     3301 gatggatgat acctgcgaca gtaaacgagg ctacatatgc cagacacgat ccgacccttc
     3361 cttgactaat cctccagcaa cgattcaaac agatggcttt gttaaatatg gcaaaagcag
     3421 ctattcactc atgagacaaa aatttcaatg gcatgaagcg gagacatact gcaagcttca
     3481 caattccctt atagccagca ttctggatcc ctacagtaat gcatttgcgt ggctgcagat
     3541 ggaaacatct aatgaacgtg tgtggatcgc cctgaacagt aacttgactg ataatcaata
     3601 cacttggact gataagtgga gggtgaggta cactaactgg gctgctgatg agcccaaatt
     3661 gaaatcagca tgtgtttatc tggatcttga tggctactgg aagacagcac attgcaatga
     3721 aagtttttac tttctctgta aaagatcaga tgaaatccct gctactgaac ccccacaact
     3781 gcctggcaga tgcccggagt cagatcacac agcatggatt cctttccatg gtcactgtta
     3841 ctatattgag tcctcatata caagaaactg gggccaagct tctctggaat gtcttcgaat
     3901 gggttcctct ctggtttcca ttgaaagtgc tgcagaatcc agttttctgt catatcgggt
     3961 tgagccactt aaaagtaaaa ccaatttttg gataggattg ttcagaaatg ttgaagggac
     4021 gtggctgtgg ataaataaca gtccggtctc ctttgtcaac tggaacacag gagatccctc
     4081 tggtgaacgg aatgattgtg tagctttaca tgcgtcttct gggttttgga gtaatattca
     4141 ctgttcttcc tacaaaggat atatttgtaa aagaccaaaa attattgatg ctaaacctac
     4201 tcatgaatta cttacaacaa aagctgacac aaggaagatg gacccttcta aaccgtcttc
     4261 caacgtggcc ggagtagtca tcattgtgat cctcctgatt ttaacgggtg ctggccttgc
     4321 cgcctatttc ttttataaga aaagacgtgt gcacctacct caagagggcg cctttgaaaa
     4381 cactctgtat tttaacagtc agtcaagccc aggaactagt gatatgaaag atctcgtggg
     4441 caatattgaa cagaatgaac actcggtcat ctagtacctc aatgcgattc tgagatattt
     4501 gaatttcata aaattgtaac tgaaatttaa aatttttagt tcaatgtgat tgttttcttt
     4561 aaaatgagta ctgaattgta ctggtctgtc cttttttcct ttgcctaatt gaagaaataa
     4621 ttgcttgttt tctagcctgg caagatattt tcataaaaga gggataacaa tgctgattac
     4681 taccttttaa aatattttag ataaatgcac agcaccacag caccacatct aagcattagt
     4741 gatgggtagc tgatgtcagc ttcatgtgga ttttaagcac tctagaaaca atgaagcttc
     4801 ttggcatatt ttaaggagct cccaaaatgt gttacctatt aaattgtaac tcagcaagta
     4861 gaagaccatt tgaaaagtca ggtacaaatt tcctcaagtg gcataaaaat gtagtcagtt
     4921 ttctctttta ccagttttta tttccactcc aattatttag aactttattt gtacatgtgc
     4981 agaagaataa ggcagctgag aatcttgttt cccccaagag agttttacag gctgagtgtt
     5041 gcaaatgtgt tctttgtcct gttatatgta tatcaggaat acaaggatgt gaaataaaac
     5101 tgtaaatttg cataactgga tgtacttaga taatgtgaaa taaacattaa agacaaggtc
     5161 tatttttaat aaaaaaaaaa aaaaa
//
LOCUS       BFRACP       1788 bp ss-RNA             PHG       10-AUG-1990
DEFINITION  Bacteriophage fr maturation and coat protein genes, complete cds,
            and replicase gene, 5' end.
ACCESSION   M31635
KEYWORDS    coat protein; maturation protein; replicase.
SOURCE      Bacteriophage fr RNA.
  ORGANISM  Bacteriophage fr
            Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; 
            Leviviridae.
REFERENCE   1  (bases 1 to 1788)
  AUTHORS   Berzin,V., Avots,A., Jansone,I., Gintnere,L. and Tsimanis,A.
  TITLE     Sequence of the genes coding for the A-protein and coat protein of
            bacteriophage fr
  JOURNAL   Nucleic Acids Res. 15, 6741-6741 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        121     1302     maturation protein (gtg start codon)
    pept       1328     1720     coat protein
    pept       1754  >  1788     replicase protein
BASE COUNT      443 a    454 c    473 g    418 t
ORIGIN      
        1 cccctttcgg ggtcctgctc gacttcctgt caagctaaat gccattttta atgtctttag
       61 cgagacgcta ccatggctat cgctgtaggt agccgcaatt ccattgctag ggagcctcgt
      121 gtgcgaaagt tcataccaac cgagagaatg agtaagagcc acgtggtctc cgtccgcgaa
      181 tatgctgacg gggaactcga ggataactct ttaccgttga tttaccgatc caattggtcg
      241 ccgggccagt acacaagtac gggtccgcgc acaaaggaat ggcactaccc atcgtcctac
      301 tcacgtgggg cgataggaat caaggctctt gatcaaggta agtacgcacg attgggcaca
      361 tcgtggggcc gcgagttcga agaacgtgcc ggttatggca tgtctatcga cgcacgtagt
      421 tgttatagcc tattccccgt tagtcaaaac ttaacgtgga tagacgtacc aacgaacgta
      481 gctaatcgcg ccacgactga ggtcttaggt aaggtcactc aaggtaattt taaccttggc
      541 gtggcccttg ctgaagcccg gtctacggcc tcacaactgt cgacgcaaac catcgcgttg
      601 attaaggcgt acaccgcggc tcgccgcggg aactggcgcc aggcactccg ctacttagcg
      661 ctgaacgaga atcgaaaatt caattcgaag tccgtcgcaa gcaggtggct ggagttgcag
      721 ttcgggtgga tgccgcttct cagcgatatc caaggtgcgt atgagatgct caccaaagtg
      781 catcttaagg catttatgcc tatgcgtgcc gtgagtcaag tcggacaaaa cgtcagtttg
      841 tctggccggc tcacttctcc ggctgcaagc tataagtcta cgtgcaacat atcacgacgc
      901 attgtgatat ggttttacat aaacgatgca cgtctggctt ggctgtcctc cttagggatt
      961 ttgaacccgc taggaatagt gtgggaaaag gtcccgttct ctttcctggt cgattggttg
     1021 ctgccggttg ggaacatgct tgaggggctt accgccccga taggctgttc gtatcaatcg
     1081 ggaacagtaa ccgacgtaat atcaggagag tcgacaataa tcgccgatga catctatggt
     1141 tgggatacag tgcgacctgc aaccgctaag gtgcaaatca gtgctgtcca ccggggggta
     1201 caaagcgtgt ggcccacaac gggcgtatac gttaagtcac ctttctcgat ggtccatacc
     1261 ttagatgcct tggcactttt caggcaacgc ctctggaaat agaggcccta accgaaggga
     1321 gagccacatg gcttcgaact ttgaagagtt cgttctcgtc gacaatggcg gaacgggaga
     1381 tgtaaaagtc gctccgagca acttcgctaa cggggttgca gaatggatca gctcgaactc
     1441 acgttctcag gcttacaaag tgacctgtag cgtgcgtcag agctctgcga acaatcggaa
     1501 atacaccgtc aaggtcgagg tcccgaaagt ggcaactcag gtccaaggcg gcgttgagct
     1561 tcctgttgcg gcgtggcgct cgtacatgaa tatggaatta actattccgg tattcgcgac
     1621 gaacgacgac tgtgccttaa tcgttaaggc attgcaaggc acctttaaaa ctggtaaccc
     1681 aattgcaaca gccatcgcag ccaactcggg aatctactaa gaaacccgtg ccattccaac
     1741 atgaggaata cccatgtcaa aatcaacaaa gaagttcaac tctttatg
//
LOCUS       HUMERCC1A    1098 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human excision repair protein (ERCC-1) mRNA, complete cds.
            cDNA clone pcDE.
ACCESSION   M28650
KEYWORDS    alternative splicing; excision repair protein.
SOURCE      Human, cDNA to mRNA, clone pcDE.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1098)
  AUTHORS   Hoeijmakers,J.H.J.
  TITLE     Characterization of genes and proteins involved in excision repair
            of human cells
  JOURNAL   J. Cell Sci. Suppl. 6, 111-125 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        143     1036     excision repair protein
    site        863      916     alternatively spliced out exon
BASE COUNT      243 a    352 c    312 g    191 t
ORIGIN      
        1 aagtgctgcg agccctgggc cacgctggcc gtgctggcag tgggccgcct cgatccctct
       61 gcagtctttc ccttgaggct ccaagaccag caggtgaggc ctcgcggcgc tgaaaccgtg
      121 aggcccggac cacaggctcc agatggaccc tgggaaggac aaagaggggg tgccccagcc
      181 ctcagggccg ccagcaagga agaaatttgt gatacccctc gacgaggatg aggtccctcc
      241 tggagtggcc aagcccttat tccgatctac acagagcctt cccactgtgg acacctcggc
      301 ccaggcggcc cctcagacct acgccgaata tgccatctca cagcctctgg aaggggctgg
      361 ggccacgtgc cccacagggt cagagcccct ggcaggagag acgcccaacc aggccctgaa
      421 acccggggca aaatccaaca gcatcattgt gagccctcgg cagaggggca atcccgtact
      481 gaagttcgtg cgcaacgtgc cctgggaatt tggcgacgta attcccgact atgtgctggg
      541 ccagagcacc tgtgccctgt tcctcagcct ccgctaccac aacctgcacc cagactacat
      601 ccatgggcgg ctgcagagcc tggggaagaa cttcgccttg cgggtcctgc ttgtccaggt
      661 ggatgtgaaa gatccccagc aggccctcaa ggagctggct aagatgtgta tcctggccga
      721 ctgcacattg atcctcgcct ggagccccga ggaagctggg cggtacctgg agacctacaa
      781 ggcctatgag cagaaaccag cggacctcct gatggagaag ctagagcagg acttcgtctc
      841 ccgggtgact gaatgtctga ccaccgtgaa gtcagtcaac aaaacggaca gtcagaccct
      901 cctgaccaca tttggatctc tggaacagct catcgccgca tcaagagaag atctggcctt
      961 atgcccaggc ctgggccctc agaaagcccg gaggctgttt gatgtcctgc acgagccctt
     1021 cttgaaagta ccctgatgac cccagctgcc aaggaaaccc ccagtgtaat aataaatcgt
     1081 cctcccaggc caggctca
//
LOCUS       MZECPPETE    3916 bp ds-DNA             ORG       10-AUG-1990
DEFINITION  Maize chloroplast cytochrome b559 alpha- and beta-subunit (psbE and
            F), cytochrome b6-f complex (petE), and photosystem II polypeptide,
            complete cds, Pro-tRNA and Trp-tRNA.
ACCESSION   J04502
KEYWORDS    cytochrome; photosystem II protein; transfer RNA-Pro;
            transfer RNA-Trp.
SOURCE      Maize (strain FR9 cms + FR37) DNA, clone pZmc503.
  ORGANISM  Chloroplast Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE   1  (bases 2565 to 2725)
  AUTHORS   Haley,J. and Bogorad,L.
  TITLE     A 4-kDa maize chloroplast polypeptide associated with the
            cytochrome b-6-f complex: Subunit 5, encoded by the chloroplast
            petE gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 1534-1538 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3916)
  AUTHORS   Haley,J. and Bogorad,L.
  TITLE     Expression of the maize chloroplast petE gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by L.Bogorad, 07-MAY-1989.
            
            Author address: J.Haley
               Department of Cellular and Developmental Biology
               Harvard University Biological Laboratories
               16 Divinity Ave
               Cambridge, MA 02138
               Ph: 617-495-4260
FEATURES       from  to/span     description
    pept        454      332 (c) ORF40
    pept        696      580 (c) photosystem II protein (psbL)
    pept        838      719 (c) cytochrome b559 beta subunit (psbF)
    pept       1100      849 (c) cytochrome b559 alpha subunit (psbE)
    pept       2337     2432     ORF31
    pept       2596     2709     cytochrome b6-f complex subunit 5 (petE)
    pept       3499     3627     ORF42
    tRNA       2903     2830 (c) Trp-tRNA
    tRNA       3115     3042 (c) Pro-tRNA
BASE COUNT     1242 a    684 c    658 g   1332 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatcctctc ccctctttct tgttgcttca taagagtgaa tccattttat gggcgaaagg
       61 cgggggcttt aaatcgaccg atggtttgct tcactcacta acatcattaa caaacaaaag
      121 aataaatgga gggattccaa ccatcagagc aaaggttttc tctttgttat ttttacaaat
      181 agaaataggt aaccaatttc tagattatgg aacaaaaccg cgttataaca ataagaattc
      241 cgcgggccct ttccgctcta atcagataaa ggggagtaag gacccgctaa gttcctactt
      301 tttcatgttt acaatctgga tcctccgatt actatagaga tgaacccaat ccagaatacg
      361 aaccgtaaaa gaaaaaacct attaaaccaa tcacaagaat accagttaca gtacctatca
      421 gccaaagagg aattcttcca gtagtatcgg ccatttcccc tactttcctc cacattttct
      481 caagtggtca tgctagagac aaaaacagtc atggatagtt ataaagatgg tatccttcca
      541 aatgggataa gagaattctt actactctct ttctttctct caattgaaga agtaattgga
      601 aaataaaaca gcaagtacaa aaatgagtaa taaaccccag tatagactgg tacgattcaa
      661 ttcaacattt tgttcattcg ggtttgattg tgtcatagtt ctatagttgg aatttggttt
      721 atcgttggat gaactgcatt gctgatattg atcccaagaa aaaaaccgtg ggtacagcta
      781 atccgtgaac agccagccat cgcactgtaa aaataggata ggttcgatct atggtcattg
      841 ggggcctcct aaaaggatct actaaattca tctagttgtt ctaaagaatc aaaacggtcg
      901 gttattaatg gaattccttg tcggctttcc gtgaaatact cgtttggtcg aggacttcca
      961 aacacgtcat aagctaaacc cgtactgaca aataaccaac ccgcaatgaa tagggaaggt
     1021 atagtaatgc tatgaataac ccagtatcga atactggtaa taatatcagc aaaagaacgt
     1081 tctcccgtgc ttccagacat gctgagctcc caaaattttt attcaaaaaa ggaattgatt
     1141 ccgtaaaaga tgggatccac cagtaaatag aaaattactg atatttcatc cttgtgagat
     1201 tgtcaatttt gtaccaaagg tgtattttga gtataccgaa ttagtatagc tatccttcct
     1261 atggcacagc aatccagttt tgcttggtcc cgaaacagaa ttcctttttt ctcttttttg
     1321 ttccttgtct ataggaaaat tacatgttat tcaaggcatc aatagaaccc cacaattttt
     1381 tgggtcctac ttattttcat tgtcttcgga atagtagaat aatttaattt ggaatagcgg
     1441 ccaagatctt gggaaaatct aagttaatga ttaataagat aaagaattta ggaaggatat
     1501 tctcatattg acgcaataca aagataagta tatgcaaaat cgatcccttt ttagttaaag
     1561 gaatttaatt ggttagcata aaataatatc taataaatag aaaatcgaat agcggataat
     1621 ccgttatgaa agaaacggaa tacattcttt gaagaatcaa gattcgtaat caatccttgt
     1681 cttgtttgtt ggattaggtc taattttctt gaccaaacag caagcatgga actttacaag
     1741 aagaacttaa aagaaaagga taattgaagt aactttgaat ccacttttgt tggggttcaa
     1801 aaaacgaata aaaataaata aaaaaaagta aatttaagga atagttccct ttttgagggg
     1861 gccctcgggg gtcgtggaat gcttttcttc tcctcttatt ccatatggaa tacaatgagt
     1921 taaaatttgc tccaaaaaga ggatgaaatc catcctatta aaaaagaaat aatccgaatc
     1981 cgaaatagaa agaatttttt ttttttcaaa ttcaattctt tatttatctt ttattccaaa
     2041 attctcccga aaatccaatt tcatttttca atggggttag atgatctagt tcttaatatt
     2101 attagtttac ttaactgaca gattccacaa caaatctctt gattcggaat tagggactca
     2161 tgttccgtct gatgaatcga ttttctcttt tttttacact tctgtatctc actctatctt
     2221 gttttttagt attatctaaa ataaccgatg aattatgaat tttccataac ttaaacttag
     2281 gtaagtgctt taccaacata tgtagtgtag taaaaaaaat ggaattgaac cccttcatgc
     2341 ttactataac tagttatttc ggttttctac tggctgcttt aactataacc ccagctctat
     2401 ttattagctt gaacaagata cgtcttattt gaaatgaatt gaatagaaga atctttcttt
     2461 tggattcttg gtattctaga ctcttttcca cactaattac caattctttt cttggtcatt
     2521 gagattcgtg ggtagtttag actattattt agagatagat cgtacctctt tttttatccc
     2581 ctcgaacaaa tcgaaatgat tgaagttttt ctatttggaa tcgtcttagg cctaattcct
     2641 attactttag cgggattatt cgtgactgcg tatttgcaat acaggcgtgg ggatcagttg
     2701 gatctttgat tgagtaatat ttcttttttg attgacctcc tctctggtct ggaggaggtc
     2761 aaattggagt tgcaattcta ctttgttttt ttttaagtta ttttagtttc gacataagat
     2821 atatggaatc acgctctgta ggatttgaac ctacgacatc gggttttgga gacccgcgtt
     2881 ctaccaaact gaactaagag cgctttcaaa acaaaaaaaa aaaagaaaat ccttttctat
     2941 tcctaacgtg tctcacgtcc gtatagtatc cacaaattca agttataccc actttaatcg
     3001 atctccccac tactgcccat aacgaagaga gaattaatag gtagggatga caggatttga
     3061 acctgtgaca ttttgtaccc aaaacaaacg cgctaccaag ctgcgctaca tcccttttcc
     3121 aaattgttgt acaatgccat tgtacacaat tcctttcttg ttttccacat cgtaattttc
     3181 ttctatttct ttatccatat agaactttct tgtcatttct tgtttttggt ctcatataag
     3241 tcaaggaagg ggtataaact aaaatccagt caaatttcac ctataaaaga aagactacta
     3301 ttccttagta atgtatagga agaaggggtc atctttttta gggataggaa aatctcgtct
     3361 atacggttca ttctatatat atataatatt tattttgttt ttttagttag gaatttcgcc
     3421 taaacaaaag aaatacaaag gatcttgggc aagagtatct gatcatatat gtattccaat
     3481 acggaaggag gattttcaat gcgggatata aaaacatatc tctctgtagc acccgtgcta
     3541 agtactctat ggtttggggc tttagcaggt ttattgatag aaatcaatcg tttattccca
     3601 gatgctttgt cattcccttt tttttaatta tagttattcc tatgcgagag atagaattct
     3661 tcgtgacatg acgaaaattc cccctttttg aattcttttt tagtatatga agcaaaaaga
     3721 aagaaaagat ggataaggat tgtattcttt aattatttct ctatgtttta ttacttaatt
     3781 tacgaatttc aaaaattttg tattctattg gattggattt gttcgagaat tcgaagaatt
     3841 acaacaaaat ctttagaaat cacattttta gttaggaact tctatggatt ttattcttct
     3901 tcttttcttt ggatcc
//
LOCUS       RATLOX       2672 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Rat aorta lysyl oxidase mRNA, complete cds.
ACCESSION   J02903
KEYWORDS    lysyl oxidase.
SOURCE      Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7,
            13, 8-1 and IIB.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2672)
  AUTHORS   Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D.,
            Troxler,R.F. and Kagan,H.M.
  TITLE     Cloning of rat aorta lysyl oxidase cDNA: Complete codons and
            predicted amino acid sequence
  JOURNAL   Biochemistry 29, 4863-4870 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            P.C.Trackman, 07-FEB-1990.
FEATURES       from  to/span     description
    pept        287     1516     lysyl oxidase precursor
    sigp        287      349     pot. lysyl oxidase signal peptide
    matp        350     1513     lysyl oxidase
    site       2252     2274     region of dyad symmetry
BASE COUNT      650 a    763 c    622 g    637 t
ORIGIN      192 bp upstream of TaqI site.
        1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc
       61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg
      121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc
      181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt
      241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg
      301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc
      361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa
      421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga
      481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt
      541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg
      601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg
      661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc
      721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac
      781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga
      841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc
      901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat
      961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga
     1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg
     1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca
     1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag
     1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg
     1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga
     1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa
     1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa
     1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac
     1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt
     1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa
     1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat
     1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta
     1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat
     1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca
     1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt
     1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg
     1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc
     2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt
     2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac
     2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc
     2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta
     2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac
     2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat
     2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct
     2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct
     2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg
     2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat
     2641 gcagtactgt aatccccgtc cctcctggag cc
//
LOCUS       RHMDCTABA     764 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  R.meliloti dctA and dctB genes, 5' end.
ACCESSION   M33555
KEYWORDS    regulatory protein.
SOURCE      R.meliloti DNA.
  ORGANISM  Rhizobium meliloti
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 764)
  AUTHORS   Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F.
  TITLE     Genetic analysis and regulation of the Rhizobium meliloti genes
            controlling C-4-dicarboxylic acid transport
  JOURNAL   Gene 85, 135-144 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        726  >   764     dctA protein
BASE COUNT      149 a    239 c    252 g    124 t
ORIGIN      
        1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg
       61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc
      121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg
      181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg
      241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc
      301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg
      361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca
      421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc
      481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc
      541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg
      601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt
      661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc
      721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg
//
LOCUS       BSUCOMGA     6278 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  B.subtilis comG-[1,2,3,4,5,6,and 7] proteins in comG operon,
            complete cds.
ACCESSION   M29691 M22854
KEYWORDS    exogenous DNA-binding.
SOURCE      B.subtilis (strain 168) DNA, clone pED4.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 6278)
  AUTHORS   Albano,M., Breitling,R. and Dubnau,D.A.
  TITLE     Nucleotide sequence and genetic organization of the Bacillus
            subtilis comG operon
  JOURNAL   J. Bacteriol. 171, 5386-5404 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by D.A.Dubnau, 02-MAR-1989.
            
            ORF1 is required for the ability of competent cultures to resolve
            into two populations with different cell densities on Renografin
            gradients, as well as for full expression of comE, another late
            competent locus. ORF1 shows significant similarity to the virB
            ORF11 protein from Agrobacterium tumefaciens, which is probably
            involved in T-DNA transfer.  The N-terminal sequences of comG ORF3
            and, to a lesser extent, the comG ORF4 and 5 proteins are similar
            to a class of pilin proteins from members of the genera
            Bacteroides, Pseudomonas, Neisseria and Moraxella.  All of the comG
            proteins, except ORF1, possess hydrophobic domains that are
            potentially capable of spanning the bacterial membrane and may be
            part of the DNA transport machinery.
FEATURES       from  to/span     description
    pept       1298     2368     comG-1 protein (ttg start codon)
    pept       2420     3391     comG-2 protein
    pept       3405     3701     comG-3 protein
    pept       3691     4122     comG-4 protein (ttg start codon)
    pept       4106     4453     comG-5 protein
    pept       4863     5237     comG-7 protein
    ORF        4365     4862     comG-6 protein (put.)
    ORF     <     1      886     ORF1 (AA at 2)
    ORF        6126     6276     ORF2
    mRNA       1278  >  6278     comG mRNA
    signal     1100     1124     pot. transcription termination signal
    signal     5501     5533     transcription termination signal
    signal     5696     5715     transcription termination signal
    signal     1243     1248     comG -35 region
    signal     1266     1271     comG -10 region
BASE COUNT     1960 a   1122 c   1383 g   1813 t
ORIGIN      1 bp upstream of BglII site.
        1 agatctaatc catttttctc actggcctca gtgtgaaaag tggtttgaaa ataaccatca
       61 cgttaatttt ttgcgagtag atacaactga aacggaaaat gaagcagtat ttgggtcgat
      121 tgtttatgat caggggcttg gtgaagaaaa agaccatact gtttttcact tttatatcac
      181 cagacaatat ttttttacaa tcaactttga cttttcaatt ttgagagaga ttaaaggcaa
      241 agaagttgtt cggcaaatgg aaagagcgga caatgcgata gaggggtttt taattcttct
      301 cggcgaacta atgaatgcgt atttaatcgg tgttgatgaa tttgaagtca agctgagaaa
      361 gctcagatgg caaattaaag acgacaatag caaaagcatt ttaaaccgcg tccatctcct
      421 gcgccatgaa ctgatgattt ggaaaaattt gatattaagc gctaaaaaaa ttgaaatggc
      481 gttgaaagaa acctttttac ctcaaaatga agggaaaaag gattatcagc ggacacaact
      541 gaagattgac aggggattta catacatcag cgaatttgaa ggggagctta acaatctgct
      601 gcattcagag gaagtcatta cctcacatag ggggaatgaa attgtaaaag cgctgaccat
      661 tttcacgacg ctttttactc cgattacagc tctgggtgcc ttatggggga tgaacttttc
      721 agtgatgccg gaactgaatt ggaaatacgg atatctcttt tccctcttat tgattgtcac
      781 atctacagtt ctgatctatc tctatttgag aaaaaaaggc tggacgggag atatgctgca
      841 ggagcggaag aagaaaaaga aacctcgaaa aaggcggact ctataggatg tttcatattt
      901 tgtgcagcgt gccccgcttt ttcaccagac atatcagggt gaccggatac gatgtcaagg
      961 ggcttatgac agagcattaa atccgcagtt tatcgattct tgaaaatgac caaatgaccg
     1021 gtattgttgc attaggcgat ctttccgttg agaaagatac tggtcaataa gcgaaaacag
     1081 cataatgaaa atggaatcta gcaggcatgg tgaccatgtc tgctttttta tttataggga
     1141 aaattataat gacaggggta cattcagttg aaagtctttt ttcttgccag aaagaattgg
     1201 tttttcagca tataacatct cacaaaatca cgttttccct gtttgattac cttttcttct
     1261 ttttctacaa tatgcgttga aaggagaggg aatcaaattg gattcaatag aaaaggtaag
     1321 caaaaacttg attgaagagg catatctaac aaaggcttct gatattcaca ttgtgccgag
     1381 ggagcgggac gctatcattc attttcgggt cgatcatgcc ttgctgaaaa aaagggacat
     1441 gaaaaaagaa gagtgcgtaa gactgatttc acattttaaa tttctttcag caatggatat
     1501 aggtgaaagg cgaaagccgc aaaacggttc gcttacgtta aagttgaaag agggaaatgt
     1561 tcatttaaga atgtcaacgc tgcccacaat taatgaagaa agcctcgtga tcagagtgat
     1621 gccccaatac aatatccctt cgattgataa attgtcgcta tttccgaaga caggagccac
     1681 attactctcg tttttaaaac attcccatgg catgctcatt tttaccgggc cgactggttc
     1741 agggaagact accacattat actctctcgt tcaatatgca aaaaaacact ttaatcgaaa
     1801 tattgtcaca ttagaggacc ctgttgaaac aagggacgaa gatgttcttc aggttcaggt
     1861 gaatgaaaaa gccggtgtaa cttattccgc aggtctgaaa gcaattttgc gccatgaccc
     1921 cgatatgatt attttaggtg agatcagaga cgcggaaaca gctgaaattg cggtgcgggc
     1981 agcgatgacg ggacatctgg tactaacgag ccttcatacg agagacgcaa agggcgcaat
     2041 ttacagactg cttgaattcg gtatcaatat gaatgaaatc gaacagactg tcattgcaat
     2101 agcggctcag cgcttggttg atttggcttg cccgttttgt gaaaacggat gttcatcagt
     2161 gtattgccga cagtcacgaa atactaggag agctagcgtt tatgagcttc tatacgggaa
     2221 aaatcttcag caatgtatcc aggaggcaaa aggaaatcat gcaaattacc aatatcaaac
     2281 gcttcgtcaa attatcagaa aaggaattgc gctcggctat ttaacgacaa acaactatga
     2341 ccggtgggtt tatcatgaaa aagattagaa agtctggttg ttaaaggatc aagccaggtt
     2401 attaaagagg ctcggtgaaa tgactgcggg cggatataca cttctggatg gattacgcct
     2461 gatggaactt cagatgaata agaggcaggc ggctgacttg actgattcgg tcacttgttt
     2521 gagggaaggg gctccgtttt atcaagtact aaagagtttg tcatttcata aggaagccgt
     2581 aggtatttgt tattttgctg aaacacatgg tgaactgcct gcttcaatga tccagagcgg
     2641 agagctgctg gaacgaaaaa ttgcacaggc agaccagctg aaaagagtgc tgcgctatcc
     2701 gcttttcctc atctttacgg tcgctgtcat gttttatatg ttacagtcca tcatcattcc
     2761 tcagttttcc ggtatctatc aatcgatgaa tatggaaacc tcacgttcaa ccgatatgct
     2821 ttttgctttt tttcagcata ttgatcttgt gatcattttg cttgttcttt ttacagcagg
     2881 tatcgggatt tattattggc ttgtgtttaa gaaaaaatca cctgcccggc aaatgctgat
     2941 ttgtatcagg attcctttgg ttggaaagct tgtaaagctg tttaacagct actttttttc
     3001 tttgcagcta agcagccttt taaaatcagg cctctcaatt tatgacagcc ttaatgcatt
     3061 taaacatcaa acgtttctcc ctttctaccg ctgcgaggct gaacaattga ttgaacggct
     3121 aaaagccggt gagtcaattg aatccgctat ttgtggaagc cttttttatg aaactgattt
     3181 atcaaaagtc atatctcacg gccagctgag cggccgattg gatcgggagc ttttcacata
     3241 cagccaattc atattacagc ggctggaaca caaagcgcaa aaatggacag gcatccttca
     3301 gccaatgatt tatggatttg ttgcagcgat gatcttactt gtgtatttat ctatgcttgt
     3361 gcctatgtat cagatgatga atcaaatgtg aaaggaagag gctgatgaat gagaaaggat
     3421 ttacacttgt tgaaatgtta atcgtgctct ttattatttc gattttgctt ttaattacga
     3481 taccgaacgt cacgaaacat aatcaaacca ttcaaaaaaa gggctgtgaa ggcttacaaa
     3541 acatggttaa ggcacaaatg actgcatttg agcttgatca tgaaggacaa actccgagcc
     3601 ttgccgattt acagtcagag ggctatgtga aaaaggatgc tgtctgtcca aatggtaagc
     3661 gcattatcat caccggcgga gaagttaagg ttgaacatta aattaaacga ggagaagggg
     3721 tttacccttt tagaaagttt gcttgtgtta agccttgcct ctatcctcct ggtggccgtc
     3781 ttcactacac ttcctcctgc ttatgacaat acagctgtcc gacaggcagc aagtcagctg
     3841 aaaaatgata ttatgctcac acagcagact gctatttccc gtcaacaaag aacaaaaatt
     3901 ctctttcata aaaaagaata tcaattagtc attggtgata cggttattga acgtccgtat
     3961 gcaacgggac tttctataga actgctgaca ttaaaagacc gtttggaatt taatgagaaa
     4021 gggcacccga atgcaggcgg aaaaatacga gtaaaaggcc atgccgttta tgacataaca
     4081 gtttatctag ggagcgggag agtcaatgtg gagagaaaat aaaggttttt ctacaataga
     4141 aacaatgtct gcgctaagcc tgtggctgtt tgtgctgctg acagtcgtcc ccttgtggga
     4201 caagctgatg gctgatgaaa aaatggcgga atcacgagaa attggctatc agatgatgaa
     4261 tgagagcatt agcaaatatg tcatgagtgg tgaaggagcc gcgtcaaaaa cgattacaaa
     4321 gaacaatcat atctatgcaa tgaagtggga ggaggagggc gaatatcaaa acgtatgtat
     4381 caaagccgca gcttataaag aaaaatcatt ttgcctcagc attttgcaga cagaatggct
     4441 acacgcttct taacgtatta ttttcgctct cagtcttttt gctcatatca ggatcgttag
     4501 ctgcgattat ccatctgttt ttgtctcgac agcaggaaca tgacggtttc acacagcagg
     4561 aatggatgat ttcgatagaa cagatgatga atgaatgcaa ggaatcacag gcagttaaga
     4621 cagccgagca tgggagcgtg ttaatctgca ccaatctttc cggacaagac atccgttttg
     4681 acatttatca ttcaatgata agaaaaagag tggatggcaa agggcatgtt ccgattttag
     4741 atcatattac tgccatgaaa gctgatattg aaaatggtgt tgttttgctg aaaattgaga
     4801 gtgaagacca aaaagtgtat caaactgctt ttccagtcta ttcgtattta ggaggggggt
     4861 gaatgtatcg tacaagaggg tttatttatc cagctgttct ttttgtgtca gcgcttgtgc
     4921 tgttaatcgt gaactttgtt gctgctcaat atatttcacg ctgcatgttt gagaaggaaa
     4981 caaaagagtt atacatagga gagaatttgc ttcaaaatgg ggtgcttctt tcgattcggc
     5041 atgttctaga ggaacggaaa ggccaggagg gtacgcagca atttctatat ggacgggttt
     5101 cttattacat tcatgataca tcgataaaag aacaaaaaga aatcaactta agagtgtcaa
     5161 cggattcggg aacagaaaga actgcacaga tcgtgtttga ccaaaaacag aaaaaactgc
     5221 tgagatggac agaataaaac agtgtaaagg gtataaaaaa agtcatgtga gacaacactc
     5281 ataataattg aatgatgagg tgatcacgtg aaaacgaatg attatgttaa atatatgacg
     5341 cagcaatttg tcaaatatat agatactccg agagatgagc gaaaagaacg aaaagaggtg
     5401 cggaaagaaa caaaaacgcc tgtttcccag cagtggttcg gtattttacc ctatggcttc
     5461 cgactttggc tgaaacggaa aaaataaccg caaataaacg aataaggtcc ttcaaaaaat
     5521 ggaggacctt attgatattc ttctaatatg gcaattttat tgaccttttg gctataagga
     5581 tcaaatgaaa tcgtcacaaa aacgccgaat tcttttgacc cttccctcag agttaaatgg
     5641 tattgcttca ctgcttcatc ttttctttta cggtcccata ctttttgttt gaacagtacc
     5701 tgtgcgagcg ggtacctttt ttttgcttct tttacagcaa tctcttccca tttggacatg
     5761 tggcgggcgg ttacaagcgg tgtttcttct gcgtgagcgg ctgtggtgcc aaagacgaga
     5821 agagatagac aaatcacaca ttgtttgatc atcatgctgt cacctttctt tgtttattat
     5881 taccaaataa taatgggata tgcatttaac ttctcacata acaatcccaa aaatttctaa
     5941 aaaattgaaa aaatgagcaa tactgagcaa gactttgtaa tatgatgaaa acattctttt
     6001 aaacgaacaa aatgagcgat ttcggtgttt ttaaatctat aaatcgttga ttatactcta
     6061 tttgtgaagt tctttaaaga gaacgattgt catatcaagt tacagtgttt tacaggaggt
     6121 aagatatgtt tcgattgttt cacaatcagc aaaaggcgaa gacgaaactg aaagttctgc
     6181 ttatctttca gctttcagtc attttcagtc tgactgccgc aatatgctta caattttcga
     6241 tgatacaagc gctgcttttc atgatattga aacatttg
//
LOCUS       CELTC1A       234 bp ds-DNA             INV       10-AUG-1990
DEFINITION  C.elegans Tc1(1664) element with a DNA insert.
ACCESSION   M22301 M29977
KEYWORDS    Tc1 element; Tc1 repetitive element.
SOURCE      C.elegans (var. Bristol, strain N2) DNA, clone pCeh45.
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 234)
  AUTHORS   Harris,L.J. and Rose,A.M.
  TITLE     Structural analysis of Tc1 elements in Caenorhabditis elegans var.
            Bristol (strain N2)
  JOURNAL   Plasmid 22, 10-21 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            L.J.Harris, 18-JAN-1989.
FEATURES       from  to/span     description
    rpt     <     1  >   234     Tc1(1664) insertion element
    rpt          85       94     direct repeat copy A
    rpt         107      116     direct repeat copy B
    rpt         128      137     direct repeat copy C
    recomb       87       88     Tc1(1664) DNA end/inserted DNA start
    recomb      141      142     inserted DNA end/Tc1(1664) DNA start
BASE COUNT       67 a     36 c     32 g     99 t
ORIGIN      3 bp upstream of EcoRV site.
        1 gatatccact tttggttttt tgtgtgtaac ttttttctca agcatccatt tgacttgaat
       61 ttttccgtgt gcataaagcg aaatgtttcg caaattaagc aaataaagtt tcgcaaagaa
      121 ttaaattgtt tcgcaaacga aacgcaaatt tgcggaccaa acattacatg attatcgatt
      181 ttttctgaat tttatttcaa ttttttgatt ttttcgtttt tccaattttc atta
//
LOCUS       HUMDEF1A      448 bp ss-mRNA            PRI       10-AUG-1990
DEFINITION  Human neutrophil peptide (defensin) 1 mRNA, complete cds.
ACCESSION   M21130 M22160
KEYWORDS    cytotoxic protein; defensin 1; microcidal protein;
            neutrophil peptide.
SOURCE      Human leukocyte (myeloid cell line HL-60), cDNA to mRNA, clones
            HNP1a and HNP1b.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 448)
  AUTHORS   Daher,K.A., Lehrer,R.I., Ganz,T. and Kronenberg,M.
  TITLE     Isolation and characterization of human defensin cDNA clones
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 7327-7331 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            K.Daher, 25-NOV-1988.
            
            Submission form for [1] kindly provided by M.Kronenberg,
            01-JAN-1989.
FEATURES       from  to/span     description
    pept         51      335     neutrophil peptide 1 precursor /nomgen="DEF1"
                                 /map="8p23" /hgml_locus_uid="LR0109A"
    sigp         51      107     neutrophil peptide 1 signal peptide
    matp        243      332     neutrophil peptide 1
    mRNA    <     1      448     neutrophil peptide 1 mRNA
BASE COUNT      106 a    127 c    110 g    105 t
ORIGIN      Unreported.
        1 tctgccctct ctggtcaccc tgcctagcta gaggatctgt gaccccagcc atgaggaccc
       61 tcgccatcct tgctgccatt ctcctggtgg ccctgcaggc ccaggctgag ccactccagg
      121 caagagctga tgaggttgct gcagccccgg agcagattgc agcggacatc ccagaagtgg
      181 ttgtttccct tgcatgggac gaaagcttgg ctccaaagca tccaggctca aggaaaaaca
      241 tggcctgcta ttgcagaata ccagcgtgca ttgcaggaga acgtcgctat ggaacctgca
      301 tctaccaggg aagactctgg gcattctgct gctgagcttg cagaaaaaga aaaatgagct
      361 caaaatttgc tttgagagct acagggaatt gctattactc ctgtaccttc tgctcaattt
      421 cctttcctca tcccaaataa atgccttg
//
LOCUS       MUSMIPAA      753 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Mouse macrophage inflammatory protein (MIP)-1a mRNA, complete cds.
ACCESSION   X12531 M19382
KEYWORDS    lymphokine; macrophage inflammatory protein.
SOURCE      Mouse cell line RAW264.7., cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 753)
  AUTHORS   Davatelis,G., Tekamp-Olson,P., Wolpe,S.D., Hermsen,K., Luedke,C.,
            Gallegos,C., Coit,D., Merryweather,J. and Cerami,A.
  TITLE     Cloning and characterization of a cDNA for murine macrophage
            inflammatory protein (MIP), a novel monokine with inflammatory and
            chemokinetic properties
  JOURNAL   J. Exp. Med. 167, 1939-1944 (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by G.Davatelis, 27-MAY-1988.
FEATURES       from  to/span     description
    pept         71      349     macrophage inflammatory protein precursor
    sigp         71      139     macrophage inflammatory protein signal peptide
    matp        140      346     macrophage inflammatory protein
    mRNA    <     1      753     MIP mRNA
    signal      753      753     polyA signal
BASE COUNT      190 a    203 c    143 g    217 t
ORIGIN      
        1 aagcagcagc gagtaccagt cccttttctg ttctgctgac aagctcaccc tctgtcacct
       61 gctcaacatc atgaaggtct ccaccactgc ccttgctgtt cttctctgta ccatgacact
      121 ctgcaaccaa gtcttctcag cgccatatgg agctgacacc ccgactgcct gctgcttctc
      181 ctacagccgg aagattccac gccaattcat cgttgagtat tttgaaacca gcagcctttg
      241 ctcccagcca ggtgtcattt tcctgactaa gagaaaccgg cagatctgcg ctgactccaa
      301 agagacctgg gtccaagaat acatcactga cctggaactg attgcctgag agtcttggag
      361 gcagcgagga accccccaaa cctccatggg tcccgtgtag agcaggggct tgagccgaac
      421 attcctgcca cctgcatagc tccatctcct ataagctgtt tgctgccaag tagccacatc
      481 gagggactct tcacttgaaa ttttatttaa tttaatccta ttggtttaat actatttaat
      541 tttgtaattt attttattgt catacttgta tttgtgacta tttattctga aagacttcag
      601 gacacgttcc tcaaccccca tctccctccc agttgttcac actgtttggt gacagctatt
      661 ctaggtagac atgatgacaa agtcatgaac tgacaatgta caatagatgc tttgtttata
      721 ccagagaagt aataaatatg ccctttaaca agt
//
LOCUS       ONGSSDNA      154 bp ds-DNA             INV       10-AUG-1990
DEFINITION  O.volvulus species-specific DNA.
ACCESSION   J03990 M22037
KEYWORDS    .
SOURCE      O.volvulus (strain Liberian) DNA, clone C1A1-2.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 154)
  AUTHORS   Harnett,W., Chambers,A.E., Renz,A. and Parkhouse,R.M.E.
  TITLE     An oligonucleotide probe specific for Onchocerca volvulus
  JOURNAL   Mol. Biochem. Parasitol. 35, 119-126 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Harnett, 18-JAN-1989.
BASE COUNT       50 a     23 c     27 g     54 t
ORIGIN      
        1 gaattcaaaa aaatatagtc gaatattttt cttaggaccc agtttgaagg tacgtacccg
       61 ttttttgaaa ttatatgtca taggtcatct gttaataatc aattttgcaa aatgcgtttt
      121 tcgccgaaaa aatcgccgtg taaatgtgga attc
//
LOCUS       RHMDCTAB     2300 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  R.meliloti C-4-dicarboxylate carrier protein (dctA) gene, complete
            cds, and dctB protein gene, partial cds.
ACCESSION   M26399 M29729
KEYWORDS    C4-dicarboxylate carrier protein; membrane protein;
            transport protein.
SOURCE      R.meliloti (strain 2011) DNA, clone pRmSC121.
  ORGANISM  Rhizobium meliloti
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 2300)
  AUTHORS   Engelke,T., Jording,D., Kapp,D. and Puehler,A.
  TITLE     Identification and sequence analysis of the Rhizobium meliloti dctA
            gene encoding the C4-dicarboxylate carrier
  JOURNAL   J. Bacteriol. 171, 5551-5560 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Jording, 13-OCT-1989.
FEATURES       from  to/span     description
    pept        750     2111     C4-dicarboxylate carrier A (put.)
    pept        786     2111     C4-dicarboxylate carrier A' (put.)
    pept        555  <     1 (c) dctB protein
    signal      582      597     activator sequence
    signal      673      689     5' promoter region
BASE COUNT      388 a    758 c    707 g    447 t
ORIGIN      
        1 gtcgacgcgc cgggaaatat aaagcccggg ccgattgctg accgtgccca tggcgaaatg
       61 ttcggccatg ccgtcgcgga cggcgagccg gaaataatcg cggaaggcat agtcgttgcc
      121 gacaaagctc gtcggctcct gccagttgct ggccgcgacg gcgacgccgc tccggtcgat
      181 cagataaatg accgcggctt cggcgcttgt cgccagggcc tcgagcttac ggttgatgcg
      241 gtcgagcgac ggcctgtccg gcgaaagcaa tgcgccacga atggctgcgt cgtcggcgag
      301 gacaagcggc agggcgcgct gccgttccac gactgctcga agaagcgagg ctttcaggct
      361 ggcgtcgatc cggctctgac cggcaaggcc ggcgagcgcc tgcgaccggc cgtagtctcg
      421 cgcaagaaga aggccggctg ccagaaggac cagcgcgact gccgcgaaaa cgagccagga
      481 ccgtcgggcc cggctgcgaa gcgcatgcgg atcgctcgct tctgcaggaa gtttgaccat
      541 gcgaacatgg tgcatgtttt cgcccaggac gccagcactt ctgtgcggaa atccgcacat
      601 atccacgaac ggcaagcgag caccgctccc aaaaatgtca tggattcaat cgcataggcc
      661 ttccgcggca aactggcacg catgttgctg acaagctcca caaggcagcc actgctgtcg
      721 atcttcggaa agcggcccgg gaggccggca tgttgccgga ctgggcctgc cacgtggagg
      781 atatcatgat catcgaacat tccgcggagg tccgcggcaa gacacccctt taccgccatc
      841 tctatgttca ggtgctggcg gcgatcgccg cgggcatcct gctcgggcat ttctatcctg
      901 atatcggcac ggagctcaaa ccgcttggcg acgccttcat caggctcgtg aagatgatca
      961 tcgcgccggt gatcttcctg acggtcgcga ccgggattgc cggcatgacc gatctcgcca
     1021 aggtcggccg cgtcgccggc aaggcgatga tctactttct cgccttctcc accctcgcgc
     1081 ttgtcgtcgg cctcgtcgtc gcaaacgtgg tgcagccggg cgcgggcatg catatcgatc
     1141 cggcctcgct ggacgccaag gcggtcgcga cctatgccga gaaagcgcat gagcagtcga
     1201 tcaccggctt cctgatgaac atcatcccga cgacgctcgt cggcgccttc gccgaaggcg
     1261 acatcctgca ggtcctgttc atctcggtgc tcttcggtat ctcgctggcg atcgtcggca
     1321 agaaagccga gcccgtggtc gatttcctgc aggcgctgac gctgccgatc ttccggctcg
     1381 ttgcgatcct gatgaaggcc gccccgatcg gcgccttcgg cgccatggcc ttcaccatcg
     1441 gcaagtacgg tatcgcctcg atcgccaatc tcgccatgct gatcggcacc ttctatctga
     1501 cgtcgtttct cttcgtcttc atcgttctcg gcgcggtcgc acgctataac ggcttctcga
     1561 tcctctcgct catccgctac atcaaggagg agctgctgct ggtgctcggg acgtcctctt
     1621 cggaggcggc tcttccgggc ctcatgaaca agatggagaa ggccggctgc aagcgctcgg
     1681 tcgtcggtct cgtcattccg accggttatt ccttcaacct ggacggcacc aatatctaca
     1741 tgacccttgc ggccctgttc atcgcccagg cgaccgatac gccgctctca tacggcgacc
     1801 agattctgct gctcctcgtc gcaatgctga gctcgaaggg tgcggccggc attaccggcg
     1861 ccggcttcat cacgcttgcc gcaacgctct cggtcgttcc ctccgtgccg gtcgccggca
     1921 tggcgctgat cctcggcatc gaccgcttca tgtcggaatg ccgcgccctg accaatttcg
     1981 tcggcaacgc ggttgcgacg atcgtggtgg cgaagtggga gggcgagctc gatcaggcgc
     2041 agctttccgc agctctcggc ggcgaggcgt ccgtcgaggc catcccggcg gtcgtccagc
     2101 ccgccgaata agcttgcctc ccagggcgag accgcacgct ggtgcgcttg gcgcggtccg
     2161 gttctccggg ccgcgctttt ttatcgccgg ccaggcctga gggggtgggc actcggcaca
     2221 agatcagatg ccgtgcagca cctgcgtcgc cttcactgcc gccgaggccc ggttttcgac
     2281 gccgagcttc acatagatct
//
LOCUS       HUMMYOHP     6016 bp ss-mRNA            PRI       10-AUG-1990
DEFINITION  Human perinatal myosin heavy chain mRNA, complete cds.
ACCESSION   M36769 M35250 Y00821
KEYWORDS    myosin; myosin heavy chain.
SOURCE      Human fetal muscle, cDNA to mRNA, clone pSMHCP.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 2651 to 6016)
  AUTHORS   Leinwand,L.A.
  TITLE     ;
  JOURNAL   Unpublished (1989) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 2651 to 6016)
  AUTHORS   Feghali,R. and Leinwand,L.A.
  TITLE     Molecular genetic characterization of a developmentally regulated
            human perinatal myosin heavy chain
  JOURNAL   J. Cell Biol. 108, 1791-1797 (1989)
  STANDARD  simple automatic
REFERENCE   3  (bases 1 to 2650)
  AUTHORS   Karsch-Mizrachi,I., Feghali,R., Shows,T.B. and Leinwand,L.A.
  TITLE     Generation of a full-length human perinatal myosin
            heavy-chain-encoding cDNA
  JOURNAL   Gene 89, 289-294 (1990)
  STANDARD  full staff_review
COMMENT     Author address: L.A.Leinwand Albert Einstein College of Medicine
            1300 Morris Park Avenue Bronx, N.Y. 10461, USA.
            
            Submitted (12-APR-1989) on tape to the EMBL data library. Draft
            entry and computer-readable sequence for [3] kindly submitted by
            I.Karsch-Mizrachi, 18-JUN-1990.
FEATURES       from  to/span     description
    pept         74     5887     perinatal myosin heavy chain
    mRNA    <     1     6016     perinatal myosin heavy chain mRNA
BASE COUNT     1953 a   1300 c   1586 g   1177 t
ORIGIN      
        1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa
       61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaagctgc
      121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc
      181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag
      241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga
      301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat
      361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat
      421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt
      481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca
      541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc
      601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca
      661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca
      721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc
      781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg
      841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt
      901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa
      961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt
     1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga
     1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac
     1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc
     1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc
     1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa
     1321 aggccagact gtgcagcagg tgtacaatgc cgtgggtgct ctggccaaag ccgtctacga
     1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag
     1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct
     1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat
     1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt
     1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat
     1681 cctggaagag gagtgcatgt tccctaaggc cacggacacc tccttcaaga acaagctgta
     1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc
     1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg
     1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc
     1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc
     1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga
     2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat
     2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct
     2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat
     2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg
     2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca
     2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct
     2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag
     2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg
     2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt
     2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa
     2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct
     2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc
     2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat
     2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa
     2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga
     2941 cattgatgac cttgagctga cactggccaa ggttgagaag cagaaacatg ccacggagaa
     3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc
     3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga
     3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga
     3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg
     3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa
     3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa
     3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc
     3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga
     3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc
     3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa
     3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg
     3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt
     3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag
     3781 taacgcagag gccatttcca aagccaaggg ccaccttgaa aagatgtgcc gctctctaga
     3841 agatcaagtg agtgggctta agaccaagga agaggagcag cagcggctga tcaatgacct
     3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga
     3961 gcaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga
     4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct
     4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg
     4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac
     4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt
     4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc
     4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga
     4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct
     4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga
     4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga
     4561 tcaactcgaa acgctaagaa gagcacataa gaacttgcaa caggagattt ctgacctcac
     4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt
     4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca
     4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga
     4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt
     4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag
     4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa
     4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac
     5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat
     5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga
     5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt
     5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga
     5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga
     5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca
     5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct
     5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca
     5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa
     5581 tgcagaggct gttaaaggtt tacggaaaca tgaccgacga gtaaaagaac tcacctacca
     5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc
     5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct
     5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctgacattgc
     5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc
     5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaaggcacaa aatgtgctat
     5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac
     6001 agtaaatgta tacatt
//
LOCUS       CDCMANABD    2155 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  C.saccharolyticum beta-D-mannanase (manA) gene, complete cds.
ACCESSION   M36063
KEYWORDS    beta-D-mannanase.
SOURCE      C.saccharolyticum DNA, clone lambda-NZP2.
  ORGANISM  Caldocellum saccharolyticum
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 2155)
  AUTHORS   Leuthi,E., Bhana Jasmat,N., Grayling,R.A., Love,D.R. and
            Bergquist,P.L.
  TITLE     Cloning, sequence analysis and expression in Escherichia coli of a
            gene coding for a beta-mannanase from the extremely thermophilic
            bacteria Caldocellum saccharolyticum
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Luethi, 03-JUL-1990.
            
              Author address: E.Luethi
              University of Auckland
              Department of Cellular and Molecular Biology
              Private Bag
              Auckland
              NEW ZEALAND
              email:SYSTEM@CELVAX.AUKUNI.AC.NZ
            
FEATURES       from  to/span     description
    pept        841     1881     beta-D-mannanase precursor (E.C. 3.2.11.78)
    sigp        841      963     beta-D-mannanase signal peptide
    matp        964     1878     beta-D-mannanase
    pept    <     1      685     ORF1 (AA at 2)
    binding     818      830     manA ribosome binding site
BASE COUNT      721 a    334 c    590 g    510 t
ORIGIN      
        1 gggtagcaat acatggtttg gattccaggc atggtcgatg cagagggtag cggagtatta
       61 ctatgtgaca ggagataaag atgcaggggc actgcttgag aagtgggtaa gctggataaa
      121 gagtgtagtg aagttgaata gtgatggtac atttgcgata ccatcgacgc ttgattggag
      181 tgggcagcca gacacatgga atgggacata tacaggtaat ccgaacttgc atgtgaaggt
      241 agtagattat gggacggatt taggaataac ggcatcactt gcgaatgcac tactttatta
      301 cagtgcaggg acgaagaagt atggggtatt tgatgaggaa gcgaagaatt tagcgaagga
      361 attgctggac aggatgtgga agttatacag ggatgagaaa ggtttatcgg cgccagagaa
      421 gagagcggac tacaagaggt tctttgagca agaggtatac attccggcag gatggacagg
      481 gaagatgccg aatggagatg taataaagag cggagttaag tttatagaca taaggagcaa
      541 gtacaaacaa gatcctgatt ggccgaagtt agaggcggca tacaagtcag ggcaggtacc
      601 ggagttcaga tatcacaggt tctgggcaca gtgtgacata gcaattgtta atgcaacata
      661 tgaaattctg ttcggtaatc aataatgagt aggtaaatgg aaatttagcg gggtggcaca
      721 tctataagtt tggtgtgctg cctcgctaaa atcctgtatg gaagtgttcg aaaaatagta
      781 caaaaaaatg gcgaggtaaa atggcaaaaa tgcataaaaa agagggggtt aggttttaag
      841 atgagactaa aaacaaaaat aagaaagaaa tggttgagtg tactatgtac agttgttttt
      901 ttattgaaca ttttgtttat agcaaatgta acgattttac ccaaagttgg tgcggctaca
      961 tctaatgatg gagtagtgaa gatagatacg agtacgttaa taggaaccaa tcacgcacat
     1021 tgctggtaca gagatagact tgatacggca ttgcgtggaa ttaggtcatg gggtatgaac
     1081 tctgtgaggg tagtgctgag taatggttac cgatggacga agataccagc aagtgaagtg
     1141 gcaaatatta tatcattgtc aagaagtctt ggtttcaaag ctattatatt agaagtgcac
     1201 gacacaacag gatatggaga agatggggca gcatgttcat tggcacaagc agtggaatat
     1261 tggaaggaga taaagagcgt attagacggt aacgaagatt ttgtaattat aaacattggt
     1321 aatgagccgt atgggaacaa taactatcaa aactgggtta atgacacgaa gaacgctata
     1381 aaagcactta gagatgcagg attcaagcac acgataatgg tggatgcgcc gaactggggt
     1441 caggattggt ctaatactat gagagataat gcccagagca taatggaagc agatccgctg
     1501 cgcaatttgg tattttcgat tcatatgtat ggcgtataca atacagcaag caaggtcgaa
     1561 gagtacatca aatcatttgt tgataagggg ttaccattgg ttattggaga atttggacat
     1621 cagcacacag atggtgaccc tgatgaagaa gctattgtca ggtatgcaaa acagtacaag
     1681 ataggattat ttagttggtc gtggtgtgga aattcgagct atgttgggta tttggacatg
     1741 gtaaacaact gggaccccaa taatccaact ccatggggac aatggtataa aactaatgca
     1801 attggtacat cttctacacc aacacctacg tcgacagtaa cgccaacgcc acccccacgc
     1861 caacaccaac accgacagta acagcgacac caacaccaac acctacacct gttagcacac
     1921 ctgcgacaag tgggcagata aaggtactgt atgctaacaa ggagacaaac agcactacaa
     1981 acacgataag gccgtggttg aaggtagtga atagtggtag cagtagcata gatttgagca
     2041 gggtaacgat aaggtactgg tacacggtag atggtgagag ggcacagact ggatatcaga
     2101 ctgggcacag ataggagcaa gcaatgtcaa cattcaagtt tgtgaagctg agagt
//
LOCUS       MUSMLHRCA    1431 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Mouse lymph node homing receptor mRNA, complete cds.
ACCESSION   M36005
KEYWORDS    lymph node homing receptor.
SOURCE      Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22+(C3H).
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1431)
  AUTHORS   Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K.
  TITLE     The mouse lymph node homing receptor is identical with the
            lymphocyte cell surface marker Ly-22: Role of the EGF domain in
            endothelial binding
  JOURNAL   Cell 61, 611-622 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         52     1170     lymph node homing receptor precursor
    sigp         52      165     lymph node homing receptor signal peptide
    matp        166     1167     lymph node homing receptor
BASE COUNT      412 a    323 c    362 g    334 t      0 others
ORIGIN      Chromosome 1.
        1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt
       61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg
      121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat
      181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat
      241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc
      301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc
      361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag
      421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac
      481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg
      541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat
      601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct
      661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt
      721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca
      781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag
      841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc
      901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt
      961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc
     1021 tcaaagatca aagaaggtga ctacaacccc ctcttcattc ctgtagccgt catggtcacc
     1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa
     1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag
     1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag
     1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga
     1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt
     1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a
//
LOCUS       MUSMLHRCB    1431 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Mouse lymph node homing receptor mRNA, complete cds.
ACCESSION   M36058
KEYWORDS    lymph node homing receptor.
SOURCE      Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22-(SJL).
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1431)
  AUTHORS   Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K.
  TITLE     The mouse lymph node homing receptor is identical with the
            lymphocyte cell surface marker Ly-22: Role of the EGF domain in
            endothelial binding
  JOURNAL   Cell 61, 611-622 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         52     1170     lymph node homing receptor
    sigp         52      165     lymph node homing receptor signal peptide
    matp        166     1167     lymph node homing receptor
BASE COUNT      412 a    322 c    362 g    335 t      0 others
ORIGIN      Chromosome 1.
        1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt
       61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg
      121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat
      181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat
      241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc
      301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc
      361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag
      421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac
      481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg
      541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat
      601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct
      661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt
      721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca
      781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag
      841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc
      901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt
      961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc
     1021 tcaaagatca aagaaggtga ctataacccc ctcttcattc ctgtagccgt catggtcacc
     1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa
     1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag
     1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag
     1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga
     1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt
     1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a
//
LOCUS       RATMHRT1B     562 bp ds-DNA             ROD       10-AUG-1990
DEFINITION  Rat MHC class II A-beta RT1.B-b-beta gene, partial cds.
ACCESSION   M36151
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Rat (strain BUF-(RT-b)) DNA.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 562)
  AUTHORS   Figueroa,F., Guenther,E. and Klein,J.
  TITLE     MHC polymorphism pre-dating speciation
  JOURNAL   Nature 335, 265-267 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   562     MHC A-beta RT1.B-b-beta cell surface
                                 glycoprotein (AA at 1)
BASE COUNT      122 a    171 c    173 g     96 t
ORIGIN      
        1 agagactccc caagggattt cgtgtaccag ttcgagggcc agtgctacta caccaccggg
       61 acgcagcgca tgcggctcgt gaccagacac atctacaacc gggaggagta cgtgcgcttc
      121 gacagcgacc tgggcgagta ccgcgcgctg accgagctgg ggcggccctc agccgagtac
      181 tggaataagc agtacctcga gcagacgcgg gccgagctgg acagggtctg cagatacaac
      241 tacgaggggc cgggggctct cacctccctg agacggcttg agcagcccaa tgtggccatc
      301 tccctgtcca ggacagaggc ccttaaccac cacaacctgc tggtctgctc agtgacagat
      361 ttctacccag cccagatcaa agtgcgctgg ttccggaatg gccaggagga gacgacgggg
      421 gtcgtgtcca cacagcttat taggaatggg gactggacct tccagatcct ggtcatgctg
      481 gagatcacgc ctcagcgggg agatgtgtac acctgccatg ttgaccaccc cagccttcag
      541 agccctgtca cagtggagtg gc
//
LOCUS       WHTMTTGPAA   1743 bp ds-DNA             ORG       10-AUG-1990
DEFINITION  Wheat mitochondrial Pro-tRNA gene.
ACCESSION   M36018
KEYWORDS    transfer RNA-Pro.
SOURCE      Wheat mitochondrial DNA, clone H-P2.
  ORGANISM  Mitochondrion Triticum aestivum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Triticum aestivum.
REFERENCE   1  (bases 1 to 1743)
  AUTHORS   Joyce,P.B.M., Spencer,D.F. and Gray,M.W.
  TITLE     Multiple sequence rearrangements accompanying the duplication of a
            Pro-tRNA gene in wheat mitochondrial DNA
  JOURNAL   Plant Mol. Biol. 11, 833-843 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA        831      905     Pro-tRNA
BASE COUNT      483 a    396 c    414 g    450 t
ORIGIN      
        1 aagcttcatg aaggaagggt ccaaggacct aagcgacaac ctctacgaaa aagaaaaacc
       61 tatatataga tagtggtatg tgttcgctga cactaaatag acttggctcg gggcagaact
      121 gagtggctag gctaagttcc ctagtcttct attggcctac ccacccctac gacatggagg
      181 cgcttaccct tctccaacaa gcgttccacc acgcggcgat tgattaccgc tttccgtctc
      241 aatcagttca gaccaagccc cttctctgat cgggggaagg tcattctaaa acatcagatc
      301 tcccattgat gctgatcaaa aataagttta tccacctaga atatatgata ggtaccgaaa
      361 gattggatgt catttctaaa caccaggcgg gacgagcagc cctataccac gtgtagccac
      421 actctagtgt ccttttctac ttagttggac agatcacttc agaaaatcgt ataaaaatca
      481 agcaagaaaa cggatgcgct aacgcgcaac ggctttcgcg ctagttgctc aaaaaatcgt
      541 ataaaaatca agcaagaaaa aggttctggc aggctgcgtg ggactgtaaa tcctctttcg
      601 ctggctgggc cctttggact cgaaatccaa acggagtgag tggttcgatt ccactctcag
      661 aacgagactg aacgaaataa aatgcaagtg aaacgagacg agaatcaaat tgtaggcttc
      721 tttcctaaaa gcggtggttc tcgcctcccc gtgcccaaag cggggtgggc gacacagcgc
      781 tgcggttctt ttctttatcg atcgggtaga tccatatgtt ctgaggggga cgaggtgtag
      841 cgcagtctgg tcagcgcatc tgttttgggt acagagggcc ataggttcga atcctgtcac
      901 cttgatgtgg tattcacaca atggggccga agtgcaaagc cccgcagcct atccgtggtc
      961 gggaaggcag gggtattaaa agtgaaactt ggcttgcgtg tgcgcgcaca caaaggagac
     1021 tgagttggat tgcaaagcct ctcagacgga aggaacctgt tttaaaaacc cctttctaca
     1081 atcttgagaa atctcacgca agcaactaca cgccgcaagt cttcggtatt gtcgcaagaa
     1141 ttgaaaagat gactccgctt tttctttcga gcatgcaagg cctcagtaga gcgagtctgt
     1201 gcatgagtag cgcagtttac aaactagtga atggtcaaat gaataggctc gtttcatcag
     1261 gggtaaccaa ggagtaagaa tgatttgata aagtaaacat tccctttgag tgcggttcct
     1321 tcagaacaga ctcttccctt ttcagattgc ttctttgggc cacttgagca tcgactgggc
     1381 aaggttggat tccctctact tcggggtggc atctatctca catgttttac attgatccat
     1441 agccctccgt tagataggaa cagctttgct aaatactgag aactctcgaa tagaatatga
     1501 gaagggaaag atcctttaca taaggaacta ttgatcatag gcgtacactc tgaaaagaat
     1561 tgaatgaaac tctctgtttc gtcactttga tatcattagc tgtctctgta gctggctctc
     1621 atgtctggtg ttccaagtaa aaacttttca actagctggg tcaccgctgt tttaaaccct
     1681 gacttggaat cagagccata aacgagtgat tcggtcggac ggacgattga gttcttaaag
     1741 ctt
//
LOCUS       YSCRSDSBA      76 bp ss-tRNA            PLN       10-AUG-1990
DEFINITION  S.cerevisiae Lys-tRNA.
ACCESSION   M36150
KEYWORDS    transfer RNA-Lys.
SOURCE      S.cerevisiae tRNA, clone alpha-S288c.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 76)
  AUTHORS   Madison,J.T., Boguslawski,S.J. and Teetor,G.H.
  TITLE     Nucleotide sequence of a lysine transfer ribonucleic acid from
            bakers' yeast
  JOURNAL   Science 176, 687-689 (1972)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       76     Lys-tRNA
    anticdn      34       36     anticodon stt
    modified      1        1     p
    modified      9        9     m2a
    modified     16       16     d
    modified     17       17     d
    modified     20       20     d
    modified     26       26     m22g
    modified     27       27     p
    modified     28       28     p
    modified     37       37     t6a
    modified     46       46     m7g
    modified     47       47     d
    modified     48       48     m5c
    modified     55       55     p
    modified     58       58     m1a
    modified     67       67     p
BASE COUNT       15 a     18 c     22 g     16 t      5 others
ORIGIN      
        1 tccttgttag ctcagddggd agagcgttcg gctsttaacc gaaatgdcag gggttcgagc
       61 cccctatgag gagcca
//
LOCUS       TNCPOLY      3660 bp ss-RNA             VRL       10-AUG-1990
DEFINITION  Tobacco necrosis virus RNA dependent RNA polymerase and coat
            protein genes, complete cds.
ACCESSION   M33002
KEYWORDS    RNA dependent RNA polymerase; coat protein; polymerase.
SOURCE      Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell
            line, cDNA to viral RNA, clones pFM2[0,1,2,3].
  ORGANISM  Tobacco necrosis virus
            Viridae; ss-RNA nonenveloped viruses.
REFERENCE   1  (bases 1 to 3660)
  AUTHORS   Meulewaeter,F., Seurinck,J. and van Emmelo,J.
  TITLE     Genome structure of tobacco necrosis virus strain A
  JOURNAL   Virology 177, 699-709 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Meulewaeter, 20-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         60      665     RNA dependent RNA polymerase (put.)
                                 /transl_except=(666..668)
                669     2234     RNA dependent RNA polymerase
    pept       2218     2436     ORF 3
    pept       2440     2610     ORF 4
    pept       2613     3443     coat protein
    pept       3467     3646     ORF 6
    site        666      668     readthrough codon
BASE COUNT     1034 a    890 c    895 g    841 t
ORIGIN      
        1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa
       61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact
      121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag
      181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt
      241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa
      301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg
      361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg
      421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg
      481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac
      541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag
      601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt
      661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac
      721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct
      781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc
      841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc
      901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa
      961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga
     1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag
     1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc
     1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc
     1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg
     1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat
     1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg
     1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa
     1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta
     1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact
     1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc
     1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga
     1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg
     1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt
     1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat
     1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca
     1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc
     1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat
     2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt
     2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg
     2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg
     2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt
     2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa
     2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa
     2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt
     2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca
     2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac
     2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg
     2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc
     2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc
     2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg
     2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag
     2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa
     2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca
     3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca
     3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg
     3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag
     3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta
     3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg
     3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg
     3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa
     3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga
     3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt
     3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa
     3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag
//
LOCUS       BMORGA1       120 bp ds-DNA             INV       10-AUG-1990
DEFINITION  B.mori 5.8S ribosomal RNA upstream region.
ACCESSION   M36169
KEYWORDS    5.8S ribosomal RNA; ribosomal RNA.
SEGMENT     1 of 2
SOURCE      B.mori DNA.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
            Ishikawa,H.
  TITLE     Comparative studies on the rDNA of the silkworm, Bombyx mori and
            its presumed ancestor
  JOURNAL   Comp. Biochem. Physiol. 88, 761-766 (1987)
  STANDARD  simple staff_review
BASE COUNT       34 a     15 c     20 g     51 t
ORIGIN      
        1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca
       61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta
//
LOCUS       BMORGA2       354 bp ds-DNA             INV       10-AUG-1990
DEFINITION  B.mori 5.8S ribosomal RNA.
ACCESSION   M36302
KEYWORDS    5.8S ribosomal RNA; ribosomal RNA.
SEGMENT     2 of 2
SOURCE      B.mori DNA.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
            Ishikawa,H.
  TITLE     Comparative studies on the rDNA of the silkworm, Bombyx mori and
            its presumed ancestor
  JOURNAL   Comp. Biochem. Physiol. 88, 761-766 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA        139      305     5.8S rRNA
BASE COUNT      103 a     74 c     74 g    103 t
ORIGIN      About 50 bp after segment 1.
        1 gctcgtgtct ctctcgtcgt cgttgacggt ttcgtaaatt aacacaaaat acattttatt
       61 ttatttattg tttttttttc gacaagaaaa aaaaaaaaga aattttttat ttttatttaa
      121 acattacaga aaaaacgaaa atcattaccc tggacggtgg atcacttggc tcgcgggtcg
      181 atgaagaacg cagttaactg cgcgtcatag tgtgaactgc aggacacatt tgaacatcga
      241 catttcgaac gcacattgcg gtccgtggag acacatccag gaccactcct gtctgagggc
      301 cggctgcata aaaaacatta atcacattgc gtggctcgtg gcgcgctcgt gatc
//
LOCUS       BMORGB1       120 bp ds-DNA             INV       10-AUG-1990
DEFINITION  B.mandarina 5.8S ribosomal RNA upstream region.
ACCESSION   M36170
KEYWORDS    5.8S ribosomal RNA; ribosomal RNA.
SEGMENT     1 of 2
SOURCE      B.mandarina DNA.
  ORGANISM  Bombyx mandarina
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
            Ishikawa,H.
  TITLE     Comparative studies on the rDNA of the silkworm, Bombyx mori and
            its presumed ancestor
  JOURNAL   Comp. Biochem. Physiol. 88, 761-766 (1987)
  STANDARD  simple staff_review
BASE COUNT       34 a     15 c     20 g     51 t
ORIGIN      
        1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca
       61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta
//
LOCUS       BMORGB2       138 bp ds-DNA             INV       10-AUG-1990
DEFINITION  B.mandarina 5.8S ribosomal RNA.
ACCESSION   M36303
KEYWORDS    5.8S ribosomal RNA; ribosomal RNA.
SEGMENT     2 of 2
SOURCE      B.mandarina DNA.
  ORGANISM  Bombyx mandarina
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 138)
  AUTHORS   Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and
            Ishikawa,H.
  TITLE     Comparative studies on the rDNA of the silkworm, Bombyx mori and
            its presumed ancestor
  JOURNAL   Comp. Biochem. Physiol. 88, 761-766 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1       89     5.8S rRNA
BASE COUNT       35 a     39 c     34 g     30 t
ORIGIN      About 50 bp after segment 1.
        1 ctgcaggaca catttgaaca tcgacatttc gaacgcacat tgcggtccgt ggagacacat
       61 ccaagaccac tcctgtctga gggccggctg cataaaaaac attaatcaca ttgcgtggct
      121 cgtggcgcgc tcgtgatc
//
LOCUS       BOVINTERA    1003 bp ss-mRNA            MAM       10-AUG-1990
DEFINITION  Bovine interleukin-1 alpha mRNA, complete cds.
ACCESSION   M36182
KEYWORDS    interleukin-1 alpha.
SOURCE      Bovine macrophage, cDNA to mRNA.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1003)
  AUTHORS   Leong,S.R., Flaggs,G.M., Lawman,M. and Gray,P.W.
  TITLE     The nucleotide sequence for the cDNA of bovine interleukin-1 alpha
  JOURNAL   Nucleic Acids Res. 16, 9053-9053 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         33      839     interleukin-1 alpha precursor
    sigp         33      368     interleukin-1 alpha signal peptide
    matp        369      836     interleukin-1 alpha
    mRNA    <     1     1003     interleukin-1 alpha mRNA
BASE COUNT      313 a    218 c    215 g    257 t
ORIGIN      
        1 tgttgctagc tcggttcagc aaagaagtga agatggccaa agtccctgac ctctttgaag
       61 acctgaagaa ctgttacagt gaaaatgaag actacagttc tgaaattgac cacctctctc
      121 tcaatcagaa gtccttctat gatgcaagct atgagccact tcgtgaggac cagatgaata
      181 agtttatgtc cctggatacc tcggaaacct ctaagacatc caagcttagc ttcaaggaga
      241 atgtggtgat ggtggcagcc agtgggaaga ttctgaagaa gagacggttg agtttaaatc
      301 agttcatcac cgatgatgac ctggaagcca ttgccaataa tacagaagaa gaaatcatca
      361 agcccagatc agcacattac agcttccaga gtaacgtgaa atacaacttt atgagagtca
      421 tccaccagga atgcatcctg aacgacgccc tcaatcaaag tataattcga gatatgtcag
      481 gtccatacct gacggctact acattaaata atctggagga ggcagtgaaa tttgacatgg
      541 ttgcttatgt atcagaagag gattctcagc ttcctgtgac tctaagaatc tcaaaaactc
      601 aactgtttgt gagtgctcaa aatgaagacg aacccgtctt gctaaaggag atgcctgaga
      661 cacccaaaat catcaaagat gagaccaacc tcctcttctt ctgggaaaag catggctcta
      721 tggactactt caaatcagtt gcccatccaa agttgtttat tgccacaaag caagaaaaat
      781 tggtgcacat ggcaagtggg ccgccctcga tcactgactt tcagatattg gaaaaatagc
      841 cttgactgtg cactctactt acttgtaaag tggtgaccat ccgtatgtac tatgtacatg
      901 aaggagtcga gcccttcact gttagtcact cgctgagcat gtgctgagct tttgtaattc
      961 taaatgaatg tttactctct ttgtaagaga gaacacaaag tcc
//
LOCUS       CHKCALMA      450 bp ss-mRNA            VRT       10-AUG-1990
DEFINITION  Chicken calmodulin mRNA, complete cds.
ACCESSION   M36167
KEYWORDS    calmodulin.
SOURCE      Chicken, cDNA to mRNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 450)
  AUTHORS   Iida,Y.
  TITLE     cDNA sequences and molecular evolution of calmodulin genes of
            chicken and eel
  JOURNAL   Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      450     calmodulin
BASE COUNT      160 a     62 c    122 g    106 t
ORIGIN      
        1 atggctgatc aactgacaga agagcagatt gcagaattca aagaagcttt ttcactattt
       61 gacaaggatg gtgatggtac tataactaca aaggagttgg ggactgtgat gagatcactt
      121 ggtcagaacc ccacagaagc agaattacag gacatgatca atgaagtaga cgctgatggc
      181 aatggcacaa ttgacttccc agagtttctg acaatgatgg caagaaaaat gaaagataca
      241 gatagcgaag aagaaattag agaagcgttc cgtgtgtttg acaaggatgg taatggttac
      301 attagtgctg cagaacttcg tcatgtgatg acaaatcttg gggagaagct aacagatgaa
      361 gaagttgatg aaatgattag ggaagcagac attgatggtg atggtcaagt aaactatgaa
      421 gagtttgtac agatgatgac agcgaagtga
//
LOCUS       FSBCALMA      450 bp ss-mRNA            VRT       10-AUG-1990
DEFINITION  E.electricus calmodulin mRNA, complete cds.
ACCESSION   M36168
KEYWORDS    calmodulin.
SOURCE      E.electricus, cDNA to mRNA.
  ORGANISM  Electrophorus electricus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Cypriniformes; Characoidei; Electrophoridae.
REFERENCE   1  (bases 1 to 450)
  AUTHORS   Iida,Y.
  TITLE     cDNA sequences and molecular evolution of calmodulin genes of
            chicken and eel
  JOURNAL   Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      450     calmodulin
BASE COUNT      133 a     96 c    137 g     84 t
ORIGIN      
        1 atggcagatc agctgactga ggaacagatt gctgagttca aggaggcgtt ttccctcttt
       61 gacaaagatg gtgacggcac catcaccacc aaagagctgg gtactgtgat gcgctctctg
      121 ggccaaaacc ccaccgaggc agagctgcag gacatgatca atgaagtgga tgctgacggc
      181 aatggaacaa tagacttccc ggagttcctg accatgatgg ccaagaaaat gaaggacaca
      241 gacagtgaag aagagatccg agaagccttc cgagtttttg acaaggacgg taatggctac
      301 atcagtgcag ccgagttgcg acatgtcatg actaacttgg gcgagaagct gacggacgag
      361 gaggtggatg agatgatccg agaggccgac atcgatggcg acggccaggt gaactatgaa
      421 gagttcgtgc aaatgatgac tgcaaagtga
//
LOCUS       HALTRL1L       88 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Leu-tRNA-1.
ACCESSION   M35741
KEYWORDS    transfer RNA; transfer RNA-Leu.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       88     Leu-tRNA
    anticdn      37       39     Leu-tRNA anticodon gag
    modified     29       29     m22g
    modified     40       40     m1g
    modified     42       42     m5c
    modified     59       59     m5c
    modified     65       65     m1g
    modified     66       66     p
    modified     67       67     p
    modified     68       68     cm
    modified     69       69     m1i
BASE COUNT       16 a     29 c     29 g     13 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gcgtgggtag ccaanccagg ccaacggcgc agcgttgagg gcgctgtcct gtagaggtcc
       61 gccggttcaa atccggtccc acgcacca
//
LOCUS       HALTRL2L       88 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Leu-tRNA-2.
ACCESSION   M35742
KEYWORDS    transfer RNA; transfer RNA-Leu.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       88     Leu-tRNA
    anticdn      37       39     Leu-tRNA anticodon cag
    modified     29       29     m22g
    modified     40       40     m1g
    modified     60       60     m5c
    modified     65       65     m1g
    modified     66       66     p
    modified     67       67     p
    modified     68       68     cm
    modified     69       69     m1i
BASE COUNT       18 a     28 c     25 g     16 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gcagggatag ccaantctgg ccaacggcgc agcgttcagg gcgctgtctc ataggagtcc
       61 gcaggttcaa atcctgctcc ctgcacca
//
LOCUS       HALTRL3L       87 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Leu-tRNA-3.
ACCESSION   M35743
KEYWORDS    transfer RNA; transfer RNA-Leu.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 87)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       87     Leu-tRNA
    anticdn      36       38     Leu-tRNA anticodon caa
    modified     28       28     m1i
    modified     39       39     m1g
    modified     41       41     p
    modified     59       59     m5c
    modified     64       64     m1g
    modified     65       65     p
    modified     66       66     p
    modified     67       67     cm
    modified     68       68     m1i
BASE COUNT       20 a     27 c     25 g     14 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gcgagggtag ctaantcagg aaaaagcggc ggactcaaga tccgctcccg taggggtccg
       61 tgggttcaaa tccctcccct cgcacca
//
LOCUS       HALTRL4L       86 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Leu-tRNA-4.
ACCESSION   M35744
KEYWORDS    transfer RNA; transfer RNA-Leu.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 86)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       86     Leu-tRNA
    anticdn      35       37     Leu-tRNA anticodon tag
    modified     27       27     m22g
    modified     38       38     m1g
    modified     48       48     m5g
    modified     53       53     m1g
    modified     54       54     p
    modified     55       55     p
    modified     56       56     cm
BASE COUNT       16 a     27 c     28 g     14 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gcgcgggtag ccaantggcc aaaggcgcag cgcttaggac gctgtggtgt agaccttcgc
       61 aggttcgaac cctgtcccgc gcacca
//
LOCUS       HALTRL5L       88 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Leu-tRNA-5.
ACCESSION   M35745
KEYWORDS    transfer RNA; transfer RNA-Leu.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       88     Leu-tRNA
    anticdn      37       39     Leu-tRNA anticodon taa
    modified     29       29     m2g
    modified     40       40     m1g
    modified     42       42     p
    modified     65       65     m1g
    modified     66       66     p
    modified     67       67     p
    modified     68       68     cm
BASE COUNT       16 a     28 c     29 g     14 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gcgggggtgg ctganccagg ccaaaagcgg cggacttaag atccgctccc gtaggggttc
       61 gcgagttcga atctcgtccc ccgcacca
//
LOCUS       HALTRS1S       88 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Ser-tRNA-1.
ACCESSION   M35746
KEYWORDS    transfer RNA; transfer RNA-Ser.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       88     Ser-tRNA
    anticdn      37       39     Ser-tRNA anticodon gct
    modified     29       29     m22g
    modified     40       40     t6a
    modified     60       60     m5c
    modified     61       61     m5c
    modified     65       65     m1g
    modified     66       66     p
    modified     67       67     p
    modified     68       68     cm
BASE COUNT       15 a     31 c     27 g     14 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gttgcggtag ccaancctgg cccaaggcgc tgggttgcta actcagtggc gtcaagcccc
       61 cggggttcga atccccgccg caacgcca
//
LOCUS       HALTRS2S       88 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Ser-tRNA-2.
ACCESSION   M35747
KEYWORDS    transfer RNA; transfer RNA-Ser.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 88)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       88     Ser-tRNA
    anticdn      36       38     Ser-tRNA anticodon cga
    modified     13       13     p
    modified     28       28     m22g
    modified     36       36     ac4c
    modified     41       41     p
    modified     60       60     m5c
    modified     65       65     m1g
    modified     66       66     p
    modified     67       67     p
    modified     68       68     cm
    modified     69       69     m1i
BASE COUNT       20 a     28 c     22 g     17 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gccgaggtag cctancccgg ccaaggcggt agattcgaaa tctactgtcc attcggacac
       61 gtgagttcaa atctcaccct cggcgcca
//
LOCUS       HALTRS3S       84 bp ss-tRNA            RNA       10-AUG-1990
DEFINITION  H.volcanii Ser-tRNA-3.
ACCESSION   M35748
KEYWORDS    transfer RNA; transfer RNA-Ser.
SOURCE      H.volcanii tRNA.
  ORGANISM  Halobacterium volcanii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Halobacteriales; Halobacteriaceae.
REFERENCE   1  (bases 1 to 84)
  AUTHORS   Gupta,R.
  TITLE     Transfer RNAs of Halobacterium volcanii: Sequences of five leucine
            and three serine tRNAs
  JOURNAL   Syst. Appl. Microbiol. 7, 102-105 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       84     Ser-tRNA
    anticdn      33       35     Ser-tRNA anticodon gga
    modified     25       25     m22g
    modified     56       56     m5c
    modified     61       61     m1g
    modified     62       62     p
    modified     63       63     p
    modified     64       64     cm
    modified     65       65     m1i
BASE COUNT       14 a     26 c     28 g     15 t      1 others
ORIGIN      5' end of mature tRNA.
        1 gccaggatgg ccgancggta aggcgcacgc ctggaaagcg tgttccctct gggatcgggg
       61 gttcaaatcc ctctcctggc gcca
//
LOCUS       HUMIGHVY      920 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human Ig germline H-chain gene D5-region.
ACCESSION   M35647
KEYWORDS    diversity exon; germline; immunoglobulin heavy chain.
SOURCE      Human placental DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 920)
  AUTHORS   Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T.
  TITLE     Human immunoglobulin D segments: Isolation of a new D segment and
            polymorphic deletion of the D-1 segment
  JOURNAL   Immunol. Lett. 17, 329-334 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /   604  /   633     Ig H-chain D-region (AA at 604)
    signal      456      462     heptamer recombination signal
    signal      488      497     nonamer recombination signal
    signal      575      583     nonamer recombination signal
    signal      596      602     heptamer recombination signal
    signal      634      640     heptamer recombination signal
    signal      654      662     nonamer recombination signal
BASE COUNT      256 a    269 c    255 g    140 t
ORIGIN      
        1 ggatccaggc agcaacatca cgggagaatg cagggctccc agacatccca accctcttgc
       61 aggcctctcc tgggaagaga ctgcatccac caccaaacag catggagccc gctgcatagt
      121 aactgagtca gtgactgacc tggagggcag gggagcagtg agccggagcc cagaccatag
      181 ggaccgagac cagccgctga catcccgagc ccctcactgg tggccccaga aaatatggaa
      241 acaggacgga cccacattcc cagctggacc agggcggaga ctgctgagcc ccgagcacca
      301 gccccaagaa acaccaggca aaatcatcag aggaggctcc tgagaaagag aggaggggag
      361 gtctccctca ccagcaaatg cttcccttaa ccaaacacag ggtccacgca actcccccca
      421 ggacaaagga ggagccccct gtacagcact gggctcagag tcctctccaa gacaccctga
      481 gtttcagaca aaaaccccct ggaaatcaca gtatcagcag gagaactggc cagagacagc
      541 aaggggggac tccgtgacac ccgtggggac aggaggattt tgtgggggct tgtgtcactg
      601 tgagaatatt gtaatagtac tactttctat gcccacagtg acacagcccc agtcccaaag
      661 ccctgctgta aatgcttcca cttctggagc tgagggggag cgtctgggaa gtagggccta
      721 gggtgtccac caatgccaaa acacaccaga cttcccccag acatcacccc caccagccag
      781 cgagcagagt aaacagaaaa tgagaagcaa gctgggaact gctggcacag gccccaagga
      841 aagagctggg gttgtgcaag agaagatgca ggcagagcta gcagggcctt ttactgtttc
      901 tgctttctgg cagagaattc
//
LOCUS       HUMIGHVZ      688 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human Ig germline H-chain gene D4-region.
ACCESSION   M35648
KEYWORDS    diversity exon; germline; immunoglobulin heavy chain.
SOURCE      Human peripheral blood cell DNA, clone D31.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 688)
  AUTHORS   Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T.
  TITLE     Human immunoglobulin D segments: Isolation of a new D segment and
            polymorphic deletion of the D-1 segment
  JOURNAL   Immunol. Lett. 17, 329-334 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    70  /   100     Ig H-chain D5-region (AA at 71)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
    signal       42       50     nonamer recombination signal
    signal       63       69     heptamer recombination signal
    signal      101      107     heptamer recombination signal
    signal      121      129     nonamer recombination signal
BASE COUNT      185 a    160 c    209 g    134 t
ORIGIN      
        1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt
       61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt
      121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct
      181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc
      241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca
      301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc
      361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttca agatcgatgg
      421 ctgggagtga gccgagacgg acagtgtggg aagggcacag ggaaggagaa gcagccgcta
      481 tcctacactg tcatctttca agagtttgcc ctgtgcccac aatgctgcat catgggatcg
      541 ttaacactga tgtagacaca gctaaagaga gaatcagtga atggatttgc agcacagatc
      601 tgaataaatt ctcagaatgt ggagccacac agaagcaagc acacggaaag tggcctgatc
      661 gaatttcaag tacagtggca cttcagct
//
LOCUS       HUMIGHWA      690 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human Ig germline H-chain gene D4-region.
ACCESSION   M35649
KEYWORDS    diversity exon; germline; immunoglobulin heavy chain.
SOURCE      Human peripheral blood cell DNA, deletion clone D30.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 690)
  AUTHORS   Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Hongo,T.
  TITLE     Human immunoglobulin D segments: Isolation of a new D segment and
            polymorphic deletion of the D-1 segment
  JOURNAL   Immunol. Lett. 17, 329-334 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    70  /   100     Ig H-chain D5-region (AA at 71)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
    signal       42       50     nonamer recombination signal
    signal       62       68     heptamer recombination signal
    signal      101      107     heptamer recombination signal
    signal      121      129     nonamer recombination signal
BASE COUNT      186 a    160 c    212 g    132 t
ORIGIN      
        1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt
       61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt
      121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct
      181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc
      241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca
      301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc
      361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttcg agatcaatgg
      421 ctgggagtga gccgagcagg acagcgtggg aagagcacag ggaaggagga gcagccgcta
      481 tcctacactg tcatctttcg aaagtttgcc ttgtgcccac actgctgcat catgggatcg
      541 ttaacactga tgtagacaca gctaaagaga gaatcatgga atggatttgc agcacagatc
      601 tgaataaatt ctcagaatgt ggagcagcac agaagcaagc acacagaaag tggcctgatg
      661 caaggacaag ttcagtggca ctagcatgct
//
LOCUS       HUMIGHWB      681 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human Ig germline H-chain gene D1-region.
ACCESSION   M35650
KEYWORDS    diversity exon; germline; immunoglobulin heavy chain.
SOURCE      Human peripheral blood cell DNA, clone D31.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T.
  TITLE     Human immunoglobulin D segments: Isolation of a new D segment and
            polymorphic deletion of the D-1 segment
  JOURNAL   Immunol. Lett. 17, 329-334 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /    69  /    99     Ig H-chain D5-region (AA at 70)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
    signal       41       49     nonamer recombination signal
    signal       61       67     heptamer recombination signal
    signal      100      106     heptamer recombination signal
    signal      120      128     nonamer recombination signal
BASE COUNT      187 a    156 c    203 g    135 t
ORIGIN      
        1 ccagcaagat cgggctccag tgacaccgag acaggacgga ggattttgtg ggggctcgtg
       61 tcactgtgag gatattgtac taatggtgta tgctataccc acagtgacac agccccattc
      121 ccaaagccct actgcaaaga ttccacttgc tgggctgagg ggctggggag cgtctgggaa
      181 atagggctag ggaccagcaa tgcccaaaac gcagattccg tcccatacat caccccattc
      241 accagtcagc agagtaaaca gaaaatgaga agcagctggg aagcttccac aggcccaagg
      301 aaagagcttt ggcgggtgtg taagagggga tgcgggcaga gcgagcaggg ccttttgctg
      361 tttctgcttt cctgtgcaga gagttcagca gctggtgttc gagatcaatg gctgggagtg
      421 agccgagcag gacagcgtgg gaagagcaca gggaaggagg agcagccgct atcctacact
      481 gtcatctttc gaaagtttgc cttgtgccca cactgctgca tcatgggatc gttaacactg
      541 atgtagacac agctaaagag agaatcatgg aatggatttg cagcacagat ctgaataaat
      601 tctcagaatg tggagcagca cagaagcaag cacacagaaa gtggcctgat gcaaggacaa
      661 gttcagtggc actagcatgc t
//
LOCUS       HUMTCAGVR     508 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human T-cell receptor germline gamma-chain gene V4-region
            (V4; subgroup I), exons 1 and 2.
ACCESSION   M36285
KEYWORDS    T-cell antigen receptor; T-cell receptor gamma-chain; germline;
            variable region.
SOURCE      Human EBV-transformed lymphoblastoid cell line DNA, clone 601.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 508)
  AUTHORS   Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L.,
            Cann,H., Cohen,D., Dausset,J. and Sigaux,F.
  TITLE     The V-gamma locus of the human T cell receptor gamma gene
  JOURNAL   J. Exp. Med. 168, 1383-1394 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1       43     T-cell receptor gamma-chain (V4), exon 1
                                 /hgml_locus_uid="LE0226F" /nomgen="TCRGV4"
                                 /map="7p15"
                159  >   508     T-cell receptor gamma-chain (V4), exon 2
    IVS          44      158     T-cell receptor gamma-chain (V4) intron A
BASE COUNT      128 a    119 c    122 g    139 t
ORIGIN      
        1 atgcagtggg ccctagcggt gcttctagct ttcctgtctc ctggtgagtg cgctgcctac
       61 agagaggatc atgggttttg ttttctttat tttcttcttt tgcaaggatt gccatactaa
      121 ggaattcctc attatatttt gtgttgttcc cattgcagcc agtcagaaat cttccaactt
      181 ggaagggaga acgaagtcag tcatcaggca gactgggtca tctgctgaaa tcacttgtga
      241 tcttgctgaa ggaagtaccg gctacatcca ctggtaccta caccaggagg ggaaggcccc
      301 acagcgtctt ctgtactatg actcctacac ctccagcgtt gtgttggaat caggaatcag
      361 cccagggaag tatgatactt acggaagcac aaggaagaac ttgagaatga tactgcgaaa
      421 tcttattgaa aatgactctg gagtctatta ctgtgccacc tgggatgggc acagtgattc
      481 agatccgccc tacaccacac tgaaaacc
//
LOCUS       HUMTCAGVS     509 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human T-cell receptor germline gamma-chain gene V5-region
            (V5; subgroup I), exons 1 and 2.
ACCESSION   M36286
KEYWORDS    T-cell antigen receptor; T-cell receptor gamma-chain; germline;
            variable region.
SOURCE      Human EBV-transformed lymphoblastoid cell line DNA, clone 601.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 509)
  AUTHORS   Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L.,
            Cann,H., Cohen,D., Dausset,J. and Sigaux,F.
  TITLE     The V-gamma locus of the human T cell receptor gamma gene
  JOURNAL   J. Exp. Med. 168, 1383-1394 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1       43     T-cell receptor gamma-chain (V5), exon 1
                                 /hgml_locus_uid="LF0226G" /nomgen="TCRGV5"
                                 /map="7p15"
                160  >   508     T-cell receptor gamma-chain (V5), exon 2
    IVS          44      159     T-cell receptor gamma-chain (V5) intron A
BASE COUNT      131 a    117 c    122 g    139 t
ORIGIN      
        1 atgcggtggg ccctactggt gcttctagct ttcctgtctc ctggtgagta cgctgcctac
       61 agagaggctc acaggttggg ttttgttttg tttacttctt ttgaaagggg tgcgatacaa
      121 aggaatacct cattaaattt tgtgttgttc ccattgcagc cagtcagaaa tcttccaact
      181 tggaaggggg aacgaagtca gtcacgaggc cgactaggtc atctgctgaa atcacttgtg
      241 accttactgt aataaatgcc ttctacatcc actggtacct acaccaggag gggaaggccc
      301 cacagcgtct tctgtactat gacgtctcca actcaaagga tgtgttggaa tcaggactca
      361 gtccaggaaa gtattatact catacaccca ggaggtggag ctggatattg atactacgaa
      421 atctaattga aaatgattct ggggtctatt actgtgccac ctgggacagg cacagtgatt
      481 cagacctgtc ctacaccaca ctgaaaatc
//
LOCUS       MUSNM23A      664 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Mouse tumor metastatic process-associated protein (NM23) gene mRNA,
            3' end.
ACCESSION   M35970
KEYWORDS    tumor metastatic process-associated protein.
SOURCE      Mouse cell line K-1735, cDNA to mRNA, clone pNM23.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 664)
  AUTHORS   Steeg,P.S., Bevilacqua,G., Kopper,L., Thorgeirsson,U.P.,
            Talmadge,J.E., Liotta,L.A. and Sobel,M.E.
  TITLE     Evidence for a novel gene associated with low tumor metastatic
            potential
  JOURNAL   J Natl Cancer Inst 80, 200-204 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      501     tumor metastatic process-associated protein
                                 NM23 (AA at 1)
    mRNA    <     1      664     NM23 mRNA
BASE COUNT      150 a    154 c    201 g    159 t
ORIGIN      
        1 cagtcgcagc cggcggtaaa gccttgtcat ctgaagggga ccatggccaa cagtgagcgt
       61 accttcattg ccatcaagcc tgatggggtc cagcgggggc tggtgggcga gatcatcaag
      121 cggttcgagc agaaggggtt ccgccttgtt ggtctgaagt ttctgcaggc ttcagaggac
      181 cttctcaagg agcactacac tgacctgaag gaccgcccct tctttactgg cctggtgaaa
      241 tacatgcact caggaccagt ggttgctatg gtctgggagg gtctgaatgt ggtgaagaca
      301 ggccgcgtga tgcttggaga gaccaacccc gcagactcta agcctgggac catacgagga
      361 gacttctgca ttcaagttgg caggaacatc attcatggca gcgattctgt aaagagcgca
      421 gagaaggaga tcagcttgtg gtttcagcct gaggagctgg tggagtacaa gagctgtgcg
      481 cagaactgga tctatgagtg ataggacggt gccggttttc tacctgctta ctcttgttct
      541 cacaggcagg ggaccagcaa ccctagatat ttctggaact tctttgacct ggaaggaacc
      601 tttgggagcc gtgactccct gtgcagtgtt acgtgccact gttagattaa agtgtttaat
      661 ctgt
//
LOCUS       SYNHGHOA      582 bp ds-DNA             SYN       10-AUG-1990
DEFINITION  Synthetic human growth hormone gene, complete cds.
ACCESSION   M36282
KEYWORDS    growth hormone.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 582)
  AUTHORS   Ohtsuka,E., Taniyama,Y., Iwai,S., Kitano,K., Miyamoto,S., Ohgi,T.,
            Sakuragawa,Y., Fujiyama,K., Ikari,T., Kobayashi,M., Miyake,T.,
            Shibahara,S., Tokunaga,T., Ikehara,M., Ono,A., Ueda,T., Tanaka,T.,
            Baba,H., Miki,T., Sakurai,A. and Oishi,T.
  TITLE     The synthesis of human growth hormone gene
  JOURNAL   Nucleic Acids Symp Ser 12, 79-82 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1      579     growth hormone
BASE COUNT      146 a    153 c    129 g    154 t
ORIGIN      
        1 atgttcccaa ctattccact gagtcgcctg ttcgataacg cgatgctgcg tgcgcatcgt
       61 ctgcaccaac tggctttcga cacttaccag gagttcgaag aagcatacat cccgaaagaa
      121 cagaaataca gcttccttca gaacccacag acctcgttgt gtttctctga aagtatcccg
      181 accccttcta accgcgaaga gacccagcag aaatcgaacc ttgaactgct tcgtatctcg
      241 ctgcttctca ttcagtcgtg gctggagcca gtacagttcc tgcgttcggt tttcgcaaac
      301 tcactggtat acggtgcgtc tgacagtaac gtttacgacc tgctgaaaga ccttgaagaa
      361 gggatccaga ccctgatggg tcgcctggaa gatggttcac cacgcactgg tcagatcttc
      421 aaacagactt actccaaatt cgatactaac tctcataacg atgatgctct gctgaaaaac
      481 tacggcctgc tgtactgttt ccgtaaagat atggataaag ttgaaacttt cctgcgtatc
      541 gttcagtgtc gttctgttga agggtcgtgt ggcttctaat ag
//
LOCUS       AC2TFDC       803 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  Plasmid pAC27 (from Pseudomonas putida) pyrocatechase (tfdC)
            gene, complete cds.
ACCESSION   M36279
KEYWORDS    pyrocatechase.
SOURCE      Plasmid pAC27(from Pseudomonas putida, strain AC867) DNA.
  ORGANISM  Plasmid pAC27
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 803)
  AUTHORS   Ghosal,D. and You,I.-S.
  TITLE     Nucleotide homology and organization of chlorocatechol oxidation
            genes of plasmids pJP4 and pAC27
  JOURNAL   Mol. Gen. Genet. 211, 113-120 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         21      803     pyrocatechase (tfdC)
BASE COUNT      174 a    240 c    250 g    139 t
ORIGIN      
        1 atcttgaaag gagacgagtc atggataaac gagttgccga ggtcgcaggc gcgatcgtcg
       61 aggcagtacg caaaattttg ctggacaagc gcgtcacgga agccgaatac cgcgcgggtg
      121 tcgactatct caccgaggtc gcacagacgc gggaaaccgc gctgcttctg gacgttttcc
      181 tgaacagcac catcatcgaa ggcaaggcgc agcgctcgcg gacctctgcg cctgcgatcc
      241 aggggccgta cttcctggaa ggtgctcctg tagttgaagg cgtcctcaag acctacgata
      301 ccgacgacca caaaccgctg atcattcgcg gtacggtgcg ctcggacacg ggcgagttgc
      361 tcgctggcgc tgtcatcgac gtgtggcact cgacgcctga tggcttgtac agcgggatcc
      421 acgacaacat ccccgtggac tactaccgcg gaaaactcgt gacggattcc cagggcaact
      481 atcgcgtgcg caccacgatg ccagtgccat accagatccc ctacgagggg ccgactgggc
      541 gtctgctggg ccacctgggc agccatacct ggcgtccgcc gcacgtgcac ttcaaggtgc
      601 gcaaggacgg tttcgaaccg ttgaccacgc aatactactt cgaagggggc aaatgggtgg
      661 acgatgactg ctgtcacggc gtcacccccg acctgattac gcccgagacg atcgaggacg
      721 gggtgcgggt catgaccctg gacttcgtaa tcgagcgtga gcaggccgag caacgcaagt
      781 cggctacgga gacagtggca tga
//
LOCUS       AFATFDC       788 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  Plasmid pJP4 (from Alcaligenes eutrophus) pyrocatechase (tfdC)
            gene, complete cds.
ACCESSION   M36280
KEYWORDS    pyrocatechase.
SOURCE      Plasmid pJP4 (from Alcaligenes eutrophus, strain JMP134) DNA.
  ORGANISM  Plasmid pJP4
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 788)
  AUTHORS   Ghosal,D. and You,I.-S.
  TITLE     Nucleotide homology and organization of chlorocatechol oxidation
            genes of plasmids pJP4 and pAC27
  JOURNAL   Mol. Gen. Genet. 211, 113-120 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         21      788     pyrocatechase (tfdC)
BASE COUNT      178 a    203 c    240 g    167 t
ORIGIN      
        1 gttttcatga cggaggcaaa gtgaacaaaa gagtcaagga tgttgtcgat gcgatcgtcg
       61 cggcggtaca gcgggtgctc gaccagaagg aggttactga ggcggaatac cgaactgcgg
      121 ttcattacct catgcaagta gccgaacagc gtgaaaccgc tcttctttgc gatgtttttt
      181 tcaatagcac cgtggctgcg acgaaggctc gtattagcga aggttcaacg cctgctatcg
      241 agggacccta ctatcgcgac gacgcccctc tggtcgatga ccggctcaag acttacgaca
      301 cggacgacca caagcctttg cttatccagg gaacggtcaa agcggtcgac gggagcgttg
      361 tcgaggacgt gacgattgat gtctggcatt cgacgcccga tgggaagtac agcggtttcc
      421 atgacgacat cccgactgat ttttatcgag ggaagctcag ggtgggcacc gatggcagct
      481 tccgcgtgcg cacaacgatg ccggtgccgt atcagatccc ggatcagggt cccacgggcg
      541 cattgctcga aaccatgggt ggtcactcgt ggcgtcccgc tcatgtacat ttcaaggtga
      601 aggcgccggg atatgaaacg ttgaccacgc agtactactt cgaaggtgga gattggatca
      661 cggacgactg ttgcaacggc gtccagtcta gtctgattac tcccgatatt gtggaagagg
      721 gtgtccgcct gatgaacatc aacttcgtta tcgagcccgc gcgtgcgcag gcgggggcaa
      781 acccgtga
//
LOCUS       ECOHSEII      530 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  E.coli heat-stable enterotoxin II (STII) gene, complete cds.
ACCESSION   M35729
KEYWORDS    enterotoxin II.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 530)
  AUTHORS   Picken,R.N., Mazaitis,A.J., Maas,W.K., Rey,M. and Heyneker,H.
  TITLE     Nucleotide sequence of the gene for heat-stable enterotoxin II of
            Escherichia coli
  JOURNAL   Infect. Immun. 42, 269-275 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        166      381     enterotoxin II (STII)
BASE COUNT      188 a     86 c     88 g    168 t
ORIGIN      
        1 taaataccta caacgggtga ttgacactac actcattaac tatactgcaa gtagcattaa
       61 aaatcttaat aaaggagagc ttcgtcacat ttttttgact tgactcatat aaaagcccac
      121 tggtataagt tttattgctt atagcaataa ggttgaggtg attttatgaa aaagaatatc
      181 gcatttcttc ttgcatctat gttcgttttt tctattgcta caaatgccta tgcatctaca
      241 caatcaaata aaaaagatct gtgtgaacat tatagacaaa tagccaagga aagttgtaaa
      301 aaaggttttt taggggttag agatggtact gctggagcat gctttggcgc ccaaataatg
      361 gttgcagcaa aaggatgcta atatatttat caatagcatt cagcaccata tacacaaaaa
      421 taatttttca taaaaagaac tctataaaat aaatattttt tgtgacaatg tcctaacgca
      481 agacggacat tgtccatttc tcactgcagg taaatgatct gtaaatagtc
//
LOCUS       RAT6P26B     1086 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Rat 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase mRNA,
            3' end.
ACCESSION   M15685
KEYWORDS    6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase.
SOURCE      Rat liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1086)
  AUTHORS   Colosia,A.D., Lively,M., El-Maghrabi,M.R. and Pilkis,S.J.
  TITLE     Isolation of a cDNA clone for rat liver 6-phosphofructo 2-
            kinase/fructose 2,6-bisphosphatase
  JOURNAL   Biochem. Biophys. Res. Commun. 143, 1092-1098 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      915     6-phosphofructo 2-kinase/fructose
                                 2,6-bisphosphatase (AA at 1)
BASE COUNT      290 a    279 c    264 g    253 t
ORIGIN      
        1 gcagaaaaca tcaagcaagt gaaacttggt agtcctgatt acatagactg tgaccaagaa
       61 aaggttttgg aagactttct aaagagaata gagtgctatg agatcaacta ccaacctttg
      121 gatgaggaat tggacagcca cctgtcctac atcaagatct tcgacgtggg cacacgctac
      181 atggtaaatc gagtgcagga ccacgttcag agccgtacag cctactacct catgaacatc
      241 catgtcacac ctcgatctat ctacctatgc cgccatggtg agagtgaact caaccttaga
      301 ggccgcattg gaggtgactc tggcctctca gctcggggca agcagtatgc ctatgcacta
      361 gccaacttca tccggtctca aggcatcagc tccctgaaag tatggactag ccacatgaag
      421 aggaccattc agaccgctga agccctaggt gtcccctatg aacagtggaa ggccctgaat
      481 gagattgatg cgggtgtctg tgaagagatg acctatgaag aaattcagga acactaccct
      541 gaggaatttg cactacggga ccaggataaa tatcgttacc gctatccgaa gggagagtcc
      601 tatgaggatc tggttcagcg tcttgaacca gttataatgg agctagaacg gcaagaaaat
      661 gtactggtga tctgtcacca ggctgtcatg cggtgcctcc tggcatactt cctggataaa
      721 agttcagatg agctgcccta tctcaagtgt cctctgcata ctgtgctcaa actcacacct
      781 gtggcttatg gctgcagagt ggagtccatc tacctgaatg tggaggctgt gaacacacac
      841 cgggacaagc ctgagaatgt ggacatcacc cgtgaagctg aggaagcctt ggacactgta
      901 cctgcccatt actgagccct tttcaagtga tcagattgtc tctgttctca cccccttcct
      961 cctgtaggag ctgctgccct tgttctcctt aagcagactc tggctatggc ctgagagtgc
     1021 cctacctcca gtgaagaagt ccgtagcagc tcccaaacag gtctcaattc ctagccacaa
     1081 ctaagg
//
LOCUS       RATFABP       491 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Rat fatty liver acid binding protein (FABP) mRNA, complete cds.
ACCESSION   M35991
KEYWORDS    fatty acid binding protein.
SOURCE      Rat adult liver, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 491)
  AUTHORS   Gordon,J.I. and Lowe,J.B.
  TITLE     Analyzing the structures, functions and evolution of two abundant
            gastrointestinal fatty acid binding proteins with recombinant DNA
            and computational techniques
  JOURNAL   Chem. Phys. Lipids 38, 137-158 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         40      423     fatty acid binding protein (FABP)
    mRNA    <     1      491     fatty acid binding protein mRNA
BASE COUNT      161 a     95 c    128 g    107 t
ORIGIN      
        1 ctgttggtgg cagctgggaa aggaaacctc attgccacca tgaacttctc cggcaagtac
       61 caagtgcaga gccaagagaa ctttgagccc ttcatgaagg cgatgggtct gcctgaggac
      121 ctcatccaga aagggaagga catcaagggg gtgtcagaaa tcgtgcatga agggaagaaa
      181 gtcaaactca ccatcaccta tgggtccaag gtgatccaca atgagttcac cttgggggag
      241 gagtgcgaac tggagaccat gactggggaa aaggtcaagg cagtggttaa gatggagggt
      301 gacaataaaa tggtgacaac tttcaaaggc ataaagtccg tgactgaatt caatggagac
      361 acaatcacca ataccatgac actgggtgac atcgtctaca agagagtcag caagagaatt
      421 tagacaaggc tgtatttcat attcttttac agtataaaat taatacaata aagttacctc
      481 tgttttgaaa t
//
LOCUS       RATFABPX      564 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Rat intestinal fatty acid binding protein (FABP) mRNA, complete
            cds.
ACCESSION   M35992
KEYWORDS    fatty acid binding protein.
SOURCE      Rat adult small intestinal epithelium, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 564)
  AUTHORS   Gordon,J.I. and Lowe,J.B.
  TITLE     Analyzing the structures, functions and evolution of two abundant
            gastrointestinal fatty acid binding proteins with recombinant DNA
            and computational techniques
  JOURNAL   Chem. Phys. Lipids 38, 137-158 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         13      411     fatty acid binding protein (FABP)
BASE COUNT      179 a    108 c    146 g    131 t
ORIGIN      
        1 acagctgaca tcatggcatt tgatggcact tggaaagtag accggaatga gaactatgaa
       61 aagttcatgg agaaaatggg cattaacgtg gtgaagagga agcttggagc tcatgacaac
      121 ttgaaactga cgatcacaca ggaaggaaat aaattcacag tcaaagaatc aagcaacttc
      181 cgaaacattg atgttgtgtt tgaactcggc gtcgactttg cctatagtct agcagatgga
      241 acagaactca ctgggacctg gaccatggag ggaaataaac ttgttggaaa attcaaacgt
      301 gtagacaatg gaaaggagct gattgctgtc cgagagattt ctggtaacga actaatccaa
      361 acctacacat atgaaggagt ggaggccaag cggatcttta agaaggaata ggccagcttc
      421 ttggagcctg gcgcaatgca gaagggctag cttggtgtcg gattcctttc tccatctcgc
      481 tgataatgca ggctcactca ctgtgtcatg agcactgctt ccaagccttg tcagagccaa
      541 agaagtaaaa gttgattagg actt
//
LOCUS       BSTRR5SA      117 bp ds-DNA             RNA       10-AUG-1990
DEFINITION  B.stearothermophilus 5S RNA.
ACCESSION   M25591
KEYWORDS    5S ribosomal RNA; ribosomal RNA.
SOURCE      B.stearothermophilus RNA.
  ORGANISM  Bacillus stearothermophilus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 117)
  AUTHORS   Zimmermann,J. and Erdmann,V.A.
  TITLE     Binding sites of E.coli and B.stearothermophilus ribosomal proteins
            on B.stearothermophilus 5S RNA
  JOURNAL   Nucleic Acids Res. 5, 2267-2288 (1978)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rRNA          1      117     5S ribosomal RNA
BASE COUNT       25 a     36 c     38 g     18 t
ORIGIN      
        1 cctagtggtg atagcggagg ggaaacaccc gttcccatcc cgaacacgga agttaagccc
       61 tccagcgccg atggtagttg gggccagcgc ccctgcaaga gtaggccgct gctaggc
//
LOCUS       CHKVIM1      2842 bp ds-DNA             VRT       10-AUG-1990
DEFINITION  Chicken vimentin gene, exons 1 and 2.
ACCESSION   M15850 J02759 M25923
KEYWORDS    intermediate filament; vimentin.
SEGMENT     1 of 3
SOURCE      Chicken DNA, library of Dodgson.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 2842)
  AUTHORS   Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M.
  TITLE     The chicken vimentin gene: Nucleotide sequence, regulatory
            elements, and comparison to the hamster gene
  JOURNAL   J. Biol. Chem. 262, 8112-8120 (1987)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Zehner,Z.E. and Paterson,B.M.
  TITLE     The chicken vimentin gene: Aspects of organization and
            transcription during myogenesis
  JOURNAL   Ann. N.Y. Acad. Sci. 455, 79-94 (1985)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by B.A.Roe, 11-MAY-1987.
FEATURES       from  to/span     description
    pept        824     1368     vimentin, exon 1
               1458  +  1518     vimentin, exon 2
    pre-msg     737  >  2842     VIM mRNA
    IVS        1369     1457     VIM cds intron A
    IVS        1519  >  2842     VIM cds intron B
BASE COUNT      569 a    827 c    882 g    564 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttgagg atttctgtcc tgcctttgag gctctttggg gatcacttct aatgaccttt
       61 ttgtaagagc tctttcccga gacagtgacc gtgtgcctac agaaactaaa agcaaacaga
      121 ccctcggtta aaaccgctcc tgccgctccc ccgctgacca ggagcgctgt gcccgaagca
      181 aagcgatgcc cctcctgcag ccgtgcgagt gcgcagcgcc gtccgcaatg agcgcaactc
      241 cgtcggcacc gagcaggctg ggcgggcccc gggagctgcg ggagcgcgct gagcccatga
      301 gcacaggacg ggaggctgag gggggccgtc ggaagcagct aaccccaaac ccaccgacgg
      361 ttcacgtttc tatctgtttg ctttggacgt tcggtagccc ttcgtgaata gcatcactcc
      421 cccgatctcg gtgcctttca gagagctaag aaagaaggga cggggctcgc acaaagttca
      481 acgcatgcat ttttaagcac tcggcccgtt ctaggagcgc tggcgagcac cgcagcccgt
      541 cccgccctcg gaatcactgc gctgtcgaag gggcgtaggg cgcaaggcac ctcggcagcc
      601 gcccgggtcc ctccgccttt gtcccagcgc cgccaccgcc gccctccctc cgcccctccc
      661 cgggcccgac gcggcattgg cgacggacgg ggcggggcgg cgcggcggcg ggcggggcgg
      721 cggccccata agaggggccc cggggtgggg acgccgctct tcttcgccgc cgcgctccga
      781 gcccgtcgct cccggattac aaagccgctc cgttcctcgc gccatgagct tcaccagcag
      841 caagaactcc tcgtaccgcc gcatgttcgg cgggggcagc cggcccagca gcggcacccg
      901 ctacatcacg tccagcaccc gctattccct gggcagcgcc ctgcggccca gcagcgcccg
      961 ctacgtgtcc gcctcgcccg gcggcgtgta ccgcaccaag gcgacgtcgg tgcggctgcg
     1021 gagcagcatg ccgcccatgc ggatgcacga cgccgtggac ttcaccctgg cggacgccat
     1081 caacacggag ttcaaggcga accgcaccaa cgagaaggta gagctgcagg agctcaacga
     1141 ccgcttcgcc aactacatcg acaaggtgcg cttcctggag cagcagaaca agatcctgct
     1201 ggccgagctg gagcagctca agggcaaagg cacgtcccgc ttgggcgacc tgtacgagga
     1261 ggagatgcgg gacgtgcggc gccaggtgga ccagctgacc aacgacaagg cccgcgtcga
     1321 ggtggagcgc gacaacctgg ccgacgacat catgcgcctg cgggagaagt gagtggggcc
     1381 gcgggccggg gccgggaagc gggcgggcac ccacccctcc ctcacctcac ctcacctcac
     1441 cctgtcgctg cttgcaggtt gcaggaggag atgctgcagc gggaggaggc cgagagcacc
     1501 ctgcagtcct tccgacaggt gggcgcggcg ccggggcggt gggggcggaa ggtcggggcg
     1561 gggctgggct gggggctgct ctgcccgcac aaaggcgctc gcgcagcctc ttgttctccc
     1621 cgttgttttc tttagggggg tgccgcagtc cgggggggag ggtgaggcta ttgtcccgct
     1681 ttatccggtc tggccggctc cacaaaagcg ctccgacgtg gaatcgcttt ctgatccaat
     1741 agttgtgaaa acaggccatc tgcgcgccgg gctgaagggt attaatggtt tctatgggat
     1801 tcacagagga aatgcagatc caggcattat ggccggtgtg gtggcagcag attgaaataa
     1861 tggatccctt tgtgtcggag gggagggtgc cgtgggctgc attcctacca aatgcgtaat
     1921 ggtgggggca tattcctggg ggagactgtg tgaaatacag aactccttct ggatgagcca
     1981 cgtcggtctg cgctctcaga agactgtccc acatttgaag tatttcctat cagtggtgtc
     2041 tgccgatcgg aagcagacaa tgggctcctt gggatgagct cgcagcagct gtctgcgtcc
     2101 ccttgacgtg cattaaaaat gtagaattta atcctatata agcaaagatg tgaatgtaag
     2161 aaacagtgca ctggagatga gtgtgtagga agctgcggtg tcacacagat gaaccaaagc
     2221 cgatgtgaca gccaaaacca actgtgaagc ctgtgaaaag ttgtgagctc aagttctgat
     2281 ttgtgcaaaa acagctctcg gcttggattg agggctgttt ggactagcag atgtttagga
     2341 atgtgccttc tcaaaacaag ataacttcct gaaaactgct ggtgtgaaac aacactgctg
     2401 tcactcctct gagctggaac gagctgccct gtatctccag acagctgggc gcaggctgac
     2461 cgtcccatag agctctccat aggcaatagg tgtcaccatg caggaagttg tcatttgaat
     2521 tttagcattt tccctagcgg tcttagtgag acagtgtggg agcagacaca tggcagggtc
     2581 tgggaatgag gtgtccactg acgttctggg ggctgaggct tgaaatagca gctccaggta
     2641 ctgcctttaa cctgtgctgc tgtgttacca ggctgagccg atccaggttg ttgccaagtc
     2701 agtgcagatg catttccctt tttttgtttt aaattactag agttcaagca caagctcctt
     2761 tgacttaaag ctagtttaat catgcgtgct tctctggtgc gcttcaaggc atgtcttcct
     2821 gtccctccaa ccgatgaagc tt
//
LOCUS       CHKVIM2       685 bp ds-DNA             VRT       10-AUG-1990
DEFINITION  Chicken vimentin gene, exons 3 and 4.
ACCESSION   M15851 J02759 M25923
KEYWORDS    intermediate filament; vimentin.
SEGMENT     2 of 3
SOURCE      Chicken DNA, library of J.B.Dodgson.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 685)
  AUTHORS   Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M.
  TITLE     The chicken vimentin gene: Nucleotide sequence, regulatory
            elements, and comparison to the hamster gene
  JOURNAL   J. Biol. Chem. 262, 8112-8120 (1987)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Zehner,Z.E. and Paterson,B.M.
  TITLE     The chicken vimentin gene: Aspects of organization and
            transcription during myogenesis
  JOURNAL   Ann. N.Y. Acad. Sci. 455, 79-94 (1985)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by by B.A.Roe, 11-MAY-1987.
FEATURES       from  to/span     description
    pept    +     7      102     vimentin, exon 3
                246  +   407     vimentin, exon 4
    pre-msg <     1  >   685     VIM mRNA
    IVS     <     1        6     VIM cds intron B
    IVS         103      245     VIM cds intron C
    IVS         408  >   685     VIM cds intron D
BASE COUNT      189 a    142 c    158 g    196 t
ORIGIN      About 3 kb after segment 1.
        1 gatcaggatg ttgacaatgc ctctctggca ggccttgatc ttgagcgccc cgttgagtcc
       61 ctgcaagaag aaattgtctt cttgaagaag cttcatgatg aggtaagctg aataatgagg
      121 tctgtgtctg ggcaccgtca gtcaagtcaa caagtaaatt gtctctggtt ctggccccga
      181 gtcctaacct gtgcattttc tgcatagtgt tgtttattaa ctccaagtgt ctacttctct
      241 ctaaggaaat ccgggaactg caggctcaac tccaggaaca gcacatccaa atcgatatgg
      301 atgtttctaa gcctgatctt actgctgccc tgcgcgatgt tcgtcaacaa tatgaaagcg
      361 ttgctgctaa gaatcttcag gaagctgaag agtggtacaa gtccaaagta agtacaatgc
      421 gttttaggat tatgcagatg cagtaggcat tcacatgaat tttaggtaac agctttgctg
      481 tgttccttca gtactgaatg ttagcacagc cacagagtag ggtagtcttc aaactatcag
      541 tcagggaaga atacagttgg aaaagtcatt tgtgccctaa gtgcagcatc agactaatgg
      601 gctcatttcc tgttcttccc tattcttgta gctggaggca gttcttaagt atatacctaa
      661 gaaaggaatg aacaaatgca ttaaa
//
LOCUS       CHKVIM3      2857 bp ds-DNA             VRT       10-AUG-1990
DEFINITION  Chicken vimentin gene, exons 5-9.
ACCESSION   M15852 J02759 M25922
KEYWORDS    intermediate filament; vimentin.
SEGMENT     3 of 3
SOURCE      Chicken DNA, library of J.B.Dodgson.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 2857)
  AUTHORS   Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M.
  TITLE     The chicken vimentin gene: Nucleotide sequence, regulatory
            elements, and comparison to the hamster gene
  JOURNAL   J. Biol. Chem. 262, 8112-8120 (1987)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Zehner,Z.E. and Paterson,B.M.
  TITLE     The chicken vimentin gene: Aspects of organization and
            transcription during myogenesis
  JOURNAL   Ann. N.Y. Acad. Sci. 455, 79-94 (1985)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by by B.A.Roe, 11-MAY-1987.  Polyadenylation signals are located at
            positions 2546-2551, 2594-2599, 2829-2834, and 2850-2855.  Multiple
            mRNAs are found for vimentin, each using a different
            polyadenylation site.
FEATURES       from  to/span     description
    pept    +   159      284     vimentin, exon 5
                901     1121     vimentin, exon 6
               1216     1259     vimentin, exon 7
               1685     1770     vimentin, exon 8
               2256     2297     vimentin, exon 9
    pre-msg <     1  >  2297     VIM mRNA
    IVS     <     1      158     VIM cds intron D
    IVS         285      900     VIM cds intron E
    IVS        1122     1215     VIM cds intron F
    IVS        1260     1684     VIM cds intron G
    IVS        1771     2255     VIM cds intron H
BASE COUNT      816 a    536 c    611 g    892 t      2 others
ORIGIN      About 0.5 kb after segment 2.
        1 tcagatagga aaggtcagtg taacgttcat gtttaaatat ttcaagcgct agaacttccc
       61 tgtaatggct tcagttgcct ccaaatcctg ggtctagaag tggtccatta caccattttt
      121 gtgaagtgtt atgctaacca atctgggtgt ttccacagtt tgcagatctc tccgaagctg
      181 ctaataggaa caatgatgcc ctgcgccagg ccaaacaaga agctaatgaa tatcgcagac
      241 agattcagtc tctcacctgt gaagttgatg cccttaaagg aagtgtaagt agacgttctt
      301 cctgattcag ctttttaagc actatgctgt agtttagggg atgtgtgtgg ctgcagctca
      361 tctcccacag aagacagcag tgcacccatc cttcctgaaa aagactctct gggctccatt
      421 tgtctgcatg tactttgtag tgcactgctc agattgctac actggactgg tctttctcaa
      481 ggtgtaaaca atacagaatg agctatacga agtggaaaag tgcagatctt gcacttccta
      541 gaatacagtt gcctgaaagg ctggcttgaa atggaagtaa ctgtatagta ctcctgtaga
      601 ttccctcacc aagtactgtc tgcatgctga cttgtactgt atgtgtttgg cttactgtgg
      661 gggtttgctt tgctgtctgg tagaataaat gtgaaacatc tctcatcaga caccagaata
      721 acacacctgg actttgcaag gggaggataa ggacatctcc ttatggggac atatttgagc
      781 atgttcagaa aggacactgt cattcaactt ccctttgcgt ttttttctta agttatgaag
      841 ggtcttgaca ttagaaagag tggctgaaat gtagttaaat ttgactttat tctctcctag
      901 aatgaatccc tggagcgcca gatgcgtgaa atggaggaga attttgctgt tgaagctgct
      961 aactaccagg acactattgg ccgcctgcag gatgagattc agaacatgaa ggaagaaatg
     1021 gctcgccatc ttcgtgagta ccaggacctg ctgaatgtaa agatggctct tgatattgag
     1081 attgctacct acagaaaact gctggaggga gaagagagca ggtaaaaatc acatgcaggc
     1141 acctaccttc tagcagttac agtttagaga ttccttacca ttgtctgact tgatctattt
     1201 cttcttttcc ttcaggatta acatgcctat tccaaccttt gcttctttga acctgagagg
     1261 taagctgttt catttgggcc tgtgtaagca gctgttttta atctttagca aaaaggtcat
     1321 ggatagttcc ttttgctgta gtatttgaac tggtgtaaat gcagttgtat atgaaagtca
     1381 ttcttacaca caactgccag tatttgcagt tacttaattt tggatgagag gtggtacttt
     1441 gttcctgggt gtacacgtaa gggagtatca atacataaca agtgacttgg atttctgttg
     1501 aagctgcttt aagaggaata agaagaaaat aagcaaagct agtttataat tcattatcat
     1561 gcaaaagaga agaggacttc tttgtataca aaactgggat tctgaaagta gatactgaca
     1621 ttcctcagtt ctttgaggag gcacttcatc tagactcttg gtaacaatac attttcttct
     1681 gcagaaacca acattgagtc tcagccaatt gttgacactc actcgaagag gacacttcta
     1741 attaagaccg tggaaactag agatggacag gttggtggtt actttgattt tttttcattc
     1801 tatctcagct ggttactttg cccagtgctg tagtctgggc cactcattgg tgcagcctgt
     1861 atatggcaga cagtgtctgg cagtccaagc tgcatggctg aatgtgctga tgtagcaaat
     1921 gtacttctgt ggaatagacc tcaaaacaga tctgagaaca gcttgagtga ctgacagccc
     1981 gtgtttagag gagactttag gttcagttga gttgcaagtc ctcaagattt ctcccattgc
     2041 ataagaaaaa tgttgcagga gaagcgttta ttaggggcag gtagtgtatt tgtactaaat
     2101 tgaagtttag atggttaagc gtgtgcagct accatttaaa cttctcattc ataggcaaat
     2161 gctgatacct ttgttcacat ctgtgctact atttgtttca gttctaactc tttataacaa
     2221 ctgtcttaag cttnnatgcc tattttccaa cacaggttat taatgaaact tcccagcatc
     2281 acgatgactt ggagtaaagt gaagatgcaa acttaatgca ggagaaattc ttaccagcaa
     2341 ggttttaaaa agttcatgtc ttaaaggaag aaacagcttt caagtgcctt tctccagttt
     2401 tccatgagcg caagattatt atgctaggaa ataggtctta gatcttgcaa actgactctc
     2461 cctgaaggat tagagtttac aatggagtct agtttacaaa tagcaatatc ttgtgctgca
     2521 atactgtttt taagtatctg aatttaataa aactgctttt tccagcacag tatgagcaac
     2581 ctgtcgctac ttcaataaat ctttggaaaa tggctcttga tgtgttctaa tttaacttca
     2641 tgactttctg caaagccata acttaatgct ggaattacta tacggttgac aactccagta
     2701 ctgattgtgt gaaatattgt tttcagatta actagacaaa ctgtcttccc atttactgct
     2761 taggttttgg aaccaattaa aatggactat aactggcaga tgcataatgt attgatactt
     2821 atcagttgaa taaaatgata cttcaagcta ataaaaa
//
LOCUS       CHKVIMN       591 bp ds-DNA             VRT       10-AUG-1990
DEFINITION  chicken intermediate filament protein vimentin gene, 3' end.
ACCESSION   X00185 J00915
KEYWORDS    vimentin.
SOURCE      Chicken DNA and cDNA to embryonic muscle mRNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 591)
  AUTHORS   Zehner,Z.E. and Paterson,B.M.
  TITLE     Characterization of the chicken vimentin gene: Single copy gene
            producing multiple mRNAs
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 911-915 (1983)
  STANDARD  simple staff_review
REFERENCE   2  (bases 30 to 591)
  AUTHORS   Zehner,Z.E. and Paterson,B.M.
  TITLE     Vimentin gene expression during myogenesis: Two functional
            transcripts from a single copy gene
  JOURNAL   Nucleic Acids Res. 11, 8317-8332 (1983)
  STANDARD  simple staff_review
COMMENT     
FEATURES       from  to/span     description
    mRNA    <     1      308     vimentin mRNA (alt.)
    mRNA    <     1      367     vimentin mRNA (alt.)
    mRNA    <     1      591     vimentin mRNA (alt.)
BASE COUNT      193 a    101 c    105 g    192 t
ORIGIN      
        1 cagcatcacg atgacttgga gtaaagctga agtgaagatg caaacttaat gcaggagaaa
       61 ttcttaccag caacctttta aaaagttcat gtcttaaagg aagaaacagc tttcaagtgc
      121 ctttctccag tttttccatg agcgcaagat tattatgcta ggaaataggt cttagatctt
      181 gcaaactgac tctccctgaa ggattagagt ttacaatgga gtctagttta caaatagcaa
      241 tatcttgtgc tgcaatactg tttttaagta tctgaattta ataaaactgc tttttccagc
      301 acagtatgag caacctgtcg ctacttcaat aaatctttgg aaaatggctc ttgatgtgtt
      361 ctaatttaac ttcatgactt tctggaaagc cataacttaa tgctggaatt actatacggt
      421 tgacaactcc agtactgatt gtgtggaata ttgttttcag attaactaga caaactgtct
      481 tcccatttac tgcttaggtt ttggaaccaa ttaaaatgga ctataactgg cagatgcata
      541 atgtattgat acttatcagt tgaataaaat gatacttcaa gctaataaaa a
//
LOCUS       ECORRD       1541 bp ss-rRNA            RNA       10-AUG-1990
DEFINITION  E.coli 16S ribosomal RNA.
ACCESSION   J01859 M24828 M24911 K02555 M24996
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      Escherichia coli rRNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1541)
  AUTHORS   Ehresmann,C., Stiegler,P., Carbon,P. and Ebel,J.-P.
  TITLE     Recent progress in the determination of the primary sequence of the
            16 S RNA of Escherichia coli
  JOURNAL   FEBS Lett. 84, 337-341 (1977)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1541)
  AUTHORS   Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P.
  TITLE     The sequence of Escherichia coli ribosomal 16 S RNA determined by
            new rapid gel methods
  JOURNAL   FEBS Lett. 94, 152-156 (1978)
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 1541)
  AUTHORS   Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P.
  TITLE     The complete nucleotide sequence of the ribosomal 16-S RNA from
            Escherichia coli: Experimental details and cistron heterogeneities
  JOURNAL   Eur. J. Biochem. 100, 399-410 (1979)
  STANDARD  full staff_review
REFERENCE   4  (bases 1346 to 1541)
  AUTHORS   Taylor,B.H., Prince,J.B., Ofengand,J. and Zimmermann,R.A.
  TITLE     Nonanucleotide sequence from 16S ribonucleic acid at the peptidyl
            transfer ribonucleic acid binding site of the Escherichia coli
            ribosome
  JOURNAL   Biochemistry 20, 7581-7588 (1981)
  STANDARD  full staff_review
REFERENCE   5  (bases 1442 to 1541)
  AUTHORS   Schnare,M.N. and Gray,M.W.
  TITLE     3'-terminal sequence of wheat mitochondrial 18S ribosomal RNA:
            Further evidence of a eubacterial evolutionary origin
  JOURNAL   Nucleic Acids Res. 10, 3921-3932 (1982)
  STANDARD  full staff_review
REFERENCE   6  (bases 1 to 1541)
  AUTHORS   Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P.
  TITLE     The determination of the primary structure of the 16s ribosomal rna
            of escherichia coli: III. Further studies
  JOURNAL   Biochimie 57, 711-748 (1975)
  STANDARD  simple staff_entry
REFERENCE   7  (bases 1 to 1541)
  AUTHORS   Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P.
  TITLE     the determination of the primary structure of the 16S ribosomal RNA
            of Escherichia coli: (2) nucleotide sequences of products from
            partial enzymatic hydrolysis
  JOURNAL   Biochimie 54, 901-967 (1972)
  STANDARD  simple staff_entry
REFERENCE   8  (sites)
  AUTHORS   Zimmermann,R.A. and Singh-Bergmann,K.
  TITLE     Binding sites for ribosomal proteins S8 and S15 in the 16S RNA of
            Escherichia coli
  JOURNAL   Biochim. Biophys. Acta 563, 422-431 (1979)
  STANDARD  simple staff_entry
REFERENCE   9  (bases 558 to 890)
  AUTHORS   Gregory,R.J., Zeller,M.L., Thurlow,D.L., Gourse,R.L., Stark,M.J.R.,
            Dahlberg,A.E. and Zimmermann,R.A.
  TITLE     Interaction of ribosomal proteins S6, S8, S15 and S18 with the
            central domain of 16S ribosomal RNA from Escherichia coli
  JOURNAL   J. Mol. Biol. 178, 287-302 (1984)
  STANDARD  simple staff_review
COMMENT     [2], [3] contain updated sequence data for the original work by the
            same laboratory [1]. There were too many discrepancies between [1]
            and [2], [3] to list each revision in our sites table. The sequence
            shown is from [3]. [1], [2], [3] point to a number of cistron
            heterogeneities. There is uncertainty, however, with regard to
            assigning these various heterogeneities to specific cistrons. The
            RNA method used by [1], [2], [3] gives the average of all the
            cistrons present in the cell [3]. The heterogeneities are
            classified by their relative proportions into major, minor and
            undetermined species. The sequence shown corresponds to the major
            species. The heterogeneities were annotated as variations in the
            sites table. It is not known which of the residues "c" (base 633)
            or "a" (base 641) undergoes a deletion, giving rise to the minor
            component "atctg". [3] suggests the existence of one or two mutated
            cistrons among the known seven cistrons of ribosomal RNA.
            
            With the exception of a single base deletion, this sequence is
            identical to the current 16S rDNA sequence for the E.coli rRNB
            gene.
FEATURES       from  to/span     description
    rRNA          1     1541     16S rRNA
    variant      76       76     g in major species; c in undetermined species
    variant      80       80     a in major species; c in undetermined species
    variant      88       89     tc in major species; tgc in undetermined
                                 species
    variant      90       92     ttt in major species; substitution of one of
                                 the t by c in undetermined species
    variant     179      179     t in major species; c in minor species
    variant     182      182     c in major species 1; t in major species 2
    variant     193      194     ca in major species; cga in undetermined
                                 species
    variant     267      272     tcacct in major species; ccacct or tcaccc in
                                 minor species
    variant     283      285     ccc in major species; cc in minor species
    conflict    348      349     ag in [3]; ga in [2]
    modified    526      526     m7g = 7-methylguanosine
    variant     630      631     ct in major species; cct in minor species
    variant     632      634     gca in major species; ga in minor species (see
                                 comment)
    variant     640      642     tac in major species; tc in minor species (see
                                 comment)
    conflict    853      855     ttc in [3]; ctt in [2]
    revision    915      916     tg in [3]; tcg in [1],[2] (typing error)
    variant     965      965     g in undetermined species 1; 2-methylguanosine
                                 in species 2
    conflict   1027     1029     ctt in [3]; ttc in [2]
    variant    1071     1072     gt in major species; gtt in undetermined
                                 species
    variant    1074     1074     t in major species; a in undetermined species
    modified   1099     1099     m5c = 5-methylcytidine
    modified   1206     1206     m2g = 2-methylguanosine
    variant    1280     1280     c in major species; t in minor species
    variant    1320     1322     tcg in major species; tg in minor species
    modified   1401     1401     m4c = n4-methylcytidine
    modified   1402     1402     cm = 2'-o-methylcytidine
    variant    1405     1405     t in major species; c in minor species
    variant    1406     1406     5-methylcytidine in major species; c in minor
                                 species
    conflict   1490     1493     gaag in [3]; gag in [4]
    modified   1497     1497     um = 2'-o-methyluridine
    modified   1517     1517     m66a = n6,n6-dimethyladenosine
    modified   1518     1518     m66a = n6,n6-dimethyladenosine
BASE COUNT      389 a    352 c    487 g    313 t
ORIGIN      5' end of mature rRNA.
        1 aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa
       61 gtcgaacggt aacaggaaga agcttgctct ttgctgacga gtggcggacg ggtgagtaat
      121 gtctgggaaa ctgcctgatg gagggggata actactggaa acggtagcta ataccgcata
      181 acgtcgcaag accaaagagg gggaccttcg ggcctcttgc catcggatgt gcccagatgg
      241 gattagctag taggtggggt aacggctcac ctaggcgacg atccctagct ggtctgagag
      301 gatgaccagc cacactggaa ctgagacacg gtccagactc ctacgggagg cagcagtggg
      361 gaatattgca caatgggcgc aagcctgatg cagccatgcc gcgtgtatga agaaggcctt
      421 cgggttgtaa agtactttca gcggggagga agggagtaaa gttaatacct ttgctcattg
      481 acgttacccg cagaagaagc accggctaac tccgtgccag cagccgcggt aatacggagg
      541 gtgcaagcgt taatcggaat tactgggcgt aaagcgcacg caggcggttt gttaagtcag
      601 atgtgaaatc cccgggctca acctgggaac tgcatctgat actggcaagc ttgagtctcg
      661 tagagggggg tagaattcca ggtgtagcgg tgaaatgcgt agagatctgg aggaataccg
      721 gtggcgaagg cggccccctg gacgaagact gacgctcagg tgcgaaagcg tggggagcaa
      781 acaggattag ataccctggt agtccacgcc gtaaacgatg tcgacttgga ggttgtgccc
      841 ttgaggcgtg gcttccggag ctaacgcgtt aagtcgaccg cctggggagt acggccgcaa
      901 ggttaaaact caaatgaatt gacgggggcc cgcacaagcg gtggagcatg tggtttaatt
      961 cgatgcaacg cgaagaacct tacctggtct tgacatccac ggaagttttc agagatgaga
     1021 atgtgccttc gggaaccgtg agacaggtgc tgcatggctg tcgtcagctc gtgttgtgaa
     1081 atgttgggtt aagtcccgca acgagcgcaa cccttatcct ttgttgccag cggtccggcc
     1141 gggaactcaa aggagactgc cagtgataaa ctggaggaag gtggggatga cgtcaagtca
     1201 tcatggccct tacgaccagg gctacacacg tgctacaatg gcgcatacaa agagaagcga
     1261 cctcgcgaga gcaagcggac ctcataaagt gcgtcgtagt ccggattgga gtctgcaact
     1321 cgactccatg aagtcggaat cgctagtaat cgtggatcag aatgccacgg tgaatacgtt
     1381 cccgggcctt gtacacaccg cccgtcacac catgggagtg ggttgcaaaa gaagtaggta
     1441 gcttaacctt cgggagggcg cttaccactt tgtgattcat gactggggtg aagtcgtaac
     1501 aaggtaaccg taggggaacc tgcggttgga tcacctcctt a
//
LOCUS       HUMRSALPA     320 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone pRI-680-05, dimer 1.
ACCESSION   M28029
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 320)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      320     alphoid repeat, dimer 1
BASE COUNT      100 a     54 c     59 g    107 t
ORIGIN      
        1 tgatgtgtgt attcaacgca cagagttgaa cgttccttta gacagagtag attggaaaca
       61 ctctttttgt agaattttca agtggaggta tcaagcgctt tgaggccaat gatagaaaag
      121 gaaatacctt cgtataataa ttagacagaa tcattctcag aaaccgcttt gcaatgtgtg
      181 cgttcaactc acagtgttta acctttcttt tcatacagtt gtttcgaaac actcttttta
      241 cagaatctac aagtaaatat ttggacctct ttgaagtctt cgttagaaat gagatttctt
      301 catataatac tagctagaag
//
LOCUS       HUMRSALPB     314 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone pRI-680-05, dimer 2.
ACCESSION   M28030
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      314     alphoid repeat, dimer 2
BASE COUNT       89 a     51 c     65 g    109 t
ORIGIN      
        1 ttctctcagt aacttgtatt tgctcggtgc tatcaactct cagagttgaa ctttccttta
       61 gaaacagtag atttgaaact ctctttttgc ggaatttgta agtggagata tcaagagttt
      121 tgaggccaat ggtagaaaag gaaatatctt cgtatgtaaa ctagatagaa tcattctcag
      181 aaactacttt ggcacgtgtg cgttcaactc acagtgttta acctttcttt tcatagagca
      241 gtttggaaac actcagtttg taaagtcagc aactggatat ttggatgtat ttgattcctt
      301 cgttggaaat ggga
//
LOCUS       HUMRSALPC    2550 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone pHS53.
ACCESSION   M28031 M28033 M28032
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2550)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      850     alphoid pentamer
    rpt         851     1700     alphoid pentamer
    rpt        1701     2550     alphoid pentamer
BASE COUNT      743 a    496 c    524 g    787 t
ORIGIN      
        1 gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg gataaacttc
       61 ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt caactcacag
      121 agttgaacct tgctttcata gttcagcttt caaacactct ttttgtagaa tctgcaagtg
      181 gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca tcaaacctag
      241 accgaagcat tctcagaatg tttcctttga tgactgcatt caactcacag aggtgaacaa
      301 tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt ggatatgtgg
      361 acctctgtga agatttcttt ggaaacgggt tcatcttcac agaaaaacta aacagaagca
      421 ttctcagaaa ctgctttgtg atgtttgtgt tccacttcag gaattaaact ttcctcttga
      481 cagagcagct ctgaaaccct cttattctag aatctgcaag tggacatttg gaggactttg
      541 aggcctgtgg tggaaaagga aaatcttcac ataaaaacta gatggaagca ttctcagaaa
      601 ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga tagagcaggt
      661 tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg aggcctactg
      721 tagtaaagga aataacttca tctaaaaacc aaacggaagc attcacagac aattcttagt
      781 gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt ttccaaacac
      841 actttctgta gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg
      901 gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt
      961 caactcacag agttgaacct tgctttcata gttcagcttt caaacactct ttttgtggaa
     1021 tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca
     1081 tcaaacctag accgaagcat tctcagaatg tttcctgtga cgactgcatt caactcacag
     1141 agatgaacaa tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt
     1201 tgatatgtgg acctctgtga agatttcgtt ggaaacgggt tcatcttcac agaaaaacta
     1261 aacagaagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa gaattgaact
     1321 ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag tggacatttg
     1381 gagggctttg aggcctgtgg tgcaaaagga aaatcttcac ataaaaacta gatggaagca
     1441 ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga
     1501 tagagcaggt tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg
     1561 aggcctactg tagtaaagga aataacttca tctaaaatcc aaacggaagc attcacagac
     1621 aattcttagt gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt
     1681 ttccaaacac actttctcta gaatctgcaa gtggatattt ggacttctct gaggatttcg
     1741 ttggaaacgg gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga
     1801 tgtttgcatt caactcacag agttgaacct tgctttcata gttcagcttt caaacactct
     1861 ttttgtggaa tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta
     1921 tatcttcaca tcaaacctag acagaagcat tctcagaatg tttcctgtga cgactgcatt
     1981 caactcacag aggtgaacaa ccctgctgat ggagcagttt tgaaactctc tttctttgga
     2041 ttctgcaagt ggatatgtgg acctctgtga agatttcgtt ggaaacggtt tcatcttcac
     2101 agaaaaacta aacaggagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa
     2161 gaattgaact ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag
     2221 tggacatttg gagggctttg aggcctgtgg tggaaaagga aaatcttcac ataaaaacta
     2281 gatggaagca ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca
     2341 ttcctataga tagagcaggt tgtaaacaat gtttttgtag aatctgcgat tggagatttg
     2401 gactgctttg aggcctactg tagtaaagga aataacttca tctaaaaacc aaacgcaagc
     2461 attcacagac aattcttagt gatcattgga ttgaactaac agagctgaac attcctttag
     2521 atggagaagt ttccaaacac actttctgca
//
LOCUS       HUMRSALPD     134 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone p340-11A.
ACCESSION   M28034
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 134)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      134     alphoid repeat
BASE COUNT       47 a     23 c     27 g     37 t
ORIGIN      
        1 tcacagagtt gaacgatcct ttacacagag cagacttgta acactctttt tgtggaattt
       61 gcaagtggag atttcagccg ctttgaagtc aaaggtagaa aaggaaatat cttcctataa
      121 aaactagaca gaat
//
LOCUS       HUMRSALPE     146 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone p340-23A.
ACCESSION   M28035
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 146)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      146     alphoid repeat
BASE COUNT       46 a     28 c     26 g     43 t      3 others
ORIGIN      
        1 ttttttccca actcacacga gttaacgatc ctttacacag agcagacttg tannnctctc
       61 tttatggaat ttgcaagtgg agatttcagc cgctttgagg tcaatggtag aaaaggaaat
      121 atcttcctat aaaaactaga cagaat
//
LOCUS       HUMRSALPF     171 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone p340-10A.
ACCESSION   M28036
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 171)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      171     alphoid repeat
BASE COUNT       53 a     27 c     38 g     53 t
ORIGIN      
        1 aattctcaga gtcttctttg tgatgtgtgc tttcaactca ccgagataaa gatttctctt
       61 gatagagcaa attggaaaca ctctttttgt agaatttgca agggtacatt gagagcgctt
      121 tcaggcctat ggtagaaaag ggaatatctt tccataaaag gtagacagaa g
//
LOCUS       HUMRSALPG     137 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone p340-11B.
ACCESSION   M28037
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 137)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      137     alphoid repeat
BASE COUNT       33 a     27 c     30 g     47 t
ORIGIN      
        1 gattctcaga aactcctttg tgatgtgtgc gttcaactca cagagtttaa cctttctttt
       61 catagagcag ttaggaaaca ctctgtttgt gaagtctgca agtggatatt cagacctctt
      121 tgaggccttc gttggaa
//
LOCUS       HUMRSALPJ     158 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone p340-03B.
ACCESSION   M28040
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 158)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      158     alphoid repeat
BASE COUNT       44 a     31 c     40 g     43 t
ORIGIN      
        1 gcaactcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa ccgagcgcaa
       61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt
      121 tgaggccttc gttggaaacg ggatttcttc ctataaag
//
LOCUS       HUMRSALPK     138 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alphoid repetitive DNA, subclone p340-10B.
ACCESSION   M28039
KEYWORDS    alphoid repetitive sequence; repetitive DNA.
SOURCE      Human DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 138)
  AUTHORS   Zaitsev,I.Z. and Rogaev,E.I.
  TITLE     Structural analysis of alphoid DNA of primates
  JOURNAL   Mol. Biol. 20, 530-538 (1986)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt           1      138     alphoid repeat
BASE COUNT       37 a     27 c     32 g     42 t
ORIGIN      
        1 caatctcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa cattcctctt
       61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt
      121 tgaggccttc gttggaaa
//
LOCUS       HUMHBGF1     1082 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human heparin-binding growth factor 1 (hbgf-1), exon 1.
ACCESSION   M23017
KEYWORDS    acidic fibroblast growth factor; heparin-binding growth factor 1.
SEGMENT     1 of 3
SOURCE      Human fetal liver DNA, clone ChIK117-1.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1082)
  AUTHORS   Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M.
  TITLE     Cloning of the gene coding for human class 1 heparin-binding growth
            factor and its expression in fetal tissues
  JOURNAL   Mol. Cell. Biol. 9, 2387-2395 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.-M.Chiu, 13-MAR-1989.
FEATURES       from  to/span     description
    pept        602  +   770     heparin-binding growth factor 1, exon 1
                                 /nomgen="FGFA" /map="5q31.3-q33.2"
                                 /hgml_locus_uid="LD0150G"
    pre-msg     568  >  1082     heparin-binding growth factor 1 mRNA and
                                 intron
    IVS         771  >  1082     heparin-binding growth factor 1 intron A
BASE COUNT      258 a    285 c    262 g    277 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttccct taacatacta accctttact ttccctgttg tgtccctgaa aggcctcctg
       61 tgcctttggc tgcaggtccc gaacgtccag gccatctgtg ctatctgctt cgcggtacct
      121 caccaacgca acgtgagggt ggagggcaga accttggtcc tggcctctca gcttttgtgg
      181 gtttcagcca gaccctaggt gttattttag tgcaactttg gtgtttaatt tgaggatgtg
      241 tgtggaccag aaggagggac caaaacatga ttcttttccc catggtcaga tgattaaatt
      301 tgaagttcta aaaaatgcag tttggtccaa agctgtgtcc aattgggaag agagaaaaat
      361 gccctggaaa cccctcccag gcctgggacc atccttcctt aaccaccagc cacctcacag
      421 gcccgcggac tgcgggcatc acctgggcag gctgtgctta ctcactaccc gggaaccctg
      481 tgccctggag ctgtccttcc tctcttcaaa gtgcattttg tgcctttgct ggaagaaccg
      541 actacaggtt tgttcaattt cttacagtct tgaaagcgcc acaagcagca gctgctgagc
      601 catggctgaa ggggaaatca ccaccttcac agccctgacc gagaagttta atctgcctcc
      661 agggaattac aagaagccca aactcctcta ctgtagcaac gggggccact tcctgaggat
      721 ccttccggat ggcacagtgg atgggacaag ggacaggagc gaccagcaca gtaagcccat
      781 ctctatggca ccccccttcc ctttctgaca tcttctgtag tcaaggtggg aggaaggtgc
      841 acatttaagt acaggtactt gcttctccaa ggttctattc aggcatgaca cattcagagg
      901 tggagtcaca taaatgcgta aaatgtctgg gaaatgaaaa tagggacttg tgggggccac
      961 cacttaccca aacgtgtcct atttcaagtt ttttaaagca ctctctgctg acccaacaga
     1021 acgggctgcc ggtgctcaat tgctgtatgt tttcccaggt ttctgtaact agtgaaagat
     1081 ct
//
LOCUS       HUMHBGF2      427 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human heparin-binding growth factor 1 (hbgf-1), exon 2.
ACCESSION   M23086
KEYWORDS    acidic fibroblast growth factor; heparin-binding growth factor 1.
SEGMENT     2 of 3
SOURCE      Human adult liver DNA, clone ChIL208-1.4.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 427)
  AUTHORS   Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M.
  TITLE     Cloning of the gene coding for human class 1 heparin-binding growth
            factor and its expression in fetal tissues
  JOURNAL   Mol. Cell. Biol. 9, 2387-2395 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.-M.Chiu, 13-MAR-1989.
FEATURES       from  to/span     description
    pept    +   186  +   289     heparin-binding growth factor 1, exon 2
                                 /nomgen="FGFA" /map="5q31.3-q33.2"
    pre-msg <     1  >   427     heparin-binding growth factor 1 mRNA and
                                 introns
    IVS     <     1      185     heparin-binding growth factor 1 intron A
    IVS         290  >   427     heparin-binding growth factor 1 intron B
BASE COUNT      106 a     85 c    108 g    128 t
ORIGIN      Unknown number of bp after segment 1.
        1 cagctttctt tggaaggcaa agaaaaaggg actgtatttc tatgttttga ttaatctgag
       61 gctcatcctg agggctccgt gaaatgaatg agcagaattt tccatggcca actgtcctgg
      121 ctgccgggtc ctatcggcaa aagcgtagtg tttatttact tttgctcgtg ttatttttat
      181 tccagttcag ctgcagctca gtgcggaaag cgtgggggag gtgtatataa agagtaccga
      241 gactggccag tacttggcca tggacaccga cgggctttta tacggctcag taagtatgaa
      301 gctgacatgc ttccagacgt tggccaaggt ttgaggtttc cagaaatctt gttacatgga
      361 gtgaggcaaa ctataaagca acaattagtc tctgtttgtt attttttcca gaaggattcc
      421 caccctc
//
LOCUS       HUMHBGF3      664 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human heparin-binding growth factor 1 (hbgf-1), exon 3.
ACCESSION   M23087
KEYWORDS    acidic fibroblast growth factor; heparin-binding growth factor 1.
SEGMENT     3 of 3
SOURCE      Human fetal liver DNA, clone ChIK105a-2.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 664)
  AUTHORS   Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M.
  TITLE     Cloning of the gene coding for human class 1 heparin-binding growth
            factor and its expression in fetal tissues
  JOURNAL   Mol. Cell. Biol. 9, 2387-2395 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.-M.Chiu, 13-MAR-1989.
FEATURES       from  to/span     description
    pept    +   304      498     heparin-binding growth factor 1, exon 3
                                 /nomgen="FGFA" /map="5q31.3-q33.2"
    pre-msg <     1  >   498     heparin-binding growth factor 1 mRNA and
                                 intron
    IVS     <     1      303     heparin-binding growth factor 1 intron A
BASE COUNT      181 a    153 c    142 g    188 t
ORIGIN      Unknown number of bp after segment 2.
        1 tgaggactct tagaagtgct cttatcagta gcatcttaat tactttacaa tggattttaa
       61 atggaaagga agtttacaat aatagcaaat gcatattgac agctctttag tgcccggtgc
      121 tgttctaagt ccttatgact accctgtgaa ataagttcca ccatgacccc aattttcctg
      181 aaaaggagac tgaggcatgg agagctttag tattttgccc aatgtcacac agctagtaaa
      241 tggggacccc catgtgaaac tactcactga ttgtcctact ctcttgtggt tttatctttt
      301 tagcagacac caaatgagga atgtttgttc ctggaaaggc tggaggagaa ccattacaac
      361 acctatatat ccaagaagca tgcagagaag aattggtttg ttggcctcaa gaagaatggg
      421 agctgcaaac gcggtcctcg gactcactat ggccagaaag caatcttgtt tctccccctg
      481 ccagtctctt ctgattaaag agatctgttc tgggtgttga ccactccaga gaagtttcga
      541 ggggtcctca cctggttgac ccaaaaatgt tcccttgacc attggctgcg ctaaccccca
      601 gcccacagag cctgaatttg taagcaactt gcttctaaat gcccagttca cttctttgca
      661 gagc
//
LOCUS       HUMASAT3     1741 bp ds-DNA             PRI       10-AUG-1990
DEFINITION  Human alpha satellite and satellite 3 DNA.
ACCESSION   M21305
KEYWORDS    alpha satellite; satellite 3; satellite DNA.
SOURCE      Human DNA, clone pTR9.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1741)
  AUTHORS   Vissel,B., Earle,E. and Choo,K.H.
  TITLE     A clone containing contiguous satellite 3 and alpha satellite
            sequences, separated by a direct repeat of 9 bp
  JOURNAL   Unpublished (1989) Royal Children's Hospital, Victoria, Australia
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by B.Vissel 29-MAR-1989.
            
            Indirect evidence suggests that this sequence may represent a
            natural junction sequence between alpha satellite and satellite 3
            sequences on chromosome 9.
FEATURES       from  to/span     description
    site          1     1482     human satellite 3 DNA
    site       1501     1741     human alpha satellite DNA
    rpt        1483     1491     direct repeat 1A
    rpt        1492     1500     direct repeat 1B
    signal        5        9     caat box (put.)
    signal       71       77     tata box (put.)
    ORF          38      207     ORF 1 (alt.)
    ORF          43      172     ORF 1 (alt.)
    signal      630      634     caat box (put.)
    signal      718      722     tata box (put.)
    ORF         752     1109     ORF 2 (alt.)
    ORF         757      925     ORF 2 (alt.)
BASE COUNT      661 a    152 c    534 g    394 t
ORIGIN      
        1 gaattcaata gaatggaatg gtatcagctc gagtggaatg gaatggaatg gaatggaatg
       61 gcatggaatc gtataaagtg gaatggaatc aactcgagtg gaatggaatg gaatggaatg
      121 gaatggaatg cagtacaatg caatagaatg gaatggaatg aactcgagtt gactggaatg
      181 gaatggaatg gaatgcattt gaattgaatg gacctgaaga gaatggaatg gaatggaatg
      241 gaatgaagtg gcctcgaatg caatagatta gattgcaacg gaatcatctt gagtggaatg
      301 gagtgaaatg gaacggaatg gaatggaatg gaatggaata gaatggaatg gaatggaatg
      361 gaatagaatg gaatggaatg gaatggaatg aaatgcaata gaatggaatg gtatcaactc
      421 cagtggaatg gaatggaatg caatggaatg gaatggaatg gaatggaatg gaatcaattc
      481 acctggaacg gaatggaatg caatggaatg gaatggaatg gaatgcagtg gaatggaatg
      541 gaatagaatg gaatggaatg gaatggaatg gaatccactt gagtggaatg gaatggaatg
      601 gaatggattc gaacggaatg gaatggaatg caatggaatg gaatgcatgt gattgaatgg
      661 acacgaaaag aattgaatgg aaatgaatgg aatcgaatga agtggcctcg aatgcaatat
      721 aatggaatgg aatcaagtcg agtggaatgg aatggaatgg gttggaatgg aatggaatgg
      781 aatggaatgc attagaatgg aatggaatca actcgagtgg aatggaatgg aatggaatgg
      841 aatggaatgg aatgcaatgg aatggaatgg aatggaatgg aatggaatgg aatggaatgc
      901 aatggaatgg aatggaatgc aatagaatgg aatggaatca cctcgactgg aatggaatgg
      961 aatggaatgc atttgaattg aatggacccg aagagaatgg aatggaatgg aatggaagga
     1021 gtggcctcaa atgcaataga atgcaatgga atcatctcga gtggaatgga gtgaaatgga
     1081 atggaatgga atggaatgga atgctataga atgaaatgga atgaagtcaa gtggaatgga
     1141 atggaatgga atgggttgga agggaatgaa atggaatgca atagaatgga aaggaatgga
     1201 atggaatgaa ctggccaatg caatagaatt gaatggaatc aactggagtg gaatggaatg
     1261 gaatggaatg gattggaatg gaatgcaata gaatggaatg gaatcaactc gagtaaaatg
     1321 gaatggaatg gaatggaatg gaatggaatg gaatggaatt gaatggaatg gaatggaatg
     1381 catttgaatt gaatggacac gaaaagaatg gattggaatg gaatggaatg aagtagcctc
     1441 gagtgcaatg atatggaatg gcatcaactc gagtggaatg gagtgaaaaa ggtgaaaaag
     1501 gaaatacctt cacacaaaaa ctagacagaa gcattctcag aatcttcctt ttgatgtgtg
     1561 cattcaactc acagagttga acctttcttt tgatagagca gttttgaaac actctctttg
     1621 taggatctgc aactggacat ttggagctct ttgataccta cagtgaaaaa ggaaatatct
     1681 tcacataaaa acttcacaga agcattctca aaaacatttt tcagatgtgt gcattgaatt
     1741 c
//
LOCUS       DROLINEJA    5154 bp ds-DNA             INV       10-AUG-1990
DEFINITION  D.melanogaster LINE element J-1.
ACCESSION   M22874
KEYWORDS    LINE element; dispersed repetitive element;
            long interspersed element.
SOURCE      D.melanogaster DNA, clone J-1.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 5154)
  AUTHORS   Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V.
  TITLE     The Drosophila mobile element jockey belongs to LINEs and contains
            coding sequences homologous to some retroviral proteins
  JOURNAL   Gene 70, 253-262 (1988)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    pept    /   414     2165     ORF1, nucleic-acid binding protein (AA at 1)
                                 (put.)
    pept       2162     4912     ORF2, reverse transcriptase (put.)
    rpt         115     5134     jockey dispersed repetitive element
BASE COUNT     1580 a   1226 c   1072 g   1276 t
ORIGIN      
        1 catacccaac ctgaacacgt ttttctcaat atttgtgcca caaaatactt cgccagtaaa
       61 ttaacatgaa gtcgctaaat gcgattacaa ttgacgacgc aatcatatac aaaaaaaaat
      121 cattcacatg ggagatgagc aatcgagtgg acgtgttcac agaagtcgcg agataaaaca
      181 aaaacgtaat tgtgatccat cacaaacatc tgcgcagatc gtgtgcttat ctcacaaaca
      241 aaatctattt ttagtcactg cataacggtg acggcttcgg ttcgcgaaac ttatcagcaa
      301 ctagcaattt ctaagctgtg ttgtttttgc ccctcgccct gcgcgctgcg caagcgggag
      361 gttgttacaa tttaccttac aagtaaaccg gtaaatctta tcgtgtttag taaatatcaa
      421 ttgcattata cggcataagt ataaagacaa ttgatataat ggagaattca tttgctcaat
      481 cgcgacctag caatgggtgc gataaatttg agaaaatgag gaaagtagca ggtgttgagc
      541 caggagaatt acgctcccaa ctccgcgcca gctgtgcagt tgtttcccct aacctggaag
      601 gtatgccaac tcaatctgcg gtctccagct taatggtgac aatcagcagc aacaccaatg
      661 caagtgttac ctgcactatt tctaacgtac aggccaacat gatctgtact cctacataca
      721 ctgattgcac aaccgtgacc actagcattt gcccaactac gccttatgac aatggactgc
      781 cgacacctct gtcatcactg cccaataagc catctaaagc gaattgcccc tttcaagcac
      841 atgatcgtac tgtcaacagg aaacgaaaag gcgtgtctca gcccccatta cctatcctca
      901 ccccttctcc aagccgtaaa actaaaaggc aggccactat gccactcaat gaggaggcct
      961 ctacctccac tgcagcagca ttaaataaca atcgcttcgc gcttttgtcc gctgaagcgg
     1021 agaatatgga gcaagacgtg tcggatgctg attctgacat tgaagactct gctgcccgag
     1081 atggtggtgg acaatccgct aaatatagca aacccccagc catatgcgta ccaagtgtaa
     1141 gcgatccggt caccttggaa cgggctctca atctgagcac cggctcctca aactactaca
     1201 tccgcatttc tagatttggt gtatccagaa tctatacagc caaccctgat gctttccgca
     1261 ccgctgtaaa agaactaaat aagttaaatt gtcaattctg gcatcaccaa cttaaagaag
     1321 aaaaacccta cagagtagtg cttaaaggaa tccatgctaa tgttcctagt tcgcagatag
     1381 aacaagcatt tagtgatcac ggctatgagg tccttaatat ctattgcccc agaaagtctg
     1441 actggaagaa cattcaggta aacgaagatg ataatgaagc tacaaaaaac ttcaaaacta
     1501 gacaaaattt gttttatatt aatcttaaac aaggcccgaa tgttaaagag tctcttaaga
     1561 taactcgact tggcagatac agagtcactg ttgagcgcgc tacacgtaga aaagaactgc
     1621 tacaatgtca aagatgccaa atttttggac actctaagaa ctattgcgcc caggatccta
     1681 tttgtggtaa atgtagtggt ccccatatga ccgggttcgc tttgtgcata agtgacgtat
     1741 gtctgtgtat aaattgtggt ggtgatcatg tctcgacaga caaaagctgc cctgtcagag
     1801 cagagaaagc caagaagcta aaaccaaggt ccaggctacc gatgactaat aatattgcca
     1861 cactcaaacc tccacaacgt tcttcaagcg gttacatacc agctgaggca ttaagaacca
     1921 acatctctta tgctgatatt gctcgacgca acacgactca atctagggct cgtgctactg
     1981 tgcaggctga agttatacca acgtcggaca atagccttaa caataaattt atgacgttag
     2041 acaactccat tcgggccatc aatacgagaa tggacgaact atttaagctt atacacgaaa
     2101 ctgtagaggc taataaagct ttcagagaac tggttcaggt tctaattaca cgtattccta
     2161 aatgactcaa ccaaccttaa aaatcggatt gtggaacgct cgcggattaa caaggggctc
     2221 tgaggagctt cggatattcc tcagcgatca cgatatagac gtaatgctta ccacggaaac
     2281 acacatgcga gttggtcagc gcatctatct cccagggtat cttatgtatc acgcccacca
     2341 ccccagtggt aacagtagag gtggctctgc agtcatcata aaatctagac tttgtcacag
     2401 ccctctgaca cctatctcta ctaatgacag gcagatagcg agagtgcacc tgcaaacatc
     2461 ggttgggacc gtcactgtag ctgctgttta tctacctcca gcagaaagat ggatagtaga
     2521 tgacttcaaa tccatgtttg ctgcgttagg caacaaattt attgctggtg gtgattacaa
     2581 tgccaaacat gcatggtggg ggaacccaag atcctgtcct agaggtaaaa tgttgcaaga
     2641 agtcattgca catgggcaat accaagttct ggctacgggc gaacccactt tctactctta
     2701 caaccctttg ttaacaccat cagcccttga tttttttata acctgtgggt acggcatggg
     2761 caggctagat gtacaaactc tccaggaact ctcgtcggac catcttccta ttctggctgt
     2821 attgcacgct acgccgttaa agaaaccaca acgcgtacga ctacttgccc ataatgctga
     2881 cataaacata ttcaaaaccc atcttgaaca gctgagtgag gtaaatatgc aaattctgga
     2941 ggcggtggac attgataatg ccacaagcct tttcatgagc aaactaagtg aggctgctca
     3001 gcttgctgca ccgagaaatc ggcatgaagt agaggccttc agaccacttc aacttccttc
     3061 cagtatattg gcactgctca ggctaaaacg aagagttcga aaagaatatg ctagaacagg
     3121 tgatccccgc atgcaacaga tccacagtag actggccaac tgcctgcata aggcccttgc
     3181 tcgaagaaag caggcccaaa tagatacctt cttggataac ttgggtgctg acgcgagcac
     3241 aaattactca ctgtggcgta tcacgaaacg gttcaaagct cagcccaccc caaaatcagc
     3301 aatcaaaaat ccgtctggtg gctggtgtcg cactagcttg gaaaaaactg aagtgttcgc
     3361 taacaacctt gagcaacgtt ttacacccta taactatgca ccggaaagtc tctgtcgtca
     3421 ggttgaagaa tacttggaat cgccctttca aatgagcctg cctctgagtg ctgtcacact
     3481 ggaagaagtg aagaatttaa tagccaagct gccacttaag aaagctcctg gagaagatct
     3541 tcttgataat agaaccatta gacttctccc agatcaagca ttgcagttcc ttgccttaat
     3601 attcaacagc gttcttgatg ttggctactt tccgaaagct tggaaatcgg cgagcataat
     3661 tatgatccat aagactggaa aaacaccgac agacgttgac tcgtacaggc ccaccagctt
     3721 actcccatct ctgggtaaaa ttatggagag gctgatccta aacaggctgc tcacatgcaa
     3781 ggatgttacc aaagcgattc ccaaatttca gtttggcttc cggttgcagc acggtactcc
     3841 tgagcaacta catagagtag tgaactttgc tctggaagct atggaaaaca aggagtatgc
     3901 agtaggtgcc tttcttgata ttcaacaggc atttgacaga gtctggcacc ctgggctcct
     3961 gtacaaagcg aagaggctgt tcccgccgca gctatatttg gttgttaaaa gtttcctgga
     4021 agaacgcaca ttccacgtct ctgttgatgg gtacaaatca tcaatcaagc caattgcagc
     4081 tggagttcct caaggaagcg ttcttggccc aaccctatac tcagtttttg cttcggacat
     4141 gcctactcac acaccagtca cagaggtaga cgaagaagat gtgctcatag ccacctacgc
     4201 tgacgatact gctgtgctca cgaaaagtaa aagtatcctg gctgccactt ctggtctaca
     4261 ggaatacctg gatgcattcc agcaatgggc tgagaactgg aatgtgcgca tcaacgctga
     4321 gaagtgtgcc aatgtgacgt tcgccaaccg aacaggtagc tgtccgggtg tcagtctgaa
     4381 tggaagactg atcagacacc atcaggctta taaatacctt ggtattaccc tcgataggaa
     4441 gctcaccttc agcaggcaca tcacaaatat tcagcaagcg ttcaggacca aggttgctcg
     4501 gatgtcttgg ctcattgcac cacgcaacaa actgtcgctt ggctgcaagg tcaatattta
     4561 caagtccata ttggccccct gcctgttcta cggcctgcag gtatacggca ttgctgcgaa
     4621 gagtcacctt aataagatcc ggattttaca ggcgaagacc ttaagaagaa tttcgggggc
     4681 tccttggtat atgagaacaa gagacatcga acgcgacctc aaggtgccca aattaggaga
     4741 caagctccag aacatcgccc aaaaatatat ggaaaggctt aatgtacacc ccaacagcct
     4801 agcaaggaag ctaggaactg cagctgtggt caatgctgac cctcggacta gagtcaaaag
     4861 aagactcaag cgacaccacc ctcatgacct ccctaacctg gttttgacct agaaagtctt
     4921 agttttaaaa ttcattagaa taatcaaata aataataatt actatgttat atcaactatt
     4981 ataattctcc ctatcatttt tagattaaaa atctgttagt cttaagtaac caagacacat
     5041 tgtaaaataa aataatttaa gcagatcaaa ttaagttgcc gcatgggtaa cagtgcgttg
     5101 atcaaataat aaaaacatca taaaaaaaaa aaaaatatac aaaattaatg atgg
//
LOCUS       DROLINEJB     490 bp ds-DNA             INV       10-AUG-1990
DEFINITION  D.melanogaster LINE element J-1.
ACCESSION   M22875
KEYWORDS    LINE element; dispersed repetitive element;
            long interspersed element.
SOURCE      D.melanogaster DNA, clone J-0.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 490)
  AUTHORS   Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V.
  TITLE     The Drosophila mobile element jockey belongs to LINEs and contains
            coding sequences homologous to some retroviral proteins
  JOURNAL   Gene 70, 253-262 (1988)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt     <     1      490     jockey dispersed repetitive element
BASE COUNT      218 a     64 c     51 g    157 t
ORIGIN      
        1 tttgttatta aaattcatta gaataatcaa ataaataata attactatgt tatatcaact
       61 attataattc tccctatcat ttttaggatt aaaaatctgt tagtcttaag taaccaagac
      121 acattgtaaa ataaaataat ttaagcagat caaattaagt tgccgcatgg gtaacagtgc
      181 gttgatcaaa taataaaaac atcatcatat aaagacctcc ctaacctggt tttgacctag
      241 aaagtcttag ttttaaaatt cattagaata atcaaataaa taataattac tatgttatat
      301 caactattat aattctccct atcattttta ggattaaaaa tctgttagtc ttaagtaacc
      361 aagacacatt gtaaaataaa ataatttaag cagatcaaat taagttgccg catgggtaac
      421 agtgcgttga tcaaataata aaaacatcat cgtatataat taaaaaaaaa aaaaaaaaaa
      481 aaaaaaaaaa
//
LOCUS       DROLINEJC     367 bp ds-DNA             INV       10-AUG-1990
DEFINITION  D.melanogaster LINE element J-1.
ACCESSION   M22876
KEYWORDS    LINE element; dispersed repetitive element;
            long interspersed element.
SOURCE      D.melanogaster DNA, clone J-4.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 367)
  AUTHORS   Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V.
  TITLE     The Drosophila mobile element jockey belongs to LINEs and contains
            coding sequences homologous to some retroviral proteins
  JOURNAL   Gene 70, 253-262 (1988)
  STANDARD  simple staff_entry
COMMENT     
FEATURES       from  to/span     description
    rpt     <     1      367     jockey dispersed repetitive element
BASE COUNT      135 a     76 c     53 g    103 t
ORIGIN      
        1 ctatctcccg atctctattg tacaccccaa cagcctagca aggaagctag gaactgcagc
       61 tgtggtcaat gctagccctc agcgctagag tcaaaagaag actcaagcga caccaccctc
      121 atgacctccc taacctggtt ttgacctaga aagtcttagt tttaaaattc attagaataa
      181 tcaaataaat aataattact atgttatatc aactattata attctcccta tcatttttag
      241 gattaaaaat ctgttagtct taagtaacca agacacattg taaaataaaa taatttaagc
      301 agatcaaatt aagttgccgc atgggtaaca gtgcgttgat caaataataa aaacatcatc
      361 gtaaaaa
//
LOCUS       HUMPOVRA     1254 bp ss-mRNA            PRI       10-AUG-1990
DEFINITION  Human poliovirus receptor mRNA, clone H20A.
ACCESSION   M24407
KEYWORDS    oncogene; poliovirus receptor; transmembrane protein;
            tyrosine kinase; viral receptor.
SOURCE      Human Hela cell cDNA to mRNA, clone H20A.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1254)
  AUTHORS   Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R.
  TITLE     Cellular receptor for poliovirus: Molecular cloning, nucleotide
            sequence, and expression of a new member of the immunoglobulin
            superfamily
  JOURNAL   Cell 56, 855-865 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1254; revises [1])
  AUTHORS   Racaniello,V.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence kindly provided
            by V.Racaniello, 27-APR-1989.
FEATURES       from  to/span     description
    pept          1     1254     poliovirus receptor
    revision   1020     1020     a in [1]; g in [1]
BASE COUNT      255 a    389 c    362 g    248 t
ORIGIN      
        1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc
       61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc
      121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat
      181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa
      241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc
      301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac
      361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga
      421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca
      481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac
      541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc
      601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc
      661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg
      721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat
      781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc
      841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt
      901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct
      961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata
     1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg
     1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg
     1141 tgtccctcga gtacagagca tgccagcgcc tcagctaatg ggcatgtctc ctattcagct
     1201 gtgagcagag agaacagctc ttcccaggat ccacagacag agggcacaag gtga
//
LOCUS       HUMPOVRB     1179 bp ss-mRNA            PRI       10-AUG-1990
DEFINITION  Human poliovirus receptor mRNA, clone H20B.
ACCESSION   M24406
KEYWORDS    oncogene; poliovirus receptor; transmembrane protein;
            tyrosine kinase; viral receptor.
SOURCE      Human Hela cell cDNA to mRNA, clone H20B.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1179)
  AUTHORS   Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R.
  TITLE     Cellular receptor for poliovirus: Molecular cloning, nucleotide
            sequence, and expression of a new member of the immunoglobulin
            superfamily
  JOURNAL   Cell 56, 855-865 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1179; revises [1])
  AUTHORS   Racaniello,V.R.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence kindly provided
            by V.Racaniello, 27-APR-1989.
FEATURES       from  to/span     description
    pept          1     1179     poliovirus receptor
    revision   1020     1020     a in [1]; g in [1]
BASE COUNT      234 a    368 c    341 g    236 t
ORIGIN      
        1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc
       61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc
      121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat
      181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa
      241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc
      301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac
      361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga
      421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca
      481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac
      541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc
      601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc
      661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg
      721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat
      781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc
      841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt
      901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct
      961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata
     1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg
     1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg
     1141 tgtccctcga gtgagcatca ccagagctgc cgtaattga
//
LOCUS       MUSSNAP25    2040 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Mouse SNAP-25 mRNA encoding synaptosomal associated protein 25,
            complete cds.
ACCESSION   M22012
KEYWORDS    synaptosomal associated protein 25.
SOURCE      Mouse (strain BALB/c) adult neural brain cell, cDNA to mRNA, clones
            pMuBr8, p8.1, p8.51, p8.52, p8.53, and p8.71.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2040)
  AUTHORS   Oyler,G.A., Higgins,G.A., Hart,R.A., Battenberg,E., Billingsley,M.,
            Bloom,F.E. and Wilson,M.C.
  TITLE     The identification of a novel synaptosomal associated protein,
            SNAP25, differentially expressed by neuronal subpopulations
  JOURNAL   J. Cell Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.Wilson 18-JAN-1989
FEATURES       from  to/span     description
    pept        164      784     synaptosomal associated protein 25
    mRNA    <     1     2037     synaptosomal associated protein 25 mRNA
BASE COUNT      613 a    462 c    430 g    535 t
ORIGIN      Chromosome 2.
        1 cccgaggttt ggagctgtct ttccttccct ccctacccgg cggctcctcc actcttgcta
       61 cctgcaggga tcagcggaca gcatcctctg aagaagacaa ggttccttaa ctaagcacca
      121 ctgacttgct ggccccggcg cccagcaacc ccccaccact accatggccg aggacgcaga
      181 catgcgtaat gaactggagg agatgcagag gagggctgac cagctggctg atgagtccct
      241 ggaaagcacc cgtcgcatgc tgcagctggt cgaagagagt aaagatgctg gcatcaggac
      301 tttggttatg ttggatgagc aaggcgaaca actggaacgc attgaggaag ggatggacca
      361 aatcaataag gatatgaaag aagcagaaaa gaatttgacg gacctaggaa aattctgcgg
      421 gctttgtgtg tgtccctgta acaagcttaa atccagtgat gcttacaaaa aagcctgggg
      481 caataatcag gatggagtag tggccagcca gcctgcccgt gtggtggatg aacgggagca
      541 gatggccatc agtggtggct tcatccgcag ggtaacaaac gatgcccggg aaaatgaaat
      601 ggatgaaaac ctagagcagg tgagcggcat catcggaaac ctccgtcata tggccctaga
      661 catgggcaat gagattgaca cccagaatcg ccagattgac aggatcatgg agaaggctga
      721 ctccaacaaa accagaattg atgaagccaa ccaacgtgca acaaagatgc tgggaagtgg
      781 ttaaatctgc cgttctgctg tgctgtcctc caatgttgtt ggacaagaga gaagagagct
      841 ccttcatgct tctctcatgg tattacctag taagacttac acacacacac acacacacac
      901 acacacacac acacacacac acacacacac agagtagtca cccccattgt aaatgtctgt
      961 gtggtttgtc agcttcccaa tgataccatg tgtcttttgt tttctccggc tctctttctt
     1021 tgccaaaggt tgtacatagt ggtcatctgg tgactctatt tcctgactta agagttcttg
     1081 ggtctctctc tttcttttct cagtggcgtt tgctgaatga caacaattta ggaatgctca
     1141 atgtactgtt gatttttctc aatacacagt attgttcttg taaaactgtg acttaccaca
     1201 gagctactac cacagtcctt tcttagggtg tcaggctctg aatctctcca aatgtgctct
     1261 ctttggttcc tcagtgctat tctttgtctt tatgatttca taattagaca atgtgaaatt
     1321 acataacagg cattgcacta aaagtgatgt gatttatgca tttatgcatg agaactaaat
     1381 agacttttag atcctactta aacaaaaact tccatgacag tagcatactg acaagaaaac
     1441 acacacaaca gcaacaataa caaagcaaca actacgcatg ctcagcattg ggacactgtc
     1501 aagattaagt cataccagca aaacctgcag ctgtgtcacc ttcttctgtc aacatacaga
     1561 ctgatcataa tgatcccttc tttacacaca cacacacaca cacacacaca cacacacaca
     1621 caaatggaat ttaaccaact tcccagaatt gatgaagcaa atatatgttt ggctgaaact
     1681 attgtaaatg ggtgtaatat agggtttgtc gaatgctttt gaaagctctg ttttccagac
     1741 aatactcttg tgtggaaaac gtgaagatct tctaagtctg gctcttgtga tcaccaaacc
     1801 ctggtgcatc agtacaacac tttgcgctaa tctagagcta tgcacaacca aattgctgag
     1861 atgtttagta gctgataaag aaacctttaa aaaattatat aaatgaatga aatatagata
     1921 aactgtgaga taaatatcat tacagcatgt atattaaatc cctcctgtct cctctgttgg
     1981 tttgtgaagt gatttgacat tttgtagcta gtttaaaatt attaaaaatt atagatgtta
//
LOCUS       CHKNCAMC1      26 bp ss-mRNA            VRT       10-AUG-1990
DEFINITION  Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12.
ACCESSION   M24014 J04140
KEYWORDS    neural cell adhesion molecule.
SEGMENT     1 of 6
SOURCE      Chicken embryonic day 10 cardiac muscle cDNA to mRNA, clone
            pEC101B.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 7 to 18)
  AUTHORS   Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
  TITLE     Four exons encode a 93-base-pair insert in three neural cell
            adhesion molecule mRNAs specific for chicken heart and skeletal
            muscle
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 26)
  AUTHORS   Prediger,E.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    pept    <     1  +    18     neural cell adhesion molecule, exon 12 (AA at
                                 1)
    IVS          19  >    26     intron
BASE COUNT       10 a      6 c      5 g      5 t
ORIGIN      
        1 tcaagacaca gccagttcgt aagtaa
//
LOCUS       CHKNCAMC2      29 bp ds-DNA             VRT       10-AUG-1990
DEFINITION  Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12A.
ACCESSION   M23991 J04140
KEYWORDS    neural cell adhesion molecule.
SEGMENT     2 of 6
SOURCE      Chicken embryonic day 10 cardiac muscle DNA, clone 6B1, and cDNA to
            mRNA, clone pEC101B.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 7 to 21)
  AUTHORS   Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
  TITLE     Four exons encode a 93-base-pair insert in three neural cell
            adhesion molecule mRNAs specific for chicken heart and skeletal
            muscle
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 6; 22 to 29)
  AUTHORS   Prediger,E.A.
  JOURNAL   Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    pept    +     7  +    21     neural cell adhesion molecule exon 12A
    IVS     <     1        6     N-CAM intron
    IVS          22  >    29     N-CAM intron
BASE COUNT        6 a     10 c      6 g      7 t
ORIGIN      
        1 ctgcaggtat ccctcactca cgtaagtgc
//
LOCUS       CHKNCAMC3      54 bp ds-DNA             VRT       10-AUG-1990
DEFINITION  Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12B.
ACCESSION   M23992 J04140
KEYWORDS    neural cell adhesion molecule.
SEGMENT     3 of 6
SOURCE      Chicken embryonic day 10 cardiac muscle DNA, clone 12AC3, and cDNA
            to mRNA, clone pEC101B.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 13 to 46)
  AUTHORS   Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
  TITLE     Four exons encode a 93-base-pair insert in three neural cell
            adhesion molecule mRNAs specific for chicken heart and skeletal
            muscle
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 12; 47 to 54)
  AUTHORS   Prediger,E.A.
  JOURNAL   Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    pept    +    13  +    46     neural cell adhesion molecule exon 12B
    IVS     <     1       12     N-CAM intron
    IVS          47  >    54     N-CAM intron
BASE COUNT       11 a     13 c     12 g     18 t
ORIGIN      
        1 ttctttcttt agcaagtgct gctgccccca ccatgtttag cagagagtga gtat
//
LOCUS       CHKNCAMC4      62 bp ds-DNA             VRT       10-AUG-1990
DEFINITION  Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12C.
ACCESSION   M23993 J04140
KEYWORDS    neural cell adhesion molecule.
SEGMENT     4 of 6
SOURCE      Chicken embryonic day 10 cardiac muscle DNA, clone 7C1, and cDNA to
            mRNA, clone pEC101B.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 13 to 54)
  AUTHORS   Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
  TITLE     Four exons encode a 93-base-pair insert in three neural cell
            adhesion molecule mRNAs specific for chicken heart and skeletal
            muscle
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 12; 55 to 62)
  AUTHORS   Prediger,E.A.
  JOURNAL   Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021
  STANDARD  full staff_entry
FEATURES       from  to/span     description
    pept    +    13  +    54     neural cell adhesion molecule exon 12C
    IVS     <     1       12     N-CAM intron
    IVS          55  >    62     N-CAM intron
BASE COUNT       18 a     20 c     12 g     12 t
ORIGIN      
        1 atgtttccat agcaactcag cctctccctg catccgaaag cacagagcca ccaagtgagt
       61 ag
//
LOCUS       CHKNCAMC5       3 bp ss-mRNA            VRT       10-AUG-1990
DEFINITION  Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12D.
ACCESSION   M23994 J04140
KEYWORDS    neural cell adhesion molecule.
SEGMENT     5 of 6
SOURCE      Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 3)
  AUTHORS   Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
  TITLE     Four exons encode a 93-base-pair insert in three neural cell
            adhesion molecule mRNAs specific for chicken heart and skeletal
            muscle
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
  STANDARD  full staff_entry
COMMENT     Exon 12D represents a very small exon.
FEATURES       from  to/span     description
    pept    +     1  +     3     neural cell adhesion molecule, exon 12D
BASE COUNT        2 a      0 c      1 g      0 t
ORIGIN      
        1 aag
//
LOCUS       CHKNCAMC6      33 bp ss-mRNA            VRT       10-AUG-1990
DEFINITION  Chicken cardiac neural cell adhesion (N-CAM) gene, exon 13.
ACCESSION   M37649 J04140
KEYWORDS    neural cell adhesion molecule.
SEGMENT     6 of 6
SOURCE      Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 33)
  AUTHORS   Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A.
  TITLE     Four exons encode a 93-base-pair insert in three neural cell
            adhesion molecule mRNAs specific for chicken heart and skeletal
            muscle
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 33)
  AUTHORS   Prediger,E.A.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     
FEATURES       from  to/span     description
    pept    +    13  >    33     neural cell adhesion molecule, exon 13
    IVS     <     1       12     intron
BASE COUNT        9 a     11 c      7 g      6 t
ORIGIN      
        1 ccattgtttc aggggaaccc agtgcaccca aac
//
LOCUS       BADLDH       1147 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  B.caldolyticus lactate dehydrogenase (LDH) gene, complete cds.
ACCESSION   M19394 M19386
KEYWORDS    lactate dehydrogenase.
SOURCE      B.caldolyticus (strain DSM 405) DNA, clone pLDHY1.
  ORGANISM  Bacillus caldolyticus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1147)
  AUTHORS   Zuelli,F., Weber,H. and Zuber,H.
  TITLE     Nucleotide sequences of lactate dehydrogenase genes from the
            thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus
            and B.caldotenax
  JOURNAL   Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987)
  STANDARD  full staff_entry
COMMENT     Draft entry and  computer-readable sequence for [1] kindly provided
            by F.Zuelli, 27-MAY-1988.
FEATURES       from  to/span     description
    pept        146     1099     lactate dehydrogenase (AC 1.1.1.27)
BASE COUNT      282 a    275 c    321 g    269 t
ORIGIN      301 bp upstream of NcoI site.
        1 tgctcccgtt gtttgccgcg gcgtctgtca tgaaatggac aaacaatagt caaacaatcg
       61 ccacaatgcc gcatgcatcg cggtgcgcct ttcgcgtaaa atagtttgtg aaagtgttca
      121 cattgtattg agggggatga atgcaatgaa aaacaacggt ggaacccgag tagtggtcat
      181 cggcaccggg tttgtcggcg ccagttatgc gtttgcctta atgaatcaag ggattgccga
      241 tgagatcgtg ctcatcgatg cgaatgaaag caaggccata ggcgatgcga tggacttcaa
      301 ccatgggaaa gtatttgcgc cgaagccggc tgacatttgg cacggcgatt acgatgattg
      361 ccgcgatgcc gatttggttg tcatttgcgc cggcgccaac caaaaaccgg gcgagacgcg
      421 gcttgatctt gtggacaaaa acattgccat tttccgctcg atcgttgagt cggtcatggc
      481 atccggattt caaggactgt ttctcgtcgc caccaatccg gtcgacattt taacgtacgc
      541 gacgtggaaa ttcagcggcc tgccgcatga gcgggtgatc ggttcgggga cgattttaga
      601 tacggcgcgg ttccgctttt tgttgggcga gtatttctct gtcgctccgc aaaatgttca
      661 tgcctatatt attggggaac acggcgacac tgaactcccg gtctggagcc aggctgatat
      721 cggcggcgtg ccgatccgca agctggtcga gtccaaaggg gaagaagcgc aaaaagagct
      781 cgagcgcatt tttgtcaatg tgcgcgatgc cgcctaccaa attattgaga aaaaaggagc
      841 gacgtactac ggaattgcga tgggacttgc ccgcgtgacg cgcgccattt tgcataacga
      901 aaacgctatt ttgaccgtat cggcctacct cgatggccca tatggggagc gcgacgtcta
      961 catcggcgtg ccggctgtca ttaaccgcaa tggcatccgc gaggtgatcg aaattgaatt
     1021 gaatgatgac gaaaaaaatc gattccatca tagcgcagcc acattaaaaa gcgtgctagc
     1081 ccgttttttt acgcgatgag gcaacaaacg aaaagttgtg ccaacagtct ttctcgtcca
     1141 aacaggg
//
LOCUS       BMELDHA      1423 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  B.megaterium L-lactate dehydrogenase gene.
ACCESSION   M22305
KEYWORDS    L-lactate dehydrogenase.
SOURCE      B.megaterium (strain DSM 090) DNA, clone pMH/pML.
  ORGANISM  Bacillus megaterium
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1423)
  AUTHORS   Waldvogel,S., Weber,H. and Zuber,H.
  TITLE     Nucleotide sequence of the L-lactate dehydrogenase gene from the
            mesophilic bacterium B.megaterium. Preparation and properties of a
            hybrid lactate dehydrogenase comprising moieties of the B.
            megaterium and B.stearothermophilus enzymes
  JOURNAL   Biol. Chem. Hoppe-Seyler 368, 1391-1399 (1987)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1423; revises [1])
  AUTHORS   Suter,F.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            submitted by S.Waldvogel 02-OCT-1987, and [2] by F.Suter
            16-MAY-1989.
            
               Author address [2]
               F.Suter
               ETH-Hoenggerberg
               Inst. f. Molekularbiologie und Biophysik
               CH 8093 ZUERICH
               Switzerland
FEATURES       from  to/span     description
    pept        250     1206     L-lactate dehydrogenase (AA at 1)
BASE COUNT      449 a    263 c    277 g    434 t
ORIGIN      
        1 taacatacaa atagagaatg ttaaactctt tagctttctg caaaacaagt tgcctttact
       61 caacacttta aacgtttaag atctttctaa atgtggcact tcatgtagaa ggacattttt
      121 ttaaatcgcc ctgtaaaatt gtttactatt ttttaaaggg tgtgattttt atcacagctg
      181 aagctattta ttcttgttac actaacattg tgaaaaacat cacaaaataa aattcaaagg
      241 atgatacaaa tgaaaacaca atttacacca aaaacacgaa aagttgccgt tatcggaact
      301 ggttttgttg gctcaagcta cgctttttca atggtgaatc aaggtattgc caatgaatta
      361 gtgttaatcg atatgaacaa agaaaaagca gaaggtgaag cacgtgatat caatcatgga
      421 atgccatttg ccacaccgat gaaaatctgg gctggagatt ataaagactg tgctgacgct
      481 gatttagcag ttattacagc gggcgctaat caagctccag gggaaacacg cttagatcta
      541 gttgaaaaaa acgttaaaat tttcgaatgc attgtaaaag atattatgaa cagcggattt
      601 gacggcatca ttttagtggc aacaaatcca gttgatattc tcgcacacgt tacacaaaaa
      661 gtatcaggat taccaaacgg acgggtaatt ggttcaggaa cgattcttga cacagctcgc
      721 ttccgctact tgttaagcga ctatttcgaa gtagattctc gcaacgtcca cgcttatatt
      781 atgggggaac atggagatac ggaatttcct gtttggagcc acgcgcaaat tggcggtgtg
      841 aagctcgaac attttatcaa tactgccgct attgaaaaag aaccggatat gcagcatcta
      901 ttcgaacaaa cccgcgatgc ggcttaccat attattaatc gaaaaggagc gacttattac
      961 ggaattgcaa tggggcttgt acgcattacc aaggctattt tagatgatga aaattctatt
     1021 ttaacagtat ctgctttatt agaaggacaa tacggtattt ctgatgtgta tatcggcgta
     1081 ccagctatca ttaataaaaa cggcgtgcgt caaattattg aattgaattt aactcctcac
     1141 gaacagcagc agctcgagca ctctgctagc attcttaagc aaactcgcga cagagctttt
     1201 gtgtaacatc taaagatttt tgcgggggac tcccccgcac cttgttttaa atatgtagta
     1261 gaggtgtatg aatatgacgt ggactcaggt atataatcct ttagataata tttggctttc
     1321 tgcactaatt gcactcattc ccattatctt tttctttatt gctttaactc ttttgaaatt
     1381 aaaaggacac attgcttgcc ggtattacgg tgcttctttc tat
//
LOCUS       BSTLDH       1150 bp ds-DNA             BCT       10-AUG-1990
DEFINITION  B.stearothermophilus lactate dehydrogenase (LDH) gene, complete
            cds.
ACCESSION   M19396 M19386
KEYWORDS    lactate dehydrogenase.
SOURCE      B.stearothermophilus (strain NCIB 8924) DNA, clone pLDHY1.
  ORGANISM  Bacillus stearothermophilus
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1150)
  AUTHORS   Zuelli,F., Weber,H. and Zuber,H.
  TITLE     Nucleotide sequences of lactate dehydrogenase genes from the
            thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus
            and B. caldotenax
  JOURNAL   Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987)
  STANDARD  full staff_entry
COMMENT     Draft entry and  computer-readable sequence for [1] kindly provided
            by F.Zuelli, 27-MAY-1988.
FEATURES       from  to/span     description
    pept        149     1102     lactate dehydrogenase (EC 1.1.1.27)
BASE COUNT      283 a    270 c    321 g    276 t
ORIGIN      304 bp upstream of NcoI site.
        1 gtgctcccgt tgtttgccgc ggcgtctgtc atgaaatgga caaacaatag tcaaacaatc
       61 gccacaatcg cgcatgcatt gcggtgcgcc tttcgcgtaa aatatttata tgaaagtgtt
      121 cgcattatat tgagggagga tgaatgcaat gaaaaacaac ggtggagccc gagtagtggt
      181 catcggcgcc gggtttgtcg gcgccagtta tgtgtttgcc ttaatgaatc aagggattgc
      241 cgatgagatc gtgctcatcg atgcgaatga aagcaaggcc ataggcgatg cgatggactt
      301 caaccatggg aaagtatttg cgccgaagcc ggttgacatt tggcacggcg attacgatga
      361 ttgccgcgat gccgatttgg ttgtcatttg cgccggcgcc aaccaaaaac cgggcgagac
      421 gcggcttgat cttgtggaca aaaacattgc cattttccgc tcgatcgttg agtcggtcat
      481 ggcatccgga tttcaaggac tgtttctcgt cgccaccaat ccggtcgaca ttttaacgta
      541 cgcgacgtgg aaattcagcg gcctgccgca tgagcgggtg atcggttcgg ggacgatttt
      601 agatacggcg cggttccgct ttttgttggg cgagtatttc tctgtcgctc cgcaaaatgt
      661 tcatgcctat attattgggg aacacggcga cactgaactc ccggtctgga gccaggctta
      721 tatcggcgtc atgccgatcc gcaagctggt cgagtccaaa ggggaagaag cgcaaaaaga
      781 tctcgagcgc atttttgtca atgtgcgcga tgccgcctac caaattattg agaaaaaagg
      841 agcgacgtac tacgggattg cgatggggct tgcccgcgtg acgcgcgcca ttttgcataa
      901 cgaaaacgct attttgaccg tatcagccta cctcgatggc ctatatgggg agcgcgacgt
      961 ctacatcgga gtgccggctg tcattaaccg caatggcatc cgcgaggtga tcgaaattga
     1021 attgaatgat gacgaaaaaa atcgattcca tcatagcgca gctacattaa aaagcgtgct
     1081 agcccgtgct tttacgcgat gaggcaacaa acgaaaagtt gtgccgacag tctttctcgt
     1141 ccaaacaggg
//
LOCUS       MUSIGVDJA     351 bp ss-mRNA            ROD       10-AUG-1990
DEFINITION  Mouse Ig active H-chain mRNA variable region (V186-2 DFL16.1 J2)
            (AA at 1) anti-NP hybridoma, clone H1-7.
ACCESSION   M18548
KEYWORDS    immunoglobulin; immunoglobulin heavy chain; joining exon;
            rearranged gene; variable region.
SOURCE      Mouse (strain C57BL/6J) anti-NP hybridoma mRNA, clone H1-7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 351)
  AUTHORS   Blier,P.R. and Bothwell,A.
  TITLE     A limited number of B cell lineages generates the heterogeneity of
            a secondary immune response
  JOURNAL   J. Immunol. 139, 3996-4006 (1987)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            A.Bothwell 03-MAY-1988.
FEATURES       from  to/span     description
    ORF     <     1  >   351     Ig heavy-chain V-region (V-D-J2) (AA at 1)
    recomb      293      294     V(186-2) region end/ D(FL16.1) start
    recomb      308      309     D(FL16.1) region end/J2 start
BASE COUNT       84 a     92 c     89 g     79 t      7 others
ORIGIN      Unreported.
        1 caggtcaaac tgcagcagcc tggggctgag cttgtgaagc ctggggcttc agnnaagctg
       61 tcctgcaagg cttctggcta caccttcacc acctncttga tncactgggt gaaacagagg
      121 cctggacgag gccttgagtg gattggaagg attgatccta atggtggtgg tactaagtac
      181 gatgnaagtt caagagcaag gccacactga ctgtagacga accctccagc acagcctaca
      241 tgcagctcag cagcctgaca tctgaggact ctgcggncta ttattgtgca acctattact
      301 acggtctcta ctttaactac tgggnaaggc accactctca cagtctcctc a
//

GenBank-Updates@genbank.bio.net (08/13/90)

LOCUS       MUSCZSOD      535 bp ss-mRNA            ROD       13-AUG-1990
DEFINITION  Mouse Cu-Zn superoxide dismutase mRNA, complete cds.
ACCESSION   M35725
KEYWORDS    Cu-Zn superoxide dismutase.
SOURCE      Mouse SWR/J liver, cDNA to mRNA, clone pS61-10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 535)
  AUTHORS   Bewley,G.C.
  TITLE     cDNA and deduced amino acid sequence of murine Cu-Zn superoxide
            dismutase
  JOURNAL   Nucleic Acids Res. 16, 2728-2728 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          5      469     Cu-Zn superoxide dismutase (EC 1.15.11)
    mRNA    <     1      469     Cu-Zn superoxide dismutase mRNA
BASE COUNT      148 a    112 c    159 g    116 t
ORIGIN      
        1 aagcatggcg atgaaagcgg tgtgcgtgct gaagggcgac ggtccggtgc agggaaccat
       61 ccacttcgag cagaaggcaa gcggtgaacc agttgtgttg tcaggacaaa ttacaggatt
      121 aactgaaggc cagcatgggt tccacgtcca tcagtatggg gacaatacac aaggctgtac
      181 cagtgcagga cctcatttta atcctcactc taagaaacat ggtggcccgg cggatgaaga
      241 gaggcatgtt ggagacctgg gcaatgtgac tgctggaaag gacggtgtgg ccaatgtgtc
      301 cattgaagat cgtgtgatct cactctcagg agagcattcc atcattggcc gtacaatggt
      361 ggtccatgag aaacaagatg acttgggcaa aggtggaaat gaagaaagta caaagactgg
      421 aaatgctggg agccgcttgg cctgtggagt gattgggatt gcgcagtaaa cattccctgt
      481 gtggtctgaa gtctcagact caactgctac cctcaaacca ttaaactgta atctg
//
LOCUS       MZECPSIIA     721 bp ds-DNA             ORG       13-AUG-1990
DEFINITION  Maize chloroplast ribosomal protein S11 (rpS11) gene, complete cds,
            and rpoA gene, 5' end.
ACCESSION   M35831
KEYWORDS    ribosomal protein S11; rpS11 gene; rpoA gene.
SOURCE      Maize chloroplast DNA.
  ORGANISM  Chloroplast Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE   1  (bases 1 to 721)
  AUTHORS   Markmann-Mulisch,U. and Subramanian,A.R.
  TITLE     Nucleotide sequence of maize chloroplast rpS11 with conserved amino
            acid sequence between eukaryotes, bacteria and plastids
  JOURNAL   Biochem. Int. 17, 655-664 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        200      631     ribosomal protein S11 (rpS11)
    pept        695  >   721     rpoA gene product
BASE COUNT      252 a    120 c    147 g    202 t
ORIGIN      
        1 agacaggggt aatctttcga aaaagaagct tttctttcta atagcttttc tttctaatta
       61 aagtaaaaag tacccacgga aatgcccaaa ttgcaaataa aaaaaatgaa agtaaaggat
      121 atattttaac ctgaaacgga tatctttgta tcttttttct ttttgttatt tctaactcat
      181 atttatgaga taataaaata tgacaaaagc tataccaaaa ataggttcac gtaagaaagt
      241 gcgtattggt ttgcgtagga atgcccgttt tagtttacgg aagagtgcac gtagaataac
      301 aaaagggatt attcatgttc aagctagttt caacaatacc attataaccg ttacagaccc
      361 acaaggtcgg gtcgttttct ggtcctccgc aggtacttgt ggattcaaaa gctcaagaaa
      421 agcatcaccc tatgctggtc aaagaacagc agtagatgct attcgtacag tgggtttgca
      481 acgagcagaa gttatggtaa aaggtgctgg tagcggaaga gatgccgcat tacgagccat
      541 tgctaaaagt ggtgtacggt taagttgtat acgcgatgta acacctatgc cgcataatgg
      601 atgtcgacct cctaaaaaaa gacgtctgta aaaaaaaatg aaaccgcttt caagagaaat
      661 aaacgattca atgatcaaat aatactagtc tgttatggtt cgagaagaga taacaggatc
      721 c
//
LOCUS       ACUCPCG       103 bp ds-DNA             ORG       13-AUG-1990
DEFINITION  A.calamus chloroplast 4.5S rRNA gene.
ACCESSION   M36166
KEYWORDS    4.5S ribosomal RNA.
SOURCE      A.calamus chloroplast DNA.
  ORGANISM  Chloroplast Acorus calamus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Arecidae; Arales; Araceae; Acorus calamus.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Bobrova,V.K., Troitsky,A.V., Ponomarev,A.G. and Antonov,A.S.
  TITLE     Low-molecular-weight rRNAs sequences and plant phylogeny
            reconstruction: Nucleotide sequences of chloroplast 4.5S rRNAs from
            Acorus calamus (Araceae) and Ligularia calthifolia (Asteraceae)
  JOURNAL   Pl. Syst. Evol. 156, 13-27 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      102     4.5S rRNA
BASE COUNT       30 a     20 c     31 g     22 t
ORIGIN      
        1 taaggtcacg gcgagacgag ccgtttatca ttacgatagg tgccaagtgg aagtgcagtg
       61 atgtatgcag ctgaggcatc ctaacagacc gagagatttg aac
//
LOCUS       LIGCPCG       103 bp ds-DNA             ORG       13-AUG-1990
DEFINITION  L.calthifolia chloroplast 4.5S rRNA gene.
ACCESSION   M36165
KEYWORDS    4.5S ribosomal RNA.
SOURCE      L.calthifolia chloroplast DNA.
  ORGANISM  Chloroplast Ligularia calthifolia
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Asterales; Asteraceae; Ligularia calthifolia.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Bobrova,V.K., Troitsky,A.V., Ponomarev,A.G. and Antonov,A.S.
  TITLE     Low-molecular-weight rRNAs sequences and plant phylogeny
            reconstruction: Nucleotide sequences of chloroplast 4.5S rRNAs from
            Acorus calamus (Araceae) and Ligularia calthifolia (Asteraceae)
  JOURNAL   Pl. Syst. Evol. 156, 13-27 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      103     4.5S rRNA
BASE COUNT       29 a     20 c     32 g     22 t
ORIGIN      
        1 gaaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagtg
       61 atgtatgcag ctgaggcatc ctaacagacc ggtagacttg aac
//
LOCUS       MBICPR1A      128 bp ss-rRNA            RNA       13-AUG-1990
DEFINITION  M.thermoautotrophicum 5S ribosomal RNA.
ACCESSION   M36186
KEYWORDS    5S ribosomal RNA; ribosomal RNA.
SOURCE      M.thermoautotrophicum (strain delta-H; variant 1) RNA.
  ORGANISM  Methanobacterium thermoautotrophicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanobacteriaceae.
REFERENCE   1  (bases 1 to 128)
  AUTHORS   Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R.
  TITLE     Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two
            methanogen species, secondary structure models, and molecular
            evolution
  JOURNAL   Syst. Appl. Microbiol. 7, 151-159 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      128     5S rRNA
BASE COUNT       18 a     30 c     41 g     39 t
ORIGIN      
        1 ggggcatggc ggtcatggcg caggggttat acctgatctc gtttcgatct cagtagttaa
       61 gtcctgctgc gttgtgggtg tgtactgcgg ttttttgctg tgggaagccc acttcactgc
      121 cagccact
//
LOCUS       MBICPR2A      131 bp ss-rRNA            RNA       13-AUG-1990
DEFINITION  M.thermoautotrophicum 5S ribosomal RNA.
ACCESSION   M36187
KEYWORDS    5S ribosomal RNA; ribosomal RNA.
SOURCE      M.thermoautotrophicum (strain delta-H; variant 2) RNA.
  ORGANISM  Methanobacterium thermoautotrophicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanobacteriaceae.
REFERENCE   1  (bases 1 to 131)
  AUTHORS   Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R.
  TITLE     Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two
            methanogen species, secondary structure models, and molecular
            evolution
  JOURNAL   Syst. Appl. Microbiol. 7, 151-159 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      131     5S rRNA
BASE COUNT       17 a     30 c     41 g     43 t
ORIGIN      
        1 gggttttggc ggtcatggcg tgggggttta tacctgatct cgtttcgatc tcagtagtta
       61 agtcctgctg cgttgtgggt gtgtactgcg gttttttgct gtgggaagcc cacttcactg
      121 ccagaccacc c
//
LOCUS       MESCPRA       130 bp ss-rRNA            RNA       13-AUG-1990
DEFINITION  M.barkeri 5S ribosomal RNA.
ACCESSION   M36188
KEYWORDS    5S ribosomal RNA; ribosomal RNA.
SOURCE      M.barkeri (strain DSM 800) RNA.
  ORGANISM  Methanosarcina barkeri
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanomicrobiales; Methanosarcinaceae.
REFERENCE   1  (bases 1 to 130)
  AUTHORS   Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R.
  TITLE     Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two
            methanogen species, secondary structure models, and molecular
            evolution
  JOURNAL   Syst. Appl. Microbiol. 7, 151-159 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      130     5S rRNA
BASE COUNT       27 a     40 c     33 g     30 t
ORIGIN      
        1 ggagtttggc gccatagcgg cagtgtaact cctgtaccca tcccgaacac agtagataag
       61 cctgcccgcg ttccttactg tactgaagtg tgcgagcctt cgggaactct ggatcgctgc
      121 catactcacc
//
LOCUS       RICCPQBP     1442 bp ds-DNA             ORG       13-AUG-1990
DEFINITION  O.sativa L. chloroplast Q-B protein (psbA) gene, complete cds.
ACCESSION   M36191
KEYWORDS    Q-B protein; thylakoid membrane protein.
SOURCE      O.sativa L. (var. Labelle) chloroplast DNA.
  ORGANISM  Chloroplast Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Oryza sativa.
REFERENCE   1  (bases 1 to 1442)
  AUTHORS   Wu,N.-H., Cote,J.-C. and Wu,R.
  TITLE     Structure of the chloroplast psbA gene encoding the Q-B protein
            from Oryza sativa L
  JOURNAL   Dev. Genet. 8, 339-350 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        257     1318     Q-B protein (psbA)
    mRNA        177  >  1442     Q-B protein mRNA
BASE COUNT      350 a    275 c    302 g    515 t
ORIGIN      
        1 ggttcgagtc ccgggcaacc catatggaaa ctagaaagga aaaggagcaa tctgagtttt
       61 gatttttcac tcacttcatt tacaaaattt tttggtttgg taaattttgt gtatggatat
      121 acaactgtcg gggctggctt ggttgacatt ggtatatagt ctatattata ctgttaaata
      181 acaagccttc tattatcttt ctagttaata cgtgtgcttg ggagtccttg caatttgaat
      241 aaaccaagat cttaccatga ctgcaatttt agagagacgc gaaagtacaa gcctgtgggg
      301 tcgcttctgc aactggataa ctagcaccga aaaccgtctt tacatcggat ggttcggtgt
      361 tttgatgatc cctaccttat tgaccgcaac ttctgtattt attatcgcct tcatcgctgc
      421 ccctccagta gatattgatg gtattcgtga gcctgtttct ggttctttac tttacggaaa
      481 caatattatc tctggtgcca ttattcctac ttctgcggcg atcggattgc acttttaccc
      541 aatttgggaa gctgcatctg ttgatgagtg gttatacaat ggtggtcctt atgagctaat
      601 tgttctacac ttcttacttg gtgtagcttg ttatatgggt cgtgagtggg aactgagttt
      661 ccgtctgggt atgcgtcctt ggattgctgt tgcatattca gctcctgttg cagctgctac
      721 tgctgttttc ttgatttacc ctattggtca aggaagtttc tctgatggta tgcctttagg
      781 aatatctggt actttcaact ttatgattgt attccaggca gagcataaca tccttatgca
      841 tccatttcac atgttaggtg tagctggtgt attcggcggt tccctattca gtgctatgca
      901 tggttccttg gtaacctcta gtttgatcag ggaaaccact gaaaacgaat ctgctaatga
      961 gggttacaga tttggtcaag aggaagaaac ttataatatt gtggccgctc atggttattt
     1021 tggccgatta atcttccaat atgctagttt taacaactct cgttctttac acttcttctt
     1081 ggctgcttgg cctgtagtag gaatttggtt taccgcttta ggtattagta ctatggcttt
     1141 caatctaaac ggattcaatt tcaaccaatc tgtagttgat agccaaggtc gcgttattaa
     1201 tacttgggct gatatcatca accgtgctaa tcttggtatg gaagtaatgc acgaacgtaa
     1261 tgctcacaac ttccctctag acctagctgc tcttgaagtt ccatctctta atggataagg
     1321 cttttctgct aacatatagc aatttttgaa gaaaggaaag ctagaaatac ccaatatctt
     1381 gctgaagcaa gatattgggt atttcttttt ttttttattt tgaatctttc tattctgaat
     1441 tc
//
LOCUS       RATC7AH7A    3561 bp ss-mRNA            ROD       13-AUG-1990
DEFINITION  Rat cytochrome P450 cholesterol 7-alpha-hydroxylase (P450 VII)
            mRNA, complete cds.
ACCESSION   J05509
KEYWORDS    cholesterol 7-alpha-hydroxylase; cytochrome P450.
SOURCE      Rat (strain Sprague-Dawley) cholestyramine-treated female liver,
            cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3561)
  AUTHORS   Li,Y.C., Wang,D.P. and Chiang,J.Y.L.
  TITLE     Regulation of cholesterol 7-alpha-hydroxylase in the liver:
            Cloning, sequencing, and regulation of cholesterol
            7-alpha-hydroxylase mRNA
  JOURNAL   J. Biol. Chem. 265, 12012-12019 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Chiang,J.Y., 08-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept         47     1558     cholesterol 7-alpha-hydroxylase (EC
                                 1.14.13.17)
    mRNA    <     1     1983     P450 VII mRNA
BASE COUNT     1048 a    719 c    695 g   1099 t
ORIGIN      
        1 gctaggcaaa gagtctcccc tttggaaatt ttcctgcttt tgcaaaatga tgactatttc
       61 tttgatttgg ggaattgccg tgttggtgag ctgttgcata tggtttattg ttggaataag
      121 gagaaggaaa gctggtgaac ctcctttgga gaacgggttg attccgtacc tgggctgtgc
      181 tctgaaattt ggatctaatc ctcttgagtt cctaagagct aatcaaagga agcatggtca
      241 cgtttttacc tgcaaactga tggggaaata tgtccatttc atcacaaact ccctgtcata
      301 ccacaaagtc ttatgtcatg gaaaatattt tgactggaaa aaatttcatt acactacttc
      361 tgcgaaggca tttggacaca gaagcattga cccaaatgat ggaaatacca cggaaaatat
      421 aaacaacact tttaccaaaa ccctccaggg agatgctctg tgttcacttt ctgaagccat
      481 gatgcaaaac ctccaatctg tcatgagacc tcctggcctt cctaaatcaa agagcaatgc
      541 ctgggtcacg gaagggatgt atgccttctg ttaccgagtg atgtttgaag ccggctatct
      601 aacactgttt ggcagagata tttcaaagac agacacacaa aaagcactta ttctaaacaa
      661 ccttgacaac ttcaaacaat ttgaccaagt ctttccggca ctggtggcag gccttcctat
      721 tcacttgttc aagaccgcac ataaagctcg ggaaaagctg gctgagggat tgaagcacaa
      781 gaacctgtgt gtgagggacc aggtctctga actgatccgt ctacgtatgt ttctcaatga
      841 cacgctctcc acctttgacg acatggagaa ggccaagacg cacctcgcta tcctctgggc
      901 atctcaagca aacaccattc ctgcaacctt ttggagctta tttcaaatga tcaggagtcc
      961 tgaagcaatg aaagcagcct ctgaagaagt gagtggagct ttacagagtg ctggccaaga
     1021 gctcagctct ggagggagtg ccatttactt ggatcaagtg caactgaatg acctgccggt
     1081 actagacagc atcatcaagg aggctctgag gctttccagt gcatccttga atatccgcac
     1141 agctaaggag gacttcactc tccatcttga ggacggttcc tataacatcc gaaaagatga
     1201 catgatagct ctttatccac agttaatgca cttggatcct gaaatctacc cagacccttt
     1261 gactttcaaa tatgaccggt accttgatga aagcgggaaa gcaaagacca ccttctacag
     1321 taatggaaac aagctgaagt gtttctacat gcccttcgga tcaggcgcga caatatgtcc
     1381 tggaagactc tttgccgtcc aagaaatcaa gcagtttttg atcctgatgc tctcctgctt
     1441 tgaactggag tttgtggaga gccaagtcaa gtgtccccct ctagaccagt cccgggcagg
     1501 cttgggaatt ttgccaccac tacatgatat tgagtttaaa tataaactga aacactgata
     1561 cgtggttgga agaagcgaac actggatgat gtcacttggc ggctgagagt catcacttaa
     1621 acaggccttc gggaccaatg ctcactgatg cgccctagcg actggattag tgggaagaac
     1681 tttgttctcg ctgcccacat tcctgggtgt tcacatagct ggggccagag cttcatcact
     1741 ttcagaaagc aatgtctttt gtatttattt tcaaaatgaa gatattccaa ttggcaggat
     1801 atttttccta aggaaattgc tttatatttt tatgaaaact accaattaat tatgaaaggc
     1861 ttgaaattca cgttttagtg aaattactga tttttcacta gtaaggttct tcaggtgtga
     1921 aactgtatta taaaaatgtt gtaatgggtc acactgtgct ttgcataaag gtaaaggaaa
     1981 ctatgtttca gccttttctg tgtctatgag attcgaaaat aatcttactg ttctagaaac
     2041 actggggaga atccgacatc gtctagctat attttatttt actgttgcga gaaattttaa
     2101 ttccagtttt caactacctt atctttcccc cattttgaca tgcatgccaa tgagaagagt
     2161 attttttagg aattaacaag gcacctccca gaaccctacc ctgagacttt taagccttta
     2221 atcccagcac tcgagaagta agagccaggc aagatctctg agtctgaggt tattctggtc
     2281 tacatcagct ccagataagc caggactaca gaatgggatc ttgtctaaaa aatacagcta
     2341 atctttatgt cataactgat tatgaatcaa cctaaaagat aaattttcaa tcaggactca
     2401 gagaaaatga gcaattaaaa aacttagctc tgaggtatgt ggaattcatt aagtacaagt
     2461 tgacattaca tgttctttaa aaatagttta tgttttatct ctaaatgccc tgcagatgaa
     2521 gaataataat gaaaagttga ataatactgt ttaaacacta agtgcaataa tgctttggta
     2581 atgtacttta agagaatcat tagccgtgcc agttttacta aaatatattt atatgtaaat
     2641 tatatttatc tttttcttat accataaata taaaaatatt gcaacattta ggtaattttt
     2701 aaaattacat acctttcaga aaattcgaat ggtatgaatg tttgtatgtt ttttaacttt
     2761 gaacagaaca tttaaattat tcatctacgg tgatttttat cttatttatt tctttttgtc
     2821 tcattcatat cttgaagaaa tccaaaaata tctgaaggaa tcgctcactc aaatctctcc
     2881 ctatcgttac agaaaaattc aataccatct ctttttcctc ggggactgaa gcagggtctc
     2941 gtgggtcccg agcagaggct cctgctgcag cgagctttat ccacgggact ccttaaacct
     3001 tttaaaatct tatcactatt atcatgcagt tattacctaa gtaggatatt tccctttcct
     3061 ttttcatttc agcagagtcc cttagcaacc caggctgact gggaccctcc atgtagctta
     3121 agctgtgaac tcactgtact tctgttttca cttattttag gaagtaattt tccctatcag
     3181 aaattttaat tgtttagatg atgtataaga gtaacacaat tctgttatat actaatctgt
     3241 agtaactaat tgttcttaga cagttgatga ctctcaattg atgatcatac atctttccat
     3301 ggcttcttga atgcccattt ctcatacaca gaataatggg tttcacggtg atatcttcct
     3361 ttcatgtctt tattcttgtg cggtgatggt tggcaaatga tacccatgga gcaaggttac
     3421 tcttcctatt tctgtgcagc ctaagtgtta agaaataaat tttttaaata cttagaggga
     3481 aggcacattt tgtgtcatat gtgagtgaca tgtgacacac agactagcaa tccatgagta
     3541 aaattttatt gggattaccg g
//
LOCUS       MUSTEAA      2397 bp ss-mRNA            ROD       13-AUG-1990
DEFINITION  Mouse tea (20.5) mRNA, complete cds.
ACCESSION   M32484
KEYWORDS    cell surface protein.
SOURCE      Mouse (strain AKR/J) T-lymphoma cell line SL12.4.10, cDNA to mRNA,
            clone 20.5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2397)
  AUTHORS   MacLeod,C.L., Finley,K., Kakuda,D., Kozak,C.A. and Wilkinson,M.F.
  TITLE     Activated T cells express a novel gene on chromosome 8 that is
            closely related to the murine exotropic retroviral receptor
  JOURNAL   Mol. Cell. Biol. 10, 3663-3674 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.L.MacLeod, 02-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        410     1771     Tea gene product (20.5)
BASE COUNT      548 a    571 c    587 g    691 t
ORIGIN      Chromosome 8.
        1 gggtgtcttt cctcatcgct gccctggcct cggttatggc cggcctttgc tatgctgaat
       61 ttggggcccg agtacccaag actggatctg cgtatctata cacttacgtc acggtcggag
      121 agctgtgggc cttcatcact ggctggaatc tcatcctgtc atatgtcata ggtacgtcca
      181 gtgtcgcaag agcatggagt ggcacctttg acgaacttct taataaacag attggccagt
      241 ttttcaaaac gtacttcaaa atgaattaca ctggtctggc agagtatcca gacttctttg
      301 ccgtgtgcct tgtattactc ctggcaggtc ttttatcttt tggagtaaaa gagtctgctt
      361 gggtgaataa atttttacag ctattaatat cctggtcctt ctctttgtca tggtggctgg
      421 gtttgtgaaa ggaaatgtgg ctaactggaa gatcagtgaa gagtttctca aaaatatatc
      481 agcaagtgct agagaaccac cttctgagaa cggaacaagc atctacgggg ctggcggctt
      541 tatgccctat ggctttacag ggacgttggc tggtgctgca acgtgctttt atgcctttgt
      601 gggctttgac tgcattgcaa caaccggtga agaggttcgg aatccacaaa aggcgatccc
      661 catcggaata gtgacgtcct tacttgtctg ctttatggct tactttgggg tttctgcagc
      721 tttaacgctt atgatgcctt actacctcct ggatgagaaa agtccactcc cagtcgcgtt
      781 tgagtatgtc agatggggcc ccgccaaata cgttgtcgca gcaggctccc tctgcgcctt
      841 atcaacaagt cttcttggat ccattttccc aatgcctcgt gtaatctatg ctatggcgga
      901 ggatgggttg cttttcaaat gtctagctca aatcaattcc aaaacgaaga caccagtaat
      961 tgctactttg tcatcgggtg cagtggcagc tgtgatggcc tttctttttg acctgaaggc
     1021 cctcgtggac atgatgtcta ttggcaccct catggcctac tctctggtgg cagcctgtgt
     1081 gcttattctc aggtaccaac ctggcttgtg ttacgagcag cccaaataca cccctgagaa
     1141 agaaactctg gaatcatgta ccaatgcgac tttgaagagc gagtcccagg tcaccatgct
     1201 gcaaggacag ggtttcagcc tacgaaccct cttcagcccc tctgccctgc ccacacgaca
     1261 gtcggcttcc cttgtgagct ttctggtggg attcctggct ttcctcatcc tgggcttgag
     1321 tattctaacc acgtatggcg tccaggccat tgccagactg gaagcctgga gcctggctct
     1381 tctcgccctg ttccttgtcc tctgcgctgc cgtcattctg accatttgga ggcagccaca
     1441 gaatcagcaa aaagtagcct tcatggtccc gttcttaccg tttctgccgg ccttcagcat
     1501 cctggtcaac atttacttga tggtccagtt aagtgcggac acttggatca gattcagcat
     1561 ctggatggcg cttggctttc tgatctattt cgcctatggc attagacaca gcttggaggg
     1621 taaccccagg gacgaagaag acgatgagga tgccttttca gaaaacatca atgtagcaac
     1681 agaagaaaag tccgtcatgc aagcaaatga ccatcaccaa agaaacctca gcttaccttt
     1741 catacttcat gaaaagacaa gtgaatgttg atgctggccc tcggtcttac cacgcatacc
     1801 ttaacaatga gtacactgtg gccggatgcc accatcgtgc tgggctgtcg tgggtctgct
     1861 gtggacatgg cttgcctaac ttgtacttcc tcctccagac agcttctctt cagatggtgg
     1921 attctgtgtc tgaggagact gcctgagagc actcctcagc tatatgtatc cccaaaacag
     1981 tatgtccgtg tgcgtacatg tatgtctgcg atgtgagtgt tcaatgttgt ccgttattag
     2041 tctgtgacat aattccagca tggtaattgg tggcatatac tgcacacact agtaaacagt
     2101 atattgctga atagagatgt attctgtata tgtcctaggt ggctggggaa atagtggtgg
     2161 tttctttatt aggtatatga ccatcagttt ggacatactg aaatgccatc ccctgtcagg
     2221 atgtttaaca gtggtcatgg gtggggaagg gataaggaat gggcattgtc tataaattgt
     2281 aatgcatata tccttctcct acttgctaag acagctttct taaacggcca gggagagtgt
     2341 ttctttcctc tgtatgacaa gatgaagagg tagtctgtgg ctggagatgg ccaatcc
//
LOCUS       MUS195A      1183 bp ss-mRNA            ROD       13-AUG-1990
DEFINITION  Mouse 19.5 mRNA, complete cds.
ACCESSION   M32486
KEYWORDS    cell surface protein.
SOURCE      Mouse (strain AKR/J) T lymphoma cell line AKR/J SL12.4, cDNA to
            mRNA, clone 19.5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1183)
  AUTHORS   MacLeod,C., Fong,A., Seal,B., Walls,L. and Wilkinson,M.
  TITLE     Isolation of novel cDNA clones from T-lymphoma
  JOURNAL   Cell Growth and Differ. 1, 271-279 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            C.L. MacLeod, 02-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         55      876     19.5 protein
BASE COUNT      297 a    250 c    255 g    381 t
ORIGIN      Chromosome 16.
        1 cctaactgac aaagtgggga gagtaaggtg tgcgcaaaca ggacaagttg ggtcatgggg
       61 agtttcaaag gacatgctct ccctgggagt ttcttcttcg ccatgggctt ttggtggact
      121 atgaagaaca tcctgaaatc tgtctacaaa aggcaaactc gaacctgcta ccttaactct
      181 aaaacattat tacgtcggac agagatttgg gaaggagttg ttgtgctttt aatgtctctc
      241 actggtatag ctggtgaaca gtttatctca ggaggacctg ccttgatctt gcataaagat
      301 ggccagtgga accagatcct gggctggcat cacacaacca tgtacttatt ctttgggcta
      361 cagggtataa cccaaatcat atgtttcact actaatgtac ttccactttc ctcaagcaag
      421 ttaatgttat caattgccat ctttgtggag acatttatgt tctacaacca cacacacggt
      481 cgggaaatga ttgacatttt tgtacaccaa cttctggtct tcgttggcac attttcgggt
      541 ctggttgcct tcttggagtt cctcgtaaag aacaacgcac ttctggagct cctgcggtgc
      601 agtctcctca tgtttcaagg aacctggttc tggcagatgg cgtttgtgct gtacccccca
      661 tgtggaagtg ctacatggaa cctgtcagat attcaaaata aaatgtttct ctcaatgtgc
      721 ttttgctggc attatgcatc aatccttatc ctcattggag taaaatatgc tttggccaac
      781 tggttagtca agtctaggct gaggaagggc tgcacctcag aagttggact cctgaagcat
      841 gctgaccgtg agcaagaatc agaagaagaa gtatgatctt gaagtctttc ttgataagcc
      901 ttctcccttt gcgttgcctt tgttcatggc tttgtttcct gacctctggt ctcaagaaca
      961 cttgtctgag gctgactcca tgctgtttgt acttccagtt ttgttaaagt gttggacttt
     1021 aagtatctta ctttcagctc tgaaagaacc atgagtgata aattcacttt ttacactgtg
     1081 catgccatgt aattcaagac caatcataat tgttttccaa agtttagttt cgtgtccatt
     1141 tattaaaaat attttttttt attttccggg tagatacctt caa
//
LOCUS       MUSCYR61A    2018 bp ss-mRNA            ROD       13-AUG-1990
DEFINITION  Mouse Cyr61 mRNA, complete cds.
ACCESSION   M32490
KEYWORDS    cyr61 product.
SOURCE      Mouse (strain BALB/c) fibroblast cell line 3T3, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2018)
  AUTHORS   O'Brien,T.P., Yang,G.P., Sanders,L. and Lau,L.F.
  TITLE     Expression of cyr61, a growth factor-inducible immediate-early gene
  JOURNAL   Mol. Cell. Biol. 10, 3569-3577 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            L.F.Lau, 02-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        190     1329     Cyr61 product
    mRNA    <     1     2018     Cyr61 mRNA
    site       1453     1551     49 bases repeat
BASE COUNT      500 a    536 c    527 g    455 t
ORIGIN      
        1 agaccgtgag cgagagcgcc ccagagaagc gcctgcaatc tctgcgcctc ctccgccagc
       61 acctcgagag aaggacaccc gccgcctcgg ccctcgcctc accgcactcc gggcgcattt
      121 gatcccgctg ctcgccggct tgttggttct gtgtcgccgc gctcgccccg gttcctcctg
      181 cgcgccacaa tgagctccag caccttcagg acgctcgctg tcgccgtcac ccttctccac
      241 ttgaccagac tggcgctctc cacctgcccc gccgcctgcc actgccctct ggaggcaccc
      301 aagtgcgccc cgggagtcgg gttggtccgg gacggctgcg gctgctgtaa ggtctgcgct
      361 aaacaactca acgaggactg cagcaaaact cagccctgcg accacaccaa ggggttggaa
      421 tgcaatttcg gcgccagctc caccgctctg aaagggatct gcagagctca gtcagaaggc
      481 agaccctgtg aatataactc cagaatctac caaaacgggg aaagcttcca gcccaactgt
      541 aaacaccagt gcacatgtat tgatggcgcc gtgggctgca ttcctctgtg tccccaagaa
      601 ctgtctctcc ccaatctggg ctgtcccaac ccccggctgg tgaaagtcag cgggcagtgc
      661 tgtgaagagt gggtttgtga tgaagacagc attaaggact ccctggacga ccaggatgac
      721 ctcctcggac tcgatgcctc ggaggtggag ttaacgagaa acaatgagtt aatcgcaatt
      781 ggaaaaggca gctcactgaa gaggcttcct gtctttggca ccgaaccgcg agttcttttc
      841 aaccctctgc acgcccatgg ccagaaatgc atcgttcaga ccacgtcttg gtcccagtgc
      901 tccaagagct gcggaactgg catctccaca cgagttacca atgacaaccc agagtgccgc
      961 ctggtgaaag agacccggat ctgtgaagtg cgtccttgtg gacaaccagt gtacagcagc
     1021 ctaaaaaagg gcaagaaatg cagcaagacc aagaaatccc cagaaccagt cagatttact
     1081 tatgcaggat gctccagtgt caagaaatac cggcccaaat actgcggctc ctgcgtagat
     1141 ggccggtgct gcacacctct gcagaccaga actgtgaaga tgcggttccg atgcgaagat
     1201 ggagagatgt tttccaagaa tgtcatgatg atccagtcct gcaaatgtaa ctacaactgc
     1261 ccgcatccca acgaggcatc gttccgactg tacagcctat tcaatgacat ccacaagttc
     1321 agggactaag tgcctccagg gttcctagtg tgggctggac agaggagaag cgcaagcatc
     1381 atggagacgt gggtgggcgg aggatgaatg gtgccttgct cattcttgag tagcattagg
     1441 gtatttcaaa actgccaagg ggctgatgtg gacggacagc agcgcagccg cagttggaga
     1501 atgccaaggg gctgatgtgg acggacagca gcgcagccgc agttggagaa gacttcgctt
     1561 catagtactg gagcgggcat tattgctcca tattggagca tgtttacgga tgacgttctg
     1621 ttttctgttt gtaaattatt tgctaagtgt atttttttgc tccagacccc cccccccttt
     1681 cttggttcta caattgtaat agagacaaaa taagattagt tgggccaagt gaaagccctg
     1741 cttgtccttt gacagaagta aatgaaagcg cctctcattc cttcccgagc ggagggggac
     1801 actctgtgag tgtccttggg gcagctacct gcactctaaa actgcaaaca gaaaccaggt
     1861 gttttaagat tgaatgtttt tttatttatc aaagtgtagc ttttggggag ggaggggaaa
     1921 tgtaatactg gaataatttg taaatgattt taattttata tcagtgaaga gaatttattt
     1981 ataaaattaa tcatttaata aagaaatatt tacctaaa
//

GenBank-Updates@genbank.bio.net (08/14/90)

LOCUS       RATNCAMC     1195 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Rat neural cell adhesion molecule (NCAM-C) gene, exon 1.
ACCESSION   M32612
KEYWORDS    neural cell adhesion molecule.
SOURCE      Rat (strain Sprague-Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1195)
  AUTHORS   Chen,A., Reyes,A. and Akeson,R.
  TITLE     Transcription initiation sites and structural organization of the
            extreme 5' region of the rat neural cell adhesion molecule gene
  JOURNAL   Mol. Cell. Biol. 10, 3314-3324 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Akeson 06-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1142  /  1193     neural cell adhesion molecule, exon 1 (NCAM-C)
    pre-msg     947  >  1193     NCAM-C mRNA and introns (alt.)
    pre-msg     948  >  1193     NCAM-C mRNA and introns (alt.)
    pre-msg     950  >  1193     NCAM-C mRNA and introns (alt.)
    pre-msg     951  >  1193     NCAM-C mRNA and introns (alt.)
    pre-msg     952  >  1193     NCAM-C mRNA and introns (alt.)
    pre-msg     955  >  1193     NCAM-C mRNA and introns (alt.)
    pre-msg     956  >  1193     NCAM-C mRNA and introns (alt.)
    IVS        1194  >  1195     NCAM-C intron 1.
    site        307      484     homopurine-homopyrimidine region
BASE COUNT      324 a    275 c    333 g    263 t
ORIGIN      
        1 gaattcttgc tcttgctcct gtgccgtaat acacaccgaa aagccagggc agcagtggag
       61 ggagaggaag agaggattcc aatacttgca agcagtttct ggtaatttct gaatactact
      121 tctaaatgtc tctgcatttt cttctaactg cccccccgca cccccccccc acttctgaga
      181 tctgagttgt gctgctgtga gttgctccct ctgtgggaag gtaaacttca taggtaagag
      241 aaaacaaaca acaaagcaaa cctccacaat cctcatttct ctccaaattt ccaatttcct
      301 tcacttagga ggaagaggag gaagaggagg aagagggtga ggaggaggaa gaggaagaag
      361 aggaagaaga ggaagaagag gaagaagagg aagaagagga agaagaggat gagggaaagg
      421 aggagaagag gaggaggagg aggaggaaga aaaggagaag aaggagaagg agaaaaggag
      481 gaagccattt gcttctattg caaggacttt agtaactaca tggggtgggt gtgggggctc
      541 tacaactttg aaaagcgagc tgaatctaaa aattctttct ttttttttcc taattattta
      601 aacgttcaaa ttcccgatta aggaacgctg gagggcagaa ggatctccaa aggcttgagt
      661 gaaggaaaag ggtgaaaaga agaaatccca gcttctgcct ggaaggttct gtgtgaaaga
      721 gcccggctcc cttggtgact ccaggccgtg ttttgcaggc ggccgcatct gcctcccctg
      781 tctcttacct cctggatgtt aggaactatt tgtggtcggc gtggtggaag gacacagtga
      841 ggttctcacc tccgcccccc gcccgtcgct cgcatcccca gttccatcaa agccaacccg
      901 ggccagcgca gggatctccg agttgcgagt gtgctgaggc tggggctgtc actcattctc
      961 cgctcagcgg tgaacgcagc tcggcagtgg ctggcaagaa acaattctgc aaaaataatc
     1021 atacccagcc tggcaattgt ctgctcctcg gtccattgct ccgccgccgt ccacagtcgc
     1081 ttgcaaggga aggcactgaa tttaccgcgg ccagaacatc cctcccagcc ggcagtttac
     1141 aatgctgcga actaaggatc tcatctggac tttgtttttc ctgggaactg caggt
//
LOCUS       YSCTEC1A     2308 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae TEC1 protein gene, complete cds.
ACCESSION   M32797
KEYWORDS    TEC1 protein.
SOURCE      S.cerevisiae (strain GRF18) haploid cell DNA, clone pBC43-2.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2308)
  AUTHORS   Laloux,I., Dubois,E., Dewerchin,M. and Jacobs,E.
  TITLE     TEC1, a gene involved in the activation of Ty1 and Ty1-mediated
            gene expression in Saccharomyces cerevisiae: Cloning and molecular
            analysis
  JOURNAL   Mol. Cell. Biol. 10, 3541-3550 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by I.M.Laloux 13-MAR-1990.
FEATURES       from  to/span     description
    pept        395     1855     TEC1 protein (TEC1)
    signal      289      294     TATA box
BASE COUNT      781 a    513 c    410 g    604 t
ORIGIN      Chromosome 2, right arm.
        1 tgcgtgtttc agattgcgtt tgcttattat tctgggaaac tctccggaaa caagctgagc
       61 tggactccac gaatggtccc tcacaaggct cttggctcat gcacagttaa aaagcacctg
      121 tagctaccat cagcaatcca tggtgctgga gtttctctcg atggtacggc ccatagctgg
      181 agaaaatagc ttattaccaa aaaagtaatt cccccgcaga ggtatatatt aagtttgaaa
      241 caaaaggctt gcttaatttt ttacatatga caaccctttt cttactttta taaatgttta
      301 tgaaacagga gattcttcga ctggtgccag attcctccca tatcgacaag aagaataatc
      361 cacctatttc aacaattctg atacctgttt aaccatgagt cttaaagaag acgactttgg
      421 caaggataat tctagaaata tagaatcata tactggtaga atttttgacg tatatataca
      481 aaaagattcg tattcacagt cggccttgga tgatatgttt ccagaagccg tagtttcaac
      541 cgccgcttgt gtgaaaaatg aagcggagga taacatcaat ctcatagaca cgcatcctca
      601 attcgaactg gtaaatactg gactgggtgc taaatcggac gatttgaaat ctccatcagc
      661 aaaggctacg ttcactgaca agcagaggaa gaatgaagta ccaaatatat ctgtgagcaa
      721 ctactttccc ggacaaagta gcgaaacgtc gtcaacaacg gaatcttgga ctatcggttg
      781 tgataagtgg tcagaaaagg tagaagaggc attccttgag gcacttagac tgataatgaa
      841 aaatgggacc acaaaaataa aaataagaaa tgccaatttt ggaagaaacg agctgatttc
      901 attatatatc aagcacaaaa ccaacgagtt cagaaccaaa aagcaaattt cttcccatat
      961 tcaagtctgg aagaagacca tacaaaacaa aatcaaggac tcgctgaccc tatcatcaaa
     1021 ggagaaggag cttctacacc ttatcgaaca tggcgctgaa caaactactg aaaactcaaa
     1081 cctgttttat gacatatttg aagaaattat cgactctcta ccttcagtca gtgattctgg
     1141 aagtttaacc cctaaaaacc tctatgtaag taataatagc agtggattgt cagtacattc
     1201 aaaactgctt acgccaatca ctgcttccaa cgagaaaaaa attgaaaatt tcataaaaac
     1261 taatgctgca tctcaagcca aaacccccct catttacgct aagcacattt atgaaaacat
     1321 agacggctac aagtgcgttc cgtcaaagag gcctcttgaa caactttccc ccacggaact
     1381 ccaccaggga gatcgcccca ataaggctag cttttccaac aagaaggcaa tcctggagag
     1441 tgcaaaaaaa atcgaaatag agcagagaaa gataatcaac aaataccaaa gaatttcccg
     1501 catacaagaa catgaaagta atcctgagtt cagttccaat tccaattccg gttcagagta
     1561 cgaatcggag gaagaagtag tcccaagatc agccacagtc acacaactcc aaagcagacc
     1621 agtgccatac tacaagaata atggaatgcc ctactcactc tccaaagtac gaggaaggcc
     1681 catgtatcca agacctgctg aagatgctta caatgccaat tatattcaag gtctgcccca
     1741 gtaccaaaca tcttattttt cgcagctgtt attatcatca ccccagcatt acgaacattc
     1801 tccacatcaa aggaacttta cgccatccaa ccaatcgcat gggaactttt attaaatgtc
     1861 tacatacata catacatctc gtacataaat acgcatacgt atcttcgtag taagaaccgt
     1921 cacagatatg attgagcacg gtacaattat gtattagtca aacattacca gtctcgaaca
     1981 aaagctactc ctgcaacact cttctatcgc acatgtatgg ttcttattgt ttcccgagtt
     2041 cttttttact gacgcgccag aacgagtaag aaagttctct agcgccatgc tgaaattttt
     2101 ttcacttcaa cggacagcga ttttttttct ttttcctccg aaataatgtt gcagcggttc
     2161 tcgatgcctc aagaattgca gaagtaaacc agccaataca catcaaaaaa caactttcat
     2221 tactgtgatt ctctcagtct gttcatttgt cagatattta aggctaaaag gaaatgttgt
     2281 cgagactatc tttattgagt aactcgag
//
LOCUS       YSCACO1A     3130 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae aconitase (ACO1) gene, complete cds.
ACCESSION   M33131
KEYWORDS    aconitase; isocitrate hydro-lyase.
SOURCE      S.cerevisiae (strain S288C) DNA, clone SG7-1.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3130)
  AUTHORS   Gangloff,S.P., Marguet,D. and Lauquin,G.J.-M.
  TITLE     Molecular cloning of the yeast mitochondrial aconitase gene (ACO1)
            and evidence of a synergistic regulation of expression by glucose
            plus glutamate
  JOURNAL   Mol. Cell. Biol. 10, 3551-3561 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.J.-M.Lauquin 21-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        648     2987     aconitase (ACO1) (EC 4.2.1.3)
    binding     357      364     HAP2-HAP3 binding site
    signal      503      513     TATA box
BASE COUNT      907 a    604 c    668 g    951 t
ORIGIN      
        1 gaattccaaa ggctcacaaa tggcattccg tactgatata cttcgcactt tacatatgct
       61 tatataaaaa tgccagtttc ccatcgtcaa ttacgcagag gtagccttcg tttattttct
      121 ctctttttgt atatcattgt attaatcata atccattgta ttttactttg tcttatctgg
      181 ctttaatgaa tgacgccggt cacacgcggg tgcctgattc tcgattgtgc caagccattt
      241 gggcacggtg tcaaattacc taaaaaatgg ccgagagccg caaaagggag gtccgcgggg
      301 ccgggcaata ccctttgttt ttcgagcatt tcggcgccga aatcggaaag gtcctgacca
      361 atcaatagag aaattagtgc atacgagaaa aatttgaacc cttcagttgt tctccgcagg
      421 cgactttaac catcaaacct ccaatcgctg ccggttttcg gaaaggcaag cacaaaaagg
      481 gaggacaagg aaaatctttt gttatatata taatcttccg gttttagagg ttaattagga
      541 ggagtatgtt gcttaattcc gttgtccctt gttctgttca ctctttctta gttattacat
      601 agtagaacga aggtaataaa tactatcatt attactatag atatacaatg ctgtctgcac
      661 gttctgccat caagagaccc attgttcgtg gtcttgcgac agtctccaac ttgactagag
      721 attcaaaagt caaccaaaac ttattagaag atcattcttt tattaactac aagcagaatg
      781 tggaaacgct ggatatcgta agaaaaagat taaacaggcc atttacctac gcggaaaaga
      841 ttttgtacgg tcacttggat gaccctcatg gtcaagatat tcagagaggt gtttcatacc
      901 taaaattaag accagatcgt gttgcctgtc aagatgctac tgctcaaatg gctattttac
      961 aatttatgtc cgctggttta ccacaggttg ctaagccagt cactgtccac tgtgaccatt
     1021 tgattcaagc acaagttggt ggtgaaaaag atttgaagag agctatagat ctaaacaagg
     1081 aagtttatga tttcttggcc tctgccactg cgaaatataa catgggtttc tggaagccag
     1141 gttccggtat cattcaccaa attgttctgg aaaactacgc tttcccaggt gctttgatca
     1201 ttggtactga ctcccataca ccaaatgctg gtggtttagg tcaattggct attggtgttg
     1261 gtggtgctga tgccgttgat gttatggcag gtcgtccatg ggaattgaag gctccaaaga
     1321 tcttaggtgt taagttgact ggtaagatga acggttggac ttctccaaag gatattattt
     1381 tgaaattggc tggtatcaca actgtcaaag gtggtactgg taaaattgtt gaatattttg
     1441 gtgatggtgt tgacactttc tccgctactg gtatgggtac catttgtaat atgggtgctg
     1501 aaatcggtgc taccacatct gttttcccat tcaacaaatc tatgattgaa tatttggaag
     1561 caactggtcg tggtaagatc gctgactttg ctaaattata ccacaaggat ctattatctg
     1621 ctgataagga tgctgaatac gatgaggtcg tcgaaattga cttgaacact ctggaaccat
     1681 acatcaatgg gccatttacc cccgatttgg ctactccagt ttctaagatg aaggaagttg
     1741 ctgttgctaa taactggcca ttggatgtca gagtcggttt gatcggttct tgtaccaatt
     1801 cctcttatga agatatgtct cgttcagcat ccattgtcaa ggacgctgct gctcatggtt
     1861 tgaaatccaa gaccattttc actgttactc caggttctga acaaatcaga gccactattg
     1921 aacgtgatgg ccaattagaa accttcaaag aatttggtgg tatcgttttg gcaaacgcct
     1981 gtggcccatg tattggtcaa tgggatcgta gagatatcaa gaaaggtgac aagaatacta
     2041 ttgtttcctc ttacaacaga aatttcactt ctagaaatga tggtaaccca caaactcatg
     2101 cttttgttgc atctccagaa ttagtaactg cgttcgccat tgcgggtgat ttgagattca
     2161 accctctaac agacaaatta aaggacaagg atggtaatga gttcatgttg aaaccaccac
     2221 atggtcgatg gtttgcctcg aaagaggtta tgatgctggt gagaacactt accaagctcc
     2281 acctgcagac cgtagccacc gttgaagtta aagtttctcc aacttcagac cgtctacaac
     2341 tgttgaaacc attcaaacct tgggatggta aggatgctaa agacatgcca atcttgatta
     2401 aggccgtcgg taagacaact actgatcata tttctatggc tggtccatgg ttgaaataca
     2461 gaggtcattt agaaaacatt tctaataact atatgattgg tgctattaat gctgaaaaca
     2521 agaaggctaa ctgtgttaaa aatgtatata ctggtgaata caaaggtgtt ccagacactg
     2581 ctagagatta cagagaccaa ggtatcaagt gggttgttat tggtgatgaa aactttggtg
     2641 aaggttcctc tcgtgaacac gctgctttgg aaccaagatt cttgggcggt ttcgctatca
     2701 tcacaaagtc tttcgctcgt atccatgaaa ctaacttgaa aaaacaaggt ctattgccat
     2761 tgaacttcaa gaacccagct gactatgaca agatcaaccc tgatgacaga atcgatattc
     2821 tgggtctagc tgaattggct ccaggtaagc ctgtaacaat gagagttcat ccaaagaatg
     2881 gtaagccatg ggatgctgtg ttgacccata ctttcaacga tgagcaaatt gaatggttca
     2941 aatatggttc tgccttaaat aaaattaagg ccgatgagaa gaaataatga aaacattgtt
     3001 ataatctttt aaaggttatt atttattttg tcttctgtac acgtaccctt gtttatcttt
     3061 tctgccttaa atttaatgac gttcggctgg agaagtcaag actatgaaat atatctcgta
     3121 atttatgatc
//
LOCUS       HIVTRA1        59 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1), gag region.
ACCESSION   M27956
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Human immunodeficiency virus type 1 (isolate TRA), clone 01.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 59)
  AUTHORS   Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
            Wain-Hobson,S.
  TITLE     HIV-1 isolates are rapidly evolving quasispecies: Evidence for
            viral mixtures and preferred nucleotide substitutions
  JOURNAL   j acquired immuno defic syndr 2, 344-52 (1989)
  STANDARD  full staff_review
COMMENT     [1] kindly submitted in computer readable form by S. Wain-Hobson.
            Isolated in Paris.  Sibling sequences are available (see 1989:
            I-A-169).
FEATURES       from  to/span     description
    pept    <     1  >    59     gag protein (AA at 2)
BASE COUNT       26 a      8 c     10 g     15 t
ORIGIN      
        1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca
//
LOCUS       HIVTRA2       268 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1), envelope region.
ACCESSION   M27948
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Human immunodeficiency virus type 1 (isolate TRA), clone 01.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 268)
  AUTHORS   Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
            Wain-Hobson,S.
  TITLE     HIV-1 isolates are rapidly evolving quasispecies: Evidence for
            viral mixtures and preferred nucleotide substitutions
  JOURNAL   j acquired immuno defic syndr 2, 344-52 (1989)
  STANDARD  full staff_review
COMMENT     Sibling sequences are available (see 1989: I-A-170).
FEATURES       from  to/span     description
    pept    <     1  >   268     env protein (AA at 1)
BASE COUNT       95 a     46 c     54 g     73 t
ORIGIN      
        1 caattaaccc cactctgtgt tactttaaat tgcactgatt atttggggaa tgctactaat
       61 accactagta gtagcggggg agggatggag agaggagaaa taaaaaactg ctctttcaat
      121 atcacgacaa gcataagaga taagatgcag agagaatatg cactttttta taaacttgat
      181 gtagtaccaa cagataatga taatactagc tataggttga taagttgtaa cacctcagtc
      241 attacacagg cctgtccaaa agtatcct
//
LOCUS       FLAH1424     1749 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Influenza A/Mallard/Gurjev/263/82 hemagglutinin subtype H14 gene.
ACCESSION   M35997
KEYWORDS    hemagglutinin.
SOURCE      Influenza A/H14/263/82 RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (sites)
  AUTHORS   Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
  TITLE     Molecular characterization of a new hemagglutinin, subtype H14, of
            influenza A virus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1749)
  AUTHORS   Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by Y.Kawaoka, 27-JUN-1990.
            
                            Author address: Y.Kawaoka
                            St. Jude Childrens Research Hospital
                            332 N. Lauderdale
                            P.O. Box 318, Memphis, TN
                            38101
BASE COUNT      575 a    360 c    412 g    402 t
ORIGIN      
        1 agcaaaagca ggggaaaatg attgcactca tattggttgc actggctctg agccacactg
       61 cttattctca gatcacaaat gggacaacag gaaaccccat tatatgcttg gggcatcatg
      121 cagtggaaaa cggcacatct gttaaaacac taacagacaa tcacgtagaa gttgtgtcag
      181 ctaaagaatt agttgagacg aaccacactg atgaactgtg cccaagcccc ttgaagcttg
      241 tcgacgggca agactgccac ctcatcaatg gtgcattggg gagtccaggc tgtgaccgtt
      301 tgcaggacac cacttgggat gtcttcattg aaaggcccac tgcagtagac acatgttatc
      361 cattcgacgt cccagattac cagagtctca gaagcatcct agcaagcagt gggagtttgg
      421 agttcatcgc cgaacaattc acctggaatg gtgtcaaagt tgacggatca agcagtgctt
      481 gtttgagggg cggtcgcaac agcttcttct cccgactaaa ctggctaacc aaagcaacaa
      541 atggaaacta tggacctatt aacgtcacta aagaaaatac gggctcttat gtcaggctct
      601 atctctgggg agtgcatcac ccatcaagcg ataatgagca aacggatctc tacaaggtgg
      661 caacagggag agtaacagta tctacccgct cggaccaaat cagtattgtt cccaatatag
      721 gaagtagacc gagggtaagg aatcagagcg gcaggataag catctactgg accctagtaa
      781 acccagggga ctccatcatt ttcaacagta ttgggaattt gattgcacca agaggccact
      841 acaaaataag caaatctact aagagcacag tgcttaaaag tgacaaaagg attgggtcat
      901 gcacaagccc ttgcttaact gataaaggtt cgatccaaag tgacaaacct tttcagaatg
      961 tatcaaggat tgctatagga aactgcccga aatatgtaaa gcaagggtcc ctgatgttag
     1021 caactggaat gcgcaacatc cctggcaaac aggcaaaggg cttatttggg gcaattgctg
     1081 gattcattga aaatggttgg caaggcctga ttgatgggtg gtatggattc aggcaccaaa
     1141 atgctgaagg aacaggaact gctgcagacc tgaagtcaac tcaggcagcc attgatcaga
     1201 taaatggcaa gctgaacaga ttgatagaga agacaaatga aaaatatcac caaatagaaa
     1261 aggaattcga acaggtggaa ggaagaatac aagaccttga gaagtacgtt gaggacacta
     1321 agattgattt gtggtcatac aatgctgaat tgctagtagc actagagaat cagcacacaa
     1381 tagatgtcac agactccgaa atgaacaagc tttttgaaag agtaagaagg caattaagag
     1441 agaatgcaga agatcaaggc aacggttgtt tcgagatatt ccatcagtgt gacaacaatt
     1501 gtatagaaag cattagaaac ggaacttatg accacaacat ctacagggat gaagccatca
     1561 acaatcgaat caaaataaat cctgtcactt tgacgatggg gtacaaggac ataatcctgt
     1621 ggatttcttt ctccatgtca tgctttgtct tcgtggcact gattctggga tttgttctat
     1681 gggcttgtca aaacgggaat atccgatgcc aaatctgtat ataaagaaaa aacacccttg
     1741 tttctactc
//
LOCUS       FLAH14244    1716 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Influenza A/Mallard/Gurjev/244/82 hemagglutinin subtype H14 gene.
ACCESSION   M35996
KEYWORDS    hemagglutinin.
SOURCE      Influenza A/H14/244/82 RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (sites)
  AUTHORS   Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
  TITLE     Molecular characterization of a new hemagglutinin, subtype H14, of
            influenza A virus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1716)
  AUTHORS   Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by Y.Kawaoka, 27-JUN-1990.
            
                            Author address: Y.Kawaoka
                            St. Jude Childrens Research Hospital
                            332 N. Lauderdale
                            P.O. Box 318, Memphis, TN
                            38101
BASE COUNT      562 a    346 c    405 g    403 t
ORIGIN      
        1 ttggttgcac tggctctgag ccaaactgct tattctcaga tcacaaatgg gacaacagga
       61 aaccccatta tatgcttggg gcaccatgca gtggaaaacg gcacatctgt taaaacacta
      121 acagacaatc acgtggaagt tgtgtcagct aaagaattag ttgagacgaa acacactgat
      181 gaactgtgcc caagcccctt gaagcttgtc gacgggcaag actgcgacct catcaatggt
      241 gcattgggga gtccaggctg tgatcgtttg caggacacca cttgggatgt cttcattgaa
      301 aggcccactg cagtagacac atgttatcca ttcgacgtcc cagattacca gagtctcaga
      361 agcattctag caagcagtgg gagtctggag ttcatcgccg aacaattcac ctggaatggt
      421 gtcaaagttg acggatcaag cagtgcttgt ttgaggggcg gtcgcaacag cttcttctcc
      481 cggctaaact ggctaaccaa agcaacaaat ggaaactatg gacctattaa cgtcactaaa
      541 gaaaatacgg gctcttatgt caggctctat ctctggggag tgcatcaccc atcaagcgat
      601 aatgagcaaa cggatctcta caaggtggca acagggagag tgacagtatc tacccgctcg
      661 gatcaaatca gtattgttcc caatatagga agtagaccga gagtaaggaa tcagagcggc
      721 aggataagca tctactggac cctagtaaac ccaggggact ccatcatttt caacagtatt
      781 gggaatttga ttgcaccaag aggccattac aaaataagta aatctactaa gagcacagtg
      841 cttaaaagtg acaaaaggat tgggtcatgc acaagccctt gcttaactga taaaggttcg
      901 atccaaagtg acaaaccttt tcagaatgta tcaaggattg ctataggaaa ctgcccgaag
      961 tatgtaaagc aagggtcctt gatgttagca actggaatgc gcaacatccc tggcaaacag
     1021 gcaaagggct tatttggggc aattgctgga ttcattgaaa atggttggca aggcctgatt
     1081 gatgggtggt atggattcag gcaccaaaat gctgaaggaa caggaactgc tgcagacctg
     1141 aagtcaactc aggcagccat tgatcagata aatggcaagc tgaacagatt aatagaaaaa
     1201 acaaatgaaa aatatcacca aatagaaaag gaattcgaac aggtggaagg aagaatacaa
     1261 gaccttgaga agtacgttga ggacactaag attgatttgt ggtcatacaa tgctgaattg
     1321 ctagtcgcac tagagaatca gcacacaata gatgtcacag actccgaaat gaacaagctt
     1381 tttgaaagag taagaaggca attaagagag aatgcagaag atcaaggcaa cggttgtttc
     1441 gagatattcc atcagtgtga caacaattgt atagaaagta ttagaaacgg aacttatgac
     1501 cacaacatct acagggatga agccatcaac aatcgaatca aaataaatcc tgtcactttg
     1561 acgatggggt acaaggacat aatcctgtgg atttctttct ccatgtcatg ctttgtcttc
     1621 gtggcactga ttttgggatt tgttctatgg gcttgtcaaa acgggaatat ccgatgccaa
     1681 atctgtatat aaagaaaaaa cacccttgtt tctact
//
LOCUS       DOGCPLA01     458 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 1 (non-coding).
ACCESSION   M28140 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     1 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 458)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pre-msg     386  >   458     pancreatic lipase mRNA and intron
    IVS         419  >   458     pancreatic lipase intron A
    signal      274      277     CAAT box
    signal      351      354     TATA box
BASE COUNT      133 a     94 c     99 g    132 t
ORIGIN      
        1 ttggagggca tttccaccta ttctaagtga gaaatactac cctccattat tggattttgc
       61 tgctgtgttc ttactggaac ttgtaaacta tttgatgata acatagaaaa tgggaaagag
      121 gggccgcatt ggctcagaat tacatgcaca ggtacatcgg tacaaaggta gggatctttt
      181 acctcctttt aatcaagcat tctttcatct tcccagactg tgggaagatc tgactactat
      241 aattagtctt gttttgtgta tttgagcaga ttacaatatt aagtgcacac agcgtgttct
      301 atcatcacct gctgtttcct tgaagcggcc tccaaatgat aaggctccag tataaaaaaa
      361 aaaaaaaagg cctgggtact tgattgaaag ggagggaagg cccagaacat cccacagggt
      421 aagtcacctc ttctgtcctc ccaccagggt tgaagcag
//
LOCUS       DOGCPLA02     129 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 2.
ACCESSION   M28141 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     2 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 129)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         41  +    89     pancreatic lipase precursor, exon 2 (EC
                                 3.1.1.3) (first expressed exon)
    sigp         41  +    89     pancreatic lipase signal peptide
    IVS     <     1       40     pancreatic lipase intron A
    IVS          90  >   129     pancreatic lipase intron B
BASE COUNT       36 a     26 c     37 g     30 t
ORIGIN      About 38 bp after segment 1.
        1 aataatcggt gagactgaga atgttccgaa tttattgtag atggtgagca tctggacaat
       61 cgcgctgttt ctgctgggag cagccaaagg taggaaacgt cacaccggcc cctcgggaag
      121 gacttaaat
//
LOCUS       DOGCPLA03     235 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 3.
ACCESSION   M28142 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     3 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 235)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    41  +   195     pancreatic lipase precursor, exon 3
    sigp    +    41       42     pancreatic lipase signal peptide
    matp         43  +   195     pancreatic lipase
    IVS     <     1       40     pancreatic lipase intron B
    IVS         196  >   235     pancreatic lipase intron C
BASE COUNT       59 a     66 c     59 g     51 t
ORIGIN      About 506 bp after segment 2.
        1 ggggagctaa gccgtgagca gagtcatcct tcctctacag caaaggaagt ttgctatgaa
       61 caaatcggat gcttttctga cgcggagccc tgggccggga cagcaatcag gcctctgaaa
      121 gttctcccct ggagccccga gagaatcggc acccgcttcc tgctctacac caacaagaac
      181 ccaaacaact ttcaagtgag acctctgtca ttttagtgcc attgtgactg gagtg
//
LOCUS       DOGCPLA04     206 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 4.
ACCESSION   M28143 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     4 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 206)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    41  +   166     pancreatic lipase precursor, exon 4
    matp    +    41  +   166     pancreatic lipase
    IVS     <     1       40     pancreatic lipase intron C
    IVS         167  >   206     pancreatic lipase intron D
BASE COUNT       55 a     60 c     44 g     47 t
ORIGIN      About 445 bp after segment 3.
        1 ggctacaaaa taatgcgacc ctttcccact tttatcacag actctccttc cctctgatcc
       61 atcaacgatt gaggcatcaa attttcaaac agacaagaag acccggttca tcatccatgg
      121 cttcatagac aagggagaag agaactggct gctggatatg tgcaaggtag gggccggctc
      181 ccaggctcac agtctgccct gctccc
//
LOCUS       DOGCPLA05     215 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 5.
ACCESSION   M28144 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     5 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 215)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    41  +   175     pancreatic lipase precursor, exon 5
    matp    +    41  +   175     pancreatic lipase
    IVS     <     1       40     pancreatic lipase intron D
    IVS         176  >   215     pancreatic lipase intron E
BASE COUNT       44 a     58 c     67 g     46 t
ORIGIN      About 2090 bp after segment 4.
        1 gtgcccctgt catgacatcc tggtgtctct ccaacatcag aacatgttca aggttgagga
       61 ggtgaactgc atctgcgtgg actggaagaa aggttcccaa acctcatata cacaggctgc
      121 caacaacgtg cgggtggtgg gcgcccaggt ggcccagatg ctcagcatgc tctcggtgag
      181 tctgccggct gggctcctgc tgaggtaggc agtgc
//
LOCUS       DOGCPLA06     189 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 6.
ACCESSION   M28145 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     6 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 189)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    41  +   149     pancreatic lipase precursor, exon 6
    matp    +    41  +   149     pancreatic lipase
    IVS     <     1       40     pancreatic lipase intron E
    IVS         150  >   189     pancreatic lipase intron F
BASE COUNT       49 a     55 c     49 g     36 t
ORIGIN      About 957 after segment 5.
        1 accacaaaaa tgttctgaaa atatgatctt ccttctccag gcaaactaca gctactcacc
       61 ttcccaagtc cagctcatcg gccacagcct gggagcccac gtggctgggg aggcagggag
      121 caggactcca ggtctgggca ggattacagg taagacccaa ggggcaggct ccagattttg
      181 tccccagta
//
LOCUS       DOGCPLA07     380 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 7.
ACCESSION   M28146 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     7 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 380)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +     1  +   120     pancreatic lipase precursor, exon 7
    matp    +     1  +   120     pancreatic lipase
    IVS         121  >   380     pancreatic lipase intron G
BASE COUNT       99 a     89 c    101 g     91 t
ORIGIN      About 1340 bp after segment 6.
        1 ggttggatcc tgtagaagca agtttccagg gcactcctga agaggttcga cttgatccca
       61 ctgatgctga ctttgttgat gtgattcaca cagatgcagc tcccctgatc ccattcctgg
      121 gtgagcccaa tgacgcttta gttgcaaaca cgtgcaataa tgtttgaacc aaaaagagcc
      181 ctgtgtggtg aaaagctcat tgctgctcca aacatttcag gttttggaac aagccaacag
      241 atgggtcacc ttgacttctt ccccaatggt ggagaggaaa tgccaggatg caagaagaat
      301 gccctgtcac agatcgtgga cctagatggc atttgggaag gtaaagccca agcagaacgg
      361 ggcgtcactt ccctggggtg
//
LOCUS       DOGCPLA08     198 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 8.
ACCESSION   M28147 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     8 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 198)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    41  +   159     pancreatic lipase precursor, exon 8
    matp    +    41  +   159     pancreatic lipase
    IVS     <     1       40     pancreatic lipase intron G
    IVS         160  >   198     pancreatic lipase intron H
BASE COUNT       38 a     61 c     40 g     59 t
ORIGIN      About 1366 bp after segment 7.
        1 ccccaaccac ccgtgcagac ttccttgttt ctttctccag gaactcggga cttcgtggct
       61 tgcaatcacc tgagaagtta caagtattac tcagagagca tcctcaaccc tgatggattt
      121 gcttcctacc cctgtgcttc ctacagggcc tttgaatctg taagctatcg tccttccttg
      181 agtccctcgt ggacggtt
//
LOCUS       DOGCPLA09     211 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 9.
ACCESSION   M28148 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     9 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 211)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    42  +   171     pancreatic lipase precursor, exon 9
    matp    +    42  +   171     pancreatic lipase
    IVS     <     1       41     pancreatic lipase intron H
    IVS         172  >   211     pancreatic lipase intron I
BASE COUNT       55 a     56 c     41 g     59 t
ORIGIN      About 675 bp after segment 8.
        1 gatttacatc ttacagctgg ctctctctct tttctcctta gaacaagtgc ttcccctgcc
       61 cagatcaagg gtgcccacag atgggtcact atgctgataa atttgctgtc aagacaagtg
      121 atgagacaca gaaatacttc ctgaacaccg gagattccag caattttgct cgtaagtttt
      181 accttgaccc tgctcaccca tgtcaaggag a
//
LOCUS       DOGCPLA10     189 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 10.
ACCESSION   M28149 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     10 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 189)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    41  +   149     pancreatic lipase precursor, exon 10
    matp    +    41  +   149     pancreatic lipase
    IVS     <     1       40     pancreatic lipase intron I
    IVS         150  >   189     pancreatic lipase intron J
BASE COUNT       47 a     38 c     43 g     61 t
ORIGIN      About 1871 bp after segment 9.
        1 gtgcatgagt caattgtact cttttctatc ccctgtgcag gctggagata cggggtttct
       61 ataacattgt ctgggaaaag agccactggt caggctaaag ttgctttgtt tggaagtaag
      121 ggaaatactc atcaattcaa tatcttcaag taagtttcct ctttcagcgt cacgtctcac
      181 tggatgcta
//
LOCUS       DOGCPLA11     248 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 11.
ACCESSION   M28150 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     11 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 248)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    41  +   208     pancreatic lipase precursor, exon 11
    matp    +    41  +   208     pancreatic lipase
    IVS     <     1       40     pancreatic lipase intron J
    IVS         209  >   248     pancreatic lipase intron K
BASE COUNT       72 a     42 c     59 g     75 t
ORIGIN      About 756 after segment 10.
        1 gtttctatgc atgagaatgt ttctatctct ttatttcaag ggggattctc aaaccaggct
       61 ctactcattc caatgagttt gatgcaaagc ttgatgttgg aacaattgag aaagtcaagt
      121 ttctttggaa taacaacgtg gtaaacccaa cctttcccaa agtgggtgca gccaagatca
      181 ccgtgcaaaa gggagaggag aaaacagtgt atgtatatct ttgctggatt gtgtctaagg
      241 atgttggt
//
LOCUS       DOGCPLA12     184 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Canine pancreatic lipase gene, exon 12.
ACCESSION   M28151 J04961
KEYWORDS    pancreatic lipase.
SEGMENT     12 of 12
SOURCE      Canine spleen DNA, clones L[13,20,4].
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 184)
  AUTHORS   Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and
            Scheele,G.A.
  TITLE     Structure of the canine pancreatic lipase gene
  JOURNAL   J. Biol. Chem. 264, 12895-12901 (1989)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    37      100     pancreatic lipase precursor, exon 12
    matp    +    37       97     pancreatic lipase
    pre-msg <     1      156     pancreatic lipase mRNA and intron
    IVS     <     1       36     pancreatic lipase intron K
    signal      135      140     poly-A signal
BASE COUNT       44 a     60 c     39 g     41 t
ORIGIN      About 3005 bp after segment 11.
        1 tcacaacact tacaaagctc tcattttggt ctgcaggcac agcttctgca gcgaaagcac
       61 cgtgagggag gatgttctgc tcaccctcac gccctgttaa cgtccaggcg caacccgggc
      121 accgtgttaa cagcaataaa aaccactgat gcatttaccc actcccacgt ctgtcgctgc
      181 tgtc
//
LOCUS       MZECPSECX     395 bp ds-DNA             ORG       14-AUG-1990
DEFINITION  Maize chloroplast 50S ribosomal protein (secX) gene, complete cds.
ACCESSION   M35956
KEYWORDS    50S ribosomal protein.
SOURCE      Maize chloroplast DNA.
  ORGANISM  Chloroplast Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE   1  (bases 1 to 395)
  AUTHORS   Markmann-Mulisch,U., von Knoblauch,K., Lehmann,A. and
            Subramanian,A.R.
  TITLE     Nucleotide sequence and linkage map position of the secX gene in
            maize chloroplast and evidence that it encodes a protein belonging
            to the 50S ribosomal subunit
  JOURNAL   Biochem. Int. 15, 1057-1067 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        165      278     50S ribosomal protein (secX)
    binding     151      154     secX ribosomal binding site (put.)
BASE COUNT      153 a     55 c     74 g    113 t
ORIGIN      
        1 gaagcgtacc gaagactcga aggatactga agatttgaag gataccaaag attcaaagga
       61 ttaggttttt ctggggtaat aacttccaag tttcaaaatt taagtgaaga gacttatttt
      121 ttccaaaaga atagattcat agtttaagaa aggaataccc atatatgaaa ataagagctt
      181 ccgttcgtaa aatttgtaca aaatgtcgac tgattcgcag gcgtgggcga attagagtta
      241 tttgttccaa tccgaagcat aaacaaagac aggggtaatc tttcgaaaaa gaagcttttc
      301 tttctaatag cttttctttc taattaaagt aaaaagtacc cacggaaatg cccaaattgc
      361 aaataaaaaa aatgaaagta aaggatatat tttaa
//
LOCUS       ATHTGYA       210 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  A.thaliana tRNA-Tyr gene.
ACCESSION   M35957
KEYWORDS    transfer RNA-Tyr.
SOURCE      A.thaliana leaf DNA, clone pAtT1.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 210)
  AUTHORS   Stange,N., Gross,H.J. and Beier,H.
  TITLE     Wheat germ splicing endonuclease is highly specific for plant
            pre-tRNAs
  JOURNAL   EMBO J. 7, 3823-3828 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA         71      107     Tyr-tRNA, exon 1
                120      155     Tyr-tRNA, exon 2
    RNA          71      181     precursor Tyr-tRNA (+/- 5 bp)
    IVS         108      119     Tyr-tRNA intron (no splice consensus)
BASE COUNT       59 a     37 c     44 g     70 t
ORIGIN      
        1 ctaacttaat acaaatcatt gtgttcatag aatatttaga tcagtacaca tgcatgaaat
       61 agaatacaat ccgaccttag ctcagttggt agagcggagg actgtagtag acgcagatta
      121 tccttaggtc actggttcga atccggtagg tcggaatttg ctcccacatg agagcttttt
      181 atttttcttt cgttgtgaca ttaaggtttt
//
LOCUS       ATHTGYB       210 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  A.thaliana tRNA-Tyr gene.
ACCESSION   M35958
KEYWORDS    transfer RNA-Tyr.
SOURCE      A.thaliana leaf DNA, clone pAtT3.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 210)
  AUTHORS   Stange,N., Gross,H.J. and Beier,H.
  TITLE     Wheat germ splicing endonuclease is highly specific for plant
            pre-tRNAs
  JOURNAL   EMBO J. 7, 3823-3828 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA         71      107     Tyr-tRNA, exon 1
                120      155     Tyr-tRNA, exon 2
    RNA          71      178     precursor Tyr-tRNA (+/- 5 bp)
    IVS         108      119     Tyr-tRNA intron (no splice consensus)
BASE COUNT       66 a     34 c     41 g     69 t
ORIGIN      
        1 tcataagttt aaatttgatt aaaaagcttc tcttgaatca tcaacatgca taaagtgtta
       61 taatacaaaa ccgaccttag ctcagttggt agagcggagg actgtagttg acgcagataa
      121 tccttaggtc actggttcga atccggtagg tcggatcatt aaaattgaaa gttttttttt
      181 tccatttcgt ttcaaagtgc ttagacaaga
//
LOCUS       STMCHOPA     3104 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  Streptomyces sp. cytochrome-P450-like protein (choP) gene and
            cholesterol oxidase (choA) gene, complete cds.
ACCESSION   M31939 J03356
KEYWORDS    cholesterol oxidase; cytochrome-P450-like protein.
SOURCE      Streptomyces sp. (strain SA-COO) DNA, clone pCO-[1,3].
  ORGANISM  Streptomyces sp.
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 963 to 3104)
  AUTHORS   Ishizaki,T., Hirayama,N., Shinkawa,H., Nimi,O. and Murooka,Y.
  TITLE     Nucleotide sequence of the gene for cholesterol oxidase from a
            Streptomyces sp
  JOURNAL   J. Bacteriol. 171, 596-601 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1450)
  AUTHORS   Horii,M., Ishizaki,T., Paik,S.-Y., Manome,T. and Murooka,Y.
  TITLE     An operon containing the gene for cholesterol oxidase and a
            cytochrome P-450-like protein from Streptomyces sp
  JOURNAL   J. Bacteriol. 172, 3644-3653 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and clean copy of sequence for [1] kindly provided by
            Y.Murooka, 01-DEC-1989.  Draft entry and printed sequence for [2]
            kindly submitted by Y.Murooka, 02-FEB-1990, for release after
            publication.
FEATURES       from  to/span     description
    mRNA        100  >  1283     choP mRNA
    pept        141     1286     cytochrome-P450-like protein (choP)
    pept       1361     3001     cholesterol oxidase precursor (choA, EC
                                 1.1.3.6)
    sigp       1361     1486     cholesterol oxidase signal peptide
    matp       1487     2998     cholesterol oxidase
    mRNA       1384  >  3104     choA mRNA
    signal       65       70     -35 region
    signal       89       94     -10 region
    binding     129      134     choP ribosomal binding site (put.)
    binding    1340     1345     choA ribosomal binding site (put.)
BASE COUNT      518 a   1234 c    932 g    420 t
ORIGIN      
        1 ggtacccccg gtaaccgaac cccgtgcccg tacccgcagg tagcccgccg ggggaacgat
       61 ctcgttgaca gccttcacat cgcctccata cggtcatttc gacggcgccg aacatgccga
      121 tcactcgtga aagggcatac atgacccagg ctgcgccggt gaccttctcg acagtccgtg
      181 agaattactt cggtccgccg gcggagatgc aggcactgcg gcacaaggca cccgtgacgc
      241 gcacggcctt cgccgacggc cgccccggct ggctcgtcac cggatattcc gccgcgcgtg
      301 cggtgttatc cgattcccgt ttcaccgccc gcggagaacg cgagcacccc gccgtgccgc
      361 gcgccgcgac cctggaggac gagcgctgcc gcaggctgat cgccggacag ttcaccgccc
      421 gccggatgcg gcagctcacc gggcggaccg agcggatcgt ccgcgaacac ctcgacgcca
      481 tggagcacat gggctccccg gccgacctcg tcgagcactt cgccctcccc gtcccctcgc
      541 tcgtcatcgc cgaactgctc ggcgtgcccc cccccgaccg ggagcacttc cagcacgaca
      601 ccctccggtg gggcggcttc ggccggtcca ccgaggaggt caccgaggcc ttcgtctccc
      661 tcggcggcca actgcagcgc ctcgtgcggc tcaagcgcac cgagccgggc gacgacctcc
      721 tcagcggcct gatcgccgcc gaccccgcgc tcaccgacga ggaactcgcc tcgatcgcct
      781 tcctgttgct ggtcgccggc cacgggacca ccgcccacca gatcgccctc ggcgccttcc
      841 tgctgctcga acaccccgac cagctggccg cgctgcgcgc cgaccccgcc ctgacggaga
      901 gcgcggtcga ggagctgctg cgccacctga gcgtcgtcca ccacggcccc acccgtgccg
      961 cgctgcagga cgccgacatc gagggcaccc ccgtcaaggc gggcgaggtc gtcgtggtct
     1021 cgctcggcgc cgccaaccgc gaccccgcac gcttcgagcg gcccgacgcc gtggatgtca
     1081 cccgcgagga caccggccac ctcgccttcg ggcacggcat gcaccagtgc ctcggccggc
     1141 aactggcccg catcgaactg cgcgtcgccc tcaccgccct cctcgaacgc ttcccgcacc
     1201 tgcgcctggc ctgccccgcg gcggagatcc cgctccgcca cgacatgcag gtctacggcg
     1261 ccgaccggct gccggtcgcc tggtgaccgt ccccgtcacc cccccgaacc accttcccca
     1321 cccctgaaga aacgtctcgg aaaggtcagg tagaccccgc gtgactgcac aacagcacct
     1381 gtcccgccgc cgcatgctcg gcatggccgc cttcggcgcc gccgccctcg ccgggggcac
     1441 caccatcgcc gccccccgtg cggccgccgc cgccaagtcc gcggcggaca acggcggtta
     1501 cgtccccgcc gtcgtcatcg gcaccggcta cggcgcggcc gtctccgcgc tgcgcctcgg
     1561 cgaggcgggt gtgcagaccc tgatgctgga gatgggccag ctgtggaacc agcccggccc
     1621 cgacggcaac atcttctgcg gcatgctcaa cccggacaag cggtccagct ggttcaagaa
     1681 ccgcaccgag gccccgctcg gcagcttcct ctggctcgac gtcgtcaacc ggaacatcga
     1741 cccctacgcg ggtgtcctgg accgtgtgaa ctacgaccag atgtcggtct acgtgggccg
     1801 cggcgtcggc ggcggctcgc tcgtcaacgg cggcatggcc gtggagccca agcgctcgta
     1861 cttcgaggag atcctcccgc gggtcgactc ctccgagatg tacgaccgct acttcccccg
     1921 cgccaactcc atgctccgcg tcaaccacat cgacaccaag tggttcgagg acaccgagtg
     1981 gtacaagttc gcccgcgtct cgcgcgagca ggcgggcaag gccggtctcg gcaccgtctt
     2041 cgtccccaac gtctacgact tcggctacat gcagcgcgag gccgcgggcg aggtgcccaa
     2101 gtccgccctg gcgaccgagg tcatctacgg caacaaccac ggcaagcaga gcctggacaa
     2161 gacctacctg gccgccgcac tcggcaccgg caaggtcacc atccagaccc tgcaccaggt
     2221 caagacgatc cgtcagacga aggacggcgg ctacgcgctg accgtcgagc agaaggacac
     2281 cgacggcaag ctcctggcca ccaaggagat ctcctgccgc tacctgttcc tcggcgcggg
     2341 cagcctcggc tccaccgaac tgctggtgcg cgcccgcgac accggcaccc tgccgaacct
     2401 caactccgag gtgggcgcgg gctggggccc caacggcaac atcatgaccg cccgggccaa
     2461 ccacatgtgg aaccccaccg gcgcccacca gtcctccatc cccgccctcg gcatcgacgc
     2521 gtgggacaac agcgactcct cggtcttcgc ggagatcgcc cccatgccgg ccggcctgga
     2581 gacgtgggtc agcctctacc tcgcgatcac caagaacccc cagcgcggca ccttcgtgta
     2641 cgacgccgcg acggaccgcg cgaagctcaa ctggacccgt gaccagaacg cccccgcggt
     2701 caacgcagcc aaggcgctgt tcgaccggat caacaaggcg aacggcacga tctaccggta
     2761 cgacctcttc ggcacccagc tgaaggcctt cgccgacgac ttctgctacc acccgctcgg
     2821 cggctgcgtc ctgggcaagg cgacggacga ctacggccgc gtcgccggtt acaagaacct
     2881 ctacgtgacc gacggttcgc tgatcccggg ttccgtcggc gtcaacccgt tcgtgaccat
     2941 cacggcgctg gccgagcgga acgtcgagcg catcatcaag caggacgtca cggcgtcgta
     3001 atcacgcacc gaccgggccg gaagggaaga acggcccggt caccgcacca tcccggaagg
     3061 aacgcaccat tcccgtgaac gccgacgaca acagcctgtg gatc
//
LOCUS       STMCEFDA     2494 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  S.clavuligerus isopenicillin N epimerase (cefD) gene and
            deacetoxycephalosporin C synthetase (DAOCS) gene, complete cds.
ACCESSION   M32324 M24140
KEYWORDS    deacetoxycephalosporin C synthetase; isopenicillin N epimerase.
SOURCE      S.clavuligerus DNA, clone pOW380.
  ORGANISM  Streptomyces clavuligerus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1520 to 2494)
  AUTHORS   Kovacevic,S., Weigel,B.J., Tobin,M.B., Ingolia,T.D. and Miller,J.R.
  TITLE     Cloning, characterization, and expression in Escherichia coli of
            the Streptomyces clavuligerus gene encoding deacetoxycephalosporin
            C synthetase
  JOURNAL   J. Bacteriol. 171, 754-760 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1706)
  AUTHORS   Kovacevic,S., Tobin,M.B. and Miller,J.R.
  TITLE     The beta-lactam biosynthetis genes for isopenicillin N epimerase
            and deacetoxycephalosporin C synthetase are expressed from a single
            transcript in Streptomyces clavuligerus
  JOURNAL   J. Bacteriol. 172, 3952-3958 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.R.Miller, 26-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept        284     1480     isopenicillin N epimerase (cefD)
    pept       1559     2494     deacetoxycephalosporin C synthetase (DAOCS)
    mRNA        152  >  1706     cefD mRNA
BASE COUNT      387 a    931 c    791 g    385 t
ORIGIN      
        1 ccatgggtga taactcctgt ggagcgggac aaaaggtcga aagattcaaa tgccgagcag
       61 cccggacggt ttactgcata atgtgggcta cctatgctgg acgacaggac tcttgaagtg
      121 ctcttcggct ggtcttcaga atctcttcgc tatttttccg aagcggacag aaaatcttcc
      181 tgttcgagat cgagtcatga ctgctacgct gcatcgggcc gatcgaattt cgatggctgc
      241 actgaattga ctgttcttct ggatttgcgg gagatgcgtt gacatggcgg tagccgactg
      301 ggaagaagcc cgcggccgta tgctgctcga ccccaccgtc gtcaacctca acaccggctc
      361 cgggggaccg ctgccgcgca gcgccttcga gcgggtcacc ggcttccgcg cccatctcgc
      421 ggccgagccg atggacttcc tgctccgcga ggtccccgca ctgctgtggc aggcgcggga
      481 gagcctcgcc cgcctcatcg gcggggaccc gctgcgcctc gccctggcca ccaacgtcac
      541 cgccgccgtc aacctcgtcg cgtcgtcact gcgcctcgaa gcgcccggcg agatcctgct
      601 cagcgacgac gagtacacgc ccatgcgctg gtgctgggag cgggtcgccc ggcggcacgg
      661 cctggagctg aggacgttcc ggctgcccga gctgccctcg gacccggccg agatcaccgc
      721 ggcggcggtc gccgcgatgg gaccgcgcac ccggctgttc ttcttcagcc atgtcgtctc
      781 cacgaccggg ctgatcctgc ccgccgccga actgtgcgag gaggcccgcg cacggggcat
      841 caccaccgtg gtcgacggcg cccacgcacc cggcttcctc gacctcgacc tctcccggat
      901 cccctgcgac ttctacgcgg gcagcggcca caaatggctg ctcgccccca ccggggtcgg
      961 ctttctccac ctcgcccccg gccgcctgga agaactggag cccacccagg tgagctgggc
     1021 gtacgagccc ccggagggca gcggcccgcc ggccgcgcgc gaccgcttcg gcagcacacc
     1081 cgggctgcgc agactcgaat gcgaggggac ccgggacatc tgcccctggc tcgccacacc
     1141 ggagtcgatc gacttccagg ccgagctggg ccccggggcg atccgcgccc gccgccgcga
     1201 gctgacggac cacgcgcgcc gcctgctcgc cgaccgcccg ggccgcacgc tcctcacccc
     1261 cgactccccg gagctgtccg gcggcatggt ggcctaccgg ctgcccccgg gaaccgacgc
     1321 ggccgagctg cgccgggggc tctgggagcg cttccggatc gaggccgcgg tggccgagca
     1381 gccgcccggg ccggtgctcc ggatctcggc gaacttctac accaccgaag aggagatcga
     1441 ccgcctggcg gacgcgctgg acgcgctgac gggcgagtga tccccccggc tcgcggaccg
     1501 cctcccccgc gctgttgacc acccggttca cggattacga gaggatcagt gagagttgat
     1561 ggacacgacg gtgcccacct tcagcctggc cgaactccag cagggcctgc accaggacga
     1621 gttccgcagg tgtctgaggg acaagggcct cttctatctg acggactgcg gtctgaccga
     1681 caccgagctg aagtcggcca aggacctcgt catcgacttc ttcgagcacg gcagcgaggc
     1741 ggagaagcgc gccgtcacct cgcccgtccc caccatgcgc cgcggcttca ccgggctgga
     1801 gtcggagagc accgcccaga tcaccaatac cggcagctac tccgactact cgatgtgcta
     1861 ctcgatgggc accgcggaca acctcttccc gtccggtgac ttcgagcgga tctggaccca
     1921 gtacttcgac cgccagtaca ccgcctcccg cgcggtcgcc cgggaggtcc tgcgggcgac
     1981 cgggaccgag cccgacggcg gggtcgaggc cttcctcgac tgcgagccgc tgctgcggtt
     2041 ccgctacttc ccgcaggtcc ccgagcaccg cagcgccgag gagcagcccc tgcggatggc
     2101 gccgcactac gacctgtcga tggtcaccct catccagcag acaccctgcg ccaacggctt
     2161 cgtcagcctc caggccgagg tcggcggcgc gttcacggac ctgccctacc gtccggacgc
     2221 cgtcctcgtc ttctgcggcg ccatcgcgac cctggtgacc ggcggccagg tcaaggcccc
     2281 ccggcaccat gtcgcggccc cccgcaggga ccagatagcg ggcagcagcc gcacctccag
     2341 tgtgttcttc ctccgtccca acgcggactt caccttctcc gtcccgctgg cgcgcgagtg
     2401 cggcttcgat gtcagcctgg acggcgagac cgccacgttc caggattgga tcgggggcaa
     2461 ctacgtgaac atccgccgca catccaaggc atag
//
LOCUS       AMTMTCYB      225 bp ds-DNA             ORG       14-AUG-1990
DEFINITION  Salamander (A.tigrinum) mitochondrial cytochrome b, partial cds.
ACCESSION   M26230 M28046
KEYWORDS    cytochrome; cytochrome b.
SOURCE      A.tigrinum liver mitochondrial DNA.
  ORGANISM  Mitochondrion Ambystoma tigrinum
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Caudata; Ambystomatidae; Ambystoma tigrinum.
REFERENCE   1  (bases 1 to 225)
  AUTHORS   Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S.,
            Villablanca,F.X. and Wilson,A.C.
  TITLE     Dynamics of mitochondrial DNA evolution in animals: Amplification
            and sequencing with conserved primers
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 225)
  AUTHORS   Kocher,T.D. and White,T.J.
  TITLE     Evolutionary analysis via PCR
  JOURNAL   (in) Erlich,H. (Ed.);
            PCR Technology: Princip. and Appl. for DNA amplification: 137-147;
            Stockton Press, New York (1989).
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1],[2] kindly
            submitted by T.D.Kocher, 18-JUL-1989.
               Bernie 12-SEP-1989 initial full entry [1]
FEATURES       from  to/span     description
    pept    <     1  >   225     cytochrome b (AA at 1)
BASE COUNT       68 a     39 c     35 g     83 t
ORIGIN      
        1 acagggttat ttttagctat acattataca gcagatacat catcagcatt ctcatccgta
       61 gcccacattt gccgagatgt aaattatggt tgacttatac gaaatattca cgcaaacggc
      121 gcttcattct tttttatttg tatctttctt catattggtc gaggaatata ttacggctca
      181 tatatgttca aagaaacatg aaacattgga gtaattttat tattt
//
LOCUS       BLYLEC        972 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  Barley root-specific lectin mRNA, complete cds.
ACCESSION   M29280 M24846
KEYWORDS    lectin.
SOURCE      Barley (var. Betzes) embryo, cDNA to mRNA, clone BLc3.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 972)
  AUTHORS   Lerner,D.R. and Raikhel,N.V.
  TITLE     Cloning and characterization of root-specific barley lectin
  JOURNAL   Plant Physiol. 91, 124-129 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and sequence for [1] kindly submitted by N.Raikhel,
            16-MAY-1989.
FEATURES       from  to/span     description
    pept         16      654     lectin precursor
    sigp         16       93     lectin signal peptide (put.)
    matp         94      606     lectin
BASE COUNT      197 a    285 c    291 g    199 t
ORIGIN      
        1 cagaaaacaa gaaggatgaa gatgatgagc accagggccc tcgctctcgg cgcggccgcc
       61 gtcctcgcct tcgcggcggc gaccgcgcac gcccagaggt gcggcgagca gggcagcaac
      121 atggagtgcc ccaacaacct ctgctgcagc cagtacgggt actgcggcat gggcggcgac
      181 tactgcggca agggctgcca gaacggcgcc tgctacacca gcaagcgctg cggcactcag
      241 gccggcggca agacatgccc taacaaccac tgctgcagcc agtggggtta ctgcggcttc
      301 ggcgccgagt actgcggcgc cggctgccag ggcggcccct gccgcgccga catcaagtgc
      361 ggcagccagg ccggcggcaa gctttgcccc aacaacctct gctgcagcca gtggggttac
      421 tgcggcctcg gctccgagtt ctgcggcgag ggctgccagg gcggtgcttg cagcaccgac
      481 aagccgtgcg gcaaggccgc cggcggcaaa gtttgcacca acaactactg ctgcagcaag
      541 tggggatcct gtggcatcgg cccgggctac tgcggcgcag gttgccagag cggcggctgc
      601 gacggtgtct tcgccgaggc catcgccgcc aactccactc ttgtcgcaga atgatgatct
      661 tgctaatggc agtattattg caacgacgaa taatccgtgg cagttttgtt gccacgtacg
      721 gtctcccttc acttactttt agcactagtc cttaataatt ctccagcctt gcaatatgac
      781 gtgcaggttg ctacatgcat ggacatattg cagtgagaag tactgtgtgg caatataggg
      841 tgtactattg ttgccacaaa tttagttctt tcttgttacg tacgtacagt tgtcaggatg
      901 catgcatccc cgttgtaatg ttggagtact ccatgatttc gttgcaatat atatattgcc
      961 atgagtctaa ag
//
LOCUS       CHKMTCYB      225 bp ds-DNA             ORG       14-AUG-1990
DEFINITION  Chicken mitochondrial cytochrome b, partial cds.
ACCESSION   M26231 M28048
KEYWORDS    cytochrome; cytochrome b.
SOURCE      Chicken mitochondrial DNA.
  ORGANISM  Mitochondrion Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus.
REFERENCE   1  (bases 1 to 225)
  AUTHORS   Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S.,
            Villablanca,F.X. and Wilson,A.C.
  TITLE     Dynamics of mitochondrial DNA evolution in animals: Amplification
            and sequencing with conserved primers
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 225)
  AUTHORS   Kocher,T.D. and White,T.J.
  TITLE     Evolutionary analysis via PCR
  JOURNAL   (in) Erlich,H. (Ed.);
            PCR Technology: Princip. and Appl. for DNA amplification: 137-147;
            Stockton Press, New York (1989).
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1],[2] kindly
            submitted by T.D.Kocher, 18-JUL-1989.
               Bernie 12-SEP-1989 initial full entry [1]
               Mia 13-AUG-1990 citation update
FEATURES       from  to/span     description
    pept    <     1  >   225     cytochrome b (AA at 1)
BASE COUNT       56 a     84 c     35 g     50 t
ORIGIN      
        1 accgccctac tactagccat gcactacaca gcagacacat ccctagcctt ctcctccgta
       61 gcccacactt gccggaacgt acaatacggc tgactcatcc ggaatctcca cgcaaacggc
      121 gcctcattct tcttcatctg tatcttcctt cacatcggac gaggcctata ctacggctcc
      181 tacctctaca aggaaacctg aaacacagga gtaatcctcc tcctc
//
LOCUS       HUMMUC18A    2940 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human MUC18 gene mRNA (3' variant), complete cds.
ACCESSION   M29277
KEYWORDS    MUC18 glycoprotein; cell adhesion molecule;
            immunoglobulin-like protein; integral membrane glycoprotein.
SOURCE      Human (isolate JuSo) melanoma (cell line Mel JuSo), cDNA to mRNA,
            clone zapy1, drop 1, drop 4, drop 4.2, drop 4.7, drop 4.11.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2940)
  AUTHORS   Lehmann,J.M., Riethmueller,G. and Johnson,J.P.
  TITLE     MUC18, a marker of tumor progression in human melanoma, shows
            sequence similarity to the neural cell adhesion molecules of the
            immunoglobulin superfamily
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 9891-9895 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            J.P.Johnson, 06-OCT-89.
FEATURES       from  to/span     description
    pept          8     1819     MUC18 glycoprotein precursor
    sigp          8       91     MUC18 glycoprotein signal peptide
    matp         92     1816     MUC18 glycoprotein
    site        122      373     immunoglobin like (V set) domain I
    site        464      691     immunoglobin like (V set) domain II
    site        793      985     immunoglobin like (C2 set) domain
    site       1073     1246     immunoglobin like (C2 set) domain
    site       1334     1522     immunoglobin like (C2 set) domain
BASE COUNT      668 a    861 c    856 g    555 t
ORIGIN      
        1 gggaagcatg gggcttccca ggctggtctg cgccttcttg ctcgccgcct gctgctgctg
       61 tcctcgcgtc gcgggtgtgc ccggagaggc tgagcagcct gcgcctgagc tggtggaggt
      121 ggaagtgggc agcacagccc ttctgaagtg cggcctctcc cagtcccaag gcaacctcag
      181 ccatgtcgac tggttttctg tccacaagga gaagcggacg tcatcttccg tgtgcgccag
      241 ggccagggcc agagcgaacc tggggagtac gagcaagcgg ctcagcctcc aggacagagg
      301 ggctactctg gccctgactc aagtcacccc ccaagacgag cgcatcttct tgtgccaggg
      361 caagcgcctc ggtcccagga gtaccgcatc cagctccgcg tctacaaagc tccggatgcc
      421 aaacatccag gtcaaccccc tgggcatccc tgtgaacagt aaggagcctg aggaggtcgc
      481 tacctgtgta gggaggaacg ggtaccccat tcctcaagtc atctggtaca agaatggccg
      541 gcctctgaag gaggagaaga accgggtcca cattcagtcg tcccagactg tggagtcgag
      601 tggtttgtac accttgcaga gtattctgaa ggcacagctg gttaaagaag acaaagatgc
      661 ccagttttac tgtgagctca actaccggct gcccagtggg aaccacatga aggagtccag
      721 ggaagtcacc gtccctgttt tctacccgac agaaaaagtg tggctggaag tggagcccgt
      781 gggaatgctg aaggaagggg accgcgtgga aatcaggtgt ttggctgatg gcaaccctcc
      841 accacacttc agcatcagca agcagaaccc cagcaccagg gaggcagagg aagagacaac
      901 caacgacaac ggggtcctgg tgctggagcc tgcccggaag gaacacagtg ggcgctatga
      961 atgtcaggcc tggaacttgg acaccatgat atcgctgctg agtgaaccac aggaactact
     1021 ggtgaactat gtgtctgacg tccgagtgag tcccgcagcc cctgagagac aggaaggcag
     1081 cagcctcacc ctgacctgtg aggcagagag tagccaggac ctcgagttcc agtggctgag
     1141 agaagagaca gaccaggtgc tggaaagggg gcctgtgctt cagttgcatg acctgaaacg
     1201 ggaggcagga ggcggctatc gctgcgtggc gtctgtgccc agcatacccg gcctgaaccg
     1261 cacacagctg gtcaagctgg ccatttttgg ccccccttgg atggcattca aggagaggaa
     1321 ggtgtgggtg aaagagaata tggtgttgaa tctgtcttgt gaagcgtcag ggcacccccg
     1381 gcccaccatc tcctggaacg tcaacggcac ggcaagtgaa caagaccaag atccacagcg
     1441 agtcctgagc accctgaatg tcctcgtgac cccggagctg ttggagacag gtgttgaatg
     1501 cacggcctcc aacgacctgg gcaaaaacac cagcatcctc ttcctggagc tggtcaattt
     1561 aaccaccctc acaccagact ccaacacaac cactggcctc agcacttcca ctgccagtcc
     1621 tcataccaga gccaacagca cctccacaga gagaaagctg ccggagccgg agagccgggg
     1681 cgtggtcatc gtggctgtga ttgtgtgcat cctggtcctg gcggtgctgg gcgctgtcct
     1741 ctatttcctc tataagaagg gcaaggctgc cgtgcaggcg ctcagggaag caggagatca
     1801 cgctgccccc gtctcgtaag accgaacttg tagttgaagt taagtcagat aagctcccag
     1861 aagagatggg cctcctgcag gcagcagcgg tgacaagagg gctccgggag accagggaga
     1921 gaaatacatc gatctgaggc attagccccg aatcacttca gctcccttcc ctgcctggac
     1981 cattcccagc tccctgctca ctcttctctc agccaaagct caaagggact agagagaagc
     2041 ctcctgctcc cctcgcctgc acaccccctt tcagagggcc actgggttag gacctgagga
     2101 cctcacttgg ccctgcaagg cccgcttttc agggaccagt ccaccaccat ctcctccacg
     2161 ttgagtgaag ctcatcccaa gcaaggagcc ccagtctccc gagcgggtag gagagtttct
     2221 tgcagaacgt gttttttctt tacacacatt atgctgtaaa tacgctcgtc ctgccagcag
     2281 ctgagctggg tagcctctct gagctggttt cctgccccaa aggctggcat tccaccatcc
     2341 aggtgcacca ctgaagtgag gacacaccgg agccaggcgc ctgctcatgt tgaagtgcgc
     2401 tgttcacacc cgctccggag agcaccccag cagcatccag aagcagctgc agtgcaagct
     2461 tgcatgcctg cgtgttgctg caccaccctc ctgtctgcct cttcaaagtc tcctgtgaca
     2521 ttttttcttt ggtcagaggc caggaactgt gtcattcctt aaagatacgt gccggggcca
     2581 ggtgtggctc acgcctgtaa tcccagcact ttgggaggcc gaggcggcgg atcacaaagt
     2641 cagacgagac catcctggct aacacggtga aaccctgtct ctactaaaaa tacaaaaaaa
     2701 aattagctag gcgtagtggt tggcacctat agtcccagct actcggaagg ctgaagcagg
     2761 agaatggtat gaatccagga ggtggagctt gcagtgagcc gagaccgtgc cactgcactc
     2821 cagcctgggc aacacagcga gactccgtct cgagccggcc ggttgcgcgg gccctcggac
     2881 cctcagagag gcgagggttc gagggcacga gttcgaggcc aacctggtcc acatgggttg
//
LOCUS       HUMTELB       262 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human DNA from the end of the chromosomal telomere.
ACCESSION   M19947 J04078
KEYWORDS    repetitive sequence.
SOURCE      Human placental DNA, clone pHuR 143.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 262)
  AUTHORS   Moyzis,R.K., Buckingham,J.M., Cram,L.S., Dani,M., Deaven,L.L.,
            Jones,M.D., Meyne,J., Ratliff,R.L. and Wu,J.R.
  TITLE     A highly conserved repetitive DNA sequence, (TTAGGG)n, present at
            the telomeres of human chromosomes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 6622-6626 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence [1] kindly submitted by
            S.Jett 27-JUL-88.
            
            The (ttaggg)n telomeric sequence is found at the ends of all human
            chromosomes as well as in some other higher eukaryotic genomes.
FEATURES       from  to/span     description
    site         80       81     rare insertion/substitution
    site        243      244     rare insertion/substitution
BASE COUNT       44 a      0 c    131 g     87 t
ORIGIN      Unreported.
        1 tttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg
       61 gttagggtta gggttaggga gtagggttag ggttagggtt agggttaggg ttagggttag
      121 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag
      181 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag
      241 ggtgttaggg ttagggttag gg
//
LOCUS       LCOLHICDH    1320 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  L.confusus L-2-hydroxyisocaproate dehydrogenase (L-HicDH) gene,
            complete cds.
ACCESSION   M31425 M28050
KEYWORDS    L-2-hydroxyisocaproate dehydrogenase.
SOURCE      L.confusus (strain DSM20196) DNA.
  ORGANISM  Lactobacillus confusus
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 1320)
  AUTHORS   Lerch,H.-P., Frank,R. and Collins,J.
  TITLE     Cloning, sequencing and expression of the L-2-hydroxyisocaproate
            dehydrogenase-encoding gene of Lactobacillus confusus in
            Escherichia coli
  JOURNAL   Gene 83, 263-270 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        328     1260     L-2-hydroxyisocaproate dehydrogenase
    ORF     <     1      243     ORF (AA at 1)
    mRNA        301     1304     LDH mRNA (3' end put.)
    signal     1267     1303     transcription termination signal
    binding     318      323     ribosome binding site
    signal      267      272     -35 region
    signal      290      295     -10 region
BASE COUNT      333 a    263 c    353 g    371 t
ORIGIN      
        1 ttctttttgt acaccatgac aatgtttgct gctaagacat cttcgccagt tgaaacggcc
       61 cgtttgtcag gtatggcgca agctggtggt tactttatgt ctgcattcgg tccaatgctt
      121 tatggtatgg cttttaccgc aaatccgaat ggggtcatcc aaaacgtcgt ttatttggta
      181 ttggtgattg tgatgattgt cgctgccgtg atgatggcga tgactaagca tttgtttgac
      241 taaaataatg atgagatgtt gttcagtggt gaacaacgtc tcttttttgt ataatgaaca
      301 tacgacaaaa acaatttggg ggatattatg gcacgtaaga ttggaattat cggccttgga
      361 aacgttgggg ctgcagtagc gcacggattg attgcacaag gtgtagccga cgactacgtc
      421 tttattgatg caaacgaagc aaaggtgaag gctgatcaaa ttgatttcca agacgcaatg
      481 gcgaacttgg aagcgcacgg taacattgtg attaacgatt gggcagcctt ggctgatgct
      541 gatgttgtga tttcaacact ggggaacatc aagttgcaac aagacaaccc aaccggtgac
      601 cgttttgctg agttgaagtt taccagcagc atggtgcaat cagtcggcac aaacttgaag
      661 gaatctggtt tccacggcgt attggtcgtg atttcaaacc cggtcgacgt gattacggcc
      721 ttgttccaac acgtgactgg tttcccagct cacaaggtta tcggaaccgg tactttgctt
      781 gacacggcgc gtatgcaacg tgcagttggt gaggcgtttg atttggatcc acgttctgtt
      841 tcaggttaca acttgggtga gcacggtaac tcacaattcg tagcttggtc aacggtgcgc
      901 gtgatgggtc aaccaatcgt gacgttggct gatgccggcg atattgactt ggcggccatc
      961 gaagaggaag cacgtaaggg tggcttcacg gtcttgaatg gtaagggcta cacgagttat
     1021 ggtgttgcaa cgtcagcaat ccgcattgcc aaggctgtta tggctgacgc gcatgctgaa
     1081 ttggttgtct caaatcgtcg cgatgacatg ggaatgtact tgtcataccc agcgattatt
     1141 ggtcgcgatg gtgtcttggc agaaacgacg cttgatttga cgacggatga gcaagaaaag
     1201 cttttgcaat cacgtgacta catccaacaa cgtttcgacg aaattgtgga tacactctaa
     1261 aaacacaaaa agtggcttca atctggtgat tggaccactt ttttctaatt gtgcatgaaa
//
LOCUS       SIVAGM266     300 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Simian (African green monkey) immunodeficiency virus, isolate 266,
            partial pol cds.
ACCESSION   M22974
KEYWORDS    .
SOURCE      Simian (African green monkey) immunodeficiency virus proviral DNA,
            isolate 266.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Li,Y., Naidu,Y.M., Daniel,M.D. and Desrosiers,R.C.
  TITLE     Extensive genetic variability of simian immunodeficiency virus from
            African green monkeys
  JOURNAL   J. Virol. 63, 1800-1802 (1989)
  STANDARD  full staff_review
COMMENT     Hard copy of sequence kindly provided by R.C. Desrosiers,
            25-JAN-1989.
FEATURES       from  to/span     description
    pept    <     1  >   300     pol polyprotein (AA at 1)
    pre-msg <     1  >   300     genomic mRNA
    pre-msg <     1  >   300     tat, rev, nef subgenomic mRNA
BASE COUNT      108 a     59 c     74 g     59 t
ORIGIN      
        1 gcaggactac tggcaggcag ttggatacct gactggacct ttgtaagcgt cccacctctg
       61 gtaacattgt ggtacacctt aacaaaggaa cccatcccag gagaagatgt gtattatgta
      121 gatggagctt gcaacaggaa ctcaagagaa ggaaaagcag ggtatataac tcagcaaggg
      181 aaacagagag tggagaagct agaaaatacc acaaatcagc aagcagaatt aacagctata
      241 aaaatggcct tagaagacag tggtcccagg gtaaacattg ttacagactc tcaatatgca
//
LOCUS       STYPOL3E      454 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  S.typhimurium DNA polymerase III epsilon subunit gene, 5' end.
ACCESSION   M26045 M29726
KEYWORDS    polymerase III.
SOURCE      S.typhimurium (strain LT2) DNA, clone pFF16.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 454)
  AUTHORS   Lancy,E.D., Lifsics,M.R., Kehres,D.G. and Maurer,R.
  TITLE     Isolation and characterization of mutants with deletions in dnaQ,
            the gene for the editing subunit of DNA polymerase III in
            Salmonella typhimurium
  JOURNAL   J. Bacteriol. 171, 5572-5580 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by R.A.Maurer, 10-JUL-1989.
FEATURES       from  to/span     description
    pept         59  >   454     polymerase III epsilon (editing) subunit
BASE COUNT      107 a     90 c     96 g    104 t     57 others
ORIGIN      
        1 gtagacttcc tgtaattgaa atcgataaca aaacgcaagt ctgacataaa tgaccgatat
       61 gagcactgca attacacgac agatcgtcct cgataccgaa accaccggta tgaatcagat
      121 aggcgcgcac tatgaaggtc acaagattat tgagatcggt gcggttgagg tgataaaccg
      181 tcgtctgacc ggcaacaatt ttcatgttta cctgaagccc gatcgccttg tcgatccaga
      241 ggcttttggc gtacacggta ttgccgatga gtttctgctg gataagccgg tttttgctga
      301 tgtggtcgat gagtttcttg attatatnnn nggcgcggag ctggtcatcc ataacgcatn
      361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncctaaaac
      421 caatactttc tgcaaagtta ccgacagcct ggcg
//
LOCUS       RHMSYRMA     2636 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  R.meliloti syrM gene, complete cds.
ACCESSION   M33495
KEYWORDS    symR gene.
SOURCE      R.meliloti (strain 1021) DNA, clones pMB[1,2].
  ORGANISM  Rhizobium meliloti
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 2636)
  AUTHORS   Long,S.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 850 to 1990)
  AUTHORS   Barnett,M.J. and Long,S.R.
  TITLE     DNA sequence and translational product of a new
            nodulation-regulatory locus: SyrM has sequence similarity to NodD
            protein
  JOURNAL   J. Bacteriol. 172, 3695-3700 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.R.Long, 02-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        991     1971     SyrM protein
BASE COUNT      534 a    769 c    792 g    541 t
ORIGIN      
        1 gcatgcgcga cgagctgctg aacgagagtc tgttcttcgg tctcagccat gccagcaggg
       61 ccatttccaa atggtcgacg actgcaatac gttccggccg cactcgtcgc tcggataccg
      121 caccccggca gcctatcctg ggatcatcgc cgcaaccgag ccagttggtg tatcgaaaac
      181 ggtcgaggct ctaatcgccg ctggatgaaa agttcagtgg caggtccgaa aggccacgta
      241 ttcggagcca acctcctcta gcgatcgctc tctcctgaaa acttgacagg gcgtcccccg
      301 accgccttgc cgaggcggcg cgtgcccgac ggcggcctcg acagcaagcc ggtctcgacg
      361 gtcttctatc ggcgctgcag gcaatgggca tttgaaaccg tttcaagagt ctccttaaca
      421 gagctcgcgc cgggtgaaga ccctgtatga tggatagtga agtgcagggg cggctgctta
      481 ccggccgttg agtgaggcta tgtatccaaa acgtggatga cagtcatgcg aaaacgcgat
      541 ttctcctgtc tcgccgaatg ccattacaag aacgttagat ataaccggag ggcagaagtg
      601 ggcagcgaga tcaattccga cagactcctt tgcaatgaaa gcatgatttg cgaccagggc
      661 ctcagatgtg ttgaggcagt tttcttcagt aagtttcagg cggggccact ttggcgagat
      721 cattttgtgg cttgctcgtt ggcactgaaa tatgatgtga gccggcaatt cctcgaaagc
      781 attctggacc agcgggtctt cgacatccat caggctaggc ctgcgaggcc cggggggcgt
      841 cacagccaag tacttctgat taacggaaaa tcacgacgct tcccatttgt cggcgggcca
      901 ggaaagtctt tggcacgcac acccatattg tgacgacctg gatcttcggg atgtggcgcg
      961 ggaacattac ctgccgagga gaatagatcc atggatcagc ccacttggaa gcggccgcat
     1021 agggccaaat ttgccggtgt cagtgacgcc gcacaacagc ggcaaatgcc gaaccttgcg
     1081 tccatcgacc taaacctact ggtggatctt gaggctctgc tgcagtatcg gcacattact
     1141 caagcggccc agcacgtcgg ccgtagccaa ccggcgatga gcagagcctt gtcaaggctg
     1201 cgcggcatgt taaaggacga tcttctggtt gccggctcta gaggcttggt cctgacgcct
     1261 ctggctgaat gcttgaccca gatgctgcct tcagtactag acgcgattcg ccagatgatg
     1321 aacctcagct tggctccggc gcaacggcga tggaaggtga caatggctat gcccgatcac
     1381 caagcggtgg ttctgctgcc acatctgttg ccgcggttac acgagcgcgc ccctcatctc
     1441 gacattgtga ccgatccgct tttgggcggc gcgctggggt tacttgaaca aggtgagatc
     1501 gacgtggtcg tcgggcagat gggcgccgct ccgctcggct acttgcggcg cagactctac
     1561 gccgacagct tcacctgcgt gctacgccac aatcacccgg ccttggcgca ggagtggacg
     1621 atcgaggctt tcgcggcctt gcgccacgtc gccattgcct cggaacctga cgagcttttc
     1681 ggtcagatct atgacaggtt aaccaaactc ggactgcagc gtggtgatcc gatggtggtt
     1741 tccaccgtac tgaccgcggc ggtcttgatc gcggcgactg actcagtgct ggttgtacca
     1801 agccgcgtgg cgacccgtgt cgccgccatg ctgtcgctcg cggtgattcc cccgcctgtg
     1861 gaactgagac catacgaagt cgcgctgatc tggcacgagc ggtgccatcg agacccagag
     1921 catcgctggc tgcgtggcga aatcgccgcc gcagcgtcca cggccggtta gacgaagact
     1981 agcgaatagg cgaggatggg tgagcataat gaatgacgac ggtttgatga ttgactatcc
     2041 cggggcggtg gaggacccgg tcgatgcgcg atttgcccga tgactcacac ccgggagggg
     2101 gtgagtccga gcgcgcctac cacatgcggc tgggtccaga caccgccgtt ttcgtcaacc
     2161 ggccgaagtc gcaagggggc acgagcgagg atcgagttgc aacgatgagc cgtcatagcg
     2221 tcatagcggc tgtgcacgcc ccaatcccgg cgggttcgcg gcgttgtgtc aaatgctcct
     2281 ctcatcgcta ccgatgacaa tacgcgaccc tgctccgaac gtaggtgttc cgtcagtcgc
     2341 cccacgtcgc cagcgcggac tgcgatccgc tgagacggaa ctatgcttag ccgccgcatt
     2401 ccgtcgcggt gtgggaaatt gcacgcgcag cagagtcttg acgatgctaa aaccatgagg
     2461 agcccttgga ggcaccggtc ttttcagagg tgctctctct gctgcgcgcc atccatactt
     2521 tggacgatct gcaccgtcga gaacgcaata acgctcctgg cttctgccta acgatacggt
     2581 ccgacccggc gtgaatcgac atgagcttag cgtacccatc gtcgcaaggt gcatgc
//
LOCUS       ECCCPL        615 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  E.granulosus cyclophilin mRNA, 3'end.
ACCESSION   J04664 M24616
KEYWORDS    cyclophilin.
SOURCE      E.granulosus, cDNA to mRNA, clone EA21.
  ORGANISM  Echinococcus granulosus
            Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Cestoda; 
            Eucestoda; Cyclophyllidea; Taeniidae.
REFERENCE   1  (bases 1 to 615)
  AUTHORS   Lightowlers,M.W., Haralambous,A. and Rickard,M.D.
  TITLE     Amino acid sequence homology between cyclophilin and a cDNA-cloned
            antigen of Echinococcus granulosus
  JOURNAL   Mol. Biochem. Parasitol. 36, 287-290 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Lightowlers, 02-MAY-1989.
            
            This E.granulosus antigen exhibits 71% amino acid homology with
            human cyclophilin.
FEATURES       from  to/span     description
    pept    <     1      488     antigen ORF (cyclophilin) (AA at 3)
    mRNA    <     1      615     CPL mRNA
BASE COUNT      145 a    135 c    166 g    169 t
ORIGIN      
        1 tgggcgtgaa gtgcttcttc gacatctcca ttggcggtaa gccagcgggt cgcatcgtct
       61 ttgccctgtt tgatgacgtc ccaaagacag tagagaactt ccgtgcactg tgcaccggtg
      121 agaagggttt cggatacaag ggctcgaagt ttcaccgaat tattcccggt tttatgtgcc
      181 aaggtggtga ttttactgcc gggaatggta ccggtggcaa gagcatatac gggagcaaat
      241 ttgaggatga aaatttcaat cacaagcaca gcaagccgat gatgctctcg atggcgaatg
      301 cgggtaagaa caccaatggc tcgcaattct tcatcactac cgccgtcacc agctggcttg
      361 atgggaagca tgttgtcttt ggtgaggtag aaagtggcga agatgttgtt aaggatatgg
      421 aggccgtcgg tagcagtagt ggcaagactt cccaagaagt actgattact gactgcggtc
      481 agctgtaaat caatcggccg aacgttcact ctattccctt tattaattgt atcgcgctaa
      541 tcctgctcct cattgcttct ttacccgttt gtggccgtaa cgtgtttcaa caatgaagtc
      601 gttgtgcgta ttttg
//
LOCUS       ECOINTDLP    2269 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  E.coli ArgU-tRNA gene overlap with prophage DLP12 integrase gene,
            complete cds.
ACCESSION   M27155
KEYWORDS    insertion element IS3; integrase; transfer RNA-ArgU.
SOURCE      E.coli (strain K12) DNA, clones pDC1 and pDM1614.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2269)
  AUTHORS   Lindsey,D.F., Mullin,D.A. and Walker,J.R.
  TITLE     Characterization of the cryptic lambdoid prophage DLP12 of
            Escherichia coli and overlap of the DLP12 integrase gene with the
            tRNA gene ArgU
  JOURNAL   J. Bacteriol. 171, 6197-6205 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by D.F.Lindsey, 16-AUG-1989.
            
            A homologous region to bacteriophage P22 excisionase is located at
            positions 1423-1205.  Positions 1727-1422 are 97.4 % identical to
            phage lambda positions 31627-31322 encoding the C-terminal end of
            exonuclease, and bases 1726-2126 are 92 % identical to phage lambda
            bases 39972-40372 encoding the C-terminal 102 residues of the acid
            P gene product and the N-terminal 31 residues of the ren gene
            product.
FEATURES       from  to/span     description
    tRNA         73      149     Arg-tRNA
    tRNA         48      227     Arg-tRNA precursor (alt.)
    tRNA         48      237     Arg-tRNA precursor (alt.)
    pept       1328      165 (c) DLP12 integrase
    trns       2127  >  2269     insertion element IS3 (alpha-4/beta-4)
    signal       14       19     -35 region
    signal       36       41     -10 region
BASE COUNT      539 a    596 c    512 g    622 t
ORIGIN      
        1 agatcaaaag ccattgactc agcaagggtt gaccgtataa ttcacgcgat tacaccgcat
       61 tgcggtatca acgcgccctt agctcagttg gatagagcaa cgaccttcta agtcgtgggc
      121 cgcaggttcg aatcctgcag ggcgcgccat tacaattcaa tcagttacgc cttctttata
      181 tcctccataa tttcagagtg ggacatattt gggacattat caccaaaaat gtcgtctatt
      241 ttcctcgcat gctctgtcaa atgattaggc gcaaggtgag catacctacg aaccatttct
      301 atggactccc atccgcccat ttcctgaagc actgataatg ggacgcctga ctgaatcagc
      361 cagcttgccc aggtgtgtct gaggtcatgg aaacggaaat cttcaattcc tgcacgacga
      421 caagctgata gccatgatgt cttgctgtcg atgcgcatct tcctgaccgc aggcgttgat
      481 gttccatctg ctcgcttagc cgccttggta tgtacaaaca cccatttgtg atgcttgcct
      541 atttgatcac gcaacacttt acaggcggta tcgttcagcg ccacaccaat ggcgcggttt
      601 gatttgctct cttctggatt cacccaggca actcgtcgct gcatgtcgat ttgttgccat
      661 tccagattta tgatgttcga ctttctcaga ccagttgcca gcgcaaactt gacgacagat
      721 ttcagtggtt cggggcactc atcaataagg cgttttgctt cctccttttc cagccatctg
      781 actcgcttgt ttctgaccgc tggtatcttg atgacaggcg ctttttccag ccacttccag
      841 tcgcgttctg cagcacggag aatggccttt atcatggcaa gatgctttgc ctttgtctga
      901 gttgatactg gctttggttc ataaacaggc agttctttac ctttcctgat ggcggcctga
      961 actttctgtt tccatatttc tttcgtcttt ctgttatgca ttctgcttac agcagagtaa
     1021 atctttgcct ccgagatatc tttaagcctt ataccctcaa aatgttcaag ccagaactca
     1081 atccggcttt tatctgaatc gagagatttt ttatcagctt tttcctcaag ccatcttagg
     1141 caggcctctt caaaagtgac atcaggtaaa tcccctagct tttctactcg ccagagttct
     1201 gcttttcgct tgtcgtgcaa ctcctgagct tgccgtttgt cctttgtgcc aagagattcc
     1261 ttaattcgtt tcccgcccgg gagcgaatac gaggcatacc atatttcatt tctgcggaag
     1321 agtgacattt tctttcctct gttatgccat cacccgcgct cacctggaca gtatgcagcg
     1381 gagactgaag cgccgcaatg caggcttgcc gtgttgtgag gtacccggat attatcgtga
     1441 ggatgcgtca tcgccattgc tccccaaata caaaaccaat ttcagccagt gcctcgtcca
     1501 ttttttcgat gaactccggc accatctcgt caaaactcgc catgtacttt tcattccgct
     1561 caatcacgac ataatgcagg ccttcacgct tcatgcgcgg gtcatagttg gcaaagtacc
     1621 aggcatcttt tcgcgtcacc cacatgctgt actgcacctg ggccatgtaa gccgatttta
     1681 ttgcctcgaa accaccgagc cggaatttca tgaaatcccg ggaggtacga gtattgccgg
     1741 aagcgtggcc tgtatccgga tgcagagtct tatccgtgga aatcgaacgc gcattactgg
     1801 ttggttacca acttgtacca gaacatgcgg gccaatgcgc tggctgacgc ggaattacgg
     1861 cgcaaggctg ccgatgaact gacctgtatg acagcgcgaa ttaaccgtgg tgagacgata
     1921 cctgaaccag taaaacaact tcctgttatg ggcggtagac ctctaaatcg tgcacaggct
     1981 ctggcgaaga tcgcagaaat taaagctaag ttcggactga aaggagcaag tgtatgacgg
     2041 gcaaagaggc aattattcat tacctgggga cgcataagag cttctgtgca caggacgttg
     2101 ccgcggtaac aggcgcaacc gtaatctgat cttacccagc aatagtggac acgcggctaa
     2161 gtgagtaaac tctcagtcag aggtgactca catgacaaaa acagtatcaa ccagtaaaaa
     2221 accccgtaaa cagcattcgc ctgaatttcg cagtgaagcc ctgaagctt
//
LOCUS       MHVJHM2CA      49 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Mouse hepatitis virus leader RNA, clone JHM-a.
ACCESSION   M22059
KEYWORDS    .
SOURCE      MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-a.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 49)
  AUTHORS   Makino,S. and Lai,M.M.C.
  TITLE     Evolution of the 5'-end of genomic RNA of murine coronaviruses
            during passages in vitro
  JOURNAL   Virology 169, 227-232 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT       16 a     10 c      4 g     19 t
ORIGIN      47 bp upstream of 5' end of genomic RNA.
        1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct
//
LOCUS       MHVJHM2CB      44 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Mouse hepatitis virus leader RNA, clone JHM-b.
ACCESSION   M23059
KEYWORDS    .
SOURCE      MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-b.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 44)
  AUTHORS   Makino,S. and Lai,M.M.C.
  TITLE     Evolution of the 5'-end of genomic RNA of murine coronaviruses
            during passages in vitro
  JOURNAL   Virology 169, 227-232 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT       14 a      9 c      4 g     17 t
ORIGIN      47 bp upstream of 5' end of genomic RNA.
        1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct
//
LOCUS       MHVJHM2CC      49 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Mouse hepatitis virus leader RNA, clone B1-c.
ACCESSION   M23060
KEYWORDS    .
SOURCE      MHV mouse hepatitis virus (strain B1) RNA, clone B1-c.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 49)
  AUTHORS   Makino,S. and Lai,M.M.C.
  TITLE     Evolution of the 5'-end of genomic RNA of murine coronaviruses
            during passages in vitro
  JOURNAL   Virology 169, 227-232 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT       16 a     10 c      4 g     19 t
ORIGIN      47 bp upstream of 5' end of genomic RNA.
        1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct
//
LOCUS       MHVJHM2CD      44 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Mouse hepatitis virus leader RNA, clone B1-d.
ACCESSION   M23061
KEYWORDS    .
SOURCE      MHV mouse hepatitis virus (strain B1) RNA, clone B1-d.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 44)
  AUTHORS   Makino,S. and Lai,M.M.C.
  TITLE     Evolution of the 5'-end of genomic RNA of murine coronaviruses
            during passages in vitro
  JOURNAL   Virology 169, 227-232 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT       14 a      9 c      4 g     17 t
ORIGIN      47 bp upstream of 5' end of genomic RNA.
        1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct
//
LOCUS       MHVJHM2CE      45 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Mouse hepatitis virus leader RNA, clone JHM-2c-e.
ACCESSION   M23062
KEYWORDS    .
SOURCE      MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-e.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 45)
  AUTHORS   Makino,S. and Lai,M.M.C.
  TITLE     Evolution of the 5'-end of genomic RNA of murine coronaviruses
            during passages in vitro
  JOURNAL   Virology 169, 227-232 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT       14 a     10 c      4 g     17 t
ORIGIN      47 bp upstream of 5' end of genomic RNA.
        1 cttgtagttt aaatctaatc taatctaatc taaacggcac ttcct
//
LOCUS       MHVJHM2CF      40 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Mouse hepatitis virus leader RNA, clone JHM-2c-f.
ACCESSION   M23063
KEYWORDS    .
SOURCE      MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-f.
  ORGANISM  Murine hepatitis virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Coronaviridae.
REFERENCE   1  (bases 1 to 40)
  AUTHORS   Makino,S. and Lai,M.M.C.
  TITLE     Evolution of the 5'-end of genomic RNA of murine coronaviruses
            during passages in vitro
  JOURNAL   Virology 169, 227-232 (1989)
  STANDARD  simple staff_entry
COMMENT     Draft entry kindly provided by S.Makino, 03-JAN-1989.
BASE COUNT       12 a      9 c      4 g     15 t
ORIGIN      47 bp upstream of 5' end of genomic RNA.
        1 cttgtagttt aaatctaatc taatctaaac ggcacttcct
//
LOCUS       MUSPDGFA      906 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse platelet-derived growth factor A chain (PDGFA) mRNA,
            complete cds.
ACCESSION   M29464
KEYWORDS    platelet-derived growth factor;
            platelet-derived growth factor A chain.
SOURCE      Mouse F9 embryonal carcinoma cell line, cDNA to mRNA, clone F9A5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 906)
  AUTHORS   Mercola,M.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
REFERENCE   2  (sites for [1])
  AUTHORS   Mercola,M., Wang,C., Kelly,J., Brownlee,C., Jackson-Grusby,L.,
            Stiles,C. and Bowen-Pope,D.
  TITLE     Selective expression of PDGF A and its receptor during early mouse
            embryogenesis
  JOURNAL   Dev. Biol. 138, 114-122 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Mercola, 26-OCT-1989.
FEATURES       from  to/span     description
    pept         63      653     platelet-derived growth factor A chain
BASE COUNT      217 a    250 c    270 g    169 t
ORIGIN      
        1 cgggctggag cgcccgcccc ggccctcgcc tccctgccga gcttccagcg cctcgggacg
       61 cgatgaggac ctgggcttgc ctgctgctcc tcggctgcgg atacctcgcc catgccctgg
      121 ccgaggaagc cgagataccc cgggagttga tcgagcggct ggctcgaagt cagatccaca
      181 gcatccggga cctccagcga ctcttggaga tagactccgt aggggctgag gatgccttgg
      241 agacaagtct gagagcccat gggtcccatg ccattaacca tgtgcccgag aagcggcctg
      301 tgcccattcg caggaagaga agtattgagg aagccattcc tgcagtttgc aagaccagga
      361 cggtcattta cgagatacct cggagccagg tggaccccac atcggccaac ttcctgatct
      421 ggcccccatg tgtggaggtg aagcgctgca ctggctgttg taacaccagc agcgtcaagt
      481 gccagccttc acgggtccac caccgcagtg tcaaggtggc caaagtggag tatgtcagga
      541 agaagccaaa attgaaagag gtccaggtga ggttagagga acacctggag tgtgcatgtg
      601 cgacctccaa cctgaaccca gaccatcggg aggaggagac agatgtgagg tgagatagag
      661 ccggccgccc ctctcccggg gacacgatgt gcgtggcgtg tgacattcct gaacatacta
      721 tgtatggtgc ttcattgcca atgtgcgtgc ggtctttgtt ctcctccgtg aaaaaccgtg
      781 tccgaggaca aagagacagt gtccgtttgt tcagtgtgac atcaaagcaa gtactgtagc
      841 actcagagag acagtgagaa agcttccttg tcacagagag cgaaaacaaa accacaaaaa
      901 agaaaa
//
LOCUS       ORFPRTPS      660 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Orf virus homologue of retroviral pseudoprotease gene, complete
            cds.
ACCESSION   M30023 J04371
KEYWORDS    pseudoprotease.
SOURCE      Orf virus (strain NZ2) DNA.
  ORGANISM  Orf virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Parapoxvirus.
REFERENCE   1  (bases 1 to 660)
  AUTHORS   Mercer,A.A., Fraser,K.M., Stockwell,P.A. and Robinson,A.J.
  TITLE     A homologue of retroviral pseudoproteases in the parapoxvirus, orf
            virus
  JOURNAL   Virology 172, 665-668 (1989)
  STANDARD  full staff_entry
COMMENT     Draft entry and clean copy of sequence [1] kindly submitted by
            A.A.Mercer 12-JUL-1989.
FEATURES       from  to/span     description
    pept        100      579     pseudoprotease homologue
BASE COUNT      157 a    170 c    166 g    167 t
ORIGIN      
        1 gctggacttg cgccgcgcca tgctgacccg cccctgattt ttctggagaa agtgtaaatt
       61 gtacaccccg tagtcgatcg gccgctcgcc accctagcca tggagttctg ccacacggaa
      121 actcttcagg ttgttcgcct cagccaaaac gccactatac cagccagagg atctcctggt
      181 gccgcaggat tagacctatg tagtgcatac gactgtgtca tcccttcgca ttgcagtcgc
      241 gtagtattta ctgaccttct gataaagccc ccttctggtt gctacggacg catagcacca
      301 cgttctggcc ttgcagtcaa acactttatc gacgtgggcg ctggcgtgat cgacgaagac
      361 taccgaggca acgtcggagt ggtgctgttt aactttggaa acagtgactt tgaggttaag
      421 aagggagaca ggatagctca gctaatatgc gaacgcatat cgtgcccagc agttcaagag
      481 gttaactgct tggacaacac tgaccgcggc gactctggat ttggttctac tggatcgggt
      541 gcttgtggtg gccgcgatac agcgtggtac atcagctaat caaaattgta aaatgtagct
      601 tctttttatt cgagagtctc gcacagttgc gtagataaca ctaattacaa cacactttaa
//
LOCUS       SP8ALU156     192 bp ds-DNA             PHG       14-AUG-1990
DEFINITION  Bacteriophage SP82 Alu156 promoter region DNA.
ACCESSION   M22133 M20661 J03904
KEYWORDS    promoter.
SOURCE      Bacteriophage SP82 (in B.subtilis) DNA, clone Alu156.
  ORGANISM  Bacteriophage SP82
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 192)
  AUTHORS   McAllister,C.F. and Achberger,E.C.
  TITLE     Effect of polyadenine-containing curved DNA on promoter utilization
            in Bacillus subtilis
  JOURNAL   J. Biol. Chem. 263, 11743-11749 (1988)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by E.C.Achberger, 14-SEP-1988.
            
            The nucleotide sequence contains a promoter from bacteriophage SP82
            that is recognized by the major B.subtilis RNA polymerase, the
            sigma-43-containing enzyme.  DNA with this sequence displays
            sequence-dependent DNA curvature.
FEATURES       from  to/span     description
    site        131      136     -35 region
    site        147      152     -10 region
    mRNA        159  >   174     Bacteriophage SP82 mRNA start
    mRNA        156  >   192     Alu156 mRNA
BASE COUNT       62 a     45 c     38 g     47 t
ORIGIN      
        1 gaattccccg gatccgtcct tccaagaaag atatcctaac agcacaagag cggaaacacg
       61 ttttgttcta catccagaac aacctctgct aaaattcctg aaaaattttg caaaaagttg
      121 ttgactttct ctacgaggtg tggcataata atcttaacaa cagcaggacg ctaggacgga
      181 tccggggaat tc
//
LOCUS       SP8BAL129     156 bp ds-DNA             PHG       14-AUG-1990
DEFINITION  Bacteriophage SP82 Bal129 promoter region DNA.
ACCESSION   M22134 M20660 J03904
KEYWORDS    promoter.
SOURCE      Bacteriophage SP82 (in B.subtilis) DNA, clone Bal129.
  ORGANISM  Bacteriophage SP82
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 156)
  AUTHORS   McAllister,C.F. and Achberger,E.C.
  TITLE     Effect of polyadenine-containing curved DNA on promoter utilization
            in Bacillus subtilis
  JOURNAL   J. Biol. Chem. 263, 11743-11749 (1988)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by E.C.Achberger, 14-SEP-1988.
            
            The nucleotide sequence contains a promoter from bacteriophage SP82
            that is recognized by the major B.subtilis RNA polymerase, the
            sigma-43-containing enzyme.  This is the site of transcription
            initiation for a major early mRNA for Bacteriophage SP82.
FEATURES       from  to/span     description
    site         60       65     -35 region
    site         83       88     -10 region
    mRNA         95  >   126     Bacteriophage SP82 mRNA start
    mRNA         95  >   156     Bal129 mRNA
BASE COUNT       60 a     29 c     29 g     38 t
ORIGIN      
        1 gaattccccg gatccgtcac ccctaagaaa aaatatctac agaaaatatg aaaaagttgt
       61 tgacatttct tcccatccat gctataataa agtcatagag aacaacacta tcaaatgaat
      121 ggagagattg atatgatgaa tggggatgac ggatcc
//
LOCUS       XELBGAI      2027 bp ds-DNA             VRT       14-AUG-1990
DEFINITION  X.laevis adult beta-globin gene (beta-AI), 5' flank.
ACCESSION   M27933
KEYWORDS    beta-globin; globin.
SOURCE      X.laevis adult DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2027)
  AUTHORS   Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
            Knoechel,W.
  TITLE     Sequence analysis of the upstream regions of Xenopus laevis
            beta-globin genes and arrangement of repetitive elements within the
            globin gene clusters
  JOURNAL   Mol. Biol. Rep. 14, 17-26 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.Knoechel, 13-SEP-1989.
BASE COUNT      792 a    266 c    284 g    685 t
ORIGIN      
        1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct
       61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat
      121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc
      181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt
      241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt
      301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta
      361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa
      421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata
      481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc
      541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct
      601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat
      661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca
      721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct
      781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt
      841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat
      901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg
      961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa
     1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt
     1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag
     1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt
     1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata
     1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca
     1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt
     1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag
     1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg
     1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat
     1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata
     1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc
     1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt
     1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa
     1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc
     1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg
     1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca
     1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc
//
LOCUS       XELBGLI      6777 bp ds-DNA             VRT       14-AUG-1990
DEFINITION  X.laevis larval beta-globin gene (beta-LI), 5' flank.
ACCESSION   M27934
KEYWORDS    beta-globin; globin.
SOURCE      X.laevis larval DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 6777)
  AUTHORS   Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
            Knoechel,W.
  TITLE     Sequence analysis of the upstream regions of Xenopus laevis beta-
            globin genes and arrangement of repetitive elements within the
            globin gene clusters
  JOURNAL   Mol. Biol. Rep. 14, 17-26 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.Knoechel, 13-SEP-1989.
BASE COUNT     2121 a   1247 c   1228 g   2181 t
ORIGIN      
        1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc
       61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt
      121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt
      181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag
      241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt
      301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca
      361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac
      421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac
      481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag
      541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac
      601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa
      661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga
      721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta
      781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca
      841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga
      901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa
      961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat
     1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt
     1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt
     1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt
     1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg
     1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg
     1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg
     1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag
     1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg
     1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc
     1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa
     1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga
     1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat
     1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg
     1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag
     1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat
     1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt
     1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa
     2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag
     2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa
     2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat
     2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc
     2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga
     2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat
     2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca
     2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac
     2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct
     2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg
     2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa
     2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt
     2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt
     2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt
     2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat
     2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat
     3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat
     3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact
     3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa
     3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt
     3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc
     3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat
     3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct
     3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga
     3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa
     3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca
     3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc
     3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa
     3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc
     3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata
     3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa
     3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta
     3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca
     4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat
     4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc
     4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct
     4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa
     4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc
     4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat
     4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt
     4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac
     4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg
     4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac
     4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta
     4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac
     4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa
     4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc
     4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg
     4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag
     4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc
     5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct
     5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg
     5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata
     5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat
     5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt
     5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat
     5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca
     5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc
     5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa
     5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc
     5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag
     5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct
     5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt
     5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg
     5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg
     5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat
     6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta
     6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca
     6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg
     6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt
     6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt
     6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga
     6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat
     6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa
     6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca
     6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca
     6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat
     6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac
     6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc
//
LOCUS       XELBGLII      910 bp ds-DNA             VRT       14-AUG-1990
DEFINITION  X.laevis larval beta-globin gene (beta-LII), 5' flank.
ACCESSION   M27932
KEYWORDS    beta-globin; globin.
SOURCE      X.laevis larval DNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 910)
  AUTHORS   Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and
            Knoechel,W.
  TITLE     Sequence analysis of the upstream regions of Xenopus laevis
            beta-globin genes and arrangement of repetitive elements within the
            globin gene clusters
  JOURNAL   Mol. Biol. Rep. 14, 17-26 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable copy of sequence [1] kindly
            submitted by W.Knoechel, 13-SEP-1989.
BASE COUNT      322 a    152 c    128 g    308 t
ORIGIN      
        1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc
       61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc
      121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt
      181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc
      241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt
      301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta
      361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct
      421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt
      481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta
      541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact
      601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac
      661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag
      721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct
      781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca
      841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat
      901 attttttttt
//
LOCUS       YSCADK1      2367 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae adenylate kinase (ADK1) gene, complete cds, and
            histone (H2A-1) gene, 3' end.
ACCESSION   M18455
KEYWORDS    adenylate kinase; histone.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2367)
  AUTHORS   Konrad,M.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_entry
REFERENCE   2  (sites for [1])
  AUTHORS   Konrad,M.
  TITLE     Analysis and in vivo disruption of the gene coding for adenylate
            kinase (ADK1) in the yeast Saccharomyces cerevisiae
  JOURNAL   J. Biol. Chem. 263, 19468-19474 (1988)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly provided
            by M.Konrad 21-JUL-1988.
FEATURES       from  to/span     description
    pept        917     1585     adenylate kinase (EC 2.7.4.3)
    pept    <     1      359     histone H2A-1 protein (AA at 3)
    ORF        2009  >  2367     ORF
BASE COUNT      777 a    473 c    435 g    682 t
ORIGIN      1 bp upstream of HindIII site; chromosome 4.
        1 aagcttctca atctagatct gctaaggctg gtttgacatt cccagtcggt agagtgcaca
       61 gattgctaag aagaggtaac tacgcccaaa gaattggttc tggtgctcca gtctacttga
      121 ctgctgtctt ggaatatttg gccgctgaaa ttttagaatt agctggtaat gctgctaggg
      181 ataacaagaa gaccagaatt attccaagac atttgcaatt ggctatcaga aatgatgacg
      241 aattgaacaa gctattgggt aacgttacca ttgcccaagg tggtgttttg ccaaacatcc
      301 atcaaaactt gttgccaaag aagtctgcca aggctaccaa ggcttctcaa gaattataag
      361 atcggttctg gtattttaaa gaaggcggaa ggaactaaac tgcttctcca ttgtaactat
      421 attaagtgtc tattagacct ttctgttttt cctttttatt tttctatata taatttattg
      481 ggtatataat taacggtaac atatatcatg catgatataa attttttgtt gtgtcttcct
      541 gttttctctg ttgctctgtt taggttcatt gggcactgtt gcttacttga acatgattga
      601 gtagcaaaca cattctgtgc tttgttcact ttgatagtgt gacgaaattt tcacaaccaa
      661 aggaaaaaag ctcacgattg cgcgatcctg tcacggagaa ctgtccgcag cagcccgcgg
      721 cttctgtcct tattcgccca tttttttttg attttcgact ttttcactct ggctagtttt
      781 attacgcata tttgtatata tatatacgca taaatttctg aaatggatta cctggtttat
      841 cagaaacgag agctttaata aagaaacatt aacgtttctc tggtaaagtc accacacagc
      901 atcaaatata acagtaatgt ctagctcaga atccattaga atggtcctaa ttggcccacc
      961 tggtgccggt aaaggtactc aagctccaaa tttgcaagag cgtttccatg ccgctcactt
     1021 ggccactggt gacatgttga gatctcaaat cgcaaagggc actcaattag gtttggaagc
     1081 aaagaaaatt atggaccaag gtggtttagt ctctgatgac attatggtta acatgatcaa
     1141 ggatgaattg accaacaatc cagcttgtaa gaatgggttc atcttggacg gtttcccaag
     1201 aaccattcct caggctgaaa aattggacca aatgttgaaa gaacaaggaa ctcctttgga
     1261 aaaagccatc gaattgaagg ttgatgatga attgttggtt gccagaatta ccggtagatt
     1321 aattcaccca gcctctggca gatcctacca caagatcttt aacccaccaa aggaagacat
     1381 gaaggatgac gtcaccggtg aagctttagt tcaaagatct gatgacaatg cagacgcctt
     1441 gaagaagaga ttagctgctt accatgctca aaccgaacca attgttgact tttacaaaaa
     1501 gaccggtatc tgggctggtg ttgatgcttc ccaacctcct gctactgttt gggctgacat
     1561 cttgaacaag ctaggtaagg attaatgacc ttgcgcaatg tcttctaaat atcttttctt
     1621 ttttttaaat taatgaaaat ataataaatt tttgtatgtg agtacatata tccgcagtat
     1681 actataaatg aaataataac ataagtgttt gtaactggct ataacgtcct taaacatgtg
     1741 cactgccatt aagtacttag tcaatatttt tatcgttgag aacgaacgca aatcatatta
     1801 ctaaatagat gaacaaaagc gcatcacata aacaaaaacg gtataacagt aatagtgaag
     1861 ctttttataa acctataacg aaataagcgc ttttcaggtt ttataattca gtcagagttc
     1921 taactggaca tcgttttgca ggggataaaa aaaaaaagga agcttcaacc cacaatacca
     1981 aaaaagcgaa gaaaacagcc aatgccaaat gacaataaga cacccaaatg ggtccagcac
     2041 tcccaagttt actaaaaaac cagtaacccc gaatgataaa attcctgaaa gagaagaaaa
     2101 atccaatgaa gtgaagacac ctaaaattcc attattcacg tttgccaaaa gcaaaaacta
     2161 ttcaaggccg tcaaccgcaa ttcatacctc acctcatcaa ccaagtgatg taaaaccgac
     2221 ttcccataaa cagttgcaac agccaaaatc ctcaccactg aaaaaaaata actataattc
     2281 ttttcctcac tcaaatctgg aaaaaataag caacagcaaa ctactctccc ttcttcggtc
     2341 caaaacgtca gcaggaagaa ttgaatc
//
LOCUS       YSCSSB1A     1842 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  Yeast (S.cerevisiae) heat shock protein 70 (YG101, SSB1) mRNA,
            complete cds.
ACCESSION   M25395
KEYWORDS    heat shock protein.
SOURCE      Yeast (S.cerevisiae, strain GRF-18) DNA, and cDNA to mRNA, clone
            F430.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (sites for [2])
  AUTHORS   Normington,K., Kohno,K., Kozutsumi,Y., Gething,M.-J. and
            Sambrook,J.
  TITLE     S.cerevisiae encodes an essential protein homologous in sequence
            and function to mammalian BiP
  JOURNAL   Cell 57, 1223-1236 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1842)
  AUTHORS   Normington,K.D.
  JOURNAL   Unpublished (1989) Univ, of Texas SW Medical Center, Dallas, Texas
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by K.D.Normington, 09-JUN-1989.
FEATURES       from  to/span     description
    pept          1     1842     heat shock protein 70
BASE COUNT      510 a    385 c    418 g    529 t
ORIGIN      
        1 atggctgaag gtgttttcca aggtgctatc ggtatcgatt taggtacaac ctactcttgt
       61 gttgctactt acgaatcctc cgttgaaatt attgccaacg aacaaggtaa cagagtcacc
      121 ccatctttcg ttgctttcac tccagaagaa agattgattg gtgatgctgc caagaaccaa
      181 gctgctttga acccaagaaa cactgtcttc gatgctaagc gtttgattgg tagaagattc
      241 gacgacgaat ctgttcaaaa ggacatgaag acctggcctt tcaaggttat cgacgtcgat
      301 ggtaacccag tcatcgaagt ccaatacttg gaagaaacca agactttctc cccacaagaa
      361 atttccgcta tggttttgac caagatgaag gaaattgctg aagctaagat tggtaagaag
      421 gttgaaaagg ccgtcattac tgtcccagct tactttaacg acgctcaaag acaagctacc
      481 aaggatgccg gtgccatttc tggtttgaac gttttgcgta tcatcaacga acctactgcc
      541 gctgctattg cttacggtct aggtgctggt aagtccgaaa aggaaagaca tgttttgatt
      601 ttcgatttgg gtggtggtac tttcgatgtt tccttgttgc acattgctgg tggtgtttac
      661 actgttaaat ctacttccgg taacactcac ttgggtggtc aagatttcga caccaacttg
      721 ttggaacact tcaaggccga attcaagaag aagactggtt tggacatctc cgacgatgcc
      781 agagctttga gaagattgag aactgctgct gaaagagcta agagaacctt atcttctgtc
      841 actcaaacta ccgttgaagt tgactctttg tttgacggtg aagatttcga atcctctttg
      901 actagagcta gatttgaaga cttgaacgcc gcattgttca agtctacttt ggaacctgtt
      961 gaacaagttt tgaaggatgc taagatctct aagtctcaaa tcgacgaagt tgtcttggtt
     1021 ggtggttcca ccagaattcc aaaggtccaa aagttgttgt ctgacttctt tgacggtaag
     1081 caattggaaa aatctattaa cccagatgaa gctgttgctt acggtgctgc tgttcaaggt
     1141 gctatcttga ccggccaatc cacatctgac gaaaccaagg acttgttgtt gttagatgtt
     1201 gctccattat ctctaggtgt tggtatgcaa ggtgacatgt tcggtatcgt tgttccaaga
     1261 aacactactg ttccaaccat caagagaaga acctttacta catgtgctga caaccaaacc
     1321 accgttcaat tcccagtcta ccaaggtgaa cgtgttaact gtaaagaaaa cactttgttg
     1381 ggtgaattcg acttgaagaa catcccaatg atgccagctg gtgaaccagt cttggaagct
     1441 atcttcgaag ttgatgctaa cggtatcttg aaggttactg ccgtcgaaaa gtctaccggt
     1501 aagtcttcta acatcactat ctctaacgct gttggtagat tgtcttctga agaaattgaa
     1561 aagatggtta accaagctga agagttcaag gctgccgatg aagcttttgc caagaagcac
     1621 gaagctagac aaagattgga atcctacgtt gcctccatcg aacaaactgt cactgaccca
     1681 gtcttgtctt ctaaattgaa gagaggttcc aagtccaaga ttgaagctgc tttgtccgat
     1741 gctttggctg ctttgcaaat cgaagaccca tctgctgatg aattgagaaa ggctgaagtt
     1801 ggtttgaaga gagttgtcac caaggccatg tcttctcgtt aa
//
LOCUS       ATHGPA1      2030 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  A.thaliana G protein alpha subunit 1 (GP-alpha-1) mRNA, complete
            cds.
ACCESSION   M32887
KEYWORDS    G protein.
SOURCE      A.thaliana (strain Landsberger) flow DNA and, cDNA to mRNA.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 2030)
  AUTHORS   Ma,H., Yanofsky,M.F. and Meyerowitz,E.M.
  TITLE     Molecular cloning and characterization of GPA1, a G protein alpha
            subunit gene from Arabidopsis thaliana
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3821-3825 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.M.Meyerowitz, 16-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        600     1751     G protein alpha subunit 1 (GP-alpha-1)
    pept        490      552     ORF1
    pept        568      591     ORF2
    mRNA        401     1885     GP-alpha-1 mRNA
    mRNA        401     1921     GP-alpha-1 mRNA (alt.)
    signal      312      318     pot. TATA box
    rpt          17       36     inverted repeat 1A
    rpt         188      207     inverted repeat 1B
    rpt         323      339     inverted repeat 2A
    rpt         341      357     inverted repeat 2B
BASE COUNT      631 a    346 c    413 g    640 t
ORIGIN      Chromosome 2 near erecta.
        1 gttaacttaa tagtatataa aataaaaatg catataggtt ccgtaattaa tctcttatcg
       61 tcacgagagg cacatctttt tccaacattt gaccactctc tctctctctc tcaggacctt
      121 tcggcgtaat ttcgtcttcc cctttgctta acattttctt tctttctttt tgaccaaata
      181 ttaaaaatat atccattttt attttatttt taattaaatt cataatttgc atttgtaata
      241 gaaaaaaaaa gaagagaata aatccaaaag agtgaagcaa aaacattaaa gcggaaagaa
      301 agtggtaaaa caataataga aacaggagaa gcagaagtac tacttcttct tcttctgctc
      361 tcttctcaga ccttgttttg tactttcttc ttcttcttct ttcttcttct tgtttgcgaa
      421 ctccgatatc ttcttcacta cctttgactc catttctttt tcttcttcag gtgtaggcat
      481 tgtcttgtta tgagaagcaa ctgtagctgg aagctcaagt atttgttttt agctgtggag
      541 cttgaatctt gatagttttc gacttctatg ttattacctg tggggatata gaaacaatca
      601 tgggcttact ctgcagtaga agtcgacatc atactgaaga tactgatgag aatacacagg
      661 ctgctgaaat cgaaagacgg atagagcaag aagcaaaggc tgaaaagcat attcggaagc
      721 ttttgctact tggtgctggg gaatctggaa aatctacaat ttttaagcag ataaaacttc
      781 tattccaaac gggatttgat gaaggagaac taaagagcta tgttccagtc attcatgcca
      841 atgtctatca gactataaaa ttattgcatg atggaacaaa ggagtttgct caaaatgaaa
      901 cagattctgc taaatatatg ttatcttctg aaagtattgc aattggggag aaactatctg
      961 agattggtgg taggttagac tatccacgtc ttaccaagga catcgctgag ggaatagaaa
     1021 cactatggaa ggatcctgca atccaggaaa cttgtgctcg tggtaatgag cttcaggttc
     1081 ctgattgtac gaaatatctg atggagaact tgaagagact atcagatata aattatattc
     1141 caactaagga ggatgtactt tatgcaagag ttcgcacaac tggtgtcgtg gaaatacagt
     1201 tcagccctgt gggagagaat aaaaaaagtg gtgaagtgta ccgattgttt gacgtgggtg
     1261 gacagagaaa tgagaggagg aaatggattc atctgtttga aggtgtaaca gctgtgatat
     1321 tttgtgctgc catcagcgag tacgaccaaa cgctctttga ggacgagcag aaaaacagga
     1381 tgatggagac caaggaatta ttcgactggg tcctgaaaca accctgtttt gagaaaacat
     1441 ccttcatgct gttcttgaac aagttcgaca tatttgagaa gaaagttctt gacgttccgt
     1501 tgaacgtttg cgagtggttc agagattacc aaccagtttc aagtgggaaa caagagattg
     1561 agcatgcata cgagtttgtg aagaagaagt ttgaggagtt atattaccag aacacggcgc
     1621 cggatagagt ggacagggta ttcaaaatct acaggacgac ggctttggac cagaagcttg
     1681 taaagaaaac gttcaagctc gtagatgaga cactaagaag gagaaattta ctggaggctg
     1741 gccttttatg accttattat tacatatctc tagtaaatta cctctcctta ttattataag
     1801 aaaaactcga aaactgaatg accgtgtaat ttatctttcg ggacaaaaga cttagcgatt
     1861 caaaatctaa tgtgtctcga tggctacgac tagtttctat tttatcattg tttttgttaa
     1921 cattcctctg tctttgactt cttatttttt ttctcatcaa aaacatctca ttttgatctt
     1981 gtttttgggg ttatattatt attaaaatga ggcatccaca tcccgaaatc
//
LOCUS       BLYABA        998 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  Barley abscisic acid (ABA) mRNA, complete cds.
ACCESSION   M36000
KEYWORDS    abscisic acid.
SOURCE      Barley (H.vulgare L. cv. Himalaya) seed, cDNA to mRNA, clone pHV
            A1-1.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 998)
  AUTHORS   Hong,L., Uknes,S.J. and Ho,T.-h.D.
  TITLE     Cloning and characterization of a cDNA encoding a mRNA rapidly-
            induced by ABA in barley aleurone layers
  JOURNAL   Plant Mol. Biol. 11, 495-506 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        120      761     abscisic acid (ABA)
    mRNA    <     1      998     ABA mRNA
BASE COUNT      249 a    284 c    323 g    142 t
ORIGIN      
        1 gtgccggtag taaatcatga gcatctcttg cgactcgaaa cgtagtacag caacagccta
       61 aagcgagtcc gagtggtgat tccagttcgt gtttgtttga gctagatcgt gagacgaaga
      121 tggcctccaa ccagaaccag gggagctacc acgccggcga gaccaaggcc cgcaccgagg
      181 agaagaccgg gcagatgatg ggcgccacca agcagaaggc ggggcagacc accgaggcca
      241 ccaagcagaa ggccggcgag acggccgagg ccaccaagca gaagaccggc gagacggccg
      301 aggccgccaa gcagaaggcc gccgaggcca aggacaagac ggcgcagacg gcgcaggcgg
      361 ccaaggacaa gacgtacgag acggcgcagg cggccaagga gcgcgccgcc cagggcaagg
      421 accagaccgg cagcgccctc ggcgagaaga cggaggcggc caagcagaag gccgccgaga
      481 cgacggaggc ggccaagcag aaggccgccg aggcaaccga ggcggccaag cagaaggcgt
      541 ccgacacggc gcagtacacc aaggagtccg cggtggccgg caaggacaag accggcagcg
      601 tcctccagca ggccggcgag acggtggtga acgccgtggt gggcgccaag gacgccgtgg
      661 caaacacgct gggcatggga ggggacaaca ccagcgccac caaggacgcc accaccggcg
      721 ccaccgtcaa ggacaccacc accaccacca ggaatcacta gacgcatgcg ttcgcgctta
      781 atttccgttc ctttagtcgt gtttggtcgt tcgagggcct tctacatatt tcatatttgt
      841 atgtttccac tctttcatga tttccgctca tttagtgtaa agtttgcctc cgatttgatg
      901 tactcgtctc tggttctgta atgagttata atccatgggc tttggtgtaa atggataacg
      961 aggacactcg aaggcggcaa taaagttgta tgtgatcg
//
LOCUS       MVARRNAG     1466 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  M.vannielii 16S ribosomal RNA gene.
ACCESSION   M36507
KEYWORDS    16S rRNA; small subunit ribosomal RNA.
SOURCE      M.vannielii DNA.
  ORGANISM  Methanococcus vannielii
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanococcales; Methanococcaceae.
REFERENCE   1  (bases 1 to 1466)
  AUTHORS   Jarsch,M. and Boeck,A.
  TITLE     Sequence of the 16S ribosomal RNA gene from Methanococcus
            vannielii: Evolutionary implications
  JOURNAL   Syst. Appl. Microbiol. 6, 54-59 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1     1466     16S ribosomal RNA
BASE COUNT      335 a    372 c    473 g    286 t
ORIGIN      
        1 attccggttg atcccgccgg aggctactgc tattggggtt cgactaagcc atgcgagtct
       61 atggtttcgg ccatggcgga cggctcatta acacgtggtt aacttaacct caggtggagc
      121 ataaccttgg gaaactgagg ataattctcc ataagaaaag cagtctggaa cgattctttt
      181 ctgaaagcat atgcgcccga ggataggact gcgctcgatt aggtagttgg tggggtaatg
      241 gcccaccaag cctacgatcg atacgggcct tgagagaggg agcccggaga tggggactga
      301 gacacggccc caggccctac ggggcgcagc aggcgcgaaa cctccgcaat gcacgaaagt
      361 gcgacggggg gaccccaagt gctcatgcac agcatgggct tttatcaagt gtaaacagct
      421 tgaggaataa gggctgggca agttcggtgc cagcagccgc ggtaataccg acggcccgag
      481 tggtagccac tcttattggg cctaaagcgt ccgtagccgg tccagtaagt ccctgtttaa
      541 attctctggc ttaaccagag gactggcagg gatactgctg gacttgggac cgggagagga
      601 caagggtact ccaggggtag cggtgaaatg tgttgatcct tggaggacca cctatggcga
      661 aggcacttgt ctggaacggg tccgacggtg agggacgaaa gccaggggcg cgaaccggat
      721 tagatacccg ggtagtcctg gccgtaaact ctgcgaacta ggtgtcacct gggcctcgag
      781 cccaggtggt gccgaaggga agccgttaag ttcgccgcct ggggagtacg gtcgcaagac
      841 tgaaacttaa aggaattggc gggggagcac cacaacgggt ggagcctgcg gtttaattgg
      901 attcaacgcc gggcatctca ccacgagcga cagcatgatg acggccaggt tgacgacctt
      961 gcctgaagcg ctgagaggtg gtgcatggcc atcgtcagct cgtaccgcga ggcgtcctgt
     1021 taagtcaggt aacgagcgag acccgtgccc tatgttgcga ctactttctc cggaaggtaa
     1081 gcactcatag gggaccgcta gcgctaagct agaggaagga gcgggcaacg ataggtccgc
     1141 atgccccgaa tctcctgggc tacacgcggg ctacaatggc taggacaatg ggctgctacc
     1201 ctgaaaaggg acgcgaatct ccgaaaccta gtcgtagttc ggatcgtggg ctgtaactcg
     1261 cccacgtgaa gctggaatcc gtagtaatcg cagttcataa tactgcggtg aatgtgtccc
     1321 tgctccttgc acacaccgcc cgtcacacca cccgagttgg gttcaggtga ggccttggcc
     1381 tttggctagg gtcgaacctg ggctcagcga ggggggtgaa gtcgtaacaa ggtagccgta
     1441 ggggaacctg cggctggatc acctcc
//
LOCUS       TOM2A11       526 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  Tomato fruit-specific protein mRNA, complete cds.
ACCESSION   M21775 X13741
KEYWORDS    fruit-specific protein.
SOURCE      Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 2A11.
  ORGANISM  Lycopersicon esculentum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 526)
  AUTHORS   Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M.
  TITLE     Isolation and characterization of a fruit-specific cDNA and the
            corresponding genomic clone from tomato
  JOURNAL   Plant Mol. Biol. 13, 639-651 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.E.Rose, 09-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept         34      324     fruit-specific protein
BASE COUNT      163 a     97 c     92 g    174 t
ORIGIN      
        1 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag
       61 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg
      121 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag
      181 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc
      241 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac
      301 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct
      361 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa
      421 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc
      481 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattcc
//
LOCUS       TOM3H11       774 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  Tomato fruit-specific protein mRNA, complete cds.
ACCESSION   M21776 X13742
KEYWORDS    fruit-specific protein.
SOURCE      Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 3H11.
  ORGANISM  Lycopersicon esculentum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 774)
  AUTHORS   Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.E.Rose, 09-DEC-1989, for release after publication.
            
               Author address: R.E.Rosen,
               Calgene Inc.
               1920 Fifth St.
               Davis, CA 95616
FEATURES       from  to/span     description
    pept        214      504     fruit-specific protein
BASE COUNT      247 a    143 c    119 g    265 t
ORIGIN      
        1 tttttttgag caaagggcaa ctcagatatc caaagatgaa tccaacatat agcttacagc
       61 tgggagaaca ttgtctaact cttctgaaat ttaaatgtta tccagaatcc ttcatcataa
      121 aataatatca aaatgcaaat ctattttttc tactcttgtc tagcttcaac tttcttcttc
      181 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag
      241 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg
      301 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag
      361 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc
      421 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac
      481 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct
      541 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa
      601 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc
      661 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattccacat taattcacgt
      721 atttatttca cttatgatac gtatttttgt tcctttcgcg taaaaaaaaa aaaa
//
LOCUS       TOMFSPG      4654 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  Tomato fruit-specific protein gene, complete cds.
ACCESSION   M21777 X13743
KEYWORDS    fruit-specific protein.
SOURCE      Tomato (strain VFNT Cherry) DNA.
  ORGANISM  Lycopersicon esculentum
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae.
REFERENCE   1  (bases 1 to 4654)
  AUTHORS   Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M.
  TITLE     Isolation and characterization of a fruit-specific cDNA and the
            corresponding genomic clone from tomato
  JOURNAL   Plant Mol. Biol. 13, 639-651 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.E.Rose, 09-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept       1379     1445     fruit-specific protein, exon 1
               2160     2383     fruit-specific protein, exon 2
    pre-msg    1326     2640     mRNA and introns
    IVS        1446     2159     fruit-specific protein intron
    signal     1064     1067     TATA-box (put.)
    signal     1135     1138     TATA-box (put.)
    signal     1190     1193     TATA-box (put.)
    signal     1291     1297     TATA box (put.)
    signal     2640     2640     polyA signal
BASE COUNT     1709 a    723 c    642 g   1580 t
ORIGIN      
        1 ctcgagccct ttaaaaagta tagtcaatat ttacggtgac cgtgaatttc ttaattatga
       61 tatataattt aaaagaaatc atgatcacat tctactgatg agaacatgtg ctaatcaagg
      121 gaaaacatgg atgtgaaaaa tactttttgt taaaagtaaa aaaaaatgtg aaattttgtt
      181 agttatttac tacctataca ttatttgagc atgtgcaaac tttacaaata cctaatagaa
      241 gattttcacc tgcctgtata tatgtaaatt aattataatg aacactctca cataaaataa
      301 ttatcagtat atacattaat acttgccctc cacaatgaat taaataaaat gtagaacatg
      361 atctacactt caataaaact aagaccataa agaataattt caaaatatac acatgtcaac
      421 aataaattat ttgcatatta tattaactta ctaaacaatc tttacttttg aaatataaaa
      481 ataatcaagt tataagtctg ctcaaagtaa agcacttgtt agactcatct gattttgaga
      541 aggtaagcaa attgatggtg cataatagtc acaagtaaaa tataaaatag atttcattag
      601 taaaattgtt ttttactttc tttatatata attatcaata tccttcaatg gtaggttaat
      661 tatattgtta acttcttgtt gaattaaagc aataagacaa gaatattaaa gataaaagaa
      721 caataaaaat agaaagacta agagataaga gttttcttat tcttctttca ataagtatca
      781 tcaagtgtat acaatataaa tttttgtatt tttgatctat ctatttataa tgttatatat
      841 aagcatacaa aagatcagtc ataaatatga ctttaatcat gaaaataatg aaagagatta
      901 tgaaggcgta aggttactag aataatagtc attaaaaaaa ggggttatct ttataattga
      961 ataattgatg aagtaatgga gataattagt gagcataaat ttttttaaaa aaatggacat
     1021 ttacactata atattttata acactttccc ttaaacatct aggtataaat aatgagtctt
     1081 gtcaaaatct tagtaggaaa aattctgtga aattttttta gtgaaaacaa atgatataaa
     1141 tatcttgaat actcattatt tgttgtctca ttaaaaatct tatctgacct ataaaataaa
     1201 ttatttgctc aactcaaaat agtttttcat tctaaaatta gtataattat tagtgaatat
     1261 ttaattaaca taattgtata ctaaggggcc tataaattgg attcttctca aagaaaaata
     1321 aaatcaccac acaactttct tcttctgctc atcaattagc aattaatcca aaaccattat
     1381 ggctgccaaa aattcagaga tgaagtttgc tatcttcttc gttgttcttt tgacgaccac
     1441 tttaggttca caacacttct cccttatttt gttttcttaa tttcttggaa gtcatatgca
     1501 tgtgtttggt atcatggtat atatataaag gaaaatattt ttcttaatta ctggttttct
     1561 aatgtttggt aggtaatcgg aaattattat gagataatga acttgcaaag tcattattat
     1621 ataacttttt ttttatactt tgatttaaga attcattttt ctcattttat ataaacttat
     1681 ttttcaacag aaaatatttt tcgaactatt caaacacacc ctaagacatt acatatatat
     1741 atatatacac cctccgtttt atattactta atgcctattg agttggccca ccctttaaga
     1801 atgattcaat tagagatatg ttttactaaa ttaacctatg ctttaagact ctaaatttgg
     1861 ctattactat tttacgttgt aatttaatga caaacatttc ataatgacta tagtctgaac
     1921 ttaattagac agacgtatct atagtttgct tactaatgat tcatagctat atatttggag
     1981 aggagagaga caaacgatat taagaaaggg aggagagagg cgaggtaaat ctgaaataga
     2041 gaagagaaag gcaaccaatt ttgatcatct atcatacttt tgattattat ttttattata
     2101 tgtacgttta cattacagtt ttcgaattct tacattaatc ttaatcataa tatatacagt
     2161 tgatatgtct ggaatttcga aaatgcaagt gatggctctt cgagacatac ccccacaaga
     2221 aacattgctg aaaatgaagc tacttcccac aaatattttg ggactttgta acgaaccttg
     2281 cagctcaaac tctgattgca tcggaattac cctttgccaa ttttgtaagg agaagacgga
     2341 ccagtatggt ttaacatacc gtacatgcaa cctgttgcct tgaacaatat caatgatcta
     2401 tcgatcgatc tatctatcta tttatctgtc tctgcgcgta tagtgttgtc tgtacctttg
     2461 gtgtgaagaa tatgaataaa gggatacata tatctagata tattctaggt aatgtcctat
     2521 tgtatttaaa atttgtagca atgattgttt gaataaaaac ataccatgag tgaaataatt
     2581 attccacatt aattcacgta tttatttcac ttatgatacg tatttttgtt cctttcgcgt
     2641 agatttttga tccttttccc ttttgaatat taaacattaa acacaaataa tgtttattaa
     2701 attaagttaa tatttttatt tagctattta tatttttatt tgaaatcaaa cttgataaat
     2761 atttataaag ataattaaca agtaatgtga cactaacacc atgtaatatt atcttgtcgt
     2821 tatttatgat aatattttaa aattataatt tcagttaaaa aattattaaa aaaacatact
     2881 tttaaaaagt gagttagcct ccgctaccca catacttatg aattggacta gttgtttttt
     2941 gacccacaaa aagaatgggc taattaaacc tgacctatca aatttcagaa tctgcataga
     3001 ttagtccgaa cgaaatgagt cagcccgtat tgaacaaaat atcaacaagg acgttatgta
     3061 aagatgttta agaaggaaaa aagatttcta atacatatgg actttcaata tcccaacttt
     3121 gtctggcgat ctgaaccctg cttagtttgt tgatcattaa cttgtcttgc tatgtattta
     3181 agatttaaac tttatatgtt taaacttaca gaaaatacat ataaatctct caagacttgg
     3241 caacataatt tactttagta cttaaactac atgaaaattt aaatatcctt ttaacatctt
     3301 tgaagtgaat taaattatca caatccgagc ctacaccttg gacgtggccg gcactcaaga
     3361 accagtgctg gtccccaagc taaccctcat cctgactgac tacaagcgga aggctaactt
     3421 aagtatacaa aagcttaaaa ctgaataaaa taaactttac aaggttttaa cacaaatgaa
     3481 caactttgaa gaaaataata tattcaacta gccataaaat agacaacttt agtctttaaa
     3541 acatttaata aaataaatgc aaaatataga ctccttaact aaactgacta tctatggagc
     3601 ctctaattga taaagatgga agtcgggaca agaccacgac atcctgacta aactgagaag
     3661 taaataaaat cccccggaaa aaaaggagcc tcaccatggc taactcgaac tcggggatat
     3721 atcaatgaag ctcctgttga tgatcttgaa gacatgtctc tgcatcatca aaagatgcag
     3781 gccaaatggc tcagtacgta aaatgtacga gtatgtaagg gaaattctaa agtataacat
     3841 aagcttgata cttgaataaa aggaaacata cttacctctt ttcaactcaa ctcaaattaa
     3901 gaataagata ctcaactcaa agattaggta ttcaacgcaa atatggcact ctactcaatg
     3961 aagtacaaat taactcagga tactcgactt aagatactca actcccgaca ctcaactgaa
     4021 ctcatttcaa tataaagcag cttaaaacaa gttcagtata aagtaaagtt gtttaaaaac
     4081 atgatgtcaa ctctgtgtgt ataataagga tacaacataa ctttgaaatg tatataaaaa
     4141 tacaattaac tgatgtatat aaaaatacat taatctatgg gagattctct aaccgacaac
     4201 catcacttaa gggctaagat gatgatatag cgatctaccg cacgctgcca tcgcatctta
     4261 tacccggcca aaggtataag acctgaactg cctaatgaat ccactaataa actgttaaaa
     4321 ggaatcatct aaaaagtatg acccttttct acccatagtg gctaacatgg tttatggggg
     4381 ctgtgagtta tctgaactct cccccatatc ggtgctcaat actactccaa aaaatatact
     4441 gctcttatgt ttaaaaacat actgattctg tggtttgaaa ttattgctta aagcttagat
     4501 ttttgaaaag ctctcttttg aaaatcgtag tttccttttt cttctattaa agctagacat
     4561 aggctatgta gaactctagc ttaccttcct tctcaaaagt ttgaaaacat ttgcttagat
     4621 tcttagggac tacttagttc ccttgttgga attc
//
LOCUS       ACLP322P     1337 bp ds-DNA             SYN       14-AUG-1990
DEFINITION  Synthetic plasmid pWH1266 origin of replication (ori) region.
ACCESSION   M36473
KEYWORDS    .
SOURCE      A.calcoaceticus lwoffi plasmid and pBR322 DNA, clone pWH1266.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 1337)
  AUTHORS   Hunger,M., Schmucker,R., Kishan,V. and Hillen,W.
  TITLE     Analysis and nucleotide sequence of an origin of DNA replication in
            Acinetobacter calcoaceticus and its use for Escherichia coli
            shuttle plasmids
  JOURNAL   Gene 87, 45-51 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    orgrpl      310      337     origin of replication
BASE COUNT      447 a    229 c    251 g    410 t
ORIGIN      
        1 gatcgtagaa atatctatga ttatcttgaa gaacgcaacc ctatagcagc tattgaaatt
       61 gatgatttaa ttgaagaaaa gacagattta gttgttgata atcgactgat ggggcgcaca
      121 ggcagacaga aagatactag ggagttagtg atacatccgc attatgtggt tgtatatgac
      181 atcactgata taatacggat actcagagtg ctacacacat cgcaggagtg gtcatgactt
      241 actcatgtac tttggattat ttagtgttat aaaatcctga tttataaatt ttttttgtta
      301 aaaaagataa aagccccttg caattgcttg gggctttacc gtaatttatg gggtacagat
      361 cttcgatact gacatatcgg caatcgaaag cattaaggtt tgacgaccgc taatgatttc
      421 accacagggg cttaatgtac ctgtcttaaa ttctaaggtt ttaactcgct ttgtcaagca
      481 tagaccccaa aaatttagcc aatgtctgta actcaatctg tccatgtgtg ggtgatgagg
      541 tacagtgacg ctagcacaca tcggaaaaac gctattacta ggggaactga acagagtagc
      601 ggacgcaatg agtagtcatt taattggcgg ttatgagcgt gttcaggcgg tgctatcaat
      661 cgtaatcata acagtggcag cttgatacag tgatgtcatc cctgatgcga aagcgaccga
      721 ccgacggtac atcgaatggg aatactttag ggtgattttt aagaatcgct ctagggtgag
      781 tatttcccat tcagctctgc tccctccctc tggtacttta atcaaaagca ctactaaaca
      841 tatgttttta aataaaaaat attgatatag agataatatt agtaagaata attaaacaat
      901 tgaatataga taaatcattg ttaaataaag attaattatt aaaatgaatg tatacttata
      961 tataaatcaa tgatttaaaa tatttgataa agaaaacttt tcaaaaaaaa tataattgag
     1021 attgtgtcat ttcggtcaat tcttaatatg ttccacgcaa gttttagcta tggtgctaaa
     1081 cagaaatttg ctgaaaaaga acttttcact gaactggtta aaatgtaagc agcctgagag
     1141 ccgccaaaaa ttttaaaaac aaaccgcctt aatcatcttc aaaaaatacc tctaaaacct
     1201 caccatttgc gttttaagac ccatatttca tcctgccctt atgttcccat gctgatagct
     1261 ataaagtgtc tgtaatcgct tcctatgacg ttctaggctg ttgataactt ttggaacaac
     1321 gcaaaatgtt aaaatcc
//
LOCUS       ATHUXP1A     1078 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  A.thaliana ubiquitin extension protein (UBQ1) gene, complete cds.
ACCESSION   J05507
KEYWORDS    gene UBQ1; ubiquitin extension protein.
SOURCE      A.thaliana (variety Columbia) DNA.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 1078)
  AUTHORS   Callis,J., Raasch,J.A. and Vierstra,R.D.
  TITLE     Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
            localization, and expression of their promoters in transgenic
            tobacco
  JOURNAL   J. Biol. Chem. 265, 12486-12493 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Callis, 08-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept         44      146     ubiquitin extension protein (UBQ1)
                234      320     ubiquitin extension protein
                406      508     ubiquitin extension protein
                888      954     ubiquitin extension protein
               1031     1057     ubiquitin extension protein
    pre-msg <     1  >  1078     UBQ1 mRNA and introns
    IVS         147      233     UBQ1 intron A
    IVS         321      405     UBQ1 intron B
    IVS         509      887     UBQ1 intron C
    IVS         955     1030     UBQ1 intron D
BASE COUNT      295 a    195 c    214 g    374 t
ORIGIN      
        1 ccttgtttag ggtttctacg tgagagaaga cgaaacacaa aagatgcaga tcttcgtgaa
       61 aaccttgacc ggcaagacca tcactctcga ggtcgagagc agcgacacca tcgacaatgt
      121 caaggccaag atccaagaca aagaaggtat cattcttcct cactcaatct ggattcttct
      181 ctttagcttt ttgaaattca gatctcttat catttacttg tttctccttt aaggaatccc
      241 tccggatcag cagagattga tcttcgccgg aaagcagctc gaagatggcc gtactttggc
      301 tgactacaac atccagaaag gtacgaaatc atccgaatcc ttctgttgat catttcgatg
      361 atctgattgt ataaactcta atggattgtt atcatttgta aacagaatct acacttcatc
      421 ttgtgttgag gcttagagga ggtattattg agccttcctt gatgatgctt gctcgtaagt
      481 acaatcagga taagatgatt tgccgcaagt aagtttcaat ttctccctaa gattgtagtt
      541 ttgttgatta atagttcaaa gtaccaaatg tttattagtg tgttagggtt ttagttgttt
      601 taatgggaat taacttagct tctcaagcct tatacaatgc gtgttgaggt tataaatgtt
      661 cccaatatca aacatttcca ttctctaaac tttgtcagct ggtcaaattt ctcatctttc
      721 cagatgattt tcttgaatgg tacatttttg caaatggttt tgaaaatgtg ttgaagcttg
      781 tatttttgat tgctgaaaca tcttgtttgt tggatagttt gtcatctatg gtccgatctg
      841 atgtctcaat gagattattc taatgtaaat ttcgcatctt ttggtaggtg ctatgctcgt
      901 cttcacccaa gagctgtcaa ctgcaggaag aagaagtgtg gtcacagcaa ccaggtttgt
      961 ttttgttact tgtttctttt tggtgtttat cagctgaaat gatatctgaa caaacttttt
     1021 ttatttgcag ttgaggccta agaagaagat caagtagaga gactcttatc aagaatcc
//
LOCUS       ATHUXP2A      976 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  A.thaliana ubiquitin extension protein (UBQ2) gene, complete cds.
ACCESSION   J05508
KEYWORDS    gene UBQ2; ubiquitin extension protein.
SOURCE      A.thaliana (variety Columbia) DNA.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 976)
  AUTHORS   Callis,J., Raasch,J.A. and Vierstra,R.D.
  TITLE     Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
            localization, and expression of their promoters in transgenic
            tobacco
  JOURNAL   J. Biol. Chem. 265, 12486-12493 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Callis, 08-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept         62      164     ubiquitin extension protein (UBQ2)
                243      329     ubiquitin extension protein
                433      535     ubiquitin extension protein
                771      837     ubiquitin extension protein
                929      955     ubiquitin extension protein
    pre-msg <     1  >   976     UBQ2 mRNA and introns
    IVS         165      242     UBQ2 intron A
    IVS         330      432     UBQ2 intron B
    IVS         536      770     UBQ2 intron C
    IVS         838      928     UBQ2 intron D
BASE COUNT      262 a    164 c    209 g    341 t
ORIGIN      
        1 cttcattttt tacactgcaa acgccgtttc gagacttccg gcgaagaaga cgaacgcaaa
       61 gatgcagatt ttcgtgaaaa cgctaaccgg caagaccatc accctcgagg tcgagagcag
      121 tgacaccatc gacaatgtca aggccaagat ccaggacaaa gaaggtatat tttctaatgc
      181 tttttaaaaa ttgtaacttt gaaattgtga tctcttagtg attcgttttt gctctgtttc
      241 aggaattcct ccggaccagc agaggttgat cttcgccgga aaacagcttg aagatggtcg
      301 taccttggct gactacaaca tccagaaagg tctctttctc gaatttgaat gttttggatt
      361 ctttagaaaa tgaaactagt atagctatga ttgttttgtt gatccaattc tgttttgttt
      421 actataaaac agagtcgact ttgcatcttg ttctgaggct taggggaggt atcattgagc
      481 cttcattgat gatgttggct cgtaagtata accaagacaa aatgatttgt cgcaagtaag
      541 atctctgatc ttttctcttt gttgttctaa gtgtaccatg tccatgtgtg atgattatag
      601 taaaaatggg gaattaactt aggggtgaac tattattcat tttctacgtg gattatgatt
      661 ctcttagttt gctgataaat gtattgaaat ctatctagtt ctttcactgt aacactatgg
      721 ttgtgtgaaa actgtgtctc atgtttgagt tgttaacctt gttggtttag gtgttatgct
      781 cgtctccacc caagagctgt gaactgcagg aagaagaagt gtggccatag caaccaggtt
      841 tggttttttg tctctttaga ttttaaaatt ccacagaatt tgtttgattt gttagttgtt
      901 aagtgtgatt ttctgtttat ttttgcagtt gaggccaaag aagaagatca agtgatcttt
      961 tgttcaccaa aagctt
//
LOCUS       ATHUXP5A      720 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  A.thaliana ubiquitin extension protein (UBQ5) gene, complete cds.
ACCESSION   J05539
KEYWORDS    gene UBQ5; ubiquitin extension protein.
SOURCE      A.thaliana (variety Columbia) DNA.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 720)
  AUTHORS   Callis,J., Raasch,J.A. and Vierstra,R.D.
  TITLE     Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
            localization, and expression of their promoters in transgenic
            tobacco
  JOURNAL   J. Biol. Chem. 265, 12486-12493 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Callis, 08-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept        112      585     ubiquitin extension protein (UBQ5)
    site        339      340     ubiquitin-extension protein junction
BASE COUNT      209 a    159 c    164 g    188 t
ORIGIN      
        1 aagcccaaat aaaataaggg tagtaacgta attgagctaa gaaaccctag aggtctcctt
       61 cgcctacaaa atccccattt tgataatctt cagcagccgt tgcctcaaaa gatgcagatc
      121 ttcgtgaaaa ccctaacggg gaagaccata acccttgagg ttgaatcatc cgacaccatc
      181 gacaatgtga aggcgaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg
      241 attttcgccg ggaaacagct tgaagacggc cgtaccctcg ccgactacaa catccagaag
      301 gaatcgacgc ttcatctcgt cctccgtctc cgtggtggtg ctaagaagag gaagaagaag
      361 acttacacca agccgaagaa gatcaagcac aagcataaga aggttaagct cgctgttctt
      421 cagttttaca aaatcgatgg atctggaaag gttcagcgtt tgaggaagga gtgccctaac
      481 gcaacctgtg gagctgggac tttcatggcg agtcattttg atcgccatta ctgtggtaag
      541 tgtggactca cctacgttta ccagaaagaa ggagttgaag cttgatctgt tgtagcggta
      601 gatcgatcct tctctttctc tcttatcttt taaaaactgc atttctattt gggaattttg
      661 taagctcttt aatttgagtt atcatggatt ctatgttgaa catctttcgt tctggattat
//
LOCUS       ATHUXP6A      681 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  A.thaliana ubiquitin extension protein (UBQ6) gene, complete cds.
ACCESSION   J05540
KEYWORDS    gene UBQ6; ubiquitin extension protein.
SOURCE      A.thaliana (variety Columbia) DNA.
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Callis,J., Raasch,J.A. and Vierstra,R.D.
  TITLE     Ubiquitin extension proteins of Arabidopsis thaliana: Structure,
            localization, and expression of their promoters in transgenic
            tobacco
  JOURNAL   J. Biol. Chem. 265, 12486-12493 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Callis, 08-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept        102      575     ubiquitin extension protein (UBQ6)
    site        329      330     ubiquitin-extension protein junction
BASE COUNT      191 a    154 c    153 g    183 t
ORIGIN      
        1 gagctccaaa accctagttt ctcaaccact actcttttat ttcttctcac cacttaaaga
       61 gtttccccag aaattttctt ccgccgtaaa agcaaaaaaa gatgcagatc ttcgtgaaaa
      121 ccctaacggg gaagacgatc actctcgagg tcgagtcctc tgacaccatc gacaatgtca
      181 aggccaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg attttcgccg
      241 gaaagcagct cgaagacgga cgtaccttag ccgattacaa catccagaag gaatcaacgc
      301 ttcaccttgt ccttcgtctc cgtggaggtg ctaagaagag gaagaagaag acctacacca
      361 agcctaagaa gatcaagcac aagcacaaga aggtcaagct cgctgttctt cagttctaca
      421 aggttgatgg ttcaggtaag gttcagcgtt tgaggaagga gtgccctaac gcaacctgtg
      481 gtgctgggac tttcatggcg agtcatttcg atcgtcacta ctgtggtaag tgtggtctca
      541 cctacgttta ccagaaagaa ggagctcagg aatgattttc atctcgatct ctatcatttt
      601 gaattgaata ctgctttttt tttggaattt ggaagttgtt tttggatgtt gtggatctta
      661 tgttgaactt gtttgaattt c
//
LOCUS       BLYG1HORDA   1614 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  Barley gamma-1 hordein storage protein gene, complete cds.
ACCESSION   M36378
KEYWORDS    gamma-hordein; storage protein.
SOURCE      Barley (cv. Carina) leaf DNA, clone lambda-hor-gamma-1.
  ORGANISM  Hordeum vulgare
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 1614)
  AUTHORS   Cameron-Mill,N. and Brandt,A.
  TITLE     A gamma-hordein gene
  JOURNAL   Plant Mol. Biol. 11, 449-461 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        379     1296     gamma-1 hordein precursor
    sigp        379      435     gamma-1 hordein signal peptide
    matp        436     1293     gamma-1 hordein storage protein
    signal     1372     1377     polyA signal
    signal     1428     1433     polyA signal
BASE COUNT      516 a    433 c    271 g    394 t
ORIGIN      
        1 agatccagtt tatccctcac aacttttaaa acatgacaaa gttagtttta gaaggattgg
       61 aagatagata cttatgtgag atgtaaagtg aataagatga gtcagcacct attatcaacc
      121 tcatcgtgca cttagaattt atgtacaacc acaaagcaac tttgatgatg tgtccaaaat
      181 tacctttgta tgtagtgcca ccaaacacaa acattccaaa tgatcagttc gataagcacg
      241 aacgcacatt aaaagaggga agaaaccatg aaaaggaacc atgatatgac atctataaag
      301 aggcctacat tttgtagatc acataccatc atccatcctt tacattacta agaaaggttt
      361 agctaacaca aatccaccat gaagatctta atcatcctta caatccttgc catggcgact
      421 accttcgcca cctctgagat gcaagtcaac cctagtgtcc aagtacaacc gacacaacaa
      481 caaccatatc ccgagtctca gcaaccattc atcagtcaat cacaacaaca atttccccag
      541 ccacaacaac cattccctca gcaaccacaa cagccatttc ctcaatccca gcagcaatgc
      601 ctccagcaac cacaacacca atttccccaa cccacacaac aattccccca acggccacta
      661 ctaccattta ctcacccgtt cctaacattc cccgaccaac tactccctca gccaccacac
      721 caatcattcc cccagccccc acaatcatac cctcagccac cgttacaacc ttttcctcag
      781 cccccgcaac aaaaatatcc cgagcaacca caacaaccat ttccttggca acaaccaact
      841 attcagctat atctccaaca acaactgaat ccgtgcaagg agtttctctt gcaacagtgc
      901 aggcctgtgt cattgttatc gtacatctgg tcaaagatcg tgcaacagag cagttgtcgg
      961 gtgatgcagc aacaatgttg cctgcaattg gcacagatcc ctgaacaata caaatgcaca
     1021 gcaatcgata gcattgtgca tgccatcttc atgcagcaag gccaaagaca aggcgtgcag
     1081 atcgtgcaac aacaacctca gccgcaacag gtgggtcaat gtgtgctcgt ccagggtcag
     1141 ggcgtcgtcc aaccgcaaca actagctcaa atggaagcaa tcagaacgtt ggtgttgcag
     1201 tccgttccat ccatgtgcaa ctttaatgtc ccacctaact gctccaccat caaagcacca
     1261 tttgttggtg tagtcaccgg tgttggagga caatgaaaag atgagagcta taataattag
     1321 tacatggatc atcgttgcat aatcgatgga ccaatcgatg taataacgat aaataaagtg
     1381 tcgtgcacca tgatgtgtga ttcgaccagg acttgtccaa tcgtgggaat aaaagtcata
     1441 aaaagttctt ctaagtgctt catttcttat actttttccc attcattgtg tatgtaaaag
     1501 tcgatacaaa actggaatcg taagtctatc cgatctatga tttagtttag acaactagat
     1561 taatgttcct tcggtgttag tcgtgcattg tagtaggttc aatctggtgg tacc
//
LOCUS       BOLSLSGRA    1286 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  B.campestris S-locus-specific glycoprotein mRNA, partial cds.
ACCESSION   M36301
KEYWORDS    S-glycoprotein; S-locus-specific glycoprotein; glycoprotein.
SOURCE      B.campestris, cDNA to mRNA.
  ORGANISM  Brassica campestris
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 1286)
  AUTHORS   Takayama,S., Isogai,A., Tsukamoto,C., Ueda,Y., Hinata,K.,
            Okazaki,K. and Suzuki,A.
  TITLE     Sequences of S-glycoproteins, products of the Brassica campestris
            self-incompatibility locus
  JOURNAL   Nature 326, 102-105 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1     1257     S-6-glycoprotein
BASE COUNT      345 a    278 c    316 g    347 t
ORIGIN      
        1 ggatccgtcg tcttgattct attttgtcct gccttttcga tcaacacttt gtcgtctaca
       61 gaatctctta gaatctcaag caacagaaca cttgtatctc caggtaataa cttcgaactc
      121 ggcttcttcc gaaccaactc aagttctcgt tggtatctcg ggatatggta caagaaattg
      181 ctcgacagaa cctatgtatg ggttgccaac agagataacc cactctccaa tgccattgga
      241 accctcaaaa tctcaggcaa taatcttgtc ctccttggtc acaccaataa atctgtttgg
      301 tcgacgaatc ttactagagg aaatgagaga cttccggtgg tggcagacgt tctctctaat
      361 ggaaacttcg tgatgcgaga ctccagtaac aacgacgcaa gtgaatactt gtggcaaagt
      421 ttcgattacc ctacggatac tttgcttcca gagatgaaac tgggttacga cctcaaaaca
      481 gggttgaaca ggttccttac atcatggaga agttcagatg atccatcaag cggggatttc
      541 tcgtacaagc tcgaaacccg aagccttcct gagttttatc tatggcatgg gatctttcca
      601 atgcatcgga gtggtccatg gaatggagtc cgatttagtg gcataccaga ggaccaaaag
      661 ctgagttaca tggtgtacaa cttcacagag aatagtgaag aggtcgctta tacattccga
      721 atgaccaaca acagcatcta ctcgagattg acactaagtt ccgaaggcta ttttcagcga
      781 cttacgtgga atccgtcaat agggatatgg acagcgttct ggtcttctcc agtggacccc
      841 cagtgcgata catacataat gtgcgggcct tacgcttact gtggcgtgaa cacatcacct
      901 gtttgtaact gtatccaagg gttcaatccc cggaatatac agcagtggga tcagagagtc
      961 tgggcaggtg ggtgtataag gaggacgcgg cttagctgca gtggagatgg ttttacaagg
     1021 atgaagaaca tgaagctgcc agaaactacg atggcgattg tcgaccgcag tattggtgtg
     1081 aaagaatgtg agaagaggtg ccttagcgat tgtaattgta ctgcttttgc aaatgcggat
     1141 atccggaatg gtgggacggg ttgtgtgatt tggaccggac ggcttgacga tatgcggaat
     1201 tacgttgctc acggtcaaga tctttatgtc agattggctg ttgctgacct tgtttagctc
     1261 tttctcttaa aataaagcac ggatcc
//
LOCUS       BOVFPKPA     1835 bp ss-mRNA            MAM       14-AUG-1990
DEFINITION  Bovine fructose 6-phosphate, 2-kinase: fructose 2,6-bisphosphatase
            mRNA, complete cds.
ACCESSION   M34241
KEYWORDS    bifunctional enzyme; fructose 2,6-bisphosphatase;
            fructose 6-phosphate,2-kinase.
SOURCE      Bovine heart, cDNA to mRNA, clones BH[9G,7H].
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1835)
  AUTHORS   Sakata,J. and Uyeda,K.
  TITLE     Bovine heart fructose 6-phosphate, 2-kinase/fructose
            2,6-bisphosphatase: Complete amino acid sequence and localization
            of phosphorylation sites
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4951-4955 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Uyeda, 11-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept    /    30     1622     fructose 6-phosphate,2-kinase:fructose
                                 2,6-bisphosphatase (EC 2.7.1.105 and EC
                                 3.1.3.46)
BASE COUNT      466 a    493 c    493 g    383 t
ORIGIN      
        1 caaccgacat ctcctgaaga gttgccatgt ccgggaatcc tgcctcttcc tcagaacaga
       61 acaacaatag ctatgaaacc aaagcaagtc tccgaatatc agagaagaaa tgttcatggg
      121 catcttacat gacaaactct ccaaccctca tcgttatgat tggcttgcca gcccgtggca
      181 aaacctacgt gtccaagaaa cttacacgct acctcaactg gattggggtg cccaccaaag
      241 tgtttaatct tggggtgtat cggcggcaag cagtcaagtc ctataagtcc tatgacttct
      301 tccggcatga caacgaggag gccatgaaga ttcgcaaaca gtgtgccctg gtggcactga
      361 aagacgtgaa ggcgtatctc acggaggaga gcgggcagat tgcggtgttt gatgctacca
      421 ataccactcg agagaggagg gacttgattt tgaactttgc cgaggagaat tctttcaagg
      481 tgttctttgt ggagtccgtg tgcgatgatc ctgatgtcat tgctgccaac atcctggagg
      541 taaaggtgtc gagccccgac taccctgaaa ggaacaggga gaatgtgatg gatgacttcc
      601 taaagaggat tgagtgctac aaagtcacct atcaacccct tgacccagac agccatgaca
      661 aggatctttc tttcatcaag gtgataaacg tgggccagag atttctcgtg aacaaagtcc
      721 aggactacat ccagagcaag atcgtctact acctcatgaa tatccacgtc caccctcgca
      781 ccatctacct ttgccggcat ggagagagcg agttcaacct cttggggaag attgggggtg
      841 actcaggcct ctcagtgcga ggaaaacagt ttgcccaggc tctaaggaag tttctggagg
      901 aacaggagat agcagacctc aaagtgtgga cgagccagtt gaaaaggact atccagaccg
      961 cagaatctct gggggtgacc tacgagcagt ggaagattct gaacgagatt gatgctggcg
     1021 tgtgcgagga gatgacttac gcggagattc aggagcagta tccggatgag tttgcgcttc
     1081 gagatgaaga gaaatatctg taccgatatc ctggagggga gtcataccag gacctcgtgc
     1141 agcggttgga gccagtcatc atggagctgg agcgccaggg caatgtcctc gttatctccc
     1201 accaggctgt catgcgctgc ctcctggcct acttcttgga caagggtgca gatgagctac
     1261 catacctgag gtgccctctc cataccatct tcaaacttac tcctgtggcc tacgggtgca
     1321 aagtggaaac aattaaactc aatgtggagg ctgtgaacac gcaccgtgac aagccaacta
     1381 acaactttcc caagagccaa acccctgtaa ggatgagaag gaacagcttt acgcctctgt
     1441 ccagttcgaa tacaatcagg cgtccaagaa attacagtgt tgggagccgg cccctccagc
     1501 ccctcagccc gctccgtgcc ctggacacgc aagaaggggc cgaccagccg aagacccaag
     1561 cagagacctc gcgggctgca cacaggctcc cgtctccagc gccccccacg tcgccctcct
     1621 gatggcgtgg aggctgaggc cagaacgctc ccccggagac tggggtctgc tgaagacctg
     1681 ggcgtgtcag ctccactggg gctggagcag gaagtcccgt ggggctcgga cattcggggc
     1741 cacccgagtg agcctggaaa agctgccctc catttcttcc cctcatgcct aatgaaaacc
     1801 gtgaaaagct cggcttccct gtacctggcc cctcg
//
LOCUS       BOVTHYRA     2045 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Bovine thyroglobulin gene, 5' end.
ACCESSION   M35823
KEYWORDS    thyroglobulin.
SOURCE      Bovine DNA, clone CBT1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 2045)
  AUTHORS   Ledent,C., Parnentier,M. and Vassart,G.
  TITLE     Tissue-specific expression and methylation of thyroglobulin-
            chloromyhenicol acetyltransferase fusion gene in transgenic mice
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by C.Ledent, 27-JUN-1990.
FEATURES       from  to/span     description
    mRNA       2037  >  2045     thyroglobulin mRNA, exon 1
    signal     2006     2011     TATA box
    rpt         826     1025     bovine monomeric repeat
    rpt        1712     1816     bovine monomeric repeat
BASE COUNT      467 a    508 c    579 g    491 t
ORIGIN      Chromosome 14q11-q15.
        1 gatccaggag ggagaggtga gagggccaga ggtgaggagc ctgcagagag aatggagacc
       61 caagccagga gcgggctcac ggaggccaga ccaaggagtt ggtgctgagt ttccgtagag
      121 catcctcctg gctcacgagg catcagatga tggtaggagc tgagtcatcg tgagaaagag
      181 gttttcacgg tgtggaggca gcgcctggca tggctctctc caagctcatc gcttcattgc
      241 tggagtggag agaagcacct ctccttccag gtacactgtt ttgactcaga aggaagcccg
      301 ggcggctctc accctcactg actctgctgt gcctcttccc aggggccagg ccagggagaa
      361 tgactcagca cacaagctgg cctgtgcaag accctgcctg catgcatggg tgggggaagc
      421 atgaggagtg ggctttctgg tagacctcag tcatggtctc agcattattt tttaaaagga
      481 ttttatttat ttgttttgat gtggaccagt tttaaaatct ttattgaatt tgttaccatg
      541 ttgcttctgt tttatgtttt ggttttttgg ccacgaggca tgggggatgt tagctccccg
      601 accagggact gaacccttgc cccctgcatg gaaaggtcaa gtcttaactt cttgactgcc
      661 taggaagtcc cggtttcaac attttagaaa attcatctgc tttagaacac ctggacctgg
      721 agagctggca agggctgcac atggccaggc tcacggtaat attacagcgt ctccatagcc
      781 tgggcagaac ttggcttgtg tgattggggc tgtgggctcc ccaggtggtg ctagtggtaa
      841 agaacccttc tgccagtgca ggagacataa gagatgcggg ttctatccct ggatcaggaa
      901 gatcccctgg agaaggaaat ggcaacctgc tccaggattc ttgcctggag aatcccaagg
      961 acagaagagc ctggcgggcc gtagtccatg gcgttgcaaa gagtcagaca caaccgaagc
     1021 gacttagcac ccaatggggg ctggagaggg ccaagctggg ggattctaat tctccagtaa
     1081 atgccagcat gtcttctcaa atcatcttta cctaaaacct aaggctcatt gtgatctccc
     1141 tcactcccaa gtgctctccc acacctgccc acctaccttc tgaacaccca tggagcctgt
     1201 ctccatcccc caccccttta atgggagaag ccactggaga aagctcactg atgtacaggc
     1261 tttatgtgga agacaagctt cctgctgcct tttggttgtc tgacgtcctg ggacagaggg
     1321 gaaaggggga tgactacgag tatgactgtg cgtgtgtttg gcttatctca tcaaaatctc
     1381 tacattctgt gttaatggat ctgcctgttt tgttccctgc catatcctca tggcctagaa
     1441 tagtgtctgc ttctctatca gactctaaag aaacattgct aggagggaag gaaggagcat
     1501 ggatgaggag ggagggagca ttgtgtttct ctcacggtgg gcctgaacgt gtggcccacc
     1561 aagttgttaa ctttggcctt tacccctgaa gatgaattat gaagccacac ccccagttct
     1621 tccttggtgg ctcagatggt caagaatcca cctgcaatgc gggagacctg ggtttgatcc
     1681 ctgggttggg aagatcccct ggagaaggga atggctaccc actccagtat tctggcctgg
     1741 agaatcccat ggacagagga gcctggcggg atgcagtcca tggggtctca gagagtcaga
     1801 tgtgactgag cgactttcac acacattcgt ccctggttct gctcccctac agcctccaca
     1861 agattttcac cccacactgg ccacatgagt gtcctccagg ggaacagacg caggtggagg
     1921 acctccttgt gaccagcaga gaaaacaggg tgggcactgc ttccctgagt gcctgtgggt
     1981 gggggctaag tacccacagc agtgctataa aggctccttg gccagagccc taaggtgggc
     2041 agcag
//
LOCUS       BRPTUBBA     4571 bp ds-DNA             INV       14-AUG-1990
DEFINITION  B.pahangi beta-tubulin gene, complete cds.
ACCESSION   M36380
KEYWORDS    beta-tubulin.
SOURCE      B.pahangi DNA, clone pBTY.
  ORGANISM  Brugia pahangi
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Gnathostomatidae.
REFERENCE   1  (bases 1 to 4571)
  AUTHORS   Guenette,S., Prichard,R.K., Klein,R.D. and Matlashewski,G.
  TITLE     Characterization of a beta-tubulin gene and beta-tubulin gene
            products of Brugia pahangi
  JOURNAL   Biochem. Pharmacol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Guenette, 11-JUL-1990.
FEATURES       from  to/span     description
    pept        396      452     beta-tubulin, exon 1
                891      999     beta-tubulin, exon 2
               2115     2341     beta-tubulin, exon 3
               2424     2552     beta-tubulin, exon 4
               2718     2880     beta-tubulin, exon 5
               3002     3192     beta-tubulin, exon 6
               3299     3394     beta-tubulin, exon 7
               3630     3816     beta-tubulin, exon 8
               4047     4234     beta-tubulin, exon 9
    pre-msg <   395  >  4400     beta-tubulin mRNA and introns
    IVS         453      890     beta-tubulin intron A
    IVS        1000     2114     beta-tubulin intron B
    IVS        2342     2423     beta-tubulin intron C
    IVS        2553     2717     beta-tubulin intron D
    IVS        2881     3001     beta-tubulin intron E
    IVS        3193     3298     beta-tubulin intron F
    IVS        3395     3629     beta-tubulin intron G
    IVS        3817     4046     beta-tubulin intron H
    signal       66       69     reverse CAAT box
    signal       90       95     weak TATA box
    signal     4395     4400     polyA signal
BASE COUNT     1278 a    783 c    910 g   1600 t
ORIGIN      
        1 ggtgtgcaga tagcacgagg aaatgagagg agcagagaag tagggcgcag tcagacagtt
       61 ggatgattgc gcctcccgtg tcacttccct atataccttt gcaaggagga catgacaaag
      121 ttcgaggtta tttctcgacg cgtctctctg tatactcaca tgtctgtctg tctgtctgtt
      181 tgtccgtctg cctgtctgcc ttggttgcta ggaagcccgt cgactttctc cacagtttag
      241 tgagtgtttg tgtgatggat aggatggttg gcgcgcggat gggtcggcca aacagttctg
      301 tgactgattg tagttgccgg atgacttgat cagtaattac cggattgatt gcaggtctca
      361 tttcggtcga caagatttca ttaagtgttt aagctatgag agaaattgtc cacgttcaag
      421 ctggtcaatg tggcaaccag attggtgcca aggtattgat tttctgctct ttctctttat
      481 cctttaggaa gatgactcat cagagggtat ctttccgcag aaattttccc gagatataat
      541 tcccacttcc cctcattgaa attgttttgg tatttaggta tcatcgtcaa aatattccat
      601 ttcatttctt aatgtgctat cgtttatcat ctgctcttcg attgtatctt ctcccttctt
      661 tgcttcattg attcctaatg caatacataa tacctaccat ctatcacggt taataaaatt
      721 cgatgttaga tcacaaagat caagttgtga taatctttcg tataataagt tgttgttttc
      781 aacaatctgc agttaggtaa gatgtctgag gagcaagcaa aatgtggatg taggggatta
      841 ctgaagttaa aaattgaaaa ttttaaacat aacttgagtt tttttttcag ttctgggaag
      901 taatatcgga tgaacatggt gttcaacctg atggtacata caaaggtgat tcagacctgc
      961 aaattgaacg aatcaacgtc tactataatg aagcgaatgg ttggtccatt attggagttt
     1021 tttttcttct ttgctttgga atgttctttt ttgttttctc tactagttgt tgccactcgt
     1081 tttcacacac attatacata gttctacata cacatgcata ccacacatag tcccatatat
     1141 aaacaattgt tgtctgacat cgaacattag ttcggcggat gaggttcatc agggcgtcct
     1201 caaaataatg tgttcgagaa ttaaatgaag aatgtctgcg tctcttttca aggtccatcg
     1261 tggtgaaatc atttgagaca taaggagatg gatggactgt ttaaattaga taaatttaaa
     1321 tacaagtttc agaataaccg acataatcaa agatgttttt gttacagaaa tatttgggaa
     1381 tgattatagc aagattgtaa aacagatatt ttgacatcta cctgtctttg atcacaacta
     1441 ttgtttatga tatttttggg atttttgcaa tgtaatgaca cttattttta gaatatcgta
     1501 aacactttct gacctaactt gtgatttgtt cgaagcctgg tagtgaaatt tatttggcgg
     1561 ttgtgaaaat acagtaaata accgcactat ataaggaaag aataacattg tcactttaac
     1621 caaacaaatc tcaaattgtc gtcacttcaa ccaaaacctt atcaaacagt gtttttgacc
     1681 tattctaact tctaacccca gtgtttcttg attgtcttga ttgttattct ataacaccta
     1741 caaagaataa gaattccttc aattcaaact tttcacgagt tattcgtctt tgtcagtcta
     1801 cactatgagg acttcgtcgt caacttattc caagataaaa aactttcagt gcaacagtct
     1861 tctttttttt ctcatctctt cgtaatggaa ttaaaaccaa acaagacgtt cgtcatatgt
     1921 tagcttgtca gtaatgtttg taatacatct ttattttcgt ctttttgttt ttcctctctt
     1981 cattatgcag tgaaaaggag tatttttctt taggttttct gtaaagatat acttgataat
     2041 agattgccat aatgaaccaa gcatcgttgt aaatcttcac ttcgtctcaa gcgatattaa
     2101 ttttttgtgt tcagggggca aatatgtgcc acgagcagtc cttgttgatt tggaaccagg
     2161 taccatggat tctattcgag gaggtgagtt cgggcaacta ttccgacctg acaattttgt
     2221 ttttgggcaa agtggagctg gcaacaactg ggctaaggga cattatacgg aaggtgcgga
     2281 actagttgat aatgtgttgg acgtgatacg aaaagaagct gagggatgcg attgtcttca
     2341 ggtacggatt gccatagttt tataagacat tttaattgtg gatgtgtttt tattttaaag
     2401 aaaaaattaa aaaatatttt cagggatttc aactaacgca ttcacttggt ggtggtaccg
     2461 gttccggcat gggaacattg ctgatctcga aaattcgtga ggagtatccg gatcgaatta
     2521 tgagctcttt ttcggttgtg ccatcgccca aagtatgtat ttgaatttct gattcttatt
     2581 tttaacatct gttatatcta ttttaaacat agctccataa cttttaagtg gcggttgtct
     2641 cacaaagtgg tttttcggct atcatttcat tttcatagtt gaagaaaaag tatgctagac
     2701 gtattaatga tttccaggta tcagatgttg tgttggaacc ctacaatgca acattatcag
     2761 tccaccaact agttgaaaac actgacgaaa ctttctgcat tgataacgag gctttgtatg
     2821 acatctgctt ccgaacgttg aagttggcaa atccaactta cggtgacctc aaccatttgg
     2881 gttcgtcctt taatttattc ttcgaactgt ccggtttaaa tgcattctat ttaactattt
     2941 ataaaattcg gatttaattg taatttattc aactcttcta aaggttaaaa tttctattta
     3001 gtgtctgtga caatgtcggg agtaacaact tgcttacgtt tccctggaca gttgaacgcc
     3061 gatctccgta aacttgccgt caatatggtg ccattcccac ggttgcattt ctttatgcca
     3121 ggatttgctc ctctctctgc tcgtgatgct gctgcttatc gagccctcaa tgttgctgaa
     3181 cttactcaac aggtctctgt ttatttttat ggtcagggtc tctttttttg aaagtaatcc
     3241 atttataatc gctttaaatg tctcctgttt tatgtttttt aatatcttat ttcttcagat
     3301 gtttgatgcc aaaaatatga tggcagcatg tgatccgcgt catggtcgtt acctaaccgt
     3361 agctgccatg ttccgaggta gaatgtctat gcgggtaagt gaattttgat aatctccatg
     3421 ttaaactcag ccgctgaagc atgaagattt taatccatgt gatgtcggtt tttcttgacc
     3481 tgaacttagt ggtgtatcac ttggtcttag ttatgttctt cctatttcca gagtgcttgc
     3541 ttaactgcaa ttcttttgtg gaagcttgtt tttgatgtag ggtttttttc ttcatcgctg
     3601 tttcaaagtt tttaataagt aggtcttagg aagtagacga gcaaatgatg caagtacaga
     3661 ataagaattc atcgtatttc gttgaatgga ttccaaataa cgtaaagaca gctgtttgcg
     3721 acattccacc acgtggatta aagatgagcg caacatttat tggaaataca acagctatac
     3781 aagaactttt caagcgaatt tccgaacagt ttactggtga acttattaat tccatctcat
     3841 ttgagcttag tttgatagct aaaaactaaa tttctggcaa gattgaaaag agatacaatt
     3901 gggtatgtgt aggttacatg atttttttgc catattatga gttttatgca aaaggtgaac
     3961 atttttaata tgtttgttcc gaatcgtctg aacttgcgtt ctatggaggt tatacttaag
     4021 tcgatccgat tttttaatga tttcagccat gttccgacgt aaagcattct tgcattggta
     4081 tactggcgaa ggtatggatg aaatggaatt cacggaagcg gagagtaata tgaatgactt
     4141 ggtgtccgaa tatcaacaat atcaggatgc gacggctgat gaagaaggtg atcttcagga
     4201 aggtgaatcg gaatacattg aacaggaaga gtgagcacaa agtgtcagct ttgtgaaaaa
     4261 ggaactttta cagtatttta tttatcgtct tttccatgtt tttatgttgt tacattttat
     4321 tttgttactc gctttcgaca tatattagca agttttaaag aattatagcg gcaaatctca
     4381 ttgttgtact tctcaataaa ttacaatcaa aagttaatga ttcctaatgc aaagtcgtaa
     4441 ggcagaaatc acgtgaaaaa cagtaatcat gattctctga aagttccaat ttgaaataaa
     4501 gtactaattt tatgtgggca gtggacggta ttccgtgaca gaagatgcta cagaagattg
     4561 cagcagtttt t
//
LOCUS       BSPCELALKA   3498 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  Bacillus sp. alkaline cellulase gene, complete cds.
ACCESSION   M27420
KEYWORDS    alkaline cellulase; endo-beta-1,4-glucanase.
SOURCE      Bacillus sp. (strain KSM-635) DNA, clone pBC100.
  ORGANISM  Bacillus sp.
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 3498)
  AUTHORS   Ozaki,K., Shikata,S., Kawai,S., Ito,S. and Okamoto,K.
  TITLE     Molecular cloning and nucleotide sequence of a gene for alkaline
            cellulase from Bacillus sp. KSM-635
  JOURNAL   J. Gen. Microbiol. 136, 1327-1334 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by S.Ito, 30-AUG-1989, for release after publication.
FEATURES       from  to/span     description
    pept        605     3430     alkaline cellulase precursor (EC 3.2.1.4)
    sigp        605      691     alkaline cellulase signal peptide
    matp        692     3427     alkaline cellulase
    binding     530      536     SD1 ribosomal binding site
    binding     595      600     SD2 ribosomal binding site
    signal      471      476     -35 region
    signal      494      499     -10 region
BASE COUNT     1133 a    605 c    704 g   1056 t
ORIGIN      
        1 ggcacctgag gaagtagcga atcctgttct tgatgcacat ccatacttaa cttctggatt
       61 tgccttcatg agtcgtgatg aaaacggaag tgcaccatta catggactgt ttgcatttaa
      121 ttattcggca ctgattagct gtggcatttc cgcttctgct ctttctggaa tgaagtacgg
      181 ggtcccaaga cttgtcactg ccattgccga tcagttattc caagatgatc gagacgagat
      241 tctaaaggac ttctttgagt atgatgagaa ggagtttgtc ggaaactggc ctttaaacgt
      301 ctaaatgaac ataatagcga aagggcttaa ccaaaaatat gaattgaacc cacataaatt
      361 tgtgggtttt tattaatcaa aaaaatggta aataaaccta ttttaacaat gcttataacc
      421 atttttctat ttattgcata aaaaaatcag taaaaaaatt catttatatg tagacgtaaa
      481 ttaacaaata ttatattata tatacgaaag cggtttcgaa aaatagagga aggaggagag
      541 tttttagttt ttgttgtttg tttattgtaa gcgtttacta ttaatacatt tctgggaggt
      601 tattatgaaa ataaagcaaa ttaaacaatc tttatctttg cttttaatca tcacactcat
      661 tatgtcacta tttgttccta tggcttcagc aaacacaaat gagtctaagt ctaatgcatt
      721 tcctttttct gatgttaaaa aaacttcttg gtcttttcca tatataaagg atttatatga
      781 gcaagaagtt attacaggaa catctgcaac aacgttctct ccaacagatt ccgttactcg
      841 tgcacaattt acagtgatgc ttacccgtgg tcttggacta gaagcatctt ctaaagatta
      901 cccttttaaa gatcgtaaaa actgggctta caaagaaatt caagctgcat atgaagctgg
      961 aattgtaact gggaaaacaa acggtgaatt tgcaccaaat gaaaacatta ctcgtgaaca
     1021 aatggctgct atggccgtac gtgcttatga atacttagaa aatgagctat ctttaccaga
     1081 agagcaaaga gaatataatg actcttcttc tatttcaacc tttgctcaag atgctgttca
     1141 aaaagcatac gtattagagc taatggaagg aaatacagat ggatattttc aaccaaaaag
     1201 aaactctact agagaacagt ctgctaaagt tatctctact ttactttgga aagtagctag
     1261 tcatgattat ttataccata cagaagctgt taaaagccct tcagaagctg gtgcgcttca
     1321 gttagtagaa ctaaacggac aattaacact agctggtgaa gatggtactc ccgttcaatt
     1381 acgtggaatg agtacacatg gcctacaatg gttcggtgaa atcgtaaacg aaaacgcttt
     1441 cgtagcacta tcgaatgatt ggggatctaa catgattcgt ctcgctatgt acattggcga
     1501 aaatggatat gcaacaaacc ctgaagtaaa agatttagtt tatgaaggaa ttgaattagc
     1561 gtttgagcac gatatgtatg taattgttga ctggcatgta catgctcctg gtgatcctag
     1621 agcggatgta tactcaggtg cttatgattt cttcgaagaa attgctgatc attacaaaga
     1681 tcatccgaaa aaccattata tcatttggga actagcaaac gaaccaagtc caaataataa
     1741 cggtggacct ggattaacaa atgatgaaaa aggttgggaa gctgtaaaag aatatgcaga
     1801 gccaatcgtt gaaatgttgc gtgaaaaagg tgacaacatg attttagttg gaaatcctaa
     1861 ctggagccaa cgtcctgact tatcagctga caacccaatt gatgcagaaa atatcatgta
     1921 ttctgttcac ttctacacag gctcacatgg cgcttctcac attggttacc ctgaaggaac
     1981 accaagctct gaacgttcta atgttatggc taacgttcgt tatgctctag acaatggcgt
     2041 tgctgtgttt gcgacagagt ggggtacgag tcaagcgaat ggagatggag gaccttattt
     2101 tgatgaagct gatgtttggc ttaatttctt aaacaaacat aacattagct gggcaaactg
     2161 gtcgttaacg aacaaaaatg agatttctgg agcatttaca ccttttgagc ttggtagaac
     2221 agatgctaca gatcttgatc caggtgctaa tcaagtatgg gcacccgagg aactaagttt
     2281 atctggtgaa tatgttcgtg ctcgtattaa aggaattgag tatacaccta tcgaccgcac
     2341 aaaattcaca aagcttgttt gggattttaa cgatggaaca acacaaggat tccaagttaa
     2401 tggagacagc cctaacaaag aaagcattac tttaagtaat aataatgatg cattacaaat
     2461 tgaaggatta aatgtaagta atgatatttc tgaaggaaac tactgggata atgtacgcct
     2521 gtcagctgat ggctggagtg aaaatgtaga tattttaggt gctacagagc ttacaattga
     2581 tgttatcgtt gaagaaccga caacagtttc aattgctgct attccacaag gacctgctgc
     2641 tggctgggct aacccgacta gagcaattaa agtaactgaa gacgatttcg aatctttcgg
     2701 agatggatac aaagctctcg taactattac ttctgaagat tcaccttcac ttgaaaccat
     2761 tgcaactagt cctgaagaca atacaatgag caatatcatt ctatttgtag gtactgaaga
     2821 tgcagatgtt atttctttag ataatatcac ggtttctggt actgagattg aaattgaagt
     2881 tattcacgat gaaaaaggaa cagcaacact tccttctact tttgaagatg gaactcgcca
     2941 aggctgggat tggcatacag aatcaggagt taagacagct cttacaattg aagaagctaa
     3001 tggatctaac gctctttcat gggaatatgc gtatcctgaa gtaaaaccaa gtgatggttg
     3061 ggctactgct cctcgtctag acttctggaa agacgaacta gttcgtggca caagcgacta
     3121 tattagtttt gacttttaca tcgatgcagt tcgtgcttct gaaggtgcta tatcaattaa
     3181 cgccgttttc caaccacctg caaacgggta ttggcaagaa gttccaacta catttgaaat
     3241 tgatttaaca gagcttgatt ctgcaactgt aacttctgat gagttgtatc attatgaagt
     3301 aaaaattaac attagagaca ttgaggctat tacagacgat acagagcttc gtaacttatt
     3361 actaatcttt gctgatgaag acagtgattt tgctggtaga gtttttgttg ataatgtaag
     3421 atttgaataa tttaaaaaca gtagatagag agactctcta tctacctgtt tattgcttac
     3481 tattcgtctt ccactttt
//
LOCUS       BSUPHOA3A     743 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  B.subtilis alkaline phosphatase (phoAIII) gene, complete cds.
ACCESSION   M33634
KEYWORDS    alkaline phosphatase.
SOURCE      B.subtilis (strain 168) DNA.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 743)
  AUTHORS   Bookstein,C., Edwards,C.W., Kapp,N.V. and Hulett,F.M.
  TITLE     The Bacillus subtilis 168 alkaline phosphatase III gene: Impact of
            a phoAIII mutation on total alkaline phosphatase synthesis
  JOURNAL   J. Bacteriol. 172, 3730-3737 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.M.Hulett, 06-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        503  >   743     alkaline phosphatase precursor (phoAIII, EC
                                 3.1.3.1)
    sigp        503      598     alkaline phosphatase signal peptide
    matp        599  >   743     alkaline phosphatase
BASE COUNT      221 a    149 c    131 g    242 t
ORIGIN      
        1 gatccgtaaa catggctgct tccattttat ttctgtatgt aaatttggat atttattttt
       61 tatccaagtt aacacctctt ccgttctttc tcggtggaac gggtcagcaa tccccgctaa
      121 gtactcagaa aaaacgtcca tgttgttccc tcctaaaata aaatgatgcc actccatatg
      181 aactgctcag acatgtgctt acacaatacc tcttgccaac gtaatacttt aaattgttca
      241 agatgtaagc gggttgcttc tatgcgacta tagcagattt cagtccgcct atccacggcc
      301 agtatctatt cctccaattt atttctttag tactatcgct ctttccttct aaaacttctc
      361 ataaaagaat aaccattatt taagggtgcc agttcattat tcttgtaaat ccaatcttta
      421 aaatcgatta atactagctt aacagtttaa aaatataatt gggttgtcat tgagattcat
      481 ctatatttag gaggttatcc agttgaaaaa attcccaaag aaattactgc ctatcgcggt
      541 tttatcatca attgcgttca gcagcttagc cagcggcagt gtgcctgaag ccagcgccca
      601 ggaaaagaaa aaggggaacc aagacgaaat taaaaatgtt attgttctga ttggtgatgg
      661 tatgggtgtg tcttatacgt ctgcttatcg atacttaaag gataataaaa agacaaaagt
      721 tgtggagcct acggcttttg atc
//
LOCUS       CHTSIGAA     2180 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  C.trachomatis sigma-A protein (sigA) gene, complete cds.
ACCESSION   M36475
KEYWORDS    sigma factor; sigma-70 homolog.
SOURCE      C.trachomatis (strain MoPn) DNA.
  ORGANISM  Chlamydia trachomatis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE   1  (bases 1 to 2180)
  AUTHORS   Engel,J.N. and Ganem,D.
  TITLE     A polymerase chain reaction-based approach to cloning sigma factors
            from eubacteria and its application to the isolation of a sigma -70
            homolog from Chlamydia trachomatis
  JOURNAL   J. Bacteriol. 172, 2447-2455 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        360     2075     sigma-A protein (sigA)
BASE COUNT      679 a    380 c    517 g    603 t      1 others
ORIGIN      
        1 tctagaggga tttgcccaat gggagcgaga tttattaaca gaatctcgaa cagaacgctc
       61 tggcaaaatc gtaattaaaa cgaaagcttt tggagccact ccgttctact attgtgtagt
      121 aacgttagat aaaggacctt tggcagaaca tgtattaggg gttttatacc ctgcaaaagc
      181 aagttttttt acaaatcttt cctatattta atgataaata gggatgtcgg tgtaggaagt
      241 ttttctaggg ccgaatctga tcagctggtt ttttatctaa aaatccctag atttcttgtt
      301 ttccttaaga taaactgtca ctttctaggc cgatttttcc ttagttttaa tttgttagca
      361 tgcgcatcca tacgctagat agtcaacgtg ctgatgcagc ccaagaagaa gaaatccaaa
      421 gaaagttaga agagctagtc actcttgcta aggatcaagg gttcatcacg tatgaagaaa
      481 ttaatgaaat tcttccccct tctttcgatt cgccagaaca gatagatcaa gttttaattt
      541 ttctggcggg gatggacgtt caagtcttaa accaagcaga cgtagagcgg cagaaagaaa
      601 gaaaaaaaga agctaaagag ctagaagggt tggctaagcg ttctgaggga acgcctgatg
      661 atccagtgcg tatgtatctg aaggaaatgg gtacagttcc tcttcttaca agagaagagg
      721 aggtggagat ttctaaaagg atagaaaaag ctcaggtaca aatagaaaga attattttac
      781 gctttcgtta ttcgactaaa gaagcggttt ctattgcgca atacttaatt aatggtaagg
      841 aacgatttga taagatcgtt tccgnaaaag aggtggaaga taagacacat tttcttaatc
      901 ttttgccgaa gttaatctct ttgctgaaag aagaggactc ttacttagaa gagcggcttt
      961 tggctttgaa agatccggca ctgtccaagc aagatcaagc aaagttgaac gatgagcttg
     1021 aaaaatgccg tattcgaaca caagcttatc tgagatgttt tcattgtcgt cacaatgtca
     1081 cggaagattt tggagaagtt gtttttaaag cttatgactc gttcttacag ttggaacaac
     1141 aaatcaatga tttaaaggtt cgtgctgaaa gaaataagtt tgctgcagca aaactagctg
     1201 cagcccgacg caagctttac aaacgagaag ttgcagcggg acgtactctt gaagagttta
     1261 aaaaagatgt gcggatgttg cagcgttgga tggataaaag ccaggaagca aagcaagaga
     1321 tggtggaatc caacttacgt ttggtgatct ctatcgcgaa aaaatatacc aaccgagggc
     1381 tgtctttctt ggatttgatt caagaaggaa atatgggctt aatgaaagct gttgaaaaat
     1441 ttgagtatcg ccggggttat aaattttcaa cttatgccac ttggtggatt cgtcaggctg
     1501 tgacgcgagc tattgccgat caggcaagaa ccattcgaat tcctgttcat atgatcgaga
     1561 ccattaataa ggtgcttcgt ggagccaaga aattaatgat ggaaactgga aaagagccta
     1621 cgcccgaaga actcggagag gaactaggtt tcactccaga ccgtgttcga gaaatttata
     1681 agatcgctca gcatccgatt tctttacagg ctgaggttgg agatagtgga gaaagctctt
     1741 ttggagattt cttggaagat acagctgttg aatctccggc agaggcaaca ggctactcca
     1801 tgttgaaaga caaaatgaag aaagtgctaa aaacgcttac tgatcgcgaa cgttttgttt
     1861 tgatccatcg gtttggcctt ttagatggcc gtcccaaaac tttggaagag gtaggctccg
     1921 cgttcaacgt gacacgagag cggattcgac aaatcgaagc caaagctttg cgaaaaatgc
     1981 gtcatcctat ccgttccaaa cagctacgag cattcttgga tttattggaa gaagagaaga
     2041 ctggttcggg caagattaag agttataaga attaaggatt ccttttcata aggagtatgc
     2101 ttgtatcggt tagacgtaac gaactttcgt gtttgggtat ctataggggt ctcagagcag
     2161 gaacgctatc acaagcaacc
//
LOCUS       CUSRGSBHA    1137 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  Melon 17S rRNA, 5.8S rRNA, and 25S rRNA gene region.
ACCESSION   M36377
KEYWORDS    17S ribosomal RNA; 25S ribosomal RNA; intraribosomal spacer region.
SOURCE      C.melo 4-wk old cotyledon DNA, clone pML17.
  ORGANISM  Cucumis melo
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Violales; Cucurbitaceae.
REFERENCE   1  (bases 1 to 1137)
  AUTHORS   Kavanagh,T.A. and Timmis,J.N.
  TITLE     Structure of melon rDNA and nucleotide sequence of the 17-25S
            spacer region
  JOURNAL   Theor. Appl. Genet. 76, 673-680 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA    <     1      259     17S rRNA
    rRNA        476      638     5.8S rRNA
    rRNA        876  >  1137     25S rRNA
BASE COUNT      253 a    333 c    324 g    227 t
ORIGIN      
        1 atagatcatt gcaattgttg gtcttcaacg aggaattcct agtaagcgcg agtcatcagc
       61 tcgcgttgac tacgtccctg ccctttgtac acaccgcccg tcgctcctac cgattgaatg
      121 gtccggtgaa gtgttcggat cgcggcgacg tgggcggttc gctgcccgcg acgtcgcgag
      181 aagtccactg aaccttatca tttagaggaa ggagaagtcg taacaaggtt tccgtaggtg
      241 aactgcggaa ggatcattgt cgatgcctaa acatcaaacg acccgcgaac gcgtttaaaa
      301 acaaactgtt cgcgttaggg gcggggggaa gcatgctctt tggctgcctc ctccccttcc
      361 aacgcgttta aacaaaaccc cgcgcaggtc gcgccaagga acttgaaatg aattcgcctg
      421 tcccctgccc cggcctcggc gtgcggggga tggagcattc tagtcgtatt actaacaacg
      481 actctcggca acggatatct cggctctcgc atcgatgaag aacgtagcga aatgcgatac
      541 ttggtgtgaa ttgcaggatc ccgcgaacca ccgagtcttt gaacgcaagt tgcgcccgga
      601 gccttctggc cgagggcacg tctgcctggg cgtcacgcat cgctgccccc accacacaac
      661 tctccccatg cggggtcgtt gtgaaggcag ggacacacac tggcctcccg tacgcaccgt
      721 cgtgcggatg gcttaaattt gagtcctcga tgctcgtcgt cgcgacacta cggtggttga
      781 ttcaactcgg tgacgcgtct cgacctcgac gtcgacttca cggactcctt cacgaccctt
      841 cgaacgccgc cccttaaaag gacgacgctc tcgacgcgac cccaggtcag gcgggactac
      901 ccgtgagagt ttaagcatat caataagcgg aggaaaagaa acttacaagg attcccctag
      961 taacggcgag cgaaccggga agagcccagc ttgagaatcg ggcgtcctcg acgtccgaat
     1021 tgtagtctgg agaagcgtcc tcagcggcgg accgggcaca agtccctgga aggggcgcca
     1081 gagagggtga gagagcccgt tgcgctcgga ccctgtcgca ccacgaggcg ctgtcaa
//
LOCUS       DOGFIXA      3080 bp ss-mRNA            MAM       14-AUG-1990
DEFINITION  Canine blood clotting factor IX mRNA, complete cds.
ACCESSION   M33826
KEYWORDS    blood clotting cascade protein; blood clotting factor IX; protease.
SOURCE      Dog hemophilic liver, cDNA to mRNA, clone dF9.28.
  ORGANISM  Canis familiaris
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae.
REFERENCE   1  (bases 1 to 3080)
  AUTHORS   Axelrod,J.H., Read,M.S., Brinkhous,K.M. and Verma,I.M.
  TITLE     Phenotypic correction of factor IX deficiency in skin fibroblasts
            of hemophilic dogs
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5173-5177 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.H.Axelrod, 23-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         22     1380     factor IX protein precursor
    sigp         22      138     factor IX signal peptide
    matp        139     1377     factor IX blood circulatory protein
    matp        577     1377     factor IX activated protein
    matp        673     1377     factor IX activated protein
    mRNA    <     1     3080     factor IX mRNA
BASE COUNT      878 a    640 c    677 g    885 t
ORIGIN      Chromosome X.
        1 aagcagcgcc tgaataggat catggcagaa gcatcgggcc tcgtcaccgt ctgcctttta
       61 ggatatctac tcagtgccga atgtgcagtt tttcttgatc gtgaaaatgc caccaaaatt
      121 ctgagtcggc caaagaggta taattcaggt aaactggaag agtttgttcg agggaacctt
      181 gagagagaat gtatagaaga aaagtgcagt tttgaagaag cacgggaagt ttttgaaaac
      241 actgaaaaaa ccactgaatt ttggaagcaa tatgttgatg gagatcaatg tgaatccaat
      301 ccatgtttaa atgacggtgt atgcaaggat gacattaatt cctatgaatg ttggtgtcga
      361 gctggatttg aaggaaagaa ctgtgaatta gatgtaacat gcaacattaa gaatggcaga
      421 tgcaagcagt tttgtaaatt gggccccgat aacaaggtgg tttgttcctg tactacggga
      481 taccaacttg cggaagacca aaggtcctgt gaaccagcag tgccatttcc atgtggaaga
      541 gtttctgtcc ctcacatttc tatgacacgc acccgtgctg aaactctttt ttccaatatg
      601 gactatgaaa attcaactga agtggaaaaa attttggata acgtcaccca accgcttaac
      661 gacttcactc gagttgttgg tggaaaagat gccaaaccag gtcaattccc ttggcaggtc
      721 cttttgaatg ggaaagttga tgcattctgc ggaggttcca tcatcaatga aaaatgggtg
      781 gtaactgcag cccactgtat tgagcctgat gttaaaatta ccatagttgc aggtgagcat
      841 aacaccgaga agagggaaca tacagagcag aagcgaaacg tgattcgcac tattcttcac
      901 cacagctata atgcaactat taataagtac aaccatgaca tcgcccttct ggaactggat
      961 gagcccttaa cgctgaacag ctatgtaaca cctatttgca ttgctgacag ggaatactcg
     1021 aacatcttcc tcaaatttgg gtctggctat gtgagtggct gggggagagt cttcaacaaa
     1081 gggcgatcgg cttcaattct tcaatacctt aaagttccac ttgttgaccg agccacgtgc
     1141 cttcggtcca cgaagttcac catttataac aacatgttct gtgctggctt ccatgaggga
     1201 ggtaaagatt catgccaggg cgatagtggg ggaccccatg tcaccgaagt agaaggcata
     1261 agtttcttaa ctgggattat tagctggggt gaagagtgtg cgatgaaagg gaagtatgga
     1321 atatatacca aggtgtcccg gtatgtcaac tggattaaag aaaagacgaa gctcacctaa
     1381 agaataatgt atttccaagg ttgacacgtt tagggtagaa aatggacaag gtcctttact
     1441 aactaatcac tttttttatc tctttagatt tgcactatat acattctctg catactgctt
     1501 tttctctttc tggggagaaa tctatctaga attcctattt tactagacta agtgaattag
     1561 aaaatgtaat cactacggga atgtactgtg atgggacttg tgaccactcc cacaggtcta
     1621 gcccttggca ccattgtgag gttaggttat tcctccgtcc cgtcagtgtt aagtttctcc
     1681 actggggcaa ctccctgatt ctccctcctt ggcagcattc catgttccag acctttctta
     1741 cctttcccat ggaaagcatc atcaaaatgt gttagatcta catccaggat atttgatcta
     1801 gttcagaaca agtctaacac ctcacttgtt aaggaagaac acaggagcaa ctgacaggtt
     1861 gcaactcacc agaaaacact atttcctttt ctgtatgctt attcctgctt ccttatctct
     1921 tctgtttcct aatcctgaaa tcagtgtctc tctttctctt tctctctctc tttttcctac
     1981 agaggtttaa aggagggaag ggacacatca tgctgtttta ctactgtcca cagttataca
     2041 tgtctatcaa acccagactt gctttcagtt tggtctttga cttgcttttc ggagcatagg
     2101 gatgaagcca ggtgcctgaa gaccttcagg aaaatttgtc tgaaagagtc atgttactga
     2161 aatacatggc atcgtggaag gaatgaccga cccatcagaa tacgtccaat aagcatttgt
     2221 aattgtgttg tgatagaggt aactaagaag agtgacatca actcctgcgt ctcatccccc
     2281 atgtgaaaaa actgtgaact aaaagagaac agccagtgtg caacgtagaa ctagtagagt
     2341 cttaaggaag aattcattgg tgggtctcca gtactggccg gagctaagga agaagttgcc
     2401 ctcgaccaaa gaacatgagc accctgtctc ttaaactagc atgtccccac agtggagaag
     2461 gatgtactgg tggcttaaag gcatgagtca ggggtgtctg ggtggctcag tggttgagcc
     2521 tttggctcag ggcatgatcc cagcgtcctg ggattgagtc ccgcatcagg cttcctgcag
     2581 ggggcctgct tctccctctg cttatgtctc tgcttctctg tgtgtgtgtg tgtgtctctc
     2641 atgaaaaaaa taaataaaat ctttaaaaaa aaaaaaggta tgagtcattc cagtgagcca
     2701 tctcactccc ccttttctgg tttcaggttc tctaggcaac cttttgatta taattaggcc
     2761 ttccatattg aattttctaa agagttgctg accaacctct gcatgtactt tagtgtgtta
     2821 ggctggctcc ctttatgaat taataaactg ttgttctggt tgatactgta gctttttgtg
     2881 aatgtggctg atgtgaatca gtcatcccac atgttgttat atatgagact actgacaaaa
     2941 tcatgtttgg gaccactttg acctgaccaa gctgctgcct tcacctcacc cccagccagg
     3001 catttacttt agtccttcaa atattaagta ccaataaatg tcttttcaaa tttcaaaaaa
     3061 aaaaaaaaaa aaaaaaaaaa
//
LOCUS       ENHSREHPA     722 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  E.histolytica serine rich protein (SHEHP) mRNA, complete cds.
ACCESSION   M34438
KEYWORDS    serine rich protein.
SOURCE      E.histolytica (strain HM1:IMSS), cDNA to mRNA.
  ORGANISM  Entamoeba histolytica
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina; 
            Entamoebidae.
REFERENCE   1  (bases 1 to 722)
  AUTHORS   Stanley,S.L.Jr., Becker,A., Kunz-Jenkins,C., Foster,L. and Li,E.
  TITLE     Molecular cloning and expression of a membrane antigen of Entamoeba
            histolytica possessing multiple tandem repeats
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4976-4980 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.L.Stanley,Jr., 22-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept          4      705     serine rich protein precursor
    sigp          4       42     serine rich protein signal peptide
    matp         43      702     serine rich protein
    ORF         188      688     ORF2
    site        247      582     multiple tandem repeats
BASE COUNT      315 a    121 c    118 g    168 t
ORIGIN      
        1 tcaatgttcg catttttatt gtttattgca ttcactagtg caactaatat cattcttgat
       61 ttggatcaag aagtaaaaga tactaatatc tacggtgtat ttttaaagaa cgaagctagt
      121 cctgaaaagc ttgaagaagc tgaagaaaaa gaaaaaagta gctcagcaaa accagaatca
      181 agttcaaatg aagataatga agatgatgaa gatgaaaaag caagttcaag tgataactca
      241 gaatcaagct caagtgataa accagataat aaaccagaag caagttcaag tgataaacca
      301 gaagcaagtt caagtgataa accagataat aaaccagaag caagttcaag tgataaacca
      361 gataataaac cagaagcaag ttcaagtgat aaaccagata ataaaccaga agcaagttca
      421 agtgataaac cagataataa accagaagca agctcaagtg ataaaccaga taataaacca
      481 gaagcaagct caactaataa accagaagca agctcaacta ataaaccaga agcaagctca
      541 actaataaac cagaagcaag ctcaactaat aaaccagaag caagctcaac tagtaattca
      601 aatgataaat caggaagtag ttcagataac gataataata accttgatgc tgcatcaagt
      661 ccattcattg ttttctgtgc tatcattata gctatcatct tctaaatgag ttatttgact
      721 tt
//
LOCUS       EUBBAIOA     6212 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  Eubacterium sp. bile acid-inducible operon (baiBCDEA2F), complete
            cds.
ACCESSION   M36292
KEYWORDS    7-alpha-dehydroxylation; bile acid-inducible operon.
SOURCE      Eubacterium sp. DNA.
  ORGANISM  Eubacterium sp.
            Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE   1  (bases 1 to 6212)
  AUTHORS   Mallonee,D.H., White,W.B. and Hylemon,P.B.
  TITLE     Cloning and sequencing of a bile acid-inducible operon from
            Eubacterium sp. VPI-12708
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.Mallonee, 10-JUL-1990.
               Medical College of Virginia-VCU
               Dept. of Microbiology
               Box 678 MCV Station
               Richmond, VA 23298-0678
FEATURES       from  to/span     description
    pept         24     1586     baiB protein
    pept       1634     3256     baiC protein
    pept       3235     3552     baiD protein
    pept       3597     4097     baiE protein
    pept       4130     4879     baiA2 protein
    pept       4932     6212     baiF protein
BASE COUNT     1864 a   1324 c   1698 g   1326 t
ORIGIN      
        1 aaaagatatt aagcattaag aaaatgcaca aaaaatcagc gtgtgagagg gagggcaagg
       61 agttgaagcg tgactttttt aacaagttta atttggggac atcgaacttt gtcacgccgg
      121 gaaaacagtt ggaatacgtt tcggaatgca agccagattc tactgcggtc atttgcttag
      181 ataaagaaca gaactgttcc gttattactt ggcatcagct gcacgtctat tccagccagc
      241 tggcatggta ccttatagaa aatgagattg gcccggggtc gatcgtactt acaatgtttc
      301 cgaacagcat cgagcacatt attgcggtat ttgcaatctg gaaggcgggc gcctgctata
      361 tgcccatgtc ctataaggcg gcggaatccg agatcaggga ggcctgcgat accatccacc
      421 cgaatgcggc ttttgcggaa tgcaagattc caggattaaa attctgcctt agcgcagacg
      481 agatatatga ggcgatggaa ggaagatcca aggagatgcc ttcggaccgt ctggccaatc
      541 cgaacatgat atccttatca ggcggaacca gcggaaagat gaagttcatc cgtcagaacc
      601 ttccatgcgg gctggacgat gagacgatca gaagctggtc tttgatgtct ggaatgggat
      661 ttgagcagcg ccagctgctg gtaggcccgc tgtttcatgg cgcgcctcac tccgcggcgt
      721 ttaatggact gttcatgggc aacaccctgg tactgaccag gaacctttgc ccgggaaata
      781 tcctgaacat gattaagaaa tataagattg aatttataca gatggtgccg accctgatga
      841 accggcttgc caaactggag ggagtcggaa aagaagactt tgcatccctg aaggcgctgt
      901 gccatacagg gggcgtctgt tctccctggc ttaagcagat ctggatcgac ctgctggggc
      961 ctgaaaagat ctatgagatg tattccatga cggaatgcat cggccttacc tgcatccggg
     1021 gagacgagtg ggtgaagcat ccgggaagca tcggacggcc agtgggcgat agcaaggtgt
     1081 ctatccggga tgagaatggc aaggaagttg cgccttttga gattggcgag atctatatga
     1141 cagcgccggc ctcctatctg gttaccgagt acatcaattg ggaaccgctg gaagtgaaag
     1201 agggaggctt ccgaagcgta ggggatatcg gctacgtgga tgagcagggc tatctgtact
     1261 tttctgaccg gcgcagcgac atgctggtat caggcggaga aaacgtgttc gccaccgaag
     1321 tcgagacggc gcttttgaga tataaggata tcctggacgc tgtagtggta gggataccgg
     1381 atgaagatct ggggcgaagg ctccatgcgg tcattgagac agggaaagag ataccggcag
     1441 aggaactgaa aacattcctg agaaagtatc tgactccata taagatacca aagacgttcg
     1501 agttcgtaag gagcatacga aggggagaca atggaaaggc cgacaggaag cggatcctgg
     1561 aagattgtat tgcccgcggg ggatgattct ataaatgcaa agaaaacaaa ttatataaag
     1621 gaggagtaac aaaatgagtt acgaagcact tttttcacca ttcaaggtca gaggactgga
     1681 acttaaaaac cgtatcgtcc tgcctggaat gaacaccaag atggcaaaga acaagcacga
     1741 cataggcgag gatatgatag cctaccatgt tgccagggca aaagcgggat gcgcgttaaa
     1801 tatatttgaa tgcgtagcat tatgtccggc gcctcacgct tatatgtata tggggcttta
     1861 tacggaccat catgtagaac agcttaagaa attgacggat gcagtccatg aagcaggcgg
     1921 caagatgggc atccagctgt ggcatggagg attcagcccg cagatgttct ttgacgagac
     1981 caacaccctg gaaactccgg acactcttac ggtagagagg attcatgaga tcgtagaaga
     2041 attcggacgc ggcgcaagga tggctgttca ggctggattt gacgcagtag aattccatgc
     2101 ggctcacagt tatctgcctc acgagttctt aagccctgga atgaacaaac gtacggatga
     2161 gtacggcgga agttttgaga accgctgcag attctgttat gaagtcgttc aggcaatccg
     2221 ttccaatatc ccggatgaca tgccattctt tatgcgtgca gactgcatcg acgaattaat
     2281 ggaacagacc atgacagagg aagagatcgt tacatttatc aataagtgcg cagaacttgg
     2341 cgtggatgtg gcagaccttt cccgtggaaa cgcgacttca ttcgcaaccg tatatgaagt
     2401 tccgccattc aacctggctc atggcttcaa catagagaat atttacaaca tcaaaaagca
     2461 gatcaatatc ccggttatgg gagttggccg tatcaataca ggagagatgg caaacaaggt
     2521 cattgaagaa ggcaagtttg acctggtagg catcggacgc gcccagcttg cagatccaaa
     2581 ctggatcacc aaagtaagag aaggcaaaga agacctgatc cgccactgta tcggatgtga
     2641 ccagggatgc tatgacgcag tcatcaatcc aaagatgaag catatcacct gcacccacaa
     2701 tccaggattg tgcttagagt atcagggaat gccaaagaca gacgctccta agaaagtcat
     2761 gatcgtagga ggcggaatgg caggcatgat cgctgcggaa gtattaaaga ccagaggcca
     2821 taacccggta atcttcgagg catccgacaa gcttgcagga cagttcaggc tggcaggcgt
     2881 agcgccgatg aagcaggatt gggcagatgt tgcagaatgg gaagcaaaag aagtagagcg
     2941 ccttggaatc gaagtacgtc tgaataccga agtgactgca gagaccatca aggaattcaa
     3001 tccggataat gtcatcatcg cagtaggctc tacctatgcg ctgcctgaga ttccgggaat
     3061 cgacagccca agcgtatact cccagtatca ggtactgaaa ggggaagtaa atccgacagg
     3121 ccgtgtagcc gttatcggat gcggactggt tggtacggaa gtcgcagaac ttctggcatc
     3181 cagaggcgca caggtaatcg cgatcgagag gaagggcgta ggtaccggcc ttagatgctt
     3241 cgcagaatgt tcatgaaccc ggaattcaaa tattacaaga tcgccaagat gtccggaaca
     3301 aatgtcaccg ctttagagca gggcaaggtt cactacatca tgacagacaa gaagaccaaa
     3361 gaagtgacgc agggagtcct ggaatgcgac gctaccgtta tctgtacagg aattaccgca
     3421 cgtccaagcg atgggcttaa ggcaagatgc gaagaacttg gaatcccggt tgaggtgatc
     3481 ggagacgctg ctggcgcaag agactgcacg atcgcgacac gcgaaggcta tgacgcagga
     3541 atggcaatct agaaaatcag aacttatcaa tcttacatat agaaaggatg atacatatga
     3601 cattagaaga gagagttgaa gcattagaaa aagaattgca ggagatgaag gatattgagg
     3661 caatcaagga actgaaagga aagtatttcc gctgcctgga cggaaagatg tgggatgagc
     3721 tggagaccac cctgtcacca aatatcgtaa cctcttattc caacgggaaa ctggtattcc
     3781 atagcccgaa ggaagttacc gattacttaa agagctcgat gccaaaagaa gagatcagca
     3841 tgcatatggg ccacacgccg gagatcacca ttgacagcga gactacggct acgggcagat
     3901 ggtatctgga agatagactg atctttacgg acggtaagta caaagacgta ggaatcaatg
     3961 gcggcgcgtt ctatacagac aaatatgaga agatagacgg ccagtggtac atccttgaaa
     4021 ccggctatgt acgaatctat gaagaacatt tcatgcgtga tccaaagatc catatcacga
     4081 tgaacatgca caaataagaa tattgtaaaa gaaaggcagg agtaagagta tgaatctcgt
     4141 acaagacaaa gttacgatca tcacaggcgg cacaagaggt attggattcg ccgctgccaa
     4201 aatatttatc gacaatggcg caaaagtatc catcttcgga gagacgcagg aagaagtaga
     4261 tacagcgctt gcacagttaa aagaacttta tccggaagaa gaggttctgg gattcgcgcc
     4321 ggatcttaca tccagagacg cagttatggc agcggtaggc caggtagcac agaaatatgg
     4381 cagactggat gtcatgatca acaatgcagg aattaccagc aacaacgtat tctccagagt
     4441 gtctgaagaa gagttcaagc atattatgga catcaacgta acaggcgtat tcaacggcgc
     4501 atggtgcgca taccagtgca tgaaggatgc caaaaagggc gttatcatca acacggcatc
     4561 cgttacaggc atcttcggat cactctcagg cgtaggatat ccggccagca aggcaagcgt
     4621 gatcggactc acccatggac ttggaagaga gatcatccgc aagaatatcc gtgtagtagg
     4681 agtggctcct ggagttgtga acacggatat gaccaatggc aatcctccgg agatcatgga
     4741 aggatatctg aaggcgcttc cgatgaagag aatgcttgag ccggaagaga tcgctaatgt
     4801 atacctgttc ctggcatctg acttggcaag cggcattacg gctactacgg tcagcgtaga
     4861 cggggcttac agaccataat tttaattttt actaagtaga atatgtgata tagaaaagga
     4921 gatataaaaa catggctgga ataaaagatt ttccaaaatt cggagctctt gcagggctta
     4981 agatacttga cagcggatct aacatcgccg gacctttagg cggaggcctt ctggcagaat
     5041 gcggagcaac ggtcatccat tttgaaggac caaagaaacc tgataaccag agaggatggt
     5101 acggctatcc acagaatcac cgtaatcagc tgtctatggt agcagacatc aaatctgaag
     5161 aaggaagaaa gatcttcctt gatctgatca aatgggcaga tatctgggta gagtcatcca
     5221 aaggcggaca gtatgacagg ctgggacttt ccgatgaagt catctgggaa gtaaatccta
     5281 agattgccat cgtgcacgta tccggatatg gacagacagg agacccgtct tacgttacac
     5341 gtgcatccta tgacgcagta ggccaggcat tcagcggcta tatgtcactg aacggaacaa
     5401 cggaagcgct gaagatcaat ccttatctga gcgatttcgt atgcggactt accacatgct
     5461 gggctatgct tgcctgctat gtaagcacca ttcttaccgg aaaaggcgaa tctgttgacg
     5521 ttgcacagta cgaagcgctg gcacgtatca tggacggacg tatgatccag tacgctacag
     5581 acggcgtgaa gatgccaaga accggcaata aggatgcgca ggctgccctg ttcagcttct
     5641 acacctgtaa agacggacgt acgatcttta tcggaatgac tggcgcggaa gtatgtaaga
     5701 gaggcttccc gatcatcgga cttccggtac ctggaaccgg agacccggac ttcccggaag
     5761 gcttcacagg ctggatgatc tatactcctg taggacagag aatggaaaag gctatggaga
     5821 agtatgtatc tgagcatacg atggaagaag tagaggctga gatgcaggca caccagattc
     5881 catgccagag agtatacgag ctggaagact gcctgaacga tcctcactgg aaagcacgtg
     5941 gaactattac ggagtgggat gacccgatga tgggacatat cacaggcctt ggactgatca
     6001 acaagttcaa gagaaatcct tccgaaatct ggagaggcgc tccgctgttc ggtatggata
     6061 accgcgatat cctgaaagac ctgggatatg acgatgcaaa gatcgatgaa ctctatgagc
     6121 agggcatcgt caatgaattc gaccttgaca ctactatcaa acgctataga ctggatgaag
     6181 taattccaca tatgagaaag aaagaggagt aa
//
LOCUS       FDICPECDA    2371 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  F.diplosiphon phycoerythrin-associated linker protein (cpeCD) gene,
            complete cds.
ACCESSION   M33832
KEYWORDS    cpeC gene; cpeD gene; phycoerythrin-associated linker protein.
SOURCE      F.diplosiphon DNA.
  ORGANISM  Fremyella diplosiphon
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria.
REFERENCE   1  (bases 1 to 2371)
  AUTHORS   Federspiel,N.A. and Grossman,A.R.
  TITLE     Characterization of the light-regulated operon encoding the
            phycoerythrin-associated linker proteins from the cyanobacterium
            Fremyella diplosiphon
  JOURNAL   J. Bacteriol. 172, 4072-4081 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy for [1] kindly submitted by N.A.Federspiel,
            24-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        454     1314     phycoerythrin-associated linker protein (cpeC)
    pept       1355     2104     phycoerythrin-associated linker protein (cpeD)
    mRNA        267  >  2371     cpeCD mRNA
    binding     440      444     cpeC ribosomal binding site
    binding    1339     1343     cpeD ribosomal binding site
BASE COUNT      679 a    529 c    514 g    649 t
ORIGIN      
        1 gatctgctca agcagtagat ttattaatca aagcaggggc gacggagtga agtggggatt
       61 ggggactggg gatgaggagg atgaggggga tgaggaggat gagggggatg aggggaaata
      121 accattaccc attccccatt ccccattacc caatttccca tgcccattac aaatagtttg
      181 tgcaaattga gtgcaaaatt ctctcatttc atcaagattt acaaatcttg atgtacatct
      241 ttccgcgctc ataagaaaat gatcataaca aagcgtataa aacgcctaaa accttgagca
      301 tctaggaaat cacagctttc atgcttatgg aagctggtaa tttgggttgc aacaaatcaa
      361 tccaaagaat tgatgagcct ggatgagcaa tctgaaattt atttacaaat cgaaacaaat
      421 ttcttaaact ttcgttaaca ggagaaacaa ttaatgccat ttggaccagc ttcacgcttg
      481 ggagtcagcc tatttgatga aactcctccc gttgagtggg taccaggtcg ctcacaagaa
      541 gaagcagaaa caatcattcg ggcaatctat cggcaagtat taggtaatgc ctatgtgatg
      601 gaaagtgagc ggcttgctgt gcctgaatcc cagtttaagc ggggtgagtt gagcgtccgc
      661 gagtttgtca gagcagtggc taaatctgaa ctatatcgtt ctcgcttttt caccagttgt
      721 gcgcgctacc gagccattga actcaacttc cgccatctat tgggtcgtcc accactagat
      781 ttagaagaaa tgcgctccca cagcacaatc cttgatactc aagggtttga agctgagatt
      841 gattcttata tcgatggtga tgagtatcag tctacttttg gcgagaacat tgtaccttac
      901 atccgaggct ataaaaccga agcgcttcag agcatggtgc aatttactca taccttccaa
      961 ctggtacgag gtgcttctag cagcagcctg aagggtgact tatctggcaa ggctcctaag
     1021 ctgaatgcat tagtgattca aagcacacca acagcagtaa tttcacctgc tagcgctgga
     1081 gcaaccttct cgacaccacc tactggtgcc cgtacccgtc ttggagtcga tgctagtgct
     1141 ggtggcaaag tttaccgcat tgaagttaca ggttatcgtg ccaaaacctt caataatatt
     1201 tccaagtttc gccgttccaa tcaagtcttt ctggtgccct acgaaaagct ctctcaagag
     1261 tatcaacgga ttcaccagca aggcggcgtg atcgcaagta tcactcctgt ataaattagg
     1321 tgcaaactta aaaaattgag gagcagaaat tttaatggca tcccagacaa ttcttgaact
     1381 ttggccctct agtagcttag aagaagttca aactattatc cgtgcagttt acaaacaggt
     1441 tttaggcaac cctcatgtta tggagagtga gcggttggtg acagcagaat cacaattatg
     1501 cgatcgctcc atcaccgtgc gggaatttgt ccgcagcgtt gccaagtctg atttttatcg
     1561 caaccgctac ttccaatcct gcgctcccta ccgatttgta gaacttaact tcttacattt
     1621 gcttggtcgc gcaccccagg atcaaagaga agtttccgaa cacatcgttc gtactgtagc
     1681 tgaaggctac gatgctgaaa ttgactccta tatcgatagt agtgaatatg aagcagcctt
     1741 tggtgaaaac gtagtgcctt actatcgtgg tagaagtagc gaagccaact ccaagcaagt
     1801 aggcttcaac cgcatatttg cccttgatcg cggccctgcc caaattgata gtgcagttaa
     1861 atcggctcaa ttggtctatg ctgttgctac taacagcgcc aacgcgatca aagcctcttc
     1921 atccacagtc attggctctg gaactgaaaa acgattcaaa atcttggtgc aaggttccaa
     1981 attcgacagt ccccgacgca tcagtaccac tgagtacatt gttccagcta gtaagatgac
     2041 tccccaaatt cagcggatta atcgtacttc tggcaaaatc gtcagcatta ctgaaattgt
     2101 ctaaccttta acagggtggg cattaatact tgtctgttct aaatgttgaa ccgattagta
     2161 ttaacaactc gtaattcgta gttactaatt cgtaattacg ttacggatag ggatttagct
     2221 cccacctgta acggcatgat tttatagaag tcggagactc aaaccctcag cttaattacg
     2281 aattacgaat tacgaattac gaattagtaa tttaggttga gacattgccc gccccagatc
     2341 attcactttt ttaatttgta attttcaaaa a
//
LOCUS       HIVPRBAC      582 bp ds-DNA             SYN       14-AUG-1990
DEFINITION  Synthetic HIV type-I protease (syn BAC) gene, complete cds.
ACCESSION   M36450
KEYWORDS    HIV type-1 protease; HIV-PR gene.
SOURCE      Synthetic oligonucleotide DNA, clone pET3AM.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 582)
  AUTHORS   Cheng,Y.-S.E., McGowan,M.H., Kettner,C.A., Schloss,J.V.,
            Erickson-Viitanen,S. and Yin,F.H.
  TITLE     High-level synthesis of recombinant HIV-1 protease and the recovery
            of active enzyme from inclusion bodies
  JOURNAL   Gene 87, 243-248 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          7      573     HIV type-I protease (BAC)
BASE COUNT      120 a    125 c    170 g    167 t
ORIGIN      
        1 gaattcatgg aagatctggc tttcctgcaa ggtaaggctc gtgagttctc ttctgaacag
       61 actcgtgcta actctccgac tcgtcgtgag ctgcaggttt ggggtcgtga taacaactct
      121 ccgtctgagg ctggcgccga tcgtcagggt accgtttctt ttaactttcc gcaggttact
      181 ctgtggcagc gtccactggt tactattaag attggtggtc agctgaagga agctctgctg
      241 gataccggtg ctgatgatac tgtactggag gagatgtctc tcccgggccg ttggaagccg
      301 aagatgatcg gtggtattgg tggtttcatt aaggtacgtc agtatgatca gattctgatt
      361 gagatctgtg gccataaggc gatcggtacc gttctggtgg gcccgactcc ggttaacatt
      421 atcggccgta acctgctgac tcagatcggt tgtactctga actttccgat ttctccgatt
      481 gagactgttc cggttaagct gaagccgggt atggatggtc cgaaggttaa gcagtggcca
      541 ctgactgagg agaagattaa ggctctggtt taatagaagc tt
//
LOCUS       HNNMTPIT     1336 bp ds-DNA             ORG       14-AUG-1990
DEFINITION  H.annuus mitochondrial plasmid pIT DNA.
ACCESSION   M36422
KEYWORDS    mitochondrial plasmid.
SOURCE      H.annuus (strain CANP3) mitochondrial plasmid pIT DNA, clone
            pITC20.
  ORGANISM  Mitochondrion Helianthus annuus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Asterales; Asteraceae; Compositae; Helianthus annuus.
REFERENCE   1  (bases 1 to 1336)
  AUTHORS   Perez,C., Dujon,B., Heizmann,P. and Berville,A.
  TITLE     Sequence of a mitochondrial plasmid of sunflower (Helianthus
            annuus) and its relationship to other mitochondrial plasmids
  JOURNAL   Plant Sci. 58, 59-69 (1988)
  STANDARD  simple staff_entry
BASE COUNT      382 a    314 c    290 g    350 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattctttt gttccctcaa caagcaacta atcacaagga aagaaaacaa aagacaaaga
       61 aaggcggttt tgagacataa aaaataaatg ctaaacttgc tttttaatct caaaaggtcg
      121 atcttgttgt tttgagaact caaatgcagc tatcattatc ggggtgctcc gccacccctc
      181 gtttccatca tccctcgact gcagtcgagg gtacgtctcc cttccctaaa aaaagatcga
      241 gctttttact tattacagaa gagagagagc accttacaga agagagacac atcgaccgtt
      301 ttttacttat tacagaagag agatagcacc ttacagagag atagcacatc gaccgttttt
      361 tacttattac agaagagaga gacctaaaaa ggttcaaatg agaaatatct taagtgatcg
      421 tggaaactga gctttgtaaa ggagtagaga gtgtaagacg gggttgcgag tctttgcgag
      481 ttgggtagtc acctctccaa cccattccgg acgttgcccc aagcccccta gcagcgctct
      541 cgaaactcta caatgcgcag acaagaaggc gattgagtta acggaagcgt ctcaaaaaca
      601 ttctttttga agaacaagaa gaagactcag cgctgggtaa gtggtttaag acttttcccg
      661 accctcggct ccttggggag tagtgccccc aagacacagc cacgccaatc cgacgcggct
      721 ccatacccct agtttagtat atagagctgt ttttcttttt ttctctgagg agcaaagaaa
      781 aaagtaaaaa atttttttat ttttggacgg ccagcccggt aggaaggtga gataaatctc
      841 cccctccccg gcctccccct ctgttaaggt cgagtcctta ctcaattcat cggatgaacg
      901 gactcggctg gacttgttag ctgcataaaa gcatagaaca tagcccggag aatgaagatc
      961 acccgaactt cattcccctg gctacaaccg gcagcaggag aagctcttct ctttgctggc
     1021 caagttatcg ccatggggcc cccccccctc ccatggcttg ctttgttgct gctcttttac
     1081 tgcctatagc tgacaagtcc gctctccccg acttaacacc gagaaaaaca aaaaacagct
     1141 ctatatacta ggggaggaga gcggagtgag tgcaggccct tcaacgagag cattgttttg
     1201 tggtccgggt tcggtattga attatgaaac tcctttctta ttgatgattt acggcctttt
     1261 aaaacagttg ttcagtctca aatggtaccc tatattgatc ttgaaagaaa tcattttgtc
     1321 tgtcaagatg gtcata
//
LOCUS       HSE1GBA      3958 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Equine herpesvirus type 1 glycoprotein B (gB) gene, complete cds.
ACCESSION   M36298
KEYWORDS    antigen; glycoprotein B; structural protein.
SOURCE      Equine herpesvirus type 1 (strain Ab1) DNA, from COS cells.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 3958)
  AUTHORS   Bonass,W.A., Elton,D.M., Stocks J,-M., Killington,R.A.,
            Meredith,D.M. and Halliburton,I.W.
  TITLE     Molecular analysis of the Equine herpesvirus type-1 strain Ab1,
            glycoprotein B gene and its expression in COS cells
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.A.Bonass, 10-JUL-1990.
               Leeds University
               Dept. of Microbiology
               Leeds LS2 9JT, UNITED KINGDOM
FEATURES       from  to/span     description
    pept        603     3545     glycoprotein B precursor (gB)
    sigp        603      860     glycoprotein B signal protein
    matp        603     3542     glycoprotein B
    mRNA        475  >  3958     glycoprotein B mRNA
    signal      370      373     CAT box
    signal      453      458     TATA box
    signal     3554     3560     polyA signal
    site       2165     2166     internal cleavage site
BASE COUNT     1019 a   1075 c   1010 g    854 t
ORIGIN      
        1 gttaacgtgt atggcgaagt gctgctacat acgtacgtat cgtcctacaa cgggttttgc
       61 gccaggcgcg ggttttgcgc ggcggtgagt cgagcgggta ccatcataga taaccgctct
      121 agcacgtccg cgttcgactc gcatcagttc atgaaggcgg cgctgcttcg ccaccccatt
      181 gaccagtcgc tcatgccgtc cataacacac aagtttttcg agctgatcaa cgggcccgtg
      241 tttgacaacg ctggccacaa ctttgcgcag ccgccaaaca cggcattata ttacagcgtt
      301 gaaaacgttg ggttgttacc gcatctcaag gaggaactag ctcggtttat gattactgcg
      361 gctaaaggtg attggtcaat tagcgagttt caaaggtttt attgctttga gggagtgaca
      421 ggtgtgacgg ccacgcagcg gctggcgtgg aaatatatcg gggagctcat cctagccgcc
      481 gcagtattct cctcggtttt ccactgtgga gaggtgcgcc tcctgcgcgc agatcgtacc
      541 tacccggact ccagcggcgc acagcgctgc gtgagcggca tttacataac ctacgaggcg
      601 tcatgtcctc tggttgccgt tctgtcggcg gctccacatg gggcaattgg cgcggagacg
      661 gtggtgattt acgacagcga cgtgttctct ctcctgtatg cagtgctcca gcagctggct
      721 cctggatcgg gagccaacta ggcaatgttg gaaacttact cgccaccccc cacccgctgg
      781 gaaagccggc atcatcgagg gtgggcacaa tagttctagc ctgtttgttg ctttttggaa
      841 gctgtgttgt tagagccgta cccaccacgc caagcccccc aactagtact cccacttcca
      901 tgtcaacgca ctcccatggg acagtagacc ctacgctgct ccccacagaa acgcccgacc
      961 cactcagact ggctgtgcgc gagtccggta tactcgctga ggatggagac ttttacacct
     1021 gcccaccgcc taccggatcc accgtcgtac gcatcgaacc acctagaact tgccccaagt
     1081 ttgaccttgg gagaaacttc acggagggga ttgctgttat ttttaaggaa aacatcgctc
     1141 cctacaaatt cagggcaaac gtatactaca aggacatcgt tgtaacacgt gtgtggaaag
     1201 gatacagcca tacgtccctg tccgacagat acaatgacag ggttccggtt tcggtggagg
     1261 agatcttcgg tctcatcgac agtaagggaa aatgttcgtc aaaggccgag tacctcagag
     1321 ataacatcat gcaccacgcg taccacgacg acgaggacga ggtggagctt gatttggtgc
     1381 cgtccaagtt tgcaactccg ggggccagag cctggcagac caccaacgat actacgtctt
     1441 acgtggggtg gatgccatgg aggcactaca cgtcaacgtc tgtcaactgc atcgtcgagg
     1501 aggtggaggc gcggtccgtc tacccctacg actccttcgc cctgtccacc ggtgatattg
     1561 tgtacgcgtc tccgttttac ggcctgaggg ctgccgctcg catagagcac aatagctacg
     1621 cgcaggacag tttcaggcaa gttgaagggt acaggccccg cgacttagac agtaaactac
     1681 aagccgaaga gccggttacc aaaaatttta tcactacccc gcatgtcacc gtcagctgga
     1741 actggaccga gaagaaagtc gaggcgtgta cgctgaccaa atggaaagag gtcgacgaac
     1801 tcgtcaggga cgagttccgc gggtcctaca gatttactat tcgatccatc tcgtctacgt
     1861 ttatcagtaa cactactcaa tttaagttgg aaagtgcccc ccttactgaa tgtgtatcca
     1921 aagaagcaaa ggaagccata gactcgatat acaaaaagca gtacgagtct acgcacgtct
     1981 ttagcggtga tgtggaatat tacctggcac gcggggggtt cttaattgca ttcagaccta
     2041 tgctctccaa cgaactcgcc aggctgtacc tgaacgagct tgtgagatct aaccgcacct
     2101 acgacctaaa aaatctattg aaccccaatg caaacaataa caataacacc acgcgaagac
     2161 gcaggtctct cctgtcagta ccagaacctc agccaaccca agatggtgtg catagagaac
     2221 aaattctaca tcgcttgcac aaacgagcag tggaggcaac ggcaggtacc gattcttcca
     2281 acgtcaccgc caaacagctg gagctcatca aaaccacgtc gtctatcgag tttgccatgc
     2341 tacagtttgc atacgatcac atccaatccc acgtcaatga aatgctaagt agaatagcaa
     2401 ctgcgtggtg taccctccaa aacaaagagc ggaccctatg gaacgaaatg gtgaagatta
     2461 acccgagcgc catagtctcc gcaacccttg acgagcgagt tgcagcgagg gtcctggggg
     2521 acgtgatagc tataacgcac tgcgccaaaa tagagggcaa cgtgtacttg caaaactcca
     2581 tgcgctcgat ggacagtaac acgtgctact cccgcccccc cgtaacattt acaattacta
     2641 agaatgcaaa caacagaggg tcgatagaag gccagctggg agaggagaac gagattttca
     2701 cggagcgcaa gctgatcgag ccgtgcgccc tcaatcagaa gcgctacttt aagtttggca
     2761 aagagtacgt ttactacgag aactacacgt tcgtccgcaa agtgcccccc acggaaatcg
     2821 aggttatcag cacgtacgtt gaactaaact tgaccctttt ggaagaccgc gagtttctgc
     2881 ccctggaggt gtacacgcgg gctgagctgg aggacaccgg cctgctagac tacagcgaaa
     2941 tacagcgccg caaccagctc cacgctctca ggttttacga catcgacagc gtggtcaacg
     3001 tggacaatac cgcagtgatt atgcagggga tcgccagctt tttcaagggc ctgggtaaag
     3061 tgggggaggc cgtgggaacg ctcgttctcg cggccgccgg cgctgttgtt tcaaccgtat
     3121 ctggaatagc ttcgttttta aacaacccat ttggggggct agccatcggc ctgctggtaa
     3181 tcgccggcct ggtagctgcg ttttttgctt acagatatgt aatgcagatc cgcagtaacc
     3241 ccatgaaagc tctatacccc ataacaacaa aggccttgaa aaacaaagcc aaaacttcct
     3301 acggccagaa cgaggaggac gatgggagcg actttgatga ggccaagctt gaagaggctc
     3361 gcgaaatgat caaatacatg tctatggttt cggccctgga aaagcaggaa aagaaagcta
     3421 taaagaaaaa cagtggggtt ggcctgatcg ccagtaacgt ctcaaagctg gccctgcgaa
     3481 ggcgcggtcc caaatatacc cgactccaac agaacgatac catggaaaat gaaaaaatgg
     3541 tttaaacatg tttaataaat attatgacac gtactcaaag tgtgacctca tatttgcata
     3601 accactttct agttccggcc ccaaggatat ttaagcctag tatctccgcc gaggtttcat
     3661 cctcattcac caactcacac ttagagttga cgcttcctct tgcgcctttg ctctcgccgc
     3721 tcctgtgtta gcgtatactg cccaagaaat ggattctcca cgcggtatct ccacagctac
     3781 cggtgatgcc cacgccgagg ccgcggtttc cccagccgcg gaaatccaga taaaaacgga
     3841 agcccccgat gtagacggac cagaagccac tactgagtgt ttagaccaca cctacaccca
     3901 acagacaagc gggggtgatg gcctagatgc tatcgatacg gacgatctct ggagatgg
//
LOCUS       HSE1GPEIA    4707 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Equine herpes virus type 1 glycoproteins gE, gI and 10 kDa protein
            genes, complete cds.
ACCESSION   M36299
KEYWORDS    glycoprotein gE; glycoprotein gI.
SOURCE      Equine herpesvirus type 1 (strain Ab1) DNA, clone pUC19.
  ORGANISM  Equine herpesvirus type 1
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Alphaherpesvirinae.
REFERENCE   1  (bases 1 to 4707)
  AUTHORS   Elton,D.M., Bonass,W.A., Killington,R.A., Meredith,D.M. and
            Halliburton,I.W.
  TITLE     DNA sequence analysis of the 4.7Kb BamHI to EcoRI fragment of the
            Equine herpesvirus type-1 short unique region
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.M.Elton, 10-JUL-1990.
               University of Leeds
               Dept of Microbiology
               Leeds LS2 9JT, UNITED KINGDOM
FEATURES       from  to/span     description
    pept        594     1868     glycoprotein I precursor (gI)
    sigp        594      701     glycoprotein I precursor (gI)
    matp        702     1865     glycoprotein I precursor (gI)
    pept       2101     3753     glycoprotein E precursor (gE)
    sigp       2101     2208     glycoprotein E precursor (gE)
    matp       2209     3750     glycoprotein E precursor (gE)
    pept       4478  >  4707     10 kDa protein
    ORF     <     1      484     ORF1
    ORF        3750     4140     ORF4
    signal      544      550     gI TATA box
    signal     1924     1929     gI polyA signal
    signal     1955     1958     gE CAT box
    signal     4139     4144     gE polyA signal
BASE COUNT     1204 a   1291 c   1107 g   1105 t
ORIGIN      
        1 gaattcaact tcccacaagg agagcatatg acatggttga agttctggtt cgtctacgat
       61 ggtggaaacc taccagtgca gttttatgaa gcccaggcat tcgcaagacc cgtgcctccg
      121 gataaccacc ctggatttga ttctgttgag tcggagatta cacaaaataa aacagacccg
      181 aaaccaggcc aggcggaccc caaacccaat cagcctttta agtggcccag catcaaacac
      241 ttggccccaa gactcgatga ggtggatgag gtcatagagc ccgtaacaaa gcccccaaaa
      301 acgtctaaga gcaactctac gtttgtgggc atcagcgtcg gtttgggtat cgccggccta
      361 gtattggtgg gcgtcattct atacgtctgc ttgcgtcgga agaaggaact gaaaaagtct
      421 gcacagaacg gcttgactcg cctacgctcg acctttaagg atgttaaata tacccagctt
      481 ccgtaaacag tgttgcgtaa cctgctggga ggtgtccacg gccttaaagc ttcgcggttt
      541 ggagatataa cgcacaacct acaacaaacg cgacacagca agtagtagtc gctatggcca
      601 aactcactgg gatgttcagc gctgcgatat tactgtctat ggctatatgc tcaaccgcaa
      661 tcatatatcg cggagaacat atgagcatgt acctaaacgc cagctctgag tttgccgtgt
      721 accccactga tcagtccctt gttttggttg gccacttgct ctttctcgac ggacaacgct
      781 tacccaccac caactatagt gggctgatcg aattgattca ttacaactac tccagcgttt
      841 gctacactgt tatccaaacg atatcgtatg aatcatgccc gcgtgtagcc aacaatgctt
      901 tcagatcgtg cctccacaaa acttctaagc actaccacga ctatttccga gtcaatgcct
      961 ctgttgaaac caacgttctc ttaaacatca caaagccaca gcctacagat tccggggcgt
     1021 atatccttcg cgtaaaactt gaccacgcgc caaccgcaga tgtttttgga gtttccgcct
     1081 ttgtttacga tctaaaatct aaaacggtcc ccgatccaat gcccaccaca caaacggtag
     1141 aacctacaac gagctatgtg tcgactccca catacgacta taccgatgac gtaaccaccg
     1201 aaactgaatc cacatcaaca tctacccaac aggcgatgac ctccactcaa acccctagcg
     1261 ctacatgggg aacccagcta accacagagc tgccgacaaa cgaaactgtg gttattggtc
     1321 aggaggccct gttatgccat tggttccagc catcgacaag ggtgccgacc ctgtatctgc
     1381 atctgttggg acgcactggc aatctcccgg aagatgttct actggtcgaa gactctgagt
     1441 ttcttcgtac cacatcgcct gcacataggc cttctgcatc acccgctgac ggtgatgatt
     1501 ttaaacagac aaactcaact tcccttaagg cgcgcaacaa gatcgtcgca atggtggtta
     1561 tcccgaccgc gtgtgtacta atgctcctgt tggtggttgt cggtgccatc ataaacggtg
     1621 ccgtgcgcaa acatttattg agttgcgcaa gccgcaggat ctaccgctcc ggacaggggg
     1681 gcgcatcggc ggccgaacgg agacggctga cttgcggtcc tactttagcc gcgtcatcgg
     1741 agtcgctggc cgacgataca acgtcatcac ctccaacccc caaaccttcg aagaaaacca
     1801 agttggagac cgatccgctt atggaacagc tgaaccggaa actggaggcc atcaaagaag
     1861 aatcatagtt gtgggggtag atggggttgg tattaaagtt tgtgtattat cgattttata
     1921 tttattaaaa tttgtgaaac ataaacatct tgtgcaatgt ttacattatt tgtgattggg
     1981 acggtccact gggaggtggt acaactcggg tttaaagctc tggatgtttg gtaggaaact
     2041 cacagttctc cactttggcg tcaaagcaat cagacgtcta attcgaagta gaacgtcaca
     2101 atggagctgt tggccgcaag tcgcgcttgt atattttttg ggctagtaac agtactcgat
     2161 gcgtggggag tccaacaagt tgaactttcc gagggggctt gggctatgat cgacggaagg
     2221 gacgttttaa cccctactaa cacaactact cgggtcacaa aggcctggac gtttttggaa
     2281 acccctcccg gttgcgctgg cgacatatca gttaagaagg tgtgcgtgag ccatagtctg
     2341 tgcgaagata acattataat aggaaagcac tgtaacctct taactgggga acatggcatt
     2401 gcgttggccg agtttaacgt agtaaacgga tcgctgcgca gaacagacga tgtgtacttt
     2461 gtgaatggta cagtctttcc aatccttgcc gaaacccgca gcgtcctaca aatccatagg
     2521 gcaaccccct ctatcgcagg ggtttacacc ctccacgttt ccatcgacgg aatgatgaaa
     2581 cactccgtcg tgctgctcac cgtcaagaag ccgcccaaac aaccgcaacc acgcttgcgc
     2641 gttaagaccc cgccacccgt aaccgttcct caggttcccg taaagaccca cacggatttt
     2701 gtggtgcacg gataccactc gcgcgtgtac gctgatggcg aatctttcga gctgtcggtg
     2761 aacctggagt cacatatcgt agagcccagc ttcagcgcgg agattcagtg gtactatatg
     2821 aatacatcat cgtcatcatg cgatctattt cgagttttcg aaacctgcat ctttcacccg
     2881 acagccatgg cctgcctgca cccggaacaa cacacctgca gcttcacatc ccccatcaga
     2941 gcgaccaaga tcctacaccg ggtgtatgga aactgcagcg atcatggaaa ttcgtggcct
     3001 tctaggtgcc atagcactct gctgggcaat cgtctatact ttattcaacc agcacagaac
     3061 agagtggacc tgttgttcaa agacactccc gcgtcggcta ccgggctgta tgtgtttgta
     3121 ttattgtaca acggacatcc ggaggcgtgg acgtatacgc tgctgtcaac cgcaaatcac
     3181 tttatgaatg tgcttactga cgtgacccgc ccacggctag gagagcactt ttatacggac
     3241 ctcgggcaca aaatcatcac tcctcatcca tctgtagcta ccactgaaga gttgggagct
     3301 tggactcgac actacctcgc ctttttgctg gttattatct gcacgtgcgc ggcgctgcta
     3361 gttgcattgg tggtgtgggg ctgtattctc tacatccgaa gcaaccgtaa gccgtatgaa
     3421 gtgctgaacc cctttgaaac ggtttacacg agcgttccaa gcaacgaccc ctcggacgag
     3481 gtcttggtgt ttgagcgcct agcttcggac tctgacgact ccttcgactc tgattcagac
     3541 gaagagttgg aatacccacc acctcccaaa ccagctccac agctcccacc ataccagttt
     3601 gtagacgggg gagacgcccc tagcggcagg tccggattca aggtttggtt ccgcgataca
     3661 cccgaggcgt ccccggttcc tcttcataaa ccaacgctac agggtccaga ctacagccgg
     3721 gtagcgtcga agctaaagtc gatactaaaa tgagcagcaa cagcgataac acagagtgct
     3781 tcgggggagt caactatgcc gagggaatgc gcaacggtaa acgcaaccct gtcagaaaca
     3841 gcacctttca agagtatctc gacgcgcgta acgcgcgtta tcccagatcc ggctcaacct
     3901 ccgattccga cgaggactac acaaccagat caaagtacga gtcagatgtc agcgagttta
     3961 aaaaaatgat ggatctggaa actctacctc ccccaaaggc tgagccgcaa gctcagaagg
     4021 ccgagcctga tgctgcgaag gaggagccag tcagcaccac tagctacatc ttaaacgaat
     4081 gggtggctcc tatgattggg cattttctgg caatgtgtat gtatgagttg cttttcaaat
     4141 aaaaacaaac attaacccct gtaaacatcc gtttgtctac tgtgtatgat agagttaaac
     4201 ccaaccctag agagttatgt atttaatgcc ctgggacccc gcggaagtca tatatccctc
     4261 ggccccctca tttgggcgca cattgcctgc ccggcggcag tcttactccc ttagctcgcc
     4321 ctcttgcata agataaacta ttcccctccc agctagtttc acccaccaga ttaagcgagg
     4381 ttttccctct cagcgatcac ttttcaccac cgaagaacag gccctcatcg gtttccctcc
     4441 gtgttttccc atccatctat ccaaccacta cattttcatg gagaaggcgg aggctgccgc
     4501 agttgttata cccctgtcag tttccaaccc cagctaccgt ggaagcggta tgtccgacca
     4561 agaagtaagc gaagaacaat ctgctggaga tgcctgggtg tctgcagcaa tggcagccgc
     4621 agaggcggtg gctgctgccg ctacctccac cggaattgat aacactaacg actacacgta
     4681 caccgctgct tctgagaatg gggatcc
//
LOCUS       HUMADRA2RA   2072 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human alpha-2-adrenergic receptor (alpha-2 c2) gene, complete cds.
ACCESSION   M34041
KEYWORDS    G-protein coupled receptor protein; alpha-2-adrenergic receptor;
            plasma membrane protein.
SOURCE      Human placenta DNA, clone alpha-2 C2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2072)
  AUTHORS   Lomasney,J.W., Lorenz,W., Allen,L.F., King,K., Regan,J.W.,
            Yang-Feng,T.L., Caron,M.G. and Lefkowitz,R.J.
  TITLE     Expansion of the alpha-2-adrenergic receptor family:
            Characterization of a human alpha-2-adrenergic receptor subtype,
            the gene for which is located on chromosome 2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5094-5098 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.W.Lomasney, 03-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept        413     1765     alpha-2-adrenergic receptor (alpha-2 C2)
                                 /hgml_locus_uid="LY0100G" /nomgen="ADRA2RL1"
                                 /map="2"
BASE COUNT      316 a    705 c    660 g    391 t
ORIGIN      Chromosome 2.
        1 ctgcaggcgc ggctggggcg aaacgtgcga gctgagcggg cgcaaggtcc tccgcgcctc
       61 ctttaagaac cggcccagcc cggcccgcgc ccccagagcg tacggcatcc gcgtggcgga
      121 gggcgcgact ttctccggtc ccgggcggga cggggacggc ggcgggacaa cttgggaaac
      181 ttctctgggg cggacggcag ggaccccggg caccgtggag gaggatgtag gagggcggct
      241 gctggtcctg ggtgttcccg acctcctagg ccccgctcgt ccaggccatg gggctccagc
      301 gccctcgcgg cgccggaggg gcgacgctct tgtctagccg agccgggcag cgctgtcgtc
      361 cacggtgcgc actgggcggg cagcgctccc tctgcccacc tcccgccccg tcatggacca
      421 ccaggacccc tactccgtgc aggccacagc ggccatagcg gcggccatca ccttcctcat
      481 tctctttacc atcttcggca acgctctggt catcctggct gtgttgacca gccgctcgct
      541 gcgcgcccct cagaacctgt tcctggtgtc gctggccgcc gccgacatcc tggtggccac
      601 gctcatcatc cctttctcgc tggccaacga gctgctgggc tactggtact tccggcgcac
      661 gtggtgcgag gtgtacctgg cgctcgacgt gctcttctgc acctcgtcca tcgtgcacct
      721 gtgcgccatc agcctggacc gctactgggc cgtgagccgc gcgctggagt acaactccaa
      781 gcgcaccccg cgccgcatca agtgcatcat cctcactgtg tggctcatcg ccgccgtcat
      841 ctcgctgccg cccctcatct acaagggcga ccagggcccc cagccgcgcg ggcgccccca
      901 gtgcaagctc aaccaggagg cctggtacat cctggcctcc agcatcggat ctttctttgc
      961 tccttgcctc atcatgatcc ttgtctacct gcgcatctac ctgatcgcca aacgcagcaa
     1021 ccgcagaggt cccagggcca agggggggcc tgggcagggt gagtccaagc agccccgacc
     1081 cgaccatggt ggggctttgg cctcagccaa actgccagcc ctggcctctg tggcttctgc
     1141 cagagaggtc aacggacact cgaagtccac tggggagaag gaggaggggg agacccctga
     1201 agatactggg acccgggcct tgccacccag ttgggctgcc cttcccaact caggccaggg
     1261 ccagaaggag ggtgtttgtg gggcatctcc agaggatgaa gctgaagagg aggaagagga
     1321 ggaggaggag gaggaagagt gtgaacccca ggcagtgcca gtgtctccgg cctcagcttg
     1381 cagccccccg ctgcagcagc cacagggctc ccgggtgctg gccaccctac gtggccaggt
     1441 gctcctgggc aggggcgtgg gtgctatagg tgggcagtgg tggcgtcgaa gggcgcacgt
     1501 gacccgggag aagcgcttca ccttcgtgct ggctgtggtc attggcgttt ttgtgctctg
     1561 ctggttcccc ttcttcttca gctacagcct gggcgccatc tgcccgaagc actgcaaggt
     1621 gccccatggc ctcttccagt tcttcttctg gatcggctac tgcaacagct cactgaaccc
     1681 tgttatctac accatcttca accaggactt ccgccgtgcc ttccggagga tcctgtgccg
     1741 cccgtggacc cagacggcct ggtgagcccg cctgcgctgc ccctgtgggt tggtgcggtg
     1801 gcgccggggt caccctgctt cttgccctgc tgtgtgtggc tgcctcccct gggctttctg
     1861 ctccctgccc agatcctgta ggcctcatct taggaacccc ttgggagggg tgggcagggg
     1921 ggctgctagc aagggtccca gtgaagcttc cccttgccgg cttagctgtg ggggacccct
     1981 tctccaccct ctccctgagc acaggccgat ggaggtggtt caaatcctct ggaacatagc
     2041 caagaccagg agaagagaga gcactttctt cc
//
LOCUS       HUMARF1BA    1816 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human ADP-ribosylation factor (ARF1) mRNA, complete cds.
ACCESSION   M36340 M25203
KEYWORDS    ADP-ribosylation factor.
SOURCE      Human foreskin fibroblast, cDNA to mRNA, clone pCD2.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 29 to 866)
  AUTHORS   Bobak,D.A., Nightingale,M.S., Murtagh,J.J., Price,S.R., Moss,J. and
            Vaughan,M.
  TITLE     Molecular cloning, characterization, and expression of human
            ADP-ribosylation factors: Two guanine nucleotide-dependent
            activators of cholera toxin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86, 6101-6105 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1816)
  AUTHORS   Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C.
  TITLE     Human ADP-ribosylation factors (ARF's): A functionally conserved
            family of GTP-binding proteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer copy of sequence [1] kindly submitted by
            D.Bobak, 02-JUN-1989.
            
            Draft entry and computer-readable sequence for [2] kindly submitted
            by R.A.Kahn, 10-JUL-1990.
               National Cancer Institute
               Lab of Biol. Chem.
               DCT Bldg. 37, RM 5D-02
               Bethesda, MD 20892
FEATURES       from  to/span     description
    pept         78      623     ADP-ribosylation factor (ARF1)
BASE COUNT      405 a    512 c    469 g    430 t
ORIGIN      
        1 ggggaaaacc aacgcctggc tcggagcagc agcctctgag gtgtccctgg ccagtgtcct
       61 tccacctgtc cacaagcatg gggaacatct tcgccaacct cttcaagggc ctttttggca
      121 aaaaagaaat gcgcatcctc atggtgggcc tggatgctgc agggaagacc acgatcctct
      181 acaagcttaa gctgggtgag atcgtgacca ccattcccac cataggcttc aacgtggaaa
      241 ccgtggagta caagaacatc agcttcactg tgtgggacgt gggtggccag gacaagatcc
      301 ggcccctgtg gcgccactac ttccagaaca cacaaggcct gatcttcgtg gtggacagca
      361 atgacagaga gcgtgtgaac gaggcccgtg aggagctcat gaggatgctg gccgaggacg
      421 agctccggga tgctgtcctc ctggtgttcg ccaacaagca ggacctcccc aacgccatga
      481 atgcggccga gatcacagac aagctggggc tgcactcact acgccacagg aactggtaca
      541 ttcaggccac ctgcgccacc agcggcgacg ggctctatga aggactggac tggctgtcca
      601 atcagctccg gaaccagaag tgaacgcgac ccccctccct ctcactcctc ttgccctctg
      661 ctttactctc atgtggcaaa cgtgcggctc gtggtgtgag tgccagaagc tgcctccgtg
      721 gtttggtcac cgtgtgcatc gcaccgtgct gtaaatgtgg cagacgcagc ctgcggccag
      781 gctttttatt taatgtaaat agtttttgtt tccaatgagg cagtttctgg tactcctatg
      841 caatattact cagctttttt tattgtaaaa agaaaaatca actcactgtt cagtgctgag
      901 aggggatgta ggcccatggg cacctggcct ccaggagtcg ctgtgttggg agagccggcc
      961 acgcccttgg cttagagctg tgttgaaatc cattttggtg gttggtttta acccaaactc
     1021 agtgcatttt ttaaaatagt taagaatcca agtcgagaac acttgaacac acagaaggga
     1081 gaccccgcct agcatagatt tgcagttacg gcctggatgc cagtcgccag cccagctgtt
     1141 cccctcggga acatgaggtg gtggtggcgc agcagactgc gatcaattct gcatggtcac
     1201 agtagagatc cccgcaactc gcttgtcctt gggtcaccct gcattccata gccatgtgct
     1261 tgtccctgtg ctcccacggt tcccaggggc caggctggga gcccacagcc accccactat
     1321 gccgcaggcc gccctaccca ccttcaggca gcctatggga cgcaggcccc atctgtccct
     1381 cggtccgcgt gtggccagag tggtccgtcg tccccaacac tcgtgctcgc tcagacactt
     1441 tggcaggatg tctggggcct caccagcagg agcgcgtgca agccgggcag gcggtccacc
     1501 tagacccaca gcccctcggg agcaccccac ctctgtgtgt gatgtagctt tctctccctc
     1561 agcctgcaag ggtccgattt gccatcgaaa aagacaacct ctactttttt cttttgtatt
     1621 ttgataaaca ctgaagctgg agctgttaaa tttatcttgg ggaaacctca gaactggtct
     1681 atttggtgtc gtaggaacct cttactgctt tcaatacacg attagtaatc aactgttttg
     1741 tatacttgtt ttcagttttc atttcgacaa acaagcactg taattatagc tattagaata
     1801 aaatctctta actatt
//
LOCUS       HUMARF2A     1529 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human ADP-ribosylation factor 2 (ARF2) mRNA, complete cds.
ACCESSION   M36341 M31890
KEYWORDS    ADP-ribosylation factor 2.
SOURCE      Human breast cancer cell line MDA-MB 231, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 766)
  AUTHORS   Monaco,L., Murtagh,J.J., Newman,K.B., Tsai,S.-C., Moss,J. and
            Vaughan,V.
  TITLE     Selective amplification of an mRNA and related pseudogene for a
            human ADP-ribosylation factor, a guanine nucleotide-dependent
            protein activator of cholera toxin
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 66 to 1529)
  AUTHORS   Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C.
  TITLE     Human ADP-ribosylation factors (ARF's): A functionally conserved
            family of GTP-binding proteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            L.Monaco, 02-FEB-1990.
            
            Draft entry and computer-readable sequence for [2] kindly submitted
            by R.A.Kahn, 10-JUL-1990.
               National Cancer Institute
               Lab of Biol. Chem.
               DCT Bldg. 37, RM 5D-02
               Bethesda, MD 20892
FEATURES       from  to/span     description
    pept        130      672     ADP-ribosylation factor 2 (ARF2)
BASE COUNT      427 a    294 c    295 g    513 t
ORIGIN      
        1 ctgcctccct ctttcttcct ccgctctttc tcttccctct cgtttagttt gcctggagct
       61 tgaaaggaga aagcacgggg tcgccccaaa ccccttctgc ttctgcccat cacaagtgcc
      121 actaccgcca tgggcctcac tatctcctcc ctcttctccc gactatttgg caagaagcag
      181 atgcgcattt tgatggttgg attggatgct gctggcaaga caaccattct gtataaactg
      241 aagttagggg agatagtcac caccattcct accattggtt ttaatgtgga aacagtagaa
      301 tataagaaca tttgtttcac agtatgggat gttggtggtc aagatagaat taggcctctc
      361 tggaagcatt acttccagaa tacccagggt cttatttttg tggtagatag caacgatcgt
      421 gaaagaattc aggaagtagc agatgagctg cagaaaatgc ttctggtaga tgaattgaga
      481 gatgcagtgc tgctactttt tgcaaacaaa caggatttgc caaatgctat ggccatcagt
      541 gaaatgacag ataaactagg gcttcagtct cttcgtaaca gaacatggta tgttcaagcc
      601 acttgtgcaa cacaaggaac tggtctgtat gaaggacttg actggctgtc aaatgagctt
      661 tcaaaacgtt aaatgaaatt ggatatctaa ccaaggacat gtttgataaa attggtctag
      721 gcttgttaca acaaaattag tttgtatctt ggttattaaa cagtatctgg gactggtttg
      781 ggcagaatat taaacttatt ttgttgccaa ttattgttta ccgagtataa tgttgctatt
      841 tagcaatgtg cttggtttta aagaaattct ccttgggaaa aaagtatcct cttttaattt
      901 tacttcccat aagcgtaaat gcctggacat agctcttgtg aacctttaaa taaattgttt
      961 gagtgttttt gagccccaga caaataatgt tttaaagtta tcccttgcta ctttactgat
     1021 acctttatca ttcctgagac agtttgctaa tttaaaaatg tagcattcca tttgtattta
     1081 tttctctccc ttgccaaaaa gattttctaa tactgcttgt accagccaga gaaagatcca
     1141 aaacactact cagctctctt gcactgagga aatttttccc cctacattga ctcctggcct
     1201 acatcagcca aacttaacct tggtggggtt tggatttgat agccaattag ttctgtgctg
     1261 gttgcaaaga attgatattt agatggtttt taatactcag cagattgtct tcccatattg
     1321 tgtctttttt atgttgcatg ttgcttttgt tatcagcctg attttttgct cagtatatga
     1381 tagttctgct gatgttttgt ttattgggca gacatatctt cattaagagt ttttggaaaa
     1441 ctcatcaaat tcgatgaata cattttcttc ataacccatt tggaattatt cctaataaaa
     1501 tgataaaata cgtaaaaaaa aaggaattc
//
LOCUS       HUMCDC25HS   2055 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human cdc25Hs mRNA, complete cds.
ACCESSION   M34065
KEYWORDS    mitotic inducer.
SOURCE      Human HeLa cell line D98/AH-2, cDNA to mRNA, clone BSK1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2055)
  AUTHORS   Sadhu,K., Reed,B.I., Richardson,H. and Russell,P.
  TITLE     Human homolog of fission yeast cdc25 mitotic inducer is
            predominantly expressed in G-2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5139-5143 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Russell, 08-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept        211     1632     CDC25Hs ORF
    mRNA    <     1     2055     cdc25Hs mRNA
BASE COUNT      576 a    472 c    477 g    530 t
ORIGIN      
        1 caggaagact ctgagtccga cgttggccta cccagtcgga aggcagagct gcaatctagt
       61 taactacctc ctttccccta gatttccttt cattctgctc aagtcttcgc ctgtgtccga
      121 tccctatcta ctttctctcc tcttgtagca agcctcagac tccaggcttg agctaggttt
      181 tgtttttctc ctggtgagaa ttcgaagacc atgtctacgg aactcttctc atccacaaga
      241 gaggaaggaa gctctggctc aggacccagt tttaggtcta atcaaaggaa aatgttaaac
      301 ctgctcctgg agagagacac ttcctttacc gtctgtccag atgtccctag aactccagtg
      361 ggcaaatttc ttggtgattc tgcaaaccta agcattttgt ctggaggaac cccaaaatgt
      421 tgcctcgatc tttcgaatct tagcagtggg gagataactg ccactcagct taccacttct
      481 gcagaccttg atgaaactgg tcacctggat tcttcaggac ttcaggaagt gcatttagct
      541 gggatgaatc atgaccagca cctaatgaaa tgtagcccag cacagcttct ttgtagcact
      601 ccgaatggtt tggaccgtgg ccatagaaag agagatgcaa tgtgtagttc atctgcaaat
      661 aaagaaaatg acaatggaaa cttggtggac agtgaaatga aatatttggg cagtcccatt
      721 actactgttc caaaattgga taaaaatcca aacctaggag aagaccaggc agaagagatt
      781 tcagatgaat taatggagtt ttccctgaaa gatcaagaag caaaggtgag cagaagtggc
      841 ctatatcgct ccccgtcgat gccagagaac ttgaacaggc caagactgaa gcaggtggaa
      901 aaattcaagg acaacacaat accagataaa gttaaaaaaa agtatttttc tggccaagga
      961 aagctcagga agggcttatg tttaaagaag acagtctctc tgtgtgacat tactatcact
     1021 cagatgctgg aggaagattc taaccagggg cacctgattg gtgatttttc caaggtatgt
     1081 gcgctgccaa ccgtgtcagg gaaacaccaa gatctgaagt atgtcaaccc agaaacagtg
     1141 gctgccttac tgtcggggaa gttccagggt ctgattgaga agttttatgt cattgattgt
     1201 cgctatccat atgagtatct gggaggacac atccagggag ccttaaactt atatagtcag
     1261 gaagaactgt ttaacttctt tctgaagaag cccatcgtcc ctttggacac ccagaagaga
     1321 ataatcatcg tgttccactg tgaattctcc tcagagaggg gcccccgaat gtgccgctgt
     1381 ctgcgtgaag aggacaggtc tctgaaccag tatcctgcat tgtactaccc agagctatat
     1441 atccttaaag gcggctacag agacttcttt ccagaatata tggaactgtg tgaaccacag
     1501 agctactgcc ctatgcatca tcaggaccac aagactgagt tgctgaggtg tcgaagccag
     1561 agcaaagtgc aggaagggga gcggcagctg cgggagcaga ttgcccttct ggtgaaggac
     1621 atgagcccat gataacattc cagccactgg ctgctaacaa gtcaccaaaa agacactgca
     1681 gaaaccctga gcagaaagag gccttctgga tggccaaacc caagattatt aaaagatgtc
     1741 tctgcaaacc aacaggctac caacttgtat ccaggcctgg gaatggatta ggtttcagca
     1801 gagctgaaag ctggtggcag agtcctggag ctggctctat aaggcagcct tgagttgcat
     1861 agagatttgt attggttcag ggaactctgg cattcctttt cccaactcct catgtcttct
     1921 cacaagccag ccaactcttt ctctctgggc ttcgggctat gcaagagcgt tgtctacctt
     1981 ctttctttgt attttccttc tttgtttccc cctctttctt ttttaaaaat ggaaaaataa
     2041 acactacaga atgag
//
LOCUS       HUMDBTG      1401 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human delta-beta-thalassemia gene region DNA.
ACCESSION   M36375
KEYWORDS    delta-beta-thalassemia.
SOURCE      Human 30 yr old japanese female blood cell DNA, clone Jp-thal.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1401)
  AUTHORS   Shiolawa,S., Yamada,H., Takihara,Y., Matsunaga,E., Ohba,Y.,
            Yamamoto,K. and Fukumaki,Y.
  TITLE     Molecular analysis of japanese delta-beta-thalassemia
  JOURNAL   Blood 72, 1771-1776 (1988)
  STANDARD  simple staff_entry
BASE COUNT      390 a    341 c    237 g    433 t
ORIGIN      
        1 gagctcttta gtttaattag attccattgg tcaatttttg cttttgctgc aattgctttt
       61 cacgctttca tcatgaaatc tgtgcccgtg tttatatcat gaatagtatt gccttgattt
      121 ttttctaggc tttttatagt ttggggtttt tcatttaagt ctctaatagc tcatagaatc
      181 tttcttcaac tcctgcaatc aaagggaaac gtaataggat tcttgtaaca gagggaactc
      241 tattagcaaa catgggtcca agaacagctc cccccaccac caccaccacc acccgccccc
      301 accatccctg ctcctggagc tggagtgcaa tggcacaatc acctactcct cccaggttca
      361 agcaattccc ttgcctcagc ctcccgagta gctaggacta caggtacatg ccaccacacc
      421 cagctaattt tttgtatttt agtagagatg gggtttcacc atgttggcca ggatggtctc
      481 gatctcctga cctcgtgatc tgcccacttc agcctcccaa agtgctggga ttacaggcgt
      541 gagccactgc tcccggccaa taatagcttt ttaaaaaaaa ttctgtgatt cttaaaccag
      601 tcttgagttg tcctgcgcta aaactaatga gcaaatatta aactcttctt aaacagttct
      661 ctttaagtgt ctctttctcc cccactgaga attcttccat gacagagcct ataacttaaa
      721 catccctatt accagactac caggaacatc taaatagaca gccctcacaa caatgcttca
      781 cattcattac tatcacaaca ccattattgg gtattaataa ctccccttta tgggtgagaa
      841 agcagggtaa gattagataa tcgagtctca gaggaaacat acgacgttct caaattcacc
      901 caacaagcca atggcagatt caggaattga gctctgttct tactccaaac gccatgctct
      961 cgcaaactcc ttatttaaat ttttttttct ttttttactg tggcatgaac aaacacatgc
     1021 tgatgactgt agtctatatt tggaataact attttgaacg ctacaacgat atttttgagt
     1081 tgacataacg tatgaaaata cactactgag ttcatgctca catttactct gaatacttcc
     1141 gcgtttgttt catatactat ctgaaggtta tacagtacct aatttattta taactgtctt
     1201 caatgcacta caacactgac cttgtgcaca gaattcttgg gtctggaagg tgaggttatt
     1261 tgatcctgcc ccctgctatt tgaaggatca tcccacacag cagagaatct attaatgcaa
     1321 agcaagtgag agagagttat ctcaacctca ctaacttact tgcagaaagg ttctgtacta
     1381 ctatagctgc ggaatcaccc c
//
LOCUS       HUMHER3A     4975 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human epidermal growth factor receptor (HER3) mRNA, complete cds.
ACCESSION   M34309
KEYWORDS    HER3 gene; epidermal growth factor receptor.
SOURCE      Human cell line MDA-MB-361, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4975)
  AUTHORS   Plowman,G.D., Whitney,G.S., Neubauer,M.G., Green,J.M.,
            McDonald,V.L., Todaro,G.J. and Shoyab,M.
  TITLE     Molecular cloning and expression of another epidermal growth factor
            receptor-related gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 4905-4909 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.D.Plowman, 16-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept        199     4227     HER3 protein precursor
    sigp        199      255     HER3 signal peptide
    matp        256     4224     HER3 protein
BASE COUNT     1195 a   1309 c   1327 g   1144 t
ORIGIN      Chromosome 12q13.
        1 ctctcacaca cacacacccc tcccctgcca tccctccccg gactccggct ccggctccga
       61 ttgcaatttg caacctccgc tgccgtcgcc gcagcagcca ccaattcgcc agcggttcag
      121 gtggctcttg cctcgatgtc ctagcctagg ggcccccggg ccggacttgg ctgggctccc
      181 ttcaccctct gcggagtcat gagggcgaac gacgctctgc aggtgctggg cttgcttttc
      241 agcctggccc ggggctccga ggtgggcaac tctcaggcag tgtgtcctgg gactctgaat
      301 ggcctgagtg tgaccggcga tgctgagaac caataccaga cactgtacaa gctctacgag
      361 aggtgtgagg tggtgatggg gaaccttgag attgtgctca cgggacacaa tgccgacctc
      421 tccttcctgc agtggattcg agaagtgaca ggctatgtcc tcgtggccat gaatgaattc
      481 tctactctac cattgcccaa cctccgcgtg gtgcgaggga cccaggtcta cgatgggaag
      541 tttgccatct tcgtcatgtt gaactataac accaactcca gccacgctct gcgccagctc
      601 cgcttgactc agctcaccga gattctgtca gggggtgttt atattgagaa gaacgataag
      661 ctttgtcaca tggacacaat tgactggagg gacatcgtga gggaccgaga tgctgagata
      721 gtggtgaagg acaatggcag aagctgtccc ccctgtcatg aggtttgcaa ggggcgatgc
      781 tggggtcctg gatcagaaga ctgccagaca ttgaccaaga ccatctgtgc tcctcagtgt
      841 aatggtcact gctttgggcc caaccccaac cagtgctgcc atgatgagtg tgccgggggc
      901 tgctcaggcc ctcaggacac agactgcttt gcctgccggc acttcaatga cagtggagcc
      961 tgtgtacctc gctgtccaca gcctcttgtc tacaacaagc taactttcca gctggaaccc
     1021 aatccccaca ccaagtatca gtatggagga gtttgtgtag ccagctgtcc ccataacttt
     1081 gtggtggatc aaacatcctg tgtcagggcc tgtcctcctg acaagatgga agtagataaa
     1141 aatgggctca agatgtgtga gccttgtggg ggactatgtc ccaaagcctg tgagggaaca
     1201 ggctctggga gccgcttcca gactgtggac tcgagcaaca ttgatggatt tgtgaactgc
     1261 accaagatcc tgggcaacct ggactttctg atcaccggcc tcaatggaga cccctggcac
     1321 aagatccctg ccctggaccc agagaagctc aatgtcttcc ggacagtacg ggagatcaca
     1381 ggttacctga acatccagtc ctggccgccc cacatgcaca acttcagtgt tttttccaat
     1441 ttgacaacca ttggaggcag aagcctctac aaccggggct tctcattgtt gatcatgaag
     1501 aacttgaatg tcacatctct gggcttccga tccctgaagg aaattagtgc tgggcgtatc
     1561 tatataagtg ccaataggca gctctgctac caccactctt tgaactggac caaggtgctt
     1621 cgggggccta cggaagagcg actagacatc aagcataatc ggccgcgcag agactgcgtg
     1681 gcagagggca aagtgtgtga cccactgtgc tcctctgggg gatgctgggg cccaggccct
     1741 ggtcagtgct tgtcctgtcg aaattatagc cgaggaggtg tctgtgtgac ccactgcaac
     1801 tttctgaatg gggagcctcg agaatttgcc catgaggccg aatgcttctc ctgccacccg
     1861 gaatgccaac ccatgggggg cactgccaca tgcaatggct cgggctctga tacttgtgct
     1921 caatgtgccc attttcgaga tgggccccac tgtgtgagca gctgccccca tggagtccta
     1981 ggtgccaagg gcccaatcta caagtaccca gatgttcaga atgaatgtcg gccctgccat
     2041 gagaactgca cccaggggtg taaaggacca gagcttcaag actgtttagg acaaacactg
     2101 gtgctgatcg gcaaaaccca tctgacaatg gctttgacag tgatagcagg attggtagtg
     2161 attttcatga tgctgggcgg cacttttctc tactggcgtg ggcgccggat tcagaataaa
     2221 agggctatga ggcgatactt ggaacggggt gagagcatag agcctctgga ccccagtgag
     2281 aaggctaaca aagtcttggc cagaatcttc aaagagacag agctaaggaa gcttaaagtg
     2341 cttggctcgg gtgtctttgg aactgtgcac aaaggagtgt ggatccctga gggtgaatca
     2401 atcaagattc cagtctgcat taaagtcatt gaggacaaga gtggacggca gagttttcaa
     2461 gctgtgacag atcatatgct ggccattggc agcctggacc atgcccacat tgtaaggctg
     2521 ctgggactat gcccagggtc atctctgcag cttgtcactc aatatttgcc tctgggttct
     2581 ctgctggatc atgtgagaca acaccggggg gcactggggc cacagctgct gctcaactgg
     2641 ggagtacaaa ttgccaaggg aatgtactac cttgaggaac atggtatggt gcatagaaac
     2701 ctggctgccc gaaacgtgct actcaagtca cccagtcagg ttcaggtggc agattttggt
     2761 gtggctgacc tgctgcctcc tgatgataag cagctgctat acagtgaggc caagactcca
     2821 attaagtgga tggcccttga gagtatccac tttgggaaat acacacacca gagtgatgtc
     2881 tggagctatg gtgtgacagt ttgggagttg atgaccttcg gggcagagcc ctatgcaggg
     2941 ctacgattgg ctgaagtacc agacctgcta gagaaggggg agcggttggc acagccccag
     3001 atctgcacaa ttgatgtcta catggtgatg gtcaagtgtt ggatgattga tgagaacatt
     3061 cgcccaacct ttaaagaact agccaatgag ttcaccagga tggcccgaga cccaccacgg
     3121 tatctggtca taaagagaga gagtgggcct ggaatagccc ctgggccaga gccccatggt
     3181 ctgacaaaca agaagctaga ggaagtagag ctggagccag aactagacct agacctagac
     3241 ttggaagcag aggaggacaa cctggcaacc accacactgg gctccgccct cagcctacca
     3301 gttggaacac ttaatcggcc acgtgggagc cagagccttt taagtccatc atctggatac
     3361 atgcccatga accagggtaa tcttgggggg tcttgccagg agtctgcagt ttctgggagc
     3421 agtgaacggt gcccccgtcc agtctctcta cacccaatgc cacggggatg cctggcatca
     3481 gagtcatcag aggggcatgt aacaggctct gaggctgagc tccaggagaa agtgtcaatg
     3541 tgtagaagcc ggagcaggag ccggagccca cggccacgcg gagatagcgc ctaccattcc
     3601 cagcgccaca gtctgctgac tcctgttacc ccactctccc cacccgggtt agaggaagag
     3661 gatgtcaacg gttatgtcat gccagataca cacctcaaag gtactccctc ctcccgggaa
     3721 ggcacccttt cttcagtggg tctcagttct gtcctgggta ctgaagaaga agatgaagat
     3781 gaggagtatg aatacatgaa ccggaggaga aggcacagtc cacctcatcc ccctaggcca
     3841 agttcccttg aggagctggg ttatgagtac atggatgtgg ggtcagacct cagtgcctct
     3901 ctgggcagca cacagagttg cccactccac cctgtaccca tcatgcccac tgcaggcaca
     3961 actccagatg aagactatga atatatgaat cggcaacgag atggaggtgg tcctgggggt
     4021 gattatgcag ccatgggggc ctgcccagca tctgagcaag ggtatgaaga gatgagagct
     4081 tttcaggggc ctggacatca ggccccccat gtccattatg cccgcctaaa aactctacgt
     4141 agcttagagg ctacagactc tgcctttgat aaccctgatt actggcatag caggcttttc
     4201 cccaaggcta atgcccagag aacgtaactc ctgctccctg tggcactcag ggagcattta
     4261 atggcagcta gtgcctttag agggtaccgt cttctcccta ttccctctct ctcccaggtc
     4321 ccagcccctt ttccccagtc ccagacaatt ccattcaatc tttggaggct tttaaacatt
     4381 ttgacacaaa attcttatgg tatgtagcca gctgtgcact ttcttctctt tcccaacccc
     4441 aggaaaggtt ttccttattt tgtgtgcttt cccagtccca ttcctcagct tcttcacagg
     4501 cactcctgga gatatgaagg attactctcc atatcccttc ctctcaggct cttgactact
     4561 tggaactagg ctcttatgtg tgcctttgtt tcccatcaga ctgtcaagaa gaggaaaggg
     4621 aggaaaccta gcagaggaaa gtgtaatttt ggtttatgac tcttaacccc ctagaaagac
     4681 agaagcttaa aatctgtgaa gaaagaggtt aggagtagat attgattact atcataattc
     4741 agcacttaac tatgagccag gcatcatact aaacttcacc tacattatct cacttagtcc
     4801 tttatcatcc ttaaaacaat tctgtgacat acatattatc tcattttaca caaagggaag
     4861 tcgggcatgg tggctcatgc ctgtaatctc agcactttgg gaggctgagg cagaaggatt
     4921 acctgaggca aggagtttga gaccagctta gccaacatag taagaccccc atctc
//
LOCUS       HUMMEHA       959 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human microsomal epoxide hydrolase (HmEH) mRNA, partial cds.
ACCESSION   M36374
KEYWORDS    microsomal epoxide hydrolase.
SOURCE      Human liver, cDNA to mRNA, clone HmEH7.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 959)
  AUTHORS   Craft,J.A., Jackson,M.R. and Burchell,B.
  TITLE     Partial nucleotide sequence of a cloned cDNA for human liver
            microsomal epoxide hydrolase
  JOURNAL   Biochem. Soc. Trans. 15, 708-709 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   959     microsomal epoxide hydrolase (AA at 3) (EC
                                 3.3.2.3)
BASE COUNT      217 a    272 c    251 g    219 t
ORIGIN      
        1 cttcagtgct gggctttgcc atctactggt tcatctcccg ggacaaagag gaaactttgc
       61 cacttgaaga tgggtggtgg gggccaggca cgaggtccgc agccagggag gacgacagca
      121 tccgcccttt caaggtggaa acgtcagatg aggagatcca cgacttacac cagaggatcg
      181 ataagttccg tttcacccca cctttggagg acagctgctt ccactatggc ttcaactcca
      241 actacctgaa gaaagtcatc tcctactggc ggaatgaatt tgactggaag aagcaggtgg
      301 agattctcaa caagtaccct cacttcaaga ctaaaattga agggctggac atccacttca
      361 tccacgtgaa gcccccccag ctgcccgcag gccatacccc gaagcccttg ctgatggtga
      421 acggctggcc cggctctttc tacgagtttt ataagatcat cccactcctg actgacccca
      481 agaaccatgg cctgagcgac gagcacgttt ttgaagtcat ctgcccttcc atccctggct
      541 atggcttctc agaggcatcc tccaagaagg ggttcaactc ggtggccacc gccaggatct
      601 tttacaagct gatgctgcgg ctgggcttcc aggaattcta cattcaagga ggggactggg
      661 ggtccctgat ctgcactaat atggcccagc tggtgcccag ccacctgaaa ggcctgcact
      721 tgaacatggc tttggtttta agcaacttct ctaccctgac cctcctcctg ggacagcgtt
      781 tcgggaggtt tcttggcctc actgagaggg atgtggagct gctgtacccc gtcaaggaga
      841 aggtattcta cagcctgatg agggagagcg gctacatgca catccagtgc accaagcctg
      901 acaccgtagg ctctgctctg aatgactctc ctgtgggtct ggctgcctat attctagag
//
LOCUS       HUMRNPSMBA    847 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human small nuclear ribonucleoprotein particle SmB mRNA, 3' end.
ACCESSION   M34081
KEYWORDS    lupus autoantigen; small nuclear ribonucleoprotein particle.
SOURCE      Human HeLa cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 847)
  AUTHORS   Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P.
  TITLE     Epitope mapping of recombinant HeLa SmB and B' peptides obtained by
            the polymerase chain reaction
  JOURNAL   J. Immunol. 145, 636-643 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990,
            for release after publication.
FEATURES       from  to/span     description
    pept    <     1      675     small nuclear ribonucleoprotein particle (SmB)
    mRNA    <     1      847     snRNP SmB mRNA
BASE COUNT      170 a    251 c    252 g    174 t
ORIGIN      
        1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc
       61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag
      121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc
      181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc
      241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc
      301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt
      361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga
      421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag
      481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc
      541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct
      601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg
      661 cgaggccttc tttgaccctt ggccacagag tatggaagta gctccgcaga ggcgtgggct
      721 cgattcctca gggccacgtt accacagacc tgtttgtttc ttatgctgtt gttcgtggag
      781 tctcatggga ttgtctggtt tccttacagg gccccctccc ccgggaatgc gcccaccaag
      841 gccctag
//
LOCUS       HUMRNPSMBB    702 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human small nuclear ribonucleoprotein particle SmB' mRNA, 3' end.
ACCESSION   M34082
KEYWORDS    lupus autoantigen; small nuclear ribonucleoprotein particle.
SOURCE      Human HeLa cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 702)
  AUTHORS   Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P.
  TITLE     Epitope mapping of recombinant HeLa SmB and B' peptides obtained by
            the polymerase chain reaction
  JOURNAL   J. Immunol. 145, 636-643 (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990,
            for release after publication.
FEATURES       from  to/span     description
    pept    <     1      702     small nuclear ribonucleoprotein particle
                                 (SmB')
BASE COUNT      147 a    215 c    212 g    128 t
ORIGIN      
        1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc
       61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag
      121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc
      181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc
      241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc
      301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt
      361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga
      421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag
      481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc
      541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct
      601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg
      661 cgagggcccc ctcccccggg aatgcgccca ccaaggccct ag
//
LOCUS       HUMTPMYOC    1191 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human skeletal muscle beta-tropomyosin mRNA, complete cds.
ACCESSION   X06825 M36268
KEYWORDS    actin-binding protein; beta-tropomyosin.
SOURCE      Human skeletal muscle, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1191)
  AUTHORS   Liautard,J.P.
  JOURNAL   Unpublished (1988) CRBM du CNRS, 34033 Montpelier Cedex, France
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1191)
  AUTHORS   Widada,J.S., Ferraz,C., Capony,J.-P. and Liautard,J.-P.
  TITLE     Complete nucleotide sequence of the adult skeletal isoform of human
            skeletal muscle beta-tropomyosin
  JOURNAL   Nucleic Acids Res. 16, 3109-3109 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        107      961     beta-tropomyosin
    mRNA    <     1     1191     beta-tropomyosin mRNA
BASE COUNT      321 a    320 c    368 g    182 t
ORIGIN      
        1 cccgctccgt cctcctcgcc tgccaccggt gcacccagtc cgctcaccca gcccagtccg
       61 tccggtcctc accgcctgcc ggccggccca ccccccaccg caggccatgg acgccatcaa
      121 gaagaagatg cagatgctga agctggacaa ggagaacgcc atcgaccgcg ccgagcaggc
      181 cgaagccgac aagaagcaag ctgaggaccg ctgcaagcag ctggaggagg agcagcaggc
      241 cctccagaag aagctgaagg ggacagagga tgaggtggaa aagtattctg aatccgtgaa
      301 ggaggcccag gagaaactgg agcaggccga gaagaaggcc actgatgctg aggcagatgt
      361 ggcctccctg aaccgccgca ttcagctggt tgaggaggag ctggaccggg cccaggagcg
      421 cctggctaca gccctgcaga agctggagga ggccgagaag gcggctgatg agagcgagag
      481 aggaatgaag gtcatcgaaa accgggccat gaaggatgag gagaagatgg aactgcagga
      541 gatgcagctg aaggaggcca agcacatcgc tgaggattca gaccgcaaat atgaagaggt
      601 ggccaggaag ctggtgatcc tggaaggaga gctggagcgc tcggaggaga gggctgaggt
      661 ggccgagagt aaatgtgggg acctagagga ggagctgaaa attgttacca acaacttgaa
      721 atccctggag gcccaggcgg acaagtattc caccaaagaa gataaatatg aagaggagat
      781 caaactgttg gaggagaagc tgaaggaggc tgagacccga gcagagtttg ccgagaggtc
      841 tgtggcaaag ttggagaaaa ccatcgatga cctagaagat gaagtctatg cccagaagat
      901 gaagtacaag gccattagcg aggaactgga caacgcactc aatgacatca cctccctctg
      961 agccccacgc ccagcgtgcc acctcagctc tcttctctcc tctcctttcc attctctcta
     1021 tggggagggg agagcaggca ggaggagcag aaattgccaa cattgcacag ccaggctggg
     1081 agcagcctag ggagagcccc catcatgccc accacccact ctggcactgg cttcatcctt
     1141 tacctatccc cttccaccct cctttgcttg cttaataaat tctgaacttg g
//
LOCUS       HUMTRHA      1322 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human thyroid releasing hormone (TRH) mRNA, complete cds.
ACCESSION   M36317
KEYWORDS    thyroid releasing hormone.
SOURCE      Human brain, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1322)
  AUTHORS   Mandel,G. and Goodman,R.H.
  TITLE     Using the brain to screen cloned genes
  JOURNAL   Trends Neurosci. 10, 101-104 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        103      870     thyroid releasing hormone precursor (TRH)
    sigp        103      174     thyroid releasing hormone signal peptide
    matp        175      867     thyroid releasing hormone
BASE COUNT      315 a    344 c    372 g    291 t
ORIGIN      
        1 tccttggatt cgggagtatt gcaaactcta cccagccagt ttgcactctt cagctcagca
       61 tcttggaaag ctctgcagag tctccacttc gcagactcca ggatgccggg accttggttg
      121 ctgctggctc tggctttgat cttcacccta actggtatcc ctgaatcctg cgccttgccg
      181 gaggcagccc aggaggaagg tgcagtgact cctgaccttc ctggcctgga gaatgttcag
      241 gtccggccag aacgtcgatt cttgtggaaa gacctccagc gggtgagagg ggacctcggt
      301 gctgccttag actcctggat cacaaaacgc cagcatccag gcaaaaggga ggaggaggaa
      361 aaagacattg aagctgaaga gaggggagac ttgggagaag ggggagcctg gagactccac
      421 aaacgacagc accccggccg acgtgccaac caggacaagt attcatgggc agatgaggag
      481 gacagtgact ggatgccacg gtcctggtta ccagatttct ttctggattc ctggttctca
      541 gatgtccccc aagtcaagcg gcagcaccct ggcaggcgat ccttcccctg gatggagtct
      601 gatgtcacca agaggcaaca tccaggccgg aggttcatag atcccgagct ccaaagaagc
      661 tgggaagaaa aagagggaga gggtgtctta atgcctgaga aacgccagca tcctggcaaa
      721 agggcattgg gtcatccctg tgggccccag gggacttgtg gtcaaacagg cctgctccag
      781 cttctaggtg acctgagcag gggtcaggag accctggtga agcaaagccc acaagtggaa
      841 ccctgggaca aggaacctct ggaggagtaa ggccagagtc aggctttagg tctaggatga
      901 tgtaagccct gtattcccta tcctgttccc ttcactagct gtcctctctt agatgctaac
      961 cctgggccct ctgtacatcg tccacccaaa ccccttcctt accgacttca gagactttag
     1021 aaagccagtc aggaagttaa aaccctactt atcccttcaa gcaagggggt gggagtcaca
     1081 ccccttcagc actggccaag atggttcttt cacaccttct aagggacctc ctcagaaagg
     1141 aagggtagaa ttgaaatgtt ttggtgttaa aacttctgta atctgcccca tgtggtaaga
     1201 ctgacctggt atagcttcag cgcatcctcc aaggttgggg tccctgagca gtttgggaga
     1261 tgtttagata tgtccttggg gtggggggac gcatcttgta tataaccctg ctcttgggaa
     1321 tc
//
LOCUS       MAARNA1A      163 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-1.
ACCESSION   M35975
KEYWORDS    RNA-1 leader sequence.
SOURCE      Alfalfa mosaic virus (strain AlMV-S) genomic RNA.
  ORGANISM  Alfalfa mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE   1  (bases 1 to 163)
  AUTHORS   Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
  TITLE     Structure of the 5'-terminal untranslated region of the genomic
            RNAs from two strains of alfalfa mosaic virus
  JOURNAL   Nucleic Acids Res. 11, 2815-2826 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        103  >   163     115 kDa protein
    ORF          92  >   163     ORF1
    ORF         107  >   163     ORF2
BASE COUNT       39 a     38 c     24 g     62 t
ORIGIN      5' end of RNA-1.
        1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttt tccttgtatt
       61 ttccacagta ttacgttcat tcaatactgt gatgaattca ctatgaatgc tgacgcccta
      121 tccaccgatt cctgccttag tatgcgagaa cctttatccc atg
//
LOCUS       MAARNA1B      115 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-1.
ACCESSION   M35976
KEYWORDS    RNA-1 leader sequence.
SOURCE      Alfalfa mosaic virus (strain AlMV-B) genomic RNA.
  ORGANISM  Alfalfa mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE   1  (bases 1 to 115)
  AUTHORS   Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
  TITLE     Structure of the 5'-terminal untranslated region of the genomic
            RNAs from two strains of alfalfa mosaic virus
  JOURNAL   Nucleic Acids Res. 11, 2815-2826 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        101  >   115     115 kDa protein
    ORF         105  >   115     ORF1
BASE COUNT       29 a     25 c     18 g     43 t
ORIGIN      5' end of RNA-1.
        1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttc cttgcgcttt
       61 ccacagcatt acgttcattc aatactgtga agaattcatt atgaatgctg acgcc
//
LOCUS       MAARNA2A      108 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-2.
ACCESSION   M36389
KEYWORDS    RNA-2 leader sequence.
SOURCE      Alfalfa mosaic virus (strain AlMV-S) genomic RNA.
  ORGANISM  Alfalfa mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE   1  (bases 1 to 108)
  AUTHORS   Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
  TITLE     Structure of the 5'-terminal untranslated region of the genomic
            RNAs from two strains of alfalfa mosaic virus
  JOURNAL   Nucleic Acids Res. 11, 2815-2826 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         56  >   108     100 kDa protein
BASE COUNT       26 a     19 c     19 g     44 t
ORIGIN      5' end of RNA-2.
        1 ggtttttatc ttttcgcgat tcaaaagata agtttcagag tctaatcttt tcaatatgtt
       61 cactcttttg agatgtcttg gatttggcgt caacgacctt accgatat
//
LOCUS       MAARNA2B      109 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-2.
ACCESSION   M36390
KEYWORDS    RNA-2 leader sequence.
SOURCE      Alfalfa mosaic virus (strain AlMV-B) genomic RNA.
  ORGANISM  Alfalfa mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
  TITLE     Structure of the 5'-terminal untranslated region of the genomic
            RNAs from two strains of alfalfa mosaic virus
  JOURNAL   Nucleic Acids Res. 11, 2815-2826 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         57  >   109     100 kDa protein
BASE COUNT       25 a     19 c     20 g     45 t
ORIGIN      5' end of RNA-2.
        1 ggtttttatc ttttcgcgat tgaaaagata agtttcctga gtctaatctt ttcaatatgt
       61 tcactctttt gagatgtctt ggatttggcg tcaacgacct taccgatat
//
LOCUS       MAARNA3B      305 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-3.
ACCESSION   M36391
KEYWORDS    RNA-3 leader sequence.
SOURCE      Alfalfa mosaic virus (strain AlMV-S) genomic RNA.
  ORGANISM  Alfalfa mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE   1  (bases 1 to 305)
  AUTHORS   Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
  TITLE     Structure of the 5'-terminal untranslated region of the genomic
            RNAs from two strains of alfalfa mosaic virus
  JOURNAL   Nucleic Acids Res. 11, 2815-2826 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        259  >   305     35 kDa protein
BASE COUNT       87 a     57 c     48 g    113 t
ORIGIN      5' end of RNA-3.
        1 ggttttcatc ttacacacgc ttgtgcaaga tagttaatca ttccaattca actcaattaa
       61 cgttttttac agtgtaattc gtacttttcg taagtaagtt tctgtaaaag cgtttcttgt
      121 tttaatttgg cctaacacgt aattcgtact cttcgtgagt aagttgtgtt agccatacct
      181 atcctttaaa ttcctgtcaa tttaaaaaga aaatcattcc cattcgcgta attcgtactc
      241 ttcgtgagta agttgcaaat ggagaataca aaaacaaatg cctcgagttc tggaatgttt
      301 tcttc
//
LOCUS       MAARNA3C      290 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-3.
ACCESSION   M36392
KEYWORDS    RNA-3 leader sequence.
SOURCE      Alfalfa mosaic virus (strain AlMV-B) genomic RNA.
  ORGANISM  Alfalfa mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Alfamoviridae.
REFERENCE   1  (bases 1 to 290)
  AUTHORS   Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L.
  TITLE     Structure of the 5'-terminal untranslated region of the genomic
            RNAs from two strains of alfalfa mosaic virus
  JOURNAL   Nucleic Acids Res. 11, 2815-2826 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        244  >   290     35 kDa protein
BASE COUNT       90 a     52 c     41 g    107 t
ORIGIN      5' end of RNA-3.
        1 ggttttaata ccattttcaa aatattccaa ttcaactcaa ttaacccttt tacagtgtaa
       61 ttcgtacttt tcgtaagtaa gtttctgtaa aagcgtttct tgttttaatt tggcctaaca
      121 cgtaattcgt actcttcgtg agtaagttgt gttagccata tctatccttt aaatttttgt
      181 caaatttaaa aaagaaaatc attcccattc gcgtaattcg tactcttcgt gagtaagttg
      241 caaatggaga atacaaaaac aaatgcctcg agaactggaa tatcttcttc
//
LOCUS       MBFRRDA      1476 bp ds-DNA             RNA       14-AUG-1990
DEFINITION  M.formicicum 16S rRNA.
ACCESSION   M36508
KEYWORDS    16S ribosomal RNA.
SOURCE      M.formicicum (strain DSM 1312) rRNA.
  ORGANISM  Methanobacterium formicicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanobacteriaceae.
REFERENCE   1  (bases 1 to 1476)
  AUTHORS   Lechner,K., Wich,G. and Boeck,A.
  TITLE     The nucleotide sequence of the 16S rRNA gene and flanking regions
            from Methanobacterium formicicum: The phylogenetic relationship
            between methanogenic and halophilic archaebacteria
  JOURNAL   Syst. Appl. Microbiol. 6, 157-163 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1     1476     16S rRNA
BASE COUNT      348 a    355 c    472 g    301 t
ORIGIN      
        1 agtccgtttg atcctggcgg aggccactgc tattgggttt cgattaagcc atgcaagtcg
       61 aacgatcctt cggggtcgtg gcgtacggct cagtaacacg tggataacct aaccttagga
      121 ctgggataac cctgggaaac tggggataat accggatatg tagggctgcc tggaatggtt
      181 ccctattgaa atgttccgac gcctaaggat ggatctgcgg cagattaggt agttggcggg
      241 gtaaatgccc accaagccag taatctgtac gggttgtgag agcaagagcc cggagatgga
      301 acctgagaca aggttccagg ccctacgggg cgcagcaggc gcgaaacctc cgcaatgcac
      361 gaaagtgcga cgggggaaac ccaagtgcca ctcttaacgg ggtggctttt cttaagtgta
      421 aaaagctttt ggaataagag ctgggcaaga ccggtgccag ccgccgcggt aacaccggca
      481 gctcaagtgg tggccgtttt tattgggcct aaagcgttcg tagccggctt gataagtctc
      541 tggtgaaatc tcacggctta accgtgagaa ttgctggaga tactattagg cttgaggccg
      601 ggagaggtta gcggtactcc cggggtaggg gtgaaatcct ataatcccgg gaggaccacc
      661 tgtggcgaag gcggctaact ggaacgggcc tgacggtgag taacgaaagc caggggcgcg
      721 aaccggatta gatacccggg tagtcctggc cgtaaacgat gtggacttgg tgttgggatg
      781 gctccgagct gccccagtgc cgaagggaag ctgttaagtc caccgcctgg gaagtacggt
      841 cgcaagactg aaacttaaag gaattggcgg gggagcacca caacgcgtgg agcctgcggt
      901 ttaattggat tcaacgccgg acatctcacc aggggcgaca gcagaatgat agccaggttg
      961 atgaccttgc ttgacaagct gagaggaggt gcatggccgc cgtcagctcg taccgtgagg
     1021 cgtcctgtta agtcaggcaa cgagcgagac ccacgccctt agttaccagc ggatccttcg
     1081 ggatgccggg cacactaagg ggaccgccag tgataaactg gaggaaggag tggacgacgg
     1141 taggtccgta tgccccgaat cccctgggct acacgcgggc tacaatggtt aggacaatgg
     1201 gttccgacac tgaaaggtgg aggtaatctc ctaaacctgg ccttagttcg gattgagggc
     1261 tgtaactcgc cctcatgaag ctggaatgcg tagtaatcgc gtgtcataac cgcgcggtga
     1321 atacgtccct gctccttgca cacaccgccc gtcacgccac ccaaaaaggg tttggatgag
     1381 gccatagtct ttggttatgg tcgaatctag gttctttgag gagggcgaag tcgtaacaag
     1441 gtagccgtag gggaacctgc ggctggatca cctcct
//
LOCUS       MSHLTRERA     190 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Harvey murine sarcoma virus LTR 75-bp direct-repeat enhancer
            region.
ACCESSION   M26322
KEYWORDS    enhancer region; long terminal repeat.
SOURCE      Harvey murine sarcoma virus, cDNA to viral RNA, clone pM14-1.
  ORGANISM  Harvey murine sarcoma virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Oncovirinae; Type C oncovirus group; 
            Mammalian type C oncoviruses; Murine sarcoma viruses.
REFERENCE   1  (bases 1 to 190)
  AUTHORS   Ostrowski,M.C., Huang,A.L., Kessel,M., Wolford,R.G. and Hager,G.L.
  TITLE     Modulation of enhancer activity by the hormone responsive
            regulatory element from mouse mammary tumor virus
  JOURNAL   EMBO J. 3, 1891-1899 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rpt          21       95     75 bp direct-repeat enchancer
    rpt          96      170     75 bp direct-repeat enchancer
BASE COUNT       31 a     54 c     48 g     57 t
ORIGIN      
        1 ggctggaccg catctgggga ccatctgttc ttggccctga gccggggcag gaactgtcta
       61 ccacagatat cctgtttggc ccatattcag ctgttctatc tgttcttggc cctggagccg
      121 ggcaggaact gtctaccaca gatatcctgt ttggcccata ttcagctgtt tctctgttcc
      181 tgaccttgat
//
LOCUS       MUSKERDA     1381 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse keratin D mRNA, complete cds.
ACCESSION   M36376
KEYWORDS    keratin D.
SOURCE      Mouse F9 teratocarcinoma stem cell, cDNA to mRNA, clone B53.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1381)
  AUTHORS   Alonso,A., Weber,T. and Jorcano,J.L.
  TITLE     Cloning and characterization of keratin D, a murine endodermal
            cytoskeletal protein induced during in vitro differentiation of F9
            teratocarcinoma cells
  JOURNAL   Roux's Arch. Dev. Biol. 196, 16-21 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         50     1321     keratin D
    mRNA    <     1     1381     keratin D mRNA
    site         28       42     TCTCC motif
BASE COUNT      366 a    373 c    401 g    241 t
ORIGIN      
        1 aactcctgtt ctggtctctc gcttcgctct cctctcctct ccagacaaga tgagcttcac
       61 aacccgctcc accaccttct ccaccaacta ccggtccctg ggctctgtgc ggactcccag
      121 ccagcgggtc cggcctgcca gcagcgcagc cagcgtctat gcaggtgctg ggggctccgg
      181 gtcacggata tccgtgtccc gctccgtctg gggtggctct gtggggtccg caggcctggc
      241 gggaatgggt ggaatccaga ccgagaagga gaccatgcaa gacctgaacg atcgcctggc
      301 cagctaccta gacaaggtga agagtctgga aactgagaac aggagactgg agagcaaaat
      361 ccgggaacat ctggagaaga aggggcccca gggcgtcaga gactggggcc actacttcaa
      421 gatcatcgaa gacctgaggg ctcagatcct tgcgaattct gtggacaatg cccgcatcgt
      481 cttgcagatc gacaatgccc gccttgccgc cgatgacttt agagtcaagt atgagacaga
      541 actagccatg cgccagtctg tggagagcga catccatgga ctccgcaagg tggtagatga
      601 caccaacatc acaaggctgc agctggagac agaaatcgag gcactcaagg aagaacttct
      661 gttcatgaag aagaatcatg aagaggaagt ccaaggtctg gaagcccaga ttgccagctc
      721 tggattgact gtggaagtgg atgcccccaa atctcaggac ctcagcaaga tcatggcgaa
      781 catccgcgcc cagtatgaag cgctgggcca gaagaaccgc gaggaactgg acaagtactg
      841 gtctcagcag attgaggaga gtaccacagt tgtcaccacc aagtctgccg aaatcaggga
      901 cgctgagacc acactcacgg agctgagacg caccctccag accttggaga ttgacttgga
      961 ctccatgaaa aaccagaaca tcaacttgga gaacagcctc ggggatgtgg aggcccgata
     1021 caaggcacag atggagcagc tcaatggggt ccttctgcat ctggagtcag agctggcaca
     1081 aactcgggca gagggccagc gccaggccca ggaatatgaa gccctcttga acatcaaggt
     1141 gaagcttgag gcagagattg ccacctaccg ccgcttgctg gaggatggag aagatttcag
     1201 tctcaacgat gccctagact cgagcaactc catgcaaact gtgcagaaga caactacccg
     1261 taagatcgtg gatggcagag tggtgtccga gactaatgac accagagttc tgaggcactg
     1321 aggcagagaa ggagggaacc cctaggaact gagggaccaa taaaagttga gagctcactg
     1381 g
//
LOCUS       MUSMBPMLD1    792 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse myelin basic protein (shi-mld) gene, exon 1.
ACCESSION   M36273 M36274
KEYWORDS    myelin basic protein; shi-mld gene.
SEGMENT     1 of 2
SOURCE      Mouse mld mutated DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 792)
  AUTHORS   Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K.,
            Oshimura,M. and Mikoshiba,K.
  TITLE     Gene organization and transcription of duplicated MBP genes of
            myelin deficient (shi-mld) mutant mouse
  JOURNAL   EMBO J. 7, 77-83 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    mRNA        713  >   792     shi-mld mRNA
    signal      623      628     GC box
    signal      630      634     CAAT box
    signal      680      685     TATA box
BASE COUNT      194 a    234 c    206 g    158 t
ORIGIN      Chromosome 18.
        1 cctccagcgg aatccgggag agggattgct cagtgccctg cttccctgga ctgtaagctg
       61 cagaaagatg tgggaagtcc tgttctccac tgagaacact aaaagcacct tttgtcaaac
      121 gaccgcttca catctggggc ttgtgcactg gtggcctttt aaaccagaga caacccacaa
      181 gatacctaac ctgcggggct ctctggtaca gtgagcaact caggaaatgc tttggcttga
      241 ttgctgtggg ctctcaggcc atcgccctct ggagtggttc ttttaatgag aacctgaaga
      301 ttggcccctg agccatgtat accaagcaag ctcaatccag gttagctccc tctggttggg
      361 gcaagctaac gtgctccttg ggccccgcgc gtaactgtgc gttttatagg agacagctag
      421 ttcaagaccc caggaagaaa gcggctttgt ccccctctag gcctcgtaca ggcccacatt
      481 catatctcat tgttgttgca ggggaggcag atgcgatcca gaacaatggg acctcggctg
      541 aggacacggc ggtgacagac tccaagcaca cagcagaccc aaagaataac tggcaaggcg
      601 cccacccagc tgacccaggg aaccgccccc acttgatccg cctcttttcc cgagatgccc
      661 cgggaaggag gacaacacct tcaaagacag gccctcagag tccgacgagc ttcagaccat
      721 ccaagaagac cccacagcag cttccggagg cctggatgtc atggcatcac agaagagacc
      781 ctcacagcga tc
//
LOCUS       MUSMBPMLD2    200 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Figure 7. Nucleotide sequences near the inverted exon 3.
ACCESSION   M36275
KEYWORDS    myelin basic protein; shi-mld gene.
SEGMENT     2 of 2
SOURCE      Mouse mld mutated DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K.,
            Oshimura,M. and Mikoshiba,K.
  TITLE     Gene organization and transcription of duplicated MBP genes of
            myelin deficient (shi-mld) mutant mouse
  JOURNAL   EMBO J. 7, 77-83 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    /    43  /   144     shi-mld myelin basic protein, exon 3 (AA at
                                 43)
    pre-msg <     1  >   200     shi-mld mRNA and introns
    IVS     <     1       42     shi-mld intron B
    IVS         145  >   200     shi-mld intron C
BASE COUNT       54 a     62 c     40 g     44 t
ORIGIN      Chromosome 18; about 11 kbp after segment 1.
        1 ctacgccctc tccatcctca gctgctcgct tctctctttc aggactcaca cacgagaact
       61 acccattatg gctccctgcc ccataattcg cagcacggcc ggacccaaga tgaaaaccca
      121 gtagtccatt tcttcaagaa cattgtaagt gactgtgcac aggggaacca agaaatcatc
      181 aagggaaagg ttgtggatcc
//
LOCUS       MUSMIGP4A    1284 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse monokine induced by gamma interferon (MIG) mRNA, complete
            cds.
ACCESSION   M34815
KEYWORDS    MIG protein; gamma interferon-induced;
            monokine induced by gamma interferon;
            platelet factor 4-related protein; secreted protein.
SOURCE      Mouse (strain BAB/14) macrophage cell line RAW 264.7, cDNA to mRNA,
            clone 119/24.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1284)
  AUTHORS   Farber,J.M.
  TITLE     A macrophage mRNA selectively induced by gamma-interferon encodes a
            new member of the platelet factor 4 family of cytokines
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5238-5242 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.M.Farber, 04-JUN-1990, for release after publication.
FEATURES       from  to/span     description
    pept         95      475     gamma interferon induced monokine precursor
                                 (MIG)
    sigp         95      157     gamma interferon induced monokine signal
                                 peptide
    matp        158      472     gamma interferon induced monokine
    signal        6       16     TATA region
BASE COUNT      401 a    316 c    247 g    320 t
ORIGIN      
        1 tttcctaaat aaatatgatc cccaagaaca tgctctctaa agacattctc ggacttcact
       61 ccaacacagt gactcaatag aactcagctc tgccatgaag tccgctgttc ttttcctttt
      121 gggcatcatc ttcctggagc agtgtggagt tcgaggaacc ctagtgataa ggaatgcacg
      181 atgctcctgc atcagcacca gccgaggcac gatccactac aaatccctca aagacctcaa
      241 acagtttgcc ccaagcccca attgcaacaa aactgaaatc attgctacac tgaagaacgg
      301 agatcaaacc tgcctagatc cggactcggc aaatgtgaag aagctgatga aagaatggga
      361 aaagaagatc aaccaaaaga aaaagcaaaa gagggggaaa aaacatcaaa agaacatgaa
      421 aaacagaaaa cccaaaacac cccaaagtcg tcgtcgttca aggaagacta cataagagac
      481 cattacttta ccaacaagca ccctgaatct taatgggttt tagattgtac tgaaaagcct
      541 tccctggcag agcagccttt aatacatagg cttttaatac attaactcaa ctacaaaaca
      601 taaagtgtta atttgaaatt ataactaact ttaggaagtt aattgcaaaa ctccaatagt
      661 aacaattgct agaggcaaaa actctgtgtt ctacacagcc aacaaaattt catcacgccc
      721 ttgagcctag tcgtgataac atcagatctg ggcaagtgtc cctttccttc atagctatcc
      781 aatgcacaac agctgtctgg cttccagagc cacacatttg gcagcctccg gagacttctg
      841 aggctcacgt caccaagtcc caggcctgtc tgtttgctgg tgagctagat agacctcacc
      901 aagctggaga ggccctcggc agctgcattt gggtcagcct agagcccctg cacacattgt
      961 gtctcagaga tggtgctaat ggttttgggg ttctacagtg gagaccacca gagttggcct
     1021 tcagaacctc ccacgtagct ttcgagacca tgggatttca ttattaactt gatcccatct
     1081 tcagagctta ttctaagttt gcctcttcaa taaaactctc ctagaaggtt gtggctgtag
     1141 cttagtggca gaacacttgg tgttgcaggg accaggtcct tcactaacag tgcaaaaact
     1201 taaccaattt aaagaacatt ttctggctac tcaaattctc ttaaatttat tcctgtttca
     1261 caagtaaaca cttcgctgct atct
//
LOCUS       MUSZFPB1      561 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse zinc finger (Zfp-35) gene, exon 1.
ACCESSION   J04770
KEYWORDS    zinc finger protein.
SEGMENT     1 of 3
SOURCE      Mouse (strain 129) cosmid library pcos2EMBL DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 561)
  AUTHORS   Cunliffe,V., Williams,S. and Trowsdale,J.
  TITLE     Genomic analysis of a mouse zinc finger gene Zfp-35, which is
            unregulated during spermatogenesis
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            Cunliffe,V., 06-JUN-1990.
FEATURES       from  to/span     description
    pre-msg     233  >   561     zinc finger protein (Zfp-35) mRNA and introns
    IVS         286  >   561     Zfp-35 intron A
BASE COUNT      141 a    135 c    149 g    136 t
ORIGIN      Chromosome 18 bands B3-C.
        1 ccctagaagt tttgacatcg aaccacagca aagataagtt tgagtcaagc gggtccccct
       61 atgctaaagc tgggagtcgc tttagggaac ttcctgttcc taattcacag agcgcgtact
      121 tgggagggtg caccatagag ctgtgatcct cccacattcc tagagcggcc aagcgaggtc
      181 gcattgcatc caaacggatg ttgaaggaat tgtgggtggc ggcgtttgac cggtagtttc
      241 ttcatcgtcg gagagtaagg gaggcttctt gggcatggag agcaggtggg ttcttggcct
      301 agtgttctgt ggacagcaag ttcgtgctcc ctgggccttt cattcaactg catgaacgtc
      361 cgagatgttg gggctggtgg aataacgaag aggatgagcc cacactaatc agcctgcaga
      421 gctggaaaag agatacctga cttgggttag aatcgaagga acgagagaga ataacatact
      481 cttaattttt aaagccagta ttaccccccc cccccccaca cacacacacc gaattatcgt
      541 tccatttacg cctctacttt a
//
LOCUS       MUSZFPB2      340 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse zinc finger (Zfp-35) gene, exon 2.
ACCESSION   M36145
KEYWORDS    zinc finger protein.
SEGMENT     2 of 3
SOURCE      Mouse (strain 129) cosmid library pcos2EMBL DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 340)
  AUTHORS   Cunliffe,V., Williams,S. and Trowsdale,J.
  TITLE     Genomic analysis of a mouse zinc finger gene Zfp-35, which is
            unregulated during spermatogenesis
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            V,Cunliffe, 06-JUN-1990.
FEATURES       from  to/span     description
    pept        231  +   279     zinc finger protein (Zfp-35)
    pre-msg <     1  >   340     Zfp-35 mRNA and introns
    IVS     <     1       66     Zfp-35 intron A
    IVS         280  >   340     Zfp-35 intron B
BASE COUNT       90 a     79 c     66 g    105 t
ORIGIN      Chromosome 18 bands B3-C; unknown number of bps after segment 1.
        1 gtcaccaaga ggaaacgcac atgaaagtta aatatatttt ttctccttta cgtttttcat
       61 atctagaaac tctagtctca gaaatgacat ctctcctctg tacaatggtt ctcatctgag
      121 ctcagggcta ccaagttgcc tgctggcttc aaaccattat gtggacaaaa ggatcctaca
      181 gaagaaatgg taattctcca gttgcttcct gaggtgtcca gttgtgccct atggagatcc
      241 aattcagtta cgaatctcaa gaccatcact ttctgtcagg tgagcctcag tgcatttcag
      301 gttacgtggg tgatagcagt attcttcact gtctacactc
//
LOCUS       MUSZFPB3     2268 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse zinc finger (Zfp-35) gene, exon 3.
ACCESSION   M36146
KEYWORDS    zinc finger protein.
SEGMENT     3 of 3
SOURCE      Mouse (strain 129) cosmid library pcos2EMBL DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2268)
  AUTHORS   Cunliffe,V., Williams,S. and Trowsdale,J.
  TITLE     Genomic analysis of a mouse zinc finger gene Zfp-35, which is
            unregulated during spermatogenesis
  JOURNAL   Genomics (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            V,Cunliffe, 06-JUN-1990.
FEATURES       from  to/span     description
    pept    +    87     1780     zinc finger protein (Zfp-35)
    pre-msg <     1     2166     Zfp-35 mRNA and introns
    IVS     <     1       86     Zfp-35 intron B
    signal     2125     2130     polyA signal
    signal     2148     2153     polyA signal
BASE COUNT      685 a    488 c    526 g    569 t
ORIGIN      Chromosome 18 bands B3-C; unknown number of bps after segment 2.
        1 ctgcagggaa atactcattt gttctacctg gagctttctt acaacatgga gcgcattttc
       61 catttagatt ttgtttatta ttccagatgg tgagactaag atcaagattg gagagccagc
      121 tacagaggag gaaatgacag gaaagattgg aacagtgact gaagagtccg gcagccttga
      181 ggaggatgtt ccccatgact ccagaggcaa agaattccgt gaatttggag aagaattgaa
      241 cgatcagatg cttttcagaa gaagacagta taactgtgat gaatgtgacc aaagctttgc
      301 ttggagtaca ggtctcatta ggcaccaaag aactcattgg aaaccttacg aatgtgagga
      361 atgtggaaag gcctttcgga tgagctcagc cctggttctg catcagagaa ttcatactgg
      421 agagaagccc tatccttgta gttggtgtat taaaagtttt agtcggagct cagaccttat
      481 taaacatcaa agagtccaca ctggtgaaaa accttacaag tgtgatgagt gtggcaaggc
      541 cttcagtcag agctcagatc ttatgataca tcagagaatc cacacaggag aaaaacccta
      601 ccaatgtagt cattgtagta aaagctttag ccagcactca ggcatggtta aacatctgag
      661 aatccacact ggagagaagc cttatatgtg taaccattgt tacaaacatt tcagtcagag
      721 ctctgatctt ataaaacatc aaaggatcca cactggggag aaaccataca agtgtgatgt
      781 gtgtgggaag gcctttagtc agagctctga tcgtattctc catcagagaa tccacactgg
      841 ggagaagcca tatccgtgtg ctcagtgtaa caaaagtttt agtcaaaatt cagaccttat
      901 taaacacaga aggatccaca ctggagagaa accgtataaa tgtagtgaat gtgggaaggc
      961 ttttaaccag agctcagtcc ttattctgca tcaaagaatt cataccggag agaaacccta
     1021 tccgtgtaac cagtgtacca aaagctttag taggctttca gatcttatta atcatcaacg
     1081 aattcacacc ggagagaagc cttacccatg tagtcagtgc agtaaaatgt ttagtagaag
     1141 gtcacacctt gttaaacatt acagaatcca cactggggag aagccctatg agtgtgataa
     1201 gtgtggcaaa actttcagtc agagttccaa ccttattctc caccagagaa tccacactgg
     1261 agagaaacca tacccgtgca acagctgttc taaaagcttc agtcgcggtt cagatctcat
     1321 aaagcatcag agggtccaca ctggagaaaa accatacacg tgcaatctgt gcagtaagag
     1381 tttcagtcaa agttcagacc tcactaaaca tcagagagtg cattctgggg agaagcccta
     1441 ccactgtagt agttgtaaca aagcctttcg tcagagttct gaccttattc tccaccacag
     1501 agttcacacg ggagaaagac catatgcgtg tacacagtgc cctagaagtt tcagtcaaaa
     1561 gtcagacctg attaaacacc agagaattca cactggagag aagccatata aatgtatgtg
     1621 tgggaaggct ttcagtcagt gctcagcctt tacccttcat cagagaatcc acactggaga
     1681 gaaaccgtat ccctgtgctc agtgtggcaa aagcttcagt cagcgctctg atctggttaa
     1741 ccatcaaaga gttcacgctg atcaaaaact acaaatgtga ctgatacctg tgtgaagcct
     1801 tcagcactgt gaccaatctt actggatatc agaggatctg caggagccgt ttgcacctct
     1861 ctgaaacgag aaaaaggaag agtgcctccc aggcggaatc aaagcaggtc agttggcaga
     1921 gttcccggct tgctgatgca tggaacaatg agaagaggtg tgcttgaggc tcaagtcaag
     1981 ggctgaatct tctgttgacc ttggtggaag gaacggggct tactggatgg gaacagatgt
     2041 cttgtgtgta tattgcagca aattgttaat actggcttgt ggctttgctc cccatggtac
     2101 tgtagtataa aaaaggctat gagaaataca ctcactgctg gcgtggtaat gacctgcagc
     2161 cctcccatgt cagtctcttg catcgttttt ctgtctttcc tataatcatc ctcacttccc
     2221 cctcaggggc tgttcgactt cacgccggcc ggactagttc aaggatcc
//
LOCUS       PP1PAR1       285 bp ds-DNA             PHG       14-AUG-1990
DEFINITION  Bacteriophage miniplasmid P1 parA gene, 5' end.
ACCESSION   M36424
KEYWORDS    centromeric-like DNA.
SEGMENT     1 of 2
SOURCE      Bacteriophage P1 plasmid prophage DNA, from E.coli.
  ORGANISM  Bacteriophage P1
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 285)
  AUTHORS   Austin,S. and Abeles,A.
  TITLE     The partition functions of P1, P7, and F miniplasmids
  JOURNAL   Basic Life Sci. 30, 215-226 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        232  >   285     parA protein
    binding     218      222     parA ribosomal binding site
    signal      149      154     parA -35 region
    signal      171      176     parA -10 region
BASE COUNT       98 a     57 c     60 g     70 t
ORIGIN      
        1 aagcttggct gcccattgaa gcaggaaatc accaaaatga ttcaggctac aacctgaacg
       61 tagaagaaat ccgcgtcctt tatgcgtgga ggatgccaaa gcatgttgtg acacacttgg
      121 caaaggagta agcatgcaga gaatgctatg tacaagcatc tacgcataca ttattatttt
      181 atgcagcatt tttaattaaa ttcaaaaata cagcataaag gatgactttc gatgagtgat
      241 tccagccagc ttcacaaggt tgctcaaaga gcaaacagaa tgctc
//
LOCUS       PP1PAR2       178 bp ds-DNA             PHG       14-AUG-1990
DEFINITION  Bacteriophage miniplasmid P1 parB gene, 3' end.
ACCESSION   M36425
KEYWORDS    centromeric-like DNA.
SEGMENT     2 of 2
SOURCE      Bacteriophage P1 plasmid prophage DNA, clone pALA161, from E.coli.
  ORGANISM  Bacteriophage P1
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 178)
  AUTHORS   Austin,S. and Abeles,A.
  TITLE     The partition functions of P1, P7, and F miniplasmids
  JOURNAL   Basic Life Sci. 30, 215-226 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       51     parB protein
BASE COUNT       62 a     42 c     21 g     53 t
ORIGIN      2150 bp after segment 1.
        1 tcgataaaaa gccgaagcct taaactttcg ccattcaaat ttcactatta actgactgtt
       61 tttaaagtaa attactctaa aatttcaagg tgaaatcgcc acgatttcac cttggatttt
      121 accttcctcc cctcctcccg aaaaaaataa aaaaattgct tgtcacgaga aagtcaac
//
LOCUS       PP7C4RNA      612 bp ds-DNA             PHG       14-AUG-1990
DEFINITION  Bacteriophage P7 c4-RNA (c4, orfx, ant1) gene, 5' end.
ACCESSION   M35139
KEYWORDS    antirepressor protein; antisense RNA; c4 repressor.
SOURCE      Bacteriophage P7 (strain c1.48ts) DNA, from E.coli (strain C600).
  ORGANISM  Bacteriophage P7
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 612)
  AUTHORS   Citron,M. and Schuster,H.
  TITLE     The c4 repressors of bacteriophages P1 and P7 are antisense RNAs
  JOURNAL   Cell (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Schuster, 06-JUL-1990.
FEATURES       from  to/span     description
    pept        362      583     orfx
    pept        580  >   612     ant1 protein
    binding     346      354     orfx ribosomal binding site
    mut         278      278     c in P7c1.48ts; t in P7c4.2ts
    signal       46       74     promoter 51a
    signal       86      114     promoter 51b
    site         31       47     Op 51
    site        146      152     presumptive c4 target site a1
    site        160      167     presumptive c4 target site b1
    site        271      278     c4 element b'
    site        285      291     c4 element a'
    site        315      321     c4 target site a2
    site        344      351     c4 target site b2
BASE COUNT      147 a    158 c    128 g    179 t
ORIGIN      
        1 acaaataata tatctattgc acgacactaa tctgctctaa taaatttgta tttttaagtc
       61 acgaatgcta tcttttcgca tcatattgac cttttaatcg ttcaggctta tagttccgcc
      121 gtcgtagcaa attctgcgac cgggtttagc agcctgaatg attgtgcgga cagccgcaga
      181 tatccgatat tgcggtattt ttgtgtccgt aaaaccgcgt tacgcccaaa ttatggtggg
      241 gcgtgatggg gaggcttcgg cctgctggtt tcacaatcgc cagtctgcta accccgtcac
      301 gtcctgccac ctgtttagca gcgggtagca ggttgttaaa cctgattgtg aggccgtaac
      361 tatggttaat gccaatcctt gcgcacgcca agaattcatc tggcgtttct attcctgtaa
      421 aaaacaccac tatcacttcg tcatcgcagc aacagaagac gaagcacgct ctcaattgcc
      481 tgatggcccc tgcattttta ctgcccgttt ttcaactaac tcgcgcaatt cacttagtta
      541 ctggaacctc cccttctctg ccgacgttca ggggggttta tgaaaaaacc tctcgtcacc
      601 cgtaatgaca ta
//
LOCUS       PSEPGLAA     1986 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  P.solanacearum polygalacturonase A (PGA, pglA) gene, complete cds.
ACCESSION   M33692
KEYWORDS    alpha-1,4-endopolygalacturonase A.
SOURCE      P.solanacearum Race 1 (strain AW) DNA, clone pJE8.
  ORGANISM  Pseudomonas solanacearum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1986)
  AUTHORS   Huang,J. and Schell,M.A.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 1715)
  AUTHORS   Huang,J. and Schell,M.A.
  TITLE     DNA sequence analysis of pglA and mechanism of export of its
            polygalacturonase product from Pseudomonas solanacearum
  JOURNAL   J. Bacteriol. 172, 3879-3887 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Schell, 13-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        100     1689     polygalacturonase A precursor (pglA)
    sigp        100      162     polygalacturonase A signal peptide
    matp        163     1686     polygalacturonase A
    mRNA         58  >  1986     pglA mRNA
    signal       27       59     promoter region
    signal       27       31     -30 Pseudomonas promoter consensus
    binding      89       92     pglA ribosomal binding site (put.)
BASE COUNT      351 a    685 c    616 g    334 t
ORIGIN      
        1 ctgcagcacg cccttcgcgc agacgaaggc gtgccgctgc ttcgctgatc ccggcgcatt
       61 gcgtcctcac aaaagacctc tcttttcgga gaattgacct tgaatcaccg ctacacactc
      121 ctcgcactcg ccgccgcagc cctgtctgcc ggagcgcacg caaccggtac gtccgtcacg
      181 gccccgtggg gcgaggtggc cgagcccagc ctgccggcgg attcggcggt ctgcaagacg
      241 ctgtcggcct cgatcacgcc gatcaaaggc tccgtcgatt cggtggacgg caaccccgcc
      301 aactcccagc cggacgcgag ccgcatccag tccgccatcg acaactgccc ggccggccag
      361 gccgtcaagc tcgtgaaggg cagcgccggc gaatccggct tcctgagcgg ctcgctgaag
      421 ctcaagtcgg gcgtgacgct gtggatcgac accggcgtga ccctgttcgc ctcgcgcaac
      481 ccggccgact acgacaatgg cctgggcacc tgcggcaccg ccaccaccag caacgacaag
      541 tcgtgcaacg cgctgatcgt ggcgcgcgat acggcgggca gcggcatcgt cggcgccggc
      601 gccatcgatg gccgcggcgg cagcctggtc accagcggcc cgaacgcgaa ccggctcacc
      661 tggtgggaca tcgcttacct gaacaagacc aaggggctga accagcagaa cccgcgcctg
      721 atccagacgt ataacggcag cgcgttcacg ctgtacggcg tgacagtgca gaactcgccc
      781 aacttccata tcgtcaccac cggcacgtcg ggcgtaacgg catggggcat caagatcgtg
      841 acgccgagcc tggcctacgc cgtggcgggc tacaagtgcc cgtccggctc cacgcccgac
      901 aaggtcacgc ccgcgacctg cttcacgccc gagacggtca agaacaccga cggcttcgac
      961 ccggggcaat cgaccaacgt ggtgctggcc tattcctaca tcaacacggg tgacgaccac
     1021 gtcgcggtca aggccagcag cggcccgacg cgcaacctgc tcttcgcgca caaccatttc
     1081 tactatgggc acggcctgtc gatcggcagc gaaaccaata ccggcgtgag caacatgctg
     1141 gtgaccgacc tgacgatgga cggcaacgac agctcggccg gcaacggcct gcgcatcaag
     1201 tccgatgcct cgcgcggcgg caaggtcacc aacatcgtct acgacggcat ctgcatgcgc
     1261 aacgtcaagg agccgctggt gttcgacccc ttctacagca gcgtgaaggg ttcgctctat
     1321 ccgaacttca ccaacatcgt cgtcaagaac ttccatgacc tgggcagcgc caagtccatc
     1381 aagcgcacga tgacgttcct gggctacaaa gccaacaagc agaagaaccc gctgacgatc
     1441 acgctcgaca acgtggtctt cgacggcacg ctgccggcct tcgaggggtc ccactacggc
     1501 gggccggctt cgcccaacgg ggtgcacttc accttcggcg gcaccggtcc ggtgagcttt
     1561 gccgacgcca tcgtcacgtc gtcgaccacc gatgtgacgg tgaccggcac gccgggcacg
     1621 gccgcggcgg tggattgcag caaggccttc gtgccgctga agtccgtggc accgacctcg
     1681 ccgatctgat cgggcgactc cgctcagcgg tacccgttct gccgcttgta ctggcggatg
     1741 gcttcgtcgt ggttggggtc ctcgtgtgca ttgtgcgcgc gctgatcttc gaactcacgg
     1801 atggcattgg cctgagcctc cgtcagtttg ggcagcggcg gcttgcgttg gggtgacggg
     1861 gtggtgctgt acgtcaccgg ctggctgtcg gcggagttcg tcttcgtagg cgatgtggtg
     1921 tccgcccttt tcaccgtggt agtcgttgcg cgcttcggtc tcgcggtcca gttgggtccg
     1981 ctgcag
//
LOCUS       RATCATHHA    1362 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat cathepsin H mRNA, complete cds.
ACCESSION   M36320 Y00708
KEYWORDS    cathepsin H; cysteine protease.
SOURCE      Rat (strain Wistar) kidney, cDNA to mRNA, clone lambda-H44.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1362)
  AUTHORS   Ishidoh,K.
  JOURNAL   Unpublished (1987) Tokyo Metro Inst of Med Sci,Tokyo 113, Japan
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 1362)
  AUTHORS   Ishidoh,K., Imajoh,S., Emori,Y., Ohno,D., Kawasaki,H., Minami,Y.,
            Kominami,E., Katunuma,N. and Suzuki,K.
  TITLE     Molecular cloning and sequencing of cDNA for rat cathepsin H
  JOURNAL   FEBS Lett. 226, 33-37 (1987)
  STANDARD  simple staff_entry
COMMENT        EMBL features not translated to GenBank features:
               key        from     to       description
            
               SITE        873    874       processing site
               SITE        211    213       pot. glycosylation site
               SITE        298    301       pot. glycosylation site
               SITE        685    688       pot. glycosylation site
FEATURES       from  to/span     description
    pept          4     1005     cathepsin H pre-pro-peptide
    sigp          4      342     cathepsin H signal peptide
    matp        343     1002     cathepsin H
    mRNA    <     1     1362     cathepsin H mRNA
    signal     1320     1325     poly A signal
BASE COUNT      369 a    341 c    343 g    309 t
ORIGIN      
        1 gcgatgtgga ctgcgctgcc cctgctgtgc gctggggcct ggctgctgag tgctggggcc
       61 accgctgagc tgaccgtgaa tgccatagaa aagtttcact ttacgtcatg gatgaaacag
      121 catcaaaaga cgtacagctc gagggagtac agccacaggc tgcaggtgtt tgccaacaac
      181 tggaggaaga ttcaagccca caaccagagg aaccacacat ttaaaatggg attgaaccag
      241 ttttcagata tgagctttgc cgaaataaaa cacaaatacc tctggtcaga gcctcagaat
      301 tgctcagcca ccaaaagtaa ctacctccgt ggtactggcc cctacccatc ctccatggac
      361 tggaggaaga aaggaaatgt cgtttcacca gtgaagaacc agggggcctg tggcagctgc
      421 tggactttct caaccactgg ggccctagag tcagctgtgg ctattgccag tgggaaaatg
      481 atgaccttgg ctgagcagca gctggtggat tgtgcccaga acttcaacaa tcatggctgc
      541 caaggaggtc tccccagcca ggccttcgag tacatcctgt acaacaaggg catcatggga
      601 gaggacagct acccttacat aggcaagaat ggtcagtgca aattcaaccc agaaaaggcc
      661 gtcgcgttcg tcaagaatgt tgtcaacatc acactcaatg atgaggctgc aatggtagag
      721 gctgtggccc tatacaatcc tgtgagcttt gcctttgagg tgactgaaga ttttatgatg
      781 tataaaagtg gtgtctactc cagtaactcc tgtcataaaa ctccagataa agtaaaccat
      841 gcagtcctgg ctgttggcta tggagaacag aatggattac tctactggat tgtgaaaaac
      901 tcttggggct ccaactgggg gaacaatggg tacttcctca ttgagcgtgg aaagaacatg
      961 tgtggcctgg ctgcctgtgc ctcctacccc atccctcagg tataagccac ggctgcacag
     1021 gccaactgct tggcagacaa agggaggaac tggtcctacg atgagaatgc cgccctggag
     1081 aaagttgttc agaaatccac ccagaggccc tctcactcct gagtctagac gcctaaagac
     1141 aagtaaggaa gaacttgacc agcaacaagc ccgcccatgt gacgacatca ccagccatac
     1201 gctttgtttg aatatggttt ttaaatgacc caaaaccatg tggacctaga atcttctctt
     1261 ttcagctctc ttcatatagg gaaagctccg atggcttacc ttttctatgt tgtatattca
     1321 ataaatgcag tgaacgcctg cctgggctgg gcctagaccc ct
//
LOCUS       RATDRTKCA     471 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat delayed-rectifier-type K+ channel mRNA, complete cds.
ACCESSION   M36461
KEYWORDS    delayed-rectifier-type K+ channel.
SOURCE      Rat neonatal heart and ovariectomized, diethylstilbestrol-primed
            uterus, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 471)
  AUTHORS   Folander,K., Smith,J.S., Antanavage,J., Bennett,C., Stein,R.B. and
            Swanson,R.
  TITLE     Cloning and expression of the delayed-rectifier I-s-K channel from
            neonatal rat heart and diethylstilbestrol-primed rat uterus
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2975-2979 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         41      433     delayed-rectifier-type K+ channel
BASE COUNT      110 a    132 c    122 g    107 t
ORIGIN      
        1 agagttttgc tccacatcag ggaaaccgtg aagccccagg atggccctgt ccaattccac
       61 aactgttctg ccttttctgg cctcgctgtg gcaggaaaca gatgagccgg gtggcaatat
      121 gtcggcggac ttggctcgta ggtcccagct ccgagatgac agcaagctgg aggctctcta
      181 tatcctcatg gtgctgggtt tcttcggctt cttcaccctg ggcatcatgc tgagttacat
      241 ccgatccaag aagctggaac actcgcacga ccctttcaac gtgtacatcg agtcggacgc
      301 ctggcaggag aaaggcaagg ccctcttcca ggcccgcgtt ctggagagct tcagagcttg
      361 ctatgtcatt gaaaaccagg cggccgtaga acaacctgcc acacaccttc ctgaactgaa
      421 gccactgtca tgaaccccat agttaattaa tagacaagtg ataagtgggt c
//
LOCUS       RATPAPA      1603 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat prostatic acid phosphatase (rPAP) mRNA, complete cds.
ACCESSION   M32397
KEYWORDS    prostatic acid phosphatase.
SOURCE      Rat (strain Sprague-Dawley) prostate, cDNA to mRNA, clone
            rPAP-[145,343A].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1603)
  AUTHORS   Roiko,K., Jaenne,O.A. and Vihko,P.
  TITLE     Primary structure of rat secretory acid phosphatase and comparison
            to other acid phosphatases
  JOURNAL   Gene 89, 223-229 (1900)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Vihko, 02-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         41     1186     prostatic acid phosphatase precursor (rPAP; EC
                                 3.1.3.2)
    sigp         41      133     prostatic acid phosphatase signal peptide
    matp        134     1183     prostatic acid phosphatase
    mRNA    <     1     1603     rPAP mRNA
    signal     1583     1588     polyA signal
BASE COUNT      434 a    414 c    387 g    368 t
ORIGIN      
        1 cacctcctac agctgccaca agcagctctc ttcccccacc atgagagctg tccctctgca
       61 cctcgtcggg acagcaagcc tcacccttgg cttcttgctc ctgctatctc tccgcctgga
      121 cccaggccaa gccaaggagt tgaagtttgt gacattggtg ttccggcatg gagaccgagg
      181 tcccatcgag acctttccta atgaccccat taaggaatcc tcgtggccac aaggatttgg
      241 ccaactcacc aagtggggca tgggacagca ctacgaactc ggaagttata taaggagaag
      301 atacgggaga ttcttgaaca actcctataa acatgaccag gtttatatcc gaagcacaga
      361 tgttgacagg actctgatga gcgctatgac aaacctcgca gccctgtttc cccctgaggg
      421 gatcagcatc tggaatccca gactgctctg gcagcccatc ccagtgcaca ccgtgtctct
      481 ctctgaggat cggttgctat acctgccttt cagggactgt cctcgctttc aagaactcaa
      541 gagtgagact ttaaaatctg aggagttcct gaagaggctt caaccatata aaagcttcat
      601 agacaccttg ccatcgctgt cgggattcga ggaccaggat ctttttgaaa tctggagtag
      661 gctttacgac cctttatatt gcgagagtgt tcacaatttc accttccgca cctgggccac
      721 agaggacgcc atgactaagt tgaaggagtt gtcagaatta tctctgttat ctctttatgg
      781 aattcacaag cagaaagaga aatctagact ccaggggggc gtcctggtca atgaaattct
      841 caagaacatg aagcttgcaa ctcaaccaca gaaggccagg aagttgatca tgtattctgc
      901 atatgacact actgtgagtg gcctgcagat ggcgctagag ctttataatg gacttctacc
      961 tccctacgct tcctgccaca taatggaatt gtaccaggat aatgggggga ccttcgtgga
     1021 gatgtactac cggaatgaga cccagaacga gccctaccca ctcacgctgc cgggctgtac
     1081 ccacagctgc cctctggaga agtttgcaga gctactggac cccgtgatcc cccaggactg
     1141 ggccacagag tgtatgggca caagcaacca ccaagcgtcg ctgtaaaaga acaaccagtt
     1201 gccagtgtaa tcaggaaaag agttaggtca aacttgtggg tctgccgggg atccatgatt
     1261 agcacaaaga actctattgc tgaaggattc gaagattttg cttgagcggg tgagtaaagc
     1321 tgccctggga atgtctgatg taaaaacagg tttcagacga agcctaccaa gttagcgtgc
     1381 aagcatgaaa aagaagacac ttatcccagg ctaatgggca aaggacctga ttcacctgac
     1441 aacaacgctc acctgttccc ctgccttctc tcacgaggca aaaagatgag acaggaggac
     1501 acctggagga gttttccata cccagcctag aacgatcaga ttttatatgt atgtcaaaag
     1561 acgctgggta gcagaactga ataataaaag cctcttggtg tcc
//
LOCUS       RHLNIFHA      524 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  R.leguminosarum nitrogen fixation protein nifH gene, 5' end.
ACCESSION   M36435
KEYWORDS    nitrogen fixation protein.
SOURCE      R.leguminosarum DNA, clone pGB5.
  ORGANISM  Rhizobium leguminosarum
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae.
REFERENCE   1  (bases 1 to 524)
  AUTHORS   Roelvink,P.W., Harmsen,M., van Kammen,A. and van den Bos,R.C.
  TITLE     The nifH promoter region of Rhizobium leguminosarum: Nucleotide
            sequence and promoter elements controlling activation by NifA
            protein
  JOURNAL   Gene 87, 31-36 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        435  >   524     nifH protein
    mRNA        364  >   524     nifH mRNA
    binding     420      426     nifH ribosomal binding site
BASE COUNT      124 a    139 c    137 g    124 t
ORIGIN      
        1 gttaacccta tgcgggcacc agacgcggca tggcgcttgt catcgccgga gacagcgctt
       61 tgcggcaagt gatcagaatt tgttacggcc ggtctggtga tggctgctct aacaaagtcc
      121 tagaagctac gcatggataa atgagacatc ctgccggttg gtcagatatt gcacaaacgg
      181 ccaccattca ttggccctca gagggtgaca atctatttcg acattgtcat cacctttgtc
      241 ggttacttga caagcgtttg tgttctcaat cccgtatttt acgcccacat agctggaatc
      301 gcggcgaatt attgttcgtc agctcaatcg gccgcttggc acgaatcttg agagctattg
      361 agaggcagcg gaacggccgc cgcattccgt tgcgggtaac caaattgctt cgaacacatg
      421 aaggaacgcc aagcatggca gctctgcgtc agatcgcatt ctatggaaaa ggcggaattg
      481 gcaagtccac tacgtcccaa aacacgctgg ccgcccttgt cgac
//
LOCUS       STYAHPCFA    2608 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  S.typhimurium alkyl hydroperoxide reductase (ahpCF) gene, complete
            cds.
ACCESSION   J05478
KEYWORDS    alkyl hydroperoxide reductase; oxidoreductase.
SOURCE      S.typhimurium (strain TN1379) DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 2608)
  AUTHORS   Tartaglia,L.A., Storz,G., Brodsky,M.H., Lai,A. and Ames,B.N.
  TITLE     Alkyl hydroperoxide reductase from Salmonella typhimurium: Sequence
            and homology to thioresoxin reductase and other flavoprotein
            disulfide oxidoreductases
  JOURNAL   J. Biol. Chem. 265, 10535-10540 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Tartaglia, 05-JUN-1990.
FEATURES       from  to/span     description
    pept        166      720     alkyl hydroperoxide reductase C22 protein
                                 (aphC)
    pept        969     2534     alkyl hydroperoxide reductase F52a protein
                                 (aphF)
    mRNA        142  >  2608     ahpCF mRNA
    binding     155      160     aphC ribosomal binding site (put.)
    binding     959      964     aphF ribosomal binding site (put.)
    signal      105      110     ahpCF -35 region
    signal      129      134     ahpCF -10 region
BASE COUNT      652 a    655 c    724 g    577 t
ORIGIN      13 minutes on S.typhimurium map.
        1 tcaggcggcc tttttacttt agatggctga cagggcgcag tgttcacaat gtcgcccgaa
       61 tcgggttgtt agttaacgct tattgatttg ataatggaaa cgcattagcc gaatcagcaa
      121 aatttcgtta acttactcct caacgaaaac acggaggaag tatagatgtc cttaattaac
      181 accaaaatca aacctttcaa aaaccaggcg ttcaaaaacg gtgaattcat tgaagtcacc
      241 gagaaagata ccgaaggccg ctggagcgtc ttcttcttct acccggccga ttttaccttt
      301 gtttgcccga ctgaactggg tgacgttgcc gaccattacg aagaactgca gaagctgggc
      361 gtagacgttt attccgtctc aaccgatact cacttcacgc acaaagcatg gcacagcagc
      421 tctgaaacta tcgcaaaaat caaatatgcg atgatcggcg acccgactgg cgccctgacc
      481 cgtaacttcg acaacatgcg tgaagatgaa ggtctggcag acgtgcgacc tttgttgttg
      541 acccgcaggg tatcatccag gcgatcgaag ttaccgctga aggtatcggc cgtgacgcgt
      601 ctgacctgct gcgtaaaatt aaagcagcac agtacgttgc cgctcaccca ggcgaagtat
      661 gcccggcgaa atggaaagaa ggcgaacgac tctggctcca tccttagacc tggtcggtaa
      721 aatctaaatt tccgtcgtct ttcacgccat agctgcgttg gcgtcgcctg ctcaccccgg
      781 tcacttactt cagtaagctc ccggggattc tcaggctagc cgccttactc tgacgtgaaa
      841 tactttggaa attgtttcag tcatgggcgc acccgcgccc atttcatttc agcaccatga
      901 cgcaagttgc attaatgcag tcccgcgaag tcggcttgca tgatgatgtt ttcagagagg
      961 gaagaagaat gctcgacaca aatatgaaaa cccagctcag ggcttacctt gagaaactga
     1021 ccaaacctgt tgagctgatt gccacgctgg atgacagcgc taaatcggcg gaaatcaagg
     1081 aactgctggc tgaaatcgcc gagctgtccg acaaagtcac gtttaaagaa gacaatactc
     1141 tgccggtgcg caagccgtct ttcctgatta cgaatccagg ttctcagcaa gggccgcgct
     1201 ttgccggttc tccgctgggc cacgaattta cctcgctggt tttggcgctg ctgtggaccg
     1261 gcggtcatcc gtcaaaagaa gcgcagtcgc tgctggagca gattcgcgat attgacggcg
     1321 attttgaatt cgaaacttat tattcactct cctgccataa ctgcccggac gtagtgcagg
     1381 cgctgaacct gatggcggtg ctgaacccgc gtattaaaca tacggcaatt gatggcggca
     1441 ccttccagaa cgaaatcacc gaacgtaacg tgatgggcgt tccggcggta tttgttaacg
     1501 gtaaagagtt cggccaggga cgtatgactc tgaccgagat tgtcgctaaa gtggatactg
     1561 gcgcagaaaa acgcgcggcg gaagcgctga acaaacgcga tgcgtatgac gtactgattg
     1621 tcggctccgg cccggcgggc gcggcggcgg cggtctactc ggcgcgcaaa ggcatccgta
     1681 ccggtctgat gggcgagcgc ttcggcggtc aggtgctgga taccgtggat atcgaaaact
     1741 atatctcggt gccgaaaacc gaaggccaga aactggcggg cgcgctgaaa gcgcatgtca
     1801 gcgattacga cgttgacgtt atcgacagcc agagcgcaag taaactggtt ccggccgcaa
     1861 cagaaggcgg cttgcatcag attgaaacgg cgtccggcgc agtattgaaa gcgcgtagta
     1921 tcatcatcgc caccggcgcg aaatggcgca atatgaatgt gccgggcgag gatcagtatc
     1981 gcaccaaagg cgtcacctat tgtccgcact gcgacggtcc gttatttaaa gggaagcgtg
     2041 tggcggtcat cggcggcggc aactctggcg ttgaagcggc tatcgatctg gcaggtattg
     2101 tagaacatgt tacgttgctg gagttcgcgc cggagatgaa ggcggatcag gtgctacagg
     2161 acaaagtgcg tagtctgaaa aatgtcgata tcattttgaa tgcgcaaacg acagaagtga
     2221 agggcgacgg ttcgaaagtg gtgggactgg agtatcgtga ccgtgtcagc ggcgatattc
     2281 atagcgtcgc gctggcaggg atttttgtgc agattggtct gttgccgaat acccactggc
     2341 tggaaggcgc gcttgagcgt aaccgcatgg gcgagatcat tattgacgcc aaatgtgaaa
     2401 ccagcgttaa gggcgtcttc gccgcaggcg attgcaccac cgttccttac aaacagatta
     2461 tcatcgccac gggcgaaggg gcgaaagcct cgttgagcgc ctttgattat ctgattcgca
     2521 ccaaaatcgc ataaaaagaa gtaagcacac ctgcaaggcc agttaactgg ccatcgtaaa
     2581 tggccgatag tgtaagctat tcgggcct
//
LOCUS       TMGCG        6355 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Tobacco mild green mosaic virus complete genome.
ACCESSION   M34077 M22483
KEYWORDS    coat protein.
SOURCE      Tobacco mild green mosaic virus (strain U2-TMV), cDNA to viral RNA,
            from N.tabacum cv. Samsum.
  ORGANISM  Tobacco mild green mosaic virus
            Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; 
            Tobamovirus.
REFERENCE   1  (bases 6127 to 6355)
  AUTHORS   Garcia-Arenal,F.
  TITLE     Sequence and structure at the genome 3' end of the U2-strain of
            Tobacco mosaic virus, a histidine-accepting tobamovirus
  JOURNAL   Virology 167, 201-206 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 6355)
  AUTHORS   Solis,I. and Garcia-Arenal,F.
  TITLE     The complete nucleotide sequence of the genomic RNA of the
            tobamovirus tobacco mild green mosaic virus
  JOURNAL   Virology 177, 553-558 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Garcia-Arenal, 10-FEB-1989.  The RNA appears to have a tRNA-
            like, L-shaped structure at the 3' terminus, linked to a quasi-
            continuous double-helical stalk, with five pseudoknots involved in
            the formation of the whole structure.  However, the structure of
            U2-TMV RNA is less stringently conserved than the 3' termini of
            "vulgare" and other histidine-accepting tobamoviruses.
            
            Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Garcia-Arenal, 08-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept         71     3406     126 kDa protein
    pept         71     3403     183 kDa protein
               3407     4900     183 kDa protein
    pept       4890     5660     28.5 kDa protein
    pept       5666     6145     coat protein
BASE COUNT     1933 a   1112 c   1489 g   1821 t
ORIGIN      
        1 gatgttttaa tagttttcga caacaacaat taaaacaaaa acaacatatt acaaacaaca
       61 aacaacaaca atggcacaca tacaatctat aattagcaac gcccttcttg aaagcgtgag
      121 tggtaaaaac actctcgtta atgaccttgc aagaaggcgc atgtacgata cggccgtgga
      181 agaatttaac gcccgcgacc gtagaccaaa ggtcaacttt tccaaaacta ttagcgaaga
      241 gcaaacgctt ctagtctcca acgcgtaccc ggagttccag attacctttt ataatactca
      301 aaatgccgta cacagtttgg ctggaggttt gagagcatta gaattggaat atctgatgct
      361 acaagttccc tatggatcgc cgacatatga tataggtggg aactttgcag cacatttgtt
      421 caaaggcagg gattacgtgc attgctgtat gcccaatctg gacatacgag atataatgag
      481 gcacgaagga caaaaggact caattgagat gtatttgtcc agattgtctc gttctaacaa
      541 ggtaattcct gagtttcaaa gggaggcttt taacaggtat gcagaagctc ccaacgaagt
      601 ctgctgctct aaaacttttc aggattgtcg aatacatccg ccagagaata gtggtagaag
      661 atacgctgtt gctctgcaca gtttgtatga tattcctgtg catgagtttg gagctgcgtt
      721 aatatctaag aatatacatg tatgttatgc agcttccatt ttggcagaag cattattact
      781 agaccagacg gaggttacgc ttaatgaaat aggcgcaact ttcaaaagag aaggtgatga
      841 tgtttctttt ttctttgctg atgaaagtac tttaaattat agtcataaat acaaaaatat
      901 cttgcattat gtagttaaat cttactttcc tgcttctagt agaatagttt actttaagga
      961 atttttagtc actagggtta atacttggtt ttgtaaattt accaaagtag atacctatat
     1021 tctgtacaag agtgttagac aagtagggtg tgatagtgat cagttctatg aggcgatgga
     1081 agacgccttt gcttacaaga aaaccttggc catgttcaac actgaaagag caatctttag
     1141 agacacggct tcggttaact tttggttccc taagatgaag gacatggtga tagtaccgct
     1201 gtttgagggt tctattacca gcaaaaagat gacaaggagt gaggtcattg ttaatcgtga
     1261 cttcgtttac acagtgctta atcatatcag aacatatcaa gccaaagcgt taacttacca
     1321 gaacgtatta tctttcgtgg agtctataag atcccgcgtg ataatcaatg gtgttactgc
     1381 taggtctgaa tgggatgtag ataaagcaat tcttcaaccc ttgtcaatga ctttcttctt
     1441 gcagactaag ctggctgcgc ttcaagacga tatagtaatg ggaaagtttc ggtgcttgga
     1501 taagaccact tctgaactta tttgggatga ggtgggcaaa ttttttggaa acgttttccc
     1561 cactatcaaa gagagattgg tgagcaggaa aattctggat gtaagtgaga atgctctgaa
     1621 gatcaagatc ccagatctgt atgtcacatg gaaagacagg ttcgtagctg aatacaccaa
     1681 gtctgaggag ttaccgcatc tagatatcaa gaaggactta gaagaagctg agcaaatgta
     1741 cgacgcgtta tcagaattat ctatccttaa gggtgctgat aatttcgata tcgcgaagtt
     1801 caaagacatg tgcaaggctt tagatgttag tcctgatgtg gcagcacgag taatcgttgc
     1861 agtggccgag aatagaagcg gtttaactct tacttttgat aagccaaccg aggagaatgt
     1921 ggctaaggct cttaaaagca cggcgtctga ggccgtggta tgtcttgaac cgacatccga
     1981 agaggtgaac gtaaataaat tttctattgc tgagaaaggg agattgcctg tgtgtgcaga
     2041 aagtcatggt ttgacgaatg ctaacttaga gcaccaggag ttggagtccc tcaacgattt
     2101 ccataaggct tgcgtggata gtgtgattac aaagcaaatg gcatcggttg tctacactgg
     2161 ctcactcaaa gttcaacaaa tgaagaacta tgtggacagt ttggcagctt cgttgtccgc
     2221 cactgtatca aatctatgca agtcactaaa ggatgaagtc gggtatgatt ctgattccag
     2281 ggagaaagtt ggtgtttggg atgtcacttt gaaaaagtgg ctcctcaaac ctgcggccaa
     2341 aggtcattca tggggagttg tcctggatta caaggggaaa atgtttactg cacttctatc
     2401 ttatgaagga gatagaatgg tgactgagag cgactggagg agggtggctg tatcatctga
     2461 tacaatggta tattctgata ttgcaaagct ccaaaatctg aggaaaacaa tgagagacgg
     2521 tgaaccccac gaacctactg caaagatggt acttgtggat ggggtgcctg gttgtggaaa
     2581 gtacaaagga gattttgaaa gatttgatct tgatgaggat ttgatcttgg ttcctggaaa
     2641 acaagctgct gctatgatca gaagaagggc taattcatct ggactgataa gagccacaat
     2701 ggacaatgtg agaacggtag attcacttct aatgcatcca aaaccgcgat cacacaagag
     2761 gctttttatt gatgaagggt tgatgctgca caccggttgt gttaacttcc tggtgcttat
     2821 ctctggttgc gacatcgcat acatttacgg agatacacag cagattcctt tcattaacag
     2881 agttcagaat ttcccgtatc ccaaacattt tgagaagctg caagtggatg aagttgagat
     2941 gaggaggacc acactgagat gcccaggtga tgtgaatttt ttcctacaat cgaagtacga
     3001 aggagcggtg acaaccactt caactgtaca acgatcggtc tcatctgaga tgataggcgg
     3061 taagggagta ctaaacagtg tttccaaacc actaaaaggg aaaattgtaa ctttcactca
     3121 ggctgataaa tttgagttag aggagaaggg ctataagaat gtgaacaccg ttcatgagat
     3181 ccaaggagaa acctttgaag atgtgtcgct ggtcagattg acggcaactc cactgactct
     3241 gatttccaag tcttccccgc atgttctagt cgctctgact agacacacaa agagcttcaa
     3301 atattacacc gtagtgttag atcctttagt acagataatt agtgatttgt cttctttaag
     3361 ctccttcctt ttagaaatgt atatggtaga agcaggtagt agatagcaat tacagatgga
     3421 tgcagtgttc aaaggtcata atctctttgt ggcaacacct aaatcaggag actttccaga
     3481 tctacagttc tattacgatg tatgcctccc tggtaatagt actatactta acaagtatga
     3541 tgctgttacc atgaggttac gtgataatag tcttaatgtg aaggattgtg ttcttgattt
     3601 ttccaaaagt attccgatgc caaaggaggt gaaaccatgt ctagagccag ttttgcgtac
     3661 cgcggcggaa ccgccaaggg ctgcaggact actcgaaaat ctggttgcaa tgattaaaag
     3721 aaatttcaac gcaccagacc tgacggggac gattgacatt gagagcaccg catctgttgt
     3781 agtagataag ttttttgata gctattttat taaaaaagaa aaatacacaa aaaatattgc
     3841 tggagtgatg acgaaggatt caatgatgag atggttggaa aacaggaaag aagtactatt
     3901 ggacgacttg gctaactaca attttacaga tctgccggcc atcgatcagt acaagcacat
     3961 gatcaaggct caaccaaaac agaaattgga cctttcaatt cagaatgaat accctgctct
     4021 gcaaacaatt gtctaccatt cgaagcagat caacggtatt ttggccggtt tctcagagct
     4081 tacaaggttg ctgctcgagg catttgattc taagaagttt cttttcttta ctaggaaaac
     4141 tccagaacag attcaagaat ttttctcgga tctcgactcg cacgttccta tggatgtgtt
     4201 agaactggat atttctaagt atgataagtc acagaacgag tttcattgtg ctgtagagta
     4261 tgaaatatgg aaaagattgg gtctcaatga gtttttggcc gaagtgtgga aacaagggca
     4321 caggaaaaca actttgaagg attacattgc tggaatcaag acatgtctgt ggtatcaaag
     4381 gaaaagcggt gatgtgacta ctttcatcgg caatactgtt ataatagcag cttgcttggg
     4441 ttcaatgtta ccgatggaaa aggtcataaa aggtgctttt tgtggagacg attccgtttt
     4501 gtattttcca aagggtttgg atttccctga cattcagtca tgtgctaatc tcatgtggaa
     4561 ttttgaggcc aaactgtata gaaagaggta cggttacttt tgtggtagat acatcataca
     4621 ccatgataag ggagcaatag tgtattatga tcctttgaag ttgatctcca aacttggggc
     4681 aaaacatatc aaggattatg atcacttaga agagttaagg gtgtctttgt gcgatgttgc
     4741 ttgttcgctc ggaaactggt gcttaggctt tccgcagctg aacgcagcta tcaaggaggt
     4801 tcataaaacc gcgattgatg gttcgtttgc ttttaattgt gttaacaaat ttttgtgtga
     4861 taaattttta tttagaactt tgtttttaaa tggctgttag tctcagagat actgtcaaaa
     4921 ttagcgagtt cattgatctt tcgaaacagg atgagatact tccggcattc atgactaagg
     4981 tcaagagtgt tagaatatcg actgtggaca agattatggc tgttaagaat gatagtcttt
     5041 ctgatgtaga tttacttaaa ggtgttaagt tagttaagaa agggtatgtg tgcttagctg
     5101 atttggtagt gtctggggag tggaatctcc cggataactg ccgtggtggt gtcagtgttt
     5161 gtattgtaga taagagaatg aaaaggagta aggaagcaac gctgggtgcg tatcacgccc
     5221 ctgcttgcaa aaagaatttt tcttttaagc taatccctaa ttattcaata acatccgagg
     5281 atgctgagaa gcacccgtgg caagtgttag tgaatatcaa aggagtggct atggaagaag
     5341 gatactgtcc tttatctttg gagttcgttt caatttgtgt agtacataaa aataatgtaa
     5401 gaaaaggttt gagggaacgt attttgagtg tgacagacgg ctcgccaatt gaactcactg
     5461 aaaaggttgt tgaggagttc gtggatgaag taccaatggc tgtgaaactc gaaaaggttc
     5521 cggaaaacaa aaaagaaatg gtaggtaata atgttaataa taagaaaata aataacagtg
     5581 gtaagaaggg ttttaaaatt gaggaaattg aggataatgt aagtgatgac gagtctatcg
     5641 cgtcatcgag tacgttttaa tcaatatgcc ttatacaatc aactctccga gccaatttgt
     5701 ttacttatct tccgcttacg cagatcctgt gcagctgatc aatctgtgta caaatgcatt
     5761 gggtaaccag tttcaaacgc aacaagctag gacaacagtc caacagcaat ttgcggatgc
     5821 ctggaaacct gtgcctagta tgacagtgag atttcctgca tcggatttct atgtgtatag
     5881 atataattcg acgcttgatc cgttgatcac ggcgttatta aatagctttg atactagaaa
     5941 tagaataata gaggttgata atcaacccgc accgaatact actgaaatcg ttaacgcgac
     6001 tcagagggta gacgatgcta ctgtagctat aagggcttca atcaataatt tggctaatga
     6061 actggttcgt ggaactggca tgttcaatca agcaggcttt gagactgcta gtggacttgt
     6121 ctggaccaca actccggcta cttagctatt gttgtgagat ttcctaaaat aaagtcgctg
     6181 aagacttaaa attcagggtg gctgatacca aaatcagcag tggttgttcg tccacttaaa
     6241 tataacgatt gtcatatctg gatccaacag ttaaaccatg tgatggtgta tactgtggta
     6301 tggcgtaaaa catcggagag gttcgaatcc tcccctaacc gccggtagcg gccca
//
LOCUS       TRNTNPIA     1333 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  Transposon Tn21 integrase (tnpI) gene, complete cds, and
            adenyltransferase (aadA), 5' end.
ACCESSION   M33633
KEYWORDS    integrase.
SOURCE      Transposon Tn21 DNA, clone pMON90.
  ORGANISM  Transposon Tn21
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1333)
  AUTHORS   Mercier,J., Lachapelle,J., Couture,F., Lafond,M., Vezina,G.,
            Boissinot,M. and Levesque,R.C.
  TITLE     Structural and functional characterization of tnpI a recombinase
            locus in Tn21 and related transposons
  JOURNAL   J. Bacteriol. 172, 3745-3757 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.C.Levesque, 06-APR-1990, for release after publication.
FEATURES       from  to/span     description
    ORF         209      613     potential ORF1
    ORF         665     1060     potential ORF2
    ORF        1150      137 (c) potential ORF3
    ORF        1298  >  1333     adenyltransferase (aadA)
    site       1288     1291     integration site GTTA
BASE COUNT      286 a    396 c    408 g    243 t
ORIGIN      
        1 ggatccatca ggcaacgacg ggctgctgcc ggccatcagc ggacgcaggg aggactttcc
       61 gcaaccggcc gttcgatgcg gcaccgatgg ccttcgcgca ggggtagtga atccgccagg
      121 attgacttgc gctgccctac ctctcactag tgaggggcgg cagcgcatca agcggtgagc
      181 gcactccggc accgccaact ttcagcacat gcgtgtaaat catcgtcgta gagacgtcgg
      241 aatggccgag cagatcctgc acggttcgaa tgtcgtaacc gctgcggagc aaggccgtcg
      301 cgaacgagtg gcggagggtg tgcggtgtgg cgggcttcgt gatgcctgct tgttctacgg
      361 cacgtttgaa ggcgcgctga aaggtctggt catacatgtg atggcgacgc acgacaccgc
      421 tccgtggatc ggtcgaatgc gtgtgctgcg caaaaaccca gaaccacggc caggaatgcc
      481 cggcgcgcgg atacttccgc tcaagggcgt cgggaagcgc aacgccgctg cggccctcgg
      541 cctggtcctt cagccaccat gcccgtgcac gcgacacgtg ctcgcgcagg ctgggtgcca
      601 agctctcggg taacatcaag gcccgatcct tggagccctt gccctcccgc acgatgatcg
      661 tgccgtgatc gaaatccaga tccttgaccc gcagttgcaa accctcactg atccgcatgc
      721 ccgttccata cagaagctgg gcgaacaaac gatgctcgcc ttccagaaaa ccgaggatgc
      781 gaaccacttc atccggggtc agcaccaccg gcaagcgccg cgacggccga ggtcttccga
      841 tctcctgaag ccagggcaga tccgtgcaca gcaccttgcc gtagaagaac agcaaggccg
      901 ccaatgcctg acgatgcgtg gagaccgaaa ccttgcgctc gttcgccagc caggacagaa
      961 atgcctcgac ttcgctgctg cccaaggttg ccgggtgacg cacaccgtgg aaacggatga
     1021 aggcacgaac ccagtggaca taagcctgtt cggttcgtaa gctgtaatgc aagtagcgta
     1081 tgcgctcacg caactggtcc agaaccttga ccgaacgcag cggtggtaac ggcgcagtgg
     1141 cggttttcat ggcttgttat gactgttttt tggggtacag tctatgcctc gggcatccaa
     1201 gcagcaagcg cgttacgccg tgggtcgatg tttgatgtta tggagcagca acgatgttac
     1261 gcagcagggc agtcgcccta aaacaaagtt aaacatcatg agggaagcgg tgatcgccga
     1321 agtatcgact caa
//
LOCUS       TTERRDA      1504 bp ss-rRNA            RNA       14-AUG-1990
DEFINITION  T.tenax 16S rRNA.
ACCESSION   M35966
KEYWORDS    16S ribosomal RNA.
SOURCE      T.tenax RNA.
  ORGANISM  Thermoproteus tenax
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Thermoproteales; Thermoproteaceae.
REFERENCE   1  (bases 1 to 1504)
  AUTHORS   Leinfelder,W., Jarsch,M. and Bock,A.
  TITLE     The phylogenetic position of the sulfur-dependent archaebacterium
            Thermoproteus tenax: Sequence of the 16S rRNA gene
  JOURNAL   Syst. Appl. Microbiol. 6, 164-170 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    rRNA          1     1504     16S rRNA
BASE COUNT      297 a    448 c    563 g    196 t
ORIGIN      
        1 aaaccggttg atcctgccgg acctgaccgc tatcggggtg gggctaagcc atgcgagtcg
       61 cgcgcccggg gcgccgggcg cggcgcacgg ctcagtaaca cgtacccaac ctaacctcgg
      121 gagggggaca accccgggaa actggggctg atcccccata ggggaagggc gctggaaggc
      181 cccttcctcc aaagggatcg cgggcgatct cccgcggtcc gcccgagggt gggggtacgg
      241 cccatcaggt tgttggcggg gtaacggccc gccaagccga agacgggtag gggcggtgag
      301 agccgtgagc cccgagatgg gcactgagac aagggcccag gccctacggg gtgcagcagg
      361 cgcgaatact ccgcaatgcg ggcaaccgcg acggggccac cccgagtgcc gggcgaagag
      421 cccggctttt gcccggtgta aggagccggg cgaataagcg gggggtaagt ctggtgtcag
      481 ccgccgcggt aataccagcc ccgcgagtgg tcagggtgat tactgggctt aaagcgcccg
      541 tagccggccc ggcaagtcgc tcctgaaatc cccaggctca acctgggggc agggggcgat
      601 actgccgggc tagggggcgg gagaggccgc cggtactccg ggggtagggg cgaaatccta
      661 taatccccgg aggaccacca gtggcgaaag cgggcggcca gaacgcgccc gacggtgagg
      721 ggcgaaagcc gggggagcaa aggggattag atacccctgt agtcccggcc gtaaacgatg
      781 cgggctagct gtcggccggg cttagggccc ggccggtggc gtagggaaac cgttaagccc
      841 gccgcctggg gagtacggcc gcaaggctga aacttaaagg aattggcggg ggggcaccac
      901 aaggggtgaa gcttgcggct taattggagt caacgccgga aaccttaccc ggggcgacag
      961 caggatgaag gccaggctaa cgaccttgcc ggacgagctg agaggaggtg catggccgtc
     1021 gtcagctcgt gccgtgaggt gtccggttaa gtccggcaac gagcgagacc cccaccccta
     1081 gttgctaccc cgctcttcgg ggcggggggc acactagggg gactgccggc gtaagccgga
     1141 ggaaggaggg ggcgacggca ggtcagtatg ccccgaaacc ccggggctgc acgcgagctg
     1201 caatggcggg gacagcggga tccgaccccg aaagggggag gcaatcccgt aaaccccgcc
     1261 ccagtaggga tcgagggctg caactcgccc tcgtgaacgt ggaatcccta gtaaccgcgt
     1321 gtcaccaacg cgcggtgaat acgtccctgc cccttgcaca caccgcccgt cgcaccaccc
     1381 gagggagttc tctgcgaggc ccctcgcttg gggcaaccca ggtgggggga cgagcagaga
     1441 actcccgagg ggggtgaagt cgtaacaagg tagccgtagg ggaacctgcg gttggatcac
     1501 ctcc
//
LOCUS       VACPOLRDA   15212 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Vaccinia virus RNA polymerase (rpo30), polyA polymerase large
            subunit and DNA polymerase genes, complete cds.
ACCESSION   M36339
KEYWORDS    DNA polymerase; RNA polymerase; polyA polymerase large subunit.
SOURCE      Vaccinia virus (strain WR) DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 15212)
  AUTHORS   Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y.
  TITLE     Eukaryotic transcriptional elongation factor homolog is an integral
            subunit of the vaccinia virus encoded RNA polymerase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 2041 to 8280)
  AUTHORS   Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y.
  TITLE     Eukaryotic transcriptional elongation factor homolog is an integral
            subunit of the vaccinia virus encoded RNA polymerase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Authorin copy for [1] kindly submitted by B.Y.Ahn, 10-JUL-1990.
               National Institutes of Health
               NIAID/LVD
               9000 Rockville Pike
               Bethesda, Maryland 20892
FEATURES       from  to/span     description
    pept       1621      182 (c) polyA polymerase large subunit
    pept       3831     1618 (c) ORF1
    pept       4530     3958 (c) ORF2
    pept       5365     4586 (c) RNA polymerase subunit (rpo30)
    pept       5414     6439     ORF3
    pept       6576     8279     ORF4
    pept       8361     8861     ORF5
    pept       8986     9807     ORF6
    pept      12834     9814 (c) DNA polymerase
    pept      12866    13153     ORF7
    pept      13537    13148 (c) ORF8
BASE COUNT     4992 a   2561 c   2400 g   5259 t
ORIGIN      
        1 aagcttcctg cgtactcaaa gtagataaac cctcatcgcc cgcgtgtgag agaagacctt
       61 cgtccccgtc cagatgcgag agaatgaata accctggaaa acaagttccg tttatgagga
      121 cggacatgct acaaaatatg ttcgcggcta atcgcgacaa cgtggcgtcg agacttttga
      181 actaaaatac aattatatcc ttttcgatat taataaatcc gtgtcgtcca ggttttttat
      241 ctctttcagt atgtgaatag ataggtattt tatctctatt catcatcgaa tttaagagat
      301 ccgataaaca ttgtttgtat tctccagatg tcagcatctg atacaacaat atatgtgcac
      361 ataaacctct ggcacttatt tcatgtacct tccccttatc actaaggaga atagtatttg
      421 agaaatatgt atacatgata ttatcatgaa ttagatatac agaatttgta acactctcga
      481 aatcacacga tgtgtcggcg ttaagatcta atatatcact cgataacaca ttttcatcta
      541 gatacactag acatttttta aagctaaaat agtctttagt agtgacagta actatgcgat
      601 tattttcatc gatgatacat ttcatcggca tattattacg cttaccatca aagactatac
      661 catgtgtata tctaacgtat tctagcatgg ttgccatacg cgcattaaac ttttcaggat
      721 ctttggatag atcttccaat ctatctattt gagaaaacat ttttatcatg ttcaatagtt
      781 gaaacgtcgg atccactata tagatattat ctataaagat tttaggaact acgttcatgg
      841 tatcctggcg aatattaaaa ctatcaatga tatgattatc gttttcatct tttatcacca
      901 tatagtttct aagatatggg attttactta atataatatt atttcccgta ataaatttta
      961 ttagaaatgc caaatctata agaaaagttc tagaattagt ctgaagaata tctatatcgc
     1021 cgtatagtat atttggatta attagatata gagaatatga tccgtaacat atacaacttt
     1081 tattatggcg tctaagatat tcttccatca acttattaac atttttgact agggaagata
     1141 cattatgacg tcccattact tttgccttgt ctattactgc gacgttcata gaatttagca
     1201 tatctcttgc caattcttcc attgatgtta cattataaga aattttagat gaaattacat
     1261 ttggagcttt aatagtaaga actcctaata tgtccgtgta tgtggtcact aatacagatt
     1321 gtagttctat aatcgtaaat aatttaccta tattatatgt ttgagtctgt ttagaaaagt
     1381 agctaagtat acgatctttt atttctgatg cagatgtatc aacatcggaa aaaaatcttt
     1441 ttttattctt ttttactaaa gatacaaata tgtctttgtt aaaaacagtt attttctgaa
     1501 tatttctagc ttgtaatttt aacatatgat attcgttcac actaggtact ctgcctaaat
     1561 aggtttctat aatctttaat gtaatattag gaaaagtatt ctgatcagga ttcctattca
     1621 ttttgaggat ttaaaactct gattattgtc taatatggtc tctacgcaaa ctttttcaca
     1681 gagcgataga gtttttgata actcgttttt cttaagaaat ataaaactac tgtctccaga
     1741 gctcgctcta tcttttattt tatttaattc gatacaaact cctgatactg gttcagaaag
     1801 taattcatta attttcagtc ctttatagaa gatatttaat atagataata caaaattttc
     1861 agttcttgat atcgatctga ttgatcctag aactagatat attaataacg tgctcattag
     1921 gcagtttatg gcagcttgat aattagatat agtatattcc agttcatatt tattagatac
     1981 cgcattgccc agattttgat attctatgaa ttcctctgaa aataaatcca aaataactaa
     2041 acattctatt ttttgtggat tagtgtactc tcttccctct atcatgttca ctactggtgt
     2101 ccacgatgat aaatatctag agggaatata atatagtcca taggatgcca atctagcaat
     2161 gtcgaataac tgtaatttga ttcttcgttc ttcattatga attgattctt gaggtataaa
     2221 cctaacacaa attatattat tagacttttc gtatgtaatg tctttcatgt tataagtttt
     2281 taatcctgga atagaatcta ttttaatgag gcttttaaac gcagagttct ccaacgagtc
     2341 aaagcataat actctgttgt ttttcttata tacgatgtta cgattttctt ctttgaatgg
     2401 aataggtttt tgaattagtt tataattaca acataataga taaggaagtg tgcaaatagt
     2461 acgcggaaaa aacataatag ctcccctgtt ttcatccatg gttttaagta aatgatcact
     2521 ggcttcttta gtcaatggat attcgaacat taaccgtttc atcatcattg gacagaatcc
     2581 atatttctta atgtaaagag tgatcaaatc attgtgttta ttgtaccatc ttgttgtaaa
     2641 tgtgtattcg gttatcggat ctgctccttt ttctattaaa gtatcgatgt cgatctcgtc
     2701 taagaattca actatatcga catatttcat ttgtatacac ataaccatta ctaacgtaga
     2761 atgtatagga agagatgtaa cgggaacagg gtttgttgat tcgcaaacta ttctaataca
     2821 taattcttct gttaatacgt cttgcacgta atctattata gatgccaaga tatctatata
     2881 attattttgt aagatgatgt taactatgtg atctatataa gtagtgtaat aattcatgta
     2941 tttcgatata tgttccaact ctgtctttgt gatgtctagt ttcgtaatat ctatagcatc
     3001 ctcaaaaaat atattcgcat atattcccaa gtcttcagtt ctatcttcta aaaaatcttc
     3061 aacgtatgga atataataat ctattttacc tcttctgata tcattaatga tatagttttt
     3121 gacactatct tctgtcaatt gattcttatt cactatatct aagaaacgga tagcgtccct
     3181 aggacgaact actgccatta atatctctat tatagcttct ggacataatt catctattat
     3241 accagaatta atgggaacta ttccgtatct atctaacata gttttaagaa agtcagaatc
     3301 taagacctga tgttcatata ttggttcata catgaaatga tctctattga tgatagtgac
     3361 tatttcattc tctgaaaatt ggtaactcat tctatatatg ctttccttgt tgatgaagga
     3421 tagaatatac tcaatagaat ttgtaccaac aaactgttct cttatgaatc gtatatcatc
     3481 atctgaaata atcatgtaag gcatacattt aacaattaga gacttgtctc ctgttatcaa
     3541 tatactattc ttgtgataat ttatgtgtga ggcaaatttg tccacgttct ttaattttgt
     3601 tatagtagat atcaaatcca atggagctac agttcttggc ttaaacagat atagtttttc
     3661 tggaacgaat tctacaacat tattataaag gactttgggt agataagtgg gatgaaatcc
     3721 tattttaatt aatgcgatag ccttgtcctc gtgcagatat ccaaacgctt ttgtgatagt
     3781 atggcattca ttgtctagaa acgctctacg aatatctgtg acagatatca tctttagaga
     3841 atatactagt cgcgttaata gtactacaat ttgtattttt taatctatct caataaaaaa
     3901 attaatatgt atgattcaat gtataactaa actactaact gttattgata actagaatca
     3961 gaatctaatg atgacgtaac caagaagttt atctactgcc aatttagctg cattattttt
     4021 agcatctcgt ttagattttc catctgcctt atcgaatact cttccgtcga tgtctacaca
     4081 ggcataaaat gtaggagagt tactaggccc cactgattca atacgaaaag accaatctct
     4141 cctagtaatt tggcagtact cattaataac ggtgacaggg ttagcacctt tccaatcaat
     4201 aattttttta gccggaataa catcatcaaa agacttatga tcctctctca ttgatttttc
     4261 gcgggataca tcatctatta tgacgtcagc catagcatca gcatccggct tatccgcctc
     4321 cgttgtcata aaccaacgag gaggaatatc gtcggagctg tacaccatag cactacgttg
     4381 aagatcgtac agagctttat taacttctcg cttctccata ttaagttgtc tagttagttg
     4441 tgcagcagta gctccttcga ttccaatggt tttaatagcc tcacacacaa tctctgcgtt
     4501 agaacgctcg tcgatataga ttttagacat ttttagagag aactaacgca atcagtaata
     4561 aaactaattt attttatcat tttttttatt catcatcctc tggtggttcg tcgtttctat
     4621 cgaatgtagc tctgattaac ccgtcatcta taggtgatgc tggttctgga gattctggag
     4681 gagatggatt attatctgga agaatctctg ttatttcctt gttttcatgt atcgattgcg
     4741 ttgtaacatt aagattgcga aatgctctaa atttgggagg cttaaagtgt tgtttgcaat
     4801 ctctacacgc gtgtctaact agtggaggtt cgtcagcggc tctagtttga atcatcatcg
     4861 gcgtagtatt cctactttta cagttaggac acggtgtatt gtatttctcg tcgagaacgt
     4921 taaaataatc gttgtaactc acatccttta ttttatctat attgtattct actcctttct
     4981 taatgcattt tataccgaat aagagatagc gaaggaattc tttttcggtg ccgctagtac
     5041 ccttaatcat atcacatagt gttttatatt ccaaatttgt ggcaatagac ggtttatttc
     5101 tatacgatag tttgtttctg gaatcctttg agtattctat accaatatta ttctttgatt
     5161 cgaatttagt ttcttcgata ttagattttg tattacctat attcttgatg tagtactttg
     5221 atgatttttc catggcccat tctattaagt cttccaagtt ggcatcatcc acatattgtg
     5281 atagtaattc tcggatatca gtagcggtta ccgccattga tgtttgttca ttggatgagt
     5341 aactactaat gtatacattt tccatttata acacttatgt attaactttg ttcatttata
     5401 ttttttcatt attatgttga tattaacaaa agtgaatata tatatgttaa taattgtatt
     5461 gtggttatac ggctacaatt ttataattag tgaaagtcag tgtccgatga tcaatgacga
     5521 tagctttact ctgaaaagaa agtatcaaat cgatagtgcg gagtcaacaa taaaaatgga
     5581 taagaagagg acaaagtttc agaatagagc caaaatggta aaagaaataa atcagacaat
     5641 aagagcagca caaactcatt acgagacatt gaaactagga tacataaaat ttaagagaat
     5701 gattaggact actactctag aagatatagc accatctatt ccaaataatc agaaaactta
     5761 taaactattc tcggacattt cagccatcgg caaagcatca cggaatccaa gtaagatggt
     5821 atatgctctg ctgctttaca tgtttcccaa tttgtttgga gatgatcata gattcattcg
     5881 ttatagaatg catccaatga gtaaaatcaa acacaagatc ttctctcctt tcaaacttaa
     5941 tcttattaga atattagtgg aagaaagatt ctataataat gaatgcagat ctaataaatg
     6001 gagaataatt ggaacacaag ttgataaaat gttgatagct gaatctgata aatatacaat
     6061 agatgcaagg tataacctaa aacccatgta tagaatcaag ggaaaatctg aagaagatac
     6121 cctctttatc aaacagatgg tagaacaatg tgtgacatcc caggaattgg tggaaaaagt
     6181 gttgaagata ctgtttagag atttgttcaa gagtggagaa tacaaagcgt acagatacga
     6241 tgatgatgta gaaaatggat ttattggatt ggatacacta aaattaaaca ttgttcatga
     6301 tatagttgaa ccatgtatgc ctgttcgtag gccagtggct aagatactgt gtaaagaaat
     6361 ggtaaataaa tactttgaga atccgctaca tattattggt aaaaatcttc aagagtgcat
     6421 tgactttgtt agtgaatagg catttcatct ttctccaata ctaattcaaa ttgttaaatt
     6481 aataatggat agtataaata gttattagtg ataaaatagt aaaaataatt attagaataa
     6541 gagtgtagta tcatagataa ctctcttcta taaaaatgga ttttattcgt agaaagtatc
     6601 ttatatacac agtagaaaat aatatagatt ttttaaagga tgatacatta agtaaagtaa
     6661 acaattttac cctcaatcat gtactagctc tcaagtatct agttagcaat tttcctcaac
     6721 atgttattac taaggatgta ttagctaata ccaatttttt tgttttcata catatggtac
     6781 gatgttgtaa agtgtacgaa gcggttttac gacacgcatt tgatgcaccc acgttgtacg
     6841 ttaaagcatt gactaagaat tatttatcgt ttagtaacac aatacaatcg tacaaggaaa
     6901 ccgtgcataa actaacacaa gatgaaaaat ttttagaggt tgccaaatac atggacgaat
     6961 taggagaact tataggcgta aattatgact tagttcttaa tccattattt cacggagggg
     7021 aacccatcaa agatatggaa atcatttttt taaaactgtt taagaaaaca gacttcaaag
     7081 ttgttaaaaa attaagtgtt ataagattac ttatttgggc ttacctaagc aagaaagata
     7141 caggcataga gtttgcggat aatgatagac aagatatata tactctattt caacaaactg
     7201 gtagaatcgt ccatagcaat ctaacagaaa cgtttagaga ttatatcttt cccggagata
     7261 agactagcta ttgggtgtgg ttaaacgaaa gtatagctaa tgatgcggat attgttctta
     7321 atagacacgc cattaccatg tatgataaaa ttcttagtta tatatactct gagataaaac
     7381 agggacgcgt taataaaaac atgcttaagt tagtttatat ctttgagcct gaaaaagata
     7441 tcagagaact tctgctagaa atcatatatg atattcctgg agatatccta tctattattg
     7501 atgcaaaaaa cgacgattgg aaaaaatatt ttattagttt ttataaagct aattttatta
     7561 acggtaatac atttattagt gatagaacgt ttaacgagga cttattcaga gttgttgttc
     7621 aaatagatcc cgaatatttc gataatgaac gaattatgtc tttattctct acgagtgctg
     7681 cggacattaa acgatttgat gagttagata ttaataacag ttatatatct aatataattt
     7741 atgaggtgaa cgatatcaca ttagatacaa tggatgatat gaagaagtgt caaatcttta
     7801 acgaggatac gtcgtattat gttaaggaat acaatacata cctgtttttg cacgagtcgg
     7861 atcccatggt catagagaac ggaatactaa agaaactgtc atctataaaa tccaagagta
     7921 gacggctgaa cttgtttagc aaaaacattt taaaatatta tttagacgga caattggctc
     7981 gtctaggtct tgtgttagat gattataaag gagacttgtt agttaaaatg ataaaccatc
     8041 ttaagtctgt ggaggatgta tccgcattcg ttcgattttc tacagataaa aaccctagta
     8101 ttcttccatc gctaatcaaa actattttag ctagttataa tatttccatc atcgtcttat
     8161 ttcaaaggtt tttaagagat aatctatatc atgtagaaga attcttggat aaaagcatcc
     8221 atctaaccaa gacggataag aaatatatac ttcaattgat aagacacggt agatcataga
     8281 acagaccaaa tatattatta ataatttgta tatacataga tataattatc acatattaaa
     8341 aattcacaca tttttgataa atgggaactg ctgcaacaat tcagactccc accaaattaa
     8401 tgaataaaga aaatgcagaa atgattttgg aaaaaattgt tgatcatata gttatgtata
     8461 ttagtgacga atcaagtgat tcagaaaata atcctgaata tattgatttt cgtaacagat
     8521 acgaagacta tagatctctc attataaaaa gtgatcacga gtttgtaaag ctatgtaaaa
     8581 atcatgcaga gaaaagttct ccagaaacgc aacaaatgat tatcaaacac atatacgaac
     8641 aatatcttat tccagtatct gaagtactat taaaacctat aatgtccatg ggtgacataa
     8701 ttacatataa cggatgtaaa gacaatgaat ggatgctaga acaactctct accctaaact
     8761 ttaacaatct ccgcacatgg aactcatgta gcataggcaa tgtaacgcgt ctgttttata
     8821 cattttttag ttatctgatg aaagataaac taaatatata agtataatcc cattctaata
     8881 ctttaacctg atgtattagc atcttattag aatattaacc taactaaaag acataacata
     8941 aaaactcatt acatagttga taaaaagcgg taggatataa atattatggc tgccaccgtt
     9001 ccgcgttttg acgacgtgta caaaaatgca caaagaagaa ttctagatca agaaacattt
     9061 tttagtagag gtctaagtag accgttaatg aaaaacacat atctatttga taattacgcg
     9121 tatggatgga taccagaaac tgcaatttgg agtagtagat acgcaaactt agatgcaagt
     9181 gactattatc ccatttcgtt gggattactt aaaaagttcg agtttctcat gtctctatat
     9241 aaaggtccta ttccagtata cgaagaaaaa gtaaatactg aattcattgc taatggatcg
     9301 ttctctggta gatacgtatc atatcttcga aagttttctg ctcttccaac aaacgagttt
     9361 attagttttt tgttactgac ttccattcca atctataata tcttgttctg gtttaaaaat
     9421 actcagtttg atattactaa acacacatta ttcagatacg tctatacaga taatgccaaa
     9481 cacctggcgt tggctaggta tatgcatcaa acaggagact ataagccttt gtttagtcgt
     9541 ctcaaagaga attatatatt taccggtccc gttccaatag gtatcaaaga tataaatcac
     9601 cctaatctta gtagagcaag aagtccatcc gattatgaga cattagctaa tattagtact
     9661 atattgtact ttaccaagta tgatccggta ttaatgtttt tattgtttta cgtacctggg
     9721 tattcaatta ctacaaaaat tactccagcc gtagaatatc taatggataa actgaatcta
     9781 acaaagagcg acgtacaact gttgtaaatt attttatgct tcgtaaaatg taggttttga
     9841 accaaacatt ctttcaaaga atgagatgca taaaacttta ttatccaata gattgactat
     9901 ttcggacgtc aatcgtttaa agtaaacttc gtaaaatatt ctttgatcac tgccgagttt
     9961 aaaacttcta tcgataattg tttcatatgt tttaatattt acaagttttt tggtccatgg
    10021 tacattagcc ggacaaatat atgcaaaata atatcgttct ccaagttcta tagtttctgg
    10081 attattttta ttatattcag taaccaaata catattaggg ttatctgcgg atttataatt
    10141 tgagtgatgc attcgactca acataaataa ttctagagga gacgatctac tatcaaattc
    10201 ggatcgtaaa tctgtttcta aagaacggag aatatctata catacctgat tagaattcat
    10261 ccgtccttca gacaacatct cagacagtct ggtcttgtat gtcttaatca tattcttatg
    10321 aaacttggaa acatctcttc tagtttcact agtaccttta ttaattctct caggtacaga
    10381 ttttgaattc gacgatgccg agtatttcat cgttgtatat ttcttcttcg attgcataat
    10441 cagattctta tataccgcct caaactctat tttaaaatta ttaaacaata ctctattatt
    10501 aatcagtcgt tctaactcct ttgctatttc tatggactta tctacatctt gactgtctat
    10561 ctctgtaaac acggagtcgg tatctccata cacgctacga aaacgaaatc tgtaatctat
    10621 aggcaacgat gttttcacaa tcggattaat atctctatcg tccatataaa atggattact
    10681 taatggattg gcaaaccgta acataccgtt agataactct gctccattta gtaccgattc
    10741 tagatacaag atcattctac gtcctatgga tgtgcaactc ttagccgaag cgtatgagta
    10801 tagagcacta tttctaaatc ccatcagacc atatactgag ttggctacta tcttgtacgt
    10861 atattgcatg gaatcataga tggccttttc agttgaactg gtagcctgtt ttaacatctt
    10921 tttatatctg gctctctctg ccaaaaatgt tcttaatagt ctaggaatgg ttccttctat
    10981 cgatctatcg aaaattgcta tttcagagat gaggttcggt agtctaggtt cacaatgaac
    11041 cgtaatatat ctaggaggtg gatatttctg aagcaagagc tgattattta tttcttcttc
    11101 caatctattg gtactaacaa cgacaccgac taatgtttcc ggagatagat ttccaaagat
    11161 acacacatta ggatacagac tgttataatc aaagattaat acattattac taaacatttt
    11221 ttgttttgga gcaaatacct taccgccttc ataaggaaac ttttgttttg tttctgatct
    11281 aactaagata gttttagttt ccaacaatag ctttaacagt ggacccttga tgactgtact
    11341 cgctctatat tcgaatacca tggattgagg aagcacatat gttgacgcac ccgcgtctgt
    11401 ttttgtttct actccataat actcccacaa atactgacac aaacaagcat catgaataca
    11461 gtatctagcc atatctaaag ctatgtttag attataatcc ttatacatct gagctaaatc
    11521 aacgtcatcc tttccgaaag ataatttata tgtatcatta ggtaaagtag gacataatag
    11581 tacgacttta aatccatttt cccaaatatc tttacgaatt actttacata taatatcctc
    11641 atcaacagtc acataattac ctgtggttaa aacctttgca aatgcagcgg ctttgccttt
    11701 cgcgtccgta gtatcgtcac cgatgaacgt catttctcta actcctctat ttaatacttt
    11761 acccatgcaa ctgaacgcgt tcttggatat agaatccaat ttgtacgaat ccaatttttc
    11821 agatttttga atgaatgaat atagatcgaa aaatatagtt ccattattgt tattaacgtg
    11881 aaacgtagta ttggccatgc cgcctactcc cttatgacta gactgatttc tctcataaat
    11941 acagagatat acagcttcct ttttgtccgg agatctaaag ataatcttct ctcctgttaa
    12001 taactctaga cgattagtaa tatatctcag atcaaagtta tgtccgttaa aggtaacgac
    12061 atagtcgaac gttagttcca acaattgttt agctattcgt aacaaaacta tttcagaaca
    12121 taaaactagt tctcgttcgt aatccatttc cattagtgac tgtatcctca aacatcctct
    12181 atcgacggct tcttgtattt cctgttccgt taacatctct tcattaatga gcgtaaacaa
    12241 taatcgttta ccacttaaat cgatataaca gtaacttgta tgcgagattg ggttaataaa
    12301 tacagaagga aacttcttat cgaagtgaca ctctatatct agaaataagt acgatcttgg
    12361 gatatcgaat ctaggtattt ttttagcgaa acagttacgt ggatcgtcac aatgataaca
    12421 tccattgtta atctttgtca aatattgctc gtccaacgag taacatccgt ctggagatat
    12481 cccgttagaa atataaaacc aactaatatt gagaaattca tccatggtgg cattttgtat
    12541 gctgcgtttc tttggctctt ctatcaacca catatctgcg acggagcatt ttctatcttt
    12601 aatatctaga ttataactta ttgtctcgtc aatgtctata gttctcatct ttcccaacgg
    12661 cctcgcatta aatggaggag gagacaatga ctgatatatt tcgtccgtca ctacgtaata
    12721 aaagtaatga ggaaatcgta taaatacggt ctcaccattt cgacatctgg atttcagata
    12781 taaaaatctg ttttcaccgt gactttcaaa ccaattaatg caccgaacat ccatttatag
    12841 aatttagaaa tatattttca tttaaatgaa tcccaaacat tggggaagag ccgtatggac
    12901 cattattttt atagtacttt cgcaagcggg tttagacggc aacatagaag cgtgtaaacg
    12961 aaaactatat actatagtta gcactcttcc atgtcctgca tgtagacggc acgcgactat
    13021 cgctatagag gacaataatg tcatgtctag cgatgatctg aattatattt attatttttt
    13081 catcagatta tttaacaatt tggcatctga tcccaaatac gcgatcgatg tgacaaaggt
    13141 taacccttta taaacttaac ccattataaa acttatgatt agtcacgact gaaataaccg
    13201 cgtgattatt ttttggtata attctacacg gcatggtttc tgtgactatg aattcaaccc
    13261 ccgttacatt agtgaaatct ttaacaaaca gcaagggttc gtcaaagaca taaaactcat
    13321 tgtttacaat cgaaatagac cccctatcac acttaaaata aaaaatatcc ttatccttta
    13381 ccaccaaata aaattctgat tggtcaatgt gaatgtattc acttaacagt tccacaaatt
    13441 tatttattaa ctccgaggca catacatcgt cggtattttt tatggcaaac tttactcttc
    13501 cagcatccgt ttctaaaaaa atattaacga gttccattta tatcatccaa tattattgaa
    13561 atgacgttga tggacaaatg atacaaataa gaaggtacgg tacctttgtc caccatctcc
    13621 tccaattcat gctctatttt gtcattaact ttaatgtatg aaaacagtac gccacatgct
    13681 tccatgacag tgtgtaacac tttggataca aaatgtttga cattagtata attgttcaag
    13741 actgtcaatc tataatagat agtagctata atatattcta tgatggtatt gaagaagatg
    13801 acaaccttgg catattgatc atttaacaca gacatggtat caacagatag cttgaatgaa
    13861 agagaatcag taattggaat aagcgtcttc tcgatggagt gtccgtatac caacatgtct
    13921 gatattttga tgtattccat taaattattt agttttttct ttttattctc gttaaacagc
    13981 atttctgtca acggacccca acatcgttga ccgattaagt tttgattgat ttttccgtgt
    14041 aaggcgtatc tagtcagatc gtatagccta tccaataatc catcgtctgt gtgtagatca
    14101 catcgtacac tttttaattc tctatagaag agcgacagac atctggagca attacagaca
    14161 gcaatttctt tattctctac agatgtaaga tacttgaaga cattcctatg atgatgcaga
    14221 attttggata acacggtatt gatggtatct gttaccataa ttcctttgat ggctgatagt
    14281 gtcagagcac aagatttcca atctttgaca atttttagca ccattatctt tgttttgata
    14341 tctatatcag acagcatggt gcgtctgaca acacaaggat taagacggaa agatgaaatg
    14401 attctctcaa catcttcaat ggataccttg ctattttttc tggcattatc tatatgtgcg
    14461 agaatatcct ctagagaatc agtatccttt ttgatgatag tggatctcaa tgacatggga
    14521 cgtctaaacc ttcttattct atcaccagat tgcatggtga tttgtcttct ttcttttatc
    14581 ataatgtaat ctctaaattc atcggcaaat tgtctatatc taaaatcata atatgagatg
    14641 tttacctcta caaatatctg ttcgtccaat gttagagtat ttacatcagt tttgtattcc
    14701 aaattaaaca tggcaacgga tttaatttta tattcctcta ttaagtcctc gtcgataata
    14761 acagaatgta gataatcatt taatccatcg tacatggttg gaagatgctt gttgacaaaa
    14821 tctttaattg tcttgatgaa ggtgggacta tatctaacat cttgattaat aaaatttata
    14881 acattgtcca taggatactt tgtaactagt tttatacaca tctcttcatc ggtaagccca
    14941 gacagaatat cgtgaacagg tggtatatta tattcatcag atatacgaag aacaatgtcc
    15001 aaatctatat tgtttaatat attatataga tgtagtgtag ctcctacagg aatatcttta
    15061 actaagtcaa tgatttcatc aaccgttaga tctattttaa agttaatcat ataggcattg
    15121 atttttaaaa ggtatgtagc cttgactaca ttctcattaa ttaaccattc caagtcactg
    15181 tgtgtaagaa gattatattc tatcataagc tt
//
LOCUS       YSCERD2A     3327 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae ER lumen protein retaining receptor (ERD2) gene,
            complete cds.
ACCESSION   M34777
KEYWORDS    ER lumen protein retaining receptor; HDEL receptor.
SOURCE      S.cerevisiae (strain SEY2102) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3327)
  AUTHORS   Semenza,J.C., Hardwick,K.G., Dean,N. and Pelham,H.R.B.
  TITLE     ERD2, a yeast gene required for the receptor-mediated retrieval of
            luminal ER proteins from the secretory pathway
  JOURNAL   Cell 61, 1349-1357 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Semenza, 01-JUN-1990, for release after publication.
FEATURES       from  to/span     description
    pept       1337     1358     ER lumen protein retaining receptor, exon 1
                                 (ERB2)
               1456     2093     ER lumen protein retaining receptor, exon 2
    ORF        2492     2960     ORF1
    IVS        1359     1455     ERB2 intron A
    site       1359     1364     consensus splice donor
    site       1416     1422     branchpoint sequence
    site       1453     1455     acceptor sequence
BASE COUNT      996 a    639 c    574 g   1118 t
ORIGIN      
        1 aagcttcctg acggaatgtt aattctcgtt gaccctgagc aggctgttga gcccggggat
       61 ttctgcatag ccagacttgg gggtgatgag tttaccttca agaaactgat ccatccaaac
      121 catttttggg tttagttgct gcttccgctg gtatccttca agatgtcatt gaaggtaagt
      181 acgatcttga ggccggcgaa aacaaattca acttttaaat caaaatttta gaataaaagt
      241 ctactgcatt aacaaagata aatttctaat ttttttatgt atattatatt ttttggaaga
      301 gaaaatattt gtagtgcttc tctacactgt tttttttttt tcgttcatgt tgtctttcct
      361 ctcctgaata tatttaattt aataaatcta tcttatctga catatagaag ctgtttcgat
      421 agagttttcg actggccaac agttgttact gactggataa aaatttacga tttttcctcc
      481 aatcttggct ttcaaaatga gaaataagag tcttgaaatt caaagaatgt atctatgtat
      541 gaatatagaa tatgctcaaa tatttagttt tgtataagga agttctaaat aattttccat
      601 ggcgtgcttt taccaaagta ctgaacaggg agattattta tcattttttt tcttactctt
      661 cgagttgaag tcgattgtag gtgatgacat gaacattaaa taaaatgtac ttgaacaaat
      721 cagaccagca aacacccaca tcgcaatggt actgggtttt ctcgttattc tgtggaagaa
      781 ttcaattacc ggattattgt taaaaaagcc cattagaagt atcctaacga ttgattaaag
      841 taaattgtta ttcacgctac tatcgcaatc gtccaactga accttattct ttattgtata
      901 taatctctgc cgagaaattc ttatacttat gtatctggga attcaccagg ttagaatacc
      961 ttcccctcct caatttctaa tttttttttc tacccttatt tttctctctt gcttcatcgc
     1021 tctcaaagct aaaagctaat acagcaaaca aaagtgttat taggaattca attgagtcgt
     1081 gaaagaaaaa taatattcgt cttacttgag gactcacttt ggtcaattga gattaaagtg
     1141 tagtctgcac cactgaattc ctcgttttcc tttaaagatc cgggatggct tatctgcggt
     1201 acccggactg aaaacgacgt tttagctaaa tattgaacac gttaaaaata agttcaaatt
     1261 tcgaggcaga tgaatttgct gagggcactg ttcaagagtg agatatacgc gcaagttgag
     1321 gcataatcag taagcaatga atccgtttag aatcttaggt atgttactat ttggagtttc
     1381 atgaggcttt tcccgccgta gatcgaaccc aatcttacta acagagaaag ggctttttcc
     1441 cgaccatcaa gacaggtgat ttatcacatc taaccagtat actgatcctg attcataata
     1501 tcaagaccac aaggtacatt gaaggtattt ctttcaagac ccaaacgttg tacgctttgg
     1561 ttttcataac acgatacttg gatctcttga cttttcactg ggtatcccta tacaatgctc
     1621 taatgaaaat atttttcatt gtatctaccg cttacattgt agtgctatta caagggtcta
     1681 aaagaaccaa caccattgcg tataatgaaa tgcttatgca tgataccttt aagatccagc
     1741 atttactaat tgggagtgct ctaatgagtg tttttttcca tcacaagttc acttttcttg
     1801 aattagcatg gagtttttct gtatggttgg agagtgtggc tattctacct caattgtaca
     1861 tgctatctaa gggtgggaag actagaagtc taactgttca ttatattttt gccatgggat
     1921 tatacagagc attgtatatt cctaactgga tttggaggta cagcacggaa gataaaaaat
     1981 tggacaagat tgccttcttc gcgggacttt tgcaaactct gttgtactct gatttctttt
     2041 acatttacta cactaaagtc atcagaggaa agggtttcaa actgccaaaa taaaaaaaaa
     2101 acctgtttta aatagtgttc gcagcaatat atacaaattt caatatatca atgctatacc
     2161 aaaagaaaaa aaaaatacat aaaattgatg ttgatattgg tctatttctt tttatactat
     2221 gatatgtatg cattaatctc tttttagctc ataaaattct ttccttacac catccttggt
     2281 gacaataagg atttccagcc catcacccac ttgtatatgt ctttctgtag cagaagtgaa
     2341 cgagtctctc accagtttga tgacttcttc cacggacaag tatttcaaag gctttttgac
     2401 tttaccgttt gtacctggct catattgaat ttttgaaatt aacctgattg tccaaaaatg
     2461 gcatgatcaa tgatgccgca gcaccacctg ctctacactg ttctctttcg taggagccaa
     2521 actgggtcga acgaatagac agcgccctta ccatcttcgt caagacccgc aatgatcgta
     2581 tgaacgtagt aagggaaaaa cctcttcccg tacagaagat gttgaatgtt ccttgctgca
     2641 gagtttatag atagtttttt gtcgttgtgg tcgaaatggt accattttac actatttttg
     2701 aatcttttta ctaaagcgtc gccgtctgct gcaaatccat tcgccgacat gactatgtta
     2761 tcaccacaat caaaaacctt gggttcataa cgagaattaa ttgagtaatc ggtgatattt
     2821 cttgtatcgc ctgctaacac tgcgaaatct tcacctgcaa tgcccaggat tgtaccacca
     2881 ttatcaccgt aaggattgaa ttgatgttca atgggtgtat ttgacgcctc cgaagagtat
     2941 tctgatgcaa tagtggccat attttgttta actttatagt tcaatagtct tggctactct
     3001 ctttccaact cagttcacct tgttattata ccgcttgttt ttgccaccct ttgagtttct
     3061 cgatccttta agttggaaaa gataaaaaca attatcgctc cgtacgaaca ccaaatctct
     3121 gcgacgcaga gacggcaata caaaatcaga aaaaaaaaac aatttatcct atgcttcact
     3181 agtcacaggc tctacgaaac attctatcgc ctcactgcct ttacatcact taagcaagaa
     3241 gttttaaata ttacggcaat tcctttgcag caataccttt actttatttc gagagaaaat
     3301 agatcgtaaa atgcctactc caagctt
//
LOCUS       YSCLYS2A     4980 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae alpha-aminoadipate reductase (LYS2) gene, complete
            cds.
ACCESSION   M36287 M14967 M14968
KEYWORDS    LYS2 gene; alpha-aminoadipate reductase.
SOURCE      S.cerevisiae (strain S288C) DNA, clone pDP6.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 746 and 3921 to 4980)
  AUTHORS   Fleig,U.N., Pridmore,R.D. and Philippsen,P.
  TITLE     Construction of LYS2 cartridges for use in genetic manipulations of
            Saccharomyces cerevisiae
  JOURNAL   Gene 46, 237-245 (1986)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 4632)
  AUTHORS   Morris,M.E. and Jinks-Robertson,S.
  TITLE     Nucleotide sequence of the LYS2 gene of Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.Jinks-Robertson, 10-JUL-1990.
               Emory University
               Dept. of Biology
               1510 Clifton Road N.E.
               Atlanta, GA 30322
FEATURES       from  to/span     description
    pept        297     4475     alpha-aminoadipate reductase (LYS2; EC
                                 1.2.1.31)
BASE COUNT     1523 a    942 c   1026 g   1489 t
ORIGIN      Chromosome II.
        1 tctagaggca tcgcacagtt ttagcgagga aaactcttca atagttttgc cagcggaatt
       61 ccacttgcaa ttacataaaa aattccggcg gtttttcgcg tgtgactcaa tgtcgaaata
      121 cctgcctaat gaacatgaac atcgcccaaa tgtatttgaa gacccgctgg gagaagttca
      181 agatatataa gtaacaagca gccaatagta taaaaaaaaa tctgagttta ttacctttcc
      241 tggaatttca gtgaaaaact gctaattata gagagatatc acagagttac tcactaatga
      301 ctaacgaaaa ggtctggata gagaagttgg ataatccaac tctttcagtg ttaccacatg
      361 actttttacg cccacaacaa gaaccttata cgaaacaagc tacatattcg ttacagctac
      421 ctcagctcga tgtgcctcat gatagttttt ctaacaaata cgctgtcgct ttgagtgtat
      481 gggctgcatt gatatataga gtaaccggtg acgatgatat tgttctttat attgcgaata
      541 acaaaatctt aagattcaat attcaaccaa cgtggtcatt taatgagctg tattctacaa
      601 ttaacaatga gttgaacaag ctcaattcta ttgaggccaa tttttccttt gacgagctag
      661 ctgaaaaaat tcaaagttgc caagatctgg aaaggacccc tcagttgttc cgtttggcct
      721 ttttggaaaa ccaagatttc aaattagacg agttcaagca tcatttagtg gactttgctt
      781 tgaatttgga taccagtaat aatgcgcatg ttttgaactt aatttataac agcttactgt
      841 attcgaatga aagagtaacc attgttgcgg accaatttac tcaatatttg actgctgcgc
      901 taagcgatcc atccaattgc ataactaaaa tctctctgat caccgcatca tccaaggata
      961 gtttacctga tccaactaag aacttgggct ggtgcgattt cgtggggtgt attcacgaca
     1021 ttttccagga caatgctgaa gccttcccag agagaacctg tgttgtggag actccaacac
     1081 taaattccga caagtcccgt tctttcactt atcgcgacat caaccgcact tctaacatag
     1141 ttgcccatta tttgattaaa acaggtatca aaagaggtga tgtagtgatg atctattctt
     1201 ctaggggtgt ggatttgatg gtatgtgtga tgggtgtctt gaaagccggc gcaacctttt
     1261 cagttatcga ccctgcatat cccccagcca gacaaaccat ttacttaggt gttgctaaac
     1321 cacgtgggtt gattgttatt agagctgctg gacaattgga tcaactagta gaagattaca
     1381 tcaatgatga attggagatt gtttcaagaa tcaattccat cgctattcaa gaaaatggta
     1441 ccattgaagg tggcaaattg gacaatggcg aggatgtttt ggctccatat gatcactaca
     1501 aagacaccag aacaggtgtt gtagttggac cagattccaa cccaacccta tctttcacat
     1561 ctggttccga aggtattcct aagggtgttc ttggtagaca tttttccttg gcttattatt
     1621 tcaattggat gtccaaaagg ttcaacttaa cagaaaatga taaattcaca atgctgagcg
     1681 gtattgcaca tgatccaatt caaagagata tgtttacacc attattttta ggtgcccaat
     1741 tgtatgtccc tactcaagat gatattggta caccgggccg tttagcggaa tggatgagta
     1801 agtatggttg cacagttacc catttaacac ctgccatggg tcaattactt actgcccaag
     1861 ctactacacc attccctaag ttacatcatg cgttctttgt gggtgacatt ttaacaaaac
     1921 gtgattgtct gaggttacaa accttggcag aaaattgccg tattgttaat atgtacggta
     1981 ccactgaaac acagcgtgca gtttcttatt tcgaagttaa atcaaaaaat gacgatccaa
     2041 actttttgaa aaaattgaaa gatgtcatgc ctgctggtaa aggtatgttg aacgttcagc
     2101 tactagttgt taacaggaac gatcgtactc aaatatgtgg tattggcgaa ataggtgaga
     2161 tttatgttcg tgcaggtggt ttggccgaag gttatagagg attaccagaa ttgaataaag
     2221 aaaaatttgt gaacaactgg tttgttgaaa aagatcactg gaattatttg gataaggata
     2281 atggtgaacc ttggagacaa ttctggttag gtccaagaga tagattgtac agaacgggtg
     2341 atttaggtcg ttatctacca aacggtgact gtgaatgttg cggtagggct gatgatcaag
     2401 ttaaaattcg tgggttcaga atcgaattag gagaaataga tacgcacatt tcccaacatc
     2461 cattggtaag agaaaacatt actttagttc gcaaaaatgc cgacaatgag ccaacattga
     2521 tcacatttat ggtcccaaga tttgacaagc cagatgactt gtctaagttc caaagtgatg
     2581 ttccaaagga ggttgaaact gaccctatag ttaagggctt aatcggttac catcttttat
     2641 ccaaggacat caggactttc ttaaagaaaa gattggctag ctatgctatg ccttccttga
     2701 ttgtggttat ggataaacta ccattgaatc caaatggtaa agttgataag cctaaacttc
     2761 aattcccaac tcccaagcaa ttaaatttgg tagctgaaaa tacagtttct gaaactgacg
     2821 actctcagtt taccaatgtt gagcgcgagg ttagagactt atggttaagt atattaccta
     2881 ccaagccagc atctgtatca ccagatgatt cgtttttcga tttaggtggt cattctatct
     2941 tggctaccaa aatgattttt accttaaaga aaaagctgca agttgattta ccattgggca
     3001 caattttcaa gtatccaacg ataaaggcct ttgccgcgga aattgacaga attaaatcat
     3061 cgggtggatc atctcaaggt gaggtcgtcg aaaatgtcac tgcaaattat gcggaagacg
     3121 ccaagaaatt ggttgagacg ctaccaagtt cgtacccctc tcgagaatat tttgttgaac
     3181 ctaatagtgc cgaaggaaaa acaacaatta atgtgtttgt taccggtgtc acaggatttc
     3241 tgggctccta catccttgca gatttgttag gacgttctcc aaagaactac agtttcaaag
     3301 tgtttgccca cgtcagggcc aaggatgaag aagctgcatt tgcaagatta caaaaggcag
     3361 gtatcaccta tggtacttgg aacgaaaaat ttgcctcaaa tattaaagtt gtattaggcg
     3421 atttatctaa aagccaattt ggtctttcag atgagaagtg gatggatttg gcaaacacag
     3481 ttgatataat tatccataat ggtgcgttag ttcactgggt ttatccatat gccaaattga
     3541 gggatccaaa tgttatttca actatcaatg ttatgagctt agccgccgtc ggcaagccaa
     3601 agttctttga ctttgtttcc tccacttcta ctcttgacac tgaatactac tttaatttgt
     3661 cagataaact tgttagcgaa gggaagccag gcattttaga atcagacgat ttaatgaact
     3721 ctgcaagcgg gctcactggt ggatatggtc agtccaaatg ggctgctgag tacatcatta
     3781 gacgtgcagg tgaaaggggc ctacgtgggt gtattgtcag accaggttac gtaacaggtg
     3841 cctctgccaa tggttcttca aacacagatg atttcttatt gagatttttg aaaggttcag
     3901 tccaattagg taagattcca gatatcgaaa attccgtgaa tatggttcca gtagatcatg
     3961 ttgctcgtgt tgttgttgct acgtctttga atcctcccaa agaaaatgaa ttggccgttg
     4021 ctcaagtaac gggtcaccca agaatattat tcaaagacta cttgtatact ttacacgatt
     4081 atggttacga tgtcgaaatc gaaagctatt ctaaatggaa gaaatcattg gaggcgtctg
     4141 ttattgacag gaatgaagaa aatgcgttgt atcctttgct acacatggtc ttagacaact
     4201 tacctgaaag taccaaagct ccggaactag acgataggaa cgccgtggca tctttaaaga
     4261 aagacaccgc atggacaggt gttgattggt ctaatggaat aggtgttact ccagaagagg
     4321 ttggtatata tattgcattt ttaaacaagg ttggattttt acctccacca actcataatg
     4381 acaaacttcc actgccaagt atagaactaa ctcaagcgca aataagtcta gttgcttcag
     4441 gtgctggtgc tcgtggaagc tccgcagcag cttaaggttg agcattacgt atgatatgtc
     4501 catgtacaat aattaaatat gaattaggag aaagacttag cttcttttcg ggtgatgtca
     4561 cttaaaaact ccgagaataa tatataataa gagaataaaa tattagttat tgaataagaa
     4621 ctgtaaatca gctggcgtta gtctgctaat ggcagcttca tcttggttta ttgtagcatg
     4681 aatcatattt gccttttttt cctgtaattc aatgattctt gcttctatac tatcctcaat
     4741 gcaaaacctt gtgatcttca caggtcgata ctgaccaatt ctatgaactc tatcaccact
     4801 ttgccattca acactagggt tccaccatgg gtctaaaatg aatacttgcg aagcttcaca
     4861 aagattcaaa gcaacaccgc ccgcctttaa actgaccaag aaaacctcgc attgaatgtt
     4921 gttcatgaaa tacttgatgg tttcatctct ttgcgtcggt gacatactac cctgaagctt
//
LOCUS       YSCMTATPSA   2820 bp ds-DNA             ORG       14-AUG-1990
DEFINITION  Yeast mitochondrial oxi3 gene, 3' end, ATPase subunit 3 (aap1) and
            ATPase subunit 6 (oli2) gene, complete cds.
ACCESSION   M36379
KEYWORDS    oli2 region.
SOURCE      Yeast (strain J69-1B) mitochondrial DNA, clone.
  ORGANISM  Mitochondrion Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae.
REFERENCE   1  (bases 1 to 2820)
  AUTHORS   Novitski,C.E., Macreadie,I.G., Maxwell,R.J., Lukins,H.B.,
            Linnane,A.W. and Nagley,P.
  TITLE     Biogenesis of mitochondria: Genetic and molecular analysis of the
            oli2 region of mitochondrial DNA in Saccharomyces cerevisiae
  JOURNAL   Curr. Genet. 8, 135-146 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       63     cytochrome c oxidase subunit I (oxi3)
    pept       1019     1165     ATPase subunit 3 (aap1)
    pept       1872     2651     ATPase subunit 6 (oli2)
    binding     701      709     ribosomal binding site C (put.)
    binding    1814     1821     ribosomal binding site A (put.)
    binding    1835     1845     ribosomal binding site B (put.)
BASE COUNT     1159 a    200 c    183 g   1278 t
ORIGIN      
        1 gaattcttat taacttctcc accagctgta cactcattta atacaccagc tgtacaatct
       61 taagttataa tatttaatta tttacttaat aattaaaaaa aaaagtaaat attatatcta
      121 aaacttaata atataataat aatattctta taaaaaaaaa tatataaaaa aaaaatatat
      181 aaaatttatt aaaatatctt aatatattat cctttttggg aactatataa ataaatacta
      241 atataatcct attatatata tatatatata aaataatata tatatataat taatataaat
      301 aatatttata attatttttt taataatata tataatttaa tatattaatt aatattatat
      361 aattattaaa tatattataa tattattatt attttataat aaaaatattt ttaatactaa
      421 ttattattta ttatttataa atatataaat agtatgttta atattattaa tactaaaaaa
      481 aagtataatt ataattagga tctacaatac atttatctga taaatattaa tattaatatt
      541 aatatttata ttaataaagg gattaaatta attgtatcca atttaattaa attatagata
      601 tattatttat aatattaata tattgtttta ttaaaaaggt aaaaatagtt tttattttat
      661 atataaatat aggatataaa taaatatatt atatatatat taagaatata tttatatttt
      721 acatataatt atttataata taaatatctc cgcaaagccg gattaatgta attatttaat
      781 aattttattt aataatttat taaaataaat atttacattt aataatattt atattatgtc
      841 agttatttta tattaatgtt taatctatta taatattttt ttttataaat atattattta
      901 tattaataaa tatatatatt atttttataa tatatatata tttttattaa atatttatta
      961 aatatttatt aaattattat aatgttgtta ttaatcttat taaaaaatat atataaaaat
     1021 gccacaatta gttccatttt attttatgaa tcaattaata tatggtttct tattaatgat
     1081 tctattatta attttattct cacaattctt tttacctatg atcttaagat tatatgtatc
     1141 tagattattt atttctaaat tataatatat attattaatt tatttattca tataaatatt
     1201 attattatat ataaatatta ataatattta tacttattta ataataataa taaaataaaa
     1261 aaaataatta taatttaata tatttaatat atttccttac agactatata tttatatata
     1321 tatattaaat acaatttaat ttaatttaat tatgttattt attaaataaa gttatattat
     1381 aatataataa caatattata tattattata taattataat atattttaat ataattatca
     1441 aaagaaataa taaaaaatat taataagaat ataatttaat aattattaaa aaaattctta
     1501 tttatagtcc ggtccgcccc cgcggggcgg accccaaagg aggagtaata aaaattatta
     1561 aatacaaata ttatatatat ataattcatt atatatatat atatataata attaatctta
     1621 ttttttatat atttatttat atatctattt atatttatat tttatatata tttatttata
     1681 tatctaaggg gtttggtccc tcccccgtaa gtataagtat acgggggagg gtccctcact
     1741 atttatattt atatttatat ttatatttat atttatattt tatatatttt atatatttat
     1801 aaataaagta taataagata taattatgat taattattta taagttatag ttttataaat
     1861 ttataattat tatgtttaat ttattaaata catatattac atcaccatta gatcaatttg
     1921 agattagact attatttggt ttacaatcat catttattga tttaagttgt ttaaatttaa
     1981 caacattttc attatatact attattgtat tattagttat tacaagttta tatctattaa
     2041 ctaataataa taataaaatt attggttcaa gatgattaat ttcacaagaa gctatttatg
     2101 atactattat aaatatgctt aaaggacaaa ttggaggtaa aaattgaggt ttatatttcc
     2161 ctatgatctt tacattattt atgtttattt ttattgctaa tttaattagt atgattccat
     2221 attcatttgc attatcagct catttagtat ttattatctc tttaagtatt gttatttgat
     2281 taggtaatac tattttaggt ttatataaac atggttgagt attcttctca ttattcgtac
     2341 ctgctggtac accattacca ttagtacctt tattagttat tattgaaact ttatcttata
     2401 ttgctagagc tatttcatta ggtttaagat taggttctaa tatcttagct ggtcatttat
     2461 taatggttat tttagctggt ttactattta attttatgtt aattaattta tttactttag
     2521 tattcggttt tgtaccttta gctatgatct tagctattat gattttagaa ttcgctattg
     2581 gtattatcca atcttatgtt tgacttatct taacagcatc atacttaaaa gatacattat
     2641 acttacatta aattataaaa taaaaattat ataaaaaaaa tataataata ataataatat
     2701 aaaataaaaa ataaaaaata aaacaatgaa aaaacaaaat ttaaattcta ttttattaat
     2761 gtatattaat tatattatta attattttaa taatattcat aaaaatcaat taaaaaaaga
//
LOCUS       YSCSPO7A     1846 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae SPO7 gene, complete cds, and FUN14 and FUN37 genes, 5'
            end.
ACCESSION   M36073
KEYWORDS    antimutator phenotype; premeiotic DNA replication.
SOURCE      S.cerevisiae (strain AB972) DNA, clone lambda-G12b and pLF31.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1846)
  AUTHORS   Whyte,W., Koepp,L.H., Lamb,J., Crowley,J.C. and Kaback,D.B.
  TITLE     Molecular cloning of chi I DNA from S.cerevisiae
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.B.Kaback, 05-JUL-1990.
FEATURES       from  to/span     description
    pept        564     1343     SPO7 protein
    pept       1624  >  1832     FUN14 protein
    pept        375  <     1 (c) FUN37 protein
BASE COUNT      556 a    308 c    415 g    567 t
ORIGIN      Chromosome I.
        1 gtcgactgtg gtgttgtcac tactcaacgt attcgcacta ctaacactga aattgaggtt
       61 tggttgcaat tgtctgtatg tttcggtggc atcttgtaat gggatatcag tagagttgcg
      121 catgaatttc tccaattgct gtgcatcgga gtataaataa ctcagagaac catttatcct
      181 ggacctcgta gaaaaatcta aagaattgaa tgtattggga gtagatttgt tggaaatttg
      241 caggtgtatt gctgagggaa ttcggaaatc taataatgtt ctcgatgtgg ccgttatatc
      301 ctcgtagcta ttttgcgtac tccaatgggt gctctgataa aatgccctta gtacttggtc
      361 catatagggt agcatcaaga tcggtcttct ctgttcgtgt ctttttccta acgtatattt
      421 gctttgtttc ttcactcaac aataaagtca aagtaaaatt aaatactaat tattcttaaa
      481 agggaagatg cgaaatttag cgaaaatcta ttgattatac acacaaagga agaaaggtag
      541 tggaaagcta aataaaggag gtcatggagc cagagagcat aggcgatgtg gggaaccatg
      601 cccaggatga tagtgccagt atagtgtccg ggcctcgcag gcgttctact agcaagacat
      661 ccagtgcgaa gaatatacgg aactccagta atatctctcc agcatcgatg attttcagga
      721 atttgttgat actggaggat gatttaagac gccaagctca cgaacaaaag atactgaagt
      781 ggcaattcac tttgttctta gcgtctatgg ccggtgtagg cgcatttacc ttctacgaac
      841 tttatttcac ttcagattat gtcaagggcc tccatagggt tattttgcaa ttcactcttt
      901 ctttcatttc cattactgta gttctttttc atatcagtgg acaatataga agaactatcg
      961 tcattccaag aagatttttt acctctacta ataaagggat taggcagttt aatgtgaagc
     1021 tagttaaagt acagtctacg tgggacgaga aatacacaga ttcagtaaga tttgtgagtc
     1081 gaacaattgc ttattgtaat atttattgtt tgaaaaaatt tctgtggctt aaagacgata
     1141 atgccattgt gaaattttgg aaaagtgtca cgatacaatc ccaaccgagg atcggagctg
     1201 tggatgtgaa attagtcctc aaccccagag catttagtgc agagattaga gaaggatggg
     1261 agatttatag agacgagttt tgggccaggg aaggtgctag aagacgcaaa caagcgcacg
     1321 aactccgacc taaatcagaa tgaaagagtt ggagggcttc ttccttcgaa taagaggtca
     1381 tatttaccta tgtaaaattg taaccatcta tgttcacaca taaattatat tttatacatt
     1441 attagaagtg aagctgttgt gtcgtgaaaa ttttacaaat ccgtcatttc atatttaagt
     1501 tttccaacaa gtgctagaaa acctaggggt tgttgaaatt ggttaaacaa ggcatcttat
     1561 tatacataca acagcataac gctagagggg caagaaggaa gaacttaaaa taataggtgt
     1621 aaaatgactt tggcttttaa tatgcaacgg ttggtgtttc gtaatttgaa tgttgggaag
     1681 cgcatgttca agaacgtccc cttatggagg tttaatgtcg ccaataaatt aggaaagccc
     1741 ttaactcgct ctgtagggtt aggcggtgct ggcatagttg ctggtggctt ttacttgatg
     1801 aatcgccagc cttctaagtt gatattcaat gattctttag gggcag
//
LOCUS       YSKERD2A     1248 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  K.lactis ER lumen protein retaining receptor (ERD2) gene, complete
            cds.
ACCESSION   M34844
KEYWORDS    ER lumen protein retaining receptor; HDEL receptor.
SOURCE      K.lactis (SEY2102) DNA.
  ORGANISM  Kluyveromyces lactis
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1248)
  AUTHORS   Lewis,M.J., Sweet,D.J. and Pelham,H.R.B.
  TITLE     The ERD2 gene determines the specificity of the luminal ER protein
            retention system
  JOURNAL   Cell 61, 1359-1363 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Lewis, 06-JUN-1990, for release after publication.
FEATURES       from  to/span     description
    pept        390      414     ER lumen protein retaining receptor, exon 1
                                 (ERD2)
                572     1206     ER lumen protein retaining receptor, exon 2
    IVS         415      571     ERD2 intron A
    site        415      420     consensus splice donor
    site        528      534     branchpoint sequence
    site        569      571     acceptor sequence
BASE COUNT      368 a    235 c    202 g    443 t
ORIGIN      
        1 gaattcttca ctcttcaccc ctcctctcta acgagactcg gataaacaga ctgtaacgat
       61 gacgacgacg acgatgatga taatgattat gatgatgatg attgaatgtg ccatttgacg
      121 tttttgacct gtaaacccct tttgtaaata gcctctgtcc tttttaagta cttatgcggt
      181 gtctgttaat acaatctttg tggctcacgt gatttattat ttttctcaat gtgggccaaa
      241 attgcaatct cttaactctg aaaaataaaa caaggcagta ctaatcgctt cttaaacact
      301 tgttctttaa agattgaaat aaactaccga catttgtcgt tctaagttaa attcaaatca
      361 aaacagcttt caattgagtg tgtcacatta tgttgaacgt tttcagaata gcaggtatgt
      421 gaaaagaact gattttatat cattgttgga cgtcaacctc gatcggaacg ttatatcttg
      481 gttctgaatg atgatttcca tcggcttatc tctactggca gcatctttac taacattgtt
      541 attaccaact attttgtaat cgagtgttta ggtgatttct ctcatttggc tagtatcatc
      601 attttgatac aatcgatcac aacatctaac tcagttgatg gtatctcatt gaaaactcaa
      661 ctgctataca ccttggtctt tatcacacgt tatttgaacc tatttaccaa atggacctcc
      721 ttgtacaact tcttaatgaa aattgttttc atttcatctt cggtttacgt cattgtgtta
      781 atgcgccaac aaaaatttaa aaaccctgtc gcatatcaag acatgatcac cagagatcaa
      841 tttaaaatca agtttttaat agtaccatgc attctcctag gattaatttt caattatcgt
      901 ttcagtttta tacaaatatg ctggtccttc tctctatggt tggaaagtgt tgcaatcctt
      961 cctcaattgt ttatgttgac taaaacaggt aaagcaaaac aattgacatc tcattatatt
     1021 tttgcattgg ggttataccg tgccttgtac attccaaatt ggatatggag gtattatacc
     1081 gaagaaagat tcgataaatt atcagttttc accggtgtga ttcaaacttt agtgtactct
     1141 gatttcttct acatctatta tcagaaggtt atcaaactcg gtggagattt agaattacca
     1201 caatagatga caggatttgc atctccactt gcagtcagtt cgaagctt
//
LOCUS       CELCKINII    3180 bp ds-DNA             INV       14-AUG-1990
DEFINITION  C.elegans casein kinase II-alpha gene, complete cds.
ACCESSION   J05274
KEYWORDS    casein kinase II-alpha.
SOURCE      C.elegans DNA.
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 3180)
  AUTHORS   Hu,E. and Rubin,C.S.
  TITLE     Casein kinase II from Caenorhabditis elegans: Properties and
            developmental regulation of the enzyme; Cloning and sequence
            analyses of cDNA and the gene for the catalytic subunit
  JOURNAL   J. Biol. Chem. 265, 5072-5080 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        250      288     casein kinase II-alpha, exon 1
                533      703     casein kinase II-alpha, exon 2
                763      890     casein kinase II-alpha, exon 3
                940     1024     casein kinase II-alpha, exon 4
               1084     1380     casein kinase II-alpha, exon 5
               1838     1962     casein kinase II-alpha, exon 6
               2018     2099     casein kinase II-alpha, exon 7
               2301     2456     casein kinase II-alpha, exon 8
    pre-msg     211     3073     casein kinase II-alpha mRNA and introns
    IVS         289      532     casein kinase II-alpha intron A
    IVS         704      762     casein kinase II-alpha intron B
    IVS         891      939     casein kinase II-alpha intron C
    IVS        1025     1083     casein kinase II-alpha intron D
    IVS        1381     1837     casein kinase II-alpha intron E
    IVS        1963     2017     casein kinase II-alpha intron F
    IVS        2100     2300     casein kinase II-alpha intron G
BASE COUNT      974 a    580 c    544 g   1082 t
ORIGIN      
        1 actttcgcag cgggaatatc tcaccacgca cgagttcaca tttggttttt tctctgcgct
       61 ctctcgcatt tgcgtactct tttcatgttc tggcatgtat tttcaaaatt attcgggttt
      121 tatcagtttt cccgagattt tactatttct actgaaaatt tcgttctaat aattcaatta
      181 taacagccgt tttgtcaaaa cttgctcaaa aataacgtgt tacttgtttc agatcgttat
      241 tcttgggaaa tgcctccgat tcctagccgg gcgagggtct acgccgaggt gagcaagttt
      301 gtatagccag gagttaatta ttgaacagaa tatattgctc atacataact cacaatgacg
      361 gaattaacgt tatccgtttt agttgtggaa tactgtgaaa cactgtacgc cacatttgtt
      421 agattttata aatgttcata ggaatttata gtctcgtaaa gaaaaactat ttgtaaaagt
      481 cactaaatta tgtttttatg agtttatata ttataaatca caattgtttc aggtgaatcc
      541 gtcgagacca cgcgagtact gggactatga ggctcacatg atcgaatggg gacaaatcga
      601 tgattatcag ctggtacgca agttgggacg tggaaaatat tcggaggttt ttgaaggatt
      661 caagatgagc actgacgaga aagttgttgt caagattctc aaggtaagtt ttacaaaaat
      721 aattgtaata aagctttcaa aatgtaattt gtttgttttc agccagtcaa aaagaagaag
      781 atcaagcgtg agattaaaat tttggagaat cttcgtggcg ggacaaacat catcaccctc
      841 ctcgacgtcg tcaaggatcc tatttcgaga actccggctc ttattttcga gttggtttac
      901 tatttgaaga tttttagaat taataatttc tatttgaagg catgttaaca attcggactt
      961 caagcaattg tatcaaaccc tcagtgacta tgatatccgt tactacctct acgagctctt
     1021 gaaggtactt ttttcgtctg tccattaatt tttgaattca taacgtttca atttattctc
     1081 caggctctcg acttctgcca ttctcaagga attatgcaca gagatgttaa gccgcacaat
     1141 gtgatgattg acgccgaaaa acgggaactt cgattaatag attggggatt agccgaattc
     1201 tatcatccaa gacaggacta taatgttcgt gttgcttcta gatatttcaa gggacccgag
     1261 cttcttgtcg attatcaatg ttacgattat tcgttggata tgtggagttt gggatgcatg
     1321 ttggctagta tgatcttcag gaaggaacca ttcttccacg gccatgataa ctacgatcag
     1381 gtaagcactt taagctgaag tagcaaagaa ccgcaaatcc acaaatttca ttaaaatcca
     1441 atattggaat atagaaaaca tttgatttat ctgattatgg cttttatttc agtgagaaca
     1501 atggggaaaa acgcttaaaa aattaccgga aagtttaaaa tttccgaaaa aaaccgaaaa
     1561 ttttccaatt ttgagcattt ttcaattgtt ccagctgcaa aaaactattt tcagataaag
     1621 aaattatttt tttgaattcc atagctaaaa ttttactgat aagtttggct tattccgtag
     1681 aatttcttgc cggcgattga ttcgcacccc tgagctgaaa tttaaatgaa taaataatat
     1741 tgtcagggta tttattaaat attcatcaat aatttttatg atttttaata tttataaatt
     1801 cacgaaatct gattttctta taaaaagttt tttgcagcta gtccgcattg caaaggtatt
     1861 gggaactgat gagctttacg agtacattgc gagatatcac attgatctcg atccacgctt
     1921 caacgatatt ctcgggcgac actcaagaaa acgctgggaa aggtatattt tgatcgaagc
     1981 gaagtttcat gtttttctta aaattcattt gttgtagatt tatccacgct gaaaatcaac
     2041 atcttgtcac cccagaggct cttgatttcc ttgataaact tcttcgctac gatcacgcgg
     2101 taagaatcgg aaaaaagaga aattacacca aattagttac tgaaaaccag tttaacaaga
     2161 acacagagaa ataaaaacaa ttttcacttg ttttattgaa aaaaaagaca tttgatactc
     2221 gaaactaaac tataaaatat cttttatttg ttaaattaat ttatttccag aatcttttaa
     2281 ttaaatttca ccatattcag gaacgcctaa ctgctcaaga ggccatgggt cacgaatact
     2341 tccgcccagt tgttgaagca catgctcgtg ccaatggaac tgaacaagct gacggacagg
     2401 gtgcatcgaa ttccgcttct tctcaatcct ccgatgccaa aattgacggc gcttaaaatt
     2461 tctttctatt ttttttttaa ttttcctgtt gactatttcc ttttttatta atttccacac
     2521 cactcctctc catcattatt cctttgctct aaaaattggc taagatatta aatcttcccc
     2581 gcttctccga aaatttccct ctacatacca caaaagtgcc taatcttttc gtgagaaacg
     2641 ttctttgcaa attctatttt taggattttt ttgtgtaatt catgcctctg aaattccgtt
     2701 tcaattttaa atcttttttg tcccatttca tattgttctc cttcttctcc cctttcctct
     2761 tcctcaaaca accacaatca tcatttttat aaatcgaatc acagttcgtg agaaaaaaaa
     2821 ctgttacttg tatatcaata ttgtctttgg ttttgaaaaa cctactattg ttcctatgtt
     2881 gagttgtgaa ctatgttcac ttttagagtt ttagatttaa aatgctcaac ggacgagtgc
     2941 tccttcgatc gattgctcgc cggacaatgg catcgaaaat tgagccagga tctcgcagcc
     3001 gatttatttg gatgtccagg ctaccgcgcc aatggtaatt aaatggaaat aataaagcat
     3061 gattaaaaat aatataaaaa acttgtatct ggaaaaaatt aacgaaactt ttcagataca
     3121 tttagatttc gaatgagttt ttttctaact aggcacagtt tagctagtag gcattctaaa
//
LOCUS       DMORGX       1650 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  D.mobilis ribosomal 16S RNA gene.
ACCESSION   M36474
KEYWORDS    ribosomal 16S RNA.
SOURCE      D.mobilis DNA.
  ORGANISM  Desulfurococcus mobilis
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Thermoproteales; Desulfurococcaceae.
REFERENCE   1  (bases 1 to 1650)
  AUTHORS   Kjems,J., Garrett,R.A. and Ansorge,W.
  TITLE     The sequence of the 16S RNA gene and its flanking region from the
            archaebacterium Desulfurococcus mobilis
  JOURNAL   Syst. Appl. Microbiol. 9, 22-28 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA         71     1565     16S ribosomal RNA
BASE COUNT      323 a    492 c    602 g    233 t
ORIGIN      
        1 ccgacgaggg ggagggagcc acttaagccg agcgctccag aaaccccggc cccaacccca
       61 agcggccgta actccggttg atcctgccgg tcccgaccgc tatcggggtg gggctaagcc
      121 atgggagtcg cacgctccgc cgctgcgggg cgtggcggac ggctgagtaa cacgtggcta
      181 acctaccctc gggaggggga taacaccggg aaactggtgc taatccccca taggggagga
      241 ggcctggaag ggttcctccc cgaaagggtg tggcaggggt taacgctgct acaccgcccg
      301 aggatggggc tacggcccat taggttgttg gcggggtaac ggcccgccaa gccgataatg
      361 ggtaggggcc gtgagagcgg gagcccccag atgggcactg agacaagggc ccaggcccta
      421 cggggcgcac caggcgcgaa acctccgcaa tgcgggaaac cgtgacgggg ccaccccgag
      481 tgccccctta cgggggcttt tccccgctgt aggaaggcgg gggaataagc ggggggcaag
      541 tctggtgtca gccgccgcgg taataccagc cccgcgagtg gtcgggacga ttattgggcc
      601 taaagcgccc gtagccggcc cggcaagtcc cctcctaaat tcccgggctc aacccgggga
      661 ctggagggga tactgccggg ctagggggtg ggagaggccg agggtactcc cggggtaggg
      721 gcgaaatcct ataatcccgg gaggaccacc agtggcgaag gcgctcggct ggaacacgcc
      781 cgacggtgag gggcgaaagc cgggggagcg aaccggatta gatacccggg tagtcccggc
      841 tgtaaacgat gcgggctagg tgttgggtgg gcttagagcc cacccagtgc cgcagggaag
      901 ccgttaagcc cgccgcctgg ggagtacggc cgcaaggctg aaactcaaag gaattggcgg
      961 gggagcacca caaggggtgg agcctgcggt tcaattggag tcaacgccgg gaatctcacc
     1021 gggggagaca gcaggatgac ggccaggtta aaggccttgc ctgacgcgct gagaggaggt
     1081 gcatggccgt cgccagctcg tgctgtgaag tgtccggtta agtccggaaa cgagcgagac
     1141 ccccacccct agttgctacc cggggctacg gctccggggc acactagggg gactgccgcc
     1201 gtttaaggcg gaggaaggag ggggccacgg caggtcagca tgccccgaac cccccgggct
     1261 acacgcgggc tacaatggcg gggacagcgg gatccgaccc cgaaaggggg aggcaatccc
     1321 tcaaaccccg ccgtggttgg gatcgagggc tgcaactcgc cctcgtgaac gaggaatccc
     1381 tagtaaccgc gcgtcaacat cgcgcggtga atacgtccct gctccttgca cacaccgccc
     1441 gtcgctccac ccgaggggag ggggagtgag gcccggcccc ttgggtcggg tcgaactccc
     1501 cctccctgag gggggagaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca
     1561 cctcctgcct caggccgggg ctggggccgg ggctggaatg cgctaaggct taagtggctc
     1621 ccgcctcctc tattcattga tgcaacacgg
//
LOCUS       ECOHBB221    3703 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  E.coli beta-globin gene, complete cds.
ACCESSION   M34058
KEYWORDS    beta-globin.
SEGMENT     1 of 2
SOURCE      E.coli (strain HB101) DNA, isolated from patient GD.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 3703)
  AUTHORS   Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M.,
            Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J.
  TITLE     Molecular basis for dominantly inherited inclusion body
            beta-thalassemia
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.L.Thein, 07-MAY-1990.
            
                Author address: S.L.Thien
                Institute of Molecular Medicine,
                MRC Molecular Haematology Unit
                John Radcliffe Hospital
                Oxford, OX3 9DU U.K.
FEATURES       from  to/span     description
    pept       1612     1703     beta-globin, exon 1
               1834     2056     beta-globin, exon 2
               2907  +  3015     beta-globin, exon 3
    IVS        1704     1833     beta-globin intron A
    IVS        2057     2906     beta-globin intron B
    IVS        3016  >  3703     beta-globin intron C
    signal     1484     1492     CAAT box
    signal     1532     1538     TATA box
BASE COUNT     1055 a    707 c    736 g   1205 t
ORIGIN      
        1 tctatttatt tagcaataat agagaaagca tttaagagaa taaagcaatg gaaataagaa
       61 atttgtaaat ttccttctga taactagaaa tagaggatcc agtttctttt ggttaaccta
      121 aattttattt cattttattg ttttatttta ttttatttta ttttatttta ttttgtgtaa
      181 tcgtagtttc agagtgttag agctgaaagg aagaagtagg agaaacatgc aaagtaaaag
      241 tataacactt tccttactaa accgacatgg gtttccaggt aggggcagga ttcaggatga
      301 ctgacagggc ccttagggaa cactgagacc ctacgctgac ctcataaatg cttgctacct
      361 ttgctgtttt aattacatct tttaatagca ggaagcagaa ctctgcactt caaaagtttt
      421 tcctcacctg aggagttaat ttagtacaag gggaaaaagt acagggggat gggagaaagg
      481 cgatcacgtt gggaagctat agagaaagaa gagtaaattt tagtaaagga ggtttaaaca
      541 aacaaaatat aaagagaaat aggaacttga atcaaggaaa tgattttaaa acgcagtatt
      601 cttagtggac tagaggaaaa aaataatctg agccaagtag aagacctttt cccctcctac
      661 ccctactttc taagtcacag aggctttttg ttcccccaga cactcttgca gattagtcca
      721 ggcagaaaca gttagatgtc cccagttaac ctcctatttg acaccactga ttaccccatt
      781 gatagtcaca ctttgggttg taagtgactt tttatttatt tgtatttttg actgcattaa
      841 gaggtctcta gttttttatc tcttgtttcc caaaacctaa taagtaacta atgcacagag
      901 cacattgatt tgtatttatt ctatttttag acataattta ttagcatgca tgagcaaatt
      961 aagaaaaaca acaacaaatg aatgcatata tatgtatatg tatgtgtgta catatacaca
     1021 tatatatata tatttttttt cttttcttac cagaaggttt taatccaaat aaggagaaga
     1081 tatgcttaga actgaggtag agttttcatc cattctgtcc tgtaagtatt ttgcatattc
     1141 tggagacgca ggaagagatc catctacata tcccaaagct gaattatggt agacaaaact
     1201 cttccacttt tagtgcatca atttcttatt tgtgtaataa gaaaattggg aaaacgatct
     1261 tcaatatgct taccaagctg tgattccaaa tattacgtaa atacacttgc aaaggaggat
     1321 gtttttagta gcaatttgta ctgatggtat ggggccaaga gatatatctt agagggaggg
     1381 ctgagggttt gaagtccaac tcctaagcca gtgccagaag agccaaggac aggtacggct
     1441 gtcatcactt agacctcacc ctgtggagcc acaccctagg gttggccaat ctactcccag
     1501 gagcagggag ggcaggagcc agggctgggc ataaaagtca gggcagagcc atctattgct
     1561 tacatttgct tctgacacaa ctgtgttcac tagcaacctc aaacagacac catggtgcac
     1621 ctgactcctg aggagaagtc tgccgttact gccctgtggg gcaaggtgaa cgtggatgaa
     1681 gttggtggtg aggccctggg caggttggta tcaaggttac aagacaggtt taaggagacc
     1741 aatagaaact gggcatgtgg agacagagaa gactcttggg tttctgatag gcactgactc
     1801 tctctgccta ttggtctatt ttcccaccct taggctgctg gtggtctacc cttggaccca
     1861 gaggttcttt gagtcctttg gggatctgtc cactcctgat gctgttatgg gcaaccctaa
     1921 ggtgaaggct catggcaaga aagtgctcgg tgcctttagt gatggcctgg ctcacctgga
     1981 caacctcaag ggcacctttg ccacactgag tgagctgcac tgtgacaagc tgcacgtgga
     2041 tcctgagaac ttcagggtga gtctatggga cccttgatgt tttctttccc cttcttttct
     2101 atggttaagt tcatgtcata ggaaggggag aagtaacagg gtacagttta gaatgggaaa
     2161 cagacgaatg attgcatcag tgtggaagtc tcaggatcgt tttagtttct tttatttgct
     2221 gttcataaca attgttttct tttgtttaat tcttgctttc tttttttttc ttctccgcaa
     2281 tttttactat tatacttaat gccttaacat tgtgtataac aaaaggaaat atctctgaga
     2341 tacattaagt aacttaaaaa aaaactttac acagtctgcc tagtacatta ctatttggaa
     2401 tatatgtgtg cttatttgca tattcataat ctccctactt tattttcttt tatttttaat
     2461 tgatacataa tcattataca tatttatggg ttaaagtgta atgttttaat atgtgtacac
     2521 atattgacca aatcagggta attttgcatt tgtaatttta aaaaatgctt tcttctttta
     2581 atatactttt ttgtttatct tatttctaat actttcccta atctctttct ttcagggcaa
     2641 taatgataca atgtatcatg cctctttgca ccattctaaa gaataacagt gataatttct
     2701 gggttaaggc aatagcaata tttctgcata taaatatttc tgcatataaa ttgtaactga
     2761 tgtaagaggt ttcatattgc taatagcagc tacaatccag ctaccattct gcttttattt
     2821 tatggttggg ataaggctgg attattctga gtccaagcta ggcccttttg ctaatcatgt
     2881 tcatacctct tatcttcctc ccacagctcc tgggcaacgt gctggtctgt gtgctggccc
     2941 atcactttgg caaagaattc accccaccag tgcagccaca cctatcagtg gtgtggctaa
     3001 tgccctggcc cacaagtatc actaagctcg ctttcttgct gtccaatttc tattaaaggt
     3061 tcctttgttc cctaagtcca actactaaac tgggggatat tatgaagggc cttgagcatc
     3121 tggattctgc ctaataaaaa acatttattt tcattgcaat gatgtattta aattatttct
     3181 gaatatttta ctaaaaaggg aatgtgggag gtcagtgcat ttaaaacata aagaaatgaa
     3241 gagctagttc aaaccttggg aaaatacact atatcttaaa ctccatgaaa gaaggtgagg
     3301 ctgcaaacag ctaatgcaca ttggcaacag ccctgatgcc tatgccttat tcatccctca
     3361 gaaaaggatt caagtagagg cttgatttgg aggttaaagt tttctatgct gtattttaca
     3421 ttacttattg ttttagctgt cctcatgaat gtcttttcac tacccatttg cttatcctgc
     3481 atctctcagc cttgactcca ctcagttctc ttgcttagag ataccacctt tcccctgaag
     3541 tgttccttcc atgttttacg gcgagatggt ttctcctcgc ctggccactc agccttagtt
     3601 gtctctgttg tcttatagag gtctacttga agaaggaaaa acagggggca cggtttgact
     3661 gtcctgtgag cccttcttcc ctgcctcccc cactcacagt gac
//
LOCUS       ECOHBB222     552 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  E.coli beta-globin gene, complete cds.
ACCESSION   M34059
KEYWORDS    beta-globin.
SEGMENT     2 of 2
SOURCE      E.coli (strain HB101) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 552)
  AUTHORS   Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M.,
            Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J.
  TITLE     Molecular basis for dominantly inherited inclusion body
            beta-thalassemia
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by S.L.Thein, 07-MAY-1990.
            
                Author address: S.L.Thein
                Institute of Molecular Biology,
                MRC Molecular Haematology Unit
                John Radcliffe Hospital,
                Oxford, OX3 9DU, U.K.
FEATURES       from  to/span     description
    pept    +   169      272     beta-globin, exon 4
    IVS     <     1      168     beta-globin intron C
BASE COUNT      153 a    113 c    108 g    178 t
ORIGIN      
        1 tctgcatata aattgtaact gatgtaagag gtttcatatt gctaatagca gctacaatcc
       61 agctaccatt ctgcttttat tttatggttg ggataaggct ggattattct gagtccaagc
      121 taggcccttt tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac
      181 gtgctggtct gtgtgctggc ccatcacttt ggcaaataat tcaccccacc agtgcaggct
      241 gcctatcaga aagtggtggc tggtgtggct aatgccctgg cccacaagta tcactaagct
      301 cgctttcttg ctgtccaatt tctattaaag gttcctttgt tccctaagtc caactactaa
      361 actgggggat attatgaagg gccttgagca tctggattct gcctaataaa aaacatttat
      421 tttcattgca atgatgtatt taaattattt ctgaatattt tactaaaaag ggaatgtggg
      481 aggtcagtgc atttaaaaca taaagaaatg aagagctagt tcaaaccttg ggaaaataca
      541 ctatatctta aa
//
LOCUS       HUMETN1       682 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human endothelin (ETN) gene, complete cds.
ACCESSION   M25377 J04819 J05489
KEYWORDS    endothelin.
SEGMENT     1 of 4
SOURCE      Human leukocyte DNA and human umbilical vein endothelial cell cDNA
            to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 682)
  AUTHORS   Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and
            Quertermous,T.
  TITLE     Structural organization and chromosomal assignment of the gene
            encoding endothelin
  JOURNAL   J. Biol. Chem. 264, 10851-10857 (1989)
  STANDARD  simple staff_review
REFERENCE   2  (bases 2 to 421)
  AUTHORS   Lee,M.-E., Bloch,K.D., Clifford,J.A. and Quertermous,T.
  TITLE     Functional analysis of the endothelin-1 gene promoter: Evidence for
            an endothelial cell-specific cis-acting sequence
  JOURNAL   J. Biol. Chem. 265, 10446-10450 (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer readable copy of sequence [1] kindly
            submitted by K.D.Bloch (06-AUG-89).
FEATURES       from  to/span     description
    pept        519  +   582     endothelin prepropeptide, exon 1
                                 /nomgen="EDN1" /map="unassigned"
                                 /hgml_locus_uid="LU0066V"
    pre-msg     252  >   682     ETN mRNA and introns (5' end put.)
    IVS         583  >   682     ETN intron A
BASE COUNT      121 a    178 c    169 g    213 t      1 others
ORIGIN      Chromosome 6.
        1 ggcaggttta gcaaaggtct ctaatgggna ttttattttt cttagccctg cccccgaatt
       61 gtcagacggc ggcgtctgcc tctgaagtta gcagtgattt ctttcgggcc tggccttatc
      121 tccggctgca cgttgcctgt tggtgactaa taacacaata acattgtctg gggctggaat
      181 aaagtcggag ctgtttaccc ccactctaat aggggttcaa tataaaaagc cggcagagag
      241 ctgtccaagt cagacgcgcc tctgcatctg cgccaggcga acgggtcctg cgcctcctgc
      301 agtcccagct ctccaccgcc gcgtgcgcct gcagacgctc cgctcgctgc cttctctcct
      361 ggcaggcgct gcttttctcc ccgttaaagg gcacttgggc tgaaggatcg ctttgagatc
      421 tgaggaaccc gcagcgcttt gagggacctg aagctgtttt tcttcgtttt cctttgggtt
      481 cagtttgaac gggaggtttt tgatcccttt ttttcagaat ggattatttg ctcatgattt
      541 tctctctgct gtttgtggct tgccaaggag ctccagaaac aggtaggcac gctcgttgac
      601 ttgtaagtct cggaattaca agttagtgtg ttcttatcca ccttcatgct tttcttgctt
      661 ctatttttcc ccgttctttt ta
//
LOCUS       MUSKER19     5422 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse keratin 19 gene, complete cds.
ACCESSION   M36120
KEYWORDS    keratin 19.
SOURCE      Mouse (strain C34/HeNCrlBR) embryo fibroblast DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5422)
  AUTHORS   Lussier,M., Filion,M., Compton,J.G., Nadeau,J.H., Lapointe,L. and
            Royal,A.
  TITLE     The mouse keratin 19-encoding gene: Sequence, structure and
            chromosomal assignment
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.Lussier, 06-JUL-1990.
FEATURES       from  to/span     description
    pept        375      803     keratin 19, exon 1
               3728     3810     keratin 19, exon 2
               4029     4185     keratin 19, exon 3
               4553     4714     keratin 19, exon 4
               4832     4957     keratin 19, exon 5
               5070     5324     keratin 19, exon 6
    pre-msg     304  >  5422     keratin 19 mRNA and introns
    IVS         804     3727     keratin 19 intron A
    IVS        3811     4028     keratin 19 intron B
    IVS        4186     4552     keratin 19 intron C
    IVS        4715     4831     keratin 19 intron D
    IVS        4958     5069     keratin 19 intron E
    signal      184      188     CAAT box
    signal      278      284     ATA box
    signal     5402     5408     poly-A signal
BASE COUNT     1255 a   1429 c   1431 g   1307 t
ORIGIN      
        1 gatatcgtca ctccggaagt tagagttgtc taagcctgtg tagtaagggg ctgaagggcc
       61 agaaaaggga cgtgacatgt tggcagtagc tttggagtgg gctggggcgg ggcagctctg
      121 ggaaggactg agacctctgg ctcctgggag gggagaggta ggagcagaat cgccaggaat
      181 tgaccaatgg ggaaagagcc catatttgca ctctgggagc ttggaaattt ctgatacccg
      241 ccccttcaac atctccatcc cccttcccgc cccgggcata aaaagccaca ggtgagggcc
      301 ttgtcactcc tcctgcggcc agcagttctc agacctgcgt ccctttttcc ttcgctctgg
      361 tctccctcct catcatgact tcctatagct atcgccagac ctcagctatg tcttcctttg
      421 ggggtacggg cgggggttca gtacgcattg ggtcaggggg tgttttccgc gcacccagca
      481 tccacggggg ctcaggtggc cgcggcgtgt ccgtgtcctc cacccgcttc gtgacctcgt
      541 cctccgggag ctatggcgga gtccgcggtg gaagttttag tgggaccctg gctgtgtctg
      601 atgggctgct gtctggcaat gagaagatca ccatgcaaaa cctcaatgat cgtctcgcct
      661 cctacttgga caaggtgcgc gccctagagc aggccaatgg cgagctggag gtgaagatcc
      721 gcgactggta ccagaagcag ggacccggac cctcccgaga ttacaaccac tactttaaga
      781 ccatcgagga cttgcgcgac aaggtgggta gcggccaggc cctgggaggt gcacctgttg
      841 gggtggtgcg gcagggcggg gaggttgtgg aaactcagcc cgcgggtgga caagcccgtt
      901 actacgggcc gggcaggaac ccgcagggtg gggcacaaag tctgggctgg accctgggat
      961 tcgagtggag gcggggtgga agggaactga gacaggttaa gtgggtgggg ctttcggaag
     1021 cggtggggac ctgcttggat agtcagggat atctatccaa agagagcatt taacacagct
     1081 ttgggctgcc cagagatgaa gataccgagc cgcagcctca gctgggggtg gggtgttggg
     1141 gagggctggg gagctgccgg atattagtct gtaggctcag aggtcacctg agcctgtggc
     1201 cctcttctgg gactcttgag cattgctttc tccagttgtg tgggttgttc ctgtctggtt
     1261 ttctttacag cagctgtcct ctcactctcc tcctctccag cccaggggac cacctaggct
     1321 cacacggagc tacgagaaga atgtaaaacc tgggggctgc acagtaaaac cagatcttaa
     1381 aaaaaaaaac ttccagtgag gtgtggtagc tcaggcctgt aatcccagga gcaggtggat
     1441 gagttcatag ctatgcccag ctacagttaa cttgaatcta gtcagggcta catgagtctc
     1501 taggctccca cccggccgtc aaacagcccg aacagacaga cagacagaca gacagacaga
     1561 cagacagaca gacaacaact ctaacacgaa acaaaagaaa aataaaaaca attattttag
     1621 taagtaagag ggaggtggaa gtcttgaccc tctgcttctg tggacctctc tcctggagtc
     1681 ccccggtcca gggaagacaa cctgcaagtc tctgtgggct ccaacatcca acaggctggt
     1741 tgggtggggc attgccctca ctcctaggga agagctatgt cctgaagaaa tgtaccagag
     1801 accaggcagg cgtttaatag cgtcctctcg ctagtgctcc caacatgcag ttctcaagag
     1861 cattttccag ggctgtgtga acttagcatg gagctagaca cacaacacag caagcagcta
     1921 gagaggtgtt tttttgaccc agtgatggac agctagaatc ctaggcggta cacatctttc
     1981 tttctcttgc tttgtccctt ccccttccac acactcccag cacgcatatc tagaagcaat
     2041 tcttactccg ctgtgaatgc tctcctccgg gtttttatat tctgttgttt tcgtcaacaa
     2101 ccccccaccc ccaaactatc tatgtagctg agcgtggtgg ctgatacgtg gtggcttatc
     2161 cagcattgag aaactgaggc agaaagaagg acgatgaagt taatgccagt tgaaccatac
     2221 agtgagactc tgtctcaaaa tgtcaaacca gagctggggc gcagtggcac acacctttaa
     2281 tcccagcact tggggaggca gagaaggtgg atctctgtga attcaaggcc agtctacaga
     2341 gtgagtccag gacagccagg gctacaaaga gaaatcctgt ctcgaacccc ctagtccttc
     2401 acccccaaag tctaacaaac cagaggaaac caacccaacc caacccaacc caacccaacc
     2461 caacccaacc caacccaacc aaaaacatta aaccaaaata atcctgcaga ttccaacctg
     2521 gtgacaggat aagtggccat gaccttaatc tttgaaaaac actgaaccct ggtactgaat
     2581 tggtggtaaa cttaatgatt tgagtgaacc cctcaaacca gagtcttcat attgggtaaa
     2641 ctcaggattt ccagtcacgt acagtggcag tactaaccta ggagtgaggc tgagttacca
     2701 tgaagagagt acaaggtgcc tggcacatca ccgaagcaga atgtgctcta agtgtgggac
     2761 acagacaggg ataagtgttg cttttctttt ctttaggatt tatttattta cttatctatt
     2821 aatttattta tatgtatatt ggtgttttgc ttgcatgtat gtctgtgtga ggaacttgaa
     2881 ttgcaaacag ttgtgagctg ccacgagggt gctgcaaatt gaactctgtt cctttggaag
     2941 aatatccagg gctcttaatc tccaagttac ctctccagtc cctgggataa atgatccatc
     3001 catccttcct tccttccttc cttccttcct tccttccttc cttccttcct tccgtctttc
     3061 ctctttcttt cttttttttt tttttttggt ttttggtttt tggttttttg agacagagtt
     3121 tctctgtgta gccctggctg gctcaaactc agaaatcctt ctgcctctgc ctcccaagtg
     3181 gtggggttaa aggcatgtgc caccactgcc tggagggata aatgattctt ttttttttaa
     3241 gatttattta tttattatat ctaagtacac tgtagctgtc ttcagacact ctggaagaga
     3301 gcgtcagatc tagtttcgga tggttgtgag ccaccatgtg gttgctggga tttgaactca
     3361 ggaccttagg aagagcagtc agtgctctta accgctgagc catctctcca gcccaggata
     3421 aatgattctt aaagcttcat gtttctaggt ccttttcaga tcacttagag tttcacctgc
     3481 acaggtcttg gccatgccct tctggaattg cctctttgcc tgcctccctt cacccaacct
     3541 cctttacagt gtgttgtgct ccagctctgg ggctgtggat cgatgccaaa gtctcccaga
     3601 gagcagacta gtgggtagtg tcaagggagt gggggctatc cattcctctc tccttctctg
     3661 ggaagaaagt agattgataa tactggcttg ccaaactggt gccaactgcc agcctcctcc
     3721 cttctagatt cttggtgcca ccattgacaa ctccaagatt gtcctacaga ttgacaatgc
     3781 tcgcctggct gcagatgact tcagaaccaa gtaagtgtcc ctgtattggg ggctgcggaa
     3841 gccagggcaa gtgagaggag agcagcggca ggtcttatag tgcttgatga agtagaggcc
     3901 tggactctgc ccttctccac ttgagctggc tcaaagcaat gccccccctc cccgactcct
     3961 aaaccctggg aaagttggca gagacacaga tccagcacta gcctgactgc aactatttct
     4021 tcggaaaggt ttgagacaga acacgccttg cgtctgagcg tggaggccga catcaacggc
     4081 ctgcgccggg tgctggatga gctgactctg gccaggactg acctggagat gcagattgag
     4141 agcctgaagg aggagctggc ctacctgaag aagaaccatg aggaggtggg tcaagctggg
     4201 ccttctgtcc attttgtccc aggattggga gaccctatta tcttgggctt actgaccatg
     4261 acctcatctc ttggccctgg ctttgatcat gcctctggca gtctcagcag ggccatgaca
     4321 tcatcactat gatgacctta ctgaccatga catcatcact atggccctgg ctttgatcat
     4381 gcctctcgca gtctcagcag ggcatggtaa ccatgacatc tactttccag tgctggctcc
     4441 tagatttgta cctccaaacc agccgtcttc agctcagggc ttgggggagt gacctgatcc
     4501 aagttcagca tgtgtaggtc ttgacatgcc cttgactgtg gtgtcttcac aggaaattac
     4561 tgccctgagg agccaggtgg gtggccaggt cagtgtggag gtggattcca ctcccggtgt
     4621 cgacctagcc aagatcctga gtgagatgag aagtcagtat gagatcatgg ccgagaagaa
     4681 ccggaaggat gctgaagcca cctaccttgc tcgggtatgc agaggatgct ggtatcccgt
     4741 gagggtggcg aggagagacc ctgaaccctc acccatccac ccgcagagag aagcttgttg
     4801 cttattttct ccttaacttg tcttgttaca gattgaggag ctgaacaccc aggtcgccgt
     4861 ccactctgag cagatccaga taagcaagac cgaagtcacg gaccttcgac ggaccctcca
     4921 gggccttgag attgagctgc agtcccagct cagcatggta cgtgtctcca ccctacggcc
     4981 tgcacacttg tgccctggca acctcggtgc ctgtgcccgt gttcaggaac ctgcctctgc
     5041 tgagtcacgc cccctttccc tcccctcaga aagctgccct ggaaggcacg ctggcagaga
     5101 cggaggcccg ttatggagtc cagctgtcac agatccagag cgtgatcagc ggttttgaag
     5161 cccagctgag cgacgtgcgt gccgacatag agcgccagaa ccaggagtat aagcagctca
     5221 tggacatcaa gtccaggctg gagcaggaga tcgccaccta ccgcagcctg ctggagggcc
     5281 aggaagccca ctacaacaat ctgcccaccc ccaaggccat ctgagctacc agcgagactc
     5341 ccctgggaag gggcctgact ggggtgataa aagtttactc taacccctcc ctcgacttgt
     5401 caataaaact atcctccaag gg
//
LOCUS       VIBHEMOX     1586 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  V.parahaemolyticus thermolabile hemolysin (TL) gene, complete cds.
ACCESSION   M36437
KEYWORDS    thermolabile hemolysin.
SOURCE      V.parahaemolyticus DNA.
  ORGANISM  Vibrio parahaemolyticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 1586)
  AUTHORS   Taniguchi,H., Hirano,H., Kubomura,S., Higashi,K. and Mizuguchi,Y.
  TITLE     Comparison of the nucleotide sequences of the genes for the
            thermostable direct hemolysin and the thermolabile hemolysin from
            Vibrio parahaemolyticus
  JOURNAL   Microb. Pathog. 1, 425-432 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        124     1380     thermolabile hemolysin precursor (TL)
    sigp        124      183     thermolabile hemolysin signal peptide
    matp        184     1377     thermolabile hemolysin
BASE COUNT      466 a    378 c    361 g    381 t
ORIGIN      
        1 tcgactgtct ggagtattta ctcagaatta gagggtttag tgagaaaaat tctcatctaa
       61 acaatgttat agccaagtat tttttcaatg tgcttgggtc aataaccatt acaagaagag
      121 tgaatgatga aaaaaacaat cacactatta actgcattac tcccgcttgc ttctgcagtt
      181 gccgaagagc caaccttatc accagaaatg gtttcagcgt ctgaagtgat cagcacgcaa
      241 gaaaaccaaa cctataccta tgttcgctgt tggtatcgca ccagctactc gaaagatgat
      301 ccagcgaccg attgggaatg ggcaaaaaac gaagatggta gctacttcac cattgacggc
      361 tactggtgga gctccgtttc atttaaaaac atgttctaca ccaacacgtc gcaaaacgtt
      421 atccgtcagc gttgtgaagc aacattagat ttggcgaacg agaacgcaga cattacgttc
      481 ttcgccgctg acaatcgctt ctcatacaac cacacgatct ggagcaacga cgcagcaatg
      541 cagccagatc aaatcaacaa agtggttgca ctcggtgaca gcttgtctga tacaggcaac
      601 atctttaacg catcacaatg gcgcttccct aacccgaaca gctggttctt aggtcacttc
      661 tccaacggtt ttgtgtggac agaatacatt gccaaagcga agaaccttcc gctctacaac
      721 tgggcagttg gcggcgcggc tggtgagaac caatacatcg cgctaacagg ggttggtgat
      781 caagtttctt cgtacttaac ctacgcaaaa ctggcgaaga actacaaacc agcaaacacc
      841 ttgtttacgc ttgagtttgg tttgaatgac ttcatgaact acaaccgtgg cgttccagaa
      901 gtgaaagcgg attatgcaga agcactgatt cgtttgacgg acgcaggtgc gaagaacttc
      961 atgttgatga cactgccaga tgcgacgaaa gcgcctcagt ttaagtactc aacacaagaa
     1021 gagatcgaca aaattcgtgc gaaagtgctt gagatgaacg agttcatcaa ggcacaagcg
     1081 atgtactaca aagcgcaagg ttacaacatc acgttgtttg atactcacgc cttgttcgag
     1141 acgctaactt ctgcgcccga agagcacggt ttcgtgaacg cgagcgatcc ttgtttggac
     1201 atcaaccgct catcgtctgt cgattacatg tacacccacg cattgcgctc tgagtgtgca
     1261 gcgtctggtg ctgagaagtt tgtgttctgg aatgtcacgc atccaacaac agcaactcac
     1321 cgctatgttg cagagaaaat gctagaaagt agcaacaact tagccgagta ccgtttctaa
     1381 ccggacacgg cttctgagtt gaaaccttat cttcgtacac acgttgataa cgaacacatc
     1441 gtggccattt ttatcgaagg aacgttgtgg tcacagcagt cacaacgcta aacaagttac
     1501 agtggcgcga cgtcggttcc ccctaaagac tggtaaagcg taacctgagt gagaaactgg
     1561 ttgtagcggt tctccaacaa agaagc
//
LOCUS       CELCAPKCC1    195 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
            exon 1.
ACCESSION   M37114 J05289 M35424
KEYWORDS    cAMP-dependent protein kinase catalytic subunit C.
SEGMENT     1 of 7
SOURCE      C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
            lambda-CC[2,3,5].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 195)
  AUTHORS   Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
  TITLE     Cloning, characterization, and expression of the gene for the
            catalytic subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 6896-6907 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        121  +   190     cAMP-dependent protein kinase catalytic
                                 subunit C, exon 1
    pre-msg       1  >   195     cAMP-dependent protein kinase catalytic
                                 subunit C mRNA and introns
    IVS         191  >   195     intron A
BASE COUNT       63 a     47 c     48 g     37 t
ORIGIN      
        1 gtttgaggta acttaaacag aaagcagcga agaccacaca agagaagact gcaagtgtct
       61 tctgaagtgt gacaacgtcc catccagtcc cgatcgtggt cttcgtctac cagcaaaaca
      121 atgctcaagt ttctgaaacc aaaatcgtcg gacgagggaa gctcgaagga caacaagaac
      181 tcggcgtcgc gtatg
//
LOCUS       CELCAPKCC2     75 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
            exon 2.
ACCESSION   M37115 J05289 M35424
KEYWORDS    cAMP-dependent protein kinase catalytic subunit C.
SEGMENT     2 of 7
SOURCE      C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
            lambda-CC[2,3,5].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 75)
  AUTHORS   Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
  TITLE     Cloning, characterization, and expression of the gene for the
            catalytic subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 6896-6907 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     9  +    70     cAMP-dependent protein kinase catalytic
                                 subunit C, exon 2
    pre-msg <     1  >    75     cAMP-dependent protein kinase catalytic
                                 subunit C mRNA and introns
    IVS     <     1        8     intron A
    IVS          71  >    75     intron B
BASE COUNT       26 a     17 c     18 g     14 t
ORIGIN      
        1 attttcagtc aaagaattcc tggacaaggc acgcgaagac ttcaaacaac gatgggagaa
       61 tccggctcag gtaca
//
LOCUS       CELCAPKCC3    142 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
            exon 3.
ACCESSION   M37116 J05289 M35424
KEYWORDS    cAMP-dependent protein kinase catalytic subunit C.
SEGMENT     3 of 7
SOURCE      C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
            lambda-CC[2,3,5].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 142)
  AUTHORS   Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
  TITLE     Cloning, characterization, and expression of the gene for the
            catalytic subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 6896-6907 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     9  +   137     cAMP-dependent protein kinase catalytic
                                 subunit C, exon 3
    pre-msg <     1  >   142     cAMP-dependent protein kinase catalytic
                                 subunit C mRNA and introns
    IVS     <     1        8     intron B
    IVS         138  >   142     intron C
BASE COUNT       44 a     27 c     35 g     36 t
ORIGIN      
        1 attttcagaa cacggcgtgt cttgacgact ttgataggat aaaaaccctt ggaaccggat
       61 cattcgggcg cgtgatgctt gtgaagcata agcagtcggg caactattat gccatgaaaa
      121 ttctagacaa acaaaaggtt gt
//
LOCUS       CELCAPKCC4    113 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
            exon 4.
ACCESSION   M37117 J05289 M35424
KEYWORDS    cAMP-dependent protein kinase catalytic subunit C.
SEGMENT     4 of 7
SOURCE      C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
            lambda-CC[2,3,5].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 113)
  AUTHORS   Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
  TITLE     Cloning, characterization, and expression of the gene for the
            catalytic subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 6896-6907 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     9  +   108     cAMP-dependent protein kinase catalytic
                                 subunit C, exon 4
    pre-msg <     1  >   113     cAMP-dependent protein kinase catalytic
                                 subunit C mRNA and introns
    IVS     <     1        8     intron C
    IVS         109  >   113     intron D
BASE COUNT       30 a     23 c     27 g     33 t
ORIGIN      
        1 attttcaggt tgtaaaactc aagcaagtgg agcacacgtt gaacgaaaag cgcattctcc
       61 aggctatcga ttttccgttt ttggtgaaca tgacgttctc tctcaagggt gag
//
LOCUS       CELCAPKCC5    222 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
            exon 5.
ACCESSION   M37118 J05289 M35424
KEYWORDS    cAMP-dependent protein kinase catalytic subunit C.
SEGMENT     5 of 7
SOURCE      C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
            lambda-CC[2,3,5].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 222)
  AUTHORS   Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
  TITLE     Cloning, characterization, and expression of the gene for the
            catalytic subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 6896-6907 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     9  +   217     cAMP-dependent protein kinase catalytic
                                 subunit C, exon 5
    pre-msg <     1  >   222     cAMP-dependent protein kinase catalytic
                                 subunit C mRNA and introns
    IVS     <     1        8     intron D
    IVS         218  >   222     intron E
BASE COUNT       50 a     60 c     44 g     68 t
ORIGIN      
        1 tatttcagac aactccaact tgtacatggt gctcgaattc atctcgggag gtgagatgtt
       61 ctcacattta cgccgaatcg gacgtttctc cgagccccac tcgcgtttct acgccgctca
      121 aattgtgctt gcctttgagt accttcactc cttagatctc atctatcgtg atctgaaacc
      181 cgagaacttg cttatcgatt ccactggata tctaaaggta gg
//
LOCUS       CELCAPKCC6    397 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
            exon 6.
ACCESSION   M37119 J05289 M35424
KEYWORDS    cAMP-dependent protein kinase catalytic subunit C.
SEGMENT     6 of 7
SOURCE      C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
            lambda-CC[2,3,5].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 397)
  AUTHORS   Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
  TITLE     Cloning, characterization, and expression of the gene for the
            catalytic subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 6896-6907 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     9  +   392     cAMP-dependent protein kinase catalytic
                                 subunit C, exon 6
    pre-msg <     1  >   397     cAMP-dependent protein kinase catalytic
                                 subunit C mRNA and introns
    IVS     <     1        8     intron E
    IVS         393  >   397     intron F
BASE COUNT      119 a     73 c     99 g    106 t
ORIGIN      
        1 attttaaggt aaccgatttc ggatttgcga aacgtgtcaa aggacgaacg tggacattgt
       61 gcggcacacc agaatacttg gcacccgaga ttattttgtc aaaaggatac aataaggctg
      121 tcgattggtg ggcacttggt gtgttgatct acgaaatggc cgccggttat ccgccgttct
      181 ttgccgatca accgattcaa atttatgaga agattgtgtc aggaaaagtg aaattcccgt
      241 cgcacttctc gaatgagttg aaagacttgc tgaagaattt gctccaagtt gatcttacaa
      301 agcgatacgg taacttgaaa aatggagttg ccgatatcaa aaatcacaaa tggtttggaa
      361 gtaccgactg gattgccatc tatcagaaga aggtgag
//
LOCUS       CELCAPKCC7    729 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  C.elegans cAMP-dependent protein kinase catalytic subunit C gene,
            exon 7.
ACCESSION   M37120 J05289 M35424
KEYWORDS    cAMP-dependent protein kinase catalytic subunit C.
SEGMENT     7 of 7
SOURCE      C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones
            lambda-CC[2,3,5].
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; 
            Spirurida; Spirurina; Filarioidea; Filariidae.
REFERENCE   1  (bases 1 to 729)
  AUTHORS   Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S.
  TITLE     Cloning, characterization, and expression of the gene for the
            catalytic subunit of cAMP-dependent protein kinase in
            Caenorhabditis elegans
  JOURNAL   J. Biol. Chem. 265, 6896-6907 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    +     9      134     cAMP-dependent protein kinase catalytic
                                 subunit C, exon 7
    pre-msg <     1      729     cAMP-dependent protein kinase catalytic
                                 subunit C mRNA and introns
    IVS     <     1        8     intron F
BASE COUNT      197 a    197 c     86 g    249 t
ORIGIN      
        1 aatttcagat cgaagcaccg ttccttccaa aatgccgtgg ccctggcgac gcatctaact
       61 ttgacgacta cgaggaagag ccgcttcgta tctcgggcac cgagaaatgc gccaaagagt
      121 ttgccgagtt ttaattcatt atctttgtcg attccttcca taagcttttt gtttcaaatt
      181 tcactctctc tctctctctt ttccatattc ctcaaaattg tactagtcca aaacaaaacc
      241 ctgtccctat tgtccggtgg tttgtcttcc ctcacccaaa ccttctcatt ctcagccaaa
      301 ttgaattcaa tattgttgcg acaattgaaa atgatgaaga tgcttactat ttcttccctt
      361 ccttctgaac cccttatttt ttaatgttca tacattttct tgccccgatc ccgtccccct
      421 aataataatt tgtaaaaata atagattgga atttgctcta atatatataa ttattatata
      481 tcaaaatttt ctagtcaacc cctgactttc cctcctccaa ttcagcctcc cgcaaacttt
      541 tttctagaaa agttgacttc tgtctcaaca aagaaatgga atcccacaaa accattaatt
      601 gtatgttttt ttctcatttt ccttttcctc tctcatgtat acatatatat gtattttaaa
      661 cacttttcac cccccccccc cccacacaca ctcaccgcaa aaatccccca gtcgtcataa
      721 aaagtagtt
//
LOCUS       DRODEC1A     3784 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.melanogaster defective chorion-1 fc125 (dec-1) gene, complete
            cds.
ACCESSION   M35887 M35688 X07697
KEYWORDS    dec-1 protein; eggshell protein.
SOURCE      D.melanogaster ovary, cDNA to mRNA and genomic DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Hawley,R.J. and Waring,G.L.
  TITLE     Cloning and analysis of the dec-1 female-sterile locus, a gene
            required for proper assembly of the Drosophila eggshell
  JOURNAL   Genes Dev. 2, 341-349 (1988)
  STANDARD  simple automatic
REFERENCE   2  (bases 44 to 3784)
  AUTHORS   Waring,G.L., Hawley,R.J. and Schoenfeld,T.
  TITLE     Multiple proteins are produced from the dec-1 eggshell gene in
            Drosophila by alternative RNA splicing and proteolytic cleavage
            events
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Waring, 28-JUN-1990.
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM          14     18       put. TATA-box
               SITE         44     44       transcription initiation site
FEATURES       from  to/span     description
    pept        119     3490     fc125 protein precursor (dec-1)
    sigp        119      187     fc125 protein signal peptide
    matp        188     3487     fc125 protein
BASE COUNT     1103 a    999 c   1004 g    678 t
ORIGIN      Chromosome 7C-X.
        1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga
       61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat
      121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag
      181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga
      241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc
      301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag
      361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa
      421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc
      481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg
      541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc
      601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc
      661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc
      721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc
      781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg
      841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca
      901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc
      961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa
     1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca
     1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga
     1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt
     1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag
     1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc
     1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg
     1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc
     1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt
     1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa
     1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag
     1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc
     1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca
     1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa
     1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg
     1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat
     1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat
     1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca
     2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag
     2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca
     2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt
     2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca
     2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag
     2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca
     2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca
     2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga
     2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc
     2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga
     2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg
     2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa
     2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga
     2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca
     2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac
     2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa
     3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca
     3061 tcgagtccac aaaagtccat catcagcacc atccgaaacc gaaatcgaaa acgcaccatc
     3121 atctgacccc caagtgggtt ccattttcac ctacggcgag ggattgctgc atccgtttat
     3181 gggtctgctg ccggtggaga ggcccgatga tccgtggaac cagaagccat acgatcctca
     3241 ccatcctctc tacacgggcg gtgggagcta cgatgcctat ctgagggacg gtcgccatcg
     3301 aagggacacg cacatcatgg gtcaaggtac gcagcatggc atcctgacac ccggcatgtt
     3361 ggagagactt cttcggatca agatggattt ccagcgtaga tttcccccat ttatacaagg
     3421 gtatgctgaa tcaccatacg aatcttaccc gtgtagaggt acagcctccg gttctgggca
     3481 agatttctaa gccgaaaact aagaccaagc ccaagaatga ggacgaaccc gtcttcgaat
     3541 tgggagccgc tgagcgtagc ctgttcgagg acgagaccaa cgattcttta gagaaagatc
     3601 ccgaaccaga accggatgag gaggatgacc gggatgtgga agagcccagc gagagcagtg
     3661 aacctagggg attcagcagc aaaaaaagcc gggatgagaa cgacatcgat tacttcaatt
     3721 tcgatgacga ttgatgttga tgattgactt ttattataat tgccaataaa attaatatta
     3781 taaa
//
LOCUS       DRODEC1B     3798 bp ds-DNA             INV       14-AUG-1990
DEFINITION  Drosophila melanogaster defective chorion-1 fc106 (dec-1) gene,
            complete cds.
ACCESSION   M35888 M35688 X07697
KEYWORDS    dec-1 protein; eggshell protein.
SOURCE      D.melanogaster ovary, cDNA to mRNA and genomic DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Hawley,R.J. and Waring,G.L.
  TITLE     Cloning and analysis of the dec-1 female-sterile locus, a gene
            required for proper assembly of the Drosophila eggshell
  JOURNAL   Genes Dev. 2, 341-349 (1988)
  STANDARD  simple automatic
REFERENCE   2  (bases 44 to 3798)
  AUTHORS   Waring,G.L., Hawley,R.J. and Schoenfeld,T.
  TITLE     Multiple proteins are produced from the dec-1 eggshell gene in
            Drosophila by alternative RNA splicing and proteolytic cleavage
            events
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Waring, 28-JUN-1990.
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM          14     18       put. TATA-box
               SITE         44     44       transcription initiation site
FEATURES       from  to/span     description
    pept        119     2971     fc106 protein precursor (dec-1)
    sigp        119      187     fc106 protein signal peptide
    matp        188     2968     fc106 protein
BASE COUNT     1106 a   1003 c   1008 g    681 t
ORIGIN      Chromosome 7C-X.
        1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga
       61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat
      121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag
      181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga
      241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc
      301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag
      361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa
      421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc
      481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg
      541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc
      601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc
      661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc
      721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc
      781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg
      841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca
      901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc
      961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa
     1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca
     1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga
     1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt
     1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag
     1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc
     1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg
     1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc
     1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt
     1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa
     1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag
     1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc
     1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca
     1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa
     1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg
     1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat
     1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat
     1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca
     2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag
     2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca
     2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt
     2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca
     2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag
     2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca
     2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca
     2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga
     2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc
     2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga
     2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg
     2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa
     2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga
     2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca
     2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac
     2941 aatggccagg gcgacgcatt tcagacaata aacccgaagc aaccaggcga agtcggtgga
     3001 tcggagagcc agaagagcaa ctctaatcct ccaacgacgt tgacaccagc tccacaagag
     3061 caaccgcaag agcatcgagt ccacaaaagt ccatcatcag caccatccga aaccgaaatc
     3121 gaaaacgcac catcatctga cccccaagtg ggttccattt tcacctacgg cgagggattg
     3181 ctgcatccgt ttatgggtct gctgccggtg gagaggcccg atgatccgtg gaaccagaag
     3241 ccatacgatc ctcaccatcc tctctacacg ggcggtggga gctacgatgc ctatctgagg
     3301 gacggtcgcc atcgaaggga cacgcacatc atgggtcaag gtacgcagca tggcatcctg
     3361 acacccggca tgttggagag acttcttcgg atcaagatgg atttccagcg tagatttccc
     3421 ccatttatac aagggtatgc tgaatcacca tacgaatctt acccgtgtag aggtacagcc
     3481 tccggttctg ggcaagattt ctaagccgaa aactaagacc aagcccaaga atgaggacga
     3541 acccgtcttc gaattgggag ccgctgagcg tagcctgttc gaggacgaga ccaacgattc
     3601 tttagagaaa gatcccgaac cagaaccgga tgaggaggat gaccgggatg tggaagagcc
     3661 cagcgagagc agtgaaccta ggggattcag cagcaaaaaa agccgggatg agaacgacat
     3721 cgattacttc aatttcgatg acgattgatg ttgatgattg acttttatta taattgccaa
     3781 taaaattaat attataaa
//
LOCUS       DRODEC1C     5740 bp ds-DNA             INV       14-AUG-1990
DEFINITION  Drosophila melanogaster defective chorion fc177 (dec-1) gene,
            complete cds.
ACCESSION   M35889 M35688 X07697
KEYWORDS    dec-1 protein; eggshell protein.
SOURCE      D.melanogaster ovary, cDNA to mRNA and genomic DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Hawley,R.J. and Waring,G.L.
  TITLE     Cloning and analysis of the dec-1 female-sterile locus, a gene
            required for proper assembly of the Drosophila eggshell
  JOURNAL   Genes Dev. 2, 341-349 (1988)
  STANDARD  simple automatic
REFERENCE   2  (bases 44 to 5740)
  AUTHORS   Waring,G.L., Hawley,R.J. and Schoenfeld,T.
  TITLE     Multiple proteins are produced from the dec-1 eggshell gene in
            Drosophila by alternative RNA splicing and proteolytic cleavage
            events
  JOURNAL   Dev. Biol. (1990) In press
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Waring, 28-JUN-1990.
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM          14     18       put. TATA-box
               SITE         44     44       transcription initiation site
FEATURES       from  to/span     description
    pept        119     4888     fc177 protein precursor (dec-1)
    sigp        119      187     fc177 protein signal peptide
    matp        188     4885     fc177 protein
BASE COUNT     1716 a   1481 c   1496 g   1046 t      1 others
ORIGIN      Chromosome 7C-X.
        1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga
       61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat
      121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag
      181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga
      241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc
      301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag
      361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa
      421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc
      481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg
      541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc
      601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc
      661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc
      721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc
      781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg
      841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca
      901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc
      961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa
     1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca
     1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga
     1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt
     1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag
     1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc
     1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg
     1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc
     1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt
     1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa
     1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag
     1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc
     1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca
     1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa
     1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg
     1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat
     1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat
     1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca
     2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag
     2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca
     2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt
     2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca
     2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag
     2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca
     2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca
     2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga
     2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc
     2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga
     2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg
     2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa
     2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga
     2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca
     2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac
     2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa
     3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca
     3061 tcgagtccac aaaaggttag cacatttcca caggtttggg caagaggcag ggttaaatgc
     3121 gaccacatcc aagggttgcg gatgtggcag gttggattgt ctgtgcggca ggagttgtcg
     3181 atgcggaaga agaggcttgg aatctagagt ggtttctagt agaacttcgg gaacttgcca
     3241 gtgcaaagcc agtcaccgga acaagcgttg tgaatatgga accttggaga ccatcgatga
     3301 gggttctctt aacgaactta gaagagaata taaactggga ctaaaagaga tcacgctgag
     3361 tccagatgaa gatcccgccg aggccctgat gcgctacaat gcggcctcca ttcgcgaagc
     3421 tttggagcga gccagtatgg aaccgctgga gatcggtggg gatcagtatg aagaggatgc
     3481 gcaacaggag cccatggagg aggagcaact acagcatgat cccaacacag agccccagta
     3541 caatcataag gactttgtgc ggctaaccac ttccacagca tctccgatca caagtaccac
     3601 agaagctgcc actccaacag gcagtgactc tacttcagaa gcaactgtga cacctgaagt
     3661 caccaccacc actagcacat ccacaacaac aacaaccgaa agcaccaaag atgagggact
     3721 ggacatgcag caagactcac aggctgaggc tgagagttct catgtcacca agtcgatatc
     3781 gaagcaggag gccganattc atcagctgca cagtattgtg gaggagctga agaatgagat
     3841 ccttaaactg aatttgcgtt gcagcacgat aatatcaaac aatgtggcca aagaaccggt
     3901 cacagagaag aacccacccg tggtggagga accatccaag caggaggata agccaaaggt
     3961 cgaggagaaa gtgattgccg aagagcaagc cccagtggag cccgaggaag atctggaaga
     4021 ggacgaggat agtacctcta ttagtacgac cacagagact ccttcaccaa gtggtagtta
     4081 ctcaacaaaa cctggacttt cccttggatc gcctagagtg gatgaacaaa gtggctcgtc
     4141 caataaactg gactacgacg acgataacaa ttggcaacgc attctggcca atcgtggcta
     4201 tgacacggac tacctaacca aatcccatga gcgtcagttt gcccagggtc aaaacctgga
     4261 gatgcccaag aactgcaatt acgatggaaa tggcagccaa gagtatggcc cctatccgga
     4321 gttccaagca gatgaaccca gcacggatac ggagggcaag gctaaaaggg ctttaagtgt
     4381 gaaacaacag gcgcagctgc tcaatgcggc cctaaatgat agtggaagcg attcgagcga
     4441 tggcaccacc accaccacca caccctctcc gtacgcgatg aggggaaagt ttgtgagacg
     4501 gaggagcacg gcaagaaggg ttcccattcc aaagatagga aaggcgagcg acgaggtatg
     4561 ggtacgatct cctcgtcaag cgaaaatgcc ccaacgaccc aagaagtcca tgtccaagcc
     4621 gaaaaagcag agctcgcagg tgactacgca ggccaccgtg agcagcacta agctggacag
     4681 tttggtggat gtgctcaagg atctggtgcg cctgcaaatt caaaaggaga aaaaatccag
     4741 cctattaaga acccagagca ataacctttc gaaaaccaaa cccaaatcta ttaagcccgt
     4801 caaagtcatc aagcgaaaaa ggttgcgcag gaggcaacac aagtctattg ccacgaccat
     4861 caggagtccc attcaaacga aagcataaga tttcagccca tttaactaat tcacagatcc
     4921 caactaatcc tatacataat ttagatatag tataagcata ttcactattc gttcgcaatt
     4981 cattcatatc catccatctt tcatttgcat atccctcgtc catatcgcca gtccatcatc
     5041 agcaccatcc gaaaccgaaa tcgaaaacgc accatcatct gacccccaag tgggttccat
     5101 tttcacctac ggcgagggat tgctgcatcc gtttatgggt ctgctgccgg tggagaggcc
     5161 cgatgatccg tggaaccaga agccatacga tcctcaccat cctctctaca cgggcggtgg
     5221 gagctacgat gcctatctga gggacggtcg ccatcgaagg gacacgcaca tcatgggtca
     5281 aggtacgcag catggcatcc tgacacccgg catgttggag agacttcttc ggatcaagat
     5341 ggatttccag cgtagatttc ccccatttat acaagggtat gctgaatcac catacgaatc
     5401 ttacccgtgt agaggtacag cctccggttc tgggcaagat ttctaagccg aaaactaaga
     5461 ccaagcccaa gaatgaggac gaacccgtct tcgaattggg agccgctgag cgtagcctgt
     5521 tcgaggacga gaccaacgat tctttagaga aagatcccga accagaaccg gatgaggagg
     5581 atgaccggga tgtggaagag cccagcgaga gcagtgaacc taggggattc agcagcaaaa
     5641 aaagccggga tgagaacgac atcgattact tcaatttcga tgacgattga tgttgatgat
     5701 tgacttttat tataattgcc aataaaatta atattataaa
//
LOCUS       DROGLDYYR    6458 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.melanogaster glucose dehydrogenase (GLD) gene, complete cds.
ACCESSION   M29298 X07358 X13581 X13582
KEYWORDS    YYRR box; glucose dehydrogenase.
SOURCE      D.melanogaster (strain OregonR) DNA, clone pEG25D20.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1460 to 1531)
  AUTHORS   Cavener,D.R.
  JOURNAL   Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235
  STANDARD  full staff_entry
REFERENCE   2  (bases 1460 to 1531)
  AUTHORS   Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M.,
            Schonbaum,C. and Xiao,X.
  TITLE     The YYRR box: A conserved dipyrimidine-dipurine sequence element in
            Drosophila and other eukaryotes
  JOURNAL   Nucleic Acids Res. 16, 3375-3390 (1988)
  STANDARD  full staff_entry
REFERENCE   3  (bases 78 to 1142 and 3792 and 4208)
  AUTHORS   Whetten,R., Organ,E., Krasney,P., Cox-Foster,D. and Cavener,D.
  TITLE     Molecular structure and transformation of the glucose dehydrogenase
            gene in Drosophila melanogaster
  JOURNAL   Genetics 120, 475-484 (1988)
  STANDARD  full staff_entry
REFERENCE   4  (bases 1 to 6458)
  AUTHORS   Krasney,P.A., Carr,C.M. and Cavener,D.R.
  TITLE     Evolution of the glucose dehydrogenase gene in Drosophila
  JOURNAL   Mol. Biol. Evol. 7, 155-177 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.R.Cavener, 20-OCT-1990.
FEATURES       from  to/span     description
    pept       3838     4005     glucose dehydrogenase (GLD), exon 2 (EC
                                 1.1.99.10)
               4079     4199     glucose dehydrogenase, exon 3
               4539     6088     glucose dehydrogenase, exon 4
    pre-msg     502     6458     glucose dehydrogenase mRNA and introns
    IVS         837     3828     GLD intron A
    IVS        4006     4078     GLD intron B
    IVS        4200     4538     GLD intron C
    signal      394      400     GLD TATA box
    site       1457     1532     YYRR box tetranucleotide repeat region
BASE COUNT     1782 a   1464 c   1418 g   1785 t      9 others
ORIGIN      Chromosome 3-84C8.
        1 tatctgtctg acattaatat ccaaaaatgt aataaattca catccacaaa ttgtgaaaaa
       61 aaatcaacaa cgcagttctg cagccgttcg actttatttt gccagtgctt cttaacttgg
      121 ctggaaatcg ttaaactcgc aggcgcacga gcaagcagct ttttgtgtgg gttgtttagc
      181 cgaaagcggt ggttgaagaa acctgtgacg tcttagccga agtcaggggt gcttaaagaa
      241 agttttacaa cactagacca tattcatgag taaagggttg agtaataaaa tacataaaac
      301 gtaagaaata ataataatac agattctaaa agttattagg taaaatttag accaatttag
      361 acctactcat tgcaaacact caaaagctcc cgattcagac caagtttcag agagcgcagc
      421 tttgcggcca gctttaagct gtctttcgtt gagttcgagc ttttcgtcag tttaaaaaga
      481 ctggcgcctg ctggtcagaa gctgagtcgg taacggtctg cgtctgcgcg cagttcgaac
      541 aagttgagaa agagaccaac agaaagccca tccaagtgca gtgatcaata cggtaactga
      601 caaaaaccct agaagtcagg gcttaaaaac gattttgcag cgctgccagt gggtttttgt
      661 gtgataaaaa aaaagcggct cagaaaaact tgctgacagc agatagcaca cacgtttttg
      721 tcgtcttcgg tccattgaaa aattttcccg aggcattttc tataaggaat aaacaattaa
      781 ttcaatattt aaagcataga aggaaactag acaccacatc accggactct acgatcgtaa
      841 gttgatgcaa tcgtctttat ttctattatt ttctgccttt tcggtttttg cacaacccca
      901 aaatccaaaa attcgcgatg tccgtttctg gcattgagga agctcaaaag atttggacag
      961 ctttttggcc cgaagtctgc tggaaatttg ccattgtcat aagcccaaga aacggaattt
     1021 tgttcggtct tctgatggct cacggcgata aatttactgc actttgtttg caatagctcc
     1081 aattcgattc ggtttgtttt tctggcacag agcacagtat gcctcacgat ttctgctcta
     1141 gactaaaact ataatcccaa atatgaaaac cgaattagct aattagttaa ggctaaagtc
     1201 tgaactggaa tcttctgact aagatcccac cccttccaaa cgtggacctt acacaattca
     1261 ctcaaaattc gcatagcttt atctggcgcc tttgtggcac tcataaattg cattacttta
     1321 tgattattat tctaatatat gcacggttcc acattaataa gcacccgaaa aactatgcct
     1381 aaacatatgg cagaagcggc aacgagataa ctgtataaca aggcccaaaa tcaagccggt
     1441 atctacggta ttgttcacac tgactggctg actggctgac tggctgactg actgcctgat
     1501 tgagtgactg actgactgac tgactaactg acaggcagct caaagaaacc ttcaaacgca
     1561 gcaacgaaga caatgccaag ctatatcatt gaagttatcg tgcgagtgca aaacaaacca
     1621 ccagaagaca actgcagctc gcgtcggttt tattcgacac ccgtgtcgct tcagtgaaca
     1681 agggattatg caccatgctt tacatccgat aaagttgtta attccaaaaa aaaaagtcta
     1741 attatgacag cgccattgat aatcgtgtca gataagttac acaggaagtg cgagtaatca
     1801 aatgataaca agcaaaaatg tcgttaggct gattaaataa ttgtcttcat gttctcacca
     1861 ttaaacaaaa aaatatttta aaagtcattg gtttctctga attttttatt taaattaaaa
     1921 attctccatc tgctataaca catattttta tttttttaac gaaacattat tttaacttaa
     1981 ctggctaaac tgcagataaa atcagaagtc caagtcctcc aaaaatctgt atattagaga
     2041 tcatttaacc ctaataaacc aggcggaaaa taattaagtc ggggaaaatg tattgcggct
     2101 atcccctttt cgtatcccta ggattttccc accaatttcc acgagccgcc acgcagaagt
     2161 ggacctaaat ctgttagcta ggcaaaactt gccccgaaca acttttgatt ttctgagtgc
     2221 tgcacgactt ttgcacgcgt ttttctcggc gtttgaagtt cgtctgtggc cacatttgtt
     2281 gcatggcttt tactaaactg ctggaaaacg ggctgcgagg tcaactgcct ctcttttcga
     2341 ttttccacgt tcgcgtgagn tgcgtatgtt ctgcgacttt gttttgactg tttttttgtt
     2401 tttctatttt tttgtgggct cgcgctctgc ttcactttcc tggcatcctt caactttaaa
     2461 agcgaaatgc aattgaaatg aatgaaaaat cgtgttccaa atgtcagatg cgagccccat
     2521 tgtggcatcc aaaacagacg aactgtagca attgcatggt cttctcgatc tccgcttagg
     2581 tgattctctt gtttcactct cggtttcagc ttgcgtttct gtttcaatta cttttctgct
     2641 ttggttttgg cttcggtttc tgtttctgtt tctgtttctg ttttcggatg gatacaaatt
     2701 gcaatttgat tgcttcaagc tgttcgagtt aattgaggtc aaaagcgctg tcagcatatt
     2761 ccaaaaatgc actgatttcg gggatcggtt cggtttgggt ttcggtttca attttgggcg
     2821 cctaattaac cgcccaaaaa ggagtgtgaa gagcaatatt ggaggcaagc aattaagatg
     2881 gtttcttcac ataaaaggag cttagaacgg atcccgtttc agagctgctc gggagtcaat
     2941 ctgatcagac ccgggggtca taagtcgacc accaacgaga cgaacaaatg acataattac
     3001 attgcacaat actcaaccgt ggttggctac cgtaattcca tgcagcctca ttgcttctgt
     3061 tttaggcatt tgctcgtcaa tttcgaaatt ccaaatagga aagaaaaggg aagccagcca
     3121 gacatttttg gccgctcggg gctaaggtca tgtgaaacta tgcagaagct gccgccgacg
     3181 gaacaaaagc caagctcgga tgatgagcta tgacttctgt ccgtccggcg ttctttttag
     3241 acctttcgta atcgccaatg cagggggcag ttgccgtttg tttgcaaatg gcaagatata
     3301 gttgccgntt gtttgcaaat ggcnnagata taaaaaatat cttttttgnt ttcaatttca
     3361 tttcgntttt ttcctttcgg ccgcttatgc cgctgtgtgc atgggacaga cgacgcccat
     3421 tgtctctcgg tgagtactct tattaattac agtactgctg ctggagcgtt actatgataa
     3481 tgtttagtct ttgtctcagt ggacaaggtt aaacagaagt tcatgttgcg gtcttcggaa
     3541 aggtttatga ctttcccaat ggctttcaat taatcagctg gaaattaaat aattaaatga
     3601 ccatatcatt ttgcgattgc ttttaataaa aagagtatat tgctgacaac gtctttatcg
     3661 ttctaacttt gtaaaacttt aaacaagcta gaaagagact tagaacacga acttgtagct
     3721 ttatttanaa aaaaagttcg tgctctaagn cntttgtatt ttgaaacttg atcatagttt
     3781 tttacctatt taatacttaa aataagaaat tcccattgat ttttttagtc tatcaacatg
     3841 tccgccagcg cctcagcctg cgattgtttg gtgggcgtac ccactgggcc caccctggcc
     3901 tccacatgtg gtggtagcgc cttcatgctg ttcatgggcc tcctggaggt ctttatccgc
     3961 tcccagtgtg atctcgagga tccctgcgga agggccagca gtcgggtaag ctgtaaatga
     4021 cgaaggccac taagagtggc tacccatatc tcaatataga tttccaacat tattacagtt
     4081 tcgatcggag ccggactacg agtacgattt cattgtcatt ggcggcggct cagcgggctc
     4141 tgtggtggcc tctcgactgt ccgaggtgcc ccaatggaag gtgcttttga ttgaagccgg
     4201 taagggatgt gccccagcaa ctcataccca tacacatacc tacatactta agttacactg
     4261 ctccggcaat tagaggcacg ttatgtggca tttgtttaga agcaattgaa gcttaacagc
     4321 ggcataaact gcaacaatgg caccaacact aatgccactt gggcaagact cggcggcaaa
     4381 gaagcaaccg caactgtcac gagcattatg gagcgataat ttctagtcct aatatagaac
     4441 cgacaaacta atcatacact tatataaact acaaaaaact ataacaacat gatatttttt
     4501 aattatgtat ttaacagtat cttaaccgtc ttacacaggt ggcgatgaac ccgtgggagc
     4561 ccagataccc tcgatgttcc tcaactttat tggcagcgac attgactacc gctacaacac
     4621 ggaacctgag ccaatggcct gtctgtcctc catggagcag cgctgctact ggccacgtgg
     4681 caaggtactt ggcggaacct cggtattgaa cggaatgatg tatgtccgcg gcaaccggga
     4741 ggactatgat gattgggcag cggatggaaa tcctggttgg gcctacaacg acgtgcttcc
     4801 gttcttcaaa aagtcagagg acaacctgga tctggatgaa gtgggcacgg aatatcacgc
     4861 caagggtggc ctcctgccag tgggcaagtt cccatacaac ccacctctat cctacgctat
     4921 tctgaaggcc ggcgaggagt tgggcttctc cgttcacgat ctaaatggtc agaactctac
     4981 tggatttatg attgcccaga tgacggcccg aaatggcatc agatatagct cagcccgggc
     5041 tttcctgcgt ccggctcgca tgcgcaacaa cctgcatatc ctgctgaaca ccacggccac
     5101 caagatcctt attcatccgc acactaagaa tgttctggga gtggaagtca gtgaccaatt
     5161 cggcagcacc cgaaaaatcc tggtcaagaa ggaggtggtt ttaagtgctg gcgctgtgaa
     5221 ctctccgcat atcctactac taagtggagt gggtcccaag gacgaactgc agcaggtgaa
     5281 tgtgaggacc gtgcacaacc taccaggtgt gggcaagaat ctacacaatc acgtgaccta
     5341 ctttaccaac ttcttcattg acgatgcgga cacggcgcca cttaactggg ccacggctat
     5401 ggagtacttg ctctttcggg atggtctcat gtccggcact ggcatttcgg atgtgaccgc
     5461 aaagctggcc actcgctatg ccgacagtcc cgaacgtccc gatctccagc tgtactttgg
     5521 cggctatctg gccagttgtg cccgtacagg acaggtgggc gagttgctct cgaataactc
     5581 tcggtccatt cagatcttcc cagctgtttt gaaccctcga tcgcggggat ttattggcct
     5641 gcgatctgca gatcctctgg agccgccgcg tattgtggcc aattacctaa cacacgagcg
     5701 ggatgtgaag acgctggtgg agggcattaa gtttgtcatc cgattgtcgc aaacgacgcc
     5761 gttgaagcag tacggcatgc ggctggacaa gacggtggtc aagggatgcg aggcccatgc
     5821 ctttggtagc gatgcctact gggagtgcgc cgtgaggcag aacacgggtc cggagaatca
     5881 ccaggctggc tcctgcaaaa tgggtcccag ccacgacccc atggcggtgg tcaaccacga
     5941 gttgcgggtc cacggaattc gcggactccg agtcatggac accagcataa tgcccaaggt
     6001 cagctcggga aacacccatg cccccgccgt gatgatcgct gagaagggcg cctacctgct
     6061 gaagagggcc tggggcgcca aggtctgacg cgtggatgcg acgtggacgt tgcatagagt
     6121 aatttaatca gaatccaagc gaaaattaca gcgagtaaaa ttttaagcga aattacactt
     6181 atggcacctc agatccaaaa ctaaactatt ttaccctaat ccaatcttat ttaatccgat
     6241 cctattatga aggtttctca ctctaggcac accgcgttgc gcaatttcca ttaaagttat
     6301 ctataagtta acaggctccc ttaaaactaa ctactgttac ccactgaata tgccgtcaaa
     6361 cttcttgaac agcacactca gctcaaatta aagacaaata aaacttatta ggtatctagt
     6421 aaatttacta aaattccaaa ttaaatttat gtgaattc
//
LOCUS       DRPGLDYYR    7382 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.pseudoobscura glucose dehydrogenase (GLD) gene, complete cds.
ACCESSION   M29299 X07359
KEYWORDS    YYRR box; glucose dehydrogenase.
SOURCE      D.pseudoobscura DNA, clone pBy2.1.
  ORGANISM  Drosophila pseudoobscura
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 2131 to 2198)
  AUTHORS   Cavener,D.R.
  JOURNAL   Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235
  STANDARD  full staff_entry
REFERENCE   2  (bases 2131 to 2198)
  AUTHORS   Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M.,
            Schonbaum,C. and Xiao,X.
  TITLE     The YYRR box: A conserved dipyrimidine-dipurine sequence element in
            Drosophila and other eukaryotes
  JOURNAL   Nucleic Acids Res. 16, 3375-3390 (1988)
  STANDARD  full staff_entry
REFERENCE   3  (bases 1 to 7382)
  AUTHORS   Krasney,P.A., Carr,C.M. and Cavener,D.R.
  TITLE     Evolution of the glucose dehydrogenase gene in Drosophila
  JOURNAL   Mol. Biol. Evol. 7, 155-177 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.R.Cavener, 20-OCT-1990.
FEATURES       from  to/span     description
    pept       3959     4126     glucose dehydrogenase (GLD), exon 2 (EC
                                 1.1.99.10)
               4203     4323     glucose dehydrogenase, exon 3
               5530     7079     glucose dehydrogenase, exon 4
    pre-msg     777  >  7382     GLD mRNA and introns
    IVS        1136     3948     GLD intron A
    IVS        4127     4202     GLD intron B
    IVS        4324     5529     GLD intron C
    site       2129     2200     YYRR box tetranucleotide repeat region
BASE COUNT     1983 a   1667 c   1611 g   2121 t
ORIGIN      Chromosome 3-84C8.
        1 gaattccaat catattaatt ttaatatcat gaagcacacc aggagttttc ttatctattt
       61 gttttttttt ttcaaagcat acatgactct acttttcttt ggtttcatat cactttttgg
      121 aagttctcaa ctctcaaata caaagtatat tctgaacaag tattattaca tattactatc
      181 ttactttttt atcaaaacct ttacaaatat ttttccactt tcaatgccca ggcttggcga
      241 aaacaacaat actagaacgt agttccactc agaggtactt tattttacct ctgttcggtc
      301 cgcttggaaa tcattaaacc acgcagggcc tctccgcgga aagagtgcct ttttgtgtgg
      361 gctgtgtgct gtctgagagc gggggttgaa gaaacctgtg acgtcttatt gaagcatagg
      421 ggtggctggc tgccataatc tggagttttg cagcactgtc aaagctgtat gagtcatcaa
      481 atattcagta aagttcttat agaactaaag cttttaatgc atagttttta ttttgattgt
      541 tatttttaga agttttagaa tgaattgtat ttgtttttac aaaaagaaat ctctcttttg
      601 attaacagca aagcccaact taacgaagcc aaacgacgca gcgcgctagc aagcttgcca
      661 accagcttta agctgccgct tcgtcaaatg aaaccttcgt taagccaaaa gcttacctca
      721 gtggcgcagc gcggcttttg ggaagcttaa aaagcctcgc gctcgccggc cagagactga
      781 gtcggtaacg gtctgcgtct gcgcgcagtt caaaacaggt tgaaaaaaac gaggcccagt
      841 gcttaaaaga agggaacact aaaaacccta gaaggcagcc caacaaatac cgttgtacgc
      901 atgagtgtct ttgtgttgtg tccaagaaaa taagaagaag aagaaactgc aacaagaaac
      961 gtttaaataa attgtttaca attgtgcatt ttttacgagt cgaacagttc gtttttatca
     1021 tctccatgcc attgagcaat tggcctagag gcatttctat ttttcaataa ggaataaata
     1081 attaaatttt caaaaattta acaaacagaa aaggaaacct gaaccgacat agaccgtaag
     1141 ttgatgcaat cgcctgatgc ccgatacctg atggctgatg cctgatgcct gagtttttct
     1201 tttatttcgc tattttctgc gtgtttcggc catgtccgtt ttggcattga ggaagctaaa
     1261 cgaaaactgt tttggccagc ctctgcctct tggactgctg gaaattttac cattgtcata
     1321 agcccaagac ccaagaaacg gaacttcaac agagtcttgt gttgtgtcta ttggtcttct
     1381 gatgcctcat ggcgataaat tgagttgcac ttttattgta tttttgtttt ttgcaatagc
     1441 tccaattcca attcgattcg ttttgtttgc cgtcagtgta catgtctcac gattttactg
     1501 cctctagaag accaaaaccc gttccagttc atagttggag gtccattggc caaatgaatg
     1561 actacaattt gccgcaatgt gtttgagcat taaatttgct atagttttta tccactgatt
     1621 atgttccagt ttttaagcgg aggggaagtt gaaaattgct ggtctgcaaa ttgactccaa
     1681 agtagatgtg aaatttcttg aaatgatgtg aatttttctc agtttaattg aatgatggat
     1741 tgccaatgga tgccaatgaa gttcttcaaa ttattgggtg tgaaagtctc acaaattttg
     1801 tgaatttgat ggggaagcta ttattatgac aggtccaaag atctcacccc atccaaagag
     1861 ccaaagacag agcccttaca caatccccgc actataatca catggtttta tctggcgcct
     1921 ttgtcatggc acgcataaat tgcattactt tatgattata attctaataa atgcacagtt
     1981 cccacattaa taagcgccct cgccctcccc tactcagaaa ccatcgctaa acatatggca
     2041 aggaggtaca actaaaactc caaccaaacc caaaaaaact ctatctctgt ctgcggtatt
     2101 gttcacaccc ctgagcacaa gaccgacata ctgactgact gactgactgg ctgactgact
     2161 gactcactaa ctgactgatt gactgactgg cagactgaca actcaaagaa accttcaaat
     2221 gcaacggcag caatgaagac aatgccaagc gatatcaata caaaaaccac aagaagacaa
     2281 ctgcaacgca actacgatta taatcgacat gcgggatcgg atcgggttgg atcgcttcag
     2341 tgcacaagat attatgtaca tccgataaaa ttgttaatac aaaaaaatca agctgttcat
     2401 aattatgaaa aaaaactcac agataacatt gttgtcgcaa aaggaagtgt gaataggcca
     2461 aagaagaggc acaaaaagtg tcattaatta attgaataat tacagagaca tgacagagag
     2521 cagcacttgc ttaggatttc ccttaagttt tccatagatt tccatgccac aatactggaa
     2581 aagcaaaact tgtcccgaac aacttttaat tttctgattg ctgcacgact tttgcacgcg
     2641 tttttctttg gcttttgtga agttcgcctg tacttgtacc cgtagccaaa gcctgcctgg
     2701 aaaacttgtt gcgaggtcaa ctgcctctct taaagccaca gttgcgtatg cactctggct
     2761 ttagtttggc tttgctttac tttcgtggaa tccttcaact ttaaaacaag aaaaggacac
     2821 caaaatccca ttcaaattga atgaaaaatc gtgtttcaaa tgtcaaatgc gaacaccaaa
     2881 gaaacaaaaa aaaaacagac gaactgtagc aattgtagca tgatcttctc tggtgattct
     2941 ctggctctga ttttttgggt ttcattttca gtttcgggct ttggtggttg tttttgtttt
     3001 tggcttccac gtttggtttg gttgcaaatt gcaatttgat tgctttaagc cgtttcgatt
     3061 taattgaggt caaaaacgtt gtcggattct tcccaaaaat gtgctgattt gtgggttggg
     3121 ttttgggctt taaaagtaca tatttgtcta attgtcatgc ctggatgtgg tatgtagcaa
     3181 ctttatcggt ctccccaaaa ctctcactta aaaatcactc aaatctcacc cgcccagagg
     3241 tcatcaaaac gtttgcaaat gacataatta taatggacat tactcaatct ctcattgctt
     3301 cttgttttag gcagtaggcc ccagcagaga gtacgagtaa tttgaaattt caaataggaa
     3361 aagagccaga gcccgagcct tggccgctcg tccggggcta aggtcatgtg aaactatgca
     3421 gctgactgaa caaaagccaa gagctaagac ccaaccaaga tgatgagctc tgtctgtctg
     3481 acttggcctt ttggtaattg ccaatgctgc tgctgctgct ggccagttgc cgtttgccaa
     3541 atgcccaacg atatcaaaaa agtatctttt tctgatttta attccatttt attcagcttt
     3601 tgtttttact tcttcgactg cgatgacgcc cattgtctgc tgcggttggt acttttatta
     3661 attgcagtac tgctgctgtt gctggagaat tacttagatg attttttgtc tttgccatgc
     3721 cccatggatc ggcaaggtta aacagaagtt tggaattgcg gaatttggaa ttatatttat
     3781 gattttgccc tttggctttg gttttgtgtt taatcaacgg caaattatta aaacaaacaa
     3841 aaatacgagt agatacattt aaatattgaa ttataatttc tgaagctttg tttctcaagt
     3901 tcgtttctta aatctttcgt tttctgctct tttcattcca cagcacagcc ttatcataat
     3961 ggccaccagt ccatcctcct gcgattgttt ggtgggcgta cccacggggc ccaccctcgc
     4021 ctccacctgc ggcggcagcg ccttcatgct gttcatgggc cttttggagg tcttcatacg
     4081 ctcccaatgc gacctcgagg atccttgcgg acgggccagc acgcgggtaa gccaaagatc
     4141 cgtatatggt caagacgtgt ggctaaccag atcacattca atccaatcca ttttgttggc
     4201 agtttcgttc ggagccggat tacgagtacg atttcattgt cattggcggc ggctcggccg
     4261 gctctgtggt tgcctcacgt ctctccgagg tgccccagtg gaaggtgttg ctgatcgaag
     4321 ccggtaagtg ttgttgtttc gtctctcgcg tgtgtgtggg tgtgtgtgag tgttgcatgc
     4381 aacatgttgc cagccgcagt gtgctcacgg gcaattagat gcacgttatg tatgtgcatt
     4441 gtttagaagt atattgaagc ctaacagcac cagcagcagc agcagcacca gcagcagcta
     4501 agacttgggc aaccgcaact ctgccacgag cattatgtaa actaaacgag ctgctgccac
     4561 tgccacctag agcagcggct ccattgctgc ccccaaggca gttccccatg caaatgtctt
     4621 cttgctatat aggatttgct gcatgccccc gccaatgcct catggaggtt ttttacgttc
     4681 cgcttgactt gacttttttg cattggaata attgttggtt ttttgtacct tacatttttt
     4741 tatgaatgaa ttataatgca tgaacaatac ggggaacgtg gaacttcgga acgagtgtgt
     4801 agggcattca cttttcctta acgagaagca aatatgcggt aaactctgct gaagaggctt
     4861 tttagcactg aataatttta catttaattt gcgatatttg aaaactggat ctatagcacc
     4921 aggttttaga ggaaagcatt agcaaatcac ttaaatcttt gatggaatat attacaatcg
     4981 tttaaaaccc aaaaaacaaa acgaaaatcc ctctatttgg tcggagcacc aaaaaacacc
     5041 tcattaaaca tcataatttt tcgccattca tcactaataa cttaattagt tattcaggtg
     5101 tctagaaaat cacgttaata cctctctcta tcgctgggaa aacattctca tagatcccat
     5161 tagacccgtc ttatcagcat tcttctttct ggttggctta ctttgtgggt taccgtctat
     5221 tcgtcgtttt caattgtata cgtgacaatt tttatgccca gtcactaatt acactaattg
     5281 cataatgaaa ccctcgagga aagagaaaca ttttcaaagt tgtcaatcgg gttggcggta
     5341 cttatacaat gccccagtca ggctgcagtt aagtgcaatt agttgtggcc attacttggc
     5401 ttattgtcta tgaaaactac actttcaaag actccaaact catgctcatc acaagtatgg
     5461 tttaatcttg ggttgacatt agacatgcaa ttcatcaaat aaaaattcgt attgtttgta
     5521 ccatttcagg tggcgatgag ccggtgggtg cacagatccc ctccatgttc ctgaatttca
     5581 ttggcagtga cattgactat cgctacaata cggagcccga gaggatggcc tgcttgtcct
     5641 ccatggagca gcgttgctac tggcctcgcg gcaaggtcct aggcggcact tcggtgatga
     5701 atggcatgat gtacatacgc ggcaatcgcg aggactacga cgaatgggcg gcccagggaa
     5761 atccgggctg gtcctaccag gatgtcctgc cgttcttcaa gaaatcagag gataatctcg
     5821 aattggatgc cgtgggcacc gaataccatg ccaagggcgg ccttttgcca gtgggcaagt
     5881 tcccctacaa tcctccactg tcttatgccc tgctgaaggc cggcgaggag atgggcttct
     5941 cggtgcagga tctaaacgga cagaattcga caggcttcat gattgcccag atgacggcca
     6001 ggaatggcat tcgctatagt tcggccaggg ctttcctaag acccgctcgc atgcgtaaca
     6061 atctgcacat attgctgaac accactgtga ccaaggtgct catccatccg ggaaccaaaa
     6121 acgttgtggg tgtggaggtt agcgatcagt ttggcagcat gcgcaagatc cttgtcaaga
     6181 aggaggtgat cgtcagtggt ggtgcggtga actcgccaca gatcctgctg ctgagtggcg
     6241 taggacccaa agaggatctg caaaaggtga atgtccgtcc agtccatcat ctcccgggcg
     6301 tgggaaagaa tctccagaac catgtggcgt acttcacgaa cttcttcatc gatgatgcgg
     6361 atacggcgcc cctcaactgg gccacggcca tggagtactt gctcttccgc gatggcctga
     6421 tgtcgggcac tggcatctcc gatgtcaccg caaagatggc cacccgctgg gccgatcgcc
     6481 ccaatctgcc cgatctgcag ctctattttg gcggctacct ggccagctgt gcccgcaccg
     6541 gacaggtggg agaactcctc tcgaacaact ctcgagccat acagatattc ccggctgtac
     6601 tcaaccccaa gtctcgtggt tacattaccc tgcgctcggc cgatcccctg gatccccccc
     6661 gcatctttgc caactatctg acggatgagc gcgatgttaa gaccctggtg gagggcatca
     6721 agttcgccat aaggctgtcg cagacgtcac ccttgaagca gtacggcatg cggctggaca
     6781 agaccgtggt gaagggctgt gagtcgcacg ccttcgcgag cgatgcctac tgggagtgcg
     6841 ctgtgcgcca gaacaccggg ccggagaatc atcaggcggg ctcctgtaaa atgggtccct
     6901 cccacgatcc catggcggtg gttaatcacg agctgagggt ccatggcgtg cggggactgc
     6961 gcgtgatgga caccagcatt atgcccaagg tgacggcggg caatacccat gcaccggcgg
     7021 tgatgattgc ggagaagggc gcctatctgc tgaagcgggc ctggggtgcc aaggtctgac
     7081 gcgtggatgc gacgtggacg ttgcatagag taatttaatc aaattccaac gaaaattact
     7141 acaaattcaa atttaagcga aaattattat taaggaagct ctgaactttt tctataccta
     7201 ttataaaacc tttttggccc gatccaatcc gatctgaacc catgccccac tctaagcact
     7261 ctgcgttgcg caatctacga ctcgacaatc gctctcgata agtttaacca tctaacagta
     7321 ctattattta aactaattct tgttaaaaac ccattgaaat atcgaaattt cgaatatcac
     7381 tc
//
LOCUS       DRVGLDYYR1    945 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.pseudoobscura glucose dehydrogenase (GLD) gene, 5' end.
ACCESSION   M29300
KEYWORDS    glucose dehydrogenase.
SEGMENT     1 of 2
SOURCE      D.virilis DNA, clone pCG79.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 945)
  AUTHORS   Krasney,P.A., Carr,C.M. and Cavener,D.R.
  TITLE     Evolution of the glucose dehydrogenase gene in Drosophila
  JOURNAL   Mol. Biol. Evol. 7, 155-177 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.R.Cavener, 20-OCT-1990.
FEATURES       from  to/span     description
    pept        943  >   945     glucose dehydrogenase (GLD; EC 1.1.99.10)
    mRNA        545  >   945     GLD mRNA
BASE COUNT      325 a    163 c    189 g    268 t
ORIGIN      Chromosome 3-84C8.
        1 gcatcctcgg taaaaagtgc acttcgagga tattttttgg tgatatttaa attttttcgg
       61 atttgcttca aacgtattat ttaaagctat acataaaaac gttatgaata cgtgttttaa
      121 cgatttggaa aatttaaacc ggcaacattg gtaaatttgg ggaaaacgtt tgtatgaaag
      181 ttgtttactt accgtccgat cagttttaaa atcattttca aagctatacc cgttagttta
      241 atatatatta ttgatcgttt tcattcaaag tatatatttc gttaacgctc gtaactttaa
      301 tgtttgcatg ttaataaata gtcatttcaa tagttagaat atgtatttat attaaattaa
      361 aaaaataaat attatgccag caagcaattg gagagcgaca agcgagcttt aagctacaca
      421 acgtgaggcg cagctggaac cagttggctc ccgaaccaac tgaaaaacga accagccaac
      481 agctcaagct tcgtattggc gtttttgggg ggagtttaaa aagacgcgcg ctgccggcct
      541 ttcactgagt cgacgccggt ctgcgattgc gcgcagttca aaaccaagtt gctaaaatca
      601 tagccaggcc taaactgcat gcagacttgt gagctgcccg ttaagccaaa gaaggcaaaa
      661 ctaagtactg tgcgtgtgtt actgtgctac tgtgtgtgct aacgaatgaa aaagaaaaaa
      721 aatcaccaaa aaataaaccg aaaaacattt aaaaaatagt ttttgtcaaa ttattgtgca
      781 cgtcttcggt ggttgtgcaa tttgccaaaa aggcattacc gataaggaga ggcacaataa
      841 gaaaataaaa gccaaaagag ctaattgttt gttttgcggc aaaccgaaaa ggaaactggc
      901 caacaactgg acacgggcat agaaaacttg aattaataca acatg
//
LOCUS       DRVGLDYYR2     50 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.virilis YYRR box located in intron-1 of Gld.
ACCESSION   X07360
KEYWORDS    YYRR box; glucose dehydrogenase.
SEGMENT     2 of 2
SOURCE      D.virilis DNA, clone pCG79.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 50; enum. 1 to 50)
  AUTHORS   Cavener,D.R.
  JOURNAL   Unpublished (1988) Vanderbilt Univ,Nashville,TN 37235
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 50)
  AUTHORS   Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M.,
            Schonbaum,C. and Xiao,X.
  TITLE     Title is missing from this entry
  JOURNAL   Nucleic Acids Res. (1988) In press
  STANDARD  simple automatic
COMMENT     *source: library=Charon 30; clone=pCG79;
            
               EMBL features not translated to GenBank features:
               key        from     to       description
            
               RPT           3     50       tetranucleotide repeat
               SITE          3     50       CTGA motif
BASE COUNT       15 a     14 c     11 g     10 t
ORIGIN      About 2 kbp downstream of segment 1.
        1 aactgactca ctgactcact gactgactga ctgacggact gacagactga
//
LOCUS       HUMSYB1A1     727 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 1 (SYB1) gene, exon 1.
ACCESSION   M36196 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     1 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 727)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept        721  +   722     synaptobrevin 1 (SYB1), exon 1
    IVS         723  >   727     synaptobrevin 1 intron A
    signal      544      549     TATA box
BASE COUNT      130 a    217 c    257 g    123 t
ORIGIN      Chromosome 12.
        1 aagcttcgtg cttgctttgg tgggacggaa ccccgcatcc ggcatctggc accgggagct
       61 tgtggctgtg tctcgaggct cctgcaccct ctcagagcgt tagtcagagg cggggcggtg
      121 gtgcggagcg agggccagag cagggcacca gggctgggcg cgcaccagga agccgccgcc
      181 tccagctgaa tgcgcgagtc ccggcgcggg gtggggcagg ggcgggcacg gtcttcagta
      241 actgcctagc gattggttgg gaggcaccac aaatgcctaa caaactgctg aagcgccctc
      301 cgaagggaca gacctcatct ccaccccgcc ccgtccctcc gccagactgg acgcaaacta
      361 tggtcgcatt gatcactgaa ggatcctcac agcaaccgct cctttccgga gtcggatgag
      421 aggagagttg tgactggcaa ttggcagggg cggggcgggc taggcctgta gcgctgggcg
      481 accgtcctgg gcatggattg ggccgcgggg ttgtcaccgt tatccgggag gcgtggtcag
      541 cactaataaa ggcggaggcc ggcgcggcag ctgcagtaag ttccagcgca cgtagaccgc
      601 ggggtggtcg gcgcgaggcg gagcttggca gttccgtcca cttcagccgc agcgtccctc
      661 gccgggtgtc tcgccgcagc ctccggagag gaacagaccc tcactctctc tgtcagaaaa
      721 atgtgag
//
LOCUS       HUMSYB1A2     153 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 1 (SYB1) gene, exon 2.
ACCESSION   M36197 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     2 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 153)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    22  +   148     synaptobrevin 1 (SYB1), exon 2
    pre-msg <     1  >   153     SYB1 mRNA and introns
    IVS     <     1       21     SYB1 intron A
    IVS         149  >   153     SYB1 intron B
BASE COUNT       37 a     49 c     42 g     25 t
ORIGIN      Chromosome 12.
        1 tctgcacatg tctttccaca ggtctgctcc agctcagcca cctgctgaag ggacagaagg
       61 gactgcccca ggtgggggtc cccctggccc tcctcctaac atgaccagta acagacgact
      121 acagcaaacc caggcacaag tggaggaggt agg
//
LOCUS       HUMSYB1A3     186 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 1 (SYB1) gene, exon 3.
ACCESSION   M36198 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     3 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 186)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    23  +   181     synaptobrevin 1 (SYB1), exon 3
    pre-msg <     1  >   186     SYB1 mRNA and introns
    IVS     <     1       22     SYB1 intron B
    IVS         182  >   186     SYB1 intron C
BASE COUNT       50 a     36 c     60 g     40 t
ORIGIN      Chromosome 12.
        1 tttgtttctg tgccctcctc aggtggtgga catcatacgt gtgaacgtgg acaaggtcct
       61 ggagagggac cagaagctgt cagagctgga tgaccgagct gatgccttgc aggcaggagc
      121 atcacaattt gagagcagtg ctgccaagct aaagaggaag tattggtgga aaaactgcaa
      181 ggtgaa
//
LOCUS       HUMSYB1A4      79 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 1 (SYB1) gene, exon 4.
ACCESSION   M36199 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     4 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 79)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    23  +    74     synaptobrevin 1 (SYB1), exon 4
    pre-msg <     1  >    79     SYB1 mRNA and introns
    IVS     <     1       22     SYB1 intron C
    IVS          75  >    79     SYB1 intron D
BASE COUNT       16 a     14 c     19 g     30 t
ORIGIN      Chromosome 12.
        1 ttttttgtcc atgtttcctc agatgatgat catgctggga gccatctgtg ccatcatcgt
       61 ggtagttatt gtaagtaag
//
LOCUS       HUMSYB1A5     428 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 1 (SYB1) gene, exon 5.
ACCESSION   M36200 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     5 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 428)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    22       38     synaptobrevin 1 (SYB1), exon 5
    pre-msg <     1  >   428     SYB1 mRNA and introns
    IVS     <     1       21     SYB1 intron D
BASE COUNT       85 a    139 c     89 g    115 t
ORIGIN      Chromosome 12.
        1 cttcctgatt cctgtgtcca gtctactttt ttacttgaga atgtacccac cccttccctg
       61 ttgtccattg ccatccacat tcatgtcctc tgccctctgt ttgctctctc aacacacttc
      121 cccacccacc gtcctccatt ccagcccagg cttctccatc acccattcct cctttttcgt
      181 tgcgttcatt tgcactctgt ccctcaacac tagaaatgct gctcgtggca caatctaagt
      241 cattacccga agagcaacag tggcgcctcc tccctgcctg ctttttctgt actctcaagt
      301 tcccccaaag ccccaaagag ttggaggcca agggaagggg cagggagggg agtggctgag
      361 gcgaagtacc catgaagctg cccagacttg ggaggagaag agtatcggtg cccatggtga
      421 cttctaga
//
LOCUS       HUMSYB2A1     954 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 2 (SYB2) gene, exon 1.
ACCESSION   M36201 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     1 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 954)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept        948  +   949     synaptobrevin 2 (SYB2), exon 1
    IVS         950  >   954     SYB2 intron A
BASE COUNT      125 a    361 c    305 g    163 t
ORIGIN      Chromosome 17.
        1 ctgcagtcgg gtcgctgacc taagcctggg gcaaagctag gtcttggcca gtcagcagcc
       61 accacgtgaa gcagcccctg cccccatcac ccccggcagc cttgtgccct cctcctcggg
      121 gtcctcagct caccccacct cagtcgggcc tgctccagcg gttaagtacc cctagtctga
      181 accttaatcg tgaggcgggg cagccccctc cccacaccct gtagacactc ccccgtcctc
      241 cgccccctgc ttttcttcct accctcctac ccagtggcca gggaccccag gcttcctctg
      301 agacatccac cagcggtgat gggttgggac caatctcccc cactcctctc caccctagca
      361 gtaaggtccc cgctctcgat actgccccag cacctgcact ggccctctct gatttcccaa
      421 ctccagccca ttccaagtgc tgaaggcggg tcgctaggtg ctgggacctg ggcgcggggc
      481 aagaccttgg gctgtgtggg gtgcaggctg ggtttactgg gttcacgatt gagggagtgg
      541 tctgggggtt aggtgtaggg ccaggctgcg ttgtgctcta aggggcaggc ggcaggcggt
      601 ttgggggggc tgtggtcaga cagggtgggg tggcggtggg tttggggaag ccgcagcagc
      661 ccgccttggg gtcccgcgag tgccgaggga gctgccgggg catggtggcg cggggcccgt
      721 ggtcctgccc ggcacgaccc ccgccccgca gcgccccgca gcgccccgcc cccgcggccc
      781 cgctcgcgac gcgtctcccg cggcgccccg cccccgcccg cagtgcccgg atgcgggtga
      841 cgcgcggccg ccatctttcc gtcccgggca gccagcgcca gtcggagcca gcgcgagcgc
      901 cgccgccatc actgccgctg ccaagtcctc cacccgctgc ccccgccatg tgag
//
LOCUS       HUMSYB2A2     148 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 2 (SYB2) gene, exon 2.
ACCESSION   M36202 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     2 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 148)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    23  +   143     synaptobrevin 2 (SYB2), exon 2
    pre-msg <     1  >   148     SYB2 mRNA and introns
    IVS     <     1       22     SYB2 intron A
    IVS         144  >   148     SYB2 intron B
BASE COUNT       25 a     58 c     41 g     24 t
ORIGIN      Chromosome 17.
        1 ttaatccgtc cctgctcctc aggtctgcta ccgctgccac ggccccccct gctgccccgg
       61 ctggggaggg tggtccccct gcaccccctc caaacctcac cagtaacagg agactgcagc
      121 agacccaggc ccaggtggat gaggtgag
//
LOCUS       HUMSYB2A3     186 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 2 (SYB2) gene, exon 3.
ACCESSION   M36203 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     3 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 186)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    23  +   181     synaptobrevin 2 (SYB2), exon 3
    pre-msg <     1  >   186     SYB2 mRNA and introns
    IVS     <     1       22     SYB2 intron B
    IVS         182  >   186     SYB2 intron C
BASE COUNT       47 a     49 c     61 g     29 t
ORIGIN      Chromosome 17.
        1 gcctcgtggg cccttctccc aggtggtgga catcatgagg gtgaacgtgg acaaggtcct
       61 ggagcgagac cagaagctgt cggagctgga cgaccgtgca gatgcactcc aggcgggggc
      121 ctcccagttt gaaacaagcg cagccaagct caagcgcaaa tactggtgga aaaacctcaa
      181 ggtaag
//
LOCUS       HUMSYB2A4      77 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 2 (SYB2) gene, exon 4.
ACCESSION   M36204 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     4 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 77)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    21  +    72     synaptobrevin 2 (SYB2), exon 4
    pre-msg <     1  >    77     SYB2 mRNA and introns
    IVS     <     1       20     SYB2 intron C
    IVS          73  >    77     SYB2 intron D
BASE COUNT       16 a     24 c     15 g     22 t
ORIGIN      Chromosome 17.
        1 tcctcgcctc tcacccccag atgatgatca tcttgggagt gatttgcgcc atcatcctca
       61 tcatcatcat aggtgag
//
LOCUS       HUMSYB2A5     146 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human synaptobrevin 2 (SYB2) gene, exon 5.
ACCESSION   M36205 J05611
KEYWORDS    intrinsic membrane protein; synaptic vesicle protein;
            synaptobrevin 1.
SEGMENT     5 of 5
SOURCE      Human DNA, clone pcVP18-[1,5].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 146)
  AUTHORS   Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C.
  TITLE     structure and chromosomal localizations of two human genes encoding
            synaptobrevins 1 and 2
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Sudhof, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    +    27       43     synaptobrevin 2 (SYB2), exon 5
    pre-msg <     1  >   146     SYB2 mRNA and introns
    IVS     <     1       26     SYB2 intron D
BASE COUNT       24 a     65 c     19 g     38 t
ORIGIN      Chromosome 17.
        1 tcatcttctt ccccctctct ccacagttta cttcagctct taaatccccg aggagtctgc
       61 cctgcctaga gaagggcctc tcccccaacc ctcagccgtt cctccacctc tcagccatat
      121 ctttcagccc ccactcccct ggatcc
//
LOCUS       SPICPATP     4066 bp ds-DNA             ORG       14-AUG-1990
DEFINITION  Spinach plastid genes atpI-H-F for ATP synthase CF(O) subunits
            IV, III and I.
ACCESSION   X03775 M35501
KEYWORDS    ATP synthase proteolipid; ATP synthetase; ATP synthetase CFo genes;
            atpA gene; atpF gene; atpH gene; atpI gene; synthetase.
SOURCE      Spinach chloroplast DNA.
  ORGANISM  Chloroplast Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae; 
            Spinacia oleracea.
REFERENCE   1  (bases 1793 to 2048)
  AUTHORS   Alt,J., Winter,W., Sebald,W., Moser,J.G., Schedel,R.,
            Westhoff,P. and Herrmann,R.G.
  TITLE     Localization and nucleotide sequence of the gene for the ATP
            synthase proteolipid subunit on the spinach plastid chromosome
  JOURNAL   Curr. Genet. 7, 129-138 (1983)
  STANDARD  simple staff_entry
REFERENCE   2  (bases 1 to 4066)
  AUTHORS   Hennig,J. and Herrmann,R.G.
  TITLE     Chloroplast ATP synthase of spinach contains nine nonidentical
            subunit species, six of which are encoded by plastid chromosomes in
            two operons in a phylogenetically conserved arrangement
  JOURNAL   Mol. Gen. Genet. 203, 117-128 (1986)
  STANDARD  simple staff_entry
COMMENT     EMBL features not translated to GenBank features:
               key        from     to       description
            
               OPR        1369   1414       pot. stem-loop structure
               OPR        2049   2079       pot. stem-loop structure
               OPR        3775   3799       stem-loop structure pot.
                                            transcription terminator
               INVREP     2319   2326       inverted repeat A
               INVREP     2334   2341       inverted repeat A'
               INVREP     2522   2534       inverted repeat B
               INVREP     3348   3359       inverted repeat B'
               SITE       2602   2604       pot. translation stop codon
               SITE       3312   3314       pot. translation stop codon
            
            Between positions 2584 and 3340 six stem-loop structures I(1)-I(6)
            are described. They designate secondary structures resembling those
            of class II introns
FEATURES       from  to/span     description
    pept        357     1100     CFo subunit IV (atpI)
    pept       1793     2038     CFo subunit III (atpH) proteolipid
    pept       2434     2578     CFo subunit I (atpF)
               3343     3752     CFo subunit I (atpF)
    pept       3818  >  4066     CF1 subunit alpha (atpA)
    pre-msg    2434     3749     primary transcript of atpF gene
    IVS        2579     3342     put. intron (no splice consensus)
    binding     348      351     atpI ribosome binding site
    binding    1780     1783     atpH ribosome binding site
    binding    2423     2426     atpF ribosome binding site
    binding    3810     3813     atpA ribosome binding site
    signal      206      211     atpI -35 region
    signal      241      246     atpI -10 region
    signal     1693     1698     atpH -35 region
    signal     1710     1715     atpH -35 region
    signal     1724     1728     atpH -10 region
    signal     1744     1749     atpH -10 region
BASE COUNT     1362 a    591 c    777 g   1336 t
ORIGIN      
        1 ctatttgttt aattgataca aattgtaacc cggatctcgc agctctttcg attccggcaa
       61 atgatgacgc tatagcttca attcgattaa ttcataccaa attagtattt gcaatttgtg
      121 aaggccgctc tagctatata agaaatcctt gattcataat aaaatcaaaa attgacttcc
      181 taaattctct atcggttact agattttgaa tctcaaaaac tagttcaaaa taacagggga
      241 tattatgtaa ttaatcagta tccgaaatag aaaattttaa ttaaagtaga caagtcgaga
      301 aagagctgat tgaatcaaaa taattttttt aagttatatt tctgtaagag gacaatatga
      361 atgttctatc atattcaatc aacccgctaa aggggttata tgctatatcg ggtgtggaag
      421 taggtcaaca tttttattgg caaataggag gtttccaaat ccatgacaag gcccttataa
      481 cttcttgggt tgtaattgct atcttattag gttcagctgc tatagctgtt cggagtccgc
      541 aaacaattcc gactggtggt caaaattttt ttgaatatgt ccttgaattc atccgagacg
      601 tgagcaaaac tcaaattggc gaagaatatc gcccgtgggt tccctttatt gggactatgt
      661 ttctatttat ttttgtttct aattggtcag gggctctttt accttggaaa atcatacagt
      721 tacctcatgg ggagttagcc gcacccacga acgatataaa tactactgtt gctttagctt
      781 tactcgcgtc agtagcctat ttctatgcag gtcttacaaa aaaaggatta ggttattttg
      841 gtaaatacat tcaaccaact ccaattcttt tacccattaa catcttagaa gatttcacaa
      901 aaccgctatc acttagtttt cgacttttcg gaaatatatt agcggatgaa ttagtagttg
      961 ttgttcttgt ttctttagta cctttagtgg ttcctatacc tgtcatgttt cttggcttat
     1021 ttacaagtgg tattcaggct cttatttttg caactttagc cgcagcttat ataggcgaat
     1081 ccctggaagg tcatcattga tttgtcttag gaagagttta tctcctagtt tagatatatg
     1141 tgtatgtgtg gctcaagata ctctataaag ataatctatt tagagcatat aaatatccaa
     1201 atacatacag tctagtggta atagaaaaaa cgatattcga gaagtgtaaa aaaaaaagac
     1261 gttggttagt cgagagggga tacccctgtt atatggaatc taatgactat aagctaattc
     1321 ttgcagattc gatgtttcga agaatgttca aaaattcgat tgaatttcaa atataatagg
     1381 cggtttacgt tatgtaagaa acctatgtat attttatatt agatattgac aagttatata
     1441 tgaaagaata tttaatttgc cctacttgaa tttggataga gacaccaacc gacgaagtcc
     1501 tttcctattc gtttatgact gcgaattgaa tggataaaca gacaaaatat aaaaaaaggt
     1561 cgaagaatga ttagaaaaag aaaaaggaaa tggaaaaact caagttgtat tgattcagaa
     1621 agactcaaca atataggaac taaaaaaatc aagtctttct aattatttaa taatattaat
     1681 tattatttca actggataaa tattagcaat ggaataatta agtcataatg cattggttga
     1741 ttgtatcatt aaccatttct tttttttttt tgtgtgtgtg aggaacttta tcatgaatcc
     1801 actgattgct gccgcatccg ttattgctgc tggattggct gtagggttgg cttctattgg
     1861 acctggagtt ggtcaaggta ctgctgcggg acaagctgta gaaggtattg cgagacagcc
     1921 cgaagcagaa ggaaaaatac gaggtacttt attacttagt ttagcgttta tggaagcttt
     1981 aacaatttat ggattggttg tagcattagc gcttttattt gcgaatcctt ttgtttaatc
     2041 cgaaaaggaa aagaaatagg agaaatacac atttctttta tagtcttgaa cttgcaggtt
     2101 gctttttcac atttataaga aaatatcgct cctacacaat tacttattcg ttgagaaaat
     2161 aatacacgga aaggacttaa tttgaggatg aagaattcgt gttacccact cggtttcttc
     2221 tttcctcccc ctttttagtc cgaaggagaa gtgttgcaac aaagagagta tttcgaaatt
     2281 cacatgaaac ctagtaccta attagtaatt ctataaatcc aataagtatt attcttattg
     2341 ggaatctcaa taaaaaaaat aaaattcatt tagaagtagc aaacaattga aataatacaa
     2401 cgattttttt gtttatctat aagaggagat catatgaaaa atgtaaccga ttctttcgtt
     2461 ttcttgggtc actggccatc cgccgggagt ttcgggttta ataccgatat tttagcaaca
     2521 aatctaataa atctcagtgt agtgcttggt gtattgatct tttttggaaa gggagtgtgt
     2581 gcgggttgtt tatttcaaaa ataggttgga ttcaaccaac tgtaccgctt tttttaatta
     2641 gggcgaaagg tgcatgattt cacgaatgac ttctgaaatt aataataaag aaatcatatg
     2701 taagaactag agcatttcgt gatttgttgg taaatatact ttgattctct atcaaccaat
     2761 aatgtgggac cataacatgg ttaaaactta aattgtttga agtccaggca cagcagggta
     2821 ttctttctac caccatgtta atacttaaat ataccgaata ccgaggctaa aattaaataa
     2881 agtaaagaaa catagttaca aatttttcga tatataacac tcatgtcgat aaaattgttg
     2941 gaatctctta tttattgtta taggaaaaat gtttcggctt tttttttatt aagtaaataa
     3001 atgccaaagg ctgagtcgat gacctacgta taaaataaga aacatttttg gatttgaata
     3061 aaaaaaaaac aactttgctg acaattactt atatattttt ttttgggggg tcagaagagt
     3121 tctccgaata ttctagtctt gattagtgat gggtttccat tttttgtcga ctatgaacag
     3181 agaagagagg ataagttcat tacattcaaa aaagatatgg aaatttgcca taaaaaattg
     3241 aagtaattga gcgtgagagc caaatgaatt gaaaaattca cgtttggttc gggaagggat
     3301 catgaatgtt ttgaaatgaa tggaaagata atctactttc attaagtgat ttattagata
     3361 atcgaaaaca gaggatcttg aatactattc gaaattcaga agaactacgc ggaaaggcca
     3421 ttgaacagct ggaaaaagcc cgggctcgct taaagaaagt cgaaatggac gcggatcagt
     3481 ttcgagtgaa tggatattct gaaatagaac gagaaaagat gaatttgatt aattcaactt
     3541 ataaaacttt agaacaattt gaaaattaca aaaacgaaac cattcagttt gaacaacaaa
     3601 aagcgattaa tcaagtccga cagcgggttt tccaacaagc cttacaagga gctctaggaa
     3661 ctctgaatag ttgtttgaac aatgagttac atttacggac catcaatgct aatattggca
     3721 tgtttggtgc gatgaacgaa ataactgatt agtccttcta tttaattgta ggtattattt
     3781 ttttgtttca aaaaaaaaaa gaatttaaga aagactcatg gcaaccattc gagcagatga
     3841 aattagcaaa attatccgtg aacgtattga aggatataat cgagaagtaa aggttgtaaa
     3901 taccggtaca gtgcttcaag tgggtgacgg cattgctcgt attcacggtc ttgatgaagt
     3961 aatggcaggt gaattagtag aatttgaaga gggaacaata ggtattgctc tgaatttgga
     4021 atcaaataat gttggtgttg tattaatggg tgacgggttg atgata
//
LOCUS       DRVSEV1      4245 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.virilis sevenless gene, exon 2.
ACCESSION   M34543
KEYWORDS    sevenless protein.
SEGMENT     1 of 3
SOURCE      D.virilis DNA.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (sites)
  AUTHORS   Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
  TITLE     Comparison of the sevenless genes of Drosophila virilis and
            Drosophila melanogaster
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 4245)
  AUTHORS   Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by G.M.Rubin, 24-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept    /  2804  +  3629     sevenless protein, exon 2 (AA at 2805) (5' end
                                 is putative)
    IVS        3630  >  4245     sevenless protein intron A
BASE COUNT     1103 a    957 c    922 g   1261 t      2 others
ORIGIN      
        1 tataagtgtg tcgtattgtt ggccaataat tagccgagaa tgtcaaatca cttgctgtta
       61 attgatgttt atcgagagta gagtcgaact aatcgttgtt acagcttggc tctgcttgtg
      121 ctttgaatag tgtactttga gctgcataaa ttcagctgat agagttggcg cattttcggt
      181 ggcacttcaa gcattatgtg taactattcc agatagatac ctttgtgctc tggaatcgcc
      241 catgacgctg ggtaaatggg gctagctggc tacgtcaacc gaagtttcgt cgcatctttg
      301 tcgttgttga tcgtcttgct ctggcttgcg acttcggctg gggttgtttt tatactctgt
      361 acacatggaa attggacaaa tggggtataa tgggcagacg gtagcaactc gtaaaccatg
      421 aatttcccat atgtatttct gatcgggaca aagagctgcg tcggcctggc tatgtctgtc
      481 cgtccgttaa gataatgaaa attcaatact taacaaacgc agttccagct tttcttggaa
      541 gtttttaaga aatggaagaa acttttatat ttgtgtgcgg ctagagtctt acgcttgggc
      601 ttacacacac ttcttatgct ttatattaag cggcgaaatt tcataacgat cggactataa
      661 ataccgaagt tctgccaaaa cacgttgact ttgatgtggg tgctaggcaa gcaagcggca
      721 tgctttgggg tgtttgtttt tttttgtgta caatctttgt aaacgacagc ttgtcgcttt
      781 gtccagggta acgcttagtc ggtcaattct gatcagagca cttacttgtt tttggagctg
      841 tcgcagtcgt tgtcgcagtc gcaatcgccg gttgccgctt gtcgtttgtc gtttgccgtt
      901 tgccgtttgc cggttccatc agttaagcga cgatcgcggc acggttaagt acgcagcgct
      961 cagatataaa tacatagcca caggctgcag atacatttgt ggatataaag atacggctgt
     1021 gacatattag ccgcgcgact ctctgaagta gcttttgaag ttcaatcccg tcccgcactg
     1081 tcccgcttcg cccaatcttg caaaaagggc tacaataaaa tgcaattaag ctgctcgaat
     1141 caattctcag caaattgaat taaagcgcgt gaagagctcc tctctacaca cacacacccc
     1201 ccctccaccc ctccacccac acccataagc aataacaaca agcccctgat atgtgggcga
     1261 gtgttatccg aaactcgaag tccatagcct cgagctcgag aactttgaga actttgccca
     1321 cggagtgctg tgaaaatata caaaatgcag atgggcataa acagaaatta acgccaaacg
     1381 gaattgctcg ctcatagaat aatgcgaata attaattaat tacgggcaat tttttgcaca
     1441 attttttttt ttttttttgt tttttgatgc aattaaaaat gttttgtgtg catattttct
     1501 ctggcatttt cgcgcgctta attagtgatt tatttatttt atttatgcat gtttttcttt
     1561 tttttttttt tggttttcat ttttctcttg tgctgtgtgt gtttttggag ttaattaata
     1621 aagcggcatt ttaacatttt ttgttatttt atgtgcgaat cggcaaatca attgcatttc
     1681 attaatatat tcctccgacg gaatttatag cgattccatg cggtttcttc ttaatctgca
     1741 tttatgcatt tgtgcggaga aagtgatccc cgtcccaccc aaagacccgc gctctatcac
     1801 tcgcacctaa atatagtttt tcccataagg cttaaattgt tgagggggga cttttgtagt
     1861 ttttgtggtt tgttgtgatt atttgatgat tattttcata tttattgatt atacacccaa
     1921 aaatgttgac aggaatctcc cgatttgccc gattgtattt cactttcccc ttattcagct
     1981 tgcaatcgaa gctagaacct tgtgcttgga tatgtgcaca tatttgggtg tcatatttca
     2041 gtctcgaaan tttcatatag atcgggcaat atgcagtgca gttaacagag aacggggacc
     2101 gaagccaagc cgtggtgtgt ctctataggc agtagctaga gtctgatccg aactccaatc
     2161 tgtgttctct atataaaaag ttctccgatt ctttgccaag tatctgaatc aattgcaagc
     2221 tttttttttt tatgctcagc agttgcgagc aggagacaaa ccaaacttta agccgggctt
     2281 ctatttcagg agattccaat tccaattgtt tgcgtagatc gtaacagatc aaaaataggt
     2341 acattgaact gttttggggc aattggttta gtttattatt gttgcgtgtt tgtcgattaa
     2401 cataatatgt taattcattt atatgcatgt gccgagacgt ttctgatttt tcgagttttt
     2461 cgttttcaat ttcagcaaat tgtgcgctaa aatgtaaatg caaaaacgtc gagccgagct
     2521 tcttcaaatc gaaatcgaac gacaacaaat tagaaaccag cccagagaca gtcgcacaca
     2581 cacacacaca aacatacact cgagcacaca cacacactcg gacacgcaca accattcgca
     2641 cacacacaca cacacacata ccgtggtcaa agtcagagtc ggacggacgg cagctccggc
     2701 cagttccgac cgccggaacg cagagtccca cttgccactt ggcattgccg tggcgtaggt
     2761 cgagcttaat ttttgtgctt cgatgaacat ttaaattgtg gactatgttt tggcgagaag
     2821 atgcagcaca gcagcagcag caacagcaac aacaacaaca gcaacagcag cagcagcaac
     2881 agccgccgca tccacccaaa cgcttgagct ttagtttcaa tgttaaaatc gctgtcaatg
     2941 tgaacaccaa gatgtccacg actcacatca accaggagcg ctccaagcag caaacaacaa
     3001 ccggatctcg atctcgatct cgttccaatt ccaattccag tgtcagttgc aaaggcgacg
     3061 gcgatcgccg tgttcggcgg cacacaacgc ggcttgtggg cctgcgtcag cagctgttgc
     3121 atctgggacg tcagctaaac ccgggccaat tcttggtcac gggtcacggc ggcatctcca
     3181 cgatactaat tgccaatttg ctgctgctgc tgctgctctc gctctgctgc aatgtctgtt
     3241 gccgctccca catcgagccg gatcaaaact tgacaccaac aacaacatca ccagctgctg
     3301 ttgcagttgt tccgatgttg ctgcccctgg cccagaccca tatgcgaccc cagctggaca
     3361 gcgatgtggt cgagaaggtg gccgtttgga caaagcatgt cggtgcggcg ccacccagca
     3421 tcgccgaggg cattgccatc agcagcgtgg tcagaatgcc acccagcatc caaaccccaa
     3481 ccgaaactgt gcgcaggcaa gaacaacaac gacaacagca gcaacagcaa caagaggcgg
     3541 ctgcggctgc ggctgccgac gccgccatcg atgagcgcat cgtactggag cgcgtcaccc
     3601 gcgactgtgt gcagcgctgc attgtggagg tgagtcgcta gttgattaac gggcggttta
     3661 aactataact aaatggaatt gttattgtta ttgttattgt gaccttgagc aacgtgcttg
     3721 tgcaccgtaa gtacagtttg gtatgtccgt tattccatac ttcctggcaa agcgcatgtt
     3781 agttaacaga atccttcaac ttatcgtaac ttacttgctt tgggtgtccc aatgcacttg
     3841 acttgccttg acagttcttt aatactctta aggactttct agttattgta gagctttgct
     3901 tgcactaaac taacttgttc agatcgttct aaaatctttc tggcaaagct catgttagtg
     3961 aactacagca cacttatcat aaagctgtag ctctggcaag caaggcttta agggcttaag
     4021 acttaaagag tttctagtgc tggtatatca atgaagccat aaagcttttt tataaactta
     4081 acttgttccg acatttacat tatacttcct ggcaaagcgt atgttagtaa aatgcagcat
     4141 tttaaacgta tcgtaacttg ctctggatat caacatacgt ggctttaggc catagattgt
     4201 gctagcaagc ttaagaattt aacattaacg ctccganaaa aagtt
//
LOCUS       DRVSEV2      5687 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.virilis sevenless gene, exon 3,4,5,6 and 7.
ACCESSION   M34544
KEYWORDS    sevenless protein.
SEGMENT     2 of 3
SOURCE      D.virilis DNA.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (sites)
  AUTHORS   Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
  TITLE     Comparison of the sevenless genes of Drosophila virilis and
            Drosophila melanogaster
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. (1990) In press
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 5687)
  AUTHORS   Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by G.M.Rubin, 24-MAY-1990.
FEATURES       from  to/span     description
    pept    +  1897     1971     sevenless protein (sev), exon 3
               4050     4115     sevenless protein, exon 4
               4177     4512     sevenless protein, exon 5
               4574     4810     sevenless protein, exon 6
               4867  +  5186     sevenless protein, exon 7
    IVS     <     1     1896     sev intron B
    IVS        1972     4049     sev intron C
    IVS        4116     4176     sev intron D
    IVS        4513     4573     sev intron E
    IVS        4811     4866     sev intron F
    IVS        5187  +  5687     sev intron G
BASE COUNT     1665 a   1222 c   1168 g   1629 t      3 others
ORIGIN      
        1 ccaacagata taaccatgtc cctctgtcct tttatctgtc accaagcgta ctagtctctt
       61 agttttaaag ctattgtctc gaaacctctc tgttgcaagc agtatatatg ttagtactgg
      121 atcggattgg atcgaagttc ttgtaagaaa aacttctttt caagatatct tgaccaaaac
      181 cgtcctctac ttgttaaact atgctccttc tatataagct caattatatg taatttccat
      241 ccttaagata ggtaaatgat caaaatgcga tgcccaaatg tttataagct ataaacttat
      301 ctaaagtctg gcattttgta gacatttgag ctgtttataa gttcagttta agctgagata
      361 atatgaaaac ttgaatataa tttagtattc ggttggactt tgtattaagt tttgacataa
      421 ataagggctt tttttatagc ttgttgtttc gcagctttaa tgctgagaca gaccataacc
      481 tgcgcttgtc tgttagaaat ttgtctgtta aagacctttg atccattatt ttactatgta
      541 ttactaggtt aaccaagatt cgagaagctc tcgagttgtt tgccttcgct ctgaattggc
      601 atgtcgctat ttttggctgc ccctcgccgc aacaacaatg aatcgatcgt ttgcggaccg
      661 agtcggagat ctctttacga tcgatcacgt gctaaaagcg aaaagaaaga gaaaaaaagg
      721 cagaaactcg tttggcgtga cgccgttaaa tgtcgatttg gtcaatagat ttatgcattc
      781 gtaatgcgcg aatatctcag aggttgttaa aatgcctcta aagtcaataa tcaggtctat
      841 gtttgagttg gcggctgtca cttgagaaaa gtagctaaat caaatcatca aaatagctat
      901 aaaaagctgc attacctaag cagagtgatc tacaaaagcg ctgctgcttt ttgcggagtt
      961 ttcttttcaa aaaaaagtta gactaaataa attagctaca aaatagccag aactgcaggc
     1021 aatggcaaat atgacaagtt tcgtgtgaaa aacagcccat tctagcttgt aaatgacagc
     1081 acaaaggtcc cagctctatt tcggcagtaa gccagaaaaa gggcacttct ttgttttctc
     1141 gctgaaaata gccaaaatct agttttgaaa tagccataac tgactataaa atagccaaac
     1201 tagcagacac ttgcaaatgt cagctattat caagttgggt agttgggtaa aaaaactcta
     1261 gctagaaaat ggcaaaattt gttcgttgcg gatagggaga atagccaagt ctcgctataa
     1321 aatagctaaa ttgacaacac aactctagcc tttttttttc aataaagcta gaacaagggc
     1381 taaatcagct aaaaaagaat ttccggtatg caaatagaaa attctagtta aataaagcag
     1441 aaatagtaat atactagctc tagctctttg ctctatttag cttgaaaatg gccaagtata
     1501 cccaaaaaaa aaaatagctg atttctttct ttgctgtaat cagttagaaa atagtctagt
     1561 ctagctagaa aatagtgaat tgctaacata ctgatccaag ctcactctct ctgttttgct
     1621 tatatttagc ttgaaaatag ctatcttggc aatattggct gtagccaatt ctcgccctgc
     1681 ctgctaccga tctgcctttt cacattgctt ttatctataa atgttaaaga ctgtgaattg
     1741 attcattgcc attgacattg aattgaaaat tcttcccacg atttggattt cccttactct
     1801 ttctttcctt gcccctgctt gctgcggcat ttccgttgca aatcggggct cttttgtgct
     1861 aattgatttt cacttggatt tctttctttt ttccaggagg atctgtttct ggacgagttt
     1921 ggcattaaat gcgagaaggc cgacaacagt gacaagtgct ataaaacacg agtaagtctg
     1981 ccccccaaaa accaacacgt ccccctcgcc gaaacggtgt caagtgtttt gacaataacc
     2041 ccccctcaag catgccccat tatccccaac taagcaaaag gccacaaagt cgcgtaattg
     2101 gaaaccacct ttcaacttta ccagacaacg tgacgacgga ccgagcgacc gaccgaccga
     2161 atgacccata aaaaaaaaca agaacaaatt cccagaaacc agaaacgaaa cgtatttacg
     2221 gctcaaaaac cctgacaaaa aggaatatac ttaagcaaac gcacgaaaag aatggcaatg
     2281 tctgtcttca ttgagccgaa gattaaatac ccttttatac agctcgtaat tgatgagaat
     2341 atcttgcaat ggattctacc ttaccatagg atctatatga tatagtagtc caatgataac
     2401 agaaagcaag gatatctctc acnagtatag aaaaagattc tctagctgtg cgatataaat
     2461 ataaatagaa ncccaacaca aataactttt gcctgtgaca tggctattca caaatgccat
     2521 atgccctttt gatacacact tgcaacagtt acagggtatc aggagtggca agcgagggat
     2581 agccagcggt aaggccatca ggcacttgtt aaggaaaaat cataaacaca ttcgatatcc
     2641 aagccagttc tgcctaacat gtatttcgga ttaaccacag ccagttgagg ttgggagtag
     2701 ggagtgggat cgacctttga tgtgattaag ttaccaagca acacagctga gcagcttttt
     2761 actggctttt aattgtcgcc atgctcgtga gtttagctag ttgcatacga actcaaagtg
     2821 catcaggcgc ggccaatcgt gctccagctc cagacagcaa aggaaaccgt cgaacaacaa
     2881 ttgtaaacgc tgctgtcggc tctcaattag tttgcttgca aggcatcgga tccggcggtt
     2941 aatttccgtt tgagttttac agcaatccat cagggtacaa tagtttatct agaaattagc
     3001 tgtgagcttg gctagcgcta agagacttac agactcgtac ttacctaata ataacatgga
     3061 attttgctat cacgaaatac aagagtagtt attgaacata aaatatattt tcagagcata
     3121 gtctaacagc tcactagtta tctgaattgt gtacgcccaa aggtaaatag tgttgagtgt
     3181 gaaatagttt aaatatgggg aacagttggc tctgcttttt atacccttgc agagggtatt
     3241 ataattttgt cgtgaaatgt gtaacgcata gaaggagaca tctctgaccc cataaagtat
     3301 atatattctt aatcagcatc aacagccgag tcgatatagc catgtccgtc tgttcgtctg
     3361 tccgtctgtc tgtttccatg tcgaactagt ccctcagttt taaagctatc ttaatgaaac
     3421 tttgcagaac tccctctttc tgttgcactc agcacatatg tgaaaaccag ctggatcgga
     3481 ccactatatc atatagctgc cataggaacg atcggtcgaa aattaagttt ttgtatgaaa
     3541 aacattttgt tatcaagata tcttgaccaa actcggcatt tattagtctt actatgctcc
     3601 tcatatatat gcaaaatctt attaagatcg gaccactata tcatatagct gccataggaa
     3661 cgatcggtcg aaaattaagt tgtatgaaaa aacattttgt ttatcaagat atcttgacca
     3721 aactcggcat ttactatttt cccggtactt cttagatagg ggcaaagcac tatgagcatt
     3781 atgaaaaggt tgggtctgca agggtattag atctttggcg tgccgaagat agcccttctt
     3841 tctcgttttt actggataag caaacgagtg cgaggcaaac gagggtatct cctagactac
     3901 ttttgtagtg cactgatatt aatagctttt taaggctgat taaactttca gctggcttta
     3961 gataagcctc actttactat gcacctcctg tcttttccct gatgggccaa ctcaacctca
     4021 actaatctga tcgtaatctc tctttgcagt gcaacaaggg ctgtgcgcaa tggtatcgcg
     4081 cgctgaagga gatcgagccc tgccaggagg cctgtgtgag tgatatctga cagctgccgt
     4141 cttaatgata gcttcttaac tgcattattt tgataggctt ccacacagtt ttatccgtac
     4201 gatatgccct gcattggagc ctgcgagacg gcgcagcgcg attattggca tatgcagcgg
     4261 ctggccatgg ctcggttggt ggagacaacg cagccgcagc tgttggagat gacggatgag
     4321 tcgagcactt tgaccatcaa atgggccatg cagttcccgg agaactattt ggccagtcga
     4381 ccctttaata tacaatacca gcaagtggac aatcaatcgg aacctgaatg gcacaatctg
     4441 gcggactacg actgcgatga atactacgta tgcgaaattc tggaggcgct tgtgccctac
     4501 acccgctaca aggtatgtcc atcatatatt aagtgaaata tctctggaga tttaaatttg
     4561 aactcttttt cagtttcgct tcgagctgcc atttggggag agcagcgaag acgtgctcta
     4621 ttcgccagca acgcctgttt acgaaacgcc gatggagggc gcccccatct ctgcgcccat
     4681 tatcgtggca ctgctggcgc tggatgagca tcatgtgttc gtgcactggc gtcccggacg
     4741 ctatagcaat gcgcccattg aaggctatcg cgtgctcttg accagcgcag gcaacaccag
     4801 ccgcgaacag gtacgtaacg agctccaact gccaagctgc caaatctaaa cgctcatctc
     4861 taccagctgc tgcctgctca acgtactagc tgcatctttg cccagctgca gccgctcacc
     4921 aactacaccg tggcgctgac catgatcaac aaacagggcg agggaccgtc tacggtggtc
     4981 agcattgtga ccaagtcgcc cctggagcca cagcagctgc agagcgtgct gcttgcgagc
     5041 gagcacagca tcatctggca atcactggag ccagcgggcg agacacgcct cctctacacc
     5101 tctgagccgg cggccattag cgattttacc ttttcgcagc gcgagcagcg gctctggctg
     5161 ctggacgaat tgggtcagtt gcacaggtga gcaattggcc atttcagtcg ctcttcaatt
     5221 atcttttcga gagtgaagat tatttttttt tgtaggattt cgaggtgaaa ttacgtaaag
     5281 tttctacagt aggaaaagct ttctagctag atcatcagtc agtcagtcat atatggatgg
     5341 ggagctgcag ttaagctcta gccaagacca gattttgttc agcctcttta gtcacagtta
     5401 aatcagtcag tcagtcaatc actcagtcag tcgaatgaat atataattta gtacgatcaa
     5461 ttgtgactgt tccttgatca tcagtcagtt agttattcaa tcngtttaat caaaaatcaa
     5521 tcagtaaagt aaattggcta ctacagctac aggttgtcca aaaaaatgga tagtttagga
     5581 tacattcaat cagccgtcaa tcaatcagtc agtcataata tgatatactt ctattatgcc
     5641 tctctatgag ttcagctaca gctaagacca actggggtgt gtgttga
//
LOCUS       DRVSEV3      8779 bp ds-DNA             INV       14-AUG-1990
DEFINITION  D.virilis sevenless gene, exons 8, 9, 10, 11 and 12.
ACCESSION   M34545
KEYWORDS    sevenless protein.
SEGMENT     3 of 3
SOURCE      D.virilis DNA.
  ORGANISM  Drosophila virilis
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (sites)
  AUTHORS   Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
  TITLE     Comparison of the sevenless genes of Drosophila virilis and
            Drosophila melanogaster
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 8779)
  AUTHORS   Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [2] kindly submitted
            by G.M.Rubin, 24-MAY-1990.
FEATURES       from  to/span     description
    pept    +   340     4973     sevenless protein (sev), exon 8
               5836     6113     sevenless protein (sev), exon 9
               6174     6365     sevenless protein (sev), exon 10
               7286     7722     sevenless protein (sev), exon 11
               7791     8175     sevenless protein (sev), exon 12
    IVS     <     1      339     sev intron H
    IVS        4974     5835     sev intron I
    IVS        6114     6173     sev intron J
    IVS        6366     7285     sev intrn K
    IVS        7723     7790     sev intron L
BASE COUNT     1930 a   2456 c   2435 g   1958 t
ORIGIN      
        1 tctggtagat gagcgttaga ttggcagctt ggcagttgga gctctcaagc tctatatata
       61 tataccattt gtcctgactg attgactgat tgacggactg attggggatc taaatttccc
      121 actcccataa ctcggccaaa actcaaccga ttttcatgag gcttggcttt ttattcatgg
      181 tttggcctgt atatcaattt ggcattttca aagatccttg agagaattca ttttataggc
      241 gccacgggca agggcgggtc taccggctgg tccgtcataa gtacatcgcc caatctctaa
      301 gctctacttc agtttagact tgcaattttc tgtttacagc cagctgctgg atgagacgac
      361 gacaagcgct gcccggcgtc tgcggctgga gctgccctca aatgggagca gccagtggac
      421 gccacgcaag ctgagcctgg actggttgca gcgacgcctc tacatagcag cgcaggcgaa
      481 cagcagcgac ggggcagaag gaggatttga gctgtttagc agcaatctgg agggcggcga
      541 tgtgcagatg gcgggcgtgc agctgggcct ggtcgtggag cagctggagc tggacgccct
      601 gaacggctgg ctcttctggt gcgatgcaga cagcctctgg cgtctcgatc tgagcagcaa
      661 gcagcagctg cgccttacac agcccgcagg agcacccgga cgtttcatgt tggagccgca
      721 gcgctggctg ctgcacgtgc tgctgccgca ggagaatcag ctgctggagc tgagctacga
      781 tggtgggcac aagcatgcgc tggcactgtc caacgacagc tggcgcggat ttgcctggtc
      841 cagcgatcag gcacagctgc tactggcgaa tgagacacag ttgcagctgc tggacggaca
      901 gacgctggtg ccgcttgcca actggtcgcc ggatggcggc tgctgcgctt tgctgcccct
      961 ggaacgtcgc cgccagccgc tgtcactgga gccgccggcg ccgcgcgagt tgcgcgccct
     1021 gctaggagcg cagggcgcgc acataacatg gcagccgccg gcagctaatc cctaccagac
     1081 tgcaacggcg gccgcgcgca atttcagcta cgagctggag gtgctcgacg tggccagcca
     1141 aagcgcctac aacatacgca acatacgcgt gccccacttt ggcctggagc gcctgcaggc
     1201 ggacaatctc tatcagctgc gcgtgcgcgc aaacaatgcg gccggccgtg cgggcgtgtg
     1261 gacagcgccg ctggcgacgc gcacctggcc cctgggcgat catcgtctgc gctgggccac
     1321 gcagcgcggc tcgctgtaca ccacgaacga gctgggcggc cagctgcagc cgctgcccgt
     1381 gcagctggca tcgtcgcccg ggccgctagc cctggtcaat gccagcgtgg cgtactatgt
     1441 cagcgggcgg gagcaatcgc tgcactgcgt caatttgctg cagccgcagc tgagctgcac
     1501 cgacgagcgg ctggagcatg taggcgccgt tgcctatgac tggcgcggcg gtctgctcta
     1561 ctggacggat ctggcgcgcg attgtgtcca gcgtctggat ccgttcagcg gtgagcgcga
     1621 gctgttgccc atctttggcg cccgccattt ggccctcgac tcggcccagg gacatctcta
     1681 ctactcgagc tcggcgcatt tggcgcgccg ttcgctcagc gctctctcca cccatcagcc
     1741 ggagctggag tactatcatg tcaatggact ggccggccag attagcggct tctgtctgga
     1801 tctgccgcag cgacacatct actggcttgt cgccggcaac tcggcgctgc atctctatcg
     1861 cacagcattg tcggcgggcg gcagccaggc cgccgtccca ctgcagctgc tgaccacgct
     1921 gccggcggcg gacgcactgc cgcatacgct gcagcatctg gcgccgctgg gtgcgctgct
     1981 ctggctggcg gcggatggcc ggggcgcgca tctgctgcgg ctcgcggccc aactggagac
     2041 ggacacggat actatgcgtc tgctgccgga gggcctggtc gagccgctca gcgccgtgca
     2101 gctgctggag cgttcggctg ggccgccgcc accgccgccg gacgagggcg tgcgtccgct
     2161 cgccgtgccg ccggacagtg tgcacatcga cgagggcggc cactggaatg actttcgtgt
     2221 gcgctggcag cctgctgcca gtggcggcaa tcacagcgtc tgctacaagc tgctgctgga
     2281 gcacggcagc gagcgtctga tcacgctgga gctcctcacg ccctttgccc gcatcacaca
     2341 actggcccag gcgccgctcg gtctgcgcat cagcattacg ccgcacaccg cctggcgcgc
     2401 cggatccacc acccgggtgc agctggacac gcccgttgcg gcgcccacgc agccgcggcg
     2461 gctgcgcgtc tttgtcgagc gtcaggcggc gccgctgcag ttggcgccga atgtaagcgc
     2521 tctgctgcgc tgggatgtgc ccgaggagca tgcgggcagt cagtcgctgc agtatcgcat
     2581 cagctgctgg cgcggcagcg agttgcattc ggagctgctg ctcaatcaga gcacattgga
     2641 ggcgcgcgtc gagcatctgc agccggagga gacgtatcgt tttcaggtgc aggcccatgt
     2701 ggccgccacg ggcctggccg ccggcgccac cagccatgcg ctgcacgttt cgcccgaggt
     2761 gcaatcggtg ccgcgtctgc tctatgccaa tgccgagcac attggcgaac tggatctgga
     2821 tacgggccat cgcaagcagc tggtgcacac cgccagtcct gtcgagcatt tggttgtgct
     2881 gcagggcgag cagcgtttgc tttgggtcaa cgagcacgtc gagctgttga gccatgtgcc
     2941 gggtaaggcg ccggccaagc tggcacgcat gcgtgccgag gtgctggccc tgaccgtcga
     3001 ctgggtgcag cgcatcgtct actgggccga gctggatgcc gcggacggcg gctgtgtcat
     3061 ctatagcctg gatctctgtc gcttcgatgg acgcattctg cagggcgaac gtctctggag
     3121 cacgccgcgc ggccagctgc tgcgcgatct tgtcgccctg ccgcatgccc gccagcttgt
     3181 ctggctgcag cacgatctgg actcgcgcaa tgccacgctg cagggtcgca gcctggccaa
     3241 tggcagcgcg ctcacctttg agggcgtcac gttaccgctg tggcgcctct tcgagggcag
     3301 ccaggagccg ctggccgaga cccttaatct ggtggatcat ttgggccgtc tgtgcgttta
     3361 ccatgtggcc cggcaactgt gcacgtccag cgcactgcga gcccagctca atctgctgaa
     3421 cgatgatatc ggccagctgg cccaggatcc gggctatctg tatgctctgc gcaatggtag
     3481 tgtgcgcgca tatggccgcc gccggcagca actggaattt ctgctggaac tgcagccgga
     3541 cgaggtacgt ctgctgcgcg cctacaacta tcaggcgtat cccagtcgtc gctgcctgct
     3601 gctgcccacc acggctgccg ccctcgaatc aacgccgtcc agctgcgagg agacgcagtg
     3661 cagcctccag ctgccggcgc tcagcgccgc accagactgc ccgctgccgg tgcccgggct
     3721 taactatcag ctgaatctta gcagcagcag cagaagcgca cagctggagc tgcgctcgct
     3781 gcactccgcc gccggcctga cgctcaacat aagtcaactg cagccctatc aggcctacga
     3841 gctgcgcgct caggtcggca gctactatca gcaacaactg ggccaggagc cgctgcagct
     3901 gcccgtcctg acgctgcaca cggctgcggc cacgcccagt gcgccacgaa atttcagcgg
     3961 acgggcactg agtcccagcg aactggagct cagctggctg gcgccgctgg agctgcgcag
     4021 cgcgagtgtc tattatacgc tgcactggca gctgcagctg gaggataccg aggagcagtc
     4081 acaggagcag cccgcccagg agcagcgcgt ggagacggcg ggcgtgcagc ggctgacggg
     4141 tctgcagccc gcacgactct atcaggtctg gctgcaggcg catgccacgc ccagcaaata
     4201 caatagcagc gggcggctgc tgatccgcag ctatgcgccg ctgccgccgc tgcagctgat
     4261 cgagctgaac gcgtacggaa tgacactcgc ctggccgggc acaccggatg cgctcagttc
     4321 gctcaccctg gaatgtcagt cgctgcgcga acagctccag ttcaatgtgg ccggaaatca
     4381 tacacaaatg cggctggcac cgctgcagcc gaagacgcgc tactcgtgcc gcctggccct
     4441 ggcgtatgcc gccacgccgg gcgcacccat ttactttggt cccagtcatg agtacgaaac
     4501 gctgggtgat gcgcccagcg cgccgggtcg gccgcagctg gagcatatag ccggcgaaat
     4561 ctttcgcgtc agctggacgc cagctctgga caacggttca cccattttgc tatacaatct
     4621 ggaggctctg caggcccggc gcaccaatcg tcgtcggcgg cggcgccgag agactacgct
     4681 ttccctattg ccctgggccg aggagccgct ggtcatcgag gatcagtggc tggacttttg
     4741 taataccacc gagctgagtt gcattgtgcg cgaattgcat acccgtcgcc tgctgctgtt
     4801 tcgcgtccgt gcccgcaatc ggccgcacgg ctggggaccc tacagcgagg acagtgaacg
     4861 catcgctgag ccctttgtct cgcccgagaa gcgcggctcc ctcgtcttgg ccatcattgc
     4921 gcccgctgcc attgtttcca gctgtgtgct cgccttggtg ctcgtgcgca aatgtaagtt
     4981 caaaatgaat atttttccgc ccgacaatca gtcaatcatt caactagtcg tcggtcagtt
     5041 tgtcaatctt tcagtcgaag aatcaatcga taagaaaatg agtggctaaa taagtaagtt
     5101 agtcaatcaa atagtctgtc tttttgtcaa tcagtcgatc aatcaatcgc tcaatccatc
     5161 tggtattgcc attcagccaa ttaaccagtc aatccagcaa tcattcaatt agtagtccgt
     5221 aggttatcag tcaactatga taaccaatca atccgtcagc ttgataattt gtcgtttaac
     5281 tagtcaagaa atcaatgtta atcaagtaat tggccaatta atcagtcagt tcagcagact
     5341 gtctagtata actgtgaatt gaaggttcat gaatctgtca gtcaaacggc caatcaaatc
     5401 aatcatttgg aaaatttttg atcacccaat ctcaaagtaa ttaagtgagg caagcagtca
     5461 attaggcaga taaacagtta atcaggatgt cagtcaatca atcaatcata aatagtctgc
     5521 cacctaatcc atcatgattc aatcagtcaa tcagccaata ctcacgtagt caattgatat
     5581 tccggacaat cagtcagcca gtctgaaatt aatcagtaaa tcaatcgaat atgagtcagt
     5641 caatcaatca attaacacgt aatcaatcaa tcaatcagtc aatggctcat taaatcaata
     5701 agcagatcag aaaaatgttt aatcattcac tcaatcagtc agcctaatca atcatgaccc
     5761 tttcattcaa tcaatcaatc aatcagtcag tcacatacac aagactcgct aattagtaaa
     5821 tctgcgattt tgcagtgcaa aagcgtcgtc atcgcgccaa aaagctgctg cagcagagcc
     5881 gcccaagcat ctggagcaat ctttcggctc tgcagacgca gcagcagctg ctggccgcac
     5941 ggagtcgcac cttctcgatg tcgctgagcg atgcggacat tgcgctgctg ccgcagatca
     6001 actggaaccg gctgacgctg ttgcgtttcc tgggcagcgg cgcctttggc gaggtgtacg
     6061 agggtcagct gcaggccgag gacgaggcgc agccgcagcg tgttgccatc aaggtgagtg
     6121 gagcagcagt tttttttttt gccacaattc aaggattctc ctcgccttta cagagtctgc
     6181 gcaagggcgc cagcgagttc gcggagctgc tgcaggaagc gcagctgatg agcaatttca
     6241 agcacgaaaa catcgtttgc ctgattggca tctgctgtga cacggactcc atatcgctga
     6301 ttatggagca catggaggcc ggcgatttgc tcagctattt gcgtgcggca aggcccagtt
     6361 cacaggtgag cgagcaagca ttaatattta ttatacgctg tagccattgc aaatgggtaa
     6421 agggtaaact gtttttgtgc atatgcatgt gacgcacagt atgtccatgc ctttaatctt
     6481 gatcaggacg cactgcgttc gtctaggcag gttcctcggt ctctcctttc gttttccaaa
     6541 tatcgtaaaa tatcgaaaca aggttttttt tatttacttt atttaattta tttgttaatg
     6601 gtcgacaaac acgagtgtct ttctaattat ttaatacatc aaatcatgaa aattaaatac
     6661 taaaaaccta aaagcatcac cgactagata tataaagcgt taagtagata ggataaagag
     6721 ataccatatg gatatattta gagctcatag cgggaaatag tcagtgagtt cccgaaaata
     6781 tatgtacatg ttgaagatta taaatgctag atacaccaca atatggatga ggattaagcg
     6841 cctttttgga tgcctccaat caaaactatt ccaaaatcaa aaggatttag aggttccgca
     6901 ttcagctttt gtatgtacca acattttgat gccatattta cacgctgaaa gttttacaaa
     6961 gatcggacaa taaacaccaa agttcctcga gaatgccata actgaaaatg tggaaccaaa
     7021 gaaaagtaga cagaagagag acaaatgtga gtctgttgtg gcagccggat acaggatatc
     7081 tctagtcgag cacttgaatt gagcactctg atgccatatt tatacgctga aagtttcatt
     7141 atgatcggac cataaacacc gaagatcctc gagaattcca tatctgaaaa tgtggaatca
     7201 agcaatagta gagattcagc tagaatgtgt gtgtggctac agggaatatt ctagtcgagc
     7261 actcttatgt atggttcgtt tgcaggaagc gctatccaag ctgcagctgc cggagctgtt
     7321 gagcatgtgc ctggacgtgg ccaatggctg cagctatatg gaggatatgc actttgtgca
     7381 tcgcgatctc gcctgccgca attgtctggt gtcggatgga gccgcaattg ggggacggcg
     7441 cattgttaaa attggcgact ttgggctggc acgcgacatc tacaagagcg actattaccg
     7501 gaaggagggc gagggactgc tgcccgtgcg ctggatggcg ctcgagagcc tggtggatgg
     7561 cctatttagc acacagtcag atgtgtgggc cttcggtgtg ctctgctggg agatattcac
     7621 gctcggccag caaccgtatg cggcgcgcaa caacttcgag gtgctcgccc atgtcaagga
     7681 gggcggacgg ctgcagcagc cggagcgctg tcccgagaag ctgtaagtgg gccagctctt
     7741 ccagttaact tctctctact caatcactgt cccatttatg ctgtccacag ttatgcgctg
     7801 ctcttgcaat gctggcgctc tgagccctgg gagagaccca gctttaagcg ctgcctcagc
     7861 acgctgcagg ccctcagctc ggatttgcgg cgcactgaaa tgctggccac ggatgagacg
     7921 ccgttggttt ctgcattgtg cgctttcaag ccggacgcca aggtgcgttt cgatgatgcg
     7981 ccgcagcgtc ttacgctgca cttggacgcc aaagatacgg ttagcacaac cgatgccgat
     8041 acgacaggaa gcccaaccac acccaccgca ccaaccacgc ccaccacgac caccagcaca
     8101 atcgcagtcg taagcacagc tccatcgtcc gaaaatggcc agctctatgc aaatgagggc
     8161 atttcgggcc tttgaaatcc cctttcattt ggatctgccg tcgcaagttg cggcgccaca
     8221 aattgatgaa cattcgcatt cgcatgtttg ggcgcactca accatgtaac gcccacttgc
     8281 cgaaacaccg cccacacgcg catacacgat tgtgggctta gaattatggt ttctatttaa
     8341 tcgatttact attctgtact gaatccaaaa gctaaacaag accaaaagat tgacctcaac
     8401 aagtcctaca tgtaaatcaa atggttctgt ttagtttttc ttttttgttt ttgttgtttt
     8461 tttttttttt ttggtatacc caacaagacg agtatgctta ttcgaatata caagtagagc
     8521 ctaaagtcga atcaataaat atcatgatat ttttgtagaa atagccttta tcattaaatt
     8581 attatatttt gcctcaatca ggctgattga cggttgatga tcaatcaaat ataaggtatg
     8641 cgctactcga aaatattata aataaatgaa ctcaacagta aatgaagata ttcttgtaaa
     8701 accaggctgc atgttttagt ctgcctgatt gattagctaa cggactgact gatcatcagt
     8761 caatcttttc gagtatgcg
//
LOCUS       ECOLAMBAA     186 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  E.coli lambda receptor (lamB) gene, 5' end.
ACCESSION   M24997
KEYWORDS    lamB; lambda receptor.
SOURCE      E.coli (K-12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 186)
  AUTHORS   Clement,J.-M., Hedgpeth,J., Hofnung,M. and Jacob,F.
  TITLE     Sequence de l'ADN correspondant a la region du peptide signal pour
            le recepteur de lambda chez E. coli K12
  JOURNAL   C. R. Acad. Sci., D, Sci. Nat. 289, 1033-1036 (1979)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         82  >   186     lambda receptor precursor
    sigp         82      156     lambda receptor signal peptide
    matp        157  >   186     lambda receptor
BASE COUNT       51 a     46 c     48 g     41 t
ORIGIN      
        1 ccggcgttta agcaccccac aaaacacaca aagctgtcac aggtgatgtg aaaaaagaaa
       61 agcaatgact caggagatag aatgatgatt actctgcgca aacttcctct ggcggttgcc
      121 gtcgcagcgg gcgtaatgtc tgctcaggca atggctgttg atttccacgg ctatgcacgt
      181 tccggt
//
LOCUS       HUMCRPR      1797 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human cysteine-rich peptide mRNA, complete cds.
ACCESSION   M33146
KEYWORDS    cysteine-rich protein.
SOURCE      Human normal term placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1797)
  AUTHORS   Liebhaber,S.A., Emery,J.G., Urbanek,M., Wang,X. and Cooke,N.E.
  TITLE     Characterization of a human cDNA encoding a widely expressed and
            highly conserved cysteine-rich protein with an unusual zinc-finger
            motif
  JOURNAL   Nucleic Acids Res. 18, 3871-3879 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.A.Liebhaber, 22-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept         36      617     cysteine-rich protein
BASE COUNT      405 a    507 c    493 g    392 t
ORIGIN      
        1 cctgccgccc ctgcgccgcc gagccagctg ccagaatgcc gaactgggga ggaggcaaga
       61 aatgtggggt gtgtcagaag acggtttact ttgccgaaga ggttcagtgc gaaggcaaca
      121 gcttccataa atcctgcttc ctgtgcatgg tctgcaagaa gaatctggac agtaccactg
      181 tggccgtgca tggtgaggag atttactgca agtcctgcta cggcaagaag tatgggccca
      241 aaggctatgg ctacgggcag ggcgcaggca ccctcagcac tgacaagggg gagtcgctgg
      301 gtatcaagca cgaggaagcc cctggccaca ggcccaccac caaccccaat gcatccaaat
      361 ttgcccagaa gattggtggc tccgagcgct gcccccgatg cagccaggca gtctatgctg
      421 cggagaaggt gattggtgct gggaagtcct ggcataaggc ctgctttcga tgtgccaagt
      481 gtggcaaagg ccttgagtca accaccctgg cagacaagga tggcgagatt tactgcaaag
      541 gatgttatgc taaaaacttc gggcccaagg gctttggttt tgggcaagga gctggggcct
      601 tggtccactc tgagtgaggc caccatcacc caccacaccc tgcccactcc tgcgcttttc
      661 atcgccattc cattcccagc agctttggag acctccagga ttatttctct gtcagccctg
      721 ccacatatca ctaatgactt gaacttgggc atctggctcc ctttggtttg ggggtctgcc
      781 tgaggtccca ccccactaaa gggctcccca ggcctgggat ctgacaccat caccagtagg
      841 agacctcagt gttttgggtc taggtgagag caggcccctc tccccacacc tcgccccaca
      901 gagctctgtt cttagcctcc tgtgctgcgt gtccatcatc agctgaccaa gacacctgag
      961 gacacatctt ggcacccaga ggagcagcag caacaggctg gagggagagg gaagcaagac
     1021 caagatgagg aggggggaag gctgggtttt ttggatctca gagattctcc tctgtgggaa
     1081 agaggttgag cttcctggtg tccctcagag taagcctgag gagtcccagc ttagggagtc
     1141 actattggag gcagagaggc atgcaggcgg ggtcctagga gcccctgctt ctccaggcct
     1201 cttgcctttg agtctttgtg gaatggatag cctcccacta ggactgggag gagaataacc
     1261 caggtcttaa ggaccccaaa gtcaggatgt tgtttgatct tctcaaacat ctagttccct
     1321 gcttgatggg aggatcctaa tgaaatacct gaaacatata ttggcattta tcaatggctc
     1381 aaatcttcat ttatctctgg ccttaaccct ggctcctgag gctgcggcca gcagagccca
     1441 ggccagggct ctgttcttgc cacacctgct tgatcctcag atgtggaggg aggtaggcac
     1501 tgcctcagtc ttcatccaaa cacctttccc tttgccctga gacctcagaa tcttcccttt
     1561 aacccaagac cctgcctctt ccactccacc cttctccagg gacccttaga tcatcactcc
     1621 acccctgcca ggccccaggt taggaatagt ggtgggagga aggggaaagg gctgggcctc
     1681 accgctccca gcaactgaaa ggacaacact atctggagcc acccactgaa agggctgcag
     1741 gcatgggctg tacccaagct gatttctcat ctggtcaata aagctgttta gaccaga
//
LOCUS       HUMCSPBB      910 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human cytotoxin serine protease-C mRNA, complete cds.
ACCESSION   M36118
KEYWORDS    cytotoxin serine protease-C.
SOURCE      Human lymphokine activated cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 910)
  AUTHORS   Klein,J.L., Selvakumar,A., Trapani,J.A. and Dupont,B.
  TITLE     Characterization of a novel human cytotoxic lymphocyte specific
            serine protease cDNA clone (CSP-C)
  JOURNAL   Tissue Antigens (1900) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Klein, 06-JUL-1990.
FEATURES       from  to/span     description
    pept         53      793     cytotoxin serine protease-C precursor
    sigp         53      157     cytotoxin serine protease-C signal peptide
    matp        158      790     cytotoxin serine protease-C
BASE COUNT      231 a    245 c    250 g    184 t
ORIGIN      
        1 ttcgatatcg agctcctcca acagctctga cctgggcagc cttcctgaga aaatgcagcc
       61 attcctcctc ctgttggcct ttcttctgac ccctggggct gggacagagg agatcatcgg
      121 gggccatgag gccaagcccc actcccgccc ctacatggcc tttgttcagt ttctgcaaga
      181 gaagagtcgg aagaggtgtg gcggcatcct agtgagaaag gactttgtgc tgacagctgc
      241 tcactgccag ggaagctcca taaatgtcac cttgggggcc cacaatatca aggaacagga
      301 gcggacccag cagtttatcc ctgtgaaaag acccatcccc catccagcct ataatcctaa
      361 gaacttctcc aacgacatca tgctactgca gctggagaga aaggccaagt ggaccacagc
      421 tgtgcggcct ctcaggctac ctagcagcaa ggcccaggtg aagccagggc agctgtgcag
      481 tgtggctggc tggggttatg tctcaatgag cactttagca accacactgc aggaagtgtt
      541 gctgacagtg cagaaggact gccagtgtga acgtctcttc catggcaatt acagcagagc
      601 cactgagatt tgtgtggggg atccaaagaa gacacagacc ggtttcaagg gggactccgg
      661 ggggcccctc gtgtgtaagg acgtagccca aggtattctc tcctatggaa ataaaaaagg
      721 gacacctcca ggagtctaca tcaaggtctc acacttcctg ccctggataa agagaacaat
      781 gaagcgcctc taacagcagg catgagacta accttcctct gggcctgacc atctctggga
      841 cagaggcaag aatccccaag ggtgggagtc ggggttgcag gactgtaata aatggatcga
      901 gcacgatatc
//
LOCUS       HUMFKBP      1532 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human FK506-binding protein (FKBP) mRNA, complete cds.
ACCESSION   M34539
KEYWORDS    FK506-binding protein;
            immunosuppressive agent FK506-binding protein;
            peptidylprolyl cis-trans isomerase.
SOURCE      Human peripheral blood T-cell, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1532)
  AUTHORS   Maki,N., Sekiguchi,F., Nishimaki,J., Miwa,K., Hayano,T.,
            Takahashi,N. and Suzuki,M.
  TITLE     Complementary DNA encoding the human T-cell FK506-binding protein,
            a peptidylprolyl cis-trans isomerase distinct from cyclophilin
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5440-5443 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            N.Takahasi, 5-16-1990, for release after publication.
FEATURES       from  to/span     description
    pept         79      405     FK506-binding protein (FKBP)
    signal     1507     1512     Poly-A signal
BASE COUNT      341 a    386 c    339 g    466 t
ORIGIN      
        1 gaattcgggc cgccgccagg tcgctgttgg tccacgccgc ccgtcgcgcc gcccgcccgc
       61 tcagcgtccg ccgccgccat gggagtgcag gtggaaacca tctccccagg agacgggcgc
      121 accttcccca agcgcggcca gacctgcgtg gtgcactaca ccgggatgct tgaagatgga
      181 aagaaatttg attcctcccg ggacagaaac aagcccttta agtttatgct aggcaagcag
      241 gaggtgatcc gaggctggga agaaggggtt gcccagatga gtgtgggtca gagagccaaa
      301 ctgactatat ctccagatta tgcctatggt gccactgggc acccaggcat catcccacca
      361 catgccactc tcgtcttcga tgtggagctt ctaaaactgg aatgacagga atggcctcct
      421 cccttagctc cctgttcttg gatctgccat ggagggatct ggtgcctcca gacatgtgca
      481 catgagtcca tatggagctt ttcctgatgt tccactccac tttgtataga catctgccct
      541 gactgaatgt gttctgtcac tcagctttgc ttccgacacc tctgtttcct cttccccttt
      601 ctcctcgtat gtgtgtttac ctaaactata tgccataaac ctcaagttat tcattttatt
      661 ttgttttcat tttggggtga agattcagtt tcagtctttt ggatataggt ttccaattaa
      721 gtacatggtc aagtattaac agcacaagtg gtaggttaac attagaatag gaattggtgt
      781 tggggggggg gtttgcaaga atattttatt ttaatttttt ggatgaaatt tttatctatt
      841 atatattaaa cattcttgct gctgcgctgc aaagccatag cagatttgag gcgctgttga
      901 ggactgaatt actctccaag ttgagagatg tctttgggtt aaattaaaag ccctacctaa
      961 aactgaggtg gggatgggga gagcctttgc ctccaccatt cccacccacc ctccccttaa
     1021 accctctgcc tttgaaagta gatcatgttc actgcaatgc tggacactac aggtatctgt
     1081 ccctgggcca gcagggacct ctgaagcctt ctttgtggcc tttttttttt ttcatcctgt
     1141 ggtttttcta atggactttc aggaattttg taatctcata actttccaag ctccaccact
     1201 tcctaaatct taagaacttt aattgacagt ttcaattgaa ggtgctgttt gtagacttaa
     1261 cacccagtga aagcccagcc atcatgacaa atccttgaat gttctcttaa gaaaatgatg
     1321 ctggtcatcg cagcttcagc atctcctgtt ttttgatgct tggctccctc tgctgatctc
     1381 agtttcctgg cttttcctcc ctcagcccct tctcacccct ttgctgtcct gtgtagtgat
     1441 ttggtgagaa atcgttgctg cacccttccc ccagcaccat ttatgagtct caagttttat
     1501 tattgcaata aaagtgcttt atgcccgaat tc
//
LOCUS       HUMINTA2      272 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human interferon alpha-2' gene, partial cds.
ACCESSION   M28028
KEYWORDS    interferon alpha-2'.
SOURCE      Human leukocytes DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 272)
  AUTHORS   Ovchinnikov,Y.A., Sverdlov,E.D., Monastyrskaya,G.S., Tsarev,S.A.,
            Zaitseva,E.M., Arsenyan,S.G., Chakhmakhcheva,O.G.,
            Novokhatskii,A.S., Aspetov,R.D. and Kuznetsov,V.P.
  TITLE     Expression of mutant human interferon alpha-2 in Escherichia coli
            cells
  JOURNAL   Mol. Biol. 18, 36-45 (1984)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1  >   272     interferon alpha-2' (AA at 2)
BASE COUNT       63 a     76 c     66 g     67 t
ORIGIN      
        1 actggtggcc ctcctggtgc tcagctgcaa gtcaagctgc tctgtgggct gtgatctgcc
       61 tcaaacccac aacctgggta gcaggaggac cttgatgctc ctggcacaga tgaggagaat
      121 ctctcttttc tcctgcttga aggacagaca tgactttgga tttccccagg aggagtttgg
      181 caaccagttc caaaaggctg aaaccatccc tgtcctccat gagatgatcc agcagatctt
      241 caatctcttc agcacaaagg actcatctgc tg
//
LOCUS       HUMMLC1AA     836 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human embryonic myosin alkali light chain (MLC1) mRNA, complete
            cds.
ACCESSION   M36172
KEYWORDS    embryonic myosin alkali light chain.
SOURCE      Human embryo, cDNA to mRNA, clone GT14.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 836)
  AUTHORS   Arnold,H.-H., Lohse,P., Seidel,U. and Bober,E.
  TITLE     A novel human myosin alkali light chain is developmentally
            regulated: Expression in fetal cardiac and skeletal muscle and in
            adult atria
  JOURNAL   Eur. J. Biochem. 178, 53-60 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         57      650     embryonic myosin alkali light chain (MLC1)
BASE COUNT      199 a    238 c    232 g    167 t
ORIGIN      
        1 cagtctctcg gtttcttctc agatcactcc tctgccaaag atcccaacaa gacaacatgg
       61 ctcccaagaa gcctgagcct aagaaggagg cagccaagcc agctccagct ccagctccag
      121 cccctgcacc agcccctgcc ccagctcctg aggctcccaa ggaacctgcc tttgacccca
      181 agagtgtaaa gatagacttc actgccgacc agattgaaga gttcaaagag gccttttcat
      241 tgtttgaccg gaccccgact ggagagatga agatcaccta cggccagtgc ggggatgtac
      301 tgcgggccct gggccagaac cctaccaatg ccgaggtgct gcgtgtgctg ggcaagccca
      361 agcctgaaga gatgaatgtc aagatgctgg actttgagac gttcttgccc atcctgcagc
      421 acatttcccg caacaaggag cagggcacct atgaggactt cgtggagggc ctgcgtgtct
      481 ttgacaagga gagcaatggc acggtcatgg gtgctgagct tcggcacgtc cttgccaccc
      541 tgggagagaa gatgactgag gctgaagtgg agcagctgtt agctgggcaa gaggatgcca
      601 atggctgcat caattatgaa gcctttgtca agcacatcat gtcagggtga agcagagtct
      661 tccaggtgcc tggcccttgg ctttagccat accagggtga gttaaagaga ggccccggct
      721 gggtgagctg agatggagtc ctcgacttat caccacacca ctgccccaag gaccttacag
      781 gccctccctg ttaataaaca gctctaacac ggccaggctg ggctctggga ttctga
//
LOCUS       MUSAIC2B     3113 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse interleukin 3 receptor-like protein (AIC2B) mRNA, complete
            cds.
ACCESSION   M34397
KEYWORDS    IL-3 receptor; interleukin 3 receptor.
SOURCE      Mouse, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3113)
  AUTHORS   Gorman,D.M., Itoh,N., Kitamura,T., Schreurs,J., Yonehara,S.,
            Yahara,I., Arai K,-I. and Miyajima,A.
  TITLE     Cloning and expression of a novel interleukin 3 receptor-like gene:
            Identification of another member of the cytokine receptor gene
            family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5459-5463 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.M.Gorman, 22-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept        288     2978     interleukin 3 receptor-like protein (AIC2B)
                                 precursor
    sigp        288      353     interleukin 3 receptor-like protein signal
                                 peptide
    matp        354     2975     interleukin 3 receptor-like protein signal
                                 peptide
BASE COUNT      727 a    955 c    806 g    625 t
ORIGIN      
        1 gagctgactg ggagatggtg gggagcgagc tactctggca gaactaaatg tcatggggca
       61 cagataaata ggaagagcct gcaactcact ggcacttgga ggctcccgaa ggaggctgcc
      121 tgtcgcccaa gcacagagcc acaaaggatg cagtctagga gggaagaatc acaagccctg
      181 taagatgagt ggagccaaac ccccagccaa gcaccaatac agaaccccgg gacaatgagg
      241 acacccccct gcccatagct tccagtgcag ccaccaaaag tgccaaaatg gaccagcaaa
      301 tggcactcac atgggggctg tgctacatgg cactggtggc tctctgttgg ggacacgggg
      361 tgacagaggc agaagaaacg gtccctctga agactctgca gtgctacaat gactacacca
      421 accacatcat ctgcagctgg gcggacacag aggatgccca ggggctaatc aacatgaccc
      481 tctatcacca gctagagaaa aaacagccag tgtcctgtga gctcagtgag aaactcatgt
      541 ggtcagagtg cccgtcatcc caccgctgtg tgcccagaag atgtgtcatc ccctatacac
      601 gattttccat cacaaacgaa gactactact ccttccggcc agatagtgat ctgggcatcc
      661 agctcatggt gccacttgcc cagaatgtgc agccaccact tcccaagaac gtcagcatca
      721 gctcctctga ggatcgtttc ctgctggagt ggagtgtgtc ccttggggat gcccaggtct
      781 cctggctttc atcaaaggac atagagtttg aggtggctta taagcggctt caggactcct
      841 gggaggatgc ctacagtctc cacactagca aatttcaggt gaatttcgag ccaaagctat
      901 tcctacccaa cagcatctat gcgccccgtg tgcgcactcg gctgtacccg ggttcaagct
      961 tgtctgggag acccagcaga tggagcccag aggctcactg ggactcccag ccaggggaca
     1021 aggcccagcc acagaacctt caatgcttct ttgatgggat ccagtccctc cactgctcct
     1081 gggaggtgtg gacccagacg actggctctg tttcctttgg gctcttctat cgccccagcc
     1141 ctgtagctcc ggaggagaaa tgctctccgg tggtgaagga gccgccgggg gccagtgtct
     1201 acacccgcta ccattgcagt ctacctgtgc ctgagcccag tgcacacagc cagtacacag
     1261 tctctgttaa gcacctggaa caagggaagt tcatcatgag ctataaccac atccagatgg
     1321 agcctccaac cctcaacctg accaagaaca gagacagcta cagcctgcat tgggaaactc
     1381 agaagatggc ttactcattc attgagcaca cattccaggt ccagtacaag aagaaatcgg
     1441 acagctggga ggacagcaag acagagaacc tagatcgagc ccatagcatg gacctctccc
     1501 agctggagcc agacacctca tactgcgcca gggtgagggt caagcccatc tctaactacg
     1561 atgggatctg gagcaagtgg agcgaagagt acacttggaa gactgactgg gtgatgccca
     1621 cgctgtggat agtcctcatc ctggtctttc tcatcctcac cttgctcctg atccttcgct
     1681 ttggctgtgt ctctgtatac aggacgtaca ggaagtggaa ggaaaagatc cccaacccca
     1741 gcaagagcct cctgttccag gatggaggta aaggtctctg gcctcctggc agcatggcag
     1801 ccttcgccac taagaacccc gctctccagg ggccacagag caggcttctt gctgagcaac
     1861 agggggagtc atatgcacat ttggaagaca acaacgtgtc acctctcact atagaggacc
     1921 ctaatataat tcgagttcca ccatccgggc ctgatacaac cccagctgcc tcatccgaat
     1981 ccacagagca acttcccaat gttcaagtag agggaccaac tcctaacaga cctaggaagc
     2041 aattacccag ctttgacttc aatgggccct acctggggcc tccccaatcc cactctctgc
     2101 ctgatctccc agaccagctg ggttcccccc aggtgggtgg gagcctgaag ccagcactgc
     2161 caggctcctt ggagtacatg tgtctggccc ctggaggtca agtgcaactg gttccattgt
     2221 cccaggtgat ggggcagggc caggctatgg atgtgcagtg tgggtccagc ctggagacct
     2281 cagggagccc ttctgtggag ccaaaggaga accctccagt tgagctgagc atggaggaac
     2341 aggaggcacg ggacaaccca gtgactctgc ccataagctc tgggggccct gagggcagta
     2401 tgatggcctc tgattatgtc actcctggag atccggtgct cactctgccc acagggcccc
     2461 tgtctacctc tctgggcccc tctctagggt tgccctcagc ccaaagcccc agtctctgtc
     2521 ttaagctgcc cagggtcccc tctggaagcc cagctctagg gccaccaggg tttgaggact
     2581 atgtggagct gcctccaagt gtgagccagg ctgccaagtc ccctccaggc catcctgctc
     2641 ctcctgtggc aagcagcccc acagtgatcc caggagagcc cagggaggaa gtgggcccag
     2701 catccccaca tcccgaaggc ctccttgttc ttcagcaggt tggggactac tgcttcctcc
     2761 ctggcctggg acctggctcc ctctcaccac acagtaagcc accctctcca agtctgtgtt
     2821 ctgagactga ggacctagtc caggacttgt ctgtcaaaaa gtttccctat cagcccatgc
     2881 cccaggcgcc agccattcag tttttcaagt ccctaaagca tcaggactac ctgtccctgc
     2941 ccccttggga caatagccag tctgggaagg tgtgctgagt ctgtctcctc ccaatctcac
     3001 cagcagcctg gcaccgcagc ctgtggtcct cagcctgagc atcaccacag aagcctctct
     3061 gagttcacac tcctccttgc tcccagccct gacatggcaa tacccccacc tgt
//
LOCUS       MUSIGHADP     363 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION   M36209
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig heavy-chain V region (AA at 1)
BASE COUNT       88 a     89 c     98 g     88 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaaatcgc
      301 tactatggta actactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc
      361 tca
//
LOCUS       MUSIGHADQ     357 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION   M36210
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 357)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   357     Ig heavy-chain V region (AA at 1)
BASE COUNT       88 a     83 c     99 g     87 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaggaaac
      301 tataggtacg acgagtttgc ttactggggc caagggactc tggtcactgt ctctgca
//
LOCUS       MUSIGHADR     336 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION   M36211
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 336)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   336     Ig heavy-chain V region (AA at 1)
BASE COUNT       81 a     77 c     94 g     84 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatggggc
      301 tactatggta acctactggt acttcgatgt ctgggg
//
LOCUS       MUSIGHADS     354 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H163-130F2.
ACCESSION   M36212
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130F2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   354     Ig heavy-chain V region (AA at 1)
BASE COUNT       86 a     83 c     97 g     88 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatactat
      301 aggtacgaca ggtttgctta ctggggccaa gggactctgg tcactgtctc tgca
//
LOCUS       MUSIGHADT     357 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H163-130H9.
ACCESSION   M36213
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 357)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   357     Ig heavy-chain V region (AA at 1)
BASE COUNT       86 a     91 c    102 g     78 t
ORIGIN      
        1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg
       61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg
      121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aaggggctac
      301 ggtagccact ggtacttcga tgtctggggc gcagggacca cggtcaccgt ctcctca
//
LOCUS       MUSIGHADU     331 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H2-4C2.
ACCESSION   M36214
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 331)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   331     Ig heavy-chain V region (AA at 2)
BASE COUNT       94 a     79 c     83 g     75 t
ORIGIN      
        1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctggatatac
       61 cttcacaaac tatggaatga attgggtgaa gcaggctcca ggaaagggtt taaagtggat
      121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt
      181 tgccttccct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa
      241 tgaggacacg gctacatatt tctgtgcaag atatgactac aactactggt acttcgatgt
      301 ctggggcgca gggaccacgg tcaccgtctc c
//
LOCUS       MUSIGHADV     354 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-1VH.
ACCESSION   M36215
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   354     Ig heavy-chain V region (AA at 1)
BASE COUNT       88 a     86 c     99 g     81 t
ORIGIN      
        1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg
       61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg
      121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagtcagggg
      301 tatggttacg ttgctatgga ctactggggt caaggaacct cagtcaccgt ctcc
//
LOCUS       MUSIGHADW     360 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-10VH.
ACCESSION   M36216
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   360     Ig heavy-chain V region (AA at 1)
BASE COUNT       89 a     90 c     92 g     89 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatcttac
      301 tacggtagta gctactactt tgactactgg ggccaaggca ccactctcac agtctcctca
//
LOCUS       MUSIGHADX     360 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-22VH.
ACCESSION   M36217
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   360     Ig heavy-chain V region (AA at 1)
BASE COUNT       91 a     89 c     96 g     84 t
ORIGIN      
        1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc
       61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct
      121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat
      181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta
      241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcactat
      301 ggttacgacg gttactatgc tatggactac tggggtcaag gaacctcagt caccgtctcc
//
LOCUS       MUSIGHADY     342 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-23VH.
ACCESSION   M36218
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 342)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   342     Ig heavy-chain V region (AA at 1)
BASE COUNT       89 a     83 c     92 g     78 t
ORIGIN      
        1 gaggtccagc ttcagcagtc aggacctgag ctggtgaaac ctggggcctc agtgaagata
       61 tcctgcaagg cttctggata cacattcact gactacaaca tgcactgggt gaagcagagc
      121 catggaaaga gccttgagtg gattggatat atttatcctt acaatggtgg tactggctac
      181 aaccagaagt tcaagagcaa ggcaacattg actgtagaca attcctccag cacagcctac
      241 atggagctcc gcagcctgac atctgaggac tctgcagtct attactgtgc aagagggggg
      301 ctacgccctt ttgcttactg gggccaaggg actctggtca ct
//
LOCUS       MUSIGHADZ     357 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-5VH.
ACCESSION   M36219
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 357)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   357     Ig heavy-chain V region (AA at 1)
BASE COUNT       87 a     88 c    100 g     82 t
ORIGIN      
        1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg
       61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg
      121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac
      241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagatgccct
      301 tatggtaact acgtttggta cttcgatgtc tggggcgcag ggaccacggt caccgtc
//
LOCUS       MUSIGHAEA     366 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-7VH.
ACCESSION   M36220
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 366)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   366     Ig heavy-chain V region (AA at 1)
BASE COUNT       90 a     92 c     94 g     90 t
ORIGIN      
        1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagactc
       61 tcctgtgcaa cttctgggtt caccttcact gattactaca tgagctgggt ccgccagcct
      121 ccaggaaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca
      181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc
      241 ctctatcttc aaatgaacac cctgagagct gaggacagtg ccacttatta ctgtgcaaga
      301 gacccccctt actacggcta cgctatggac tactggggtc aaggaacctc agtcaccgtc
      361 tcctca
//
LOCUS       MUSIGHAEB     328 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H230-2VH.
ACCESSION   M36221
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 328)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   328     Ig heavy-chain V region (AA at 2)
BASE COUNT       87 a     86 c     80 g     75 t
ORIGIN      
        1 acctggcctg gtggcgccct cacagagcct gtccatcaca tgcactgtct cagggttctc
       61 attaaccagc tatggtgtaa gctgggttcg ccagcctcca ggaaagggtc tggagtggct
      121 gggagtaata tggggtgacg ggagcacaaa ttatcattca gctctcatat ccagactgag
      181 catcagcaag gataactcca agagccaagt tttcttaaaa ctgaacagtc tgcaaactga
      241 tgacacagcc acgtactact gtgccaaacc ttatgattac gaagggtact actttgacta
      301 ctggggccaa ggcaccactc tcacagtc
//
LOCUS       MUSIGHAEC     331 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H230-5VH.
ACCESSION   M36222
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 331)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   331     Ig heavy-chain V region (AA at 2)
BASE COUNT       74 a     88 c     86 g     83 t
ORIGIN      
        1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc
       61 actgagcact tctggtatgg gtgtgagctg gattcgtcag ccttcaggaa agggtctgga
      121 gtggctggca cacatttact gggatgatga caagcactat aacccatccc tgaagagccg
      181 gctcacaatc tccaaggata cctccagcaa ccaggtattc ctcaagatca ccagtgtgga
      241 cactgcagat actgccacat actactgtgc tcgaagagat ggttacgacg aggggtttgc
      301 ttactggggc caagggactc tggtcactgt c
//
LOCUS       MUSIGHAED     334 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H250-6VH.
ACCESSION   M36223
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 334)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   334     Ig heavy-chain V region (AA at 2)
BASE COUNT       75 a     81 c     95 g     83 t
ORIGIN      
        1 gggaggctta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac
       61 tttcagtagc tatgccatgt cttgggttcg ccagtctcca gagaagaggc tggagtgggt
      121 cgcagaaatt agtagtggtg gtagttacac ctactatcca gacactgtga cgggccgatt
      181 caccatctcc agagacaatg ccaagaacac cctgtacctg gaaatgagca gtctgaggtc
      241 tgaggacacg gccatgtatt actgtgcaag ggatctcgac tatggtaact cctggtttgc
      301 ttactggggc caagggactc tggtcactgt ctct
//
LOCUS       MUSIGHAEE     354 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H280-10VH.
ACCESSION   M36224
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   354     Ig heavy-chain V region (AA at 1)
BASE COUNT       88 a     90 c     90 g     86 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatctaag
      301 atcccgtact actttgacta ctggggccaa ggcaccactc tcacagtctc ctca
//
LOCUS       MUSIGHAEF     354 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H280-15VH.
ACCESSION   M36225
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   354     Ig heavy-chain V region (AA at 1)
BASE COUNT       95 a     88 c     89 g     82 t
ORIGIN      
        1 caggtccagc ttcagcagtc tggggctgaa ctggcaaaac ctggggcctc agtgaagatg
       61 tcctgcaagg cttctggcta cacctttact agctactgga tgcactgggt aaaacagagg
      121 cctggacagg gtctggaatg gattggatac attaatccta gcactggtta tactgagtac
      181 aatcagaagt tcaaggacaa ggccacattg actgcagaca aatcctccag cacagcctac
      241 atgcaactga gcagcctgac atctgaggac tctgcactgt attactgtgc aagatgggtc
      301 tattactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca
//
LOCUS       MUSIGHAEG     360 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H35-C10VH.
ACCESSION   M36226
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 360)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   360     Ig heavy-chain V region (AA at 1)
BASE COUNT       84 a     87 c    100 g     89 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagagggcga
      301 actatgattc ccgcctggtt tgcttactgg ggccaaggga ctctggtcac tgtctctgca
//
LOCUS       MUSIGHAEH     345 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H35-C7VH.
ACCESSION   M36227
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 345)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   345     Ig heavy-chain V region (AA at 1)
BASE COUNT       85 a     78 c     96 g     83 t      3 others
ORIGIN      
        1 caggttcagc tgcagcagtc tggagctgaa ctggtaaagc ctggggcttc agtgaagttg
       61 tcctgcaagg cttctggcta caccttcaca agctatgata taaactgggt gaggcagagg
      121 cctgaacagg gacttgagtg gattggatgg atttttcctg gagatggtag tactaagtac
      181 aatgagaagt tcaagggcaa ggccacactg actacagaca aatcctccag cacagcctac
      241 atgcagctca gcaggctgac atctgaggac tctgctgtct atttctgtgc aaactggnac
      301 gggtttgctt actggnncca agggactctg gtcactgtct ctgca
//
LOCUS       MUSIGHAEI     354 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H61-15VH.
ACCESSION   M36228
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 354)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   354     Ig heavy-chain V region (AA at 1)
BASE COUNT       90 a     89 c     93 g     82 t
ORIGIN      
        1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc
       61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct
      121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat
      181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta
      241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcattac
      301 tacggctact atgctatgga ctactggggt caaggaacct cagtcaccgt ctcc
//
LOCUS       MUSIGHAEJ     351 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H9-A15VH.
ACCESSION   M36229
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 351)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   351     Ig heavy-chain V region (AA at 1)
BASE COUNT       96 a     82 c     91 g     81 t      1 others
ORIGIN      
        1 gttcagctgc agcagtctgg agctgagctg atgaagcctg ggncctcagt gaagatatcc
       61 tgcaaggcta ctggctacag attcagtagc tactggatgg agtgggtaaa gcagaggcct
      121 ggacatggcc ttgagtggat tggagagctt ttacctggaa atagtaatac taattacaat
      181 gagaagttca agggcaaggc cacattcgct gcagatacat cctccaacac agcctacatg
      241 caactcagca gcctgacatc tgaggactct gccgtctatt actgtgcaag agatctgaaa
      301 aatggtgact cgttctactt tgactactgg ggccaaggca ccactctcac a
//
LOCUS       MUSIGHAEK     333 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H9-D3VH.
ACCESSION   M36230
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 333)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   333     Ig heavy-chain V region (AA at 1)
BASE COUNT       95 a     73 c     84 g     80 t      1 others
ORIGIN      
        1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc
       61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct
      121 ccaggaaagg atttaaagtg gatgggctgg ataaacacct atactggaga gccaacattt
      181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat
      241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaagagaa
      301 aattcctact ggtatttcga tgtctggggc gca
//
LOCUS       MUSIGHAEL     334 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            IC5-3A4HV.
ACCESSION   M36231
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 334)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   334     Ig heavy-chain V region (AA at 2)
BASE COUNT       86 a     69 c     95 g     84 t
ORIGIN      
        1 gggagactta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac
       61 tttcagtagt tatggcatgt cttggtttcg ccagattccg gagaagaggc tggagtgggt
      121 cgcaaccatt agtggtggtg gtagttatat ttactatgta gacagtgtga aggggcgatt
      181 caccatctcc agagacaatg ccaagaacaa cctgtacctg caaatgagca gtctgaggtc
      241 tgaggacacg gccttgtatt attgtgcaag agggaatata atataccacg aggactatgc
      301 tatggactac tggggtcaag gaacctcagt cacc
//
LOCUS       MUSIGHAEM     328 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            L11-2E2HV.
ACCESSION   M36232
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 328)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   328     Ig heavy-chain V region (AA at 2)
BASE COUNT       85 a     78 c     96 g     69 t
ORIGIN      
        1 ggcagagctt gtgaagccag gggcctcagt caagttgtcc tgcacagctt ctggcttcaa
       61 cattaatgac acctatatgc actgggtgaa gcagaggcct gaacagggcc tggagtggat
      121 tggaaggatt gatcctgcga atggtaatac taaatatgac ccgaagttcc agggcaaggc
      181 cactataaca gcagacacat cctccaacac agcctacctg cagttcagca gcctgacatc
      241 tgaggacact gccgtctatt actgtgatag agaggactac ggtagtaggt acggggggtg
      301 gtacttcgat gtctggggcg cagggacc
//
LOCUS       MUSIGHAEN     331 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            L2-10C1HV.
ACCESSION   M36233
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 331)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   331     Ig heavy-chain V region (AA at 2)
BASE COUNT       90 a     78 c     89 g     74 t
ORIGIN      
        1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctgggtatac
       61 cttcacaaac tatggaatga actgggtgaa gcaggctcca ggaaagggtt taaagtggat
      121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt
      181 tgccttctct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa
      241 tgaggacacg gctacatatt tctgtgcaag atgggactac gggtactggt acttcgatgt
      301 ctggggcgca gggaccacgg tcaccgtctc c
//
LOCUS       MUSIGHAEO     328 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            L2-5D2HV.
ACCESSION   M36234
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 328)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   328     Ig heavy-chain V region (AA at 2)
BASE COUNT       76 a     85 c     82 g     83 t      2 others
ORIGIN      
        1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc
       61 actgagcact tctggtatgg gtgnnagctg gattcgtcag ccttcaggaa agggtctgga
      121 gtggctggca cacatttact gggatgatga caagcgctat aacccatccc tgaagagccg
      181 gctcacaatc tccaaggata cctccagaaa ccaggtattc ctcaagatca ccagtgtgga
      241 cactgcagat actgccacat actactgtgc tcggtcgtat ggtaatgggg attactatgc
      301 tatggactac tggggtcaag gaacctca
//
LOCUS       MUSIGHAEP     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H120-2HV.
ACCESSION   M36235
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       83 a     89 c     78 g     74 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgta cacgttcgga
      301 ggggggacca agctggaaat aaaa
//
LOCUS       MUSIGHAEQ     317 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H130-5VK.
ACCESSION   M36236
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 317)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   317     Ig heavy-chain V region (AA at 1)
BASE COUNT       91 a     76 c     71 g     79 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
      241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtacac gttcggaggg
      301 gggaccaagc tggaaat
//
LOCUS       MUSIGHAER     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H130-6VK.
ACCESSION   M36237
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       78 a     92 c     78 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt
      301 gctgggacca agctggagct gaaa
//
LOCUS       MUSIGHAES     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H146-24B1VK.
ACCESSION   M36238
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       83 a     90 c     75 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc
      301 tcggggacaa agttggaaat aaaa
//
LOCUS       MUSIGHAET     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H147-25H1VK.
ACCESSION   M36239
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       84 a     89 c     77 g     74 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact aacgttcgga
      301 ggggggacca agctggaaat aaaa
//
LOCUS       MUSIGHAEU     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H163-130H9.
ACCESSION   M36240
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130-F2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       78 a     92 c     78 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt
      301 gctgggacca agctggagct gaaa
//
LOCUS       MUSIGHAEV     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H163-130H9.
ACCESSION   M36241
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       78 a     92 c     78 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt
      301 gctgggacca agctggagct gaaa
//
LOCUS       MUSIGHAEW     321 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H2-4C2.
ACCESSION   M36242
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig heavy-chain V region (AA at 1)
BASE COUNT       93 a     79 c     69 g     80 t
ORIGIN      
        1 aatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactc ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
      241 gaagatattg ccacttacta ttgtcagcac tatagtgagc ttccgtggac gttcggtgga
      301 ggcaccaagc tggaaatcaa a
//
LOCUS       MUSIGHAEX     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-1.
ACCESSION   M36243
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       83 a     90 c     75 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc
      301 tcggggacaa agttggaaat aaaa
//
LOCUS       MUSIGHAEY     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-10.
ACCESSION   M36244
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       83 a     90 c     75 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc
      301 tcggggacaa agttggaaat aaaa
//
LOCUS       MUSIGHAEZ     303 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-22.
ACCESSION   M36245
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 303)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   303     Ig heavy-chain V region (AA at 1)
BASE COUNT       82 a     78 c     73 g     70 t
ORIGIN      
        1 tctcacaaat tcatgtccac atcagtagga gacagggtca gcatcacctg caaggccagt
       61 caggatgtgg gtactgctgt agcctggtat caacagaaac cagggcaatc tcctaaacta
      121 ctgatttact gggcatccac ccggcacact ggagtccctg atcgcttcac aggcagtgga
      181 tctgggacag atttcactct caccattagc aatgtgcagt ctgaagactt ggcagattat
      241 ttctgtcagc aatatagcag ctatcctcgg acgttcggtg gaggcaccaa gctggaaatc
      301 aaa
//
LOCUS       MUSIGHAFA     303 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-23.
ACCESSION   M36246
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 303)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   303     Ig heavy-chain V region (AA at 1)
BASE COUNT       78 a     71 c     70 g     83 t      1 others
ORIGIN      
        1 tctccatcct ccttatctgc ctctctggga gaaagagtca gtctcacttg tcgggcaagt
       61 caggacattg gtagtagctt aaactggctt cagcaggaac cagatggaac tattaaacgc
      121 ctgatctacg ccacatccag tttagattct ggtgtcccca aaaggttcag tggcagtagg
      181 tctgggtcag attattctct caccatcagc agccttgagt ctgaagattt tgtagactat
      241 tactgtctac aatatgctag ttctccgtac acgttcggag gggggaccaa gctgnaaata
      301 aaa
//
LOCUS       MUSIGHAFB     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-5.
ACCESSION   M36247
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       79 a     92 c     77 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt
      301 gctgggacca agctggagct gaaa
//
LOCUS       MUSIGHAFC     297 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H220-7.
ACCESSION   M36248
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 297)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   297     Ig heavy-chain V region (AA at 1)
BASE COUNT       78 a     72 c     73 g     69 t      5 others
ORIGIN      
        1 aaattcatgt ccacatcagt aggaganagg gtcagcatca cctgcaaggc cagtcaggat
       61 gtgagtactg ctgtagcctg gtatcaacag aaaccaggac aatctcctaa actactgatt
      121 tactcggcat cctaccggta cactggagtc cctgatcgct tcactggcag tggatctggg
      181 nnggatttca ctttcaccat cagcagtgtg caggctgaag acctggcagt ttattactgt
      241 cagcnacatn atagtactcc gtggacgttc ggtggaggca ccaagctgga aatcaaa
//
LOCUS       MUSIGHAFD     315 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H230-2.
ACCESSION   M36249
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 315)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   315     Ig heavy-chain V region (AA at 1)
BASE COUNT       83 a     80 c     75 g     77 t
ORIGIN      
        1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
       61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
      121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat
      181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
      241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccattcac gttcggctcg
      301 gggacaaagt tggaa
//
LOCUS       MUSIGHAFE     303 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H230-5.
ACCESSION   M36250
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 303)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   303     Ig heavy-chain V region (AA at 1)
BASE COUNT       77 a     77 c     75 g     74 t
ORIGIN      
        1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
       61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
      121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat
      181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
      241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtggac gttcggtgga
      301 ggc
//
LOCUS       MUSIGHAFF     318 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H250-6.
ACCESSION   M36251
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   318     Ig heavy-chain V region (AA at 1)
BASE COUNT       83 a     81 c     77 g     76 t      1 others
ORIGIN      
        1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
       61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca
      121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat
      181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct
      241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtgnac gttcggtgga
      301 ggcaccaagc tggaaatc
//
LOCUS       MUSIGHAFG     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H280-10.
ACCESSION   M36252
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       79 a     92 c     77 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt
      301 gctgggacca agctggagct gaaa
//
LOCUS       MUSIGHAFH     312 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H280-15.
ACCESSION   M36253
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig heavy-chain V region (AA at 1)
BASE COUNT       72 a     88 c     79 g     73 t
ORIGIN      
        1 gaaaatgtgc tcacccagtc tccagcaatc atgtctgcat ctccagggga aaaggtcacc
       61 atgacctgca gggccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag
      121 tcaggtgcct cccccaaact ctggatttat agcacatcca acttggcttc tggagtccct
      181 gctcgcttca gtggcagtgg gtctggcacc tcttactctc tcacaatcag cagtgtggag
      241 gctgaagatg ctgccactta ttactgccag cagtacagtg gttacccgta cacgttcgga
      301 ggggggacca ag
//
LOCUS       MUSIGHAFI     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H35-C10.
ACCESSION   M36254
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       79 a     92 c     77 g     76 t
ORIGIN      
        1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc
       61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag
      121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct
      181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag
      241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt
      301 gctgggacca agctggagct gaaa
//
LOCUS       MUSIGHAFJ     302 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H35-C7.
ACCESSION   M36255
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 302)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   302     Ig heavy-chain V region (AA at 1)
BASE COUNT       75 a     76 c     77 g     74 t
ORIGIN      
        1 gcttctttgg ctgtgtctct agggcagagg gccaccatat cctgcagagc cagtgaaagt
       61 gttgatagtt atggcaatag ttttatgcac tggtaccagc agaaaccagg acagccaccc
      121 aaactcctca tctatcgtgc atccaaccta gaatctggga tccctgccag gttcagtggc
      181 agtgggtcta ggacagactt caccctcaca attaatcctg tggaggctga tgatgttgca
      241 acctattact gtcagcaaag taatgaggat ccgctcacgt tcggtgctgg gaccaagctg
      301 ga
//
LOCUS       MUSIGHAFK     321 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H61-15.
ACCESSION   M36256
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
BASE COUNT       95 a     75 c     78 g     71 t      2 others
ORIGIN      
        1 gacatcaaga tgacccagtc tccatcctcc atgtatgcnt cgctgggaga gagagtcact
       61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca
      121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca
      181 agattcagtg gcagtggatc tgggcaanat tattctctaa ccatcagcag cctggagtct
      241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg
      301 gggaccaagc tggaaataaa a
//
LOCUS       MUSIGHAFL     324 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H9-A15.
ACCESSION   M36257
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 324)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   324     Ig heavy-chain V region (AA at 1)
BASE COUNT       72 a     94 c     76 g     82 t
ORIGIN      
        1 caaattgttc tttcccagtc tccagcagtc atgtctgcat ctcctgggga gaaggtcacc
       61 ttgacctgca gtgccagctc aagtgtaagt tccacctact tgtactggta ccagcagaag
      121 ccaggatcct cccccaaact ctggatttat agcacatcca acctggcttc tggagtccct
      181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag
      241 gctgaagatg ctgcctctta tttctgccat cagtggagta gttacccacc gacgttcggt
      301 ggaggcacca agctggaaat caaa
//
LOCUS       MUSIGHAFM     300 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            H9-D3.
ACCESSION   M36258
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 300)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   300     Ig heavy-chain V region (AA at 1)
BASE COUNT       81 a     69 c     63 g     87 t
ORIGIN      
        1 cagattacat cctccctgtc tgtctctctg ggagacagag tcaccatcag ttgcagtgca
       61 agtcagggca ttaacagtta tttaaactgg tatcagcaga aaccggatgg aactgttaaa
      121 ctcctgatct atttcacatc aattttatat tcaggagtcc catcaaggtt cagtggcagt
      181 gggtctggga cagattattc tctcaccatc agcaacttgg aacctgaaga tattgccact
      241 tactattgtc agcagtatac tattcttcca ttcacgttcg gctcggggac aaagttggaa
//
LOCUS       MUSIGHAFN     318 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            IC5-3A4.
ACCESSION   M36259
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 318)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   318     Ig heavy-chain V region (AA at 1)
BASE COUNT       87 a     80 c     75 g     76 t
ORIGIN      
        1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc
       61 atcccctgca aggccagtca ggatgtaggt actgctgtag cctggtatca acagaaacca
      121 ggacaatctc ctaaaatact gatttactgg gcatccaacc ggcaaattgg agtccctgat
      181 cgcttcacag gcagtggatc tgggacagat ttcactctca ccattagcta tgtgcagtct
      241 gaagacttgg cagattattt ctgtcagcaa tatagcacct atcccacgtt cggtgctggg
      301 accaagctgg agctgaaa
//
LOCUS       MUSIGHAFO     312 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            L11-2E2.
ACCESSION   M36260
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 312)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   312     Ig heavy-chain V region (AA at 1)
BASE COUNT       92 a     76 c     67 g     77 t
ORIGIN      
        1 gacatccaga tgacacagtc tccatcctca ctgtctgcat ctctgggagg caaagtcacc
       61 atcacttgca aggcaagcca agacattaac aagtatatag cttggtacca acacaagcct
      121 ggaaaaggtc ctaggctgtt catacattat acatctacat tacagccagg catcccatca
      181 aggttcagtg gaagtgggtc tgggagacat tattccttca gcatcagcaa cctggagcct
      241 gaagatattg caacttatta ttgtctacag tatgataatc ttctgacgtt cggtggaggc
      301 accaagctgg aa
//
LOCUS       MUSIGHAFP     321 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            L2-10C1.
ACCESSION   M36261
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 321)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   321     Ig heavy-chain V region (AA at 1)
BASE COUNT       93 a     77 c     71 g     80 t
ORIGIN      
        1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc
       61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca
      121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca
      181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct
      241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtggac gttcggtgga
      301 ggcaccaagc tggaaatcaa a
//
LOCUS       MUSIGHAFQ     288 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
            L2-5D2.
ACCESSION   M36262
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 288)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   288     Ig heavy-chain V region (AA at 1)
BASE COUNT       78 a     65 c     68 g     73 t      4 others
ORIGIN      
        1 cagtctccag cctccctgtc tgcatctgtg ggagaaactg tcaccatcac atgtcgagca
       61 agtgggaata ttcacaatta tttagcatgg tatcagcaga nacagggaaa atctcctcag
      121 ctcctggtct ataatgcaaa aaccttagca gatggtgtgc catcaaggtt cagtggcagt
      181 ggatcaggaa cacaatattc tctcaagatc aacagcctgc agcctgaaga ttttgggagt
      241 tattactgtc aacatttttg gnnnactccg tacacgttcg gagggggg
//
LOCUS       MUSIGHAFR     363 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION   M36207
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 363)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   363     Ig heavy-chain V region (AA at 1)
BASE COUNT       85 a     88 c    101 g     89 t
ORIGIN      
        1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata
       61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg
      121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac
      181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac
      241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc gaggatttac
      301 tacggtagta gctactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc
      361 tca
//
LOCUS       MUSIGHAFS     348 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse Ig heavy-chain mRNA V region, partial cds.
ACCESSION   M36208
KEYWORDS    immunoglobulin heavy-chain; processed gene; variable region.
SOURCE      Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W.
  TITLE     A set of closely related antibodies dominates the primary antibody
            response to influenza virus hemagglutinin
  JOURNAL   J. Immunol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Kavaler, 06-JUL-1990.
FEATURES       from  to/span     description
    pept    <     1  >   348     Ig heavy-chain V region (AA at 1)
BASE COUNT       96 a     80 c     93 g     78 t      1 others
ORIGIN      
        1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc
       61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct
      121 ccaggaaagg gtttaaagtg gatgggctgg ataaacacct acactggaga gccaacatat
      181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat
      241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaggagat
      301 ggtaactact ggtacttcga tgtctggggc gcagggacca cggtcacc
//
LOCUS       MUSMHEA2A1    478 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II A-beta 2 gene, exon 2.
ACCESSION   M30158
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     1 of 2
SOURCE      Mouse (haplotype SAX2) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 478)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    /   154  +   423     A-beta 2 chain, exon 2 (AA at 156)
    IVS     <     1      153     A-beta 2 chain intron A
    IVS         424  >   478     A-beta 2 chain intron B
BASE COUNT      118 a     99 c    132 g    129 t
ORIGIN      
        1 aaataaatgc caataagcat taaatgacat tgactttgtc tcttaatgtt gggcttttga
       61 gcacattgct atctgactta gtgtaccagg tacaggcttt ccctgaggtg cggctggttc
      121 ctaactggtt cctaatcctt ttatggtttc cagagaattt tgtgattcag gcaaaggcag
      181 actgttactt caccaatggg acagaaaagg ttcacttcct ggtcaggttc atcttcaacc
      241 tggaggagta tttacatttc gacagtgacc tggggatgtt cgtggcgctg acggagctgg
      301 gggaacctga cgctgaccag tggaacaaac gcctagatct cctggagatg agcagagccg
      361 ctgtgaacat ggtctgcagg cagaagtaca agctgggggc ccccttcatc gtggagagaa
      421 atggtgggta acagcaaggt ctaggggttc tccgtgagaa ccttgtaata ggcagttt
//
LOCUS       MUSMHEA2A2    526 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II A-beta 2 gene, exon 3.
ACCESSION   M30159
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     2 of 2
SOURCE      Mouse (haplotype SAX2) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 526)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +    49  /   330     A-beta 2 chain, exon 3 (AA at 49)
    IVS     <     1       48     A-beta 2 chain intron B
    IVS         331  >   526     A-beta 2 chain intron C
BASE COUNT      115 a    135 c    136 g    140 t
ORIGIN      
        1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg cctccagagg
       61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg
      121 tgacaggctt ctaccccagg gacataagtg tcaagtggtt ccggaatgga caggaggagc
      181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag
      241 taatgctgga aatgatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg
      301 gcctcctgag acctgtttct gtggcatgga gtgagcttca tttagttttc tgtatttctg
      361 gaccttgacc tgtaggagct atcatttctg tgttcatctt gatgacaaga caagttcccc
      421 tgatctaaga atcctagtgc cagggcagga gaaaacagga caggcatcca ttctctgaat
      481 gtccaagaga ggctgttaac ccacaacttg cctgtaatcc tgggac
//
LOCUS       MUSMHEAII1    491 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II A-beta 2 gene, exon 2.
ACCESSION   M30156
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     1 of 2
SOURCE      Mouse (haplotype SAX1) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 491)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    /   158  +   427     A-beta 2 chain, exon 2 (AA at 160)
    IVS     <     1      157     A-beta 2 chain intron A
    IVS         428  >   491     A-beta 2 chain intron B
BASE COUNT      123 a     97 c    135 g    136 t
ORIGIN      
        1 aaataaatgt caataagcat taaatgacat taagtgactt tgtctcttaa tgttgggctt
       61 ttgagcacat tgctatctga cttagtgtac caggtacagg ctttcccaga ggtgcggctg
      121 gttcctaact ggttcctaat ccttttatgg tttccagaga attttgtgat tcaggcaaag
      181 gcggactgtt acttcaccaa tgggacagaa aaggttcact tcctggtcag gttcatcttc
      241 aacctggagg aatatttaca tttcgacagt gacctgggga tgtttgtggc gctgacggag
      301 ctgggggaac ctgacgctga ccagtggaac aaacgcctgg atctcctgga gacgagcaga
      361 gccgctgtga acatggtctg caggcagaag tacaatctgg gggccccctt cattgtggag
      421 agaaatggtg ggtaacagca aggtctaggg gttctccgtg agaaccttgt aataggcatt
      481 tgaaaggttt t
//
LOCUS       MUSMHEAII2    592 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II A-beta 2 gene, exon 3.
ACCESSION   M30157
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     2 of 2
SOURCE      Mouse (haplotype SAX1) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 592)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +    49  /   330     A-beta 2 chain, exon 3 (AA at 50)
    IVS     <     1       48     A-beta 2 chain intron B
    IVS         331  >   592     A-beta 2 chain intron C
BASE COUNT      138 a    147 c    148 g    159 t
ORIGIN      
        1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg tctccagagg
       61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg
      121 tgacaggctt ctatcccggg aacataagtg tcaagtggtt ccggaatgga caggaggaga
      181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag
      241 taatgctgga aataatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg
      301 gcctcctgag acctgtttct gtggcatgga gtgagcttta ttagttttct gtaattctgg
      361 accttgacct gtaggagctg tcatttctgt gtcttgatga caagacaagt tcccctgatc
      421 taagaatcct aatgccaggg gcaggagaaa acaggacagg catccattct ctgaatgtcc
      481 aagagataga taaggaggct gttaacccac aacttgcctc taaccctggg acctagtatc
      541 ctttagcact ttaatcttta cagacactgc cactggctat aagtcaggga tc
//
LOCUS       MUSMHEB2B1    518 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II E-beta 2 gene, exon 2.
ACCESSION   M30160
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     1 of 2
SOURCE      Mouse (haplotype SAX) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 518)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    /    92  +   358     E-beta 2 chain, exon 2 (AA at 94)
    IVS     <     1       91     E-beta 2 chain intron A
    IVS         359  >   518     E-beta 2 chain intron B
BASE COUNT      120 a     94 c    151 g    153 t
ORIGIN      
        1 ctgcaggtgt cacagttcct ttcactgact gccattctgg agcattgtct gtcctcacag
       61 acatcctgta attgtgttca tgtacccaca gcacgttttc tggagcagtt gaaggctgag
      121 tgtcactact tcaatgggaa ggagcgtgtg tggagtgtga ccagattcat ctataaccag
      181 gaagagtttg tccgctttaa cactgacttt gggaagttcc tggcagtgac tgagctgggg
      241 cggcccatag ctgagtactt gaacacccag aaggacatgc tggacaatta tcgtgcctca
      301 gtggacaggt gcagaaataa ctatgacctt gtggatatct tcatgttgaa cttaaaaggt
      361 aagcattaga tagagagtag atgggttgag gggtgtgtgt gtgtgtgtgt gtgtgtgtgt
      421 gtgtgtgtgt gtgtgtaaga gaggatagtt ctgtgtcacg tgtgtggaga tcataagaca
      481 ggttactgat tctcgccatc tatcctatgg tcctgaga
//
LOCUS       MUSMHEB2B2    573 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II E-beta 2 gene, exon 3.
ACCESSION   M30161
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     2 of 2
SOURCE      Mouse (haplotype SAX) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 573)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +    97  /   378     E-beta 2 chain, exon 3 (AA at 97)
    IVS     <     1       96     E-beta 2 chain intron B
    IVS         379  >   573     E-beta 2 chain intron C
BASE COUNT      138 a    155 c    143 g    137 t
ORIGIN      
        1 gagatcttta gcctgggtgg atggaggcag gtaggcagcc cacagggaga acagctctat
       61 cctcagggct ctctctcacc tccctttctc ctctagctga gcccaaggtg actgtgtatc
      121 ctgcaaagac acagcctctg gaacaccaca acctcctggt ctgctctgtg aatgacttct
      181 accctggcag cattgaagtc agatggttcc ggaatggcga ggaggagaag actggagttg
      241 tgtccaccgg actgatccaa aatggagatt ggacctacca gaccctgttg atgctggaga
      301 tggttcctca gggtggagag gtttacacct gccaggtgga gcatcccagc ctgaccagcc
      361 ctgtcacagt ggagtggagt gagtgggaaa cccttgactc tgcaaatgcc caaccaccat
      421 ataataggag tgactttctc tgcctgttcc ctttctctga ccctgtaaat ccctgcctac
      481 cagggaagca catgggtgac tccacaggaa aaatgaatac aatttccaag ttactagctt
      541 ttattcctca cctagttcaa catctatgcc agt
//
LOCUS       MUSMHEBI1     445 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II E-beta 1 gene, exon 2.
ACCESSION   M30154
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     1 of 2
SOURCE      Mouse (haplotype SAX) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 445)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    /    11  +   280     E-beta 1 chain, exon 2 (AA at 13)
    IVS     <     1       10     E-beta 1 chain intron A
    IVS         281  >   445     E-beta 1 chain intron B
BASE COUNT      101 a    108 c    161 g     75 t
ORIGIN      
        1 gtctccacag cacggttttt ggaacaagct aaacatgagt gtcatttcta caacggaacg
       61 cagcgcgtgc ggtttctgtt gagacaaatc cacaacaggg aggagaacgt gcgcttcgac
      121 agcgacctgg gcgagttccg cgcagtgaac gagctggggc ggccagacgc cgagtacttc
      181 aacagccaga aggaggtcct ggaggatttg cgggccgctg tggacacgta ctgcagacac
      241 aactacgaga tcttagataa cttccttgtg ccgcggagag gtgagacagg aggcggtggg
      301 tggggcggaa ccacggtgag ggtggggctg tggggagcag caggaggcgg tgcgcacgtg
      361 cgcaggagcc gcaggggatg ctgggttccc tgccgctgga gccacaggcg gctttaagca
      421 gcctcatggc aggagaacag aattc
//
LOCUS       MUSMHEBI2     525 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse MHC class II E-beta 1 gene, exon 3.
ACCESSION   M30155
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SEGMENT     2 of 2
SOURCE      Mouse (haplotype SAX) liver DNA.
  ORGANISM  Mus saxicola
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 525)
  AUTHORS   Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N.
  TITLE     Structure of class II genes in wild mouse Mus saxicola: Functional
            and evolutionary implications
  JOURNAL   Eur. J. Immunol. 20, 1337-1343 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Cam, 27-NOV-1989, for release after publication.
FEATURES       from  to/span     description
    pept    +   102  /   383     E-beta 1 chain, exon 3 (AA at 102)
    IVS     <     1      101     E-beta 1 chain intron C
    IVS         384  >   525     E-beta 1 chain intron D
BASE COUNT      116 a    144 c    141 g    124 t
ORIGIN      
        1 ggatccagtc tggatggatg gaggcaggca ggcaggcagg cagcctacag gaaggacagc
       61 tccaccctca tggcaccttc tcacctctct ttctcctcta gttgagccta cagtgactgt
      121 gtaccctaca aagacacagc ccctggaaca ccacaacctc ctggtctgct ctgtgagtga
      181 cttctaccca ggcagcattg aagtcagatg gttccggaat ggtgaggagg agaagacggg
      241 aattgtgtcc actggcctga tccgaaatgg agactggact ttccagatgc tggtgatgct
      301 ggagacggtt cctcaggtgg aagaggttta cacctgccag gtggagcatc ccagcctgac
      361 cagccctgtc acagtccagt ggagtgagtg ataacttcca tactctgtga atgtcattcc
      421 accagggagg gggcgtggct tatccctgcc tgtcagcttt ctctgcccac atactctttc
      481 cactggtttt atgctgtcct gactttcacc acagcttaaa gggaa
//
LOCUS       MUSMMTVA      448 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse mammary tumor virus long terminal repeat.
ACCESSION   M35959
KEYWORDS    long terminal repeat.
SOURCE      Mouse lymphoma cell line MLA DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 448)
  AUTHORS   Yanagawa,S.-I., Murakami,A. and Tanaka,H.
  TITLE     Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas
            acquire a selective advantage in lymphocytes by alteration in the
            U3 region of the long terminal repeat
  JOURNAL   J. Virol. 64, 2474-2483 (1990)
  STANDARD  simple staff_entry
BASE COUNT      150 a     70 c     95 g    133 t
ORIGIN      
        1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact
       61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc
      121 tcagttaaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag
      181 aagagtgtgc ttgtcaaaat aagacacagg tggtggcaac cagggactta taggggacct
      241 tacatctaca gaccaacaga tgcccccttt cttaaaacaa ggatgtgaga caggtggttt
      301 cctgagtcgg tttggtatca aatgttctaa tctgagctct tagtgttcta ttttcctgtg
      361 ttcttttgga atctatccaa gtcttatgta aatgcttatg taaaccatga tataaaagag
      421 tgctgatttt tttgagtaaa tttgcaac
//
LOCUS       MUSMMTVB      527 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse mammary tumor virus long terminal repeat.
ACCESSION   M35960
KEYWORDS    long terminal repeat.
SOURCE      Mouse lymphoma cell line DL8 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 527)
  AUTHORS   Yanagawa,S.-I., Murakami,A. and Tanaka,H.
  TITLE     Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas
            acquire a selective advantage in lymphocytes by alteration in the
            U3 region of the long terminal repeat
  JOURNAL   J. Virol. 64, 2474-2483 (1990)
  STANDARD  simple staff_entry
BASE COUNT      169 a     82 c    120 g    156 t
ORIGIN      
        1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact
       61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc
      121 tcagttgaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag
      181 aagagtgtgc ttgtcaaaat aggagacagg tggtggcaac cagggactta taggggacct
      241 tacatctaca gacggttagc ttggcttccc agggttcaaa ctgttcttaa aacaaggatg
      301 tgagacaagt gcgcaacggt tagcttggct tcccagggtt caaactgttc ttaaaacaag
      361 gatgtgagac aagtggtttc ctgagttggt ttggtatcaa atgttctaat ctgagctctt
      421 agtgttctat tttcctatgt tcttttggaa tctatccaag tcttatgtaa atgcttatgt
      481 aaaccatgat ataaaagagt gctgattttt ttgagtaaac ttgcaac
//
LOCUS       PFDSYN         85 bp ds-DNA             SYN       14-AUG-1990
DEFINITION  Bacteriophage fd synthetic G2 promoter region DNA.
ACCESSION   M24953
KEYWORDS    promoter.
SOURCE      Synthetic DNA.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 85)
  AUTHORS   Ovchinnikov,Y.A., Efimov,V.A. and Chakhmakcheva,O.G.
  TITLE     Synthesis of a polynucleotide corresponding to the promoter region
            of bacteriophage fd DNA
  JOURNAL   FEBS Lett. 100, 341-346 (1979)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    mRNA    <     1       14     G2 mRNA
BASE COUNT       34 a     17 c     15 g     19 t
ORIGIN      
        1 aaatcagtct ttaccctgtc tattatcgtc agaagcaaag cgaattgcat caaaaagatt
       61 aagaggaagc ccgaaagact tcaaa
//
LOCUS       PRMSPCA       200 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  P.vulgaris spc r-protein operon L24 gene, 3' end, and L5 gene, 5'
            end.
ACCESSION   M36264
KEYWORDS    r-protein.
SOURCE      P.vulgaris DNA.
  ORGANISM  Proteus vulgaris
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 200)
  AUTHORS   Mattheakis,L.C., Vu,L. and Nomura,M.
  TITLE     Nucleotide sequence of the L24/L5 intercistronic region of Proteus
            vulgaris, Serratia marcescens and Salmonella typhimurium
  JOURNAL   J. Mol. Biol. 204, 326-329 (1988)
  STANDARD  simple staff_entry
REFERENCE   2  (sites)
  AUTHORS   Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M.
  TITLE     Translational regulation of the spc operon in Escherichia coli:
            Identification and structural analysis of the target site for S8
            repressor protein
  JOURNAL   J. Mol. Biol. 204, 309-329 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       57     r-protein L24
    pept         71  >   200     r-protein L5
BASE COUNT       64 a     36 c     47 g     53 t
ORIGIN      
        1 ttcgaagacg gcaaaaaagt tcgtttcttc aagtctaata aagaaactat caagtaattt
       61 ggagtatacg atggcgaaac tgcatgatta ctacaaagac gaggtagtcc aaaaactgat
      121 gtctcagttt ggttaccatt ctgtcatgca agtccctcgg gtcgagaaga tcaccctgaa
      181 tatgggtgtt ggtgaagcga
//
LOCUS       PSCTETR      1110 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  Plasmid pSC101 tetracycline resistance element repressor protein
            (tetR) gene, complete cds.
ACCESSION   M36272
KEYWORDS    tetracycline resistance element repressor protein.
SOURCE      Plasmid pSC101 DNA.
  ORGANISM  Plasmid pSC101
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 1110)
  AUTHORS   Brow,M.A.D., Pesin,R. and Sutcliffe,J.G.
  TITLE     The tetracycline repressor protein of pSC101
  JOURNAL   Mol. Biol. Evol. 2, 1-12 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         42      701     tetracycline resistance element repressor
                                 protein (tetR)
BASE COUNT      235 a    298 c    351 g    226 t
ORIGIN      
        1 aagcttatcg atgataagct gtcaaacatg agaattcgcg aatgaacaag ctccaacgcg
       61 aggccgtgat ccgaaccgcg ctcgaactgc ttaacgacgt gggcatggaa ggtctaacga
      121 cgcgccgact ggctgagcgc ctcggggtgc aacagccagc gctctactgg catttcaaga
      181 acaagcgtgc gttgctcgac gcacttgccg aagccatgct gacgataaat cacacgcatt
      241 cgacgccaag ggatgacgac gactggcgtt cgttcctgaa gggcaatgca tgcagttttc
      301 gacgggcgtt gctcgcttat cgcgatggcg cgcgtattca tgccgggacg cggccagccg
      361 cgccgcagat ggaaaaagcc gacgcgcagc ttcgcttcct ttgcgatgct ggcttttcgg
      421 caggtgacgc gacctatgcg ttgatggcaa tcagctactt caccgtcggc gctgttcttg
      481 agcagcaagc tagcgaggca gacgccgagg agcggggcga agatcagttg accacctcag
      541 cgtctacgat gccggcgcgc ctacagagcg cgatgaaaat cgtctacgaa ggcggtccgg
      601 acgcggcatt cgagcgaggc ctggctctca tcatcggcgg tcttgaaaaa atgaggctca
      661 ctacgaacga cattgaggtg ctgaagaatg ttgacgaatg acagggggcg gcaggtgcgg
      721 agggcgcggt tgcttcgtca tatgaagcaa agtcacctag ctgaattaat gggtgtggat
      781 caggcaaccg tgtcgcgctg ggagcggggc acccttgcat tgtcggatgg gaggtggtca
      841 gcggttcttc aattgcttac cgggccttcc gattcatcgt acgacgctgc gctgaagcgt
      901 ctggtgcaat cctccgccca caaagtccat ctggtagcga ccggacacat tgtttgctcg
      961 cggcatctcc ggccaggcaa agggaattgc ggattgacct agccgaactc cttggtaaat
     1021 cgctgcgtgt ttatgcgtcc ccgagatagt tgcggccgac tctgcgctta atgggctcgg
     1081 ttggcatgag gggcggctgg ggtcactcga
//
LOCUS       PTVCOMPL      359 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Potato spindle tuber viroid complete sequence.
ACCESSION   M36163
KEYWORDS    .
SOURCE      Potato spindle tuber viroid RNA.
  ORGANISM  Potato spindle tuber viroid
            Viridae; Nonclassified viruses.
REFERENCE   1  (bases 1 to 359)
  AUTHORS   Gross,H.J. and Riesner,D.
  TITLE     Viroids: A class of subviral pathogens
  JOURNAL   Angew Chem. Int. Ed. Engl. 19, 231-243 (1980)
  STANDARD  simple staff_entry
BASE COUNT       73 a    108 c    102 g     76 t
ORIGIN      
        1 cggaactaaa ctcgtggttc ctgtggttca cacctgacct cctgagcaga aaagaaaaaa
       61 gaaggcggct cggaggagcg cttcagggat ccccggggaa acctggagcg aactggcaaa
      121 aaaggacggg ggggagtgcc cagcggccga caggagtaat tcccgccgaa acagggtttt
      181 cacccttcct ttcttcgggt gtccttcctc gcgcccgcag gaccacccct cgcccccttt
      241 gcgctgtcgc ttcggctact acccggtgga aacaactgaa gctcccgaga accgcttttt
      301 ctctatctta cttgcttcgg ggcgagggtg tttagccctt ggaaccgcag ttggttcct
//
LOCUS       RATCBFB      1415 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat CCAAT binding transcription factor-B subunit (CBF-B) mRNA,
            complete cds.
ACCESSION   M34238
KEYWORDS    CCAAT binding transcription factor-B subunit;
            heteromeric mammalian transcription factor; nuclear protein;
            transcription activator.
SOURCE      Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone
            pCBF-B1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1415)
  AUTHORS   Maity,S.N., Vuorio,T. and de Crombrugghe,B.
  TITLE     The B subunit of a heteromeric rat CCAAT-binding transcription
            factor shows a striking sequence identity with the yeast Hap2
            transcription factor
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5378-5382 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.N.Maity, 11-MAY-1990, for release after publication.
FEATURES       from  to/span     description
    pept        171     1196     CCAAT binding transcription factor-B subunit
                                 (CBF-B)
BASE COUNT      371 a    371 c    417 g    256 t
ORIGIN      
        1 cagcggcagt ggcggcggcg cggcggctgg agcctctgat tgggtttcgg ggtccggtac
       61 tggagccaat cagcgcgggc agcgaaccgg gggagcgagg cacggagtcc ctacctggca
      121 gctgcctggg atctgtagag tgaagcttca ggacacaacg tggccgggcc atggagcagt
      181 atacagcaaa tagcaatagt tccacagagc agatcgtggt gcaggctggc cagattcagc
      241 agcagcagca gggtggtgtc actgctgtcc agctgcagac tgaggcccag gtggcatccg
      301 cctcaggcca gcaagtccag accctccagg tagttcaggg gcagccatta atggtgcaag
      361 tcagtggagg ccagctgatt acatcaactg gccaacccat catggtacag gctgtgcctg
      421 gtggacaagg ccaaactatc atgcaagtac ctgtgtctgg aacacaaggt ttacagcaga
      481 tacagttggt accccctgga cagatccaga tccagggcgg gcaggctgtg caggtgcaag
      541 gccagcaggg acagacccag cagatcatca ttcagcagcc acagaccgcg gtcaccgctg
      601 gccagactca gacacaacaa cagattgctg tccagggaca gcaagtggcc cagactgctg
      661 aagggcagac tattgtctac cagccagtta atgcagatgg cacaatcctc cagcaaggca
      721 tgatcaccat cccagcagcc agtttggcag gggcacagat cgttcagaca ggagccaata
      781 ccaacacaac cagtagtgga caagggactg tcactgtgac actgccagtg gcagggaatg
      841 tggtcaactc aggaggaatg gtcatgatgg taccaggggc tggctctgtg cctgctatcc
      901 aaagaatccc tttacctggg gcagagatgc tggaagaaga gccgttgtat gtgaatgcca
      961 aacagtatca ccgcatcctt aagaggagac aagcgagggc caagctagag gcagaaggga
     1021 agatcccaaa ggaaagaagg aaatacctcc atgagtctcg gcaccgacac gccatggcac
     1081 ggaagcgtgg ggaagggggc cgcttcttct ctccaaaaga aaaggacagt cctcacatgc
     1141 aggatccaaa ccaagctgat gaagaagcca tgacacagat catccgagtt tcctaaccac
     1201 aggagggtgg cagagcggat gagtcacagt cctctccact gttcctggaa atggactgac
     1261 ttccagttgg gcactgacag tctcactctg ccctttctca ggacagaaac tacttagctc
     1321 agtattacag ctgcagtgat ggctgacaaa ctgaagaggc aagcttttgt ctctcccttt
     1381 cattgaggac ctgttcagac tgttggtgac attga
//
LOCUS       RATRTG2       902 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat thyroglobulin (rTg-2) mRNA, complete cds.
ACCESSION   M35965
KEYWORDS    thyroglobulin.
SOURCE      Rat (strain Fisher) thyrocyte, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 902)
  AUTHORS   Graves,P.N. and Davies,T.F.
  TITLE     A second thyroglobulin messenger RNA species (rTg-2) in rat
            thyrocytes
  JOURNAL   Mol. Endocrinol. 4, 155-161 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1      732     thyroglobulin (rTg-2)
    mRNA    <     1      902     thyroglobulin mRNA
BASE COUNT      239 a    216 c    234 g    213 t
ORIGIN      
        1 atgatgacct tggtcttgtg ggtctcgact ttgttgagct cagtctgcct ggtagcagcc
       61 aacatctttg agtaccaagt ggatgcacag ccactccgcc cctgtgagct gcaaagggag
      121 aaggcctttc tgaagcagga tgaatatgtt ccacagtgct ctgaagatgg aagtttccag
      181 acagttcagt gccaaaatga tggccaatct tgctggtgtg tggattctga tggcacggaa
      241 gtacctggca gcaggcagct gggaaggcca acagcatgtc tgtccttctg ccagctacac
      301 aagcaacgaa tcctgctgag cagctacatt aacagcacag atgctctcta ccttcctcag
      361 tgccaggatt cagggaacta tgcaccagtg cagtgcgacc tgcagcaagt gcaatgctgg
      421 tgtgtagaca cagaggggat ggaggtgtat ggcacccgcc agcaggggag gccaacacgg
      481 tgtccaagga gctgtgagat aagaagccgc cgtctcctcc atggagtagg agacaaatcg
      541 ccaccccagt gcgatgcaga tggggagttt atgcctgtcc agtgcaagtt tgtcaacacc
      601 acagacatga tgatttttga tctgattcac aactacaaca gcaacaaacc tcacctcaaa
      661 tgccctagta catctgttat actgtggcaa gagtcaagag cacctttaac atgtgctgaa
      721 gaatcatcct aattggaata tagaagacta tggggaaatc ctgagaggca gatcgtctag
      781 tcacctgcaa gaccacctat ggtggccaat ttgacttgtg tgtattgtta tgctgtgtat
      841 agctttctgc caaagactgc ctgccctggc ccaggagaga ataaacacac tgttaattta
      901 tt
//
LOCUS       SMASPCA       201 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  S.marcescens spc operon r-protein L24 gene , 3' end, and L5 gene,
            5' end.
ACCESSION   M36265
KEYWORDS    r-protein.
SOURCE      S.marcescens DNA.
  ORGANISM  Serratia marcescens
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 201)
  AUTHORS   Mattheakis,L.C., Vu,L. and Nomura,M.
  TITLE     Nucleotide sequence of the L24/L5 intercistronic region of Proteus
            vulgaris, Serratia marcescens and Salmonella typhimurium
  JOURNAL   J. Mol. Biol. 204, 326-329 (1988)
  STANDARD  simple staff_entry
REFERENCE   2  (sites)
  AUTHORS   Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M.
  TITLE     Translational regulation of the spc operon in Escherichia coli:
            Identification and structural analysis of the target site for S8
            repressor protein
  JOURNAL   J. Mol. Biol. 204, 309-329 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       57     r-protein L24
    pept         72  >   201     r-protein L5
BASE COUNT       65 a     40 c     46 g     50 t
ORIGIN      
        1 ttcgaagacg ctaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt
       61 ggagtaatac gatggcgaaa ctgcatgatt actacaaaga cgaggtagtc aaacaactga
      121 tgtctcagtt tgattacaac tctgtcatgc aagtccctcg ggtcgagaag atcaccctga
      181 acatgggtgt tggtgaagcg a
//
LOCUS       STYSPCA       201 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  S.typhimurium spc operon r-protein L24 gene, 3' end, and L5 gene,
            5' end.
ACCESSION   M36266
KEYWORDS    r-protein.
SOURCE      S.typhimurium DNA.
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 201)
  AUTHORS   Mattheakis,L.C., Vu,L. and Nomura,M.
  TITLE     Nucleotide sequence of the L24/L5 intercistronic region of Proteus
            vulgaris, Serratia marcescens and Salmonella typhimurium
  JOURNAL   J. Mol. Biol. 204, 326-329 (1988)
  STANDARD  simple staff_entry
REFERENCE   2  (sites)
  AUTHORS   Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M.
  TITLE     Translational regulation of the spc operon in Escherichia coli:
            Identification and structural analysis of the target site for S8
            repressor protein
  JOURNAL   J. Mol. Biol. 204, 309-329 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       57     r-protein L24
    pept         72  >   201     r-protein L5
BASE COUNT       67 a     38 c     46 g     50 t
ORIGIN      
        1 ttcgaagacg gtaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt
       61 ggagtagtac gatggcgaaa ctgcatgatt actacaaaga cgaagtagtt aacaaactca
      121 tgactgagtt taactacaat tctgtcatgc aagtccctcg ggtcgagaag atcaccctga
      181 acatgggtgt tggtgaagcg a
//
LOCUS       GCOADDA      1771 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  G.verrucosa 18S ribosomal RNA gene.
ACCESSION   M33638
KEYWORDS    18S ribosomal RNA.
SOURCE      G.verrucosa (isolate Norway) male tetrasporangial DNA.
  ORGANISM  Gracilaria verrucosa
            Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; 
            Florideophycideae; Gracilariaceae; Gracilariaceae.
REFERENCE   1  (bases 1 to 1771)
  AUTHORS   Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A.
  TITLE     Nucleotide sequences of the 18S ribosomal RNA gene from the red
            algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson)
            Papenfuss and Gracilariopsis sp
  JOURNAL   Nucleic Acids Res. 18, 4023-4024 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Ragan, 08-APR-1990.  Atlantic Res. Lab., National Research
            Council of Canada, 1411 Oxford St., Halifax, Nova Scotia, CANADA
            B3H 3Z1
BASE COUNT      454 a    366 c    498 g    453 t
ORIGIN      
        1 caacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc
       61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct
      121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccaaa
      181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt
      241 tttttctgat cgcaattatt gcgacgcacc gttcaaattt ctgacctatc aactttggat
      301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg
      361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc
      421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta
      481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc
      541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc
      601 tcgtagtcgg attttggtgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg
      661 cctttgtgga tgggagctag gtggtgctta attggatcac ctagctgccg ccaccgttta
      721 ctgtgaaaaa aatagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata
      781 atagaatagg acccggtcct attttgttgg tttgcttgaa tcgggtaatg attaagaggg
      841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg
      901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga
      961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga
     1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg
     1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg
     1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg
     1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt
     1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag
     1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc
     1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc
     1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat
     1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa
     1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct
     1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat
     1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt
     1741 ttccgtaggt gaacctgcag aaggatcaga a
//
LOCUS       GRO18SAA     1782 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  Gracilariopsis sp. 18S ribosomal RNA gene.
ACCESSION   M33639
KEYWORDS    18S ribosomla RNA.
SOURCE      Gracilariopsis sp. (isolate England-1) diploid tetrasporophyte DNA.
  ORGANISM  Gracilariopsis sp.
            Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; 
            Florideophycideae; Gracilariaceae; Gracilariaceae.
REFERENCE   1  (bases 1 to 1782)
  AUTHORS   Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A.
  TITLE     Nucleotide sequences of the 18S ribosomal RNA gene from the red
            algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson)
            Papenfuss and Gracilariopsis sp
  JOURNAL   Nucleic Acids Res. 18, 4023-4024 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Ragan, 08-APR-1990.
BASE COUNT      443 a    372 c    515 g    452 t
ORIGIN      
        1 caacctggtt gatcctgcca gtggtatatg cttgtctata ggactaagcc atgcaagtgc
       61 aagtatgagt gtattataca acgaaactgc gaatggctct gtaaaacagc tataatttct
      121 tcggtgctaa atactactcg gataaccgta gtaattctag agctaatacg tgcctacaca
      181 gcgacgcaag tcgtggtaca aattagatac acaagccagc ttttttggtg attctagatt
      241 ttttttctga tcgcacattg ttgcgacgca ccgttcaaat ttctgaccta tcaactttgg
      301 atggtaaggt aatggcttac catggttgtg acgggtaacg gaccgtgggt gcgggactcc
      361 ggagagggag cctgagagac ggctaccaca tccaaggaag gcagcaggcg cgcaacttac
      421 ccaatccgga caccgggagg tagtgacaag aaatagcaat agagggcccg atgggttttc
      481 taattggaat gagaacaaga taaacagctt atcgaggagc cagcagaggg caagtctggt
      541 gccagcagcc gcggtaattc cagctctgta agcgtatacc aaagttgttg cagttaaaac
      601 gctcgtagtc gaattttggc gtgtgatttg ggtcgttctc gcggacgcgc tcaggttgtg
      661 cgcctttgtg gagagggggt gtggtggtgc ttgagtgcgc tgccatgctg ccgccaccgt
      721 ttactgtgaa aaaattagag tgttcaaagc aggcgattgc cctgaataca ttagcatgga
      781 ataatagaat aggacccggt tctattttgt tggtttgttt gaatcaggta atgattaaca
      841 gggacggttg ggggcattcg tattccggcg ctagaggtga aattcttgga ttgtcggaag
      901 acgaacagct gcgaaagcgt ctgccaagga cgttttcatt gatcaagaac gaaagtaagg
      961 ggatcgaaga cgatcagata ccgtcgtagt ctttactata aacgatgagg actggagatc
     1021 ggacaagact gatttatggc ttgttcggca tccttcgaga aatcaaagtg tttgctttct
     1081 ggggggagta tggtcgcaag gctgaaactt aaaggaattg acggaagggc atcaccgggt
     1141 gtggagcctg cggcttaatt tgactcaaca cgggaaaact taccaggtca ggacatagta
     1201 aggattgaca gattgagagc tctttcttga ttctatggtt ggtggtgcat ggccgttctt
     1261 agttggtgga gtgatctgtc tggttaattc cgttaacgag cgagacctgg gcgtgctagc
     1321 taggcgccgt taccaattga gtttggtagc gaggcggtgc cttcctagac ggactgtggg
     1381 cgtctagccc acggaagctc caggcaataa caggtctgag atgcccttag atgtcctggg
     1441 ccgcacgcgt gctacactga acggcccaac gagtggagga tgtgcgagag catttccgaa
     1501 tctccaaatc cgttcgtgat ggggatcgac ggttgcaatt ttccgtcgtc aacgaggaat
     1561 accttgtaag cgcgggtcat catcccgcgc tgaatacgtc cctgcccttt gtacacaccg
     1621 cccgtcgctc ctaccgattg agtggtccgg tgaggccttg ggagggctgg atgcgctgtt
     1681 tttatgcaga cattttggcc tgaacctggt caaaccttat cacttagagg aaggagaagt
     1741 cgtaacaagg tttccgtagg tgaacctgca gaaggatcag aa
//
LOCUS       REO1SIG1     1463 bp ds-RNA             VRL       14-AUG-1990
DEFINITION  Reovirus type 1 sigma-1 protein (S1) gene, complete cds.
ACCESSION   M35963
KEYWORDS    cell-attachment protein; hemagglutinin; sigma-1 protein.
SOURCE      Reovirus type 1 (strain Lang) RNA.
  ORGANISM  Reovirus type 1
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 1463)
  AUTHORS   Nibert,M.L., Dermody,T.S. and Fields,B.N.
  TITLE     Structure of the reovirus cell-attachment protein: A model for the
            domain organization of sigma-1
  JOURNAL   J. Virol. 64, 2976-2989 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         14     1426     sigma-1 protein
BASE COUNT      426 a    291 c    370 g    376 t
ORIGIN      
        1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact
       61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca
      121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat
      181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg
      241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca
      301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga
      361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt
      421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg
      481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga
      541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt
      601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg
      661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta
      721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc
      781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg
      841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat
      901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga
      961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac
     1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt
     1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc
     1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca
     1201 gacgggggat actgaaccat cgtcaactat tgatccatgg tctacgacat ttgccgcggt
     1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg
     1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt
     1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc
     1441 ctcggcacaa ggggtcaatc atc
//
LOCUS       REO2SIG1     1440 bp ds-RNA             VRL       14-AUG-1990
DEFINITION  Reovirus type 2 sigma-1-s protein (S1) gene, complete cds.
ACCESSION   M35964
KEYWORDS    cell-attachment protein; hemagglutinin; sigma-1-s protein.
SOURCE      Reovirus type 2 (strain Jones) RNA.
  ORGANISM  Reovirus type 2
            Viridae; ds-RNA nonenveloped viruses; Reoviridae.
REFERENCE   1  (bases 1 to 1440)
  AUTHORS   Nibert,M.L., Dermody,T.S. and Fields,B.N.
  TITLE     Structure of the reovirus cell-attachment protein: A model for the
            domain organization of sigma-1
  JOURNAL   J. Virol. 64, 2976-2989 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         14     1402     sigma-1-s protein
BASE COUNT      384 a    314 c    383 g    359 t
ORIGIN      
        1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac
       61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa
      121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact
      181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg
      241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc
      301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga
      361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat
      421 tactaatgac cttggatcat tgaatacgag ggtgacgact gaattgaacg atgtccgcca
      481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt
      541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg
      601 gttcgaccgc aataatgttc tgcagttatt cgtatcgaac cagcagaaag ggttgggatt
      661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg
      721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc
      781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct
      841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag
      901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc
      961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata
     1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga
     1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt
     1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc
     1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc
     1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat
     1321 cacgaattat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat
     1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc
//
LOCUS       MARCMYC       100 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Woodchuck mutant c-myc gene, intron A and breakpoint region.
ACCESSION   M30385
KEYWORDS    c-myc oncogene.
SOURCE      Woodchuck (individual W64, chronically infected with woodchuck
            hepatitis virus [WHV]) hepatocellularcarcinoma (HCC) DNA.
  ORGANISM  Marmota monax
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; 
            Marmotini.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Moeroey,T., Marchio,A., Etiemble,J., Trepo,C., Tiollais,P. and
            Buendia,M.-A.
  TITLE     Rearrangement and enhanced expression of c-myc in hepatocellular
            carcinoma of hepatitis virus infected woodchucks
  JOURNAL   Nature 324, 276-279 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    IVS     <     1  >   100     c-myc intron A
    recomb       48       49     cellular DNA of unknown function end/c-myc DNA
                                 start
BASE COUNT       24 a     26 c     29 g     21 t
ORIGIN      3 bp upstream of BGlII site.
        1 ggagatctca ggaagtaagg cactcagaga gacagtgggc gattcgtagg attccccacg
       61 gcgattccaa ctcgcccccg ctctttttaa gaagttggcg
//
LOCUS       MARWHV1       120 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Woodchuck DNA/woodchuck hepatitis virus DNA, 5' junction.
ACCESSION   M23563
KEYWORDS    c-myc oncogene.
SEGMENT     1 of 2
SOURCE      Woodchuck (individual W74) hepatocellular carcinoma DNA.
  ORGANISM  Marmota monax
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; 
            Marmotini.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C.,
            Tiollais,P. and Buendia,M.-A.
  TITLE     Activation of c-myc by woodchuck hepatitis virus insertion in
            hepatocellular carcinoma
  JOURNAL   Cell 55, 627-635 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1        6     c-myc oncogene
    recomb       92       95     woodchuck DNA end/virus DNA start
BASE COUNT       41 a     23 c     25 g     31 t
ORIGIN      
        1 gcctaagttg aactgttgga gggaggaact ggaatctcct aaattctcac tccttactaa
       61 gggaaagcaa ggaaaaaggt tccttctgac tgaaggcata aaaaaccatt atcctaattt
//
LOCUS       MARWHV2        40 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Woodchuck DNA/woodchuck hepatitis virus DNA, 3' junction.
ACCESSION   M23564
KEYWORDS    c-myc oncogene.
SEGMENT     2 of 2
SOURCE      Woodchuck (individual W74) hepatocellular carcinoma DNA.
  ORGANISM  Marmota monax
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; 
            Marmotini.
REFERENCE   1  (bases 1 to 40)
  AUTHORS   Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C.,
            Tiollais,P. and Buendia,M.-A.
  TITLE     Activation of c-myc by woodchuck hepatitis virus insertion in
            hepatocellular carcinoma
  JOURNAL   Cell 55, 627-635 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    recomb       15       16     virus DNA end/woodchuck DNA start
BASE COUNT        5 a     14 c      7 g     14 t
ORIGIN      About 900 bp after segment 1.
        1 agggcagcat tgatctacgc ttcctgcttc cctctttctc
//
LOCUS       BSUPAIA      1367 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  B.subtilis negative regulator of protease synthase and sporulation
            (pai) genes, complete cds.
ACCESSION   M36471
KEYWORDS    regulatory protein.
SOURCE      B.subtilis (strain DB104) DNA, clone pPAI121.
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1367)
  AUTHORS   Honjo,M., Nakayama,A., Fukazawa,K., Kawamura,K., Ando,K.,
            Hori,M. and Furutani Y,
  TITLE     A novel Bacillus subtilis gene involved in negative control of
            sporulation and degradative-enzyme production
  JOURNAL   J. Bacteriol. 172, 1783-1790 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        100      618     negative regulator pai 1
    pept        640     1263     negative regulator pai 2
BASE COUNT      504 a    205 c    283 g    375 t
ORIGIN      
        1 tttttggaat ggtaacaagg tcagtatgtg ttccgtgttt aaatatataa aagggacaca
       61 agacaattga attttaaata aaggggctga atccgaacaa tgagtgtaaa aatgaaaaaa
      121 tgcagccggg aagatttaca aacacttcaa caattgagta ttgaaacatt caatgacatt
      181 tttaaagaac agaactcacc tgaaaatatg aaagcctatt tagaaagcgc atttaacact
      241 gagcagctgg aaaaagagtt atctaatatg tcttcgcaat tcttttttat ttactttgat
      301 catgaaatcg ctggatatgt aaaggtcaat atcgatgatg ctcagtctga agaaatgggt
      361 gctgaatcac ttgaaatcga gagaatttat ataaagaaca gctttcaaaa acatgggctt
      421 ggcaaacatc tgctgaataa agcgatagaa attgcgctgg aacgtaataa aaagaacatt
      481 tggctaggtg tgtgggaaaa aaatgaaaat gccattgcct tttataagaa aatggggttt
      541 gttcagaccg gcgcccactc attttatatg ggtgatgaag aacaaacgga tttaatcatg
      601 gctaaaacac tcatataata tttttcgaag gggatgatga tgtatattcc aaaatatttt
      661 aaagtcacaa atgctgaaga aatttggaat tttgtacaag aaaactcctt tggtacggtt
      721 gtcacaacag aacaaggaaa accaattgcc acacatttgc cgttaggatt caataaaaaa
      781 gacgatcatt actatattac ggggcatttc gcttatggaa atcctcagtg gagaacgttt
      841 gaagcctgtg aagatgtgct tgtgatgttt cagggaccgc acgcttatat ttcttcttcc
      901 tggtattcgc gcgaaaatgt tccgacgtgg aattatcagg ctgtccacat gtatggaaaa
      961 gcaagtatgt tagagaaaga tgaattagca gaagaattaa caatcatgtt ggagaaatac
     1021 gaaaaacatc gtgacaatcc tgtgttatgg gataagcttt ctccaaaact cttagaaagt
     1081 gaattgaaag gtattgttgg atttaagatt aaggtggaag acattcaagc ggcatataaa
     1141 ttaagccaga accgaaatga aacggattat atgaacgtca ttgaacagtt gcaaaatgaa
     1201 gaaaatccaa atgcaaaaca aatggcagaa ctgatggaag ataaactcaa aaagcaaata
     1261 taagcttact ttgctgaaga agtagcagaa aacgccaact caagcggaga accctggcac
     1321 ttggcgtttt tttatgatta atatttatcc cattaaaaat ggaattc
//
LOCUS       HUMA2MGL     2041 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human alpha-2-macroglobulin mRNA, 3' end.
ACCESSION   M36501
KEYWORDS    alpha-2-macroglobulin.
SOURCE      Human liver, cDNA to mRNA, clone pha2ml.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2041)
  AUTHORS   Bell,G.I., Rall,L.B., Sanchez-Pescador,R., Merryweather,J.P.,
            Scott,J., Eddy,R.L. and Shows,T.B.
  TITLE     Human alpha-2-macroglobulin gene is located on chromosome 12
  JOURNAL   Somat. Cell Mol. Genet. 11, 285-289 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1     1932     alpha-2-macroglobulin (AA at 1)
                                 /hgml_locus_uid="LH0012N" /nomgen="A2M"
                                 /map="12p13.3-p12.3"
BASE COUNT      561 a    528 c    481 g    471 t
ORIGIN      
        1 cccgccttcc tagctgtccc agtggagaag gaacaagcgc ctcactgcat ctgtgcaaac
       61 gggcggcaaa ctgtgtcctg ggcagtaacc ccaaagtcat taggaaatgt gaatttcact
      121 gtgagcgcag aggcactaga gtctcaagag ctgtgtggga ctgaggtgcc ttcagttcct
      181 gaacacggaa ggaaagacac agtcatcaag cctctgttgg ttgaacctga aggactagag
      241 aaggaaacaa cattcaactc cctactttgt ccatcaggtg gtgaggtttc tgaagaatta
      301 tccctgaaac tgccaccaaa tgtggtagaa gaatctgccc gagcttctgt ctcagttttg
      361 ggagacatat taggctctgc catgcaaaac acacaaaatc ttctccagat gccctatggc
      421 tgtggagagc agaatatggt cctctttgct cctaacatct atgtactgga ttatctaaat
      481 gaaacacagc agcttactcc agagatcaag tccaaggcca ttggctatct caacactggt
      541 taccagagac agttgaacta caaacactat gatggctcct acagcacctt tggggagcga
      601 tatggcagga accagggcaa cacctggctc acagcctttg ttctgaagac ttttgcccaa
      661 gctcgagcct acatcttcat cgatgaagca cacattaccc aagccctcat atggctctcc
      721 cagaggcaga aggacaatgg ctgtttcagg agctctgggt cactgctcaa caatgccata
      781 aagggaggag tagaagatga agtgaccctc tccgcctata tcaccatcgc ccttctggag
      841 attcctctca cagtcactca ccctgttgtc cgcaatgccc tgttttgcct ggagtcagcc
      901 tggaagacag cacaagaagg ggaccatggc agccatgtat ataccaaaga cctgctggcc
      961 tatgcttttg ccctggcagg taaccaggac aagaggaagg aagtactcaa gtcacttaat
     1021 gaggaagctg tgaagaaaga caactctgtc cattgggagc gccctcagaa acccaaggca
     1081 ccagtggggg atttttacga accccaggct ccctctgctg aggtggagat gacatcctat
     1141 gtgctcctcg cttatctcac ggcccagcca gccccaacct cggaggacct gacctctgca
     1201 accaacatcg tgaagtggat cacgaagcag cagaatgccc agggcggttt ctcctccacc
     1261 caggacacag tggtggctct ccatgctctg tccaaatatg gagcagccac atttaccagg
     1321 actgggaagg ctgcacaggt gactatccag tcttcaggga cattttccag caaattccaa
     1381 gtggacaaca acaaccgcct gttactgcag caggtctcat tgccagagct gcctggggaa
     1441 tacagcatga aagtgacagg agaaggatgt gtctacctcc agacatcctt gaaatacaat
     1501 attctcccag aaaaggaaga gttccccttt gctttaggag tgcagactct gcctcaaact
     1561 tgtgatgaac ccaaagccca caccagcttc caaatctccc taagtgtcag ttacacaggg
     1621 agccgctctg cctccaacat ggcgatcgtt gatgtgaaga tggtctctgg cttcattccc
     1681 ctgaagccaa cagtgaaaat gcttgaaaga tctaaccatg tgagccggac agaagtcagc
     1741 agcaaccatg tcttgattta ccttgataag gtgtcaaatc agacactgag cttgttcttc
     1801 acggttctgc aagatgtccc agtaagagat ctgaaaccag ccatagtgaa agtctatgat
     1861 tactacgaga cggatgagtt tgcaattgct gagtacaatg ctccttgcag caaagatctt
     1921 ggaaatgctt gaagaccaca aggctgaaaa gtgctttgct ggagtcctgt tctcagagct
     1981 ccacagaaga cacgtgtttt tgtatcttta aagacttgat gaataaacac tttttctggt
     2041 c
//
LOCUS       HUMPPA1CO    1500 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human alpha-1 collagen type I mRNA, 5' end.
ACCESSION   M36546
KEYWORDS    alpha-1 collagen type I.
SOURCE      Human, cDNA to mRNA, clone pHUCI.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1500)
  AUTHORS   Tromp,G., Kuivaniemi,H., Stacey,A., Shikata,H., Baldwin,C.T.,
            Jaenisch,R. and Prockop,D.J.
  TITLE     Structure of a full-length cDNA clone for the prepro-alpha1(I)
            chain of human type I procollagen
  JOURNAL   Biochem. J. 253, 919-922 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         84  >  1500     alpha-1 collagen type I
                                 /hgml_locus_uid="LG0047H" /nomgen="COL1A1"
                                 /map="17q21.3-q22"
    sigp         84      149     alpha-1 collagen type I signal peptide
    matp        567  >  1500     alpha-1 collagen type I
    matp        618  >  1500     alpha-1 collagen type I alpha chain
BASE COUNT      257 a    488 c    488 g    267 t
ORIGIN      
        1 ggcacgcgga gtgtgaggcc acgcatgagc ggacgctaac cccctcccca gccacaaaga
       61 gtctacatgt ctagggtcta gacatgttca gctttgtgga cctccggctc ctgctcctct
      121 tagcggccac cgccctcctg acgcacggcc aagaggaagg ccaagtcgag ggccaagacg
      181 aagacatccc accaatcacc tgcgtacaga acggcctcag gtaccatgac cgagacgtgt
      241 ggaaacccga gccctgccgg atctgcgtct gcgacaacgg caaggtgttg tgcgatgacg
      301 tgatctgtga cgagaccaag aactgccccg gcgccgaagt ccccgagggc gagtgctgtc
      361 ccgtctgccc cgacggctca gagtcaccca ccgaccaaga aaccaccggc gtcgagggac
      421 ccaagggaga cactggcccc cgaggcccaa ggggacccgc aggcccccct ggccgagatg
      481 gcatccctgg acagcctgga cttcccggac cccccggacc ccccggacct cccggacccc
      541 ctggcctcgg aggaaacttt gctccccagc tgtcttatgg ctatgatgag aaatcaaccg
      601 gaggaatttc cgtgcctggc cccatgggtc cctctggtcc tcgtggtctc cctggccccc
      661 ctggtgcacc tggtccccaa ggcttccaag gtccccctgg tgagcctggc gagcctggag
      721 cttcaggtcc catgggtccc cgaggtcccc caggtccccc tggaaagaat ggagatgatg
      781 gggaagctgg aaaacctggt cgtcctggtg agcgtgggcc tcctgggcct cagggtgctc
      841 gaggattgcc cggaacagct ggcctccctg gaatgaaggg acacagaggt ttcagtggtt
      901 tggatggtgc caagggagat gctggtcctg ctggtcctaa gggtgagcct ggcagccctg
      961 gtgaaaatgg agctcctggt cagatgggcc cccgtggcct gcctggtgag agaggtcgcc
     1021 ctggagcccc tggccctgct ggtgctcgtg gaaatgatgg tgctactggt gctgccgggc
     1081 cccctggtcc caccggcccc gctggtcctc ctggcttccc tggtgctgtt ggtgctaagg
     1141 gtgaagctgg tccccaaggg ccccgaggct ctgaaggtcc ccagggtgtg cgtggtgagc
     1201 ctggcccccc tggccctgct ggtgctgctg gccctgctgg aaaccctggt gctgatggac
     1261 agcctggtgc taaaggtgcc aatggtgctc ctggtattgc tggtgctcct ggcttccctg
     1321 gtgcccgagg cccctctgga ccccagggcc ccggcggccc tcctggtccc aagggtaaca
     1381 gcggtgaacc tggtgctcct ggcagcaaag gagacactgg tgctaaggga gagcctggcc
     1441 ctgttggtgt tcaaggaccc cctggccctg ctggagagga aggaaagcga ggagctcgag
//
LOCUS       MACTRIIS1     280 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  M.mulatta triosephosphate isomerase gene, exon 1.
ACCESSION   M37561 M35724
KEYWORDS    triosephosphate isomerase.
SEGMENT     1 of 7
SOURCE      M.mulatta DNA.
  ORGANISM  Macaca mulatta
            Unclassified.
REFERENCE   1  (bases 1 to 280)
  AUTHORS   Old,S.E. and Mohrenweiser,H.W.
  TITLE     Nucleotide sequence of the triosephosphate isomerase gene from
            Macaca mulatta
  JOURNAL   Nucleic Acids Res. 16, 9055-9055 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        156  +   270     triosephosphate isomerase, exon 1 (E.C.
                                 5.3.1.1)
    IVS         271  >   280     triosephosphate isomerase intron A
    signal       46       49     CAAT box
    signal       93       98     TATA box
BASE COUNT       50 a     77 c    111 g     42 t
ORIGIN      
        1 gcaggagggc gggcgggggg cagggtctcc gggggaccgg tcgggccatg gcggaggacg
       61 gcgaggaggc ggagttccac ttctcggcgc tctatataag cgggcagtgg cccggctgcg
      121 cgcagacact gaccttcagc gtctcgctcc atgccatggc gccttccagg aagttcttcg
      181 ttggggggaa ctggaagatg aacgggcgga agcagaatct gggggagctc atcggcactc
      241 tgaacgcggc caaggtgccg gcagacaccg gtaagcctcg
//
LOCUS       MACTRIIS2     144 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  M.mulatta triosephosphate isomerase gene, exon 2.
ACCESSION   M37562 M35724
KEYWORDS    triosephosphate isomerase.
SEGMENT     2 of 7
SOURCE      M.mulatta DNA.
  ORGANISM  Macaca mulatta
            Unclassified.
REFERENCE   1  (bases 1 to 144)
  AUTHORS   Old,S.E. and Mohrenweiser,H.W.
  TITLE     Nucleotide sequence of the triosephosphate isomerase gene from
            Macaca mulatta
  JOURNAL   Nucleic Acids Res. 16, 9055-9055 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +   134     triosephosphate isomerase, exon 2 (E.C.
                                 5.3.1.1)
    IVS     <     1       10     triosephosphate isomerase intron A
    IVS         135  >   144     triosephosphate isomerase intron B
BASE COUNT       33 a     38 c     39 g     34 t
ORIGIN      
        1 ttgccctcag aagtggtttg tgctcccccc actgcctata tcgacttcgc ccggcagaag
       61 ctagatccca agattgctgt ggctgcacag aactgctaca aagtgactaa tggggccttt
      121 actggggaga tcaggtgaga tcga
//
LOCUS       MACTRIIS3     105 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  M.mulatta triosephosphate isomerase gene, exon 3.
ACCESSION   M37563 M35724
KEYWORDS    triosephosphate isomerase.
SEGMENT     3 of 7
SOURCE      M.mulatta DNA.
  ORGANISM  Macaca mulatta
            Unclassified.
REFERENCE   1  (bases 1 to 105)
  AUTHORS   Old,S.E. and Mohrenweiser,H.W.
  TITLE     Nucleotide sequence of the triosephosphate isomerase gene from
            Macaca mulatta
  JOURNAL   Nucleic Acids Res. 16, 9055-9055 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +    95     triosephosphate isomerase, exon 3 (E.C.
                                 5.3.1.1)
    IVS     <     1       10     triosephosphate isomerase intron B
    IVS          96  >   105     triosephosphate isomerase intron C
BASE COUNT       23 a     23 c     35 g     24 t
ORIGIN      
        1 cttcctttag ccctggcatg atcaaagact gcggagccac gtgggtggtc ctggggcact
       61 cagagagaag acatgtcttt ggggagtcag atgaggttag tagcc
//
LOCUS       MACTRIIS4     153 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  M.mulatta triosephosphate isomerase gene, exon 4.
ACCESSION   M37564 M35724
KEYWORDS    triosephosphate isomerase.
SEGMENT     4 of 7
SOURCE      M.mulatta DNA.
  ORGANISM  Macaca mulatta
            Unclassified.
REFERENCE   1  (bases 1 to 153)
  AUTHORS   Old,S.E. and Mohrenweiser,H.W.
  TITLE     Nucleotide sequence of the triosephosphate isomerase gene from
            Macaca mulatta
  JOURNAL   Nucleic Acids Res. 16, 9055-9055 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +   143     triosephosphate isomerase, exon 4 (E.C.
                                 5.3.1.1)
    IVS     <     1       10     triosephosphate isomerase intron C
    IVS         144  >   153     triosephosphate isomerase intron D
BASE COUNT       39 a     29 c     49 g     36 t
ORIGIN      
        1 ttctcaacag ctgattgggc agaaagtggc ccatgctctg gctgagggac tcggagtaat
       61 cgcctgtatt ggggagaagc tagatgaaag ggaagctggc atcactgaga aggttgtttt
      121 cgagcagaca aaggtcatcg caggtatctc ttg
//
LOCUS       MACTRIIS5     102 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  M.mulatta triosephosphate isomerase gene, exon 5.
ACCESSION   M37570 M35724
KEYWORDS    triosephosphate isomerase.
SEGMENT     5 of 7
SOURCE      M.mulatta DNA.
  ORGANISM  Macaca mulatta
            Unclassified.
REFERENCE   1  (bases 1 to 102)
  AUTHORS   Old,S.E. and Mohrenweiser,H.W.
  TITLE     Nucleotide sequence of the triosephosphate isomerase gene from
            Macaca mulatta
  JOURNAL   Nucleic Acids Res. 16, 9055-9055 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +     7  +    92     triosephosphate isomerase, exon 5 (E.C.
                                 5.3.1.1)
    IVS     <     1        6     triosephosphate isomerase intron D
    IVS          93  >   102     triosephosphate isomerase intron E
BASE COUNT       26 a     25 c     31 g     20 t
ORIGIN      
        1 ctgcagataa tgtgaaagac tggagcaagg ttgtcctggc ctatgagcct gtgtgggcca
       61 ttggtactgg caagactgca acgccccaac aggtaaccga cg
//
LOCUS       MACTRIIS6     108 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  M.mulatta triosephosphate isomerase gene, exon 6.
ACCESSION   M37571 M35724
KEYWORDS    triosephosphate isomerase.
SEGMENT     6 of 7
SOURCE      M.mulatta DNA.
  ORGANISM  Macaca mulatta
            Unclassified.
REFERENCE   1  (bases 1 to 108)
  AUTHORS   Old,S.E. and Mohrenweiser,H.W.
  TITLE     Nucleotide sequence of the triosephosphate isomerase gene from
            Macaca mulatta
  JOURNAL   Nucleic Acids Res. 16, 9055-9055 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11  +    98     triosephosphate isomerase, exon 6 (E.C.
                                 5.3.1.1)
    IVS     <     1       10     triosephosphate isomerase intron E
    IVS          99  >   108     triosephosphate isomerase intron F
BASE COUNT       26 a     24 c     32 g     26 t
ORIGIN      
        1 cttgttctag gcccaggaag tacatgagaa gctccgagga tggcttaagt ccaacgtctc
       61 tgaagcagtg gctcagagca cccgtatcat ttatggaggt gagtggct
//
LOCUS       MACTRIIS7     632 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  M.mulatta triosephosphate isomerase gene, exon 7.
ACCESSION   M37572 M35724
KEYWORDS    triosephosphate isomerase.
SEGMENT     7 of 7
SOURCE      M.mulatta DNA.
  ORGANISM  Macaca mulatta
            Unclassified.
REFERENCE   1  (bases 1 to 632)
  AUTHORS   Old,S.E. and Mohrenweiser,H.W.
  TITLE     Nucleotide sequence of the triosephosphate isomerase gene from
            Macaca mulatta
  JOURNAL   Nucleic Acids Res. 16, 9055-9055 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    +    11      129     triosephosphate isomerase, exon 7 (E.C.
                                 5.3.1.1)
    IVS     <     1       10     triosephosphate isomerase intron F
BASE COUNT      122 a    201 c    154 g    155 t
ORIGIN      
        1 cccttcccag gctctgtgac tggggcaacc tgcaaggagc tggccagcca gcctgacgtg
       61 gatggcttcc ttgtgggtgg tgcttccctc aagcccgaat tcgtggacat catcaatgcc
      121 aaacaatgag ccccatccac cttccctacc cttcctgcca agccagggac taagcagccc
      181 agaagcccag tcactgcccc ttccccgcac atgcttctga tggtgtcgcc tgctccttcc
      241 tgtggcctca tccaaactgt accttccttt actatcttca gcctgtaatg gttgggacca
      301 ggccaatccc ttctccactt actataatgg ttggaactaa atgtcaccaa ggtggcttct
      361 cctcggctga gagatggaag gggtgggatt tgctcctggg ttccctaggc cctagtgagg
      421 gcaggagaga agccatcctc tcccttctta cacagtgagg ccgagatcct cccctccgaa
      481 ggcaggagtg ccgccctctc ccatggtgcc catgcctgtg tgctgtgtac gtgaaccacc
      541 cacatgtgag ggaataaaca cctagcacta ggtcttgtgg tctgtctgcc ttcactggac
      601 ttgcccagat aatcttcctt tttgaggcag ct
//
LOCUS       MUS2NF3MK    2470 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse zinc finger protein (mkr3) mRNA, 3' end.
ACCESSION   M36514
KEYWORDS    mkr3 gene; zinc finger protein.
SOURCE      Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 15.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2470)
  AUTHORS   Chowdhury,K., Rohdewohld,H. and Gruss,P.
  TITLE     Specific and ubiquitous expression of different Zn finger protein
            genes in the mouse
  JOURNAL   Nucleic Acids Res. 16, 9995-10011 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1     1287     zinc finger protein (mkr3; AA at 1)
BASE COUNT      695 a    501 c    488 g    786 t
ORIGIN      
        1 ataaaaactc acactggtga gaagcccttc cggtgtaaag tatgtgcaag gacttttaga
       61 aattcctcat gccttaagac taactttcga attcacactg gaataaaacc atacaaatgt
      121 aattactgtt ggaaagcctt cactgcacgc tcaggcctta ctaagcatgt actaattcac
      181 aatggtgaga agccctatga gtgtaaggag tgtgggaaag ccttcagtac atcttccggc
      241 cttgttgaac atataagaat tcatacagga gagaagccct ttgaatgtta ccagtgtggg
      301 aaagccttgg tccattcctc atcccttgtt ggacatttaa gaactcacac tggagagaaa
      361 ccctttgagt gtaatcagtg tgacaaaact tttacacgat cttcttatct tcgtattcat
      421 atgcgaactc acaccggaga gaaaccatat gaatgtaaag agtgtgggaa aactttccct
      481 gagcgctcat gccttactaa acacataagg acacacactg gtgaaaggcc ctatgaatgt
      541 aaggagtgtg acaaaggctt tattagcttt gctcagctta ctgtacacat aaaaactcat
      601 agttctgaga gaccctttca gtgtaaggta tgcacaaaat cttttagaaa ctcttcatcc
      661 cttgagaccc acttccgaat tcacactgga gtaaaaccct ataaatgcag ttactgtggg
      721 aaagccttca ctgctcgttc aggccctacc atccatttac gcaatcacac tggggagaaa
      781 tcctatgcat gccaagagtg tggaaaggcc tttagcactt cctcaggcct tattgcacat
      841 ataagaagtc acaaaggaga gaaacccttt gaatgtgacc actgtgggaa ggcctttcct
      901 tcttcatctt atcttaatgt gcatttgaaa attcacactg gggaaaaacc ctttcggtgt
      961 acggtatgtg ggaaaacatt tacgtgttct tcttaccttc cggttcacat gcgaactcac
     1021 actggaggga ggccttttcg gtgtataata tgtggaaggt catttttgtg gtcctcgtac
     1081 cttagagttc acatgcgaat tcacactgga gagaaaccct atgtatgtca gtactgtgga
     1141 aaagctttta cagagcactc aggccttaat aaacatttac ggaaacacac aggagagaaa
     1201 ccatatgaat ataaggaatg tggggaaaac ttcactactt ctgctgatgc taatgaacat
     1261 gaaactcccc actggggaga cctttgaatg taagggatca gaaaattgat agccgttgac
     1321 catcatttgc aatcaaaatt tatagtctat acaagtctta ttgataagca acgtataaaa
     1381 aacattcggt tgctatgtgc tgtaaaaaca actcattctg aggatgctct gagtttaagg
     1441 attgtgggag aaatcaaaat ttcatgatat atgtatgaac tgaaggtttg taatataggg
     1501 aacgaaaacc tttagtgttt ctatgagtgt tgatgcaaga catggagaaa tgtttctatg
     1561 ttcacgtgtg gtctcacagt tcgagaaatg tctttctgaa gtttgggaag taactttaca
     1621 tttgaacaca agctacatgc gtgtaagagg ggatttgctt cagtgtcttg gatggcaaca
     1681 cttggctttc ttgggtgctt ctctatgaac tttgaatatt tctccatttg tgtattttta
     1741 tgacctcaat tccattgtta gttttccttt ttaaaatatt cctgtttgat gtgattgaaa
     1801 cttgttttta tctagtatct gttatatgtc ccatgttggc attgtgaacc taatgacaca
     1861 aactcccatc tcctcactac ttgttagttt atttgttgtt gtttttttcc cctcttttct
     1921 ttggagacag tctcactttc tggcctagta taacatgaaa gttatgtagc ctagactggt
     1981 ccctagctca tggcagtcta gacccagcct tctgagtgca tagattacag gtgcaagcca
     2041 ccatacctgc tctttctact gttttaagta taaacttcac tagcactgaa aaaaccatca
     2101 tgttttacaa ttgtgttcag gttcttaaca caagtaggtt tggatctgac caatacaatc
     2161 ccttactctg agttctaagt ctttggtagc cattccattc tctgcttgga gttcagcttt
     2221 tctttaggct ttgcttctta gatcatgtag tatctgtatt aactctactt agctttttta
     2281 tttttgtgac agggtcttgc tgtagctcaa actgacctca aactcgcaaa cctttttgcc
     2341 tcagacttac tgctacagtt ataggcatgt gccttcatgc ctagctggcg tatatcaatc
     2401 tacataatgt tctcaaggtg gaactgtgtt atggtcagtt attttaagac atctaattat
     2461 cctttggcct
//
LOCUS       MUS2NF4MK    1258 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse zinc finger protein (mkr4) mRNA, partial cds.
ACCESSION   M36515
KEYWORDS    mkr3 gene; zinc finger protein.
SOURCE      Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 14.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1258)
  AUTHORS   Chowdhury,K., Rohdewohld,H. and Gruss,P.
  TITLE     Specific and ubiquitous expression of different Zn finger protein
            genes in the mouse
  JOURNAL   Nucleic Acids Res. 16, 9995-10011 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  1258     zinc finger protein (mkr4; AA at 1)
BASE COUNT      356 a    335 c    321 g    246 t
ORIGIN      
        1 aaagtccttt ttccaagtat cttctctgtt caggcaccgg agatccacac tggagaaaag
       61 ctctacgact gtagccattg tgggaaaggc ttctcttaca actcagacct caggatacat
      121 cagaagatcc atacaggaga gaaacgccat ggctgcgtcg actgtggcaa agccttcaca
      181 caaaagtcca cactgaggat gcaccagaag atccacacgg gcgagagggc ctatgtctgt
      241 attgagtgtg gacaggcttt tatccagaag acacacctgg tcgcacaccg aagaattcac
      301 tcaagagtga ggccctgtgt atctctcgac cgtgcgaaac ccttcagcag cgctcccaac
      361 ctccttccgc gtaagaaagt tcaaatgaga gagaaatcgt ccatctgcgc ggagtgtggg
      421 aaggccttta cctacaggtc ggagctaatc attcatcaga gaactcacac gggagagaag
      481 ccttatcagt gcggggactg tggcaaagcc ttcacccaga agtcagccct caccgtgcac
      541 cgaagaatcc acacggggga gaagtcgtat gtgtgtgtga agtgcgggct agccttcgtg
      601 caaagggcac acttggatgc acaccaggtg attcacactg gagaaaaacc ttaccagtgt
      661 ggtcattgtg ggaaattctt cacttccaag tcgcaactcc acgtgcacaa gcggattcac
      721 acgggggaaa aaccctatgt gtgcagtaac tgtgggaagg cctttgccaa caggtcaaat
      781 ctcatcacac atcagaaaac tcatacaggg gagaaagcct atgtctgtgc aaggtgtggg
      841 aaagcattca ctcagaggtc agaacctgtt acacaccaga ggatacatac cggagagaag
      901 ccatatgggt gtagaccctg cgggaaagcc tttacccaga aatcacacct cagtatacac
      961 gagaagattc acaccggaga gagacagtac ggatgccgag actgtgggaa agccttcaac
     1021 cagaaatcca tactcatcgt gcaccagaaa atccacacgg gggagaaacc gcaggtgtgt
     1081 gccgagtgcg ggcgagcttt catccgcaag tccaacttca tcacccatca gcgcatccat
     1141 actggcgaga aaccttatgg atgcaccgac tgcggcaagt cctttacctc caagtcgcag
     1201 ctcctagtgc accggccaat ccacaccggc gagaaacctt acgtgtgtgc cgagtgtg
//
LOCUS       MUS2NF5MK    2433 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Mouse zinc finger protein (mkr5) mRNA, 3' end.
ACCESSION   M36516
KEYWORDS    mkr3 gene; zinc finger protein.
SOURCE      Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 10.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2433)
  AUTHORS   Chowdhury,K., Rohdewohld,H. and Gruss,P.
  TITLE     Specific and ubiquitous expression of different Zn finger protein
            genes in the mouse
  JOURNAL   Nucleic Acids Res. 16, 9995-10011 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1     1845     zinc finger protein (mkr5; AA at 1)
BASE COUNT      752 a    516 c    536 g    629 t
ORIGIN      
        1 cagcagtctg tacaggaagc ccaggatctg cttccaaggc aggactcaca tgctgaacgg
       61 gtaacaggca gaacttggag cactaaactt gagtgctcca ctttcagaga tcaggattct
      121 gagtgtacgt ttgaaaggaa tgagcaagag acagtcactc cgaacagagc cttctcagag
      181 gggagagatg gtatgtgtat cgaatctgga agatggtttc atttgaacag ttcagacgag
      241 agaagtcata actgtgactc aggtaaaagt ttctcctcaa atccagtagt tgtaaaggaa
      301 accggaatct gttcaggaaa aaaacttttc caatgtaatg aatgtaagaa aactttcacc
      361 cagagctcct cccttaccgt tcaccagaga attcatactg gagagaagcc ctataaatgc
      421 aatcagtgtg ggaaggcctt cagtgacggg tcctccttcg cacggcacca gaggtaccat
      481 acaggcaaga agccgtacga gtgcccggag tgtggcaaag ctttcataca gaacacctcg
      541 ctcgttcgtc actggaggta ctatcacact ggggagaaac ccttcgattg catcgactgt
      601 gggaaagcct tcagtgacca catagggctt aatcaacaca ggaggattca cactggagag
      661 aaaccataca cgtgtgaagt gtgtcacaaa tcctttaggt acggctcatc cctcactgtg
      721 caccaaagga ttcatactgg agagaaaccg tacgagtgtg agatttgcag aaaagccttc
      781 agccaccatg catccctcac tcagcatcaa cgagtgcatt ctggagaaaa gccttttaaa
      841 tgtaaagagt gtgggaaagc ttttaggcag aatatacacc ttgctagtca ctggaggatc
      901 cataccgggg agaagccctt tgagtgtggg gaatgtggga aatctttcag catcagctca
      961 cagcttgcca cgcatcagag aattcataca ggagagaagc cctttgagtg taaggtttgt
     1021 aggaaagctt ttaggcagaa tatacacctt gctagtcact ggagaatcca taccggggag
     1081 aagccctttg agtgtgggga atgtgggaaa tctttcagca tcagctcaca gcttgccacg
     1141 catcagagaa ttcatacagg agagaagcct tatgaatgta aggtttgtag gaaagcattt
     1201 acgcagaagg ctcaccttgc acaacatcag aagactcata cgggggagaa gccatacgag
     1261 tgcaaggaat gtggcaaggc cttcagccag accactcacc tcattcaaca ccagagggtt
     1321 catactggag agaagcccta taaatgcctg gaatgtggga aggcctttgg tgataactca
     1381 tcctgtactc aacatcagag acttcacact ggcctgaggc cttatgagtg tgtggagtgt
     1441 ggaaagacat tcaagaccaa gtcatccctt atctgttatt gtagatgtca cacgggagag
     1501 aaaccttatg agtgcagtgc atgtggtaaa gcctttagcc actggcagtc ccttagtgta
     1561 catcagagaa ttcattcggg gaagaaaccg tatgaatgta aggaatgtcg gaaaaccttc
     1621 atccagattg gacaccttaa ccaacataag agagtccata ccggggagag aacgtacaac
     1681 tacaagaagg gcaggagggc cttcaggcag actgcacact ttgctcacca tcagcagatt
     1741 cattccggaa agtcacctgc tcaccactct ttgccttcca catcgaatcc tgtggatctc
     1801 ttctccaaat tcgtctggaa tccatcctca ctgccatcat cataacctca tattttcacc
     1861 acacttgttt atcaacttgt ctctttgccc ctttgttcat tcttgtccct tatgagtcct
     1921 tcatattaca accaggttgg attatttttg gagtgtaaat gtaattaatt tgctcattta
     1981 aagcttatat tcactgggga aaaaaaaaga agttagctga aacataaaga aacctcatga
     2041 atctaagaaa tgttccatat cttgctctgg gtagtatatc tgtaaatctt agcaagccct
     2101 gtatgtagta ttttaagaac tttctgactt tgttatcctt taatttttaa tgctgttata
     2161 ttaagaaaaa cacatttata ataaatatgg tattttaaat aaaaagtatt atatcttata
     2221 gctatttttc agataaacaa aaacaaattt gatacaaagt aaaatatgta taaggttggt
     2281 gttccagaac aaaatctgca ggaatattta cagcatggta taaagcaacg aaaccaggtc
     2341 atccaccaac attttgcctt agattttcta atatttcagc atatgagtag cacagtatta
     2401 aaacactata ggtagtagca tttcacagaa ttc
//
LOCUS       ORCHRCOM     1579 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  O.orca highly repetitive element DNA.
ACCESSION   M36451
KEYWORDS    repetitive element.
SOURCE      O.orca DNA.
  ORGANISM  Orcinus orca
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Odontoceta; Delphinidae; Orcininae.
REFERENCE   1  (bases 1 to 1579)
  AUTHORS   Widegren,B., Arnason,U. and Akusjaervi,G.
  TITLE     Characteristics of a conserved 1,579-bp highly repetitive component
            in the killer whale, Orcinus orca
  JOURNAL   Mol. Biol. Evol. 2, 411-419 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt           1     1579     highly repetitive element
BASE COUNT      368 a    358 c    333 g    520 t
ORIGIN      
        1 atcttatgat gtttcttttt ttgtgtgact tatttcatgt agaatcatcg tacctgaatc
       61 cactcattat gctgctacgg gcctgatgac atagatttca ttgctgagtg atattgcatt
      121 gtacgtaagt accacaaagt ctttatccat ttttcgcttt ctgcgatact gaacttgtac
      181 cataaacgag gttcttgtaa acagagccgt cacaaacttt gggatggctg tgtctttttc
      241 attttaattt caactaagct ataggaccat aagtggaagt gccctaggct ctgttgcttt
      301 gttgtttaga tgtttcagga aacaccatac acttctccag agtggctgtt ggcaatttac
      361 atcccgccca tcagcataac aaggctccca attctccatg gcctgtcccg cctttctgga
      421 ttttacactt ttttcagatg gcccttttga ccgtggggca gtgagacttc atgtagtgca
      481 gatttccttt gcaagcttgc ttggttggcc aaaaagttcg tatgcgtttc tttctgaata
      541 tattcaggaa aaaacgcata agaccttttt ggccaagtgc atcattgtgg acgttctgcc
      601 tgttttccta tgctttacat gcaattccag tctacctcct gaaatcggtg tctggcaatt
      661 ttgccccgct ttcaagtcct cttggcagcc ttacttcaat atatttttgg acgatagctg
      721 tcatttataa ctctgcaggt ttgtgaatca cagtgcccct gagctccttt cttcaactgg
      781 ctttcttgtg agctggctgc aacaccgcag gattgcttga ggccctcgtg tggttccggc
      841 agggcacgct gagccttttg ttaattcctc ttcctggtgg gaaatgaggg ttaaatttgc
      901 ccgtccagac acctccagct agtctctcat tgattctccc tattcctgtt cattttccgc
      961 agaaattgca aactgggcca aacaggaggt taaaggcact gactctccaa gtggggagag
     1021 tgttagtaaa gcgtctggaa tcttgcaccc gagtaccagg ggacgaaaac tgagacacat
     1081 ttgaacacgt ttcccgatca cacggtggat catactctgg gttccacatg catgttttag
     1141 ctgaaggaag aatcccttaa acctggagag ttgagaccca tggaatgggt accatgcaat
     1201 atgacttcaa agggtctgta tttgctaact gaaactcact aatcctatca ctgctgcgtt
     1261 tatgccactg tacacacgct tgattccctt tcggagacat ataagtccac aggttttaag
     1321 attcttacta gtcaggtata ttcttaggag tttaatatgg agtgttgagt ccactcgatc
     1381 agcaaggagt agctcttgtc tattaaatat ttggcttatg gaacggtatc tgtgctgatt
     1441 tcaatctctg gttttatgca gcacctcaac tcacctttcc ccttaagcaa gcataagttg
     1501 ctgttctaca tttgagaccc tgttctgttt tgtaattcag ttcctgtgta gccaagttta
     1561 cattccgtgt agtagtgat
//
LOCUS       RICADH2A     3779 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  Rice alcohol dehydrogenase (adh2) gene, complete cds.
ACCESSION   M36469
KEYWORDS    adh2 gene; alcohol dehydrogenase.
SOURCE      Rice DNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 3779)
  AUTHORS   Xie,Y. and Wu,R.
  TITLE     Molecular analysis of an alcohol dehydrogenase-encoding genomic
            clone (adh2) from rice
  JOURNAL   Gene 87, 185-191 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        639      669     alcohol dehydrogenase (adh2), exon 1
                768      904     alcohol dehydrogenase, exon 2
               1017     1063     alcohol dehydrogenase, exon 3
               1672     1998     alcohol dehydrogenase, exon 4
               2114     2195     alcohol dehydrogenase, exon 5
               2301     2376     alcohol dehydrogenase, exon 6
               2471     2529     alcohol dehydrogenase, exon 7
               2628     2723     alcohol dehydrogenase, exon 8
               2854     3012     alcohol dehydrogenase, exon 9
               3149     3247     alcohol dehydrogenase, exon 10
    IVS         670      767     adh2 intron A
    IVS         905     1016     adh2 intron B
    IVS        1064     1671     adh2 intron C
    IVS        1999     2113     adh2 intron D
    IVS        2196     2300     adh2 intron E
    IVS        2377     2470     adh2 intron F
    IVS        2530     2627     adh2 intron G
    IVS        2724     2853     adh2 intron H
    IVS        3013     3148     adh2 intron I
BASE COUNT     1002 a    799 c    867 g   1111 t
ORIGIN      
        1 taatcattac caacagatat cttagaaaaa aaagctatta ttcctcaaaa ggaaaaaaaa
       61 ggctattatt ccttacaaga aaaaagaaga aggcaatgca ttatgctttt ctaaatgtct
      121 agattaattc atagttggac agatatgacc gcatatcaac gttaattact gccgatttga
      181 ctgttgatta gacatatctt agcactaaac ctcactgaaa attcggagaa tttactattg
      241 gagagaagaa aaggcatcca tcctccttgt cttcactctg caatctgcat catcatccgc
      301 ggaaaccgga aaaccaccga agctcgaaaa ccggcggagg caagagcagc tagcagcatg
      361 cagcagacgc agagccttcc tcttcttcca agcaaaaaat attcagatga cgcctcctcc
      421 tctttccccc ctcctcccac cttgcttggt tcctttcagg cgtacttgcc cgcgagactc
      481 ctaccactct ctgatcccta tatatacacc ccctcctccc ctctcgtctc ctcaccagaa
      541 ttccaagcaa cgaactgcga gtgattcaag aaaaaagaaa acctgagctt tcgatctcta
      601 cggagtggtt tcttgttctt tgaaaaagag ggggattaat ggcgacaggg aaggtgatca
      661 agtgcaaagg tccgttcttg ctgttcttgg agcggttcgt gagaaaatta gtttgatgtt
      721 ttgtggggag gagaagctga ttagatcggg ttttggggtt tttgcagcgg cggtggcatg
      781 ggaggccggg gaagcgctgt cgatcgagga ggtggaggtt gcgccgccgc agcgcatgga
      841 ggtccgcgtc aagatcctct acaccgccct ctgccacacc gacgtctact tctgggaggc
      901 caaggtaatt aatcagcatc tgccatggct gagctttctc accttctcaa gcttgcaaga
      961 atgatcagtg actaactaac ttggtgtgtg tgtggtttct gctttgatct tcgcaggggc
     1021 aaacacctgt tttccctagg atcttgggcc atgaagctgg agggtatgtg taatctgaac
     1081 caaccctgtc caattcttgt tcttcttcta ctacactgtt tagtggttca gtctattctg
     1141 ttcttgtatt gttttttttt tgttgaaaat ggtttttggt gtttagtgga atttagtgtg
     1201 tacttgtacg tcataggttg tttggtttat ggtattcggt tactaactat agttagtgtg
     1261 cacgttgggg agtcctgatt agtcagtggc attttcatta agttcagtaa agaaatctac
     1321 caagtgtcca attgtgcatg atattttacc tggtttggat aaatcatatg aaggaatcaa
     1381 agaacagcgt cattgatgac atatacaacg acaattggat tcctaggata atatactttt
     1441 cttcatgtgc tcctctatgc tagtgcttta ttactttctc tcctttgatg ttttgaggta
     1501 ctcaaaggaa aaagacacac tgaatcatta tccatcttat gaactgttta aacttgactt
     1561 acttatcact tctctgctca aatggacccc tttttttttt catttccatg gaagatatat
     1621 gtgtaatcct aacaatcaaa ttgatctaac aaaacaattt tttgctcaca gcattgtgga
     1681 gagtgtggga gagggtgtga ccgaactcgc cgcgggcgac catgtcctcc cggtgttcac
     1741 cggcgagtgc aaggagtgtg atcactgcaa atcggaggag agcaacatgt gtgacctcct
     1801 caggatcaac gtcgaccgcg gcgtcatgat cggcgacggc aagtcccgat tcaccatcaa
     1861 ggggaagccc atcttccact ttgttggcac ttccaccttc agcgagtaca ccgtcatcca
     1921 tgtcggctgc ctcgcgaaga tcaacccgga ggcgcctctc gacaaggtct gcattctcag
     1981 ctgcggtttc tcgtccaggt aagatgatgg atggcttcat gattatgatg acgtagtagt
     2041 tttcacagtg aaatttggaa aggtcttttt ttttttggat gcattaagtt gatgggtttt
     2101 gctgttcttg caggtttggt gcgacagtga atgtcgcgaa accgaaaaag ggccagaccg
     2161 tagctatttt cggtcttgga gctgtaggtc ttgctgtaat gggcacacgc acaaagatat
     2221 attcatttcc atttgatttg cagattgttg agatcttgaa actcaaactg atgaatgttt
     2281 gtaaaattac atgattatag gctatggaag gtgccaggct gtctggagca tcgaggatca
     2341 ttggtgtgga cctgaaccct gcgaaattcg aacaaggtac aatcttttat catctttgag
     2401 tgaaaattac atgtttggta tatgttctgg attgaacact gaactctgca acttgttttc
     2461 atctcttcag ctaagaagtt tggctgcact gacttcgtaa accccaagga ccacagcaag
     2521 ccagtgcagg tacatcacat tgcctcacta atttctctta ttatcacaca ctgttcatga
     2581 catggagtgt acatacttaa acatcaactc tgcaatggtg atctcaggtg ctgattgaga
     2641 tgaccaacgg tggactcgac agggccgttg aatgcacggg caacatcaac gccatgatat
     2701 cctgcttcga atgtgtccat gatgtcagtc tctttcttac aaacatgaaa catgatagta
     2761 acaaattgaa gatgatcatt tcagcaatgc ctccagttct gctgtaaact gtatcctgaa
     2821 aactaagagt ggcaatctga aacctgaatc cagggatggg gcgttgcggt gctggtcggc
     2881 gtgaaggatg atgtgttcaa gacccacccg atgaacttcc tcaacgagaa gacgctgaag
     2941 gggacgttga tcttcggtaa ctacaagccg cgcaccgacc tgcccaacgt cgtcgagctg
     3001 tacatgaaga aggtaaaaat aattgcagaa cagaatcaga tttgcaacct gaatgtgcat
     3061 tgttcagaaa tctatggtca gtgattccaa tgagccactg attgatcact attttatgct
     3121 aatgatttct ggtgtttgat gtctgaagga gctggagctg gagaagttca tcacgagcgt
     3181 gccgttctcg gagatcaaca cggcgttcga tctcatgctc aaggggagag tctccgctgc
     3241 gtcatgagta tgtatgagta gagatgatga tgatctatct ctctctctct ctctctgcag
     3301 tttcagtgca tacaagaaac caaaaatgaa agcaaaaaat ggatgagaag tttgaggttt
     3361 tccatttaag tttaataatg gtcgtgagac agggcttgtg tggagtcagc tctgtttact
     3421 ccctcctttc atggcttgat cttgtttatt gtcagcattg tggcttagca gattttcacc
     3481 agttttaccc tgtaaattag taccattctg atcaatcgta ataaactact agcagtgtga
     3541 ggtactattg ctggaactga attgattgtg cagtaaagag gtgaatagca gccagtatga
     3601 aggttttaca atttgaacat cgaaacgaaa tccacacgaa attcataaat ttcatcgcat
     3661 cttcgtgaag cttaccagat tctatcgatt cgatctagct ccaataggtc gatgggagca
     3721 tctattgatt ttttcgacag acgaaaattt tgcaagggac tagctaggga gttgtcgac
//
LOCUS       YSCCTA1A     2785 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae catalase A (CTA1) gene, complete cds.
ACCESSION   M36510
KEYWORDS    catalase A.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2785)
  AUTHORS   Cohen,G., Rapatz,W. and Ruis,H.
  TITLE     Sequence of the Saccharomyces cerevisiae CTA1 gene and amino acid
            sequence of catalase A derived from it
  JOURNAL   Eur. J. Biochem. 176, 159-163 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        820     2367     catalase A (CTA1)
BASE COUNT      917 a    529 c    571 g    768 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcttag aaggtgaaga aatagtacta gattcatatg attgttataa taatggtgaa
       61 ttgttgcctc aactaatact tttggtccaa atcttgacaa ttctttgcca aattccaggt
      121 ttatgcaaac tggacataaa agcaatggaa aggcaagtgg agagaattgt aaagaagtgt
      181 ttacaattga tagaaggtgc ccgcgccact acaaactgta gtgccacatg gaaacgttgt
      241 attatgaagc gtctagccga ttaccccata aaaaagtgcg tttctatcga aaaaccttcg
      301 aaaggaaact cattaacaag ggaagaacta agagatgtta tggctcggag agttttgaaa
      361 agcgaaatag attcgctgca agtttgtgaa gaaaccatcg acaagaatta caaggttatt
      421 cctgatgaaa agctgctaac taatatttta aagagaaagt tgacagagga agaaaaaagc
      481 tctgtcaaac gtccttgcgt gaagaagtga gcggttgttc taaccactat ttaaagccgc
      541 aattagtaat gcaaaaagtt ggccggaatt agccgcgcaa gttggtgggg tcccttaatc
      601 cgaaaaagga cggctttaac aaatataaac tccgaaaatc cccacagtga cagaattgga
      661 gaaacaacca gttttgatat cgccatacat ataaagagat gtagaaagca ttcttcactg
      721 taatgtccaa atcgtacatt tgaatttctt gtaggtttat ttaaaaggta agttaaataa
      781 atataatagt acttacaaat aaatttggaa ccctagaaga tgtcgaaatt gggacaagaa
      841 aaaaatgaag taaattactc tgatgtaaga gaggatagag ttgtgacaaa ctccactggt
      901 aatccaatca atgaaccatt tgtcacccaa cgtattgggg aacatggccc tttgcttttg
      961 caagattata acttaattga ttctttggct catttcaaca gggaaaatat tcctcaaagg
     1021 aatccacatg ctcatggttc tggtgccttc ggctattttg aagtaaccga tgacattact
     1081 gatatctgcg ggtctgctat gtttagtaaa attgggaaaa gaacgaaatg tctaacaaga
     1141 ttttcgactg tgggtggtga taaaggtagt gccgacacgg ttcgtgatcc aagggggttt
     1201 gccaccaaat tctacactga agaaggtaat ttagattggg tctacaataa tacaccggta
     1261 ttctttatca gagacccttc caagttccct cactttatcc acacacagaa gagaaaccca
     1321 caaaccaacc taagggatgc tgacatgttt tgggatttcc tcaccactcc tgaaaatcag
     1381 gtggccattc atcaagtaat gatccttttt tcagaccgtg gtacccctgc caactaccgt
     1441 agtatgcatg gttattctgg tcatacctat aaatggtcca ataaaaacgg agattggcat
     1501 tatgtgcaag ttcatatcaa aaccgatcaa ggaataaaga atttgaccat agaagaggct
     1561 accaaaattg cgggatccaa tccagattac tgccagcagg atttatttga ggctattcag
     1621 aatggaaact atccttcctg gacagtttat attcaaacaa tgaccgaacg cgatgccaaa
     1681 aaattaccat tttcagtctt tgatttgact aaagtatggc ctcaggggca attcccttta
     1741 cggcgtgtgg gtaagattgt tttgaacgag aatccactga acttcttcgc acaggtggaa
     1801 caagctgcct tcgcccccag taccacggtt ccttaccaag aagcaagcgc tgatccagta
     1861 ttacaggccc gtttgttttc atatgcggat gctcatagat acaggctagg tcctaacttc
     1921 catcaaatac ccgtaaactg tccatatgca tctaaatttt tcaatcccgc tatcagagat
     1981 ggaccgatga atgttaacgg caacttcggc tcagaaccta catatttggc caacgataaa
     2041 tcgtacacgt atatccaaca ggacagaccc attcaacaac accaagaggt atggaatggg
     2101 ccagctatcc cttatcattg ggcaacatcc ccaggtgatg tagatttcgt gcaagcaaga
     2161 aatctctacc gcgttttggg taaacaacct ggacagcaaa agaacttggc atataacatc
     2221 ggcattcatg tagaaggcgc ctgtcctcaa atacagcagc gcgtttatga tatgtttgct
     2281 cgtgttgata agggactatc tgaggcaatt aaaaaagtag ctgaggcaaa acatgcttct
     2341 gagctttcga gtaactccaa attttgaaac gctcaagtaa caaatgagtg gcgttgtttc
     2401 cacgacaatt atttatgata gtgtgtattt ttaacacatt ttatttatta caatttatgt
     2461 attttgttat gaattattta tttatacgac taataggtga tgctcatatt ctcgtgttag
     2521 aaagttaaaa aaattatcat ttcacacata ggaaagctcg tcgcgccggg gaaaaagctg
     2581 aggaatctct attattaggg gtaaagttca acacattcag tatgagataa gtgtgtcttc
     2641 aagagagatg cagcactgag tagggaacca agaaacgatg tctgaattac tagatagctt
     2701 tgagacagag tttgcgaaat tttataccga cagcaatctg gaagagacaa accttcaaaa
     2761 atgtcttgat catactcatg aattc
//
LOCUS       YSCILV1A     2766 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae threonine deaminase (ILV1) gene, complete cds.
ACCESSION   M36383
KEYWORDS    ILV1 gene; threonine deaminase.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2766)
  AUTHORS   Kielland-Brandt,M.C., Holmberg,S., Petersen,J.G.L. and
            Nilsson-Tillgren,T.
  TITLE     Nucleotide sequence of the gene for threonine deaminase (ILV1) of
            Saccharomyces cerevisiae
  JOURNAL   Carlsberg Res. Commun. 49, 567-575 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        661      756     ORF1
    pept        778     2508     threonine deaminase (ILV1)
BASE COUNT      797 a    569 c    510 g    890 t
ORIGIN      
        1 gaattcgttc tcattacatg gccgcccacc atgcaccttt acatttcgta tttctttcct
       61 ctctgcgcta tatctttggg aaagatttgc aatcaaggca aacacccgct tgtccccttt
      121 caattcttgt gttatttcaa aaaccatcct ttcataatat atttgtattt ctttcgccct
      181 ggactctacc agttcatatt tagaaccttc cacatatcgt ccaattctgc agcccacatc
      241 tttttccacc acgatacggg aaacagaatg ggtccttgga ttctcgctaa acagtctctc
      301 tattcccctg ttcaaaaccc tcaagatatt tgtttcccgc aacagctgca attgcaattg
      361 atcaatccta tgcgaaaatg ccgagtttat gttattcaag acgcatttta aaaaattcac
      421 tagcggctcc ttgaaattca ttatgtctga tgaatatgaa aaccttttcc tgactaccaa
      481 gactctttaa ctcttctctc tttattgcat attatctctg ctattttgtg acgttcaatt
      541 ttaattgacg cgaaaaagaa aaaataagaa gggcaaaaag aaaaagcgca gcgggtagca
      601 aatttggaat cgcataaaaa gaaaaaaaaa atatcaaaga aaaagagtca tctcaaacat
      661 atgtctgcag atacttcatt atcagctttg aaaacttttg ttgttgctgc tttgagttct
      721 ttcttgtgtg agtgctacaa gccacattta aactaagtca attacacaaa gttagtgatg
      781 tcagctactc tactaaagca accattatgt acggttgttc ggcaaggtaa acagtccaaa
      841 gtgtctggat tgaacctttt gagactaaag gctcatttgc acagacaaca cctgtcacct
      901 tccttgataa aactacactc tgaattgaaa ttggatgagc tgcaaactga taacacccct
      961 gattacgtcc gtttagtttt aaggtcctct gtatacgatg ttattaatga atctccaatc
     1021 tctcaaggtg taggtttgtc ttcccgtcta aacacgaatg tcatcttgaa aagagaagat
     1081 ctattgcctg ttttctcttt caagcttcgt ggtgcctata acatgattgc caagttggac
     1141 gattctcaaa gaaaccaggg tgttattgcc tgttcagctg ggaatcatgc ccaaggtgtg
     1201 gcctttgctg ctaaacactt gaaaatacct gctactatcg ttatgcctgt ttgtacacca
     1261 tctattaagt atcaaaatgt ctcgagatta gggtctcaag tcgtcctata tggtaacgat
     1321 tttgacgagg ctaaggctga atgtgccaaa ttggctgaag agcgtggctt gacgaacatt
     1381 cctcctttcg atcatcctta tgtcattgcc ggtcaaggta ctgtagctat ggaaatccta
     1441 agacaagtac gtaccgctaa taagatcggt gctgtctttg ttcccgtcgg cggtggtggt
     1501 ttaattgctg gtattggtgc ttatttgaaa agggttgctc ctcatatcaa aactattggt
     1561 gttgaaactt acgatgcggc cactttacat aattccttgc aacgcaacca gagaactcct
     1621 ttacctgtgg tgggtacttt tgccgatggt acgtctgtgc gtatgattgg tgaagaaaca
     1681 tttagagtcg cccaacaagt ggttgatgaa gttgttcttg ttaacactga cgaaatctgt
     1741 gctgcagtaa aggatatttt tgaagatact agaagtattg tagaaccatc tggtgccctt
     1801 tcagtagccg gtatgaagaa atacatctct accgtacatc cagaaattga ccacactaaa
     1861 aacacctatg ttcccatcct ttctggtgct aacatgaact ttgatagatt aagatttgtt
     1921 tccgaacgtg ctgttcttgg tgaaggaaag gaagtcttca tgttagttac tttacccgac
     1981 gtccctggtg cgttcaagaa aatgcaaaag atcatccacc caagatctgt cactgaattc
     2041 tcttaccgtt acaatgaaca tcgtcatgag tcctctagtg aagtgcccaa ggcttacatt
     2101 tacacttctt tcagcgtcgt tgacagagaa aaggaaatca agcaagttat gcaacagttg
     2161 aatgctttag gttttgaagc tgtggatatc tccgataacg aattggctaa atctcatggt
     2221 agatacttgg ttggtggtgc ttctaaggtt cctaatgaaa gaattatttc atttgaattc
     2281 cctgaaagac caggtgcctt gactaggttc cttggaggcc taagcgattc ttggaatctt
     2341 actttattcc attatagaaa ccatggtgcc gatatcggta aggttttagc tggtatttcc
     2401 gttcctccaa gggaaaactt aaccttccaa aaattcttgg aagatttagg ctacacttat
     2461 catgatgaaa ctgataacac tgtttatcaa aaattcttga aatattaaaa ccgacaattt
     2521 actttataaa tttacgcaac aacttgttag gtaaaatgaa cgtcgtcata atttaaatac
     2581 tagacagcgc gtgtatgtta ctcatattta tactcatatc atgtgaacct ataaaccaat
     2641 tcaatacaag atttacagtt ctcttgtttt cttcttccag ttttcccctt ttatttcctg
     2701 tagaataaac ttaaacagtg atcattaatt tataattttc tgctttgtaa atacggcggt
     2761 aatgga
//
LOCUS       YSPRPS6A     1940 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.pombe ribosomal protein S6 (rps6) gene, complete cds.
ACCESSION   M36382
KEYWORDS    ribosomal protein S6.
SOURCE      S.pombe DNA.
  ORGANISM  Schizosaccharomyces pombe
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1940)
  AUTHORS   Gross,T., Nischt,R., Gatermann,K., Swida,U. and Kaeufer,N.F.
  TITLE     Primary structure of the ribosomal protein gene S6 from
            Schizosaccharomyces pombe
  JOURNAL   Curr. Genet. 13, 57-63 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        495     1214     ribosomal protein S6 (rps6) precursor
    sigp        495      614     ribosomal protein S6 signal peptide
    matp        615     1211     ribosomal protein S6
    mRNA        478     1240     rps6 mRNA (alt.)
    mRNA        478     1260     rps6 mRNA (alt.)
BASE COUNT      592 a    371 c    355 g    622 t
ORIGIN      
        1 gatcatctct atacgctagc gttttgttga gtaaataatt ctgttaatga tgtacaaacc
       61 attgagttca gctagttatt taaaaaatgc aacgagtata tcttataaaa ttaagtcata
      121 caagacatca atttcctcgt aacgatgata aatcacatct atggaaacat agcagactat
      181 gactttaata ttaattacct ttctccataa attaaacgaa aattaaagat gagatgaaaa
      241 gtgataatta tagtaacttg taagttatat tcaacagtaa aatcctaatt tccttttcaa
      301 ttttctgatt cacaatgtat atattagcta cccgcgtgtt tgtttactaa aaatagtgca
      361 ttaatcatcc atcgcaaaat atttataacg agagatataa agacttgtaa atctatccca
      421 aattgtgact gctcttagag gagaaactta gctagtcaaa aatacaacat tactgccggg
      481 aacctcaacg caaaatgaag ttaaacattt cttatccagc aaatggtaca caaaaattga
      541 ttgaaatcga cgatgatcgt cgtcttcgag tctttatgga gaagagaatg ggacaagagg
      601 ttccaggtga ctctgtaggc ccggagtttg ctggatatgt cttcaaaatt accggtggta
      661 acgacaaaca aggatttccc atgtttcaag gcgtcttgct tcctcaccgt gttcgtcttt
      721 tgcttcgtgc tggccaccct tgctatcgcc cacgccgtga tggtgagcgc aagcgtaaat
      781 ctgtccgtgg ttgcatcgtc ggccaagatt tggctgtcct tgctcttgcc attattaaac
      841 aaggtgaaca agatatccct ggtttgaccg atgttaccgt ccccaagcgt cttggtccaa
      901 agcgtgcctc taagatccgc cgcttcttta acttgtccaa ggaagatgat gtccgtcaat
      961 tcgttattcg tcgtgaggtt gttcccaaaa aggaaggaaa gaagccttat accaaggccc
     1021 ctaagattca acgtcttgtt actcctagaa ctcttcaaca caagcgccac cgctttgctc
     1081 ttaagcgcag acaagctgaa aagaaccgtg aagaggctgc tgagtttgct caacttatgg
     1141 ctaagcgtgt cgctgaggct aagcaaaaga gagaggtcgt caaggctcgc cgtgcctctt
     1201 ctttgaaaaa gtaaataggc atttttaagt aactttggtc atctccatat tattattttg
     1261 ctggaacttt tcttatgggg tttattgttg ttgaataaac gtagttagta acattctcta
     1321 agcatcgttt gtattcactt atgagtagga atgttaactg taattaggat tatcttcacg
     1381 aagctgcttt ttgtggtaat ccttcatgct tttaaataat tttaggatct atatagtaga
     1441 ggtaattaat gatattctaa tcaatctagc ttcaacacac acatgatcaa tttttatacg
     1501 ttttttaatc attcatatac actacgctaa aaccttttac agattgtttt actaaaataa
     1561 ttattatctg cgaaacactc cttctatttt cttgaggata tgctaggaga aaatattata
     1621 ctataagtat aagagggggt ttgataaaac tatttaatta cctctgctca ctagagtttt
     1681 tatcctttat gttcctatct aagtcgccta aatctcaaca atttggcgca gacaaaagtt
     1741 gtttgcgttg cgctagtttc tacaaccaca ccactaatac caactagcaa gaaccagaat
     1801 ttctgagcta ctttaggtag taaaataaga ggtgaattgt gagccccggt ttgctttttt
     1861 agacataagg cgaacacgag ttagtgctgt ctgttgccaa attattttaa ttcggcttta
     1921 tcaaaacaat atagcatcga
//
LOCUS       CNAGMDHA     1334 bp ss-mRNA            PLN       14-AUG-1990
DEFINITION  C.vulgaris glyoxysomal malate dehydrogenase mRNA, complete cds.
ACCESSION   M33148
KEYWORDS    glyoxysomal malate dehydrogenase.
SOURCE      C.vulgaris (variety Sugar Baby) cotyledon, cDNA to mRNA.
  ORGANISM  Citrullus vulgaris
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Violales; Cucurbitaceae.
REFERENCE   1  (bases 1 to 1334)
  AUTHORS   Gietl,C.
  TITLE     Glyoxysomal malate dehydrogenase from watermelon is synthesized
            with an amino-terminal transit peptide
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5773-5777 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Gietl, 22-MAR-1990.
FEATURES       from  to/span     description
    pept         70     1140     glyoxysomal malate dehydrogenase precursor (EC
                                 1.1.1.37)
    sigp         70      180     glyoxysomal malate dehydrogenase signal
                                 peptide
    matp        181     1137     glyoxysomal malate dehydrogenase
    mRNA    <     1     1334     glyoxysomal malate dehydrogenase mRNA
BASE COUNT      340 a    275 c    340 g    379 t
ORIGIN      
        1 caacgctaag ttcccaaagg tttctgatct tgaagcggtt ggtttgtttt tctgtttgtc
       61 aaactaatta tgcagccgat tccggatgtt aaccagcgca ttgctcgaat ctctgcgcat
      121 cttcatcctc ccaagtctca gatggaggag agttcagctt tgaggagggc gaattgccgg
      181 gctaaaggcg gagctcccgg gttcaaagtc gcaatacttg gcgctgccgg tggcattggc
      241 cagccccttg cgatgttgat gaagatgaat cctctggttt ctgttctaca tctatatgat
      301 gtagtcaatg cccctggtgt caccgctgat attagccaca tggacacggg tgctgtggtg
      361 cgtggattct tggggcagca gcagctggag gctgcgctta ctggcatgga tcttattata
      421 gtccctgcag gtgttcctcg aaaaccagga atgacgaggg atgatctgtt caaaataaac
      481 gcaggaattg tcaagactct gtgtgaaggg attgcaaagt gttgtccaag agccattgtc
      541 aacctgatca gtaatcctgt gaactccacc gtgcccatcg cagctgaagt tttcaagaag
      601 gctggaactt atgatccaaa gcgacttctg ggagttacaa tgctcgacgt agtcagagcc
      661 aatacctttg tggcagaagt attgggtctt gatcctcggg atgttgatgt tccagttgtt
      721 ggcggtcatg ctggtgtaac cattttgccc cttctatctc aggtgaagcc tccaagttct
      781 ttcacacaag aagagattag ttacctgact gataggattc aaaatggtgg aacagaagtt
      841 gtcgaggcca aagcaggagc tggctcagca actctctcaa tggcttatgc tgccgttaag
      901 tttgcagatg catgcctcag gggcttaaga ggagatgctg gtgtcattga atgcgcgttt
      961 gtgtcttctc aggtgactga acttccattc tttgcatcaa aagtacgact tggtcgcaat
     1021 ggtatcgaag aagtatactc ccttggcccg ctaaatgagt atgagaggat tggattggag
     1081 aaagcgaaga aagagttggc aggaagcatt gagaagggag tttccttcat cagaagctga
     1141 agagatgcca attaccatta gttttaatag aaacattcca tctcttatag attacttgtg
     1201 ctcaatgttt tcctggagat tgaagttgat tgaaatgata ccacaccacg tatttttata
     1261 ctaataaaac tatatcgcca tcatgtcgat atttaatgca caaccaaaag ggttggatta
     1321 gagtaccttt tatg
//
LOCUS       HAMACAPT     1837 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA,
            partial cds.
ACCESSION   J05590 M22755
KEYWORDS    N-acetylglucosamine-1-phosphate transferase; glycosyl transferase;
            tunicamycin-resistance.
SOURCE      Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone
            TRG-[4,22].
  ORGANISM  Cricetulus longicaudatus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 889 to 960)
  AUTHORS   Lehrman,M.A., Zhu,X. and Khounlo,S.
  TITLE     Amplification and molecular cloning of the hamster tunicamycin-
            sensitive N-acetylglucosamine-1-phosphate transferase gene: The
            hamster and yeast enzymes share a common peptide sequence
  JOURNAL   J. Biol. Chem. 263, 19796-19803 (1988)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1837)
  AUTHORS   Zhu,X. and Lehrman,M.A.
  TITLE     Cloning, sequence and expression of a cDNA encoding hamster
            UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate
            transferase
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and sequence for [1] kindly submitted by M. Lehrman,
            04-APR-1989.  Draft entry and computer-readable sequence for [1]
            kindly submitted by M.Lehrman, 12-JUN-1990.
FEATURES       from  to/span     description
    pept        148     1374     N-acetylglucosamine-1-phosphate transferase
                                 (GPT)
    binding      67       79     dolichol binding site
    binding     222      234     dolichol binding site
BASE COUNT      363 a    515 c    418 g    541 t
ORIGIN      
        1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg
       61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg
      121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg
      181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc
      241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg
      301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc
      361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc
      421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg
      481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct
      541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta
      601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc
      661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat
      721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta
      781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt
      841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga
      901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc
      961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct
     1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg
     1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc
     1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag
     1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt
     1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc
     1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc
     1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca
     1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt
     1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt
     1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc
     1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat
     1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag
     1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc
     1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt
//
LOCUS       HS4NA3ABC   10089 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C
            genes, complete cds.
ACCESSION   M34440
KEYWORDS    nuclear antigen.
SOURCE      Epstein-Barr virus type 2 (isolate AG876) DNA.
  ORGANISM  Epstein-Barr virus
            Viridae; ds-DNA enveloped viruses; Herpesviridae; 
            Gammaherpesviridae.
REFERENCE   1  (bases 1 to 10089)
  AUTHORS   Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E.,
            Rickinson,A. and Kieff,E.
  TITLE     Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their
            EBNA-3A, EBNA-3B and EBNA-3C genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.T.Sample, 23-MAY-1990.
               Kieff Laboratory
               Harvard Medical School
               75 Francis Street
               Boston, MA 02115
FEATURES       from  to/span     description
    pept        189      524     nuclear antigen EBNA-3A, exon 1
                610     3051     nuclear antigen EBNA-3A, exon 2
    pept       3237     3593     nuclear antigen EBNA-3B, exon 1
               3672     6155     nuclear antigen EBNA-3B, exon 2
    pept       6277     6636     nuclear antigen EBNA-3C, exon 1
               6711     9560     nuclear antigen EBNA-3C, exon 2
    IVS         525      609     EBNA-3A intron
    IVS        3594     3671     EBNA-3B intron
    IVS        6637     6710     EBNA-3C intron
BASE COUNT     2495 a   2837 c   2619 g   2138 t
ORIGIN      
        1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct
       61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag
      121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg
      181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag
      241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca
      301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc
      361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac
      421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc
      481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg
      541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt
      601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat
      661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg
      721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg
      781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact
      841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg
      901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg
      961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc
     1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct
     1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac
     1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat
     1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg
     1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca
     1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg
     1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg
     1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct
     1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa
     1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac
     1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc
     1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc
     1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg
     1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg
     1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa
     1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca
     1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca
     2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag
     2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc
     2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac
     2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg
     2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa
     2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg
     2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg
     2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc
     2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg
     2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg
     2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa
     2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg
     2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct
     2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat
     2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt
     2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg
     3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg
     3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat
     3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc
     3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga
     3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg
     3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac
     3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag
     3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca
     3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc
     3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag
     3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc
     3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga
     3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac
     3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct
     3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata
     3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa
     3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag
     4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt
     4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga
     4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata
     4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag
     4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct
     4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg
     4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa
     4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg
     4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca
     4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag
     4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc
     4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg
     4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca
     4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca
     4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga
     4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt
     4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac
     5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc
     5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac
     5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct
     5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc
     5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc
     5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc
     5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc
     5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc
     5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc
     5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag
     5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta
     5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg
     5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac
     5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac
     5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc
     5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct
     6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt
     6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt
     6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg
     6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta
     6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg
     6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat
     6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac
     6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc
     6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta
     6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca
     6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc
     6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc
     6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct
     6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg
     6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat
     6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca
     6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg
     7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag
     7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga
     7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc
     7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt
     7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta
     7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc
     7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg
     7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag
     7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc
     7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac
     7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag
     7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg
     7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag
     7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt
     7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga
     7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta
     7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac
     8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc
     8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc
     8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc
     8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc
     8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc
     8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg
     8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac
     8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg
     8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg
     8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata
     8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc
     8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact
     8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc
     8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt
     8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag
     8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc
     9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac
     9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc
     9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc
     9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac
     9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg
     9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag
     9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa
     9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg
     9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca
     9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga
     9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta
     9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca
     9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg
     9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg
     9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg
     9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag
     9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc
    10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt
    10081 ggaggatcc
//
LOCUS       HUMPAPB      3061 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human prostatic acid phosphatase mRNA, complete cds.
ACCESSION   M34840
KEYWORDS    glycoprotein; prostatic acid phosphatase; secretory protein.
SOURCE      Human prostate, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1199)
  AUTHORS   Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
            Moore,D.L.
  TITLE     Covalent structure, disulfide bonding and identification of
            reactive surface and active site residues of human prostatic acid
            phosphatase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3061)
  AUTHORS   Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and
            Moore,D.L.
  TITLE     Covalent structure, disulfide bonding and identification of
            reactive surface and active site residues of human prostatic acid
            phosphatase
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.L.Van Etten, 08-JUN-1990.
               Chemistry Department
               Purdue University
               West Lafayette, IN 47907
FEATURES       from  to/span     description
    pept         15     1175     prostatic acid phosphatase precursor (EC
                                 3.1.3.2) /hgml_locus_uid="LA0168F"
                                 /nomgen="ACPP" /map="3q21-qter"
    sigp         15      110     prostatic acid phosphatase signal peptide
    matp        111     1172     prostatic acid phosphatase
BASE COUNT      917 a    642 c    681 g    821 t
ORIGIN      Chromosome 3q21-qter.
        1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc
       61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga
      121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg
      181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg
      241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt
      301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg
      361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc
      421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc
      481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg
      541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag
      601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg
      661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga
      721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat
      781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc
      841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc
      901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga
      961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc
     1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt
     1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa
     1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag
     1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc
     1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca
     1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt
     1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa
     1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt
     1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt
     1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga
     1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc
     1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa
     1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt
     1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat
     1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata
     1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga
     1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag
     2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag
     2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt
     2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta
     2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca
     2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa
     2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc
     2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac
     2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa
     2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat
     2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac
     2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc
     2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc
     2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat
     2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc
     2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat
     2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta
     3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa
     3061 a
//
LOCUS       LDV3ECPA     1064 bp ss-RNA             VRL       14-AUG-1990
DEFINITION  Lactate dehydrogenase-elevating virus capsid protein gene, complete
            cds.
ACCESSION   M34856
KEYWORDS    viral capsid protein.
SOURCE      Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones
            A16, B24, B63, B90, B103, C44, dt4.
  ORGANISM  Lactate dehydrogenase-elevating virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Togaviridae.
REFERENCE   1  (bases 1 to 1064)
  AUTHORS   Godeny,E.K., Speicher,D.W. and Brinton,M.A.
  TITLE     Map location of lactate dehydrogenase-elevating virus (LDV) capsid
            protein (Vp1) gene
  JOURNAL   Virology (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.K.Godeny, 06-JUN-1990.
FEATURES       from  to/span     description
    pept        637      984     capsid protein
    pept        135      650     unidentified ORF
BASE COUNT      267 a    223 c    265 g    309 t
ORIGIN      3' terminus region.
        1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc
       61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc
      121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt
      181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg
      241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt
      301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg
      361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc
      421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg
      481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta
      541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga
      601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc
      661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat
      721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc
      781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga
      841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt
      901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat
      961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg
     1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt
//
LOCUS       MBIFRHADGB   3903 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase
            (frhADGB) gene, complete cds.
ACCESSION   J02914
KEYWORDS    8-hydroxy-5-deazaflavin-reducing hydrogenase.
SOURCE      M.thermoautotrophicum (strain delta-H) DNA.
  ORGANISM  Methanobacterium thermoautotrophicum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanobacteriaceae.
REFERENCE   1  (bases 1 to 3903)
  AUTHORS   Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T.
  TITLE     Cloning, sequence determination, and expression of the genes
            encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin
            reducing hydrogenase from Methanobacterium thermoautotrophicum
            delta-H
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.A.Alex, 25-MAY-1990.
FEATURES       from  to/span     description
    pept        298     1515     F420 hydrogenase alpha-subunit (frhA)
    pept       1524     2000     F420 hydrogenase delta-subunit (frhD)
    pept       2114     2824     F420 hydrogenase gamma-subuint (frhG)
    pept       2837     3682     F420 hydrogenase beta-subunit (frhB)
    binding     284      293     ribosomal binding site (put.)
    binding    1514     1520     ribosomal binding site (put.)
    binding    2100     2106     ribosomal binding site (put.)
    binding    2823     2832     ribosomal binding site (put.)
BASE COUNT     1201 a    878 c    965 g    859 t
ORIGIN      
        1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta
       61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac
      121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt
      181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt
      241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg
      301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg
      361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg
      421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg
      481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc
      541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac
      601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac
      661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt
      721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac
      781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa
      841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc
      901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa
      961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata
     1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca
     1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt
     1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg
     1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta
     1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag
     1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca
     1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc
     1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa
     1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg
     1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt
     1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc
     1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg
     1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg
     1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa
     1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa
     1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg
     1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc
     2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga
     2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag
     2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact
     2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga
     2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga
     2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac
     2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt
     2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc
     2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc
     2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt
     2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct
     2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg
     2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct
     2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac
     2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata
     2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg
     3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa
     3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga
     3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta
     3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat
     3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat
     3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga
     3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg
     3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg
     3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga
     3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga
     3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat
     3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt
     3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa
     3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt
     3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat
     3901 cga
//
LOCUS       MUSLRCNA     1751 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse loricrin gene, complete cds.
ACCESSION   M34398
KEYWORDS    cell envelope protein; loricrin.
SOURCE      Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to
            mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1751)
  AUTHORS   Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D.,
            Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M.,
            Yuspa,S.H. and Roop,D.R.
  TITLE     Identification of a major keratinocyte cell envelope protein,
            loricrin
  JOURNAL   Cell 61, 1103-1112 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Rothnagel, 22-MAY-1990.
FEATURES       from  to/span     description
    pept         71     1516     loricrin
    signal     1740     1745     polyA signal
BASE COUNT      223 a    493 c    706 g    329 t
ORIGIN      
        1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc
       61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg
      121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg
      181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg
      241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg
      301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg
      361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg
      421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg
      481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg
      541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg
      601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg
      661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc
      721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg
      781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg
      841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg
      901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg
      961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg
     1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg
     1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc
     1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg
     1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag
     1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg
     1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg
     1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg
     1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac
     1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc
     1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac
     1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct
     1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca
     1741 ataaatttgc t
//
LOCUS       PFAGIPA      1906 bp ds-DNA             INV       14-AUG-1990
DEFINITION  P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete
            cds.
ACCESSION   J05544
KEYWORDS    glucosephosphate isomerase.
SOURCE      P.falciparum (strain 7G8) DNA.
  ORGANISM  Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae.
REFERENCE   1  (bases 1 to 1906)
  AUTHORS   Kaslow,D.C. and Hill,S.
  TITLE     Cloning metabolic pathway genes by complementation in Escherichia
            coli: Isolation and expression of Plasmodium falciparum
            glucosephosphate isomerase
  JOURNAL   J. Biol. Chem. 265, 12337-12341 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.C.Kaslow, 04-JUN-1990.
FEATURES       from  to/span     description
    pept         28     1803     glucosephosphate isomerase (EC 5.3.1.9)
BASE COUNT      782 a    174 c    289 g    661 t
ORIGIN      Chromosome 14.
        1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat
       61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat
      121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct
      181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa
      241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc
      301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata
      361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat
      421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg
      481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa
      541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat
      601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat
      661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct
      721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt
      781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg
      841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta
      901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga
      961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat
     1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat
     1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca
     1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa
     1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat
     1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa
     1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg
     1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa
     1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt
     1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta
     1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt
     1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat
     1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat
     1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag
     1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt
     1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt
//
LOCUS       RATGADA      3198 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat glutamic acid decarboxylase (GAD) mRNA, complete cds.
ACCESSION   M34445
KEYWORDS    glutamic acid decarboxylase.
SOURCE      Rat brain and hypothalamus, cDNA to mRNA, (libraries of
            D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and
            pCR22HP.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3198)
  AUTHORS   Wyborski,R.J., Bond,R.W. and Gottlieb,D.I.
  TITLE     Characterization of a cDNA coding for rat glutamic acid
            decarboxylase
  JOURNAL   Mol. Brain Res. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.I.Gottlieb, 23-MAY-1990.
FEATURES       from  to/span     description
    pept        184     1965     glutamic acid decarboxylase
    mRNA    <     1     3198     glutamic acid decarboxylase mRNA
    signal     3174     3179     polyA signal
BASE COUNT      869 a    742 c    767 g    820 t
ORIGIN      
        1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca
       61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc
      121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag
      181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat
      241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc
      301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag
      361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa
      421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct
      481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg
      541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac
      601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac
      661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt
      721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta
      781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc
      841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca
      901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc
      961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc
     1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg
     1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt
     1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat
     1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca
     1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg
     1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc
     1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa
     1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag
     1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac
     1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc
     1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt
     1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa
     1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc
     1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg
     1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc
     1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc
     1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca
     2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg
     2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa
     2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat
     2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc
     2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca
     2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag
     2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta
     2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca
     2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac
     2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag
     2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt
     2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta
     2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt
     2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt
     2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc
     2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa
     3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta
     3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt
     3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa
     3181 taaaaaatat tctccatg
//
LOCUS       RATTSHRA     5270 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat thyroid stimulating hormone receptor mRNA, complete cds.
ACCESSION   M34842
KEYWORDS    TSH receptor; thyroid stimulating hormone receptor;
            transmembrane protein.
SOURCE      Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to
            mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 5270)
  AUTHORS   Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C.,
            McBride,O.W. and Kohn,L.D.
  TITLE     Cloning, chromosomal assignment, and regulation of the rat
            thyrotropin receptor by thyrotropin, agents which increase cAMP
            levels and thyroid autoantibodies
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5677-5681 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.D.Kohn, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         55     2349     thyroid stimulating hormone receptor precursor
    sigp         55      117     thyroid stimulating hormone receptor signal
                                 peptide
    matp        118     2346     thyroid stimulating hormone receptor
    signal     5256     5261     poly-A signal
BASE COUNT     1415 a   1343 c   1176 g   1336 t
ORIGIN      
        1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg
       61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga
      121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag
      181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact
      241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat
      301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa
      361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta
      421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct
      481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct
      541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg
      601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg
      661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga
      721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct
      781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag
      841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac
      901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat
      961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc
     1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag
     1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag
     1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac
     1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca
     1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg
     1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc
     1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc
     1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac
     1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt
     1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc
     1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg
     1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc
     1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac
     1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg
     1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt
     1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc
     1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg
     2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc
     2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag
     2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa
     2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt
     2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca
     2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa
     2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct
     2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc
     2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac
     2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg
     2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag
     2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga
     2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat
     2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc
     2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat
     2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt
     3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta
     3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat
     3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac
     3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga
     3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt
     3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa
     3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct
     3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt
     3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt
     3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata
     3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt
     3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac
     3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat
     3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga
     3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat
     3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag
     3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc
     4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc
     4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg
     4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa
     4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga
     4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag
     4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc
     4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg
     4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc
     4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct
     4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg
     4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca
     4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa
     4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta
     4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga
     4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg
     4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc
     4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg
     5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca
     5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt
     5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg
     5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg
//
LOCUS       STMBLIPA      840 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  S.clavuligerus beta-lactamase inhibitory protein gene, complete
            cds.
ACCESSION   M34538
KEYWORDS    beta-lactamase inhibitory protein.
SOURCE      S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP.
  ORGANISM  Streptomyces clavuligerus
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 840)
  AUTHORS   Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E.
  TITLE     Isolation and characterization of a beta-lactamase inhibitory
            protein from Streptomyces clavuligerus and cloning and analysis of
            corresponding gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.E.Jensen, 25-MAY-1990.
               Dept. of Microbiology
               University of Alberta
               Edmonton, Alberta Canada T6G 2E9
FEATURES       from  to/span     description
    pept         67      672     beta-lactamase inhibitory protein precursor
                                 (bliP)
    sigp         67      174     beta-lactamase inhibitory protein signal
                                 peptide
    matp        175      669     beta-lactamase inhibitory protein
    binding      52       63     ribosomal binding site
    signal      686      724     transcription termination signal
BASE COUNT      140 a    248 c    320 g    132 t
ORIGIN      
        1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga
       61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc
      121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg
      181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc
      241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc
      301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc
      361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg
      421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg
      481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg
      541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc
      601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg
      661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc
      721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg
      781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc
//
LOCUS       TRBPARPA      763 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  T.brucei procyclin (pPS-1) mRNA, complete cds.
ACCESSION   M34790
KEYWORDS    procyclic acidic repetitive protein; procyclin.
SOURCE      T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone
            pPS-1.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 763)
  AUTHORS   Dorn,P.L., Aman,R.A. and Boothroyd,J.C.
  TITLE     Inhibition of protein synthesis results in super-induction of
            procyclin RNA
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Boothroyd, 04-JUN-1990.
               Stanford University School of Medicine
               Dept of Microbiology and Immunology
               Sherman Fairchild Science Building
               Stanford, CA 94305-5402 USA
FEATURES       from  to/span     description
    pept         29      466     procyclin
    mRNA    <     1      763     procyclin mRNA
    rpt         215      393     12 bp degenerate repeats
BASE COUNT      192 a    215 c    156 g    200 t
ORIGIN      
        1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct
       61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc
      121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg
      181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga
      241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga
      301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga
      361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc
      421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt
      481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc
      541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt
      601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc
      661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag
      721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt
//
LOCUS       W10PPIBCA    4761 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes,
            complete cds, and ippD gene, 3' end.
ACCESSION   M34849
KEYWORDS    antigen.
SOURCE      Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA,
            clones HB101/pHS[4108,5357].
  ORGANISM  Plasmid pWR100
            Prokaryota; Bacteria.
REFERENCE   1  (bases 1 to 4761)
  AUTHORS   Baudry,B., Kaczorek,M. and Sansonetti,P.J.
  TITLE     Nucleotide sequence of the invasion plasmid antigen B and C genes
            (ipaB and ipaC) of Shigella flexneri
  JOURNAL   Microb. Pathog. 4, 345-357 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 538 and 3987 to 4761)
  AUTHORS   Baudry,B., Kaczorek,M. and Sansonetti,P.J.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by B.Baudry, 05-JUN-1990.
               Center for Vaccine Development
               University of Maryland at Baltimore
               10 South Pine Street
               Baltimore, MD 21201
FEATURES       from  to/span     description
    pept        606     1073     polypeptide I (ippI)
    pept       1079     2821     polypeptide B (ipaB)
    pept       2841     3932     polypeptide C (ipaC)
    pept       3983  >  4761     ipaD protein
    pept    <     1      549     unidentified ORF
    binding     595      600     ippI ribosomal binding site
    binding    1065     1070     ipaB ribosomal binding site
    binding    2829     2834     ipaC ribosomal binding site
    binding    3971     3977     ipaD ribosomal binding site
    signal      556      561     ippI -10 region
    signal     2723     2728     ipaC -35 region
    signal     2745     2750     ipaC -10 region
BASE COUNT     1769 a    952 c    805 g   1235 t
ORIGIN      
        1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat
       61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat
      121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct
      181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca
      241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat
      301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc
      361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg
      421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca
      481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat
      541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag
      601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa
      661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca
      721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct
      781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta
      841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat
      901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag
      961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat
     1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat
     1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga
     1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac
     1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa
     1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct
     1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa
     1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga
     1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat
     1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat
     1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga
     1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca
     1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat
     1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa
     1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg
     1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc
     1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga
     1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt
     2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt
     2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca
     2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc
     2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc
     2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac
     2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac
     2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat
     2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc
     2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc
     2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa
     2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga
     2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc
     2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg
     2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata
     2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc
     2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat
     3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg
     3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag
     3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag
     3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga
     3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc
     3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa
     3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc
     3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa
     3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc
     3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt
     3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc
     3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca
     3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca
     3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc
     3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag
     3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg
     3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc
     4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa
     4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa
     4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac
     4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc
     4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt
     4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga
     4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata
     4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag
     4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa
     4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc
     4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac
     4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat
     4741 agacaatatg ttaaaaagct t
//
LOCUS       YEPOMPHA      824 bp ss-mRNA            BCT       14-AUG-1990
DEFINITION  Y.enterocolitica cationic outer membrane protein (ompH) gene,
            complete cds.
ACCESSION   M34854
KEYWORDS    cationic outer membrane protein;
            enterobacterial outer membrane protein;
            lipopolysaccharide binding protein; secretory protein;
            structural membrane protein.
SOURCE      Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115.
  ORGANISM  Yersinia enterocolitica
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 824)
  AUTHORS   Hirvas,L., Koski,P. and Vaara,M.
  TITLE     The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and
            comparison to known ompH sequences
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.H.Hirvas, 06-JUN-1990.
               University of Helsinki
FEATURES       from  to/span     description
    pept        324      818     ompH protein precursor (gtg start codon)
    sigp        324      392     ompH protein signal peptide
    matp        393      815     ompH protein
    binding     308      317     ompH ribosomal binding site
    signal      120      125     ompH -35 region
    signal      143      148     ompH -10 region
BASE COUNT      257 a    147 c    198 g    221 t      1 others
ORIGIN      
        1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg
       61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg
      121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt
      181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt
      241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa
      301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag
      361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa
      421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag
      481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc
      541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga
      601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc
      661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta
      721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta
      781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac
//
LOCUS       YSCEXG1A     2056 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase
            (EXG1) gene, complete cds.
ACCESSION   M34341
KEYWORDS    1,3-glucan glucanohydrolase; exo-1,3-B-glucanase.
SOURCE      S.cerevisiae (strain AB320) DNA, clone YEp13.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2056)
  AUTHORS   Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A.,
            Nebreda,A.R., Mendez,E. and del Rey,F.
  TITLE     Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1,
            in the yeast Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.del Rey, 17-MAY-1990.
               Instituto de Microbiologia bioquimica
               Facultad de Biologia
               CSIC/Universidad de Salamanca
               37008 Salamanca, SPAIN
FEATURES       from  to/span     description
    pept        363     1709     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase precursor (EXG1, EC 3.2.1.58)
    sigp        363      482     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase signal peptide
    matp        483     1706     exo-1,3-B-glucanase/1,3-B-D-glucan
                                 glucanohydrolase
    mRNA        265  >  1709     EXG1 mRNA
    signal      153      159     TATA box
BASE COUNT      610 a    395 c    396 g    655 t
ORIGIN      Chromosome XII-right arm.
        1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg
       61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac
      121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa
      181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat
      241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt
      301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta
      361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta
      421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa
      481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg
      541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg
      601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg
      661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg
      721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc
      781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca
      841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt
      901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca
      961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat
     1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca
     1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca
     1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact
     1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct
     1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg
     1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga
     1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg
     1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt
     1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct
     1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg
     1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc
     1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat
     1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt
     1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa
     1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta
     1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat
     1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt
     2041 gagcttttcc gccggc
//
LOCUS       CORDTXRA     2529 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  C.diphtheriae toxin repressor (dtxR) gene, complete cds.
ACCESSION   M34239
KEYWORDS    regulatory protein; toxin repressor.
SOURCE      C.diphtheriae (strain C7) DNA, clone pHH2500..
  ORGANISM  Corynebacterium diphtheriae
            Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE   1  (bases 1 to 1390)
  AUTHORS   Boyd,J., Oza,M.N. and Murphy,J.R.
  TITLE     Molecular cloning and DNA sequence analysis of the diphtheria tox
            iron-dependent regulatory element (dtxR) from Corynebacterium
            diphtheriae
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5968-5972 (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 2529)
  AUTHORS   Boyd,J., Oza,M.N. and Murphy,J.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.R.Murphy, 11-MAY-1990.
               University Hospital
               Dept Medicine-Rm E613
               88 East Newton Street
               Boston, MA 02118-2393
FEATURES       from  to/span     description
    pept        681     1361     toxin repressor protein (dtxR)
    pept         25      456     ORF1
    pept       1384     2370     ORF3
    signal      595      629     promoter sequence
    binding     664      669     ribosomal binding site (put.)
    binding    1200     1283     DNA-binding region (put.)
BASE COUNT      691 a    615 c    593 g    630 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttctc gaatcaaacg cgagatgtat caacagctcg gacgtgaagc aaccaacgaa
       61 gagcttgcgg aagaatctgg cattgaagaa tccaaaattg aaatgttgct tcgccagtct
      121 cgtgacccag taagcctaga tatgccagtt ggtgccgatg aagaggctcc acttggtgac
      181 tttatcgaag attccgaggc gactgatgcc gaatcagctg tagttgcgtc gttgcgtcac
      241 tctgatatca gaacagtctt ggacactttg gagcaacgtg aacaggacgt tattcgcctt
      301 cgttatgggc tcgatgatgg agttccccgc actttagatc aaattggtcg tcaatttgga
      361 ctgtctcgtg aaagagttcg ccaaattgaa cgcgaggtca tgtcgaaact gcgtgacggt
      421 gcccgtgccg aaaaacttcg agcatacgca caataaagcg tttgcttaga tatgcctacc
      481 aataaagaca taaacgccta ttaaaagcaa tctttagatt aggcgttata attaaagtct
      541 catcgaaaaa cgcgctgcgg gactacaacg caacaagaaa actattccat atttttcacg
      601 ctacaattcg ttgtagattg ataggaattg atcaccagca cacaacagtc tccatggcac
      661 tataaggaaa gaggcttaca atgaaggact tagtcgatac cacagagatg tacttgcgta
      721 ctatctatga gctggaagaa gagggagtca cccctcttcg cgctaggatc gctgagcgtc
      781 tggaacaatc tggacctaca gttagccaaa ccgttgcccg tatggagcgc gatggacttg
      841 tcgttgtcgc ctcagaccgc agtctacaaa tgacaccgac aggccgcact ttagcgactg
      901 cagttatgcg taaacatcgc ttagctgagc gccttcttac cgatatcatt ggtctagata
      961 tcaataaagt tcacgatgaa gcctgccgct gggaacacgt tatgagtgac gaagttgaac
     1021 gcaggctcgt gaaagtattg aaagatgtca gtcggtcccc cttcggaaac ccaattccag
     1081 gtctcgacga actcggcgta ggcaattctg acgcggcagc ccccggaact cgcgttattg
     1141 acgctgccac cagcatgccc cgcaaagtac gcattgttca gattaacgaa atctttcaag
     1201 ttgaaacgga tcagtttaca cagctcctcg atgctgacat ccgtgttgga tcagaagtcg
     1261 aaattgtaga tagagacggc cacatcacgt tgagccacaa tggaaaagat gtcgaactcc
     1321 tcgatgatct ggctcacact attcgtatcg aagaactcta aatactaaag gcggcgaaat
     1381 tagatgaaac tcctcgttac cggtggcgcc ggatacgtag gaagtgtctg ttccactgtt
     1441 ctgctcgaac aggggcacga agtaacaatt gtcgataatc ttacaaccgg caatcgcgat
     1501 gctgttccgc taggagccac ttttgtcgag ggtgatatca aagacgttgc ggaaaacgtt
     1561 ttatcatctg attcttttga tgcggttctt cacttcgcgg ctcgttcgct tgtaggcgaa
     1621 tcagttgaaa agccagatga atattggcag cacaacatgg tgaccacact cgcacttctt
     1681 gatgccatga aacgtaataa cgtgcgaaac attgtctttt cttctacagc tgctacatac
     1741 ggcgaacccg aaacagtccc gattactgaa gacgctccta cccacccaac gaatccatac
     1801 ggcgcaacca agctgtctat cgactacgcg atcacatcat acgcacatgc atatgggttt
     1861 gctgcgacga gtctgcggta cttcaacgta gctggcgcat atggtttggt gggtgaaaat
     1921 cgagaaatag aaacccatct cattccgctc gttttacaag tagcactggg acaccgtgac
     1981 aagattttca tgtttggtga cgactggcca actgaagatg gaacccctat ccgcgattac
     2041 attcatattc gtgatcttgc agatgcccat attttggcac ttcaaagtaa cgtcgaaggc
     2101 agccaccgga tctttaatct cggatccggt gaaggatatt cggtcaaaca ggtcattgat
     2161 acctgcagag aagttaccgg tcatcccatc cccgctgagg tagctcctcg acgggctggt
     2221 gatcctgcag tgctcattgc atcatcggct aaggcgcaat ccgaactagg atggaagcca
     2281 caacgcactg atctgcacac gattgtttct gatgcttggg cttttacgtc tcaactcgga
     2341 gacaaagccc atagcgcttc tcgtggataa caacgagtag gtttatcttt tgcagtccgc
     2401 ggtgtgatta cttaagtatt cacaccgcgg attttacatt tgttgactca ccaaaatgtt
     2461 tctgagcttc ctgactccca ttgataacaa gatcaacaat ggatccagat agacctcgat
     2521 aataagctt
//
LOCUS       ECOANSBA     1643 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  E.coli L-asparaginase II (ansB) gene, complete cds.
ACCESSION   M34234
KEYWORDS    L-asparaginase II; ansB gene.
SOURCE      E.coli (strain K12 isolate JM108) DNA, clones lambda-ansB and
            pansB7.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1643)
  AUTHORS   Bonthron,D.T.
  TITLE     L-asparaginase II of Escherichia coli K-12: Cloning and sequencing
            of the ansB gene and its position on the chromosomal map
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by D.T.Bonthron, 11-MAY-1990.
FEATURES       from  to/span     description
    pept        346     1392     L-asparaginase II (ansB) (EC 3.5.1.1)
    sigp        346      411     L-asparaginase II signal peptide
    matp        412     1389     L-asparaginase II
    binding     335      339     ribosomal binding site (put.)
    site       1400     1425     transcription termination signal (put.)
BASE COUNT      442 a    404 c    405 g    392 t
ORIGIN      63.8 min on K12 map.
        1 gcggtaaccc attacagaat gtgctgggaa gcctgggggg gctgcaatcc tcaatccaaa
       61 ccgagtggaa aaagcaggaa aaagatttcc agcagtttgg caaagatgtt tgtagccgcg
      121 ttgtgactct ggaagatagc cgcaaagccc tggtcgggaa tttaaaataa tcctctattt
      181 taagacggca taatactttt ttatgccgtt taattcttcg ttttgttacc tgcctctaac
      241 tttgtagatc tccaaaatat attcacgttg taaattgttt aacgtcaaat ttcccataca
      301 gagctaaggg ataatgcgta gcgttcacgt aactggagga atgaaatgga gtttttcaaa
      361 aagacggcac ttgccgcact ggttatgggt tttagtggtg cagcattggc attacccaat
      421 atcaccattt tagcaaccgg cgggaccatt gccggtggtg gtgactccgc aaccaaatct
      481 aactacacag tgggtaaagt tggcgtagaa aatctggtta atgcggtgcc gcaactaaaa
      541 gacattgcga acgttaaagg cgagcaggta gtgaatatcg gctcccagga catgaacgat
      601 aatgtctggc tgacactggc gaaaaaaatt aacaccgact gcgataagac cgacggcttc
      661 gtcattaccc acggtaccga cacgatggaa gaaactgctt acttcctcga cctgacggtg
      721 aaatgcgaca aaccggtggt gatggtcggc gcaatgcgtc cgtccacgtc tatgagcgca
      781 gacggtccat tcaacctgta taacgcggta gtgaccgcag ctgataaagc ctccgccaac
      841 cgtggcgtgc tggtagtgat gaatgacacc gtgcttgatg gccgtgacgt caccaaaacc
      901 aacaccaccg acgtagcgac cttcaagtct gttaactacg gtcctctggg ttacattcac
      961 aacggtaaga ttgactacca gcgtaccccg gcacgtaagc ataccagcga cacgccattc
     1021 gatgtctcta agctgaatga actgccgaaa gtcggcattg tttataacta cgctaacgca
     1081 tccgatcttc cggctaaagc actggtagat gcgggctatg atggcatcgt tagcgctggt
     1141 gtgggtaacg gcaacctgta taaatctgtg ttcgacacgc tggcgaccgc cgcgaaaacc
     1201 ggtactgcag tcgtgcgttc ttcccgcgta ccgacgggcg ctaccactca ggatgccgaa
     1261 gtggatgatg cgaaatacgg cttcgtcgcc tctggcacgc tgaacccgca aaaagcgcgc
     1321 gttctgctgc aactggctct gacgcaaacc aaagatccgc agcagatcca gcagatcttc
     1381 aatcagtact aatcgcctcg ccccggtatc gtgccggggc tttttcactt cagactcacg
     1441 tccattgcca attttaatta ccctaatgat aatcaccgga ataaattatt ccgcgtgagg
     1501 tttttcgggt gaaaaagcaa tggattgttg gcacggcact gcttatgttg atgactggta
     1561 atgcctgggc agatggcgaa ccgccaactg aaaatatctt aaaagatcaa ttcaaaaagc
     1621 agtatcacgg cattctcaag ctt
//
LOCUS       HUMADE5RC    1946 bp ds-DNA             SYN       14-AUG-1990
DEFINITION  Synthetic human/adenovirus type 5 recombination junction.
ACCESSION   M34061
KEYWORDS    .
SOURCE      Human fetal bone marrow fibroblasts transformed with Adeno5/SV40
            recombinant virus (cell line H13.1) DNA, clone pH13.1SV1.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 1946)
  AUTHORS   Romani,M., De Ambrosis,A., Alhadeff,B., Purrello,M., Gluzman,Y. and
            Siniscalco,M.
  TITLE     Preferential viral integration at the highly recombinogenic
            chromosomal site 1p36 in human cells transformed with an
            Adeno5/SV40 hybrid
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Romani 08-MAY-1990.
FEATURES       from  to/span     description
    recomb      429      430     adenovirus type 5 end/Human DNA start
    rpt         430     1134     tandem Alu repeats
BASE COUNT      492 a    548 c    510 g    396 t
ORIGIN      Chromosome 1p36.
        1 tttgacccgg aacgcggaaa acacctgaga aaaacacctg ggcgagtctc cacgtaaacg
       61 gtcaaagtcc ccgcggccct agacaaatat tacgcgctat gagtaacaca aaattattca
      121 gatttcactt cctcttattc agttttcccg cgaaaatggc caaatcttac tcggttacgc
      181 ccaaaattta ctacaacatc cgcctaaaac cgcgcgaaaa ttgtcacttc ctgtgtacac
      241 cggcgcacac caaaaacgtc acttttgcca catccgtcgc ttacatgtgt tccgccacac
      301 ttgcaacatc acacttccgc cacactacta cgtcacccgc cccgttccca cgccccgcgc
      361 cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat aaggtatatt
      421 attgatgata tataatatag aggctgaggc aggagggttg cttgagcctg ggaggctgcg
      481 gctgcagtga gccatgatca caccactgca cttcagcctg ggcaacagag cgagactccg
      541 tctcaaaaaa tagtaatagg ccaggcgcgg tggctcatgc cagtaatccc agcgctttgg
      601 gaggctgagg caggcagatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg
      661 tgaaaccctg tctctactaa aaatacaaaa attagttggg catggtggcg gcgcacacca
      721 gtaatcccag ctactcaaga ggctgagggc acttgggcag cagaggaggt tgtagtgagc
      781 agagatcacg ccactgcact ctagcctggg caacagaggg agactctgtc tcaaaaaata
      841 ataggccggg cacggtggct catgcctgta atcccagcac tttgggaggc cgaggcgggt
      901 ggatcacctg agatcaggag tttaagacca gcctggccaa catggtgaaa ccccatctct
      961 actaaaaata caaaaaatta gccaggcgtg gtggcgggcg tctgtaatcc cagctacttg
     1021 ggaggctgaa gcaggagaat cacttgagcc caggaggcag aggttgcagt gagctgagat
     1081 catgccatta cactccagcc tgggcaacaa gagtgaaact ccgtctcaaa aaataataat
     1141 aataatgata ataataataa taattgcatc tattcactga gcgcctcctg tacaccaggc
     1201 accatgtggc attaggggcc catgctgtca tcatattctg ccagccaccc atggggtggt
     1261 ggggtaattc cccaggccac acctgtccac acctctcttc aggtactgcg tgggcatgtg
     1321 cggagacggc gccaatgact gtggggccct gaaggcggct gatgtcggca tctcgctgtc
     1381 ccaggcagaa gcctcagtgg tctcaccctt cacctcgagc atggccagta ttgagtgcgt
     1441 gcccatggtc atcaggtaag gcaggcaggg accggtgggt gagggctcag cagggctggt
     1501 cagccaacct gggtccccta atgtccgtgc cctgccaccc agggaggggc gctgttccct
     1561 tgacacttcg ttcagcgtct tcaagtacat ggctctgtac agcctgaccc agttcatctc
     1621 cgtcctgatc ctctacacgg tgagtatctg cagagctccg tacccagagc cgtcccagga
     1681 ctagggaggg gacacagccc tgtgccttgg agctggcagg tcccagatag atgacgtggg
     1741 cccagatcct gaccctgcca ctctccggct gtgtggttgg agaggtggca aggcttgggg
     1801 gagatgagaa aaacaccagg cacggagcct ggtacccact aggtcctcac aaatggcagc
     1861 agctgtcatg ggctaacatc ttgaccgcaa ctctgggagc tccccaatct gcagggaaga
     1921 ctgaacggag gcagggccaa aaccct
//
LOCUS       MCLPCCA      1701 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  M.laminosus phycoerythrocyanin (PCC) alpha and beta subunit genes,
            complete cds, and linker protein, 5' end.
ACCESSION   M34254
KEYWORDS    light-harvesting biliprotein; phycoerythrocyanin.
SOURCE      M.laminosus (strain Fischerella PCC 7603) DNA.
  ORGANISM  Mastigocladus laminosus
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria.
REFERENCE   1  (bases 1 to 1701)
  AUTHORS   Eberlein,M. and Kufer,W.
  TITLE     Genes encoding both subunits of phycoerythrocyanin, a light-
            harvesting, biliprotein from the cyanobacterium Mastigocladus
            laminosus
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.Kufer, 11-MAY-1990.
               Botanisches Institut
               Ludwig-Maximilians- Universitaet Muenchen
               Menzingerstr. 67
               D-8000 Muenchen 19
               Federal Republic of Germany
FEATURES       from  to/span     description
    pept        279      797     phycoerythrocyanin beta-subunit
    pept        884     1372     phycoerythrocyanin alpha-subunit
    pept       1528  >  1701     phycoerythrocyanin linker protein precursor
    matp       1531  >  1701     phycoerythrocyanin linker protein
BASE COUNT      526 a    312 c    383 g    480 t
ORIGIN      
        1 agctttacat ttaaggatat ggcgctcaaa gaatgggagt tttacagtta ttaaataaag
       61 ttcataaaaa agaatacaag taatatataa aatacgaaac taaggtttga aagtaattgc
      121 aaatatcaag gcaaacgaaa agataaaggc taattgttgc acgtaaaaat ggtatttttt
      181 tgatttttag gtgtagttat tggcggattt tatctggtct cggtttgtct tgattaaaaa
      241 tgcaatagct gccatgaaat cgaggagaaa atagaaaaat gcttgatgct ttttccagag
      301 tagttgaaca agctgataaa aaaggtgctt acctaagcaa tgatgaaata aatgctttgc
      361 aggctatagt tgcagatagc aacaaacgtt tggatgtagt gaatcggctt acaagcaacg
      421 cctcctcaat tgtggcgaat gcttatcgtg ctttagtagc agaacgtccg caggtattta
      481 atcctggtgg tccttgtttt caccatcgca accaagctgc ttgcatccgc gatttaggat
      541 ttattctgcg ttatgttacg tactccgtac ttgctggtga cactagtgtc atggacgatc
      601 gctgcttgaa tggtttgcgc gaaacctatc aagctcttgg tactcccggt gatgctgtag
      661 cttctggaat taaaaaaatg aaagaagctg cacttaaaat tgctaacgat cccaatggca
      721 ttaccaaagg tgattgcagt caattaatgt ctgaattagc aagttatttt gaccgagcag
      781 cggctgctgt tgcttaatgc agtcaaaggc gcgactaagt gacaaaattt tcaaactctt
      841 ttttagtaat taatttgaca agtttttgga gaagaagcga atcatgaaga ccccgttaac
      901 cgaagcgatc gccgccgccg atcttcgcgg ttcttatttg agcaacacag aattgcaagc
      961 agtatttggt cgtttcaatc gcgctcgtgc tggtttagaa gcagcaagag cattcgctaa
     1021 caacggtaag aaatgggcgg aagcagcagc caatcacgta tatcaaaaat ttccctacac
     1081 tacccaaatg caaggacctc agtatgcttc tactccagaa gggaaagcaa agtgcgtgcg
     1141 ggatatcgac cattacttac gcacaattag ctactgctgt gtagttggcg gtactggtcc
     1201 cttggatgac tatgtagttg caggcttaaa ggaatttaac agcgccctgg gtttatctcc
     1261 cagctggtac atagctgctc tagaatttgt ccgcgacaat cacggtttga ccggtgatgt
     1321 tgctggtgaa gccaacacct acatcaacta tgcaattaat gcattgagct aaatctaaac
     1381 agcgaatgtc ctttgtattt gcttaatagc taatggctga tagctgatgg ctaatggcta
     1441 atggaaaaat gcaattagca attagcagtg agttattggc cattaactgt gtttgagaaa
     1501 aatgataaaa atggagcgat tgttaacatg agtacatcag tcgcagaacg cctcgcaatt
     1561 aaagatgaag tagacaaaaa aattgaattg cgccctaatt ggagcgaaga cgagctgcaa
     1621 attgttttta aaacagccta cgagcaagtt tttggacgtc agggattata tgccagtcaa
     1681 agatttgcaa ctgccgaagc t
//
LOCUS       YSCTSDNAA     238 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-ii.
ACCESSION   M34310
KEYWORDS    .
SOURCE      S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-ii.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 238)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. 10, 4415-4419 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    146 g     92 t
ORIGIN      
        1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
       61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
      121 gtgtggtgtg tgggtgtggt gtgtgggtgt ggtgtgggtg tgggtgtggt gtgtgtgggt
      181 gtggtgtgtg ggtgtgggtg tggtgtggtg tgtgtgggtg tgtgggtgtg gtgtgtgt
//
LOCUS       YSCTSDNAB     297 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-i.
ACCESSION   M34311
KEYWORDS    .
SOURCE      S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-i.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 297)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    184 g    113 t
ORIGIN      
        1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg
       61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg
      121 gtgtggtgtg ggtgtgggtg tgggtgtggt gtgtgtgtgg gtgtggtgtg tgggtgtggg
      181 tgtgtgtggg tgtgggtgtg ggtgtggtgt ggtgtgtggg tgtggtgtgt gggtgtggtg
      241 tgtgtgggtg tggtgtgtgt gggtgtgggt gtgtgtgggt gtgggtgtgg tgtgtgg
//
LOCUS       YSCTSDNAC     325 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-ii.
ACCESSION   M34312
KEYWORDS    .
SOURCE      S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-4-ii.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 325)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    199 g    126 t
ORIGIN      
        1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
       61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
      121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg gtgtgtgggt gtggtgtggg tgtggtgtgt
      181 gggtgtgtgg gtgtgggtgt gtgtggtgtg tggtgtgtgt ggtgtgggtg tggtgtgtgg
      241 gtgtgggtgt ggtgtgtggg tgtgggtgtg gtgtgtgtgt gtgggtgtgg tgtgtgggtg
      301 tgggtgtggt gtgtgggtgt gtggg
//
LOCUS       YSCTSDNAD     248 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-i.
ACCESSION   M34313
KEYWORDS    .
SOURCE      S.cerevisiae (strain XS595-6C) extrachromosomal plasmid DNA, clone
            YLP108CA-4-i.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 248)
  AUTHORS   Wang,S.-S. and Zakian,V.A.
  TITLE     Sequencing of Saccharomyces telomeres cloned using T4 DNA
            polymerase reveals two domains
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.-S.Wang, 16-MAY-1990.
BASE COUNT        0 a      0 c    151 g     97 t
ORIGIN      
        1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg
       61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg
      121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg tgtgtgtggg tgtggtgtgg gtgtggtgtg
      181 tgggtgtgtg tgggtgtggt gtgtgtgggt gtgtgggtgt ggtgtgtgtg ggtgtggtgt
      241 gtgggtgg
//
LOCUS       PLYORIA       375 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Human polyomavirus BK (strain WWT) origin of replication.
ACCESSION   M34048
KEYWORDS    origin of replication.
SOURCE      Polyomavirus BK (strain WWT) DNA, from human urine.
  ORGANISM  Polyomavirus BK
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 375)
  AUTHORS   Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P.,
            Subramani,S. and Traavik,T.
  TITLE     At least two type of control regions can be found among naturally
            occurring BK virus strains
  JOURNAL   J. Virol. 64, 3864-3871 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Johansen, 07-MAY-1990.
               University of Tromso
               P.O. Box 977
               N-9001, Tromso, NORWAY
               e-mail: terje.johansen@esuit.uit.no
FEATURES       from  to/span     description
    orgrpl       97      119     polyomavirus origin of replication
    site          1      375     regulatory region
    site        143      210     P block
    site        211      249     Q block
    site        250      302     R block
BASE COUNT      123 a     86 c     95 g     71 t
ORIGIN      
        1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
       61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
      121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
      181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
      241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt
      301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta
      361 aactggacaa aggcc
//
LOCUS       PLYORIB       424 bp ds-DNA             VRL       14-AUG-1990
DEFINITION  Human polyomavirus BK (strain UT) origin of replication.
ACCESSION   M34049
KEYWORDS    origin of replication.
SOURCE      Human polyomavirus BK (strain UT) DNA, from human urine.
  ORGANISM  Polyomavirus BK
            Viridae; ds-DNA nonenveloped viruses; Papovaviridae; 
            Polyomaviruses.
REFERENCE   1  (bases 1 to 424)
  AUTHORS   Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P.,
            Subramani,S. and Traavik,T.
  TITLE     At least two type of control regions can be found among naturally
            occurring BK virus strains
  JOURNAL   J. Virol. 64, 3864-3871 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.Johansen, 07-MAY-1990.
               University of Tromso
               P.O. Box 977
               N-9001, Tromso, NORWAY
               e-mail: terje.johansen@esuit.uit.no
FEATURES       from  to/span     description
    orgrpl       97      119     polyomavirus origin of replication
    site          1      424     regulatory region
    site        143      210     P block
    site        211      249     Q block
    site        250      261     R block
    site        263      315     P block
    site        316      349     Q block
    site        350      361     R block
BASE COUNT      134 a    103 c    105 g     82 t
ORIGIN      
        1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa
       61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct
      121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa
      181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga
      241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa
      301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc
      361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa
      421 ggcc
//
LOCUS       MXAFRZE      2493 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  M.xanthus gliding motility regulatory protein (frzE) gene, complete
            cds.
ACCESSION   M35192
KEYWORDS    gliding motility regulatory protein.
SOURCE      M.xanthus (strain DZF1) DNA.
  ORGANISM  Myxococcus xanthus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; 
            Myxococcaceae.
REFERENCE   1  (bases 1 to 2493)
  AUTHORS   McCleary,W.R. and Zusman,D.R.
  TITLE     FrzE of Myxococcus xanthus is homologous to both CheA and CheY of
            Salmonella typhimurium
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5898-5902 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.R.McCleary, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         64     2397     gliding motility regulatory protein (frzE)
BASE COUNT      375 a    848 c    926 g    344 t
ORIGIN      
        1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg
       61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc
      121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag
      181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt
      241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag
      301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc
      361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc
      421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg
      481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc
      541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc
      601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc
      661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc
      721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc
      781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg
      841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg
      901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc
      961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc
     1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag
     1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg
     1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg
     1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc
     1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc
     1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg
     1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc
     1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag
     1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc
     1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg
     1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg
     1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg
     1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc
     1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac
     1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc
     1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc
     1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc
     2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag
     2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag
     2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg
     2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg
     2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc
     2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga
     2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg
     2461 caaggggctg cgtgcgctcg cggcccgggg cct
//
LOCUS       RATPCBB       463 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat PCB-binding protein mRNA, complete cds..
ACCESSION   J05536
KEYWORDS    PCB binding protein; binding protein.
SOURCE      Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone
            pPCB6.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 463)
  AUTHORS   Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J.,
            Gillner,M., Gustafsson,J.-A. and Lund,J.
  TITLE     Cloning, structure, and expression of a rat binding protein for
            polychlorinated biphenyls: Homology to the hormonally regulated
            progesterone-binding protein uteroglobin
  JOURNAL   J. Biol. Chem. 265, 12690-12693 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            J.Lund, 15-JUN-1990.
FEATURES       from  to/span     description
    pept         58      348     PCB binding protein precursor
    sigp         58      114     PCB binding protein signal peptide
    matp        115      345     PCB binding protein
BASE COUNT      117 a    156 c     84 g    106 t
ORIGIN      
        1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg
       61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg
      121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat
      181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag
      241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg
      301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat
      361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta
      421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc
//
LOCUS       YSBCPR       2990 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds.
ACCESSION   M35199
KEYWORDS    NADPH-cytochrome P450 reductase.
SOURCE      C.tropicalis (ATCC 750) DNA, clone pTS1.
  ORGANISM  Candida tropicalis
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; 
            Deuteromycotina.
REFERENCE   1  (bases 1 to 2990)
  AUTHORS   Sutter,T.R., Sanglard,D. and Loper,J.C.
  TITLE     Isolation and characterization of the alkane-inducible NADPH-
            cytochrome P450 oxidoreductase gene from Candida tropicalis:
            Identification of invariant residues within similar amino acid
            sequences of divergent flavoproteins
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            T.R.Sutter, 15-JUN-1990.
            
            Author name and address
               T.R.Sutter
               CIIT, P.O. Box 12137
               Research Triangle Park, NC 27709
FEATURES       from  to/span     description
    pept        554     2596     NADPH-cytochrome P450 reductase
    signal      198      204     TATA box
BASE COUNT      988 a    442 c    558 g   1002 t
ORIGIN      
        1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa
       61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg
      121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag
      181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag
      241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag
      301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa
      361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata
      421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta
      481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc
      541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt
      601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt
      661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa
      721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa
      781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc
      841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt
      901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac
      961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac
     1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg
     1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt
     1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa
     1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga
     1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt
     1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga
     1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa
     1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat
     1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa
     1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat
     1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg
     1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga
     1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc
     1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt
     1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt
     1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac
     1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca
     2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag
     2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg
     2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg
     2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt
     2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa
     2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa
     2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc
     2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga
     2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca
     2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta
     2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga
     2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat
     2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc
     2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta
     2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt
     2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta
//
LOCUS       HLTLYS        642 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  H.rufescens sperm lysin mRNA, complete cds.
ACCESSION   M34388
KEYWORDS    sperm lysin.
SOURCE      H.rufescens testis, cDNA to mRNA, clones r13 and r14.
  ORGANISM  Haliotis rufescens
            Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; 
            Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE   1  (bases 1 to 642)
  AUTHORS   Vacquier,V.D., Carner,K.R. and Stout,C.D.
  TITLE     Species-specific sequences of abalone sperm lysin, the sperm
            protein that creates a hole in the egg envelope
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES       from  to/span     description
    pept         16      480     sperm lysin precursor
    sigp         16       69     sperm lysin signal peptide
    matp         70      477     sperm lysin
    mRNA    <     1      642     sperm lysin mRNA
    signal      625      630     poly-A signal
BASE COUNT      202 a    124 c    151 g    165 t
ORIGIN      
        1 cgaacagatt acaagatgaa gctgttggtg ctttgcattt ttgcgatgat ggcgacattg
       61 gcaatgtcac gcagctggca ttatgttgaa cccaagtttt taaacaaggc ttttgaagtt
      121 gcactgaagg tacagatcat tgctggattc gatagagggc tggtcaaatg gcttagagtt
      181 catggcagaa cattgtctac cgttcagaag aaggcgctgt acttcgtcaa tagacgttac
      241 atgcagaccc actgggcgaa ttacatgctc tggataaata agaagatcga tgcactgggc
      301 agaactccag ttgttggtga ctacacacgg ttaggtgccg aaattggaag acgtattgat
      361 atggcttact tttacgactt cctgaaagat aagaacatga tcccaaaata tctgccatac
      421 atggaagaaa ttaatagaat gcgcccggca gatgtcccag ttaaatacat gggcaagtga
      481 aacccatgaa gaagacagct tcatgcaccg tgcatgtcaa cagatacaga aaacatcgtc
      541 aaacacgggg agctatgaca ccatcctaac atgggttggc tatttgttgc acgattttta
      601 aaacggaaag attgtacatt ccagaataaa tagatttact ac
//
LOCUS       HLTLYS2       650 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  H.corrugata sperm lysin mRNA, complete cds.
ACCESSION   M34389
KEYWORDS    sperm lysin.
SOURCE      H.corrugata testis, cDNA to mRNA, clones p1 and p3.
  ORGANISM  Haliotis corrugata
            Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; 
            Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae.
REFERENCE   1  (bases 1 to 650)
  AUTHORS   Vacquier,V.D., Carner,K.R. and Stout,C.D.
  TITLE     Species-specific sequences of abalone sperm lysin, the sperm
            protein that creates a hole in the egg envelope
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by V.D.Vacquier, 22-MAY-1990.
FEATURES       from  to/span     description
    pept         16      483     sperm lysin precursor
    sigp         16       69     sperm lysin signal peptide
    matp         70      480     sperm lysin
    mRNA          1      650     sperm lysin mRNA
    signal      635      640     poly-A signal
BASE COUNT      194 a    134 c    162 g    160 t
ORIGIN      
        1 ggaacagatt acaagatgaa gctgttggtg ctttgccttt ttgcgatgat ggcgacattg
       61 gctgtgtcgc gccaccgctt ccgttttatt ccacataagt atatacgcaa ggagtttgaa
      121 gttgcactga aggtagagat cattgccgga ttcgatagaa cgctggtcaa atggcttaga
      181 gttcatggcg gacgtttgag tacagttcag aagaaggcgc tgtacttcgt caatagacgt
      241 tacatgcaga cccactggca gaattacatg ctctggatag tgaggaagac cgatgcactg
      301 ggcagacctc cagttgttgc tgactacagt cggttaggtg ccgaaattgg aagacgtatt
      361 gatatggcat atttttacaa cttcctgaac gggaggaaca tgatcccaaa atatctgcca
      421 tacatggagg aaattaatag aatgcgcccg gcagatgtcc cagttgcaaa cagaggcaag
      481 tgaaaccaac gaagaagaaa gctgcatgca ccctgcatgt caacagatac agaaaacatc
      541 gtccgtcaaa cacgaggagc tgtgacagca tccatcccaa catgggttgg ctatttgttg
      601 cacgatttta acacagaaag attgtacatt ccagaataaa tagatttact
//
LOCUS       HUMROSSA     1843 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human 52 kD ribonucleoprotein Ro/SSA, complete cds.
ACCESSION   M34551
KEYWORDS    ribonucleoprotein.
SOURCE      Human thymocyte, cDNA to mRNA, clone FI18.1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1843)
  AUTHORS   Itoh,K., Itoh,Y. and Frank,M.B.
  TITLE     Heterogeneity in human Ro/SSA ribonucleoproteins: The 52 and 60 kd
            Ro/SSA proteins are encoded by separate genes
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable [or printed] sequence for [1]
            kindly submitted by M.B.Frank, 18-MAY-1990.
            
                             Author address: M.B.Frank
                             Oklahoma Medical Res Foundation
                             Arthritis and Immunology program
                             825 WE 13th St, Oklahoma City, OK 73104
FEATURES       from  to/span     description
    pept         39     1466     52 kD Ro/SSA ribonucleoprotein
    signal     1827     1832     poly-A signal
BASE COUNT      474 a    458 c    488 g    423 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcgggc acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga
       61 caatgatgtg ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga
      121 gcatcgagtg tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg
      181 gcagcgtctg tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac
      241 agctagccaa catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac
      301 agggggaacg gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga
      361 aggccctttg ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc
      421 ttgaggaggc tgcacaggag taccaggaga agctccaggt ggcattaggg gaactgagaa
      481 gaaagcagga gttggctgag aagttggaag tggaaattgc aataaagaga gcagactgga
      541 agaaaacagt ggaaacacag aaatctagga ttcacgcaga gtttgtgcag caaaaaaact
      601 tcctggttga agaagaacag aggcagctgc aggagctgga gaaggatgag agggagcagc
      661 tgagaatcct gggggagaaa gaggccaagc tggcccagca gagccaggcc ctacaggagc
      721 tcatctcaga gctagatcga aggtgccaca gctcagcact ggaactgctg caggaggtga
      781 taattgtcct ggaaaggagt gagtcctgga acctgaagga cctggatatt acctctccag
      841 aactcaggag tgtgtgccat gtgccagggc tgaagaagat gctgaggaca tgtgcagtcc
      901 acatcactct ggatccagac acagccaatc cgtggctgat actttcagaa gatcggagac
      961 aagtgaggct tggagacacc cagcagagca tacctggaaa tgaagagaga tttgatagtt
     1021 atcctatggt cctgggtgcc cagcactttc actctggaaa acattactgg gaggtagatg
     1081 tgacaggaaa ggaggcctgg gacctgggtg tctgcagaga ctctgtgcgc aggaaggggc
     1141 actttttgct tagttccaag agtggcttct ggacaatttg gttgtggaac aaacaaaaat
     1201 atgaggctgg cacctacccc cagactcccc tccaccttca ggtgcctcca tgccaagttg
     1261 ggattttcct ggactatgag gctggcatgg tctccttcta caacatcact gaccatggct
     1321 ccctcatcta ctccttctct gaatgtgcct ttacaggacc tctgcggccc ttcttcagtc
     1381 ctggtttcaa tgatggagga aaaaacacag cccctctaac cctctgtcca ctgaatattg
     1441 gatcacaagg atccactgac tattgatggc tttctctgga cactgccact ctccccattg
     1501 gcaccgcttc tcagccacaa accctgcctc ttttccccat gaactctgaa ccacctttgt
     1561 ctctgcagag gcatccggat cccagcaagc gagctttagc agggaagtca cttcaccatc
     1621 aacattcctg ccccagatgg ctttgtgatt ccctccagtg aagcagcctc cttatatttg
     1681 gcccaaactc atcttgatca accaaaaaca tgtttctgcc ttctttatgg gacttaagtt
     1741 ttttttttct cctctccatc tctaggatgt cgtctttggt gagatctcta ttatatcttg
     1801 tatggtttgc aaaagggctt cctaaaaata aaaacccgaa ttc
//
LOCUS       TRBBS12      4004 bp ds-DNA             INV       14-AUG-1990
DEFINITION  T.brucei expression site-associated protein (BS1.1 and BS1.2)
            genes, 5' end and complete cds.
ACCESSION   M34520
KEYWORDS    expression site-associated protein.
SOURCE      T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
            clone BS1.1-2.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 4004)
  AUTHORS   Hobbs,M.R. and Boothroyd,J.C.
  TITLE     An expression-site-associated gene family of trypanosomes is
            expressed in vivo and shows homology to a VSG gene
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Hobbs, 22-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1      129     expression site-associated protein (BS1.1)
    pept       1318     2151     expression site-associated protein (BS1.2)
    site       3083     3604     RIME sequence
BASE COUNT     1181 a    650 c   1052 g   1121 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattcaact atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg
       61 gcagcagtac atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga
      121 gtgctgtgaa ggaaggatgc gacagaacgt gcgctgctta gtgtgaaaga ttatggtaat
      181 ggagggttgt gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg
      241 ggcaaaagta acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa
      301 ggaaagatgt gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg
      361 agatgtaagg ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa
      421 tttagttaca acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt
      481 ctactgttta atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc
      541 gaatttcact tctccagaat ctttatttta ggtgcgctat agatgatagt ccaaagagaa
      601 ggaattacaa ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga
      661 ttcttttcta tgtcacttgt gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg
      721 aaatggggcg tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt
      781 atgatgcata gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta
      841 ggtggaagcg gctaatgatt gctagcagca ccaacagagg aagttaattg ttgacgtagc
      901 cgtttgtgaa tgagatagaa ggttattggg tggaacatta tcttggaaag agtttttctt
      961 ccactaaaaa tgtgtagagt ggtggtttcg agaaaatggt ttagatattg aggaagaaac
     1021 aaggatttga ggtgcaagtt gatgcgctgt atgatttgtc ttgccgttgc tgttacttat
     1081 tgttgagttg gtatttgagg aaaatgtttt ttcagtgtgc gttgttatat acggctgtga
     1141 gttggtgatg tttatttgaa tgatagcttg ataggataag atgaaataat actaagcatg
     1201 tattcttctg gctgttcgcc gaaatggtaa tctatgttac ccatagcacg aaaaaagttt
     1261 ttttactcag taactgaagc ggaaagtttt aaaaagcata atttaaaaaa gttgataatg
     1321 atgaagtttt ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa
     1381 aataaaagga atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgaactat
     1441 cttaaaggaa tagcgcacag agtaaacagc gagagtgccg tggttacgaa aaaactatca
     1501 gatttgaaaa tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct
     1561 ggcgagcagg attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt
     1621 acattccagc aggagttgga ggaaatgagg aagccatccg cgttagcagc agctgcagct
     1681 gggatagcag ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg
     1741 tcatcacagt tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa
     1801 gaatgttttg atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta
     1861 aaagagtcgg cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat
     1921 agcttgggtg cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga
     1981 acagatactg acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt
     2041 ggtggagtga tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa
     2101 tacgtgatgc tacagagtac ggtgcagtac ggtggaccga ggatccagta aagtgagcat
     2161 atttaaggat gtcatccgcc tgtttgcgcg cttccaagaa gcaaaaaatg aagtgatgaa
     2221 taaaataaaa actactgtgg atgaattggc gaaatgcata gggcagaaag aggttgagct
     2281 cactgatgat cagctttacg aggaatttat ctgggagacc atacacagat tggagctgtc
     2341 aaagagagtg agtgaacaac tctctttagg tgaagaagag gagacgatcc tgaaatctaa
     2401 ctatactgca gaaccggtcc gtggactttc acggtagcgg ggtccaacgc ggcagcagta
     2461 catttgagtg tttctaccgc tgcactttgt ttttcagttt tattgttggg agtgctgtga
     2521 aggaaggatg cgacagaagc tgcgctgctt agtgtgaaag attatggtaa tggagggttg
     2581 tgaaagattg ggggaacaaa aacctatttc ttttatttgg gggaacaaat gggcaaaagt
     2641 aacgtaagtt tccagtggga gtggtatgtg tgtgtgtatg gggctggcta aggaaagatg
     2701 tgagttcggc atgtggtatg tacaagctac gaaaacgtgt gaaacaaaac gagatgtaag
     2761 gggaaaatgt aacaaccaac tatgttaaat ttcaggagac tatttttcaa atttagttac
     2821 aacaaagtaa atgtcaaata atgccaacta taaaggaaaa ctagactaat tctactgttt
     2881 aatgtgacga gaattgtggc gtgaaaccta gtaactgctt atcatggcgt cgaatttcac
     2941 ttctccagaa tctttatttt agtgcgctat agatgatagt ccaaagagaa ggaattacaa
     3001 ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga ttcttttcta
     3061 tgtctcttgt gaaggtcgat ctttttgcta ccccctggcg atgccggcca cctcaacgtg
     3121 gtgccagggt ccagtgaccc cgtatcattg ggggaagcca agagccagca gcgttccttt
     3181 catggggaac actgctgtag ctccggctac gcatcataca gcacagggat cagcaggcgt
     3241 cttgctggga caccgttttt catttgtcgg tccctgggca cgtgccagcg tcgatcagca
     3301 gtatcatccg cgctaagatg ctgctgtccg gtgatgtgga cctccaaaaa aggattgcca
     3361 ttttggcatc ttttggagag tccagggtgg gaggcttctc gccccatctg ctgtattccg
     3421 ttcaactgcg gacgtacaac aaaaaattat agagggtgtg ttaggatgaa tgaaaaaggg
     3481 agactctgcc acagtcgcca gaccgatagc actctcaggg ctctacggtg ttggctgatg
     3541 gccgcgcaga ggggggaaac tctcacgaag gcacgaagaa aattcaaaaa aaaatttttg
     3601 ctaccacagg ctgtaaggat aacgctggag atggagccga ctgtggtttt tcttgtgaat
     3661 gcgtatcttc gtaaccggtt gttaagaagg gatcggacga aggaatttgt ctgagtatga
     3721 ggtaatgtcg aggttttctt tcattggatg tagtaataat ctccaaattt gttggaacat
     3781 ataatattat tgcaataaat gcttgaagag tggaattatt gatggattga cttgagtatc
     3841 atgattccaa tgaacactct tataatcctt cgaatagcaa acgtatgaac aaattaacgt
     3901 agcgtcatgt agacatggac atttttgtaa tgatacgact gctattgcac atctgaaaat
     3961 ttctatgacg gaccagtgcc gttaatcacc gtacaattga attc
//
LOCUS       TRBBS134     2396 bp ds-DNA             INV       14-AUG-1990
DEFINITION  T.brucei expression site-associated protein (BS1.3 and BS1.4)
            genes, 3' and 5' ends.
ACCESSION   M34521
KEYWORDS    expression site-associated protein.
SOURCE      T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA,
            clone BS1.3-4.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 2396)
  AUTHORS   Hobbs,M.R. and Boothroyd,J.C.
  TITLE     An expression-site-associated gene family of trypanosomes is
            expressed in vivo and shows homology to a VSG gene
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Hobbs, 22-MAY-1990.
FEATURES       from  to/span     description
    pept       1836  >  2396     expression site-associated protein (BS1.4)
    ORF     <     1     1206     expression site-associated protein (BS1.3)
BASE COUNT      744 a    357 c    642 g    653 t
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttgcaa ctccatgaag tgttatcatc catttccttc ggtagcttgg gtgtgaaaaa
       61 tattcgagga ggaaatggga gggatggatg caatttggtg agaacagaca ctaacggtat
      121 attaaatggg ggttcaccaa cacggcacaa cttaacgtgg ggtggtggag tgatgaattt
      181 tggatcctat caaaacggaa gtatgtacgt tgagggtggt gaatacggtg atgctacaga
      241 gtacggtgcg gtacggtgga ctgaagatcc cagtaaagtg agcatatttg aggatgtcat
      301 tcgcctgttt gcgcgcttcc aagaagcaaa aaatgcagtg atgaggagaa taaaaactac
      361 tgtggatgaa ttgacgaaat gtatagggca gaaggaggct gaactcacta atgatcagat
      421 ttacgaggaa tttatctggg agaccataca cagattggag ctgtcaaaga gagtgagtga
      481 acaaccctct ttaggtgaag aagaggagac aatcctgaaa tctaactata ctgcagaacc
      541 tgtccgtgga cctttcacgg gagcggggtc caacacagta gcagtacatt tgagtgtttc
      601 taccgctgca ctctgttgtt tagctttatt gttgggagtg ctgtgaggga aggatgcgac
      661 agaaactgcg ctgcttagcg tgaaagatta tggtaatgga gggttgggaa agattggggg
      721 aacaaaaacc tatttctttt atttggggga acaaatgggc aaaagtaacg tgagtttcca
      781 gcgggagtgg tgtgtgtgtg catggggctg actaaggaaa gatgtaagtt cggaatgtgg
      841 tatgtacaag ctacgaaaac gtgtgaaaca aaacgatatg taaggggaaa atgtaacaac
      901 caactatgtt aaatttcaag agactatttt tcaaatttag ttacaacaaa gtaaatgtca
      961 aataatgcca actataaagg aaaactagac taattctact gtttaatgtg acgagaattg
     1021 tggcgtgaaa cctagtaacg accacgtcac gatgtggaat ttcacttctc cagaatcttt
     1081 gttttagtgc gctatagatg atagtccaaa gagaaggaat tgcagttatt ataacccatc
     1141 caaagataga tcaaagactc ctcttctccg agtgattctt ttctatgtca cttgtggaag
     1201 gtcgatcttg gaacaatatc gaacgtgtga agcctgaaat gaggcgttat tattgcttct
     1261 ctttttggtg gttgttagag ttatataccg ctatggtatg atgcacagag gtaacgctaa
     1321 aaataagtga caaggcgttg tagtagggcg tatggtagtt ggaagcggct aatgattgct
     1381 ggcctcacca acagaggaag ttaattgttg acgtagccgt ttgtgaatga gatagaaggt
     1441 tatcgggtgg aacattatct tggaaagagt ttttcatcac ctaaaaatgt gtagagtggt
     1501 ggtttggaga aaatggttta gatattgagg aggaaacaag gatttgaggt gcaagttgat
     1561 gcgctgtaag atttgtatta ccgttggtgt tacttattgt tgagtgagta tttgagggaa
     1621 atgttatggg aggtgcttcg ttgttacata cggctctgag tcagtggtgt tcatttgaat
     1681 gaaatcttga taggataaga tgaaataata ctaagcatgt attcttctgt gctgccgccg
     1741 aaatggtaat ctatcttacc tacagcacga aaaatttatt ttttttactc agtaacaaag
     1801 cggaaagttt taaaagcata atttaaaaaa gtgatatgat gaggttttgg tttgtgttgt
     1861 tggccctttt gggaaaaaaa acacatgcat attatgaaaa tgaaaggaat gcattaaacg
     1921 caaccgccgc taataaagtt tgtgggctat cgaactatct taaaggagta gcgcacagag
     1981 taaacagcga gagtgctgtg gttacggaaa aactatcaga tttgaaaatg agaagcatcc
     2041 agttgcagct atcagtaatg cggaacagag tgccttctgg cgagaaggat tgtaaagaca
     2101 taaggacact cttgaaaaca gtattgagga atgagtttac attccagcag gagttggagg
     2161 aaatgaggaa cgcatccgcg ttagcagcag ctgcagctgg gttggcagct ggaagactgg
     2221 aagagtggat ttttgtattt gcacaggcag cagacaggtc atcacagttt tgcataagtg
     2281 tggggaagca tatcgcggct gagcacggca acttgcaaga atgttttgat ggaacaattg
     2341 gacctgaaac cctttacaaa attgaggatt cacgtgtaaa agagtcggcg cagaaa
//
LOCUS       TRBBS15       714 bp ss-mRNA            INV       14-AUG-1990
DEFINITION  T.brucei expression site-associated protein (BS1.5) gene, partial
            cds.
ACCESSION   M34522
KEYWORDS    expression site-associated protein.
SOURCE      T.brucei (strain 427) bloodstream stage isolate MiTat 1.4, cDNA to
            mRNA, clone BS1.5.
  ORGANISM  Trypanosoma brucei
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 714)
  AUTHORS   Hobbs,M.R. and Boothroyd,J.C.
  TITLE     An expression-site-associated gene family of trypanosomes is
            expressed in vivo and shows homology to a VSG gene
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Hobbs, 22-MAY-1990.
FEATURES       from  to/span     description
    pept    <     1  >   714     expression site-associated protein (BS1.5) (AA
                                 at 1)
BASE COUNT      226 a    116 c    203 g    169 t
ORIGIN      
        1 agcgagagtg ccgtggttac ggaaaaacta tcagatttga aaatgagaag catccagttg
       61 cagttatcag taatgcggaa cagagtgcct tctggcgaga aggattgtaa agacatcagg
      121 acactcttga aaacagtatt gaggaatgag tttacattcc agcaggagtt ggaggaaatg
      181 aggaacgcat ccgcgttagc agcagctgca gctgggttgg cagctggaag actggaagag
      241 tggatttttg tatttgcaca ggcagcagac aggtcatcac agttttgcat aagtgtgggg
      301 aaaactatcc caccggaaca aaacaacttg caagaatgtt ttgatggaac aattggacct
      361 gaaacccttt acaaaattga ggattcacgt gtaaaagagt ctgcaaagaa aagcttgcaa
      421 ctccatgaag cgttatcatc catttccttc aatagtttgg gtgcggaaag tatccgagga
      481 ggaaatggga aggatggatg taatttggtg agaacagaca ctgacggtat attaaatggg
      541 ggttcaccaa cacggcacaa tttaacgtgg ggtggtggag tgatgaattt tggatcctat
      601 caaaacggaa gtatgtacgt tgagggggga gaatacggtg atgctacaga gtacggtgca
      661 gtacggtgga ctgaagatcc cagtaaagta agcatattta aggatgtcat tcgc
//
LOCUS       ECOPOLBDA    4666 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  E.coli DNA polymerase (polB) gene, 5' flank.
ACCESSION   M35371
KEYWORDS    DNA polymerase; polB gene.
SOURCE      E.coli (strain W3110) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 4666)
  AUTHORS   Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R.
  TITLE     Nucleotide sequence and deletion analysis of the polB gene of
            E.coli
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Chen, 20-JUN-1990.
            
              Author address: H.Chen
              Baylor College of Medicine
              Cell Biology and IMG
              One Baylor Plaza
              Houston, TX 77030
              email: hchen@vulture.bcm.tmc.edu
            
FEATURES       from  to/span     description
    signal     1209     1214     -35 region
    signal     1229     1234     -10 region
    binding    1250     1255     ribosome binding site
    site       1193     1208     lexA box
BASE COUNT     1100 a   1278 c   1257 g   1031 t
ORIGIN      
        1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg
       61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg
      121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga
      181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc
      241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc
      301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg
      361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg
      421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca
      481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct
      541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg
      601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat
      661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga
      721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac
      781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac
      841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga
      901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca
      961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg
     1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat
     1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga
     1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa
     1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc
     1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca
     1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa
     1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt
     1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat
     1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt
     1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt
     1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt
     1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc
     1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat
     1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc
     1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg
     1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt
     1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc
     2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag
     2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag
     2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc
     2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg
     2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac
     2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg
     2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc
     2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac
     2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg
     2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc
     2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa
     2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt
     2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc
     2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac
     2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa
     2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa
     3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc
     3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga
     3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc
     3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac
     3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat
     3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg
     3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc
     3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg
     3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc
     3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca
     3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa
     3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac
     3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat
     3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga
     3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac
     3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc
     3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg
     4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt
     4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga
     4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt
     4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa
     4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg
     4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc
     4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct
     4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca
     4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca
     4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc
     4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc
//
LOCUS       HUMBIGFII    1387 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA,
            complete cds.
ACCESSION   M35410
KEYWORDS    insulin-like growth factor binding protein 2.
SOURCE      Human 67-year old retina, cDNA to mRNA, clone AS200.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1387)
  AUTHORS   Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B.,
            Francke,U. and Swaroop,A.
  TITLE     Sequence analysis, expression and chromosomal localization of a
            gene, isolated from a subtracted human retina cDNA library, that
            encodes an insulin-like growth factor binding protein (IGFBP2)
  JOURNAL   Exp. Eye Res. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Swaroop, 20-JUN-1990.
FEATURES       from  to/span     description
    pept         64     1050     insulin-like growth factor binding protein 2
                                 (IGFBP2) precursor /hgml_locus_uid="LN0220S"
                                 /nomgen="IBP1" /map="7p13-p12"
    sigp         64      180     insulin-like growth factor binding protein 2
                                 signal peptide
    matp        181     1047     insulin-like growth factor binding protein 2
    mRNA    <     1     1387     IGFBP2 mRNA
    signal     1362     1367     poly-A signal
    binding     175      197     ATP binding site
BASE COUNT      232 a    455 c    477 g    223 t
ORIGIN      
        1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc
       61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg
      121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg
      181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg
      241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg
      301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag
      361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc
      421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac
      481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga
      541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct
      601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact
      661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag
      721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag
      781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg
      841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg
      901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga
      961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct
     1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg
     1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg
     1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg
     1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc
     1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga
     1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta
     1381 aagtgtg
//
LOCUS       HUMLBPA      1431 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human lipopolysaccharide binding protein (LBP) mRNA,
            complete cds.
ACCESSION   M35533
KEYWORDS    lipopolysaccharide binding protein.
SOURCE      Human liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  TITLE     Structure and function of lipopolysaccharide binding protein
  JOURNAL   Science (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1431)
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Tobias, 21-JUN-1990.
            
               Author address: P.S.Tobias
               Department of Immunology, IMM-12
               10466 N. Torrey Pines Rd
               La Jolla, CA 92037
FEATURES       from  to/span     description
    pept          1  >  1431     lipopolysaccharide binding protein (LBP)
                                 precursor
    sigp          1       75     lipopolysaccharide binding protein signal
                                 peptide
    matp         76     1431     lipopolysaccharide binding protein
BASE COUNT      319 a    417 c    359 g    336 t
ORIGIN      
        1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc
       61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag
      121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct
      181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc
      241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc
      301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag
      361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc
      421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc
      481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc
      541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg
      601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca
      661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc
      721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt
      781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat
      841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca
      901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc
      961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc
     1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata
     1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc
     1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca
     1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg
     1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg
     1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag
     1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t
//
LOCUS       HUMPEC12L    2344 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human cell 12-lipoxygenase gene, complete cds.
ACCESSION   M35418
KEYWORDS    lipoxygenase.
SOURCE      Human platelet/erythroleukemia cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2344)
  AUTHORS   Funk,C.D., Furci,L. and FitzGerald,G.A.
  TITLE     Molecular cloning, primary structure, and expression of the human
            platelet/erythroleukemia cell 12-lipoxygenase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5638-5642 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.D.Funk, 20-JUN-1990.
FEATURES       from  to/span     description
    pept         42     2033     12-lipoxygenase
    signal     2324     2329     poly-A signal
    mRNA    <     1     2344     12-lipoxygenase mRNA
BASE COUNT      514 a    692 c    642 g    496 t
ORIGIN      
        1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc
       61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg
      121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg
      181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca
      241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg
      301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga
      361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc
      421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag
      481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc
      541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc
      601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct
      661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt
      721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc
      781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag
      841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca
      901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc
      961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa
     1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg
     1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg
     1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca
     1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac
     1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac
     1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc
     1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc
     1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg
     1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc
     1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc
     1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc
     1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca
     1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg
     1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc
     1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat
     1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact
     1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag
     2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt
     2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac
     2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt
     2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg
     2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac
     2341 actg
//
LOCUS       HUMRALBA     1327 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human GTP-binding protein (RALB) mRNA, complete cds.
ACCESSION   M35416
KEYWORDS    GTP-binding protein.
SOURCE      Human retina, cDNA to mRNA, clone AS181.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1327)
  AUTHORS   Hsieh,C.-L., Swaroop,A. and Francke,U.
  TITLE     Chromosomal localization and cDNA sequence of human RALB, a GTP
            binding protein
  JOURNAL   Somat. Cell Mol. Genet. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by A.Swaroop, 20-JUN-1990.
FEATURES       from  to/span     description
    pept        171      791     GTP-binding protein (RALB)
    mRNA    <     1     1327     RALB mRNA
    signal     1303     1308     poly-A signal
BASE COUNT      368 a    273 c    373 g    313 t
ORIGIN      Chromosome cen-q13.
        1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc
       61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc
      121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca
      181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag
      241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg
      301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga
      361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc
      421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa
      481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc
      541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca
      601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca
      661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa
      721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt
      781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg
      841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc
      901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa
      961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc
     1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt
     1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta
     1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag
     1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag
     1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga
     1321 aacagct
//
LOCUS       LACLACR      1332 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  L.lactis lactose phosphotransferase system repressor (lacR) gene,
            complete cds.
ACCESSION   M35375
KEYWORDS    lactose phosphotransferase system repressor; lactose repressor.
SOURCE      L.lactis (strain MG1820) DNA.
  ORGANISM  Lactococcus lactis
            Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; 
            Lactobacillaceae.
REFERENCE   1  (bases 1 to 1332)
  AUTHORS   Van Rooijen,R.J. and Devos,W.M.
  TITLE     Molecular cloning, transcriptional analysis, and nucleotide
            sequence of LACR, a gene encoding the repressor of the lactose
            phosphotransferase system of Lactococcus lactis
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.J.Van Rooijen, 20-JUN-1990.
            
              Author address: R.J.Van Rooijen
              Netherlands Inst for Dairy Research
              Kernhemseweg 2
              P.O. Box 20
              6710 BA EDE
              THE NETHERLANDS
            
FEATURES       from  to/span     description
    pept        370     1155     lactose repressor (lacR; alt.)
    pept        388     1155     lactose repressor (lacR; alt.)
    mRNA         79     1245     lactose repressor mRNA
    signal     1215  >  1155     transcription termination signal
    binding     353      357     ribosomal binding site
    signal       45       51     -35 region
    signal       68       74     -10 region
BASE COUNT      469 a    207 c    198 g    458 t
ORIGIN      
        1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt
       61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca
      121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac
      181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata
      241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg
      301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa
      361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta
      421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt
      481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt
      541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag
      601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc
      661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa
      721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg
      781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata
      841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa
      901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa
      961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt
     1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc
     1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt
     1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac
     1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt
     1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga
     1321 taattggata tc
//
LOCUS       MARCMYCA     1391 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Woodchuck c-myc protein gene, exon 1.
ACCESSION   M35498
KEYWORDS    c-myc protein.
SOURCE      Woodchuck (Marmota monax) DNA.
  ORGANISM  Marmota monax
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; 
            Marmotini.
REFERENCE   1  (bases 1 to 1391)
  AUTHORS   Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J.
  TITLE     Evolutionary conservation of target sequences for cis-acting
            regulation in c-myc exon 1 and its upstream sequences
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.A.Buendia, 21-JUN-1990.
FEATURES       from  to/span     description
    pre-msg     818  >  1391     c-myc mRNA and introns
    IVS        1378  >  1391     c-myc intron 1
    site        159      220     P0 promoter
    site        790      794     TATA box
    site        964      968     TATA box
BASE COUNT      317 a    385 c    383 g    306 t
ORIGIN      
        1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc
       61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat
      121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac
      181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg
      241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg
      301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc
      361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc
      421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt
      481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag
      541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa
      601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg
      661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag
      721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca
      781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg
      841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg
      901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag
      961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac
     1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt
     1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc
     1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg
     1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg
     1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg
     1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta
     1381 agcaccagat c
//
LOCUS       ONGOSTLE      214 bp ds-DNA             INV       14-AUG-1990
DEFINITION  O.volvulus recombinant antigen gene, 3' end.
ACCESSION   M35370
KEYWORDS    T-cell epitope; recombinant antigen.
SOURCE      O.volvulus DNA.
  ORGANISM  Onchocerca volvulus
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Similiidae.
REFERENCE   1  (bases 1 to 214)
  AUTHORS   Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B.
  TITLE     The identification of an Onchocerca-specific recombinant antigen
            containing a T cell epitope
  JOURNAL   Unpublished (19900
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.B.Nutman, 19-JUN-1990.
            
              Author address: T.B.Nutman
              Inst. Lab. of Parasitic Diseases
              NIAID
              National Institutes of Health
              Bldg 4, Rm 126
              9000 Rockville Pike
              Bethesda, MD 20892
              email: tbn@helix.nih.gov
            
FEATURES       from  to/span     description
    pept    <     1       93     recombinant antigen (AA at 1)
    site         91       93     nematode splice junction
BASE COUNT       81 a     37 c     31 g     65 t
ORIGIN      
        1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt
       61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa
      121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga
      181 aaactttcac ttatgtaaaa ttgcgcacaa gacc
//
LOCUS       PSEIAAL      2766 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  P.syringae IAA-lysine synthetase (iaaL) gene, complete cds.
ACCESSION   M35373
KEYWORDS    IAA-lysine synthetase.
SOURCE      P.syringae savastanoi DNA.
  ORGANISM  Pseudomonas syringae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 2766)
  AUTHORS   Roberto,F.F., Klee,H., White,F., Nordeen,R. and Kosuge,T.
  TITLE     Expression and fine structure of the gene encoding
            N-epsilon-(indole-3-acetyl)-L-lysine synthetase from Pseudomonas
            savastanoi
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5797-5801 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.J.Klee, 20-JUN-1990.
FEATURES       from  to/span     description
    pept       1100     2287     IAA-lysine synthetase (iaaL)
    pept         95     1003     ORF1
BASE COUNT      637 a    760 c    754 g    615 t
ORIGIN      
        1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc
       61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag
      121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag
      181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat
      241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac
      301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt
      361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat
      421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt
      481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct
      541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat
      601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt
      661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc
      721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac
      781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg
      841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac
      901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca
      961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt
     1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa
     1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc
     1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt
     1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct
     1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa
     1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt
     1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct
     1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt
     1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa
     1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca
     1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt
     1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg
     1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt
     1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac
     1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat
     1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat
     1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc
     2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt
     2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga
     2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa
     2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg
     2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc
     2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga
     2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc
     2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg
     2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac
     2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg
     2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct
     2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag
     2761 ccgcct
//
LOCUS       RABLPBA      1446 bp ss-mRNA            MAM       14-AUG-1990
DEFINITION  Rabbit lipopolysaccharide binding protein (LBP) protein mRNA,
            complete cds.
ACCESSION   M35534
KEYWORDS    lipopolysaccharide binding protein.
SOURCE      Rabbit liver, cDNA to mRNA.
  ORGANISM  Oryctolagus cuniculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Lagomorpha; Leporidae.
REFERENCE   1  (ases 1 to 1446ites; for [2]
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  TITLE     Structure and function of lipopolysaccharide binding protein
  JOURNAL   Science (1990) In press
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1446)
  AUTHORS   Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D.,
            Mathison,J.C., Tobias,P.S. and Ulevitch,R.J.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.S.Tobias, 21-JUN-1990.
            
               Author address: P.S.Tobias
               Department of Immunology, IMM-12
               10466 N. Torrey Pines Rd
               La Jolla, CA 92037
FEATURES       from  to/span     description
    pept          1  >  1446     lipopolysaccharide binding protein (LBP)
                                 precursor
    sigp          1       78     lipopolysaccharide binding protein signal
                                 peptide
    matp         79     1446     lipopolysaccharide binding protein
BASE COUNT      316 a    454 c    391 g    285 t
ORIGIN      
        1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct
       61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg
      121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg
      181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac
      241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag
      301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc
      361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc
      421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc
      481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac
      541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg
      601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca
      661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca
      721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca
      781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt
      841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg
      901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag
      961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc
     1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa
     1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc
     1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact
     1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc
     1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag
     1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac
     1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg
     1441 agagtt
//
LOCUS       RATUKATPA    1484 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete
            cds.
ACCESSION   M35535
KEYWORDS    (H+,K+)-ATPase beta-subunit.
SOURCE      R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1484)
  AUTHORS   Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P.,
            Forte,J.G. and Levenson,R.
  TITLE     Cloning of the H,K-ATPase beta subunit: Tissue-specific expression,
            chromosomal assignment, and relationship to Na,K-ATPase beta
            subunits
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by V.A.Canfield, 21-JUN-1990.
            
              Author address: V.A.Canfield
              Yale University School of Medicine
              Dept. of Cell Biology
              P.O. Box 3333
              New Haven, CT 06510
              email: levenson@YALEMED
            
FEATURES       from  to/span     description
    pept        176     1060     gastric (H+,K+)-ATPase beta-subunit (E.C.
                                 3.6.1.3)
    mRNA          1     1484     gastric (H+,K+)-ATPase beta-subunit
BASE COUNT      363 a    457 c    362 g    302 t
ORIGIN      
        1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta
       61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag
      121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc
      181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg
      241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct
      301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct
      361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt
      421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga
      481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc
      541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt
      601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg
      661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa
      721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac
      781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta
      841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc
      901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt
      961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta
     1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca
     1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca
     1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga
     1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc
     1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg
     1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata
     1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg
     1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta
//
LOCUS       YSCMYO2A     5675 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds.
ACCESSION   M35532
KEYWORDS    myosin-1.
SOURCE      S.cerevisiae (strain GRF88) DNA, clone 10-2B.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 5675)
  AUTHORS   Johnston,G.C., Prendergast,J.A. and Singer,R.A.
  TITLE     The S.cerevisiae MYO2 gene encodes an essential myosin for
            vectorial transport of vesicles
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.C.Johnston, 21-JUN-1990.
            
              Author address: G.C.Johnston
              Dalhousie University
              7E Tupper Medical Bldg.
              Halifax, N.S.
              CANADA B3H 4H7
              email: JOHNSTON@AC.DAL.CA
            
FEATURES       from  to/span     description
    pept        581     5305     myosin 1 isoform (MYO2)
    binding     138      171     ATP-binding site
    binding     443      523     actin-binding site
    rpt         926      981     heptad repeat
    rpt        1010     1086     heptad repeat
BASE COUNT     1994 a    986 c   1110 g   1585 t
ORIGIN      Chromosome 15 right arm.
        1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt
       61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa
      121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt
      181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa
      241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc
      301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta
      361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc
      421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt
      481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg
      541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac
      601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga
      661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt
      721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc
      781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt
      841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct
      901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc
      961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc
     1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg
     1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga
     1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc
     1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc
     1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag
     1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa
     1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca
     1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg
     1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac
     1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat
     1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact
     1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca
     1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc
     1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat
     1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct
     1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta
     1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga
     2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat
     2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc
     2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac
     2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc
     2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga
     2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg
     2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg
     2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt
     2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg
     2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc
     2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc
     2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg
     2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat
     2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt
     2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc
     2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat
     3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt
     3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc
     3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag
     3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat
     3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac
     3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa
     3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt
     3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag
     3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa
     3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca
     3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga
     3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga
     3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca
     3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg
     3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc
     3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct
     3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg
     4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac
     4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc
     4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat
     4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca
     4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag
     4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa
     4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta
     4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa
     4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat
     4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt
     4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg
     4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt
     4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg
     4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca
     4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca
     4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac
     4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat
     5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc
     5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac
     5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc
     5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca
     5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat
     5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa
     5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg
     5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat
     5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga
     5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat
     5641 aataacaact aagaataaac acattctcat tttta
//
LOCUS       BSURPOF      3078 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB),
            complete cds.
ACCESSION   M34995
KEYWORDS    RNA polymerase; minor sigma-37 factor.
SOURCE      B.subtilis (strains 168 and Marburg), clones
            lambda-[21,gt11,1,11,21,31].
  ORGANISM  Bacillus subtilis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 368 to 2297)
  AUTHORS   Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W.
  TITLE     Gene encoding the 37,000-dalton minor sigma factor of Bacillus
            subtilis RNA polymerase: Isolation, nucleotide sequence,
            chromosomal locus, and cryptic function
  JOURNAL   J. Bacteriol. 169, 771-778 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 3078)
  AUTHORS   Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W.
  TITLE     Similar organization of the sigB and spoIIA operons encoding
            alternate sigma factors of Bacillus subtilis RNA polymerase
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.W.Price, 18-FEB-1987.  Draft entry and computer-readable
            sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990.
FEATURES       from  to/span     description
    pept        200      529     ORF V
    pept        526     1008     ORF W
    pept        968     1762     37 kd minor sigma factor (rpoF, sigB; ttg
                                 start codon)
    pept       1762     2361     ORF X
    mRNA        131     2368     sigB operon mRNA
    signal      131      168     sigB operon promoter
    signal     2368     2401     sigB transcription termination signal
    binding     188      194     ORF V ribosomal binding site
    binding     511      519     ORF W ribosomal binding site
    binding     954      961     sigB ribosomal binding site
    binding    1750     1753     ORF X ribosomal binding site
BASE COUNT      999 a    575 c    755 g    749 t
ORIGIN      40 degrees on B.subtilis map.
        1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt
       61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt
      121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg
      181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat
      241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct
      301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat
      361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg
      421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt
      481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct
      541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg
      601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc
      661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg
      721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa
      781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac
      841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat
      901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg
      961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga
     1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg
     1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca
     1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc
     1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg
     1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc
     1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga
     1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg
     1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac
     1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca
     1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct
     1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat
     1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga
     1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta
     1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct
     1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac
     1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc
     1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg
     2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt
     2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc
     2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata
     2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta
     2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat
     2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt
     2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg
     2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac
     2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga
     2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac
     2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca
     2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag
     2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa
     2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt
     2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt
     2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc
     3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg
     3061 caaactgctg cgaagctt
//
LOCUS       HUMERPR      1624 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human erythropoietin receptor mRNA, complete cds.
ACCESSION   M34986
KEYWORDS    erythropoietin receptor.
SOURCE      Human cell line OCI-M1, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1624)
  AUTHORS   Ehrenman,K. and St John,T.
  TITLE     Sequence of the human erythropoietin gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.St.John, 07-JUN-1990.
            
              Author address: T.St.John
              Hutchinson Cancer Res Center
              1124 Columbia St.
              Seattle, WA
              Email: TSTJOHN@FHCRCVAX
            
FEATURES       from  to/span     description
    pept          1     1527     erythropoietin receptor precursor
    sigp          1       72     erythropoietin receptor signal peptide
    matp         73     1524     erythropoietin receptor
BASE COUNT      276 a    537 c    498 g    313 t
ORIGIN      
        1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct
       61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc
      121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg
      181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc
      241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt
      301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta
      361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat
      421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc
      481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac
      541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag
      601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc
      661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg
      721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg
      781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag
      841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac
      901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc
      961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg
     1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc
     1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg
     1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc
     1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct
     1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca
     1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct
     1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta
     1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg
     1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata
     1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc
     1621 agga
//
LOCUS       HUMGFP40H    4379 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human P40 T-cell and mast cell growth factor (hP40) gene, complete
            cds.
ACCESSION   M30135
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor;
            megakeryoblastic leukemia cell growth factor.
SOURCE      Human (cell line CESS) DNA, clone lambda-H40.3a1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 4379)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 12-DEC-1989.
FEATURES       from  to/span     description
    pept        796      909     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
               1022     1057     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
               1140     1172     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2461     2592     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               4096     4215     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        796      849     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        850      909     P40 T-cell and mast cell growth factor
               1022     1057     P40 T-cell and mast cell growth factor
               1140     1172     P40 T-cell and mast cell growth factor
               2461     2592     P40 T-cell and mast cell growth factor
               4096     4212     P40 T-cell and mast cell growth factor
    IVS         910     1021     hP40 intron A
    IVS        1058     1139     hP40 intron B
    IVS        1173     2460     hP40 intron C
    IVS        2593     4095     hP40 intron D
    signal      742      747     TATA box
    signal     4354     4359     polyA signal
    rpt        2965     3265     Alu repeat
    binding     370      376     AP-2 binding site
    binding     624      629     IRF-1 binding site
    binding     627      633     AP-1 binding site
BASE COUNT     1208 a    921 c    904 g   1346 t
ORIGIN      
        1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt
       61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc
      121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca
      181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct
      241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact
      301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc
      361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag
      421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa
      481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt
      541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt
      601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa
      661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt
      721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc
      781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc
      841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac
      901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat
      961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta
     1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac
     1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg
     1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg
     1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat
     1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt
     1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt
     1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag
     1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa
     1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc
     1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg
     1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc
     1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc
     1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat
     1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac
     1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc
     1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg
     1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt
     2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc
     2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga
     2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc
     2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa
     2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa
     2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt
     2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag
     2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg
     2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac
     2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt
     2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc
     2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc
     2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct
     2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag
     2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa
     2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg
     3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg
     3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat
     3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca
     3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc
     3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt
     3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac
     3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg
     3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc
     3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt
     3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca
     3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca
     3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact
     3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga
     3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt
     3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc
     3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc
     3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg
     4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa
     4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac
     4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg
     4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc
     4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga
     4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa
//
LOCUS       MEFDNABP     1710 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  M.fervidus DNA binding protein HMfB gene, complete cds.
ACCESSION   M34778
KEYWORDS    DNA binding protein HMfB.
SOURCE      M.fervidus DNA.
  ORGANISM  Methanothermus fervidus
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Methanobacteriales; Methanothermaceae.
REFERENCE   1  (sites; for [2])
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  TITLE     HMf, a DNA binding protein isolated from the hypothermophilic
            archaeon Methanothermus fervidus, is most closely related to
            histones
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5788-5791 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 1710)
  AUTHORS   Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by K.Sandman, 01-JUN-1990.
            
               Author address: K.Sandman
               Department of Microbiology
               Ohio State University
               484 W. 12th Ave.
               Columbus, Ohio 43210
               E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU
FEATURES       from  to/span     description
    pept        578      787     DNA binding protein HMfB
    pept       1662      805 (c) ORF285
    signal      450      457     promoter
    signal      523      530     promoter
    signal      790      814     transcription termination signal
BASE COUNT      619 a    263 c    265 g    561 t      2 others
ORIGIN      1 bp upstream of HindIII site.
        1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa
       61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag
      121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag
      181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg
      241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat
      301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat
      361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa
      421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt
      481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt
      541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat
      601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct
      661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag
      721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa
      781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat
      841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc
      901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt
      961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat
     1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa
     1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg
     1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata
     1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc
     1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc
     1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc
     1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat
     1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc
     1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt
     1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt
     1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa
     1681 gttttgtttt acccattgct tctaaatatt
//
LOCUS       MUSINT1A     5607 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds.
ACCESSION   K02593 M34750
KEYWORDS    Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene.
SOURCE      Mouse (BALB/c) DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1098 to 5606)
  AUTHORS   Van Ooyen,A. and Nusse,R.
  TITLE     Structure and nucleotide sequence of the putative mammary oncogene
            int-1; Proviral insertions leave the protein-encoding domain intact
  JOURNAL   Cell 39, 233-240 (1984)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 5607)
  AUTHORS   Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A.,
            Otte,A., Schuuring,E. and van Ooyen,A.
  TITLE     The Wnt-1 (int-1) oncogene promoter and its mechanism of activation
            by insertion of proviral DNA of the mouse mammary tumor virus
  JOURNAL   Mol. Cell. Biol. 10, 4170-4179 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer readable sequence for [2] kindly submitted
            by R.Nusse, 31-MAY-1990.
FEATURES       from  to/span     description
    pept       1779     1882     Wnt-1 protein (exon 1)
               2452     2705     Wnt-1 protein (exon 2)
               3279     3544     Wnt-1 protein (exon 3)
               4002     4490     Wnt-1 protein (exon 4)
    IVS        1883     2451     Wnt-1 cds intron A
    IVS        2706     3278     Wnt-1 cds intron B
    IVS        3545     4001     Wnt-1 cds intron C
    site       1133     1134     MMTV insertion site in tumor 17 [1]
    site       1394     1395     MMTV insertion site in tumor 35 [1]
    site       1631     1632     MMTV insertion site in tumor 102 [1]
    site       4495     4496     MMTV insertion site in tumor 53 [1]
    mRNA       1428     5398     Wnt-1 mRNA (alt.)
    mRNA       1595     5398     Wnt-1 mRNA (alt.)
    signal     1562     1566     TATA box
    signal     1573     1577     TATA box
BASE COUNT     1188 a   1681 c   1519 g   1219 t
ORIGIN      Chromosome 15.
        1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc
       61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac
      121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac
      181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa
      241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg
      301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg
      361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac
      421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct
      481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact
      541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac
      601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag
      661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg
      721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga
      781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca
      841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact
      901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc
      961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca
     1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg
     1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga
     1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg
     1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg
     1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc
     1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc
     1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg
     1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt
     1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa
     1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag
     1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact
     1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt
     1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc
     1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg
     1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg
     1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag
     1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct
     2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc
     2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct
     2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt
     2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg
     2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg
     2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg
     2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt
     2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga
     2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat
     2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg
     2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa
     2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc
     2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag
     2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg
     2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg
     2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc
     3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag
     3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag
     3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact
     3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac
     3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat
     3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc
     3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg
     3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc
     3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg
     3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg
     3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag
     3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag
     3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt
     3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca
     3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac
     3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt
     3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc
     4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc
     4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc
     4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg
     4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat
     4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt
     4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc
     4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg
     4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc
     4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg
     4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc
     4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga
     4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc
     4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac
     4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac
     4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc
     4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa
     4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc
     5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc
     5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac
     5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga
     5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac
     5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct
     5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat
     5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca
     5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga
     5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat
     5581 gaccctaaat gagaccaact agggatc
//
LOCUS       MUSP40M      3808 bp ds-DNA             ROD       14-AUG-1990
DEFINITION  Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete
            cds.
ACCESSION   M30136
KEYWORDS    T-cell growth factor; cytokine; mast cell growth factor.
SOURCE      Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210
            DNA, clone lambda-S40.1a.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3808)
  AUTHORS   Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and
            Van Snick,J.
  TITLE     Human P40/Il9: Expression in activated CD4+ cells, genomic
            organization, and comparison with the mouse gene
  JOURNAL   J. Immunol. 144, 4235-4241 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.Van Snick, 21-NOV-1989.
FEATURES       from  to/span     description
    pept        644      757     P40 T-cell and mast cell growth factor
                                 precursor, exon 1
                846      881     P40 T-cell and mast cell growth factor
                                 precursor, exon 2
                965      997     P40 T-cell and mast cell growth factor
                                 precursor, exon 3
               2184     2315     P40 T-cell and mast cell growth factor
                                 precursor, exon 4
               3393     3512     P40 T-cell and mast cell growth factor
                                 precursor, exon 5
    sigp        644      697     P40 T-cell and mast cell growth factor signal
                                 peptide
    matp        698      757     P40 T-cell and mast cell growth factor
                846      881     P40 T-cell and mast cell growth factor
                965      997     P40 T-cell and mast cell growth factor
               2184     2315     P40 T-cell and mast cell growth factor
               3393     3509     P40 T-cell and mast cell growth factor
    IVS         758      845     mP40 intron A
    IVS         882      964     mP40 intron B
    IVS         998     2183     mP40 intron C
    IVS        2316     3392     mP40 intron D
    signal      589      594     TATA box
    binding     227      233     AP-2 binding site
    binding     475      480     IRF-1 binding site
    binding     478      484     AP-1 binding site
BASE COUNT     1012 a    903 c    869 g   1024 t
ORIGIN      Chromosome 13.
        1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca
       61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt
      121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag
      181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc
      241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga
      301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt
      361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt
      421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga
      481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg
      541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta
      601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat
      661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg
      721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga
      781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac
      841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag
      901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt
      961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc
     1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct
     1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat
     1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga
     1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa
     1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac
     1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac
     1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc
     1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca
     1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta
     1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac
     1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc
     1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct
     1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta
     1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat
     1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg
     1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc
     1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg
     2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa
     2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt
     2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt
     2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa
     2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca
     2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc
     2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag
     2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg
     2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct
     2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag
     2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg
     2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag
     2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat
     2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc
     2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct
     2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc
     3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga
     3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg
     3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga
     3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa
     3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt
     3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc
     3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc
     3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga
     3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg
     3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag
     3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga
     3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac
     3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat
     3781 aggtaggatg gagcaaagac ttggtaat
//
LOCUS       PIGMHDQAA    1105 bp ss-mRNA            MAM       14-AUG-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete
            cds.
ACCESSION   M29938
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone
            ZQ30-1.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1105)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         13      777     MHC SLA-DQ alpha chain precursor
    sigp         13       81     MHC SLA-DQ alpha chain signal peptide
    matp         82      774     MHC SLA-DQ alpha chain
    mRNA    <     1     1105     MHC SLA-DQ mRNA
BASE COUNT      254 a    290 c    255 g    306 t
ORIGIN      
        1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc
       61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta
      121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag
      181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa
      241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg
      301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact
      361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac
      421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt
      481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc
      541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat
      601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag
      661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc
      721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc
      781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa
      841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc
      901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg
      961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc
     1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1081 aaacaataaa tcacttcatg aggtt
//
LOCUS       PIGMHDQAB    1174 bp ss-mRNA            MAM       14-AUG-1990
DEFINITION  Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete
            cds.
ACCESSION   M29939
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1174)
  AUTHORS   Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and
            LeGuern,C.
  TITLE     Class II genes of miniature swine: III. Characterization of an
            expressed pig class II gene homologous to HLA-DQA
  JOURNAL   Immunogenetics 31, 52-56 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept         69      836     MHC SLA-DQ alpha chain precursor
    sigp         69      137     MHC SLA-DQ alpha chain signal peptide
    matp        138      833     MHC SLA-DQ alpha chain
BASE COUNT      273 a    313 c    266 g    322 t
ORIGIN      
        1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg
       61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga
      121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg
      181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat
      241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta
      301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca
      361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt
      421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca
      481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt
      541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca
      601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata
      661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga
      721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct
      781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca
      841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac
      901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct
      961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga
     1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc
     1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg
     1141 aaacaataaa tcacttcatg aggtttttgt actg
//
LOCUS       PIGMHDQBA     212 bp ds-DNA             MAM       14-AUG-1990
DEFINITION  Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end.
ACCESSION   M29944
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Pig (haplotype c/c) adult spleen DNA, clone D26.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 212)
  AUTHORS   Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F.,
            Gustafsson,K. and LeGuern,C.
  TITLE     Class II genes of miniature swine: II. Molecular identification and
            characterization of B (beta) genes from SLA-c haplotype
  JOURNAL   Immunogenetics 31, 1-6 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.LeGuern, 17-NOV-1989.
FEATURES       from  to/span     description
    pept    <     1       12     MHC SLA-DO beta chain
BASE COUNT       38 a     58 c     52 g     64 t
ORIGIN      
        1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg
       61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc
      121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct
      181 cttacccttc tgcctttctt cctcctactt gt
//
LOCUS       PP2LEND       266 bp ds-DNA             PHG       14-AUG-1990
DEFINITION  Bacteriophage P2 left end DNA.
ACCESSION   M34756
KEYWORDS    .
SOURCE      Bacteriophage P2 DNA.
  ORGANISM  Bacteriophage P2
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 266)
  AUTHORS   Ziermann,R. and Calendar,R.
  TITLE     Characterization of the cos sites of Bacteriophages P2 and P4
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by R.Ziermann, 05-JUN-1990.
            
              Author address: R.Ziermann
              University of California
              Dept. Mol. Cell Biology
              Berkeley, CA 94720
              Email: calendar@garnet.berkeley.edu
            
FEATURES       from  to/span     description
    site          1       19     cohesive end of phage DNA
    site         20      266     left end of phage DNA
BASE COUNT       41 a     58 c     99 g     68 t
ORIGIN      Map position 0.0-0.8%.
        1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct
       61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt
      121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg
      181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc
      241 gagccagccg tttatctcgc ggatcc
//
LOCUS       RCABCHH       314 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  R.capsulata bacteriochlorophyll biosynthesis protein (bchH and
            F108) genes, 3' end, and 5' end respectively.
ACCESSION   M34843
KEYWORDS    .
SOURCE      R.capsulata DNA.
  ORGANISM  Rhodobacter capsulatus
            Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; 
            Purple nonsulfur bacteria.
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Yang,Z. and Bauer,C.E.
  TITLE     Rhodobacter capsulatus genes involved in early steps of the
            bacteriochlorophyll biosynthetic pathway
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.E.Bauer, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    <     1       79     bacteriochlorophyll biosynthesis protein
                                 (bchH) (AA at 2)
    pept        108  >   314     bacteriochlorophyll biosynthesis protein
                                 (F108)
BASE COUNT       59 a     87 c    119 g     49 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga
       61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg
      121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg
      181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg
      241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca
      301 agcgcgtgct gcag
//
LOCUS       TREANTAT      999 bp ds-DNA             ORG       14-AUG-1990
DEFINITION  T.evansi AnTat 3/3 kinetoplast DNA minicircle.
ACCESSION   M34848
KEYWORDS    .
SOURCE      T.evansi AnTat 3/3 kinetoplast minicircle DNA.
  ORGANISM  Kinetoplast Trypanosoma evansi
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae; 
            Trypanosoma evansi.
REFERENCE   1  (bases 1 to 999)
  AUTHORS   Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S.,
            Steinert,M. and Hamers,R.
  TITLE     Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi
  JOURNAL   Mol. Biochem. Parasitol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by E.Bajyana Songa, 05-JUN-1990.
FEATURES       from  to/span     description
    site        367      516     kinetoplast minicircle common region (ORI)
BASE COUNT      391 a     78 c    203 g    327 t
ORIGIN      
        1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt
       61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt
      121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg
      181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt
      241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt
      301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat
      361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg
      421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt
      481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag
      541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag
      601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca
      661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat
      721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac
      781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa
      841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg
      901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat
      961 aaaataatgg ataaaagagt ttataggtga agttgaagt
//
LOCUS       VIBSCRRE     1524 bp ds-DNA             BCT       14-AUG-1990
DEFINITION  V.alginolyticus Scr repressor protein gene (scrR), complete cds,
            and sucrose transport (scrA) protein, 5' end.
ACCESSION   M35009
KEYWORDS    Scr repressor protein; sucrose transport protein.
SOURCE      V.alginolyticus DNA.
  ORGANISM  Vibrio alginolyticus
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Vibrionaceae.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Blatch,G.L. and Woods,D.R.
  TITLE     Nucleotide sequence and analysis of the Vibrio alginolyticus Scr
            repressor-encoding gene
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.L.Blatch, 10-JUN-1990.
            
              Author address: G.L.Blatch
              University of Cape Town
              Dept. Microbiology
              Private Bag
              Rondebosch
              7700 SOUTH AFRICA
              Email: bltgre01.uctvax@f4.n494.z5.fidonet.org
            
FEATURES       from  to/span     description
    pept         36  <     1 (c) sucrose transport protein (scrA)
    pept        271      555     Scr repressor protein (scrR)
    pept        791     1090     ORF5
    binding     264      267     scrR ribosomal binding site
    binding     780      785     ORF5 ribosomal binding site
    signal      178      183     scrA -10 region
    signal      203      208     scrA -35 region
    signal      190      195     scrR -10 region
    signal      162      167     scrR -10 region
    signal      762      767     ORF5 -35 region
    signal      742      747     ORF5 -35 region
    site         78       91     repressor operator site (put.)
    site        229      242     repressor operator site (put.)
    site         12       71     ScrR helix-turn-helix motif
BASE COUNT      466 a    344 c    321 g    393 t
ORIGIN      
        1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc
       61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg
      121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt
      181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc
      241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg
      301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa
      361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc
      421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg
      481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt
      541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga
      601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt
      661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac
      721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag
      781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt
      841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga
      901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg
      961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga
     1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta
     1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc
     1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt
     1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt
     1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat
     1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct
     1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca
     1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa
     1501 gccaattgtg atattgaaga agct
//
LOCUS       XELNRP1A     2431 bp ss-mRNA            VRT       14-AUG-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA,
            complete cds.
ACCESSION   M34894
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA, clones 24-39.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2431)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         74     1117     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2431     nrp-1B mRNA
    signal     2407     2418     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
    site        921      941     alternative exon (put.)
BASE COUNT      663 a    522 c    543 g    703 t
ORIGIN      
        1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt
       61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc
      121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact
      181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt
      241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa
      301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt
      361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt
      421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga
      481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac
      541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa
      601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt
      661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat
      721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc
      781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc
      841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc
      901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag
      961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag
     1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc
     1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct
     1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct
     1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga
     1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat
     1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg
     1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat
     1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta
     1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa
     1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat
     1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac
     1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg
     1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc
     1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat
     1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc
     1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga
     1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct
     2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg
     2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc
     2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat
     2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg
     2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca
     2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata
     2401 gttcaaaata aactatgaat tttattcatt t
//
LOCUS       XELNRP1B     2324 bp ss-mRNA            VRT       14-AUG-1990
DEFINITION  X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA,
            complete cds.
ACCESSION   M34895
KEYWORDS    RNA-binding protein; nervous system-specific protein.
SOURCE      X.laevis stage 24, cDNA to mRNA.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 2324)
  AUTHORS   Richter,K., Good,P.J. and Dawid,I.B.
  TITLE     A developmentally regulated, nervous system-specific gene in
            Xenopus encodes a putative RNA-binding protein
  JOURNAL   New Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.J.Good, 05-JUN-1990.
FEATURES       from  to/span     description
    pept         29     1051     pot. RNA-binding protein (nrp-1B)
    mRNA    <     1     2324     nrp-1B mRNA
    signal     2306     2311     poly-A signal
    site         22       27     RNP2 consensus sequence
    site         58       68     RNP1 consensus sequence
    site        111      116     RNP2 consensus sequence
    site        147      157     RNP1 consensus sequence
BASE COUNT      647 a    467 c    520 g    690 t
ORIGIN      
        1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc
       61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac
      121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt
      181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca
      241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga
      301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat
      361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca
      421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg
      481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt
      541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc
      601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat
      661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag
      721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac
      781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg
      841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc
      901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat
      961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat
     1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg
     1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct
     1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat
     1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta
     1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa
     1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa
     1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct
     1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa
     1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc
     1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact
     1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg
     1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt
     1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct
     1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag
     1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag
     1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt
     1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta
     2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac
     2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt
     2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat
     2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa
     2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc
//
LOCUS       YSJSACDHY    2569 bp ds-DNA             PLN       14-AUG-1990
DEFINITION  Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds.
ACCESSION   M34929
KEYWORDS    saccharopine dehydrogenase.
SOURCE      Y.lipolytica (strain W29) DNA.
  ORGANISM  Yarrowia lipolytica
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2569)
  AUTHORS   Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and
            Gaillardin,C.
  TITLE     Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia
            lipolytica
  JOURNAL   Mol. Cell. Biol. (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.M.Fournier, 06-JUN-1990.
            
            ORF1 is probably not translated, since a lacZ fusion/expression
            experiment did not produce results.
FEATURES       from  to/span     description
    pept        649     1929     ORF1
    pept       1764      655 (c) saccharopine dehydrogenase
    mRNA        582     2060     ORF1 mRNA (alt.)
    mRNA        598     2060     ORF1 mRNA (alt.)
    mRNA        599     2060     ORF1 mRNA (alt.)
    mRNA        602     2060     ORF1 mRNA (alt.)
    mRNA        606     2060     ORF1 mRNA (alt.)
    mRNA        609     2060     ORF1 mRNA (alt.)
    mRNA       1807      639 (c) ORF2 mRNA (alt.)
    mRNA       1809      639 (c) ORF2 mRNA (alt.)
    mRNA       1811      639 (c) ORF2 mRNA (alt.)
    mRNA       1812      639 (c) ORF2 mRNA (alt.)
    mRNA       1814      639 (c) ORF2 mRNA (alt.)
    signal      521      528     TATA box
    signal      565      575     TATA box
BASE COUNT      668 a    573 c    666 g    662 t
ORIGIN      
        1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag
       61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt
      121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca
      181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa
      241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag
      301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc
      361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc
      421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt
      481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca
      541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt
      601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc
      661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt
      721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc
      781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc
      841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac
      901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt
      961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca
     1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt
     1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc
     1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt
     1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata
     1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc
     1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc
     1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct
     1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg
     1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg
     1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc
     1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt
     1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag
     1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg
     1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag
     1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg
     1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga
     1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa
     2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca
     2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg
     2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc
     2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct
     2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag
     2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga
     2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat
     2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat
     2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc
//
LOCUS       HUMVINC      5102 bp ss-mRNA            PRI       14-AUG-1990
DEFINITION  Human vinculin mRNA, complete cds.
ACCESSION   M33308
KEYWORDS    cytoskeletal protein; vinculin.
SOURCE      Human endothelial cells, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 5102)
  AUTHORS   Weller,P.A., Ogryzko,E.P., Corben,E.B., Zhidkova,N.I., Patel,B.,
            Price,G.J., Spurr,N.K., Koteliansky,V.E. and Critchley,D.R.
  TITLE     Complete sequence of human vinculin and assignment of the gene to
            chromosome 10
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 5667-5671 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.A.Weller, 28-MAR-1990.
FEATURES       from  to/span     description
    pept         51     3251     vinculin
BASE COUNT     1379 a   1248 c   1273 g   1202 t
ORIGIN      Chromosome 10.
        1 gaattccact tctctgtcgc ccgcggttcg ccgccccgct cgccgccgcg atgccagtgt
       61 ttcatacgcg cacgatcgag agcatcctgg agccggtggc acagcagatc tcccacctgg
      121 tgataatgca cgaggagggc gaggtggacg gcaaagccat tcctgacctc accgcgcccg
      181 tggccgccgt gcaggcggcc gtcagcaacc tcgtccgggt tggaaaagag actgttcaaa
      241 ccactgagga tcagattttg aagagagata tgccaccagc atttattaag gttgagaatg
      301 cttgcaccaa gcttgtccag gcagctcaga tgcttcagtc agacccttac tcagtgcctg
      361 ctcgagatta tctaattgat gggtcaaggg gcatcctctc tggaacatca gacctgctcc
      421 ttaccttcga tgaggctgag gtccgtaaaa ttattagagt ttgcaaagga attttggaat
      481 atcttacagt ggcagaggtg gtggagacta tggaagattt ggtcacttac acaaagaatc
      541 ttgggccagg aatgactaag atggccaaga tgattgacga gagacagcag gagctcactc
      601 accaggagca ccgagtgatg ttggtgaact cgatgaacac cgtgaaagag ttgctgccag
      661 ttctcatttc agctatgaag atttttgtaa caactaaaaa ctcaaaaaac caaggcatag
      721 aggaagcttt aaaaaatcgc aattttactg tagaaaaaat gagtgctgaa attaatgaga
      781 taattcgtgt gttacaactc acctcttggg atgaagatgc ctgggccagc aaggacactg
      841 aagccatgaa gagagcattg gcctccatag actccaaact gaaccaggcc aaaggttggc
      901 tccgtgaccc tagtgcctcc ccaggggatg ctggtgagca ggccatcaga cagatcttag
      961 atgaagctgg aaaagttggt gaactctgtg caggcaaaga acgcagggag attctgggaa
     1021 cttgcaaaat gctagggcag atgactgatc aagtggctga cctccgtgcc agaggacaag
     1081 gatcctcacc ggtggccatg cagaaagctc agcaggtatc tcagggtctg gatgtgctca
     1141 cagcaaaagt ggaaaatgca gctcgcaagc tggaagccat gaccaactca aagcagagca
     1201 ttgcaaagaa gatcgatgct gctcagaact ggcttgcaga tccaaatggt ggaccggaag
     1261 gagaagagca gattcgaggt gctttggctg aagctcggaa aatagcagaa ttatgtgatg
     1321 atcctaaaga aagagatgac attctacgtt cccttgggga aatatctgct ctgacttcta
     1381 aattagcaga tctacgaaga caggggaaag gagattctcc agaggctcga gccttggcca
     1441 aacaggtggc cacggccctg cagaacctgc agaccaaaac caaccgggct gtggccaaca
     1501 gcagaccggc caaagcagct gtacaccttg agggcaagat tgagcaagca cagcggtgga
     1561 ttgataatcc cacagtggat gaccgtggag tcggtcaggc tgccatccgg gggcttgtgg
     1621 ccgaagggca tcgtctggct aatgttatga tggggcctta tcggcaagat cttctcgcca
     1681 agtgtgaccg agtggaccag ctgacagccc agctggctga cctggctgcc agaggggaag
     1741 gggagagtcc tcaggcacga gcacttgcat ctcagctcca agactcctta aaggatctaa
     1801 aagctcggat gcaggaggcc atgactcagg aagtgtcaga tgttttcagc gataccacaa
     1861 ctcccatcaa gctgttggca gtggcagcca cggcgcctcc tgatgcgcct aacagggaag
     1921 aggtatttga tgagagggca gctaactttg aaaaccattc aggaaagctt ggtgctacgg
     1981 ccgagaaggc ggctgcggtt ggtactgcta ataaatcaac agtggaaggc attcaggcct
     2041 cagtgaagac ggcccgagaa ctcacacccc aggtggtctc ggctgctcgt atcttactta
     2101 ggaaccctgg aaatcaagct gcttatgaac attttgagac catgaagaac cagtggatcg
     2161 ataatgttga aaaaatgaca gggctggtgg acgaagccat tgataccaaa tctctgttgg
     2221 atgcttcaga agaagcaatt aaaaaagacc tggacaagtg caaggtagct atggccaaca
     2281 ttcagcctca gatgctggtt gctggggcaa ccagtattgc tcgtcgggcc aaccggatcc
     2341 tgctggtggc taagagggag gtggagaatt ccgaggatcc caagttccgt gaggctgtga
     2401 aagctgcctc tgatgaattg agcaaaacca tctccccaat ggtgatggat gcaaaagctg
     2461 tggctggaaa catttccgac cctggactgc aaaagagctt cctggactca ggatatcgga
     2521 tcctgggagc tgtggccaag gtcagagaag ccttccaacc tcaggagcct gacttcccgc
     2581 cgcctccacc agaccttgaa caactccgac taacagatga gcttgctcct cccaaaccac
     2641 ctctgcctga aggtgaggtc cctccaccta ggcctccacc accagaggaa aaggatgaag
     2701 agttccctga gcagaaggcc ggggaggtga ttaaccagcc aatgatgatg gctgccagac
     2761 agctccatga tgaagctcgc aaatggtcca gcaagggcaa tgacatcatt gcagcagcca
     2821 agcgcatggc tctgctgatg gctgagatgt ctcggctggt aagagggggc agtggtacca
     2881 agcgggcact cattcagtgt gccaaggaca tcgccaaggc ctcagatgag gtgactcggt
     2941 tggccaagga ggttgccaag cagtgcacag ataaacggat tagaaccaac ctcttacagg
     3001 tatgtgagcg aatcccaacc ataagcaccc agctcaaaat cctgtccaca gtgaaggcca
     3061 ccatgctggg ccggaccaac atcagtgatg aggagtctga gcaggccaca gagatgctgg
     3121 ttcacaatgc ccagaacctc atgcagtctg tgaaggagac tgtgcgggaa gctgaagctg
     3181 cttcaatcaa aattcgaaca gatgctggat ttacactgcg ctgggttaga aagactccct
     3241 ggtaccagta ggcacctggc tgagcctggc tggcacagaa acctctacta aaaagaagga
     3301 aaatgatctg agtcccagga gctgcccaga gttgctggga gctgaaaaat cacatcctgg
     3361 cctggcacat cagaaaggaa tgggggcctc ttcaaattag aagacattta tactcttttt
     3421 tcatggacac tttgaaatgt gtttctgtat aaagcctgta ttctcaaaca cagttacact
     3481 tgtgcaccct ctatcccaat aggcagactg ggtttctagc ccatggactt cacataagct
     3541 cagaatccaa gtgaacacta gccagacact ctgctctgcc cttgttccct aggggacact
     3601 tccctctgtt tctctttcct tggctcccat tcactcttcc agaatcccaa gacccagggc
     3661 ccaggcaaat cagttactaa gaagaaaatt gctgtgcctc ccaaaattgt tttgagcttt
     3721 ccatgttgct gccaaccata ccttccttcc ctgggctgtg ctacctgggt ccttttcaga
     3781 agtgagcttt gctgctacag gggaaggtgg cctctgtgga gccccagcat atgggggcct
     3841 ggattcattt cctgcccttc ctcagtttaa tccttctagt ttcccacaat ataaaactgt
     3901 acttcactgt caggaagaaa tcacagaatc atatgattct gcttttacca tgcccctgag
     3961 caatgtctgt gctagggaaa ctccccgtcc catatcctgc ctcagcccgc caaggtagcc
     4021 atcccatgaa cacactgtgt cctggtgctc tctgccactg gaagggcaga gtagccaggg
     4081 tgtggccctg ccatcttccc agcagggcca ctcccggcac tccatgctta gtcactgcct
     4141 gcagaggtct gtgctgaggc cttatcattc attcttagct cttaattgtt cattttgagc
     4201 tgaaatgctg cattttaatt ttaaccaaaa catgtctcct atatcctggt ttttgtagcc
     4261 ttcctccaca tcctttctaa acaagatttt aaagacatgt aggtgtttgt tcatctgtaa
     4321 ctctaaaaga tcctttttaa attcagtcct aagaaagagg agtgcttgtc ccctaagagt
     4381 gtttaatggc aaggcagccc tgtctgaagg acacttcctg cctaagggag agtggtattt
     4441 gcagactaga attctagtgc tgctgaagat gaatcaatgg gaaatactac tcctgtaatt
     4501 cctacctccc tgcaaccaac tacaaccaag ctctctgcat ctactcccaa gtatggggtt
     4561 caagagagta atgggtttca tatttcttat caccacagta agttcctact aggcaaaatg
     4621 agagggcagt gtttcctttt tggtacttat tactgctaag tatttcccag cacatgaaac
     4681 cttatttttt ccaaagccag aaccagatga gtaaaggagt aagaaccttg cctgaacatc
     4741 cttccttccc acccatcgct gtgtgttagt tcccaacatc gaatgtgtac aacttaagtt
     4801 ggtcctttac actcaggctt tcactatttc ctttaaaatg aggatgatta ttttcaaggc
     4861 cctcagcata tttgtatagt tgcttgcctg atataaatgc aatattaatg cctttaaagt
     4921 atgaatctat gccaaagatc acttgttgtt ttactaaaga aagattactt agaggaaata
     4981 agaaaaatca tgtttgctct cccggttctt ccagtggttt gagacactgg tttacacttt
     5041 atgccggatg tgcttttctc caatatcagt gctcgagaca cagtgaagca aattaaaaaa
     5101 aa
//
LOCUS       HUMPSBGA01    134 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 1.
ACCESSION   M32622
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     1 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 134; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept         50  +   113     pregnancy-specific beta-1-glycoprotein 4
                                 (PSG4) precursor, exon 1
    sigp         50  +   113     pregnancy-specific beta-1-glycoprotein 4
                                 signal peptide
    IVS         114  >   134     PSG4 intron A
BASE COUNT       33 a     44 c     36 g     21 t
ORIGIN      Chromosome 19q13.2-13.3.
        1 ggatcctagg cttatctcca cagaggagaa cacacaagca gcagagacca tggggcccct
       61 ctcagcccct ccctgcacac acctcatcac ctggaagggg gtcctgctca caggtgagga
      121 gagaacttcc tggg
//
LOCUS       HUMPSBGA02   1010 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 2.
ACCESSION   M32623 M22436 D51537
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     2 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2] and hsCGM4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1010)
  AUTHORS   Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
            Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
            Zimmermann,W.
  TITLE     Analysis of the size of the carcinoembryonic antigen (CEA) gene
            family: Isolation and sequencing of N-terminal domain exons
  JOURNAL   Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 1010; for [2])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +   468  +   833     pregnancy-specific beta-1-glycoprotein 4
                                 (PSG4) precursor, exon 2
    sigp    +   468      505     pregnancy-specific beta-1-glycoprotein 4
                                 signal peptide
    matp        506  +   833     pregnancy-specific beta-1-glycoprotein 4
    IVS     <     1      467     PSG4 intron A
    IVS         834  >  1010     PSG4 intron B
BASE COUNT      281 a    225 c    267 g    237 t
ORIGIN      Chromosome 19q13.2-13.3, about 650 bp after segment 1.
        1 gagctcacac agtcatcggg gggggaagac agacatgcta agtgatctag aatgtgagat
       61 caggtgttga caagaaccct ggagggagga gagcagggaa aggtcagaaa gggaagaccc
      121 agggtctctg aaggaggtat caggaaagaa gtctaaggat gccctgatgt gagcaggacc
      181 tgagggcagt gtggaggggg ccgtgcggac cctggggaag aggaatccaa aaagaaaaat
      241 gccaaggtca gaagtgttga aggaatgggg gtcatgctgc tgatcttgac ctagtgggac
      301 agtaggacac acacacatac actcacgccc ctttagtgtg tgtatgtgtt tgtatgtgtg
      361 tgtttgtgtg tcttcaaggc tgaggattga agagaccttc tcaggaccca tcttttcacc
      421 ccaatacata ggtctcaata ttaactgatg ctctctgtac ctcctagcat cacttttaaa
      481 cttctggaat ccgcccacaa ctgcccaagt cacgattgaa gcccagccac ccaaagtttc
      541 tgaggggaag gatgttcttc tacttgtcca caatttgccc cagaatcttg ctggctacat
      601 ttggtacaaa gggcaaatga catacctcta ccattacatt acatcatatg tagtagacgg
      661 tcaaagaatt atatatgggc ctgcatacag tggaagagaa agagtatatt ccaatgcatc
      721 cctgctgatc cagaatgtca cgcaggagga tgcaggatcc tacaccttac acatcataaa
      781 gcgacgcgat gggactggag gagtaactgg acatttcacc ttcaccttac accgtgagtg
      841 attccacatg atccctgggt gttgggggac aggggtcact tctacttcac acacacagga
      901 ttctcaggcc tggactctgc ctgtgtccct ctctgcatta agtccatgct ggggtttggg
      961 catttagtgc aggacacaca gaggagacaa atttcaacag atcagaattc
//
LOCUS       HUMPSBGA03    430 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 3.
ACCESSION   M32624
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     3 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 430; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +   124  +   402     pregnancy-specific beta-1-glycoprotein 4
                                 (PSG4) precursor, exon 3
    matp    +   124  +   402     pregnancy-specific beta-1-glycoprotein 4
    IVS     <     1      123     PSG4 intron B
    IVS         403  >   430     PSG4 intron C
BASE COUNT      108 a    117 c    103 g    102 t
ORIGIN      Chromosome 19q13.2-13.3, about 5500 bp after segment 2.
        1 ttccatgggc ttttaaggac ttgggtgggc tgagaggtgg gaaatgccaa ctctgattga
       61 aaaatgcctt tggaagaatc aaagatgcca cacagggcaa acctctctct gttttctgca
      121 cagtggagac tcccaagccc tccatctcca gcagcaactt aaatcccagg gaggccatgg
      181 aggctgtgat cttaacctgt gatcctgcga ctccacccgc aagctaccag tggtggatga
      241 atggtcagag cctccctatg actcacaggt tgcagctgtc caaaaccaac aggaccctct
      301 ttatatttgg tgtcacaaag tatattgcag gaccctatga atgtgaaata cggaacccag
      361 tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct ccgtgagtat cctctgttcc
      421 tctgtgagcc
//
LOCUS       HUMPSBGA04    573 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 4.
ACCESSION   M32625
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     4 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 573; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +   181      365     pregnancy-specific beta-1-glycoprotein 4
                                 (PSG4) precursor, exon 4
                369  +   435     pregnancy-specific beta-1-glycoprotein 4
                                 (PSG4) precursor, exon 4
    matp    +   181  +   435     pregnancy-specific beta-1-glycoprotein 4
    IVS     <     1      180     PSG4 intron C
    IVS         436  >   573     PSG4 intron D
    site        366      368     internal stop codon
BASE COUNT      146 a    149 c    132 g    146 t
ORIGIN      Chromosome 19q13.2-13.3, about 270 bp after segment 3.
        1 agcttgtgat gggagaaaca ggtgtaaggt ctcaggctcc agatcagtga acacagcggg
       61 gatttggctg ggacttcagt gttgcgactt ggctcacagg gtcactgtgg cccttccaca
      121 gaccaggatt ttcccttccc tctgacaatg tcacctgtga ctttattctc tttgctccag
      181 atggcctgga tgcccccacc atttcttcct catacaccta ttaccataca ggggaattcc
      241 ccaatctctc ctgcctcaca gactctcacc cactggcaga gcattcttgg ctgattgatg
      301 ggaagttcca gcaatcagca caagtgtgct ttatccccca aatcactaaa acatatagag
      361 gggtttaggt ctgtttcatc cataactcag ccactgatgg aacaaatctc ataatcaaga
      421 ggatcatagt gcctggtaag tggatccctg gagcattggc aatatgtttt ccagtgaagc
      481 tatctatgct atcagggaaa agccaccgtc cctcgtcaaa gggagaggga aaatcaaaaa
      541 cccagcacag ggaatatgtt tcgtctccgg ggg
//
LOCUS       HUMPSBGA05    647 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 5.
ACCESSION   M32626
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     5 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 647; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +   158  +   436     pregnancy-specific beta-1-glycoprotein 4
                                 (PSG4) precursor, exon 5
    matp    +   158  +   436     pregnancy-specific beta-1-glycoprotein 4
    IVS     <     1      157     PSG4 intron D
    IVS         437  >   647     PSG4 intron E
BASE COUNT      183 a    168 c    147 g    149 t
ORIGIN      Chromosome 19q13.2-13.3, about 1800 bp after segment 4.
        1 gatcatcatg catctatctt gtgacagaag caccagctat tggctttcaa ggactcgggt
       61 gggctgagag gtgggagatg ccaactctga ttgaagtatg cctgtggagg aatcaaaggt
      121 gccacacagg acaatcttct ctctgttatc cacacagcaa agctgtccaa gccctacatc
      181 acaatcaaca acttaaaccc cagagagaat aaggatgtct taaccttcac ctgtgaacct
      241 aagagtaaga actacaccta catttggtgg ctaaatggtc agagcctccc tgtcagtccc
      301 agggtaaagc gacccattga aaacaggatc ctcattctac ccaatgtcac gagaaatgaa
      361 acaggacctt atcaatgtga aatacgggac cgatatggtg gcatccgcag tgacccagtc
      421 accctgaatg tcctctgtga gtatcttttt gttcctctgt gggccaggac accagcttaa
      481 atccaaacca ccagaggcca ggtctctcag tctctctcca gtccaagtat agacaccttt
      541 acttctggac atccgaggtg gccatgactc cctgccctgg gaaaacctag gtaggcacag
      601 cttaaccaag aatataaggg gaggggacgc tcttgtcatg gagactt
//
LOCUS       HUMPSBGA06    558 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 6.
ACCESSION   M32627
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     6 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 558; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +    45  /   299     pregnancy-specific beta-1-glycoprotein 4
                                 (PSG4) precursor, exon 6
    matp    +    45  /   299     pregnancy-specific beta-1-glycoprotein 4
    IVS     <     1       44     PSG4 intron E
    IVS         300  >   558     PSG4 intron F
BASE COUNT      161 a    144 c    103 g    150 t
ORIGIN      Chromosome 19q13.2-13.3, about 200 bp after segment 5.
        1 ttcctcggat gacatcacct atggctttat tctctttgcg ccagatggtc cagacctccc
       61 cagcatttac ccttcattca cctattaccg ttcaggagaa aacctctact tgtcctgctt
      121 cgccgagtct aacccacggg cacaatattc ttggacaatt aatgggaagt ttcagctatc
      181 aggacaaaag ctctctatcc cccaaataac tacaaagcat agtgggctct atgcttgctc
      241 tgttcgtaac tcagccactg gcaaggaaag ctccaaatcc atcacagtca aagtctctgg
      301 taagtggatc ccagcatcct tggcaatagg gatttaggtg gagtctatct ggcattcagg
      361 gaagagtgag gaaaacattt ttattcccag cctgtgtccc atgggcacaa gcaaatccca
      421 aattctactc ctgaacactc ccaatttgtc taagaacttc caaaacttta acaaacaggc
      481 tgatatcttc ttaaaattcc cagcctagac caagcaggga gaacattgat ttcattgaaa
      541 taattgacaa tattgagg
//
LOCUS       HUMPSBGA07    621 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 1.
ACCESSION   M32628
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     7 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 621; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept        360  +   423     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 1
    sigp        360  +   423     pregnancy-specific beta-1-glycoprotein 5
                                 signal peptide
    IVS         424  >   621     PSG5 intron A
BASE COUNT      176 a    135 c    193 g    113 t      4 others
ORIGIN      Chromosome 19q13.2-13.3, about 7000 bp after segment 6.
        1 tatgtgtgtg actgtgtggg tctgtgtgtg tctcttctgt gtgtgtgtgt ctgcacaaag
       61 tgtggttgag gtttggtgaa agaatcactg ctgaaaaatg cagaggcctc cacaattccc
      121 agggacctga aacacagaca aaaggaaaaa cagaaggagg acaaggaggc aggactgaga
      181 gaggagggga cagagaggtg tcctgggcct gaccccaccc atgagcctga gaagtgctcc
      241 tgccctggag agaggctcag cacagaagga ggaaggacag cacagcctac agccgtgctc
      301 aggaagtttc tggatcctag gctcagctcc acagaggaga acacgcaggc gcagagacca
      361 tggggcccct ctcagcccct ccctgcacac agcacatcac ctggaagggg ctcctgctca
      421 caggtgagga gagaacttcc tgggagagga caggaggagg aagaagagtg actggattgg
      481 gtctcctgga gaggatnnnn ttctaaaaaa taaaagaagt ccagcacttt gggaggatga
      541 ggtgggtgga ttatgagatc aggagttcaa ggtcagtcct gccaacacag tgaagccctg
      601 tctctactaa aaatacaaaa a
//
LOCUS       HUMPSBGA08    173 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, intron
            A.
ACCESSION   M32629
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     8 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 173; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    IVS     <     1  >   173     pregnancy-specific beta-1-glycoprotein 5
                                 intron A
BASE COUNT       56 a     29 c     62 g     26 t
ORIGIN      Chromosome 19q13.2-13.3, about 560 bp after segment 7.
        1 gagctcacgc tgtcatgggg aggaagacag acatgcaaag agatctagaa tgtgaggtca
       61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag
      121 ggtctctgaa ggacgtgtca ggaaagaagt ctagagatgc cctgatgtga gca
//
LOCUS       HUMPSBGA09    547 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 2.
ACCESSION   M32630
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     9 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 547; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +     9  +   374     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 2
    sigp    +     9       47     pregnancy-specific beta-1-glycoprotein 5
                                 signal peptide
    matp         48  +   374     pregnancy-specific beta-1-glycoprotein 5
    IVS     <     1        8     PSG5 intron A
    IVS         375  >   547     PSG5 intron B
BASE COUNT      149 a    139 c    117 g    142 t
ORIGIN      Chromosome 19q13.2-13.3, about 300 bp after segment 8.
        1 cctcctagca tcacttttaa acttctggaa cctgcctatc actgctcaag tcacgattga
       61 agccctgcca cccaaagttt ccgaggggaa ggatgttctt ctacttgtcc acaatttgcc
      121 tcagaatctt gctggctaca tctggtacaa aggacaactg atggacctct accattacat
      181 tacatcatat gtagtagacg gtcaaataaa tatatatggg cctgcataca ctggacgaga
      241 aacagtatat tccaatgcat ccctgctgat ccagaatgtc acccgggaag acgcaggatc
      301 ctacacctta cacatcataa agcgaggtga taggactaga ggagtaactg gatatttcac
      361 cttcaactta taccgtgagt gattccacat gatccctggg tgttggggga caggggtcat
      421 ttctacttca cacacacagg attgtcaggc ctgcactgtg cctgtgtccc tctctgcatt
      481 atgtcccatg ctgcggtttg ggcatttagt gcaggacaca cacagaggag acaaatttca
      541 acagatc
//
LOCUS       HUMPSBGA10    458 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 3.
ACCESSION   M32631
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     10 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 458; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +   167  +   445     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 3
    matp    +   167  +   445     pregnancy-specific beta-1-glycoprotein 5
    IVS     <     1      166     PSG5 intron B
    IVS         446  >   458     PSG5 intron C
BASE COUNT      117 a    122 c    108 g    111 t
ORIGIN      Chromosome 19q13.2-13.3, about 5200 bp after segment 9.
        1 gagctctcag attgccatgc atctgtcttg taacacacac acctgccatg ggcttttaag
       61 gacttgggtg ggctgacagg tgggaaatgc caactctgat tgaaaagtgc ctttggagga
      121 atcaaaggtg ccacacaggg caatcttctc tctgttttct gcaaagtgga aactcccaag
      181 ccctccatct ccagtagcaa cttaaacccc agggaggcca tggaggctgt gatcttaacc
      241 tgtgatcctg cggcttcaga cacaagctac ctgtggtgga tgaatggtca gagcctccct
      301 atgactcaca ggttgcagct gtccaaaacc aacaggaccc tctttatatt tggtgtcaca
      361 aagtatattg caggacccta tgaatgtgaa atacggaacc cagtgagtgc cagccacagt
      421 gacccagtca ccctgaatct cctccgtgag tatcctct
//
LOCUS       HUMPSBGA11    359 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 4.
ACCESSION   M32632
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     11 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 359; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +    95      267     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 4
                272  +   347     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 4
    matp    +    95  +   347     pregnancy-specific beta-1-glycoprotein 5
    IVS     <     1       94     PSG5 intron C
    IVS         348  >   359     PSG5 intron D
    site        268      270     internal stop codon
BASE COUNT       89 a    108 c     65 g     97 t
ORIGIN      Chromosome 19q13.2-13.3, about 300 bp after segment 10.
        1 acttggctca cagggtcact gtggcccttc cacagaccag gattttccct tccctctgac
       61 aatgtcacct gtgactttat tctctttgct ccagatggcc tggatgcccc caccatttct
      121 tcctcataca cctattacca tacaggggaa gtccccaatc tctcctgcct cacagactct
      181 cacccactgg cagagcattc ttggctgatt gatgggaagt tccagcaatc agcacaagtg
      241 tgctttattc cccaaatcac caaaacatag agggatctat gtctgtttca tccataactc
      301 agccactgat ggaacaaatc tcataatcaa gaggatcata gtccctggta agtggatcc
//
LOCUS       HUMPSBGA12    381 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 5.
ACCESSION   M32633
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     12 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 381; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +    33  +   311     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 5
    matp    +    33  +   311     pregnancy-specific beta-1-glycoprotein 5
    IVS     <     1       32     PSG5 intron D
    IVS         312  >   381     PSG5 intron E
BASE COUNT      111 a    105 c     81 g     84 t
ORIGIN      Chromosome 19q13.2-13.3, about 2000 bp after segment 11.
        1 acaggacaat cttctctctg ttatccacac agtgaagctg cccaagccct acatcaccat
       61 caacaactca aaacccaggg agaataagga tgtcttagcc ttcacctgtg aacctaagag
      121 tgagaactac acctacattt ggtggctaaa tggtcagagc ctcccggtca gtcccagggt
      181 aaagcgaccc attgaaaaca ggatcctcat tctacccagt gtcacgagaa atgaaacagg
      241 accctatgaa tgtgaaatac gggaccgaga tggtggcatg cgcagtgacc cagtcaccct
      301 gaatgtcctc tgtgagtatc ttttgttcct ctgtgggcca ggacaccagc ttaaatctaa
      361 atgaccacaa gccagtcctc t
//
LOCUS       HUMPSBGA13    540 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 6.
ACCESSION   M32634
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     13 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 540; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +    40  +   294     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 6
    matp    +    40  +   294     pregnancy-specific beta-1-glycoprotein 5
    IVS     <     1       39     PSG5 intron E
    IVS         295  >   540     PSG5 intron F
BASE COUNT      132 a    142 c    108 g    158 t
ORIGIN      Chromosome 19q13.2-13.3, about 300 bp after segment 12.
        1 ggatgacatc acctgtggct tcattcttct ttgctccaga tggtccagac ctccccagca
       61 tttacccttc attcacctat taccgttcag gagaaaacct ctacttgtcc tgcttcgcgg
      121 aatctaaccc accggcagag tatttttgga caattaatgg gaagtttcag caatcaggac
      181 aaaagctctc tatcccccaa attactacaa agcatagagg gctctatact tgctctgttc
      241 gtaactcagc tactggcaag gaaagctcca aatccatgac agtcgaagtc tctggtaagt
      301 ggatcccagc atccttggca atagggtttt aggtggagtc tacctggctt tcagagaaga
      361 gttaggaaaa catttttatt cccagcctgt gtcccatggg cacaagcaaa tcccaaattc
      421 tcctcctgaa cccttccaat ttgtctctac aaactctctt ctccttgttt ttctgttttc
      481 tcatggctga ccttgtgtct ggcctaagaa agttagggag ggggctttat cagccctgag
//
LOCUS       HUMPSBGA14    320 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 7.
ACCESSION   M32635
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     14 of 14
SOURCE      Human placenta DNA, clones cosPSG4/5-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 320; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +   202      245     pregnancy-specific beta-1-glycoprotein 5
                                 (PSG5) precursor, exon 7
    matp    +   202      245     pregnancy-specific beta-1-glycoprotein 5
    IVS     <     1      201     PSG5 intron F
    IVS         286  >   320     PSG5 intron G
BASE COUNT       94 a     78 c     54 g     94 t
ORIGIN      Chromosome 19q13.2-13.3, about 4300 bp after segment 13.
        1 ggatcaccat cctagaaaca tggtgagagg gaaaaaaaaa actgcaagaa tataatcatc
       61 tcccatcaat ttcccaagag gaataatgta gtacttgaag aaacaattat agagtacctc
      121 atgctacatg cctcttcctg aggctccccc atgtaaaata acatcacctt cattccttct
      181 ttacttttct ttccatgaca gctccttcag gaataggacg tcttcctctc cttaatccaa
      241 tatagcagcc gtgaagtcat ttctgtattt caggaagact ggcaggtatg atggcctttt
      301 ctcttatcct ggtcctgcag
//
LOCUS       HUMPSBGB1     237 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 4.
ACCESSION   M32617
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     1 of 5
SOURCE      Human leukocyte DNA, clone lambda-PSG6/7-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 237; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    <     1  +    14     pregnancy-specific beta-1-glycoprotein (PSG7),
                                 exon 4 (AA at 2)
    IVS          15  >   237     PSG7 intron D
BASE COUNT       76 a     52 c     50 g     59 t
ORIGIN      Chromosome 19q13.2-13.3.
        1 gatcatagtc cctggtaagt ggatcctgga gcattggcaa tatgttttcc agggaagtct
       61 atctggctat cagggaagag ccacctgccc tctgcagagg agagggaaaa tcaaaactca
      121 ggaaagggaa tatatttctg ctccaaaacc accaggtttt cctgtacctc actctttcta
      181 gatcattctt cagactatgc gcactaacaa tgaacaatct gaaaagaaat taagaaa
//
LOCUS       HUMPSBGB2     435 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 5.
ACCESSION   M32618
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     2 of 5
SOURCE      Human leukocyte DNA, clone lambda-PSG6/7-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 435; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +    25  +   303     pregnancy-specific beta-1-glycoprotein 7
                                 (PSG7), exon 5
    IVS     <     1       24     PSG7 intron D
    IVS         304  >   435     PSG7 intron E
BASE COUNT      120 a    123 c     87 g    105 t
ORIGIN      Chromosome 19q13.2-13.3; about 1800 bp after segment 1.
        1 atcttctctc tgttattcac acagtgaagc tgcctaagcc ctacatcacc atcaataact
       61 taaaccccag ggagaataag gatgtctcaa ccttcacctg tgaacctaag agtgagaact
      121 acacctacat ttggtggcta aatggtcaga gcctcccggt cagtcccagg gtaaagcgac
      181 gcattgaaaa caggatcctc attctaccca gtgtcacgag aaatgaaaca ggaccctatc
      241 aatgtgaaat acgggaccga tatggtggca tccgcagtga cccagtcacc ctgaatgtcc
      301 tctgtgagta tccttgagta tcctttcttt cctctgtggg ccaggacaca ccagcttaac
      361 tccaaacgac cagaggccag tcctctcagt ctctctccgg tccaagtata gacatattta
      421 cttctggaca tccga
//
LOCUS       HUMPSBGB3     274 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 6.
ACCESSION   M32619
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     3 of 5
SOURCE      Human leukocyte DNA, clone lambda-PSG6/7-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 274; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept    +     8  /   262     pregnancy-specific beta-1-glycoprotein 7
                                 (PSG7), exon 6
    IVS     <     1        7     PSG7 intron E
    IVS         263  >   274     PSG7 intron F
BASE COUNT       72 a     78 c     52 g     72 t
ORIGIN      Chromosome 19q13.2-13.3; about 300 bp after segment 2.
        1 cctccagatg gtccagacct ccccagaatt tacccttcat tcacctatta ccattcagga
       61 gaaaacctct acttgtcctg ttttgcggac tctaacccac cggcacagta ttcttggaca
      121 attaatggga agtttcagct atcaggacaa aagctttcta tcccccagat tactacaaag
      181 catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa
      241 tccgtgacag tcagagtctc tggtaagtgg atcc
//
LOCUS       HUMPSBGB4     206 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 1.
ACCESSION   M32620
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     4 of 5
SOURCE      Human leukocyte DNA, clone lambda-PSG6/7-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   2  (bases 1 to 206; for [1])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1],[2] kindly
            submitted by J.Thompson, 06-MAR-1990.
FEATURES       from  to/span     description
    pept         50  +   113     pregnancy-specific beta-1-glycoprotein 6
                                 (PSG6) precursor, exon 1
    sigp         50  +   113     pregnancy-specific beta-1-glycoprotein 6
                                 signal peptide
    IVS         114  >   206     PSG6 intron A
BASE COUNT       59 a     50 c     66 g     31 t
ORIGIN      Chromosome 19q13.2-13.3; about 7800 bp after segment 3.
        1 ggatcctagg ctcatctcca caggggagaa cacacagaca gcagagacca tgggacccct
       61 ctcagcccct ccctgcactc agcacatcac ctggaagggg ctcctgctca caggtgagga
      121 gaggacttcc tgggagagga caggaggagg aagcagagtg actggatggg gtatctggag
      181 aggatggggt tctaaaaaat aaaaga
//
LOCUS       HUMPSBGB5    1026 bp ds-DNA             PRI       14-AUG-1990
DEFINITION  Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 2.
ACCESSION   M32621 M22435 D51537
KEYWORDS    pregnancy-specific beta-1-glycoprotein.
SEGMENT     5 of 5
SOURCE      Human leukocyte DNA, clones lambda-PSG6/7-1 and hsCGM3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1026)
  AUTHORS   Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H.,
            Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and
            Zimmermann,W.
  TITLE     Analysis of the size of the carcinoembryonic antigen (CEA) gene
            family: Isolation and sequencing of N-terminal domain exons
  JOURNAL   Biochem. Biophys. Res. Commun. 158, 996-1004 (1989)
  STANDARD  full staff_review
REFERENCE   2  (sites)
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  TITLE     The human pregnancy-specific glycoprotein genes are tightly linked
            on the long arm of chromosome 19 and are coordinately expressed
  JOURNAL   Biochem. Biophys. Res. Commun. 167, 848-859 (1990)
  STANDARD  full staff_review
REFERENCE   3  (bases 439 to 846; for [2])
  AUTHORS   Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C.,
            Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D.,
            Ammaturo,D. and Hardman,N.
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990.
            
            This gene was named human carcinoembryonic antigen (hsCGM3) in [1],
            but [2] says it is actually one of the pregnancy-specific
            beta-1-glycoproteins.
FEATURES       from  to/span     description
    pept    +   484  /   846     pregnancy-specific beta-1-glycoprotein 6
                                 (PSG6) precursor, exon 2
    sigp    +   484      521     pregnancy-specific beta-1-glycoprotein 6
                                 signal peptide
    matp        522  /   846     pregnancy-specific beta-1-glycoprotein 6
    IVS     <     1      483     PSG6 intron A
    IVS         847  >  1026     PSG6 intron B
BASE COUNT      291 a    235 c    264 g    236 t
ORIGIN      Chromosome 19q13.2-13.3; about 1200 bp after segment 4.
        1 gagctcacac tctcatgggg aggaagacag acatgcaaag agatatagaa tgtgaggtca
       61 ggtgttgaca agaaccctag agggagcaga gcagggaaag gtcagaaagg gaagacccag
      121 ggtctctgaa gcaggcatca ggaaagaagt ctaaggatgc cctgatgtga gcaggacctg
      181 agggcagtgt ggagggggcc gtgcggaccc ctggggaaga ggattgcaaa cagaaaaatg
      241 ccaaggtcag gagtgttgaa ggaatggggg tcatgctgct gaccttgacc tagtaggaca
      301 gtaggacaca cacacataca cacacacaaa cacacatgcc cttttgtgtg tgtgtgtttg
      361 tatgtgtgtg tgtgcatatc ttcaaggctg atgattgaag agaccttctc aggacacagg
      421 gccccatctt ttcaccccaa tacataggtc caaatattaa ctgatgctgt ctctacctcc
      481 tagcatcact tttaaacttc tggaacctgc ccaccactgc ccaagtaata attgaagcca
      541 agccacccaa agtttccgag gggaaggatg ttcttctact tgtccacaat ttgccccaga
      601 atcttactgg ctacatctgg tacaaagggc aaatgacgga cctctaccat tacattacat
      661 catatgtagt acacggtcaa attatatatg ggcctgccta cagtggacga gaaacagtat
      721 attccaatgc atccctgctg atccagaatg tcacacagga ggatgcagga tcctacacct
      781 tacacatcat aaagcgaggc gatgggactg gaggagtaac tggatatttc actgtcacct
      841 tatactgtga gtgattccgc atgatccctg ggtgttgggg ggcaggggtc atttctactt
      901 cacacacaca gaattgtcag gcctggactc tgcctgtgtc actctctgca ttatgtccca
      961 tgctggggtt tgggcattta gtgcaggaca cacacagagg agacacattt caacagatca
     1021 gaattc
//
LOCUS       MSQRTR1A      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 16bs.
ACCESSION   M35448
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 16bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       76 a     78 c     64 g    131 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
      181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt
//
LOCUS       MSQRTR1B      348 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 916.
ACCESSION   M35449
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 916.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      348     T1 retroposon
BASE COUNT       79 a     75 c     63 g    131 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatat actcgacagt cgtcttaact ttaaactgca
       61 gcttgatgag gttctactaa aagctaatcg aactcttggg tttattttac gttttacctc
      121 tatttttaga gattaaagct tcttaagaaa cctttattat gctctggtaa ggcctcttct
      181 tgaatatgct agcatcatct ggaatcctcc tactattgat ggctgttcga gaattgaaag
      241 cattcagcgc ctttttacca gggttgcttt tcgtcgtttg ttcggtgctg cctcactacc
      301 tccctatgaa acgagattgc agttattcaa tcttcactct ttaagctt
//
LOCUS       MSQRTR1C      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 52a.
ACCESSION   M35450
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 52a.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       78 a     76 c     64 g    131 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcccattc
      181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS       MSQRTR1D      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone L34.
ACCESSION   M35451
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone L34.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       77 a     77 c     64 g    131 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
      181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS       MSQRTR1E      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 914a.
ACCESSION   M35452
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 914a.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       77 a     76 c     65 g    131 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctgtggta aggcctcttc
      181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS       MSQRTR1F      343 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 13bs.
ACCESSION   M35453
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 13bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 343)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      343     T1 retroposon
BASE COUNT       78 a     73 c     60 g    132 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta taatttacag tcgtcttaac tttaaactgc
       61 agcttgatga gattctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
      181 ttgaatatgc tagtatcatc tggaattgtc ctactattga tggctgttcg agaattgaat
      241 gccttcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcagtta ttcaatcttc actctttaag ctt
//
LOCUS       MSQRTR1G      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 12bs.
ACCESSION   M35454
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 12bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       78 a     79 c     62 g    130 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acccttattg tgctctggta aggcctcttc
      181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcatcgttt tttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt
//
LOCUS       MSQRTR1H      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 9bs.
ACCESSION   M35456
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 9bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       77 a     77 c     64 g    131 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
      181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS       MSQRTR1I      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 6bs.
ACCESSION   M35457
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 6bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       78 a     76 c     64 g    131 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
      181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS       MSQRTR1J      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 4bs.
ACCESSION   M35461
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 4bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       76 a     77 c     64 g    132 t
ORIGIN      
        1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc
      181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt
//
LOCUS       MSQRTR1K      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 1bs.
ACCESSION   M35462
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 1bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       78 a     80 c     64 g    127 t
ORIGIN      
        1 gtgttgtgtc catccgtgac cttggtataa tactcgacag tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagctaatc gaacccttgg gtttatttta cgttttacct
      121 ctatttttag agatcaaagc ttcctaagca acctttattg tgctttggta aggcctattc
      181 ttgaatatgc tagtatcctc tggaatcctc ctactattga ttgctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1L      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 5bs.
ACCESSION   M35463
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 5bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       81 a     80 c     64 g    124 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1M      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 3bs.
ACCESSION   M35464
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 3bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       80 a     80 c     65 g    124 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggagtcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1N      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 11bs.
ACCESSION   M35465
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 11bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       82 a     80 c     64 g    123 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacca
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1O      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 14bs.
ACCESSION   M35466
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 14bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       81 a     82 c     64 g    122 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1P      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 31b.
ACCESSION   M35467
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 31b.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       80 a     80 c     65 g    124 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctgcta aaagcaaatc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1Q      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 15bs.
ACCESSION   M35468
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 15bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       81 a     80 c     64 g    124 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1R      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 17bs.
ACCESSION   M35469
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 17bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       82 a     79 c     64 g    124 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgacttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agattaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1S      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 18bs.
ACCESSION   M35470
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 18bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       82 a     81 c     64 g    122 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaaccctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgatag cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1T      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 19bs.
ACCESSION   M35471
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 19bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       81 a     82 c     64 g    122 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       MSQRTR1U      349 bp ds-DNA             INV       14-AUG-1990
DEFINITION  A.gambiae T1 retroposon, clone 28bs.
ACCESSION   M35472
KEYWORDS    retroposon; retrotransposon.
SOURCE      A.gambiae (colony G3) DNA, clone 28bs.
  ORGANISM  Anopheles gambiae
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; 
            Culicoidea; Culicidae; Anophelelinae.
REFERENCE   1  (bases 1 to 349)
  AUTHORS   Besansky,N.J.
  TITLE     Evolution of the T1 retroposon family in the Anopheles gambiae
            complex
  JOURNAL   Mol. Biol. Evol. 7, 229-246 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    trns          1      349     T1 retroposon
BASE COUNT       79 a     80 c     64 g    126 t
ORIGIN      
        1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc
       61 agcttgatgt ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct
      121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc
      181 ttgaatatgc tagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa
      241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac
      301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt
//
LOCUS       SHRMTRRDA    1136 bp ss-RNA             RNA       14-AUG-1990
DEFINITION  Brine shrimp mitochondrion 16S ribosomal RNA precursor.
ACCESSION   X12965 M21833
KEYWORDS    16S ribosomal RNA; ribosomal RNA.
SOURCE      Brine shrimp mitochondrion, cDNA to RNA, clones pArLSU-[1,2].
  ORGANISM  Mitochondrion Artemia salina
            Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; 
            Branchiopoda; Sarsostraca; Anostraca; Artimiidae; Artemia salina.
REFERENCE   1  (sites)
  AUTHORS   Sastre,L.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   2  (bases 1 to 1136; enum. 1 to 1136)
  AUTHORS   Sastre,L.
  TITLE     ;
  JOURNAL   Unpublished (1988) see COMMENT for author address
  STANDARD  simple automatic
REFERENCE   3  (bases 518 to 1129)
  AUTHORS   Palmero,I., Renart,J. and Sastre,L.
  TITLE     Isolation of cDNA clones coding for mitochondrial 16S ribosomal RNA
            from the crustacean Artemia
  JOURNAL   Gene 68, 239-248 (1988)
  STANDARD  full staff_review
REFERENCE   4  (bases 1 to 1136; revises [1],[2],[3])
  AUTHORS   Sastre,L.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     Author address: Sastre L., Instituto de Investigaciones Biomedicas
            del CSIC, Av Arzobispo Morcillo 4, 28029 Madrid, Spain.
            
            Submitted (22-SEP-1988) on tape to the EMBL data library. [1]
            Submitted (17-NOV-1988) to the EMBL data library. [2]
            
            Draft entry and computer-readable sequence for [3] kindly provided
            by L.Sastre, 01-MAR-1989; for [4], 12-MAY-1989.
FEATURES       from  to/span     description
    rRNA          1     1136     16S ribosomal RNA
    revision    548      550     cct in [2]; ct in [1]
    revision    555      557     act in [2]; at in [1]
    revision    778      780     gtc in [2]; gc in [1]
    revision    906      908     tta in [2]; ta in [1]
    revision    981      982     tc in [2]; ttc in [1]
    revision   1003     1007     cagcc in [2]; cc in [1]
    revision   1111     1112     ta in [4]; tca in [2]
BASE COUNT      382 a    196 c    218 g    340 t
ORIGIN      
        1 atttaaaaca atattatatt agtaaaagaa tcgaatttat tcaaggtcta agtactgaaa
       61 aggaaagtct actaattaca aagaagttat tgtttgtacc ttttgtatca gggtttatta
      121 aatttcaatc tttattgctg aatcccgaaa caaggtgatc tatattggtt aaagatgatt
      181 gtttcataaa tctcttcaag accatatcgt tgcgaaaata tttcgtacct ttatatctgg
      241 ttggcagaat actcctttag ggtactccgc aaggaagatt ttctagggga cgagctctag
      301 aaaaaattat attataatta tcaagaaaaa catgactagg cttaaaagtt gcctcggttt
      361 aataaccatt gttagtaaaa tgataatttc aaaataatta attaagactg gccgatgcta
      421 attacataaa tttagcaatc acaatgatag aattagtaga actataaatg cacttaacac
      481 aatattcctc acaaaatccc aaaggaattc ggcaaaactc atctccgcct gtttaacaaa
      541 gacatcgcct cttaactttg ggaggtcgag cctgcccact gatgaattaa agggccgtgg
      601 tatactgacc atgcgaaggt agcataatca ttagcctttt gatttgaggc tggaatgaat
      661 ggtttgacga gagatggtct gtctcttcga ttaaattgaa gttaatcttt aagtgaaaaa
      721 gcttaaatgt acttggaggg cgataagacc ctatagatct ttacatttaa ttcttttgtc
      781 ttgcggtagg taattagaca gagtaaaaca atgttcggtt ggggcgacgg taagaacaga
      841 ataaacactt acaacataaa cacatcaata aatgaccatt gatccttaga tgaataaaga
      901 ccaagttacc ttagggataa cagcgtaatt ctttttgaga gttcaaatcg acaaaagagt
      961 ttgcgagcct cgatgttggt tcagggaccc tactcggtgc agcagcctag agaggcagtc
     1021 tgttcgactg ttaaacccct acgtgatctg agttcagacc gacgtaagcc aggttagttt
     1081 ctacccccaa ggcaaatcca ctcatgatag tacgaaagga cctcttgaga ttttat
//
LOCUS       RATI145TR    9852 bp ss-mRNA            ROD       14-AUG-1990
DEFINITION  Rat inositol-1,4,5-triphosphate receptor mRNA, complete cds.
ACCESSION   J05510
KEYWORDS    inositol-1,4,5-triphosphate receptor.
SOURCE      Rat adult brain, cDNA to mRNA, clones pI16, pI17, pI19, and pI25.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 9852)
  AUTHORS   Mignery,G.A., Newton,C.L., Archer,B.T.III. and Suedhof,T.C.
  TITLE     Structure and expression of the rat inositol-1,4,5-triphosphate
            receptor
  JOURNAL   J. Biol. Chem. 265, 12679-12685 (1900)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.C.Suedhof, 19-APR-1990.
FEATURES       from  to/span     description
    pept        330     8579     inositol-1,4,5-triphosphate receptor (alt.,
                                 clone pI16)
    pept        330     1280     inositol-1,4,5-triphosphate receptor, exon 1
                                 (alt., clones pI17, pI19, and pI25)
               1326     8579     inositol-1,4,5-triphosphate receptor, exon 2
                                 (alt., clones pI17, pI19, and pI25)
    IVS        1281     1325     inositol-1,4,5-triphosphate receptor intron A
                                 (no splice consensus)
    signal     9033     9083     poly-A signal
    signal     9832     9837     poly-A signal
    variant     752      752     a in pI16; g in pI25
    variant     802      802     g in pI16; a in pI17
    variant    1923     1923     g in pI16; a in pI17
    variant    2181     2181     g in pI16; a in pI17
    variant    3340     3340     g in pI16; a in pI17
BASE COUNT     2688 a   2324 c   2498 g   2342 t
ORIGIN      
        1 tggggtcact tccttcacac aggactttgg cggaaatctc tacacccgtg tctcgtcatc
       61 cccttcgaga gcattaacta tttaaaacca ggagagaaag cgcacgccga gaggaggtgt
      121 gggtgttccg ctcccatcct aacggaacga gctccctctt cgcggacatg ggattgccca
      181 gcggctgcta acccctgtcc tggtcctgat ccctcaaacc ggcgtggctc cccggtcacc
      241 aaggagctga ctacaaagga tttgcatcct tggctgggag tccattggct acagagtgcc
      301 tgacctgggt caggcttttc aacaaggaca tgtctgacaa aatgtctagt ttcctacata
      361 tcggagacat ttgttctctg tatgcagagg gatctacgaa tggatttatc agcaccttgg
      421 gcttggttga tgaccgttgc gttgtacagc cagaagctgg ggaccttaac aatccaccca
      481 agaaattcag agactgcctc tttaagctat gtcctatgaa tcgatattct gcacagaagc
      541 agttctggaa agctgctaag cccggggcca acagtacaac agatgcggtg ctgctcaaca
      601 aattgcatca tgctgcagac ttggaaaaga agcagaatga gacagaaaac aggaaattgc
      661 tggggacggt catccagtat ggcaacgtga tccagctcct acatttgaaa agcaataaat
      721 acttaactgt gaataagagg cttcctgccc tactggagaa gaacgccatg agagtgacgt
      781 tggatgaggc tggaaatgaa gggtcttggt tttacattca gccgttctac aagctccgct
      841 ccatcggaga cagtgtggtc ataggtgaca aggtagtttt gaaccctgtc aatgctgggc
      901 agccgctaca tgccagcagc catcagctgg tggataaccc gggctgcaat gaggtcaact
      961 ccgtcaactg taacacaagc tggaaaatag tgcttttcat gaaatggagt gataacaaag
     1021 atgacattct aaaaggaggt gatgtggtaa gactcttcca tgctgagcaa gagaagtttc
     1081 tcacgtgcga tgagcacagg aagaagcagc acgtcttcct gaggacaacc ggcagacagt
     1141 cagccacgtc ggccaccagc tccaaagccc tgtgggaagt ggaggtagtc cagcatgacc
     1201 catgtcgggg tggagctggg tactggaata gcctcttccg gttcaagcac ctggctacag
     1261 ggcattactt ggcagcagag gtagaccctg actttgagga agaatgcctg gagtttcagc
     1321 cctcagtgga ccctgatcag gacgcatctc ggagcaggtt gagaaatgcc caagaaaaaa
     1381 tggtgtactc tctggtctct gtgcctgagg gcaacgacat ctcctccatc tttgagctag
     1441 accccacaac tctacgtgga ggtgacagcc ttgtcccaag gaactcctat gtccggctca
     1501 gacacctgtg tacaaacacc tgggttcaca gcacaaacat ccccattgac aaggaagagg
     1561 aaaaacctgt catgctgaag attggtacct ctcccctgaa ggaggacaag gaagcttttg
     1621 ccatagtccc cgtttctcct gctgaggttc gggacctgga ctttgccaat gatgccagca
     1681 aggtgctggg ctccatcgct gggaagttgg aaaagggcac catcacccag aatgagagaa
     1741 ggtctgtcac caagctgctg gaagacttgg tttactttgt cacgggtgga acgaactctg
     1801 gccaagacgt gcttgaagta gtcttctcta agcccaaccg agagcggcag aagctgatga
     1861 gggaacagaa tattctcaag cagatcttca agcttttgca ggcccccttc acggactgcg
     1921 gggatggccc aatgcttcgg ctggaggagc tgggggacca gcggcacgct cctttcagac
     1981 acatttgccg actctgctac agggtcctga gacactcaca gcaagactac aggaagaacc
     2041 aggagtacat agccaagcag tttggcttca tgcagaagca gattggctat gatgtgctgg
     2101 ccgaagacac catcactgcc ctgctccaca acaaccgaaa gctcctggaa aagcacatca
     2161 ccgcagcaga gattgacacg tttgtcagcc tggtgcgaaa gaacagggag cccaggttct
     2221 tggattacct ctctgacctc tgtgtatcca tgaataagtc aatccctgtg acccaggagc
     2281 tcatttgtaa agctgtgctg aatcccacca atgctgacat cctgattgag accaagttgg
     2341 ttctttctcg ttttgagttt gaaggcgttt ccaccggaga gaatgctctg gaagctgggg
     2401 aagatgagga agaggtgtgg ctgttctgga gggacagcaa caaagagatc cgtagcaaga
     2461 gtgttaggga attggcacaa gatgctaaag agggacagaa ggaagacagg gatgtcctca
     2521 gctactacag gtatcagctg aacctctttg caaggatgtg tctggaccgc cagtacctgg
     2581 ccatcaatga aatatccggg cagctggatg ttgacctcat cctccgctgc atgtccgatg
     2641 agaacctccc ctatgacctc agagcatcct tttgccgcct catgcttcac atgcatgtgg
     2701 accgagatcc ccaggagcag gtgacccctg tgaaatacgc ccgtctgtgg tcagaaattc
     2761 cctccgagat cgccattgac gactatgaca gcagtggagc atcgaaggat gaaatcaagg
     2821 agaggtttgc tcagacgatg gagtttgtgg aggagtacct gagagatgtg gtttgtcaaa
     2881 gattcccctt ttctgataag gagaaaaata agctcacatt tgaggttgtg aacttagcca
     2941 ggaacctcat atactttggc ttctacaact tttctgacct tctccgatta accaagatcc
     3001 tcttggcaat cttagactgt gttcatgtga caaccatctt ccccattagc aagatgacga
     3061 aaggcgaaga gaataaaggc agcaacgtga tgagatctat ccatggagtt ggggagctca
     3121 tgacccaggt ggtgcttcgg ggaggaggct ttttgcccat gactcctatg gctgctgccc
     3181 ctgaaggaaa tgtgaagcag gcagagccag agaaagagga tatcatggtc atggacacca
     3241 agttaaagat cattgaaata ctccagttta ttttgaatgt gagattggat tataggatct
     3301 cctgcctcct gtgtatattt aagcgggagt ttgatgaaag caattcccag tcatcagaaa
     3361 catcctccgg aaacagcagc caagaagggc caagtaatgt accaggtgct cttgactttg
     3421 aacacattga agagcaagca gaaggcatct ttgggggaag tgaggagaac acacctctgg
     3481 acttggatga tcatggtggc agaaccttcc tcagggtcct gctccacttg acaatgcatg
     3541 attacccacc cctggtgtct ggggccctgc agctcctctt ccggcatttc agccagaggc
     3601 aggaggtcct tcaggccttc aaacaggttc aactgctggt tactagccaa gatgtggaca
     3661 actacaaaca gatcaagcaa gacttggacc agctaagatc catcgtggag aagtctgagc
     3721 tctgggtata caaaggccag ggtcccgatg agcctatgga cggagcctct ggtgaaaacg
     3781 aacataagaa aactgaggag gggaccagta agccactgaa gcacgaaagc accagcagct
     3841 acaactaccg agtggtgaaa gagattctga ttcgacttag caagctctgt gttcaagaga
     3901 gcgcgtcggt gaggaagagc aggaagcagc agcagcgact gctgaggaac atgggtgccc
     3961 atgctgtggt gctggagctg ctgcagatcc cctatgagaa ggccgaagat accaagatgc
     4021 aggagatcat gaggctggct cacgaatttc tgcagaattt ctgcgcaggc aaccagcaga
     4081 accaagcttt gctacataaa cacataaacc tgtttctcaa cccagggatc ctggaagcag
     4141 tgaccatgca gcacatcttc atgaacaact tccagctgtg cagtgagatc aacgagagag
     4201 tggtccagca tttcgttcac tgcatagaga cccacggtcg aaatgtccag tatatcaagt
     4261 ttctccagac tattgtcaag gccgaaggga aattcattaa aaagtgccaa gacatggtca
     4321 tggctgagct tgtcaattct ggagaggacg tcctcgtgtt ctacaatgac agagcctctt
     4381 tccagactct gatccagatg atgcggtccg agcgtgaccg gatggatgag aacagccctc
     4441 tcttcatgta ccacatccat ctggtggagc tcttggctgt gtgcacagag ggcaagaatg
     4501 tgtacacaga gatcaagtgc aactccctgc tcccgctcga tgacatcgtt cgtgtggtca
     4561 ctcatgaaga ctgtatcccc gaggttaaga ttgcttacat caacttcctg aatcactgct
     4621 atgtggacac tgaggtggag atgaaggaga tttacacaag taatcacatg tggaagttgt
     4681 ttgagaattt cctcgtggac atctgcaggg cctgtaacaa cacaagtgac aggaagcatg
     4741 cagactcggt tctggagaag tacgtcactg aaatcgtgat gagcattgtc accaccttct
     4801 tcagctctcc cttctcagac cagagtacca ctctgcagac tcgccagcct gtctttgtgc
     4861 agctcctgca aggcgtgttc cgagtttacc actgcaactg gttgatgccg agccagaaag
     4921 cctcggtgga gagctgtatc cgcgtgctct ctgacgtagc caagagccgg gccatagcca
     4981 ttcctgttga cctagacagc caagtcaaca acctcttcct gaagtcccac aacattgtgc
     5041 agaaaacagc catgaactgg cggttatcag cccgcaacgc tgctcgtaga gactctgtcc
     5101 tggcagcttc cagagactac cgaaatatta ttgagaggtt acaggacatc gtgtctgccc
     5161 tggaggaccg actcaggccc ctggtccagg ccgagctgtc tgtgctcgtg gatgttctac
     5221 acagaccgga actgcttttc cctgagaaca cggatgccag gaggaaatgc gaaagtggag
     5281 gtttcatctg caagctcata aaacacacca agcagctgct agaggagaat gaagagaaac
     5341 tgtgcattaa agtcttacag accctcaggg aaatgatgac caaagataga ggctatggag
     5401 aaaagcaaat ttccattgat gaattggaaa atgccgagct gccacagcca ccggaagctg
     5461 agaactccac agaggagctt gaaccaagtc cacccctgag gcaactggaa gaccataaaa
     5521 ggggtgaggc actccggcaa attttggtca accgttacta tggaaacatc agaccttcag
     5581 gaagaagaga gagccttacc agctttggca atggtccact atcaccagga ggacccagca
     5641 agcctggggg aggaggggga ggtcctgggt ccggctccac aagccggggt gagatgagcc
     5701 tggctgaggt tcaatgtcat ctcgacaagg aaggcgcctc caatctggtc atcgacctca
     5761 taatgaatgc atccagcgac cgagtattcc acgaaagcat tctcctggcc atcgcacttc
     5821 tggaaggagg caacaccact atccagcact catttttctg ccggctgacg gaagataaga
     5881 aatcagagaa gttcttcaag gtcttttacg atcgaatgaa ggtggcccag caggaaatca
     5941 aggcgacagt gaccgtgaac accagcgatt tgggaaacaa aaagaaagat gatgaagtgg
     6001 accgggatgc cccatcccga aagaaagcca aagagcccac aacacagata acagaagagg
     6061 tccgggatca gctcttggaa gcatctgctg ccaccagaaa agcctttacc accttccgga
     6121 gggaggccga ccctgatgac cattaccaat ctggggaggg cacccaggct acaacggaca
     6181 aagccaagga tgacctagag atgagtgcag tcatcaccat catgcagcct atcctgcgct
     6241 tcctgcagct cctgtgtgaa aaccacaacc gcgatctgca gaatttcctt cgttgccaaa
     6301 ataataagac caactacaac ttggtgtgtg agacactgca gtttctggac tgtatctgtg
     6361 ggagcacaac tggaggcctt ggtcttctcg gactgtacat aaatgaaaag aatgtagctc
     6421 ttatcaacca aaccctggag agtctgactg agtactgtca agggccttgc catgaaaacc
     6481 agaactgcat cgccacccac gagtccaatg gcatcgatat catcacagcc ctcatcctta
     6541 acgatatcaa ccctctgggg aagaaacgga tggacctggt gttagaactg aagaacaatg
     6601 cttcaaagct actcctggcc atcatggaaa gcagacacga tagtgaaaat gcagagagga
     6661 tcctgtacaa catgaggccc aaggagctgg tggaagtgat caagaaggcc tacatgcaag
     6721 gcgaagtgga gtttgaggat ggggagaacg gtgaggatgg agcagcctcc cccaggaacg
     6781 tgggccataa catctacatc ctcgctcacc agttggctcg gcataacaaa gaacttcaaa
     6841 ccatgctgaa acctggaggc caggtggatg gcgatgaagc tctggagttc tatgcgaagc
     6901 acacagcaca aattgagatt gtcagactgg accgaactat ggaacagatt gtcttccctg
     6961 tgcccagcat ctgtgaattt ctgactaagg aatcgaaact tcgaatatat tacaccacag
     7021 agagggatga gcaaggcagc aagatcaatg acttcttcct gcgctccgag gacctcttta
     7081 atgagatgaa ctggcagaag aaacttcgag cccagcctgt cttgtactgg tgcgcccgaa
     7141 acatgtcttt ctggagcagc atctccttca acctggccgt cctgatgaac ctgctggtgg
     7201 cgtttttcta cccatttaaa ggagtgagag gaggaacact agagccgcac tggtccggcc
     7261 tcctgtggac agccatgctc atctctctgg caattgtcat tgcgctgccc aagccccatg
     7321 gcatccgggc cttaattgct tctacaatcc tacgactgat attttcagtt ggattacagc
     7381 ccacactgtt tctgctggga gctttcaatg tctgcaataa aatcatcttc ctgatgagct
     7441 ttgtgggcaa ctgtgggacc ttcaccagag gctaccgagc catggttctg gacgttgagt
     7501 tcctctatca tttgctgtat ctactgatct gtgccatggg cctcttcgta catgagtttt
     7561 tctatagttt gctgcttttt gatttagtgt acagagagga gactttgctt aatgtcatta
     7621 aaagtgtcac ccgcaatgga cggcccatca tcttgacagc ggccctggct ctgattctgg
     7681 tttacctgtt ctcaattgtg ggctacctgt tcttcaagga tgactttatc ttggaagtag
     7741 acaggttgcc caatgagaca gctggtccag aaactggtga gagtttggcc aacgatttcc
     7801 tgtactctga tgtgtgccgg gtagagacgg gggagaactg cacctctccg gcacccaaag
     7861 aagagctgct ccctgtggaa gaaacggaac aggataagga gcacacgtgt gagacactgc
     7921 tcatgtgcat cgtcaccgtt ctgagtcacg ggctgcggag tgggggcgga gtaggagatg
     7981 tgctcaggaa gccatccaaa gaggagcctc tgtttgctgc aagggtgatc tatgacctcc
     8041 tcttcttctt catggtcatc atcattgtcc tgaacctcat ttttggggtc atcatcgaca
     8101 cctttgctga cctgaggagt gagaagcaga agaaggaaga gatcttaaaa accacgtgct
     8161 tcatttgtgg cttggaaagg gacaagtttg acaataagac cgtcaccttt gaagagcaca
     8221 ttaaggaaga acacaacatg tggcactatc tgtgcttcat cgtgctggtg aaagtgaagg
     8281 actccaccga gtacaccggg cccgagagct acgtggcaga gatgatcagg gaaagaaacc
     8341 ttgattggtt ccccaggatg agagccatgt ccctggtcag cagtgactct gaaggggaac
     8401 agaacgagct gaggaacctg caggagaagc tggagtccac catgaagctg gtcaccaatc
     8461 tttctggcca gctgtcagaa ctaaaggacc agatgacaga acaaaggaag cagaaacaaa
     8521 gaattggcct tctaggacat cctcctcaca tgaatgtcaa cccacagcag ccagcctaag
     8581 caaatgagac ggagggactg tactcaaccc tttttaaatt attgtcagcg tgggtctggc
     8641 tcttggttct gatctgccct ctaaggttac atgtgcgctt agtacatttg taaatactcg
     8701 gttttgtatt gtatgtatat gattgcaatt ctcagaggtt tggactttcg tattgtaatt
     8761 agctctgttg gcatggtgac ttgtcactcc tgccaaaaat attaaaaatg ccttttttgg
     8821 aaggactaca gaaagtacct gatttgcact tgaaccagat tatagattta aaaagtattt
     8881 gacatgtatt ttgttattta aaactagaat agccagtatt tatgtttttt ataaaactgt
     8941 gcaatacaaa ttatgcaatc accataactc tgtaactcct gagtgtctta agggagtgcg
     9001 catctttgaa gctgatctgt taatacttat gtaataaatg gttaaatatc aaatgatgat
     9061 gctgctgcca aaattatatt aatagcaagt ttctggcccc tgggcaattt tgtaccttgt
     9121 aattatccta tggtgatgct gtttctcgtt gctaatggca ttagtgcctc gtatcctagt
     9181 gataactcca ggtctgtgaa ccattcaaac aacattcatt tttgagacaa agcaacttta
     9241 gtttcaaggg taattttaag cttcaaaatt aatcatttaa agtatttctt taagagagcc
     9301 atgttagagg ctcacacgtt agcttgaaag gagtcgatga aataattttt ttaaagggaa
     9361 ctttttacac gacgtttgga ataacagcat attgctgacc agtcagtgtc atctcccagg
     9421 tgaattttga tgtcacgtta tagtcaaatg agctagctta tggtttctag attctcttca
     9481 tctgaaccaa catgatgcag taggtgagaa gttattatgc atatatacac atatacatcc
     9541 atataaggca aagtaggagc tgtcccctta ggatgcatag ctgccccctt agggtgcaga
     9601 gctcaacact gacagtgctg ttctgagaga gccacgtttg ggttttattt ctttgtcaca
     9661 tgatttcttt tctggatgtg tgcaagtata acaggaagtg ttttctacct gttgcttgtt
     9721 ttgtacctgg gtctcgtttt actagaccat ctctgcacaa aagtttaaaa aatgaaccgt
     9781 aagcagagtt ctgaagcaag tcaagtttgt aagtgcatac ctgaaattat ttaataaatg
     9841 acgcagaata ct
//

GenBank-Updates@genbank.bio.net (08/15/90)

LOCUS       RHPNIFDK     3500 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  Parasponia rhizobium nifD and nifK genes coding for the alpha- and
            beta-subunits of the Mo-Fe protein of nitrogenase, complete cds.
ACCESSION   X01139
KEYWORDS    nitrogenase.
SOURCE      Parasponia rhizobium (strain ANU289) DNA, clones pR289nif-[3,4,5].
  ORGANISM  Parasponia rhizobium
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Hamamelidae; Urticales; Ulmaceae.
REFERENCE   1  (bases 1 to 3500)
  AUTHORS   Weinman,J.J., Fellows,F.F., Gresshoff,P.M., Shine,J. and Scott,K.F.
  TITLE     Structural analysis of the genes encoding the molybdenum-iron
            protein of nitrogenase in the Parasponia rhizobium strain ANU289
  JOURNAL   Nucleic Acids Res. 12, 8329-8344 (1984)
  STANDARD  simple staff_review
COMMENT     EMBL features not translated to GenBank features:
               key        from     to       description
            
               PRM         108    124       consensus promotor sequence
               SITE        135    135       transcription start
               RBS         163    168       pot. ribosome binding site
               RBS        1753   1758       pot. ribosome binding site
               SITE       3322   3355       pot. stem-loop structure
FEATURES       from  to/span     description
    pept        176     1678     Mo-Fe protein alpha-subunit
    pept       1767     3308     Mo-Fe protein beta-subunit
BASE COUNT      826 a    957 c    992 g    725 t
ORIGIN      1 bp upstream of EcoRI site.
        1 gaattctccg tgcaaagcgc gatgtcgcct tcgcaacaac aaccagcccc atcggacgaa
       61 acgcgctaac tgtttttatt tattctgctt tttgtgctcg cgccgcgctg gcatgctcgt
      121 tgcagtcttg ttcaagaagc tgctcccgca cagttaattc ttgaaggaca tcagcatgag
      181 tctcgccacg acccagagca tcgcagaaat cagggctcgc aataaagagc tgatcgagga
      241 ggtgctgaaa gtctatccgg agaagaccgc gaaacggcgt gccaagcacc tcaacgttca
      301 ccaagccggc aagtcggact gcggggtcaa gtccaacatc aaatcaatac ctggtgtgat
      361 gacaatcaga ggctgcgcct atgcaggatc caaaggggtg gtctggggac cgatcaagga
      421 catggtccat atcagccatg gcccggtcgg ctgtggtcag tattcgtggg gctcgcgtcg
      481 caactattat gttggcacga cgggcgtcga tagtttcgtg accctgcagt tcacctccga
      541 cttccaggaa aaggacatcg tatttggcgg cgacaagaag ctgatcaaag tccttgacga
      601 aatccaggag ctgttcccgc tcaacaacgg catcaccatc caatcggaat gcccgatcgg
      661 actgatcggg gacgacatcg aggctgtgtc aagatcgaaa tccaaagaat acggcggcaa
      721 gaccatcgtg cctgttcgct gtgagggctt tcgcggcgtg tcgcaatcgc ttggccacca
      781 cattgccaat gacgcggtgc gcgattggat cttcgacaag ctagagcccg agggcgaacc
      841 aaagttccag ccgacgccct acgacgttgc gatcatcgga gactacaata ttggcggcga
      901 tgcctggtca tcgcgcattc tgctggaaga aatgggcttg cgggtgattg cgcagtggtc
      961 cggcgacggt tccctcgccg aactcgaagc aacgccgaag gcaaagctca atattctgca
     1021 ttgctaccgt tccatgaact acatctcccg ccacatggag gagaagtttg gcatcccctg
     1081 gtgcgagtac aacttcttcg gaccgtcgaa gatcgcagaa tcgctgcgca agattgcggg
     1141 ctatttcgac gacaagatca aggaaggcgc cgagcgagta attgaaaaat accagccact
     1201 ggtggacgcc gtaatcgcaa aatatcgccc ccgcctggag ggcaagactg tgatgctgta
     1261 cgtcggcggg cttcgtccac gtcatgtgat tggcgcgtac gaggatctcg gcatggaagt
     1321 cgtgggcacc ggatacgagt tcggccacaa cgacgattat cagcgcaccg cccagcacta
     1381 cgttaaggac agcacgctca tctacgacga cgtcaatggc tatgaattcg agcgcttcgt
     1441 cgaaaaggtc caaccagatc tggttggctc gggcatcaag gagaaatacg ttttccaaaa
     1501 gatgggtgtg ccgttcccgg agatgcattc ctgggactat tccggcccat atcacggcta
     1561 tgacggcttt gcgatcttcg cgcgggacat ggacatggct gtcaactcgc cgatctggaa
     1621 gaagacgaag gccccctgga aggaagctgc gaagccgaag ctcttggctg cagaataaca
     1681 agcacttggt tccacaatag agcgatcaat cccgctctct gcggagagct ggggcgacat
     1741 catttcgata gtgaaggatc ttaacaatgg cgcagagtgc agaccatgtg ctcgatcatc
     1801 tcgaactgtt ccgcggtcca gaataccaac aaatgctggc cgacaagaag atgttcgaga
     1861 atccccgcga tcctgccgag gtcgaacgta tccgagcagt gacgaaaacg cccgaatatc
     1921 gcgagaagaa ttttgcggag gcgcttgcgg taaatccggc caaggcttgc cagccgcttg
     1981 gcgccgtatt cgtctcggtt ggttttgaag gcacgctgcc cttcgtccat ggctcgcagg
     2041 gctgcgtggc ctattaccgc agccatctgt cgcggcactt caaggagccg agctcctgcg
     2101 tgtcttcgtc gatgacggaa gacgccgctg tattcggggg gctgaacaat atgatcgatg
     2161 gcctcgccaa cagctacaac atgtacaaac ccaagatgat ttgctcgacg acctgcatgg
     2221 ccgaggtgat cggcgatgac ctgaacgcct tcatcaagac atcaaaagaa aaaggctcgg
     2281 ttcggcggag ttcgactcct ttcgcgcaca ctccagcgtt cgtcggcagc cacgtcaccg
     2341 gctatgacaa cgcactcaag ggcattctcg agcacttttg gaacggcaag gccggaacgg
     2401 cgccgaagct ggagcgcaaa ccaaacgagg caatcaacat catcggcggt ttcgatggca
     2461 ataccgttgg aaaccttcgt gagatcaagc gaatcttagc gttgatgggc atcaaacaca
     2521 cgattctcgc cgataactct gaagtcttcg ataccccgac tgatggcgag ttccggatgt
     2581 atgacggcgg tacccacgtg gaggacacgg ccaacgcgat tcacgccaag gcgacaatct
     2641 ccatgcagca atggtgtacg gaaaaaacgc tgccgttcgt gtccgagcat ggacaggacg
     2701 ttgtgtcttt caattacccg gtaggtgtat ccgcgacgga tgatcttctc gtggccttgt
     2761 cacgcatcag cggcaaggag attccggagc aactcgcgcg agagcgtggc cgcttggttg
     2821 atgccatcgc ggattccagc gcgcatatcc atggcaagaa gttcgcgatc tacggcgatc
     2881 cggatctctg ctatgggttg gctgcctttc tgctcgaact cggcgccgag cctactcatg
     2941 tgctgtccac caacggcaac aacgtggcag gagaaaatgc gacgctgttt gcaggctcgc
     3001 catttggaga acttccagcc tatccgggac gagacctctg gcacatgcgc tcgctcttgt
     3061 tcacagagcc ggttgacttt ctgattggca acacccatgg caagtacctg gagcgtgaca
     3121 ctggaacgcc attgatccgc atcggctttc caatttttga tcggcatcac catcaccgct
     3181 tccctgtatg gggctatcag ggcggcctga atgtgctggt gaagatcctc gacaagatct
     3241 tcgacgaaat cgacaagaag accagcgttc ttggcaaaac tgactacagt ttcgacatca
     3301 ttcgttgatg acgggcagtg cgcgtgggct cgccgaaaca gcggcgagcc cacgctgggc
     3361 actggttgac attgaaattt tcttccgctg agaggaaaat gctgatgagt tcgtctagtc
     3421 ggccacggtc cagggtattt tcaggcgaac cgggctgccg aagaatggaa gtaagtcgga
     3481 ggctgagcgc aagaagggct
//
LOCUS       RHPNIFH      2030 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  Parasponia rhizobium nitrogenase (nifH) gene, iron protein
            component.
ACCESSION   K00487
KEYWORDS    nifH gene; nitrogenase; unidentified reading frame.
SOURCE      Parasponia rhizobium (strain ANU289) DNA.
  ORGANISM  Parasponia rhizobium
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Hamamelidae; Urticales; Ulmaceae.
REFERENCE   1  (bases 1 to 2030)
  AUTHORS   Scott,K.F., Rolfe,B.G. and Shine,J.
  TITLE     Nitrogenase structural genes are unlinked in the nonlegume symbiont
            Parasponia rhizobium
  JOURNAL   DNA 2, 141-148 (1983)
  STANDARD  full staff_review
COMMENT     [1] states the iron protein subunit is encoded on a separate operon
            from other components of the nitrogenase enzyme complex, unlike
            previously studied nitrogen-fixing prokaryotes.
FEATURES       from  to/span     description
    pept        576     1460     nifH (nitrogenase iron protein)
    mRNA        421  >  1460     nifH mRNA
BASE COUNT      430 a    572 c    618 g    410 t
ORIGIN      5 bp upstream of PstI site
        1 ctgcagggcc cttgtaaggc gcttcttgct gcctttaagc tcatgcgcac cgatctgatc
       61 agctggatca atcgggaggt cagccgcaca attgatctcg tcatcctcga ccacgaaccc
      121 catcgccggc cacttgcctt gaggttctga cctcgacctg catattgctc tccgcggatt
      181 gccgccactg gcttgcaaga agaggagcaa gtcccgttcc agttgaggaa atcgaaccag
      241 atcatgccaa accggcgttt tccggttgat gggtgtggcc gttgttcgtt ttctgacagc
      301 cgcgcagatc ctgtccggtg caaacctccc tggggtagct cagcggctcg ttggcttttt
      361 agagcgtaat caagaagctt aataagcgcg gacagtgttg gcatggcgat tgctgttgag
      421 ttgcagcaac actgagtgag ggctgggtgc acgccgacgc gtaagacgag cgatgcgctc
      481 cttcccttga acccgtgtgc cccgtttctg agagagaaac aagctcgcgt gtcggaagca
      541 cgcaactttt ggcaaatcgg ttgatggaga acaacatgtc ttcactgaga caaatcgcgt
      601 tctacggaaa gggcggcatc ggcaagtcga ccacgtccca gaatacgttg gcggcactgg
      661 ccgagatggg ccagaaaatc ctgatcgtgg gatgcgatcc taaggcggac tcgacgcgcc
      721 tcatcctgca cgcgaaggcg caggacacga ttttgagcct tgcagcgagc gctggcagcg
      781 tggaagacct cgaactcgag gacgtgatga aggtcggcta caaggacatc cgatgcgtgg
      841 agtccggtgg tcccgagccg ggtgtcggct gcgcgggccg cggcgtcatc acctcgatca
      901 atttcctgga ggagaacggc gcctatgaga acattgacta tgtctcatat gacgtgctcg
      961 gcgacgtcgt ttgcggtggc tttgcgatgc cgatccggga aaacaaggcg caggagatct
     1021 atatcgtgat gtctggagaa atgatggcaa tgtatgccgc aaacaatatc tccaaaggta
     1081 tcctgaaata cgccaactct ggcggcgtgc ggctgggcgg cctgatctgc aacgagcggc
     1141 agaccgataa ggagctggag ctggcggagg cgctggccaa gaagttaggt actcagctga
     1201 tctacttcgt gccgcgcgac aatgtggtgc agcatgccga gctacggcgc atgacggtgc
     1261 tggagtatgc ccctgagtcg cagcaggccg atcactatcg caatcttgcg accaaggttc
     1321 acaacaatgg cggcaaaggc atcattccga ctccgatctc catggatgag ctcgaggaca
     1381 tgctgatgga gcatggcatt atgaagcccg tcgacgaatc catcgtcggc aagaccgccg
     1441 ccgaactcgc ggcctcgtaa aggtcgcggg tcgcggcctt gtgaaggcgc gcgacggatg
     1501 ccggtctccc tcacccccca tccggggaga ccggcattct gacgattatc tgaccagcca
     1561 gagtggagct ggcaaccgtg accgctatgg gaacccaaaa catcatgaca ggagcgcact
     1621 tccttccgct tatggcttct tgcgccgtcg aggcgagcag caaggtgcaa agaggaattg
     1681 cgacctaccg agcgctcact ggcgtcctcc tgaagaggcc gacattgcga ccgacagcaa
     1741 tttcgattgc catgtcctgg cgtcaatcct ggcggccgct cgatggatgg tggcccgctt
     1801 cccgagcgcc ctgtccgcca ccagctggcg accctgctcg cagcaatttc catcggttga
     1861 ggtcgatatc tcggagcagc tcctggcgtc taagtgcgat gagaatgacg agatcgcgat
     1921 ggtgcgcgat cttttgctca agcaacgctc gacggacggg catattcggg ctggctagcc
     1981 gcgatgattg cgcgccgcgc catagagcca gatcacctgt gggaagatct
//
LOCUS       RHPHBEM      1520 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  P.andersonii haemoglobin gene, complete cds.
ACCESSION   M36509
KEYWORDS    haemoglobin.
SOURCE      P.andersonii DNA.
  ORGANISM  Parasponia andersonii
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Hamamelidae; Urticales; Ulmaceae.
REFERENCE   1  (bases 1 to 1520)
  AUTHORS   Landsmann,J., Dennis,E.S., Higgins,T.J.V., Appleby,C.A.,
            Kortt,A.A. and Peacock,W.J.
  TITLE     Common evolutionary origin of legume and non-legume plant
            haemoglobins
  JOURNAL   Nature 324, 166-168 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        198      313     haemoglobin, exon 1
                436      550     haemoglobin, exon 2
                877      993     haemoglobin, exon 3
               1153     1293     haemoglobin, exon 4
    IVS         314      435     haemoglobin intron A
    IVS         551      876     haemoglobin intron B
    IVS         994     1152     haemoglobin intron C
BASE COUNT      470 a    281 c    276 g    493 t
ORIGIN      
        1 ttatcttact aaaaagaaaa cgaaaataaa aaacccaaag atatggctcc ccaataccct
       61 gaagagttac acacgatccc cattttttct actatatata cagagtgcct tcaccagatt
      121 ttccaaacac actccaacat atcccattgc ccaaataaaa atttctcagc ttttagtccc
      181 ctcaacccac agaagccatg agcagctcag aagttaacaa agttttcaca gaggagcagg
      241 aagctctggt ggtgaaagca tgggctgtaa tgaagaagaa ctctgctgaa ctgggtcttc
      301 aattcttcct caagtaagtc aaaattatat atagtacact ttttatttac tttgcttctt
      361 ttatagacca agtttttgaa taaaagggta ctattttttt ttcctgaaaa aaattggttg
      421 attgaaactt tgcaggatat ttgagattgc accgtctgcc aagaacttgt tctcttattt
      481 gaaggactct ccggttcctt tggagcagaa cccaaagctc aagccccatg ctacgactgt
      541 cttcgttatg gtaaagccaa cttttgttct cctattccct tatcctaatt ttacaagaat
      601 ctaatgttaa taaaatagta ttttgcctat ttaaacaacc aaaaatttag acacaactat
      661 ataaaacatt taaattcttg tggtttatga taccttgatc tacaatgatt ccaacttccc
      721 gtgttgcatt tatgagttgt gctagcaaca gtcgcatcac agtcgtctat tccagaaagg
      781 acgactgtga ctcttgagac atatcaaagc aaagctcagc aatttttatg tttctcactt
      841 gctctgttct ttttctctgg tacttgtcct ggaaagacat gtgagtctgc ggttcaactt
      901 cggaaagccg gaaaagtgac agtgaaagaa tcagacttga aaagaattgg ggctatccac
      961 ttcaaaactg gcgtagttaa tgaacatttt gaggtactac cctggccact tagtagatat
     1021 aattccctaa gtgtaatcca aacatttgtt gtttagagtc aaattattat tattctgtat
     1081 ggtggttctt gaataatcga tcttattatg gtatttacta attatattat gcatgggaaa
     1141 aacgatttgt aggtcacaag gtttgcactt ttggagacca taaaggaagc agtaccagaa
     1201 atgtggtcac ctgagatgaa gaacgcatgg ggagtagctt atgatcagtt ggttgctgcc
     1261 atcaagttcg aaatgaaacc ctccagtact tgagaatttt tatagttctt ggaacaattg
     1321 ggtttgaata atgtgacaaa acttatactt aattacgttt gcatgagaga gaggtaataa
     1381 ttgcatagtg tataacttgc atatgtatca tagtgtgacg caatctctcc acttgtgttg
     1441 ttcatcttgt tcaaaaggaa ttagtctttc actttacatt ttgggtggaa gtatggaatg
     1501 aaatcagagt ttcattgatt
//
LOCUS       PT7RNAA       266 bp ds-DNA             PHG       15-AUG-1990
DEFINITION  Bacteriophage T7 RNA polymerase gene 1, 3' end.
ACCESSION   M24964 M24965
  ORGANISM  Bacteriophage T7
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 266)
  AUTHORS   Osterman,H.L. and Coleman,J.E.
  TITLE     T7 ribonucleic acid polymerase-promoter interactions
  JOURNAL   Biochemistry 20, 4884-4892 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      201     RNA polymerase (gene 1; AA at 1)
    mRNA    <     1      266     gene 1 mRNA
BASE COUNT       72 a     66 c     63 g     65 t
ORIGIN      
        1 ccggctgacg ctgcgaacct gttcaaagca gtgcgcgaaa ctatggttga cacatatgag
       61 tcttgtgatg tactggctga tttctacgac cagttcgctg accagttgca cgagtctcaa
      121 ttggacaaaa tgccagcact tccggataaa ggtaacttga acctccgtga catcttagag
      181 tcggacttcg cgttcgcgta acgccaaatc aatacgactc actatagagg gacaaactca
      241 aggtcattcg caagagtggc ctttat
//
LOCUS       PT7RNAB       139 bp ds-DNA             PHG       15-AUG-1990
DEFINITION  Bacteriophage T7 class III RNA polymerase promoter L1 fragment.
ACCESSION   M24966
  ORGANISM  Bacteriophage T7
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 139)
  AUTHORS   Osterman,H.L. and Coleman,J.E.
  TITLE     T7 ribonucleic acid polymerase-promoter interactions
  JOURNAL   Biochemistry 20, 4884-4892 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA         58  >   139     L1 mRNA
BASE COUNT       44 a     25 c     28 g     42 t
ORIGIN      
        1 cggtatttaa ttaaatattc tccctgtggt ggctcgaaat taatacgact cactataggg
       61 agaacaatac gactacggga gggttttctt atgatgacta taagacctac taaaagtaca
      121 gactttgagg tattcactc
//
LOCUS       PT7RNAC       141 bp ds-DNA             PHG       15-AUG-1990
DEFINITION  Bacteriophage T7 L2 nonpromoter fragment.
ACCESSION   M24967
  ORGANISM  Bacteriophage T7
            Viridae; ds-DNA nonenveloped viruses; Podoviridae.
REFERENCE   1  (bases 1 to 141)
  AUTHORS   Osterman,H.L. and Coleman,J.E.
  TITLE     T7 ribonucleic acid polymerase-promoter interactions
  JOURNAL   Biochemistry 20, 4884-4892 (1981)
  STANDARD  simple staff_review
BASE COUNT       36 a     35 c     35 g     35 t
ORIGIN      
        1 cggaagtgct ggcattttgt ccaattgaga ctcgtgcaac tggtcagcga actggtcgta
       61 gaaatcagcc agtacatcac aagactcata tgtgtcaacc atagtttcgc gcactgcttt
      121 gaacaggttc gcagcgtcag c
//
LOCUS       SIVSMMM7     1210 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV) pol region.
ACCESSION   M27256
KEYWORDS    .
SOURCE      Simian immunodeficiency virus (isolate SMM-M7).
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 1210)
  AUTHORS   Li,Y.
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_entry
COMMENT     This sequence corresponds to the 3' third of the pol gene.  Kindly
            provided in computer-readable form by Yen Li.
            
               Author address:Y.Li
               New England Regional Primate Research Center
               Southborough, Massachusetts 01772
               (508-481-0400).
BASE COUNT      478 a    207 c    269 g    256 t
ORIGIN      
        1 gcccggccag taatccgccc accattgctc ccgaatttcg acccctcctc tagtcagatt
       61 agtgttcaat ttggtaaagg atcccatcga agaaatagga acattttatg tggatggctc
      121 ttgcaataaa cagtcaaaag agggaaaagc aggatacata acagacagaa ggaggagcaa
      181 aataaagttc ttagaacaga ctaccaatca gcgagcagaa ttagaagcct ttctcatggc
      241 agtaacagat tcaggagcag aggcaaatat tatagtagat tctcaatatg tgatggggat
      301 agtgacaagg caacccactg aatcagaaag taaaatagta aatcagataa tagaagaaat
      361 gatcaaaaag acagcagtat atgtgacata ggtaccagct cataaaggtc taggaagaaa
      421 tcaagaaata gaccatttag ttagtcaaag gattaggcaa gtcttgttcc tagaaaagat
      481 agaaccagcc caagaagagc acgaaaaata tcacagcaat gtaaaagaat tggtctttaa
      541 atttaggata ccaagattag tagcaaaaca gatagtagat acctgtgata aatgccagca
      601 gaaaggagaa gctatacata gacaggtaaa cacagagtta agaatttggc aaatagactg
      661 cacacaccta gagggcaaag ttgttatagt agcagtacat gtggctagtg gattcataga
      721 ggcagaagta atcccacaag aaacaggaag acagacagca ttgttcctgt taaaattagc
      781 tagcaggtgg cccatcacac acctgcacac agataatggt gctaactttg cttcgcaaga
      841 agtaaagatg gtagcctagt gggcagatat agaacacacc tttaaggtac catataatcc
      901 acaaagtcaa agagtagtag aagcaatgaa tcatcaccta aagaatcaga tagagagaat
      961 tagagagcag gcaaattcag tagaaacaat agtgctcatg gcagttcatt gcatgaattt
     1021 taaaagaagg ggaggaatag gggatatgac cccagcagaa agattaatta atatgatcac
     1081 cacagaacaa gaaatacaat tccaacaatc aaaaaattca aaatttaaaa attttcgggt
     1141 ctatttcaga gaaggcagag accaactgtg gaaaggaccc ggtgaattac tgtggaaagg
     1201 ggaaggagca
//
LOCUS       ADEAD5A       180 bp ds-DNA             VRL       15-AUG-1990
DEFINITION  Adenovirus type 5 packaging domain region.
ACCESSION   M36423
KEYWORDS    .
SOURCE      Adenovirus type 5 (strain dl309) DNA.
  ORGANISM  Mastadenovirus h5
            Viridae; ds-DNA nonenveloped viruses; Adenoviridae.
REFERENCE   1  (bases 1 to 180)
  AUTHORS   Graeble,M. and Hearing,P.
  TITLE     Adenovirus type 5 packaging domain is composed of a repeated
            element that is functionally redundant
  JOURNAL   J. Virol. 64, 2047-2056 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    site          1      156     packaging domain
BASE COUNT       48 a     27 c     52 g     53 t
ORIGIN      
        1 gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag taaatttggg
       61 cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga agtgaaatct
      121 gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg gcatttgacc
//
LOCUS       CAJFJAAB     1932 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  C.coli flagellin (flaB) gene, complete cds.
ACCESSION   M35141
KEYWORDS    flaB gene; flagellin.
SOURCE      C.coli (strain VC167, serogroup LIO 8) DNA.
  ORGANISM  Campylobacter coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 1932)
  AUTHORS   Guerry,P., Logan,S.M., Thornton,S. and Trust,T.J.
  TITLE     Genomic organization and expression of Campylobacter flagellin
            genes
  JOURNAL   J. Bacteriol. 172, 1853-1860 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        211     1932     flagellin (flaB)
    mRNA        185  >  1932     flagellin mRNA
BASE COUNT      638 a    325 c    387 g    582 t
ORIGIN      
        1 taacaaatcc aagcctagta gtaatactag gcttttttat ttctaaataa aacttggaac
       61 attctttagc gtttactgta atttatacaa atccaagcct agtagtaata ctaggctttt
      121 tttatttcta aataaaattt caatttgaat caaaacttgg aacacttctt gctttaatct
      181 tttcgatgca atattttgaa aggatttaaa atgggtttta gaataaacac caacatcggt
      241 gcattgaacg cacatgcaaa ttcagttgtt aatgctaggg agcttgacaa gtctttaagt
      301 agacttagct caggtcttag aatcaactcc gcagcagatg atgcttcagg gatggcgata
      361 gcagattctt tgcgttcaca agcagcaact ttaggtcaag ctataaacaa tggtaatgat
      421 gctataggta tcttgcaaac tgcagataag gctatggatg agcaacttaa aatcttagat
      481 accatcaaga ctaaagcgac tcaagctgct caagatggtc aaagcttaaa aacaagaact
      541 atgcttcaag cagacatcaa ccgtttgatg gaagaacttg ataatatcgc aaataccact
      601 tcatttaatg gcaaacaact tttaagtggt ggttttacca atcaagaatt ccaaatcggt
      661 tcaagttcaa atcaaactat taaagcaagt ataggagcaa ctcagtcttc taaaatcggt
      721 gtaacaagat ttgaaacagg ttcacaaagt ttttcttcag gcactgtagg acttactatt
      781 aaaaactaca acggtatcga agattttaaa tttgatagtg tagtgatttc tacttcagta
      841 ggaacaggtc ttggagcttt ggctgaagag atcaacagaa atgcagataa aacaggaatt
      901 cgtgcaactt ttgatgtaaa atctgtagga gcctatgcaa taaaagcagg aaatacttct
      961 caggattttg ctatcaatgg ggttgttatc ggacaaataa attataatga cggtgataac
     1021 aatggtcaac ttatctcagc tatcaatgct gtaaaagata caactggtgt tcaagcctct
     1081 aaagatgaaa atggtaaact tgttcttact tcggccgatg gtagagggat taaaatcaca
     1141 ggtagcatag gtgtaggagc tggtatattg cacactgaaa attatggaag gttatcttta
     1201 gttaaaaatg atggtagaaa tatcaatata agtggaacag gtctttcagc tataggtatg
     1261 ggtgctacag acatgatttc tcaatcttca gtatctctaa gagagtcaaa agggcaaatt
     1321 tcagcagcca atgctgatgc tatgggcttt aatgcttata atggcggcgg cgctaagcaa
     1381 attattttcg cttctagtat tgcaggattt atgtctcagg ctggttcagg cttctctgct
     1441 ggttcgggat tttcagtagg tagtggtaaa aattattcag ccattttatc agcttctata
     1501 cagatagtat ctagcgcagc ttctatcagt agcacctatg ttgtttctac tggttcaggt
     1561 ttctctgctg gttcaggtaa ttctcaattt gcagctttaa gaataagtac agtaagtgct
     1621 catgatgaaa ctgcaggtgt aactacactt aagggtgcaa tggctgtgat ggatatagca
     1681 gaaactgcta ttaccaattc tgatcaaatc agagcggata taggtgctgt gcaaaatcag
     1741 ctccaagtaa cgataaataa tattaccgta acccaggtaa atgttaaagc agcagaatca
     1801 accataagag atgtggattt cgctgcagaa agtgcaaatt tttctaagta caatatcctt
     1861 gcgcagtcgg gttcatatgc tatgagccaa cgtaacgctg tgcaacaaaa tgtcttaaaa
     1921 cttttacaat aa
//
LOCUS       CAJFLA       1719 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  C.coli flagellin gene, complete cds.
ACCESSION   M26945
KEYWORDS    flagellin.
SOURCE      C.coli (strain VC167) DNA.
  ORGANISM  Campylobacter coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic/microaerophilic, motile, helical/vibrioid bacteria.
REFERENCE   1  (bases 1 to 1719)
  AUTHORS   Logan,S.M., Trust,T.J. and Guerry,P.
  TITLE     Evidence for posttranslational modification and gene duplication of
            Campylobacter flagellin
  JOURNAL   J. Bacteriol. 171, 3031-3038 (1989)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept          1     1719     flagellin
BASE COUNT      563 a    284 c    365 g    507 t
ORIGIN      
        1 atgggatttc gtattaacac aaatgttgca gcattaaatg ctaaagcaaa ttcggatcta
       61 aacagcagag cattagatca atcactttca agactcagtt caggtcttag aatcaactcc
      121 gcagcagatg tagcttcagg gatggcgata gcagatagtt taagatctca ggcaaatact
      181 ttgggtcagg ctatatctaa tggtaatgat gctttaggta tcttgcaaac tgcagataag
      241 gctatggatg agcaacttaa aatcttagat accatcaaga ctaaagcgac tcaagctgct
      301 gaagatggtc aaagcttaaa aacaagaact atgcttcaag cagacatcaa ccgtttgatg
      361 gaagaacttg ataatatcgc aaataccact tcatttaatg gcaaacaact tttaagtggt
      421 ggttttacca atcaagaatt ccaaatcggt tcaagttcaa atcaaactat taaagcaagt
      481 ataggagcaa ctcagtcttc taaaatcggt gtaacaagat tgaacaggtt cacaaagttt
      541 tcttcaggca ctgtagggct tactatcaaa aactacaacg gtatcgaaga ttttaaattt
      601 gatagtgtag tgatttctac ttcagtagga acaggtcttg gagctttggc tgaagagatc
      661 aacagaaatg cagataaaac aggaattcgt gcaacttttg atctaaaatc tgtaggagcc
      721 tatgcaataa aagcaggaaa tacttctcag gattttgcta tcaatggggt tgttataggt
      781 aaggttgatt attcagatgg tgatgagaat ggttctttaa tttcagctat caatgctgta
      841 aaagatacaa ctggtgttca agcctctaaa gatgaaaatg gtaaacttgt tcttacttcg
      901 gccgatggta gagggattaa aatcacaggt agcataggtg taggagctgg tatattgcac
      961 actgaaaatt atggaaggtt atctttagtt aaaaatgatg gtagagatat caatataagt
     1021 ggaacaggtt tttcagctat aggtatgggt gctacagaca tgatttctca atcttcagta
     1081 tctctaagag agtcaaaagg gcaaatttca gcagccaatg ctgatgctat gggctttaat
     1141 gcttataatg gcggcggcgc taagcaaatt attttcgctt ctagtattgc agggtttatg
     1201 tctcaggctg gttcaggctt ctctgctggt tcgggatttt cagtaggtag tggtaaaaat
     1261 tattcagcca ttttatcagc ttctatacag atagtatcta gcgcagcttc tatcagtagc
     1321 acctatgttg tttctactgg ttcaggtttc tctgctggtt caggtaattc tcaatttgca
     1381 gctttaagaa taagtacagt aagtgctcat gatgaaactg caggtgtaac tacacttaag
     1441 ggtgcaatgg ctgtgatgga tatagcagaa actgctatta ccaatcttga tcaaatcaga
     1501 gcggatatag gttctgtgca aaatcaaatc acatcgacta taaacaacat tactgtaacc
     1561 caggtaaatg ttaaatcagc agaatcacaa atcagagatg tagattttgc aagcgagagt
     1621 gcaaattact ctaaagcaaa tatattggct caaagtggtt cttatgctat ggctcaagca
     1681 aattcaagcc agcaaaatgt tttaagatta ctacagtag
//
LOCUS       CHKLNKPA1     215 bp ds-DNA             VRT       15-AUG-1990
DEFINITION  Chicken cartilage link protein gene, exon 2.
ACCESSION   M35035
KEYWORDS    cartilage link protein.
SEGMENT     1 of 5
SOURCE      Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
            lambda gLP532 and lambda gLP12.1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 51 to 65 and 182 to 195)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 215)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1,2] kindly
            submitted by I.Kiss, 04-JUN-1990.
FEATURES       from  to/span     description
    pept         87  +   186     cartilage link protein, exon 2 (first
                                 expressed exon
    pre-msg <     1  >   215     cartilage link protein mRNA and introns
    IVS     <     1       60     cartilage link protein intron A
    IVS         187  >   215     cartilage link protein intron B
BASE COUNT       63 a     43 c     47 g     62 t
ORIGIN      
        1 gaattccata aagggttcca aaaaattgat gagcctttct gttatgtgat gcccttacag
       61 tgaagaagat tcttgtgact gtgaagatga caagtctact ctttctggtg ctgatttctg
      121 tctgctgggc agaacctcat cctgacaact caagcctgga gcatgagagg attattcaca
      181 tccaaggtaa ggaaatacat cagaaaacgc ctttt
//
LOCUS       CHKLNKPA2     460 bp ds-DNA             VRT       15-AUG-1990
DEFINITION  Chicken cartilage link protein gene, exon 3.
ACCESSION   M35036
KEYWORDS    cartilage link protein.
SEGMENT     2 of 5
SOURCE      Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
            lambda gLP39.13 and lambda gLP33.7.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 49 to 63 and 429 to 442)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 460)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.Kiss, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    +    59  +   433     cartilage link protein, exon 3
    pre-msg <     1  >   460     cartilage link protein mRNA and introns
    IVS     <     1       58     cartilage link protein intron B
    IVS         434  >   460     cartilage link protein intron C
BASE COUNT      145 a     90 c    113 g    112 t
ORIGIN      
        1 tctgtaaaag gtggagtgca gactaattct cctttttgtt tttctccttg aattgtagaa
       61 gaaaatggac cccgcctact tgtggtagca gaacaagcta agatcttctc tcagcgaggt
      121 ggcaacgtca cactgccttg taaattttac catgaacaca catcaacagc tggctcagga
      181 acccacaaaa tccgggtcaa gtggaccaaa ctcacctcag attacctcaa agaagtggat
      241 gtctttgtcg caatgggaca ccacagaaag agctacggaa agtatcaggg cagagtgttt
      301 ctgagggaaa gcagtgagaa cgatgcctct cttataatca cgaatataat gctggaggat
      361 tatgggagat acaagtgcga agtgattgaa ggattagagg acgacacagc agtggtagct
      421 ctgaatttgg aaggtaggta acatctaatg tagacttaaa
//
LOCUS       CHKLNKPA3     427 bp ds-DNA             VRT       15-AUG-1990
DEFINITION  Chicken cartilage link protein gene, exon 4.
ACCESSION   M35037
KEYWORDS    cartilage link protein.
SEGMENT     3 of 5
SOURCE      Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
            lambda gLP33.7 and lambda gLP10.1.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 44 to 58 and 352 to 365)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 427)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.Kiss, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    +    54  +   356     cartilage link protein, exon 4
    pre-msg <     1  >   427     cartilage link protein mRNA and introns
    IVS     <     1       53     cartilage link protein intron C
    IVS         357  >   427     cartilage link protein intron D
BASE COUNT       99 a    108 c    105 g    115 t
ORIGIN      
        1 aaaaaccctt ctagtgggga ttacccccag ctcacctctt tttgccattt caggtgttgt
       61 tttcccctat tctccacgtc tgggtcgtta caacctaaac ttccatgagg ctcagcaagc
      121 ttgcctggac caggactcca tcattgcctc cttcgaccag ctctacgagg cctggaggtc
      181 agggctggac tggtgcaatg ctggctggct cagtgatggt tcagtgcagt accctatcac
      241 caagcccaga gagccctgtg gagggaagaa tacggtgccc ggtgtcagaa actatggctt
      301 ctgggataaa gagaggagcc gatatgatgt tttctgcttt acttcaaact tcaatggtaa
      361 gaacctggtt tacatttacc ttgcaagggt ctttttccat gctttaaaaa gaaagagatg
      421 ccagcgg
//
LOCUS       CHKLNKPA4     826 bp ds-DNA             VRT       15-AUG-1990
DEFINITION  Chicken cartilage link protein gene, exon 5.
ACCESSION   M35038
KEYWORDS    cartilage link protein.
SEGMENT     4 of 5
SOURCE      Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
            lambda gLP10.1 and lambda gLP39.23.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 15 to 29)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1 to 826)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.Kiss, 04-JUN-1990.
FEATURES       from  to/span     description
    pept    +    25      314     cartilage link protein, exon 5
    pre-msg <     1  >   826     cartilage link protein mRNA and introns
    IVS     <     1       24     cartilage link protein intron D
    signal      786      792     AATAAA sequence
BASE COUNT      262 a    166 c    162 g    236 t
ORIGIN      
        1 atggctccct ccgtctctcc ccaggtcgtt tttactacct aatacaccca accaagctga
       61 cctatgatga agccgtgcag gcctgcctga aggatggcgc tcagattgcc aaggttgggc
      121 agatattcgc tgcctggaag ctccttggtt atgaccgctg tgatgccggc tggctggcag
      181 acggcagcgt ccgctacccc atctccagac ccagaaagcg ctgcagcccc aacgaggctg
      241 ccgtccgctt tgtaggcttt cctgataaaa agcacaagct gtatggtgtc tactgtttca
      301 gagcttacaa ctgaaaatac ctagagctgc aacagtcttt aattcattaa gaacatgtga
      361 aatatttcga tatgaactcg tgcaagttac caaaactgtg ataaaccttt cttacttact
      421 gtagagtcat tttcataaac caaaaccatt aatttgtttt tgtttctgtt taaatatttt
      481 tgtaaaagta tcattccata gatatttaaa aataatataa gtttaatgga agctctaggt
      541 aagaagagcc aaattcttta agctacgtca tcccaacaaa atataatttt catgaatggg
      601 gcatgcaata gagcttgaca attgctagga cacaattatg gaatgtaagg ctactcaaag
      661 cagaagcttt taaaagcaca aattttacat gtttgtaccc gtttgagata cacagcaaat
      721 tgattgtatc tggagttttg aattaagatg tttttgttta taggggtcag tgaggttttg
      781 caaaaaataa aaattaaaaa aaaaaaaaaa aaaaaaaaag gccgcc
//
LOCUS       CHKLNKPA5     217 bp ds-DNA             VRT       15-AUG-1990
DEFINITION  Chicken cartilage link protein gene, exon 6.
ACCESSION   M35039
KEYWORDS    cartilage link protein.
SEGMENT     5 of 5
SOURCE      Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones
            lambda gLP10.1 and lambda gLP39.23.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 217)
  AUTHORS   Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K.,
            Argraves,W.S., Sparks,K.J. and Goetinck,P.
  TITLE     Structure of the chicken link protein gene: Exons correlate with
            the protein domains
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by I.Kiss, 04-JUN-1990.
FEATURES       from  to/span     description
    pre-msg <     1      217     cartilege link protein mRNA and intron
    signal       44       49     poly-A signal
    signal       98      103     poly-A signal
BASE COUNT      102 a     24 c     22 g     69 t
ORIGIN      
        1 tataatattt aatatttctt aagctattta cacatcacaa gaaaataaaa aattggaaaa
       61 aaaaatcaaa tgatcaagtc ttagaagaag attattgaat aaaatctgaa accagctatt
      121 aaggtttaga agagaagaag tactttattt ccttacatct tatctgtatc taaatataca
      181 tctgtttttt aaactatcaa tgaaaaaaaa aaaaaaa
//
LOCUS       CHTCRPA      3012 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  C.trachomatis 9-kD and 60-kD cysteine-rich and 15 kD serine-rich
            outer membrane protein genes, complete cds.
ACCESSION   M35148 M23180 M35161
KEYWORDS    cysteine-rich outer membrane protein;
            serine-rich outer membrane protein.
SOURCE      C.trachomatis (serovar L1) DNA.
  ORGANISM  Chlamydia trachomatis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE   1  (bases 1 to 753 and 1715 to 2577)
  AUTHORS   Lambden,P.R., Everson,J.S., Ward,M.E. and Clarke,I.N.
  TITLE     Sulfur-rich proteins of Chlamydia trachomatis: Developmentally
            regulated transcription of polycistronic mRNA from tandem promoters
  JOURNAL   Gene 87, 105-112 (1990)
  STANDARD  simple staff_review
REFERENCE   2  (bases 483 to 3012)
  AUTHORS   Clarke,I.N., Ward,M.E. and Lambden,P.R.
  TITLE     Molecular cloning and sequence analysis of a developmentally
            regulated cysteine-rich outer membrane protein from Chlamydia
            trachomatis
  JOURNAL   Gene 71, 307-314 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        185      451     9-kDa cysteine-rich outer membrane protein
    pept        703     2259     60-kDa cysteine-rich outer membrane protein
                                 precursor
    sigp        703      735     60-kD serine-rich outer membrane protein
                                 signal peptide
    matp        736     2256     60-kDa cysteine-rich outer membrane protein
    pept       2437     2889     15-kDa serine-rich outer membrane protein
    mRNA         93     2296     CrP operon mRNA (alt.)
    mRNA        159     2296     CrP operon mRNA (alt.)
    mRNA        160     2296     CrP operon mRNA (minor alt.)
    mRNA       2406     2965     SrP mRNA
BASE COUNT      898 a    537 c    678 g    899 t
ORIGIN      
        1 tttgtttgct ttgatttgct aattacctgt tattagacga tttgttttaa aaaacaattg
       61 atataatttt tattttataa tgtaatattg tctatgaggg ctagtttctt ttattattaa
      121 aagaattgct tttatcgata aaagaaactt caagagccct tttctagaaa ggagtctgga
      181 agttatgaaa aaaactgctt tactcgctgc tttatgtagt gttgtttctt taagtagttg
      241 ttgtcgtatc gttgactgtt gcttcgaaga tccatgcgca cctatccaat gttcaccttg
      301 tgaatctaag aagaaagacg tagacggtgg ttgcaactct tgtaacgggt atgtcccagc
      361 ttgcaaacct tgcggagggg atacgcacca agatgctgaa catggccctc aagctagaga
      421 aattccagtt gacggcaaat gcagacaata ggtagcgcaa gttaagagcc tacccacaac
      481 agatgtagtt agtaaggaag ttggcttcct tactaactat ttcggctaac aagaaaatgt
      541 tgagggtaaa agttagttaa taacaatttc tacccgatgg cagacaaaaa ataatctatg
      601 cgaataggag atcctatgaa caaactcatc agacgagcag tgacgatctt cgcggtgact
      661 agtgtggcga gtttatttgc tagcggggtg ttagagacct ctatggcaga gtttatctct
      721 acaaacgtta ttagcttagc tgacaccaaa gcgaaagaca acacttctca taaaagcaaa
      781 aaagcaagaa aaaaccacag caaagagact cccgtaaacc gtaaaaaggt tgctccggtt
      841 catgagtcta aagctacagg acctaaacag gattcttgct ttggcagaat gtatacagtc
      901 aaagttaatg atgatcgtaa tgttgaaatc acacaagctg ttcctaaata tgctacggta
      961 ggatctccct atcctgttga aattactgct acaggtaaaa gggattgtgt tgatgttatc
     1021 attactcagc aattaccatg tgaagcagag ttcgtacgca gtgatccagc gacaactcct
     1081 actgctgatg gtaagctagt ttggaaaatt gaccgcttag gacaaggcga aaagagtaaa
     1141 attactgtat gggtaaaacc tcttaaagaa ggttgctgct ttacagctgc aacagtatgc
     1201 gcttgtccag agatccgttc ggttacaaaa tgtggacaac ctgctatctg tgttaaacaa
     1261 gaaggcccag agaatgcttg tttgcgttgc ccagtagttt acaaaattaa tgtagtgaac
     1321 caaggaacag caacagctcg taacgttgtt gttgaaaatc ctgttccgga tagttacgct
     1381 cattcttctg gacagcgtgt actaacgttt actcttggag atatgcaacc tggagagcac
     1441 agaacaatta ctgtagagtt ttgtccgctt aaacgtggtc gtgctaccaa tatagcaatg
     1501 gtttcttact gtggaggaca taaaaataca gcaagcgtaa caactgtgat caacgagcct
     1561 tgcgtacaag taagtattgc aggagcagat tggtcttatg tttgtaagcc tgtagaatat
     1621 gtgatctccg tttccaatcc tggagatctt gtgttgcgag atgtcgtcgt taaagacact
     1681 ctttctcccg gagtcacagt tcttgaagct gcaggagctc aaatttcttg taataaagta
     1741 gtttggactg tgaaagaact gaatcctgga gagtctctac agtataaagt tctagtaaga
     1801 gcacaaactc ctggacaatt cacaaataat gttgttgtga agagctgctc tgactgtggt
     1861 acttgtactt cttgcgcaga agcgacaact tactggaaag gagttgctgc tactcatatg
     1921 tgcgtagtag atacttgtga ccctgtttgt gtaggagaaa atactgttta ccgtatttgt
     1981 gtcaccaaca gaggttctgc agaagataca aatgtttctt taatgcttaa attctctaaa
     2041 gaactgcaac ctgtatcctt ctctggacca actaaaggaa cgattacagg caatacagta
     2101 gtattcgatt cgttacctag attaggttct aaagaaactg tagagttttc tgtaacattg
     2161 aaagcagtat cagctggaga tgctcgtggg gaagcgattc tttcttccga tacattgact
     2221 gttccagttt ctgatacaga gaatacacac atctattaat ctttgatttt atcgatgtgt
     2281 aggtgccgtc cagggattcc tgggcggctt tttttgttat ctatatgaaa ataaaagagt
     2341 tcattttcgt tctcagagca tattctagat gggtttttga aaaaaataag tgtttgtgta
     2401 gactccctgc tcacaaccaa aaaaggaatg taaaatatga gcactgtacc cgttgttcaa
     2461 ggagctggat cttccaattc ggcacaggat atttccacta gttctgtacc attaacactg
     2521 caagggcgta tatcgaatct tctatcttcc actgcattta aggtgggatt agtggtgatg
     2581 ggactacttt tagtgatggc tacgatattc ctagtttcgg cagcttcgtt tgtaaatccc
     2641 atctatctag ctattcctgc tattgtggga tgcgtgaata tctgcgtagg aattttatcc
     2701 atggaaggat actgttctcc ggagagatgg agcttatgta agaaggtatt aaaggcttca
     2761 gaagatatca tcgatgatgg gcagataaac aactctaata aagtgtttac tgatgagagg
     2821 ttgaatgcca taggtggggt agtggaatct ctatctagaa gaaatagtct ggtggatcag
     2881 acccaatgat aagagattgc tctataggca aaagatgata gcggcagttt ttatggatga
     2941 tctgctgaca gatgatgtat ggaaagggag gaggaaagag tcctcctccc agattttatt
     3001 gagctggagt tt
//
LOCUS       DDIGP80A     1545 bp ss-mRNA            INV       15-AUG-1990
DEFINITION  D.discoideum membrane-associated glycoprotein (gp80) mRNA,
            complete cds.
ACCESSION   M36545
KEYWORDS    gp80 gene; membrane-associated glycoprotein.
SOURCE      D.discoideum, cDNA to mRNA.
  ORGANISM  Dictyostelium discoideum
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; 
            Dictyosteliidae.
REFERENCE   1  (bases 1 to 1545)
  AUTHORS   Siu,C.-H., Wong,L.M., Lam,T.Y., Kamboj,R.K., Choi,A. and Cho,A.
  TITLE     Molecular mechanisms of cell-cell interaction in Dictyostelium
            discoideum
  JOURNAL   Biochem. Cell Biol. 66, 1089-1099 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          1     1545     membrane-associated glycoprotein (gp80)
                                 precursor
    sigp          1       48     membrane-associated glycoprotein (gp80) signal
                                 peptide
    matp         49     1542     membrane-associated glycoprotein (gp80)
BASE COUNT      502 a    332 c    209 g    502 t
ORIGIN      
        1 atgaaatttt tattagtatt gataatatta tataatattt taaatagtgc acattcagct
       61 ccaacaataa cagctgtttc aaatggaaaa tttggtgttc caacatatat taccattaca
      121 ggtactggat ttacaggaac tccagttgta actattggtg gccagacctg tgatccagtt
      181 attgtagcca ataccgcatc gttacaatgc caattttctg ctcaattagc tccaggaaat
      241 tcaaattttg atgttattgt aaaggttggt ggtgtaccat ctacaggtgg taatggtctt
      301 tttaaatata cacctccaac tctttcaaca atatttccaa ataatggaag aattggtatg
      361 attttagttg atggaccatc caatatatct ggatacaaat taaatgtgaa cgactctatt
      421 aactctgcta tgttatctgt tactgctgat tcagtatccc caacaattta tttcctcgtg
      481 ccaaatacaa tcgctggtgg tctacttaat cttgaactca ttcaaccatt tggcttttca
      541 acaattgtaa cttccaaatc agtgttttct ccaaccatta catcaatcac cccattagct
      601 tttgatctca caccaaccaa tgtaaccgtc actggtaaat actttgttac tacagctagt
      661 gttacaatgg gaagtcatat ctatacagga ttgactgttc aagatgatgg aacaaattgt
      721 catgttattt ttactactcg ttcagtttat gaatcatcaa atactataac tgctaaagct
      781 tcaacaggtg tcgatatgat ttatttagac aatcaaggta atcaacaacc aataactttt
      841 acatataacc caccaaccat tacttcaaca aaacaagtca atgactctgt tgagatctca
      901 acaaccaata ctggtactga tttcactcaa atttctttaa ccatgggaac ctcaagccca
      961 acaaaccttg taatcactgg tacaaatgaa aagattgtta taactcttcc acatgctctt
     1021 ccagaaggtg aaattcaatt caatttgaaa gctggtatct caaatgttgt cacatcaact
     1081 ttattagtta ctccggttat aaatagtgtc actcaagcac ctcacaatgg tggaagtatt
     1141 acaatttcag gtatcttttt aaacaatgcc catgtttcga ttgttgttga ccaaaatact
     1201 actgatatag tttgtgctcc agattcaaat ggtgaatcaa tcatttgtcc agttgaagct
     1261 ggtagtggta ctattaattt agtcgttaca aactataaaa actttgcttc agatccaact
     1321 attaaaactg aagccacaac ctctacaacc tatacaattc cagacactcc aactccaact
     1381 gatacagcca ccccatctcc aactccaact gaaacagcca ccccatctcc aactccaaaa
     1441 ccaaccagca caccagaaga aactgaagca ccttcatcag caacaactct tatttcacca
     1501 ttatctttaa ttgttatttt catttctttt gttttattaa tttaa
//
LOCUS       ECOMANXF     1474 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  E.coli enzyme III-Man function protein (manX (ptsL)) gene,
            complete cds, and manY (pel) gene, 5' end.
ACCESSION   M36404
KEYWORDS    enzyme III-Man function protein; manX gene; manY gene; pel gene;
            ptsL gene.
SOURCE      E.coli (strain K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1474)
  AUTHORS   Saris,P.E.J., Liljestroem,P. and Palva,E.T.
  TITLE     Nucleotide sequence of manX (ptsL) encoding the enzyme III-Man
            (II-a-Man) function in the phosphotransferase system of Escherichia
            coli K-12
  JOURNAL   FEMS Microbiol. Lett. 49, 69-73 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        258     1205     enzyme III-Man function protein (manX (ptsL))
    pept       1268  >  1474     manY (pel) gene product
    mRNA        120  >  1474     manXYZ operon mRNA (5' end put.)
BASE COUNT      411 a    319 c    376 g    368 t
ORIGIN      
        1 cctttgcaaa cgaatgtgac aaggatattt tacctttcga aatttctgct aatcgaaagt
       61 taaattacgg atcttcatca cataaaataa ttttttcgat atctaaaata aatcgcgaaa
      121 cgcaggggtt tttggttgta gcccttatct gaatcgattc gattgtggac gacgattcaa
      181 aaatacatct ggcacgttga ggtgttaacg ataataaagg aggtagcaag tgaccattgc
      241 tattgttata ggcacacatg ggttggggct gcagagcagg ttgcttaaaa cggcagaaag
      301 tgctgttagg cgagcaggaa aacgtcggct ggatcaattt cgttccaggt gaaaatgccg
      361 aaacgctgat tgaaaagtac aacgctcagt tggcaaaact cgacaccact aaaggcgtgc
      421 tgtttctcgt tgatacatgg ggaggcagcc cgttcaatgc tgccagccgc attgtcgtcg
      481 acaaagagca ttatgaagtc attgcaggcg ttaacattcc aatgctcgtg gaaaggttaa
      541 tggcccgtga tgatgaccca agctttgatg aactggtggc actggcagta gaaacaggcc
      601 gtgaaggcgt gaaagcactg aaagccaaac cggttgaaaa agccgcgcca gcacccggtg
      661 ccgcagcacc aaaagcggct ccaactccgg caaaaccaat ggggccaaac gactacatgg
      721 ttattggcct tgcgcgtatc gacgaccgtc tgattcacgg tcaggtcgcc acccgctgga
      781 ccaaagaaac caatgtctcc cgtattattg ttgttagtga tgaagtggct gcggataccg
      841 ttcgtaagac actgctcacc caggttgcac ctccgggcgt aacagcacac gtagttgatg
      901 ttgccaaaat gattcgcgtc tacaacaacc cgaaatatgc tggcgaacgc gtaatgctgt
      961 tatttaccaa cccaacagat gtagagcgtc tcgttgaagg cggcgtgaaa atcacctctg
     1021 ttaacgtcgg tggtatggca ttccgtcagg gtaaaaccca ggtgaataac gcggtttcgg
     1081 ttgatgaaaa agatatcgag gcgttcaaga aactgaatgc gcgcggtatt gagctggaag
     1141 tccgtaaggt ttccaccgat ccgaaactga aaatgatgga tctgatcagc aaaatcgata
     1201 agtaacgtat tgtgttgatt atcactcagt tttcacactt aagtcttacg taaacaggag
     1261 aagtacaatg gagattacca ctcttcaaat tgtgctggta tttatcgtag cctgtatcgc
     1321 aggtatggga tcaatcctcg atgaatttca gtttcaccgt cctctaatcg cgtgtaccct
     1381 ggtgggctat cgttcttggg gatatgaaaa ccggtattat tatcggtggt acgctggaaa
     1441 tgatcgcgct gggctggatg aacatcggtg ctgc
//
LOCUS       FSCCKPA      1428 bp ss-mRNA            VRT       15-AUG-1990
DEFINITION  T.californica creatine kinase mRNA, complete cds.
ACCESSION   M36427
KEYWORDS    creatine kinase.
SOURCE      T.californica electric organ, cDNA to mRNA, clone CK52g8.
  ORGANISM  Torpedo californica
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; 
            Chondrichthyes; Elasmobranchii; Euselachii; Neoselachii; 
            Squalomorphii; Torpediniformes; Torpedinoidea; Torpedinidae.
REFERENCE   1  (bases 1 to 1428)
  AUTHORS   West,B.L., Babbitt,P.C., Mendez,B. and Baxter,J.D.
  TITLE     Creatine kinase protein sequence encoded by a cDNA made from
            Torpedo californica electric organ mRNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 7007-7011 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         90     1235     creatine kinase (E.C. 2.7.3.2)
BASE COUNT      348 a    398 c    394 g    288 t
ORIGIN      
        1 ggtcacccac accagcggta gttccagcac caagcaggac aaggtccaga gtggttcacc
       61 gtgcgccagg agtcagccaa cctccaacca tgcctttcgg aaacactcac aataaatgga
      121 agctgaacta ttcggcggcg gaagaattcc ccgacctcag caagcacaac aaccacatgg
      181 ccaaggcttt aaccctggac atctacaaga aacttcggga caaggagact ccaagtggct
      241 tcaccctcga tgatatcatc cagacaggag tggacaaccc aggtcacccc ttcatcatga
      301 ccgtgggctg cgtggctggc gatgaggaat gctacgaggt tttcaaggac ctgttcgatc
      361 ccgtcattga ggaccgccac ggtggctaca aaccaactga caagcacaag actgacctga
      421 accaggagaa cctgaagggc ggcgatgacc tcgacccgaa ttacgtcctg agcagccggg
      481 tgcgcactgg ccgcagcatc aagggcatcg ccctgcctcc tcactgcagc cgcggggagc
      541 gccgtctggt tgagaagctc tgcatagacg gtctcgccac cttgacgggc gagttccagg
      601 gcaagtacta ccccctctcc tccatgtctg atgcagagca gcagcagctg atcgatgacc
      661 acttcctgtt tgacaaaccc atctctcctc tgcttctcgc ctctggcatg gctcgggact
      721 ggcccgatgg ccggggcatt tggcataaca acgacaagac cttcctggtc tgggtcaacg
      781 aggaggacca cctccgagtc atctcgatgc agaaaggtgg caacatgaag gaggtcttca
      841 ggcgcttctg cgttggtctg aagaagatcg aggacatttt cgtgaaggct ggccgtggct
      901 tcatgtggaa cgagcacctg ggctacgtcc tgacctgccc gtccaacctg ggcactggcc
      961 tccgtggtgg tgtccacgtg aaaatccctc acctctgcaa gcacgagaag ttcagcgagg
     1021 tcctcaagag aacgaggctg cagaaacgtg ggacaggtgg agtggatacc gcagcggttg
     1081 gcagcatcta tgacatctcc aacgccgacc gtctgggctt ctccgaggtg gaacaggtcc
     1141 agatggtggt ggacggtgtg aagctgatgg tcgagatgga gaagaggctg gaaaatggga
     1201 aaagcatcga tgacctgatg ccggctcaga agtagacctt gggttggctg ggtgcctgcc
     1261 actctgagat gccttgaaat atcacaggtc gcgaactttg aactttccca ctccaatctt
     1321 tcttggccac agatctcgtg tctcaaatga ggaagcagaa ggtttggttt catcacattc
     1381 agatttgcta gacacaattt taaccttgat gacacattaa taaaatat
//
LOCUS       HUMLBPP2A    1541 bp ss-mRNA            PRI       15-AUG-1990
DEFINITION  Human phosphatase 2A-beta catalytic subunit mRNA, complete cds.
ACCESSION   M36511
KEYWORDS    phosphatase 2A-beta catalytic subunit.
SOURCE      Human lung fibroblast, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1541)
  AUTHORS   Hemmings,B.A., Wernet,W., Mayer,R., Maurer,F., Hofsteenge,J. and
            Stone,S.R.
  TITLE     The nucleotide sequence of the cDNA encoding the human lung protein
            phosphatase 2A-beta catalytic subunit
  JOURNAL   Nucleic Acids Res. 16, 11366-11366 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         22      951     phosphatase 2A-beta catalytic subunit
BASE COUNT      436 a    296 c    327 g    482 t
ORIGIN      
        1 ccgagcccca gcccggccgc catggacgac aaggcgttca ccaaggagct ggaccagtgg
       61 gtcgagcagc tgaacgagtg taagcagctg aacgagaacc aagtgcggac gctgtgcgag
      121 aaggcaaagg aaattttaac aaaagaatca aatgtgcaag aggttcgttg ccctgttact
      181 gtctgtggag atgtgcatgg tcaatttcat gatcttatgg aactctttag aattggtgga
      241 aaatcaccgg atacaaacta cttattcatg ggtgactatg tagacagagg atattattca
      301 gtggagactg tgactcttct tgtagcatta aaggtgcgtt atccagaacg cattacaata
      361 ttgagaggaa atcacgaaag ccgacaaatt acccaagtat atggctttta tgatgaatgt
      421 ctgcgaaagt atgggaatgc caacgtttgg aaatatttta cagatctctt tgattatctt
      481 ccacttacag ctttagtaga tggacagata ttctgcctcc atggtggcct ctctccatcc
      541 atagacacac tggatcatat aagagccctg gatcgtttac aggaagttcc acatgagggc
      601 ccaatgtgtg atctgttatg gtcagatcca gatgatcgtg gtggatgggg tatttcacca
      661 cgtggtgctg gctacacatt tggacaagac atttctgaaa cctttaacca tgccaatggt
      721 ctcacactgg tttctcgtgc ccaccagctt gtaatggagg gatacaattg gtgtcatgat
      781 cggaatgtgg ttaccatttt cagtgcaccc aattactgtt atcgttgtgg gaaccaggct
      841 gctatcatgg aattagatga cactttaaaa tattccttcc ttcaatttga cccggcgcct
      901 cgtcgtggtg agcctcatgt tacacggcgc accccagact acttcctata aatttctcct
      961 gggaaacctg cctttgtatg tggaagtata cctggctttt taaaatatat gtatttaaaa
     1021 acaaaaagca acagtaatct atgtgtttct gtaacaaatt gggatctgtc ttggcattaa
     1081 accacatcat ggaccaaatg tgccatacta atgatgagca tttagcacaa tttgagactg
     1141 aaatttagta cactatgttc tagataggtc agtctaacag tttgcctgct gtatttatag
     1201 taaccatttt cctttggact gttcaagcaa aaaaggtaac taactgcttc atctcctttt
     1261 gcgcttattt ggaaatttta gttatagtgt ttaactggca tggattaata gagttggagt
     1321 tttattttta agaaaaattc acaagctaac ttccactaat ccattatcct ttattttatt
     1381 gaaatgtata attaacttaa ctgaagaaaa ggttcttctt gggagtatgt tgtcataaca
     1441 tttaaagaga tttcccttca tttaaactaa attactgttt tatgttgatc tgcatatttc
     1501 tgtatatttg tcatgacagt gcttgcatcc tatttggtgt g
//
LOCUS       HUMPDEGA      978 bp ss-mRNA            PRI       15-AUG-1990
DEFINITION  Human cGMP phosphodiesterase gamma-subunit (PDEG) mRNA,
            complete cds.
ACCESSION   M36476
KEYWORDS    cGMP phosphodiesterase gamma-subunit.
SOURCE      Human retina, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 978)
  AUTHORS   Tuteja,N., Danciger,M., Klisak,I., Tuteja,R., Inana,G.,
            Mohandas,T., Sparkes,R.S. and Farber,D.B.
  TITLE     Isolation and characterization of cDNA encoding the gamma-subunit
            of cGMP phosphodiesterase in human retina
  JOURNAL   Gene 88, 227-232 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        102      365     cGMP phosphodiesterase gamma-subunit (PDEG)
    mRNA    <     1      978     PDEG mRNA
BASE COUNT      213 a    341 c    257 g    167 t
ORIGIN      
        1 ccgcactcac agcacagccc cctgagaccc gccctgcact tgaccgcagc aggagggagt
       61 ccaggagcca aggttgccgc ggtgtctccg tcagcctcac catgaacctg gaaccgccca
      121 aggctgagtt ccggtcagcc accagggtgg ccgggggacc tgtcaccccc aggaaagggc
      181 cccctaaatt taagcagcga cagaccaggc agttcaagag caagccccca aagaaaggcg
      241 ttcaagggtt tggggacgac atccctggaa tggaaggcct gggaacagac atcacagtca
      301 tctgcccttg ggaggccttc aaccacctgg agctgcacga gctggcccaa tatggcatca
      361 tctagcacga ggcccctgct gaagtccaga ccctccccct cctgcccact atgctaaacc
      421 ctgctcagga ttcctgttga ggagatgacc tccctagccc cagatggcac ctggacacca
      481 ggatgggact gcaacctcag gtctccccct acatattaat accagtcacc aggagcccac
      541 cacctccctc taggatgccc cctcagggtg gccaggccct gctcaacatc tggagacaca
      601 ggcccacccc tcagtcctgc ccacagagag gcttggtcgg tctccactcc cagggagaac
      661 gggaagtgga ccccagcccg ggagcctgct ggaccccaga tcgtcccctc ctcccagctg
      721 gaaagctagg gcaggtctcc ccagagtgct tctgcacccc agccccctgt cctgcctgta
      781 aggggataca gagaagctcc ccgtctctgc atcccttccc aggggggtgc ccttagtttg
      841 gacatgctgg gtagcaggac tccagggcgt gcacggtgag cagatgaggc cccaagctca
      901 tcacaccagg gggccatcct tctcaataca gcccgccctt gcagtcccta tttcaaaata
      961 aaattagtgt gtccttgc
//
LOCUS       HUMSON3A     1449 bp ds-DNA             PRI       15-AUG-1990
DEFINITION  Human son3 protein gene, partial cds.
ACCESSION   M36428
KEYWORDS    son3 protein.
SOURCE      Human placenta DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1449)
  AUTHORS   Berdichevskii,F.B., Chumakov,I.M. and Kiselev,L.L.
  TITLE     Determination of the nucleotide sequence of the son3 fragment of
            the human genome: Identification of a new protein with an unusual
            structure and homology with DNA-binding proteins
  JOURNAL   Mol. Biol. 22, 639-646 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  1449     son3 protein (AA at 1)
BASE COUNT      487 a    348 c    329 g    285 t
ORIGIN      
        1 cgggctctgc tcagccctaa agaaagtagt ggaggagaaa aagaagtacc tccccctcct
       61 aaagagacac tgcctgattc aggattttct gccaatattg aggatattaa tgaagcagat
      121 ttagtgagac cgttacttcc taaggacatg gaacgtctta caagccttag agctggcatt
      181 gaaggacctt tacttgcaag tgatgttgga cgtgacagat ctgctgccag cccggttgta
      241 agtagtatgc cagaaagagc ttcagagtct tcttcagagg aaaaagatga ttatgaaatt
      301 tttgtaaaag ttaaggacac tcacgaaaaa agcaagaaaa ataagaaccg tgataagggg
      361 gagaaagaga agaaaagaga tcctcattta agatctcgaa gtaagcgttc caaatcttct
      421 gaacacaaat cacgcaagcg taccagtgaa tctcgttcta gggcaagaaa gagatcatct
      481 aagtccaagt ctcatcgctc tcagacacgt tcacggtcac gttcaagacg caggaggaga
      541 agcagcagat caagatcaaa gtctagagga agaagatctg tatcaaaaga gaagcgcaaa
      601 agatctccaa agcacagatc caagtctagg gaaagaaaaa gaaaaagatc aagctccagg
      661 gataaccgaa agacagttag agctcgaagt cgaaccccaa gtcgtcggag tcggagtcat
      721 actccaagtc gtcgacgaag gtctagatct gtgggtagaa gaaggagctt tagcatttcc
      781 ccaagccgcc gcagccgcac ccccagccgc cgcagccgca cccccagccg ccgcagccgc
      841 acccccagcc gccgcagccg cacccccagc cgccggagcc gcacccctag ccgtcggagc
      901 cgcaccccaa gccgccggag aagatcaagg tctgtggtaa gaagacgaag cttcagtatc
      961 tcaccagtca gattaaggcg atcaagaaca cccttaagaa gaaggtttag cagatctccc
     1021 atccgtcgta aaagatccag gtcttctgaa cgaggcagat cacccaaacg tctgacagat
     1081 ttggataagg ctcaattact tgaaatagcc aaagctaatg cagctgccat gtgtgctaag
     1141 gctggtgtcc ctttaccacc aaacctaaag cctgcacctc cacctactat agaagagaaa
     1201 gttgctaaaa agtcaggagg agctactata gaagaactaa ctgagaaatg taaacagatc
     1261 gcacagagta aagaagatga tgatgtaata gtgaataaac ctcatgtttc ggatgaagag
     1321 gaagaagaac ctccttttta tcatcatccc tttaaactca gtgaacccaa acctattttt
     1381 ttcaatctga atattgctgc agcaaaacca actccaccaa aaagccaggt aacattaaca
     1441 aaagaattc
//
LOCUS       MYXGFA       2269 bp ds-DNA             VRL       15-AUG-1990
DEFINITION  Myxoma virus growth factor and M-T9 genes, complete cds.
ACCESSION   M15806 M35234
KEYWORDS    M-T9 gene product; growth factor.
SOURCE      Myxoma virus (strain Lausanne) DNA, clone pMYH-1.
  ORGANISM  Myxoma virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus.
REFERENCE   1  (bases 1 to 1421)
  AUTHORS   Upton,C., Macen,J.L. and McFadden,G.
  TITLE     Mapping and sequencing of a gene form myxoma virus that is related
            to those encoding epidermal growth factor and transforming growth
            factor alpha
  JOURNAL   J. Virol. 61, 1271-1275 (1987)
  STANDARD  full staff_review
REFERENCE   2  (bases 584 to 2269)
  AUTHORS   Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G.
  TITLE     Myxoma virus and malignant rabbit fibroma virus encode a serpi
            protein important for virus virulence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Computer-readable sequence for [1] kindly provided by C.Upton,
            09-MAY-1987.  Draft entry and computer-readable sequence for [2]
            kindly submitted by C.Upton, 14-JUN-1990.
            
              Author address: C.Upton
              University of Alberta
              Dept of Biochemistry
              471 Med Sci Bldg
              Edmonton
              Alberta, CANADA T6G 2H7
              email: USERCU11@ualtamts
            
FEATURES       from  to/span     description
    pept        204      461     growth factor
    pept        717     2246     M9-R gene product
BASE COUNT      685 a    441 c    540 g    603 t
ORIGIN      1239 bp upstram of DdeI site; about 13 kb from 3' viral end.
        1 ttaaacaaga tacaacatac ggacgcggct atgttctcgg aagtcataga cggtattgtc
       61 gcggaagaac agcaggtgat tggatttatt cagaaaaaat gtaaatataa cacgacatac
      121 tacaatgtac gtagcggcgg gtgtaaaata tccgtctatc taaccgcggc agttgttggc
      181 tttgtcgcat acggaatact aaaatggtac cgagggacct agtcgcaact ctcttatgtg
      241 cgatgtgtat tgtacaggca acgatgcctt cgttggataa ttatctgtat attattaaac
      301 gtattaaact atgtaacgac gactataaaa actattgtct aaataacgga acctgtttca
      361 ccgtagcatt aaacaatgtt tcacttaacc cgttttgtgc gtgtcatatt aactacgtgg
      421 gaagccgatg tcagtttatt aatctaatta ccattaagta acccgtttta catgtataat
      481 aatacatacg tatttttaga taactttaat aaataacatt gtataaactt acttatcata
      541 tacggtacac ataacgaata acactacatg tttttatata tacataggtt tggaaaaaac
      601 ttaatcacga acgtatcatt agacaatgac tccatctagg aggggttttg ggaactacgt
      661 acacgatata ttcacatcgc gaaaacataa ataataattt tttacaacga ttcacgatgt
      721 cgcgcacttt attgagattt ctggaagatg gtgcaatgag cgacgtaaca gtcgtcgccg
      781 gggactcgac gtttctcggg cataaagtta ttttatctct tcactcggat tacttctatc
      841 gtctgtttaa tggagacttt acctcgcccg atacggttac gctggacgcg acggacgatg
      901 ccgttcgtac ggtgtttacg tatatgtacg cgggatgtga cgggttaaac gatcgtacga
      961 tagacgattt acaatccatt atcgtattgg cggactacct gggtataacg aaactggtgg
     1021 acgaatgcgt acgtcgtatc gtatctaaag tggacgtatt aaactgcgta ggggtatata
     1081 cgtttgcgga gacgtatcat ataacggact tgcagcgggc ggccaaaacg tttttaacag
     1141 aactactggg gtctaaagaa gcgttcgaag aactatccca agacgatgcg gttatcgcgt
     1201 taagggaaac gcgtaacatt gtcgatagac gatccattct tagagcgatc ctgttatggg
     1261 ttcgaaaatg tccagatcgt atcgaacaac taaaggtgtt agtcgccgcc gtagacgacg
     1321 tagacgacga tgacaacgta tatacgatct acgagagata cgctgaagaa ctaaaggata
     1381 tgatcgcgtg tccattatcc tataattgcg tcgttgtggt cgacagagat agatacgttc
     1441 gcctcattaa cccagacacc ctatggagta aacgcgtgac gtacatacgt aaacgcgcca
     1501 taggcgatcg attcaccgtc gtttgtatga acaacgttct atactgttta gggggtacgt
     1561 tagacggggc acccacgtgt gacgtgttgg cctacgatct actgacgaac gaatacagtt
     1621 taatgccgga gatgggacac tatagacgta atgcgtcggc gtgtatcgta aatggatata
     1681 tatacgtcgt aggaggcgta gacgaagaaa acagattaat cggttccgta gagtactggc
     1741 aacccggaat ggaggaatgg cacgacgctc cttatctaca ggcgaacgta gaaacggcta
     1801 cggtgtgtta caggaacgag ttgtggatcg taggaggcac cgtggactta tatcatccca
     1861 cgtttataag cgcagttaag aaattaacag acaatcgatg gatgtcgatg gaacctcttc
     1921 ccgaaccacg atcgggtgct acgaccgtcg tgtataataa tcgattatac tgcataggcg
     1981 gaaggataca cggtggcgcg tacacaaatc acgtctacaa ctatttagac gagtcacgta
     2041 cgtgggaacg ggtaggggat atggcgaacg tacgcagaaa tcccagttgt tgtgtgtaca
     2101 ataaggcgat ttacgtattg ggagggaata caaacgccgt agagaaatac aacgggtgga
     2161 agtggcaaga ggtaggtaat atatccacgt atcccgcgtg taataatacc gcgtatccat
     2221 ttttttatac caacgacgag atataaaacg agtatgatat acaagtcgt
//
LOCUS       MYXMAP1A     2204 bp ds-DNA             VRL       15-AUG-1990
DEFINITION  Myxoma virus MAP1 gene, complete cds, and M-T8 gene, 5' end.
ACCESSION   M35233
KEYWORDS    M-T8 gene product; MAP1 gene product serpi protein.
SOURCE      Myxoma virus (strain Lausanne) DNA, clone pBU-3.
  ORGANISM  Myxoma virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus.
REFERENCE   1  (bases 1 to 2204)
  AUTHORS   Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G.
  TITLE     Myxoma virus and malignant rabbit fibroma virus encode a serpi
            protein important for virus virulence
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.Upton, 14-JUN-1990.
            
              Author address: C.Upton
              University of Alberta
              Dept of Biochemistry
              471 Med Sci Bldg
              Edmonton
              Alberta, CANADA T6G 2H7
              email: USERCU11@ualtamts
            
FEATURES       from  to/span     description
    pept        363     1472     MAP1 gene product
    pept       1450  >  2204     M-T8 gene product
BASE COUNT      592 a    520 c    569 g    523 t
ORIGIN      
        1 ggatccgtaa caacacgtgt gtcgtagcgt atacataatg ccgtaaatga cagtcataaa
       61 accatcgagt cgtcccaggc cgaggaaaaa caaaaatata aaagtaaata catacagaac
      121 gagcgccatg gatctctctc cgggaagtgt ccacgagggt atcgtatatt ttaaagacgg
      181 aatattcaaa gtccgcctac tcggatacga gggacacgag tgtattcttt tggactatct
      241 gaactacagg caagacacgt tggatcggtt gaaggaacga ctcgtgggac gcgtgattaa
      301 aacgcgagtc gttcgcgcgg acggtttata cgtggacctg cgacgttttt tttgagggtt
      361 aaatgaagta tctggtcctc gtcttatgtt taacgtcgtg cgcgtgtcga gatatcggac
      421 tatggacgtt ccgatacgtc tacaacgaaa gcgacaacgt cgtgttctca ccgtacggct
      481 tgacctccgc gttgtccgtg ttacggatcg cggcgggcgg taacacgaaa cgagaaatag
      541 acgtccccga atccgtcgtg gaggactccg acgcctttct cgcgttacgg gagttgttcg
      601 tagacgcatc cgttccgtta cgtcccgagt ttacggcgga gttctcctcg cgattcaata
      661 cctccgtgca acgcgtgacg tttaactcgg agaacgtcaa agacgtcatt aactcgtacg
      721 ttaaggataa gacgggagga gacgtcccac gcgtattgga cgcctcccta gaccgagata
      781 ctaaaatgct gctattgagc tccgttcgta tgaagacgag ctggagacac gtattcgacc
      841 cttcgttcac gacggatcaa cctttttatt ccggaaacgt cacatacaag gtacgtatga
      901 tgaataaaat agatacgttg aaaacggaga cgtttacgct tagaaacgtg ggatactccg
      961 taacggaact gccgtataaa cggcgtcaaa cggccatgtt gctcgtcgtt ccggacgact
     1021 tgggagagat cgtgcgggcc ctcgatcttt ctctagtacg cttctggata cgcaacatga
     1081 ggaaagacgt gtgtcaggtg gtaatgccca agttctccgt cgaatcggtc ctggatctga
     1141 gggacgccct ccagagactg ggggtgcgag acgcgttcga tccatcccgg gcggacttcg
     1201 gtcaggcgtc cccgtcgaac gatctatacg tcacgaaggt gttacagacg tccaagatag
     1261 aggcggacga acggggaacg acggcgtcga gcgacacagc catcaccctc atccccagga
     1321 acgccctcac ggcgatcgtg gcgaacaaac cgtttatgtt tctcatctat cacaagccta
     1381 caacgaccgt gttgtttatg ggaacgataa caaagggtga aaaagtaata tacgatacgg
     1441 agggtcgaga tgatgtcgta tcctctgtat aaactctttt tgaagggtaa actatgcgac
     1501 gtcgaaatcg tcgcggaagg caaaagcatc cgagcgcatc ggttggtgct ttccgcgtat
     1561 tctaaatact tttacaactt gtttaatggg aatttcttag aaaaaaacgt agacgtaatc
     1621 gacttagaag cggattataa aaccgtattt gacgtgattt attacatgta tacagaatcg
     1681 atagaattac acaaagggaa taccgaatcc attttctcat tggttcatta cctacagatt
     1741 aaacccctga ttaaaaaatg tatctacgag tttaacagca tcgtgaacga agaaaactgt
     1801 atacgtctgt ttaagttcgc cgaattatac gacctgtccg agttgaaacg cagggcgcga
     1861 tggcttatgc ccagtctcgt tatgaatgag aaagatcgcc tgcgggagat gtccttggac
     1921 gacctatccc tgatgttagt ccagatacgg aacacggtcg atcgaagtat cgctttgtcg
     1981 gcgatcacgg aatggataca gacaaacgtt cgcgaacgta ggagacacgc cgtccatctg
     2041 gcgacgtgtt taggggatgt cccaggaacc gcatcctcca gagccgtata caaacactac
     2101 atgtcggaac tacgtattcg ggttacggaa tttcaaccgg cgtatcacaa ctgcgtcgtg
     2161 tacctgggag gatcgatgaa aggtcgcgtc accgccctgg atcc
//
LOCUS       MZEMT2BATP   2054 bp ss-mRNA            ORG       15-AUG-1990
DEFINITION  Maize mitochondrial F-1-ATPase subunit-2 mRNA, complete cds.
ACCESSION   M36087
KEYWORDS    ATPase subunit-2.
SOURCE      Maize (inbred line A188) embryo kernel mitochondrion, cDNA to mRNA.
  ORGANISM  Mitochondrion Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE   1  (bases 1 to 2054)
  AUTHORS   Ehrenshaft,M. and Brambl,R.
  TITLE     Respiration and mitochondrial biogenesis in germinating embryos of
            maize
  JOURNAL   Plant Physiol. 93, 295-304 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          6     1667     F-1-ATPase subunit-2
BASE COUNT      421 a    539 c    558 g    536 t
ORIGIN      
        1 cggccatggc gtcccgccgg gtcgtctcct cgctcctccg ctccgcgtcc cgcctgcggg
       61 ccgcctcgcc cgctgctcca cgaccgcgcg cgccaccgca ccgcccgtcc ccggccgggt
      121 acctcttcaa ccgcgctgcc gcctacgcct cttccgccgc ggcccaggcg gcacctgcca
      181 ccccgccgcc ggccaccggg aagaccgggg ggggcaagat caccgacgag ttcaccggcg
      241 ctggcgccat cggccaggtg tgccaggtga tcggcgccgt cgttgacgtg cgcttcgatg
      301 agggcctccc gcccatcctc acggcgctcg aggtgctcga caacaacatc cgcctcgtgc
      361 tcgaggtggc gcagcacctt ggcgagaaca tggtgcgcac catcgctatg gacggcacgg
      421 aggggctcgt ccgcggccag cgcgtcctca acactggctc ccccatcacc gtgcctgttg
      481 gcagggctac ccttggacgc atcataaatg ttattggtga accgattgat gagaagggtg
      541 acataaagac aaaccacttc ctccctattc atcgtgaagc ccctgctttt gttgagcagg
      601 ccactgagca gcaaattctt gttactggaa tcaaggtcgt ggatcttctt gcaccctacc
      661 aaaggggtgg aaagattggt ctcttcggtg gtgcaggagt gggtaaaact gtgctcatta
      721 tggagttgat caacaatgtt gctaaggccc atggtggttt ctctgtgttt gctggtgttg
      781 gagaacgtac ccgtgaaggt aatgatctgt acagggaaat gattgaaagt ggtgtcatta
      841 agctagatga caagcagagc gaaagcaagt gtgctcttgt ttacgggcag atgaatgagc
      901 ccccgggtgc tcgtgctcgt gttgggttga ctggtttgac tgttgctgaa catttccgtg
      961 atgctgaagg acaagatgtg cttctgttta ttgacaacat tttccgtttt actcaggcaa
     1021 actctgaggt gtctgctctt cttggacgta tcccatctgc tgtgggatac cagccaaccc
     1081 ttgccactga tcttggagga ctgcaagagc gtattacgac aacaaagaag ggttctatta
     1141 catctgtgca ggccatctac gtgcctgccg atgacttgac ggatcctgct cctgctacta
     1201 cctttgccca tcttgatgct acaactgtgt tgtcacgaca gatctctgag cttggtattt
     1261 atcctgctgt tgatccactg gattccacat caagaatgct ttctccccac gtgctgggtg
     1321 aggatcacta caacactgct cgtggtgtgc agaaggttct tcagaactac aaaaatcttc
     1381 aggatattat tgctatcttg ggtatggatg agctcagtga ggatgacaag ctgacagtcg
     1441 cccgtgcaag aaagattcag cgtttcctga gccagccttt ccatgtcgct gaagttttca
     1501 cgggtgctcc aggaaagtat gtggagctga aggaaagcgt gaagagtttc cagggtgttt
     1561 tggatgggaa gtatgatgac ctccctgagc agtcattcta catggttggt ggcattgagg
     1621 aagtcattgc taaggctgag aaaattgcca aggagtctgc ttcataagga ggcttcttgc
     1681 ttgttcaacc ctgtacaagt tccatttttg gattttaagc gtttatttat gcttttccca
     1741 gttaggcatg acgagctgga gagtccatct cctgctgaga gatgtttgtt ttacccttct
     1801 ttgcttcctc caccttacac ccaaataagc aactgcagtg ccgttggttt tggctgcacc
     1861 caaactacat gactgaagaa acttgtggcc tgtgtaacgc gaatccatca gaacgccaaa
     1921 gttatggctt ctggttgtgg caaattatgg ttcctccctg ttcggttgag tggttgcatt
     1981 ctggaggtat tgttctggac tcaggctaat gattgtgcgt gcaactgttt cggagtcatt
     2041 tcaaagggtt atcc
//
LOCUS       PFAMTSSU      935 bp ds-DNA             ORG       15-AUG-1990
DEFINITION  P.falciparum mitochondrial small subunit rRNA gene.
ACCESSION   M23443
KEYWORDS    small subunit ribosomal RNA.
SOURCE      P.falciparum (strain C10) mitochondrial DNA.
  ORGANISM  Mitochondrion Plasmodium falciparum
            Eukaryota; Animalia; Protozoa; Microspora; Microsporea; 
            Microsporida; Haemosporina; Plasmodiidae; Plasmodium falciparum.
REFERENCE   1  (bases 1 to 935)
  AUTHORS   Gardner,M.J., Bates,P.A., Ling,I.T., Moore,D.J., McCready,S.,
            Gunasekera,M.B.R., Wilson,R.J.M. and Williamson,D.H.
  TITLE     Mitochondrial DNA of the human malarial parasite Plasmodium
            falciparum
  JOURNAL   Mol. Biochem. Parasitol. 31, 11-18 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1  >   935     small subunit ribosomal RNA
BASE COUNT      376 a     91 c    132 g    336 t
ORIGIN      
        1 aagcttgata aagtaatatt tcttttagga agacagtatt attaaaatat tgtaaacttt
       61 ttattttatt tttaaatatt gataaaaata aaaaatagta tttgctattt tctgtgccag
      121 cagcagcggt aatacagaaa tgcaagcgtt attcatttta ttaggcgtaa agcgttttaa
      181 ggttttatat taattttatg tttaaatatt taaattaaat ttaaaataaa ttaataaata
      241 ataatataat agagtattat aaaagtatta agaatttttt gagaagtagt gaaatacaat
      301 gatacaaaaa agaatatcaa aggcggaagc ataatactat ataattactg acacttaaaa
      361 acgaaagcta aggtagcaaa taggattaga taccctagta gtcttagctg taaactatga
      421 atattttata tttatatttt ataaatataa taactaacgt gataaatatt ccgcctgagt
      481 agtatattcg caagaatgaa attcaaagga attgacggga gcttatacaa gtggtggaac
      541 atgtggctta attcgatgca acacgataaa ccttaccaaa atttaacaat atttttaata
      601 ttaagaaatt aatattttaa taaaatatat aggtagtgca tggctgtcgt cagttcgtgc
      661 tgtgaagtgt taattttagt attataacga acgtaacctt ttataaaaaa aatttttata
      721 ataaataata ataaagatta cgtcaagtca ttatgctcct tatattttgg gctgctcacg
      781 tgttacataa aatattacaa tattttatta tatgttaaat ataataatta aaatatattt
      841 atagttcaga ttataaattg aaactcattt atataaagat ggaatcacta gtaatcgcta
      901 atcagaatta tagcggtgaa taagttctta agctt
//
LOCUS       PSEALGR3A     120 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  P.aeruginosa alginate synthesis regulatory protein (algR3) gene,
            5' end.
ACCESSION   M35259
KEYWORDS    alginate synthesis regulatory protein.
SOURCE      P.aeruginosa (strain 8882) DNA.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 120)
  AUTHORS   Kato,J., Misra,T.K. and Chakrabarty,A.M.
  TITLE     AlgR3, a protein resembling eukaryotic histone H1, regulates
            alginate synthesis in Pseudomonas aeruginosa
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2887-2891 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         70  >   120     alginate synthesis regulatory protein (algR3)
    mRNA         45  >   120     algR3 mRNA
BASE COUNT       24 a     39 c     37 g     20 t
ORIGIN      
        1 cgaacccgtt ggcgagaggg ggtttgcggg tctagtatgg gcgcaaccac gtccgcctgg
       61 aggcacgtca tgtcggccaa caagaagccc gtcaccaccc ccttgcacct gttgcagcaa
//
LOCUS       STYOMPH       992 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  S.typhimurium cationic 16 kD outer membrane protein (ompH) gene,
            complete cds.
ACCESSION   J05101 M36486
KEYWORDS    ompH gene; outer membrane protein.
SOURCE      S.typhimurium (strain LT2 subline, isolate SH5014) DNA, clones
            pUCHS[14,16].
  ORGANISM  Salmonella typhimurium
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 992)
  AUTHORS   Koski,P., Rhen,M., Kantele,J. and Vaara,M.
  TITLE     Isolation, cloning, and primary structure of a cationic 16 kDa
            outer membrane protein of Salmonella typhimurium
  JOURNAL   J. Biol. Chem. 264, 18973-18980 (1989)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 992)
  AUTHORS   Koski,P., Hirvas,L. and Vaara,M.
  TITLE     Complete sequence of the ompH gene encoding the 16-kDa cationic
            outer membrane protein of Salmonella typhimurium
  JOURNAL   Gene 88, 117-120 (1989)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.S.Vaara 02-SEP-1989.
FEATURES       from  to/span     description
    pept        311      796     cationic outer membrane protein precursor (gtg
                                 start codon)
    sigp        311      370     cationic outer membrane protein signal peptide
    matp        371      793     cationic outer membrane protein
    signal      142      147     -35 region
    signal      165      170     -10 region
    signal      854      879     transcription termination signal
    binding     293      305     ribosome binding site
BASE COUNT      281 a    224 c    260 g    227 t
ORIGIN      334 bp upstream of PstI site.
        1 gatccgtcat ctgcgccgtc agatgtaccg gattacagcg atccaggcaa catccgtatg
       61 tccgcgggta tcgcattaca atggatgtcc cattggggcc gttggtcttc tcctacgccc
      121 agccgtttaa aaagtacgat ggagacaaag ccgagcagtt ccagtttaac attggtaaaa
      181 cctggtaatt gttcactgca aaggaatgca ttggtagtgt agcgatgact tttggcgatg
      241 cccccaggga tcgccaggcc acgcaaagag ctgtaccttc gggtgcaaat gggatggtaa
      301 ggagtttatt gtgaaaaagt ggttattagc tgcaggtctt ggtttggcga tggtaacgtc
      361 cgcacaggct gctgacaaaa ttgcaatcgt caacatgggt aatctgttcc aacaggttgc
      421 gcagaagacg ggtgtatcca atacactgga aaacgaattt aaaggccgtg cggctgaact
      481 gcaaaaaatg gaaaccgatc tgcaatctaa aatgcagcgt ctgcaatcca tgaaagcagg
      541 tagcgatcgt actaagctgg aaaaagacgt gatgtctcag cgccagactt tcgcacaaaa
      601 agcgcaggct tttgagaaag atcgcgctcg tcgttccaac gaagaacgca acaaactggt
      661 gactcgtatc cagactgcgg tgaaaaaagt ggctaacgac cagagtatcg atctggtggt
      721 agacgcaaac accgttgctt acaacagcag cgatgtgaaa gacatcaccg ctgacgtact
      781 gaaacaggtt aaataagtaa tgcccttcaa ttcgactggc tgacttagca gaacagttgg
      841 atgcagaatt acacggtgat ggcgatatcg tcatcaccgg cgttgcgtcc atgcaatgtg
      901 caacaacagg ccacattacg tttatggtga atcctaagta ccgtgaacac ttaggtttat
      961 gccaggcttc tgcggttgtc atgacgcagg ac
//
LOCUS       SIVAGM155    9794 bp ds-DNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV), complete genome.
ACCESSION   M29975
KEYWORDS    .
SOURCE      Simian immunodeficiency virus (isolate 155) proviral DNA, clone 4.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9794)
  AUTHORS   Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
            Olmstead,R.A. and Hirsch,V.M.
  TITLE     Simian immunodeficiency viruses from African green monkeys display
            unusual genetic diversity
  JOURNAL   J. Virol. 64, 1086-92 (1990)
  STANDARD  full staff_entry
COMMENT     Kindly submitted prior to publication and in a computer-readable
            form by Phillip Johnson, Georgetown University, Rockville MD (301-
            496-2976). The 155 isolate is from a monkey imported from Kenya.
FEATURES       from  to/span     description
    pept        931     2493     gag polyprotein
    pept       2199     5342     pol polyprotein
    pept       5260     5958     vif protein
    pept       5741     6100     vpx protein
    pept       6051     6268     tat protein, exon 2 (first expressed exon)
               8492     8633     tat protein, exon 3 (AA at 8493)
    pept       6208     6268     rev protein, exon 2 (first expressed exon)
               8492     8700     rev protein, exon 3 (AA at 8494)
    pept       6275     8581     env polyprotein
    pept       8724     9416     nef protein
    LTR           1      726     5' LTR
    LTR        9070     9794     3' LTR
    rpt           1      625     R repeat 5' copy
    rpt        9578     9794     R repeat 3' copy
    binding     727      744     primer (Lys-tRNA) binding site
    signal     9675     9680     poly-A signal
BASE COUNT     3321 a   1905 c   2450 g   2118 t
ORIGIN      
        1 tggatgggat ttattactcc gataggagaa ataagatcct taatctgtat gccctcaatg
       61 aatggggaat cattgatgat tggaacgcat ggtcaaaagg acctgggata agatacccga
      121 ggtgctttgg cttctgcttc aagctagtac cggttgccct gcatgaggaa gcagaaacat
      181 gtgaaaggca ttgcttggta cacccagcac aactgcatga agaccctgat ggtataaatc
      241 atggagaaat attggcatgg aagtttgatc caatgttggc tgttcagtac gacccctcaa
      301 gggagtactt tacagactta tattcaacag ttggtacagg aaactagccg accacaggct
      361 tgcggtttcc tggttgccta ggagatgaca ttaagaactg ctgacgggac tttccagcac
      421 gggactttcc aaggcgggac atgggcggta cggggagtgg ctttaccctc agagctgcat
      481 aaaagcagat gctcgctggc ttgtaactca gtctcttact aggagaccag cttgagcctg
      541 ggtgttcgct ggttagccta acctggttgg ccaccagggg taaggactcc ttggcttaga
      601 aagctaataa acttgcctgc attagagctt atctgagtca agtgccctca ttaatgcctc
      661 actcttgaac gggagaagtt ccttactggg ttctctctca aacccaggcg agagaaactc
      721 cagcatggcg cccgaacagg gacttgagtg aaggcacgta cagctgagaa gacgtcggac
      781 gcgaaggaac cgcggggtgc gacgtgaccg agaagggctc ggtgagtagg cttctcgagt
      841 gccgggaaaa agctcgagcc tagttagagg actaggaagg gccgtagccg taactactct
      901 gggcaagtag ggcaggcgga cgggtacgta atgggggcgg ctacctcagc actgaatagg
      961 agacaattag atgaatttga gcatatacga cttcgcccga acggaaagaa aaagtatcaa
     1021 attaaacatt taatatgggc aggcaagaag atggaccgct tcggcctcca tgagaagtta
     1081 ttggagacag aggaaggttg taaaaagatc atagaagttc tctctcccct agaaccaaca
     1141 gggtcggaag gaatgaaaag tctgtataat ctggtgtgcg tattgctttg cgtccaccaa
     1201 gaaaagaaag tgaaagacac agaggaagct ttagcaatag taagacaatg ctgccaccta
     1261 gtggacaaag aaaaaactgc agttacgcca cctggtggac agcagaaaaa taacacagga
     1321 ggaacagcga cacctggtgg cagccaaaat tttcccgcac aacagcaagg gaatgcatgg
     1381 gtgcatgtac cactttcacc tcgcacccta aatgcatggg taaaagcagt agaagagaaa
     1441 aaatttgggg cagaaatagt acccatgttc caagccctct cagaaggctg caccccatat
     1501 gacatcaatc agatgcttaa tgtcttagga gatcatcagg gggccttgca aatagtgaaa
     1561 gaaataatta atgaggaagc agcccagtgg gatgtaaccc acccaccgcc ggcaggcccc
     1621 ttgccagcgg gacagctcag ggatccgggg ggatcagata tagcagggac cactagtaca
     1681 gtgcaagagc agctagagtg gatctatact gctaacccaa gggtagatgt aggggccatc
     1741 tatcgaagat ggatcatcct agggttacaa aaatgtgtaa aaatgtacaa tccagtgtct
     1801 gttttagata tcagacaagg gcccaaagaa ccattcaaag attatgtaga cagattctat
     1861 aaagcaataa gagcagaaca agcttcagga gaagtcaaac aatggatgac agaatctttg
     1921 ctcattcaga atgccaaccc agattgcaaa gtaattttga agggcctagg gatgcacccc
     1981 actcttgaag aaatgctgac agcctgtcaa ggggtgggag gcccaagtta caaagccaaa
     2041 gtcatggcag aaatgatgca gaacctgcag agtcagaaca tggtacagca gggaggtgga
     2101 aggggaagac caagaccccc gccaaagtgt tacaactgtg gaaaatttgg ccacatgcag
     2161 aggcagtgtc ctgagccaag aaaaataaaa tgtcttaaat gtggaaagcc agggcactta
     2221 gcaaaagact gcaggggaca ggtgaatttt ttagggtatg gccggtggat ggggacaaaa
     2281 ccaagaaatt ttcccgcagc cactcttggg gcggaaccaa gtgcgccccc tccaccgaac
     2341 aactctacac cttacgaccc agcaaagaag ctcctgcagc agtatgcaga gaaagggaaa
     2401 caaatgagaa atcagaacag aaacccccca gcgaacaatc cagattggaa cgagggatat
     2461 tctttgaact ccctctttgg agaagaccaa taaggacctg tataatagga ggaactgccg
     2521 ttaaggcatt attagataca ggggcagatg acactataat aaaggataca gatttacaat
     2581 taaggggatc atggagacca aaaatagtag gaggaattgg gggagggtta aacgtaaaag
     2641 aatatgataa tgtagaagta caattggaag acaagatatt aagaggaaca gtcctcatag
     2701 gagcaactcc catcaatatc ataggaagaa actttttagc ccaggcagga gccaaattag
     2761 tgatggggca attgtcgcag acaataccaa tcaccccggt acgcttaaag gaaggggcca
     2821 gaggaccacg attgaagcaa tggccactct ctaaagaaaa aataatagcc ctgcaagaaa
     2881 tttgcaaaac attagaggaa gaaggaaaat taagcagggt agggggagac aatgcataca
     2941 atacaccagt attctgtata aggaaaaaag acaaatcaca gtggagaatg ctggtagatt
     3001 tcagggaact caacaaagct acacaagact tctttgaagt ccaattaggt ataccccatc
     3061 cagcagggtt aaagaaaatg aagcaaataa ccattataga tgtgggggat gcatattata
     3121 gcataccact ggatcctgag tttagaaaat acacagcttt caccatccct acggtaaaca
     3181 atgagggacc aggcataaga tatcaattta attgcctacc gcagggctgg aagggatccc
     3241 cgacaatttt ccaaaacaca gcatcaaaaa ttctagaaga aataaagaaa gaattaaaac
     3301 agctgacgat tgtccagtac atggatgacc tctgggtagg atcacaagaa gagggtccaa
     3361 agcatgatca gctagtacaa acacttagga atagattgca agaatgggga ttagaaacac
     3421 cagagaaaaa ggtgcaaaga gaacctccct ttgagtggat gggatataaa ttatggcctc
     3481 ataaatggaa gttacaaagt atagaattag agaagaaaga acaatggaca gtgaatgatc
     3541 ttcagaaatt ggtagggaaa ttaaattggg cagcacaatt atatccagga ttgagaacaa
     3601 aaaatatctg taagctactt agaggaaaga aaaatttatt agacgtggta gaatggaccc
     3661 cagaggcaga agcagagtac gaagaaaaca aggagatcct aaaaacagag caagaaggta
     3721 cttattatgc accagaaaaa ccccttaggg cagcagtaca gaaattagga gatgggcaat
     3781 ggtcatacca attcaagcag gaaggaaaaa tcttaaaggt agggaagttc gccaaacaga
     3841 aagctactca caccaatgag ttgcgtgtac tagcaggagt agtacagaaa atagggaaag
     3901 aggccctagt aatttgggga caattaccca cttttgaact cccagtggag agggacacat
     3961 gggaacaatg gtgggcagac tattggcaag tcagttggat acccgaatgg gactttgtca
     4021 gtgttccgcc cttagtaact ttgtggtata cactgactaa ggaacccatc ccgggagagg
     4081 atgtctacta tgtagatgga gcctgtaata gacagtcgaa agagggaaaa gcaggctaca
     4141 taacccaaca aggcaaacaa agagtacaac agctagaaaa cacaacaaat caacaagctg
     4201 aactgacagc cataaaaatg gccttggagg atagcggccc taaagtcaat atagtaacag
     4261 attcacaata tgcgatgggc atattgacag cacagcccac acagagtgac tccccactag
     4321 tagaacaaat aatagcacag atggtacaga aagaagccat ctatctgcaa tgggtacctg
     4381 ctcataaagg tatagggggc aatgaagaaa tagacaaatt agtaagcaag ggagttagaa
     4441 gaatattgtt cattggcagg atagaagaag cacaagaaga acatgatagg tatcacagta
     4501 actggagaaa tctagcagac acatttggat tgccacaaat agtagctaaa gaaattgtag
     4561 caatgtgccc aaaatgtcaa gtaaaagggg aaccaataca tggacaagta gatgcttcac
     4621 caggagtgtg gcagatggac tgcacacata tagaaggaaa aatagtgata gtagcggtcc
     4681 atgtagccag tgggtttata gaagcagagg ttatccctag ggaaacagga aaagagacag
     4741 caaagttctt gttaaaaata ataggaagat ggcccatcac tcacctccat acagataatg
     4801 gaccaaattt cacttctcag gaagtagctg ctatgtgctg gtggggaaag gtagaacaca
     4861 caacgggggt accatataat ccacagtccc agggatctat agaaagtatg aacaaacaat
     4921 tgaaagagat aattggaaaa ataagagatg actgtcaata tacagaaaca gcagtactta
     4981 tggcctgcca cattcacaat tttaaaagaa agggaggaat aggggggcta acagctgcag
     5041 agagactaat aaatatgata acaacacaat tagaaatcaa cactctacaa accaaaatcc
     5101 aaaaaatttt gaattttaga gtctactaca gagaaggcag agatccagtg tggaagggac
     5161 ctgctcgcct gatctggaaa ggagaaggcg cggtagttct caaggaaggt gaagaactga
     5221 aggtagttcc gagaaggaaa gcaaaaatca taaaagacta tgagccaaga aaaacattgg
     5281 gtgatgagac tcacctggaa ggtgcaggag gaagtgatca ccaaatggca ggggatagtt
     5341 agatattgga tgaataaaag gaatctgaaa tgggaataca aaatgcatta tcaaatcact
     5401 tgggcatggt acactatgag cagatatgta atacccctcc caggaagtgg agaaatccat
     5461 gtggatatct attggcattt agctccaaaa caaggatggc tctcaactta tgcagtagga
     5521 atacaatatg ttagcctagt aaatgataaa tatagaacag aattagatcc caatacagca
     5581 gactccatga tacattgtca ttattttacc tgttttacag atagagccat ccaacaggca
     5641 ctaaggggaa acaggttcat cttctgtcaa tttccaggag gacataaact aacaggtcag
     5701 gtaccctcct tgcaatattt agcattacta gcccatcaaa atggcctcag gaagagatcc
     5761 cagagaggag agaccaggag gactagaaat ttgggatctc agcagggagc cgtgggacga
     5821 atggctcaga gatatggtag aagaaatcaa caacgaagcc aaactgcatt ttggccgaga
     5881 actcctatac caagtatgga attattgtca ggaggaaggg gagagacagg gaagacccat
     5941 agcggaaagg gcatataagt attatcgctt agttcagaaa gctctctttg tgcatttccg
     6001 gtgtggatgt cgcaggagac aaccctttga gccatacgag gagaggagaa atggacaagg
     6061 gggaggaaga ccaggacgtg tcccaccagg acttgattaa acaatacagg aaaccccttg
     6121 agacatgtac aaataaatgc ttttgcaaaa aatgctgtta tcattgccaa ttctgcttct
     6181 tacggaaagg actaggtatt acctatcatg cctttaggac cagaagaaag aagattgctt
     6241 cggctgatcg cattcctgta ccgcagcagt aagtatgaca aagttcttag gaatttttat
     6301 agtattagga atagggatag gaatagggat aagtacaaaa cagcagtgga taacagtgtt
     6361 ctatggagta ccagtatgga aaaacagctc agtccaagct ttttgcatga cacctactac
     6421 taggttgtgg gcaactacta attgcatacc agatgatcat gactatacag aagtaccact
     6481 gaatataaca gagccatttg aagcatgggc agacagaaat cccttagtag cacaagcagg
     6541 aagtaacatt cacctgctgt ttgaacagac attaaagccc tgtgtaaagc tatcacctct
     6601 atgtatcaaa atgaattgtg tagagttaaa aggctccgca acctctaccc cagcaacctc
     6661 tactacggca ggaaccaaac taccctgtgt tagaaataaa acagactcca acctacagtc
     6721 atgcaacgac accatcatag aaaaggagat gaatgacgag gcagcgtcaa actgcacctt
     6781 tgctatggct gggtacatta gggaccaaaa gaagaattac tcagtagtat ggaatgatgc
     6841 agaaatcttt tgtaagcgta gtacatcgca taatgggaca aaagagtgct atatgatcca
     6901 ctgtaatgat tcagttataa aggaagcttg tgataagaca tattgggatg aattaagact
     6961 aagatattgt gctccagcag gatacgcttt gcttaaatgt aatgattggg attatgcagg
     7021 atttaagcca gaatgttcta atgtttcagt agtgcattgc acaactttaa tgaatacaac
     7081 agtaaccact ggtctgttat tgaatggaag ctattcagaa aatcgaaccc agatctggca
     7141 aaaacatgga gtgagcaatg actcagtgtt aatcttgctc aataagcatt ataacctgac
     7201 agttacatgc aaaaggccag ggaataagac agtcttgcca gtaacgataa tggcaggatt
     7261 agtcttccac tcacagaagt ataatacaag actaaggcag gcctggtgcc acttccaggg
     7321 caattggaaa ggagcttgga aggaagtaca agaggaaata gtaaaattac caaaagaacg
     7381 gtaccaaggc accaatgata caaacaaaat ctttttgcaa agacaatttg gagacccaga
     7441 agcagcaaat ctatggttca actgtcaagg ggaattcttc tactgtaaaa tggactggtt
     7501 tttaaattat ctgaataatt taacagtgga tgctgatcat aatcattgta aaaacaacgc
     7561 agggaaaggt cgaagtccag gtccctgtgt acagagaact tatgttgcct gccatatccg
     7621 atctgtcata aatgattggt atactatatc aaagaaaaca tatgctccac caagagaagg
     7681 acatttgcag tgcacgtcca cagttactgg gatgacagta gagctaaact ataataacca
     7741 gaacaggaca aatgtaacat tgagtcccca gatagaaacc atctgggcgg cagaattggg
     7801 cagatacaaa ttggtagaga ttacaccaat tggatttgca cccacagaag tcaggcgata
     7861 cacgggaggc caagagaggc aaaaacgagt cccgttcgtg ctagggttcc taggcttctt
     7921 gggagctgct gggactgcaa tgggagcagc ggcgacagcc ctgacggtcc agtctcagca
     7981 tttacttgct gggatattgc agcagcagaa gaatctgctg gcggctgtgg gagctcaaca
     8041 gcagatgttg aagctgacca tttggggtgt gaaaaacctc aatgcccgcg tcacagctct
     8101 tgagaagtac ctggcggatc aggcacggtt aaacgcttgg gggtgcgcgt ggaaacaagt
     8161 atgtcataca acagtaccct ggacgtggaa taatacacca gagtggaata atatgacctg
     8221 gttggagtgg gaaaaacaga tagaaggatt ggagggcaac ataacaaaac aattggaaca
     8281 ggcaagggaa caagaggaaa agaatttgga tgcttatcaa aagttgtcag actggtcgag
     8341 tttttggtct tggttcgatt tttcaaaatg gctgaacatt ttaaagatag gctttttggc
     8401 agtaataggc gttatagggt taagattgct ttacacatta tatacttgca tagctagggt
     8461 taggcagggt tactctcctt tatctcctca gatccatatc catccgtgga agggacagcc
     8521 agacaacgca ggagagccag aagaaggtgg aagaacaggc aaaagcaaat ctacgcatta
     8581 gcagaaagaa tttgggggac gagacaagag gaccagttgg tgcaggcaat tgaccaattg
     8641 gttcttgaca ctcagcatct ggttacacaa cagctgcctg accctccttc tcaagcttag
     8701 aagcgcctgg cagtacttac aatatgggct tggggagctc aaagccgcag cacaagaagc
     8761 agttaaccat ctggcgagct ttgcacgcaa cgcggcacac cagatatggc ttgcttgcag
     8821 atccgcttat cgggcaatca tcaactctcc aagaagagtg cgacaagggc ttgaggaagt
     8881 ccttaattag gaagagaaat ggcaacatga ctccagaagg aagacgtcta caggacgggg
     8941 accaatggga tgaatggtca gatgaagaag atgaagtggg atttccagta agaccaagag
     9001 tgccactaag acaaataaca tacaaacttg cagtagattt ttcgcacttt ttaaaagaaa
     9061 agggaggact ggatgggatt tattactccg ataggagaaa taagatcctt aatctgtatg
     9121 ccctcaatga atggggaatc attgatgatt ggaacgcatg gtcaaaagga cctgggataa
     9181 gatacccgag gtgctttggc ttctgcttca agctagtacc ggttgccctg catgaggaag
     9241 cagaaacatg tgaaaggcat tgcttggtac acccagcaca actgcatgaa gaccctgatg
     9301 gtataaatca tggagaaata ttggcatgga agtttgatcc aatgttggct gttcagtacg
     9361 acccctcaag ggagtacttt acagacttat attcaacagt tggtacagga aactagccga
     9421 ccacaggctt gcggtttcct ggttgcctag gagatgacat taagaactgc tgacgggact
     9481 ttccagcacg ggactttcca aggcgggaca tgggcggtac ggggagtggc tttaccctca
     9541 gagctgcata aaagcagatg ctcgctggct tgtaactcag tctcttacta ggagaccagc
     9601 ttgagcctgg gtgttcgctg gttagcctaa cctggttggc caccaggggt aaggactcct
     9661 tggcttagaa agctaataaa cttgcctgca ttagagctta tctgagtcaa gtgccctcat
     9721 taatgcctca ctcttgaacg ggagaagttc cttactgggt tctctctcaa acccaggcga
     9781 gagaaactcc agca
//
LOCUS       SIVAGM3      9625 bp ds-RNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV) proviral, complete genome.
ACCESSION   M30931
KEYWORDS    complete genome.
SOURCE      Simian immunodeficiency virus (isolate AGM3) from African Green
            monkey proviral genomic DNA.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9625)
  AUTHORS   Baier,M., Garber,C., Mueller,C., Cichutek,K. and Kurth,R.
  TITLE     Complete nucleotide sequence of a simian immunodeficiency virus
            from African green monkeys: A novel type of intragroup divergence
  JOURNAL   Unpublished (1990); Paul-Ehrlich-Institute, 6070 Langen 1, Germany
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Baier 20-DEC-1989.
            
            This sequence was taken from an infectious molecular clone (used
            for heterologous infection of the pigtail macaque). The 3' LTR
            sequence does not appear to match the 5' LTR sequence.
FEATURES       from  to/span     description
    pept        431     1996     gag polyprotein
    pept    >  1687     4827     pol polyprotein (NH2 terminus uncertain)
    pept       4763     5461     vif protein
    pept       5244     5603     vpX protein
    pept       5554     5771     tat protein, exon 2 (first expressed exon)
               8013     8154     tat protein, exon 3 (AA at 8014)
    pept       5711     5771     rev protein, exon 2 (first expressed exon)
               8013     8221     rev protein, exon 3 (AA at 8015)
    pept       5778     8411     env polyprotein
    pept       8245     8934     nef protein
BASE COUNT     3324 a   1827 c   2383 g   2091 t
ORIGIN      
        1 cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc taacctggtt
       61 ggccaccagg ggtaaggact ccttggctta gaaagctaat aaatcttcgc tgcattagag
      121 cttctctgag tcaagtgccc tcattgacgc ctcactcttg aacgggtaaa acttccttac
      181 tgggttctct ctcaacccag gcgagagaaa ctccagcagt ggcgcccgaa cagggacttg
      241 acttgagtga aggcacgtac agctgagaag acgtcggacg cgaaggaagg cgcggggtgc
      301 gacgtgacca agaagggctt ggtgagtagg cttctcgagt gccgggaaaa agctcgagcc
      361 tagttagagg actaggaagg gccgtagcca taactactct gggcaagtag ggcaggcgga
      421 cgggtacgca atgggggcgg ctacctcagc actaaatagg agacaattag acaaatttga
      481 gcatatacga cttcgcccga ccggaaagaa aaagtaccaa attaaacatt taatatgggc
      541 aggcaaggaa atggagcgct tcggcctcca tgagagatta ctagaatcag aagaaggatg
      601 taagaagatc atagaagtac tctacccgct agaaccaaca gggtcggagg gcttaaaaag
      661 tctgtttaac cttgtgtgcg tattgttttg cgtacacaaa gataaggaag tgaaagacac
      721 agaagaagca gtagcaatag taagacaatg ctgccatcta gtggagaaag aaagaaatgc
      781 agaaagaaat acaacagaga catctagtgg acaaaagaaa aatgacaagg gagtaacagt
      841 gccacctggt ggcagtcaaa atttcccagc acaacaacag ggaaatgcat ggatacatgt
      901 gcccttgtca ccacgcacct taaatgcgtg ggtaaaagca gtagaggaga aaaaattcgg
      961 agcagaaata gtgcccatgt tccaggcttt atcagaaggg tgcacaccct atgacatcaa
     1021 tcaaatgctt aatgtcctgg gagaccatca aggggcgcta caaatagtaa aagaaatcat
     1081 caatgaggaa gcagcccagt gggatatagc tcacccacca ccagcaggac cattaccagc
     1141 aggacaactc agagacccta gaggctctga catagcagga accaccagca cagtgcaaga
     1201 acagctggaa tggatataca cagccaatcc cagagtagat gtgggtgcca tctatagaag
     1261 gtggattatc ctggggttgc aaaaatgtgt aaaaatgtac aacccagtgt ctgtcttaga
     1321 cataagacag gggcccaaag aagcattcaa agactacgta gataggttct acaaagcaat
     1381 aagagctgag caggcctcag gagaagtaaa acagtggatg acagaatcat tactcattca
     1441 gaatgctaat ccagactgta aagtcatcct aaagggcctg ggaatgcatc ccactctaga
     1501 agaaatgtta actgcctgtc aaggagtggg aggaccaagt tacaaagcaa aagtgatggc
     1561 agaaatgatg caaaatatgc aaagccagaa catgatgcaa cagggcggtc agagaggaag
     1621 accaagaccc ccagtaaagt gttacaattg tggaaaattt ggccatatgc aaagacaatg
     1681 ccctgaacca agaaagatga gatgcttgaa atgtgggaaa ccagggcatt tagcaaaaga
     1741 ttgcagagga caggtaaatt ttttagggta tggccggtgg atgggagcga aacccagaaa
     1801 ttttcccgcc gctactcttg gggtggagcc aactgcgccc cctccaccga gtccatacga
     1861 ccctgcaaag aagctcctgc agcaatatgc agacaagggg aagcagttga gggaacaaag
     1921 gaaaaaacca ccagcagtga atcccgattg gacagaggga tattctttga actccctctt
     1981 tggagaagac caataaaaac agtttacata gaaggggtcc ccatcagagc attattagat
     2041 acgggggcag atgataccat tataaaagaa gcagatttac aattatcagg aacatggaaa
     2101 ccaaaaataa tagggggcat tggaggggga ctcaatgtaa aagagtatag tgatagggaa
     2161 gtaagattgg aagacaaaat tttgagaggg accatattga taggaagcac tcccataaac
     2221 ataattggaa gaaatatatt agcaccagca ggagccaaat tagtaatggg tcaactgtca
     2281 gaacaaattc ccattacccc tgtgaaatta aaagaagggg ctagaggacc tttcttaaaa
     2341 caatggcccc tctccaaaga aaaaataaaa gccttacagg aaatatgtga ccaattagag
     2401 aaagaaggaa aaattagcaa gataggagga gagaatgcat acaacactcc agtgttttgc
     2461 ataaagaaaa aagacaagtc acaatggaga atgttagtag attttaggga actaaacaaa
     2521 gcaacacaag attttttcga agtacagtta ggcatacctc atccatcagg gttcgaaaag
     2581 atgacggaaa taacagtatt agacataggg gatgcctatt attcaatacc attagaccca
     2641 gagtttagaa agtataccgc ttttaccatt ccatcagtaa ataatcaagg gccaggtact
     2701 agatatcagt tcaactgtct tccacaagga tggaagggat ccccaactat ttttcagaac
     2761 acagcagctt ccattctaga agaaataaaa aaggagttaa aacccctaac cattgtgcaa
     2821 tacatggatg acctatgggt agggtctcag gaagatgaat acacgcatga tcggttggta
     2881 gaacaactaa gaatgaaatt aagtgcctgg ggattagaaa caccagacaa gaaagtacag
     2941 aaaaaaccac cttatgagtg gatgggatac aaattgtggc cacacaagtg gcagataagc
     3001 agcatagaat tagaagacaa agaagaatgg actgtaaatg atatacaaag actagtgggg
     3061 aaactaaatt gggcagcaca gctttaccca ggactcagaa ctaaaaactt gtgtaaatta
     3121 atcagaggaa aaaagaactt actagaaaca gtaacctgga cagaggaagc agaagcagaa
     3181 tatgcagaaa acaaagagat cttaaaaacg gaacaggaag ggacctacta caaaccagga
     3241 agacccatca gagcagcagt gcaaaaacta gaaggaggtc aatggagtta ccaattcaag
     3301 caagagggac aagtattaaa agtaggtaaa tacacaaagc agaaaaacac tcataccaat
     3361 gagttccgtg tattggcagg attagtacaa aaactttgta aagaatcttt agttatatgg
     3421 ggagagttgc cagtccttga actcccaata gagagggaag tatgggaaca atggtgggct
     3481 gattactggc aggtaagttg gattccagac tgggaatttg tcagtacccc acccctagta
     3541 aaattatggt ataccctgac aaaagaaccc ataccaaagg aagatgtcta ctatgtggat
     3601 ggagcttgta atagaaattc aagggaagga aaagcaggat atatcacaca atatgggaaa
     3661 caaagggtgg aaaaattaga aaatacaaca aaccagcaag cagaattaat ggccataaaa
     3721 atggcactag aagatagtgg gcctaatgta aacatagtaa cagattcaca atatgcaatg
     3781 ggaatattaa ctgcccaacc cacacagagt gactcaccct taatagaaca aattatagca
     3841 ctaatggtac aaaaacatca gatatacttg caatgggtac cagcagacaa agggatagga
     3901 ggcaatgaag agatagataa actagtaagt caagggatga ggaaaatttt atttttagaa
     3961 aaaatagaag aagcccagga ggaacatgaa aggtaccata ataattggag gaacttagca
     4021 gacacttatg ggctaccaca aattgtggca aaagaaatag tagccatgtg tccaaaatgt
     4081 cagataaaag gggaaccagt ccatgggcaa gtagatgcct cgccaggggt atggcaaatg
     4141 gactgtacac atttagaagg caaggtaatc atagtagcag tccatgtagc cagtggattc
     4201 atagaagcag aagttatacc tagagaaaca gggaaagaaa cagcaaaatt tttattaaag
     4261 atactaagta gatggcccat aacccaactg catacagaca atggacccaa ttttacgtct
     4321 caagaagtag cagcaatgtg ttggtgggga aaaatagaac acaccacagg tgtaccctat
     4381 aaccctcaat cacaaggctc tatagagagt atgaataaac agttaaaaga aataattggg
     4441 aaaataagag atgactgtca atacacagaa acagcagtac ttatggcatg ccacatccac
     4501 aattttaaaa gaaagggagg aatagggggg ttaacaccgg cagagagatt aatcaatatg
     4561 attactacac aattagaatt acaacaccta caaaccaaaa ttcaaaaaat tttaaatttt
     4621 agagtctact acagagaagg gagagatcct gtctggaaag gaccaggaca gttaatttgg
     4681 aaaggggaag gtgcagtggt catcaaagga ggtgtggaat taaaagaata cccaagaagg
     4741 aaagcaaaaa ttataaagga ttatgaacca agaaaaagaa tgggtgatga gagtaacttg
     4801 gaaggtgccg gaggagctga taactaaatg gcaagggata gtgaggtact ggatgaggac
     4861 tagaaaatta gactggaaat atcgaatgca ctaccaaatt acatgggcat ggtacacaat
     4921 gagtagatat gagatacccc tagggcaaca tggaagtata catgtagatc tatattggca
     4981 tctgacacca gaaaagggat ggctatcaac atatgctgag gggatacagt atctaagcaa
     5041 tagggatcct tggtatagga cagaattgga tcctgcaaca gcagatagcc tgatacatac
     5101 ccattatttt acttgtttta cagaaagggc catcaggaaa gccctattgg gacagaggtt
     5161 caccttctgt cagttccccg agggacacaa gaaaacagga caggtaccct ctttgcaata
     5221 cttagctctc cttgcacacc aaaatggcct caggcagaga tcccagagaa gcaagaccgg
     5281 gggaactaga aatatgggat ttgagcaggg agccgtggga cgaatggcta agagacatgc
     5341 tagaagatat caatcaggaa gccaagatgc attttgggcg cgagctcctg ttccaagtat
     5401 ggaactattg tcaggaggag ggagaaagga atcgcactcc catgctagaa agggcttata
     5461 aatattataa attggtgcaa aaagctctct ttgtgcattt ccggtgtgga tgccgcagaa
     5521 gacaaccctt tgaaccatac gaagaaagga gggatggaca agggggagga cgagcagggc
     5581 gcgtaccacc aggacttgat tgaacaactc aaagcacccc tgaagcggtg tacaaacaag
     5641 tgctattgta aatgttgctg ttatcactgt cagctttgct ttttacaaaa gggattaggt
     5701 gttacctatc atgcccctag gatcagaaga aagaagattg ctccgcttga tcgctttcct
     5761 gaacaaaaac agtgagtatg aagctgacat tactgatagg gatactatta atagggatag
     5821 gagtagtgct taatacaagg caacaatggg tcacagtatt ttatggagta ccagtatgga
     5881 aaaacagctc agtacaggct ttctgcatga cacccaccac cagactatgg gcaactacta
     5941 actcgatacc agatgatcat gactacacag aggtaccatt aaacatcact gaaccatttg
     6001 aagcatgggc tgacagaaac cccttagtag cacaagcagg aagtaatata cacctgctat
     6061 ttgagcagac tctgaagcca tgtgtaaaat tatcaccttt gtgcattaaa atgtcctgtg
     6121 tagaattgaa ctcctctgag cctaccacca ctcctaaaag taccacggcc tcaacaacca
     6181 atatcacagc ctcaacaacc actttgccgt gtgtccagaa caagacaagt actgtgttag
     6241 aatcatgtaa tgaaacaatc atagaaaagg aattaaatga agagcctgct tctaattgta
     6301 catttgcaat ggcagggtat gtaagagatc agaaaaagaa gtattcagtg gtgtggaatg
     6361 atgcagaaat catgtgtaag aagggtaaca attctaacag agaatgttat atgattcatt
     6421 gtaatgattc agttataaaa gaagcctgtg ataaaacata ttgggatgag ttaagattaa
     6481 ggtactgtgc cccggcaggg tttgctttat taaaatgcaa cgattatgat tatgcagggt
     6541 ttaagacaaa ctgttctaat gtttcagtgg tgcattgtac taacttgata aatacaacag
     6601 tgactactgg actgttgttg aatgggagct actcagagaa tcgaacccag atatggcaga
     6661 aacatagagt aagcaatgac tcagtgttag tgttatttaa taaacattac aatctaacag
     6721 ttacttgcaa aagaccagga aacaaaacag tcttaccagt aacaatcatg gcagggctag
     6781 tgtttcattc tcagaggtac aatacaaggc tgagacaagc ttggtgtcac ttccagggca
     6841 actggagagg agcctggaaa gaagtaaaaa atgaaatagt aaaattacca aaagatagat
     6901 accaaggaac caatgatact gaagagattt atctgcagag actatttgga gatccagaag
     6961 cagcaaattt atggtttaat tgtcaggggg aattcttcta ttgtaaaatg gattggtttc
     7021 taaattacct gaataatcgt acagtagatc cggaccataa tccgtgtaat ggtacgaagg
     7081 gaaaaggtaa ggcaccagga ccctgtgcac aaagaacata tgttgcttgc catatacgat
     7141 ctgtcattaa tgattggtac acactatcaa ggaaaaccta tgcaccgcca agagaagggc
     7201 acttgcaatg cacatccacg gtaacgggta tgtcagtgga gctaaattac aatagtaaga
     7261 acaggactaa tgtaacatta agtccccaga tagaaaccat ctgggcagca gaattgggca
     7321 ggtacaaatt agtagaaatt acaccaattg gcttcgcacc cacagaagta agaaggtata
     7381 cgggaggtca tgacagaaca aagcgagtcc cgttcgtgct agggttccta ggcttcttag
     7441 gagctgctgg gactgcaatg ggagcagcgg cgacagccct gacggtccag tctcagcatt
     7501 tacttgctgg gatactgcag cagcagaaga atctgctggc ggctgtggag gctcaacagc
     7561 agatgttgaa gctgaccatt tggggtgtga aaaacctcaa tgcccgcgtc acagctcttg
     7621 agaagtacct agaggaccag gcgcggttga atgcttgggg gtgcgcatgg aagcaagtct
     7681 gtcatacaac cgtaccgtgg cagtggaata ataggacccc tgattggaat aatatgactt
     7741 ggctggaatg ggaaagacag atatcgtatt tggaaggtaa cataacaaca caattagagg
     7801 aagccagagc acaggaggag aagaatttgg atgcatacca aaaattaagt agttggtcag
     7861 atttctggtc ttggttcgat ttctcaaagt ggctgaacat tctaaaaata ggatttttgg
     7921 atgtactagg tattatagga ttaagattgc tttatacagt atattcttgc atagctaggg
     7981 ttaggcaggg ttactctcct ctttctccac agatccatat ccacccgtgg aagggacagc
     8041 cagacaacgc agaagggcca ggagaaggtg gagacaagcg caagaacagc tccgagcctt
     8101 ggcagaaaga atctggcaca gcagagtgga agagcaactg gtgcaagcga ttgaccaatt
     8161 ggtgctcgat cagcagcatc tggctataca acagttgcct gaccctccta gttcatctta
     8221 ggagcgcttt ccagtacata caatatgggc ttggggaact caaagccgca gcacaagaag
     8281 cagttgtcgc tttggcacgc cttgcacaaa acgcgggcta ccagatatgg cttgcttgca
     8341 gatccgctta tagggcaatc atcaactctc caagaagagt gcgacaaggc cttgaaggaa
     8401 tccttaatta ggaagagaaa tggtaaaatg actccagaag gaagaaaatt acaagaagga
     8461 gataaatggg atgaatggtc tgatgaagaa gatgaagtag gatttccagt aagaccaaga
     8521 gtgccgctaa gacaaatgac ctataaatta gcggtggact tttcgcactt tttaaaagaa
     8581 aaggggggac tggatgggat ttattactcc gacaggagga atcagatcct aaacctgtac
     8641 gccctcaatg agtggggaat cattgatgat tggaatgctt ggtcagaagg accaggaatc
     8701 agatacccaa gatgcttcgg cttctgcttt aaattggtac cagtagacct gcatgaggaa
     8761 gcagagactt gtgagagaca ttgcctggtg catccagcac aagtgaggga agaccctgat
     8821 ggaatcaacc atggagaagt cttggtctgg aagtttgatc ccatgttagc agtccaatat
     8881 gaccctaaca gaaaatatct cactgacatg catgatcttg gcaagaggaa gtagctaacc
     8941 gcaggcttgt ggttaagcac atcaccatgg tgatgacatt aagaactgct gacgggactt
     9001 tccagcaagg gactttccag ggcgggtcat gggcggtacg gggagtggct ttaccctcag
     9061 agctgcataa aagcagatgc tcgctggctt gtaactcagt ctcttactag gagaccagct
     9121 tgagcctggg tgttcgctgg ttagcctaac ctggttggcc accaggggta aggactcctt
     9181 ggcttagaaa gctaataaat cttcgctgca ttaggcagag acttgtgaga gacattgcct
     9241 ggtgcatcca gcacaagtga gggaagaccc tgatggaatc aaccatggag aagtcttggt
     9301 ctggaagttt gatcccatgt tagcagtcca atatgaccct aacagaaaat atctcactga
     9361 catgcatgat cttggcaaga ggaagtagct aaccgcaggc ttgtggttaa gcacatcacc
     9421 atggtgatga cattaagaac tgctgacggg actttccagc aagggacttt ccagggcggg
     9481 tcatgggcgg tacggggagt ggctttaccc tcagagctgc ataaaagcag atgctcgctg
     9541 gcttgtaact cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc
     9601 taacctggtt ggccaccagg ggtaa
//
LOCUS       SIVAGM691     683 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV) long terminal repeat.
ACCESSION   M33719
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Simian immunodeficiency virus (isolate ver-1 (692)) from African
            green monkey proviral DNA.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 683)
  AUTHORS   Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
            Olmstead,R.A. and Hirsch,V.M.
  TITLE     Simian immunodeficiency viruses from African green monkeys display
            unusual genetic diversity
  JOURNAL   J. Virol. 64, 1086-92 (1990)
  STANDARD  full staff_entry
COMMENT     Kindly submitted prior to publication in computer-readable form by
            Phillip Johnson. The ver-1 isolate is from a monkey imported from
            Ethiopia.
            
               Author address:Phillip Johnson
               Georgetown University
               Rockville, MD
               (301-496-2976)
FEATURES       from  to/span     description
    LTR           1      683     long terminal repeat
BASE COUNT      174 a    149 c    187 g    173 t
ORIGIN      
        1 tggatgggat ttattactcc gaaagaaggg aaaagatttt gaacctgtat gcattaaatg
       61 aatggggaat catagatgat tggcaagctt atactccagg tccaggcatc agatatccaa
      121 gatgctttgg gttctgtttt gaattagtgc cagtggacct tagtgaggaa gcgcaaggat
      181 gtgaaaggca ctgtctggtc catcctgctc aattacagga ggatccagat ggtatctggc
      241 atggagaaac attggtctgg agattcaatc ccatgctagc atgcaaggcc atgccaggag
      301 tgttcaatga catgcatgca acagtgggga agtagcttgc ggttagcgcg tccgggacct
      361 gtgtaccaac cagcatagca accatgctaa tgagctaggg actttccaga aggggagtgg
      421 tttaaccctc agatattgta tataagcaga tgctcttggg cttgtaactc agtgctctta
      481 ctaggagcca gctagagcct gggtgttcgc tggtagccta acctggactg gccctccagg
      541 ggtaagagcc tccacggctt gaatgcttaa taaaccttgc ctgcattaga agtacttcga
      601 gtcgtgtggt cccattgccg cctccgttca cgggaatcct caatactggg ttctctcttg
      661 cccaggggag agaaactcca gca
//
LOCUS       SIVAGM692    1542 bp ss-DNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV) gag gene, complete cds.
ACCESSION   M29974
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Simian immunodeficiency virus (isolate ver-1 (692)) from African
            green monkey proviral DNA, clone ver-1(692).
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 1542)
  AUTHORS   Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
            Olmstead,R.A. and Hirsch,V.M.
  TITLE     Simian immunodeficiency viruses from African green monkeys display
            unusual genetic diversity
  JOURNAL   J. Virol. 64, 1086-92 (1990)
  STANDARD  full staff_entry
COMMENT     Kindly submitted prior to publication in computer-readable form by
            Phillip Johnson. The ver-1 isolate is from a monkey imported from
            Ethiopia.
            
               Author address:Phillip Johnson
               Georgetown University
               Rockville, MD
               (301-496-2976)
FEATURES       from  to/span     description
    pept          1     1542     gag polyprotein
BASE COUNT      532 a    299 c    405 g    306 t
ORIGIN      
        1 atgggttcgg gttcctcagc actgtcaggg agaaaattag accaatttga acatatacgt
       61 cttcgcccga acggaaagaa aaagtaccaa ttgaaacatt taatatgggc aggcaaggaa
      121 atggagcgct ttggcctcca tgaaaagttg ttagaaacag aagaggggtg taaaaagatc
      181 atagaagtat tgcttccctt agaaccaacc gggtcggaag gtttaaaaag cctgttcaat
      241 ttgacctgcg tcatttgctg cattcatcag gaagcgaaag tgaaagacac agaggaagca
      301 gtaataagaa taaagcaaca gtgccatcta gtggacaaag gtgagaatgc agccaaagga
      361 atagataaga caacaccgac acctagtggt aggagtcaaa attacccggc acaacagcag
      421 aataatgtat gggtacatgt gccacttagc cccagaacat taaatgcttg ggtaaaagta
      481 attgaagaaa agaaatttgg agcagagata gttcccatgt ttcaggccct gtcagaagga
      541 tgtaccccat atgatgtgaa ccaaatgttg aatgttctag gagaccatca gggggccctg
      601 cagatagtga aagaggtcat caatgaagaa gctgcccagt gggacattac acatccccca
      661 ccagcagggc cgctcccagc agggcaattg agagatccaa gggggtcaga catagcaggg
      721 actactagta ccattcaaga acaactagaa tggatttaca cagccaaccc aagaatagac
      781 gtgggagcta tctataggag atgggtaata gcagggctgc aaaaatgtgt cagaatgtat
      841 aatccaacag gggttctgga tataagacaa ggaccaagag aatcttttag cgattatgta
      901 gatagattct acaaggccct gagagcagaa caagcctctc aggatgttaa gaattggatg
      961 acagacactc tgttgattca aaatgctaac ccagagtgta aggtcattct gaaagggcta
     1021 ggcatgcacc ctaccttgga agaaatgctt acggcatgcc agggagtagg gggaccccaa
     1081 tacaaagcca aattgatggt agaaatgatg aatcaaatgc agggggtcaa catggtacag
     1141 caagcaggaa taggaggtag agggagagga agaccagtta aatgctacaa atgtggaaaa
     1201 tttgggcatg tgcagaaaaa ttgcactcaa aaagggccag tagtatgcct gaaatgtgga
     1261 aaacctggcc attttgctcg agattgcaga ggagcagtaa attttttagg gtatggcagg
     1321 tggatgggag caaaaccaaa aaatttttta gaacacagag cagcagtccc ctccgcccct
     1381 ccaccgccgc acaacccagg ggcgtacgac gaagccactc ggcttctgga gaaatatacc
     1441 caagagggag cccaacaaag gagaaaagta gagaagagct cccaagcggg gagggaggaa
     1501 gaggattatt ccttgaaatc cctctttgga gaagaccaat aa
//
LOCUS       SIVAGM90      723 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV) long terminal repeat.
ACCESSION   M33718
KEYWORDS    .
SOURCE      Simian immunodeficiency virus (isolate 90) from African green
            monkey proviral DNA, PCR clone 03F.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 723)
  AUTHORS   Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
            Olmstead,R.A. and Hirsch,V.M.
  TITLE     Simian immunodeficiency viruses from African green monkeys display
            unusual genetic diversity
  JOURNAL   J. Virol. 64, 1086-92 (1990)
  STANDARD  full staff_entry
COMMENT     Kindly submitted in computer-readable form prior to publication by
            Phillip Johnson, Georgetown University, Rockville, MD (301-496-
            2976).
            
               Author address:Phillip Johnson
               Georgetown University
               Rockville, MD
               (301-496-2976)
FEATURES       from  to/span     description
    LTR           1      723     long terminal repeat
BASE COUNT      187 a    160 c    202 g    174 t
ORIGIN      
        1 tggatgggat ttattactcc gaaaggagga atagaatcct caacctatat gctcttaatg
       61 aatggggaat cattgatgat tggaatgcat ggtcagcagg accaggcata agatatcccc
      121 gctgctttgg cttttgcttc aagttagtac cggtagagat gcatgaagag gcagaaacct
      181 gtgagagaca ttgcttggtg catcctgcac aagtaaaaga ggaccccgat ggcatcagtc
      241 atggagagac cttggtctgg aagtttgacc cctatgttag cagtgcagta tgacccaaac
      301 agacagtatt tagaagacat gcatgcactg gtgaagagga agtagctaac cgcaggcttg
      361 tggttaagcc gttgccgggg agatgacatt tgaaactgct gacaagggac tttccaaggg
      421 actttccagg gcgggccatg ggcggtacgg ggagtggttt taccctcaga gctgcataaa
      481 agcagatgct cgctggcttg taactcagtc tcttactagg agaccagctt gagcctgggt
      541 gttcgctggt tagcctaacc tggttggcca ccaggggtaa ggactccttg gcttggaaag
      601 ctaataaaca ttgcctgcat tagagcttat ccgagtcaag tgccctcatt gacgcctcac
      661 tcaagcaggg gaaccgttcc ttactgggtt ctctctctga cccaggcgag agaaactcca
      721 gca
//
LOCUS       SIVMNDGB1    9215 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV) gag, pol, vif, vpR, tat,
            rev, env and nef genes.
ACCESSION   M27470 X15781
KEYWORDS    .
SOURCE      Simian immunodeficiency virus (isolate GB1) from African mandrill.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9215)
  AUTHORS   Tsujimoto,H., Hasegawa,A., Maki,N., Fukasawa,M., Miura,T.,
            Speidel,S., Cooper,R.W., Moriyama,E.N., Gojobori,T. and Hayami,M.
  TITLE     Sequence of a novel simian immunodeficiency virus from a
            wild-caught African mandrill
  JOURNAL   Nature 341, 539-541 (1989)
  STANDARD  full staff_entry
COMMENT     The mandrill virus is distinct from all other primate immuno-
            deficiency viruses, thus it can be regarded as a type 4 virus.
            There is neither a vpX nor a vpU coding sequence. The splice sites
            and coding regions for tat and rev are tentative.
FEATURES       from  to/span     description
    pept        450     1958     gag polyprotein
    pept    <  1745     4774     pol (NH2-terminus uncertain; AA at 1745)
    pept       4728     5246     vif
    pept       5227     5541     vpR protein
    pept       5471     5730     tat protein, exon 2 (first expressed exon)
               7950     8037     tat protein, exon 3 (AA at 7951)
    pept       5590     5677     rev protein, exon 2 (first expressed exon)
               7950     8167     rev protein, exon 3 (AA at 7952)
    pept       5661     8126     env polyprotein
    pept       8170     8814     nef
    binding     275      291     primer (Lys-tRNA) binding site
    signal     9191     9196     poly-A signal
BASE COUNT     3323 a   1478 c   2196 g   2218 t
ORIGIN      
        1 ggagtctcta ctacagaggc taagggttgt atctctgagc agatcccctt agagcaagga
       61 ccagagtcct gagtgactgg gtctgagcac ctcactcggg gctgatcacc tcgaggtagt
      121 ggaactcctt gcttgcttgc tattgtcttc aataaagtaa cttagaatta gagcaagtga
      181 gtaagtgtta tccattgtgc gcctctcttc taaacctgtt gtgttctcat ttagagaaca
      241 gaaggacttc tagttaaccc tagaagcctt tcagtggcgc ccgaacagga cttgaagaga
      301 ggcactgaca cttgaggcag agcactccgc ctggaagaag caggttgaag gagagtggac
      361 tggtctgaag acgccaggag gtgagtcagt gggactgact ttacaagaat tagttgtacc
      421 ctagtgtaag gggcagcata gtcagagcaa tgggtaatgg gaactctgcc ttgttaggga
      481 ctgatttgga taaatttgag aaaataagat taaagagagg tggtaaaaaa tgttatagat
      541 tgaaacacct ctgttggtgt aaaggtgaat tagatagatt tggcttatcg gataaactcc
      601 ttgaaacaca gcaaggatgt gaaaaaatcc tctcagtatg ttggccatta tatgaccaag
      661 gatcagataa tctaaaagct ttggtaggga cagtctgtgt tgtagcctgc atacacgcag
      721 gtatagaaat taagagcaca caagatgctt taaaaaaatt aaaagtcata acaagaaagg
      781 aagaaaagca ggaggatgaa agtaagaatt tccctgtaca aagggatgca gcaggacagt
      841 atcagtatac tccaataagt cctaggatta tacagacatg ggtaaaaaca gtggaagaaa
      901 agaagtggaa accggaggtc atccctctat tctcagcatt gacagaagga gcaatcagtc
      961 atgatttgaa tatcatgctg aatgcagtag gagatcatca gggagcaatg caagtcttaa
     1021 aagatgtaat taatgagcaa gcagcagaat gggatctaac acatcctcaa caacaaccag
     1081 cacaaccagg aggaggatta aggacccctt caggctctga tatagcagga actacttcta
     1141 cagtggaaga acaattggca tggatgaata tgcaacaaaa tgcaatcaat gtaggaacaa
     1201 tctataagag ttggattata ctgggcatga atagattggt aaaaagtcat tgtccaataa
     1261 gtataacaga tgtaagacag ggaccaaagg aagcttttaa agactatgta gatagattct
     1321 acaatgtaat gagagcagaa caagcttcag gagaagtaaa gatgtggatg cagcagcatc
     1381 tgcttataga aaatgcaaac ccagaatgca agcagatttt gagaagctta gggaaaggag
     1441 caactttaga ggaaatgttg gaagcatgtc agggagtagg tgggccacaa cataaagcca
     1501 gattaatggc agaaatgatg agaacagtgg taggacaatc acaaaatttt gtgcagcaga
     1561 gagggcctca aagaggacca gttagacaac ctactggaag gaaacctatc tgcttcaact
     1621 gtaataaaga agggcatgta gcaaggttct tcaaggcccc tagaaggaaa gggtgctgga
     1681 attgtggagc aatggatcat cagaaagctc aatgccctaa gccagctcag cagcagaggg
     1741 ttaatttttt agggtatggc ccttggggtc cctccaaacc ggggaattat ccggcacaag
     1801 aggtgactcc aacagctcca ccattagagg agaaacctct gcagaaaact ctgagcactt
     1861 atcagaaatt agggagaggg ctcaggcaga agatgaagga ggagaagaga gaggaggatt
     1921 ttcattccct gagtactctc tttcaagaag accaatagaa gaggtctcag tggatggtgt
     1981 cactataaga gctctactag atacaggagc tgatgatacc atctttaatg aaagaaatat
     2041 aaaattaaaa ggaaattggc agccaaaaat tataggggga ataggtggaa acttaagagt
     2101 aaaacagtat gataatgtat atgtagaaat aagagggaag ggaacatttg ggacagtatt
     2161 gataggacct actccaatag atataatagg gagaaacata atggaaaaat taggaggaaa
     2221 attaatattg gcacaattgt ctgataaaat accaataaca aaagtgaaat taaaaccagg
     2281 agtagatgga cccagaataa aacaatggcc tttaagtaaa gagaaaatag ttggtcttca
     2341 gaaaatatgt gatagattag aggaggaagg aaaaattagt agggtagatc caggaaataa
     2401 ttacaataca cctatctttg ccataaagaa gaaggataaa aatgaatgga gaaaattaat
     2461 agactttaga gaattaaaca agttaacaca ggattttcat gaattacagt taggtatacc
     2521 tcacccagca ggaataaaaa agtgtaaaag aataacagtc ctagatatag gggatgccta
     2581 ttttagtata cctctggatc cagattatag accctatact gcctttacgg taccatcagt
     2641 taataatcaa gcaccaggaa aaagatacat gtataatgtt cttcctcaag ggtggaaggg
     2701 aagtccatgt atctttcaag ggacagtagc atcactgctg gaggtattta gaaagaacca
     2761 tccaacagta cagttatatc aatacatgga tgatttgttt gtagggtcag actatacagc
     2821 agaagagcat gagaaagcta tagtagaatt aagggcttta ttaatgacat ggaacttaga
     2881 aacacctgaa aagaaatatc agaaagaacc tccctttcat tggatggggt atgagttaca
     2941 cccagataag tggaagatag aaaaggttca actaccagaa ttagcagaac agccaacagt
     3001 aaatgaaata cagaaattgg taggtaaatt aaattgggct gcacagttat atcctgggat
     3061 caaaacaaaa caactgtgca agctaataag aggaggacta aacataacag agaaagtcac
     3121 aatgacagaa gaagcaagac tggaatatga acaaaataaa gagatcttgg ctgaagaaca
     3181 agaagggtct tattatgatc ctaataagga attatatgta agatttcaga aaacaacagg
     3241 aggagatata tcatttcaat ggaagcaagg aaataaggtt ttaagagcag ggaaatatgg
     3301 gaaacagaaa acagcacata gtaatgacct catgaaattg gcaggtgcta cgcagaaggt
     3361 aggaagagaa agtatagtaa tctggggttt tgtaccaaaa atgcagatac ccactacaag
     3421 ggagatatgg gaagattggt ggcatgagta ttggcagtgt acatggatac cagaagtaga
     3481 atttatcagc acacctatgt tagaaaggga atggtatagc ttgtccccag aacctctaga
     3541 gggggtagaa acatattatg ttgatggagc agctaacagg gacagtaaaa tgggaaaagc
     3601 aggatatatt acagatagag gttttcaaag ggtagaagaa tatctaaata ccaccaatca
     3661 gcagacagaa ttacatgcag taaaactagc tctagaagat agtggaagtt atgttaacat
     3721 agtaacagat tcacaatatg tagtaggtat actagcaagc agacctactg aaacagatca
     3781 ccccatagta aaggaaataa tagaattaat gaaaggaaaa gaaaaaattt atttaagttg
     3841 gctaccagca cacaaaggga taggagggaa tgagcaaata gataagctag taagttcagg
     3901 aatcagaaaa gtcttattcc tacaaaatat agaaccagca caggaagaac atgagaaata
     3961 tcatagcaat gaagcacaat taagagagaa attccactta ccagctctag tagccaaaca
     4021 gattgtgcaa agttgcagta agtgctgtca tcatggagag cccataaagg gacagacaga
     4081 tgcttcactt ggagtctggc agatagattg cacacatctg gaaaatcaaa ttattatagt
     4141 agcagtgcat gtagcttcag gcttcatgaa ggcagaagtt ataacagcag aaactggaaa
     4201 aaagacagca gagtttctgt taaagttagc agcacaatgg cctattagta aactacacac
     4261 agataatggg cctaacttta ctagtcagga agtagaaacc atgtgttggt ggttagggat
     4321 agaacacaca tttggaatac cctataaccc acaaagtcag ggggtagtgg aaaataaaaa
     4381 taagtatcta aaagaattga ttgagaaaat aagagaagat tgcaaagaat taaaaacagc
     4441 agtagccatg gccacattca ttcataattt taaacaaagg ggaggactag gggggatgac
     4501 agcaggagag agaatagtaa atatgatcaa tacagaatta gaatatcaat atcaacaaaa
     4561 tcaaatttca aaaaatttaa attttaaggt ttacttcaga gaaggaagag atcagctgtg
     4621 gaaaggacct ggtatccttt tgtggaaagg agaaggggca gtagttttaa aatatcaaga
     4681 agagataaag atagtaccta gaagaaagtg taaaataata aaagattatg gagagagtgg
     4741 aaagaatagt caggttaact tggaaagtgt ctagtcagag aatagaaaag tggcactggt
     4801 tagtaagaag acagatggca tgggccactg caaataatga ggaaggatgt tggtggctgt
     4861 atcctcattt tatggcttat aatgaatggt atacttgcag taaagtagtg attataataa
     4921 atagggacat aagattaata gttagaagct attggcattt gcaaatagag gtaggatgct
     4981 taagtactta tgcagtaagc atagaagcag tagttagacc gccacccttt gagaaagagt
     5041 ggtgtacaga gataactcca gaggtagcag atcatctaat acatttacat ttttatgact
     5101 gcttcatgga cagtgcagtt atgaaagcca tcaggggaga agaagtgtta aaagtttgta
     5161 gatttccagc tggccataaa gcacaaggtg ttctctcttt gcagtttctc tgcttgagag
     5221 tcatctatgg gccagaagag agatgagcaa gtatcagaag atcaaggacc tcccagagag
     5281 ccatacaatc agtggctagc agatactatg gaggaaataa aggaagaagc aagaaagcac
     5341 ttccctctca ttatcctaaa tgcagtatca gaatattgtg tgcaaaacac agggagtgag
     5401 gaagaggcct gtgagaaatt tattacctta atgaatagag ccatttgggt ccacctagct
     5461 caagggtgtg atggaacctt cagggaaaga agaccacaac tgcccccctc aggattcagg
     5521 ccaagaggag atagattata agcaactgct agaagagtat tatcagcctt tgcaagcttg
     5581 tgagaataaa tgctggtgca agaaatgctg ctttcattgt atgctttgct ttcaaaagaa
     5641 gggtttagga ataaggtacc atgtctacag gaaacgtgta ccaggaacta ataagaagat
     5701 acctggtagt ggtgaagaag ctatacgaag gtaagtatga agtgtccagg tctttttctt
     5761 atactatgtt tagcctacta gtaggtatta taggaaaaca atatgtgaca gtcttctatg
     5821 gagtaccagt atggaaggaa gctaaaacac atttgatttg tgctacagat aattcaagtc
     5881 tctgggtaac cactaattgc ataccttcat tgccagatta tgatgaggta gaaattcctg
     5941 atataaagga aaattttaca ggacttataa gggaaaatca gatagtttat caagcatggc
     6001 atgctatggg aagtatgtta gataccatac ttaagccatg tgtaaagatt aacccatatt
     6061 gtgttaagat gcaatgtcag gaaacagaaa atgtatcagc aacaacagct aagcctataa
     6121 ctacacctac tactacatct acagttgcaa gtagtacaga gatttactta gatgtagata
     6181 aaaataatac agaagaaaag gtagagagga atcatgtatg taggtataac ataacaggac
     6241 tatgcaggga ttcgaaggaa gaaatagtaa caaattttag aggggatgat gtgaaatgtg
     6301 aaaataatac ttgctatatg aatcattgta atgagtcagt taatacagaa gactgtcaga
     6361 agggactttt gataagatgt attttaggtt gtgtgcctcc aggatatgtc atgttaagat
     6421 ataatgagaa gttaaataat aataaattgt gtagcaatat atcagcagtg cagtgtactc
     6481 agcacttagt agccacagta agtagctttt ttggctttaa tggaactatg cataaggaag
     6541 gagaattgat acccatagat gataaatata ggggcccaga ggaatttcat caaaggaagt
     6601 ttgtctataa ggtgccagga aaatatggct taaagataga atgtcacaga aaaggaaata
     6661 ggtcagtagt gagtactcca tcagctacag gattattatt ttatcatggg ttagaacctg
     6721 gaaagaattt aaagaaaggc atgtgcacct tcaaaggacg ttgggggtta gcactttgga
     6781 gtctagctaa agaactaaat aaattaaatg actccatcaa agtgaaccag acctgtaaaa
     6841 attttactag cactggagag gagaacaaac aaaacacgga caagcaaaag gagtttgcca
     6901 aatgcataaa gactcttaag atagataatt atactacatc aggagataga gcagcagaaa
     6961 tgatgatgat gacatgtcaa ggtgaaatgt tcttctgtaa tgtaacaaga atcatgaggg
     7021 catggaatga tcctaatgag aagaagtggt atccttatgc ctcatgtcaa attaggcaaa
     7081 tagtagatga ctggatgcaa gtaggaagaa agatatattt accacctaca tcaggattta
     7141 ataatcacat aaggtgtaca catagggtaa cagaaatgta ctttgaaatg caaaagatag
     7201 atagtaatga aacaaaaatg caaattaaat tcttgcctcc cagtgaaacc tccaatcaat
     7261 ttgttgctta tggagctcat tataaattag tcaaaataat gccaattggc atagcaccta
     7321 cagatgtgaa aagacacact ttacctgaac atcataaaga gaagagagga gcagtaatac
     7381 ttggtatcct tggtctgctc tcgctggcag gatccgcgat gggctcagtg tcggtggcac
     7441 tgactgtcca atctcagtct ttggtgactg ggatagtgga acaacaaaaa cagttgttga
     7501 agctcataga gcaacagtct gaactcttaa aactcaccat atggggagta aagaatttac
     7561 agactcgcct gaccagtttg gagaattata tcaaggacca agctttgctg tctcaatggg
     7621 ggtgttcatg ggcacaggtg tgtcatactt ctgtagagtg gactaataca agcatcactc
     7681 caaattggac atcagaaact tggaaggaat gggagacaag aactgattat ctgcaacaaa
     7741 acattacaga aatgttaaaa caggcatatg atcgagagca aagaaacaca tatgaattac
     7801 agaagttagg agaccttaca tcttgggcaa gttggtttga ctttacttgg tgggttcaat
     7861 acttaaaatg gggagttttc ttagtgttag gaattatagg attaagaatt ttgttagcct
     7921 tatggaatac aataagtagg tttaggcagg gctatcgacc tgtcttttca caggactgcc
     7981 agcagaacct ataccgcaaa cggccagaca acggagaaga agaaagcaac agcttagaac
     8041 taggagagca caactccgag aacttgaagg aagaatcctt aaacagatcc ttgatagagg
     8101 acctgaccag ctttgccagg gagtgaccaa tttggctttg gctgaaaaat ctgagagcag
     8161 caattgaata tgggttcctc gcagtccaag aagcgatcag aagcttgggt tcgctactcg
     8221 tcagctttgc ggcaattagt tggagggccg gttacaccgg atggctacaa gcaaatagaa
     8281 tcttcacagg gtgcagagaa gcaatcattg ctgcggggac gtgcatatgg cacatactca
     8341 gaaggattag acaaagtgca gaacgacccc ttaactaaag atgagaaact tgacttaaca
     8401 cagcaggatc cagaagagga ggaagaagtt ggatttcctg tgtgtcgcca agtttcctta
     8461 agagtgccat catacaaaga tctgatagac ttctctcatt ttataaaaga aaagggggga
     8521 ctgggaggga tatattatag caggagaaga gaagaaatcc tagatctcta tgcagagaat
     8581 gagtggggat ttgaacctgg atggcaacag tatacgacag gtccaggaac cagatatcct
     8641 aagacatttg gattcctgtt taagctggaa ccagtgagca gagctatagg agatgagtat
     8701 gcagctaaca atcatctgtt acactcctcc cagttatgtc ctcaggaaga tccagaagga
     8761 gagaccctca tgtggtctgg gaccctcatc ttgcctatga ctttgcagca ttaacatatc
     8821 accctgagtg tttcaataag gctaagagta ttgaacatct gccattttgg aagaggaagt
     8881 agcctaaccg caaaaccaca tcctactgca gaactgtagt tgcttggcaa cctgcttagc
     8941 aacctggact ggcgcttgcg cgctaggaag ggactttcca aacagggagg gggaggctcg
     9001 ccccatgctg ctatataagc agctgcattt cgcttgttcg ggagtctcta ctacagaggc
     9061 taagggttgt atctctgagc agatcccctt agagcaagga ccagagtcct gagtgactgg
     9121 gtctgagcac ctcactcggg gctgatcacc tcgaggtagt ggaactcctt gcttgcttgc
     9181 tattgtcttc aataaagtaa cttagaatta gagca
//
LOCUS       SIVMNE       9628 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus (SIV) complete proviral genome.
ACCESSION   M32741
KEYWORDS    complete genome.
SOURCE      Simian immunodeficiency virus from captive Macaque nemestrina
            proviral DNA, clone 8.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9628)
  AUTHORS   Benveniste,R.E., Heidecker,G., Greenwood,J. and Gonda,M.A.
  TITLE     ;
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Kindly submitted in computer-readable form by R. Benveniste.  The
            gag protein sequence was reported in J. Virol. 62, 2587-2595, 1988.
            This molecular clone, after transfection into T-cell lines,
            produces infectious viral particles. In particular, clone 8 has
            been inoculated intravenously into two pig-tailed macaques causing
            CD4 lymphocyte depletion; see J. Virol. 62, 2091-2101, 1988.  The
            env cds is truncated as is the case with MM251 and MM142.
            
               Author address:R.Benveniste
               National Cancer Institute
               Frederick, MD
               (301-698-5836)
FEATURES       from  to/span     description
    pept        533     2053     gag polyprotein
    pept    <  1708     4878     pol polyprotein (NH2-terminus uncertain)
    pept       4808     5452     vif protein
    pept       5280     5618     vpX protein
    pept       5619     5924     vpR protein
    pept       5770     6065     tat protein, exon 2 (first expressed exon)
               8280     8376     tat protein, exon 3 (AA at 8281)
    pept       5996     6065     rev protein, exon 2 (first expressed exon)
               8280     8533     rev protein, exon 3 (AA at 8282)
    pept.ps    6072     8779     env protein (premature stop codon)
    pept       6072     8276     env protein
               8280     8717     env protein
    pept       8551     9342     nef protein
    site       8277     8279     env protein in-frame stop codon
BASE COUNT     3294 a   1807 c   2379 g   2148 t
ORIGIN      5' end of 5' LTR R region (putative mRNA start).
        1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag
       61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagagt
      121 ggctccacgc ttgcttgctt aaagacctct tcaataaagc tgccttttag aagtaagcca
      181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcaact cggtactcga taataagaag
      241 accctggtct gttaggaccc tttctgcttt gggaaaccga agcaggaaaa tccctagcag
      301 attggcgccc gaacagggac ttgaaggaga gtgagagact cctgagtacg gctgagtgaa
      361 ggcagtaagg gcggcaggaa ccaaccacgg cggagtgctc ctagaaaggc gcgggtcggt
      421 accagacggc gtgaggagcg ggagagaaga ggcctccggt tgcaggtaag tgcaacacaa
      481 aaaagagata gctgtctttt atccaggaag ggataataag atagagtggg agatgggcgc
      541 gagaaactcc gtcttgtcag ggaagaaagc agatgaatta gaaaaaatta ggctacgacc
      601 cggcgggaag aaaaagtaca tgttgaagca tgtagtatgg gcagcaaatg aattagatag
      661 atttggatta gcagaaagcc tgttggagaa caaagaagga tgtcaaaaaa tactttcggt
      721 cttagctcca ttagtgccaa caggctcaga aaatttaaag agcctttata atactgtctg
      781 cgtcatctgg tgcattcacg cagaagagaa agtgaaacac actgaggaag caaaacagat
      841 agtgcagaga cacctagtgg tggaaacagg aacagcagaa actatgccaa aaacaagtag
      901 accaacagca ccatctagtg gcagaggagg aaattaccca gtacaacaag taggtggtaa
      961 ctatacccac ctaccattaa gcccgagaac attaaatgcc tgggtaaaat tgatagagga
     1021 gaagaaattt ggagcagaag tagtgccagg atttcaggca ctgtcagaag gctgcacccc
     1081 ctatgacatt aatcagatgt taaattgtgt gggagaacat caagcagcta tgcagattat
     1141 cagagaaatt ataaacgagg aggctgcaga ttgggacttg cagcacccac aacaagctcc
     1201 acaacaagga cagcttaggg agccgtcagg atcagacatt gcaggaacaa ctagtacagt
     1261 agatgaacaa atccagtgga tgtacagaca acagaacccc ataccagtag gcaacattta
     1321 caggagatgg atccaactgg ggttgcaaaa atgtgtcaga atgtataacc caacaagcat
     1381 tctagatgta aaacaagggc caaaagagcc atttcagagc tatgtagaca ggttctacaa
     1441 aagcttaaga gcagaacaaa cagatccagc agtaaagaat tggatgactc aaacactgct
     1501 gattcaaaat gctaacccag attgcaagct agtgctgaag gggctgggta tgaatcccac
     1561 cctagaagaa atgctgacgg cttgtcaagg agtaggagga ccaggacaaa aggcaagatt
     1621 aatggcagaa gccctgaaag aggcccttgc accagggcca ctcccttttg cagcagccca
     1681 acagaaggga ccaagaaagc caattaagtg ttggaattgt gggaaagagg gacactctgc
     1741 aaggcaatgc agaaccccaa gaagacaggg ctgctggaaa tgtggacaaa tgggccatgt
     1801 tatggccaaa tgcccagaca gacaggcagg ttttttaggc tttggcccat ggggaaagaa
     1861 gccccgcaat ttccccatgg cccaaatgca tcaggggctg acgccaactg ctcccccaga
     1921 ggacccagct gtggatctgc taaaaaacta catgcagttg ggcaaacagc agagagaaag
     1981 caaaaggaag ccttacaagg aggtgacaga ggatttgctg cacctcaatt ctctctttgg
     2041 agaagaccag tagtcactgc tcatattgag ggacagcctg cagaagtatt attagataca
     2101 ggggctgatg attctattgt agcaggaata gagttaggtc cacattatac cccaaaaata
     2161 gtaggaggaa taggaggttt tattaatact aaagaataca aaaatgtaaa aatagaagtt
     2221 ttaggcaaaa ggattaaagg gacaatcatg acaggggaca ccccgattaa catttttggt
     2281 agaaatttgc taacagctct ggagatgtct ctaaatttcc ccatagctaa ggtagagcct
     2341 gtaaaagtca ccttaaagcc aggaaaagat ggaccaaaat tgaggcagtg gccattatca
     2401 aaagaaaaga tagttgcatt aagagaaatc tgtgaaaaga tggaaaagga tggtcagttg
     2461 gaggaagctc ccccgaccaa tccatacaac acccccacat ttgccataaa gaaaaaggac
     2521 aagaacaaat ggagaatact gatagatttt agggaactaa ataaggtcac tcaggacttt
     2581 acagaagtcc aattgggaat accacaccct gcaggactag caaaaaggaa gaggatcaca
     2641 gtactggatg taggtgacgc atatttctcc atacctctag atgaagaatt taggcagtac
     2701 actgctttta ctttaccatc agtaaataat gcagaaccag gaaaacgata catttataag
     2761 gttctgcctc aggggtggaa ggggtcacca gccatcttcc aacacactat gagaaatgtg
     2821 ctggaaccct tcaggaaggc aaatccagat gtgaccttag tccagtatat ggatgacatc
     2881 ttagtagcta gtgacaggac agacctggaa catgacaggg tagttttaca gttaaaggaa
     2941 ctcttaaata gcatagggtt ttctacccca gaagagaagt tccaaaaaga tcccccattt
     3001 caatggatgg ggtatgaatt gtggccaaca aaatggaagt tgcaaaagat agagttgcca
     3061 caaaaagaga cctggacagt gaatgatata cagaagttag taggagtatt aaattgggca
     3121 gctcaaattt atccaggtat aaaaaccaaa catctctgta ggttaattag aggaaaaatg
     3181 actctaacag aggaagttca gtggactgag atggcagagg cagaatatga ggaaaataaa
     3241 ataattctca gtcaggaaca agaaggatgt tattaccaag aaggcaagcc attagaggcc
     3301 acggtaataa agaatcagga caatcagtgg tcttataaga ttcaccaaga agacaaaata
     3361 ctaaaagtag gaaaatttgc aaagataaaa aatacacata ccaatggagt tagactatta
     3421 gcacatgtaa tacagaaaat aggaaaggaa gcaatagtga tctggggaca ggtcccaaaa
     3481 ttccacttac cagttgagaa agatgtatgg gaacagtggt ggacagacta ttggcaggta
     3541 acctggatac cgaaatggga ttttatctca acaccaccac tagtaagatt agtcttcaat
     3601 ctggtaaagg accctataaa gggagaagaa acctattatg tagatggatc atgtaataaa
     3661 cagtcaaaag aagggaaagc aggatatatc acagataggg gcaaagacaa agtaaaagtc
     3721 ttagaacaga ctactaatca acaagcagaa ttggaagcat ttctcatggc attggcagac
     3781 tcagggccaa aggcaaatat tatagtagat tcacaatatg ttatgggaat aataacagga
     3841 tgccctacag aatcagagag caggctagtt aaccaaataa tagaagaaat gattaaaaag
     3901 acagaaattt atgtagcatg ggtgccagca cacaaaggta taggaggaaa ccaagaaata
     3961 gaccacctag ttagtcaagg gattagacaa gttctcttct tggaaaagat agagccagca
     4021 caagaagaac atgataaata ccatagtaat gtaaaagaat tggtattcaa atttggatta
     4081 cccagactag tggccaaaca gatagtagac acatgtgata aatgtcatca gaaaggagaa
     4141 gctatacatg ggcaggtaaa ttcagatcta gggacttggc aaatggattg tacccatcta
     4201 gagggaaaaa taatcatagt tgcagtacat gtagctagtg gattcataga agcagaagta
     4261 attccacaag agacaggaag acagacagca ctatttctgt taaaattggc aagcagatgg
     4321 cctattacgc atctacacac agataatggt gccaactttg cttcgcaaga agtaaagatg
     4381 gttgcatggt gggcagggat agagcacacc tttggggtac catacaatcc acagagtcag
     4441 ggagtagtgg aagcaatgaa tcaccatcta aaaaatcaaa tagatagaat cagggaacaa
     4501 gcaaattcaa tggaaaccat agtattaatg gcagttcatt gcatgaattt taaaagaagg
     4561 ggaggaatag gggatatgac tccagcagaa agattactta acatgatcac tacagaacaa
     4621 gaaatacaat tccaacaatc aaaaaactca aaatttaaaa attttcgggt ctattacaga
     4681 gaaggcagag atcagctgtg gaaaggacct ggtgagctat tgtggaaagg ggaaggagca
     4741 gtcgtcttaa aggtagggac agacattaag gtagtaccca gaagaaaggc taagattatc
     4801 aaagattatg gaggaggaaa agaggtggat agcagttccc acatggagga taccggagag
     4861 gctagagagg tggcatagcc tcataaaata tctgaaatat aaaactaaag atctacaaaa
     4921 ggtttgctat gtgccccatc ataaggtcgg atgggcatgg tggacctgca gcagagtaat
     4981 cttcccacta caagaaaaaa gccaattaga agtacaaggg tattggaatt tgacaccaga
     5041 aagagggtgg ctcagtactc atgcagtgag aataacctgg tactcaagga acttttggac
     5101 agatgtaaca ccagactgtg cagacatttt actgcatagc acttatttcc cttgctttac
     5161 agcgggagaa gtgagaaggg ccatcagggg agaacaactg ctgtcttgct gcaggttccc
     5221 gagagctcat aagacccagg taccaagtct acagtactta gcactgagag tagtaagtta
     5281 tgtcagatcc cagagagaga atcccacctg gaaacagtgg agaagagaca ataggagaag
     5341 ccttcgaatg gctaaacaga acagtagagg agataaacag agaggcagta aaccacctac
     5401 caagggagtt gattttccag gtttggcaaa ggtcttggga atactggcat gatgaacaag
     5461 ggatgtcgca aagctatgta aagtacagat acttgtgttt aatacaaaag gctttattta
     5521 tgcattgcaa gaaaggctgt agatgtctag gggaaggaca tggggcaggg ggatggagac
     5581 caggacctcc tcctcctccc cctccaggac tagcataaat ggaagaaaga cctccagaag
     5641 atgaaggccc acaaagggaa ccatgggatg aatgggtagt ggaggttctg gaggaactga
     5701 aagaagaagc tttaaaacat tttgatcctc gcttgctaac tgcgcttggt aatcatatct
     5761 ataatagaca tggagacacc cttgagggag caggagaact cattaaaatc ctccaacggg
     5821 cgctcttcat gcacttcaga ggcggctgca cccactctag aatcggccaa tctggaggag
     5881 gaaatcctct ctcaactata ccgccctcta gaagaatgct ataacacatg ctattgcaaa
     5941 aagtgttgct accattgcca gttttgtttt cttaaaaagg gcttggggat atgttatgag
     6001 cagtcacgca gaagaagaag aactccgaag aaggctaagg ctaatacatc ttctgcatca
     6061 aacaagtaag tatgggatgt cttgggaatc agctgcttat cgccatcttg tttctaagtg
     6121 cctatgggat ctattgcatt caatatgtca cagtctttta tggtgtacca gcttggagga
     6181 atgcgacaat tcccctcttc tgtgtaacca ggaataggga tacttgggga acaactcagt
     6241 gcctaccaga taatgatgat tattcagaat tggcccttaa tattacagaa agctttgatg
     6301 cttgggagaa tacagtcaca gaacaggcaa tagaggatgt atggcatctc tttgagacct
     6361 caataaagcc ttgtgtaaaa ttaaccccat tatgcattac tatgaaatgc aacaaaagtg
     6421 agacagataa atggggattg acaaaatcat caacaacaac agcaccaaca gcaataccaa
     6481 caaaagcaga ggcaataaaa gtggtcaatg agaatagtcc ttgtataaat catgataatt
     6541 gcacaggctt ggaacaagag ccaatgataa gctgtaaatt caacatgaca gggttaaaaa
     6601 gagacaagag aagagagtac aatgaaactt ggtactctgc agatttggtt tgtgaacaag
     6661 gtaatagcac tgaaaatgaa agtagatgtt acatgaatca ctgtaacact tctgttattc
     6721 aagaatcttg tgacaaacat tattgggatg ctattagatt taggtattgt gcacctccag
     6781 gttatgcttt gcttagatgt aatgacacaa attattcagg ctttatgcct aactgttcta
     6841 aggtggtggt ctcttcatgc acaagaatga tggagacaca gacttctact tggtttggct
     6901 ttaatggaac tagagcagaa aatagaactt atatttactg gcatagcaaa gataatagga
     6961 ctataattag tttgaataag tattataatc taacaatgaa atgtagaaga ccaggaaata
     7021 agacagtttt accagtcacc atcatgtctg gattggtttt ccactcacaa ccaatcaatg
     7081 ataggccaaa acaggcatgg tgtaggtttg aaggaaattg gaaggaggca ataaaagagg
     7141 taaagcagac cattgtcaaa catcccaggt atactggaac taacaatact gataaaatca
     7201 atttgacggc tcctggagga ggagatccgg aagttacctt catgtggaca aattgcagag
     7261 gagagtttct ctactgtaaa atgaattggt ttctaaattg ggtagaagat aagaatctga
     7321 ctggaactac ccagaagcca caggaacggc ataaaaggaa ttacgtgcca tgtcatatta
     7381 gacaaataat caacacttgg cataaagtag gcagaaatgt ttatttgcct ccaagagagg
     7441 gagacctcac gtgtaattcc acagtgacca gtctcatagc aaacatagat tggattgatg
     7501 gaaaccaaac taatatcacc atgagtgcag aggtggcaga actgtatcga ttggaattgg
     7561 gagattataa attagtagag atcactccaa ttggcttggc ccccacaaat gtgaagaggt
     7621 acactactgg tggcacctca agaaataaaa gaggggtctt tgtgctaggg ttcttaggtt
     7681 ttctcgcaac ggcaggttct gcaatgggcg cggcgtcgtt gacgctgacc gctcagtccc
     7741 ggactttatt ggctgggata gtgcagcaac agcaacagct gttggacgtg gtcaagagac
     7801 aacaagaatt gttgcgactg accgtctggg gaacaaagaa cctccagact agagtcactg
     7861 ccatcgagaa gtacttaaag gaccaggcgc agctaaatgc ttggggatgt gcatttagac
     7921 aagtctgcca tactactgta ccatggccaa atgcaaatct aacaccaaat tggaacaatg
     7981 agacttggca agagtgggag cgaaaggttg acttcttgga ggaaaatata acggcccttt
     8041 tagaagaggc acaaattcaa caagaaaaga acatgtatga attacaaaag ttgaatagct
     8101 gggatgtgtt tggcaattgg tttgaccttg cttcttggat aaggtatata caatacggag
     8161 tttatatagt tgtaggagta atactgttaa gaatagtgat ctatatagta caaatgctag
     8221 ctaagttaag gcaagggtat aggccagtgt tctcttcccc accttcttat ttccagtaga
     8281 cccatatccg acaggaccag gcactgccaa ccaaagaagg aacagaagga gacggtggag
     8341 gcagcggtgg caacagctcc tggccttggc agatagaata tattcatttc ctgatccgcc
     8401 aactaatacg cctcttgact tggttattca gcaactgcag aaccttgcta tcgagagcat
     8461 accagatcct ccaaccaata ttccagagat tctccacgac cctacagaga atccgagaag
     8521 tcctcaggac tgaactaacc tacctacaat atgggtggag ctacttccaa gaggcggtcc
     8581 aagtcgcctg gagatctgcg acagagactc ttgcgggcgc gtggggagac ttatgggaga
     8641 ctctgggaag agttggaaga tggatactcg caatccctag gaggatcaga caagggctcg
     8701 agcttactct cttgtgaggg acagaaatac aatcagggac agtttatgaa tactccatgg
     8761 aaaaacccag ctggagagag ggaaaaatta gcatacagaa aacaaaatat agatgatata
     8821 gatgaagaag ataatgactt ggtaggggta ccagtgaggc cacgagttcc cttaagaata
     8881 ataagttaca aattggcagt agatatgtct cattttataa aagaaaaggg gggactggaa
     8941 gggatttatt acagtgaaag aagacataaa atcttagaca tgtacttaga aaaggaagaa
     9001 ggcatcatgc cagattggca gaattacacc tcgggaccag gacctagata cccaaagaca
     9061 tttggctggc tatggaaatt agtccctgta aatgtatcag atgaggcaca ggagggtgag
     9121 gagaattatt tactgcatcc agctcaaact tcccagtggg atgacccttg gggagaggtt
     9181 ctagtatgga agtttgatcc aactctagcc tacacttatg aggcatatat tagataccca
     9241 gaagagtttg gaagcaagtc aggcctgtca gaggaagagg ttagaagaag gctaaccgca
     9301 agaggcctct taaaaatggc tgacaagagg gaaactagct gagacagcag ggactttcca
     9361 taaggggatg tcatggggag gtactgggga ggagccggtc gggaacaccc actttcttga
     9421 tgtataaata tcactgcatt tcgctctgta ttcagtcgct ctgcggagag gctggcagat
     9481 tgagccctgg gaggttctct ccagcactag caggtagagc ctgggtgttc cctgctagac
     9541 tctcaccagc acttggccgg tgctgggcag agtggctcca cgcttgcttg cttaaagacc
     9601 tcttcaataa agctgccttt tagaagta
//
LOCUS       SIVAGM677    2438 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Simian immunodeficiency virus LTR and gag gene, complete cds.
ACCESSION   M29973
KEYWORDS    .
SOURCE      Simian immunodeficiency virus (isolate 677,(gri-1)) from African
            green monkey.
  ORGANISM  Simian immunodeficiency virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 2438)
  AUTHORS   Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T.,
            Olmstead,R.A. and Hirsch,V.M.
  TITLE     Simian immunodeficiency viruses from African green monkeys display
            unusual genetic diversity
  JOURNAL   J. Virol. 64, 1086-92 (1990)
  STANDARD  full staff_entry
COMMENT     Kindly submitted prior to publication by P. Johnson, Georgetown
            University, Rockville MD (301-496-2976). The remainder of this
            complete genomic sequence will become available later in 1990.  The
            gri-1 isolate is from a monkey imported from Ethiopia.
            
               Author address:P.Johnson
               Georgetown University
               Rockville, MD
               (301-496-2976)
FEATURES       from  to/span     description
    pept        897     2438     gag polyprotein
    LTR           1      688     5' LTR
    rpt         461      588     R repeat 5' copy
    binding     689      706     primer (Lys-tRNA) binding site
BASE COUNT      752 a    503 c    689 g    494 t
ORIGIN      
        1 tggatgggat atattactct gaaagaagag aaaagatcct gaatttgtat gccttgaacg
       61 agtggggaat aatagatgat tggcaagctt actcaccagg cccggggata aggtacccga
      121 gagtctttgg cttctgcttt aagctagtcc cagtggacct gcatgaggag gcacgcaact
      181 gtgagagaca ctgtctgatg catccagcac agatggggga agatcctgat ggaatagatc
      241 atggagaagt cttggtctgg aagtttgacc cgaagttggc ggtggagtac cgcccggaca
      301 tgtttaagga catgcacgaa catgcaaagc gctagtgtca gcactttgcg gttgggactt
      361 tccgccaggg actttccaca gtgggtggat cggaggcggt acaggggcgg tactgggagt
      421 ggctttcccc tcagagctgc ataaaagcag atgctcgctg gcttgtaact cagtctctta
      481 ctaggagacc agctagagcc tgggtgttcg ctggttagcc taacccggtt ggccaccggg
      541 ggtaaggact ccttggcttc atatagctca ataaacctgc tcgcttagtc gctatattgg
      601 agtcaagtgc tcattgctgc gccgagcctc tagaggtgaa cctctcttac tgggttctcc
      661 tgtacccagg tgggagaaac tccagcagtg gcgcccgaac agggacttga gaagaggcat
      721 cggcaccgac cgctgagttg ctgagcgtcg gagagggacg actcaggtag ggtgagagcc
      781 tacgagtttt ttgctaccta gtcagcgaga aaggctaggc cgcgacaggg gcgcgggtcc
      841 cattagtggc aaccaaccca gttggacgaa gggttggtag gggacgggtc ggagcaatgg
      901 gcgggggtca ctcagcactg tcagggagaa gcctcgacac gttcgagaag attaggctac
      961 gtccgaacgg gaaaaagaag taccaaatta aacatttaat atgggcagga aaagaaatgg
     1021 aacgatttgg gttacatgag aaacttttag aaacaaaaga aggctgtcaa aaaatcatag
     1081 aagttttaac cccgttggaa ccgacaggct ccgaggggct aaaagctctg tttaatttgt
     1141 gctgcgtcat ttggtgcatt cacgcagaac agaaagtgaa agacacagag gaagctgtag
     1201 taacagttaa gcaacactac catctagtgg acaaaaatga gaaagcagct aaaaagaaaa
     1261 atgagacaac agcgccacct ggtggcgaat caagaaatta cccagtagta aatcagaata
     1321 atgcctgggt acaccagcct ttgtctccgc gcacgttaaa tgcgtgggtc aaatgcgtgg
     1381 aggaaaaaag gtggggagca gaagtagtcc ccatgttcca agcactctca gagggatgtc
     1441 tctcctatga tgtaaatcag atgctcaatg taataggaga ccatcagggg gcattacaaa
     1501 ttcttaagga agtcattaat gaagaagcag cagagtggga caggacacac agaccaccag
     1561 ctggcccgtt accagcaggg cagctaagag acccgacagg gtcagatata gcaggaacta
     1621 ccagctcaat tcaggaacaa atagagtgga ccttcaatgc caatccaaga atagacgtag
     1681 gggcacaata cagaaaatgg gttattttgg gcttacaaaa ggtagtgcag atgtacaatc
     1741 cccaaaaggt cctagacatt cgacagggac ctaaagaacc cttccaggac tatgtagaca
     1801 gattctataa agccctgaga gcagaacaag caccacagga tgttaaaaat tggatgacac
     1861 aaactttgct tatccagaat gccaatccgg attgtaaatt gattctgaaa ggattgggaa
     1921 tgaatccaac cttggaggaa atgctaatag cttgccaggg agtaggaggg ccacaacata
     1981 aggctaagct aatggtagaa atgatgagta atggacagaa tatggtccaa gtgggacctc
     2041 agaaaaaggg cccccgaggg ccgctaaaat gctttaattg tggcaaattt ggacatatgc
     2101 aaagggaatg caaggcacca agacagatca aatgctttaa gtgcggcaaa attggccata
     2161 tggcaaaaga ctgcaagaat ggacaggcaa attttttagg gtatggccat tggggaggag
     2221 cgaaaccaag aaattttgtg caatacagag gagacacagt tggtctggaa ccaacagccc
     2281 ccccaatgga aacagcttac gatccagcaa agaagctcct ccagcagtat gcagagaagg
     2341 gacagcgcct gagagaggag agagaacaga caaggaaaca gaaggagaaa gaagtggagg
     2401 atgtttcctt gagctccctc tttggaggag accaatga
//
LOCUS       BOVMHDQBQ1    624 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DQ-beta gene, exon 2.
ACCESSION   M30008
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     1 of 2
SOURCE      Bovine (Holstein individual 2042) DNA, clone Q1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 624)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    /   172  +   438     MHC DQ-beta cell surface glycoprotein, exon 2
                                 (AA at 174)
    pre-msg <     1  >   624     MHC DQ-beta mRNA and introns
    IVS     <     1      171     MHC DQ-beta intron A
    IVS         439  >   624     MHC DQ-beta intron B
BASE COUNT      103 a    192 c    243 g     86 t
ORIGIN      Chromosome 23.
        1 cccgggttca cagcgggagg cgcagggccg ggctggagcg caacaggggt tgagaggcgg
       61 cgggtttcag gtttagggac cctctggcgg cggcggcacc tccccatctg gccgagcggc
      121 gccgcgtggg gctgtggggc tgagcctgac cgagcggctg tctccccgca gaggatttcg
      181 tggtccagtt taagggcctg tgttacttca ccaacgggac ggagcgagtg cggctcgtgg
      241 tcagacacat ctacaaccgg gaggagtacg cgcggtttga cagcgacgtg aacgagtacc
      301 gggcggtgac ctctggggcg ccgcacgccg agtactggaa cagccagaag gacctcctgg
      361 agcagaggcg ggccgaggtg gacagggtgt gcagacacaa ctaccaggtg gctgccccct
      421 tcacctggca gcggctaggt gagtacgggc tgccctccgc gggcccgccc tccacccgag
      481 actcagcgcg ggagggggcc gggtctccag ggcggggttc ccaggcccgc atagggacag
      541 ggaggccggg gcttcgcgga ggggcaggga ccgacgctcc gcggaaatgg acactcgcag
      601 ccctggacct ctccccgcag aggc
//
LOCUS       BOVMHDQBQ2   1151 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DQ-beta gene, exons 3 and 4.
ACCESSION   M30007
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     2 of 2
SOURCE      Bovine (Holstein individual 2042) DNA, clone Q1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1151)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    +   113      394     MHC DQ-beta cell surface glycoprotein, exon 3
                870  /   980     MHC DQ-beta cell surface glycoprotein, exon 4
    pre-msg <     1  >   980     MHC DQ-beta mRNA and introns
    IVS     <     1      112     MHC DQ-beta intron B
    IVS         395      869     MHC DQ-beta intron C
BASE COUNT      243 a    310 c    324 g    274 t
ORIGIN      Chromosome 23, about 3.7 kb after segment 1.
        1 tggaatccgg ggatcttcct actctggaac cgaggaagga ctcttctcca tgggagacgt
       61 gctgtgcggt ctcatgtctc actgtgtctt ttcctgtctg ttcctccctc agtggaacct
      121 acagtgacca tctccccgtc caggactgag gctctaaacc accacaacct gctggtctgc
      181 tcggtgacag atttctatcc gggccagatc aaggttcggt ggttccggaa tgaccgggag
      241 gagacagctg gtgttgtgtc cacccctctt attaggaacg gggactggac cttccagatc
      301 ctcgtgatgc tggaaatgac cccccagcga ggagatgtct acacctgccg cgtggagcac
      361 cccagcctcc agagtcccat ctcagtggag tggcgtaagg gcacttggtc tcctttcact
      421 gtgggcccta caggataggg cagacagagc ttcccgggtt catcccatct cacctctagt
      481 ccccagcatc cctactgaaa tcagaggaca caagagtgct catacctcat agcaggggca
      541 ttggaagagc ctagttacat tgtctttcca gatacgggag ctcactcaca caccatggcc
      601 ccagagcccc acccagggag ctctgcagga gtgacaggtc caaggttatg catgtgtcct
      661 tgaggggcag ggattggctt tctctgctta ttcaccttcc cagtctgtcc aaggatcttt
      721 tgctgggtcc ctcacctggg ggtggttaga atgaagaact gagttcccct ggtacttcca
      781 cttcctgtac ctcagactgg acttcaggat tctcaaggga cactgtggga tgtggagaca
      841 aatgctgaca ctcaggctct gctccccagg ggcgcagtct gaatctgccc agagcaagat
      901 gctgagtggt gttgggggct tcgtgctggg gctgatcttc ctcgggctgg gcctcattat
      961 ccgtcacagg agccagaagg gtaaggagct ctggggacat ggggaagact ttgactggga
     1021 ccttcttctc agggaggctc tagatgtagc tcttttccct gaccctgaca taaaggaggt
     1081 taaggtggtg gcaggaagaa acaagcaacc tagggagaga ctgaagtctt actttactga
     1141 ttgaaaggta g
//
LOCUS       BOVMHDQBY1    779 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DQ-beta gene, exon 1.
ACCESSION   M30006
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     1 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone Y1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 779)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept        467  +   575     MHC DQ-beta cell surface glycoprotein, exon 1
    pre-msg     467  >   779     MHC DQ-beta mRNA and introns
    IVS         576  >   779     MHC DQ-beta intron A
    signal      351      357     CAAT box
    signal      384      391     TATA box
    site        292      304     X box
    site        324      333     Y box
BASE COUNT      204 a    179 c    164 g    232 t
ORIGIN      Chromosome 23.
        1 ggatcctgaa gggctacagt ccatggggtc gtaaagagta gaacacaact cattaattaa
       61 cactttcact tttattttcc catacctcaa attctaagaa caacaggttt taaataaata
      121 tcacagaaat atctactctt gaatcatttt ttttcattat ttaaactcct aaggcattca
      181 atattcagat attttataac tgagagaaca ttttcatctc tatccagtgt aatttgatta
      241 ggacacagtg ccaggcatta gattaagaac cttcaaaaaa aaaatgtcta cccagaaaca
      301 gatgaagttt ttccgctcca ctgctgattg gtcccttttc tagggactct ccaatcttgc
      361 catacatgga agctctcata ggctttttat tctgtgaagt aggctcacca gatccactgt
      421 gtttgagctg tgttgactac cattagttct tcctttgttc tcaattatgt ttgggatggt
      481 ggctctgcgg atccccagag ccctctggac agcagttgtg atggtgaccc tggtgatgct
      541 gagcacccca ggggctgagg gcagagactc accaagtaag tgcagggcag ctgctccctg
      601 gagccaccac actggggagc aggctctgag ggacccttgg gctggggtgt gatcttggga
      661 tactgtcttt tatcacacat ttcctcccat tgggaatgag ggctatgtta cattctcatt
      721 tccaccctct aaggacaagg tgaggacaat tcccctccca caggtttaac cctgggaat
//
LOCUS       BOVMHDQBY2    977 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DQ-beta gene, exon 2.
ACCESSION   M30005
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     2 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone Y1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 977)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    +   559  +   825     MHC DQ-beta cell surface glycoprotein, exon 2
    pre-msg <     1  >   977     MHC DQ-beta mRNA and introns
    IVS     <     1      558     MHC DQ-beta intron A
    IVS         826  >   977     MHC DQ-beta intron B
BASE COUNT      191 a    264 c    338 g    182 t      2 others
ORIGIN      Chromosome 23, about 0.9 kb after segment 1.
        1 actggcgcaa ctgttggaag gcgatcggtg cgggcctctt cgctattagc cagctggacg
       61 aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc ccagtcacga
      121 cgttgtaaaa cgacgccagt gccaagctta attctacagg tcctttctca tcccttgaac
      181 tctcctgttg tcgtttgtct ctgaggttcc caggagttca gggtaaaatg ggatttaatg
      241 tgagaatctt ttaagtatag agatggatgc aaaatcaacc tgccgccctg tttacttgat
      301 tctgagcctc tagggatcac aggtcctagg gctctctcag cgtcaggcct cctcacatcc
      361 tgggagccct cagagggggc ggnaagcccg ggttcacagc gggaggcgca gggccgggct
      421 ggagcggaac agggtttgag aggcggctgg tttcaggttt aaagaccccg tggcggcggc
      481 ggcacctccc catctggccg agcggcgccg cgtggggctg tggggctgag cctgacagag
      541 cggctgtctc ccccgcagag gatttcgtgg tccagtttat gggccagtgt tatttcacca
      601 acgggacgga gcgggtgcgg tacgtgacca gatacatcta caaccaggag gagtacgcgc
      661 gcttcgacag cgactggggc gagtaccggg cgctgacccg ctggcggccg gccgccgagt
      721 actggaacag ccagaaggac atcctggagc agacgtgggc cgaggtggac agggtgtgca
      781 gaaacaacta ccaggtggaa gcccccttca cctggcagcg gcaaggtgag tgccggnctc
      841 tccgcggggc cgccctccac ccgccaggac ttcgcgcagg gagggactga gtcctccgag
      901 gcggtcccca gaccctcgaa tgggacagag gggcgctgag ggacagggga ccgagggcac
      961 agcgtatggg gcggggg
//
LOCUS       BOVMHDQBY3   1199 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DQ-beta gene, exons 3 and 4.
ACCESSION   M30004
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     3 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone Y1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1199)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    +   129  +   410     MHC DQ-beta cell surface glycoprotein, exon 3
    pept    +   907  +  1017     MHC DQ-beta cell surface glycoprotein, exon 4
    pre-msg     467  >  1199     MHC DQ-beta mRNA and introns
    IVS     <     1      128     MHC DQ-beta intron B
    IVS         411      906     MHC DQ-beta intron C
    IVS        1018  >  1199     MHC DQ-beta intron D
BASE COUNT      263 a    338 c    321 g    277 t
ORIGIN      Chromosome 23, about 3.7 kb after segment 2.
        1 atctaaatcc aagccttgga atccaacgat ctttccactc tggtatcaag gaatgactcc
       61 tgcccatggg agacatgctg tgcggtctca tgtctcactg tgtcttttcc tgtctgttcc
      121 tccctcagtg gaacctacag tgaccatctc cccgtccagg acagaggctc taaaccacca
      181 caacctgctg gtctgctcgg tgacggattt ctatccgggc cagatcaagg ttcggtggtt
      241 ccggaatgac cgggaggaga cagccggcgt tgtgtccacc cctcttatag ggaatgggga
      301 ctggaccttc cagatcctcg tgatgctgga aatgaccccc cagcgaggag atgtctacac
      361 ctgccgcgtg gagcacccca gcctccagag ccccatcatg gtggagtggc gtaagggcac
      421 ttggtttcct ttcactgtgg gcctaccgga cagggcagac agagcttccc ctgtccatgc
      481 cctctcatcc cttgtcccca gcatcactac tgaactggaa atcacaggac acaagagtgc
      541 tcatgcctcc tagcacaggc atcagaagag ccaaatcaca ttgtcttttc acatacaggg
      601 agctcactgt acacatcatg gccccagagc ccagcctggt agctctgtag aactgactgg
      661 tgaccatagt cttaaggtct aaggttatgg aagtgtccct gagagcaggg atccactttc
      721 accttctctc acctgcccac tgtgtccaaa gatctgttgg tgggtccctc ccctggggtg
      781 gtcagaatgg agagccacgt tcccctgaca cctccacctc ctgtacctca gactagacct
      841 caagcttcct aaaggaatac catgagatgt ggggacaaac gctgacactc gggctctgct
      901 ccccaggggc acagtctgaa tctgcccaga gcaagatgct gagtggtgtt gggggcttcg
      961 tgctggggct gatcttcctc gggctgggcc tcattatccg tcacaggagc cagaagggta
     1021 aggaactctg gggaaatggg aagatgggct gtgattcaga ccctctgttc agatcagcct
     1081 ctgcctctga atgtagctct ttcctcctga tcctgaaacg gggaggcggg gctggggatg
     1141 ggaggaaatg aacaacctag ggagacattg gagtttgact ttactagttt gaaagggta
//
LOCUS       BOVMHDQBY4    883 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DQ-beta gene, exon 5.
ACCESSION   M30003
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     4 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone Y1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 883)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    +   521      534     MHC DQ-beta cell surface glycoprotein, exon 5
    pre-msg <     1      841     MHC DQ-beta mRNA and introns
    IVS     <     1      520     MHC DQ-beta intron D
    site        263      276     MHC DQ-beta g/t cluster implicated to
                                 contribute additional information to
                                 polyadenylation
BASE COUNT      200 a    201 c    226 g    256 t
ORIGIN      Chromosome 23, about 0.3 kb after segment 3.
        1 tttgtgtcat gagatctttt gtagacattg tgacccctag cagaaggtgc tctatttctg
       61 ttctgtgtca gtgggattgt gggacaggta aaggagggaa gggtgtgaga tgagtgtgcc
      121 tgggcgcagt gtctcattca tgacctgttc cctgctatgg aatcaagagt tagggaagaa
      181 gtttctgtag gaggttctgt aggaagctcc tgaggttgtt ccccagaacc aggccataac
      241 tttgatggca cctttctgtg aaacttggag ccagagctct ggtttgaaag atagacacca
      301 ggatatcacc tactttgtgc cacatgttgg tgcctactgc ctgtgggcat ttataagtga
      361 ttgaatgtgg tagaaagaag gtgaactatc actgcaattt actaaaaaat tgaaatcttc
      421 atatccctca gaaggacaac agctgcttcc tggcttccca tgcctccttg ttaggttgaa
      481 tgtgcgtgcc tgtgtgctga tcactctctc tcttctacag ggctcatgcg ctgactcctg
      541 aggatatttt gggattggtg tttgctcttc tataatgtgt gcctgatctt gcccggaatt
      601 cccagattcc tgtcagcctg tcccactctg agatcagagt caggtcacca ggtcatttcc
      661 cgtggccatc ccccaaccac ggatctggct gtgatgctgc ttcctccact gaccctggaa
      721 tctctgcctg tgcgttgtca gctgaatcta ctcagatccc aaaagcttct gacatagaca
      781 tcagaagggg gacggagagt gtccccgcta gtctttagcc cagtgtttag aagctattaa
      841 tcagataaga gagacacctc aaggttgatg gagtttcacc agg
//
LOCUS       BOVMHDRB1     459 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DR-beta gene, exon 2.
ACCESSION   M30012
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     1 of 3
SOURCE      Bovine (Holstein individual 2042) DNA, clone A1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 459)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    /    21  +   290     MHC DR-beta cell surface glycoprotein, exon 2
                                 (AA at 23)
    pre-msg      21      290     MHC DR-beta mRNA and introns
    IVS     <     1       20     MHC DR-beta intron A
    IVS         291  >   459     MHC DR-beta intron B
BASE COUNT      108 a     92 c    169 g     90 t
ORIGIN      Chromosome 23.
        1 gatctatcct ctctctgcag cacatttcct ggagtattct aagagcgagt gtcatttctt
       61 caacgggacc gagcgggtgc ggttcctgga cagatactac actaatggag aagagaccgt
      121 gcgcttcgac agcgactggg gcgagttccg ggcggtgacc gagctggggc cgcaggaccg
      181 cgagtactgg aacagccaga aggacttcct ggaggagaag cgggccgagg tggacagggt
      241 gtgcagacac aactacgggg gtatggagag tttcactgtg cagcggcgag gtgagcgcgg
      301 gggtggactg gccagtgtgg agcagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt
      361 gtgtgagaga gagagagaga gacagagaca gagacagaga cagagataga cagacagaaa
      421 cagagatact tcactcactc tggtcgagtg tgtaccgac
//
LOCUS       BOVMHDRB2     427 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DR-beta gene, exon 3.
ACCESSION   M30013
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     2 of 3
SOURCE      Bovine (Holstein individual 2042) DNA, clone A1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 427)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    +    71  +   352     MHC DR-beta cell surface glycoprotein, exon 3
    pre-msg <     1  >   427     MHC DR-beta mRNA and introns
    IVS     <     1       70     MHC DR-beta intron B
    IVS         353  >   427     MHC DR-beta intron C
BASE COUNT       95 a    124 c    107 g    101 t
ORIGIN      Chromosome 23, about 2.7 kb after segment 1.
        1 ctgaaaggca gctaaccaag gagacttact ctgttgtcct cactgattcc ctccaccttt
       61 tctctcctag tggagcctac agtgactgtg tatcctgcaa agactcagcc cctgcagcac
      121 cacaacctcc tggtctgctc tgtgaacggt ttctacccag gccacattga agtcaggtgg
      181 ttccggaacg cccatgaaga ggaggctggg gtgatctcca caggcctgat ccagaatgga
      241 gactggacct tccagaccat ggtgatgctt gaaacagttc ctcagagtgg agaggtctac
      301 acctgccaag tggatcaccc cagccggacg agccctatca cagtagaatg gagtgagctt
      361 tctgatctca taaatccctc acccactgtg gagggggctt gctttcctct gagtgtcccc
      421 tgagtgt
//
LOCUS       BOVMHDRB3     276 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DR-beta gene, exon 4.
ACCESSION   M30014
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein.
SEGMENT     3 of 3
SOURCE      Bovine (Holstein individual 2042) DNA, clone A1.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 276)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept    +   116  /   226     MHC DR-beta cell surface glycoprotein, exon 4
    pre-msg <     1  >   226     MHC DR-beta mRNA and introns
    IVS     <     1      115     MHC DR-beta intron C
BASE COUNT       61 a     63 c     68 g     84 t
ORIGIN      Chromosome 23, about 0.35 kb after segment 2.
        1 attctgattc ttccgggtag ccttctttcc tcattcccat agttcacaat ttcagcatca
       61 caattagaga agagaatttg ggataaaaat gactaaaact ggcttctttt ctcaggggca
      121 cggtctgact ctgctcagag caagatgatg agtggagtcg ggggcttcgt tctgggtctg
      181 ctcttccttg ccgtggggct cttcatctac ttcaggaatc agaaaggtaa ggagcttgtt
      241 ctttggacag ctgagcctcc ccactgactt ttggag
//
LOCUS       BOVMHDRBE1    483 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DR-beta pseudogene, exon 1.
ACCESSION   M30011
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein;
            pseudogene.
SEGMENT     1 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone E4.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 483)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept.ps     167  +   236     pseudo-MHC DR-beta, exon 1
    pre-msg <   167  >   483     pseudo-MHC DR-beta mRNA and introns
    IVS         237  >   483     pseudo-MHC DR-beta intron A
    signal        9       14     CAAT box
BASE COUNT      114 a    100 c    124 g    145 t
ORIGIN      Chromosome 23.
        1 gagctcaccc aatccaggaa caaagatatg agccatttgt tggtatcact tggaatgtgg
       61 gtggaggagg gctcatgtct ttactgagtg agacttccct gctcccccac accttgtctt
      121 ttcctgttct ccagcatggt gtgactgttt ccccagaggc tcctggatgg cagctctgac
      181 agtgatactg atggtgatga accctcccct ggcttgggcc agggacaccc acataagtgc
      241 gtacctttcc ggcgggggtg aggggggtga gctatcatgg gatgggggga aggaagggag
      301 ctagctttgt cactgtattc aggccatgtc ccttaaaatt gtgacatatt cttcatacta
      361 tatatagtgg ctaagctgag tctgaataat tggtaacatt ttctgatgtt catatgtaac
      421 atcagtgtac cttatggtat atttcaatat ataggggaat ttattcattc acattatatt
      481 gaa
//
LOCUS       BOVMHDRBE2    929 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DR-beta pseudogene, exon 2.
ACCESSION   M30010
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein;
            pseudogene.
SEGMENT     2 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone E4.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 929)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept.ps +   228  +   500     pseudo-MHC DR-beta, exon 2
    pre-msg <     1  >   929     pseudo-MHC DR-beta mRNA and introns
    IVS     <     1      227     pseudo-MHC DR-beta intron A
    IVS         501  >   929     pseudo-MHC DR-beta intron B
BASE COUNT      203 a    209 c    302 g    215 t
ORIGIN      Chromosome 23, about 5.4 kb after segment 1.
        1 gtcgaccact gaagccactt ggagacctga ggggtctcct ctgcccacct tcgcctccct
       61 gcactgtagg cagatgaaag aagggcccgt ggtagttcag gggtgcctgt ggagccaatg
      121 agggagccct agtggccttc ctgtgcttgg gcagccctca ttggtggccg tcacatcagt
      181 tccttcctgg gagcccacca ggtgaccgaa tcctggtgtg cccacagcac atttgatggt
      241 gcagggcaag tccgagtgtc atttctccat ccggactgag caggtacgat tcttggccag
      301 atacttctat aaccagaagg agttggtgca ttttgtcagc aacgatgtgg gtgagttcag
      361 ggcagtgacc gagcggggca ggctcttcgc tgagagttgg aatcatcaga aggacttagt
      421 ggagtgaacg caggctgtgg tggacacgtt ctgcagatac aactactgga ttggggagag
      481 cttcatcctg cagcagcaag gtgagcacag gggtgggcgg ccaggggact ggggacagtg
      541 tgtgtgtgtg tgtgtgtgtg tgagagagag agagagagac aaagagatag agagactgag
      601 tcccggtgaa tgtgttgtat tatgagcaag tatgcttaag gagagttcct gtgagagcat
      661 gttgcctgga gaaatgacac ttggacttgc cctgcaccat gaaatttgct gtgggaacag
      721 caggattcgg tcaccctggt gggctcccag gaaggaactg atgtgacggc caccaatgac
      781 gggctgccca agcacaggag ggccactagt gctccctcat tggctttaca ggcacccctc
      841 aactaccatg ggttcttctt tcatctgcct gtatgacttt gtcagttatt gtgaaggaag
      901 agacagtgtg tgtggtgggg ggagtacct
//
LOCUS       BOVMHDRBE3    548 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DR-beta pseudogene, exon 3.
ACCESSION   M30002
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein;
            pseudogene.
SEGMENT     3 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone E4.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 548)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept.ps +    12  +   293     pseudo-MHC DR-beta, exon 3
    pre-msg <     1  >   548     pseudo-MHC DR-beta mRNA and introns
    IVS     <     1       11     pseudo-MHC DR-beta intron B
    IVS         294  >   548     pseudo-MHC DR-beta intron C
    site        425      427     in-frame stop codon
BASE COUNT      122 a    139 c    134 g    153 t
ORIGIN      Chromosome 23, about 5.8 kb after segment 2.
        1 tttcctccta gtggaggatc ctacagtgac tgtgtatcct gcaaagaccc agcctctgca
       61 gcaccacaac ctcctggtct gctctgtgaa tggtttctat ccaggacacg ttgaagtcag
      121 gtggttccag aacggccatg aagaggctgg agtgatctcc acaggcctga tccagaatgg
      181 agactggacc ttccagaccg tggtgatgct tgaaacagtt cctcagagtg gagaggtcta
      241 cgcctgccaa gtggagcacc ccagccggac gagccctctc acagtggaat ggagtgagaa
      301 gctttctgat ctcgtaagtt cctcacccac caagaagggg gcttgctcac ctctgagtgt
      361 caggtttctc ctctctccat accatatttt ttatttgctt catgctcttt ctttcttagc
      421 acaaattgtt ggggagtagc tctgtgatag cctgtgttag aaatcctctg atagtttaca
      481 gatatcgttt gatagtttct atcaatacct atacctgctg gtgagacagt tcttcctggc
      541 aggcagag
//
LOCUS       BOVMHDRBE4    206 bp ds-DNA             MAM       15-AUG-1990
DEFINITION  Bovine MHC class II DR-beta pseudogene, exon 4.
ACCESSION   M30009
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility protein;
            pseudogene.
SEGMENT     4 of 4
SOURCE      Bovine (Holstein individual 2042) DNA, clone E4.
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 206)
  AUTHORS   Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J.
  TITLE     The nucleotide sequence of bovine MHC class II DQB and DRB genes
  JOURNAL   Immunogenetics 31, 37-44 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            M.A.M.Groenen, 20-NOV-1989.
FEATURES       from  to/span     description
    pept.ps +    86  /   196     pseudo-MHC DR-beta, exon 4
    pre-msg <     1  >   196     pseudo-MHC DR-beta mRNA and introns
    IVS     <     1       85     pseudo-MHC DR-beta intron C
BASE COUNT       47 a     42 c     54 g     63 t
ORIGIN      Chromosome 23, about 0.35 kb after segment 3.
        1 cttccaggca accttcttct cccatcctca aaagcttagg gaagttggat tgggataaga
       61 tcactgaaac ttacttcttt tctaggggca tgatctgact ctgctcagag caggatgatg
      121 agtggagtca ggggctttgt tgtgggtctg ctcttccttg ggatcaggtt gttcatctac
      181 tttaggaatc agaaaggtaa ggatcc
//
LOCUS       VECPCE30      143 bp ds-DNA             SYN       15-AUG-1990
DEFINITION  Expression vector pCE30, partial sequence.
ACCESSION   M36426
KEYWORDS    expression vector.
SOURCE      Synthetic DNA.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 143)
  AUTHORS   Elvin,C.M., Thompson,P.R., Argall,M.E., Hendry,P., Stamford,N.P.J.,
            Lilley,P.E. and Dixon,N.E.
  TITLE     Modified bacteriophage lambda promoter vectors for overproduction
            of proteins in Escherichia coli
  JOURNAL   Gene 87, 123-126 (1990)
  STANDARD  simple staff_entry
BASE COUNT       37 a     35 c     40 g     31 t
ORIGIN      
        1 agggcagcat tcaaagcaga aggctttggg gtgtgtgata cgaaacgaag cattgggatc
       61 cccgggaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc
      121 caacttaatc gccttgcagc aca
//
LOCUS       CHKPPPTH     1723 bp ss-mRNA            VRT       15-AUG-1990
DEFINITION  Chicken parathyroid hormone mRNA, complete cds.
ACCESSION   M36522
KEYWORDS    parathyroid hormone.
SOURCE      Chicken parathyroid gland, cDNA to mRNA, clones cPTH-[11,12,3].
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 1723)
  AUTHORS   Khosla,S., Demay,M., Pines,M., Hurwitz,S., Potts,J.T.Jr. and
            Kronenberg,H.M.
  TITLE     Nucleotide sequence of cloned cDNAs encoding chicken
            preproparathyroid hormone
  JOURNAL   J. Bone Miner. Res. 3, 689-698 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        128      487     parathyroid hormone precursor
    sigp        128      202     parathyroid hormone signal peptide
    matp        221      484     parathyroid hormone
BASE COUNT      626 a    311 c    331 g    455 t
ORIGIN      
        1 ttttaaagtt agatttaagg gatccactaa accaattcag tagtctttaa atatacttga
       61 catcaagaca cagccatctg ctgacatacc ccaaccagaa aactgttaag gacaatatct
      121 gataaaaatg acttctacaa aaaatctggc caaggccata gtgattttat atgctatatg
      181 tttttttaca aactctgatg gaagaccaat gatgaagaga tcggtgagtg agatgcaatt
      241 aatgcataac cttggagagc atcgacacac tgtggagaga caggactggc ttcagatgaa
      301 gctgcaggat gtgcacagtg cccttgagga tgccaggacc cagaggcctc gaaacaagga
      361 ggatattgtc ctgggggaga taagaaaccg gaggctgctc cctgagcatt tgcgggcagc
      421 agtgcagaag aaatccattg acctggacaa agcttacatg aatgtactct ttaaaactaa
      481 gccatgatga aaagaccaag agcattataa ctgtccaagt aagcacatgt ctgtagatca
      541 ctgaccagtt agggcatttt atttattatt ttttttttaa ctcaaactat gataaggatt
      601 aaaggctcca tgccagactg tagccccact gagatgggta tttcacaact aaatagtaaa
      661 gtgtatttat aggccaccca tggccattgc tgctaactcc caggtatctt ttaaatggct
      721 aatgtaactc attaacttcc aggagaatta aaaacaaatg gcaaaacaaa aaacaacaaa
      781 gaccacctgc aatagaataa gaaagttgaa aaacatttaa gaccagttct accactccta
      841 tatggagagc atttgtctgt aatctttaga cctactagta ctgtaaacta acaacgtaat
      901 ataggcataa ctgcattatg cctagggtta aacttcaagt ttgtcctaat gaaaggaacg
      961 caaacttaaa tccactctta ctttcccaag aaggcctaaa gccagaccaa tgtcagtaac
     1021 atagacaaag ctgcatgata ataacttagg attaaagagt gcgaacatga aaaatagaag
     1081 gaacccaaag cttaagatta aagtagaatg aaataaattg tgcatgaaaa agaagaacga
     1141 agttttacaa gatactgaaa tgaaagggag gtttattaac tttccctctt aattatgagc
     1201 tgtcaccttt tggaactgca ggaacagtga gagcagagat tgtagcatat atgtatgcaa
     1261 agccctaact atagaactgg gaaatggttc aacacgagat aaaaacaaga cttgtttcaa
     1321 ttgttatcat ctctccttca gtcaataatc tatgagtttc tgtatattgt gcttaggcca
     1381 catgggtaag tggctcacat aaaattactc atcttcacat gtgcacttat acagaattgg
     1441 gatttcagtt tgttaaaacc ctgaaattac aaccattaaa atatagaaat caaaacctgg
     1501 gaaccatcag ttaaaatata agcaggattc agaaagaatt tgacaggaac atggatggga
     1561 gaaaatgatg ataataatat agaaaagaaa gcagcaaata taaaatgatt ttgaattgta
     1621 tagacaagta tgtgcttatg acctcgacca cttctgaata ataagaatat ttcccctgta
     1681 gaagtgacag cagtttcctc ccaatgttcc actgtgagaa ttc
//
LOCUS       CUC11SGB     1684 bp ss-mRNA            PLN       15-AUG-1990
DEFINITION  Pumpkin 11-S globulin beta-subunit mRNA, complete cds.
ACCESSION   M36407
KEYWORDS    11-S globulin beta-subunit.
SOURCE      Pumpkin (cv. Kurokawa Amakuri Nankin) cotyledon mRNA, clone
            pPG-beta-2.
  ORGANISM  Cucurbita pepo
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Violales; Cucurbitaceae.
REFERENCE   1  (bases 1 to 1684)
  AUTHORS   Hayashi,M., Mori,H., Nishimura,M., Akazawa,T. and Hara-Nishimura,I.
  TITLE     Nucleotide sequence of cloned cDNA for pumpkin 11-S globulin beta-
            subunit
  JOURNAL   Eur. J. Biochem. 172, 627-632 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         31     1473     11-S globulin beta-subunit precursor
    sigp         31       93     11-S globulin beta-subunit signal peptide
    matp         94      918     11-S globulin beta-subunit gamma-chain
    matp        919     1470     11-S globulin beta-subunit delta-chain
BASE COUNT      457 a    406 c    463 g    358 t
ORIGIN      
        1 ctaatagccc ttctcttctc cataccagca atggctcgct cttctctttt taccttttta
       61 tgtttagcag ttttcatcaa tggctgcctc tctcagattg agcagcagag cccctgggaa
      121 ttccaaggca gcgaagtatg gcaacagcac cgctaccaat ctcctagagc ctgtcgtctt
      181 gagaatcttc gagctcaaga ccccgttcgc cgggctgagg cggaggcgat cttcactgaa
      241 gtctgggacc aggacaacga tgagttccag tgcgccggcg tcaatatgat ccgccataca
      301 atccggccca aaggtctgct tcttcctggt ttctctaatg ctcctaaact catcttcgtc
      361 gcccaaggct tcggtattcg cggcattgca atccccggct gtgcagagac ttaccagact
      421 gatttacgaa gatcgcaatc ggccggatct gcgttcaaag accagcatca gaagatccgc
      481 cccttcagag agggagatct cctcgtcgtc ccggccggag tttctcactg gatgtataat
      541 cgaggacagt ccgatctcgt tttgatcgta ttcgctgaca ctcgcaacgt cgcaaaccaa
      601 atcgatccct acctcagaaa attctacctt gccggaaggc cagagcaggt agaaagaggc
      661 gtagaggaat gggaaagaag tagccgaaag ggatcttccg gcgagaaatc aggcaatata
      721 ttcagcggat ttgcagacga atttctagag gaagctttcc agatcgacgg tggactggtt
      781 aggaagctaa agggagaaga cgacgagaga gacagaatcg tgcaggtcga cgaagatttc
      841 gaggtgcttc taccggagaa agatgaagaa gagagatcga gaggaagata catcgaatca
      901 gaatcagaat cggagaatgg cttagaagaa accatttgca cactccgatt aaagcaaaac
      961 atcggccgat ctgttcgcgc cgacgtgttc aacccacgcg gcggccgaat ctccacggcc
     1021 aactaccata ccctccccat tctccgccaa gtccgcctta gcgccgaacg aggagtcctc
     1081 tacagcaacg cgatggtggc gccgcactac acagtgaaca gtcactcagt gatgtacgcg
     1141 acgagaggca acgcgagagt gcaggtggtg gacaacttcg ggcagtcagt gttcgacggc
     1201 gaggtccggg aaggacaggt actgatgatt ccgcagaact tcgtggtgat taaacgagca
     1261 agcgacagag gattcgagtg gatcgcattc aagacgaacg acaacgcaat cacgaatctg
     1321 ctggcggggc gagtgtcgca gatgaggatg ttgccgctgg gagtgctgtc gaacatgtac
     1381 cggatctcga gagaggaggc gcagaggctg aagtacgggc agcaggagat gagggtgctc
     1441 agccccggaa ggtcgcaggg aagaagagag tgaaaatgaa gaagtgggta gtgggtaatg
     1501 ggtaatggga aatatatata tatggtagta gtaatctaat gtaatttagt gaataaagag
     1561 cgagctttca ggtgatgccg ccgacgagcc ctgcttgtta ccggccggaa aaaatggaga
     1621 aatctctcag aaagacaccg agttttaata ataaaagtaa taatattcgc ctcttttttc
     1681 cttc
//
LOCUS       DROKINLA     2175 bp ds-DNA             INV       15-AUG-1990
DEFINITION  D.melanogaster kinesin-like protein (nod) gene, complete cds.
ACCESSION   M36195
KEYWORDS    kinesin-like protein; nod gene.
SOURCE      D.melanogaster DNA.
  ORGANISM  Drosophila melanogaster
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Diptera; Brachycera; 
            Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae.
REFERENCE   1  (bases 1 to 2175)
  AUTHORS   Zhang,P., Brodeur,B.A., Goldstein,L.S.B. and Hawley,R.
  TITLE     A kinesin-like protein required for the distributive chromosome
            segregation in Drosophila
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by P.Zhang, 06-JUL-1990.
            
              Author address: P.Zhang
              Albert Einstein College of Medicine
              Molecular Genetics Dept.
              1300 Morris Park Avenue
              Bronx, NY 10461
            
FEATURES       from  to/span     description
    pept         72     2072     kinesin-like protein (nod)
    mRNA          1     2175     nod mRNA
BASE COUNT      557 a    594 c    561 g    463 t
ORIGIN      
        1 caaagtaaaa taattacggt gaatgcaagc caattgtgca ttattcaaac aacttcaatt
       61 cttcaatctg catggagggc gccaaattaa gcgcagttcg gattgcggtc cgcgaggcgc
      121 cgtaccgcca gttcttgggg cgtcgggagc ccagcgtcgt ccagtttccg ccatggagcg
      181 acggaaagtc gttaatagtg gatcagaatg aattccactt cgatcacgcc tttcccgcga
      241 ccatcagcca ggatgagatg taccaggcgc tgatcttgcc gctggtggac aagctgctcg
      301 agggattcca gtgcactgca ctcgcctacg gccagacggg aacgggcaag agctactcaa
      361 tgggcatgac acctccggga gagatactgc ccgagcacct gggtattctg cctcgcgccc
      421 tgggcgacat ttttgagcgc gtgaccgccc ggcaggagaa caacaaggat gcgattcagg
      481 tgtacgcctc cttcatagag atctacaatg agaaaccctt cgatctgctg ggctccacgc
      541 cacatatgcc catggtggcg gcgcgttgcc agcgatgcac ctgccttcct ttgcacagcc
      601 aggcggatct gcatcacatc ttggagctag gcactcgcaa tcgacgcgtt cgtcccacca
      661 atatgaattc caatagttcg cgatcccatg ccatagtcac cattcacgtg aagagtaaaa
      721 cccatcactc gcggatgaat attgtggatc tggccggttc agaaggcgtg cggcgaactg
      781 ggcacgaggg cgtggccagg caggagggcg tcaacatcaa tctgggcctg ttgagcatca
      841 acaaggtggt gatgtccatg gcggcgggcc acacagtgat accataccgc gacagcgtcc
      901 ttaccacagt tctgcaggcc tcgctaaccg cgcagtcgta tctgaccttt ctggcctgca
      961 tcagtccgca tcaatgcgat ctcagcgaga cgttgtccac cctgcgtttt ggcaccagtg
     1021 ccaagaagct tcggctgaat ccgatgcaag tggcgcgcca gaagcaatcg ctggccgcac
     1081 ggacaacaca cgtcttccgc caagcgctat gcacctcgac ggccatcaag tcaaacgcag
     1141 ccaatcataa tagcatagtg gttccaaaat ccaaatatag cacaaccaag ccgctgagcg
     1201 ccgtgctcca tcgaactcgc tccgaacttg gcatgacgcc caaagctaag aaaagggctc
     1261 gcgagctatt ggagctggag gagaccacgc tggagctctc gtctatacac attcaggaca
     1321 gcagtctgag tctgttgggt ttccatagcg atagcgataa ggataggcat ttaatgcctc
     1381 ccccaacagg gcaagagcca aggcaagcca gcagccagaa ctctacgcta atgggcattg
     1441 tcgaagagac cgagcccaag gaatcgtcaa aggtgcaaca gtcaatggtt gcccccacgg
     1501 tgcccacaac tgtacgctgc cagctgttca acaccaccat cagtcccatc agtctacggg
     1561 catccagctc tcagcgagaa cttagcggca tccagccaat ggaggagaca gtagtggctt
     1621 cgccacagca gccatgcctt cgtcgttccg tgcgtctagc gagtagcatg cgttcgcaga
     1681 actatggagc cattcccaag gttatgaatt tgcggcgcag cacgcggctg gcgggaatcc
     1741 gggaacatgc cacctccgtt gttgtgaaaa acgagacgga tgcgataccg caccttcgaa
     1801 gtacagtgca aaaaaaacgt acgcgaaacg tgaaacctgc gcccaaggcc tggatggcca
     1861 ataatacaaa atgttttctg gacctgctta acaatggaaa cgttaagcaa ttgcaggaga
     1921 ttccagggat cggtccaaag tccgccttta gtttggcctt gcacagatcc cgcctgggtt
     1981 gcttcgagaa tctttttcaa gtcaaatccc tgcccatttg gtcgggaaat aaatgggaac
     2041 gattttgtca aattaactgt ctcgacactt gatacaatta ctaattaaat agcattttaa
     2101 ttcgaatata gtatagtgat tgttatttat gtggcatata ctttgatttt acaactatag
     2161 taggagtaaa aaaag
//
LOCUS       HAMCADCA     3902 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Hamster carbamoyl-phosphate synthetase mRNA, partial cds.
ACCESSION   J05503
KEYWORDS    carbamoyl-phosphate synthetase.
SOURCE      Hamster cell line 165-28, cDNA to mRNA, clone pCAD142.
  ORGANISM  Mesocricetus auratus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; 
            Cricetini.
REFERENCE   1  (bases 1 to 3902)
  AUTHORS   Simmer,J.P., Kelly,R.E., Rinker,A.G.Jr., Scully,J.L. and Evans,D.R.
  TITLE     Mammalian carbamyl phosphate synthetase (CPS): cDNA sequence and
            evolution of the CPS domain of the Syrian hamster multifunctional
            protein CAD
  JOURNAL   J. Biol. Chem. 265, 10395-10402 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >  3902     carbamoyl-phosphate synthetase (E.C.6.3.5.5;
                                 AA at 3)
BASE COUNT      829 a   1056 c   1135 g    882 t
ORIGIN      
        1 tcaggcccct ggcaccagag gtttctatta agaccccacg ggtattcaat gcagggggtg
       61 cccctcggat ctgtgccttg gactgcggcc tcaagtataa tcagatcaga tgtctctgcc
      121 agcttggggc tgaggttact gtggtgccct ggaaccacga attagacagt cagaagtatg
      181 atggcctttt tctgagtaat ggacctggcg atcctgcctc ttatcctggt gtggtagcca
      241 cactgaaccg cgtcttgtct gagcccaatc cccgacctgt gtttggaatc tgccttggac
      301 accagctgtt ggctttagcc attggggcca aaacttacaa aatgaggtat ggaaaccgag
      361 gccacaacca gccctgttta ctggtgggca ccgggcgctg ctttctgacg tctcagaatc
      421 acgggtttgc cgtggatgca gactcgctgc cagcaggctg gactccgctc ttcaccaatg
      481 ccaacgactg ttccaacgaa ggcattgtac atgacagcct gccctttttc agtgtccagt
      541 ttcacccaga gcaccgagct ggcccttcag atatggaact gctttttgat gtatttctgg
      601 agactgtgag agaggctgta gctgggaacc ccgggggcca gacagttaaa gagcggttgg
      661 tgcagcgcct ctgtccccct gggcttctca ttcctggttc tgggcttcca ccaccacgga
      721 aggttctgat cctaggctct gggggcctct ccattggcca ggctggagaa tttgactact
      781 caggctctca ggccattaaa gccctgaagg aggagaacat ccagacgctg ctgatcaacc
      841 ccaacattgc tacagtgcag acctcgcagg ggctggcaga caaggtctac ttccttccca
      901 ttacacctca ctacgtaacc caggtgattc ggaatgaacg cccagatggt gtgttactga
      961 cttttggggg ccaaacagcc cttaactgcg gtgtagaact gaccaaagcc ggagtgctag
     1021 ctcggtatgg ggttcgggtc ttgggtacac ctgtggagac cattgaactg actgaggacc
     1081 gacgagcctt cgcggccagg atggctgaga tcggagagca tgtagccccc agcgaagcgg
     1141 caaattctct tgaacaggct caggcagctg ctgagcgact gggctaccct gtgctggtgc
     1201 gtgcagcctt tgccctgggt ggtcttggtt ctggctttgc ttccaccaaa gaggaactct
     1261 cagctcttgt ggctccagct ttcgcccata ccagccaggt gctgatagac aagtctctga
     1321 agggctggaa ggagattgaa tatgaggtgg tgagagacgc ctatggcaac tgtgtgacgg
     1381 tatgtaacat ggagaactta gacccactgg gcatccacac tggtgagtcc atagtggtgg
     1441 cgcccagcca gacgctgaat gacagagagt accaacttct gcgacggaca gctatcaaag
     1501 tcacccagca cctggggatc gtcggggagt gcaacgtgca gtatgccttg aacccggagt
     1561 ctgagcagta ttacatcatt gaagtaaatg ccaggctgtc tcgaagctct gccctggcca
     1621 gtaaggccac aggctatcct ctagcctatg tggcagccaa gctggcgttg ggcattcccc
     1681 tgccggagct caggaactct gtcactgggg gaacagcagc ctttgagcct agcctggact
     1741 actgtgtggt aaagattcct cgatgggacc tcagcaagtt cttgcgtgtc agtacgaaga
     1801 ttgggagctg tatgaagagt gttggtgaag tcatgggcat tggacgctca tttgaagagg
     1861 ccttccaaaa ggccctgcgc atggtggatg agaactgtgt gggcttcgac catacagtga
     1921 agccagtcag tgatgtggag ttggagacac caacagataa gcggatcttt gtggtggctg
     1981 ctgctctgtg ggctggctac tcggtggagc gcctgtatga gctcacacgc atcgactgct
     2041 ggttcctgca tcgaatgaag cgtatcgtga cccacgccca gttgctggaa caacaccgag
     2101 gacagccgtt gtctcaagac ctgctgcacc aggccaagtg cctcggcttc tcagacaaac
     2161 aaattgccct tgcagtcctg agcacagagc tggcggttcg aaagctacgt caggaactgg
     2221 gaatctgccc tgcagtgaaa cagattgaca cagttgcggc tgagtggcca gcacagacca
     2281 attacctgta cctgacatac tggggcaaca cccatgacct cgactttcga actcctcacg
     2341 tcctggtcct tggctctggt gtctaccgca tcggctccag tgttgagttt gactggtgtg
     2401 ccgtcggctg catccagcag ctccggaaga tgggttataa gaccatcatg gtgaactaca
     2461 acccagagac agtcagcaca gactatgaca tgtgcgaccg actctacttt gatgagatct
     2521 cctttgaggt ggtgatggac atctatgagc tggagaaccc cgacggcgtg atcctgtcca
     2581 tgggtggaca gctgcccaac aacatggcca tggctctgca tcggcagcag tgccgagtgc
     2641 tgggcacctc cccggaagcg atcgattcag ctgagaaccg gttcaagttc tcccggcttc
     2701 tagataccat cggcatcagc cagcctcagt ggcgtgaact cagtgacctc gagtctgctc
     2761 gccagttctg ccagactgtg gggtacccct gtgtggtgcg cccctcctat gtgctcagcg
     2821 gtgccgctat gaatgtggcc tacactgatg gggacctgga gcgcttcctg agcagtgcgg
     2881 ccgctgtctc caaggagcac cccgtggtca tctccaaatt catccaggaa gcaaaggaga
     2941 ttgatgtgga cgctgtggcc tgcgatggcg tcgtgtcagc cattgccatc tccgagcacg
     3001 tggagaatgc aggtgtgcat tcaggggatg ctacgctggt caccccccca caagacatca
     3061 cccccaaaac tctggagcgg atcaaagcca ttgtgcatgc cgtggggcag gaactacagg
     3121 tcacagggcc cttcaatctg cagctcattg ccaaggatga ccagctgaaa gttattgagt
     3181 gcaatgtgcg tgtctctcgc tccttcccct tcgtgtctaa gacgctgggt gttgacctag
     3241 tggccttggc cacgaggatc atcatgggag agaaggtaga acccatcgga ctcatgacgg
     3301 gctctggagt cgtgggagta aaggtgcctc agttctcctt ctcgcgcttg gcgggtgctg
     3361 atgtggtgct gggcgtggag atgaccagta ctggagaagt agctggcttt ggagagagcc
     3421 gttgtgaggc ctacctcaaa gccatgctta gcactggctt taagatcccc aagaagaaca
     3481 tcctgctgac catcggcagc tacaagaaca aaagtgagct gctcccgact gtgcggttgc
     3541 tggagagcct gggctatagc ctctacgcca gcctgggtac ggcggacttc tacactgagc
     3601 acggggtcaa ggtgacagct gtggactggc actttgaaga ggctgtggat ggcgagtgcc
     3661 cgccacagcg gagcatcttg gatcagctgg ctgagaatca ctttgagtta gtgattaacc
     3721 tgtcaatgcg tggggccggg ggtcgacggc tttcctcctt cgtcaccaag ggctaccgca
     3781 cgcggcgcct ggctgctgac ttctctgtgc ctctcatcat cgacatcaag tgcaccaaac
     3841 tcttcgtgga ggccctgggt cagattggcc ccgccccgcc tttgaaggtt catgtagact
     3901 gc
//
LOCUS       LEIKPDNP      376 bp ds-DNA             ORG       15-AUG-1990
DEFINITION  L.aethiopica kinetoplast DNA.
ACCESSION   M36194
KEYWORDS    .
SOURCE      Kinetoplast L.aethiopica (strain 1467/85) promastigote, clone R3,.
  ORGANISM  Kinetoplast Leishmania aethiopica
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; 
            Kinetoplastida; Trypanosomatina; Trypanosomatidae.
REFERENCE   1  (bases 1 to 376)
  AUTHORS   Laskay,T., Kiessing,R., Rinke de Wit,T.F. and Wirth,D.F.
  TITLE     Generation of species-specific DNA probes for Leishmania aethiopica
  JOURNAL   Unpublished (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.F.Rinke de Wit, 06-JUL-1990.
            
              Author address: T.F.Rinke de Wit
              Leiden University Hospital
              Rijnsbugerweg 10
              2300 RC Leiden
              THE NETHERLANDS
              email:WBLGIPHAR@HLERUL52.BITNET
            
BASE COUNT      113 a     99 c     70 g     94 t
ORIGIN      
        1 ctctaatagc ccaggaccta tcgtcgccac tctccgaact atagaaagac ccgcgctgta
       61 ggcacaatag gaccaactgt actacctgca gtggctagac cactactggc aaatcaatag
      121 aactattacc tttaactata agtgatttaa ctttaaccta taatagaaca ttattcgtcg
      181 ctcattcccg ggccccacgt agcctttccc atgaagttcg tataccgact ctacggttca
      241 agtttatata ccggttcact ccgttgcacc atggtgacct tacgtcacta gatacaattg
      301 atattaataa ttaaatacag ccaagatagg cggcatgtgc cacagagtag cggcaggaag
      361 ccagccaatg agcata
//
LOCUS       LMIB19KP      938 bp ss-mRNA            INV       15-AUG-1990
DEFINITION  L.migratoria basic 19kD hemolymph protein mRNA, complete cds.
ACCESSION   M36206
KEYWORDS    basic 19k protein.
SOURCE      L.migratoria adult female fat body, cDNA to mRNA, clone
            lambda-LmF2.
  ORGANISM  Locusta migratoria
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Orthoptera; Caelifera; 
            Acrididea; Acridoidea; Acrididae.
REFERENCE   1  (bases 1 to 938)
  AUTHORS   Kanost,M.R., Bradfield,J.Y., Cook,K.E., Locke,J., Wells,M.A. and
            Wyatt,G.R.
  TITLE     Gene structure, cDNA sequence, and developmental regulation of a
            low molecular weight hemolymph protein from Locusta migratoria
  JOURNAL   Arch. Insect Biochem. Physiol. 8, 203-217 (1988)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by M.R.Kanost, 06-JUL-1990.
FEATURES       from  to/span     description
    pept         54      572     basic 19k protein precursor
    sigp         54       95     basic 19k protein signal peptide
    matp        108      569     basic 19k protein
BASE COUNT      231 a    267 c    222 g    218 t
ORIGIN      
        1 agctctgctg tctcctgtcc actccacacc acaggctcag taccaggatc aggatgaagc
       61 tggtggtggc tgcagttctc gcgatggccg cgtcgcggtg gcggcgcctg tcggcccacg
      121 gccaggtgcc gtccagcacg tgcgccgaca tgctgcccgt gcacggcaac gcaatgccca
      181 gcacagccct gccctacacc atcaccgtgt cgcccacctc cgtcaacggc ggcgacaccg
      241 tcagagtgca catctcgggc acggaggagt tccgcggcgt ctacctgcag cgaggagggg
      301 ccaagagcag taggagagtt cctgctgccc gccggagaga acaacaagat cgccctgtcc
      361 gactgcccgc cggacacaac aacgccttct catacatttc gcgcacaccc ctggacacac
      421 tggacatcga ctggaaggca ccatacacca gcgatgaaat cgttttcagg gctactttcg
      481 tcaagagctt ctccgagttc tgggtcggcg ttgagtcacc gaagatcaca ttgggaccgc
      541 tacgtcaact tgacaacgca gttgctgctt agtgactgaa gtcgccatat tcatatacga
      601 gcacatccag tactgatgtc ctagtttatc acaacatcgc cgcaccacca ctttcacgtt
      661 ctctactact aaaatggtag ataaatcgct tattacagct gttagctgca tataagagaa
      721 gcgtttcaaa acgagaaact ctttttgatt ttgtactgag ggaattcaag taaagatttg
      781 acaggcagac gtcaccatct tgttcaagac ttggcatcca gtttgcctgt ctgctgtgtg
      841 tttgtagatg ctcacacttc ttgtgatatt tactaccaca aattttgtac tcaagacttg
      901 aagaattgaa atatattctc taattaatat aaaaaaaa
//
LOCUS       MUSALDAA     8190 bp ds-DNA             ROD       15-AUG-1990
DEFINITION  Mouse aldolase A gene, complete cds.
ACCESSION   J05517
KEYWORDS    aldolase A.
SOURCE      Mouse (strain RIII S/J and Blue Spruce (outbred Swiss Webster))
            adult DNA, clone lambda 16.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 8190)
  AUTHORS   Stauffer,J.K., Colbert,M.C. and Ciejek-Baez,E.
  TITLE     Nonconservative utilization of aldolase A alternative promoters
  JOURNAL   J. Biol. Chem. 265, 11773-11782 (1990)
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.K.Stauffer, 15-JUN-1990.
FEATURES       from  to/span     description
    pept       4301     4412     aldolase A, exon 4 (E.C. 4.1.2.13) (first
                                 expressed exon)
               4493     4704     aldolase A, exon 5
               5125     5179     aldolase A, exon 6
               5278     5438     aldolase A, exon 7
               5727     5810     aldolase A, exon 8
               5908     6082     aldolase A, exon 9
               6222     6421     aldolase A, exon 10
               6519     6614     aldolase A, exon 11
    pre-msg    1700  >  6614     aldolase A mRNA and introns (alt.)
    pre-msg    1740  >  6614     aldolase A mRNA and introns (alt.)
    IVS        1804     1951     aldolase A intron A (put.)
    IVS        2023     4278     aldolase A intron B (put.)
    IVS        2205     4278     aldolase A intron C (alt.)
    IVS        3256     4278     aldolase A intron C (alt.)
    IVS        4413     4492     aldolase A intron D
    IVS        4705     5124     aldolase A intron E
    IVS        5180     5277     aldolase A intron F
    IVS        5439     5726     aldolase A intron G
    IVS        5811     5907     aldolase A intron H
    IVS        6083     6221     aldolase A intron I (no splice consensus)
    IVS        6422     6518     aldolase A intron J
    signal     3032     3036     CAAT box
    signal     1673     1676     TATA box
    signal     2235     2240     TATA box
    signal     3089     3094     TATA box
    signal     3132     3137     TATA box
BASE COUNT     1676 a   1884 c   2061 g   1876 t    693 others
ORIGIN      
        1 gatccttgct ttttgaagcc ttagaatgaa gccagcattc ctggccttgg gagggcaggc
       61 acgggagact ccaaggcctg gggaaagcaa ctctagtcca aaccagtttc tcttgctggt
      121 tgtagtcttt tgggcaaacc actgagtttc tatctcatta ttttgtgatg agccccccac
      181 gagtgtgacc cccattcaag gtggctcaga agcagagtgc ttgccttgtg tttgtgacat
      241 cccaagttca attcatcact gaggaaaccc ctccctttaa gatttatctt atctctgaac
      301 gttttcccga ttgtatgact cgtatgtatc tgaggaagtc agaagaaatg tcagatcccc
      361 caggatcttg ggatctggag tcgtgatggc tgtgagtcac tgtatatatg tgctggagct
      421 gaactcaggt cctctggaac agccattgct cttaaccact gagccatggt ccggacacct
      481 ggcttagaca gggtcccttt ctgtcagtgg ttctcaacct gtgggttatg gccctttgtg
      541 ggggtggagg tgggtattaa cttatacagg gctgacctaa ggttataaaa acccagatat
      601 ttatgattca taacagcaaa attacaggtg taaagtagca acaaaaattc ttttttggtt
      661 gggagtacca caacatgggg aactgtatta aaaggtagca ttaggaaggt tgggcaccac
      721 tgctctcgta gccctggcta tcctagaact caaatagtag atcaggctgg tccaaactga
      781 cagagatcta tctctgccag cgtcagcact aggaagtgag taaattccat gatagccagg
      841 ccatacagtg aaaccctgtc tcaaaacagg acaagaggaa ccccagtact tagtaggttg
      901 aagtaaggat tgtcattttt tttgaggcca gcttgggttt catggctctt gactagtctg
      961 agctgtagag ggagagcctg tctcacgagg aagcttagga gggagatatt atagtttggt
     1021 ttatgccagc aagaaagtcc aaagtcccag aaattatctt catgaggatt gaaacatgtt
     1081 ttctggtcct gacttcctct aggttgcata gggctttgag agtatagtat acctactatg
     1141 tgcgcataca cacacgcgcg cgcgcgcgca cgcgacacac acaggaccca gtgggacaga
     1201 tactttatca ctgctgctgt tcagcatgga gggagcttct ttccagtgct ttgtctctcc
     1261 gtccactggg cctggtgggt gggtgctcct cagccctctg cttacccacc tctctcttct
     1321 cctttagggt tgggcccctc gatgccctgg cctgctgccc actgtgtgac tgtgcctgtg
     1381 cctgccagct cccagactgc cagagcctca actgcctctg tttcgagatc aagctcagat
     1441 gaaagatggg gctggggacg ttgttctttg gggagtggcc agtccccagg gccccctcta
     1501 tgatcctcag gacatcatta tactggagct atggatggca ggcccagcct aattacctgg
     1561 gttccttgag ttctctgaaa ggcaggattc tgagagccct tggaccgctg aaaagggcct
     1621 gatgctctgg ccagtgcccc tgcctttctt cctctccctt ccctgataaa ctattgtatg
     1681 tgaggtagga tcgagacatt gctcacccag gcaacagtgt gggaggtttc tgccaacctg
     1741 gactatcagg ataaagggat ggccagccac accctgcctt tagactcctg gttattttaa
     1801 gaggtgagta tcctgcctga ctctgctctc ctttggaaaa aaaaaaaaag ttcaaccacc
     1861 agcaggcacc agagtcaagg gaggagggaa ccagaggagg gcagtgggag gcaatatcta
     1921 gatgttttcc cttcttgttc tgccttaaca gatcctggac ctgagactga tttcttgact
     1981 aatttcactg tatttccaag gaagaggttc ctctaaagac cggtgagtga gcagtggcac
     2041 ctcctcctct caaggcaaac caaagctgcc tcttcttcac cccccacgca gggatgaatg
     2101 tcaggagcct caggtttccc taaatatagg tcccggccgc gggattcgtg gtggggaaag
     2161 ggcaggggtt accgagaagg tctgggacac tggtgcgggg gtgtgtaggg gaggggtggg
     2221 gagtaggagc tgccttaaaa cccagccctg gactgccggg ctcactctct gctgaccggg
     2281 ctctgcggct tctgtcactg cgccacaggt gggccgctat ccggattgca ggatgggaat
     2341 gggggttgcg gattgggacc tgaggaaact gactgctctg agagttacag ggtgacaaga
     2401 gagctccgag acggattttt ttattttgga gaaggaaatc aggttcggga aagacctgtc
     2461 tggcttgggc cagtccttgt cggtcatttc ctcaaactgg gtgtgtttag ctcgcgggtg
     2521 gtgcctcccg ccaatctgct aggcaacgcc aggcctggat acgccactca gttccgatgt
     2581 ggccggcaca ctagttctgg gaggttttgc ctgcgtacca tgtcactcgc cgtgctctgg
     2641 ccagggagag atggaatgng ccctgcattt tagtcaagcg acgaagcagg caggcaggga
     2701 ggctccgaag ctctgcgttc ttagcagtga cgtcaggctg caactacaca gccggaagcc
     2761 tgggtcttgg aggagaggcc agccaccatc tcactctgac cccctcccta ctcttcgcca
     2821 acccacattc cggctgagtc acatgttccg cgcgcgccag gcaggggttg gggggggggg
     2881 tgttgggggg ggggggtggt gacctgcggg atgtggctcg agtcacgtcc tagcggggcg
     2941 gaggagggat cgtgttctag ccgcttgtct cctccccagt gccgcctcct atcggagcat
     3001 cttggggcgg tctgcgcaca gtgcccacct tcaattgacg gttcccgtcc ctgcaaggga
     3061 aaaaacctgc agagggcgga gcggcgcctt taaatgtccg gggccccgcc tccggtcccc
     3121 cccaacccag ctgaataggc tgggttctct tggaacgcgc agcagaacca ggttctggtg
     3181 accctagccg ttcgctcctt agtcctttcg cctacccacc ggcgtaccag gcagacccac
     3241 cccgtcctgt gccaggtgag cgccatttac acgtgctcgg ggaagggtct atggggttag
     3301 gatcttgggc cggtggcggg cagtgcagag ccgtcttccc cacggcccct cacttctcct
     3361 ttttctaccc ccacgcttgc ccccagcccn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     3781 nnnnnnnnnc tggttctctc ttaactctcg cctttgggtt gctatgtggc tttgagcaca
     3841 gatcatttct ttcttgggct ctttcagatg agggtattag gctcctgccc tattcgtgat
     3901 ccttaaattc taaaatatcc cggttcaatt ttgtttctag gcaaggtgac ccatggcaac
     3961 gcgcaggcca gatgggtcag cttcaacatg accgctgtcc tggctctggc ttcttcttcc
     4021 ccagttggcc agtgagcgaa cccactctga gctgggcaac acccagcaac agacagagtt
     4081 aggaaaggta caggaagagg caggtctagt atagggaagt cgggagtagg ggagagctct
     4141 gggacaggaa gtatcccagg accctcaggg agtggggcag gggaggtggg ggctagtgcc
     4201 ctggcctcca ggaagctttg taccggggag accatgggat ggtccaacta agcgctggtc
     4261 tctgcctccc tcacccagga aagcaactgc caccggcacc atgccccacc catacccagc
     4321 actgaccccg gagcagaaga aggagctgtc tgacatcgct caccgcattg tggctccggg
     4381 caagggcatc ctggctgcag atgagtccac cggtgcggta caggagaaga aagggaggag
     4441 gacccaggtt ggagctagca ggctgatccc ttatctccat catgactttt aggaagcatt
     4501 gccaagcgcc tgcagtccat tggcaccgag aacaccgagg agaacaggcg cttctaccgc
     4561 cagctgctgc tgactgcaga cgaccgtgtg aatccctgca ttgggggggt gatcctcttc
     4621 cacgagacac tgtaccagaa ggcagatgat ggacgtccct tcccccaagt tatcaagtcc
     4681 aagggtggtg ttgtgggcat taaggtaaga gggcagactc tggggggggg gtaagattag
     4741 aggaggatct cggagaaagg gattaatagg tagggagggg gtaatatggc tagcaggcct
     4801 agagactcag gtggatgtat cagcataatt ttttttcagt gtttggggtg aacttaggtc
     4861 cttgtgcatg tcggcaagcg cgctgttgcc aacttaatgg ttccctgtga tacaagaagg
     4921 tgatttcatg gtgaagaagt gaaaaggttt tctcagtgtg cagtagcacc aggtccctct
     4981 agtccagtta acattctctc aaatatacac atcttttctc ataaatatgt gcaagccatg
     5041 agaggctaca gtgaaaggtg aagtttgggc ctgggtagag gagacagggg ccataaagct
     5101 gactgctggt ctcctccctg gcaggtagat aagggtgtgg tgcccctggc aggaaccaat
     5161 ggcgagacaa ctacccaggg taagaatgat ctgcctgcct ccttcccttc tccaccagct
     5221 catcagagtt ccagagtgag tctgatcaaa agccttctct ttattcttcc ccttcagggc
     5281 tggatgggct gtctgaacgc tgtgcccagt ataagaagga tggagccgac tttgccaagt
     5341 ggcgctgtgt gctaaagatt ggggaacata ctccctcggc cctggccatc atggaaaatg
     5401 ccaatgttct ggcccgttat gccagcatct gccagcaggt gggattggac tacttcctaa
     5461 cacattgatg cagcgcgggc tagctttctg tctatctgcc aggatatctg cctcctcaga
     5521 gcagctgctc tcaatacccg ctgtggccag gtcttgagtg gaggtctgca atgtagaggt
     5581 ggcaacaggt gtacaggcag attgatagga ttgcttgtcc cctgtaaact gctgaggcct
     5641 ttgaagcctg ggtctctgtc atcaagttaa tggtgaggag gctcctagtc aggaggcctt
     5701 gcctcattac cctgtccctc ccacagaatg gcattgtacc cattgtggag cctgaaattc
     5761 tccctgatgg ggaccatgac ttgaagcgct gccagtatgt tactgagaag gtagtgccat
     5821 ctgctgtaga tagtgtgtgc tgcgcgtagt atcgtttcac ttctcgtctg cnnnnnnnnn
     5881 nnnnnnnnnn nccctgctgt cttccaggtc ctggcggctg tctacaaggc tctgagcgac
     5941 caccatgtct atctggaagg cacattgctg aagcccaaca tggtcacccc tggccatgct
     6001 tgcacccaga aattttccaa tgaggagatt gccatggcaa cggtcacagc acttcgtcgc
     6061 acagtgcccc ctgctgtcac tggtgaggcc actcctcatc ttggtggtga ggtggatgca
     6121 ccatcacatt tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     6181 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     6241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     6301 nnngccatgg gccttgactt tctcctatgg tcgagccctg caggcctctg ctctaaaggc
     6361 ctggggtggg aagaaggaga acctgaaggc agcccaggag gagtacatca agcgcgccct
     6421 ggtaaggcag gcaggcaggc gtggaagtgt gaacaggtgc ctgggcgggg tggggaggga
     6481 ctcaagaaga gaattcctct gattcctctt ccttttaggc caacagcctc gcttgtcaag
     6541 gaaagtatac cccaagtggc cagtctggag ccgcagccag tgaatctctc ttcatctcta
     6601 accatgccta ctaaccagag ctgaactaag gctgctccat caacactcca ggcccctgcc
     6661 tacccacttg ctattgaaga ggggtcttca ggctctttcc catcactctt gctgccctcg
     6721 tgtgcggtgt tgtctgtgaa tgctaaatct gccatccctt ccagcccact gccaataaac
     6781 aactatttaa gggggagtct gttgttcatg tcttgtaggg tataggggag ggctgaggaa
     6841 agagctactt gggttcttct tcttggacag taaaaggaag gggttttttg accagagctt
     6901 tgagaaaggc atagtattat gggatgttct ttgcctacat ctaattgaag gtaactttta
     6961 cactaattaa tattcagttt aagccaacca agggcttatg aatacttggc aaggattgta
     7021 tcagggctaa cacatttatg cgttttgggg actatggagc tttggagacg agatctctct
     7081 gcagtgacat aggtatacag ctcactgcag aactcttggg ttccaggttg agaatggagc
     7141 ctcagagctg ctgatgttcc ctggtgatag aataagaagc acatcaaacc atgggccact
     7201 gtatcttgcc acattatatt gagtgtagtc ggtgtgctag tgcacacttt aatccagcac
     7261 tcaggaggca gaggcaggca ggaggcaact ggaactcaca aagtgagttc caggacagcc
     7321 agggctatac agagaaaccc tgtcttgaaa aaaaaaaatt ctggcctaaa tgaatggata
     7381 cagtgtatct gcctttggag gccaaaaggc gtgtatcaag tgctagcttc tggcaagata
     7441 agaaacctta aggagtaggg cttcgactat actcagtagc agagtcttgc atggtactca
     7501 tggttgtgag cacatgtggt gctaactgct gagtctctct cagtccatca tactctagta
     7561 tatagtcaga gactctagat actgacgact agactagact cgtcgtctnn nnnnnnnnnn
     7621 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
     7681 nnnnnnnnnn nnnnnnnnnn nnnnnnnntt ccttcccaag catctttttc tttgacactt
     7741 tcgttttcag tgatctgcgt agaattgtct tactaggagt atcaaagcat agtctccact
     7801 gtcctaatat tcccatgtat tggccaatag tcaaagctat gcgcaggctg tggatagagc
     7861 ccagtggctg agtacccaaa gctctggttc cttccccagt gctgcaaggg aaaactcaaa
     7921 tccctatgct tccccaaact tcagcctccc attttactgc tcatcacgta cttgtagcct
     7981 tgctctctag aattctgtag cccacactgg ccttgaactc tcaagatctg ctttccaagt
     8041 actgggatga aaggcatgtg ctattctcct agcttctatg aggcgatcct ttttatttta
     8101 tatacattgg tattaactga atgtgtgtat gtgtgtagtg tgatccggta cgagctcgag
     8161 cgtatagtga gtcgatacat catgcgcgct
//
LOCUS       MUSCR2AA     2102 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Mouse complement receptor (Cr2) gene, 5' end.
ACCESSION   M36470
KEYWORDS    Cr2 gene; complement receptor.
SOURCE      Mouse (strain Balb/c) spleen, cDNA to mRNA, clone 31-1.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2102)
  AUTHORS   Kurtz,C.B., O'Toole,E., Christensen,S.M. and Weis,J.H.
  TITLE     The murine complement receptor gene family. IV. Alternative
            splicing of Cr2 gene transcripts predicts two distinct gene
            products that share homologous domains with both human Cr2 and Cr1
  JOURNAL   J. Immunol. 144, 3581-3591 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         67  >  2102     complement receptor (Cr2)
BASE COUNT      590 a    472 c    452 g    588 t
ORIGIN      
        1 ctcttcctct ccttgctaca ggctcacaac tcacagagcg caacctgcca ttggactgct
       61 gcacacatgg gatccttggg ttcgctctgg gttttcttca ctctcatcac tccaggagtt
      121 cttggtcagt gtaagttgct gccaaagtat tcttttgcta aaccttctat tgtgagtgat
      181 aaatctgagt ttgccattgg aacaacttgg gaatacaaat gtcgccctgg gtattttagg
      241 aagtcattta ttatcacctg cttagaaacc tccaagtggt cagatgctca gcagttctgt
      301 aaacgtaaac catgtatgaa tcctcaagaa cccctccatg gttctgtgca tataaacacg
      361 ggtatcgagt ttgggtcaac aattacgtat tcttgtaatc aaggatatcg actcattggt
      421 gactcgtctg ctacatgtat tgtatcagac aatactgtaa tgtgggataa tgatatgcct
      481 ctttgtgaat ctattccttg tgagtcacct ccagccatct ccaatggaga cttctacagc
      541 agcagcagag acagcttttt ctatgggatg gtagtaactt attattgcca taccggaaag
      601 aatagggaaa aactgtttga tctggtgggt gagaagtcaa tatattgtac cagcaaagac
      661 aatcaagttg gcatctggaa tagtccacct cctcagtgta ttcctagagt caagtgccca
      721 atgccagaaa ttgaaaatgg actagtggag tctggattta aacactcctt cttcttaaat
      781 gatacagtaa tatttaagtg caaatctggc tttaccatga aaggcagcag aatagcatgg
      841 tgccagccaa acagcaaatg gagccctcca ttgccaacat gcttcatggg atgtctacca
      901 cctcaaaata tcctccatgg tgattataac aaaaaggatg agttcttctc tgttggccag
      961 aaagtgtcat atacgtgtaa ccctggctat actctcattg gaactaacct cgtggagtgt
     1021 acatccttgg gaacctggag caatacagtc ccgacatgtg aagtgaaatc atgtgatgca
     1081 attccaaacc atcttctcca tggccgtgtg tttcttcccc ctaatctcca gcttggggca
     1141 gaggtttcct ttgtttgtga cttagggttc cagttaaaag gcaaaccttc tagtcagtgt
     1201 atcccagaag gagagacagt aatctggaat aataagtttc ctgtctgtga acagatttct
     1261 tgtgaccctc ctcctgaagt caaaaatgct cggaaaccct attattctct tcccatagtt
     1321 cctggaactg ttctgaggta cacttgttca cctagctacc gcctcattgg agaaaaggct
     1381 atcttttgta taagtgaaaa tcaagtgcat gccacctggg ataaagctcc tcctatatgt
     1441 gaatctgtga ataaaaccat ttcttgctca gatcccatag taccaggggg attcatgaat
     1501 aaaggatcta aggcaccatt cagacatggt gattctgtga catttacctg taaagccaac
     1561 ttcaccatga aaggaagcaa aactgtctgg tgccaggcaa atgaaatgtg gggaccaaca
     1621 gctctgccag tctgtgagag tgatttccct ctggagtgcc catcacttcc aacgattcat
     1681 aatggacacc acacaggaca gcatgttgac cagtttgttg cggggttgtc tgtgacatac
     1741 agttgtgaac ctggctattt gctcactgga aaaaagacaa ttaagtgctt atcttcagga
     1801 gactgggatg gtgtcatccc gacatgcaaa gaggcccagt gtgaacatcc aggaaagttt
     1861 cccaatgggc aggtaaagga acctctgagc cttcaggttg gcacaactgt gtacttctcc
     1921 tgtaatgaag ggtaccaatt acaaggacaa ccctctagtc agtgtgtaat tgttgaacag
     1981 aaagccatct ggactaagaa gccagtatgt aaagaaattc tctgcccacc acctccacct
     2041 gttcgtaatg gaagtcatac aggcagcttt tcagaaaatg taccatatgg aagcacagtt
     2101 ac
//
LOCUS       NEUALCA      1639 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  N.crassa allantoicase (alc) gene, complete cds.
ACCESSION   J02927
KEYWORDS    allantoicase.
SOURCE      N.crassa (strain Oak Ridge), clone pALC-1.
  ORGANISM  Neurospora crassa
            Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; 
            Pyrenomycetes; Sordariales; Sordariaceae.
REFERENCE   1  (bases 1 to 1639)
  AUTHORS   Lee,H., Fu,Y.-H. and Marzluf,G.A.
  TITLE     Nucleotide sequence and DNA recognition elements of alc, the
            structural gene which encodes allantoicase, a pirine catabolic
            enzyme of Neurospora crassa
  JOURNAL   Biochemistry (1990) In press
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.A.Marzluf, 12-JUL-1990.
FEATURES       from  to/span     description
    pept        250      340     allantoicase (alc), exon 1
                413     1386     allantoicase (alc), exon 2
    IVS         341      412     alc intron A
    signal      129      135     TATA box
BASE COUNT      383 a    441 c    443 g    372 t
ORIGIN      
        1 cgttgcagat cgaatacgac ggttaggtac gacgaagaag gaccacgatt gtcgttgctg
       61 ttacgtactt tgacctcctc aacgcactat cttgcttaag ctatcgctct tgtctgtcgc
      121 tgtggtgata taaattctgc gcctgctctt ggtttattcc gaggacgctc gttccatctc
      181 tgtttttttt ttctctctgt gacatcgagg actgaagtct cacttattca aatacacatt
      241 tccctcacca tgaccgacat cgattacaag ctcgaggctg ttccggccac tcggattgcc
      301 gccgatgata tcgacaagac tttccgttcc agcaccatcg gtccgtagca tccatctcac
      361 caaacatggc aacccaaacc tttcaactaa cggaagtcga gctgggatac agatcttatc
      421 tcaggggctc tcggtggcaa ggtttccggt ttctcggacg aatggttcgc cgaagcagcc
      481 aacctcctca ctcctacagc cccaatccgc cagccgggaa agatggttta caccggcgcc
      541 tggtatgacg gatgggagac aaggagacac aaccctgccg agttcgactg ggttgtgatc
      601 cgtctgggcg tcgcctcggg taccgtcgag ggtgtcgaga ttgacacggc tttcttcaac
      661 ggcaaccatg cgcccgccat ctcggtcgag ggttgcttca gccaaaacga cgatgaggtt
      721 ctgtcatgga agggcgagct gggtggatgg gagactattc ttggcgttca agagtgcggc
      781 ccttcgcaga gattctgctg gaaactcgag aaccctacca agaagcagta cacccatgtg
      841 cgactaaaca tgtaccccga cggcggcatt gccaggttcc gtctgtttgg acacgccgta
      901 ccggtcttcc ccgacaatac ggatgccatc tttgacttgg cggctgccca gaacggcgga
      961 gttgcgatct cctgcagtga ccagcacttt ggtaccaagg acaaccttat ccttccgggc
     1021 cgcggcaagg acatgggcga cggttgggag acagcacgct cgcgcaccaa gggccacgtc
     1081 gactggacca tcatcagact cggcgcgccc ggctacattc agaatttcat ggtcgacacg
     1141 gctcacttcc gcggtaacta cccccagcag gtcaagctgc aacgtatcga gtggaagagc
     1201 gagggcaggc cgggagcgga ttctgagggc tggacagagg ttgttgagcc catcaagtgc
     1261 ggtcccgatc aggaacaccc tgtcgagagc ttggtgaagg acaagccgtt cacccacgtc
     1321 aagctcatca ttgtgcctga cggcggagtg aaaagactgc gggtgtttgc gaagagggct
     1381 gtttaagaaa ttaccaagct atatatctga aggcaattat tcggtgagag cagcatttac
     1441 ggggagccat caacagcgag cgatccacat aaaaaggggg aggacctcat ttagtatgat
     1501 gggcaacgag tgcagtcatt tagccgcgaa gaatcgaaat ctctcagatc tttgattgtc
     1561 tgcgcttaag taacaaagtc taattctcaa tcagctttcg tcgtagagta aaattagaag
     1621 gatgcacggc tgcccacga
//
LOCUS       RATINHA      1561 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Rat inhibin alpha-subunit mRNA, complete cds.
ACCESSION   M36453
KEYWORDS    inhibin.
SOURCE      Rat female (strain Sprague-Dawley) ovary, cDNA to mRNA, clone
            rINA-13.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1561)
  AUTHORS   Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E.
  TITLE     Rat inhibin: Molecular cloning of alpha- and beta-subunit
            complementary deoxyribonucleic acids and expression in the ovary
  JOURNAL   Mol. Endocrinol. 1, 561-568 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        256     1356     inhibin alpha-subunit precursor
    sigp        256      954     inhibin alpha-subunit signal peptide
    matp        955     1353     inhibin alpha-subunit
    mRNA    <     1     1561     inhibin alpha-subunit mRNA
BASE COUNT      308 a    465 c    440 g    348 t
ORIGIN      
        1 ggacactaga atgctgtgtt gttagaggag tggagagagg aagatgtgct aagtgtagca
       61 gtacacacct ataatcctag cacttgagag gttgaaggca ggaggatgag acattcaggt
      121 cattcttagc tacatgaaga gtttaaggcc agcacggatt acaggatatc tgtttctggg
      181 gaaaaaggag gggaagagag agaggaaagg gcaaagggca gagtgtgggc tccctgtcgt
      241 cagggcaaga gaactatggt gatccagccg tctctgctgc tccttttgct gttgactcta
      301 caggatgtgg acagctgcca ggggccagaa cttgtccggg agcttgtcct ggccaaagtg
      361 aaggcactat tcctagatgc cttggggccc ccagcaatgg atggggaagg tgggggtcct
      421 ggaataaggc ggctgcctcg aagacatgcc cttgggggct tcatgcacag gacctctgaa
      481 ccagaggagg aggatgtctc ccaggccatc cttttcccag ccacaggtgc cacctgtgag
      541 gatcaggcag ctgctggagg gcttgcccag gagcctgagg aaggtctctt cacttatgta
      601 ttccggccat cccaacacat acgcagccac caggtgactt cagcccagct gtggttccac
      661 acggggctcg acaggaagag cacagcagcc tccaatagct ctaggcccct gctagatctt
      721 ctggtgctgt catctggggg gcccatggct gtgcctgtgt ccttgggaca gagcccccca
      781 cgctgggctg tcctgcacct ggcggcctcc gctttccctc tgttgaccca ccccatcctc
      841 gtgttgctgc tgcggtgccc actctgttct tgctcaggcc ggcctgagac cactcctttc
      901 ctggtggccc acactagggc tcgagccccc agtgcggggg agagggctcg acgttcagct
      961 ccctcgatgc cttggccttg gtctcctgca gccttgcgtt tgctgcagag gcctccagag
     1021 gaaccctctg cccatgcctt ctgccatcga gctgccctca acatctcctt ccaggagctg
     1081 ggctgggacc gctggatcgt acaccctccc agcttcattt tccactactg ccatggtagc
     1141 tgcgggatgc ccacatctga tctgcccctg ccagtccctg gggctccccc taccccggct
     1201 cagcccctgt ttttggtgcc aggggccaag ccctgctgtg cagctctacc agggagcatg
     1261 aggtccctac gcgtccgaac cacctcagat ggaggctact ctttcaagta tgagatggta
     1321 ccgaacctca ttacacaaca ctgtgcttgt atctaaaagc acctcgtctc ctcctccaca
     1381 gccactggcc accatcacct caccatccca cggtcggtcg gtcggtcggt cgtcagctag
     1441 gaggaaggtg ggtgtggaaa gtagacagtt tccacttcct tttcccttca tctttctgtc
     1501 tgaggcttcc acaccccact ccacccaggt cctgtggata acaataaaga aggaagtgtg
     1561 t
//
LOCUS       RATINHB      1543 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Rat inhibin beta-A-subunit mRNA, complete cds.
ACCESSION   M37482
KEYWORDS    inhibin.
SOURCE      Rat female (strain Sprague-Dawley) granulosa cell, cDNA to mRNA,
            clone rINB-5.
  ORGANISM  Rattus rattus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1543)
  AUTHORS   Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E.
  TITLE     Rat inhibin: Molecular cloning of alpha- and beta-subunit
            complementary deoxyribonucleic acids and expression in the ovary
  JOURNAL   Mol. Endocrinol. 1, 561-568 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        163     1437     inhibin beta-A-subunit precursor
    sigp        163     1086     inhibin beta-A-subunit signal peptide
    matp       1087     1434     inhibin beta-A-subunit
    mRNA    <     1     1543     inhibin beta-A subunit mRNA
BASE COUNT      435 a    356 c    454 g    298 t
ORIGIN      
        1 ctctgacctc atgagacaag agccggctgg caaaacagaa gggacccgaa agagaatttg
       61 ctgaagagga gaaggaaaaa agtccaaaaa acctgtacgt gaggggtggg gaggaaaagc
      121 agggccttta aagaaggcaa ccacacgact tttgctgcca ggatgccctt gctttggctg
      181 agaggatttc tgttggcaag ttgctggatt atagtgagga gttcccccac cccaggatcc
      241 gaggggcacg gcgcagcccc ggactgcccg tcctgtgcgc tggccaccct tccgaaggat
      301 ggacctaact ctcagccaga gatggtagag gctgtcaaga agcacatctt aaacatgctg
      361 cacttgaaga agagacccga tgtcacccag ccggtaccca aggcggcgct tctcaacgcg
      421 atcagaaagc ttcatgtggg taaagtgggg gaaaacgggt atgtggagat agaggacgac
      481 attggcagga gggccgaaat gaatgaactc atggagcaga cctcggagat catcaccttt
      541 gccgagtcag gcacagccag gaagacactg cattttgaga tttccaagga aggcagtgac
      601 ctgtcagtcg tggagcgtgc agaagtctgg ctcttcctga aagtccccaa ggccaacagg
      661 accaggacca aagtcaccat ccgtctgttt cagcagcaga agcatccaca gggcagcttg
      721 gacatggggg atgaggccga ggaaatgggc ttgaaggggg agaggagtga actgttgcta
      781 tcagagaaag tggtagatgc tcggaagagc acttggcaca tcttcccagt gtctagcagc
      841 atccagcgcc tgctggacca ggggaagagt tccctggatg tgcggattgc ttgtgaacag
      901 tgccaggaga gcggtgccag cctagtgctc ctgggcaaga agaagaagaa agaggtggat
      961 ggagacggga agaagaaaga cggaagtgac ggagggctgg aagaggaaaa agaacagtca
     1021 cacagacctt tcctcatgct gcaggctagg cagtctgaag accatcctca ccgcaggcgt
     1081 aggcggggct tggagtgtga tggcaaggtc aacatttgct gtaagaaaca gttctttgtc
     1141 agcttcaagg atattggctg gaatgactgg atcattgctc cctctggcta tcatgccaac
     1201 tattgtgagg gtgagtgccc aagccacata gcaggcacct ctgggtcctc actctccttc
     1261 cactcaacag tcattaacca ctaccgcatg aggggtcaca gcccctttgc caaccttaag
     1321 tcatgctgtg tgcccaccaa gctgagaccc atgtccatgc tgtattatga tgatggtcaa
     1381 aacattatca aaaaggacat tcagaacatg attgtggagg agtgtggctg ctcctagagt
     1441 tgccaggtcc cagagcaaat ggatctaggg tgtccaggaa aagacagtgg caaatgaaga
     1501 aaaatatata agatttctgc ctaaacaaga caaccagaaa aat
//
LOCUS       RSBMNP       1201 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Bovine syncytial virus major nucleocapsid protein (N) mRNA,
            complete cds.
ACCESSION   M35076
KEYWORDS    major nucleocapsid protein.
SOURCE      Bovine syncytial virus (strain A51908) MDBK cell, cDNA to mRNA.
  ORGANISM  Bovine syncytial virus
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Spumavirinae.
REFERENCE   1  (bases 1 to 1201)
  AUTHORS   Samal,S.K., Zamora,M., McPhillips,T.H. and Mohanty,S.B.
  TITLE     Molecular cloning and sequence analysis of bovine respiratory
            syncytial virus mRNA encoding the major nucleocapsid protein
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by S.K.Samal, 12-JUL-1990.
            
              Author address: S.K.Samal
              Univ Maryland at College Park
              Dept. Veterinary Medicine
              College Park, MD 20742
            
FEATURES       from  to/span     description
    pept         16     1191     major nucleocapsid protein
    mRNA          1     1200     major nucleocapsid protein mRNA
BASE COUNT      434 a    196 c    270 g    301 t
ORIGIN      
        1 ggggcaaata caaaaatggc tcttagcaag gtcaaactaa atgacacttt caacaaggat
       61 caactgttat caaccagcaa atatactatt caacgtagta caggtgacaa cattgatata
      121 cccaattatg atgtacaaaa acatctcaat aagttgtgtg gtatgctact aataacagaa
      181 gatgccaatc ataaatttac aggattgata ggtatattat atgctatgtc ccgattgggg
      241 agagaagata cccttaaaat actcaaagat gcaggctacc aagtaagggc caatggggtt
      301 gatgtgataa cacatcgaca ggatgtgaat ggaaaagaaa tgaaatttga agtgctaaca
      361 ttagtcagct taacatcaga agttcaaggc aatatagaaa tagagtcaag gaagtcttac
      421 aaaaagatgc taaaagagat gggagaggta gccccagaat acagacatga ctctcctgat
      481 tgtggtatga tagtgctatg tgttgctgct ttggttataa caaaattagc agcaggtgat
      541 agatcaggcc tcactgcagt cattaggaga gccaacaatg tactaaggaa tgaaatgaaa
      601 cgatacaaag gacttatccc gaaagatata gctaacagct tctatgaagt gattgaaaag
      661 taccctcatt acatagatgt attcgtacat tttggcattg ctcaatcctc aactagagga
      721 ggtagtaggg tagaaggaat ctttgcaggg ttattcatga atgcatatgg agcaggtcaa
      781 gtgatgttaa gatggggtgt attagccaaa tcagtcaaga acattatgct tggtcatgcc
      841 agcgtgcaag cagaaatgga acaggttgta gaggtctatg aatatgcaca aaagttaggt
      901 ggagaagctg gtttttatca catattgaac aaccctaaag catcactgtt atccttaaca
      961 caattcccca acttctctag tgtagtccta ggcaatgctg caggactagg tataatgggt
     1021 gagtatagag gtacaccaag aaaccaagac ttgtatgatg ctgccaaagc atatgcggaa
     1081 caattaaaag agaatggggt catcaattac agtgtattag atctgactac agaggaacta
     1141 gaggcaatca agaaccaatt gaatcccaaa gacaatgatg tggaactgtg agttaataaa
     1201 a
//
LOCUS       URELOCAB      558 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  U.urealyticum urease locus proteins A and B, complete cds.
ACCESSION   M36190
KEYWORDS    urease locus-encoded protein.
SOURCE      U.urealyticum (serotype 8) DNA.
  ORGANISM  Ureaplasma urealyticum
            Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; 
            Mycoplasmatales; Mycoplasmataceae.
REFERENCE   1  (bases 1 to 558)
  AUTHORS   Willoughby,J.J., Russell,W.C., Thirkell,D. and Burdon,M.G.
  TITLE     PCR primers that detect Ureaplasma species and a study of the
            urease locus by 'PCR walking'
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.J.Willoughby, 27-JUN-1990.
            
              Author address: J.J.Willoughby
              University of St. Andrews
              Biochemistry and Microbiology
              North Street
              St. Andrews, Fife KY16 9AL
              SCOTLAND
            
FEATURES       from  to/span     description
    pept         23      349     urease locus protein A
    pept        436      522     urease locus protein B
BASE COUNT      209 a     78 c    111 g    160 t
ORIGIN      
        1 tttataagga gataatgatt atatgtcagg atcatcaaat caattcactc caggtaaatt
       61 agtaccagga gcaattaact tcgctgaagg cgaaaatgtg atgaacgaag gtagagaagc
      121 aaaagtaatc agcattaaaa atactggtga ccgtcctatc caagttggat cacatttgca
      181 cttatttgaa acaaatagtg cattagtatt ctttgatgaa aaaggaaacg aagacaaaga
      241 acgtaaagtt gcttatggac gtcgtttcga tattctcagt actgctattc gttttgaacc
      301 aggagacaaa aaagaagttt cagttattga tttagtcgga acacgttgaa gtttgaggtg
      361 taaacggctt agttaacggc aaaaccttaa aaaataatct atttacaagt ttctatatag
      421 acgaagggga acattatgtt taaaatttca agaaaaaatt actcagatct atatggtatc
      481 acaactggtg atagcgttag attaggagac acaaatcttt gagttaaagt tgaaaaagac
      541 ttaactactt atggcgaa
//
LOCUS       YSCFUR1A     2123 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  S.cerevisiae uracil phosphoribosyltransferase (FUR1) gene,
            complete cds.
ACCESSION   M36485
KEYWORDS    uracil phosphoribosyltransferase.
SOURCE      S.cerevisiae (strain FL100, ATCC 28383) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 2123)
  AUTHORS   Kern,L., de Montigny,J., Jund,R. and Lacroute,F.
  TITLE     The FUR1 gene of Saccharomyces cerevisiae: Cloning, structure and
            expression of wild-type and mutant alleles
  JOURNAL   Gene 88, 149-157 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        895     1650     uracil phosphoribosyltransferase (FUR1)
    mRNA        886     1791     FUR1 mRNA (alt.)
    mRNA        888     1791     FUR1 mRNA (alt.)
    signal      841      848     TATA box
    signal     1896     1901     poly-A signal
BASE COUNT      659 a    427 c    392 g    645 t
ORIGIN      
        1 atcgataaaa gaactaatgt ttcccaaaga aataggaaaa agggaataaa gaataatagg
       61 ccccacaaag acataaacag cagtcctgac tggggcaact gcacagagga accgattggc
      121 agagcgaaaa agcaaacggc atgaacaggg ccaagaactc tcggaatttt accactaata
      181 ttaaattgca gcgacaacat tttggcgaag aaatacaagg tggccagcca gccttgtgat
      241 atctacaaat tcagatgctt cagataaatt gttaatgcta ttcaacctaa ctttgggagt
      301 aaaccaagaa aacttgaaaa atgttctgga aaacatttct caggtgcaga tagctcaaat
      361 tagggttaga gacctgcctt caggatctgc caccgctaag gtccgtctgg catatcctac
      421 aacacagtct ttggagaagg taagaaaact gttccatggc gctctagttg atggaaggcg
      481 catccaagtg gtgattgcat ctgatgaatc gtcccacttg tcgtattaga gtttgtcaac
      541 gacactcaca aggtatttaa tcagcaaaat ccccgccaca aactattttt ttgaagacat
      601 gctttctcat gactgcctaa taacaatacc tcattctact agtaatcgac ctatgtaatt
      661 atttcataaa ctataaagca ggtcattgca ataacagaaa ggccggtttt tctataagct
      721 tatctcatcg cataaaaaat cgacagttgt aattatctcc ggcggacttt tccctttccg
      781 tctttttttt caaaattttt ttttttttca cttcttcttt caaagctgcc tcaaaagaga
      841 tatatatatt ggtaagaatc ctcttccaat actagcttca tttcttcttg aaccatgaac
      901 ccgttattct ttttggcttc tccattcttg taccttacat atcttatata ttatccaaac
      961 aaagggtctt tcgttagcaa acctagaaat ctgcaaaaaa tgtcttcgga accatttaag
     1021 aacgtctact tgctacctca aacaaaccaa ttgctgggtt tgtacaccat catcagaaat
     1081 aagaatacaa ctagacctga tttcattttc tactccgata gaatcatcag attgttggtt
     1141 gaagaaggtt tgaaccatct acctgtgcaa aagcaaattg tggaaactga caccaacgaa
     1201 aacttcgaag gtgtctcatt catgggtaaa atctgtggtg tttccattgt cagagctggt
     1261 gaatcgatgg agcaaggatt aagagactgt tgtaggtctg tgcgtatcgg taaaatttta
     1321 attcaaaggg acgaggagac tgctttacca aagttattct acgaaaaatt accagaggat
     1381 atatctgaaa ggtatgtctt cctattagac ccaatgctgg ccaccggtgg tagtgctatc
     1441 atggctacag aagtcttgat taagagaggt gttaagccag agagaattta cttcttaaac
     1501 ctaatctgta gtaaggaagg gattgaaaaa taccatgccg ccttcccaga ggtcagaatt
     1561 gttactggtg ccctcgacag aggtctagat gaaaacaagt atctagttcc agggttgggt
     1621 gactttggtg acagatacta ctgtgtttaa ataaatcaca cccgaacacc atcttgaagg
     1681 ttcagaacgg ctgaagccat atcaactttg ggtttctact gttttaaatt tcctttctcg
     1741 ttttaaactt ttgttgccgt ctcttctact atcaattttt gttgttcatg catgtttaat
     1801 tacctttttt gtaaaaataa tataaacgta ccaatggtca tttataacaa atatgcttga
     1861 aaaatctaac gactctgttt cttacattag gttcgaataa acacggtaca tgtcctctag
     1921 ccaatctgac atttttggtc caaagtcttt gaaaggtaga taaccccgtt aaaatagaac
     1981 caccaatcca tgtagtatat tttctttctg aaggggctat aatctttatc taggatgttc
     2041 ctttggttaa tgcctccaaa tcccatagca ttcggtctcc aaagccttta agcgttgtag
     2101 ttccgccact taggattatc gat
//
LOCUS       YSCMET16A    1986 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  S.cerevisiae 3'-phosphoadenylyl sulfate reductase (MET16) gene,
            complete cds.
ACCESSION   J05591
KEYWORDS    3'-phosphoadenylyl sulfate reductase.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1986)
  AUTHORS   Thomas,D., Barbey,R. and Surdin-Kerjan,Y.
  TITLE     Gene-enzyme relationship in the sulfate assimilation pathway of
            Saccharomyces cerevisiae: Study of the 3'-phosphoadenylylsulfate
            (PAPS) reductase structural gene
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  simple staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Surdin-Kerjan, 28-JUN-1990.
FEATURES       from  to/span     description
    pept        792     1562     3'-phosphoadenylyl sulfate reductase (MET16)
    signal      686      692     TATA-box
    site        643      648     cis-acting element in general control of AA
                                 synthesis
    site        612      618     UAS (methionine metabolism)
BASE COUNT      670 a    368 c    379 g    569 t
ORIGIN      
        1 atgcatcttg cctctttgat attggttgga tcttcttatg gcttccacga actctcttgt
       61 gtaaatatct ggatttctac cgtcctcaat gtattgaaca acttccaagg gaatgtccac
      121 cttagacaag ctggattgag gatcgttgct tctcacgttc agcttgtaca agcgatccac
      181 atttctttgc aagttggtga tcattccctt ggtggcttct ggagtaccag gaaaatcata
      241 tatcgagaca cctaattcaa cgaaggactc aataatcgaa gccacttggt cttgagtagt
      301 ggccagttct tgctgcaatt gttcattgtt agtgctgttt ccattcatct tatcggttta
      361 tttttctata tatttgcctc tttctcaaac aggagttagt agttaaaagt acgaagttct
      421 tgttctttaa tgcgcgctga caaaagaatt ggataaaaga gaatggtggg gggacaagaa
      481 ggaaatttgt cctagtttaa catgaatggc atcttgttac cgggtggaca tcacctattg
      541 attctaaata tctttacggt ttatcatact gttctttatt ccgtcgttat tctttttatt
      601 tttatcatca tttcacgtgg ctagtaaaag aaaagccaca acatgactca gcaaatctcg
      661 acaaagtaaa agctcataga gatagtatta tattgatata aaaaaagtat actgtactgt
      721 ttgtaacctt ttcaatgctt taagatcaaa actaaggcca gcaaaggtat caacccatag
      781 caactcataa aatgaagacc tatcatttga ataatgatat aattgtcaca caagaacagt
      841 tggatcattg gaatgaacaa ctaatcaagc tggaaacgcc acaggagatt attgcatggt
      901 ctatcgtaac gtttcctcac cttttccaaa ccactgcatt tggtttgact ggcttggtta
      961 ctatcgatat gttgtcaaag ctatctgaaa aatactacat gccagaacta ttatttatag
     1021 acactttgca ccatttccca caaactttaa cactaaaaaa cgagattgag aaaaaatact
     1081 accagcctaa aaatcaaacc attcacgtat ataagccgga tggatgtgaa tcggaggcag
     1141 attttgcctc gaaatacggg gatttcttat gggagaaaga tgatgacaag tacgattatc
     1201 tggccaaagt ggaacctgca catcgtgcct acaaagagct acatataagt gctgtgttta
     1261 ctggtagaag aaaatcacaa ggttctgccc gctcccaact gtcgattatt gaaatagacg
     1321 aacttaatgg aatcttaaaa ataaatccat tgatcaattg gacgttcgag caggttaaac
     1381 agtatataga tgcaaacaat gtaccataca acgaactttt ggaccttgga tatagatcca
     1441 ttggtgatta ccattccaca caacccgtca aggaaggtga agatgagaga gcaggaagat
     1501 ggaagggcaa ggcaagaccg agtgtggaat tcatgaagcc agccgattcg cgcaattttt
     1561 aaagcaagat gcctagatag atagagtacg atatataacc atatgtatgt gactaattat
     1621 ttattcctta ataacaccaa tgattacaac tttctaaagc tggcggagaa ttcgcgctgt
     1681 acgagaaaag agcgaaaaca gaggaatatt caaactaaga accaaactgc gataaagagg
     1741 attgaaagga aaaacgaaag aaaaggtaaa ctgacaaata tatacattaa ccgatgggta
     1801 atttcagatt tcctataaaa accaagctac caccagggtt tatcaatgct cgcatactta
     1861 gggataactt caaaagacaa caatttaaag agaatgaaat ccttgttaaa tctttgaaat
     1921 tcatcgctag aaatatgaac cttccaacaa aactgaggtt ggaggctcag ttaaaactaa
     1981 atgcat
//
LOCUS       CHT59KD      2429 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  C.trachomatis 59-kDa immunogenic protein (SK59) gene, complete cds.
ACCESSION   M31119
KEYWORDS    antigen; immunogenic protein.
SOURCE      C.trachomatis L2 (strain LGV-2 434BU) elementary body DNA, clone
            beta-1.
  ORGANISM  Chlamydia trachomatis
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae.
REFERENCE   1  (bases 1 to 2429)
  AUTHORS   Kahane,S., Weinstein,Y. and Sarov,I.
  TITLE     Cloning, characterization and sequence of a novel 59-kDa protein of
            Chlamydia trachomatis
  JOURNAL   Gene 90, 61-67 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by Y.Weinstein, 05-JAN-1990, for release after publication.
FEATURES       from  to/span     description
    pept        466     2043     59-kDa immunogenic protein
BASE COUNT      700 a    626 c    417 g    686 t
ORIGIN      
        1 ggatcccgaa ttgggtaact ctcagaccca cacataaggc catatgctcg agtacgtgag
       61 ccactccact agaatcttgc gggcaagtcc gaaaagaaat attgaacaca ttttcatcat
      121 catcattcac gatcatcatg atcgttgccc cggtcggagt atgttccact tcaattagct
      181 tgctctcgat ctcgggaaga tcctgactca acttgactac aaaatttcta taggtatccc
      241 agttttcata tcccactcaa tcttctataa tagagaagct tgttgcatct ccctattttc
      301 gattcaccta acatagaaga cagctactgt gagctcttat atccacacaa atattctttc
      361 tgaaggcttc tcttattaaa aaaaaagacg ggactcgatt gagtccccat actagactag
      421 cttcctaaaa tataaggcca ggactactcg tctgatttca agacgatgaa tcgcaccaca
      481 tctcccttga gaaaccataa ggagaacatt ctctcccttt cgagtttttc aaaacctgat
      541 ttaactcttc aacggaagcg acctcgctgc ctattcaccg ctaagataag ctgtccagga
      601 gcgacgcctg cagaagctgc aggcgagctg cctccacagc aactaccaga atcctcgggt
      661 atctgctgcc aatccgagtt tcttacaaat ttctggagta atgtaagtct cacggactcc
      721 catcttctgg caacgctgaa acgccatcct ctgttgggtc tgtgtaaccg tcacaggtat
      781 ctcgcttgtt ttcccttcac gacgattttt aaaataacac gagtccctgg catcattagg
      841 gaaatggcat tacgcaacgc actcaaagac tctacttctt tttccattgt aagccacaat
      901 gacatcttct tccagccccg ctttttctgc tggagaacct ttaacaacat ccgtcaccaa
      961 acgttccgta cacttttcca atttgtaaca agtagccaat tcagaatcta tcggttgcaa
     1021 ggtaactccc aaaaagcctc ttgttacctg cccatcacta atcaattgat caatgactcg
     1081 tttagccatc aagctaggaa tagcaaaccc tattccaata tatcccccgc taccactgac
     1141 aatggcagta ttaaccccga taacttgacc attgattgtt taacaatgga ccgccctgaa
     1201 ttcccaggat taatggcagc atctgttgta acaaatcttc gaaatctaca atatgtagat
     1261 gatttcttcc tttcagcact aacgaccccg atagtgaccg ttgcttgcaa tccaaaagga
     1321 tttccaatag caatagccca gtcacctatc tgcagtcgat cagaattccc aaaagtcaaa
     1381 aatggtaatt tctctgctgt aattttgatc acagcaagat ctgtttttgg atctaacccc
     1441 acgatcttag gctgtgtatt tttgtccatc gtggagagta acatgaattt ttcctgcatc
     1501 ctcgactaca tgatggttag taacaacata accattcgaa ttcagaaaca tagaacccag
     1561 ttcctcttac agcatcacgc cgctgctgcg gacgctgctg ctctctatcg aaggcaaccc
     1621 aaaaaatcga ttaaaaaatt cgtcattaaa ataatcaaaa acaaaagggt tctcttgaag
     1681 cctcttttgt ttcctggaga agcaatagcc tggttccctg ttttaggaaa attttcatat
     1741 atatcaactc caggacgttg ccttagacgc gcgacccgag taaaacctcg ggatacttct
     1801 ttaggagatc ttcttgtgaa acctcttgat ctccgtgagg atactgcaag acaaatatca
     1861 gccattaaga atctttcttt gacgcactat agcctagcat tggcgaagag aaaacgtgat
     1921 gtcgatagca acacacataa taataatctt ttcaatcatc ttttccttga taagcgatct
     1981 gcgtctagcc cggtttttca tttatgcacc ataacaagca gatatgcagc atacaaaatc
     2041 taatgatgca aatcaaggag actactctga tgattctcca atctaaaaaa ctaacgtggt
     2101 tttagaacgg atgcaaccgg cctctccaat cagtgcagga gattctacaa cggtaacccc
     2161 tgcctgtctc aaagcttctt gtttgctaaa agcatcccca cttttccctg aaataatagc
     2221 tcctgcatgt cccatacgtt tccctttggg gagccgtagc tcctgcaata aatgcaatca
     2281 caggcttact actatgttga cgtatccaat ctgcagcttc ctcttcagcg cttccaccaa
     2341 tctccaatca taagaacagc ttctgtttgg ctatcctttt caaactcttg gagaggcatc
     2401 gataaaagat gtgccacttt aaaggatcc
//
LOCUS       AFAAZU        810 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  A.denitrificans azurin (azu) gene, complete cds.
ACCESSION   M30388
KEYWORDS    azurin.
SOURCE      A.denitrificans (strain NCTC8582) DNA.
  ORGANISM  Alcaligenes denitrificans
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Aerobic rods and cocci; Alcaligenaceae.
REFERENCE   1  (bases 1 to 810)
  AUTHORS   Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., van De Kamp,M. and
            Canters,G.W.
  TITLE     Isolation and sequencing of the Alcaligenes denitrificans
            azurin-encoding gene: Comparison with the genes encoding blue
            copper proteins from Pseudomonas aeruginosa and Alcaligenes
            denitrificans
  JOURNAL   Gene 90, 15-20 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.W.Canters, 01-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept        307      756     azurin (azu) precursor
    sigp        307      366     azurin signal peptide
    matp        367      753     azurin
    site        190      203     fnr-box
    site        251      267     ntrA-box
    binding     296      299     ribosome binding site
    signal      778      800     terminator
BASE COUNT      160 a    238 c    245 g    167 t
ORIGIN      
        1 cccgccgctg tgctgccttg catgctcgaa ctctacttgt ttgcaattgt ttgcaggcat
       61 cctacgaaga tggaagaccc ttcgtattgc ggtttgtcaa tgggcacggt ttcggtgcgc
      121 cggatgggcc aataccccta tgcggcatgg ggatttcccc tgtttttggg catctgaacg
      181 gggtgggatt gatgtccgtc aatagcgcgc ttttttcgcc gtcttagact tgtgcgtggc
      241 ggcagcgacg caggcatgtg cctggcgcga gtcgaagaat ggccgccctg tttacggaga
      301 gtctccatgc tggcaaaagc caccctagct atcgttctgt ccgcagccag cctgcccgtg
      361 ctggctgctc aatgcgaagc aaccatcgaa agcaacgacg ccatgcagta caacctgaag
      421 gaaatggtcg ttgacaaaag ctgcaagcag ttcacggtgc acctcaagca cgtcggcaag
      481 atggccaagg tcgccatggg ccacaactgg gtgctgacca aggaagccga caagcagggc
      541 gtcgccactg acggcatgaa cgccggcctg gcgcaggact acgtgaaggc gggcgatacc
      601 cgtgtcatcg cgcacaccaa ggtcatcggc ggcggcgaat cggattcggt aacgttcgac
      661 gtgtccaagc tgaccccggg cgaagcctat gcctacttct gctcgttccc cggccactgg
      721 gccatgatga agggcacgct caagctgagc aactgacccc gccctagcgc gcagataccg
      781 gcccagggcc ggtttttttt gtcttggggc
//
LOCUS       PSEAZU       1287 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  P.aeruginosa azurin (azu) gene, complete cds.
ACCESSION   M30389
KEYWORDS    azurin.
SOURCE      P.aeruginosa (strain CIT135) DNA.
  ORGANISM  Pseudomonas aeruginosa
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 1287)
  AUTHORS   Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., Van De Kamp,M. and
            Canters,G.W.
  TITLE     Isolation and sequencing of the Alcaligenes denitrificans
            azurin-encoding gene: Comparison with the genes encoding blue
            copper proteins from Pseudomonas aeruginosa and Alcaligenes
            denitrificans
  JOURNAL   Gene 90, 15-20 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by G.W.Canters, 01-DEC-1989, for release after publication.
FEATURES       from  to/span     description
    pept        213  <     1 (c) ORF1
    pept        489      935     azurin (azu) precursor
    sigp        489      548     azurin signal peptide
    matp        549      932     azurin
    pept    >  1287      985 (c) ORF2 (AA at 1287)
    site        318      333     ntrA-box
    site        403      416     fnr-box
    signal      958      985     terminator (bidirectional azu and ORF2)
    binding     476      481     ribosome binding site
    binding     224      220 (c) ribosome binding site (ORF1)
BASE COUNT      229 a    423 c    428 g    207 t
ORIGIN      
        1 ctgcaggctc tgcgggatga tcccgatcac ttcgctgccg gcggccaatg cggcgtccgc
       61 cacggtgccc atcagaccga ccgcgccgcc accgtagacc agggtcaggc cgcgctcggc
      121 caggtgccgg ccgagggcca cggcggcttc ctggtagacc ggggaagcgc cggggctggc
      181 gccacagaat acgcagacgg aacgcaaggt catgatcgac tcctgtcggg ggtggaaaaa
      241 ggcgcacagg gtagcggctg ggagcgcttc gaccaagccg tgcgaagcgt tgccggacgt
      301 tgcgtcgcag gcgcgaagcg gcacatctgt gctaaaacag gagttccccg tagtaaacgc
      361 cgggcagatc ccgctcgatg ccccgccacg tccggttcgg gtttgacctg aatcagtgga
      421 actcggtgcc cgatcgggca gtctgctctt tcaggattca tcgcccaacc tgcctaggag
      481 gctgctccat gctacgtaaa ctcgctgccg tatccctgct gtccctgctc agtgcgccgc
      541 tgctggctgc cgagtgctcg gtggacatcc agggtaacga ccagatgcag ttcaacacca
      601 atgccatcac cgtcgacaag agctgcaagc agttcaccgt caacctgtcc caccccggca
      661 acctgccgaa gaacgtcatg ggccacaact gggtactgag caccgccgcc gacatgcagg
      721 gcgtggtcac cgacggcatg gcttccggcc tggacaagga ttacctgaag cccgacgaca
      781 gccgcgtcat cgcccacacc aagctgatcg gctcgggcga gaaggactcg gtgaccttcg
      841 acgtctccaa gctgaaggaa ggcgagcagt acatgttctt ctgcaccttc ccgggccact
      901 ccgcgctgat gaagggcacc ctgaccctga agtgatgcgc gagcgatccg ctgcatgaaa
      961 aagcccggcc gctgccgggc tttttcatgg gcgcgcgccg ggctcagcgc gcgtagctgc
     1021 cgccatcgcc tcgccggcca gttggtgcac gcgccgggtc ggatgccact cgtcccagaa
     1081 gtagtactgg tccgggttgg cgcaggccgg gcggacgctg ggctgggtcg gctggcaggg
     1141 cgcgtccagc tccaccaggc catagcgcgc cgggttgcgc cgcaagtggc ggctgaaggt
     1201 gagatggtcg aaccagctca gctccaggcc gcgggtcttg cgcagggcgg cgagctggat
     1261 cggcaggctg gcgttgactg cctgcag
//
LOCUS       MZEADH1CM    6167 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  Z.mays alcohol dehydrogenase (ADH-1 C-m allele) gene, complete cds.
ACCESSION   M32984
KEYWORDS    alcohol dehydrogenase.
SOURCE      Z.mays DNA.
  ORGANISM  Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 6167)
  AUTHORS   Osterman,J.C. and Dennis,E.S.
  TITLE     Molecular analysis of the ADH1-Cm allele of maize
  JOURNAL   Plant Mol. Biol. 13, 203-212 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by J.C.Osterman, 18-MAR-1990.
FEATURES       from  to/span     description
    pept       1217     1250     alcohol dehydrogenase, exon 1 (ADH-1) (EC
                                 1.1.1.1)
               1785     1921     alcohol dehydrogenase, exon 2
               2019     2065     alcohol dehydrogenase, exon 3
               2482     2807     alcohol dehydrogenase, exon 4
               2894     2976     alcohol dehydrogenase, exon 5
               3070     3145     alcohol dehydrogenase, exon 6
               3487     3548     alcohol dehydrogenase, exon 7
               3636     3731     alcohol dehydrogenase, exon 8
               3823     3984     alcohol dehydrogenase, exon 9
               4085     4201     alcohol dehydrogenase, exon 10
    pre-msg    1110  >  4201     ADH-1 mRNA and introns
    IVS        1251     1784     ADH-1 intron A
    IVS        1922     2018     ADH-1 intron B
    IVS        2066     2481     ADH-1 intron C
    IVS        2808     2893     ADH-1 intron D
    IVS        2977     3069     ADH-1 intron E
    IVS        3146     3486     ADH-1 intron F
    IVS        3549     3635     ADH-1 intron G
    IVS        3732     3822     ADH-1 intron H
    IVS        3985     4084     ADH-1 intron I
BASE COUNT     1574 a   1335 c   1378 g   1880 t
ORIGIN      1 bp upstream of BamHI site.
        1 ggatccaata ggctagtcac ttttacttta gcttctgaga tccaaacagt cacttaggac
       61 atgtttggaa gcacaccagt ttttaaaaaa ctttttccta tcctcaattt ctagaaaatg
      121 gtttatgaaa aaaaatttgg gtgggatgtt tgtaacccag tttctagttt tttttataaa
      181 gagagtagct tcttggtttt agttagagga gagtagcttc ttggttttta agaaactggg
      241 aatccagttt ctataaactg gaacataaat aagtatattt ggaatcactt tagtttgtac
      301 aaaccgattt cttagaaatt ggatgcttat aaataggccc tcaatgtcct tgttgggttt
      361 atgaaattta catctattac cacattttta aaaatagagg aagagtatgc tagtagttat
      421 gtataaaaaa actagaaact gtttttttta aaaaaaaact gagttccagt ttcctttatc
      481 taattctttt ataagctatt ttttagaaaa ggatagaaac tgtttttaaa aaaactggtg
      541 tgcttctgtt taactcttcg taagaacagt gttacgtccc gtgtctatat tttgcttttg
      601 ttgaaagcca tcgtaagtac atgcttgcgt gggtgaaatg ccatcgcaat gctacaactt
      661 ttcggctccc tcctgcttcg gtgcttccac atgccctgca cggcgtctag aaaccctaat
      721 gattcagcag cacacctgtc cgcctagccg cctacgcgta cacagaaaac aaattttttg
      781 tccacacacg cgcgcgctcc gagccgcaga tccgagctag cgcggcgcat ccgacggcca
      841 cgacagcgcg gtgccgtcct ccgccgccac cgcttggcgc ttgtccgcac cccccaccag
      901 tccaccacct cccccacgag cgaaaaccac ggtccacgga ccacggctat gttccactcc
      961 aggtggaggc tgcagccccg gtttcgcaag ccgcgccgtg gtttgcttgc ccacaggcgg
     1021 ccaaaccgca ccctccttcc cgtcgtttcc catctcttcc tcctttagag ctaccactat
     1081 ataaatcagg gctcattttc tcgctcctca caggctcgtc tcgctttgga tcgattggtt
     1141 tcgtaagtgg tgagggactg agggtctcgg agtggattga tttgggattc tgttcgaaga
     1201 tttgcggagg ggggcaatgg cgaccgcggg gaaggtgatc aagtgcaaag gtccgccttg
     1261 tttctcctct gtctcttgat ctgactaatc ttggtttatg attcgttgag taattttggg
     1321 gaaagcttcg tccacagttt tttttttcga tgaacagtgc cgcagtggcg ctgatcttgt
     1381 atgctatcct gcaatcgtgg tgaacttatt tcttttatat ccttcactcc catgaaaggc
     1441 tagtaatctt tctcgatgta acatcgtcca gcactgctat taccgtgtgg tccatccgac
     1501 agtctggctg aacacatcat acgatattga gcaaagatct atcttccctg ttctttaatg
     1561 aaagacgtca ttttcatcag tatgatctaa gaatgttgca acttgcaagg aggcgtttct
     1621 ttctttgaat ttaactaact cgttgagtgg ccctgtttct cggacgtaag gcctttgctg
     1681 ctccacacat gtccattcga attttaccgt gtttagcaag ggcgaaaagt ttgcatcttg
     1741 atgatttagc ttgactatgc gattgctttc ctggacccgt gcagctgcgg tggcatggga
     1801 ggccggcaag ccactgtcga tcgaggaggt ggaggtagcg cctccgcagg ccatggaggt
     1861 gcgcgtcaag atcctcttca cctcgctctg ccacaccgac gtcgacttct gggaggccaa
     1921 ggtatctaat cagccatccc atttgtgatc tttgtcagta gatatgatac aacaactcgc
     1981 ggttgacttg cgccttcttg gcggcttatc tgtcttaggg gcagactccc gtgttccctc
     2041 ggatctttgg ccatgaggct ggagggtatg ttctattccc cgatttactt cactatgttg
     2101 ctgactatat atgtgctgtg tttatatttt gcatatttat tatgtttttg cgtctgaatt
     2161 tatgggtatg gttggtggtc tttgtttact gttttactag atgcatgtgg aagagtcaga
     2221 agaaatagtt tttgtttgaa atggtatacc aacggttgga tattatctgt gtggacatca
     2281 gatgttctgg gttactggca gtggactttt gacagattta tctatgattc tttcattagc
     2341 agtttcttcg gctaatttac tcttactatt ttttcagtat acaaaggcac gtacagcttg
     2401 gattgtgtag aatcatttta gatctgttat ctgaggcaaa tttgcttatt ctagccgcct
     2461 gaaaattctt gattttgcca gtatcataga gagtgttgga gagggtgtga ctgacgtagc
     2521 tccgggcgac catgtccttc ctgtgttcac tggggagtgc aaggagtgcg ctcactgcaa
     2581 gtcggcagag agcaacatgt gtgatctgct caggatcaac accgaccgcg gtgtgatgat
     2641 tgccgatggc aagtcgcggt tttcaatcaa tgggaagcct atctaccact ttgttgggac
     2701 ttccaccttc agcgagtaca ctgtcatgca tgtcggttgt gttgcaaaga tcaatcctca
     2761 ggctcccctt gataaagttt gcgtccttag ctgtggtatt tctaccggta agttcattta
     2821 ctacattttg gtgtggatgc tggagtacat ttatcttgag atgctgagtt acacaaattc
     2881 tttatctgtt taggtcttgg tgcatcaatt aatgttgcaa aacctccgaa gggttcgaca
     2941 gtggctgttt tcggtttagg agccgttggt cttgccgtaa gtgttgaaac gatttgcttg
     3001 ttctatgacc tttcaattgc aatgagaacg tgtgttgggt ttgcatctga ttaccctgcg
     3061 catggttagg ctgcagaagg tgcaaggatt gctggagcgt caaggatcat tggtgtcgac
     3121 ctgaacccca gcagattcga agaaggtaca gtacacacac atgtatatat gtatgatgta
     3181 tcccttcgat cgaaggcatg ccttggtata atcactgagt agtcatttta ttactttgtt
     3241 ttgacaagtc agtagttcat ccatttgtcc cattttttca gcttggaagt ttggttgcac
     3301 tggccttggt ctaataactg agtagtcatt ttattacgtt gtttcgacaa gtcagtagct
     3361 catccatctg tcccattttt tcagctagga agtttggttg cactggcctt ggactaataa
     3421 ctgattagtc attttattac attgtttcga caagtcagta gctcatccat ctgtcccatt
     3481 tttcagctag gaagttcggt tgcactgaat ttgtgaaccc aaaagaccac aacaagccgg
     3541 tgcaggaggt ctgtttcttt acccaaggca acaaaaggtt atcacagctt atgctgaact
     3601 tggccataac attcaataat tcctttatgg tctaggtact tgctgagatg accaacggag
     3661 gggtcgaccg cagcgtggaa tgcactggca acatcaatgc tatgatccaa gctttcgaat
     3721 gtgttcatga tgtaagtata tgtatacact ctcagctact ttcattctcc aggttccctt
     3781 catccagaca tgcatgttct aaccgccgcc ctcgtgatcc agggctgggg tgttgccgtg
     3841 ctggtgggtg tgccgcataa ggacgctgag ttcaagaccc acccgatgaa cttcctgaac
     3901 gaaaggaccc tgaaggggac cttctttggc aactataagc cacgcactga tctgccaaat
     3961 gtggtggagc tgtacatgaa aaaggtaaat tgcaaagtgc tgttccttcg gtttccttac
     4021 cagccgagct tttgctgaaa aactgttaag aatcgttcct gcaattctgc ttggctctgc
     4081 acaggagctg gaggtggaga agttcatcac gcacagcgtc ccgttcgccg agatcaacaa
     4141 ggcgttcgac ctgatggcca agggggaggg catccgctgc atcatccgca tggagaacta
     4201 gatttcgctg tctagtttgt gatctggctg ggcttggggt taataaagga ggcaatgcta
     4261 gcctgccctt tcgatgagga ggtacataca cgctggcgat ggaccgcgct tgtgtgtcgc
     4321 gttcagtttg gcttttgcca agcagtaggg tagcttcccg tgtcggtaat tatatggtat
     4381 gaaccatcac cttttggcgc aatacatggt atgaacgtaa gatacaaatt ccaactacct
     4441 ctagctcgct tgtgtgctat atgtatctct ctcgacggat gacacaagat cgcttctata
     4501 tccgaagtga aactaaaagg agaaggaaaa gaaggtaaca gaataggaac cggtttggtg
     4561 agaattggag aggattcatg aaagagaaaa tcccttttca ttaaatttta aatagcaagt
     4621 gatttactct ctcatgatct cctccagttt ccatttcatc aaaacaaacc ttattcattt
     4681 tcccctctaa tctctttctt gtcaccaccg gtggagcaag gtgattaaag agactaaatt
     4741 attattcaat gaatagtagg ggttttagcc cctcaattcc tccaatacct ttgctcccaa
     4801 ataagggggt gtttggtttc tagggactaa ttcctccaat acctttgctc ccaaataagg
     4861 gggtgtttgg tttctaggga ctaatgttta gtcccatcat ttttttttct attttagtct
     4921 ataaattgct aaatatagaa actaaaataa attaaaatat agttttagtt tctatatttg
     4981 acaattttag aactaaaatg gaataaaatg tagggactaa aaattagtct agaaatcaaa
     5041 caccccctaa atccctaaga gccgaggaag gggattaaaa aggataaaat cttctttgtg
     5101 ttcaatttta aataggactc gccgtatcgg taaggccttg ttcgtttaca ttggattgca
     5161 cctggaatcg ttccggctaa tcaaagttta tataaattag agaagcaatc cggatcggaa
     5221 tcgttccgac ccaccaatcc gacgcaaacg aacaaggcct aaggcttcgc ggcggggctc
     5281 gcagtccgga cgccggagag ggggagtgga gatggagaat gacaaggggg tgttctggaa
     5341 agtttccttt ccaagagtaa gggtggttgg tttcgtacac taatttttaa gagcgtttgg
     5401 ttaagaaaca gagaaaaatg gagtaactct attcttattt tttatgttta gttttcatta
     5461 aaaaaggagc agaataccac ttgaagttct tatatagaaa tttatcataa atagttaaaa
     5521 tgctctcact ccataaaaac aatcggatgc tagcgctctt cttcctatcc taccctctat
     5581 attcatatga ctctttaacc aaacagagaa cggagcggct ccgctctatt ttactcttca
     5641 accaaataaa aaaggagcaa ctctgtttgt catacgcgga atagaacgga tttatcctca
     5701 aaaactagaa tggagcccct ctattttagt cgattctcca accaaacgca tagtgtctcc
     5761 atttcattct attttagtct ctaaattgac aaatacataa actaaattat attttaagtt
     5821 ttcgtattta atcaatccct accaaccaaa cactccctaa tttcgcatat cagccccaaa
     5881 tcaagagtgg ttgacccatc gagacgttat cggcggatca aaggcatgcc ccgctaagca
     5941 ataagtgtct aaactaacgt gccgtcgatc tcattaaaca gcaccacgag ctaaacagaa
     6001 tgccaacctc aaaatcaaac atcacctgga tgctggatct gacatccgac ctaggtgcta
     6061 ggcaacgatt gtgcgtagtg ctgaccatat ttgagatttt cactttattt attaaaaaaa
     6121 agaggccagc agggtgggcc gctacccggc ctggtggccg agctaga
//
LOCUS       CFICMCASE    1828 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  C.uda endoglucanase gene, complete cds.
ACCESSION   M36503
KEYWORDS    endoglucanase.
SOURCE      C.uda CB4 DNA.
  ORGANISM  Cellulomonas uda
            Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods.
REFERENCE   1  (bases 1 to 1828)
  AUTHORS   Nakamura,K., Misawa,N. and Kitamura,K.
  TITLE     Sequence of a cellulase gene of Cellulomonas uda CB4
  JOURNAL   J. Biotechnol. 4, 247-254 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        105     1184     endoglucanase
BASE COUNT      348 a    542 c    557 g    381 t
ORIGIN      
        1 ctgcagagtc agggaggcag cgctcacgta atattgcagc gtgaccgcgt gttctctgtc
       61 tctgacgttc agtttcttta ctaccatcca taatgagtga atttatgccc ctgcgtgctt
      121 tagtggcggt gatagtgaca acggcagtaa tgctggtgcc ccgggcgtgg gcgcagacgg
      181 cctgggagcg ttataaggcc cgttttatga tgccggacgc gcgtatcatt gataccgcca
      241 atggcaatgt gtcgcatacg gaaggccagg gcttcgccat gctcctggcg gtggcgaata
      301 acgatcgccc ggcgttcgac aagctgtggc agtggacgga cagcaccctg cgcgacaagt
      361 ctaacgggct gttttactgg cgctataacc cggtggcgcc ggacccgatc gccgataaaa
      421 acaacgccac cgatggcgat accctgatcg cctgggcgct gctgcgcgcg caaaagcagt
      481 ggcaggacaa gcgctacgcc acggcctccg atgccatcac cgcctccctg ctgaaatata
      541 cggtggtgac tttcgccggt cgccaggtga tgctcccggg cgtgaagggg tttaaccgca
      601 acgaccacct gaaccttaac ccctcctatt tcatcttccc ggcctggcgg gcctttgcgg
      661 agcggacgca cctgaccgcc tggcggacat tgcagagtga cgggcaggcg ctgctggggc
      721 aaatgggctg ggggaaatcg catctgccca gcgactgggt ggcgctgcgg gcggatggca
      781 agatgctgcc ggccaaagag tggccgccgc ggatgagttt cgatgcgatc cgtatcccgc
      841 tgtatatctc gtgggtcgat ccgcacagcg ccttgctcgc accgtggaaa gcctggatgc
      901 agagttaccc gcgcctgcaa actccggcgt ggatcaacgt tagcaccaac gaggtcgccc
      961 cgtggaatat ggccggcggc ctgctggcgg tgcgtgattt aacgcttggc gaaccgctgg
     1021 aacgccgcag attgacgaca aggatgatta ttactccgcc agcctcaagc tgctggtctg
     1081 gctggcgaaa caggatcagc gctagcgctg tgatggcttt gcaggtttct cagcccgtat
     1141 gcctgcgggc tgagagaaaa gagcaggaac gtctcacgat gtaaggccgc cagaataggc
     1201 ggccttgtcg cttattgcgg ataaggcacc caactgccgc cattcagctg gacataaggc
     1261 ttgccctgat actggataac gatggcgttg gcgttttcgg acaccgccgc gctctgcggc
     1321 aggttggcga catactgctg ccagttgacg ctgtcttcgc tgaacatttt gccgtcgagg
     1381 gcgcgcgcac caccagctcc gacaccgcca ggtagctgct gggctgatcg atgataattg
     1441 gcgcgccttc atgtggcgcc ttcatgccga agaatttcac cgccgtcggg acgttagtga
     1501 tcgacgggct cgggatatcc cgcaggccag acacctgcat cttatcgccc ttcagcgcgc
     1561 cgccgtgttc cggcaccacc accaccatca ccttacgccc cgatttttcc agttcggtga
     1621 agaagttatc caggtcgtca aacagcttct gcgcccgcac tttgtagtcc gcggttttgc
     1681 tttgccccgg gaagtgattg ccgtcatgca gcggcagggt gttatagaac gtggcgctcc
     1741 gcggattgct gctggcctct tcggtcttca gccacgggtt gagaaccgcg agatcctcat
     1801 acactggcga accatcaaat gcctgcag
//
LOCUS       HUMTAPA1     1496 bp ss-mRNA            PRI       15-AUG-1990
DEFINITION  Human 26-kDa cell surface protein TAPA-1 mRNA, complete cds.
ACCESSION   M33680
KEYWORDS    26-kDa cell surface protein TAPA-1;
            target of antiproliferative antibody.
SOURCE      Human cell line OCI-LY8, cDNA to mRNA, clones 7-3 and 8-1.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1496)
  AUTHORS   Oren,R., Takahashi,S., Doss,C., Levy,R. and Levy,S.
  TITLE     TAPA-1, the target of an anti-proliferative antibody, defines a new
            family of transmembrane proteins
  JOURNAL   Mol. Cell. Biol. 10, 4007-4015 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by S.Levy, 10-APR-1990, for release after publication.
FEATURES       from  to/span     description
    pept        239      949     26-kDa cell surface protein TAPA-1
    signal     1455     1460     Poly-A signal
BASE COUNT      257 a    504 c    413 g    322 t
ORIGIN      
        1 ccattgtgct ggaaaggcgc gcaacggcgg cgacggcggc gaccccaccg cgcatcctgc
       61 caggcctccg cgcccagccg cccacgcgcc cccgcgcccc gcgccccgac cctttcttcg
      121 cgcccccgcc cctcggcccg ccaggccccc ttgccggcca cccgccaggc cccgcgccgg
      181 cccgcccgcc gcccaggacc ggcccgcgcc ccgcaggccg cccgccgccc gcgccgccat
      241 gggagtggag ggctgcacca agtgcatcaa gtacctgctc ttcgtcttca atttcgtctt
      301 ctggctggct ggaggcgtga tcctgggtgt ggccctgtgg ctccgccatg acccgcagac
      361 caccaacctc ctgtatctgg agctgggaga caagcccgcg cccaacacct tctatgtagg
      421 catctacatc ctcatcgctg tgggcgctgt catgatgttc gttggcttcc tgggctgcta
      481 cggggccatc caggaatccc agtgcctgct ggggacgttc ttcacctgcc tggtcatcct
      541 gtttgcctgt gaggtggccg ccggcatctg gggctttgtc aacaaggacc agatcgccaa
      601 ggatgtgaag cagttctatg accaggccct acagcaggcc gtggtggatg atgacgccaa
      661 caacgccaag gctgtggtga agaccttcca cgagacgctt gactgctgtg gctccagcac
      721 actgactgct ttgaccacct cagtgctcaa gaacaatttg tgtccctcgg gcagcaacat
      781 catcagcaac ctcttcaagg aggactgcca ccagaagatc gatgacctct tctccgggaa
      841 gctgtacctc atcggcattg ctgccatcgt ggtcgctgtg atcatgatct tcgagatgat
      901 cctgagcatg gtgctgtgct gtggcatccg gaacagctcc gtgtactgag gccccgcagc
      961 tctggccaca gggacctctg cagtgccccc taagtgaccc ggacacttcc gagggggcca
     1021 tcaccgcctg tgtatataac gtttccggta ttactctgct acacgtagcc tttttacttt
     1081 tggggttttg tttttgttct gaactttcct gttacctttt cagggctgat gtcacatgta
     1141 ggtggcgtgt atgagtggag acgggcctgg gtcttgggga ctggagggca ggggtccttc
     1201 tgcccctggg gtcccagggt gctctgcctg ctcagccagg cctctcctgg gagccactcg
     1261 cccagagact cagcttggcc aacttggggg gctgtgtcca cccagcccgc ccgtcctgtg
     1321 ggctgcacag ctcaccttgt tccctcctgc cccggttcga gagccgagtc tgtgggcact
     1381 ctctgccttc atgcacctgt cctttctaac acgtcgcctt caactgtaat cacaacatcc
     1441 tgactccgtc atttaataaa gaaggaacat caggcatgct aaaaaaaaaa aaaaaa
//
LOCUS       DROSYNCL     3727 bp ds-DNA             SYN       15-AUG-1990
DEFINITION  Synthetic cloning vector encoding heat-shock protein 82/neomycin
            phosphotransferase fusion protein (hsp82-neo) gene, complete cds.
ACCESSION   M32616
KEYWORDS    heat-shock protein 82; neomycin phosphotransferase.
SOURCE      Synthetic, D.pseudoobscura, D.melanogaster and bacterial DNA, clone
            pHS85.
  ORGANISM  Cloning vector
            Artificial sequences; Cloning vehicles.
REFERENCE   1  (bases 1 to 3727)
  AUTHORS   Sass,H.
  TITLE     P-transposable vectors expressing a constitutive and
            thermoinducible hsp82-neo fusion gene for Drosophila germline
            transformation tissue-culture transfection
  JOURNAL   Gene 89, 179-186 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.Saas 06-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       2068     2925     heat-shock protein 82/neomcyn
                                 phosphotransferase fusion protein (hsp82-neo)
    IVS        1005     2067     hsp82 intron A
    pre-msg       6     3610     hsp82-neo fusion protein mRNA and intron
    site          6      868     D.pseudoobscura hsp82 gene 5' flank
    site        869     1004     D.pseudoobscura heat-shock protein 82, exon 1
    site       2068     2126     D.pseudoobscura hsp82 truncated exon 2
    site       2127     2142     coding linker
    site       2143     3269     neomycin phosphotransferase coding sequence
    site       3270     3610     non-coding 3'flank of D.melanogaster hsp82
                                 gene with Poly-A signal
    site       3620     3726     multiple cloning site (MCS)
BASE COUNT      950 a    866 c    882 g   1029 t
ORIGIN      23 on XR.
        1 ggatccgatg gatttttacc atattattat tatttctagc cacgttgcaa ctctatgtca
       61 gtaccggaaa tagcagccct ggagtctctt agcctctaga aacggctaga acattctacg
      121 cttgtggttg gttttcattg aaagcaggcg tcttttatat actttacggt atatagctac
      181 atgtatataa tggtatactt catcaatatc atcaatctat gaattttaat ttttaagagt
      241 acatatataa attaacatgg gggatatagt tctcaatacc caagtatttg aattttccat
      301 ctctcatcgg gggtaattca tgaaccggtt ccagccgaaa aatgaacgaa attcatgaga
      361 gattattttt tcgggattgc ttgccaatac atttcggaaa aacaaaatgt actacatttt
      421 tgtcatctca gggtgctcca attaattatg aatgctacga cactacaaag cagcttggaa
      481 atccgaattt taacaataat taaaggaaat agggtatagc gtatataggg tatcatagct
      541 gaaacgggta taccaacaat aatgacgcag cacttacgtt tcactccgta ctcacttacg
      601 atttatgctt ataatttttg ttcacctctt ttacttaaac ctcactttaa aaacaatcaa
      661 ataaatggga gtatttatgt atatttctaa gattacggcg gtattgttct gctgtctgcg
      721 gtcacactgg ttttcagcct cggtgcaact ctgtttcagt accggaaata gcagccctgg
      781 attctcgtag cctctagaaa cgtctagaaa attctacgct tggggttggt ttgctataaa
      841 agcaggcggg ccgactgttg ccggctcgag tcttgaaaaa tttttgtcca gtgaaggtgc
      901 gtttgcttag agcgcagtgc aacaaagtga atttattcta cacaaatcga agtgaaaata
      961 tatatatatt tttatctctg ctgttaaatt aaaacacata caaggtaagc gttaacaatg
     1021 aaagtgcatt tatttaacaa aatgtaaaga tctgctgtgg tgcaatgctt gctgcgcgtc
     1081 tgctgatgaa aagttcttga cccaaatgca gaaaatcaat agaatctgtg aaatcttcta
     1141 taatcttaaa attagattaa agttctattt ttttgcccga gtttgtaacc acgggcgata
     1201 aaaagtagct ttacgcctcg cacaccaata cacgaacaga aaaattatgc cggctgtaat
     1261 atgagctcgg cgcgaaattt ctagatgacc ggttcttaga acatcaacct tgcatgtcca
     1321 acaaatgctg gttaattaaa gacgtgcctt aacttaattt tcttggcaca cgtgcttatt
     1381 tgaattcagt cttttgcact tgccatgcac acagccacac atatgtgaat ttgcgaattt
     1441 gccactcatg catacactca tgtatgttcc atcatcgaga aaattcgaaa atcgtgaatc
     1501 aaacttcggc atgaatcaaa tttcaaagag gtctttgttt ccacctggtt ctagaagttt
     1561 cctttcgcgt gcttggatac ctatcttatg cataaacggt ttctgcacat gtaacttgaa
     1621 cacatacaca cttgcaaaca tatgtatgta catatgcata ccctgaccac aaaattttca
     1681 gcaaacttta gccgtacatc aaaccaccaa agagctgtgc tgttgtcaag gagaattttc
     1741 ttccagaaag cttcaattag attgtttatc tgggggtgat gtacgcattg gacaacccta
     1801 tgcgctctag aaacttccag taaatgttaa ctggatgtac aatgggtaca tccctaagcg
     1861 tgcgagtgta tgcgtgttcg ctaactgtaa tgtatgtgtg ttcgtgtgcg aaagagaaaa
     1921 ggatgagaag tctgccattt tgaaataaaa agattttgtg ctaggggggt ggggaaatat
     1981 gattatcgaa aatgggcagt gaacaatgca gctgcatatt taatgagttg tgactaattc
     2041 tcgtgtggta ttttcttgct cttccagatg cccgaagaag ctgagacttt cgcattccag
     2101 gctgagattg ctcagcttat gtcgttgatc cggccaagct tggatggatt gcacgcaggt
     2161 tctccggccg cttgggtgga gaggctattc ggctatgact gggcacaaca gacaatcggc
     2221 tgctctgatg ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct ttttgtcaag
     2281 accgacctgt ccggtgccct gaatgaactg caggacgagg cagcgcggct atcgtggctg
     2341 gccacgacgg gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc gggaagggac
     2401 tggctgctat tgggcgaagt gccggggcag gatctcctgt catctcacct tgctcctgcc
     2461 gagaaagtat ccatcatggc tgatgcaatg cggcggctgc atacgcttga tccggctacc
     2521 tgcccattcg accaccaagc gaaacatcgc atcgagcgag cacgtactcg gatggaagcc
     2581 ggtcttgtcg atcaggatga tctggacgaa gagcatcagg ggctcgcgcc agccgaactg
     2641 ttcgccaggc tcaaggcgcg catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat
     2701 gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt ctggattcat cgactgtggc
     2761 cggctgggtg tggcggaccg ctatcaggac atagcgttgg ctacccgtga tattgctgaa
     2821 gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat
     2881 tcgcagcgca tcgccttcta tcgccttctt gacgagttct tctgagcggg actctggggt
     2941 tcgaaatgac cgaccaagcg acgcccaacc tgccatcacg agatttcgat tccaccgccg
     3001 ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg atgatcctcc
     3061 agcgcgggga tctcatgctg gagttcttcg cccaccccgg gctcgatccc ctcgcgagtt
     3121 ggttcagctg ctgcctgagg ctggacgacc tcgcggagtt ctaccggcag tgcaaatccg
     3181 tcggcatcca ggaaaccagc agcggctatc cgcgcatcca tgcccccgaa ctgcaggagt
     3241 ggggaggcac gatggccgct ttggtcgatc gatgataaac ataaaaccaa ataaacaaca
     3301 agcaaatgtg ttttaaaaat ctaacttctg agcgagtatt tattgggggg aataaacaat
     3361 ctatgaatcg gattctttgc gcagcagctg ctcaatggcc tccaccgtgg acactccgtt
     3421 ggttatcatt attatcttgt ttcgcgatcg agatcccttg tccaaagaaa cgtcgctctt
     3481 tcgaagacct agaactttcg acagaaactt gaccagttcg gcgttagctt ctccctcgct
     3541 gggcggagcg gcgatttgga cgcccactcc ttcaaagcca attcctgtga ttccgttctg
     3601 cttagccccc ccggaattgg gtacccccac cgcggtggcg gccgctctag aactagtgga
     3661 tcccccgggc tgcaggaatt cgatatcaag cttatcgata ccgtcgacct cgaggggggg
     3721 cccggta
//
LOCUS       ECOARGD      1221 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  E.coli acetylornithine aminotransferase (argD) gene, complete cds.
ACCESSION   M32796
KEYWORDS    acetylornithine aminotransferase.
SOURCE      E.coli (K12) DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1221)
  AUTHORS   Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N.
  TITLE     Escherichia coli and Saccharomyces cerevisiae acetylornithine
            aminotransferases: Evolutionary relationship with ornithine
            aminotransferases
  JOURNAL   Gene 90, 69-78 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by A.H.T.Boyen 13-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept          1     1221     acetylornithine aminotransferase (argD) (EC
                                 2.6.1.11)
BASE COUNT      261 a    290 c    376 g    294 t
ORIGIN      73 minutes.
        1 atggcaattg aacaaacagc aattacacgc gcgactttcg atgaagtgat cctgccgatt
       61 tatgctccgg cagagtttat tccggtaaaa ggtcagggca gccgaatctg ggatcagcaa
      121 ggcaaggagt atgtcgattt cgcgggtggc attgcagtta cggcgttggg ccattgccat
      181 cctgcgctgg tgaacgcgtt aaaaacccag ggcgaaactc tgtggcatat cagtaacgtt
      241 ttcaccaatg aaccggcgct gcgtcttggg cgtaaactga ttgaggcaac gtttgccgaa
      301 cgcgtggtgt ttatgaactc cggcacggaa gctaacgaaa ccgcctttaa actggcacgc
      361 cattacgcct gtgtgcgtca tagcccgttc aaaaccaaaa ttattgcctt ccataacgct
      421 tttcatggtc gctcgctgtt taccgtttcg gtgggtgggc agccaaaata ttccgacggc
      481 tttgggccga aaccggcaga catcatccac gttcccttta acgatctcca tgcagtgaaa
      541 gcggtgatgg atgatcacac ctgtgcggtg gtggttgagc cgatccaggg cgagggcggt
      601 gtgacggcag cgacgccaga gtttttgcag ggcttgcgcg agctgtgcga tcaacatcag
      661 gcattattgg tgtttgatga agtgcagtgc gggatggggc ggaccggcga tttgtttgct
      721 tacatgcact acgcgttagc gccggatatt ctgacctctg cgaaagcgtt aggcggcggc
      781 ttcccgatta gcgccatgct gaccacggcg gaaattgctt ctgcgtttca tcctggttct
      841 cacggttcca cctacggcgg taatcctctg gcctgtgcag tagcgggggc ggcgtttgat
      901 atcatcaata cccctgaagt gctggaaggc attcaggcga aacgccagcg ttttgttgac
      961 catctgcaga agatcgatca gcagtacgat gtatttagcg atattcgcgg tatggggctg
     1021 ttgattggcg cagagctgaa accacagtac aaaggtcggg cgcgtgattt cctgtatgcg
     1081 ggcgcagagg ctggcgtaat ggtgctgaat gccggaccgg atgtgatgcg ttttgcaccg
     1141 tcgctggtgg tggaagatgc ggatatcgat gaagggatgc aacgtttcgc ccacgcggtg
     1201 gcgaaggtgg ttggggcgta a
//
LOCUS       YSCARG8      1272 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  S.cerevisiae acetylornithine aminotransferase (ARG8) gene, complete
            cds.
ACCESSION   M32795
KEYWORDS    acetylornithine aminotransferase.
SOURCE      S.cerevisiae FL100 DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1272)
  AUTHORS   Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N.
  TITLE     Escherichia coli and Saccharomyces cerevisiae acetylornithine
            aminotransferases: Evolutionary relationship with ornithine
            aminotransferases
  JOURNAL   Gene 90, 69-78 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer readable sequence for [1] kindly submitted
            by A.H.T.Boyen 13-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept          1     1272     acetylornithine aminotransferase (ARG8) (EC
                                 2.6.1.11)
BASE COUNT      404 a    230 c    283 g    355 t
ORIGIN      
        1 atgtttaaaa gatatttatc cagtacgtca tcaagaagat ttacaagcat tttagaggaa
       61 aaggcctttc aagtgaccac ttactctaga cctgaagatc tatgtataac tagaggtaaa
      121 aatgcaaagc tgtatgatga cgtgaatggt aaagaatata tcgatttcac cgcaggtatt
      181 gcggtgaccg cattaggcca tgcaaatcct aaagtggcag aaattctgca ccatcaggct
      241 aacaaactgg ttcattcctc caacctttac ttcactaagg aatgtttgga tttaagtgaa
      301 aagattgttg aaaagaccaa gcaattcggt ggtcaacacg acgcctcaag agtattttta
      361 tgtaattctg gtacggaagc aaatgaagct gctttgaagt ttgcaaagaa acatggtata
      421 atgaaaaatc ctagcaagca aggcattgtt gcatttgaga actcttttca tggccgtact
      481 atgggcgctt tatctgtcac ttggaatagt aaatatagaa ctccttttgg ggatttggtt
      541 ccccatgtct cattcttaaa tttgaatgac gaaatgacca aactacaaag ttatatcgag
      601 accaaaaagg acgagattgc tggtttaatt gtcgagccca tacaaggtga aggtggggtt
      661 tttcccgtag aagttgaaaa gctaaccgga ttgaagaaaa tatgtcaaga taatgatgtg
      721 attgtcattc atgatgaaat tcaatgcggt ttgggccgtt caggtaaact atgggctcat
      781 gcttatttac caagtgaggc tcatccggat atttttacat ctgccaaagc attgggaaat
      841 ggcttcccca tcgctgccac catcgtcaat gaaaaagtta ataatgcttt gagagttggt
      901 gaccacggca ccacgtatgg tggtaatccg ctggcctgtt ctgtaagcaa ctatgttttg
      961 gataccatag cagacgaagc ttttttgaaa caagtctcta agaagagtga tatcttacaa
     1021 aagcgcttgc gcgaaattca agccaaatat ccaaatcaaa taaagactat cagaggaaaa
     1081 ggtttgatgc ttggtgctga gttcgtcgaa ccacccaccg aggtcatcaa aaaggccaga
     1141 gaattgggac ttttgatcat taccgctggt aagagtaccg ttagatttgt tcccgcatta
     1201 acgattgaag acgaactaat cgaagaaggg atggatgctt ttgaaaaggc tattgaagcg
     1261 gtttacgctt aa
//
LOCUS       MZEMTMINI    1445 bp ds-DNA             ORG       15-AUG-1990
DEFINITION  Maize mitochondrion 1.4 kb minicircle DNA open reading frame.
ACCESSION   M36398
KEYWORDS    .
SOURCE      Maize mitochondrion 1.4 kb minicircle DNA.
  ORGANISM  Mitochondrion Zea mays
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae; Zea mexicana.
REFERENCE   1  (bases 1 to 1445)
  AUTHORS   Smith,A.G. and Pring,D.R.
  TITLE     Nucleotide sequence and molecular characterization of a maize
            mitochondrial plasmid-like DNA
  JOURNAL   Curr. Genet. 12, 617-623 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        120      353     ORF 1
    pept       1240     1356     ORF 2
    pept       1245     1403     ORF 3
BASE COUNT      375 a    327 c    320 g    423 t
ORIGIN      
        1 gaattccttc ctttggtcgg actactcttt ttaggttatt gccttcggtc aaccctaaat
       61 aagttgattg tcaaattgcg ctgtaactgc attcagttga atatgcggat attttatcaa
      121 tgaatctcga tatcctgttg ataaagattg gatttcttgc gattctgatc gttttatcaa
      181 tccaaatcat cgatgaatat ttccataaag tgatctgtga tcctttagtc tcaatatcag
      241 ttgtttcctg ccgggataac ttgggttatg ctagccacct acttctacaa acaggtgaga
      301 tccacctggg tgggttcgaa tcccatctgc tagatgcgtg gtcatggaat tgaaacctct
      361 atggctggcc caagggaacc ggtcttgtcg attgacctag cttaggaaga gcccagtgaa
      421 cctatccaca agtcaacccc cagggataat ggaaaacctc attcgcccat tggcaaacac
      481 ttaaatatga ggacattcct ctggcaagac aggttagaga cttgagagac taaagacaag
      541 aaggcacagg ttgtagtttt cttccaaggc caaaagcccc gcatggtgga agaagctact
      601 ggtaagtccg agggggggct taactgcgat agttgaccga cgcgacgcta taccggaaag
      661 gccttcgggg tgttgaaagt atggaacttt tattctcgca tagcttggga aagggtatcc
      721 ggtgaaactc cccttaaaag ggtttttccc ccgtaccccc ttttcccaaa aaatttttta
      781 aaaaaagtgg atcagtgaac ctatctttat ctgattaaat cagtggttag gttcactact
      841 atttatagat aacaacccta gccttggggg gacaccccct ccccccaatc ccccctgtct
      901 ggttttgttt taaaccaagt ttgcagggcg agcttgtttt gttatttata attagttatt
      961 tcatgtttga tccgagcttc gggataggga acctctcttg tcagaaaggc ttccctctcc
     1021 cttggtctct tgaaacagga cttttattca ctcagctatg cttcccggaa atccggatta
     1081 aagaataaag acttctatac ctttccggga agcagagcag agggaaacgg agccctcgcc
     1141 ccggagggga atcaattctc tggtttatcg ttcttatgct gttgcggtta taacgatagg
     1201 aattactaga taacatcctc taggaattac tagataacaa tggaatggtt gagcctacta
     1261 tctcaagtgt tggaaggctc aacctacttg cttgtccctc tccactatcg ttccggtctt
     1321 accttccctc gagtccgatc tcgggaaggc gcttaggcag gggccccaag actaagcagg
     1381 taatacaata cctatattta tagagggctt ttacctcgat aaatgagggc gcttcctata
     1441 atgtg
//
LOCUS       NGOTEM1A     1199 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  N.gonorrhoeae plasmid pFA7 beta-lactamase (TEM-1) gene, 3' end.
ACCESSION   M36543
KEYWORDS    beta-lactamase.
SOURCE      N.gonorrhoeae plasmid pFA7 DNA.
  ORGANISM  Neisseria gonorrhoeae
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae.
REFERENCE   1  (bases 1 to 1199)
  AUTHORS   Sanchez-Pescador,R., Stempien,M.S. and Urdea,M.S.
  TITLE     Rapid chemiluminescent nucleic acid assays for detection of TEM-1
            beta-lactamase-mediated penicillin resistance in Neisseria
            gonorrhoeae and other bacteria
  JOURNAL   J. Clin. Microbiol. 26, 1934-1938 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1       21     beta-lactamase (TEM-1) (AA at 1)
BASE COUNT      344 a    237 c    178 g    440 t
ORIGIN      
        1 tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat
       61 ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg
      121 accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccta tctataaact
      181 cttggcttgg ttctaatccc tctaaacgat tattatcaat agccgctcta accgcttttt
      241 ctcggcttaa tttttctgtc tctgttataa aattgcttat tcattcttgt tcttctttca
      301 aaaaaaagtt aagtaaaata cctacctaaa tttttactag ttcgcaatct acgagcttat
      361 aacctcgttt tttcaattca tttaaaaaat cagattttga gcctaatttg atctattgct
      421 atcgttaccc gctagaaata cccagtaatt acgcaaatct tcattggtaa ctttcgtaat
      481 atctgtgtaa tgatcttcga gtatttttaa gcaatctcta gcccataaac cgtactcgtg
      541 attgctcatc ttagggtttt gcttatcgag tttgacgaac ttcccatact tgtttttatg
      601 tggaaatact ggccgtttgc aacttcttca attttttgag ctgttcgttt tttactacca
      661 atcacaaaat ttaaagagtg aatagtacgc ccacgcttga tttgttcaac ctcaacgact
      721 aaatcagatt tctcgttaat ctcagttatt gcaggttcca aaacacgttg atttaatgaa
      781 ttaaatctag gtattattca acctgaagcc attctttagt tttctactgt aatttcacga
      841 ctaccaacag agcgatattg tgtaattagc tcataaattc gaattgaatg tacactgttg
      901 aaataagcga tatgtttgag ttgatattgc gtgaattgcc ctttaagttg cgttaggtat
      961 ggcataactt catcagtcat tgcaattcta aaacgcccct ctttctgaaa tatgttctag
     1021 aggaaaccca acgaaattca gttacacggt ctttatcttc agttttaaca cttcggtcat
     1081 aaatccgttt tatagccgcc tgaatttgct tataggcgtt atcttggctt atttctggaa
     1141 actcacggac aaaatcagcc accgtaaaat caaaaatttt ttgattagat ttcggatcc
//
LOCUS       FLANAX       1461 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Influenza A/Chile/1/83 (H1N1), neuraminidase (seg 6), cDNA to mRNA.
ACCESSION   M24783 M33023
KEYWORDS    neuraminidase.
SOURCE      Influenza virus type A, cDNA to viral RNA.
  ORGANISM  Influenza virus type A
            Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; 
            Orthomyxoviridae; Influenzavirus; Influenza A viruses.
REFERENCE   1  (bases 1 to 1461)
  AUTHORS   Schreier,E., Roeske,H., Driesel,G., Kuenkel,U., Petzold,D.R.,
            Berlinghoff,R. and Michel,S.
  TITLE     Complete nucleotide sequence of the neuraminidase gene of the human
            influenza virus A/Chile/1/83 (H1N1)
  JOURNAL   Arch. Virol. 99, 271-276 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         21     1433     neuraminidase
BASE COUNT      466 a    263 c    343 g    388 t      1 others
ORIGIN      
        1 agcaaaagca ggagtttaaa atgaatccaa atcagaaaat aataaccatt ggatcaatct
       61 gtatgacaat cggaataatt agtctaatat tgcaaatagg aaatattatt tcaatatggg
      121 ttagccactc aatccaaact ggaagtcaaa accacactgg aatatgcaac caaagaatca
      181 ttacttatga aaatagcacc tgggtaaatc aaacatatgt caatattaac aacactaacg
      241 ttgttgctgg aaaggacaca acttcagtga cattagccgg caattcatct ctttgtccta
      301 tccgtgggtg ggctatatac agcaaagaca acagcataag aattggttcc aaaggagatg
      361 tttttgtcat aagagaacct tttatatcat gttctcactt ggaatgcaga accttttttc
      421 tgacccaagg tgctctatta aatgacaagc attcaaatgg gaccgttaag gacagaagcc
      481 cttatagggc cttaatgagc tgtcctatag gtgaagctcc gtctccatac aattcaaggt
      541 ttgaatcagt tgcttggtca gcaagcgcat gtcatgatgg catgggctgg ctaacaatcg
      601 gaatttctgg tccagatgat ggagcagtgg ctgtactaaa atacaacggc ataataactg
      661 aaaccataaa aagttggagg aagcgaatat taagaacaca agagtctgaa tgtgtctgtg
      721 taaacggttc atgttttacc ataatgaccg atggcccgag taatggacct gcctcgtaca
      781 gaatcttcaa aatcgagaag gggaagatta ctaaatcaat adagttggat gcacccaatt
      841 ctcattacga ggaatgttcc tgttacccag acaccggcac agtgatgtgt gtgtgcagag
      901 acaattggca tggttcgaat cgaccttggg tgtcttttaa tcaaaacctg gattatcaaa
      961 taggatacat ctgcagtggg gttttcggtg acaatccgcg tcccaaagat ggaaaaggca
     1021 gctgtgatcc agtaactgtt gatggagcag acggagtaaa ggggttttca tacaggtatg
     1081 gtaatggtgt ttggatagga aggactaaaa gtaacagctc cagaaaggga tttgagatga
     1141 tttgggatcc taatggatgg acagataccg atagtaattt cttagtgaaa caggatgtag
     1201 tggcaatgac tgattggtca gggtacagcg gaagtttcgt tcaacatcct gagctaacag
     1261 gattggactg tatgaggcct tgcttctggg ttgaattaat cagaggacga cctagagaaa
     1321 agacaacaat ctggactagt gggagcagca tttctttttg tggcgtgaat agtgatactg
     1381 caaattggtc ttggccagac ggtgccgagt tgccattcac cattgacaag tagtccgttg
     1441 aaaaaactcc ttgtttctac t
//
LOCUS       YSPURA4      1764 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  S.pombe orotidine-5'-phosphate decarboxylase (ura4) gene.
ACCESSION   M36504
KEYWORDS    orotidine-5'-phosphate decarboxylase.
SOURCE      S.pombe DNA.
  ORGANISM  Schizosaccharomyces pombe
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1764)
  AUTHORS   Grimm,C., Kohli,J., Murray,J. and Maundrell,K.
  TITLE     Genetic engineering of Schizosaccharomyces pombe: A system for gene
            disruption and replacement using the ura4 gene as a selectable
            marker
  JOURNAL   Mol. Gen. Genet. 215, 81-86 (1988)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        534     1328     orotidine-5'-phosphate decarboxylase (ura4)
    mRNA        484  >  1328     ura4 mRNA
BASE COUNT      550 a    274 c    349 g    591 t
ORIGIN      
        1 aagcttagct acaaatccca ctggctatat gtatgcattt gtgttaaaaa agtttgtata
       61 gattatttaa tctactcagc attctttctc taaataggaa tttgttactt aatggagaaa
      121 aaaatgtttc gatttaccta gtgtatttgt ttgtatactc acgtttaatt tcaaacatcc
      181 attctatctt gtgtaatttt tggcatggtg aaaaagataa tcagccttat aatctttaca
      241 aaagtaagaa attctgtaaa taagccttaa tgcccttgct ttaaattaaa atggttcttt
      301 ttcatgataa tgtttgcact ttgtgaatat attttagata gttctgtgag gtataattaa
      361 gatgttttag agacttatac aattttgtct ttataaattc ttaattgatt ttaccatccc
      421 agtttaacta tgcttcgtcg gcatctctgc acatgtcgtg ttttcttacc gtattgtcct
      481 accaagaacc tcttttttgc ttggatcgaa attaaaggtt taaaagcaaa gttatggatg
      541 ctagagtatt tcaaagctat tcagctagag ctgaggggat gaaaaatccc attgccaagg
      601 aattgttggc tttgatggaa gaaaagcaaa gcaacttgtc agtcgcggtc gatttgacga
      661 agaaatccga aatcttagaa ttggtagata aaattggacc ctatgtctgt gttatcaaga
      721 cacatattga cgttgtcgag gatttcgacc aggatatggt agaaaaactg gtggccttag
      781 gtaaaaagca tcgttttctt atctttgagg atcgcaaatt cgcagacatt ggaaataccg
      841 tcaagctaca atatgcatct ggtgtgtaca aaattgcttc ttgggctcat atcacaaatt
      901 gccatacagt gccaggcgag ggtattatac aaggcctcaa agaagttggt ttacctttgg
      961 gacgtggtct cttgcttttg gctgaaatgt cttccaaagg ctctttggct actggttcct
     1021 acacagagaa aaccttagaa tggtttgaga agcataccga tttttgcttt ggctttatag
     1081 ctggtcgtcg atttcctaac cttcaaagcg actacataac tatgtcccct ggtatcggct
     1141 tggatgttaa aggagacggg ctgggacagc aatatcgtac tcctgaagaa gtgattgtaa
     1201 actgcggtag cgatatcatc attgttggtc gtggagtcta tggagctggt cgtaatcctg
     1261 ttgtcgaagc caagagatat agagaagctg gttggaaggc atatcagcaa agactttctc
     1321 agcattaaaa aaagactaat gtaaaatttt tttggttggt tattgaaaaa gtcgatgcct
     1381 tgtttgcgtt tgttttccta ggcgttttat gtcagaaggc atttagaatt agtatacaag
     1441 tactctttgg taaaatttta tgtagcgact aaaatattaa ctattataga taaacacctt
     1501 gggaataaaa agtaatttgc tatagtaatt tattaaacat gctcctacaa cattaccaca
     1561 atcttttctc ttggattgac attgaataag aaaagagtga atttttttag acttgtaatg
     1621 ataactatgt acaaagccaa tgaaagatgt atgtagatga atgtaaaata ccatgtagac
     1681 aaacaagata aaacttggtt ataaacattg gtgttggaac agaataaatt agatgtcaaa
     1741 aagtttcgtc aatatcacaa gctt
//
LOCUS       BMEGDH1      2834 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  B. megaterium glucose dehydrogenase gene and ORFs.
ACCESSION   D90043
KEYWORDS    glucose dehydrogenase.
SOURCE      Bacillus megaterium (strain IAM1030) DNA.
  ORGANISM  Bacillus megaterium
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2834)
  AUTHORS   Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
            Urabe,I. and Okada,H.
  TITLE     Active and silent isozyme genes of glucose dehydrogenase from
            Bacillus megaterium IAM1030
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihide
            Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan.
FEATURES       from  to/span     description
    pept       1964     2749     glucose dehydrogenase (EC 1.1.1.47)
    ORF         181      867     ORF1
    ORF        1086     1946     ORF2
    signal       89       94     put. -35 region for ORF1
    signal      116      121     put. -10 region for ORF1
    signal      167      172     ORF2 ribosome binding site
    signal      893      928     termination signal
    signal      972      977     put. -35 region for ORF2
    signal      997     1003     put. -10 region for ORF2
    signal     1070     1076     ORF2 ribosome binding site
    signal     1949     1954     glucose dehydrogenase ribosome binding site
    signal     2751     2783     termination signal for glucose dehydrogenase
BASE COUNT      896 a    442 c    644 g    852 t
ORIGIN      
        1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga
       61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag
      121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa
      181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg
      241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt
      301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat
      361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca
      421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat
      481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag
      541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta
      601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat
      661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct
      721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt
      781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt
      841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg
      901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt
      961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt
     1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt
     1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt
     1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt
     1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg
     1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta
     1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca
     1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg
     1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag
     1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa
     1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat
     1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca
     1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg
     1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc
     1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga
     1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag
     1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga
     1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg
     2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa
     2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt
     2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt
     2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt
     2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga
     2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt
     2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa
     2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat
     2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg
     2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc
     2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc
     2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca
     2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga
     2821 ctggtgttga attc
//
LOCUS       BMEGDH2      1202 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene.
ACCESSION   D90044
KEYWORDS    glucose dehydrogenase; isozyme.
SOURCE      Bacillus megaterium (strain IAM1030) DNA.
  ORGANISM  Bacillus megaterium
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 1202)
  AUTHORS   Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S.,
            Urabe,I. and Okada,H.
  TITLE     Active and silent isozyme genes of glucose dehydrogenase from
            Bacillus megaterium IAM1030
  JOURNAL   J. Bacteriol. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Toshihide
            Mitamura Department of Fermentation Technology Osaka University 2-1
            Yamada-oka Suita, Osaka 565 Japan Phone:  06-877-5111 x4373 Fax:
            06-876-9036
FEATURES       from  to/span     description
    pept        125      910     glucose dehydrogenase
    signal       27       32     put. -35 region
    signal       49       55     put. -10 region
    signal      111      116     SD sequence
    signal      928      959     termination signal
BASE COUNT      427 a    190 c    268 g    317 t
ORIGIN      
        1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa
       61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt
      121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt
      181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg
      241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc
      301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc
      361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt
      421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg
      481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa
      541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc
      601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa
      661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa
      721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg
      781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt
      841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg
      901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt
      961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg
     1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa
     1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg
     1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg
     1201 ag
//
LOCUS       BMOPTTHP1    1023 bp ss-mRNA            INV       15-AUG-1990
DEFINITION  B.mori PTTH mRNA.
ACCESSION   D90082
KEYWORDS    PTTH; preproPTTH.
SOURCE      B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to
            mRNA, clones P1, P2, C2, C9 and C19.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 1023)
  AUTHORS   Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
            Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
  TITLE     Molecular cloning of the Bombyx mori prothoracicotropic hormone
  JOURNAL   Science 247, 1333-1335 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hironori
            Ishizaki Department of Biology, School of Science, Nagoya
            University Chikusa-ku Nagoya 464-01 Japan Phone:  052-781-5111
            x2472 Fax:    052-783-0719 Telex:  SCUNAG J: 447-7323
FEATURES       from  to/span     description
    pept         34      708     preproPTTH
    matp        379      708     PTTH subunit
    signal      768      773     polyadenylation signal
    signal      826      831     polyadenylation signal
    signal      997     1002     polyadenylation signal
    signal     1004     1009     polyadenylation signal
BASE COUNT      358 a    179 c    186 g    300 t
ORIGIN      
        1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc
       61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg
      121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac
      181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg
      241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta
      301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa
      361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct
      421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc
      481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc
      541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct
      601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg
      661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt
      721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata
      781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc
      841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt
      901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca
      961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt
     1021 aaa
//
LOCUS       BMOPTTHP4     944 bp ss-mRNA            INV       15-AUG-1990
DEFINITION  B.mori preproPTTH mRNA.
ACCESSION   D90083
KEYWORDS    PTTH; preproPTTH.
SOURCE      B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to
            mRNA, clone P4 and C21.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 944)
  AUTHORS   Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M.,
            Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H.
  TITLE     Molecular cloning of the Bombyx mori prothoracicotropic hormone
  JOURNAL   Science 247, 1333-1335 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Hironori
            Ishizaki Department of Biology, School of Science, Nagoya
            University Chikusa-ku Nagoya 464-01 Japan Phone:  052-781-5111
            x2472 Fax:    052-783-0719 Telex:  SCUNAG J: 447-7323
FEATURES       from  to/span     description
    pept    <     1      631     preproPTTH
    matp        302      631     PTTH subunit
    signal      691      696     polyadenylation signal
    signal      749      754     polyadenylation signal
    signal      918      923     polyadenylation signal
    signal      925      930     polyadenylation signal
BASE COUNT      337 a    163 c    177 g    267 t
ORIGIN      
        1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt
       61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag
      121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga
      181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa
      241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag
      301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa
      361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg
      421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag
      481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga
      541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag
      601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt
      661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa
      721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag
      781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt
      841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata
      901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa
//
LOCUS       HUMMTSDHB     958 bp ss-mRNA            ORG       15-AUG-1990
DEFINITION  Human mitochondrial succinate-ubiquinone oxidoreductase
            (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA.
ACCESSION   D90047
KEYWORDS    Ip; complex II; iron sulfur subunit; sdh B;
            succinate-ubiquinone oxidoreductase.
SOURCE      Human liver mitochondrion, cDNA to mRNA.
  ORGANISM  Mitochondrion Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; 
            Homo sapiens.
REFERENCE   1  (bases 1 to 958)
  AUTHORS   Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M.
  TITLE     Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning
            of iron sulfur(Ip) subunit of liver mitochondria
  JOURNAL   Biochem. Biophys. Res. Commun. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kiyoshi
            Kita Department of Parasitology Juntendo University 2-1-1,Hongo
            Bunkyo-ku, Tokyo 113 Japan Phone:  03-813-3111 x3542 Fax:
            03-814-9300
FEATURES       from  to/span     description
    pept    <     1      789     succinate-ubiquinone oxidoreductase (sdh B)
                                 (AA at 1)
    site         74       95     iron-sulfur binding site I
    site        167      179     iron-sulfur binding site II
    site        224      236     iron-sulfur binding site III
BASE COUNT      319 a    212 c    204 g    223 t
ORIGIN      
        1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt
       61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg
      121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag
      181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt
      241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac
      301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat
      361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag
      421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag
      481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc
      541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg
      601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc
      661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat
      721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct
      781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt
      841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata
      901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//
LOCUS       HUMNCAW      2287 bp ss-mRNA            PRI       15-AUG-1990
DEFINITION  Human nonspecific cross-reacting antigen (NCA-W272) mRNA.
ACCESSION   D90064
KEYWORDS    CEA; CEA gene family; PI-anchored membrane protein.
SOURCE      Human white blood cells, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2287)
  AUTHORS   Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and
            Matsuoka,Y.
  TITLE     Characterization of a cDNA clone encoding a new species of the
            nonspecific cross-reacting antigen (NCA), a member of the CEA gene
            family
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Fumiko
            Arakawa First Department of Biochemistry School of Medicine Fukuoka
            University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone:
            092-801-1011  x2892 Fax:   092-801-3600
FEATURES       from  to/span     description
    ORF          87     1136     nonspecific cross-reacting antigen ORF
BASE COUNT      618 a    593 c    453 g    623 t
ORIGIN      
        1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag
       61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc
      121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca
      181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc
      241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa
      301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag
      361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg
      421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg
      481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca
      541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc
      601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc
      661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac
      721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga
      781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag
      841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt
      901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa
      961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca
     1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag
     1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc
     1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa
     1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact
     1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc
     1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa
     1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt
     1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc
     1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga
     1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga
     1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc
     1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt
     1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt
     1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa
     1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca
     1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat
     1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga
     2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt
     2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac
     2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc
     2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg
     2281 gctgatc
//
LOCUS       PIGDESTN     1666 bp ss-mRNA            MAM       15-AUG-1990
DEFINITION  Porcine destrin mRNA.
ACCESSION   D90053 J05290
KEYWORDS    actin-binding protein; cofilin; destrin.
SOURCE      Pig adult brain, cDNA to mRNA, clone PD2.
  ORGANISM  Sus scrofa
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Suiformes; Suidae.
REFERENCE   1  (bases 1 to 1666)
  AUTHORS   Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S.,
            Iida,K. and Yahara,I.
  TITLE     Destrin, a mammalian actin-depolymerizing protein, is closely
            related to cofilin: Cloning and expression of porcine brain destrin
            cDNA
  JOURNAL   J. Biol. Chem. 265, 5768-5773 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kenji
            Moriyama Department of Biophysics and Biochemistry Faculty of
            Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan
            Phone: 03-821-2111  x4408
FEATURES       from  to/span     description
    pept         54      551     destrin
    mRNA    <     1     1666     destrin mRNA
    signal     1645     1660     polyadenylation signal
BASE COUNT      492 a    322 c    372 g    480 t
ORIGIN      
        1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt
       61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt
      121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag
      181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa
      241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg
      301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc
      361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg
      421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc
      481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat
      541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc
      601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg
      661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct
      721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga
      781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt
      841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt
      901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt
      961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac
     1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca
     1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca
     1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt
     1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta
     1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag
     1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc
     1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg
     1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga
     1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt
     1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta
     1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt
//
LOCUS       RATPMP70X    3324 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA.
ACCESSION   D90038 J05256
KEYWORDS    PMP70; peroxisomal membrane protein.
SOURCE      Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181,
            189,201].
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3324)
  AUTHORS   Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T.
  TITLE     The 70-kDa Peroxisomal Membrane Protein Is a Member of the
            Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily
  JOURNAL   J. Biol. Chem. (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Keiju
            Kamijo Department of Biochemistry Shinshu University School of
            Medicine Matsumoto 390 Japan Phone:  263-35-4600 x5182 Fax:
            263-33-6458
FEATURES       from  to/span     description
    pept         36     2015     peroxisomal membrane protein (PMP70)
    signal     3271     3276     polyadenylation signal (put.)
BASE COUNT      921 a    658 c    773 g    972 t
ORIGIN      
        1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga
       61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc
      121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg
      181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac
      241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta
      301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac
      361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact
      421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg
      481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag
      541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc
      601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta
      661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg
      721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa
      781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta
      841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa
      901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct
      961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg
     1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca
     1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc
     1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg
     1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta
     1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg
     1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa
     1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg
     1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat
     1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc
     1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa
     1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg
     1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg
     1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac
     1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca
     1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc
     1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa
     1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt
     2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta
     2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca
     2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat
     2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg
     2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga
     2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct
     2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt
     2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat
     2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt
     2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt
     2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg
     2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac
     2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg
     2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg
     2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca
     2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc
     3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg
     3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa
     3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt
     3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat
     3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag
     3301 attaaaaaaa aaaaaaagga attc
//
LOCUS       VACSANT      1525 bp ds-DNA             VRL       15-AUG-1990
DEFINITION  Vaccinia virus surface (S) antigen gene.
ACCESSION   D90076
KEYWORDS    S gene; surface antigen.
SOURCE      Vaccinia virus DNA.
  ORGANISM  Vaccinia virus
            Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus.
REFERENCE   1  (bases 1 to 1525)
  AUTHORS   Ueda,Y., Morikawa,S. and Matsuura,Y.
  TITLE     Identification and nucleotide sequence of the gene encoding a
            surface antigen induced by Vaccinia virus
  JOURNAL   Virology 177, 588-594 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Yoshiaki
            Ueda National Institute of Health Gakuen, Musashimurayama Tokyo
            190-12 Japan Phone:  0425-61-0771 Fax:    0425-65-3315
FEATURES       from  to/span     description
    pept        382     1437     surface antigen S
BASE COUNT      568 a    229 c    258 g    470 t
ORIGIN      
        1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata
       61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat
      121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa
      181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat
      241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac
      301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata
      361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca
      421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc
      481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt
      541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt
      601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa
      661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta
      721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca
      781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc
      841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga
      901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat
      961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta
     1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat
     1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa
     1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact
     1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga
     1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc
     1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt
     1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata
     1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt
     1501 gctatagaat gaatgaggat acgcg
//
LOCUS       YSCA1         881 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  S. cerevisiae acidic ribosomal protein A1 (YSCA1).
ACCESSION   D90072 X13682
KEYWORDS    acidic ribosomal protein; ribosomal protein.
SOURCE      S. cerevisiae (strain IFO-40028) DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 881)
  AUTHORS   Mitsui,K. and Tsurugi,K.
  TITLE     Identification of A1 protein as the fourth member of 13 kDa-type
            acidic ribosomal protein family in yeast Saccharomyces cerevisiae
  JOURNAL   Unpublished (1990)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 315; 631 to 881)
  AUTHORS   Mitsui,K. and Tsurugi,K.
  TITLE     Identification of A1 protein as the fourth member of 13 kDa-type
            acidic ribosomal protein family in yeast Saccharomyces cerevisiae
  JOURNAL   Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989)
  STANDARD  full staff_entry
REFERENCE   3  (bases 277 to 742)
  AUTHORS   Tsurugi,K. and Mitsui,K.
  TITLE     cDNA and deduced amino acid sequence of acidic ribosomal protein A1
            from Saccharomyces cerevisiae
  JOURNAL   Nucleic Acids Res. 16, 3574-3574 (1988)
  STANDARD  simple automatic
COMMENT     These data kindly submitted in computer readable form by: Kazuhiro
            Mitsui Department of Biochemistry Yamanashi Medical college Tamaho,
            Nakakoma-gun Yamanashi 409-38 Japan Phone:  0552-73-1111 x2257
FEATURES       from  to/span     description
    pept        313      633     acidic ribosomal protein A1
    signal      125      135     UASrpg box1
    signal      182      192     UASrpg box2
    signal      716      721     poly(A) signal
    variant     303      303     a in [1]; g in [3]
    variant     684      685     tt in [1]; t in [3]
    variant     719      719     a in [1]; t in [3]
BASE COUNT      270 a    165 c    157 g    289 t
ORIGIN      
        1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt
       61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca
      121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa
      181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca
      241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa
      301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac
      361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca
      421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac
      481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct
      541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc
      601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg
      661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa
      721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg
      781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc
      841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c
//
LOCUS       YSCCDC23X    3107 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  S. cerevisiae CDC23 protein gene.
ACCESSION   D90081
KEYWORDS    CDC23 protein.
SOURCE      Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line
            D22) DNA, clone YX34.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 3107)
  AUTHORS   Doi,A. and Doi,K.
  TITLE     Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces
            cerevisiae
  JOURNAL   Gene (1990) In press
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Kenji Doi
            The Institute of Scientific and Industrial Research Osaka
            University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111
            Fax:    06-877-4977
FEATURES       from  to/span     description
    ORF         765     2645     ORF for CDC23
    site        153      161     calcium-binding site
BASE COUNT     1010 a    595 c    629 g    873 t
ORIGIN      
        1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact
       61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt
      121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc
      181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt
      241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg
      301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct
      361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt
      421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga
      481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag
      541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat
      601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact
      661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc
      721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc
      781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac
      841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag
      901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag
      961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt
     1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt
     1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc
     1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta
     1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc
     1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga
     1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag
     1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa
     1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact
     1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat
     1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta
     1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg
     1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc
     1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg
     1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt
     1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg
     1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact
     1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt
     2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca
     2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca
     2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt
     2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg
     2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca
     2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat
     2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg
     2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa
     2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta
     2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata
     2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat
     2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc
     2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact
     2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc
     2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc
     2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga
     3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga
     3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag
//
LOCUS       BSPRSDA      2996 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  Bacillus sp. raw-starch-digesting amylase gene.
ACCESSION   D90112
KEYWORDS    alpha amylase; raw-starch-digesting amylase.
SOURCE      Bacillus sp.(strain B1018) DNA.
  ORGANISM  Bacillus sp.
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 2996)
  AUTHORS   Itokor,P., Tsukagoshi,N. and Udaka,S.
  TITLE     Nucleotide sequence of the raw-starch-digesting amylase gene from
            Bacillus sp. B1018 and its strong homology to the cyclodextrin
            glucanotransferase genes
  JOURNAL   Biochem. Biophys. Res. Commun. 166, 630-636 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Shigezo
            Udaka Department of Food Science and Technology, Faculty of
            Agriculture Nagoya University Furo-cho, Chikusa-ku Nagoya 464 Japan
            Phone:  052-782-5111 x6356 Fax:    052-781-4447
FEATURES       from  to/span     description
    pept        313     2454     raw-starch-digesting amylase precursor (EC
                                 3.2.1.1)
    sigp        313      393     raw-starch-digesting amylase signal peptide
    matp        394     2454     raw-starch-digesting amylase mature peptide
    binding     302      306     ribosome binding site
    signal      101      106     -35 region
    signal      125      130     -10 region
    rpt        2567     2580     inverted repeat
    rpt        2585     2598     inverted repeat
BASE COUNT      764 a    852 c    774 g    606 t
ORIGIN      10 bp upstream of RsaI site.
        1 ttatttgagt acattttatg tattcccaca ttgcgcccga tatctacgct tagaaaaaaa
       61 tcgtcggaaa agcgccccaa aaaattttta ttgttattta ttgacagttg tattcgcttt
      121 catctacaat gatggaggaa cgcaatactc gatataattt aagggccatg cattccgtga
      181 ccgcacaccc ggtatggaac aaccccggta tctcgatgga gaagccgggg ttttttgtcg
      241 ccctttttta ggaggtgatc cggcgacagc ggatcaagcc tggaattcaa ataattacat
      301 aggaggtata acatgaagaa atttctgaaa atgacagccg cgttttccct gggattatcc
      361 ctggcgttcg ggcttttcag ccccgcccag gccgcgccgg atacctcggt atccaacaag
      421 caaaatttca gcaccgacgt catctatcaa attttcaccg acaggttttc ggacggcaat
      481 cccgccaaca atccgaccgg cgcggcgttt gacggaacct gcacgaacct ccggctgtat
      541 tgcggcggcg actggcaggg catcatcaac aaaatcaacg acggttacct gaccgggatg
      601 ggcgttaccg ccatctggat ctcccagccg gtcgaaaaca tctacagcat catcaattat
      661 tccggcgtca acaacacggc ctatcacggc tactgggccc gggacttcaa gaagacgaat
      721 ccggcctacg gcacgattgc ggacttccag aacctgatcg ccgccgcgca tgccaaaaac
      781 atcaaagtca ttatcgactt cgccccgaac catacgtcgc ccgcctcgtc cgaccagcct
      841 tcctttgcgg aaaacggccg gctgtacgat aacggcacgc tgctcggggg atacacgaac
      901 gatacgcaga acctgttcca ccataacggc ggcacggact tttccacgac cgaaaacggc
      961 atctacaaaa acctgtacga tctcgccgac ctgaaccata acaacagcac gtcggacgtc
     1021 tacttgaagg acgcgatcaa aatgtggctg gatctcggca tcgacggcat ccgcatggat
     1081 gcggtgaagc atatgccgtt cggctggcag aagagcttta tggctgccgt caacaactat
     1141 aagccggtct ttaccttcgg cgaatggttc ctgggcgtaa acgaagtagg cccggaaaac
     1201 cataagtttg ccaacgaatc cggcatgagc ctgcttgatt tccgttttgc ccaaaaggtg
     1261 cggcaggtgt tccgggacaa caccgacaat atgtacggcc tgaaggcgat gctggagggc
     1321 tccgcagccg attacgccca ggtggatgac caggtgacgt tcatcgacaa ccatgacatg
     1381 gagcgtttcc acgcaagcaa tgcaaaccgc cggaagctgg agcaagcgct ggcgttcacg
     1441 ctgatcctcg cgcgcgtccc cgccatttat tacggcaccg agcagtacat gtcgggtggg
     1501 accgatccgg acaaccgggc gcggatccct tccttctcca cgtcgacgac cgcctatcaa
     1561 gtcattcaaa agctggcgcc gctgcgcaag tccaacccgg ccatcgccta cggatcgacg
     1621 caggagcgct ggatcaacaa cgacgtgctc atttatgagc gcaaattcgg cagcaacgtt
     1681 gccgtcgttg ccgtcaaccg caatttgaac gcgccggctt ccatttcggg acttgtcact
     1741 tccctgccgc aaggcagcta caatgacgtc cttggcggcc ttctgaacgg caacacgtta
     1801 acggtaggct ccggcggagc cgcctccaat ttcacgcttg cggccggcgg cacggcggtg
     1861 tggcagtaca ccgcggcaac ggcgacgccg accatcgggc atgtcgggcc gatgatggcc
     1921 aagccgggcg tgacgatcac gatcgacggc cgcggcttcg gctctagcaa aggcaccgtc
     1981 tacttcggca cgacggcggt gagcggcgcc aacatcacgt cttgggaaga cacgcagatc
     2041 aaagtgaaaa ttccggccgt cgcaggcggc atctacaaca ttaaagtcgc aaacgccgcc
     2101 ggaacggcaa gcaacgtgta cgacaacttc gaggtattgt ccggagacca ggtcagcgtc
     2161 cgcttcgtgg tcaacaacgc gacaacggcc cttgggcaaa atctctacct gacgggcaat
     2221 gtcagcgagc tggggaactg ggacccggca aaagcgatcg ggccgatgta caaccaggtc
     2281 gtttaccaat atccgaactg gtattatgac gtcagcgttc cggccggcaa aacgatcgag
     2341 ttcaagtttt tgaaaaaaca aggctccacc gtcacgtggg aaggcggcag caaccacacc
     2401 ttcaccgcgc cgtccagcgg caccgcgacc attaacgtga attggcagcc ataaggcgtg
     2461 agggataggc ggctggcatt cattggaaaa ggcggactat atgacgtccg ttccgtgagc
     2521 aacgctcatc gctccgttca aaccgccaca aggctgatct tcagccaaaa aaagagggga
     2581 cctttcccct ctttttttat ttccgttgac taacggtatt cccaaaaatt acattggggg
     2641 ataagctccc tcccctctaa tagcaataac aagagcgtaa acccaaccag gtgatccata
     2701 gcgtgcggtc gcctttaatc ccggtatcaa aatgtatcct accttacaaa aatgatcgga
     2761 tcatacaaaa tagtgcgtac tactcaacga aatagaacct acatacagaa cgatcgatcc
     2821 agatttcaac gaacggcacg gtcgtttaaa aaaatggtgt gcggggtgcg agaatatgca
     2881 agaatatcaa ctgactttga aagataagcg gatcgtatgg gggaaggcga tcgaccttga
     2941 gcctctcatt ggcaaatatc ctggcgactc gattagacag ggcatgaacg aagctt
//
LOCUS       HUMALPL      3101 bp ds-DNA             PRI       15-AUG-1990
DEFINITION  Human alkaline phosphatase (EC 3.1.3.1) gene.
ACCESSION   D90054
KEYWORDS    alkaline phosphatase.
SOURCE      Human liver DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 3101)
  AUTHORS   Matsuura,S., Kishi,F. and Kajii,T.
  TITLE     Characterization of a 5'-flanking region of the human liver/bone/
            kidney alkaline phosphatase gene: Two kinds of mRNA from a single
            gene
  JOURNAL   Biochem. Biophys. Res. Commun. 168, 993-1000 (1990)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Fumio
            Kishi Department of Pediatrics Yamaguchi University School of
            Medicine Ube, Yamaguchi 755 Japan Phone:  0836-22-2258 Fax:
            0836-22-2696
FEATURES       from  to/span     description
    pre-msg    2130     3101     alkaline phosphatase mRNA and intron
    IVS        2341     3101     alkaline phosphatase intron
    rpt         375      664     Alu sequence
    rpt        2631     2926     Alu sequence
BASE COUNT      775 a    752 c    692 g    882 t
ORIGIN      chromosome 1; map position p34-36.1.
        1 aagctttctc cagcgagtat gatggtttct gcaggttctt ggcataaagc ctttatcaga
       61 ttaaggaaat tcttttcaat acctggtttg ctgagggctt ctgtcacatc gttttctgtg
      121 accccattcc ctctccctag gtgagcacgt caagtttgat cagggtgtta aactgccacc
      181 cctgtgccta tgattcccaa atttatactc taacccagac ttctttttca aatgccagag
      241 ccaaatattc agctgcctcc ttagtgtctc cacttctaaa agacatctcc aactcaacat
      301 atccaaaaac aagttcctga ttgtctccac ctcatgcctc aaaagaccac cccaaacgcc
      361 gaaaggctga atgctttttt ctttttcttt tttttttttt tctgagatgg agtctcactc
      421 tgttgcccag gctggactgc agtgatgcga tctcagctca ctgcaaactc tgcttcctgg
      481 gttcaagtga ttctcctacc tcagcctctc aggtagctgg gactacaggt gcacaccacc
      541 atgcccagct aatttttgta gagagagttt caccatgttg gccaggctgg tctcaaacac
      601 ctgaccttaa gggatccacc cgcctcagcc tctcaaagtg ctgggattac aggtgtgagc
      661 catcgcactt ggctcggtag tatatggctc agaaacattg ccatttacaa tagttcccca
      721 aaaagcaaaa ttcttaggta taaatctgga ttcagagtcc agaatgctaa ccattacacg
      781 atggaacccg taggtataaa tctaagaaaa catatccaag atctacaggc tgaagactac
      841 agagtgctga taaaaccgaa gaactctgac tgaatgagtg gagagacgtg gtgtcttcat
      901 gactgggcaa ctccatgtgg tatagacgta aaccctccca cattgatctg tggatttaat
      961 accataccta tcaaaaacac agtggtggag gacagatcag ggatcgccag gtttagggat
     1021 ggggggattg tgtaactata aagaacgcaa gagagatttt tggggtggca gagctgttct
     1081 gggtcctgac ggtggcggtg gtggttacat aaatctatcc atgtgtcaaa cgtcagaaca
     1141 ctcattttac acttgggggc aacagaaatc cctccctctg gagggggtga ctgatggtaa
     1201 cctgattgct aattctggaa tcaggagccc tgtggtcagg tttctgctct gcaacttcct
     1261 gttggtaacc ttgggcaagt ctccgtccag agccttggtt ttctcatctg taaaaggaga
     1321 tgataggtcc ttttctgtcc actgcatagc tgattagtga aacatcatgg tgaaattctt
     1381 tatgaactat ggagtgcagc acatagactt gctttcattt tgtcagtatc ctttatagat
     1441 tgttcatgta agctcccaaa gagtagtatt tattttattg aaataaaatg cacgtagaga
     1501 aaaatgtgtg tatcatacat tgacagctga acccaccgtg taaccagcac ccacccaccc
     1561 agatcaatca taaaccgaac cgcaccagca ccccagcagc ccgttcccgt ttccgtaccc
     1621 tccacgtgga gcctccgttc tgtctcccaa cgccctgggt tagtttttat actttctgtc
     1681 atcggaatca cactgtaagt gctcttgggt ttagcttcct ttgctcaagc ttaccttgtg
     1741 cgattcattc atgttgttgt gaggagctgt ggatcatcca ttctccttgc tgtctgtggt
     1801 ggtttctgtg ttgtgaacac acacaatgta ttatccagcc tgccgtagat ggaggcagtt
     1861 ttgaagccat tataaacagg gctgatgtgc acattctgct ggagagaaac gggtcccagg
     1921 gtacaggtag gatgatcagc ttcggtagat cctgccggtt ttcccatgcg ctgtgcctgt
     1981 ctgcactcca ccaacggcga gcggaccttc cggtagttaa acatcttcac gaactcttgg
     2041 actttcctgc acacacagag aagataattt tggatggctc ttcccttccc cccacaacct
     2101 tccttagggc actggctttc aactgatgta aatatttact atgccaagca ctaggagggc
     2161 agagacaaac aagacaaagt cctcacactt agaaactccc ggtgtggcag ctgagatggc
     2221 ccaggaaaga actatattac cttcaaaaag agaggtacat gcgatgtttg aggtggcatg
     2281 aagctcagtg gtgttatatt ggaatgagtg agtgaccatc ctggagcctt cctgaaagag
     2341 gtgacttcat ttttaagtga ttttaaataa tagtttaatg aattagtatt tcgtattcag
     2401 ttaataacat ttttctgatt ttaggatttg ctatagaaat atttggaaac cgtaaagtag
     2461 aacaaaaaaa aaatgtagga atcatctgaa attccaaatt ctaccactca cagttaagtg
     2521 ttgttagatg ttagatgtgg gatattgcct tttaatttcc actctgcgcc gctaccccca
     2581 gcccctaccc cagagccgtc acttctggca ctggagcgca gcttgcgtgg tttttttttt
     2641 tttttttttt tttttgagac agagtcctgc ctgtcgccca ggctggagtg cagtggcgcg
     2701 atctcggctc actgcaactc cccctcccgg gttcacgcca ttctcctgcc tcagcctcct
     2761 gagtagctgg gactacaggc gcccgctacc tctcccggct aattttttgt atttttagta
     2821 gagacggggt ttcactgtgt tagccaggat ggtctcgatc tcctgacttc gtgatccgcc
     2881 cgcctcggcc tcccaaagtg ggcagatcac ctgaggtaga gagttcgaga ccagacctga
     2941 ccaacatgga ccccatctct actaaaaata caaaattggc cagggcatgg tggcgcaagc
     3001 tgctaatccc agccactcag ggaggctgag gctggaaaat tgcttgaacc cgacctgcag
     3061 gcatgcaagc ttggcgtaat catggtcata gctgttttcc t
//
LOCUS       RATCNRAA     2337 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Rat calcineurin A alpha mRNA, complete cds.
ACCESSION   D90035
KEYWORDS    calcineurin; calcineurin A alpha; calmodulin binding protein;
            calmodulin-dependent protein phosphatase; isoform.
SOURCE      Rat brain, cDNA to mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 2337)
  AUTHORS   Ito,A., Hashimoto,T., Hirai,M., Takeda,T., Shuntoh,H., Kuno,T. and
            Tanaka,C.
  TITLE     The Complete Primary Structure of Calcineurin A, a Calmodulin
            Binding Protein Homologous with Protein Phosphatases 1 and 2A
  JOURNAL   Biochem. Biophys. Res. Commun. 163, 1492-1497 (1989)
  STANDARD  full staff_entry
COMMENT     These data kindly submitted in computer readable form by: Takayoshi
            Kuno Department of Pharmacology Kobe University School of Medicine
            7-5-1 Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone:  078-341-7451
            x3273 Fax:    078-351-6531 Peptides, 78-329 and 391-414, seem to be
            putative catalytic domain and calmodulin binding domain,
            respectively.
FEATURES       from  to/span     description
    ORF         208     1773     calcineurin A alpha
    signal     1944     1950     polyadenylation signal
BASE COUNT      649 a    523 c    596 g    569 t
ORIGIN      
        1 cgggaggagg agtgaaggcg gcggcggcgg aggagggacg cgcggagccg gcagtaactt
       61 tcgagccagc ccagagcccg gagctccagc cgagcggttt gcagcgcggc ggcgcggcgc
      121 tgagtgtctg gcccgccggt gcggtcgggg tgtgcagtcg gacgggacca gcagcgcgtc
      181 gctgtccccc cctcccggtg actggagatg tccgagccca aggcgattga tcccaagttg
      241 tcgactacgg acagggtggt gaaagccgtt ccatttccgc caagtcaccg gctgacagca
      301 aaggaagtgt ttgataacga tgggaagcct cgtgtggata tcttaaaagc acatctcatg
      361 aaggaaggca ggctggaaga aagtgtcgcg ttgagaataa taacagaggg tgcttcgatt
      421 ctccgacagg aaaaaaactt gctggatatt gatgccccag tcacagtttg cggggacatc
      481 catggacaat tctttgactt gatgaagctc tttgaagtgg gaggatctcc tgccaacact
      541 cgctacctct tcttagggga ctatgttgac agagggtact tcagtatcga atgtgtgctg
      601 tatttgtggg ccttgaaaat tctttacccc aaaacactgt ttttacttcg tggaaaccat
      661 gaatgtaggc acctaacaga gtatttcacg tttaaacaag aatgtaaaat aaagtattca
      721 gaacgcgttt atgacgcctg tatggatgcc ttcgactgcc ttcccctggc tgcgctgatg
      781 aaccaacaat tcctgtgtgt acacggtggt ttgtctccag agattaacac tctagatgac
      841 atcagaaaat tagaccgatt caaagaacca cctgcttatg ggcctatgtg tgacatcttg
      901 tggtcagacc ccctggagga ctttggaaat gagaagactc aggaacattt cactcacaac
      961 acagtcaggg gttgttcgta cttctacagt tacccggctg tatgtgactt cctgcagcac
     1021 aataatttgt tgtccatact ccgagcccac gaagcccagg acgcagggta ccgcatgtac
     1081 aggaaaagcc aaacaactgg cttcccgtct ctaattacga tcttctcggc accaaattac
     1141 ttagatgtgt acaataataa agctgcagtg ttgaagtacg agaacaacgt gatgaacatc
     1201 aggcagttca actgctcccc ccatccgtac tggctcccaa atttcatgga tgttttcacc
     1261 tggtcgctgc catttgttgg ggagaaagtg actgagatgc tggtaaacgt cctgaacatc
     1321 tgctcagatg atgaactggg gtcagaagaa gatggatttg acggagccac ggctgcagcc
     1381 cggaaggagg tcatcaggaa caagatccga gcaataggca aaatggccag agtattctca
     1441 gttctcagag aagagagtga gagcgttcta actctgaagg gcctgacccc gactggcatg
     1501 ctccccagcg gagtgctctc tggcgggaaa caaactctgc aaagcgctac tgttgaggcc
     1561 attgaggctg atgaagccat caaaggattc tcaccacaac ataagattac cagcttcgag
     1621 gaggccaagg gcttagaccg aattaacgag aggatgccgc ctcgcagaga cgccatgcct
     1681 tccgacgcca accttaactc catcaacaag gctctcgcct cagagactaa cggcacagac
     1741 agcaacggca gtaatagcag caatattcag tgaccacttc ctgttcactt tttttttttg
     1801 agctgcaggg catgatgggt ttgctgcatc tcagcagttg gatgttcttg cctctgacgg
     1861 tagcttgttt gctctggggg ggccaggaat tggattcagt ttacactatc atgaaaaaaa
     1921 aaaagaggga gagagagaga gataataaaa ctatattttg gtgagggtgg tgattaaaca
     1981 cctcttttgg gtatgccttt aaaaatgctt ctaggaaaaa aaaagtttta aaaagaaagc
     2041 taatgctagt ctatacttca atgttagggg aatgaacacg ttttcctagc gcactgggga
     2101 cttttagata ggttaatgaa aggcctttta ttctgttact ggacacgaaa actttgtcta
     2161 atttcttata ctctattgta cgtttacagt cgcagcacta aaaatggatg acatcaaaca
     2221 tttttaaaca gaaaaaaaag atgtacaaac taaataagga ctatttattg ataatgtttt
     2281 gctactcttg tcagacaatg gctataaact gaattaggca gtcttaaaaa aaaaccg
//
LOCUS       PHALPO       5710 bp ds-DNA             PLN       15-AUG-1990
DEFINITION  P.chrysosporium lignin peroxidase genes, complete cds.
ACCESSION   M37701 M22720
KEYWORDS    lignin peroxidase.
SOURCE      P.chrysosporium (strain BKM-F-1767 (ATCC 24725)) DNA.
  ORGANISM  Phanerochaete chrysosporium
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; 
            Agaricales; Corticiaceae.
REFERENCE   1  (bases 3402 to 5365)
  AUTHORS   Walther,I., Kaelin,M., Reiser,J., Suter,F., Fritsche,B.,
            Saloheimo,M., Leisola,M., Teeri,T., Knowles,J.K.C. and Fiechter,A.
  TITLE     Molecular analysis of a Phanerochaete chrysosporium lignin
            peroxidase gene
  JOURNAL   Gene 70, 127-137 (1988)
  STANDARD  full staff_entry
REFERENCE   2  (bases 1 to 3543; 5096 to 5710)
  AUTHORS   Huoponen,K., Ollikka,P., Kaelin,M., Walther,I., Maentsaelae,P. and
            Reiser,J.
  TITLE     Characterization of lignin peroxidase-encoding genes from
            lignin-degrading basidiomycetes
  JOURNAL   Gene 89, 145-150 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly provided
            by J.Reiser, 22-FEB-1989.
FEATURES       from  to/span     description
    pept        652      712     lignin peroxidase lpoB, exon 1
                770      923     lignin peroxidase lpoB, exon 2
                976     1032     lignin peroxidase lpoB, exon 3
               1087     1301     lignin peroxidase lpoB, exon 4
               1354     1395     lignin peroxidase lpoB, exon 5
               1474     1552     lignin peroxidase lpoB, exon 6
               1603     2026     lignin peroxidase lpoB, exon 7
               2079     2143     lignin peroxidase lpoB, exon 8
               2197     2218     lignin peroxidase lpoB, exon 9
    pept       5098     5038 (c) lignin peroxidase lpoA, exon 1
               4977     4824 (c) lignin peroxidase lpoA, exon 2
               4770     4714 (c) lignin peroxidase lpoA, exon 3
               4659     4445 (c) lignin peroxidase lpoA, exon 4
               4391     4350 (c) lignin peroxidase lpoA, exon 5
               4296     4218 (c) lignin peroxidase lpoA, exon 6
               4167     3744 (c) lignin peroxidase lpoA, exon 7
               3689     3625 (c) lignin peroxidase lpoA, exon 8
               3562     3541 (c) lignin peroxidase lpoA, exon 9
    IVS         713      769     lpoB intron A
    IVS         924      975     lpoB intron B
    IVS        1033     1086     lpoB intron C
    IVS        1302     1353     lpoB intron D
    IVS        1396     1473     lpoB intron E
    IVS        1553     1602     lpoB intron F
    IVS        2027     2078     lpoB intron G
    IVS        2144     2196     lpoB intron H
    IVS        2219      769     lpoB intron I
    IVS        5037     4978 (c) lpoA intron A
    IVS        4823     4771 (c) lpoA intron B
    IVS        4713     4660 (c) lpoA intron C
    IVS        4444     4392 (c) lpoA intron D
    IVS        4349     4297 (c) lpoA intron E
    IVS        4217     4168 (c) lpoA intron F
    IVS        3743     3690 (c) lpoA intron G
    IVS        3624     3563 (c) lpoA intron H
BASE COUNT     1204 a   1613 c   1594 g   1299 t
ORIGIN      
        1 agctcacttt acctatacac atctgcattc agtccttcca gttctctgac cctaacatcc
       61 ggtaaatgta ccttcagtga tcgggacgga aggtatgggc ctttcgcata ggtgggtaat
      121 ctgcgactgt atgttttgta tggtaccctg agacagtcac ttactgtttc tgctcgctcc
      181 aggtaccatt gtcccgcctc tgcgtgattt ccgaggctgg actggcccat ctctgcccac
      241 cctgtcctca tctgccaaga gccatcggaa tgccaagccg tgaccactcc aaccggtccc
      301 gttctctcag ccactgcgca agtttcttac aggagggctg cttcgccgtt cattcgcggc
      361 ctccggatag ctagcgagct tcgatgctcg tggccaatta tggaagcagt cgttgatcgc
      421 accggtcccg tactgccttc gctcacaagc cgtgttgttg cgagactctc attcgctggc
      481 tcagggtatt gtgcctgttt gctgaggcac agtgcagtca atacacactt gtctcgtcag
      541 gacgcggttt gacattccgt ggtgcgtgaa acggtataaa agggatacgc gatttgcagc
      601 atatcctcag gccattcgtc ttctacagcc caagttccaa gtcaaacggt catggccttc
      661 aagcagctcg tcgcagcgat ttccctcgca ctctcgctca ccactgccaa tggtacgcac
      721 cgcttctgca tgctgtgata acgggccccg actaacgcct ccgctgcagc cgccgtggtc
      781 aaggagaagc gcgccacctg ctccaacggc gccaccgttg gcgacgcgtc ctgctgtgct
      841 tggttcgatg tcctcgacga catacagcag aacctgttcc aaggaggcca gtgcggcgct
      901 gaggcccacg agtctatccg tctgtaagtc aatacgctgg tgttgcgcca aggtcataga
      961 ttcactttgc tgcagcgtgt tccacgatgc tattgccatc tctcctgcta tggaggccca
     1021 gggcaagttc gggtatgtct ttccggcatg gcaatatttt acagcagaca ctgagatatt
     1081 gcgcagcggt ggtggtgctg acggctccat catgatcttc gacgacatcg agcccaactt
     1141 ccaccctaac attggcctcg acgagattat caacctccag aagccgttcg tccagaagca
     1201 cggtgtcacc cctggtgact tcatcgcctt cgccggtgct gtcgcgctca gcaactgccc
     1261 gggtgcccca cagatgaact tcttcactgg tcgtcgtcct ggtacgtctc ctctacgaat
     1321 cgatctcgac acctcattca tatcgcctta tagctaccca gcccgcaccc gatggtctcg
     1381 ttcccgagcc tttccgtgag tttgcagacc acttcatcgc atagttctta gctgacctct
     1441 tcatcgcata gttcttagct gacttcagca cagacaccgt cgaccagatc atcgctcgtg
     1501 ttaacgatgc cggcgagttc gacgagctcg agcttgtctg gatgctttcc gcgtaagtga
     1561 ctgccgcctc gaatttccat cccgacttac accccgattc agccactccg ttgctgcagt
     1621 caacgacgtg gacccgaccg tccagggcct gcccttcgac tccacccccg gaatcttcga
     1681 ctcgcagttc ttcgtcgaga ctcagttccg tggtatcctc ttccccggct ccggtggcaa
     1741 ccagggtgag gtcgagtccg gtatggctgg cgagatccgc atccagaccg accacactct
     1801 cgcccgcgac tcccgcaccg cttgcgagtg gcagtcgttc gtcaacaacc agtccaagct
     1861 cgtctccgac ttccagttca tcttccacgc cctcacccag ctcggccagg acccgaacgc
     1921 gatgaccgac tgctcggatg tcatcccgat ctcgaagccc atccccggca accttccgtt
     1981 ctcgttcttc ccccctggca agagcatgaa ggatgttgag caggctgtag tatccgattc
     2041 agtccttgtc gcagagctta tgctgacggc ttctgcagtg cgccgagacc cccttcccca
     2101 gcctcgtcac tctccccggc cccgcgacct ctgtcgctcg catgtgagta tctccgacgg
     2161 tctatgaagc ccccagctga catattcctc ttccagcccc ccgccgccgg gtgcttaagt
     2221 cattctatcg gtcatctttg gctgaaacgg agtatttgga atacggctca ctcgtaacgg
     2281 taacttgcgc tcaagtgttt agaaatgtct cctttgtatc tacgcgattg gtccgctttt
     2341 gacgatagat cgttactgtg ttcattgaaa ttctcgtccg cgcgccctgg agcgaaccgg
     2401 ttagcattgc cacacgagag ctcttccgtt gctccaactc gagctgtaat ggtccaacgc
     2461 tccacgctac atcaatttaa cctctcatgg gtacggtgta ttcggcaagt ttatctcaca
     2521 taataagagg cacgctatca ttcgacgata caagaacatg agccttcgct tcgtttatga
     2581 tattggttca ctgtcgagct aatttctgag ggttagcgct ctgacatgat cagctacagg
     2641 aacggaggcc gtaccttgaa tgtgcccata aacccgctgt cttattcttc tcaaattgat
     2701 tcttcatgtt tgaatcacgt ttgcaggtgc attcgtgtac ctgcggcgcg tacacgcggt
     2761 atgtattggt cgcaaatcgc atcatggtga gatcttgctc ttcactcttg aagttgctac
     2821 cgtataccac catgtgcagg aattctcgta catccctgtt tctcctcgaa tgtatgtgga
     2881 gccagggaaa ccctaacccc ggattctgct gagatgcgtc gatgcacgca gccgtagcgg
     2941 aggtccgtga ggtccgctcc ggccacgaag caggggccgt cctgaccggt cgaaggtcat
     3001 gtcgtgcgac atagtcggct tccaggagga cgatatcgac caatacgtcg aaaggaggag
     3061 actgcgggtc taggctggac gctgtttgcg agggcccggg ggagaacgag gccattggga
     3121 gtcagcgaga ttattgaata gtcgaagggt attcattgag tcactaaggg aaacacttct
     3181 gagccgctgg tagtacttgt gtatgcccgg gttctgcgcc tgataattag cctcgctcct
     3241 ccgttgacgt tgggttttgg caataggaca tcaccacttt caccacgcgg acgcaatgcg
     3301 aagggcacga gtggtatctc aatagctagt taccttccaa gaccctcaat catgatcgga
     3361 agaagaggat gtgcaccgat atttcataag cccacggcag atatcgtaag agagtagacg
     3421 aatgagattc gtagttaggt gcagagatac gatgaatgaa atctagtaaa gccgaagttc
     3481 cgtcacgagt tagccggcca ccgttacagt cggtttgagg agtattctgt atggcatcat
     3541 ttaagcaccc ggaggcggag ggctggagaa ggagcatgtc agcccagatt gcatttcctg
     3601 aaagatctca tggattgtac tcacatgcgc tggacggacg tctcggggcc cgggagagtg
     3661 gtgagagtcg ggaagggggt ctccgcacac tgtcatgcga tgttcagcag ccactctact
     3721 gcatggtggg gtgaaatacg caccgcctgc tcaacgtcct tgatggtctt gccagcgggg
     3781 aagaacgaga atgggaggtt gccagggatg ggcttggact gcgggataac atccgagcag
     3841 tcggtcatcg cgttcgggtc ctggccgagc tgggtgaggg cgaggaagat gaactggaag
     3901 tcatcgacga gcttggactg gttgttgacg aaggactgcc attcacacgc cgtgcgcgag
     3961 tcgcgggcga tagtgtggtc ggactggatg cgaatttcgc cagggagcgg cgactcgacc
     4021 tcgccttggt tgccaccaga gccggggaag gcggtaccac gaagctgagt ctcgacgaag
     4081 aactgggagt cgaagattcc gggggtcgag tcaaagggca gaccctggac ggtcgggtcg
     4141 acgtcgttca ccgctgcgac ggagtgcctg tcgaggtctc aggaagggag tgtcgaagtc
     4201 aacagtgagt gacttacgcg gagagcatcc agacaagctc gagctcatcg aactcgcctg
     4261 cgtcgttgac acggttgatg atttggtcga cagtgtctgc atgctagtca gtatagaccg
     4321 cacctaactg cttggataag accacttacg gaagggctcg gggacaaggc catcaggagc
     4381 gggctgggta gctaaagcag acagttagtt cgtaccatcc gcaaagcgag ttttgcaggt
     4441 ataccaggtg cacgaccagt gaagaagttc atctgcgggg caccagggca gttgctgagc
     4501 gcgacacgac cagcgaaggc gatgaagtca ccaggggtga caccgtgctt ctgaacgaat
     4561 ggcttctgga gcttgacgat ctcgtcgaga ccgatgttag ggtggaacgc agtctcgata
     4621 tcgtcgaaga tcatgatgga gccgtcagca ccaccgccgc tgcaaggagg gatcagcaaa
     4681 cgactaggtg gcgcaacgcg ggtggcaact tacccgaact tgccctgtgc ctccatggcg
     4741 ggcgaaattg cgatggagtc gtggaagacg ctgggcgggg tgttcaaaca tgcatagcag
     4801 gagatcgcga cgggatcact cacagacgaa tcgactcgtg cgcctcagcg ccgcactggc
     4861 cgccgtggaa caggttctgc tggatatcat ccaggacgtc gaaccaagcg cagcacgacg
     4921 catcgccgac ggtcttgccg ttggaacagg tggcgcgctt ctcgatcacc gcagccgctg
     4981 cacaagacga cgttcagcat gcagtccact ggtcaacgct aactgcgatg ggcataccgt
     5041 tcgcagccga gagcaagaga gcgagagaga tagctgcgaa gagctgcttg aaggccatgt
     5101 ccgctgtgtt gctggtgctg agtgggactg aagagactgg atgtctgagg gactgcggtg
     5161 gtcctgtcgc ccttttatac cctaggcgtg gtcgacgtcc tggtattgtt cgccgtagaa
     5221 cagtgtcgaa tcgacgtgac gcggtgcgcg gacatgcacg acactgcgcc agccaatgag
     5281 gacgctgcca aaacgcagcc tgtgagcgag ttggtgcggt gccggcaacc atcaccgact
     5341 cgtctcacat ttgggccact gcgtcgagcg cagttcgcgc cggcaccgct gttgaatagc
     5401 acgcgagctc tgcaagaaag aatagggcgg cccatgagaa cagaaatccg agtcagagga
     5461 attaactgcg cgtgccgatg agtcttgaca tgaggatgat ctaacgaaga gaccttgcat
     5521 tgagccgttt ccagtgctgc caggggtaat cagtcggcat tactgccaag tccggggatg
     5581 tactgctagc tcactcccat cgcaatatgt caccgagtat tgcctttgtg aacataccat
     5641 tgattcggtc ccgatcatgc acgaacgact cccgcaaagt ggggccagtg actatcacgt
     5701 ccgtgctcag
//
LOCUS       XANXCAA      2333 bp ds-DNA             BCT       15-AUG-1990
DEFINITION  X.campestris major extracellular endoglucanase (engXCA) gene,
            complete cds.
ACCESSION   M32700
KEYWORDS    major extracellular endoglucanase.
SOURCE      X.campestris DNA.
  ORGANISM  Xanthomonas campestris
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Pseudomonadaceae.
REFERENCE   1  (bases 1 to 2333)
  AUTHORS   Gough,C.L., Dow,J.M., Keen,J., Henrissat,B. and Daniels,M.J.
  TITLE     Nucleotide sequence of the engXCA gene encoding the major
            endoglucanase of Xanthomonas campestris pv. campestris
  JOURNAL   Gene 89, 53-59 (1990)
  STANDARD  simple staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by C.L.Gough, 09-MAR-1990.
FEATURES       from  to/span     description
    pept        383     1864     major extracellular endoglucanase (engXCA)
                                 precursor
    sigp        383      457     major extracellular endoglucanase signal
                                 peptide (put.)
    matp        458     1861     major extracellular endoglucanase (put.)
BASE COUNT      444 a    800 c    734 g    355 t
ORIGIN      
        1 gaattcccgg ggatcacaaa cgacgcgaac aagccgacct gcgggtccac gcctgcgacg
       61 aacgcaaagg cgatgacttc gggaatcagg gcgaacgtgg caacggcgcc agccatcagt
      121 tcgcgcgcag gcgaggcgcc attgcgccag ttcggtgcgc aggaaggaca tgggggacac
      181 tccagggaca agaacgacat gcctgcggac agcgcgcagg gggcactagt gtgcgggaaa
      241 cggccgctcc cgcagccgcg atgtgatcgg tgcggcaatg gtgttttctg tggggacgat
      301 cacaccacgc gacgcgcgca cagaccaaga tgcccgcctt accgcgctcg ggtgtcgagc
      361 ccggttctct agggagatca ccatgtccat attcaggacc gcaagcacgc tcgctttggc
      421 caccgccctc gcactggccg ccgggccggc cttcagctat tccatcaaca acagcaggca
      481 gatcgtcgac gacagcggca aggtcgtgca gctcaagggt gtgaacgtgt tcggcttcga
      541 aaccggcaac cacgtgatgc atggcctgtg ggcacgcaac tggaaggaca tgatcgtgca
      601 gatgcagggc ctgggcttca acgccgtgcg cctgccgttc tgcccggcca cgctgcgtag
      661 cgacaccatg ccggccagca tcgactacag ccgcaacgcc gacctgcagg gcctgacctc
      721 gctgcagatc ctcgacaagg tgatcgccga attcaatgcg cgcggcatgt atgtgctgct
      781 ggatcaccac acccccgatt gcgccggcat ttccgagctc tggtacaccg gctcctatac
      841 cgaggcacag tggctggccg acctgcgctt tgtggccaac cgctacaaga acgtgccgta
      901 tgtactcggc ctggatctga agaacgaacc gcacggcgcc gccacctggg gtaccggcaa
      961 cgccgccacc gattggaaca aggctgccga gcgcggctcg gccgcggtgt tggcggtcgc
     1021 gccgaagtgg ctgatcgcgg tggaaggcat caccgacaac ccggtgtgct ccaccaacgg
     1081 cggcatcttc tggggcggca acctgcagcc gctggcctgc accccgctca acatcccggc
     1141 caaccgcctg ctgctggccc cgcacgtgta cggcccggac gtgttcgtgc agtcgtactt
     1201 caacgacagc aacttcccca acaacatgcc cgccatctgg gaacgccatt tcggtcagtt
     1261 cgccggcacg catgcgctgt tgctgggcga gttcggtggc aagtacggcg aaggcgacgc
     1321 acgcgacaag acctggcagg acgcgctggt gaagtacctg cgcagcaagg gcatcaacca
     1381 gggcttctac tggtcgtgga atcccaacag cggcgacacc ggcggcatcc tgcgcgatga
     1441 ctggaccagc gtgcgccagg acaagatgac cctgctgcgc acgctgtggg gcaccgccgg
     1501 caataccacg ccgacgccga ctcccacacc tacgcccaca ccgacaccga cgcctacccc
     1561 cacgccgacg cccaccccgg gcaccagcac cttcagcacc aaggtgatcg cctcgccggt
     1621 ggtggggtcg gcagcgcgaa aactgccggc ggcatcgcgg ctggcttgcc attggccggc
     1681 cagcagcacg ggttggagag tctgggtcat cgcggcacct tcggttacgt ggaagcgccc
     1741 gcacgcagca cgggcgatcg aacggcggat gagggtaacg cgcctgcgac gtgccacccg
     1801 tttgaatcgt ggaccactac cggcaccggc ccatacaacg cagcacgcac cgcggctgcg
     1861 ctaaacaagg ccgcgcgacg gcggtggcgc gtgctcagtg caggctgggc gcggtggcga
     1921 tggcgtggtc gatcaccttc agcgctgcct cgcgctcggc accgtccacc accaggcgtg
     1981 gcgcacggac acgctcgctg cccaggccca ccttttcctg caccagtttg atcagctgca
     2041 cgaacttggg cacggtatcc aggcgcagca gcggcaggaa ccagtcgtac agttccttgg
     2101 cggcggggta accgccgtcg cgtgccagtt cgaacaggcg taccgactcc ttcggactac
     2161 tgtacttgac cagcccggcg atccacccct tggcgcccat gctcaggcct tcgacgatgg
     2221 cgtcgtccat gccgaccagc agcgccagac gatcgcccag caattcctgc agcgcggcga
     2281 agcggcgcac atcgccggaa gattccttta ctgcctgcag gattggggaa ttc
//
LOCUS       FLAHANENJ8   1458 bp ss-RNA             VRL       15-AUG-1990
DEFINITION  Influenza virus A/NJ/8/76 (H1N1) hemagglutinin/neuraminidase (seg
            4) gene, complete cds.
ACCESSION   M27970
KEYWORDS    hemagglutinin/neuraminidase.
SOURCE      Influenza virus A/NJ/8/76, cDNA to viral RNA, clones pNA[6,28],
            passed in embryonated eggs.
REFERENCE   1  (bases 1 to 1458)
  AUTHORS   Miki,T., Nishida,Y., Hisajima,H., Miyata,T., Kumahara,Y.,
            Nerome,K., Oya,A., Fukui,T., Ohtsuka,E., Ikehara,M. and Honjo,T.
  TITLE     The complete nucleotide sequence of the influenza virus
            neuraminidase gene of A/NJ/8/76 strain and its evolution by
            segmental duplication and deletion
  JOURNAL   Mol. Biol. Med. 1, 401-413 (1983)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept         21     1430     hemagglutinin/neuraminidase precursor
    sigp         21      125     hemagglutinin/neuraminidase signal peptide
    matp        126     1427     hemagglutinin/neuraminidase
BASE COUNT      462 a    257 c    343 g    396 t
ORIGIN      
        1 agcaaaagca ggagtttaaa atgaatacaa atcaaagaat aataaccatt gggacaatct
       61 gtctaatagt tggaataatt agtctattat tgcagatagg aaatataatc ttgttatgga
      121 tgagccattc aattcagact ggagaaaaaa gccatcctaa ggtatgcaac caaagtgtca
      181 ttacctatga aaacaacaca tgggtgaacc agacttatgt aaacattagc aataccaata
      241 ttgctgctgg acagggtgtg actccaataa tactagccgg caattcctct ctttgcccaa
      301 tcagtgggtg ggctatatac agcaaagaca atagcataag gattggttcc aaaggagaca
      361 tttttgtcat gagagagcca ttcatttcat gctctcactt ggaatgcaga accttttttc
      421 tgacccaagg cgctttgctg aatgacaggc attctaatgg aaccgtcaag gacaggagtc
      481 cttatagaac cttaatgagc tgccccatcg gtgaagctcc atctccgtac aattcaaggt
      541 tcgaatcagt tgcttggtca gcaagtgcat gccatgatgg aatgggatgg ctaacaatcg
      601 ggatttccgg tccagataat ggagcagtgg ctgttttaaa atacaatggt ataataacag
      661 atacaataaa aagttggaga aacaaaatat taagaacaca agagtctgaa tgtgtttgta
      721 taaacggttc gtgttttact ataatgactg acggcccaag caatgggcaa gcctcgtaca
      781 aattattcaa aatggagaaa gggaagatta ttaagtcaat tgagctggat gcacctaatt
      841 accactatga ggaatgctcc tgttaccctg atacaggcaa agtggtgtgt gtgtgcagag
      901 acaattggca tgcttcgaat cgaccatggg tctctttcga tcagaatctt gattatcaaa
      961 tagggtacat atgcagtggg gttttcggtg ataatccgcg ttctaatgat gggaaaggca
     1021 attgtggccc agtactttct aatggagcaa atggagtgaa ggggttttca tttagatatg
     1081 gcaatggtgt ttggatagga agaactaaaa gtatcagctc tagacgtgga tttgagatga
     1141 tttgggatcc taatggatgg acagaaactg atagtagttt ctctatgaag caagatatta
     1201 tagcattaac tgattggtcg ggatacagtg gaagttttgt ccaacatcct gaattaacag
     1261 gaatgaactg cataaggcct tgtttctggg tagagttaat cagagggcaa cccaaggaga
     1321 gcacaatctg gactagtgga agcagcattt ctttctgtgg cgtgaacagt ggcactgcaa
     1381 gctggtcatg gccagacgga gctgatctgc cattcaccat tgacaagtag tttatccaaa
     1441 aaactccttg tttctact
//
LOCUS       HUMHIS3PRM   1125 bp ds-DNA             PRI       15-AUG-1990
DEFINITION  Human histone H3 gene, complete cds.
ACCESSION   M26150
KEYWORDS    histone.
SOURCE      Human HeLa cell DNA, clone pST519.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1125)
  AUTHORS   Marashi,F., Helms,S., Shiels,A., Silverstein,S., Greenspan,D.S.,
            Stein,G. and Stein,J.
  TITLE     Enhancer-facilitated expression of prokaryotic and eukaryotic genes
            using human histone gene 5' regulatory sequences
  JOURNAL   Biochem. Cell Biol. 64, 277-289 (1986)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept        557      964     histone H3 /hgml_locus_uid="LV0006C"
                                 /nomgen="H3F2" /map="1q21"
    mRNA        520  >   964     histone H3 mRNA (5' end + / - 4 bp)
    signal      422      425     CAAT box
    signal      463      468     CAAT box
    signal      485      492     TATA box
BASE COUNT      298 a    283 c    267 g    277 t
ORIGIN      
        1 gcagcggcgt gataacagct cactgtaacc tcgaactcgg gctcaagcga tcctcatcga
       61 cagccttctg agtagctggg attacaggcg agagcgccac gcccgactaa gagcattttc
      121 taattgccca cacttcttat gcgacaccca gaaaaataca attttaaata aagcgcatat
      181 gcaaataacc ctaatcgtct ccaatattca ctgatttctt ttttatattt taactagaaa
      241 caattggagg tttccgcgtt gctttgtgtg gttgtaaatt ttaagacttc aggaaacttt
      301 tccagtacaa gacttgtcca acagtggata tagcagctaa ggggttaaca aaatgacgtc
      361 agagtagcta cggtaatggg caggagcctc tcttaatctg caaccaagca cagagatgga
      421 ccaatccagg aagggcgcgg ggatttttga atttacttgg gtccaatggt tggtggtctg
      481 actctataaa agaagagtag ctctttcctt tcctccacag acgtctctgc aggcaaagct
      541 tttctgtggt tttgccatgg ctcgtactaa acagacagct cggaaatcca ccggcggtaa
      601 agcgccacgc aagcagctgg ctaccaaggc tgctcgcaag agcgcgccgg ctaccggggg
      661 cgtgaaaaag cctcaccgtt accgcccggg cactgtggct ctgcgcgaga tccgccgcta
      721 ccaaaagtcg accgagttgc tgattcggaa gctgccgttc cagcgcttgg tgcgagaaat
      781 cgcccaagac ttcaagaccg atcttcgatt ccagagctcg gcggtgatgg cgctgcagga
      841 ggcttgtgag gcctacttgg tagggctctt tgaggacaca aacctttgcg ccatccatgc
      901 taagcgagtg actattatgc ccaaagacat ccagctcgct cgccgcattc gcggagaagc
      961 gtaaatgtaa agtcactttt tcatcagtct taaaacccaa aggctctttt cagagccacc
     1021 cacttattcc aacgaaagta gctgtgataa ttttttgttg tcttaacaga acaaatttct
     1081 aaggaccccc ccggaaagca ttagactatg gcttaaagtt gatac
//
LOCUS       MUSTUBMA1     786 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Mouse alpha-tubulin gene M-alpha-1, 3' end.
ACCESSION   M28729
KEYWORDS    alpha-tubulin.
SOURCE      Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 786)
  AUTHORS   Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
  TITLE     Five mouse tubulin isotypes and their regulated expression during
            development
  JOURNAL   J. Cell Biol. 101, 852-861 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      597     alpha-tubulin (AA at 1)
    signal      773      778     poly-A signal
BASE COUNT      186 a    187 c    210 g    203 t
ORIGIN      
        1 gaattccaga ccaacctggt accctaccct cgtatccact tccctctggc cacttatgcc
       61 cctgtcatct ctgctgagaa agcctaccac gagcagcttt ctgtagcaga gatcaccaat
      121 gcctgctttg agccagccaa ccagatggtg aaatgtgacc ctcgccatgg taaatacatg
      181 gcttgctgcc tgctgtaccg tggtgatgtg gttcccaaag atgtcaatgc tgccattgcc
      241 accatcaaga ccaagcgtac catccagttt gtggactggt gccccactgg cttcaaggtt
      301 ggcattaact accagcctcc cactgtggta cccggtggtg acctggccaa ggtgcagaga
      361 gctgtgtgca tgctgagcaa caccacagcc attgctgagg cctgggctcg cctagatcac
      421 aagtttgatc tgatgtatgc caagcgtgcc tttgtgcact ggtatgtggg tgagggcatg
      481 gaggagggtg agttctctga ggcccgtgag gacatggctg ccctagagaa ggattatgag
      541 gaggttggtg tggattctgt ggaaggcgag ggggaggaag aaggagagga atactaaatt
      601 aaatgtcaca aggtgctgct tccacaggga tgtttattgt gttccaacac agaaagttgt
      661 ggtctgatca gttaatttct atgtggcaat gtgtgctttc atacagttac tgacttatga
      721 atgattgatt ttgacagaga ccccaagctg cccatttcac ttatgggttt taaataaaat
      781 actccc
//
LOCUS       MUSTUBMA2    1198 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Mouse alpha-tubulin gene M-alpha-2, 3' end.
ACCESSION   M28727
KEYWORDS    alpha-tubulin.
SOURCE      Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1198)
  AUTHORS   Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
  TITLE     Five mouse tubulin isotypes and their regulated expression during
            development
  JOURNAL   J. Cell Biol. 101, 852-861 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1     1059     alpha-tubulin (AA at 1)
BASE COUNT      259 a    329 c    303 g    307 t
ORIGIN      
        1 gcaaataact atgcccgtgg ccactacacc attggcaagg agatcattga ccttgtcctg
       61 gacaggattc gcaagctggc tgaccagtgc acgggtctcc agggcttgtt cgttttccac
      121 agctttggcg ggggaactgg ctctggcttc acctccctgc tgatggagcg gctctctgtg
      181 gattacggaa agaagtccaa gctggagttc tccatttacc cagcccccca ggtttccact
      241 gctgtggttg agccctacaa ttccatcctc accacccaca ccaccctgga gcactctgat
      301 tgtgccttca tggtagacaa tgaggccatc tatgacatct gtcgtagaaa cctcgacatt
      361 gagcgcccaa cctacaccaa ccttaaccgc cttattagcc agattgtgtc ttccatcact
      421 gcttccctca gatttgatgg ggccctcaat gttgatctga cagaattcca gaccaacctg
      481 gtaccctacc ctcgcatcca cttccctctg gccacttatg cccctgtcat ctctgctgag
      541 aaagcctacc atgagcagct ttctgtagca gagatcacca atgcctgctt tgagccagcc
      601 aaccagatgg tgaaatgtga ccctcgccat ggtaaataca tggcttgctg cctgctatac
      661 cgtggtgatg tggttcccaa agatgtcaat gctgccattg ccaccatcaa gaccaagcgc
      721 acgatccagt ttgtagactg gtgccccact ggcttcaagg ttggcattaa ttaccagcct
      781 cccactgtgg tacccggtgg tgacctggcc aaggtgcaga gagctgtgtg catgctgagc
      841 aacaccacag ccattgctga ggcctgggct cgcctagatc acaagtttga tctgatgtat
      901 gccaagcgtg cctttgtgca ctggtatgtg ggtgagggca tggaggaggg tgagttctct
      961 gaggcccgtg aggacatggc tgccctagag aaggattatg aggaggttgg tgtggattct
     1021 gtggaaggcg agggggagga agaaggagag gagtactaag tccattcctt gagccccctg
     1081 tgtcgtcaaa tgctccagta ttagttgcag gcacctgatg cttctgtgct gtttccattc
     1141 tgtgatcatg tcttctccat gttgtacctc ttaagttttc catgatgtct caaactaa
//
LOCUS       MUSTUBMB2     488 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Mouse beta-tubulin gene M-beta-2, 3' end.
ACCESSION   M28739
KEYWORDS    alpha-tubulin.
SOURCE      Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 488)
  AUTHORS   Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
  TITLE     Five mouse tubulin isotypes and their regulated expression during
            development
  JOURNAL   J. Cell Biol. 101, 852-861 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      303     beta-tubulin (AA at 1)
    signal      472      477     poly-A signal
BASE COUNT      121 a    116 c    131 g    120 t
ORIGIN      
        1 cccaacaacg tcaagacggc cgtgtgtgac atccctcctc gtggcctcaa gatgtcagcc
       61 accttcattg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc
      121 actgccatgt tccggcgcaa ggctttcctg cactggtaca cggctgaggg catggacgag
      181 atggagttca ccgaggcgga gagcaacatg aatgacctgg tgtctgagta ccagcagtac
      241 caggatgcca cggccgatga gcagggcgag ttcgaggagg aggagggtga agatgaggct
      301 tgagaacttc tcagatacag tgtgcaccct tagtgaactt ctgttgtcct ccagcattgg
      361 tctttctatt tgtaaattat ggtgctcagt ttgcctctgt cagaaattca ctgttgatgt
      421 aatagtgtga acctctttca agatcacagt attgtctcag aaatctatat gaataaaaaa
      481 gcatgtgg
//
LOCUS       MUSTUBMB4    1454 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Mouse beta-tubulin gene M-beta-4, 3' end.
ACCESSION   M28730
KEYWORDS    alpha-tubulin.
SOURCE      Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1454)
  AUTHORS   Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
  TITLE     Five mouse tubulin isotypes and their regulated expression during
            development
  JOURNAL   J. Cell Biol. 101, 852-861 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1     1140     beta-tubulin (AA at 1)
BASE COUNT      327 a    456 c    376 g    295 t
ORIGIN      
        1 gtcgacctgg aacccggcac catcgactct gtccgctccg gcccttttgg ccagatcttt
       61 cggccagaca actttgtatt tggtcaatcc ggagcaggca acaactgggc caagggtcac
      121 tacaccgagg gcgcgcagtt agtggatgcc gtcctggacg tggtgcgcaa agaggcggaa
      181 agctgcgact gtctccaggg cttccagctc acccactcgc tcggaggtgg caccggctca
      241 ggcatgggga ccttgctcat cagcaagatc cgagaggagt ttccagacag gatcatgaat
      301 acgttcagcg tggtgccatc acccaaggtg tctgacacgg tggtggagcc ctacaatgcc
      361 acactgtctg tgcatcagct ggtggagaac actgatgaga cctactgcat cgacaacgag
      421 gccctgtacg acatctgctt ccgtacgctc aagctgacca cgcccacgta cggggacctc
      481 aaccacctcg tgtcagccac catgagtgga gtcaccacct gcctacgttt cccgggccag
      541 ctcaatgcag acctacgcaa gctggctgtg aacatggtgc cattcccccg tctccacttc
      601 ttcatgccag gattagcacc cttgaccagc aggggcagcc agcagtaccg ggccctcacc
      661 gtccctgagc tgacccaaca ggtgttcgat gctaagaaca tgatggctgc gtctgacccg
      721 agacacggtc gctacctgac tgtggctgct gtcttccggg gacggatgtc catgaaggag
      781 gtagacgagc agatgttaag tgtgcagagc aagaacagca gttacttcgt tgagtggatc
      841 cccaacaatg tgaaggcagc cgtatgtgac atcccgcccc gcggcctgaa gatggcagcc
      901 accttcatcg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc
      961 accgccatgt tcagacgcaa ggccttcctg cactggtaca cggccgaagg catggacgag
     1021 atggagttta cggaagcaga gagcaatatg aacgacctgg tgtccgagta ccagcagtac
     1081 caggatgcca ctgctgaaga gggcgagttc gaagaggagg ctgaagagga ggtggcttaa
     1141 gtctcctgcc atcactctgt ccctggggcc caccagcaaa gctttgaccc taagcatcac
     1201 acccctgcac ctagttgcct cattccctag gaccccatga gcatcttcac catgaggcca
     1261 agcccaggtt gcttctattt gcttcacctt taactcctaa accccactgt ctctccaacc
     1321 tgccagggaa gggctcttct agttcccatg agcgcccctc aacacatgta cacacgcaca
     1381 cacactccac cttcttagat cttgaaaatc ctttccttta tgccctgtcc cttccccagc
     1441 actcctgaac cgat
//
LOCUS       MUSTUBMB5     542 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Mouse beta-tubulin gene M-beta-5, 3' end.
ACCESSION   M28732
KEYWORDS    alpha-tubulin.
SOURCE      Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 542)
  AUTHORS   Lewis,S.A., Lee,M.G.-S. and Cowan,N.J.
  TITLE     Five mouse tubulin isotypes and their regulated expression during
            development
  JOURNAL   J. Cell Biol. 101, 852-861 (1985)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      363     beta-tubulin (AA at 1)
    signal      524      529     poly-A signal
BASE COUNT      126 a    136 c    149 g    131 t
ORIGIN      
        1 gaggtggatg agcagatgct caatgtgcag aacaagaata gcagctactt cgtggaatgg
       61 atccccaaca atgtcaagac agctgtctgt gacatcccac cgcgtggcct caagatggca
      121 gtcaccttca ttggaaacag cacagccatc caggagctgt tcaagcgcat ctctgagcag
      181 tttacggcta tgttccgccg gaaggctttc ctccactggt acacggctga gggcatggac
      241 gagatggagt tcaccgaggc tgagagcaac atgaacgacc tggtgtctga gtaccagcag
      301 taccaggatg ccaccgctga agaggaagag gatttcggag aggaggcaga agaggaggcc
      361 taacggcaga gagccctgca tcagctcagg ctgcttagac tccctcagcc tttctccaac
      421 tgccctttgt cctccagttt ctttctgctg cctctgtctt gtatttgttt tgcttctgtt
      481 ttctcattgg gggtaaatgg tgcctggcac atggcaggca ctcaataaat atttgtttgt
      541 gg
//
LOCUS       XELPAL        353 bp ss-mRNA            VRT       15-AUG-1990
DEFINITION  X.laevis parvalbumin mRNA, 3' end.
            protein.
ACCESSION   M28644
KEYWORDS    parvalbumin.
SOURCE      X.laevis tadpole, cDNA to mRNA, clone lambda-PV1.
  ORGANISM  Xenopus laevis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 353)
  AUTHORS   Kay,B.K., Shah,A.J. and Halstead,W.E.
  TITLE     Expression of the Ca2+ -binding protein, parvalbumin, during
            embryonic development of the frog, Xenopus laevis
  JOURNAL   J. Cell Biol. 104, 841-847 (1987)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    pept    <     1      339     parvalbumin (AA at 1)
BASE COUNT       92 a     86 c     90 g     85 t
ORIGIN      
        1 agatttacta tggcattcgg tggtatcctg agtgaggctg acatctctgc tgccctgcag
       61 aactgccaag ctgctgactc cttcaacttc aaaactttct ttgcccagtc tggtctgagc
      121 agcaagtccg cagatgatgt gaaaaacgtc tttgccatcc tcgaccagga caggagcggc
      181 ttcattgagg aagaggaact gaagttgttc ctccagaact tcagcgcaag tgccagggca
      241 ctgactgatg ctgaaaccaa ggccttcctg gcagctggtg actctgatgg tgatggcaaa
      301 attggagttg aagaattcca gtccctagtc aaaccttgaa gaagtaagac caa
//
LOCUS       RATMLVI4      100 bp ss-mRNA            ROD       15-AUG-1990
DEFINITION  Rat Moloney murine leukemia provirus Mlvi-4 mRNA, partial sequence.
ACCESSION   M36432
KEYWORDS    provirus.
SOURCE      Rat Moloney murine leukemia virus-induced T-cell lymphoma, cDNA to
            mRNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Tsichlis,P.N., Lee,J.S., Bear,S.E., Lazo,P.A., Patriotis,C.,
            Gustafson,E., Shinton,S., Jenkins,N.A., Copeland,N.G., Huebner,K.,
            Croce,C., Levan,G. and Hanson,C.
  TITLE     Activation of multiple genes by provirus integration in the Mlvi-4
            locus in T-cell lymphomas induced by Moloney murine leukemia virus
  JOURNAL   J. Virol. 64, 2236-2244 (1990)
  STANDARD  simple staff_entry
FEATURES       from  to/span     description
    mRNA    <     1  >   100     Mlvi-4 mRNA
    recomb       73       74     Rat DNA end/provirus DNA start
BASE COUNT       24 a     29 c     19 g     28 t
ORIGIN      
        1 ttactggaag ccctcctcat catgggattt catcacagta aacaacaatc tcacctctga
       61 ccaggctgtc caggattctc ctcatggttt gtcgaaggtc
//

GenBank-Updates@genbank.bio.net (08/16/90)

LOCUS       THARGA        635 bp ds-DNA             BCT       11-AUG-1990
DEFINITION  T.acidophilum 5S rRNA gene.
ACCESSION   M32297
KEYWORDS    5S ribosomal RNA.
SOURCE      T.acidophilum (strain 122-1B2) DNA, clone pT5PH-23.
  ORGANISM  Thermoplasma acidophilum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Thermoplasmales.
REFERENCE   1  (bases 1 to 635)
  AUTHORS   Ree,H.K. and Zimmerman,R.A.
  TITLE     Organization and expression of the 16S, 23S and 5S ribosomal RNA
            genes from the archaebacterium Thermoplasma acidophilum
  JOURNAL   Nucleic Acids Res. 18, 4471-4478 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.K.Ree 23-FEB-1990.
FEATURES       from  to/span     description
    rRNA        213      335     5S rRNA
    RNA         211      346     5S RNA ( 3' end +/- 4 bp)
    site        184      191     put. promoter
    site        333      354     put. terminator
BASE COUNT      175 a    140 c    140 g    180 t
ORIGIN      
        1 ctgcagcctt atcatgggaa aatgggcatc tgctggagac gactatgtac ttcatgcatc
       61 catattatct tatgatatta aaatgctaaa gcgcatgatc gcttcggaag gaatatcgac
      121 agggcgagat tcgctgtctc ataatcgccc ggatctgcgc tgaagccgta aaatcacgaa
      181 aatcttatat agatgtgttc tatatagtgt tcggcaacgg tcatagcagc agggaaacac
      241 cagatcccat tccgaactcg acggttaagc ctgctgcgta ttgcgttgta ctgtatgccg
      301 cgagggtacg ggaagcgcaa tatgctgtta ccacttttga aatgaaagtt ttttacatct
      361 atctttaatt cagtggcaat gattaatcct tggtcatcat ctgatttctt cgactatgag
      421 aggctaaaga gaggaattcg gcataggcga tatgaacgtt cccttcgatc attttctctt
      481 caggacttat acttggccag cgatccatag actatatcga ctacgcttta aagcatcaca
      541 tgaaattcaa cgttatgact ggcctcatgc cttcaggaga gatgcatctt ggcaacaaga
      601 gcgccataga ccaggtcatc ttcttccaga agctt
//
LOCUS       THARGG       4154 bp ds-DNA             BCT       11-AUG-1990
DEFINITION  T.acidophilum 23S ribosomal RNA gene.
ACCESSION   M32298
KEYWORDS    23S ribosomal RNA.
SOURCE      T.acidophilum (strain 122-1B2) DNA, clones pTH1-1, pL8 and pTH3-7.
  ORGANISM  Thermoplasma acidophilum
            Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; 
            Thermoplasmales.
REFERENCE   1  (bases 1 to 4154)
  AUTHORS   Ree,H.K. and Zimmermann,R.A.
  TITLE     Organization and expression of the 16S, 23S and 5S ribosomal RNA
            genes from the archaebacterium Thermoplasma acidophilum
  JOURNAL   Nucleic Acids Res. 18, 4471-4478 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by H.K.Ree 23-FEB-1990.
FEATURES       from  to/span     description
    rRNA        521     3426     23S RNA
    RNA         429     3473     23S RNA gene (3' end +/- 5 bp)
BASE COUNT     1097 a    903 c   1177 g    977 t
ORIGIN      
        1 tggtgttaag gtggatgcac tatatctatg agagctaccg tgagatggcc ataagggaga
       61 tacgcgatat gatagagtag ctgcataagg caattcggaa taacagacgc ggcggttgtt
      121 catcgcatag gtagagtaaa tctcacagag gactccgttg ttatagcggt atctgcgcct
      181 catagaggac aggcgtttga cgcatgcaga tacataatcg accagataaa ggagagagcg
      241 cccatatgga aacgtgacat cgtgcaggga tctcctgcga gctggcactg aacccatttt
      301 ttacggtatc agcaagaaaa agttaattaa aatatatttc aaattctgaa aagtgagtat
      361 tataaaatta ttggataatt gggcttccgg atcaaaatgc ttatatccct cttaatgata
      421 tagtccatac acgcttacaa tgcacctttt aatgcatcac attaaaaggg aagatgtcga
      481 attttgatga gactagatga gagatggcac aatggattgt gctctgctaa tctgcctaga
      541 ggatggcttg gttcgggcgc cgaagaagga cgtgccaagc tgcgataagc cttggggagg
      601 cgcatggagc cttagatcca aggatctccg aatgggactt cctgccgtaa ggcactccga
      661 aaggagaggg aacccgggga attgaaacat cttagtaccc ggaggaaaag aaatcaattg
      721 agataccgtt agtaaaggcg atcgaaagcg gtagaaggca aaccgaatag cccttcgaaa
      781 gaagggaaag atgtggagtt tggtcttcct ctaatgcctc ctgaagcgag atgaatcttc
      841 tggaaagaag agccttagaa ggtgatagcc ctgtaatcga agcttcagaa gctacaaggg
      901 gaagtaacca gagtaccatg cgtcgttttt cgcgtgggaa tttgggtggc actaacatcc
      961 aaccttaaat acgtcccgag tccgatagcg aacaaagtac cgtgagggaa agctgaaaag
     1021 aaacccggaa gggtggtgaa aagagcctga aactaggcag agataaactt atagggcagt
     1081 taagaggtga agtcgttaac tacgatggat cgctgtccta ttgtccgtgt tgaagaacgg
     1141 gccagggagt tctgacgagt ggcaaggtta atcctgaaag gagtagccgt agcgaaagca
     1201 actacccgca cagcaatggg ggggtggcgt ggtaaacgcg tttagtcact cgtgagagac
     1261 ccgaagccgg tcgatctaca cctgagtagg ttgaagctca gtgaaagctg ggtggaggac
     1321 cgaacctatt ctgatgtgca aatcgtttgg atgacttggg tgtaggggtt aaaggccaat
     1381 ctaggccggc aatagcgggt tccccccgat actacccgca ggtagacctc gatggagatt
     1441 ctcggcgagg tagagcgacc gattggttgg taagcagtcg aaaggctgcg ccgacttgtc
     1501 aaactccgaa cttgtcgaga tcgtagaagt cgggtgctag ggggcaggga taagctttgc
     1561 ttccgtgatg ggaacaacca agacgagggt taaggtccct aagttctagt taagtgcact
     1621 aaataggttt gtggccaaag acagtgggga ggtaggctca gaagcagcca tccttcaaag
     1681 agtgcgtaac agctcactca ccgaggtcac atgcctagaa gatggaaggg gctaaaacta
     1741 gacaccgaga ccttcgagca ccgaaaggtg atctggtagg ggggcgtgcc atgtggatag
     1801 aagtctcccc gagaggaggg atggaccgca tggtatcgcg gatcctggtg aaagtagcag
     1861 agaagaaccg tgagaatcgg ttccgccgaa agggctaggg ttccttggca atgttcgtca
     1921 gccgagggtt agtcgatcct aaggccatac ctaacaggat atggtcgaag gggaagccgg
     1981 ttaatattcc ggcacactga acgttttgcc ctgtatgaga aggttcaggg taggggcggt
     2041 acgggtgcca acgtatttat gctcataagc ggatggagag tcgtaatgac gagaagttcg
     2101 cgaaagagcg tatgttcccg tttgggaatc gcctcgatcc ccggatccca tgaaaatcat
     2161 gcaggggtca ggttcagtat cgtaccaaga accgacactg gtgcccctag gtgagaagcc
     2221 taaggcgttt tgggataatg gacgcgaggg aaatcggcaa aatagctccg tatcttcggt
     2281 ataaggagtg cctattccgt gagggaatag gtcgcagtga cgaagggact ccgactgttt
     2341 accacaaaca cagatcgctg ctagtccgta aggatgtgta tagcggttga aacctgccca
     2401 gtgctggtac ctgaaagccc cgtacaaggg gaagaagggc cagtaaacgg cgggggtaac
     2461 tatgaccctc ttaaggtagc gtaataccta gccgcttaat tggcggcttg catgaaggtt
     2521 caacgtgggt cccactgtcc ccgcgttcag cccagtgaaa ttgatgtact ggtgcacaat
     2581 ccagtctctc ccacgtgaaa gcgaagtccc cgtggagctt tactgcagcc tgtagctgtg
     2641 gtgcgatcct gactgcgtag tgtaggaagg agccgtcgaa gctctggttt cggccggagt
     2701 ggaggcgcca atgaaacact tccctctcgg gattgcgtca ctaacctctt cggagggaca
     2761 actattggtg ggcagtttgg gtggggcgcc acgcccctaa caacgtaaca ggggccccca
     2821 aaggtcagct caggagggtc agaaatcctc cgtagagtgt aaaagcaaaa gctggcttga
     2881 ctgtgttgca gacaactagc aacgcagatg cgaaagcagg gtttagcgaa ccacccagtt
     2941 cctccttagt gggggcgggt gataagagag aagttacccc agggataact gagtcgtcct
     3001 cggcaagagt acacatcgac ccgagggttt gctacttcga tgtcgtctgt tcctatcctg
     3061 gtgctgcata aggtgccaag ggtggggctg ttcgcccatt aaaagggatc ctgagatggg
     3121 ttcactacgt cgcgagacag tagggttgct tctccgtggg agtgctcgat gtctgagggg
     3181 aaggggcctt tagtacgaga ggaacaaggg ctcgtgacct ctagtttacc ggttgtctgg
     3241 caaggcatcg ccgggtagcc acgtcatacg cggataagag ctgaaagcat ctaagctcga
     3301 agccgccccc gaaaatagac atcgtcatca gatcgcctct agaagagagg tttgatagag
     3361 ccgggatgta aggatcgagc ttcggcgaga ttttaagtcc acggctacta aagatcgaag
     3421 gcacaatcca tgctaaattc tcgtctagtt gaagtcataa gcgtgtttga tttttgtttt
     3481 ttattttttt gccgttgtag tgtgaactat gatgaaaaat tatattgata taaaagagaa
     3541 atagaaaaaa tgatcactgc tttatatttt ctatctgagc ctctttcagt atctgatcgg
     3601 ttatgctgtc gaaaatctca cgtatcttat catttataac gacggcaggt attccgttgt
     3661 caccgttttc tgctatttct ggtattagag gtatttttcc caggaacggg aggttgtatt
     3721 cctttgcgga tttctccccg ccgccgttct tgaatatgta tgtgagctta ccgcagtgcg
     3781 cacacaaaac cgctcatgtt ctctattatg ccaaggacag ggagcttcag ctgttttgca
     3841 aagtttattg ccttctttgc atcgagaagg gcgacatcct gcggcgttac gactatgaca
     3901 acgccatttg attctggtat cagctgggcc acgctcagtg caacgtctcc ggtgccaggc
     3961 ggcatgtcaa gcacaacaac atccgtatcc ttccagttta cgtcttccag aaactgctgt
     4021 attgccttgt gcatcagcga accacgccat attatcggcg tatcctctgt cggtatgagg
     4081 aaacccatgg acaccacgcg aacaccgtat cttgtctctg ccggtattat tccctcctcg
     4141 tcagcgtaaa gctt
//
LOCUS       HIVANT70      583 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral LTR region.
ACCESSION   M31171
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 1 (isolate ANT70) proviral DNA.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 583)
  AUTHORS   De Leys,R., Vanderborght,B., Haesevelde,M.V., Heyndrickx,L.,
            van Geel,A., Wauters,C., Bernaerts,R., Saman,E., Nijs,P.,
            Willems,B., Taelman,H., van der Groen,G., Piot,P., Tersmette,T.,
            Huisman,J.G. and van Heuverswyn,H.
  TITLE     Isolation and partial characterization of an unusual human
            immunodeficiency retrovirus from two persons of west-central
            African origin
  JOURNAL   J. Virol. 64, 1207-16 (1990)
  STANDARD  full staff_review
COMMENT     Kindly provided prior to publication in computer-readable form by
            R. De Leys, Innogenetics, Antwerp.
BASE COUNT      162 a    133 c    153 g    135 t
ORIGIN      5'-terminus of 5'LTR.
        1 tggaagggtt aatttactcc cataaaagag cagaaatcct ggatctttgg gtgtataaca
       61 ctcagggatt cttccctgat tggcagaact acacaccagg accaggaacc aggttcccac
      121 tgacatttgg atggttgttt aaactagtac cagtgtcaga agaagaggca gaaagactag
      181 gaaatacatg tgagagggct aatctcctgc atccagcatg tgcccatgga tttgaagata
      241 cacataaaga aatactgatg tggaagtttg atagatctct aggcaacacc catgttgcta
      301 tgataactca cccagagctc ttccagaagg actaaaaact gctgacctga agattgctga
      361 cactgtggaa ctttccagca aagactgctg acactgcggg gactttccag tgggagggac
      421 agggggcggt tcggggagtg gctaaccctc agaagctgca tataagcagc cgctttctgc
      481 ttgtaccggg tctcggttag aggaccaggt ctgagcccgg gagctccctg gcctctagct
      541 gaacccgctc gttaacgctc aataaagctt gccttgagtg aga
//
LOCUS       HIVBAN1        59 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral gag region.
ACCESSION   M27953
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Human immunodeficiency virus type 1 (isolate BAN) proviral DNA.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 59)
  AUTHORS   Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
            Wain-Hobson,S.
  TITLE     HIV-1 isolates are rapidly evolving quasispecies: Evidence for
            viral mixtures and preferred nucleotide substitutions
  JOURNAL   j acquired immuno defic syndr 2, 344-52 (1989)
  STANDARD  full staff_review
COMMENT     [1] kindly submitted in computer readable form by M. Goodenow.
            Sibling sequences are available (see 1989: I-A-172).
BASE COUNT       26 a      7 c     10 g     16 t
ORIGIN      
        1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacta
//
LOCUS       HIVBAN2       250 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral gag region.
ACCESSION   M27944
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Human immunodeficiency virus type 1 (isolate BAN) proviral DNA.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 250)
  AUTHORS   Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
            Wain-Hobson,S.
  TITLE     HIV-1 isolates are rapidly evolving quasispecies: Evidence for
            viral mixtures and preferred nucleotide substitutions
  JOURNAL   j acquired immuno defic syndr 2, 344-52 (1989)
  STANDARD  full staff_review
COMMENT     Sibling sequences are available (see 1989: I-A-173).
BASE COUNT       96 a     40 c     42 g     72 t
ORIGIN      
        1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc
       61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg
      121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag
      181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca
      241 aaggtatcct
//
LOCUS       HIVBRU       9229 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral complete
            genome.
ACCESSION   K02013
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 1 (HIV-1), isolate BRU proviral
            DNA clone lambda-J19.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9229)
  AUTHORS   Wain-Hobson,S., Sonigo,P., Danos,O., Cole,S. and Alizon,M.
  TITLE     Nucleotide sequence of the AIDS Virus, LAV
  JOURNAL   Cell 40, 9-17 (1985)
  STANDARD  full staff_review
REFERENCE   2  (bases 1713 to 1748; revision of [1])
  AUTHORS   Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P.
  TITLE     Genetic variability of the AIDS virus: Nucleotide sequence analysis
            of two isolates from African patients
  JOURNAL   Cell 46, 63-74 (1986)
  STANDARD  full staff_review
COMMENT     The original LAV, sometimes called LAV-1 to distinguish it from
            HIV2 (LAV-2), is now referred to as HIV-1bru. An infectious clone
            of this virus has been constructed by Keith Peden, Molecular
            Biology and Genetics, Johns Hopkins University School of Medicine,
            Baltimore, MD 21205 (301) 955-3652. HIVNL43 is also an infectious
            clone having for its 3' half a clone of the BRU isolate. The tat
            sequence for the L20 clone is available (see 1989: I-A-54).
FEATURES       from  to/span     description
    pept        336     1874     gag polyprotein
    pept    <  1631     4678     pol polyprotein (NH2-terminus uncertain; AA at
                                 1631)
    pept       4623     5201     vif protein
    pept       5141     5431     vpr protein
    pept       5412     5626     tat protein, exon 2 (first expressed exon)
               7972     8017     tat protein, exon 3 (AA at 7973)
    pept       5551     5626     rev protein, exon 2 (first expressed exon)
               7972     8246     rev protein, exon 3 (AA at 7974)
    pept       5643     5888     vpu protein
    pept       5803     8388     envelope polyprotein
    pept       8390     9010     nef protein
    IVS         290     5358     tat, rev, nef subgenomic mRNA intron 1
    IVS        5627     7971     tat cds intron 2
    IVS        5627     7971     rev cds intron 2
    LTR     <     1      180     5' LTR
    LTR        8679  >  9229     3' LTR
    rpt     <     1       97     R repeat 5' copy
    rpt        9133     9229     R repeat 3' copy
    binding     182      199     primer (Lys-tRNA) binding site
    revision   1712     1749     atttcttcagagcagaccagagccaacagccccaccag in [2];
                                 ag in [1]
    signal     9205     9210     poly-A signal
BASE COUNT     3289 a   1656 c   2232 g   2052 t
ORIGIN      Cap site of genomic RNA.
        1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctaacta gggaacccac
       61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt
      121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca
      181 gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag
      241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg aggcgactgg tgagtacgcc
      301 aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa
      361 gcgggggaga attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat
      421 ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg
      481 gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc
      541 agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc
      601 atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa
      661 acaaaagtaa gaaaaaagca cagcaagcag cagctgacac aggacacagc agccaggtca
      721 gccaaaatta ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac
      781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga
      841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa
      901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag
      961 ctgcagaatg ggatagagtg catccagtgc atgcagggcc tattgcacca ggccagatga
     1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat
     1081 ggatgacaaa taatccacct atcccagtag gagaaattta taaaagatgg ataatcctgg
     1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac
     1201 caaaagaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag
     1261 cttcacagga ggtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcgaacccag
     1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag
     1381 catgtcaggg agtgggagga cccggccata aggcaagagt tttggctgaa gcaatgagcc
     1441 aagtaacaaa ttcagctacc ataatgatgc aaagaggcaa ttttaggaac caaagaaaga
     1501 ttgttaagtg tttcaattgt ggcaaagaag ggcacatagc cagaaattgc agggccccta
     1561 ggaaaaaggg ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga
     1621 gacaggctaa ttttttaggg aagatctggc cttcctacaa gggaaggcca gggaattttc
     1681 ttcagagcag accagagcca acagccccac catttcttca gagcagacca gagccaacag
     1741 ccccaccaga agagagcttc aggtctgggg tagagacaac aactccctct cagaagcagg
     1801 agccgataga caaggaactg tatcctttaa cttccctcag atcactcttt ggcaacgacc
     1861 cctcgtcaca ataaagatag gggggcaact aaaggaagct ctattagata caggagcaga
     1921 tgatacagta ttagaagaaa tgagtttgcc aggaagatgg aaaccaaaaa tgataggggg
     1981 aattggaggt tttatcaaag taagacagta tgatcagata ctcatagaaa tctgtggaca
     2041 taaagctata ggtacagtat tagtaggacc tacacctgtc aacataattg gaagaaatct
     2101 gttgactcag attggttgca ctttaaattt tcccattagt cctattgaaa ctgtaccagt
     2161 aaaattaaag ccaggaatgg atggcccaaa agttaaacaa tggccattga cagaagaaaa
     2221 aataaaagca ttagtagaaa tttgtacaga aatggaaaag gaagggaaaa tttcaaaaat
     2281 tgggcctgaa aatccataca atactccagt atttgccata aagaaaaaag acagtactaa
     2341 atggagaaaa ttagtagatt tcagagaact taataagaga actcaagact tctgggaagt
     2401 tcaattagga ataccacatc ccgcagggtt aaaaaagaaa aaatcagtaa cagtactgga
     2461 tgtgggtgat gcatattttt cagttccctt agatgaagac ttcaggaagt atactgcatt
     2521 taccatacct agtataaaca atgagacacc agggattaga tatcagtaca atgtgcttcc
     2581 acagggatgg aaaggatcac cagcaatatt ccaaagtagc atgacaaaaa tcttagagcc
     2641 ttttagaaaa caaaatccag acatagttat ctatcaatac atggatgatt tgtatgtagg
     2701 atctgactta gaaatagggc agcatagaac aaaaatagag gagctgagac aacatctgtt
     2761 gaggtgggga cttaccacac cagacaaaaa acatcagaaa gaacctccat tcctttggat
     2821 gggttatgaa ctccatcctg ataaatggac agtacagcct atagtgctgc cagaaaaaga
     2881 cagctggact gtcaatgaca tacagaagtt agtgggaaaa ttgaattggg caagtcagat
     2941 ttacccaggg attaaagtaa ggcaattatg taaactcctt agaggaacca aagcactaac
     3001 agaagtaata ccactaacag aagaagcaga gctagaactg gcagaaaaca gagagattct
     3061 aaaagaacca gtacatggag tgtattatga cccatcaaaa gacttaatag cagaaataca
     3121 gaagcagggg caaggccaat ggacatatca aatttatcaa gagccattta aaaatctgaa
     3181 aacaggaaaa tatgcaagaa cgaggggtgc ccacactaat gatgtaaaac aattaacaga
     3241 ggcagtgcaa aaaataacca cagaaagcat agtaatatgg ggaaagactc ctaaatttaa
     3301 actacccata caaaaggaaa catgggaaac atggtggaca gagtattggc aagccacctg
     3361 gattcctgag tgggagtttg tcaatacccc tcctttagtg aaattatggt accagttaga
     3421 gaaagaaccc atagtaggag cagaaacgtt ctatgtagat ggggcagcta gcagggagac
     3481 taaattagga aaagcaggat atgttactaa tagaggaaga caaaaagttg tcaccctaac
     3541 tgacacaaca aatcagaaga ctgagttaca agcaattcat ctagctttgc aggattcggg
     3601 attagaagta aatatagtaa cagactcaca atatgcatta ggaatcattc aagcacaacc
     3661 agataaaagt gaatcagagt tagtcaatca aataatagag cagttaataa aaaaggaaaa
     3721 ggtctatctg gcatgggtac cagcacacaa aggaattgga ggaaatgaac aagtagataa
     3781 attagtcagt gctggaatca ggaaagtact atttttagat ggaatagata aggcccaaga
     3841 tgaacatgag aaatatcaca gtaattggag agcaatggct agtgatttta acctgccacc
     3901 tgtagtagca aaagaaatag tagccagctg tgataaatgt cagctaaaag gagaagccat
     3961 gcatggacaa gtagactgta gtccaggaat atggcaacta gattgtacac atttagaagg
     4021 aaaagttatc ctggtagcag ttcatgtagc cagtggatat atagaagcag aagttattcc
     4081 agcagaaaca gggcaggaaa cagcatactt tcttttaaaa ttagcaggaa gatggccagt
     4141 aaaaacaata catacagaca atggcagcaa tttcaccagt actacggtta aggccgcctg
     4201 ttggtgggcg ggaatcaagc aggaatttgg aattccctac aatccccaaa gtcaaggagt
     4261 agtagaatct atgaataaag aattaaagaa aattataggc caggtaagag atcaggctga
     4321 acatcttaag acagcagtac aaatggcagt attcatccac aattttaaaa gaaaaggggg
     4381 gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac
     4441 taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag
     4501 cagagatcca ctttggaaag gaccagcaaa gctcctctgg aaaggtgaag gggcagtagt
     4561 aatacaagat aatagtgaca taaaagtagt gccaagaaga aaagcaaaga tcattaggga
     4621 ttatggaaaa cagatggcag gtgatgattg tgtggcaagt agacaggatg aggattagaa
     4681 catggaaaag tttagtaaaa caccatatgt atgtttcagg gaaagctagg ggatggtttt
     4741 atagacatca ctatgaaagc cctcatccaa gaataagttc agaagtacac atcccactag
     4801 gggatgctag attggtaata acaacatatt ggggtctgca tacaggagaa agagactggc
     4861 atctgggtca gggagtctcc atagaatgga ggaaaaagag atatagcaca caagtagacc
     4921 ctgaactagc agaccaacta attcatctgt attactttga ctgtttttca gactctgcta
     4981 taagaaaggc cttattagga catatagtta gccctaggtg tgaatatcaa gcaggacata
     5041 acaaggtagg atctctacaa tacttggcac tagcagcatt aataacacca aaaaagataa
     5101 agccaccttt gcctagtgtt acgaaactga cagaggatag atggaacaag ccccagaaga
     5161 ccaagggcca cagagggagc cacacaatga atggacacta gagcttttag aggagcttaa
     5221 gaatgaagct gttagacatt ttcctaggat ttggctccat ggcttagggc aacatatcta
     5281 tgaaacttat ggggatactt gggcaggagt ggaagccata ataagaattc tgcaacaact
     5341 gctgtttatc catttcagaa ttgggtgtcg acatagcaga ataggcgtta ctcaacagag
     5401 gagagcaaga aatggagcca gtagatccta gactagagcc ctggaagcat ccaggaagtc
     5461 agcctaaaac tgcttgtacc acttgctatt gtaaaaagtg ttgctttcat tgccaagttt
     5521 gtttcacaac aaaagcctta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa
     5581 gacctcctca aggcagtcag actcatcaag tttctctatc aaagcagtaa gtagtacatg
     5641 taatgcaacc tatacaaata gcaatagcag cattagtagt agcaataata atagcaatag
     5701 ttgtgtggtc catagtaatc atagaatata ggaaaatatt aagacaaaga aaaatagaca
     5761 ggttaattga tagactaata gaaagagcag aagacagtgg caatgagagt gaaggagaaa
     5821 tatcagcact tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg
     5881 atctgtagtg ctacagaaaa attgtgggtc acagtctatt atggggtacc tgtgtggaag
     5941 gaagcaacca ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat
     6001 aatgtttggg ccacacatgc ctgtgtaccc acagacccca acccacaaga agtagtattg
     6061 gtaaatgtga cagaaaattt taacatgtgg aaaaatgaca tggtagaaca gatgcatgag
     6121 gatataatca gtttatggga tcaaagccta aagccatgtg taaaattaac cccactctgt
     6181 gttagtttaa agtgcactga tttggggaat gctactaata ccaatagtag taataccaat
     6241 agtagtagcg gggaaatgat gatggagaaa ggagagataa aaaactgctc tttcaatatc
     6301 agcacaagca taagaggtaa ggtgcagaaa gaatatgcat ttttttataa acttgatata
     6361 ataccaatag ataatgatac taccagctat acgttgacaa gttgtaacac ctcagtcatt
     6421 acacaggcct gtccaaaggt atcctttgag ccaattccca tacattattg tgccccggct
     6481 ggttttgcga ttctaaaatg taataataag acgttcaatg gaacaggacc atgtacaaat
     6541 gtcagcacag tacaatgtac acatggaatt aggccagtag tatcaactca actgctgttg
     6601 aatggcagtc tagcagaaga agaggtagta attagatctg ccaatttcac agacaatgct
     6661 aaaaccataa tagtacagct gaaccaatct gtagaaatta attgtacaag acccaacaac
     6721 aatacaagaa aaagtatccg tatccagagg ggaccaggga gagcatttgt tacaatagga
     6781 aaaataggaa atatgagaca agcacattgt aacattagta gagcaaaatg gaatgccact
     6841 ttaaaacaga tagctagcaa attaagagaa caatttggaa ataataaaac aataatcttt
     6901 aagcaatcct caggagggga cccagaaatt gtaacgcaca gttttaattg tggaggggaa
     6961 tttttctact gtaattcaac acaactgttt aatagtactt ggtttaatag tacttggagt
     7021 actgaagggt caaataacac tgaaggaagt gacacaatca cactcccatg cagaataaaa
     7081 caatttataa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagcgga
     7141 caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtaataac
     7201 aacaatgggt ccgagatctt cagacctgga ggaggagata tgagggacaa ttggagaagt
     7261 gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca
     7321 aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg
     7381 ttcttgggag cagcaggaag cactatgggc gcacggtcaa tgacgctgac ggtacaggcc
     7441 agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg
     7501 caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg
     7561 gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa
     7621 ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag
     7681 atttggaata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta
     7741 atacattcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg
     7801 gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat
     7861 ataaaaatat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta
     7921 ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc
     7981 ccaaccccga ggggacccga caggcccgaa ggaatagaag aagaaggtgg agagagagac
     8041 agagacagat ccattcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg
     8101 agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt
     8161 gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacag
     8221 tattggagtc aggaactaaa gaatagtgct gttagcttgc tcaatgccac agccatagca
     8281 gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc
     8341 cacataccta gaagaataag acagggcttg gaaaggattt tgctataaga tgggtggcaa
     8401 gtggtcaaaa agtagtgtgg ttggatggcc tactgtaagg gaaagaatga gacgagctga
     8461 gccagcagca gatggggtgg gagcagcatc tcgagacctg gaaaaacatg gagcaatcac
     8521 aagtagcaat acagcagcta ccaatgctgc ttgtgcctgg ctagaagcac aagaggagga
     8581 ggaggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt acaaggcagc
     8641 tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca
     8701 acgaagacaa gatatccttg atctgtggat ctaccacaca caaggctact tccctgattg
     8761 gcagaactac acaccagggc caggggtcag atatccactg acctttggat ggtgctacaa
     8821 gctagtacca gttgagccag ataaggtaga agaggccaat aaaggagaga acaccagctt
     8881 gttacaccct gtgagcctgc atggaatgga tgaccctgag agagaagtgt tagagtggag
     8941 gtttgacagc cgcctagcat ttcatcacgt ggcccgagag ctgcatccgg agtacttcaa
     9001 gaactgctga catcgagctt gctacaaggg actttccgct ggggactttc cagggaggcg
     9061 tggcctgggc gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt
     9121 ttgcctgtac tgggtctctc tggttagacc agatttgagc ctgggagctc tctggctaac
     9181 tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttca
//
LOCUS       HIVELI       9176 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral complete
            genome.
ACCESSION   K03454
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 1, isolate ELI, proviral DNA,
            clone E-H12.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9176)
  AUTHORS   Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P.
  TITLE     Genetic variability of the AIDS virus: Nucleotide sequence analysis
            of two isolates from African patients
  JOURNAL   Cell 46, 63-74 (1986)
  STANDARD  full staff_review
COMMENT     Sibling sequences of the ELI isolate are available (see 1989:
            I-179)
FEATURES       from  to/span     description
    pept        336     1838     gag polyprotein
    pept    <  1634     4642     pol polyprotein (NH2-terminus uncertain; AA at
                                 1634)
    pept       4587     5165     vif protein
    pept       5105     5395     vpr protein
    pept       5376     5590     tat protein, exon 2 (first expressed exon)
               7915     7999     tat protein, exon 3 (AA at 7916)
    pept       5515     5590     rev protein, exon 2 (first expressed exon)
               7915     8195     rev protein, exon 3 (AA at 7917)
    pept       5607     5852     vpu protein
    pept       5770     8331     envelope polyprotein precursor
    pept       8333     8953     nef protein
    IVS        5591     7914     tat cds intron 2
    IVS        5591     7914     rev cds intron 2
    LTR     <     1      180     5' LTR
    LTR        8625  >  9176     3' LTR
    rpt     <     1       97     R repeat 5' copy
    rpt        9078     9175     R repeat 3' copy
    binding     182      199     primer (Lys-tRNA) binding site
    signal     9151     9156     mRNA polyadenylation signal
BASE COUNT     3333 a   1632 c   2179 g   2032 t
ORIGIN      Cap site of genomic RNA.
        1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctagcta gggaacccac
       61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt
      121 gtgactctgg taactagaga tccctcagac ccctttagtc agagtggaaa atctctagca
      181 gtggcgcccg aacagggacc tgaaagcgaa agtagaacca gaggagctct ctcgacgcag
      241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg cagcgactgg tgagtacgct
      301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa
      361 gcgggggaaa attagataaa tgggaaaaaa ttcggttacg gccaggagga aagaaaaaat
      421 atagactaaa acatatagta tgggcaagca gggagctaga acgatatgca cttaatcctg
      481 gccttttaga aacatcagaa ggctgtaaac aaataatagg gcagctacaa ccagctattc
      541 agacaggaac agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtac
      601 ataaaggaat agatgtaaaa gacaccaagg aagctttaga aaagatggag gaagagcaaa
      661 acaaaagtaa gaaaaaggca cagcaagcag cagctgacac aggaaacaac agccaggtca
      721 gccaaaatta tcctatagtg cagaacctac aggggcaaat ggtacatcag gccatatcac
      781 ctagaacttt gaacgcatgg gtaaaagtaa tagaagaaaa ggctttcagc ccagaagtaa
      841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa
      901 acacagtggg gggacatcaa gcagccatgc aaatgctaaa agagaccatc aatgaagaag
      961 ctgcagaatg ggataggtta catccagtgc atgcagggcc tattgcacca ggccagatga
     1021 gagaaccaag gggaagtgat atagcaggaa ctactagtac ccttcaggaa caaatagcat
     1081 ggatgacaag taacccacct atcccagtag gagaaatcta taaaagatgg ataattgtgg
     1141 gattaaataa aatagtaaga atgtatagcc ctgtcagcat tttggacata agacagggac
     1201 caaaggaacc ttttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag
     1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag
     1321 attgcaagac tatcttaaaa gcattgggac cacaggctac actagaagaa atgatgacag
     1381 catgtcaggg agtggggggg cccagccata aagcaagagt tctggctgag gcaatgagcc
     1441 aagcaacaaa ttcagttact acagcaatga tgcagagagg caattttaag ggcccaagaa
     1501 aaattattaa gtgtttcaat tgtggcaaag aagggcacat agcaaaaaat tgcagggccc
     1561 ctaggaaaaa gggctgttgg agatgtggaa aggaaggaca ccaactaaaa gattgcactg
     1621 agagacaggc taatttttta gggagaattt ggccttccca caagggaagg ccggggaact
     1681 ttctccaaag cagaccagag ccaacagccc caccagcaga gagcttcggg tttggggaag
     1741 agataacccc ctctcaaaaa caggagcaga aagacaagga actgtatcct ttaacttccc
     1801 tcaaatcact ctttggcaac gaccccttgt cgcaataaaa atagggggac agctaaagga
     1861 agctctatta gatacaggag cagatgatac agtattagaa gaaatgaatt tgccaggaaa
     1921 atggaaacca aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca
     1981 aatacccata gaaatctgtg gacagaaagc tataggtaca gtattagtag gacctacgcc
     2041 tgtcaacata atcggaagaa atttgttgac ccagattggc tgcactttaa attttccaat
     2101 tagtcctatt gaaactgtac cagtaaaatt aaagccagga atggatggcc caaaagttaa
     2161 acaatggcca ttgacagaag aaaaaataaa agcattaaca gaaatttgta cagatatgga
     2221 aaaggaagga aaaatttcaa gaattgggcc tgaaaatcca tacaatactc caatatttgc
     2281 cataaagaaa aaagacagta ccaagtggag aaaattagta gatttcagag aacttaataa
     2341 gagaactcaa gatttctggg aagttcaatt aggaataccg catcctgcag ggctgaaaaa
     2401 gaaaaaatca gtaacagtac tggatgtggg tgatgcatat ttttcagttc ccttagatga
     2461 agattttagg aaatataccg cctttaccat atctagtata aacaatgaga caccagggat
     2521 tagatatcag tacaatgtgc ttccacaggg atggaaagga tcaccggcaa tattccaaag
     2581 tagcatgaca aaaatcttag agccctttag aaaacaaaat ccagaaatgg ttatctatca
     2641 atacatggat gatttgtatg taggatctga cttagaaata gggcagcata ggacaaaaat
     2701 agagaaatta agagaacatc tattgaggtg gggatttacc agaccagata aaaaacatca
     2761 gaaagaaccc ccatttcttt ggatgggtta tgaactccat cctgataaat ggacagtaca
     2821 gtctataaaa ctgccagaaa aggagagctg gactgtcaat gatatacaga acttagtgga
     2881 gagattaaac tgggcaagcc agatttatcc aggaattaaa gtaagacaat tatgtaaact
     2941 ccttagggga accaaagcac taacagaagt aataccacta acagaagaag cagaattaga
     3001 actggcagaa aacagggaaa ttttaaaaga accagtacat ggagtgtatt atgacccatc
     3061 aaaagactta atagcagaaa tacagaaaca agggcacggc caatggacat accaaattta
     3121 tcaagaacca tttaaaaatc tgaaaacagg aaagtatgca agaatgaggg gtgcccacac
     3181 taatgatgta aagcaattag cagaggcagt gcaaagaata tccacagaaa gcatagtgat
     3241 atggggaagg actcctaaat ttagactacc catacaaaag gaaacatggg aaacatggtg
     3301 ggcagagtat tggcaagcca cttggattcc tgagtgggaa tttgtcaata cccctccttt
     3361 agtaaaatta tggtaccagt tagagaagga acccataata ggagcagaaa ctttctatgt
     3421 agatggggca gctaatagag agactaaatt aggaaaagca ggatatgtta ctgacagagg
     3481 aagacagaaa gttgtccctt tgactgacac gacaaatcag aagactgagt tacaagcaat
     3541 taatctagcc ttgcaggatt cgggattaga agtaaacata gtaacagatt cacaatatgc
     3601 attaggaatc attcaagcac aaccagataa gagtgaatca gagttagtca atcaaataat
     3661 agagcagtta ataaaaaagg aaaaggttta cctggcatgg gtaccagcac acaaaggaat
     3721 tggaggaaat gaacaagtag ataaattagt cagtcaagga atcaggaaag tactattttt
     3781 ggatggaata gataaggctc aagaagaaca tgagaaatat cacaacaatt ggagagcaat
     3841 ggctagtgat tttaacctac cacccgtggt agcaaaagaa atagtagcta gctgtgataa
     3901 atgtcagcta aaaggagaag ccatgcatgg acaagtagac tgtagtccag gaatatggca
     3961 attagattgt acacacttag aaggaaaagt tatcctggta gcagttcatg tagccagtgg
     4021 ctatatagaa gcagaagtta ttccagcaga aacagggcag gaaacagcat attttctttt
     4081 aaaattagca ggaagatggc cagtaaaagt agtacataca gacaatggca gcaatttcac
     4141 cagtgctgca gttaaggccg cctgttggtg ggcaggtatc aaacaggaat ttggaattcc
     4201 ctacaatccc caaagtcaag gagtagtaga atctatgaat aaagaattaa agaaaattat
     4261 aggacaggta agagatcaag ctgaacatct taagacagca gtacaaatgg cagtattcat
     4321 ccacaatttt aaaagaagaa gggggattgg gggatacagt gcaggggaaa gaataataga
     4381 cataatagca acagacatac aaactaaaga attacaaaaa caaattataa aaattcaaaa
     4441 ttttcgggtt tattacagag acagcagaga tccaatttgg aaaggaccag caaagctcct
     4501 ctggaaaggt gaaggggcag tagtaataca agacaagagt gacataaagg tagtaccaag
     4561 aagaaaagta aagattatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc
     4621 aagtagacag gatgaggatt aaaacatgga aaagtttagt aaaacaccat atgtatgttt
     4681 caaagaaagc taacagatgg ttttatagac atcactatga aagcccccac ccaaaaataa
     4741 gttcagaagt acacatccca ctaggagaag ctagactggt aataaaaaca tattggggtc
     4801 tgcatacagg agaaagagaa tggcatctgg gtcagggagt ctccatagaa tggaggaaaa
     4861 ggagatatag cacacaagta gaccctggcc tggcagacca actaattcat atgtattatt
     4921 ttgattgttt ttcagaatct gctataagaa aagccatatt aggagatata gttagtccta
     4981 ggtgtgagta tcaagcagga cataacaagg taggatccct acagtatttg gcactaacag
     5041 cattaatagc accaaaacag ataaagccac ctttgcctag tgttaggaag ctaacagaag
     5101 atagatggaa caagccccag cagaccaggg gccacagagg gagccataca atgaatgggc
     5161 attagagctt ttagaggagc ttaagagtga agctgttaga cattttccta ggatatggct
     5221 ccatagctta ggacaacata tttatgaaac ttatggggat acctgggtag gagttgaagc
     5281 tataataaga atactgcaac aattactgtt tattcatttc agaattgggt gtcaacatag
     5341 cagaataggc attattcgac agagaagagc aagaaatgga tccagtagat cctaacctag
     5401 agccctggaa ccatccagga agtcagccta ggactccttg taacaagtgt cattgtaaaa
     5461 agtgttgcta tcattgccca gtttgcttct taaacaaagg cttaggcatc tcctatggca
     5521 ggaagaagcg gagacagcga cgaggacctc ctcaaggcgg tcaggctcat caagttccta
     5581 taccaaagca gtaagtagta catgtaatgc aacctttagg gataatagca atagcagcat
     5641 tagtagtagc aataatacta gcaatagttg tgtggaccat agtattcata gaatatagaa
     5701 ggataaaaaa gcaaaggaga atagactgtt tacttgatag aataacagaa agagcagaag
     5761 acagtggcaa tgagagcgag ggggatagag agaaattgtc aaaactggtg gaaatggggc
     5821 atcatgctcc ttgggatatt gatgacctgt agtgctgcag acaatctgtg ggtcacagtt
     5881 tattatgggg tgcctgtatg gaaggaagca accaccactc tattttgtgc atcagatgct
     5941 aaatcatatg aaacagaggc acataatatc tgggccacac atgcctgtgt acccacggac
     6001 cccaacccac aagaaatagc actggaaaat gtgacagaaa actttaacat gtggaaaaat
     6061 aacatggtgg aacagatgca tgaggatata atcagtttat gggatcaaag cctaaaacca
     6121 tgtgtaaaat taaccccact ctgtgtcact ttaaactgta gtgatgaatt gaggaacaat
     6181 ggcactatgg ggaacaatgt cactacagag gagaaaggaa tgaaaaactg ctctttcaat
     6241 gtaaccacag tactaaaaga taagaagcag caagtatatg cactttttta tagacttgat
     6301 atagtaccaa tagacaatga tagtagtacc aatagtacca attataggtt aataaattgt
     6361 aatacctcag ccattacaca ggcttgtcca aaggtatcct ttgagccaat tcccatacat
     6421 tattgtgccc cagctggttt tgcgattcta aagtgtagag ataagaagtt caatggaaca
     6481 ggcccatgca caaatgtcag cacagtacaa tgtacacatg gaattaggcc agtggtgtca
     6541 actcaactgc tgttgaatgg cagtctagca gaagaagagg tcataattag atccgaaaat
     6601 ctcacaaaca atgctaaaaa cataatagca catcttaatg aatctgtaaa aattacctgt
     6661 gcaaggccct atcaaaatac aagacaaaga acacctatag gactagggca atcactctat
     6721 actacaagat caagatcaat aataggacaa gcacattgta atattagtag agcacaatgg
     6781 agtaaaactt tacaacaagt agctagaaaa ttaggaaccc ttcttaacaa aacaataata
     6841 aagtttaaac catcctcagg aggggaccca gaaattacaa cacacagttt taattgtgga
     6901 ggggaattct tctactgtaa tacatcagga ctgtttaata gtacatggaa tattagtgca
     6961 tggaataata ttacagagtc aaataatagc acaaacacaa acatcacact ccaatgcaga
     7021 ataaaacaaa ttataaagat ggtggcaggc aggaaagcaa tatatgcccc tcctatcgaa
     7081 agaaacattc tatgttcatc aaatattaca gggctactat tgacaagaga tggtggtata
     7141 aataatagta ctaacgagac ctttagacct ggaggaggag atatgaggga caattggaga
     7201 agtgaattat ataaatataa ggtagtacaa attgaaccac taggagtagc acccaccagg
     7261 gcaaagagaa gagtggtgga aagagaaaaa agagcaatag gattaggagc tatgttcctt
     7321 gggttcttgg gagcagcagg aagcacgatg ggcgcacggt cagtgacgct gacggtacag
     7381 gccagacaat taatgtctgg tatagtgcaa cagcaaaaca atttgctgag ggctatagag
     7441 gcgcaacagc atctgttgca actcacggtc tggggcatta aacagctcca ggcaagaatc
     7501 ctggctgtgg aaagatacct aaaggatcaa cagctcctag gaatttgggg ttgctctgga
     7561 aaacacattt gcaccactaa tgtgccctgg aactctagtt ggagtaatag atctctaaat
     7621 gagatttggc agaacatgac ctggatggag tgggaaagag aaattgacaa ttacacaggc
     7681 ttaatatata gcttaattga ggaatcgcag acccagcaag aaaagaatga aaaagaattg
     7741 ttggaattgg acaagtgggc aagtttgtgg aattggttta gcataacaca atggctgtgg
     7801 tatataaaaa tattcataat gataatagga ggcttgatag gtttaagaat agtttttgct
     7861 gtgctttctt tagtaaatag agttaggcag ggatactcac ctctgtcgtt tcagaccctc
     7921 ctcccagccc cgaggggacc cgacaggccc gaaggaacag aagaagaagg tggagagcga
     7981 ggcagagaca gatccgtgag attgctgaac ggattctcgg cacttatctg ggacgacctg
     8041 cggagcctgt gcctcttcag ctaccaccgc ttgagagact taatcttaat tgcagtgagg
     8101 attgtagaac ttctgggacg cagggggtgg gacatcctca aatatctgtg gaatctccta
     8161 cagtattgga gtcaggaact gaggaacagt gctagtagct tgtttgatgc catagcaata
     8221 gcagtagctg aggggacaga tagagttata gaaataatac aaagagcttg cagagctgtt
     8281 cttaacatac ccagaagaat aagacagggc ttagaaaggt ctttacttta aaatgggtgg
     8341 caaatggtca aaaagtagta tagtgggatg gcctgctata agggaaagaa taagaagaac
     8401 taatccagca gcagatgggg taggagcagt atctcgagac ctggaaaaac atggggcaat
     8461 cacaagtagc aatacagcaa gtactaatgc tgactgtgcc tggctagaag cacaagaaga
     8521 gagcgacgag gtgggctttc cagtcagacc ccaggtacct ttaagaccaa tgacttacaa
     8581 agaagctcta gatctcagcc actttttaaa agaaaagggg ggactggaag ggctaatttg
     8641 gtccaaaaag agacaagaga tccttgatct ttgggtctac aacacacaag gcatcttccc
     8701 tgattggcaa aactacacac cagggccagg gatcagatat ccactaacct ttggatggtg
     8761 ctacgagcta gtaccagttg atccacagga ggtagaagaa gacactgaag gagagaccaa
     8821 cagcttgtta caccctatat gccagcatgg aatggaggac ccggagagac aagtgttaaa
     8881 atggagattt aacagcagac tagcatttga gcacaaggcc cgagagatgc atccggagtt
     8941 ctacaaaaac tgatgacacc gagctttcta caagggactt tccgctgggg actttccagg
     9001 gaggcgtgga ctgggcggga ctggggagtg gctaaccctc agatgctgca tataagcagc
     9061 tgctttttgc ctgtactggg tctctctggt tagaccagat ttgagcctgg gagctctctg
     9121 gctagctagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaa
//
LOCUS       HIVHXB2      9718 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral complete
            genome.
ACCESSION   K03455
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 1 (isolate HXB2) proviral DNA.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 653)
  AUTHORS   Starcich,B., Ratner,L., Josephs,S.F., Okamato,T., Gallo,R.C. and
            Wong-Staal,F.
  TITLE     Characterization of long terminal repeat sequences of HTLV-III
  JOURNAL   Science 227, 538-540 (1985)
  STANDARD  full staff_review
REFERENCE   2  (bases 493 to 674; 9577 to 9718)
  AUTHORS   Ratner,L., Haseltine,W., Patarca,R., Livak,K.J., Starcich,B.,
            Josephs,S.J., Doran,E.R., Rafalski,J.A., Whitehorn,E.A.,
            Baumeister,K., Ivanoff,L., Petteway,S.R.Jr., Pearson,M.L.,
            Lautenberger,J.A., Papas,T.S., Ghrayeb,J., Chang,N.T.,
            Gallo,R.C. and Wong-Staal,F.
  TITLE     Complete nucleotide sequence of the AIDS virus, HTLV-III
  JOURNAL   Nature 313, 277-284 (1985)
  STANDARD  full staff_review
REFERENCE   3  (bases 1 to 9718)
  AUTHORS   Ratner,L., Fisher,A., Jagodzinski,L.L., Mitsuya,H., Liou,R.-S.,
            Gallo,R.C. and Wong-Staal,F.
  TITLE     Complete nucleotide sequences of functional clones of the AIDS
            virus
  JOURNAL   AIDS Res. Hum. Retroviruses 3, 57-69 (1987)
  STANDARD  full staff_review
REFERENCE   4  (bases 5611 to 5611; revises [3])
  AUTHORS   Ratner,L.
  JOURNAL   Unpublished (1987) Washington U. Med School, St. Louis, MO
  STANDARD  full staff_review
REFERENCE   5  (bases 6224 to 8794)
  AUTHORS   Reitz,M.S.
  JOURNAL   Unpublished (1988)
  STANDARD  full staff_review
COMMENT     Submitted in computer-readable form by L. Ratner.  A second
            envelope gene sequence for the HXB2D clone has been independently
            determined by Reitz and co-workers ([5]); the seven or so
            differences from the Ratner sequence ([3] and [4]) have not been
            resolved nor annotated herein.
            
            The vpU cds annotated below does not possess a start codon in the
            normal position (bases 6061 to 6063; "ACG").  As there are no
            nearby in-frame upstream or downstream start codons, HXB2 may not
            possess a functional vpU cds. This clone is infectious [3].
            
            Sibling sequences are available (see 1989: I-A-47).
FEATURES       from  to/span     description
    pept        789     2291     gag polyprotein
    pept    <  2084     5095     pol polyprotein (NH2-terminus uncertain; AA at
                                 2084)
    pept       5040     5618     vif protein
    pept       5558     5794     vpr protein
    pept       5830     6044     tat protein, exon 2 (first expressed exon)
               8378     8423     tat protein, exon 3 (AA at 8379)
    pept       5969     6044     rev protein, exon 2 (first expressed exon)
               8378     8652     rev protein, exon 3 (AA at 8380)
    pept    <  6061     6309     vpu protein (AA at 6061, put.)
    pept       6224     8794     envelope polyprotein
    pept       8796     9167     nef protein (premature termination)
    IVS         743     5776     tat, rev, nef mRNA intron 1
    IVS        6045     8377     tat cds intron 2
    IVS        6045     8377     rev cds intron 2
    IVS        6045     8377     tat, rev, nef mRNA intron 2
    LTR           1      634     5' LTR
    LTR        9085     9718     3' LTR
    rpt         454      551     R repeat 5' copy
    rpt        9538     9635     R repeat 3' copy
    binding     636      653     primer (Lys-tRNA) binding site
    signal     9611     9616     poly-A signal
BASE COUNT     3411 a   1773 c   2370 g   2164 t
ORIGIN      5' terminus of 5' LTR-U3.
        1 tggaagggct aattcactcc caacgaagac aagatatcct tgatctgtgg atctaccaca
       61 cacaaggcta cttccctgat tagcagaact acacaccagg gccagggatc agatatccac
      121 tgacctttgg atggtgctac aagctagtac cagttgagcc agagaagtta gaagaagcca
      181 acaaaggaga gaacaccagc ttgttacacc ctgtgagcct gcatggaatg gatgacccgg
      241 agagagaagt gttagagtgg aggtttgaca gccgcctagc atttcatcac atggcccgag
      301 agctgcatcc ggagtacttc aagaactgct gacatcgagc ttgctacaag ggactttccg
      361 ctggggactt tccagggagg cgtggcctgg gcgggactgg ggagtggcga gccctcagat
      421 cctgcatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga
      481 gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct
      541 tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc
      601 agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacctgaaag
      661 cgaaagggaa accagagctc tctcgacgca ggactcggct tgctgaagcg cccgcacggc
      721 aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg gaggctagaa
      781 ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat cgatgggaaa
      841 aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa
      901 gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta
      961 gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat
     1021 tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca
     1081 aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaaaaaa gcacagcaag
     1141 cagcagctga cacaggacac agcaatcagg tcagccaaaa ttaccctata gtgcagaaca
     1201 tccaggggca aatggtacat caggccatat cacctagaac tttaaatgca tgggtaaaag
     1261 tagtagaaga gaaggctttc agcccagaag tgatacccat gttttcagca ttatcagaag
     1321 gagccacccc acaagattta aacaccatgc taaacacagt ggggggacat caagcagcca
     1381 tgcaaatgtt aaaagagacc atcaatgagg aagctgcaga atgggataga gtgcatccag
     1441 tgcatgcagg gcctattgca ccaggccaga tgagagaacc aaggggaagt gacatagcag
     1501 gaactactag tacccttcag gaacaaatag gatggatgac aaataatcca cctatcccag
     1561 taggagaaat ttataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata
     1621 gccctaccag cattctggac ataagacaag gaccaaagga accctttaga gactatgtag
     1681 accggttcta taaaactcta agagccgagc aagcttcaca ggaggtaaaa aattggatga
     1741 cagaaacctt gttggtccaa aatgcgaacc cagattgtaa gactatttta aaagcattgg
     1801 gaccagcggc tacactagaa gaaatgatga cagcatgtca gggagtagga ggacccggcc
     1861 ataaggcaag agttttggct gaagcaatga gccaagtaac aaattcagct accataatga
     1921 tgcagagagg caattttagg aaccaaagaa agattgttaa gtgtttcaat tgtggcaaag
     1981 aagggcacac agccagaaat tgcagggccc ctaggaaaaa gggctgttgg aaatgtggaa
     2041 aggaaggaca ccaaatgaaa gattgtactg agagacaggc taatttttta gggaagatct
     2101 ggccttccta caagggaagg ccagggaatt ttcttcagag cagaccagag ccaacagccc
     2161 caccagaaga gagcttcagg tctggggtag agacaacaac tccccctcag aagcaggagc
     2221 cgatagacaa ggaactgtat cctttaactt ccctcaggtc actctttggc aacgacccct
     2281 cgtcacaata aagatagggg ggcaactaaa ggaagctcta ttagatacag gagcagatga
     2341 tacagtatta gaagaaatga gtttgccagg aagatggaaa ccaaaaatga tagggggaat
     2401 tggaggtttt atcaaagtaa gacagtatga tcagatactc atagaaatct gtggacataa
     2461 agctataggt acagtattag taggacctac acctgtcaac ataattggaa gaaatctgtt
     2521 gactcagatt ggttgcactt taaattttcc cattagccct attgagactg taccagtaaa
     2581 attaaagcca ggaatggatg gcccaaaagt taaacaatgg ccattgacag aagaaaaaat
     2641 aaaagcatta gtagaaattt gtacagagat ggaaaaggaa gggaaaattt caaaaattgg
     2701 gcctgaaaat ccatacaata ctccagtatt tgccataaag aaaaaagaca gtactaaatg
     2761 gagaaaatta gtagatttca gagaacttaa taagagaact caagacttct gggaagttca
     2821 attaggaata ccacatcccg cagggttaaa aaagaaaaaa tcagtaacag tactggatgt
     2881 gggtgatgca tatttttcag ttcccttaga tgaagacttc aggaagtata ctgcatttac
     2941 catacctagt ataaacaatg agacaccagg gattagatat cagtacaatg tgcttccaca
     3001 gggatggaaa ggatcaccag caatattcca aagtagcatg acaaaaatct tagagccttt
     3061 tagaaaacaa aatccagaca tagttatcta tcaatacatg gatgatttgt atgtaggatc
     3121 tgacttagaa atagggcagc atagaacaaa aatagaggag ctgagacaac atctgttgag
     3181 gtggggactt accacaccag acaaaaaaca tcagaaagaa cctccattcc tttggatggg
     3241 ttatgaactc catcctgata aatggacagt acagcctata gtgctgccag aaaaagacag
     3301 ctggactgtc aatgacatac agaagttagt ggggaaattg aattgggcaa gtcagattta
     3361 cccagggatt aaagtaaggc aattatgtaa actccttaga ggaaccaaag cactaacaga
     3421 agtaatacca ctaacagaag aagcagagct agaactggca gaaaacagag agattctaaa
     3481 agaaccagta catggagtgt attatgaccc atcaaaagac ttaatagcag aaatacagaa
     3541 gcaggggcaa ggccaatgga catatcaaat ttatcaagag ccatttaaaa atctgaaaac
     3601 aggaaaatat gcaagaatga ggggtgccca cactaatgat gtaaaacaat taacagaggc
     3661 agtgcaaaaa ataaccacag aaagcatagt aatatgggga aagactccta aatttaaact
     3721 gcccatacaa aaggaaacat gggaaacatg gtggacagag tattggcaag ccacctggat
     3781 tcctgagtgg gagtttgtta atacccctcc cttagtgaaa ttatggtacc agttagagaa
     3841 agaacccata gtaggagcag aaaccttcta tgtagatggg gcagctaaca gggagactaa
     3901 attaggaaaa gcaggatatg ttactaatag aggaagacaa aaagttgtca ccctaactga
     3961 cacaacaaat cagaagactg agttacaagc aatttatcta gctttgcagg attcgggatt
     4021 agaagtaaac atagtaacag actcacaata tgcattagga atcattcaag cacaaccaga
     4081 tcaaagtgaa tcagagttag tcaatcaaat aatagagcag ttaataaaaa aggaaaaggt
     4141 ctatctggca tgggtaccag cacacaaagg aattggagga aatgaacaag tagataaatt
     4201 agtcagtgct ggaatcagga aagtactatt tttagatgga atagataagg cccaagatga
     4261 acatgagaaa tatcacagta attggagagc aatggctagt gattttaacc tgccacctgt
     4321 agtagcaaaa gaaatagtag ccagctgtga taaatgtcag ctaaaaggag aagccatgca
     4381 tggacaagta gactgtagtc caggaatatg gcaactagat tgtacacatt tagaaggaaa
     4441 agttatcctg gtagcagttc atgtagccag tggatatata gaagcagaag ttattccagc
     4501 agaaacaggg caggaaacag catattttct tttaaaatta gcaggaagat ggccagtaaa
     4561 aacaatacat actgacaatg gcagcaattt caccggtgct acggttaggg ccgcctgttg
     4621 gtgggcggga atcaagcagg aatttggaat tccctacaat ccccaaagtc aaggagtagt
     4681 agaatctatg aataaagaat taaagaaaat tataggacag gtaagagatc aggctgaaca
     4741 tcttaagaca gcagtacaaa tggcagtatt catccacaat tttaaaagaa aaggggggat
     4801 tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca tacaaactaa
     4861 agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca gggacagcag
     4921 aaattcactt tggaaaggac cagcaaagct cctctggaaa ggtgaagggg cagtagtaat
     4981 acaagataat agtgacataa aagtagtgcc aagaagaaaa gcaaagatca ttagggatta
     5041 tggaaaacag atggcaggtg atgattgtgt ggcaagtaga caggatgagg attagaacat
     5101 ggaaaagttt agtaaaacac catatgtatg tttcagggaa agctagggga tggttttata
     5161 gacatcacta tgaaagccct catccaagaa taagttcaga agtacacatc ccactagggg
     5221 atgctagatt ggtaataaca acatattggg gtctgcatac aggagaaaga gactggcatt
     5281 tgggtcaggg agtctccata gaatggagga aaaagagata tagcacacaa gtagaccctg
     5341 aactagcaga ccaactaatt catctgtatt actttgactg tttttcagac tctgctataa
     5401 gaaaggcctt attaggacac atagttagcc ctaggtgtga atatcaagca ggacataaca
     5461 aggtaggatc tctacaatac ttggcactag cagcattaat aacaccaaaa aagataaagc
     5521 cacctttgcc tagtgttacg aaactgacag aggatagatg gaacaagccc cagaagacca
     5581 agggccacag agggagccac acaatgaatg gacactagag cttttagagg agcttaagaa
     5641 tgaagctgtt agacattttc ctaggatttg gctccatggc ttagggcaac atatctatga
     5701 aacttatggg gatacttggg caggagtgga agccataata agaattctgc aacaactgct
     5761 gtttatccat tttcagaatt gggtgtcgac atagcagaat aggcgttact cgacagagga
     5821 gagcaagaaa tggagccagt agatcctaga ctagagccct ggaagcatcc aggaagtcag
     5881 cctaaaactg cttgtaccaa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt
     5941 ttcataacaa aagccttagg catctcctat ggcaggaaga agcggagaca gcgacgaaga
     6001 gctcatcaga acagtcagac tcatcaagct tctctatcaa agcagtaagt agtacatgta
     6061 acgcaaccta taccaatagt agcaatagta gcattagtag tagcaataat aatagcaata
     6121 gttgtgtggt ccatagtaat catagaatat aggaaaatat taagacaaag aaaaatagac
     6181 aggttaattg atagactaat agaaagagca gaagacagtg gcaatgagag tgaaggagaa
     6241 atatcagcac ttgtggagat gggggtggag atggggcacc atgctccttg ggatgttgat
     6301 gatctgtagt gctacagaaa aattgtgggt cacagtctat tatggggtac ctgtgtggaa
     6361 ggaagcaacc accactctat tttgtgcatc agatgctaaa gcatatgata cagaggtaca
     6421 taatgtttgg gccacacatg cctgtgtacc cacagacccc aacccacaag aagtagtatt
     6481 ggtaaatgtg acagaaaatt ttgacatgtg gaaaaatgac atggtagaac agatgcatga
     6541 ggatataatc agtttatggg atcaaagcct aaagccatgt gtaaaattaa ccccactctg
     6601 tgttagttta aagtgcactg atttgaagaa tgatactaat accaatagta gtagcgggag
     6661 aatgataatg gagaaaggag agataaaaaa ctgctctttc aatatcagca caagcataag
     6721 aggtaaggtg cagaaagaat atgcattttt ttataaactt gatataatac caatagataa
     6781 tgatactacc agctatagct tgacaagttg taacacctca gtcattacac aggcctgtcc
     6841 aaaggtatcc tttgagccaa ttcccataca ttattgtgcc ccggctggtt ttgcgattct
     6901 aaaatgtaat aataagacgt tcaatggaac aggaccatgt acaaatgtca gcacagtaca
     6961 atgtacacat ggaattaggc cagtagtatc aactcaactg ctgttaaatg gcagtctagc
     7021 agaagaagag gtagtaatta gatctgtcaa tttcacggac aatgctaaaa ccataatagt
     7081 acagctgaac acatctgtag aaattaattg tacaagaccc aacaacaata caagaaaaag
     7141 aatccgtatc cagagaggac cagggagagc atttgttaca ataggaaaaa taggaaatat
     7201 gagacaagca cattgtaaca ttagtagagc aaaatggaat aacactttaa aacagataga
     7261 tagcaaatta agagaacaat tcggaaataa taaaacaata atctttaagc aatcctcagg
     7321 aggggaccca gaaattgtaa cgcacagttt taattgtgga ggggaatttt tctactgtaa
     7381 ttcaacacaa ctgtttaata gtacttggtt taatagtact tggagtactg aagggtcaaa
     7441 taacactgaa ggaagtgaca caatcaccct cccatgcaga ataaaacaaa ttataaacat
     7501 gtggcagaaa gtaggaaaag caatgtatgc ccctcccatc agtggacaaa ttagatgttc
     7561 atcaaatatt acagggctgc tattaacaag agatggtggt aatagcaaca atgagtccga
     7621 gatcttcaga cttggaggag gagatatgag ggacaattgg agaagtgaat tatataaata
     7681 taaagtagta aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt
     7741 gcagagagaa aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc
     7801 aggaagcact atgggcgcag cctcaatgac gctgacggta caggccagac aattattgtc
     7861 tggtatagtg cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt
     7921 gcaactcaca gtctggggca tcaagcagct ccaagcaaga atcctagctg tggaaagata
     7981 cctaaaggat caacagctcc tagggatttg gggttgctct ggaaaactca tttgcaccac
     8041 tgctgtgcct tggaatgcta gttggagtaa taaatctctg gaacagatct ggaatcacac
     8101 gacctggatg gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat
     8161 tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg
     8221 ggcaagtttg tggaattggt ttaacataac aaattggctg tggtatataa aattattcat
     8281 aatgatagta ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa
     8341 tagagttagg cagggatatt caccattatc gtttcagacc cacctcccaa tcccgagggg
     8401 acccgacagg cccgaaggaa tagaagaaga aggtggagag agagacagag acagatccat
     8461 tcgattagtg aacggatcct tggcacttat ctgggacgat ctgcggagcc tgtgcctctt
     8521 cagctaccac cgcttgagag acttactctt gattgtaacg aggattgtgg aacttctggg
     8581 acgcaggggg tgggaagccc tcaaatattg gtggaatctc ctacagtatt ggagtcagga
     8641 actaaagaat agtgctgtta gcttgctcaa tgccacagcc atagcagtag ctgaggggac
     8701 agatagggtt atagaagtag tacaaggagc ttgtagagct attcgccaca tacctagaag
     8761 aataagacag ggcttggaaa ggattttgct ataagatggg tggcaagtgg tcaaaaagta
     8821 gtgtgattgg atggcttact gtaagggaaa gaatgagacg agctgagcca gcagcagatg
     8881 gggtgggagc agcatctcga gacctggaaa aacatggagc aatcacaagt agcaacacag
     8941 cagctaccaa tgctgcttgt gcctggctag aagcacaaga ggaggaggag gtgggttttc
     9001 cagtcacacc tcaggtacct ttaagaccaa tgacttacaa ggcagctgta gatcttagcc
     9061 actttttaaa agaaaagggg ggactggaag ggctaattca ctcccaaaga agacaagata
     9121 tccttgatct gtggatctac cacacacaag gctacttccc tgattgacag aactacacac
     9181 cagggccagg ggtcagatat ccactgacct ttggatggtg ctacaagcta gtaccagttg
     9241 agccagataa gatagaagag gccaataaag gagagaacac cagcttgtta caccctgtga
     9301 gcctgcatgg gatggatgac ccggagagag aagtgttaga gtggaggttt gacagccgcc
     9361 tagcatttca tcacgtggcc cgagagctgc atccggagta cttcaagaac tgctgacatc
     9421 gagcttgcta caagggactt tccgctgggg actttccagg gaggcgtggc ctgggcggga
     9481 ctggggagtg gcgagccctc agatcctgca tataagcagc tgctttttgc ctgtactggg
     9541 tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg
     9601 cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt
     9661 gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagca
//
LOCUS       HIVMAB         59 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) gag region.
ACCESSION   M27955
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 1 (isolate MAB), clone 01.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 59)
  AUTHORS   Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
            Wain-Hobson,S.
  TITLE     HIV-1 isolates are rapidly evolving quasispecies: Evidence for
            viral mixtures and preferred nucleotide substitutions
  JOURNAL   j acquired immuno defic syndr 2, 344-52 (1989)
  STANDARD  full staff_review
COMMENT     [1] kindly submitted in computer readable form by S. Wain-Hobson.
            Sibling sequences are available (see 1989: I-A-184).
FEATURES       from  to/span     description
    pept    <     1  >    59     gag protein (AA at 2)
BASE COUNT       24 a      7 c     12 g     16 t
ORIGIN      Within gag cds.
        1 ctataaaaga tggataatcc tggggttaaa taaaatagta agaatgtata gccctgtca
//
LOCUS       HIVMN        9738 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral complete
            genome.
ACCESSION   M17449
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 1 (isolate MN), proviral DNA
            (non-infectious).
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9738)
  AUTHORS   Gurgo,C., Guo,H.-G., Franchini,G., Aldovini,A., Collalti,E.,
            Farrell,K., Wong-Staal,F., Gallo,R.C. and Reitz,M.S.Jr.
  TITLE     Envelope sequences of two new United States HIV-1 isolates
  JOURNAL   Virology 164, 531-536 (1988)
  STANDARD  full staff_review
COMMENT     Sequence kindly provided in computer-readable form by M. Reitz,
            N.C.I., Bethesda, MD. 20892 U.S.A.
            
            The MN isolate was taken from a pediatric AIDS patient in 1984.
            The pol coding sequence shows an in-frame stop codon at 3783; the
            nef protein is prematurely truncated at position 9357.  The vpU
            protein is prematurely truncated at position 6142.
FEATURES       from  to/span     description
    pept        787     2307     gag polyprotein
    pept    <  2091     3782     pol polyprotein (NH2-terminus uncertain; AA at
                                 2091; in-frame stop codon at 3783)
               3786     5111     pol polyprotein
    pept       5056     5634     vif protein
    pept       5574     5864     vpr protein
    pept       5845     6059     tat protein, exon 2 (first expressed exon)
               8396     8486     tat protein, exon 3 (AA at 8397)
    pept       5984     6059     rev protein, exon 2 (first expressed exon)
               8396     8667     rev protein, exon 3 (AA at 8398)
    pept       6076     6144     vpu protein (premature termination)
    pept       6239     8809     envelope polyprotein
    pept       8811     9359     nef protein (premature termination at 9357
                                 relative to other HIV-1 sequences)
    LTR           1      633     5' LTR
    LTR        9106     9738     3' LTR
    rpt         453      550     R repeat 5' copy
    rpt        9558     9655     R repeat 3' copy
    binding     635      652     primer (Lys-tRNA) binding site
    site       3783     3785     pol cds in-frame stop codon
    signal     9631     9636     poly-A signal
BASE COUNT     3463 a   1769 c   2344 g   2162 t
ORIGIN      Left end of viral genome.
        1 tggatgggtt aatttactcc caaaagagac aagacatcct tgatctgtgg gtctaccaca
       61 cacaaggcta cttccctgat tggcagaact acacaccagg gccagggatc agatatccac
      121 tgacctttgg atggtgcttc aagctagtac cagttgagcc agagaagata gaagaggcca
      181 ataaaggaga gaacaactgc ttgttacacc ctatgagcca gcatggatgg atgacccgga
      241 gagagaagtg ttagtgtgga agtctgacag ccacctagca tttcagcatt atgcccgaga
      301 gctgcatccg gagtactaca agaactgctg acatcgagct atctacaagg gactttccgc
      361 tggggacttt ccagggaggt gtggcctggg cgggaccggg gagtggcgag ccctcagatc
      421 gtgcatataa gcagctgctt tctgcctgta ctgggtctct ctggttagac cagatctgag
      481 cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt
      541 gagtgcttca agtagtgtgt gcccgtctgt tatgtgactc tggtagctag agatccctca
      601 gatcctttta ggcagtgtgg aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc
      661 gaaagaaaaa ccagagctct ctcgacgcag gactcggctt gctgaagcgc gcacggcaag
      721 aggcgagggg cggcgactgg tgagtacgcc aaaaattctt gactagcgga ggctagaagg
      781 agagagatgg gtgcgagagc gtcggtatta agcgggggag aattagatcg atgggaaaac
      841 attcggttaa ggccaggggg aaagaaaaaa tataaattaa aacatgtagt atgggcaagc
      901 agggagctag aacgattcgc agtcaatcct ggcctgttag aaacatcaga aggctgtaga
      961 caaatactgg gacagctaca accatccctt cagacaggat cagaagaact taaatcatta
     1021 tataatacag tagcaaccct ctattgtgtg catcaaaaga tagagataaa agacaccaag
     1081 gaagctttag agaaaataga ggaagagcaa aacaaaagta agaaaaaagc acagcaagca
     1141 gcagctgaca caggaaacag aggaaacagc agccaagtca gccaaaatta ccccatagtg
     1201 cagaacatcg aggggcaaat ggtacatcag gccatatcac ctagaacttt aaatgcatgg
     1261 gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa tacccatgtt ttcagcatta
     1321 tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg gggacatcaa
     1381 gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcagaatg ggatagattg
     1441 catccagtgc atgcagggcc tattacacca ggccagatga gagaaccaag gggaagtgac
     1501 atagcaggaa ctactagtac ccttcaggaa caaataggat ggatgacaaa taatccacct
     1561 atcccagtag gagaaatcta taaaagatgg ataatcctgg gattaaataa aatagtaagg
     1621 atgtatagcc cttccagcat tctggacata agacaaggac caaaggaacc ctttagagac
     1681 tatgtagacc ggttctataa aactctaaga gccgagcaag cttcacagga ggtaaaaaac
     1741 cggacgacag aaaccttgtt ggtccaaaat gcgaacccag attgtaagac tattttaaaa
     1801 gcattgggac cagcagctac actagaagaa atgatgacag catgtcaggg agtgggagga
     1861 cctggtcata aagcaagagt tttggcggaa gcgatgagcc aagtaacaaa ttcagctacc
     1921 ataatgatgc agagaggcaa ttttaggaat caaagaaaga ttatcaagtg cttcaattgt
     1981 ggcaaagaag ggcacatagc caaaaattgc agggccccta ggaaaagggg ctgttggaaa
     2041 tgtggaaagg aaggacacca aatgaaagat tgtactgaga gacaggctaa ttttttaggg
     2101 aagatctggc cttcctgcaa gggaaggcgg aattttcctc agagcagaac agagccaaca
     2161 gccccaccag aagagagctt caggtttggg gaagagacaa caactcccta tcagaagcag
     2221 gagaagaagc aggagacgat agacaaggac ctgtatcctt tagcttccct caaatcactc
     2281 tttggcaacg acccattgtc acaataaaga taggggggca actaaaggaa gctctattag
     2341 atacaggagc agatgataca gtattaggag aaatgaattt gccaagaaga tggaaaccaa
     2401 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcag ataaccatag
     2461 gaatctgtgg acataaagct ataggtacag tattagtagg acctacacct gtcaacataa
     2521 ttggaagaaa tctgttgact cagcttgggt gcactttaaa ttttcccatt agtcctattg
     2581 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat
     2641 tgacagaaga aaaaataaaa gcattaatag aaatttgtac agaaatggaa aaggaaggga
     2701 aaatttcaaa aattgggcct gaaaatccat acaatactcc agtatttgcc ataaagaaaa
     2761 aagacagtac taaatggaga aaattagtag atttcagaga acttaataag aaaactcaag
     2821 acttctggga agttcaatta ggaataccac atcctgcagg gttaaaaaag aaaaaatcag
     2881 taacagtact ggatgtgggt gatgcatatt tttcagttcc cttagataaa gacttcagga
     2941 agtatactgc atttaccata cctagtataa acaatgaaac accagggatt agatatcagt
     3001 acaatgtgct tccacaggga tggaaaggat caccagcaat attccaaagt agcatgacaa
     3061 aaatcttaga gccttttaga aaacaaaatc cagacatagt tatctatcaa tacatggatg
     3121 atttgtatgt aggatctgac ttagaaatag ggcagcatag agcaaaaata gaggaactga
     3181 gacgacatct gttgaggtgg ggatttacca caccagacaa aaaacatcag aaagaacctc
     3241 cattcctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctatagtgc
     3301 taccagaaaa agacagctgg actgtcaatg acatacagaa gttagtggga aaattgaatt
     3361 gggcaagtca gatttacgca gggattaaag taaagcaatt atgtaaactc cttagaggaa
     3421 ccaaagcact aacagaagta ataccactaa cagaagaagc agagctagaa ctggcagaaa
     3481 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa
     3541 tagcagaagt acagaagcag gggcaaggcc aatggacata tcaaatttat caagagccat
     3601 ttaaaaatct gaaaacaggc aaatatgcaa gaatgagggg tgcccacact aatgatgtaa
     3661 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtaata tggggaaaga
     3721 ctcctaaatt tagactaccc atacaaaaag aaacatggga aacatggtgg acagagtata
     3781 cgtaagccac ctggattcct gagtgggagg ttgtcaatac ccctccctta gtgaaattat
     3841 ggtaccagtt agagaaagaa cccatagtag gtgcagaaac tttctatgta gatggggcag
     3901 ctaacaggga gactaaaaaa ggaaaagcag gatatgttac taacagagga agacaaaagg
     3961 ttgtctccct aactgacaca acaaatcaga agactgagtt acaagcaatt catctagctt
     4021 tgcaagattc agggttagaa gtaaacatag taacagactc acaatatgca ttaggaatca
     4081 ttcaagcaca accagataaa agtgaatcag agttagtcag tcaaataata gagcagttaa
     4141 taaaaaagga aaaggtctat ctggcatggg taccagcaca caaaggaatt ggaggaaatg
     4201 aacaagtaga taaattagtc agtgctggaa tcaggaaagt actattttta gatggaatag
     4261 ataaggccca agaagaccat gagaaatatc acagtaattg gagagcaatg gctagtgact
     4321 ttaacctacc acctatagta gcaaaagaaa tagtagccag ctgtgataaa tgtcagctaa
     4381 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ctagattgta
     4441 cacatttaga aggaaaagtt atcctggtag cagttcatgt agccagtgga tacatagaag
     4501 cagaagttat tccagcagag acagggcagg agacagcata ctttctctta aaattagcag
     4561 gaagatggcc agtaaaaaca atacatacag acaatggccc caatttcacc agtactacgg
     4621 ttaaggccgc ctgttggtgg acgggaatca agcaggaatt tggcattccc tacaatcccc
     4681 aaagtcaagg agtaatagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa
     4741 gagatcaggc tgaacatctt aagagagcag tacaaatggc agtattcatc cacaatttta
     4801 aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtaggc ataatagcaa
     4861 cagacataca aactaaagaa ctacaaaaac aaattacaaa aattcaaaat tttcgggttt
     4921 attacaggga cagcagagat ccactttgga aaggaccagc aaagcttctc tggaaaggtg
     4981 aaggggcagt agtaatacaa gataataatg acataaaagt agtgccaaga agaaaagcaa
     5041 aggtcattag ggattatgga aaacagacgg caggtgatga ttgtgtggca agcagacagg
     5101 atgaggatta gaacatggaa aagtttagta aaacaccata tgtatatttc aaagaaagct
     5161 aaaggacggt tttatagaca tcactatgaa agcactcatc caagaataag ttcagaagta
     5221 cacatcccac taggggatgc tagattggta ataacaacat attggggtct gcatacagga
     5281 gaaagagact ggcatttagg tcagggagtc tccatagaat ggaggaaaaa gagatatagc
     5341 acacaagtag accctgacct agcagaccac ctaattcatc tgcattactt tgattgtttt
     5401 tcagactctg ccataagaaa ggccatatta ggacatagag ttagtcctat ttgtgaattt
     5461 caagcaggac ataacaaggt aggacctcta cagtacttgg cactaacagc attaataaca
     5521 ccaaaaaaga taaagccacc tttgcctagt gttaagaaac tgacagagga tagatggaac
     5581 aagccccaga agaccaaggg ccacagaggg agccatacaa tcaatgggca ctagagcttt
     5641 tagaggagct taagaatgaa gctgttagac attttcctag gatatggctc catggcttag
     5701 ggcaacatat ctatgaaact tatggggata cttgggcagg agtggaagcc ataataagaa
     5761 ttctacaaca actgctgttt attcatttca gaattgggtg tcgacatagc agaataggca
     5821 ttattcgaca gaggagagca agaaatggag ccagtagatc ctagactaga gccctggaag
     5881 catccaggaa gtcagcctaa gactgcttgt accacttgct attgtaaaaa gtgttgcttt
     5941 cattgccaag tttgtttcac aaaaaaagcc ttaggcatct cctatggcag gaagaagcgg
     6001 agacagcgac gaagagctcc tgaagacagt cagactcatc aagtttctct accaaagcag
     6061 taagtagtac atgtaatgca acctttagta atagcagcaa tagtagcatt agtagtagca
     6121 ggaataatag caatagttgt gtgatccata gtattcatag aatataggaa aataagaaga
     6181 caaagaaaaa tagacaggtt aattgataga ataagcgaaa gagcagaaga cagtggcaat
     6241 gagagtgaag gggatcagga ggaattatca gcactggtgg ggatggggca cgatgctcct
     6301 tgggttatta atgatctgta gtgctacaga aaaattgtgg gtcacagtct attatggggt
     6361 acctgtgtgg aaagaagcaa ccaccactct attttgtgca tcagatgcta aagcatatga
     6421 tacagaggta cataatgttt gggccacaca agcctgtgta cccacagacc ccaacccaca
     6481 agaagtagaa ttggtaaatg tgacagaaaa ttttaacatg tggaaaaata acatggtaga
     6541 acagatgcat gaggatataa tcagtttatg ggatcaaagc ctaaagccat gtgtaaaatt
     6601 aaccccactc tgtgttactt taaattgcac tgatttgagg aatactacta ataccaataa
     6661 tagtactgct aataacaata gtaatagcga gggaacaata aagggaggag aaatgaaaaa
     6721 ctgctctttc aatatcacca caagcataag agataagatg cagaaagaat atgcacttct
     6781 ttataaactt gatatagtat caatagataa tgatagtacc agctataggt tgataagttg
     6841 taatacctca gtcattacac aagcttgtcc aaagatatcc tttgagccaa ttcccataca
     6901 ctattgtgcc ccggctggtt ttgcgattct aaaatgtaac gataaaaagt tcagtggaaa
     6961 aggatcatgt aaaaatgtca gcacagtaca atgtacacat ggaattaggc cagtagtatc
     7021 aactcaactg ctgttaaatg gcagtctagc agaagaagag gtagtaatta gatctgagaa
     7081 tttcactgat aatgctaaaa ccatcatagt acatctgaat gaatctgtac aaattaattg
     7141 tacaagaccc aactacaata aaagaaaaag gatacatata ggaccaggga gagcatttta
     7201 tacaacaaaa aatataatag gaactataag acaagcacat tgtaacatta gtagagcaaa
     7261 atggaatgac actttaagac agatagttag caaattaaaa gaacaattta agaataaaac
     7321 aatagtcttt aatcaatcct caggagggga cccagaaatt gtaatgcaca gttttaattg
     7381 tggaggggaa tttttctact gtaatacatc accactgttt aatagtactt ggaatggtaa
     7441 taatacttgg aataatacta cagggtcaaa taacaatatc acacttcaat gcaaaataaa
     7501 acaaattata aacatgtggc aggaagtagg aaaagcaatg tatgcccctc ccattgaagg
     7561 acaaattaga tgttcatcaa atattacagg gctactatta acaagagatg gtggtaagga
     7621 cacggacacg aacgacaccg agatcttcag acctggagga ggagatatga gggacaattg
     7681 gagaagtgaa ttatataaat ataaagtagt aacaattgaa ccattaggag tagcacccac
     7741 caaggcaaag agaagagtgg tgcagagaga aaaaagagca gcgataggag ctctgttcct
     7801 tgggttctta ggagcagcag gaagcactat gggcgcagcg tcagtgacgc tgacggtaca
     7861 ggccagacta ttattgtctg gtatagtgca acagcagaac aatttgctga gggccattga
     7921 ggcgcaacag catatgttgc aactcacagt ctggggcatc aagcagctcc aggcaagagt
     7981 cctggctgtg gaaagatacc taaaggatca acagctcctg gggttttggg gttgctctgg
     8041 aaaactcatt tgcaccacta ctgtgccttg gaatgctagt tggagtaata aatctctgga
     8101 tgatatttgg aataacatga cctggatgca gtgggaaaga gaaattgaca attacacaag
     8161 cttaatatac tcattactag aaaaatcgca aacccaacaa gaaaagaatg aacaagaatt
     8221 attggaattg gataaatggg caagtttgtg gaattggttt gacataacaa attggctgtg
     8281 gtatataaaa atattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc
     8341 tgtactttct atagtgaata gagttaggca gggatactca ccattgtcgt tgcagacccg
     8401 ccccccagtt ccgaggggac ccgacaggcc cgaaggaatc gaagaagaag gtggagagag
     8461 agacagagac acatccggtc gattagtgca tggattctta gcaattatct gggtcgacct
     8521 gcggagcctg ttcctcttca gctaccacca cagagactta ctcttgattg cagcgaggat
     8581 tgtggaactt ctgggacgca gggggtggga agtcctcaaa tattggtgga atctcctaca
     8641 gtattggagt caggaactaa agagtagtgc tgttagcttg cttaatgcca cagctatagc
     8701 agtagctgag gggacagata gggttataga agtactgcaa agagctggta gagctattct
     8761 ccacatacct acaagaataa gacagggctt ggaaagggct ttgctataag atgggtggca
     8821 aatggtcaaa acgtgtgact ggatggccta ctgtaaggga aagaatgaga cgagctgaac
     8881 cagctgagct agcagcagat ggggtgggag cagcatcccg agacctggaa aaacatggag
     8941 cactcacaag tagcaataca gcagctacca atgctgattg tgcctggcta gaagcacaag
     9001 aggaggagga agtgggtttt ccagtcaaac ctcaggtacc tttaagacca atgacttaca
     9061 aagcagcttt agatcttagc cactttttaa aagaaaaggg gggactggat gggttaattt
     9121 actcccaaaa gagacaagac atccttgatc tgtgggtcta ccacacacaa ggctacttcc
     9181 ctgattggca gaactacaca ccagggccag ggatcagata tccactgacc tttggatggt
     9241 gcttcaagct agtaccagtt gagccagaga agatagaaga ggccaataaa ggagagaaca
     9301 actgcttgtt acaccctatg agccagcatg gatggatgac ccggagagag aagtgttagt
     9361 gtggaagtct gacagccacc tagcatttca gcattatgcc cgagagctgc atccggagta
     9421 ctacaagaac tgctgacatc gagctatcta caagggactt tccgctgggg actttccagg
     9481 gaggtgtggc ctgggcggga ccggggagtg gcgagccctc agatcgtgca tataagcagc
     9541 tgctttctgc ctgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg
     9601 gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag
     9661 tgtgtgcccg tctgttatgt gactctggta gctagagatc cctcagatcc ttttaggcag
     9721 tgtggaaaat ctctagca
//
LOCUS       HIVNDK       9143 bp ds-DNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral complete
            genome.
ACCESSION   M27323
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 1 (isolate NDK), proviral DNA
            infectious clone.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9143)
  AUTHORS   Spire,B., Sire,J., Zachar,V., Rey,F., Barre-Sinoussi,F.,
            Galibert,F., Hampe,A. and Chermann,J.-C.
  TITLE     Nucleotide sequence of HIV1-NDK a highly cytopathic strain of the
            human immunodeficiency virus HIV1
  JOURNAL   Gene 81, 275-84 (1989)
  STANDARD  full staff_review
COMMENT     Kindly provided prior to publication by J.-C. Chermann, Pasteur
            Institute, Marseille.
            
            This is an infectious molecular clone of an isolate estimated to be
            10,000 times more cytopathic in vitro than a prototypical HIV-1.
            [1] reports that only minor sequence differences appear to be
            responsible for the "acute biological effect".
FEATURES       from  to/span     description
    pept        334     1827     gag polyprotein
    pept    <  1623     4631     pol polyprotein (NH2-terminus uncertain)
    pept       4576     5154     vif protein
    pept       5094     5384     vpR protein
    pept       5365     5579     tat protein, exon 2 (first expressed exon)
               7883     7928     tat protein, exon 3 (AA at 7884)
    pept       5504     5579     rev protein, exon 2 (first expressed exon)
               7883     8157     rev protein, exon 3 (AA at 7885)
    pept       5596     5841     vpU protein
    pept       5759     8299     env polyprotein
    pept       8301     8924     nef protein
    pre-msg       1     9143     genomic mRNA
BASE COUNT     3307 a   1617 c   2193 g   2026 t
ORIGIN      5'-terminus of 5' R repeat (putative mRNA start).
        1 ggtctctctg gttagaccag atttgagcct gggagctctg gctaattagg gaacccactg
       61 cttaagcctc aataaagctt gccttgagcg cttcaagtag tgtgtgcccg tctgttgtgt
      121 gactctggta actagagatc cctcagatcc gtttagtccg agtgaaaaat ctctagcagt
      181 ggcgcccgaa cagggacctg aaagcgaaag tagaaccaga gaagatctct cgacgcagga
      241 ctcggcttgc tgaagcgcgc acagcaagag gcgaggggca gcgactggtg agtacgctga
      301 aatttttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc agtattaagc
      361 gggggaaaat tagatacatg ggaaagaatt cggttacggc caggaggaaa gaaaaaatat
      421 gcactaaaac atttgatatg ggcaagcagg gagctagaac gatttacact taatcctggc
      481 cttttagaga catcagaagg ctgtaaacaa ataataggac agctacaacc atctattcaa
      541 acaggatcag aagaaattag atcattatat aatacagtag caaccctcta ttgtgtacat
      601 gaaaggatag aggtaaaaga caccaaagaa gctgtagaaa agatggagga agaacaaaac
      661 aaaagtaaga aaaagacaca gcaagcagca gctgatagca gccaggtcag ccaaaattac
      721 cctatagtgc agaacctaca ggggcaaatg gtacatcagg ccatatcacc tagaactttg
      781 aacgcatggg taaaagtaat agaagaaaag gccttcagcc cggaagtaat acccatgttt
      841 tcagcattat cagaaggagc caccccacaa gatttaaaca ccatgctaaa cacagtgggg
      901 ggacatcaag cagctatgca aatgctaaaa gagaccatca atgacgaagc tgcagaatgg
      961 gacagattac atccagtgca tgcagggcct gttgcaccag gccaaatgag agaaccaagg
     1021 ggaagtgata tagcaggaac tactagtacc cttcaggaac aaatagcatg gatgacaagc
     1081 aacccaccta tcccagtagg agaaatctat aaaagatgga taatcctggg attaaataaa
     1141 atagtaagaa tgtatagccc tgtcagcatt ttggacataa gacagggacc aaaggaacct
     1201 tttagagact atgtagaccg gttctataaa actctaagag ccgagcaagc ttcacaggat
     1261 gtaaaaaact ggatgacaga aaccttgttg gtccaaaatg caaacccaga ttgtaaaact
     1321 atcttaaaag cattgggacc acaggctaca ctagaagaaa tgatgacagc atgccaggga
     1381 gtgggggggc ccggccataa agcaagagtt ttggctgagg caatgagcca agtaacaggt
     1441 tcagctactg cagtaatgat gcagagaggc aattttaagg gcccaagaaa aagtattaag
     1501 tgtttcaact gtggcaagga agggcacaca gcaaaaaatt gcagggcccc tagaaaaaag
     1561 ggctgttgga aatgcggaag ggaaggacac caaatgaaag attgcactga aagacaggct
     1621 aattttttag ggaagatttg gccttcccac aagggaaggc cggggaattt tcttcagagc
     1681 agaccagagc caacagcccc accagcagag agcttcgggt ttggggagga gataaccccc
     1741 tctcagaaac aggagcagaa agacaaggaa ctgtatcctt tagcttccct caaatcactc
     1801 tttggcaacg acccctcgtc acaataaaga tagggggaca gctaaaggaa gctctattag
     1861 atacaggagc agatgataca gtattagaag aaataaattt gccaggaaaa tggaagccaa
     1921 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcaa atactcatag
     1981 aaatctgtgg atataaagct atgggtacag tattagtagg acctacacct gtcaacataa
     2041 ttggaagaaa tttgttgacc cagattggct gcactttaaa ttttccaatt agtcctattg
     2101 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat
     2161 tgacagaaga aaaaataaaa gcattaacag aaatttgtac agaaatggaa aaggaaggaa
     2221 aaatttcaag aattgggcct gaaaatccat ataatactcc aatatttgcc ataaagaaaa
     2281 aagacagtac caagtggaga aaattagtag atttcagaga acttaataag agaactcaag
     2341 atttctggga ggttcaatta ggaataccgc atcctgcagg gctgaaaaag aaaaaatcag
     2401 taacagtact ggatgtgggt gatgcatatt tctcagttcc cttagatgaa gattttagga
     2461 aatataccgc atttaccata cctagtataa acaatgagac accagggatt agatatcagt
     2521 acaatgtgct cccacaggga tggaaaggat caccggcaat attccaaagt agcatgacaa
     2581 aaatcttaga gccctttaga aaacaaaatc cagaaatagt tatctatcaa tacatggatg
     2641 atttgtatgt aggatctgac ttagaaatag ggcagcatag aacaaaaata gaggaattaa
     2701 gagaacatct attgaggtgg ggatttacca caccagataa aaaacatcag aaagaacctc
     2761 catttctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctataaacc
     2821 tgccagaaaa agaaagctgg actgtcaatg atatacagaa gttagtgggg aaattaaact
     2881 gggcaagcca gatttatgca ggaattaaag taaagcaatt atgtaaactc cttaggggaa
     2941 ccaaagcact aacagaagta gtaccactaa cagaagaagc agaattagaa ctggcagaaa
     3001 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa
     3061 tagcagaact acagaaacaa ggggacggcc aatggacata ccaaatttat caagaaccat
     3121 ttaaaaatct aaaaacagga aagtatgcaa gaacgagggg tgcccacact aatgatgtaa
     3181 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtgata tggggaaaga
     3241 ctcctaaatt taaactaccc atacaaaagg aaacatggga aacatggtgg atagagtatt
     3301 ggcaagccac ctggattcct gagtgggaat ttgtcaatac ccctccttta gtaaaattat
     3361 ggtaccagtt agagaaggaa cccataatag gagcagaaac tttctatgta gatggggcag
     3421 ctaatagaga gactaaatta ggaaaagcag gatatgttac tgacagagga agacagaaag
     3481 ttgtcccttt cactgacacg acaaatcaga agactgagtt acaagcaatt aatctagctt
     3541 tacaggattc gggattagaa gtaaacatag taacagattc acaatatgca ctaggaatca
     3601 ttcaagcaca accagataag agtgaatcag agttagtcag tcaaataata gagcagctaa
     3661 taaaaaagga aaaggtttac ctggcatggg taccagcaca caaaggaatt ggaggaaatg
     3721 aacaagtaga taaattagtc agtcagggaa tcaggaaagt actatttttg gatggaatag
     3781 ataaggctca ggaagaacat gagaaatatc acaacaattg gagagcaatg gctagtgatt
     3841 ttaacctacc acctgtggta gcgaaagaaa tagtagctag ctgtgataaa tgtcagctaa
     3901 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ttagattgta
     3961 cacatctgga aggaaaagtt atcctggtag cagttcatgt agccagtggc tatatagaag
     4021 cagaagttat tccagcagaa acggggcaag aaacagcata ctttctctta aaattagcag
     4081 gaagatggcc agtaaaagta gtacatacag ataatggcag caatttcacc agtgctacag
     4141 ttaaggccgc ctgttggtgg gcagggatca aacaggaatt tggaattccc tacaatcccc
     4201 aaagtcaagg agtagtagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa
     4261 gagatcaagc tgaacatctt aagacagcag tacaaatggc agtatttatc cacaatttta
     4321 aaagaaaagg ggggattggg ggatacagtg caggggaaag aataatagac ataatagcaa
     4381 cagacataca aactagagaa ttacaaaaac aaatcataaa aattcaaaat tttcgggttt
     4441 attacaggga cagcagagat ccaatttgga aaggaccagc aaagcttctc tggaaaggtg
     4501 aaggggcagt agtaatacaa gacaatagtg acataaaggt agtaccaaga agaaaagtaa
     4561 agatcattag ggattatgga aaacagatgg caggtgatga ttgtgtggca agtagacagg
     4621 atgaggatta acacatggaa aagtttagta aaataccata tgtatgtttc aaagaaagct
     4681 aacagatggt tttatagaca tcactatgac agccaccacc caaaaataag ttcagaagta
     4741 cacatcccac taggagaagc tagactggta gtaacaacat attggggtct gcatacagga
     4801 gaaaaagaat ggcatctggg tcagggagtc tccatagaat ggaggaaaag gagatatagc
     4861 acacaagtag accctggcct ggcagaccaa ctaattcata tgtattattt tgattgtttt
     4921 gcagaatctg ctataagaaa agccatatta ggacatatag ttagtcctag ttgtgagtat
     4981 caagcaggac ataacaaggt aggatcctta cagtatttgg cactagcagc attaatagca
     5041 ccaaaaaaga taaagccacc tttgcctagt gttaggaagc taacagaaga tagatggaac
     5101 aagccccaga agaccaaggg ccgcagaggg agccatacaa tgaatggaca ttagagcttt
     5161 tagaggagct taagagtgaa gctgtcagac attttcctag gatatggctc catagcttag
     5221 gacaacatat ctatgaaact tatggggata cctgggcagg tgttgaagct ataataagaa
     5281 ttctgcaaca actactgttt attcatttca gaattgggtg tcaacatagc agaataagta
     5341 ttactcgaca gagaagagca agaaatggat ccagtagatc ctaatctaga gtcctggaac
     5401 catccaggaa gtcagcctag gactgcttgt aataagtgtc attgtaaaaa gtgttgctat
     5461 cattgccaag tttgcttcat aacgaaaggc ttaggcatct cctatggcag gaagaagcgg
     5521 agacagcgac gaaaacctcc tcaaggcgat caggctcatc aagttcctat accagagcag
     5581 taagtattcc atgtaatgca acctttagtg ataatagcaa tagcagcatt agtagtagca
     5641 ataataatag caatagttgt gtggaccata gtatacatag aatataggag gataaaaagg
     5701 caaagaaaaa tagactgttt aattgataga ataagagaaa gagcagaaga cagtggcaat
     5761 gagagcgagg gagaaagaga ggaattgtca aaacttgtgg aaatggggca tcatgctcct
     5821 tgggatgttg atgacctgta gtgctgcaga agatttgtgg gtcacagttt attatggggt
     5881 gcctatatgg aaggaagcaa ctaccactct attttgtgca tcagatgcta aagcatataa
     5941 aaaagaggca cataatatct gggccacaca tgcctgtgta cccacggacc ccaacccaca
     6001 agaaatagaa ttggaaaatg tgacagaaaa ctttaacatg tggaaaaata acatggtgga
     6061 acaaatgcat gaggatataa tcagtttatg ggatcaaagc ctaaaaccat gtgtaaaatt
     6121 aactccactc tgtgtcactt taaactgcac tgatgaattg aggaacagca agggcaatgg
     6181 gaaggtagaa gaggaggaaa aaaggaaaaa ctgctctttc aatgtaaggg ataagaggga
     6241 gcaagtatat gcactttttt ataaacttga tatagtgcca atagacaata ataataggac
     6301 caatagtact aattataggt taataaattg tgatacctca accattacac aggcttgtcc
     6361 aaagatatcc tttgaaccaa ttcccataca tttttgtgcc ccagctggtt ttgcaattct
     6421 aaagtgtaga gataagaagt tcaatgggac aggcccatgc tcaaatgtca gcacagtaca
     6481 atgtacacat ggaattagac cagtggtgtc aactcaactg ctgttgaatg gcagtctagc
     6541 agaagaagag atcataatta gatctgaaaa tctcacaaac aatgttaaaa ccataatagt
     6601 acagcttaat gcatctatag taattaattg tacaaggccc tacaaatata caaggcaaag
     6661 gacatcgata ggactaaggc aatcactcta tacaataaca ggaaaaaaga agaaaacagg
     6721 atacatagga caagcacatt gtaaaattag cagagcagaa tggaataaag ctttacaaca
     6781 ggtagctaca aaactaggaa accttcttaa caaaacaaca ataactttta agccatcctc
     6841 aggaggggac ccagaaatta catcacacat gcttaattgt ggaggggact tcttctactg
     6901 taatacatca agactgttta atagtacatg gaatcagact aatagtacag ggttcaataa
     6961 tggcacagtc acactcccat gcagaataaa acaaattgta aacttgtggc agagagtagg
     7021 aaaagcaatg tatgcccctc ccatcgaagg actaattaaa tgttcatcaa atattacagg
     7081 gctactatta acaagagatg gtggtgcaaa taatagttct catgagacca tcagacctgg
     7141 aggaggagat atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat
     7201 tgaaccaata ggagtagcac ccaccaaggc aaggagaaga gtggtggaaa gagaaaaaag
     7261 agcaatagga ctaggagctg tgttccttgg gttcttggga gcagcaggaa gcacgatggg
     7321 cgcagcgtca gtgacgctga cggtacaggc cagacagtta atgtctggta tagtgcacca
     7381 gcaaaacaat ttgctgaggg ctatagaggc gcaacaacat ctgttgcaac tcacggtctg
     7441 gggcattaaa cagctccagg caagagtcct ggctgtggaa agatacctaa gggatcaaca
     7501 gctcctagga atttggggtt gctctggaag gcacatttgc accactaatg taccctggaa
     7561 ctctagttgg agtaatagat ctctagatga gatttggcag aacatgacct ggatggagtg
     7621 ggaaagagaa attgacaatt acacaggctt aatatacagc ttaattgaag aatcgcagat
     7681 ccagcaagaa aagaatgaaa aagaattatt ggaattggac aaatgggcaa gtttgtggaa
     7741 ttggtttagc ataacaaaat ggctgtggta tataaaatta ttcataatga tagtaggagg
     7801 cctgataggt ttaagaatag tttttgctgt gctttctgta gtaaatagag ttaggcaggg
     7861 atactcacct ctgtcatttc agaccctcct cccagtcccg aggggacccg acaggcccga
     7921 agaaatagaa gaagaaggtg gagagcgagg cagagacaga tccattcgat tggtgaacgg
     7981 attatttgca cttttctggg acgacctgag gaacctgtgc ctcttcagct accaccgctt
     8041 gagagactca atcttaattg cagcgaggat tgtggaactt ctgggacgca gggggtggga
     8101 agccctcaaa tacctgtgga acctcctgca gtattggagt caggaactga ggaatagtgc
     8161 tagtagcttg cttgatacca tagcaatagc agtagctgag aggacagata gggttataga
     8221 agtagtacaa agagcttgca gagctattct taacgtaccc agaagaataa gacagggctt
     8281 ggaaaggctt ttgctataaa atgggtggca aatggtcaaa aagtagttta gttggatggc
     8341 ctgctataag ggaaagaata agaaaaactg atccagcagc agatggggta ggagcagtat
     8401 ctcgagacct ggaaaaacat ggggcaatca caagtagcaa tacagcaagt actaatgata
     8461 cctgtgcatg gctggaagca caagaggaga gcgaggaggt gggctttcca gttagacctc
     8521 aggtaccttt aagaccaatg acttacaaag aagctgttga tcttagccac tttttaaaag
     8581 aaaagggggg actggaaggg ctaatttggt ccaagaaaag acaagagatc cttgatcttt
     8641 gggtctacaa cacacaaggc atcttccccg attggcagaa ctacacacca gggccaggga
     8701 tcagataccc actgaccttt ggatggtgct tccagctagt accagttgat ccacaggagg
     8761 tagaagaggc cactgaaaga gaggacaact gcttgttaca ccctatgtgc cagcaaggaa
     8821 tggaggaccc ggagagacaa gtgttaatgt ggagatttaa cagcagacta gcacttgagc
     8881 acaaggcccg agagctgcat ccggagttct acaaagactg ctgacatcga gtttcctaca
     8941 agggactttc cgctggggac tttccaggga ggcgtggcct gggcgggact tgggagtggc
     9001 taaccctcag atgctgcata taagcagctg cttttcgcct gtactgggtc tctctggtta
     9061 gaccagattt gagcctggga gctctctggc taattaggga acccactgct taagcctcaa
     9121 taaagcttgc cttgagtgct tca
//
LOCUS       HIVOYI       9190 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1) proviral DNA, complete
            genome.
ACCESSION   M26727
KEYWORDS    complete genome.
SOURCE      Human immunodeficiency virus type 1 (isolate OYI) proviral DNA.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 9190)
  AUTHORS   Wain-Hobson,S. and Pasteur Institute,Paris.
  TITLE     ;
  JOURNAL   AIDS Res. Hum. Retroviruses (1990) In press
  STANDARD  full staff_review
COMMENT     Kindly provided in computer-readable form by Simon Wain-Hobson,
            Pasteur Institute, Paris.
            
            OYI, designated elsewhere as isolate 397, was taken from a healthy
            Gabonese presenting an atypical Western Blot. Tree analysis (see
            Part III) reveals that the sequence is closely related to the North
            American isolate SF2 and the European virus HAN(across the genome).
            This is the first report of a virus from Africa that clusters with
            North American rather than Zairean viruses: OYI and SF2 differ by
            approximately 7% in envelope. The single C -> S substitution at
            residue 22 of the OYI tat protein renders it inactive, but may not
            account for the avirulence of the virus. The pattern of env
            termination and nef start is not what is typically encountered.
            Sibling sequences for OYI(397) are available (see 1989:I-A-181).
FEATURES       from  to/span     description
    pept        336     1835     gag polyprotein
    pept    <  1628     4639     pol polyprotein
    pept       4584     5162     vif protein
    pept       5102     5392     vpR protein
    ORF        5599     5721     vpu ORF
    pept       5373     5587     tat protein, exon 2 (first expressed exon)
               7918     8008     tat protein, exon 3 (AA at 7919)
    pept       5512     5587     rev protein, exon 2 (first expressed exon)
               7918     8192     rev protein, exon 3 (AA at 7920)
    pept       5767     8334     env polyprotein
    pept       8336     8971     nef protein
    binding     182      199     primer (Lys-tRNA) binding site
    pre-msg       1     9190     genomic mRNA
    signal     9166     9171     poly-A signal
BASE COUNT     3304 a   1630 c   2202 g   2054 t
ORIGIN      
        1 ggtctctcta gctagaccag atctgagccc gggagctctc tggctaacta gggaacccac
       61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt
      121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca
      181 gtggcgcccg aacagggacc ttaaagtgaa agtggaacca gaggagctct ctcgacgcag
      241 gactcggctt gcttagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca
      301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa
      361 gcgggggaga attagataaa tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat
      421 atcaactaaa acatatagta tgggcaagca gggagctaga acgattcgca attaatcctg
      481 gcctgttaga aacatcagaa ggttgtagac aaatactggg acagctacaa ccatccctta
      541 agacaggatc agaagaaatt agatcattat ataatacagt agcaactctt tattgtgtgc
      601 atcaaaagat agaggtaaaa gacaccaagg aagctttaga taagatagag gaagagcaaa
      661 acaaaagtaa gaaaaaagca cagcaaacag cagctgacac aggaaacagc agccaggtca
      721 gccaaaatta ccctatagta cagaaccttc aggggcaaat ggtacatcag cccatatcac
      781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa
      841 tacccatgtt ttcagcatta gcagaaggag ccaccccaca agatctaaac accatgctaa
      901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag
      961 ctgcagaatg ggatagattg catccagtac atgcagggcc tattgcacca ggccagatga
     1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat
     1081 ggatgacaaa taatccacct atcccagtag gagaaatcta taaaagatgg ataatcctgg
     1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac
     1201 caaaggaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag
     1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag
     1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag
     1381 catgtcaggg agtgggggga cccggccata aagcaagagt tttggctgaa gcaatgagcc
     1441 aagtaaattc agtcaccgta atgatgcaga aaggcaattt taagaaccaa agaaagactg
     1501 ttaagtgttt caattgtggc aaagaagggc acatagccaa aaattgcagg gctcctagga
     1561 aaaagggctg ttggaaatgt ggaagggaag gacaccaaat gaaagattgt actgagagac
     1621 aggctaattt tttagggaag atctggcctt cccacaaggg aaggccaggg aattttcttc
     1681 agaacagacc agagccaaca gccccaccag cagagagctt cgggtttggg gaagagacaa
     1741 caactccccc tcagaagcag gagccgatag acaagggact gtatccttta acctccctca
     1801 gatcactctt tggcaacgac ccatcgtcac aataaagata ggggggcaac taaaagaagc
     1861 tctattagat acaggagcag atgatacagt attagaagaa atgaatttgc caggaagatg
     1921 gaaaccaaaa atgatagggg gaattggagg ttttatcaaa gtaagacagt atgatcagat
     1981 actcatagaa atctgtggac ataaagctat aggtacagta ttagtaggac ctacacctgt
     2041 caacataatt ggaagaaatc tgttgactca gcttggttgt actttaaatt ttcccattag
     2101 tcctattgaa actgtaccag taaaattaaa gccaggaatg gatggcccaa aagttaaaca
     2161 atggccattg acagaagaga aaataaaagt attaatagaa atttgtacag aaatggaaaa
     2221 ggaagggaaa atttcaaaag ttgggcctga aaatccatac aatactccag tatttgccat
     2281 aaagaaaaaa gacagtacta aatggagaaa attagtagat ttcagagaac ttaataagag
     2341 aactcaggac ttctgggaag tccaattagg aataccacat ccagcagggt taaaaaagaa
     2401 aaaatcagta acagtactgg atgtgggtga tgcatacttt tcagttccct tagataaaga
     2461 cttcagaaag tatactgcat ttaccatacc tagtataaac aatgagacac cagggattag
     2521 atatcagtac aatgtgcttc cacagggatg gaaaggatca ccagcgatat tccaaagtag
     2581 tatgacaaaa atcttagagc cttttagaaa acaaaatcca gacatagtta tctatcaata
     2641 catggatgat ttgtatgtag gatctgactt agaaataggg cagcatagaa caaaaataga
     2701 ggaactgaga caacatctgt tgaggtgggg atttaccaca ccagacaaaa aacatcagaa
     2761 agaacctcca ttcctttgga tgggttatga actccatcct gataaatgga cagtacagcc
     2821 tataatgctg ccagaaaaag acagctggac tgtcaatgac atacagaagt tagtgggaaa
     2881 attaaattgg gcaagtcaga tttatgcagg gattaaagta aagaacttat gtaaactcct
     2941 taggggaacc aaagcactaa cagaagtaat accactaaca gaagaagcag agctagaact
     3001 ggcagaaaac agggagattc taaaagaacc agtacatgga gtgtattatg acccatcaaa
     3061 agacttagta gcagaattac agaaacaggg acaaggccaa tggacatatc aaatttatca
     3121 agagccattt aaaaatctga aaacaggaaa gtatgcaaga atgaggggtg cccacactaa
     3181 tgatgtaaaa cagttaacag aggcagtgca aaaaataacc caagaaagca tagtaatatg
     3241 gggaaagact cctaaattta aactacccat acaaaaagaa acatgggaag catggtggac
     3301 ggagtattgg caagccacct ggattcctga atgggagttt gtcaataccc ctcccttagt
     3361 aaaattatgg taccagttag agaaagaccc catagtagga gcagaaactt tctatgtaga
     3421 tggggcagct aatagggaga ctaaattagg aaaagcagga tatgttactg acagaggaag
     3481 acagaaagtt gtctccctaa ctgacacaac aaatcagaag actgaattac aagcaattca
     3541 tctagctctg caggattcgg gattagaagt aaacatagta acagactcac aatatgcatt
     3601 aggaatcatt caagcacaac cagataaaag tgaatcagag ttagtcagtc aaataataga
     3661 gcagttaata aaaaaggaaa aggtctacct ggcatgggta ccagcacaca aaggaattgg
     3721 aggaaatgaa caagtagata aattagtcag tgctggaatc aggaaagtac tatttttgga
     3781 tggaatagat aaggcccaag aggaacatga gaaatatcac agtaactgga gagcaatggc
     3841 tagtgatttt aacctgccac ctgtagtagc aaaagaaata gtagccagct gtgataaatg
     3901 ccagctaaaa ggagaagcca tgcatggaca agtagactgt agtccaggaa tatggcaact
     3961 agattgtaca catttagaag gaaaaattat cctggtagca gttcatgtag ccagtggata
     4021 tatagaagca gaagttattc cagcagagac agggcaggaa acagcatact ttatcttaaa
     4081 attagcagga agatggccag taaaaacaat acatacagac aatggcagca atttcaccag
     4141 tactacggtt aaggccgcct gttggtgggc agggatcaag caggaatttg gcattcccta
     4201 caatccccaa agtcaaggag tagtagaatc tatgaataat gaattaaaga aaattatagg
     4261 acaagtaaga gatcaggctg aacatcttaa gacagcagta caaatggcag tatttatcca
     4321 caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa tagtagatat
     4381 aatagctaca gacatacaaa ctaaagaact acaaaaacaa attacaaaaa ttcaaaattt
     4441 tcgggtttat tacagggaca gcagagaacc actttggaaa ggaccagcaa agcttctttg
     4501 gaaaggtgaa ggggcagtag taatacaaga taatagtgac ataaaagtag tgccaagaag
     4561 gaaagcaaag atcattaggg attatggaaa acagatggca ggtgatgatt gtgtggcaag
     4621 tagacaggat gaggattaga acatggaaaa gtttagtaaa acaccatatg tatgtttcaa
     4681 agaaagctaa gggatggttt tatagacatc actatgaaag cactcatcca agaataagtt
     4741 cagaagtaca catcccacta ggggatgcta ccttggtagt aacaacatat tggggtctgc
     4801 atacaggaga aagagaatgg catttgggcc agggagcctc tatagaatgg aggaaaaaga
     4861 gatatagcac acaagtagac cctggcctag cagaccaact aattcataca tattattttg
     4921 attgtttttc agaatctgct ataagaaatg ccatactagg aaatatagtt agtcctaggt
     4981 gtgaatatcc agcaggacat aacaaggtag gatctctaca atacttggca ctagcagcat
     5041 tgataaaacc aaaaaagata aagccacctt tgcctagtgt tacgaaacta acagaggata
     5101 gatggaacaa gccccagaag accaagggcc acagagggag ccatacaatg aatggacact
     5161 agagctttta gaggagctta agagtgaagc tgttagacat tttcctagga tatggctcca
     5221 tagcttagaa caatatatct atgaaactta tggggatact tgggaaggag tggaagccat
     5281 aataagaatt ctgcaacaac tgctgtttat tcatttcaga attgggtgtc aacatagcag
     5341 aataggcatt actcgacaga gaagagcaag aaatggagcc agtagatcct agactagagc
     5401 cctggaagca tccaggaagt cagcctaaga ctgctagtaa caattgctat tgtaaaaggt
     5461 gttgccttca ctgtcaagtt tgtttcacaa aaaaaggctt aggcatttcc tatggcagga
     5521 agaagcggag acagcgacga agagctcctc aagacagtaa gactcatcaa gtttctctat
     5581 caaagcagta agtaatacat gtaatatact ctttacaaat attagcaata gtagcattag
     5641 tagtagtaac aataatagca atagttgtgt ggaccatagt actcttagaa tataggaaaa
     5701 tattaagaca aagaaaaata gacaggttaa ttgatagaat aagagaaaga gcagaagaca
     5761 gtggcaatga cagcgagggg gaccaggaag aattatcagc gcttgtggag atggggcacc
     5821 atgctccttg ggatgttgat gatatgtagt gctgcagaaa atttgtgggt cacagtctat
     5881 tatggggtac ctgtgtggaa agaagcaacc accactctat tctgtgcatc agatgctaga
     5941 gcatatgcta cagaggtaca taatgtttgg gccacacatg cctgtgtacc cacagacccc
     6001 aacccacaag aagtagtatt gggaaatgtg acagaaaatt ttgacatgtg gaaaaataat
     6061 atggtagaac aaatgcaaga agatataatc agtttatggg atcaaagcct aaagccatgt
     6121 gtaaaattaa ccccactctg tgttacttta gattgcactg atgttaatac cactagtagt
     6181 agtttgagga atgctactaa taccacaagt agtagttggg aaacgatgga gaaaggagaa
     6241 ttaaaaaact gctctttcaa taccaccaca agcataagag ataagatgca ggaacaatat
     6301 gcactttttt ataaacttga tgtattacca atagataaga atgatactaa atttaggtta
     6361 atacattgta acacctcaac cattacacag gcctgtccaa agatatcctt tgagccaatt
     6421 cccatgcatt attgtactcc ggctggtttt gcgattctaa agtgtaatga taagaagttc
     6481 aatggaacag gaccatgtac aaatgtcagc acagtacaat gtacacatgg aattaagcca
     6541 gtagtgtcaa ctcaactgct gttaaatggc agcctagcag aagaagaggt aataattaga
     6601 tctagcaatt tcacaaacaa tgctaaaatc ataatagtac agctgaataa atctgtagaa
     6661 attaattgta caagacccaa caacaataca agaaacagga tatcaatagg accagggaga
     6721 gcatttcata caacaaaaca aataatagga gatataagac aagcacattg taaccttagt
     6781 agagcaacat gggagaaaac tttagaacag atagctacaa aattaagaaa acaatttagg
     6841 aataaaacaa tagcctttga tcgatcctca ggaggggatc cagaaattgt aatgcacagt
     6901 tttaattgtg gaggggaatt tttctactgt aatacatcac aactgtttaa tagtacttgg
     6961 aatgatacta caagggcaaa tagcactgaa gtaactatca cactcccatg tagaataaaa
     7021 caaattgtaa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagtgga
     7081 caaattagat gttcatcaaa gattacaggg ctgctattaa caagagatgg tggtaagaac
     7141 accacgaacg ggatcgaaat cttcagacct gcaggaggag acatgaggga caattggaga
     7201 agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag
     7261 gcaaggagaa gagtggtgca gagagaaaaa agagcagtgg gaatgctagg agctatgttc
     7321 cttgggttct tgggagcagc aggaagcact atgggcgcac ggtcgatgac gctgacggta
     7381 caggccagac aattattgtc tggtatagtg cagcagcaga acaatctgct gagggctatt
     7441 gaggcgcaac agcacctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga
     7501 gtcctggctg tggaaagata cctaaaggat caacagctcc tagggatttg gggttgctct
     7561 ggaaagctca tttgcaccac tactgtgcct tggaatgcta gttggagtaa taaatctctg
     7621 aatgagattt gggataacat gacctggatg cagtgggaaa gagaaattga caattacaca
     7681 cacttaatat acaccttaat tgaagaatcg cagaaccaac aggaaaagaa tgaacaagaa
     7741 ttattggaat tggataagtg ggcaggtttg tggagttggt ttagcataac aaactggctg
     7801 tggtatataa gaatattcat aataatagta ggaggcttgg taggtttaag aatagttttt
     7861 gctgtacttt ctatagtgaa tagagttagg cagggatact caccattatc gtttcagacc
     7921 cgcctcccaa cccagagggg acccgacagg cccgaaggaa tcgaagaaga aggtggagag
     7981 agagacagag acagatccgg aagattagtg gatggattct tagcacttat ctgggacgat
     8041 ctgcggagcc tgtgcctctt cagctaccac cgcttgagag acttaatctt gattgtagcg
     8101 aggattgtgg aacttctggg acgcaggggg tgggaagtcc tgaaatattg gtggaatctc
     8161 ctgcagtatt ggagccagga actaaagaat agtgtcatta gcttgctcaa cgccacagct
     8221 atagcagtag ctgaggggac agatagggtt atagaaatag tacaaagagc ttatagagct
     8281 tttctcaata tacctagaag aataagacag ggcttggaaa gggctttgct ataagatggg
     8341 tggcaagtgg tcaaaatgta gtatgaaggg atggcctact ataagggaaa gaatgaagcg
     8401 agctgagcta cagccacctg agccagcagc agaaggggtg ggagcagcat ctcgagacct
     8461 ggaaaaacat ggagcaatca ctagtagcaa tacagcagct actaatgctg attgtgcctg
     8521 gctagaagca caagaggatg aagaggtggg ttttccagtc agacctcagg tacctttaag
     8581 accaatgact tacaagggag ctttagatct tagccacttt ttaaaagaaa aggggggact
     8641 ggaagggcta atttactccc agaaaagaca agatattctt gatctgtggg tctatcacac
     8701 acaaggctat ttccctgatt ggcagaacta cacaccaggg ccagggatca gatatccact
     8761 gtgctttgga tggtgcttca agttagtacc aatggatcca gaccaggtag aagaggccaa
     8821 cgaaggagag aacaacagct tgttacatcc tataagcctg catgggatgg atgacccaga
     8881 gaaagaagtg ttagtgtgga agtttgacag ccgcctagca tttcgtcaca tggcccgaga
     8941 ggtgcatccg gagtactaca aagactgctg acatcgagct ttctacaggg gactttccgc
     9001 tggggacttt ccagggaggc gtggcctggg cgggaccggg gagtggcgag ccctcagatg
     9061 ctgcatataa gcagctgctt tttgcctgta ctgggtctct ctagctagac cagatctgag
     9121 cccgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt
     9181 gagtgcttca
//
LOCUS       HIVPHIL1      259 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1), envelope region.
ACCESSION   M27911
KEYWORDS    .
SOURCE      Human immunodeficiency virus type 1 (isolate 799), clone B7.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 259)
  AUTHORS   Ou,C.-Y. and colleagues CDC Atlanta GA,
  JOURNAL   Unpublished (1989)
  STANDARD  full staff_review
COMMENT     [1] kindly submitted in computer readable form by C.-Y. Ou, CDC,
            Atlanta. The sequence below, designated 799-B7, appears to be a
            major form of the sibling sequences 799, taken from a hemophilia
            patient in Philadelphia.
FEATURES       from  to/span     description
    pept    <     1  >   259     env region (AA at 2)
BASE COUNT       99 a     45 c     56 g     59 t
ORIGIN      Approximately 1172 nt from env start (BRU coordinates).
        1 ttcaacacaa ctgtttaata gtacttggaa tgttactgaa gggttaaata acactgcaaa
       61 taacagtgaa ggaactatca cactcccatg cagaataaaa caaattataa acatgtggca
      121 ggaagtagga aaagcaatgt atgcccctcc catcagagga caaattagat gttcatcaaa
      181 tattacaggg ctgctattaa caagagatgg tggtaatgaa actaacggga ccgagatctt
      241 cagacctgga ggaggagat
//
LOCUS       HIVSWB881      59 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1), gag region.
ACCESSION   M27952
KEYWORDS    .
SEGMENT     1 of 3
SOURCE      Human immunodeficiency virus type 1 (isolate SWB88), clone 01.
  ORGANISM  Human immunodeficiency virus type 1
            Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; 
            Retroviridae; Lentivirinae.
REFERENCE   1  (bases 1 to 59)
  AUTHORS   Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
            Wain-Hobson,S.
  TITLE     HIV-1 isolates are rapidly evolving quasispecies: Evidence for
            viral mixtures and preferred nucleotide substitutions
  JOURNAL   j acquired immuno defic syndr 2, 344-52 (1989)
  STANDARD  full staff_review
COMMENT     [1] kindly submitted in computer readable form by S. Wain-Hobson.
            B88 is a so-called "slow virus" isolated in Sweden. Sibling
            sequences available (see 1989: I-A-174).
FEATURES       from  to/span     description
    pept    <     1  >    59     gag region (AA at 2)
BASE COUNT       26 a      8 c     10 g     15 t
ORIGIN      
        1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca
//
LOCUS       HIVSWB882     314 bp ss-DNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1), tat first expressed
            exon of isolate L1.01.
ACCESSION   M31462
KEYWORDS    .
SEGMENT     2 of 3
SOURCE      Human immunodeficiency virus type 1 (isolate B88).
REFERENCE   1  (bases 1 to 314)
  AUTHORS   Meyerhans,A., Cheynier,R., Albert,J., Seth,M., Kwok,S., Sninsky,J.,
            Morfeldt-Manson,L., Asjo,B. and Wain-Hobson,S.
  TITLE     Temporal fluctuations in HIV quasispecies in vivo are not reflected
            by sequential HIV isolations
  JOURNAL   Cell 58, 901-910 (1989)
  STANDARD  full staff_review
COMMENT     Kindly submitted in computer-readable form by S. Wain-Hobson,
            Pasteur Institute, Paris.
            
            The B88 isolate is considered to be a "slow-low" viral form.
            Sibling sequences from pbl's and viruses propagated on pbl's are
            available. This sequence corresponds to the earliest sample major
            form, L1.01.
FEATURES       from  to/span     description
    pept         30  /   244     tat exon 2 (first expressed exon)
BASE COUNT      111 a     57 c     70 g     76 t
ORIGIN      
        1 cattattcaa cagaggagag caagaagaaa tggagccagt agatcctaga ctagagccct
       61 ggaagcatcc aggaagtcag cctaagactg cttgtaacaa ttgctattgt aaaaagtgtt
      121 gctttcattg ccaagtttgt ttcacaaaaa aaggcttagg catctcctat ggcaggaaga
      181 agcggagaca gcgacgaaga gctcctcagg acagtgagac tcatcaagtt tctctatcaa
      241 agcagtaagt agtacatgta atgcaatctt tagaaatatt agcaatagta gcattagtag
      301 tagcaataat acta
//
LOCUS       HIVSWB883     250 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Human immunodeficiency virus type 1 (HIV-1), envelope region.
ACCESSION   M27947
KEYWORDS    .
SEGMENT     3 of 3
SOURCE      Human immunodeficiency virus type 1 (isolate SWB88), clone 01.
REFERENCE   1  (bases 1 to 250)
  AUTHORS   Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and
            Wain-Hobson,S.
  TITLE     HIV1 isolates are rapidly evolving quasispecies: Evidence for viral
            mixtures and preferred nucleotide substitutions
  JOURNAL   J. Acq. Immun. Defic. Syndrome. 2, 344-8 (1989)
  STANDARD  full staff_review
COMMENT     [1] kindly submitted in computer readable form by S. Wain-Hobson.
            This stretch comes from the first hypervariable region of env.
            Sibling sequences are available (see 1989: I-A-175).
FEATURES       from  to/span     description
    ORF     <     3      245     env ORF (AA at 3)
BASE COUNT       96 a     40 c     42 g     72 t
ORIGIN      
        1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc
       61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg
      121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag
      181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca
      241 aaggtatcct
//
LOCUS       ALFLBA        578 bp ss-mRNA            PLN       11-AUG-1990
DEFINITION  Alfalfa leghemoglobin mRNA, complete cds.
ACCESSION   M36100
KEYWORDS    leghemoglobin.
SOURCE      Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL549.
  ORGANISM  Medicago sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 578)
  AUTHORS   Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and
            Huguet,T.
  TITLE     Identification of two groups of leghemoglobin genes in alfalfa
            (Medicago sativa) and a study of their expression during root
            nodule development
  JOURNAL   Plant Mol. Biol. 11, 761-772 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          7      450     leghemoglobin
BASE COUNT      189 a     91 c    126 g    172 t
ORIGIN      
        1 agaaatatga gtttcaccga taagcaggag gctttagtca atagctcatg ggaggcattc
       61 aaacaaaacc ttcctcgtta tagtgttttt ttctacactg tcatattaga gaaagcacct
      121 gcagcaaaag gtttgttctc ttttcttaag aactcagctg aagtacaaga tagtcctcaa
      181 ctccaagctc acgctgaaaa ggtttttgga ctggtgcgtg actcagctgt tcaacttcga
      241 gcaacagggg gggtagtttt gggagatgct acgttgggtg ctatccacgt tcgaaaggga
      301 gttgttgatc ctcattttgt ggtggttaaa gaggctttgc tcaaaacaat aaaagaagca
      361 gctggagaca aatggagtga agagcttaac actgcttggg aagtagctta tgatgcactg
      421 gcaactgcaa ttaaaaaagc aatgagttaa acctgtgatg atctattatc ataaaaagat
      481 taataaataa aatatgtatt actaaaactt gttaaacaag ttcctataat aaattatgat
      541 aaaaatattt atgaaatatt gttagtgttt atgccggt
//
LOCUS       ALFLBB        521 bp ss-mRNA            PLN       11-AUG-1990
DEFINITION  Alfalfa leghemoglobin mRNA, complete cds.
ACCESSION   M36101
KEYWORDS    leghemoglobin.
SOURCE      Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL154.
  ORGANISM  Medicago sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 521)
  AUTHORS   Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and
            Huguet,T.
  TITLE     Identification of two groups of leghemoglobin genes in alfalfa
            (Medicago sativa) and a study of their expression during root
            nodule development
  JOURNAL   Plant Mol. Biol. 11, 761-772 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    >     1      396     leghemoglobin
BASE COUNT      175 a     77 c    112 g    157 t
ORIGIN      
        1 aaatcattta aacaaaaccc tggtaatagt gttttgttct acactataat attggagaaa
       61 gcccctgcag caaagggcat gttctctttt cttaaggact cagctggagt acaggatagc
      121 cctaaactcc aaagccatgc tgaaaaagtt tttggaatgg tgcgcgattc agctgctcaa
      181 ctccgagcaa caggaggagt agttttggga gatgctacat tgggtgcaat ccacattcag
      241 aaaggagttg ttgatcctca ttttgcggtg gttaaagaag ctctgctgaa aacaataaag
      301 gaagtatcag gagataaatg gagcgaagaa ttgaacactg cttgggaagt agcctatgat
      361 gcattggcaa ctgcaattaa gaaggcaatg gtttaaattt gggatgatct atagttatta
      421 ctcaatttaa taaataaaaa atgtattact aaaacttgtt taacaagttt ctataataaa
      481 tgttgttgaa atgactatta aatattattt gagtttatgt t
//
LOCUS       CLTPHYC      1077 bp ds-DNA             BCT       11-AUG-1990
DEFINITION  Calothrix sp. phycocyanin alpha (PC2-alpha) and phycocyanin beta
            (PC2-beta) genes, complete cds.
ACCESSION   M36276
KEYWORDS    phycocyanin.
SOURCE      Calothrix sp. (strain PCC 7601 (Fremyella diplosiphon UTEX 481))
            DNA.
  ORGANISM  Calothrix sp.
            Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; 
            Cyanobacteria; Nostocales; Rivulariaceae.
REFERENCE   1  (bases 1 to 1077)
  AUTHORS   Capuano,V., Mazel,D., Tandeau de Marsac,N. and Houmard,J.
  TITLE     Complete nucleotide sequence of the red-light specific set of
            phycocyanin genes from the cyanobacterium Calothrix PCC 7601
  JOURNAL   Nucleic Acids Res. 16, 1626-1626 (1988)
  STANDARD  full staff_review
FEATURES       from  to/span     description
    pept          1      519     phycocyanin beta (PC2-beta)
    pept        589     1077     phycocyanin alpha (PC2-alpha)
BASE COUNT      276 a    259 c    242 g    300 t
ORIGIN      
        1 atgcttgacg cttttactaa agtagtttcc caggctgata ctagaggcgc atacattagc
       61 gatgcagaaa tcgatgcatt gaagacaatg gttgctgctg gttctaagcg catggatgtt
      121 gttaaccgca ttaccggcaa tgcctccaca atcgttgcta atgcagctcg cgctttattt
      181 gaagaacaac cccagttgat tgcacctggt ggtaatgctt acaccaaccg tcgtatggct
      241 gcttgcttac gcgacatgga aattatctta cgctatgtaa cctacgcagt atttgcaggt
      301 gatgctagcg ttttagacga tcgctgcttg aacggtcttc gtgaaacata tcaagcatta
      361 ggcgttcctg gcgcttccgt atctactggc gttcaaaaga tgaaagaagc tgcgatcgct
      421 attgctaacg accccagcgg tgttacccgt ggtgactgta gctccttaat gtctgaattg
      481 ggtagctact tcgatcgcgc tgctgctgca gttggctagt aaatcgctag ttaattattt
      541 ctactttctg acgcataact ttagatattt gttgagagaa taatcagcat gaaaacacct
      601 ttaaccgaag ccgtagctac cgcagattcc caaggtcggt ttttgagctc cacagaactt
      661 caagttgctt ttggtcgttt ccgtcaagca tctgctagct tggatgcagc taaagcatta
      721 agcagcaaag ctaacagctt ggctcaaggt gccgttaacg cagtttacca aaagttcccc
      781 tacaccaccc aaatgcaagg taagaacttc gcttctgacc aacgcggtaa agacaagtgt
      841 gcacgcgaca tcggttacta cattcgcatc gttacctact gtctagttgc tggtggtaca
      901 ggccctctag atgactacct aattggtggt ttggcagaaa tcaaccgcac ctttgatctg
      961 tctcccagct ggtacgttga agctctcaaa tacatcaaag ctaatcatgg tttgagtggc
     1021 gatcctgctg ttgaagcaaa ctcctacatt gactacgcaa tcaacgccct aagctag
//
LOCUS       PINACT        745 bp ds-DNA             PLN       11-AUG-1990
DEFINITION  P.contorta (lodgepole pine) actin gene, last two exons.
ACCESSION   M36171
KEYWORDS    actin.
SOURCE      P.contorta (var. latifolia) seed DNA.
  ORGANISM  Pinus contorta
            Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae; 
            Pinaceae.
REFERENCE   1  (bases 1 to 745)
  AUTHORS   Kenny,J.R., Dancik,B.P., Florence,L.Z. and Nargang,F.E.
  TITLE     Nucleotide sequence of the carboxyl-terminal portion of a lodgepole
            pine actin gene
  JOURNAL   Can. J. For. Res. 18, 1595-1602 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      420     actin, exon X (AA at 1)
                555      620     actin, exon X + 1
    IVS         421      554     actin intron A
BASE COUNT      196 a    124 c    172 g    253 t
ORIGIN      
        1 aagcttgctt atgtagccct tgactatgag caagaacttg aaacatcgaa gagtagttct
       61 tcacaagaaa agaattatga gcttcctgat ggacaggtca ttaccgttgg tgctgagcgt
      121 ttcagatgtg ctgaggtttt gttccagcca tctctcattg gaatggaagc agcgggtatc
      181 catgagacta catacaattc cattatgaaa tgtgatgtgg atatcagaaa agatctctat
      241 ggaaacattg ttcttagtgg tggttctacc atgtttcctg gtatagctga tcgtatgagc
      301 aaagagatca ctgcacttgc tcccagtagc atgaaaatca aagttgtagc accccctgag
      361 aggaagtaca gtgtttggat cggaggttct atcttggcat ctcttagcac atttcagcag
      421 gttattctca actttctaca tgccggaact tgaggttctt gtttggtttg gtggtattta
      481 atgtgctttg caatatataa atattatttc tttgattgta tatttaacca tgccttgatt
      541 gtttttgtat gcagatgtgg attgccaaga ccgagtatga ggagtcaggt ccctcaattg
      601 tccacagaaa gtgtttttaa acagagtgct ctattgatta tgctttatga ttggccttta
      661 ggtatattgt gagggtggaa gacacgatta ccgagatttt gatttagtat catgctttag
      721 gatttgcatc tgtatagttg aattc
//
LOCUS       RATMHCI      1355 bp ss-mRNA            ROD       11-AUG-1990
DEFINITION  Rat MHC class I RT1.C/E mRNA, 3' end.
ACCESSION   X16979
KEYWORDS    cell surface antigen; cell surface glycoprotein; class I gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Rat liver, cDNA to mRNA, clone RT.BM1.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1355)
  AUTHORS   Parker,K.E., Carter,C.A. and Fabre,J.W.
  TITLE     A rat class I cDNA clone with an Alu-like sequence and mapping to
            two genes in RT1.C/E
  JOURNAL   Immunogenetics 31, 211-214 (1990)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      857     MHC RT1.C/E (AA at 3)
    mRNA    <     1     1355     MHC RT1.C/E mRNA
    rpt        1210     1349     type 2 Alu-like repeat
BASE COUNT      322 a    334 c    387 g    312 t
ORIGIN      
        1 attgggagcg ggagactcgg aaagccaggg acaccgggag gaacttcaaa gtgaatctga
       61 ggaccctgct ccgctactac aatcagagtg acgacgaatc tcacacgctg cagtggatgt
      121 acggctgcga cgtggggccc gatgggcacc tgctccgcgg ttattgtcag gaggcttatg
      181 acggccggga ttatatctcc ttgaacgagg acctgcgctc ctggaccgcg actgacatgg
      241 cctcacaggc ctctaagatc aagtcagagg aagttggtga ggcccatcac cagagggcct
      301 acctgcaagg tccttgcgtg gagtggctcc acacatacct acatctggga aaggagacgc
      361 tgctgcgctc agacccccca aaggcacatg tgacccttca ccccagacct gaaggtgatg
      421 tcaccctgag gtgctgggcc ctgggcttct accctgctga catcaccttg acctggcaat
      481 tgaatgggga ggacctgacc caggacatgg agcttgtgga gaccaggcct gcaggggatg
      541 gaaccttcca gaagtgggca gctgtggtag tgccttctgg ggaggagctg aagtacacat
      601 gccatgtgga gcatgaggga ctgcctgaac ctctcgccct gagatgggag ccttctccat
      661 tcaccgactc cagcatgcca gtgattgttg tccttggagc tgtggccatc attggagctg
      721 tggccatcat tggagctgtg gccatcattg gagctgttgt gaggaggagg aagagaaaca
      781 caggtgaaaa aggcagctat gctcatgttc taggcagcaa ggctttccag atctctgact
      841 ggcctcagaa ggcatgaaaa gacagttgcc tcgtggggct gagcgacaag ttctgactaa
      901 acatctgagg gagctgcttc ccctgaggac aatgtgaaga actggcaatc ccagaccttc
      961 ttgcctccct ggacacctga cttcacaaca gaaaataatc tgactctggg ctgggtgttt
     1021 cctattgctt ttatctgcac tgagtggaaa tgcggtttcc aacagtgtaa cgcagtaccg
     1081 ttgtgtggtg gtagtgacat tgcacacagg tttgtgagct gggggtgtga tgtgcttttg
     1141 ttactgtagg cagaacaagc tagaagtcca ccattcctct tatggtcatt gttttaaaac
     1201 aaaacaaatg ctggagagat ggctcagtgt taagagcact gacttctctt acagaggtcc
     1261 tggttcaaat cccagcaacc acatggtggc tcacaaccat atgtaatgag atctgatgct
     1321 ctcttctgat gtgtctgaac acagctatat atgat
//
LOCUS       SCOMTCOIII   1120 bp ds-DNA             ORG       11-AUG-1990
DEFINITION  S.commune mitochondrial cytochrome oxidase III (COIII) gene,
            complete cds.
ACCESSION   M36270
KEYWORDS    cytochrome oxidase III.
SOURCE      S.commune mitochondrial DNA.
  ORGANISM  Mitochondrion Schizophyllum commune
            Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; 
            Agaricales; Schizophyllaceae; Schizophyllum commune.
REFERENCE   1  (bases 1 to 1120)
  AUTHORS   Phelps,L.G., Burke,J.M., Ullrich,R.C. and Novotny,C.P.
  TITLE     Nucleotide base sequence of the mitochondrial COIII gene of
            Schizophyllum commune
  JOURNAL   Curr. Genet. 14, 401-403 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        147      953     cytochrome oxidase III (COIII)
BASE COUNT      387 a    155 c    158 g    420 t
ORIGIN      
        1 ggatcttaca aagaatatta tctttcaact ttcttttaaa taataaaatt acttgttaat
       61 taattcaaat tacaagtgta aagccatagt ataaaacaaa atttataaat aaataaataa
      121 aaaataacaa ataataaata acaaaaatga caagtattaa attctatcaa tcatttagtg
      181 cacacttagt acaacattca ccttggccaa ttttagttag tttcagtctt tttaatttag
      241 ctattggtac agtattaact atgcatggat acagtcacag ttcaacaact tttgatttag
      301 gtttagcagt aactgttgga tcaattttat tatggactag agatattgta attgaaggat
      361 ctttcttagg agatcacaca aaacaagtac aagaaggatt aatcatagga tttatcttat
      421 ttattataag tgaagtattc gcttttattt cagtattctg ggcttatttc catagtgctt
      481 tatcaccagc tgtagaatta ggatcaacat ggccaccagt aggaataatt ccattagata
      541 ccttttcatt accattattt aatacaataa ttttattatc tagtggagct tttgttacct
      601 atggacacca tgctattttc tcaggaaaaa gattagattc aatcatagga ttatttttaa
      661 cagtagcctt agcattaatc tttagttact ttcaagcatt tgaatatatt catgctggat
      721 tttccatgtc cgatagtgta tttggaacag tattttttgc aagtacagga ctacatggga
      781 tacatgttat gcttgggact ctttttcttt tcgtatcttt tcttagacaa gttaactatc
      841 aaactacaaa agaacacaat attggacttg aaacaagtat tctatattgg cactttgttg
      901 atcttgtgtg gttattctta tttcttgttg tttacttttg gggtggtgca taattatttc
      961 ttttctttta ttgaaagtaa taactattca ttagaactta taaactatgt gcaatttcca
     1021 attttattcg gatttatcaa tacaggggga aaaacttctc ccttagaaaa tacaagaaat
     1081 tgaagacaaa gaattaaatg taataaataa aataattaca
//
LOCUS       STYHSQ        135 bp ds-DNA             SYN       11-AUG-1990
DEFINITION  S.potsdam and S.typhimurium hsdS recombinant DNA.
ACCESSION   M15940
KEYWORDS    DNA binding protein; type I restriction and modification system.
SOURCE      S.potsdam and S.typhimurium DNA, clone lambda-hsd SQ.
  ORGANISM  Artificial gene
            Artificial sequences; Genes.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Fuller-Pace,F.V. and Murray,N.E.
  TITLE     Two DNA recognition domains of the specificity polypeptides of a
            family of type I restriction enzymes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83, 9368-9372 (1986)
  STANDARD  full staff_review
COMMENT     The hsdS genes of the SB (S.typhimurium) and SP (S.potsdam) have a
            conserved sequence of around 100 bp flanked by two nonhomologous
            variable regions of around 500 bp.  Recombination between the hsdS
            genes of SB and SP generates a system (SQ) with a different
            recognition specificity.  The crossover point lies in the central
            conserved region.  Concomitant with the generation of a new
            combination of flanking variable regions is the recombination of
            minor differences in the central conserved region. A polypeptide
            domain encoded on the 5' side of the crossover dictates recognition
            of the trinucleotide component "aac" (positions 25-27) of the
            target sequence, and a second domain, encoded on the 3' side of the
            crossover (positions 50-54), governs recognition of the tetra- or
            penta-nucleotide component.  Parts of the variable regions are
            implicated in the determination of the specificity of interaction
            between protein and DNA.
FEATURES       from  to/span     description
    pept    <     1  >   135     hsdS specificity protein (AA at 1)
    recomb       32      101     SP DNA end/SB DNA start (uncertain within
                                 span)
BASE COUNT       46 a     39 c     24 g     26 t
ORIGIN      Unreported.
        1 ataccaatcc cgtcacttgc tgaacaaaaa atcatcgccg aaaaactcga tacgctgctg
       61 gcgcaggtag acagcaccaa agcacgtctt gagcaaatcc cacaaatcct gaaacgtttt
      121 cgccaatcag tgata
//
LOCUS       RATCYP2A3A  14315 bp ds-DNA             ROD       11-AUG-1990
DEFINITION  Rat cytochrome P450 II A3 (CYP2A3) gene, complete cds.
ACCESSION   M33190
KEYWORDS    coumarin 7-hydroxylase; cytochrome P450 II A3.
SOURCE      Rat (Sprague-Dawley) DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 14315)
  AUTHORS   Ueno,T. and Gonzalez,F.
  TITLE     Complete sequence of the rat CYP2A3 gene specifically transcribed
            in lung
  JOURNAL   Nucleic Acids Res. 18, 4623-4623 (1990)
  STANDARD  full staff_entry
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Gonzalez 23-MAR-1990, for release after publication.
FEATURES       from  to/span     description
    pept       3446     3625     cytochrome P450 II A3, exon 1
               3892     4054     cytochrome P450 II A3, exon 2
               4736     4885     cytochrome P450 II A3, exon 3
               5184     5344     cytochrome P450 II A3, exon 4
               6879     7055     cytochrome P450 II A3, exon 5
               8457     8598     cytochrome P450 II A3, exon 6
               9049     9236     cytochrome P450 II A3, exon 7
               9806     9947     cytochrome P450 II A3, exon 8
              11123    11304     cytochrome P450 II A3, exon 9
    pre-msg    3422    11497     cytochrome P450 II A3 mRNA and introns
    IVS        3626     3891     intron A
    IVS        4055     4735     intron B
    IVS        4886     5183     intron C
    IVS        5345     6878     intron D
    IVS        7056     8456     intron E
    IVS        8599     9048     intron F
    IVS        9237     9805     intron G
    IVS        9948    11122     intron H
    signal    11466    11471     poly-A signal
BASE COUNT     3819 a   3801 c   2926 g   3769 t
ORIGIN      Chromosome 7.
        1 gaattccaac agagacccca cttgtgcacc cgaaatgata cacccagaag aaggtacaag
       61 cagagctaag cgaggcagag gctcccatca taacctacta atcaaaacga tagcagcgtg
      121 aacatgttca caagcatgga ggacttgggg acaacagtgg ctcgctaagg aggagaaagc
      181 aaacatgagt cgatagtcta cagtgagaga tgatagacaa caaggcaaag tcctgtgcag
      241 gcttcagctc catccagagc aatctgaaca tctaacgctg atcagaaatt gcccaacagg
      301 gcaccaagcc tccaggataa acttcctact cacagatcct tgagtatggt ggaagagggc
      361 actgatgctt ggctttgatg aagaagcact ggggtgggag gcttctcaca caccaccagc
      421 ttctttaggg ttgtcctgga ggcatttcaa aatgagacag gccttgcttg cccaccgtaa
      481 ctgaatgtta ccaaagacac caggagcttt ggctacagga catagaaata tcaatctcaa
      541 actgaccagg aaattgtctc cttagtggtt ctgggagtat cattggtctg accaagtgtg
      601 gaccctctcg ctacaataaa aacaagaggc tcccaaaggc agattgaagt tcagaggaac
      661 atcaatccat atcctttttt cttgtcaagt aataagatgt gataataaca gatgaataaa
      721 atagctattt attgacttta gcattaactc aaccattact accaagaaga agattcacct
      781 ataaagttca gtttgttatt gaagtcagct cttcaaggac acacaacaga caagatctca
      841 gggacactga ttggattgcc caggagttgg tatctgccca ggttagggtt ccattctatg
      901 ctgtgccatt gctctcaatc acggaaccag aaagggagac tggaaccatc agagggagac
      961 ttcccaaggt tgctgcttat tcactctgga aacctgtctg agatgcttgc tgctgttccc
     1021 acatgtctct ggcatgtgac cgctgtcacc gatccttcat tcacgcccct gtgtctctcc
     1081 tgcataactc ctgttcctgc tctgtcattc atgtgcccgg aaaaaaaaat caataaggac
     1141 aaatacactt gatctctcct gaactctcca agctctgttc ccattctggc ccccgtgccg
     1201 aacatgagaa catgcctgac cactcagggt ccttctgggt gcctcctcca ctgagaaaca
     1261 gattctctaa attacccctt cctcctattc aagtggaaac aaggggagag gagaggtccg
     1321 tgctgggtat agtgcaggtg caggagagat tgatgcctgc caagcctagg ctaatcccat
     1381 tccccccccc gccccttcct cctccaaaat caacaaacaa cctttctctc tgctttcttc
     1441 tcggcccttc gtttcctccc caaccggtct cagagtccaa cagcctaaaa tgagatgagc
     1501 taattgccac tcatgagctc caacctcacc caatggtcta atgctgggat accatcatgg
     1561 gagggaccca tccatcaaag cctgccagca tccacccaag gtccctccca gggtctcaag
     1621 gaagccaggg tcagcaagca cctagtgttt atatcagagc ctgctggaac caaatccata
     1681 cccagcatgg tgccacactt atgccttcac tcaaacaatt ccagtgacca cctgccgccc
     1741 ctctcaaggt ctctctcagg gtaagaatat caccaatatc cttttggtgt caagctccgt
     1801 gctaactcct ttttcacctt agtgcttgcc tgggggctat ctcaatgcca accccatcct
     1861 cctccctcct gggagcccat cctgcagcta agcttaattc caggtgtctc cagggtgagc
     1921 tgagctggat aatgaggagg aaagagaatc aaatccctgc tcctttatga cccggtcatc
     1981 tcaatgaggt tggtgatgca ccaactgcac aatgactaag attccacaga tgggatacac
     2041 atcatcttcc tgggctacgg gttcatcaaa actgtccaga ttattaaaga agacttacaa
     2101 tgtgagagaa cagtagggtc attttccctt ctccatagcc accctcagca tcccagggtc
     2161 ctcttctgtt ccccatgctg gacgctcatc taaatctctc cacctcagcc cactgataac
     2221 atccaggcat cagcacacat acccaccccc caccaaatat gacctttatt tccccaatgc
     2281 tctcttctca ctggtgggaa agatgtctag acagggaaag ctaaaagtat gagcaagagt
     2341 agatgttaat gcctctgact ttcgggcttt ccatgtatgt gccccccatg ccacccctat
     2401 gttttgttca cgtgcaccca tagaaaacat gtgtaaatta cagcatcatt gagccagtga
     2461 ctctgatggc ctcctctaca aatctcgata ggaatccagt ggggctttcg taaattagaa
     2521 ctttgctacc atattggagt tgtgattatc atttctcaag acctagacag acccccagat
     2581 attggctcta gtactgtgcc aaattccagt gaaatatgac tctggtcttc cctacctttc
     2641 tcagatctta tgttccagca atccatccta gactcagaga cttctctggg ttctgagcat
     2701 taagagacag catgaagcca taaccaaagt gcatccacgc cggggcacag gtgttgactc
     2761 caagatcctc ctgcagtttg tgtccttcca ggcactcccc atccacagcc tttcttgact
     2821 caagatccat aacccatagc cctggagtcg ccaattctcg gttccttaca ttcaggaccc
     2881 acttcattga aagtctccca gggttggctg tggaagtgta ggtctgtaat cccagcactc
     2941 agagcagaca ggtggatctc tgtagtctat gtacagactc cggactaccc agaactacat
     3001 agtgaaaact tgtctcaaac aagaaagaaa agtgatcttt caacttcata gccattcatc
     3061 ctatttcgaa caaatgaatt atagggttct tctctcctaa gaccctaata tggctatcac
     3121 ttgtgccctt tctcaaagga agggtctcta gtatacatgt catgaaggat ccagttctgg
     3181 cttctctcta ctgtcctctg taatgcatag ttttggagca cttcctcctt agacaccaca
     3241 aactcagtct actccctgtt aatctgtcat ggagccctcc tcctccttga gtgttggcta
     3301 tgtcccaaac taggtgcatg acgggggtag tctggaggta aaattagaca gttgcataat
     3361 caagaccaaa ggccagcctt ctatctcaga atgtataaaa acaagctaat tcagccatca
     3421 ctgtgtctgt catcccatcg ccaccatgct ggcctcagga ctccttctgg tggcctcagt
     3481 ggccttcctc agtgtcctgg tcttgatgtc tgtctggaag cagaggaagc tctcagggaa
     3541 gctgcctcct ggacccaccc cattgccctt catcgggaac tacctccagc tgaacacaga
     3601 gaaaatgtac agctctctca tgaaggtgcc actgggcatg gagatgggtc gaatgggggc
     3661 gtggagggat gcccaatttt gattgaggtt ttgtggtagg tattgaccag atgggagtgt
     3721 aattttaggc caggggaatt gtgaagaaat cagcactttc taggccagtg gtatcacatc
     3781 ttgggacacc cagtctgata actattagaa ctctagagag atgaatccca gaccggatat
     3841 caaagaatga accatcccct cttaccttct ctgatcattc tcctccatca gatcagccaa
     3901 cgttacggtc ctgtattcac catccacctg ggacctcgcc gagttgtggt gctgtgcgga
     3961 caggaggcag tcaaggaggc tctggtggac caagctgagg aattcagtgg tcggggagag
     4021 caggccacct tcgactggct tttcaaaggc tatggtgagg ggaagaccca ggggaatgtg
     4081 gcagaggaca tttgagggca tcagtttccc cagatttctc cctgactgtc tatcccctcc
     4141 aagcaatctt ctccccacat ccctgtctca gactggtctc tctgccagtg tttcctgact
     4201 gccccacctc cacccatgcc tccttttcca cctagcaatt gctgtcggaa tcctgccgcc
     4261 tctaagggga tccgaagctt ccctgcaacc cacaccatca ttttcagctt ctgcgattgc
     4321 tttttctctt cctctcctct gctttccagt gtcagacatc acagaacgtt tgttcctagg
     4381 ttctatctac cagcaacgat ttcttctctt tccagactct ctttgtgtct ctctctttat
     4441 ttccctctct ttttttgctc aattacactt ctttcacaat tgtatccacc ctcagctctc
     4501 tgcagtctcc tgtctctcac ctctgtctct gcacctctgt ctccctcttc ttactgtcca
     4561 gtaacttgct cttttacctg tactttcgtg gtgccttctt gtgttttgca tttgcatttc
     4621 caaggatctg tgtgcatctt tttccttctt gaccgcctga aggggaacag cgcaagggtg
     4681 gtcaatggcc agccacgccc cctgacttct cctggcttct cattcacctt cccaggcgta
     4741 gccttcagca gcggggagcg agccaaacag ctaaggcgct tctccatcgc cacgctgcgg
     4801 gacttcggcg tgggcaagcg tggcatcgag gagcgtatcc aagaggaggc gggctttctc
     4861 atcgagtcat ttcgaaagac gaacggtaag cggagcaccg ttctccggac ctaccaaggg
     4921 aaagcagcac catgtcccag gatagaaact aggctaggga agtgtatgct ggtcccctac
     4981 tgtgggatct ggtctcagcg tttttggttc agcgctgcag ccagtgccct tcttgagttc
     5041 tcagattcaa agtatctgat gctaacaccc aggcgatgct ttttgcctag ttccaatgtc
     5101 tccctttctt tcctcctcct cctccccttc ctccccctcc tccccctcct cctcctccct
     5161 ctcctcctct tccccatcca caggtgccct cattgacccc accttctatc tgagccggac
     5221 agtctccaat gtcattagct caatagtctt cggggaccgc ttcgactatg aggacaaaga
     5281 gttcctgtca ctgcttcgaa tgatgctggg aagcttccag ttcacagcta cctccacggg
     5341 gcaggtaacc tgtttcaacc ttgtcctgtc atacctcctg accttgactg ccagctcact
     5401 cccaaactgc ttccagcagg caccttcaca ctagcccttg cagttttgca agaacaaata
     5461 aatgtcaata tgtagacatc ctcaacagct gcccctgaaa tcaaaagctg gcagtctatc
     5521 tccctttccc cctccctttt ccccttcccc tttccccctc cccttccccc tcccctctcc
     5581 ctcccctccc cttccctctc atgtacacac acacacacac acacacacac acaccacaca
     5641 cacacacaca gccactcagc tccttcagct gggcacatgt taccaccaca atttgtttta
     5701 atttctaaca caagattgcc ataggaaggg tccgctgaaa acaccacccc tcttcagttc
     5761 ctgaacatct ggcactgcag aaaacacaga gctagcccac tggagtatgt agatgagtgc
     5821 cccttactca gccacggaac taagcaaacg tctgaggctc caagcgaaat ttctgcacag
     5881 atgtccccca aacatccaag taagggtctt tcaacacagc tcattcacac tcctaagcac
     5941 gaccatctgt gaagctgttt gataacaaag caggtgaaga tatggatatc agcactactt
     6001 aaaatagaaa taataagaca attttggaca agcatacatt gtgtattgag ccttttcagc
     6061 cccatacccc ccaccttccc tgttctcaca cttcctcctc ttcctcctcc tcctcttcct
     6121 cttcctcctc ttcctcctcc tccttctgtt agtcctgttt gtccctgaga cagttttatt
     6181 tctacttcta agtgatacac atgtacatat atatgcatat gtatgtacaa cgtgatttga
     6241 attaatacca cagattgaaa gaacacatac acgggttctc taatactggt taaatttatt
     6301 tgatatgatc tcccattgtg tctgctttcc tgaaaatttc cataccattg tttttaatgg
     6361 ctgaaaaatt ttacacacac acacacacac acatacacat atacatatat acatatacat
     6421 atatatctca cattttctct ttcattacct tgtctttgga cagctagttt gagtgcataa
     6481 ttcagttact gtgactaccc aggactggtt ggtttaatac tggaaaacct gcgaaccttc
     6541 accatccaaa ccctgctagt atttgaacag ctgcctctag ctaatgtcac ccaaatacct
     6601 aagcatctac agatgaaact tcagcccaac aactgaatgt cattcaagcc agccatgcgc
     6661 aagaactcca aacctatata tgcgaccaaa tccacctcac ttaaaatctt ggacaatagc
     6721 cctctcactt aaactataag aaaactatat tgcctccttg ccctaccagg ttacctgaac
     6781 ctcaggacag gtgacagaaa agtcccttgc ctgaaaacag ccaccacacc tgttggattt
     6841 tatgcagtct tggtttctcc tcctttttcc accctcagct ctatgagatg ttctcttctg
     6901 tgatgaaaca cctgccaggc ccccagcaac aggcctttaa ggagctgcag gggctggagg
     6961 acttcataac caagaaggtg gaacagaatc agcgcacgct ggatcccaat tccccaaggg
     7021 acttcatcga ctctttcctc atccgaatgc tggaggtaaa cccagaagtt agggagcatg
     7081 gcgggaggtc caaagccaag caggatgaaa tcacgcttgg ggaatggggg taatggtgga
     7141 ccacttaaat cagtcctctc aatataaacc tcacatatta ccactcagca ccactgttcc
     7201 tttcacttct cagtctcagg aggatggaca gactgagatc acagccagga tgtggtgctg
     7261 ccatatgcca ggcaccattt taaaccattt taaaggctta gtgatatcca cttatttagt
     7321 attcacacct tggacgaggt tgtgttatga ctcccatttt acaaatgagt aggctgcagc
     7381 tctgagaata taggcaactc gtcctgcatc acagctgtct caggctgaca cacaacttaa
     7441 cgttgcctgt cctcattgtc attcacgtct gtggcacagg tgacctagct ttattgtgac
     7501 cttctctcac ctatatggaa cagaggcagg ggtgttttgt gataggctgt agccctagtt
     7561 ggcctgaaac ttgctctgca caccaggcaa gtctcaaacc tgccgtggtc ctcctgaggc
     7621 tgctttctga tggcttggat tatcaacatg tggtatccgg ctgagatgga acttcatgta
     7681 gtctgaatgc aaaggattga gggatgagat tcccctagaa aatattaagg ttcgtaatga
     7741 agagatggga tttattgcct agcattcatt gtctgagggt tggaaagatg ggtcagtggt
     7801 taagagcact ggctgctctt ccagagtgcc agtgggttca atcccagcac atgcatggca
     7861 gctcacaacc atgtcatttc agtcccagta gatccaatgc cctctttggg cctccatggt
     7921 tactgaacgc atgtagtgct ctgacacaca tgcaggcgaa acactcatac actcaaaata
     7981 atttttaaaa agctctgtat caggatgggg atttagctca gtggtagagc acttgcctag
     8041 gaagcgcaag gccctgggtt tggtccccag ctccgaaaaa aagaacaaaa aaaaagctct
     8101 gtatctacct ctgaaaaacc atctcaagtt ttccctgact tcctccaaca tgacgtcagt
     8161 accattgttt cttgggctct cttcagcctt cctttcctaa tttcgaccat taaataactg
     8221 acgactaagt tagtaacaat cagacaagca ctaagtcacg aatgacagaa gagcatttat
     8281 ggtcccaggt cacttccaaa tgtctgggcc ccggagagaa ctcagcatag ccagggttcc
     8341 tcttctaagc cttaggtctg gcccaaagag aagtgagttc aagttgggaa aaaaaagcta
     8401 gaagggtacc aagagtgtaa ggaatggatt tgacgtgaat atttccctcc ctgcaggaaa
     8461 agaagaaccc caatactgag ttctacatga agaacttggt gctgactacc ctaaatctct
     8521 tctttgccgg cacagagacc gtcagcacca ccctgcgtta cggctttctg ttgctcatga
     8581 agcacccgga tattgagggt aacgaaggct tcaagggaca ggaaagcagg ggttgagggg
     8641 agcataacct ggatatagat tctaaatttg catcttttat ctgccacaat gtcccctacc
     8701 tttctcaaga aaccggactc tcaaacagct agctaccccc tgctttgcct agacagagcc
     8761 acagctggtg ggcaccagtc tctagttgct taaccaactg gtgtgctgct cgccacactt
     8821 cagaggagtt acttccccgt ctcccgggtc tcctcgcaac tcccgtacaa ttatggtatg
     8881 atccatccag ctcacgggaa acttcttcac caatgcagga gttgcacata ttgaaaatga
     8941 tctacaatgc catacactgt tccatggcct ccatcccgtc ttagtttctg aactatcatc
     9001 tctactgtgg cgttcttgga ttcctaaacc ggtccctgtt cttctcagct aaggttcacg
     9061 aggagattga ccgggtgatt ggcaggaacc ggcaggccaa gtatgaggac cgaatgaaga
     9121 tgccctacac ggaggctgtg atccacgaga tccagagatt tgcagacatg atccccatgg
     9181 gcctggctcg cagggtcacc aaggacacca agtttcgaga gttcctcctc cccaaggtgc
     9241 tgattcacct agcctagagt gacctctaga cctcttcatg ggcctccagg acctcctgct
     9301 cattagaaac cacccggccc ctctcttggt ttcttagtga caggctttct ctgtcaccac
     9361 atcccactgc cctcctactt agagactcct atatcttgtc ttcaccccag actttcccac
     9421 tcaggaattt taactccatg cctcttaaga agtataaact taatttatct agatcttgac
     9481 tgtgttggag ataggagtct catgtctgct atatatcctg tttggggcta cacactttca
     9541 atacccccaa atcttgactt gagaggcaca atttgcccca gaagacaaag cctgtattac
     9601 tcatctgtcc tggctactag aacatgagtc ccacatcccc catatcttaa ggataacctc
     9661 acattcggca aagctcctcc ctcaaaggtc ccctaactca ctatctttac cttcacttgg
     9721 ataccctagc tcaccctttt aacccctgtg ggccagcagg acattcatcc aaacatacct
     9781 catggagtct cttcttcatc accagggtac tgaagtattt cctatgctgg gctctgtact
     9841 gaaagaccct aagttcttct ccaaccccaa cgacttcaac ccaaagcact tcctagatga
     9901 caagggacag tttaagaaga gtgatgcctt tgtgcccttt tccattggta agaacccact
     9961 gccgagtgcc agaccactgc tcacaccaac agataagctt ctccattgct tctgtctgat
    10021 gcagccaaat atttctctct attttggaat attctcttgc atctaactct cggctagtta
    10081 gctgtagctc ccataattcc caagcaccat gtatctgaac cctggagacg tgatggtcaa
    10141 actttagtga taggtgaagg gaatggaacc tcaaactaag ccagaggtct ttctcaggtc
    10201 acacaaccag ttcttaaaaa ggatgtggcc acatggcagc cataccattc cctatagctg
    10261 tagggaaagg tgtaaaagta ttttttaatc tcctgatgta aatgtctccg aggcttactg
    10321 cctccctctg ctaacctagg cctagccctg gaagcttcta gcctctgaac aatcttatct
    10381 agcctagaat gttttcagcc tctgagactt gctgctaaat aagctcacac cttcctagtt
    10441 ccctctgatc tctggatggc tggttcaact cagctgtttt ggctcaaact cctctaacag
    10501 attcagtctc tctctttgcc tctcctgagc tgctctactt ggctccatgc taactttggc
    10561 aatctgttct gatcttctgt ctccttctca ttctctggct tgttctgtct tcacctgggc
    10621 ctagctcgct ctctcagtaa cctgcctcta tataactgtc ccagttaaac tgcccccttt
    10681 ctccatgttt ctctctcttc actgttctct ttgtctgcca ctcaattaga cgtcattttc
    10741 aaacatgggt gctttcttct acaaactaac tttatcttca ttgtttggaa tttaaagtgt
    10801 gtaccaaggg tttgcattcc agccagaggg attaaaggtt gtgtgctaag gctgagccac
    10861 tccacaccta gaaacaggtt tttcagtaaa taacacaatc tcggggttca cagtgtgatc
    10921 gaatatcctg caacagaagg gaatccacat ttctcattgt tacagcctgg tatggatcaa
    10981 cccactacaa gtacaaatac aactgaggct caggagttag agcgttcctt aaacgtctca
    11041 gcacttcctc ccatgaagaa tgcagccagg gatttgctat cggcaaaaac tggcttcacc
    11101 ttcccctgtc ttgtctcttt aggaaaacgg tattgtttcg gggaaggact ggcaaggatg
    11161 gaactctttc tcttcctcac aaacatcatg cagaacttct gcttcaaatc cccacaggca
    11221 ccccaggaca tcgatgtgtc tcctagactt gtgggctttg ccacaatccc accaaactac
    11281 actatgagtt tcttgtcccg ttgagccagc attgcgtgag agtaaaggga agaatggggg
    11341 ttgaaactta gaaagggtcg gatgtactga agaaacagag gaaaaagagt agacctgctg
    11401 aagatatata tttcaaaggt agagccagag aagggggaaa tatctactat actgtgaata
    11461 ggaataataa aataataaaa gtagatatta tttatggcat aagtagtctt tgtcaagttt
    11521 atgctaaggg gtcttacaat ctcacttaat actcagaagt gcatgggaag gggaagagag
    11581 ttcatgctca ttttattagg gtcacattct tgatcttatg tttccctagt ctaagtttac
    11641 caaagtctca cagaatccaa gagctcacta atatttggac aaagatctgt gttcattggc
    11701 cttgggcaga ttttcaaata tcaccacctc tcagataaaa catgaaagct caagtctatt
    11761 gccacatcat cctgggcaga tctttgggtg ggcaacctgc caggctctac cacttgcctg
    11821 gccacactct gatctactta cagagaaaac tcagggcctg caccatgaca tgaatgtgtg
    11881 ctgatttatg gatgcccacg taagtgtggt gcagggtttc ctcaagttat gcacgagtat
    11941 attttctgtc atgagttggc ttccaaaggc caagtgtttg ctcacagata agagtcagtt
    12001 aaagcagttg aaaagatgtg ctgtggtaca ctggtgtaaa tctggtgagt agatgcctgt
    12061 gcgatgcagt ctcaacagca gtggacacag cagccagcac tttctaggct tcttactttt
    12121 atgatgtcag tattagtcat acccaggagg aagggacaga tataaggata tcaaggtgac
    12181 atctttgaga actgtattgc aagccctccc tggtagtaaa tttcaaccct gcaagctgca
    12241 taagcttcct gggataatga agagaagaag gagggaacag gggtgaggat ggctggggag
    12301 gggatggggt agagtaggga gaatgactca attagaaaag tgcttgccaa gcaataatga
    12361 ggacccgagt ttggatcctc agcacccctc aaaagcccag cgtaatggca caagcctgtc
    12421 accctgatgc tgtgaggtgc gtgaaaattg cctggtccct gaagatcact gggcagccca
    12481 gtccagtcaa atggtgaact cgggatttag agaccctgtc tcaaagaaca aagtggagac
    12541 aggaagatca acagagtcaa ctaacctggg ggctagctca ctagaccctt ggggctctca
    12601 aagactgagc caccaaccaa ggtgcacacg tggcctggac agaggccccc agcacatacg
    12661 tagcatatgt acatctcaat ctttatgtgg gttccccaac aactggagca ggagctatca
    12721 taaagctatt gcctgtcagt ggaatccatt cccctaactg ggctgccttg tctcacctcc
    12781 gtgggagagg gtgcacttag ccctacagag acttgatgca caaggaaggg aatataccca
    12841 ggtgggcccc atcctctcag aagagaaggg gatgggagat gggggagagg tttgtaagga
    12901 gggaacagga gttgggcatt gttcaggatg gaaaatgaat aaataaatta attaatgggg
    12961 aaaactggag agaaactaaa gaagacatgt actgttgacc tgtggcctct acacacccat
    13021 gtacacatgc acctgcatgc acatacaaaa atacacacac acacacacac acacacacac
    13081 acacacacac acagactagc tcatttttag caactggcaa gaaagccttt gaaagtgagg
    13141 gagggaaaaa aaactaacac attctttaat gctgaaaaac attaaaataa caaacaaatg
    13201 aaaagctcct aaagacgtga cctgggctgc tgagacaggc cacattcaac caacgatgtt
    13261 gcattacata gcagcagagc ttgcagcctt ctgaagggca tccgtgacct tcttcaggct
    13321 aagctgtatg ttagggttca aagtttccca tcactttatg aatattagcc taaccagagt
    13381 aactttgatg aacgagatgg ccagaaacca acagcaaagt caaaaccaca aaagccatgg
    13441 gtatgtggaa attaagagac gtgccatcaa aattgtttct tttcaaataa aacttaggca
    13501 tataccgagg tagtagagag acagttcaat ggttaaaagc acagaatgct catgcagagg
    13561 acttgagttg ggtccctggc actcatgtct ggcagcctaa agtcagctgt aactaactaa
    13621 ctaactaact aactccaggg gatccaatgc catcctctgg attccatggg gcctgcatgt
    13681 atatatgcat atgcacacac acacacacac acacacacac acacaaagac acacacaaag
    13741 atacacacaa agacacatga ctttaaaaat aagtcttttt aaaaatataa ctattaatga
    13801 aattaatgaa gtgcaactaa tgttttccat acatgtataa attatgcaat gactgaatca
    13861 tagcaacaaa aaatgtagac ttatacaaca gtggttcaaa gaaatattac atggtaaatc
    13921 acaaaacact taaataatgg aacaaaaata aatacacaca gacttacagg atgagaacag
    13981 ccacagcagg agatgcaggt aaactcatag catggttttt aaagaaatgt caaattcata
    14041 aacggtgttt taaattttta cccactagat tgaaagactc actaaactga tgtttttttt
    14101 tctctttttt gtgattatac atacttattt tactgggtat tttttattta catttcaaat
    14161 gtgattccct ttcgaggttt cccggccata tgcactctat cacaactccc tccccttctt
    14221 ctataagggt gttccctctc cccaaccacc ccaccttcct gccccccacc gacattcccc
    14281 tgcactgggg gtccagcctc agcagggatc aaggg
//
LOCUS       ACARRDX       119 bp ss-rRNA            RNA       11-AUG-1990
DEFINITION  A.castellani 5S ribosomal RNA.
ACCESSION   M28984
KEYWORDS    5S ribosomal RNA.
SOURCE      A.castellani ribosomal RNA from Viluy encephalomyelitis patient
            KPN..
  ORGANISM  Acanthamoeba castellanii
            Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; 
            Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; 
            Acanthamoebidae.
REFERENCE   1  (bases 1 to 119)
  AUTHORS   Chumakov,K.M. and Karavanov,A.S.
  TITLE     Preliminary characterization of an organism isolated from a case of
            Viluy encephalomyelitis indicates a protozoal, rather than viral,
            aetiology
  JOURNAL   J. Gen. Microbiol. 132, 1127-1133 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA          1      119     5S ribosomal RNA
BASE COUNT       26 a     36 c     37 g     20 t
ORIGIN      
        1 ggatacggcc atactgcgca gaaagcaccg cttcccatcc gaacagcgaa gttaagctgc
       61 gcgaggcggt gttagtactg gggtgggcga ccacccggga atccaccgtg ccgtatcct
//
LOCUS       BMOCHORB      538 bp ss-mRNA            INV       11-AUG-1990
DEFINITION  B.mori early chorion protein mRNA, 3' end.
ACCESSION   M36291
KEYWORDS    early chorion protein; eggshell protein.
SOURCE      B.mori, cDNA to mRNA, clone m2G12.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 538)
  AUTHORS   Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C.
  TITLE     A complex set of early chorion DNA sequences from Bombyx mori
  JOURNAL   Dev. Biol. 112, 368-376 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      482     early chorion protein precursor (AA at 3)
    sigp    <     1       44     early chorion protein (AA at 1)
    matp         45      479     early chorion protein
    signal      518      523     poly-A signal
BASE COUNT      104 a    128 c    159 g    147 t
ORIGIN      
        1 tcctcatctg tgcttcggca ttattcgtcc agtctgctgt cagtcaatat ctgagtcgcg
       61 tcggctgtgg ttgtggaggg gttggcaatg gcttgggcta cggtggtatc ggctataacg
      121 gctatgtcgg taacgatatc ggcgctgctg gagctctagg agcttccaat ggagggtgtc
      181 tcaacgtcgt cagctcctct gccgctccta ctagtttagg tgtagcctct gagaactctt
      241 acgagggaac tgtcggtgtc tgcggtaatc tgcccctctt gggtactgca atcgtcaccg
      301 gcgagttctc tactggcggc cttggtggta ttaactacgg ctgcggtaat ggcgctgttg
      361 ggatcaccgc tgaagatcgc gccggtatcg gctacgccgg cggcctcggc tacggatcag
      421 gctacggact cggctacgga ggctacgccg gcaatggatg tggatgcggt ggcgcctatt
      481 gaatgctaga atatatataa aaaaacacgt tttttttaat aaattccttt aaaaaaat
//
LOCUS       BMODNA        586 bp ss-mRNA            INV       11-AUG-1990
DEFINITION  B.mori early chorion protein mRNA, 3' end.
ACCESSION   M36290
KEYWORDS    early chorion protein; eggshell protein.
SOURCE      B.mori, cDNA to mRNA, clone m6F6.
  ORGANISM  Bombyx mori
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; 
            Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; 
            Bombycoidea; Bombycidae.
REFERENCE   1  (bases 1 to 586)
  AUTHORS   Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C.
  TITLE     A complex set of early chorion DNA sequences from Bombyx mori
  JOURNAL   Dev. Biol. 112, 368-376 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      483     early chorion protein precursor (AA at 1)
    sigp    <     1       39     early chorion protein (AA at 1)
    matp         40      480     early chorion protein
    signal      575      580     poly-A signal
BASE COUNT      117 a    133 c    166 g    170 t
ORIGIN      
        1 ttgtctggat gtgctctcgt ggctcagttc acagttggcc agtacatcgg ccgcgtgaac
       61 aatggttgtg gatgcgggag tttcgacaac cgtggcctcg gttacaccgc tggctgtggt
      121 ctcaccgctg ctagttctct tgcagcctcc catggaggag ggttattcgt cgtcacctcc
      181 tctgccgcgc ctgctggtct cggcatagct tccgagaaca gatacgaagg cgctgtcgat
      241 gtgtgcggca acattccatt cttgggcacc gctgatgtcg caggcgagtt ccccactgcg
      301 ggcattggtg agatcaacta cggctgcggc gatggagcag tcgccattac cgctgaaggt
      361 ggtctcggct acgctggagg acttgactac actggtggac tcggctatgc tagtggactt
      421 ggctacggct taggctatgg agaatacgtt ggatgcagtt gtggttgtgg tgacatctac
      481 tagaatggat attgttatat taatcttgta caagtatgta cattcaggtt gtaattttta
      541 caagaattaa attttatttt tccgaataaa ctctggtata taattt
//
LOCUS       BTHINSECT     977 bp ds-DNA             BCT       11-AUG-1990
DEFINITION  B.thuringiensis 27.3 Kd cytolytic insecticidal protein gene,
            complete cds.
ACCESSION   M35968
KEYWORDS    27.3 Kd cytolytic insecticidal protein.
SOURCE      B.thuringiensis morrisoni (mosquitocidal isolate PG-14) DNA.
  ORGANISM  Bacillus thuringiensis
            Prokaryota; Bacteria; Firmicutes; 
            Endospore-forming rods and cocci; Bacillaceae.
REFERENCE   1  (bases 1 to 977)
  AUTHORS   Galjart,N.J., Sivasubramanian,N. and Federici,B.A.
  TITLE     Plasmid location, cloning, and sequence analysis of the gene
            encoding a 27.3-kilodalton cytolytic protein from Bacillus
            thuringiensis subsp. morrisoni (PG-14)
  JOURNAL   Curr. Microbiol. 16, 171-177 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         67      816     27.3 Kd cytolytic insecticidal protein
BASE COUNT      349 a    161 c    149 g    318 t
ORIGIN      
        1 aagtttaaaa tcataattta atgttgaaag gccactattc taattaactt aaggagttgt
       61 ttatttatgg aaaatttaaa tcattgtcca ttagaagata taaaggtaaa tccatggaaa
      121 acccctcaat caacagcaag ggttattaca ttacgtgttg aggatccaaa tgaaatcaat
      181 aatcttcttt ctattaacga aattgataat ccgaattata tattgcaagc aattatgtta
      241 gcaaatgcat ttcaaaatgc attagttccc acttctacag attttggtga tgccctacgc
      301 tttagtatgg caaaaggttt agaaatcgca aacacaatta caccgatggg tgctgtagtg
      361 agttatgttg atcaaaatgt aactcaaacg aataaccaag taagtgttat gattaataaa
      421 gtcttagaag tgttaaaaac tgtattagga gttgcattaa gtggatctgt aatagatcaa
      481 ttaactgcag cagttacaaa tacgtttaca aatttaaata ctcaaaaaaa tgaagcatgg
      541 attttctggg gcaaggaaac tgctaatcaa acaaattaca catacaatgt cctgtttgca
      601 atccaaaatg cccaaactgg tggcgttatg tattgtgtac cagttggttt tgaaattaaa
      661 gtatcagcag taaaggaaca agttttattt ttcacaattc aagattctgc gagctacaat
      721 gttaacatcc aatctttgaa atttgcacaa ccattagtta gctcaagtca gtatccaatt
      781 gcagatctta ctagcgctat taatggaacc ctctaatctt agtagctata tttattaaag
      841 atggtaatat cacaagtata aatacttgtg gtattaccta ccattcttaa attatatcca
      901 aaatcatgcg ttaatctaca ttcccctttc tctaaaattt gttcttcaca catccacatt
      961 tttcgataaa acaatct
//
LOCUS       ECOPYRI       133 bp ds-DNA             BCT       11-AUG-1990
DEFINITION  E.coli ATCase regulatory protein (pyrI) gene, 3' end.
ACCESSION   M28578
KEYWORDS    ATCase regulatory protein.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 133)
  AUTHORS   Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G.,
            Glansdorff,N. and Pierard,A.
  TITLE     Structure-function relationship in allosteric aspartate
            carbamoyltransferase from Escherichia coli: I. Primary structure of
            a pyrI gene encoding a modified regulatory subunit
  JOURNAL   J. Mol. Biol. 186, 707-713 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       81     ATCase regulatory protein (pyrI) (EC 2.1.3.2)
    signal      100      129     transcription termination signal
BASE COUNT       41 a     27 c     29 g     36 t
ORIGIN      
        1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttcccat
       61 aatgtggtgc tggccaatta attgcggttg gtaataaaag ttggctccct ataatgagcc
      121 agacttttta ccg
//
LOCUS       HUMBHD       1269 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human placental 17-beta-hydroxysteroid dehydrogenase mRNA, complete
            cds.
ACCESSION   M36263
KEYWORDS    placental 17-beta-hydroxysteroid dehydrogenase.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 1269)
  AUTHORS   Peltoketo,H., Isomaa,V., Maeentausta,O. and Vihko,R.
  TITLE     Complete amino acid sequence of human placental 17-beta-
            hydroxysteroid dehydrogenase deduced from cDNA
  JOURNAL   FEBS Lett. 239, 73-77 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         10      996     placental 17-beta-hydroxysteroid dehydrogenase
                                 precursor (EC 1.1.1.62)
    matp         13      993     placental 17-beta-hydroxysteroid dehydrogenase
    mRNA    <     1     1269     placental 17-beta-hydroxysteroid dehydrogenase
                                 mRNA
BASE COUNT      220 a    378 c    437 g    234 t
ORIGIN      
        1 agtctcacca tggcccgcac cgtggtgctc atcaccggct gttcctcggg catcggcctg
       61 cacttggccg tacgtctggc ttcagatcca tcccagagct tcaaagtgta tgccacgttg
      121 agggacctga aaacacaggg ccggctgtgg gaggcggccc gggccctggc atgccctccg
      181 ggatccctgg agacgttgca gctggacgta agggactcaa aatccgtggc cgctgcccgg
      241 gaacgcgtga ctgagggccg cgtggacgtg ctggtgtgta acgcaggcct gggcctgctg
      301 gggccgctgg aggcgctggg ggaggacgcc gtggcctctg tgctggacgt gaatgtagta
      361 gggactgtgc ggatgctgca ggccttcctg ccagacatga agaggcgcgg ttcgggacgc
      421 gtgttggtga ccgggagcgt gggaggattg atggggctgc ctttcaatga cgtttattgc
      481 gccagcaagt tcgcgctcga aggcttatgc gagagtctgg cggttctgct gctgcccttt
      541 ggggtccact tgagcctgat cgagtgcggc ccagtgcaca ccgccttcat ggagaaggtg
      601 ttgggcagcc cagaggaggt gctggaccgc acggacatcc acaccttcca ccgcttctac
      661 caatacctcg cccacagcaa gcaagtcttt cgcgaggcgg cgcagaaccc tgaggaggtg
      721 gcggaggtct tcctcaccgc tttgcgcgcc ccgaagccga ccctgcgcta cttcaccacc
      781 gagcgcttcc tgcccctgct gcggatgcgc ctggacgacc ccagcggctc caactacgtc
      841 accgccatgc accgggaagt gttcggcgac gttccggcaa aggccgaggc tggggccgag
      901 gctgggggcg gggccgggcc tggggcagag gacgaggccg ggcgcagtgc ggtgggggac
      961 cctgagctcg gcgatcctcc ggccgccccg cagtaaaggc ttcctcagcc gctgtctccc
     1021 gcgcccttct ttgtcccctg ggtctgtgtg gtccctgggg atggggcggc ggtagcagct
     1081 gtgggtggct aattaagata gatcgcgtta gccagtttta ccagcgcagc taggcgcgat
     1141 ggcgtcgcct gtaatgccag cgctttggga ggcggaggca ggaggatcgc tcaagccccg
     1201 gagttggaga ccagccagag caacacagtg agacccccat ctctacaaaa ataaagaaaa
     1261 tttaaaaat
//
LOCUS       HUMC1Q         84 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human complement component 1, beta-chain (C1QB) mRNA, partial cds.
ACCESSION   M36278
KEYWORDS    complement component 1.
SOURCE      Human liver, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 84)
  AUTHORS   Reid,K.B.M., Bentley,D.R. and Wood,K.J.
  TITLE     Cloning and characterization of the complementary DNA for the B
            chain of normal human serum C1q
  JOURNAL   Philos. Trans. R. Soc. Lond., B, Biol. Sci. 306, 345-354 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >    84     complement 1-q subcomponent beta-chain (AA at
                                 1) /hgml_locus_uid="LR0054Y" /nomgen="C1QB"
                                 /map="1p"
BASE COUNT       18 a     25 c     22 g     19 t
ORIGIN      
        1 gccaccgaca agaactcact actgggcatg gagggtgcca acagcatctt ttccgggttc
       61 ctgctctttc cagatatgga ggcc
//
LOCUS       HUMGAPDA      197 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human glyceraldehyde-3-phosphate dehydrogenase mRNA, 3' flank.
ACCESSION   M36164
KEYWORDS    glyceraldehyde-3-phosphate dehydrogenase.
SOURCE      Human pancreatic adenocarcinoma cell line HPAF, cDNA to mRNA, clone
            pII5B.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 197)
  AUTHORS   Schek,N., Hall,B.L. and Finn,O.J.
  TITLE     Increased glyceraldehyde-3-phosphate dehydrogenase gene expression
            in human pancreatic adenocarcinoma
  JOURNAL   Cancer Res. 48, 6354-6359 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    signal      175      180     poly-A signal /hgml_locus_uid="LM0055R"
                                 /nomgen="GAPD" /map="12p13"
BASE COUNT       50 a     69 c     46 g     31 t      1 others
ORIGIN      
        1 ccctggacca ccagccccag caagagcaca agaggaagag agagaccctc actgctgggg
       61 agtccctgcc acactcagtc ccccaccaca ctgaatctcc cctcctcaca gttgccatgt
      121 agaccccttg aagaggggag gggcctaggg agccgcacct tgtcatgtac catcaataaa
      181 gtaccctgtg ctnaacc
//
LOCUS       HUMGYPAA      445 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human glycophorin A mRNA, partial cds.
ACCESSION   M36281
KEYWORDS    glycophorin A.
SOURCE      Human, cDNA to mRNA, clone pHGpA-3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 445)
  AUTHORS   Siebert,P.-D. and Fukuda,M.
  TITLE     Molecular biological study of the structure and expression of human
            glycophorin A
  JOURNAL   Rev. Fr. Transfus. Immunohematol. 24, 251-266 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   445     glycophorin A precursor (AA at 2)
                                 /hgml_locus_uid="LW0071X" /nomgen="GYPA"
                                 /map="4q28-q31"
    sigp    <     1       67     glycophorin A signal peptide (AA at 1)
    matp         68  >   445     glycophorin A
BASE COUNT      135 a    103 c     84 g    123 t
ORIGIN      
        1 tgatctcagg atgtatggaa aaataatctt tgtattacta ttgtcagcaa ttgtgagcat
       61 atcagcatca agtaccactg gtgtggcaat gcacacttca acctcttctt cagtcacaaa
      121 gagttacatc tcatcacaga caaatgatac gcacaaacgg gacacatatg cagccactcc
      181 tagagctcat gaagtttcag aaatttctgt tagaactgtt taccctccag aagaggaaac
      241 cggagaaagg gtacaacttg cccatcattt ctctgaacca gagataacac tcattatttt
      301 tggggtgatg gctggtgtta ttggaacgat cctcttaatt tcttacggta ttcgccgact
      361 gataaagaaa agcccatctg atgtaaaacc tctcccctca cctgacacag acgtgccttt
      421 aagttctgtt gaaattgaga acccc
//
LOCUS       HUMGYPCAA     915 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human glycophorin C mRNA, complete cds.
ACCESSION   M36284
KEYWORDS    glycophorin C; syaloglycoprotein.
SOURCE      Human reticulocytes and fetal liver, cDNA to mRNA, clones pGCW5 and
            pGCF22.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 915)
  AUTHORS   Cartron,J.-P., Colin,Y., Le Van Kim,C., Rahuel,C., Blanchard,D.,
            Bloy,C. and London,J.
  TITLE     Structure of human erythrocyte glycophorin C deduced from cDNA
            analysis
  JOURNAL   Rev. Fr. Transfus. Immunohematol. 24, 267-285 (1986)
  STANDARD  simple staff_review
COMMENT     The first 6 nucleotides at the 5' end were not present in the cDNA
            clones but where deduced from the known protein sequence.
FEATURES       from  to/span     description
    pept          1      387     glycophorin C /hgml_locus_uid="LM0084R"
                                 /nomgen="GYPC" /map="2q14-q21"
    mRNA    <     1      915     glycophorin C mRNA
BASE COUNT      228 a    249 c    262 g    176 t
ORIGIN      
        1 atgtggtcga cgagaagccc caacagcacg gcgtggcctc tcagcctcga gcctgatccg
       61 gggatggcct ctgcctccac cacaatgcat actaccacca ttgcagagcc tgatccaggg
      121 atgtctggat ggccggatgg cagaatggag acctccaccc ccaccataat ggacattgtc
      181 gtcattgcag gtgtgattgc tgctgtggcc atcgtcctag tctccctcct cttcgtcatg
      241 ctgcgctaca tgtaccggca caagggcacg taccacacca atgaggccaa gggcacggag
      301 tttgctgaga gtgcagatgc agccctgcag ggagaccctg ccctccaaga tgctggtgat
      361 agcagcagaa aggagtactt tatttgaggg acaacagact tcacttccct gaatgcctcc
      421 cccatctcca tcaggaaaaa tacaccccat cgcccagtat ccccgtcgat accaccagac
      481 agagagagag agtacacttg atttcttccc gagatagtta cctagaaaca ctaggtgcct
      541 gcccaaggag gaacggagga ggactcgcgc tacaagaggc cactcccagg gacccaggga
      601 ggcgatggcc accccagagg ccaccttttg ctccacggag gtgggagaga atctgggcac
      661 atggggcccc ctagggcagt gcaggacaac atcagctcac tggcaggaaa gtccttgttg
      721 agggtgaggg ggtgctgggg tacccggggg ctggggaagc aaggaaataa gtcatctgta
      781 tgctgactgg ggataatggc atcaatgtca gtccttgact ttggggggaa cagcaggtgc
      841 cagagctaaa aggtaccttt gtctgccatt gatccagcta agaacgattg gaaataaatt
      901 ggaaatgtaa ccgag
//
LOCUS       HUMIGDD       102 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36090
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            AW.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 102)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   102     Ig H-chain VDJ-region (AA at 2)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       26 a     23 c     36 g     17 t
ORIGIN      
        1 gagagctgag gacacggctg tgtattactg tgcgaaagat cgagcagcaa cggacgggct
       61 actactacgg tatggacgtc tggggccaag ggaccacggt ca
//
LOCUS       HUMIGDE       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36091
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            AW.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   100     Ig H-chain V-D-J region (AA at 1)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       22 a     24 c     30 g     24 t
ORIGIN      
        1 cctgcagctg aactctgtaa ctcccgagga cacggctgtg tattactgtg caagaggact
       61 gggtggcttt tgatatctgg ggccaaggga caatcgtcac
//
LOCUS       HUMIGDF       101 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36092
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            AW.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 101)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   101     Ig heavy-chain V-D-J region (AA at 3)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       17 a     27 c     34 g     23 t
ORIGIN      
        1 ccgccgcgga cacggccgtg tattactgtg ccagggtagg atattgtact aatggtgttt
       61 ccgggatggt tcgacccctg gggccaggga accctggtca c
//
LOCUS       HUMIGDG       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36093
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            TD.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   100     Ig heavy-chain V-D-J region (AA at 1)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       22 a     34 c     25 g     19 t
ORIGIN      
        1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt
       61 gcgaccattg actactgggg ccagggaacc ctggtcacgg
//
LOCUS       HUMIGDH       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36094
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            TD.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   100     Ig heavy-chain V-D-J region (AA at 1)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       23 a     30 c     24 g     23 t
ORIGIN      
        1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt
       61 gcgagggggg atggttacta ctactactac atggacgtct
//
LOCUS       HUMIGDI       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36095
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            TD.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept.ps <     1  >   100     Ig heavy-chain V-D-J region (AA at 3)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       25 a     22 c     30 g     23 t
ORIGIN      
        1 aaatgaacag cctgagagct gaggacacgg ctgtgtattg aacccacggt ggtagctgct
       61 actctgcttt tgatatctgg ggccaaggga caatggtcac
//
LOCUS       HUMIGDJ       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36096
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            TD.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   100     Ig heavy-chain V-D-J region (AA at 1)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       24 a     23 c     30 g     23 t
ORIGIN      
        1 tctgagagcc gaggacacgg ctgtgtatta ctgtgcaaca gggccgtcac tggttattat
       61 aaactactac tacggtatgg acgtctgggg ccaagggcca
//
LOCUS       HUMIGDK       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36097
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            TD.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   100     Ig heavy-chain V-D-J region (AA at 2)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       26 a     24 c     33 g     17 t
ORIGIN      
        1 ggcgtatctg caaatgaaca gcctgaaaac cgaggacacg gccgtgtatt actgtactag
       61 acgggacggt atggacgtct ggggccaagg gaccacggtc
//
LOCUS       HUMIGDL       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene V-D-J region, partial cds.
ACCESSION   M36098
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene; variable region.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            TD.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   100     Ig heavy-chain V-D-J region (AA at 1)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       21 a     18 c     33 g     28 t
ORIGIN      
        1 acggccgtgt attactgtgc gagagatagg ggatttctac gagtagtggt ggtagctgct
       61 actctgcttt tgatatctgg ggccaaggga caatggtcac
//
LOCUS       HUMIGDM       100 bp ds-DNA             PRI       11-AUG-1990
DEFINITION  Human Ig rearranged heavy-chain gene D-J region, partial cds.
ACCESSION   M36099
KEYWORDS    diversity region; immunoglobulin heavy-chain; joining exon;
            processed gene.
SOURCE      Human acute lymphoblastic leukemia B-cell line DNA, from patient
            TD.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 100)
  AUTHORS   Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J.
  TITLE     Continuing rearrangement but absence of somatic hypermutation in
            immunoglobulin genes of human B cell precursor leukemia
  JOURNAL   J. Exp. Med. 168, 229-245 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   100     Ig heavy-chain D-J region (AA at 1)
                                 /hgml_locus_uid="LR0220V" /nomgen="IGH@"
                                 /map="14q32.33"
BASE COUNT       24 a     15 c     29 g     32 t
ORIGIN      
        1 ggtaggttta gaatgaggtc tgtgtcactg tggtattacg atattttgac tggttattat
       61 aaactactac tacggtatgg acgtctgggg ccaagggcca
//
LOCUS       LAMPYRIB      135 bp ds-DNA             PHG       11-AUG-1990
DEFINITION  Bacteriophage lambda mutant truncated regulatory protein (pyrI)
            gene from E.coli, 3' end.
ACCESSION   M28579
KEYWORDS    ATCase regulatory protein.
SOURCE      Bacteriophage lambda (strain lambda OTC3) DNA.
  ORGANISM  Bacteriophage lambda
            Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
REFERENCE   1  (bases 1 to 135)
  AUTHORS   Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G.,
            Glansdorff,N. and Pierard,A.
  TITLE     Struture-function relationship in allosteric aspartate
            carbamoyltransferase from Escherichia coli: I. Primary structure of
            a pyrI gene encoding a modified regulatory subunit
  JOURNAL   J. Mol. Biol. 186, 707-713 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       75     ATCase regulatory protein
BASE COUNT       54 a     24 c     22 g     35 t
ORIGIN      
        1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttatact
       61 aagttggcat tataaaaaag cattgcttat caatttgttg caagcaacag gtcactatca
      121 gtcaaaataa aatca
//
LOCUS       MUSBGALA      240 bp ss-mRNA            ROD       11-AUG-1990
DEFINITION  Mouse beta-1,4-galactosyltransferase mRNA, 5' end.
ACCESSION   M36289
KEYWORDS    beta-1,4-galactosyltransferase.
SOURCE      Mouse, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 240)
  AUTHORS   Shaper,J.H., Hollis,G.F. and Shaper,N.L.
  TITLE     Evidence for two forms of murine beta-1,4-galactosyltransferase
            based on cloning studies
  JOURNAL   Biochimie 70, 1683-1688 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         52  >   240     beta-1,4-galactosyltransferase (long form)
                                 precursor
    sigp         52      111     beta-1,4-galactosyltransferase (long form)
                                 signal peptide (put.)
    matp        112  >   240     beta-1,4-galactosyltransferase (long form)
    pept         91  >   240     beta-1,4-galactosyltransferase (short form)
BASE COUNT       27 a     91 c     77 g     45 t
ORIGIN      Chromosome 4.
        1 gggccgtcct ctcagccgta gcccaccccc tcttaaagcc gcggcgggaa gatgaggttt
       61 cgtgagcagt tcctgggcgg cagcgccgcg atgccgggcg cgaccctgca gcgggcctgc
      121 cgcctgctcg tggccgtctg cgcgctgcac ctcggcgtca ccctcgtcta ttacctctct
      181 ggccgcgatc tgagccgcct gccccagttg gtcggagtct cctctacact gcagggcggc
//
LOCUS       MUSIGHADN     348 bp ds-DNA             ROD       11-AUG-1990
DEFINITION  Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds.
ACCESSION   M36103
KEYWORDS    immunoglobulin heavy-chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse hybridoma GA733 DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Caton,A.J.
  TITLE     Comparative sequences analysis of CO17-1A antigen-specific mono-
            clonal antibodies
  JOURNAL   Hybridoma 5, 11-16 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   348     Ig heavy-chain V-J region (AA at 1)
    recomb      294      303     V-region end/D-region start
    recomb      303      304     D-region end/J3-region start
BASE COUNT       91 a     83 c     91 g     83 t
ORIGIN      
        1 caggttcagc tacagcagtc tggagctgag gtgatgaggc ctggggcctc agtgaagata
       61 tcctgcaagg ctactggcta cacattcact aggtactaca tacaatgggg taaaaacaga
      121 gggcctggac atggccttga gtggattgga gagattttac ctggaactct tactaattac
      181 aatgagaaat tcaagggcaa ggccgcattc actgcagata gatcctccaa cacagcctac
      241 atgcaactca gcagccttac atctgaggac tctgccgtct attactgtgc aagagatggt
      301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS       MUSIGHADO     348 bp ds-DNA             ROD       11-AUG-1990
DEFINITION  Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds.
ACCESSION   M36104
KEYWORDS    immunoglobulin heavy-chain; joining exon; processed gene;
            variable region.
SOURCE      Mouse hybridoma CO17A-1A DNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 348)
  AUTHORS   Caton,A.J.
  TITLE     Comparative sequences analysis of CO17-1A antigen-specific mono-
            clonal antibodies
  JOURNAL   Hybridoma 5, 11-16 (1986)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   348     Ig heavy-chain V-J region (AA at 1)
    recomb      294      303     V-region end/D-region start
    recomb      303      304     D-region end/J3-region start
BASE COUNT       81 a     79 c    101 g     87 t
ORIGIN      
        1 caggtccagt tgcagcagtc tttagctgag ctggtaaggc ctgggacttc agtgaaggtg
       61 tcctgcaagg cttctggata cgccttcact aattacttga tagagtgggt aaagcagagg
      121 cctggacagg gccttgagtg gattggggtg attaatcctg gaagtggtgg tactaactac
      181 aatgagaagt tcaagggcaa ggcaacactg actgcagaca aatcctccag cactgcctac
      241 atgcagctca gcagcctgac atctgatgac tctgcggtct atttctgtgc aagagatggt
      301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca
//
LOCUS       RICGICR       524 bp ds-DNA             PLN       11-AUG-1990
DEFINITION  O.sativa Gly-tRNA gene.
ACCESSION   M35993
KEYWORDS    transfer RNA-Gly.
SOURCE      O.sativa DNA.
  ORGANISM  Oryza sativa
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
            Commelinidae; Cyperales; Poaceae.
REFERENCE   1  (bases 1 to 524)
  AUTHORS   Reddy,P.S. and Padayatty,J.D.
  TITLE     Effects of 5' flanking sequences and changes in the 5' internal
            control region on the transcription of rice Gly-tRNA-GCC gene
  JOURNAL   Plant Mol. Biol. 11, 575-583 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA        381      451     Gly-tRNA
    anticdn     413      415     Gly-tRNA anticodon cgg
BASE COUNT      166 a     72 c     98 g    187 t      1 others
ORIGIN      
        1 gcgcatgttt gtttcagctt aagattattg taatntagat tatttaaatc aagattactt
       61 taagtttgga ttataataag ctgatataaa ataagctaca agttgtttgt ttttctagat
      121 tattaaaggc atctaagggt agtgggtttt tagccacttc aattaatctg gaaaaagctc
      181 atatagaaaa gattattgga ttatagtaat ctggcttata gttataataa tctagcatca
      241 ataatctact tgtttgtttc agcttacttc taataattta gtttataata attctgaaac
      301 tgaattaaac agggtcttag acatatgagt tggatttgtg tagagctgaa attagtttac
      361 tcgtagaatc aagaacgaaa gcaccagtgg tctagtggta gaatagtacc ctgccacggt
      421 acagacccgg gttcgattcc cggctggtgc attattaatt tttggctctt ccattttaga
      481 taaagacaat tggcaacttg atgcatttat gaacaaatct gcgc
//
LOCUS       M27244       1413 bp ds-DNA             UNA       11-AUG-1990
DEFINITION  Figure 2. The sequence and predicted coding capacity of the
            HOX-2.5 h14 cDNA.
ACCESSION   M34857 M27244
REFERENCE   1  (bases 1 to 1413)
  AUTHORS   Bogarad,L.D., Utset,M.F., Awgulewitsch,A., Miki,T., Hart,C.P. and
            Ruddle,F.H.
  TITLE     the developmental expression pattern of a new murine homeo box
            gene: hox-2.5
  JOURNAL   Dev. Biol. 133, 537-549 (1989)
  STANDARD  unannotated staff_entry
COMMENT     
FEATURES       from  to/span     description
BASE COUNT      366 a    386 c    311 g    350 t
ORIGIN      
        1 gcgccggggg agctgctcaa acagggcacg cccgagtaca gtttggaaac ttcggcgggc
       61 agggaggctg tcctgtctaa tcaaagagct ggctacgggg acaataaaat ttgcgaagga
      121 agcgaggaca aagagaggcc gggtcaaacc aacccctctg ccaactggct gcacgcccgc
      181 tcttcccgga aaaagcgctg tccctacacc aaataccaga cgctggagct agagaaggag
      241 tttctgttca atatgtacct caccagggac cgcaggcatg aagtggccag actcctcaat
      301 ctgagtgaga gacaagtcaa aatctggttt cagaaccggc ggatgaaaat gaagaagatg
      361 aataacggag caggcaaaga gtaaagatga ccaccccccc cttccccagc tcactcttat
      421 tatttatgtg atggtcaaaa agccactgct gtctgggtgt acccaacgtg agtggggaag
      481 agtatctcct ctttaaaatc cctcatctgc acctggagcc tgtctcctct cctttgcgcc
      541 tctctctctc tctctctctc tctctctctc tctctctctc tctctctcca ctctcctggt
      601 gttaagagga aattttatta agttagaagc tagctatact tggttcctag aaaagtgatg
      661 agcctcgctc cgaggaaaga gaccctagtc gagctcctag catccctgct aatttttctg
      721 gaaaacccca gcccctctct tccagtctgc ctgttccctc cctgtaacct agccagacgc
      781 cacccaggta cagtttaact gcacacagcc cagtggatgc ccacagacat agagtttgga
      841 gtctatgcta atcatcgtgg agaaaggaca tcaagacgtt gttcatcaaa ccaagcaggg
      901 cagctctcag atcaggttca ttttggttgt gccctgatcc aaacacacac taggaagatg
      961 tctagagatg ccctcaatga ggcactccag tcccaaagtc tcaacatccc aatatgtatc
     1021 ctatggaatt gccagctaga cccatatctg gggaggcctg tgggagatgt ggccattcct
     1081 gtccccaagc tgttgtgagc tagaaagggc aagaaaaagc agggagtggt tttatgaagg
     1141 taccattagt ggagtcccct cgtattttcc ctgcaggaag tgtttaattc tgcatgtttc
     1201 ctttccttcc aactaaagga ggtcaaagtg ggttgtagag ccctgacagt gctgtttccc
     1261 cattcataca ttcctatcct catttgacag aatgtaactt ttcttgctga ccacagtctc
     1321 agatttctct ctatcctcca actttgaaat caactctcct ttccaccaaa cccatcttgc
     1381 accagcaaag acaagtctcc cattgtaaaa aaa
//
LOCUS       CHKBGLOA      168 bp ds-DNA             VRT       11-AUG-1990
DEFINITION  Chicken beta-3-globin gene, exon N.
ACCESSION   M33204
KEYWORDS    beta-3-globin.
SOURCE      Chicken DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 168)
  AUTHORS   Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
  TITLE     Chromosomal arrangement of the chicken beta-type globin genes
  JOURNAL   Cell 24, 669-677 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     7  /   168     beta-3-globin, exon N (AA at 7)
    IVS     <     1        6     beta-3-globin intron N (no splice consensus)
BASE COUNT       37 a     58 c     38 g     35 t
ORIGIN      
        1 tacaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctc
       61 tccagcccca ctgccatcct tggcaacccc atggtccgcg cccacggcaa gaaagtgctc
      121 acctcctttg gggatgctgt gaagaaccta gacaacatca agaacacc
//
LOCUS       CHKBGLOB       90 bp ds-DNA             VRT       11-AUG-1990
DEFINITION  Chicken beta-1-globin gene, exon N.
ACCESSION   M33205
KEYWORDS    beta-1-globin.
SOURCE      Chicken DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 90)
  AUTHORS   Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
  TITLE     Chromosomal arrangement of the chicken beta-type globin genes
  JOURNAL   Cell 24, 669-677 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     1  /    90     beta-1-globin, exon N (AA at 1)
BASE COUNT       22 a     29 c     20 g     19 t
ORIGIN      
        1 aggttctttg ataacttcgg gaacctctcc agccccaccg ccatcattgg taaccccaag
       61 gtccgtgctc acggcaaaaa agtgctgagc
//
LOCUS       CHKBGLOC      234 bp ds-DNA             VRT       11-AUG-1990
DEFINITION  Chicken beta-2-globin gene, exon N.
ACCESSION   M33206
KEYWORDS    beta-2-globin.
SOURCE      Chicken DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 234)
  AUTHORS   Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
  TITLE     Chromosomal arrangement of the chicken beta-type globin genes
  JOURNAL   Cell 24, 669-677 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     7  /   228     beta-2-globin, exon N (AA at 7)
    IVS     <     1        6     beta-2-globin intron N (no splice consensus)
    IVS         229  >   234     beta-2-globin intron N+1
BASE COUNT       53 a     68 c     65 g     48 t
ORIGIN      
        1 tgcaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctg
       61 tccagtgcca ccgccatcat tggcaacccc atggttcgcg cccacggcaa gaaagtgctg
      121 agctcctttg gggaggctgt aaagaacctg gacaacatca agaagagctt tgcccagctg
      181 agcaaactgc actgtgacaa gctgcacgtg gaccccgaga acttcagggt gagt
//
LOCUS       CHKBGLOD      121 bp ds-DNA             VRT       11-AUG-1990
DEFINITION  Chicken beta-4-globin gene, exon N.
ACCESSION   M33207
KEYWORDS    beta-4-globin.
SOURCE      Chicken DNA.
  ORGANISM  Gallus gallus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; 
            Neornithes; Neognathae; Galliformes; Phasianidae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D.
  TITLE     Chromosomal arrangement of the chicken beta-type globin genes
  JOURNAL   Cell 24, 669-677 (1981)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    /     1  /   121     beta-4-globin, exon N (AA at 1)
BASE COUNT       19 a     43 c     29 g     30 t
ORIGIN      
        1 ctccttgggg acatcctgat catcgtcctg gcctcccact ttcccaggga tttcactcct
       61 gcctgccaat ttgcctggca gaagctggtc aacgttgtgg cccatgccca ggcccgcaag
      121 t
//
LOCUS       ECOTRYA        87 bp ss-RNA             RNA       11-AUG-1990
DEFINITION  E.coli Tyr-tRNA.
ACCESSION   M35994
KEYWORDS    transfer RNA-Tyr.
SOURCE      E.coli.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 87)
  AUTHORS   Doctor,B.P., Loebel,J.E., Sodd,M.A. and Winter,D.B.
  TITLE     Nucleotide sequence of Escherichia coli tyrosine transfer
            ribonucleic acid
  JOURNAL   Science 163, 693-695 (1969)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA          1       87     Tyr-tRNA
    anticdn      36       38     Tyr-tRNA anticodon rta
    modified      8        8     p
    modified     18       18     gm
    modified     20       20     q
    modified     41       41     p
    modified     66       66     p
BASE COUNT       19 a     27 c     24 g     16 t      1 others
ORIGIN      
        1 gggtgggttc cccgagcggg caaagggagc agactrtata tctgccggtc acagacttcg
       61 aaggttcgaa tccttcccca cccacca
//
LOCUS       HUMMHDCBA     142 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human MHC class II HLA-DC-beta chain mRNA, 5' end.
ACCESSION   M35982
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human, cDNA to mRNA, clone pII-beta-5.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 142)
  AUTHORS   Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
  TITLE     Signal sequences distinguish class II histocompatibility antigen
            beta chains of different loci
  JOURNAL   Scand. J. Immunol. 19, 91-97 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   142     MHC HLA-DC-beta chain precursor (AA at 2)
    sigp    <     1       79     MHC HLA-DC-beta signal peptide
    matp         80  >   142     MHC HLA-DC-beta
BASE COUNT       27 a     40 c     44 g     31 t
ORIGIN      
        1 aggggggatc cccggagacc ttcgggtagc aactgtcacc ttgatgctgg cgatcctgag
       61 ctcctcactg gctgagggca gagactctcc cgaggatttc gtgtaccagt ttaagggcct
      121 gtgctacttc accaacggga cg
//
LOCUS       HUMMHDRAA     157 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human MHC class II HLA-DR-alpha chain mRNA, 5' end.
ACCESSION   M35979
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human, cDNA to mRNA, clone pII-alpha-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 157)
  AUTHORS   Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
  TITLE     Signal sequences distinguish class II histocompatibility antigen
            beta chains of different loci
  JOURNAL   Scand. J. Immunol. 19, 91-97 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          8  >   157     HLA-DR-alpha chain precursor
    sigp          8       82     HLA-DR-alpha chain signal peptide
    matp         83  >   157     HLA-DR-alpha chain
BASE COUNT       43 a     31 c     40 g     43 t
ORIGIN      
        1 gaagaaaatg gccattagtg gagtccctgt gctaggattt ttcatcatag ctgtgctgat
       61 gagcgctcag gaatcatggg ctatcaaaga agaacatgtg atcatccagg ccgagttcta
      121 tctgaatcct gaccaatcag gcgagtttat gtttgac
//
LOCUS       HUMMHDRBB     157 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human MHC class II HLA-DR-beta chain mRNA, 5' end.
ACCESSION   M35980
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human, cDNA to mRNA, clone pII-beta-3.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 157)
  AUTHORS   Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
  TITLE     Signal sequences distinguish class II histocompatibility antigen
            beta chains of different loci
  JOURNAL   Scand. J. Immunol. 19, 91-97 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          8  >   157     MHC HLA-DR-beta chain precursor
    sigp          8       94     MHC HLA-DR-beta chain signal peptide
    matp         95  >   157     MHC HLA-DR-beta chain
BASE COUNT       26 a     42 c     46 g     43 t
ORIGIN      
        1 ctccagcatg gtgtgtctga ggctccctgg aggctcctgc atggcagttc tgacagtgac
       61 actgatggtg ctgagctccc cactggcttt ggctggggac accagaccac gtttcttgga
      121 gtactctacg tctgagtgtc atttcttcaa tgggacg
//
LOCUS       HUMMHDRBC     154 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human MHC class II HLA-DR-beta chain mRNA, 5' end.
ACCESSION   M35981
KEYWORDS    cell surface glycoprotein; class II gene;
            integral membrane protein; major histocompatibility complex.
SOURCE      Human, cDNA to mRNA, clone pII-beta-4.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 154)
  AUTHORS   Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A.
  TITLE     Signal sequences distinguish class II histocompatibility antigen
            beta chains of different loci
  JOURNAL   Scand. J. Immunol. 19, 91-97 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept          5  >   154     MHC HLA-DR-beta chain precursor
    sigp          5       91     MHC HLA-DR-beta chain signal peptide
    matp         92  >   154     MHC HLA-DR-beta chain
BASE COUNT       27 a     37 c     49 g     41 t
ORIGIN      
        1 cagcatggtg tgtctgaggc tccctggagg ctcctgcatg gcagttctga cagtgacact
       61 gatggtgctg agctccccac tggctttggc tggggacacc agaccacgtt tcttggagga
      121 ggttaagttt gagtgtcatt tcttcaacgg gacg
//
LOCUS       MUSIAPA       494 bp ss-mRNA            ROD       11-AUG-1990
DEFINITION  Mouse intracisternal A particle mRNA, 5 'end.
ACCESSION   M36322
KEYWORDS    intracisternal A particle.
SOURCE      Mouse endogenous A type retrovirus embryonal carcinoma cell line
            PCC3, cDNA to mRNA, clone EC26.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 494)
  AUTHORS   Morgan,R.A., Christy,R.J. and Huang,R.C.C.
  TITLE     Murine A type retroviruses promote high levels of gene expression
            in embryonal carcinoma cells
  JOURNAL   Development 102, 23-30 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        477  >   494     intracisternal A particle mRNA
BASE COUNT      111 a    124 c    125 g    134 t
ORIGIN      
        1 ctgcagctag aaaggctctg cggcatctct gagcctattc tagggagaca tgtcatcttt
       61 caagaaggtt gagtgtccaa gtgtccttct ctgcaggcaa aacgacaagg gagcaggtca
      121 gggttcctct gggtaaaagc ctgtgagcct aagagcctaa tcctgtacat ggctccttaa
      181 cctacacatt ggggatttga cctctatctc cactctcatt aatatgggtg gcctatttgc
      241 ctcttattaa aaggaaaggg ggagatgttg ggagccgccc ccacattcgc cgttacaaga
      301 tggcgctgac agctgtgttc taagtggtaa acaaataatc tgcgcatgtg ccgagggtgg
      361 ttctccactc catgtctctg ccttccccgt gacgtcaact cggccatggc tgcagccaat
      421 cagggagtga cacgtcctag gcgaaggata attctcctta atagggacgg gttttcgttt
      481 tttctctctc tctt
//
LOCUS       MUSIAPB       492 bp ss-mRNA            ROD       11-AUG-1990
DEFINITION  Mouse intracisternal A particle mRNA, 5 'end.
ACCESSION   M36323
KEYWORDS    intracisternal A particle.
SOURCE      Mouse endogenous A type retrovirus embryonal carcinoma cell line
            PCC3, cDNA to mRNA, clone EC32.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 492)
  AUTHORS   Morgan,R.A., Christy,R.J. and Huang,R.C.C.
  TITLE     Murine A type retroviruses promote high levels of gene expression
            in embryonal carcinoma cells
  JOURNAL   Development 102, 23-30 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    mRNA        475  >   492     intracisternal A particle mRNA
BASE COUNT      111 a    118 c    129 g    134 t
ORIGIN      
        1 ctgcagtaga aaggctctgc ggcatattga gcctattcta gggagacatg tcatctttca
       61 agaaggttga gtgtccaagt gtccttctct gcaggcaaaa cgacacggga gcaggtcagg
      121 gttgcctctg ggtaaaagcc tgtgagccta agagctaatc ctgtaatggc tccttaacct
      181 acacattggg gatttgacct cttctccact ctcattaata tgggtggcct atttgcctct
      241 tattaaaagg aaagggggag atgttgggag ccgcccccac attcgccgtt acaagatggc
      301 gtgacagctg tgttctaagt ggtaaacaaa taatctgcgc atgtgccgag ggtggttctc
      361 cactccatgt gctctgcctt cccggtgacg tcaactcggc catggctgca gccaatcagg
      421 gagtgacacg tcctaggcga aggataattc tccttaatag ggacgggttt tatcgttttt
      481 tctcgctctc tt
//
LOCUS       PT4SB         122 bp ds-DNA             PHG       11-AUG-1990
DEFINITION  Bacteriophage T4 mutant Ser-tRNA suppressor gene.
ACCESSION   M35988
KEYWORDS    serine transfer-RNA.
SOURCE      Bacteriophage T4 (strain R24) DNA.
  ORGANISM  Bacteriophage T4
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 122)
  AUTHORS   McClain,W.H.
  TITLE     Specific duplications fostered by a DNA structure containing
            adjacent inverted repeat sequences
  JOURNAL   J. Mol. Biol. 204, 27-40 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA    <     1  >   122     Ser-tRNA
BASE COUNT       25 a     35 c     36 g     26 t
ORIGIN      
        1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ccggagcgac
       61 tgccggtctt taaaaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc
      121 cg
//
LOCUS       PT4SC         121 bp ds-DNA             PHG       11-AUG-1990
DEFINITION  Bacteriophage T4 mutant Ser-tRNA suppressor gene.
ACCESSION   M35989
KEYWORDS    serine transfer-RNA.
SOURCE      Bacteriophage T4 (strain R66) DNA.
  ORGANISM  Bacteriophage T4
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   McClain,W.H.
  TITLE     Specific duplications fostered by a DNA structure containing
            adjacent inverted repeat sequences
  JOURNAL   J. Mol. Biol. 204, 27-40 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA    <     1  >   121     Ser-tRNA
BASE COUNT       24 a     36 c     34 g     27 t
ORIGIN      
        1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac
       61 tccggtcttt aaaaccggca gtcgctccgg cgactcatag gttcaaatcc tatcgcctcc
      121 g
//
LOCUS       PT4SEA        122 bp ds-DNA             PHG       11-AUG-1990
DEFINITION  Bacteriophage T4 mutant Ser-tRNA suppressor gene.
ACCESSION   M35987
KEYWORDS    serine transfer-RNA.
SOURCE      Bacteriophage T4 (strain R51) DNA.
  ORGANISM  Bacteriophage T4
            Viridae; ds-DNA nonenveloped viruses; Myoviridae.
REFERENCE   1  (bases 1 to 122)
  AUTHORS   McClain,W.H.
  TITLE     Specific duplications fostered by a DNA structure containing
            adjacent inverted repeat sequences
  JOURNAL   J. Mol. Biol. 204, 27-40 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    tRNA    <     1  >   122     tRNA-Ser
BASE COUNT       24 a     35 c     36 g     27 t
ORIGIN      
        1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac
       61 tgccggtttt aaagaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc
      121 cg
//
LOCUS       VFARDNAA     1234 bp ds-DNA             PLN       11-AUG-1990
DEFINITION  V.faba ribosomal spacer DNA repeat.
ACCESSION   M36318
KEYWORDS    direct repeat; spacer DNA.
SOURCE      V.faba DNA, clone VER 1.
  ORGANISM  Vicia faba
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 1234)
  AUTHORS   Kato,A., Yakura,K. and Tanifuji,S.
  TITLE     Repeated DNA sequences found in the large spacer of Vicia faba rDNA
  JOURNAL   Biochim. Biophys. Acta 825, 411-415 (1985)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rpt          78      228     direct repeat copy A
    rpt        1064     1123     direct repeat copy A
BASE COUNT      320 a    329 c    304 g    281 t
ORIGIN      
        1 agacaagtcc atgcaaagga ctaggcaaga acgcattgcg taggccatgg tttgcgttgc
       61 ttccgcctca tggcccaccg acacgcgaaa agccgaaaaa catgaaggga tcgcccgaat
      121 ggtacgaaac cttgcattcc atctcaattt ggtgtcagat taatggatga aaaaattcag
      181 accggaactc attgcgtagc ccgacgttgg cttgctttcc gccccttgga gaaaccgcac
      241 acgcacgcgt tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa
      301 ctccttggac actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt
      361 attcccctaa ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgttcg
      421 ggccggaaat tcaaaattgt cgaaacgcta cgaaactttg cagatccctt ctatgtgtta
      481 tttgggactt ccatgcaaaa attcatcggg agattattcc cctaaggcgc gtttgcctcc
      541 gctcccgttc atggagaaac cgcacacgca cgcgttccgg tcggaaattg aaagtagacc
      601 gagggcttgt cgaaacgcta cgaaactcct tggacactct caatgtgtta tttgggaagt
      661 gcatgcaaaa atccatcggg agattattcc cctaaggcgc gtttgcctcc gctcccgtcg
      721 gaaaaaccgc acgggggccc gttcgggccg gaaattcaaa attgtcgaaa cgctacgaaa
      781 ctttgcagat cccttctatg tgttatttgg gacttccatg caaaaattga tcgggagatt
      841 attcccctaa ggcgcgtttg cctccgctcc cgttcatgga gaaaccgcac acgcacgcgt
      901 tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa ctccttggac
      961 actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt attcccctaa
     1021 ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgacac gcgaaaagcc
     1081 gaaatatatc gagggatcgc ccgaatggca tgaaacttgg catgccatct caatttggtg
     1141 gcagattaat gcatgaaaaa ttccagaccg gaactcattc cgtagcccga cgtttgtttc
     1201 cttccgcccc ttgaaaaaaa ctcaattgca cgcg
//
LOCUS       VFARDNAB      149 bp ds-DNA             PLN       11-AUG-1990
DEFINITION  V.faba ribosomal spacer DNA repeat.
ACCESSION   M36319
KEYWORDS    .
SOURCE      V.faba DNA, clone VER 6.
  ORGANISM  Vicia faba
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Rosidae; Rosales; Fabaceaea.
REFERENCE   1  (bases 1 to 149)
  AUTHORS   Kato,A., Yakura,K. and Tanifuji,S.
  TITLE     Repeated DNA sequences found in the large spacer of Vicia faba rDNA
  JOURNAL   Biochim. Biophys. Acta 825, 411-415 (1985)
  STANDARD  simple staff_review
BASE COUNT       39 a     40 c     35 g     35 t
ORIGIN      
        1 ccgacacgcg aaaagccgaa atatatcgag ggatcgcccg aatggcatga aacttggcat
       61 gccatctcaa tttggtggca gattaatgca tgaaaaattc cgaccggaac tcattccgta
      121 gcccgacgtt tgtttgcttc cgccccttg
//
LOCUS       XEBEBNA1      185 bp ds-DNA             VRT       11-AUG-1990
DEFINITION  X.borealis external transcribed spacer and 18S rRNA gene, 5' end.
ACCESSION   M36006
KEYWORDS    18S ribosomal RNA.
SEGMENT     1 of 2
SOURCE      X.borealis DNA, clone pXbr101.
  ORGANISM  Xenopus borealis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 185)
  AUTHORS   Furlong,J.C. and Maden,B.E.H.
  TITLE     Patterns of major divergence between the internal transcribed
            spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis,
            and of minimal divergence within ribosomal coding regions
  JOURNAL   EMBO J. 2, 443-448 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA         23  >   185     18S ribosomal RNA
    site          1       22     external transcribed spacer
BASE COUNT       43 a     44 c     46 g     52 t
ORIGIN      
        1 ggttcccccc ggagccgagg gctacctggt tgatcctgcc agtagcatat gcttgtctca
       61 aagattaagc catgcacgtg taagtacgca cggccggtac agtgaaactg cgaatggctc
      121 attaaatcag ttatggttcc tttgatcgct ccatctgtta cttggataac tgtggtaatt
      181 ctaga
//
LOCUS       XEBEBNA2     1589 bp ds-DNA             VRT       11-AUG-1990
DEFINITION  X.borealis 18S rRNA gene, 3' end, 5.8S rRNA gene, complete,
            ITS2, and 28S ribosomal RNA gene, 5 ' end.
ACCESSION   M36008 M36010 M36012 M36014 M36016
KEYWORDS    18S ribosomal RNA; 28S ribosomal RNA; 5.8S ribosomal RNA.
SEGMENT     2 of 2
SOURCE      X.borealis DNA, clone pXbr101.
  ORGANISM  Xenopus borealis
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; 
            Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae.
REFERENCE   1  (bases 1 to 1589)
  AUTHORS   Furlong,J.C. and Maden,B.E.H.
  TITLE     Patterns of major divergence between the internal transcribed
            spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis,
            and of minimal divergence within ribosomal coding regions
  JOURNAL   EMBO J. 2, 443-448 (1983)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    rRNA    <     1        1     18S ribosomal RNA
    site        401      954     internal transcribed spacer 1 (ITS1)
    rRNA        955     1116     5.8S ribosomal RNA
    site       1117     1452     internal transcribed spacer 2 (ITS2)
    rRNA       1453  >  1589     28S ribosomal RNA
BASE COUNT      287 a    560 c    546 g    196 t
ORIGIN      About 1500 bp after segment 1.
        1 gaattcccag taagtgcggg tcataagctc gcgttgatta agtccctgcc ctttgtacac
       61 accgcccgtc gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg
      121 ggtcggcaac ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa
      181 gtaaaagtcg taacaaggtt tccgtaggtg aacctgcgga aggatcatta accgcccgtc
      241 gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg ggtcggcaac
      301 ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa gtaaaagtcg
      361 taacaaggtt tccgtaggtg aacctgcgga aggatcatta acgagagagg gcgagaggct
      421 cacggtgcga gagcgcggct ctgccgcacc cccgccaccg tcccggctcc ccccgcggac
      481 gacgacaggt ccgggcgcgg gctcccgccc gccccgacac gaggagagag cggaaaagga
      541 acaacaacaa aaaaaacgag cccgggtacc tagccggggg ggggcgaggg cggggacggg
      601 gggcagaggg ccctccgacc ccccctccct cctcccgggc ggttcgaaga ccccggccca
      661 cggccggcgg cggcgacgag gggcgagccg tccggaggcg gggggcgggc gcggaggggg
      721 gacggggggg aggcggaggt cacggccgga cgcggccgcg tccctccgcc cgcccccccc
      781 ccgtctgtcc gcgccccgcc ccgcgcgccg gtcgcgtcag tccccgccgc cccgtcggcg
      841 gcgccccggg gcgcccgggg cccgggggcc cgccccgtcc agcttccacg gcgcccgcgc
      901 gcgccggtac ccctgccgcg aggcaacgag agagaaaaac aaacgaaaaa aatctcgcga
      961 ctcttagcgg tggatcactc ggctcgtgcg tcgatgaaga acgcagctag ctgcgagaat
     1021 tagtgtgaat tgcaggacac attgatcatc gacacttcga acgcaccttg cggccccggg
     1081 ttcctcccgg ggctacgcct gtctgagggt cgctccgacg tccatcgccc ccgccgggtc
     1141 cccgacccgg ggaaaacggg cggcgcggct ggggccgtcg caggggtcga gggacgcccc
     1201 cccaccgccc ttccggggga ggtgaggggg ccccgcgacc ccttcgtccc cccaaggcca
     1261 gaccccgacc atcccggcgg ccccccccag ggccccggcg cggctgtctg tgggaaaccg
     1321 atttcacggc tgccggcgcc gcgggcgctg gggacccgcc ggggcgggaa gagcccccgg
     1381 gcgggccggg gccggggaag gcgcgcttga acccgcgccc ccccgcgtcc cccgcccgcc
     1441 cccccgctcg actcagacct cagatcagac gtggcgaccc gctgaattta agcatattac
     1501 taagcggagg aaaagaaact aaccaggatt cccccagtaa cggcgagtga agagggaaga
     1561 gcccagcgcc gaatccccgc ccgcccggc
//
LOCUS       BOVTGFB      1117 bp ss-mRNA            MAM       11-AUG-1990
DEFINITION  Bovine transforming growth factor-beta-1 (TGF beta-1) mRNA,
            3' end.
ACCESSION   M36271
KEYWORDS    transforming growth factor-beta-1.
SOURCE      Bovine fibropapilloma, cDNA to mRNA, (library of Okayama and Berg).
  ORGANISM  Bos taurus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae.
REFERENCE   1  (bases 1 to 1117)
  AUTHORS   Van Obberghen-Schilling,E., Kondaiah,P., Ludwig,R.L.,
            Sporn,M.B. and Baker,C.C.
  TITLE     Complementary deoxyribonucleic acid cloning of bovine transforming
            growth factor-beta-1
  JOURNAL   Mol. Endocrinol. 1, 693-698 (1987)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1      950     transforming growth factor-beta-1 precursor
                                 (AA 75 at 3)
    matp        612      947     transforming growth factor-beta-1
BASE COUNT      244 a    379 c    301 g    193 t
ORIGIN      
        1 aggccatact ggccctttac aacagtaccc gcgaccgggt ggccggggaa agtgccgaaa
       61 cggagcctga gccagaggcg gactactacg ccaaggaggt cacccgcgtg ctaatggtgg
      121 aatacggcaa caaaatctat gacaaaatga agtctagctc gcacagcata tatatgttct
      181 tcaacacgtc cgagctccgg gaagcggtgc ccgaacctgt gttgctctct cgggcagacg
      241 tgcgcctgct gaggctcaag ttaaaagtgg agcagcacgt ggagctgtac cagaaatata
      301 gcaacaattc ctggcgctac ctcagcaacc ggctgctcgc ccccagcgac tcaccggagt
      361 ggctgtcctt tgacgtcact ggagttgtgc ggcagtggct gacccgcaga gaggaaatag
      421 agggctttcg cctcagtgcc cactgttcct gtgacagtaa agataacacg cttcaagtgg
      481 acattaacgg gttcagttcc ggccgccggg gtgacctcgc caccattcac ggcatgaacc
      541 ggcccttcct gctcctcatg gccacccctc tggagagggc ccagcacctg cacagctccc
      601 gccaccgccg agccctggac accaactact gcttcagctc cacagaaaag aactgctgtg
      661 ttcgtcagct ctacattgac ttccggaagg acctgggctg gaagtggatt catgaaccca
      721 aggggtacca cgccaatttc tgcctggggc cctgccctta catctggagc ctggatacac
      781 agtacagcaa ggtcctggcc ctgtacaacc agcacaaccc gggcgcttcg gcggcgccgt
      841 gctgcgtgcc tcaggcgctg gagcccctgc ccatcgtgta ctacgtgggc cgcaagccca
      901 aggtggagca gttgtccaac atgatcgtgc gctcctgcaa gtgcagctga ggccccgtcc
      961 caccccaaca gcccccgccc cgtagccccg ccacccggca gcccggcccc gcccccgccc
     1021 gcctcaccag gactgtattt aaggacacag caccaccccc ccccccccac tcccatcaag
     1081 cccacctggg gtccattaaa ggtggcgaga ggaaaaa
//
LOCUS       FSBPROLA     1148 bp ss-mRNA            VRT       11-AUG-1990
DEFINITION  O.keta (chum salmon) prolactin mRNA, complete cds.
ACCESSION   M36267
KEYWORDS    prolactin.
SOURCE      O.keta pituitary gland, cDNA to mRNA.
  ORGANISM  Oncorhynchus keta
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; 
            Actinopterygii; Salmoniformes; Salmonoidei; Salminidae.
REFERENCE   1  (bases 1 to 1148)
  AUTHORS   Song,S., Trinh,K.-Y., Hew,C.L., Hwang,S.-J., Belkhode,S. and
            Idler,D.R.
  TITLE     Molecular cloning and expression of salmon prolactin cDNA
  JOURNAL   Eur. J. Biochem. 172, 279-285 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         19      654     prolactin precursor
    sigp         19       87     prolactin signal peptide
    matp         88      651     prolactin
    mRNA          1     1148     prolactin mRNA
BASE COUNT      319 a    284 c    239 g    306 t
ORIGIN      
        1 gtccgaaaag aaggaaagat ggctcgccga tcccagggta ccaaactcca cttagcagtt
       61 ctgtgtctag ttgtgtcctg tcatgccatt ggccttagtg acctaatgga gagagcttcc
      121 cagcgatcag acaagcttca ctcactcagc acttccctca ccaaggacct ggactctcac
      181 ttcccaccaa tgggacgagt gatgatgcca cgcccgtcta tgtgtcacac ctcctcgctc
      241 cagacaccca aggacaagga gcaagcactc aaggtatcgg agaatgagct gatctccctg
      301 gctcgctacc tcctcctggc ctggaacgat cccctgctgc tgctctcctc agaggcgccg
      361 actctgcccc acacgccgtc caatggcgac atcagcagta agatcaggga actgcaggac
      421 tactccaaga gcctgggaga tggactggac ataatggtca acaagatggg accctcctcc
      481 cagtacattt cttcaatccc cttcaagggt ggagacctcg gcaatgacaa gacctcccgc
      541 ctcatcaact tccacttcct catgtcttgc ttccgcaggg actcccacaa aatcgacagt
      601 ttcctcaagg tccttaggtg ccgggctaca aatatgcgac cagaaacatg ttaggagaaa
      661 atggcaggat tttggttctg gattgttcca ttttcaaact gatagtcaaa atggggtagg
      721 aattcaagag agtgattgtc ttgttgtagt ttgattttgt gaaataacta atgctgccac
      781 ctacatcaca tttggactat tcatagatct atacagttag tagtcatcct ggttatctga
      841 taccgacatg tttacctata catatcttta taggtagttt atgttcggta atgcattatg
      901 catcagaaaa atgatgcaac ctaagtattg gtcacttata tgatcatatt caatagattc
      961 atatatagat aacaaaggat cctcctagtc attcttagta gatatacagg atggctgata
     1021 ggtagatgcc atgaaccgtc atgattaaca atctagaact tgaatatagg taactcagcc
     1081 tttgtaaact attgataatt gcttccaaga gttctgttta cacaataaat caagtaagat
     1141 tttgcaat
//
LOCUS       PETIRAB       183 bp ds-DNA             ORG       11-AUG-1990
DEFINITION  P.hybrida chloroplast rps19' and rp12 genes, partial cds.
ACCESSION   M35955
KEYWORDS    inverted repeat; rp12; rps19'.
SOURCE      Petunia hybrida chloroplast DNA.
  ORGANISM  Chloroplast Petunia hybrida
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae; Petunia hybrida.
REFERENCE   1  (bases 1 to 183)
  AUTHORS   Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E.
  TITLE     Sequence analysis of the junction of the large single copy region
            and the large inverted repeat in the petunia chloroplast genome
  JOURNAL   Curr. Genet. 14, 487-492 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    >   183      143 (c) rp12 gene product (AA at 181)
    pept         91  <     1 (c) rps19' gene product (gtg start codon)
BASE COUNT       58 a     28 c     26 g     71 t
ORIGIN      
        1 cgcccatacc ataattcaga attcatcaaa tattagataa atttaataaa tgattggcta
       61 caaaggggtt tttttttagt gaacgtgtca caagcttact cctatttttt ttttttttgt
      121 aaagtttttg taaaacgaag aattaaattc gatttcctct cctatttact atggcgacga
      181 aga
//
LOCUS       PETIRABX      162 bp ds-DNA             ORG       11-AUG-1990
DEFINITION  P.hybrida chloroplast rps19 and rp12 genes, partial cds.
ACCESSION   M37322
KEYWORDS    inverted repeat; rp12; rps19.
SOURCE      Petunia hybrida chloroplast DNA.
  ORGANISM  Chloroplast Petunia hybrida
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Asteridae; Solanales; Solanaceae; Petunia hybrida.
REFERENCE   1  (bases 1 to 162)
  AUTHORS   Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E.
  TITLE     Sequence analysis of the junction of the large single copy region
            and the large inverted repeat in the petunia chloroplast genome
  JOURNAL   Curr. Genet. 14, 487-492 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1       41     rp12 gene product (AA at 3)
    pept         92  >   162     rps 19 gene product (gtg start codon)
BASE COUNT       70 a     26 c     21 g     45 t
ORIGIN      
        1 tcttcgtcgc catagtaaat aggagaggaa atcgaattta attcttcgtt ttacaaaaac
       61 ttacaaaaaa aaaaaaaata ggagtaagct tgtgacacgt tcactaaaaa aaaacccctt
      121 tgtagccaat catttattaa ataaaattga taagcttaac ac
//
LOCUS       RATATROPO    1633 bp ds-DNA             ROD       11-AUG-1990
DEFINITION  Rat alpha-tropomyosin and N5 genes, 5' flank.
ACCESSION   J05467
KEYWORDS    alpha-tropomyosin.
SOURCE      Rat (Sprague-Dawley) adult liver DNA.
  ORGANISM  Rattus norvegicus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1633)
  AUTHORS   Herrera,V.L.M. and Ruiz-Opazo,N.
  TITLE     Regulation of alpha-tropomyosin and N5 genes by a shared enhancer
  JOURNAL   J. Biol. Chem. 265, 9555-9562 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            N.Ruiz-Opazo, 04-JUN-1990.
FEATURES       from  to/span     description
    signal      569      772     enhancer
BASE COUNT      424 a    509 c    405 g    294 t      1 others
ORIGIN      1 bp upstream of HindIII site.
        1 aagcttaaag agatgggggg ggggggtgtc ctcctgcctt cagtgcagac ctggctggcc
       61 cctttccctc tgcagagttc accctacgga ttgaaccatt tccgtagcta atttgtagca
      121 tgttttcagc accagcaccc aagtcgctaa cagagaaaag aaggacnccc accccccacc
      181 cccaatagat ccgagcaagg aaaaaaatga aaagctagat ctgaagggat tttaggaatc
      241 cactaccctc tgaatttaga tatgaggaaa cctgaactaa caagatggat ggctcagcag
      301 gtaacggagc ttgctatcaa gctcaaagac ctgtgcacgg atcgaacacc cacatggtag
      361 aaacaactga ttcttaaaga gttgctctct gaccgccaca gcaccaccac gccacgccca
      421 cgcccctata agtaaacgac taaagtaagg gaagctacag catggaagct gtttctaatc
      481 acagctacac aatgtagctg ttagctagct aatgactgaa gaccagaccc ctcccactcc
      541 ccacgatgtt aactgagtct aaactcaaca cacacacaca cacacacaca cacacacaca
      601 cacacacctt ggcacactcc ttctaccctt tccaaagcac agcacaggca cataactcag
      661 caaaccacac acacacacac acacacacac acacacacac acacacacac acaccttggc
      721 acactccttc taccctttcc aaagcacagc acaggcacat aactcagcaa accgggccct
      781 aagttcctaa tgtattttac tgcccacctc cagcttgggt gaaacgattg ggagacttcc
      841 tgtgtgtatc attatgccca ccatcacggg cagagagaca gggcctcagg aatggccccc
      901 acaccttcct ccacctctat ttctctgcgt gggttgaaat ttccagactg tgagagccca
      961 ggactatgac tggtgggtgg actcagagct gctagaagga cgcagctagg tactgagaac
     1021 tcccaggccc agatcgaagg tgcggtggct gtgactggca gcccccggac cttacagtga
     1081 gcaccccagg acccggaggc caagtgagat ggtgcgcaca caccacctgc cctgttcccg
     1141 ggattccctg tgactcccga cagactcttc tggggaccct gaccgccagg cgacggctct
     1201 gggaggtggc tagagactcc tcgccagtac cgggctcctc tccagcgagg tccgggagct
     1261 ctttcccagg cgtgcaggtt tagaaattaa ggggagcctc gcccggccgc ggtctcggag
     1321 gctcgggtcc ccatatatag tcatatccac cgtcgactgg gagcggcagg cggcagcgaa
     1381 tggcgtgagg ccccgcggga ggagcgagga gggggcatgg ggcggaggca ggagactggg
     1441 aaagggggaa ggaggaacaa gcttttccaa aaaagtattg gctgacctaa ggaatgcggt
     1501 cgccccctgg gaaagtacat atcagggagc agcaggcacg tccgcgctcg caccttccgg
     1561 cctctgccac cctaccgcgc tcgcgtctcc tctggctcct ctcgctaagg gcccctcgcc
     1621 accgccacca tgg
//
LOCUS       STMXYLI      1201 bp ds-DNA             BCT       11-AUG-1990
DEFINITION  S.violaceoniger xylose isomerase gene, complete cds.
ACCESSION   M36269
KEYWORDS    glucose isomerase; xylose isomerase.
SOURCE      S.violaceoniger DNA.
  ORGANISM  Streptomyces violaceoniger
            Prokaryota; Bacteria; Firmicutes; Streptomycetaceae.
REFERENCE   1  (bases 1 to 1201)
  AUTHORS   Drocourt,D., Bejar,S., Calmels,T., Reynes,J.P. and Tiraby,G.
  TITLE     Nucleotide sequence of the xylose isomerase gene from Streptomyces
            violaceoniger
  JOURNAL   Nucleic Acids Res. 16, 9337-9337 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept         24     1193     xylose isomerase
BASE COUNT      187 a    455 c    402 g    157 t
ORIGIN      
        1 cgtacggaac aaggagcagc gccatgagct tccagcccac ccccgaggac aagttcacct
       61 tcggtctgtg gaccgtcggc tggcagggaa gggacccgtt cggcgacgcc acccgccctg
      121 ccctcgaccc ggtcgagacc gtgcagcgcc tggccgagct gggcgcctac ggagtgacct
      181 tccacgacga cgacctgatc cccttcgggt cgtccgacac cgagcgcgag tcgcacatca
      241 agcggttccg ccaggccctg gacgccactg gcatgacggt gccgatggcc accacgaacc
      301 tcttcaccca ccccgtcttc aaggacggcg gtttcaccgc caacgaccgc gacgtgcgcc
      361 gctacgcgct gcgcaagacg atccgcaaca tcgacctggc ggccgagctg ggcgccaaga
      421 cgtacgtcgc ctggggcggc cgtgagggcg ccgagtccgg tggcgccaag gacgtgcgcg
      481 acgccctcga ccgcatgaag gaggcgttcg acctcctcgg cgagtacgtc accgcccagg
      541 gctacgacct ccgcttcgcc atcgagccca agcccaacga gccccgcggc gacatcctcc
      601 tgcccaccgt cggccacgcc ctggccttca tcgagcgcct ggagcgcccg gagctgtacg
      661 gcgtcaaccc ggaggtcggc cacgagcaga tggccggcct gaacttcccg cacggcatcg
      721 cgcaggccct gtgggcgggc aagctcttcc acatcgacct caacggccag tccggcatca
      781 agtacgacca ggacctgcgg ttcggcgccg gcgacctgcg ggcggcgttc tggctggtcg
      841 acctcctgga gagcgccggt tacgagggcc cgcggcactt cgacttcaag ccgccgcgga
      901 ccgaggactt cgacggcgtg tgggcctcgg ccgagggctg catgcgcaac tacctgatcc
      961 tcaaggagcg cgcggccgcc ttccgcgccg acccggaggt gcaggaggcc ctgcgcgccg
     1021 cgcgtctgga ccagctggcc cagccgaccg cggcggacgg cctggaggcc ctgctcgccg
     1081 accgcaccgc gttcgaggac ttcgacgtgg aggcggccgc cgcgcgcgcg gcatggccgt
     1141 tcgaacgcct cgaccagctg gcgatggacc acctgctggg cgcgcgcggc tgacggtcgc
     1201 g
//
LOCUS       BNAALS       3315 bp ds-DNA             PLN       11-AUG-1990
DEFINITION  B.napus chloroplast acetolactate synthase (ALS) gene, complete cds.
ACCESSION   M20498
KEYWORDS    acetolactate synthase.
SOURCE      B.napus (cv. Westar) 3 week old leaf DNA.
  ORGANISM  Brassica napus
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Dilleniidae; Capparales; Brassicaceae.
REFERENCE   1  (bases 1 to 3315)
  AUTHORS   Wiersma,P.A., Schmiemann,M.G., Condie,J.A., Crosby,W.L. and
            Moloney,M.M.
  TITLE     Isolation, expression and phylogenetic inheritance of an
            acetolactate synthase gene from Brassica napus
  JOURNAL   Mol. Gen. Genet. 219, 413-420 (1989)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by W.L.Crosby, 06-DEC-1988, for release after publication.
FEATURES       from  to/span     description
    pept        784     2697     acetolactate synthase (ALS) precursor (EC
                                 4.1.3.18)
    sigp        784     1011     acetolactate synthase (ALS) signal peptide
    matp       1012     2694     acetolactate synthase (ALS)
    signal     2111     2116     transcriptional stop sequence (put.)
    signal      446      459     promoter element (put.)
    signal      665      668     CAAT box
    signal      705      709     TATA box
    signal     2894     2899     polyA signal
BASE COUNT      954 a    712 c    702 g    947 t
ORIGIN      
        1 tctagatata aaagtatttt attacttaaa aagtatattt ttttgttaat taaagatatc
       61 ttttagatgg actattacta ttttgtgaac tttctttttt ttctgaaatc atattatata
      121 tacatatttt tttgttttta aattattttt taaaacttta taaactttcc ttttttattt
      181 atattgttat ttgaaaaatt ttaagaaagt aaagaataat atttaaatgt acttttttta
      241 tttcattagg gatatcatag taatcaacca ccgtgagagt taacgtgagt gcgacaccta
      301 aaaaagtgac ttctcaaata atattatgga gatgtatcta caagaaaaat aaacattgtt
      361 gtgtaaattt tgaataaatg actgagggca tccacatggt aaaatttact aagagtatct
      421 gaacaaaaat attaaaataa aaataaaaag aaagaaagag aaatcagaaa acaagagaag
      481 ccattggtta agaagagact cacacattct tttgacagag tttttctacg gatgacattt
      541 attgaatgag acaacaaatt ttacagcatt aagttttatt catctaatta acctattaat
      601 aattttatgt taagagacat ggataaagaa acttattgct gcagacgctc tgagatgggg
      661 ttggcaatat tagttacgag ccaacttgaa gtgtcacatg catgtataaa ttttagttct
      721 ctctcacaga gtttatttac tcattcaaat taagcaattt ctcgcaacac tccattttgc
      781 accatggctt cgttttcgtt cttcggcacc attccgtcgt ctcccacaaa agcttccgtc
      841 ttctccctgc cggtgtcggt aactacgctc ccgtccttcc cgcgccgccg tgctactcgt
      901 gtctccgttt ccgccaactc gaagaaagac caagaccgca cagcttcacg tcgagagaat
      961 ccgagcacat tcagctccaa atacgctccc aacgtgcccc gcagtggcgc agacatcctg
     1021 gtcgaagccc tggagcgtca aggagtggac gtagtcttcg cttacccagg aggcgcatca
     1081 atggagatcc atcaagccct aactcgctcc aacacaatcc gaaacgtcct tccccgtcac
     1141 gaacaaggag gtatcttcgc cgccgagggt tacgctcgtt cctccggtaa acccggaatc
     1201 tgcatcgcca cttccggtcc aggagctatg aatctcgtca gcggattagc cgacgccctg
     1261 tttgacagcg tacccctcat cgcaatcaca ggacaggtcc ctcgccggat gattggtacc
     1321 atggcgttcc aggagacacc cgttgttgag gtaacgagga ctataacgaa acataactat
     1381 cttgttatgg aagttgatga tatacctagg atcgttcgag aagctttctt tctagctact
     1441 tcggttagac cgggaccggt tcttatagac gtccccaaag atgttcagca acagtttgcg
     1501 attcctaact gggaacagcc tatgcgctta cctctttaca tgtctacgat gcctaaaccc
     1561 cccaaagttt ctcacttaga gcagattctt aggttggttt cggagtctaa gaggcctgtc
     1621 ttgtacgttg gaggtggttg tctgaactcg agtgaggaac tgcgcagatt tgtggaactt
     1681 actggcatcc ctgttgctag tacgttcatg ggacttggat cgtatccttg tgacgatgaa
     1741 gagttctctc tgcaaatgct aggaatgcat ggaacagtgt acgctaatta cgctgtcgag
     1801 tatagcgatc ttctgcttgc ttttggggtt aggtttgacg accgtgtgac cggaaagctt
     1861 gaggcctttg ctagccgggc caagatcgtg catattgata ttgattctac cgaaatcggg
     1921 aagaacaaga cacctcatgt gtcggtgtgt tgtgatgttc agctagcctt gcaagggatg
     1981 aacgaggttc ttgagaaccg acgagatgtg cttgacttcg gggaatggag atgtgaattg
     2041 aacgaacaga gactaaagtt ccctctccgc tacaagacgt ttggggaaga gattcctcca
     2101 cagtacgcca ttcaactact tgacgagcta accgacggga aggcaattat cactactggt
     2161 gtcgggcaac accagatgtg ggccgcccaa ttctacagat tcaagaaacc ccgccaatgg
     2221 ctgtcttcag gaggcctagg agccatgggg ttcggtcttc ctgcagccat gggagccgct
     2281 atagccaacc cgggagcagt ggttgtcgac attgatgggg atggtagctt catcatgaac
     2341 attcaagaac tggcaaccat cagggttgag aatcttccag tcaaggtttt gctgattaat
     2401 aatcagcacc tcggaatggt ccttcagtgg gaagaccact tctacgcagc taacagagcc
     2461 gattcttttc tgggagaccc ggcgaaccct gaggcggtat tcccggatat gctgttgttc
     2521 gccgcatcgt gcggtatacc agccgccagg gtcaccagaa gggaggacct ccgagaggca
     2581 atccagacaa tgctggacac acctggacca ttcttgttgg atgtggtctg tcctcaccag
     2641 gaccatgtgt taccactcat ccctagtggc ggcaccttca aggacattat tgtgtagtac
     2701 tactttaatg tttgattatt ctatgctctc ttctgttttg taataacaaa gtaaaaccag
     2761 tttgcaatat cagaagatct tctgattatc aatatcatgc tactctcata atgaaaaacg
     2821 caaacttcac tctttattta tatgagaata ataactgtag tgtttacata acaatttata
     2881 agtatgaaag actaataaaa aaaaagttat atacttgata aaaaggcaga agtaagttta
     2941 caaatgtagt aggccatcat aacgatctga taaggaattt ttgtataggg actaatacct
     3001 ctatgaagga gaacatgtat cgcaacgagt tgacatatgt gtgaacctca gatggactcg
     3061 atgttaagtt ctacgttcct gcatgaaaaa aaaaaaacat acaatttttt tcaagcatgg
     3121 gtaaccttat tgaatacgaa caatccagac gaaaatgggg caaaactacc ttgagatctg
     3181 cacctgaaca gaaaactcca ggaaccagac ctgtgatcat cacaactcta aaagctgaac
     3241 tatcctgctg tatggtctca aatgtgttct gcatctcctt atcaatagca tttttcggct
     3301 actggcctgt ctaga
//
LOCUS       HUM5AR       2102 bp ss-mRNA            PRI       11-AUG-1990
DEFINITION  Human steroid 5-alpha-reductase mRNA, complete cds.
ACCESSION   M32313
KEYWORDS    dihydrotestosterone; steroid 5-alpha-reductase.
SOURCE      Human adult prostate, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 1 to 2102)
  AUTHORS   Andersson,S. and Russell,D.W.
  TITLE     Structural and biochemical properties of cloned and expressed human
            and rat steroid 5-alpha-reductases
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 3640-3644 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            D.W. Russell, 23-FEB-1990, for release after publication.
FEATURES       from  to/span     description
    pept         31      810     steroid 5-alpha-reductase (EC 1.3.99.5)
    mRNA    <     1     2102     steroid 5-alpha-reductase mRNA
BASE COUNT      470 a    482 c    486 g    664 t
ORIGIN      
        1 gggcatggag cacgctgccc agccctggcg atggcaacgg cgacgggggt ggcggaggag
       61 cgcctgctgg ccgcgctcgc ctacctgcag tgcgccgtgg gctgcgcggt cttcgcgcgg
      121 aatcgtcaga cgaactcagt gtacggccgc cacgcgctgc ccagccacag gctccgagtg
      181 ccggcgcggg ccgcctgggt ggtgcaggag ctgccctcgc tggccctgcc gctctaccag
      241 tacgccagcg agtccgcccc gcgtctccgc agcgcgccca actgcatcct cctggccatg
      301 ttcctcgtcc actacgggca tcggtgctta atttacccgt ttctgatgcg aggaggaaag
      361 cctatgccac tgttggcatg tacaatggcg attatgttct gtacctgtaa cggctatttg
      421 caaagcagat acttgagcca ttgtgcagtg tatgctgatg actgggtaac agatccccgt
      481 tttctaatag gttttggctt gtggttaaca ggcatgttga taaacatcca ttcagatcat
      541 atcctaagga atctcagaaa accaggagat actggataca aaataccaag gggaggctta
      601 tttgaatacg taactgcagc caactatttt ggagaaatca tggagtggtg tggctatgcc
      661 ctggccagct ggtctgtcca aggcgcggct tttgctttct tcacgttttg ttttttatct
      721 ggtagagcaa aagagcatca tgagtggtac ctccggaaat ttgaagagta tccaaagttc
      781 agaaaaatta taattccatt tttgttttaa gtgcgttttt catgaaatta tcttcaactt
      841 gaagctttcc aatggcgctt ctctatggac tttgtaaata agttatatct ttgtaatttt
      901 cctgctactt tatcattttc aagatgtcct ctaggaattt tttttctagt aattttgcaa
      961 tctacctaat aagtacctaa atacgctgaa atggaggttg aatatcctac tgtgtaacag
     1021 gtcagaattt caagctctgg gtaataactg ctgatatttt ttctaatttc aaatttacct
     1081 cttttggcta tgtcttgcca agtgtgtatg agactagact ttacaactgt ctttgatggc
     1141 attttcagaa caataaatgt cacaatccct tctatagccc cctacagtga tctcttcaag
     1201 gtcaactgca gtgttgcttc cctcccccta tagggctgga atctgtctag gagccctctc
     1261 tcggaggcca cagaggctgg gggtagccat tgtgcagtca tggcccgggg gaaacttgcc
     1321 aaccttcgtg tcaggtgctg tgtgtaagtg gagaacttgg ggatagagga ggaagctcct
     1381 cgtggccctt ccaaggtgag gcaaaggcat ctggacttgt tccagcccag cccaccgggt
     1441 gacatcaccg ggcagggagg ggtgctggtg gtggttcata cggagtaagc tgctctgcct
     1501 gtgtgagtgg ctcctgggcc ctaaacaggc acctttaggc catgggtcac tcaccgtgag
     1561 ccatcaatgt gctctggtct gacatggttt ctctctgtct tctagtctag acctagtttt
     1621 tttgttctgt tccccacgta tggatatagt agagattgtt gtctgtgaaa tttctctttt
     1681 gtagattttg agttttccct tgtagtgtaa agaatgatca ctttctgtaa caataacaag
     1741 accacttttt aagatttatc ctgtttgttc tttgttgatt gaaacataat aattgttaaa
     1801 attctctaca gccttctttt tcttccatag ctaatcttcc ttctaatagt ttttgctttc
     1861 tgttttgctg ttgttgcttt gcaaagcttt cccctcatag cctgtacctg ttatcaatat
     1921 aaaataatct tcctgttgaa tgcttcatga cttgaattct actttgataa aaacattgcc
     1981 atactgcttt ttatcttgat gaattcatct ggcattgctt tgccttatca tctcatctgg
     2041 agtttttaaa tgccatttgt ttcagttgtc tttaacaaca taataaatag actttgccat
     2101 tt
//
LOCUS       SPIBADH      1797 bp ss-mRNA            PLN       11-AUG-1990
DEFINITION  Spinach betaine-aldehyde dehydrogenase (BADH) mRNA, complete cds.
ACCESSION   M31480
KEYWORDS    betaine-aldehyde dehydrogenase.
SOURCE      Spinach salt-stressed leaf, cDNA to mRNA.
  ORGANISM  Spinacia oleracea
            Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; 
            Caryophyllidae; Caryophyllales; Chenopodiaceae.
REFERENCE   1  (bases 1 to 1797)
  AUTHORS   Weretilnyk,E.A. and Hanson,A.D.
  TITLE     Molecular cloning of a plant betaine-aldehyde dehydrogenase, an
            enzyme implicated in adaptation to salinity and drought
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87, 2745-2749 (1990)
  STANDARD  full staff_review
COMMENT     Draft entry and printed sequence for [1] kindly submitted by
            A.D.Hanson, 27-MAR-1990,, for release after publication.
FEATURES       from  to/span     description
    pept         68     1561     betaine-aldehyde dehydrogenase (BADH) (EC
                                 1.2.1.8)
BASE COUNT      504 a    324 c    443 g    526 t
ORIGIN      
        1 cgttgcgtgc tcgccttacc ctctcaactc aatttcttca acccaatttc ttcgcattta
       61 accaagaatg gcgttcccaa ttcctgctcg tcagctattc atcgacggag agtggagaga
      121 acccattaaa aaaaatcgca tacccgtcat caatccgtcc actgaagaaa tcatcggtga
      181 tattccggca gccacggctg aagatgtgga ggttgcggtg gtggcagctc gaagagcctt
      241 taggaggaac aattggtcag caacatctgg ggctcatcgt gccacatact tgcgtgctat
      301 tgctgctaag ataacagaaa aaaaagatca tttcgttaaa ctggaaacca ttgattctgg
      361 gaaacctttt gatgaagcag tgctggacat tgatgacgtt gcttcatgtt ttgaatattt
      421 tgccggacaa gcagaagctc ttgatggtaa acaaaaggct ccagtcaccc tgcctatgga
      481 aaggttcaaa agtcatgttc tcaggcagcc ccttggtgtt gttggattaa tatccccatg
      541 gaattaccca cttctaatgg ctacatggaa aattgctcca gcacttgctg ctgggtgtac
      601 agctgtactt aagccatccg agttggcatc tgtgacttgt ctagaattcg gtgaagtttg
      661 caacgaagtg ggacttcctc caggcgtgtt gaatatcttg acaggattag gtccagatgc
      721 tggtgcacca ttagtatcac accccgatgt tgacaagatt gcctttactg ggagtagtgc
      781 cactggaagc aaggttatgg cttctgctgc ccaattggtt aagcctgtta cattagaact
      841 tgggggtaaa agtcctattg tagtgtttga agatgttgat attgataaag ttgtggaatg
      901 gactattttt ggctgtttct ggacaaatgg tcaaatatgt agtgcaacgt ctagactgct
      961 tgtgcatgaa agtattgcag ctgagtttgt tgataagctt gtaaaatgga cgaaaaacat
     1021 taaaatttct gacccatttg aagaaggatg ccggcttggc cctgttatta gtaaaggaca
     1081 gtacgacaaa attatgaagt tcatatcaac agcaaagagt gagggggcaa ctattttgta
     1141 tggaggttcc cgtcctgagc atttgaagaa aggttattac attgaaccca ccattgtaac
     1201 tgatatctcc acatccatgc aaatatggaa agaggaagtt tttggccctg tcttgtgtgt
     1261 taaaacattt agttccgaag atgaagccat tgcattggca aatgatacag agtacggttt
     1321 agctgctgct gtgttttcta atgatcttga aagatgtgag aggataacga aggctctaga
     1381 agttggagct gtttgggtta attgctcaca accatgcttt gttcaagctc cttggggagg
     1441 catcaagcgt agtggttttg gacgtgaact tggagaatgg ggtatccaga attacttgaa
     1501 tatcaagcag gtgactcaag atatttctga tgaaccatgg ggatggtaca agtctccttg
     1561 aagctatgat caaatttgaa tgacggtgtt gtttttgtta agtgagcagc ggttggactg
     1621 taccttgaaa tggttcgcag agaaggtcga gtttacagta aaaatggatt gaataaaggg
     1681 ttggttgatg cagaagtcca acaagcatag cttaattttg ttgtatcatg taatagtgtg
     1741 tattatttca gacaattgag ttggatcaga aataagatga tacagagttt gacaatg
//
LOCUS       ECORTSA      1407 bp ds-DNA             BCT       11-AUG-1990
DEFINITION  E.coli rts gene, complete cds.
ACCESSION   M36321
KEYWORDS    rts gene.
SOURCE      E.coli DNA.
  ORGANISM  Escherichia coli
            Prokaryota; Bacteria; Gracilicutes; Scotobacteria; 
            Facultatively anaerobic rods; Enterobacteriaceae.
REFERENCE   1  (bases 1 to 1407)
  AUTHORS   Flamm,J.A., Friesen,J.D. and Otsuka,A.J.
  TITLE     The nucleotide sequence of the Escherichia coli rts gene
  JOURNAL   Gene 74, 555-558 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        294     1304     rts gene product (gtg start codon)
BASE COUNT      383 a    316 c    298 g    410 t
ORIGIN      
        1 gcgcgcattc tatggagaca tgcgagttca tgcaactaaa aaattgcata atttgtttta
       61 ttggtcacat tttatgcgac acgatgaaga aacagccgaa cggagggcag attagcacac
      121 tttttcaaca tcattgtgct caacaatgcg ctcctgctaa accataattc tttttatcag
      181 atggaatatc tgtcacattg cttttcaacg atagcttcct ggcagagatt ttttcttatt
      241 attcctcccc atctggtgtt accctcctgc ccattaaccc attcaacaga actgtgacgc
      301 gccatggcaa atatcgcttt gccgatagag ctatgaccgc cagaaacatg cttatgagta
      361 taaaagagca aacgttaatg acgccttacc tacagtttga ccgcaaccag tgggcagctc
      421 tgcgtgattc cgtacctatg acgttatcgg aagatgagat cgcccgtctc aaaggtatta
      481 atgaagatct ctcgttagaa gaagttgccg agatctattt acctttgtca cgtttgctga
      541 acttctatat aagctcgaat ctgcgccgtc aggcagttct ggaacagttt cttggtacca
      601 acgggcaacg cattccttac attatcagta ttgctggcag tgtcgcggtg gggaaaagta
      661 caaccgcccg tgtattgcag gcgctattaa gccgttggcc ggaacatcgt cgtgttgaac
      721 tgatcactac agatggcttc cttcacccta atcaggttct gaaagaacgt ggtctgatga
      781 agaagaaagg cttcccggaa tcgtatgata tgcatcgcct ggtgaagttt gtttccgatc
      841 tcaaatccgg cgtgccaaac gttacagcac ctgtttactc acatcttatt tatgatgtga
      901 tcccggatgg agataaaacg gttgttcagc ctgatatttt aattcttgaa gggttaaatg
      961 tcttacagag cgggatggat tatccacacg atccacatca tgtatttgtt tctgattttg
     1021 tcgatttttc gatatatgtt gatgcaccgg aagacttact tcagacatgg tatatcaacc
     1081 gttttctgaa attccgcgaa ggggctttta ccgacccgga ttcctatttt cataactacg
     1141 cgaaattaac taaagaagaa gcgattaaga ctgccatgac attgtggaaa gagatcaact
     1201 ggctgaactt aaagcaaaat attctaccta ctcgtgagcg cgccagttta atcctgacga
     1261 aaagtgctaa tcatgcggta gaagaggtca gactacgcaa ataatttgca ggggagcgaa
     1321 tactcccctt tcttattttt ctgcactacg cagggatatt tcaccgccca tccagggttt
     1381 tattattcca tcctgctcaa gtaataa
//
LOCUS       MUSBCLQ      1056 bp ss-mRNA            ROD       11-AUG-1990
DEFINITION  Mouse complement component subunit Clq B-chain mRNA, complete cds.
ACCESSION   M36293
KEYWORDS    complement component C1q.
SOURCE      Mouse Englebreth-Holm-Swarm tumor, cDNA to mRNA.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 1056)
  AUTHORS   Wood,L., Pulaski,S. and Vogeli,G.
  TITLE     cDNA clones coding for the complete murine B chain of complement
            Clq: Nucleotide and derived amino acid sequences
  JOURNAL   Immunol. Lett. 17, 59-62 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        157      918     complement component subunit Clq B-chain
                                 precursor
    sigp        157      231     complement component subunit Clq B-chain
                                 signal peptide
    matp        232      915     complement component subunit Clq B-chain
BASE COUNT      236 a    320 c    285 g    215 t
ORIGIN      
        1 acagcaaggg aaggttcttc cttcctctag ggacccagac ttccgctttc tgaggacagc
       61 agcaggctct gggctctggg aatccactgc tgtccggcct agaagcatca cagaacacca
      121 ggattccata cacaggaagc ccctgaggct gagctgatga agacacagtg gggtgaggtc
      181 tggacacacc tgttactgct gcttctaggt tttctccatg tgtcctgggc ccaaagcagc
      241 tgcaccgggc cccctggcat ccctggcatc cctggggtcc ctggggttcc tggctctgat
      301 ggccaacctg gcactccagg gattaaaggg gagaaagggc tccctggact ggctggagac
      361 cttggtgagt ttggagagaa aggggaccca gggatccctg ggactccagg caaagttggc
      421 cctaagggtc ccgtcggccc taagggtact ccaggcccct ctggaccccg cggtcccaaa
      481 ggcgattctg gggactacgg ggctacacag aaagtcgcct tctctgccct gaggaccatc
      541 aacagcccct tgcgaccgaa ccaggtcatt cgcttcgaaa aggtgatcac caacgcgaac
      601 gagaactatg agccacgcaa cggcaagttc acctgcaagg tgcctggcct ctactacttc
      661 acctatcatg ccagctcccg gggcaacctg tgtgtgaatc tcgttcgtgg ccgcgatcgg
      721 gacagcatgc agaaagtagt caccttctgt gactatgccc agaacacctt ccaggtgacc
      781 acaggtgggg tagtcttgaa gctagagcaa gaggaggttg ttcacctgca ggccacagac
      841 aagaactccc tcctgggcat tgagggtgcc aacagcatct tcactggctt tctgcttttc
      901 cctgacatgg atgcgtaatc acggggtcaa attacaccta tccaacacca tcttcctgct
      961 cctgcagcaa tcctccctgg acccctgaca tcaccccctt gactgcctga aacccagacc
     1021 agagccctgt agatgttaca gaacgaatgg gtcaat
//
LOCUS       MUSHOX23R    3542 bp ss-mRNA            ROD       11-AUG-1990
DEFINITION  Mouse Hox2.3 protein mRNA, complete cds.
ACCESSION   X06762 M36183
KEYWORDS    Hox2.3 protein; homeo box.
SOURCE      Mouse plasmacytoma cell line, cDNA to mRNA, clones pRO.8, pR1.2,
            and genomic clones L13 and L2.
  ORGANISM  Mus musculus
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; 
            Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae.
REFERENCE   1  (bases 1 to 3542)
  AUTHORS   Meijlink,F., Verrijzer,P., De Laaf,L., De Graaff,W. and
            Deschamps,J.
  TITLE     A mouse homeobox containing gene on chromosome 11: Sequence and
            tissue-specific expression
  JOURNAL   Nucleic Acids Res. 15, 6773-6786 (1987)
  STANDARD  simple staff_review
REFERENCE   2  (bases 1184 to 2206; revises [1])
  AUTHORS   Verrijzer,P., De Graaff,W., Deschamps,J. and Meijlink,F.
  TITLE     Nucleotide sequence of the Hox2.3 gene region
  JOURNAL   Nucleic Acids Res. 16, 2729-2729 (1988)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept       2187     2840     Hox2.3 polypeptide
    mRNA       2100     3373     Hox2.3 mRNA
    site       2595     2774     homeo box
    site       3371     3371     hox2.2 mRNA
BASE COUNT      738 a    924 c    951 g    890 t     39 others
ORIGIN      Chromosome 11.
        1 gagctcggcc ccgcgagcga cgcaggagct gggcctccca cagcagcgtc ccccgccgcg
       61 ccagtccccg ctagtggtag tatctcgtaa tagcttctgt gtgtgagcta ccgtggatct
      121 ccttcccttc tcttgggggt ccggggggaa aaaaagaaaa ggattttaag caaggactcc
      181 ctcgtcctgc gagggtgatc gactgcggcc tggcagaacc ccctcgcccc cgccccatgt
      241 aaaaaagcct ccttgtgcaa tggtctgttt cctttgaacg tgcttctttg taatgaccga
      301 ggtaccgatt tctgctaagt tttcccaaca acatgaaact gcctattcac gccgtaattc
      361 tttctgtctc ccgctcactt tctctctttc tctcgctctc tttctctcac cgcgtcccca
      421 tctttcctcg caaccccctc tccccgctgc cctccctagc tggctttctc tcttgcttct
      481 ctcttttcct cctgtccccc cacccccacc ccctttggtt tgacaatttt gtcttaagtg
      541 tttctcaaaa gagattactt tagttagcat gcgcgctgtg agcattgtta aaagtgttct
      601 taggtttact gtgaagagaa tgtatcctgt atctgtgaat tgctttatgg gggggaggga
      661 gggctaatta tatattttgt tgttcctcta tactttgttc tgttgtctgc gcctgaaaag
      721 ggcggaagag ttacaataaa gtttacaagc agagaacccg agnnnnnnnn nnnnnnnnnn
      781 nncccgggcc gccgctcctc attcgctcct aggcgccttg cagggctggg ggtggggggg
      841 agctggtcag caggctcctg ggctggccta ggctaggtcg ctgagaggag ggggcggggg
      901 cgggggctgg aagcaggtgg tgcgagtccc tgggcccagg ggcgcagggg gtgagggagg
      961 cggctgaacg tgattggagg agagaggatc gagggagggg agccaagaga aaccccctcc
     1021 ccttgcattc tgaggctgaa ggaccaggga gactccagcg cccaggccgc tcttgggaag
     1081 agatctaccc aggctggtgg ctagtgtccc ccgccgcttt tctctttgtt tccgtgtgtg
     1141 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tttggggnnn nnnnnnnnnn
     1201 nnnnnngggg tgagaaagat gcagcgcaag acttctgagt ttgtatttcc attttttcct
     1261 tggactcagg ttggggaaac aggagcagag ggaagcggtt cctccctacc tcccctctct
     1321 gggacgtcgt cactttctcc gacgtttcta ggcctcggct tgccgcagcc ttgcttcctt
     1381 cgttgcttct gccttcctgg cagccacgct ccagtgagtg aggcatccgc cttccggaac
     1441 cgggaaagca aagagccgga cccaagcctc cttcctccct tcctttttct cccagccccc
     1501 attccattct ttttaaattt tgtatatctt tttattgtca tcagaaatct gagcgtccaa
     1561 cgccttattg ggttggatct ctgccttagg gacgccttgg tcacatctag ttactacaac
     1621 tggggcacta agacaaccgg gaggccaggc tcgcctcctc ttgggagaag agcagcagct
     1681 cggatgaatt aacccaaatt aataaatatt cggccagcac ccaccgacca agttgcgaac
     1741 attcaatccc tgcgtctctc tggctctgta accggctggg ggaaatgggt gggggatgac
     1801 aacacggttc cctcagaggt tatttatttt ctcttccact caattccttc ttccccaaat
     1861 ctcgcctgca agctgcctcc agcccgcggg ggtcgacagc ggcccttaag cccccagccc
     1921 caatccgcag agctcggcct tcccattcat tattgatcat attttataaa tcgaacgcga
     1981 cacaattttt tccacattac tgggagcctc cgggaggccg tcatagcatt ggccgagggg
     2041 atatcacgtg ggccggggtc acgtggtcag aagaggaaaa agggggtcct tttggtgtaa
     2101 atctggactc taattctgta atatatcaag gaatctcgta aaaccgacac taaaacgtcc
     2161 ccgactacaa atcatccggc caaattatga gttcattgta ttatgcgaat gctttatttt
     2221 ctaaatatcc agccgcaagt tcggttttcg ctccaggagc cttccccgaa caaacttctt
     2281 gcgcctttgc ttccaacccc cagcgcccgg gctatggagc aggtccgggc gctcctttct
     2341 ccgcctcggt gcagggtctg tactccggcg ggggggccat ggcgggccag agcgcggctg
     2401 gcgtctatgc ggccggctac gggctcgaac cgagttcctt caacatgcac tgcgcgccct
     2461 ttgagcagaa cctctccggg gtgtgtccgg gcgacgccgc caaggccggt ggcgccaagg
     2521 agcagaggga ctcggacttg gcggccgaga gtaacttccg gatctacccc tggatgcgaa
     2581 gctcagggcc tgaccgaaag cgaggccgcc agacctacac gcgctaccag accctggagc
     2641 tggagaaaga atttcactac aatcgctacc tgactcggcg gaggcgcatc gagatcgcgc
     2701 acacgctctg cctcaccgaa agacagatca agatctggtt tcagaaccgg cgcatgaagt
     2761 ggaaaaagga gaacaaaacc tcaggaccgg gaaccaccgg ccaggacaag gcggaaggcg
     2821 aggaggagga ggaagagtga gggacagaga aagccaagag gaggaaagcg aagagaggga
     2881 gaacccaatt atgggaactg aagcatgaaa ctcaaataaa ggggcaaact atttaaatga
     2941 agaggtctaa aaataataag gaaagagaag ggtgaaattt gggtttctta acactgtaaa
     3001 aaaatatact acctatggga aagtgtgttg tctgtttttg tacagtatcg gaaggacatt
     3061 atctacctgt tctgtggctt tctggaatgt gcctcccctt ttctatgttg ctagtaaggt
     3121 ctttgtaaaa tcttgctgtt ttgtaagccc tctttgaagc tgtctttgtg aactgtggtt
     3181 ccagatgaac agattagtgc ggttccttac ctaccccata cttcccagta gcagtactag
     3241 aggggttgta tggagccctg aggatccact agcttctgcg tctggtgcat tttggctgct
     3301 gtttctaggt cctattcacg agcctctttc tgtatatctg aaggatggaa aataaaacag
     3361 gattaaatat caacagacac gctgtgtttt ctgtctccct ctggctgggg gaggtgtaag
     3421 ggttggtagt gggcgagtgt cccagagccc ctctgctggt ctttttctgt cggtgctggg
     3481 tttgctttta ttttccagat agcctctgct atagagactg ccagctagtg aatccccaac
     3541 at
//
LOCUS       POLVP1A       900 bp ss-RNA             VRL       11-AUG-1990
DEFINITION  Poliovirus type 3 VP1 gene, partial cds.
ACCESSION   M37321 M36161 M36162
KEYWORDS    .
SOURCE      Poliovirus type 3 (P3/Leon/1947) RNA.
  ORGANISM  Poliovirus type 3
            Unclassified.
REFERENCE   1  (bases 1 to 900)
  AUTHORS   Minor,P.D., Evans,D.M.A., Schild,G.C., Ferguson,M. and Almond,J.W.
  TITLE     Identification of an antigenic site in the neutralization of type 3
            poliovirus
  JOURNAL   Rev Infect Dis 6, 516-518 (1984)
  STANDARD  simple staff_review
REFERENCE   2  (bases 34 to 900)
  AUTHORS   Almond,J.W., Cann,A.J., Minor,P.D., Reeve,P., Schild,G.C.,
            Hauptmann,R. and Stanway,G.
  TITLE     Nucleotide sequence from neurovirulent and attenuated strains of
            type 3 poliovirus
  JOURNAL   Rev Infect Dis 6, 487-493 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept    <     1  >   900     VP1 (AA at 1)
BASE COUNT      238 a    250 c    206 g    206 t
ORIGIN      
        1 ggtattgaag atttgatttc tgaagttgca cagggcgccc taactttgtc actcccgaag
       61 caacaggata gcttacctga tactaaggcc agtggcccgg cgcattccaa ggaggtacct
      121 gcactcactg cagtcgagac tggagccacc aatcctctgg caccatccga cacagttcaa
      181 acgcgccacg tagtccaacg acgcagcagg tcagagtcca caatagaatc attcttcgca
      241 cgcggggcgt gcgtcgctat tattgaggtg gacaatgaac aaccaaccac ccgggcacag
      301 aaactatttg ccatgtggcg cattacatac aaagatacag tgcagttgcg ccgtaagttg
      361 gagtttttca catactctcg ttttgacatg gaattcacct tcgtggtaac cgccaacttc
      421 accaacgcta ataatgggca tgcactcaac caggtgtacc agataatgta catcccccca
      481 ggggcaccca caccaaagtc atgggacgac tacacttggc aaacatcttc caacccgtcc
      541 atattttaca cctatggggc tgccccggcg cgaatctcag tgccatacgt ggggttagcc
      601 aatgcttact cgcactttta cgacggcttc gccaaggtgc cattgaagac agatgccaat
      661 gaccagattg gtgattcctt gtacagcgcc atgacagttg atgactttgg tgtattggca
      721 gttcgtgttg tcaatgatca caaccccact aaagtaacct ccaaagtccg catttacatg
      781 aaacccaaac acgtacgtgt ctggtgccct agaccgccgc gcgcggtacc ttattatgga
      841 ccaggggtgg actataagaa caacttggac cccttatctg agaaaggttt gaccacatat
//
LOCUS       YSCATRP3A    1240 bp ds-DNA             PLN       11-AUG-1990
DEFINITION  S.cerevisiae glutamine amidotransferase (TRP3) gene, 3' end.
ACCESSION   M36300
KEYWORDS    TRP3 gene; glutamine amidotransferase.
SOURCE      S.cerevisiae DNA.
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; 
            Endomycetales; Saccharomycetaceae.
REFERENCE   1  (bases 1 to 1240)
  AUTHORS   Aebi,M., Furter,R., Prantl,F., Niederberger,P. and Huetter,R.
  TITLE     Structure and function of the TRP3 gene of Saccharomyces
            cerevisiae: Analysis of transcription, promoter sequence, and
            sequence coding for a glutamine amidotransferase
  JOURNAL   Curr. Genet. 8, 165-172 (1984)
  STANDARD  simple staff_review
FEATURES       from  to/span     description
    pept        401  >  1240     glutamine amidotransferase (TRP3)
    mRNA        309  >  1240     TRP3 mRNA (alt.)
    mRNA        314  >  1240     TRP3 mRNA (alt.)
    mRNA        320  >  1240     TRP3 mRNA (alt.)
BASE COUNT      352 a    267 c    259 g    362 t
ORIGIN      
        1 ttcacttttt ttgcttctta taggcctttt tgaactattt tctgttaggg gcttaaagaa
       61 agcgcagcaa atggctgatt ttaaatgctg ttttctcgaa agcaatgctg atgatgtacc
      121 acaaagtagt atttgcgctt agcttgtcga tataagtatg cctttgagcc agtattaaac
      181 ttcctactat ctcttctgga cctcaccgtg ctattcatat gactgacgac ggaaaatgcc
      241 tcttgtagta atcattgaat gatcatcaat ggctcgttga ctcattctaa tctatagcag
      301 atattaaggt acgcaaaaag ttcgacaagg agctggtcag tattaacaca ttaccaaatc
      361 tgtttggttc tacatagaac gccataaaag taagaaaaaa atgtctgtgc acgctgcaac
      421 aaacccaatc aataagcatg tggttctaat tgacaactac gattccttta cctggaacgt
      481 ttacgagtac ttgtcccagg agggcgccaa agtgagcgtc taccgtaacg atgcaattac
      541 agttccagaa attgccgcct tgaatcccga cacattgctt atctcgcttg gactaggcca
      601 cccaaagaca gattctggca tttcaagaga ctgtatccgg tattttactg ggaaaattcc
      661 tgtatttgga atctgtatgg gccagcaatg catgtttgac gtatttggtg gtgaagttgc
      721 ctacgctggt gagattgtcc acggtaaaac gtccccaatc tctcacgaca actgtggaat
      781 tttcaagaac gtgccgcaag gtattgctgt gacaagatac cattcattgg ccgggacaga
      841 atcgtcccta ccatcctgct tgaaggttac tgcgagtacc gaaaatggaa ttatcatggg
      901 tgtaagatac aagaagtaca ctgtagaagg tgtgcaattt catccggaat ccatcttgac
      961 cgaggaaggt catctgatga tcaggaacat tttaaatgtc agtggaggca cttgggagga
     1021 aaacaaatca tctccttcaa attctatttt ggaccgtatc tatgctcggc gtaaaataga
     1081 cgtcaatgag cagtctaaaa tcccaagttt cacctttcaa gacttacaat ctaactatga
     1141 tttaggtctt gccccaccgt tacaggattt ctacacggtg ttgtcatcat cccataaaag
     1201 agccgttgtt cttgctgaag tcaagcgtgc ctctccatcg
//